international journal of computers, communications & control vol. ii (2007), no. 1, pp. 37-38 international conference on virtual learning – icvl 2006 the current issue of the journal contains seven extended papers published in the "proceedings of the 1st international conference on virtual learning, october 27 29, 2006, bucharest, romania, (icvl 2006)" (m. vlada, g. albeanu & d.m. popovici eds.). the first edition of the international conference on virtual learning icvl 2006 was organized by university of bucharest, faculty of mathematics and computer science in association with european intuition project (the intuition network of excellence in europe http://www.intuition-eunetwork.net/) and in conjunction with the fourth national conference on virtual learning cniv 2006 to celebrate one hundred years from the birth of the great romanian mathematician and "computer pioneer award" of ieee computer society (1996) grigore moisil (1906 1973). the icvl was structured to provide a vision of european e-learning and e-training policies, to take notice of the situation existing today in the international community and to work towards developing a forward looking approach in virtual learning from the viewpoint of modelling methods and methodological aspects (m&m), information technologies (tech) and software solutions (soft). the conference has established a large area of topics to cover the following subjects, but not limited to: innovative teaching and learning technologies, web-based methods and tools in traditional, online education and training, collaborative e-learning, e-pedagogy, design and development of online courseware, information and knowledge processing, knowledge representation and ontologism, cognitive modelling and intelligent systems, algorithms and programming for modelling, advanced distributed learning technologies, web, virtual reality/ar and mixed technologies, mobile e-learning, communication technology applications, computer graphics and computational geometry, intelligent virtual environments, new software environments for education and training, streaming multimedia applications in learning, scientific web-based laboratories and virtual labs, soft computing in virtual reality and artificial intelligence, avatars and intelligent agents. initially 72 abstracts were received and 55 of them were selected. finally only 34 papers were accepted for presentation at the icvl and publication in proceedings of the icvl bucharest university press (isbn 978-973-737-218-5). participants coming from europe, japan, australia and canada have discussed various aspects concerning the future developments in the virtual learning field during the conference. four invited papers talking about trends in professional learning, time series modelling, analysis and forecasting in e-learning environments, ael the e-learning universal platform and, the teaching through projects methodology have been presented as plenary lectures. ten papers proposed different software solutions, while twenty papers were dedicated to modelling methods and methodological aspects. some icvl papers are considered for publishing in the current issue of the international journal of computers, communications and control. let us present an introduction of the selected papers. the paper of a. andreatos is dedicated to define and classify the virtual communities and their importance for informal learning, and to examine their social impact and resulting trends in technology management. a bibliographical review, and some case studies illustrate the aforementioned tasks. a. anohina, in her paper, considers the intelligent tutoring systems (including architecture topics based on two layers approach) powered by adaptive support for learners in order to solve practical problems. copyright c© 2006-2007 by ccc publications 38 grigore albeanu the minimax algorithm is considered as a practical illustration. the paper of n. doukas and a. andreatos presents a computer-aided assessment system (e-xaminer; a web-based interface system) based on parametrically designed questions that uses meta-language concepts to automatically generate tests. in their paper, i. kitagaki and his colleagues, present an algorithm for groupware modelling for a collaborative learning environment using mobile terminals. they show not only the grouping algorithm but also some considerations about discussion in a classroom. m. lambiris presents the concepts and a technique used to design a methodology for providing individualised computer-generated feedback to students. such an approach can also be used to provide detailed and high accuracy information to the instructor about the performance of the whole group. the paper of g. moise describes a software system for online learning using intelligent agents (an execution agent and a supervisor agent) and conceptual maps. experimental results are also considered. m. oprea presents in her paper a multi-agent system design procedure to be applied for universities course timetable scheduling that is a difficult administrative task. a preliminary evaluation of the proposed multi-agent system is presented in order to show the benefits obtained when a university uses such an approach. considering the successful story of the icvl 2006 event, the scientific community shows great interest to the second edition (icvl 2007: 26-28th of october) that will take place at ovidius university of constanta, romania. grigore albeanu guest editor icvl technical program chair unesco chair in information technologies university of oradea, university street, no. 1 410087, oradea, bihor, romania int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 613-615 writing as a form of freedom and happiness celebrating the 60th birthday of gheorghe păun g. ciobanu gabriel ciobanu romanian academy, institute of computer science and a.i.cuza university of iasi, romania e-mail: gabriel@info.uaic.ro gheorghe păun (born on december 6, 1950) graduated the faculty of mathematics of the bucharest university in 1974 and got his phd at the same faculty in 1977. he has won many scholarships, in germany, finland, the netherlands, spain, etc. presently he is a senior researcher at the institute of mathematics of the romanian academy, bucharest, and a ramon y cajal research professor at sevilla university, spain. since 1997 he is a corresponding member of the romanian academy, and since 2006 a member of academia europaea. his main research fields are formal language theory (regulated rewriting, contextual grammars, grammar systems), automata theory, combinatorics on words, computational linguistics, dna computing, membrane computing (this last area was initiated by him in 1998). he has (co)authored and (co)edited more than fifty books in these areas, and he has (co)authored more than 400 research papers. in the last two decades he has visited many universities from europe, usa, canada, japan, also participating to many international conferences, several times as an invited speaker. he is a member of the editorial board of numerous computer science journals and professional associations. figure 1: a copy cropped from [1] essentially writing is form of thinking on paper, and a way of learning. according to winston churchill, writing a book is an adventure. "to begin with, it is a toy and an amusement; then it becomes a mistress, and then it becomes a master, and then a tyrant. the last phase is that just as you are about to be reconciled to your servitude, you kill the monster, and fling him out to the public." on the other hand, writing could be a form of freedom by escaping the madness of a period, and reducing the anxiety. in many situations the authors write to save themselves, to survive as individuals. gheorghe păun is an example of a person affirming his own existence by writing. he is a prolific writer with a huge number of papers: tens of scientific books, hundreds of articles, several novels, poems, and books on games. a list of his scientific publications is posted at http://www.imar.ro/~gpaun/papers.php [2], while his books are listed at http://www.imar. ro/~gpaun/books.php [1] his way of distributing information is not by speaking, but by writing. gheorghe păun did not like very much to teach in universities. he preferred a form of "teaching by researching", combining ideas with nice metaphors and distributing his knowledge in articles and books. in this way he wrote several papers having a high impact in the scientific community. his seminal paper "computing with membranes" published in journal of computers and system sciences in 2000 and his fundamental book on computation theory "membrane computing" (springer, 2003) has over 1,000 citations [6] (and his author was recognized as an "isi highly cited researcher" [5]). he has defined new branches, new theories. the field of membrane computing was initiated by gheorghe păun as a branch of natural computing [3]; p systems are copyright c⃝ 2006-2010 by ccc publications 614 g. ciobanu inspired by the hierarchical membrane structure of eukaryotic cells [4]. an impressive handbook of membrane computing was published recently (2010) by oxford university press. after 1990 he becomes a traveling scientist, visiting several countries and receiving many research fellowships and awards. fruitful scientific collaboration at magdeburg university (germany), and at university of turku (finland). the trio gheorghe păun, grzegorz rozenberg and arto salomaa is well-known for several successful books. the last years were spent in spain, first in tarragona and now in sevilla. several collaborations were possible during his trips, and there are over 100 co-authors from many countries. his scientific reputation is related to the large number of invited talks provided at many international conferences and universities. he is a member of the editorial boards for several international journals, corresponding member of the romanian academy (from 1997), and member of academia europaea (from 2006). it is not possible to understand the personality of gheorghe păun without mentioning his activity as writer of novels and poems; he is a member of the romanian writers association for a long time. another aspect of his life is related to the intellectual seduction of games; he was the promoter of go in romania, writing many books about go and other "mathematical" games. personally, i am impressed by the speed of his mind (it is enough to say few words about some new results, and he is able to complete quickly the whole approach), his wide-ranging curiosity and intelligence, rich imagination and humor, talent and passion. he is highly motivated by challenging projects, and work hard to conclude them successfully. there are very few scientists having such an interesting profile, and i am very happy to learn a lot from him. celebrating his 60th birthday, we wish him a good health, long life, and new interesting achievements! figure 2: g. păun-an isi highly cited researcher (a copy cropped from [5]) writing as a form of freedom and happiness celebrating the 60th birthday of gheorghe păun 615 bibliography [1] gheorghe păun, one more universality result for p systems with objects on membranes, international journal of computers, communications & control, 1(1): 25-32, 2006 (free access at http://www.journal.univagora.ro/download/pdf/21.pdf) [2] http://www.imar.ro/ gpaun/ [3] http://esi-topics.com/ [4] http://ppage.psystems.eu/ [5] http://hcr3.isiknowledge.com/ [6] http://interaction.lille.inria.fr/ roussel/projects/scholarindex/index.cgi international journal of computers, communications & control vol. ii (2007), no. 1, pp. 103-105 management information systems: managing the digital firm 9th edition, authors: keneth c. laudon and jane p. laudon (book review) florin g. filip management information systems: managing the digital firm 9th edition authors: keneth c. laudon and jane p. laudon pearson prentice hall, pearson education, inc., upper saddle river, new jersey 97458. isbn 0-13-153841-1; xxxii+641+22+14+2+21 pags. this is the 9th edition of a successful textbook. the authors are two well-known and productive writers. k.c. laudon, a professor of information systems at stern school of business of the new york university, took his ba in economics from stanford and his ph.d. from columbia university. he is the author of twelve books and over forty articles about social, organizational and management impacts of information systems, privacy, ethics, and multimedia technology. jane p. laudon, a management consultant in information systems area, took her m.a. from harvard university and her ph.d. from columbia university and authored seven books. her main scientific interests are systems analysis, data management and software evaluation. the background, scientific interests and expertise of the authors and their previous works had an obvious impact on the manner this book was conceived, written and accompanied by auxiliary materials (the cd rom and companion web site). the authors start from the premise that, nowadays, "information systems knowledge is essential for creating successful, competitive firms, managing global corporations, adding business value and providing useful products and service to customers" (p. xix). moreover, they state that "in many industries survival and even existence without the extensive use of it is inconceivable" (p.31). an important development the authors remark is the emergence of the digital firm, "where nearly all core business processes and relationship with customers, suppliers and employees are digitally enabled" (p.31). in the book the management information systems (mis) is defined at large as "the study of [computer based] information systems in business and management" (p.44). besides, the authors adopt a broader view of information systems (is) "which encompasses an understanding of the management and organizational dimensions as well as technical dimensions of the systems as information systems literacy" (p.20). consequently, this book can be viewed as an effort made by the laudons with a view to contributing to building up and consolidating such an information system literacy for current and future managers, which are to be confronted with several "major challenges concerning: a) "information system investments", b) "strategic business"„ c) "globalization", d) "information infrastructure", and e) "ethics and security" (p.28). the authors have noticed a "user designer communication gap". in table 15.3 (p.552) they give several examples of that gap. while the user is concerned with problem solving related questions such as: "will the system deliver the information i need for my work?, "how quickly can i access the data?", ..., "how will the system fit into my daily business schedule?", the designer is preoccupied to find optimal answers to technology-oriented questions such as: "how many lines of program code will it take to perform this function?", ...,"what database management system should we use? in order to help the future managers to successfully face the major challenges mentioned above and to solve the possible communication gap, the authors adopt a sociotechnical view and style of presentation. they combine technical aspects (drawn from computer science, management science, and operations copyright c© 2006-2007 by ccc publications 104 florin g. filip research) with behavioral elements (drawn from economics, sociology, psychology). throughout the book, the presentation method chosen contains, besides the introductimg mis concepts, facts about realworld experiences (cca 200 examples are given), new technologies, and various exercises. in accordance with the sociotechnical perspective adopted, the book is organized in four parts as it follows. part one, which is made up of five chapters, addresses the organizational and managerial foundations of information systems. it introduces real-world systems and highlights their relationship with the organizations and managements. the concept of the digital firm is explained together with the fashionable concepts of e-business and e-commerce. various types of information systems are reviewed (and they will be presented in detail in the remaining chapters). the four major business functions (sales and marketing, production and manufacturing, human resources and finance and accounting) are introduced with a view to being resorted to in an ending section of all remaining chapters. a special chapter is dedicated to ethical and social issues in the digital firm. part two, which is made up of five chapters, addresses the various facets of the information technology infrastructure (iti). iti is viewed as a set of technology resources (hardware and software) and as a set of services (computing platform service, telecommunication service, data management service, application software service, it management service, standards service, education and research and development service), composed both of human and technical capabilities (p.186). a particular emphasis is laid on describing the leading edge wireless technologies, and security and control aspects. part three, which is made up of three chapters, describes several types of information systems. one by one, the main concepts of the enterprise applications (including enterprise [resource planning] erp, customer relationship managing crm, and supply chain management scm), knowledge management systems, and [group, and executive] decision support systems are presented. part four, which is made up of three chapters, presents the process of building, implementing and managing the systems in organizations. several important topics such as development approaches (based on lifecycle or prototyping), managing the necessary changes in the organizations, and various methods for evaluating the business value of the project are reviewed. the last chapter addresses the specific aspects of managing international information systems. in comparison with previous editions, the present edition contains much up-to-date information about leading edge technologies. the chapters which address "wireless revolution", enterprise applications, knowledge management systems are new. other chapters were re-written or/and completed with new topics. throughout the book, all chapters are organized in the same manner. each chapter opens with the statement of five learning objectives. then, five (in few cases, four) subchapters of text follow to provide the mis concepts and real-world examples which are related to the learning objectives. other common features to all chapters are: a) the opening case which describes a real world example and the coresponding diagram to analyze the case in terms of management, organization and technology model; b) the concluding section on management opportunities, challenges and solutions related to the theme of the chapter; c) the "make it your business" section to place the concepts described in the chapter in relation with the major business functions; d) the "chapter summary" organized in accordance with chapter objectives, e) the list of key terms, f) the review questions, g) the application software exercise to develop solutions to real-world business problems, h) the running case project (on a simulated firm entitled dirt bikes). the book also contains an impressive list of references which is organized on chapters, a glossary, three appendices of "hands-on" type (for analyzing a case study, designing a data base, and sql), an index of terms and an index of organizations. the text about information systems is supplemented by two it based learning aids: a) the companion web site (to guide the interactive study, to facilitate internet connections, and to provide additional case studies), and b) the interactive multimedia cd-rom (to be used either as an interactive study guide or as an alternative to the text). management information systems: managing the digital firm 9th edition, authors: keneth c. laudon and jane p. laudon (book review) 105 to conclude this review, i think this book, together with the cd-rom and companion web site, is an excellent dynamic and active learning environment. i recommend it to be used as a textbook for the undergraduate students for information systems courses in business administration departments. it can also be utilized by the students in computer science as a complementary text, which can help them build a broader view on information systems. florin-gheorghe filip romanian academy 125, calea victoriei 010071 bucharest-1, romania e-mail: ffilip@acad.ro received: december 31, 2006 int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 946-952 contributions to the study of semantic interoperability in multi-agent environments an ontology based approach i.f. toma iulian-florin toma university of pitesti romania, 110040 pitesti, 1 targu din vale e-mail: tif@tif.ro abstract: this paper details the results of our work in the field of multi-agent ontology-based environment simulation. we analyze the impact of introducing some techniques for the alignment / translation / mapping of agent ontologies, which allows for collaborative understanding of distributed ontologies. in the end, we analyze the difficulties / gaps that are to be filled for an actual deployment of the technology / concept in a real life environment. keywords: ontology, semantic interoperability, agents. 1 introduction the ability to communicate depends on understanding the syntax and the semantics of a language. we used an ontology model to facilitate semantic interoperability in a simulated multi-agent environment. current studies relieve the fact that using ontologies associated with the agents allows for building semantic-aware distributed multi-agent systems. an ontology is a formal specification for shared understanding of a domain of interest and offers the possibility of building a formal and machine manipulable model of a domain of interest. ontologies describe entities and relations between them, classes of objects and their attributes, and are complemented by logical rules that constrain the meaning assigned to the terms. these constraints are represented by inference rules that can be used by agents to perform the reasoning on which the autonomy and proactiveness of the agents are based. thus semantic interoperability facilitates the increase of the autonomy of agents. the reasoning process of agents in multi-agent ontology-based environments is mainly focused on the alignment of their own ontologies with ontologies exposed by other agents. having considered the multi-agent environment as heterogeneous, we analyse the introduction of guidelines for ontology development and evolution which should facilitate ontology reuse that may underpin a usage model for ontologies. 2 multi-agent environment the multi-agent environment we propose is part of an e-commerce scenario where a user’s agent agu is activated to find, request and gather offers for a specific product from a series of suppliers’ agents ags1, ..., agsn. in our implementation, the suppliers’ agents (ags agents) are published in the fipa compliant directory facilitator (df) of the jade platform. fipa is an ieee computer society standards organization that promotes agent-based technology and the interoperability of its standards with other technologies. jade is a software framework to develop agent applications in compliance with the fipa specifications for interoperable intelligent multi-agent systems. we propose the implementation of a mediator agent (agm) to make the interactions with the df and to handle the negotiation process between agents. the ags agents (suppliers’ agents) use their own ontologies. suppliers can adhere to centralized/standardized/publicly available ontologies, can extend such ontologies based on their needs, copyright c⃝ 2006-2010 by ccc publications contributions to the study of semantic interoperability in multi-agent environments an ontology based approach 947 but can also build their own ontologies from scratch. facing this heterogeneity of ontologies used by the ags agents, the agu agent has to achieve semantic interoperability to negotiate with each ags agent. fipa recommends using an external ontology agent (ago) for handling the interoperability problems in heterogeneous multi-agent environments [9]. this agent handles the tasks of matching ontologies and translating the values of some attributes (translate notions based on their specified language, convert currencies, convert units of measure). figure 1 illustrates the architecture of the proposed system: �� figure 1: ontology-based multi-agent environment 3 ontologies the semantical description of an entity offers a better understanding of the entity and puts the entity in a context. the idea is not necessarily to achieve the level of understanding and association to other entities that a human subject would have for that entity, but to improve the understanding of a software regarding that entity only to satisfy specific goals. the main heterogeneity problem in our simulated environment is that the agents will typically use different ontologies. ontology-based interoperation provides a solution in environments with heterogeneous semantics. ontologies can capture both the structure and semantics of information environments [7]. an ontology-based search agent like agu can handle both simple keyword-based queries as well as complex queries on structured data. figure 2 contains some sample ontologies exposed by 2 book suppliers’ agents [1]. 948 i.f. toma figure 2: sample ontologies [1] the ontologies can be designed in any ontology editor, like the protégé software [6] (figure 3). a large variety of formal languages exists for describing the ontologies (e.g. owl [2] [8]). figure 3: ontology editing in protégé 2000 4 implementation the test simulation can be implemented using the jade environment. the jade environment has direct support for ontologies only if they are represented as java classes. these classes can be written by hand of generated using the beangenerator plug-in of the protégé software. although the java classes can be generated automatically, this approach has the disadvantage that it is very rigid when conceptual changes are necessary. under this approach, modifications of the ontology in protégé have to be followed by the regeneration of the java classes using the contributions to the study of semantic interoperability in multi-agent environments an ontology based approach 949 beangenerator tool, the re-integration in the main project and the recompiling. any modification in an ontology represented using java classes may also require modifications in the java code of the agent. a solution to simplify this process is to access the jena framework for writing ontology-based applications. this framework allows keeping the ontologies in the owl format generated by protégé, eliminating any further need to transform the ontology in another format. the parser of this framework has direct access to the owl files and provides access to the classes of the ontology using java language. using this framework, the agent can access the instances of the classes, retrieve the value of attributes, determine subclasses etc. 5 negotiation the negotiation is implemented using the fipa contract net interaction protocol [10]. the workflow for this protocol is represented in figure 4. figure 4: the workflow of thhe fipa contract net interaction protocol the semantic analysis of the ontologies takes place between the call for proposals (represented as "cfp") and the receival of the refusals/proposals from the suppliers’ agents. the semantic analysis is performed by the dedicated agent ago . the suppliers’ agents ags1...agsn (noted: ags) receive the cfp. if the concept/item is understood by ags, it starts to evaluate de conditions/constraints contained in the cfp. if the conditions match, ags returns a proposal to the agu agent, otherwise it returns a reject-proposal message. the messages may be sent using predicates defined in a separate common / generalized / standard centralized ontology (ontreqstd) specific to the process of requesting offers of products . agents agu and ags could adhere to this ontology and ags could then respond with a more descriptive message 950 i.f. toma like "availableonlywithpreorder", "stockempty" rather than responding with a simple yes/no message. the types of ontologies proposed for our model are represented in figure 5. �� figure 5: the role of the ago (ontology agent) ontology matching analysis starts if the ags does not "understand" the concepts/attributes inside the cfp. in this case, the ags agents call the ago agent. the ago agent requests contact details to ags to contact the agu agent. then ago queries the agu agent about its exposed ontology. ago starts a matching process to obtain an alignment between the agu request (cfp), agu ontology (ontu) and ags ontology. if the matching succeeds (with a high confidence mark), ago reformulates the cfp to the ags and the ags may start the simple quantitative evaluation and send an answer to agu accordingly to the result. ago may also play the role of converting units of measure, currencies, translating notions between languages etc.) and use the values to reformulate the cfp. 6 ontology mapping the same concept/product may be described using different attributes, structures and relations, conducting to distinct ontologies. even so, taking into consideration that the concept is part of a specific domain, there exists a set of common characteristics that should be mandatory when representing a specific product, like "width", "height", "material", etc. thus lexical measures can be used to compare attributes and relations between concepts. some of the lexical measures that can be applied to achieve ontology-matching are n-gram similarity, hamming distance, levenshtein distance, jaro measure, jaro-winkler measure and the token-based distances like cosine similarity, term frequency-inverse document frequency (tfidf) described in [1]. contributions to the study of semantic interoperability in multi-agent environments an ontology based approach 951 the ago agent can also make use of the language-based methods by calling external resources like lexicons and thesauri. wordnet [3] is such an electronic lexical database for english, based on the notion of synsets or sets of synonyms. a synset denotes a concept or a sense of a group of terms. wordnet also provides an hypernym (superconcept/subconcept) structure as well as other relations such as meronym (part of relations). it also provides textual descriptions of the concepts (gloss) containing definitions and examples. there are lots of techniques available for analyzing similarities using wordnet [4] [5]. in addition to comparing their names or identifiers, the structure of entities that can be found in ontologies can be compared. this comparison can be subdivided into a comparison of the internal structure of an entity, i.e., besides its name and annotations, its properties or, in the case of owl ontologies, the properties which take their values in a datatype, or the comparison of the entity with other entities to which it is related. using these well-known techniques, the ago agent can run a detailed analysis on the matching of the ontu and onts ontologies. if an alignment is achieved, the ago agent can reformulate the cfp (call for proposals) to match the onts ontologies of the ags agents. 7 conclusions and the future of semantic interoperability in this paper, we proposed a feasible implementation of a multi-agent environment which makes use of ontologies and ontology mapping to achieve semantic interoperability. the ontologybased multi-agent environments and semantic-enabled communications are, in the present (20092010), under theoretical study only. there are numerous attempts to bring this technology to public use e.g. building public libraries of ontologies, but this field still resides only in the boundaries of the scientifical community. to bring it to public use, tools should emerge, that would facilitate the semantic annotation and semantic description of entities, by any internet user, in various environments like web services, web pages, distributed software, reusable components etc. these tools should be easy to use by any internet user. they should incorporate automatic validation based on the recommendations and restrictions of the ontology engineering discipline. only then, when the advantages of these technologies will overcome the difficulties of defining and maintaining the ontologies, will semantic-enabled communications be a part of common software and human activities. bibliography [1] j. euzenat, p. shvaiko, ontology matching, springer, 2007. [2] owl 2 web ontology language, w3c recommendation, http://www.w3.org/tr/owl2overview/, october 27, 2009. [3] g.a. miller, wordnet: a lexical database for english, communication of acm, 38(11):39-41, 1995. [4] pedersen, patwardhan, and michelizzi, wordnet::similarity measuring the relatedness of concepts, proceedings of the nineteenth national conference on artificial intelligence (aaai-04), pp. 1024-1025, july 25-29, 2004, san jose, ca (intelligent systems demonstration) [5] a. budanitsky, g. hirst, semantic distance in wordnet: an experimental, applicationoriented evaluation of five measures, proceedings of the workshop on wordnet and other 952 i.f. toma lexical resources, second meeting of the north american chapter of the association for computational linguistics, pittsburgh, usa, 2001. [6] j. gennari, m.a. musen, r.w. fergerson, w.e. grosso, m. crubezy, h. eriksson, n.f. noy, s.w. tu, the evolution of protege: an environment for knowledge-based systems development, technical report, smi report number: smi-2002-0943, 2002. [7] m.p. singh, m.n. huhns, service-oriented computing: semantics, processes, agents, wiley, 2005 [8] a. gómez-pérez, m. fernández-lópez, o. corcho , ontological engineering : with examples from the areas of knowledge management, e-commerce and the semantic web , springer, 2004 [9] ontology service specification, fipa-oss, http://www.fipa.org/specs/fipa00086 [10] fipa contract net interaction protocol specification, fipa tc communication, sc00029h, http://www.fipa.org/specs/fipa00029/sc00029h.html. international journal of computers, communications & control vol. iii (2008), no. 1, pp. 51-59 mobile message passing using a scatternet framework brendan j. donegan, daniel c. doolan, sabin tabirca abstract: the mobile message passing interface is a library which implements mpi functionality on bluetooth enabled mobile phones. it provides many of the functions available in mpi, including point-to-point and global communication. the main restriction of the library is that it was designed to work over bluetooth piconets. piconet based networks provide for a maximum of eight devices connected together simultaneously. this limits the libraries usefulness for parallel computing. a solution to solve this problem is presented that provides the same functionality as the original mobile mpi library, but implemented over a bluetooth scatternet. a scatternet may be defined as a number of piconets interconnected by common node(s). an outline of the scatternet design is explained and its major components discussed. keywords: bluetooth, scatternet, message passing, network formation 1 introduction mobile technology is one of the fastest growing fields of technology, with over one billion mobile phones shipped during 2006 [8]. the power of mobile devices is also growing quickly, in october 2005 arm announced the arm cortex a8 [1][2], having a clock speed of 1ghz. this increase in both availability and performance makes mobile devices a prime candidate for parallel computing systems. having a multitude of mobile devices available one can now take advantage of the situation by performing complex computational tasks across several devices. the original mmpi library was restricted in terms of its world size by the upper bound of the piconet network standard, this limits the maximum number of nodes in an mmpi world to eight. the evolution of the mmpi library to allow for scatternet formation requires significantly more work behind the scenes, both to setup the network infrastructure and to allow for complete inter-device communication. 1.1 need for mobile parallel computing the mmpi library was created to allow for parallel computing across mobile devices [6]. mobile devices have very limited resources and processing capabilities. the nokia 6630 and 6680 [14] mobile phones have 220mhz processors and just a few megabytes of memory. along with such limited capabilities they also have a finite amount of battery power. running processor intensive applications on such devices drains battery power at a higher rate than standard phone usage. therefore if one device doesn’t have sufficient battery power it may not be capable of solving a complex problem. such a problem would also take too long to process from the users perspective. splitting the processing among several devices not only speeds up the processing, but also distributes the battery drain across all devices, allowing more computationally intensive tasks to be performed. 1.2 review of scatternet formation algorithms the bluetooth specification [3], describes the concept of a scatternet. a scatternet (figure 1) is defined as two or more piconets joined together through the mechanism of a common node (bridging node). significant research has been conducted on scatternets, much of it focusing on how they can be optimized. miklos [10] described some of the aspects of scatternet formation, including the performance bottleneck caused by the bridging node switching between the frequency hopping patterns of its masters. copyright © 2006-2008 by ccc publications 52 brendan j. donegan, daniel c. doolan, sabin tabirca figure 1: scatternet topology the dynamp project [13] is a mobile parallel computing architecture that uses bluetooth scatternets as the underlying infrastructure. the application area for this body of work is in mobile robotics, specifically that of the cybot toy. the main reason for the parallel computing architecture is real-time image processing of data acquired from the robots sensor system. to avoid the upgrading of the processing capabilities of the robots, the path of distributing the processing requirements across a group of robots was investigated. the target architecture is the tini from dallas semiconductor that uses java as its native environment. the ipaq pda is also used, running a linux based os and the keffe jvm. unlike native message passing libraries such as the c and fortran based mpi that use a static number of nodes, a far more dynamic approach is required within the mobile environment. the head node is therefore responsible for discovering all the active nodes within the network. this is achieved by broadcasting a process request packet throughout the network. an example of a simple node topology is given in the paper that shows a linear like architecture, thereby communication between distant nodes would certainly require one or more intermediary nodes for forward on the message. bluetrees [15] generates a tree-like scatternet. the network formation algorithm is initiated by a single node that forms the root of the tree. the root node begins by acquiring slave devices that are within it vicinity. these in turn page their own neighbours. in order to limit the number of slave devices connected to another node at the level above some branch reorganization may be required. jayanna & zaruba [9] use an approach where by all nodes maintain a dynamically generated list of its neighbours based on the number of hops required to reach the node in question. the strategy ensures that if neighbour information is included for one and two-hop piconets, then two adjacent piconets can share only one bridging node. therefore all nodes of one piconet share a single path to all nodes of a neighbouring piconet. 1.3 mobile message passing interface the mobile message passing interface (mmpi) [6] provides many of the functions that can be found in a standard mpi [11] [12] implementation. the main difference between mmpi and standard mpi implementations is that it is designed for mobile devices that communicate via bluetooth [4] [5]. the library has one drawback that makes it less than an ideal candidate as a platform to support parallel computation on mobile devices. it was designed to operate only on a bluetooth piconet of up to eight mobile message passing using a scatternet framework 53 devices. in a parallel computing system in general one can achieve faster computation by throwing more processors at the problem. 2 enhanced mmpi library structure the problem of creating scatternets and performing communication within them after they have been created is a complex one. the original mmpi library could not simply be adapted, as the differences between communication in a piconet and communication in a scatternet are too great. a new solution was therefore designed from the ground up, whilst retaining all of the original functionality. the enhanced library comprises of a number of distinct components that provides an abstraction from lower level layers. 2.1 components of the library the enhanced mmpi library is made up of a number of components. the most important parts of the architecture are shown in figure 2 figure 2: library components the commscenter class forms the heart of the library. its role is to receive raw data from the network and translate it into mmpi messages. the mmpinode class provides the interface between the commscenter and the mmpi class and also performs the initial device discovery. finally, the mmpi class is the interface to the library as a whole, and is the only class whose methods are exposed to the developer using mmpi. relevant data is fed up through the hierarchy, starting at the commscenter and continuing up to the mmpi class. this simplifies matters greatly as information that is relevant at a particular level of the hierarchy is available only at that level. 2.2 messages on receipt of a message by the commscentre it must inspect the header to identify the type of the message it has received and accordingly perform the appropriate action. the message types are categorised as follows: • bridge the node should take up the role of bridging node. 54 brendan j. donegan, daniel c. doolan, sabin tabirca • master the node should take up the role of master. • slave the node should be a slave exclusively. • confirm the network formation is complete. • data the message contains a data payload. the first four messages are used only during the formation of the scatternet and the data message is used only after the scatternet is formed. messages will only be processed by the node for which they are addressed, otherwise they will be forwarded by the message routing system. 2.3 scatternet formation formation of the scatternet is initiated at a chosen node (the ‘root node’) by first performing an inquiry for devices that provide the mmpi service. the root node then determines how many piconets are required to support this number of nodes. in the case of thirteen mmpi capable devices are discovered then two piconets will be created. the root node selects a device to be the bridge node and sends it a list of addresses of the other nodes in the network (algorithm 1). data: list of all mmpi capable devices initialize list of devices to send to bridge node; foreach device in the list do if bridge node then create slave connection; add bridge node to routing table; else if prior to bridge node then create slave connection; add slave node to routing table; send slave message; else if after bridge node then add device to list of devices to send to bridge node; add node to routing table; add number of devices to bridge message; add list of devices to the bridge message; send bridge message; algorithm 1: establishing initial connections at the root node the bridge node then chooses one of these to be the master of the other piconet and sends it the list minus the device chosen (algorithm 2). the second master makes connections to each device on the list, completing the scatternet formation. it then sends a confirmation message which propagates through the network via the message routing system. 2.4 message routing in order for a message to reach a recipient with which the sender has no direct connection, the message must be routed through the network. the library uses a simple message routing table (table 1) where each node keeps a table of all the nodes except itself. each entry in the routing table contains an index that is used to navigate between nodes. mobile message passing using a scatternet framework 55 data: list of devices sent by root node initialize list of devices to send to additional master; foreach device in the list do if second master then create master connection; add second master to routing table; else add node to routing table; add number of devices to master message; add list of devices to the master message; send master message; algorithm 2: establishing connections at the bridge node node node 2 node 0 node 5 node 6 0 0 0 0 1 0 0 0 0 2 1 0 0 3 0 2 0 0 4 0 3 0 0 5 0 4 0 6 0 4 1 7 0 4 1 1 8 0 4 1 2 9 0 4 1 3 table 1: routing tables for nodes 2, 0, 5 & 6 of a ten node scatternet in order to send a message from node 2 to node 8 (figure 3), the following steps are taken. node 2 looks up node 8 in its routing table (table 1). since node 2 has only one link, the index will be 0, and the message will be sent through that link to node 0. node 0 then looks up its routing table, finding the index 4, and sends the message through that link to node 5. node 5 looks up the routing table again and sends the message through link 1 to node 6. node 6 then looks up node 8, finding the index 2 and sends the message to it through that link. in summary, for a slave node of piconet one (on the left) (figure 3) to communicate to a slave node of piconet two (on the right) all communications traffic is first routed through the master node for piconet one and forwarded on to the bridge between the two networks. this is then picked up the the master node of piconet two and again forwarded on to the correct end point. for a node such as the master node of piconet one to communicate with the master node of piconet two, data need only be forwarded directly through the bridging node. 3 using the library using the library to write message passing programs is nearly identical to the original library. the operations that were supported in the original version of the library are still supported. 3.1 creating an mmpi node to create an mmpi node, the developer first calls the initialize(...) method of the mmpi class. a handle to the midlet which is using the mmpi library needs to be passed, as does a boolean value 56 brendan j. donegan, daniel c. doolan, sabin tabirca figure 3: structure of a ten node scatternet topology indicating whether the node is the root node. the developer needs to determine how this value will be set themselves, and it should be ensured that all non-root nodes are set up before the root node. this is because calling initialize(...) activates the bluetooth device and registers the mmpi service, allowing the node to be discovered. the root nodes reaction to the initialize(...) call will be to create the network. the non-root nodes reaction will be to wait until this event has occurred and completed. when network formation is complete, the program can continue execution. 3.2 message passing operations after calling initialize the communications world has been created thus giving the developer the ability to work with both point to point and global communication methods (listing 1). unlike many methods of the original mmpi library (and the mpi libraries), several of the new communications methods return the actual data received as an object. this is quite different to the other libraries that would pass a reference into the method for population with the received data. int broadcast(object buffer, int count, int datatype, int root, int tag) int scatter(object sendbuf, int sendcnt, int sendtype, int rcvcnt, int rcvtype, int root) int send(object buffer, int count, int datatype,int dest, int tag) object receive(int count, int datatype, int source, int tag) object gather(object sendbuf, int sendcnt, int sendtype, int recvcnt, int recvtype, int root ←↩ ) object reduce(object sendbuf, int count, int datatype, int op, int root) listing 1: mmpi communication method prototypes 4 evaluation to fully test the library one needs a significant number of mobile devices (far in excess of eight). such resources were unavailable at the time of implementation, but the system was developed and tested using the j2me emulators of the sun wireless toolkit. the system performed very well on the emulator, and several applications were developed for test purposes, including such classical applications as the mandelbrot set. using the mandelbrot set as an example it takes on average 139,360ms to generate an image of 2002 pixels at 500 iterations on a single device using the wtk emulator. this is significantly slower that a real world phone such as the nokia 6630, capable of generating the same image in 52,344ms. when mobile message passing using a scatternet framework 57 multiple instances of the emulator are running and carrying out a complex processing task they do not appear to achieve total parallelism as one would expect. many of the emulated devices however do give processing times as expected on the order of 13 to 14 seconds, when the application is executed with a set of ten phones. 4.1 other applications although we have developed the framework mainly to alleviate the node restriction on the existing version of mmpi, it is not limited to this use only. mmpi itself has been used as a communications library for bluetooth gaming [7] and this framework could be used to increase the number of players that may participate in the game. 5 conclusion this paper has outlined a library for creating scatternet based applications, which is capable of parallel computing on adhoc bluetooth networks of more than eight devices, using the scatternet framework. the structure and operation of the library has been outlined. it was found that there is a performance overhead associated with message routing and parsing. the framework can be used for a myriad of applications, such as multiplayer gaming or chat applications, quite easily. the most important aspect of the framework is that it can be deployed to any mobile device with midp 2.0 and bluetooth functionality, therefore it is capable of running on a significant number of todays mobile devices. 6 acknowledgment development of the mmpi library was funded under the “irish research council for science, engineering and technology” funded by the “national development plan”. bibliography [1] arm. arm cortex-a8, 2005. http://www.arm.com/products/cpus/arm_ cortex-a8.html. [2] arm. arm introduces industry’s fastest processor for low-power mobile and consumer applications, oct 2005. http://www.arm.com/news/10548.html. [3] bluetooth-sig. bluetooth specification version 1.1. [4] bluetooth.com. the official bluetooth website. http://www.bluetooth.com/. [5] bluetooth.org. the official bluetooth membership site. http://www.bluetooth.org/. [6] d. c. doolan, s. tabirca, and l. t. yang., mobile parallel computing, in 5th international symposium on parallel and distributed computing (ispdc06), pp 161–167, timisoara, romania, july 2006. [7] k. duggan, d. c. doolan, s. tabirca, and l. t. yang. single to multiplayer bluetooth gaming framework. in 6th international symposium on parallel and distributed computing (ispdc07), hagenberg, austria, july 2007. [8] itfacts. 1.019 billion mobile phones shipped in 2006. http://www.itfacts.biz/index. php?id=p8049. 58 brendan j. donegan, daniel c. doolan, sabin tabirca [9] d. jayanna, g. zaruba, a dynamic and distributed scatternet formation protocol for real-life bluetooth scatternets in proceedings of the 38th annual hawaii international conference on system sciences (hicss05), 2005. [10] g. miklos, a. racz, z. turanyi, a. valko, and p. johansson. performance aspects of bluetooth scatternet formation. in mobile and ad hoc networking and computing, 2000. mobihoc. 2000 first annual workshop on, pages 147–148, 11 aug. 2000. [11] mpi. the message passing interface (mpi) standard. http://www-unix.mcs.anl.gov/ mpi/. [12] mpich. mpich free implementation of mpi. http://www-unix.mcs.anl.gov/mpi/ mpich/. [13] r. shepherd, j. story, s. mansoor, parallel computation in mobile systems using bluetooth scatternets and java in proceedings of the international conference on parallel and distributed computing and networks, 2004. [14] symbian freak. nokia 6680 is loosing the battle to 6630. http://www.symbian-freak. com/news/0305/6680.htm. [15] g. zaruba, s. basagni, i. chlamtac, bluetrees scatternet formation to enable bluetooth based adhoc networks in ieee international conference communications (icc2001), pp 273–277, 2001. brendan j. donegan, daniel c. doolan, sabin tabirca university college cork department of computer science college road, cork, ireland e-mail: d.doolan@cs.ucc.ie, tabirca@cs.ucc.ie received: october 18, 2007 mobile message passing using a scatternet framework 59 brendan j. donegan, was a postgraduate student in the computer science department of university college cork between 2005-2006, studying mobile networking and computing. he is now working as graduate software engineer for symbian software limited in london. this work is the result of the msc project titled “mobile parallel computing” he undertook in the mobile multimedia group. daniel c. doolan, is currently in the final stages of completing a phd in the area of mobile computing. he holds a bsc in computer applications, an msc in multimedia technology, and over half a dozen other degrees at certificate and diploma level in the areas of business and computing. he has authored approximately 40 publications, including 6 book chapters, covering topics such as mobile computing, computer graphics and parallel processing. sabin tabirca is a lecturer in department of computer science of national university of ireland, cork. his main research interest is on mobile and parallel computing for scientific problems. he has published more than 100 articles in the areas of mobile multimedia, parallel computation, number theory and combinatorial optimization. international journal of computers, communications & control vol. i (2006), no. 3, pp. 25-32 pathogen variability. a genomic signal approach paul dan cristea abstract: the conversion of genomic symbolic sequences into digital signals has been applied for the analysis pathogen variability. results are given on the variability of human immunodeficiency virus, type 1, subtype f, isolated in romania, and of the type a avian influenza virus h5n1, for which sequences have been downloaded from genbank [1]. nucleotide sequence analysis is corroborated with techniques based on the genomic signal approach to detect pathogen resistance to antiretroviral treatment. in the case of protease (pr) inhibitors, it is found that the treatment induces single nucleotide polimorphisms (snps) in specific sites. for moderate resistance, the changes affect the pr enzyme only at the level of the protein, whereas for multiple drug resistance, the rna gene secondary structure also changes. keywords: genomic signals, pathogen variability, hiv, influenza, orthomyxoviridae, drug resistance 1 introduction as shown in a series of previous papers [2-4], the conversion of nucleotide and amino acid sequences into digital signals offers the possibility to apply signal processing methods for the analysis of genomic data. the genomic signal conversion used in our work is a one-to-one mapping of symbolic genomic sequences into complex signals, as described in [2]. the idea is to conserve all the information in the initial symbolic sequence, while bringing in foreground some features significant for the subsequent processing and analysis. this direct method has proven its potential in revealing large scale features of dna sequences, maintained at the scale of whole genomes or chromosomes, including both coding and non-coding regions. one of the most conspicuous results is that the unwrapped phase of dna complex genomic signals varies almost linearly along all investigated chromosomes, for both prokaryotes and eukaryotes. the slope is specific for various taxa and chromosomes. this regularity of the genomic signals reveals a corresponding large scale regularity in the distribution of pairs of successive nucleotides, which is similar to chargaff’s first order rules for the frequencies of occurrence of the nucleotides [5]. we applied the same genomic signal approach for studying the variability of several pathogens, including the human immunodeficiency virus, type 1 (hiv-1), subtype f, isolated from romanian patients at the national institute of infectious diseases “prof. dr. matei bals”, bucharest [3], and the avian influenza virus type a, based on genomic sequences downloaded from genbank [1].we have used mainly the phase analysis of the complex genomic signals attached to the nucleotide sequences describing viral genes, as well as the analysis of the corresponding secondary rna structure and of the phylogenetic neighbor-joining trees for some of these genes. the focus of the study is primarily on the enzyme changes involved in generating pathogen resistance to multiple drug treatment. a novel methodology for describing sets of related genomic signals, based on a common reference and on individual differences has been developed. variability signals with respect to average, median and maximum flat references, and digital derivatives of genomic signals are applied to this purpose. applying this method, it has been found that the mutations in the genes of the analyzed viruses occur only in some specific, well defined locations, while the largest part of their genome remains unchanged. the mutations conferring drug resistance are a subset of all mutations occurring in the studied viruses. on the other hand, for the case of hiv protease, it has been shown that the changes in response to the antiretroviral drug treatment occur not only at the level of the final enzyme product, preventing the action copyright c© 2006 by ccc publications selected from icccc 2006 (invited paper) 26 paul dan cristea of the drug on the active protease catalytic site, but also at the level of protease gene rna secondary structure. these type of changes have been found only for multiple drug resistant viruses. 2 symbolic sequence conversion for convenience we repeat here the mapping used in our work for the representation of the nucleotides [2] a = 1 + j, c = −1− j, g = −1 + j,t = 1− j (1) apart of the mapping of the four nucleotides (a, c, g,t), the complete genomic signal representation of nucleotide sequences also comprises the mapping of all the other iupac symbols for nucleotide classes: s = {c, g} strongly bonded, w = {a,t} weakly bonded, r = {a, g} purines, y = {c,t} pyrimidines, m = {a, c} amine, k = {g,t} ketone, b = {c, g,t} = ¬a, d = {a, g,t} = ¬c, h = {a, c,t} = ¬g, v = {a, c, g} = ¬t, and n = {a, c, g,t} [2]. these symbols occur in the nucleotide sequences generated by genotyping because of the multiplicities determined either by the variability within the virus population or by noise. but this is not the case of the consensus sequences downloaded from genbank [1], which are curated to contain only the (a, c, g,t) nucleotide symbols. the mapping in equation (1) has the advantage of conserving all the information in the initial symbolic sequence, as it uses a bijective mapping, while being as little biased as possible. 3 representation by reference and variation to study the variability of the genomic signals in a given set, for example, the signals for multiple resistant viruses, it is convenient to use a description comprising two types of components: (1) the reference a certain signal considered to best describe the common variation of all components in the considered cluster; (2) the difference of each signal in the cluster with respect to the common reference. in such an approach, it is important to introduce in the common reference as much as possible of the variation shared by all the signals, and keep for the individual differences of each signal only the variations belonging actually to the that signal, without external variation. the reference can be chosen as one of the following possibilities: • average (mean) of the signals, or another linear combination of the signals; • median the signal in the central position, or the average of the pair of signals placed centrally; • maximum flat signal a modified median that keeps better local variations on the signals where they occur avoiding spurious transfers on other signals. when the reference equals the average, the dispersion of the cluster of signals is minimum, i.e., the sum of the squares of the individual differences between each signal and the reference is minimized. but the average, as any other linear combination, has the important disadvantage that a localized variation of only one of the signals is transmitted to the reference, so that all the other signals will have an apparent variation of opposite sign in that point. the median reference performs better, being a nonlinear function of the signals in the cluster, so that it decouples the common reference from the local variations of each of the individual signals. the median reference minimizes the sum of the absolute values of the differences between each signal and the reference. a variation localized on only one of the signals is no longer transmitted to the reference, so that it does not affect the variation with respect to the reference of the other signals. the exception occurs when the signal on which the localized variation occurs is just the median. the maximum flat (maxflat) reference is equal to the median wherever the median has no variations which are not shared by other signals. elsewhere, the maxflat reference assumes the minimal variation pathogen variability. a genomic signal approach 27 that corresponds to its trend, if possible remaining constant. consequently, the variation signals show better the changes that occur in each individual signal, with less "crosstalk". the digital derivatives of the variation signals show only the actual changes, caused by the variability in each of the signals and, for genomic signals, correspond directly to the snps. 4 hiv-1 subtype f variability a phase analysis has been performed on a segment of about 1302 base pairs, approximately aligning with the standard sequence of hiv-1 (nc001802) in genbank [1] over the interval 1799..2430 bp. this segment, which is currently used for the standard identification and assessment of hiv-1 strains, comprises the protease (pr) gene and almost two thirds of the reverse transcriptase (rt) gene. the pr and rt segments are contiguous and have been analyzed both together, as one entity, and independently, as two distinct encoding regions. the pr gene has the length 297 bp and is located in the first interval (1..297 bp) of the sequenced dna segment, respectively along the 1799..2095 bp region of the nc001802 sequence. the rt encoding segment that has been analyzed has a length of 1005 bp and is located in the second interval (298..1302 bp) of the analyzed dna segment, respectively along the 2096..3100 bp region of the nc001802 sequence. the entire rt gene has 1680 bp located in the interval 2096..3775 of the sequence. figures 1 and 2 show the cumulated and unwrapped phase of genomic signals for the protease (pr) genes from nine instances of hiv type 1, f clade [1, 6]. three cases come from treatment naïve patients (s sensitive), three from patients that developed resistance to one of the drugs (r), and three with multiple resistance to ther antiretroviral treatment (m). the cumulated phase is proportional to the unbalance in the number of nucleotides (statistics of first order) along the nucleic acid strand given by: 3(ng − nc) + (na − nt ), up to a π/4 factor, whereas the unwrapped phase is proportional to the difference between the number of direct and inverse nucleotide transitions (statistics of second order) along the nucleic acid strand (n+ − n−), with a π/2 factor [2]. figures 3 and 4 give the same informtion for the segment comprising 1005 bp of reverse transcriptase (rt) genes, out of the total of 1680 bp in this gene, for the same isolates in figs. 1 and 2. as expected, the cumulated phase varies less than the unwrapped phase for these instances, as all mutations are of the snp type and affect more the nucleotide pair distribution than the nucleotide distribution itself. even for the unwrapped phase, the variation of the signal along the strand is quite similar for most of the sequences, but the local changes cumulate along the strands. because of the mutations are local, the general shape of the phase signals are similar. it is also to be noticed that all the genomic material in these sequences is encoding and uses the same reading frame. the vertical strips in these figures mark the positions of the mutations (snps) that induce resistance to protease inhibitors (indinavir, ritonavir, saquinavir, nelfinavir, and amprenavir) [1]. the mutations that lead to multiple drug resistance are concentrated in several sites. in most of the remaining genome, the viruses have the same longitudinal structure. the sequences display mutations in several other locations. the effect of the mutations can easier be seen on the unwrapped phase, which is more sensitive to snps. the successive mutations of the snp type do not induce the divergence that could be expected, so that the signals do not actually diverge from one another. on the contrary, the signals tend to cluster, as the variations tend to compensate each other, so that the overall span of the signals does not increase directly with the number of mutations and the number of signals. this is another proof of the fact that, from the structural point of view, a genomic sequence satisfies more restrictions than a "plain text", which must just correspond to a certain semantics and to certain grammar rules, and resembles more to a “poem”, which additionally obeys rules of symmetry, giving its “rhythm” and “rhyme”. the recurrence of such patterned structures is reflected in simple mathematical rules satisfied by the corresponding genomic signals. 28 paul dan cristea the representation can be improved by using the reference-difference description, choosing the maximum flat (maxflat) reference, as shown in fig. 5 for the unwrapped phase in fig. 4. in this case, the largest possible part of the common behavior of the signals is introduced in the reference signal, whereas each individual variation signal maintains only the changes occurring in that particular signal, or to the class it belongs to. the reference signal is no longer necessarily equal in each interval with one of the signals, even when the number of signals is odd. the digital derivatives of the difference signals, shown in fig. 6 show only the actual changes caused by the variability in each of the signals. in the case of hiv, these changes correspond directly to the snps. for multiple resistant strains, the pulses correspond to the sites known from literature to confer resistance to various drugs. figure 1: cumulated phase expressed by 3(ng − nc) + (na − nt ) [2] for the protease (pr) gene of nine isolates of hiv-1, subtype f, showing sensitivity (s), resistance (r) and multiple resistance (m) to drugs. figure 2: unwrapped phase expressed by n+ − n− [2] for the protease gene of the isolates of hiv-1 in fig. 1. hiv-1 makes many of its proteins in one long chain, and protease (pr) has the essential role of cutting this ’polyprotein’ into the proper pieces, with the proper timing. consequently, pr has been chosen as an important target for the current drug anti-hiv therapy. pr is a small enzyme, comprising two identical peptide chains, each of 99 amino acids long, which are encoded by the same gene of 297 nucleotides. the two chains form a tunnel that holds the polyprotein, which is cut at an active site located in the center of the tunnel. drugs bind to pr, blocking its action. studying the estimated secondary structure pathogen variability. a genomic signal approach 29 figure 3: cumulated phase of rt genomic signals for the isolates shown in figs. 1 and 2. figure 4: unwrapped phase for the rt gene in the isolates shown in figs. 1 and 2. of the pr rna for the nine virions previously analyzed, it can be shown [3] that the structures are quite similar for drug sensitive and drug simple resistant viruses. this result is consistent with the generally accepted model stating that the genomic changes of hiv, which induce resistance to drugs, operates at the level of the protein (the final protease enzime), preventing the blocking of its catalytic site. on the other hand, it is found the remarkable fact that, for drug multiple resistant strains, there is a significant change in the rna secondary structure. large loops and bulges are replaced with similar, but smaller, less vulnerable, closed-loop structures. these results indicate that there is a certain action of the drug at the level of the protease rna, effect that becomes evident when mutations conferring multiple drug resistance occur. 30 paul dan cristea figure 5: the unwrapped phase in fig. 4 shown with respect to the maxflat reference. figure 6: digital derivatives of the variation signals in fig. 5. pathogen variability. a genomic signal approach 31 5 variability of hemagglutinin gene of influenza h5n1 virus the influenza virus envelope embeds two specific antigenic glycoproteins that project out of the virion surface, the hemagglutinin (ha) and the neuraminidase (na). many different combinations of ha and na proteins are possible, but only the h1n1 (spanish endemic), h1n2 (asian epidemic), and h3n2 (hong kong epidemic) subtypes have circulated worldwide among humans. ha protein selectively binds to the sialic acid of the host cell surface receptors, thus recognizing the cells that the virus can invade [4, 6]. figure 7 gives the cumulated phase of the ha gene for h5n1 viruses isolated from two humans (af046080, af046097) and one chicken (af046088), in hong kong, in 1997 [6, 7]. the genes for viruses isolated close in time are similar, even when crossing the inter-species barrier, whereas a large variation can be seen for genes isolated at larger time intervals. only several snps are found in fig. 8 which gives the difference cumulated phases with respect to the maxflat reference. the same result has been obtained for all the genes in the eight segments of the h5n1 virus [4, 6]. figure 7: cumulated phase of the ha gene, h5n1 virus (accessions af046080, 88, 97 [1, 6]). figure 8: differences of ha gene cumulated phases in fig.7 with respect to the maxflat reference. 6 further work further work will be focused on: 32 paul dan cristea • the dynamics of influenza type a viruses that have crossed till now the species barrier from birds to humans, and which hold the potential to become highly contagious and highly lethal in humans, including the h5n1 subtype, • extending the study from the nucleotide to the amino acid level, which could be more significant from the phenotypic point of view, • using genomic signals for helping clustering viruses in classes. acknowledgments the sequences of hiv presented in this paper have been genotyped by dr. dan otelea from the national institute of infectious diseases “prof. dr. matei bals”, bucharest, romania. results referring to the study of hiv variability have been previously jointly published [3]. references [1] national center for biotechnology information, national institutes of health, national library of medicine, national center for biotechnology information, genbank, http://www.ncbi.nlm.nih.gov/genoms [2] p. d. cristea, "representation and analysis of dna sequences", in genomic signal processing and statistics, editors e.g. dougherty, i. shmulevici, jie chen, z. j. wang, book series on signal processing and communications, hidawi, 2005, pp.15-65. [3] p. d. cristea, d. otelea, rodica tuduce, "study of hiv variability based on genomic signal analysis of protease and reverse transcriptase genes", embc’05, sept. 2005, shanghai, china. [4] p. d. cristea, "genomic signal analysis of pathogen variability", spie, bo24, paper 5699-52, san jose, jan, 2005, 12 pg. [5] e. chargaff, "structure and function of nucleic acids as cell constituents", fed. proc., 10, pp. 654659, 1951. [6] d.l. suarez,.et.al., comparisons of highly virulent h5n1 influenza a viruses isolated from humans and chickens from hong kong, j. virology, vol. 72 (8), pp. 6678-6688, 1998. (af046080-99). [7] e. ghedin, n. sengamalay, m. shumway et. al., "large-scale sequencing of human influenza reveals the dynamic nature of viral genome evolution", nature, vol. 437, oct. 2005, pp.1162-1166. [8] m.s. hirsch et al., "antiretroviral drug resistance testing in adult hiv-1 infection", in proc. recommendations of an international aids society usa panel, jama, vol. 283, no. 18, may 10, 2000, pp.2417-2426. paul dan cristea university politehnica of bucharest biomedical engineering center address: spl. independentei 313, sect. 6 060042 bucharest, romania e-mail: pcristea@dsp.pub.ro int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 2, pp. 216-223 synthetic genes for artificial ants. diversity in ant colony optimization algorithms s.c. negulescu, i. dzitac, a.e. lascu sorin c. negulescu "lucian blaga" university of sibiu, "hermann oberth" faculty of engineering 10, victoriei bd, 550024 sibiu, romania email: sorin.negulescu@ulbsibiu.ro ioan dzitac "aurel vlaicu" university of arad, faculty of exact sciences, department of mathematics-informatics, str. elena dragoi, nr. 2, complex universitar m, arad, romania and r&d agora ltd. oradea [cercetare dezvoltare agora] 8, piata tineretului, 410526 oradea, romania e-mail ioan.dzitac@uav.ro alina e. lascu "lucian blaga" university of sibiu, "hermann oberth" faculty of engineering 10, victoriei bd, 550024 sibiu, romania e-mail alina.lascu@ulbsibiu.ro abstract: inspired from the fact that the real world ants from within a colony are not clones (although they may look alike, they are different from one another), in this paper, the authors are presenting an adapted ant colony optimisation (aco) algorithm that incorporates methods and ideas from genetic algorithms (ga). following the first (introductory) section of the paper is presented the history and the state of the art, beginning with the stigmergy and genetic concepts and ending with the latest aco algorithm variants as multiagent systems (mas). the rationale and the approach sections are aiming at presenting the problems with current stigmergy-based algorithms and at proposing a (possible yet to be fully verified) solution to some of the problems ("synthetic genes" for artificial ants). a model used for validating the proposed solution is presented in the next section together with some preliminary simulation results. some of the conclusions regarding the main subject of the paper (synthetic genes: agents within the mas with different behaviours) that are closing the paper are: a) the convergence speed of the aco algorithms can be improved using this approach; b) these "synthetic genes" can be easily implemented (as local variables or properties of the agents); c) the mas is self-adapting to the specific problem that needs to be optimized. keywords: ant colony optimization, genetic algorithms, multiagent systems, stigmergy. 1 introduction there are about ten thousand species of ants living on earth, grouped in colonies, all of them having or one or (at most) two queens and sterile female workers (all the other ants). as the research conducted by biologists showed [5], there is no central control or no management whatsoever inside a colony, but the behaviour of the colony changes (as it gets older and larger). it was also determined that there are four types of ants [5]: harvester ants, foraging ants (out on a foraging trail searching for food or bringing food back), patrolling (out early in the morning before the foragers are active, somehow choosing the directions that the forager ants will go, and just by coming back to the nest they indicate to the foragers that is safe to go out), maintenance (work inside the nest with moist soil for building the walls), midden work (put some kind of territorial chemical in the garbage they create). another interesting fact is that 50% of the ants are doing nothing, acting as a reserve for exceptionally situations. one class of multi-agent systems (mas) are copying some of the characteristics of the foraging ants within the colonies (depending on how much these characteristics are helpful in solving a particular type of problem) such as [10]: copyright c⃝ 2006-2010 by ccc publications synthetic genes for artificial ants. diversity in ant colony optimization algorithms 217 • being a distributed society of autonomous individuals/agents; • having fully distributed control among the agents; • having localized communications among the individual; • taking stochastic decisions; • having system-level behaviours that are transcending the behavioural repertoire of the single (minimalist) agent; • following simple interaction rules. as a result, the overall very important features of these systems are: robustness, adaptability and scalability. the rest of this paper is arranged as follows: the history section presents the stigmergy and genetic concepts and variants of aco algorithms as mas. the rationale and the approach sections are aiming at presenting the problems with current stigmergy-based algorithms and at proposing a (possible yet to be fully verified) solution to some of the problems ("synthetic genes" for artificial ants). a model used for validating the proposed solution is presented in the next section together with some preliminary simulation results. some of the conclusions regarding the main subject of the paper presented in the last section are: a) the convergence speed of the aco algorithms can be improved using this approach; b) these "synthetic genes" can be easily implemented (as local variables or properties of the agents); c) the mas is self-adapting to the specific problem that needs to be optimized. 2 history and related work swarm intelligence systems (inspired from biological systems as ant colonies, bird flocking, animal herding, bacterial growth, and fish schooling) are in fact simple agents that are interrelated, being able to communicate one with another and to interact with their environment. they follow simple rules and there is no centralized control [9]. in swarm intelligence systems the focus is not on modelling, inspired from biologic (sub-symbolic) paradigms, but merely on simulating biologic behaviour [8]. the ant colony optimization algorithm is a probabilistic technique for solving computational problems which can be reduced to finding good paths through graphs [17]. this technique was initially proposed by marco dorigo in 1992 in his phd thesis [6] [7] and it was aiming at searching an optimal path in a graph (i.e. the travelling salesperson problem tsp) based on the behaviour of ants seeking a path between their colony and a source of food [4]. the algorithm was further developed and, as a result, some common extensions appeared such as the elitist ant system (eas) where the ant that found a better solution could deposit an extra amount of pheromone; the max-min ant system (mmas)[16] where the pheromone intensity was bounded to certain minimum and maximum values (tmax, tmin), and where only the ant that found a better solution could deposit pheromone. the mmas was the first algorithm that was constantly checking its convergence to solution; all the edges in the graph were initialized to tmax and reinitialized to tmax when the algorithm was near stagnation [17]. another version of the aco is the rank-based ant system (asrank) where all solutions are ranked according to their fitness. the amount of pheromone deposited is then weighted for each solution, such that the solutions with better fitness deposit more pheromone than the solutions with worse fitness [17]. besides these common variants of the algorithm [12], the authors proposed some different approaches such as: • human-driven stigmergic control [3] aimed at exploring the relationship between stigmergy and synergy, based on the threshold principle, specifically, focusing on affordability and keeping a definite engineering perspective, the purpose was to save computer resources in applying stigmergic control to industrial problems by exploring the relationship between the number of digital ants and problem complexity. the long-range target was to follow the analogy to superconductivity: moving the threshold in order to improve performance and/or save computing resources. the research proved that the threshold exists and it depends on problem type and complexity; the same solution quality can be obtained with fewer ants. • user-driven heuristics [1] [12] the approach emphasised the significance of the environment in the agent system and the attention that must be paid to the dynamics that emerges from the indirect interactions of the agents. finding out the form and parameters that influence the system behaviour is non-trivial for humans and the research investigated the methods that would enable user-driven solutions in dealing with system tuning. the proposed methods were based on two essential functional requirements: a) at the macro level, the need to monitor and represent in an intuitive way the system behaviour and b) at the micro level, the need to control and track the system state space. the approach proved to be workable on usual configurations and effective in dealing with combinatorial explosion. 218 s.c. negulescu, i. dzitac, a.e. lascu • (sub-)symbolic inferences in multi-agent systems [2] the research aimed at showing that new logics, already useful in modern software engineering, become necessary mainly for mas, despite obvious adversities. the features asked for by the paradigm of computing as intelligent interaction, based on "nuances of nuancedreasoning", that should be reflected by agent logics were outlined. by injecting symbolic reasoning in systems based on sub-symbolic "emergent synthesis" the authors presented the way in which how quantifiable synergy can be reached even in advanced challenging domains, such as stigmergic coordination. the original idea has diversified to solve a wider class of numerical problems, and as a result, several problems have emerged, drawing on various aspects of the behaviour of ants such as solving combinatorial optimization, classification and image processing problems [15]. the authors successfully applied the aco algorithms to solve difficult problems such as: vehicle route allocation (with multiple constraints) [13] [14] and optimal capacitor banks placement in power distribution networks [15] where the criterion of the mathematical optimization model was a nonlinear function based on costs and the model imposed equality constraints described by the network operating equations and inequality constraints required to maintain within admissible limits the parameters characterizing the system state. 3 rationale and approach the motive for proposing synthetic genes for artificial ants and promoting the diversity in ant colony optimization algorithms is that the convergence speed of current variations of aco algorithms is strongly dependent of the problem type and size. the real world ants from within a colony are not clones as current aco algorithms are modelling them; although they may look alike, they are different from one another. the vast majority of aco algorithms have general variables that are determining the overall system behaviour such as: • pheromone evaporation speed when updating the pheromones intensities in the graph (ρ ); • pheromone quantity deposited on graph (q); • pheromone influence is on edge selection (α ); • distance influence on edge selection (β ); • the multiplication value for the pheromone intensity of the elitist ant (ε ) only for eas. when a certain problem must be solved, the values of the above presented parameters are set and after a few runs they are adjusted in order to improve the overall convergence speed or solution quality. when dealing with dynamic optimisation problems the above mentioned approach accentuates once more its weak points, as it is almost impossible to determine good parameter values. a possible manner to combat these limitations is to adopt some ideas from the ga field. although hybrid approaches between mas (such as aco or eas) and ga are not new, the technique proposed herein is innovative because: • it does not absolutely require to implement all the ga concepts (like selection, crossover and mutation); • the fitness function may or may not exist (depending on problem type); • the generations from ga can be associated to the iteration number from mas; • the global parameters belonging to the algorithm are converted to local parameters belonging to ants. as depicted in figure 1b, each virtual ant within the colony has its own governing parameters in the same manner that real ants exhibit different behaviour. more details about the way in which such a system should work are presented in the next section. 4 the model since the model proposed herein imposes that each virtual ant within the colony must have its own parameters that are determining its behaviour, it is important for the user to have the ability to specify some general limits in which these parameters may vary. in this regard, some other general parameters must be defined as follows: • minimum and maximum values for the exploitation of the good old found paths αmin, αmax; • minimum and maximum values for the exploration of the other paths within the graph βmin, βmax; • minimum and maximum values for the multiplication value for the pheromone intensity of the (elitist) ants that found a better solution εmax, εmax; synthetic genes for artificial ants. diversity in ant colony optimization algorithms 219 (a) (b) figure 1: exemplification of a) global parameters versus b) local parameters for the ant system depending on the problem to be solved some other global parameters can be defined to enforce the limits for the local parameters (e.g. for the vehicle route allocation problem [13] a local history parameter was used and hmin, hmax as global parameters). after finding a better solution, different actions can be taken by the algorithm to auto-tune the ant local parameters: • the parameters will be adjusted for some, all or none of the ants; • parameters can be adjusted using different types of equations e.g. form simple average, weighted average (depending on solution quality), or other linear functions to polynomial or exponential functions. the initialization of the ant parameters can be easily done in a random manner as shown in figure 2, but some other initialization methods can be applied as well (e.g. gaussian centred to the average value: min+(max-min)/2 ). although the research on the proposed model is in progress the results so far are promising. the algorithm is self-tuning to the problem type and size. some other methods must be checked such as starting with a big number of ants and reducing it to a minimum in time, with a twofold advantage: • having a large genetic pool at the beginning will assure that the algorithm can adapt to very diverse problems; • reducing the number of ants will increase the computation speed; this number can be drastically reduced as presented in [3]. 220 s.c. negulescu, i. dzitac, a.e. lascu synthetic genes for artificial ants. diversity in ant colony optimization algorithms 221 figure 2: the proposed pseudocode for the "synthetic genes" enhanced eas. 222 s.c. negulescu, i. dzitac, a.e. lascu 5 conclusions and future work since this paper presents an ongoing research, the conclusions will be presented tanking in consideration two time ranges: a) short (current relevant results) and b) medium-long (directions of future work). a1. the proposed aco algorithms have the ability to self-adapt to different optimisation problem types and sizes. a2. the convergence speed of the aco algorithms can be improved using this approach. a3. the "synthetic genes" can be easily implemented (as local variables or properties of the agents); only some other global variables that will impose the limits of variation must be defined. a4. different types of initialisation of the genes values can be used (e.g. random). ideas for this particular subject can be used from the ga field. a5. the genes values can be automatically adjusted for some, all or none of the ants. moreover, this can be achieved using different types of equations e.g. form simple average, weighted average (depending on solution quality), or other linear functions to polynomial or exponential functions. b1. some other methods of initialization/algorithm evolution must be tested. as presented in the previous section, the algorithm can start with a big number of ants and reduce it to a minimum in time. b2. other concepts form ga can be implemented taking into account both the advantages (solution quality) and the disadvantages (computing time) like selection, crossover or mutation. b3. finding a solution to the stagnation problem that is intrinsic both to aco and ga approaches. another possible technique would be to reinitialize the local parameters while keeping the pheromones intensities on the graph intact. bibliography [1] bǎrbat b.e, moiceanu a., pleşca s., negulescu s.c., affordability and paradigms in agent-based systems, computer sc. j. of moldova, 15, 2(44):178-201, 2007. [2] bǎrbat b.e., negulescu s.c., from algorithms to (sub-)symbolic inferences in multi-agent systems. international journal of computers, communications and control, 1(3):5-12, 2006. [3] bǎrbat b.e., negulescu s.c., zamfirescu c.b., human-driven stigmergic control. moving the threshold. in n. simonov (ed.), proc. of the 17th imacs world congress (scientific computation, applied mathematics and simulation), paris, pp.86-92, 2005. [4] bonabeau e., dorigo m., theraulaz g., swarm intelligence: from natural to artificial systems, new york: oxford university press, 1999. [5] deborah gordon (2008), dig ants, retrived 11.2009, from ted. ideas worth spreading. web site: http://www.ted.com/talks/lang/eng/deborah_gordon_digs _ants.html. [6] dorigo m., maniezzo v., colorni a., the ant system: optimisation by a colony of cooperating agents, ieee transactions on systems, man, and cybernetics, part b, 26 (1):29-42, 1996. [7] dorigo, m., optimization, learning and natural algorithms, phd thesis. italy: politecnico di milano, 1992. [8] dzitac i., bǎrbat b.e., artificial intelligence + distributed systems = agents, international journal of computers, communications and control, 4(1):17-26, 2009. [9] dzitac i., moisil i., advanced ai techniques for web mining, proceedings of the 10th wseas international conference on mathematical methods, computational techniques and intelligent systems, corfu, pp.343-346, 2008. [10] gambardella l.m., di caro g. (2005), the ant colony optimization (aco) metaheuristic: a swarm intelligence framework for complex optimization tasks. retrived 2008, from university of bologna: first summer school on aspects of complexity. web site: http://www.cs.unibo.it/ fioretti/ac/ac2005/ docs/slides_dicaro.pdf. [11] negulescu s.c., bǎrbat b.e., enhancing the effectiveness of simple multi-agent systems through stigmergic coordination, in icsc-naiso (ed.), fourth international icsc symposium on engineering of intelligent systems (eis 2004), canada: icsc-naiso academic press, pp.149-156, 2004. synthetic genes for artificial ants. diversity in ant colony optimization algorithms 223 [12] negulescu s.c., zamfirescu c.b., bǎrbat b.e., user-driven heuristics for nondeterministic problems. studies in informatics and control (special issue dedicated to the 2nd romanian-hungarian joint symp. on applied computational intelligence), 15(3):289-296, 2006. [13] negulescu, s.c., kifor, c.v., oprean, c., ant colony solving multiple constraints problem: vehicle route allocation. international journal of computers, communications and control (ijccc), 3(4):366-373, 2008. [14] negulescu, s.c., oprean, c., kifor, c.v., carabulea, i., elitist ant system for route allocation problem. in mastorakis, n.e. et al (ed.), proceedings of the 8th conference on applied informatics and communications, greece: world scientific and engineering academy and society (wseas), pp.62-67, 2008. [15] secui, d.c., dzitac, s., bendea, g.v., dzitac, i., an aco algorithm for optimal capacitor banks placement in power distribution networks. studies in informatics and control, 18(4):305-314, 2009. [16] stützle t., hoos h.h., max-min ant system, future generation computer systems, 16 (8):889-914, 2000. [17] wikipedia, the free encyclopedia (2009), ant colony optimization. retrived 11.2009, from wikipedia. web site: http://en.wikipedia.org/wiki/ ant_colony_optimization. sorin c. negulescu has a m.sc. degree in quality mangement (2005) a m.sc. degree in computer science (2003), "lucian blaga" university, sibiu and a ph. d in industrial engineering. his research interests include multi-agent systems with applications in manufacturing systems using stigmergic coordination, factory automation and the holonic paradigm. ioan dzitac (b. february 14, 1953) received his m.sc. in mathematics (1977) and phd in information sciences (2002) from "babes-bolyai" university of cluj-napoca. now he is full professor of informatics at mathematics-informatics department, faculty of exact sciences, "aurel vlaicu" university of arad, romania and director of research & development agora ltd. of oradea, romania. his current research interests include different aspects of artificial intelligence and distributed systems. he has (co-)authored 18 books and more than 70 papers, more than 50 de conferences participation, member in international program committee of more than 40 conferences and workshops. alina e. lascu has a m.sc. degree in international relations and european studies (2008), is currently in the last year of her master studies in the domain of computer science and in the first year of the doctoral studies in the domain of computers and ict. as depicted from her educational background her research interests are very much trans-disciplinary: language technology, intelligent agents, agent orientation, computer-aided semiosis, cultural and interdisciplinary studies. international journal of computers, communications & control vol. i (2006), no. 1, pp. 9-15 visualising infinity on a mobile device daniel c. doolan, sabin tabirca abstract: this paper discusses how a java 2 micro edition (j2me) application may be developed to visualise a wide variety of differing fractal types on a mobile phone. a limited number of j2me applications are available that are capable of generating the mandelbrot set. at present there are no j2me applications capable of generating a multitude of fractal image types. keywords: j2me, mobile phone, fractals 1 introduction it has been shown that mobile devices are capable of generating high quality images of infinite detail [2]. the generated images have generally been limited to the mandelbrot set. since the late 19th century fractals have been a favourite topic for mathematicians. since the dawn of modern day microprocessor based computing the study of fractals has taken a radical leap, as it is within the computing domain that all the nuances of fractal type images can be visualised. benoit mandelbrot made a huge contribution in the late 1970’s with the discovery of the mandelbrot set (an index for all the possible julia sets). the dawn of the 21st century has seen a radical changed in what we consider a computer to be. it has seen the widespread uptake of mobile phones throughout the world. devices that started life in the latter years of the 20th century as a mobile communications medium have evolved and mutated into mobile computing devices of considerable processing power. no longer are mobile phones used for just telecommunications but for just about any type of application that a standard desktop machine is capable of. the 2d and 3d visualisation of fractal images is one interesting topic that is coming into the realm of reality within the mobile computing domain. current high end phones have processing speeds in the region of 100 to 200mhz [7] [1], typically running arm9 type processors. the next evolution in processing power will see such devices fitted with the arm11 processor cores with speeds as high as 500mhz. mobile devices clearly have a huge processing potential especially if the combined processing power of the many millions of phones around the world were put to task on a singular problem. examples of such computation are already in existence for example “distributed fractal generation across a piconet” [3] demonstrates that the combined processing power of several mobile devices may be used to distribute the processing load between several devices that are connected together over a bluetooth network. 1.1 mobile phone market penetration the uptake of mobile devices around the world is staggering. in september 2004 the market penetration stood at 89% in ireland [10], by march 2005 it stood at 94% [11]. this is a huge increase when in 2001 penetration stood at only 67% [8]. ireland achieved 100% penetration in september 2005, showing an increase of over 11% in a twelve month period. this allows ireland to join spain, finland the netherlands and austria in having a 100% penetration rage. luxembourg is currently on top with a rate of 156% [12]. it is expected that western europe will exceed 100% usage by 2007 [15]. the year 2015 should see half the world’s population (four billion people) using mobile phones as a communications medium [9]. 1.2 primary aims the primary purpose of this paper is the development of an application capable of running on a single mobile device that has the ability to generate a variety of two dimensional fractal images. one of the chief aims is that the application should be easy to use. this would allow it to be used as a teaching tool. to achieve this each section of the application has a easy to use graphical user interface (gui) to allow the user to specify the parameters for the image generation process. once the user is happy with the image parameters they can then press a button to begin the image generation process. the resultant output will be a fractal image based on the input parameters. copyright c© 2006 by ccc publications 10 daniel c. doolan, sabin tabirca 2 mandelbrot & julia set generation the discovery of the julia set in 1918 by gaston julia described in his paper “mémoire sur l’itération des fonctions rationnelles” proved to be a most important work at the time. it became almost forgotten until benoit mandelbrot brought it back to the forefront with his discovery of the mandelbrot set. this ensued a new field of research that became known as fractal geometry. both the julia and mandelbrot set images can be generated by the repeated iteration of a simple function (see figure 2). the mandelbrot set is an index into the julia set, in other words every possible julia set can be represented within the mandelbrot set (see figure 1). figure 1: index of the julia set (the mandelbrot set) jc = { z0 ∈c| lim n→∞ zn 6= ∞ } where : z0 = c, zn+1 = f (zn), n ≥ 0 m = { c ∈c| lim n→∞ zn 6= ∞ } where : z0 = 0, zn+1 = f (zn), n ≥ 0 figure 2: julia & mandelbrot set definitions fractal images are usually obtained when the generating function f (z) is non linear. the mandelbrot set is obtained by iterating the function f (z) = z2 + c. when the generating function has the form of f (z) = zu + cv many other mandel-like sets may be produced. algorithms 1 and 2 show how both the julia and mandelbrot sets can be generated. algorithm 1 the julia set algorithm for each (x,y) in [xmin, xmax]×[ymin, ymax] construct z0 = x + j ×y; find the orbit of z0 [first niter elements] if(all the orbit points are under the threshold) draw (x,y) algorithm 2 the mandelbrot set algorithm for each (x,y) in [xmin, xmax]×[ymin, ymax] c = x + i×y; find the orbit of z0 while under the threshold r if(all the orbit points are not under the threshold) draw (x,y) 2.1 implementation a simple to use graphical user interface (gui) is provided within the application to allow the user to enter various parameters detailing the type of image to be generated (see figure 3). the parameters dealing with the fractal image itself include: the image size, number of iterations, radius, cpower, zpower, formula type and image inversion. the other options are for the rate of fractal zoom, and the accuracy of the crosshair (seen in the image output screen (canvas)). figure 3: mandelbrot settings gui and output image figure 4: julia set image & results output screen a thread is used for the generation of the fractal image to allow for user interaction with the image as it is being generated. the outputted image is redrawn at regular intervals so the user can see the progress of the image generation process. the fractal image itself is generated as an array of integers (listing 1). this array is then passed visualising infinity on a mobile device 11 to the creatergbimage(. . .) method of the image class to generate the image object that is ultimately displayed within the onscreen canvas. the canvas has a crosshair present to allow the user to navigate around the image and select an area to zoom on. the crosshair is controlled by the directional keys of the mobile device. the user may also view the corresponding julia set (figure 4) for any point that the crosshair is currently indicating, by selecting the “view julia set” option from the mandelbrot set canvas menu. for(int i=0;ir){ r = c[k%l][0]; g = c[k%l][1]; b = c[k%l][2]; color = b + (g<<8) + (r<<16) + alpha; pixels[(j*sizex) + i] = color; break; } } } listing 1: code listing of mandelbrot set function the application allows for a variety of mandel-like images to be generated of the form f (z) = zu + cv, f (z) = zu − cv and f (z) = zu + cv + z. the cpower and zpower parameters of the gui dictate the values of u and v. the inverted representation of each form may also be generated by selecting the appropriate option from the gui, producing images of the form f (z) = zu + inv(cv), f (z) = zu −inv(cv) and f (z) = zu + inv(cv) + z. 2.2 processing results the application was tested using a number of image sizes as well as a varying the number of iterations (see table 1). the 6680 was unable to generate an image of 600 pixels square. testing with a nokia 3220 it was unable to generate an image of 2002 pixels. however it was capable of generating images of 1002 and 1502 pixels, but with considerable processing times, 56,503ms and 298,365ms for 1002 at 50 and 500 iterations respectively. device iter 100×100 200×200 300×300 400×400 500×500 600×600 nokia 6630 50 3,000ms 9,812ms 19,484ms 33,812ms 53,359ms 76,859ms nokia 6680 50 3,141ms 12,516ms 21,859ms 36,234ms 60,859ms n/a nokia 6630 500 13,281ms 52,344ms 111,484ms 196,125ms 301,344ms 429,047ms nokia 6680 500 12,359ms 48,000ms 103,219ms 187,797ms 290,062ms n/a table 1: image generation times for the mandelbrot set (xmin,ymin -2.0, xmax,ymax 2.0). 3 prime number fractal generation mathematicians have been studying prime numbers for thousands of years, with it origins going all the way back to ancient greece and the period of euclid. prime numbers have remained a mystery throughout the intervening centuries. their is currently ongoing work to find a prime number with ten million digits or more [4]. the largest prime found to date was discovered 15th december 2005 by dr. curtis cooper and dr. steven boone, the 43rd mersenne prime 230,402,457 − 1 being 9, 152, 052 digits in length. the second largest was discovered by dr. martin nowak 18th february 2005, the 42nd mersenne prime, 225,964,951 −1 containing 7,816,230 digits. the next largest was discovered 15th may 2004 by josh findley (41st mersenne prime, 224,036,583 −1 containing 7,235,733 digits. these discoveries were part of the great internet mersenne prime search (gimps) project. it is likely that within the next year or so a ten million digit prime number will be found. figure 5: pnf examples with 5 & 20 million primes figure 6: prime number fractal user interface the visualisation of prime numbers is again of interest to many in the mathematical community. probably the most well know visualisation was discovered by stanislaw ulam (the ulam spiral) in 1963 while doodling during 12 daniel c. doolan, sabin tabirca a scientific meeting [14]. an alternate visualisation is the prime number fractal (figure 5) where the resultant image has a central area of brightness and typically resembles a gaseous nebula or some other cosmic object. it is generally recognised that adrian leatherland from monash university in australia constructed the first prime number fractal [6]. the application has a very simple user interface with only a few components (figure 6). the options are: the sieve size, image size, and initial coordinates for x and y. 3.1 theoretical background the most important and evident feature of a prime number fractal image is the central area of brightness. this results in the pixels within the vicinity of the central area are visited more often than pixels around the periphery of the image. the movements in the up, down, left, right directions occur randomly but each direction produces approximately the same number of moves (see table 3). hence the trajectory of moves is random, generally staying around the central area of brightness. theorem 1. the number of up, down, back, forward movements are asymptotically equal. proof. dirichlet’s theorem assures if a and b are relatively prime then there are an infinity of primes in the set a·k + b, k > 0. this means that the random walk has an infinity of up, down, left, right moves. if πa,b(x) denotes the number of primes of the form a ·k + b less than x then we know from a very recent result of weisstein [16] that limx→∞ πa,b(x) li(x) = 1 ϕ(a) where li(x) is the logarithmic integral function and ϕ(a) the euler totient function. the particular case a = 5 gives limx→∞ π5,k (x) li(x) = 1 ϕ(5) = 1 4 ,∀k ∈ 1, 2, 3, 4, which means that π5,1 ≈ π5,2 ≈ π5,3 ≈ π5,4(x) ≈ li(x)4 clearly the two dimensional prime number fractal algorithm has asymptotically the same number of up, down, left and right moves. even on desktop systems the process of generating primes can take a significant amount of time for example 21,157ms on a amd athlon xp2600 system (table 4). the generation of the primes examines all the numbers between 0 and n the input size. as the prime numbers become larger their distribution becomes far sparser (table 2). their should be at least one prime between n and 2(n). # primes sieve size 1 million 15,485,865 2 million 32,452,850 5 million 86,028,130 10 million 179,424,680 table 2: sieve size #primes 1 million 5 million 10 million 20 million left 249,934 1,249,832 2,499,755 4,999,690 right 205,015 1,250,079 2,500,284 5,000,241 up 250,110 1,250,195 2,500,209 5,000,270 down 249,940 1,249,893 2,499,751 4,999,798 table 3: distribution of moves for a two dimensional prime number fractal #primes 1 million 5 million 10 million athlon xp2600 1,781ms 9,752ms 21,157ms amd 500mhz system 4,084ms 23,946ms 51,552ms table 4: processing times to generate primes on desktop systems 3.2 the fractal algorithm the generation of the fractal image requires the iteration through a loop (algorithm 3) for all numbers from 1 to n. the first requirement of the algorithm is to determine if the number i is prime or not. the sieve of eratosthenes is an time efficient method for determining primality of a sequence of numbers. the time complexity is o(n · log log n), however the space requirement is o(n). the algorithm iterates through the sequence from 2 to n crossing off all numbers > 2 that are divisible by 2. it then moves on to the smallest remaining number, and removes all of its multiples. the process of moving on and removing multiples continues until all numbers up to √ n have been crossed off. in java one may use an array of boolean values to indicate if the number at index i of the array is prime. a boolean in j2se requires one byte for storage, however the bitset class may be used. this reduces the storage requirements to o( n8 ) bytes. as mobile devices have such a limited memory visualising infinity on a mobile device 13 this approach cannot be used, hence the need for an alternate method. determination of the primality is achieved by calling the method isprime(i) (algorithm 4). it returns a boolean value indicating if the number is prime or not. if the number is prime then the process of plotting the prime takes place. firstly the direction is calculated by carrying out modular division by 5 yielding p mod 5 ∈ {1, 2, 3, 4} this mapping can be clearly seen in table 5. next depending of the direction the current pixel location is updated to reflect the direction indicated by the prime. lastly the colour of the pixel at the new cell location is incremented. algorithm 3 2d prime number fractal algorithm for i = 0 to n -1 do begin if(isprime(i)) then dir = p[i] mod 5; if dir = 1 then x–; if dir = 2 then x++ if dir = 3 then y– if dir = 4 then y++ pixels[x,y] ++; end if; end for; algorithm 4 isprime( ) algorithm boolean isprime(long p) begin long d; if(p=1) return false; if(p=2 || p=3) return true; if(p % 2=0 || p % 3=0) return false; for(d=3;d <= sqrt(p); d=d+2) if(p % d=0) return false; return true; end 1. left: p mod 5 = 1 ⇒ (x, y) goes in (x−inc, y) 2. right: p mod 5 = 2 ⇒ (x, y) goes in (x + inc, y) 3. up: p mod 5 = 3 ⇒ (x, y) goes in (x, y−inc) 4. down: p mod 5 = 4 ⇒ (x, y) goes in (x, y + inc) table 5: mapping of direction values to pixel movement 3.3 processing results as is the case with desktop systems the generation of the primes is the most computationally expensive operation. the results (table 6) show that a significant amount of time is required to generate the prime numbers. this limits the computation of the image to just a few hundred thousand primes so the image may be generated and displayed to the user in a reasonable amount of time. #primes 20, 000 40, 000 60, 000 80, 000 100, 000 nokia 6630 44,797ms 129,235ms 240,313ms 373,437ms 524,703ms nokia 6680 44,984ms 129,000ms 240,078ms 372,875ms 524,046ms table 6: processing times to generate pnf on mobile devices 4 plasma fractal generation plasma fractals are often referred to as “fractal clouds” and the resultant image typically has a cloud like appearance (figure 7). the generation of this type of fractal uses a recursive algorithm known as random midpoint displacement. applying the exact same algorithm in the 3d universe to height values will result in the generation of fractal terrain. an example of this method being used for the generation of terrain in the film industry is star trek ii “the wrath of kahn” where random midpoint displacement was used to generate the terrain of a moon, the scene being called the “geneses sequence”. the procedure for generating a “fractal cloud” (algorithm 5) begins by assigning a colour to each of the four corners of a blank image. executing the “divide(. . .)” algorithm will firstly find the colour for the central point of the image by calculating the average value of the four corners. the colour value at the central point is then randomly displaced. the image area is then divided into four smaller sections by recursively calling the “divide(. . .)” again for each of the four quadrants. this division process will continue until the image cannot be 14 daniel c. doolan, sabin tabirca figure 7: plasma fractal examples (grain 1.2 , 2.2, 5.8, 9.6) further broken down. by this time the sub quadrants have reached the pixel level. several example applets that use this procedure may be found on the internet [13] [5]. algorithm 5 plasma fractal algorithm divide(x,y,w,h,tlc,trc,brc,blc) float nw = w /2, nh = h /2; if(w > 1 || h > 1) int displace = displace(nw,nh); color top = avgcolors() + displace; color right = avgcolors() + displace; color bottom = avgcolors() + displace; color left = avgcolors() + displace; color centre = avgcolors() + displace; divide(x,y,nw,nh,tlc,top,centre,left); divide(x+nw,y,w,h,top,trc,right,centre); divide(x+nw,y+nh,w,h,centre,right,brc,bottom); divide(x,y+nh,w,h,left,centre,bottom,blc); else drawpixel(x,y) 4.1 processing results the results show that mobile devices are capable of generating respectably sized plasma fractal images in a reasonable amount of time (table 7). image size 100×100 200×200 300×300 400×400 500×500 600×600 nokia 6630 672ms 2,219ms 6,672ms 6,328ms 6,156ms 23,453ms nokia 6680 671ms 2,328ms 7,468ms 7,344ms 6,828ms 25,329ms table 7: processing times to generate plasma fractals 5 further work the primary focus of this paper was on the generation of two dimensional fractal on mobile devices. the obvious progression from this application is to expand the generating functions into the third dimension. in the case of the plasma fractal this will yield a randomly generated terrain if the height values of a plane are randomly displaced instead of the colour values of a two dimensional image. 6 conclusion an integrated tool has been developed to generate a variety of two dimensional fractals. the fractals in question being the mandelbrot set, julia set, prime number fractal and the plasma fractal. it has been shown that the mandelbrot and plasma fractal image can be generated in reasonable time. the generation of the pnf image however does involve significant computation resources, especially when the number of primes is greater than 100 thousand. visualising infinity on a mobile device 15 references [1] a. baker, “mini review enhancements in nokia 6680,” http://www.i-symbian.com/forum/ images/articles/43/mini_review-nokia_6680_enhancements.pdf, 2005. [2] d. doolan, s. tabirca, “interactive teaching tool to visualize fractals on mobile devices,” proceedings of eurographics ireland chapter workshop,dublin, ireland pp. 7–12, 2005. [3] d. doolan, s. tabirca, “distributed fractal generation across a piconet,” proceedings of sigrad05 mobile computer graphics conference,lund, sweden pp. 63–68, 2005. [4] gimps, “mersenne prime search,” http://mersenne.org/. [5] j. lawlor, “plasma fractal notes,” http://charm.cs.uiuc.edu/users/olawlor/projects/ 2000/plasma/, 2001. [6] a. leatherland, “pulchritudinous primes: visualizing the distribution of prime numbers,” http://yoyo. cc.monash.edu.au/~bunyip/primes. [7] e. murtazine, “review gsm/umts smartphone nokia n90,” http://www.mobile-review.com/ review/nokia-n90-en.shtml, 2005. [8] rte, “internet and mobile penetration still rising,” http://www.rte.ie/business/2001/0308/ odtr.html, 2001. [9] rte, “half the world to have mobile phones by 2015,” http://www.rte.ie/business/2004/0225/ phones.html, 2004. [10] rte, “mobile penetration now stands at 89%,” http://www.rte.ie/business/2004/0907/ comreg.html, 2004. [11] rte, “mobile penetration now stands at 94%,” http://www.rte.ie/business/2005/0318/ comreg.html, 2005. [12] rte, “mobile penetration now stands at 100%,” http://www.rte.ie/news/2005/1220/ mobilephones.html, 2005. [13] j. seyster, “plasma fractals,” http://www.ic.sunysb.edu/stu/jseyster/plasma/, 2002. [14] h. systems, “plasma number spiral,” http://www.hermetic.ch/pns/pns.htm. [15] c. taylor, “mobile penetration to hit 100% in europe,” http://www.enn.ie/news.html?code= 9604990, 2005. [16] e. weisstein, “arbitrarily long progressions of primes,” http://mathworld.wolfram.com/news/ 2004-04-12/primeprogressions/, 2004. daniel c. doolan, sabin tabirca university college cork department of computer science cork, ireland e-mail: {d.doolan, tabirca }@cs.ucc.ie international journal of computers communications & control issn 1841-9836, 9(4):430-438, august, 2014. distributed compressed sensing algorithm for hierarchical wsns n. jiang, h. you, f. jiang, l. liu, y. he nan jiang* college of information engineering, east china jiaotong university nanchang, 330013, jiangxi, p.r.china state key laboratory breeding base of nuclear resources and environment, east china institute of technology, nanchang, 330013, jiangxi, p.r. china *corresponding author: jiangnan1018@gmail.com hui you, lingfeng liu college of information engineering, east china jiaotong university nanchang, 330013, jiangxi, p.r.china youhui106@gmail.com, liulingfeng1983@gmail.com feng jiang school of computer science and technology, harbin institute of technology harbin, 15001, heilongjiang, p.r.china fjiang@hit.edu.cn yueshun he college of information engineering, east china institute of technology nanchang, 330013, jiangxi, p.r.china hys8418@163.com abstract: in the traditional theory of cs, each sensor node in wireless sensor networks (wsns) sends the information to sink node directly, and only considers the correlation between internal nodes information when recover, this would lead to the loss of the node information and also cause too much energy consumption. in this paper, combined with deec protocol, based on simultaneous orthogonal matching pursuit (somp) algorithm, we propose a novel distributed compressed sensing algorithm for hierarchical wireless sensor networks (dcsh algorithm). this algorithm use the spatial correlation between the nodes and joint sparse model jsm-2, to compress and recover the sensor node information according to somp algorithm. simulation results show that dcsh algorithm can not only obtain accurate reconstruction values of the node information and reduce the energy consumption greatly, but also can prolong the network lifetime. keywords: distributed compressed sensing; deec algorithm; jms-2 model; cluster architecture; wireless sensor networks (wsns). 1 introduction in the past few years, the capability of receiving and transmitting data in wireless sensor networks has been constantly enhanced, simultaneously, the amount of data, which need to be handled, is also growing very quickly. traditional nyquist sampling theory, needs the sampling rate of signal not less than 2 times the bandwidth of the signal; undoubtedly, this is higher requirement for signal processing capability and hardware equipments. therefore, how to deal with bigger and faster data and find a new method is attracted more and more attentions. in 2004, donoho and candes et al proposed compressed sensing (cs) theory, it is a novel theory which make full use of sparse and compressibility of the signal [1] [2]. this theory shows that when the signal is sparse or compressible, it can realize the refactoring value of signal exactly copyright © 2006-2014 by ccc publications distributed compressed sensing algorithm for hierarchical wsns 431 through collecting the projection values of small amount of signal. different from traditional data processing method, sampling and compression of signal can conduct at the same time with a low rate, to the method of data processing, sampling process also completed the compression, so the amount of sampling data is greatly reduced, the nyquist sampling is evenly spaced sampling, and compressed sampling is random sampling. generally, compressed sensing theory study how to use the intra-signal on compression. considering distributed deployment and limited capability of sensor nodes in wireless sensor networks, it is necessary to use inter-signal with distributed compressed sensing (dcs), d.baron gave the basic concept and theory of distributed compressed sensing [3], and then proved the upper and lower bounds of the number of measurements required for decoding [4]. in [5], the author presented another joint sparse model for the application scenarios such as the mimo communication and speech signal, and designed the corresponding joint decoding algorithm. however, data fusion technology for wireless sensor networks based on the dcs theory is still at the starting stage. therefore, it is necessary to study how dcs theory do the joint reconstruction through the observation vectors of each node in cooperation way, through define joint sparse of each node based on spatial correlation data. our contributions are showed as follows: • based on jsm-2 model, signals are related to each other, each signal has different coefficient value, while they are made up with the base vector of the same sparse coefficient set. • by using somp algorithm, the decoding process can be simplified and corresponding measured values can also be reduced. in decoder side, the reconstruction efficiency will be improved. • cluster architecture is widely used in wsns, such as classic leach, heed, deec and so on. we try to bring cluster into cs theory. on one hand, it will save the transmission energy; on the other hand, sink nodes will receive more complete information in the case of limited sensor node processing capacity. 2 simultaneous orthogonal matching pursuit(somp) algorithm 2.1 distributed source coding [11] according to the characteristics of the distributed sensor networks, a number of distributed coding algorithms are developed, these algorithms all involved the cooperation of sensor nodes, including predictive coding, distributed klt and distributed wavelet transform, and also the three dimensional wavelet algorithm which uses inter-correlation between signal and in signal at the same time. x y x̂ ŷ figure 1: distributed source coding. figure 1 is distributed source coding of two different source information. processing of signal y is completely independent, the transmission information is h(y ), and processing of the 432 n. jiang, h. you, f. jiang, l. liu, y. he signal x need to use correlation with the signals y . at the receiving side, the decoder h(y ) can first restore the original signal y ; as the edge information y , use the correlation between the information x and y that contained in h(y ), and send the joint information h(x|y ), complete the joint decoding of the source information x, here with the sum of the amount of the transmission information x and y must be less than the joint entropy h(x|y ), so it can ensure the compression effect of source information. in one word, for distributed source coding theory, the key issue is how to use the original related characteristics of the signals x and y , and then to do the coding and joint decoding independently. 2.2 somp algorithm and jsm-2 model jsm-2 [3]: the signals in this model have different coefficient value, while they are all made up with the base vector of the same sparse coefficient set. this model can be applied some important applications, such as mimo telecommunication and audio signal array. the signals which are accorded with jsm-2 model may be sparsity in the fourier domain, for example, degenerative frequency component which is caused by different propagation paths. in jsm-2 model, the expression of the original signal shown by the following formula: xj = ψ · θj, j ∈ 1, 2, ..., j (1) among them, the coefficient of each vector θj, corresponding to a collection of indicators ij ⊂ 1, 2, ..., n , and ij only has k elements, namely, that is ∥ij∥l0 = k, the sparsity of each original signal xj is k. paper [7] presented the greedy algorithm to recover the related signal, which is called simultaneous orthogonal matching pursuit (somp) algorithm. the algorithm is very similar to the omp algorithm, but there are some small changes. somp algorithm is based on the concept of distributed source coding. the basic idea of somp algorithm is: assuming that the sensor nodes in wsns are all consistent with the jsm-2 model, namely each part of the original signal includes only part of the information and do the sparse representation on the same sparse basis, but the sparsity of each of the original signal sparse are not the same. suppose that there are b distributed sampling signals y1, y2, ..., yb,, change the second step of the omp algorithm to find the index, using the index to solve the following simple optimization problem: maxω∈ω b∑ 1 |⟨rk,t−1, φj| (2) in which rk,t−1 is the residual value of the first k distributed sample signal, the rest of the steps are the same with omp algorithm. when b = 1, this process can be summarized as the standard omp algorithm. in conclusion, the somp algorithm is divided into the following steps: 1) initialization: residual r0 = y, the number of iterations t = 1, the index set ∧ 0 = ϕ; 2) find the index λt, solve a simple optimization problem; 3) update the indexes set λt = λt−1 ∪ λt, and update the selected column space θt = [θt−1, φjt]. by convention, θt is an empty matrix; 4) by solving a least squares problem, get a new estimation of the signal; 5) calculate the new approximate data and the new at = θtxt , rt = y − at; 6) t = t + 1, if t < m, go back to step 2), otherwise, to step 7); distributed compressed sensing algorithm for hierarchical wsns 433 7) if ∥θt∥∞ > γ, choose h1, otherwise choose h0. in which γ is nonzero threshold, h0 : θs = 0, h1 : θs ̸= 0, θs is sparse coefficient. 3 deec clustering algorithm deec [9] is a classical clustering algorithm, which can save the energy effectively and prolong the network lifetime. it can be able to construct optimized cluster structure for and data aggregation through the exchange of the message between the local nodes, and also balance network energy consumption effectively, then be better adapted to periodically collect data of sensor network applications. simulation results show that deec can prolong the network lifetime of leach of about 45% under the smaller latency of network. aggregator head relay figure 2: the structure of deec. clusters in the deec protocol contain four types of the nodes: the sensor, the head, the aggregator and the sender. nodes in a network choose itself as the cluster head nodes with a certain probability value to form a cluster head node collection hheadi, become a cluster head node which is responsible for the set of clusters. the distance between each of the sensor nodes which are in the cluster i and the head nodes to meet the following conditions: min1≤k≤|h|dist(vij, headk) (3) but the head is not responsible for data aggregation, fusion and sending information to the base station immediately, it plays a role of a "call" in the first place. in the process of the set of clusters, we determine the aggregator and the sender through the calculation. the former is used to gather the information which is sent by clusters and then make the data processing and fusion, the latter accept the information transmitted by the former and then send them to the base station. of course, set clusters, gather data and send information to the base station, those three types of task may be borne by one or two nodes. the process of the head selection of deec is similar to leach: each node generate a random number, if it less than the threshold, the node will be the cluster head. but the determination of the threshold here is different from leach, deec neither consider the effect of the current round nor consider whether the node within several rounds of cluster heads, for deec, cluster head set h only have the effect of dividing the network into several clusters. the threshold of deec is the probability of each node to become cluster head phead, phead = kopt/n, in which kopt is the optimal value of number of clusters in the network; n is the number of nodes in the network during initialization. the derivation of kopt has a related discussion in the literature [8], here only assume that it is a system parameter during the initialization time of the network. when a node is selected as the cluster head, it broadcasts messages inv ite to 434 n. jiang, h. you, f. jiang, l. liu, y. he all other nodes, the nodes which receive the news choose the nearest cluster heads to join and send a message to its cluster head join(id, position). in this way, the entire network is divided into several clusters. 4 dcsh algorithm in above algorithm, the ability of sensor nodes in wsns is limited to only receive the information sent by a small amount of sensor nodes, and each sensor node has only its local information, therefore the joint decoding side cannot recover the full information, this will lead to the loss of the target source information when recover. in this paper, based on our previous works [12] [13] [14], we propose an improvement scheme of dcs reconstruction algorithm (dcsh algorithm) which is based on somp. its basic idea is: by using deec algorithm to divide sensor nodes into clusters, then select the cluster head, and the information of the nodes all gathered to the cluster head, then the information on the cluster head is the information within the entire cluster nodes. the sensor nodes send the information to the cluster head after data fusion and then use somp algorithm to recover the information. on one hand, the dcsh algorithm reduces the number of the nodes which transfer the information directly to the joint decoding end, avoids the loss of transmit information; on the other hand, by using the advantage of the deec protocol, our algorithm can achieve the goal of saving energy. system model for dcsh algorithm is shown in fig.3: x x nm 1 c m c 1 y 2 y 1 n y 1n m y n y 1 x 2 x 1 n x n m x 1n m x n x sink n m y figure 3: system model for dcsh algorithm. suppose that there are n sensor nodes randomly distribute in the wireless sensor networks, meanwhile the sink node is in the center of the network area. there is a target source in the network, information on the target source will be sent to all nodes in the network, any sensor nodes can communicate with the sink node directly, and the signals all meet the joint sparse model jsm-2. first, divide n nodes into m clusters, then elect the cluster heads c1, c2, ..., cm of m clusters respectively, and the number of the nodes in each cluster is n1, n2, ..., nm , transmit the cluster information to each cluster head and then use dcs algorithm to encode information on the cluster head, getting the measured value y1, y2, ..., ym which is transmitted to the sink node for the joint decoding. at last, get the reconstruction value x̂1, x̂2, ..., x̂n , thereby, recover the target source information accurately. distributed compressed sensing algorithm for hierarchical wsns 435 dcsh algorithm can be shown as follows: 1) based on deec protocol, divide the sensor nodes into cluster, and choose the cluster head. the threshold phead, phead = kopt/n of deec is the probability value of each node to become cluster head, in which kopt is the optimal value of number of clusters in the network; n is the number of nodes in the initialized network. 2) determine the aggregator: when the head know location information of all nodes, aggregator can be determined through simple algorithm find_aggregator [10]. 3) determine the sender: using the algorithm find_sender [10]. 4) transmit the information of the sensor nodes to the cluster head, using their original measurement matrix to do the independently coding on the cluster head. 5) use somp algorithms to joint reconstruction of the nodes information, thus recover the full information of the network. 5 the results of simulation and analysis this paper uses matlab as a tool for simulation, 100 sensor nodes are randomly distributed in deployment area , the center of the area is the cluster head, each of the node information conform to jsm-2 models. first of all, we verify the performance of the somp algorithm. get the reconstruction value x̂1 of the sensor node information x1 by somp algorithm and omp algorithm. the reconstruction error of is 5.1672 × 10−15 and 8.1596 × 10−15 . figure 4 and figure 5 show us the comparison between the two kinds of algorithms, and it apparently show that somp algorithm has the excellent effect of reconstruction. after compared with omp algorithms, we will find somp algorithm has good performance and also the accuracy is higher. 0 100 200 300 400 500 −1.5 −1 −0.5 0 0.5 1 1.5 recover original signal recovery signal figure 4: comparison of somp algorithm. suppose that the leach protocol and deec protocol have the same number of the original data nodes, figure 6 studies the performance under the conditions of same numbers of cluster nodes. in this figure, the two curves represent the protocol as leach and deec. its specific results are shown in figure 6, deec protocol has m less dead nodes when the algorithm run same time. known that the initial energy of the nodes is 50j, the energy of the nodes decrease with the number of the election round, figure 7 shows the relationship between the rounds and active 436 n. jiang, h. you, f. jiang, l. liu, y. he 0 50 100 150 200 250 300 −1.5 −1 −0.5 0 0.5 1 1.5 2 recovery original figure 5: comparison of omp algorithm. 0 1000 2000 3000 4000 5000 0 2 4 6 8 10 12 x 10 4 x(time) y( d a ta ) comparation of data transaction between leach and deec leach deec figure 6: the comparison of the lifecycle. nodes of deec algorithm. we can conclude that, in the same case, networks lifetime of deec is longer than traditional leach protocol, to a great extent, improve the network cycle. for the further study of these two reconstruction algorithms, once again, we compare the average absolute error performance of the somp and omp algorithm. figure 8 demonstrates the relationship between the absolute error performances of the two algorithms, it can be seen that, in the case of the same number of measured values, the absolute error performance of somp algorithm is lower than the absolute error performance of omp algorithm. to sum up, the comparison and analysis of the performance of somp algorithm and omp algorithm in figure 8 give a conclusion that in the same conditions, somp algorithm can ensure the accuracy rating of the reconstruction while reducing the number of measured values. in this way we can save the energy of the network and improve the efficiency of the signal reconstruction. 6 conclusions this paper proposes a distributed compressed sensing algorithm for hierarchical wireless sensor networks (dcsh algorithm). this algorithm utilize deec algorithm to get the accurate information by recovering the information of sensor nodes in the network. the sensor nodes distributed compressed sensing algorithm for hierarchical wsns 437 0 200 400 600 800 1000 1200 1400 1600 1800 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 election rounds in the net/r t h e r e m a in in g e n e rg y o f n o d e s/ j leach deec figure 7: the comparison of the nodes and the wheel. 40 60 80 100 120 140 160 180 200 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 number of the measured value a ve ra g e a b so lu te e rr o r omp algorithm somp algorithm figure 8: the change of two kinds of algorithm. choose the cluster head by deec protocol. we take the residual energy of nodes into consideration while in the process of choosing the cluster head, prevent some nodes energy exhausted prematurely, this can not only extend the network lifetime effectively, but also reduce the number of the nodes which transmit data directly to the gathering node. then based on the spatial correlation of the cluster nodes and joint sparse model (jsm-2), by using distributed compressed sensing reconstruction algorithm(somp), recover the original node information from a small amount of information accurately. the simulation results show that this algorithm has better performance, which can improve the network lifetime, and can be better adapted to periodically collect data of the wsns application. acknowledgments this work is supported by national natural science foundation of china under grant no. 61063037 and no. 51364001, open program of state key laboratory breeding base of nuclear resources and environment under grant no.nre1102, key projects in the science and technology pillar program of jiangxi province of china under grant no. 20111bbg70031-2 and 20133bbe50033, educational commission of jiangxi province of china under grant no. 438 n. jiang, h. you, f. jiang, l. liu, y. he gjj13335 and gjj13354, and foundation for young scientists of jiangxi province of china under grant no. 20133bcb23016. bibliography [1] donoho d l.(2006); compressed sensing, ieee transactions on information theory, 2006, issn 0018-9448, 52(4): 1289-1306. [2] candés e.(2006); compressive sampling, in: proceedings of international congress of mathematicians, isbn 978-3-03719-022-7, 1433-1452. [3] d.baron et al (2005); distributed compressive sensing, technical report ,pre-print. [4] d.baron et al (2005); an information theoretic approach to distributed compressed sensing, in: conference on communication, control, and computing , isbn: 9781604234916. [5] m.f.duarte et al (2005); distributed compressed sensing of jointly sparse signals, in: proceeding of the 39th asilomar conference on signals, systems and computation , issn 10586393, 1537-1541. [6] j tropp; a. gilbert; m strauss(2006); algorithms for simultaneous sparse approximation, part i: greedy pursuit, journal of signal processing, issn 0165-1684, 86: 572-588. [7] w dai; o milenkovic(2009); subspace pursuit for compressive sensing signal reconstruction. ieee transactions on information theory, issn 0018-9448, 55(5): 2230-2249. [8] l.zhao; l.q.lian(2005); distributed and energy efficient self-organization for on-off wireless sensor networks, international journal of wireless information networks, issn 10689605, 12(1) : 211-215. [9] l.qing; q.zhu; m.wang(2006); design of a distributed energy-efficient clustering algorithm for heterogeneous wireless sensor networks, elsevier, computer communications,issn 0140-3664, 29(12): 2230-2237. [10] y.x.liu et al(2010); regularized adaptive matching pursuit algorithm for signal reconstruction based on compressive sensing, journal of electronics and information, issn 10095896, 32(11): 2713-2717. [11] d. slepain; j. k. wolf(1973); noiseless coding of correlated information sources. ieee transaction on information theory, issn 0018-9448, 19(9): 471-480. [12] nan jiang(2014). wdem: weighted dynamics and evolution models for energyconstrained wireless sensor networks, physica a: statistical mechanics and its applications, issn 0378-4371, 404: 323-331. [13] nan jiang; sixin jin; yan guo; yueshun he(2013); localization of wireless sensor network based on genetic algorithm, international journal of computers communications & control, issn 1841-9844, 8(6): 825-837. [14] nan jiang; rigui zhou; qiulin ding(2009); dynamics of wireless sensor networks, international journal of distributed sensor networks, issn 1550-1329, 5(6): 693-707. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iii (2008), no. 2, pp. 116-124 e-learning multimedia applications: towards an engineering of content creation maria dolores afonso suarez, cayetano guerra artal, francisco mario tejera hernandez abstract: in the same manner that e-learning applications are becoming increasingly important at the university, there are still some critical questions that should be solved with the objective of making use of the potential offered by current web technologies. the creation of contents that are able of capturing the attention of interest of the students and their disposal in an appropriate way constitute the main purpose of this work. the teaching content engineering expounded shows the different stages that should form part of the process. a development team, composed of different professional profiles, will work together with the lecturers of the subject to which the contents are been created, i.e. multimedia videos and interactive applications. this process should be developed according to a methodology that assure the use of appropriate resources, all that tasks -suitable of beingshould be modularized and factorized. this paper presents the acquired experience in the development and use of multimedia contents for e-learning applications, created for some of the subjects of the degree in computer science engineering. the deliveries of these contents make use of internet and video streaming techniques. the result of the work shows the students satisfaction, including their comments. keywords: teaching content engineering, didactic objectives, active learning, passive learning. 1 introduction during recent years university education has experienced an important progress with respect to the resources used for lecturing. techniques have changed from traditional blackboard methods to the use of projector transparencies (slices) and, later on, the use of video projector and powerpoint to present the content of the lessons. nowadays, the use of these technologies has become more common. as far as content availability for the student is concerned, it has passed from traditional notes taken in classrooms, books and photocopies to digital format of books or notes taken in classrooms, which are available in the subject’s web page. in fact, the universities use to place courses on the web focusing on information delivery rather than learning. the proposal of european higher education area provides the framework to take an important step forward in techniques used in lectures and in the methodology used to create them. for this reason, new methodological approaches are introduced in order to value an active learning [1] as opposed to a traditional passive learning. to value the effort that students should make in order to assimilate knowledge and not only consider the number of hours of attendance to classes. the use of e-learning techniques makes this easier; its wide range of application allows increasing education quality and delivery of information. 2 e-learning in ehea the european higher education area proposes the setting up of a convergence process of educational material in europe. new methodological approaches are introduced in order to value an active learning as opposed to a traditional passive learning. to value the effort that students should make in order to assimilate knowledge and not only consider the number of hours of attendance to classes. all this is made easier by using an e-learning [2] approach, whose extensive work field, in all range of subjects, copyright © 2006-2008 by ccc publications e-learning multimedia applications: towards an engineering of content creation 117 allows the quality of education and its availability to increase [3]. the creation of multimedia contents for e-learning could be developed together with other european universities, and its use could be shared within the same knowledge areas. this allows a new common line of work to be opened. it will be possible to promote european cooperation to guarantee the quality of higher education using comparable methodologies and criteria. 3 teaching content engineering we understand for that concept the whole methodological corpus that allows inserting development elearning technologies in the production of didactic solutions with appropriate invests in production time, resources and person/hour. in this manner, the creation of contents will be developed in a systematic way, planning the process and using adequate techniques. the integration of these techniques and the extent of the teaching material development project require a multidisciplinary team. this multidisciplinary team composed of a group of different professional profiles will carry out different tasks using a wide variety of technologies. these tasks will come together in order to create a production line to maximize results. the improvement of productivity and quality in teaching content creation will constitute the main objective of this engineering [4]. the proposed engineering of teaching content is carried out by means of a methodology, which divides the multimedia production process into phases and assigns tasks to each member of this professional development team. all these different phases include research on suitability of the content for e-learning, a previous analysis to select multimedia techniques to apply, the development of the contents based on the project design and, finally, the maintenance. with respect to the multidisciplinary team, it will be composed of a project manager, programmers, designers, and multimedia experts, as well as the lecturers of each subject. 4 methodology the obtaining of an optimum workflow [5] and the use of resources by the different production tasks is the main purpose of the proposed methodology. for that, we define different production phases. 4.1 suitability of the content for e-learning to create multimedia material for subjects using e-learning techniques means a considerable resource investment. therefore, some factors, which help to make a decision about its profitability, should be taken into consideration. in this first phase a study on the suitability is made in accordance with the factors below: the content validity. the period of time in which teaching contents do not need to be updated. the number of students who attend classroom training. this factor takes into consideration the number of resources used in their learning. the modularity and reusability of independently operable units, which are part of the total structure for creating more contents. 4.2 contents analysis the creation of multimedia contents is carried out through the division of teaching contents in didactic objectives. in this phase the modularization of content takes place. the extraction of the lecturers’ knowledge will be necessary for the selection and proposal of different multimedia didactic elements, 118 maria dolores afonso suarez, cayetano guerra artal, francisco mario tejera hernandez which will constitute part of these multimedia didactic units. it is recommendable to follow a methodology belonging to knowledge engineering, where the knowledge is produced according to the lecturer’s subject specific knowledge, and contains the knowledge and analytical skills of one or more human experts. to achieve this aim, the tasks below are introduced: meetings with the lecturers. in these meetings lecturers expound the teaching objectives of the subject, explain the lesson content and the current way to portray the contents to the students. the selection of lessons for multimedia format. the project manager, together with the lecturers, carries out the selection of lessons for which multimedia teaching content will be created proposal and techniques selection [6]. according to the techniques selected, and the modules repository, the project manager makes a proposal to the lecturers. 4.3 development starting from the analysis of the previous phase, a formal design is carried out, this formal design will identify the activities and work planning that will be done under the supervision of the project manager [7]. therefore, this phase is structured as shown below: formal design includes all the multimedia didactic elements to be used. in accordance with this design, both human and material resources are managed. it is a process of problem solving and planning for reaching the objectives of the project. development of interactive applications will follow a development methodology belonging to software engineering. creation of videos, for which the work will be divided into: script writing, recordings, postproduction and codification [8]. on-line disposition of multimedia contents, becoming, this way, accessible for the students. in the process the creativity of the development team is considered fundamental. 4.4 maintenance once the project is finished, it is difficult to assure that it will work properly unless it is tested. in order to realize a high quality solution, testing throughout each phase of the process is proposed. the project team should be involved in the maintenance phase. they are expected to work on their known issues and prepare for a new release of the created material. in order to detect defects and deficiencies in the multimedia material, some tasks are introduced: interviews with lecturers and students, to know first hand how they feel about this new content to use, in various aspects: accessibility, manage, design. questionnaires, for recover statistics results about all the aspects to be evaluated. comparison of academic results, where an evaluation report will be made, and academic results will be compared with those of other groups of students that have made use of traditional learning methodologies. all these tasks will help to obtain a constructive feedback and to enhance and optimize this multimedia material and its different aspects evaluated. a thorough study of time and staff required for project development will take us to a suitable planning and this to an appropriate economic investment. therefore, cost and resources affect directly in the amortization of the investment made for the creation of contents. e-learning multimedia applications: towards an engineering of content creation 119 5 resources among the means that will be used to carry out this technology we find both human and material resources. the competences of the multidisciplinary team are defined by different profiles: lecturers, whose main tasks include the structure of the subject program, script writing, and the proposal and collective agreement with the project manager on the multimedia didactic units. project manager, who advises lecturers, coordinates efforts of the development team and assigns tasks according to the planning made. designers, which will carry out tasks of graphic design, user interfaces development, and 2d 3d elements creation. analysts/programmers, to analyze and develop software application and programs. multimedia technicians, in charge of recording, editing, and postproduction. regarding to material resources, these are comprised of the multimedia recording studio and the laboratory room for development of software applications and editing of contents. the workstations of the editing room in the laboratory are fully equipped with the necessary computer material for creating and editing audiovisual content. figure 1. figure 1: on the left we can observe the record studio used for the recording of lessons. on the right workstation of the editing room. 6 other methodologies there are different strategies for the creation of teaching content for e-learning. these approaches often use a similar set of stages or phases for content development. among them we find the "waterfall philosophy" characterized by the use of a number of phases strictly ordered in such a way that each phase begins once the previous one is finished; and the "evolutionary philosophy" or "based in prototypes", characterized by considering since the beginning that, although the project start out with a set of requirements, will arise changes in these requirements as the project is developed. however, the methodological proposal made in this work for an e-learning content engineering focuses on lecturing and this means a set of differences, related below: scope. traditional methodologies are directed, generally, towards content development for corporate e-learning. the proposed methodology is focused on content development for lecturing. 120 maria dolores afonso suarez, cayetano guerra artal, francisco mario tejera hernandez contents modularity. in each knowledge area to create independently operable units intended for reuse in the same or a different one is proposed. this means the use of a knowledge area thorough analysis in order to make these operable units relocation possible when creating new teaching programs. extend of involvement. lecturers should be part of the work team that will create multimedia contents for e-learning, should be involved not only in didactic advises tasks, according to the teaching objectives of the subject, but also should take part in the conception of the whole set of tools proposed by the project manager. resources. the wide variety and quantity of multimedia resources used are essential in the proposed methodology for e-learning content creation. multimedia technology and the development of interactive applications provide suitable capabilities to develop quality contents at the university level. 7 lecturing experiences for the evaluation of e-learning techniques in lecture content, some subjects have been selected. specifically, the subjects correspond to bachelor and graduate degrees on "computer science engineering", multimedia and automaton theory and formal languages ii. specially, the subject of multimedia has the particular feature of covering two fundamental aspects in the teaching of computer science knowledge. on the one hand, we find purely theoretical and mathematical content which supports audio compression, image and video technologies. on the other, we find content that makes reference to the use of multimedia tools and web programming languages. in particular, the subject of multimedia has a teaching timetable of 30 theoretical hours (twice a week), using traditional classrooms, and other 30 hours of practical lessons in the laboratory. the prepared material embraces half the timetable in the classroom. the content generated for automaton theory and formal languages ii corresponds to the practical exercises of the subject. as lecturers can now count on this new multimedia teaching material, the methodology, which has been followed till now in multimedia, has changed substantially. firstly, the students still have the same number of learning hours; however, they receive some lessons by means of video in a dedicated server. this means that they receive, weekly, one hour in the classroom and another one through video streaming. this new material allows traditional lessons to be divided into two groups. in this way the lesson in the classroom is repeated and the student attends the most convenient classes. therefore, not only do they not miss the opportunity to ask the lecturer doubts, but also the number of students is reduced, allowing a more personalized treatment. with respect to automaton theory and formal languages ii, the prepared material only provides support to the personal training of the student. the results of this experience have been very positive. students have valuated unanimously this new teaching model as completely recommendable. even, the students have taken part in the improvement of the material contributing with their own ideas, needs, and wishes, like including random questions in order to implement interactivity [9]. from the valuation questionnaire issues below are recovered: multimedia videos are very appreciated. no comments were made about accessibility or reproduction difficulties or even misunderstanding. although students think it is necessary: the inclusion of subject’s content in a pdf format file and an index to facilitate access directly to each unit of the lesson. and just only one student pointed out that the teacher should propose more practical material in the classroom. the valuation questionnaire results are represented in the table 1. as we can observe, percentage for the last one question is quite good. figure 2. with respect to the academic efficiency revealed in the assessments, we should say that the percentage of success in the exams remains constant if we compare with the results of past years where lessons were taught in a traditional way. however, a particular fact, which is known by all lecturers when the exams’ e-learning multimedia applications: towards an engineering of content creation 121 table 1: results from bachelor and graduate degrees on "computer science engineering" figure 2: graphical representation for general assessment. results from bachelor and graduate degrees on "computer science engineering" 122 maria dolores afonso suarez, cayetano guerra artal, francisco mario tejera hernandez dates come close, should be commented. in the previous weeks to exam dates the students come more frequently, and in many cases only on those dates, to the tutorials. it is remarkable that during this teaching experience there have been practically no consultations in tutorials in all the academic year, not even on dates coming up to the exams. 8 future research the acquired experience in the development of multimedia contents for e-learning applications shows that this emergent task should be tackled through the use of a systematic, disciplined, quantifiable approach to the development, manage and maintenance of this teaching content material. in the near future, we will test the process of development of a new project from the university of las palmas de gran canaria. this project involves the creation of multimedia material for subjects of three different areas: the technical area, the health care science area, and the human, society and legal area. the election of the subjects has been made according to the factors expounded in the phase of suitability of the subject. it has been preferential for the technical department to choose subjects of the first courses of various degrees with similar contents, what implies a considerable number of lessons in common. so that, there will be subjects with the 85 or 90 per cent of their content delivered with multimedia material, which is more profitable than having only one or two subjects with the 100 per cent of the content delivered with this new material. the subject is named mathematics in some degrees and calculus in other ones. something similar has happened in the other two knowledge areas: in the health care science area anatomy has been chosen, this is a subject of the first course of "medical degree" with a huge number of students. for the human, society and legal area has been chosen constitutional law, which corresponds to two different degrees: "law" and "economics and business sciences", degrees with a large number of students in the first courses too. in this way, we will delve into each phase of this new engineering focusing on improving the efficiency and cost-effectiveness of the development of multimedia contents for e-learning in these subjects selected. 9 conclusions new information technologies and e-learning will be indispensable tools in lecturing in the near future. the increasing bandwidth available for internet connection and multimedia capacities found in current computers allow them to be used as a completely valid way for teaching. however, the lack of multimedia content production is a very important cause that limits an e-learning widespread. the creation of large-scale teaching content for lecturing needs a production methodology to assure the optimization of resources and, therefore, a reduction in costs. this work intends to contribute with the methodological lines applied in the production of multimedia teaching content for lecturing. new teaching methodologies require new teaching aims; the role of lecturers in the new educational process should be restated in the whole educational process. lecturers should reduce the time dedicated to teaching lessons using traditional blackboard methods. this activity could be mainly, replaced by e-learning techniques. lecturers could dedicate time to activities that offer a better quality in teaching, transmitting motivation about the subject being studied and directing the students in their studies [10]. the future of teaching in the european society should see an improvement in the quality of education, its availability, and a lower cost for students as well as for educative organizations. 10 acknowledgment we wish to thank the programm committee of icvl 2007 that recommended the publication of an extended version of this paper. e-learning multimedia applications: towards an engineering of content creation 123 bibliography [1] a. m. barry, creating and sustaining an effective learning environment!. aishe conference 2006. [2] http://ec.europa.eu/education/programmes/elearning/index_en.html [3] http://ec.europa.eu/education/policies/educ/higher/higher_en.html [4] b. shackelford, project managing e-learning. astd. eeuu, 2002. [5] r. pedrosa and s. petitz, production of interactive contents for training in e-learning environmentsprocedures and methodologies. iadis international conference e-learning 2007. [6] w. horton, k.horton, e-learning tools and technologies, (1st edition), wiley. eeuu, 2003. [7] l. dublin, j. cross, implementing e-learning. astd. eeuu, 2002. [8] j. casteleira and p. leão, production and distribution of contents in digital video of quality for the distance learning. iadis international conference e-learning 2007. [9] r. rheeder, r. diseko and g. lautenbach, the design of interactivity for a web-based learning environment at a higher education institution. iadis international conference e-learning 2007. [10] k. fraser, a blended learning approach to teaching "introduction to multimedia" the e bit!. aishe conference 2006. maria dolores afonso suarez, cayetano guerra artal francisco mario hernandez tejera instituto universitario de sistemas inteligentes y aplicaciones numericas en ingenieria iusiani universidad de las palmas de gran canaria departamento de informatica y sistemas edif. de informatica y matematicas. campus universitario de tafira. 35017. las palmas. spain. e-mail: mafonso@dis.ulpgc.es, cguerra@iusiani.ulpgc.es mhernandez@iusiani.ulpgc.es received: november 30, 2007 124 maria dolores afonso suarez, cayetano guerra artal, francisco mario tejera hernandez maria dolores afonso suarez, graduated at the university of las palmas de gran canaria in computer science. she has worked in private enterprises developing software and as associated professor at the computer science and systems department of the university of las palmas de gran canaria. her main research fields are multimedia and e-learning. cayetano guerra artal, graduated at the university of las palmas de gran canaria in computer science. he has been teaching multimedia for seven years at the computer science and systems department of the university of las palmas de gran canaria. he received his ph.d. in computer vision. his main research fields are computer vision, computer graphics, multimedia and e-learning. francisco mario hernandez tejera graduated in electrical engineering and received his ph.d. in computer science at the university of las palmas of gran canaria. he is assistant professor of computer science and engineering at the computer science and systems department of the university of las palmas de gran canaria. is member of the institute of electrical and electronic engineers ieee , the international association of pattern recognition (iapr), the asociacion española de reconocimiento de formas y analisis de imgenes (aerfai) and the asociacion española para la inteligencia artificial (aepia). his current research interests are in autonomous systems, knowledge-based systems, active vision, visual learning, scene analysis, mobile robotics and interactive robotic systems, that is robots that interact with people. as well as multimedia and e-learning. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 783-791 an algorithm for customer order fulfillment in a make-to-stock manufacturing system d. lečić-cvetković, n. atanasov, s. babarogić danica lečić-cvetković, nikola atanasov, sladan babarogić university of belgrade, faculty of organizational sciences jove ilića 154, 11000 belgrade, serbia e-mail: {danica,nikola.atanasov,sladjan}@fon.rs abstract: in the competitive environment, many manufacturers are increasingly focusing on designing the systems that help them to manage variable demand and supply situations. dynamic allocation of demands is very important in case of customer order allocations. order promising and allocation can be based on the simple sequence that enables a manufacturing company to receive orders unless there are some other priority orders. manufacturing company can also manage allocations of supply to key customers and channels, thereby ensuring that they can meet contractual agreements and service levels in the priority that yields better profit. this paper will focus on a maketo-stock order fulfillment system facing random demand with random orders from different classes of customers. available-to-promise (atp) calculating from master production schedule (mps) exhibits availability of finished goods that can be used to support customer order allocation. this order allocation system is adapted in mts (make-to-stock) production model and all orders are treated according to maximization of customer service policy. it allows incoming purchase orders as well as existing inventory on hand to be selected and allocated to customer sale orders and back orders. the system then automatically allocates the available stock to the selected sales orders. we developed an integrated system for allocation of inventory in anticipation of customer service of high priority customers and for order promising in real-time. our research exhibits three distinct features: (1) we explicitly classified customers in groups based on target customer service level; (2) we defined higher level of customer selection directly defined according to company strategy to develop small and medium customers; (3) we considered backorders that manufacturing company has to fulfill in order to maximize overall customer service for certain customers. keywords: manufacturing system, order allocation, customer service. 1 introduction production companies are facing the dilemma on whether to increase or decrease the capacities of production lines and/or the capacities of production plants. everyday, production managers are facing the dilemma on whether to buy a new machine, to increase the number of shifts within the production process, or to employ new workers, in spite of the fact that such decisions also affect the middle-term and long-term production planning, and should not be made hastily. sales forecasting and identification of an increase in demand represent the starting point for production capacity planning, and also for making a decision on capacity expansion. until the available capacities are increased, the production company has a goal of satisfying the market copyright c⃝ 2006-2010 by ccc publications 784 d. lečić-cvetković, n. atanasov, s. babarogić demand by means of available production capacities. the allocation of available finished products to customer orders requires an efficient system of allocation aimed at improving the overall operating efficiency. operating efficiency is directly related to quantities of goods produced and the profit from sales. in addition to profit-oriented decision on selection of orders to be fulfilled, one needs to consider the customer service, since the overall operating of a production company depends on customers. customers, who account for a large share in the sales results, require special attention, and the fulfillment of every order. there is also a group of customers who constantly increase their orders, and thus expect adequate and better service. small but numerous customers, thus shape the overall sales of a production company. some of them represent the future potential for the increase in sales and also for the increase in incomes of the production company. the abovementioned facts highlight the importance of making a decision while selecting the orders to be fulfilled, and the necessity of having an algorithm that will efficiently and effectively perform the allocation of limited quantities of available finished products. this paper is structured as follows: the second chapter presents relevant approaches of other authors in solving the observed problem. the third chapter defines the problem of allocation of limited resources in production companies. the problem of allocation refers to the distribution of limited amount of production to customer orders, with the goal of maximizing the percentage of fulfilled orders. aiming to define efficient and effective allocation, the fourth chapter describes the procedure of the development of algorithms for fulfilling the orders in the make-to-stock production system. in their conclusion, authors cited the main advantages of the proposed algorithm for allocation of limited production, as well as the possible directions of development and upgrading the algorithm aimed at improving the performances of the production company. 2 related work providing the ordered quantities of products represents the key function in planning operations within the entire supply chain. the system of providing stocks is recognized as a key challenge in production companies. the basic goal of the process of providing stocks atp/ctp (atp – available-to-promise, and ctp – capable-to-promise) is providing a reliable response to customer demands, taking into consideration the wide range of information and limitations that exist in the distribution channel network. key measures of performances within production companies, according to [6] are recognized in measuring the level of customer service, and customer satisfaction. traditional approaches to order provision, adjusted to mts production system, were described in [1] while considering the available stocks of finished products for order fulfillment in line with the principle first come – first served (fcfs), without awarding priority to either customers or orders. the priority of received orders also represents a significant factor in providing stocks for order fulfillment. optimization of orders is carried out based on the maximization of profit for the entire operating, through the system of production planning represented in [9]. priority is given to orders in the following way: (1) orders are given the priority if they are already in the forecast of sales, i.e. (2) orders are given high-priority if the delivery of such orders should bring higher profit to operating of a production company. the general application of atp/ctp decision system was implemented with an aim to improve the profit and system performances, in line with [6]. by considering the advanced planning systems (aps), it is possible to identify various approaches in defining the priority of customers, based on which it will be possible to fulfill the incoming orders. the essential idea of the approach described in [9] is the segmentation of customers with the goal of increasing the overall income of the production company through order acceptance, and delivery of orders with the greatest profit. atp model allows for such system, since it provides quantities in advance in accordance with certain customer segments, by an algorithm for customer order fulfillment in a make-to-stock manufacturing system785 satisfying only the orders of priority customers. atp system is based on mixed-integer programming, with the goal of optimizing the utilization of limited production capacities, in order to provide timely information regarding the fulfillment of a customers order. in addition, the abovementioned problem is according to [2] classified amongst the dynamic models of order management with limited capacities based on the profitability analyses. if a production company has limited production capacities, it is clear that the company will decide to reject some of the incoming orders, which will directly affect the profitability of operating. the decision on rejecting orders is based on the comparison of orders, since company rejects those who yield lower profit. atp system that is based on providing quotes for important customers in a defined timeframe, in order to provide timely information on the delivery date, is described in [9]. atp defined in a way that orders yielding higher profit for the producer must be fulfilled with priority, in spite of the fact that there are less profitable orders still ’on hold’. meyr focused atp on mts system through the assumption that the supply of finished products is fixed with available stocks and the ongoing production, which will be available in a short period of time. the model of allocation of limited quantities of finished products through customer segmentation, developed by kilger and meyr, is based on profit assessment that is generated by fulfillment of orders in accordance with their priority. new orders could be fulfilled by the allocation of quantities granted to the group of customers they belong to, i.e. if there is no ’free stock’ in that group from the quantities granted to lower-priority groups. such method prevents orders of low-priority customers to be fulfilled prior to orders of high-priority customers, who help in generating higher profit. the atp allocation system is based on defining the priority classes with the goal of maximizing the overall profit of the production company. on the grounds of various researches, according to [9], it was determined that the fcfs system for customers orders bring the best results with limited production capacities, if case the production is realized based on the forecast accuracy. if there are classes defined in accordance with the priority of customers, the quantity awarded to certain class within a cycle, if not allocated, stays booked for the same class in the following cycle. the same author in [10] presents that the provisional allocation of stocks for certain customer classes within a defined period of time could give a significant contribution if the customers demand within the class could be anticipated with guaranteed accuracy. 3 order fulfillment problem definition production companies operating on markets with irregular demand often face the problem of insufficient stock of finished products. until it makes a decision on the expansion of production capacities, a production company has the goal of fulfilling as much orders as possible, only with available production capacities and quantities of finished products. the allocation of finished products to customer orders is realized through the process of allocation within the mts (maketo-stock) production system. the problem of allocation of available products to incoming orders is listed among problems of the allocation of limited resources. in case the incoming customer orders within one cycle do not exceed the available stocks of finished products, the allocation is complete and all orders are fulfilled. in the opposite case, when the sum of overall orders is greater than the quantity of available stock, there is a need to define the way of product allocation, i.e. rules must be introduced to help the allocation of products to incoming customer orders. with the goal of maximizing the effect of allocation of available products, the need for the development of allocation algorithm has been perceived, with which to enable the fulfillment of orders in line with their priority. the fulfillment of orders of high-priority customers contributes to the maintenance of preferred service for those customers who rank high because of their share in incomes and the profit of the production company. the order fulfillment algorithm refers to 786 d. lečić-cvetković, n. atanasov, s. babarogić products of fmcg (fast moving consumer goods) industry, and with minor modifications, it is possible to apply it to the problem of allocation of products in other industries too. 4 algorithm the problem of allocation of limited quantities of finished products to customer orders could be solved by creating an algorithm that systematizes the allocation process. the proposed allocation algorithm tends to maximize the customers service that is expressed by the number of fulfilled orders and the percentage of order fulfillment, all this through the processing of orders in accordance with previously defined groups and partitions of customers. the criterion of dividing customers into groups is based on the consideration of revenue per customer within the company, profit per customer, development potential, service rate per customer, strategic partnership with the customers, etc. with the goal of providing strategic potential for the development of a customer network, customers are classified into partitions that are provided with guaranteed quantity of products for certain groups of customers, which would otherwise be marginalized. the proposed algorithm has polynomial complexity. table 1: variable descriptions variables description ptl total available production in l-iteration (l = 1, ..., r) otl total order in l-iteration (l = 1, ..., r) pk partition (k = 1, ..., p) gj group (j = 1, ..., m) ci customer (i = 1, ..., n) algorithm is applicable only when otl > ptl. in the beginning of the year, or at the beginning of iteration: 1. form the list of customers with parameters (i = 1, ..., n) the list comprises basic data on customers and their business indicators for the previous year, for the last three years, and last five years. 2. define the number of groups (j = 1, ..., m) every identified group has its priority. smaller ordinal number of a group brings higher priority. the recommendation is to form three groups, in spite of the fact that the algorithm functions even if only two groups are present. 3. clustering customers into groups at the beginning of the year, a company disposes of the list of customers, and it revises business results from the previous period for every respective customer, which is actually the base for classification into groups (figure 1a). every customer within the same group is equal with other members of the group when it comes to products’ allocation. clustering (classification) into groups could be done based on results of applying methods such as abc method of prioritizing. ci ∈ gj, for i ∈ {1, ..., n} and j ∈ {1, ..., m} information on which customer belongs to what group are kept in the matrix customer group (mcg), as shown on figure 1b. if a new customer emerges at the very beginning of new iteration, he is been added to the system and attached to some of the existing groups. cn+1 ∈ gj, for j ∈ {1, ..., m} an algorithm for customer order fulfillment in a make-to-stock manufacturing system787 4. define the number of partitions with protective percentage quotes (k = 1, ..., p) partitions are being introduced with an aim to protect small customers with an opportunity to growth in the future. granting a quote to the partition with groups containing small customers will provide a certain quantity of products for small customers who have great potential for development and also for the increase in quantities to be ordered in the future. the protective quote, i.e. the percentage portion of the overall production in l-iteration is defined for every partition, and it will be available to the observed partition. 0 ≤ kpk ≤ 1, for k ∈ {1, ..., p}, p∑ k=1 kpk = 1 in addition, the number of partitions is smaller or equal to the number of groups. the concept of partition will lose sense (purpose) if only one partition has been defined. 5. clustering groups into partitions every defined partition must have at least one group, even though such case counts as if every group has been awarded the protective quote (figure 1a). gj ∈ pk, for j ∈ {1, ..., m} and k ∈ {1, ..., p} information on which group belongs to what partition are kept in the matrix group partition (mgp), as shown on figure 1c. figure 1a: customers, groups, partitions figure 1b: matrix customer group figure 1c: matrix group partition if we mark the customer i demand with oci, then we could say that the total order for products in l-iteration is: otl = n∑ i=1 oci or otl = m∑ j=1 n∑ i=1 oci · mcgij, for l ∈ {1, ..., r} variable apk is then introduced with the goal to keep the amount of products allocated to k-partition in l-iteration: apk = ptl · kpk, ptl = p∑ k=1 apk, k ∈ {1, ..., p} and l ∈ {1, ..., r} where ptl is the total production in l-iteration. table 2: variable descriptions 788 d. lečić-cvetković, n. atanasov, s. babarogić figure 2: algorithm represented as uml 2.0 activity diagram an algorithm for customer order fulfillment in a make-to-stock manufacturing system789 variables description oci order of customer i in l-iteration aci allocated quantity of customer i in l-iteration ncj = n∑ i=1 mcgij number of customers in group j ogj = n∑ i=1 mcgij · oci total order of group j ngk = m∑ j=1 mgpjk number of groups in partition k opk = m∑ j=1 n∑ i=1 oci · mcgij · mgpjk total order of all customers in partition k bl buffer in l-iteration mci memory remembered quantity of unfulfilled part of the order of customer i in iteration l gmtk order number of group in partition k where token is set the diagram on figure 2 shows that the allocation algorithm in every iteration starts from checking whether apk > opk. checking goes from the last partition to the first one, and if this condition has been met, then all allocated surplus is transferred to a joint buffer, whereas the allocated amount of finished products transferred to k-partition now equals the overall demand in given partition (first set bl = bl + (apk − opk), and then assign apk = opk).this situation might happen in partitions that gather customers with small orders in case that higher quote than necessary is given in the beginning. this is a sign that one has to reaccess the analysis of partition of ’small’ customers, and award smaller and more reasonable protective quote (fine tuning). within the algorithm, all amounts transferred to buffer are added to the allocated amount of finished products of the first partition (ap1 = ap1 + bl). one of the most important concepts of the proposed algorithm is group memory token (gmt). it is introduced in order to designate a group of customers whose order has not been fulfilled during an observed iteration. there could be only one group memory token (pointer) in each partition. in the end of one allocation iteration, token stays in a certain group with unfulfilled demand, which gives a priority to that group in the following iteration, so that the customers would get the unfulfilled order in the following iteration as a priority. in order to remember the unfulfilled order of group of customers with gmt, the variable mc (memory of customer) is introduced. we need to stress that mc obtains the value for every customer from the groups 790 d. lečić-cvetković, n. atanasov, s. babarogić that kept token in the observed iteration, and it is used only during the allocation process in the following iteration. formula for calculating the unfulfilled part of an order is: mci = oci ogj · (ogj − apk) · mcgij where mci is remembered quantity of unfulfilled part of an order of i-customer in l-iteration that is being transferred to the following iteration. in case gmt in several iterations stays in the same group of the first partition of important customers, then the necessity of expanding capacities becomes clear. 5 conclusion the goal of this paper was to create an algorithm for solving the problem of allocation of limited stocks to incoming orders. order fulfillment is carried out by letting the allocation algorithm grant stocks in line with the priority of customers, with the goal of providing high level of customer service. the algorithm proposed in this paper provides following: (1) classification in groups will provide the order of allocation with primarily focus to satisfy customers that are important for the company, in accordance with previously defined criteria, (2) application of partitions will help certain groups of lower priority within protected partitions to be involved in allocation so that the low-ranked customers would be at least partially satisfied, which keeps all the customers in the system and sends a useful signal to company’s management saying that there is still unsatisfied demand, and they need to enlarge the production capacities, (3) introduction of gmt allows all customers within a group to be delivered the backorders from the previous cycle, with the extended delivery lead time, with which they improve the overall customer service. the main differences between the allocation algorithm presented in this paper, and the models described in the related work are: (1) this allocation algorithm directly affects the long-term operating results by accomplishing the customer service as the primary goal, unlike other models that focus only on short-term profit generation, and (2) there is a tendency of bounding customers with different priority for a long-term period, i.e. the tendency of keeping the customers within the system. in addition, there are also differences in basic goals of models: the primary goal of advanced algorithm is the maximization of customer service, unlike other systems that are primarily orientated towards profit maximization. further research will be focused on testing and improving algorithm based on real data obtained in other companies and also from different industries. in addition, there are possibilities for further improvement of the allocation algorithm by creating a report system that would support management in making timely decisions on the change in operating strategy. bibliography [1] cederborg o., rudberg m.: customer segmentation and capable-to-promise in a capacity constrained manufacturing environment, 16th international annual euroma conference, göteborg, sweden, june 14-17, 2009. [2] chan f.t., chung s.h.: a modified multi-criterion genetic algorithm for order fulfillment in manufacturing network, proceedings of the 9th asia pacific industrial engineering & management system conference, apiems, indonesia, 2008. an algorithm for customer order fulfillment in a make-to-stock manufacturing system791 [3] chen j.h., lin j., wu y.s.: order promising rolling planning with atp/ctp reallocation mechanism, iems, apiems, vol. 7, no.1, 2008. [4] kaschel h., bernal l.m.s.: importance of flexibility in manufacturing systems, international journal of computers, communications & control, vol. i, no. 2, 2006. [5] lawrence g.: introducing aps: getting production in lock step with customer demand, automotive manufacturing&production, vol. 110, issue 5, 1998. [6] lin j., chen j.h.: enhance order promising with atp allocation planning considering material and capacity constraints, journal of the chinese institute of industrial engineers, vol.22, no.4, 2005. [7] lupse v., dzitac i., dzitac s., manolescu a., manolescu m.-j., crm kernel-based integrated information system for a sme: an object-oriented design, international journal of computers communications and control, issn 1841-9836, suppl. s, 3(s): 375-380, 2008. [8] makatsoris h.c., chang y.s., richards h.d.: design of a distributed order promising system and environment for a globally dispersed supply chain, int. j. computer integrated manufacturing, vol. 17, no. 8, 2004. [9] meyr h.: customer segmentation, allocation planning and order promising in make-to-stock production, or spectrum, vol 31, no 1, 2009. [10] rudberg w., wikner j.: mass customization in terms of the customer order decoupling point, production planning & control, vol. 15, no. 4, 2004. [11] wikner j., rudberg m.: introducing a customer order decoupling zone in logistics decisionmaking, international journal of logistics: research and applications, vol 8, no. 3, 2005. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iii (2008), no. 2, pp. 125-134 design of a mimo pid robust controller using moments based approach abdelmadjid bentayeb, nezha maamri, jean-claude trigeassou abstract: in this paper we present a new technique for robust mimo controllers synthesis and reduction based on a reference model and moments approach intended to control a mimo thermal system. the reference model allows to specify the performances requirements for the closed loop and improve the controller robustness while the moments tool (frequency and time ones) is used to reduce the controller structure using a non linear optimization. the implementation on the real system associates this methodology of mimo pid controllers synthesis with broïda’s identification technique in order to carry out a auto-tuning procedure [2][11]. keywords: pid control, reference model, moments, optimization, robustness, broïda’s identification technique. 1 introduction the method of moments was introduced in a previous contribution [1] to show how we can treat single-input single-output (siso) control systems. we saw that the moments represent a good tool to obtain a reduced robust controller in order to approximate the closed loop behaviour to a reference model one. the reference model gathers all performances requirements like time response and overshoot. an extension of this method to the multivariable (multi-inputs multi-outputs) systems is presented in this paper; in this case the fundamental idea is to choose a diagonal reference model to make inputs outputs pairing [10] where the diagonal reference transfer functions are chosen using the same method for the siso case. in this paper we present our control methodology for mimo systems with an application to a thermal system. the control of the mimo thermal system is realized with the help of a pid controller using moments based approach and reference model. the aim of this technique is to synthesize a reduced robust controller (pid for example) for the implementation. the identification was realized thanks to broïda’s method which is an elementary technique very used in the industry in order to develop a auto-tuning procedure requiring a minimum intervention of the user. the controller achieving the performances for the worst case model is called ideal controller [10] which will be reduced using the moments based approach to have a pid structure for implementation [4]. the reduced controller must preserve the same performances as the ideal one; this rises from frequency and time moments which represent a good tool for synthesis and analysis [7]. concretely, the reduction procedure is based on a non linear optimization and its initialization is given by the least squares algorithm [8]. the paper is organized as follows: in section 2 we give a complete description of the application; in section 3 we develop our synthesis methodology; in section 4 and 5 we describe the moments theory and the reduction method and we finish this communication by presenting the different results obtained and a conclusion. 2 description the general diagram of the temperature control device is illustrated by fig. 1. copyright © 2006-2008 by ccc publications 126 abdelmadjid bentayeb, nezha maamri, jean-claude trigeassou amplification u1 +vcc t1 u2 θ1 θ2 stage amplifying stage amplifying y2y1 t2 figure 1: the general diagram of the system it is composed of a radiator on which we placed two power transistors t1 and t2 controlled by the tensions u1 and u2 which vary between 0 and 10 volts. near these power transistors, which represent the heat source, two temperature sensors (pt 100) are laid out. after adaptation and amplification, we obtain the tensions y1 and y2 (which vary between 0 and 10 volts) proportional to the temperatures θ1 and θ2. the ratio tension-temperature is of 0.02volts/c. 2.1 modelling the aim is to define the model connecting the tensions delivered by the temperature sensors y1 and y2 to the control tensions u1 and u2 applied to the transistors. the conduction of the heat in the radiator, towards the temperature sensors produced by the power transistors is governed by the following diffusion equation: µ ∂ 2θ ∂ x2 = ∂ θ ∂ t (1) where µ is the coefficient of thermal diffusion process. consequently the resulting physical model is of distributed parameters and we preferred to present a black box model adapted to the synthesis of a control law, this model is given from identification. let:   y1 (s) y2 (s)   =   g11 (s) g12 (s) g21 (s) g22 (s)     u1 (s) u2 (s)   (2) where g (s) is the transfer matrix of the system. notice that the system is symmetrical because of the provision of the heat sources and the position of the sensors. indeed, we must obtain: { g11 (s) = g22 (s) g12 (s) = g21 (s) (3) in addition, since the sensors θ1 and θ2 are very close to the sources t1 and t2, the transfers g11 and g22 can be reasonably modelled by a first order system ( k11 1 + τ11s ). on the other hand θ2 is less sensitive to the effect of t1 than that of t2 (so k21 < k11), with a settling time definitely higher. we can thus approach this transfer by an aperiodic nth order system with (n >> 1) or more simply by a first order system with a time delay ( k12e−α12s 1 + τ12s ). design of a mimo pid robust controller using moments based approach 127 2.2 identification several sophisticated identification techniques (minimization of a quadratic cost by least squares or non linear optimization) (ljung [5] and walter [12]) can be used to estimate the parameters of g (s). taking into account the fact that the aim is to synthesize pid controllers (in a auto-tuning objective), we preferred to choose a basic identification method requiring only step tests. our choice thus is broïda’s method which delivers an approached model of the form ( ke−α s 1 + τ s ), so well adapted to the selected modelling [2][11]. let us recall that the coefficient {k, α et τ} are obtained with the help of the following formulas:    ki j = y∞ αi j = 2.8t1 −1.8t2 τi j = 5.5 (t2 −t1) (4) where y∞ is the final value of the system step response, t1 (respectively t2) is the time where the output attains 28% (respectively 40%) of its final value. we made 5 tests which provided: test1 test2 test3 test4 test5 k11 0.1221 0.1221 0.1172 0.1172 0.1318 τ11 577 414 449 451 457 k12 0.0634 0.0636 0.0586 0.0586 0.0684 τ12 910.5 862.95 684.2 683.1 863 α12 7 26.2 59.6 58.9 6 k21 0.083 0.0781 0.0684 0.0684 0.083 τ21 715 654.5 808.5 808.5 715 α21 113 99.8 73.4 73.4 113 k22 0.0977 0.0977 0.0977 0.0928 0.1074 τ22 450 445 447 447 600 the settling times of g11 (s) and g22 (s) are close and equal 33minutes. 2.3 the nominal model the nominal model was obtained by carrying out the average of the 5 tests, thus: gnom (s) =   0.122 470s+1 0.0625e−31.5s 801s+1 0.0762e−94.5s 740s+1 0.0987 478s+1   (5) from the values of the preceding table, we note that the theoretical symmetry is not checked in practice, well that g11 (s) and g22 (s) are close (g12 (s) and g21 (s) respectively), these differences are the consequences of the noises level disturbing the measurements and the non perfect reproducibility of the assembly. 2.4 taking into account of uncertainties identification uncertainties will deteriorate the performances of the control device if they are not taken into account during the synthesis. the table recapitulating the 5 tests shows that two uncertainties types are present: 128 abdelmadjid bentayeb, nezha maamri, jean-claude trigeassou • uncertainties on the d.c gains (of type low frequencies). • uncertainties on time-constants and time delays, which causes modelling errors (of type high frequencies). since the transfer matrix g (s) is theoretically symmetrical, we decided to make the synthesis by basing us on a symmetrical worst case model gp (s). using gp (s), we took into account all the uncertainties by raising: • low frequencies uncertainties by taking kmax. • high frequencies uncertainties by introducing a time delay e−δ s (see [1]). remark 1. the worst case model has been chosen to take into account all the identification uncertainties: • low frequencies uncertainties, by choosing the max of the d.c gains. • high frequencies, by introducing time delays in all input-output transfers. thus, we obtain the following worst case model: gp (s) =   0.132e−5s 500s+1 0.085e−125s 800s+1 0.085e−125s 800s+1 0.132e−5s 500s+1   (6) 3 synthesis the control configuration is illustrated by fig. 2 g(s) r e u po y pi c(s) figure 2: unitary feedback configuration c (s) is the controller to implement, g (s) is the plant, r represent the reference inputs, y the outputs to be controlled, u the control inputs, pi and po are the input and output disturbances and e is the tracking error. the synthesis of c (s) is based on the internal model control [6][10], so: c (s) = q (s) (i −g (s) q (s))−1 (7) where q (s) is any stable transfer matrix [3]. design of a mimo pid robust controller using moments based approach 129 3.1 the reference model q (s) is calculated using gp (s) in order to approach the closed loop tyr (s) to a reference model tre f (s), so: tyr (s) = gp (s) q (s) = tre f (s) (8) the reference model gathers all the objectives of the synthesis in terms of stability and performances (i.e inputs-outputs decoupling, settling time... etc). in our case, tre f (s) is given by: tre f (s) =   t2(s)e−δ s d(s) 0 0 t2(s)e −δ s d(s)   (9) where t2 (s) represent the dominant poles to fixe the closed loop dynamic, d (s) represent the auxiliary poles, which reduce the control input energy, confer robustness to the controller by making a sufficient roll-off of tyr (s) and ensure its causality, finally e−δ s represents the singularity of gp (s) which is integrated in tre f (s) to have q (s) stable and realizable. so q (s) is given by: q (s) = gp (s) −1 tre f (s) (10) since the singularities of gp (s) (i.e time delays and rhp zeros) are integrated in the reference model tre f (s), so the stability of q (s) is guaranteed [3]. t2 (s) and d (s) are given by: t2 (s) = ω 2n s2+2ξ ωns+ω 2n d (s) = (1 + η s)n (11) where ωn and ξ , fixe the settling time and the overshoot of the outputs. we can recapitulate our reference model by saying that it imposes a inputs-outputs decoupling and the same dynamics for the two outputs of g (s). 3.2 robustness and the reference model it is well-known that the relative modelling errors ∆r (s) modify the dynamics of the system with a possibility of instability because: sypo (s) = ( i −g (s) q (s) (i + ∆ (s) q (s))−1 ) = snom (s) (i + ∆r (s) tre f (s)) −1 (12) so robustness of stability and performances is guaranteed if: ‖ ∆r (s) tre f (s) ‖∞≤ δ −1sr (13) where δsr, represents stability margin; notice that from (13), we can act on the auxiliary poles of tre f (s) to ensure the robustness [3]. 130 abdelmadjid bentayeb, nezha maamri, jean-claude trigeassou 3.3 the ideal controller from (10), we deduce the ideal controller c (s): c (s) = q (s) (i −gp (s) q (s))−1 (14) it is clear that the implementation of c (s) is very hard to do because of time delays; for that we will reduce its structure to get an implementable one which must preserve the same performances of those ensured by the ideal controller. the moments tool, particularity of our approach is used for reduction. 4 the moments let us consider a linear siso system, characterized by its transfer function g (s) analytic in the rhp plan (.i.e re (s) > 0) and let g (t) be its impulse response: g (s) = ∫ ∞ 0 g (t) e−st dt (15) the transfer function is given by the following state space (not necessary minimal) realization: g (s) s = [ a b c d ] = c (si −a)−1 b + d (16) where a ∈ rn×n, b ∈ rn×1,c ∈ r1×n and d ∈ r1×1. 4.1 computing the moments using state space realization using the following equality: (si −a) ( −a−1 −sa−2 −s2a−3 −···− ) = i ⇒ (si −a)−1 = − ∞ ∑ n=0 ( sna−(n+1) ) (17) frequency moments realizing a variable change µ = jω − jω0, equation (15) becomes: g (µ) = ∞ ∑ n=0 (−1)n (µ)n an,ω0 (g) (18) and (16): g (µ) = c (µ i −(− jω0i + a))−1 b + d (19) so, we get: a0,ω0 (g) = −c (− jω0i + a) −1 b + d (20) an,ω0 (g) = (−1) n+1 c (− jω0i + a)−(n+1) b, (n = 1···∞) (21) remark 2. the time moments are giving by remplacing ω0 = 0. both time and frequency moments can be calculated easily using the previous algorithm. thanks to the computation of the moments using state space realization, we can compute moments for mimo systems (multi inputs multi outputs). design of a mimo pid robust controller using moments based approach 131 5 the controller reduction for more details of our model reduction procedure, please refer to [1]. it is interesting to have a reduced structure of the implemented controller (pid for example) [6]. let: θ =   θ 11 (s) ··· θ 1m (s) ... . . . ... θ m1 (s) ··· θ mm (s)   (22) be the matrix representing the controller’s parameters to be calculated. θ i j represents the parameters vector of numerator and denominator of cr,i j (s); i.e the reduced controller between the jth input and the ith output. let us define our cost function j as the 2 norm of the errors between the different moments of the ideal controller and those of the reduced one. j = n ∑ n=0 ||εn||22 = n ∑ n=0 ||an,ω0 (c)−an,ω0 (cr)||22 (23) where an,ω0 (cr) represents n th order moments matrix, which is function of the parameters θ : an,ω0 (cr) = fn (θ ) (24) let: j = n ∑ n=0 ||an,ω0 (c)− fn (θ )||22 (25) the objective is to determine the estimated parameters θ̂ minimizing j around ω0. this frequency ω0 is chosen in order to preserve stability of the system (i.e critical pulsation), (see the generalized nyquist criterion [10]). 5.1 linear optimization the first step consists on imposing the common denominator dr (s) of the reduced controller (for example we can take the auxiliary poles of the reference model dr (s) = d (s)). so only the zeros have to be determined; the function fn (θ ) is linear; thus the minimization of j is obtained by least squares, let: cr (s) = cls (s) (26) be the reduced controller which will be used to initialize the non linear programming algorithm. remark 3. by imposing the poles of the reduced controller, for example a pole with an integral action we define so a pid structure. for example we can take dr (s) = s (1 + η s). 5.2 non linear optimization the fact of imposing the poles of the reduced controller, this will limit its performances and consequently those of the closed loop. so, it is preferable to optimize all the structure (i.e. poles and zeros); the function fn (θ ) is non linear; the estimation of θ is obtained by non linear programming [7]. we use marquardt’s algorithm which is a good combination between rapidity and convergence [8]. the parameters are updated with the help of the following algorithm: θ̂i+1 = θ̂i −{[j ′′ + λii]−1.j ′}θ̂ =θ̂i (27) 132 abdelmadjid bentayeb, nezha maamri, jean-claude trigeassou where: ( ∂ j ∂ θ ) = j ′ : gradient vector (28) ( ∂ 2j ∂ θ 2 ) = j ′′ : hessian matrix (29) λi : coefficient to adjust (30) the initialization is given by the least squares solution: θ̂0 = θ̂ls (31) 6 results and comments the controllers synthesis and reduction technique was applied to the thermal system with 2 inputs and 2 outputs. let us recall that this thermal system is characterized by an important open loop settling time (approximately 33 minutes); moreover the elementary modelling used in a disturbed context involves important uncertainties. the objective of the temperature control, in a auto-tuning context, will be double: • reduce considerably the closed loop settling time by using a controller of a simplified structure: pid controller. • guarantee robustness in spite of the simplicity of the controller, the level of uncertainty and the reduction of the settling time. let us specify that the implementation of the pid controllers was ensured thanks to xpc target software of matlab. we used for that a sample time te = 1s. our objective is to accelerate the system reasonably, in a ratio of 6, thus passing from 33mn in open loop to 5mn in closed loop all while maintaining the relative overshoot around 5%; for that, we fixed: ωn = 0.02rd/s ξ = 0.7 the auxiliary poles ensuring the robustness condition (13) are: d (s) = (1 + 3s)3 the pid controller obtained using the synthesis and the reduction procedures described obviously is: cr (s) = [ c11 (s) c12 (s) c21 (s) c22 (s) ] (32) with: c11 (s) = c22 (s) = −3.162s 2+52.81s+0.14 s(1+34.87s) c12 (s) = c21 (s) = −2019e −7s2+4403e−6s−7193e−5 s(1+15.88s) (33) the implementation provides the results illustrated by fig. (3) and fig. (4). design of a mimo pid robust controller using moments based approach 133 0 1000 2000 3000 4000 5000 6000 2.5 2.55 2.6 2.65 time(sec) o u tp u t( v o lt) 0 1000 2000 3000 4000 5000 6000 2.5 2.55 2.6 2.65 time(sec) o u tp u t( v o lt) tref y1 tref y2 figure 3: step responses 0 1000 2000 3000 4000 5000 6000 3 4 5 6 7 8 9 10 time(sec) co ntr ol int pu t(v olt ) u1 u2 figure 4: the control inputs remark 4. we can check that the system’s responses correspond perfectly to those of the reference model. we can also check the perfect decoupling of the system outputs, y2 being insensitive to the reference input applied to y1 and reciprocally. it is checked finally that the control input obtained is completely reasonable, that it is with respect to its initial magnitude or of its insensitivity to the output noise (of considerable level). 7 conclusion we presented in this communication a synthesis and reduction technique of robust controllers to the multivariable control of a thermal system. this methodology is based on a reference model integrating explicitly the desired performances. the experimental results show that the closed loop system verifies well the performances described by the reference model. the moments approach, characteristic of this methodology, allows the reduction of the ideal controller to lead to a pid structure, while guaranteeing the dynamic performances and especially the robustness as testify the experimental results. let us recall finally that this synthesis methodology and reduction was associated with broïda’s identification technique to carry out the auto tuning of multivariable pid controllers. 134 abdelmadjid bentayeb, nezha maamri, jean-claude trigeassou bibliography [1] a.bentayeb, n.maamri and j-c.trigeassou "the moments in control: a tool for analysis, reduction and design", international journal of computers, communication and control, vol ii (2007). [2] a.anon "special edition on pid tuning methods". computing and control engineering journal 10(2), (1999). [3] j.c.doyle, b.a.francis and a.tannenbaum "feedback control theory". macmillan, new york (1991). [4] c.e.garcia and m.morari "internal model control: an unifying review and some new results". ind.eng.chem.proc.dec.dev, vol(91.2) (1982). [5] ljung "system identification. theory for the user". prentice hall (1987). [6] m.morari and s.zafiriou "robust process control". prentice hall (1989). [7] n.maamri, a.bentayeb and j-c.trigeassou "design and iterative optimization of reduced robust controllers with equality-constraints" rocond-milan (2003). [8] d.w.marquardt "an algorithm for least-squares estimation of non linear parameters" journal of soc. indust. appl. math v(11-2) (1963). [9] q.g.wang "decoupling control", lecture notes in control and information sciences, springer verlag (2003). [10] s.skogestad and i.postlethwaite "multivariable feedback control", jw (1996). [11] r.toscano "a simple robust pi/pid controller design via numerical optimization approach", journal of process control (15), (2005). [12] e.walter and l.pronzato "identification of parametric models from experimental data". communications and control engineering series. springer (1997). a. bentayeb, n. maamri and j-c. trigeassou university of poitiers laboratoire d’automatique et d’informatique industrielle 40 avenue du recteur pineau 86022 poitiers e-mail: abdelmadjid.bentayeb@gmail.com received: january 16, 2007 abdelmadjid bentayeb was born in 1977, he obtained the phd thesis in automatic control at the university of poitiers in 2006. his current research interests include robust control and model reduction. nezha maamri is lecturer at the unisversity of poitiers, hers current research interests include identification and robust control. jean-claude trigeassou is professor at the university of poitiers, his current research interests include identification and control. int j comput commun, issn 1841-9836 vol.7 (2012), no. 3 (september), pp. 450-458 data consistency in emergency management d. ergu, g. kou, y. peng, f. li, y. shi daji ergu college of electrical and information engineering southwest university for nationalities chengdu, china, 610041 e-mail: ergudaji@163.com gang kou, yi peng , feixiong li school of management and economics university of electronic science and technology of china chengdu, china, 610054 e-mail: kougang@yahoo.com; pengyicd@gmail.com; lifx@uestc.edu.cn yong shi 1. research center on fictitious economy and data sciences chinese academy of sciences, beijing 100190, china, and 2. college of information science & technology university of nebraska at omaha, omaha, ne 68182, usa e-mail: yshi@gucas.ac.cn abstract: timely response is extremely important in emergency management. however, cardinal inconsistent data may exist in a judgment matrix because of the limited expertise, preference conflict as well as the complexity nature of the decision problems. the existing inconsistent data processing models for positive reciprocal matrix either are complicated or dependent on the priority weights, which will delay the decision making process in emergency. in this paper, a geometric mean induced bias matrix (gmibm), which is only based on the original matrix a, is proposed to quickly identify the most inconsistent data in the judgment matrix. the correctness and effectiveness of the proposed model are proved mathematically and illustrated by two numerical examples. the results show that the proposed model not only preserves most of the original information in matrix a, but also is faster than existing methods. keywords: cardinal inconsistency, positive reciprocal matrix, geometric mean induced bias matrix (gmibm), inconsistency identification 1 introduction emergency management is an interdisciplinary field, and is in essence a complex multi-objective optimization problem [1]. multi-criteria decision making (mcdm) methods have therefore been extensively employed to study emergency management, for instance, the ordered weighted averaging (owa) [2], technique for order preference by similarity to ideal solution (topsis) [3], preference ranking organization method for enrichment evaluations (promethee) [4], analytic hierarchy process (ahp) ( [5], [6],and [7]), analytic network process (anp) ( [8], [9]), decision making trial and evaluation laboratory (dematel) [10], fusion approach of mcdm (famcdm) methods [11] etc. among these mcdm methods, ahp and anp are two of the most popular methods for studying emergency management, and usually used to assess the emergency management performance, select the best emergency response alternatives or emergency recovery alternatives and allocate reasonable relief resources etc. copyright c⃝ 2006-2012 by ccc publications data consistency in emergency management 451 in the ahp/anp, the tangible and intangible attributes or criteria are always measured by numerical data through pairwise comparisons, and displayed in a judgment matrix whose numerical data are positive and reciprocal. the data of judgment matrix are usually provided by experts and collected through questionnaire survey [12]. therefore, the data may be inconsistent because of the limited expertise, preference conflict as well as the complexity nature of the decision problems etc ( [13], [14]). for instance, assume there are three emergency response alternatives, a, b, and c, the ith expert thinks that alternative a is preferred to b 2 times, and b is preferred to c 4 times, but alternative a is preferred to c 3 times in stead of 8 times. this is called cardinal inconsistency, and for an inconsistent judgment, the inconsistent data should be identified and adjusted before it is used to make a valid decision. therefore, the inconsistent data processing issue in a judgment matrix has become a hot topic since the introduction of the ahp/anp. currently, there are many methods for identifying and adjusting the inconsistent data, for example, auto-adaptive algorithms in ( [14], [16]), absolute differences methods in ( [17], [18]), perturbation matrix method in [19] etc. however, the existing methods are either too complicated to delay the speed of inconsistent data analysis or are difficult to preserve most of the original information in the judgment matrix, or are extremely dependent on the priority weights. therefore, it is necessary to propose a cardinal inconsistent data analysis model to effectively and simply identify the inconsistent data in order to make a valid decision, which is independent to the priority weights while preserving most of the original information in a judgment matrix. in an attempt to establish such models, the absolute differences of geometric mean matrix in [20], the induced bias matrix model (ibmm) in [21], were proposed to identify the possible inconsistent data in a judgment matrix. in this paper, a geometric mean induced bias matrix (gmibm), which is only based on the original matrix a, is established to identify the most inconsistent data in a judgment matrix. through observing and adjusting the largest bias data in the induced bias matrix, the consistency ratio of the judgment matrix can be quickly improved to make a fast decision for emergence management. besides, a general estimating formula of the mined cardinal inconsistent data of gmibm is provided. the remaining parts of this paper are organized as follows. the next section briefly describes the cardinal inconsistency in a judgment matrix. the theorems, corollaries and identifying processes of gmibm are further proposed and presented in section 3. two numerical examples introduced in [21] are used to test the proposed model in section 4. section 5 concludes the paper. 2 cardinal inconsistency let the judgment matrix be a = [ai j]n×n, where ai j>0 and ai j =1/ a ji for all i, j,and k, if ai j = aikak j holds for all i, jandk, then matrix a is said to be perfectly cardinal consistency, otherwise, it is called cardinal inconsistency. in practice, it is unrealistic to obtain a perfectly cardinal consistency of matrix a, therefore ahp allows a certain level of cardinal inconsistency of the judgment matrix. to measure the consistency of a judgment matrix and determine a certain acceptable level of inconsistency, saaty proposed a consistency index (ci), denoted as: ci = λmax − n n − 1 (1) where λmax is the maximum eigenvalue of matrix a, and n is the order of matrix a. to define a unique consistency test index that does not rely on the order of judgment matrices, the consistency index (ci) was extended and the consistency ratio (cr) method was further proposed by saaty [17], cr = ci ri (2) where ci is the consistency index shown in equation (1) while ri is the average random index based on matrix size, as shown in table 1. 452 d. ergu, g. kou, y. peng, f. li, y. shi table 1 the average random index n 1 2 3 4 5 6 7 8 9 10 ri 0 0 0.52 0.89 1.11 1.25 1.35 1.4 1.45 1.49 if the cr of a judgment matrix is less than or equal to 0.1 (cr≤0.1), indicating the inconsistency is relatively small, the judgment matrix is said to be of acceptable inconsistency. if cr is greater than 0.1 (cr>0.1), the judgment matrix is of unacceptable cardinal inconsistency, and the decision makers are asked to revise their judgments. to effectively identify the cardinal inconsistent data and improve the consistency ratio of a judgment matrix, a geometric mean induced bias matrix (gmibm) by hadarmad product is proposed in the following sections. 3 geometric mean induced bias matrix (gmibm) 3.1 theorem of gmibm to identify the inconsistent data, a geometric mean induced bias matrix (for short gmibm, hereinafter), which is only based on the original judgment matrix and independent to the priority weights, is established to amplify the most inconsistent data in this paper. then, the most inconsistent data can be identified by observing the largest data in the induced bias matrix. the related theorems and corollaries are presented in this section. theorem 1. the geometric mean induced bias matrix (gmibm) c should be a u matrix if judgment matrix a is perfectly consistent, that is, c = ā ◦ at = ( ci j ) = ( n √√ n∏ k=1 aikak j · a ji) = u i f aikak j = ai j (3) where ā = ( āi j ) n×n = ( n √ n∏ k=1 aikak j ) n×n represents an n-by-n geometric mean matrix composed of all geometric mean of aikak j for all i, j and k, u =  1 · · · 1 ... ... ... 1 · · · 1 , n denotes the order of a, at represents the transpose of matrix a. symbol ′ ◦ ′ denotes hadamard product (e.g. c = a ◦ b means ci j = ai jbi j for all i and j ). proof: if the judgment matrix satisfies the perfect consistency condition, namely, aikak j = ai j holds for all i, j and k. since ai j =1/a ji, we have ci j = n √√ n∏ k=1 aikak j · a ji = n √√ n∏ k=1 ai j · a ji = n √ ai jn · a ji = ai ja ji = 1 (4) therefore, all entries in matrix c are ones if the matrix is perfectly consistent, and matrix c is a u matrix, whose entries are ones. � to simply compute the gmibm and easily understand the theorem of gmibm, the theorem 1 is transformed to following theorem. data consistency in emergency management 453 theorem 2. the geometric mean induced bias matrix (gmibm) c should be a u matrix if judgment matrix a is perfectly consistent, that is, c = l × r ◦ at = ( ci j ) = ( n √√ n∏ k=1 aik · n √√ n∏ k=1 ak j · a ji) = u i f aikak j = ai j (5) where l = ( n √ n∏ k=1 aik ) n×1 represents an n-by-one column matrix composed of geometric mean of rows in matrix a, while r = ( n √ n∏ k=1 ak j ) 1×n denotes an one-by-n row matrix composed of geometric mean of columns in matrix a. corollary 3. the geometric mean induced bias matrix (gmibm) c should be as close as possible to a u matrix if judgment matrix a is approximately consistent. corollary 4. there must be some inconsistent data in the geometric mean induced bias matrix (gmibm) c deviating far away from one if the judgment matrix is inconsistent. based on corollary 4, we can identify the most inconsistent data in matrix a by observing the largest data deviating from 1 in the geometric mean induced bias matrix (gmibm) c. details of inconsistency identification processes are presented below. 3.2 inconsistency identification and adjustment processes of gmibm to propose the inconsistency identification and adjustment processes of gmibm based on above theorems and corollaries, the aforementioned n-by-n judgment matrix a = [ai j]n×n is used in the following. the processes of inconsistent data analysis and adjustment of gmibm include two major steps, inconsistency identification and inconsistency adjustment: step i: inconsistency identification step 1: compute a column matrix l and a row matrix r, which are composed of geometric means of rows and columns respectively. l = a =  a11 · · · a1i · · · a1 j · · · a1n ... ... ... ai1 · · · aii · · · ai j · · · ain ... ... a j1 · · · a ji a j j ... a jn an1 · · · ani · · · an j · · · ann  n √ n∏ k=1 a1k n √ n∏ k=1 aik n √ n∏ k=1 a jk n √ n∏ k=1 ank r = n √ n∏ k=1 ak1, n √ n∏ k=1 aki, n √ n∏ k=1 ak j, n √ n∏ k=1 akn . where  l = ( n √ n∏ k=1 a1k, · · · n √ n∏ k=1 aik, · · · n √ n∏ k=1 a jk, · · · n √ n∏ k=1 ank )t r = ( n √ n∏ k=1 ak1, · · · n √ n∏ k=1 aki, · · · n √ n∏ k=1 ak j, · · · n √ n∏ k=1 akn ) (6) 454 d. ergu, g. kou, y. peng, f. li, y. shi step 2: compute geometric mean matrix by formula ā = l × r (7) we can obtain the geometric mean matrix ā. the computing processes are shown below: ā = l × r =  n √ n∏ k=1 a1k ... n √ n∏ k=1 aik ... n √ n∏ k=1 a jk ... n √ n∏ k=1 ank  n×1 × ( n √ n∏ k=1 ak1, · · · n √ n∏ k=1 aki, · · · n √ n∏ k=1 ak j, · · · n √ n∏ k=1 ank ) 1×n =  n √ n∏ k=1 a1kak1 · · · n √ n∏ k=1 a1kaki · · · n √ n∏ k=1 a1kak j · · · n √ n∏ k=1 a1kakn ... · · · ... ... n √ n∏ k=1 aikak1 ... n √ n∏ k=1 aikaki · · · n √ n∏ k=1 aikak j · · · n √ n∏ k=1 aikakn ... ... ... ... n √ n∏ k=1 a jkak1 · · · n √ n∏ k=1 a jkaki · · · n √ n∏ k=1 a jkak j · · · n √ n∏ k=1 a jkakn ... ... ... ... n √ n∏ k=1 ankak1 · · · n √ n∏ k=1 ankaki · · · n √ n∏ k=1 ankak j · · · n √ n∏ k=1 ankakn  n×n . =  n √√ n∏ k=1 aikak j  n×n where l is an n-by-one column matrix, which is composed of all geometric means of rows while r is a one-by-n row matrix r composed of all geometric means of columns, as shown in formula (6) and the two edges of matrix a in step 1. the geometric mean matrix ā can be easily computed by multiplying l to r. step 3: compute geometric mean induced bias matrix (gmibm) c by formula, c = ā ◦ at = ( ci j ) = ( āi j · a ji ) = ( n √√ n∏ k=1 aikak j · a ji) (8) step 4: identify the data with the largest value, denoted as cmaxi j , deviating from 1 in matrix c, then the corresponding ai j is regarded as the most inconsistent data in matrix a. if there are other data, say cmn, cpq, whose values are also deviating far away from 1, then their corresponding data in matrix a, amn, apq, can also be considered as the possible inconsistent elements. once the inconsistent data are data consistency in emergency management 455 identified, the following two steps are proposed to adjust the inconsistent data. step ii: inconsistency adjustment: step 1: estimate the value of identified inconsistent data by formula ãi j =n−2 √√√ n∏ k=1,,i, j aikak j =n−2 √√ āni j a2i j = āi j ( āi j ai j ) 2 n−2 (9) where ãi j denotes the estimated value of the most inconsistent data ai j while āi j is the geometric mean value located at the ith row and the jth column of geometric mean matrix ā. step 2: test the consistency of the revised matrix a by replacing the inconsistent data with the estimated values. to summarize, the processes of dealing with inconsistency by gmibm include two major steps, that is, inconsistency identification and inconsistency adjustment. the first two steps in step i are used to show the specific procedure of computing geometric mean matrix ā. for simplicity, one can directly use the latter two steps as the sub-steps of step i to identify the most inconsistent data. to verify the effectiveness and accuracy of gmibm, in the following, two numerical examples introduced in [20] are used to illustrate the proposed model. 4 illustrative examples to test the effectiveness and correctness of the proposed gmibm, and illustrate the processes of the proposed specific inconsistency identification and adjustment by numerical examples, two numerical examples in [21] are used in this paper. example 1 the example 1 used in [21], which was firstly introduced in [20], is a 4 × 4 inconsistent pair-wise comparison matrix a with cr=0.173>0.1. a =  1 1/9 3 1/5 9 1 5 2 1/3 1/5 1 1/2 5 1/2 2 1  apply the gmibm to this matrix: step i: inconsistency identification step 1: compute the column matrix l and the row matrix r by formula (6), l = ( 0.5081 3.0801 0.4273 1.4953 )t ,r = ( 1.968 0.3247 2.3403 0.6687 ) step 2: compute geometric mean matrix by formula (7), ā = l × r =  1 0.165 1.1892 0.3398 6.0615 1 7.2084 2.0598 0.8409 0.1387 1 0.2857 2.9428 0.4855 3.4996 1  456 d. ergu, g. kou, y. peng, f. li, y. shi step 3: compute geometric mean induced bias matrix (gmibm) c by formula (8) , c = ā◦at =  1 1.4848 0.3964 1.6990 0.6735 1 1.4417 1.0299 2.5227 0.6936 1 0.5715 0.5886 0.9710 1.7498 1  step 4: identify the largest value cmaxi j in matrix c. here c max i j = c max 31 = 2.5227, deviating from 1 in matrix c, then the corresponding element a31 in matrix a is regarded as the most inconsistent element,indicating that it is smaller than its average values. step ii: inconsistency adjustment step 1: estimate the possible proper value of a31 using the estimating formula (9) ã31 = 4−2 √√ ā431 a231 = √ 0.84094 (1/3)2 = 2.1213 ≈ 2 step 2: test the consistency of the revised matrix a by replacing the inconsistent elements a31 and a13 with the estimated values 2 and 1/2. the revised matrix passed with cr=0.0028<0.1. the identified inconsistent data and its estimated value are the same as the ones in [19] and [20], but the proposed method is faster to find the inconsistent element and estimate the values. example 2 the second example in [20] is a 4 × 4 inconsistent pair-wise comparison matrix a with cr=1.0242>0.1. a =  1 2 4 18 1 2 1 2 4 1 4 1 2 1 2 8 14 1 2 1  apply the gmibm to this matrix: step i: inconsistency identification step 1: compute the column matrix r and the row matrix l by formula (6), l = ( 1 1.4142 0.7071 1 )t , r = ( 1 0.7071 1.4142 1 ) step 2: compute geometric mean matrix by formula (7), ā = l × r =  1 0.7071 1.4142 1 1.4142 1 2 1.4142 0.7071 0.5 1 0.7071 1 0.7071 1.4142 1  data consistency in emergency management 457 step 3: compute geometric mean induced bias matrix (gmibm) c by formula (8), c = ā◦at =  1 0.3536 0.3536 8 2.2828 1 1 0.3536 2.2828 1 1 0.3536 0.125 2.8284 2.8284 1  step 4: identify the largest value cmaxi j in matrix c. here c max i j = c max 14 = 8, deviating from 1 in matrix c, then the corresponding element a14 in matrix a is regarded as the most inconsistent element, indicating that it is smaller than its average values. step ii: inconsistency adjustment step 1: estimate the possible proper value of a14 using the estimating formula (9), ã14 = ā14 ( ā14 a14 ) 2 4−2 = 1 · ( 1 1/8 ) = 8 step 2: test the consistency of the revised matrix a by replacing the inconsistent elements a14 and a41with the estimated values 8 and 1/8. the revised matrix passed with cr=0<0.1. the identified inconsistent data and the estimated value are the same as the ones in [20], but the proposed method is faster to find the inconsistent data and estimate the values. 5 conclusions in this paper, we proposed a geometric mean induced bias matrix (gmibm), which is only based on the original matrix and independent to the way of deriving the priority weights, to identify the cardinal inconsistent data in the judgment matrix. the inconsistent data identification process includes two major steps, namely, inconsistency identification and inconsistency adjustment. the inconsistent data can be easily and quickly identified by observing the data with the largest value(s) deviating from 1 in the induced bias matrix c. besides, the identified data can be estimated by the estimated formula. two examples are used to illustrate the proposed model. the results show that the proposed model is easier and faster to identify and adjust the inconsistent data than existing models. acknowledgments this research has been partially supported by grants from academic degree programs construction at southwest university for nationalities (#2012xwd-s1201), and grants from the national natural science foundation of china (#70901015 and #70921061), the fundamental research funds for the central universities and program for new century excellent talents in university (ncet-10-0293). no additional external funding received for this study. the funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. bibliography [1] tufekci, s., wallace, w.a, the emerging area of emergency management and engineering,ieee transactions on engineering management, 1998, 45(2):103-105, 1998. [2] shan, l., zhao, z.p, evaluation on emergency management system in supply chain based on trapezoidal fuzzy order weighted average (fowa) operator, applied mechanics and materials,3435:1170-1174, 2010. 458 d. ergu, g. kou, y. peng, f. li, y. shi [3] chen, y., li, k., xu, h.y., xu, h.y, a dea-topsis method for multiple criteria decision analysis in emergency management,journal of systems science and systems engineering, 18(4):489-507, 2009. [4] zhang, k., kluck, c., achari, g, a comparative approach for ranking contaminated sites based on the risk assessment paradigm using fuzzy promethee, environmental management, 44(5):952967, 2009. [5] tie, y.b. tang, c, zhou, c.h. the application of ahp to emergency response capability assessment in urban disaster, journal of geological hazards and environment preservation,16(4):433-437, 2005. [6] ergu, d., kou, g., peng, y., shi, y., shi, y, the analytic hierarchy process: task scheduling and resource allocation in cloud computing environment,journal of supercomputing, doi: 10.1007/s11227-011-0625-1, 2011. [7] wu, w.; kou, g.; peng, y.; ergu, d., improved ahp-group decision making for investment strategy selection, technological and economic development of economy, 18(2), 2012. doi: 10.3846/20294913.2012.680520 [8] ergu, d., kou, g., shi, y., shi, y., analytic network process in risk assessment and decision analysis, computers & operations research, 2011. doi: 10.1016/j.cor.2011.03.005. [9] levy j.k., taji k., group decision support for hazards planning and emergency management: a group analytic network process (ganp) approach, mathematical and computer modelling,46(78): 906-917, 2007. [10] zhou q., huang w., zhang y., identifying critical success factors in emergency management using a fuzzy dematel method, safety science, 49(2):243-252, 2011. [11] peng, y., kou, g., wang, g., and shi, y.,famcdm: a fusion approach of mcdm methods to rank multiclass classification algorithms, omega, 39(6): 677-689, 2011, doi:10.1016/j.omega.2011.01.009 [12] ergu, d., kou, g., questionnaire design improvement and missing item scores estimation for rapid and efficient decision making, annals of operations research, doi: 10.1007/s10479-0110922-3, 2011. [13] filip,f.g., d.a. donciulescu, cr.i. filip.,towards intelligent real-time decision support systems for industrial milieu,studies in informatics and control , 11 (4):303-311, 2001. [14] filip f.g., decision support and control for large-scale complex systems, annual reviews in control, 32(1):61-70,2008. [15] xu.z, wei.c ., a consistency improving method in the analytic hierarchy process, european journal of operational research, 116:443–449, 1999. [16] cao, d., leung, l.c., law, j.s., modifying inconsistent comparison matrix in analytic hierarchy process: a heuristic approach, decision support systems ,44:944–953, 2008. [17] saaty, t.l., the analytical hierarchy process, new york: mcgraw-hill, 1980. [18] saaty, t.l., how to make a decision: the analytic hierarchy process, interfaces, 24:19-43, 1994. [19] saaty, t.l., decision-making with the ahp: why is the principal eigenvector necessary, european journal of operational research, 145(1): 85-91, 2003. [20] yang, y.q., study on adjustment method for the inconsistency of the judgment matrix in ahp, operations research and management science, 8(3):12-16,1999 (in chinese). [21] ergu, d., kou, g., peng, y., shi, y., a simple method to improve the consistency ratio of the pairwise comparison matrix in anp, european journal of operational research, 213(1):246–259, 2011. international journal of computers, communications & control vol. i (2006), no. 2, pp. 41-51 a new technique for texture classification using markov random fields mauricio gomez, renato a. salinas abstract: this paper proposes, applies and evaluates a new technique for texture classification in digital images. the work describes, as far as possible in a quantitative way, the concept of texture in digital images. furthermore, we developed an innovative model that allows classifying and characterizing texture in digital images, to be used as a useful tool in noninvasive inspection of visual surfaces. the proposed methodology extracts the statistical order from an image of texture. the extraction of the high statistical order has been made using as a tool markov random fields. the backpropagation neural net is used for designing a classification module that will serve to test the performance of the configuration histograms, which are based on the statistical order. furthermore, the research suggests the evaluation of the proposed technique from a qualitative perspective. keywords: texture, backpropagation, configuration histograms, classification, markov random fields 1 introduction digital image processing is a current subject of research that has being developed for a long time. of all the invested efforts, multitude of applications have aroused in quite diverse fields such as industrial, medical, spatial, etc. where very different tasks are made, such as: codification, edge detection, color processing, etc. we are therefore, in front of a subject in constant growth and expansion, that it is giving rise to a high number of projects and publications. one of the characteristics that are almost always present in an image is the texture, which generally can be associated to different elements from the scene. in order to be able to apply the concept of texture to the digital analysis of images a quantitative characterization of the texture is required. nevertheless, it does not exist a precise definition nor a mathematical expression for the quantitative description of this property. for the analysis of the texture, there exist three widely used formulations in image processing, they are: statistical [1], structural and spectral [2]. the statistical formulation indicates if an image is smooth, rustic, granulated, etc. [1]. the structural technique, on the other hand, indicates the primitive features that exist in the image, such as regularity of parallel lines; and the spectral technique is based on the energy properties and it is used mainly to detect global regularity in an image, indicating small peaks of high energy in its spectrum [2]. these techniques can used separately or jointly to detect different textures. the detection of textures has wide spectra of applications, mainly in the following areas: • medicine: in several types of images (x-rays, ultrasound, magnetic resonance, etc.); the properties of the texture are important for the diagnosis. cancer frequently is characterized by analysis of texture in different types of medical images; also techniques of texture have been applied satisfactorily for the detection of abnormalities in mammographic images [3]. • remote sensors: numerous formulations for recognition of textures in remote sensors have been proposed. their applications include: land classification, cloud classification, recognitions of seismic patterns, etc. copyright c© 2006 by ccc publications 42 mauricio gomez, renato a. salinas • industrial inspection: in industrial processes, the detection of texture in defective manufactured products or natural materials is of crucial importance. the manual inspection frequently is tedious and laborious, for that reason, the automation is very useful [4]. however, none of these classic descriptions leads to a quantitative measurement universally accepted of texture. due to this difficulty, research in this field continues being interesting until a method be obtained that is satisfactory for most of the applications. this research is an attempt to describe in a quantitative way, the concept of texture in digitized images. furthermore, it develops an innovative model that allows to classify and to characterize textures in digitized images, and it can be used as a useful tool in noninvasive visual inspection. 2 models of the texture in the image processing literature, texture is often defined as the spatial interaction between the values of pixels. that is to say, the texture analysis attempts capturing its visual characteristics in an analytical way, in order to be able to model mathematically those spatial interactions. modeling successfully the visual characteristics, we will be able to associate and to discriminate different textures analytically. this type of analysis will require of a number and type of textures previously known. this model only needs to capture sufficient textural characteristics on a training set to classify the texture, however, the procedure is best adapted for textures that belong to the training set. of these models of understanding of texture, there are four of them that stand out in the literature of digital image processing. statistical models of texture: it is based on a set of features to represent the characteristics of the texture of an image. those features are contrast, correlation, entropy, etc. they are usually derived from “measurement of the gray level of the image”, “it differentiates from the values of gray” or “cooccurrence matrix” [1]. the characteristics are selected in heuristic form; nevertheless, an image similar to the analyzed one cannot be recreated using some measurement of the set of features. structural models of the texture: some textures can be seen as two-dimensional patterns, composed of a set of primitives or sub-patterns, which are organized according to a certain rule of positioning. examples of such textures are: brick walls and mosaics; the primitives used are areas of constant gray level, lines, curves and polygons. the correct identification of those primitives is quite difficult. however, if the textures primitives are identified completely, then it is possible to recreate the texture from the primitives. a work using a structural model is indicated in [4]. models of signal processing for the texture: psychological researchers have given evidence that the human brain analyzes the frequency of the images. the texture is specially fit for this type of analysis due to its properties. most of the techniques try to process certain characteristics from filtered images, which are used in tasks of classification and segmentation. among the methods we can mention: filters in the spatial domain, fourier analysis, wavelets and gabor filters. works using this type of formulation are given in [5]. stochastics models of the texture: a texture is assumed to be the realization of a stochastic process, which is governed by some parameters. the analysis is executed, defining a model and considering the parameters. this way, the stochastic processes can be reproduced from the model and associated to the parameters. the estimation of the parameters can serve to classify and to segment textures. this type of model offers a good possibility to recreate realistic examples of natural textures. stochastic formulation of the texture is used in [6]. a new technique for texture classification using markov random fields 43 3 our model of texture as seen previously, there are several problems associated with modeling texture. a key factor in our model of texture analysis is that it be able to distinguish between different textures. conventional techniques have been based on prior knowledge of the number and types of textures that are desired to analyze. in our model we shall use the same philosophy of analysis. therefore, we need to construct a model that can take a texture and capture the totality of the characteristics that distinguish that group of textures. but, in which way we can capture the real characteristics that distinguish a group of textures? and, which will be our approach in the direction of constructing such characteristics? we believe that an understanding of the statistical order applied to images of digitized textures will allow us to capture the real characteristics that distinguish natural textures. we know that the statistical understanding of the texture was proposed by julesz as early as 1962. the statistical order in the understanding of the texture, applied to digital processing of images has been used in [1], where second order statistics are used to define 14 types of textural features from the cooccurrence matrix. in the texture models that use the auto-models and multilevel logistic models also use second order statistics; nevertheless, they have not provided a base of understanding applicable to natural textures, although in [7] modest results were reached using a gaussian model of markov random field that use precisely second order statistics. models that have managed to recreate natural textures using several multiresolution filters do not use third or higher statistical orders, and even more, they become indefinite (ill-defined) if the election of the filters is globally optimal for all the analyzed textures. julesz in 1962 had already indicated the existence of textural information in the high statistical order. in such a way, starting from the knowledge already generated and using the approach of the human visual perception proposed by julesz, is that we have raised our hypothesis applied to textures in digitized images, which establishes that to model textures, models of different statistical orders are required and for it we have chosen the markov random fields as the basis of our model of texture due to its flexibility in defining the statistical dimension and the order. 4 markov random fields the theory of the markov random fields (mrf) provides suitable tools for us to model context dependent entities such as pixels or other characteristics of spatial correlation [8], [9]. markov random fields are defined with respect to a system of neighborhoods. the mathematical interpretation of the model is defined with respect to the corresponding set of cliques. in this research, a systematic method for the total extraction of the set of local cliques has been used from any system of neighborhood in which the markov random field can be defined. the fundamental property of a mrf is that given a point j on lattice, the probability of that point, given the values of all the other points of the lattice, is the same as the probability of that point j, given only the values of the neighboring points to j [8, 9]. in other words, the mrfs are characterized by a conditional probability function defined with respect to a system of neighborhoods. we denote a set of sites on a lattice by s and the system of vicinities on s as g = {gs, s ∈ s}, where g is the set of neighbors for s so that gs gs ⊂ s, s /∈ gs. given the variable xs in site s with valuexs the function of conditional probability of a markov random field with its respective system of vicinity g is defined by the theorem of hammersley and clifford [9] as: p(xs = xs|xr = xr, r ∈ gs) = 1 zs exp{−∑ c vc(x)} (1) where zs is a constant and v c is a potential function defined on clique c. the sum is over all cliques in the local set of cliques cs. 44 mauricio gomez, renato a. salinas 1 2 0 3 4 1 2 3 4 0 5 6 7 8 1 2 3 4 5 6 0 7 8 9 10 11 12 figure 1: three different neighborhoods. level 10 ? ? ? ? ? ? ? ? ? level 21 2 3 4 5 6 7 8 ? ? ? ? ? ? ? r r j r r level 32 4 3 4 5 5 6 7 7 8 7 8 ? ? ? ? level 44 5 7 8 figure 2: tree diagram for the 3x3 neighborhood. the theorem of hammersley and clifford, indicates that for a system of neighborhoods is necessary that s ∈ gr ⇐⇒ r ∈ gs. three different neighborhoods are shown in figure 1. given a system of neighborhoods g, a clique is a set c ⊆ s so that s, r ∈ c, s 6= r, then s ∈ gr . that is, all the pairs of different sites in a clique are neighbors. in such a way that the local set of cliques for site s is defined as: cs = {c ⊆ s : s ∈ c} (2) the method used for the extraction of the local set of cliques from a system of neighborhoods of a mrf, is based on a structure of a tree diagram [10]. the root of the tree represents only a site, whereas the branches in the first level represent all the pairs of connections between the site and each one of the components of the neighborhoods. the other branches of the highest levels represent connections of high order that form more complex cliques . each level in the tree diagram will be represented by a node, which is indicated by a number within the neighborhood previously defined. for example, the tree diagram for the neighborhood of the figure 1.b is shown in figure 2. 5 characterization of texture using order statistics in order to characterize the texture of a digitized image, we will use a vector called of directional homogeneity [10], which considers the relation between the pixel corresponding to a site s and all the remaining pixels of neighborhood gs. the relation to be established between the site and the other pixels of the neighborhood will depend on the type of clique and the statistical order used. figure 1 shows different types from neighborhood with site s. let us suppose that x is a random field of the class described in [8, 9], with a space of discrete range z = {zk, k = 1, ..., m}. let us also suppose that we have an instance x of this one random field that a new technique for texture classification using markov random fields 45 v1 u2 v2 u1 s u3 v4 u4 v3 figure 3: site s and its neighborhood. can be used for the estimation of parameters of the distribution. figure 3 shows a site (i, j) and its system of neighborhood with the notation employed. furthermore, we define as homogeneity function the function i such that: i(z1, ..., zk) = {−1, if z1 = z2 = ... = zk; 1, otherwise (3) this homogeneity function, for the case of cliques of first order will be defined as: i(s) = {−1 if s = zm; 0, otherwise (4) using the terms defined until now, we can express the potential functions of a gibbs distribution, v (s, e, θ ) = φ (s, e)t θ (5) where is the vector of parameters and the vector that we will call of directional homogeneity. for example, for the neighborhood of figure 3 the component of the vector , using the tree of cliques (see figure 2) is defined as: φ3 =    (i(s, u2, v2) + i(s, u4, u3) + i(s, u1, v4)), (i(s, u4, v3) + i(s, u2, u3) + i(s, u1, v1)), (i(s, u2, v1) + i(s, u1, u4) + i(s, u3, v3)), (i(s, u1, u2) + i(s, u4, v4) + i(s, u3, v2))    (6) the set of components φk of vector φ represents the different statistical order extracted from the image, where k is the statistical order with k ∈ {1, 2, 3, 4} for a neighborhood of second order like the one shown in figure 3. 6 direct measurement of the statistical order this section presents our proposal to characterize digitized images of texture that makes use of the properties of vector φ in each point of the digitized image. taking the expressions 3, 4and 5 for a second order neighborhood (figure 3) and applying them to the calculation of the components of vector φ , we see that the values that can take components are the following: φ1 ∈ {0, 1} (7) φ2 ∈ {−2, 0, 2} (8) φ3 ∈ {−3,−1, 1, 3} (9) φ4 ∈ {−1, 1} (10) of the possible values that can take different components φk , we establish equivalence between the value and its respective binary, ternary or tetranary value, as corresponds. table 1 illustrates some examples of equivalences. 46 mauricio gomez, renato a. salinas table 1: equivalence between the value and ternary number. value ternary number -2 0 0 1 2 2 figure 4: brodatz textures used in our experiments. top row: d22, d103, d105 and d79; bottom row: d76, d37, d5 and d28. now, our interest is only in the values that take the components of vector φ , that is why we will work with a decimal number, instead of considering its vector array. this way, given a digitized image of texture and applying the different statistical orders through the different cliques used, we will obtain histograms of frequency versus directionality, which will be different and particular for each of the analyzed textures. 7 classification of textures using histograms of configurations the process of texture classification will be based on the histograms of configurations proposed in the previous section. in the present work we have used a set of 8 classes of textures that have been obtained from the album of textures of brodatz [11]. each image has a dimension of 640x640 with 256 gray levels (8 bits/pixel). the images of textures used are shown in figure 4. for each of these images, 32 sub-images of 64x64 pixels have been extracted. for each sub-image, the histogram of configurations has been computed. in figures 6, 7 and 8, can be seen some of those histograms. the form in which the 32 sub-images have been extracted is explained in a scheme (figure 5) that represents the 32 sub-images taken from one of the images shown in figure 4. the white squares represent the 32 sub-images and the dark background represents the greater image of the texture (figure 5). images with an asterisk have been taken to construct a characteristic vector, which will be used to train the a new technique for texture classification using markov random fields 47 figure 5: layout employed to select the training and validation subimages. each cell represents a subimage, and every cell containing an asterisk is used for training. figure 6: texture d22 and its corresponding configuration histogram. classification module. this procedure has been performed with the eight images of textures shown in figure 4. the sub-images without asterisk in figure 5 are used to test the performance of the texture classification module. the architecture that has been developed for classification of the texture consists of four basic blocks (see figure 9): a) extraction of characteristic vector. the purpose of this block is to construct a vector made of the frequencies of the histogram of configurations used; b) classification: the objective of this block is to classify the different characteristic vectors, that is to say, the information contained in the digitized input image (texture). for this block it has been proposed using a backpropagation (bp) neural network; c) decision: this block makes the final decision from data generated by the classification block; d) data base: it stores parameters and models to control adaptively the operation of the various blocks. once determined the configuration histograms for different statistical orders for the 32 sub images and for each texture category, each configuration histogram is evaluated with respect to performance. 48 mauricio gomez, renato a. salinas figure 7: texture d103 and its corresponding configuration histogram. figure 8: texture d105 and its corresponding configuration histogram. a new technique for texture classification using markov random fields 49 extraction of characteristic vector classification decision 6 data base figure 9: block diagram of the proposed system. table 2: proposed binary codes for the seven output nodes. texture 1st 2nd 3rd 4th 5th 6th 7th d22 0 0 0 0 0 0 0 d5 0 0 0 0 0 0 1 d79 0 0 0 0 0 1 0 d76 0 0 0 0 1 0 0 d28 0 0 0 1 0 0 0 d37 0 0 1 0 0 0 0 d105 0 1 0 0 0 0 0 d103 1 0 0 0 0 0 0 the classification module, is built using a bp neural net that is trained with 16 sub images and validated with the remaining 16 unused sub images. a bp neural net is trained with a predefined set of input-output pairs. each input pattern (vector) is applied to the input layer, and it propagates through the successive layers until it generates a given output. this current output is compared against the target output to obtain the so called error vector. these errors are back propagated, beginning at the output layer and then propagated to the hidden layers until the input layer is reached. based upon an error minimization procedure, the connection weights of each neuron are modified, in such a way as to reduce the final error the next time the same input pattern be presented to the network. for this research, a bp neural net with 3 layers has been used. the number of nodes of the input layer is the same as the number of components of the characteristic vector. for the hidden layer, from 12 to 16 nodes have been used, and finally, for the output layer, 7 neurons with proper codification have been employed. table 2 shows the proposed code for the seven output nodes. the bp neural net has been programmed and trained using matlab r©. in particular, two algorithms have been evaluated for weight modification: gradient descent (traingd) and levenberg-marquardt (trainlm). 50 mauricio gomez, renato a. salinas table 3: results using a 3x3 neighborhood and second order histograms. texture d22 d103 d105 d79 d76 d37 d5 d28 total d22 16 d103 14 d105 2 11 2 1 d79 3 12 1 d76 2 13 2 2 d37 3 16 d5 1 11 3 d28 1 2 10 % 100 87.5 68.8 75 81.3 100 68.8 62.5 80.47 table 4: results using a 5x5 neighborhood and third order histograms. texture d22 d103 d105 d79 d76 d37 d5 d28 total d22 16 1 d103 14 d105 1 13 1 d79 12 d76 2 13 1 d37 3 16 d5 1 2 13 1 d28 1 3 14 % 100 87.5 81.3 75 81.3 100 81.3 87.5 86.72 8 experimental results using histograms of configurations table 3 shows the classification results obtained using a 3x3 neighborhood and histograms of configurations of second order statistics. in this case the classification module was built with 16 hidden layer neurons, and trained for 500 epochs. later on, the performance was evaluated with 5x5 neighborhoods and third order histograms. table 4 shows the results of this experiment. from the above results, we can confirm that the histograms of configurations are sufficiently representative of the textures analyzed using different statistical orders. furthermore, we achieve a classification rate of 86.72% while employing third statistical order. 9 summary and conclusions a new texture classification method has been developed, based on the extraction of the statistical order from a digitized texture image. this later operation has been performed using the set of local cliques from a neighborhood. the use of histograms of configurations in the process of texture classification has been quite successful, achieving a classification rate of around 87%. the decision of employing a bp neural network was also adequate for the classification. best results a new technique for texture classification using markov random fields 51 were reached when training the neural net with the levenberg-marquardt method. the proposed classification architecture requires of just a few features, thus achieving a reduction in training time and smaller computational efforts during the classification stage. the work developed so far constitutes the initial stage of a series of applications and works related to texture and non-destructive visual inspection of materials. references [1] haralick r., shanmugam k. and dinstein i., textural features for image classification, ieee trans. on systems, man, and cybernetics, vol. smc-3, no. 6, pp. 610-621, 1973. [2] hsin h., texture segmentation using modulated wavelet transform, ieee trans. on image processing, vol 8. no. 7, 2000. [3] hernández l., torrealba v. and reigosa a., clasificación automática del carcinoma de la mama, mediante un sistema de reconocimiento basado en redes neuronales, memorias ii congreso latinoamericano de ingeniería biomédica, la habana, cuba, 2001. [4] mery d., da silva r., calôva l. and rebello j., detección de fallas en piezas fundidas usando metodología de reconocimiento de patrones, 3rd panamerican conference for nondestructive testing, rio de janeiro, brazil, 2003. [5] bader d., jájá j. and chellapa r., scalable data parallel algorithms for texture synthesis using gibbs ramdon fields, ieee transactions on image processing, vol. 4 no. 10 , pp. 1456-1460, 1995. [6] pun c. and lee m., rotation invariant texture classification using a two stage wavelet packet features approach, iee proc. vis. image signal process, vol. no148, no6, pp 422-428, 2001. [7] manjunath b., simchony t. and chellappa r., stochastic and deterministic networks for texture segmentation, ieee transactions on acoustics speech and signal processing, vol. no6, pp. 10391047, 1990. [8] li s., modeling image analysis problems using markov random fields, handbook of statistics, vol. no 20, 2000. [9] li s., markov random fields models in computer vision, proceedings of ieee computer society conference on computer vision and pattern recognition, pp 866-869, 1994. [10] gómez m., modelación y clasificación de texturas utilizando campos aleatorios de markov, master thesis, electrical engineering, universidad de santiago de chile, santiago, chile, 2004. [11] brodatz p., textures. new york: dover, 1966. renato salinas, mauricio gomez universidad de santiago de chile electrical engineering department ave. ecuador 3519, santiago, chile e-mail: rsalinas@lauca.usach.cl, mauriciogomez@esfera.cl int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 1 (march), pp. 101-112 an optimal task scheduling algorithm in wireless sensor networks l. dai, y. chang, z. shen liang dai, yilin chang, zhong shen state key laboratory of integrated service networks xidian university xi’an 710071, china e-mail: ldai1981@gmail.com, ylchang@xidian.edu.cn, zhshen@mail.xidian.edu.cn abstract: sensing tasks should be allocated and processed among sensor nodes in minimum times so that users can draw useful conclusions through analyzing sensed data. furthermore, finishing sensing task faster will benefit energy saving, which is critical in system design of wireless sensor networks. to minimize the execution time (makespan) of a given task, an optimal task scheduling algorithm (otsa-wsn) in a clustered wireless sensor network is proposed based on divisible load theory. the algorithm consists of two phases: intra-cluster task scheduling and inter-cluster task scheduling. intra-cluster task scheduling deals with allocating different fractions of sensing tasks among sensor nodes in each cluster; inter-cluster task scheduling involves the assignment of sensing tasks among all clusters in multiple rounds to improve overlap of communication with computation. otsa-wsn builds from eliminating transmission collisions and idle gaps between two successive data transmissions. by removing performance degradation caused by communication interference and idle, the reduced finish time and improved network resource utilization can be achieved. with the proposed algorithm, the optimal number of rounds and the most reasonable load allocation ratio on each node could be derived. finally, simulation results are presented to demonstrate the impacts of different network parameters such as the number of clusters, computation/communication latency, and measurement/communication speed, on the number of rounds, makespan and energy consumption. keywords: wireless sensor networks; divisible load theory; multi-round task scheduling. 1 introduction & motivation owing to the wireless sensor network node with limited energy, the task should be completed within the shortest possible amount of time. divisible load theory [1] provides an effective solution to wireless sensor networks for task scheduling [2-5]. different from other heuristic solutions of task scheduling problem in wireless sensor networks [6, 7], this scheme can get not only the optimal solution, but also the analytic solution, thus ensuring the consistency of the results of scheduling. divisible load theory has been intensively studied in the past decades. it is mainly concerned with obtaining an optimal partitioning and scheduling strategy for a given task such that it can be processed in the shortest amount of time. divisible load scheduling algorithm can be divided into single-round scheduling algorithms [2-5] and multi-round scheduling algorithms [8, 9]. single-round scheduling algorithm is relatively simple, but its computation and communication overlap are rather poor, and the extra overhead is relatively large. single-round copyright c⃝ 2006-2011 by ccc publications 102 l. dai, y. chang, z. shen scheduling algorithms were applied to wireless sensor networks in [2-5]. although the authors derived closed-form solutions to obtain the optimal finish time, the network topology discussed in those papers is single-level tree structure. while in wireless sensor networks, as compared with the single-level tree structure, clustered structure (multi-level tree structure) has a great of advantages [10]. multi-round scheduling algorithm has the characteristics of better computation and communication overlap, thus properly reducing the scheduling overhead. however, it is more difficult to analyze, so fewer results of multi-round scheduling algorithm are available and the existing multi-round scheduling algorithms are designed based on grid computing environments. therefore, we present a multi-round task scheduling algorithm (otsa-wsn) in clustered wireless sensor networks. the goal of this algorithm is to minimize the overall execution time (hereafter called makespan) and fully utilize network resources, by finding an optimal strategy of splitting the original tasks received by sink into a number of sub-tasks as well as distributing these sub-tasks to the clusters in the right order. 2 optimal scheduling algorithm wireless sensor networks construct clusters several times in its life cycle. each cluster will have a set-up phase and a steady-state phase[10]. we discuss our multi-round task scheduling algorithm in a steady-phase phase. the original tasks received by sink are divided into two stages: inter-cluster task scheduling and intra-cluster task scheduling. first, inter-cluster task scheduling partitions the entire tasks into each cluster, and then the sub-tasks in a cluster is assigned to each intra-cluster sensor node by intra-cluster task scheduling. to improve overlap of communication with computation, inter-cluster task scheduling assigned sensing tasks among all clusters in multiple rounds. according to divisible load theory, to remove performance degradation caused by communications interference, sink sends each round’s tasks to cluster heads sequentially. after each cluster finishing its tasks and fusing the data, the cluster heads also send this round’s results to sink sequentially. that in every moment only allows sink node sends sub-tasks to a cluster head, or a cluster head return fusion data to the sink. 2.1 intra-cluster task scheduling in order to ensure that the tasks are processed orderly, sink allocates the tasks to each cluster according to the task-processing rate of each cluster, which guarantees that the task execution time of all clusters in each round remains the same. definition: the task-processing rate of a cluster is the average rate the cluster takes to complete the intra-cluster tasks, that is the number of tasks dealt (measurement and reporting data) per unit of time. assuming there are k nodes in a cluster, according to divisible load theory, the cluster’s task-processing rate is as follows: s = (1 + k∑ i=2 i∏ j=2 hj)/(1/s1 + 1/b1) (1) where hi = (1/si−1)/(1/si + 1/bi), i = 2, · · ·, k, where si is node i ’s measuring rate, in this case, the number of tasks completed per unit of time, bi states node i ’s transmitting rate to cluster head, in this case, the number of tasks transmitted per unit of time. an optimal task scheduling algorithm in wireless sensor networks 103 measurement data reporting n1 n2 1 1/ sα 2 2/ sα 2 2/bα 3 3/bα 1 1/bα 3 3/ sα / k k bα/ k k sα t n3 nk figure 1: timing diagram for in-cluster task-processing proof: αi is defined as the fraction of sensing task assigned to node ni by the cluster head. it is assumed that every node will be assigned non-zero task, i.e., 0 < αi < 1, and the task for all nodes in this cluster sums to 1. by definition we can see: k∑ i=1 αi = 1 (2) so, the time for node ni measuring its tasks and reporting results to cluster head are αi/si and αi/bi, respectively. one cluster head and a set of sensor nodes constitute a cluster where each node is able to communicate with the cluster head directly. sensor nodes measure data from surroundings related to the given task, and then report these data to the cluster head. to complete certain amount of sensor readings in minimum finishing time, the sensing task should be allocated to each sensor node and scheduled to avoid transmission conflicts and idle time on the cluster head. fig.1 illustrates the timing diagram for a set of sensor nodes, indexed from n1 to nk, in one cluster. from fig.1, it can be observed that there is no time gap between every two successive nodes because the divisible workload can be transferred in the cluster. all sensor nodes start to measure data at the same time. once the previous node finishes transmitting data, the other one completes its measuring task and starts to report its data. as a result, the proposed timing diagram minimizes the finish time by scheduling the measuring time and reporting time of each senor node. moreover, since the intra-cluster scheduling tries to avoid the transmission conflicts at the cluster head, energy spent on retransmission are conserved. the working time of a sensor node can be divided into two parts: measuring time and reporting time. in fig.1, one can set up the following corresponding recursive load distribution equations: αi−1/si−1 = αi/si + αi/bi, i = 2, 3, ...k (3) rewriting the above set of equations as: αi = hiαi−1 (4) where hi = (1/si−1)/(1/si + 1/bi), i = 2, 3, ...k using fig.1, eq.(2) and eq.(4), the largest workload for the sensor node can be solved as: α1 = 1/(1 + k∑ i=2 i∏ j=2 hj) (5) 104 l. dai, y. chang, z. shen similarly, the workloads of other sensor nodes given by eq.(4) can be obtained: αi = i∏ j=2 (hj)/(1 + k∑ i=2 i∏ j=2 hj) (6) fig.1 indicates that when the node with the largest measuring data finishes transmission, the local cluster completes its assigned sensing task. then, the finish time of measuring and reporting data for the cluster is: t = (1/s1 + 1/b1)/(1 + k∑ i=2 i∏ j=2 hj) (7) we can get the task-processing rate of the cluster: s = (1 + k∑ i=2 i∏ j=2 hj)/(1/s1 + 1/b1) (8) it’s not difficult to see that in the homogeneous network environment, every cluster have the same parameters, and their task-processing rate is: s = (1−hk)/(1/s + 1/b)(1−h) (9) where h = (1/s)/(1/s + 1/b) theoretical analysis and simulation in this paper are based on leach protocol[10] family (inter-cluster one-hop topology). for the multi-hop networks, namely, the multi-layer tree structure, we can do the calculations on parent node in the tree using eq.(3) eq.(8). according to divisible load theory, the conflict-free scheduling strategy for each parent node is calculated in order to save energy and prolong network lifetime. in wireless sensor networks, cluster head is responsible for data exchange for sink and incluster nodes. in order to reduce energy consumption caused by transmitting redundant data, lower latency and prolong the survival period, cluster head needs fuse the data [11]. a new estimation method for data fusion information utilization constant is introduced [5] in this paper. information utilization constant is based on a technique of information accuracy estimation. through estimating accuracy of information, cluster head can know the approximate percentage of data fusion. 2.2 inter-cluster task scheduling the following notations will be used throughout this paper: • wtotal: total amount of workload that resides at sink; • wji :number of tasks assigned to cluster i in round j. • si: rate of cluster i’s task-processing, that is, the tasks processed in a unit time. • bi: downlink communication speed sink to cluster head i; b′i: uplink transmission rate cluster head i to sink, that is, the number of tasks transmitted per unit time. • tj: processing time of round j. • wj: size of the total load dispatched during round j. an optimal task scheduling algorithm in wireless sensor networks 105 task allocation reporting the fuzed data measuring and reporting data round j+1round j communication/co mputing latency wj,1/b1 wj+1,1/b1’ wj,i/bn wj+1,n/bn α2 φiwj,1/b1’ φiwj,n/bn’ β3 β3’ cn c3 c2 wj,1/s1 wj+1,1/s1 wj,2/s2 wj+1,2/s2 wj,n/sn wj,3/s3 wj+1,n/sn wj+1,3/s3 figure 2: process of multi-round scheduling algorithm • φi: information utility constant of cluster head i. thus, the entire load for cluster i in round j can be sensed (measured, transmitted) is wji/si. in round j, the time for sink sending tasks to cluster i and for cluster head i sending the fused data are wji/bi and φiwji/b′i, respectively. in practical wireless sensor network environment, communication and computing latency caused by pre-initialization are inevitable. suppose affine cost parameters are as follows: • αi: computing latency of cluster i, that is, the time taken for initialization. • βi(resp. βi′): communication latency incurred by sink to initiate a data transferring to cluster head i. (resp. start-up time for communication from the cluster head i to sink). fig.2 describes the procedure of sink dispatching the tasks to each cluster, each cluster measuring and reporting data, as well as cluster heads reporting the fused data to sink. in this paper, we assume that there are total n clusters in a stable stage, where ci, i = 1, · · ·, n represents each cluster. as the computational cost of each cluster remains the same, so there are: αi + wji/si = tj, j = 1, · · ·, m −1 (10) where tj is only related to the number of rounds j, and m is the optimal scheduling round. the sum of tasks allocated to every cluster in round j is equal to the tasks in round j: wj = n∑ i=1 wji (11) from the eq.(10) and eq.(11) we can compute: wji = aiwj + bi (12) where ai = si/ n∑ k=1 sk, bi = (si/ n∑ k=1 sk) n∑ k=1 (skαk)−siαi. 106 l. dai, y. chang, z. shen as shown in fig.2, in order to fully utilize the bandwidth and avoid cluster waiting among different rounds, sink must send the tasks allocated in round j + 1 to all the cluster heads and receive the fused data from all the cluster heads in the round j, before cluster n finished the tasks in round j. when the time for intra-cluster nodes processing the tasks in round j is exactly equal to the sum of the time for sink sending sub-tasks to all cluster heads in round j +1 and receiving the fused data from all the cluster heads in round j, then, the best bandwidth utilization is achieved, that is: n∑ i=1 [(wj+1,i/bi) + βi + (φiwj,i/b ′ i) + βi ′] = tj (13) utilizing eq.(10), eq.(12) and eq.(13), we have: wj+1 = wj ∗ 1−φi n∑ i=1 (si/b ′ i) n∑ i=1 (si/bi) + n∑ i=1 (siαi) n∑ i=1 si n∑ i=1 (ai/bi) − n∑ i=1 (βi + bi/bi + βi ′ + φibi/b ′ i) n∑ i=1 (ai/bi) (14) simplify the eq.(14) as follows: wj = θ j(w0 −η) + η (15) where θ = [1−φi n∑ i=1 (si/b ′ i)]/ n∑ i=1 (si/bi), η = n∑ i=1 (siαi) [ n∑ i=1 (si/bi)+φi n∑ i=1 (si/b′i)]−1 − n∑ i=1 si n∑ i=1 (βi+bi/bi+βi ′+φibi/b ′ i) [ n∑ i=1 (si/bi)+φi n∑ i=1 (si/b′i)]−1 also the total load is equal to the sum of the tasks allocated in all rounds: m−1∑ j=0 wj = wtotal (16) the following constraint relations can be obtained: g(m, w0) = (w0 −η)(1−θm)/(1−θ) + mη −wtotal = 0 (17) the problem of minimizing the total task finish time in scheduling algorithm[8] is described below: ex(m, w0) = m−1∑ j=0 tj + 1 2 n∑ i=1 [(w0,i/bi) + βi + (φiwm−1,i/b ′ i) + βi ′] (18) the above minimization problem can be solved through the lagrange multiplication. w0 = (1−θ)/(1−θm)(wtotal −mη) + η (19) after solving w0 and m, the sizes of all the chunks wj,i can be obtained using eq.(12) and eq.(15). an optimal task scheduling algorithm in wireless sensor networks 107 3 wireless energy use in this section, the energy model of the otsa-wsn algorithm is presented in detail and the equations of energy consumption of individual sensor nodes are derived. the model is based on first-order radio model [10]. there are three kinds of energy consumption in the wireless sensor network: measurement, data fusion, and communication. because nodes in the sensor network cooperate with each other via data transmission. energy consumption of communications exist in sensor nodes, cluster heads and sink. it is not necessary for cluster heads and sink to perform any sensing task. thus, there is no energy cost for cluster heads due to the measurement of these nodes, while the additional energy cost of cluster heads attributes to data fusion. the energy to sense, fuse, and transmit a unit sensory data are denoted by es, ep, and etx, respectively. sensor nodes also consume the energy of erx to receive one unit of data. the distance between the sender and the receiver is d. the energy use for each kind of nodes is outlined as follows: energy use for individual sensor nodes j in cluster i: ei,j = αi,j(es + etxd 2), i = 1, · · ·, k , j = 1, · · ·, ni (20) energy use for individual cluster head: ei = αi(erx + ep + φietxd 2), i = 1, · · ·, k (21) energy use for sink: esink = k∑ i=1 αiφietx (22) 4 performance evaluation in the above sections, we have obtained the optimal number of rounds m for a given sensing task, and energy use for individual sensor nodes. in this section, we investigate the effects of three network parameters, such as the number of clusters, computation/communication latency, and measurement/communication speed, on the number of rounds, makespan and energy consumption in the homogeneous network environment. in the simulation, the following energy parameters are adopted: transmitting a unit of sensor reading over a unit distance takes etx=200nj, receiving one unit of sensor reading consumes erx=150nj, measuring one unit of sensor reading needs es=100nj, fusing one unit of observation consumes ep=20nj and the distance between the sender and the receiver is d=100m. there are 20 sensor nodes in each cluster. the simulation results are shown in fig.3 to fig.6. firstly, fig.3 plots the m values computed by otsa-wsn versus computation/communication latency when they vary among 0 and 1.0. assume the communication speeds of the uplink and downlink between sink and cluster head are identical, namely, β=β′. as can be seen from fig.3, m decreases with either the communication or computation latency increasing, owing to the reason that fewer rounds may result in less overhead. because when the communication latency and computation latency increase, the tasks allocated for each round will increase to meet the requirement, which makes full use of bandwidth. therefore, the number of rounds will be reduced. next, the makesapn against the number of clusters are plotted in fig.4. assume that transmission rate of all links between nodes is the same, that is, b=b′ = b. in fig.4(a), the value of s 108 l. dai, y. chang, z. shen figure 3: impact of communications and computing latency on the number of rounds is chosen from 4 to 10, while b is fixed to 1.0. this figure shows that measurement speed almost does not affect the makespan because sensing takes a small fraction of the entire execution time. fig.4(b) shows that when the communication speed of sensor nodes increases, the makespan of a given task is reduced. it can be found that the four lines in fig.4(b) converge when the number of clusters becomes large. then, the third simulation is about the energy consumption of intra-cluster sensor nodes. sink and cluster heads are not taken into account because generally, sink has no energy constraint and the chosen cluster heads have the possibly enough energy. the network is configured with 20 clusters. without loss of generality, the intra-cluster sensor nodes in the first cluster are chosen to study the energy consumption, as shown in fig.5. fig.5 presents the energy consumption of all the nodes in the first cluster as given by eq.(20), where the intra-cluster nodes are indexed from 1 to 20. in each case, the energy consumption of sensor nodes monotonically decreases due to the reduced workload. fig.5(a) shows the higher the in-cluster node’s measuring speed, the more evenly the tasks allocated to each nodes, hence the smaller the energy consumption of the nodes. fig.5(b) presents the larger communication speed between nodes, the smaller the energy consumption of the in-cluster nodes. finally, to simulate the otsa-wsn algorithm in the heterogeneous network environments, the measuring speed of intra-cluster nodes is set as the random numbers vary from 4 to 10, and the communication speed of links from 0.4 to 1.0. through 8 experiments, the performance of scheduling algorithms in the heterogeneous network environment is analyzed. fig.6(a) shows the impact of random measuring speed and communication speed on the makespan with the increasing number of clusters in the heterogeneous network environment. fig.6(b) presents the impact of random measuring speed and communication speed on energy-consuming in the order of tasks allocation. compared with fig.4 and fig.5, it can be seen that, the impact of measuring speed on the makespan and energy consumption is less than communication speed. the makespan decreases with the increasing number of clusters, and the energy consumption are reduced in the order of tasks allocation. an optimal task scheduling algorithm in wireless sensor networks 109 (a) (b) figure 4: impact of measuring speed and bandwidth on the makespan 110 l. dai, y. chang, z. shen (a) (b) figure 5: the impact of measuring speed and bandwidth on the energy consumption in in-cluster nodes an optimal task scheduling algorithm in wireless sensor networks 111 (a) (b) figure 6: impact of random measuring speed and bandwidth on the makespan and energy consumption 112 l. dai, y. chang, z. shen 5 conclusions as the wireless sensor network node with limited energy, so the tasks should be completed as quickly as possible, and the network resources should be fully utilized. in this paper, we present a multi-round task scheduling algorithm (otsa-wsn) in clustered wireless sensor networks. the goal of this algorithm is to minimize the makespan and fully utilize network resources, by finding an optimal strategy of splitting the original load received by sink into a number of chunks as well as distributing these chunks to the clusters in the right order. bibliography [1] v. bharadwaj, d. ghose, t. g.robertazzi, divisible load theory: a new paradigm for load scheduling in distributed systems. cluster computing, vol.6, no.1, pp.7-18, 2003. [2] m. moges, t.g. robertazzi, wireless sensor networks: scheduling for measurement and data reporting. ieee transactions on aerospace and electronic systems, vol.42, no.1, pp.327340, 2006. [3] h. liu, x. yuan, m. moges, an efficient task scheduling method for improved network delay in distributed sensor networks. in proceedings of tridentcom 2007, orlando, fl, usa, 1-8, 2007. [4] h. liu, j. shen, x. yuan, m. moges, performance analysis of data aggregation in wireless sensor mesh networks, in proceedings of earth & space 2008, akron, oh, usa, 1-8, 2008. [5] c. kijeung , t. g. robertazzi, divisible load scheduling in wireless sensor networks with information utility performance. in proceedings of ipccc 2008, austin, texas, usa, 9-17, 2008. [6] z. zeng, a. liu, d. li, a highly efficient dag task scheduling algorithm for wireless sensor networks, in proceedings of icycs 2008,zhang jia jie , hunan , china, 570-575, 2008. [7] j. lin, w. xiao, f. l. lewis, energy-efficient distributed adaptive multisensor scheduling for target tracking in wireless sensor networks. ieee transactions on instrumentation and measurement, vol.58, no.6, pp.1886 1896, 2009. [8] y. yang, r. van, der, k, h. casanova, multiround algorithms for scheduling divisible loads. ieee trans on parallel and distributed systems, vol.16, no.11, pp.1092-1102, 2005. [9] c. yeim-kuan, w. jia-hwa, c. chi-yeh, c. chih-ping, improved methods for divisible load distribution on k-dimensional meshes using multi-installment. ieee transactions on parallel and distributed systems, vol.18, no.11, pp. 1618-1629,2007. [10] w. heinzelman, a. chandrakasan, an application-specifid protocol architecture for wireless microsensor networks. ieee transaction on wireless communications, vol.1, no.4, pp. 660-670, 2002. [11] x. tang, j. xu, optimizing lifetime for continuous data aggregation with precision guarantees in wireless sensor networks. ieee/acm transactions on networking, vol.16, no.4, pp. 904 917,2008. ijcccv4n1draft.pdf int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 1, pp. 65-72 bayesian network classifier for medical data analysis beáta reiz, lehel csató beáta reiz biological research center, central labs, bioinformatics group 62 temesvári krt., hu-6701, szeged, hungary e-mail: beareiz@brc.hu lehel csató babeş bolyai university, faculty of mathematics and computer science 1 kogălniceanu str. ro-400084 cluj-napoca, romania e-mail: csatol@cs.ubbcluj.ro abstract: bayesian networks encode causal relations between variables using probability and graph theory. they can be used both for prediction of an outcome and interpretation of predictions based on the encoded causal relations. in this paper we analyse a tree-like bayesian network learning algorithm optimised for classification of data and we give solutions to the interpretation and analysis of predictions. the classification of logical – i.e. binary – data arises specifically in the field of medical diagnosis, where we have to predict the survival chance based on different types of medical observations or we must select the most relevant cause corresponding again to a given patient record. surgery survival prediction was examined with the algorithm. bypass surgery survival chance must be computed for a given patient, having a data-set of  medical examinations for  patients. keywords: bayesian networks, classification, medical data analysis, causal discovery. 1 introduction in this paper we analyse tree-like bayesian network (bn) implementation for medical data classification. we consider a general case for data attributes, where the observations can be both continuous and discrete, and general to almost all medical data missing observations also can occur. we aim to establish causal relationships between variables representing medical examinations. whilst interested in a good classification performance, we also want to interpret and analyse the predictions in terms of the encoded causal relations. the database we used consists of  medical examinations of  people containing both discrete and continuous observations. the task is thus to predict the surgery survival chance based on the available data – the medical examinations [1, 2], and analysis of impact of a specific examination on patient survival. partial observability characterises the database, the number of missing values is . our aim is to predict target variables value – survival – for a particular patient and to obtain the “most relevant” variables affecting the output of the classifier. of equal interest is to analyse the decisions in terms of encoded relationships between data attributes. this analysis is usually done to provide support for physicians. we encode the dependencies between the class variable and the observations using a tree with root node the class variable. the other attributes are inside the tree with coresponding conditional probability tables “learned” from the data-set. finding the most appropriate structure is an extremely difficult task. we reduce the complexity of constructing the tree of immediate causal relationships between copyright © 2006-2009 by ccc publications 66 beáta reiz, lehel csató class variable and observations [3]. a tree-like bayesian network structure was inferred from the data [4], where the root of the tree is the class variable and remaining nodes are attributes. direct causal relations between attributes and class variable were revealed in the first phase of the algorithm, constructing a naive bayesian network. attribute-attribute correlations were searched based on chow-liu’s algorithm in the second phase of the algorithm. in practical situations we also have to face the problem – general to almost all medical data – of missing observations for some patients, meaning incomplete data items; this issue can also be considered in a principled way with a bayesian network. the paper is organised as follows: next we present the bayesian networks, then a stochastic algorithm to extract a plausible network sructures from the data, and we also analise experimental results of applying the algorithm to real data-sets. 2 bayesian networks bayesian networks (bns) [5] are triplets (v,e, p), where (v,e) is a directed acyclic graph (dag) with nodes v , edges e, and a set of probability distributions p, called parameters, whose elements are assigned to the nodes of the graph. the nodes represent domain variables and edges mark direct causal relations between these variables. the network encodes a joint probability distribution function representative to the domain: p(x) = n∏ i= p(xi|par(xi)) where n is the number of domain variables, xi is a node from the bn and par(xi) is the set of xi’s parents. the aciclicity of the graph ensures the product to be finite. we employed a tree-like representation for the topology of bn in order to increase efficiency in class variable estimation and interpretation. in section 3. we describe this algorithm, where we construct a tree in such a way that the root of the tree will be the class variable and the remaining nodes are attributes. direct causal relations encoded by the bn are interpreted as the maximum of mutual respective conditional mutual information [6, 7, 8] between nodes. now we present the necessary information theoretical concepts [9] for our algorithm. we will use the following notations: x and y are random variables defined on probability spaces ωx respective ωy with corresponding distribution functions p(x) respective p(y). we use their joint and conditional probability functions, denoted with p(x,y) and p(x|y) respectively. information theory offers us numerical characterisation of uncertainty in domain variables. uncertainty is measured using the information entropy of the respective variable. information entropy can be understood as the average minimal message length that should be sent on a channel to encode the message and is defined as follows: h(x) = − ∑ x∈ωx p(x)log p(x) mutual information is the quantity of information two random variables contain about each other, defined as: i(x,y ) = ∑ x∈ωx ∑ y∈ωy p(x,y) log p(x,y) p(x)p(y) (1) in next section we present a two-phase tree-like bayesian network structure learning algorithm. the algorithm consists of an extension of naive bayesian structure learning algorithm with inner structure learning for finding causal relations between attributes. bayesian network classifier for medical data analysis 67 3 network topology learning bayesian network classification consists of emphasising the node corresponding to the class variable during inference. as an optimisation of the learning process and inference we construct the network topology such a way to optimise the efficiency of prediction and data attribute impact estimation on target variable. we set out the class variable and in the first phase we’re searching for direct dependencies between attributes and class variable, this way constructing a naive bayesian network. naive bayes classifiers [10, 11] are widely used in classification problems. they are called naive because of the independence assumption of the attributes. although this is strong assumption when facing real data-sets, the naive bayes classification is a powerful tool for its simplicity and often gives convenient results. during the naive bayesian network learning process direct dependencies between class variable and attributes has to be find. dependency relations are interpreted as class variable specifiers, so an edge from x to class variable y means that variable x has information about class variable y . mutual information between class variable and attributes, conditioning on attributes already placed between direct dependencies of class variable, gives the amount of new information the respective attribute has regarding the class variable [12]. considering the problem this solutions means that we choose some medical examinations which we place in the network and exclude the rest of the attributes. this is a strong restriction considering that some examinations are replaced by others in different hospitals. the second phase of the algorithm consists of applying chow-liu algorithm [13] to learn the inner structure of network and reveal attributeattribute correlations. the naive bayesian network is formed of class variable y respective variables x directly linked to the class variable. our next task is to place the excluded attributes in the bayesian network. we use mutual information maximisation to discover the causal relations between attributes from the network and excluded attributes. class variable could be ignored. mutual information maximisation is enough in this case, because dependency now has the meaning of replaceability. we are searching for the excluded attributes that carry almost the same information about class variable as the attributes already placed in the network. before presenting the algorithm, we introduce the notations used in the following: xxx set of attributes not yet placed in the net zzz set of attributes in the net x one attribute from xxx zi an element from zzz y the class variable i(x,y |zzz) conditional mutual information of i(x,y ) mutual information of x and y x and y given zzz our algorithm introduces a threshold parameter – denoted with α – which is the minimum “information” required when putting a new attribute in the network during the naive bayesian structure learning. this parameter controls the number of direct connections between class variable and attributes. the algorithm is presented in algorithm. 1. the threshold parameter α assures the selection of relevant attributes respect to the class variable, controlling the number of direct causal relations of class variable and attributes. the result is a tree-like bayesian network as in figure 2, where the root of the tree is the class variable, and the other nodes are attribute variables. the orientation of edges is from parent to the child, this way minimising the modification of network parameters during a learning step. the algorithm above is deterministic in sense that it generates the same network for the same data all the time. we introduce importance sampling [14] in order to avoid the determinism of the algorithm in case of selection from equal information quantities. the distribution used for sampling is based on mutual information. it has the maximum where the mutual information is maximal. we used two functions during the tests. the first function – denoted f – is the conversion of the 68 beáta reiz, lehel csató algorithm 1 tree-like bayesian network structure learning. 1: place the class variable y in the network 2: zzz = ∅ 3: {naive bayesian structure learning} 4: while i(x, y|zzz) ≥ α1 do 5: x̂ = argmax xxx i(x, y|zzz) 6: place x̂ in the network 7: xxx = xxx − {x̂} 8: zzz = zzz ∪ {x̂} 9: end while 10: {inner structure learning} 11: while xxx 6= ∅ do 12: [ x̂, ẑ ] = argmax xi,zj i(xi, zj) 13: place edge between x̂ and ẑ 14: xxx = xxx − {x̂} 15: zzz = zzz ∪ {x̂} 16: end while figure 1: tree-like bayesian network structure learning. x1 x2 x3 x4 x5 x6 x7 x8 x9 y figure 2: structure of a bn mutual information to a distribution function: f1(x) = i(x,y ) ∑ x ′∈xxx i(x ′,y ) (2) figure 3(b). illustrates what edges are inferred when using importance sampling with function f from artificial data. the generator network for the data is presented on figure 3(a). on the horizontal plane is the adjacency matrix of the graph and the non-zero columns represent the edges. figure 3(a). points the edges of the generator network, and figure 3(b). shows the frequency of learned edges during  tests. the second function – denoted f – uses the exponentiation of the mutual information. it has a β parameter which can be understood as a temperature parameter and it controls the constructed distribution function. the higher this parameter is the higher is the probability of selecting the maximum mutual information. on lower values of β the probabilities of selecting an attribute becomes closer to the uniform distribution. f(x) = exp(β ·f1(x))∑ x ′∈xxx exp(β ·f1(x)) (3) figure 3. shows the histogram of learned edges using the presented approaches and also the generator network topology of data on figure 3(a). in each graph on the horizontal plane is the adjacency matrix of the network topology, and the vertical columns represent the histogram edges. we consider the first bayesian network classifier for medical data analysis 69 1 2 3 4 5 6 7 8 9 10 11 1 2 3 4 5 6 7 8 9 10 11 0 1 child parent ed ge (a) generator network topology 1 2 3 4 5 6 7 8 9 10 11 1 2 3 4 5 6 7 8 9 10 11 0 50 100 150 200 250 child parent ed ge f re qu en cy (b) learned topologies with function f 1 2 3 4 5 6 7 8 9 10 11 1 2 3 4 5 6 7 8 9 10 11 0 50 100 150 200 childparent ed ge fr eq ue nc y (c) learned topologies with function f, β =  1 2 3 4 5 6 7 8 9 10 11 1 2 3 4 5 6 7 8 9 10 11 0 100 200 300 childparent ed ge fr eq ue nc y (d) learned topologies with function f, β =  figure 3: generator network and histogram of bn edges attribute from the network as the class variable, so it is the root of the constructed tree. this means, there’s no arc from any attribute to the class variable, hence it’s column is  at each point. figure 3(a). represents the generator network topology for the data, hence each edge appears once. the other graphs – on figures 3(b), 3(c), and 3(d) – represent the frequency of edges in the inferred network topologies. although there is a randomness introduced with importance sampling in our first approach, the generated structure is relatively stable through the iterations when learning with function f. the direct causal relations between the class variable and attributes are almost the same during the  tests simulations, differences can be observed only in the causal relations between the attributes, more precisely the differences are on the third level of the tree. there in approximately  cases – out of  – a single edge is placed differently compared to the generator network. a bit more unstable structure (figure 3(c)) can be observed when learning with function f with parameter β = . this is due to the fact, that the separation between lower and higher values of mutual information is more sensitive for lower values of β . figure 3(d). represents the learned structures for β = . one can see that in this case the structure is fully stable, which assures the former statement. in next sections we will analyse the convergence of the learning process and the usage of the constructed network. 4 results in this section we will present the learned topologies in case of real data. to fully settle the bypass problem, we have to perform the binarisation of data. this is due to the very low number of data samples in bypass database considering conditional probability distribution function estimation. we made  test of the algorithm on the fully specified attributes from the database. figure 4. presents the results of these testings. one can observe a high order of uncertainty when learning with function f with β = . this is reduced by the increase of β to . function f highlights almost the same dependency relations as function f with β = , but there is an annoying level of uncertainty in these relations. further analysis of the algorithm meant to check stability of it on real data-set. for this reason leave-one-out method on attributes was used: we eliminated one attribute from the data, and tested the algorithm on the remaining ones. the results are showed on figure 5., where figure 5(a)., figure 5(c). 70 beáta reiz, lehel csató 1 2 3 4 5 6 7 8 9 10 1112 1 2 3 4 5 6 7 8 9 10 11 12 0 50 100 child parent ed ge fre qu en cy (a) histogram of edges with function f 1 2 3 4 5 6 7 8 9 10 11 12 1 2 3 4 5 6 7 8 9 10 11 12 0 20 40 60 child parent ed ge fre qu en cy (b) histogram of edges with function f, β =  1 2 3 4 5 6 7 8 9 10 11 12 1 2 3 4 5 6 7 8 9 10 11 12 0 50 100 150 childparent ed ge fre qu en cy (c) histogram of edges with function f, β =  figure 4: learned topologies of fully specified attributes from the database and figure 5(e). depicts the histogram of learned edges, and on figure 5(b)., figure 5(d). and figure 5(f). the most frequent learned edges are drawn. figure 5(a). and figure 5(b). depicts the histogram of learned edges respective the most frequent edges learned with function f, β =  for all fully specified attributes from the bypass database. one can see, that there are two crucial attributes in the database, namely the thirrd and seventh, which are central players in attribute dependence relations. next sub-figures – figure 5(c). and figure 5(d). elimination of 3rd. attribute, respective figure 5(e). and figure 5(e). elimination of 3rd. attribute – depicts the learned bns when eliminating these attributes. one can see that most of the dependence relations are stable, although there is a reorganisation between dependencies, when eliminating a crucial attribute. the most observable instability in dependence relations is that the edge  →  becomes an edge  →  when eliminating either the third attribute or the seventh attribute. but when analysing the dependency of  → , respective  → , one can see that their is not so much significant difference between the frequency of the two edges when learning on all attributes. 5 conclusions in this paper we presented a tree-like bayesian network classifier algorithm developed for medical decision making problems and a stochastic algorithm to find the most appropriate structure of the network. we tried two functions for eliminating determinism from the algorithm, with the two functions f and f, defined with eq. 2, respective eq. 3. learned topologies with the presented algorithm and functions were presented both for artificial and real data. in this section we will present results considering the inference, and compare them with logistic regression and svm. table 1. shows the results of efficiency of the presented algorithms compared to logistic regression. comparing the first and second approach we described above the results are surprising. although the high order of uncertainty in some cases, the results of efficiency are similar for all cases. one can observe that the tree-like bayesian networks constructed with the presented algorithm perform better than logistic regression, but support vector machines with linear kernel obtain higher prediction accuracy than bn-s. it has to be mentioned, that although better accuracy on svm, they don’t allow interpretation of predictions, while bns do, and interpretation in case of tree-like bayesian networks, bayesian network classifier for medical data analysis 71 1 2 3 4 5 6 7 8 9 10 11 12 1 2 3 4 5 6 7 8 9 10 11 12 0 50 100 150 childparent ed ge fr eq ue nc y (a) histogram of edges all attributes 1 2 3 4 5 6 7 8 9 10 11 12 (b) most frequent edges all attributes 1 2 3 4 5 6 7 8 9 10 11 1 2 3 4 5 6 7 8 9 10 11 0 50 100 150 200 child parent ed ge fre qu en cy (c) histogram of edges loo 3. attribute 1 2 4 5 6 7 8 9 10 11 12 (d) most frequent edges loo 3. attribute 1 2 3 4 5 6 7 8 9 10 11 1 2 3 4 5 6 7 8 9 10 11 0 50 100 150 child parent ed ge fr eq ue nc y (e) histogram of edges loo 7. attribute 1 2 3 4 5 6 8 9 10 11 12 (f) most frequent edges loo 7. attribute figure 5: leave-one-out results on attributes method accuracy bayesian network f .% bayesian network f, β =  .% bayesian network f, β =  .% bayesian network f, β =  .% logistic regression .% svm with linear kernel .% table 1: efficiency of presented algorithms constructed as above, can be done efficiently. as for a summaryy of results it has to be mentioned that the learned structure by the algorithm is generally stable; the interpretation of the results is possible and partial observability is not a problem in case of prediction and interpretation. acknowledgements we acknowledge the program committee of the international conference on computers, communication and control 2008 for the recommendation and also thank for the problem description and the medical database to béla vizvári from department of operations research, eötvös loránd university, budapest. this work was partially supported by the romanian ministry of education and research through grant 11-039/2007. 72 beáta reiz, lehel csató bibliography [1] zs. csizmadia, p.l.hammer, b. vizvári. generation of artificial attributes for data analysis. rutcor research report rrr 42-2004, rutgers center for operations research, rutgers university, 2004. [2] zs. csizmadia, b. vizvári. methods for the analysis of large real-valued medical databases by logical analysis of data. rutcor research report rrr 42-2004, rutgers center for operations research, rutgers university, 2004. [3] judea pearl. causality: modeling, reasoning, and inference. cambridge university press, cambridge, 2000. [4] beáta reiz, lehel csató. tree-like bayesian network classifiers for surgery survival chance prediction. in proceedings of international conference on computers, communications and control, vol. iii, pp. 470-474, 2008. [5] kevin p. murphy. learning bayes net structure from sparse data sets. technical report, comp. sci. div., uc berkeley, 2001. [6] jie cheng, david a. bell, and weiru liu. an algorithm for bayesian belief network construction from data, 1997. [7] jie cheng, david a. bell, and weiru liu. learning belief networks from data: an information theory based approach. in cikm, pages 325–331, 1997. [8] mieczyslaw a. klopotek. mining bayesian network structure for large sets of variables. in ismis, pages 114–122, 2002. [9] thomas m. cover and joy a. thomas. elements of information theory. wiley-interscience, new york, ny, usa, 1991. [10] nir friedman, dan geiger, and moises goldszmidt. bayesian network classifiers. machine learning, 29(2-3):131–163, 1997. [11] david heckerman and christopher meek. models and selection criteria for regression and classification. technical report msr-tr-97-08, microsoft research, 1997. [12] f. fleuret. fast binary feature selection with conditional mutual information. journal of machine learning research, 5:1531–1555, november 2004. [13] c. chow and c. liu. approximating discrete probability distributions with dependence trees. information theory, ieee transactions on, 14(3):462–467, 1968. [14] christian p. robert and george casella. monte carlo statistical methods (springer texts in statistics). springer-verlag new york, inc., secaucus, nj, usa, 2005. lehel csató obtained his bsc and msc degrees at the babeş–bolyai university, cluj–napoca, and his phd degree from the neural computing research group at the university of aston, the united kingdom. he was interested in the applications of machine learning techniques, specifically to apply non-parametric methods in bayesian inference. his thesis investigated methods to approximate solutions of bayesian regression using stochastic gaussian processes, centring on sparse solutions that approximate the gaussian processes. he is teaching at the babes-bolyai university and he is interested in applications of bayesian techniques in modern data processing and probabilistic methods in robotics. he is heading the “data mining research group” at the same university. web–page: http://www.cs.ubbcluj.ro/∼csatol beáta reiz obtained her bsc and msc degrees at the babeş–bolyai university, cluj–napoca. currently she is a phd student in hungary, university of szeged and she is working in the bioinformatics group of the biological research center, from szeged under the supervision of sándor pongor and jános csirik. international journal of computers, communications & control vol. i (2006), no. 4, pp. 9-19 new discrete tanaka sugeno kang fuzzy systems characterization and stability domain mohamed benrejeb, dhaou soudani, anis sakly, pierre borne abstract: in this paper, an analytical approach to characterize discrete tanaka sugeno kang (tsk) fuzzy systems is presented. this characterization concerns the choice of the adequate conjunctive operator between input variables of discrete tsk fuzzy models, t-norm, and its impact on stability domain estimation. this new approach is based on stability conditions issued from vector norms corresponding to a vector-lyapunov function. in particular, second order discrete tsk models are considered and this work concludes that zadeh’s t-norm, logic product min, gives the largest estimation of stability domain. keywords: discrete nonlinear systems, discrete tsk fuzzy model, t-norm, stability domain, vector norm, arrow form matrix. 1 introduction fuzzy control of systems presents a major interest in several applications including industrial ones. however, closed loop system properties are not easily understood and the design of the fuzzy system is generally based on intuitive approaches. different fuzzy control strategies exist. in particular, tsk fuzzy approach permits the description and the control of a system by defining different models related via a rule base. the stability of tsk systems has been one of the central issues and is subject of many works either in the continuous case or in the discrete one. to do this, different approaches are considered mainly based on lyapunov functions [5, 10, 12]. in particular, the linear matrix inequality (lmi) formulation is used [11] and, according to the considered system, permits the stability problem resolution. in our previous work [1], the used approach is led through the convergence of a regular vector norm [3]. the vector norm approach, based on the comparison and overevaluating principle, has a major advantage that it deals with a very large class of systems, since no restrictive assumption is made on the matrices of state equations. so, in [1], the approach estimates the stability domain of continuous tsk fuzzy systems and its dependence on the choice of the conjunctive operator between inputs. in this way, many authors have presented and analyzed several of these operators defined by t-norms and so said t-operators [4, 6, 7]. in particular, in [6], six t-norms are used to represent this connective operator in the inverse pendulum control. in another work [9], are used the same t-norms to study their impact on fuzzy control performances of a second order process by defining a performance criterion based on error measurement of the closed loop system. however, then exist other operators to represent this connective such as mean operators developed in [14]. except [1], all these studies don’t treat the influence of the choice of the conjunctive operator between inputs on the stability conditions obtained and often only prod or min operators are used without theoretical argumentation and just for simplification reasons. therefore, in this way the study will be considered and constitutes a generalization of [1] in the discrete case seeing the soft implementation nature of this control strategy. by exploiting conditions or hypothesis obtained from stability analysis of discrete tsk fuzzy systems based on vector norms approach, the influence of the choice of t-norm is studied. similarly to [1], only second order tsk fuzzy models are considered for simplification reason. in the next section, is presented the structure of discrete tsk fuzzy models. in section 3, several definitions dealing with t-norm notion as well as its properties are given. in section 4, sufficient stability conditions of these tsk systems based on vector norms approach are presented. then, the study of the copyright c© 2006 by ccc publications 10 mohamed benrejeb, dhaou soudani, anis sakly, pierre borne impact of the choice of t-norm on the proposed stability conditions will be considered in section 5. in particular, second order tsk fuzzy models are considered. a dc motor is studied in section 6 as an application example validating this study. finally, some concluding remarks are pointed out. 2 tsk discrete models description an r-order tsk discrete fuzzy model of an n-order nonlinear system to be controlled is given by a rule base where the ith rule is in the form: ri : if x1 is g i 1 and x2 is g i 2 and···and xn is gin t hen { x(k + 1) = aix(k) + biu(k) y(k) = cix(k) i = 1, 2, ..., r (1) where gij, j=1,2,...,n, is the i th fuzzy set of the state vector x j. the state vector x ∈ ℜn, the control input u ∈ ℜ and the matrices ai, bi and ci are of appropriate dimensions. according to the parallel distributed compensation (pdc) concept [13], the rule base r′i of the fuzzy controller stabilizing the former system is in the form: r′i : if x1 is g i 1 and x2 is g i 2 and···and xn is gin t hen u(k) = −kix(k) i = 1, 2, ..., r (2) with ki = [k1i ,k 2 i ,...,k n i ]. by substituting u in the equation (1) above, it comes: x(k + 1) = r ∑ i=1 r ∑ j=1 hih j(ai −bik j)x(k) (3) with: hi = wi/ r ∑ i=1 wi (4) wi = t (g i 1(x1), g i 2(x2), ..., g i n(xn)) (5) and t a t-norm. when linear models of the system to be controlled are considered in the controllable form given by: ai =   0 1 ··· 0 ... . . . . . . ... 0 ··· 0 1 −a1i ··· −an−1i −ani   and bi = b =   0 ... 0 1   (6) relation (3) becomes: x(k + 1) = r ∑ i=1 r ∑ j=1 hih j(ai −bik j)x(k) = r ∑ i=1 r ∑ j=1 hih j(ai −bk j)x(k) = r ∑ i=1 r ∑ j=1 hih jaix(k)− r ∑ i=1 r ∑ j=1 hih jbk jx(k) = r ∑ i=1 hi r ∑ j=1 h jaix(k)− r ∑ j=1 h j r ∑ i=1 hibk jx(k) = r ∑ i=1 hiaix(k)− r ∑ j=1 h jbk jx(k) new discrete tanaka sugeno kang fuzzy systems characterization and stability domain 11 then finally: x(k + 1) = r ∑ i=1 hi(ai −bki)x(k) (7) 3 t-norms an important task to be performed in the design of tsk fuzzy systems is the choice of the conjunctive operator materializing the connective and between input variables in the rule base and so corresponding to the intersection operation between fuzzy subsets relatively to different inputs. often, this operator is defined by a t-norm t , whose definition and proprieties are presented as follows. definition 1. a triangular norm (t-norm) is a function t :[0,1]×[0,1] → [0,1] verifying for every u, v, w and t of [0,1] : i) t (u,v) = t (v,u) (commutativity) ii) t (u,t (v,w)) = t (t (u,v),w) (associativity) iii) t (u,v) ≤ t (w,t) si u ≤ w et v ≤ t (monotonicity) iv) t (u, 1) = u (one identity) (8) min operator is the largest of all possible t-norms, t (u,v) ≤ min (u,v). the intersection operation between fuzzy subsets is defined by a t-norm t such as: c = a∩b, then ∀ x ∈ x µc(x) = t (µa(x),µb(x)) (9) the commonly encountered t-norms are given in table 1. table 1: main t-norms t-norm name min(u, v) zadeh (logical operators) uv bandler (probabilistic operators) max(u + v−1, 0) lukasiewicz (bounded operators)    u if v = 1 v if u = 1 0 otherwise weber (drastic operators) definition 2. a t-norm t is said archimedean if and only if t (u, v) is continuous and t (u, u) < u for each u of ]0,1[. moreover, t is said strict archimedean if and only if t (u, v) < t (w,t) as soon as u < w and v < t. any function t :[0,1] × [0,1] → [0,1] is an archimedean t-norm if and only if it exists a continuous decreasing function f : [0,1]→[0,+∞[ such that f (1)=0 satisfying : t (u,v) = f −1∗( f (u) + f (v)) (10) where f −1∗ is the pseudo-inverse of f such as f −1∗ = ( f −1(w) if w ∈ [0, f (0)[ 0 if w ∈ [ f (0), + ∞[ . 12 mohamed benrejeb, dhaou soudani, anis sakly, pierre borne besides, t is strict if and only if f (0)= +∞. f is said the additive generator of the t-norm t . the additive generators associated to main parameterized archimedean t-norms are presented in table 2 [14]. table 2: main parametrized t-norms and their additive generators t-norm generator f (u) parameter name uv γ + (1−γ)(u + v−uv) 1 γ log [ γ + (1−γ )u u ] strict archimedean γ > 0 hamacher 1 [( 1 u )p + ( 1 v )p −1 ] 1 p 1 (1 + u) 1 p strict archimedean p > 0 schweizer & sklar 1 1 + [ ( 1u −1)λ + ( 1v −1)λ ] 1 λ 1 1 + u 1 λ strict archimedean λ > 0 dombi 1−min [1, ((1−u)ω + (1−v)ω ) 1ω ] (1−u)ω archimedean no strict ω > 0 yager uv max (u,v,α ) no archimedean α ∈ [0,1] dubois & prade 4 new stability conditions in [2], a change of base of (6) under the arrow form give: x(k + 1) = r ∑ i=1 himix(k) (11) where mi is a matrix in the arrow form and p is the corresponding passage matrix: mi = p −1(ai −bki)p (12) new discrete tanaka sugeno kang fuzzy systems characterization and stability domain 13 mi =   α1 0 ··· 0 β1 0 . . . . . . ... ... ... . . . . . . 0 ... 0 ··· 0 αn−1 βn−1 γ 1i ··· ··· γ n−1i γ ni   and p =   1 1 ··· 1 0 α1 α2 ··· αn−1 0 α 21 α 2 2 ··· α 2n−1 ... ... ... ··· ... 0 α n−11 α n−1 2 ··· α n−1n−1 1   (13) with: β j = n−1 ∏ q = 1 q 6= j (α j −αq)−1 ∀ j = 1,2,...,n−1    γ ji = −pi(α j) ∀ j = 1,2,...,n−1 pi(λ ) = λ n + n−1 ∑ q=0 (aq+1i + k q+1 i )λ q γ ni = −(ani + kni )− n−1 ∑ j=1 α j (14) the application of the classical borne-gentina criterion [3] leads to the following theorem. theorem 3. the discrete system described by (7) is asymptotically stable if there exist 0 < α j < 1, α j 6= αk, ∀ j 6= k, such as ∀x ∈ s: i) 1− ∣∣α j ∣∣ > 0 ∀ j = 1, 2, ..., n−1 ii) 1− ∣∣∣∣ r ∑ i=1 hiγ ni ∣∣∣∣− n−1 ∑ j=1 ∣∣∣∣ r ∑ i=1 hiγ j i β j ∣∣∣∣(1− ∣∣α j ∣∣ )−1 > 0 (15) if s = ℜn, the stability is global. furthermore, if there exist α j, j = 1, 2, ..., n−1, such as : i) 0 < α j < 1 j = 1, 2, ..., n−1 ii) r ∑ i=1 hiγ ni > 0 iii) r ∑ i=1 hiγ j i β j > 0 j = 1, 2, ..., n−1 (16) then the previous theorem can be simplified to the following corollary. corollary 4. the discrete system described by (7) is asymptotically stable if there exist 0 < α j < 1, α j 6= αk, ∀ j 6= k, such as ∀x ∈ s: bph < 0 (17) where matrices b ∈ ℜn×n, p∈ ℜn×r and h∈ ℜr×1 are such as : b =   β1 0 ··· 0 0 . . . . . . ... ... ··· βn−1 0 0 ··· 0 −1   , p =   p1 (α1) ··· pr(α1) ... ··· ... p1 (αn−1) ··· pr(αn−1) p1 (1) ··· pr(1)   and h =   h1 ... hr   (18) if s = ℜn, the stability is global. according to the choice of the coefficients α j, different conditions on the parameters hi can be obtained. 14 mohamed benrejeb, dhaou soudani, anis sakly, pierre borne 5 t-norm influence on the proposed stability conditions let us consider the following classical fuzzy partition of a second order tsk model given by figure 1, with: µi = g1i (xi) i = 1, 2, . . ., n (19) and: g1i (xi) = min (1, max (0, li −xi 2li )) and g2i (xi) = 1−g1i (xi) (20) µ figure 1: fuzzy partition of state variables for r=2 then, the whole fuzzy state space considered is the open subset ω =]−l1, l1[×]−l2, l2[×. . .×]−ln, ln[. so, the notion of global stability, whenever used in the literature on fuzzy control, refers to local stability within a such domain [8]. for second order tsk model, i.e. r=2, then h2 = 1−h1 and: h1 = w1 w1 + w2 = t (µ1,µ2, ..., µn) t (µ1,µ2,...,µn) + t (1− µ1,1− µ2,...,1− µn) (21) where t is a t-norm. propositions [1] 1. let µ1, µ2, . . ., µn and µmax ∈ [0, 1] such as µ1 ≤ µmax, µ2 ≤ µmax, ..., and µn ≤ µmax. then, we obtain: h1(µ1,µ2,...,µn) ≤ h1(µmax,µmax,...,µmax) (22) this proposition allows the determination of the domain s by its characteristic point m in figure 2. x 1 s x 2 x 1max 0 1 1 0.5 0.5 max x 2max l 1 l 2 m ω µ 1 2 µµ figure 2: correspondence between the state space variables domain and the membership values domain new discrete tanaka sugeno kang fuzzy systems characterization and stability domain 15 2. let us note ϕn(µ) = t (µ1, µ2, ..., µn) when µ 1=µ 2 =. . . =µ n=µ . ϕ n is an increasing function on [0, 1] with ϕn(0) = 0 and ϕn(1) = 1. if the following conditions are satisfied : i) ϕn is continuous ii) ∀ µ > 0 , ϕn(µ ) 6= 0 iii) ∀ µ1, µ2 ∈ ]0,1] with µ2 > µ1 , ϕn(µ2)µ2 ≥ ϕn(µ1) µ1 (23) then we have: { h1(µ ,µ ,...,µ ) ≥ µ if µ ≥ 0.5 h1(µ ,µ ,...,µ ) ≤ µ if µ ≤ 0.5 (24) this proposition means that for a given µ , the smallest coefficient h1 is obtained with the t-norm of zadeh, the logic product min; and so for the condition (22) we obtain the largest rectangular form domain s with this t-norm by taking µ max = c. 3. conditions (23) of the proposition 2 are satisfied for each strict archimedean t-norm such as ϕn(µ ) is derivable on ]0,1]. the proposed approach is illustrated by considering the following parameterized t-norms: • the hamacher one: t (µ1,µ2) = µ1 µ2γ+(1−γ)(µ1+µ2−µ1 µ2) for γ>0, • the yager one: t (µ1,µ2) = 1−min (1,((1− µ1)ω + (1− µ2)ω ) 1 ω ) for ω>0, • and the dubois one: t (µ1,µ2) = µ1 µ2max (µ1,µ2,α) for α ∈ [0,1]. 6 application example as an example, let consider in figure 3 the case of a dc motor with two linear models g1(p) and g2(p) with δ 1=0.5 and δ 2=1. )( 1 )( i i pp pg δ+ = u x i=1,2)(0 pb figure 3: dc motor discrete models b0(p) = 1−e −te p p is a zero order holder and te=0.2s is the sampling time. the discrete models of the dc motor are given by the following z-transmittances: gi(z) = ni(z) (z−1)(z−ψi) i = 1, 2 with ψi = e−δite and ni(z) = δite+e −δi te−1 δ 2i z + 1−e −δi te (1+δite) δ 2i . the two discrete dc linear models are considered in the following controllable form: a1 = [ 0 1 −0.905 1.905 ] , a2 = [ 0 1 −0.819 1.819 ] and b = [ 0 1 ] we suppose that for particular constraints the choice of ki is imposed such that the pole placement is different for the two models by taking: 16 mohamed benrejeb, dhaou soudani, anis sakly, pierre borne k1 = [ −0.7 1.2 ] and k2 = [ −1.1 1.2 ] according to [3], the minimal overvaluing matrix relatively to the regular vector norm p: p(x) = [|x1|,|x2|]t is such as: m(.) = [ |α| 1 |γ1| |γ2| ] with γ1 = (0.486−0.086α )h1 + α 2 −0.619α −0.281 γ2 = 0.086h1 + 0.619−α then, stability conditions deduced from the corollary are: i) 0 < α < 1 ii) 0.086h1 + 0.619−α > 0 iii) (0.486−0.086α)h1 + α 2 −0.619α −0.281 < 0 iv) 0.4h1 + 0.1 > 0 when i) is satisfied, relations ii) and iv) are too. then condition iii) leads to the following inequality h1 < − α 2 −0.619α −0.281 0.486−0.086α = c when 0.5 < c < 1 , let s a neighborhood of the equilibrium point 0, included in ω, ω =]−1, 1[×]−1, 1[, that verifies a such condition. s is an overvaluing domain of the fuzzy system and an estimation of a symmetrical domain s with respect to 0 imposes: 1−c < h1 < c. now, consider the study of the impact of the t-norm t on the width of the neighborhood s of the equilibrium point 0, and then the determination of the largest stability domain d included in s verifying the previous condition. for α = 0.345, we obtain the maximal value of c, c=0.82. thus, the overvaluing matrix m(·) is constant: m(·) = m = [ α 1 0 α ] whitch is triangular and not irreductible. however, for c = 0.80, this irreductibility is skirted and for α = 0.345, its comes the following overvaluing matrix m = [ 0.345 1 0.01 0.343 ] whitch is irreductible, whose principal eigenvalue is λm = 0.446, the corresponding vector is: um = [9.88 1] t and the largest estimated stability domain d is such as d = { x ∈ s/pt (x)um = 9.88|x1|+|x2| ≤ xmax } where s is the square form domain and xmax is its width. s depends on the condition h1 < c with c = 0.8 and so on the t-norm t . new discrete tanaka sugeno kang fuzzy systems characterization and stability domain 17 table 3: µmax and xmax corresponding to different t-norms t-norm µmax xmax zadeh 0.80 0.60 bandler 0.67 0.34 hamacher (γ = 0) 0.75 0.50 hamacher (γ = ∞) 0.58 0.16 yager (ω = 2) 0.62 0.24 dubois (α = 0.5) 0.70 0.40 for n = 2, table 3 gives µmax, corresponding to the membership values domain, and xmax, corresponding to the square form domain s, with respect to different t-norms. the results obtained in table 3 shows that for n = 2, the greatest value of µmax and so the largest domain s is obtained specifically for the logic product min. figure 4 presents an estimation of the largest stability domain d respectively for zadeh and bandler t-norms. x 1 x 2 1−1 1 -1 s 0.6−0.6 0.6 −0.6 0.06−0.06 d ω x 1 x 2 1−1 1 -1 0.34 s 0.34−0.34 −0.34 0.03−0.03 d ω a) zadeh’s t-norm (min) b) bandler’s t-norm (prod) figure 4: stability domains obtained for two different t-norms 7 conclusion in this paper, we are interested in the stability study of discrete tsk fuzzy systems and the correspondent domain estimation relatively to the used conjunctive operator, materialized by a t-norm. the stability approach is based on vector norms. when second order tsk models are considered, a mathematical study permits to conclude that zadeh’s t-norm, min operator, yields to the largest stability domain. a dc motor with two discrete models is considered as an illustrative example and the estimated stability domains for different t-norms confirm the former result. however, it suits to remark that this study can be generalized for r-order tsk models, in the two 18 mohamed benrejeb, dhaou soudani, anis sakly, pierre borne cases continuous and discrete, with r greater than 2 and then we must distinguish between the two cases, r odd or even. for two cases, the stability study can be leaded in different symmetrical rectangular zones surrounding the equilibrium point, from the smallest to the largest. references [1] benrejeb m., sakly a., ben othman k. and borne p.: choice of conjunctive operator of tsk fuzzy systems and stability domain study. to appear in matcom (mathematics and computers and simulation journal). [2] benrejeb m., borne p. and laurent f. : sur une application de la représentation en flèche à l’analyse des processus. rairo automatique, 16 (2). (1982) 133-146. [3] borne p., richard j. p. and radhy n. e. : stabilité, stabilisation, régulation : approche par les normes vectorielles, systèmes non linéaires, tome 2 : stabilité-stabilisation. coordonné par fossard a. j. et normand-cyrot d., responsable du tome : borne p., 45-90. editions masson, paris (1993). [4] butkiewicz b.s. : control error of fuzzy system with different reasoning and defuzzification methods. 7th international fuzzy systems association world congress (ifsa) , iii. praga, czech (1997). [5] cao s. g., rees n. w. and feng g. : quadratic stability analysis and design of continuous-time fuzzy control systems. int. j. syst. sci., 27(2). (1996) 193-203. [6] cardenas e., castillo j.c., cordon o., herrera f. and peregrin a. : applicability of t-norms in fuzzy control. busefal, 61. (1995) 28-37. [7] gupta m.m. and qi j. : design of fuzzy logic controllers based on generalized t-operators. fuzzy sets and systems, 40. (1991) 473-489. [8] palm r., driankov d. and hellendoorn h. : model based fuzzy control. springer-verlag, berlin heidelberg (1997). [9] sakly a., ben othman k. and benrejeb m. : sur le choix de l’opérateur de conjonction en commande floue basé sur la robustesse de critères de performances. 2ème conférence internationale jtea, tome 1, sousse nord (2002). [10] sugeno m. : on stability of fuzzy systems expressed by fuzzy rules with singleton consequences. ieee transactions on fuzzy systems, 7(2). (1999) 201-223. [11] tanaka k., ikeda t. and wang h.o. : fuzzy regulators and fuzzy observers : relaxed stability conditions and lmi-based designs. ieee transactions on fuzzy systems, 6(2). (1998) 1-16. [12] tanaka k. and sugeno m. : stability analysis and design of fuzzy control systems. fuzzy sets and systems, 45. (1992) 135-156. [13] tanaka k. and sano m. : a robust stabilization problem of fuzzy control systems and its application to backing up control of a truck-trailer. ieee transactions on fuzzy systems, ii. (1994) 119-134. [14] yager r. r. and filev d. p. : essentials of fuzzy modeling and control. john wiley & sons, inc., usa (1994). new discrete tanaka sugeno kang fuzzy systems characterization and stability domain 19 mohamed benrejeb1, dhaou soudani1, anis sakly1, pierre borne2 1ecole nationale d’ingénieurs de tunis unité de recherche lara-automatique bp 37, le belvédère, 1002 tunis, tunisie 2ecole centrale de lille, cité scientifique laboratoire d’automatique, génie informatique et signal bp 48, 59651 villeneuve d’ascq cedex, france e-mail: mohamed.benrejeb@enit.rnu.tn, dhaou.soudani@enit.rnu.tn, sakly_anis@yahoo.fr, pierre.borne@ec-lille.fr received: november 10, 2006 int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 701-709 stable factorization of strictly hurwitz polynomials ö. eğecioğlu, b. s. yarman ömer eğecioğlu department of computer science university of california, santa barbara ca 93106, usa e-mail: omer@cs.ucsb.edu b. siddik yarman department of electric and electronics engineering college of engineering, istanbul university 34320 avcilar, istanbul, turkey e-mail: sbyarman@gmail.com abstract: we propose a stable factorization procedure to generate a strictly hurwitz polynomial from a given strictly positive even polynomial. this problem typically arises in applications involving real frequency techniques. the proposed method does not require any root finding algorithm. rather, the factorization process is directly carried out to find the solution of a set of quadratic equations in multiple variables employing newton’s method. the selection of the starting point for the iterations is not arbitrary, and involves interrelations among the coefficients of the set of solution polynomials differing only in the signs of their roots. it is hoped that this factorization technique will provide a motivation to perform the factorization of two-variable positive function to generate scattering hurwitz polynomials in two variables for which root finding methods are not applicable. keywords: routh-hurwitz stability, hurwitz polynomial, stable factorization, newton’s method. 1 introduction in many microwave communication system design, modeling and simulation problems, description of lossless two ports in one or two kinds of elements is essential [5]. in the design of microwave matching networks, amplifiers or in modeling passive one port devices such as antennas, lossless two ports are either described in terms of driving point immitance or reflectance functions [6,7]. the methods known as real frequency techniques (rft) are excellent tools for design and modeling [5, 8]. once the independent descriptive parameters are selected, numerical implementations of real frequency techniques demands the construction of strictly hurwitz polynomials. for example, in the simplified real frequency technique (srft), the numerator polynomial h(p) = h0 + h1p + · · · + hnpn of the driving point input reflectance s11(p) = h(p) g(p) completely specifies the scattering parameters of the lumped element reciprocal lossless two port as follows: s12 = s21 = f(p) g(p) and s22 = f(p) f(−p) h(−p) g(p) (1) provided that the monic-polynomial f(p) which is constructed on the transmission zeros of the system under consideration, is pre-selected. in this representation, the denominator polynomial g(p) = g0 + g1p + · · · + gnpn is generated as a strictly hurwitz polynomial from the equation g(p2) = g(p)g(−p) = h(p)h(−p) + f(p)f(−p) = g0 + g1p 2 + · · · + gnp2n (2) copyright c⃝ 2006-2010 by ccc publications 702 ö. eğecioğlu, b. s. yarman which is obtained by means of the lossless condition. once f(p) is selected, (2) is specified in terms of the real coefficients {h0, h1, . . . , hn} of h(p). for many practical problems, it may be sufficient to choose f(p) as f(p) = pk, k ≤ n. in this case, (2) results in a set of quadratic equations such that g0 = g 2 0 = h 2 0 g1 = −g 2 1 + 2g0g2 = −h 2 1 + 2h0h2 ... gi = (−1) ig2i + 2(g2ig0 + i∑ j=2 (−1)jgj−1g2j−i+1) = (−1) ih2i + 2(h2ih0 + i∑ j=2 (−1)jhj−1h2j−i+1) ... (3) gk = g(i=k) + (−1) k ... gn = (−1) ng2n = (−1) nh2n it should be mentioned that the general form of f(p)f(−p) may be described as f(p2) = f(p)f(−p) = f0 + f1p 2 + · · · + fnp2n. (4) then it is straightforward to revise (4.3) with the help of (4.4). at this point it is the crucial issue to generate g(p) as a strictly hurwitz polynomial either employing (2) or (4.3). if one employs (2), it is sufficient to find the roots of g(p2) and then, construct g(p) on the left halfplane roots of g(p2), yielding g(p) = g0 +g1p+· · ·+gnpn. this has been the common practice of the srft. however, if the problem under consideration demands the construction of lossless two-ports with two kinds of elements, then there is no way to carry out the computation by means of root finding techniques. in this case, one has to rewrite (2) in two variables as g(p, λ) = g(p, λ)g(−p, −λ) and revise (4.3) accordingly. eventually one needs to solve (4.3) to generate g(p, λ) as a “two variable scattering hurwitz polynomial" [1,2]. in this representation the complex variable p = σ+ jω is associated with first kind of elements and the complex variable λ = σ+jω is associated with the second kind of elements of the lossless two-port. actually, this way of posing the problem may be understood as the factorization of the two variable polynomial g(p, λ) as g(p, λ)g(−p, −λ), which in turn yields the scattering hurwitz polynomial g(p, λ). based on the knowledge of the authors, there is no explicit solution for the factorization of two variable polynomials in the current literature. however, for the single variable case, root finding techniques provide excellent results as described within srft. therefore, in this paper, to provide an insight to the general factorization problem, an attempt will be made to come up with a numerical procedure to solve (4.3) which is specified in single variable, with the hope that the numerical procedure presented in this paper may be extended to cover the two variable factorization case. 2 mathematical problem statement let g(z2) = g0 + g1z2 + g2z4 + · · · + gnz2n be a real polynomial with g0 > 0. consider a factorization of g of the form g(z2) = g(z)g(−z) (5) stable factorization of strictly hurwitz polynomials 703 for a real polynomial g(z) = g0 + g1z + g2z2 + · · · + gnzn as required in (4.3). call (5) a stable factorization of g, if the polynomial g is stable: that is, the real parts of the zeros of g are strictly negative. we also refer to a stable polynomial as strictly hurwitz. from physical considerations that give rise to the problem, g0, g1, . . . , gn are such that g admits a stable factorization. our aim is to determine the coefficients of g(z) as a function of g0, g1, . . . , gn. 2.1 on root finding this one dimensional problem is theoretically solvable quite easily by root finding: since g is a real polynomial, it can be factored as g(z2) = c(z2 − α1)(z 2 − α2) · · · (z2 − αn) with c > 0 and the αi complex. for i = 1, 2, . . . , n, let βi = ± √ αi, where the sign is picked so that βi has a negative real part. then g(z) = √ c(z − β1)(z − β2) · · · (z − βn), and the gi can be computed from this product. however, we wish to avoid this approach as the real motivation behind the treatment of the one variable case is the factorization problem in two variables to generate scattering hurwitz polynomials, for which root finding techniques do not apply. 2.2 basic elements of routh-hurwitz stability the conditions for a real polynomial g(z) = g0 + g1z + g2z 2 + · · · + gnzn (6) with g0 > 0 to be strictly hurwitz are given in terms of the positivity of the hurwitz determinants ∆i = det   g1 g3 g5 . . . g2i−1 g0 g2 g4 . . . g2i−2 0 g1 g3 . . . g2i−3 0 g0 g2 . . . g2i−4 . . . . . . . . . . . . . gi   . the indices in each row increase by two and the indices in each column decrease by one. the term gj is taken to be zero if j < 0 or j > n. note that ∆1 = g1. theorem 1. (routh-hurwitz stability) a necessary and sufficient condition that the polynomial (6) is strictly hurwitz is that ∆1, ∆2, . . . , ∆n be all positive [3]. since ∆n = gn∆n−1, the condition that ∆n−1 and ∆n be positive is equivalent to the requirement that ∆n−1 and gn be positive. furthermore, a necessary condition for (6) to be strictly hurwitz is that all coefficients g0 through gn be positive. 3 the main quadratic system comparing coefficients in (5), we derive a quadratic system of n+1 equations in the variables g0, g1, . . . , gn: gk = ∑ i+j=2k (−1)igigj, (k = 0, 1, . . . n) (7) 704 ö. eğecioğlu, b. s. yarman this is the system we are aiming to solve in the factorization problem. the additional constraint is that the polynomial (6) is stable. when n = 5, g0 = g 2 0 g1 = −g 2 1 + 2g0g2 g2 = g 2 2 + 2g0g4 − 2g1g3 g3 = −g 2 3 − 2g1g5 + 2g2g4 (8) g4 = g 2 4 − 2g3g5 g5 = −g 2 5 so in this case the stable factorization problem is to find a solution (g0, g1, g2, g3, g4, g5) of the quadratic system (8) in which each gi > 0, and in addition the constraints ∆2 = det [ g1 g3 g0 g2 ] > 0, ∆3 = det   g1 g3 0g0 g2 g4 0 g1 g3   > 0, ∆4 = det   g1 g3 g5 0 g0 g2 g4 0 0 g1 g3 g5 0 g0 g2 g4   > 0 are satisfied. in the general case g0, g1, . . . , gn with g0 > 0 are given as input. the output the solution required is real g0, g1, . . . , gn, with g0 > 0 such that g0, g1, . . . , gn is a solution of the associated quadratic system (7) of n + 1 equations and g(z) = g0 + g1z + · · · + gnzn is strictly hurwitz. we assume that the gk are given so that the system has a solution of the required type. 3.1 newton’s method we consider the vector valued function f : irn+1 → irn+1 which has as its set of real zeros the solutions to the quadratic system (7). for n = 5, f : ir6 → ir6 is f = (f0, f1, . . . , f5)t with f0 = x 2 0 − g0 f1 = −x 2 1 + 2x0x2 − g1 f2 = x 2 2 + 2x0x4 − 2x1x3 − g2 f3 = −x 2 3 − 2x1x5 + 2x2x4 − g3 f4 = x 2 4 − 2x3x5 − g4 f5 = −x 2 5 − g5 we compute the jacobian matrix as jf = 2   x0 0 0 0 0 0 x2 −x1 x0 0 0 0 x4 −x3 x2 −x1 x0 0 0 −x5 x4 −x3 x2 −x1 0 0 0 −x5 x4 −x3 0 0 0 0 0 −x5   . we calculate by elementary operations det(jf) = 2 6(−x0x5) det   −x1 x0 0 0 −x3 x2 −x1 x0 −x5 x4 −x3 x2 0 0 −x5 x4   = 26(−x0x5) det   x1 x3 x5 0 x0 x2 x4 0 0 x1 x3 x5 0 x0 x2 x4   stable factorization of strictly hurwitz polynomials 705 and det(jf) = 26(−x0)∆5. for general n we have a similar identity relating the jacobian of f and ∆n as det(jf) = 2 n+1(−1)nx0∆n. (9) thus the jacobian jf does not vanish at (g0, g1, . . . , gn) at if (g0, g1, . . . , gn) corresponds to a stable g(x). in other words, starting from an initial point that is close enough to the stable solution, the jacobian of f does not vanish. starting with an initial vector x0 = (x0, x1, . . . , xn)t we compute the iterates by newton’s method as xn+1 = xn − j −1 f (xn)f(xn) until successive iterates are within a given tolerance. the invertibility of jf at the point xn is guaranteed for xn close to a stable solution (g0, g1, . . . , gn). however a real solution g(z) of the quadratic system found by newton’s method is not necessarily strictly hurwitz. the polynomial we want is obtained from g(z) by flipping the sign of some of its roots and making each one have negative real part, even though we do not have access to the roots themselves. example 1. suppose g(x2) = g0 + g1x2 + g2x4 + g3x6 + g4x8 with g0 = 9.244, g1 = 72.286, g2 = 217.183, g3 = 296.638, g4 = 155.673. g(x2) = g(x)g(−x) where g(x) = g0 + g1x + g2x 2 + g3x 3 + g4x 4 with g0 = 3.040, g1 = 2.289, g2 = 12.749, g3 = 4.637, g4 = 12.476 is strictly hurwitz. starting with the initial random vector of coefficients (1.933, 2.008, 0.181, 0.870, 2.582), and tolerance 0.01, newton’s method converges to the polynomial 3.040 + 0.004x + 11, 887x2 − 0.003x3 + 12.477x4 of the quadratic system in 14 iterations. this polynomial is not stable. its roots are −0.0928 ± 0.6956j and 0.0929 ± 0.6972j. 3.2 an auxiliary problem the necessity of being able to “flip" the sign of certain roots of a given real polynomial as indicated above results in the following auxiliary problem: given a real polynomial g(x) = g0 + g1x + · · · + gnxn of degree n with g0 > 0, construct the real polynomial h(x) = h0 + h1x + · · · + hnxn with h0 = g0, such that the roots of h are ± roots of g and h is strictly hurwitz. if we could generate the polynomials h whose roots differ from the roots of g only in their sign, then we could test each polynomial generated by the routh-hurwitz criteria to see if it is stable. but there cannot be an analytic way involving radicals to do this: consider a generic fifth degree polynomial g(x) = g0 + g1x + · · · + g5x5 with g0 > 0. let r be a real root of g, and let h(x) = h0 + h1x + · · · + h5x5 be the polynomial with h0 = g0, which has identical roots as g(x), except for its fifth root it has −r instead of r. since g4 and h4 are the negative of the sums of the roots of g(x) and h(x) respectively, we have r = 1 2 (h4 − g4). we can also calculate g(x)/(x − r) by synthetic division and compute the roots of this quartic by radicals. thus if there were a way of computing the coefficients of h(x) from those of g(x) by means of radicals, then this would allow us to express the roots of a general fifth degree polynomial by radicals. 706 ö. eğecioğlu, b. s. yarman 4 algorithmic approaches to finding g(x) there are two essentially distinct approaches to find the strictly hurwitz polynomial g(x) given the input data g0, g1, . . . , gn with g0 > 0. both have a random component. a1 : generate an initial vector x0 = (x0, x1, . . . , xn)t and run newton’s method starting with x0. let the converged polynomial be h(x). if h(x) passes the routh-hurwitz criteria, then it is the strictly hurwitz polynomial desired and we are done. if not, generate another x0 and continue. a2 : generate an initial vector x0 = (x0, x1, . . . , xn)t and run newton’s method starting with x0. let the converged polynomial be h(x). if h(x) passes the routh-hurwitz criteria, then it is the strictly hurwitz polynomial desired and we are done. if not, use the coefficients of h(x) to generate another x0 and continue. a1 is simple to implement. on the other hand the number of executions of the newton method is fewer for a2, which essentially goes from a computed h(x) to another polynomial whose roots are negatives of some of the roots of h(x). we shall indicate a number of methods for a2. example 2. in [8], the data given for a monopole antenna is modeled using the linear interpolation technique proposed for positive real functions. we used this problem for the experimental evaluation of a1 and a2. for this model g(x2) = g0+g1x2+g2x4+g3x6+g4x8 with g0 = 9.244, g1 = 72.286, g2 = 217.183, g3 = 296.638, g4 = 155.673. employing a1, the strictly hurwitz polynomial g(x) = g0 + g1x + g2x 2 + g3x 3 + g4x 4 with g0 = 3.040, g1 = 2.289, g2 = 12.749, g3 = 4.637, g4 = 12.476 was found. the initial vectors x0 = (x0, x1, x2, x3, x4) t were generated by picking xi independently and uniformly in the range 0 < xi < √ g0. the average number of different starting points required for the newton method for convergence to g(x) with a tolerance of 0.001 is about 8 with a standard deviation of 5. next we consider two properties of the family of polynomials which are solutions to (4.3) and differ only in the signs of their roots. 4.1 selection of starting points for a2, we use the following idea: suppose we have two real solutions g(x) and h(x) to the quadratic system of equations (7). then g(x2) = g(x)g(−x) = h(x)h(−x) where g(x) and h(x) have the same roots up to signs. define f(x) = h(x)/g(x). since h(x)/g(x) = g(−x)/h(−x), f(x) satisfies the functional equation f(x)f(−x) = 1 . (10) put f(x) = c0 + c1x + c2x2 + · · · with c0 = 1. >from (10), we have 1 = c20 0 = −c21 + 2c2 0 = c22 + 2c0c4 − 2c1c3 0 = −c23 + 2c0c6 − 2c1c5 + 2c2c4 0 = c24 + 2c0c8 − 2c1c7 + 2c2c6 − 2c3c5 ... = ... stable factorization of strictly hurwitz polynomials 707 the general form of the k-th equation for k ≥ 1 is 0 = ∑ i+j=2k (−1)icicj. in this infinite system, each of c2, c4, c6, . . . can be expressed in terms of the coefficients c1, c3, c5, . . .. in fact, we can represent c2k as a polynomial in c1, c3, . . . c2k−1. >from the second equation, c2 = 1 2 c21. using this with the third equation we get c4 = − 1 2 c22 + c1c3 = − 1 8 c41 + c1c3 and c6 = 1 2 c23 + c1c5 − 1 2 c31c3 + 1 16 c61 . in the general case we can write c2k = 1 2 (−1)k+1c2k + k−1∑ i=1 (−1)i+1cic2k−i . (11) in (11), we repeatedly substitute the expressions obtained for the earlier coefficients with even indices, we arrive at the expression of c2k in terms of c1, c3, c5, . . .. therefore f(x) = 1 + c1x + 1 2 c21x 2 + c3x 3 + (c1c3 − 1 8 c41)x 4 + c5x 5 + (1 2 c23 + c1c5 − 1 2 c31c3 + 1 16 c61)x 6 + c7x 7 + · · · thus h(x) = g(x) ( 1 + c1x + 1 2 c21x 2 + c3x 3 + (c1c3 − 1 8 c41)x 4 + c5x 5 + · · · ) for some real numbers c1, c3, c5, . . .. we can use the form of the coefficients of f(x) to pick a new starting point if the solution g(x) we obtain from the newton’s method fails to be stable. for algorithm a2, we generate a new initial point h(x) for newton’s method from the current computed solution g(x) = g0 + g1x + · · · + gnxn with g0 > 0 by setting hk = ∑k i=0 gick−i for k = 0, 1, . . . , n with c0 = 1 and 0 = ∑k i=0 gick−i for k > n. we then express the even indexed ci in terms of the odd index ones. after this stage, the hk’s involve only c1, c3, . . . , cn−1 (or up to cn if n is odd.) we pick random values for these ci’s satisfying these constraints. example 3. for the data in example 2, we considered the experimental evaluation of a2. the initial vector x0 = (x0, x1, x2, x3, x4)t was generated by picking xi independently and uniformly in the range 0 < xi < √ g0. following that the algorithm jumps to the next initial vector using the ideas presented above. the average number of iterations to converge to the strictly hurwitz polynomial within a tolerance of 0.001 was 2, with a standard deviation of 1. 4.2 a linear algebraic property given a real polynomial g(x) = g0 + g1x + · · · + gnxn of degree n, we briefly consider the problem of constructing a new polynomial h(x) = h0 + h1x + · · · + hnxn whose roots depend on the roots of g, without actually finding the roots themselves. without loss of generality, gn = 1. suppose the roots of g are β1, . . . , βn and the required roots of h are p(β1), . . . , p(βn) for some polynomial p. consider the companion matrix of g defined by c =   0 1 0 0 0 0 1 0 ... . . . ... 0 1 −g0 −g1 · · · −gn−1   708 ö. eğecioğlu, b. s. yarman the characteristic polynomial of c is det(xi − c) = g(x). then h can be expressed in terms of only the coefficients of g as det(xi − p(c)) = h(x) without calculating the zeros β1, . . . , βn. for example for g(x) = g0 + g1x + x2 with zeros β1, β2, c = [ 0 1 −g0 −g1 ] and the characteristic polynomial of c2 − 3c has zeros β21 − 3β1, β 2 2 − 3β2. we compute c2 − 3c = [ 0 −2 3g0 + g 2 0 3g1 + g 2 1 ] and therefore h(x) = det( [ x 2 −3g0 − g 2 0 x − 3g1 − g 2 1 ] ) = 2g0(3 + g0) − g1(3 + g1)x + x 2 . the reason for this is that c is similar to an upper triangular matrix with β1, . . . , βn on the diagonal [4], bcb−1 =   β1 0 β2 ∗ ... . . . 0 . . . 0 βn   , so that bp(c)b−1 =   p(β1) 0 p(β2) ∗ ... . . . 0 . . . 0 p(βn)   . functions other than polynomials can be used for p (e.g. mixtures of exponential and certain rational functions). however for this approach to work, each βi must be transformed by the same function p. we only want to change the sign of one of the βi at a time. let i(i) be the matrix that is obtained from the identity matrix by changing the ith 1 to a −1. we would like to construct the matrix si(c) such that bsi(c)b −1 = i(i)bcb−1. (12) then the characteristic polynomial of si(c) has zeros β1, . . . , βi−1, −βi, βi+1, . . . , βn. >from (12), si(c) = bi(i)b−1c. if b1, . . . , bn are the column vectors of b and c1, . . . , cn are the row vectors of b−1, then si(c) and c are related by si(c) = (i − 2bici)c . (13) we do not need the matrix b exactly (this may involve finding eigenvalues, which are not permitted in this approach). the characteristic polynomial of the perturbed matrix in (13) will be used as a starting point for the next iteration of newton’s method, so bi and ci and the outer product bici can be approximate. 5 conclusions and further work we have proposed a stable factorization procedure generate strictly hurwitz polynomial from a given strictly positive even polynomial. the factorization process is carried out directly to find the solution of a set of quadratic equations in many variables employing newton’s method. it is hoped that the method presented in this paper generalizes to two-variable polynomials. this would make possible the generation of scattering hurwitz polynomials, which are the twodimensional analogues of strict hurwitz polynomials. stable factorization of strictly hurwitz polynomials 709 bibliography [1] aksen a., “design of lossless two-ports with mixed, lumped and distributed elements for broadband matching," phd. dissert., lehrstuhl für nachrichtentechnik, ruhr universitaet bochum, 1994. [2] fettweis a., “on the scattering matrix and the transfer scattering matrix of multi dimensional lossless two-ports," int. j. of communication, vol. 36, pp. 374-381. [3] henrici p., applied and computational complex analysis, vol. ii, wiley, new york, 1977. [4] lang s., linear algebra, addison-wesley, reading, ma, 1966, p. 183. [5] yarman b. s., “broadband networks," wiley encyclopedia of electrical and electronics engineering, vol. ii, pp. 589-604, 1999. [6] yarman b. s. and aksen a., “a reflectance-based computer aided modeling tool for high speed/high frequency communication systems," proc. ieee-iscas 2001; 4, pp. 270-273. [7] yarman b. s., aksen a. and kilinc a., “immitance data modeling via linear interpolation techniques," proc. ieee-iscas 2002; 3, pp. 527-530. [8] yarman b. s., kilinc a. and aksen a., “immitance data modeling via linear interpolation techniques: a classical circuit theory approach," int. j. of circuit theory and applications, 2004; 32, pp. 537-563. [9] yarman b. s. and carlin h. j., “a simplified real frequency technique applied to broadband multi-stage amplifiers," ieee trans. mtt; 30, pp. 2216-2222, 1982. international journal of computers, communications & control vol. ii (2007), no. 4, pp. 355-366 robust predictive control using a gobf model for miso systems ali douik, jalel ghabi, hassani messaoud abstract: in this paper we develop a new method for robust predictive control for miso systems represented on the generalized orthonormal basis functions. unknown but bounded error approaches are used to update the uncertainty domain of the resultant model coefficients. this method uses a worst case strategy solved by a min-max optimization problem taking into account the constraints relative to parameter uncertainties and to measurement signals. keywords: predictive control, robust, generalized orthonormal basis functions, miso, ubbe. 1 introduction there has been interest in the use of orthogonal basis functions for the purposes of robust model predictive control (rmpc) [1, 2, 3, 4]. the most common model structure employing these bases is the well known fir one. however, the number of terms in the series expansion is high, and this may lead to poor accuracy in the estimated uncertainty domain parameter as well as the control strategy. another approach is to use laguerre or kautz models that are more suitable to represent systems having near or oscillating dynamics [5, 6]. moreover, using the popular armax model structure [7] involves a small number of parameters but the criterion to be minimized is not convex which may complicate the optimization problem. this paper is a contribution overlapping these methods by developing a new rmpc algorithm for a miso system represented on the generalized orthonormal basis functions (gobf) [8, 9]. however, the main features of using gobf model in rmpc methods is that the common fir, laguerre and kautz model structures are special cases of this complete construction [10, 11, 12], it is not sensitive to sampling interval choice, it doesn’t requires a prior knowledge of the system delay and it operates on a small number of parameters. furthermore, the criterion is convex on the uncertainty domain of the gobf model coefficients. the uncertainty domain is determined with unknown but bounded error approaches (ubbe) that updates polytopes, orthotopes, parallelotopes, ellipsoids or limited complexity polytopes [13, 14, 15]. the optimal poles of these basis functions are estimated using a new technique of poles estimation [16, 17]. the paper is organized as follows: in section 2 we present the state space model for the miso system represented on the gobf. the predictor output is expressed in section 3. in section 4, robust predictive control method is detailed and the main results are developed. simulation examples are in section 5 and finally, some conclusions are given in section 6. 2 state-space model this paper considers a miso system having m input sequences {u1(k), u2(k),··· , um(k)} and an output sequence {y(k)} that are related according to: y(k) = m ∑ j=1 g j(q −1)u j(k) + e(k) (1) where q−1 is the backward shift (q−1u j(k) = u j(k − 1) ). { g j(q−1) } describe the unknown system dynamics (assumed stable) and e(k) is the model uncertainty. copyright © 2006-2007 by ccc publications 356 ali douik, jalel ghabi, hassani messaoud the discrete time state-space model for a miso system represented on the gobf is defined by: { x(k + 1) = ax(k) + bu(k) ŷ(k) = θ t x(k) (2) with: u(k) ∈ ℜm and ŷ(k) are the input signal vector and the model output respectively. x(k) is an n dimensional state vector of elements { x jn(k) } j=1,2,···,m n=0,1,···,n j defined by: x jn(k) = z −1 { b jn(z, ξ j) } u j(k) (3) where z−1 is the inverse transform of z. n j and ξ j are the truncating order and the poles vector respectively for the j-network of the gobf. n = m ∑ j=1 (n j + 1) is the number of the gobf model parameter for the miso system and { b j n(z, ξ j) } j=1,2,···,m n=0,1,···,n j is the gobf expression given by: b jn(z) = √ 1− ∣∣∣ξ jn ∣∣∣ 2 z−ξ jn n−1 ∏ k=0 ( 1− ξ̄ jk z z−ξ jk ) (4) where ξ jk and its conjugate ξ̄ j k are the poles for the k-filter of the gobf. θ ∈ ℜn is the parameter vector. a and b are (n × n) and (n × m) dimensional matrices respectively defined by: a = diag (a j) j=1,2,···,m , b = diag (b j) j=1,2,···,m (5) where the (1 + n j)×(1 + n j) dimensional matrix a j and the (1 + n j) dimensional vector b j are given by: a j(a, b) =    ξ ja−1 if a = b, fj(a, b) if a â b, 0 if a ≺ b. (6) fj(a, b) = (−1)a+b+1α ja−1(1−ξ j b−1ξ̄ j b−1) a−1 ∏ `=b+1 α j`−1ξ̄ j `−1 (7) b j(b) = (−1)b+1α jb−1 b−1 ∏̀ =1 α j`−1ξ̄ j `−1 (b = 1,··· , n j + 1) (8) and we assume: α j` = √ 1− ∣∣∣ξ j` ∣∣∣ 2 √ 1− ∣∣∣ξ j`−1 ∣∣∣ 2 , α j0 = √ 1− ∣∣∣ξ j0 ∣∣∣ 2 (9) 3 step-ahead predictor equation system (2) can be written in incremental form as: δ x(k + 1) = aδ x(k) + bδ u(k) (10) ŷ(k) = ŷ(k −1) + θ t δ x(k) (11) robust predictive control using a gobf model for miso systems 357 where: δ u(k) = u(k)−u(k −1), δ x(k) = x(k)−x(k −1) (12) when the error on the gobf model is unknown but bounded, the fourier coefficients are defined by uncertainty intervals. equation (11) can be then rewritten as: ŷ(k) = ŷ(k −1) + θ t (ε)δ x(k) (13) where ε ∈ ω is the vector of parameter uncertainties and ω the parameter uncertainty domain. from (13), the p-step ahead predictor can be written as: ŷ(k + p/k) = ŷ(k + p−1/k) + θ t (ε)δ x(k + p); p ≥ 1 (14) using (10) and by successive substitutions we can write: δ x(k + p) = apδ x(k) + p ∑ q=1 ap−qbδ u(k + p−1) (15) thus, by successive substitution of (15) into (14) we finally have: ŷ(k + p/k) = ŷ(k) + θ t (ε) [kp −in ] δ x(k) + θ t (ε) p ∑ q=1 kp−qbδ u(k + q−1) (16) where in is the identity matrix and kp is an (n ×n) dimensional matrix defined by: kp =    p ∑ q=0 aq for p ≥ 0 0 for p ≺ 0 (17) the p-step ahead predictor can be written as a sum of two components: the free part and the forced part: ŷ(k + p/k) = ŷl (k + p/k) + ŷ f (k + p/k) (18) with: ŷl (k + p/k) = ŷ(k) + θ t (ε) [kp −in ] δ x(k) (19) ŷ f (k + p/k) = θ t (ε) p ∑ q=1 kp−qbδ u(k + q−1) (20) we note by h1, h2 and hu (hu ≺ h2) the output prediction horizons and the control horizon successively. we assume that h1 = 1. on the prediction horizon [k + 1, k + h2], (18) can be written in matrix form as: ŷ (k, ε) = ŷf (k, ε) + ŷl (k, ε) (21) where ŷ (k, ε) is the predictor vector of dimension h2 defined by: ŷ (k, ε) =   ŷ(k + 1/k, ε) ... ŷ(k + h2/k, ε)   (22) the vectors ŷl (k) and ŷf (k) can be computed using (19) and (20) respectively for (p = 1, 2,··· , h2). thus, we can write: ŷf (k, ε) = g(ε)δu (k) (23) 358 ali douik, jalel ghabi, hassani messaoud with: δu (k) is the control increment vector of dimension (mhu) defined by: δu (k) =   δ u(k) δ u(k + 1) ... δ u(k + hu −1)   (24) where δ u(k + p) represent the control increment vector defined by: δ u(k + p) = u(k + p)−u(k + p−1) ∀ p ∈ [0, hu −1] (25) u(k + p) = p ∑ q=0 δ u(k + p−q) + u(k −1) (26) g(ε) is an h2 ×(mhu) dimensional matrix that represents the impulse response coefficients and defined by: g(ε) =   g1(ε) 0 ··· 0 g2(ε) g1(ε) ··· ... ... ... . . . ... ghu (ε) ··· ··· g1(ε) ... ... . . . ... gh2 (ε) ··· ··· gh2−hu+1(ε)   (27) with gtp (ε) is a vector of dimension m given by: gp(ε) = θ t (ε)kp−1b = p ∑ q=1 θ t (ε)aq−1b (p = 1, 2,··· , h2) (28) 4 robust predictive control algorithm 4.1 constraints the constraints are resulting from uncertainties on the gobf model coefficients and bounds on control signals and control increments over the control horizon hu. umin ≤ u(k + p) ≤ umax ∀ p ∈ [0, hu −1] (29) δ umin ≤ δ u(k + p) ≤ δ umax ∀ p ∈ [0, hu −1] (30) where: umax =   u1 max ... um max   , umin =   u1 min ... um min   (31) δ umax =   δ u1 max ... δ um max   , δ umin =   δ u1 min ... δ um min   (32) using (26), (29) and (30) we define the set δ ψ of constraints on control signals as follows: δ ψ = {δu /γδu ≤ v} (33) robust predictive control using a gobf model for miso systems 359 with γ is an (4mhu)×(mhu) dimensional matrix and v a vector of dimension (4mhu). γ =   imhu −imhu ∆ −∆   , v =   δumax −δumin umax −ϕ −umin + ϕ   (34) where imhu is the (mhu) dimensional identity matrix. the matrix ∆ of dimension (mhu)×(mhu) and the vector ϕ of dimension (mhu) are given by: ∆ =   1 0 ··· 0 1 1 . . . ... ... . . . . . . 0 1 ··· 1 1   , ϕ(k −1) =   u(k −1) ... u(k −1)   (35) umax, umin, δumax and δumin are (mhu) dimensional vectors defined as: umax =   umax ... umax   , umin =   umin ... umin   (36) δumax =   δ umax ... δ umax   , δumin =   δ umin ... δ umin   (37) 4.2 optimization criterion the robust predictive control algorithm using an uncertainty model, is based on a worst case strategy that consists to resolve a min-max optimization problem given by: min δu∈δ ψ max ε∈ω j(δu, ε) (38) the quadratic criterion to be minimized is defined by: j(δu, ε) = h2 ∑ p=1 (ŷ(k + p)−r(k + p))2 + m ∑ j=1 { hu−1 ∑ p=0 λ pj δ u 2 j (k + p) } (39) with: δ u(k + p) = 0 for p ≥ hu (40) where λ pj â 0 ( j = 1, 2,··· , m) is a weighting factor generally considered constant and equals to λ j. r(k + p) represent the reference signal defined on the prediction horizon [k + 1, k + h2]. the quadratic criterion j(δu, ε) can be written in matrix form as: j(δu, ε) = ∥∥ŷ (k, ε)−r(k) ∥∥2 + ∥∥∥λ1/2δu (k) ∥∥∥ 2 (41) >from (41), we can write: j(δu, ε) = ( ŷ (k, ε)−r(k) )t ( ŷ (k, ε)−r(k) ) + δu t (k)λδu (k) (42) 360 ali douik, jalel ghabi, hassani messaoud where r(k) is an h2 dimensional reference vector defined by: r(k) =   r(k + 1) ... r(k + h2)   (43) λ is an (mhu ×mhu) dimensional weighting diagonal matrix defined by: λ = diag(λ0, λ1,··· , λhu−1) λp = diag(λ1, λ2,··· , λm); p = 0,··· , hu −1 (44) using (21), the matrix form (42) can be rewritten as: j(δu, ε) = δu t φ (ε)δu + 2ρ t (ε)δu + β (ε) (45) where φ is an (mhu ×mhu) dimensional positive definite matrix: φ (ε) = gt (ε)g(ε) + λ (46) ρ is a vector of dimension (mhu): ρ(ε) = gt (ε) [ ŷl (k, ε)−r(k) ] (47) β is a scalar defined as follows: β (ε) = [ ŷl (k, ε)−r(k) ]t [ ŷl (k, ε)−r(k) ] (48) since the criterion is convex over the parameter uncertainty set, the maximization problem over this set can be reduced to the maximization over its vertices. when the parameter set is an ellipsoid, it is approximated by the orthotope containing it. therefore the optimization problem (38) becomes: min δu∈δ ψ max ε∈s j(δu, ε) (49) where s is the set of vertices of the orthotope. the number of constraints is given by: l = 2n + 4mhu (50) where 2n is the number of the vertices of the domain s for the miso system. the rmpc algorithm using a gobf model for a miso system can be summarized as follow: – compute the matrices a and b from (5), – determine the set of vertices, – select the parameters h2 and hu, – select the weighting matrix coefficients, – compute the matrices kp (p = 1,··· , h2) from (17), – compute the coefficients gp (p = 1,··· , h2) from (28), – compute the references. computation at each sampling period: – compute the free component ŷl (k) using (19), – compute the quadratic criterion using (45), – determine the control increment vector using (49). robust predictive control using a gobf model for miso systems 361 5 simulation examples in this section we will illustrate the utility of the robust predictive control method by presenting some simulation examples. to begin with, suppose we have a miso system with m = 2 input sequences and a number of h = 300 point data record generated by the following model: y(k) = 0.102z−1 −0.751z−2 1−0.745z−1 u1(k) + −(0.152z−1 + 0.255z−2) (1 + 0.7047z−1)(1−0.3547z−1) u2(k) + e(k) (51) where u1(k), u2(k), y(k) and e(k) are the inputs, the output and the model error respectively. the model error is assumed to be bounded such |e(k)| ≤ 4.51 and the input signals are uniformly distributed sequences. in this simulation we approximate this model by the gobf model where the truncating order and the optimal poles are: nopt = 4; ξopt = (0.7450 0 0.3547 −0.7047). the process output and the gobf model output are illustrated in figure 1. 0 100 200 300 −20 −10 0 10 20 25 process output gobf model output figure 1: process output and gobf model output the center and uncertainty intervals (ui) of the ellipsoid are given in table 1. the tuning parameters used in this simulation are: h2 = 8, hu = 2, λ1 = 1 and λ2 = 1. table 1: ellipsoid performances ellipsoidal center -0.6326 -0.9135 -0.2266 -0.1260 uncertainty intervals 0.3797 0.9320 0.7085 1.9076 to validate the control method we plot in figure 2 the gobf model output and the reference signal. the control signals and the control increment signals are illustrated in figure 3 and 4 successively. the picks of the control signals as well as the control increment signals are due to the changed reference signal from -40 to +40 at the iterations 100 and 200. therefore, we notice the rapid convergence of the model output to the reference signal. this is predictable since we optimize a tracking criterion. other simulation examples with different gobf models and reference signals have been studied and yielded the same results. 362 ali douik, jalel ghabi, hassani messaoud 0 100 200 300 −50 −40 0 40 50 gobf model output reference signal figure 2: reference signal and gobf model output 0 100 200 300 −40 −30 −20 −10 0 10 20 30 40 control signal u1 0 100 200 300 −15 −10 −5 0 5 10 control signal u2 figure 3: control signals 0 100 200 300 −20 −15 −10 −5 0 5 10 15 20 25 control increment signal 1 0 100 200 300 −25 −20 −15 −10 −5 0 5 10 15 20 25 control increment signal 2 figure 4: control increment signals robust predictive control using a gobf model for miso systems 363 on the other hand, the influence of the error bounds on the gobf model output in the case of an ellipsoid domain is studied by considering 3 different snr (signal to noise ratio). the table 2 gives the centers and the uncertainly intervals where the figure 5 illustrates the model outputs and the reference signal fixed arbitrary. this figure shows the similar convergence of the model outputs to the reference signal. thus, we conclude that for different error bounds, we obtain the same gobf model output. the control method has been tested with different reference signals and error bounds that yielded the same results. finally we study the influence of different uncertainty domains such an ellipsoid, an orthotope and a polytope. the table 3 regroups the centers and the uncertainty intervals of these domains. the model outputs correspondent are shown in figure 6. by examining this figure we notice that the model outputs converge simultaneously to the reference signal. so, we conclude that the type of the parameter domain has no influence on this control method. other experiences with different reference signals and domain parameter have been realized and yielded the same results. table 2: ellipsoid performances for different error bounds snr=5 center -0.5698 -1.0975 -0.2557 -0.0844 ui 0.7915 1.9507 1.4634 3.9237 snr=10 center -0.6071 -0.9886 -0.2377 -0.1086 ui 0.5517 1.3550 1.0246 2.7549 snr=20 center -0.6326 -0.9135 -0.2266 -0.1260 ui 0.3797 0.9320 0.7085 1.9076 table 3: domain performances (snr=20) ellipsoid center -0.6326 -0.9135 -0.2266 -0.1260 ui 0.3797 0.9320 0.7085 1.9076 orthotope center -0.6950 -0.7551 0.1082 -0.1356 ui 0.6403 1.6896 1.7069 4.2133 polytope center -0.6924 -0.7556 -0.1968 -0.1754 ui 0.0236 0.0307 0.0472 0.1095 0 100 200 300 −60 −50 0 50 60 gobf model output (snr=20) gobf model output (snr=10) gobf model output (snr=5) reference signal figure 5: model outputs for 3 different snr of an ellipsoid domain 364 ali douik, jalel ghabi, hassani messaoud 0 150 300 −40 −30 0 30 40 gobf model output (ellipsoid) gobf model output (orthotope) gobf model output (polytope) reference signal figure 6: model outputs for different uncertainty domains 6 conclusion this paper has presented a new robust predictive control method based on the gobf model for a miso system. a min-max problem is solved taking into account the uncertainties on the model coefficients and the constraints on the control signals. the uncertainty parameter domain can be an ellipsoid, an orthotope or a polytope and the performance criterion is optimized with respect to constraints relative to parameter uncertainties and measurement constraints. the implication of these results in the context of system controls is that the gobf can be used to deliver state space models suitable to synthesize a robust predictive control without affecting the computational complexity and the performance of the method. finally, it should also be noted that this control method provides best results and may be synthesized for a mimo system represented on the gobf. bibliography [1] g. oliveira, g. favier, g. dumont, and w. amara, predictive controller based on laguerre filters modeling, in proc. 13th ifac world congress, san francisco, usa, vol. g, pp. 375-380, 1996. [2] a. mbarek, h. messaoud and g. favier, robust predictive control using kautz model, in proc. 10th ieee international conf. electronics, circuits and systems, pp. 184-187, december 2003. [3] l.p. wang, discrete time model predictive control design using laguerre functions, journal of process control, 14:131-142, 2004. [4] e.m. el adel, m. ouladsine, and l. radouane, predictive steering control using laguerre series representation, in proc. 2003 american control conf., vol.1, pp. 439-445, june 2003. [5] b. wahlberg, system identification using laguerre models, ieee trans. on automatic control, 36(5): 551-562, 1991. [6] b. wahlberg, system identification using kautz models, ieee trans. on automatic control, 39(6):1276-1282, 1994. [7] a. gutierrez and e. camacho, robust adaptive control for processes with bounded uncertainties, in proc. 3rd ecc, roma, vol.2, pp. 1295-1300, 1995. robust predictive control using a gobf model for miso systems 365 [8] j. ghabi, a. douik and h. messaoud, a new modelling approach of mimo linear systems using the generalized orthonormal basis functions, in proc. of the 6th ispra’07, greece, pp. 192-196, february 2007. [9] j. ghabi, a. douik and h. messaoud, new methods of modelling and parameter estimation for mimo linear systems using generalized orthonormal basis functions, journal on systems and control, vol. 2, pp. 133-140, 2007. [10] b. ninness and f. gustafsson, a unifying construction of orthonormal bases for system identification, ieee trans. on automatic control, 42 (4): 515-521, 1997. [11] p. s. c. heuberger, p. m. j. van den hof, and o. h. bosgra, a generalized orthonormal basis for linear dynamical systems, ieee trans. on automatic control, 40(3):451-465, 1995. [12] paul van den hof, brett ninness, system identification with generalized orthonormal basis functions, in modelling and identification with rational orthogonal basis functions, springer verlag, chap.4, pp. 61-102, 2005. [13] g. favier and l. arruda, review and comparison of ellipsoidal bounding algorithms, in m. et al., editor, bounding approaches to system identification, plenum press, new york, chap.4, pp. 43-68, 1996. [14] h. messaoud and g. favier, recursive determination of parameter uncertainty intervals for linear models with unknown but bounded errors, in proc. 10th ifac symposium on system identification, copenhagen, denmark, pp. 365-370, 1994. [15] s. maraoui and h. messaoud, design and comparative study of limited complexity bounding error identification algorithms, ifac, symposium on system structure and control, prague, cheque republique, pp. 29-31, 2001. [16] j. ghabi, a. douik and h. messaoud, a new estimation method of the poles for the generalized orthonormal bases filters, in proc. of the 6th ispra’07, greece, pp. 197-202, february 2007. [17] j. ghabi, a. douik and h. messaoud, a new technique of poles estimation for generalized orthonormal basis functions, journal on systems and control, vol.2, pp. 125-132, 2007. ali douik, jalel ghabi and hassani messaoud ecole nationale d’ingénieurs de monastir (enim) département de génie electrique laboratoire atsi rue ibn el jazzar, 5019 monastir tunisie e-mail: ali.douik@enim.rnu.tn, jalel.ghabi@yahoo.fr, hassani.messaoud@enim.rnu.tn received: june 30, 2007 366 ali douik, jalel ghabi, hassani messaoud ali douik was born in tunis, tunisia. he received the master degree in 1990 and the ph.d. degree in 1996 of science electrical genius from the “ecole normale supérieur de l’enseignement technique de tunis”. he is currently “maitre assistant”in the “ecole nationale d’ingénieurs de monastir”and director of the “département de génie electrique”. his research is related to signal processing and automatic control. jalel ghabi was born in kairouan, tunisia. he received the diploma of graduate engineer in electrical genius from the “ecole nationale d’ingénieurs de monastir”, in 1997. he obtained her master of automatic and industrial data processing from “ecole nationale d’ingénieurs de sfax”, in 2003. he is currently preparing the phd. degree in control automatic in the laboratory atsi “automatique, traitement de signal et imagerie”. his research is related to the robust predictive control of mimo systems using generalized orthonormal basis functions. hassani messaoud was born in mahdia, tunisia. he received the master degree from the “ecole normale supérieur de l’enseignement technique de tunis”, in 1985 and the ph.d. degree in automatic control from the “ecole centrale de lille, france”, in 1993. in 2001, he received the ability degree from the “ecole nationale d’ingénieurs de tunis”. he is presently professor in the “ecole nationale d’ingénieurs de monastir”and director of the laboratory atsi. he has been the supervisor of several phd thesis and is author or co-author of several journal articles. his research is related to automatic control and signal processing. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 2, pp. 148-157 applying rbf neural nets for position control of an inter/scara robot fernando passold pontifical catholic university of valparaíso college of electrical engineering avenida brasil 2147, valparaíso, chile e-mail: fernando.passold@ ucv.cl abstract: this paper describes experimental results applying artificial neural networks to perform the position control of a real scara manipulator robot. the general control strategy consists of a neural controller that operates in parallel with a conventional controller based on the feedback error learning architecture. the main advantage of this architecture is that it does not require any modification of the previous conventional controller algorithm. mlp and rbf neural networks trained on-line have been used, without requiring any previous knowledge about the system to be controlled. these approach has performed very successfully, with better results obtained with the rbf networks when compared to pid and sliding mode positional controllers. keywords: manipulator robots, position-force control, neural networks. 1 introduction this paper describes and discusses practical results obtained with the use of computational intelligence techniques, specifically artificial neural networks, applied to the position control of a real manipulator robot. the neural controller proposed in this work was applied in a real scara robot installed at the industrial automation laboratory of the federal university of santa catarina, brazil. the work is the result of a cooperation between the automation and mechanical engineering departments. the robot was manufactured by the institute of robotics (ifr) of the eth (swiss federal institute of technology, http://www.ifr.mavt.ethz.ch/ ). differently from most industrial manipulator robots, this one has an open architecture, which allows the implementation of any type of control law. the main purpose was to evaluate new algorithms for position/force control, since the robot is also equipped with a force sensor. fig. 1 shows the inter/scara robot. manipulator robots are a type of non-linear and time variant systems. conventional controllers, such as pd and pid, are used among other advanced and robust controllers that require some knowledge about the dynamic model of the system under control. in the case of manipulator robots, it is difficult to obtain some parameters, as the inertia matrix and mass centers of any joint, with sufficient accuracy. therefore, either adaptive controller are required to overcome these inaccurate parameters or control laws based on lyapunov functions are developed to guarantee some kind of stability [1]. both approaches require some knowledge about the system. this work explores a computational intelligence technique, namely artificial neural networks, to deal with this situation. in particular, control algorithms based on neural networks and fuzzy logic techniques are considered intelligent control approaches that do not require any previous knowledge about the system to be controlled. the main goal of this work is to explore an effective computational intelligence technique to control this complex system using a structure as simple as possible, with preference to one that requires less change in the previous conventional controller that has installed in the system, do not overload the main processor and is robustness against disturbance and load effect variations. copyright © 2006-2009 by ccc publications applying rbf neural nets for position control of an inter/scara robot 149 q3–, z z ↓↓ q3+, z z ↑↑ q0 z0 q1 qq22yy00 xx00 xx yy xx yy θθ–– zzmáxmáx= 44.55 [cm]= 44.55 [cm] zzminmin= 18.45 [cm]= 18.45 [cm] l1 l2 rrminmin=25=25 [cm][cm] rrmáxmáx=50=50 [cm][cm] figure 1: inter/scara robot and its coordinate system. 2 neural controller types artificial neural networks (nns) have been applied to several cases of control systems, showing special adequacy when we have to deal with complexity, non-linearity or uncertainty [13]. the neural approach is interesting, notably in the cases where: a) mathematical models of the process are poor or do not even exist and linear models are inadequate to represent the system with sufficient accuracy; b) the system works satisfactorily with an existing controller but its performance deteriorates substantially when high performances (e.g. speeds) are required, so non-linear controllers become necessary. nns have proved their ability to approximate arbitrary nonlinear maps and the availability of methods for adjusting their parameters on basis of input and output data makes them particularly attractive in identifiers and controllers [2]. narendra also comments that it seems to be valuable to keep linear and non-linear components working in parallel, where the neural networks represent the non-linear components [2]. he also mentions the brief learning time periods and the increase of accuracy that result from this kind of integration. in the control systems area, a few neural models have been proved to be more suitable than others, mainly: 1) multilayer perceptron networks (mlp), and; 2) radial base function networks (rbf). among several ways to apply nns in a control scheme, we can cite: (i) inverse identification (requires undesirable off-line training), (ii) reference model structure, (iii) internal model control, (iv) predictive control (uses two nns, one of them trained off-line to identify the system), (v) optimal control (also requires two nets: the first one is used to quantify the state space of the system and the next acts as a 150 fernando passold classifier). katič and vukobratovič [3] and morris and khemaissia [4] discuss two learning architectures that seem to be the most appropriate and promising: a) feedback-error learning architecture; and b) adaptive learning architecture. the feedback error learning approach is characterized by the nn inserted in the feedback path to capture the nonlinear characteristics of the system. the ann weights are tuned on-line with no off-line learning phase and, when compared with the adaptive technique, we do not require any knowledge of the robot dynamics, linearity in the unknown system parameters or the tedious computation of a regression matrix. thus, this approach is model-free and represents an improvement over adaptive techniques [5]. 2.1 multilayer perceptron net this network consists of a set of sensory input units (source nodes) that constitute the input layer, one or more hidden layers and an output layer. the input signal propagates through the network in a forward direction, on a layer-by-layer basis [6]. multilayer perceptrons could be trained in a supervised manner using the back-propagation algorithm. back-propagation is based on the error-correction learning rule and uses a least-mean-square error algorithm to adjust its connection weights. the error back-propagation learning consists of two stages: first, a forward phase, when the input vector is applied to the sensory nodes of the network, and its effect propagates through the network layer by layer. finally, an output set is produced as the current response of the network. during the forward phase the weights of the network are kept unchanged. in the second phase, the backward phase, an error signal is propagated backward through the network against the direction of synaptic connections and the weights are adjusted to make the current response of the network move closer to the desired response based on a steepest descent algorithm, or back propagation weight update rule, also called generalized delta-rule [6]. 2.2 radial base functions net the basic structure of the rbf network consists of three layers. different from the mlp networks, the layers here have different tasks. the first layer is passive and only connects the model to the real world. the second layer is a unique hidden layer. it performs a non-linear transformation from the input vector space to the internal vector space. the last layer is the output layer and transforms internal vector space into output vector space in a linear manner. there are several algorithms available to train the network [6, 7, 8]. these two types of neural nets can be universal function approximators [6, 8]. 3 controller proposed the controller proposed here uses a conventional pd or pid that performs in parallel with an artificial neural network trained on-line. this kind of architecture for neural controllers is known as feedback error learning because the net uses the output signal generated by the conventional controller as its own error signal that is back propagated for learning purposes ([2, 5]). fig. 2 shows the architecture applied in this case. both types of nns tested deal with the same input data vector: xnn = [ q q̇ q̈d ] , where q is the vector of current joint positions; q̇ is the vector of current joint speeds (obtained through numerical differentiation); q̈d corresponds to the desired joint accelerations (like in other manipulators, there is no accelerometer available for each joint, so the desired acceleration computed by the path generator was used). these inputs were bounded into its maximum and minimum possible operational values for this robot and then scaled between the neural input range -0.9 up to +0.9 only for the mlp nets. note that, different from mlp nets, the rbf nets do not need a scale procedure sice that they could deal directly with rough data, but it was necessary to organize the input data into three different classes: 1) joint positions, 2) joint speeds and 3) joint accelerations – see fig. 3. the idea behind organizing the data applying rbf neural nets for position control of an inter/scara robot 151 path generator conventional controller robot robot q q�,dq�� qq �,dddqqq�� ,, fbτ nnτneuralnetwork finalτ + + + figure 2: feedback-error-learning neural controller used. in different classes comes from previous skills with fuzzy inference systems. rbf networks could be compared with fuzzy inference systems [9]. each class of input data could be understood as linguistic operations of fuzzy systems. each class of data is mapped using m gaussian functions and could be compared to the m membership functions that will be used in a fuzzy system. and finally, the rules and the way they are evaluated in a fuzzy system were performed by the output layer of a rbf network. each synaptic connection of the output layer could be compared to the fuzzy if-then-rules. the overall outputs are derived from the weighted sum done by the output layer [10]. hence, m gaussian functions were created to categorize each vector of each class of input data, as can be seen in fig. 3. it could be argumented that the massive amount of data required for the input layer (3 classes × 4 d.o.f. × m gaussian functions) is a drawback of this approach, but this solution was related to the final application in mind in this case. motions in the plane xy were done by the first two joints of the robot. height (z) and final orientation (θ) is performed by the last two joints of this robot but there is a mechanical coupling between them (a ball-screw-spline system), i.e., changes only in the final orientation of the robot result in a small change in the final height reached by the robot. that, represents an extra challenge to develop an effective controller to this kind of robot. the centers xi of the m desired gaussian functions are fixed, based on the range (minimal and maximal values) of the input vector. that, allows defining the maximum euclidian distance, dmax, between each gaussian centers as: dmax = (xmax − xmin) / (m − 1) and then fixing the standard deviation (or spread) of each gaussian function to be used according to: σ = dmax√ 2m (1) the traditional back-propagation algorithm expanded with momentum term was used to adjust in real-time the weights of the mlp and rbf networks [6]. the addition of the momentum term to the delta rule traditionally used to update the weights of the net (based on the method of the descending gradient of the error signal), speeds up this algorithm, it eliminates decurrent oscillations of the calculation of the gradient and prevents the net to get paralyzed into a point of minimum local (and not global) in its surface errors [6]. both networks end with 4 neurons, each one to evaluate the torque needed to command each joint motor of the robot. the final torque sent to each joint is defined as: τfinal = τfb + τnn (2) where τfb is related to the torque evaluated by the conventional feedback controller that performs in parallel with one of these networks. pd and pid controllers working in the joint space have been used. the equation for the pid used is given by: τfb = b̂(q) [ kpq̃ + ki ∫ q̃ dt + kd ˙̃q ] (3) 152 fernando passold ϕ0 ϕi ϕn-1 ϕ0 ϕi ϕn-1 ϕ0 ϕi ϕn-1 ϕ0 ϕi ϕn-1 ϕ0 ϕi ϕn-1 ϕ0 ϕi ϕn-1 ϕ0 ϕi ϕn-1 ϕ0 ϕi ϕn-1 green functions hidden layer output layer input layer 1q� 1dq�� σ00q� 0dq�� σ1 2q� 2dq�� σ2 3q� 3dq�� σ3 0q 1q 2q 3q 11−m ϕ 0nn τ 1nn τ 2nn τ 3nn τ 11−m ϕ 11−m ϕ 11−m ϕ ijw 0ϕ iϕ 0ϕ iϕ 0ϕ iϕ 0ϕ iϕ figure 3: structure of the rbf net implemented. where b̂(q) refers to the inertia matrix of the robot (estimated); q̃ = qd − q represents the error between the desired and the actual joint position; ˙̃q refers to the velocity error; kp is the vector of proportional gains for each joint; ki refers to the integral vector gains and kd is the derivative gain vector for each joint. to get a pd action over the system, the ki vector was not used (equal to zero). 4 experimental results the proposed controllers were implemented to the inter/scara robot (fig 1). the first two links of the inter/scara robot acts like an xy planar robot, and each one has 0.5 meters of length and its mechanical transmissions use harmonic drives (hd) to reduce motor-joints frictions at a minimal level. the last two joints use a ball-screw-spline mechanical scheme that allows movement in z direction and the definition of the final orientation of these robot (angle θ). this robot could be manipulated in the z direction between 18.45 (cm) and 44.55 (cm). tab. 1 shows the denavit-hartenberg parameters among others of this robot. the inter/scara uses the xo/2 real-time object oriented operational system (www.xo2.org) developed by the institute of informatics of the swiss federal institute of technology (eth). it includes a high level object oriented programming language called oberon which is a kind of successor of modula 2 and pascal (see http://www.oberon.ethz.ch. the robot runs a xo/2 version over a powerpc 200 mhz applying rbf neural nets for position control of an inter/scara robot 153 table 1: denavit-hartenberg parameters of inter/scara robot. joint αi ai di qi mli τmaxi 0 0 250 665 q0 ∼= 6.3 333.0 1 180o 250 0 q1 ∼= 19.5 157.0 2 0 0 q2 0 * 877.0 3 0 0 0 q3 * 16.7 (degrees) (mm) (mm) (kg) (nm) note: ml1 includes the last 2 joints(*). equipped with 16 mbytes of memory, which communicates with its i/o devices using an industrial vme bus (67 mhz). the user interface is done through a tcp/ip connection with a pc running the oberon system 3 for windows win32 2.3 (http://www.oberon.ethz.ch/) over the windows 2000 pro. the user develops the whole control system of the robot (including text command interface with the user, initialization and security functions) and through a cross compiler the execution programm is downloaded to the cpu of this robot (up to 4mbytes of code). the algorithms for the proposed neural controllers were implemented as a real-time task running within a sampling rate of 1 millisecond. in each millisecond is evaluated the action of conventional controller, also it is evaluated the forward phase of the neural net and still the backward phase when the trajectory error, q̃ > 0.0001(rad|m). the controller was tested over the trajectory shown by fig. 4. table 2 shows the joint positions, speeds and accelerations developed for each joint. all the four joints were moved simultaneously. −15 −10 −5 0 5 10 15 20 25 30 25 30 35 40 45 50 55 x [cm] y [ cm ] vista superior [xyphi] a b step=0.246[s] (a) xyθ view. −5 0 5 10 15 20 20 25 30 35 40 a b vista lateral [xz] x [cm] z [ cm ] step=0.246[s] (b) xz view figure 4: trajectory used for the tests. the pd/pid controller were digitally implemented using a 1(ms) sampling rate, and also an anti wind-up scheme was introduced with it to limit the integrative values up to 10000. table 3 shows the parameters used for the tested pd/pid controllers. their gains was settled initially based on zieglernichols parameters and then better adjusted using trial-and-error method. fig. 5 shows the output torques developed by different controllers for the joint 3 (the last and faster). note the different performances developed by the pd+mlp1c (mlp with one single hidden layer), pd+mlp2c (mlp with two hidden layers), pd+rbf(5) (rbf with 5 gaussian functions) and pid+rbf controllers tested. 154 fernando passold table 2: parameters of the trajectory used for the tests. parameter joint 0 joint 1 joint 2 joint 3 qa 2.45 -1.85 -0.25 -1.57 qb 1.25 0.75 -0.46 0.00 [rad] [rad] [m] [rad] q̇máx -0.71 1.19 -0.22 0.85 [rad/s] [rad/s] [m/s] [rad/s] q̈máx 0.85 1.00 0.47 -0.93 [rad/s2] [rad/s2] [m/s2] [rad/s2] table 3: pd/pid gains used. joint 0 joint 1 joint 2 joint 3 kp 4900 12100 90000 14400 kd 140 220 600 240 ki 478 1200 9200 1410 fig. 6 shows the trajectory error during all the time. during the initial one-third of the robot configuration change period the nns are in their learning time and the conventional controller still predominates in the joint control. but even before the end of this period of time, it could be seen that the nns output torque takes predominance over the final torque evaluated (fig. 6). it could be seen that the nn learns the dynamic behavior of the system and then does the dynamic compensation that results in higher performance compared to a conventional controller. the best results for the mlp nets were achieved with learning rate η = 0.035 and momentum term α = 0.5. related to the rbf nets, the best learning parameters founded were: η = 0.005 and α = 0.5. note that a pid performing with a rbf net, allows the better performance (fig. 5(d) and 5(e)) followed by the pd+rbf, pd+mlp and finally, the single pd. it was noted that the use of two hidden layers for the mlp nn does not imply in a better performance and moreover adds a small residual memory effect in presence of a disturbance on the system (this behavior has been observed in tests where an elastic string was placed in the middle of a linear trajectory). experimental results have also demonstrated that the addition of more than 5 gaussian functions in the rbf nn controller, slightly increase the performance, but at the expenses of a significant higher computational cost. 5 conclusions this paper has presented a practical and successful application of a neural controller performing in parallel with a conventional controller in the position and trajectory following control of a real robot. the use of a conventional controller performing in parallel with the nns is advantageous to maintain the robustness of the system when the nn become saturated (due to high learning rates) and it is important to force the readjustment of the synaptic weights of the nn used when the robot changes its configuration. as soon as the nn captures the dynamic behavior of the system, the final torque is given quite totally by the nn and a higher performance could be achieved. both the mlp and rbf ann perform very well, applying rbf neural nets for position control of an inter/scara robot 155 2 3 4 5 6 -0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 1 1.2 1.4 time (s) (n m ) torques joint 3 (a) pd controller output torques. 2 3 4 5 6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 1 1.2 1.4 time (s) (n m ) torques joint 3 fb final nn τfinal τnn τ fb (b) pd + mlp1c controller outputs torques. 3 4 5 6 7 -0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 1 1.2 1.4 time (s) (n m ) torques joint 3 fb final nn τfinal τ nn τ fb (c) pd + mlp2c controller outputs torques. 3 4 5 6 7 -0.2 0 0.2 0.4 0.6 0.8 1 1.2 1.4 time (s) (n m ) torques joint 3 fb final nn τ final τnn τfb (d) d) pd + rbf(5) controller outputs torques. 2 3 4 5 6 -0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 1 1.2 1.4 time (s) (n m ) torques joint 3 fb final nn τ final τnn τ fb (e) pid + rbf(5) controller outputs torques. figure 5: controllers output torques. 156 fernando passold 300 400 500 600 700 800 900 1000 -5 0 5 10 15 x 10 -3 sampling time (k) (r a d ) joint 3 error position pd pd+mlp2c pd+rbf pid+rbf pd pd+mlp2c pd+rbf pid+rbf figure 6: joint 3 some tracking error. but rbf does it better and faster than a mlp, mainly if it works in parallel with a pid controller. an additional and unexpected advantage could be achieved with the pid+rbf controller: robot motion with lowest noise levels (quite silent). on the other hand, there is also a drawback: nn requires more processing power to work in parallel with the conventional controller – see table 4 and fig 7 that summarize the results achieved by the different controllers. table 4: power computer resources required. processing time controller min max pd 104 104 pd + mlp2c 194 385 pd + rbf(5) 333 579 pd + rbf(7) 425 679 pd + rbf(9) 104 809 note: values expressed in microseconds (µs). even if computer resources are short, a simple pd+mlp with one hidden layer allows better results than a pd controller. otherwise, if higher processing power is available, a pid+rbf achieves the best results. since the neural controllers proposed here have performed very successfully, future directions of this works intend to establish an integrated position/force control over a hybrid control architecture to deal with robot applications that imply some contact with the environment. applying rbf neural nets for position control of an inter/scara robot 157 vs c pi d pd pd +m lp 1c pd +m lp 2c pd +r bf (5) pd +r bf (7 ) 0 500 1000 1500 2000 pd +m lp 2c pi d+ m lp 2 c 0 500 1000 1500 2000 vs c pd pi d pd +m lp 1c pi d+ m lp 2c pd +r bf (5 ) 0 500 1000 1500 2000 flops l o g (m s e ) less (1e-7) larger (1e 0) test w/disturbance trajectory “b” trajectory “a” better relation cost × benefits� � larger errors larger cost � figure 7: relationship within computational cost × benefit of the controllers tested here. bibliography [1] sciavicco, l. and siciliano, b., modeling and control of robot manipulators, mcgraw-hill, 1996. [2] narendra, k.s., neural networks for real-time control. in 36th ieee conference on decision and control cdc’97, 1026–1031, san diego, california, usa, 1997. [3] katič, d. and vukobratovič, m., connectionist based robot control: an overview. in 13th ifac, volume 1b-05 6, 169–174. san francisco, usa, 1996. [4] morris, a.s. and khemaissia, s., artificial neural network based intelligent robot dynamic control. in a.m.s. zalzala and a.s. morris (eds.), neural networks for robotic control – theory and applications, chapter 2, 26–63, ellis horwood, great britain. 1996. [5] kim, y.h. and lewis, f.l., neural network output feedback control of robot manipulators. ieee transaction on robotics and automation, 15(2), 301–309, 1999. [6] haykin, s., neural networks a comprehensive foundation, prentice hall, new jersey, usa, 2nd edition, 1999. [7] gabrijel, i. and dobnikar, a., adaptative rbf neural network. in soco’97 conference, 164–170. nimes, france. url http://cherry.fer.uni-lj.si:80/∼gabriel/soco97/soco97.zip, 1997. [8] girosi, f. and poggio, t., networks and the best approximation property. in m.m. gupta and d.h. rao (eds.), neuro-control systems, theory and applications, 257–264. ieee pres, piscataway, new jersey, usa, 1993. [9] fritzke, b., incremental neuro-fuzzy systems. in applications of soft computing, spie international symposium on optical science, engineering and instrumentation, san diego, 1997. [10] kiguchi, k. and fukuda, t., intelligent position/force controller for industrial robot manipulators – application of fuzzy neural networks, ieee transactions on industrial electronics, 44(6), 753–761, 1997. http://horos.rdsor.ro/ijcccv3n4draft.pdf int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iii (2008), no. 4, pp. 374-383 computation results of finding all efficient points in multiobjective combinatorial optimization milan stanojević, mirko vujošević, bogdana stanojević abstract: the number of efficient points in criteria space of multiple objective combinatorial optimization problems is considered in this paper. it is concluded that under certain assumptions, that number grows polynomially although the number of pareto optimal solutions grows exponentially with the problem size. in order to perform experiments, an original algorithm for obtaining all efficient points was formulated and implemented for three classical multiobjective combinatorial optimization problems. experimental results with the shortest path problem, the steiner tree problem on graphs and the traveling salesman problem show that the number of efficient points is much lower than a polynomial upper bound. keywords: multiple objective optimization, combinatorial optimization, complexity of computation. 1 introduction for all combinatorial problems cardinality of the feasible solution set grows exponentially with the problem size. for one group of combinatorial problems (e.g. the shortest path problem, the shortest spanning tree problem, assignment problem etc.) algorithms that can find a solution of a single-criterion problem in polynomial time are known. this problem class is denoted by p. for other combinatorial problems (e.g. traveling salesman problem, steiner tree problem, knapsack problem etc.) so called non deterministic polynomial algorithms exist. these problems belong to the class denoted by np. for this problem’s class it is not proved whether polynomial algorithms exist or not. there is a third class of problems (e.g. finding all spanning trees for a given graph) for which it is known that they can be solved only by exponential algorithms. in such problems result usually consists of exponential amount of data, so the exponential time is needed just to represent them. let’s denote this class by e. in all known literature which concerns multiobjective combinatorial optimization (moco), mostly the set of pareto optimal solutions is being observed, and it has been stated that its size can grow exponentially with the problem size. moreover, with sufficient number of uncorrelated criteria it is possible to achieve that every feasible solution is pareto optimal [1]. this implies that there is no efficient method of determining all pareto optimal solutions for problems of bigger dimensions, because such a procedure would not be even np hard, but strictly exponential, i.e. it would belong to class e. such results are confirmed for many known problems, such as: the shortest spanning tree, the shortest path problem, traveling salesman problem, assignment problem and knapsack problem [2, 3, 4, 5, 7]. in section 2 some multiobjective optimization models are introduced. section 3 presents the implemented algorithm which is applied in order to find all efficient points of multiobjective combinatorial optimization problems. experimental results described in section 4 show that the number of efficient points is even much lower than a polynomial upper bound. in section 5 some concluding remarks are formulated. 2 multiobjective optimization models the general model of multiobjective optimization problem can be briefly formulated as follows. min x∈x f (x) (1) copyright © 2006-2008 by ccc publications computation results of finding all efficient points in multiobjective combinatorial optimization 375 where x is the decision variable, x is the feasible solution set, and f = ( f1, . . . , f p) is the p-dimensional vector of objective functions. the solution x∗ ∈ x of problem (1) is pareto optimal if there is no other solution x ∈ x such that fk(x) ≤ fk(x∗) ∀k = 1, . . . , p, where at least one of these inequalities is a strict inequality. if x∗ is pareto optimal solution of problem (1), the point y∗ = f (x∗) is called the efficient point in the criteria space. the set of all pareto optimal solutions is called the pareto set and it is denoted by xpar. the set of all efficient points in the criteria space is called the efficient points set and it is denoted by ye f f . a solution obtained by minimizing the objective function fk(x) over x, is called the marginal solution for the k-th criterion of problem (1). one marginal solution which is pareto optimal is called pareto optimal marginal solution. if x is a pareto optimal marginal solution then f (x) is called efficient marginal point. remark 1. for each criterion k = 1, . . . , p at least one pareto optimal marginal solution exists. it can be obtained by lexicographic optimization putting k-th criterion to have the first priority. all notions regarding problem (1) can be applied to the next models which are defined as its particular cases. the model of multiobjective combinatorial optimization problem can be formulated as follows. min x∈χ f (x) (2) where x is the decision variable, χ is the feasible solution set of the model which is a subset of the power set of a finite set e (i.e. χ ⊂℘(e), e = {e1, ..., em}), and f = ( f1, ..., f p) is the p−dimensional vector of objective functions. moreover, x ∈ χ can be represented by a binary m-dimensional variable (x1, x2, ..., xm) ∈ {0,1}m where xk = 1 if and only if the corresponding element ek ∈ e belongs to x and xk = 0 if and only if the corresponding element ek ∈ e does not belong to x. the experiments related to the number of efficient points of multiobjective optimization problems were done on three types of multiple objective network problems: multiobjective shortest path problem (spp), multiobjective steiner tree problem on graphs (stp) and multiobjective traveling salesman problem (tsp). for all the three problems, an undirected graph g = (v, e) with |v | = v vertices, |e| = m edges and wk : e → z+, k = 1,2, ..., p weight functions on the edges is given. in addition, for spp starting vertex s ∈ v and target vertex t ∈ v and for stp set of terminal vertices t ⊂ v are given. for each of the problems, each feasible solution represents a specific graph structure. for spp it is a path from s to t and the feasible set χ is a set of all such paths. for stp, χ represents set of all steiner trees of graph g and terminal nodes t . and for tsp χ is a set of all hamiltonian cycles. for any of the problems, a feasible solution x ∈ χ is a set of edges that belong to a feasible graph structure. alternatively, a feasible solution is a vector (x1, x2, . . . , xm) ∈ {0,1}m that satisfies a set of constraints specific to each problem. the form of goal functions in each of above mentioned problems can be twofold, depending on the type of k-th criterion: • when a criterion represents length or weight, the corresponding goal function is linear as follows fk(x) = m ∑ j=1 c k j x j (3) and it is minimized. • when a criterion type represents capacity, the corresponding goal function is fk(x) = min 1≤ j≤m c k j x j (4) 376 milan stanojević, mirko vujošević, bogdana stanojević and it is maximized. problems of minimizing a function of type (3) are called minisum, while problems of maximizing a function of type (4) are called maximin or bottleneck problems. coefficients ckj, j = 1, . . . , m, k = 1, . . . , p in general case are real numbers that represent length, height, weight or capacity of elements of set e. in our experiments it is presumed that coefficients ckj, j = 1, . . . , m, k = 1, . . . , p have integer values. practically that does not mean a loss of generality because in real life problems coefficients are rational numbers which can be transformed to integers. in one multiobjective combinatorial optimization problem both kinds of goal function can exist. if functions of type (4) exist, they can be transformed to fk(x) = max 1≤ j≤m ( −ckj x j ) , (5) that have to be minimized, in order to match the general formulation (2). 3 applied algorithm in order to find all efficient points for mentioned problems an original algorithm was formulated and implemented. the algorithm is based on ε-constraints method which is usualy used in apriori approach (when the decision maker’s preferences about criteria are known before optimization). our method is developed for aposteriori approach (when optimization is done with purpose to present information about nondominated solutions to decision maker). f 2 1 f1 ε a d c b figure 1: sketch of applying algorithm for finding all efficient points of a combinatorial optimization problem with two objective functions min{( f1 (x) , f2 (x))|x ∈ x} the following algorithm was applied. algorithm 2. input: parameters of the problem. 1. for both criteria find efficient marginal points f k = ( f k1 , f k 2 ) , k = 1,2. 2. identify index s such as ∣ ∣ f 1s − f 2s ∣ ∣ = min { ∣ ∣ f 11 − f 21 ∣ ∣ , ∣ ∣ f 12 − f 22 ∣ ∣ } . use fs as a search criterion and another one, denoted by fo, as an optimization criterion. computation results of finding all efficient points in multiobjective combinatorial optimization 377 3. initialize ye f f = {( f 11 , f 1 2 ) , ( f 21 , f 1 2 )} and εs = f o s −1. 4. solve problem min{ fo (x)|x ∈ x , fs (x) ≤ εs} and denote by x∗ its solution. 5. ye f f = ye f f ∪{( f1 (x∗) , f2 (x∗))}. 6. put εs = fs (x ∗) −1. if εs > f ss then go to step 4, otherwise stop. output: set ye f f . an example of a part of criteria space and efficient points obtained by algorithm 2 is graphically represented in figure 1. points a and b represent eficient marginal points ( f 11 , f 1 2 ) , ( f 21 , f 2 2 ) of the problem. they were obtained by independent optimization of criteria f1 and f2 respectively and they are efficient points of the multiobjective problem. in this example f1 is the search criterion and f2 is the optimization criterion. points c and d are also efficient points of the problem and they were obtained in two succesive iterations. efficient point c is obtained in the first iteration. in the next iteration point d will be obtained by optimizing second objective function over the initial feasible set reduced by the additional constraint f1 (x) ≤ ε1. remark 3. algorithm 2 can be extended for problems with more than two criteria by putting one criterion to be optimization criterion and all others to be search criteria. efficiency of such procedure can be questionable because all search criteria have to be recursively checked. this implies time and resource consumption problems. also, efficient marginal points can be difficult to be determined in this case. 4 experiments problem spp, in its single objective version, belongs to the class p and its multiobjective version represents one of the most studied problems of moco. problem stp in its single objective version with a minisum criterion belongs to the class np, but if objective function is of type maximin, it belongs to the class p [9]. the single objective tsp, with any type of objective function (minisum or maximin) is an np hard problem. 4.1 description of the developed computer programs for each of the problems, a specific computer program was developed by authors. each program determines all efficient points (and one solution for each of them) using algorithm 2 adapted to the corresponding problem. the programs for spp and stp support instances with both kinds of objective functions minisum and maximin. program for tsp supports only instances with minisum criteria type. for all the experiments random instances with certain characteristics were generated. all instances had two non correlated criteria. each result of experiments is obtained as an average of 10 randomly generated instances with the same characteristics. for spp problem instances had specific structure in order to provide paths to have at least √ m edges. graph density varied between 36% and 60% depending on the number of vertices. instances with smaller number of vertices had higher density. instances for examining stp problem were generated so each vertex has a certain probability to be connected to any of other vertices. both, too dense or too spare graphs would not be proper for this kind of problem. average number of a vertex degree was 7 for all dimensions, e.g. average density was 35% for graphs with 20 vertices and 14% for graphs with 50 vertices. the number of terminals was 5 for all instances. 378 milan stanojević, mirko vujošević, bogdana stanojević tsp instances were generated in a similar way as stp, but density was higher, 50% for all graph dimensions. 4.2 example in order to present the procedure of performing experiments a short example of finding all efficient points of a small instance of stp problem is given below. the instance has the following characteristic: unoriented graph with 10 vertices and 20 edges (consequently, graph density is 44% and average number of vertex degree is 4), 5 terminal nodes, two criteria of type minisum. edges weights were generated randomly from interval [10,99]. criteria were not correlated. the upper triangular matrix of edges weights for both criteria is given below.                 0 98/95 45/17 92/12 41/45 0 67/58 12/89 32/47 0 65/98 22/61 71/67 19/32 0 33/49 83/32 0 18/70 11/93 51/47 10/53 0 44/51 0 12/29 0 0 15/25 0                 results obtained by developed program using algoritm 2 are presented in table 1. table 1: output results coordinates of efficient points no. first criterion values second criterion values remarks 1 130 286 supported point 2 174 276 3 176 236 supported point 4 199 234 5 232 231 6 236 200 supported point 7 275 186 supported point seven efficient points were obtained and their coordinates in criteria space are given in columns 2 and 3 of table 1. also, for each efficient point it is determined wether they are supported or non supported points (information given in column 4 of table 1). 4.3 types of the performed experiments tree groups of experiments were performed. the first group was inspired by a supposition that the upper bound for the number of efficient points depends on the length of the intervals from which edges lengths can get integer values. three such intervals were defined: i1=[10, 99], i2=[100, 999] and i3=[1000, 9999] (as sets of two, three and four digits numbers) which contain sets of integer values of cardinality 90, 900 and 9000, respectively. instances were generated so the lengths of edges would get random values from a certain interval, independently for each criterion. all combinations of the intervals for the first (c1) and second (c2) criterion were computation results of finding all efficient points in multiobjective combinatorial optimization 379 checked. all the experiments from this group were performed on graphs with 20 vertices. both criteria were of minisum type. for all problems, spp, stp and tsp, two values were observed: upper bound (ub) and actual numbers of efficient points (eff). the upper bound was calculated by the formula u b = min { f 2 1 − f 11 , f 12 − f 22 } + 1 (6) where ( f k1 , f k 2 ) , k = 1,2 are efficient marginal points of the k-th criterion. both ub and eff are calculated as the average of results obtained from 10 randomly generated instances. in paper [6] a different upper bound was also considered. although that upper bound has a polynomial growth with the problems size, it is more rough than ub. on the other side for its calculation were used only parameters of instances. upper bound ub presented in this paper is far more precise. but, in order to obtain it, p2 optimizations per instance are necessary to be performed in order to obtain efficient marginal points. the analysis and comparisons of upper bound and the actual number of efficient points were performed in order to get an idea about order of magnitude and relations between the values of the ub and eff. results of this group of experiments are given in table 2. table 2: dependence of the upper bound and efficient points number on the range of edge lengths for spp, stp and tsp prob: spp stp tsp c1 c2 ub eff ub eff ub eff i1 i1 136 9.6 133 7.5 483 40.5 i1 i2 147 6.6 172 8.9 546 44.1 i1 i3 166 7.6 159 7.9 498 39.3 i2 i2 1027 6.5 905 8.7 4951 44.0 i2 i3 1870 9.8 1804 9.3 5489 45.1 i3 i3 13967 8.5 13461 8.0 47606 39.1 average 8.1 8.4 42.0 the second group of experiments was performed in order to check the dependency of the number of efficient points on the graph size. because of the exponential complexity of the algorithms for finding all efficient points for all tree problem types when objective functions are of minisum type, the experiments were performed for instances with up to 50 vertices. performing experiments it was concluded that the number of efficient points for stp with both criteria types minisum and maximin, significantly more depends on the number of terminal vertices than on total number of vertices in graph. because of that, instances of problems spp and tsp with minisum criteria types were compared separately. experiments results for this group are represented in table 3. in order to demonstrate “independence” of efficient points number on graphs size for stp, some experimental results are given for maximin criteria types. for this problem, algorithm is polynomial, so instances with much bigger number of vertices were able to be solved. these results are given in table 4. the third group of experiments considered the types of criteria. three combinations were observed: when both criteria were of type minisum (s/s), when the first criterion was of type maximin and second was of type minisum (m/s) and when both criteria were of type maximin (m/m). this time the tsp problem was excluded from the experiments because the available software did not support solving tsp with maximin criterion. instances with both, 20 and 50 vertices were observed. 380 milan stanojević, mirko vujošević, bogdana stanojević table 3: dependence of the efficient points number on the graph size for spp and tsp problem spp tsp v ub eff ub eff 10 622 3.5 771 4.7 20 1096 8.3 4809 43.9 30 2158 15.0 8365 105.5 40 2333 19.1 14116 223.5 50 2371 16.8 18521 373.9 table 4: depedence of the efficient points number on the graph size for maximin stp intervals: i1 i2 i3 average for size v ub eff ub eff ub eff eff 50 35 5,1 224 3,9 3928 7,1 5,4 100 33 5,7 235 3,6 4269 7,2 5,5 200 24 4,0 267 5,5 3233 5,0 4,8 500 26 4,3 181 3,1 2357 3,7 3,7 1000 13 2,3 153 4,1 1858 3,9 3,4 2000 14 3,6 161 4,9 1830 4,1 4,2 average for interval 4,2 4,2 5,2 4,5 the results of the third group of experiments are represented in table 5. table 5: dependence of the efficient points number on the type of criteria for spp and stp criteria type: c=s/s c=m/s c=m/m problem v ub eff ub eff ub eff spp 20 2886 8.1 938 5.4 309 2.6 50 5689 20.0 1141 12.8 739 5.4 stp 20 2772 8.4 908 7.8 667 5.5 50 2815 8.9 818 9.6 674 5.8 5 conclusion we can make the following conclusions which are based on results presented in tables 2, 3, 5. although the problems spp, stp and tsp have very different nature, their number of efficient points show very similar characteristics. the most interesting are the values in columns eff. first of all, they are surprisingly small, and second, the influence of the observed parameters to it is very low or even insignificant. observing table 2, it is obvious and expected that upper bound ub grows with the size of the intervals from which edges take their values. very unexpected is that the actual number of efficient points does computation results of finding all efficient points in multiobjective combinatorial optimization 381 not show any dependence on the size of the intervals for all three problems. it is also obvious that tsp has about five times bigger number of efficient points than spp and stp which have similar number. explanation is that for instances we used, tsp solutions contain more edges than solutions of spp and stp. consequences are that the distance between the two efficient marginal points is bigger, because of that ub is also bigger and it is expected that bigger number of efficient points can be between them. observing the results from table 3 a little deviation can be noticed for spp between graphs with 40 and 50 vertices. namely, the number of efficient points on this stage starts to decrease. however, we concluded that the deviation is accidental, especially because in the next group of experiments, on different instances with the same characteristics (spp, 50 vertices, s/s) is obtained value 20.0 (shown in table 5) which matches the value expected in table 3. here the number of efficient points is bigger for tsp and moreover, it grows faster than for stp. this is in accordance to the previous explanation because the number of edges in solution for tsp grows linearly with number of vertices and for spp is approximately √ m. finally, the results from table 5 show that the number of efficient points is smaller for maximin than for minisum type of criteria, i.e. if more criteria are of maximin type, smaller will be the number of efficient points. a slightly deviation of that rule is in the last row where for m/s combination of criteria types is a bigger number of efficient points than for s/s. still we consider this as an accidental deviation. it was mentioned before and presented in table 4 that the number of efficient points for stp does not depend much on the number of vertices and it is also obvious from the last two rows of table 5. in all the considerations in this paper it was assumed that criteria are not correlated. on the other hand, the number of efficient points decreases with the increase of correlation between criteria. since it is known that between many criteria used in practice correlation exists (the length, time and price of path, price and reliability etc.), we can expect even less number of efficient points when it comes to practical problems. bibliography [1] m. ehrgott, multicriteria optimization, springer-verlag, 2000. [2] m. ehrgott, x. gandibleux, a survey and annotated bibliography of multiobjective combinatorial optimization, or spektrum, vol. 22, pp. 425-46 2000. [3] v.a. emelichev, v.a. perepelitsa, on cardinality of the set of alternatives in discrete manycriterion problems, discrete math. appl. vol. 2, pp. 461-471, 1992. [4] h.w. hamacher, g. ruhe, on spanning tree problems with multiple objectives, ann. oper. res., vol. 52, pp. 209-230, 1994. [5] i.v. sergienko, v.a. perepelitsa, finding the set of alternatives in discrete multicriterion problems, cybernetics vol. 23, pp. 673-683, 1987. [6] m. stanojević, m. vujos̆ević, b. stanojević, number of efficient points in some multiobjective combinatorial optimization problems, international journal of computers, communications & control, vol.iii (supl. issue), pp. 497-502, 2008. [7] m. visée, j. teghem, m. pirlot, e.l. ulungu, two-phases method and branch and bound procedures to solve the bi-objective knapsack problem, j. glob. optim. , vol. 12, pp. 139-155, 1998. [8] m. vujošević, m. stanojević, multiobjective traveling salesman problem and a fuzzy set approach to solving it. in: d. ivanchev, m.d. todorov (eds), applications of mathematics in engeneering and economics, heron press, sofia, pp. 111-118, 2002. 382 milan stanojević, mirko vujošević, bogdana stanojević [9] m. vujošević, m. stanojević, a bicriterion steiner tree problem on graph”, yugosl. j. oper. res., vol. 13, pp. 25-33, 2003. milan stanojević university of belgrade faculty of organizational sciences 154 jove ilića, 11000 belgrade, serbia e-mail: milans@fon.bg.ac.yu mirko vujošević university of belgrade faculty of organizational sciences 154 jove ilića, 11000 belgrade, serbia e-mail: mirkov@fon.bg.ac.yu bogdana stanojević transilvania university of braşov department of computer science 50 iuliu maniu, braşov, romania e-mail: bpop@unitbv.ro computation results of finding all efficient points in multiobjective combinatorial optimization 383 milanstanojević was born in belgrade, serbia in 1965. he graduated at university of belgrade, faculty of organizational sciences in 1990. he obtained doctoral degree at the same faculty in 2005. since 1993 he works at faculty of organizational sciences, in the begining as a teaching assistant and now as an assistant professor of operational research. he has published more than 40 papers in national and international journals, and conference procceedings in the field of operational research. his research interest includes multiobjective optimization, combinatorial optimization and software for operational research. mirko vujošević was born in podgorica, yugoslavia in 1951. he graduated in electrical engineering at university of belgrade where he also finished his postgraduate studies and earned his doctorate. from 1976 to 1995 he worked at mihailo pupin institute, belgrade, and now he hold the chair of operational research and statistics at the faculty of organizational sciences, university of belgrade. he has published more than 150 professional papers on different topics of operational research, reliability, maintenance, inventory control and applied mathematics. he is author and co-author of two monographs, (one of them published by elsevier), five textbooks, and several chapters in monographies. he is member of editorial boards of several scientific and professional journals and associated editor of yujor yugoslav journal of operational research. he is member of programme committees of several national and interanational conferences, as well as several professional societies: dopis yugoslav operational research society (he was its president for eight years), prim serbian society for applied and industrial mathematics, ieee institute of electric and electronic engineers, ors operational research society (u.k.), isir international society for inventory research and informs institute for operations research and management science. he is member of academy of engineering sciences of serbia. bogdana stanojević was born in oradea, romania in 1972. she graduated mathematics and computer science specialization at “transilvania” university of braşov, in 1995 and she obtained her doctoral degree in mathematics in 2003 from the romanian academy. currently she is an associate professor at computer science department of transilvania university of braşov. her research interests include different aspects of fuzzy optimization, multiple objective optimization and mathematical fundamentals of computers. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 4, pp. 326-334 a framework for enhancing competitive intelligence capabilities using decision support system based on web mining techniques i. anica-popa, g. cucui ionuţ anica-popa academy of economic studies of bucharest, romania e-mail: ionut.anica@ase.ro gabriel cucui valahia university of targoviste, romania abstract: nowadays competitive intelligence (ci) represents one of the most important pieces in strategic management of organizations in order to sustain and enhance competitive advantage over competitors. there are some studies that claim that a successful strategic management is influenced by the accuracy of external environment’s evaluation and, in the same time, in order to have correct and complete business strategies it is necessary to be sustained by competitive advantage. but till at the beginning of ’80 the things were totally different. this paper will present the evolution and the objectives of ci, the results of using ci in organizations and how can be improved the ci process using tools and techniques provided by business intelligence (bi). the study will propose a framework of a decision support system based on web mining techniques in order to enhance capabilities of organization’s competitive intelligence. keywords: competitive intelligence, information system, decision making process, web mining 1 introduction several studies ([13], [27], [30], [28], [32]) consider that starting with the second half of the 20th century a series of researches were focused on the enhancing competitiveness of the organization based on concept of intelligence as a process. [3], [37] state that the accuracy of external environment’s evaluation has a big influence over a successful strategic management and correct and complete business strategies must be sustained by competitive advantage. in the context of using methods and tools provided by information technology and communication (itc), the increasing significance of using information and knowledge in order to obtain a competitive advantage and the enhancing ability of organization in order to maximize the efficiency of using internal and external linkages through new data and information flows [32] represent an activity which must be closely supervised in every organization. there are studies claiming that competitive intelligence (ci), as a research area, attracted growing attention over the last two decades and especially in the last years ([7], [5]). some of the main factors which had a great influence over this evolution are: (1) spectacular development of the information technology and communication (itc) domain, (2) methods and tools provided by itc, (3) exponentially use of the internet, many ci researchers considering that internet represents the one of the major information data sources used by ci ([36], [41], [5]). the research objectives of this study are outlined below: • presenting a short overview of competitive intelligence (ci) evolution and possible ways of developing in future; " identifying the objectives of using competitive intelligence (ci) in organizations; • presenting a short overview of some specific tools and techniques provided by business intelligence (bi) which can be used in competitive intelligence; • identifying the results of using competitive intelligence (ci) in organizations; copyright c© 2006-2009 by ccc publications a framework for enhancing competitive intelligence capabilities using decision support system based on web mining techniques 327 • proposing a framework for a decision support system using web mining techniques. an extensive review of literature will be conducted in order to accomplish the goals of this study. 2 definition and evolution of competitive intelligence the literature related to intelligence suggests that are three directions for this area [5]: • the first is represented by the military domain where can be identified one of the earliest studies regarding intelligence, which is the art of war by sun tzu (translated by griffith in 1963); • the second is represented by the national security as a policy issue [4]; • the third is represented by the economic domain, where the intelligence determines an increasing competitiveness of organizations [12]. [21] consider that information is factual because consist in numbers, indicators, data about competitors, customers, suppliers regarding past actions of them, but intelligence represents a collection of cleaned, filtered and analyzed information that support managers in decision making process. the modern approach of competitive intelligence (ci) was appeared after the second world war and started to become important in the ’80 [15], the indicators that are sustaining this being: • the number of conferences having the topic competitive intelligence (ci) which is increasing constantly; • the increasing number of books, articles, studies, papers and other types of publication about competitive intelligence (ci); • the increasing number of academic courses and professional programs dedicated to competitive intelligence (ci); • the developing and increasing role of society of competitive intelligence professionals (scip) which is a global organization with declared mission to "be the global organization of choice for professionals engaged in competitive intelligence and related disciplines" and to "be the premier advocate for the skilled use of intelligence to enhance business decision-making and organizational performance" (www.scip.org) and having the declared vision: "better decisions through competitive intelligence". starting with the 1980 an important number of researchers tried to define the concept of competitive intelligence (ci). ci it is represented by information which describe how competitive is an organization and, in the same time, it is being able to predict moves of the business environment actors (competitors, customers, suppliers, government etc) [16]. ci represents an iterative and systematic process for gathering and analyzing data and information about activities of competitors, business environment and business trends in order to fulfill the goals of organization [21]. ci represents a process that predicts behavior and moves of the actors (competitor, customer, suppliers, government etc) which interact with organization or influence either the business environment or the behavior of organization [6]. the results provided by ci are used in order to identify potential business opportunities and to minimize the possibility of appearance for unpredictable situations, those facts define the goal of the ci: to predict the future situations (what is going to happen) rather than debate past situations (what did happen). 328 i. anica-popa, g. cucui ci is a process that aims to monitor the external business environment of organization in order to identify relevant information for decision making process [9]. competitive intelligence represents a process that consists of two levels [25]: • accessing legal and ethical data sources (publicly or semipublicly) in order to gather data for building new data sources regarding competitors, competition, environmental conditions, past, present or future trends etc. • analyzing those new data sources in order to transform data into usable and valuable information and knowledge that will support the decision making process. summarizing all those points of view we consider that competitive intelligence represents a continuous process of gathering data, information and knowledge about actors (competitors, customers, suppliers, government etc) which interact with organization in the business environment in order to support decision making process for enhancing competitiveness of organization. the ci process consists in the following steps: monitoring business environment (external data, information and knowledge), gathering, analyzing, filtering and disseminating intelligence that will support decision making process in order to increase competitiveness and improve position of organization. figure 1: competitive intelligence (ci) process [33] considers that evolution of ci could be divided in following stages: (1) "competitive intelligence gathering" occurred through the 60s and 70s; (2) "industry and competitor analysis" occurred through 1980 and 1987; (3) "competitive intelligence for strategic decision making" occurred through 1987 and 2000; (4) "competitive intelligence as a core capability" which represent the present stage in ci evolution. in the competitive intelligence gathering stage of ci the key defining event was the book "competitive strategy" published by michael porter in 1980. personnel involved in ci activities were located mostly in library or marketing department and the primary skill was capability to find information [33]. despite the fact that organization was collected large amount of data, over gathered data rarely were applied some static analyses. another important thing was represented by very weak connection between ci and decision making process. development of skills in information acquisition was the key issue of this stage. in the industry and competitor analysis stage of ci the key defining event was the founding of the society of competitive intelligence professionals (scip). personnel involved in ci activities were located mostly in planning or marketing department, over gathered data were applied quantitative analyses, and between ci and decision making process was a weak connection [33]. building a business case for ci, spy image and analytical skill development were the keys issues of this stage. a framework for enhancing competitive intelligence capabilities using decision support system based on web mining techniques 329 in the competitive intelligence for strategic decision making stage of ci the key defining event was the establishment of the competitive intelligence review. personnel involved in ci activities were located mostly in planning, marketing or ci unit, over gathered data were applied quantitative and qualitative analyses, and between ci and decision making process was a strong connection. demonstrating bottom-line input, role of information technology, ci technology, international ci, demands vs. supplydriven ci, counter-intelligence were the keys issues of this stage [33]. in the competitive intelligence as a core capability stage of ci the key defining event is represented by ci courses taught in universities and in business school across the world. personnel involved in ci activities are located mostly in ci units, planning or marketing, over gathered data are be applied qualitative emphasis analyses and information and knowledge provided by ci represent a raw material for decision making. managing the parallel process, intelligence infrastructures for multinationals, ci as learning, network analysis are the keys issues of this stage. 3 the objectives and results of using ci in organizations from the beginning is important to emphasize that is not necessary that every organization to develop and implement an organized ci activity. in many cases formal information gathered from external sources (internet, media, government etc) and informal data and information collected, analyzed and filtered by managers represent a very flexible and efficient informal ci. but in the case of some organization, an informal ci is not enough in order to support tactic and strategic decision making process, fact that conduct to develop and implement an organized ci system. the ci can be used in order to increase the capabilities of the organizational memory (om) of the enterprise because the om contains organizational memory information systems which are based on knowledge management techniques [39]. in the last years, enterprises are constantly working to improve and to enhance their operations [40]. the reasons which are staying at the base of decision to implement a ci in organization are various [21]: • the globalization process which determine an increasing level of competition among organizations because the number of competitor is increasing and, in the same time, the quantity and quality of goods and services provided by them growing continuously. • the new products, services, methods and tools provided by information technology and communication domain. • the rapidly changing of the business environment where new business opportunities appear and disappear very fast and the period of time allocated for decision making processes is decreasing constantly. • political changes which affect and influence the business environment as well as the evolution of organizations. a review of the literature related to objectives of using ci in organization reveals quite a wide and ambitious arrangement of objectives for ci [31]. [1] consider that objectives of using an organized information system of ci are: (1) identifying and analyzing new business opportunities or the market trends; (2) developing or updating software using latest technologies, methods and tools for software developing; (3) maximizing revenues and minimizing expenses; (4) identifying, understanding and analyzing strategies, already implemented or in the phase of implementation. [31] consider that objectives of using an organized information system of ci are: (1) help organization in order to gain a competitive edge; (2) reveal opportunities and threats by surveying weak signals; 330 i. anica-popa, g. cucui (3) process and combine data, information and knowledge in order to produce new knowledge about competitors, customers, suppliers etc.; (4) provide useful information for managers in decision making process and reduce the period of time used by decision making process. in our opinion, the objectives of using an organized information system of ci in organizations are: • enhancing organization’s competitiveness. • predicting, with a high level of trust, business environment’s evolutions, competitors’ actions, customers’ requirements, even influences generated by political changes. • providing a better and better support for strategic decision making process. there are studies that identify some benefit derived from using organized information system of ci [16]: (1) increasing analytical skill for managers and the ability to anticipate moves of the other actors from organization’s business environment; (2) sharing ideas and knowledge inside organization in order to develop new ideas or knowledge or to integrate the existing into organization. some authors identified new benefits from using organized information system of ci [21]: (1) discovering new potential competitors or customers and supporting starting of new businesses (2) identifying and analyzing new technologies, products and processes that influence organization’s activities and behavior; (3) identifying and analyzing political or legislative standards or regulations that influence organization’s activities and behavior; (4) identifying and analyzing situations, from competitors, customers, suppliers or other, that evolved into successes or into failures. 4 decision support system and web mining in the last years, due to the economic evolution, the amount of data and information which must be used in order to adopt better decision has increase extraordinary; therefore the decisional process from organizations has a spectacular evolution [42]. companies gathering data from a various sources and deposit them in data warehouses which represent a collection of subject-oriented, integrated, timevariant and non-volatile data used in order to support the process of decision-making [20]. each person involved in the process of decision-making can and must be supported in performing his activities by specialized it tools [14]. [8] consider that decision support systems (dss) represents a specific class of information system that supports business and organizational decision-making activities. [19] state that business intelligence represents the ability of companies of analyzing and studying the behavior and actions from previous periods of time in order to understand actual position of organization and to predict or try to change what is going to happen in future period of time. [35] consider that business intelligence is composed from following components: (1) extraction, transformation and loading etl; (2) data mining; (3) online analytical processing olap; (4) enterprise reporting. in last decade, the internet growing was spectacular and the amount of information which reside on the internet is huge and nowadays the world wide web became represents one of the major source of data and information for all activities [11]. in order to extract relevant information from the web it is necessary to examine and to analyze the content, structure and usage of web resources using techniques of data mining [26]. because the scope of data mining is the identify patterns which are hidden in large databases [29], in the last decade, the web mining technologies developed in this period determine an increasing of volume of valuable material which can easily browsed and identified over the internet [18]. web data mining represents a group of three concepts (figure 2): web content mining, web structure mining and web usage mining and it is very important to make a clear distinction between these concepts ([10], [24], [22]). [34] consider that web content mining represents "the process of searching for contextually relevant sources of web data whose embedded information can be extracted and used to generate actionable knowledge". [24] consider that "web structure mining tries to discover the model underlying a framework for enhancing competitive intelligence capabilities using decision support system based on web mining techniques 331 the web hyperlink structure". [34] consider that the web usage mining represents "the process of searching for user behavior patterns by mining the data stored in referrer logs, server access logs, and other web user behavior data repositories". figure 2: structure of web data mining there are studies ([23]; [38]) that consider that web content mining and web structure mining can be used to gather data about business competitors in order to enhance business decision-making and organizational performance 5 proposed framework for a decision support system using web mining capabilities because a modern company’s information system must also contain data and information acquired from the web which are extremely dynamically. in the same time, during the decision making process, the information system must be updated with the latest data, information and knowledge. in this section will be presented a framework for decision support system using web mining techniques based on architecture consist of three tiers (figure 3): decision tier, logic tier and data tier. the decision tier, which is the top-most level of the model, receives the request from the user and invokes the logic tier for resolving the request after translating the request. after the receiving of results from logic layer, the decision layer transposes these into a format that help user to understand and use them. the logic tier, which is the middle level of the model, has the following functions: • gather data, information and knowledge from company’s information system, invoking data mining algorithms, performing calculation, making evaluations and finally transformation of the obtained data into useful information [2]; • gather data, information and knowledge from the web using specific web mining techniques and provide them to the data tier. data tier store and retrieve information from all databases contained by company’s information system and transmit then to the logic tier for processing. 6 conclusions understanding business environment dynamics and predicting its evolution is a challenging and difficult task for every information system of ci, but represents, in the same time, one of the most expected needs and requirements from organizations. an information system of ci used in an efficient way will determine an increasing level of efficiency of adopted and applied decisions. information system of ci 332 i. anica-popa, g. cucui figure 3: framework for dss based on web mining techniques is recommended to be developed, implemented and used in order to create a competitive advantage over competitors and became more and more evident that ci process is necessary in every organization which intends to increase the existing level of intelligence. bibliography [1] s. alampalli, role of ci in opportunity assessment online version. 2002, available from: http://www.scip.org/publications/cimarticledetail.cfm?itemnumber=985 (downloaded: 21 september 2006). [2] l. anica-popa, i. anica-popa, decision support in global organization, accounting and management information system journal, supplement 2006, pp. 536-54, 2006. [3] t.s. bateman, s.a. snell, management: competing in the new era 5th ed, new york, ny, mcgrawhill higher education, 2002. [4] b.d. berkowitz, a.e. goodman, strategic intelligence for american national security. princeton, nj, princeton university press, 1989. [5] f. bouthillier, t. jin, , competitive intelligence and webometrics, journal of competitive intelligence and management, volume 3, no. 3, pp. 19-39, 2005. [6] j. calof, increasing your ciq the competitive intelligence edge. ecdevjournal online version, 1998. available from: http://www.ecdevjournal.com (downloaded: 24 june 2006). [7] c. carr, g.s. erickson, h.n. rothberg, intellectual capital, competitive intelligence and the economic espionage act, international journal of learning and intellectual capital, volume 1, pp. 152-164, 2004. [8] m. castellano, g. mastronardi, a. aprile, m. minardi, p. catalano, v. dicensi, g. tarricone, a decision support system base line flexible architecture to intrusion detection, journal of software, volume 2, no. 6, pp. 30-41, 2007 [9] h. chen, m. chau, d. zeng, ci spider: a tool for competitive intelligence on the web, decision support systems, volume 34, pp. 1-17, 2002. a framework for enhancing competitive intelligence capabilities using decision support system based on web mining techniques 333 [10] r. cooley, b. mobasher, j. srivastava, web mining: information and pattern discovery on the world wide web, in proceedings of the 9th ieee international conference on tools with artificial intelligence (ictai’97), newport beach, ca, 1997. [11] i. dzitac, i. moisil, advanced ai techniques for web mining, proc. of 10th wseas international conference on mathematical methods, computational techniques and intelligent systems, corfu, greece, pp. 343-346, 2008. [12] r. ecells, p. nehemkis, corporate intelligence and espionage: a blueprint for executive decision making. new york, ny, macmillan, 1984. [13] w.r. fair, the corporate cia a prediction of things to come, management science, volume 12, no. 10, pp.b489-b503, 1966. [14] f.g. filip, decision support and control for large-scale complex systems, annual reviews in control, volume 32, no. 1, pp. 61-70, 2008. [15] c.s. fleisher, an introduction to the management and practice of competitive intelligence, in c.s. fleisher, d.l. blenkhorn (eds.), managing frontiers of competitive intelligence, westport, ct, quorum books, pp. 3-18, 2001. [16] b. gilad, the role of organized competitive intelligence in corporate strategy. columbia journal of world business, volume 24, no. 4, pp. 29-36, 1989. [17] s.e. griffith, sun tzu: the art of war, new york, oxford university press, 1963. [18] l. hâncu, data-mining techniques for supporting merging decisions, international journal of computers, communications & control, vol. iii, suppl. issue: proceedings of icccc 2008, pp. 322-326, 2008. [19] c. imhoff, m. galemmo, j.g. geiger mastering data warehouse design relational and dimensional techniques, indianapolis, in, wiley publishing, 2003. [20] w. inmon, building the data warehouse. fourth edition, indianapolis, in, wiley publishing, 2005. [21] l. kahaner, competitive intelligence: from black ops to boardrooms how businesses gather, analyze, and use information to succeed in the global marketplace, new york, ny, 1996. [22] j.s. kim, customized recommendation mechanism based on web data mining and case-based reasoning, intelligent agents for data mining and information retrieval, hershey, pa, idea group publishing, 2004. [23] b. liu, y. ma, p.s. yu, discovering unexpected information from your competitors’ web sites, proceedings of the seventh acm sigkdd international conference on knowledge discovery and data mining, san francisco, usa, 2001. [24] s. madria, s.s. bhowmick, w.k. ng, e.p. lim, research issues in web data mining, proceedings of data warehousing and knowledge discovery, first international conference, florence, italy, 1999 [25] j.j. mcgonagle, c.m. vella, a case for competitive intelligence, the information management journal, july/august edition, vol. 36 no.4, pp.35-40, 2002. [26] b. mobasher, web mining overview, encyclopedia of data warehousing and mining, heshey, pa, ideagroup, 2006. 334 i. anica-popa, g. cucui [27] d.b. montgomery, g.l. urban, marketing decision-information systems: an emerging view, journal of marketing research, volume 7, pp. 226-234, 1970. [28] d.b. montgomery, c.b. weinberg, toward strategic intelligence systems, journal of marketing, volume 43, pp. 41-52, 1979. [29] m. pater, d.e. popescu, multi-level database mining using afopt data structure and adaptive support constrains, international journal of computers, communications & control, vol. iii, suppl. issue: proceedings of icccc 2008, pp. 437-441, 2008. [30] f.t. pearce, business intelligence systems: the need, development and integration, industrial marketing management, volume 5, pp. 115-138, 1976. [31] m. peltoniemi, e. vuori, competitive intelligence and co-evolution within an organisation population, proceedings of 6th european conference on knowledge management, september 8-9, 2005, university of limerick, ireland, 2005. [32] m.e. porter, v.e. millar, how information gives you competitive advantage, harvard business review, volume 63, no. 4, pp. 149-161, 1985. [33] j. prescott, the evolution of competitive intelligence: designing a process for action, proposal management, pp. 37-52, 1999 [34] d.s. soper, a framework for automated web business intelligence systems, proceedings of the 38th annual hawaii international conference on system sciences, usa, 2005. [35] z.h. tang, j. maclennan, data mining with sql server 2005, indianapolis, in, wiley publishing, 2005. [36] t.s.h. teo, w.y. choo, assessing the impact of using the internet for competitive intelligence, information and management, volume 39, no. 1, pp. 67-83, 2001 [37] a.a. thompson, a.j. strickland, strategic management concepts and cases 13th ed. new york, ny, mcgraw-hill higher education, 2003. [38] l. vaughan, j. you, mining web hyperlink data for business information: the case of telecommunications equipment companies. proceedings of the first ieee international conference on signalimage technology and internet-based systems, yaoundé, cameroon, 2005. [39] m. vrincianu, l. anica-popa, i. anica-popa, organizational memory: an approach from knowledge management and quality management of organizational learning perspectives, amfiteatru economic, no. 26, pp. 473-81, 2009. [40] s. wadhwa, j. madaan, a multi criteria decision model for alternative selection in reverse logistics system, studies in informatics and control, vol. 16, no. 3, pp. 271-282, 2007. [41] g. windle, how can competitive intelligence practitioners avoid over-relying on the internet, in fleisher, c. s., blenkhorn, d. l. (eds.), controversies in competitive intelligence westport, pp. 85-97, 2003. [42] p. zarate, decision making process: a collaborative perspective, studies in informatics and control, vol. 17, no. 2, pp. 225-230, 2008. [43] *** http://www.scip.org international journal of computers, communications & control vol. ii (2007), no. 3, pp. 269-278 efficient variable length block switching mechanism jaidhar c.d, a.v. reddy abstract: most popular and widely used packet switch architecture is the crossbar. its attractive characteristics are simplicity, non-blocking and support for simultaneous multiple packet transmission across the switch. the special version of crossbar switch is combined input crossbar queue (cicq) switch. it overcomes the limitations of un-buffered crossbar by employing buffers at each crosspoint in addition to buffering at each input port. adoption of crosspoint buffer (cb) simplifies the scheduling complexity and adapts the distributed nature of scheduling. as a result, matching operation is not needed. moreover, it supports variable length packets transmission without segmentation. native switching of variable length packet transmission results in unfairness. to overcome this unfairness, fixed length block transfer mechanism has been proposed. it has the following drawbacks: (a) fragmented packets are reassembled at the crosspoint buffer (cb). hence, minimum buffer requirement at each crosspoint is twice the maximum size of the block. when number of ports are more, existence of such a switch is infeasible, due to the restricted memory available in switch core. (b) reassembly circuit at each crosspoint adds the cost of the switch. (c) packet is eligible to transfer from cb to output only when the entire packet arrives at the cb, which increases the latency of the fragmented packet in the switch. to overcome these drawbacks, this paper presents variable length block transfer mechanism. it does not require internal speedup, segmentation and reassembly circuits. using simulation it is shown that proposed mechanism is superior to fixed length block transfer mechanism in terms of delay and throughput. keywords: crossbar switch, un-buffered crossbar switch, buffered crossbar switch, combined input crossbar queue switch. 1 introduction packet switching technology has become the predominant technology for high speed data networks and has begun to be used for applications like voice communication which have traditionally relied on circuit switching. recently, many large-scale fast routers have used input queued (iq) switches. in an input queued switch, variable length packets arriving at the inputs are segmented into fixed size packets known as cells for transmission over the switch and reassembled into packets at the output before being transmitted. cell transmission time is fixed and is called as cell time or time slot. when a packet size is not an integral multiple of cell size, padding bytes are needed for the last fragment and is called as segmentation overhead. smaller cell size generates more number of cells per packet and leads to a large switch header overhead. in high-speed switches/routers, segmentation results in heavy load. moreover, if an optical switching technology is introduced, it is even more difficult to segment optical domain packets. in high speed optical networks, it is more reasonable that incoming ip packets pass through a switch fabric based on packet by packet switching scheme. variable length packet switching considers the entire ip packet as a single switching unit, doing away with padding bytes and reassembly buffers. for high speed switches/routers, non-existence of reassembly buffers and circuits is an attractive feature. in ip networks, the speed at which a scheduler can switch cells is not really significant, as even if only one cell is remaining for switching in an input queue, it is impossible to reassemble a complete ip packet in the reassembly buffer. hence, it is reasonable to compare the packet switching with cell switching, from the point of view of quality of service (qos). copyright © 2006-2007 by ccc publications 270 jaidhar c.d, a.v. reddy in general, latency problem in packet switching scheme is worse than in cell switching scheme because of a decrease in the statistical multiplexing effect. however, from the packet latency view point, packet switching scheme performs better than cell switching scheme. in addition, packet latency is much better than that of cell latency for qos requirement, because each packet represents complete information. variable length packets dominate network traffic (e.g., ip packets in ethernet frames). thus, a study of high-speed switches that support variable length packet switching is needed. a high performance variable length packet switching mechanism is proposed, for efficiently switching variable length ip packets. performance of the proposed mechanism is evaluated in terms of packet latency and throughput. overall, performance of our mechanism is better than earlier one. 2 previous work buffers in an input queued (iq) switch can be a single queue or multiple queues. simplest one is single first in first out (fifo) queue, in which cells are served according to their arrival order i.e., first come first served (fcfs) basis. due to the head of line (hol) blocking problem, fcfs service discipline limits the maximum throughput to 58.6% [1]. to overcome hol blocking problem several techniques have been proposed. one of the techniques is virtual output queue (voq) [2] in which each input maintains separate queue for each output. voq approach overcomes the hol blocking problem. however, it creates an input contention. output contention occurs when more than one input wishes to send a packet to the same output at the same time. to resolve both input and output contentions, schedulers are used, whose function is to find one to one conflict free match between inputs and outputs in every time slot. recently many scheduling algorithms have been proposed for input queued switch [3– 8]. to cope with the overheads and the scheduler inefficiencies, internal speedup is used as an alternative solution. the switch speedup s is defined as the ratio of the switch bandwidth to the bandwidth of the line rates. internal speedup is a good solution but, it incurs significant cost. if the speedup is n for n ×n switch, arriving cells at the inputs get immediately transferred to their corresponding outputs. hence, buffers are used only at the output side and this kind of architecture is named as output queued (oq) switch. an oq switch can provide qos guarantees to individual data flow or groups of data flow [9]. however, oq switch is inherently less scalable when number of ports is more or link rate is higher. this makes an iq switch an attractive candidate when line rate is higher or number of ports is more. but, iq switch fails to provide guaranteed qos. to satisfy both the requirements of high switching capacity and guaranteed qos simultaneously, buffered crossbar switch was proposed as an alternative. buffered crossbar (bc) switch overcomes the limitation of un-buffered crossbar switch by employing buffers at each crosspoint. adoption of crosspoint buffer (cb) drastically improves the overall performance of the switch. the first bc switch was implemented as a large multi cabinet unit [10]. pure buffered crossbar (pb) switch employs buffers only at each crosspoint and nowhere else. incoming cells at the input, enter directly into the switch core to reside in their corresponding cb. pb switch consists of a fifo cb, preceded by an address filter (af) was proposed [11]. yet another pb switch with a restricted cb of size 16 kb was introduced [12]. there is a possibility for packet loss due to the restricted cb size. to prevent packet loss completely, larger cb is needed, but cb size is inversely proportional to switch size. when port size increases, it forces larger memory requirement to switch core. the requirement of larger cb could be minimized by employing buffers at the input side, in addition to cbs. such an architecture is known as combined input crossbar queue (cicq) switch. it separates input and output contentions. schedulers at each input and output port work independently and in parallel. cicq switch to use fifo input buffer was proposed [13, 14]. using simulation it is shown that the throughput of the fifo input buffered cicq switch is limited to 91% due to the hol blocking problem [13]. to overcome the hol blocking problem, voq’s have been used in most of the proposed efficient variable length block switching mechanism 271 cicq architectures. in the rest of the paper, virtual output queued cicq switch is referred as cicq switch. the cicq switch having voq was first proposed in [15]. to schedule the cells, oldest cell first (ocf) selection strategy is employed at all the contention points. cicq switch in which cb size is one cell, has been proposed [16] in which the longest queue first (lqf) scheduling is used at the input side and round robin (rr) scheduling is used at the output side. yet another cicq switch architecture, where cb is restricted to a single cell, has been proposed [17] in which rr scheduling style used at all the contention points. all these scheduling algorithms were just simple mapping of earlier algorithms, proposed for un-buffered crossbar switch into the new cicq switch architecture. 3 combined input crossbar queue switch (cicq) architecture our n ×n cicq switch model is shown in fig. 1 and has a structure as described below. figure 1: combined input crossbar queue switch with round robin scheduling at all the contention points. input queue: there are n voq’s at each input, one for each output. packets arriving at the input i destined for output j are stored in voqi, j. internal fabric consists of n2 crosspoint buffers. the cbi, j stores the packets coming from input i destined to output j, where i, j = 1, 2,. . . , n and its size is set to 2250 bytes. scheduler: each input and output port has its own scheduler and each of them work independently and in parallel. at all the contention points round robin (rr) scheduling is used. input scheduler selects voq from among the active voqs. a voqi, j is said to be active for being scheduled in the input scheduling process, if it is not empty and the corresponding cbi, j has enough space to accommodate the incoming block. i.e., the value of its credit counter is greater than or equal to the size of its head block. the output scheduler is responsible for: (a) selecting the next eligible flow (cb) in its column; (b) initializing the transmission of packets to the specific switch output and sending a credit back to the appropriate input credit scheduler. a flow is eligible when the corresponding cb is not empty. if there is more than one eligible flow, the output scheduler has to select one of them in a rr fashion. flow control: a credit based flow control mechanism is used in order to provide lossless transmission between input port and cb [18]. each input i maintain n credit counters, one for each voq. initially, value of these counters is set to the cb size. whenever an input scheduler forwards a block from a voq, it decrements its respective credit counter by the size of the forwarded block. when a block departs from cbi, j, its corresponding output scheduler sends a credit back to the respective voqi, j. 272 jaidhar c.d, a.v. reddy 3.1 variable length packet switching significant advantage of the cicq switch is their capacity to directly switch variable length packets without segmenting it. all the input and output schedulers operate independently and in parallel. hence, there is no global “time-frame” that constrains the system to transmit fixed size packets. this does not hold good for unbuffered crossbar switch. native switching of variable length packets eliminates the internal speedup and reassembly circuit at all the outputs. however, native switching of variable length packets results in unfairness. consider two voq’s at a port i, where voqi,1 is saturated with large packets and voqi,2 is saturated with small packets. in this scenario, rr polling alternatively selects voqi,1 and voqi,2 regardless of packet size and voqi,1 achieves high transfer rate than voqi,2. fig. 2 shows unfairness caused by rr selection strategy for a 2 × 2 switch. to overcome this unfairness, fixed length block transfer mechanism has figure 2: native switching of variable length packets. been proposed [19]. in this mechanism, voq is eligible to transfer predefined block bytes of data (i.e., 1500 b) to its cb, when it gets the chance. each block consists of a set of entire packets and/or packet segments and packet reassembly is performed at the cb. hence, minimum cb requirement at each crosspoint is twice the maximum size of the block. due to restricted memory available to switch core, it does not work when port size is larger. packet is eligible to transfer from cb to output port only when entire packet has arrived at the cb. as a result, latency of the fragmented packet is increased. consider the worst case scenario where packet p1 of size 40 b and p2 of size 1500 b alternatively arrive at the input port. each block contains a fragmented packet. fragmented packet is delayed at the cb till the arrival of the complete packet, even when there is no output contention. moreover, number of reassembly circuits is square of the switch size, which adds to the cost of the switch. to overcome these problems variable length block transfer mechanism is proposed in this paper. variable length block transfer mechanism transfers up to a block of bytes, of a data packet from a selected voq to cb. block size may vary from block to block and its maximum size is restricted to 2250 bytes. block may contain set of entire packets or a single complete packet. packets that share a common destination, are packed inside the block continuously one after other. when entire packet cannot be accommodated in a single block, it is packed into a new block instead of fragmenting it. unlike cell switching, our mechanism does not use padding bytes to fill the block. hence, speedup is eliminated. fig. 3 shows variable length block transfer mechanism for the input port 1. the block 1 at the voq1,1 consists of set of entire packets p1,1 and p1,2. sum of the size of p1,1p1,2 and p1,3 is greater than 2250 bytes. hence, packet p1,3 is packed into new block, block 2 without fragmenting it. under heavy load, block size may be maximized. as a result, header overhead is reduced and crossbar operates very close to maximum efficiency. table 1 compares the proposed mechanism and fixed length block transfer mechanism. efficient variable length block switching mechanism 273 figure 3: variable length block transfer mechanism. table 1: comparison of the proposed mechanism and fixed length block transfer mechanism. characteristics variable length block transfer mechanism fixed length block transfer mechanism cross-point buffer size 2250 bytes 3000 bytes segmentation and reassembling circuit no yes scheduler round robin round robin packet segmentation no yes block size variable size fixed size 4 simulation experiment three simulation experiments were designed in order to compare the performance of the proposed mechanism with earlier ones. for all experiments, a 32 × 32 switch, port speed of 10 gbps, no internal speedup and single priority is assumed. round trip time (rtt) between line cards and switch fabric has been set to 40 b times (corresponding to 32 ns at 10 gbps line rate) which is the sum of the following delays (a) input arbitration (b) the transmission of a packet from an input port to the switch crossbar (c) the output arbitration and (d) the transmission of the flow control information back from the crossbar to the input port. delay is measured as the time interval between the first byte of the packet arriving at the input port and its first byte departing from the output port. the reported delay is averaged over all packets. experiment #1: poisson arrivals of variable length packets are assumed and each of the 32 input ports chooses an output port with a uniform distribution over the 32 output ports (λi, j = ρ/n for all i and j). every input port has identical offered load ranging from 80 to 98%. experiment #2: each input i hosts two active flows, flow i → i and i → (i + 1) mod n. the former flow consumes two thirds (2/3) of the incoming load and the latter consumes the remaining one third (1/3). poisson arrival of variable length packets is assumed and the offered load ranges from 50 to 98% and 80 to 98% for small and large size packets respectively. experiment #3: both variable length and fixed length block transfer mechanisms are modeled under non-uniform traffic such as unbalanced traffic as defined in [17]. it uses a probability w, as the 274 jaidhar c.d, a.v. reddy fraction of the input load directed to a single predetermined output, while the rest of the input load is directed to all outputs with uniform distribution. let us consider input port s, output port d, and the offered load for each input port ρs,d . the traffic load from input port s to output port d, ρs,d , is given by ρs,d =    ρ ( w + 1−wn ) , if s = d ρ ( 1−w n ) , otherwise when w = 0, the offered traffic is uniform. on the other hand, when w = 1 the traffic is completely directional from input i to output j, i.e., i = j. poisson arrivals of variable length packets are assumed and the throughput is measured as a fraction of the maximum possible one (320 gbps in our simulation). 5 experimental results fig. 4(a) shows the results for experiment #1 under bimodal packet size distribution in which packet size of 40 b and 1500 b alternatively arrived at the input ports. average delay of the proposed mechanism is lower than fixed length block transfer mechanism. in a fixed length block transfer mechanism, each block contains a fragmented packet. when the block arrives at the cb, the fragmented packets’ reassembly is delayed until the next packet arrives at the cb. segmentation and reassembly delay increases the latency of the packet. these types of delay are non-existent in our mechanism. hence, our mechanism exhibits shorter delay than the earlier mechanism. block of data bytes are eligible for transfer from cb to output, when cb gets the chance to transfer. if there is no output contention in a column, block of data bytes can immediately be transferred without waiting for the next packet. fig. 4(b) shows the results of experiment #1 in which packet size is uniform. our mechanism shows lower average delay than fixed length block transfer mechanism due to the non existence of segmentation and reassembly delay. figs. 5(a) and 5(b) show the results for the experiment #2 for small and large packets respectively. mean delay of block transfer mechanism is higher for larger packets than output queued switch. however, it shows smaller average delay for small packets. presence of two-stage buffering in a cicq switch introduces priority, based on packet lengths. the transfer time required for packets from a voq to a cp is proportional to the size of the packet. thus, a smaller packet requires less transfer time from a voq to a cp. suppose the transfer of a small packet from port 1 to cb1,1 and the transfer of a large packet from port 2 to cb2,1 begin at the same time. the small packet arrives at a cb1,1 before the large packet does at a cb2,1. thus, the small packet will be transmitted before the large packet, to an output link if the remaining cbi,1 for all ports i are empty. the effect is demonstrated by the smaller mean delay than that of the oq switch, for small packets, at a high offered load. the block transfer mechanism further adds the packet size-based priority within each port. in the block transfer mechanism, a multiple of small packets of a single block, are eligible for transfer from a single voq in the rr polling at an input port, giving higher priority to smaller packets over larger packets within the input port. this explains the lower small packet mean delay of the block transfer mechanism than the output queued switch for all offered loads considered. fig. 6 shows the results for the experiment #3. it is observed that our mechanism shows higher throughput than earlier ones. under heavy load, block size is maximized and as a result header overhead is reduced. 6 conclusions and future work in this paper, variable length block transfer mechanism is proposed to overcome the limitations of fixed length block transfer mechanism. arriving packets at the inputs are not segmented as a result padding bytes and internal speedup is not required. in addition it eliminates reassembly circuit and efficient variable length block switching mechanism 275 1 10 100 0.8 0.82 0.84 0.86 0.88 0.9 0.92 0.94 0.96 0.98 1 m ea n sw it ch de la y (n an os ec on ds ) offered load (%) fixed length block transfer mechanism 3 3 3 3 3 3 3 3 variable length block transfer mechanism + + + + + + + + (a) bimodel packet size 1 10 100 0.8 0.82 0.84 0.86 0.88 0.9 0.92 0.94 0.96 0.98 1 m ea n sw it ch de la y (n an os ec on ds ) offered load (%) fixed length block transfer mechanism 3 3 3 3 3 3 3 3 variable length block transfer mechanism + + + + + + + + output queued switching 2 2 2 2 2 2 2 2 (b) uniform packet figure 4: delay performance of the proposed mechanism, fixed length block transfer under uniform traffic. 276 jaidhar c.d, a.v. reddy 0.1 1 10 100 0.5 0.6 0.7 0.8 0.9 1 m ea n sw it ch de la y (m ic ro se co nd s) offered load (%) output queued switch 3 3 3 3 3 3 3 3 3 3 3 3 3 3 fixed length block transfer mechanism + + + + + + + + + + + + + + variable length block transfer mechanism 2 2 2 2 2 2 2 2 2 2 2 2 2 2 (a) small packets 1 10 100 0.8 0.82 0.84 0.86 0.88 0.9 0.92 0.94 0.96 0.98 1 m ea n sw it ch de la y (m ic ro se co nd s) offered load (%) output queued switch 3 3 3 3 3 3 3 3 fixed length block transfer mechanism + + + + + + + + variable length block transfer mechanism 2 2 2 2 2 2 2 2 (b) larger packets figure 5: results for diagonal experiment. efficient variable length block switching mechanism 277 0.9 0.91 0.92 0.93 0.94 0.95 0.96 0.97 0.98 0.99 1 0 0.2 0.4 0.6 0.8 1 t hr ou gh pu t unbalanced probability factor w variable length block transfer mechanism 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 fixed length block transfer mechanism + + + + + + + + + + + + + + + + + + figure 6: throughput experiment under unbalanced traffic with load 100%. buffers which reduces the cost of the switch. block size is maximized under heavy load resulting in reduction of header overhead, scheduler rate and power consumption. mean switch delay of the proposed mechanism is lower than earlier because of the not existence of the segmentation and reassembly. memory requirement at the each crosspoint is 25% lesser than the earlier one and it is feasible to implement. through simulation, proposed mechanism is compared with earlier one and found to be superior in terms of switch throughput and packet mean delay. our mechanism may produces the unfairness in terms of service rate when different voq’s have different arrival rate which we are trying to rectify in our further work. unfairness problem can be overcome by maintaining a counter called service counter for each voq and cb (vsci, j and cbsci, j). initial value of these counters is set to zero and the value may vary at the time of scheduling. during the time of scheduling, input port scheduler examines the content of the service counter vsci, j. if the value of vsci, j is greater than or equal to threshold value, voqi, j is not eligible in the current round and its value is updated as vcsi, j = vcsi, j—threshold value. otherwise voqi, j is eligible in the current round and its service counter value is updated as vcsi, j =vcsi, j+ voqi, j [block size] − minimum block size. bibliography [1] m. j. karol, m. g. hluchyj and s. p. morgan, input versus output queuing on a space-division packet switch, ieee transactions on communications, vol. 35, pp. 1347–1356, 1987. [2] t. anderson, s. owicki, j. saxe and c. thacker, high speed switch scheduling for local area networks, acm transactions on computer systems, vol. 11, pp. 319–352, 1993. [3] n. mckeown, the islip scheduling algorithm for input-queued switches, ieee/acm transactions on networking, vol. 7, pp. 188–201, 1999. [4] d. n. serpanos and p. i. antoniadis, firm: a class of distributed scheduling algorithms for high-speed atm switches with multiple input queues, proceedings of the ieee infocom, pp. 548–555, 2000. 278 jaidhar c.d, a.v. reddy [5] h. j. chao and j. s. park, centralized contention resolution schemes for a large-capacity optical atm switch, proceedings of the ieee atm workshop, 1998. [6] y. l. s. panwar and h. j. chao, on the performance of a dual round-robin switch, proceedings of the ieee infocom, pp. 1688–1697, 2001. [7] n. mckeown, v. anantharam and j. walrand, achieving 100% throughput in an input-queued switch, proceedings of the ieee infocom, pp. 296–302, 1996. [8] a. mekkittikul and n. mckeown, a starvation-free algorithm for achieving 100% throughput in an input-queued switch, proceedings of the ieee icccn, pp. 226–231, 1996. [9] g. keridis and n. mckeown, output-buffer atm packet switching for integrated services communication networks, proceedings of the ieee international conference on communications, montreal, canada, 1997. [10] e. rathgeb, t. theimer and m.huber, buffering concepts for atm switching networks, proceedings of the ieee globecom, pp. 1277–1281, 1988. [11] a. k. gupta, l. o. barbosa and n. d. georganas, 16×16 limited intermediate buffer switch module for atm networks, proceedings of the eee globecom, pp. 939-943, 1991. [12] a. l. gupta and n. d. georganas, analysis of a packet switch with input and output buffers and speed constraints, proceedings of the ieee infocom, pp. 694–700, 1999. [13] a. k. gupta, l. o. barbosa and n. d. georganas, limited intermediate buffer switch modules and their interconnection networks for b-isdn, proceedings of the ieee international conference on communications, pp. 1646–1650, 1992. [14] m. lin and n. mckeown, the throughput of a buffered crossbar switch, ieee communications letters, vol. 9, pp. 465–467, 2005. [15] m. nabeshima, performance evaluation of a combined input and crosspoint queued switch, ieice transactions on communications, vol. e83-b, pp. 737–741, 2000. [16] t. javid, r. magil and t. hrabik, a high throughput scheduling algorithm for a buffered crossbar switch fabric, proceedings of the ieee international conference on communications, pp. 1586– 1591, 2001. [17] r. r. cessa, e. oki, z. jing and h. j. chao, cixb-1 combined input-one cell-crosspoint-buffered switch, proceedings of the ieee workshop on high performance switching and routing, pp. 324– 329, 2001. [18] h. t. kung and r. morris, credit-based flow control for atm networks, ieee network magazine, vol. 9, pp. 40–48, 1995. [19] k. yoshigoe and k.j. christensen, an evaluation to crossbar switches with virtual output queuing and buffered cross points, ieee network, vol. 17, pp. 48–56, 2003. jaidhar c.d. department of computer science and engineering national institute of technology tiruchirappalli – 620 015, india e-mail: csk0305@nitt.edu a.v reddy department of computer applications national institute of technology tiruchirappalli – 620 015, india e-mail: reddy@nitt.edu received: march 30, 2007 revised: may 19, 2007 international journal of computers, communications & control vol. ii (2007), no. 2, pp. 174-184 ant systems & local search optimization for flexible job shop scheduling production noureddine liouane, ihsen saad, slim hammadi, pierre borne abstract: the problem of efficiently scheduling production jobs on several machines is an important consideration when attempting to make effective use of a multimachines system such as a flexible job shop scheduling production system (fjsp). in most of its practical formulations, the fjsp is known to be np-hard [8][9], so exact solution methods are unfeasible for most problem instances and heuristic approaches must therefore be employed to find good solutions with reasonable search time. in this paper, two closely related approaches to the resolution of the flexible job shop scheduling production system are described. these approaches combine the ant system optimisation meta-heuristic (as) with local search methods, including tabu search. the efficiency of the developed method is compared with others. keywords: flexible production, ant colony, tabu search, job shop scheduling, makespan, optimisation. 1 introduction modern hybrid heuristics are by their nature non-exhaustive, and so there is often scope for different approaches to better previous solution methods according to the execution speed or the quality of feasible solutions. traditional approaches to resolve the fjsp are as varied as the different formulations of the problem, but include fast, simple heuristics [2][12], tabu search [15], evolutionary approaches [5] and modern hybrid meta-heuristics that consolidate the advantages of various different approaches [1][13]. the ant colony optimisation (aco) was described by dorigo in his phd thesis [6] and was inspired by the ability and the organisation of real ant colony using external chemical pheromone trails acting as a means of communication. ant system algorithms have since been widely employed on the np-hard combinatorial optimisation problems including problems related to continuous design spaces research [4], and job shop scheduling [16]. however, they have not previously been applied to the fjsp described in what follows. local search methods encompass many optimisation approaches and have been shown that the efficiency of their use with an ant system approach [7]. the approach described in this paper for the fjsp shows the quality of solutions found, using benchmark problems. the performances of the proposed approach are evaluated and compared with the results obtained from other methods. in this paper, an application of the ant system algorithms combined by the tabu search heuristic is proposed for solving the fjsp. thus, the fjsp is described and formulated in section 2. then, in section 3, the suggested approach by aco with the tabu search is described. an illustrative example is given in section 4. the last section will be devoted to the presentation of some results and some conclusions relating to this research work. 2 problem formulation the fjsp may be formulated as follows. consider a set of n independent jobs, noted ℑ = {j1, j2, ..., jn, 1 ≤ j ≤ j}, which are carried out by k machines mk, m = {m1, m2, ..., mk, 1 ≤ k ≤ k}. each job j j consists of a sequence of n j operations oi, j, copyright © 2006-2007 by ccc publications ant systems & local search optimization for flexible job shop scheduling production 175 i = 1, 2, ...n j. each routing has to be performed to achieve a job. the execution of each operation i of a job j j requires one ressource selected from a set of available machines. the assignment of the operation oi, j to the machine mk ⊆ m entails the occupation of the latter one during a processing time, noted pi, j,k. the problem is thus to both determine an assignment scheme and a sequence of the operations on all machines that minimize some criteria. • a set of j independent jobs. • each job is characterized by the earliest starting time r j and the latest finishing time d j. • denote by pti, j and ri, j respectively the processing time and the ready date of the operation oi, j. the pi, j,k represent the processing time pti, j with the machine mk. • a started operation can not be interrupted. • each machine can not perform more than one operation at the same time. • the objective is to find an operation ordering set satisfying a cost function under problem constraints. in this paper, the considered objective is to minimize the makespan cmax. 3 aco and tabu search for fjsp scheduling in this stage, the application of the combined ant systems with tabu search techniques in the resolution of fjsp problem are described. 3.1 construction graph and constraints generally, the fjsp can be represented by a bipartite graph with two categories of nodes: oi, j and mk. a task is mapped to a oi, j node; a machine is mapped to a mk. there is an edge between the oi, j node and the mk node if and only if the corresponding task can be assigned to the corresponding machine while respecting the availability of the machine and the precedence constraints among the operations of different jobs. the cost of assignment is directly related to the processing time of the task upon the machine. to model the process in a more straightforward manner, we use the construction graph that is derived from the utilization matrix. below is a sample construction graph. table 1: construction graph of 4 machines and 7 tasks. m1 m2 m3 m4 o1,1 10 7 6 13 o2,1 4 5 8 12 o3,1 9 5 6 12 o1,2 15 12 8 6 o2,2 9 5 7 13 o1,3 7 16 5 11 o2,3 9 16 8 11 with this construction graph, we can transform the fjsp into a traveling ant problem. specifically, given the representative table of n rows and m columns, and each of its cells is associated with pi, j,k, representing this one distance among oi, j and mk. an ant seeks to travel across the table in such a way 176 noureddine liouane, ihsen saad, slim hammadi, pierre borne that all of the following constraints will be satisfied: one and only one cell is visited for each of the rows. in the rest of this paper, "tour" and "solution" are used interchangeably; a pair of (operation, machine) means: operation is assigned to machine, table 2. table 2: solution of construction graph table 1 m1 m2 m3 m4 o1,1 6 o2,1 4 o3,1 5 o1,2 6 o2,2 5 o1,3 5 o2,3 8 3.2 ant systems scheduling the ant system approach was inspired by the behaviour of the real ants. the ants depose the chemical pheromone when they move in their environment, they are also able to detect and to follow pheromone trails. in our case, the pheromone trail describes how the ant systems build the solution of the fjsp problem. the probability of choosing a branch at a certain time depends on the total amount of pheromone on the branch, which in turn is proportional to the number of ants that used the branch until that time. the probability p fi jk that an ant will assign an operation oi, j of job j j to an available machine mk. each of the ants builds a solution using a combination of the information provided by the pheromone trail τi jk and by the heuristic function defined by ηi jk = pi, j,k. formally, the probability of picking that an ant f th will assign an operation oi, j of job j j to the machine mk is given in equation 1. p fi jk =    (τi jk)α∗(ηi jk)−β ∑ l∈d (τi jk)α∗(ηi jk)−β i f j ∈ d 0 i f j /∈ d (1) in this equation, d denotes the set of available non-executed operations set and where α and β are parameters that control the relative importance of trail versus visibility. therefore the transition probability is a trade-off between visibility and trail intensity at the given time. 3.3 updating the pheromone trail to allow the ants to share information about good solutions, the updating of the pheromone trail must be established. after each iteration of the ant systems algorithm, equation 2 describes in detail the pheromone update used when all ants have completed an own scheduling solution denote lants, that represent the length of ant tour. in order to guide the ant systems towards good solutions, a mechanism is required to assess the quality of the best solution. the obvious choice would be to use the best makespan lmin = cmax of all solutions given by a set of ant. ∆τ fi jk = { lmin lants i f i, j, k ∈ lants 0 otherwise (2) ant systems & local search optimization for flexible job shop scheduling production 177 after all of the ants have completed their tours, the trail levels on all of the arcs need to be updated. the evaporation factor ρ ensures that pheromone is not accumulated infinitely and denotes the proportion of śoldš pheromone that is carried over to the next iteration of the algorithm. then for each edge the pheromone deposited by each ant that used this edge are added up, resulting in the following pheromonelevel-update equation: τi jk = ρ.τi jk + nba ∑ f =1 ∆τ fi jk (3) where nba defines the number of ants to use in the colony. 3.4 tabu search optimisation a simple tabu search was also implemented for this optimisation fjsp problem. the proposed approach is to allow the ants to build their solutions as described in section 3.2 and then the resulting solutions are taken to a local optimum by the local search mechanism. each of these ant solutions is then used in the pheromone update stage. the local search is performed on every ant solution, every iteration, so it needs to be fairly fast. in the case of the fjsp problem, the method is to pick the machine responsible to the cmax and check if any operations oi, j could be swapped between other machines which would result in a lower makespan. following their concept, the local search considers one problem machine at a time and attempts to swap one operation from the problem machine with any other (non-problem) machine in the solution (non-problem operations). then the ants are used to generate promising scheduling production solutions and the tabu search algorithm is used to try to improve these solutions. the tabu search is performed on each problem machine and continues until there is no further improvement in the makspean value of the solution. an example of this algorithm is shown in section 4. 3.5 the set up parameter values the set up parameter values used in the ant system scheduling algorithms are often very important in getting good results, however the appropriate values are very often entirely problem dependent [7], and cannot always be derived from features of the problem itself: • α determines the degree to which pheromone trail is used as the ants build their solution. the lower the value, the less ‘attention’ the ants pay to the pheromone trail, but the higher values implicate the ants then perform too little exploration, after testing values in the range 0.1-0.75 this algorithm works well with relatively high values (around 0.5-0.75). • β determines the extent to which heuristic information is used by the ants. again, values between 0.1-0.75 were tested, and a value around 0.5 appeared to offer the best trade-off between following the heuristic and allowing the ants to explore the research space. • τ is the value to which the pheromone trail values are initialized. initially the value of the parameter should be moderately high to encourage initial exploration, while the pheromone evaporation procedure will gradually stabilise the pheromone trail. • ρ is the pheromone evaporation parameter and is always set to be in the range [0 < ρ < x]. it defines how quickly the ants ‘forget’ past solutions. a higher value makes for a more aggressive search; it tests a value of around 0.5-0.75 to find good solutions. 178 noureddine liouane, ihsen saad, slim hammadi, pierre borne • nba defines the number of ants to use in the colony, a low value speeds the algorithm up because less search is done, a high value slows the search down, as more ants run before each pheromone update is performed. a value of 10 appeared to be a good compromise between execution speed and the quality of the solution achieved. it is interesting to note that for each value of parameters the ant systems scheduling meta-heuristics yields a good solution. moreover, its convergence speed depends essentially on the number of used ants nba. 3.6 building a solution steps the main steps in the strategy of the fjsp system by ant systems and tabu search algorithm are given below. • initialize parameters nba, α , β , τ0, ρ . • create an initial solution and an empty tabu list of a given size. in order to generate feasible and diverse solutions, initial ants are represented by solutions issued from heuristic rules [12] (spt, dl, fifo, etc) and a random method. heuristics are used to approximate an optima solution as near as possible. • repeat the following steps until the termination criteria are met: – find new solution by ant systems procedure scheduling given in section 3.2. – evaluate the quality of the new solution. – if a new solution is improved then the current best solution becomes new solution – else if no new solution was improved then apply the tabu search optimisation given in section 3.4. – add solution to the tabu list, if the tabu list is full then delete the oldest entry in the list. – apply the updating pheromone trail procedure given in section 3.3. • end repeat 4 illustration example let us consider a flexible job shop scheduling problem, this example is to execute three jobs j j (j=1,2,3) and six machines mk (k = 1, . . ., 6) described in table 1. applying the ant systems meta-heuristic, the simulation propose four different scheduling with cmax = 19 ut (unit of time), shown in table 2 to 7. the solution given in the table 7 has a makespan equal to 19 ut. the machine m5 is the cause of this value of makespan. to solve this problem, the tabu search optimisation is applied for this solution. indeed, this method finds the operation o2,2 for job j2 on m2 that can be swapped with other machines which will reduce makespan to 18 ut. and this method finds that the operation o1,3 for the job j1 executed by m2 and can be swapped with m5 who will execute the operation o2,2 for the job j2. finally, the obtained solution by the tabu search is better than before, table 8. ant systems & local search optimization for flexible job shop scheduling production 179 table 3: example benchmark 3 jobs 6 machines : processing time and ordering operation. m1 m2 m3 m4 m5 m6 j1 o1,1 10 7 6 13 5 1 o2,1 4 5 8 12 7 11 o3,1 9 5 6 12 6 17 o4,1 7 8 4 10 15 3 j2 o1,2 15 12 8 6 10 9 o2,2 9 5 7 13 4 7 o3,2 14 13 14 20 8 17 j3 o1,3 7 16 5 11 17 9 o2,3 9 16 8 11 6 3 o3,3 6 14 8 18 21 14 table 4: solution 1 for benchmark 3 jobs 6 machines. nba = 10; α = 0.5; β = 0.5; τ0 = 0.01; ρ = 0.5 s1 o1 o2 o3 o4 j1 m6: [0,1] m1: [1,5] m2: [11,16] m6: [16,19] j2 m4: [0,6] m2: [6, 11] m5: [11,19] *** j3 m3: [0,5] m3: [5,13] m1: [13; 19] *** table 5: solution 2 for benchmark 3 jobs 6 machines. nba = 10; α = 0.75; β = 0.25; τ0 = 0.01; ρ = 0.5 s2 o1 o2 o3 o4 j1 m6: [0,1] m1: [1,5] m2: [5,10] m6: [10,13] j2 m4: [0,6] m5: [6,10] m5: [10,18] *** j3 m3: [0,5] m3: [5,13] m1: [13,19] *** table 6: solution 3 for benchmark 3 jobs 6 machines. nba = 10; α = 0.25; β = 0.75; τ0 = 0.01; ρ = 0.5 s3 o1 o2 o3 o4 j1 m6: [0,1] m1: [1,5] m5: [5,11] m6: [11,14] j2 m4: [0,6] m2: [6,11] m5: [11; 19] *** j3 m3: [0,5] m6: [5,8] m1: [8,14] *** table 7: solution 4 for benchmark 3 jobs 6 machines. nba = 10; α = 0.3; β = 0.7; τ0 = 0.01; ρ = 0.5 s4 o1 o2 o3 o4 j1 m6: [0,1] m1: [1,5] m5: [5,11] m6: [10,13] j2 m4: [0,6] m2: [6,11] m5: [11,19] *** j3 m3: [0,5] m6: [5,8] m1: [8,14] *** 180 noureddine liouane, ihsen saad, slim hammadi, pierre borne table 8: tabu search optimisation solution. o1 o2 o3 o4 j1 m6; 0; 1 m1; 1; 5 m2; 5; 10 m6; 10; 13 j2 m4; 0; 6 m5; 6; 10 m5; 11; 18 *** j3 m3; 0; 5 m6; 5; 8 m1; 8; 14 *** table 9: results of problem sets solution. results problem sets from [11] problem set kacem et al genace ant systems and tabu search optimisation fjsp 4-5 16 11 11 fjsp 10-7 15 12 11 fjsp 10-10 7 7 7 fjsp 15-10 23 12 12 results problem sets from [3] fjsp 10-6 32 29 28 fjsp 10-15 86 68 68 table 10: result example : fjsp 10-10 from [11] nba = 10; α = 0.1; β = 0.9; τ0 = 0.1; ρ = 0.25 * o1 o2 o3 j1 m7 :[0,2] m3 :[2,3] m4 :[3,4] j2 m1 :[1,3] m10 :[3,4] m10 :[4,6] j3 m10 :[0,1] m8 :[1,2] m8 :[2,4] j4 m9 :[0,1] m3 :[3,6] m4 :[6,7] j5 m9 :[1,3] m9 :[3,4] m4 :[4,5] j6 m6 :[1,3] m9 :[4,6] m9 :[6,7] j7 m1 :[0,1] m3 :[1,2] m4 :[2,3] j8 m5 :[0,2] m2 :[2,5] m2 :[5,7] j9 m3 :[0,1] m7 :[2,3] m6 :[3,4] j10 m6 :[0,1] m4 :[1,2] m7 :[3,5] ant systems & local search optimization for flexible job shop scheduling production 181 table 11: result example : fjsp 10-6 from [3] nba = 10; α = 0.25; β = 0.75; τ0 = 0.2; ρ = 0.5 * o1 o2 o3 o4 o5 o6 j1 m6 :[0,1] m6 :[4,9] m1 :[9,10] m2 :[15,21] m6:[23,26] m2 :[27,28] j2 m4 :[1,3] m6 :[9,12] m5 :[13,15] m3 :[16,19] m3 :[19,21] m1 :[21,24] j3 m1 :[2,3] m1 :[5,7] m2 :[14,15] m5 :[15,17] m4 :[21,22] m5 :[24,27] j4 m5 :[0,2] m6 :[3,4] m2 :[8,14] m1 :[14,15] m4 :[15,20] m4 :[20,21] j5 m1 :[0,2] m2 :[2,5] m4 :[5,11] m1 :[11,13] m4 :[13,15] m3 :[24,27] j6 m5 :[2,5] m3 :[8,12] m6 :[12,14] m1 :[15,17] m2 :[21,27] m6 :[27,28] j7 m4 :[0,1] m5 :[5,7] m1 :[7,9] m5 :[10,13] m6 :[20,23] m4 :[24,27] j8 m3 :[1,4] m2 :[5,8] m1 :[10,12] m5 :[17,20] m5 :[20,24] m1 :[24,27] j9 m3 :[4,8] m3 :[12,16] m6 :[16,17] m1 :[17,21] m3 :[21,24] m4 :[27,28] j10 m6 :[1,3] m1 :[3,5] m5 :[7,10] m4 :[11,12] m6 :[17,20] m4 :[22,24] 5 validation and comparison all ant systems and tabu search optimisation results presented are for 1000 iterations with 10 the number of ants, and each run was performed 10 times. the algorithms have been coded in vb language and tested using a p4 pentium processor 2.4 ghz and windows xp system. to illustrate the effectiveness and performance of the algorithm proposed in this paper, six representative benchmark fjsp instances (represented by problem n × m) based on practical data have been selected to compute. these benchmark instances are all taken from of brandimarte [3] and from kacem [11] as well as those from genace [14]. the different results obtained by proposed approach is presented and compared with the other methods in table 9. concerning the fjsp instances, the different results show that the solutions obtained are generally acceptable and satisfactory. the values of the different objective functions show the efficiency of the suggested approach, table 9. moreover, the proposed method enables us to obtain good results in a polynomial computation time. in fact, the efficiency of this approach can be explained by the quality of the ant system algorithms combined by the tabu search heuristic to the optimization of solutions. 6 conclusion in this paper, a new approach based on the combination of the ant system with tabu search algorithm for solving flexible job-shop scheduling problems, is presented. the results for the reformulated problems show that the ant systems with local search meta-heuristic can find optimal solutions for different problems that can be adapted to deal with the fjsp problem. the performances of the new approach are evaluated and compared with the results obtained from other methods. the obtained results show the effectiveness of the proposed method. ant system algorithms and the tabu search techniques described are very effective and they alone can outperform all the alternative techniques. references [1] a. c. f. alvim, f. glover, c. c. ribiero and d. j. aloise. a hybrid improvement heuristic for the bin packing problem, 2002. available from: http://citeseer.nj.nec.com/557429.html. [2] t. d. braun, h. j. siegel, n. beck, l. l. bölöni, m. maheswaran, a. i. reuther, j. p. robertson, m. d. theys, b. yao, d. hensgen and r. f. freund. a comparison of eleven static heuristics for 182 noureddine liouane, ihsen saad, slim hammadi, pierre borne mapping a class of independent tasks onto heterogeneous distributed computing systems. journal of parallel and distributed computing, 61(6):810-837, 2001. [3] brandimarte p., routing and scheduling in a flexible job-shop by tabu search, annals of operations research, vol. 2, pp. 158-183, 1993. [4] bilchev, g., parmee, i.c. : the ant colony metaphor for searching continuous design spaces. lecture notes in computer science, 993,pp. 25-39, 1995. [5] a. l. corcoran and r. l. wainwright. a parallel island model genetic algorithm for the multiprocessor scheduling problem. in selected areas in cryptography, pp. 483-487, 1994. [6] m. dorigo. optimization, learning and natural algorithms. phd thesis, dei, polytecnico di milano, milan, 1992. [7] m. dorigo and t. stützle. the ant colony optimization metaheuristic: algorithms, applications, and advances. in glover, f. and kochenberger, g., editors, handbook of metaheuristics, vol. 57 of international series in operations research and management science, pp. 251-285. kluwer academic publishers, 2002. [8] m. garey, d. johnson, r. sethi. the complexity of flow shop and job-shop schedules. mathematics of operations research, vol. 1(2), pp. 117-129, 1976. [9] m. garey and d. johnson. computers and intractability: a guide to the theory of np-completeness. freeman and company, san francisco, 1979. [10] i. kacem, s. hammadi and p. borne. approach by localization and multiobjective evolutionary optimization for flexible job-shop scheduling problems. ieee transactions on systems, man and cybernetics, vol. 32(1), pp. 1-13, 2002. [11] i. kacem, s. hammadi and p. borne. pareto-optimality approach for flexible job-shop scheduling problems: hybridization of evolutionary algorithms and fuzzy logic. mathematics and computer in simulation, vol. 60, pp. 245-276, 2002. [12] n. liouane, s. hammadi and p. borne. robust methodology for scheduling production in uncertain environment. imacs multi-conference on computational engineering in systems applications, cesa’98, hammamet, 1998. [13] k. mesghouni. application des algorithmes évolutionnistes dans les problèmes d’optimisation en ordonnancement de production. thèse de doctorat, université de lille1, lille, 1998. [14] j. c. tay and n. b. ho. genace: an efficient cultural algorithm for solving the flexible jobshop problem. proceedings of the ieee congress of evolutionary computation, pp. 1759-1766, 2004. [15] a. thiesen,. design and evaluation of tabu search algorithms for multiprocessor scheduling. journal of heuristics, vol. 4, pp.141-160, 1998. [16] s. van der zwaan and c. marques. ant colony optimisation for job shop scheduling. in proceedings of the third workshop on genetic algorithms and artificial life, gaal 99,1999. noureddine liouane1, ihsen saad2,3, slim hammadi2, pierre borne2 1atsi : ecole nationale d’ingénieurs de monastir, rue ibn el jazzar, 5019 monastir, tunisie ant systems & local search optimization for flexible job shop scheduling production 183 2ecole centrale de lille, cité scientifique laboratoire d’automatique, genie informatique et signal bp 48, 59651 villeneuve d’ascq cedex, france 3ecole nationale d’ingénieurs de tunis unité de recherche lara-automatique bp 37, le belvédère, 1002 tunis, tunisie e-mail: noureddine.liouane@enim.rnu.tn, ihsen.saad@enit.rnu.tn, slim.hammadi@ec-lille.fr, pierre.borne@ec-lille.fr noureddine liouane was born in kairouan, tunisia, in 1963. he received the master degree of science electrical genius in 1988 from the “ecole normale supérieur de l’enseignement technique de tunis”. he received the ph.d. degree from ecole centrale de lille, france, in 1998. he is currently “maitre assistant” in the “ecole nationale d’ingénieurs de monastir” and director of the “institut supérieure des sciences appliquées et de la technologie de kairouan”. his research is related to the evolutionary optimization methods for discrete events systems, computer science and operational research. ihsen saad was born in monastir, tunisia, in 1977. he received the engineer diploma degree in electrical automatic control engineering from the “ecole nationale d’ingénieur de gabès”, tunisia, in 2002. he obtained the master of automatic and signal treatment in 2004 at the “ecole nationale d’ingénieur de tunis”. he is currently preparing the ph.d. degree in automatic and computer science within the framework of lagis-ec-lille and lara-enit cooperation. his research is related to the evolutionary optimization methods for discrete events systems, computer science and operational research. slim hammadi is a full professor of production planning and control at the ecole centrale de lille (french “grande ecole”). born in gafsa (tunisia) in 1962, he has obtained by 1988 the master degree in computer science from the university of lille (france). pr hammadi obtained a p.h.d degree in job-shop scheduling and control in 1991 at ecole centrale de lille. he is a senior member of ieee/smc and has served as a referee for numerous journals including the ieee transactions on smc. pr. s. hammadi was co-organizer of a symposium (ims) of the imacs/ieee smc multiconference cesa’98 held in hammamet (tunisia) in april 1998. he has organized several invited sessions in different smc conferences where he was session chairman. he was chairman of the international congress on “logistic and transport” lt’2004, mhosi’2005 and lt’2006. his teaching and research interests focus on the areas of production control, production planning, computer science, discrete and dynamic programming and computer integrated manufacturing. 184 noureddine liouane, ihsen saad, slim hammadi, pierre borne pierre borne was born in corbeil, france in 1944, he received the master degree of physics in 1967, the masters of electronics, of mechanics and of applied mathematics in 1968. the same year he obtained the diploma of "ingénieur idn" (french “grande ecole”). he obtained the phd in automatic control of the university of lille in 1970 and the dsc of the same university in 1976. he became doctor honoris causa of the moscow institute of electronics and mathematics (russia) in 1999 and of the university of waterloo (canada) in 2006. he is author or coauthor of about 200 journal articles and book chapters, and of 30 plenary lectures and of more than 250 communications in international conferences. he has been the supervisor of 64 phd thesis and is author of 20 books. he has been president of the ieee/smc society in 2000 and 2001. he is presently professor “de classe exceptionnelle” at the ecole centrale de lille and director of the french pluriformations national group of research in automatic control. international journal of computers, communications & control vol. i (2006), no. 4, pp. 101-109 computation on the optimal control of networked control systems with multiple switching modes over high speed local area networks gang zheng, wenli zeng, fanjiang xu abstract: the optimal control problem for the networked control system with multiple switching modes over high speed local area networks is addressed, where an initial state is a parametric vector. because in the general case, the time delay is much less than the sampling period and the possibility of the packets collision is much lower, it can be assumed that the influence of the time delay and the packets loss on the optimal controller design can be ignored. on the basis of the assumption, the networked control systems with multiple switching modes are modeled as a hybrid system. moreover, based on the bellman type inequality for the hybrid systems, a dynamic program to solve the optimal control with a parameter vector is proposed, in every step of the technique, the feasible region is divided into evenly distributed grid points, and then, the optimal control law is transformed into maximizing the lower bound of the cost to go function in grid points. finally, an experiment setup of the networked control system with multiple switching modes is constructed and a simulation example is given to illustrate the optimal control computation results. keywords: networked control systems, hybrid systems, optimal control, dynamic program, multiparametric program 1 introduction networked control systems (ncs) are feedback control systems wherein the control loops are closed through a real-time network[1]. the defining feature of an ncs is that information (reference input, plant output, control input, etc.) is exchanged using a network among control system components (sensors, controller, actuators, etc.). the use of a communication network offers advantages in terms of reliability, enhanced resource utilization, reduced wiring, and reconfigurability. networked control systems typically involve switching between several different modes, depending on the range of operation. in order to improve the performance of the system and save the operation cost, it is important to design an optimal control law for the systems with different switching modes. at present the methodologies of designing the optimal controllers for such networked control systems are few. [2] assumes that the controller-plant communication in ncs is periodic, the optimal controller design problem is formulated as one for sampled-data feedback systems with periodic discrete-time components, and give a necessary and sufficient condition for existence of discrete time periodic controller in terms of lmis, and derive a controller construction algorithm. [3]discusses the optimal h∞ control problem for networked systems with limited communication constraint, which is formulated as a periodic control problem, and proposes a heuristic search approach in conjunction with the convex optimization of controller to solve the optimization problem. the switching process in [2] and [3] is time-controlled. however, in practice, the switching may be completely determined by the system state, for such networked control systems, the whole state space is divided into cells which each correspond to a particular control mode, so that each cell has its own continuous dynamics associated to it. this type of the system is state-controlled and can be regarded as a hybrid system. hybrid systems have attracted considerable attention in recent years [4]. also, in practice, the initial state of networked control systems may be not determined, and for those with multiple switching modes, the unpredicted initial states may make the optimization process more complex. this paper model the class of networked control systems with multiple switching modes as a hybrid system, based on the model, a general cost function is defined and the optimal control problem copyright c© 2006 by ccc publications 102 gang zheng, wenli zeng, fanjiang xu for a general cost function is addressed, where a initial state is a parametric vector and is not predicted. furthermore, a solution to the optimal feedback control law is proposed, which combine the dynamic program and multiparametric program. finally, an example is given to illustrate the optimal computation results. 2 problem formulation 2.1 modeling networked control systems with multiple switching modes consider a class of networked control systems shown in figure 1. the plant, the sensors, the controller and the actuators are spatially distributed and connected together through a control network. in general, there are multiple control modes in the controller of the class of systems, and every mode corresponds to the range of the continuous state of the physical plant. we assume that the control network is a high speed local area network, e.g. ethernet, and the transmitting data policy is scheduled releasing policy, the time delay is much less than the sampling period in practice and the possibility of message collision is much lower, therefore, in this paper, it is reasonable to ignore the influence of the time delay and packets loss on the optimal controller design in the networked control systems with multiple switching modes. figure 1: the networked control system with multiple control modes on the basis of the assumption, the class of networked control systems with multiple modes can be modeled as a hybrid system. hybrid systems contain continuous dynamics and discrete events [4]. the controller in networked control systems can calculate the state of the plant and determine the control modes, and the switching between different modes corresponds to the occurrence of the discrete event in hybrid systems. we assume that clock-driven sensors sample the plant outputs periodically at sampling instants, then the networked control system can be modeled as a hybrid system as following: { x(k + 1) = fq(k)(x(k), u(k)) q(k + 1) = v(x(k), q(k)) (1) where continuous state variable x ∈ x , rn, continuous input variableu ∈ u , rm, k denotes k-th sampling period, k = 0, 1, 2, ... , q ∈ q, q is a finite countable set, whose elements correspond to the modes in the networked control system, fq : x × u → x denotes the continuous dynamics in discrete mode q, v : x × q → q denotes the discrete dynamics, which is a transition process between operation modes. without the loss of the generality, we think the transition between two modes occurs in the sampling time instant. computation on the optimal control of networked control systems with multiple switching modes over high speed local area networks 103 the hybrid system model (1) of the networked control system with multiple modes can also be denoted a hybrid automaton[5]. in addition, if the continuous dynamics can be denoted by the linear time invariant system, the model can transformed into the piecewise affine system[6], where different modes correspond to different ranges of state variable x. 2.2 optimal control formulation based on the model (1), the optimal control formulation in finite time horizon is discussed as follows. firstly, define the cost function of the networked control system j(un , x(0)) = fq(n)(xn ) + ∑n−1k=0 [ fq(k)(xk, uk) + g(xk, q(k), q(k + 1)) ] (2) where n is the time horizon, starting from the state x0 = x (0), xk denotes the continuous state vector at time k and q(k) denotes the discrete state vector corresponding operation mode at time k, the column vector un ,[ut0 , ... , utn−1]t ∈ rm, fq(n)(xn ) > 0 is a cost at end time, fq(k)(xk, uk) > 0 is a cost for continuous evolution in an operation mode and g(xk, q(k), q(k + 1)) ≥ 0 is a cost for switching from one mode to another mode, if there is no switching, i.e. q(k) = q(k + 1), g(xk, q(k), q(k + 1)) = 0. note that, we don’t consider the problem of infinitely many jumps in transition time instants. also, it should be noted that q(0) is not written in the left of cost function (2), because the operation modes correspond to the range of the state variable, if the initial condition x(0) is given, and the discrete mode q(0) is fixed, too. the optimal control problem of the networked control system is j∗(x(0)) ∆ = min {un} j(un , x(0)) (3) s.t.    xk+1 = fq(k)(xk, uk) q(k + 1) = v(xk, q(k)) xn ∈ x f x0 = x(0) (4) where x f is the terminal region, in addition, x j denotes the set of states x j at time j for which (3) is feasible. in general, the optimal control problem (3)-(4) may not have a minimum for some feasible x(0). this is caused by discontinuity of the hybrid system in the input space. we will consider problem (3)-(4) assuming that a minimizer u∗n (x(0)) exists for all feasible x(0). note that in this paper we will distinguish between the current state x(k) of the system (1) at time k and the variable xk in the optimization problem (3)-(4), that is the predicted state of the system (1) at time k obtained by starting from the state x0 = x(0) and applying the input sequence u0, ..., uk−1to system (1) . analogously, u(k) is the input applied to system (1) at time k while uk is the k-th optimization variable of the optimal control problem (3)-(4). in the optimal control problem (3)-(4), the initial state x(0) is a vector of parameters, the goal in this paper is to solve (3)-(4) for all values of x(0) of interest, and to make this dependence explicit. therefore, the optimal control problem of the networked control system with multiple modes is a multiparameter program problem. 3 solution to the optimal control of the networked control systems for the networked control systems with multiple modes, the optimal control problem can be viewed as a parametric programming one. the parametric program is one of the important optimization techniques in mathematical program, where the optimality is affected by the uncertainty of the parameters 104 gang zheng, wenli zeng, fanjiang xu in the cost function or the constrained conditions. because the parameters are either unknown or that will be decided later, parametric programming need to subdivide the space of parameters into characteristic regions, which depict the feasibility and corresponding performance as a function of the uncertain parameters, and hence provide the decision maker with a complete map of various outcomes [6]. in the networked control system, different operation modes correspond to different range of the state variable, the switching sequence between different modes is related with the initial condition of the system, which is not fixed. if the switching sequence is fixed, the optimal control problem is equivalent to a finite time optimal control problem for a hybrid system, which can be solved by multiparametric program, in this case, the optimal solution is a optimal control candidate for the networked control system. if all of the switching sequences in the networked control system are given, the solution to the optimal control problem (5) can be obtained by comparing the values of cost function corresponding to the optimal control candidate in different switching sequences. however, with the increase of the control modes, the computing process gets more complex and the computing efficiency gets much lower because we must enumerate all the possible switching sequences of the networked control system, which is a little same as the famous traveling salesman problem (tsp) in the optimization theory. in order to decrease the computation complexity and improve the computation efficiency, the dynamic program technique is proposed to solve the optimal control problem of the networked control system with multiple control modes. the dynamic programming solution of the optimal control problem is denoted as follows: j∗n (xn) ∆ = min {un} {[ fq(n)(x(n), u(n)) + g(x(n), q(n), q(n + 1)) ] + j∗n+1(x(n + 1)) } (5) s.t. x(n + 1) ∈ xn+1, n = n −1, ..., 0 (6) with terminal conditions: xn = x f (7) j∗n (x(n)) = fq(n)(xn ) (8) where x j is the set of all initial states for which dynamic program problem (5)-(8) is feasible,x j = {x ∈ rn|∃u, fq( j)(x, u) ∈ x j+1}. the first step of dynamic program is as follows: j∗n−1(x(n −1)) , min{un−1} {[fq(n−1)(x(n −1), u(n −1)) + g(x(n −1), q(n −1), q(n))] + j∗n (x(n))} (9) s.t.    fq(n−1)(x(n −1), un−1) ∈ x f j∗n (x(n)) = fq(n)(xn ) = fq(n)( fq(n−1)(x(n −1), un−1)) (10) it is clear that the optimal control problem (9)-(10) is a multiprarametric program problem, where x(n −1) is a parametric vector. in terms of terminal region x f and the system equation (1), the feasible region xn−1 at time n − 1 is obtained. [7] discuss the discretized computation on the optimal control of hybrid systems, where the initial state and the terminal state are determined and the key point is to search the maximum lower bound of the value function. the parameter vector increases the complexity of computing the optimal control feedback law. based on the algorithm in [7], the value function vq(x) is introduced, when x = x0, q = q0, vq0 (x0) is a lower bound on the cost for optimally bring to the system from determined initial state (x0, q0) to the terminal state (xt f , qt f ). it should be noted that without the loss of the generality, we discuss the case where continuous state space is two-dimensional system. computation on the optimal control of networked control systems with multiple switching modes over high speed local area networks 105 let e1 = [1 0]t , e2 = [0 1]t , ∀x ∈ xn−1, let x jk n−1 = xn−1 + jhe1 + khe2, x jk n−1 ∈ xn−1, and j, k ∈z, y jk n−1 , {x : x = x jk n−1 + θ1he1 + θ2he2,−1 ≤ θ1 ≤ 1,−1 ≤ θ2 ≤ 1}, ( f jk q(n−1) ) i = min x∈y jkn−1,u∈u { fq(n−1)(x, u)ei } , f jk q(n−1) = min x∈y jkn−1,u∈u { fq(n−1)(x, u) } , i = 1, 2 v jk q(n−1) = vq(n−1)(x jk n−1), ∆iv jk q(n−1) = (vq(n−1)(x jk n−1 + hei)−vq(n−1)(x jk n−1))/h, ∆−iv jk q(n−1) = (vq(n−1)(x jk n−1)−vq(n−1)(x jk n−1 −hei))/h, i = 1, 2. introduce new vector variables, λ jk q(n−1) ∈r2 for ( j, k) at time n −1, construct the following inequalities:    0 ≤ ( λ jk q(n−1) ) 1 + ( λ jk q(n−1) ) 2 + f jk q(n−1) ( λ jk q(n−1) ) |i| ≤ ( f jk q(n−1) ) |i| ∆iv jk q(n−1), i = −2,−1, 1, 2 ( λ jk q(n−1) ) |i| ≤ ( f jk q(n−1) ) |i| ∆iv jk q(n−1) 0 ≤ v jk q(n) −v jk q(n−1) + g(x jk n−1, q(n −1), q(n)), x jk n−1 ∈ sq(n−1),q(n) 0 ≥ v xn q(n), xn ∈ x f (11) if there exists v jk q(n−1) such that inequalities (11) hold, which are bellman type inequalities. it can be proved that that v jk q(n−1) is the lower bound of the cost to go function jn−1(xn−1) , and the proof is given in [7]. however, any function that meets the constraints is a lower bound on the cost to go function, thus to yield the useful bounds, it is necessary to find an maximum of all grid points in the feasible region xn−1, j∗(x(n −1)) ∆= max x jk n−1∈xn−1, j,k∈z { v jk q(n−1)(x jk n−1) } (12) assuming that v j0k0 q(n−1) = max x jk n−1∈xn−1, j,k∈z { v jk q(n−1)(x jk n−1) } (13) 106 gang zheng, wenli zeng, fanjiang xu when j = j0, k = k0, for x = x j0k0 + θ1he1 + θ2he2 ∈ y j0k0 n−1, define the interpolating function: vq(n−1)(x) = (1−θ1)(1−θ2)v j0k0q(n−1) + θ1(1−θ2)v ( j0+1)k0 q(n−1) +(1−θ1)θ2v j0(k0+1)q(n−1) + θ1θ2v ( j0+1)(k0+1) q(n−1) (14) from (14), the optimal feedback control law can be calculated as un−1 = arg min u∈u { ∂vq(n−1) ∂ x fq(n−1)(x, u) + fq(n−1)(x, u) } (15) so far, the optimal control law at the first step is finished. from the second step n = n − 2 to the last one n = 0, the cost to go function is defined on the feasible region xn+1, we will solve the problem (5), which is still a multiparametric program problem, the computing process is analogous as the first step, which is presented as follows: (i) divide the feasible region xn into evenly distributed grid points, and introduce x jk n , y jk n , ( f jk q(n) ) i , ( f jk q(n) ) i , f jk q(n), v jk q(n), ∆iv jk q(n), ∆−iv jk q(n), i = 1, 2; (ii) solve the linear program problem: j∗(x(n)) , max x jk n ∈xn, j,k∈z { v jk q(n)(x jk n ) } s.t.    0 ≤ ( λ jk q(n) ) 1 + ( λ jk q(n) ) 2 + f jk q(n) ( λ jk q(n) ) |i| ≤ ( f jk q(n) ) |i| ∆iv jk q(n), i = −2,−1, 1, 2 ( λ jk q(n) ) |i| ≤ ( f jk q(n) ) |i| ∆iv jk q(n), i = −2,−1, 1, 2 0 ≤ v jk q(n+1) −v jk q(n) + g(x jk n , q(n), q(n + 1)), x jk n ∈ sq(n),q(n+1) (16) (iii) after solving the maximum value of v jk q(n), define the interpolating function vq(n)(x), then calculating the optimal feedback input un = arg min u∈u { ∂vq(n) ∂ x fq(n)(x, u) + fq(n)(x, u) } (17) it should be noted that the dynamic program methodology to compute the optimal control feed back input in this paper is different from [6]. in [6], the cost function is quadratic and the optimal controller design is for the piecewise affine (pwa) system, if the hybrid system can not be modeled as a pwa system, and the methodology in computing the optimal controller needs to be further discussed. also, in [7], the initial state and the terminal state are determined, thus there are no parameters in the optimal control problems. in this paper, the initial state is a vector parametric vector, the solution to optimal control must make an dynamic enumeration to search the maximum lower bound in every sample period, the problem is different from [7]. 4 simulation example we use the local area network in the laboratory to construct a networked control system, where the network contains ethernet and the communication between nodes is done using tcp/ip sockets. the experiment setup is shown in figure 2. computation on the optimal control of networked control systems with multiple switching modes over high speed local area networks 107 figure 2: the networked control system experiment setup in our experiment setup, one computer (computer 1) works as a plant, and the other computer (computer 2) works as a controller, these two computers and the other computers are connected over the local area net in the lab. computer 1 simulate the dynamics of the plant, which can obtain the control signal from the net, make the output calculation, and send the computation results to the controller computer. computer 2 obtains the plant output, and calculates the control signal, then send the control signals to the plant computer. note that the networked control systems with multiple switching modes can be carried out in the experiment setup. based on the hybrid system model of the networked control system, the different dynamics of the plant can be simulated in the plant computer, and the controller computer calculates the control results and sends them to the plant computer and control the dynamics of the plant in different modes. because tcp/ip is used in the ethernet in the lab, the end to end data transmission is reliable, and the packets loss between the plant computer and the controller computer can not be considered. but the network-induced delay needs to be considered when the plant computer and the controller computer exchange data across the network. we test the delays between two computers in the network, it can be found that the time delay generally maintains a fixed value, in our experiment setup, the fixed value is much less than the sampling period. therefore, it is reasonable that the networked-induced delay can not be considered in this paper, and the model and the solution to the optimal control can be validated in this simulation. the networked control system simulated in the experiment setup is given as follows. x(k + 1) = aqi x(k) + bu(k), i = 1, 2 (18) where aq1 = [ √ 2 2 − √ 2 2√ 2 2 √ 2 2 ] , aq2 = [ √ 2 2 √ 2 2 − √ 2 2 √ 2 2 ] , b = [ 0 1 ] , sq1,q2 = {x : [ 1 0 ] x(k) < 0}, sq1,q2 = {x : [ 1 0 ] x(k) ≥ 0}, x(k) ∈ [−10, 10]×[−10, 10], u(k) ∈ [−1, 1], (19) 108 gang zheng, wenli zeng, fanjiang xu define the cost function j(un , x(0)) ∆ = xtn pxn + ∑n−1k=0 xtk qxk + utk ruk (20) where p = q = [ 700 0 0 700 ] , r = 1, terminal region x f = [−0.01, 0.01]×[−0.01, 0.01], n = 3. figure 3: the optimal trajectory of the state variable x figure 4: the optimal control input when x = [−1 1]t the optimal control problem is minimizing the cost function and solving the optimal control feedback input. in terms of the steps presented in section 3 and some toolbox related with multiparametric programming [8] and optimal control [9], the solving program is designed, and then the optimal control can be obtained. when x0 = [−1 1]t , the optimal trajectory of the state variable x is shown in fig. 3, and the control input is shown in fig. 4. 5 conclusion in networked control systems with multiple switching modes, it is necessary to calculate the optimal control feedback input, which can improve the system performance and save the cost. this paper models computation on the optimal control of networked control systems with multiple switching modes over high speed local area networks 109 the class of networked control systems as a hybrid system and addresses a optimal control problem, where initial state is a parametric vector. consider switching modes and unknown initial states, the dynamic program techniques are proposed to solve the optimal control problem, where every step calculation need enumerate the grid points in feasible region to find a maximum lower bound of the cost function and solve the optimal feedback input. finally, an example is given to illustrate the techniques. it should be noted that, the switching modes in networked control systems make the calculation of the optimal control input more complex, dynamic program can reduce the calculation complexity than the heuristic enumeration. in the future research, it is an important task to improve the computation efficiency and reduce the complexity. in addition, time delays are common in networked control systems, it is an interesting area to work into the optimal control problem with delay. references [1] w. zhang, m. s. branicky , s. m. phillips . stability of networked control systems. ieee control systems magazine, vol. 21, pp. 84-99, 2001. [2] h. fujika, k. ito. performance optimization for a class of networked control systems with communication constraints. proceedings of the american control conference, pp. 248-253, 2003. [3] l. lu, l. xie, m. fu. optimal control of networked systems with limited communication: a combined heuristic and convex optimization approach. proc. of 42th ieee int. conf. on cdc, pp. 11941199 2003. [4] a. van der schaft, h. schumacher,an introduction to hybrid dynamical systems,berlin: springerverlag, 2000. [5] r. alur,c. courcoubetis, t. a. henzinger. hybrid automata: an algorithmic approach to the specification and verification of hybrid systems. in: grossman r l, nerode a, ravn a p, et al, eds. hybrid systems, lecture notes in computer science,berlin: springer-verlag, vol. 736, pp. 209-229, 1993. [6] f. borrelli, constrained optimal control of linear and hybrid systems, berlin: springer-verlag, 2003. [7] s. hedlund, a. rantzer. optimal control of hybrid systems. proc. 38th ieee int. conf. on cdc,pp. 3972-3977, 1999. [8] m. kvasnica, p. grieder, m. baotic, et al, multi-parametric toolbox. eth, swiss federal institute of technology, 2004. [9] s. hedlund, a. rantzer, cdp tool: a matlab tool for optimal control of hybrid systems. department of automatic control, lund institute of technology, 1999. authors: gang zheng, wenli zeng, fanjiang xu laboratory of integrated information systems technology institute of software, chinese academy of sciences no.4 south fourth street, zhongguancun, beijing, china,100080 e-mail: gangzhengcn@yahoo.com.cn received: november 6, 2006 int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 3, pp. 314-324 image segmentation using euler graphs t.n. janakiraman, p.v.s.s.r. chandra mouli t.n. janakiraman department of mathematics national institute of technology, trichy, india. e-mail: janaki@nitt.edu p.v.s.s.r. chandra mouli school of computing science and engineering v.i.t. university, vellore, india. e-mail: mouli.chand@gmail.com abstract: this paper presents a new algorithm for image segmentation problem using the concepts of euler graphs in graph theory. by treating image as an undirected weighted non-planar finite graph (g), image segmentation is handled as graph partitioning problem. the proposed method locates region boundaries or clusters and runs in polynomial time. subjective comparison and objective evaluation shows the efficacy of the proposed approach in different image domains. keywords: image segmentation, graph theory, euler graphs, cycles. 1 introduction image segmentation can be treated as a graph partitioning problem which is solved by making use of cuts in a weighted graph based on certain criterion. the proposed method deals the image segmentation problem in a diverse manner. an excellent review for image segmentation is available in [8], [9], [15]. earlier approaches to image segmentation are categorized into three groups: (1) cluster the low level feature, such as histogram thresholding by [14], k–means / k–centroid by [12], [27] and mixture of gaussians (mog) by [2], (2) edge linking such as dynamic programming by [26], relaxation approach by [16] and saliency network by [25] and (3) region operations, such as region splitting and merging by [24], [23], region growing methods by [3] and by [13], by [17], by [11] and region competition by [20]. applications of segmentation are abundant. it is heavily used in medical imaging. for example, segmentation of internal brain nuclei in mri images as discussed in [38]. this work is aimed to bring robust image segmentation using graph theoretic concepts like euler graphs and cycles. the proposed method finds the cycles of a given graph so that the image regions are formed by connecting all relevant pixels together. the relevancy of pixels is determined based on two parameters namely, edge weight similarity and node label similarity, which are described in the subsequent sections. the algorithm may end up at a particular stage when there is no possibility of refinement due to constraints imposed on cycle formation. such paths are tried for further refinement. if the refinement is not possible then those paths are treated as open paths and may be treated as cuts. all the procedures of the proposed method run in polynomial time. the rest of the paper is organized as follows. in section 2, a brief review on graph based segmentation is discussed. the basic definitions related to euler graphs and some of its properties are presented in section 3. in section 4, the proposed algorithm and the experimental results are presented. section 5 concludes the work. 2 graph approaches recently graph based image segmentation has attracted growing interest. graph theory and its concepts has been dominating in image processing research. the concepts of graph theory like maximum copyright c© 2006-2010 by ccc publications image segmentation using euler graphs 315 flow, maximum clique, shortest path, minimum spanning tree etc have been used for image processing problems. [21] discussed the various types of graph algorithms in computer vision. a special issue on graph based image processing is published in [32]. early graph-based methods include by [4], [18] and more recent formulations in terms of graph cuts by [22], [31] and spectral methods by [30]. the notion of a connectivity graph was introduced by [19] to allow for image processing on a foveal sensor. this notion is introduced specifically to model the sampling of the macaque retina in [5]. the work of zahn (1971) presents a segmentation method based on the minimum spanning tree (mst) of the graph. the segmentation criterion in zahn’s method is to break mst edges with large weights. the algorithm proposed by urquhart (1982) normalizes the weight of an edge using the smallest weight incident on the vertices touching that edge. work by wu and leahy (1993) introduced such a cut criterion, but it was biased toward finding small components. this bias was addressed with the normalized cut criterion developed by shi and malik (2000), which takes into account self-similarity of regions. these cutbased approaches to segmentation capture non-local properties of the image, in contrast with the early graph-based methods. weiss (1999) has shown how the eigenvector-based approximations developed by shi and malik relate to more standard spectral partitioning methods on graphs. however, all such methods are too slow for many practical applications. an alternative to the graph cut approach is to look for cycles in a graph embedded in the image plane. [10] described the quality of each cycle is normalized in a way that is closely related to the normalized cuts approach. [7] described an efficient graph-based segmentation in which they defined a predicate for measuring the evidence for a boundary between two regions. using that predicate, an algorithm is developed which makes greed decisions to produce segmentations that satisfy global properties. the literature in the most recent times reveal many improvements over these existing methods but for comparison and evaluation, the methods by shi and malik, pedro f. felzenzwalb etc are treated as benchmark works. a method to build a hierarchy of partitions of an image is introduced by [29] in which they build a hierarchy of partitions of an image by comparing in a pairwise manner the difference along the boundary of two components relative to the differences of components’ internal differences. they stated the drawback of this method as the maximum and minimum criterion introduced are very sensitive to noise, although in practice it has a small impact. a mst pyramid based segmentation is carried out by [28] using dual graph contraction. for evaluating the segmentation results of the proposed methods with other existing methods, precision, recall and f-measure have been implemented since berkeley images [34] for segmentation have been evaluated using these three measures. the methods considered for comparison are [35], [36] and [37]. 3 background leonhard euler discussed [6] graphs for the first time while solving the famous seven bridges of k .. onigsberg problem. the following are some of the terms and their definitions used in this work. these definitions are taken as they are defined in [1]. 3.1 basic definitions let g(v,e) be the given graph with v and e representing the vertex set and edge set respectively. definition 1. a trail that traverses every edge of g is called an euler trail. it is named as euler trail because euler was the first to investigate the existence of such trails in graphs. definition 2. an euler tour is a tour which covers all the edges of g. definition 3. a graph is an euler graph or eulerian if it contains an euler tour. euler proved the following theorem and corollary through which a graph has euler tour can be determined. the following characterizations are taken as they are defined and proved in bondy and murty (1982). 316 t.n. janakiraman, p.v.s.s.r. chandra mouli theorem 4. a non-empty connected graph is eulerian if and only if it has no vertices of odd degree. corollary: a connected graph has an euler trail if and only if it has at most two vertices of odd degree. 3.2 extraction / development of euler graphs from non-euler graphs if a graph does not have an euler circuit, it still might be interested in knowing how it could be traveled with as few retraced edges as possible (starting and ending at the same vertex). eulerian can be obtained in two ways. (i) by adding one spurious multiple edge which joins two adjacent odd degree vertices and (ii) by deleting the edges joining two adjacent odd degree vertices. 4 proposed method the euler graph and its properties are exposed in this work for solving image segmentation problem. the basic idea is that euler graph is decomposed into edge disjoint cycles. the steps of the proposed method are given below: ¨ ¥ step-1: representation of image as a grid graph step-2: conversion of grid graph into eulerian step-3: segmentation procedure step-4: refinement of segments § these stages are discussed in detail in the following sub-sections. 4.1 representation of image as a grid graph the image to be segmented is represented as a graph g(v, e). to do so, each pixel is treated as a vertex of the graph. edges are defined based on 8–connectivity of the neighborhood vertices. an edge (vi, v j) ∈ e corresponds to a pair of neighboring vertices. the graph g, thus obtained is an undirected weighted non-planar graph. clearly, an image of size n × n contains n vertices, (n − )n vertical edges, n(n − ) horizontal edges and (n − ) diagonal edges. thus, in total there are (n − )n + n(n − ) + (n − )= n − n +  edges. let m = n − n + . the graph thus formed is visualized as a grid and hence called as grid graph. a sample grid graph of size  ×  is shown in figure 1. the weights are assigned to the edges by using the absolute intensity difference between the adjacent pixels. figure 1: grid graph of an image image segmentation using euler graphs 317 4.2 conversion of grid graph into eulerian the grid graph thus obtained is a connected non-eulerian because some of the vertices have odd degree. the procedure for the conversion to eulerian guarantees the formation of cycles covering all edges since all the vertices are of even degree. border vertices are the vertices on the first row, last row, first column and last column. for this reason, the grid graph can be converted to eulerian so that all vertices have even degree. this can be achieved in two ways. in the first case i.e., by adding one extra multiple edge for each of disjoint pair of adjacent odd degree vertices. the same weight is allocated to both duplicated and original edge to avoid ambiguity. the process is repeated until no such pair exists. in figure 2, (a) and (d) show two grid graphs of size  × and  ×  respectively. figure 2(b) and (c) represent the two possible euler graphs of (a), (e) and (f) represent the euler graphs of (d). figure 2: grid graph and its corresponding euler graphs in the second case, instead of adding duplicate edges to the pair of adjacent vertices of odd degree, alternate edges are removed at the boundary to maintain even degree. it is found that there is no loss of information from images by removing such edges because all the edges removed are due to border vertices. in practice, there is not much information available at the border vertices and experimentally it is found that there is no variation in the segments formed in either way. 4.3 segmentation procedure once the given image is represented as eulerian, the segmentation procedure is carried out over the eulerian. the algorithms for image segmentation and segments_formed are given below. ¨ ¥ 1. color all the edges as white. 2. call segments_formed procedure. 3. call regions_refinement procedure. § ¨ ¥ 1. select arbitrarily a white colored edge. 2. the selected edge is included in the temporary_growing_vector if it satisfies the threshold. 3. if the temporary_growing_vector forms a cycle then the closed path is stored in cycles_formed vector. 3a. the cycle formed is treated as a region. in the region formed, the edges in the closed path represent the boundary edges of the region. the edges present inside the region are internal edges of that region. the corresponding vertices are called boundary vertices and internal vertices respectively. 3b. the boundary edges of the region are colored black and internal edges are colored gray. 318 t.n. janakiraman, p.v.s.s.r. chandra mouli 4. if the temporary_growing_vector has no cycle then choose the next minimum weighted white colored adjacent edge satisfying the threshold and goto step-2. 4a. if there is no edge available or which satisfies the threshold then backtrack its parent and search for next white colored adjacent edge satisfying the threshold. 4b. if no parent exits then the temporary_growing_vector is stored in open_paths vector. color all the edges in open_paths as black. 4c. else 4d. remove the last included edge in the temporary_growing_vector. 4e. choose the next white colored adjacent edge and goto step-2. 5. if all the vertices are not covered or all edges are not colored gray or black then goto step-1. 6. if all the vertices are covered in the region either as boundary or as internal vertices then it induces the initial segmentation for the given threshold. 7. if all the edges are colored either gray or black representing the internal or boundary edges then the segmentation is subjected for refinement. § the algorithm uses a color structure which labels the edges as given below: • initially all edges are in white color • a visited edges is in gray color • an edge in black color indicates that it is a part of the boundary of a region. the black colored edges are marked permanently so that they are not considered for refinement. only white and gray colored edges are subjected for refinement. the criteria that is imposed on every edge to form a segment is defined in equation (1). the equation (1) refers to the difference of the maximum and minimum vertex labels in the cycle formed. in this case, it is used as the difference of the maximum and minimum vertex labels in the temporary_growing_vector. t = maxv − minv  (1) the algorithm starts by randomly choosing a white colored edge. at the first execution, the edge chosen is included directly in the temporary_growing_vector. since a cycle cannot be formed with one edge, line 4 is executed where the algorithm tries to choose a white colored edge adjacent to the previously chosen edge. the edge is selected based on the threshold criteria. if no minimum weighted, white colored adjacent edge is available then, the algorithm backtracks to its parent and searches for another minimum weighted white colored adjacent edge. if it finds, then the last included edge in the temporary_growing_vector is removed since the algorithm could not traverse from that edge and adds the newly selected edge into temporary_growing_vector. line 3 of the algorithm checks for any cycle in temporary_growing_vector. to check this, bfs algorithm is used. each cycle is treated as one region. if cycle is formed, then the closed path is stored in cycles_formed vector. the edges of the closed path are colored black indicating that they are boundary vertices. these edges are not chosen for forming any other cycles. the edges present inside the region are colored gray. these edges may be used for forming cycles once the white colored edges are exhausted. this will help in avoiding self overlapping region formation that means that the traversal starts from an image segmentation using euler graphs 319 internal edge and traverses to outside the region is termed as self overlapping. self overlapping is avoided at the initial stage in order to get maximum number of non–overlapping regions but it is carried out in region refinement stage, if necessary. during the execution of the algorithm, if it chooses a white colored edge outside any region and on its traversal, overlaps the existing region, then it is allowed because the internal edges of one region act like boundary edges of another region. another possibility during the traversal is that the temporary_growing_vector may not grow further because no further edge satisfies the criteria at any level (neither at the current edge nor at any of its parent edges), then the temporary_growing_vector stops traversing. by nature, eulerian guarantees cycle formation but due to the threshold criteria, it may not form cycles all the cases. in such case, the temporary_growing_vector contains an open path and such paths are stored separately in open_paths vector. in this way, the algorithm tries to traverse until it covers all vertices. this completes the first stage where, it induces an initial segmentation of image. refinement of regions formed at this stage, all the edges are labeled to either gray or black. refinement of black colored edges is not possible because they represent the boundaries of the regions already formed. the gray colored edges are subjected for refinement. the same procedure is used to form regions by choosing any randomly selected gray colored edge and for further traversals. in this way, the algorithm tries to refine the segmentation for regions formation. too much of refinement leads to over segmentation and no refinement leads to under segmentation. a moderate level of refinement is necessary. this is controlled by threshold selection. 5 experimental results the proposed method is tested on standard berkeley image database. two trivial synthetic images have been created and tested the algorithm on them. the results of the two synthetic images and the corresponding results are shown in figure 3. the results presented in figure 3 are the induced segmentations obtained before refinement process. figure 3: segmentation results-i of synthetic images in figure 3, (a) and (c) are the two synthetic images created and the corresponding segmentations are shown in (b) and (d). these two synthetic images are created in such a way to study the behavior of the algorithm in open_paths case. as mentioned in the algorithm, the temporary_growing_vector stops traversing when there is no suitable edge satisfying the criteria. in such case the path is not closed and hence it is stored in open_paths vector. in figure 3b, the segmentation result shows two open_paths 320 t.n. janakiraman, p.v.s.s.r. chandra mouli (cross lines). the two different ends of the two open_paths are adjacent to one region formed. thus, segmentation output gives a visualization that there are two closed regions labeled 1 and 2; and three open regions labeled 3, 4 and 5. in figure 3d, the segmentation output shows two open_paths for which no end is adjacent to any other region. the four open regions formed by the two open_paths are labeled 1,2,3 and 4 in figure 3d. after applying the refinement procedure, the segments obtained are shown figure 4: segmentation results-ii of synthetic images in figure 4. in figure 4b, the segmentation output shows five regions labeled. similarly in figure 4d, there are 4 closed regions. the refinement process, in these cases, tried to get closed regions and in that process lead to over segmentation. this may be true in real images also. hence, the refinement procedure is executed depending on the user’s choice. the results of some real images taken from berkeley image database are shown in figure 5. in figure 5, the first and third columns represent the original image and second and fourth columns represent the segmentation result obtained. figure 5: segmentation results of some sample images in berkeley image database the proposed algorithm is executed on the 100 images in the database. the results of 50 images have been tabulated in table 1. from the table, the following observations have been made. it is observed that those images having uniform background or average intensity range obtained best results; images having overlapping of objects or having complex structures, the statistical results are almost equivalent to the other existing methods chosen for comparison and for those images having high overlapping of objects or very dark images which cannot be visualized perfectly with the human eye, the proposed method could not segment the images and the statistical results revals that the f-measure for such images for the existing methods is better compared to the proposed method. the graphical representation of the results is shown in figure 6 respectively. image segmentation using euler graphs 321 figure 6: comparison of segmentation evaluation results table 1: comparison of segmentation evaluation results with other existing methods image name bel gpb xren eg image name bel gpb xren eg 119082 0.7 0.74 0.8 0.75 89072 0.68 0.71 0.71 0.69 42049 0.92 0.91 0.85 0.92 126007 0.72 0.78 0.76 0.75 167062 0.67 0.76 0.75 0.75 296007 0.66 0.69 0.69 0.65 24077 0.74 0.76 0.76 0.76 175032 0.49 0.62 0.63 0.6 38092 0.78 0.78 0.73 0.74 103070 0.68 0.68 0.62 0.65 101085 0.74 0.83 0.78 0.76 285079 0.71 0.72 0.71 0.69 41033 0.62 0.68 0.68 0.66 167083 0.61 0.75 0.75 0.7 291600 0.57 0.61 0.59 0.6 271035 0.73 0.73 0.71 0.71 130026 0.52 0.51 0.47 0.52 12084 0.48 0.52 0.49 0.5 241004 0.85 0.81 0.81 0.85 69015 0.79 0.82 0.79 0.75 147091 0.71 0.77 0.75 0.75 58060 0.5 0.55 0.58 0.49 189080 0.78 0.8 0.77 0.79 163085 0.49 0.5 0.6 0.49 14037 0.65 0.7 0.65 0.71 220075 0.62 0.64 0.62 0.59 62096 0.79 0.79 0.78 0.78 45096 0.76 0.79 0.78 0.73 227092 0.75 0.88 0.85 0.88 16077 0.57 0.61 0.54 0.58 253027 0.63 0.65 0.69 0.68 219090 0.71 0.74 0.74 0.72 229036 0.67 0.76 0.72 0.75 300091 0.57 0.79 0.76 0.65 3096 0.9 0.89 0.88 0.85 156065 0.66 0.67 0.64 0.63 170057 0.66 0.68 0.7 0.66 76053 0.61 0.61 0.62 0.59 157055 0.73 0.76 0.79 0.74 304034 0.47 0.49 0.47 0.41 295087 0.71 0.78 0.78 0.75 86016 0.39 0.52 0.42 0.48 78004 0.79 0.8 0.8 0.77 8023 0.41 0.5 0.42 0.4 43074 0.66 0.67 0.78 0.65 108082 0.43 0.46 0.47 0.43 86000 0.62 0.7 0.7 0.67 69040 0.5 0.55 0.57 0.52 322 t.n. janakiraman, p.v.s.s.r. chandra mouli 6 conclusion in this paper, a novel algorithm for segmenting an image into different regions using euler graphs has been proposed. the algorithm starts by randomly choosing an edge and tries to form closed regions. in cases, open paths are formed. the color look up table is used for edges to trace their transition. a white color indicates unvisited edge, a gray color indicates visited and may go for refinement and black color indicates visited and marked permanently for no refinement since it is already a part of a region boundary. the procedures discussed run in polynomial time. the mst and cycles method performs better compared to euler graph method in terms of precision, recall and f measures. bibliography [1] j.a. bondy and u.s.r. murty. graph theory with applications, fifth printing. elsevier science publishing co., inc., 52, vanderbilt avenue, new york, 1982. [2] h.greenspan c.carson, s.belongie and j.malik. blobworld: image segmentation using expectationmaximization and its application to image querying. ieee transactions on pattern analysis and machine intelligence, 24(8):1026–1038, 2002. [3] c.r.brice and c.fennema. scene analysis using regions. artificial intelligence, 1(3–4):205–226, 1970. [4] c.t.zahn. graph theoretical methods for detecting and describing gestalt clusters. ieee transactions on computation, 20:68–86, 1971. [5] e.l.schwartz. spatial mapping in the primate sensory projection: analytic structure and relevance to perception. biological cybernetics, 25(4):181–194, 1977. [6] euler. solutio problematis ad geometriam situs pertinentis comment. academiae sci. i. petropolitanae, 8:128–140, 1736. [7] pedro f. felzenszwalb and daniel p. huttenlocher. efficient graph-based image segmentation. international journal of computer vision, 59(2):167–181, 2004. [8] k.s. fu and j.k. mui. a survey of image segmentation. pattern recognition, 13:3–16, 1981. [9] r.m. haralick and l.g. shapiro. survey, image segmentation techniques. computer vision, graphics and image processing, 29:100–132, 1985. [10] i.jermyn and h.ishikawa. globally optimal regions and boundaries as minimum ratio weight cycles. ieee transactions on pattern analysis and machine intelligence, 23(4):1075–1088, 2001. [11] l.vincent and p.soille. watersheds in digital spaces: an efficient algorithm based on immersion simulations. ieee transactions on pattern analysis and machine intelligence, 13(6):583–598, 1991. [12] r.velthuizen l.hall d.goldgof l.clarke m.clark and m.silbiger. mri segmentation using fuzzy clustering techniques. ieee engineering in medicine and biology magazine, 13(5):730–742, 1994. [13] o.monga. an optimal region growing algorithm for image segmentation. prai, 1(4):351–375, 1987. [14] n. otsu. a threshold selection method from grey level histograms. ieee transactions on system, man and cybernetics, 9:62–66, 1979. image segmentation using euler graphs 323 [15] n.r. pal and s.k. pal. a review on image segmentation techniques. pattern recognition, 26:1277– 1294, 1993. [16] p.parent and s.w.zucker. trace inference, curvature consistency, and curve detection. ieee transactions on pattern analysis and machine intelligence, 11(8):823–839, 1989. [17] r.adams and l.bischof. seeded region growing. ieee transactions on pattern analysis and machine intelligence, 16(6):641–647, 1994. [18] r.urquhart. graph theoretical clustering based on limited neighborhood sets. pattern recognition, 15(3):173–187, 1982. [19] p.w.ong. r.wallace and e.schwartz. space variant image processing. international journal of computer vision, 13(1):71–90, 1994. [20] s.c.zhu and a.l.yuille. region competition: unifying snakes, region growing, and bayes/mdl for multiband image segmentation. ieee transactions on pattern analysis and machine intelligence, 18(9):884–900, 1996. [21] m.pelillo s.dickinson and ramin zabih. introduction to the special section on graph algorithms in computer vision. ieee transactions on pattern analysis and machine intelligence, 23(10):1049– 1052, 2001. [22] j. shi. and j. malik. normalized cuts and image segmentation. ieee transactions on pattern analysis and machine intelligence, 22(8):888–905, 2000. [23] s.l.horowitz and t.pavlidis. a graph-theoretic approach to picture processing. jacm, 7(2):282– 291, 1976. [24] s.l.horowitz and t.pavlidis. picture segmentation by a tree traversal algorithm. jacm, 23(2):368– 388, 1976. [25] s.ullman and a.shaashua. structural saliency: the detection of globally salients structures using a locally connected network. technical report, cambridge, ma, usa, 1988. [26] u.montanari. on the optimal detection of curves in noisy pictures. communications. acm, 14(5):335–345, 1971. [27] a.r. weeks and g.e. hague. color segmentation in the hsi color space using the k-means algorithm. in spie, p. 143-154, nonlinear image processing viii, edward r. dougherty; jaakko t. astola; eds, volume 3026, pages 143–154, 1997. [28] y.haxhimusa and w.kropatsch. hierarchy of partitions with dual graph contraction. lecture notes in computer science, 2781:338–345, 2003. [29] y.haxhimusa and w.kropatsch. segmentation graph hierarchies. in proceedings of structural, syntactic, and statistical pattern recognition, volume 3138, pages 343–351. lncs, 2004. [30] y.weiss. segmentation using eigenvectors: a unifying view. in proceedings of the international conference on computer vision, volume 2, pages 975–982, 1989. [31] z.wu and r.leahy. an optimal graph theoretic approach to data clustering: theory and its application to image segmentation. ieee transactions on pattern analysis and machine intelligence, 15(11):1101–1113, 1993. 324 t.n. janakiraman, p.v.s.s.r. chandra mouli [32] dickinson, s., pelillo, m. and zabih, r. introduction to the special section on graph algorithms in computer vision ieee transactions on pattern analysis and machine intelligence, 23(10):1049– 1052, 2001. [33] d. martin and c. fowlkes and d. tal and j. malik a database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics proc. 8th int’l conf. computer vision, volume 2, pages 416–423, 2001. [34] http://www.eecs.berkeley.edu/research/projects/cs/vision/bsds/ [35] piotr dollar, zhuowen tu, and serge belongie supervised learning of edges and object boundaries proc. ieee computer vision and pattern recognition, cvpr, june, 2006. [36] michael maire, pablo arbelaez, charless fowlkes and jitendra malik using contours to detect and localize junctions in natural images proc. ieee computer vision and pattern recognition, cvpr, 2008. [37] xiaofeng ren multi-scale improves boundary detection in natural images proc. eccv conference, 2008. [38] marius george linguraru, miguel á ngel gonz á lez ballester, nicholas ayache deformable atlases for the segmentation of internal brain nuclei in magnetic resonance imaging international journal of computers, communications & control, volume 2, no. 1, pages 26–36, 2007. t. n. janakiraman (born on february 14, 1953) received his ph.d. in graph theory and its applications from madras university, india in 1991. he did his p.d.f. from the same university in graph theory and its applications in 1994. he is currently associated with department of mathematics, national institute of technology, trichy, india. he has two sponsored research projects to his credit and published around 40 papers in refereed international journals. his main research interests are in graph theory and its applications in digital image processing, wireless ad hoc networks and digital video processing. p.v.s.s.r. chandra mouli (born on may 21, 1976) received his ph.d. from national institue of trichy in 2010. he is currently working as associate professor in school of computing science and engineering, vit university, vellore. his research interests include image segmentation, pattern classification and wireless ad–hoc networks. he has published 10 refereed research papers in various international journals and conferences. he was co–investigator to research project sponsored by defence research development organization (drdo), new delhi and also worked as a research fellow in another research project sponsored by drdo, new delhi, india. he is a life member of iste, and also member of csi. he is reviewer for many international journals. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 852-861 agent technology in monitoring systems b. pătruţ, c. tomozei bogdan pătruţ, cosmin tomozei “vasile alecsandri” university of bacău, romania, 600115, calea mărăşeşti, 157 e-mail: bogdan@edusoft.ro, cosmin.tomozei@ub.ro abstract: the aim of this paper is to make a brief presentation of the results obtained by the authors regarding agent technology application in distributed monitoring systems development. consequently, we would like to present magelan and conttest as monitoring systems based on intelligent agents technology. formal aspects regarding intelligent agents will be mentioned. quantitative issues concerning efficiency, maintenance and reengineering are also to be taken into account. keywords: intelligent agents, distributed systems; monitoring systems; hyper-encyclopedia, reengineering. 1 intelligent agents in distributed computing software applications are nowadays in position and necessity to solve some problematical situations by their own, in order to save time and reduce cost. intelligent agent integration in distributed computing proved to be an important way of achieving this objective in the last years. furthermore, the union distributed computing with intelligent agents theoretical perspective offers a good practical basis to distributed artificial intelligence [2]. intelligent agents have to be reliable, robust in order to offer accuracy in the results, in dissimilar, open or unpredictable environments [7], [8]. software agents are situated in particular environments and capable of autonomous actions, in order to fulfill their objectives. the concept of autonomy and the fact that intelligent agents are enriched with it presume that human action is minimized. in [8] it is affirmed that the agent is an entity which can perceive the environment by sensors and acts in order to realize its objectives by effectors. on the other hand, distribution of hardware, software and data offers the possibility for the agents to be replicated on diverse nodes on the computer network. 2 developing students’ metacognitive competences metacognitive skills development is an important formative intellectual object in education of the students, as reaching this level involves a route through effective education, appropriate to each one in particular [7]. metacognitive skills suppose that students are aware of their own cognitive activity, i.e. learning activity, and self-adjustment mechanisms consisting in cognitive controls (rules, procedures, strategies). next we will introduce the notions of environment, agent, s-agent, necessary to develop some intelligent systems as those described in chapter 5 of this article. the following theoretical concepts are different from the fipa standard [9] ones or from the concepts defined by authors like wooldridge in [8], and are necessary for the implementation of the systems magelan and conttest. the following statements are to introduce theoretical concepts regarding intelligent agents and distributed technology. copyright c⃝ 2006-2010 by ccc publications agent technology in monitoring systems 853 definition 1. we use the name of environment for a set of elements e = {e0, e1, e2, e3, ..., en} among which there is a relation of partial order marked with "<". we use the notation e 6 f for the fact that e < f or e = f, respectively f > e if e < f. the environment can be, at a certain point, in a certain state e, which we will express by st(e) = e. at first, the environment leaves an initial state e0, for which e0, e1, ∀i ∈ {1, 2, ...n} . the state en is called final state for which it is considered that ei, en, ∀i ∈ {1, 2, ...n}. definition 2. we use the name of agent for a triple of the type (3.1) a = (s, s0, r) where s is a finite set of states, s0 in s is called the agent’s initial state, and r is a set of evolution rules. if agent a is in state s, then we express this by st(a) = s. among the states of s there is a special state marked with λ. at first st(a) = s0, and when st(a) = λ we say that the agent is inactive. for the rest, the agent is active. the rules in r are of the type (1) or (2): r1 = (a, s, e) → (a, t, f) (2.1) r2 = (b, s, e) → (b, t, f) (2.2) rule (1) states that if st(a) = s, st(e) = e, then st(a) becomes t and st(e) becomes f, if e 6 f. the second rule (2) states that agent a ceases its implementation (st(a) becomes λ), transferring the control to agent b, for which st(b) becomes t, and the environment remains in the same state. if st(e) = e and there are two agents a and b with st(a) = a ̸= λ and st(b) = b ̸= λ and (a, s, e) → (c, z, f) and (b, t, e) → (d, z′, f′), then we will consider st(e) = max(f, f′) if f and f′ are comparable, one of them respectively, if f and f′ are not comparable, and st(a) = 0 and st(c) = z if f < f′, respectively st(b) = 0 and st(d) = z′, if f < f′. this can be generalized for more active agents. definition 3. we use the name of s-agent for an n-uple of the type (3) s = (c, a1, a2, ..., an) (2.3) where c is an agent called coordinating agent, and a1, a2, ..., an are agents corresponding to the definition above that are called effectors or atomic agents. the coordinating agent will interact directly with the user and the architecture and its functionality depends on the concrete implementation (the examples will be offered in the following chapters). because inside an s-agent, the agents transfer the control form one to another, an s-agent behaves like a communicative multi-agent system. definition 4. let there be s = (c, a1, a2, ..., an) an s-agent. if s1, s2, ..., sn are the states of the atomic agents that make up s (without the coordinator c), we then say that (s1, s2, ..., sn) is the state of s (at a given moment). definition 5. let there be s = (c, a1, a2, ..., an) an s-agent in the environment e, that cannot be modified by the user. if the initial state of s is of the type (0, λ, λ, ..., λ) we say that s is a normal s-agent (s01 marks the initial state of the agent a1). directly, we obtain the following lemma: lemma 1. in a normal s-agent, {s1, s2, ..., sn} = {s, λ, λ, ..., λ} is enacted, with s ̸= λ, at any point of time t. 854 b. pătruţ, c. tomozei proof. the s-agent being normal, it follows that its initial state (at the moment t0) is (0, λ, λ, ..., λ), therefore {s01, s02, ..., s0n} = {s01, λ, λ, ..., λ}, and s01 ̸= λ. let us suppose that the state in the moment ti is {s, λ, λ, ..., λ}, with s ̸= λ. this means that an active ai agent exists and that it is unique, with and st(ai) = s ̸= λ and st(aj) = λ, ∀j ̸= i. if there is a relation of evolution of the type (ai, s, e) → (ajt, f), then by applying this relation of evolution, we will obtain in the moment ti+1 : st(ai) = λ and st(aj) = λ, with t ̸= λ. therefore the state of the s-agent will become (no matter the situation) (λ, λ, ..., λ, t, λ, ...λ), with t ̸= λ, on the position j, therefore. if there is no relation of evolution with (ai, s, e) on the left side, then the s-agent will remain in the state {s, λ, λ, ..., λ}, with s ̸= λ. the fact that the agent will remain in that certain state will be called blockage, a notion that we will eventually formally define. definition 6. we use the name of multi-agent monitoring system (mams) of the environment e, based on s-agents (or on groups) for a triple of the type (4) mams = (sa, l, e) (2.4) where sa is a set of s-agents, having the same structure, e is the environment within which these exist and are implemented, and l are communication or linkage rules of the type ci → cj, where ci and cj are coordinating agents of some s-agents from sa. the communication relations among the s-agents form an oriented graph depending on the architecture and the concrete implementation of the multi-agent system, as we will see in the following chapters. our interest lies in some evolution rules of the environment within an s-agent and the structure and graphic representation of an s-agent. a mams containing only normal s-agents is called a normal mams. definition 7. the purpose of an s-agent is to take the environment e to a state as close as possible to its final en state. if the relation (5) can be obtained, we then say that the aim of the s-agent can be carried out. st(e) = e ∧ ¬∃f ∈ e, f ̸= en : e < f (2.5) by extension, we can say that the aim of mams is reached if all the targets of the component s-agents are achieved. definition 8. two rules of evolution of the type r1 = a → b and r2 = b → c, where a, b, and c are triples of the type of those in (1) or (2), they are called adjacent. definition 9. let there be r1, r2, ..., rk a line of adjacent rules of evolution, two by two. we will use the notation (6) and we will call this relation as the derivation relation (see (6)). r1 ⇒ rk (2.6) within an s-agent, the following results are obvious: proposition 1. if st(e)=e, there is an agent a with st(a)=s ̸= λ, (a, s, e) ⇒ (a’, s’, e’) and there is no f ̸= en, so that e′ < f, then the aim of the s-agent can be reached (with the environment in state e′). (from the very beginning, we have noted the final state of the environment with en). most of the times, set e is not fully ordered. if, however, a fully ordered relation "<" is found, then the following sentence is enacted. agent technology in monitoring systems 855 proposition 2. if the relation "<" is fully ordered, then the purpose of the s-agent can be reached with the environment in the final state (en) if there is a derivation of the type (a, sa0 , e0) ⇒ (b, t, en). proposition 2 affirms that the purpose of the s-agent can be reached if there is a derivation which leads the environment to the state en, starting from the environment’s initial state e0 and the initial state of any agent (a), without the user’s intervention (in the case of a fully ordered relation "<"). if |{a; st(a) ̸= λ}| = 1, for any e = st(e) (therefore a single agent is active at a given moment, as in lemma 1), then the mams operates sequentially. this is what happens most of the times. we consider that the environment e modifies its current state in two cases: • as a result of applying an evolution rule, by one of the system’s agents; • as a result of the direct intervention of a human user. we can also consider, in some exceptional cases, that the state a certain agent is in can be modified by the evolution rules as by the user as well. therefore, we cannot hold control over what is going to happen, or how the state of the environment is going to evolve within a certain interval of time. if, ideally, the human user cannot randomly modify neither the environment e nor the current state of the agents, then the system is entirely deterministic. 3 blockages and infinite cycles our interest does not lie simply in building absolutely sequential systems or deterministic systems, but in specifying in the best possible way the evolution rules so that blockages cannot occur. definition 10. if relation (7) is certified, then we say that the s-agent is under blockage. ∃a ∈ s : st(a) = s, st(e) = e, e ̸= en ∧ ¬∃(a, s, e′) → (b, s′, f), f, e′ ̸= en (3.1) therefore, we say that an s-agent is under blockage when an atomic agent of the system gets into a state s, and the environment is in a non-final state e, and there is no evolution relation that can allow for the agent’s passing out from the state s, although the environment is suddenly modified by the user, into another non-final state e′. in (7) f and e′ are certain non-final states of the environment (e′ and f may also be even e), and s′ a certain state of a certain agent b from the s-agent, being even possible for b to be a. in other words, there is no evolution rule, with s on the left side, which can lead to another state of a or within another agent b, no matter the evolution of the environment e. definition 11. in an s-agent a derivation of the type (a, s, e) ⇒ (a, s, e) is called infinite cycle. this occurs when, if the user does not intervene through modifying the environment, the execution of the s-agent’s agents cycles infinitely, without the environment reaching the final state. in this case, the user may intervene to take the mams out of the cycle. blockages and infinite cycles can be identified easier if we represent the s-agents and the mms. graphically, a multi-agent monitoring system (mams) can be represented in the shape of a graph oriented thus (figure 1): • optionally, more s-agents are represented in the shape of some polygons or other geometrical figures, containing more s-agents, the internal structure being represented only for one of them; • optionally, the links among the s-agents will have the shape of some curves; graphically only one s-agent will be represented, because all s-agents are considered to have the same internal structure; 856 b. pătruţ, c. tomozei • the generic s-agent will be represented through a geometrical figure where all atomic agents are represented; • the atomic agents are represented through some rectangles labeled with their names; inside those squares there will be circles representing the different states of the respective agent; • each state will represent a point in an oriented graph, where the edges are the evolution relations, labeled with a pair of clues for the states of the environment: the starting state and the state that is finally reached. figure 1: a simple s-agent containing a blockage in a1 and an infinite cycle in a2 figure 2: a normal s-agent and its reduced s-agent this is an ideal example of mams functioning, illustrated by the bold arcs in figure 2. obviously, the other arcs except the bold ones are useless in this graph, because they will never be passed through. if, however, the user interferes, for example the moment the mams is in state 2 from the atomic agent a1, modifying the state of the environment from e2 to e3, then the evolution (a1, 2, e3) → (a2, 1, e4) will occur, expressed in figure 3 through the dotted arc. we can obviously "endow" an s-agent with many evolution rules. inside a co-operating working environment, different users will introduce different rules of evolution. the question is, if under the circumstances of some normal agents, some of those rules will ever be applicable, will ever come into action. it is as if we had a program with functions or pieces of codes which are not resorted to by anywhere, cannot be touched, or an expert system with production rules whose part of the premises will never be fulfilled. thus, we will show that normal agents can be reduced to other normal agents, on the basis of an algorithm, so that certain evolution rules could become useless and could be eliminated from the system, the later behavior being not affected. on the contrary, the systems become more simple. by an s-agent’ behavior we mean the applicability of the rules of its component agents. therefore, if a certain evolution rule could ever be applied, it will be a part of the s-agent’s agent technology in monitoring systems 857 behavior, and if not, then it will not be a part of this behavior. definition 12. let there be s1 and s2 two normal s-agents, functioning simultaneously within the same environment e. if the behavior of s1 is identical with the one of s2 at all moments, meaning that the same evolution rule is applied both in s1 and s2, we say that the two agents are equivalents. equivalence also refers to the situations of blockages or infinite cycles. 4 reducing the normal s-agents we will further consider a normal s-agent. we will show that the following theorem is enacted. theorem 1. a normal s-agent s can be reduced to a normal s-agent t that is equivalent with s, by eliminating some evolution relations, according to the following algorithm. reduction algorithm(s-agent s, s-agent t); {local variables: a, s, e, ai, blockage, obj_realized, m, t = ∅; step 1: for each ai from s ; if st(ai) ̸= λ then (a, s, e) ← (ai, st(ai), st(e)); end for ; step 2: m = ∅; cycle = false; blockage = false; obj_realized = false; while (not cycle) and (not obj_realized) and (not blockage); {m = m ∪ (a, s, e); if in s exists edge (a, s)→ (b, t) labeled with (e, f) then {; add t in vertex s in agent a; vertex t in agent b ; add edge (a, s) → (b, t) between s and t; label edge (a, s) → (b, t) with (e, f); if f = en then obj_realized = true else; if ¬ ∋ f’ > f and f’ ̸= en then obj_realized = true; else if (b, t, f) ∈ m then cycle=true }; else blockage=true } }. proof. according to lemma 1, the s-agent s has always a state made up of n-1 inactive states (λ) and a single active state belonging to one single agent. this means that the ist part in an algorithm will determine a unique agent a with a singular active state s, and e will be the current state of the environment. the cycle "while" from the second part ends because inside it all possible cases are brought into discussion on the "if-then-else" branches. building the s-agent t is done using the instructions in the framed part. we have used the graph representation of s and t. the instructions inside the dotted frame realize the inclusion in t of the agents a and b (unless they are already there), of their states s and t (unless they are already there) and of the relation of evolution (a, s, e) → (b, t, f). the algorithm will extract out of the graph of s a chain t, until the final state has been reached, the "largest" final state, a blockage will occur or an infinite cycle will be entered into. t will thus be a normal s-agent, because if, at a given moment, according to lemma 1, {s1, s2, ..., sn} = {t1, t2, ..., tn} and s1, s2, ..., sn = {s, λ, λ, ..., λ} is enacted, it also results that {t1, t2, ..., tn} = {s, λ, λ, ..., λ}. observation 1. the previous result is equivalent, in the theory of formal and automatic languages, with the elimination of inaccessible and unusable states from a finite deterministic automate [1]. corollary 1. an s-agent s gets blocked if and only if its reduced agent t gets blocked. 858 b. pătruţ, c. tomozei corollary 3.2. an s-agent s has an infinite cycle if and only if its reduced agent t has an infinite cycle. according to the algorithm in theorem 1, the s-agent t is equivalent to the s-agent s, therefore the implementations of s and t are identical. if s gets blocked, then t gets blocked and reciprocally. if t has an infinite cycle, then s has the same cycle too (though it may have some others as well). the algorithm in theorem 1 can be completed with a sequence of pseudo code, in order to display the configuration in which both s and t get blocked or enter an infinite cycle. 5 practical implementations one first example of mams is that where the environment is given by an intelligent hyperencyclopedia. we described in [4] the concept of intelligent hyper-encyclopedia, which, intuitively means an encyclopedia distributive developed by several users and which uses artificial intelligence tools. for the time being, we only mention the fact that the environment e is made up of the content of the hyper-encyclopedia. by the content of hyper-encyclopedia, we understand the totality of its entries. each entry is defined by a word, a definition of this and the references to the other related entries. the states of the environment are states of the content of this hyperencyclopedia at a certain moment, that is the totality of the entries introduced by the respective moment. we certainly have an initial state e0 when the hyper-encyclopedia contains nothing, but also a final state en, when the hyper-encyclopedia is considered by the users to be definite, complete. the order relation "<" between the states of the hyper-encyclopedia is a relation of partial order, because we can say that one state is smaller than the other only if the contents of the encyclopedia at a certain moment is included in the contents of the encyclopedia at another (later) moment. however, we cannot compare two states in which the hyper-encyclopedia has completely different contents, or has only one common part. the aim of the mams is to develop the hyper-encyclopedia as much as possible. the s-agents will be local multi-agent systems. each user who contributes to the enlargement of the hyper-encyclopedia has his/her own s-agent. in [3], [4] we presented a real architecture of such an s-agent. within each s-agent there is a series of atomic agents bearing different tasks. we presented the agents responsible for monitoring the microsoft word editing activity of the content of the hyper-encyclopedia, while we have dealt with the agents that monitor the web search by using the internet explorer browser. the encyclopedia behaves intelligently in these directions: it raises certain questions in order to generate new information, based on the existing ones; it answers certain questions from the users in order to provide them with the information necessary to generate new knowledge; it adapts itself to the users’ language and to their way of working (consults and modifies the databases or interacting with the others; it provides an intelligent mediation in the communication among users, so that these may be able to learn from each other, and the encyclopedia from them. the intelligent hyper-encyclopedia having so many "human" traits, even in its limited alternative will have a network structure where each knot will detain a local data basis, a human user and a system of intelligent agents, thus: an agent which observes the user’s behavior while consulting the encyclopedia and adapts itself to it; an agent which observes the user’s behavior while updating the local data basis (modifying, deleting, adding data or generating rules) and acts accordingly; an agent which alone modifies the local data basis in order to adapt it to the user from that knot (in order to ease his/her work). all the above mentioned agents communicate among themselves and cooperate to learn from each other and to modify their own local data bases, but also the way in which they interact with the users from the knots. communication among agents and human users will often be achieved by natural language, through certain agent technology in monitoring systems 859 conversational agents. they are also capable of "metamorphosis’ in order to adapt themselves to the users, or, in case this cannot be done, agents can move from one knot to another. the magelan [9] operates to solve tasks characteristic of artificial trainers which assist (monitor, help and manage) several students (users) in developing an intelligent hyper-encyclopedia. one rule of evolution of the type (a, s, e) → (a, t, f) can be thus interpreted: if the hyperencyclopedia is in state e, and the agent a notices that the user searches the web for the term s, then the hyper-encyclopedia passes to the state f (its content is improved by creating some new links between the articles or by some other means) and the user is suggested to search for the term t. obviously, the user may or may not follow the system’s piece of advice. a second example is given by an instructive system, conttest [5]. the environment is represented by the knowledge of the learner. in this case, first of all we assume that the student knows nothing, therefore the environment lies in the state e0. when the student has achieved what we had had in mind the environment is in its final state. the aim of the mams is to make the student know everything, or at least as much as possible. if we assume that the s-agents are multi-agent systems, each of them responsible for one lesson or one learning unit, then each s-agent will contain atomic agents which will have specific pedagogical tasks. for example, an agent will be responsible for teaching some content, another will be responsible for offering examples, another one for generating tests. the states of an s-agent are stages in teaching, illustration, testing and so on. let us consider the agents as being didactic actions. for example, agent a can mark the teaching, and agent b, the testing. one rule of evolution of the type (a, s, e) → (a, t, f) will communicate to us something like this: if we have taught (a) the notion s, and the student’s level is e, then we are to teach the notion t next and the student will reach level f. one rule of evolution of the type (a, s, e) → (b, t, f) will communicate to us that if we have taught the notion s and the student’s level is e, we can then pass on to testing (agent b) the student through exercise t and we expect the student to reach the (superior) level f. 6 reengineering mms systems multi-agent monitoring systems are being submitted to the process of reengineering in order to achieve new objectives. software reengineering proves to be [6] valuable in large-scale application development. by applying this procedure, we will keep a valuable software system, such as magelan hyper-encyclopedia in working condition for a longer period of time. the cost is significantly reduced and the new tasks are efficiently accomplished. if we take into consideration the application’s functionalities, the following formula (8) will describe the elements regarding the connection of the modules that remain unchanged. elements which are to be modified or deleted will be identified as well, for the entire application to become a powerfully connected system. ir = n∪ i=1 functi0 + n∪ i=1 functi1 − n∪ i=1 delfuncti (6.1) ir represents the indicator of reengineering, functi0 is the functionality in the initial moment, functi1 is the functionality in the present moment added due to reengineering, consequently delfuncti is the functionality eliminated by reengineering, and n represents the number of functionalities. conttest (figure 3) significantly evolved due to reengineering, the agents being enriched with new teaching and learning methods and procedures. therefore, evolution in objectives brought about evolution in the functionalities of agents. pedagogical tasks of agents have been 860 b. pătruţ, c. tomozei considerably improved. consequently, higher educational objectives and elevated results in the evaluation process have been obtained. figure 3: validation of answers we may state that each software project has been enhanced with a higher level of quality, evolving correspondingly with the evolution in demands, by integrating reengineering. 7 conclusions our aim was to summarize the results obtained in development and maintenance of distributed applications and multi-agent monitoring systems. practical examples, such as magelan and conttest have been given, in order to implement and to test the theoretical approaches. multi-agent monitoring systems bring quality and efficiency in almost every area of activity, including scientific research, education, healthcare or defense. by implementing the solid theoretical concepts of artificial intelligence and reuniting them to the infinite resources of distributed systems and the internet, we get the possibility to increase the quality of the results and reduce the duration of reaching the objectives. reengineering maintains the multi-agent systems and expands their utility in time and functionality. consequently, software which proved to be precious in the activity continues to be efficient in service for a longer period of time. bibliography [1] atanasiu, a., 2007, formal languages and automata, infodata publishing house, clujnapoca, romania [2] dzitac, i., bărbat, b.e., artificial intelligence + distributed systems = agents, int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844, 4(1):17-26, 2009 [3] pătruţ, b., pandele, i., 2008, how to compute the references emergencies in a hyperencyclopedya, in "recent advances in systems engineering and applied mathematics. selected papers from the wseas conferences in istanbul, may 27-30, 2008", isbn 978-9606766-91-6, issn 1790-2769, istanbul, turkey, pp.72-75 [4] pătruţ, b., socaciu, t., 2008, constructing a hyper-enciclopedya: how to transfer the emergencies between the nodes, in proceedings of the fourth international bulgarian-greek conference (computer science’ 2008), 18-19 september 2008, kavala, greece, pp. 468-473 agent technology in monitoring systems 861 [5] pătruţ, b., vârlan s. e., socaciu, t., 2008, conttest a multiagent system for accounting education, in proceedings of the third international multi-conference on computing in the global information technology, iccgi 2008, july 27 august 1, 2008, athens, greece [6] tomozei, c., 2009, security engineering and reengineering on windows 2008 server based distributed systems, journal of information technology & communication security, secitc 2009, bucharest, pp. 63-73 [7] vlassis, n., 2007, a concise introduction to multiagent systems and distributed artificial intelligence. synthesis lectures on artificial intelligence and machine learning, department of production engineering and management, technical university of crete, morgan and claypool publishers [8] weiss, g. (ed.), 1999, multiagent systems a modern approach to distributed artificial intelligence, mit press, isbn 978-0-262-23203-6 [9] fipa standards specifications: http://www.fipa.org/specifications/index.html ijcccv4n3draft.pdf int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 3, pp. 291-300 p systems computing the period of irreducible markov chains mónica cardona-roca, m. ángels colomer-cugat, agustín riscos-núñez, miquel rius-font mónica cardona-roca, m. ángels colomer-cugat dpt. of mathematics, university of lleida av. alcalde rovira roure, 191. 25198 lleida, spain e-mail: {mcardona, colomer}@matematica.udl.es agustín riscos-núñez research group on natural computing dpt. of computer science and artificial intelligence university of sevilla avda. reina mercedes s/n, 41012 sevilla, spain e-mail: ariscosn@us.es miquel rius-font department of applied mathematics iv universitat politécnica de catalunya edifici c3, despatx 016, av. del canal olímpic, s/n, 08860 castelldefels, spain e-mail: mrius@ma4.upc.edu received: april 5, 2009 accepted: may 30, 2009 abstract: it is well known that any irreducible and aperiodic markov chain has exactly one stationary distribution, and for any arbitrary initial distribution, the sequence of distributions at time n converges to the stationary distribution, that is, the markov chain is approaching equilibrium as n → ∞. in this paper, a characterization of the aperiodicity in existential terms of some state is given. at the same time, a p system with external output is associated with any irreducible markov chain. the designed system provides the aperiodicity of that markov chain and spends a polynomial amount of resources with respect to the size of the input. a comparative analysis with respect to another known solution is described. keywords: markov chain, p sytems, membrane computing 1 introduction a discrete-time markov chain is a stochastic process such that the past time is irrelevant to predict the future, given knowledge of the present time. that is, given the present time, the future does not depend on the past time: the result of each event depends only on the result of the previous event. in order to study the evolution in time of a markov chain as well as the existence of the stationary distribution, it is suitable to classify its states. this classification depends on the path structure of the chain. one of the central issues in markov theory is the study of the asymptotic behavior of markov chains. it is well known that for any irreducible and aperiodic markov chain: (a) there exists at least one stationary distribution (that is, a probability distribution on the state space which is an invariant for the transition matrix associated with the chain), and (b) for any initial distribution, µ () and for any stationary distribution π for the markov chain, the sequence (µ (n))n∈nnn converges to π in total variation as n → ∞ (that is, the markov chain is approaching equilibrium as n → ∞). copyright c© 2006-2009 by ccc publications 292 mónica cardona-roca, m. ángels colomer-cugat, agustín riscos-núñez, miquel rius-font in paper [2], a classification of states of a finite and homogeneous markov chain is provided by using p systems. moreover, the period was calculated for recurrent classes. the design of the p systems was inspired in properties used in classic algorithms that deal with the problem of the classification. especially, this solution allows us to decide whether an irreducible markov chain is aperiodic or not. the main goal of this paper is to design a p system associated with an irreducible markov chain which provides an answer to the aperiodicity of the chain. if the answer is negative, then the system provides the period of the chain. the solution presented is based on a characterization of the aperiodicity in existential terms of some state and a natural number, and it is semi–uniform, in the sense that for each markov chain, a p system associated with it is constructed. besides, the solution spends a polynomial amount of resources in the sense of the computational complexity theory in membrane computing. the solution presented in the paper improves the solution obtained in [2], because less computational resources are used. the paper is organized as follows. in the following section, we recall some basic notions and results that we use in the paper. in section 3, a p system associated with an irreducible markov chain is constructed in order to study the periodicity of that class. in section 4, the solution presented is compared with another solution given in [2]. finally some conclusions are presented. 2 preliminaries a discrete markov chain is a sequence {xt | t ∈ n} of random variables whose values are called states, that verifies the following property: p(xt+ = j/x = i, x = i, . . . , xt = it ) = p(xt+ = j/xt = it ). without loss of generality, we can suppose that the state space is the set of nonnegative integers. the value of variable xt is interpreted as the state of the process at instant t. in this paper we work with markov chains having a finite state space s = {s, . . . , sk}. a discrete markov chain is characterized by the transition probability pi j(t) = p(xt = s j/xt− = si), ∀t ≥ , where pi j(t) provides the transition from state si to state s j at time t − . the matrix of transition probabilities p(t) = (pi j(t))≤i, j≤k , is a stochastic matrix, that is, is nonnegative for all t and the sum of each row is equal to 1, ∑k j= pi j(t) = . we say that the chain is time homogeneous or stationary if pi j(t) = pi j for each t and it verifies the kolmogorov-chapman equation: p () i j = pi j, p () i j = k∑ l= pil pl j, . . . , p (n) i j = k∑ l= pil p (n−) l j , where p (n) i j is the transition probability of state si to state s j at time n. we denote the initial distribution by means of the vector µ () = (µ () , . . . , µ () k ) = (p(x = s), p(x = s), . . . , p(x = sk)), and the distribution of the markov chain at time n is µ (n) = (µ (n) , . . . , µ (n) k ) = (p(xn = s), p(xn = s), . . . , p(xn = sk)). p systems computing the period of irreducible markov chains 293 then, µ (n) = µ () · p(n), where p = (pi j) is the transition matrix of the homogeneous markov chain. next, we introduce some concepts and results related to the states of a homogeneous markov chain. we say that a state s j communicates with another state si (and we denote it by si → s j), if there exists a natural number n >  such that p (n) i j >  (that is, if the chain has a positive probability of ever reaching s j when we start from si. we say that the states si and s j intercommunicate (and we denote it by si ↔ s j) if si → s j and s j → si. in the finite state space s = {s, . . . , sk} of a markov chain, the relation ↔ is an equivalence relation and we can consider the corresponding quotient set {s, . . . , sk}/ ↔ whose elements are the classes of equivalence by ↔. a markov chain with state space s = {s, . . . , sk} is said to be irreducible if there exists only equivalence class with respect to ↔; that is, if for all si, s j ∈ e we have si ↔ s j. otherwise, the chain is said to be reducible. we say that a state si is recurrent or essential if for each natural number m and for each state s j verifying p (m) i j >  there exists a natural number n such that p (n) ji > . otherwise, the state is said to be transient. a recurrent class is the equivalence class determined by a recurrent state. it is easy to prove that from a recurrent state, only recurrent states belonging to the same class are reachable. a recurrence time of si is a natural number n >  such that p (n) ii > . the period of a state si is defined as d(i) = g.c.d. {n ≥  | p (n) ii > }, that is, it is the greatest common divisor of the recurrence times associated with it. all states belonging to the same class have the same period. then, we can define the period of a class of a given markov chain in a natural manner: it is the period of any state of the class (see [3] and [4] for more details). definition 1. a markov chain is said to be aperiodic if all its states are aperiodic; that is, their periods are equal to 1. otherwise, the chain is said to be periodic. next, we provide a method to compute the period of a recurrent class and a characterization of the periodicity of a class. theorem 2. let a = {s, . . . , sr} be a recurrent class. the period of a is d = g.c.d. {n | p (n) ii > ;  ≤ i, n ≤ r}. proof. as all states have the same period d, we have d = d() = d() = . . . = d(r) = g.c.d. {n ≥  | p (n) ii > ;  ≤ i ≤ r}. let d ′ = g.c.d.{n| p (n) ii > ;  ≤ i, n ≤ r}. let us see that d = d ′. for that, let n > r be a time of recurrence associated with a state si ∈ a, that is, p (n) ii > . there exists a state si such that p (n) ii ≥ p (n′) ii · p (n) ii · p (n′′) ii > , where n = n′ + n + n ′′. thus, n and n ′ + n′′ are also times of recurrence. if n > r or n ′ + n′′ > r, then we repeat the process until we obtain a decomposition p (n) ii ≥ p (n′) ii · p (n) ii · p (n) ii . . . p (nt ) it it · p (n′′) it i > , with  ≤ i, . . . , it ≤ r, n = n ′ + n + . . . + nt + n ′′ verifying n′ + n′′ ≤ r and n, . . . , nt ≤ r. finally, let us notice that substituting p (n) ii , with n > r, by a suitable sequence of p (m) ii , with m ≤ r, the g.c.d. is the same. lemma 3. let a = {a,··· , ar} be a set of natural numbers. let us suppose g.c.d. {a,··· , ar} = . let us denote by a+ the set of all positive linear combinations λa + ··· + λrar, with λi ∈ z +,  ≤ i ≤ r. then, there exists a natural number n such that n ∈ a+ for all n ≥ n. 294 mónica cardona-roca, m. ángels colomer-cugat, agustín riscos-núñez, miquel rius-font proof. see, e.g., the appendix of [1] next, we characterize the aperiodicity of a recurrent class of a finite markov chain through the existence of a state s j reachable from each state si. theorem 4. let {xt | t ∈ n} be a markov chain with state space s = {s, . . . , sk} and transition matrix p = (pi j). (1) if {xt | t ∈ n} is aperiodic, then there exists a natural number n such that p (n) ii > , for all i ( ≤ i ≤ k) and all n ≥ n. (2) if {xt | t ∈ n} is irreducible and aperiodic, then there exists a natural number m such that p (n) i j > , for all i, j ( ≤ i, j ≤ k) and all n ≥ m. proof. see, e.g., chapter 4 from [3] theorem 5. let a = {s, . . . , sr} be a recurrent class of a finite markov chain. the following are equivalent: (1) class a is aperiodic. (2) there exists a state s j ∈ a and a natural number m ∈ n such that p (m) i j >  for all state si ∈ a. proof. let us suppose that class a is aperiodic. then all states in a have the same period d = . from theorem 4 there exists a natural number n such that p (n) ii > , for all i ( ≤ i ≤ r) and all n ≥ n. given j ( ≤ j ≤ r), we define ni( j) = min{n | p (n) i j > }, for each si ∈ a, n( j) = max{n( j), . . . , nr( j)}, and m = n + n( j). let us see that p (m) i j > , for each i ( ≤ i ≤ r). we have p (m) i j ≥ p (ni( j)) i j p (m−ni( j)) j j >  because of p (ni( j)) i j >  by definition of ni( j), and p (m−ni( j)) j j >  by theorem 4. conversely, let us suppose that there exists m ≥  and a state s j ∈ a such that ∀ si ∈ a we have p (m) i j > . in particular, p (m) j j >  so m is a recurrence time. on the one hand, if d is the period of the class, then m is a multiple of d. on the other hand, if si ∈ a is a state such that p ji > , then  < p (m) i j p ji ≤ p (m+) ii , so m +  is a multiple of d. hence, d = . 3 a p system associated with an irreducible markov chain the goal of this paper is to study the aperiodicity of an irreducible markov chain with state space s = {s, . . . , sk}, k ≥ , by using p systems. in the affirmative case, the answer of the system is y es, on the contrary, the system sends an object encoding the period of the class to the environment. 3.1 the design of the p system let pk = (pi j)≤i, j≤k be a boolean matrix associated with a class with a finite and homogeneous markov chain of order k such that pi j =  if the transition from si to s j is possible, and pi j =  otherwise; that is, pk is the adjacency matrix of the directed graph associated with the recurrent class. the solution presented in this paper is a semi–uniform one in the following sense: we give a family πππ = {π (pk) | k ∈ nnn}, associating with pk a p system with external output, such that (a) there exists a deterministic turing machine working in polynomial time which constructs the system π (pk) from pk; and (b) the output of the p system π (pk) provides the classification of the recurrent class of the markov chain as well as the period of the states. we associate with the matrix pk the p system of degree 4 with external output, p systems computing the period of irreducible markov chains 295 π (pk) = (γ (pk), µ(pk), m, m, m, m, r) defined as follows: • working alphabet: γ (pk) = {si j, ti j, τi j |  ≤ i, j ≤ k} ∪ {si jr |  ≤ i, j, r ≤ k} ∪ {tr |  ≤ r ≤ k} ∪ {βl |  ≤ l ≤ k − } ∪ {bi, pi |  ≤ i ≤ k} ∪ {ci, di |  ≤ i ≤ α} ∪ {yes,y es, σ } where α = k + ⌈ k  ⌉. • membrane structure: µ(pk) = [ [ [ [ ] ] ]]. • initial multisets: m = {t pi j i j |  ≤ i, j ≤ k} ∪ {β}; m = {sii |  ≤ i ≤ k} m = {bi |  ≤ i ≤ k} ∪ {d}; m = /0 • the set r of evolution rules consists of the following rules: r = [ti j → τi jt k i j],  ≤ i, j ≤ k r = [βi → βi+],  ≤ i ≤ k −  r = [βk−] → c k  r = [crsi jτ p j j . . . τ p jk jk ] → [s p j i . . . s p jk ik c γ j r+]s p j ir+ . . . s p jk ikr+t p ji r+,  ≤ i, j ≤ k,  ≤ r ≤ α − , γ j = ∑k l= p jl r = [σ ] → σ r = [s jr . . . sk jr] → [σ ] yes,  ≤ j ≤ k,  ≤ r ≤ α r = [trbr → pr],  ≤ r ≤ k r = [pi pi+l → pi pl ],  ≤ i ≤ k,  ≤ l ≤ k − i r = [p  i → pi],  ≤ i ≤ k r = [di → di+],  ≤ i ≤ α −  r = [dα pr] → pr[ ],  ≤ r ≤ k r = [dα p] → yes[ ] r = [yes] → y es[ ] r = [pr] → pr[ ],  ≤ r ≤ k 3.2 an overview of computations initially, membrane 1 contains objects ti j that codify the elements pi j of the boolean matrix associated with the transition matrix of the markov chain, together with the counter β. this counter allows us to dissolve membrane 1 at a certain instant. membrane 2 contains initially objects sii that codify the states si of the chain. membrane 3 contains objects bi that will be used in order to avoid that repeated recurrence times smaller than or equal to k appear. the counter d in membrane 2 will be used to trigger the answer at the suitable instant. the design of the p system π (pk) implements a process that is structured by stages. the first one consists of k steps which allow the production of sufficiently many new copies τi j of objects ti j. this is done by applying rules of type r and r in membrane 1 at k −  first steps and applying at step k rule r that dissolves membrane 1. 296 mónica cardona-roca, m. ángels colomer-cugat, agustín riscos-núñez, miquel rius-font at the second stage, all paths between states with length at most k, as well as recurrence times smaller than or equal to k, are generated. this stage starts at step k +  and it spends at most k steps. first, rules of type r are applied producing objects si jr in membrane 3 that codify the existence of a path with length r from state si to state s j, as well as the objects tr codifying the existence of a recurrence time equal to r. simultaneously, it is checked if there exists a state s j and a natural number m such that p (m) i j > , for all states si. in that case, an object σ is produced in membrane 2 and the system expels an object y es to the environment. the third stage is only applied if an object y es has not been expelled to the environment. at this stage, the period of the class is computed and it takes k + ⌈ k  ⌉ steps. by applying rules of type r, objects pr encoding recurrence times smaller than or equal to k, are obtained. such recurrence times are different from each other. by applying rules of types r and r, the greatest common divisor of these times is computed. if the period of the class is equal to , then the system sends an object y es to the environment, otherwise, the system expels an object pn that encodes the period of the class to the environment. 4 results and discussions in [2] a p system was constructed which allows us to classify the states of a markov chain. thus, that p system can be adapted to characterize the aperiodicity of such a chain. specifically, if pk = (pi j)≤i, j≤k is the boolean matrix associated with the states of a recurrent class of a finite and homogeneous markov chain of order k, then we define the system π ′(pk) = (γ ′(pk), µ ′(pk), m ′ , m ′ , m ′ , m ′ , r ′, ρ ′), as follows: • working alphabet: γ ′(pk) = {a, ri, ti j |  ≤ i, j ≤ k} ∪ {cr |  ≤ r ≤ k + } ∪ {ti jur |  ≤ i, j, u ≤ k,  ≤ r ≤ k} ∪ {βi |  ≤ i ≤ γ + } ∪ {si jr |  ≤ i, j ≤ k,  ≤ r ≤ k} ∪ {di |  ≤ i ≤ (k − )} where γ = k +  + ⌈lgk⌉ + (k−)(k+)  . • membrane structure: µ ′(pk) = [ [ [ [ ] ] ] ]. • initial multisets: m′ = /0; m ′  = {β}; m ′  = {c}; m ′  = {sii t pi j (k−) i j |  ≤ i, j ≤ k}. • the set r of evolution rules consists of the following rules: – rules in the skin membrane labeled by 1: r = {dp → (rp, out) |  < p ≤ k} r = {d → (a, out)} – rules in the membrane labeled by 2: r = {βi → βi+ |  ≤ i ≤ γ} ∪ {βγ+ → λ }. r = {d  j → d j |  ≤ j ≤ k} r = {d jd j+l → d jdl |  ≤ j ≤ k,  ≤ j + l ≤ k} p systems computing the period of irreducible markov chains 297 – rules in the membrane labeled by 3: r = {ti jur → (ti j su j(r+), in) | pi j = , u 6= j,  ≤ i, j, u ≤ k,  ≤ r < (k − )} r = {ti ju(k−) → (ti j, in) | pi j = , u 6= j,  ≤ i, j, u ≤ k} r = {ti j jr → (ti j, in) dr+ | pi j = ,  ≤ i, j ≤ k,  ≤ r < (k − )} r = {ti j j(k−) → (ti j, in) | pi j = ,  ≤ i, j ≤ k} r = {cr → cr+ |  ≤ r ≤ (k − ) + } ∪ {c(k−)+ → λ } – rules in the membrane labeled by 4: r = {suirt pi i . . .t pik ik → (t pi iur . . .t pik ikur, out) |  ≤ u, i ≤ k,  ≤ r ≤ (k − )}. • there is only a priority relation in the membrane labeled by 2: {r > r}. in order to study the efficiency of the p system π (pk) constructed in this work, we will compare the results with those obtained by the p system π ′(pk) described above. for that purpose, a comparative analysis of the computational resources required in both p systems is given first. secondly, an analysis of the times of execution obtained on designed simulators for both p systems with some case studies is presented. 4.1 computational resources required the resources required initially to construct the systems π (pk) and π ′(pk), and the number of steps taken by the systems, are the following: π (pk) π ′(pk) size of the alphabet θ (k) θ (k) initial number of membranes 4 4 sum of the sizes of initial multisets θ (k) θ (k) number of rules θ (k) θ (k) maximal length of a rule θ (k) θ (k) number of priority relations 0 θ (k) number of steps θ (k) θ (k) in the previous table, let us notice that the amount of resources requested by π (pk) is smaller than the ones requested by π ′(pk). indeed, the size of the alphabet and the number of rules pass from power 3 to power 4, and the system π (pk) has no priority relation. the number of steps is of the same asymptotic order. 4.2 case studies we have realized a simulator for each system π (pk) and π ′(pk). these simulators have been written in c++ language and they have been executed on a pentium 4 computer with 512 mb ram and 3.20 ghz. in both simulators objects ti j have been represented by means of arrays of dimension 2; objects si j have been represented by vectors of dimension 2 and recurrent times have been represented by onedimensional vectors. the simulator of the system π (pk) generates the trajectories with a length at most k + ⌈k/⌉ in a sequential way, keeping the times of recurrence smaller than or equal to k. if assertion () in theorem 5 is fulfilled, the simulator halts displaying the time of execution and the aperiodicity of the markov chain. otherwise the simulator computes the g.c.d. of the recurrence times obtained where all of them are different. 298 mónica cardona-roca, m. ángels colomer-cugat, agustín riscos-núñez, miquel rius-font similarly, a simulator for the system π ′(pk) has been implemented. the main difference with respect to the previously mentioned one is that it can keep more than a copy of the times of recurrence. all trajectories of the markov chain with a length smaller than or equal to (k − ) and their recurrence time are computed. then the g.c.d. of these times is obtained. when the markov chain is aperiodic, the p system π (pk) can finish before all trajectories with a length k + ⌈k/⌉ are computed. in case it is necessary to calculate the period, bearing in mind that all recurrence times are different, system π (pk) is faster than π ′(pk) in computing the g.c.d. of these times. when the markov chain is periodic the length of the trajectories computed by π (pk) are longer than those computed by π ′(pk). nonetheless, in order to compute the period, recurrence times used in π (pk) are all different. the simulators designed have been executed on eight recurrent markov chains with 100 states. four of these markov chains are periodic and the others are aperiodic. table 1 shows the values equal to 1 of the adjacency matrix of the graph associated with the recurrent markov chains. the execution times are described in table 2. example 1 pi,i+ =   ≤ i ≤  p, =  2 pi,i+ =   ≤ i ≤  pi, =   ≤ i ≤  3 p j+i, j+i+ =   ≤ i ≤   ≤ j ≤  p j, j− =   ≤ j ≤  p j+, j+ =   ≤ j ≤  p, =  4 p j+i, j+i+ =   ≤ i ≤   ≤ j ≤  p j, j− =   ≤ j ≤  p j+, j+ =   ≤ j ≤  p, =  p, =  5 p j+i, j+i+ =   ≤ i ≤   ≤ j ≤  p j, j− =   ≤ j ≤  p j+, j+ =   ≤ j ≤  p, =  p, =  6 p j+i, j+i+ =   ≤ i ≤   ≤ j ≤  p j, j− =   ≤ j ≤  p j+, j+ =   ≤ j ≤  p, =  7 pi,i+ =   ≤ i ≤  pi+,i =   ≤ i ≤  p+i,+i =   ≤ i ≤  8 pi,i+ =   ≤ i ≤  pi+,i =   ≤ i ≤  p+i,+i =   ≤ i ≤  p, =  table 1. adjacency values of the examples 5 conclusions markov chains have applications in different fields such as physics, economics, biology, statistics, social sciences, etc. in these applications it is important to know whether the markov chain associated with the process is convergent or not. when the markov chain is aperiodic, the transition matrix converges and the process becomes stable. in other cases, the process does not reach an equilibrium. in this work, a characterization of the aperiodicity of a markov chain has been given in terms of the p systems computing the period of irreducible markov chains 299 example period π ′(pk) π (pk) 1 100 0 0 2 1 146 0 3 10 0 0 4 1 122 35 5 1 1 2 6 5 11 20 7 2 381 169 8 1 1101 104 table 2. observed run times existence of a state reachable from any other state. based on this property, a computational p system has been constructed that allows us to know whether the markov chain is aperiodic and calculate its period if not. in [2], every finite and homogeneous markov chain has associated a p system that provides a classification of its recurrent classes. that p system can be adapted to study the aperiodicity of a markov chain and then its period can be calculated. the solution presented in this work improves the solution derived from the p system described in [2]. for that purpose, simulators have been constructed for these p systems and the respective times of execution on eight examples have been analyzed. for the computational study of the aperiodicity of a markov chain it would be interesting to design new p systems that incorporate additional features such as electrical charges, active membranes, etc. and that improve quantitatively the amount of computational resources used. acknowledgement the authors wish to thank mario j. pérez-jiménez for his advices, suggestions and constant help. among the numerous virtues of this excellent professor, we would like to highlight his enviable capacity of work and his great human quality. as it is usually said, behind a great man there is a great woman and so we cannot end theses line without expressing our gratitude to queta for her immense patience with all of us. the third author acknowledges the support of the project tin2006–13425 of the ministerio de educación y ciencia of spain, cofinanced by feder funds, and the support of the project of excellence with investigador de reconocida valía of the junta de andalucía, grant p08-tic-04200. bibliography [1] p. brémaud: markov chains: gibbs fields, monte carlo simulation, and queues. springer, new york, 1998. [2] m. cardona, m.a. colomer, m.j. pérez–jiménez, a. zaragoza: classifying states of a finite markov chains with membrane computing. lecture notes in computer science. springer,vol 4361, pp. 266278, 2006. [3] o. häggstöm: finite markov chains and algorithmic applications. london mathematical society, cambridge university press, cambridge, 2003. [4] r. nelson: probability, stochastic processes, and queing theory. springer, new york, 1995. 300 mónica cardona-roca, m. ángels colomer-cugat, agustín riscos-núñez, miquel rius-font mónica cardona-roca received his degree in mathematics in 2000, from the barcelona university. he is associate professor at the department of mathematics of the university of lleida where he has been teaching for eight years. here main research areas are natural computing and membrane computing. she has co-authored two books in mathematics. she has co-authored 5 papers on international journals in natural computing. she has co-authored 3 papers in statistics published in proceedings of spanish conferences. m. ángels colomer-cugat received his degree in engineer agronomist from the upc (politecnic universitiy of catalonia) and doctor degree in mathematics in 1996 from the upc. currently, she is titular professor of statistics and operative investigation where she has been teaching for more than twenty five years. in the current moment she is the head of the emergent research group on models of membrane computation applied to ecosystems. she has published seven books on statistics and quality control. she has published twelve scientifics papers in international journals. her main research area are models of computation with membranes applied to the ecology and biology processes. hers first researches were centred on the stocastics models, concretely markov chain applied to natural processes where she has some works. agustín riscos-núñez got his master degree in mathematics in 2000, and then his phd degree on 2004. currently he is an associate professor at the department of computer science and artificial intelligence in the university of sevilla (spain). he is a member of the research group on natural computing in the same university, and his main research interests within the membrane computing area are complexity theory, models for biological processes, and computer simulation. he has co-authored about 30 papers on international journals in the last years, and he has also participated in 15 conferences and workshops. miquel rius-font received his bachelor degree in mathematics form the universitat autónoma de barcelona and his doctor degree in mathematics from university of barcelona. he is currently teaching mathematics at catalonian techology university and his research area is graph theory and natural computing. his interest involves isoperimetric problems as well as graph labeling problems. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iii (2008), no. 3, pp. 295-303 tissue p systems with cell division gheorghe păun, mario j. pérez-jiménez, agustín riscos-núñez abstract: in tissue p systems several cells (elementary membranes) communicate through symport/antiport rules, thus carrying out a computation. we add to such systems the basic feature of (cell–like) p systems with active membranes – the possibility to divide cells. as expected (as it is the case for p systems with active membranes), in this way we get the possibility to solve computationally hard problems in polynomial time; we illustrate this possibility with sat problem. keywords: tissue-like p systems, cell division rule, sat problem, np-complete problem 1 introduction in membrane computing, there are two main classes of p systems: with the membranes arranged hierarchically, inspired from the structure of the cell, and with the membranes placed in the nodes of a graph, all of them at the same level, inspired from the cell inter-communication in tissues. a particularly interesting sub-class of the first class are the systems with active membranes, where the membrane division can be used in order to solve hard problems, e.g., np-complete problems, in polynomial or even linear time, by a space-time trade-off. in the tissue p systems, the communication among cells is performed by means of symport/antiport rules, well-known in biology. details can be found in [8], [10], as well as in the comprehensive page from the web address http://ppage.psystems.eu). in this paper we combine the two definitions, and consider tissue p systems (with the communication done through symport/antiport rules) with cell division rules of the same form as in p systems with active membranes, but without using polarizations. the rules are used in the non-deterministic maximally parallel way, with the restriction that if a division rule is used for dividing a cell, then this cell does not participate in any other rule, for division or communication (the intuition is that when dividing, the interaction of the cell with other cells or with the environment is blocked); the cells obtained by division have the same labels as the mother cell, hence the rules to be used for evolving them or their objects are inherited (the label precisely identifies the available rules). this natural extension of tissue p systems provides the possibility of solving sat problem in polynomial time (with respect to the number of variables and of clauses), in a confluent way: at precise times, one of the objects yes or no is sent to the environment, giving the answer to the question whether the input propositional formula is satisfiable. the construction is uniform: in a polynomial time, a family of recognizing tissue p systems with cell division is constructed, which, receiving as inputs encodings of instances of sat, tells us whether or not these instances are satisfiable. 2 tissue p systems with cell division we assume the reader to be familiar with basic elements of membrane computing and we directly define the class of p systems which is investigated in this paper. a tissue p system with cell division of degree m ≥ 1 is a construct π = (o, e, w1, . . . , wm, r, io), where: copyright © 2006-2008 by ccc publications 296 gheorghe păun, mario j. pérez-jiménez, agustín riscos-núñez 1. m ≥ 1 (the initial degree of the system; the system contains m cells, labeled with 1, 2, . . . , m; we will use 0 to refer to the environment); 2. o is the alphabet of objects; 3. w1, . . . , wm are strings over o, describing the multisets of objects placed in the m cells of the system at the beginning of the computation; 4. e ⊆ o is the set of objects present in the environment in arbitrarily many copies each; 5. r is a finite set of evolution rules, of the following forms: (a) (i, x/y, j), for i, j ∈ {0, 1, 2, . . . , m}, i 6= j, and x, y ∈ o∗; communication rules; 1, 2, . . . , m identify the cells of the system, 0 is the environment; when applying a rule (i, x/y, j), the objects of the multiset represented by x are sent from region i to region j and simultaneously the objects of the multiset y are sent from region j to region i; (b) [ a ] i → [ b ] i[ c ] i, where i ∈ {1, 2, . . . , m} and a, b, c ∈ o; division rules; under the influence of object a, the cell with label i is divided in two cells with the same label; in the first copy the object a is replaced by b, in the second copy the object a is replaced by c; all other objects are replicated and copies of them are placed in the two new cells. 6. io ∈ {1, 2, . . . , m} is the output cell. therefore, we use antiport rules for communication (for a rule (i, x/y, j) we say that the maximum of the lengths of x and y is the weight of the rule), and division rules as in p systems with active membranes. the rules of a system as above are used in the non-deterministic maximally parallel manner as customary in membrane computing. in each step, all cells which can evolve must evolve in a maximally parallel way (in each step we apply a multiset of rules which is maximal, no further rule can be added), with the following important mentioning: if a cell is divided, then the division rule is the only one which is applied for that cell in that step, its objects do not evolve by means of communication rules. this is like saying that a cell which divides first cuts all its communication channels with the other cells and with the environment; the dotter cells will participate to the interaction with other cells or with the environment only in the next step – providing that they are not divided once again. their label precisely identify the rules which can be applied to them. the computation starts from the initial configuration and proceeds as defined above; only halting computations give a result, and the result is the number of objects present in the halting configuration in cell io; the set of numbers computed in this way by the various halting computations in π is denoted by n(π). in the present paper we are not interested in the computing power of systems as above – already systems without membrane division are known to be turing complete (see [8], [6], etc.), but in their computing efficiency. that is why we introduce a variant of tissue p systems with membrane division, namely recognizing systems with input following the definitions of complexity classes in terms of membrane computing (see [9]). such a system has the form π = (o, σ, e, w1, . . . , wm, r, iin), where: • (o, e, w1, . . . , wm, r, 0) is a tissue p system with cell division of initial degree m ≥ 1 (as defined in the previous section, but with the environment, indicated by taking io = 0, used for reading the output of a computation), and w1, . . . , wm are strings over o−σ. • the working alphabet o has two distinguished objects yes and no, present in at least one copy in some initial multisets w1, . . . , wm, but not present in e. • σ is an (input) alphabet strictly contained in o. • iin ∈ {1, . . . , m} is the input cell. tissue p systems with cell division 297 • all computations halt. • if c is a computation of π, then either the object yes or the object no (but not both) must have been released into the environment, and only in the last step of the computation. the computations of the system π with input w ∈ σ∗ start from a configuration of the form (w1, w2, . . . , wiin w, . . . , wm; e), that is, after adding the multiset w to the contents of the input cell iin. we say that the multiset w is recognized by π if and only if the object yes is sent to the environment, in the last step of the corresponding computation. we say that c is an accepting computation (respectively, rejecting computation) if the object yes (respectively, no) appears in the environment associated with the corresponding halting configuration of c. definition 1. we say that a decision problem x = (ix , θx ) is solvable in polynomial time by a family π = {π(n) | n ∈ n} of recognizer tissue-like p systems with cell division if the following holds: • the family π is polynomially uniform by turing machines, that is, there exists a deterministic turing machine working in polynomial time which constructs the system π(n) from n ∈ n. • there exists a pair (cod, s) of polynomial-time computable functions over ix (called a polynomial encoding of ix in π) such that: − for each instance u ∈ ix , s(u) is a natural number and cod(u) is an input multiset of the system π(s(u)); − the family π is polynomially bounded with regard to (x , cod, s), that is, there exists a polynomial function p, such that for each u ∈ ix every computation of π(s(u)) with input cod(u) is halting and, moreover, it performs at most p(|u|) steps; − the family π is sound with regard to (x , cod, s), that is, for each u ∈ ix , if there exists an accepting computation of π(s(u)) with input cod(u), then θx (u) = 1; − the family π is complete with regard to (x , cod, s), that is, for each u ∈ ix , if θx (u) = 1, then every computation of π(s(u)) with input cod(u) is an accepting one. we denote by pmct d the set of all decision problems which can be solved by means of recognizer tissue-like p systems with cell division in polynomial time. this class is closed under polynomial–time reduction and under complement. we close this section with an important remark about the previous way of solving decision problems. specifically, we have said nothing about the way the computations proceed; in particular, they can be non-deterministic, as standard in membrane computing. it is important however that the systems always stop and always they send out an object which is the correct answer to the input problem. from the soundness and completeness conditions above we deduce that every p system π(n) is confluent, in the following sense: every computation of a system with the same input multiset must always give the same answer. 3 solving sat in polynomial time as expected, the possibility to divide cells means the possibility to create an exponential space in a linear time, and this space can be used in order to obtain fast solutions to computationally hard problems. theorem 1. tissue p systems with active membranes can solve sat in polynomial time. (otherwise stated, sat ∈ pmct d.) 298 gheorghe păun, mario j. pérez-jiménez, agustín riscos-núñez proof. let us consider a propositional formula γ = c1 ∧···∧cm, consisting of m clauses c j = y j,1 ∨···∨ y j,k j , 1 ≤ j ≤ m, where y j,i ∈ {xl ,¬xl | 1 ≤ l ≤ n}, 1 ≤ i ≤ k j (there are used n variables). without loss of generality, we may assume that no clause contains two occurrences of some xi or two occurrences of some ¬xi (the formula is not redundant at the level of clauses), or both xi and ¬xi (otherwise such a clause is trivially satisfiable, hence can be removed). we codify γ , which is an instance of sat with size parameters n and m, by the multiset cod(γ) = {si, j | y j,r = xi, 1 ≤ i ≤ n, 1 ≤ j ≤ m, 1 ≤ r ≤ k j} ∪ {s′i, j | y j,r = ¬xi, 1 ≤ i ≤ n, 1 ≤ j ≤ m, 1 ≤ r ≤ k j}. (we replace each variable xi from each clause c j with si, j and each negated variable ¬xi from each clause c j with s′i, j, then we remove all parentheses and connectives. in this way we pass from γ to cod(γ) in a number of steps which is linear with respect to n·m.) the instance γ will be processed by the tissue p system π(s(γ)) with input cod(γ), where s(γ) = 〈n, m〉 = (n+m)·(n+m+1)2 + n. we construct the recognizing tissue p system (of degree 2) with input π(〈n, m〉) = (o, σ, e, w1, w2, r, 2), with the following components: o = σ∪{ai,ti, fi | 1 ≤ i ≤ n}∪{ri | 1 ≤ i ≤ m} ∪ {ti, fi | 1 ≤ i ≤ n}∪{ti, j, fi, j | 1 ≤ i ≤ n, 1 ≤ j ≤ m + 1} ∪ {bi | 1 ≤ i ≤ 3n + m + 1}∪{ci | 1 ≤ i ≤ n + 1} ∪ {di | 1 ≤ i ≤ 3n + nm + m + 2}∪{ei | 1 ≤ i ≤ 3n + nm + m + 4} ∪ { f , g,yes,no}, σ = {si, j, s′i j | 1 ≤ i ≤ n, 1 ≤ j ≤ m}, e = o−{yes,no}, w1 = yes no b1 c1 d1 e1, w2 = f g a1 a2 . . . an, and the following rules. 1. division rules: [ ai ] 2 → [ ti ] 2[ fi ] 2, for all i = 1, 2, . . . , n. (membrane 2 is repeatedly divided, each time expanding one object ai, corresponding to a variable xi, into ti and fi, corresponding to the values true and false which this variable may assume. in this way, in n steps, we get 2n cells with label 2, each one containing one of the 2n truth-assignments possible for the n variables. the objects f , g are duplicated, hence a copy of each of them will appear in each cell.) 2. communication rules: (1, bi/b2i+1, 0), for all i = 1, 2, . . . , n + 1, (1, ci/c2i+1, 0), for all i = 1, 2, . . . , n + 1, (1, di/d2i+1, 0), for all i = 1, 2, . . . , n + 1, (1, ei/ei+1, 0), for all i = 1, 2, . . . , 3n + nm + m + 3. (in parallel with the operation of dividing cell 2, the counters bi, ci, di, ei from cell 1 grow their subscripts. in each step, the number of copies of objects of the first three types is doubled, hence tissue p systems with cell division 299 after n steps we get 2n copies of bn+1, cn+1, and dn+1. objects bi will check which clauses are satisfied by a given truth-assignment, objects ci are used in order to multiply the number of copies of ti, fi as we will see immediately, di are used to check whether there is at least one truth-assignment which satisfies all clauses, and ei will be used in order to produce the object no, if this will be the case, in the end of the computation.) (1, bn+1cn+1/ f , 2), (1, dn+1/g, 2). (in step n + 1, the counters bn+1, cn+1, dn+1 are brought in cells with label 2, in exchange of f and g. because we have 2n copies of each object of these types and 2n cells 2, each one containing exactly one copy of f and one of g, due to the maximality of the parallelism of using the rules, each cell 2 gets precisely one copy of each of bn+1, cn+1, dn+1. note that cells 2 cannot divide any more, because the objects ai were exhausted.) (2, cn+1ti/cn+1ti,1, 0), (2, cn+1fi/cn+1fi,1, 0), for each i = 1, 2, . . . , n, (2, ti, j/titi, j+1, 0), (2, fi, j/ fifi, j+1, 0), for each i = 1, 2, . . . , n and j = 1, 2, . . . , m. (in the presence of cn+1, the objects ti, fi introduce the objects ti,1 and fi,1, respectively, which initiates the possibility of introducing m copies of each ti and fi in each cell 2. the idea is that because we have m clauses, in order to check their values for a given truth-assignment of variables, it is possible to need one value for each variable for each clause. note that this phase needs 2n steps for introducing the double-subscripted objects ti,1, fi,1 – for each one we need one step, because we have only one copy of cn+1 available – then further m steps are necessary for each ti,1, fi,1 to grow its second subscript; all these steps are done in parallel, but for the last introduced ti,1, fi,1 we have to continue m steps after the 2n necessary for priming. in total, we perform 2n + m steps.) (2, bi/bi+1, 0), (2, di/di+1, 0), for all i = n + 1, . . . , (n + 1) + (2n + m)−1. (in parallel with the previous operations, the counters bi and di increase their subscripts, until reaching the value 3n + m + 1. this is done in all cells 2 at the same time. simultaneously, ei increases its subscript in cell 1.) (2, b3n+m+1tisi, j/b3n+m+1r j, 0), (2, b3n+m+1 fis′i, j/b3n+m+1r j, 0), for all 1 ≤ i ≤ n and 1 ≤ j ≤ m, (2, di/di+1, 0), for all i = 3n + m + 1, . . . , (3n + m + 1) + nm−1. (in the presence of b3n+m+1 – and not before – we check the values assumed by clauses for the truth-assignments from each cell 2. we have only one copy of b3n+m+1 in each cell, hence we need at most nm steps for this: each clause contains at most n literals, and we have m clauses. in parallel, d increases the subscript, until reaching the value 3n + nm + m + 1.) (2, d3n+nm+m+iri/d3n+nm+m+i+1, 0), for all i = 1, 2, . . . , m. (in each cell with label 2 we check whether or not all clauses are satisfied by the corresponding truth-assignment. for each clause which is satisfied, we increase by one the subscript of d, hence the subscript reaches the value 3n + nm + 2m + 1 if and only if all clauses are satisfied.) (2, d3n+nm+2m+1/ f yes, 1). (if one of the truth-assignments from a cell 2 has satisfied all clauses, then we reach d3n+nm+2m+1, which is sent to cell 1 in exchange of the objects yes and f .) 300 gheorghe păun, mario j. pérez-jiménez, agustín riscos-núñez (2,yes/λ , 0). (in the next step, the object yes leaves the system, signaling the fact that the formula is satisfiable. in cell 1, the counter e will increase one more step its subscript, but after that it will remain unchanged – it can leave cell 1 only in the presence of f , but this object was already moved to cell 2.) (1, e3n+nm+2m+2 f no/λ , 2), (2,no/λ , 0). (if the counter e reaches the subscript 3n + nm + 2m + 2 and the object f is still in cell 1, then the object no can be moved to a cell 2, randomly chosen, and from here it exits the system, signaling that the formula is not satisfiable.) in order to show that the family π = {π(〈n, m〉) | n, m ∈ n} is polynomially uniform by deterministic turing machines we first note that the sets of rules associated with the system π(〈n, m〉) are recursive. hence, it is enough to note that the amount of necessary resources for defining each system is quadratic in max{n, m}, and this is indeed the case, since those resources are the following: 1. size of the alphabet: 6nm + 17n + 4m + 12 ∈ θ(nm). 2. initial number of cells: 2 ∈ θ(1). 3. initial number of objects: n + 8 ∈ θ(n). 4. number of rules: 4nm + 10n + 3m + 16 ∈ θ(nm). 5. upper bound for the length of the rules: 3 ∈ θ(1) from the previous explanations, one can see that, starting with the multiset cod(γ) added to cell 2, which is the input cell, the system correctly answers the question whether or not γ is satisfiable. the duration of the computation is polynomial in terms of n and m: the answer yes is sent out in step 3n + nm + 2m + 2, while the answer no is sent out in step 3n + nm + 2m + 4. this concludes the proof. 2 the antiport rules from the previous construction are of weight at most 3, but the weight can be reduced to two, at the expense of some slowdown of the system. for instance, instead of the rule (1, e3n+nm+2m+2 f no/λ , 2) we can consider the rules (1, e3n+nm+2m+2 f /h, 0), (1, h no/λ , 2), where h is a new object. we can proceed in the same way with the rules (2, b3n+m+1tisi, j/b3n+m+1r j, 0), (2, b3n+m+1 fis′i, j/b3n+m+1r j, 0), for 1 ≤ i ≤ n and 1 ≤ j ≤ m, but in this way instead of at most nm steps for finding the satisfied clauses we will need at most 2nm steps. the details are left to the reader. taking into account that sat is an np–complete problem and the class pmct d is closed under polynomial–time reduction and under complement, we have: corollary 2. np ∪ co-np ⊆ pmct d 4 final remarks we have proven that by adding the membrane division feature to tissue p systems (with the communication done by antiport rules of a small weight) we can solve np-complete problems in polynomial time. we exemplify this possibility with sat problem. it remains as a research topic to consider the same extension for other types of systems, for instance, for cell p systems with symport/antiport rules, or for neural p systems (with states associated with cells and multiset rewriting rules for processing the objects. the difficulty in the case of cell p tissue p systems with cell division 301 systems with symport/antiport comes from the fact that only the skin membrane can communicate with the environment; on the other hand, the skin membrane cannot be divided, hence we need exponentially many objects for communication with inner membranes, and such objects should be brought in from the environment. in turn, neural p systems with the maximal use of rules and replicated communication are already known to be able to solve np-complete problems in polynomial time; the challenge now is not to use replication). in spite of these difficulties, we expect results similar to the above one also in these cases. another problem which remains open is to consider tissue p systems with the communication using only symport rules. a previous version of the present paper was circulated in the volume of the second brainstorming week on membrane computing, held in sevilla, in february 2004, and in the meantime several papers have considered tissue-like p systems with cell division as a framework for devising polynomial solutions to np-complete problems. for instance, [2] deals with the subset sum problem, [3] deals with the partition problem, [5] deals with the vertex cover problem, and [4] considers the 3–coloring problem. what is not yet investigated is the possibility to also solve pspace problems, as it is the case, for instance, for cell-like p systems with division of non-elementary membranes (see [11]) or with membrane creation (see [7]). this last possibility for producing working space, cell creation rules, has been only recently considered for tissue p systems [1]. let us recall that this kind of rules does not perform replication of objects, as it happens with cell-division rules, and it is an open question whether tissue p systems with communication and membrane creation rules can solve efficiently computationally hard problems. acknowledgements. the support of this research through the project tin2006-13425 of the ministerio de educación y ciencia of spain, cofinanced by feder funds, and the support of the project of excellence tic-581 of the junta de andalucía, is gratefully acknowledged. bibliography [1] d. díaz–pernil: sistemas celulares de tejidos: formalización y eficiencia computacional, phd thesis, university of sevilla, 2008. [2] d. díaz–pernil, m.a. gutiérrez–naranjo, m.j. pérez–jiménez, a. riscos–núñez: solving subset sum in linear time by using tissue p systems with cell division. in j. mira, j.r. alvarez, eds. bioinspired modeling of cognitive tasks, second international work–conference on the interplay between natural and artificial computation, iwinac 2007, la manga del mar menor, spain, june 2007,part i, lncs 4527, springer, 2007, 170–179. [3] d. díaz–pernil, m.a. gutiérrez–naranjo, m.j. pérez–jiménez, a. riscos–núñez: solving the partition problem by using tissue-like p systems with cell division. in d. díaz, c. graciani, m.a. gutiérrez, gh. păun, i. pérez–hurtado, a. riscos, eds., proceedings of the sixth brainstorming week on membrane computing, report rgnc 01/08, fénix editora, 2008, 123–134. [4] d. díaz–pernil, m.a. gutiérrez–naranjo, m.j. pérez–jiménez, a. riscos–núñez: a linear-time tissue p system based solution for the 3-coloring problem, electronic notes in theoretical computer science, 171 (2007), 81–93. [5] d. díaz–pernil, m.j. pérez–jiménez, a. riscos–núñez, a. romero–jiménez: computational efficiency of cellular division in tissue-like membrane systems, submitted, 2008. 302 gheorghe păun, mario j. pérez-jiménez, agustín riscos-núñez [6] p. frisco, h.j. hoogeboom: simulating counter automata by p systems with symport/antiport. in gh. păun, g. rozenberg, a. salomaa, c. zandron, eds., membrane computing. international workshop wmc 2002, curtea de argeş, romania, revised papers, lncs 2597, springer, 2003, 288–301. [7] m.a. gutiérrez–naranjo, m.j. pérez–jiménez, f.j. romero–campero: a linear time solution for qsat with membrane creation. in r. freund, gh. păun, g. rozenberg, a. salomaa, eds., membrane computing, 6th international workshop, wmc 2005, vienna, austria, july 18-21, 2005, revised selected and invited papers, lncs 3850, springer, 2006, 241–252. [8] gh. păun: computing with membranes: an introduction, springer, berlin, 2002. [9] m.j. pérez–jiménez: an approach to computational complexity in membrane computing. in g. mauri, gh. păun, m.j. pérez-jiménez, g. rozenberg, a. salomaa, eds., membrane computing, 5th international workshop, wmc5, revised selected and invited papers, lncs 3365, springer, 2005, 85–109. [10] m. pérez–jiménez, a. romero–jiménez, f. sancho–caparrini, teoría de la complejidad en modelos de computatión celular con membranas, editorial kronos, sevilla, 2002. [11] p. sosik, a. rodríguez-patón: membrane computing and complexity theory: a characterization of pspace, international journal of foundations of computer science, 73, 1 (2007), 137–152. gheorghe păun1,2, mario j. pérez-jiménez2, agustín riscos-núñez2 1institute of mathematics of the romanian academy po box 1-764, 014700 bucureşti, romania 2 research group on natural computing department of computer science and artificial intelligence technical higher school of computer science engineering university of sevilla avda. reina mercedes s/n, 41012 sevilla, spain e-mail: {gpaun, marper, ariscosn}@us.es mario j., pérez–jiménez received his degree in mathematics from the barcelona university and doctor degree in mathematics from the sevilla university. in the past, he was associated professor at the university of barcelona. currently, he is titular professor of computer science and artificial intelligence at the university of sevilla, where is the head of the research group on natural computing. his main research fields are computational complexity theory, unconventional models of computation, natural computing, membrane computing, bioinformatics, and computational modelling for systems biology. he has published twelve books in computer science and mathematics, and over 100 scientific papers in international journals (collaborating with many researchers worldwide). he is the main researcher in various european, spanish and andalusia research grants. he has been and independent expert to the evaluation of nest (new and emergent science and technology) proposals under the sixth framework programme of the european community, and from may 2006 he is an european science foundation peer reviewer. tissue p systems with cell division 303 gheorghe păun, graduated the faculty of mathematics, university of bucharest, in 1974 and received his ph.d. in mathematics (with specialization in computer science) from the same university in 1977. he held a research position at the university of bucharest, and from 1990 he is at the institute of mathematics of the romanian academy, where he is currently a senior researcher. he visited numerous universities in europe, asia, and north america, with frequent and/or longer stays in turku (finland), leiden (the netherlands), magdeburg (germany, including an alexander von humboldt fellowship, in 1992-93), tarragona, madrid, and sevilla (spain, including a ramon y cajal scholarship, in 2001–2006), london-ontario (canada), rome, milano, pisa (italy), tokyo (japan), warsaw (poland), vienna (austria), budapest (hungary), etc. his main research areas are formal language theory and its applications, computational linguistics, dna computing, and membrane computing; this last research area was initiated by him, in 1998, and the respective models are now called p systems, see http://ppage.psystems.eu). he has published a large number of research papers (collaborating with many researchers worldwide), has lectured at over 100 universities, and gave numerous invited talks at recognized international conferences. he has published eleven monographs in mathematics and computer science (some of them translated in japanese, chinese, russian), has (co)edited over seventy collective volumes and special issues of journals, and also published many popular science books, books on recreational mathematics (games), and fiction books. he is a member of the editorial board of more than a dozen international journals and was/is involved in the program/steering/organizing committees for many recognized conferences and workshops. in 1997 he was elected a member of the romanian academy and from 2006 he is a member of academia europaea. he also got other honors, in romania or abroad (professional and literary prizes, honorary citizenship titles, doctor honoris causa of the silesian university in opava, czech republic, etc.). int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 4, pp. 427-438 fuzzy controller based qos routing algorithm with a multiclass scheme for manet b. sun, c. gui, q. zhang, h. chen baolin sun, chao gui, qifei zhang school of computing hubei university of economics wuhan 430205, china e-mail: blsun@163.com, guichao@126.com, cheffly@gmail.com hua chen college of science wuhan university of science and engineering wuhan 430073, china e-mail: qiuchen_1022@163.com abstract: a mobile ad hoc network (manet) consists of a set of mobile hosts that can communicate with each other without the assistance of base stations. due to the dynamic nature of the network topology and restricted resources, quality of service (qos) and multicast routing in manet are challenging tasks which attract the interests of many people. in this paper, we present a fuzzy controller based qos routing algorithm with a multiclass scheme (fqra) in mobile ad hoc networks. the performance of this scheduler is studied using ns2 (network simulator version 2) and evaluated in terms of quantitative measures such as packet delivery ratio, path success ratio and average end-to-end delay. simulations show that the approach is efficient, promising and applicable in ad hoc networks. keywords: mobile ad hoc networks, fuzzy controllers, routing algorithm, qos 1 introduction mobile ad hoc networks (manet) consist of mobile nodes that autonomously establish connectivity via multihop wireless communications. there is no use of a static network infrastructure such as base station or any centralized administration in manet. in ad hoc network, if two nodes are not within radio range, all message communication between them must pass through one or more intermediate nodes. all the nodes are free to move around randomly, thus changing the network topology dynamically [1-10]. these type of networks have many advantages, such as self reconfiguration and adaptability to highly variable mobile characteristics like the transmission conditions, propagation channel distribution characteristics and power level. they are useful in many situations such as military applications, conferences, lectures, emergency search, and rescue operations. however, such benefits come with new challenges which mainly resides in the unpredictability of the network topology due to mobility of nodes and the limited available bandwidth due to the wireless channel. these characteristics demand a new way of designing and operating these type of networks. for such networks, an effective routing protocol is critical for adapting to node mobility as well as possible channel error to provide a feasible path for data transmission [1-10]. aodv is an on-demand distance vector routing protocol [2]. the protocol is well known for the use in ad hoc networks. the use of multicasting with the network has many benefits. multicasting reduces the communication cost for applications that sending the same data to many recipients [3, 5-8]. instead of sending via multiple unicast, multicast reduces the channel bandwidth, sender and router processing and delivery delay. in addition, multicast gives robust communication whereby the receiver address is unknown or modifiable without the knowledge of the source within the wireless environment. copyright c© 2006-2009 by ccc publications 428 b. sun, c. gui, q. zhang, h. chen quality of service (qos) support for multimedia applications is closely related to resource allocation, the objective of which is to decide how to reserve resources such that qos requirements of all the applications can be satisfied [3, 6-11, 16, 21]. the goals of qos routing are twofold: finding a suitable route through the network between the source and the destination that will have the necessary resources available to meet the qos constraints; and achieving global efficiency in the utilization of resources. however, it is a significant technical challenge to provide reliable high-speed end-to-end communications in these networks, due to their dynamic topology, distributed management, and multi-hop connections. the provision of qos requirements is of utmost importance for the development of future networks. for supporting qos-aware applications, qos based routing algorithms such as ticket base routing (tbr) [3] and core extraction dynamic source routing (cedar) [4] are proposed. in cedar, each host compares degree with its neighbors. the host with the largest degrees among its neighbors is selected as the core node. the core node is responsible for recording all information for its members. when a source host requests to establish a qos routing path to some destination host, its core node must construct a qos routing path. the advantage of core-based management is that it reduces the number of control packets during path construction. chang proposes a two-level management approach for efficiently constructing and maintaining a qos routing path in ad hoc wireless networks, significantly reducing the quantity of control packets [5]. in the first phase, the mobile hosts are partitioned into a number of complete graphs, each represented by a supernode managed by an agent. in the second phase, some agents of a larger degree than neighboring agents are selected as core nodes. lorenz and orda demonstrate in the literature [16] that this uncertainty places additional constraints on qos provisioning. xiao et al proposed a dynamic backup routes routing protocol (dbr2p), a backup node mechanism for quick reconnection during link failures [20]. dbr2p is an on-demand routing protocol and it can set up many routes to reach a destination node in a given period. these algorithms determine a path that satisfies the required qos. fuzzy logic exploits the pervasive imprecision, uncertainty and partial truth of the real world using simple linguistic statements and thereby achieves tractability, robustness, and low solution cost [11, 1319, 23]. fuzzy logic based decision algorithm influences caching decisions of multiple paths uncovered during route discovery and avoids low quality paths [11, 14-19]. differentiated resource allocation considering message type and network queue status is evaluated using fuzzy logic scheme [11]. fernandez, hu, kazemian, zhang et al propose the use of fuzzy logic controllers for the dynamic reconfiguration of edge and core routers [13-15, 23]. this reconfiguration allows for adjusting the network provisioning according to the incoming traffic and the qos level achieved. hu and peter, utilize a self tuning fuzzy controller to apply an end-to-end rate-based feedback flow control algorithm for the available bit rate (abr) service in atm [14]. sheng, et al propose an adaptive routing algorithm in which the link cost was dynamically assigned using a fuzzy system [18]. their results show the traffic in the networks is rerouted to have less congestion or spare capacity. sheu and chen propose a fuzzy bandwidth allocation controller (fbac) to support services including restricted time-bounded service such as voice and video in wireless networks [19]. their results show the traffic in the network is rerouted to have less congestion or spare capacity. in those studies, the parameters and rules were further calibrated to obtain a more efficient evaluation. because the ad hoc network traffics have self-similarity, a common statistical method has been widely used to verify self-similarity of time-series, so that the characteristics of the ad hoc wireless networks can be grasped. in this paper, we present a fuzzy controller based qos routing algorithm with a multiclass scheme (fqra) in mobile ad hoc networks. fqra proposes to deal with route table management for keeping the active routes’ lifetime. fqra applies a fuzzy logic system to dynamically evaluate the route expiry time. the fuzzy logic is chosen because there are uncertainties associated with node mobility and the estimation of link crash; moreover, there is a mathematical model capable of estimating the node mobility. in addition, fqra is able to take some controlling factors into consideration. therefore, fqra is a multiclass scheme fuzzy evaluation for qos routing protocol. the performance of the fqra is studied using ns2 [24] and evaluated in terms of quantitative measures such as improved path success ratio, fuzzy controller based qos routing algorithm with a multiclass scheme for manet 429 reduced average end-to-end delay and increased packet delivery ratio. the rest of the paper is organized as follows. in section 2, we introduces the ad hoc network model and route issues. in section 3, we presents the fuzzy qos controller. in section 4, we evaluate the performance of the algorithm and present the simulation results. finally, section 5 concludes the summary of the work and future challenges. 2 network model and routing issues a network is usually represented as a weighted digraph g = (n, e), where n denotes the set of nodes and e denotes the set of communication links connecting the nodes. |n| and |e| denote the number of nodes and links in the network respectively [3-11, 16]. in g(n, e), considering a qos constrained multicast routing problem from a source node to multi-destination nodes, namely given a non-empty set m={s, u, u, . . . , um}, m ⊆ n, s is source node, u ={u, u, . . . , um} is a set of destination nodes. multicast tree t = (nt , et ), wherent ⊆ n, et ⊆ e, if c(t ) is the cost of t , pt (s, u) is the path from source node s to destination u ∈ u in t , dt (s, u) and bt (s, u) are the delay and usable bandwidth of pt (s, u). definition 1. the cost of multicast tree t is: c(t ) = ∑ e∈et c(e), e ∈ et . definition 2. the bandwidth, and delay route of multicast tree t is the value of link bandwidth, and delay in the path from source node s to each destination node u ∈ u . i.e. bt (s, u)= min(b(e), e ∈ et ). dt (s, u)= max( ∑ e∈pt (s, u) d(e), u ∈ u ). definition 3. assume the minimum bandwidth constraint of multicast tree is b and the maximum delay constraint is d, given a multicast demand r; then, the problem of bandwidth and delay constrained multicast routing is to find a multicast tree t , satisfying: (1) bandwidth constraint: bt (s, u) ≥ b, u ∈ u . (2) delay constraint: dt (s, u) ≤ d, u ∈ u . suppose s(r) is the set, and s(r) satisfies the conditions above; then, the multicast tree t which we find is: c(t ) = min (c(ts), ts ∈ s(r)) 3 fuzzy qos controller 3.1 fuzzy logic controller the fuzzy logic was introduced by zadeh as a generalization of the boolean logic [22]. the difference between these logics is that fuzzy set theory provides a form to represent uncertainties; that is, it accepts conditions partially true or partially false. fuzzy logic is a good logic to treat random uncertainty, i.e., when the prediction of a sequence of events is not possible. fuzzy logic control system is rule-based system in which a set of so-called fuzzy rules represents a control decision mechanism to adjust the effects of certain causes that come from the system. the aim of the fuzzy control system is normally to substitute for or replace a skilled human operator with a fuzzy rule-based system. specifically, based on the current state of a system, an inference engine equipped 430 b. sun, c. gui, q. zhang, h. chen with a fuzzy rule base determines an on-line decision to adjust the system behavior in order to guarantee that it is optimal in some certain senses. there are generally two kinds of fuzzy logic controllers. one is the feedback controller, which is not suitable for the high performance communication networks. another one, which is used in this paper, is shown in figure 1. the output of the fuzzy logic controller in figure 1 is used to tune the controlled system’s parameters based on the state of the system. this control mechanism is different from the conventional feedback control and considered as an adaptive control [11, 14, 19, 23]. figure 1: the fuzzy logic controller the specific features of the fuzzy controller depend on the model under control and performance measurement. however, in principle, in the fuzzy controller we explore the implicit and explicit relationships within the system and subsequently develop the optimal fuzzy control rules as well as a knowledge base. in the route discovery process, new routes are created. the creation of new routes makes use of forward packet (table 1) and backward packet (table 2). a forward packet is broadcasted by the sender and will be relayed by the neighbors of the sender. when the forward packet reaches the destination node, it extracts the information collected by the forward packet and destroys it and subsequently creates a backward packet which follows the track of the forward packet, but in the reverse direction. table 1: forward packet. source address destination address sequence number number of hops qos metric intermediate node table 2: backward packet. source address destination address number of hops qos metric intermediate node the packets flow through router is presented in figure 2. the aim was to decrease tcp and udp traffic settling, rise and fall times; decrease overshoots and undershoots; and to stabilize throughput in the router’s output connections to let the model adapt for load and link capacity variations. fuzzy controller based qos routing algorithm with a multiclass scheme for manet 431 figure 2: schematic presentation of packets flow through router 3.2 scheduler controller the packet scheduler used in our architecture is wrr (weighted round robin). in this scheduler, queues are served according to a configurable weight that can be changed during network operation. this allows having control of the bandwidth assigned to each service class. the packet delay and discard rate for each queue (class) can be controlled by changing this weight. an example of membership function of schedule controller is showed in figure 3. the fuzzy scheduler proposed here calculates the priority index of each packet. here we consider all the inputs which decide the priority associated with the packet, unlike the previous scheduling schemes. other membership functions are: packet delay in the expedited forwarding queue and discard rate due to queue overflow in the besteffort class. the output membership functions are also defined as trapezoid functions by the same previous reasons. we make use of the center of gravity defuzzification method, since it gives better results. the output membership function gives the weights assigned to each class in the wrr scheduler. figure 3: scheduler membership functions 3.3 fuzzy rule base (fuzzy routing) fuzzy systems reason with multi-valued fuzzy sets instead of crisp sets. the fuzzy logic controller (flc) (figure 1) has two inputs: residual bandwidth and traffic class and one output: fuzzy routing decision [12, 22]. mamdani fuzzy-rule based systems constitute of a linguistic description in both the antecedent parts and the consequent parts. each rule (table 3) is a description of a condition-action statement that may be clearly interpreted by the users. rule base is an if-then rule group with fuzzy sets that represents the desired behavior of a fuzzy system. it can be defined in agreement with the administrative policy. if x is ai and . . . and xn is ain then y is ci, i = 1, 2, . . . , l where l is the number of fuzzy rules, x j ∈ u j, j=1, 2, . . . , n, are the input variables, y is the output variable, ai j are the fuzzy sets of the input linguistic variable x j and ci is called the set of the output linguistic variable y. ai j and ci are characterized by both membership functions. inputs are of the form: x is a′, x is a ′ , . . . , xn is a ′ n where a ′ , a ′ , . . . , a ′ n are fuzzy subsets of u, u, . . . un, which are the universe of discourse of inputs. 432 b. sun, c. gui, q. zhang, h. chen table 3: fuzzy rule base qos classes and application type. qos class application type 1 video conference 2 sdtv-quality voice 3 cd-quality audio 4 high-quality voice 4 simulation 4.1 random graph generation in generating random graphs, we have adopted the method used in kazemian, where vertices are placed randomly in a rectangular coordinate grid by generating uniformly distributed values for their x and y coordinates [15]. the remaining edges of the graph are chosen by examining each possible edge (u,v) and generating a random number 0 ≤ r <1. if r is less than a probability function p(u, v) based on the edge distance between u and v, then the edge is included in the graph. the distance for each edge is the euclidean distance denoted as d(u, v) between the nodes that form the end-points of the edge. we use the probability p(u, v) = β exp[− d(u,v)α l ]. where d(u, v) is geometric distance from node u to node v and l is maximum distance between two nodes. the parameters α and β are in the range (0, 1) and can be used to obtain certain desirable characteristics in the topology; parameter α can be used to control short edge and long edge of the random graph, and parameter β can be used to control the value of average degree of the random graph. 4.2 simulation model to conduct the simulation studies, we used randomly generated networks on which the algorithms were executed. this ensures that the simulation results are independent of the characteristics of any particular network topology. using randomly generated network topologies also provides the necessary flexibility to tune various network parameters such as average degree, number of nodes, and number of edges, and to study the effect of these parameters on the performance of the algorithms. the platform used was the ns2 (network simulator version 2) [24]. ns2 is a discrete event simulator targeted at networking research. ns2 provides substantial support for simulation of tcp, routing, and multicast protocols over wired and wireless networks. fuzzy routing decision was implemented using the fuzzy logic toolbox in matlab [25]. the simulator ran with various input configuration settings and the statistics collected were analyzed in comparison with other well-known on demand routing protocol aodv [2]. our simulation modeled a network of mobile nodes placed randomly within 1000 × 1000 meter area. each node had a radio propagation range of 250 meters and channel capacity of 10 mbps. two-ray propagation model was used. the ieee 802.11 distributed coordination function was used as the medium access control protocol. a random waypoint mobility model was used: each node randomly selected a position and moves toward that location with a speed ranging from just above 0 m/s to 10 m/s. when the node reached that position, it became stationary for a programmable pause time; then it selected another position and repeated the process. the simulation was repeated with different seed values. a traffic fuzzy controller based qos routing algorithm with a multiclass scheme for manet 433 generator was developed to simulate cbr (constant bit rate) sources. the size of the data payload was 512 bytes. data sessions with randomly selected sources and destinations were simulated. each source transmitted data packets at a minimum rate of 4 packets/sec. and maximum rate of 10 packets/sec. traffic classes were randomly assigned and simulation was carried out with different bandwidth requirements. there were no network partitions throughout the simulation. each simulation was executed for 600 seconds of simulation time. multiple runs with different seed values were conducted for each scenario and collected data was averaged over those runs. table 4 lists the simulation parameters which are used as default values unless otherwise specified. table 4: simulation parameters number of nodes 100 terrain range 1000m × 1000m transmission range 250 m average node degree 3-5 node’s mobility speed 0-10 m/s mobility model random way point propagation model free space channel bandwidth 5 mbps links delay 20-200 ms traffic type cbr data payload 512 bytes/packet node pause time 0-10 seconds 4.3 performance measures the following measures are used in computing the scheduler performance. these measures were derived from one suggested by the manet working group for routing protocol evaluation. packet delivery ratio: packet delivery ratio is the ratio of the number of data packets actually delivered to the destinations to the number of data packets supposed to be received. this estimate gives us an idea about how successful the protocol is in delivering packets to the application layer. a high value of packet delivery ratio indicates that most of the packets are being delivered to the higher layers and is a good indicator of the algorithm performance. path success ratio: the ratio of the number of connection request discover to the destinations to the number of routed connection requests. the successful routing request is defined as the path computed and established by the algorithm satisfies the delay and bandwidth constraints. this number presents the effectiveness of the algorithm. average end-to-end delay: this indicates the end-to-end delay experienced by packets from source to destination. the average end-to-end packet delay is computed as the ratio of total end-to-end delays to the aggregate number of packets successfully delivered to the destination nodes during a simulation run. a higher value of end-to-end delay means that the network is congested and hence the routing algorithm does not perform well. 434 b. sun, c. gui, q. zhang, h. chen 4.4 simulation results in this performance evaluation the following performance measures were evaluated: packet delivery ratio, percentile of path success ratio and edge-to-edge delay. for each evaluation, we used cbr. all simulations started with initial scheduler configuration with 60% of the bandwidth for each class. to eliminate simulation results with an empty network, we started collecting results 30 seconds after the beginning of the simulation. after optimization procedure was executed, we could verify the result comparing packet delivery ratio, path success ratio, and average end-to-end delay. figure 4 shows the performance analysis of the packet delivery ratio vs. network size for the fqra, aodv, and non-qos (not quality of service metric constraints) in ad hoc network. using non-qos constraints algorithm as a base line for comparison, the result shows that both fqra and aodv are much better than the non-qos constraints algorithm, fqra is considerably better than aodv. this is because that aodv needs to rediscover the route to retransmit data packets that are lost due to the node’s mobility or unreal route paths during the communication. the advantage of fqra is resulted from choosing the right routing path or updating the unreal route paths just in time by the virtue of the suitable route lifetime estimation. figure 4: packet delivery ratio vs. networks size figure 5 shows the comparison of the three algorithms with respect to the average end-to-end delay vs. number of nodes. it shows that the end-to-end delay increases usually with the increasing node. from the figure 5, we can see that when the nodes increase, fqra algorithm average end-to-end delay is lower than that of aodv and non-qos algorithm. this is because the fuzzy scheduler controller has stability routes and gives more precedence to the packets. both the aodv and non-qos algorithm need more time and more control overhead than the fqra does to recover unreal paths (broken paths) and to discover new paths. as the nodes increase, more packets are received and thus end-to-end delay increases. the end-to-end delay is important as many real time applications require a small latency to deliver usable data within stipulated period of time. figure 6 depicts a comparison path success rate to find the path through fqra, aodv and non-qos in ad hoc network. with the relaxation of bandwidth constraints, the path success rate becomes higher for non-qos. the success rate is still higher than that of non-qos, which means more suitable for the routing choosing under timely data transmission application and dynamic network structure. the average and-to-end delay performance as shown in the figure 7, proves that the end-to-end delay improves when scheduler is included. as the mobility varies from 0-10 m/s, the fuzzy controllers scheduler provides an end-to-end delay reduced by around 0.01 seconds to 0.05 seconds. it shows that the end-to-end delay increases usually with the increasing speed. it also shows that fqra is much better than the other two. both the non-qos and the aodv need more time and more control overhead than the fqra does to recover unreal paths and to discover new paths. fuzzy controller based qos routing algorithm with a multiclass scheme for manet 435 figure 5: average end-to-end delay vs. networks size figure 6: path success ratio vs. bandwidth figure 7: average end-to-end delay vs. node’s mobility speed 436 b. sun, c. gui, q. zhang, h. chen 5 conclusion and future work our qos routing algorithm has produced significant improvements in throughput, average end-toend delay and path success ratio. the fuzzy controllers scheduler algorithm attaches a qos class to each packet in the queue of the node. unlike the normal sorting procedure for scheduling packet, the crisp qos class is calculated by the fuzzy scheduler based on the above inputs which are derived from the network. the membership functions and rule bases of the fuzzy scheduler are carefully designed. the use of fuzzy logic improves the handling of inaccuracy and uncertainties of the ingress traffic into the domain. in this paper, we present a fuzzy controller based qos routing algorithm with a multiclass scheme in mobile ad hoc networks. the performance of this scheduler is studied using ns2 and evaluated in terms of quantitative measures such as path success ratio, average end-to-end delay and throughput. simulation shows that the approach is efficient, promising and applicable in ad hoc networks. future work includes comparison with "crisp" versions of the fuzzy algorithm to isolate the contributions of fuzzy logic, as well as applications of fuzzy control to power consumption and directional antennas in manets. we also intend to compare fqra with other qos routing algorithm. acknowledgement the authors thank the editors and the anonymous reviewers for their valuable comments that helped to improve the paper. this work is supported by china postdoctoral science foundation of china (no. 20070410955), the young and middle-aged elitists’ scientific and technological innovation team project of the institutions of higher education in hubei province (no. t200902), and key scientific research project of hubei education department (no. d20081904, q20091903, b20091904). bibliography [1] m. a. rajan, m. g. chandra, l. c. reddy and p. hiremath, concepts of graph theory relevant to ad-hoc networks. international journal of computers, communications & control, vol. 3, no. suppl, pp. 465-469, 2008. [2] c. perkins, e. belding-royer and s. das, ad hoc on-demand distance vector (aodv) routing. ietf rfc 3561, july 2003. [3] s. chen and k. nahrstedt, distributed quality of service routing in ad hoc networks. ieee journal on selected areas in communication, vol. 17, no. 8, pp. 1488-1504, 1999. [4] r. sivakumar, p. sinha and v. bharghavan, cedar: core extraction distributed ad hoc routing. ieee journal on selected areas in communication, vol. 17, no. 8, pp. 1454-1465, 1999. [5] c. y. chang, c. t. chang, t. s. chen and h. r. chang, hierarchical management protocol for constructing a qos communication path in wireless ad hoc networks. information sciences, vol. 177, no. 13, pp. 2621-2641, 2007. [6] b. l. sun and l. y. li, a qos multicast routing optimization algorithms based on genetic algorithm, journal of communications and networks, vol. 8, no. 1, pp. 116-122, 2006. [7] b. l. sun, l. y. li, q. yang and y. xiang, an entropy-based stability qos multicast routing protocol in ad hoc network, lecture notes in computer science, vol. 3947, springer-verlag, pp. 217-226, 2006. fuzzy controller based qos routing algorithm with a multiclass scheme for manet 437 [8] b. l. sun, s. c. pi, c. gui, et al, multiple constraints qos multicast routing optimization algorithm in manet based on ga. progress in natural science, vol. 18, no. 3, pp. 331-336, 2008. [9] b. l. sun, h. li, y. zeng and q. q. qin, fuzzy qos controllers in mobile ad hoc networks using genetic algorithms. journal of computational information systems, vol. 3, no. 6, pp. 2255-2260, 2007. [10] b. l. sun and l. y. li, a distributed qos multicast routing protocol in ad hoc networks. journal of systems engineering and electronics, vol. 17, no. 3, pp. 692-698, 2006. [11] a. thomas, c. chellappan and c. jayakumar, anthoc qos: quality of service routing in mobile ad hoc networks using swarm intelligence. in: the second asia pacific conference on mobile technology, applications and systems, guangzhou, china, 1517 november, 2005. [12] y. h. wang and c. f. chao, dynamic backup routes routing protocol for mobile ad hoc networks, information sciences, vol. 176, no. 2, pp. 161-185, 2006. [13] m. p. fernandez, a. de castro, p. pedroza and j. f. de rezende, qos provisioning across a diffserv domain using policy-based management. in: globecom 2001, san antonio, usa, nov. 2001. [14] r. q. hu and d. peter, a predictive self-tuning fuzzy logic feedback rate controller. ieee/acm transactions on networking, vol. 8, no. 6, pp. 697-709, 2000. [15] h. b. kazemian and l. meng, a fuzzy control scheme for video transmission in bluetooth wireless. information sciences, vol. 176, no. 9, pp. 1266-1289, 2006. [16] d. h. lorenz and a. orda, qos routing in networks with uncertain parameters. ieee/acm transactions on networking, vol. 6, no. 6, pp. 768-778, 1998. [17] s. rea and d. pesch, multi-metric routing decisions for ad hoc networks using fuzzy logic. in: proc. of 1st intl. symposium on wireless communication systems, mauritius, 20-22 september, pp. 403-407, 2004. [18] h. m. sheng, j. c. wang, h. h. huang and d. c. yen, fuzzy measure on vehicle routing problem of hospital materials. expert systems with applications, vol. 30, no. 2, pp. 367-377, 2006. [19] s. t. sheu and m. h. chen, a fuzzy bandwidth allocation controller to support real-time traffic over wireless network. in: proc. of ieee wireless communications and networking conference, new orleans, la, vol. 1, pp. 1348-1352, 1999. [20] y. xiao, c. l. p. chen and y. wang, optimal admission control for multi-class of wireless adaptive multimedia services, ieice transaction on communications, special issue on mobile multimedia communications, vol. e84-b, no. 4, pp. 795-804, 2001. [21] b. l. ye, m. y. guo, d. x chen and s. l. lu, a degree-constrained qos-aware routing algorithm for application layer multicast, information sciences, vol. 177, no. 17, pp. 3613-3626, 2007. [22] l. a. zadeh, fuzzy sets. information and control, vol. 8, no. 3, pp. 338-353, 1965. [23] r. zhang and y. phillis, fuzzy control of queueing system with heterogeneius servers. ieee trans. fuzzy systems, vol. 7, no. 1, pp. 17-26, 1999. [24] the network simulator ns-2, http://www.isi.edu/nsnam/ns/. [25] fuzzy logic tool box (for use with matlab) users’ guide version 2, the mathworks inc., http://www.mathtools.net/matlab/fuzzy_logic/. 438 b. sun, c. gui, q. zhang, h. chen baolin sun (born in 1963), graduated in mathematics (the huazhong normal university, wuhan, china) and received ph.d. in computer science (wuhan university of technology, wuhan, china). he is currently a professor, and one of editorial board guest members of world scitech r&d, one of the paper reviewers of chinese of journal electronics, journal of electronics & information technology, high technology letters, and kgcm, iccit program committee members, and also an international standard draft organizing members of iso/iec jtc1/sc6. his research interests include qos multicast routing, distributed computing, network optimization and ad hoc networks. professor sun has published over 100 research papers and is the author of four books. he was awarded the province special prize by the hubei province government in 2007. chao gui (born in 1964), graduated in physics (the wuahn university, wuhan, china) and received m.s. degree in electrical engineering (wuhan university of technology, wuhan, china). he is currently a associate professor in computer science. his research interests include wireless communication, performance analysis and analytical modeling. he has published over 30 research papers. qifei zhang (born in 1977), graduated in computer science (the jianghan university, wuhan, china) and received ph.d. in information and communication engineering (huazhong university of science and technology, wuhan, china). he is currently a lecturer in computer science. his research interests include wireless local area networks, mesh networks, network protocol and congestion control. he has published over 30 research papers. hua chen (born in 1977), graduated in graduated in mathematics (the huazhong normal university, wuhan, china) and received m.s. degree in mathematics (huazhong university of science and technology, wuhan, china). she is currently a lecturer in mathematics. her research interests include wireless communication, performance analysis, network model and algorithm. she has published over 20 research papers. international journal of computers, communications & control vol. i (2006), no. 1, pp. 73-79 grigore c. moisil: a life becoming a myth solomon marcus “all what is correct thinking is either mathematics or feasible to be transposed in a mathematical model.” grigore c. moisil (1906 1973) grigore c. mosil, romanian mathematician, professor at the university of bucharest faculty of mathematics and computer science, member of the romanian academy, member of the academy of sciences in bologna and of the international institute of philosophy, computer pioneer award of ieee computer society (ieee 1996). grigore c. moisil belongs to the fifth generation of romanian mathematicians. the first generation includes the founders spiru haret and david emmanuel, both with doctorates at sorbonne (paris). they were born at the middle of the xixth century and have the merit to initiate the high level mathematical research in romania. (we leave aside transylvania, with janos bolyai, who already in the first half of the xixth century discovered non-euclidean geometry). the second generation includes the first romanian mathematicians with a long career of scientific research: gh. ţiţeica, d. pompeiu (both with doctorates at sorbonne), al. myller and vera myller (both with doctorates at gottingen, germany). they had already an important impact on moisil’s formation as a mathematician. the third generation includes victor valcovici, traian lalescu and simion stoilow (considered as the most important romanian mathematician until the sixties of the xxth century), born in the eighties of the xixth century. lalescu died very young (in 1929), but valcovici and stoilow became great professors not only before, but also after the second world war. we have then a fourth generation of mathematicians born in the last decade of the xixth century: octav onicescu (the initiator of the romanian school of probability theory), petre sergescu (the organizer of the first two congresses of romanian mathematicians, before the second world war), dan barbilian (the same as the poet ion barbu), alexandru froda and gheorghe vranceanu (the greatest romanian geometer after ţiţeica). directly or indirectly, all these mathematicians had an impact on moisil’s personality. born in tulcea (dobrogea), with a long genealogic tree of high intellectuals in the north-western part(bistriţa-năsăud) of romania, with school training partly in bucharest, partly in iasi (moldova), moisil’s childhood and adolescence are well known from his own diary published in the recent years by moisil’s widow viorica moisil, who took great care of the whole scientific and human heritage of her husband. the main teachers of the child moisil were his parents; his father was an important historian, while his mother was just an educator for elementary school. the child grigore enjoyed to look around, to give free expression to his curiosity and wonder, to ask questions, to read books of a large diversity, from science to literature and from practical jobs to philosophy. he felt always the need to react to what he was seeing, listening and reading. his diary is an excellent mirror of this fact. his main pleasure was always of an intellectual nature, he was not attracted to play with other children or to practice various sport games. his mother taught him to count and to make calculations and only in a second step to read and to write. the most important part of the learning process took place at his home, with his parents and some time with his brothers and his sister, all of them becoming copyright c© 2006 by ccc publications 74 solomon marcus intellectuals. moisil’s diary should be known by the children of the new generations. the attitude of the child moisil towards learning remained his attitude towards life. he enjoyed to interact with people, to read, to write and to learn. he did all these things with pleasure and with humor, he was able to contaminate people with his capacity to discover something new where most people see nothing new; to invent questions where most people believe that everything was already answered. his attraction for mathematics did not diminish his interest for the other school disciplines. his curiosity was total and remained total during his whole life. but his passion for mathematics and his way to understand mathematics lead him to see the world through the glasses of mathematics; so mathematics was not for him only a profession, a job, it was a way to look at the life and at the universe. when moisil had to become a university student, choosing math was equivalent for most people with choosing engineering, so his parents pressed him to choose polytechnical school, where he became a student in 1924; but one year before he became also a student of the faculty of science, mathematical section. he did not like engineering, so he never finished polytechnical school, but the irony of life decided that moisil had to return to engineering much later, when he discovered that logic and engineering share some very important features. on the other hand, the university student moisil attended many classes of philosophy, of history and of art. his teachers were ţiţeica, lalescu, davidoglu and others, but the university teacher who impressed him the most and became his spiritual model was dimitrie pompeiu. his phd thesis (1929) was about the analytic mechanics of continuous systems. as a characteristic feature of his works published in the twenties and the thirties of the past century, we observe that most of them were at the interference of differential equations (with partial derivatives), differential geometry, function theory and mechanics. the main reason of this situation was that moisil was devoted to the topics most considered by his predecessors and his colleagues (pompeiu, with his areolar derivative, vrânceanu, with his neolonomic varieties, nicolae teodorescu with his papers on finsler spaces etc.), but he also paid attention to the work of some great western mathematicians of his time: eli cartan, w. blaschke, and mainly vito volterra and jacques hadamard. having the opportunity to work with volterra (1931-1932) in rome, moisil became a pioneer of the functional methods in differential geometry and mechanics (let us recall that volterra is one of the initiators of functional analysis). but before this, moisil was in paris, working with great french mathematicians such as hadamard (1930-1931); he spent a second period in paris after returninf from rome. so, in this atmosphere moisil obtained a generalization of volterra’s conjugate functions and he also generalized hadamard’s total geodesic varieties. from the same period let us recall the joint paper by moisil and teodorescu on holomorphic functions in the space (1931), the first joint work in romanian mathematics. joint works remained a rare phenomenon until the middle of the past century. at nevember 1, 1932, moisil is named a provisional associate professor of algebra at the university of iasi. at that moment, moisil was already the author of an important number of scientific papers, but no of them was in the field of algebra. his first algebraic paper (in the field of non-associative algebra) was published only in 1934. so, this fact came as a surprise. but for a good observer, nothing surprising was in this fact. behind the diversity of topics, moisil was, in most of his mathematical papers, mainly an algebraist, his genuine thinking was almost always of an algebraic nature. in everything he did, he projected an algebraic spirit. within the framework of his traditional preoccupations of analysis, geometry and mechanics (monogeneity, function theory, geodesics in some riemann spaces, partial differential equations etc.), he works with algebraic tools such as functional groups, parametric groups, monogenic quaternions, hypercomplex numbers, ideals of polynomials, areolar polynomials, polynomials associated to some bilinear differential expressions with constant coefficients, interpretation of the fundamental group of a differential variety etc. to the above fact, we have to add an event having an important role in moisil’s life: the publication, in the first part of the fourth decade of the past century, of the book of van der waerden, "moderne algebra". it was an important sign of the move of algebra from the quantitative to the qualitative, from the algorithmic to the structural phase of its evolution. it was a turning moment in moisil’s mathematical life, an event having a huge impact on moisil’s further evolution. in this order of ideas, we should observe that moisil is the first to introduce bourbaki’s ideas in romania, at a moment when bourbaki’s mathematical structuralism, strongly influenced by the german school of structural algebra, was only a project. in his course of general analysis (analiza genarala) published towards the end of the thirties of the past century, moisil gives an account of henri cartan’s theory of filters, while uniform spaces were also presented there. but moisil was not the first to teach in iaşi modern structural algebra; he was preceded in this respect by vera myller. on the other hand, emmanuel and lalescu had also, at some moments, the opportunity to teach some ideas on groups and fields. however, it seems that moisil was the first to give to this structural style the whole amplitude. only barbilian will go further in this respect, but this will happen grigore c. moisil: a life becoming a myth 75 in the second half of the fourties of the past century. moisil became full associate professor of algebra at january 1, 1935, but at november 1, 1936 he got a position of professor of differential and integral calculus and then full professor of calculus in november 1939, all of them at iaşi university. moisil spent ten years at iaşi university (1931-1941). it was a period in which he alternated his old interests in continuous mathematics, with applications to mechanics and physics, with his new interests in discrete mathematics, mainly in algebra and logic. moisil was very impressed by an article due to the polish logician jan lukasiewicz, on the logic with three values (then the interest moved to several values) and by the analogy proposed by lukasiewicz between the non-classical logics, on the one hand, and the non-euclidean geometries, on the other hand. as the initiator of the logic of three values, lukasiewicz was considered as the lobatchevski of logic. it was one more reason for moisil to be attracted by logic: his philosophical interests lead him to pay great attention to the philosophical consequences of quantum mechanics, where aristotle logic is no longer valid. indeed, the principle of excluded middle is here replced by a principle of included middle. in the thirties of the past century, important authors, such as john von neumann, paid attention to the logic of quantum mechanics. it was also the principle of universal determinism which was under question; certainty is replaced sometimes by probability. moisil and some of his colleagues (onicescu, procopiu, barbilian etc.) organized some debates on this topic and reading his writings in this respect one can understand how this interest motivated him to orient his attention towards mathematical logic, a field where he published his first paper in 1936, in a volume devoted to the 75th birthday of vitto volterra. he will never leave this field, but concomitantly he will remain also stable in his old interests. only in the last decade of his life he will be devoted exclusively to discrete mathematics. but, being first of all an algebraist, moisil will project in his studies in logic the same algebraic spirit. his main project was to build for lukasiewicz’s logic of several values an algebraic framework in a way similar to the way george boole has proposed in the xixth century an algebraic model for aristotle’s logic (based on the principles of identity, non-contradiction and excluded middle). moisil called this framework "lukasiewicz algebras", but ultimately these algebras received the more appropriate name "lukasiewicz-moisil algebras"; this name became the title of a monograph published at north holland publishing house by a team lead by professor sergiu rudeanu. his move from iaşi to bucharest university, at the end of the year 1941. was a dramatic one. not only moisil, but also vrânceanu, barbilian and miron nicolescu were competing for the same position of professor. among them, moisil was the youngest and with the smallest chance to win. the winner was vrânceanu. then, moisil had the happy idea to convince the ministry of education, ion petrovici, to create three different chairs and so all of them became professors; it was the great chance for the next generation (that of the author of these lines) to benefit of such great professors. after the second world war, moisil had many interesting initiatives. one of them was related to an idea proposed by shannon in his phd thesis and independently by some russian engineers, to associate electric circuits with binary logic, because each of them works with two values: yes or no in logic, while the circuit is open or closed. moisil succeeded to develop this idea in many variants, stimulating a whole team of researchers to articulate engineering, classical and non-classical logics and various types of algebraic structures with some ideas from number theory. another fruitful idea was to associate some matrices to some systems of linear partial differential equations. but perhaps more important was the way moisil understood, in the early fifties, that the emergence of the new paradigm of information, communication and computation could change to a large extent the social, cultural and scientific life of the next decades. in 1949, he initiates a whole school in the field of the algebraic theory of automatic mechanisms as a professor of the bucharest university, he was the first to teach there mathematical logic. articulating logic and automata, moisil was well prepared to organize the romanian development in the emergent field of computer science. he monitorized the building of he first romanian computer, by victor toma, at the institute of atomic physics, in 1957, and, on the other hand, he organized courses in the field of computation at the faculty of mathematics, university of bucharest. he also directed the first promotion of students in mathematics to work with the team of victor toma, at the institute of atomic physics; they were trained to learn programming at the new computers cifa (calculatorul institutului de fizica atomica). the first romanian team of mathematicians included dragos vaida, i..moldovanu, gh. zamfirescu, g. klarsfeld. so, we can say that 1957 is the date of birth of romanian computer science, under the guidance of professor moisil and by the collaboration between engineers and mathematicians. in 1962, moisil initiates a new section "computing machines" at the faculty of mathematics and physics of the univ of bucharest and, associated with this section, the computing center of the univ of bucharest (ccub); as a matter of fact, ccub was under the guidance of the chair of algebra, whose chief was moisil. in 1963, 76 solomon marcus ccub is endowed with the computer cifa 3, the third version of the first romanian electronic computer, and with an analogic computer of meda type. moisil is very active in preparing the corresponding mathematical background: learning of algol 60, organization of a seminar of algebraic theory of automatic mechanisms (started in 1954), organization of a course of logic applied to electric circuits and of a seminar of mathematical logic (started in 1966); numerical analysis and combinatorics are also stimulated. in a further step, mathematical biology and mathematical linguistics as well as perspectives of computation in various fields of the humanities: history, archeology, musical composition etc. in 1968, ccub is endowed with a computer ibm 360/30 of the third generation and the learning of fortran and cobol is introduced. ccub became a place where people from all cultural horizons came to learn from professor moisil how could they take profit in using in their own field the mathematical and computational thinking. lawyers and musicians (among them, aurel stroe), engineers and economists, linguists and philosophers, biologists and medical doctors, painters and writers were visiting ccub and the main reason of this fact was the presence there of professor moisil, who had the gift to leave the mathematical jargon and to address non-mathematicians in the simplest possible language; and in this simple language he was able to explain the mathematical and computational way of thinking. the capacity of moisil to seduce and to fascinate the auditory became very fast an element of attraction for mass media. newspapers, radio and television began to invite him and moisil became a star whose fame was in competition with that of the most popular singers and actors. his unique voice, his way to transform the speech in a song, his spontaneity, his humor, his permanent state of joy made moisil so popular, that even today, 33 years after his death in may 1973, his statements are still in the attention of the public. "new ideas appear first as paradoxes, then they become common truth and ultimately they die as prejudices"; "is logic a practical science ? yes, because you learn from it how to take decisions" ; "you lose a lot of time when you believe that you know what in fact you don’t know". many of his jokes have a mathematical structure. here is a joke illustrating the recursive thinking: "every man has right to a glass of wine; but when you drink a glass of wine, you become another man" (the corollary: every man has right to infinitely many glasses of wine). a joke illustrating self-reference: a child asked him: "professor moisil, do you like dreams ?" – "yes, i had once a dream in which i was sleeping during a session and when i waked up i was really in a session". jokes having the same pattern, being so able to be produced algorithmically: "the water is bad, even in the shoes"; "everything can be proved; even the truth"; "you can fall in love with any women; even with your wife"; "every joke can make you to be in the best mood; even the above ones". all these jokes have as a common denominator the confusion between normality and exceptionality: the water in the shoes, the proof of a truth; to be in love with your wife. as soon as you understood the pattern, you can produce infinitely many jokes of similar type. at 11 february 1971, moisil sends a letter to the rector of the bucharest university, proposing a whole program of organization of education in the field of computers and their mathematics. in january 1973, he sends another letter to the dean of the faculty of mathematics, where he explains that this faculty has a great responsibility concerning the formation of the teachers of computer science with a solid mathematical background. moisil explains that the main job in this respect is to assure the computational literacy of the coming generations, because in the emergent period of the information and computation all professions will need in some way familiarity with computing and programming. very few people were aware at that moment of this truth which today is obvious. in this respect, we can consider moisil as a kind of spiru haret of the second half of the past century: haret was an important fighter against illiteracy, while moisil was an important fighter against computational illiteracy. due to his multiple interests, moisil succeeded to form a lot of disciples in various directions: in mechanics of solids (nicolae cristescu, p.p.teodorescu, m. predeleanu, george dinca), in logic (sergiu rudeanu, george georgescu, afrodita iorgulescu), in computer science (dragoş vaida, constantin popovici, paul constantinescu), in logic of electric circuits (l. livovschi), in algebra, in analysis, in differential geometry. if the child moisil revealed a total curiosity, the same totality characterizes the creative work of the adult moisil. within mathematics, he interrelates all its domains; beyond mathematics, he is looking for the way mathematics may have an impact on natural and social sciences; beyond science, he is questioning the relation between math and philosophy, between math and art; beyond culture, he is interested in the impact of math in the everyday life. scholars are of two types: the ant type, looking for what happens in a specific area of knowledge and trying to deepen more and more the respective segment of investigation; but there is also the bee type, going from flower to flower and changing frequently the area of investigation. obviously, moisil was of the second type. but, looking with more attention at his behavior, we realize that he was sometimes of a mixed type, because he liked to go back to flowers already visited. i remember his renewed interest in the sixties in the problem of mechanics he discussed long time ago in his phd thesis. after the second world war, his growing interest in discrete mathematics was concomitant with the continuation of his work in continuous mathematics. his general grigore c. moisil: a life becoming a myth 77 strategy was to trust the unity of mathematics and the potential solidarity between its different parts, including the case when these parts seem to be completely away each other. for this reason, he used to oblige his phd students to pass examinations on some chapters of math which apparently were very far from the object of direct interest of the respective student. this is the reason why most of his papers combine different branches of math. another interesting feature of moisil’s works is their strong link with the works of his colleagues and of his professors. a typical example are his papers of geometry. arrived in iaşi at the end of the year 1931, when he was only 26, but with his thesis published by gauthier-villars (paris, 1929), moisil found an adequate atmosphere in the seminar lead by al. myller, predominantly concerning differential geometry. at that moment, he was considered, in view of his already published works, the founder of the theory of infinite dimensional riemann spaces. to this, he added the study of infinite lie groups and of mechanical systems with infinitely many degrees of freedom. but the geometric methods used by moisil were for him a tool to investigate the mechanics of systems of material points with infinitely many degrees of freedom. for him, the respective geometric model consists of infinite dimensional subvarieties in an adequate hilbert space. in other situations, he is oriented towards the geometrization of systems of equations with partial derivatives. so, analysis, geometry, mechanics form an organic mathematical entity. see, for more, in this respect, the article by acad. radu miron in "academica" (forthcoming, 2006), from which we have borrowed some elements. in order to illustrate the style of work done by moisil in the field of math and humanities, we will indicate some ideas he developed in the field of what he called "the mechanical grammar of romanian". he proposed some new classifications of romanian nouns and verbs. two ideas deserve to be mentioned. the first one concerns the possibility to use, in the declension of nouns and conjugation of verbs, of what he calls the method of variable letters, by means of which he copes, in a very elegant way, with the phenomenon called in linguistics "morphological alternances". the variable letters are like the functions defined by means of two or several analytic expressions, each of them for a specific part of the domain of definition of the function. another idea proposed by moisil concerns the conjugation of verbs, where a classification is made according to the behavior of what is called "the more than the perfect" (mai mult ca perfectul). another paper concerns a comparative analysis between the linguistic conjunction "and" and the mathematical conjunction "and", in the case of romanian language, but things remain valid, to a large extent, for other languages too. for instance, it is shown the contrast between the possibility to iterate indefinitely the use of "and" in logic, and the impossibility of doing the same in linguistics. to his own works in this respect, moisil added his capcity to stimulate and to guide the first steps in the development of mathematical and of computational linguistics in romania. he guided the first algorithm of automatic translation (english-romanian) by erica nistor, the similar work by minerva bocsa in timisoara, the work done by the team lead by p. schveiger in cluj-napoca and the work done by eliza roman in the field of automatic abstracts and automatic documentation. the author of this lines remains indebted to moisil for his major help jointly with the linguist alexandru rosetti, concerning the first steps of mathematical and computational linguistics in romania. moisil and rosetti made possible the organization of the first university courses with this profile; they founded "cahiers de linguistique theorique et appliquee", a journal of an interdisciplinary nature, devoted to the interferences among linguistics, mathematics, computer science and poetics. the remarkable fact, in these articles, is the capacity of moisil to develop the mathematical way of thinking in absence of the usual mathematical jargon, consisting of formulas, equations, calculations etc. he never leaves in these texts the natural language. another aspect of moisil’s personality can be seen in his philosophical writings. mathematics and philosophy were for him two faces of the same coin, each of them requiring the other. already during his childhood and adolescence, his interrogative nature and his readings prepared the way towards his philosophical personality. he reads poincare and selects in his diary statements such as : "science deserves to be studied for the glory of human spirit" and "for the enormous pleasure offered by the knowledge of truth", more than "for its practical utility". somewhere he notes: "life is a work of art. it is a pleasure to think to what happened sometimes in the past and will never happen again !". in another place he notes: "pleasure is more attractive when you are looking for it than when you feel it". a lot of remarks related to his readings in the field of history, of literature, of natural sciences, of religion etc. all of them, when he was 7, 8, 9, 10, 11, 12, 13, 14. here is a comment about the cause of wars: "if we cancel the fights having no rational motivation, only a few of wars remain in the memory of the history". a word of wisdom: "you don’t have to exagerate in love, because you risk to end by exagerating in hate". the articles in the field of philosophy deserve a special attention. the first one, chronologically, was published in 1937 and concerns the successive steps in the development of the mathematical knowledge. moisil was at that time under the strong influence of the ideas of vito volterra, who pays attention to the way mathematical knowledge is born from the pre-mathematical knowledge. in a first step, qualitative descriptions are converted 78 solomon marcus in quantitative ones, by means of measurements and counting. then the mathematics of quantity are developed, until the moment when qualitative aspects are again in attention. the development of physics and of mathematical analysis are followed concomitantly. the mechanical stage, the energy stage and the einsteinian stage of the generalized relativity theory are analyzed. then he directs his attention towards the development of qualitative mathematics and the main example is here the notion of a group, observed on the particular cases of the group of rotations and the group of permutations. we are lead in this way to what moisil will call structural mathematics and the way is prepared to connect it to the general emergence of structuralism (in psychology, in linguistics, in anthropology and before them in chemistry, with the idea of isomerism). moisil will be among the first to observe the big change brought by the new fields of topology, functional analysis, combinatorics, graph theory, mathematical logic, abstract algebra. in the same way, moisil stresses the importance of the theory of complex numbers and of their structural aspects. moisil analyzes the way classical infinitesimal calculus lead to general topology. the idea of a differential is both quantitative and structural (having the structure of a polynomial). logistics is a clear example of reduction of quantity to structure and this is the task of russell’s and whitehead’s "principia mathematica". starting from the contrast observed by other authors between the quantitative aspect of mathematics and the qualitative aspect of the acts of thinking, moisil observes that the acts of thinking have a structural rather than a qualitative aspect and so the mathematician’s job is to investigate the algebraic structure of the acts of thinking. so, concludes moisil, "ce n’est pas a l’ancienne logique de la qualite qu’on devra s’adresser, mais a la nouvelle algebre de la structure. ce n’est pas en effet trop tot si on essaie de construire une theorie coherente de la vie spirituelle". in his "la logique formelle et son probleme actuel" (1939), moisil investigates the principles of classical logic (identity, contradiction, excluded middle) and their modifications in brouwer-heyting’s intuitionistic logic, in kolmogorov’s intuitionistic logic, in lukasiewicz’s ternary logic; he also discusses the very nature of axiomatic deductive systems, with special attention to hilbert, russell and poincare. this investigation is continued in "sur l’autonomie des mathematiques" (1941), where he is using the term "panmathematisme" understood as the process of approaching mathematically the natural sciences and those of the human spirit (sciences de l’esprit)". in this order of ideas, he characterizes the mathematical activity as being irreducible, i.e., independent of empirical investigations and of any previous rational development. this is for him the autonomy of mathematics. moisil meets, in this respect, the way old greeks (pythagoras, platon), then kant and goethe, conceived mathematics (for kant, mathematics is what is called in german geisteswissenschaft). the utilitarian function of mathematics is in most cases a consequence of its cognitive function, but the temporal distance between the cognitive moment and the utilitarian one is usually imprevisible. we stress this fact, because in his writings after 1950, in view of the ideological constraints, he will no longer state explicitly the autonomy of mathematics, but he will defend strongly the need to develop pure mathematics, mathematics for its own sake, as a condition to reach applied mathematics. moisil considers that the human spirit can reach the center of a deductive discipline; mathematics can bridge the self and the non-self. he is consistent with the idea emergent much later, according to which the subject-object distinction will no longer be considered as sharp as in the classical science; consequently, the distinction between natural and human sciences is under question. as a matter of fact, mathematics is for moisil by excellence a human discipline. the last philosophical article he published before the communist regime was his "closure lecture at the university of iaşi" (16 january 1942): "the perspectives of axiomatic philosophy". we learn from this lecture that romanian scientists paid a great attention to the philosophical problems of their science, mainly those related to non-euclidean geometries and to relativity and to quantum mechanics. philosophers like ion petrovici, physicists like st. procopiu, mathematicians like victor valcovici, simion stoilow, octav onicescu, dan barbilian and gr. c. moisil were involved in hot discussions on these topics. ultimately, moisil stresses the human side of mathematics. "the axiomatic freedom of mathematics does not fit with something similar in the real life" observes moisil and this premonitory statement is followed by other similar statements: "human creative work is by excellence one of expression. the deep human desire is to be understood by other peoples. this expression which succeeds to be communicated is just what we call culture". will this communication remain possible ? such thoughts expressed in the hot year 1942, when the war knew dramatic changes and the future became increasingly unsure are just the thoughts of moisil at the moment when his stage in iasi ends and a new university life will begin in bucharest. at some moment, moisil makes reference to "goedel paradox". clearly, he has in view the famous 1931 goedel incompleteness theorem, but it is clear that this theorem had to wait until it will be understood in its deep meaning and huge consequences for the whole mathematics. this was not only moisil’s shortcoming, but a shortcoming of the quasi totality of the mathematical community. during the summer of the year 1942, stoilow and barbilian had a daily correspondence about foundations of mathematics, but the reference to goedel 1931 theorem did not exist grigore c. moisil: a life becoming a myth 79 in their dialogue. sometimes, moisil is very near to some ideas emergent much later. for instance, in "determinism si inlantuire" (1940), he refers to georges bouligand in connection with the fact that small changes in the initial data may have a big impact on the further development of a phenomenon. "throwing a small stone may have some influence on the movement of the sun", observes moisil. does not anticipate it the modern "butterfly effect"? in another place, he refers to the involvement of continuous nowhere differentiable functions in the study of brownian motion. so, chaotic systems and fractals are suggested to a contemporary reader. let us recall that several decades later brownian motion will be recognized as a strange fractal (whose hausdorff dimension is an integer). in one respect only, moisil failed. he did not succeed to organize his life in order to make it more efficient. moreover, he did not try this. ho died at 67, when his head was still full of ideas and projects. he did not know how to alternate work and rest, how to pay attention to his health. i remember my last meeting with him.it was in an evening of the spring of the year 1973. we left together the university and we were walking in the direction of his home, armeneasca street, 14. when we arrived at his house, he told me: "you know that sudan is the real author of the first recursive function that is not primitive recursive ?" (all treatises of mathematical logic claim that the author of such an example is g. ackermann). very interesting, i said. where did you find this ? then, moisil said: "it is too late now, i will tell you this at a next occasion". this "next occasion" never arrived. moisil left next days for canada, where he died on 21 may 1973. today is may 18. in three days we can say: moisil died just 33 years ago. but my curiosity to find out what is behind the mysterious message left to me by moisil obliged me to accomplish the respective research. together with two young students, cristian calude and ionel tevy, we took piece by piece all the published papers of gabriel sudan. no of them had in its title or in its introduction something suggesting the presence of such an example. it is perhaps a rule of the nature that more interesting is something, more hidden it is and more effort we need to discover it. this was the reality. the respective example was hidden in the last part of an article which was explicitly concerned with a problem having nothing to do with a recursive function which is not primitive recursive. clearly, sudan was not aware of the fact, like moliere’s hero, jourdain, who remained a symbol of such situations. i had the happy opportunity to edit a part of the scientific work of moisil, in three volumes published by the publishing house of the romanian academy. other writings of moisil were published also after his death, some of them under the care of viorica moisil. i edited his "lectii depre logica rationamentului nuantat" and his articles in the newspaper "contemporanul", under the label "stiinta si umanism". the articles published in "viata economica" were edited in a small book "indoieli si certitudini". now, we have in front of us the duty to publish his papers of "mechanical grammar of romanian" and his philosophical papers. moisil’s heritage belongs to the romanian culture and the new generations deserve to know this unusual personality. solomon marcus romanian academy e-mail: solomon.marcus@imar.ro editor’s note about the author: solomon marcus member of the romanian academy born: march 1, 1925, bacău, romania. elementary and high school in bacau. diploma of merit in mathematics, university of bucharest. asistant professor(1950), lecturer, associate professor, professor (1966) at the faculty of mathematics and computer science, university of bucharest. phd in mathematics 1956, doctor in science 1967, corresponding member of the romanian academy 1993, full member of the romanian academy 2001. research and teaching in mathematical analysis, theoretical computer science, linguistics, semiotics, poetics, history and philosophy of science, fields where he published about 50 books in romanian, english, french, german, italian, spanish,russian, greek, hungarian, czech, serbocroatian and about 400 research articles in specialized journals in almost all european countries, in usa, canada, south america, japan, india, new zealand etc. more than 1000 authors quoted his works. he is recognized as one of the initiators of mathematical linguistics and of mathematical poetics. hundreds invited lectures at various international scientific meetings. member of the editorial board of several tens of international scientific journals. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 953-960 using qsps in developing and realization of a production line in automotive industry n. tudor, v.c. kifor, c. oprean nicolae tudor continental automotive systems test department e-mail: nicolae.tudor@continental-corporation.com vasile claudiu kifor “lucian blaga” university of sibiu, director of the research department constantin oprean “lucian blaga” university of sibiu rector abstract: using the qsps (quality system for the production software) for industrial projects and not only therefore, has led to accurate running of the production line from beginning of the sop (start of production). this paper presents the application way of the qsps at one of the strongest european automotive company. by using of this system several significant costs savings and quality improvement can be observed. the content of this paper will show step by step how to use qsps for the integration of a production line in the traceability system from a big company in automotive industry. the production line involved contains 56 production equipments, which have to be passed trough by the product before being packet and deliver to the customer. the control of the line is done by this traceability system, so the impact of this system with the quality of the product is very high. the structure of this system contains 7 steps. all of these steps are followed and executed in each system (test, pilot and production environment). keywords: quality improvement, control, savings, efficiency, capability. 1 introduction the traceability software for the production lines becomes more and more important and is a very significant process while running complex production lines with a high difficulty degree. the entire customer claims issues, statistics, the control of the process, cpk (process capability index) studies, fpy (first pass yield) and ppm (defective parts per million) reports are done very simple, based on a strong traceability system. therefore a lot of techniques for implementing the traceability software were tested and checked, some of them successful and some less successful depending on the kind of the project. the paper below describes the practical usage of the qsps system in real situation for the implementation of the traceability software of a complex line which contains 56 stations. the qsps system, should improve the quality of work, cost and time saving at one side, but on the other side the system should be very simple and easy to be used in order to simplify the work activities and not to make it more complicated. copyright c⃝ 2006-2010 by ccc publications 954 n. tudor, v.c. kifor, c. oprean also the system should cover and prevent all the risk which can appear in all 3 qsps phases (test, pilot and production) because of non conformity of improper software. figure 1: qsps framework the next chapters describe step by step the implementation strategy of the traceability software, based on qsps, of the already specified production line. all of the steps are described in the test (model) system, which have to be implemented and build up much closed (or even identical) to the characteristics of the production system. those steps should be followed in all of the 3 environments systems of the qsps (test, pilot and production). 2 step i: time and cost evaluation this phase of the project should help to understand the deepness of the project, the difficulty level, and cost of the project, to make very clear and transparent the resources needed (time resources and headcounts) and to define a due date very closed to the reality. therefore the project should be split out in work packages, which can be better followed up, controlled and evaluated. the table 1 shows how the project was divided in work packages and subprojects. subproject work packages difficulty level process definition 2 process description 2 process review 2 technical requirements specification 3 target specification 3 software implementation concept 3 programming 3 documentation of the software 2 installation and configuration of the 56 modules 2 testing the 56 modules 2 advising and user support 2 project evaluation(the time for evaluation itself) 2 internal software release compliance to technical requirements 3 compliance to the software iso norms 2 calibration and adjustments(bug solving) 3 capability measurements 2 usage work instruction 2 user manual 2 ergonomics and design 2 customer software release efficiency analyze 3 defects analyze 3 corrective actions 2 analyze after corrective actions 2 customer release reports 2 validation and verification of the implemented software package lifetime test 3 test procedure for modification issues 2 continued on next page using qsps in developing and realization of a production line in automotive industry955 table 1 – continued from previous page subproject work packages difficulty level approval procedure 2 final releases 2 functional follow up of the software during production software audit 2 statistical rate of the occurred errors 2 customer claims problem recording 1 analyze and solving procedure 2 analyze report 1 table 1 project segmentation with this information the time evaluation can be done more accurate using the pert algorithm. the results of the time evaluation can be observed in figure 2. resource, time and costs evaluation project name version date employees rep. vo vn vp va total deviation process definition 1,0 30 30 32 30,3 30,3 0,1 2 2821 process description 1,0 16 20 20 19,3 19,3 0,4 2 1798 process review 1,0 8 10 10 9,7 9,7 0,1 2 899 technical requirements specification 1,0 80 90 90 88,3 88,3 2,8 3 14486,66667 target specification 1,0 40 50 50 48,3 48,3 2,8 3 7926,666667 software implementation concept 1,0 80 85 85 84,2 84,2 0,7 3 13803,33333 programming 1,0 160 160 170 161,7 161,7 2,8 3 26513,33333 documentation of the software 1,0 40 50 50 48,3 48,3 2,8 2 4495 installation and configuration of the 56 modules 1,0 40 50 50 48,3 48,3 2,8 2 4495 testing the 56 modules 1,0 40 50 50 48,3 48,3 2,8 2 4495 advising and user support 1,0 8 10 10 9,7 9,7 0,1 2 899 project evaluation(the time for evaluation itself) 1,0 4 5 5 4,8 4,8 0,0 2 449,5 internal software release 1,0 0,0 0,0 0,0 0 compliance to technical requirements 1,0 40 50 50 48,3 48,3 2,8 3 7926,666667 compliance to the software iso norms 1,0 8 9 9 8,8 8,8 0,0 2 821,5 calibration and adjustments(bug solving) 1,0 16 18 18 17,7 17,7 0,1 3 2897,333333 capability measurements 1,0 8 10 10 9,7 9,7 0,1 2 899 usage work instruction 1,0 8 8 8 8,0 8,0 0,0 2 744 user manual 1,0 8 8 8 8,0 8,0 0,0 2 744 ergonomics and design 1,0 8 8 8 8,0 8,0 0,0 2 744 customer software release 1,0 0,0 0,0 0,0 0 efficiency analyze 1,0 4 4 4 4,0 4,0 0,0 3 656 defects analyze 1,0 4 4 4 4,0 4,0 0,0 3 656 corrective actions 1,0 4 4 4 4,0 4,0 0,0 2 372 analyze after corrective actions 1,0 4 4 4 4,0 4,0 0,0 2 372 customer release reports 1,0 2 3 3 2,8 2,8 0,0 2 263,5 validation and verification of the implemented software package 1,0 0,0 0,0 0,0 0 lifetime test 1,0 8 9 9 8,8 8,8 0,0 3 1448,666667 test procedure for modification issues 1,0 4 5 6 5,0 5,0 0,1 2 465 approval procedure 1,0 8 8 8 8,0 8,0 0,0 2 744 final releases 1,0 8 8 8 8,0 8,0 0,0 2 744 functional follow up of the software during production 1,0 0,0 0,0 0,0 0 software audit 1,0 4 5 6 5,0 5,0 0,1 2 465 statistical rate of the occurred errors 1,0 4 6 6 5,7 5,7 0,1 2 527 customer claims 1,0 0,0 0,0 0,0 0 problem recording 1,0 0,5 0,5 0,5 0,5 0,5 0,0 1 30 analyze and solving procedure 1,0 3 3 3 3,0 3,0 0,0 2 279 analyze report 1,0 0,5 0,5 0,5 0,5 0,5 0,0 1 30 total 626 705 717 693,8 693,8 21,2 93549 work package (wp) costs (euro) time[h] dificulty level traceability project of production line 24.11.2009 number figure 2: project time and cost evaluation 3 step ii: internal software release this step is the first evaluation of the software package from quality point of view. at the same time it is the first control tool used to signalize if the software was done in the right way, accordingly to the requirements, to the quality norms and not at least to assure the production of qualitative products. therefore each of the steps below has to be verified. 3.1 compliance to customer specification this check has to be done based on the target specification. each step from this chapter must be checked. for help a check table can be used and the characteristics from the target specification should be proved. for the line in our example following characteristics were verified: 956 n. tudor, v.c. kifor, c. oprean • program flow; • special cases in real life (like emergency stop, current interruption); • fail /pass /scrap situations; • handshake protocol between the industrial equipment and traceability software; • communication syntax; • repair scenarios; • limitation of the nr. of repair process; • check in/check out process; • process parameter; • process results, cpk measurements; • statistics of the process; • statistic of the failure; • product logistic on the line (process flow); • line flow control; • back flushing to the main database, in order to administrate material stocks; • label scanning; • label syntax; • packing. check if the product pass all the processes in the line before being packed; • quality alert; • alert notification. 3.2 check the implemented software package for software techniques and standards for this point it was used the iso/iec 9899, for programming language c. based on technical corrigenda 1:2001 the source code was reviewed. also the next criteria were inspected at the very early beginning: • is the right software? • the target is reached? • interface to the users is clear and unique while using it? • at least one diagnostic message? • the programming editor was used correctly? (tabulator, empty spaces etc.) • requirements resulted from coded character set. • requirements resulted from binding techniques. • comments in the source code. • documentation. • usage documentation. 3.3 software calibration and adjustments (bugs solving) after the two chapters above were effectuated and all of the situations where verified, so all the bugs could be resolved, bugs which can occurred if the programmer doesn‘t consider a state from the program logic. 3.4 capability measurements (using pareto distribution, cpk, iso 15504) this measurement was done even in the test system. production units can be created virtual for a good simulation. knowing the handshake protocol between the traceability system and the using qsps in developing and realization of a production line in automotive industry957 industrial equipment, a simulation can be done very closed to the reality situation. with these results the real running of the production can be compared. the upper, respectively the lower limits are in this case the cycle time of the process admitted by the customer while processing a production unit. cpk was done as described in table 2, respectively the pareto distribution for the analyses of the software package in case of rejected software, table 3 respectively figure 3. unit id measured time lower limit 31 upper limit 36 1 33,5 30 36 2 33,2 30 36 3 34,1 30 36 4 33,9 30 36 5 33 30 36 6 34,3 30 36 7 33,2 30 36 8 32,9 30 36 9 33,8 30 36 10 33,6 30 36 11 33,7 30 36 12 33,6 30 36 13 32,9 30 36 14 33,1 30 36 15 33,8 30 36 16 34,3 30 36 17 33,9 30 36 18 33,6 30 36 19 33,2 30 36 20 33,3 30 36 21 32,7 30 36 22 33,5 30 36 23 33,8 30 36 24 34 30 36 25 33,6 30 36 26 33 30 36 27 33,8 30 36 28 32,9 30 36 29 33,4 30 36 30 33,9 30 36 average 33,51666667 deviation 0,435560149 cp 1,913245127 cpk 1,90049016 table 2: cpk calculation 3.5 software ergonomic and designs therefore we based on the iso 9241. this iso norm is a standard of ergonomics of human system interaction. following rules were proved: • suitability for the task (the software interface should be suitable for the user‘s task and skill level); • self-descriptiveness (the interface should make it clear what the user should do next); • controllability (the user should be able to control the pace and sequence of the interaction); • conformity with user expectations (it should be consistent); • error tolerance (the software should be forgiving); • suitability for individualization (the software should be able to be customized to suit the user) and • suitability for learning (the software should support learning). 4 step iii: customer software release (run@rate) for this step we run the line with 100 production units (in the test system we simulate the running of the line with virtual units) and observe the behavior of the software, of the users 958 n. tudor, v.c. kifor, c. oprean software return reasons problem count percent of total cumulative percent not compatible 21 30,67 30,67 does not perform as expected 18 18 50,33 missing hardware resources 15 16 54 not suitable for self learning 11 14 65,4 missing user manual 10 7 71 bad user interface 8 5,33 78 too complicated 7 4 88,9 bad cycle time 4 3 98,3 bad backup of recorded data 3 2 100 total 97 100 100 table 3: pareto analyze software return reasons 0 10 20 30 40 50 60 70 80 90 100 n ot c om pa tib le d oe s no t p er fo rm a s ex pe ct ed m is si ng h ar dw ar e re so ur ce s n ot s ui ta bl e fo r se lf le ar ni ng m is si ng u se r m an ua l b ad us er in te rf ac e t oo co m pl ic at ed b ad cy cl e tim e b ad b ac ku p of re co rd ed d at a percent of total cumulative percent figure 3: pareto distribution (operators) while using it in different situation and intentionally we cause extreme situation (like emergency stop of the equipment, current interruption... etc.) for a better software rating. a statistic is made for each equipment, inclusively the packing station. this statistic contains the number of the parts being processed on each station during one hour. also a pareto was realized similar to the figure 3. the following issues were also effectuated: • efficiency analyze (how many parts per hour per equipment); • defects (bugs) analyze; • causes and corrective actions; • reevaluation procedure after corrective actions; • creating reports (like 8d report). 5 step iv: validation and verification of the implemented software package after release and after the customer agreed the implemented software, the final release is made by the project manager. that means that the software is officially registered as a production software. at this level the project manager analyze the robustness of the software package by making some research about the following properties: • lifetime (how long the software will run without any modifications or maintenance), iso 12207 using qsps in developing and realization of a production line in automotive industry959 – acquisition analyze (how was the software required); – supply analyze (how was the software provided); – how it was develop; – how is the operating mode; – how much maintenance is needed. • modifications handling (how will be the modification make and tested); • documents like ppap (product part approval process) and psw (part submission warranty) are made and signed from the project manager. 6 step v: functional follow up of the software during the production after the software was accepted and installed in the production line, periodical (every 2 weeks) software audits were made to assure that the software is running in conformity with the requirements and the production criteria are fulfilled. in this software audit the main criteria (see chap. 3.1) are checked again. statistics are made for the cycle time, and errors occurred, in order to make the difference between software errors and process error. the main aim of this step is to find improvements for the software in order to make it more robust and efficient. 7 step vi: customer claims customer claim escalation plan is also created like in the figure 4 in order to obtain the best reaction time in case of software error which could appear, and disturb the normal running of the production. this will drive to very small solving and reaction times for eventually downtimes on the line because of software problems. 8 conclusions using the qsps system, the downtime of the line (containing the 56 production equipments) because of production software is decreased to almost 0%. the target of 0% is only possible when the technical requirements from the customer are done accurate and to 100% complete. even with incomplete requirements, using the qsps system the possibility to reach the target of 0% is possible. therefore the implementation and running of the software should have a much longer period of time running in the pilot environment, to assure a safe lunch in the production. in most all of the cases this is not possible because the production should always start even if some problems are unsolved, because of time pressure. qsps should eliminate that inconveniences, provoked by time pressure or other factors, due to a much easier structure, better and faster to be used, compared to other systems met till know in other studies. the advantage of using the qsps is that during the 3 environments (test, pilot and production), using the rules and iso norms for each of them, the almost or all troubles which occurred are filtered before running the real production. that means that after the sop (start of production) the interruption of the production line because of the software is almost inexistent. 960 n. tudor, v.c. kifor, c. oprean figure 3: escalation plan figure 4: escalation plan bibliography [1] kai-yuan cai, bey-bey yin, software execution processes as an evolving complex network, information science, april 2008. [2] georg kühner, torsten bluhm, employing industrial standards in software engineering for w7x, fusion engineering and design, 2009. [3] j.-w. li, modeling a quality assurance information system for product development projects with the uml approach, vol. 20, nr.4 of international journal of computer integrated manufacturing, june 2007. [4] n. tudor, c. kifor and c. oprean, quality system for production software qsps. [5] oprean, c., kifor c. v., suciu, o., managementul integrat al calităţii, sibiu, editura universităţii lucian blaga din sibiu, 2005, isbn 973-739-034-2. [6] n. tudor, d. dumitrascu, the benefits of project structuring in sub-projects and work packages, http://imtuoradea.ro/auo.fmte/files-2008/mie_files/tudor%20nicolae%202.pdf. [7] n. tudor, d. dumitrascu, advance estimate expenses for project execution time, http://imtuoradea.ro/auo.fmte/files-2008/mie_files/tudor%20nicolae%201.pdf. international journal of computers, communications & control vol. ii (2007), no. 2, pp. 111-120 variable selection and grouping in a paper machine application timo ahola, esko juuso, kauko leiviskä abstract: this paper describes the possibilities of variable selection in large-scale industrial systems. it introduces knowledge-based, data-based and model-based methods for this purpose. as an example, case-based reasoning application for the evaluation of the web break sensitivity in a paper machine is introduced. the application uses linguistic equations approach and basic fuzzy logic. the indicator combines the information of on-line measurements with expert knowledge and provides a continuous indication of the break sensitivity. the web break sensitivity defines the current operating situation at the paper mill and gives new information to the operators. together with information of the most important variables this prediction gives operators enough time to react to the changing operating situation. keywords: variable selection, grouping, paper machine, web breaks 1 introduction data-driven modelling requires always variable selection or grouping. in small systems, expert knowledge gives a clear basis for the variable selection since possible interactions and causal effects are known fairly well. in these cases, few modelling alternatives can be compared interactively. variable selection becomes important when the number of variables increases, especially when normal process data is used. as a model should include a reasonable number of variables, a modular approach based on variable grouping provides a better process insight, which makes the model assessment easier. in practical cases, variable selection is necessary either because it is computationally infeasible to use all available variables, or because of estimation problems when limited data samples with a large number of variables are present. variable grouping means finding feasible groups and combinations of variables for modelling. it is closely connected to data clustering since the interactions can depend on the operating area. in large-scale systems, the number of possible variable combinations becomes easily very large, e.g. the case models of the web break indicator included originally 24 variables, which mean 2024 alternative three variable combinations. the newest version has 72 variables leading to 59,640 three variable groups, 1,028,790 four variable groups and 13,991,544 five variable groups. most of these alternatives are useless, and therefore, methods for selecting reasonable variables for modelling are crucial. there is a lot of recent literature on variable selection and both model and data-based techniques are in use. spectroscopic data, multi-sensor systems, multivariate analysis and modelling of large-scale systems seem to require efficient methods for variable selection. four different methods for variable selection ű genetic algorithm, iterative pls, uninformative variable elimination by pls and interactive variable selection for pls in partial least square (pls) regression are studied and compared to a calibration made with manually selected wavelengths in [1]. the application is nir analysis of pharmaceutical tablets. it has been found that multiresolution analysis (haar wavelets) pre-processing before variable selection leads to simpler models with lower errors than single-wavelength selection in nir data [2]. wavelength selection for process monitoring has also been done using genetic algorithms (ga) coupled with a curve resolution method (opa) [3]. variable selection is also an important topic in using multiway methods in modelling nir spectra from a pharmaceutical batch process [4]. nir analysis of sugar cane juice has utilized partial least squares (pls) pruning for variable selection [5]. uv-vis and nir spectrometry of oils takes advantage of the successive projections algorithm (spa) in large-scale variable selection [6]. copyright © 2006-2007 by ccc publications 112 timo ahola, esko juuso, kauko leiviskä quantitative structureűactivity relationship (qsar) studies require also sophisticated methods for variable selection. there is a report on applying multi-objective genetic programming (gp) to the hept data and constructing the nonlinear qsar model using counter-propagation (cp) neural network with the selected variables [7]. particle swarm optimizer (pso) applies for the same purpose and the comparison with gp is in [8]. norinder [9] reports also on the use of support vector machine (svm) in qsar. statistical parametric mapping (spm), relying on the general linear model and classical hypothesis testing, is a benchmark tool for assessing human brain activity using data from fmri experiments [10]. prediction-based variable selection has reported to give 82 % success rate in quantitative structureűproperty relationship models (qspr) based on in vivo bloodűbrain permeation data [11]. in multi-sensor systems, variable selection problem originates from two reasons: the high dimensionality in the data used is due to a high number of sensors or many features extracted, or both. fuzzy artmap classifier analyses the results from a 12-element gas sensor array [12, 13]. fast wavelet transform is useful in feature selection before calibration in stripping voltammetry [14]. principal component analysis (pca) is a well-known method for variable selection. testing of loadings and their estimated standard uncertainties are used to calculate significance on each variable for each component [15]. variable selection can also mean identifying a k-subset of a set of original variables that is optimal for a given criterion that adequately approximates the whole data set [16]. the application of principal component regression to the trajectories of the process variables (block-wise pcr) has given straightforward results without requiring a deep knowledge of the process [17]. in this case, variable selection methods and technical information of the process has allowed the process variables most correlated with the final quality be revealed. genetic algorithms (gas) have been proposed recently for many applications including variable selection for multivariate calibration, molecular modelling, regression analysis, model identification, curve fitting, and classification. gašs are also incorporated with fisher discriminant analysis (fda) for key variable identification for trouble-shooting problems of the tennessee eastman process [18]. ga and simulated annealing have also been combined for reduction in the number of variables in neural network models [19]. two other approaches for the selection of variables in neural networks are in [20] and [21]. this paper is organised as follows: section 2 concerns with knowledge-based variable selection and grouping, section 3 with variable grouping with data analysis and section 4 with model-based variable selection. the case-based reasoning system for evaluating paper machine web breaks is shortly revisited in section 5. 2 knowledge-based variable selection knowledge can be used in decreasing the number of variables. for example, if we have a case with 10 process variables and group them in all possible groups with three, four and five variables, we end up to 582 groups. if we can, based on the process knowledge, include variable 1 in all groups with three variables, variable 10 in all groups with four variables, and variables 5 and 6 in all groups with five variables, we have 176 groups to analyse. this means that using process knowledge has decreased the number of alternatives by 70 percent. some variable combinations should be avoided, e.g. calculated variables should not be used together with the variables used in calculating them. also a group containing a controlled variable and its setpoint is not usually appropriate. these problems are avoided by defining the inappropriate groups as non-groups, i.e. as variables groups, which should not be a part of any acceptable variable group. variable selection and grouping in a paper machine application 113 3 variable grouping with data analysis correlation is a statistical technique which can show whether and how strongly pairs of variables are related. binary correlations and their combinations are used in pruning the set of acceptable groups defined by the domain expertise. for forecasting models, input variables should have a high correlation with the output variables, but a low one with each other. for case detection, causality is not always as clear: there is nor necessarily any definite output variable i.e. also groups where several variables have a high correlation between each other are acceptable. this sets new requirements for the model assessment. in practical cases, the results from correlation analysis are improved with appropriate filtering and using correct time delays between the variables. calculation of moving averages, medians and value ranges includes already a time delay, which depends on the calculation window and the applied methodology. nonlinear scaling is the essential feature in using linguistic equations method [22]. it improves the correlation analysis of curvilinear relationships, since the correlation analysis is a linear method. finding patterns in data with high dimension is difficult. however, in data sets with many variables, groups of variables often move together as they are measuring the same phenomena. a host of clustering approaches helps in digging out these interactions. as shown in introduction, principal component analysis (pca) is a conventional method to decrease the dimensionality in data without losing the information stored in the correlated variables. it searches for new fewer linear combinations of the original variables that explain the most of the variance of the original data. these linear combinations can be viewed as a linear transformation to the hyperplane defined by the principal components or a rotation and a stretch that transform original data to a new bias. principal components are calculated by defining the eigenvectors of the covariance matrix or utilizing the singular value decomposition. usually, only few first principal components (2 or 3) are used ű they are enough to explain most of the variance in the data set. there are also extensions in the basic methods that apply for analyzing time trajectories. 4 model-based variable selection isokangas and ruusunen [23] describe the automated procedure for finding interactions between variables from large datasets. this occurs systematically by constructing simple dynamic model candidates with complete input combinations for data segments of the varying and sliding window size. the final analysis goes on according to the structure properties of the best candidate models. model candidate construction, validation and testing proceed in the following way: the half of all available data is used in training and validation so that model candidates are constructed systematically from the beginning of data with selected data window size. after a data window has been used for training, the window of the same size is taken for validation. the procedure uses a partly overlapping data window. for example, if the data window is 400 minutes, first models are constructed using training data from 1 400 minutes and data from 401 800 for validation of a model candidate under evaluation. next, all model candidates are constructed using training data from 201 600 and validation data from range 601 1000 minutes. to define the right size of training data, different window sizes are systematically tested at this stage. models are evaluated with the correlation coefficient and rms-error measure using validation data. best models are further tested with independent testing data, which is another half of available data. 114 timo ahola, esko juuso, kauko leiviskä 5 paper mill example paper web breaks commonly account for 2-7 percent of the total production loss, depending on the paper machine type and its operation. this could mean 1.5 million euros lost annually at a single paper machine. according to statistics only 10-15 percent of web breaks have a distinct reason. the most of the indistinct breaks are due to dynamical changes in the chemical process conditions. the main area of interest in the indicator development is the paper making process before the actual paper machine. this includes also the short circulation and the wet end of the paper machine. in this area, the paper making process is typically non-linear with many, long delays that change in time and with process conditions, there are process recycles at several levels, there are closed control loops, there exist factors that can not be measured and there are interactions between physical and chemical factors. also several different paper grades are produced with different production conditions and operating parameters. this section shows how to combine on-line measurements and expert knowledge in paper machine modelling in developing the sensitivity indicator for paper web breaks [24]. the indicator would give the process operators a continuous indication of the web break sensitivity in an easily understandable way. being able to indicate the break risk would give a possibility to react on changes of the break sensitivity in time and therefore avoid breaks. 5.1 experimental data the actual measurements from a paper machine were used. the main interest was in paper machine variables and the variables just before the paper machine. the final selection of variables used expert knowledge and altogether 73 variables (72 variables + information on the break occurrence) were studied. these variables were supposed to influence on paper web breaks. the measurements were collected from the mill automation system during normal operation and no special test runs were made. the measurements were used as such to retain their information content, and, on the other hand, to keep the application as simple as possible. only a simple filtering was added to the indicator software to make rapid changes slower and to cut the outliers from the data. the measurement data was divided into periods of 24 hours. further the data sets were classified into five categories, depending on how many breaks there were in one day: no breaks (0), a few breaks (1−2), normal (3−4), many breaks (5−6) and a lot of breaks (> 6). 5.2 reasons for web breaks different statistical methods were used, but reliable correlations between single variables and web breaks did not exist. therefore, the only way to proceed was the classification and modelling of break situations to find out differences between operating situations leading to breaks. case-based reasoning was used for the identification of different operating situations instead of trying to predict a single break occurrence. identified operating situations contain information about how many breaks there will be in the near future and this information is given to the process operators as the web break sensitivity. the identification is performed using linguistic equation approach and fuzzy logic [24]. 5.3 correlation analysis before modelling, correlation analysis was used in order to find out binary interactions between different process variables. the basic tool used for these analyses was microsoft excel spreadsheet. the correlation exceeding 0.6 was considered worth mentioning. according to this analysis, correlation variable selection and grouping in a paper machine application 115 varies quite a lot with time. the variation in correlation rates is due to the usage of normal on-line measurements, which include the effects of different control operations. the most important result of this analysis was that interactions vary in different operating situations, and the number of breaks also varies with time, and this was the basis for different case models. due to different interactions, also different variables became important in different operating situations. 5.4 model-based variable grouping the web break sensitivity indicator was developed as a case-based reasoning type application with linguistic equations approach and fuzzy logic [24]. the case base contains case models with different number of breaks. a new case is presented to the system as a collection of on-line measurements. the indicator compares the new case to the examples in the case base and uses the information of the best fitting case to calculate the predicted break sensitivity. as output the system gives numerical value for the predicted amount of breaks [24, 25]. figure 1 shows the principal structure of the case base, and figures 2 and 3 the different stages of case-based reasoning. figure 1: structure of the case base [25]. figure 2: the structure of retrieve and reuse stages [25]. the case base of this application contains modelled example cases classified according to the number of related breaks. models consist of equations that are stored as simple numerical matrices, which are indexed with break class information and number of examples in class. equations itself describe the interactions between 3-5 variables. the variables in equations are found using a partly knowledge-based, partly model-based grouping technique. for complex systems, a set of alternative variable groups are generated and models created with these groups. process knowledge can be used in defining these groups. another approach is to generate all 116 timo ahola, esko juuso, kauko leiviskä possible groups containing three, four or five variables and modelling them. groups can also contain different number of variables. correlation analysis has also use in grouping. it should be noted that for prediction the input variables should have a high correlation with the output variable, but a low one with each other. for state detection, causality is not clear, and the group where all variables correlate with each other are acceptable. here, however, the limitations given in section 2 must be taken into account. groups with three, four or five variables can be generated automatically with fuzzequ toolbox [22]. the generation of the alternatives is based on groups with three variables: all groups with four variables have one variable in common, and all groups with five variables have two variables in common. the subsets of the variables and the common variables in the groups with four or five variables can be based on process knowledge. figure 3: the structure of revise and retain stages [25]. 5.5 some feedback ű importance of parameters after modelling, the importance of variables was analysed based on the occurrence of variables in case models. this was also considered as a useful tool to reveal less important variables not included in the models. for the operating personnel, the list of importance might give some new information about which variables are responsible for different operating problems. there is also some difference in the collection of important variables between cases with different number of breaks. the user interface presents continuously the six variables that best describe the current operating situation. these are simply the variables of the two best fitting equations of the best fitting case. in addition to these, also the most important variable of the best fitting case model is presented with a trend value of 8 hours. the variables are presented with their current measurement values marked with colours as normal, low or high and very low or very high. this information gives the process operators useful information of the current process conditions together with the reason for the current break sensitivity level. the follow-up of the variable importance can lead to the need to update the whole system. the system updating is a straightforward task, all though time consuming when the whole case base is changed. a single model with 73 variables takes only few minutes to build. the same time is required for validation and tuning and it makes altogether 15 to 20 minutes. the updating of the system with 40 cases could take the working hours of two days. however, the automation of these tasks makes this time shorter. variable selection and grouping in a paper machine application 117 6 summary and conclusions this paper has considered possibilities to variable selection in large-scale industrial systems. it introduced knowledge-based, data-based and model-based methods for this purpose. as an example, case-based reasoning application for the evaluation of the web break sensitivity in a paper machine was introduced. the application was build with linguistic equations approach and basic fuzzy logic. the case base of the system contains models of example cases with different number of breaks. a new case is presented to the system as a collection of on-line measurements. the indicator compares the new case to the examples in the case base and uses the information of the best fitting case to evaluate the break sensitivity. the latest version of the indicator operates with a case base of 40 example models. although the size of this case base is rather small, the results have been considerably good compared to the real break sensitivity. the indicator combines the information of on-line measurements with expert knowledge and provides a continuous indication of the break sensitivity. the web break sensitivity defines the current operating situation at the paper mill and gives new information to the operators. the web break sensitivity is presented as a continuous signal with information of the actual web breaks as a trend of 8 hours. the trend shows how the situation has developed and the current value gives the prediction for next 24 hours if the situation stays as it is now. together with information of the most important variables this prediction gives operators enough time to react to the changing operating situation. the variable selection and grouping utilize knowledge-based and model-based approaches. automatic group and model generation makes also the interactive variable selection possible. references [1] c. abrahamsson, j. johansson, a. sparén, and f. lindgren, comparison of different variable selection methods conducted on nir transmission measurements on intact tablets, chemometrics and intelligent laboratory systems, vol. 69, pp. 3–12, 2003. [2] c.e.w. gributs and d.h. burns. parsimonious calibration models for near-infrared spectroscopy using wavelets and scaling functions, chemometrics and intelligent laboratory systems, vol. 83, pp. 44–53, 2006. [3] s. gourvénec, x. capron, and d. l. massart, genetic algorithms (ga) applied to the orthogonal projection approach (opa) for variable selection, analytica chimica acta, vol. 519, pp. 11–21, 2004. [4] l. stordrange, t. rajalahti, and f.o. libnau, multiway methods to explore and model nir data from a batch process, chemometrics and intelligent laboratory systems, vol. 70, pp. 137–145, 2004. [5] s.l.t. lima, c. mello, and r. j. poppi, pls pruning: a new approach to variable selection for multivariate calibration based on hessian matrix of errors, chemometrics and intelligent laboratory systems, vol. 76, pp. 73–78, 2005. [6] m.j.c. pontes, r. kawakami, h. galvão, m.c. ugulino araújo, p.n. teles moreira, o.d. pessoa neto, g.e. josé, and t.c. bezerra saldanha, the successive projections algorithm for spectral variable selection in classification problems, chemometrics and intelligent laboratory systems, vol. 78, pp. 11–18, 2005. [7] m. arakawa, k. hasegawa, and k. funatsu, qsar study of anti-hiv hept analogues based on multi-objective genetic programming and counter-propagation neural network, chemometrics and intelligent laboratory systems, vol. 83, pp. 91-98, 2006. 118 timo ahola, esko juuso, kauko leiviskä [8] q. shen, j.-h. jiang, c.-x. jiao, g. shen, and r.-q. yu, modified particle swarm optimization algorithm for variable selection in mlr and pls modeling: qsar studies of antagonism of angiotensin ii antagonists, european journal of pharmaceutical sciences, vol. 22, pp. 145–152, 2004. [9] u. norinder, support vector machine models in drug design: applications to drug transport processes and qsar using simplex optimisations and variable selection, neurocomputing, vol. 55, pp. 337–346, 2003. [10] m. smith, b. pütz, d. auer, and l. fahrmeir, assessing brain activity through spatial bayesian variable selection, neuroimage, vol. 20, pp. 802–815, 2003. [11] r. narayanan and s.b. gunturi, in silico adme modelling: prediction models for bloodűbrain barrier permeation using a systematic variable selection method, bioorganic & medicinal chemistry, vol. 13, pp. 3017–3028, 2005. [12] e. llobet, j. brezmes, o. gualdrón, x. vilanova, and x. correig, building parsimonious fuzzy artmap models by variable selection with a cascaded genetic algorithm: application to multisensor systems for gas analysis, sensors and actuators b: chemical, vol. 99, pp. 267–272, 2004. [13] o. gualdrón, e. llobet, j. brezmes, x. vilanova, and x. correig, coupling fast variable selection methods to neural network-based classifiers: application to multisensor systems, sensors and actuators b: chemical, vol. 114, pp. 522–529, 2006. [14] m. cocchi, j.l. hidalgo-hidalgo de cisneros, i. naranjo-rodríguez, j.m. palacios-santander, r. seeber, and a. ulrici, multicomponent analysis of electrochemical signals in the wavelet domain, talanta, vol. 59, pp. 735–749, 2003. [15] f. westad, m. hersleth, p. lea, and h. martens, variable selection in pca in sensory descriptive and consumer data, food quality and preference, vol. 14, pp. 463–472, 2003. [16] j. cadima, j. orestes cerdeira, and m. minhoto, computational aspects of algorithms for variable selection in the context of principal components. computational statistics & data analysis, 47(2):225–236, 2004. [17] m. zarzo and a ferrer, batch process diagnosis: pls with variable selection versus block-wise pcr, chemometrics and intelligent laboratory systems, vol. 73, pp. 15–27, 2004. [18] l. h. chiang and r.j. pell, genetic algorithms combined with discriminant analysis for key variable identification, journal of process control, vol. 14, pp. 143–155, 2004. [19] a. alexandridis, p. patrinos, h. sarimveis, and g. tsekouras, a two-stage evolutionary algorithm for variable selection in the development of rbf neural network models, chemometrics and intelligent laboratory systems, vol. 75, pp. 149–162, 2005. [20] f. dieterle, s. busche, and g. gauglitz, growing neural networks for a multivariate calibration and variable selection of time-resolved measurements, analytica chimica acta, vol. 490, pp. 71–83, 2003. [21] i. drezga and s. rahman, input variable selection for ann-based short-term load forecasting, power systems, ieee transactions on, vol. 13, pp. 1238–1244, 1998. [22] e. k. juuso, integration of intelligent systems in development of smart adaptive systems, international journal of approximate reasoning, vol. 35, pp. 307–337, 2004. variable selection and grouping in a paper machine application 119 [23] a. isokangas and m. ruusunen, systematic approach for data survey, in proceedings of the international conference on informatics in control, automation and robotics. september 14 17, 2005, barcelona, spain, pp. 60–65, 2005. [24] t. ahola, intelligent estimation of web break sensitivity in paper machines. doctoral dissertation. university of oulu, department of process and environmental engineering. acta universitatis ouluensis, technica c 232, 92 p., oulu, 2005. [25] t. ahola and k. leiviskä, case-based reasoning in web break sensitivity evaluation in a paper machine, journal of advanced computational intelligence and intelligence informatics, vol. 9, pp. 555–561, 2005. timo ahola, esko juuso, kauko leiviskä university of oulu, control engineering laboratory p.o. box 4300, fi-90014 university of oulu, finland e-mail: esko.juuso@oulu.fi received: march 21, 2007 timo ahola, born in hämeenkyrö, finland on december 18, 1965, recieved his m.sc. (eng.) in process engineering in 1992 and lic. tech. in control engineering in 2001 from the university of oulu. he recieved d. tech. in control engineering from the university of oulu in 2006 with the thesis on intelligent estimation of web break sensitivity in paper machines. he worked as a research scientist in the control engineering laboratory at the university of oulu 1993-2006. since 2007 he has been belonging to the outokumpu stainless oy, tornio research centre, finland. presently he is a research engineer in process development and he is working with predictive maintenance issues. esko juuso, born in ylitornio, finland on december 12, 1951, received m.sc. (eng.) in technical physics in 1979 from university of oulu. he has worked as research engineer in outokumpu metallurgical research centre and computer analyst in outokumpu electronics. since 1986, he has been belonging to university of oulu, oulu, finland. presently he is senior assistant in control engineering. he has been active in finnish simulation forum (finsim), scandinavian simulation society (sims) and eurosim, currently he is chairman of finsim. he has been member of steering committee and co-chairman of technical committee on production industry in eunite network of excellence, 2000-2004. his main research fields are intelligent systems and simulation in industrial applications, including control and fault diagnosis. in 1991 he introduced the linguistic equation (le) methodology, and he has authored more than 200 publications on his research field. 120 timo ahola, esko juuso, kauko leiviskä kauko leiviskä, born in pyhäntä, finland, 1950, received m.sc.(eng.) in process engineering in 1975 and lic. tech. in control engineering in 1976 from the university of oulu. he received the d. tech. in control engineering from the university of oulu in 1982 with the thesis on short term production scheduling of the pulp mill. since 1975, he has been belonging to university of oulu, oulu, finland. he has been professor of control engineering and head of control engineering laboratory in the same university since 1988. he has been active in ifac since 1988, currently he is member of ifac tc on large scale complex systems, tc on cognition and control and tc on mining, mineral and metal processing. he has been member of steering committee and chairman of technical committee on primary and process industries in erudit network of excellence, 1997-2000, and the scientific director of eunite, the european network of excellence, 2000-2004. he is participating in eu/ca project nisis (nature-inspired smart information systems). a list of more than 200 publications of which he is (co)author is available. recently his work concentrates on modelling and control of industrial processes, intelligent control methods, production scheduling and millwide control. he has also been consulting industry on control engineering and millwide control applications. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 625-633 generic multimodal ontologies for human-agent interaction a. braşoveanu, a. manolescu, m.n. spînu adrian braşoveanu lucian blaga univeristy of sibiu, romania e-mail: adrian.brasoveanu@gmail.com adriana manolescu agora university, oradea and r&d agora ltd. cercetare dezvoltare agora oradea, romania e-mail: adrianamanolescu@gmail.com marian nicu spînu aurel vlaicu university of arad, faculty of exact sciences department of mathematics-informatics romania, 310330 arad, 2 elena drăgoi abstract: watching the evolution of the semantic web (sw) from its inception to these days we can easily observe that the main task the developers face while building it is to encode the human knowledge into ontologies and the human reasoning into dedicated reasoning engines. now, the sw needs to have efficient mechanisms to access information by both humans and artificial agents. the most important tools in this context are ontologies. the last years have been dedicated to solving the infrastructure problems related to ontologies: ontology management, ontology matching, ontology adoption, but as time goes by and these problems are better understood the research interests in this area will surely shift towards the way in which agents will use them to communicate between them and with humans. despite the fact that interface agents could be bilingual, it would be more efficient, safe and swift that they should use the same language to communicate with humans and with their peers. since anthropocentric systems entail nowadays multimodal interfaces, it seems suitable to build multimodal ontologies. generic ontologies are needed when dealing with uncertainty. multimodal ontologies should be designed taking into account our way of thinking (mind maps, visual thinking, feedback, logic, emotions, etc.) and also the processes in which they would be involved (multimodal fusion and integration, error reduction, natural language processing, multimodal fission, etc.). by doing this it would be easier for us (and also fun) to use ontologies, but in the same time the communication with agents (and also agent to agent talk) would be enhanced. this is just one of our conclusions related to why building generic multimodal ontologies is very important for future semantic web applications. keywords: multimodal ontology, ontology matching, interface agents, semantic web, human-agent interaction 1 introduction the knowledge society (ks) is a society where information is the primary resource which can be consumed by both humans and machines. if we want to build such a society in a proper way we need different kinds of infrastructure: hardware, software, organizational, etc. sw and copyright c⃝ 2006-2010 by ccc publications 626 a. braşoveanu, a. manolescu, m.n. spînu agents represent only a small part of the large infrastructure needed in order to build the true ks. sw ( [1], [2], [3], and [4]) is one of those disruptive technologies which tend to be talked about years before their coming of age. one of the visions presented in [1] was that of agents replacing humans for simple everyday tasks like buying tickets for a concert or making appointments to the doctor. the main reason why this vision hasn’t yet come to life is one that is now well understood and also explained in the article’s revision [2]: encoding the human knowledge into ontologies and the human reasoning into dedicated reasoning engines is not an easy task. this process requires trans-disciplinary knowledge, dedicated tools and repositories, and advanced techniques from mathematics, logics and software. it is in fact an extremely difficult procedure which relies entirely on the cooperation between hundreds or thousands of organizations and different standards. since the standardization processes take a long time even in these days and the time of adoption for new technologies is sometimes around 2-3 years at least, we should not be surprised that it will take a while until the sw reaches the critical mass. ontologies represent the key to a successful communication between human and agents if they are done right. we are only beginning to understand the implications of using the ontologies for the great tasks we assigned for them, but some problems like ontology management (versioning, change, tools and standards), ontology matching (finding correspondences between different ontologies) and the adoption of ontologies on large scale by developers and users proved to be quite challenging. ontology dynamics is definitely a field on which we should keep an eye on. according to [30] there is still no clear winner in the process of ontology matching (in other words: a standard or a methodology with clear rules to match almost everything automatically or semi-automatically since sometimes humans will need to check the results). therefore we should not be surprised, when reading a journal or conference proceeding, that most of the articles refer to these tasks rather than to the desired using of ontologies which is to give agents a way of understanding our world and reason about it. it is the way things should be: in order to build a functional system we always need to have its parts figured out. we should however not lose sight of the system we need to build and this is one of the purposes of this paper: to look at the current state of the art in several fields of study and see if we are heading in the right direction. in this context we will especially examine some problems related to the multimodal communication between human and agent and try to see how they are solved by using ontologies. 2 rationale and approach: why complicate things and use generic multimodal ontologies? first we need to clear one question: what is an ontology? some answers to this (and also some examples of how to use ontologies) can be found in [12], [15], [16], [17], [22], [23] and [31]. the classic definition proposed by gruber tells us that an ontology is "explicit specification of a conceptualization" [12]. this definition is examined and extended by many papers, most recently by guarino, oberle and staab in [16] which also focuses on the importance of "shared explicit specifications" because without commiting to ontologies every agent would understand something else (they also take the opportunity to revise the semiotic triangle). ontologies are us, mika’s thesis [23] is a simple yet powerful statement. it tells us that since we are the ones who design the ontologies they will only express what we want them to express and will sometimes be useless without the context in which they have been created. the main problem when designing ontologies is to carefully choose the concepts within a domain and the relationships between them in such a way as the ontology to be well founded because "any ontology will always be less complete and less formal than it would be desirable in generic multimodal ontologies for human-agent interaction 627 theory" [16]. in the light of this statement it should become quite clear why we sometimes need to use generic ontologies: there is simply no other way to address the problem of uncertainty when developing ontologies than genericity. figure 1: one of the most popular programs for ontology matching: coma++, developed at the university of leipzig. in this screenshot we can see how we can establish some correspondences between two ontologies representing a computer science department nowadays there are probably thousands of ontologies in use, but if the sw will ever look like berners-lee’s visions then ontologies will be common place for every designer, developer or user. usually an ontology only addresses the problems from a narrow field of knowledge (domain ontology) so it is not uncommon that applications may use many ontologies for different purposes. in some of these cases it is useful to also use upper level ontologies which are general ontologies that represent concepts that are the same across all domains. a unique upper level ontology which should encompass all the human knowledge is not feasible and will never be built because of practical reasons (each society has its concepts, every field of knowledge has a certain language to protect itself, etc.), but upper level ontologies are used for mediation mainly in the idea that universal agreement between different ontologies will be/is possible. in other cases in order to use different ontologies the applications will use ontology matching schemes like those discussed in [10]. since ontologies are the building blocks of sw, any application from this area must use them, even if that means adding layers of complexity because of the matching process, apis, uncertainty. for everybody working in the it industry these days it should be clear that the medium in which we work is becoming more and more like ohdue (open heterogenous dynamic uncertain environment) [8] and ontologies are part of this medium. these issues are addressed in articles and books like [10], [19], [30] (ontology matching), [26] (automatic generation of ontology apis), [8] (ohdue, agents). because the field of ontology engineering is becoming more popular we should not be surprised that we will also hear a lot about the ontology driven software engineering. ontology driven information systems (odis) [36] is just one of the recent examples which fell into this category. given all these complications that appear when designing and working with ontologies it is interesting to ask a new question: why would we want to complicate our life even more by using multimodal ontologies? it is not enough that the ontology management or ontology matching problems still pose so many challenges? are these new breed of ontologies even feasible? certainly from a user’s perspective multiple modalities to enter input into a system (touch, voice, mouse, pen, etc.) can only mean increased usability (do we need to remember how touch screens became the norm in the mobile phones industry after iphone was launched?), while from 628 a. braşoveanu, a. manolescu, m.n. spînu figure 2: the multimodal communication dream: to use all the five senses (smell, sight, touch, taste, sound) during the process of communication. a developer’s perspective this means that software gets even more complicated than it is now. this is the right moment for such a development since for the multiple streams of data that come with multimodal communication we need distributed systems. since multi-core processors are now luckily the norm in desktop computing we should have no problem (at least not hardware) dealing with the huge flux of data. in the past 40 years scientists have developed different mechanisms for getting audio, video and touch input, but the integration of all five senses in the communication between man and machine remains a dream. it is enough however to use one sense in different ways (for example for seeing we have images, text, video) to be able to speak about multimodal communication. in this respect different research groups (most notably [29]) started to develop also multimodal ontologies, but most of them took the approach of developing different ontologies for text, images, video or voice and then use ontology alignment to match them (multimodal integration through ontology matching [29]). a multimodal ontology gets us all the benefits of having such different ontologies. like all things in life, multimodal ontologies do not come without bad parts (even harder to design, mantain and match), but they are definitely closer to our way of thinking. is this a sufficient reason to try it? it might not be, but it is not the only one. the usage of multimodal ontologies will allow us to give a more natural, even realistic, feeling during communication between agents and humans, enhanced usability, the possibility to model mechanisms that are closer to the way we understand the world (diagrams, mind maps, feedback, brainstorming, slides, visual thinking, and others). it should be clear that it’s not just art for art’s sake, but rather art for a better life in the future. 3 generic multimodal ontologies for human-agent interaction the process of multimodal ontology modeling is still open to exploratory research because ontologies are not everywhere. without ontologies for all possible fields, and tools to match these ontologies it is debatable whether we will achieve an efficient semantic web, but rather the illusion of a semantic web maintained by few successful applications in certain areas (like social generic multimodal ontologies for human-agent interaction 629 networking, language translation or medicine). since multimodal communication is difficult to process it is clear that in the first phase of any research regarding this subject, the communication between agents and humans will not be efficient. the question we need to ask ourselves in this situation is: if it is not efficient why should we bother at all to try something like this? the answer is simple and is typical for exploratory research: it takes time to find the best way to integrate multiple streams of data in an efficient manner and it also takes time to develop efficient ontology matching processes for such tasks. the role of exploratory research is to discover niches. the task of creating efficient mechanisms is one best suited for incremental research. since this area of research is relatively new there is enough room for exploratory research and for breakthroughs. generic ontologies are rarely used by developers. most of the articles present different ontologies and clearly state that they do not use generic ontologies because the problem’s domain was well understood. generic ontologies are best suited for modelling as we can see from [17], and [13]. it is easier to say you have an ontology with few concepts and not define all of them when doing modelling. the task of defining all the concepts and relationships between them is one that remains to the ontology engineer or to the developer. when dealing with models that are related to multimodal communication it makes sense to use generic multimodal ontologies. it also makes sense to use a generic ontology whenever dealing with uncertainty as suggested by [8] [28]. the agents of tomorrow will be built taking into account recent findings like the requirementsdriven self-reconfiguration [6], multi-party, multi-issue, multi-strategy negotiation [35], natural language [18], and controlled natural language [32]. if we are to follow berners-lee vision from [1] we absolutely need to integrate such findings into our work. in fact according to [18] ontologies are the "common ground for virtual humans". their architecture suggests using multimodal communication, but this is not clearly stated in the article since the ontology is not multimodal. if we look at [6] and [35] we can envision agents that dynamically change their strategies according to the environment and the context of conversations. this requires designing flexible ontologies, another reason to make them generic. the agents must use ontologies if they are to understand something from this world. they also need to share them and commit to them if we want them to be able to talk between them. the multimodal ontology helps in some of the phases of multimodal communication: fusion and integration (getting the input from different channels), natural language processing, disambiguation, error reduction and fission (preparing the output). when designing a multimodal ontology one must also take into account the problems related to designing multimodal systems as described in [25], and also the medium in which these agents will evolve because an agent that needs to evolve in the urban computing environment [34] will have different needs than an agent that just surfs the web. the focus of research is usually on multimodal fusion, but a recent survey [9] shows that the interest in multimedia fission is increasing. designing a multimodal ontology thus requires taking into account all these findings because the agent must be able to give us a response not only to understand our requirements. probably one of the big challenges ahead is to annotate the multimodal content in real-time. this is particularly hard to do for video content, but not impossible, as [27] suggests. m3o (multimedia metadata ontology) allows us to annotate the multimedia content from a page to retrieve it easier. if such ontologies will be improved then the road to the visions from [1] will be shorter. 4 related work the current state of the art in multimodal hci is presented in [7] and [20]. one of the conclusions from [7] leaves further space for improvements: "most researchers process each channel (visual, audio) independently, and multimodal fusion is still in its infancy". the same can be 630 a. braşoveanu, a. manolescu, m.n. spînu rendered as true for the multimodal ontologies too. since [7] is more recent we will use it as a basis for further investigation in this field. since there are only few interesting articles related to multimodal ontologies every year, we have selected a few of them to be used as basis for future research. when searching for definitions related to ontologies and trends in the field of ontology development /matching some of the best research groups in the world are the ones from trento (loa and university of trento), and koblenz-landau. many of the articles cited in this paper come from some of the members of the trento group: [6], [10], [16], [17], [30]. these are related to definitions of ontology, ontology matching, and modelling with ontologies. we have also used articles from the koblenz-landau group: [6], [26], [27] related to definitions, automatic generations of ontology apis and m3o. one interesting idea is that of multimodal context-aware interaction presented by cearreta and his team in [5]. if we have to model emotions there might be no other solution than to use multimodal ontologies combined with special reasoners. another article related to our subject is [29]. their approach of using different ontologies for text and images and then use ontology matching can definitely be improved on the long term. they clearly state that for the moment multimodal ontology do not offer fast communication, but that in time speed might be improved. also [24], [32] and [33] study the relationships between natural language processing (nlp) and sw. the work of these research groups must be studied. one of them [32] is from southampton, one of the workplaces of timothy berners-lee. when it comes to generic ontologies and tools for working with ontologies, one of the best research groups that needs to be followed is stanford’s [11], [28]. their work on biomedicine ontologies and protégé is fundamental. 5 conclusions and future work the sw tools are now an important part of the it industry, the main clients coming from the fields of biomedicine, aeronautics, automotive, government and local administrations, and media. this sudden interest might be related to the success of social media [14], [21] and means that developers are starting to tap into the potential promises of the field. even so there is a lot of work to be done regarding multimodal ontologies. the reason is one that was mentioned several times during this paper: the task of designing such ontologies is still difficult. as we do not have yet universal methods for ontology matching we do not have a clear methodology of designing multimodal ontologies (regardless of the fact that they are generic or not). the main advantages of using generic multimodal ontologies should be better understood now: they offer us a modality to design the process of communication with agents as close to our way of thinking as possible and also play a very important role in several phases of the multimodal communication (multimodal fusion and integration, disambiguation, nlp, error reduction, multimodal fission, etc.). the main disadvantage will probably be efficiency for the next years, but given the exploratory nature of the research this is normal. the future work of our group will consider implementing new mechanisms for linking the generic multimodal ontologies and affective interfaces with recent research in semantic web and hci in a 3 years interval (during the phd studies of the first author). the objectives are to be fulfilled involving european teams of researchers interested in this kind of projects. acknowledgements this work was partially supported by the strategic grant posdru/88/1.5/s/60370(2009) on "doctoral scholarships" of the ministry of labour, family and social protection, romania, cofinanced by the european social fund investing in people. generic multimodal ontologies for human-agent interaction 631 bibliography [1] t. berners-lee, j. hendler, o. lassila. the semantic web. scientific american, may 2001, 34-43. [2] n. shadbolt, w. hall, t. berners-lee. the semantic web revisited. ieee intelligent systems, pages 96101, may/june 2006. [3] t. berners-lee, w. hall, j.a. hendler, k. o’hara, n. shadbolt, d.j. weitzner. a framework for web science. foundations and trends in web science, 1 (1), pages 1-130, 2006. [4] c. bizer, t. heath, t. berners-lee. linked data the story so far. international journal on semantic web and information systems, volume 5, issue 3. [5] i. cearreta, j. m. lopez, n. garay-vitoria. modelling multimodal context-aware affective interaction. proceedings of the doctoral consortium of the second international conference on acii’07. lisbon, portugal, 57-64, 2007. [6] f. dalpiaz, p. giorgini, j. mylopoulos. an architecture for requirements-driven selfreconfiguration. proc. of the 21st int. conf. on advanced information systems engineering, lncs 5565, springer, 246260, http://www.disi.unitn.it/ pgiorgio/papers/caise09-b.pdf, 2009. [7] b. dumas, d. lalanne, s. oviatt. multimodal interfaces: a survey of principles, models and frameworks. in d. lalame, j. kohlas, editors, human machine interaction research results of the mmi program, springer, 3-27, 2009. [8] i. dzitac, b.e. barbat. artificial intelligence + distributed systems = agents. international journal of computers, communications & control, issn 1841-9836, 4(1):17-26, 2009. [9] d.w. embley, a. zitzelberger. theoretical foundations for enabling a web of knowledge. retrieved from: http://dithers.cs.byu.edu/tango/papers/formalwok.pdf, 2009. [10] j. euzenat, p. shvaiko. ontology matching. springer, 2007 [11] a. ghazvinian, n. f. noy, c. jonquet, n. h. shah, m. a. musen. what four million mappings can tell you about two hundred ontologies. international semantic web conference 2009: 229-242 [12] t. r. gruber. a translation approach to portable ontologies. knowledge acquisition, 5(2):199220, 1993. [13] m. gruninger. designing and evaluating generic ontologies. in ’ecai96’s workshop on ontological engineering’. [14] t. gruber. collective knowledge systems: where the social web meets the semantic web. journal of web semantics, 6(1):4-13, 2008. [15] n. guarino. the ontological level: revisiting 30 years of knowledge representation. in a. borgida, v. chaudhri, p. giorgini, e. yu (eds.), conceptual modelling: foundations and applications, springer verlag 2009: 52-67. [16] n. guarino, d. oberle, s. staab. what is an ontology? in s. staab and r. studer (eds.), handbook on ontologies, second edition. international handbooks on information systems. springer verlag: 1-17, 2009. 632 a. braşoveanu, a. manolescu, m.n. spînu [17] g. guizzardi, t. halpin. ontological foundations for conceptual modeling. applied ontology 3, 112, 2008. [18] a. hartholt, t. russ, d. traum, e. hovy, s. robinson. a common ground for virtual humans: using an ontology in a natural language oriented virtual human architecture. in: language resources and evaluation conference (lrec). (may 2008) [19] w. hu, y. qu. falcon-ao: a practical ontology matching system. web semantics: science, services and agents on the world wide web 6 (2008) 237-239. [20] a. jaimez, n. sebe. multimodal human-computer interaction: a survey. computer vision and image understanding, volume 108, issues 1-2, october-november 2007, 116-134, special issue on vision for human-computer interaction, 2007. [21] f. limpens, f.gandon, and m. buffa. linking folksonomies and ontologies for supporting knowledge sharing: a state of the art. technical report, eu project, isicil, 2009. [22] d. lonsdale, d. w. embley, y. ding, l. xu, m. hepp. reusing ontologies and language components for ontology generation, accepted for publication in data and knowledge engineering. retrieved from: http://www.heppnetz.de/files/dke2008.pdf, 2010. [23] p. mika. social networks and the semantic web, springer, 2007 [24] j. niekrasz and m. purver. a multimodal discourse ontology for meeting understanding. in the 2nd joint workshop on multimodal interaction and related, 2005. [25] l. nigay, j. coutaz. a design space for multimodal systems: concurrent processing and data fusion. acm conf. human factors in computing systems (chi), 1993. [26] f. s. parreiras, c. saathoff, t. walter, t. franz, s. staab. apis a gogo: automatic generation of ontology apis. icsc, 342-348, 2009 ieee international conference on semantic computing, 2009 [27] c. saathoff, a. scherp. m3o: the multimedia metadata ontology. proceedings of the workshop on semantic multimedia database technologies, 10th international workshop of the multimedia metadata community (semudate 2009), graz, austria, 2009. [28] abraham sebastian, natalya fridman noy, tania tudorache, and mark a. musen. a generic ontology for collaborative ontology-development workflows. in aldo gangemi and jérôme euzenat, editors, ekaw, volume 5268 of lecture notes in computer science, 318-328. springer, 2008. [29] a.a.a. shareha, m. rajeswari, d. ramachandram. multimodal integration (image and text) using ontology alignment. american journal of applied sciences 6 (6): 1217-1224, 2009. [30] p. shvaiko, j. euzenat. ten challenges for ontology matching. in proceedings of the 7th international conference on ontologies, databases, and applications of semantics (odbase), pages 1164-1182, monterrey (mx), 2008. [31] w. v. siricharoen. ontology modeling and object modeling in software engineering. international journal of software engineering and its applications, vol. 3, no. 1, january, 2009, 43-59, 2009. generic multimodal ontologies for human-agent interaction 633 [32] p. smart, j. bao, d. braines, n. shadbolt. development of a controlled natural language interface for semantic mediawiki. in: proceedings of the workshop on controlled natural language, springerverlag, heidelberg, germany. [33] d. sonntag, m. romanelli. a multimodal result ontology for integrated semantic web dialogue applications. in proceedings of the 5th conference on language resources and evaluation (lrec 2006), genova, italy, may 24-26. [34] a. tenschert, m. assel, a. cheptsov, g. gallizo, e. della valle, i. celino. parallelization and distribution techniques for ontology matching in urban computing environments. om 2009 [35] d. traum, s. marsella, j. gratch, j. lee, and a. hartholt.. multi-party, multi-issue, multistrategy negotiation for multi-modal virtual agents. in proc. of intelligent virtual agents conference iva-2008. [36] m. uschold. ontology-driven information systems: past, present and future. in proceedings of the 5thinternational conference on formal ontology in information systems (fois2008), saarbrücken, germany, (oct 31st nov 3rd), 2008. international journal of computers, communications & control vol. ii (2007), no. 1, pp. 39-47 virtual communities and their importance for informal learning antonios andreatos abstract: this paper deals with the concept of informal learning in virtual communities on the internet. initially we discuss the need for continuing education and its relation with informal learning. virtual communities are next defined and then compared to real communities. case studies are employed, focused on some specific kinds of virtual communities. we examine how they operate, how their members interact, what values they share and what kind of knowledge they gather. the learning process within virtual communities is then examined. we look at the kind of information and knowledge available in some particular virtual communities, and comment on its organisation. next, the learning process of virtual communities is compared to that of open universities. finally, we claim that the participation in virtual communities is not only a form of continuing education but also a contribution towards the multiliteracies needed for working as well as living in the 21st century. keywords: virtual communities, informal learning, multiliteracies. 1 introduction 1.1 defining informal learning learning is a natural, spontaneous and lifelong process of human nature. education, on the other hand, is a formal, structured, organised process with specific goals. the terms ‘learning’ and ‘education’ are often confused, because education is based on the learning process [1]. learning may be formal, non-formal or informal [2]. 1. formal learning (what we usually call education) is offered by elementary schools, high schools, colleges and universities; it is based on the teacher-student model. 2. non-formal learning is still organised learning but outside the formal learning system; it is offered by official organisations such as governmental services, youth organisations, training services, scientific unions, enterprises, voluntary and non-profit organisations, etc. 3. informal learning on the other hand is not organised nor organized but casual; even travelling or watching tv may lead to informal learning [1]. it is what we learn from everyday life [2]. 1.2 continuing education and informal learning social changes and the evolution of human knowledge in the digital era are so fast, that make further education imperative for many professionals. like regular education, continuing education may be formal, non-formal or informal. in a recent research among the engineers members of the technical chamber of greece, it was found out that [3]: • the great majority (92,5 %) believes that continuing professional education is necessary for finding a (good) job. a percentage of 50,6 % believes that this education should take place every 3 years, while another 22 % places this time to every 5 years. • a majority of 56,4 % believes that the most important needs for continuing professional education are related to computers and new technologies. copyright c© 2006-2007 by ccc publications selected paper from icvl 2006 40 antonios andreatos • a percentage of 49 % has attended (or attends currently) a professional continuing education program, while most of those who have not attended such a program (60,1 %) declare as the most important reason the lack of time due to work overload. from the above we conclude that professional continuing education program is necessary for many professionals. according to the institute for research on learning, located in menlo park (2000), at least 80 % of the professional knowledge, skills and practices needed for many jobs is informal [4]. since a lot of professionals lack the time needed for attending a non-formal professional continuing education program, they have to learn new things and acquire new skills informally. since informal education is so important, it is worth to be further examined. the most important characteristics of informal learning are [1], [4]: • it does not take place in special educational establishments standing out from normal life and professional practice; • it has no curriculum and is not professionally organized; it rather stems accidentally, sporadically, in association with certain occasions, from the changing practical requirements; • it is not planned pedagogically, nor systematically organised in subjects; • it is not qualification-oriented, nor officially recognised; • it is not formally organised and financed by institutions; • it is rather practical than theoretical; • it is rather unconscious, incidental problem-related and therefore, well-focused; • it is not instructed by a teacher or a course designer but rather self-directed; • it is closely related to professional practice; • it is a tool for living and survival. the ability of informal continuous self-education and training is a vital skill for today’s professionals. knowledge is outdated fast, so a professional has to continually update his/her experiences and knowledge profile, if they want to be competitive. in this paper we are interested in studying informal learning, in relation to virtual communities (vcs). it is a process which lies between the non-formal education, defined above, and casual learning. the actual point varies from person to person [5]. it is not casual learning, because it aims at a goal; and the goal has to do with the common interest of the vc members. a user participates in a vc which deals with his/her interest(s), either for professional reasons or as a hobby. 2 defining virtual communities 2.1 the era of new media the internet has dramatically changed the way people get informed, interact and communicate in the 21st century. distribution of information and knowledge is nowadays carried out more and more via the internet [5]. it is characteristic that new terms such as blogs, bots, wikies and podcasting [6], [7] were unknown some years ago, and are still not registered in most (paper) dictionaries. here lies the internet advantage: it is the only medium that instantaneously follows today’s social evolutions. not only that, but it is actually driving the evolution. in 1988 there were about 30.000 blogs available; today, there are more virtual communities and their importance for informal learning 41 than 35 million [8] and it is estimated that every second a new blog is created (www.technorati.com). on the other hand, philip meyer, author of the book “vanishing newspaper: saving journalism in the information age”, estimates that, with current trends, the last newspaper reader will recycle that last newspaper in april 2040 [8]! some types of new media, along with representative examples, are given below [9]: • bbs: the well, genie • blog: livejournal, xanga, myspace, facebook, blogspot, blogger, myciab • webcomic: userfriendly, penny arcade, sluggy freelance • habitat: lucasfilm’s habitat, vzones • im: icq, yahoo! messenger, msn messenger, aim • irc/efnet • mmorpg: everquest, ultima online, world of warcraft, silk road online • moo: lambdamoo • mud/mush: tinymud • p2p: napster, kazaa, gnutella, morpheus • usenet • wiki: wikipedia, wikiwikiweb, pbwiki, wetpaint • www: ebay, geocities, slashdot. 2.2 towards a definition the birth of virtual communities is placed at the early years of the arpanet, back to the seventies; the world wide web was not invented yet. today they are well-established forums, i.e. virtual places for communicating and exchanging information. however, the term virtual community appeared in 1993 and it is attributed to the homonymous book by howard rheingold [10]. the book discusses a variety of information and communication technology (ict) -based communication and social groups. the technologies included usenet, internet relay chat (irc), chat rooms, electronic mailing lists and gaming communities such as multi-user dungeon (mud) and its clones (e.g., mush and moo). rheingold pointed out that belonging to such a group has some potential benefits for the personal psychological health, as well as for the society in general [9]. according to rheingold, virtual communities are formed “when people carry on public discussions long enough, with sufficient human feeling, to form webs of personal relationships” [10]. the explosive diffusion of the internet in certain countries was also accompanied by the proliferation of virtual communities. the nature of those communities and communications is rather diverse [9]. today, virtual communities or online communities are used by a variety of social groups interacting via the internet. different virtual communities, like real communities, have different levels of interaction and participation among their members. an important characteristic of a community is the interaction among its members. thus, an email distribution list with hundreds of recipients with zero or low interaction among members, may not be called a virtual community. similarly, placing comments or tags to a blog or message board may not constitute a community. the highest degree of interaction is achieved in video gaming communities, where users compete online against other users. like traditional social 42 antonios andreatos groups or clubs, virtual communities often divide themselves into cliques or even separate to form new communities. also, membership turnover rate varies greatly from vc to vc [9]. each community shares its own interests, values, jargon [6], titles, leaders, ways of communicating and exchanging information and knowledge. 2.3 comparison of vcs to real communities there is of course no substitute to interpersonal communication, but it may be limited by distance. on the other hand, in vcs the distance factor is not applicable. the ability to interact with likeminded individuals instantaneously from anywhere on the globe has considerable benefits. perhaps the greatest advantage is that the common interests are guaranteed in vcs, whereas this is not the case in real companies based on proximity. the use of multimedia technologies greatly facilitates long-distance communication today. evolution of technology will eventually bring multimedia (image, video etc) dimensions in digital communication, a fact which will enrich it further. of course, the participation in a vc presupposes some familiarisation with ict and the relevant equipment (pc, internet connection etc). in real-life friendships, age is often a critical factor. usually, one’s friends are around the same age. generation gap constitutes a strong unwritten law in many societies. yet, in virtual communities there is no age barrier. this is very important in many countries -including greecewhere the majority of internet users are young people and higher age groups are minorities [11]. since the personal characteristics of live communication are absent in vcs, user personalities are denoted by other symbols like nickname, personal information (such as email, website, blog, irc number, skype usernames etc), image/ personal mark/ signature, equipment related to the community interests (e.g. car, pc, cameras etc), user’s achievements related to the community interests etc. vcs should be seen as supplementary to the real communities and not as alternatives or substitutes. 3 case studies the examination of some case studies will further clarify the above discussion. 1. scientific union of adult education (of greece, www.adulteduc.gr). the common interest here is professional. the union organises conferences, seminars and meetings all around greece; it also issues an online bimonthly bulletin for briefing and member communication. this also contains information on newly-edited books and scientific journals and the corresponding links on adult learning, information on instructor certification etc. a similar example is ‘the hellenic network of open and distance education’ (www.opennet.gr). these communities have a professional character, are a bit more formal (e.g., no nicknames), have a hierarchy (president and members). they have a continuing education as well as a selfeducation character. 2. hellenic linux club (www.hellug.gr). this club is a greek official and non-profit association of people working with or using or positively predisposed to linux. its aim is the union of such individuals, the communication among them in order to tighten their privities, as well as, the further proliferation of this operating system. means for achieving the above goals are: meetings, problem solving support, translation of documents and articles in greek; improvement of greek language support in linux; development of free software; presence in meetings, conferences and exhibitions; collaboration with peer clubs with common goals; diffusion of know-how; follow-up and intervention whenever the interests of linux are threatened. a similar site is: www.linux.gr. it contains news, documentation, articles, download material, links, guidelines for various linux distributions, guidelines for beginners, indexing and an electronic magazine. linux blogs (for instance: http://linuxhelp.blogspot.com, virtual communities and their importance for informal learning 43 http://www.computerworld.com/blogs/software/os/linux, http://linux-blogger.com, and http://www.suseblog.com) are also available. 3. www.overclockers.com: perhaps what is more admired here is the extra mhz a user can get out of his new pc, or, the exotic water cooler system one has constructed. vcs such as the second and the third one listed above, may be characterised as hobbyist or amateur communities rather than professional. such vcs are more free, more non-formal, more casual. they share different values than the formal ones. nicknames are used instead of real names. a couple of examples (with pseudonyms) are: “john smith aka shroomer in the forums”, “my name is valentino jones, a.k.a cr@zyvj on the net, and friends simply call me vj”. also, former education titles are not so important; the most important virtues are: expertise, participation and voluntarism to help other users. 4 learning in virtual communities 4.1 organisation of knowledge in vcs in a formal distance learning environment the educational material is well organised: (i) the courses are structured in a prerequisite order, from the fundamental to the most complicated. (ii) the educational material is composed of learning objects (los) [2]. many los form a course and many courses form a curriculum. among the various courses there is no (or minimal) overlap. (iii) the educational material is usually managed by a learning management system (lms) [2], [13]. let us assume that the information / knowledge resources of a vc are the contents of its node (such as website or a blog). in this case, the material is rather chaotically organised, with high overlaps, no particular structure, no particular management. homepages link to several sub-pages and other related nodes. the various similar vc nodes (e.g., linux communities) are loosely connected. the ability to find specific information requires sometimes specific skills of searching and data mining. however, there is still a hidden hierarchy: the first level is the knowledge present in the node, which may be downloaded; the second level is the knowledge and experience of the community members, which is not seen. 4.2 looking for information in vcs the most common ways for getting access to specific information from vcs are: 1. download articles from their nodes 2. participate in fora and pose questions 3. read faqs and search for keywords 4. use the site search engine (if available) 5. contact sage membres (‘gurus’, ‘masters’ etc.) directly. however, since the material is not organised, a user may have to search for several minutes in order to find what he/she is looking for. 4.3 comparison to open university practices there exist strong similarities but also differences between the ways learning is achieved on line in virtual communities and open universities (ou) using the web. as an example of an open university we shall consider the hellenic open university (hou) [www.eap.gr]. hou students interact with their 44 antonios andreatos instructor as well as with each other over the internet frequently, in order to ask questions and get answers about the educational material and particularly the assignments they have to carry out. mostly the interaction is done by emails and forums. the students are all provided with the same books and are supposed to follow a specific syllabus. the students meet regularly five times throughout the course of a year; attendance is not required. in the end of the academic year they also take an exam (live) which is mandatory and counts for 70 % of the final grade. all these practices do not occur in vcs, where learning is informal. but there is a strong similarity in that the students study and learn on their own. this practice is fundamental for the institution and operation of all ous worldwide [3]. similarities between ous and vcs are listed in table 1 below. table 2 lists some differences. students / members study on their own they depend a lot on the educational material they help each other gain specific knowledge or skills they may be assessed by knowledge or skills (titles or grades or expertise) they may be anywhere in the world they are adults and therefore, self-motivated they are moderated by an instructor or a list moderator or owner of the site table 1: similarities between ous and vcs open universities virtual communities students are directed by the instructors members of vcs are self-directed students are provided with the same education material members study different material and practice a lot students are supposed to follow a specific syllabus there is no syllabus students have seen each other at least once real life interaction may never take place focus primarily on knowledge focus primarily on expertise provide a title do not provide a title knowledge is more theoretical knowledge is more practical and empirical table 2: differences between ous and vcs 4.4 professionals and continuing education today the internet is used by millions of people as an interminable pool of knowledge, as a huge online encyclopaedia. a user seeking particular information on a subject, may find a lot of it, not only in online encyclopaedias or dictionaries but also in specialised vc nodes. for this reason, the ability of informal self-education and training is a vital skill for today’s professionals. based on personal experience, we believe that information and knowledge gathered in some specific vc-related nodes, is superior to that available in traditional, even academic, sources such as books, electronic or conventional, in terms of practicality and in-depth and up-to-date coverage. since vcs update continually the (practical) skills of their members, we can claim that they offer some kind of informal education [15]. the user groups of these professional sites may be regarded as loose professional communities with no or limited interaction amongst users. many profit and non-profit organisations offer (often for free) seminars via the web (also knows as ‘webinars’) to their customers or community members. the main purpose of most such webinars is to demonstrate the use of the companies’ products (such as software tools, integrated circuits, e-learning platforms, etc.). let’s look at some examples. virtual communities and their importance for informal learning 45 the mathworks inc. offers free, live and interactive monthly webinars concerning the use of matlab toolboxes. saba, an e-learning systems company, also offers live online webinars (www.saba.com). national semiconductor (www.national.com) offers online seminars for design engineers. microsoft maintains a large ‘knowledge base’ with articles for computer and network professionals. teacher unions and communities do not lag behind. let us examine two case studies from the greek national internet domain: • ‘eeep’, the greek primary teachers association for the valorisation of ict in education, a nonprofit open community. they issue a journal, organize conferences and maintain a vivacious site (eeep.gr). users can read news and download their electronic magazine (sometimes called a ‘webzine’). • the aforementioned scientific union of adult education (of greece) [www.adulteduc.gr] is another example. they organise conferences, seminars and meetings all around greece; it also issues an online bimonthly bulletin for briefing and member communication. 4.5 vcs and ‘multiliteracies’ in a pioneer as well as important article published in 1996, the ‘new london group’ argues that today’s world is characterised by an increasing cultural and linguistic diversity and a variety of new communication ways and channels, due to the evolution of ict. according to the authors, traditional language-based pedagogical approaches do not provide adequate skills for working and living in general in today’s multi-cultural societies, and that, a new approach to literacy pedagogy, which they have called ‘multiliteracies’, is needed instead. multiliteracies are based on the assumption that the multiple linguistic and cultural differences in our society is essential to the working and private lives of students. the use of multiliteracies approaches to pedagogy will enable students to achieve the following two goals: a) create access to the evolving language of work and community; and b) foster the critical engagement necessary for them to design their social futures and succeed through satisfying employment [16], [17]. 5 discussion and conclusion in this paper we have examined virtual communities (vcs); more specifically, we have dealt with three types of vcs: video game vcs, professional vcs and amateur vcs. we have identified some differences among them, as well as, some similarities and differences between vcs and real-life communities. next we have examined informal learning in vcs and we have compared the organisation of knowledge in vcs to that of distance learning courses. learning gained by the participation in vcs was briefly compared to the methods followed by open universities. furthermore, it was claimed that new ‘digital’ skills are needed by 21st-century citizens. from the discussion above we may conclude that for a professional, participation in professional vcs may be akin to continuing education, whereas for a non-professional, it may merely serve as entertainment. of course, professionals may also benefit from non-professional vcs. in any case, however, free-will participation in vcs is very important, because it fosters the necessary ‘digital behaviour’ and cultivates ‘digital communication’ skills. based on personal experience, we believe that information and knowledge gathered in some communityrelated nodes concerning practical subjects, is superior to that available in traditional, even academic, sources such as books, electronic or conventional. nowadays, where a multiliteracy education is needed for living and working in the digital era, digital communication skills are necessary. ‘digital behaviour’ and ‘digital communication’ rules and ethics are being developed; therefore, all contemporary people should be ‘digitally literate’, in order to be able to survive in a changing and competitive environment. real communication skills are not enough; ‘digital 46 antonios andreatos communication’ skills are also needed. the ability to use the internet and the new media is vital for surviving in the 21st century. vcs will continue to play an important role in 21st-century society, due to social evolution, the globalisation of economy and knowledge, competition and new media technologies. 6 acknowledgement author wishes to thank mr. m. vidalis for reviewing the manuscript. references [1] a. rogers, teaching adults, open university press. 1996. [2] http://en.wikipedia.org/wiki/learning, retrieved on dec. 11, 2006. [3] continuous further education is necessary changes in syllabus are imperative, news bulletin of the technical chamber of greece, no. 2423, jan. 15, pp. 6-8. 2007. [4] http://en.wikipedia.org/wiki/informal_learning, retrieved on dec. 11, 2006. [5] a. vardamaskou and p. antoniou, informal learning: evaluation of an internet-based physical activity educational program for adults, proceedings of the 3rd international conference on open and distance learning, patra, greece, vol. a, pp. 405-417 (in greek), 2005. [6] s. ververidis, the glossary of new media, kathimerini (newspaper) special edition: new media: the alternative choice, 28, pp. 88-89 (in greek). 2006. [7] d. doulgeridis, electronic diaries in common view, tachydromos magazine, no. 266, pp. 44-49 (in greek). (2005) [8] c. angelopoulos, blogs change the landscape of communication, special edition of kathimerini (newspaper): new media: the alternative choice, 28, pp. 78-79 (in greek), 2006. [9] http://en.wikipedia.org/wiki/virtual_communities, retrieved on may 15, 2006. [10] h. rheingold, the virtual community: homesteading on the electronic frontier, harper perennial, san francisco, 1993; also available on line at : www.rheingold.com/vc/book, retrieved on may 29, 2006. [11] vprc national research for new technologies and information society, available online at: http://www.vprc.gr/2/1232/21_gr.html, in greek, 2005, retrieved on sept. 29, 2006 [12] f. pantano-rokou, educational design for e-learning: models, meaning and impact on learning, open education, 1, pp. 45-68 (in greek), 2005. [13] g. dimauro et al., an lms to support e-learning activities in the university environment, wseas transactions on advances in engineering education, vol. 3(5), pp. 367-374. 2006. [14] d. vergidis, a. lionarakis, a. lykourgiotis, b. makrakis and ch.matralis, open and distance learning, vol. 1, institution and operation, hellenic open university, patra (in greek; title of book translated by paper author), 1998. virtual communities and their importance for informal learning 47 [15] a. margetousaki and p. michaelides, communities of practice as a place of learning and development, proceedings of the 3rd pan-hellenic conference on the didactics of information science, corinth (in greek), oct. 2005. [16] the new london group, a pedagogy of multiliteracies: designing social futures, harvard educational review, vol.66, no.1, pp. 60-92, 1996. [17] j. salpeter, 21st century skills: will our students be prepared?, available online at: www.techlearning.com/story/showarticle.jhtml?articleid=15202090, 2003, retrieved on march 22, 2006. antonios andreatos dept. of aeronautical sciences div. of computer engineering and informatics hellenic air force academy dekeleia, attica, tga-1010 greece e-mail: andreatos@hafa.gr received: november 17, 2006 editor’s note about the author: antonios andreatos is a professor at the computer engineering division of the hellenic air force academy. he was born in 1960 in athens, greece. he received the diploma in electrical engineering from the university of patras in 1983, the m.s. degree from the university of massachusetts (amherst) in 1985 and the ph.d. from the national technical university of athens (ntua) in 1989. he was a research scholar at the european joint research centre of ispra, italy. he has published various papers in journals and conferences; he has also authored a book on the design of microcomputer systems in 2001. he has also taught at the hellenic open university. his main technical interests lie in the areas of microprocessors, computer architecture, computer networks, e-learning and adult education. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 1, pp. 106-124 gaze, posture and gesture recognition to minimize focus shifts for intelligent operating rooms in a collaborative support system j. p. wachs juan p. wachs purdue university department: school of industrial engineering address: 315 n. grant st., w. lafayette, in 47907-2023 usa e-mail: jpwachs@purdue.edu abstract: this paper describes the design of intelligent, collaborative operating rooms based on highly intuitive, natural and multimodal interaction. intelligent operating rooms minimize surgeon’s focus shifts by minimizing both the focus spatial offset (distance moved by surgeon’s head or gaze to the new target) and the movement spatial offset (distance surgeon covers physically). these spatio-temporal measures have an impact on the surgeon’s performance in the operating room. i describe how machine vision techniques are used to extract spatio-temporal measures and to interact with the system, and how computer graphics techniques can be used to display visual medical information effectively and rapidly. design considerations are discussed and examples showing the feasibility of the different approaches are presented. keywords: posture recognition, behavior analysis, intelligent rooms. 1 introduction intelligent systems can assist in improving safety and performance during surgery in many ways. intelligent operating rooms assist the surgeon in timeand safety-critical situations. multimodal cues from humans and the environment can be used to extract behavior, which in turn can be used to derive contextual and event information. some of these behaviors include, for example, whether there are distractions, whether the subject is busy with certain tasks or how frequently the doctor switches his focus of attention. 1.1 previous research hansen [1] studied focus shift in the operation room and he used the terms focus spatial offset and movement spatial offset. focus spatial offset is the change in the doctor’s gaze as a result of focusing in a new spot. conversely, movement spatial offset results from the doctor’s change in position. it is possible to perform basic behavior analysis of body language in order to determine which type of offset of attention occurs in any point in time. 1.2 hypothesis with behavioral information the following questions could be answered: 1. how can surgeon’s behavioral cues be leveraged to improve the operating room (or) layout dynamically? 2. how can innovative channels of interactions in this enhanced layout minimize unnecessary focus shifts? copyright c© 2006-2010 by ccc publications gaze, posture and gesture recognition to minimize focus shifts for intelligent operating rooms in a collaborative support system 107 3. how can the doctor and the operating room collaborate to display time sensitive visual information effectively? our hypothesis is that a reconfigurable smart operating room can organize the displayed information in such a way that unnecessary focus shifts are minimized. this has an impact on the surgery performance time. moreover, with the adoption of natural modalities of interaction, unnecessary instructions to the nurses will be eliminated. this also has an impact on the task completion time. 1.3 previous works previous research has demonstrated the negative effects of attention shift on the performance of cognitive and motor tasks [2], [3]. a study conducted by godell et al., [4] looked at virtual reality laparoscopic surgical simulation tasks designed to replicate the levels of cognitive and motor demands in surgical procedures, and found that there was a 30-40% increase in task completion time in the distracted vs. undistracted condition. recent advances have been proposed to counteract unnecessary distracting behavior through the integration of doctor behavior analysis and context awareness into the operating room [5], [6]. the analysis of body language is critical in determining when the surgeon is operating or analyzing medical imagery or just chatting with the staff. providing the intelligent operating room with the ability to understand the doctor’s behavior and the context of the situation allows the projection of patient imagery in the area that allows the least shift of focus and the most optimal interaction space. thus, focus shift is reduced and task performance is improved. for example, when a surgeon interacts with a particular display to obtain information which must be cross checked with magnetic resonance imaging (mri) images obtained previously, in the current operating room’s layout, she will need to move to the main control (see fig 1). this causes focus and spatial shifting and hence distraction and unintended delay. an intelligent operating room can use the surgeon’s body language to determine that she is currently interacting with a control and can then determine the best orientation and position to project the patient mri imagery. a doctor’s assistance system mitigates shift of focus and distractions because it senses the center of the surgeon’s attention through context and uses this knowledge to display relevant patient imagery at this attention point. the system also provides a sterile interface for interaction with the visual information in the spot where the surgeon is already standing, thereby avoiding spatial shift. in this paper, torso and head orientation can be used to find the focus of attention and the context of the event, and how gestures and voice can be combined to interact with medical imagery is discussed. this concept is depicted in figure 1. also, by extracting the dynamics of head pose (eye gaze) and hand gestures, it may be possible to identify those attention shifts which are associated with premeditated or task-oriented attention shifts. for example, a hand gesture oriented to the display combined with gaze directed towards it may indicate that the attention shift was goal-oriented. if the surgeon is bored or distracted he would likely not point to the display. intelligent environments could use this semantic information to assist doctors and improve the safety and comfort of their interactions and surroundings. examples from two different real-time working systems to support collaborative work: real-time pose, posture and head orientation recognition, and a sterile hand gesture recognition system for displaying medical imagery. the remainder of the paper is organized as follows. section two covers previous work, and a description of the methodology proposed for displaying imagery and interacting with the intelligent room are presented in section three. a concluding discussion appears in section four. 108 j. p. wachs 2 related work a large amount of research has recently been conducted on intelligent rooms based on the detection of human behaviors and supporting collaborative work with the room. many of these examples use machine vision techniques to extract important clues, such as head-gaze orientation and posture patterns. they also use computer graphics techniques to display the visual information with high detail and accuracy using special displays or projectors. to interact with the room, voice, body and hand gestures are the prominent channels of natural communication.below, an overview of related research pertaining to the health-care scenario is presented. 2.1 intelligent rooms traditional intelligent rooms are multi-modal environments where audible and visible clues are captured and analyzed exclusively from people’s actions. these systems typically run in real-time, and utilize embedded sensors small enough as to be unnoticeable by people. intelligent rooms in collaborative environments, however, allow human-machine interaction. recent research has supported the incorporation of sensors that look inside a living or working space (the room) to observe inhabitant behavior and potentially infer his intent. one of the pioneering works is the digitaldesk [7], [8] where a computer display is projected onto the desk, and video cameras pointed down at the desk feed an image-processing system that can sense what the user is doing. this application allows users to construct a mixed paper and electronic drawing device. not surprisingly, some of the ideas first presented in the digitaldesk system can now be found in more sophisticated applications. one example is the multi-touch screen interaction on the current microsoft surface [9] device. current approaches used in intelligent rooms combine robotics and vision technology with speech understanding algorithms and distributed agent-based architectures to provide information services in real-time for users engaged in everyday activities [10]. real-time three-dimensional people-tracking by a network of video cameras using a rectilinear video array tracker and an omnidirectional video array tracker (o-vat) in order to develop a real-time system for tracking the head and face of a person in an intelligent room is described in [11]. in the globeall system, [12], a modular four-component prototype for a vision-based intelligent room is developed. the prototype consists of a video input from an electronic pan-tilt-zoom camera array, which performs tracking and interpretation. an intelligent room for welfare applications called the "percept-room" was developed in [13]. the system is capable of detecting human faces and classifying hand postures in japanese sign language in color images. in the human interaction loop (chil) smart room project [14] perceptual and situation-modeling components are used to provide context-aware services to the users of the smart room. anthropocentric interfaces based on intuitive and natural aspects is undergoing which is expected to improve the usability of current systems based on multimodal interaction [15]. the intelligent room reverses the traditional model of teaching the user to interact with computers. instead the intelligent room supports natural interaction, enabling the user to feel as if the computer weren’t there. the smart room research is particularly encouraging: it enabled people to use home appliances and perform every-day tasks that would be more awkward to do in other ways. our goal is to bring these achievements into the operating room in the context of a collaborative support system. the steps involved in this transition are described in the next sections. 2.2 smart displays and projectors smart projectors allow such applications as augmented reality and immersive displays for three dimensional video-conferencing, real-time annotation and simulation and complex and high volume imagery display. increasing computing power, smaller projectors and fast graphic cards make projectorgaze, posture and gesture recognition to minimize focus shifts for intelligent operating rooms in a collaborative support system 109 based interaction an attractive feature for intelligent rooms. for example, a patient’s basic information and condition could be displayed in his field of view, or an mri image projected over a patient’s head could help a surgeon determine more accurately a brain tumor’s location. projection in side walls is the method adopted in this work. wide screen projection using multiple projectors is a challenging problem since it requires undistorting the image, projector calibration and a setting a position and orientation for the projectors, shadow elimination, and because the image surface is of unknown geometry. if the surface to be projected over is uneven the problem becomes even more complex. [16] presents an image projection method that allows arbitrary observer-projector-screen geometries, relying on a robust structured light approach which can be easily adapted to multi-projector configurations. an interactive projector automatic calibration process for multi-projector-camera environments is demonstrated in [17]. this method does not require calibration rigs nor does it assume fixed display positions; however it requires the cameras to be to be pre-calibrated. this is done through detecting self-identifying tags projected in freely moving targets. overlapping tiled projected images are obtained from multiple steerable projectors in [18]. the projectors operate simultaneously and misalignments on the projected image are corrected through a preliminary process. in addition the authors present a virtual synchronization method based on the dynamic memory architecture for the projectors. a high resolution video display system using multiple projectors to build a surround video display system is presented in [19]. bhasker et al. [20] suggested a registration technique allowing severely distorted projectors to be used which opens up the possibility of mounting inexpensive lenses on projectors. this technique can handle imperfect and uncalibrated devices in planar multi-projector displays. more specifically, it can correct for geometrically imperfect projectors and photometrically uncalibrated cameras. approaches to solve occlusion and the resulting blinding light are reported in [21] related to front projection. an algorithm based on a distributed calibration framework for multi-projector displays where the projectors cooperatively re-estimate the poses of all projectors during actual display use is discussed in [22]. 2.3 hand gesture recognition in healthcare natural hand poses and gestures are used to control, teach, treat and manipulate systems in diverse areas of the healthcare environment. gestures can be used to control the distribution of resources in a hospital, to interact with medical instrumentation, visualization displays, to help handicapped users as an alternative interface to computers and as part of rehabilitation therapy. when the hands are attached to sensors to provide haptic (tactile and force) feedback, a surgeon’s gestures can be used to perform long distance surgeries with the help of telerobots. additional systems use voice, gaze and gestures together, profiting from the combined advantages of these modalities to convey richer and redundant information. some gesture concepts have been exploited for improving medical procedures and systems. the "come as you are" requirement is addressed in face mouse [23], where a surgeon can control the motion of the laparoscope by simply making the appropriate face gesture, without hand or foot switches or voice input. current research to incorporate hand gestures into doctor-computer interfaces has appeared in graetzel et al.[24]. they developed a computer vision system that enables surgeons to perform standard mouse functions (pointer movement and button presses) with hand gestures while addressing the "intuitiveness" requirement. a hand gesture tracking device for browsing mri images in the operating room (or), called "gestix" was developed in [25] and it was validated in a real brain biopsy (see figure 1). "gestix" addressed both the "come as you are" and "intuitiveness" requirements by providing a natural effective interface. the "comfort" requirement is fulfilled in "wearit@work" [26], a rfid reader is used to identify the patient and to interact with the hospital information system (his) using gestures to fill out exams and prescriptions. this project ensures sterility. however, since this is an encumbered interface, the "come as you are" requirement is violated. from the patient side, the most prominent requirements in a hand gesture interface system are "user 110 j. p. wachs figure 1: a surgeon using gestix to browse medical images adaptability and feedback" and "come as you are" because impaired users may be limited in the classes of gestures that they can learn and the devices that they can wear. in this context, wheelchairs as mobility aids have been enhanced with robotic/intelligent vehicles able to recognize the user’s commands indicated by hand gestures [27]. the gesture pendant [28] is a wearable gesture recognition system that can be used to control home devices and provides additional functionality as a medical diagnostic tool. the "user adaptability and feedback" requirement is addressed in staying alive[29], which is a virtual reality imagery and relaxation tool which allows cancer patients to navigate through a virtual scene. a haptic glove attached to the hand was used to rehabilitate post-stroke patients in the chronic phase in [30]. these reviewed systems indicate that hand gesture interfaces in medical domains still represent a novel area of research and that few systems are currently in use in healthcare environments. nevertheless, current works highlight the potential of gestures as a natural modality for assisting in the advancement of medical research and surgery, and indicate the need for additional research and evaluation procedures so that such systems can be widely adopted. 3 methods in this section, the philosophy behind the design of our system is presented, and how to develop an "intelligent" operating room based on off-the shelf hardware (a servo controlled projector, four pan-tiltzoom cameras, four microphones, and a controlled connected to a dedicated computer) is described. the system consists of four sub-systems: (a) steerable projectors, (b) focus of attention determination, (c) hand gesture interaction and (d) speech interaction. first the surgeon’s posture, pose and orientation are tracked and detected. this information is sent to the steerable projector, which controls a servo-mirror where the projector ray is deflected. the head orientation is used to determine the closest wall in front of the surgeon where the medical imaging can be reflected. the projection system is activated only when the surgeon is standing straight and staring at a wall, and also evokes a command by saying "computerproject-now". once the image is projected, the surgeon can interact with the images using hand gestures gaze, posture and gesture recognition to minimize focus shifts for intelligent operating rooms in a collaborative support system 111 and voice commands. the system is turned off when the surgeon performs a specific gesture command. these functionalities are described in the schema presented in figure 2. figure 2: architecture for the intelligent operating room in the following sub-sections the subsystems are described in greater detail. most emphasis is given to focus of attention and hand gesture interaction because those are the most difficult challenges of this environment as showed in previous research. steerable projectors and speech recognition are the focus of future work. 3.1 steerable projectors for non-uniform projection surfaces a steerable projector allows us to move the projected image in real time such that the projected image is in front of the surgeon, perpendicular to his torso. the implementation of the steerable projector proposed here is similar to [31] where a standard digital projector is combined with a mirror attached to a pan-tilt unit. this approach is more compact and much cheaper than standard steerable projectors. this model includes the internal parameters of the projector, the projector pose, the pose of the pantilt mirror system, and the internal details of the mirror system. by changing the pan and tilt angles programmatically, the mirror rotates about the pan and tilt axes respectively and the projected image changes in turn. the transformation between the projector pose and the mirror pose can be calculated through a calibration process. since the pan-tilt angles will have an impact on the registration process between the projected image and the camera model of the 3d view, the use of a fixed set of projector poses is suggested, one for each wall, to reflect the surgeon’s possible orientations: upper abdominal procedures are usually performed with the surgeon standing in the midline; central and pelvic operations are generally performed with the surgeon on the lateral side of the patient [32]. therefore, a set of four reflected projector poses is calculated, one for each mirror pose. each pair of θ and ϕ values results in a different mirror pose and hence a different set of projected corner points on the surface. the rigid transform from the reflected projector position for each of those mirror poses is 112 j. p. wachs figure 3: concept image from the intelligent operating room calculated using the procedure detailed in [33]. this requires the use of a camera to capture the projected image on the display surface, and assumes that the display surface is uneven. since in our case the display surface could potentially be any wall around the surgeon, a pan/tilt/zoom camera dedicated to the projector is allocated, in addition to those used to capture the surgeon postures. let the display surface be represented in a 3d mesh. i try to find the transform that relates a given 2d pixel in the projector image plane (z), to a 3d pixel in the display surface (k), given that the place of the camera is so that the same point (v) appears in the camera image as z’. see figure 4. figure 4: . camera and projector 3d view geometry representation each point x = (x, y, w)t in the uneven display surface is a result of a ray originated in the center of the projector op traversing the projector plane in point z = (ûp, v̂p), which in turn appears on the image captured by the pan/tilt camera as point z ′ = (c, v̂c). the goal is to find the static parameter f for every point relating the projector plane and the display surface. knowing the internal parameters of the projector, and the internal parameters of the camera and the homogeneity matrix, [33] show that each gaze, posture and gesture recognition to minimize focus shifts for intelligent operating rooms in a collaborative support system 113 sample point k in the 3d mesh follows can be found using (1):   x y z   = p−   −p̂ + f   ûp p̂p      (1) where p is the rotation 3x3 matrix, p̂ is the translation 3x1 vector, and f is the parametric scalar value. it is possible to show that the parameter f can be estimated using traditional correspondence approaches, or using predictive methods such as particle filtering or kalman filters. in [33] a bank of lalman-filters were used to find the correspondences between the projector pixel z and the projected point k, one filter for each point z. to reduce the complexity of the process, in every frame only one point z is selected for matching. once a point is selected using a certain method (pseudo-randomly, distance based, feature detector), a patch around the neighboring pixels is selected. this patch is cross-correlated with the predicted location on the camera’s image, and close to the epipolar line. once the new coordinate is found on the camera’s image, the parameters for the 3d mesh are updated. this process corrects the projected image over the uneven surface so that curves and discontinuities are not perceived by the viewer, however errors (maximum 6.78mm and mean 2.41mm) similar to [33] are expected. sensitivity studies based on these errors are the focus of future work. 3.2 focus of attention determination in this section, it is shown that by extracting the surgeon’s head orientation (which will be used as a proxy for eye gaze) and torso posture and orientation (which are indicators of the action being performed), it may be possible to identify focus spatial offset and movement spatial offset which are associated with attention shifts. in the first case, gaze orientation helps us to know where to project the medical imagery (which wall to use as the display surface). in the second case, torso position (standing straight or bent) indicates the surgeon’s possible intention: interaction with the patient, interaction with the medical imaging display or other activity. our method to extract head and torso position, posture and orientation is to look at each body configuration as a different class: for example, the following notation describes the current state of a surgeon: s1 = standing, torso 90 degrees, facing 0 degrees. thus, in this section, a multi-class classifier based on parts-based models is described to find each of these configurations. the approach is briefly described for object detection using single and multi-class detectors, in the context of us marine detection, as a case study only, but it is clear that the same approach can be applied to any person. first, it is described the feature extraction process from patches (or parts) and then the basic and shared classifiers are discussed. 3.2.1 dictionary creation initially, a dictionary is created from square sub-region patches extracted from a set of images per class, similar to [34]-these are also called "features". each image is convolved with a filter from a bank of filters, grayscale normalized and re-scaled to a standard scale of 128x48 for the standing and 64x48 for marines kneeling. then patches are selected in x, y locations found using an interest point detector. in those locations patches are extracted from all the filtered images. each patch is associated with the place from where it was extracted, relative to the center of the object. this location information is stored in two vectors containing the x, y offset distances respectively lx, ly, after applying a blurred delta function to them. hence, each entry i in the dictionary has the form vi={filter, patch, lx, ly, image no.}. if 8 images per class are used to create the dictionary, 20 patches are extracted per image, a bank of 4 filters is used, and by classifying into 8 classes, a total of 640 entries is obtained. the procedure is shown in figure 5. 3.2.2 the feature vectors the training set is created from a sample set of images excluding those used for the dictionary creation. for each of the eight (classes) objects i found all the images that include that type of object. in each image, feature vectors are obtained using the following method: 114 j. p. wachs figure 5: . . dictionary entries: patches selected randomly (on the left image) are convolved with a bank of filters. the position of the patches is represented by the location matrix (right). since the red patch is at almost the same horizontal position and at the top, relative to the center, the position matrix has a bright spot 1. scale all the images in the training set so the object of interest is bounded by a rectangle of size 128x48 and 64x48 (region of interest, roi) for standing and kneeling respectively, and the images are not larger than 200x200. 2. for each image j normalized in scale, each entry i of the dictionary is applied to it: this means that this image is convolved with the filter in entry i, and convolved again with a gaussian to smooth the response. next, it is cross-correlated with the patch in entry i, yielding a strong response where this patch appears in the filtered image. finally, the 1d filters lx and ly are applied to the cross-correlated image, effectively "voting" for the object center. this is summarized in (2): vi(x, y) = [(i ∗ fi)⊗pi]∗ltx ly (2) where * is the convolution operator, ⊗ is the normalized cross correlation operator, vi(x, y) is the feature vector entry i, f is a filter, p is a patch, and lx and ly are the x,y location vectors with respect to the center of the image respectively. each training feature vector is coupled with a class label (1 to 8) and -1 for negative samples. for a sample set of 25 images per class, 4000 negative and 200 positive samples are obtained, with 640 features, see figure 6. 3.2.3 sharing the features effectively in this section i briefly describe the joint boosting algorithm used for multi-class multi-view object detection. for a more detailed discussion, refer to [35]. a boosting algorithm is an additive model where weak learners are sequentially added to form a strong classifier. for the multiclass case, the strong learner is defined as: h(v, c) = m∑ m= hm(v, c) (3) where v is the input feature vector, m is the number of boosting iterations, c is a specific class and gaze, posture and gesture recognition to minimize focus shifts for intelligent operating rooms in a collaborative support system 115 figure 6: . positive and negative vector set creation using the dictionary entries and sampling the center out of the silhouette points. each sampled point, is a vector, where an entry j in the vector represents the number of votes assigned by patch pi h(v,c)=log p(zc=1|v)/p(zc = -1|v) is the logistic function where z is the membership label (ą1). when the expectation is replaced by an average over the training data, the cost function can be written as: jwse = c∑ c= n∑ i= wci (z c i − hm(vi, c))  (4) where n is the number of training vectors,wci are the weights for sample i and for class c,z c i is the membership label for sample i for class c (±1). the weak shared learner, also called, regression "stump" is defined for the multiclass in (5): hm(v, c) =    as if v f i > θ andc ∈ s(n) bs if v f i ≤ θ andc ∈ s(n) kcs if c 6∈ s(n) (5) where v f is the component f th from the vector v, θ is a threshold, δ is the indicator function, as and bs are regression parameters. s(n) is a subset of the classes labels. each round of boosting consists of selecting the shared "stump" and the shared feature f that minimizes (3), from the subset of classes s(n), in the following stated procedure: pick a subset of classes s(n). search all the components f of the feature vector v, for each component, search over all the discrete values of θ and for each couple {f, θ }, find the optimal regression parameters as and bs using (6-8). finally, select {f,θ , as, bs} that minimizes (4). as( f , θ ) = ∑ c∈s(n) ∑ i w c i z c i δ (v f i > θ )∑ c∈s(n) ∑ i w c i δ (v f i > θ ) (6) bs( f , θ ) = ∑ c∈s(n) ∑ i w c i z c i δ (v f i ≤ θ )∑ c∈s(n) ∑ i w c i δ (v f i ≤ θ ) (7) kc = ∑ i w c i z c i∑ i w c i (8) 116 j. p. wachs therefore a shared weak learner is associated with a set of 6 parameters {f, θ , as,bs, kc, sn} of the subset of classes selected. it is more efficient to keep a pointer to the entry in the dictionary from which f was obtained rather than keeping the whole feature vector (figure 7 displays all the entries in the dictionary). this will also provides us with the patch, filter and location vectors entries in the dictionary which will be used for the detection stage. this new weak learner is added to the previous accumulated learner, for each training example: h(vi, c)= h(vi, c)+hm(vi, c) where hm is computed for the optimal subset of classes. the optimal subset of classes is the one that minimize the misclassification error by selecting a feature shared by those classes. finally, the chain of weak learners is stored in the accumulated learner. figure 7: dictionary entries selected by the multiclass adaboost 3.2.4 detection to detect an object of class c in a test image i need to compute the score for every pixel in the image, provided by the strong classifier h(v,c) evaluated in all the pixels. if the score exceeds some threshold the object is detected. in order to calculate h(v,c) i use the following procedure. i find all the shared weak learners that shares class c, and for each sharing weak learner: 1. obtain the 4-tuple {f, θ , as, bs} from the weak learner. since f is associated with an entry in the dictionary, i retrieve the corresponding filter, patch and vectors lx, ly from the dictionary, and apply them to the test image using (1). 2. calculate hm(v)= aδ (v f > θ )+b where vf is the image obtained in the previous step. finally i add up all the weak learners. each weak learner votes for the center of the object sought, and it is expressed by a grayscale image obtained in step 2. the accumulated image will have bright pixels where the weak learners "agreed" about the center of the object in the "voting space". a maximum in the accumulated image indicates the probability to find the object in that location. each strong detector of a different class outputs an accumulated image. thus, it is possible that more than one strong detector will vote for the same (or very close) pixel coordinates. this situation is not rare since some postures are very similar. to solve this conflict, peaks that are closer than a given radius are clustered together, and the resulting class of the detection is the one from the class with the highest maximum. 3.2.4.1 torso and head orientation the maximum voting schema, from the strong detector results, is one class from the four possible gaze, posture and gesture recognition to minimize focus shifts for intelligent operating rooms in a collaborative support system 117 classes. each class is associated with a torso orientation. class 1 to 4 corresponds to 0, 90, 180, 270 degrees of rotation in the azimuth axes of the torso, accordingly. once the class is selected, a bounding box with the size equal to the average size of the dictionary annotated boxes for that class, and which center corresponds to the peak of the maximum votes. a color bounding box with an arrow in the direction of the torso is plotted on the image tested to represent the direction where the torso is rotated towards. for head orientation, which is our proxy for gaze direction (this assumption holds when the object visualized is far enough from the observer, a different multiclass detector was trained around the head of the subject. this additional multi-class classifier was trained to detect the head in four orientations. the torso detection is performed first and determines a limited search region for the head. experiments yielded the top 1/7th of the body detection area with added margins above the top to be a sufficient head search region. should multiple body postures have been detected, a heuristic increases the search region, taking nearby body detections into account. figure 8: . examples of head and torso recognition procedure in different scenarios 3.3 hand gesture interaction a hand gesture interaction system used in the or for doctors/surgeons should follow the following specifications [36]: (1) real time feedback and operation; (2) low fatigue; (3) high intuitiveness; (4) robust to unintended action; (5) robust recognition; (6) easy to use and to learn; (7) unencumbered (no wired attached). following this considerations, our approach is described next. 3.3.1 approach four pan/title/zoom network cameras placed in the vertices of the operating room’s ceiling captures a sequence of images of the hand. the hand is tracked by a tracking module which segments the hand from the background using color and motion information. to clean the main object, morphological image processing operations are used. the location of the hand is represented by the 2d coordinates of its centroid which is sampled continuously resulting in a trajectory. these trajectories are mapped into a set of commands. for example, a flick gesture is the rapid movement of the hand from a neutral position to a specific direction, and return to the original position. ’flick’ gestures are used to navigate through the projected image on the walls of the or. the doctors/surgeons intended actions/commands are recognized by extracting features from the spatio-temporal data of the gestures. using the corresponding commands to which the gestures are associated, doctors can bring up x-rays images, select a patient record from the database or annotate a region on the image. a two layer architecture is used. the lower level provides tracking and recognition functions, while the higher level manages the user interface. as opposed to field conditions, may raise challenges related to limited lighting conditions and unfixed environments. we plan to address this problem using simulation under operating room conditions to obtain a better assessment of system’s performance. 3.3.2 hand segmentation and tracking a 2d histogram is generated in real-time during ’calibration’ from the doctor’s hand. the calibration process is initiated when the user places his hand slowly into a boundary without touching the screen. this, in turn, is used to build a hand color distribution model. a pixel at location (x, y) is converted to the 118 j. p. wachs probability that the pixel is classified as a hand (or gloved hand), in any frame using the 2d histogram lookup table created earlier (figure 9). figure 9: . user hand skin color calibration in order to avoid false motion clues originated by non-hand motion in the calibration phase, a background maintenance procedure was developed. first, an initial image of the background is created. changes are detected background differencing. when this difference is coherently significant, i assume that the present object is a hand. the background stored image is updated using a running smoothed average (9). bk(i, j) = λ × f (i, j) + ( − λ )×bk−(i, j) (9) where, bk is the updated stored background image at frame k, bk− is the stored background image at frame k-1, λ is the smoothing coefficient (regulates update speed), f(i, j) is the current background image at frame k. to track the hand, camshift is used [37]. it uses a probability distribution image comprised of pixels representing hand colors. this hand image is created from a 2d hue-saturation skin color histogram [38]. a histogram is used as a look-up-table to convert the acquired camera images of the hand into corresponding hand pixels, a process known as back projection. in the original camshift algorithm the probability of a pixel belonging to the hand is determined by the grayscale value of the pixel only. in lieu of using color probability alone, i modify it with motion information according to (2) to represent a hand pixel probability. the relative weights between color and motion are shifted according to the amount of motion in the scene resulting in an adaptive fusion system. using the centroid and size of the hand pixel an iterative procedure based on a generalization of the mean shift algorithm [39]. is used to update the tracking window at each frame. thresholding to black and white followed by morphological operations is used to obtain a single component for further processing to classify the gestures. 3.3.3 operation the gesture interface can be used to browse medical databases and manipulate the projected imagery such as x-rays and mris. the finite state machine in figure 10 illustrates the operational modes with the gesture interface. after the calibration procedure dynamic gestures are used to browse images and hand poses are used to switch between modes of operation. the central area in the middle of the frame is called the "neutral area", see figure 11. when a doctor decides to perform a specific operation on a medical image, he/she places the hand in the ’neutral area’ momentarily, which will be followed by the zoom posture or rotation pose gesture. gaze, posture and gesture recognition to minimize focus shifts for intelligent operating rooms in a collaborative support system 119 figure 10: state machine for the gesture-based medical browser figure 11: four quadrants mapped to cursor movements 120 j. p. wachs navigation gestures are designed to browse through a medical data browser projected on the wall. the immersion sense is created by representing 3d objects, where each image is a side of a cube and arranged in numerous levels. the cube can be rotated cw and ccw, and moved up and down in the vertical direction to exhibit various levels on the screen. hence, any image on the screen can be accessed directly by four navigation commands. a ’flick’ gesture is performed when a doctor/surgeon moves the hand out from a ’neutral area’ toward any of four directions, and then back to the neutral area. interaction is designed in such a way that the gestures commands are carried out only when the doctor’s body posture is standing still, instead of bending towards the patient (which may indicate that the surgeon is operating). zoom is used to change the size of an image. when the zoom-mode is triggered, the size of image changes according to the proximity of the palm to the camera. to go back to the normal mode, the hand is moved out from the neutral area to any of the 4 directions. the "zoom mode" is activated, when the hand is in the neutral area, rotates suddenly ccw by 90ş. rotation is achieved through the rotation of a sterilized straight instrument hold on the doctor’s hand. in rotation mode, the angle to which the medical image is rotated is determined by the angle made by the instrument and the horizontal axis, in increments of 90ş. the orientation’s instrument can be found using the probabilistic hough transform (pht). only lines on the nearby proximity of the hand are considered (2.5 times around the interaction window). to avoid the recognition of unintentional gestures, when the doctor wants to stop temporarily the operation of the hand gesture recognition system, he can either gaze down towards the patient, or user moves the hand to the lowest part of the screen, keeping it within the screen boundaries. to return to the "normal mode" a wake up gesture is used whereby the user waves the hand over the small rectangle in the upper left corner of the screen. this is useful when the doctor wishes to discuss details on the projected image without being "tracked" by the system. 3.4 speech interaction one of the main goals regarding the intelligent operating room is twofold: (a) identify the user (id tagging) automatically and (b) to recognize spoken utterances. the first goal, on one hand grants the user (either a nurse or the main surgeon) access to digital patient records and medical imaging tools according to the privileges that have been assigned to him, and on the second hand allows the room to keep track of the subject when ambiguity occurs due to light changing, occlusions and merging of several subjects. if i want to allow only the surgeon to manipulate and annotate the patient’s medical images during surgery, the operation will be activated only when spoken utterances are recognized by the room as belonging to the surgeon’s voice. to achieve this goal, the users will be required to say the word "login" [38] and compared to prototype feature vectors using some distance measure, such as maximum likelihood. for every participant in the or, a profile is created and matched to the existing profiles in order to assess the level of permission that needs to be granted. the second goal involves the recognition of words and sentences using off-the-shelf voice recognition software, called "dragondictate", which can explicitly build continuous speech and context-free recognition grammars [39]. to transmit the speech to the recognition module, multiple microphones are used. since i want to keep the sterility requirements, the microphones are not attached to the doctors. instead, the microphones are set-up in key locations on the or’s ceiling. voice commands are used to evoke functions that are very difficult to map to hand gestures since there is no natural association between them. for example, to retrieve medical images of the patient being operated, the surgeon can say the patient’s name. to discriminate between environment noise, which can mistakenly being recognized as a command, the user has to start the command by saying "system" shortly followed by the command to be carried out. this approach was suggested in [39] in the context of smart rooms, where the vision recognition and audio recognition modules are independent, and hence it fits the requirements for the operation room. environment noise can still be too high and gaze, posture and gesture recognition to minimize focus shifts for intelligent operating rooms in a collaborative support system 121 interfere with the interaction. we plan to test these scenarios in further work. 4 conclusions and future works this work presents one possible application for smart rooms: the intelligent, collaborative operating room. by monitoring the surgeon’s activities while performing surgery, the system can collaborate with her by displaying relevant medical imaging information in a convenient location in the or. the or depends on the surgeon’s body language as the essential key to understanding his focus of attention. one goal of the suggested approach is to use computer vision techniques to detect body postures and gaze to determine focus of attention. smart projectors combined with computer graphics techniques are used to project medical imaging in front of the surgeon, based on the knowledge provided by the posture recognizer. person-detection with simultaneous human body posture recognition is achieved using parts-based models and a multiclass boosting approach: each body part is matched with the image captured by the camera, and each part votes for the center of the person. focus of attention is assessed by simultaneous detection of the surgeon’s posture and orientation. detection and classification are possible since features of different human postures have shared subspaces as opposed to the non-person class. one challenge here is the focus on postures that cannot be easily distinguished by their aspect ratio or silhouette, but rather require a bottom-up approach. parts-based detection does not require explicit models, nor the labeling of individual body parts. the detection and posture classifications are performed in a single pass over the image, and the strength of the recognition is proportional to the ensemble of votes from parts patches. a vision-based system is used for interaction with the medical images,. it recognizes the surgeon’s gestures in real-time which are used to browse, zoom and rotate the images projected on the wall in front of him. the system is user independent since it is calibrated using a multi-modal two step procedure: first the user’s id is recognized using a voice identification system, then cameras are used to model the gamut of colors of the surgeon’s hand. camshift is used to track the hand, which allows dynamic hand gesture navigation control. the decision to use hand gestures as the main modality of interaction is based on previous work done by the author [25], where it was shown that hand gesture interaction in the operating room is the preferred modality of interaction by the doctors in the or, due to their proficiency at using the hand as their main tool of work. hand gestures offer the following benefits: (i) ease of use: surgeons are already quite proficient in their use of hands as a primary work tool, (ii) rapid reaction: hand gesture commands are intuitive and fast, (iii) unencumbered: does not require the surgeon to be wired to any device, and (iv) sterility: non contact interaction. issues related to image processing performance algorithm under unfixed environments require further analysis and more robust vision algorithms. for example, in [25] we used only one frontal camera, and in the current research i implement four. it is not likely that the surgeon will be in front of any of these cameras. to correct for this, the image need to be re-projected using the homography matrix found in the calibration process. still some areas may remain occluded. in future work these challenges will be addressed and a simulation framework is going to be adopted in order to quantitatively validate the hypothesis suggested in this work. surgeons must command a high level of cognitive and motor skills in order to complete their tasks successfully. previous research has shown that during the performance of a specific task, most gaze changes are related to the task-oriented visual search. distractions may have an adverse effect on task performance since they compete with the mental resources allocated for that task. i specifically address the problem of the distraction to the surgeon, (or shift of focus) required for closer examination of medical imaging. i alleviate this situation by displaying the medical imaging in a convenient size right in front of the surgeon, automatically detecting his current orientation. in more general terms, i believe that hand gesture, body posture and voice recognition can be used to help an intelligent system understand the context of the work being performed in an operating room. 122 j. p. wachs intelligent rooms equipped with hand gesture, body posture and voice recognition capabilities can assist surgeons in the execution of time-or-safety critical tasks, while providing him/her with natural, intuitive and easy to use interaction modalities. bibliography [1] t.r. hansen ,focus shift analysis in the operating theatre,. 2004. source:htt p : //www.pervasive − interaction.org/publications/focus_shi f t.pd f . [2] j. graydon, m. eysenck, distraction and cognitive performance, eur j cogn psychol.;1(2):161-179, 1989. [3] v. weerdesteyn, a. schillings, g. van galen, d. duysens, distraction affects the performance of obstacle avoidance during walking, j motor behavior. 35(1):53-63, 2003. [4] k. h. goodell, c. g. cao, s. d. schwaitzberg, effects of cognitive distraction on performance of laparoscopic surgical tasks, j laparoendosc adv surg tech a. apr;16(2):94, 2006. [5] r. m. satava, disruptive visions: the operating room of the future. surgical endoscopy, vol. 17, no1, pp. 104-107, 2003. [6] j. v. sutherland, w. j. van den heuvel, t. ganous , m. m. burton, a. kumar, towards an intelligent hospital environment: adaptive workflow in the or of the future, stud health technol. inform.118:278-312, 2005. [7] p. wellner,the digitaldesk calculator: tangible manipulation on a desk top display, proceedings of uist’91. pp.27-33. 1991. [8] w. newman, p. wellner, a desk supporting computer based interaction with paper, proceedings of the conference on human factors in computing systems (chi’92). p587592. 1992. [9] microsoft surface. available online: www.microsoft.com/surface/ [10] r. a. brooks, the intelligent room project, proceedings of the second international cognitive technology conference (ct’97), aizu, japan, august 1997. [11] k. s. huang and m. m. trivedi, video arrays for real-time tracking of persons, head and face in an intelligent room, machine vision and applications, special issue, jun. 2003. [12] m. nicolescu and g. medioni, electronic pan-tilt-zoom: a solution for intelligent room systems, proc. icme’2000, 2000. [13] j. c. terrillon, a. piplr, y. niwa, k. yamamoto, robust face detection and japanese sign language hand posture recognition for human human-computer interaction in an "intelligent" room, in proc. int’l conf. vision interface, 2002. [14] n. dimakis, j. k. soldatos, l. polymenakos, p. fleury, d. curin, jan, j. kleindienst, integrated development of context-aware applications in smart spaces, pervasive computing, ieee 7 (4), 7179, 2008. [15] t skordas, g metakides, major challenges in ambient intelligence, studies in informatics and control, vol.12, no.2, june 2003 page 3. 2.3. gaze, posture and gesture recognition to minimize focus shifts for intelligent operating rooms in a collaborative support system 123 [16] j. p. tardif, s. roy, m. trudeau, multi-projectors for arbitrary surfaces without explicit calibration nor reconstruction, in proceedings of fourth international conference on 3-d digital imaging and modeling. 3dim 2003.. pp. 217-224. 2003. [17] a. griesser and l. v. gool, automatic interactive calibration of multi-projector-camera systems, proceedings of the 2006 conference on computer vision and pattern recognition workshop, 8, 2006. [18] i. mitsugami, n. ukita, m. kidode,.‘displaying a moving image by multiple steerable projectors’, ieee international workshop on projector-camera systems (procams2007), 2007. [19] z. jiang, y. mao, b. qin and b. zang, a high resolution video display system by seamlessly tiling multiple projectors, ieee international conference on multimedia and expo, 2007. [20] e. bhasker, r. juang, a. majumder, registration techniques for using imperfect and par tially calibrated devices in planar multi-projector displays, ieee transactions on visualization and computer graphics, 13:6, 1368-1375, 2007. [21] j. summet, m. flagg, tj. cham, jm. rehg and r. sukthankar, shadow elimination and blinding light suppression for interactive projected displays,ieee transactions on visualization and computer graphics, 13:3,508-517, 2007. [22] t. johnson, g. welch, h. fuchs, e. la force, h. towles, a distributed cooperative framework for continuous multi-projector pose estimation in proceedings ieee vr 2009. [23] a. nishikawa, t. hosoi, k. koara, d. negoro, a. hikita, s. asano, h. kakutani, f. miyazaki, m. sekimoto, m. yasui, y. miyake, s. takiguchi, and m. monden. face mouse: a novel humanmachine interface for controlling the position of a laparoscope, ieee trans. on robotics and automation, 19:5, 825-841, 2003. [24] c. graetzel, t.w. fong, s. grange, and c. baur, a non-contact mouse for surgeon-computer interaction, technology and health care, 12:3, 245-257, 2004. [25] j. wachs, h. stern, y. edan, m. gillam, c. feied, m. smith, and j. handler, a hand gesture sterile tool for browsing mri images in the or, journal of the american medical informatics association. 15(3), 2008. [26] p. lukowicz, a. timm-giel, h. lawo, o. herzog, wearitwork: toward real-world industrial wearable computing, ieee pervasive computing. 6:4,8-13, 2007. [27] y. kuno, t. murashima, n. shimada, and y. shirai, intelligent wheelchair remotely controlled by interactive gestures, in proceedings of 15th intl. conf. on pattern recognition, 4,672-675, 2000. [28] t. starner, j. auxier, d. ashbrook, and m. gandy. the gesture pendant: a self-illuminating, wearable, infrared computer vision system for home automation control and medical monitoring, in 4th intl. symposium on wearable computers, 87-94, 2000. [29] d. a. becker and a. pentland, staying alive: a virtual reality visualization tool for cancer patients, proc. of the aaai’96 workshop on entertainment and alife/ai, 1996. [30] r. boian, r. sharma, c. han, a. merians, g burdea, s. adamovich, m. recce, m. tremaine, h. poizner, virtual reality-based post-stroke hand rehabilitation,studies in health and technology information, 85, 64-70, 2002. 124 j. p. wachs [31] m. ashdown and y. sato, steerable projector calibration, proceedings of the 2005 ieee computer society conference on computer vision and pattern recognition (cvpr’05), 2005. [32] e. moschos, r. coleman, acquiring laparoscopic skill proficiency: does orientation matter?, american journal of obstetrics and gynecology, 191:5, 1782-1787, 2004. [33] r. yang and g. welch.,automatic and continuous projector display surface estimation using every-day imagery, 9th international conference in central europe on computer graphics, visualization and computer vision, 2001. [34] k. murphy, a. torralba, d. eaton, w. t. freeman,object detection and localization using local and global features, lecture notes in computer science. sicily workshop on object recognition, 2005. [35] j. wachs, d. goshorn and m. kölsch, recognizing human postures and poses in monocular still images, in proceeding of the international conference on image processing, computer vision, and signal processing. (ipcv09) las vegas, nevada. july 2009. [36] j. wachs, h. stern, y. edan, m. gillam, c. feied, m. smith, and j. handler, a real-time hand gesture interface for a medical image guided system, international journal of intelligent computing in medical sciences and image processing, 1:3:1, 175-185, 2008. [37] g. r. bradski, computer vision face tracking for use in a perceptual user interface, intel technical journal, 1-15, 1998. [38] s. t. shivappa, m. m. trivedi, and b. d. rao, hierarchical audio-visual cue integration framework for activity analysis in intelligent meeting rooms, ieee cvpr 2009, 2009. [39] m. coen, design principles for intelligent environments, in proc. of aaai, madison, wisconsin, 1998. juan wachs is an assistant professor at purdue university in the school of industrial engineering. he completed a postdoctoral training at the naval postgraduate school’s moves institute in the area of computer vision. his research interests are machine and computer vision, robotics, tele-operations, human factors, assistive technologies and health support systems. juan wachs is a member of ieee and the operation research society of israel (orsis). he has published in journals including ieee trans. systems, man, and cybernetics, journal of american medical informatics, journal of image and graphics, and the international journal of semantic computing. he has been a visiting professor at the buenos aires university, argentina in the area of image processing. he has taught digital electronics, multimedia and web development at ort and intel colleges, israel. he received his b.ed.tech in electrical education from the ort academic college in jerusalem, his m.sc and ph.d in industrial engineering and management, information systems and intelligent systems tracks, respectively, from the ben-gurion university of the negev. international journal of computers, communications & control vol. i (2006), no. 1, pp. 17-24 novel features for off-line signature verification banshider majhi, y santhosh reddy, d prasanna babu abstract: in this paper a novel feature extraction scheme has been suggested for offline signature verification. the proposed method used geometric center for feature extraction. euclidean distance model was used for classification. this classifier is well suitable for features extracted and fast in computation. method proposed in this paper leads to better results than existing offline signature verification methods. threshold selection is based on statistical parameters like average and standard deviation (σ ). keywords: feature extraction, geometric center, euclidean distance model, standard deviation and off-line signature verification. 1 introduction signature verification is an important research area in the field of person authentication. we can generally distinguish between two different categories of verification systems: online, for which the signature signal is captured during the writing process, thus making the dynamic information available, and offline for which the signature is captured once the writing processing is over and, thus, only a static image is available[8]. the objective of the signature verification system is to discriminate between two classes: the original and the forgery, which are related to intra and interpersonal variability. the variation among signatures of same person is called inrea personal variation. the vatiation between originals and forgeries is called inter personal variation[7]. in this paper we concentrated on offline verification system. upto now many signature verification methods proposed based on different strategies but no verification system classified near forgeries which were classified by this method. and the main advantage of this algorithm is efficiency and computational complexity. for general purpose applications like smart cards we want quick and efficient verification system[2]. this method is based on the geometric center and signature strokes distribution. section 2 discusses the feature extraction from signature. this is a recursive method which applying on signature recursively. a lot of work has been done in the field of automatic off-line signature verification. while a large portion of work is focused on random forgery detection, more efforts are still needed to address the problem of skilled forgery detection[6]. our method will be the first verification system which seperates some skilled forgeries from originals. this paper organized in the following sections: section 1.1 provides the different types of forgeries. section 2 introduces new feature extraction method. section 3 discusses classification based on euclidean distance model. section 4 discussed about threshold selection. section 5 shows training, testing and results and section 6 gives conclusion and furthure working directions. 1.1 types of forgeries there are three different types of forgeries to take into account. the first, known as random forgery which writtn by the person who don’t know the shape of original signature. the second, called simple forgery, is represented by a signature sample which written by the person who know the shape of original signature without much practice. the last type is skilled forgery, represented by a suitable imitation of the genuine signature model[3]. each type of forgery requires different types of verification approach[4]. hybrid systems have also been developed[9] fig. 1 shows the different types of forgeries and how much they are varies from original signature[5]. figure 1: (a) random forgery (b) simple forgery (c) skilled forgery (d)original signature by using this method we can easily eliminate random and simple forgeries. some of the skilled forgeries also eliminated. copyright c© 2006 by ccc publications 18 banshider majhi, y santhosh reddy, d prasanna babu figure 2: (a) before adjustment of signature (b) after adjustment of signature 2 feature extraction the geometric features proposed by this paper are based on two sets of points in two-dimentional plane. each set having six feature points which represent the stroke distribution of signature pixels in image. these twelve feature points are calculated by geometric center[1]. vertical splitting and horizontal splitting are two main steps to retrieve these feature points. vertical splitting is discussed in section 2.2 and horizontal splitting is discussed in section 2.3. before finding feature points we have to do some adjustments to the signature image. that is moving signature strokes to the center of the image which discussed in section 2.1. 2.1 moving signature to the center of image in this step signatures are moving to the center of image. because of this we can reduce intra-personal variations. here first we have to find out the geometric center of the image and move the signature pixels such that the geometric center should reside at center of image. fig. 2 shows the signature images before moving and after moving. 2.2 feature points based on vertical splitting six feature points are retrieving based on vertical splitting. here feature points are nothing but geometric centers. the procedure for finding feature points by vertical splitting is mentioned in algorithm. algorithm this is the procedure for generating feature points based on verical splitting. input: static signature image after moving the signature to center of image output: v1, v2, v3, v4, v5, v6 (feature points) (a)split image with vertical line at the center of image then we will get left and right parts of image. (b)find geometric centers v1 and v2 for left and right parts correspondingly. (c)split left part horizontal line at v1 and find out geometric centers v3 and v4 for top and bottom parts of left part currespondingly. (d)split right part horizontal line at v2 and find out geometric centers v5 and v6 for top and bottom parts of left part currespondingly. fig. 3 shows the feature points retrieved from signature image and o is the center of image. these features we have to calculate for every signatrure image in both training and testing. 2.3 feature points based on horizontal splitting six feature points are retrieving based on horizontal splitting. here feature points are nothing but geometric centers. the procedure for finding feature points by horizontal splitting is mentioned in algorithm. novel features for off-line signature verification 19 figure 3: feature points based on vertical splitting figure 4: feature points based on horizontal splitting algorithm this is the procedure for generating feature points based on horizontal splitting. input: static signature image after moving the signature to center of image output: h1, h2, h3, h4, h5, h6 (feature points) (a)split image with horizontal line at the center of image then we will get top and bottom parts of image. (b)find geometric centers h1 and h2 for top and bottom parts correspondingly. (c)split top part with vertical line at h1 and find out geometric centers h3 and h4 for left and right parts of top part currespondingly. (d)split bottom part with vertical line at h2 and find out geometric centers v5 and h6 for left and right parts of left part currespondingly. fog. 4 shows the feature points retrieved from signature image and o is the center of image. these features we have to calculate for every signatrure image in both training and testing. now total twelve feature points (v1, ..., v6 and h1, ..., h6) are calculated by vertical and horizontal splittings. in section 4 we will see how each feature point can classify. 3 classification in this paper features are based on geometric properties. so we used euclidean distance model for classification. this is the simple distance between a pair of vectors of size n. here vectors are nothing but feature points, so the size of vector is 2. how to calculate distance using eucliden distance model is descibed in section 3.1. in threshold calculation these distances are useful. 3.1 euclidean distance model let a(a1, a2, ..., an) and b(b1, b2, ..., bn) are two vectors of size n. we can calculate distance(d) by using equarion 1. distance(d) = √ n ∑ i=1 (ai −bi)2 (1) 20 banshider majhi, y santhosh reddy, d prasanna babu in our application, vectors are points on plane. so d is the simple distance between two points. 4 threshold individual thresholds for vertical splitting and horizontal splitting. here we proposed one method for threshold selection which used in section 5.1. fig. 5 shows the variations in single curresponding feature points of training signatures. let n is the number of training signatures and x1, x2, ..., xn are curresponding single feature points of training signatures(taking one curresponding feature point from each signature). xmedian is the median of n features from n signatures. let d1, ..., dn are distances defined here, d1 = distance(xmedian, x1) d2 = distance(xmedian, x2) ... dn = distance(xmedian, xn) (2) two main parameters we used in threshold calculation are davg and σ . equations 3 and 4 shows the calculation d avg + σ x i x 2 x n−1 x 1 x 3 d avg x n σ figure 5: davg (average distance) and σ (standard deviation) derivation from distances of these two parameters. davg = average(d1, d2, ..., dn) (3) σ = sd(d1, d2, ..., dn) (4) like this total six different feature points are there for both vertical and horizontal splitting based on average distance (davg) and standard deviation (σ ). equation 5 shows the main formula for threshold. threshold(t) = √√√√ 6∑ i=1 (davg,i + σi)2 (5) novel features for off-line signature verification 21 5 experiments & results for experiment we took 30 original signatures from each person and selected 9 for training. these original signatures are taken in different days. forgeries taken by three persons and 10 from each. total 21 originals and 30 forgeries for each person signature are going to be tested. there are two thresholds (one based on vertical splitting and another based on horizontal splitting) for each person signature. 5.1 training let n signatures are taking for training from each person. there are 12 feature points from each original signature, 6 are taken by vertical splitting (section2.2) and 6 are taken by horizontal splitting (section2.3). individual thresholds and patterns are calculating for vertical splitting and horizontal splitting. pattern points based on vertical splitting are shown below. vpattern,1 = median(v1,1, v2,1, ..., vn,1) vpattern,2 = median(v1,2, v2,2, ..., vn,2) vpattern,3 = median(v1,3, v2,3, ..., vn,3) vpattern,4 = median(v1,4, v2,4, ..., vn,4) (6) vpattern,5 = median(v1,5, v2,5, ..., vn,5) vpattern,6 = median(v1,6, v2,6, ..., vn,6) where vi,1, vi,2, ..., vi,6 are vertical splitting features of i th training signature sample. threshold based on vertical splitting is shown below. vthreshold = √√√√ 6∑ i=1 (vdavg,i + σv,i)2 (7) in equation 9 vdavg,i is same as average distance and σv,i is same as standard deviation shown in section 4. pattern points based on horizontal splitting are shown below. hpattern,1 = median(h1,1, h2,1, ..., hn,1) hpattern,2 = median(h1,2, h2,2, ..., hn,2) hpattern,3 = median(h1,3, h2,3, ..., hn,3) hpattern,4 = median(h1,4, h2,4, ..., hn,4) (8) hpattern,5 = median(h1,5, h2,5, ..., hn,5) hpattern,6 = median(h1,6, h2,6, ..., hn,6) where hi,1, hi,2, ..., hi,6 are horizontal splitting features of i th training signature sample. threshold based on horizontal splitting is shown below. hthreshold = √√√√ 6∑ i=1 (hdavg,i + σh,i)2 (9) we will store pattern points and thresholds of both horizontal splitting and vertical splitting. these values are useful in testing. 5.2 testing when new signature comes for testing we have to calculate features of vertical splitting and horizontal splitting. feature points based vertical splitting are vnew,1, vnew,2, vnew,3, vnew,4, vnew,5, vnew,6. distances between new signature 22 banshider majhi, y santhosh reddy, d prasanna babu features and pattern feature points based on vertical splitting are shown below. vdnew,1 = distance(vpattern,1, vnew,1) vdnew,2 = distance(vpattern,2, vnew,2) vdnew,3 = distance(vpattern,3, vnew,3) vdnew,4 = distance(vpattern,4, vnew,4) (10) vdnew,5 = distance(vpattern,5, vnew,5) vdnew,6 = distance(vpattern,6, vnew,6) for classification of new signature we have to calculate vdistance and compare this with vthreshold . if vdistance is less than or equal to vthreshold then new signature is acceptable by vertical splitting. vdistance = √√√√ 6∑ i=1 vd2new,i (11) feature points based vertical splitting are hnew,1, hnew,2, hnew,3, hnew,4, hnew,5, hnew,6. distances between new signature features and pattern feature points based on vertical splitting are shown below. hdnew,1 = distance(hpattern,1, hnew,1) hdnew,2 = distance(hpattern,2, hnew,2) hdnew,3 = distance(hpattern,3, hnew,3) hdnew,4 = distance(hpattern,4, hnew,4) (12) hdnew,5 = distance(hpattern,5, hnew,5) hdnew,6 = distance(hpattern,6, hnew,6) for classification of new signature we have to calculate hdistance and compare this with hthreshold . if hdistance is less than or equal to hthreshold then new signature is acceptable by horizontal splitting. hdistance = √√√√ 6∑ i=1 hd2new,i (13) new signature features have to satisfy both vertical splitting and horizontal splitting thresholds. 5.3 results false acceptance rate (far) and false rejection rate (frr) are the two parameters using for measuring performance of any signature verification method. far is calculated by equation 14 and frr is calculated by equation 15. far = number o f f orgeries accepted number o f f orgeries tested ×100 (14) f rr = number o f originals re jected number o f originals tested ×100 (15) table 1 shows the false acceptance rate of our method for different types of forgeries. table 2 shows the false rejection rate for original sigature. table 1: false acceptance rate (far) forgery type far(%) random forgeries 2.08 simple forgeries 9.75 skilled forgeries 16.36 in general there are different thresholds for different types of forgery detections. but here threshold is same for random, simple and skilled forgeries. because this method is mainly eliminating random and simple forgeries. novel features for off-line signature verification 23 table 2: false rejection rate (frr) signature frr(%) original signatures 14.58 6 conclusion this method performs much better than any other off-line signature verification methods. future direction in this is classifying the skilled forgeries correctly. for this we have to approach novel classification method. figure 6: feature points based on vertical splitting of depth 2 figure 7: feature points based on horizontal splitting of depth 2 for better classification we can again split the sub-parts of fig.3 using vertical splitting and fig.4 using horizontal splitting. then instead of six featire points we can get 24 feature points for each vertical and horizontal splittings. fig.6 shows the vertical splitting of depth 2. fig.7 shows the horizontal splitting of depth 2. references [1] j.j. brault and r. plamondon, "segmanting handwritten signatures at their perceptually important points", ieee trans. pattern analysis and machine intelligence, vol.15, no. 9, pp.953-957, sept.1993. [2] j edson, r. justino, f. bortolozzi and r. sabourin, "a comparison of svm and hmm classifiers in the off-line signature verification", pattern recognition letters 26, 1377-1385, 2005. [3] j edson, r. justino, f. bortolozzi and r. sabourin, "an off-line signature verification using hmm for random, simple and skilled forgeries", sixth international conference on document analysis and recognition, pp. 1031-1034, sept.2001. [4] j edson, r. justino, f. bortolozzi and r. sabourin, "the interpersonal and intrapersonal variability influences on off-line signature verification using hmm", proc. xv brazilian symp. computer graphics and image processing, 2002, pp. 197-202 oct.2002. 24 banshider majhi, y santhosh reddy, d prasanna babu [5] j edson, r. justino, a. el yacoubi, f. bortolozzi and r. sabourin, "an off-line signature verification system using hmm and graphometric features", das 2000, pp. 211-222, dec.2000. [6] b. fang, c.h. leung, y.y. tang, k.w. tse, p.c.k. kwok and y.k. wong, "off-line signature verification by the tracking of feature and stroke positions", patter recognition 36, 2003, pp. 91–101. [7] migual a. ferrer, jesus b. alonso and carlos m. travieso, "off-line geometric parameters for automatic signature verification using fixed-point arithmetic", ieee tran. on pattern analysis and machine intelligence, vol.27, no.6, june 2005. [8] r. plamondon and s.n. srihari, "online and offline handwriting recognition: a comprehensive survey", ieee tran. on pattern analysis and machine intelligence, vol.22 no.1, pp.63-84, jan.2000. [9] a. zimmer and l.l. ling, "a hybrid on/off line handwritten signature verification system", seventh international conference on document analysis and recognition, vol.1, pp.424-428, aug.2003. banshider majhi, y santhosh reddy, d prasanna babu department of csea nit rourkela india 769008 e-mail: ysantosh@rediffmail.com int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 792-798 the development of students’ metacognitive competences. a case study d. mara daniel mara “lucian blaga” university of sibiu e-mail: danielmara11@yahoo.com abstract: in the information society metacognitive competencies are essential. based on some activities from the enrichment instrumental program elaborated by professor reuven feuerstein we have designed a program for developing the students capacities of selfcontrol, selfknowing and intelectual learning strategies. the case study presents the formation of students’ metacognitive competences at the "lucian blaga" university of sibiu, "hermann oberth" faculty of enginereeing, department of computers sciences. a web based application has been developed in order to enable students to self-evaluate their metacognitive competencies and to acquire self-regulatory abilities. keywords: metacognitive competences, instrumental enrichment program, computer science, higher education, relational competences, motivation for didactic career, web based application. 1 introduction metacognitive skills are a must for students preparing to have a career in the information/knowledge society. for those that want to embrace a didactic career that is essential. therefore the department for teaching staff training [8] has started a special training program consisting of two modules. the education plans for the first module consists of the following courses: psychology of education, pedagogy 1 (foundations of pedagogy, curriculum theory and methodology), pedagogy 2 (instruction theory and methodology, evaluation theory and methodology), specialty didactics, teaching practice, optional courses, final evaluation-didactic portfolio [9]. in the second module the following courses are included: curriculum area didactics, class management, counseling and vocational guidance, computer-assisted instruction, psychology of education, optional i (1 of 4: intercultural education, educational politics, contemporary pedagogical doctrines, management of school organization), optional ii (1 of 4: psycho-pedagogy of adults, foundations of special psycho-pedagogy, sociology of education, research methodology in the sciences of education), final evaluation-project, teaching probation (42 hours for those who did not teach during the period between the attendance of the first module and the enrollment at the second module). 2 developing students’ metacognitive competences metacognitive skills development is an important formative intellectual object in education of the students, as reaching this level involves a route through effective education, appropriate to each one in particular [6]. metacognitive skills suppose that students are aware of their own cognitive activity, i.e. learning activity, and self-adjustment mechanisms consisting in cognitive controls (rules, procedures, strategies). copyright c⃝ 2006-2010 by ccc publications the development of students’ metacognitive competences. a case study 793 2.1 the background the development of metacognitive skills goes in the same direction with the strategies used in developing cognition. the main steps in the formation, in the affirmation of conscience gripping meta-cognition are: • affirmation of trust and intuition (ah schoenfeld’s model) in solving the problems, or tasks, based on knowledge, on previous experiences; in this step the trainer looks to identify in the student a sense of referral tasks, intuition, a way of understanding and finding solutions taking into account all possibilities; • personal reflection on the knowledge involved; the student must become aware of the solutions found, the instruments used, her/his capacity of analysis and comparison, the way to analyse the difficulties of other methods previously used; • self-awareness or awareness of effective solutions addresses the solving style, based on selfobservation, analysis of results and of the ways to solve, progress and cognitive acting. the development stages show that metacognitive skills are associated with knowledge from management, and construction and that these are the conditions in which the knowledge appears. cognition managerial approach reveals the fact that metacognitive includes: awareness of how to understand the problem and how to solve it, planning the processes and finding the pathways necessary, monitoring the application solutions, the resources used, constraints, necessary instruments, decisions and analysis of results [4]. f.p. büchel considers that training of metacognitive competences is more efficient if working in groups, in a climate of cooperation and confrontation, because there is the possibility of mutual evaluation. in self-training, the individual student is more concerned about solving the problem itself, bout the acquisition of knowledge and s/he is less concerned by the understanding of how knowledge is acquired, how solutions were found or decisions taken. researchers have built a hierarchical model of the criteria-assessment questions in the classroom climate in function of different elements: diversity of awareness, respect for others’ style, commitment, encouragement, student-teacher relationship, student-group-class relationship, learning with pleasure, and sense of humour, comfortable participation and freedom of expression. studies that explore the effects of attitudes and emotions on learning indicate that stress and constant fear, at any age, can circumvent the brain’s normal circuits. a person’s physical and emotional well-being is closely linked to the ability to think and to learn effectively [1]. 2.2 the instrumental enrichment program the instrumental enrichment program is composed of a set of exercises divided into 14 tools that are used as means for developing mental capacities. the exercises do not concern the acquisition of specific knowledge, but the acquisition of mental skills, of ways to use concepts in different situations [2]. each instrument is focused on specific cognitive functions and provides means for developing cognitive capacities necessary for solving tasks that require a high level of abstraction. the instrumental enrichment program components are: • organization points; • spatial orientation i. • comparisons; • analytical perception; • pictures; • spatial orientation ii, • classification, • temporal relations, 794 d. mara • instructions, • family relations, • numerical progression; • syllogism; • transferable relations; • outlines. the exercises have images and temporal relations that are organized differently and provide a gradual increase in difficulty. in this way the student is encourage to a progressive acquisition of skills necessary to solve the problems or tasks, thus strengthening motivation, the feeling of competence and autonomy in organizing intrinsic work. subjects become aware of the importance and need for discussion about the work done and to make transfers on the basis of principles/rules/patterns formulated during activity. the development of principles/rules/patterns and the implementation of transfers are very important elements. all the details of the page of an exercise must be caught and analyzed and a synthesizing valid principle must be identified and expressed in a concise sentence. a principle is important because it can highlight a complex problem, newly learned information, or a necessary element to solve the exercise. the transfer is created as a link between the principles/patterns/rules resulting from the reflection necessary for understanding and addressing new events [3]. during an activity, two or more instruments are used in order to avoid monotony of using for a long period the same type of exercises, or the feeling of failure resulted from difficulties in solving an exercise. students are lead to use different instruments and to learn to choose the right ones. an activity is made up of elements called "pages". a page contains a story, illustrated by images. each instrument begins with a picture page (cover or homepage), which is used for placing the instrument, creating a horizon for motivation and development through the following pages. any learning (instructional) form may be tackled from the point of view of the general systems theory, distance learning forms included. a system is defined by a set of elements that interact and work together in order to achieve an objective [7, 8]. cover pages have certain features that remain unchanged from instrument to instrument to highlight the continuity of work, but each instrument is different from others. the mediator/trainer oriented subjects to consider the symbol on the cover to deduce the exercises that will solve the issues and that they will discuss. 2.3 organize an activity of the instrumental enrichment program an activity of the instrumental enrichment program is organized respecting some rules and some key moments: the introduction, individual work, discussion and conclusions [5]: in the following i will briefly present the key moments and the rules to follow. introduction. by going through this phase the mediator wants to awake the group interest in the work that will be developed and to define the problems they will have to solve. the introduction begins with revision, i.e. data from previous lessons. the mediator shall ensure that requirements and concepts were well understood, and that the vocabulary necessary to solve the task is assimilated. students will learn to analyze the page autonomously. the trainer guides students in observing and identifying objectives [10]. individual work. in this stage students will be asked to solve an individual task, after which they will be involved in a discussion aimed at highlighting possible strategies for solving the exercises from the page. students must understand that it is important not to finish quickly the exercises of a page. is important to understand how to solve a task and how they form and develop certain abilities. an activity based on reflection, even if not fully effective, may be more useful, more fruitful than the one done in a hurry, because it is based on a more deep analyze the processes that formed it. the development of students’ metacognitive competences. a case study 795 discussion. when most students have completed the individual task the trainer may start the discussion stage. being particularly interested in mental processes that led to finding the solution, it is appropriate to insist on correct answers and to explain the wrong ones, to understand the mental processes through which solutions were found [12]. at first it is recommended that the mediator identifies the link between work and other applied situations, then the students will gradually create these connections between the instruments and the surrounding reality. each transfer is built on a solid and appropriate explanation of the type of connection between the examples and the proposed developments. conclusion. at the end of each lesson there should be a revision of the whole activity. even if it is short it should highlight the steps taken to achieve the objective, the new words acquired, targets and strategies set out above for achieving the aim of the lesson. it is possible to encourage valorisation activities to determine individually or in small groups the utility obtained by applying different tools. 3 the case study development activities of the metacognitive skills students were conducted by applying the tools instrumental enrichment program developed by reuven feuerstein a group of 75 students from the faculty of engineering "hermann oberth", section computer sciences, of the university "lucian blaga" principles: one event can’t be observed by itself , it has to be seen in the whole context, before and after. we have to make a difference between opened eyes dreams and reality, between what is possible and impossible. we have to be aware about our goals, about their importance and about the risks they implied. the 75 students have bee enrolled in a trening program aimed to develop their metacognitive skills. one group (40) has worked only in the classroom and one group (35) has used also the web based application designed to support them in developing metacognitive comptences. the web based application is preparing the student for the training program, the students becoming familiar with the kind of exercises used in the fuerstein program. at the end of the training program the overall scores of the students that have used also the web based application was significant higher than the score of the group that worked only in the classroom. gender has not a significant effect on students’ perception of their metacognitive skills. a rather uncomfortable conclusion is that more than 75% of the students (no gender significant differences) have difficulties in expressing in words their thoughts and experience. the group that has a pre-training with the cogitino web based application was significant more rapid in solving different tasks, but has the same difficulties in expressing in words their thoughts and experience as the control group. 4 the web application in order to help students to better understand their thoughts and experiences a support software cogitino has been designed and implemented. cogitino is a web based application (fig.1) that offers a set of resources concerning meta-cognitive skills and acts as an adviser for the student that is enrolled in the metacognitive skills development training program. cogitino is a multi-agent system (fig.2 ) that through its profiler agent determines the student’s level of meta-cognitive competences and then recommend different training paths (fig.3). 796 d. mara the system is rating students’ metacognitive competences based on the answers to several questionnaires and problems’ results. before applying the instrumental enrichment program students are asked to solve several problems similar to those that they will have to solve during the class. principle of e-learning applications have been observed [1, 7] figure 1: the home page of the cogitino application figure 3: cogitino training resources the development of students’ metacognitive competences. a case study 797 figure 2: cogitino general structure 5 conclusions metacognitive skills are mandatory for today students. they must be aware and must know their mental processes and they must be able to self-monitor, regulate, and direct their actions to their global aim. metacognitive training becomes an important and basic tool also in business and management efficiency, skill and competences. the result of the research carried out leads to evidence of at least three essential aspects in the development of students’ meta-cognition competences. first, students balance their attention in preparation, implementation and evaluation of the educational and training process itself, but their qualitative analysis is poor. they have a reduce vocabulary and therefore a difficulty in explaining their experience and performance. secondly, being enrolled in a technical program they feel, at least at the beginning of the training program, that they do not need to express themselves in words. and last but not least, the lack of general culture is an obstacle in understanding some of the tasks and problems they have been asked to solve. meta-cognition components are usually observed only in the final stage of evaluation. another conclusion is that the web based application has been appreciated by students as very helpful. considering this aspect and that in educational practice meta-cognition principles can be developed and applied efficient by students following a training program that included theoretical aspects and practical-application, my future work will consist in enriching (with the help of colleagues from the computer science department of our university) cogitino with two new modules : one module that will automatically generate explanations after a task has been solved, showing to the students how s/he has proceed, and another one that will be a "vocabulary training" for the student. acknowledgment this work was supported by cncsis-uefiscsu, project number 882/19/01/2009 pnii idei, code 471/2008. 798 d. mara bibliography [1] chen, z., learning about learners: system learning in virtual learning environment. international journal of computers, communications & control, vol. iii (2008), no. 1, pp. 33-40. [2] feuerstein r., rand j., rynders j.e., non accettarmi come sono, sansoni, milano, italy, (1995). [3] feuerstein, r., rand, y., hoffman, m. b., miller, r., instrumental enrichment: an intervention program for cognitive modifiability, baltimore university park press, baltimore, usa, (1980). [4] joiţa, e., educaţia cognitivă. fundamente. metodologie, editura polirom, iaşi, romania, (2002). [5] kopciowski camerini, j., l’apprendimento mediato. orientamenti teorici ed esperienze pratiche del metodo feuerstein, la scuola, brescia, italy, (2002). [6] miclea, m., psihologie cognitiv?. modele teoretico-experimentale, editura polirom, iaşi, românia, (1999). [7] moise, g., a formal description of the systemic theory based e-learning. international journal of computers, communications & control, vol. iii (2008), no. 1, pp. 90-102. [8] moisil, i., a model of the student behavior in a virtual educational environment, international journal of computers, communications & control, vol. iii (2008), suppl. issue: proceedings of icccc 2008, pp. 108-115 [9] neculau, a., (coord.), (1996), psihologie socială. aspecte contemporane, editura polirom, iaşi. [10] neculau, a., (coord.), (1997), universitatea: valorile şi actorii săi, în "câmpul universitar şi actorii săi", a. neculau (coord.), editura polirom, iaşi. [11] vanini, p., potenziare la mente? una scommessa possibile: l’apprendimento mediato secondo il metodo feuerstein, vannini editrice, gussago (brescia), italy, (2003). [12] vanini, p., il metodo feuerstein: una strada per lo sviluppo del pensiero, irrsae emilia romagna, bologna, italy, (2001). int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 2, pp. 158-166 task resource allocation in grid using swift scheduler k. somasundaram, s. radhakrishnan arulmigul kalasalingam college of engineering department of computer science and engineering krishnankoil-626190, tamilnadu, india e-mail: soms72@yahoo.com abstract: in nature, grid computing is the combination of parallel and distributed computing where running computationally intensive applications like sequence alignment, weather forecasting, etc are needed a proficient scheduler to solve the problems awfully fast. most of the grid tasks are scheduled based on the first come first served (fcfs) or fcfs with advanced reservation, shortest job first (sjf) and etc. but these traditional algorithms seize more computational time due to soar waiting time of jobs in job queue. in grid scheduling algorithm, the resources selection is npcomplete. to triumph over the above problem, we proposed a new dynamic scheduling algorithm which is the combination of heuristic search algorithm and traditional sjf algorithm called swift scheduler. the proposed algorithm takes care of job’s memory and cpu requirements along with the priority of jobs and resources. our experimental results shows that our scheduler reduces the average waiting time in the job queue and reduces the over all computational time. keywords: grid computing, swift scheduler, dynamic scheduling algorithm, first come first serve, shortest job first. 1 introduction: a computational grid is a hardware and software infrastructure that provides dependable, consistent, pervasive, and inexpensive access to high-end computational capabilities. according to the function, grid is classified into three types: computing grid, data grid, and service grid. computing grid is used to connect varied computing resource on the network to construct a virtual high performance computer, which could offer high performance computer [1]. the traditional computing grid systems involve many technologies such as certification, task scheduling, communication protocols, fault tolerance and so on. the task of grid resource broker and scheduler is to dynamically identify and characterize the available resources and to select and allocate the most appropriate resources for a given job [2]. the resources are typically heterogeneous locally administered and accessible under different local policies. advance reservation [3] is currently being added to portable batch system (pbs). in a grid scheduler, the mapping of grid resources and an independent job in optimized manner is so hard where we couldn’t predict optimized mapping. so the combination of uninformed search and informed search will provide the good optimal solution for mapping a resources and jobs, to provide minimal turn around time with minimal cost and minimize the average waiting time of the jobs in the queue. a heuristic algorithm is an algorithm that ignores whether the solution to the problem can be proven to be correct, but which usually produces a good solution. heuristics are typically used when there is no known way to find an optimal solution, or when it is desirable to give up finding the optimal solution for an improvement in run time. copyright © 2006-2009 by ccc publications task resource allocation in grid using swift scheduler 159 the primary objective of this research is to investigate effective resource allocation techniques based on computational economy through simulation. we like to simulate millions of resources and thousands of users with varied requirements and study scalability of systems, algorithms, efficiency of resource allocation policies and satisfaction of users. in our simulation we would like to model applications in the areas of biotechnology, astrophysics, network design, and high-energy physics in order to study usefulness of our resource allocation techniques. the results of our work will have significant impact on the way resource allocation is performed for solving problems on grid computing systems. the organization of this paper is as follows. in section 2, the related works are discussed. in section 3, we introduce our scheduling algorithm model. in section 4 we present and discuss the experimental results. we conclude this study in section 5. 2 related work: job scheduling in parallel system has been extensively researched in the past. typically this research has focused on allocating a single resource type (e.g., cpu usage) to jobs in the ready queue. the use of many of these scheduling algorithms has been limited due to restriction in application designs, runtime system, or the job management system itself. therefore simple allocation scheme such as first come first serve (fcfs) or fcfs with first fit back fill (fcfs/ff) are used in practice [4]. current job scheduling practices typically support variable resource allocation to a job, and run to completion scheduling. scheduling policies are also heavily based on first-come-first-serve (fcfs) methods [5]. a fcfs scheduling algorithm allocates resources to jobs in the order that they arrive. the fcfs algorithm schedules the next job in ready queue as soon as sufficient system resources become available to meet all of the job requirements. the advantage is that this provides level of determinism on the waiting time of each job[6]. the disadvantage of fcfs shows up when the jobs at the head of the ready queue cannot be scheduled immediately due to insufficient system resources, but jobs further down the queue would be able to execute given the currently available system resources. these latter jobs are essentially blocked from executing while the system resource remains idle. fidanova [7] compared the simulated annealing approach with the ant algorithm for scheduling jobs in grid. david beasley, marek mika and grzegorz waligora [8] formulated the scheduling problem as a linear programming problem and proposed local search meta-heuristic to schedule workflow jobs on a grid. fair share scheduling [12] is compared with simple fare task order scheduling, adjusted fair task order scheduling and max-min fair share scheduling algorithm are developed and tested with existing scheduling algorithms. rafael a. moreno [9] addresses the issues that the resource broker has to tackle like resource discovery and selection, job scheduling, job monitoring and migration etc. resource management system [10, 11] was discussed and models of grid rms availability by considering both the failures of resource management (rm) servers and the length limitation of request queues. the resource management systems (rms) can divide service tasks into execution blocks (eb), and send these blocks to different resources. to provide a desired level of service reliability, the rms assigns the same eb to several independent resources for parallel (redundant) execution. 3 swift scheduler (ss) model: let n be the number of jobs in job queue ’jq’ which is indicated as, jq = {j1, j2, j3, ...., jn} (1) jobs are allotted to m number of resources in resource queue’rq’which is indicated as, 160 k. somasundaram, s. radhakrishnan rq = {r1, r2, r3, ...., rm} (2) let f(ji, rj) be the overall job completion time for the ith job in jth resources can be calculated as, n∑ i=0 m∑ j=0 f(ji, rj) = n∑ i=0 m∑ j=0 g(ji, rj) + n∑ i=0 m∑ j=0 h(ji, rj) (3) let g(ji, rj) be the expected job completion time of the ith job in jth resources which can be calculated as, n∑ i=0 m∑ j=0 g(ji, rj) = n∑ i=0 m∑ j=0 (jli/rcj) (4) jli be the job length of ith jobs and rcj be the capacity of the jth resources. let h(ji, rj) be the heuristic function of the ith job in jth resources which can be calculated as n∑ i=0 m∑ j=0 h(ji, rj) = n∑ i=0 m∑ j=0 (jli/rcj) + communicationoverhead (5) 3.1 working principle of swift scheduler (ss): figure 1: architecture of ss figure 1 shows working principles and architecture of our proposed algorithm ss as follows: incoming jobs from different users are collected and stored in job list and available resources are stored in resource list. jobs are randomly arrived to job queue as well as resources are selected based on the availability. the swift scheduler in gridsim [13] maps the jobs from job queue and resources from resource queue where the resources are selected using the heuristics function. the function will select the optimized resource for the particular job to complete it with minimum time. task resource allocation in grid using swift scheduler 161 3.2 pseudo code for swift scheduler (ss): 4 performance analysis: in this section, we analyze the performance of swift scheduler with existing simple fair task order scheduling (sfto) against large set of independent jobs with varying size and large number of heterogeneous resources. assume, the arrival rates of jobs are based on the poisson distribution. the following fig.(2) and fig.(3) shows the job allocation methods used in sfto and ss respectively and the following table (1) and table (2) shows the arrival order of the particular jobs and at what time, the jobs will start its execution in the particular resource and its service time of the jobs in the particular resource where the selection of resources are based on the algorithms. for example, in sfto, the jobs j0 and j3 are allotted in resource r1. the residing time (tr) of jobs j0 is the combination of jobs j0 waiting time (tw) in queue and service time (ts) of j0 (i.e tr = tw + ts = 2820.03ms + 61.35ms = 2881.38ms). similarly, the residing time of job j3 is 1490 ms. the job j2 is allotted in resource r2 where the residing time of j2 is 1257.19 ms and jobs j4 and j1 are chosen 162 k. somasundaram, s. radhakrishnan jobid resource name start time in ms residing time in ms 4 r3 1851.12 1886.86 1 r3 3337.99 3391.73 2 r2 1214.79 1257.19 3 r1 1330.66 1490.66 0 r1 2820.03 2881.38 table 1: job, resource, start and residing time for sfto jobid resource name start time in ms residing time in ms 2 r1 1210.02 1316.02 0 r1 2805.39 2980.39 3 r2 1093.04 1178.27 1 r2 2688.17 2772.17 4 r3 1453.51 1489.25 table 2: job, resource, start and residing time for ss resources r3 where the residing time of j4 is 1886.86 ms and j1 is 3391.73 ms. the average waiting time of all jobs in grid system is 2181.56ms. figure 2: job and resource allocation for sfto in ss, resource selection and jobs allocation are based on heuristic searching algorithm on sjf, which reduces the average waiting time of the jobs in queue. so, overall turn around time is reduced and resource utilization is increased. for example, in ss, the jobs j2 and j0 are allotted to resource r1 where its residing time is 1316.02 ms and 2980.39 ms respectively. similarly, jobs j3, j1 and j4 are chosen by resources r2, r2 and r3 respectively where residing time of jobs j3,j1 and j4 are 1178.29 ms, 2772.17 ms and 1489.25 ms respectively. the average waiting time of all jobs in grid system is 1947.22 ms which is less than sfto. the statistical data presented here is acquired by averaging the scheduling performance over different runs. the following figs and tables shows the cost based, total processing time and resource utilization based comparison test results of fcfs, sjf, sfto and ss against varying number of jobs and resources. our proposed algorithm swift scheduler (ss) is compared with fcfs, sjf and simple fare task order (sfto) scheduler. we have tested ss in gridsim by varying number of resources, no. of jobs task resource allocation in grid using swift scheduler 163 figure 3: : job and resource allocation for ss against total processing time, cost, and resource utilization. we can vary the number of resources like 5, 10, 15, 20,... etc. for experimental purposes, two sample simulation results are shown in figure 4, 5, 6, 7, 8 & 9. by analyzing the obtained results from the simulator, swift scheduler completed all jobs with minimum time and cost by utilizing maximum amount of resources compares with other scheduler like fcfs, sjf and sfto. total processing time analysis figure 4: : no of jobs vs total processing time figure 5: : no of jobs vs total processing time 164 k. somasundaram, s. radhakrishnan cost analysis figure 6: : no of jobs vs total cost figure 7: : no of jobs vs total cost resource utilization figure 8: : no. of jobs vs resource utilization figure 9: : no. of jobs vs resource utilization task resource allocation in grid using swift scheduler 165 5 conclusion and future work: in this paper, we have presented the design and analyze the new scheduling algorithm swift scheduler. our proposed swift scheduler completed a task by using highly utilized low cost resources with minimum computational time. our scheduling algorithm uses the heuristic function to select the best resources to achieve a higher throughput while maintaining the desired success rate of the job completion. this algorithm is performing better for real time job parameters and suitable for different job sizes in real environment. however, in all conditions, the proposed algorithms outperform the traditional ones. the ss policy is more effective than the fcfs, sjf and sfto in the extent of computational complexity with lower cost but higher resources utilization. in future, we can hybrid the swift scheduler with any evolutionary scheduling algorithm like genetic algorithm, particle swarm optimization technique to achieve a high throughput and high resource utilization. bibliography [1] ranganathan,k, and i.foster, "decoupling computation and data scheduling in data intensive applications", 11th international symposium on high perfromance distributed computing, edinburgh, scotland, condor project, condor-g, 2002. [2] mitrani i, palmer j," dynamic server allocation heterogenous clusters ", 1st international working conference on heterogeneous networks, ilkley,uk, 2003. [3] foster,i, et al, "the grid 2003 production grid : principles and practice", 13th international symposium on high performance distributed computing, 2004. [4] vijay subramanian, rajkumar kettimuthu, et al, "distributed job scheduling on computational grids using multiple simultaneous requests", proceedings 11th ieee international symposium on high performance distributed computing, 2002. hpdc-11 2002. pages: 359366 [5] c.bitten, j. gehring, et. al, "the nrw meta computer : building block for a worldwide computational grid", proceeding of the 9th heterogeneous computing workshop, pp.31-40, 2000. [6] c.ememann, v. hamscher, et. al, "on advantageous of grid computing for parallel job scheduling", proceeding 2nd ieee/acm int’l symp. on clustercomputing and the grid (ccgrid 2002), berlin, 2002, ieee press. [7] stefka fidanova, "simulated annealing for grid scheduling problem", ieee international symposium on international symposium on modern computing, 2006. jva apos;06 3-6 oct. 2006 page(s):41 45 [8] marek mika, grzegorz waligora and jan weglarz, a meta-heuristic approach to scheduling workflow jobs on a grid, grid resource management: state of the art and future trends, isbn : 1-40207575-8 , kluwer academic publishers, norwell, ma, usa; pp : 295 318, 2004. [9] moreno, r., alonso-conde a.b, job scheduling and resource management techniques in dynamic grid environments in et al., f.f.r., ed.: across grids 2003, volume 2970 of lecture notes in computer science, springer, pp : 25 32, 2004. [10] yuan-shun dai, min xie and kim-leng poh "availability modeling and cost optimization for the grid resource management system" ieee transactions on systems, man and cybernetics, part a volume 38, issue 1, jan. 2008 page(s):170 179. 166 k. somasundaram, s. radhakrishnan [11] yuan-shun dai and gregory levitin "optimal resource allocation for maximizing performance and reliability in tree-structured grid services" ieee transactions on reliability volume 56, issue 3, sept. 2007 page(s):444 453. [12] doulamis, n.d.; doulamis, a.d.; varvarigos, e.a.; varvarigou, t.a "fair scheduling algorithms in grids" ieee transactions on parallel and distributed systems, volume 18, issue 11, nov. 2007 page(s):1630 1648 [13] anthony sulistio, uros cibej, srikumar venugopal, borut robic and rajkumar buyya "a toolkit for modelling and simulating data grids: an extension to gridsim", concurrency and computation: practice and experience (ccpe), online issn: 1532-0634, printed issn: 1532-0626, 20(13): 1591-1609, wiley press, new york, usa, sep. 2008. international journal of computers, communications & control vol. i (2006), no. 2, pp. 53-60 importance of flexibility in manufacturing systems héctor kaschel c., luis manuel sánchez y bernal abstract: flexibility refers to the ability of a manufacturing system to respond cost effectively and rapidly to changing production needs and requirements. this capability is becoming increasingly important for the design and operation of manufacturing systems, as these systems do operate in highly variable and unpredictable environments. it is important to system designers and managers to know of different levels of flexibility and / or determine the amount of flexibility required to achieve a certain level of performance. this paper shows several representations schemes for product flexibility and discusses the usefulness and limitations of each. keywords: flexibility, fuzzy logic, disjunctive graph, constraints temporal network, complexity in manufacturing, non lineal mathematic programming 1 introduction present market demands require that manufacturing systems develop their activities under a dynamic and uncertain production environment (calinescu et al., 2003). to understand the problems affecting flexible manufacturing systems (fms), it is necessary to incorporate the concept of flexibility in the scheduling (deshmukh et al. 2002). flexibility constitutes a strategic topic in decision making to give quick and efficient answers to the demands of the national and international markets (pelaez, and ruiz, 2004). in order to define the flexibility in fms, the scientific literature reports a wide variety of concepts. for example, flexibility for sethi and sethi (sethi and sethi, 1992) represents the capacity of fms to modify manufacturing resources to produce different products efficiently maintaining an acceptable quality. for gupta (gupta, 2004), it is necessary to make the difference between flexibility inherent to the fms and flexibility required during the production cycle. the first one is defined as the capacity of resources and human operators to implement decisions of the factory company. while required flexibility is defined by the changes in the environment that surrounds the fms and for the implementation of new manufacturing strategies. starting from these differences, it is possible to determine the flexibility type and measure of the fms quantitatively. for (benjaafar and ramakrishman, 1996), it is important to differentiate the types of flexibility in the fms. they define the flexibility of the fms as product flexibility and process flexibility. the first one refers to the variety of factory options for a certain product. while process flexibility is defined as a characteristic of an industrial process to operate under diverse dynamic conditions of operation. both general flexibilities can be grouped hierarchically as shown in the figure 1. the classification proposed by (benjaafar and ramakrishman, 1996) shows flexibility as a consequence of the physical and logical characteristics that the manufacturing system presents to quantify the operative flexibility of job i, a function is defined starting from the time of dependent configuration of the sequence between operations: φ(pi) = −∑nij=1 ∑ ki j l=1 ( 1 si jl ∑ ki j l=1 1 si jl ) log ( 1 si jl ∑ ki j l=1 1 si jl ) ni (1) where: copyright c© 2006 by ccc publications 54 héctor kaschel c., luis manuel sánchez y bernal figure 1: classification of the fms flexibility • si jl represents the time of configuration of job i during the commutation of the operation j to the operation l. • ni represents the total number of operations associated to job i. in the present paper different models were analyzed as described in several studies dealing with product flexibility in fms. 2 product flexibility in all fms, each job or jobs group is associated to a certain scheduling, consisting in an orderly sequence of operations to be executed in a group of machines. an example of scheduling of the fms is illustrated in figure 2(a). if we incorporate conditions of flexibility to the scheduling (operational flexibility), an operation can be carried out in other machines. these, in general, can be different or they can have different operational levels (figure 2 (b)). the classification of the operations can also be varied (sequence flexibility), where some operations should be carried out in sequence while others don’t have this restriction. an example of this type of flexibility is shown in figure 2(c), where operations 2 and 3 can be carried out in any order after executing operation 1. finally, in a highly flexible fms, a job can have a group of schedulings (processing flexibility). this means that the group of operations required to manufacture a job can be variable and depending on the operation conditions of the fms. for example, in figure 2(d), operations 1 and 2 can be replaced by the operation 5 while operation 3 can be divided in operations 6 and 7. 3 analysis models for product flexibility the product flexibility constitutes an important aspect in the functionality of the fms, where the variability in the scheduling and the operability of the machines allow several alternatives to process the jobs. in what follows some models of analysis of the product flexibility are presented. importance of flexibility in manufacturing systems 55 figure 2: types of product flexibility 3.1 disjunctive graph it is a model of directed graph, whose nodes have a certain weight assigned. mathematically it is expressed as 3 tupla (benjaafar and ramakrishman, 1996): g = (n, a, e) (2) where: • n: represents the group of nodes that represent the operations. in this model are defined two particular nodes, called initial and final. the positive weight assigned to each node represents the processing time of the corresponding operation. the initial node is connected with the first operation of each job, and in the same manner the last operation of each job is connected with the final node. • a: represents the group of conjunctive directed arcs that indicate the precedence restrictions among the operations of each job. • e: represents the group of disjunctive directed arcs that indicate the restrictions of machines operative capacity. to illustrate the application of the disjunctive graph, an example of scheduling of 4 different jobs is presented (j1, j2, j3, j4) in three types of machines (m1, m2, m3). the distribution of processing times and assignment of machines is shown in table 1, starting from this information we proceed to generate the disjunctive graph of figure 3, that shows the precedence restrictions between the operations (conjunctive arcs) and the flexibility in the machines operative sequence (disjunctive arcs). 56 héctor kaschel c., luis manuel sánchez y bernal processing time machines assignment m1 m2 m3 1 2 3 j1 5 8 2 m1 m2 m3 j2 3 9 7 m3 m1 m2 j3 1 10 7 m1 m3 m2 j4 7 4 11 m2 m3 m1 table 1: machines assignment and processing times figure 3: disjunctive graph generated by table 1 3.2 temporal constraints network it is a model of directed graph that allows the representation of temporary constraints assigned to the precedence relationships between operations that are part of a scheduling of a fms. the nodes of the graph represent the operations and the arcs have associated temporary intervals to denote the temporary constraints that exist between two nodes. all the times are calculated starting from of the scheduling initial state. the development of the network is based mathematically from disjunctive metric temporal algebra (alfonso, 2001). this methodology is used to solve scheduling problems characterized from a group of temporary constraints, consequently, the techniques developed in areas like temporary reasoning and constraints satisfaction can be applied. in general, two techniques are used: closure and constraint satisfaction problem (csp) (barber, 2000). closure is a deductive process by means of which new constraints are inferred, starting from existing constraints. it also allows to detect possible inconsistencies (value of the variables that don’t lead to any solution) that can be eliminated. the main advantage of the closure techniques consists in the reduction of the search space. therefore, the closure is used fundamentally as a previous step to a process of search of solutions. techniques of constraints satisfaction problem constitutes a process of search of solutions by means of the successive assignment of values to the variables that are part of the constraints of the problem. in a process of constraints satisfaction problem, it is important to use different heuristics to make the search process more efficient (beck and fox, 2000). to illustrate the use of the temporary constraints network, a distribution of temporary constraints is presented among four operations of a fms (see table 2). starting from the information in table 2, we proceed to establish the group of operations (o), the importance of flexibility in manufacturing systems 57 o1 o2 o3 o4 o0 initial state (10, 20) (10, 20) (20, 50) (60, 70) o1 (30,40), (60,60) (10,30), (40,40) (40, 60) o2 (0, 30) o3 (10, 20) (20,30), (40,50) table 2: precedence temporary constraints between operations group of precedence relationships between operations (p) and the group of temporal relationships between operations (r). finally, starting from the definitions of the groups we proceed to generate the corresponding graph of temporal constraints (see figure 4): g = (o, p) (3) o = (o0, o1, o2, o3, o4) (4) p =    (00, o1), (00, o2), (00, o3), (00, o4) (01, o2), (01, o3), (01, o4), (02, o4) (03, o2), (03, o4)    (5) r =    (o0(10, 20)o1), (o0(10, 20)o2), o0(20, 50)o3), (o0(60, 70)o4, (o1(30, 40)(60, 60)o2), (o1(10, 30)(40, 40)o3), (o1(40, 60)o4), (o2(0, 30)o4), (o3(10, 20)o2) (o3(20, 30)(40, 50)o4)    (6) figure 4: temporary constraints network generated by table 2 3.3 model based on mathematical programming chandra and tombak (chandra and tombak, 1993) propose a operations sequence flexibility model, based on the theory of lineal programming. the model incorporates the concepts of machines reliability and machines processing capacity. machine reliability is defined as the probability of an operation to be executed by a machine in a certain interval of time. machine processing capacity, represents the total number of jobs that a machine can execute in a interval of time. 58 héctor kaschel c., luis manuel sánchez y bernal the fms base for the development of the model consists of a finite group of machines that can present independent fault events. the model is formulated considering the jobs flow on a fixed scheduling and it is expressed mathematically as: rf = max ∑ i ∑ h cihxih (7) subject to the following constraints: ∑ i ∑ h|bihk=1 tik pk xih ≤ tk,∀k (8) ∑ h xih > di,∀i (9) xih ≥ 0,∀i, h (10) where, • tik represents the processing time of job i in machine k; • tk represents the total processing time of machine k; • pk represents the probability that machine k is operative in a certain time; • bihk represents the possibility that job i is processed in machine k along route h; • ch represents the factor of cost of job i when processed in route h. this factor depends on the reliability machines located along route h; • di represents the minimum demand of job i and, • xih represents the flow of job i on route h. the model represents the processing capacity of each machine considering its reliability, where the factor tikpk represents the time required to process job i in machine k. the model considered n + m constraints and m(1 + ∑n−2k=1 k!c n−2 k ) variables in case of full machines connections. 3.4 model based on fuzzy logic in (tsourveloudis and phillis, 1998) and (fortemps, 2000), analyze the flexibility starting from a methodology based on the knowledge that the human expert has on the structural and dynamic behavior of the fms. for (tsourveloudis and phillis, 1998) the flexibility is represented by means of a group of diffuse linguistic rules of the type: if 〈antecedent fuzzy〉 then 〈consequent fuzzy〉 (11) this proposal is based on the fuzzy behavior that the fms presents, in cases of concurrence and synchronization of operations or fault machines events. for (tsourveloudis and phillis, 1998), a fms presents seven different types of flexibility: 1. materials manipulation system flexibility: it measures the capacity of the system to transfer several jobs types efficiently between two neighboring points. 2. product flexibility: it measures the capacity of the system to produce mixed jobs. importance of flexibility in manufacturing systems 59 3. operative flexibility: it measures the capacity to modify the sequence of operations to produce a job. 4. process flexibility: it measures the capacity of the system to produce different jobs without modifying the fms structure. 5. volume flexibility: it measures the operative capacity of the fms for different levels of productivity. 6. scalability flexibility: it measures the cost or time required to enlarge a fms. 7. labor flexibility: it measures the operator’s capacity to develop different operative tasks of fms. each one of these types of flexibility is expressed by means of a fuzzy rule that represents the expert’s knowledge about this flexibility. the fuzzy rule is expressed mathematically as: if f1 is a1 and . . . and fn is an then fmf is mf (12) where: ai = {low, about_low, average, about_high, high} represents the group of linguistic values for the flexibility parameter. for example, the following fuzzy rule expresses the operative flexibility of the fms: if c0 is tc0 and sb is tsb then fr is tfr (13) for this rule, c0 represents the number of operations common to machines group and sb represents the fms capacity to route a jobs group under conditions of machines faults. 4 conclusions the models described in this paper define a set of basic principles that should satisfy all parameters associated to the fms flexibility, such as: • they should be specified and derived according to the methodology proposed for the analysis of the flexibility. • they should incorporate structural and operational aspects of the fms. • they should incorporate and accumulate the human expert’s knowledge. of the models, we can conclude that the problem of the flexibility is analyzed starting from one of the three big current problems affecting the fms [calinescu et al 2003]: • making decisions problem, analyzed by the methods based on mathematical programming and fuzzy logic. • structural problem, analyzed by the method of the disjunctive graph and • behavior problem, analyzed by the method of the temporary constraints network. 60 héctor kaschel c., luis manuel sánchez y bernal references [1] alfonso, m., an integration model of temporary constraints based on methodology closure and csp: application to scheduling problems, doctoral thesis presented at university of alicante, spain, 2001. [2] barber, f., "reasoning on interval and point-based disjunctive metric constraints in temporal contexts". journal of artificial intelligence research, pp:35-86, 2000. [3] beck, j. and fox m., "constraint directed techniques for scheduling alternative activities". journal of artificial intelligence, pp: 211-250, 2000. [4] benjaafar, s. and ramakrishman, r., "modeling, measurement and evaluation of sequencing flexibility in manufacturing systems", international journal of production research, vol. 34, pp:11951220, 1996 [5] chandra, p. and tombak, m., models for the evaluation of routing and machine flexibility, technical report of decision craft analytics, www.decisioncraft.com, 1993. [6] deshmukh, a., talavage, j. and barash, m., "complexity in manufacturing systems", iie transaction on manufacturing systems, vol 30, pp: 645 655, 2002. [7] fortemps, p., "introducing flexibility in scheduling: the preference approach", advances in scheduling and sequencing under fuzziness, ed. springer verlag, pp:61-79, 2000. [8] gupta, a., "approach to characterize manufacturing flexibility", second world conference on production & operations management society, cancun méxico, pp:100-120, 2004. [9] calinescu, a., sivadasan, s., schirn, j. and huaccho, l., complexity in manufacturing: an information theoretic approach, technical report of manufacturing system research group, department of engineering science, university of oxford, england, 2003. [10] pelaez, j. and ruiz, j., measuring operational flexibility, technical report, university of murcia, faculty of economy and enterprise, spain, 2004. [11] sethi, a. and sethi, s., "flexibility in manufacturing: a survey", international journal of operations and production management, pp:35-45, 1992. [12] tsourveloudis, n. and phillis, y., "fuzzy assessment of machine flexibility", ieee transaction on engineering management, vol. 45, pp:78-87, 1998. héctor kaschel c. and luis manuel sánchez y bernal depto. de ingeniería eléctrica and depto. de matemática y ciencia de la computación universidad de santiago de chile avda. ecuador ] 3519 estación central. santiago chile e-mail: hkaschel@lauca.usach.cl o msanchez@lauca.usach.cl int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iii (2008), no. 2, pp. 125-134 design of a mimo pid robust controller using moments based approach abdelmadjid bentayeb, nezha maamri, jean-claude trigeassou abstract: in this paper we present a new technique for robust mimo controllers synthesis and reduction based on a reference model and moments approach intended to control a mimo thermal system. the reference model allows to specify the performances requirements for the closed loop and improve the controller robustness while the moments tool (frequency and time ones) is used to reduce the controller structure using a non linear optimization. the implementation on the real system associates this methodology of mimo pid controllers synthesis with broïda’s identification technique in order to carry out a auto-tuning procedure [2][11]. keywords: pid control, reference model, moments, optimization, robustness, broïda’s identification technique. 1 introduction the method of moments was introduced in a previous contribution [1] to show how we can treat single-input single-output (siso) control systems. we saw that the moments represent a good tool to obtain a reduced robust controller in order to approximate the closed loop behaviour to a reference model one. the reference model gathers all performances requirements like time response and overshoot. an extension of this method to the multivariable (multi-inputs multi-outputs) systems is presented in this paper; in this case the fundamental idea is to choose a diagonal reference model to make inputs outputs pairing [10] where the diagonal reference transfer functions are chosen using the same method for the siso case. in this paper we present our control methodology for mimo systems with an application to a thermal system. the control of the mimo thermal system is realized with the help of a pid controller using moments based approach and reference model. the aim of this technique is to synthesize a reduced robust controller (pid for example) for the implementation. the identification was realized thanks to broïda’s method which is an elementary technique very used in the industry in order to develop a auto-tuning procedure requiring a minimum intervention of the user. the controller achieving the performances for the worst case model is called ideal controller [10] which will be reduced using the moments based approach to have a pid structure for implementation [4]. the reduced controller must preserve the same performances as the ideal one; this rises from frequency and time moments which represent a good tool for synthesis and analysis [7]. concretely, the reduction procedure is based on a non linear optimization and its initialization is given by the least squares algorithm [8]. the paper is organized as follows: in section 2 we give a complete description of the application; in section 3 we develop our synthesis methodology; in section 4 and 5 we describe the moments theory and the reduction method and we finish this communication by presenting the different results obtained and a conclusion. 2 description the general diagram of the temperature control device is illustrated by fig. 1. copyright © 2006-2008 by ccc publications 126 abdelmadjid bentayeb, nezha maamri, jean-claude trigeassou amplification u1 +vcc t1 u2 θ1 θ2 stage amplifying stage amplifying y2y1 t2 figure 1: the general diagram of the system it is composed of a radiator on which we placed two power transistors t1 and t2 controlled by the tensions u1 and u2 which vary between 0 and 10 volts. near these power transistors, which represent the heat source, two temperature sensors (pt 100) are laid out. after adaptation and amplification, we obtain the tensions y1 and y2 (which vary between 0 and 10 volts) proportional to the temperatures θ1 and θ2. the ratio tension-temperature is of 0.02volts/c. 2.1 modelling the aim is to define the model connecting the tensions delivered by the temperature sensors y1 and y2 to the control tensions u1 and u2 applied to the transistors. the conduction of the heat in the radiator, towards the temperature sensors produced by the power transistors is governed by the following diffusion equation: µ ∂ 2θ ∂ x2 = ∂ θ ∂ t (1) where µ is the coefficient of thermal diffusion process. consequently the resulting physical model is of distributed parameters and we preferred to present a black box model adapted to the synthesis of a control law, this model is given from identification. let:   y1 (s) y2 (s)   =   g11 (s) g12 (s) g21 (s) g22 (s)     u1 (s) u2 (s)   (2) where g (s) is the transfer matrix of the system. notice that the system is symmetrical because of the provision of the heat sources and the position of the sensors. indeed, we must obtain: { g11 (s) = g22 (s) g12 (s) = g21 (s) (3) in addition, since the sensors θ1 and θ2 are very close to the sources t1 and t2, the transfers g11 and g22 can be reasonably modelled by a first order system ( k11 1 + τ11s ). on the other hand θ2 is less sensitive to the effect of t1 than that of t2 (so k21 < k11), with a settling time definitely higher. we can thus approach this transfer by an aperiodic nth order system with (n >> 1) or more simply by a first order system with a time delay ( k12e−α12s 1 + τ12s ). design of a mimo pid robust controller using moments based approach 127 2.2 identification several sophisticated identification techniques (minimization of a quadratic cost by least squares or non linear optimization) (ljung [5] and walter [12]) can be used to estimate the parameters of g (s). taking into account the fact that the aim is to synthesize pid controllers (in a auto-tuning objective), we preferred to choose a basic identification method requiring only step tests. our choice thus is broïda’s method which delivers an approached model of the form ( ke−α s 1 + τ s ), so well adapted to the selected modelling [2][11]. let us recall that the coefficient {k, α et τ} are obtained with the help of the following formulas:    ki j = y∞ αi j = 2.8t1 −1.8t2 τi j = 5.5 (t2 −t1) (4) where y∞ is the final value of the system step response, t1 (respectively t2) is the time where the output attains 28% (respectively 40%) of its final value. we made 5 tests which provided: test1 test2 test3 test4 test5 k11 0.1221 0.1221 0.1172 0.1172 0.1318 τ11 577 414 449 451 457 k12 0.0634 0.0636 0.0586 0.0586 0.0684 τ12 910.5 862.95 684.2 683.1 863 α12 7 26.2 59.6 58.9 6 k21 0.083 0.0781 0.0684 0.0684 0.083 τ21 715 654.5 808.5 808.5 715 α21 113 99.8 73.4 73.4 113 k22 0.0977 0.0977 0.0977 0.0928 0.1074 τ22 450 445 447 447 600 the settling times of g11 (s) and g22 (s) are close and equal 33minutes. 2.3 the nominal model the nominal model was obtained by carrying out the average of the 5 tests, thus: gnom (s) =   0.122 470s+1 0.0625e−31.5s 801s+1 0.0762e−94.5s 740s+1 0.0987 478s+1   (5) from the values of the preceding table, we note that the theoretical symmetry is not checked in practice, well that g11 (s) and g22 (s) are close (g12 (s) and g21 (s) respectively), these differences are the consequences of the noises level disturbing the measurements and the non perfect reproducibility of the assembly. 2.4 taking into account of uncertainties identification uncertainties will deteriorate the performances of the control device if they are not taken into account during the synthesis. the table recapitulating the 5 tests shows that two uncertainties types are present: 128 abdelmadjid bentayeb, nezha maamri, jean-claude trigeassou • uncertainties on the d.c gains (of type low frequencies). • uncertainties on time-constants and time delays, which causes modelling errors (of type high frequencies). since the transfer matrix g (s) is theoretically symmetrical, we decided to make the synthesis by basing us on a symmetrical worst case model gp (s). using gp (s), we took into account all the uncertainties by raising: • low frequencies uncertainties by taking kmax. • high frequencies uncertainties by introducing a time delay e−δ s (see [1]). remark 1. the worst case model has been chosen to take into account all the identification uncertainties: • low frequencies uncertainties, by choosing the max of the d.c gains. • high frequencies, by introducing time delays in all input-output transfers. thus, we obtain the following worst case model: gp (s) =   0.132e−5s 500s+1 0.085e−125s 800s+1 0.085e−125s 800s+1 0.132e−5s 500s+1   (6) 3 synthesis the control configuration is illustrated by fig. 2 g(s) r e u po y pi c(s) figure 2: unitary feedback configuration c (s) is the controller to implement, g (s) is the plant, r represent the reference inputs, y the outputs to be controlled, u the control inputs, pi and po are the input and output disturbances and e is the tracking error. the synthesis of c (s) is based on the internal model control [6][10], so: c (s) = q (s) (i −g (s) q (s))−1 (7) where q (s) is any stable transfer matrix [3]. design of a mimo pid robust controller using moments based approach 129 3.1 the reference model q (s) is calculated using gp (s) in order to approach the closed loop tyr (s) to a reference model tre f (s), so: tyr (s) = gp (s) q (s) = tre f (s) (8) the reference model gathers all the objectives of the synthesis in terms of stability and performances (i.e inputs-outputs decoupling, settling time... etc). in our case, tre f (s) is given by: tre f (s) =   t2(s)e−δ s d(s) 0 0 t2(s)e −δ s d(s)   (9) where t2 (s) represent the dominant poles to fixe the closed loop dynamic, d (s) represent the auxiliary poles, which reduce the control input energy, confer robustness to the controller by making a sufficient roll-off of tyr (s) and ensure its causality, finally e−δ s represents the singularity of gp (s) which is integrated in tre f (s) to have q (s) stable and realizable. so q (s) is given by: q (s) = gp (s) −1 tre f (s) (10) since the singularities of gp (s) (i.e time delays and rhp zeros) are integrated in the reference model tre f (s), so the stability of q (s) is guaranteed [3]. t2 (s) and d (s) are given by: t2 (s) = ω 2n s2+2ξ ωns+ω 2n d (s) = (1 + η s)n (11) where ωn and ξ , fixe the settling time and the overshoot of the outputs. we can recapitulate our reference model by saying that it imposes a inputs-outputs decoupling and the same dynamics for the two outputs of g (s). 3.2 robustness and the reference model it is well-known that the relative modelling errors ∆r (s) modify the dynamics of the system with a possibility of instability because: sypo (s) = ( i −g (s) q (s) (i + ∆ (s) q (s))−1 ) = snom (s) (i + ∆r (s) tre f (s)) −1 (12) so robustness of stability and performances is guaranteed if: ‖ ∆r (s) tre f (s) ‖∞≤ δ −1sr (13) where δsr, represents stability margin; notice that from (13), we can act on the auxiliary poles of tre f (s) to ensure the robustness [3]. 130 abdelmadjid bentayeb, nezha maamri, jean-claude trigeassou 3.3 the ideal controller from (10), we deduce the ideal controller c (s): c (s) = q (s) (i −gp (s) q (s))−1 (14) it is clear that the implementation of c (s) is very hard to do because of time delays; for that we will reduce its structure to get an implementable one which must preserve the same performances of those ensured by the ideal controller. the moments tool, particularity of our approach is used for reduction. 4 the moments let us consider a linear siso system, characterized by its transfer function g (s) analytic in the rhp plan (.i.e re (s) > 0) and let g (t) be its impulse response: g (s) = ∫ ∞ 0 g (t) e−st dt (15) the transfer function is given by the following state space (not necessary minimal) realization: g (s) s = [ a b c d ] = c (si −a)−1 b + d (16) where a ∈ rn×n, b ∈ rn×1,c ∈ r1×n and d ∈ r1×1. 4.1 computing the moments using state space realization using the following equality: (si −a) ( −a−1 −sa−2 −s2a−3 −···− ) = i ⇒ (si −a)−1 = − ∞ ∑ n=0 ( sna−(n+1) ) (17) frequency moments realizing a variable change µ = jω − jω0, equation (15) becomes: g (µ) = ∞ ∑ n=0 (−1)n (µ)n an,ω0 (g) (18) and (16): g (µ) = c (µ i −(− jω0i + a))−1 b + d (19) so, we get: a0,ω0 (g) = −c (− jω0i + a) −1 b + d (20) an,ω0 (g) = (−1) n+1 c (− jω0i + a)−(n+1) b, (n = 1···∞) (21) remark 2. the time moments are giving by remplacing ω0 = 0. both time and frequency moments can be calculated easily using the previous algorithm. thanks to the computation of the moments using state space realization, we can compute moments for mimo systems (multi inputs multi outputs). design of a mimo pid robust controller using moments based approach 131 5 the controller reduction for more details of our model reduction procedure, please refer to [1]. it is interesting to have a reduced structure of the implemented controller (pid for example) [6]. let: θ =   θ 11 (s) ··· θ 1m (s) ... . . . ... θ m1 (s) ··· θ mm (s)   (22) be the matrix representing the controller’s parameters to be calculated. θ i j represents the parameters vector of numerator and denominator of cr,i j (s); i.e the reduced controller between the jth input and the ith output. let us define our cost function j as the 2 norm of the errors between the different moments of the ideal controller and those of the reduced one. j = n ∑ n=0 ||εn||22 = n ∑ n=0 ||an,ω0 (c)−an,ω0 (cr)||22 (23) where an,ω0 (cr) represents n th order moments matrix, which is function of the parameters θ : an,ω0 (cr) = fn (θ ) (24) let: j = n ∑ n=0 ||an,ω0 (c)− fn (θ )||22 (25) the objective is to determine the estimated parameters θ̂ minimizing j around ω0. this frequency ω0 is chosen in order to preserve stability of the system (i.e critical pulsation), (see the generalized nyquist criterion [10]). 5.1 linear optimization the first step consists on imposing the common denominator dr (s) of the reduced controller (for example we can take the auxiliary poles of the reference model dr (s) = d (s)). so only the zeros have to be determined; the function fn (θ ) is linear; thus the minimization of j is obtained by least squares, let: cr (s) = cls (s) (26) be the reduced controller which will be used to initialize the non linear programming algorithm. remark 3. by imposing the poles of the reduced controller, for example a pole with an integral action we define so a pid structure. for example we can take dr (s) = s (1 + η s). 5.2 non linear optimization the fact of imposing the poles of the reduced controller, this will limit its performances and consequently those of the closed loop. so, it is preferable to optimize all the structure (i.e. poles and zeros); the function fn (θ ) is non linear; the estimation of θ is obtained by non linear programming [7]. we use marquardt’s algorithm which is a good combination between rapidity and convergence [8]. the parameters are updated with the help of the following algorithm: θ̂i+1 = θ̂i −{[j ′′ + λii]−1.j ′}θ̂ =θ̂i (27) 132 abdelmadjid bentayeb, nezha maamri, jean-claude trigeassou where: ( ∂ j ∂ θ ) = j ′ : gradient vector (28) ( ∂ 2j ∂ θ 2 ) = j ′′ : hessian matrix (29) λi : coefficient to adjust (30) the initialization is given by the least squares solution: θ̂0 = θ̂ls (31) 6 results and comments the controllers synthesis and reduction technique was applied to the thermal system with 2 inputs and 2 outputs. let us recall that this thermal system is characterized by an important open loop settling time (approximately 33 minutes); moreover the elementary modelling used in a disturbed context involves important uncertainties. the objective of the temperature control, in a auto-tuning context, will be double: • reduce considerably the closed loop settling time by using a controller of a simplified structure: pid controller. • guarantee robustness in spite of the simplicity of the controller, the level of uncertainty and the reduction of the settling time. let us specify that the implementation of the pid controllers was ensured thanks to xpc target software of matlab. we used for that a sample time te = 1s. our objective is to accelerate the system reasonably, in a ratio of 6, thus passing from 33mn in open loop to 5mn in closed loop all while maintaining the relative overshoot around 5%; for that, we fixed: ωn = 0.02rd/s ξ = 0.7 the auxiliary poles ensuring the robustness condition (13) are: d (s) = (1 + 3s)3 the pid controller obtained using the synthesis and the reduction procedures described obviously is: cr (s) = [ c11 (s) c12 (s) c21 (s) c22 (s) ] (32) with: c11 (s) = c22 (s) = −3.162s 2+52.81s+0.14 s(1+34.87s) c12 (s) = c21 (s) = −2019e −7s2+4403e−6s−7193e−5 s(1+15.88s) (33) the implementation provides the results illustrated by fig. (3) and fig. (4). design of a mimo pid robust controller using moments based approach 133 0 1000 2000 3000 4000 5000 6000 2.5 2.55 2.6 2.65 time(sec) o u tp u t( v o lt) 0 1000 2000 3000 4000 5000 6000 2.5 2.55 2.6 2.65 time(sec) o u tp u t( v o lt) tref y1 tref y2 figure 3: step responses 0 1000 2000 3000 4000 5000 6000 3 4 5 6 7 8 9 10 time(sec) co ntr ol int pu t(v olt ) u1 u2 figure 4: the control inputs remark 4. we can check that the system’s responses correspond perfectly to those of the reference model. we can also check the perfect decoupling of the system outputs, y2 being insensitive to the reference input applied to y1 and reciprocally. it is checked finally that the control input obtained is completely reasonable, that it is with respect to its initial magnitude or of its insensitivity to the output noise (of considerable level). 7 conclusion we presented in this communication a synthesis and reduction technique of robust controllers to the multivariable control of a thermal system. this methodology is based on a reference model integrating explicitly the desired performances. the experimental results show that the closed loop system verifies well the performances described by the reference model. the moments approach, characteristic of this methodology, allows the reduction of the ideal controller to lead to a pid structure, while guaranteeing the dynamic performances and especially the robustness as testify the experimental results. let us recall finally that this synthesis methodology and reduction was associated with broïda’s identification technique to carry out the auto tuning of multivariable pid controllers. 134 abdelmadjid bentayeb, nezha maamri, jean-claude trigeassou bibliography [1] a.bentayeb, n.maamri and j-c.trigeassou "the moments in control: a tool for analysis, reduction and design", international journal of computers, communication and control, vol ii (2007). [2] a.anon "special edition on pid tuning methods". computing and control engineering journal 10(2), (1999). [3] j.c.doyle, b.a.francis and a.tannenbaum "feedback control theory". macmillan, new york (1991). [4] c.e.garcia and m.morari "internal model control: an unifying review and some new results". ind.eng.chem.proc.dec.dev, vol(91.2) (1982). [5] ljung "system identification. theory for the user". prentice hall (1987). [6] m.morari and s.zafiriou "robust process control". prentice hall (1989). [7] n.maamri, a.bentayeb and j-c.trigeassou "design and iterative optimization of reduced robust controllers with equality-constraints" rocond-milan (2003). [8] d.w.marquardt "an algorithm for least-squares estimation of non linear parameters" journal of soc. indust. appl. math v(11-2) (1963). [9] q.g.wang "decoupling control", lecture notes in control and information sciences, springer verlag (2003). [10] s.skogestad and i.postlethwaite "multivariable feedback control", jw (1996). [11] r.toscano "a simple robust pi/pid controller design via numerical optimization approach", journal of process control (15), (2005). [12] e.walter and l.pronzato "identification of parametric models from experimental data". communications and control engineering series. springer (1997). a. bentayeb, n. maamri and j-c. trigeassou university of poitiers laboratoire d’automatique et d’informatique industrielle 40 avenue du recteur pineau 86022 poitiers e-mail: abdelmadjid.bentayeb@gmail.com received: january 16, 2007 abdelmadjid bentayeb was born in 1977, he obtained the phd thesis in automatic control at the university of poitiers in 2006. his current research interests include robust control and model reduction. nezha maamri is lecturer at the unisversity of poitiers, hers current research interests include identification and robust control. jean-claude trigeassou is professor at the university of poitiers, his current research interests include identification and control. int j comput commun, issn 1841-9836 vol.7 (2012), no. 4 (november), pp. 759-766 transmission control for future internet including error-prone wireless region i. ryoo, s. kim intae ryoo, seokhoon kim dept. of computer engineering & ru-iptv research center kyunghee univ., 1, seocheon-dong, giheung-gu, yongin-si, gyeonggi-do, korea e-mail: itryoo@khu.ac.kr, kimsh@khu.ac.kr abstract: this paper introduces a transmission control scheme which is aimed at enhancing overall transmission capability of internetworks including error-prone wireless regions. the proposed scheme can accurately adapt to wireless communication environments by integrating new approaches of bandwidth estimation, loss detection, and error recovery techniques while differentiating data losses due to physical bit error characteristics of error-prone wireless regions from those due to network congestion. from the simulations, it has been verified that the proposed scheme shows better throughput performances than the existing major transmission control schemes such as new reno, vegas, and westwood+, while achieving satisfied levels of fairness and friendliness with them. keywords: transmission control, error-prone wireless region, bandwidth estimation, loss detection, error recovery 1 introduction wireless and mobile communication environments have already been pervasively deployed, and never-ending communication demand in error-prone wireless regions of upcoming future internet makes it difficult to maintain the same level of performances of legacy transmission controls. this is mainly due to the inability of the existing transmission control schemes to differentiate data losses resulting from inevitable bit errors in error-prone wireless regions [1] from those resulting from network congestion. as bad channel conditions and aperiodic disconnections are typically transient phenomena, legacy tcp congestion control responses may be inappropriate and undesirable for the networks with error-prone wireless regions. in order to achieve a higher level of transmission efficiency, we have proposed a new transmission control method that is well suited for error-prone wireless regions as well as wired regions. many solutions have been proposed to improve tcp performances in wireless networks and can be categorized into end to end (e2e), split connection, and link layer solutions [2][3]. the proposed transmission control protocol for error-prone wireless regions (tcp ewr) falls into the e2e category as it measures packet arrival rates and acts according to current network status. it, however, differs from the general e2e solutions in the point that it is aiming to improve its performances in case of random or sporadic data losses by performing a time-stamp based bandwidth estimation at the receiving tcp rather than at the sending tcp, calculating the expected and actual bandwidths, and reacting accordingly to the random wireless packet loss events. it measures bandwidth utilization by using packet arrival information at the receiver rather than by using acknowledgement (ack ) information at the sender like tcp new jersey [4]. as a result, sending feedback information to the receiver is not necessary in our scheme. moreover, in order to make the tcp ewr operate in a stable status, we have introduced two thresholds, α and β, which corresponds to having too little and too much data enroute to the destination, respectively. in addition, the tcp ewr performs an enhanced error recovery in copyright c⃝ 2006-2012 by ccc publications 760 i. ryoo, s. kim case that the network suffers from timeout expiration due to wireless packet losses. that is, if there is any sign of data loss either by retransmission timeout or by three duplicate ack s, the tcp ewr sets slow start threshold (ssthresh) value to optimal congestion window (ocwnd). with these features, the overall performance of the tcp ewr can be increased for the networks including error-prone wireless regions. in sect. 2, we discuss the bandwidth estimation, loss detection, enhanced error recovery, and congestion control procedures of the tcp ewr. we also give a pitch of our scheme through performance tests in sect. 3, and finally bring this paper to a conclusion in sect. 4. 2 transmision control for future internet including error-prone wireless regions the proposed transmission control scheme is composed of bandwidth estimation at the receiver, loss detection, enhanced error recovery, and congestion control. the first step of bandwidth estimation is to calculate a sample bandwidth (bwsample(n)) when the nth packet arrives at the receiver at time tn: bwsample(n) = dn/(tn − tn−1) (1) where dn is the amount of data currently received by the receiver and tn−1 is the previous (n −1)th packet arrival time. that is, differently from the existing schemes such as westwood+ and new jersey [5], our scheme uses packet arrival information at the receiver instead of using returning ack information at the sender. note that westwood+ and new jersey are considered in this work as they are representative transmission control schemes that can distinguish wireless packet losses from congestion packet losses and react accordingly. by incorporating the actual arrival rate information in bandwidth estimation and, at the same time, by including congestionrelated packet loss probability into the packet arrival rate calculation [6], the proposed scheme can probe the available bandwidth more accurately. the next step is to estimate the available bandwidth with a time varying coefficient, exponentially weighted moving average filter, which yields to following equation. bwestimated(n) = bwestimated(n−1)× δn + bwsample(n)× (1− δn) (2) where δn is a constant filter gain. in our simulation, we set δn to 0.7 for n≥2 and zero for n=1 for simplicity. thus, the optimal congestion window size ocwndn is computed as following: ocwndn = rttmin ×bwestimated(n)/seg_size (3) where rtt min is minimum measured round-trip time (rtt ) and seg_size is the length of the tcp segment. note that we do not consider the available buffer space in the receiver in this work, and ocwndn is used as a receiver-advertised congestion window size. simulation results in sect. 3 show that this yields good bandwidth estimates for random packet loss scenarios with error-prone wireless region. in order to proactively detect the incipient stage of congestion and to efficiently figure out the reason for packet losses, the proposed scheme calculates maximum expected throughput and measures actual throughput as followings similar to vega and veno approaches [7]: expected throughput = windowsize/rttmin (4) actual throughput = windowsize/rtt (5) transmission control for future internet including error-prone wireless region 761 where rtt is the smoothed round-trip time measured. the difference d between these two throughputs indicates the amount of data that is currently residing on the corresponding path and/or going away due to network congestion or error-prone wireless regions. also, by using the backlog variable x = d ∗ rtt min and two thresholds, α and β, which correspond to having relatively a few and too many data on the route respectively, we can grasp how to manage the sender’s congestion window (cwnd) size based on the current network situation. in our simulations, the values of α=1 and β=9 are used, but they can be set to better adapt to future internet environment, which is for further study. with these indicators for the current network status, the tcp ewr performs an enhanced error recovery (eer) mechanism. if there is any sign of data loss either by retransmission timeout (rto) at the sender or by three duplicate acks (dupacks) from the receiver, the proposed scheme sets the ssthresh value to the ocwndn. note that, if the connection is restored after physical bit errors, the window size should grow quickly to make up for the previous data loss event. in the proposed scheme, the sender immediately adjusts its congestion window size to the ocwndn. it is because the receiver has continuously monitored the available bandwidth of the corresponding path as given in eq. (1) and (2). there is no need to employ any kinds of slow start, additive increase, or fast recovery algorithms whenever the connection is recovered. with regard to congestion window updates when a packet loss is detected, there are three different cases depending on the values of backlog variable x, and two thresholds α and β. when x<α, the proposed scheme updates the congestion window size cwnd to ocwndn or follows an additive-increase paradigm (linear increase state) by comparing the current cwnd with a new ssthresh value. the reason why the proposed scheme updates the cwnd like this is that the network condition is not too bad as the backlog x<α implies, although there exits an incident that packets have been lost during transmission. when α≤x≤β, it updates the congestion window size to ocwndn or keeps the previous cwnd value unchanged by comparing the current cwnd with a new ssthresh value. the backlog α≤x≤β implies that there is a high probability that packets will be lost due to either network congestion or wireless bit errors. so, it is desirable to retain the previous cwnd (wait-and-see state) rather than following an additive-increase/adaptive-decrease paradigm. when x>β, it decreases the congestion window size adaptively. the backlog x>β means that the network falls into a congestion state. by adopting this eer approach, we can use the available bandwidth efficiently. the pseudo code of the proposed tcp ewr transmission control algorithm is given below: a) when packets successfully arrive at the receiver: bwsample(n), bwestimated(n), and ocwndn are computed; b) on ack reception at the sender: cwnd = ocwndn; c) when 3 dupacks are received or rto expires: if (x<α) ssthresh = ocwndn; if (cwnd > ssthresh) cwnd = ocwndn; else /* linear increase state */ cwnd=cwnd+2; else if (α ≤ x ≤ β) ssthresh=ocwndn; if (cwnd > ssthresh) cwnd = ocwndn; else /* wait-and-see state */ cwnd is retained; 762 i. ryoo, s. kim end if else /* congestion state */ ssthresh = ocwndn; cwnd = 1; note that the cwnd is increased by two in the linear increase state because it is updated after one rto rather than after each rtt . 3 simulation results the proposed transmission control scheme has been simulated by using ns-2 and its performances are compared with several representative schemes such as new reno, vegas, westwood+, and new jersey. performance metrics used in simulations include throughput, fairness, and friendliness. although the tcp ewr is proposed for adapting to future internet including error-prone wireless regions, it must also operate well for any topology with lossy or congested links. figure 1 shows a single bottleneck scenario for comparing performance metrics of the tcp ewr with those of well-known schemes under the condition that there exist various link error rates, and background traffics. figure 1: single bottleneck scenario figure 2 shows throughput comparison results. the tcp ewr shows almost similar performances when link error rate is low. however, as the link error rate increases, especially from the point of 8 % link error rate, the tcp ewr outperforms other schemes. in addition, fairness index [8] has been compared by considering 10 same tcp flows that share a 10 mbps bottleneck link. different tcp schemes have been simulated individually and the corresponding results are summarized in table 1. a perfect fairness of bandwidth allocation leads to the fair index of 1. it has been verified that, except 0 % link error rate, the tcp ewr’s fairness index achieves a satisfactory margin compared to other schemes. the reason why the tcp ewr shows an inferior result under the very low link error condition is that it does not aggressively use the bandwidth but maintain the optimal congestion window size ocwndn. figure 3 shows a mixed wired and error-prone wireless scenario where there exist multiple transmission control for future internet including error-prone wireless region 763 figure 2: throughput comparison for single bottleneck scenario table 1: fairness comparison for single bottleneck scenario error rate(%) new reno westwood+ ewr 0 0.76 0.80 0.64 0.1 0.31 0.93 0.98 1 0.25 0.74 0.76 10 0.57 0.23 0.62 tcp connections. in this scenario, bidirectional ftp background traffics flow between n1 and n3 and between n2 and n4. the queue sizes of wired links and the wireless link are set to 100 and 10, respectively. figure 3: mixed wired/error-prone wireless scenario with these simulation conditions, we have compared the tcp ewr with new reno, vegas, and westwood+. new reno is considered as it is the leading internet congestion control scheme. vegas is considered as it also proposes, as westwood+, a new mechanism to throttle the congestion window based on measuring the network congestion status. westwood+ is considered as it remarkably improves utilization of wireless links that are affected by losses not due to congestion. figure 4 shows the corresponding throughput comparison results. new reno, westwood+, and ewr show almost similar throughput results when the link error rate is less than 1 %. when 764 i. ryoo, s. kim the link error rate is greater than 1% and less than 10 %, the ewr outperforms westwood+. vegas shows poor performance throughout the whole link error range. from the results shown in figure 2 and figure 4, we can conclude that the ewr shows better throughput performances than the other schemes for wireless scenario as well as for wired scenario. the reason why the ewr shows again similar result as new reno and westwood+ is that the link with more than 10 % error rate makes all the above control schemes fall into congestion state and cannot be logically managed any more. figure 4: throughtput comparison for mixed wired/error-prone wireless scenario for the same topology shown in figure 3, fairness indices of new reno, westwood+, and the ewr are compared. there are seven tcp flows of the same version that share a bottleneck link of which the bandwidth is 5 mbps. the results are shown in table 2. from these results, we can verify that the ewr also achieves a satisfactory level of fairness index as new reno and westwood+ do. table 2: fairness comparison for mixed wired/error-prone wireless scenario error rate(%) new reno westwood+ ewr 0 0.95 0.94 0.96 0.1 0.94 0.94 0.89 1 0.95 1.00 0.95 10 0.96 0.95 0.96 table 3: friendliness comparison of the ewr with new reno (throughputs in kbps) number of number of mean throughput mean throughput connections(new reno) connections(ewr) (new reno) (ewr) 3 7 69.24 59.97 5 5 77.91 77.68 7 3 71.55 71.03 transmission control for future internet including error-prone wireless region 765 figure 5: a network topology for verifying friendliness table 4: friendliness comparison of the ewr with westwood+ (throughputs in kbps) number of number of mean throughput mean throughput connections(new reno) connections(ewr) (new reno) (ewr) 3 7 73.71 81.95 5 5 78.03 97.03 7 3 63.25 70.62 finally, the friendliness of the proposed scheme with new reno and westwood+ has been tested by using a simple network topology shown in figure 5. in simulations, there are 10 pairs of connections where m is the number of the hosts that communicate based on the proposed scheme and n is the number of hosts that use new reno or westwood+. wireless link error rates have been set to vary between 0.1 and 10, and the corresponding results are summarized in table 3 and table 4. from the results, it has been shown that the proposed tcp ewr has good controllable friendliness compared with other two major tcp control schemes in error-prone wireless networking environment. 766 i. ryoo, s. kim 4 conclusion this paper introduces a new transmission control scheme for future internetworking environment with error-prone wireless regions. the proposed scheme is designed to adjust its congestion window optimally based on the current network situation while estimating the available bandwidth at the receiver. as a result, accurate window update and transmission control can be possible to obtain high throughput while achieving satisfactory levels of fairness and friendliness, which are important indices for the proposed tcp ewr to be a feasible scheme. acknowledgement this research was supported by kyung hee university, korea (20090724). bibliography [1] sumit rangwala, apoorva jindal, ki-young jang, and konstantinos psounis, understanding congestion control in multi-hop wireless mesh networks, proceedings of the 14th acm international conference on mobile computing and networking, san franscisco, california, usa, pp. 291-302, 2008. [2] dhiman barman, ibrahim matta, eitan altman, and rachid el azouzi, tcp optimization through fec, arq and transmission power tradeoffs, lecture notes in computer science, vol. 2957, pp. 87-98, 2004. [3] rajashree paul and ljiljana trajković, selective-tcp for wired/ wireless networks, in proceedings spects 2006, calgary, al, canada, pp. 339-346, 2006. [4] kai xu, ye tian, and nirwan ansari, tcp-jersey for wireless ip communications ,ieee journal on selected areas in communications, vol. 22, no. 4, pp.747-756, 2004. [5] luigi a. grieco and saverio mascolo, performance evaluation and comparison of westwood+, new reno and vegas tcp congestion control , acm sigcomm computer communications review, vol. 34, no. 2, pp. 25-38, 2004. [6] jitendra padhye, victor firoiu, don towsley, and jim kurose, modeling tcp throughput: a simple model and its empirical validation , acm sigcomm computer communication review, vol. 28, issue 4, pp. 303-314, 1998. [7] cheng peng fu and soung c. liew, tcp veno: tcp enhancement for transmission over wireless access networks , ieee journal on selected areas in communications, vol. 21, no. 2, pp. 216-228, 2003. [8] rajendra. k. jain, dah-ming w. chiu, and william r. hawe, a quantitative measure of fairness and discrimination for resource allocation in shared computer system, dec-tr301, eastern research lab., http://www.cs.wustl.edu/ jain/papers/ftp/fairness.pdf international journal of computers, communications & control vol. ii (2007), no. 4, pp. 367-374 an efficient numerical integration algorithm for cellular neural network based hole-filler template design v. murugesh, krishnan batri abstract: this paper presents, a design method for the template of the hole-filler used to improve the performance of the character recognition using numerical integration algorithms. this is done by analyzing the features of the hole-filler template and the dynamic process of cnn and by using popular numerical algorithms to obtain a set of inequalities satisfying its output characteristics as well as the parameter range of the hole-filler template. some simulation results and comparisons are also presented. keywords: cellular neural networks; euler algorithm; rk-gill algorithm; rkbutcher algorithm; ordinary differential equations, hole-filler. 1 introduction cellular neural networks (cnns) are analog, time-continuous, nonlinear dynamical systems and formally belong to the class of recurrent neural networks. since their introduction in 1988 (by chua and yang [1, 2], it has been the subject of intense research. initial applications include image processing, signal processing, pattern recognition and solving partial differential equations, etc. runge-kutta (rk) methods have become very popular, both as a computational technique as well as a subject of research, which are discussed by butcher [3, 4]. this method was derived by runge around the year 1894 and extended by kutta a few years later. they developed algorithms to solve differential equations efficiently and these are the equivalent of approximating the exact solutions by matching ‘n’ terms of the taylor series expansion. butcher [3] derived the best rk pair along with an error estimate and by all statistical measures it appeared as the rk-butcher algorithm. this rk-butcher algorithm is nominally considered as sixth order, since it requires six function evaluations, but in actual practice the “working order” is close to five (fifth order). bader [4, 5] introduced the rk-butcher algorithm for finding the truncation error estimates and intrinsic accuracies and the early detection of stiffness in coupled differential equations that arise in theoretical chemistry problems. recently devarajan et al [7] used the rk-butcher algorithm for finding the numerical solution of an industrial robot arm control problem. oliveria [8] introduced the popular rk-gill algorithm for the evaluation of ’effectiveness factor’ of immobilized enzymes. in this paper, we describe the dynamic behavior of cnn in section 2, hole-filler template design ideas in section 3, numerical integration algorithms and its description is shown in section 4, and simulation results in section 5. 2 dynamic analysis of cnn the dynamic equation of cell c(i, j) in an m x n cellular neural network is given by chua and yang [1, 2]. c dxi j(t) dt = − 1 rx xi j(t) + ∑ c(k,l)∈nr (i, j) a(i, j; k, l)ykl (t) + ∑ c(k,l)∈nr (i, j) b(i, j; k, l)ukl + i (1) yi j(t) = [∣∣xi j(t) + 1 ∣∣− ∣∣xi j(t) ∣∣−1 ] , 1 ≤ i ≤ m, 1 ≤ j ≤ n (2) copyright © 2006-2007 by ccc publications 368 v. murugesh, krishnan batri uij xij · · · yij c r i yij n i uij n i figure 1: cnn–cell wherexi j, yi j and ui j are the state voltage, output voltage and input voltage respectively and they are functions of time t. rx is a linear resistance, c is a linear capacitor, and a(i, j; k, l) and b(i, j; k, l) are the transconductances of the output and input voltages of c(k, l) with respect to c(i, j) called the cloning templates of cnn. nr(i, j) denotes the rthneighbor of c(i, j) and i is an independent current source. from equation (2) one can see that the output voltage is nonlinear. we can rewrite the cell equation (1) as follows: c dxi j(t) dt = − f [xi j(t)] + g(t) (3) where f [xi j(t)] = 1 rx xi j(t) (4) g(t) = ∑ c(k,l)∈nr (i, j) c(k,l)6=c(i, j) a(i, j; k, l)ykl (t) + ∑ c(k,l) b(i, j; k, l)ukl + i (5) 3 hole-filler template design the hole-filler is a cellular neural network discussed by yin et al [9], which fills up all the holes and remains unaltered outside the holes in a bipolar image. let rx = 1, c = 1 and let +1 stand for the black pixel and -1 for the white one. we shall discuss the images having holes enclosed by the black pixels, when the bipolar image is input with u = {ui j} into cnn. the initial state values are set as xi j(0) = 1. from the equation (2) the output values are yi j(0) = 1, 1 ≤ i ≤ m, 1 ≤ j ≤ n. suppose that the template a and b and the independent current source i are given as a =   0 a 0 a b a 0 a 0   , a > 0, b > 0, b =   0 0 0 0 4 0 0 0 0   , i = −1 (6) where the template parameters a and b are to be determined. in order to make the outer edge cells become the inner ones, normally auxiliary cells are added along the outer boundary of the image, and their state values are set to zeros by circuit realization, resulting in the zero output values. the state equation (1) can be rewritten as dxi j(t) dt = −xi j(t) + ∑ c(k,l)∈nr (i, j) a(i, j; k, l)yi j(t) + 4ui j(t)−i (7) for the cell c(i, j), we call the cells c(i + 1, j), c(i − 1, j), c(i, j + 1) and c(i, j − 1) to be the nondiagonal cells. here, several cases are to be considered. an efficient numerical integration algorithm for cellular neural network based hole-filler template design 369 case 1: the input value ui j = +1 for cell c(i, j), signaling the black pixel. because the initial state value of the cell c(i, j) has been set to 1, xi j(0) = 1, and from equation (2) its initial output value is also yi j(0) = 1 . according to the hole-filler demands, its eventual output should be yi j(∞) = 1. to obtain this result we set dxi j(t) dt ≥ 0 (8) substituting this input ui j = 1 and equation (6) into equation (7), we obtain dxi j(t) dt = −xi j(t) + a [ y(i−1) j(t) + y(i+1) j(t) + yi( j−1)(t) + yi( j+1)(t) ] + byi j(t) + 3 (9) combining equations (8) and (9) and considering the minimum value of xi j(t) = 1 this case yields a [ y(i−1) j(t) + y(i+1) j(t) + yi( j−1)(t) + yi( j+1)(t) ] + byi j(t) + 2 ≥ 0 (10) to facilitate our discussion, two sub cases are distinguished. sub case 1: the cell c(i, j) is inside the holes. since xi j(0) = 1 , from equation (2) its initial output value yi j(0) = 1. considering equations (8) and (2), yi j(t) ≥ 1. according to the hole-filler demands, its initial output of non-diagonal black pixels should not be changed inside the holes. the weights of a and b are equal to +4 and +1, respectively. since a(i, j; k, l) > 1rx the parameter b is found to be b > 1, or 4a + b + 2 ≥ 0, b > 1 (11a) sub case 2: the cell c(i, j) is outside the holes. to satisfy equation (10), we need to check only the minimum value on the left-hand side of equation (10). this is true when there are four non-diagonal white pixels around the cell c(i, j), where the weight of a in equation (10) is -4. since yi j(t) ≥ 1, the weight of b is equal to 1. combining this with b > 1 gives −4a + b + 2 ≥ 0, b > 1 (11b) case 2: the input value of cell c(i, j) is ui j = 1, signaling the white pixel. substituting this input value in equation (7) gives dxi j(t) dt = −xi j(t) + a [ y(i−1) j(t) + y(i+1) j(t) + yi( j−1)(t) + yi( j+1)(t) ] + byi j(t)−5 (12) sub case 1: the cell c(i, j) is inside the holes. since xi j(0) = 1, from equation (2) its initial output value is yi j(0) = 1. according to the hole-filler demands, the holes should be filled by the black pixels, whereas its initial black pixels remain unaltered: dxi j(t) dt ≥ 0 (13) combining equations (12) and (13) and considering xi j(t) ≥ 1 yields aby(i−1) j(t) + y(i+1) j(t) + yi( j−1)(t) + yi( j+1)(t)c+ byi j(t)−6 ≥ 0 (14) where we use the minimum value of xi j(t) in equation (12). since the cell is inside the holes, its initial output of non-diagonal black pixels remain unchanged. the weight of a and b are equal to +4 and +1, respectively. combining this with b > 1 gives 4a + b−6 ≥ 0, b > 1 (15) 370 v. murugesh, krishnan batri sub case 2: the cell c(i, j) is outside the holes. since xi j(0) = 1, from equation (2) its initial output value is yi j(0) = 1. according to the hole-filler demands, the final output of this cell should be white, and in this case yi j(∞) ≤ −1. dxi j(t) dt < 0 (16) combining equations (12) and (16) and considering xi j(t) ≤ 1. we get aby(i−1) j(t) + y(i+1) j(t) + yi( j−1)(t) + yi( j+1)(t)c+ byi j(t)−6 < 0 (17) where we use the maximum value of xi j(t) in equation (12). initially yi j(0) = 1. how can the output of cell c(i, j) be changed to -1?. where does this change begin?. first we consider the situation where the change begins from the inside of the bipolar image. if the maximum value on the left-hand side in equation (17) is less than zero, equation (17) holds. inside the image and outside the holes, the maximum weights of a and b are +4 and +1, respectively. this case was described by equation (15). in fact, the change of the output of the cell c(i, j) is like a wave propagating from the edges to the inside of the image and it is verified from the simulated result. therefore, we should first consider the edge cell c(i, j), i = 1 or m, j = 1 or n. for this the maximum weight of a in equation (17) is +3, which is also the maximum weight of a outside the holes. the maximum weight of b is +1, occurring at the initial time: 3a + b−6 < 0, b > 1 (18) combining cases 1 and 2, we obtain 3a + b−6 < 0, 4a + b−6 ≥ 0, (19) −4a + b + 2 ≥ 0. 4 numerical integration algorithms the cnn dynamics on a digital architecture requires discretization in time and suitable numerical integration algorithms. three of the most widely used numerical integration algorithms are used in raster cnn simulation described here. they are the euler’s algorithm, rk-gill algorithm discussed by oliveria [8] and the rk-butcher algorithm discussed by badder [5, 6] and murugesh and murugesan [10, 11, 12]. 4.1 euler algorithm euler’s method is the simplest of all algorithms for solving odes. it is explicit formula which uses the taylor-series expansion to calculate the approximation. xi j((n + 1)τ) = xi j(π τ) + τ f ′(x(π τ)) (20) 4.2 rk-gill algorithm the rk-gill algorithm discussed by oliveria [8] is an explicit method requiring the computation of four derivatives per time step. the increase of the state variable xi j is stored in the constant ki j1 . this an efficient numerical integration algorithm for cellular neural network based hole-filler template design 371 result is used in the next iteration for evaluating ki j2 . the same must be done for k i j 3 and k i j 4 . ki j1 = f ′ (xi j (τ π)) ki j2 = f ′ ( xi j (τ π) + 1 2 ki j1 ) (21) ki j3 = f ′ ( xi j (τ π) + ( 1√ 2 − 1 2 ) ki j1 + ( 1− 1√ 2 ) ki j2 ) ki j4 = f ′ ( xi j (τ π)− 1√ 2 ki j2 + ( 1 + 1√ 2 ) ki j3 ) the final integration is a weighted sum of the four calculated derivatives: xi j((n + 1)τ) = xi j + 1 6 [ ki j1 + ( 2− √ 2 ) ki j2 + ( 2 + √ 2 ) ki j3 + k i j 4 ] (22) 4.3 rk-butcher algorithm the rk-butcher algorithm discussed by badder [5, 6] and murugesh and murugesan [10, 11, 12], is an explicit method. it starts with a simple euler step. the increase of the state variable xi j is stored in the constant ki j1 . this result is used in the next iteration for evaluating k i j 2 . the same must be done for ki j3 , k i j 4 , k i j 5 and k i j 6 . ki j1 = τ f ′ (xi j (π τ)) ki j2 = τ f ′ ( xi j (π τ) + 1 4 ki j1 ) ki j3 = τ f ′ ( xi j (π τ) + 1 8 ki j1 + 1 8 ki j2 ) ki j4 = τ f ′ ( xi j (π τ)− 1 2 ki j2 + k i j 3 ) (23) ki j5 = τ f ′ ( xi j (π τ) + 3 16 ki j1 + 9 16 ki j4 ) ki j6 = ∆t f ( xi j (π τ)− 3 7 ki j1 + 2 7 ki j2 + 12 7 ki j3 − 12 7 ki j4 + 8 7 ki j5 ) the final integration is a weighted sum of the five calculated derivatives: xi j ((n + 1) τ) = 1 90 ( 7ki j1 + 32k i j 3 + 12k i j 4 + 32k i j 5 + 7k i j 6 ) (24) 5 simulated results this hole-filler template has been simulated using pentium iv machine with 3.0 ghz. speed using different numerical integration algorithms. the settling time t and integration time ts is obtained with various step sizes is to be displayed below in the table-1. the settling time ts describes the time from start of integration until the last cell leaves the interval [-1.0, 1.0] which is based on certain limit (e.g., ∣∣ dx dt < 0.01 ∣∣). the simulation shows the desired output for every cell. we use +1 and -1 to indicate the black and white pixels, respectively. 372 v. murugesh, krishnan batri (a) (b) figure 2: image before and after hole filling step size euler algorithm rk-gill algorithm rk-butcher algorithm settling time(t) integration time(ts) settling time(t) integration time(ts) settling time(t) integration time(ts) 0.5 6.5 2.5 6.8 2.4 5.8 2.4 0.6 15.5 12.7 16.4 13.7 11.4 12.5 0.7 32.5 28.3 32.0 27.4 30.0 27.2 0.8 35.0 30.7 34.6 30.0 32.4 29.6 0.9 36.8 32.6 36.6 32.0 34.2 31.6 1.0 37.9 33.6 37.6 33.0 36.0 32.8 1.5 44.8 36.8 45.7 36.9 41.1 36.0 2.0 47.4 43.2 48.2 43.6 46.2 42.8 2.5 50.6 45.6 52.6 44.5 48.3 44.7 3.0 53.5 49.3 54.8 50.2 52.3 49.2 table 1: simulated results of hole-filler template design example the templates a, b and i are given as follows: a =   0 1.0 0 1.0 3.0 1.0 0 1.0 0.0   , b =   0 0 0 0 4 0 0 0 0   , i = −1.0 using the simulation program developed in c++, the input image is shown in figure-2(a) and the output image in figure-2(b). the obtained result is represented in table-1. from the table-1, we find that rk-butcher algorithm yields less settling time and integration time compared to euler and rk-gill algorithms. 6 conclusion it is shown that the cellular neural network based hole-filler template could be designed from its dynamic behavior using different numerical algorithms, and also the template for other cellular neural network can similarly be designed. the hole is filled and the outside image remains the same. the templates of the cellular neural network are not unique and this is important in its implementation. an efficient numerical integration algorithm for cellular neural network based hole-filler template design 373 bibliography [1] l. o. chua, l. yang, cellular neural networks: theory, ieee transactions on circuits and systems, vol. 35, pp. 1257 1272, 1988. [2] l. o. chua, l. yang, cellular neural networks: applications, ieee transactions on circuits and systems, vol. 35, pp. 1273 1290, 1988. [3] j. c. butcher, the numerical analysis of ordinary differential equations: runge-kutta and general linear methods, chichester: john wiley, 1987. [4] j. c. butcher, numerical methods for ordinary differential equations, chichester: john wiley, 2003. [5] m. bader, a comparative study of new truncation error estimates and intrinsic accuracies of some higher order runge-kutta algorithms, computers & chemistry, vol. 11 pp. 121-124, 1987. [6] m. bader, a new technique for the early detection of stiffness in coupled differential equations and application to standard runge-kutta algorithms, theoretical chemistry accounts, vol. 99, pp. 215-219, 1988. [7] g. devarajan, v. murugesh, k. murugesan, numerical solution of second-order robot arm control problem using runge-kutta butcher algorithm, international journal of computer mathematics, vol. 83 pp. 345-356, 2006. [8] s. c. oliveira, evaluation of effectiveness factor of immobilized enzymes using runge-kutta-gill method: how to solve mathematical undetermination at particle center point?, bio process engineering, vol. 20, pp. 85-187, 1999. [9] c. l. yin, j. l. wan, h. lin, w. k. chen, the cloning template design of a cellular neural network, journal of the franklin institute, vol. 336, pp. 903-909, 1999. [10] v. murugesh, k. murugesan, comparison of numerical integration algorithms in raster cnn simulation, lecture notes in computer science, vol. 3285 pp. 115-122, 2004. [11] v. murugesh, k. murugesan, simulation of cellular neural networks using the rk-butcher algorithm, international journal of management and systems, vol. 21, pp. 65-78, 2005. [12] v. murugesh, k. murugesan, simulation of time-multiplexing cellular neural networks with numerical integration algorithms, lecture notes in computer science, vol. 3991, pp. 115-122, 2005. v. murugesh department of information and communication engineering hannam university 133 ojung-dong daeduk-gu, daejeon 306-791, republic of korea e-mail: murugesh72@gmail.com k. batri department of computer science and engineering muthyammal engineering college rasipuram 637 408, india e-mail: krishnan.batri@gmail.com received: july 25, 2007 374 v. murugesh, krishnan batri dr. v. murugesh obtained his bachelor of science in computer science and master of computer applications degree from bharathiar university, coimbatore, india during 1992 and 1995 respectively. completed his phd in computer science from bharathidasan university, tiruchirappalli, india during 2006. he has held various positions at national institute of technology, tiruchirappalli, india and sona college of technology, salem, india. currently, he is working as assistant professor in the department of information and communication engineering at hannam university, daejeon, republic of korea. his fields of interest are in neural network based image processing and scientific computing. he has published more than 30 technical papers in international, national journals and conferences. krishnan batri received the m.e. from madurai kamarj university in 2003. he is a research scholar with the department of computer science and engineering in the national institute of technology tiruchirapalli, tamil nadu, india. currently he is working as a assistant professor with the department of computer science and engineering at muthayammal engineering college, rasipuram, tamilnadu, india. his research interests include information retrieval, data fusion and genetic algorithms. int j comput commun, issn 1841-9836 9(1):71-78, february, 2014. energy efficient key management scheme for wireless sensor networks n. suganthi, v. sumathy n.suganthi* dept of information technology kumaraguru college of technology, coimbatore-49 *corresponding author suganthiduraisamy@yahoo.co.in v.sumathy ece department government college of technology, coimbatore-13 sumi_gct2001@yahoo.co.in abstract: designing an efficient key establishment scheme is of great importance to the data security in wireless sensor networks. the traditional cryptographic techniques are impractical in wireless sensor networks because of associated high energy and computational overheads. this algorithm supports the establishment of three types of keys for each sensor node, an individual key shared with the base station, a pair wise key shared with neighbor sensor node, and a group key that is shared by all the nodes in the network. the algorithm used for establishing and updating these keys are energy efficient and minimizes the involvement of the base station. polynomial function is used in the study to calculate the keys during initialization, membership change and key compromise. periodically the key will be updated. to overcome the problem of energy insufficiency and memory storage and to provide adequate security, the energy efficient scheme is proposed. it works well in undefined deployment environment. unauthorized nodes should not be allowed to establish communication with network nodes. this scheme when compared with other existing schemes has a very low overhead in computation, communication and storage. keywords: key management, sensor nodes, polynomial function 1 introduction these tiny sensor nodes, which consist of sensing, data processing and communicating components, leverage the idea of sensor networks based on the collaborative effort of a large number of nodes. sensor nodes are deployed in hostile environments or over large geographical area. the nodes could either have a fixed location or could be randomly deployed to monitor the environment. the nodes then sense environmental changes and report them to other nodes over flexible network architecture. they have thus found application domains in battlefield communication, homeland security, pollution sensing and traffic monitoring. the limited factors of using sensor nodes are that they have limited battery power and less memory capacity. to control information access in a sensor environment only authorized node must know the key to disseminate the information that is unknown to the compromised nodes. the communication keys may be pair wise [7],chan,du or group wise [1], these keys to be updated to maintain security and resilience to attacks. some of the proposed work was based on static schemes [7]liu and some are on dynamic schemes [1]eltoweissy though many protocols have been designed for the purpose of security in sensor environment, unfortunately, node compromising is rarely or not enough investigated and most of these protocols have a weak resilience to attack [13]. in this paper, we propose a key management scheme for wsns in which the pair wise keys and the group wise key are set up through the broadcast information during the network initialization copyright © 2006-2014 by ccc publications 72 n. suganthi, v. sumathy phase and no further message exchange is needed afterwards. consequently, the communication overhead is very low. therefore, the compromise of some sensor nodes will not affect any other non-compromised pair wise keys. for the establishment of keys for new nodes, we propose a composite mechanism based on an algorithm in which resource consumption can also be kept very low and also the transmission of information. here only the polynomial identifier needs to be communicated to the nodes for establishing group key and pair wise key. the rest of this paper is organized as follows. in section 2, some related work and their drawbacks are discussed. in section 3, energy-efficient key management scheme, in sections 4 and 5, the security and performance of energy efficient key management scheme are analyzed. finally, section 6 deals with the conclusion. 2 related works many pair wise key distribution schemes [5] [7] [8] [9] have been developed for peer to peer wireless sensor networks and heterogeneous network [6] [12] [14]. in one of the hierarchical schemes [1], the base node calculates the group key using partial keys in bottom up fashion. the partial key of the child node is generated using random number and which is passed to its parent to calculate its partial key which further goes in bottom up fashion finally to calculate the group key. the partial keys are calculated by using a function. the function is expressed as f(k1,k2) = α k1+k2mod p (1) p is the prime number, k1, k2 are the partial keys the decision for choosing a number of partial keys is based on the key size for the security requirements and the corresponding energy consumption. to guarantee that all the nodes in a group received the information, they send the reply (rep) message. if the cluster head does not get the (rep) from all the node, it re-broadcasts. when a new node joins the group, the group key is recalculated and again the cluster head broadcasts the newly created group key to all the nodes in the group. the same is repeated when a node leaves the group. this makes the old node, which is deleted, not to know the new key that is created. also communication takes places between two cluster heads. due to poor memory capacity and low power of sensor nodes it will be difficult to store all the partial keys and the communication becomes costly as it needs to broadcast the group key once it is created and changed. in our energy efficient key management protocol scheme the group key need not be broadcasted each time. a tree based key management protocol [2] in which each sensor node is pre-deployed with three keys. one of the keys is used for initial communication i.e for key exchange and tree spanning. after the tree is spanned, this key is deleted from the memory of the sensor node. then for further communication, the remaining two keys are used. one of these keys is symmetric and is used to encrypt (or decrypt) information sent from the child to the parent. the third key is also symmetric and is used to encrypt (or decrypt) information sent from the parent to the child. the two keys are used to make the task of cryptanalysis attacker difficult. the disadvantage of this scheme is that an attacker gaining access (physical) to the sensor node can obtain the information. xing zhang et al [11] proposed an energy efficient distributed deterministic key management protocol (eddk). though this scheme provides higher security than the above two schemes namely hierarchical and tree-based protocol, it requires large memory to store data. sencun zhu et al [10] proposed a leap: efficent security mechanisms for large scale distributed sensor networks and du et al [11]proposed a scheme using depolyment knowledge. energy efficient key management scheme for wireless sensor networks 73 3 proposed scheme: energy-efficient key management scheme to fix the flaws present in the existing key management schemes, we propose an energy efficient key management scheme for wsns. this scheme mainly focuses on the establishment and maintenance of the pair wise keys as well as the group keys. unlike hierarchical and treebased key management schemes, this scheme does not require additional memory for storing the keys after deployment. in this scheme, no key is broadcasted. each and every node generates the group key and pair wise key using one of the polynomial functions. the polynomial function is identified with the help of its id. to enhance message security in the network initialization phase, each sensor node makes use of its own individual key. the base station computes the individual key of all the nodes using the unique keys and the ids that it has stored. this method also enhances security in data transmission with periodic key update. to avoid the replay attack, sequence number is used. 3.1 overall system description the diagram (figure 1) shows the overall system description. key will be established after node initialization. if any node joins in the system or a node is compromised, key update will be performed. after detecting the compromised node the keys will be removed from the node memory. the overall system is designed to reduce the computation overhead and it requires less communication between nodes. figure 1: system model 3.2 spanning of tree before deployment, every sensor node is pre-distributed with a network wide shared pseudorandom function (rf) and an initial key. it is assumed that each node is tamper proof, and it will not be affected by capture attack. the pseudorandom function and the initial key is used by every node in the network to compute its own individual key. this individual key is used for initial communication with the base station. for example, node a s individual key can be computed as per equation (2). ka = rf(ki,ida). (2) ki initial key, ida individual node identifier 74 n. suganthi, v. sumathy ka individual key shared with base station, rf -shared pseudorandom function the hello message which is used to span the tree is also encrypted using this individual key. the hello message contains the id of the sender and the hello keyword. the base station broadcasts hello message. the nodes that reply to hello message become the children of the base station. here acknowledgment of the child node is essential to accept the node as the child. these nodes then broadcast hello message to other nodes. the nodes that reply to hello message become the children of these nodes. the spanning of the tree is stopped when the nodes do not get a reply to the hello message. 3.3 key establishment phase base station will transmit function identifier and random number, encrypted with individual key to individual nodes as shown in equation (3). eka(pfid,rn) (3) pfid function identifier , rn random number pair wise key after sensor nodes are placed in the sensor field, each sensor node communicates with its neighbor via the pair wise key. this key is used to make sure that the message to the intended neighbor node is not known by other neighbor nodes. nodes a and b are used to show the calculation of the pair wise key using equation (4). let us consider that node a wants to communicate with node b. kab = pfid(rn,ida) (4) kab pairwise key, rn random number, ida identifier of node a the polynomial function takes the random number and the id of the node which initiates the communication as input to calculate the pair wise key. the id of the polynomial function is used to identify one of the functions from the set of polynomial functions. the base node communicates the random number and the id of the polynomial function to all the nodes. after calculating the pair wise key, it will transmit the message encrypted with this key to the node b along with its id in plain. then node b will calculate the pair wise key as it has all the information needed for calculation. using the key it will decrypt the message transmitted by node a. here each node, thus, calculates the pair wise key by knowing initiator id. as no transmission of id or key information takes place, communication overhead is avoided here. group key a group key is a key shared by all the nodes in the network, and it is needed when the base station is distributing a secure message, (e.g. a query on some event of interest or a confidential instruction) to all the sensor nodes in the network. in a conventional method the parent encrypts m with its cluster key and then broadcasts the message. each neighbor receiving the message decrypts it to get the message and re-encrypts with its own cluster key, and then transmits the m. this process is repeated until all the nodes receive the message. however, this method has a drawback. in this method, every node has to encrypt and decrypt the message, thus consuming a large amount of energy on computation. so encryption using group key is the most desirable one from the performance point of view. the simple way to store the group key for a node is to preload every node with it. an important problem that arises immediately is the secured energy efficient key management scheme for wireless sensor networks 75 updation of this key when a compromised node is detected. in our proposed scheme, to enable base station-individual nodes communication, group key is used. the group key generator that is present in all the nodes is used for generating the group key using the equation(5). the random number that is transmitted to each node is also involved in key calculation. the group key is calculated as follows. kg = pfid(rn,gk) (5) kg group key , pfid polynomial function , rn random number, gk group key generator the timer is set for the reestablishment of the key. when the timer reaches the threshold value that is assigned, the re-keying is done. here re-keying is done by changing the coefficient of the polynomial function. the previously calculated keys are deleted periodically. therefore, even if an adversary could compromise some legitimate nodes, it still could not compute the pair wise keys and the group key. note that each sensor node only needs to broadcast one communication message during the key establishment phase with no further message exchange required for key calculation. thus, the communication overhead can be very low. during the data transfer phase the sequence number is used to indicate the message transfer between the nodes. once the sequence number reaches the threshold value that is already set, the sequence number is reset to 1 again and it can prevent the replay attacks. this sequence number helps to know the number of the messages sent and received. it is also used for receiving the acknowledgement. 3.4 key update phase pair wise key and group key should be updated to avoid cryptanalysis and to prevent attacks from adversaries after one or more sensor nodes are compromised. and also after the threshold time, nodes need to update the keys using the same formula as mentioned in equation (4) and (5). the coefficient of the polynomial function is changed. it is done by adding the constant with the previous values and the modulus is taken to be used as new coefficients. thus it makes the key update easily and avoids communication overhead. 4 system analysis for system analysis, we have implemented the key management algorithm in matlab. compared with the eddk, tree based protocol and hierarchical scheme. 4.1 computation costs computation costs are measured in terms of number of encryptions required to change the keys in the event of node compromise and node addition. when a node is added, only the new random number and new id of the polynomial function will be transmitted by base station. individual nodes will receive it and compute the group key and pair wise key using the formulas. calculation using the polynomial function will consume less energy. but in other schemes lot of encryption and decryption is involved to get the key update. it consumes lot of battery energy. the comparison is made between the existing schemes and the proposed scheme in terms of time. this graph (figure 2) shows the computation time differences between various schemes and proposed scheme depending upon the number of nodes during initial key calculation. 4.2 memory requirement for key storage let x represent the number of neighboring nodes around a sensor and n be the number of polynomial functions. each sensor node has x storage units for the pair wise keys, nt + n 76 n. suganthi, v. sumathy figure 2: time required to calculate initial key storage units for the t-degree polynomial functions, two storage units for random number and pseudo random function and single storage unit to store the group key. in terms of memory requirement to store keys for each scheme, the proposed scheme needs less memory, hence it provides scalability. even when the number of nodes increases, the memory required to store the keys remains the same. on the contrary the tree based protocol requires more memory as the number of nodes increases. the following graph (figure 3) shows the comparison of memory requirement for all schemes. the proposed scheme need less memory even when number of nodes increases. figure 3: memory requirement for key storage 4.3 communication overhead for key exchange communication cost is measured in terms of number of messages needed to be exchanged in order to update the existing keys as a result of events like; addition of new node, node compromise, and key refreshing at regular intervals. the communication overhead for the existing scheme is more when compared to the proposed scheme. this is because they need to exchange the keys to enable communication. the proposed scheme requires transmitting the ids of the polynomial function and random numbers only. so overhead for key exchange is minimal. as shown in figure 4, the number of nodes increases, the communication overhead increases for tree based protocol; whereas eddk and proposed scheme require less communication overhead. energy efficient key management scheme for wireless sensor networks 77 figure 4: communication overhead for key exchange 5 security analysis adversaries with a single compromised node and adversaries with n compromised nodes are chosen selectively. in all cases we study their impact on the desired network properties assuming that the adversary acts maliciously at different layers of the communication protocols. insiders are adversaries that can compromise nodes or otherwise have a valid identity in a network with appropriate key material. insiders therefore have the same capabilities as outsiders plus the ability to participate in the network protocols and deviate from the normal behavior of the protocols. stronger security considerations have to be taken into account for insiders. a minimum level of fault tolerance has to be designed into the network inside attackers. but in our proposed algorithm every node is loaded with the set of polynomials, and every time one function will be used to calculate the key. adversary nodes cannot generate this polynomial functions. and also it doesn’t know which function will be used to calculate the key at that time. the random number will be communicated to the individual nodes by the base station after encrypting with secret key shared by the base station and the individual node. 6 conclusion the key exchange problem for sensor networks has been introduced and it is believed that information can be secured by not exchanging the keys directly. a mechanism that makes use of pre-deployed functions has been proposed to fulfill our idea. by using this mechanism, the impacts of many attacks in wireless sensor networks can be limited. this scheme incorporates mechanism that allows for the scalability of memory in the sensor nodes. by comparing the proposed scheme with the existing scheme, it becomes clear that memory required to store the key information is less. similarly the communication overhead to exchange the keys is also low. because of the increase in the complexity of the algorithm, the immunity of the sensor networks towards various attacks has been greatly increased. thus the proposed naming mechanism shields the network from various attacks. bibliography [1] biswajit panja; sanjay madria; bharat bhargava; energy-efficient group key management protocols for hierarchical sensor networks,int. j. of distributed sensor networks taylor francis group, 201-223, doi:10.1080/15501320701205225, 2007. 78 n. suganthi, v. sumathy [2] messai,l.; aliouat,m.; seba,h.; tree based protocol for key management in wireless sensor networks, eurasip j.on wireless communications and networking, article id 910695, doi:10.1155/2010/910695, 2010. [3] xing zhang; jingsha he; qianwei; eddk: energy-efficient distributed deterministic key management for wireless sensor networks,eurasip j. on wireless communications and networking, article id 765143, doi:10.1155/2011/765143, 2011. [4] eltoweissy,m.; moharrum,m.; mukkamala,r.; dynamic key management in sensor networks, ieee communications magazine, 44(4):122130, 2006. [5] du,w.; deng,j.; han,y.s.; varshney,p.k.; katz,j.; khalili,a.; a pairwise key predistribution scheme for wireless sensor networks, acm trans. on information and system security, 8(2):228-258, 2005. [6] jen yan huang; i-en liao; hao-wen tang; a forward authentication key management scheme for heterogeneous sensor networks, eurasip j. on wireless communications and networking, article id 296704, doi:10.1155/2011/296704, 2011. [7] eschenauer,l.; gligor,v.d.(2002); a key-management scheme for distributed sensor networks, proc. of the 9th acm conference on computer and communications security, washington, dc, usa, 41-47, 2002. [8] chan,h.; perrig,a.; song, d.; randomkey predistribution schemes for sensor networks, proc. of ieee symposium on security and privacy, 197-213, 2003. [9] liu,d.; ning, p.; establishing pairwise keys in distributed sensor networks,proc. of the 10th acm conference on computer and communications security (ccs 03), washington, dc, usa, 52-61, 2003. [10] sencun zhu; sanjeev setia; sushil jajodia (2003); leap: efficient security mechanisms for large scale distributed sensor networks, proc. of the 10th acm conference on computer and communications security, pp.62-72. [11] du,w.; deng,j.; han,y.s.; chen,s.; varshney,p.; a key management scheme for wireless sensor networks using deployment knowledge, proc. ieee infocom04, 586-597, 2004. [12] kausar,f.; hussain,s.; yang,l.t.; masood,a.; scalable and efficient key management for heterogeneous sensor networks, j. of supercomputing, 45(1):44-65, 2008. [13] xiao,y.; rayi,v.k.; sun,b.; du,x.; hu,f.; galloway,m.; a survey of key management schemes in wireless sensor networks,j. of computers communications, 30(11-12):2314-2341, 2007. [14] du,x.; xiao, y.; guizani, m.; chen,h.h.; an effective key management scheme for heterogeneous sensor networks, j. of ad hoc networks, 5(1):24-34, 2007. international journal of computers, communications & control vol. i (2006), no. 3, pp. 33-39 descriptive timed membrane petri nets for modelling of parallel computing emilian guţuleac abstract: in order to capture the compartmentation and behaviour of membrane systems for modelling of parallel computing, we introduce the descriptive dynamic rewriting descriptive membrane timed petri nets (dm-nets) that can at in run-time modify their own structure by rewriting some of their descriptive expression components. furthermore, this descriptive approach facilitates the understanding of complex models and their component-based construction as well as the application of modern computer engineering concepts. keywords: descriptive petri nets, membrane systems, modelling, parallel computing. 1 introduction recent technological achievements require advances beyond the existing computational models in order to be used effectively. pragmatic aspects of current and future computer systems will be modelled so that realistic estimates of efficiency can be given for algorithms in these new settings. petri nets (pn) are very popular formalism for the analysis and representation of parallel and distributed computing in concurrent systems that has draw much attention to modelling and verification of this type of systems [1]. p systems, also referred to as membrane systems, are a class of parallel and distributed computing models [6]. the interest of relating p systems with the pn model of computation lead to several important results on simulation and decidability issues. some efforts have been made to simulate p systems with petri nets [2, 5, 7] to verifying the many useful behavioral properties such as reachability, boundedness, liveness, terminating, etc. in this paper we propose a new approach to express the components of continuous-time p systems [6] throughout components of escriptive petri nets (pn) using descriptive expressions (de) [3] for modelling of parallel computing. the de are used for analytical representation and compositional construction of pn models. to model specific rules of p-systems within the framework of the descriptive rewriting timed pn (rtn) [4] we introduce a new extensions ű the descriptive membrane rtn, called dm-nets, that can modify dynamically their own structures by rewriting rules some of their components. 2 labeled extended petri nets in this section, we define a variant of pn called labeled extended pn. let l be a set of labels l = lp ]lt . each place pi labeled l(pi) ∈ p a local state and transition t j has action labeled as l(t j) ∈ lt . a labeled extended pn is structure as a γ =< p, t, pre, post, test, inh, g, pri, kp, l >, where: p is the finite set of places and t is a finite set of transitions that p ∩ t = /0. in the graphical representation, the place is drawn as a circle and the transition is drawn as a black bar; the pre, test and inh : p×t ×n|p| → n+ respectively is a forward flow, test and inhibition functions and is a backward flow function in the multi-sets of p, where defined the set of arcs a and describes the marking-dependent cardinality of arcs connecting transitions and places. the set a is partitioned into tree subsets: ad , ah, and at . the subset ad contains the directed arcs which can be seen as ad : ((p×t )∪(t ×p))×n|p| → n+ and are drawn as single arrows. the inhibitory arcs ah : (p × t ) × n|p| → n+are drawn with a small circle at the end. the test arcs at : (p × t )×n|p| → n+are directed from a place to a transition, and are drawn as copyright c© 2006 by ccc publications selected paper from icccc 2006 34 emilian guţuleac dotted single arrows. it does not consume the content of the source place. the arc of net is drawn if the cardinality is not identically zero and this is labeled next to the arc and by a default value being 1; g : e × n|p| → {true, f alse} is the guard function transitions. for t ∈ t a guard function g(t, m) that will be evaluated in each marking, and if it evaluates to true, the transition t may be enabled, otherwise t is disabled (the default value is true); pri : t → n+ defines the priority functions for the firing of each transition that maps transitions onto natural numbers representing their priority level. the enabling of a transition with higher priority disables all the lower priority transitions; kp : p → n+ is the capacity of places, and by default being infinite value; the l : t ∪ p → l, is a labeling function that assigns a label to a transition and places. in this way that maps transition name into action names that l(t j) = l(tk) = α but t j 6= tk and l(pi) = l(pn) = β but pi 6= pn. a marked labeled extended pn net is a pair n =< γ, m0 >, where γ is a labeled pn structure and m0 is the initial marking of the net. m : p → n+ is the current marking of net which is described by a symbolic vector-column m = (mi pi), mi ≥ 0,∀pi ∈ p, where the (mi pi) is the number mi of tokens in place pi. the m is the state of net that assigns to each place tokens, represented by black dots. the details concerning on enabling and firing rules, and evolution for of n =< γ, m0 > can be found in [3] as they require a great deal of space. 3 descriptive expressions of petri nets due to the space restrictions we will only give a brief overview to this topic and refer the reader to [3, 4] and the references therein. in following for abuse of notation, labels and name of transitions/places are the same. we use the concept of a basic descriptive element (bde) for a basic pn (bpn) introduced in [2] as following: bde = |α jt j m0i pi[w +i ,w −i ]| αk tk . the translation of this bpn is shown in figure 1a, where respectively is input transition (action type α j ) and tk = p•i is the output transition (action type αk ) of place pi ∈ p with initial marking m0i , and the flow type relation functions w +i = pre(t j, pi) and w −i = post(t j, pi), respectively which return the multiplicity of input and output arcs of the place pi ∈ p. the derivative elements of bde are for p•i = /0,w −i = 0 is | α j t j m 0 i [wi] with final place pi of t j and • pi = /0,w +i = 0 is m 0 i pi[wi]|αktk with entry place pi of tk. if the initial marking m0i of place is a zero tokens we can omit m0i in bde. by default, if the type of action α is not mentioned this to match the name of a transition t. from a bde we can build more complex de of pn components by using composition operations. also by default,if w +i = w − i = 1, we present bde and it derivatives as following:| α j t j m 0 i pi|αktk , |α jt j m0 pi or m0i pi| αk tk . a descriptive expression (de) of a labeled pn is either bde or a composition of de a n: de ::= bde|de ∗de|◦de, where ∗ represents any binary composition operation and ◦ any unary operation. descriptive compositional operations. in the following by default the labels of n are encoded in the name of the transitions and places. the composition operations are reflected at the level of the de components of n models by fusion of places, fusion of transitions with same type and same name (label) or sharing of as subnets. place-sequential operation. this binary operation, denoted by the “ | “ sequential operator, determines the logic of a interaction between two local states pi (pre-condition) and pk (post-condition) by t j action that are in precedence and succeeding (causality-consequence) relation relative of this action. sequential operator is the basic mechanism to build de of n models. this operation is an associative, reflexive and transitive property, but is not commutative operation. the means the fact de1 = m0i pi[wi]| α j t j m 0 k pk[wk] 6= m0k pk[wk]| α j t j m 0 i pi[wi] that the specified conditions (local state) associated with place-symbol pi are fulfilled always happens before then the occurrence of the conditions associated with place-symbol pk by means of the action t j. also, the pn modelling of the iteration operation is obtained by the fusion of head (entry) place with the tail (final) place that are the same name (closing operation) in de which describes this net. the self-loop of n2 net described by an: descriptive timed membrane petri nets for modelling of parallel computing 35 de2 = m0i pi[wi]| α j t j pi[wi] = m 0 i p̃i[wi]| α j t j , it is the test operator "p̃", i.e. represent the test arc. the translation of de2 in n2 is shows in figure 2b. inhibition operation. this unary operation is represented by inhibitory operator "" (place-symbol with overbar) and it de3 = m0i p̄i[wi]| α j t j describe the inhibitor arc with a weight wi = inh(pi,t j). synchronization operation. this binary operation is represented by the " •" or " ∧”join operator describe the rendez-vous synchronization (by transition tt j ) of a two or more conditions represented respectively by symbol-place pi ∈• t j, i = 1, n, i.e. it indicate that all preceding conditions of occurrence actions must have been completed. this operation is a commutative, associative and reflexive. split operation . this binary operation represented by the " ♦" split operator and it describe the causal relations between activity t jand its post-conditions: after completion of the preceding action of t j concomitantly several other post-condition can take occurs in parallel ("message sending"). property of split operation is a commutative, associative and reflexive. competing parallelism operation. this compositional binary operation is represented by the " ∨" competing parallelism operator, and it can be applied over two na with dea = a and nb with deb = b or internally into resulting nr with der = r, between the places of a single nr which the symbol-places with the same name are fused, respectively. we can represent the resulting der = a ∨ b as a set of ordered pairs of places with the same name to be fused, with the first element belonging to a the second to b. the fused places will inherit the arcs of the place in a and b . also, this compositional binary operation is a commutative, associative and reflexive property. precedence relations between the operations. we introduce the following precedence relation between the compositional operations in the de: a) the evaluation of operations in de are applied left-toright; b) an unary operation binds stronger than a binary one; c) the "• "operation is superior to"|" and " ♦", in turn, its are superior the "∨ " operation. further details on definitions, enabling and firing rules, and evolution for of n can be found in [3] as they require a great deal of space. 4 dynamic rewriting petri nets in this section we introduce the model of descriptive dynamic net rewriting pn system. let x ρy is a binary relation. the domain of is the dom(ρ) = ρy and the codomain of ρ is the cod(ρ) = x ρ . let a =< pre, post, test, inh > is a set of arcs belong to netγ . a descriptive dynamic rewriting pn system is a structure rn =< γ, r, φ , gtr, gr, m >, where: =< p, t, pre, post, test, inh, g, pri, kp, l >; r = r1, ..., rk is a finite set of rewriting rules about the runtime structural modification of net that p ∩ t ∩ r = /0. in the graphical representation, the rewriting rule is drawn as a two embedded empty rectangle. we let e = t ∪ r denote the set of events of the net; φ : e → t, r is a function indicate for every rewriting rule the type of event can occur; gtr : r×n|p| → {true, f alse} and gr : r × n|p| → {true, f alse} is the transition rule guard function associated with r ∈ r and the rewriting rule guard function defined for each rule of r ∈ r , respectively. for ∀r ∈ r, the gtr ∈ gtr and gr ∈ grwill be evaluated in each marking and if its are evaluates to true, the rewriting rule r may be enabled, otherwise it is disabled. default value of gtr ∈ gtr is true and for gr ∈ gr is false. let rn =< rγ, m > and rγ =< γ, r, φ , gtr, gr > described with the descriptive expression derγ and dern , respectively. a dynamic rewriting structure modifying rule r ∈ r of rn is a map r : del ¤ dew , where whose codomain of the rewriting operator ¤ is a fixed descriptive expression del of a subnet rnl of current net rn, where rnl ⊆ rn,with pl ⊆ p, el ⊆ eand set of arcs al ⊆ a and whose domain of the ¤ is a descriptive expression dew of a new rnw subnet with pw ⊆ p, ew ⊆ e and set of arcs aw . the ¤ rewriting operator represent binary operation which produce a structure change in the dern and the net rn by replacing (rewriting) of the fixed current del of subnet rnl (del and rnl are dissolved) by the new dew of subnet rnw now belong to the new modified resulting dern′ of net rn ′ = (rn\rnl)∪rnw with p ′ = (p\pl) ∪ pw and e ′ = (e\el) ∪ ew , where a ′ = (p\pl) ∪ aw the meaning of \ (and ∪) is 36 emilian guţuleac operation to removing (adding) rnl from (rnw to) net rn. in this new net rn ′ , obtained by execution (fires) of enabled rewriting rule r ∈ r , the places and events with the same attributes which belong rn′ are fused, respectively. by default the rewriting rules r : del ¤ /0 andr : /0 ¤ dew describe the rewriting rule which fooling holds rn ′ = (rn\rnl) and rn ′ = (rn ∪ rnw ), respectively. a state of a net rn is a pair (rγ, m), where rγ is the configuration of net together with a current marking m. also, the pair (rγ0, m0) with p0 ⊆ p, e0 ⊆ e and marking m0 is called the initial state of the net. enabling and firing of events. the enabling of events depends on the marking of all places. we say that a transition t j of event e j is enabled in current marking m if the following enabling condition ec(t j, m) is verified: ec(t j, m) = (∧∀pi∈•t j (mi ≥ pre(pi,t j))∧(∧∀pk∈◦t j (mk < inh(pi,t j)))∧(∧∀pl∈∗t j (ml ≥ test(pl ,t j)))∧ (∧∀pn∈t•j ((kpn −mi) ≥ post(pn,t j))))∧g(t j, m)). similarly, the rewriting rule r j ∈ r is enabled in current marking m if the following enabling condition ectr(r j, m)is verified: ectr(r j, m) = (∧∀pi∈•r j (mi ≥ pre(pi, r j))∧(∧∀pk∈◦r j (mk < inh(pi, r j)))∧(∧∀pl∈∗r j (ml ≥ test(pl , r j)))∧ (∧∀pn∈r•j ((kpn −mi) ≥ post(pn, r j))))∧g(r j, m)). let the t (m) and r(m) is respectively the set of enabled transitions and rewriting rule in current marking m. let the e(m) = t (m) ] r(m), is the set of enabled events in a current marking m. the event e j ∈ e(m) fire if no other event ek ∈ e(m) with higher priority has enabled. hence, for e j event i f ((φ j = t j)∨(φ j = r j)∧(gtr(r j, m) = f alse)) then (the firing of transition t j ∈ t (m) or rewriting rule r j ∈ r(m) change only the current marking:(rγ, m) e j→ (rγ, m′) ⇔ (rγ = rγ′ and m[e j > m ′ )). also, for e j event i f ((φ j = r j)∧(gr(r j, m) = true) then (the event e j occur to firing of rewriting rule r j and it occurrence change configuration and marking of current net:(rγ, m) r j→ (rγ′, m′), m[r j > m ′ ). the accessible state graph of a netrn =< γ, m > is the labeled directed graph whose nodes are the states and whose arcs which is labeled with events of rn are of two kinds: a) firing of a enabled event e j ∈ e(m): arcs from state (rγ, m) to state(rγ, m ′ ) labeled with evente j then this event can fire in the net configuration rγ at marking m and leads to new marking m ′ : (rγ, m) e j→ (rγ′, m′) ⇔ (rγ = rγ′ and m[e j > m ′ in rγ); b) change configuration: arcs from state (rγ, m) to state( rγ ′ , m ′ ) labeled with rewriting rule r j :(rγl, ml)¤ (rγw , mw ) which represent the change configuration of current rn net: (rγ, m) e j→ (rγ′, m′) and m[r j > m ′ . figure 1: translation of (a) derγ1in rn1 and (b) derγ2 in rn2 let we consider the rn1 given by the following descriptive expression:derγ1 = p1|r1 p2 ∨ de ′ rγ1, de ′ rγ1 = (p2· p5)|t1 p3|t2 p4|t3 (p1♦p5), m0 = (5p1, 1p5), gr(r1, m) = (m1 = 3)&(m5 = 0) and r1 : derγ1 ¤ derγ2. also, for r j is required to identify if rnl belong the rγ. upon firing, the enabled events or rewriting rule modify the current marking and/or and modify the structure and current marking of net rn1 in rn2 given by: derγ2 = p1|t1 p2 ∨ de ′ rγ2, de ′ rγ2 = (p2 · p6)|t2 p3(|t3 p4|t4 p5 ∨|t5 p5|r2 (p1♦p6)), m = (1p1, 3p2, 1p3), gr(r2, m) = (m1 = 4)&(m5 = 1), r2 = r −1 1 : derγ2 ¤ derγ1. figure 1 show the translation of derγ1 in rn1 and derγ2 in rn2, respectively. descriptive timed membrane petri nets for modelling of parallel computing 37 5 dynamic rewriting timed petri nets systems are described in timed pn (tpn) as interactions of components that can performed a set of activities associated with events. an event e = (α, θ ), where α ∈ e is the type of the activity (action name), and θ is the firing delay. a descriptive dynamic rewriting tpn as a rt n =< rn, θ >, where: rn =< γ, r, φ , gtr, gr, m >, γ =< p, t, pre, post, test, inh, g, pri, k p, l > (see definition 2 and 3) with set of events e which can be partitioned into a set e0 of immediate events and a set eτ of timed events e = e0 ] eτ . the immediate event is drawn as a thin bar and timed event is drawn as a black rectangle for transition or a two embedded empty rectangle for rewriting rules, and pri(e0) > pri(eτ ) ; θ : e ×n|p| → r+ is the weight function that maps events onto real numbers r+ (delays or weight speeds). its can be marking dependent. the delays θ (ek, m) = dk(m)defining the events firing parameters governing its duration for each timed events of eτ . if several timed events are enabled concurrently e j ∈ e(m) for e j ∈ • pi = ∀e j ∈ e : pre(pi, e j) > 0 , either in competition or independently, we assume that a race race competition condition exists between them. the evolution of the model will determine whether the other timed events have been aborted or simply interrupted by the resulting state change. the θ (e j, m) = w j(m) is weight speeds of immediate events e j∈e0 . if several enabled immediate events are scheduled to fire at the same time in vanishing marking m with the weight speeds, and the probability to enabled immediate event e j can fire is: qi(m) = w(e j, m)/ ∑el∈(e(m)&• pi) w(e j, m) ,where e(m) is the set of enabled events in m. an immediate events e j ∈ t0 has a zero firing time. 6 p systems and descriptive timed membrane petri nets here we give a brief review of p systems and its encoding with dm-nets. the main components of p systems are membrane structures consisting of membranes hierarchically embedded in the outermost skin membrane. a full guide for p systems can be referred to [3]. in general, a basic evolution-communication p system with active membranes (of degree n ≥ 0) is π = (o, h, µ, ω, (ρ, π)), where: o is the alphabets of objects; h is a finite set of labels for membranes;µ is a membrane structure consisting of n membranes labeled with elements h in h;ω is the configuration, that is a mapping from membranes of π(nodes in µ ) to multisets of objects ωk ∈ ω, k = 1, ,|ω|, from o;ρ and π is respectively the set off developmental rules ρh and πhits priorities , h = 0, 1, , n − 1. thus the can be of two forms of rules: a) the object rules (or), i.e., evolving and communication rules concerning the objects; b) the membranes rules (mr), i.e., the rules about the structural modification of membranes. here we define dm-nets for encoding of p systems mentioned above into descriptive dynamic rewriting tpn as a rt n. the basis for dm-nets is a membrane rt n that is de net structure comprise: places; transitions; weighed directed arcs from places to transitions and vice-versa; a capacity for each place; weighed inhibitory and test arcs; priority and guard function of transitions. the dm − nets of degree n ≥ 0 is a construct dm = ∨n−1h=0[hdeh]h , where deh is the descriptive expression of rt nh that represent the configuration of membrane [h ]h in a p system π. consider the p system π.the encoding of π into rt nπ is decomposed into two separate steps. first, for every membrane [h ]h we associate: to each object ωi ∈ ω one place ph,i = [hm0i pi]h labeled as ωi with the initial marking m0i , and to each rule ρh, j ∈ ρ one event eh, j = [he j]h labeled as ρh, j that acts on the this membrane. second, for every membrane [h ]h we define the deh of rt nh that it correspond to the initial configuration of the p system π as [hdeh]h. let u, v , and u′, v′ , is a multiset of objects. the evolving object rule ρh′ , j : [h[h′ u → v]h′ ]h with multiset of objects u, v , which will be kept in membrane [h]h is encoded as [h[h′ pu|t j pv]h′ ]h . the antiport rule ρh′ , j : [hu[h′ v]h′ ]h → [hv ′ [h′ u ′ ]h′ ]h , that realize a synchronized wich object c the exchange of objects, is encoded as [h[h′ (pu · pv · p̃c)|t j (pu′♦pv′ )]h′ ]h. also, the symport rule ρh′ ,k : [hu[h′ ]h′ ]h → [h[h′ u ′ ]h′ ]h that 38 emilian guţuleac move objects from inside to outside a membrane, or vice-versa is encoded as [h[h′ (pu · p̃c)|tk pu′ ]h′ ]h. because a configuration mean both a membrane structure and the associated multisets, we need rules for processing membranes and multisets of objects as: mr = change, dissolve,create, divide, merge, separate, move. the above membrane rewriting rules (realized by the rewriting events in de) are defined as follows: changerewriting rule [h[h′ (deh′ , mh′ )]h′ ]h ¤ [h[h′ (de ′ h ′ , m ′ h ′ )]h′ ]h that in runtime the current structure and the multisets of objects to membrane h, encoded by descriptive expression deh′ and marking mh′ is changed in a new structure de ′ h′ with new marking m ′ h′ ; dissolve rewriting rule [h(deh, mh)[h′ (deh′ , mh′ )]h′ ]h ¤[h(deh, m ′ h)]h that the objects and sub-membranes of membrane h ′ now belong to its parent membrane h , the skin membrane cannot be dissolved; create rewriting rule [h(deh, mh)]h ¤ [h(de ′ h, m ′ h)[h′ (de ′′ h ′ , m ′′ h ′ )]h′ ]h with mh = m ′ h + m ′′ h′ that the new membrane h ′ is created and m ′′ h ′ are added into membrane h ′ , the rest remain in the parent membrane h; divide rewriting rule [h(deh, mh)]h ¤[h[h′ (deh, mh)]h′ [h′′ (deh, mh)]h′′ ]hthat the objects and submembranes are reproduced and added into membrane h ′ and membrane h ′′ , respectively; merge rewriting rule that the objects of membrane h ′ and h ′′ are added to a new membrane h is: [h[h′ (de ′ h ′ , mh′ )]h′ [h′′ (de ′′ h ′′ , m ′′ h ′′ )]h′′ ]h ¤ [h(de ′ h ′ ∨de′′ h ′′ , mh′ + m ′′ h ′′ )]h; separate rewriting rule is the counterpart of merge is done by a rewriting rule of the form ¤[h(de ′ h ′ ∨ de ′′ h ′′ , mh′ + m ′′ h ′′ )]h ¤ [h[h′ (de ′ h ′ , mh′ )]h′ [h′′ (de ′′ h ′′ , m ′′ h ′′ )]h′′ ]h with the meaning that the content of membrane h is split into two membranes, with labels h ′ and h ′′ . moverewriting rule where a membrane h ′′ can be moved out or moved into a membrane h ′ as a whole is: [h[h′ (deh′ , mh′ )[h′′ (de ′′ h ′′ , m ′′ h ′′ )]h′′ ]h′ ]h ¤ [h[h′ (deh′ , mh′ )]h′ [h′′ (de ′′ h ′′ , m ′′ h ′′ )]h′′ ]h or [h[h′ (deh′ , mh′ )]h′ [h′′ (de ′′ h ′′ , m ′′ h ′′ )]h′′ ]h ¤ [h[h′ (deh′ , mh′ )[h′′ (de ′′ h ′′ , m ′′ h ′′ )]h′′ ]h′ ]h. thus, using the dm − nets facilitates a compact and flexible specification to visual simulate of p systems with dynamic rewriting tpn nets that permit the verification of the its many useful behavioral properties such as reachability, boundedness, liveness, terminating, etc., and the performance evaluation of parallel computing models. 7 summary and conclusions in this paper we have proposed an approach to the performance modeling of the behaviour of psystems through a class of petri nets, called descriptive membrane timed pn (dm-nets). based upon the introduction of a set of descriptive composition operation and rewriting rules attached with transitions for the creation of dynamic rewriting tpn, the membrane structure can be successfully encoded as a membrane descriptive rewriting timed petri nets models which permit the description the behavioral state based process run-time structure change of p systems. we are currently developing a software visual simulator with a friendly interface for verifying and performance evaluation of descriptive rewriting tpn models and dm-nets. references [1] m. ajmone-marsan, g. balbo, g. conte, s. donatelli, and g. francheschinis, “modeling with generalized stochastic petri nets,” ser. in parallel computing ,new york: wiley, 1995. [2] s. dal zilio, e. formenti, “on the dynamics of pb system: a petri net view,” in proceedings wmc 2003, lecture notes in computer science 2933 , springer-verlag, pp. 153-167, 2004. descriptive timed membrane petri nets for modelling of parallel computing 39 [3] e. gutuleac, “descriptive compositional construction of gspn models for performance evaluation of computer systems,” in proceedings of the 8-th international symposium on automatic control and computer science, saccs2004, 22-23 october,iasi, romania, cd, 2004. [4] e. gutuleac, “descriptive dynamic rewriting gspn-based performance modeling of computer systems,” proceedings of the 15th international conference on control systems and computer science, cscs15, 25-27 may 2005,bucuresti, romania, pp. 656-661, 2005. [5] j. kleijn, m. koutny, g. rozenberg, “towards a petri net semantics for membrane systems,” in proceedings of the wmc6 2005, july 18-21, wien, austria, pp. 439-459, 2005. [6] gh. paun, “membrane computing. an introduction,” natural computing series. ed. g. rozenberg, th. back, a.e. eibenj.n. kok, h.p. spaink, leiden center for natural computing, springer-verlag, berlin, p. 420, 2002. [7] z. qi, j. you, and h. mao, “p systems and petri nets,” proceedings wmc 2003, lecture notes in computer science, vol. 2933, springer-verlag, berlin, pp. 387-403, 2003. emilian guţuleac, technical university of moldova, computer science department, address: 168, bd. stefan cel mare, md-2004, chişinău, republic of moldova e-mail: egutuleac@mail.utm.md int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 862-870 a novel qos framework based on admission control and self-adaptive bandwidth reconfiguration a. peculea, b. iancu, v. dadarlat, i. ignat adrian peculea, bogdan iancu, vasile dadarlat, iosif ignat technical university of cluj-napoca romania, 400020 cluj-napoca, 15 constantin daicoviciu e-mail: {adrian.peculea,bogdan.iancu,vasile.dadarlat,iosif.ignat}@cs.utcluj.ro abstract: this paper proposes a novel end-to-end qos framework, called self-adaptive bandwidth reconfiguration qos framework (sar). sar provides end-to-end qos guarantees on a per-flow basis through admission control and end-to-end bandwidth reservation. in order to adapt to short and long time traffic load changing, sar performs dynamic bandwidth reconfiguration. due to a new organization of the network physical lines, sar allows for a better utilization of the links’ capacity and a smaller number of rejected flows, increasing the network’s availability. keywords: end-to-end qos, admission control, bandwidth reconfiguration. 1 introduction computer networks transport simultaneously several flows, fact that makes necessary a multiplexing mechanism. transport procedures affect the traffic flows, reason for which the traffic has to be characterized and quality of service (qos) requirements need to be established. traffic types and their qos requirements impose the implementation of qos methods and architectures. this paper presents the design and implementation of a new end-to-end qos framework with self-adaptive bandwidth reconfiguration. integrated services (intserv) [1] provide end-to-end quality of service (qos) guarantees for individual flows by maintaining the state and by reserving bandwidth for each flow at routers on the path between source and destination. the additional loading introduced by the perflow bandwidth reservation processing and by the per-flow state maintaining at each router is significant and is increasing along with the network. for this reason, integrated services presents scalability problems. differentiated services (diffserv) [1] group the flows in traffic classes at the edge of the network. interior routers forward each packet function of the per-hop behavior associated to the traffic class of the packet. because of the flow aggregation and the lack of admission control, differentiated services do not provide end-to-end qos guarantees to individual flows. on-demand qos path (odp) [2] provides end-to-end qos guarantees to individual flows introducing an additional load much lower than in the case of integrated services and maintaining a similar scalability to the one of the differentiated services. odp exercises per-flow admission control and end-to-end bandwidth reservation at the edge of the network. inside the network odp differentiates the traffic classes as in the differentiated services. the main disadvantage of odp is that the bandwidth adjustment is only inside the traffic class and does not allow for bandwidth redistribution between classes. the free bandwidth of the provisioned links that are not used or present a low utilization can not be made available for other provisioned links, the free bandwidth remaining unused. another disadvantage of this framework is the fact that it does not include a module for determination of the bandwidth necessary for each input flow. in order to eliminate the disadvantages above mentioned, we elaborated, implemented and proposed a framework for end-to-end quality of service guaranteeing through admission control copyright c⃝ 2006-2010 by ccc publications a novel qos framework based on admission control and self-adaptive bandwidth reconfiguration 863 and self-adaptive bandwidth reconfiguration, which allows for bandwidth redistribution between classes. in this approach, the physical line is divided into two main sections, a part being the guaranteed link (gl) necessary for guaranteeing a minimum bandwidth (where is the case) for traffic classes (tcs), and a common part named common link (cl), which can be used by any tc. having two separated sections, the framework guarantees a minimum bandwidth for any trunk and offers a common bandwidth which can be used by every trunk, irrespective to their tc. this allows for better bandwidth utilization and for the decrease of the rejected flows number. this paper is organized in the following manner. section ii presents related work, section iii describes the architecture and the functioning of the proposed framework, section iv and section v present the admission control method, and respectively the self adaptive reconfiguration technique of the proposed framework and, finally, section vi presents the experimental results and the concluding remarks. 2 related work integrated services (intserv) framework uses resource reservation protocol (rsvp) to reserve bandwidth for each flow at every router along the path of the flows. using per-flow based hop-by-hop signaling, consisting of path and resv messages, integrated services provides endto-end guarantees. these guarantees come with the overhead of processing per-flow bandwidth reservation and maintaining per-flow state at each router along the flow’s path. because this overhead is significant and is increasing along with the network size, intserv presents scalability problems. differentiated services (diffserv) framework classifies packets into traffic classes at the boundary of the network. during the classification process each packet is marked according to its traffic class. the routers inside the network recognize the traffic class of the packets and, using a scheduling mechanism, forward each packet function of the per-hop behavior associated to the traffic class of the packet. in the case of this framework, the service is provided on a per-class basis instead of a per-flow basis as in intserv framework. this approach removes the overhead specific to intserv framework reason for which diffserv framework is much more scalable. however, diffserv framework does not exercise admission control at the edge of the network, so the network can be overloaded, reason for which this framework does not provide end-to-end guarantees. on-demand qos path (odp) provides end-to-end qos guarantees to individual flows with less overhead than in the case of intserv, maintaining a similar scalability to the one of the diffserv. two types or routers are defined in this framework: edge and core. odp exercises per-flow admission control and end-to-end bandwidth reservation at the edge of the network. inside the network odp differentiates the traffic classes as in the diffserv. odp organizes link bandwidth hierarchically. each physical link is statically divided into several provisioned links (pls), each pl being dedicated to a traffic class. each pl is divided into several trunks, each trunk being dedicated to an edge router. an edge router keeps track of available bandwidth of its trunks and performs admission control locally without hop-by-hop signaling through network. the main disadvantage of odp is that the bandwidth adjustment is only inside the traffic class and does not allow for bandwidth redistribution between classes. the free bandwidth of the provisioned links that are not used or present a low utilization can not be made available for other provisioned links, the free bandwidth remaining unused. another disadvantage of this framework is the fact that it does not include a module for determination of the necessary bandwidth for each input flow. 864 a. peculea, b. iancu, v. dadarlat, i. ignat 3 the architecture of the framework the proposed framework serves the user networks and defines two types of routers, edge and core, and entities for common bandwidth control. figure 1: bandwidth organization in the proposed framework edge routers (ers), connected to served networks, determine the necessary bandwidth for each input flow, take admission or rejection decision for each input flow, dynamically reconfigure the bandwidth assigned to trunks, map flows to corresponding tcs and transmit the packets belonging to the admitted flows in the network. core routers (crs), connected to edge or core routers, recognize tcs and provide class based service differentiation. entities for common bandwidth control monitor and update common bandwidths utilization and accept or reject the requests for additional bandwidth for trunks received from ers. the bandwidth is hierarchically organized. each physical line is divided in two sections as it is presented in figure 1. a first section guarantees the minimum bandwidth, which can be also 0, for each class and each trunk. the second section, cl, offers a common bandwidth which can be used by every trunk function of their bandwidth requirements, irrespective to their belonging tc or er. so, trunks can acquire additional bandwidth without being conditioned by the available bandwidth of the belonging class. first section is statically divided in several guaranteed class links (gcls). each gcl is reserved to a tc existing a one to one mapping between the tcs supported by the physical line and gcls. each gcl is divided in several trunks, each trunk being dedicated to an er. a trunk belonging to a gcl supports the flows belonging to the tc that corresponds to the considered gcl, originating from the er to which the trunk is dedicated, irrespective to their destination. an er keeps track of available bandwidth of its assigned trunks and performs admission control locally, without hop-by-hop signaling through network. a virtual ip path (vip) is a path from a source er to a destination er for a tc, being a concatenation of trunks belonging to the source er over a source-destination path. the bandwidth assigned to trunks has a minimum guaranteed value which can be also 0 and, by using cl, is dynamically adjusted function of the network traffic modifications. function of the entities for common bandwidth control there are three possible approaches: central control (cc), router-aided (ra) and edge-to-edge (ee). the architecture of the framework is presented in figure 2 and it is composed of two ena novel qos framework based on admission control and self-adaptive bandwidth reconfiguration 865 tities: edge router and entity for common bandwidth control. the edge router determines the necessary bandwidth for each input flow, takes the admission or rejection decision for each input flow, reserves the necessary bandwidth for each admitted flow, dynamically reconfigures the bandwidth assigned to trunks and classifies the packets belonging to the admitted flows. the entity for common bandwidth control monitors and updates common bandwidths utilization and accepts or rejects the additional bandwidth requests for trunks, received from edge routers. the communication between the two entities is realized through a predefined message set. figure 2: the architecture of the proposed framework the edge router is composed of two planes: local resources monitoring plane and flow management and local resources control plane. the local resources monitoring plane is composed of the following tables: classification and reservation table which realizes a correspondence between flow types, the elements that identify them, corresponding traffic class, their necessary bandwidth and the maximum necessary bandwidth for any flow from the respective traffic class, flow table which stores the admitted flows and the time of the last packet from each flow, routing to vip correspondence table which allows for vips determination, vip table which stores the vips and trunk table which stores the reserved bandwidth, bandwidth being used and minimum reserved bandwidth for every trunk belonging to the er. the flow management and local resources control plane takes the packets from the traffic policy module and delivers them to the routing process being composed of the following blocks: packet reception time storage which reads the receiving time of each packet, flow identification which determines and identifies the packets membership to admitted flows, flow table update which updates the reception time of the last packet from each flow from flow table, admission control and additional resources acquiring which admits the flows for which there are enough resources and rejects the flows when there is not enough bandwidth for them, acquires additional bandwidth for trunks, reserves the necessary bandwidth for the admitted flows and inserts the admitted flows in the flow table and packet classification which identifies the packets function of classification and reservation table criteria and marks them according to the identification criteria. the second task of this plane is to determine finished admitted flows and release the acquired resources used for these flows. the following blocks realize this task: clock generates the time period when acquired resources are released and acquired resources release which determines finished admitted flows and releases reserved acquired resources for these flows. the entity for common bandwidth control is composed of two planes: common resources monitoring plane and common resources control plane. the common resources monitoring plane 866 a. peculea, b. iancu, v. dadarlat, i. ignat contains common bandwidth table which stores the reservation and utilization for common bandwidths of cls. the common resources control plane contains the common resources control block which updates the common bandwidth table and decides if additional bandwidth requests for trunks received from ers can be accepted or not. 4 admission control admission control is performed at the arrival of the first packet from a new flow, by the source er. admission control and additional resources acquiring module stores the packet into the not admitted flows memory and determines if there are other packets belonging to this flow stored in the memory. if there are no more such packets, it determines from classification and reservation table the necessary bandwidth for the flow and tc, determines from routing to vip correspondence table the flows corresponding vip and extracts from vip table the trunks that belong to the determined vip. then, for trunks which have enough available bandwidth, reserves the flows necessary bandwidth by updating the bdw being used field. for a trunk, the condition to have enough available bandwidth is: reserved_bdw ≥ bdw_being_used + necessary_bdw (4.1) where reserverd_bdw and bdw_being_used are the amounts of reserved and utilized bandwidth for the trunk and necessary_bdw is the flows necessary bandwidth. the update of the bdw_being_used field is done in the following manner: bdw_being_used = bdw_being_used + necessary_bdw (4.2) if the vip has enough bandwidth to support the input flow, the admission control accepts the flow. if there are trunks which do not have enough available bandwidth, admission control and additional resources acquiring module tries to increase the reserved bandwidth of those trunks sending in this sense a request to the entities for common bandwidth control. if the request is admitted, the reserved bandwidth of the trunks is increased by updating reserved_bdw field, so that these trunks too will have enough available bandwidth to support the input flow. for these trunks, admission control and additional resources acquiring module reserves the flows necessary bandwidth by updating the bdw_being_used field. in this case too, the admission control accepts the flow. after a flow acceptance, the flow is inserted into the flow table and the packets belonging to this flow, stored into the not admitted flow memory, will be transmitted to the flow table update module for the rest of the processing and transmission. if the request is rejected, the flow is rejected, the reservations made on the trunks which had enough available bandwidth are canceled by updating the bdw_being_used field and the packets belonging to the flow, stored into the not admitted flow memory, are discarded. a flow is considered finished after an inactivity period that exceeds a predefined value. each er, using the acquired resources release module, periodically inspects its own flow table in order to identify the finished flows and, as a consequence of finished flow identification, releases the bandwidths correspondingly. if there are finished flows, the reserved bandwidth and tc for these flows are determined from the classification and reservation table and the flows are discarded from the flow table. then, the acquired resources release module, determines from routing to vip correspondence table the corresponding vips and extracts from the vip table the trunks belonging to the determined vips. after this, releases the reserved bandwidth for the flows by updating the bandwidth being used field from the trunk table for each trunk belonging to the vips. the update of the bdw_being_used field is done in the following manner: bdw_being_used = bdw_being_used − necessary_bdw (4.3) a novel qos framework based on admission control and self-adaptive bandwidth reconfiguration 867 also, it extracts from the classification and reservation table the maximum amount of bandwidth for the corresponding tcs and verifies if the trunks utilization is under the predetermined lower threshold. for a trunk, the condition to have the utilization under a predetermined lower threshold is: reserved_bdw > bdw_being_used + n ∗ tc_maximum_necessary_bdw (4.4) where tc_maximum_necessary_bdw is the maximum amount of bandwidth for the corresponding tc and n is a predefined parameter having a value larger than or equal to 1. also, it extracts from the trunk table the minimum reserved bandwidth for the trunks and verifies if the trunks have additional bandwidth acquired from cls. for a trunk, the condition to have additional bandwidth acquired from cl is: reserved_bdw > trunk_minimum_reserved_bdw (4.5) where trunk_minimum_reserved_bdw is the minimum reserved bandwidth for the trunk if there are trunks whose bandwidth being used is under the predetermined lower threshold and the trunks have additional bandwidth acquired from cls, the acquired resources release module, in the limit of the acquired bandwidth, computes de bandwidth that will be released from the reserved bandwidth of the trunks. reduction of the reserved bandwidths is accompanied by appropriated resources release for the common bandwidths. 5 self-adaptive bandwidth reconfiguration the proposed framework dynamically adjusts the bandwidth assigned to the trunks, in order to adapt to changes in network traffic. a source edge router has the option to request additional bandwidth for its trunks or it can release bandwidth not used by the trunks, depending on bandwidth usage of his trunks. bandwidth adjustment is done using the cl’s bandwidth. this adjustment allows all trunks, regardless of the class of traffic or the edge router where they belong, to share the bandwidth provided by lc. the trunk reconfiguration process of the proposed framework involves three main actions: (1) the control of the common bandwidth table, (2) the release of bandwidth not used by the trunks, and (3) acquisition of additional bandwidth for trunks. a common bandwidth table stores the common bandwidth utilization of the network cls. as shown in figure 3, an entry in this table contains: cls identifier, the reserved amount of shared bandwidth and the amount of shared bandwidth for the cl. figure 3: common bandwidth table depending on the share bandwidth entities, three approaches are being proposed: central control (cc), router-aided (ra) and edge-to-edge (ee). in the central control approach the common bandwidth table is managed by a network management server (nms) and the common bandwidth table stores the bandwidth utilization of all cls in the network. in the router-aided approach, each core router manages a common bandwidth table, and each of these tables stores the bandwidth utilization of the lcs belonging to all physical links directly connected to that core router. in the edge-to-edge approach each edge router manages a common bandwidth table, which will store the bandwidth utilization of all the cls in the network. 868 a. peculea, b. iancu, v. dadarlat, i. ignat figure 4: reserved bandwidth update algorithm each edge router periodically examines its own flow table and determines which flows are finished. if there are any finished flows, the flow table and the trunk table will be updated. a next step for the edge router is to examine the trunk table and to obtain the bandwidth utilization of its own trunks. if the bandwidth utilization of any trunk is under a predetermined lower threshold and those trunks have additional bandwidth acquired from the common link, the source edge router computes the amount of bandwidth to be released from the reserved bandwidth, adjusts the released bandwidth of the trunks, in the limit of the additional acquired bandwidth, updates its own trunk table and sends a control message to the entities for common bandwidth control, in order to release the used shared bandwidth. adjusting a trunk’s bandwidth is done only in the limit of the additional acquired bandwidth. the algorithm that describes the reserved bandwidth update process for the trunks is presented in figure 4. the trunk reconfiguration process is always initiated by a source edge router using a threshold and computed values driven mechanism. 6 experimental results and conclusions for the development and testing of the proposed qos framework (sar framework) and also for the developing of new ones, an experimental methodology was used, rather than simulation techniques, thus an integrated solution a development tool, was created [3]. also a benchmarking system for qos parameters [4] was developed in order to allow the testing of the proposed sar qos framework. the benchmarking system generates traffic for the defined testbed and measures the following parameters: delay, ip delay variation (ipdv) or jitter and bandwidth, both on tcp and udp. the benchmarking allows a user to define and store complex traffic patterns that can be recharged for making further measurements, to test various qos techniques based on the same traffic characteristics. for simulations purposes, the self-adaptive bandwidth reconfiguration qos framework (sar) described in the previous section and the odp framework were tested, in a comparative manner, using the development tool and the benchmarking system. the final testbed is a network of programmable routers, and consisted of three edge routers and three served networks. the tests were intended as performance comparison between odp and sar frameworks. traffic classes and traffic patterns were defined similarly in both frameworks tested. four classes of traffic were considered. two test traffic patterns were defined. in the first traffic pattern considered flows are injected from classes 2 and 3 and in the case of the second traffic pattern flows belonging to class 2 are injected. for both traffic patterns a balanced distribution of traffic from and to the served networks is ensured. after testing and analyzing the results (figure 5) it was found that the number of flows a novel qos framework based on admission control and self-adaptive bandwidth reconfiguration 869 figure 5: test results admitted flows admitted for sar framework is higher than in the case of odp framework, on both tested traffic patterns, which demonstrates a more efficient use of network resources. also, the equal number of control messages transmitted by the two frameworks shows that sar is a scalable framework. finally, tests confirmed that admission control has eliminated network congestion. this paper presents a new end-to-end qos framework, called self-adaptive bandwidth reconfiguration qos framework (sar). the proposed dynamic allocation method guaranties a minimum bandwidth available for each traffic class and trunk, and provides a common bandwidth section which can be used by every trunk, function of their bandwidth requirements, irrespective to their belonging tc or er. thus, trunks can acquire additional bandwidth without being conditioned by the available bandwidth of the belonging class. the new framework, sar, uses the proposed bandwidth organization, allowing the increase of the traffic volume it handles, guaranteeing end-to-end quality of service through network resources monitoring, admission control and resource reservation for new flows. the end-to-end qos framework with self-adaptive bandwidth reconfiguration overcomes the disadvantages of odp by providing minimum service guaranties and bandwidth redistribution between classes. acknowledgments this work was supported by the pnii-idei 328/2007 qaf quality of service aware frameworks for networks and middleware research project within the framework national research, development and innovation programme initiated by the national university research council romania (cncsis uefiscsu). 870 a. peculea, b. iancu, v. dadarlat, i. ignat bibliography [1] z. wang, internet qos: architectures and mechanisms for quality of service, morgan kaufmann, san francisco, 2001. [2] m. yang, y. huang, j. kim, m. lee, t. suda, m. daisuke, an end-to-end qos framework with on-demand bandwidth reconfiguration, twenty-third annual joint conference of the ieee computer and communications societies (infocom), hong-kong, vol. 3, pp. 2072 2083, 2004. [3] a. peculea, v. dadarlat, i. ignat, b.iancu, l. cobarzan, on developing a qos framework with self-adaptive bandwidth reconfiguration, pollack periodica an international journal for engineering and information sciences, vol.4, no.1, pp. 121-129, 2009. [4] a. peculea, b. iancu, v. dadarlat, i. ignat, e. cebuc, z. baruch, benchmarking system for qos parameters, proceedings of the ieee 3rd international conference on intelligent computer communication and processing 2007 (iccp 2007), cluj-napoca, romania, p.255p.258, 2007. international journal of computers communications & control issn 1841-9836, 12(3), 323-329, june 2017. learning speed enhancement of iterative learning control with advanced output data based on parameter estimation g.-m. jeong, s.-h. ji gu-min jeong school of electrical engineering kookmin university, korea gm1004@kookmin.ac.kr sang-hoon ji* robot r&bd group kitech, korea *corresponding author: robot91@kitech.re.kr abstract: learning speed enhancement is one of the most important issues in learning control. if we can improve both learning speed and tracking performance, it will be helpful to the applicability of learning control. considering these facts, in this paper, we propose a learning speed enhancement scheme for iterative learning control with advanced output data (adilc) based on parameter estimation. we consider linear discrete-time non-minimum phase (nmp) systems, whose model is unknown, except for the relative degree and the number of nmp zeros. in each iteration, estimates of the impulse response are obtained from input-output relationship. then, learning gain matrix is calculated from the estimates, and by using new learning gain matrix, learning speed can be enhanced. simulation results show that the learning speed has been enhanced by applying the proposed method. keywords: iterative learning control, speed enhancement, parameter estimation, learning gain estimation. 1 introduction by using iterative learning control (ilc), the tracking performance can be enhanced when the the same task is performed iteratively [7]– [9]. among various ilc schemes, iterative learning control with advanced output data (adilc) [4] [5] has been proposed for the learning control in discrete time non-minimum phase (nmp) systems. adilc stabilizes inverse mapping by using output-to-input mapping directly with time-advanced output data. its learning structure is simple since it consists of an input update law that depends on the relative degree and number of nmp zeros. on the other hand, due to the complexity of computation, learning speed enhancement is one of the most important issues in iterative learning control (ilc). considering this, various approaches for direct learning control (dlc) [9] have been proposed. in [5], an adilc scheme based on the estimation of the impulse response is proposed for linear discrete-time nmp systems, whose model is unknown, except for the relative degree and the number of nmp zeros. instead of using an approximate model of the system, the first part of impulse response is estimated and used for the adilc. however, considering the computational cost of this method, we need a novel scheme to enhance learning speed. in this paper, we propose a new speed enhancement scheme for discrete time nmp systems, extending the results in [5]. by using the estimates of the learning matrix, an estimate of the desired input is derived and the learning speed can be significantly enhanced. further, an illustrative example is provided to demonstrate the applicability of the proposed method. copyright © 2006-2017 by ccc publications 324 g.-m. jeong, s.-h. ji 2 adilc for discrete-time nmp systems in this section, some preliminary results of the adilc in [4] are briefly summarized. let us consider a linear time invariant(lti) system described by x(i + 1) = ax(i) + bu(i) y(i) = cx(i) (1) where, u ∈ r1, x = [x1, · · · ,xn]t ∈ rn, and y ∈ r1 are the input, the state, and the output of the system, respectively. a, b and c are matrices of appropriate dimensions. let xd(i), yd(i) and ud(i) represent the state, the output and the input corresponding to the desired trajectory respectively. further, let the desired output yd(i), i ∈ [σ,n + σ − 1] be given and u[i,j] := [u(i), · · · ,u(j)]t , y[i,j] := [y(i), · · · ,y(j)]t . the transfer function of the system is represented by g(z) = β1z n−1+···+βn zn+α1zn−1+···+αn . here, it is assumed that the number of nmp zeros, d0, and the relative degree, σ, are known a priori (i.e., β1 = · · · = βσ−1 = 0). in the adilc, the following input-output mapping is used to stabilize the inverse mapping. y[σ+d0,n+σ+d0−1] = hx(0) + ju[0,n−1], (2) h = [ (hd0+1) t , · · · , (hn+d0 ) t ]t , j =   jd0+1 jd0 · · · 0 jd0+2 jd0+1 · · · 0 ... ... ... ... jn+d0 jn+d0−1 · · · jd0+1   , where hl = caσ+l−1, jl = caσ+l−2b. the time interval for the output of interest is [σ + d0,n + σ + d0 − 1] in (2), whereas it is [σ,n + σ− 1] for minimum phase systems (i.e., d0 = 0). for an adilc, we set the input horizon to [0,n + d0 − 1] with u[n,n+d0−1] = 0 and the output horizon to [0,n +σ +d0−1]. the desired trajectory, yd, is given in [σ,n +σ−1]. we set yd [n+σ,n+σ+d0−1] to some appropriate constants. further, at every iteration, we set x k(0) = xd(0) and uk(i) = ud(i) = 0,n ≤ i ≤ n − 1 + d0. to analyze the stability of the inverse mapping, we need the following assumptions: • (a1) the system is stable, controllable and observable. • (a2) the matrix a is invertible. • (a3) βn 6= 0 in g(z). • (a4) the matrix j is nonsingular. with these assumptions, lemma 1 shows that the inverse mapping (2) is stable using the time advancing of the output data, even though it is an nmp system. lemma 2.1. (stable inversion using time advancing) the inverse mapping from yd [σ+d0,n+σ+d0−1] to u d [0,n−1] is stable. the input update law is derived from lemma 1 as follows: uk+1 [0,n−1] = u k [0,n−1] + s kek[σ+d0,n+σ+d0−1], (3) learning speed enhancement of iterative learning control with advanced output data based on parameter estimation 325 where ek [l,m] = yd [l,m] − yk [l,m] and sk ∈ rn×n is the learning gain matrix. the next lemma shows that the input uk [0,n−1] converges to u d [0,n−1] as k → ∞ using the input update law (3). it should be noted that this inverse mapping is stable. lemma 2.2. the uncertain system (1) satisfies (a1)–(a4). if the condition ‖i − skj‖≤ ρ < 1 (4) holds, the input uk [0,n−1] converges to u d [0,n−1] as k →∞. 3 adilc with the estimation of the impulse response in this section, the impulse response estimation scheme in [5] is slightly modified for the learning speed enhancement detailed in the next section. after estimating the first p impulse responses, we select the first l ≤ p responses to obtain j̄, the estimate of j in (2). with j̄, which consists of the estimations of the first l impulse responses (j1,· · · ,jl), the adilc scheme can be applied to unknown nmp systems. since we set x(0) = 0 for the learning scheme, from (2), we can obtain y[σ+d0,n+σ+d0−1] = ju[0,n−1]. (5) by exchanging the location of j and u[0,n−1], (5) can be changed into y[σ+d0,n+σ+d0−1] = umaxj[1,n+d0+1]. (6) here, umax =   u(d0) · · · u(0) · · · 0... ... ... ... ... u(d0 + n − 1) · · · u(n − 1) · · · u(0)   , j[1,n+d0+1] = [j1, · · · ,jn+d0+1] t . (7) to estimate the first p impulse responses, we make an approximation for j[1,p]. as i becomes larger, the impulse response ji approaches 0. by selecting a sufficient large p and discarding the impulse responses from p + 1, the approximation is made as y[σ+d0,n+σ+d0−1] ≈ upj[1,p]. (8) here, up =  u(d0) u(d0 − 1) · · · 0 u(d0 + 1) u(d0) · · · 0 ... ... ... ... u(d0 + n − 1) u(d0 + n − 2) · · · u(d0 + n − l− 1)   , j[1,p] = [j1, · · · ,jp] t . (9) using the least square method, we can obtain j̄[1,p] consisting of the estimates of j[1,p], as j̄[1,p] = (u t p · up) −1utp y[σ+d0,n+σ+d0−1]. (10) 326 g.-m. jeong, s.-h. ji after estimating the impulse responses, we select the first l ≤ p impulse responses and obtain j̄ which is the estimates of j in (2). in [5], a learning control scheme was presented based on impulse response estimation. at step 0, u1 [0,n−1] can be determined by setting s 0 as an appropriate matrix, e.g., αi and u0 [0,n−1] = 0. then, we can estimate j̄k [1,n] similarly to (10) and learning control can be performed using (3) with sk = α(j̄k)−1 for some α, 0 < α < 1. 4 learning speed enhancement using the estimation of the impulse response in this section, a new learning speed enhancement algorithm is presented using (10) and the learning scheme for unknown nmp systems. since the estimates of impulse responses can be used to estimate the desired input, the learning speed can be enhanced. at k = 0, since u0 [0,n−1] = 0, y 0 [σ+d0,n+σ+d0−1] will be zero. thus, u 1 [0,n−1] = s 0yd [σ+d0,n+σ+d0−1]. here, we set s0 to be an appropriate matrix, e.g., αi. for k ≥ 1, we estimate the impulse response j̄k [1,p] using (10) from uk [0,n−1] and y k [σ+d0,n+σ+d0−1], derive j̄ k from the estimates of first l impulse responses. likewise, for a sufficient k, e.g., k = 1, we can obtain the estimate of the input ūd [0,n−1] as follows: ūd[0,n−1] = (j̄ k)−1y[σ+d0,n+σ+d0−1]. (11) if the estimation is successfully made and l is sufficiently enough, ūd [0,n−1] will be considerably close to ud [0,n−1]. for k + 1, e.g., k = 2, we can set uk [0,n−1] = ū d [0,n−1] and s = α(j̄ k)−1. if s satisfies the convergence condition, we can obtain the desired input with the proposed method. throughout this approach, we can enhance the learning speed. we can summarize the learning rule as follows: the proposed learning algorithm • step 0: when k = 0. set s0 to be an appropriate matrix and obtain u1 [0,n−1]. • step 1: for the first iteration, obtain y1 [σ+d0,n+σ+d0−1]. if ‖ek [σ+d0,n+σ+d0−1]‖≤ �, then stop. else, derive j̄k using (10). calculate the estimated value of the desired input ūd [0,n−1] from (11) and set u 2 [0,n−1] = ūd [0,n−1] . • step k: for the k-th iteration, if ‖ek [σ+d0,n+σ+d0−1]‖≤ �, then stop. set s = α(j̄k)−1. update the input using (3), increment k, and repeat step k until termination. learning speed enhancement of iterative learning control with advanced output data based on parameter estimation 327 theorem 1. the nmp system (1) satisfies (a1)-(a4), the relative degree and the number of nmp zeros are known, and the system dynamics may not be known completely. let us assume that we update the input based on the proposed learning algorithm. if the condition (4) holds for all k ≥ 1, the input uk [0,n−1] converges to u d [0,n−1] as k →∞. proof: this can be easily shown using lemma 2.1, and theorem 1 from [5]. 2 (a) outputs using the proposed method and y5 in [5] (b) inputs using the proposed method and u5 in [5] figure 1: outputs and inputs for different values of k 5 simulation results let us consider an example of nmp system for a positioning table in [10] as follows: g(z) = 0.0082z4 + 0.031z (z + 0.29)(z − 0.2)(z − 0.46)(z2 − 1.7z + 0.73) 328 g.-m. jeong, s.-h. ji this system has one nmp zero (z = −3.7805) and satisfies (a1)–(a4). the desired trajectory is given as yd(i) = { 0, i = 0, 1, 83, 84, 85, 86 −0.2 cos(0.05π(i− 2)), 2 ≤ i ≤ 82. (12) here, we set n = 85, s0 = 0.1i and u(85) = u(86) = 0. the input update law is given as uk+1 [0,84] = uk [0,84] + skek [2,86] . from (10), we set p = 85. the impulse response is estimated using j̄k[1,85] = ((u k)t · uk)−1(uk)t yk[2,86]. (13) in addition, j̄1 is obtained using l = 30. then, we set u2 = ūd and enhance the learning speed. in this case, the convergence condition is satisfied as ‖i − skjk‖ < 0.568 when sk = 0.5(j̄k)−1. fig. 1(a) and 1(b) show the outputs and inputs for different values of k, respectively. the root mean square (rms) error for the output error is 0.0012 for k = 2, and is smaller than the rms error of 0.0036 for k = 10 reported in [5]. from this example, we can see that the learning speed is significantly enhanced. 6 conclusion in this paper, we have proposed a new learning speed enhancement algorithm of adilc for discrete-time nmp systems. first, we have presented an estimation algorithm of impulse responses based on the input-output mapping of adilc. next, learning speed enhancement algorithm has been derived from new learning gain, which is calculated with the estimates of impulse response. simulation results for the nmp system have demonstrated the learning speed enhancement of the proposed method. robust algorithms over disturbances for learning control can be considered with the proposed method. it remains as a future work. acknowledgment this work was supported by the national research foundation of korea(nrf) grant funded by the korean government(msip)(nrf-2016r1a5a1012966), and also supported by basic science research program through the national research foundation of korea(nrf) funded by the ministry of education(nrf-2015r1d1a1a01060917) bibliography [1] arimoto s., kawamura s., miyazaki f. (1984); bettering operation of robots by learning, journal of robotic systems, 1(2), 123–140, 1984. [2] bien z., xu j.-x. (1998); iterative learning control analysis, design, integration and applications, kluwer academic publishers, 1998. [3] jang t.-j., ahn h.-s., choi c.-h. (1994); iterative learning control for discrete-time nonlinear systems, international journal of systems science, 25(7): 1179-1189.. [4] jeong g.-m., choi c.-h. (2002); iterative learning control for linear discrete time nonminimum phase systems, automatica, 38(2), 287–291, 2002. learning speed enhancement of iterative learning control with advanced output data based on parameter estimation 329 [5] jeong g.-m., ji s.-h. (2013); iterative learning control with advanced output data using an estimation of the impulse response, ieice transactions on fundamentals, e96-a (6), 1488-1491, 2013.. [6] ngo t., wang y., mai t.l., ge j., nguyen m.h., wei s. n. (2012); an adaptive iterative learning control for robot manipulator in task space, international journal of computers communications & control, 7(3), 518–529, 2012. [7] uchiyama m. (1978); formulation of high-speed motion pattern of mechanical arm by trial, transactions of the society of instituteument and control engineers (in japanese), 14(6), 706–712, 1978. [8] xia c., deong w., shi t., yan y. (2016); torque ripple minimization of pmsm using parameter optimization based iterative learning control, journal of electrical engineering and technology, 11(2), 709–718, 2016. [9] xu j.-x. (1997); direct learning of control efforts for trajectories with different magnitude scales, automatica, 33(12), 2191–2195, 1997. [10] yamada m., riadh z., funahashi y. (1999); design of discrete-time repetitive control system for pole placement and application,ieee/asme transactions on mechatronics, 4(2), 110118, 1999. ijcccv4n1draft.pdf int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 1, pp. 73-81 modeling of errors realized by a human learner in virtual environment for training thanh-hai trinh, cédric buche, ronan querrec, jacques tisseau université européenne de bretagne, ecole nationale d’ingénieurs de brest, laboratoire informatique des systèmes complexes, centre européen de réalite virtuelle technopôle brest-iroise, 29238 brest cedex 3, france e-mail: {trinh,buche,querrec,tisseau}@enib.fr abstract: this study focuses on the notion of erroneous actions realized by human learners in virtual environments for training. our principal objective is to develop an intelligent tutoring system (its) suggesting pedagogical assistances to the human teacher. for that, the its must obviously detect and classify erroneous actions produced by learners during the realization of procedural and collaborative work. further, in order to better support human teacher and facilitate his comprehension, it is necessary to show the teacher why learner made an error. addressing this issue, we firstly modeling the cognitive reliability and error analysis method (cream). then, we integrate the retrospective analysis mechanism of cream into our existing its, thus enable the system to indicate the path of probable cause-effect explaining reasons why errors have occurred. keywords: intelligent tutoring system, erroneous actions, retrospective analysis 1 introduction in order to simulate procedural and collaborative work, we developed the model mascaret (multi-agent system for collaborative adaptive and realistic environment for training) where human learners and agents collaborate to realize a task [1]. learners are gathered in team consisting of several predefined roles, every role contains a number of actions to be realized by learners with specific resources. during realization of the tasks, it is essential to take into account that human learners may make erroneous actions in comparing to their predefined correct procedure. in [2], we have proposed a model of intelligent tutoring system (its) whose principal objective is to suggest pedagogical assistances to the teacher adapted to the simulation context and to the learner’s behaviours (including erroneous actions). however, this works exclusively concerns errors detection and tagging. once erroneous actions are detected in our existing its, it were be classified in different types (cf. figure 1(a)) whose explications are based on a knowledge base on classical errors. in order to better support the teacher and facilitate his comprehension, it lacks a model that could explain reasons why the learner made an error. our approach bases on the cognitive reliability and error analysis method (cream) in human team error error team proceduralerror proceduralerror actionerror usageerror (a) errors types in its [2] erroneous action t im in g duration sequence object forcedirection distance speed t o o e a rly , to o la te , o m is s io n t o o lo n g , to o s h o rt r e s e rv a l, re p e titio n , c o m m is s io n , in s tru s io n w ro n g a c tio n , w ro n g o b je c t t o o m u c h , to o little w ro n g d ire c tio n t o o fa r, to o s h o rt t o o fa s t, to o s lo w (b) dimensions of error modes [3] figure 1: errors types and errors’s phenotypes reliability analysis field [3]. this approach proposed a classification scheme which makes a distinction between observations of errors (phenotypes, cf. figure 1(b)) and its causes (genotypes) classified in copyright © 2006-2009 by ccc publications 74 thanh-hai trinh, cédric buche, ronan querrec, jacques tisseau three categories: m(an), t(echnology) and o(rganization). for example, since the learner made a mistake about the order of actions, the erroneous action observed is in phenotype sequence and that can be futher explained by some genotypes such as inattention (man related genotype), communication failure (technology related genotype), etc. the causal links between phenotype-genotype are represented using a number of consequent-antecedent links. finally, the scheme could be associated with both a method of retrospective analysis (the search for causes) and a performance prediction method. however, in our goal of erroneous actions detection and then searching for the causes, we interested in human learner’s performance analyses, in other words, in retrospective analyses. implementation of cream was object in the work of el-kechaï [4][5] which firstly proposed a task model named metisse in order to recognize learner’s plans in virtual environments for training (vet), then this model could be used to detect for erroneous actions according to classification of hollnagel. nevertheless, implementation of metisse was not complete, and integration of cream into a really its was not performed. in this paper, we will firstly propose an approach to model cream (section 2). next, in section 3, we will present the integration of retrospective analysis mechanism of cream into our existing its as well as our evaluation. 2 implementation of cream 2.1 classification scheme representation there are several graphic tools that permit to keep track of analyses processes such as cream navigator developed by serwy and rantanen [7]. however, this navigator is completely closed in the sense that it does not maintain an explicit representation of possible errors modes and probable causes. for that, [4] proposed using a rules base for represent consequent-antecedent links, hence the search for the causes was executed by backward inferences. limitation of this method obviously lies on the performance of inference mechanism, other problem maybe occurs in adding, removing another potential errors that will demand a considerable modification on the rules base. for our development, as suggested in [3], we intent to separate the analysis method (cf. section 2.3 and 2.4) and the representation of errors modes using a group of four data files in format xml detailed below: questionnaire.xml : proposing to represent a list of questions from which we could evaluate the common performance conditions (see section 2.2 in following). phenotype.xml : proposing to maintain the phenotypes and its antecedents (cf. figure 2). inadequate plan inattention earlier omission ... figure 2: representation of phenotypes genotype.xml : containing all possible causes classified in three groups (m,t,o), each group is then detailed into several categories. the important point is that this data file also represents relations between each consequent and its antecedents (cf. figure 3). repartition.xml : proposing to determine repartition of specific antecedents (cf. figure 4) in three factors (m,t,o) which serves to initialize the mass of each specific antecedent as a probable cause (cf. section 2.4). finally, in considering that cream is naturally a flexible method and adaptable to different analysis contexts, this strategy of classification scheme representation permits customize the scheme without any modification on analysis method. modeling of errors realized by a human learner in virtual environment for training 75 distraction excessive demand error in goal inadequate training ... ... ... figure 3: representation of genotypes ... figure 4: repartition of specific antecedents in three factors (m,t,o) 2.2 define the common performance conditions (cpc’s) in cream, hollnagel highlighted that the context strongly influence human actions. it is therefore essential to take into account the description of virtual environment in which the human learner is immersed. the objective is to determine how each factor (m,t,o) influences the training context. here, we are inspired from the proposition presented in [5] using a predefined questionnaire which will be answered by the teacher before training session (cf. figure 5). next, each factor will be assigned one ... figure 5: define the cpc’s by questionnaire [5] coefficient calculated using formula below: coe f ficientgroup i = number o f yes answers associated to group i total number o f yes answers (1) where group i is respectively in (man, technology, organization). these values permit define the most probable factor leading to erroneous actions. 2.3 modelling of consequent-antecedent relations one advantage of cream lies on its recursive analysis approach, rather than strictly sequential in compare with other traditional analysis methods. so that, it also conducts to a non-hierarchical data structure to connect the direct as well as indirect links: (i) between a phenotype and its antecedent; and (ii) between a consequent and its antecedents. figure 6 shows our model to represent the connection between consequent-antecedent. here, we are going to construct a causal graph where we use the term node to point to either a consequent or an antecedent. each node is described by its name; the group of errors modes that it is associated and its category in group; the description in text helps better explain the error’s semantics in particular context. the boolean attribute terminal permit to identify if that is a terminal-cause or not. the most important is that, each node contains two lists: one includes its antecedents, other points to its consequents, in others words, they represent edges in/out one node in the causal graph. at last, each node must also 76 thanh-hai trinh, cédric buche, ronan querrec, jacques tisseau +a dda ntec ede nt() +a ddc ons eq uen t() +c alcu latem a ss () -_na m e : string -_grou p : string -_ca te go ry : s trin g -_de scription : s trin g -_m as s : dou ble -_term ina l : boo l -_list_a ntec ede nt -_list_c ons equ en t node + getq u estio nna ire () + getph eno type s () + getsp ecifica ntec ede nts () + getg e nera lc ons eq uen ts () + getg e nera la ntec ede nts () util 1 0 ..1 0..1 0..* 1 1 « use s» q ue stion naire .xm l ph eno ty pe .xm l g en otyp e .x m l r e partitio n.xm l +ge tantec ede ntfro m phe noty pe () +ge tg eno type from a ntec ede nt() +finds pec ific ante ced entr epa rtition () +cre ateg rap hf ro m ph eno ty pe () +findl istte rm in al() +so rtl istte rm in al() -_ grap h : s tring genotypeanalyzer figure 6: uml diagram for modeling consequent-antecedent links include a value of mass which represent the certitude of choosing this node as a probable cause. the two methods addantecedent() and addconsequent() serve for maintaining the two lists of antecedents and consequents of one node. note that once a node calls the method addantecedent() serving for adding a "parent" node like one of its antecedents, this node will also add itself to the consequents list of the "parent" node (using the method addconsequent() of the parent node), the value of the attribute terminal then will be set to false. 2.4 search for the causes the retrospective analysis is executed by a genotypeanalyzer containing graph attribute which is initialized by pointing to the phenotype input (root node), then the analyzer calls accurate methods to find the root causes (the nodes with the attribute terminal having value false). this mechanism is presented below (cf. algorithm 1). algorithm 1 retrospective analysis require: phenotype of erroneous action 1: initialization: construct the "root" node pointing to phenotype input 2: {step 1: finding antecedents of phenotype input} 3: read from file phenotype.xml, find all general antecedents of phenotype input 4: for each antecedent do 5: add it into antecedents list of "root" node 6: end for 7: {step 2: construction the causal graph} 8: for each unvisited node in the graph do 9: find its antecedents from file genotype.xml 10: add them to antecedents list 11: end for 12: return step 2. this recursive search terminates when the node selected is a specific antecedent node or a general consequent node without antecedents. with this algorithm, we finally attain a causal network where each node is associated with its antecedents and consequents. the "leaves" are terminal nodes (or "root" causes) whose antecedents list is empty. in order to calculate the certitude of choosing each node as a probable cause, we inherit the proposition presented in [5] using dempster-shafer’s evidence theory: mass(a) = coe f ficient (g(a)) ∗ ∑ ∀c∈cons(a) ( mass(c) ∑ ∀i∈{m,t,o} (coe f ficient(i) ∗nic) ) (2) where: modeling of errors realized by a human learner in virtual environment for training 77 • mass(a) : mass of antecedent a • g(a) : group of a • cons(a) : consequents list of a • coe f ficient(i) : coefficient of group i calculated in formula 1 • nic : number of antecedents of c classified in group i figure 7: cream explorer finally, the figure 7 illustrates our tool cream explorer which was developped in this phase permitting to maintain the errors scheme, answer the questionnaire for define the cpc’s and execute the retrospective analysis. 3 integration of retrospective analysis into our existing its 3.1 learner’s plans recognition in order to detect the erroneous actions realized by a human learner, it is indispensable to know: (i) the learner’s activities in the past; (ii) his current action (in the meaning that the action has just been done); (iii) the actions that the human learner intents to do in according to a predefined correct procedure. our existing its as proposed in [2] bases on the model mascaret [1] where we used an multi-agent system to simulate collaboration between human learners and agents during their realization of tasks. learners are gathered in team consisting of several predefined roles, every role contains a number of tasks associated eventually with accurate resources, every leaner also owns an epistemic memory containing all actions realized in the past, etc. finally, we could retrieve from mascaret following informations relating to learner’s plan in vet: 78 thanh-hai trinh, cédric buche, ronan querrec, jacques tisseau • action(s) before: learner’s action(s) in the past (note that, in mascaret, every action is eventually associated with its accurate resource(s)) • current action: action has just been done by learner • action(s) correct (according to role): action(s) must be done by learner in his role(s) • action(s) correct (according to plan): action(s) may be done by learners in the context. here, it is essential to make distinction betweens action(s) correct according to role and action(s) correct according to plan. in the first case, because the learner could play several roles, it represents all correct actions that the system expects from the learners. the second one concerns the cases where there are more than one learner in vet to realize together a mission. therefore, in this case, it is possible that a leaner performs a correct action according to the plan but it is not correct in compare to his role. • next correct action(s) in the role: next action(s) must be done by learner in his role(s) • full correct plan: description of all accurate actions (associated with resources) in predetermined procedure that the learner must respect. in next section, we present our mechanism for mapping erroneous actions detected by our existing its with hollnagel’s classification scheme of errors modes. 3.2 classification of erroneous actions according to the scheme of cream erroneous actions in phenotype "sequence" according to hollnagel, performing an action at the wrong place in a sequence or procedure is a common erroneous action, and it is more realistic in our context of simulation of procedural and collaborative work. the "sequence" problem consists of several specific effects: omission (an action was not carried out); jump forward/ jump backwards (actions in a sequence were skipped/carried out again); repetition (the previous action is repeated); reversal (the order of two neighbouring action is reversed); wrong action (an extraneous or irrelevant action is carried out). we present in following our mechanism to detect erroneous actions in phenotype "sequence": algorithm 2 detection of erroneous actions in phenotype sequence 1: if current action exists in actions correct according to role then 2: this is a correct action (phenotype sequence does not occur) 3: else 4: if current action does not exist in actions correct according to plan then 5: specific effect = "wrong action" 6: else 7: if current action exist in last action before then 8: specific effect = "repetition" 9: end if 10: compare the relative order of current action to the order of next correct action(s) in the role using the full correct plan 11: if id current action < id correct action in role then 12: specific effect = "jump backwards and/or omission" 13: else 14: specific effect = "jump forward and/or omission" 15: end if 16: if id current action = id correct action in role + 1 then 17: specific effect = "reversal" 18: end if 19: end if 20: end if erroneous actions in phenotype "wrong object" in [3], the author clarified that "action at wrong object" is one of the more frequent error modes, such as pressing the wrong button, looking at the wrong indicator, etc. in our context, during realisation of modeling of errors realized by a human learner in virtual environment for training 79 collaborative work, it is possible that learner performs a correct action but on a wrong object. therefore, the detection of erroneous actions in phenotype "wrong object" must be implemented independently with the detection of phenotype "sequence". this phenotype is detailed into following specific effects: neighbour/similar object (an object that is proximity/similar to the object that should have been used); unrelated object (an object that was used by mistake). in order to detect erroneous actions in phenotype "wrong object", we use the same principle presented in the case of phenotype "sequence" by using following informations retrieved from model mascaret: • current resource: resource associated with current action • resource(s) correct (according to role): resource(s) must be used by learner in his role(s) • resource(s) correct (according to plan): list of resource(s) associated with all action(s) in action(s) correct according to plan. our algorithm is detailed in following: algorithm 3 detection of erroneous actions in phenotype wrong object 1: if current resource exists in resource(s) correct according to role then 2: this is a correct resource (phenotype wrong object does not occur) 3: else 4: if current resource does not exist in resources correct according to plan then 5: specific effect = "unrelated object" 6: else 7: specific effect = "neighbour and/or similar object" 8: end if 9: end if erroneous actions in phenotype "time/during" the phenotype time/during is divided in several specific effects: too early/ too late (an action started too early/too late); omission (an action that was not done at all); too long/too short (an action that continued/was stopped beyond the point when it should have been). hollnagel noted that the error modes of timing and duration refer to a single action, rather than to the temporal relation between two or more actions. in our context, the realization of tasks in model mascaret is sequential, therefore, an action is considered to be too early when it was realized before several actions in plan; also, action(s) are considered to be omitted when they were not carried out. finally, in order to detect erroneous actions in phenotype time/durring, we propose that: • action having specific effect jump forward also has specific effect too early • action described by specific effect omission (in error mode sequence) will be considered as an action having specific effect omission (in error mode time/during) 3.3 experiment & results in order to evaluate our integration of retrospective analysis into its, we take place in gaspar application [6] whose objective aims at simulate aviation activities by virtual reality. the learners are immersed in virtual environment simulating the aircraft carrier in order to realize together the tasks. during the realization of these collaborative works, our its follows the learners and then apply the algorithms depicted above for detecting learner’s erroneous actions. next, for interpreting the causes of errors, we use the classification scheme of error modes proposed in [5] which were particularly adapted to vet. table 1 and table 2 respectively illustrate results of retrospective analysis for the phenotype sequence and wrong object. we change coefficients of three factors (m,t,o) for evaluating how cpc’s influence the analysis result. for each phase in analysis process, we select and display the most probable cause by ordering mass values. 80 thanh-hai trinh, cédric buche, ronan querrec, jacques tisseau coefficient (m,t,o) causal links (0.333 0.333 0.333) 1, design failure (0.125) → inadequate scenario (0.125) → sequence 2, adverse ambient condition (0.125) → inattention (0.125) → sequence 3, long time since learning (0.042) → memory failure (0.125) → sequence (1 0 0) 1, other priority (0.2) → memory failure (0.2) → sequence 2, error in mental model (0.067) → faulty diagnosis (0.2) →sequence 3, erroneous analogy (0.067) → faulty diagnosis (0.2) → sequence (0 1 0) 1, equipment failure (0.1) → access problems (0.5) → sequence 2, distance (0.1) → access problems (0.5) → sequence 3, localisation problem (0.1) → access problems (0.5) → sequence (0 0 1) 1, noise (1) → communication failure (1) → sequence table 1: causal links of phenotype sequence coefficient (m,t,o) causal links (0.333 0.333 0.333) 1, access problems (0.125) → wrong object 2, design failure (0.125) → inadequate scenario (0.125) → wrong object 3, adverse ambient condition (0.042) → inattention (0.125) → wrong object (1 0 0) 1, fatigue (0.1) → performance variability (0.2) → wrong object 2, virtual reality sickness (0.1) → performance variability (0.2) → wrong object 3, anticipation (0.05) → wrong identification (0.2) → wrong object (0 1 0) 1, access problems (0.5) → wrong object (0 0 1) 1, noise (1) → communication failure (1) → wrong object table 2: causal links of phenotype wrong object 4 conclusion & future work in this paper, we proposed an approach to modelling the cognitive reliability and error analysis method (cream). we separated the representation of classification scheme of erroneous actions and the analysis method; therefore, our description of errors modes is adaptable to different training context without any modification on analysis method. we started by defining the common performance conditions, then the direct and indirect relations between consequent-antecedent are modelled using a non-hierarchical data structure. finally, the most probable cause-effect links could be found using dempster-shafer’s theory presented in [5]. in order to integrate the retrospective analysis described above into our existing its, we based on the model mascaret to retrieve information concerning learner’s plans and then detect erroneous actions. finally, we presented our proposition to mapping erroneous actions with hollnage’s classification. the experimental results in gaspar project are also presented. so that, in addition to the detection and tagging of erroneous actions, the its could furthermore indicate the path of probable cause-effect modeling of errors realized by a human learner in virtual environment for training 81 explaining reasons that the errors occur. in the future work, we will concentrate our attention on evaluation of mascaret so that this model could permit to describe more complex tasks in taking into account other factors such as force, distance, speed, direction, etc. hence, other different types of errors modes could be detected and then explained using the retrospective analysis. acknowledgement this article is an extended version of our paper [8] published in proceedings of the rd international conference on virtual learning (icvl’08). the authors would like to thank the scientific committee of icvl’08 (chaired by dr. grigore albeanu) that recommended the publishing of our extended work in ijccc. bibliography [1] r. querrec, c. buche , e. maffre , and p. chevaillier, multiagents systems for virtual environment for training: application to fire-fighting, international journal of computers and applications (ijca), pp. 25-34, june 2004. [2] c. buche and r. querrec, intelligent tutoring system for mascaret, simon richir and bernard taravel, editors, 7th virtual reality international conference (vric’05), pp. 105-108, april 2005, laval, france. [3] e. hollnagel, cognitive reliability and error analysis method, oxford: elsevier science ltd, 1998. [4] n. el-kechaï and c. després, a plan recognition process, based on a task model, for detecting learner’s erroneous actions, intelligent tutoring systems its 2006, pp. 329-338, june 2006, jhongli, taăżwan. [5] n. el-kechaï and c. després, proposing the underlying causes that lead to the trainee’s erroneous actions to the trainer, ec-tel : european conference on technology enhanced learning, pp. 41-55, september 2007, crète, grèce. [6] n. marion, c. septseault, a. boudinot and r. querrec, gaspar : aviation management on an aircraft carrier using virtual reality, cyberworlds, 2007. [7] r.d. serwy and e.m. rantanen, cream navigator, http://www.ews.uiuc.edu/ serwy/cream/v0.6beta/, version 0.6, september, 2007. [8] t.h. trinh, c. buche and j. tisseau, modeling of errors realized by a human learner in virtual environment for training, rd international conference on virtual learning icvl 2008, pp. 71-80, octobre 31, constanta, romania. thanh-hai trinh, received his msc in artificial intelligent & multimedia at the francophone institute for computer science. he is currently a phd student in cerv (virtual reality european centre held at brest, france). his current research interests are in the applications of multi-agent systems and artificial intelligent in virtual environements for training. cédric buche is a professor assistant in computer science and works at the cerv. he works on the use of the behavior modeling agent applied to virtual environment for human learning. he is the leader of the its project in mascaret. ronan querrec is professor assistant in computer science and works at the cerv. his reasearch work is about virtual environment for training. in this theme, he works on the mascaret project, a virtual environment meta-model. jacques tisseau is professor in computer science at the engineer school of brest (enib) where he leads the computer science for complex systems laboratory (lisyc). his research focus on autonomous virtual entities, interaction with these entities and epistemology of virtual reality. international journal of computers, communications & control vol. ii (2007), no. 2, pp. 185-194 fault detection for large scale systems using dynamic principal components analysis with adaptation jesús mina, cristina verde abstract: the dynamic principal component analysis is an adequate tool for the monitoring of large scale systems based on the model of multivariate historical data under the assumption of stationarity, however, false alarms occur for non-stationary new observations during the monitoring phase. in order to reduce the false alarms rate, this paper extends the dpca based monitoring for non-stationary data of linear dynamic systems, including an on-line means estimator to standardize new observations according to the estimated means. the effectiveness of the proposed methodology is evaluated for fault detection in a interconnected tanks system. keywords: fault detection, statistical analysis, dynamic principal component analysis, time series analysis, non-stationary signals. 1 introduction the on-line process monitoring for fault detection and isolation, fdi, is an important task to ensure plant safety and product quality. one of the most consolidated fdi techniques of the last twenty years is the analytical approach, which is based on explicit modeling, this is, models obtained from primary physical principles, some of the analytical approaches are very well revised e.g. [1]. in the case of large scale processes whose analytical models are not available or are difficult to obtain the fdi techniques based on data-driven can help to overcome the problem of modeling, these techniques are based on implicit modeling through multivariate statistical methods, some of these methods are resumed in [2]. principal component analysis (pca) is a multivariate statistical method which models the linear correlation structure of a multivariate process from nominal historical data. pca transforms a set of multivariate observations to a lower dimension orthogonal space, retaining the most variability of the original data [3]. because of the simplification and the orthogonal property obtained with pca, this has been used with success for fault diagnosis issues as in [4] and [5]. it is important to note that like other multivariate statistical methods, pca works under three assumptions: the data follows a multivariate normal distribution; there exist no auto-correlation among observations; and the variables are stationary, this is, the variables should keep constant mean and standard deviation over time [6], [7]. in the case of data with non normal distribution it is possible to carry out an appropriate transformation like square root or logarithm [8], in order to improve the distribution of data. in the case of dynamic systems the auto-correlation in variables is taking into account incorporating time lags of the time series during the modeling stage, this extension is called dynamic principal component analysis, dpca [9]. since pca and dpca assume stationarity during modeling process, high rate of false alarms are generated in the diagnosis stage if the test data are non-stationary. the non stationary problem has been tackled with adaptive versions of pca like in [10] and [11]. although these algorithms adapt means, covariance, and the pca model, however, they can not be used for fdi tasks since the adaptation is based in the variations of actual multivariate observations without distinguish the real causes of changes in the variables. copyright © 2006-2007 by ccc publications supported by dgapa-unam-in11403-2 and the eoli project of the european community inco program contract number ica4-ct-2002-10012 186 jesús mina, cristina verde a non-stationary condition has many possible causes e.g. due to components aging, to faults, even to normal changes in the operating point of the plant, this problem motivated the development of a fault detection algorithm which is robust to changes in the operating point but sensitive to faults. the proposal is based in the fact that the correlation structure between system signals is invariant under nominal conditions, i.e. the relations between variables is the same despite the nominal changes in the mean of the signals, this is a result of the affinity property of nominal signals in linear systems [12]; therefore this work proposes: in the modeling stage, obtain a nominal dpca model from nominal historical data and identify a set of nominal inputs-output relations; in the diagnosis stage, keep the nominal dpca model but estimate actual means of input variables through exponentially weighted moving average (ewma) and estimate means of the output variables from input means through the identified nominal inputs-output relations in order to carry out an appropriate standardization with respect to the estimated means. in the following sections the recursive means estimation process is summarized; next, the proposed extension of dpca based fault detection for changes in the operating point will be described. finally the methodology will be evaluated for faults detection in a three interconnected tanks system. 2 backgrounds 2.1 identification of the inputs-output relations it is proposed the recursive mean estimation of the input signals using ewma and the outputs mean estimation from inputs estimated means and through inputs-output nominal relations. here is proposed to identify the inputs-output relations with moving average models, ma(), for each one of the output variables. lets consider the case of a mimo linear system with r inputs and s outputs y = au (1) u = un + η y = yn + ν (2) where η and ν are stationary white noise vectors added to the inputs and to the outputs, with zero mean and variances σ 2η and σ 2 ν , respectively. each one of the output variables can be expressed as a linear combination of the inputs with corresponding time lag orders q1i, q2i,··· , qri yni (t) = q1i ∑ k=0 ak1i u1 (t −k) +···+ qri ∑ k=0 akriur (t −k) (3) for i = 1,··· , s. the ak1i,··· , akri parameters are obtained through correlation analysis [13]. the identified input-output relations can be expressed in compact form as follows ŷni (t) = âi −→u i (t) (4) where âi = [ â01i ... âq1i 1i ... â0ri ... âqri ri ] fault detection for large scale systems using dynamic principal components analysis with adaptation 187 and −→u i (t) = [ u1 (t) ··· u1 (t −q1i) ··· ur (t) ··· ur (t −qri) ]t the orders q ji of the model (4) is selected through a validation procedure taking the minimal value of the sum of square error as a function of q ji. finally, from (4) for all of the outputs, the relations in matrix notation are given by ŷn (t) = â −→u (t) (5) where ŷn (t) = [ ŷn1 (t) ··· ŷns (t) ]t (s×1) −→u (t) = [ u1 (t) ··· u1 (t −l) ··· ur (t) ··· ur (t −l) ]t (o×1) and â ∈ ℜs×o is made up by the coefficients âk ji = { âk ji , k ≤ q ji f or j = 1, ..., r 0 , q ji < k ≤ l i = 1, ..., s l = max{q11, ..., qr1, ..., q1s, ..., qrs} and o = r (l + 1). 2.2 means recursive estimation once the identification of ma models (5), for the system (1), are obtained, the recursive means estimation is carried out in the following way. the inputs mean recursive estimation can be computed as follows µ̂−→u (t) = β µ̂−→u (t−1) + (1−β )−→u (t) (6) with (0 < β ≤ 1) the forgetting factor of ewma. by the other side, the outputs means at time instant t are estimated from the input means given in (6) and through (5) according with µ̂y(t) = µ̂yn(t) = âµ̂−→u (t) (7) 3 dpca based fault detection with mean parameter estimation the dpca statistical tool is used to obtain an implicit model of a dynamic system from nominal historical data, and use this implicit model to carry out fault detection tasks. the proposed algorithm is illustrated in fig. 1. the idea is not only to obtain a dpca based statistical model but also identify nominal inputs-output relations. so, estimating actual means of input variables through ewma and estimating means of the output variables from input means using the nominal inputs-output relations, an appropriate standardization can be carried out. 188 jesús mina, cristina verde figure 1: proposed fault detection algorithm. (a) modeling stage carried out off-line, (b) detection stage carried out on-line 3.1 dpca based statistical modeling let matrix x be a set of historical data made up of nt observations of r input variables and s output variables, taken from a dynamic system working under nominal conditions and around an operating point x = [ u1 ··· ur y1 ··· ys ] (nt×p) (8) each column in x represents an auto-correlated time series. in dpca the serial correlation is included constructing the named trajectory matrix applying w time lags on each time series, this is −→ x = [ −→ u 1 ··· −→ u r −→ y 1 ··· −→ y s ] (n×m) (9) where e.g. −→ u 1 = [u1 (t) u1 (t −1) ... u1 (t −w)]; n = nt −w and m = p (w + 1). to avoid that some particular variables dominate the modeling process, it is convenient to carry out a data standardization in matrix −→ x in relation to its means and standard deviations. thus, the means of−→ x are given by µ̂−→x = [ 1 n −→ x t 1 ]t = [ µ̂−→u µ̂−→y ] (1×m) (10) where 1 = [1, 1, ..., 1]t ∈ ℜn. by the other side, the covariance matrix of −→x is given by s = 1 n−1 (−→ x −1µ̂−→x )t (−→ x −1µ̂−→x ) from which the standard deviations can be obtained σ̂−→x = √ diag (s) = [ σ̂−→u σ̂−→y ] (1×m) (11) thus, the data standardization is computed in the following way x̃ (i, j) = −→ x (i, j)− µ̂−→x ( j) σ̂−→x ( j) for i = 1, . . . , n and j = 1, . . . , m. fault detection for large scale systems using dynamic principal components analysis with adaptation 189 the uncorrelated principal components z of dimension n × l are obtained through the following transformation z = x̃vt (12) where the orthonormal transformation matrix vt ∈ ℜm×l is composed of an appropriate selection of l eigenvectors, called loading vectors, associated to the correlation matrix r = 1n−1 x̃ t x̃. the data matrix x̃ can be expressed as x̃ = ̂̃x + e = zvtt + e where ̂̃x is the information captured by the l-principal components and e is the information in the neglected m − l-components. so, for the detection purpose it is possible to use the hotelling statistic, t 2zi from z and/or the squared prediction error (spe) from e. in this paper the hotelling statistic is used for fault detection because our interest is not the evaluation of these two statistical parameters but the illustration of reduction of false alarms. for each l-variate observation in z the hotelling univariate statistic t 2zi is given by t 2zi = zis −1 z z t i (13) where sz is the covariance matrix of z. finally, a threshold of normal condition from the probability density function of the set of parameters t 2zi is calculated. [6] propose, among others, for a beta distribution of the data set t 2zi the threshold ucl as ucl = (n−1)2 ( l n−l−1 ) f ( α 2 ; l, n−l −1 ) n ( 1 + ( l n−l−1 ) f ( α 2 ; l, n−l −1 )) (14) where n and l are the dimensions of z and α is a level of significance. in a dpca based modeling conventional approach the implicit model consist of the means and standard deviations vectors (10), (11); the loading vectors in vt ; the variance in the principal components given by sz; and the nominal threshold ucl. however, according with the proposal it is just considered σ̂−→x ,vt , sz and ucl, since µ̂−→x will be estimated recursively. additionally to the statistical modeling it is carried out the identification of the inputs-output relations (5). 3.2 fault detection dpca detects a deviation of vector of actual observation −→x a from the nominal reference in terms of its mean and its standard deviation. however, it is important to note that the modeling process is based in the data set −→ x which was obtained in a particular operating point of the system, so any change in the nominal values of the signals is interpreted by dpca as a fault, even when the process is healthy, this misinterpretation is because of the time variant behavior of components in (10). for linear systems, a change in the operating point means a new assignment in the input variables with consequent variations in the output variables, this is, changes in the mean values of input and output variables but no changes in their correlation structure. however, faults in the system produce changes in the mean values and in the correlation structure between variables. thus, here it is proposed during the detection stage, the on line estimation of the statistical set (10) using nominal linear inputs-output relations in order to adapt the standardization procedure. 190 jesús mina, cristina verde so, according to the proposed extension to dpca based fault detection algorithm, the procedure to evaluate and classify an actual observation −→x a ∈ ℜ1×m is summarized as follows. be the actual observation vector, with input and output variables, expressed in w time lags −→x a = [ −→u a1 ··· −→u ar −→y a1 ··· −→y as ] (15) 1. estimate through (6) the means of the actual input data, µ̂−→u a , and through (7) the nominal means of the output variables, µ̂−→y ; next construct the vector µ̂−→x a = [ µ̂−→u a µ̂−→y ] (1×m) (16) 2. standardize the m terms in (15) using the means estimated given in (16) and the historical standard deviations (11), this is x̃a ( j) = −→x a ( j)− µ̂−→x a ( j) σ̂−→x ( j) for j = 1, . . . , m. 3. transform the x̃a vector to the principal components subspace za through vt za = x̃avt 4. map za in the behaviour symptom t 2za through t 2za = zas −1 z z t a 5. if the resulting value deviates from the normal condition threshold ucl then a fault is present in the system. the key of the proposed methodology is in the continuous estimation of nominal means (16) using the nominal linear inputs-output relations (5) in order to carry out an appropriate standardization. in the following section the proposed fault detection algorithm is applied to detect faults in a three interconnected tanks system considering simple relations for the means estimation. 4 three tanks system the tanks system is composed of three cylindrical tanks, interconnected at the bottom by pipes and with valves v1 in the link between tanks 2 and 3, and v2 in the link between tank 2 and the outside, which aperture can be manipulated in order to emulate faults (e.g. pipe blockage), see fig. 2. the tank dimensions are: ht = 0.63m, at = 0.01539m2. the system is feed by two inputs q1 to the tank 1 and q2 to the tank 2 which are measured just as the output variables h1, h2 and h3 which correspond to tanks levels. the mathematical model is the following at dh1 dt = q1 + q31 −q10 at dh3 dt = q23 −q31 at dh2 dt = q2 −q23 −q20 ; q10 = k1 √ h1 q31 = k31ρ (h3 −h1) q23 = k23ρ (h2 −h3) q20 = k2 √ h2 (17) where ρ (x) , sgn (x) √ |x|. fault detection for large scale systems using dynamic principal components analysis with adaptation 191 q1 q2 at h1 h3 h2 q10 q31 q23 q20 v1 v2 figure 2: three tanks system for the experiments the system was simulated under the following operation point: q01 = 4.75e − 5m3/s, ( σ 2q1 = 1.07e−10 ) ; q02 = 7.35e − 5m3/s, ( σ 2q2 = 1.05e−10 ) ; h01 = 0.147m, ( σ 2h1 = 1.96e−4 ) ; h02 = 0.276m, ( σ 2h2 = 4.81e−4 ) ; h03 = 0.195m, ( σ 2h3 = 2.65e−4 ) ; k1 = 1.816e−4, k31 = 1.005e−4, k02 = 9.804e−5 and k023 = 7.804e−5. taking a set of 400 nominal observations measured every 10s it was obtained a dpca based principal components space of dimensions 301×68, so, for an α = 0.01 the resulting threshold is ucl = 95.886. by the other side, the inputs-output relations identified were h1 = f (q1, q2, q1), h2 = f (q1, q2, q2) and h3 = f (q1, q2, q3) with time lags of order q1 = 61, q2 = 61 and q3 = 60, respectively. 4.1 detection results using a forgetting factor of β = 0.95 for the recursive inputs means estimation (6) the fault detection algorithm is evaluated considering the following cases: 1. fault condition, blockage in the pipe which links tanks 2 and 3, the fault occurrence is at 8000s. 2. normal operation of the system during 15000s, with changes in the means of u1 of +20% in 3000s < t < 6000s; −20% in 9000s < t < 12000s and in u2 of +20% in 4500s < t < 7500s; −20% in 10500s < t < 13500s. 3. change in the mean of u1 of +20% from 4000s and fault condition, blockage in the pipe between tank 2 and 3, at 8000s. the first test is to compare the performance of the dpca based conventional fault detection and the fault detection based in the proposed algorithm, under fault conditions. the monitoring results are given in fig. 3 which shows that both algorithms are able to detect the fault. the second test evaluate the performance of both algorithms before changes in the operation point, the monitoring results are given in fig. 4, where it is cleared observed that the traditional dpca-based fault detection (mon1) interprets the normal changes in the operation point as faults, however, the proposed algorithm (mon2) is robust before these changes, which reduces the false alarm rate. finally, the third test shows the capability of the proposed fault detection algorithm to distinguish between normal variations in the operating point and the presence of faults, see fig. 5. 5 conclusions here, a modification to the dpca algorithm for fault detection has been proposed, in which an appropriate standardization with respect to on-line estimated statistical parameters is carried out if simple 192 jesús mina, cristina verde 0 5000 10000 15000 0 50 100 150 200 250 300 350 400 450 t(s) mon1 mon2 ucl figure 3: fault condition: ucl threshold of normal condition; mon1 dpca-based monitoring; mon2 dpca with adaptation 0 5000 10000 15000 0 200 400 600 800 1000 1200 1400 1600 1800 t(s) mon1 mon2 ucl figure 4: normal condition: ucl threshold of normal condition; mon1 dpca-based monitoring; mon2 dpca with adaptation 0 5000 10000 15000 0 100 200 300 400 500 600 t(s) mon1 mon2 ucl figure 5: normal and fault condition: ucl threshold of normal condition; mon1 dpca-based monitoring; mon2 dpca with adaptation fault detection for large scale systems using dynamic principal components analysis with adaptation 193 healthy relations between variables can be obtained. this idea allows to deal with non-stationary signals and to reduce significatively the rate of false alarms. it was shown through a series of tests the effectiveness of the proposed fault detection algorithm to distinguish between normal changes in signals and the variations due to the presence of faults. references [1] r. j. patton, p. m. frank, r. n. clark, issues of fault diagnosis for dynamic systems, springerverlag, 1989, london. [2] l. h. chiang, e. l. russell, r. d. braatz, fault detection and diagnosis in industrial systems, advanced textbooks in control and signal processing, springer-verlag, 2001, london. [3] j. e. jackson, a users guide to principal components, john wiley, 1991, new york. [4] j. v. kresta, j. f. macgregor, t. e. marlin, multivariate statistical monitoring of process operating performance, the canadian journal of chemical engineering, vol. 69, pp. 35-47, february, 1991. [5] a. raich, a. çinar, statistical process monitoring and disturbance diagnosis in multivariable continuous processes, aiche journal, vol. 42, no. 4, pp. 995-1009, april, 1996. [6] n. d. tracy, j. c. young, r. l. mason, multivariate control charts for individual observations, journal of quality technology, vol. 24, no. 2, pp. 88-95, april, 1992. [7] a. norvilas, a. negiz, j. decicco, a. çinar, intelligent process monitoring by interfacing knowledge-based systems and multivariate statistical monitoring, journal of process control, vol. 10, no. 4, pp. 341-350, august, 2000. [8] d. c. montgomery, introduction to statistical quality control, john wiley, 2001, new york. [9] w. ku, r. h. storer, ch. georgakis, disturbance detection and isolation by dynamic principal component analysis, chemometrics and intelligent laboratory systems, vol. 30, no. 1, pp. 179196, november, 1995. [10] n. b. gallagher, b. m. wise, s. w. butler, d. d. white, g. g. barna, development and benchmarking of multivariate statistical process control tools for a semiconductor etch process: improving robustness through model updating, adchem’97, banff, canada, pp. 78-83, 9-11 june, 1997. [11] w. li, h. h. yue, s. valle-cervantes, s. j. qin, recursive pca for adaptive process monitoring, journal of process control, vol. 10, no. 5, pp. 471-486, october, 2000. [12] t. kailath, a. h. sayed, b. hassibi, linear estimation, prentice hall, 2000, new jersey. [13] g. e. p. box, g. m. jenkins, g. c. reinsei, time series analysis: forecasting and control, prentice hall, 1994, new jersey. jesús mina, cristina verde instituto de ingeniería-unam automatización coyoacán, df, 04510, méxico, fax: (52)-55-56233600 ext 8052 e-mail: jminaa@iingen.unam.mx, verde@servidor.unam.mx received: january 15, 2007 194 jesús mina, cristina verde jesús mina received the bs degree in electric eng. from tuxtla gutiérrez technological institute, méxico in 1999; the ms degree in electronic eng. from the research and technological development national center, méxico in 2002; and currently is a student in the phd program in electrical eng. of the national university from méxico. he was professor from 2002 to 2003 in the zacatepec technological institute. he has carried out research in non-linear control for power active filters and currently is interested in fault diagnosis based in multivariate statistical analysis. jesús mina is a member of the international society of automation. cristina verde received the bs degree in electronic and communication eng. from national polytechnic, méxico in 1973, the ms degree in electrical eng. from the national polytechnic, in méxico 1974 and the phd degree in electrical eng. from the duisburg university in germany in 1983. in 1984, she joined the national university from méxico (unam) and became the head of the automatic control department in 1988. she has been the coordinator of the postgraduate school in computer science and engineering from the national university. she has used the control theory to improve distribution, regulation and quality of water in méxico and her main topics of research interests include automatic fault detection and diagnosis for dynamic systems and integrity of industrial process. she got the prize sor juana ines de la cruz given by the national university to the outstanding women in the engineering field in 2005. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iii (2008), no. 2, pp. 135-148 feedback gain design method for the full-order flux observer in sensorless control of induction motor abderrahmane bouhenna, abdellah mansouri, mohammed chenafa, abdelkader belaidi abstract: this paper deals with a feedback gain design method for the full-order flux observer with adaptive speed loop, which enables the minimizing the unstable operation region of this observer to a line in the torque-speed plane. the stability in regenerating mode is studied using necessary condition of stability based on determinant of matrix and a linearized model. simulations results where the proposed observer is compared with an exiting solution (where the unstable region is not totally removed) are presented to validate the proposed observer design. keywords: induction motor, full-order flux observer, sensorless control, stability analysis, adaptive speed estimator, regenerating mode 1 introduction the speed-sensorless control of induction motor drives have developed significantly during the last number of years. speed adaptive full observers introduced by [8], [15] are promising flux estimators for inductions motors drives. the speed adaptive observer consists of a state variable observer augmented with a speed adaptation loop. the observer gain and the speed adaptive law determine the properties of the observer. the speed adaptation law is based on the component of the current estimation error with the estimated rotor flux. the adaptation law was originally derived using the lyapunov stability theory [8]. however, the stability of the adaptation law is not guaranteed and stability problem exist in the regenerating mode. the derivation in [8] neglects a term including the actual rotor flux (which is not measurable). the positive-realness condition is not satisfied as shown [5]. some limits of operation were quickly highlighted [9], [13]. in particular, a well known instability region was described in regenerating mode. thus, the drive stability can’t be guaranteed when this type of observer is associated with a field oriented control. there was many work in order to reduce this region of instability which is due to inadequate observer design [1, 2, 5, 14]. in this paper, we describe the design of an adaptation law that minimizes the instability region of an adaptive speed estimator. the paper is organized as follows. the induction motor model and the speed adaptive flux observer are first defined in section 2 and 3 respectively. we introduce the observer gain design in section 4 leading to a reduced instability region limited to a line. finally, simulations results are presented and discussed in section 5, where the proposed observer is compared with an exiting solution [5, 13]. 2 induction motor model the induction motor is described by the following state equations in the synchronous rotating reference frame with complex notations: d dt x = a (ω, ωs) x + bus (1) is = cx (2) where x = [ ψ r is ] t a =   −( 1 tr + jωsl ) lm tr lm b ( 1 tr − jω) −(a + jωs)   , b =   0 1 σ ls   , c = [ 0(2×2) i ] , i = [ 1 0 0 1 ] (3) copyright © 2006-2008 by ccc publications 136 abderrahmane bouhenna, abdellah mansouri, mohammed chenafa, abdelkader belaidi and the mechanical equation is: d dt ω = p2 lm jlr ℑ(isψ ∗ r )− p tl j (4) where ∗ means a conjugate, j a complex number and ℑ an imaginary part. ψ r : rotor flux; is : stator current; us : stator voltage; ωs : stator angular frequency; ω : motor angular speed; ωsl = ωs − ω : slip angular frequency; rs, rr : stator and rotor resistance; ls, lr : stator and rotor self-inductance; lm : mutual inductance; tl : load torque; j : rotor inertia ; p : number of pole pairs; tr = lr/rr : rotor time constant; a = (l2r rs + l 2 mrr)/(σ lsl 2 r ); b = σ lslr; σ = 1−(l2m)/(lslr) : leakage coefficient. 3 adaptive observer the conventional full-order observer, which estimates the stator current and the rotor flux together [10, 11], is written as the following state equation. d dt x̂ = â (ω̂, ω̂s) x̂ + bus + g(is − îs) (5) îs = cx̂ (6) where ̂ means the estimated values and g = [ g1 g2 ]t is the observer gain matrix. we assume that all machine parameters are perfectly known except the motor speed. using the assumption of constant angular rotor speed ω̇ = 0 (i.e. the speed variations are slow with respect to electrical mode) [8], [5], the speed adaptive law is [8]: d dt ω̂ = λ lm b (eid ψ̂rq −eiqψ̂rd ) (7) where λ is a positive constant and will be tuned in (7) to improve observer dynamics. in practice, proportional-integral action is used in order to improve the dynamic behavior of the estimator. d dt ω̂ = kp d dt (eid ψ̂rq −eiqψ̂rd ) + ki(eid ψ̂rq −eiqψ̂rd ) (8) where eid = isd − îsd , eiq = isq − îsq, (isd , isq) are (d,q) components of stator current, (ψrd , ψrq) are (d,q) components of rotor flux. the speed adaptive observer scheme with the speed adaptation mechanism is presented in fig. 1. 4 observer gain design 4.1 linearized model the nonlinear and complicated dynamics of the speed adaptive observer can be studied via smallsignal linearization. it is useful to proceed with a local analysis based in the principle of stability in the first approximation [12, 7]. we will choose the particular form g1 = g1i2×2 where i2×2 is the identity matrix and g2 = 02×2. the complete adaptive observer may be written as equation (10). note that according the assumption ω̇ = 0, the motor model (1) may be written as (9)    d dt ψ r = −( 1 tr + jωsl )ψ r + lm tr is d dt is = lm b ( 1 tr − jω)ψ r −(a + jωs)is + 1 σ ls us d dt ω = 0 (9) feedback gain design method for the full-order flux observer in sensorless control of induction motor 137 induction motor b â speed adaptive law eq. (8) + + + ∫ ω̂ g c i s x̂ î s − + u s 1 figure 1: speed adaptive observer    d dt ψ̂ r = −( 1 tr + jω̂sl )ψ̂ r + lm tr îs + g1δ ei d dt îs = lm b ( 1 tr − jω̂)ψ̂ r −(a + jω̂s)̂is + 1 σ ls us d dt ω̂ = kp d dt (eid ψ̂rq −eiqψ̂rd ) + ki (eid ψ̂rq −eiqψ̂rd ) (10) we investigate the stability of the observer by linearizing the two systems (10) and (9) around an equilibrium operating point.defining the new state vectors x = xo + δ x with xo = [ ψ ro iso ωo ] t , δ x = [ δ ψ r δ is δ ω ]t and x̂ = x̂o + δ x̂ with x̂o = [ ψ̂ ro îso ω̂o ] t , δ x̂ = [ δ ψ̂ r δ îs δ ω̂ ] t . the reference frame is synchronized with the estimated rotor flux (ψ̂rqo = 0), then its two components are ψ̂rd = ψ̂o + δ ψ̂rd and ψ̂rq = δ ψ̂rq. in these two systems, the stator frequencies are regarded as identical : ωs = ω̂s [5]. preserving only dynamic parts, the two systems (9), (10) become after linearization,:    d dt δ ψ r = −( 1 tr + jωslo )δ ψ r + lm tr δ is − jψoδ ωsl d dt δ is = lm b ( 1 tr − jωo)δ ψ r −(a + jωso)δ is + 1 σ ls δ us − j lm b ψoδ ω − jisoδ ωs d dt δ ω = 0, (11)    d dt δ ψ̂ r = −( 1 tr + jω̂slo )δ ψ̂ r + lm tr δ îs − jψ̂oδ ω̂sl + g1δ ei d dt δ îs = lm b ( 1 tr − jω̂o)δ ψ̂ r −(a + jω̂so)δ îs + 1 σ ls δ us − j lm b ψ̂oδ ω̂ − jîsoδ ω̂s d dt δ ω̂ = −kp(− lm b ωoψ̂oδ ψ̂ rd + lm btr ψ̂oδ ψ̂ rq −ωsoψ̂oδ îsd −aψ̂oδ îsq − lm b ψ̂oδ ω̂s) −ki(−eidoδ ψ̂ rq + eiqoδ ψ̂ rd + ψ̂ oδ eiq). (12) 138 abderrahmane bouhenna, abdellah mansouri, mohammed chenafa, abdelkader belaidi defining δ e = [ δ eψ δ ei δ eω ]t , the system describing the estimation error is as follows:    d dt δ eψ = −( 1 tr + jωslo )δ eψ + ( lm tr −g1)δ ei − jeψo δ ωsl + jeωo δ ψ̂ r + jψ̂oδ eω d dt δ ei = lm b ( 1 tr − jωo)δ eψ −(a + jωso)δ ei − j lm b eψo δ ω − j lm b eωo δ ψ̂ r − jeioδ ωs − j lm b ψ̂ o δ eω d dt δ eω = kp(− lm b ωoψ̂oδ eψ d + lm btr ψ̂oδ eψ q −ωsoψ̂oδ eid −aψ̂oδ eiq − lm b ψ̂oδ eω ) +ki(−eidoδ ψ̂ rq + eiqoδ ψ̂ rd + ψ̂ oδ eiq). (13) separating each state in d and q components, we obtain the corresponding state matrix â1: â1 =   − 1 tr ωslo lm tr −g1 0 0 −ωslo − 1 tr 0 lm tr −g1 ψ̂o lm btr lm b ωo −a ωso 0 −lm b ωo lm btr −ωso −a − lm b ψ̂o −lm b kpωoψ̂o lm btr kpψ̂o −kpωsoψ̂o (ki −akp)ψ̂o − lm b kpψ̂o   (14) note the dependency of the dynamic matrix â1 by the operating condition. in order to obtain analytic conditions about the local stability using the necessary condition for stability based on the determinant of (14) [4], it is possible to obtain a relevant result as reported in the next section. 4.2 stability criterion we use the following property: det(â1) = 5 ∏ i=1 λi (15) where λi are the eigenvalues of matrix â1. the determinant of matrix â1 is: det(â1) = −lmψ̂ 2o kiωso((ωso −ωo)abtr + l2mωo −lmωog1tr + ωsob)/(b2tr) (16) the condition det(â) = 0 leads to: ωso = 0, (17a) ωso = ωo g1lm + rslr (rrls + rslr) . (17b) these conditions of stability may be expressed in the torque/speed plane. let us consider the mechanical equation: d dt ω = p2 lm jlr ℑ(isψ ∗ r )− p tl j . (18) under rfoc conditions and steady state (ψ̂rqo = ψrqo = 0), we obtain: 0 = p lm lr ψ̂oisqo −tlo (19) feedback gain design method for the full-order flux observer in sensorless control of induction motor 139 then isqo = lr plmψ̂o tlo. (20) from system (1), in the same conditions, we find : ωslo = lm trψ̂o isqo. (21) finally using ωso = ωslo + ωo, equations (17a) and (17b) become tlo = − pψ̂ 2o rr ωo (22a) tlo = − pψ̂ 2o rr (1− g1lm rrls )ωo (1 + ts tr ) (22b) with ts = ls/rs. above relations describe respectively two lines, defining two well known instability regions in regenerating mode. an sufficient condition for instability is then: det(â1) > 0. (23) the condition (23) defines a set whose the instability region is a subset. in order to complete the study of local stability, we plot for each eigenvalue, the locus in the torque/plane where conditions (ℜ(λi) > 0, i = 1 . . . 5) are verified. in one hand, if we chose a zero observer gain, as in [9], g1 = 0 (24) we obtain the instability region limited by lines d1 and d2, (fig. 1) where ℜλi > 0, i = 1...5, are the positive real part of the eigenvalues λi of the state matrix â1. the eigenvalues correspond respectively to the states variables δ eψrd , δ eψrq , δ eid , δ eiq and δ eω . tlo = − pψ̂ 2o rr ωo (1 + tr ts ) (25) in other hand, in order to reduce (not totally remove) the unstable region, a real valued observer gain was considered in [13] which corresponds to the region limited by lines d1 and d3, (fig. 3). the value of the parameter g1 selected is: g1 = −0.25rs (26) it is be noted that the curves corresponding to zero observer gain are similar, except that the unstable region is larger. tlo = − pψ̂ 2o rr (1 + 0.25rslm rrls )ωo (1 + tr ts ) (27) the principle of the instability reduction proposed here consists in the calculation of the feedback gain so that the unstable region will be limited to the inobservabilité line (d1). we can note that, whatever the structure of the matrix g, (d1) is always defined by ωso = 0. from equation (16), we can write the 140 abderrahmane bouhenna, abdellah mansouri, mohammed chenafa, abdelkader belaidi −200 −150 −100 −50 0 50 100 150 200 −50 −40 −30 −20 −10 0 10 20 30 40 50 ω o (rad/s) t l o (n m ) region (ω o ,t lo ), zero observer gain ℜλ 2 of δψ rq ℜ 0λ 3 of δi sd reλ 4 of δi sq ℜλ 5 of δω tlo − tlo d1, wso=0 d2 figure 2: torque/speed plane,g1 = 0, ℜ(λi) > 0, i = 1...5 −200 −150 −100 −50 0 50 100 150 200 −50 −40 −30 −20 −10 0 10 20 30 40 50 ω o (rad/s) t l o (n m ) region (ω o , t lo ), observer gain g 1 = − 0.25 r s ℜλ 2 of δψ rq ℜ 0λ 3 of δi sd ℜλ 4 of δi sq ℜλ 5 of δω t lo − t lo d 1 , (ω wso =0) d 2 d 3 figure 3: torque/speed plane, g1 = −0.25rs, ℜ(λi) > 0, i = 1...5 feedback gain design method for the full-order flux observer in sensorless control of induction motor 141 condition ωso = 0 in equation (28) (ωso −ωo)abtr + l2mωo −lmωog1tr + ωsob = 0 (28) which can be achieved by choosing the following observer gains g1 = − lrrs lm (29) the straight line (d1) correspond to zero synchronous speed ωs = 0. it is known in the literature as the inobservability line (normally referred as dc-excitation) [6, 3] and seems to be a generic problem for sensorless control of induction motors. 5 simulations results in order to validate the proposed design, the regenerating mode low speed operation of the speed adaptive observer was investigate by means of simulations. a rotor flux oriented control (rfoc) is simulated using matlab/simulink software. the block diagram of the control system is shown in fig.4. the flux reference is fixed to the nominal value ψ re fo where ref denotes the reference value. the proposed usd -+ ω ref ω̂ + i ref sq -+ ψ ref o ψ̂r + i ref sd cω(s) cψ(s) ci(s) ci(s) isd isq usq motor adaptive observer eq. (5) usd usq isd isq ψ̂r ω̂ i sd i sq ψrd ψrq ω model eq. (1) 1 figure 4: block diagram of sensorless rfoc induction motor simulator. observer is compared with an exiting solution [13]. in order to validate the proposed design, we studied a conventional test used by industrial drive designers: very low and progressive load torque increase under constant speed; fig. 5 depicts results in regenerating mode obtained using the observer gain g1 = −0.25rs, [13], [5]. the speed reference was set to (−25rad/s) (dashed line) and a rated-load torque ramp was applied at t=0. after applying the load progressively, the drive should operate in the regenerating mode. however, the actual angular speed et actual flux of the motor collapse and the system becomes unstable. fig. 6 present results obtained using the proposed observer design. the system behaves stably. on fig. 7, the observer gain g1 = −0.25rs was used. real speed diverges. first subplot shows reference (dashed line) and actual angular speed. second subplot shows rated-load torque ramp. third subplot present actual flux components (ψrα , ψrβ ) in stator reference frame. fouth subplot, shows control voltages. in the fifth and six subplot respectively, we present current and current norm. we note that when the load torque increases, the control voltage, the current and the current norm increase too. on fig.8, the proposed observer design was used. the system becames stable. real rotor angular speed converges well towards the reference value in response to the same rated-load torque. note the behavior of the actual flux at (t ≈ 3.75 s) when the real angular rotor speed crosses the line (d1 = d3). the system becomes unobservable at this time. 142 abderrahmane bouhenna, abdellah mansouri, mohammed chenafa, abdelkader belaidi −50 −45 −40 −35 −30 −25 −20 −15 −10 −5 0 0 5 10 15 20 25 t l o ( n m ) region (ω o ,t lo ) d 1 d 3 −50 −45 −40 −35 −30 −25 −20 −15 −10 −5 0 0 5 10 15 20 25 ω o (rad/s) t l o ( n m ) figure 5: a rated-load torque ramp is applied with the observer gain g1 = −0.25rs. first subplot shows region (ωo, tlo) with the two lines d1 and d3. second subplot shows the actual angular speed. feedback gain design method for the full-order flux observer in sensorless control of induction motor 143 −50 −45 −40 −35 −30 −25 −20 −15 −10 −5 0 0 5 10 15 20 25 t l o ( n m ) region (ω o ,t lo ) d 1 , (ω so =0) d 3 −50 −45 −40 −35 −30 −25 −20 −15 −10 −5 0 0 5 10 15 20 25 ω o (rad/s) t l o ( n m ) figure 6: a proposed observer design was used. first subplot shows region (ωo, tlo) with the line d1 = d3. second subplot shows the actual angular speed. 144 abderrahmane bouhenna, abdellah mansouri, mohammed chenafa, abdelkader belaidi 0 2 4 6 −40 −30 −20 −10 0 ω o (r a d /s ) 0 2 4 6 0 5 10 15 20 25 t l o ( n m ) 0 2 4 6 −3 −2 −1 0 1 2 time(s) ψ rα , ψ rβ ( w b ) 0 2 4 6 −400 −200 0 200 u sα , u sβ ( v ) 0 2 4 6 −30 −20 −10 0 10 20 i s α , i s β (a ) 0 2 4 6 0 10 20 30 c u rr e n t n o rm ( a ) time(s) figure 7: instability phenomenon with observer gain g1 = −0.25rs. feedback gain design method for the full-order flux observer in sensorless control of induction motor 145 0 2 4 6 −40 −30 −20 −10 0 ω o (r a d /s ) 0 2 4 6 0 5 10 15 20 25 t l o ( n m ) 0 2 4 6 −3 −2 −1 0 1 2 time(s) ψ rα , ψ rβ ( w b ) 0 2 4 6 −400 −200 0 200 u sα , u sβ ( v ) 0 2 4 6 −30 −20 −10 0 10 20 i s α , i s β (a ) 0 2 4 6 0 10 20 30 c u rr e n t n o rm ( a ) time(s) figure 8: the instability was removed by the proposed observer design g1 = −lrrs/lm. 146 abderrahmane bouhenna, abdellah mansouri, mohammed chenafa, abdelkader belaidi 6 appendix 6.1 induction motor parameters voltage rating : 380 v , current rating: 2.2 a, number of phases: 3, rated power: 1.1 kw , frequency: 50 hz, rated speed: 1430 r pm/min, p = 2, ls = 0.472 h, lr = 0.4721 h, lm = 0.4475 h, rs = 9.65 ω, rr = 4.3 ω. 7 conclusions the feedback gain design method proposed in this paper reduces the instability region of adaptive observer to a inobservability line (d1) (ωso = 0). the observer using the proposed gain does not have the unstable region, which was shown by means of speed/torque plane and a linearized model. the stability of the regenerating-mode operation was also confirmed by simulations. bibliography [1] a. bouhenna, c. chaigne, n. bensiali, e. etien and g. champenois, design of speed adaptation law in sensorless control of induction motor in regenerating mode, simulat. modell. pract. theory, elsevier, doi:10.1016, simpat.2007.04.005, vol. 15, no.7, pp. 847-863, 2007. [2] a. bouhenna, contribution à la commande sans capteur mécanique de la machine asynchrone en mode générateur à basse vitesse, thèse de doctorat en sciences, université des sciences et de la technologie d’oran, algérie, octobre, 2007. [3] c. canudas de wit and a. youssef and j. p. barbot and ph. martin and f. malrait, observability conditions of induction motors at low frequencies, proc. cdc, sydney, pp. 1-7, 2000. [4] e. etien and n. bensiali and c. chaine and g. champenois, adaptive speed observers for sensorless control of induction motors: a new criterion of stability, international review of electrical engineering, vol. 1, pp. 36-43, 2006. [5] m. hinkanen, stabilization of regenerating-mode operation in sensorless induction motor drives by full-order flux observer design, ieee trans. ind. electron., vol. 51, pp. 1318-1328, 2004. [6] h. hofmann and s. sanders, speed-sensorless vector torque control of induction machine using a two-time-scale approach, ieee trans. on ind. appl., vol. 34, pp. 169-177, 1998. [7] h. k. khalil, nonlinear systems, macmillan, new york, 1983. [8] h. kubota and k. matsuse, dsp-based speed adaptive flux observer of induction motor, ieee trans. ind. appl., vol. 29, pp. 344-348, 1993. [9] h. kubota and i. sato, regenerating mode low speed operation of sensorless induction motor drive with adaptive observer, ieee trans. ind. appl., vol. 38, pp. 1081-1086, 2002. [10] a. mansouri and m. chenafa and a. bouhenna and e. etien, powerful nonlinear observer associated with field-oriented control of an induction motor, international journal of applied mathematics and computer sciences, vol. 14, no. 2, pp. 209-220, 2004. feedback gain design method for the full-order flux observer in sensorless control of induction motor 147 [11] m. chenafa and a. mansouri and a. bouhenna and e. etien and a. belaidi and m. a. denai, global stability of linearizing control with a new robust nonlinear observer of the induction motor, international journal of applied mathematics and computer sciences, vol. 15, no. 2, pp. 235-243, 2005. [12] m. montanarri and s. peresada and a. tilli, observeless scheme for sensorless control of induction motor: stability analysis and design procedure, proc. of the 10th mediterranean conference and automation, med’02, lisbon, 2002. [13] s. suwankawin and s. sangwongwanich, speeds sensorless im drive with decoupling control and stability analysis of speed estimation, ieee trans. ind. electron., vol. 49, pp. 444-455, 2002. [14] s. suwankawin and s. sangwongwanich, design strategy of an adaptive full order observer for speed sensorless induction motor drives-tracking performance and stabilization, ieee trans. ind. electron. vol. 53, pp. 96-119, 2006. [15] g. yang and t. chin, adaptive-speed identification scheme for a vector-controlled speed sensorless inverter-induction motor drive, ieee trans. ind. appl. electron., vol. 29, pp. 820-825, 1993. a. bouhenna1, a. mansouri1, m. chenafa1, and a. belaidi1 1 e.n.s.e.t. d’oran, laboratoire d’automatique et d’analyses des systèmes, (l.a.a.s) département de génie électrique, b.p 1523, el m’naouer, oran, algérie e-mail: bouhenna @ enset-oran.dz, (abouhenna @ yahoo.fr) received: october 10, 2007. 148 abderrahmane bouhenna, abdellah mansouri, mohammed chenafa, abdelkader belaidi abderrahmane bouhenna was born in 1955. he receive the dipl. eng. degree in electronic engineering, the m. s degree and the doctorat in automatic from u.s.t.o, oran, algeria in 1980, 1987 and 2007 respectively. he is currently a professeur searcher in the laboratory of automatic and analysis systems at enset of oran (algeria). he work on the subject of sensorless control and observers of induction motors and obtained some results in this domain for the stabilisation of the observers and the control of the mas in regenerating mode at low speed. abdellah mansouri was born in oran in algeria, in 1953. he received his bs degree in electronic engineering from usto (algeria) in 1979, the ms degree in engineering control from usto (algeria) in 1991, and the phd degree in engineering control from usto (algeria) in 2004. he is currently professor of automatic control at enset of oran (algeria). his research interests are non linear control and observers applied in induction motor and manipulator robot. mohammed chenafa was born in oran in algeria, in 1954. he received his bs degree in electronic engineering from usto (algeria) in 1979, the ms degree in signal processing and robotic from usto (algeria) in 1998, and the phd degree in engineering control from usto (algeria) in 2005. he is currently professor of automatic control at enset of oran (algeria). his research interests are non linear control and observers applied in induction motor and manipulator robot. abdelkader belaidi, professor in 1981 he obtained a ph.d in radiation physics at the university of east anglia, norwich england. his fields of interest are collision damage in materials and neural and fuzzy logic. now, he is a professor of physics and applied computing at the higher school of education oran algeria. he is also head of automatic and system analysis laboratory (laas). international journal of computers, communications & control vol. iii (2008), no. 1, pp. 60-68 evaluation of the recorded state mechanism for protecting agent integrity against malicious hosts kamalrulnizam abu bakar, b. s. doherty abstract: as agent technology is expected to become a possible base platform for an electronic services framework, especially in the area of electronic commerce, reliable security protection is a crucial aspect, since some transactions in this area might involve confidential information, such as credit card number, bank account information or some form of digital cash, that has value and might therefore be attacked. in addition, without proper and reliable security protection, the wide spread use of agent technology in real world applications could be impeded. in this paper, evaluation of the recorded state mechanism (rsm) previously proposed by the authors is presented. the evaluation examines the rsm security protection and implementation overhead, in order to analyse the rsm security strength and implementation feasibility in real world application. keywords: agent security, malicious host, recorded state mechanism. 1 introduction problem in agent technology arise when agents are used in an open and unsecured environment. for example, a customised agent application is sent out to visit several airline servers (in an open and unsecured environment) to find a suitable flight. in this example, the agent application is allowed to completely migrate to (the agent originating host transfer the agent’s code, data and state to the remote server) and execute in (the remote server executed the receiving agent application) the remote server environment, to take advantage of exploiting resource near the data source and thus reducing network traffic [20]. this opens a greater opportunity for the agent application to be abused by the executing host, because the agent application is fully under control of the executing host [10, 19]. an example of attack by the executing host (the malicious host) is to tamper with the agent’s data or state, so that the agent will forget all the previous visits and offers held by the agent, and thus force the agent (application) to accept an offer from the malicious host even though the malicious host’s offer is not the best offer [10, 17, 19]. this kind of attack is known as a manipulation attack [11, 10]. in this attack, the owner of the agent may not know the attack has happened. this is because the malicious host may make subtle changes in the agent’s code, data and state, which are difficult to detect, thus enabling the malicious host to achieve its objective. in addition, the agent (application) that returns from the malicious host does not show any different behaviour from an untampered agent, which makes the attack difficult to detect and prevent. the problem of manipulation attack has been addressed by the authors using the recorded state mechanism (rsm) [1, 2]. the rsm uses the state of an agent, which is recorded during the agent execution process inside an execution host environment to detect the malicious host manipulation attack. in this paper the evaluation of the recorded state mechanism is presented. the evaluation analyse the rsm’s security and overhead its feasibility in real world applications. the paper is organized as follows: section 2 presents the evaluation of the recorded state mechanism, which includes the analysis on the security and implementation overhead of the rsm. section 3 presents a discussion and the conclusion is presented in section 4. copyright © 2006-2008 by ccc publications evaluation of the recorded state mechanism for protecting agent integrity against malicious hosts 61 2 the evaluation of the recorded state mechanism the recorded state mechanism is an integrity protection mechanism that is able to detect manipulation attacks from a malicious host. the mechanism consists of three different types of container, the recordedreadonly, recordedexecuteonly and recordedcollectonly that are used to record the agent state information. this recorded agent state information, which consists of the data of the agent (located in its variables) and the execution information (such as the program counter, the call stack and a few more items) is used for detecting any modification attacks from the malicious host in order to protect the integrity of the agent during agent execution inside the malicious host environment. the evaluation of the recorded state mechanism is presented by examines the rsm security protection and implementation overhead that will be discussed in the next section. 2.1 the security analysis of the recorded state mechanism to assess the strength of the recorded state mechanism, its ability to handle well-known attacks is discussed in table 1. summary of evaluation of recorded state mechanism the recorded state mechanism is able to detect most of the malicious host attacks that try to tamper with the agent’s data and state integrity. this mechanism when combined with distributed migration pattern, can prevent collaboration attacks by two or more hosts and extraction of information by the malicious host. however attacks such as an execution host lying about input data cannot be detected or prevented by this mechanism, because the attack does not alter any state information, and so leaves no trace. 2.2 the overhead of implementing the recorded state mechanism the experiments to measure the overhead of implementing the recorded state mechanism are conducted using six 400 mhz sun ultra sparc 5 workstations with 128 mb of main memory. each of the workstations is running the solaris 8 operating system and is connected to the others using 100 mbit/s utp1 cable. all of the workstations involved in this experiment were situated in the same room. in this configuration, one workstation will be chosen among the six workstations to be the home host for the agent, and only this host has the permission to manage and dispatch the agent. the rest of the workstations are assumed to be the remote host, having only the capability to receive and dispatch the agent back to its home host. to evaluate the security overhead for implementing the recorded state mechanism in an agentbased application, times are measured starting from sending of the agents to the remote hosts and ending by receiving the agents back from the remote hosts. the times, are measured using the “system.currenttimemillis()” method in the java language. this method produces a specific instant in time with millisecond precision [14]. the experiments are done using four different remote host starting with one remote host, two remote hosts, three remote hosts and five remote hosts on three different types of agent: plain agent2, agent with cryptographic security mechanism (crypto) and agent with these security mechanisms and the recorded state mechanism (crypto+rsm). there are four different experiments used in examining the overhead for implementing the recorded state mechanism: one input and one cycle, one hundred inputs and one cycle, one input and one thousand cycles, and one hundred inputs and one thousand cycles. the input is 1unshielded twisted pair category 5e 2agents without security mechanisms 62 kamalrulnizam abu bakar, b. s. doherty a t t a c k s s o l u t i o n s t h e m a l i c i o u s h o s t c o u l d m a k e s u b t l e c h a n g e s o n r e a d o n l y d a t a i n s i d e t h e r e c o r d e d r e a d o n l y c o n t a i n e r t o e n a b l e i t t o a c h i e v e i t s o b j e c t i v e , i n s u c h a w a y t h a t t h e o w n e r ' s d i g i t a l s i g n a t u r e t h a t w a s s i g n e d i n t h e r e c o r d e d r e a d o n l y c o n t a i n e r s t i l l r e m a i n s v a l i d . t h i s a t t a c k c a n b e r u l e d o u t , b e c a u s e t h e d i g i t a l s i g n a t u r e ( u s i n g s h a 1 ) , w h i c h i s u s e d i n t h e r e c o r d e d s t a t e m e c h a n i s m i s s e c u r e d a g a i n s t b r u t e f o r c e c o l l i s i o n a n d i n v e r s i o n a t t a c k s , w h e r e b y u s i n g t h e s h a 1 a s t h e d i g i t a l s i g n a t u r e f u n c t i o n c o u l d m a k e t h e a t t a c k e r c o m p u t a t i o n a l l y i n f e a s i b l e t o f i n d a m e s s a g e w h i c h c o r r e s p o n d s t o a g i v e n m e s s a g e d i g e s t , o r t o f i n d t w o d i f f e r e n t m e s s a g e s w h i c h p r o d u c e t h e s a m e m e s s a g e d i g e s t . t h e m a l i c i o u s h o s t c o u l d a l s o m a k e s u b t l e c h a n g e s o n b o t h t h e r e a d o n l y d a t a a n d t h e d i g i t a l s i g n a t u r e f o r t h e r e c o r d e d r e a d o n l y c o n t a i n e r , i n o r d e r t o m a k e b o t h o f t h e m a p p e a r t o b e v a l i d . t h i s p o s s i b l e a t t a c k c a n a l s o b e r u l e d o u t b e c a u s e i n o r d e r t o c r e a t e a n e w d i g i t a l s i g n a t u r e t h a t w i l l b e v a l i d f o r o t h e r h o s t , t h e m a l i c i o u s h o s t n e e d s t o h a v e a p r i v a t e k e y o f t h e a g e n t o w n e r . h o w e v e r , o n l y t h e k e y o w n e r h a s t h i s k e y a n d n o o t h e r e n t i t y c a n p r o d u c e t h i s k e y f r o m a m o d i f i e d h a s h v a l u e . t h e m a l i c i o u s h o s t c o u l d a l s o t a m p e r w i t h t h e a g e n t s t a t e r e c o r d e d i n t h e r e c o r d e d e x e c u t e o n l y a n d r e c o r d e d c o l l e c t o n l y c o n t a i n e r b y m o d i f y i n g t h e a g e n t s t a t e b e f o r e t h e s t a t e i s r e c o r d e d i n t o b o t h c o n t a i n e r s . t h i s d u e t o t h e f a c t t h a t t h e r e c o r d e d p r o c e s s o f t h e r e c o r d e d s t a t e m e c h a n i s m i s u n d e r t h e m a l i c i o u s h o s t ' s c o n t r o l a n d t h e r e f o r e , t h e m a l i c i o u s h o s t c a n d o a n y t h i n g t o i t . t h i s a t t a c k c a n a l s o b e r u l e d o u t b e c a u s e t h e m a l i c i o u s h o s t h a s t o u s e i t s o w n p r i v a t e k e y t h a t c o n t a i n s i t s i d e n t i t y i n o r d e r t o c o m p u t e a n d r e c o m p u t e t h e d i g i t a l s i g n a t u r e o n t h e t a m p e r e d s t a t e , t h u s r e v e a l i n g i t s e l f d u r i n g t h e v e r i f i c a t i o n p r o c e s s . t h e m a l i c i o u s h o s t c o u l d a l s o a t t a c k t h e a g e n t b y l a u n c h i n g c o l l a b o r a t i o n a t t a c k s i n c o o p e r a t i o n w i t h t w o o r m o r e c o n s e c u t i v e h o s t s i n o r d e r t o d e n y t h e c h e c k i n g p r o c e s s f o r d e t e c t i n g a n y m a l i c i o u s h o s t a t t a c k f r o m t h e p r e v i o u s v i s i t o r t o r e m o v e a n y a g e n t s t a t e t h a t r e c o r d s t h e c h a n g e s m a d e b y t h e p r e v i o u s h o s t o n t h e a g e n t d u r i n g i t s e x e c u t i o n s e s s i o n . i n a d d i t i o n , t h e m a l i c i o u s h o s t c o u l d a l s o u s e t h e c o l l a b o r a t i o n a t t a c k t o e x t r a c t i n f o r m a t i o n f r o m t h e a g e n t t o w i n s o m e c o m p e t i t i o n o v e r o t h e r h o s t s . t h e a t t a c k c a n b e r u l e d o u t s i n c e t h e u s e d o f m a s t e r s l a v e a g e n t a r c h i t e c t u r e i n i m p l e m e n t i n g t h e r e c o r d e d s t a t e m e c h a n i s m o n l y a l l o w s d i f f e r e n t a g e n t s t o b e s e n t a n d s e r v e d b y d i f f e r e n t r e m o t e h o s t s . a n a g e n t v i s i t s o n l y o n e h o s t , t h u s p r e c l u d i n g t h e c o l l a b o r a t i n g a t t a c k . i n a d d i t i o n , t h e i n f o r m a t i o n e x t r a c t e d f r o m a s i n g l e a g e n t d o e s n o t g i v e e n o u g h i n f o r m a t i o n t o t h e m a l i c i o u s h o s t t o a l l o w i t t o w i n a n y c o m p e t i t i o n o v e r o t h e r h o s t s b e c a u s e t h e e x t r a c t e d i n f o r m a t i o n i s n o t s u f f i c i e n t b y i t s e l f . t h e m a l i c i o u s h o s t c o u l d a l s o l i e a b o u t t h e i n p u t d a t a , w h i c h i s r e c o r d e d i n t h e r e c o r d e d e x e c u t e o n l y a n d t h e r e c o r d e d c o l l e c t o n l y c o n t a i n e r s i n o r d e r t o d e c e i v e t h e o w n e r o f t h e a g e n t . t h i s a t t a c k i s u n a b l e t o b e r u l e d o u t b y t h e r e c o r d e d s t a t e m e c h a n i s m b e c a u s e t h e i n p u t d a t a t h a t w a s s u p p l i e d b y t h e m a l i c i o u s h o s t i s a s s u m e d t o b e a c o r r e c t d a t a b y t h e a g e n t , s i n c e o n l y t h e m a l i c i o u s h o s t k n o w s w h e t h e r t h e i n p u t d a t a i s c o r r e c t o r i n c o r r e c t . h o w e v e r , t h e o w n e r o f t h e a g e n t k n o w s t h e i d e n t i t y o f t h e h o s t , w h i c h s u p p l i e s t h e i n p u t d a t a t o t h e a g e n t b e c a u s e a l l t h e d a t a a n d s t a t e a r e d i g i t a l l y s i g n e d b y t h e e x e c u t i o n h o s t b e f o r e t h e d a t a a n d s t a t e l e a v e d t h e e x e c u t i o n h o s t . t h u s , t h e o w n e r o f t h e a g e n t k n o w s w h i c h e x e c u t i o n h o s t i s r e s p o n s i b l e f o r s u p p l y i n g t h e f a l s e i n p u t d a t a . t h e m a l i c i o u s h o s t c o u l d a l t e r t h e r a n d o m s e q u e n c e 3 l e v e l o b f u s c a t i o n a l g o r i t h m t o e x e c u t e i n m a n y d i f f e r e n t w a y s ( i n c o r r e c t e x e c u t i o n a t t a c k ) t h i s a t t a c k c a n b e r u l e d o u t s i n c e t h e r e c o r d e d s t a t e m e c h a n i s m w i l l c h e c k t h e r e s u l t s g a t h e r e d b y t h e r e t u r n i n g s l a v e a g e n t b y e x e c u t i n g t h e s a m e e x e c u t i o n p r o c e s s t h a t i s a s s u m e d , h a s b e e n d o n e b y t h e s l a v e a g e n t i n s i d e t h e r e m o t e h o s t e x e c u t i o n e n v i r o n m e n t . table 1: possible attacks and solutions of the recorded state mechanism represents a character. this character is use as a data that need to be protected by the agent. the cycle, on the other hand represents a loop that is used to simulate an agent tasks. note that all of the experiments on the recorded state mechanism are using master-slave agent architecture and operates on the distributed migration pattern [1, 2]. the experiment is performed for 20 runs and the result for each run is gathered in milliseconds. from evaluation of the recorded state mechanism for protecting agent integrity against malicious hosts 63 the author’s observation, all the 20 runs in this experiment give very similar results and for this reason, 20 runs of the experiment are considered sufficient. the average result of all the 20 runs is taken and converted into seconds. the result is then rounded up and presented in two decimal places as given and illustrated in tables 2 to 5 and figure 1 to 4 respectively. number of mean standard error standard deviation remote hosts plain cyp cyp+rsm plain cyp cyp+rsm plain cyp cyp+rsm 1 1.54 29.15 28.91 0.003 2.18 1.23 0.01 9.73 5.52 2 2.49 30.89 31.11 0.003 2 1.38 0.02 8.93 6.19 3 3.37 31 32 0.004 0.93 0.98 0.02 4.15 4.38 5 5.35 36.36 36.03 0.011 1.5 1.87 0.05 6.7 8.37 plain = without cryptographic mechanism cyp (crypto) = cryptographic mechanism rsm = recorded state mechanism table 2: summary statistics of the recorded state mechanism overhead (1 input and 1 cycle experiment) s e c u r i t y o v e r h e a d f o r i m p l e m e n t i n g t h e r e c o r d e d s t a t e m e c h a n i s m 0 1 0 2 0 3 0 4 0 1 2 3 5 n u m b e r o f r e m o t e h o s t s ti m e i n se co nd s p l a i n c y p ( c r y p t o ) c y p + r s m figure 1: security overhead of the recorded state mechanism (1 input and 1 cycle experiment) based on the observation on the results gained through the experiments done, it can be seen that the standard error and the standard deviation of the security overhead are similar regarding the number of remote host but different between agents. agents with security mechanism give larger standard error since the agents have to execute many tasks such as generate the cryptography key, generate digital signature, verify digital signature and execute encryption and decryption. from the results given in table 2 and illustrated in figure 1, it can be seen that the mean of the security overhead is almost the same for agents with security mechanism and agents with security mechanism plus rsm, where the security overhead for the agent with security mechanism plus rsm is just 7.82 % higher than the overhead for the agent with security mechanism. however, both agent’s security overheads are higher by up to 1792.86 % than the overhead for the plain agent. from table 3 and figure 2, the security overhead for the plain agent is almost the same as with one input given in table 2 and figure 1, but the security overhead for the agents with security mechanism is increased by up to 60.52 % along the security overhead with one input. results in table 4, table 5, figure 3 and figure 4 show that the security overhead for all the agents 64 kamalrulnizam abu bakar, b. s. doherty number of mean standard error standard deviation remote hosts plain cyp cyp+rsm plain cyp cyp+rsm plain cyp cyp+rsm 1 1.55 45.43 45.32 0.003 1.21 1.19 0.01 5.42 5.34 2 2.53 45.99 48.21 0.005 1.35 1.83 0.02 6.03 8.18 3 3.37 49.76 49.42 0.002 1.13 1.26 0.01 5.03 5.65 5 5.43 53.09 53.14 0.005 1.09 1.76 0.02 4.86 7.85 table 3: summary statistics of the recorded state mechanism overhead (100 input and 1 cycle experiment) s e c u r i t y o v e r h e a d f o r i m p l e m e n t i n g t h e r e c o r d e d s t a t e m e c h a n i s m 0 1 0 2 0 3 0 4 0 5 0 6 0 1 2 3 5 n u m b e r o f r e m o t e h o s t s ti m e i n se co nd s p l a i n c y p ( c r y p t o ) c y p + r s m figure 2: security overhead of the recorded state mechanism (100 input and 1 cycle experiment) is similar to the security overhead of the agents with the same number of input but different number of cycle given in table 2, table 3, figure 1 and figure 2 respectively. therefore, it is worth noting that number of cycles does not affect the security overhead of the agents. number of mean standard error standard deviation remote hosts plain cyp cyp+rsm plain cyp cyp+rsm plain cyp cyp+rsm 1 2.41 27.65 28.94 0.008 0.77 1.55 0.04 3.45 6.94 2 3.53 30.91 31.31 0.008 1.38 1.68 0.04 6.18 7.53 3 5.17 30.94 33.36 0.005 1 1.04 0.02 4.46 4.65 5 7.34 38.46 37.57 0.01 1.29 1.69 0.05 5.76 7.55 table 4: summary statistics of the recorded state mechanism overhead (1 input and 10000 cycle experiment) summary of experimental results it can be seen from the results shown in tables 2 to 5 and illustrated in figures 1 to 4 that the implementation of the recorded state mechanism does increase the overhead by only up to an acceptable 7.82 % when compared to the agent with security mechanism but 2830.96 % when compared to the plain agent. however, the low overhead of the plain agent is not important since the plain agent does not have any security protection. evaluation of the recorded state mechanism for protecting agent integrity against malicious hosts 65 s e c u r i t y o v e r h e a d f o r i m p l e m e n t i n g t h e r e c o r d e d s t a t e m e c h a n i s m 0 1 0 2 0 3 0 4 0 5 0 1 2 3 5 n u m b e r o f r e m o t e h o s t s ti m e i n se co nd s p l a i n c y p ( c r y p t o ) c y p + r s m figure 3: security overhead of the recorded state mechanism (1 input and 10000 cycle experiment) number of mean standard error standard deviation remote hosts plain cyp cyp+rsm plain cyp cyp+rsm plain cyp cyp+rsm 1 2.39 45.93 47.26 0.005 1.06 1.11 0.02 4.76 4.97 2 3.56 48.63 48.85 0.005 0.73 1.42 0.02 3.28 6.36 3 5.17 50.86 51.54 0.008 1.52 1.57 0.03 6.82 7.01 5 7.4 54.36 54.48 0.015 1.8 1.94 0.07 8.05 8.66 table 5: summary statistics of the recorded state mechanism overhead (100 input and 10000 cycle experiment) s e c u r i t y o v e r h e a d f o r i m p l e m e n t i n g t h e r e c o r d e d s t a t e m e c h a n i s m 0 1 0 2 0 3 0 4 0 5 0 6 0 1 2 3 5 n u m b e r o f r e m o t e h o s t s ti m e i n se co nd s p l a i n c y p ( c r y p t o ) c y p + r s m figure 4: security overhead of the recorded state mechanism (100 input and 10000 cycle experiment) 3 discussion integrity protection is one of the main requirements for protecting agents against a malicious host attacks. the requirement was successfully fulfilled3 by the recorded state mechanism, which is able to detect most of the malicious host attacks that try to tamper with the agent’s data and state integrity. the analysis on security strength and implementation feasibility of the recorded state mechanism in real world applications has been conducted. the security strength of the recorded state mechanism has 3detect or prevent some attacks and made others more difficult 66 kamalrulnizam abu bakar, b. s. doherty been analysed by evaluating the mechanism against well-known attack scenarios, and from the results, it can be seen that the mechanism is capable to prevent or detect some of the attacks and made other attacks more difficult. the implementation feasibility is measured by examining the overhead imposed by the mechanism in protecting agents integrity against malicious host attacks. the result shows that the rsm imposed an acceptable overhead. 4 conclusion this paper presented the evaluation of the recorded state mechanism for protecting the integrity of the agents against the malicious host attacks. the evaluation produced significant results on the strength of the recorded state mechanism, where it is able to prevent or detect some attacks and made other attacks more difficult with an acceptable overhead. in conclusion, the mechanism offered significant advances in protection of agents against malicious host attacks and is therefore suitable for use in real world applications. bibliography [1] abu bakar, k. and doherty, b. s. a new model for protecting mobile agents against malicious host. proceedings of the iadis international conference www/internet. iadis press, portugal (2002) 780-784 [2] abu bakar, k. and doherty, b. s. protecting mobile agents against a malicious host attacks using recorded state mechanism. proceedings of the 2003 international conference on informatics, cybernetics and systems. i-shou university(2003) 396 – 401 [3] chess, d.m. security issues in mobile code systems. g. vigna(ed.): mobile agents and security, vol. 1419. springer verlag (1998) 1 – 14 [4] chess, d.m. and harrison, c.g. and kershenbaum, a. mobile agents: are they a good idea?. ibm research report. ibm research division (1995). http://www.research.ibm.com/iagents/publications.html [5] corradi, a. and cremonini, m. and montanari, r. and stefanelli, c. mobile agents integrity for electronic commerce application. information system. elsevier science (1999) 519 – 533 [6] diaz, j. and gutierrez, d. and lovelle, j. an implementation of a secure java2-based mobile agent system. proceedings of the second international conference on the practical application of java. practical application company (2000) 125 – 142 [7] farmer, w.m. and guttman, j.d. and swarup, v.: security for mobile agents: issues and requirements. proceedings of the 19th national information system security conference. baltimore (1996) 591-597 [8] ford, w. and baum, m. secure electronic commerce, ed. 2nd. prentice hall (2001) [9] guan, x. and yang, y. and you, j. pom a mobile agent security model against malicious hosts. proceedings of is & n’99 spring verlag (1999) 155 – 167 [10] hohl, f. a framework to protect mobile agents by using reference states. in: proceedings of the 20th international conference on distributed computing systems (icdcs 2000). ieee computer society (2000) 410 – 417 evaluation of the recorded state mechanism for protecting agent integrity against malicious hosts 67 [11] hohl, f.: time limited blackbox security: protecting mobile agents from malicious hosts. in: g. vigna (ed.). mobile agent and security. lecture notes in computer science, vol. 1419. springerverlag, berlin(1998) 92–113 [12] hohl, f. a model of attacks of malicious hosts against mobile agents. in 4th ecoop workshop on mobile object systems (mos’98): secure internet mobile computations. (1998) http://mole.informatik.uni-stuttgart.de/papers.html [13] hohl, f. an approach to solve the problem of malicious hosts. institute of parallel and distributed high-performance systems (ipvr), university of stuttgart, germany (1997) [14] sun microsystems, inc. java 2 platform std. ed. v1.3.1 http://java.sun.com/j2se/ 1.3/docs/api/index.html (2004) [15] kun, y. and xin, g. and dayou, l. security in mobile agent system: problems and approaches. operating system review, vol. 34, no. 1. acm (2000) 21 – 28 [16] reisner, j. and donkor, e. protecting software agents from malicious hosts using quantum computing. proceedings of spie the international society for optical engineering. iee (2000) 50 – 57 [17] sander, t. and tschudin, c.: protecting mobile agent against malicious hosts. in: g. vigna (ed.). mobile agent and security. lecture notes in computer science, vol. 1419. springer-verlag, berlin(1998) 44-60 [18] schneier, s. applied cryptography, ed. 2nd. wiley & son (1996) [19] vigna, g. cryptographic traces for mobile agents. in: g. vigna (ed.). mobile agent and security. lecture note in computer science, vol. 1419. springer verlag (1998) 137 – 153 [20] wang, t., guan, s. and chan, t.: integrity protection for code-on-demand mobile agents in e-commerce. the journal of systems and software. elsevier (2002) 211-221 kamalrulnizam abu bakar faculty of computer science and information system universiti teknologi malaysia 81310 utm skudai johor d. t. malaysia e-mail: kamarul@fsksm.utm.my b. s. doherty school of engineering and applied science aston university aston triangle, birmingham b4 7et united kingdom e-mail: b.s.doherty@aston.ac.uk received: july 22, 2006 68 kamalrulnizam abu bakar, b. s. doherty kamalrulnizam abu bakar is a lecturer at universiti teknologi malaysia, malaysia. he received the diploma and degree of computer science in 1994 and 1996 respectively from universiti teknologi malaysia, malaysia. he then received masters in computer communication and networks degree from leeds metropolitan university, united kingdom in 1998 and phd in network security from aston university, united kingdom in 2004. his current research interests include computer and network security, distributed systems and parallel processing, grid computing, wireless and cellular network. bernard s. doherty (born october 2nd, 1945) obtained the degrees of bachelor of engineering (electrical), bachelor of arts and master of engineering science from the university of melbourne in 1967, 1971 and 1981 respectively. he has held positions with the state electricicity commission of victoria, lm ericsson pty ltd, swinburne college of technology (all in melbourne) and, since 1980, at aston university (birmingham, uk), where is presently lecturer in computer science. his main fields of teaching and research are distributed and networked applications and information security. in addition to supervising a number of doctoral students, he has developed computer-based administration and teaching software, written a number of papers and presented papers at international conferences. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 1 (march), pp. 113-124 eecda: energy efficient clustering and data aggregation protocol for heterogeneous wireless sensor networks d. kumar, t.c. aseri, r.b. patel dilip kumar centre for development of advanced computing, mohali, india e-mail: dilip.k78@gmail.com trilok c. aseri pec university of technology, chandigarh, india e-mail: a_trilok_chand@yahoo.com r.b. patel m.m. university, mullana, india e-mail: patel_r_b@yahoo.com abstract: in recent years, energy efficiency and data gathering is a major concern in many applications of wireless sensor networks (wsns). one of the important issues in wsns is how to save the energy consumption for prolonging the network lifetime. for this purpose, many novel innovative techniques are required to improve the energy efficiency and lifetime of the network. in this paper, we propose a novel energy efficient clustering and data aggregation (eecda) protocol for the heterogeneous wsns which combines the ideas of energy efficient cluster based routing and data aggregation to achieve a better performance in terms of lifetime and stability. eecda protocol includes a novel cluster head election technique and a path would be selected with maximum sum of energy residues for data transmission instead of the path with minimum energy consumption. simulation results show that eecda balances the energy consumption and prolongs the network lifetime by a factor of 51%, 35% and 10% when compared with low-energy adaptive clustering hierarchy (leach), energy efficient hierarchical clustering algorithm (eehca) and effective data gathering algorithm (edga), respectively. keywords: clustering; data aggregation; lifetime; heterogeneous wireless sensor networks. 1 introduction for past few years, wireless sensor networks (wsns) attracted lots of researchers because of its potential wide applications and many research challenges. early study on wsns mainly focused on technologies based on the homogeneous wsn in which all nodes have same system resources. however, heterogeneous wsn is becoming more and more popular because the benefits of using heterogeneous wsns with different capabilities in order to meet the demands of various applications have been presented in recent literature [1], [2]. one of the crucial challenges in the organization of the wsns is energy efficiency and stability because battery capacities of sensor nodes are limited and replacing them are impractical. since, sensor nodes use a large amount of energy for data transmission and aggregation. therefore, new energy efficient routing protocols are required to save energy consumption. in this paper, we propose a novel energy-efficient clustering and data aggregation (eecda) protocol for heterogeneous wsn. in this approach, a new cluster head (ch) election and data communication mechanism is presented to extend the lifetime and stability of the network. after the copyright c⃝ 2006-2011 by ccc publications 114 d. kumar, t.c. aseri, r.b. patel chs election, a path with maximum sum of residual energy would be selected for data communication instead of the path with minimum energy consumption. therefore, each ch first aggregates the received data and then transmits the aggregated data to the base station (bs). the main contributions of eecda protocol is to provide longest stability (when the first node is dead) and improves the network lifetime in comparison to low-energy adaptive clustering hierarchy (leach), energy-efficient hierarchical clustering algorithm (eehca) and effective data gathering algorithm (edga). the rest of this paper is organized as follows. section 2 presents related works. section 3 describes the eecda protocol. section 4 explores on simulation results, and finally paper is concluded in section 5. 2 related work many recent research works in the area of cluster-based wsns have extensively focussed on energy efficiency, lifetime, stability and scalability. in the past few years, numerous clustering algorithms have been proposed for a wide range of applications [3], [4], [5]. data aggregation and hierarchical mechanism are commonly used in many critical applications of wsns. it reduces the data redundancy and communication load [6]. leach [7] is the first clustering protocol based on single-hop communication model. in leach, during the setup phase, each node generates a random number between 0 and 1. if this random number is smaller than the threshold value, t(s) , which is given by equation (1), then the node becomes a ch for the current round. during each round, new chs are elected and as a result balanced load energy is distributed among the chs and other nodes of the network. t(s) =   popt 1−popt×(r mod 1popt ) if s ϵ g 0 otherwise   (1) where popt is the desired percentage of chs, r is the count of current round, g is the set of sensor nodes that have not been chs in the last 1 popt rounds. in this paper, we refer round, 1 popt , as epoch of the heterogeneous wsn. power-efficient gathering in sensor information systems (pegasis) [8] is a chain-based power efficient protocol based on leach. it assumes that each node must know the location of all other nodes. it starts with the farthest node and the chain is constructed by using a greedy algorithm. the chain leader aggregates data and forwards it to the bs. in order to balance the overhead involved in communication between the chain leader and the bs, each node in the chain takes turn to be the leader. in [9], the authors described a heuristic approach to solve the data-gathering problem with aggregation in sensor networks. in this scheme, the data is collected in an efficient manner from all the sensor nodes and transmitted to the bs to maximize the lifetime of the network. in [10], the authors have studied the impact of heterogeneity of sensor nodes in terms of their energy and proposed a heterogeneous-aware protocol to prolong the time interval before the death of the first node. in [11] a cost-based comparative study between homogeneous and heterogeneous clustered wsns is proposed to estimate the optimal distribution among different types of sensors, but this result is hard to use if the heterogeneity is due to the operation of the network. in [12], authors have developed energy efficient clustering protocol in wsn which is more suitable for periodical data gathering applications. a survey on many ad hoc and mobile ad hoc network clustering schemes are presented in [13]. in this article authors observed that new clustering schemes are required to handle the topology maintenance and managing node eecda: energy efficient clustering and data aggregation protocol for heterogeneous wireless sensor networks 115 movement in the network. in [14], the authors have proposed a new data gathering approach for single-hop transmission wherein both the data gathering and the aggregation are performed by the same sensor in a cluster but the report to the bs may be done by a different sensor. in [15], authors have investigated the problem of cluster formation for data fusion by focusing on two aspects: (i) how does one can estimate the number of clusters needed to utilize efficiently data correlation of sensors for a sensor network, and (ii), how does one can pick the chs to cover the whole network more efficiently. in [16], the authors have analyzed the strengths and weaknesses of many existing clustering algorithms and observed many solutions of appropriate aggregation metrics those have been recently proposed in the literature. energy-efficient protocol with static clustering (eepsc) which partitions the network into static clusters and utilizes chs to distribute the energy load among high power sensor nodes and extends the network lifetime [17]. a distributed energy saving clustering algorithm called bpec has been proposed in [18]. in this algorithm, chs are selected by two probabilities. first is based on the ratio between average residual energy of neighbor nodes and its residual energy and second is the node’s degree. by using this algorithm, the entire network broadcasting complexity is o(n), the entire network computing complexity is o(1). the results show that when the network has a higher communication coverage density, analytical and experimental results are very close. energy-efficient hierarchical clustering algorithm (eehca) [19] has adopted a new method for ch election, which can avoid the frequent election of chs. a new concept of backup chs is introduced which improves the performance over leach and hybrid energy-efficient distributed clustering (heed), in terms of network lifetime. an energy efficient hierarchical data gathering protocol, called edga adopts weighted election probabilities of each heterogeneous sensor node to become a ch which better handle heterogeneous energy circumstances [20]. the results demonstrate that edga significantly outperforms leach and heed in terms of network lifetime. the authors in [21] have discussed a new ch election problem based on a set of coverage-aware cost metrics which favor nodes deployed in densely populated network areas. the coverage-aware election of ch nodes, active sensor nodes and routers in clustered wsn increases the lifetime as compared with traditional energy based election methods. in [22], the authors have presented an important corona model to maximize the network lifetime by using maximal transmission range of sensors into different levels. the sensor nodes belong to the same corona have the same transmission range, whereas different coronas have different transmission ranges. in [23] authors have presented a short survey on the main techniques used for energy conservation in wsns. the main focus is primarily on duty cycle scheme which represents the most suitable technique for energy saving. in [24], the authors reviewed many existing definitions of network lifetime and discussed about the merits and demerits of these definitions. 3 eecda protocol the main goal of eecda protocol is to maintain efficiently the energy consumption of sensor nodes by involving them in a single-hop communication within a cluster. the data aggregation and fusion technique is used to reduce the number of transmitted messages to the bs to save the energy and prevent the congestion. to make the protocol implementation, we have adopted a few reasonable assumptions as follows: (i) n sensor nodes are uniformly dispersed within a square field; (ii) all sensor nodes and the bs are stationary after deployment; (iii) the wsn consists of heterogeneous nodes in terms of node energy; (iv) chs perform data aggregation; (v) the bs is not energy limited in comparison with the energy of other nodes in the network. we use the same radio model defined in [7]. the amount of energy required to transmit a l bit packet over a distance, d, is given by equation (2). 116 d. kumar, t.c. aseri, r.b. patel etx(l, d) =   l×eelec + l× ϵfs ×d2 if d <= d0 l×eelec + l× ϵmp ×d4 if d >= d0   (2) eelec is the energy being dissipated to run the transmitter or receiver circuitry. the parameters ϵmp and ϵfs is the amount of energy dissipates per bit in the radio frequency amplifier according to the distance d0, which is given by equation (3). d0 = √ ϵfs ϵmp (3) the amount of energy required to receive a packet is given by equation (4). erx(l) = l×eelec (4) 3.1 impacts of heterogeneity on network performance by placing few heterogeneous nodes in the network can bring three main benefits: (i) extending network lifetime: the average energy consumption for forwarding a packet from the heterogeneous nodes to a bs will be much less than the energy consumed in homogeneous sensor networks, (ii) improving reliability of data communication: the heterogeneous sensor network can get much higher end-to-end delivery rate than the homogeneous sensor network and (iii) decreasing latency of data transmission: the heterogeneous nodes can decrease the forwarding latency by using fewer hops to the bs. 3.2 optimal number of clusters the optimal probability of a node to become a ch is very important in wsns. this clustering is optimal in the sense that energy consumption is well distributed among all the sensor nodes and the total energy consumption should be minimum. such optimal clustering highly depends on the energy model. for eecda, we have used similar energy model as discussed in [7]. let us assume an area a = m ×m square meters over which n nodes are uniformly distributed. for simplicity, assume the bs is located in the center of the field, and the distance of any node to the bs or its ch is d0. therefore, the energy dissipated by the ch node during a round is given by the equation (5). ech = ( n k )×l× (eelec + eda) + l× ϵfs ×d2bs (5) where k is the number of clusters, eda is the data aggregation and dbs is the average distance between a ch and the bs which is given by equation (6). d2bs = ∫ √ (x2 + y2)× 1 a = 0.765× m 2 (6) the energy dissipated by a non-ch node is given by equation (7). ench = l× (eelec + ϵfs ×d2ch) (7) where dch is the average distance between a non-ch node and its associated ch, which is given by equation (8) [10]. d2ch = ∫ ∫ (x2 + y2)×ρ(x, y)dxdy = m2 2πk (8) eecda: energy efficient clustering and data aggregation protocol for heterogeneous wireless sensor networks 117 where ρ(x, y) is the node distribution and m is the area of monitoring field. the total energy dissipated in a cluster per round is given by equation (9). et = ech + ench (9) by substituting equation (5) and equation (7) in equation (9), we obtain energy dissipating during a round which is given by equation (10). et = l× (2×n×eelec + n×eda + ϵfs × (k ×d2bs + n× m2 2πk ) (10) by setting the derivative et with respect to k to zero, we derive the optimal number of clusters which is given by equation (11). kopt = √ n 2π × √ ϵfs ϵmp × m d2bs (11) using equation (6) and equation (11), the optimal probability of a node to become a ch, popt, can be computed by equation (12) popt = 1 0.765 × √ 2 nπ × √ ϵfs ϵmp (12) if the clusters are not constructed in an optimal way, the total energy dissipated per round is increased exponentially either when the number of clusters is greater or less than the optimal value. 3.3 ch election phase eecda considers three types of nodes (i.e., normal, advanced and super) which have deployed in a harsh wireless environment where battery replacement is impossible. nodes with higher battery energy are advanced and super nodes and the remaining nodes are normal nodes. the main aim of eecda is to increase the energy efficiency, lifetime and stability of the network in the presence of heterogeneous nodes. let m be the fraction of advanced nodes among the normal nodes and (mo) be the proportion of super nodes among the advanced nodes. let us assume the initial energy of each normal node is e0 . the initial energy of each advanced and super node is e0 ×(1 +α) and e0 ×(1 +β), where both α and β means the advanced and super node have α and β times more energy than the normal node. intuitively, advanced and super nodes have to become chs more often than the normal nodes, which is equivalent to a fairness constraint on energy consumption. the new heterogeneous setting has no affect on the spatial density of the network so the priori setting of, popt, does not change but the total energy of the network will be changed. the total initial energy of the new heterogeneous network setting is given by equation (13). n×e0×{(1−m)+m×((1−mo)×(1+α)+mo×(1+β)}) = n×e0×(1+m×(α−mo×(α−β))) (13) the first improvement to the existing protocols is to increase the epoch of the sensor network in proportion to the energy increment. in order to optimize the stable region of the system, the new epoch must become equal to ( 1 popt )× (1 + m× (α−mo × (α−β))) because the system has m× (α −mo × (α −β)) times more energy due to heterogeneous nodes. 118 d. kumar, t.c. aseri, r.b. patel if we set the same threshold value for super, advanced and normal nodes with the difference that each normal node ϵg becomes a ch once every ( 1 popt ) × (1 + m × (α − mo × (α − β))) rounds per epoch, and each advanced and super node ϵg becomes a ch (1 + α) and (1 + β) times every ( 1 popt )× (1 + m× (α−mo × (α−β))) rounds per epoch, then there is no guarantee that the number of chs per round per epoch will be popt ×n. this problem can be overcome by modifying the threshold equation (1). in eecda, we assign a weight to the optimal probability popt. this weight must be equal to the initial energy of each node divided by the initial energy of the normal node. let us define pn, pa, and ps are the weighted election probabilities for normal, advanced and super nodes. virtually there are n × (1 + m × (α − mo × (α − β))) nodes with energy equal to the initial energy of a normal node. in order to maintain the minimum energy consumption in each round within an epoch, the average number of chs per round per epoch must be constant and equal to popt × n. in the heterogeneous scenario, the average number of chs per round per epoch is equal to (1 + m×(α−mo ×(α−β)))×n×pn because each virual node has the initial energy of a normal node. therefore, the weighed probabilities for normal, advanced and super nodes are respectively given by equations (14-16). pn = popt (1 + m× (α −mo × (α −β)) (14) pa = popt (1 + m× (α −mo × (α −β)) × (1 + α) (15) ps = popt (1 + m× (α −mo × (α −β)) × (1 + β) (16) by substituting equation (14) in equation (1) and a new threshold is derived for normal nodes which is given by equation (17). t(sn) =   pn 1−pn×(r mod 1pn ) if sn ϵ g ′ 0 otherwise   (17) where r is the current round, g′ is the set of normal nodes that have not become chs within the last, 1 pn , rounds of the epoch and t(sn) is the threshold applied to a population of n × (1 − m) normal nodes. this guarantees that each normal node will become a ch exactly once every ( 1 popt )×(1 + m×(α−mo ×(α−β))) rounds per epoch, and that the average number of chs of normal nodes per round per epoch is equal to (n × (1 − m) × pn). similarly, new thresholds for advanced and super nodes can be derived by substituting equation (15) and (16) into equation (1), which are given by equation (18) and equation (19). t(sa) =   pa 1−pa×(r mod 1pa ) if sa ϵ g ′′ 0 otherwise   (18) t(ss) =   ps 1−ps×(r mod 1ps ) if ss ϵ g ′′′ 0 otherwise   (19) eecda: energy efficient clustering and data aggregation protocol for heterogeneous wireless sensor networks 119 route selection phase once all chs are elected in a specific round by using weighted election probability, each ch first estimates its energy residue e(chr)s and broadcast this information with its ch role to the neighboring nodes. the value of e(chr)s can be calculated by equation (20). e(chr)s = (e(chrem)s − (e(bs)s) s ϵ gc (20) where gc is the set of elected chs per round. (e(chrem)s indicates the remaining energy of chs in current round and (e(bs)s) indicates the communication energy dissipated from chs to the bs. then, each ch records the value of (e(chr)s) in advertisement message and broadcasts advertisement message to the rest of the nodes in the wsn. during the ch election phase, each non-ch node receives all advertisement messages, and extracts all of energy residue data of chs from advertisement messages. moreover, each non-ch node also calculates energy residues (e(nchr)i) to every ch respectively which is given by equation (21). e(nchr)i = (e(nchrem)i − (e(ch)is) i ϵ gn (21) where gn is the set of non-ch nodes. (e(nchrem)i) indicates the residual energy of non-ch node i in the current round and (e(ch)is) indicates the communication energy from non-ch node i to ch node s. finally, each non-ch node would associate one of the existing ch according to maximum energy residue which is given by equation (22). therfore, a path with a maximum sum of energy residues would be selected for data transmission in spite of that path with minimum energy consumption. max{e(chr)s + e(nchr)i} s ϵ gc, i ϵ gn (22) data communication phase in data communication phase, each non-ch node transmits its data to the associated ch. each ch will receive all sensed data from its associated non-ch nodes and sends it to the bs. 4 simulation results and discussion to evaluate and compare the performance of eecda with eehca, edga and leach in the heterogeneous wsn, we have conducted simulations for two scenarios: first, a network with 100 nodes deployed over an area of size 100 × 100 square meter, and second, a network with 200 nodes deployed over an area of size 200 ×200 square meter as shown in figure 1 and figure 2, we denote a normal node with (o), an advanced node with (+), a super node with (*) and the bs with (x). the simulation parameters are summarized in table 1. the performance metrics used for these protocols are: (i) network lifetime: this is the time interval from the start of the operation until the first and last node dies; (ii) stability period: this is the time interval from the start of the operation until the death of the first alive node; (iii) instability period: this is the time interval from the death of the first alive node until the death of the last alive node and (iv) number of alive nodes per round: this is the instantaneous measure reflects that the total number of alive nodes per round that have not yet expended all of their energy. figure 3 and figure 4 show that both leach and eehca fails to take the full advantage of heterogeneity in both the scenarios where the first and last node dies earlier as compared to edga and eecda. therefore, eecda prolongs the network lifetime by 51% , 35% and 10% 120 d. kumar, t.c. aseri, r.b. patel figure 1: random deployment of 100 nodes over an area 100×100 m2. figure 2: random deployment of 200 nodes over an area 200×200 m2. table 1: simulation parameters parameter scenario i and ii network area 100 ×100m2 , 200 ×200m2 bs location (50, 50), (100, 350) n 100, 200 eda 5nj/bit/report packet size 50bytes ϵmp 0.0013pj/bit/m 4 ϵfs 10pj/bit/m 2 eelec 50nj/bit when compared with leach, eehca and edga protocols, respectively. figures 3, 4, 5 and 6 present that the unstable region for eecda is shorter than that of leach, eehca and edga because the normal nodes die in both the scenarios very fast in case of leach, eehca and edga that result in the sensing field it become sparse very fast. on the other hand, advanced and super nodes die in a very slow fashion, because they are not selected as chs very often after the death of the normal nodes, which again affects the election process of chs and makes the network unstable. it is quite evident that the stable region of eecda is extended as compared with leach, eehca and edga for both the scenarios. figure 5 and figure 6 indicate that the number of alive nodes are more per round in case of eecda as compared with edga, eehca and leach because a path with a maximum sum of energy residual would be selected for data transmission in spite of that path with minimum energy in case of eecda. figures 7, 8, 9, 10, 11 and 12 illustrate the performance of residual energy of normal, advanced and super nodes under the heterogeneous settings of eecda, edga, eehca and leach. initially, eecda has the same initial energy as edga, leach and eehca, but gradually it decreases in edga, eehca and leach over rounds. so, edga, eehca and leach have less residual energy left after certain rounds for both the scenarios. therefore, more the residual energy more efficient is the system. 5 conclusion most existing research considers homogeneous sensor networks. however, a homogeneous sensor network suffers from poor performance and scalability. in this paper, we have developed eecda: energy efficient clustering and data aggregation protocol for heterogeneous wireless sensor networks 121 figure 3: network lifetime as a function of first and last dead nodes over an area 100×100 m2. figure 4: network lifetime as a function of first and last dead nodes over an area 200×200 m2. figure 5: stability as a function of number of alive nodes per round over an area 100×100 m2. figure 6: stability as a function of number of alive nodes per round over an area 200×200 m2. figure 7: residual energy of normal nodes per round over an area 100×100 m2. figure 8: residual energy of normal nodes per round over an area 200×200 m2. figure 9: residual energy of advanced nodes per round overan area 100×100 m2. figure 10: residual energy of advanced nodes per round over an area 200×200 m2. 122 d. kumar, t.c. aseri, r.b. patel figure 11: residual energy of super nodes per round overan area 100×100 m2. figure 12: residual energy of super nodes per round over an area 200×200 m2. a novel energy efficient clustering and data aggregation (eecda) protocol to improve the network performance by using some heterogeneous nodes in the network. a novel cluster head election technique and a path with maximum sum of energy residual for data transmission can maintain the balance of energy consumption in the network. simulation results show that eecda has better network lifetime, stability and energy efficiency when compared with edga, eehca and leach protocols. the future work includes more levels of hierarchy with some mobility in the network. bibliography [1] r kumar, v. tsiatsis, m.b. srivastava, computation hierarchy for in-network processing, proceedings of 2nd acm international workshop on wireless networks and applications, san diego, ca, 68-77, 2003. [2] d. kumar, t. c. aseri, r.b. patel, hcee: hierarchical clustered energy efficient protocol for heterogeneous wireless sensor networks, international journal of electronics engineering, 1(1):123-126, 2009. [3] j. yu, p. chong, a survey of clustering schemes for mobile ad hoc networks, ieee communication surveys,7(1): 32-48, 2005. [4] n. vlajic, d. xia, wireless sensor networks: to cluster or not to cluster?, proceedings of international symposium on a wireless of wireless , mobileand multimedia networks (wowmom’06), 259-268, 2006. [5] w.s. jang, w.m. heley, m.j. skibniewsk, wireless sensor networks as a part of web based building environment monitoring system, automation in construction journal, 17: 729-736, 2008. [6] b. krishnamachari, d. estrin, s.b. wicker, the impact of data aggregation in wireless sensor networks, proceedings of 22nd international conference on distributed computing systems workshops (icdcsw’02), 575-578, 2002. [7] w.b. heinzelman, a.p. chandrakasan, h. balakrishnan, an application-specific protocol architecture for wireless microsensor networks, ieee transactions on wireless communications, 1(4):660-669, 2002. [8] s. lindesy, c. raghavendra, pegasis: power-efficient gathering in sensor information system, proceedings of ieee aerospace conference, 1-6, 2002. eecda: energy efficient clustering and data aggregation protocol for heterogeneous wireless sensor networks 123 [9] k. dasgupta, k. kalpakis, p. namjoshi, an efficient clustering-based heuristic for data gathering and aggregation in sensor networks, proceedings of wireless communication and networking (wcnc’03), 3: 1948-1953, 2003. [10] g. smaragdakis, i. matta, a. bestavros, sep: a stable election protocol for clustered heterogeneous wireless sensor networks, proceedings of 2nd international workshop on sensor and actor network protocols and applications (sanpa’04),boston, ma, 660-670, 2004. [11] v. mhatre, c. rosenberg, homogeneous vs. heterogeneous clustered sensor networks: a comparative study, proceedings of ieee international conference on communications (icc’04), 2004. [12] m. ye, c. li, g. chen, j. wu, eecs: an energy efficient clustering scheme in wireless sensor networks, proceedings of 24th ieee international performance, computing and communications conference (ipccc’05), 535540, 2005. [13] d. wei, h. a. chan, clustering ad hoc networks: schemes and classifications, proceedings of 3rd annual ieee communication society on sensors and ad hoc communication and networks (secon’06),, 3: 920-926, 2006. [14] h. yuning, z. yongbing, j. yusheng, s. xuemin, a new energy efficient approach by separating data collection and data report in wireless sensor networks, proceedings of international conference on communication and mobile computing (iwcmc’06), 1165-1170, 2006. [15] h. chen , s. megerian, cluster sizing and head selection for efficient data aggregation and routing in sensor networks, proceedings of wireless communication and networking (wcnc’06), 4: 2318-2323,2006. [16] s. xun, a combinatorial algorithmic approach to energy efficient information collection in wireless sensor networks, acm transactions on sensor networks, 3(1), 2007. [17] a. s. zahmati, b. abolhassani, a. a. b. shirazi, a. s. bakhtiari, an energy-efficient protocol with static clustering for wireless sensor networks, international journal of electronics, circuits and systems, 3(2): 135-138, 2007. [18] x. jianbo, h. yong, l. i. renfa, an energy-aware distributed clustering algorithm in wireless sensor networks, proceedings of international conference on computer science and software engineering, 528-531, 2008. [19] g. xin, w.h. yang, d. degang, eehca: an energy-efficient clustering algorithm for wireless sensor networks, information technology journal, 7(2):245-252, 2008. [20] y. mao, z. liu, l. zhang, x. li, an effective data gathering scheme in heterogeneous energy wireless sensor networks, proceedings of international conference on computational science and engineering , 338-343, 2009. [21] a. bari, a. jaekel, s. bandyopadhyay, clustering strategies for improving the lifetime of two-tiered sensor networks, elsevier, computer communication journal, 31(14): 34513459, 2008. [22] c. song, m. liu, j. cao, y. zheng et. al, maximizing network lifetime based on transmission range adjustment in wireless sensor networks, elsevier, computer communication journal , 32(11): 1316-1325, july 2009. 124 d. kumar, t.c. aseri, r.b. patel [23] h.y. shiue, j.x lieo-hong, s. horijuchi, energy saving in wireless sensor networks, journal of communication and computing , 6(5):20-28, 2009. [24] i. dietrich, f. dressler, on the lifetime of wireless sensor networks, acm transactions on sensor networks , 5(1): 5:1-5:39, 2009. http://horos.rdsor.ro/ijcccv3n4draft.pdf int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iii (2008), no. 4, pp. 384-394 redistributing fragments into a distributed database leon ţâmbulea, manuela horvat-petrescu abstract: a distributed system database performance is strongly related to the fragment allocation in the nodes of the network. an heuristic algorithm for redistributing the fragments is proposed. the algorithm uses the statistical information relative to the requests send to a distributed database. this algorithm minimizes the size of the data transferred for solving a request. assuming that a distribution of the fragments in the nodes of a network is known, the algorithm generates a plan to transfer data fragments, plan that will be used to evaluate a request. keywords: distributed database, fragment allocation, allocation algorithm, transfer cost, heuristic algorithm, redistribution algorithm 1 introduction let’s consider a distributed database c, formed by n nodes (sites) si, 0 <= i (a ≥ ) : symmetric positive definite (semi-definite) matrix; eqk : q dimensional unit vector which has 1 in the k th element and zero elsewhere. an lmi technique for the global stabilization of nonlinear polynomial systems 337 (ii) the relation between the redundant and the nun-redundant kronecker power of the state vector x can be stated as follows { ∀i ∈n ∃ ! ti ∈ rn i×ni , x [i] = tix̃ [i], (5) a procedure of the determination of the matrix ti is given in [25]. (iii) the permutation matrix denoted un×m is defined as un×m = n∑ i= m∑ k= ( eni ·(emk )t ) ⊗ ( emk ·(eni )t ) . (6) this matrix is square (nm × nm) and has precisely a single 1 in each row and in each column. among the main properties of this matrix presented in [17], [11], we recall the following useful ones (b⊗a) = ur×p(a⊗b)uq×s, (7) (x ⊗y ) = un×m(y ⊗x ), (8) ∀i ≤ k x [k] = uni×nk−i x [k]. (9) (iv) an important vector valued function of matrix denoted vec(.) was defined in [17] as follows a = [ c c ... cq ] ∈rp×q, where ∀i ∈ {, ..., q} , ci ∈rp are the columns of a vec(a) = [ ct c t  ... c t q ]t ∈rpq. we recall the following useful rules [17] of this function vec(bac) = (ct ⊗b)vec(a), (10) vec(at ) = up×qvec(a). (11) (v) a special function mat(n,m) (.) can be defined as follows if v is a vector of dimension p = n.m then m = mat(n,m)(v ) is the (n × m) matrix verifying v = vec(m). (vi) for a polynomial vectorial function a (x ) = r∑ i= aix [i], (12) where x ∈rn and ai are (n×ni) constant matrices, we define the (υ ×υ) matrix m(a) as m(a) =   m(a) m(a)  . . .   m(a) . . . . . . ... ... . . . . . . . . .  ... . . . ms−,s−(as−) ms−,s(as−)  . . . . . .  ms,s(as−)   , (13) 338 m. moez belhaouane, r. mtar, h. belkhiria ayadi, n. benhadj braiek with υ = n + n + ... + ns. • for j = , ..., s m j, j(a j−) =   mat(n j−,n j ) ( at j− ) mat(n j−,n j ) ( at j− ) ... mat(n j−,n j ) ( ant j− )   , (14) • for j = , ..., s −  m j, j+(a j) =   mat(n j−,n j ) ( at j ) mat(n j−,n j ) ( at j ) ... mat(n j−,n j ) ( ant j )   , (15) where aik is the i th row of the matrix ak ak = [ atk a t k ... a nt k ]t . (16) (vii) we introduce the matrix r defined by r = τ +[] ·u·h ·τ, (17) where τ = diag ( ti,i=,...,s ) , (18) with τ + is the moore-penrose pseudo-inverse of τ and τ +[]  = τ +  ⊗τ + . τ = diag ( tj, j=,...,s ) , (19) u = diag ( uni,i=,...,s×η ) , (20) h =   iη  η×η iη η×(η+η) iη ... . . . ηs×(η+η+...+ηs−) iηs   , (21) for j = , ....., s : η j = n j · ( s∑ i= ni ) . we note γ the matrix defined by γ = (iη  + uη×η ) ( r+t rt − iη  ) , (22) an lmi technique for the global stabilization of nonlinear polynomial systems 339 with η = s∑ j= n j = s∑ j= ( n + j −  j ) and r+ is the moore-penrose pseudo-inverse of r. β = rank(γ ) (23) and ci, i=,...,β are β linearly independent columns of γ . (iix) for a (n×l) matrix φ , we define ds(φ ) the (υ ×υ ) matrix defined as ds(φ ) =   φ  φ ⊗in . . .  φ ⊗ins−   . (24) in the case where the matrix φ is square (l = n), the matrix ds(φ ) is also square (υ × υ), with υ is defined in (vi). as well, if φ is square and is symmetric positive definite, then so is ds(φ ). 3 the proposed global stabilization condition of controlled polynomial systems we consider the polynomial nonlinear systems defined by the equation (1). our purpose is to determine a polynomial feedback control law u = k(x ) = r∑ i= kix [i], (25) with ki,i=,...,r are constant gains matrices which stabilizes asymptotically and globally the equilibrium (x = ) of the considered system. applying this control law to the open-loop system (1), one obtains the closed-loop system ẋ = a(x ) = ( f + gk)(x ), = r∑ i= aix [i], (26) where ai = fi + gki. (27) using a quadratic lyapunov function v (x ) and computing the derivative v̇ (x ), lead to the sufficient condition of the global asymptotic stabilization of the polynomial system, given by the following theorem 1. theorem 1. the nonlinear polynomial system defined by the equation (1) is globally stabilized by the control law (25), if there exist • an (n×n)-symmetric positive definite matrix p; • arbitrary parameters µi,i=,...,β ∈r; • gain matrices ki,i=,...,r; 340 m. moez belhaouane, r. mtar, h. belkhiria ayadi, n. benhadj braiek such that the (η ×η) symmetric matrix q defined by q = τ t [ds(p)m( f ) + m( f ) t ds(p)]τ + τ t [ds(p)gm(k) + (ds(p)gm(k)) t ]τ + β∑ i= µimat(η,η)(ci), (28) be negative definite. where β and ci,i=,...,β are defined in (23). proof. consider the quadratic lyapunov function v (x ) = x t px , (29) differentiating v (x ) along the trajectory of the system (26), we obtains v̇ (x ) = r∑ k= (x t pakx [k] + x [k] t atk px ), =  r∑ k= x t pakx [k]. (30) using the rule of the vec-function (10), the relation (30) can be written as v̇ (x ) =  r∑ k= v tk x [k+], (31) where vk = vec(pak). (32) we can write r=s−∑ k= v tk x [k+] = s−∑ j= v t jx [ j+] + s∑ j= v t j−x [ j], (33) using the mat-function defined in section 2, one has v̇ (x ) = [ s−∑ j= x [ j] t mat(n j ,n j+)(v t  j)x [ j+] + s∑ j= x [ j] t mat(n j ,n j )(v t  j−)x [ j]]. (34) applying the following lemma [11] lemma 2. consider a (n×nk) matrix a (k ∈n) and a (n×n) matrix p. let i and j two integers verifying i + j = k +  and i ≥ . then mat(ni, n j )(vec(pa)) = uni−×n(p⊗ini− ).m, with m =   mat(ni−,n j ) ( at ) mat(ni−,n j ) ( at ) ... mat(ni−,n j ) ( ant )   , where ai denotes the ith row of the matrix a a = [ at at ... ant ]t an lmi technique for the global stabilization of nonlinear polynomial systems 341 leads to the following relations mat(n j ,n j+)(v t  j) = un j−×n(p⊗in j− )m j, j+(a j), (35) mat(n j ,n j )(v t  j−) = un j−×n(p⊗in j− )m j, j(a j−), (36) where m j, j+(a j) and m j, j(a j−) are defined respectively in (15) and (14) and un j−×n is mentioned in (20). using the results (35) and (36), the equality (34) can be expressed as v̇ (x ) = [ s−∑ j= x [ j] t un j−×n(p⊗in j− )m j, j+(a j)x [ j+] + s∑ j= x [ j] t un j−×n(p⊗in j− )m j, j(a j−)x [ j]], (37) by means of the relation (9), one obtains v̇ (x ) = [ s−∑ j= x [ j] t (p⊗in j− )m j, j+(a j)x [ j+] + s∑ j= x [ j] t (p⊗in j− )m j, j(a j−)x [ j]]. (38) consequently, we obtain v̇ (x ) = xt ds(p)m(a)x, = xt (ds(p)m(a) + m(a)t ds(p))x, (39) with x = [ x t x [] t ··· x [s]t ]t (40) ds(p) and m(a) are defined respectively in (24) and (13). using the nun-redundant kronecker product power form, the vector x can be written as x = τx̃, (41) where x̃ = [ x̃ t x̃ [] t ··· x̃ [s]t ]t ∈rη , η = s∑ j= n j and τ is defined in (18). then v̇ (x ) can be written in the following form v̇ (x ) = x̃t τ t (ds(p)m(a) + m(a) t ds(p))τx̃, (42) a sufficient condition of the global asymptotic stability of the equilibrium (x = ) is that the quadratic form v̇ (x ) is negative definite. this condition can be ensured if there exists a symmetric negative definite q ∈rη×η such that x̃t τ t (ds(p)m(a) + m(a) t ds(p))τx̃ = x̃t qx̃, (43) using the vec-function, the equality (43) can be expressed as vect ( q − τ t (ds(p)m(a) + m(a) t ds(p))τ ) x̃[] = . (44) but, it can be easily checked that x̃[] can be written as x̃[] = rx̃, (45) 342 m. moez belhaouane, r. mtar, h. belkhiria ayadi, n. benhadj braiek where x̃ = [ x̃[]t . . . x̃[s+]t x̃[s+]t . . . x̃[s]t ]t , (46) and r is the matrix defined in (17). the proof of the relation (45) is given in [11]. therefore the equality (44) yields the following equation rt vec(s) = , (47) with: s = q − τ t ( ds(p)m(a) + m(a)t ds(p) ) τ. the η -vector vec(s) solution of (47) can be expressed as vec(s) = ( r+t rt − iη  ) y, (48) where y is an arbitrary vector of rη  . the matrix s is symmetric since q is symmetric, then we can write s =   (s + st ), (49) and using the property (11) yields vec(s) =   (iη  + uη×η )vec(s) = β∑ i= µici, (50) where • β = rank [ (iη  + uη×η ) ( r+t rt − iη  )] , • ci,i=,...,β are β linearly independent columns of (iη  + uη×η ) ( r+t rt − iη  ) , (51) • µi,i=,...,β are arbitrary values. consequently, the symmetric matrix q verifying (47) is of the following form q = τ t (ds(p)m(a) + m(a) t ds(p))τ + β∑ i= µimat(η,η)(ci). (52) according to (26) and the following lemma [10] lemma 3. let g ∈ rn×m, k(.) a polynomial vectorial function defined in (25) and g.k(.) the resultant product of g by k(.), then one has m(g.k) = gm(k), where g = ds(g) and m(.) the matrix function defined in (13). thus, we can write m(a) = m( f + gk) = m( f ) + gm(k), (53) finally, we obtain the following quadratic form of the symmetric matrix q q = τ t [ds(p)m( f ) + m( f ) t ds(p)]τ + τ t [ds(p)gm(k) + m(k) t gt ds(p)]τ + β∑ i= µimat(η,η)(ci). (54) if q is negative definite, then the derivative v̇ (x ) is negative definite. which ends the proof. an lmi technique for the global stabilization of nonlinear polynomial systems 343 4 stabilizing control synthesis using the lmi approach in this section we show how the stabilization problem stated by the theorem 1 can be formulated as an lmi feasibility problem. let recall that our main problem is to find • gain matrices ki,i=,...,r; • a (n×n) matrix p; • real parameters µi,i=,...,β ; such that p > , (55) τ t [ds(p)m( f ) + m( f ) t ds(p)]τ + τ t [ds(p)gm(k) + m(k) t gt ds(p)]τ + β∑ i= µimat(η,η)(ci) < . (56) note that this problem is nonlinear with respect of the unknown parameters p,ki and µi, since the inequality (56) is bilinear on (p, ki). to overcome this problem we make use of the known schur’s complement [18] and we exploit the separation lemma [26]. in this sequel we transform the bmi problem into lmi problem as it is shown in the following development. making use of the following separation lemma [26] lemma 4. for any matrices a and b with appropriate dimensions and for any positive scalar ε > , one has: at b + bt a ≤ ε at a + ε −bt b, one obtains q ≤ τ t [ds(p)m( f ) + m( f )t ds(p)]τ + β∑ i= µimat(η,η)(ci) +γ τ t ds(p) t ds(p)τ + γ −τ t m(k) t gt gm(k)τ, (57) with γ > . then, to ensure that the matrix q is negative definite, it is sufficient to have τ t [ds(p)m( f ) + m( f ) t ds(p)]τ + β∑ i= µimat(η,η)(ci) −τ t ds(p) t (−γ i)ds(p)τ − τ t m(k) t gt (−γ −i)gm(k)τ < . (58) using the generalized schur’s complement, the inequality (58) is equivalent to   τ t (ds(p)m( f ) + m( f ) t ds(p))τ + β∑ i= µimat(η,η)(ci) (ds(p)τ)t (gm(k)τ)t ds(p)τ −γ −i  gm(k)τ  −γ i   < , (59) when pre-and post-multiplying the inequality (59) by ξ = diag(i, i, γ −i), we get   τ t (ds(p)m( f ) + m( f ) t ds(p))τ + β∑ i= µimat(η,η)(ci) (ds(p)τ)t (gw(k)τ)t ds(p)τ −γ −i  gw(k)τ  −γ −i   < , (60) with w(k) = γ −m(k). this new inequality is linear on the decision variables, and then we can state the following result. 344 m. moez belhaouane, r. mtar, h. belkhiria ayadi, n. benhadj braiek theorem 5. the equilibrium (x = ) of the system (1) is globally asymptotically stabilizable if there exist • a (n×n)-symmetric positive definite matrix p ; • arbitrary parameters µi,i=,...,β ∈r ; • gain matrices ki,i=,...,r; • a real γ >  ; such that p > , (61) and   τ t (ds(p)m( f ) + m( f ) t ds(p))τ + β∑ i= µimat(η,η)(ci) (ds(p)τ)t (gw(k)τ)t ds(p)τ −γ −i  gw(k)τ  −γ −i   < . (62) thus, a stabilizing control law (25) for the considered polynomial system (1) can be characterized by applying the following procedure 1. solve the lmi feasibility problem i.e., find the matrices ds(p), w(k) and the parameters µi and γ such that the inequalities (61), (62) are verified. 2. extract the gain matrices ki from the relation m(k) = γw(k). this optimization problem can be carried out using matlab software. to provide the effectiveness of the proposed approach, we consider the following numerical example. 5 illustrative example our aim in this section is to apply the proposed approach for the global stabilization of the following polynomial system { ẋ = −x + x + x + xx − x   + xx − xx   + x  , ẋ = −x + .x − x − .xx − x   − xx + .xx   − x   + u. (63) using the kronecker product, this system can be described by the following compact state equation ẋ = fx + fx [] + fx [] + gu, (64) with f = [ −  − . ] f = [     − −.   ] f = [ −   −     − −  .    − ] and g = [   ] . we are interested with the stabilization of the origin equilibrium (x = ) of the system (64). let us note that the uncontrolled (u = ) non linear system is unstable since the matrix f has an unstable eigenvalue. an lmi technique for the global stabilization of nonlinear polynomial systems 345 solving the optimization problem formulated by theorem 2, we obtain    µ = −. µ = . µ = . ; p = [ . . . . ] ; γ = .. the searched gain matrices, extracted from m(k), are given by k = [ −. −. ] k = [ . .   ] k = [ −. −. −. −. . −. . . ] then a global stabilizing control law can be characterized for the studied system using the previous developed method. this control law can be expressed as u = kx + kx [] + kx []. (65) the figure 1 shows the behavior of the state variables x(t) and x(t) of the controlled system from initial conditions which were taken sufficiently far from the initial conditions (x() = −, x() = ). it appears that the state variables converge into the origin point which confirm the asymptotic stability of the controlled system. 0 0.2 0.4 0.6 0.8 1 −10 −5 0 5 10 15 t(s) v ar ia bl es x 1 an d x2 dynamic behaviour of the state variable x1 dynamic behaviour of the state variable x2 figure 1: closed-loop responses of the system (64) with the control law (65). 6 conclusion in this paper, an original technique has been proposed for the global and asymptotic stabilization of the nonlinear polynomial systems. this new stabilizing approach is based on the lyapunov direct method and elaborated algebraic developments using the kronecker product properties. this development has allowed the formulation of the system stabilization condition as an lmi feasibility problem, which resolution leads to a polynomial control law ensuring the quadratic stability in the whole state space of the 346 m. moez belhaouane, r. mtar, h. belkhiria ayadi, n. benhadj braiek considered system. further works, will consider extension of these results to the robust stabilization of polynomial uncertain systems. bibliography [1] j. c. doyle, “a review of µ for case studies in robust control,” ifac, pp. 395–402, 1987. [2] f. lescher, j. y. zhao, and p. borne, “switching lpv controllers for a variable speed pitch regulated wind turbine,” international journal of computers, communications and control (ijccc), vol. 1, no. 4, pp. 73–84, 2006. [3] a. isidori, nonlinear control systems. springler-verlag, 1995. [4] p. borne, j. p. richard, and n. e. radhy, “stability, stabilization, regulation using vector norms,” nonlinear systems, vol. 2, stability and stabilization, chapter 2, pp. 45–90, 1996. [5] p. borne, g. dauphin tanguy, j. p. richard, f. rotella, and i. zambettakis, commande et optimisation des processus. technip, 1998. [6] y. blanco, w. perruquetti, and p. borne, “stability and stabilization of nonlinear systems and tanaka-sugeno fuzzy models,” european control conference ecc’01, 2001. [7] n. benhadj braiek, “on the global stability of nonlinear polynomial systems,” ieee conference on decision and control,cdc’96, december 1996. [8] n. benhadj braiek and f. rotella, “design of observers for nonlinear time variant systems,” ieee syst. man and cybernetics conference, vol. 4, pp. 219–225, 1993. [9] n. benhadj braiek and f. rotella, “state observer design for analytical nonlinear systems,” ieee syst. man and cybernetics conference, vol. 3, pp. 2045–2050, october 1994. [10] n. benhadj braiek and f. rotella, “stabilization of nonlinear systems using a kronecker product approach,” european control conference ecc’95, pp. 2304–2309, september 1995. [11] n. benhadj braiek, f. rotella, and m. benrejeb, “algebraic criteria for global stability analysis of nonlinear systems,” journal of systems analysis modelling and simulation, gordon and breach science publishers, vol. 17, pp. 221–227, 1995. [12] n. benhadj braiek, “feedback stabilization and stability domain estimation of nonlinear systems,” journal of the franklin institute, vol. 332, no. 2, pp. 183–193, march 1995. [13] f. rotella and g. tanguy, “non linear systems: identification and optimal control,” int.j.control, vol. 48, no. 2, pp. 525–544, 1988. [14] h. bouzaouache and n. benhadj braiek, “on guaranteed global exponential stability of polynomial singularly perturbed control systems,” international journal of computers, communications and control (ijccc), vol. 1, no. 4, pp. 21–34, 2006. [15] h. bouzaouache and n. benhadj braiek, “on the stability analysis of nonlinear systems using polynomial lyapunov functions,” mathematics and computers in simulation, vol. 76, no. 5-6, pp. 316–329, 2008. [16] h. b. ayadi and n. benhadj braiek, “robust control of nonlinear polynomial systems,” ieee syst. man and cybernetics conference, smc’02, 6-9 october 2002. an lmi technique for the global stabilization of nonlinear polynomial systems 347 [17] j. brewer, “kronecker product and matrix calculus in system theory,” ieee trans.circ.sys, vol. cas-25, pp. 722–781, 1978. [18] s. boyd, l. ghaoui, and f. balakrishnan, “linear matrix inequalities in system and control theory,” siam, 1994. [19] j. g. v. antwerp and r. d. braatz, “a tutorial on linear and bilinear matrix inequalities,” journal of process control, vol. 10, pp. 363–385, 2000. [20] d. rosinova and v. vesely, “robust pid decentralized controller design using lmi,” international journal of computers, communications and control (ijccc), may 2007. [21] a. nemirovsky and p.gahinet, “the projective method for solving linear matrix inequalities,” in proc. american control conf., 1994. [22] s. boyd and l. vandenberghe, “introduction to convex optimization with engineering applications,” lecture notes, information systems laboratory, 1995. [23] c. scherer, p. gahinet, and m. chilali, “multiobjective output-feedback control via lmi optimization,” ieee transactions on automatic control, vol. 42, no. 7, pp. 896–910, july 1997. [24] y. nesterov and a.nemirovsky, “interior point polynomial methods in convex programming: theory and applications,” siam, pp. 101–127, 1994. [25] n. benhadj braiek and f. rotella, “logid: a nonlinear systems identification software,” modelling and simulation of systems, scientific publishing co., pp. 211–218, 1990. [26] zhou and khargonedkar, “robust stabilization of linear systems with norm-bounded time-varying uncertainty,” sys. contr. letters, vol. 10, pp. 17–20, 1988. mohamed moez belhaouane was born in tunis in 1980. he received the electrical engineering bachelor and the master degree in automatic control from école supérieure des sciences et techniques de tunis (esstt), in 2003 and 2005, respectively. currently, he is research member of the processes study and automatic control laboratory (lecap) in the polytechnic school of tunisia and he is working toward ph.d. degree in electrical engineering. his research interests include analysis and control of nonlinear systems, polynomial systems and robust control. riadh mtar was born in tunis in 1974. he received his master degree of systems analysis and numerical treatment from école nationale d’ingénieurs de tunis (enit), in 2005. he is currently an contractual assistant in école supérieure de commerce (esc) and research member of the processes study and automatic control laboratory (lecap) in the polytechnic school of tunisia. actually, he is preparing his ph.d. dissertation in electrical engineering from école nationale d’ingénieurs de tunis (enit). his research interests include analysis and control of nonlinear polynomial discret systems. hela belkhiria ayadi received her engineer diplomas from the école nationale d’ingénieurs de monastir (enim) in 1997 and master degree of automatic control from école supérieure des sciences et techniques de tunis (esstt), in 1998. she completed her ph.d. in electrical engineering in 2004 from école nationale d’ingénieurs de tunis (enit). she is now an assistant professor in école supérieure des sciences et techniques de tunis (esstt) and research member of the processes study and automatic control laboratory (lecap) in the polytechnic school of tunisia. her research interests include robust analysis and control of nonlinear uncertain systems. 348 m. moez belhaouane, r. mtar, h. belkhiria ayadi, n. benhadj braiek naceur benhadj braiek was born in mahdia, tunisia, in 1963. he obtained the master of electrical engineers and the master of systems analysis and numerical processing, both from école nationale d’ingénieurs de tunis (enit) in 1987, the master of automatic control from institut industriel de nord (école centrale de lille) in 1988, the ph.d. degree in automatic control from université des sciences et techniques de lille, france, 1990, and the doctorat d’état in electrical engineering in 1995 from école nationale d’ingénieurs de tunis (enit). now, he is professor of electrical engineering at the university of tunis école supérieure des sciences et techniques de tunis (esstt). he is also director of the processes study and automatic control laboratory (lecap) in the polytechnic school of tunisia. his domain of interest is related to the modeling, analysis and control of nonlinear systems with applications on electrical processes. international journal of computers, communications & control vol. i (2006), no. 4, pp. 21-34 on guaranteed global exponential stability of polynomial singularly perturbed control systems hajer bouzaouache, naceur benhadj braiek abstract: the problem of global exponential stability for a class of nonlinear singularly perturbed systems is examined in this paper. the stability analysis is based on the use of basic results of integral manifold of nonlinear singularly perturbed systems, the composite lyapunov method and the notations and properties of tensoriel algebra. some of the derived results are presented as linear matrix inequalities (lmis) feasibility tests. moreover, we pointed out that if the global exponential stability of the reduced order subsystem is established this is equivalent to guarantee the global exponential stability of the original high order closed loop system. an upper bound ε1 of the small parameter ε , can also be determined up to which established stability conditions via lmi’s are maintained verified. a numerical example is given to illustrate the proposed approach. keywords: nonlinear singularly perturbed system, integral manifold, lyapunov stability, kronecker product, linear matrix inequalities (lmis). 1 introduction stability analysis and control of nonlinear singularly perturbed systems have been widely studied in the literature [2], [6], [7], [12], [13]. in a two time scale framework, the stability study of the controlled systems using the lyapunov stability method [15] and the integral manifold approach as a means for the control of nonlinear systems based on the singular perturbation method have been developed in recent years [10], [11], [14], [16], [17]. the approaches proposed in this direction. differ by imposing different conditions on the smoothness properties of the used functions, different assumptions and different classes of lyapunov functions. in this paper, we are concerned with the global exponential stability of polynomial singularly perturbed systems when the chosen design manifold is an exact integral one. further extension of some previous results [11], [17] are suggested and leads to effective global exponential stability conditions via lmis [8] which can be easily verified when using lmi toolbox of matlab. the contribution of the present paper is based, on one hand, on the use of the lyapunov method which is a powerful tool for combined controller design and stability analysis, the definition of appropriate lyapunov functions for the reduced systems and the corrected system via the integral manifold approach and on the other hand, on the notations and properties of the tensoriel product [9]. our paper is organized as follows: in section 2 we present the considered description of the studied systems which allows important algebraic manipulations and some results from the literature on integral manifolds for nonlinear singularly perturbed systems. some useful notations and needed assumptions are introduced in section 3. exploiting the stability statements about singularly perturbed systems possessing integral manifolds and using the composite lyapunov technique, we propose in section 4 an appropriate control law that insures the existence of an attractive integral manifold and furthermore insures stability of the studied systems when the dynamics are restrictive to the integral manifold. the stability results proving the global exponential stability of polynomial singularly perturbed systems are also given and presented as linear matrix inequalities feasibility tests. finally an illustrative example is treated and some conclusions are drawn. copyright c© 2006 by ccc publications 22 hajer bouzaouache, naceur benhadj braiek 2 studied systems and integral manifolds the class of systems to be considered in this paper are described by the following state equations: { ẋ = f (x, z) (a) ε ż = g(x, z) + l(x, z)u (b) (1) where x ∈ rn1 is the state of the slow subsystem (1-a), z ∈ rn2 is the state of the fast subsystem, u ∈ rpis the input control. ε is a small positive parameter. f , g and l are analytic vector fields which are sufficiently many times continuously differentiable functions of their arguments. using the kronecker power of vectors, these functions can be written in the polynomial form as [7]:    f (x, z, ε) = r ∑ i=1 i+1 ∑ j=1 fi jx [i+1− j] ⊗z[ j−1] g(x, z) = r ∑ i=1 i+1 ∑ j=1 gi jx [i+1− j] ⊗z[ j−1] l(x, z) = r ∑ i=1 i+1 ∑ j=1 li j(im ⊗(x[i+1− j] ⊗z[ j−1])) (2) in general the stability of the reduced order subsystems for a class of nonlinear singularly perturbed systems cannot guarantee the stability of the original full order system even with the additional stability of the boundary layer subsystem but when an attractive manifold is designed, the stability problem of the original system reduces to a stability problem of a low dimensional system on the manifold. subsequently, in the context of control system design, our goal is to find an appropriate control law that insures the existence of an attractive integral manifold and furthermore insures stability of the studied systems (1) when the dynamics are restrictive to the integral manifold. the basic ideas of exploiting the integral manifold method are: – if an integral manifold σ of systems described by (1) is established, so that if the initial states start on σ, the trajectory of the system remains on σ thereafter. – when restricted to the integral manifold σ, the dynamics of the system should insure stability of the equilibrium. – the integral manifold should be attractive so that if the initial conditions are off σ, the solution trajectory asymptotically converges to σ. according to these important issues of the integral manifold method, let’s present the definition, and the properties of integral manifold of nonlinear systems. definition 1. [16] the set σ ⊂ r×rn is said to be an integral manifold (invariant manifold) for the differential equation: ẋ = n(t, x ), x , n ∈rn if for (t0, x0) ∈ σ, the solution (t, x (t)), x (t0) = x0, is in σ for t ∈r. if (t, x (t)) ∈ σ for only a finite interval of time, then σ is said to be a local integral manifold. lemma 1. [11] consider the following system: { ẋ = f (t, x, y, ε) ε ẏ = g(t, x, y, ε) (3) x, f ∈rn, y, g ∈rm,t ∈r, ε a small parameter. and suppose the following hypotheses hold: – the algebraic equation g(t, x, y, 0) = 0 has an isolated solution y = h0(t, x), ∀t ∈r, ∀x ∈ bx – the functions f , g, and h0 are twice continuously differentiable (∈c2) ∀t ∈r,∀x ∈ bx. ∀ε ∈ [0, ε0) and for ‖y−h0(t, x)‖ ≤ ϕ̄y where ε0 and ϕ̄y are positive real constants. on guaranteed global exponential stability of polynomial singularly perturbed control systems 23 – the eigenvalues λi = λi(t, x), i = 1, 2, . . . , m of the matrix z(t, x) := ( ∂ g ∂ y ) (t, x, h0(t, x), 0) satisfy the inequality re [λi] ≤ −2β < 0 ∀t ∈r,∀x ∈ bx (4) then there exists ε ≤ ε1 such that ∀ε ∈ [0, ε1), the singularly perturbed system (3) has an m-dimensional local integral manifold σε : y = h0(t, x) + h(t, x, ε) = h(t, x, ε) (5) where h(t, x, ε) is defined for all x ∈ bx and ε ≤ ε1 and is continuously differentiable (∈ c1) the function h(t, x, ε) ∈ c1 satisfies the so-called manifold equation : ε ∂ h ∂ t + ε ∂ h ∂ x f (t, x, h, ε)g(t, x, h, ε) (6) which is obtained by substituting y by h in equation (3). on this manifold, the flow of systems (3) is governed by the n-dimensional reduced system ẋ = f (t, x, h(t, x, ε), ε) (7) furthermore, if for x ∈ bx and p integer we have f (t, x, y, ε) ∈ c p+1, g(t, x, y, ε) ∈ c p+2 and h0(t, x) ∈ c p+2 then h ∈ c p 3 useful notations and assumptions in our study we make use of the following lemma 2 and assumptions 1-2. the lemma 2 is concerned with a kronecker transformation of vectors. more properties of the kronecker product are given in the appendix. lemma 2. [6] given x = ( x z ) ∈ rn; x ∈ rn1 , z ∈ rn2 and n = n1 + n2 there exists a matrix (i) m ∈ rn i×ni making possible a transformation which introduces the change of coordinates that forms the new following kronecker power of vector: (̂ x z )[i] =   x[i] : x[i− j] ⊗z[ j] : z[i]   ∈rni such that x [i] = ( x z )[i] = (i) m (̂ x z )[i] (8) with    (i) m = ( (i−1) m ⊗in) (i) u (i) v (1) m = in ∣∣∣∣∣∣ 1 ≤ j ≤ (i−1) ni = i ∑ j=0 n i− j 1 n j 2 (9) 24 hajer bouzaouache, naceur benhadj braiek and (i) v =   ini1 a b 0 . . . 0 ab . . . ini2   ︸︷︷︸∣∣∣∣ (i+1) blocs columns2i blocs rows (i) u =   u n (i−1) 1 ×n . . . 0 u n (i−k) 1 n (k−1) 2 ×n 0 . . . u n (i−1) 2 ×n   f or j ∈ {2, ...i} { a = u n2×n(i− j+1)1 ⊗i n ( j−2) 2 b = i n (i− j+1) 1 ×n ( j−1) 2 (10) the permutation matrix denoted un×m is defined in [9] un×m = n ∑ i=1 m ∑ k=1 ( ei (n) etk (m) )⊗(etk (m) ei (n) ) (11) this matrix is square (nm×nm) and has precisely a single 1 in each row and in each column. to clarify the meaning of (i) m, consider the following example: for n = 3 (n1 = 2, n2 = 1), i = 2; x [2] = ( x z )[2] = (2) m (̂ x z )[2] x [2] = ( x z )[2] =   1 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 1 0 0 0 0 1 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 1 0 0 0 0 0 1 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 1     x[2] x⊗z z[2]   assumption 1. there exists a continuously differentiable function v1(t, x) : r×rn1 →r+ such that the following inequalities hold: ∀t ∈r, x ∈rn1 α1 ‖x‖2 ≤ v1(t, x) ≤ α2 ‖x‖2 (12) ‖∇xv1(t, x)‖ ≤ α3 ‖x‖2 (13) on guaranteed global exponential stability of polynomial singularly perturbed control systems 25 ∇tv1(t, x) + (∇xv1(t, x))t f (t, x, 0, 0) ≤ −2γ1v1 (14) where α1, α2, α3, and γ1 are positive constants. assumption 2. there exists a continuously differentiable function v2(t, z) : r×rn2 →r+ such that the following inequalities hold: ∀t ∈r, z ∈rn2 β1 ‖z‖2 ≤ v2(t, z) ≤ β2 ‖z‖2 (15) dv2(t, z) dt ≤ −2 ε γ2v2(t, z) (16) (17) where β1, β2 and γ2 are positive constants. 4 main results given the system (1), (2), we have to determine an adequate feedback control u that, starting from any initial states, will attract exponentially the trajectories of the closed loop system along the chosen design manifold to the equilibrium point at the origin. in what follows, we assume that hypothesis of lemma 1 are satisfied, and hence the singularly perturbed system (1) has an n2 dimensional integral manifold: z = h0(x) (18) satisfying the equation (1-b). the flow of system (1), on this manifold, is governed by the n1 dimensional reduced system: ẋ = f (t, x, h0(t, x, ε), ε) (19) this result can be reached by the design of a desired control u satisfying: l(x, z)u = −g(x, z) + a(z−h0) + ε dh0(x) dx f (x, z) (20) where a is a hurwitz matrix. specifically, we choose the design manifold in this paper to be equal to z = h0(x) = 0. then, the control in equation (20) becomes: l(x, z)u = −g(x, z) + az (21) and the nonlinear singularly perturbed system (1) can be written as:    ẋ = r ∑ i=1 i+1 ∑ j=1 fi jx [i+1− j] ⊗z[ j−1] (a) ε ż = az (b) (22) it is then clear, that the fast subsystem and the fast states of the system (22) are attracted toward the manifold as quickly as desired by the choice of the hurwitz matrix a. the reduced order system of (22) is obtained by setting ε = 0 as: ẋ = f (x, 0) = r ∑ i=1 fiix [i] (23) the boundary layer system is given, in the fast time scale τ tε , by: dz∗(τ) dτ = az∗(τ) (24) 26 hajer bouzaouache, naceur benhadj braiek now to study the stability of the system (22), let’s consider that the reduced order system (23) and the boundary layer system (24) have respectively v1(x) and v2(z) as quadratic lyapunov candidate functions verifying assumptions 1-2 and defined as follows : v1(x) = x t p1x (25) v2(z) = z t p2z (26) where p1, p2 are symmetric positive definite matrices solutions of the following lyapunov equations: v̇1(x) ≤ −xt q1x (27) v̇2(z) ≤ −zt q2z (28) q1, q2 are also positive definite matrices. based on results of the stability theory [15] and others derived in previous work [1], [5], (27) and (28) are formulated as follows: τ t1 (m t 1 p1 + p1m1)τ1 ≤ −q1 (29) ( a ε )t p2 + p2 ( a ε ) ≤ −q2 (30) where p1 =   p1 0 p1 ⊗in1 . . . 0 p1 ⊗ins−11   (31) and m1 =   λ11π11 λ12π12 ··· λ1sπ1s ... . . . ... ... . . . ... λs1πs1 ··· ··· λssπss   (32) with πk− j+1, j =   mat (nk− j ,n j ) (f 1 t kk ) mat (nk− j ,n j ) (f 2 t kk ) ... mat (nk− j ,n j ) (f n t kk )   (33) for the corrected system (22), we define the following lyapunov functions: v (x, z, ε) = x t eε px (34) where x = [ x z ] ∈rn; p = ( p1 0 0 p2 ) and eε = ( in1 0 0 ε in2 ) (35) the derivative of v (x, z, ε) along the trajectories of (22) is given by: v̇ (x, z, ε) = x t eε pẋ + ẋ t eε px (36) on guaranteed global exponential stability of polynomial singularly perturbed control systems 27 in the above equation, we need to explicit the derivative of the state vector x . so, we begin by writing the state equations (22) in the following form: ẋ = .( x z ) = r ∑ i=1 λi (̂ x z )[i] (37) with λ1 = ( f11 f12 0 aε ) and for i > 1: λi = ( fi1 ···fi j ···fi(i+1) on2×αi ) (38) using the results given by the lemma 2, it follows from equation (37) that ẋ = r ∑ i=1 λi (i) m +x [i] (39) where (i)+ m is the moore-penrose pseudo inverse of (i) m defined in (a.4). the derivative of the composite lyapunov function (34) is then written: v̇ (x, z, ε) = 2 r ∑ k=1 x t (eε pλk (k) m +)x k (40) using the property of the vec-function (1), we have: v̇ (x, z, ε) = 2 r ∑ k=1 v tk x [k+1] (41) where vk = vec(eε pλk (k) m +) (42) knowing that all polynomials with even degree (2s) can be represented as a symmetric quadratic form. thus, we assume in the following development that r is odd: r = 2s−1, and it comes out: v tk x [k+1] = hk ∑ j=gk λk− j+1, jx [k+1− j]nk− j+1, jx [ j] (43) where λk− j+1, j are arbitrary reals verifying: hk ∑ j=gk λk− j+1, j = 1 (44) and for k = 1, . . . , 2s−1: gk = sup(0, k + 1−s) and hk = inf(s, k) (45) for j = gk, . . . , hk: nk− j+1, j = mat (nk− j+1,n j ) (vk) (46) applying the properties [1], one obtains: nk− j+1, j = mat (nk− j+1,n j ) ( vec(eε pλk (k) m +) ) = unk− j×n(eε p⊗ink− j ).mk− j+1, j (47) 28 hajer bouzaouache, naceur benhadj braiek with mk− j+1, j =   mat (nk− j ,n j ) (b1 t k ) mat (nk− j ,n j ) (b2 t k ) ... mat (nk− j ,n j ) (bn t k )   andbk=λk (k) m + (48) where bik is the i-th row of the matrix bk: bk=   b1k b2k ... bnk   (49) by (47) and from the relation (48), we obtain: x [k− j+1] t nk− j+1, jx [ j] = x [k− j+1] t unk− j×n(eε p⊗ink− j )mk− j+1, jx [ j] = x [k− j+1] t (eε p⊗ink− j )mk− j+1, jx [ j] (50) consequently, we have: v tk x [k+1] = hk ∑ j=gk λk− j+1, jx [k− j+1] t nk− j+1, jx [ j] = x t (pε mk)x (51) with x =   x x [2] ... x [s]   (52) and pε =   eε p 0 eε p⊗in . . . 0 eε p⊗ins−1   (53) let’s note that pε is a symmetric positive matrix, and v̇ (x , ε) (41) can be written as: v̇ (x , ε) = 2 2s−1 ∑ k=1 v tk x [k+1] = x t (pε mε + m t ε pε )x (54) with mε = 2s−1 ∑ k=1 mk =   λ11m11 λ12m12 ··· λ1sm1s ... . . . ... ... . . . ... λs1ms1 ··· ··· λssmss   (55) when considering the nun-redundant form, the vector x can be written as: x = τ x̃ (56) on guaranteed global exponential stability of polynomial singularly perturbed control systems 29 where τ =   t1 0 . . . 0 ts   and x̃ =   x̃ ... x̃ [s]   (57) >from (54) and (56) we easily obtain: v̇ (x , ε) = x̃ t τ t (pε mε +mtε pε )τ x̃ (58) let us denote the largest eigenvalue of the matrix pε by λmax(pε ), the smallest eigenvalue of q by λmin(q). where the matrix q verifies: v̇ (x , ε) ≤−x t qx . the positive definiteness of pε and q implies that these scalars are all strictly positive. since matrix theory shows that: pε ≤ λmax(pε )i; λmin(q)i ≤ q (59) we have x t qx ≥ λmin(q) λmax(pε ) x t [λmax(pε )i] x ≥ λmin(q) λmax(pε ) x t [λmax(eε p)i] x (60) otherwise x t x ≥ ‖x‖2 and eε p ≤ λmax(eε p)i (61) hence, (54) will satisfies the following condition: v̇ (x , ε) ≤ −2γv (x , ε) where γ = 1 2 · λmin(q) λmax(pε ) (62) it comes out v (x , ε) ≤ v (x0)e−2γ(t−t0) (63) considering the previous developments, we state now our main result: theorem 1. assume the following assumptions hold: (i) lemma 1 satisfied (ii) assumptions 1 2 are satisfied the system (1) is globally exponentially stable (ges), if there is, for all ε < ε1, ε1 > 0 a feasible solution to the lmi :    ε > 0 ∃pt1 = p1 > 0 ∃pt2 = p2 > 0 ∃pt = p > 0( a ε )t p2 + p2 ( a ε ) ≤ −q2 τ t1 (m t 1 p1 + p1m1)τ1 ≤ −q1 τ t (mtε pε + pε mε )τ ≤ −q (64) m1, τ1 and p1 are given by (32), (57) and (31). mε , τ and pε are given by (55), (57) and (53). moreover, the lyapunov function that demonstrates the g.e.s is given by: v (x) = x t eε pε x now, let’s evaluate the convergence rate of the full order system (22). in view of (12), (15), and (27), we have for all t ∈r, x ∈rn1 and z ∈rn2 : v1(x) ≤ (α2 ‖x0‖2 + ε β2 ‖z0‖2)e−2γ(t−t0) (65) >from (12) and (15) we have: ‖x‖ ≤   (√ α2 α1 ‖x0‖ )2 +   √ ε β2 α1 ‖z0‖   2   1/2 e−γ(t−t0) ≤   √ α2 α1 ‖x0‖+ √ ε β2 α1 ‖z0‖   e−γ(t−t0) (66) 30 hajer bouzaouache, naceur benhadj braiek identically, using (15) and (16), we obtain: β1 ∥∥z2 ∥∥ ≤ v2(z) ≤ β2 ∥∥z20 ∥∥ e−2(γ2/ε)(t−t0) (67) then ‖z‖ ≤ √ β2 β1 ‖z0‖e−(γ2/ε)(t−t0) (68) >from (66) and (68), we can write in the case γ ≤ (γ2/ε) that ‖x‖2 ≤ 2η e−2γ(t−t0) where η = max     √ α2 α1 ‖x0‖+ √ ε β2 α1 ‖z0‖   2 , (√ β2 β1 ‖z0‖ )2  (69) which implies that the norm ‖x‖ of the state vector converges to zero exponentially, with a rate γ . the convergence rates of the reduced systems can be calculated: γ1 = 1 2 · λmin(q1) α2 , γ2 = λmin(q2) 2β2 >from above, we state the following second result: theorem 2. assume the following assumptions hold: (i) lemma 1 satisfied (ii) assumptions 1 2 are satisfied (iii) there exists a lyapunov function v (t, x , ε) that satisfies equation ((34)) then the original nonlinear singularly perturbed system ((1)) is globally exponentially stable under the proposed control ((21)) and with the convergence rate γ ((63)). moreover, note that when we proves that the limit of γ as ε → 0 , tends to the convergence rate of the reduced order system. this implies that under the proposed control (21), the global exponential stability of the initial studied system is equivalent to that of the reduced order system. 5 illustrative example to illustrate the previous derived results, we consider a third order nonlinear singularly perturbed system defined by the following equations:    ẋ1 = −x1 + x2 + 0.1x1z ẋ2 = −x1 −0.09x2 + 2z + 0.05x1z ε ż = 4x1 −4x2 + z + 0.5x21 −x22 + 10u this system can be described by the following model using the kronecker product and the power of vectors which allowed important algebraic manipulations. { ẋ = f11x + f12z + f21x [2] + f22x⊗z + f23z [2] ε ż = g11x + g12z + g21x [2] + g22x⊗z + g23z [2] + bu on guaranteed global exponential stability of polynomial singularly perturbed control systems 31 where f11 = [ −1 1 −1 −0.09 ] , f12 = [ 0 2 ] , g11 = [ 4 −4 ] , g12 = 1, f21 = 02×4, f22 = [ 0.1 0 0 0.5 ] , f23 = 02×1 g21 = [ 0.5 1 0 ] , g22 = [ 0 0 ] , g23 = 0, b = 10. when implemented using the lmi toolbox of matlab, the proposed lmi’s conditions proves that the numerical studied system which is initially instable can be globally exponentially stabilised by the given controller with the considered a = −1 for all ε < ε1 = 0.5 in view of theorem 1: u = −0.4x1 + 0.4x2 −0.1z−0.05x21 + 0.1x22 figure 1: state trajectories of the controlled studied system hence for all ε < ε1 = 0.5 and from any initial states, the trajectories of the system are steered to the origin along the integral manifold with the convergence rate 0.1 in view of (62). indeed, it is shown in fig. 1 that the state trajectories of the controlled system ( – – –) are bounded by the function (—). 6 conclusion in this paper, the global exponential stabilisation for nonlinear singularly perturbed control systems is investigated. in the stability study, the composite lyapunov method was applied and the global exponential stability of the equilibrium of the full control system was established for all ε < ε1. the upper bound ε1 for which the stability properties are guaranteed can be reached after a number of iterations on ε when resolving the proposed lmi’s conditions via the lmi toolbox of matlab. a numerical example has been provided to illustrate the proposed results. 32 hajer bouzaouache, naceur benhadj braiek 7 appendix notations: the dimensions of the matrices used here are the following: a(p×q), b(r ×s), c(s×h), d(s×h), e(n× p), p(n×n), x (n×1) ∈rm, y (m×1) ∈rm, z(q×1) ∈rq let’s consider the following notations: in: n×n identity matrix; 0n×m: n×n zero matrix; 0zero matrix of convenient dimensions ; at : transpose of matrix a; a > 0 (a ≥ 0): symetric positive definite (semi definite matrix a); ek (q) : q dimensional unit vector which has 1 in the kth element and zero elsewhere. the kth row of a matrix such as a is denoted ak. and the kth column is denoted a.k. the ik element of a will be denoted aik. the kronecker product of a and b is denoted a⊗b a (p.r×q.s) matrix, and the i−th kronecker’s power of a denoted i a[i] = a⊗a⊗···⊗a s a ( pi ×qi) matrix. the nun-redundant j-power x̃ [ j] of the state vector x introduced in [9] is defined as: x̃ [1] = x [1] = x   ∀ j ≥ 2 x̃ [ j] = [ x j 1, x j−1 1 x2,··· , x j−1 1 xn, x j−2 1 x 2 2 x j−2 1 x2x3,··· , x j−2 1 x2xn, ···x j−21 xn2 ,··· , x j−3 1 x 3 2 ,··· , x jn ] (a.1) where the repeated components of the redundant j-power are omitted. then we have the following relation:    ∀ j ∈n ∃!tj ∈rn j×α j ; α j = ( n + j −1 j ) x [ j] = tjx̃ [ j] (a.2) thus, one possible solution for the inversion can be written as: x̃ [ j] = t +j x [ j] (a.3) where t +j is the moore-penrose pseudo inverse of tj given by: t +j = ( t tj tj )−1 t tj (a.4) and α j stands for the binomial coefficients. an important vector valued function of matrix denoted vec(.) was defined as [9]: vecpq×1(a) =   a.1 a.2 ... a.q   (a.5) a matrix valued function is a vector denoted mat(n,m)(.) was defined in [1] as follows: if v is a vector of dimension p = n×n then mmat(n,m)(v ) is the n×m matrix verifying: v = vec(m) (a.6) among the main properties of this product presented in [9], [1], we recall the following useful ones: (a⊗b) (c ⊗d) = (ac)⊗(bd) (a.7) (a⊗b)t = at ⊗bt (a.8) b⊗a = ur×p (a⊗b)uq×s (a.9) x ⊗y = un×m (y ⊗x ) (a.10) vec (eac) = ( ct ⊗e ) vec (a) (a.11) vec ( at ) = up×qvec (a) (a.12) ∀i ≤ k x [k] = uni×nk−i x [k] (a.13) on guaranteed global exponential stability of polynomial singularly perturbed control systems 33 references [1] e. benhadj braiek , f. rotella, m. benrejeb, algebraic criteria for global stability analysis of nonlinear systems, sams journal, vol.17, pp. 211-227, 1995. [2] p. borne, j.c gentina, on the stability of large nonlinear systems. structured and simultaneous lyapunov for system stability problem, joint. aut. cont. conf., austin, texas, 1974. [3] p. borne, m. benrejeb, on the stability of a class of interconnected systems. application to the forced working condition, actes 4th ifac symposium mts, fredericton, canada, 1977. [4] p. borne and g. dauphin-tanguy, singular perturbations, boundary layer problem, systems and control encyclopedia, m.g.singh eds, pergamon press, iii i-2, 1987. [5] h. bouzaouache, e. benhadj braiek, on the stability analysis of nonlinear systems using polynomial lyapunov functions, imacs world congress, paris, 2005. [6] h. bouzaouache, sur la représentation d’état des systèmes non linéaires singulièrement perturbés polynomiaux, conférence tunisienne de génie electrique ctge ,pp. 420-424, 2004. [7] h. bouzaouache, e. benhadj braiek, m. benrejeb, a reduced optimal control for nonlinear singularly perturbed systems", systems analysis modelling simulation journal.vol 43, pp. 75-87, 2003. [8] s. boyd, l. el ghaoui and feron e. balakrishnan, linear matrix inequalities in system and control theory, siam, philadelphia,1994. [9] j. w. brewer, kronecker product and matrix calculus in system theory, ieee trans.on circ.syst, 25 , nř9, pp. 772-781, 1978. [10] c.c. chen, research on the feedback control of nonlinear singularly perturbed systems, phd thesis, inst. electrical eng., natsun yat sen univ, taiwan, roc, 1994. [11] f. ghorbel and m.w.. spong, integral manifolds of singularly perturbed systems with application to rigid-link flexible joint multibody systems, international journal of nonlinear mechanics 35, pp. 133-155, 2000. [12] m. innocenti, l. greco, l. pollini, sliding mode control for two time scale systems : stability issues, automatica, vol. 39,pp. 273-280,2003. [13] p.v. kokotovic, h.k. khalil, o’reilly, singular perturbation methods in control: analysis and design, academic press , new york, 1986. [14] p.m. sharkey and j. o’reilly, exact design manifold control of a class of nlsp systems, ieee trans. automat. contr., vol.32, pp.933-937, 1987. [15] j.j.e. slotine, applied nonlinear control, englenwood cliffs, nj, prentice hall, 1991. [16] v.a. sobolev, integral manifolds and decomposition of singularly perturbed systems, systems control letters, vol.5, pp.169-179, 1984. [17] m.w.spong, k. khorasani, p.v. kokotovic, an integral manifold approach to feedback control of flexible joint robots, ieee j. robot. automat.3 (4), pp. 291-300, 1987. 34 hajer bouzaouache, naceur benhadj braiek hajer bouzaouache, naceur benhadj braiek laboratory of study and automatic control of processes address: polytechnic school of tunisia (ept) bp.743, 2078 la marsa, tunisia. e-mail: hajer.bouzaouache,naceur.benhadj@ept.rnu.tn received: november 6, 2006 int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 2, pp. 224-237 towards structured modelling with hyperdag p systems r. nicolescu, m.j. dinneen, y.-b. kim michael j. dinneen, yun-bum kim and radu nicolescu department of computer science, university of auckland, private bag 92019, auckland, new zealand e-mail: {radu,mjd,yun}@cs.auckland.ac.nz abstract: although p systems are computationally complete, many real world models, such as socio-economic systems, databases, operating systems and distributed systems, seem to require more expressive power than provided by tree structures. many such systems have a primary tree-like structure augmented with shared or secondary communication channels. modelling these as tree-based systems, while theoretically possible, is not very appealing, because it typically needs artificial extensions that introduce additional complexities, inexistent in the originals. in this paper, we propose and define a new model called hyperdag p systems, in short, hp systems, which extend the definition of conventional p systems, by allowing dags, interpreted as hypergraphs, instead of trees, as models for the membrane structure. we investigate the relation between our hp systems and neural p systems. despite using an apparently restricted structure, i.e., a dag instead of a general graph, we argue that hp systems have essentially the same computational power as tissue and neural p systems. we argue that hp systems offer a structured approach to membranebased modelling that is often closer to the behavior and underlying structure of the modelled objects. keywords: hyperdag p systems, tissue and neural p systems, membrane structures. 1 introduction p systems provide a distributed computational model, based on the structure and interaction of living cells, introduced by g. păun in 1998 [10]. the model was initially based on transition rules, but was later expanded into a large family of related models. essentially, all versions of p systems have a structure consisting of cell-like membranes and a set of rules that govern their evolution over time. many of the “classical” versions use a structure, where membranes correspond to nodes in a rooted tree. such a structure is often visualized as a venn diagram, where nesting denotes a parent–child relationship. for example, figure 1 [12] shows the same p system structure with nine membranes, labelled as 1,...,9, both as a rooted tree and as a venn diagram. more, recently, neural p systems [11], abbreviated as np systems (also known as tissue p systems [7]), have been introduced, partially to overcome the limitations of the tree model. essentially, these systems organize their cells in an arbitrary graph. for example, ignoring for the moment the actual contents of cells (states, objects, rules), figure 2 illustrates the membrane structure of a simple np system, consisting of three cells, σ1,σ2,σ3, where cell σ1 is the output cell. a large variety of rules have been used to describe the operational behavior of p systems, such as multiset rewriting rules, communication rules and membrane handling rules. essentially, transition p systems and np systems use multiset rewriting rules, p systems with symport/antiport rules operate by communicating immutable objects and p systems with active membranes combine all three rule types. for a comprehensive overview and more details, we refer the reader to [11, 12]. besides theoretical computer science and biology, p systems have been applied to a variety of other domains, ranging from linguistics [5] to theoretically efficient solutions of np-complete problems [16], copyright c⃝ 2006-2010 by ccc publications towards structured modelling with hyperdag p systems 225 2 3 4 5 6 7 8 9 1 1 2 3 4 5 6 7 8 9 figure 1: a p system structure represented as a tree and as a venn diagram. σ1 σ2 σ3 figure 2: an np system structure represented as a digraph. or to model distributed algorithms [3, 6]. the underlying tree structure provides good support for reasoning and formal verification, good potential for efficient implementation on multi-core architectures, and an excellent visualization, very appealing to practitioners. although the p systems are computationally complete, many real world models seem to require more expressive power, essentially trees augmented by shared or secondary communication channels. for example, the notion of a processing node having a unique parent is not true for (a) computer networks where a computer could simultaneously be attached to several subnets (e.g., to an ethernet bus and to a wireless cell), (b) living organisms may be the result of multiple inheritance (e.g., the evolutionary “tree” is not really a tree, because of lateral gene transfer [4]) and (c) socio-economic scenarios where a player is often connected to and influenced by more than one factor [13, 14, 15]. modelling these as tree-based systems, while theoretically possible, is not very appealing. simulating shared or secondary channels requires artificial mechanisms that will ripple data up and down the tree, via a common ancestor. this could of course limit the merits of using a formal model. models based on general graphs, such as np systems, while allowing any direct communications, also tend to obscure the structures already present in the modelled objects, limiting the advantages that a more structured approach could provide. verification is more difficult without a clear modularization of concerns, practical parallel implementation could be less efficient, if the locality of reference is not enforced, and visualizations are not very meaningful, unless the primary structure is clearly emphasized. we do not think that we have to choose between structure and flexibility. we propose a solution that seems to combine both, i.e., flexibility without sacrificing the advantages of a structured approach. our main contribution in this paper is to propose a new model for p systems, called hyperdag p systems, in short, hp systems, that allows more flexible communications than tree-based models, while preserving a strong hierarchical structure. this model, defined in section 3, (a) extends the tree structure of classical p systems to directed acyclic graphs (dags), (b) augments the operational rules of np systems with broadcast facilities (via a go-sibling transfer tag), and (c) enables dynamical changes of the rewriting modes (e.g., to alternate between determinism and parallelism) and of the transfer modes (e.g., to switch between unicast or broadcast). in contrast, classical p systems, both treeand graph-based p systems, seem to focus on a statical approach. we investigate the relation between our hp systems and np systems. despite using an apparently 226 r. nicolescu, m.j. dinneen, y.-b. kim restricted structure, we show in section 4 that our dag-based model has the same computational power as graph-based tissue p systems and neural p systems. we argue that hp systems offer a structured approach to membrane-based modelling that is often closer to the behavior and underlying structure of the modelled objects. because our extensions address the membrane topology, not the rules model, they can be applied to a variety of p system flavors, such as systems based on symport/antiport rules. we support our view with a realistic example (see examples 11 and 12), inspired from computer networking, modelled as an hp system with a shared communication channel (broadcast channel). classical p systems allow a “nice” planar visualization, where the parent–child relationships between membranes are represented by venn-like diagrams. we show in section 5 that the extended membrane structure of hp systems can still be visualized by hierarchically nested planar regions. 2 preliminaries a (binary) relation r over two sets x and y is a subset of their cartesian product, r ⊆ x ×y . for a ⊆ x and b ⊆ y , we set r(a) = {y ∈ y | ∃x ∈ a,(x,y) ∈ r}, r−1(b) = {x ∈ x | ∃y ∈ b,(x,y) ∈ r}. a digraph (directed graph) g is a pair (x,a), where x is a finite set of elements called nodes (or vertices), and a is a binary relation a ⊆ x × x , of elements called arcs. for an arc (x,y) ∈ a, x is a predecessor of y and y is a successor of x. a length n −1 path is a sequence of n distinct nodes x1,...,xn, such that {(x1,x2),...,(xn−1,xn)} ⊆ a. a cycle is a path x1,...,xn, where n ≥ 1 and (xn,x1) ∈ a. a dag (directed acyclic graph) is a digraph (x,a) without cycles. for x ∈ x , a−1(x) = a−1({x}) are x’s parents, a(x) = a({x}) are x’s children, a(a−1(x))\{x} = a(a−1({x}))\{x} are x’s siblings (siblings defines a symmetric relation). a node x ∈ x is a source iff |a−1(x)| = 0, and x ∈ x is a sink iff |a(x)| = 0. the height of a node x is the maximum length of all paths from x to a sink node. an arc (x,y) is transitive if there exists a path x1,...,xn, with x1 = x, xn = y and n > 2. dags without transitive arcs are here called canonical. a (rooted unordered) tree is a dag with exactly one source, called root, and all other nodes have exactly one parent (predecessor). sinks in a tree are also called leaves. a topological order of a dag is a linear reordering of vertices, in which each vertex x comes before all its children vertices a(x). dags and trees are typically represented with parent-child arcs on the top-down axis, i.e., sources (roots) up and sinks (leaves) down. example dags are shown in figures 3 and 4. we consider a variant hypergraph definition, based on multisets, as an extension of the classical definition [1], which is based on sets. a hypergraph is here a pair (x,e), where x is a finite set of elements called nodes (or vertices), and e is a finite multiset of subsets of x , i.e., e ∈ e ⇔ e ⊆ x . by using a multiset of edges, instead of a more conventional set of edges, we introduce an intensional element, where two extensionally equivalent hyperedges (i.e., hyperedges containing the same nodes) are not necessarily equal. a graph is a set based hypergraph, where hyperedges are known as edges and contain exactly two nodes. alternatively, a graph (x,e) can be interpreted as a digraph (x,a), where a = {(x,y) | {x,y} ∈ e}. hypergraphs (set or multiset based) can be represented by planar diagrams, where hyperedges are represented as regions delimited by images of jordan curves (simple closed curves) [2]. with the above hypergraph definition, a height 1 dag (x,a) can be interpreted as a hypergraph (x,e), where e is the multiset e = {a(x) | |a−1(x)| = 0}. for example, figure 3 represents, side by side, the dag d = ({a,b,c,d,e, f },{(d,a),(d,b),(d,c),(e,b), (e,c),( f ,b),( f ,c)}) and its corresponding hypergraph h = ({a,b,c},{d,e, f }), where d = {a,b,c},e = {b,c}, f = {b,c}. note that the apparently empty differences of regions are needed in the case of multiset based hypergraphs, to support the intensional (as opposed to the extensional) aspect: here e ̸= f , despite containing the same nodes, b and c, and neither e nor f is included in d. generalizing the above hypergraph definition, a height n generalized hypergraph is a system (x,e), recursively built via a sequence of n hypergraphs (x1,e1),...,(xn,en) where x1 = x , xi ∩ ei = /0,xi+1 = towards structured modelling with hyperdag p systems 227 d e a b c a b c d e f f figure 3: a simple height 1 dag and its corresponding hypergraph representation. xi ∪ ei, e ∩ ei ̸= /0 for ∀e ∈ ei+1 and e = ∪ i∈{1,...,n} ei. an arbitrary height n dag can be represented by a height n generalized hypergraph, where the hypergraph nodes correspond to dag sinks, and height i hyperedges correspond to height i dag nodes, for i ∈ {1,...,n}. we will later see that any generalized hypergraph that corresponds to a non-transitive dag can also be represented by hierarchically nested planar regions delimited by jordan curves, where arcs are represented by direct nesting. for example, figure 4 shows a height 2 dag and its corresponding height 2 hypergraph (x,e), where x = x1 = {a,b,c,d,e}, e1 = { f ,g,h}, e2 = {i}, e = { f ,g,h,i}. a b c d e a b c d e f g h i f g h i figure 4: a height 2 dag and its corresponding height 2 hypergraph. an alphabet o is a finite non-empty sets of objects. we will assume that the alphabet o is implicitly ordered. multisets over an alphabet o are represented as strings over o, such as on11 ...o nk k , where oi ∈ o, ni ≥ 0, and, in the canonical form, letters appear in sorted order, i.e., o1 < ··· < ok, and ni ≥ 1. the set of all multisets is denoted by o∗. for this representation, two strings are equivalent if they become equal after sorting, e.g., a2cbd0a and a3bc are equivalent representations of the same multiset {a,a,a,b,c}. under this convention, the empty string λ represents the empty multiset, and string concatenation represents multiset union, e.g., (a2c) ·(ab) = a3bc. 3 hyperdag p systems in this paper we use the definition of np systems, as given in [11], that coincides with an early definition of tissue p systems as given in [7]. our definition includes a small technical correction (slight ambiguity). for details, please see our technical report [8]. as in the mentioned np systems definition, we will use the following sets of tagged objects: ogo = {(a,go) | a ∈ o}, oout = {(a,out) | a ∈ o}, and we set otot = o ∪ ogo ∪ oout . for simplicity, we will use subscripts for these tagged objects, such as ago for (a,go) and aout for (a,out). we also define projection homomorphisms, here denoted in postfix notation: |o, |go, |out : o∗tot → o∗, by o|o = o,ogo|go = o,oout|out = o for o ∈ o, and otherwise λ . for example, a2a3gob4bgo|go = a3b. besides the existing go, out tags, we consider three other object tags: 1. go-parent, abbreviated by the symbol ↑, indicating objects that will be sent to parents; 228 r. nicolescu, m.j. dinneen, y.-b. kim 2. go-child, abbreviated by the symbol ↓, indicating objects that will be sent to children; 3. go-sibling, abbreviated by the symbol ↔, indicating objects that will be sent to siblings. the precise semantics of these tags will be explained below when we detail the hp system object transfer modes. in fact, we could also discard the go tag, as it corresponds to the union of these new target tags (go-parent, go-child, go-sibling); however, we will keep it here, for its concise expressive power. we use similar notation as np systems for these new tags o↑,o↓,o↔, and postfix projections |↑, |↓, |↔. other extension tags, including addressing mechanisms (such as from, to or via tags) are possible, and indeed seem natural, but this is beyond the scope of this article. we will now define hp systems, as an apparent restriction of np systems, where the underlying structure is a dag, with several other adjustments. definition 1 (hyperdag p systems) an hp system (of degree m) is a system π = (o,σ1,..., σm,δ ,iout), where: 1. o is an ordered finite non-empty alphabet of objects; 2. σ1,...,σm are cells, of the form σi = (qi,si,0,wi,0,pi), 1 ≤ i ≤ m, where: • qi is a finite set (of states), • si,0 ∈ qi is the initial state, • wi,0 ∈ o∗ is the initial multiset of objects, • pi is a finite set of multiset rewriting rules of the form sx → s′x′u↑v↓w↔ygozout , where s,s′ ∈ qi, x,x′ ∈ o∗, u↑ ∈ o∗↑ , v↓ ∈ o∗↓ , w↔ ∈ o∗↔ , ygo ∈ o∗go and zout ∈ o∗out , with the restriction that zout = λ , for all i ∈ {1,...,m}\iout , 3. δ is a set of dag parent–child arcs on {1,...,m}, i.e., δ ⊆ {1,...,m}×{1,...,m}, representing duplex communication channels between cells; 4. iout ⊆ {1,...,m} indicates the output cells, the only cells allowed to send objects to the “environment”. the essential novelty of our proposal is to replace the arbitrary arc set used in neural p systems by a more structured arc set δ (dag), or, otherwise interpreted, as a generalized multiset-based hypergraph. this interpretation has actually suggested the name of our proposal, hyperdag p systems, and their abbreviation hp systems. the changes in the rules format are mostly adaptations needed by the new topological structure. here, we have reused and enhanced the rewriting rules used by np systems [11]. however, we could adopt and adapt any other rule set, from other variants or extensions of p systems, such as rewriting, symport/antiport or boundary rules [12]. definitions of configurations, transitions, computations and results of computations in hp systems are similar to definitions used for np systems (see also [8]), with the following essential additions and differences, here informally stated: • the rewriting mode α and transfer mode β may not be fixed from the start, i.e., they may vary, for each cell σi and state s ∈ qi. • if object transfer mode is repl (this is a deterministic step): – the objects tagged with ↑ will be sent to all the parents, replicated as necessary – the objects tagged with ↓ will be sent to all the children, replicated as necessary towards structured modelling with hyperdag p systems 229 – the objects tagged with ↔ will be sent to all the siblings, of all sibling groups, replicated as necessary • if object transfer mode is one (this is a nondeterministic step): – the objects tagged with ↑ will be sent to one of the parents, arbitrarily chosen – the objects tagged with ↓ will be sent to one of the children, arbitrarily chosen – the objects tagged with ↔ will be sent to one of the siblings, of one of the sibling groups, arbitrarily chosen • if object transfer mode is spread (this is a nondeterministic step): – the objects tagged with ↑ will be split into submultisets and distributed among the parents, in an arbitrary way – the objects tagged with ↓ will be split into submultisets and distributed among the children, in an arbitrary way – the objects tagged with ↔ will be split into submultisets and distributed among the siblings and sibling groups, in an arbitrary way figure 5 schematically shows the possible object transfers from a cell σi, having two children, two parents, hence two sibling groups, with one sibling in the first group and two siblings in the other. the above mentioned transfer modes will select one, some or all the illustrated transfer targets, deterministically (repl) or nondeterministically (one, spread). go-parent (↑) go-parent (↑) go-sibling (↔) go-child (↓) go-child (↓) go-sibling (↔) go-sibling (↔) σi figure 5: an annotated hp system indicating possible transfers from cell σi. the parent-child axis is top-down. plain lines indicate parent-child relations and dashed lines indicate siblings. arrows at the end of long thick lines, plain or dashed, indicate possible transfer directions from cell σi. more formal definitions follow. definition 2 (configurations) a configuration of the hp system π is an m-tuple of the form (s1w1,...,smwm), with si ∈ qi and wi ∈ o∗, for 1 ≤ i ≤ m. the m-tuple (s1,0w1,0,...,sm,0wm,0) is the initial configuration of π . definition 3 (rewriting and transfer modes) for an hp system of degree m, • the object rewriting mode is a function α : ∪ i∈{1,...,m} {i}× qi → {min, par,max} . • the object transfer mode is a function β : ∪ i∈{1,...,m} {i}× qi → {repl,one,spread} . 230 r. nicolescu, m.j. dinneen, y.-b. kim definition 4 (rewriting steps) for each cell σi with s,s′ ∈ qi, x ∈ o∗, y ∈ o∗tot , we define a rewriting step, denoted by ⇒α , where α = α(i,s) ∈ {min, par,max}. • sx ⇒min s′y iff sw → s′w′ ∈ pi, w ⊆ x, and y = (x − w)∪ w′; • sx ⇒par s′y iff sw → s′w′ ∈ pi, wk ⊆ x, wk+1 * x, for some k ≥ 1, and y = (x − wk)∪ w′k; • sx ⇒max s′y iff sw1 → s′w′1,...,swk → s′w′k ∈ pi, k ≥ 1, such that w1 ...wk ⊆ x,y = (x−w1 ...wk)∪ w′1 ...w ′ k, and there is no sw → s′w′ ∈ pi, such that w1 ...wkw ⊆ x (note that rules can be combined only if they start from the same state s and end in the same state s′). definition 5 (transition steps) given two configurations c1 = (s1w1,...,smwm) and c2 = (s′1w ′′ 1 ,...,s ′ mw ′′ m), we write c1 ⇒α,β c2, for α and β (as defined in definition 3), if the conditions below are met. first, we apply rewriting steps (as defined in definition 4) on each cell, i.e., siwi ⇒α(i,si) s′i w′i , 1 ≤ i ≤ m. secondly, we define z↑j,k, z↓j,k, z↔j,k, the outgoing multisets from j to k, where j ∈ {1,...,m} and, respectively, k ∈ δ −1( j), k ∈ δ ( j), k ∈ δ (δ −1( j))\{ j}: • if β ( j,s j) = repl, then – z↑j,k = w′j|↑, for k ∈ δ −1( j); – z↓j,k = w′j|↓, for k ∈ δ ( j); – z↔j,k = w′j|↔, for k ∈ δ (δ −1( j))\{ j}. • if β ( j,s j) = one, then – z↑j,k j = w′j|↑, for an arbitrary k j ∈ δ −1( j), and z↑j,k = λ for k ∈ δ −1( j)\{k j}; – z↓j,k j = w′j|↓, for an arbitrary k j ∈ δ ( j), and z↓j,k = λ for k ∈ δ ( j)\{k j}; – z↔j,k j = w′j|↔, for an arbitrary k j ∈ δ (δ −1( j))\{ j}, and z↔j,k = λ for k ∈ δ (δ −1( j))\{ j,k j}. • if β ( j,s j) = spread, then – {z↑j,k}k∈δ −1( j) is an arbitrary multiset partition of w′j|↑; – {z↓j,k}k∈δ ( j) is an arbitrary multiset partition of w′j|↓; – {z↔j,k}k∈δ (δ −1( j))\{ j} is an arbitrary multiset partition of w′j|↔. finally, we set w′′i = w ′ i |o ∪ ∪ j∈δ −1(i) z↑j,i ∪ ∪ j∈δ (i) z↓j,i ∪ ∪ j∈δ (δ −1(i))\{i} z↔j,i , for i ∈ {1,...,m}. definition 6 (halting and results) if no more transitions are possible, the hp system halts. for halted hp system, the computational result is the multiset that was cumulatively sent out (to the “environment”) from the output cells iout . the numerical result is the set of vectors consisting of the object multiplicities in the multiset result. within the family of p systems, two systems are functionally equivalent, if they yield the same computational results. towards structured modelling with hyperdag p systems 231 example 7 consider two hp systems, π1 and π2, (which are functionally equivalent). π1 = (o,σ1,σ2,σ3,δ ,iout), where: • o = {a}; • σ1 = ({s},s,a,{sa → sa↓,sa → saout}); • σ2 = ({s},s,λ ,{sa → sa↑}); • σ3 = ({s},s,λ ,{sa → sa↑}); • δ = {(1,2),(1,3)}; • iout = {1}. s, a sa → sa↓ sa → saout s, λ sa → sa↑ s, λ sa → sa↑ σ1 σ2 σ3 π2 = (o,σ1,σ2,σ3,σ4,σ5,δ ,iout), where: • o = {a}; • σ1 = ({s},s,a,{sa → sa↔,sa → saout}); • σ2 = ({s},s,λ ,{sa → sa↔}); • σ3 = ({s},s,λ ,{sa → sa↔}); • σ4 = ({s},s,λ , /0); • σ5 = ({s},s,λ , /0); • δ = {(4,1),(4,2),(5,1),(5,3)}; • iout = {1}. s, a sa → sa ↔ sa → saout s, λ sa → sa ↔ s, λ sa → sa ↔ σ1σ2 σ3 σ4 σ5 s, λ s, λ 4 relations between p systems, neural p systems and hyperdag p systems theorem 8 (hyperdag p systems include non-dissolving transition p systems). any non-dissolving1 transition p system can be simulated by an hp system, with the same number of steps and object transfers. proof: given a non-dissolving, transition p system πt [12], we build a functionally equivalent hp system πh by the following transformation f . essentially, we use the same elements, with minor adjustments. as the underlying structure, we can reuse the rooted tree structure of the p systems, because any rooted tree is a dag. πt = (o,c, µ,w1,...,wm,r1,...,rm,io), πh = f (πt ) = (o′,σ ′1,...,σ ′ m,δ ,iout). • o′ = o; • σ ′i = (q ′ i ,s ′ i,0,w ′ i,0,p ′ i ), 1 ≤ i ≤ m, where: – q′i = {s}, where s /∈ o; – s′i,0 = s; – w′i,0 = wi; – p′i = {su → sv′ | u → v ∈ ri}, where v′ is a translation of v by the following homomorphism: (o ∪ o ×{here,in,out})∗ → o∗, such that a → a, (a,here) → a, (a,out) → a↑, (a,in) → a↓. 1a dissolving membrane occurs if we allow rules that tell a membrane to disappear where its remaining objects go to its parent membrane; see [12]. 232 r. nicolescu, m.j. dinneen, y.-b. kim • δ = µ ; • iout = {io}; • the object rewriting mode is the max constant function, i.e., α(i,s) = max, for i ∈ {1,...,m},s ∈ qi; • the object transfer mode is the spread constant function, i.e., β (i,s) = spread, for i ∈ {1,...,m},s ∈ qi. tags go-child(↓), go-parent(↑) correspond to p system target indications in,out, respectively. an empty tag corresponds to p system target indication here. object rewriting and transfer modes of hp systems are a superset of object rewriting and transfer modes of p systems. we omit here the rest of the proof that shows the two systems, πt and πh , yield the same computational results, which is now straightforward but lengthy. 2 remark 9 in analogy to the p systems, it is straightforward to extend the hp systems with additional features, such as dissolving membranes, priorities or polarities. however, to keep the arguments simple, we here omit such extensions. proving that hp systems also simulate np systems appears more daunting. however, here we will use a natural interpretation of hp systems, where the bulk of the computing will be done by the sink nodes, and the upper nodes (parents) will function mostly as communication channels. remark 10 the combination of go-sibling (↔) with repl object transfer mode enables the efficient modelling of a communication bus, using only one hyperedge or, in the corresponding dag, n arcs. in contrast, any formal systems that use graph edges (or digraph arcs) to model 1:1 communication channels will need n(n −1) separate edges (or 2n(n −1) arcs) to model the associated complete subgraph (clique). it is expected that this modelling improvement will also translate into a complexity advantage, if we count the number of messages. in hp systems, a local broadcast needs only one message to siblings, while graphor digraph-based systems need n −1 messages. example 11 figure 6 shows the structure of an hp system that models a computer network. four computers are connected to “ethernet bus 1”, the other four computers are connected to “ethernet bus 2”, while two of the first group and two of the second group are at the same time connected to the same wireless cell. in this figure, we also suggest that “ethernet bus 1” and “ethernet bus 2” are themselves connected to a higher level communication hub in a generalized hypergraph. example 12 figure 7 shows the computer network of figure 6, modelled as a graph (if we omit arrows) or as a digraph (if we consider the arrows). note that the graphor digraph-based models, such as np systems, do not support the grouping concept, i.e., there is no direct way to mark the nodes a,b,c,d as being part of the “ethernet bus 1”. we can now mention an important theorem comparing hp systems and np systems. theorem 13 (hyperdag p systems can simulate symmetric neural p systems). any symmetric np system can be simulated by an hp system, with the same number of steps and object transfers. proof: the simulation details are given in our research report [8]. 2 remark 14 we leave here open the case of non-symmetric np systems, which can also be simulated by hp systems, but require additional costs, in terms of steps and object transfers. towards structured modelling with hyperdag p systems 233 ethernet bus 1 ethernet bus 2 wireless cell ethernet bus 1 ethernet bus 2 a b c d e f g h a b c d e f g h wireless cell figure 6: a computer network and its corresponding hypergraph representation. a b c d e f g h (ethernet bus 1) (ethernet bus 2)(wireless cell) figure 7: the digraph representation of the computer network of figure 6. 5 planar representation of hyperdag p systems classical tree-based p systems allow a “nice” planar representation, where the parent–child relationships between membranes are represented by venn-like diagrams. can we extend this representation to cover our dag-based hp systems? in this section, we will show that any hp system, structurally based on a canonical dag, can still be intensionally represented by hierarchically nested planar regions, delimited by jordan curves (simple closed curves). conversely, we also show that any set of hierarchically nested planar regions delimited by jordan curves can be interpreted as a canonical dag, which can form the structural basis of a number of hp systems. we will first show how to represent a canonical dag as a set of hierarchically nested planar regions. algorithm 15 (algorithm for visually representing a canonical dag) without loss of generality, we consider a canonical dag (v,δ ) of order n, where vertices are indexed according to an arbitrary topological order implied by the arcs, by considering parents before the children, i.e., v = {vi | 1 ≤ i ≤ n}, where (vi,v j) ∈ δ implies i < j. figure 8 shows side by side a simple height 1 canonical dag and its corresponding hypergraph representation. note the intensional representation (as opposed to the extensional one), where v2 is not totally included in v1, although all vertices included in v2, i.e., v4 and v5, are also included in v1. a possible topological order is v1,v2,v3,v4,v5. 234 r. nicolescu, m.j. dinneen, y.-b. kim v3 v4 v5 v1 v2 1 2 3 4 5 figure 8: a simple canonical dag and its corresponding hypergraph representation. for each vertex vi, we associate a distance ψi = 12(n−i+1) , for i ∈ {1,...,n}. for figure 8, ψi = 1 32 , 1 16 , 1 8 , 1 4 , 1 2 , for i ∈ {1,...,n}. we process the vertices in reverse topological order vn,...,v1, at each step i representing the current vertex vi by a planar region ri. first, we set parallel horizontal axis xo and xp, vertically separated by distance 3(n −1). secondly, we set points o1,...,on on xo, such that oi and oi+1 are separated by distance 3, for 1 ≤ i ≤ n − 1. we define oi as the origin point of vi, and write oi = origin(vi). finally, we set points p1,..., pn on xp, such that pi and pi+1 are separated by distance 3, for 1 ≤ i ≤ n −1. we define pi as the corridor point of vi. figure 9 shows the construction of xo,xp,oi and pi, for the dag of figure 8, where n = 5. o3 o4 o5o1 o2 p1 p2 p3 p4 p5 xo xp figure 9: construction of xo,xp,oi and pi, for the dag of figure 8, where n = 5. if the current vertex vi is a sink, then ri is a circle with radius 1 2 centered at oi. if the current vertex vi is a non-sink, then ri is constructed as follows. assume that the children of vi are w1,...,wni , and their (already created) regions are s1,...,sni . consider line segments l0,l1,...,lni , where l0 is bounded by oi and pi, and l j is bounded by pi and origin(w j), for j ∈ {1,...,ni}. let l0, l1,...,lni , t1,...,tni be the regions enclosed by jordan curves around l0, l1,...,lni , s1,...,sni , at a distance ψi, and let r′i = l0 ∪ ∪ j=1,...,ni l j ∪ ∪ j=1,...,ni tj. we define ri as the external contour of r ′ i . this definition will discard all internal holes (such as the dashed enclosed regions of figure 10), if any, without introducing any additional containment relations between our regions. the details of our construction guarantee that no internal hole will ever contain an origin point. 2 figure 10 shows the side by side, a dag and its corresponding planar region representation; internal holes are represented by dotted lines. our objective here was not to create “nice” visualizations, but to prove that it is possible to represent an arbitrary canonical dag, i.e., an arbitrary hp system structurally based on a canonical dag, by hierarchically nested planar regions. we will next show that, for any finite set of hierarchically nested planar regions, we can build a corresponding canonical dag (i.e., the underlying structure of an hp system). algorithm 16 (algorithm for building a canonical dag from finite set of hierarchically nested planar regions) towards structured modelling with hyperdag p systems 235 1 2 3 4 r1 r2 r3 r4 xo xp figure 10: a height 2 dag and its corresponding representation, built by algorithm 15. assume that we have n hierarchically nested planar regions, 1. label each planar region by ri, i ∈ {1,...,n}, 2. if ri directly nests r j then draw an arc from a vertex vi to a vertex v j, i, j ∈ {1,...,n}, i ̸= j. 2 we now show that a canonical digraph produced from algorithm 16 does not contain any cycles. our proof is by contradiction. let us assume a digraph g produced from algorithm 16 contains a cycle vi,...,vk,...,vi. then every vertex in a cycle has an incoming arc. if vertex vk is a maximal element in a cycle, with respect to direct nesting, then its corresponding planar region rk have the largest region area among planar regions in a cycle. since no other planar region in a cycle can contain rk, there are no arc incident to vertex vk. hence, there is no cycle in g. remark 17 we present in [9] a solution to the problem of representing dags (that contain transitive arcs) by a set of simple regions, where direct containment denotes a parent–child relation. 6 summary we have proposed a new model, as an extension of p systems, that provides a better communication structure and we believe is often more convenient for modelling real-world applications based on tree structures augmented with secondary or shared communication channels. we have shown that hp systems functionally extends the basic functionality of transition p systems and np systems, even though the underlying structure of hp systems is different. in dag-based hp systems, we can have a natural separation of computing cells (sinks) from communication cells (hyperedges). this model also allows us to easily represent multiple inheritance or to distribute computational results (as specified by a dag) amongst several different parts of a membrane structure. we note that the operational behavior of hp systems is separate from the topological structure of a membrane system. in this paper, we illustrated hp systems using the computational rules of np systems, where multisets of objects are repeatedly changed within cells, by using a fixed set of multiset rewriting rules, or transferred between cells, using several possible transfer modes. finally, we provided an intuitive visualization of hp systems, by showing that any set of hierarchically nested planar regions (which represents any set of cells ordered by containment) is equivalent to, or modelled by, a dag without transitive arcs. we provided simple algorithms to translate between these two interpretations. 236 r. nicolescu, m.j. dinneen, y.-b. kim this paper is part of an ongoing research dedicated to structured modelling and model checking of p systems. dedication this article is dedicated to mario j. pérez-jiménez, on the occasion of his 60th birthday (november 2008). bibliography [1] c. berge, hypergraphs: combinatorics of finite sets, elsevier science publishers, 1989. [2] c. carathéodory, theory of functions of a complex variable, vol.1, chelsea publishing company, 1954. [3] g. ciobanu, distributed algorithms over communicating membrane systems, bio systems, 70(2):123–133, 2003. [4] w. f. doolittle, uprooting the tree of life, scientific american, 282(2):90–95, 2000. [5] t. -o. ishdorj, m. ionescu, replicative-distribution rules in p systems with active membranes, proceeding of the first international colloquium on theoretical aspects of computing (ictac 2004), 68–83, 2004. [6] c. li, validating p system as distributed computing models, master thesis, 2008. [7] c. martín-vide, g. păun, j. pazos, a. rodríguez-patón, tissue p systems, theoretical computer science, 296(2):295–326, 2003. [8] r. nicolescu, m. j. dinneen, y.-b. kim, structured modelling with hyperdag p systems: part a, cdmtcs research report series, cdmtcs-342, 1–24, december 2008. http://www.cs.auckland.ac.nz/cdmtcs/researchreports/342hyperdaga.pdf [9] r. nicolescu, m. j. dinneen, y.-b. kim, discovering the membrane topology of hyperdag p systems, proceeding of the tenth workshop on membrane computing (wmc10 2009), curtea de argeş, romania, august 24–27, 426–451, 2009. [10] g. păun, computing with membranes. journal of computer and system sciences, 61(1):108–143, 2000, (and turku center for computer science, tucs report 208, november 1998). [11] g. păun, membrane computing: an introduction. springer-verlag, 2002. [12] g. păun, introduction to membrane computing, proceeding of the first brainstorming workshop on uncertainty in membrane computing, 17–65, 2004. [13] g. păun, r. a. păun, membrane computing as a framework for modeling economic processes, seventh international symposium on symbolic and numeric algorithms for scientific computing (synasc 2005), 11–18, 2005. [14] m. slikker, a. van den nouweland, social and economic networks in cooperative game theory, kluwer academic publishers, 2001. [15] v. i. voloshin, coloring mixed hypergraphs: theory, algorithms and applications, american mathematical society, 2002. [16] c. zandron, c. ferretti, g. mauri, solving np-complete problems using p systems with active membranes, proceeding of the second international conference on unconventional models of computation, 289–301, 2000. towards structured modelling with hyperdag p systems 237 michael j. dinneen received his phd in 1996 from the university of victoria in canada. he is currently a senior lecturer at the university of auckland. his research interests are in combinatorial algorithms, graph algorithms, uncoventional computing models and network design. yun-bum kim is currently a phd student at the university of auckland, new zealand. in 2007 he has completed a msc thesis in network optimization design problems and currently has interests in molecular and distributed computing. radu nicolescu (phd bucharest, macm, memieee) is currently a senior lecturer at the university of auckland, new zealand. he has research interests in formal languages, information complexity and service oriented computing. int j comput commun, issn 1841-9836 vol.7 (2012), no. 3 (september), pp. 459-472 inverse kinematics solution for robot manipulator based on neural network under joint subspace y. feng, w. yao-nan, y. yi-min yin feng, wang yao-nan, yang yi-min the college of electrical and information engineering hunan university, changsha, hunan province 410082, p.r.china e-mail: yinfeng83@126.com, yaonan@hnu.cn, yimin-yang@126.com abstract: neural networks with their inherent learning ability have been widely applied to solve the robot manipulator inverse kinematics problems. however, there are still two open problems: (1) without knowing inverse kinematic expressions, these solutions have the difficulty of how to collect training sets, and (2) the gradient-based learning algorithms can cause a very slow training process, especially for a complex configuration, or a large set of training data. unlike these traditional implementations, the proposed metho trains neural network in joint subspace which can be easily calculated with electromagnetism-like method. the kinematics equation and its inverse are one-to-one mapping within the subspace. thus the constrained training sets can be easily collected by forward kinematics relations. for issue 2, this paper uses a novel learning algorithm called extreme learning machine (elm) which randomly choose the input weights and analytically determines the output weights of the single hidden layer feedforward neural networks (slfns). in theory, this algorithm tends to provide the best generalization performance at extremely fast learning speed. the results show that the proposed approach has not only greatly reduced the computation time but also improved the precision. keywords: inverse kinematics, neural network, extreme learning machine. 1 introduction the inverse kinematics (ik) problem for a serial-chain manipulator is to find the values of the joint positions given the position and orientation of the end-effector relative to the base. there are many solutions to solve the inverse kinematics problem [1] , such as geometric, algebraic, and numerical iterative methods. in particular, some of the most popular methods are mainly based on inversion of the mapping established between joint space and task space by the jacobian matrix. this solution uses numerical iteration to invert the forward kinematic jacobian matrix and does not always guarantee to produce all possible inverse kinematics solutions. the artificial neural network, which has significant flexibility and learning ability, has been used in the inverse kinematics problem. one solution followed a closed-loop control scheme where a neural network is used to directly learn the nonlinear relationship between the displacement in the workspace and control signal in the joint angle space to achieve a desired position([2] and [3]) . other schemes used neural networks to learn a mapping function from the world space to joint space. although there are various neural networks, the multi-layer perceptron network (mlpn) and the radial basis function network (rbfn) are the most popular neural network applied to functional approximation problems. in [4], the effects of structural parameters, iteration steps and different numbers of training points on the performance of the inverse kinematics approximation were investigated. the results showed that a more complex mlpn configuration is likely to produce a more accurate inverse kinematics approximation. however, it also leads to the number of iterations increasing significantly to satisfy the required training goal. similarly, the neural networkss generalization ability seems to be improved when the number of training sets is increased. however, if the numbers of hidden neurons or training sets are too large, copyright c⃝ 2006-2012 by ccc publications 460 y. feng, w. yao-nan, y. yi-min the training process can not even converge to an expected error goal in some cases. in [5], an mlpn with various structures of the input layer were proposed to solve the inverse kinematics problem of a 6 dof manipulator. three different forms representing the orientation of the end-effector with respect to the base were defined: a 33 rotation matrix, a set of 3 euler angles and one angle and a 13 unit vector. another solution combining an mlpn and a lookup table to solve an inverse kinematics problem of a redundant manipulator was proposed in [6]. although the use of mlpn in the inverse kinematics problem has a greater extent, there have some significant disadvantages. for example, there is no reasonable mechanism to select a suitable network configuration relating to the system characteristics represented by training sets. in addition, training mlpn using the back-error propagation algorithm is complex and slow. for a complex mlpn structure required for a complex configuration manipulator, or a large set of training data, the training process is slow to converge to a specific goal. therefore, trends towards using rbfn which are conceptually simpler and possess the ability to model any nonlinear function conveniently have become more popular. in [7], a variety of network configurations based on rbfn were developed to explore the effect of various network configurations on the performance of the network. in [8], a novel architecture of rbfn with two hidden layers was developed for a inverse kinematics problem of a 3-link manipulator. a fusion approach was proposed in [9]. the proposed approach used rbfn for prediction of incremental joint angles which in turn were transformed into absolute joint angles with the assistance of forward kinematics relations. another rbfn-based method was presented in [10]. it developed a structure of six parallel rbfn, each of which consists of six inputs which represent a location of the end-effector and one output as the joint angle. thus, the group of six parallel rbfn (one for each joint angle) could perform an inverse kinematics approximation. in addition, some hybrid techniques made use of neural networks along with expert system [11], fuzzy logic [12] and genetic algorithm [13] for solving the inverse kinematics. though these intelligence approaches can be applied for two or three dof planar robots, they often demand high performance computing systems and complex computer programming for complex robotic system. traditionally, all the parameters (weights and biases) of the feedforward networks need to be turned. for past decades gradient descent-based methods have mainly been used in various learning algorithms. it is clear that the learning process often needs many training patterns and times to cover the entire workspace. thus, it is not surprising to see that it may take several hours and several days to train neural networks to solve the inverse kinematics. unlike these traditional implementations, this paper uses a novel learning algorithm called extreme learning machine (elm) for single hidden layer feedforward neural networks (slfn) which randomly chooses the input weights and biases, and analytically determines the output weights of slfn [14,15]. in theory, this algorithm tends to provide the best generalization performance at extremely fast learning speed. another issue of concern for solving the inverse kinematics using neural networks is the training data sets. as we know, the joint space of the robot can be considered as an inverse image of the cartesian space and vice versa. thus, the forward kinematics can be assumed to be an inverse image of inverse kinematics and vice versa [9]. the pose p can be used as an input and the corresponding joint angle q as the output for the neural network training data. in other words, q−→p relationship is used while generating the data whereas p−→q mapping is done while training the neural network. usually, the inverse kinematics problems have multiple solutions. for example, the puma 560 robot has at most eight solutions when there are no joint limits imposed. hence, the inverse kinematics equation is one-tomany mapping.unfortunately,the neural network can not match the actual output with the desired output. so the learning error of neural network is hard to be calculated when training. an effective solution is that the training sets are constrained to only one solution set so that the one-to-one mapping can be achieved. for simple structure, such as two-link planar manipulator, the training sets can be collected based on the inverse kinematics equation which only consist of either the positive or negative solution. however, this solution has the difficulty of how to collect constrained data without knowing the inverse kinematic inverse kinematics solution for robot manipulator based on neural network under joint subspace 461 expressions of the complex robotic system. the present work attempts to resolve this crucial issue by using a novel heuristic algorithm, called electromagnetism-like method (em)[16,17], for determining a joint subspace which includes one and only one inverse kinematics solution for a given pose. for convenience’s sake, a graphic depiction of the proposed method is illustrated by using a 2d example, as shown in figure 1. 1 2 the whole joint space joint subspace 1 d 2 d 1 2 3 figure 1: an illustration of the proposed algorithm. the point 1 and 2 is true solution and point 3 is an approximate solution in figure 1, assume that there are two inverse solutions in the whole space. one approximate solution is denoted by (θ∗1,θ ∗ 2) . if we select appropriate dθ1 and dθ2 such that θ1 ∈ [θ ∗ 1 − dθ1,θ ∗ 1 + dθ1] and dθ2 ∈ [θ∗2 − dθ2,θ ∗ 2 + dθ2], this joint subspace includes just one true solution point 1. based on this, the data required for training of neural network is proposed to be derived from the joint subspace with the forward kinematics relations instead of deriving a set complex inverse kinematics equations from the whole joint space. then the true solution point 1 can be approached by using the trained network. the proposed method can be summarized as follows: 1. given a desired coordinate of position and orientation of the end-effector, making use of em to calculate an approximate solution near to one true solution. 2. specify appropriate value of dθk such that θk ∈ [θ∗k − dθk,θ ∗ k + dθk](k = 1,2, · · · ,n), where n is the number of joint and θ∗k is the approximate solution of the k joint variable calculated with em in step 1. for the sake of simplicity, all dθ can be set to be the same value. 3. collect the training sets from the joint subspace, and train the neural network with elm. for a new coordinate of position and orientation of the end-effector, the neural networks usually need to be retrained following the steps above. fortunately, our results show that the training process is very fast. thus, the retraining procedure appears to be acceptable. 2 calculation of joint subspace with em 2.1 problem formulation as shown in figure 2, the desired position vector and orientation matrix of a manipulator end-effector are denoted by: pd and [rd] = [d1,d2,d3], where dj (j=1, 2, 3) are unit vectors along the xd,yd,zd axes. ph is the current position vector of the end-effector. the current orientation matrix is defined by : [rh] = [h1,h2,h3], where hj (j=1,2,3) are unit vectors along the xh,yh,zh axes and the joint variables are denoted by the n×1 vector, θ= [θ1,θ2, · · · ,θn]t. the error between the current and the desired locations of the end-effector can be described by the following functions [18]: 462 y. feng, w. yao-nan, y. yi-min figure 2: the current and the desired end-effector configurations position error: ∆p(θ) =∥ pd − ph(θ)∥ (1) orientation error: ∆o(θ) = 3∑ j=1 (d j · h j (θ) − 1) 2 (2) the total error: e(θ) =∆p(θ) +∆o(θ) (3) where (·) denotes the vector dot product. furthermore, the total error can be chosen to be a weighted sum of the position and orientation components: e(θ) =wp∆p(θ) +wo∆o(θ) (4) where wp and wo are weighting factors assigned to position and orientation, respectively, such that wp +wo = 1. now the inverse kinematics problem is to find a solution θ∗, such that e(θ∗) ≤ ε(ε−→ 0).it is clear that this problem can be transformed into the following minimization problem: mine(θ) s.t. θ ∈ℜn|lk ≤ θk ≤ uk, i = 1,2, · · · ,n (5) 2.2 brief of electromagnetism-like method (em) to solve the problem in (5), the general scheme of em is given by following procedures: initialize, local search, calculation of charge and total force vector and movement according to the total force. initialization the procedure initialization is used to sample m points, θ1, · · · ,θm, randomly from the feasible domain of the joint variables, where θi = [θi1, · · · ,θ i n](i = 1, · · · ,m). the procedure uniform sampling can be determined by following θik = lk + rand · (uk − lk) k = 1,2, · · · ,n (6) inverse kinematics solution for robot manipulator based on neural network under joint subspace 463 the procedure ends with m points identified, and the point that has the best function value is stored in θbest. local search the local search procedure is used to gather the local information and improve the current solutions. it can be applied to one or many points for local refinement per iteration. the selection of these two procedures, does not affect the convergence result. calculation of charge and total force vector the charges of the points are calculated according to their objective function values, and the charge of each point is not constant and changes from iteration to iteration. the charge of the ith point, qi, is evaluated as following qi = exp[−n (e(θi) − e(θbest)) m∑ k=1 (e(θk) − e(θbest)) ], i = 1,2, · · · ,m (7) in this way, points that have better objective values possess higher charges. notice that, unlike electrical charges, no signs are attached to the charge of an individual point in (7). instead, the direction of a particular force between two points is decided after comparing their objective function values. hence, the total force fi exerted on point i is computed by the following equation fi =  m∑ j,i (θ j − θi) q jqi ∥ θ j − θi ∥2 , i f e(θ j) < e(θi) m∑ j,i (θi − θ j) q jqi ∥ θ j − θi ∥2 , others (8) according to (8), the point that has a better objective function value attracts the other one. contrarily, the point with worse objective function value repels the other. since θbest has the minimum objective function value, it acts as an absolute point of attraction. then it attracts all other points in the population to better region. movement according to the total force after evaluating the total force vector fi, the point i is moved in the direction of the force by a random step length in (8). here the random step length λ is assumed to be uniformly distributed between 0 and 1. θi = θi +λ fi ∥fi∥ rng, i = 1,2, · · · ,m (9) in (9), rng is a vector whose components denote the allowed feasible movement toward the upper bound uk or the lower bound lk of the joint variables. after finishing the above procedures, the positions of points are updated and we have finished one iteration calculation of em. take the figure 3 for example. there are three particles and their own objective values are 15, 10 and 5, respectively. because particle 1 is worse than particle 3 while particle 2 is better than particle 3, particle 1 represents a repulsion force which is f13 and particle 2 encourages particle 3 that moves to the neighborhood region of particle 2. consequently, particle 3 moves along with the total force f. 464 y. feng, w. yao-nan, y. yi-min 1 2 3 ( ) 5f x ( ) 15f x ( ) 10f x f13{repulsion} f f23 {attraction} figure 3: an example of attract-repulse effect on particle number 3 2.3 performance evaluation of em in solving the inverse kinematics this example is used to examine the precision of the approximate solution calculated by em, which directly impact the choice of the interval width dθ. the robot structure for this example is based on puma 560. the link parameters are given in table 1. table 1 the link parameters of the puma 560 robot joint link length (m) twist angle (degree) offset length (m) joint limitations (degree) 1 0 -90 0.6604 [−160,160] 2 0.4320 0 0.2000 [−225,045] 3 0 90 -0.0505 [−045,225] 4 0 -90 0.4320 [−110,170] 5 0 90 0.0000 [−100,100] 6 0 0 0.0565 [−266,266] the desired configuration of the end-effector is given by: pd=[0.7433, 0.3111, 0.7883] (m), and d1= [-0.6366, 0.7712, -0.0084], d2= [0.0227, 0.0296, 0.9993], d3 = [0.7709, 0.6359, -0.0364]. note that there are multiple solutions within the joint limitations shown in table 1. for the sake of simplicity, the joint 1 and 3 limitations are rearranged into [-120,160] and [-45,120], respectively. for the given coordinates of the end-effector, it corresponds to an exact solution of θ = [10 20 30 40 50 60](degree) within the adjusted joint limitations. then the error between an approximate solution and the true solution can be easily calculated. the stopping criterion for em is defined by ε=0.01. in other words, stop calculation when the total error (see (3)) is less than ε. in this example, 100 trials have been conducted for em and the maximum absolute error (absolute value) at each joint angle is shown in figure 4. from figure 4, the widths of the joint limitations are set as about 20◦ at least, i.e.dθ = 10◦, which can guarantee that the one-to-one mapping is achieved. it should be noted that em is not suitable for high precision applications. as can be seen in figure 5, the number of evaluations drastically increases with precision. however, during the early stage of computations, em algorithm is highly efficient. thus, em is suitable for providing a good initial guess. 3 model the inverse kinematics with neural network the architecture used for solving the inverse kinematics problems is shown in figure 6. the single layer network consists of n outputs (joint angles) and 12 inputs [n, s, a, p] which represents a location (position and orientation) of the end-effector. as mentioned earlier, the training set have been constrained to only one solution set so that the one-to-one mapping could be achieved. for the present work, a fast and accurate learning algorithm called as extreme learning machine inverse kinematics solution for robot manipulator based on neural network under joint subspace 465 figure 4: maximum absolute error at each joint angle among 100 trials 0 50 100 150 200 250 300 350 400 0 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.1 steps t o ta l e rr o r figure 5: the performance of em 466 y. feng, w. yao-nan, y. yi-min x p y p z p x n y n z n x s y s z s x a y a z a 1 2 1n n position and orientation of the end-effector output: joint angles figure 6: a general structure of the slfn to approximate the inverse kinematics (elm) are used to train the neural network in modeling the inverse kinematics of robot. test results show that the learning speed of elm algorithm is much faster than the traditional methods. for example, the learning speed of elm is at least 1000 and 2000 times faster than bp and svm for solving the benchmark problem of california housing [14]. thus, this new training method is very suitable for solving the inverse kinematics. 3.1 brief of extreme learning machine (elm) elm is a unified with randomly generated hidden nodes independent of the training data. the output of an slfn with l hidden nodes can be represented by fl(x) = l∑ i=1 βig(ai,bi,x), x ∈ rn,ai ∈ rn (10) where ai and bi are the learning parameters of hidden nodes and βi is the weight connecting in ith hidden node to the output node. g(ai,bi,x) is the output of the ith hidden nodes with respect to the input x. additive and rbf hidden nodes are used often in applications. for additive hidden node with the activation function g(x) (e.g. sigmoid,threshold,sin,etc.), g(ai,bi,x) is given by g(ai,bi,x) = g(ai · x + bi), bi ∈ r (11) where ai is the weight vector connecting the input layer to the ith hidden node and bi is the bias of the ith hidden node. ai · x denotes the inner product of vectors. for rbf hidden node with activation function g(x) (e.g. gaussian), g(ai,bi,x) is given by inverse kinematics solution for robot manipulator based on neural network under joint subspace 467 g(ai,bi,x) = g(bi ∥ x − ai ∥) bi ∈ r+ (12) where ai and bi are the center and impact factor of ith rbf node. r+ indicates the set of all positive real values. the rbf network is a special case of slfn with rbf nodes in its hidden layer. each rbf node has its own centroid and impact factor, and its output is given by a radially symmetric function of the distance between the input and the center. for a given set of trainning samples(xi,ti)ni=1 ⊂ r n ×rm,if the outputs of the network are equal to the targets, we have fl(x j) = l∑ i=1 βig(ai,bi,xi) = t j j = 1,2, · · · , n. (13) above equation can be written compactly as hβ= t (14) where h =  g(a1,b1,x1), · · · ,g(al,bl,x1) ..., · · · , ... g(a1,b1,xn ), · · · ,g(al,bl,xn )  n×l (15) β=  βt1 ... βtl  l×m and t =  tt1 ... ttn  n×m (16) βt is the transpose of a matrix or vector β. h is called the hidden layer output matrix of the network; the ith column of h is the ith hidden node’s output vector with respect to input and the jth row of h is the output vector of the hidden layer with respect to input x j. usually, when the number of training data is larger than the number of hidden nodes n > l, one can not expect an exact solution of the system (14).after the hidden nodes are randomly generated and given the training data, the hidden-layer output matrix h is known and need not be tuned. thus, training slfns simply amounts to getting the solution of a linear system (14) of output weights β. under the constraint of minimum norm least square, i.e., min ∥β∥ and∥hβ− t∥ , a simple representation of the solution of the system (14) is given explicitly as β̂= h†t (17) where h† is the moore-penrose generalized inverse of the hidden-layer output matrix h. the simple learning algorithm can be summarized as follows: algorithm elm: given a training set ℜ= (xi, ti)|xi ∈ rn,ti ∈ rm, i = 1, · · · , n, activation function g(x),and hidden neuron number n step 1: assign arbitrary input weight wi and bias bi, i = 1, · · · , n; step 2: calculate the hidden layer output matrix h; step 3: calculate the output wight β : β= h†t where h,β and t are defined as formula (15) and (16). 468 y. feng, w. yao-nan, y. yi-min 4 performance evaluation and discussion example 1: this simple example demonstrates that the neural network trained by the constrained data can produce a better approximation of the inverse kinematics function. an rbfn is used to approximate the inverse kinematics function of two-link manipulator. it consists of two revolute joints and two links that have the same length of 30mm. two coordinate values x, y describe the position of the tip of the manipulator. the forward kinematics is x = l1cosθ1 + l2cos(θ1 + θ2)y= l1 sinθ1 + l2 sin(θ1 + θ2) (18) the inverse kinematics can be described by θ2 = atan2(± √ 1 − ( x2 +y2 − l21 − l 2 2 2l1l2 )2, x2 +y2 − l21 − l 2 2 2l1l2 ) (19) θ1 = atan2(y, x) − atan2(l2 sinθ2, l1 + l2cosθ2) (20) given a desired configuration of the end-effector, there are usually two desired true solutions which correspond with the lower-elbow structure and the upper-elbow structure respectively. we test the neural network with three different cases. 1. the training set randomly sample from the whole joint space. 2. the training set randomly sample from the constrained joint space which only consisted of the positive solution (+sign in (19)). 3. the training set randomly sample from the sub joint space. all the simulation are carried out in matlab 6.5 environment running in an intel(r) core(tm) 2 duo cpu 3.00ghz pc. the training process of the neural network can be executed using matlab code "newrb". the root mean squared (rms) error goal is defined by 0.001, and the number of training sets is 1000. 0 100 200 300 400 500 10 −4 10 −2 10 0 10 2 10 4 steps r m s e rr o r case1,training time:84.8590s case2, training time: 10.5620s case3,training time:1.3440s figure 7: the training convergence performance as can be seen in the figure 7, the same network trained by the constrained training data produces a better convergence performance.moreover, the neural network trained within the sub joint space produces the best performance. for case 1, the training sets contain the many-to-one mapping from the joint space to the cartesian space. it may be one reason that leads to training failure. inverse kinematics solution for robot manipulator based on neural network under joint subspace 469 example 2: in this example, the performance comparison of the new proposed elm algorithm and the gradient-based learning algorithm has been conducted for an inverse kinematics of puma robot. the desired configuration of the end-effector is the same with section 2.3. test 1:training the network with traditional algorithm first, the approximate solution is calculated by em. set the dθ = 30◦, then one of the sub space is determined for each joint, as shown in table 2. table 2 one group of joint subspace joint number joint 1 joint 2 joint 3 joint 4 joint 5 joint 6 subrange (degree) [-20.1, 39.9] [-8.7, 51.3] [2.1, 62.1] [2.9, 62.9] [11.9, 71.9] [37.6, 97.6] next, different training size, which is 100, 500 and 1000 respectively, sample randomly from the sub joint space. other 500 data set is used for testing the performance of neural network. the root mean squared error goal and the maximum number of neurons are set as 10−8 and 500, respectively. and the spreads in three cases are experimentally selected as 3, 1.2 and 1.2 so that the rbfn can produce an appropriate performance. the training steps are repeated until the network’s root mean squared error falls below goal or the maximum number of neurons are reached. figure 8 shows the training convergence performance obtained by using different training size. the training time increases greatly with the number of the training data, as can be seen in the figure 8. although the training error fail to reach the goal 10−8 using 500 and 1000 training data, all three trained networks are considered to achieve a good approximate performance. since the training error successfully reach the 10−4 in three case, which is an accepted result for inverse kinematics. these conclusions can also be confirmed by the following results. figure 9, 10 and 11 shows the testing root mean square (rms) error at each joint angle using the corresponding networks trained above. it can be seen that the rms error is very small. in addition, the network trained using 500 and 1000 size performs similarly. and the generalizations of both of them are better than the network trained using 100 data size. this occurs because less training data reduces the generalization of the network. however, taking into account training time, the network trained with less data size appears to be a better choice. 0 100 200 300 400 500 10 −8 10 −6 10 −4 10 −2 10 0 10 2 10 4 steps r m s e rr o r training size=1000, training time=100.0780s training size=500 training time=44.320s training size=100 training time=0.8750s figure 8: network training convergence with different training size. figure 9: absolute error at each joint angle using 100 training set test 2: training the network with elm in this example, a single feedforward network with sigmoidal additive activation function is used. for elm, the input weights and biases are randomly chosen from the range [-1, 1].to compare the results of elm and gradient-based learning algorithm in test 1, two groups of tests use the same training/testing sets. figure 12 shows the training rms errors with different hidden nodes number in three cases. the corresponding testing rms errors are plot in figure 13. the average training time is 0.0014 s, 0.0056 s and 0.012 s, respectively. as observed from figure 12 and 13, in general, the network trained using three 470 y. feng, w. yao-nan, y. yi-min 1 2 3 4 5 6 0 0.5 1 1.5 2 2.5 3 3.5 x 10 −3 joint number t ra in in g r m s e rr o r( d e g re e ) figure 10: absolute error at each joint angle using 500 training set. 1 2 3 4 5 6 0 0.5 1 1.5 2 2.5 3 3.5 x 10 −4 joint number t ra in in g r m s e rr o r (d e g re e ) figure 11: absolute error at each joint angle using 1000 training set groups of training data performs similarly. furthermore, the lowest validation error is achieved when the numbers of hidden nodes are within the ranges [15, 50]. the results show that the generalization performance obtained by the elm algorithm is very close to the generalization performance of gradientbased learning algorithm. however, the elm algorithm can be simply conducted and runs much faster. according to our results, the average learning speed of elm algorithm is at least 1000 times than the gradient-based learning algorithm. 0 5 10 15 20 25 30 35 40 45 50 0 0.005 0.01 0.015 0.02 (a) number of hidden nodes t ra in in g r m s e rr o r 0 5 10 15 20 25 30 35 40 45 50 0 0.005 0.01 0.015 0.02 (b) number of hidden nodes t ra in g r m s e rr o r 0 5 10 15 20 25 30 35 40 45 50 0 0.005 0.01 0.015 0.02 (c) number of hidden nodes t ra in g r m s e rr o r figure 12: the training rms error (degree) with elm, the training size: (a) 100, (b) 500 and (c) 1000. example 3: this example demonstrates that the proposed method can be used for continuous joint space trajectory planning. the robot structure for this example is still based on puma 560 robot. the desired trajectory of the end-effector is a circle centered at (0.2, 0.05, 0.5) (m) with respect to the base coordinate frame and a radius equal to 0.2(m). the trajectory is discretized into 72 equally spaced points. to ensure the existence of solution, the joint limitations are released in this example. moreover, noting that multiple solutions do exit, in order to prevent a sudden jump to another solution, a unique orientation is assigned and the approximate solution for each of the successive points is given by the solution of the preceding point. for example, if the calculated solution of the k point is denoted by θk, the joint variable limitations are set as [θk − dθ,θk + dθ] for the k+1 point, instead of re-computing the approximate solution. the computed joint trajectories and the corresponding total error (sum of the position and orientation error) are plotted in figure 14 and figure 15, respectively. it should be noted inverse kinematics solution for robot manipulator based on neural network under joint subspace 471 0 5 10 15 20 25 30 35 40 45 50 0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0.2 number of hidden nodes t e st in g r m s e rr o r (d e g re e ) training data size=100 training data size=500 training data size=1000 figure 13: the testing rms error with elm. that the trajectory in figure 14 is just one of the multi-trajectory for puma robot. 0 10 20 30 40 50 60 70 80 −100 −50 0 50 100 150 200 trackpoint jo in t v al ue (d eg re e) theta1 theta2 theta3 theta4 theta5 theta6 figure 14: computed trajectories of the joints. 0 10 20 30 40 50 60 70 80 0 0.5 1 1.5 2 2.5 x 10 −9 trackpoint t ot al e rr or ( th e po si tio n an d or ie nt at io n er ro r) figure 15: total error at each track point the results in figure 15 show that the idea of using a neural network has produced an excellent approximation of the inverse kinematics function. although neural network solutions are usually not suited for high precision robotic application, high precision results are achieved here. this occurs because the joint varies are limited within a small space when training network. 5 conclusions the proposed hybrid approach combined the electromagnetism-like method and the neural network to solve the inverse kinematics problem. unlike the traditional neural network approaches that generate the training data from the whole joint space, the neural network in the proposed approach collects the training data from a sub joint space, in which the training set is constrained to only one solution set so that the one-to-one mapping is achieved. another important feature of the proposed approach is to use an efficient learning algorithm, elm, to train the neural network. the learning speed of this novel training algorithm can be thousands of times faster than traditional feedforward network learning algorithms while obtaining better generalization performance. the results show that the proposed hybrid approach has not only greatly reduced the computation time but also improved the precision. 472 y. feng, w. yao-nan, y. yi-min bibliography [1] bruno siciliano, oussama khatib, springer handbook of robotics,springer press,2008. [2] h jack, dma lee, ro buchal and wh elmaraghy, neural networks and the inverse kinematics problem, journal of intelligent manufacturing,4:43-66,2003. [3] fl lewis,neural network control of robot manipulators,ieee expert,11(3):64-75,1996. [4] bb choi and c lawrence,inverse kinematics problem in robotics using neural networks,nasa technical memorandum-105869. [5] z binggul, hm ertunc and c oysu, comparison of inverse kinematics solutions using neural network for 6r robot manipulator with offset,in proceedings of the 2005 congress on computational intelligence method and application,pp:1-5. [6] as morris , a mansor,finding the inverse kinematics of manipulator arm using artificial neural network with look-up table. robotica,15:617-625,1997. [7] ja driscoll,comparison of neural network architectures for the modeling of robot inverse kinematics,in proceedings of the 2000 ieee,3:44-51,2000. [8] ss yang, m moghavvemi and john d tolman,modelling of robot inverse kinematics using two ann paradigms, in proceedings of tencon2000 intelligent system and technologies for the new millennium,3:173-177,2000. [9] shital s, chiddarwar n and ramesh babu,comparison of rbf and mlp neural networks to solve inverse kinematic problem for 6r serial robot by a fusion approach,engineering applications of artificial intelligence,, 23(7):1083-1092,2010. [10] py zhang, ts lu and lb song, rbf neworks-based inverse kinematics of 6r manipulator,int. journal of advanced manufacturing technology,26:144-147,2004. [11] eimei oyama, arvin agah and karl f, a modular neural architecture for inverse kinematics model learning,neurocomputing, 38(40):797-805,2001. [12] srinivasan alavandar, mj nigam, neuro-fuzzy based approach for inverse kinematics solution of industrial robot manipulators, int. j. of computers, communication and control, 3(3):224-234,2008. [13] karlra p, prakash nr, a neuro-genetic algorithm approach for solving inverse kinematics of robotic manipulators, ieee international conference on systems, man and cybernetics,2:1979-1984,2003. [14] guang-bin huang, qin-yu zhu, chee-kheong siew, extreme learning machine: a new learning scheme of feedforward neural networks, in proceedings of ieee international joint conference on neural networks,2:985-990,2004. [15] guang-bin huang, lei chen, enhanced random search based incremental extreme learning machine,neurocomputing, 71(16-18):3460-3468,2008. [16] birbil si, fang sc, an electromagnetism-like mechanism for global optimization,journal of global optimization, 23(3):263-282,2003. [17] birbil si, fang sc, sheu rl, on the convergence of a population-based global optimization algorithm,journal of global optimization, 30:301-318,2004. [18] wang lct, chen cc, a combined optimization method for solving the inverse kinematics problem of mechanical manipulators,ieee transaction on robotics and automation, 7(4):489-499,1991. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iii (2008), no. 3, pp. 304-316 surface roughness image analysis using quasi-fractal characteristics and fuzzy clustering methods tiberiu vesselenyi, ioan dzitac, simona dzitac, victor vaida abstract: in this paper the authors describe the results of experiments for surface roughness image acquisition and processing in order to develop an automated roughness control system. this implies the finding of a characteristic roughness parameter (for example ra) on the bases of information contained in the image of the surface. to achieve this goal we use quasi-fractal characteristics and fuzzy clustering methods. keywords: image processing, surface roughness, quasifractal parameters, fuzzy clustering. 1 introduction surface roughness of manufactured products is defined in sr iso 4287/2001 standard and other international standards. simple and complex characterization parameters are explained in works like [1], which are considering the use of stylus devices to measure roughness after a linear or curved path [3, 7, 7, 12]. although these devices had been continuously upgraded in order to increase measuring precision [3], they are not efficient enough used in automated measuring systems, due to the fact that the stylus must make contact with the measured surface and also due to the very long time of measurement. a newer technique in surface roughness measurement is the employment of digital image acquisition and processing [4, 6]. in this case the camera is coupled to a microscope (bellow a magnification of x100) and the acquisitioned images are processed with specially designed computer programs. so one image of 24 mm2, corresponds to 100 stylus scanning. in the paper [10] a method of summit and directionality identification of textured images is defined using surface image analysis. in [2] there is shown that surfaces obtained by turning, milling and grinding presents a high complexity when they are analyzed by optical, electron microscopes or afm (atomic force microscope). isotropic surfaces obtained by machining can be characterized by the mandelbrotweierstrass function using fractal methods. for the characterization of anisotropic surfaces some authors are proposing methods based on the two dimensional fft algorithm. this method had been used for characterization of grinded surface. the basic idea in [11] is to decompose the surface roughness (described in terms of amplitude, wave length and direction) in convex elements (summits) and then analyze these structures with morphologic trees. the authors claim that clear correlations can be obtained between the obtained morphologic tress and the tribologic proprieties of the surfaces. in [4] the correspondence between surface roughness obtained with a certain manufacturing method and its image had been studied based on functional dependence between roughness height and the grey level values of the image. the authors had studied these correspondences with the help of polynomial nets, trained with experimental results, which has as inputs cutting speed, feed and cutting depth and also the mean values of image grey levels. the polynomial net is capable to estimate the surface roughness with an acceptable accuracy, which had been validated with a series of experimental measurements. the advantage of this method is the possibility to apply it to online roughness estimation in turning processes. the paper [9] proposes a new method to analyze and characterize the surface roughness. on the basis of an algorithm in three steps the classification of textured images of some manufactured surfaces is made, surfaces obtained by casting, milling and grinding. in the first step the image is processed by a frequency normalized wavelet transform, obtaining a set of images at different scales and phases. in the copyright © 2006-2008 by ccc publications surface roughness image analysis using quasi-fractal characteristics and fuzzy clustering methods 305 second step characteristic parameter values are extracted and in the third step the image classification is obtained using the set of extracted features. from the synthesis of presented works, it can be concluded that the method of surface image processing is better than the methods using the stylus type measurement, because it is faster and there is no contact between the measuring instrument and the surface. it also has been shown that there is a correlation between the height of the roughness and the image grey levels and the estimation of surface roughness can be done in similar ways as texture analysis. 2 acquisition and preprocessing of surface images for image acquisition purposes several manufactured roughness probes with known roughness parameters were used (stalî dovodca gost 9378-80 e15718) obtained by manufacturing operations as: cylindrical milling, plane milling, shaping, frontal grinding, plane grinding and polishing. in this paper only surface roughness images representing shaping, plane grinding and polishing will be analyzed, because these images are more similar to each other and harder to classify. four non-overlapping images, of every probe’s surface were taken using a ccd camera mounted on a citival microscope at magnifications of x10 and x25. the resolution of the images was 640x480 pixel. the correlation between surface roughness and surface image had been studied in a large number of papers [4] showing a certain functional dependency between asperity height and image intensity. during experiments however, we observed that this correlation is more complex and depends in a very high degree on the illumination conditions of the probe. usual image processing phases, of non-object representation images are presented in [14]. after the image acquisition, a number of preprocessing operations had to be made in order to obtain better image quality. the used preprocessing steps were as follows: filtering eliminate inherent image noises; establishing region of interest keep only high information regions of the image; uneven illumination effects elimination eliminate effects of higher intensity in the middle of the image, which is characteristic for images taken with microscopes (figure 1 and 2) ; correction of probe rotation and position variations as the images are anisotropic, rotation of the probe can alter the analysis results. here 2d fast fourier transforms described in [2] or oriented gabor filters described in [5], can be used. the authors had tested several automatic image rotation algorithms and finally a 1d fft method was used. on the base of tested preprocessing methods a program module had been developed, which can perform al preprocessing steps automatically and which can be eventually included in an automated quality control system. after preprocessing, the image quality was fair enough to perform the next step of image processing. studying recent researches in texture analysis and image processing a number of statistical methods (co-occurrence, statistical moments) and frequency domain methods (gabor filters, wavelet analysis), had been tested in order to obtain automated recognition of surface roughness parameters, but these methods didn’t yield the wanted results. so we focused our research on fractal methods. 3 fractal image processing characterization considering the goal of our research, computation of fractal dimension is less important from a practical point of view. it is more important to use fractal or pseudo-fractal parameters in order to discriminate surfaces with different roughness characteristics. fractal dimension computation of rough 306 tiberiu vesselenyi, ioan dzitac, simona dzitac, victor vaida figure 1: surface image with higher center intensity. figure 2: image after preprocessing (uneven lighting and rotation has been eliminated). surface roughness image analysis using quasi-fractal characteristics and fuzzy clustering methods 307 surfaces using the weierstrass-mandelbrot function is described in [2] and others. when this function is correlated to power spectral density, the fractal dimension is correlated to the slope of the spectrum represented in logarithmic scale. the weierstrass-mandelbrot function is difficult to apply in practice. that is why we had to use methods, which are easier to implement as computer algorithms. these methods are the box counting method (bc) and the frequency domain fractal parameter (using power spectral density diagrams). both methods had been tested on the roughness probes images. the box counting method (bc) had been derived from the “compass dimension” and is closely related to fractal dimension as it has been stated by mandelbrot with the relation: d = log n log(1/r) (1) the compass dimension is obtained measuring a curve (which can represent a section through a surface) with decreasing measuring units (ri, i = k...1) and storing the number of measures ni for each ri. the diagram of log(ni) as function of log(1/ri) is drawn obtaining a so called richardson plot. if the richardson plot is a straight line then the measured object is fractal and the slope of the plot is it’s compass dimension. the bc method uses rectangular boxes of decreasing edges instead the linear measure ri. fractal dimension can also be computed on the bases of power spectral density (psd), as it is stated in [1]. if the psd amplitude is represented as a function of spatial frequency (f) in a logarithmic diagram then the fractal parameter can be considered as the slope (p′1) of the log(psd) approximation line and the (p2’) as the intersection of this line with the ordinate axis. log(psd) = p′1 log( f ) + p ′ 2 (2) first we have developed a 2d box counting algorithm and then a 3d algorithm (which uses 3d boxes on 3d matrix as shown in figure 2) both yielding satisfactory results. although the obtained results show that the analyzed images do not have true fractal behavior (the resulted richardson plot is not a rigorously straight line), the goal is to find correlations between obtained parameters on one hand and the surface roughness on the other hand. in this research we had used the bc3d (3 dimensional box counting) method, but in the richardson plot, instead of using only linear approximation in order to define parameters, we also used second and third degree polynomials. if we denote y = log(ni) and x = log(1/ri), we will have the following relations: y = p1x + p2 (3) y = p3x 2 + p4x + p5 (4) y = p6x 3 + p7x 2 + p8x + p9 (5) examples of curve fitting for relations (3), (4), (5) are given in figures 3, 4 and 5. we had considered then to use coefficients p1ěp9 to characterize the roughness of the studied manufactured surfaces. these coefficients had been named by us “quasi fractal parameters” (qfp, meaning fractal-like parameters) highlighting the fact that they are not rigorously correlated to fractal dimension. in order to study the possibilities to classify images representing rough surfaces, with the above defined parameters, we have made two dimensional representations that we denoted as qfp diagrams (quasi fractal parameter diagrams). each of these diagrams represents a bi-dimensional space having as dimensions two of the qfps defined above. 308 tiberiu vesselenyi, ioan dzitac, simona dzitac, victor vaida figure 3: linear curve fitting for points obtained with the bc3d algorithm applied on an image of shaped surface. figure 4: second order polynomial curve fitting for points obtained with the bc3d algorithm applied on an image of shaped surface. surface roughness image analysis using quasi-fractal characteristics and fuzzy clustering methods 309 figure 5: third order polynomial curve fitting for points obtained with the bc3d algorithm applied on an image of shaped surface. 4 experimental considerations in order to establish how qfps can characterize surface roughness we had acquired and processed four different images from each surface roughness probe, for three types of cutting operations: shaping with 4 different roughness values, grinding with four different roughness values and polishing with three different roughness values. table 1 shows sample categories, roughness values, sample codes and diagram symbols. in diagram symbols the color represents the operation and the symbol represents a specific roughness value. some examples of analyzed images are shown in figure 6. there had been also acquired images for magnification of x25. for both magnifications, the images had been studied using qfp diagrams. two examples of such diagrams are shown in figure 6.a and 6.b. comparing diagrams in figures 6.a. and 6.b., it can be observed that some of the same studied samples have different locations in the parameter space. for example the black squares (representing s4 samples) have one location for x10 magnification and other location for x25 magnification. this shows that p1 and p2 parameters are not “true fractal” parameters because they do not exhibit an invariance to scale, but this also shows that if in x10 magnification we can not really distinguish the s4 samples from the g group samples, at x25 magnification the s4 samples are well distinguishable. this lead to the conclusion that acquiring and analyzing sample images at different magnifications can help to better discriminate the samples. the selection of optimum magnifications to be used needs to be experimented for large sets of images. in this paper we will present discrimination algorithms only for samples with magnification of x10. the establishing of box dimension range is also an issue, but this can be also solved with experiments on large sets of calibration images. the authors plan to present algorithms for these issues in a future work. we had mentioned before that there is a closed link between fractal dimension 310 tiberiu vesselenyi, ioan dzitac, simona dzitac, victor vaida figure 6: examples of studied images (magnification x10, coding as stated in table 1). figure 7: qfp diagrams for parameters p1 − p2, a. magnification x10; b. magnification x25. surface roughness image analysis using quasi-fractal characteristics and fuzzy clustering methods 311 table 1: machining roughness code diagram operation (ra) symbol 12.5−6.3 s1 black diamond shaping 6.3−3.2 s2 black up pointing triangle 3.2−1.6 s3 black down pointing triangle 1.6-0.8 s4 black square 3.2−1.6 g3 blue down pointing triangle grinding 1.6−0.8 g4 blue square 0.8−0.4 g5 blue circle 0.4−0.2 g6 blue star 0.2−0.1 p7 red plus sign polishing 0.1−0.05 p8 red x 0.05−0.025 p9 red dot and power spectral density parameters, obtained with fft methods [1] and [2]. the authors had made experiments with the psd method too, in order to compare the two methods. for the psd method the same probes were used as in the first case. the obtained results were very close to results obtained with the box counting method. here an issue to solve is the automated selection of the analysis frequency range. establishing samples discrimination can be made automatically by a series of clustering methods, like fuzzy c-means or artificial neural networks. the goal of applying clustering methods is to find the cluster center for each sample with known roughness parameter (training phase). after finding the cluster central point this will be used to classify unknown roughness samples (recognition phase). in this phase we will discuss only the training phase. regardless what kind of clustering method we use, a good practice is to make the discrimination in successive steps: 1. plot the qfp diagrams for combinations of quasi-fractal parameters taken by two (it is also possible to use higher order qfp spaces, but these can not be properly represented in diagrams); 2. apply a clustering algorithm and find cluster centers; 3. observe which samples are well discriminated, store the cluster centers for these samples and eliminate them from the data set; 4. restart from step 1. with the remaining samples, until all the samples are discriminated. 5 fuzzy c-means clustering of quasi-fractal parameters in order to perform sample discrimination, we choose to use the fuzzy c-means clustering method, which has been implemented in some programming environments as for example in matlab. this method does not need large sets of data for training and was suitable for our purpose. in this method each sample is considered to be a part of a cluster, in some degree, defined by a membership degree. this method has been introduced in [13], as an enhancement of existing clustering methods. the method solves the problem of clustering by grouping data sets in multi-dimensional spaces in a number of specified clusters. the method starts with an estimation of the cluster centers marking the central value of each cluster. in the same time, for each sample is assigned a membership degree, which reflects the 312 tiberiu vesselenyi, ioan dzitac, simona dzitac, victor vaida belonging of that sample to certain cluster. the initial cluster center is then successively modified in order to achieve the minimum of an objective function which is the distance of a sample to the center, weighted by the samples membership degree. we are presenting in the followings the results obtained applying the fuzzy c-means clustering method to the qfp spaces of the described samples. final clustering is achieved in 4 phases. figure 8: qfp diagram (a) and clustering (b) for phase 1. the black diamond markers in (b) represent the cluster center and each color represents one cluster. figure 9: qfp diagram (a) and clustering (b) for phase 2. the black diamond markers in (b) represent the cluster center. in the first phase (figure 8) we could find a well discriminated clustering of samples s1, s3, s4, in the second phase (figure 9) we discriminate s2, in the third (figure 10) p1, p2, p3 and g6 and eventually in phase four (figure 11) g3, g4, g5. the found cluster centers will serve to classify any unknown sample later on. in order to increase clustering precision, a combination of different diagrams can also be used. 6 conclusions in this paper new quasi-fractal parameters (p1 . . . p9) were defined and a new type of diagram (qfp diagram) was proposed to achieve roughness image recognition. surface roughness image analysis using quasi-fractal characteristics and fuzzy clustering methods 313 figure 10: qfp diagram (a) and clustering (b) for phase 3. figure 11: qfp diagram (a) and clustering (b) for phase 4. 314 tiberiu vesselenyi, ioan dzitac, simona dzitac, victor vaida the presented method is not an optimal one since the selection of parameters for the different phases was made by hand, but this selection is needed to be done only once for a group of roughness samples obtained with certain manufacturing methods. the algorithms must also be tested on larger sets of images. as further developments the selection of box dimension range for bc3d method and frequency range for psd method will be studied. we will also study the influence of magnification on quasi-fractal parameter discrimination, and a method to find optimal parameter selection. bibliography [1] m. a. costa fractal description of rough surfaces for haptic display phd thesis, stanford university, 2000. [2] z. jiang; h. wang; b. fei research into the application of fractal geometry in characterising machined surfaces, international journal of machine tools and manufacture, elsevier science ltd, 2001. [3] r. k. leach nanosurf iv: traceable measurement of surface texture at the national physical laboratory, international journal of machine tools and manufacture, elsevier science ltd, 2001. [4] b. y. lee; y. s. tangsurface roughness inspection by computer vision in turning operations, international journal of machine tools and manufacture, elsevier science ltd, 2001. [5] d. m. tsai; s. k. wu; m.c. chenoptimal gabor filter design for texture segmentation using stochastic optimization, image and vision computing, elsevier science, 2000. [6] s. baleix, s. le roux, g. bernhart, p. luorssurface and image analysis of oxides grown and spalled on heat resistant cast steels exposed on thermal cycles, ecole des mines d’albi-carmaux, albi, france, journal of materials processing technology, elsevier science bv, 2001. [7] s. brinkmann, h. bodschwinna, h. w. lemke accessing roughness in three-dimensions using gaussian, regression filtering, university of hanover, germany, international journal of machine tools and manufacture, elsevier science ltd, 2001. [8] s. lu; y. gao; t. xie; f. xie; x.q. jiang; z. li; f. wang a novel contact/non-contact hybrid measurement system for surface topography characterization huazhong, hong kong university of science and technology, second research institute of the public security ministry, china, international journal of machine tools andmanufacture, elsevier science ltd, 2001. [9] b. josso, d. r. burton, m. j. lalor wavelet strategy for surface roughness analysis and characterisation, liverpool jhon moores university, uk, computer methods in applied mechanics and engineering, elsevier science, 2001. [10] k. yanagi, s. hara, t. endoh summit identification of anisotropic surface texture and directionality assessment based on asperity tip geometry, nagaoke university of technology, tokyo institute of technology, japan, international journal of machine tools and manufacture, elsevier science ltd, 2001. [11] h. zahouani, m. assoul, r. vargiolu, t. mathia the morphological tree transform of surface motifs. incidence in tribology, ecole centrale de lion, universite france comte, france, international journal of machine tools and manufacture, elsevier science ltd, 2001. surface roughness image analysis using quasi-fractal characteristics and fuzzy clustering methods 315 [12] m. wieczorowski spiral sampling as a fast way of data acquisition in surface topography, international journal of machine tools and manufacture, elsevier science ltd, 2001. [13] j. c. bezdecpattern recognition with fuzzy objective function algorithms, plenum press, new york, 1981. [14] c. aguilera, m. ramos, g. roaan automatic grading system for panels surfaces using artificial vision, international journal of computers, communications and control, vol. i (2006), no. 2, pp. 15-22, 2006. tiberiu vesselenyi university of oradea universitatii st. 1, 410087, oradea, romania e-mail: tvesselenyi@yahoo.co.uk ioan dzitac department of economics agora university of oradea piata tineretului 8, oradea 410526, romania e-mail: idzitac@univagora.ro simona dzitac university of oradea universitatii st. 1, 410087, oradea, romania e-mail: simona.dzitac@gmail.com victor vaida university of oradea universitatii st. 1, 410087, oradea, romania sc electrocentrale deva sa str. santierului, nr.1, mintia, romania e-mail: vaida@termodeva.ro tiberiu vesselenyi was born in oradea, romania in 1957, he finished the university “politehnica” from timişoara in 1983. from 1983 to 1991 he worked at a machine building company in oradea as designer and cnc programmer. from 1991 till 1994 he was a research engineer at the “geothermal energy research center” in oradea and from 1994 till today is assoc. prof. at the university of oradea, where he teaches robot and cnc programming. he had earned a phd in robotics at the university “politehnica” at timişoara. he had published over 150 papers in national and international conferences and journals, and is author or coo author of 4 books. 316 tiberiu vesselenyi, ioan dzitac, simona dzitac, victor vaida ioan dzitac received m. sc. in mathematics (1977) and ph.d in information sc. (2002) from “babes-bolyai” university of clujnapoca. at this moment, he is associate professor and head of economics department at agora university, oradea, romania. his current research interests include different aspects of parallel and distributed computing, applied mathematics and economic informatics. he has edited 6 conference proceedings, published 15 books and more than 50 scientific papers in journals and conferences proceedings. he was member of the program committee of 27 international conferences. simona dzitac received b.sc. (2000) and m. sc. (2001) in mathematics-physics, b.sc. (2005) and m. sc. (2007) in energy engineering from university of oradea and b.sc. in economic informatics (2007) from university of craiova, romania. at this moment, she is phd student in energy engineering field and researcher at university of oradea. her current research interests include reliability, applied mathematics and computer science in engineering fields. she published 5 books and 45 scientific papers in journals and conferences proceedings. victor vaida graduated from the technical university of timisoara, faculty of electrotechnics, being awarded a b.sc degree in electrical power engineering in 1968. in 1997 he was awarded a phd in technical sciences by transilvania university of brasov. he completed several specialization trainings and courses: in automation, electric and information systems in power plants at siemens company in germany (1972 1974), in the quality of electricity supply (2007 2008) at sier bucharest and in business management at the bucharest institute of management (1994). at present, he is the technical manager of sc electrocentrale deva and an associate professor of the university of oradea, faculty of power engineering. he is also an associate researcher at the economy and industry institute of the romanian academy and the president of the society of power engineers of romania. his research activity includes: 7 specialized books and 160 articles in the power field published in romania and abroad, 27 contract/grant-based research projects and some other r and d. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 4, pp. 439-451 mpm job-shop under availability constraints n. zribi, l. duţă, a. el kamel nozha zribi école centrale de lille france luminiţa duţă state university "valahia tirgoviste", romania abstract: a large part of scheduling literature assumes that machines are available all the time. in this paper, the mpm job-shop scheduling problem, where the machine maintenance has to be performed within certain time intervals inducing machine unavailability, is studied. two approaches to solve the problem are proposed. the first is a two-phase approach where the assignment and the sequencing are solved separately. the second is an integrated approach based on the exact resolution of the 2-job problem using the geometric approach. keywords: genetic algorithm, geometric approach, assignment heuristic most of scheduling literature assumes that machines are available all the time. however, in many realistic situations, e.g., in typical industrial settings, machine breakdowns and preventive scheduled maintenance have rather quietly common occurrences. these considerations increase the complexity of any scheduling problem but make the problem closer to the industrial reality [26]. in this paper, we consider the job-shop scheduling with multi purpose machines and availability constraints. we consider the deterministic model where the unavailability periods corresponding to maintenance tasks are known in advance. we also assume that preemption of operations is not allowed. more precisely, an operation oi j of job ji on machine mk starts only if its execution can be finished before mk becomes unavailable. the problem considered is a generalization of the classical job-shop problem and the multi-purpose machine problem studied in [12], where machines are available all times. as compared to the literature dedicated to classical scheduling problems, studies dealing with limited machine scheduling problems are rather rare. availability constraints have been firstly introduced in single machine [1], [28] and parallel machines [24], [25]. lee extensively investigated flow-shop scheduling problems with two machines [15], [18], [19]. in particular, the author defined the resumable, non-resumable and semi-resumable models. an operation is called resumable if it can be interrupted by an unavailability period and completed without penalty as soon as the machine becomes available again. if the part of the operation that has been processed before the unavailability period must be partially (respectively fully) re-executed, then the operation is called semi-resumable (respectively non-resumable). recently, flow-shop scheduling problems with two machines and resumable jobs have been treated in [9] and [14]. job-shop problem under unavailability constraints has also been considered recently [30], [3] where authors proposed a branch and bound algorithm for the job-shop problem with heads and tails and unavailability periods. the problem considered here is strongly np-hard since problem without unavailability periods is already strongly np-hard [12]. in this paper we propose two different approaches to solve this problem. the remainder of this paper is organized as follows. after a description of the considered problem in the following section, we propose first a two-phase method where a heuristic is used to solve the assignment problem and a genetic algorithm is developed for the sequencing problem. an integrated method, based on the exact resolution of the 2-job problem, is then developed. a comparison between the two algorithms is given in section 4. copyright c© 2006-2009 by ccc publications 440 n. zribi, l. duţă, a. el kamel 1 problem formulation the mpm job-shop (job shop with multi purpose machines) with availability constraints ( j(mpm)ncwin | cmax : terminology defined in [29]) may be formulated as follows. there are n jobs j, ..., jn to be processed on a set of m machines r = (m, ..., mm). each machine mr can process at most one job at a time. each job ji consists of a sequence of ni operations, that must be accomplished according to its manufacturing process. each operation oi j (i = , ..., n; j = , ..., ni) can be performed by any machine mr in a given set µi j ⊂ r for pi j time units. each operation is non-preemptive, i.e., it must be accomplished without interruption. moreover, we assume that machine mr is unavailable during giving periods corresponding to preventive maintenance. the starting times and durations of these tasks are fixed and known in advance. we note kr the number of maintenance tasks on machine mr. arl and drl represent respectively the starting and the finishing time of the lth maintenance task on machine mr. the objective is to find a schedule, defined by the starting time si j and the completion time ci j of each operation oi j, with a minimum makespan (maxci j). the scheduling problem in j(mpm)ncwin | cmax can be decomposed in two subproblems: • a routing subproblem that consists in assigning operations to machines; • an operation scheduling subproblem associated with each machine to minimize the makespan. this is a job-shop scheduling problem with availability constraints j, ncwin | cmax. 2 two-phase approach for the problem j(mpm)ncwin | cmax 2.1 the routing problem since the precedence constraints could be relaxed following the decomposition of the problem in two separate stages, the assignment problem may be treated as a parallel machine problem with the two additional constraints: • an operation can be performed by a machine belonging to a subset of the set of the available machines: partial flexibility • machines are subjected to several maintenance constraints: the planning horizon is divided into subintervals we propose a heuristic based on several priority rules taking into account these two additional constraints. assignment heuristic we use a list algorithm based on priority rules in order to construct an initial assignment solution. let us define the following parameters: • ri j: earliest starting time of operation oi j (definition 1) • t sk availability date of machine mk at iteration s, where s denotes the iteration number • erk set of operations which can be performed on machine mk • eask set of operations which can be assigned to machine mk at iteration s • cmsk load of machine mk at iteration s mpm job-shop under availability constraints 441 definition 1. to each operation oi j, we associate an earliest starting time ri j calculated by the following formula: { ri, =  ∀  ≤ i ≤ n , ri, j+ = ri, j + pi, j ∀  ≤ j ≤ ni − , ∀  ≤ i ≤ n. (1) in step 1, the different parameters are initialized. in step 2, for each machine we determine the set eask of operations such that: ri j ≤ t sk . in step 3, we evaluate (the potential) starting time of each operation on each possible machine. the availability periods are taken into account. in fact we test if the operation can be scheduled before the next availability period on that machine. a pair (operation/machine) is selected using the following priority rule: the less flexible machine mk (min cmsk ) is selected. operations in eak are sorted in non decreasing order of card(µi j) (priority is accorded to the less flexible operation). mk is assigned to the first operation in eak which can be scheduled before the next unavailability period. this priority rule allows occupying the time intervals before the unavailability periods and takes into account the load of machines and the flexibility degree of each operation. in order to ensure a high level of the solution quality, we have chosen to improve the assignment given by the assignment heuristic. to this end, a local improvement search has been studied. such search is based on a tabu algorithm , an adapted routing move technique and an adapted criteria for the studied problem. in next section, we give a description of the tabu algorithm. a tabu search algorithm for the assignment problem optimization criteria: for a classical routing problem, where machines are available all times, we choose, in general, to minimize the workload of the most loaded machine, since it provides a lower bound for the makespan. we define, for each assignment s, a lower bound denoted lb(s) for the makespan corresponding to s. this lower bound is based on relaxation into a set of single-machine problems taking into account the unavailability periods. the objective of the tabu search algorithm presented here is to minimize cr = lb(s) and hence to preoptimize the makespan. given an assignment s, we associate with each machine mk, a single-machine problem πk with ready times (definition 1), tails (definition 2) and unavailability periods. a lower bound for πk is the makespan of a preemptive schedule with unavailability periods, based on the jackson preemptive schedule (jps) algorithm. such schedule is calculated for each machine and lb(s) is the maximum makespan value of these schedules. definition 2. after the finishing of operation oi j , a time of qi j has to go before job ji is finished completely. qi j is called the tail of operation oi j the procedure of preemptive schedule allows constructing the optimal schedule when preemptresume applies and hence to obtain a lower bound for πk due to the two following reasons: 1. the unavailability period is treated as an operation, so the problem here is equivalent to the preempt-resume case where jps gives the optimal solution. 2. the unavailability period will start right on its ready time and will never be preempted since it has the largest tail among the available operations. preemptive schedule is calculated for each machine and lb(s) is the maximum makespan value of these schedules. 442 n. zribi, l. duţă, a. el kamel description of the tabu search (ts) algorithm ts was introduced by glover as a general iterative meta-heuristic for solving combinatorial optimization problems [16]. the ts algorithm is as follows. the initial solution is obtained by applying the assignment heuristic described above. the solution is described as a list of operations with their corresponding machines. a routing move is defined by the relocation of a critical operation (operation that belongs to the critical machine) to a feasible machine position. for a given solution, we consider every possible relocation of every reroutable critical operation. the routing move is based on the following steps: 1. find the critical machine mkc . 2. find an operation oi j that can be assigned to another machine mk ∈ µi j without increasing the criterion value. 3. reassign oi j to mk if possible. the tabu list consists of pairs (op; mo), where op denotes the operation that is moved from machine mo to a different machine. the choice of the move is based on the value of cr which is the maximum makespan value of the preemptive schedules. 2.2 genetic algorithm for the sequencing problem after the assignment step, each operation is assigned to a fixed machine. thus the mpm job-shop problem is reduced to a job-shop problem with availability constraints (jspac). the problem is then to assign a starting time si j and a completion time ci j to each operation oi j (ci j = si j + pi j). the considered objective is to minimize the makespan (cmax = maxi, jci j ). we propose a genetic algorithm to optimize the makespan in a jspac. coding according to the literature, two types of approaches exist. in the first, the schedule is directly coded in the chromosome. in the second, a scheduler is associated to the ga to transform the chromosome into actual schedule. in this paper, the latter approach is used to code ga chromosomes. in fact, we use a representation based on job operation. it consists in representing the schedule in a chain of nt operations ( ∑ ≤i≤n ni) where operations of the same job are represented by the same symbol ji, the job number. each job ji appears exactly ni times (ni is the number of operations of ji) in the chain. for example , for a job-shop problem of dimension 3*3 (3 jobs and 3 operations per job), an example of a chromosome is (1 2 1 3 1 3 2 2 3). the computation of the starting time and the completion time (si j , ci j) is obtained according to the order z of each task in the chain (chromosome) and taking into account the unavailability periods of the machines. crossover and mutation we use gox generalized order crossover , a swap based mutation and an "intelligent" mutation operator. mpm job-shop under availability constraints 443 "intelligent" mutation operator: this operator consists in reducing the idle time before unavailability periods in order to improve the makespan. this mutation heuristic is described by algorithm 1 and consists in: for each unavailability period of the critical machine, we exchange an operation scheduled before a maintenance period with another operation which can begin before this maintenance period, but is scheduled after this one. all possible permutations are tested. the permutation minimizing the makespan is selected. algorithm 1 heuristic for the "intelligent" mutation operator choose a chromosome x aleatory calculate the schedule and find the critical machine mmax find ommax : set of operations oi j assigned to mmax for each maintenance period d of mmax for each operation oi j in ommax finishing before d for each operation oi′ j ′ de ommax which can begin before d and having been scheduled after d test the permutation of oi j and oi′ j ′ choose the permutation minimizing the makespan 2.3 application example let us consider an example of a mpm job-shop. it is made of 15 jobs, 5 operations per job and 5 machines. each machine is subject to two maintenance periods as follows: m: [201 250],[463 512] means that the machine m is unavailable between the dates 201 and 250 and between the date 463 et 512. m: [104 139] , [520 588] m: [233 331], [499 528] m: [137 186], [507 556] m: [129 187], [783 881] 1. assignment step we apply first the assignment heuristic to obtain an initial solution. in order to evaluate the solution given by the assignment heuristic, we report in table 1 the load of the different machines as well as the makespan value of the preemptive schedule constructed using jackson rule. m m m m m load 714 782 868 789 839 preemptive 812 907 995 887 997 schedule table 1: results of the assignment schedule 2. sequencing problem 444 n. zribi, l. duţă, a. el kamel m m m m m load 815 812 790 818 757 preemptive 913 915 917 916 915 schedule table 2: results of the ts algorithm 0 500 1000 1500 2000 2500 in s t 1 in s t 2 in s t 3 in s t 4 in s t 5 in s t 6 in s t 7 in s t 8 in s t 9 in s t 1 0 in s t 1 1 in s t 1 2 in s t 1 3 in s t 1 4 in s t 1 5 in s t 1 6 in s t 1 7 in s t 1 8 in s t 1 9 in s t 2 0 instances m ak es p a n cmax lbjmpm figure 1: simulation tests table 3 gives the value of the lower bounds based on preemptive schedule for the initial solution of assignment (solution 1) and the solution obtained after applying ts (solution 2) as well as the makespan found by the genetic algorithm. these results show that ts improves the global solution of the problem and ga gives a good solution comparing to the lower bound. we propose in the next section an integrated approach based on the exact resolution of the 2-job problem. solution 1 solution 2 preemptive 997 917 schedule makespan 1023 968 table 3: results of the genetic algorithm other experiments were performed on randomly generated instances with more or less availability periods. we report in this section different results for three classes of instances. each class has five instances. the number of jobs for each class is 10 , 15, 10 and 20 respectively. the number of machines is equal to 5 , 10, 10 and 10 respectively. the processing time for each operation is randomly selected in [|40,150|]. for each machine, the maintenance tasks occur after at least one operation. the starting time for each maintenance task differs from machine to another. the duration of a maintenance task on a machine is the average of the processing times of operations. for the ga, the mutation and crossover probabilities are fixed to: (pcrossover = ., pmutation = .). in figure 1 we compare the result of the two-phases approach with a lower bound denoted lbjmpm that we have developed for the problem [31]. comparing with lbjmpmac our approach gives interesting results (rdjmpmac .%) with a short computational time. it’s worth noting that lbjmpm , which is based on a lower bound for the parallel machine problem, is more interesting for instances with high flexibility. mpm job-shop under availability constraints 445 3 an integrated approach for the j(mpm), ncwin | cmax this approach is based on the exact resolution of the problem j(mpm), ncwin|n = |cmax. a polynomial algorithm is developed to solve the problem. 3.1 polynomial algorithm for j, ncwin|n = |cmax state of the art the geometric approach has been firstly introduced by akers and friedman (1955) in [2]. it consists in reducing the two-job job-shop scheduling problem in the search of a shortest path and thus gives a polynomial algorithm to solve it. the first step of the geometric approach is the representation of the scheduling problem in a 2dimensional plane with obstacles, which represent the machine conflict between operations of the two jobs [22]. more precisely, each job ji is represented by an axe with ni intervals according to its manufacturing process. each interval corresponds to an operation oi j and has a length of pi j (fig.2). intervals o j and ok form an obstacle if o j and ok share the same machine (fig.2). the horizontal and the vertical crossing the final point f, which corresponds to the completion of the two jobs, are considered as the final obstacle. a feasible solution of the scheduling problem is then a path going from the origin o to the final point f . such a path consists of horizontal, vertical and diagonal legs. a horizontal (resp. vertical) leg represents the exclusive progression of job j (resp. j), whereas diagonal legs correspond to simultaneous executions of the two jobs. moreover, any path must avoid the interior of the obstacles. this is due to the fact that two operations can not be executed simultaneously on the same machine and are not preemptable. the length of a horizontal or vertical segment is equal to its usual length while the length of a diagonal segment is equal to the length of its projection in any axe, which is the time spent for the simultaneous processing of two operations. the shortest path problem in the plane can be transformed into an unrestricted shortest path problem in an acyclic network (see fig.2 ), where the set of vertices corresponds to the origin o, the final point f and the north-west and south-east corners of the obstacles. each vertex has at most two successors obtained by going diagonally until hitting an obstacle d. if the obstacle d is the final obstacle, the vertex f is the only successor of node, otherwise the nw and se corners of obstacle d, are immediate successors of node (see fig.2 ). the temporized geometric approach (tga), developed by aggoune [3], [4] is an extension of the geometric approach which exactly solve the problem j, ncwin|n = |cmax. it allows integrating the evolution of time and so the availability of the machines, based on the definition and the introduction of new vertices, as well as a new and dynamic way to progress from one vertex to its successors. vertices characterization and definitions in tga: in the classical geometric approach, vertices of the network are the north-west (nw) and south-east (se) corners of the obstacles hit when going diagonally in the plane. these corners are located at the extremities of the intervals corresponding to operations in conflict. each vertex can then be defined thanks to its coordinates in the plane: the x-coordinate (resp. 446 n. zribi, l. duţă, a. el kamel j1= {(m1, 1), (m2, 2), (m3, 1)} j2= {(m2, 2), (m3, 1), (m1, 1)} nw1 se1 o f o nw1 se1 f 2 3 3 4 figure 2: classic geometric approach y-coordinate) of the vertex corresponds to operation of job j (resp. j) to be executed. in tga [4], some vertices can be located between two lines bounding an operation, i.e., inside the intervals. for each vertex, each coordinate is additionally attributed by information related to the duration of already processed part of the associated operation. moreover, an earliest starting time h(s) is associated with each vertex s. h(s)is the length of the shortest path from the origin to s. the set of vertices of the network constructed by tga is composed by the three following types of vertices: • regular vertices, located at the intersection of horizontal and vertical lines, to which nw and se corners of obstacles belong. • singular vertices, located on a horizontal (resp. vertical) line, which means that the execution of the operation of job j (resp. j) has not started yet • waiting vertices, also located at the intersection of two lines, for which the execution of operations of jobs j and j has not started yet. a singular vertex is created if the progression of only one job is possible (availability problem for the other job), whereas waiting vertices are created if the progression is possible for none of the two jobs. a waiting vertex is always a duplication of the regular vertex having the same geometric coordinates but not the same earliest starting time. the progression works as follow: • if the operations of the two jobs cannot start at time h(s), the earliest starting time of vertex s, a waiting vertex is then created. • if there is an availability problem in the direction j (resp. j), the progression is made along the vertical (resp. horizontal) line, what means that the execution of the operations of job j (resp. j) only, until job j (resp. j) becomes available. a singular vertex , from which a diagonal progression is possible, is added as successor of s. • if there is no availability problem, that is to say if the operations of the two jobs can be executed at time h(s), the progression works as in the classical geometric approach. 3.2 an extended approach we propose a generalization of tga (gtga) in order to deal with the flexibility property of the j(mpm), ncwin | n =  | cmax. this generalization is based on the works of aggoune [4] and mati and al. [21] for the flexible job-shop without availability constraints. as for the job-shop problem, the mpm job-shop under availability constraints 447 scheduling of mpm job shop can be represented in the 2-dimensional plane with potential obstacles that depend on the assignment of machines for the two jobs. let us define the vertices of the network describing the progression in the plane, the successors of each vertex and the distance between any two vertices. we develop the algorithm succvertex allowing to find the successors of each vertex s = ((k, ∆ k ), (k, ∆  k )). the algorithm succvertex is in three steps. • step 1 is an initialization step. set ph (resp pv) is defined to keep the machines of job j (resp j) allowing to progress until meeting an horizontal and \ or a vertical. this set is used to progress in the next iteration of the program in the case of diagonal progression. e is the set of the possible machines of ov and e is the set of the possible machines of oh. • in step 2, first we check the availability of the two machines for ov et oh. if these machines are available, we progress diagonally until a vertical (end of operation of job 1) and or an horizontal (end of operation of job 2) is met depending on the duration of the current operations. set ph or pv is defined to keep the machines of job j or j respectively in the case of diagonal progression. if an availability problem occurs in one of the two directions, the algorithm succvertexavailability is used to define the successors of s in this case. the machines concerned by this case are memorized in sets pv_ availavility, rh_ availability, ph_ availavility and rv_ availability. these sets are used by algorithm succvertexavailability. the corners se and nw of the potential obstacles that could be reached from s are added as successors of s during the diagonal progression. • in step 3, we update the current time current_ time, the sets e, e, v and h. the algorithm succvertex is stopped if the final obstacle is hit or when the diagonal progression is not possible because of availability problems or unavoidable obstacle. the algorithm succvertexavailability allows progressing horizontally or vertically (availability problem) until the progression in the two directions becomes possible (end of the unavailability period) , in this case a singular or a regular vertex is added as successor to s ; another availability problem occurs, and in this case a waiting vertex is added as successor of s or an unavoidable obstacle is hit. distance between two vertices s and s′: the distance between a vertex vi and its successor vi′ is calculated inside the two developed algorithms using the variable current_ time. in fact, all machines used to progress from vi until vi′ are fixed. remark: if f is a successor of s, the distance between s and f is calculated using only available machines, if possible, or machines becoming available first. in fact we neglect the other paths. theorem 1. the set of vertices constructed by applying algorithm succvertex is sufficient to determine the optimal schedule. proof. the correctness of theorem 1 is due to the fact that tga gives the optimal schedule in the case of classical job-shop [3] and the developed algorithm checks all possible machines for each operation. 3.3 the general job shop problem with multi purpose machine and availability constraints from the result of the previous section we can deduce a greedy heuristic to calculate a solution for j(mpm), ncwin | cmax. this heuristic works as follows: 1. the two first jobs are optimally scheduled using the gtga algorithm. 2. additional unavailability periods, corresponding to the execution of operations of the two scheduled jobs, are fixed on each machine. 448 n. zribi, l. duţă, a. el kamel instance size (nxmxnt) two-phase approach integrated approach inst1 10x5x50 852 833 inst2 10x5x50 641 631 inst3 10x5x50 697 707 inst4 10x5x50 710 703 inst5 10x5x50 633 625 inst6 15x5x75 945 983 inst7 15x5x75 897 891 inst8 15x5x75 1023 1017 inst9 15x5x75 1001 997 inst10 15x5x75 998 1002 table 4: comparison of the two approaches 3. the algorithm is applied to the next two jobs of the sequence, taking into account the initial and the new unavailability periods. this procedure continues until all jobs are treated. if the number of job is odd we need an insertion procedure to schedule the last job. it is based on the following rule: an operation of the last job is inserted in such a way, that it begins as early as possible, if we have the choice between two machines we choose the machine giving the smallest idle time. 4 experimental results to perform an experimental evaluation of the proposed approaches, we present, in this paper, a ten classical flexible job-shop instances [13]. in order to provide proper experimental settings, two availability periods are generated randomly for each machine. it is worth noting that these results were also confirmed by several other experiments based on randomly generated instances with more or less sizes and/or availability periods [31]. in table 4, we report the values of the makespan given respectively by the two-phase approach and the integrated approach. these simulations show that the two-phase approach gives interesting results comparing with the integrated approach. the main advantage of the two-phase approach is related to the computation time. the integrated approach is rather more complicated mainly in terms of computing time. besides, we note that solutions given by the integrated approach vary with the initial sequence of jobs. hence, this approach can be improved by adding an optimization algorithm for the initial sequence of jobs. meanwhile, it appears clearly worth applying this polynomial algorithm in a branch and bound algorithm, as done by jurisch for the classical mpm job-shop without availability constraints [17]. 5 conclusion we have investigated in this paper mpm job shop scheduling problems under availability constraints. we have proposed two kinds of methods. the first one solves the assignment and the sequencing problems separately. the second one is based on an extension of the geometric approach to deal with the machine availability and the flexibility property of the problem. we are now working on the development of a maintenance management system with an industrial partner including these optimization in a multi-objective environnement. in fact, the analysis of performance of scheduling problem involves mpm job-shop under availability constraints 449 more than one criteria [7]. we are focusing on two main criteria : the maintenance total costs and the delivery processes. other aspects that we will focus on is the real-time approaches to solve schedule problem with unanticipated interruptions [5]. bibliography [1] adiri i., bruno j., frostig e., and rinnooy kan a. h. g. : single machine flow-time with a single breakdown. acta informatica, 26, 679-696. [2] s.b. akers et j. friedman. – a non-numerical approach to production scheduling problems. operations research, vol. 3, 1955, pp. 3, 429–442. [3] aggoune, r.: ordonnancement d’ateliers sous contraintes de disponibilité des machines(2002) ph.d. thesis, université de metz, france. [4] aggoune, r.: two-job shop scheduling problems with availability constraints. icaps 2004, june 3-7, 2004, whistler, british columbia, canada. [5] duta, l.: contribution ŕ l’étude de la conduite des systčmes de désassemblage.these ( 2006). université de franche-comté université de bucarest. [6] duta,l., filip, f. g. henrioud,j.-m. popescu: disassembly line scheduling with genetic algorithms. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iii (2008), no. 3, pp. 270-280. [7] filip f.g., neagu g. and donciulescu d. (1983). job shop scheduling optimization in real time production control. computers in industry, 4(4) (north holland, amsterdam), 395 403. [8] ruiz-torres aj. , nakatani k.: application of real-time simulation to assign due dates on logisticmanufacturing networks. proceedings of the 1998 winter simulation conference. [9] blazewicz j., breit j., formanowicz p., kubiak w., schmidt g.: heuristic algorithms for the twomachine flowshop problem with limited machine availability. omega journal (2001), 29, 599-608. [10] carlier j.: scheduling jobs with release dates and tails on identical machines to minimize the makespan. european journal of operational research(1987), 29:298-306, north-holland. [11] carlier j.: the one-machine sequencing problem, european jounal of operational research(1982), 11: 42-47. [12] jurisch b.: scheduling jobs in shops with multi purpose machines (1992) ph.d thesis, universität osnabrück. [13] j. hurink, b. jurisch et m. thole. – tabu search for the job-shop scheduling problem with multipurpose machines. operations research spektrum, vol. 15, 1994, pp. 205–215. [14] kubiak w., blazewicz j., formanowicz p., breit j., schmidt g.: two-machine flow shops with lim-ited machine availability. european journal of opera-tional research (2002) 136: 528-540. [15] lee c.y.: machine scheduling with an availability constraint. journal of global optimization(1996) 9: 395-416. [16] glover f., laguna m.: tabu search, kluwer publishers, boston 1997. 450 n. zribi, l. duţă, a. el kamel [17] jurisch, b., (1995). lower bounds for the job-shop scheduling problem on multi-purpose machines. discrete applied mathematics, 58, 145-156. [18] lee c.y.: minimizing the mazkespan in two-machine flows-hop scheduling with availability constraint. operations research letters (1997), 20: 129-139, . [19] lee c.y.: two-machine flowshop scheduling with availability constraints. european journal of opera-tional research (1999), 114: 420-429. [20] lee km., yamakawa t.: a genetic algorithm for general machine scheduling problems. int.conf. on conventional and knowledge-based electronics systems, vol 2 pp60-66 australia, 1998. [21] mati, y., and xie, x.: un algorithme polonimial pour le job shop flexible avec blocage : cas de deux jobs. mosim’2003, april 23-25 april, toulouse, france. [22] mati, y., and xie, x.: the complexity of two-job shop problems with multi-purpose unrelated machines. european journal of operational research, 152 (1), 159-169, 2004. [23] nowicki e., smutnicki c.: a fast taboo search algorithm for the job-shop problem, management science (1996), vol. 42, no. 6, pp. 797-813. [24] schmidt g.: scheduling on semi identical processors. z. oper.res.1984, a28, 153-162, . [25] schmidt g.: scheduling independent tasks with deadlines on semi-identical processors. journal of operational research society, 39, 271-277, 1988. [26] sanjay j. , william j. foley :impact of interruptions of schedule execution in flexible manufacturing systems. the international journal of flexible manufacturing systems, 14, 319-344, 2002. [27] roy b., sussmann b.: les probl‘emes d’ordonnancement avec contraintes disjonctives (in french). technical report 9 bis, sema, paris (france), december 1964. [28] leon v. j., wu s. d.: on scheduling with ready-times, due-dates and vacations. naval research logistics, 39:53-65, 1992. [29] schmidt g.: scheduling with limited machine availability. european journal of operational research(2000), 121, 1-15. [30] mauguiere ph., billaut j-c., bouquard j-l.: new single machine and job-shop scheduling problems with availability constraints. journal of scheduling, 2004. [31] zribi n. : ordonnancement des job-shop flexibles sous contraintes de disponibilité des machines. phd thesis, ecole centrale de lille, france, 2005. nozha zribi has obtained an engineer diploma in 2002, a master of science and european master in computer engineering in 2002, and a phd in operational research from ecole centale of lille in 2005. since 2007, she is a research engineer. her interests are logistics, software design and implementation and sea. luminiţa duţă is currently associate professor (lecturer) at "valahia" university of targoviste, romania. she took the phd from université franche comté, besancon (france) and from technical university "politehnica" of bucharest in automation and control field. her research interests include the use of decision support systems in complex process controlling and meta-heuristics. she is a member of ifac technical committees 5.2 and 5.4. mpm job-shop under availability constraints 451 abdelkader el kamel (m’96-sm’00) received the engineering diploma in 1990, the ph.d. in 1993 and the "habilitation ŕ diriger des recherches" in 2000. he is currently professor at the ecole centrale de lille and visiting professor in different countries (china, chili, india and tunisia). he is the author or coauthor of more than 100 journal articles, book chapters, plenary sessions, and communications in international conferences. his current research interests include intelligent control of complex systems under uncertainty; modeling, design, real-time monitoring of autonomous dynamical systems and mobile cooperative robots. dr. el kamel was the president of the ieee-smc conference held in tunisia in 2002. he was program chair or ipc member of several ieee, ifac and wac conferences, member of the adcom of ieee france section, president of the ieee-smc tc on "complex systems under uncertainty", member of the ifac tc 3.2 and president of the ieee-smc france chapter. international journal of computers, communications & control vol. i (2006), no. 4, pp. 110-125 ant colony with dynamic local search for the time scheduling of transport networks salah zidi, salah maouche, slim hammadi abstract: this article presents an ant colony optimization for the time scheduling of public transport traffic. in fact, the assistance of a decision support system becomes necessary for the real-time regulation of this transport networks since the size of the search space increases exponentially with the number of vehicles and stops. so, we propose an ant colony algorithm with dynamic local search, in the case of unpredictable disturbance. this approach consists in applying a local search window with increasing dimension according to the iterations. it treats the regulation problem as an optimization and provides the regulator with relevant decisions. a regulated timetable is proposed as solution aiming at minimizing the waiting time of passengers. we insure the three most important criteria of regulation which are the punctuality, the regularity and the correspondence. keywords: urban transportation systems, real-time scheduling, ant colony optimization. 1 introduction in real time, the transport system can be affected in an unpredictable way by incidents which cause a delay between these theoretical time schedules and the real time schedules. in these conditions, we have re-adjust the planning (time table) made at previous time to return quickly to the theoretical schedules. it is a real-time regulation. within this context we developed an algorithm of scheduling which allows us to ensure the most important three criteria: the punctuality, the regularity of passing of vehicles by stops as well as the correspondence between the lines of the different transport modes. it is an ant colony algorithm where the main goal is to find in every movement between two successive stops the delay which it is necessary to apply to concerned vehicles to optimize these three criteria of regulation. at the beginning, our regulation algorithm encountered difficulties to escape from local optima. in fact, the optimization problem which we have to resolve is complicated and the research space is important. to manage these difficulties we propose the idea of the dynamic local search. it is the new method which consists in applying a local search window with increasing dimensions according to the iterations. this article contains five parts. in the first, we will present the regulation problem. the second part is a presentation of the ant colony algorithm. in the third part we will detail our algorithm of regulation and we will also explain the simulation results of this approach in the urban transport network of lille. we will finish by a conclusion. 2 real-time regulation the planning process of a public transport company is made by establishing different timetables that describe trips according to the lines, frequencies, transport demand, and travel times in the network. these trips are then transformed into blocks and assigned to vehicles. a crew scheduling process finally follows this vehicle scheduling [1]. hence, the vehicle schedules are fixed for every timetable period. this type of vehicle scheduling is in fact called predictive scheduling. it is based on a periodic review copyright c© 2006 by ccc publications ant colony with dynamic local search for the time scheduling of transport networks 111 of demand and resource availability in order to create arrival and departure times for the vehicles at the different stops of the network. however, in reality, travel times and transport demands are not fixed because of random external influences that affect the traffic within the network and cause disturbances. these disturbances can be, for example, caused by traffic jams, accidents, or strikes. consequently, the theoretical schedules resulting from the planning process cannot be followed exactly, which compels trips to start late and makes customers wait longer. therefore, to reduce the effects of the disturbances, the theoretical schedules have to be adapted to the real traffic conditions through regulation, or rescheduling tasks [2]. this process is then called reactive scheduling. it consists in creating new schedules that increase the level of service by undertaking operational decisions, such as, the injection of an extra vehicle in the network, or the deviation of the routes of some vehicles. the real-time traffic management of an urban transport system is presented by the figure1. figure 1: process of real time management of public transport traffic presently, it’s a human operator, regulator, who performs these real time tasks and controls the global network traffic by treating the information provided by the automatic vehicle monitoring (avm) system and the vehicle drivers. the level of service can be represented by different regulation criteria such as, the regularity, punctuality, and connection criteria. the choice of the criteria depends on the regulation objectives obtained from the nature of the disturbances. however, the regulator is usually overloaded with information, which complicates its decision-making task. in addition, despite the avm system assistance, the regulator spends more than 50% of his work time in communication with the vehicle drivers. hence, the regulator has to carry out difficult tasks that are often inaccessible for the human scale especially if many disturbances occur simultaneously, which involves the assistance of a decision support system [3]. within this context, researchers began to think about the development of a computer system for the regulators by using different tools such as the fuzzy logic [4], the multi-agent system [5] and an evolutionary algorithm [6]. in our work we will apply an approach of ant colony optimization and we will introduce a new idea of dynamic local search. 112 salah zidi, salah maouche, slim hammadi 3 ant colony optimization (aco) ant colonies are able to organize their foraging behavior in a seemingly efficient way without any centralized control[7]. this self-organizing structure is carried out via stigmergic communication, i.e. communication by changing the environment, in this case by laying down pheromone trails. initially ants have no idea of where food is in the environment, so they wander randomly, leaving a pheromone trail. when an ant finds food it wanders back to the nest. initially these paths will be arbitrary, but when an ant follows a shorter path it will be able to follow that path more often within the same time period than an ant following a longer path, so there is a positive reinforcement process whereby the shorter paths get stronger. a simple version of this is illustrated in figure 2, where ants have two possible routes from a nest to a food source. if two ants set out at the same time, one taking route a and one route b, which is twice as long, then the ant taking a will have traveled back and forth between the food source twice in the same time that the other ant has traveled back and forth once. therefore there will be a stronger pheromone trail on route a compared to route b. this idea can be effectively scaled up to solving route finding problems such as the tsp, with performance as good as or better than existing heuristics. figure 2: a simple ant foraging problem. the natural ants inspired "ant colony algorithms" invented in 1992 by marco dorigo from free brussels university, in his ph.d. thesis [8]. in an iteration of this algorithm, n ants build n solutions according to decisions based on heuristics criteria and on the quantity of pheromone. this quantity is updated by examining the solutions. it is strengthened for the decisions having given better solutions and decreased for the others. this mechanism allows to improve gradually the solutions during the iterations. the ants colony optimization were applied to divers problems of optimization such as the salesman traveling problem [9], the problems of robotics [10], the industrial problems [11] as well as the carp (capacitated arc routing problem) in [12] and the rucksack problem [13]. and there are several versions of the ants colony algorithm such as the ant system [14], the min-max ant system [15], the asrank [16] and the ant colony system [9]. 4 rescheduling algorithm 4.1 notation we use even notations in the horizon of regulation. sh : set of stops of the regulation horizon. v h : set of the vehicles in the horizon of regulation. srk: k th stop situated on the line r. v li : i th vehicle in the line l . veh(v li , s m j , s r k): first successor of v l i at traveling s m j to s r k. almi j : stop variable of v l i ats m j . ant colony with dynamic local search for the time scheduling of transport networks 113 x lmri jk : destination variable of v l i from s m j to s r k. talmi j : arrival time of v l i at s m j . tdlmi j : departure time of v l i from s m j . nmont lmi j : number of persons who go aboard v l i in s m j . ndesclmi j : number of persons who come down of v l i in the stop s m j . ρ ll ′m ii′ j : rate of correspondence from v l i to v l′ i′ in s m j . µ(∆ti, smj , s r k): arriving rate of the passengers traveling between these stops during ∆ti. nl ′mr i′ jk : number of passengers in v l′ i′ traveling from s m j to s r k. y ll ′m ii′ j : variable of correspondence from v l i to v l′ i′ in the stop s m j (equal 1 if a correspondence is possible and 0 otherwise). wll ′m ii′ j : number of transferring persons from v l i to v l′ i′ at s m j . clmi j : load of v l i at its departure from s m j . at: total time of waiting of the passengers in the regulation horizon. at0: initial waiting time of passengers at the different stops of the regulation horizon according to the disturbed schedules. t t : total duration of transfers in the regulation horizon. t t0: initial transfer or connection time between the different vehicles of the regulation horizon according to the disturbed schedules. rt : total duration of roads in the regulation horizon. rt0: initial route time for the different vehicles of the regulation horizon according to the disturbed schedules. 4.2 criteria of regulation to have a good quality of service in a transport network, several criteria must be assured during the off-line planning and the on-line regulation such as the safety, the regularity, and the punctuality. in the present problem, we chose the criteria of the regularity, the punctuality and the correspondence. they are the most important and the most used by regulators and researchers. regularity this criterion expresses the preservation of the regularity of the time intervals which separate the successive passings of vehicles. it concerns the minimization of the passenger wait in stops. the calculation of the traveler wait in a stop smj depends on the interval separating two successive vehicles and the number of travelers at this stop. we suppose that in a given period of the day, v l ′ i′ is the vehicle succeeding v l i in the stop s m j . the time interval which separates both passages is: ∆t = tal ′m i′ j −tdlmi j (1) we consider the distribution of the passenger arrivals, µsmj (t), to the stop s m j . we can then calculate, according to the figure 3, the passenger wait, during t. (equation 2) attente(∆t, smj ) = ∫ ∆t 0 µsmj (t)(∆t −t)dt (2) the distribution of the passenger arrivals to stops is often considered as a non-stationary process [6]. besides, if we have reduced intervals (2 to 4 minutes) or situated in the homogeneous periods, we can consider a constant passenger flow,µsmj . consequently, the number of persons who arrive in s m j during ∆t is µsmj ×∆tand the average wait becomes: 114 salah zidi, salah maouche, slim hammadi attente(∆t, smj ) = µsmj × ∆t 2 2 (3) figure 3: distribution of the passenger arrivals to a stop in the case of a wider interval not belonging to a homogeneous period, it can be divided into several reduced intervals to simplify the calculation of the traveler waits, ∆t = ⋃ i=1...n ∆ti . the number of persons arriving in smj during ∆ti with a rate of arrival equal to µi is µi ×∆ti . their average wait is then: attente(∆ti , smj ) = µi ×∆ti ×( ∆ti 2 + n ∑ i′=i+1 ∆ti′) (4) indeed, the averages wait of the travelers who arrived during ∆ti is ( ∆ti2 + ∑ n i′=i+1 ∆ti′). we can so calculate the average wait during t at the stop smj : attente(∆t, smj ) = n ∑ i=1 µi ×∆ti ×( ∆ti 2 + n ∑ i′=i+1 ∆ti′) (5) we can now formulate, in the following equation, the total wait, at the stop smj , of the travelers who go to srk during the interval ∆t which separates the successive passages of both vehicles v l i and v l ′ i′ (v l′ i′ = veh +(v li , s m j , s r k)). attente(∆t, smj , s r k) = n ∑ i=1 µ(∆ti , smj , s r k)×∆ti ×( ∆ti 2 + n ∑ i′=i+1 ∆ti′) (6) the duration of the wait of all the passengers at smj is then the sum of the waits for all the vehicles which pass by this stop, as described below: attente(smj ) = ∑ v li ∈v h (almi j × ∑ srk>s m j attente(tal ′m i′ j −tdlmi j , smj , srk)) (7) finally, because the criterion of regularity is concerning the total wait, at, of passengers at stops in the regulation horizon, this at is then formulated in equation 8 and 9 by adding the wait at the different concerned stops. at = ∑ smj ∈sh attente(smj ) (8) at = ∑ smj ∈sh ∑ v li ∈v h (almi j × ∑ srk>s m j attente(tal ′m i′ j −tdlmi j , smj , srk)) (9) ant colony with dynamic local search for the time scheduling of transport networks 115 correspondence the correspondence criterion is associated to the duration of transfers between vehicles in the disrupted zone. we can suppose that the number of persons in transfer at stop smj is proportional to the passenger number who go to this stop with the rate ρ ll ′m ii′ j so w ll′m ii′ j = ρ ll′m ii′ j ×ndesclmi j . we can deduct the total duration of transfers, tt, who is equal to a sum of the durations of the correspondences between the various vehicles at the concerned stops of the network (equation 10). t t = ∑ v li ∈v h ∑ v l ′ i′ ∈v h ∑ smj ∈sh y ll ′m ii′ j ×wll ′m ii′ j ×(tdl ′m i′ j −talmi j ) (10) punctuality the punctuality criterion deals with the route duration of the different vehicles. it is computed after an estimation of the vehicle loads via the arrival rates of the passengers at the stops, and also the initial real loads that are assumed known. hence, rt = ∑ v li ∈v h ∑ smj ∈sh almi j ×clm ′ i j′ ×(tdlmi j −tdlm ′ i j′ ) (11) in fact, the loads are determined by the number of the alighting and boarding persons, according to the arrival rates or to the origin destination matrix. it can be written as clmi j = c lm′ i j′ −ndesclmi j + nmont lmi j (12) 4.3 principle of the rescheduling algorithm at first, we have a set of stops and routes of the disrupted zone. we add for every inter-stop (route chosen between two successive stations) a set of fictitious arcs which don’t have a physical existence but we consider them as delays to be applied to this road. in the figure 4, for example, we dispose of 5 arcs between every two successive stops. the first arc presents the real arc with 0 minutes of delay and the others possess 1, 2, 3 or 4 minutes of delay witch we can consider in one of both stops or in a route between these stops. figure 4: graphic presentation of the decision arcs the objective of the ant colony algorithm is to find the delay to be applied for every vehicle, with the aim to finding the schedules that satisfy the different criteria then propose a regulated timetable. so, for every vehicle ants move between stops and search the best arcs (delay) to be taken, until the arrival terminus. an ant has to propose all the delays which it is necessary to apply between the departure and the arrival. 116 salah zidi, salah maouche, slim hammadi 4.4 objective function the objective function to optimize is an aggregation of three criteria representing the total travel time, total waiting time, and total transfer time. this aggregation relies on weight parameters representing the relative importance of the criteria according to the disturbances and the regulator objectives. for instance, if no connection is involved in the disturbance, the weight parameter associated with the transfer criterion would be surely null. hence, the objective function to maximize is written as f = c=3 ∑ c=1 αccrc (13) where cr1 = (at0 −at ): for the regularity, cr2 = (t t0 −t t ): for the transfer (correspondence), cr3 = (rt0 − rt ): for the punctuality, α1, α2andα3 weight parameters for the regularity, transfer and punctuality criteria, respectively and ∑−13αc = 1. 4.5 generation of a new solution from a stop s and with the probability p, an ant uses the first method witch chooses an arc i with the probability p1 described by the equation (14). in that case we give the same probability to the k arcs (in our case k=5) to be chosen. this choice allows us a better exploration of the research space. p1 = { 1 k if i ∈ ωs 0 otherwise (14) with the probability 1-p, the ant uses the second method, more intelligent than the first. that is by regard to the pheromone trail, τ(t). and it chose an arc according to the probability given by the equation (15). p1 = { [τi(t)] ∑kj=1[τ j (t)] if i ∈ ωs 0 otherwise (15) where ωs is a set of arcs witch have a stop s as departure. 4.6 update of pheromone trail by analogy with the nature, every ant leaves a quantity of pheromone on every chosen arc. we reinforce the pheromone trail on the chosen arcs by taking into account vaporization. that is represented by the equation (16), which includes a term of persistence ρ and a term of strengthening ∆τa, it is a quantity added by an ant a. τi(t + 1) = ρ τi(t) + ∑ allants ∆τa (16) where ∆τa = fa is the objective function of the solution proposed bay the ant a. 4.7 algorithm structure every ant moves from a stop to the other, by using the method of generation of solution described previously, and it adds this arc to its road, until the terminus stop, then the ant begins again from the departure stop for all the vehicles of the disrupted zone. as soon as the ant chose the regulation decision (the delay) to apply to every vehicle, we build the new regulated timetable and we calculate the three criteria: the regularity, the punctuality and the transfer then the objective function which we compare ant colony with dynamic local search for the time scheduling of transport networks 117 with those found by the previous ants and we keep the best (maximal). when all the ants treat all the transport services we update the pheromone trail on every arc. the algorithm structure is presented by the figure 5. figure 5: ants colony algorithm for the regulation we stop the algorithm when the best solution found by ants is unchanged since a number of iteration i1 or a maximum number of iteration i2 is attained. 4.8 dynamic local search to escape from local optima, the methaheuristic algorithms showed an important efficiency. but, many difficulties persist for the more intricate problems and the wide research areas. we proposed in the first algorithm, in the paragraph iv, the second solution based on research idea. according to our early execution tests related to the second algorithm, in the paragraph v. c, we noticed that the unequal decisions to zero (the delays to be applied) are always located in a part of the disrupted spatiotemporal zone. so, we adopted another way based on local research, which is one of the most applied ideas with ant colony algorithms [12] as well as other heuristics such as the genetic algorithms [17]. this idea, which shows promising results, is based on a procedure that aims to improve the found solutions. in fact, this idea deals with a restricted research area, where the early solutions are located. nevertheless, we still have the problem of bumping into local optimums areas, which leads us to poor quality solutions. so, we used a new idea of dynamic local research. this idea aims at restricting the research area by means of spatiotemporal windows with increasing dimension according to the iterations. we begin the investigation in a small zone of the research space and we look for decisions, which can be different from zero on this zone, but we apply only the zero 118 salah zidi, salah maouche, slim hammadi figure 6: rescheduling ant colony algorithm with dynamic local search ant colony with dynamic local search for the time scheduling of transport networks 119 decision delay on the rest of the research space. after some iteration, we increase the spatiotemporal dimensions of this zone and we investigate the research space for new solutions. after that, we compare the results and we continue to increase the local search zone, after some iteration, until we get a dimension equals to the complete space. the figure 6 shows our improved algorithm comparing to the one illustrated in figure 5. this algorithm includes the research process with increase of the zone. in fact, we try to look for optimal decisions included between s1 and s2 stations (spatial limitation), and t1 and t2 times (temporal limitation). we affect the decision zero outside of this zone. after some iteration, we decrease a station for s1 and we add a station for s2 to increase the spatial zone and we make same for the temporal window. without initializing the pheromone trails, the solutions of the first zone will be more strengthened than the others and therefore they will have more chance to be chosen. 5 simulation and result we applied this work to scenarios inspired from a real transportation system existing in lille, in the north of france. we used an algorithm with 100 ants and two maximal number of iteration i1=10 and i2=500 and a probability p=10%. 5.1 scenario1 in the line 0 which has a frequency of one bus every 10 minutes. a disturbance, caused by the vehicle v 03 , is detected at tpert=12:01am, at its departure from the stop s 0 2. the incident consists of a traffic accident between two cars, what slows down vehicles. the delay of the disrupted vehicle at its arrival in s03 is estimated to 5 minutes. we suppose that no correspondence is involved in the disturbance. so, we are only interested in the criteria of regularity and punctuality. we assume that the studied horizon is included in a homogenous period of the day. then, we can have a constant arrival rate of passengers at all the stations. let µ = 2 passengers per minute. s00 s 0 1 s 0 2 s 0 3 s 0 4 s 0 5 s 0 6 s 0 7 v 00 0 0 0 0 0 0 0 0 v 01 0 0 0 0 0 0 0 0 v 02 0 0 0 0 0 0 0 0 v 03 0 0 0 0 0 0 0 0 v 04 0 0 0 0 0 0 0 0 v 05 0 0 0 0 0 0 0 0 v 06 0 0 0 0 0 0 0 0 example 1: α1 = α2 = 0andα3 = 1 f = 0 v 00 0 0 0 0 0 0 0 0 v 01 0 0 0 0 0 0 0 0 v 02 0 0 0 0 0 2 1 0 v 03 0 0 0 0 0 0 0 0 v 04 0 0 0 3 0 0 0 0 v 05 0 0 0 1 0 0 0 0 v 06 0 0 0 0 0 0 0 0 example 2: α1 = 1andα2 = α3 = 0 f = 160 table 1: results of the monocriterion regulation 120 salah zidi, salah maouche, slim hammadi the table 1 shows the delays to be applied to the vehicles of the regulation horizon. they involve decisions proposed by our ant colony algorithm for a regulation. for the first two examples, we consider a monocriterion regulation where we optimize only the punctuality in the first and the regularity in the second. in the example1, the optimal solution for the punctuality can correspond only to null decisions in all the compartments of table. so, we had a null objective function f. in the example 2 the algorithm can allow to delay vehicles to adjust the intervals which separate them. we notice that vehicles v 02 and v 0 4 , before and after the disrupted vehicle, were delayed 3 minutes and v 05 was delayed 1 minute. figure 7: vehicles schedules representation for the example 2 scenario 1 the figure 7 shows the efficiency of the scheduling algorithm to insure the regularity of vehicles in the perturbed zone. s00 s 0 1 s 0 2 s 0 3 s 0 4 s 0 5 s 0 6 s 0 7 v 00 0 0 0 0 0 0 0 0 v 01 0 0 0 0 0 0 0 0 v 02 0 0 0 0 1 0 0 0 v 03 0 0 0 0 0 0 0 0 v 04 0 0 0 1 0 0 0 0 v 05 0 0 0 0 0 0 0 0 v 06 0 0 0 0 0 0 0 0 exp 3: α1 = 0.9α2 = 0α3 = 0.1 f = 20.89 v 00 0 0 0 0 0 0 0 0 v 01 0 0 0 0 0 0 0 0 v 02 0 0 0 0 3 0 0 0 v 03 0 0 0 0 0 0 0 0 v 04 0 0 0 3 0 0 0 0 v 05 0 0 0 1 0 0 0 0 v 06 0 0 0 0 0 0 0 0 exp 4: α1 = 0.98α2 = 0α3 = 0.02 f = 132.78 table 2: results of the multicriteria regulation the example 3 favors the punctuality criterion, although α1 < α3. we notice a difference with the example 2 in the number of treated decisions. indeed, only vehicles v 02 and v 0 4 have to be delayed 1 minute and we find an objective function f = 20,89. in the example 4 the result is very similar to the ant colony with dynamic local search for the time scheduling of transport networks 121 example 2. indeed, v 02 and v 0 4 were delayed 3 minutes as in the example 2, but for the vehicle v 0 2 the delay is applied in s04 instead of and because the passenger number in these stops is more important thus a delay in is more interesting for the punctuality. we had f=132.78. in this first scenario the time of execution is always between 7 and 8 seconds. 5.2 scenario2 the disturbance is detected at tpert=12:24. it is caused by a technical problem at the tram line t, obliging the tram v t3 to stand still 7 min at stop s t 2 . this tram-line has a frequency of one vehicle per 10 min. the stop is situated at 10 min from a connection node, n, where a connection is planned at 12:40 with a bus from line b, which has a frequency of one bus every 20 min. however, because of the disturbance, it would arrive at 12:43 at n, so the connection would not occur. we assume that the studied horizon is included in a homogenous period of the day. then, we can have a constant arrival rate of passengers at all the stations (µ = 2). additionally, we assume that the connection rates between the two concerned lines are constant. hence, we suppose that the number of passengers in the tram arriving at the node and willing to take a bus on line is proportional to the load of the tram with a rate of 10%. and, the connection rate from the buses to the trams is 20%. we applied our algorithm for a first example which takes into account the regularity criterion (table 3). the best solution is obtained within 20 s with a maximal value of the objective function f=362 passengers-minute, that is a decrease of wait of 10 minutes for more than 36 passengers. we notice the important number of decisions (delays) for the disrupted line (t) to adjust the intervals before and after the disrupted vehicle. line t st0 s t 1 s t 2 s t 3 n s t 5 s t 6 s t 7 s t 8 v t0 0 0 0 0 0 0 0 0 0 v t1 0 0 0 3 0 0 3 0 0 v t2 0 0 0 0 0 0 0 0 0 v t3 0 0 3 2 0 0 0 1 0 v t4 0 0 0 3 0 0 1 0 0 v t5 0 0 0 0 0 0 0 2 2 v t6 0 0 0 0 0 0 0 0 0 line b sb0 s b 1 s b 2 s b 3 n s b 5 s b 6 s b 7 s b 8 v b0 0 0 0 0 0 0 0 0 0 v b1 0 0 0 0 0 0 0 0 0 v b2 0 0 0 0 0 0 0 0 0 v b3 0 0 0 0 0 0 0 0 0 v b4 0 0 0 0 0 0 0 0 0 α1 = 1α2andα3 = 0 table 3: results of example1 for scenario 2 the second example concerns the transfer criterion (table 4). so, the delays were applied before the stop n and we had an objective function f=208. the figure 8 shows the efficiency of the scheduling algorithm to insure the correspondence in the perturbed zone. we executed the algorithm also in other multicriteria example where α1 = 0.4, α2 = 0.58 and α3 = 0.02. we had a decrease of the decisions number (not equal to zero) because the punctuality criterion 122 salah zidi, salah maouche, slim hammadi line t st0 s t 1 s t 2 s t 3 n s t 5 s t 6 s t 7 s t 8 v t0 0 0 0 0 0 0 0 0 0 v t1 0 0 0 0 0 0 0 0 0 v t2 0 0 0 0 0 0 0 0 0 v t3 0 0 0 3 3 0 0 0 0 v t4 0 0 0 0 0 0 0 0 0 v t5 0 3 0 3 0 0 0 0 0 v t6 0 0 0 0 0 0 0 0 0 line b sb0 s b 1 s b 2 s b 3 n s b 5 s b 6 s b 7 s b 8 v b0 0 0 0 0 0 0 0 0 0 v b1 0 0 0 0 0 0 0 0 0 v b2 0 0 0 2 0 0 0 0 0 v b3 0 0 0 0 0 0 0 0 0 v b4 0 0 0 0 0 0 0 0 0 α1 = 0α2 = 1andα3 = 0 table 4: results of example 2 for scenario 2 figure 8: vehicles schedules representation for the example 2 scenario 2 ant colony with dynamic local search for the time scheduling of transport networks 123 and we had f=104. 5.3 comparison between both rescheduling ant colony algorithms we applied both rescheduling algorithms, without local search and with dynamic local search, for real transport scenarios. we present in the table 5 the results of these simulations. we notice that the algorithm with dynamic local search is faster. its execution time is always lower than the first algorithm, with same number of iteration. the ant colony algorithm with the dynamic local search space proposes an important improvement of solutions with regard to the algorithm without local search. ant colony algorithm (aca) aca with dynamic local search scenario example f execution time (ms) f execution time (ms) 1 1 0 7390 0 7390 2 160 7157 182 6922 3 20.89 7047 20.89 6890 4 132.78 7468 132.94 6860 2 1 192 10094 212 10000 2 526.4 10150 533.99 8290 3 394 10000 391.956 9370 3 1 362 28000 362 29000 2 208 31000 232 22800 3 104.88 28930 104.88 28060 4 114.39 28.31 115.23 26040 4 1 192 10500 192 7828 2 290 10109 310 8578 3 0 5000 0 5000 4 74.8 8700 74.8 8672 table 5: results of the both rescheduling ant colony algorithms for the found optima, we notice that it is also effective. in fact, the results of the algorithm with dynamic local search are better than the author 7 times on 15 examples. the local search allows us to win in execution times and in search space exploitation. in fact, it is easier and faster to look for optima in a smaller zone. this parameter time is very important for our real time problem of regulation. but also the idea of the dynamic dimension allows us more investigation than the idea of the classic local search. 6 conclusion the disturbance of a transport system affects, first of all, the vehicle schedules. in this article we presented an ant colony algorithm for the time-based regulation of a multimodal transport network, in real-time. the results of this algorithm applied to real scenarios of transportation system existing in lille, showed the efficiency of our approach. the execution time is also important. indeed our objective consists in proposing quickly a solution before the propagation of the disturbance. for a set of five stops, for example, for every vehicle there are 44 = 254 possible solutions. so it is a complex problem. and we also noticed problems of convergence of our algorithm because of the important dimension of the research space. the idea of the dynamic local search allowed us to win in the execution times but also the improvement of the solutions without decreasing the search space exploration and exploitation. 124 salah zidi, salah maouche, slim hammadi references [1] d. huisman, r. freling, and a. p. m.wagelmans, "a dynamic approach to vehicle scheduling," econometric inst., erasmus university, rotterdam, the netherlands, rep. ei2001-17, 2001. begin comment [2] s. maouche, h. laichour, s. hayat, "amélioration de la qualité de correspondances dans les réseaux de transport", rapport final grrt, avril 2001. [3] p. borne, b. fayech, s. hammadi and s. maouche, "decision support system for urban transportation networks", ieee smc part c: applications and reviews, special issue on decision technologies in honour of prof madan singh, vol.33, no.1, pp.67-77, 2003. [4] a. soulhi, "contribution de l’intelligence artificielle à l’aide à la décision dans la gestion des systèmes de transport urbain". thèse de doctorat, université des sciences et technologies de lille, france, 2000. [5] h. laichour, " modélisation multi-agent et aide à la décision : application à la régulation des correspondances dans les réseaux de transport urbain". thèse de doctorat, université des sciences et technologies de lille, france, 2002. [6] b. fayech, "régulation des réseaux de transport multimodal : systèmes multi-agent et algorithmes évolutionnistes", thèse de doctorat, université des sciences et technologies de lille, france, 2003. [7] j. d. moss and c. g. johnson, "an ant colony algorithm for multiple sequence alignment in bioinformatics", in david w. pearson, nigel c. steele, and rudolf f. albrecht, editors, artificial neural networks and genetic algorithms, pages 182-186. springer,april 2003. [8] m. dorigo,"optimization, learning and natural algorithms". ph.d.thesis, politecnico di milano, italy, in italian, 1992. [9] m. dorigo and l. m. gambardella, "ant colonies for the traveling salesman problem". biosystems, 43:73-81. also tecnical report tr/iridia/1996-3, iridia, université libre de bruxelles, 1997. [10] n. monmarché, "algorithme de fourmis artificielles : application à la classification et à l’optimisation", thèse à l’université de françois rabelais tour, 2000. [11] c. gagné and w. l. price, "optimisation par colonie de fourmis pour un problème d’ordonnancement industriel avec temps de réglages dépendants de la séquence". 3e conférence francophone de modélisation et simulation mosim01, troyes (france), 2001. [12] p. lacomme, c. prins, a. tanguy, "optimisation par colonies de fourmis pour les tournées sur arcs". 4e conférence francophone de modélisation et simulation mosim03, toulouse (france), 2003. [13] i. alaya, c. solnon and k. ghédira, "algorithme fourmi avec différentes stratégies phéromonales pour le sac à dos multidimensionnel", mhosi’05, hammamet, tunisie, 2005. [14] m. dorigo, v. maniezzo, and a. colorni. the ant system: optimization by a colony of cooperating agents. ieee transactions on systems, man and cybernetics, part b, 26(1):2941, 1996. [15] t. stutzle and h. hoos. max-min ant system and local search for combinatorial optimization problems. in s. vos, s. martello, i.h. osman, and c. roucairol, editors, meta-heuristics: advances and trends in local search paradigms for optimization, pages 137-154. kluwer, boston, 1998. ant colony with dynamic local search for the time scheduling of transport networks 125 [16] b. bullnheimer, r. f. hartl, and c. strauss. "a new rank-based version of the ant system: a computational study". technical report pom-03/97, institute of management science, university of vienna, accepted for publication in the central european journal for operations research and economics, 1997. [17] p. lacomme, c. prins and w. ramdane-chérif "competitive genetic algorithms for the capacitated arc routing problem and its extensions", in e.j.w. boers and al. (ed.), applications of evolutionary computing, pp. 473-483, lecture notes in computer science 2037, springer, 2001. acknowledgement: the authors want to acknowledge the support of the eu thought the feder grant # obj2-2005/3-4.1-253presrge-7820 salah zidi, salah maouche and slim hammadi université des sciences et technologies de lille and ecole centrale de lille ufr i.e.e.a. bâtiment p2, bureau 308 ufr i.e.e.a. cité scientifique 59655 villeneuve dáscq cedex france e-mail: salah.zidi@ed.univ-lille1.fr, salah.maouche@univ-lille1.fr, slim.hammadi@ec-lille.fr received: november 11, 2006 int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 634-641 parallel simulation of quantum search s. caraiman, v. manta simona caraiman, vasile manta “gheorghe asachi” technical university of iasi romania, 700050 iasi, 27 dimitrie mangeron e-mail: {sarustei,vmanta}@cs.tuiasi.ro abstract: simulation of quantum computers using classical computers is a computationally hard problem, requiring a huge amount of operations and storage. parallelization can alleviate this problem, allowing the simulation of more qubits at the same time or the same number of qubits to be simulated in less time. a promising approach is represented by executing these simulators in grid systems that can provide access to high performance resources. in this paper we present a parallel implementation of the qc-lib quantum computer simulator deployed as a grid service. using a specific scheme for partitioning the terms describing quantum states and efficient parallelization of the general singe qubit operator and of the controlled operators, very good speed-ups were obtained for the simulation of the quantum search problem. keywords: quantum computer simulation, parallel computing, quantum search. 1 introduction the research in quantum informatics has gained an immense interest due to the remarkable results obtained for the factorization [11] and search [6] problems. these results prove the huge computational power of a quantum machine with respect to the classical computers. however, building quantum computers represents an immense technological challenge and, at present, the quantum hardware is only available in research labs. under these circumstances quantum simulators have become valuable instruments in developing and testing quantum algorithms and in the simulation of physical models used in the implementation of a quantum processor. according to feynman’s paper [3], classical computers will never be able to simulate quantum systems in polynomial time. the simulation of 29 qubits (quantum bits) uses 32 gb of memory [1] and any additional qubit doubles the resources needed: time, memory, computational power and space. in this paper we present a solution based on grid computing for the quantum simulation problem. our simulator relies on paralel processing for storing quantum states and applying quantum operators. the deployment of this solution in grid systems provides access to high performance computing devices for simulation and availability in the context of collaboration through the means of virtual organizations. our quantum simulator, gqcl, partitions the terms coresponding to a quantum state between several procesing nodes using a scheme that minimizes communication between nodes during the application of quantum operators. in a previous paper [1] we describe the development of a grid service that provides this functionality to client applications by enabling the quantum computation language [9] through a parallel implementation of the qc-lib simulator [8]. the results recorded for the application of the hadamard transform illustrate the performances of this approach [1]. in the following we present the parallelization of the general single qubit operator, the conditional operators and of the measurement process. this allows us to study the performance of our simulator regarding the quantum search problem. copyright c⃝ 2006-2010 by ccc publications parallel simulation of quantum search 635 2 basic concepts in quantum computing the quantum analogous of the classical bit is the qubit. a qubit is a quantum system whose states can be completely described by the superposition of two orthonormal basis states, labeled |0⟩ and |1⟩ (in a hilbert space h = c2, |0⟩ = (1 0)t , |1⟩ = (0 1)t ). any state |ψ⟩ can be described by: |ψ⟩ = α|0⟩ + β|1⟩, |α|2 + |β|2 = 1, (1) where α and β are complex numbers. thus, unlike the classical bit, the qubit can also be in a state different from |0⟩ and |1⟩: linear combinations of states can be formed, called superpositions (eq. 1). when measuring a qubit either the result 0 is obtained, with probability |α|2, or 1 with probability |β|2. the sum of the probabilities must be 1, so the state of a qubit represents a unit vector in a complex bi-dimensional vector space. a collection of n qubits is called a quantum register with dimension n. the general state of a n-qubit register is |ψ⟩ = 2n−1∑ i=0 ai|i⟩, (2) where ai ∈ c, ∑2n−1 i=0 |ai| 2 = 1. this means that the state of a n-qubit register is represented by a complex unit vector in hilbert space h2n. the quantum analogous of the classical not gate is labeled x and can be defined such that x|0⟩ = |1⟩ and x|1⟩ = |0⟩. the quantum not gate acts similarly with its classical counterpart, although, unlike in the classical case, its action is linear: state α|0⟩ + β|1⟩ is transformed in a coresponding state β|0⟩+α|1⟩. a convenient way of representing the action of the quantum not gate is in matrix form: x = ( 0 1 1 0 ) . (3) controlled gates are quantum logical gates acting on more than one qubit. the notion of controlled gate allows the implementation of the if − else constructs. quantum controlled gates use a control qubit to determine whether a specific unitary action is applied to a target qubit. the controlled-not operator (cnot) is the prototypical multi-qubit gate. the first parameter of a cnot gate is the control qubit. if this qubit is in state |0⟩, the target qubit is left unchanged and if the control qubit is in state |1⟩, the target qubit is flipped: |00⟩ → |00⟩; |01⟩ → |01⟩; |10⟩ → |11⟩; |11⟩ → |10⟩. the cnot operator is a generalization of the classical xor, since its action can be summarized as |x, y⟩ → |x, x ⊕ y⟩, where ⊕ is addition modulo two. the matrix representation of cnot is: cnot =   1 0 0 0 0 1 0 0 0 0 0 1 0 0 1 0   . (4) there are several other multi-qubit gates, nevertheless, the controlled-not gate and the single qubit gates represent the prototypes for any other quantum gate because of the folowing remarkable universality result: any multi-qubit gate can be built out of cnot gates and single qubit gates. the proof of this statement represents the quantum analogous to the universality of the classical nand gate. 636 s. caraiman, v. manta 3 parallel simulation of quantum computation the state of a n-qubit register is represented by a complex unit vector in hilbert space h2n . storing a complex number a = x + iy on a classical computer requires storing the pair of real numbers (x, y) for which the 8 byte representation is prefered. thus, in order to store an n-qubit quantum register using a conventional (classical) computer, at least 2n+4 bytes are required. the memory needed for simulating a n-qubit quantm computer grows exponentially with respect to the number n. for example, when n = 24 (n = 36) at least 256 mb (1 tb) of memory is required to store a single arbitrary state |ψ⟩. the time evolution of a n-qubit quantum register is determined by a unitary operator defined in the h2n space. the matrix dimension is 2n x 2n. in general, 2n x 2n space and 2n(2n+1 − 1) arithmetic operations are needed to execute such an evolution step. thus, the simulation of a quantum computer using a classical device represents a computationally hard problem and the memory and processor generate drastic limitations on the size of the quantum computer that can be simulated. because of the exponential behavior of quantum systems, simulation using classical computers enforces the use of exponential memory space and the execution of an exponential number of operations. it is obvious that the simulation of quantum problems of interesting sizes enforces the use of high performance computing devices. parallel computing can represent a solution to this problem [5, 7, 10, 13]. nevertheless, the development of a quantum simulator must consider another important aspect: it has to be easily accessible. but this contradicts the first requirement, that it is parallel, which deeply restricts the group of potential users. a solution based on the concept of grid systems ca be used to solve this contradiction and to provide the scientific community with an useful and easily accessible instrument. the grid concept adresses the problem of coordinated resource sharing and problem solving in dinamic, multi-institutional virtual organisations [4]. a grid enabled quantum computer simulator is gqcl [1]. this simulator allows the use of the qcl [9] quantum programming language to implement quantum algorithms and the quantum programs are executed using a parallel version of the qc-lib simulation library [8]. using a specific data partitioning scheme and efficient storing of quantum states allowed very good speedups and efficiency of this parallel implementation which will be discussed in the following. 3.1 overview of gqcl grid service for quantum simulation our quantum computer simulator is based on the qc-lib simulation library which provides a framework to execute programs written in a quantum programming language, qcl, in the absence of quantum hardware. the reasons that lead to this choice for a quantum programming language are detailed in [1] and mainly consider the representation of the quantum state using complex numbers, the possibility to write complex quantum operators, the classical extension and its universality. qcl was conceived by ömer [9] and the first version appeared in 1998 and the last one in 2004. it is open-source running under linux operating system and it is a procedural high level language with a c like syntax. qc-lib is a c++ library for the simulation of quantum computers at an abstract functional level [8], and it is used as the back end interpreter for qcl. for the execution of quantum programs written in qcl we developed a parallel version of the qc-lib simulator in which the terms representing quantum states are distributed across multiple processing nodes. we have chosen to expose the parallel implementation of qc-lib through a globus toolkit 4 (gt4 for short) grid service. parallelization has been achieved through the use of lam 7.1.2/7.1.4 implementation of the mpi-2 standard. in gqcl, the execution of parallel simulation of quantum search 637 figure 1: distribution of the 16 basis states of a 4-qubit register using 23 = 8 proccesing nodes. the processing nodes represent the corners of a 3-dimensional hypercube. the mpi implementation of qc-lib is enabled through the means of a wrapper service based on the factory/instance architecture [12]. in gqcl, the instance service, is responsible for actually managing the quantum simulation as a grid job. through the use of a ws-resource, the instance service starts and monitors the job state, notifying the client application on any relevant changes. also, file staging is automated and when the job finishes, the client is given access to the results of the simulation. the architectural details of gqcl and its advantages are presented in [1]. in the following we discuss the complete parallel implementation of the simulation library, addressing the issues regarding the representation of quantum states using multiple processing nodes, the application of singleand multi-qubit quantum operators and measurements. 3.2 parallel implementation of the qc library a quantum register in qcl contains a number of basis vectors, each with a corresponding amplitude. when the qubits forming the quantum register are in a superposition of states, the number of vectors grows exponentially. in qc-lib, the superposition state of a 1-qubit register is represented by two basis vectors (terms) for which the correponding complex amplitudes must be stored: 2 x complex < double >= 32 bytes. when appplying a quantum operator, two term lists are created: one for storing the terms in the current state and one to accumulate the result of an operation on the state. this gives a total of 64 bytes/term which for a n-qubit register requires the use of 2n+6 bytes. thus for n = 25 (n = 29) qubits 2 gb (32 gb) of memory is necessary. in order to provide an efficient parallel execution of qc-lib we take advantage of the specific form of the quantum computation process and distribute the 2n basis states of a quantum register to 2p processors based on the p least significant bits. in this representation, the processing nodes and the basis vectors are actually considered the coordinates of a n-dimensional hypercube (figure 1). each processing node applies quantum operators only to local terms and communicates the generated terms to corresponding processing nodes if necessary. another feature of our implementation is that for a quantum state only non-zero amplitude terms are stored thus diminishing the communication costs in early stages of some operator execution and the space required to store a quantum state. the general single qubit operator. communication between processing nodes is only necessary when applying the operator to a qubit determining the data distribution. applying a 638 s. caraiman, v. manta general single qubit operator u = ( u11 u12 u21 u22 ) on a single qubit with state |ψ⟩ = α|0⟩ + β|1⟩ yelds u|ψ⟩ = (αu11 + βu12)|0⟩ + (αu21 + βu22)|1⟩ (5) if 2 processors are used, then each processor holds the amplitude of one basis state. applying operator u locally on each processor, terms are created that are not owned by the processor, and so communication of these terms is needed: p0 : α|0⟩ u−→ u11α|0⟩ + u21α|1⟩ p1 : β|1⟩ u−→ u12β|0⟩ + u22β|1⟩ comm−→ u11α|0⟩ + u12β|0⟩ u21α|1⟩ + u22β|1⟩ for each term in the initial state at most two terms are created, out of which at most one needs to be communicated. if working, for example, with an n-qubit register and 2p processors, communication is necessary only when applying a single qubit operator on any of the qubits that form the distribution key. for the rest of the qubits, all the terms needed for computing the amplitude of the resulting state are locally owned by each processor. moreover, in the first case, for each processor, all the remotely owned terms are owned by the same other processor as a single bit is flipped in the distribution key. the parallel implementation of the general single qubit operator allows the parallel execution of not, hadamard, phase shift of the amplitude and exponentiation gates. controlled gates. cnot operator (controlled-not) in the parallel implementation of qc-lib, when the control qubit is in state |0⟩, the state of the target qubit doesn’t change, so no new terms are generated and the amplitudes of existing terms are left unchanged. when the control qubit is in state |1⟩, the state of the target qubit is flipped. in this case new terms are generated that need to be communicated to another processing node if the target qubit is part of the distribution key. for example, working with 4 processing nodes and applying cnot to the least significant qubit in a 3-qubit register initially in the state α|100⟩ + β|011⟩, and the control qubit is qubit 2, the following evolution is obtained: p0 : α|100⟩ cnot−−−−→ α|101⟩ p1 : − p2 : − p3 : β|011⟩ cnot−−−−→ β|011⟩ comm−→ − α|101⟩ − β|011⟩ in qc-lib the cnot operator can act on two registers: the control register and the target register. these registers can represent substates of the quantum basis state (sub-registers). in this case, the cnot gate inverts the state of the target (sub-)register if the control (sub-)register is in the state |1⟩c, where c is the number of qubits in the control register. for example, let a be a 1-qubit register in state |0⟩, b a 2-qubit register in state α0|01⟩ + β0|10⟩ and c a 2-qubit register in state α1|10⟩+β1|11⟩. applying cnot to target register b with control register c, the state of the entire quantum memory distributed to 8 processing nodes will be: p2 : |10010⟩, |11010⟩ cnot−−−−→ |10010⟩, |11100⟩ p4 : |10100⟩, |11100⟩ cnot−−−−→ |10100⟩, |11010⟩ comm−→ |10010⟩, |11010⟩|10100⟩, |11100⟩ similar to the case of the general single qubit operator, each processing node communicates with at most one other process node. the index of the process involved in communication is determined by the qubits of the target (sub-)register that make part of the distribution key of the whole quantum memory. parallel simulation of quantum search 639 the cphase operator is another example of 2-qubit quantum gate implemented in qc-lib and allows for a controlled phase shift of the amplitudes. its inputs are a rotation angle, θ, and a control qubit that acts in the same manner as in the cnot case. the amplitudes of the basis states where the control qubit is |0⟩ are left unchanged, an if the control qubit is in state |1⟩, the phase of the amplitudes of the basis states are multiplied by eiθ. the matrix form of the cphase operator is: cphase =   1 0 0 0 0 1 0 0 0 0 1 0 0 0 0 eiθ   . (6) operator cphase can also act on (sub-)registers of the quantum memory and its parallel implementation is analogous to that of the cnot operator. one important difference between the two implementations is that when applying the cphase operator communication between processing nodes is not necessary as the action of this operator doesn’t generate new terms, and only the amplitudes of the local terms are modified. measurement of quantum states. in qc-lib, the measurement of a n-qubit quantum register is simulated in o(2n) time. let |ψ⟩ = ∑2n−1 j=0 αj|j⟩ be the state of a n-qubit quantum register. the measurement step is simulated in the following manner: 1. rrandomly generate a number p, 0 ≤ p < 1, 2. randomly generate a positive integer x, smaller than the number of terms with non-zero amplitude, 3. determine an integer i, 0 ≤ i < 2n − 1, such that ∑i−1 j=x |αj| 2 ≤ p < ∑i j=x |αj| 2. integer i is the representation of the measured state. after measurement, the state of the register becomes |i⟩. because the terms of the quantum register are distributed across processing nodes, the measurement operation requires communication between these nodes in order to correctly select the term i, but also to colapse the register in state |i⟩. thus, a master process is responsible with the random generation of numbers p and i and with computing the sum. sincronization between processing nodes is achieved using mpi_bcast operations such that the master process could receive the norm of the amplitude of a term j from the owning processing node. after selecting number i, all processing nodes know this number and can pass the state of the quantum register in |i⟩. 4 simulation of quantum search in gqcl many problems in classical computing can be reformulated to express the search of a unique element that satisfies a certain predefined condition [2]. if there is no additional information about the search condition, the best classical algorithm is a brute-force search, meaning that the elements are sequentially tested against the search condition. for a list of n elements, this algorithm executes an average of n/2 comparisons. by exploiting the advantages of quantum parallelism and interference of quantum states, grover formulated a quantum algorithm that can find the searched element in an unstructured database in only o( √ n) steps [6]. grover’s algorithm is based on the concept of amplitude amplification and its principle is to encode the elements in the data set as quantum states of a quantum register and to apply an operator, g, whose effect is to raise the probability that the system finds itself in the marked state (the state encoding the solution of the search problem). because only unitary transformations are used to act upon the system, the probability conservation takes place. this allows that as the probability that the system finds itself in the desired state grows, the probability of all 640 s. caraiman, v. manta (a) (b) figure 2: (a) execution time for grover’s algorithm; (b) speed-up for grover’s algorithm. other (unmarked) states are correspondingly diminished. applying grover’s operator g a certain number of times will determine the probability of the marked state to be very close to 1. in order to acheive this behavior of the quantum system, a grover iteration first inverts the phase of the amplitude of the marked state and then inverts the phase of the amplitude of all states around the mean. the inversion of the solution state can be obtained using a so-called "black box" function (known as a quantum oracle) which must be able only to identify wheather a certain record is member of the solution set and thus the mechanism is very general. after one application of grover operator, the amplitude of the marked state grows with a factor of o( 1√ n ), while the amplitudes of the unmarked states lower correspondingly. to obtain o(1) probability for the solution state, grover iteration must be applied o( √ n) times. there is a finite probability that the search operation doesn’t end in success, in which case, grover’s algorithm must be repeated. the performance of our gqcl quantum simulator with respect to the quantum search problem is evaluated. in order to eloquently compare the running times for different problem sizes, we only measure the execution time for one application of grover’s algorithm. in figure 2 we present the results obtained for different problem sizes on various numbers of processing nodes. it can be observed from figure 2(a) that the run time grows with an average factor of about 2.8 for each additional qubit. this is due to the fact that each extra qubit represents a doubling of the problem size and that the number of applied grover iterations grows with a factor of √ 2 for an additional qubit. variation of the speedup with the number of processing nodes is presented in figure 2(b). for only 19 qubits we obtain a speed-up of 7.9 and the measurements reveal a growing trend of the speed-up with the increase of the problem size. 5 conclusions classical sequential computers enforce drastic limitations over the quantum computation simulation process. quantum computer simulators have become an attractive alternative for experimentation with quantum algorithms, but their purpose cannot be achieved without significant computing resources. a promising approach is represented by executing these simulators in grid systems that can provide high performance resources. the quantum computer simulator described in this paper relies on parallel processing implemented in qc-lib. besides the parallelization of the general single qubit operator, we also described the parallelization of the control gates (cnot, cphase) and of the measurement process. the efficient representation and partitioning of the quantum states using the distributed memory of a computer cluster allowed parallel simulation of quantum search 641 very good speed-ups to be recorded at the execution of grover’s search algorithm. bibliography [1] s. caraiman, a. archip, and v. manta. a grid enabled quantum computer simulator. in proc. of synasc’09. ieee computer society, 2009. [2] s. caraiman and v. manta. new applications of quantum algorithms to computer graphics: the quantum random sample consensus algorithm. in proc. of acm cf ’09, pages 81–88, new york, ny, usa, 2009. acm. [3] r. feynman. simulating physics with computers. int. j. theor. phys., 21(6):467–488, 1982. [4] i. foster, c. kesselman, and s. tuecke. the anatomy of the grid: enabling scalable virtual organizations. int. j. high perform. comput. appl., 15(3):200–222, 2001. [5] i. glendinning and b. omer. parallelization of the qc-lib quantum computer simulator library. in proc. of ppam 2003, volume 3019 of lncs, pages 461–468. springer, 2004. [6] l. grover. a fast quantum mechanical algorithm for database search. in proc. of 28th acm annual stoc, pages 212–219, 1996. [7] j. niwa, k. matsumoto, and h. imai. general-purpose parallel simulator for quantum computing. in proc. of umc ’02, pages 230–251, london, uk, 2002. springer-verlag. [8] b. ömer. simulation of quantum computers, 1996. http://tph.tuwien.ac.at/ oemer/doc/qcsim.ps. [9] b. ömer. structured quantum programming. phd thesis, tu vienna, 2003. [10] k. d. raedt, k. michielsen, h. d. raedt, b. trieuc, g. arnoldc, m. richterc, t. lippertc, h. watanabed, and n. itoe. massively parallel quantum computer simulator. computer physics communications, 176(2):121–136, 2007. [11] p. shor. algorithms for quantum computation: discrete logarithms and factoring. in proc. of sfcs ’94, pages 124–134. ieee computer society, 1994. [12] b. sotomayor. the globus toolkit 4 programmer’s tutorial, 2005. http://gdp.globus.org/gt4-tutorial/multiplehtml/index.html. [13] f. tabakin and b. juliá-díaz. qcmpi: a parallel environment for quantum computing. computer physics communications, 180(6):948–964, 2009. international journal of computers, communications & control vol. ii (2007), no. 1, pp. 48-55 advances in intelligent tutoring systems: problem-solving modes and model of hints alla anohina abstract: the paper focuses on the issues of providing an adaptive support for learners in intelligent tutoring systems when learners solve practical problems. the results of the analysis of support policies of learners in the existing intelligent tutoring systems are given and the revealed problems are emphasized. the concept and the architectural parts of an intelligent tutoring system are defined. the approach which provides greater adaptive abilities of systems of such kind offering two modes of problem-solving and using a two-layer model of hints is described. it is being implemented in the intelligent tutoring system for the minimax algorithm at present. in accordance with the proposed approach the learner solves problems in the mode which is the most appropriate for him/her and receives the most suitable hint. keywords: intelligent tutoring system, problem-solving mode, hint 1 introduction emerging of a knowledge society and growing demands for highly skilled and educated labor force claim for changing traditional teaching and learning processes. one way of changes is related with an integration of various kinds of computer-based learning systems as supplements to conventional teaching methods. however, it is necessary to provide intelligent and adaptive abilities of a software system in order it could take over a role of a teacher in effective way. this idea is not new one as it is exploited in intelligent tutoring systems for more than 30 years since the earliest scholar system [1] appeared. during this time a huge amount of intelligent tutoring systems has been implemented for different areas, for example, for mathematics, physics, medicine, informatics and computer science [2, 3, 4, 5, 6, 7, 8, 9, 10]. nowadays developments of systems of such kind have received new breath with the appearance of the agent paradigm [11]. however, adaptive abilities of intelligent tutoring systems still are not high enough, particularly regarding modes of practical problems solving and support of a learner in this process. solving of domain problems is an important part of intelligent tutoring systems, as it allows to deepen the acquired theoretical knowledge in practice, but the mere solving is unlikely to lead to improved skills or deeper understanding of a subject matter. learning often takes place best when the learner receives feedback from the system. feedback is a way to improve the learning process on the basis of continuous assessment of learning results, the analysis of their quality and performance of necessary corrections. feedback encourages desired learning behavior and discourages undesired one, allows to understand how successfully the learner acts, whether he/she applies relevant knowledge, and it provides opportunities to correct misconceptions. in case of intelligent tutoring systems feedback is the various reactions of the system to learner’s learning behaviour. in its turn, a hint is only one form of feedback. unfortunately, little prior researches have been done which are devoted to the general issues of hints formation in intelligent tutoring systems. the most significant work is [12], containing the description of the results of studying hints used by experienced tutors and an attempt to formulate a strategy for using hints in intelligent tutoring systems. according to [12] hints encourage the student to engage in active cognitive processes that are thought to promote deeper understanding and long-term retention. as it is pointed in [13], the developed intelligent tutoring systems have relatively simple and inflexible hinting policies, which more often demand from the learner to follow a prescribed problem-solving strategy and, therefore, hints are always aimed at the next step which should be taken accordingly to the copyright c© 2006-2007 by ccc publications selected paper from icvl 2006 advances in intelligent tutoring systems: problem-solving modes and model of hints 49 strategy. the authors draw attention to two problems: inflexible choice of the steps targeted by hints and proceeding of hints from the most general to the most specific. the analysis of the existing intelligent tutoring systems allows to make the following conclusions about reactions of a system to actions of a learner. typically the system gives the learner an immediate feedback after each performed action or step during problem-solving irrespective of the fact whether the action or the step was correct or incorrect. such policy prevents the learner from proceeding along a wrong solution path. the examples of immediate feedback are found in [5, 8, 9, 14, 15]. but may be a learner would like to make a series of steps and after that to receive feedback about correctness and to find by his/herself what step has led to the incorrect solution? the system usually provides a special button or tool, which the learner can use to request a hint. in algebrain [5] such tool is an animated agent. the system responds with two types of support: generalized "here’s what i’m expecting you to do at this point" help text and a hint specific to the current state of the problem. in andes [9] there are two buttons. one of them gives help "what’s wrong with that?" on an incorrect entry. other button provides a hint about the next step in problem-solving. typically hints are organized in a range from the most general to the most specific. the general hint as a rule contains a minimum information on an error. further the informativeness of hints increases. the most specific hint clearly specifies or shows what should be done. hints are given sequentially. there is a number of systems which use this approach, for example, [5, 8, 10, 14, 16, 17]. the organization of hints from the most general to the most specific is not flexible enough. the insufficient amount of information in a hint can cause frustration and desire to request the subsequent hints without attempts to solve a problem by the learner. information, which specifies necessary actions after first request of a hint, in its turn, is contradictory to the learning process. thus, mechanisms, which will allow to implement individual system’s reactions for each learner giving such amount of information, which will help and at the same time will provide certain cognitive load, are necessary. the example of adaptive hinting is described in [18]. the authors use learner’s proficiencies to select an appropriate hint. the learner with high proficiency at a particular skill receives the more subtle hint. the less proficient learner is presented with a more obvious hint. the authors point out that this is better than require learners to wade through several levels of hints before they receive material that is appropriate to their knowledge level. the paper describes an approach which provides greater adaptive abilities of intelligent tutoring systems supporting two modes of problem-solving and using a two-layer model of hints. thus, the learner solves problems in the mode which is the most appropriate for him/her and receives the most suitable hint. the aforementioned approach is being implemented in the intelligent tutoring system for the minimax algorithm at present. the paper is organized as follows. section 2 defines the concept and the architectural parts of an intelligent tutoring system. the developed approach based on two modes of problem-solving and a twolayer model of hints is discussed in section 3. section 4 describes the intelligent tutoring system for the minimax algorithm in which the proposed approach is being implemented. finally, conclusions are presented, and some directions for future work are outlined. 2 intelligent tutoring systems despite of a broad variety of the developed systems an unequivocal and exhaustive definition of an intelligent tutoring system still does not exist. however, it is possible to list the most often mentioned characteristics of systems of such kind [10, 19, 20, 21, 22, 23, 24]. thus, the intelligent tutoring system is a computer-based system. it is an intelligent system because it uses principles and methods of artificial intelligence [25] such as knowledge representation, inference mechanisms and machine learning in its structure and operation. an intelligent tutoring system is an adaptive system as it alters aspects of its structure, functionality or interface for the particular user and his/her changing needs over time [26]. it 50 alla anohina emulates a human teacher, tries to provide benefits of individual (one-on-one) tutoring, and is based on the theory of learning and cognition. furthermore, intelligent tutoring systems are characterized by the fact that they store three basic kinds of knowledge [20, 27]: domain knowledge, knowledge about learners, and pedagogical knowledge. the knowledge types determine three main parts of the system’s architecture: the domain knowledge, the student diagnosis module, and the pedagogical module. an intelligent tutoring system, as any other software intensively communicating with users, needs a part of the architecture responsible for the interaction between the system and the learner. it is a communication module or interface which controls screen layouts, interaction tools, etc. however, each system can contain additional components the presence of which depends on the following factors: features of problem domain, locking down of separate functions of the basic constituent parts in the isolated components of the structure, technology used for system implementation, and additional functional capabilities of the system. the general architecture of an intelligent tutoring system is shown in figure 1. figure 1: the general architecture of an intelligent tutoring system (adopted from [28]) the domain knowledge is the knowledge the system is teaching. most often it is incorporated in the expert model which represents skills and expertise that an expert in a particular domain holds. the model serves as a standard for evaluating the learner’s performance and the knowledge level. the domain knowledge can include fragments of theoretical materials, texts of practical tasks and attributes related with them, explanatory units, rules and principles used in the domain, etc. typically, it is represented within the system using logical, procedural, network or structured knowledge representation schemes [29]. moreover, the domain knowledge is organized in a certain way, commonly, as a hierarchy, for example, a topic includes some units, which consist of several chapters. the expert module generates solutions of problems for their further comparison with solutions of the learner. the student diagnosis module carries out the student diagnosis process that collects information about the learner, analyzes it and stores in the student model. the student model is formed for a particular learner and serves as an input to the pedagogical module which tailors the learning process to the needs of the learner. the model contains learner’s identifying information, information on the current knowledge level of the learner, information about learner’s cognitive, emotional and psychological features, his/her past experience, interests, and system’s options usage by the learner. the pedagogical module provides a knowledge infrastructure for adaptation of the learning process to characteristics and needs of a learner without interventions of a human-teacher. it implements the advances in intelligent tutoring systems: problem-solving modes and model of hints 51 learning process on the basis of teaching strategies and instructions held in the pedagogical model. the primary tasks of this module are selection and sequencing of learning material that is the most suitable for the learner, determining of the type and content of feedback and help, and answering questions from the learner. 3 the proposed approach 3.1 the problem-solving modes generally, there are two possibilities regarding moments of feedback delivering: an immediate feedback after each step or action in problem-solving and feedback after submission of a whole solution to the problem. it is a basis for two modes of problem-solving in the proposed approach. in the completeness mode a learner chooses the moments of feedback presentation to check correctness of a series of steps. so, he/she can perform one or more steps solving a problem and then to require checking of the performed steps. the system provides feedback about correctness of his/her previous actions and the learner by his/herself should determine what step has led to the incorrect solution. this mode is similar to reinforcement learning [30] which is widely used in artificial intelligence. in the step-by-step mode the system monitors each problem-solving step and gives feedback about its correctness. there are four variations of the step-by-step mode regarding a kind of information given to the learner: • the learner receives both positive and negative feedback. in the case when the learner has performed the correct action he/she is praised (receives a positive feedback, or a reward). if the step was incorrect, criticism (negative feedback) is given to the learner. moreover, negative feedback can be given in two different forms: only as a text, which informs that the action was incorrect, and as a text about the incorrect step together with a hint about how to improve his/her operation. • the learner receives only negative feedback. in this case negative feedback also can be given in two different ways described above. in the completeness mode the learner is not praised or criticized for each performed step. instead of it he/she receives a total estimation of all performed actions. the estimation specifies how far the learner is from his/her goal: the correct solution of a problem. the total estimation can have a positive or negative deviation regarding difference between a number of correctly and incorrectly performed steps. in case when the mentioned difference exceeds some admissible value, a hint can be given to the learner. thus, there are two variations of the completeness mode regarding a kind of feedback: • the learner receives only a total estimation of the performed steps. • the learner receives a total estimation of the performed steps together with a hint about how to improve his/her operation. it is obvious, that it is necessary to provide an opportunity to change the problem-solving mode and a kind of feedback by the learner, as well as, to request a hint in case when he/she receives only the text of feedback. thus, the general scheme of the problem-solving modes and kinds of feedback is displayed in figure 2. it is necessary to stress, that the described problem-solving modes can be implemented only if the process of finding of a problem solution consists of several (homogeneous or heterogeneous) steps. 52 alla anohina figure 2: the problem-solving modes and kinds of feedback for an intelligent tutoring system 3.2 the model of hints the model of hints in the proposed approach defines two layers (figure 3): a layer of the general hint categories and a layer of hints within the general categories. there are three general hint categories: general hints, hints of average informativeness, and specific hints. each category contains one or more hint, which also are ranged from less informative to more informative. the model allows the learner to receive a hint that is the most suitable for him/her. before the learner starts to work in problem-solving mode testing should be taken with the purpose to determine a general hint category which is suitable for the learner. further requesting help during problem-solving the learner will receive an average by number hint from the hint category suitable for him/her. if after receiving of a hint the learner is not capable to execute a correct action, he/she is presented with a subsequent hint. the process proceeds while he/she will not reach last hint for the given error. such approach spares the learner from being presented with informativeless hints. contrary, the learner timely receives a hint providing help and certain cognitive load, therefore, reducing an opportunity of frustration, floundering and loss of interest to learning. figure 3: the two-layer model of hints (adopted from [28]) 4 the intelligent tutoring system for the minimax algorithm the described approach is being implemented in the intelligent tutoring system for a topic of the learning course "fundamentals of artificial intelligence" at the faculty of computer science and information technology at riga technical university. the topic is related with the algorithm for implementing advances in intelligent tutoring systems: problem-solving modes and model of hints 53 two-person games with full information, i.e., the minimax algorithm [29] which example is given in [31] in details. all practical problems directed to the development of skills of applying of the minimax algorithm consist of a sequence of homogeneous steps. let’s consider one of them. a game tree is presented to the learner. it has arcs which are inadmissible for a game with full information: arcs between nodes at the same level or through levels. as a rule, there are from 3 up to 5 wrong arcs in the tree. the learner should find them and to remove from the tree. thus, a removal of one (wrong or correct) arc is a step in this task. it is obvious, that tasks for the minimax algorithm are a fine example of an opportunity to provide two problem-solving modes in the system. at present the following tasks regarding the developed approach are completed: possible mistakes and hints corresponding to them are defined for each problem, sets of hints for each mistake are divided into categories according to the two-layer model of hints described in the previous section, and the user interface of both problem-solving modes is developed. the architecture of the system corresponds to the general architecture of an intelligent tutoring systems displayed in figure 1. the system carries out assessment of an initial learner’s knowledge level on the topic, provides the theoretical knowledge acquiring mode and the practical problem-solving mode with preliminary determining of the problem-solving mode and a category of hints most suitable for the learner, and gives a final assessment of the achieved knowledge level. 5 conclusions and future work adaptive abilities of intelligent tutoring systems are not high enough especially regarding problemsolving modes offered to a learner and ordering of hints from the most general to the most specific. the paper presents an approach which allows the learner to work in the problem-solving mode that is the most appropriate for him/her and to receive the most suitable hint. the proposed two-layer model of hints can reduce frustration, floundering and loss of interest to learning that are inevitable in case when the learner receives too little support during problem-solving. at present, the proposed approach is in a stage of development. firstly, it is necessary to determine, how testing for the problem-solving mode and a category of hints most suitable for the learner may be implemented. secondly, both psychological and pedagogical foundations of the proposed approach should be specified. after the implementation of the approach in the intelligent tutoring system for the minimax algorithm the testing of the system will be done in the corresponding learning course together with questioning of learners and the subsequent interpretation of the received results. references [1] j. r. carbonell, ai in cai: an artificial intelligence approach to computer-assisted instruction, ieee transactions on man-machine systems, vol. 11, no. 4, pp. 190-202, 1970. [2] j. s. brown and r. r. burton, a paradigmatic example of an artificially intelligent instructional system, international journal of man-machine studies, vol. 10, pp. 323-339, 1978. [3] j. s. brown, r. r. burton and j. de kleer, pedagogical, natural language, and knowledge engineering techniques in sophie i, ii and iii, in d.h. sleeman and j.s. brown (eds): intelligent tutoring systems, academic press, london, 1982. [4] j. r. anderson and b. j. reiser, the lisp tutor, byte magazine, vol. 10, pp. 159-175, 1985. [5] s. r. alpert, m. k. singley and p. g. fairweather, deploying intelligent tutors on the web: an architecture and an example, international journal of artificial intelligence in education, vol. 10, no. 2, pp. 183-197, 1999. 54 alla anohina [6] v. devedzic, j. debenham and d. popovic, teaching formal languages by an intelligent tutoring system, educational technology & society, vol. 3, no. 2, pp. 36-49, 2000. [7] m. hospers, e. kroezen, a. nijholt, r. den akker and d. heylen, an agent-based intelligent tutoring system for nurse education, in j. nealon and a. moreno (eds): applications of intelligent agents in health care, birkhauser publishing ltd, basel, switzerland, 2003. [8] n. matsuda and k. vanlehn, advanced geometry tutor: an intelligent tutor that teaches proof-writing with construction, proceedings of the 12th international conference on artificial iintelligence in education, amsterdam, pp. 443-450, 2005. [9] k. vanlehn, c. lynch, k. schulze, j. a. shapiro, r. shelby, l. taylor, d. treacy, a. weinstein and m. wintersgill, the andes physics tutoring system: lessons learned, international journal of artificial intelligence in education, vol. 15, no. 3, pp. 147-204, 2005. [10] r. s. crowley and o. medvedeva, an intelligent tutoring system for visual classification problem solving, artificial intelligence in medicine, vol. 36, no. 1, pp. 85-117, 2006. [11] a. anohina, agents in intelligent tutoring systems: state of the art, scientific proceedings of riga technical university, computer science, 5th series, vol. 22, pp. 110-121, 2005. [12] g. hume, j. a. michael, a. a. rovick, and m. evens, hinting as a tactic in one-on-one tutoring, the journal of the learning science, vol. 5, no. 1, pp. 23-47, 1996. [13] n. matsuda and k. vanlehn, modeling hinting strategies for geometry theorem proving, proceedings of 9th international conference on user modeling, johnstown, pa, usa, pp. 373-377, 2003. [14] kinshuk, t. lin, a. yang and a. patel, plug-able intelligent tutoring and authoring: an integrated approach to problem-based learning, international journal of continuing engineering education and life-long learning, vol. 13, no. 1/2, pp. 95-105, 2002. [15] m. a. nunes, l. l. dihl, l. m. fraga, c. r. woszezenki, l. oliveira, d. j. francisco, g. machado, c. nogueira and m. notargiacomo, ivte pedagogical game for distance learning, proceedings of aset conference, melbourne, 2002. [16] p. suraweera, an animated pedagogical agent for sql-tutor, honours project hons 08/99, 1999. [17] m. kalayar, h. ikematsu, t. hirashima and a. takeuchi, intelligent tutoring system for search algorithm, proceedings of icce, seoul, korea, pp. 1369-1376, 2001. [18] m. stern, j. beck and b. p. woolf, adaptation of problem presentation and feedback in an intelligent mathematics tutor, in c. frasson, g. gauthier and a. lesgold (eds): intelligent tutoring systems, springer-verlag, new york, 1996. [19] b. a. cheikes, gia: an agent-based architecture for intelligent tutoring systems, proceedings of the cikm’95 workshop on intelligent information agents, baltimore, maryland, usa, 1995. [20] n. capuano, m. de santo, m. marsella, m. molinara and s. salerno, a multi-agent architecture for intelligent tutoring, proceedings of the international conference on advances in infrastructure for electronic business, science, and education on the internet ssgrr 2000, l’aquila, 2000. [21] a. m. bell and s. ramachandran, an intelligent tutoring system for remote sensing and image interpretation, proceedings of the interservice/industry training, simulation, and education conference (i/itsec), orlando, florida, usa, 2003. [22] c. j. butz, s. hua and r. b. maguire, a web-based intelligent tutoring system for computer programming, proceedings of the ieee/wic/acm international conference on web intelligence (wi’04), beijing, china, pp. 159-165, 2004. advances in intelligent tutoring systems: problem-solving modes and model of hints 55 [23] e. remolina, s. ramachandran, d. fu, r. stottler and w. r. howse, intelligent simulation-based tutor for flight training, proceedings of the interservice/industry training, simulation, and education conference (i/itsec), orlando, florida, usa, 2004. [24] j. m. gascueña and a. fernández-caballero, an agent-based intelligent tutoring system for enhancing e-learning/ e-teaching, international journal of instructional technology and distance learning, vol. 2, no. 11, pp. 11-24, 2005. [25] p. brusilovsky and c. peylo, adaptive and intelligent web-based educational systems, international journal of artificial intelligence in education, vol. 13, pp. 156-169, 2003. [26] d. r. benyon and d. m. murray, adaptive systems: from intelligent tutoring to autonomous agents, knowledge-based systems, vol. 6, no. 4, pp. 197-219, 1993. [27] c. frasson, t. mengelle and e. aimeur, using pedagogical agents in a multi-strategic intelligent tutoring system, proceedings of the 8th world conference on artificial intelligence in education ai-ed97, workshop v: pedagogical agents, kobe, japan, pp. 40-47, 1997. [28] a. anohina, the problem-solving modes and a two-layer model of hints in the intelligent tutoring system for minimax algorithm, proceedings of the 1st international conference on virtual learning, bucharest, romania, pp. 105-112, 2006. [29] g. f. luger, artificial intelligence: structures and strategies for complex problem solving, addison wesley, 2001. [30] s. russell and p. norvig, artificial intelligence: a modern approach, prentice hall, 2003. [31] a. anohina, intelligent tutoring system for minimax algorithm, scientific proceedings of riga technical university, computer science, 5th series, vol. 22, pp. 122-130, 2005. alla anohina riga technical university department of systems theory and design kalku street 1, riga, latvia, lv-1658 e-mail: alla.anohina@cs.rtu.lv received: november 8, 2006 editor’s note about the author: alla anohina is an assistant at the department of system theory and design of riga technical university in latvia. she got m.sc.ing. in 2002 from riga technical university and received werner von siemens excellence award, award of latvian fund of education and award and memorial medal of latvian academy of sciences, lattelecom ltd. and latvian fund of education for the best master’s thesis in the year 2002. her main research fields are intelligent tutoring systems, computer-assisted assessment systems and artificial intelligence. now she is finishing her ph.d. thesis which main topic is related with the development of the intelligent supporting system for adaptive learning and knowledge assessment. she has five years’ experience of teaching in the field of computer science both in riga technical university, and in other educational institutions of latvia. she has participated in several research projects related with the development of knowledge assessment software. international journal of computers, communications & control vol. ii (2007), no. 2, pp. 121-131 modelling of wastewater treatment plant for monitoring and control purposes by state – space wavelet networks adam borowa, mietek a. brdys, krzysztof mazur abstract: most of industrial processes are nonlinear, not stationary, and dynamical with at least few different time scales in their internal dynamics and hardly measured states. a biological wastewater treatment plant falls into this category. the paper considers modelling such processes for monitorning and control purposes by using state space wavelet neural networks (sswn). the modelling method is illustrated based on bioreactors of the wastewater treatment plant. the learning algorithms and basis function (multidimensional wavelets) are also proposed. the simulation results based on real data record are presented. keywords: neural network models, model approximation, learning algorithms, waste treatment. 1 introduction biological wastewater treatment plants (wwtp) are very important due to their ability of neutralising results of human activity. figure 1: activated sludge reactor (bioreactor) with secondary clarifier typical wwtp consists of three phases of treatment: mechanical, biological and chemical. example of biological part of wwtp (bioreactor with secondary clarifier) is shown in figure 1. before biological treatment the wastewater passes through mechanical treatment where coarse particle, inorganic solids and suspended particulate matter are removed. chemical treatment may be implemented before, after or into biological treatment. activated sludge is responsible for nitrogen and phosphorus removing in bioreactor. biological treatment, due nitrogen and phosphorous removing, consists of three phases: anaerobic, anoxic and aerobic. in figure 1 recirculation (from secondary clarifier to anoxic zone and from anoxic to anaerobic) are shown. control of that recirculation, the air flow rate to aerobic zone and excessive sludge flow rate is very important for the process quality [8]. wastewater treatment process is very complex due to its specific features such as: highly non-linear and multiple time scale dynamics, varying influent flow, high dimension of state vector with many states not accessible by hard sensors; see [4] for details. due to nationwide regulations, which force high standards on treated wastewater quality, a need for better treatment still exists. there are two solutions of this problem. the first is to enlarge the wwtp and the second, to implement an intelligent control system. itšs obvious that the last proposition is better in an economic sense. the main purpose of control of wwtp is to keep all quality parameters of treated waste under specific norm. because of complexity of the process, the advanced control technologies are required. the multilevel-multilayer hierarchical control structure was recently derived [1] and further developed in [4]. copyright © 2006-2007 by ccc publications 122 adam borowa, mietek a. brdys, krzysztof mazur figure 2: multilevel-multilayer hierarchical control structure this structure (figure 2) consists of 3 control levels: supervisory control level, optimising control level and follow up control level; each with different control objectives. optimising control level (ocl) uses a robust model predictive control (rmpc) algorithm and it is decomposed into three layers each with different control horizon and time scale. this work assembles on modeling the wwtp for rmpc and supervisory control. a good candidate for modelling a dynamical, non-linear system with multiple time scales is a state space wavelet network (sswn). using the wavelets as the basis functions solves the multiple time scale problem. the rest of article order goes as follow. section 2 is a description of problem statement. wavelet networks are presented in section 3. a learning algorithm is proposed in section 4. stability of sswn is mentioned in section 5. application and results are shown in section 6 and 7. 2 problem statement the paper aim is to verify the possibility of applying the sswn modelling of a wwtp for monitoring and control purposes. 2.1 monitoring of wwtp for supervisory control purposes monitoring the wwtp state is essential for the supervisory control purposes [4]. most of the wwtp states are not measurable. even though the sswn for state monitoring can be made. first model of wwtp in simba has to be calibrated. methods for simba model calibration are presented in [2]. when calibrated model in simba is available then it can be used as a data generator for learning the sswn. 2.2 progress of the hierarchical intelligent control structure the layer decomposition of the ocl raises a loss on optimality and certain problems with accommodating the constraints into the layer optimisation tasks. as the new sswn model has an ability to combine the two time scales (fast and medium), the fast and medium layers can be combined into one layer, hence eliminating the above problems. modelling of wastewater treatment plant for monitoring and control purposes by state – space wavelet networks 123 3 wavelet network wavelet network is a neural network with one hidden layer consisting wavelets as the basis function. wavelets are specific mathematical functions and are described below. neurons made up of wavelets are called wavelons. 3.1 wavelets any function satisfying the conditions (1) and (2), where ψ(ω) is a fourier transform of ψ(t), is called a mother wavelet. it is required that cψ = ∫ ∞ 0 |ψ(ω)|2 ω dω < ∞ (1) ∫ ψ(t)dt = 0 (2) examples of wavelets are haar wavelet or morlet wavelet shown in figure 3 and defined by the equations (3) and (4) respectively. ψ(x) =    1, 0 ≤ x < 0.5 −1, 0.5 ≤ x < 1 0, elsewere (3) ψ(x) = exp(−x2/2)∗cos(5x) (4) a family of wavelets is made by dilating and translating mother wavelet. see equation (5), where d and figure 3: haar and morlet mother wavelets. t represent the dilation and translation parameters respectively. ψdt (x) = √ dψ(d(x−t)) (5) the translation and dilation parameter determine the position (time) and scale (frequency) domain. the wavelets represented by equation (5) may be design as orthonormal. 3.2 multidimensional wavelets the multidimensional wavelet (6) was presented first by zhang and beneveniste in [11]. in equation (6) ψs(x) is a one–dimensional wavelet, x is a vector and ψ(x ) is a scalar ψ(x ) = ψ(x1, ..., xn) = πnj=1ψs(x j) (6) 124 adam borowa, mietek a. brdys, krzysztof mazur few months later zhang [12] presented new multidimensional wavelet radial wavelet (7) and (8) where ψ(x ) is a one–dimensional wavelet. ψ(x ) = ψs(||x||) (7) ||x|| = (xt x) 12 (8) it is logical that for a multivariable function approximation the multidimensional wavelets are desirable. the multidimensional wavelet (6) was tested and some problems were encountered. first, often the optimisation problem to be solved by the learning algorithm was illűconditioned. second, a large n relatively easily led to overparameterization. only when radial wavelet (equations (7) and (8)) was used the network learning was successfully finalized. the radial multidimensional wavelet, used in the paper, is given by equations (9) (13) and it is shown in figure 4. figure 4: multidimensional radial wavelon. z=[x,u] (9) d j = [d1, j, ..., dk+m+n, j] (10) t j = [t1, j, ...,tk+m+n, j] (11) a= diag(d j)∗(z−t j)t (12) a j = r(a(z,d,t)) = (at a) 1 2 (13) a one ű dimension wavelet used in this application was a morlet wavelet (4). proposed structure for a wavelon was described in [12]. 3.3 feed-forward and input-output wavelet networks. it was presented in [5] that feed–forward wavelet network (ffwn) in the form (14) is a universal approximator for any function in sobolev space. wavelet networks have more freedom then other neural networks because of number of optimised parameters for each wavelon. z = σni=1wiψ(di(x−ti)) (14) the wavelet network for one-dimensional input network is described by equation (14), where di and ti are dilation and translation parameters respectively, wi are linear weights and n is a number of wavelons. a feed forward wavelet network (figure 5) was presented first by zhang and beneveniste [11]. a dynamical wavelet network (dwn) was presented in [7]. the dwn structure is shown in figure 6. it is the input– output structure, which is commonly used but it has some drawbacks that do not permit this dwn to be used for wwtp modelling. since 1992 many wavelet networks were presented and used in different places of human activity. modelling of wastewater treatment plant for monitoring and control purposes by state – space wavelet networks 125 figure 5: feed-forward wavelet network. figure 6: dynamical wavelet nertwork. 3.4 state space wavelet network state space wavelet network (sswn) is not as well known as ffwn or dwn but it has specific advantages over mentioned networks. due to its of internal state space component the sswn, better captures the modeled plant structure. therefore the modeling error is smaller and the learning process is faster. nevertheless, there is no proof for such a network to be the universal approximator. sswn with multidimensional wavelets is shown in figure 7 and described by the equations (15) and (16), where n is a number of outputs, m + n is a number of state variables, k is a number of inputs (control and disturbance), l is a number of wavelons. a number of parameters to be estimated during learning process are then (2∗l∗(n + m + k) + l∗(n + m)). it combines the state space architecture of dynamic neural network [10] with the multidimensional wavelons as the processing nodes. figure 7: structure of state space wavelet network with multidimensional wavelons ∀n+mi=1 xi(k + 1) = σlj=1wi, j ∗ψ j(x(k), u(k)) (15) ∀ni=1yi(k) = σlj=1wi, j ∗ψ j(x(k), u(k)) (16) 4 learning algorithm during searching for the right structure of wavelet network, selecting the learning algorithm was also investigated. searching over a large set of combinatorial optimization algorithms included: simple genetic algorithm (sga), sga with elitism, evolutionary algorithm (ea), ea with sbx crossover, 126 adam borowa, mietek a. brdys, krzysztof mazur simulated annealing (sa) and parallel hybrid of sa and ea with the use of computer grid technology. the sa algorithm turned out to be the best for our problems. 4.1 simulated annealing whilst inspiration for ga and ea was in biology (genetic), the sa was inspired by thermodynamics (statistical mechanics). the algorithm was motivated by the growing mechanism of a single crystal from a melt [6]. it was found that slow cooling (annealing) of melted metal goes to low state of energy while fast cooling does not. a simple algorithm based on monte carlo search was proposed in (metropolis 1953) which then became an important part of the sa algorithm [6]. this algorithm works on a chain of atoms (s). in each step one atom from the chain is disturbed and the new chain energy e(s′) is calculated. a difference between the chain s and s′ energies is calculated by equation (17). ∆e = e(s′)−e(s) (17) if ∆e < 0 then new chain s′ is accepted; otherwise the new chain is conditionally accepted with a probability given by boltzmann probability factor (18), where kb is a boltzmannšs constant and t is a temperature factor. whilst t is not real temperature and kb is a constant then product kb ∗ t may be replaced for practical implementation by single factor t . p(∆e) = exp(−∆e/(kbt )) (18) the annealing schedule was added in [6] in order to formalise the sa algorithm. the sa algorithm works iteratively as follow. for given temperature t0 the metropolis monte carlo (mmc) method is applied. when chain of atoms is said to be stable then new t is computed and mmc is applied; this procedure goes as long as the temperature reaches 0 or the energy obtains optimum. for the new t the computation-annealing schedule is given by equation (19), where t0 is initial temperature and k is sa iteration counter. for practical matter chain of atoms is said to be stable when mmc iteration counter obtain an established number. t (k) = t0 1 + ln(k) (19) 4.2 a simple method for sswn learning with simulated annealing algorithm in the paper the learning procedure was implemented as follow: – initialize a chain of atoms, – set the initial temperature, – establish a number of mmc iterations needed to obtain a stable chain, – run iterative sa algorithm. initialization of chain of atoms is made using a random number generator with upper and lower constrains for weights and wavelet coefficients. the initial temperature is chosen by the user (between 0 and 1). number of mmc iterations is proportional to length of the chain of atoms. in optimisation only one chain of atom should be used because convergence of sa is independent of initialization due to exploring nature of the method. before the sa is applied the energy function must be described. energy function is given by equation (20), where y is the plant output and y κ is a sswn output, n is a number of sswn outputs, j is a number of samples taken for network learning. e = 1 n ∗j σ n i=1σ j j=1|yi, j −y κi, j| (20) modelling of wastewater treatment plant for monitoring and control purposes by state – space wavelet networks 127 it is important to normalize the data before learning the network. 5 stability of sswn it was proved that under certain conditions the state space neural networks (ssnn) can be made stable [9]. the sufficient conditions for the stability suitable constrain the network weights. it is not a subject of this work to proof the sswn stability. however, we shall demonstrate by simulation that the stability is in place if the network parameters are constrained. a discharge of a randomly chosen initial state of the network is illustraed in figure 8. after the discharge has been finished the network output accurately follows the plant output. figure 8: discharge of randomly chosen initial state of sswn 6 application after the sswn structure and learning algorithm have been chosen the inputs, outputs, size of the state vector and the number of wavelons must be fixed. figure 9: learning results for different m and l the presented sswn was applied to wastewater treatment plant modelling. the modelled plant consists of 4 control inputs (flow rate to aerobic zone, 2 recirculation flow rates and excessive sludge flow rate), 4 measured or estimated disturbance inputs, including inflow and quality (cod, bod, t ss), and 4 outputs: outflow (q), concentrations of nitrate and nitrite nitrogen (sno) and nh + 4 + nh3 nitrogen (snh ) in effluent and concentration of oxygen in aerobic zone (so). still the number of state space variables m and number of wavelons l are unknown. fortunately we know the size of state of the modelled plant, which is 14 for each zone of biological treatment. therefore m was searched in a set of values: 14, 21 and 28; the larger m implied a huge set of optimized parameters. in order to reduce the computational burden m wavelons were applied. the network was parameterized by 980, 1911 and 3136 parameters, respectively. results of the learning (limited to single output) for these three parameterization examples are shown in figure 9. finally m and l were selected as 28 to give small modelling error and acceptable learning time. 128 adam borowa, mietek a. brdys, krzysztof mazur 7 results the results of long term learning are shown in figure 10. the learning time was around 36 hours with simulated annealing algorithm. the bold line shows the modelled plant output while the dashed line illustrates the network output. the mean modelling error was about 1.84 figure 10: comparison of model and plant outputs 8 conclusions a structure of a dynamical wavelet network called state space wavelet network (sswn) has been proposed. the multidimensional radial wavelon has been proposed as the network processing nodes. the simulated annealing for the sswn learning has been derived and validated by application to modelling the wastewater treatment plant. it has been demonstrated that such a network is able to approximate this dynamic, nonlinear, not stationary process with several different time scales. references [1] m. a. brdys, m. grochowski, k. duzinkiewicz, w. chotkowski, y. liu "design of control structure for integrated wastewater treatment plant űsewer systems", international conference on technolog, automation and control of wastewater and drinking water systems tiaswik’02 gdansk – sobieszewo, poland, june 19-21 2002. [2] w. chotkowski, j. mźkinia, m.a. brdys, k. duzinkiewicz, k. konarczak "mathematical modelling of the processes in integrated municipal wastewater systems", proc. of the 9th ifac/ifors/imacs/ifip symposium on large scale systems: theory and applications, bucharest, july 18-20 2001. [3] i. daubechies ten lectures on wavelets cbms-nsf regional series in applied mathematics, siam, philadelphia, 1992. modelling of wastewater treatment plant for monitoring and control purposes by state – space wavelet networks 129 [4] m. grochowski, m.a. brdys, t. gmiñski "intelligent control structure for control of integrated wastewater systems" ifac 10th symposium large scale systems: theory and applications osaka ű japan july 26-28 2004. [5] a. juditsky, q. zhang, b. delyon, p-y. glorennec, a. beneveniste "wavelets in identification" rapport de recherche nř2315 1994. [6] s. kirkpatrick, c.d. gelatt, m.p. vecchi "optimization by simulated annealing" science vol. 220 pp. 671-680, 1983. [7] y. oussar, i. rivals, l. personnaz, g. dreyfus "training wavelet networks for nonlinear dynamic input-output modeling" neurocomputing, vol. 20, pp. 173-188, 1998. [8] g. olsson, r newell wastewater treatment systems. modelling, diagnosis and control. iwa publishing, london, 1999. [9] e.n. sanchez, j.p. perez "input-to-state stability (iss) analysis for dynamic neural networks" ieee transactions on circuits and systems ű i: fundamental theory and applications, vol. 46, no. 11, pp 1395 ű 1398, november 1999. [10] j.m. zamarreno, v. pastora "state space neural network. properties and applications." neural networks, vol. 11, pp 1099-1112, 1998. [11] q. zhang, a. beneveniste "wavelet networks" ieee trans. on neural networks, vol. 3, num. 6, pp 889-898, nov. 1992. [12] q. zhang "wavelet network: the radial structure and an efficient initialization procedure" technical report of linköping university, lith-isy-i-1423, october 1992. [13] j. zhao, b. chen, j. shen "multidimensional non-orthogonal wavelet basis function neural network for dynamic process fault diagnosis" computer and chemical engineering vol. 23 pp. 83-92, 1998. adam borowa, krzysztof mazur gdansk university of technology, department of automatic control, ul. g. narutowicza 11/12, 80 952 gdansk, poland e-mail: aborowa@ely.pg.gda.pl, kmazur@ely.pg.gda.pl mietek a. brdys the university of birmingham, school of engineering, department of electronic, electrical and computer engineering, birmingham b15 2tt, uk e-mail: m.brdys@bham.ac.uk received: march 12, 2007 130 adam borowa, mietek a. brdys, krzysztof mazur adam borowa received his m.sc. degree in control engineering in 2002 from electrical and control engineering department at gdansk university of technology. soon after he became a ph.d. student in this department. during the period of 2001 to 2002 he served one’s apprenticeship on wastewater treatment plant at swarzewo. he published 6 publications. mainly he focuses on modelling and monitoring of large scale systems, especially processes with many time scales. mietek a. brdys received the m.sc. degree in electronic engineering and the ph.d. and the d.sc. degrees in control systems from the institute of automatic control at the warsaw university of technology in 1970, 1974 and 1980, respectively. from 1974 to 1983, he held the posts of assistant professor and associate professor at the warsaw university of technology. in 1992 he became full professor of control systems in poland. between 1978 and 1995, he held various visiting faculty positions at the university of minnesota, city university, de montfort university and university polytechnic of catalunya. since january 1989, he has held the post of senior lecturer in the school of electronic, electrical and computer engineering at the university of birmingham. since february 2001 he has held the post of full professor of control systems in the department of automatic control at gdansk university of technology. he has served as consultant for honeywell systems and research center in minneapolis, gec marconi and water authorities in uk, france, spain, germany and poland. he is head of interdisciplinary research network on decision support and control systems at the university of birmingham and head of intelligent decision support and control system group at technical university of gdansk. his research is supported by the uk and polish research councils, and industry and european commission. he is author and co-author of about 200 refereed papers and six books. his current research includes intelligent decision support and control of complex uncertain systems, robust monitoring and control, softly switched robustly feasible model predictive control. the applications include environmental systems, technological processes, autonomous intelligent vehicles and defence systems. he is a chartered engineer, a member of the iee, a senior member of ieee, a fellow of ima and a vice-chair of ifac technical committee on large scale complex systems. he the ipc chair of the 11th ifac symposium on large scale complex systems, gdansk, july 23-25, 2007. modelling of wastewater treatment plant for monitoring and control purposes by state – space wavelet networks 131 krzysztof mazur received his m.sc. degree in control engineering from electrical and control engineering department at gdansk university of technology in 2005. currently a ph.d. student in this department. his research interests are in the areas of modelling, control and monitoring of large scale systems. coauthor of 4 publications. international journal of computers, communications & control vol. i (2006), no. 1, pp. 25-32 one more universality result for p systems with objects on membranes gheorghe păun abstract: we continue here the attempt to bridge brane calculi with membrane computing, following the investigation started in [2]. specifically, we consider p systems with objects placed on membranes, and processed by membrane operations. the operations used in this paper are membrane creation (cre), and membrane dissolution (dis), defined in a way which reminds the operations pino, exo from a brane calculus from [1]. for p systems based on these operations we prove the universality, for one of the two possible variants of the operations; for the other variant the problem remains open. keywords: membrane computing, brane calculi, matrix grammar, universality 1 introduction this paper is a direct continuation of [2], where a first step was made to bridge membrane computing [4], [5], [6] and brane calculi [1]. the main point of this effort is to define p systems which work with multisets of objects placed on the membranes rather than inside the compartments defined by membranes, and to process these multisets by means of operations with membranes rather than by multiset rewriting rules acting only on objects. the operations pino, exo, mate, drip were formalized in [2] as membrane computing rules, and used in defining p systems based on them. the universality of mate, drip operations was proved in [2] (for systems using simultaneously at any step of a computation at most eleven membranes). we give here an universality result for other two operations, membrane creation (cre), and membrane dissolution (dis), which have the same syntax as pino, exo operations, but a different interpretation in what concerns the contents of the handled membranes – details can be found in section 3 below. actually, as it was the case in [2] with pino, exo, we have two variants of each of the operations cre, dis. for one of these variants, we prove the turing completeness, while the case of the other variant remains open (we believe that a similar result holds true). 2 prerequisites all notions of formal language theory we use are elementary and standard, and can be found in any basic monograph of formal language theory. for the sake of completeness, we introduce below only the notion of matrix grammars with appearance checking – after specifying that by re we denote the family of recursively enumerable languages, and by psre the family of parikh images of languages from re (the parikh mapping associated with an alphabet v is denoted by ψv ). a matrix grammars with appearance checking [3] is a construct g = (n, t, s, m, f), where n, t are disjoint alphabets (of non-terminals and terminals, respectively), s ∈ n (axiom), m is a finite set of matrices, that is sequences of the form (a1 → x1, . . . , an → xn), n ≥ 1, of context-free rules over n ∪t , and f is a set of occurrences of rules in the matrices of m. for w, z ∈ (n ∪ t )∗ we write w =⇒ z if there is a matrix (a1 → x1, . . . , an → xn) in m and the strings wi ∈ (n ∪t )∗, 1 ≤ i ≤ n + 1, such that w = w1, z = wn+1, and, for all 1 ≤ i ≤ n, either (1) wi = w′iaiw′′i , wi+1 = w′ixiw′′i , for some w′i, w ′′ i ∈ (n ∪ t )∗, or (2) wi = wi+1, ai does not appear in wi, and the rule ai → xi appears in f . (if applicable, the rules from f should be applied, but if they cannot be applied, then we may skip them. that is why the rules from f are said to be applied in the appearance checking mode.) if f = /0, then the grammar is said to be without appearance checking. the language generated by g is defined by l(g) = {w ∈ t ∗ | s =⇒∗ w}, where =⇒∗ is the reflexive and transitive closure of the relation =⇒. the family of languages of this form is denoted by matac; it is known that matac = re. we say that a matrix grammar with appearance checking g = (n, t, s, m, f) is in the z-binary normal form if n = n1 ∪ n2 ∪{s, z, #}, with these three sets mutually disjoint, and the matrices in m are in one of the following forms: 1. (s → x a), with x ∈ n1, a ∈ n2, 2. (x → y, a → w), with x ,y ∈ n1, a ∈ n2, w ∈ (n2 ∪t )∗,|w| ≤ 2, copyright c© 2006 by ccc publications 26 gheorghe păun 3. (x → y, a → #), with x ∈ n1,y ∈ n1 ∪{z}, a ∈ n2, 4. (z → λ ). moreover, there is only one matrix of type 1, f consists exactly of all rules a → # appearing in matrices of type 3, and, if a sentential form generated by g contains the symbol z, then it is of the form zw, for some w ∈ (t ∪{#})∗ (that is, the appearance of z makes sure that, except for z, all symbols are either terminal or the trap-symbol #). the matrix of type 4 is used only once, in the last step of a derivation. for each language l ∈ re there is a matrix grammar with appearance checking g in the z-binary normal form such that l = l(g). as usual, we represent multisets over an alphabet v by strings over v , with the obvious observation that all permutations of a string represent the same multiset. 3 p systems using the cre/dis operations we start by recalling from [2] the formalization of the operations pino, exo in terms of membrane computing. a membrane is represented, as usual, by a pair of square brackets, [ ], but we associate here with membranes multisets of object (corresponding to the proteins embedded in the real membranes). a membrane having associated a multiset u (represented by a string) is written in the form [ ] u; we also use to say that the membrane is marked with the multiset u. the following four operations were defined in [2]: pinoi : [ ] uav → [ [ ] ux] v, (1) exoi : [ [ ] ua] v → [ ] uxv, (2) pinoe : [ ] uav → [ [ ] v] ux, (3) exoe : [ [ ] u] av → [ ] uxv. (4) in all cases with a ∈ v , u, x ∈ v ∗, v ∈ v +, with ux ∈ v + for pino rules, where v is a given alphabet of objects. in each case, multisets of proteins are transferred from input membranes to output membranes as indicated in the rules, with protein a evolved into the multisets x (which can be empty). the subscripts i and e stand for “internal" and “external", respectively, pointing to the “main" membrane of the operation in each case. it is important to note that the multisets u, v and the protein a marking the left hand membranes of these rules correspond to the multisets u, v, x from the right hand side of the rules; specifically, the multiset uxv resulting when applying the rule is precisely split into ux and v, with these two multisets assigned to the two new membranes. the rules are applied as follows. assume that we have a membrane [ ] zuav, for a ∈ v, u, v, z ∈ v ∗. by a pinoi rule as in (1), we obtain any one of the pairs of membranes [ [ ] z1ux ] z2v such that z = z1z2, z1, z2 ∈ v ∗, and by a pinoe rule as in (3), we obtain any one of the pairs of membranes [ [ ] z1v ] z2ux such that z = z1z2, z1, z2 ∈ v ∗. in the case of the two exo operations, the result is uniquely determined. from a pair of membranes [ [ ] z1ua] z2v, by an exoi rule as in (2) we obtain the membrane [ ] z1z2uxv, and from [ [ ] z1u] z2av, by an exoe rule as in (4) we obtain the same membrane [ ] z1z2uxv. the contents of membranes involved in these operations is transferred from the input membranes to the output membranes in the same way as in brane calculi (p,q represent here the possible contents of the respective membranes): pinoi : [ p ] uav → [ [ ] ux p ] v, exoi : [ [ p ] ua q ] v → p [ q ] uxv, pinoe : [ p ] uav → [ [ ] v p ] ux, exoe : [ [ p ] u q ] av → p [ q ] uxv. here we change the interpretation of these rules, as suggested below (because the new semantics do not correspond to the operations pino, exo, we change the name of operations to cre, dis, for “membrane creation" and one more universality result for p systems with objects on membranes 27 “membrane dissolution"): crei : [ p ] uav → [ [ p ] ux ] v, disi : [ [ p ] ua q ] v → [ p q ] uxv, cree : [ p ] uav → [ [ p ] v ] ux, dise : [ [ p ] u q ] av → [ p q ] uxv. that is, when a membrane is created inside an existing membrane, the new membrane contains all previously existing membranes, and while dissolving a membrane, its contents remains inside the membrane where it was placed before the operation. the interpretation of the latter operation is rather similar to the usual dissolution operation in membrane computing, while the membrane creation is understood as doubling the existing membrane, with a distribution of the multiset marking the initial membrane to the two new membranes. using rules as defined above, we can define a p system as π = (a, µ, u1, . . . , um, r), where: 1. a is an alphabet (finite, non-empty) of objects; 2. µ is a membrane structure with m ≥ 2 membranes; 3. u1, . . . , um are multisets of objects (represented by strings over a) bound to the m membranes of µ at the beginning of the computation; the skin membrane is marked with u1 = λ ; 4. r is a finite set of cre, dis rules, of the forms specified above, with the objects from the set a. for a rule of any type, with u, a, v as above, |uav| is called the weight of the rule. in what follows, the skin membrane plays no role in the computation, no rule can be applied to it. also, we stress the fact that there is no object in the compartments of µ ; a membrane can contain other membranes inside, but in-between membranes there is nothing. when using any rule of any type, we say that the membranes from its left hand side are involved in the rule; they all are “consumed", and the membranes from the right hand side of the rule are produced instead. similarly, the object a specified in the left hand side of rules is “consumed", and it is replaced by the multiset x. the evolution of the system is defined in the standard way used in membrane computing, with the rules applied in the non-deterministic maximally parallel manner, with each membrane involved in at most one rule. thus, the parallelism is maximal at the level of membranes – each membrane which can evolve has to do it – but each multiset of objects evolves in a sequential manner, as only one rule can act on any multiset in a transition step. more precise details can be found in [2]. a computation which starts from the initial configuration is successful if (i) it halts, that is, it reaches a configuration where no rule can be applied, and (ii) in the halting configuration there are only two membranes, the skin (marked with λ ) and an inner one. the result of a successful computation is the vector of multiplicities of objects which mark the inner membrane in the halting configuration. the set of all vectors computed in this way by π is denoted by ps(π). the family of all sets of vectors ps(π) computed by p systems π using at any moment during a computation at most m membranes, and crei, disi rules of weight at most p, q, respectively, is denoted by psopm(crep, disq). when one of the parameters m, p, q is not bounded we replace it with ∗. we end this section by pointing out some relations which follow directly from the definitions (and from turingchurch thesis). lemma 1. (i) psopm(crep, disq) ⊆ psopm′(crep′, disq′), for all m ≤ m′, p ≤ p′, q ≤ q′. (ii) psop∗(cre∗, dis∗) ⊆ psre. we also recall the main result from [2]: psop11(mate5, drip5) = psre (the notation is self-explanatory). 28 gheorghe păun 4 universality for the cre/dis operations in the case of cre, dis operations as defined above, we cannot generate vectors of norm 0 or 1: in each rule [ ] uav → [ [ ] ux] v, [ [ ] ua] v → [ ] uxv (necessary in the last step of any computation in order to get only one internal membrane) we have imposed to have |uxv| ≥ 2. that is why the universality below is obtained modulo vectors of the form (0, . . . , 0) and (0, . . . , 0, 1, 0, . . . , 0). we denote by ps′re and ps′opm(crep, disq) the sets of vectors from psre and psopm(crep, disq) having the sum of elements greater than or equal to 2. theorem 2. ps′re = ps′opm(crep, disq) for all m ≥ 7, p ≥ 4, and q ≥ 4. proof. let us consider a language l ∈ re = matac, l ⊆ v 2v ∗, for an alphabet v with n symbols. we write this language in the form l = ⋃ a,b∈v {ab}∂ lab(l). let gab = (nab,v, sab, mab, fab) be a matrix grammar with appearance checking such that l(gab) = ∂ lab(l), for a, b ∈ v . we consider these grammars gab in the z-normal form, with the notations from section 2 (hence nab = nab,1 ∪nab,2 ∪{sab, zab, #}), and we construct the matrix grammar g = (n,v, s, m, f) with n = n1 ∪n2 ∪{zab | a, b ∈ v}∪{s, #}, n1 = ⋃ a,b∈v nab,1, n2 = ⋃ a,b∈v nab,2, m = {(s → x a) | for (sab → x a) ∈ mab, a, b ∈ v} ∪ {(x → y, a → w) | for (x → y, a → w) ∈ mab, a, b ∈ v} ∪ {(zab → ab) | for (z → λ ) ∈ mab, a, b ∈ v}. obviously, l(g) = l. we assume that all two-rules matrices from m are injectively labeled, in the form ml : (x → y, a → x), l ∈ lab, for a set of labels lab. starting from the grammar g we now construct a p system π = (a, [ [ ] ], λ , s1s2, r), with the alphabet a = {y,y ′,y ′′,y ′′′,y iv,y v,y vi,y vii,y viii,y ix,y x | y ∈ n1} ∪ {α, α′, α′′ | α ∈ n2 ∪v} ∪ {ā | a ∈ n2} ∪ {zab, z′ab, z′′ab, z′′′ab | a, b ∈ v} ∪ {e, h, h′, s1, s2, s3, c1, . . . , c11, c0, c′0, c′′0 , c′3, c′′3 , d1, d2, d′1, d′2, f ′, f ′′, #}, and the rules from the set r as constructed below. any computation starts from the configuration [ [ ] s1s2 ] λ , by using the following rules: step 1 : [ ] s1s2 → [ [ ] x ] s2 , step 2 : [ [ ] x ] s2 → [ ] x c0d1s2 , step 3 : [ ] x s2c0d1 → [ [ ] x s3 ] c0d1 , step 4 : [ ] s3x → [ [ ] eā ] x , [ ] c0d1 → [ [ ] c′0 ] d1 , step 5 : [ [ ] eā ] x → [ ] eāx , [ [ ] c′0 ] d1 → [ ] c′′0 d1 , step 6 : [ ] x āe → [ [ ] x a ] e , [ ] c′′0 d1 → [ [ ] c1 ] d1 , for each matrix (sab → x a) ∈ mab, for a, b ∈ v . one more universality result for p systems with objects on membranes 29 the rules are used as indicated in the table above, with two rules simultaneously applied in steps 4, 5, 6. the only possible branching is in step 3, when instead of the rule [ ] x s2c0d1 → [ [ ] x s3 ] c0d1 , we can also use the rule [ ] c0d1 → [ [ ] c′0 ] d1 . in this way we obtain the membranes [ [ ] c′0 ] d1 , with x s2 distributed among them. because s3 will be never introduced, we continue only with rules which process membranes marked with ci and d1, namely, the rules from the third column of table 1; in this way, the computation will never stop, both because we can return again and again to a pair of membranes of the form [ [ ] c1 ] d1 , and because pairs of membranes marked with c ′ 3 will appear and introduce trap objects/membranes – see also below. the evolution of the membrane structure is indicated in figure 1. initial [ [ ] s1s2 ] λ step 1 [ [ [ ] x ] s2 ] λ step 2 [ [ ] x c0d1s2 ] λ step 3 [ [ [ ] x s3 ] c0d1 ] λ step 4 [ [ [ [ [ ] eā ] x ] c′0 ] d1 ] λ step 5 [ [ [ ] eāx ] c′′0 d1 ] λ step 6 [ [ [ [ [ ] x a ] e ] c1 ] d1 ] λ figure 1: the evolution of membranes at the beginning of computations. thus, we end with a configuration of the form [ [ [ [ [ ] x a ] e ] c1 ] d1 ] λ . the rules for simulating the two-rules matrices from m are indicated in table 1; by w′ we denote here the string obtained from w by priming one symbol; if w = λ , then w′ = f ′, hence α′ = f ′, α′′ = f ′′ and, in row 6, α = λ . step ml : (x → y, a → w) ml : (x → y, b → #) 1 [ [ ] x ] e → [ ] xl e [ [ ] x ] e → [ ] xl e [ [ ] c1 ] d1 → [ ] c2c3d1 2 [ ] aexl → [ [ ] w′ ] exl [ ] xl be → [ [ ] xl ## ] e [ ] c3c2d1 → [ [ ] c′3 ] c2d1 3 [ [ ] exl ] c′3 → [ ] ey ′c′3 [ [ ] xl e ] c′3 → [ ]y vi hec′3 [ ] c2d1 → [ [ ] c4 ] d1 4 [ ] c′3y ′e → [ [ ] c′3y ′′ ] e [ ]y vic′3eh → [ [ ] c′3y vii ] eh [ [ ] c4 ] d1 → [ ] c5d1 5 [ [ ] α′ ] c′3y ′′ → [ ] α′′c′3y ′′ [ ]y vii c′3 → [ [ ]y viii ] c′3 [ ] c5d1 → [ [ ] c6 ] d1 [ ] he → [ [ ] h′ ] e 6 [ [ ] α′′c′3y ′′ ] e → [ ] α c′3y ′′e [ [ ] c′3 ] h′ → [ ] c′′3 h′ [ [ ] c6 ] d1 → [ ] c7d1 7 [ ] c′3y ′′e → [ [ ] c′3y ′′′ ] e [ [ ]y viii ] c′′3 h′ → [ ]y ix c′′3 h′ [ ] c7d1 → [ [ ] c8 ] d1 8 [ [ ] c′3y ′′′ ] e → [ ]y ′′′e [ [ ]y ix c′′3 h′] e → [ ]y ix h′e [ [ ] c8 ] d1 → [ ] c9d1 9 [ ]y ′′′e → [ [ ]y iv ] e [ ]y ix h′e → [ ]y x ] e [ ] c9d1 → [ [ ] c10 ] d1 10 [ [ ]y iv ] e → [ ]y ve [ [ ]y x ] e → [ ]y x e [ [ ] c10 ] d1 → [ ] c11d1 11 [ ]y ve → [ [ ]y ] e [ ]y x e → [ [ ]y ] e [ ] c11d1 → [ [ ] c1 ] d1 table 1: rules for simulating two-rules matrices. we also consider the rules [ ] xl e → [ [ ] ## ] e , for each matrix ml : (x → y, a → w), [ [ ] h′ ] e → [ ] ##e , [ ] ## → [ [ ] # ] #, [ [ ] # ] # → [ ] ##. the simulation of matrices in g is performed by modifying the marking of the central membranes, those emerging from the initial membranes with markings x a and e, with these operations being assisted by the two membranes with markings c1 and d1 and their successors, which are external to the central membranes where the sentential form of g is produced. always during the computation, the membranes remain embedded one in another, in a linear manner, never having two membranes on the same level (here stands the essential difference between the interpretation of the cre, dis operations and the interpretation of the pino, exo operations from [1], [2]). 30 gheorghe păun the evolution of the membranes and of their relevant markings can be followed in figure 2. if in the second step the rule [ ] aexl → [ [ ] w′ ] exl is not applicable (hence the matrix ml cannot be applied), then the rule [ ] xl e → [ [ ] ## ] e will be applied, introducing the trap-object #, and the computation will never halt. starting [ [ [ [ [ ] x ] e ] c1 ] d1 ] λ step 1 [ [ [ ] xl ea ] c2c3d1 ] λ step 2 [ [ [ [ [ ] w′ ] exl ] c′3 ] c2d1 ] λ step 3 [ [ [ [ [ ] α′ ] ey ′c′3 ] c4 ] d1 ] λ step 4 [ [ [ [ [ ] α′ ] c′3y ′′ ] e ] c5d1 ] λ step 5 [ [ [ [ [ ] α′′c′3y ′′ ] e ] c6 ] d1 ] λ step 6 [ [ [ ] α c′3y ′′e ] c7d1 ] λ step 7 [ [ [ [ [ ] c′3y ′′′ ] e ] c8 ] d1 ] λ step 8 [ [ [ ]y ′′′e ] c9d1 ] λ step 9 [ [ [ [ [ ]y iv ] e ] c10 ] d1 ] λ step 10 [ [ [ ]y ve ] c11d1 ] λ step 11 [ [ [ [ [ ]y ] e ] c1 ] d1 ] λ figure 2: the evolution of membranes when simulating ml : (x → y, a → w). the evolution of membranes in the case of the simulation of a matrix ml : (x → y, b → #) can be followed in figure 3. this time, if b is present, in step 2 we have to use the rule [ ] xl be → [ [ ] xi## ] e , and the computation will never halt. if no copy of b is present, then the central membrane does not evolve, waiting for the membrane marked with c′3 to be produced; this membrane can be used in the next step for evolving the central membrane. starting [ [ [ [ [ ] x ] e ] c1 ] d1 ] λ step 1 [ [ [ ] xl e ] c2c3d1 ] λ step 2 [ [ [ [ ] xl e ] c′3 ] c2d1 ] λ step 3 [ [ [ [ ]y vihec′3 ] c4 ] d1 ] λ step 4 [ [ [ [ ] c′3y vii ] eh ] c5d1 ] λ step 5 [ [ [ [ [ [ [ ]y viii ] c′3 ] h′ ] e ] c6 ] d1 ] λ step 6 [ [ [ [ [ ]y viii ] c′′3 h′ ] e ] c7d1 ] λ step 7 [ [ [ [ [ ]y ix c′′3 h′ ] e ] c8 ] d1 ] λ step 8 [ [ [ ]y ix h′e ] c9d1 ] λ step 9 [ [ [ [ [ ]y x ] e ] c10 ] d1 ] λ step 10 [ [ [ ]y x e ] c11d1 ] λ step 11 [ [ [ [ [ ]y ] e ] c1 ] d1 ] λ figure 3: the evolution of membranes when simulating ml : (x → y, b → #). another step when we can apply a rule different from that indicated in table 1 is step 4, when we can also use the rule [ ] he → [ [ ] h′ ] e . in this way, we pass to the configuration of membranes [ [ [ [ ] h′w1 ] ew2 ] c5d1 ] λ , where w1w2 = y vic′3. no rule can be applied to the two inner membranes other than [ [ ] h′ ] e → [ ] ##e , and again the computation will never stop. therefore, the simulation of matrices in g should be done as above, and in this way we return to a configuration as that we have started with, with four membranes marked with x , e, c1, d1, respectively (the central membranes also having on them the symbols of the current sentential form of g which is simulated in π). note that the rules used for simulating a matrix ml : (x → y, a → w) cannot be mixed with the rules used for simulating a matrix ml′ : (x ′ → y ′, a′ → #), because of the injective labeling of matrices from m and because of the priming of symbols from n1. the process can be iterated, hence at some moment we introduce the symbol zab identified by the symbols from n1 used. the respective configuration is of the form: [ [ [ [ [ ] zab ] e ] c1 ] d1 ] λ . the central membrane will one more universality result for p systems with objects on membranes 31 “swallow" all other membranes, also removing all auxiliary objects. to this aim, we use the following rules: step 1 [ [ ] zab ] e → [ ] z′abe , step 2 [ [ ] z′abe ] c2c3 → [ ] z′abbc2c3 , step 3 [ ] z′abc2c3d1 → [ [ ] z′ab ] c3d1 , step 4 [ [ ] z′ab ] c3d1 → [ ] z′′abc3d1 , step 5 [ ] z′′abc3d1 → [ [ ] z′′ab ] d1 , step 6 [ [ ] z′′ab ] d1 → [ ] z′′′abd1 , step 7 [ ] z′′′abd1b → [ [ ] z′′′ab ] b, step 8 [ [ ] z′′′ab ] b → [ ] ab, for all a, b ∈ v . furthermore, we consider the rules [ ] z′abe → [ [ ] ## ] e , [ [ ] c′3 ] c′3 → [ ] ##c′3 , [ ] #a → [ [ ] ## ] a, for all a ∈ v. the first of these rules is used in step 2 if the rule [ [ ] z′abe ] c2c3 → [ ] z′abbc2c3 is not used – the objects c2c3d1 might be used at that time by the rule [ ] c3c2d1 → [ [ ] c′3 ] c2d1 from table 1. similarly, if this last rule is used in step 3 instead of the rule [ ] z′abc2c3d1 → [ [ ] z′ab ] c3d1 , then a membrane marked with c ′ 3 is introduced, which will never be removed. in particular, after 11 steps, we introduce another membrane marked with c′3, and then the rule [ [ ] c′3 ] c′3 → [ ] ##c′3 is used, preventing the termination of the computation. in conclusion, the evolution of the membranes in the final stage of the computation is as indicated in figure 4. starting [ [ [ [ [ ] zab ] e ] c1 ] d1 ] λ step 1 [ [ [ ] z′abe ] c2c3d1 ] λ step 2 [ [ ] z′abbc2c3d1 ] λ step 3 [ [ [ ] z′ab ] c3d1 ] λ step 4 [ [ ] z′′abc3d1 ] λ step 5 [ [ [ ] z′′ab ] d1 ] λ step 6 [ [ ] z′′′abd1 ] λ step 7 [ [ [ ] z′′′ab ] b ] λ step 8 [ [ ] ab ] λ figure 4: the evolution of membranes in the end of computations. the equality ψv (l(g)) = ps(π) follows from the previous explanations. with the observation that the maximal number of membranes present in the system is seven, in step 5 from figure 3 (during the simulation of matrices with a rule to be used in the appearance checking mode), and that the rules have the weight as specified in the theorem, we conclude the proof. 5 final remarks the case of using the operations cree, dise remains as a task for the reader, and the same with other operations from brane calculus – see also [2] for related problems. improvements of the result in theorem 2 are also plausible in what concerns the degree of context-sensitivity of the rules (and maybe also in what concerns the number of membranes). the same problems can be formulated for the result from [2]. as a general research topic, it remains to systematically investigate p systems with multisets of objects placed on membranes (maybe also in the compartments), processed by membrane handling operations like in brane calculi (maybe also by local multiset rewriting rules). 32 gheorghe păun references [1] l. cardelli, brane calculi. interactions of biological membranes, proc. computational methods in systems biology, 2004, springer-verlag, berlin, to appear. [2] l. cardelli, gh. păun, an universality result for a (mem)brane calculus based on mate/drip operations, intern. j. foundations of computer sci., 17, 1 (2006), 49–68. [3] j. dassow, gh. păun, regulated rewriting in formal language theory, springer-verlag, berlin, 1989. [4] gh. păun, computing with membranes, journal of computer and system sciences, 61, 1 (2000), 108–143 (and turku center for computer science–tucs report 208, november 1998, www.tucs.fi). [5] gh. păun, membrane computing. an introduction, springer-verlag, berlin, 2002. [6] the membrane computing web page: http://psystems.disco.unimib.it. institute of mathematics of the romanian academy po box 1-764, 014700 bucureşti, romania and research group on natural computing department of computer science and artificial intelligence university of sevilla avda. reina mercedes s/n, 41012 sevilla, spain e-mail: george.paun@imar.ro, gpaun@us.es editor’s note about the author: gheorghe păun (born on december 6, 1950) graduated the faculty of mathematics of the bucharest university in 1974 and got his phd at the same faculty in 1977. he has won many scholarships, in germany, finland, the netherlands, spain, etc. presently he is a senior researcher at the institute of mathematics of the romanian academy, bucharest, and a ramon y cajal research professor at sevilla university, spain. since 1997 he is a corresponding member of the romanian academy, and since 2006 a member of academia europaea. his main research fields are formal language theory (regulated rewriting, contextual grammars, grammar systems), automata theory, combinatorics on words, computational linguistics, dna computing, membrane computing (this last area was initiated by him in 1998). he has (co)authored and (co)edited more than fifty books in these areas, and he has (co)authored more than 400 research papers. in the last two decades he has visited many universities from europe, usa, canada, japan, also participating to many international conferences, several times as an invited speaker. he is a member of the editorial board of numerous computer science journals and professional associations. international journal of computers, communications & control vol. i (2006), no. 2, pp. 61-71 solution to the reliability problem of radial electric distribution systems through artificial intelligence e. lópez, j. campos, c. tardon, f. salgado, j. tardon, r. lópez abstract: in this paper the distribution electric system reliability is recognized like an artificial intelligence problem. how this idea is applied in evaluation of reliability is detailed. concepts as intelligence matrix and inter-feeder route are defined. from the last one a reliability prediction strategy for medium voltage networks is proposed and tested. keywords: artificial intelligence, agent, searching, primary route, secondary route 1 introduction it has been established that within an electric system, nearly 80% of the faults occur in the distribution system, (billinton and allan,1998 ). legal aspects protect the costumers from faults in the electric system. so it’s necessary, for electrical companies, to guarantee a high level of security, quality and reliability in the service. companies are not only affected by the demanding norms but also by high financial lost due to energy non-sell and penalties. the reliability is evaluated using the markovian models (brown and gupta, 1998; asgapoor and mathine, 1997 brown, et al., 1997 brown and ochoa, 1998) or different analytic methods (billinton y wagn, 1999), that, according to the past behavior, they make an extrapolation of its future one. this paper is based in developing a method that makes possible the prediction of distribution system’s behavior in terms of reliability, addressing it to the artificial intelligence (ai). this is capable of giving information in terms of representative indexes in the points of loading and overall system, including alternative supply in case of the principal feeding’s failure. when considering the ai, we intent to model the distribution system’s reliability and developing concepts to be used in the solution of a generic problem that requires topological analysis. therefore, emphasis is made in searching problems as ait’s tasks. within the ai’s wide world we find the searching’s problems (russell and norvig, 1996). once the problem is defined (this action is called the "abstraction") the process starts from an initial state. then, operators are applied to the present state, resulting in a new set of states. this process is called "expansion" state. the essence of the search is to choose one option and to put all the others aside to be expanded afterwards, just in case the first election does not reach solution. the dilemma of which state to be expanded first is determined by the "strategy of search". the process of searching can be included as the construction of a search tree that is superposed on the space of states where the node of roots is the initial state whereas the nodes leaves are states that do not have successor in the tree. (because they have not been expanded yet or because they have already been expanded but they generate an empty state). an efficient strategy of search is named "depth-first search". this strategy always expands one of the nodes at the deepest level of the tree and only when the search finds its end (a goal state or one without expansion) the search continues expanding nodes of less deeper level. 2 proposed method the algorithm has the mission or purpose of finding the failure modes of the system. the hypotheses are: the elements of protection are a 100% reliable. the fault is determined by first order cut. the first step of the method is to make a protocol of nodal assignation. it consists in labelling a bus or node of distribution with number 1 and the branches with ascendant (rising) numbers from the departure node to full stop of load, in the form shown by the arrows in figure 1. copyright c© 2006 by ccc publications 62 e. lópez, j. campos, c. tardon, f. salgado, j. tardon, r. lópez figure 1: basic system next the space of state is defined, the one that corresponds to the environment in which the agent can move to find the solution. according to the above for the basic system of test (bs), figure 1, the states are defined by each one of the buses of the system. the bs consists of 16 buses and 5 points of load (lp). there are 16 possible states by which the agent can "travel" with the purpose of satisfying the test of goal. the information related with the topology of the bs is in the so called "intelligence matrix"(im) which forms the base of the algorithm of search. part of the im corresponding to the bs is shown in figure 2 figure 2: part of im for the basic system the numbers for the rows and columns are the nodes or buses. the procedure to enter the data consists of entering the components between two consecutive nodes, identifying it with a letter and associating it with the respective element of the im. for bs, between the node 1 and 2 there is a line, indicated by an l in the position of the element [1,2] of the matrix. the same for the element [2,1]. when there is a switching or protection device (as between nodes 2 and 3) or when there is a line and a disconnects, the element, [2,3], is defined with an l (line). in the position [3,2] you enter an s (disconnects) indicating that the switching device is closer to bus 3. this directional quality or differentiation of the common elements of the system with those of switching and/or protection is conclusive in the optima resolution of the problem, this is the reason of the name "intelligence matrix". specially, for the evaluation of the indicator of future reliability, en the bs are identified: initial state: the search begins from the point of load to evaluate. operators: the agent can "jump" or "travel" from a particular state towards the next closer, i.e., from [3,2] to [2,1]. in direction to the node of generation (ascending form) or towards the other point of load (descending form). state space: the states extended by the agent from the load point (initial state). route: the route from one state to another within the space of states. goal test: it corresponds to have extended all the possible states in which any eventual fault may produce a lost in supply energy to the load point evaluated. solution to the reliability problem of radial electric distribution systems through artificial intelligence 63 cost of route: unit cost in order to travel from one state to the next nearest one. solution: it corresponds to the result of the algorithm of search once the goal is fulfilled. in accordance to the above the complete im for the bs is shown in figure 3. the non-existence of elements in the system is indicated with blank spaces in figure 3. in order to find the failure modes of system, the algorithm must identify the primary route and the secondary route (lópez et al, 1998). the im of the bs shows that the diagonal divides the information of the different routes. the last one, due to the arranged assignation of the number of nodes. on the other, the algorithm uses the "best first search", then, the best expanding option occurs at first. in this case, it corresponds to the primary route. it leaves, in "waiting queue", the secondary routes for the next expansion. 2.1 getting the primary route the primary route is the minimal cost one from the load point to the feeding node. the information about the primary route is contained under the principal diagonal of the im. the primary route of each load point is stored in a matrix called " matrix of primary route" (mpr). though each row will contain the primary route corresponding to a load point. the mpr is of order m x n (load point x nodes) and initially is a null matrix. its formation is ordered and it is starting with the agent in the first load point (lp1). in the intelligence matrix the row corresponding to the node number, that has the load point, is checked. then the elements contained under the diagonal are examined. when finding an element within the search sector, in the row of the matrix of primary routes, the number of the next node that corresponds is entered. the number of the column that corresponds to the found element gives this node. then, is verified the row corresponding to the next node of the primary route and so on, until arriving at node 1 of the feeding. the figure 4 outlines the way of how to find the primary route for the load point lp1. it corresponds to the node 16. figure 5 shows the bs’s matrix of the primary route (mpr of bs). regarding to the agent like an "problem-solving agent" (russell and norvig, 1996) and one state as a structure of simple data, the agent is placed in a problem of determining the route. the agent corresponds to an algorithm of resolution that uses a sequence of actions to follow in order to obtain its goal. the "resolutor-agent" of the simplified problem, in order to find the primary route, corresponds to figure 6. this agent is executed for each load point (lp) and ends or gets to the goal, when finding the feeding node (node 1). the route of the agent corresponds to the one shown above in figure 4. it is observed in figure 6 that the agent has two rules (if condition, then action).they take step from a basic and incipient system to an expert one that has the particularity to provide a great flexibility by the time of incorporating new knowledge. the incorporation of a new rule is enough for it, which don’t need to change the algorithm’s behavior. 2.2 secondary routes at the moment the "resolutor agent" of the primary route crosses the system and full the matrix of primary route, mpr. a vector is added that stores the quantity of nodes including each primary route this vector is called "cont". by knowing the dimensions of each primary rout (cont) the agent can track the secondary routes ’looking’ for it in the superior diagonal of the im. the last one for each one of the nodes that the primary route contains. extending the first secondary routes nearer to the feeding and leaving in delay the rest of the routes in the nearest nodes to the load point does this. for this, the mpr in inverse form is crossed. for example, for the load point lp1 the first secondary routes that are originated in node 1, then those of node 2, 7, and 8 are extended, to arrive to node 16 which corresponds to the node that contains the load point. the search of these routes is similar to which is made in the primary routes, but with some restrictions. one of them is that the agent must finish the search at the moment a fuse is find and must begin to extend the following one that is in delay. as secondary routes are subdivided in different branches, it is 64 e. lópez, j. campos, c. tardon, f. salgado, j. tardon, r. lópez figure 3: intelligence matrix corresponding to bs figure 4: searching the primary route for lp1 (node 16) necessary to store the nodes that at the moment of the distance traveled (on the diagonal of the matrix) will have to wait until the previous one in its deeper level extends (point of neighboring or fusible load ). for this, it is strictly necessary to store four data of the node to expand: figure 5: matrix of primary route (mpr) for the bs • the preceding node is required in case that there is a switching or protection device in the arrival of the node. these elements can involve the end of the route searching or a change in the repairing time to switching time. • the node to expand, is the node that indicates in which row of the intelligence matrix, to look for the elements that form the secondary route and if there are more elements that will be needed to expand more ahead. • the node towards which the agent travels, gives the direction in which the search takes place. • condition in which it is found, considering the time of repairing or switching, essential condition for the calculus of the unavailability. these data are stored in a matrix of 4 rows and a undetermined number of columns. it is filled during the route and it is an uncertain number at first. this matrix has the name of "extend" and determines the end of the process of searching as the last element in delay expands. the simplified agent to find the secondary routes is summarized in figure 7. this agent is executed for each primary route, i.e., for each load point it finds all the corresponding secondary routes. it is also observed that it conditions on switching time the rest of the route in case of finding a disconnects (s) and ends the route when finding a fuse (f). the end of this agent corresponds to the extension of the last secondary route (on the primary route) and that there aren’t any nodes in delay waiting to be extended. solution to the reliability problem of radial electric distribution systems through artificial intelligence 65 figure 6: simplified agent to find the primary route 2.3 rate of failure and unavailability for a load point it is possible to calculate its rate of failure and its unavailability by adding those that correspond to primary or secondary routes. the time of shut down is the quotient between the total unavailability of the load point and its corresponding rate of failure. rate o f f ailure : λ (l p) = λpr(l p) + λsr(l p) (1) u navailability : u (l p) = upr(l p) + usr(l p) (2) outage time : r(l p) = u (l p) λ (l p) (3) in the primary route: the information stored is checked in the mpr. a rate of failure is assigned to each element. this information has been entered in a routine of parallel data entrance, to the im. the sum of the rates individual failures provide the rate of failure of the primary route for the load point λpr(l p). the unavailability of the primary route is the sum of individual unavailability of the elements involved in it, without considering the presence of the switching devices. in the secondary routes: while the search is made the rates of individual failures of the elements in this route are added. at the same time, the unavailability of the secondary routes is being calculated. at the beginning, the unavailability is related to the repair time of the element, when a switching device is found the unavailability is associated to the switching time and this is kept from this node to the end of the corresponding secondary route. as long as one advances in the search the individual unavailability’s are added. at the end of the process the results are the unavailability, usr(l p) and the rate of failure λsr(l p) of the secondary routes by load point. the way in which the algorithm is worked is shown in figure 9 (getting the mode of failure for the lp1). with a different color the point where the disconnects is indicated, which conditions to use the time of switching for the rest of the route. 2.4 considering the alternative feeding a system with alternative feeding diminishes the times of unavailability to the customer. before making any calculus, it necessary to know where the alternative feeding is. for the previous, the "inter66 e. lópez, j. campos, c. tardon, f. salgado, j. tardon, r. lópez feeding route" that joins the main feeding with the alternative feeding is defined. the corresponding agent is similar to the "resolutor agent" that finds the primary route (under the diagonal of the matrix). but instead of beginning at the load point (lp), it starts its nodal search where the alternative feeding is. this route is stored in a vector called "ralt". in the first place, the inter-feeding route is located. then, it is compared with the primary routes for each load point settling down the intersection point among them. then, a pursuit between the main feeding and the intersection point is done, where the agent has to realize if there is a switching device in this sector. if not in this route, a calculus is made in a traditional way, without affecting the alternative feeding. if there is a switching device between the main feeding and the intersection point of the routes, this conditions all the elements that are found between the switching device and the main feeding, in case of a possible fault to only consider the time of switching, since this one is isolated, connecting the alternative supplies. this lightens the search, since the agent leaves the search of other switching devices in that sector to assign times. the rest of the system is analyzed as if there is no alternative feeding. considering the bs with alternative feeding in the node 11, nodes-1 2 3 4 11 form the inter-feeding route for that configuration. for the lp3 (node 15), the intersection of the primary routes, the inter-feeding and the switching device in [3,2] are shown in figure 11 the existing switching device conditions a great part of the system to a time of switching. this same example is considered, in terms of evaluating the lp1 (node 16). the primary route of this load point only intersects in node 2 with the inter-feeding route and there is no switching device between node 2 and the main feeding, in which the alternative supply does not influence the index calculus. figure 7: simplified agent finding the secondary routes solution to the reliability problem of radial electric distribution systems through artificial intelligence 67 3 applications the algorithm was applied to several distribution systems. in this document 4 of them are presented: the basic system (bs), two rbts feeders (billinton and kumar, 1989; allan et al., 1991) and a real system. 3.1 basic system this system was created to replace the necessity of having a small distribution system (ds) that had the complexity and diversity of a real one. the system has 5 load points and a primary feeding in node 1. the elements corresponding to a conventional ds are also present, such as: line, cables, fuses, transformers, switching devices, and so on. the modes of failure, the times of switching and repairing used correspond to those indicated in the ieee st. 493 (1990). the results are indicated in table 1. figure 8: mode of failure lp1 (node 16) figure 9: inter-feeding route. alternative supplies in 11 3.2 rbts system (roy billinton test system) this is a test system that is divided en 6 buses. bar 1 is the feeding of the system, whereas bus 2 to 6 correspond to the load buses. the whole system shows different level of tension which it works 220, 138, 33, and 11 kv. with the purpose of proving the algorithm, the reliability of the feeder 1 of buses 2 and 4 are evaluated, corresponding to a level of 11 kv. table 2 gives the result of the reliability indexes. 3.3 the cge’s feeder this feeder is inserted in the cge’s distribution system. this has 70 elements and 33 load points. for its evaluation the same considerations of the experimental feeder are applied. three evaluations to the system were made. case 1: future reliability evaluation. case 2: future reliability evaluation considering the alternative feeding in node 27. case 3: future reliability evaluation considering the alternative feeding in node 55. table 3, show the results of the reliability evaluation for the three different cases given. 68 e. lópez, j. campos, c. tardon, f. salgado, j. tardon, r. lópez figure 10: switching zone and lp3-alternative supplies alt node 5 alt node 11 nerc indexes saifi 0.3963 0.3963 0.3963 saidi 4.4847 4.0171 4.1588 caidi 11.3159 10.1359 10.4935 ens mwh/year 21.396 19.102 19.759 cier indexes f 0.3964 0.3964 0.3964 t 4.4576 3.9616 4.1166 d 11.2442 9.9931 10.3841 end 21.396 19.102 19.759 (cier: regional electric inter-american council) (nerc: national electric reliability council) table 1: reliability indexes of the basic system 4 analysis of results the index of future reliability for each system as the nerc presents was obtained, those that are associated to the client. the index corresponding to the cier; that referring to the power (kva) were calculated. these last ones are of great importance due to the fact that these are the authorized by the cne (national energy council) in chile through the general law’s regulation of the electric service (1980). table 1 shows the difference between the alternative feeding in some of the bs’s nodes in respect to the same system with only a main feeding. an important improvement is observed in the reliability’s index when considering the alternative feeding. if the analysis is centered in one of the index as the average time of fault is, when applying the alternative feeding in node 5 a diminish in time of 11%, can be proved. by applying the same alternative feeding in point 11 the diminishing time of fault was around 7.6%. this is justified whereas node 5 is in the biggest point load of the feeder and there is a switching device immediately before the connection of the new feeding. the rbts system was useful in confirming the good behavior of the proposed algorithm. this system, though being small, was very easy to follow with a hand calculator, in a addition to the complete results base already given in the brown et al., (1997). referring to the results (table 2) the most important is to see the effect that is produced in the reliability the connection to the client from the main one with a disconnects (bus 2) or with two disconnects isolating the client in both sides (bus 4). a this difference is only remarked by the fact that both bus’s feeders consider an alternative feeding. with this specific fact and in similar conditions the solution to the reliability problem of radial electric distribution systems through artificial intelligence 69 rbts 2.1 rbts 4.1 nerc indexes saifi 0.2488 0.3021 saidi 3.6193 3.4694 caidi 14.5443 11.4849 ens 13.155 12.196 cier indexes f 0.2477 0.3031 t 3.6090 3.4746 d 14.5696 11.4630 end 13.155 12.196 table 2: reliability indexes of the rbts systems best reliability is obtained in the bus 4’s feeder. this is clear since with the possibility of having an alternative feeding from both ends, the most advisable it is to maintain the client with the switching. this means that before a possible fault, the client can isolate himself of the extreme in which the fault is produced and can keep connected with the other end in which the feeding remains, diminishing therefore the times of unavailability and improving the reliability of the system. for the cge’s feeder just like the previous cases, got good results being within the established limits for f and t (ministry of mining, 1980). in table 3 it is possible to see that the unavailability index decreases when considering alternative feeding. for the alternative feeding en node 27, the algorithm obtained a decrease of 15.6% en the unavailability. in as much as if the alternative feeding is in node 55, the decrease in the unavailability is of 24,5%. the differences between values is directly related with the amount of the existing switching devices. all results were checked with theoretical or manual analysis to assure the correct execution of the developed algorithm. case 1 case 2 case 3 nerc indexes saifi 0.3222 0.3222 0.3222 saidi 2.8079 2.3575 2.1948 caidi 8.7139 7.3162 6.8112 ens 12.410 10.470 9.3700 cier indexes f 0.3198 0.3198 0.3198 t 3.2547 2.7458 2.4573 d 10.176 8.5849 7.6830 end 12.410 10.470 9.3700 table 3: reliability indexes of cge system 5 conclusions this paper deals with the problem of the future reliability of a distribution system through ai. the solution considers the process of "searching" placed in the field of the ai. the approach presented in this paper is a powerful tool in distribution system reliability prediction. its kindness was demonstrated in many evaluations that were made in systems with completely different characteristics. as it was expected, the results obtained were consequent with the theoretical and practical considerations of the 70 e. lópez, j. campos, c. tardon, f. salgado, j. tardon, r. lópez problem. the model proposes the concepts of "intelligence matrix" and "agent". a very remarkable aspect of the conjunction matrix-agent is the facility with which it deals with the elements of protection and switching devices to value the importance of the strategic location of this elements. furthermore the "intelligence matrix" gathers a condition so that the "agent" works in an efficient way within the topological search. his connections make the run in an efficient and rapid way to complete the layout of the routes that involves the distribution reliability’s calculus. this point is the clue of success in the search tree-failure modes. a second significant contribution corresponds to the route concept "inter-feeding" helping to harness the model in which it says to be related with the option of alternative feeding. with the help of this a decrease in the unnecessary expenses of time and computer memory was obtained. this, as well, becomes a potential tool of evaluation of the ideal positioning of an alternative point of feeding in a real system of distribution. from a more general perspective, we can conclude that the use of this model prevents important economical measures, in which the electric companies could commit or incur when not having a suitable control. finally, the investigation’s development resulted in the necessity to deepen in ordaining the switching and protection devices, that can lead to obtain the best reliability of the system. references [1] asgarpoor s. y m. mathine, reliability evaluation of distribution systems with non-exponential down times. ieee transactions on power systems. vol. 12, no. 2, 1997. [2] allan r., r. billinton, i. sjarief, l. goel and k. s. so, a reliability test system for educational purposes basic distribution system data and results, ieee transactions on power system, vol. 6, no. 2, 1991. [3] billinton, r. y allan, r., reliability assessment of a large electric power systems. kluwer academic publishers, boston, usa, 1988. [4] billinton r y s. kumar, a reliability test system for educational purposes basic data ieee transactions on power systems, vol. 4, no. 3, 1989. [5] billinton r., y s. jonnavithula, a test system for teaching overall power system reliability assessment, ieee transactions on power systems, vol. 11, no. 4, 1996. [6] billinton r., y p. wang, teaching distribution system reliability evaluation using monte carlo simulation, ieee transactions on power systems, vol.14, no. 2, 1999. [7] brown r., s. gupta, r. d. christie, s. s. venkata y r. fletcher, distribution system reliability assessment using hierarchical markov modeling, ieee transactions on power delivery, vol. 11, no. 4, 1996. [8] brown r., s. gupta, r. d. christie, s. s. venkata y r. fletcher, distribution system reliability assessment momentary interruptions and storms, ieee transaction on power delivery, vol. 12, no. 4, 1997. [9] brown r., y j. ochoa, distribution system reliability: default data and model validation, ieee transactions on power systems, vol. 13 no. 2, 1998. [10] ieee st 493, ieee recommended practice for the design of reliable industrial and commercial power systems, 1990. solution to the reliability problem of radial electric distribution systems through artificial intelligence 71 [11] lópez e., tardon c., contreras v., reliability evaluation of primary distribution systems via search tree, procc. of international congress of acca/ifac automatic control association 2000, santiago of chile, 1998. [12] ministry of mining, general law of electric services. official newspaper of the republic of chile, 1998. [13] russell s y p. norvig, artificial intelligence. a modern approach, editorial prentice hall & hispanoamericana s.a., 1996. e. lópez, j. campos, c. tardon, f. salgado, university of concepción, chile department of electrical engineering e-mail: elopez@die.udec.cl j. tardon sts-saesa-frontel e-mail: jtardon@saesa.cl r. lópez supelec. lep-university of paris xi. e-mail: rodrigo.lopez@supelec.fr international journal of computers communications & control issn 1841-9836, 10(3):298-307, june, 2015. threshold based best custodian routing protocol for delay tolerant network q. ayub, m. s. mohd zahid, s. rashid, a. hanan abdullah qaisar ayub*, soperi mohd zahid, sulma rashid, abdul hanan abdullah faculty of computing universiti teknologi malaysia utm skudai, 81310 johor,malaysia sheikhqaisar@gmail.com, soperi@utm.my, sulmaqaiser@gmail.com, hanan@utm.my *corresponding author:sheikhqaisar@gmail.com abstract: delay tolerant network (dtn) is a kind of network in which the source may not be able to establish the stable and uninterrupted path to destination due to network partitioning, dynamic topology change and frequent disconnections. in order to dealt disruption and disconnections a store, carry and forward paradigm is used in which node stores the incoming messages in its buffer, carries it while moving and forward when comes within the transmission range of other nodes. message forwarding contributes and important role in increasing its delivery. for instance, probabilistic routing protocol forwards message to a node having high probability value to meet message destination. these protocols cannot handle a situation in which the node continually transmits messages even the probability difference is very small. in this paper, we have proposed a routing protocol known as threshold based best custodian routing protocol (tbbcrp) for delay tolerant network. we have proposed a threshold-based method to compute the quality value which is the ability of node to carry message. a self-learning mechanism has been used to remove the delivered messages from the network. moreover, a buffer aware mechanism has been used that make sure availability of buffer space at receiver before message transmission. we have compared the performance of tbbcrp with epidemic, prophet and delegated forwarding. the proposed tbbcrp outperforms in terms of maximizing the delivery probability, reducing number of transmissions and message drop. keywords: delay tolerance network , store-carry-forward, routing protocols , algorithms. 1 introduction with advancement in communication technologies [1-2] it is now possible to interconnect mobile nodes, stand-alone computers and provide an innovative way to join the social and business communities. despite, other communication architectures such as lan, wlan, the mobile ad hoc networks have gained more popularity. in ad hoc networking routing protocols [3-6], the source and destination establishes the end-to-end path prior to the transmission of data. this prerequisite is impossible in highly disrupted wireless applications such as wildlife monitoring, deep-space communication and military networks. such environments suffer frequent disconnections, dynamic teleology change and network partitioning due to node mobility. in addition, limited network resource, for instance, buffer space, bandwidth, energy and processing power of nodes makes routing a real challenge. delay tolerance network (dtn)[7] is a kind of network that aims to provide the communication via opportunistically connected mobile nodes. a novel method called as the store, carry and forward is used in which nodes stores the message in their buffers carries them while moving and forwards when connected to other nodes. the dtn routing protocols can be classified as copyright © 2006-2015 by ccc publications threshold based best custodian routing protocol for delay tolerant network 299 single copy and multi copy. in single copy protocols, the unique copy of the message exists in entire network [8,21]. these protocols are capable to operate under limited resource but reduce delivery ratio and raises the delivery delay. multi copy routing protocols transmits the redundant copies of each message to all connected nodes [9-13]. therefore, the message can reach to its destination via multiple intermediate nodes. as a result, multi copy routing protocols minimize the delivery delay and maximize the delivery [14-18]. the multi copy routing protocols are more robust, but unreliable due to consumption of high volume of network resources. the probabilistic routing protocols were proposed to reduce the resource consumption that considers node behavior such as its movement pattern, encounter history [12,14,27] before the transmission of the message. a carrier node with probabilistic protocol continues to forward message to high probable relay nodes. this issue was addressed in [19] where vijay erramilli et al. proposed a new message forwarding technique called as delegated forwarding. in this method, each node maintains a quality metric and forward the message to another node only if it has high quality metric which have been seen by the message. later, in [20][23-24] the authors present the variations of delegated forwarding. the previous works has not defined a method to compute quality value of nodes. the contribution to this paper is as follows • we have proposed a routing protocol called as threshold based best custodian routing protocol for delay tolerant network (tbbcrp). • we have used a self-learning method to remove the previously delivered messages from the network. • a threshold-based method has been used to assign the quality value to network nodes. • we have compared the performance of tbbcrp with epidemic, prophet and delegated forwarding in terms of minimizing number of transmissions, number of drops, overhead while raising the deliver probability. 1.1 review of dtn routing protocols in dynamic surroundings like intentions topology change, node mobility and frequent network partitions, problem is to select a suitable relay node for a message. in addition, scarce network resources such as limited buffer space, bandwidth, and low power of nodes makes data routing even more challenging. therefore, prototype of a good routing protocol must focus on minimizing the consumption of network resources and delivery of more messages to their destinations. in flooding based routing protocols such as epidemic protocol [13] each node encounter results in the exchange of messages. the encountering nodes further diffuse the message copy and process continues. despite the fact that in epidemic protocol, each message may have more than one path to reach destination, epidemic protocol consumes high volume of network resources. the control on creation of message copies can reduce the resource requirement. in this background, qouta based routing protocols were emerged where each node was given the opportunity to transmit the n number of message copies for example spray and wait[10], spray and focus[11], spray and wait binary qon spray and wait[26]. the spray and wait algorithm consist of a spray phase, where node spread n message copies to its neighbors called as relays. if the destination is not found in the spray phase then each node wait until contacted to message destination. in spray and wait protocol, message transmission was limited only to the neighboring nodes. this problem was solved in binary spray and wait protocol in which a source node on encountering forwards the n/2 message copies to the connected 300 q. ayub, m. s. mohd zahid, s. rashid, a. hanan abdullah node while keeps n/2. in addition, the receiver node was also privileged to distribute n/2 message copies. this hierarchical forwarding improves the performance of spray and wait and increases message delivery. the spraying protocols work well when the node movement is independent and identically distributed (iid) which is not possible for real world scenarios where each node exhibits its own movement pattern. these challenges motivate the researchers and various utility functions were introduced in spraying algorithms. for example, spray and focus [11] protocol starts by distributing the n/2 message copies like spray and wait binary, however when the node left only one copy of message then it shifts to the focus phase where the nodes forwards the message to neighbors by observing its suitability to meet the destination. the suitability is determined by the time since two nodes last saw each other. quality of node spray and wait [26] improves the performance of binary spray and wait algorithm by introducing qon (quality of node). the qon is represented by an integer number which describes encountering frequency of one node to encounter with other node in a given time interval. the primary objective of quota based routing protocols was to control the transmission of message copies. despite the fact that spraying algorithms has exploited the encountering history of nodes, other factor such as mobility patterns or positional coordinates can improve the routing procedure. for example in [28], the author designs a mobility based spraying strategy most social first, most mobile first (mmf), last seen first (lsf). the shen ling et [22] observe the node mobility and introduce a influence factor which is determined by the mobility of the nodes. in [12] lindgren et al. introduces prophet protocol which reduces the number of message transmissions by introducing a new metric called as delivery predictability and transitive connectivity. the nodes are capable receive a message only if they constitute a high value of predictability and transitivity. as expected, prophet protocol compared to epidemic protocol minimizes the number of transmissions. the variation of prophet such as procs [29] introduces a new message forwarding method which observes movement pattern of nodes and their time sequence in [19] vijay erramilli et al. propose a new message forwarding technique called as delegated forwarding. in delegated forwarding, each node maintains a quality metric and forwards the message to another node only if it has high quality metric which have been seen by the message. the protocol works well to and controls the transmission of the message on the relay nodes. however, it does not provide any solution to control the transmission of the source messages. in [20] author modified the algorithm by updating source message with the probability value of high quality node. in this way, the replication was also controlled from the source messages. in [23] the author defines the delegated forwarding by designing cost-based drop and transmission methods. according to this, the message that is close to their destinations are assigned the high priorities by defining a replication count number called as the delegated number. yunsheng wang et.al [30] proposed for single copy multicast, multi copy multicast and delegated forwarding multicast algorithms. 1.2 the proposed threshold based best custodian routing protocol in proposed tbbcrp, each network node maintains the time information about encountering to other nodes in a vector called as previous encounter vector (pev). the pev consists of node id nid and encounter time (et). we have used et to assign the quality value which describes the future encountering likelihood among same nodes. the high quality value indicates that node is more likely to delver message. the quality information is stored in the quality vector. the quality vector consists of node id and quality value (qv). principal-1. when nodes contact, they assigns quality values by using upstream and downthreshold based best custodian routing protocol for delay tolerant network 301 stream time threshold: td = currenttime − et(ni). (1) when the node encounters first time, they initializes the default quality in the quality vector and current time in the et of pev. if nodes have encountered previously, then the time difference (td) is computed by subtracting the current time from the et by using equation 1 and threshold streams module is invoked. the threshold stream module updates the quality value for a node by mapping the td in the pre-defined collection of threshold queue. table 1 shows the meaning of variables used in tbbcrp. table 1: meaning of variables used in tbbcrp symbol description ni , nj node i and node j pev,et previous encounter vector, encounter time qv quality value td time difference qp quality points vd vector deliver figure 1: structure of threshold queue figure 1 show the structure of threshold queue which is divided into upstream time thresholds and downstream time thresholds. the upstream time thresholds further defines its lower bound of upstream limit (lbul) and upper bound of upstream limit (ubul). the upstream thresholds are used to decrement quality value while downstream thresholds are used to increase quality values. the nodes encountering after large interval of time shows high td value and relevant quality points (qp) are subtracted from quality value of node. when the time difference is above the ubul then quality value of node is initialized to zero. the downstream threshold starts after lbul and defines its lower bound of downstream limit (lbdl). the downstream threshold is used to increment the quality value of nodes. for instance, nodes encountering after small interval of time are expected to encounter again. the td is mapped and relevant quality points (qp) are incremented in the quality value. when time difference is lower than the lbdl then maximum quality points ( qp) are assigned to the quality value (qv). figure 2 shows the algorithmic flow of threshold based method in which node x and node y have established connectivity. the node x and y has maintained previous encounter vector 302 q. ayub, m. s. mohd zahid, s. rashid, a. hanan abdullah figure 2: threshold based best custodian routing protocol example (pev) and quality vector. in step one, x map tdy which represents time elapsed since x has seen y. in step two, the relevant quality values qvy will be given to x. the same steps are followed by node y. 1.3 self learning method to remove delivered messages since, dtn is a highly disrupted environment where it is not possible to keep the track of the transmitted messages via central administration. hence, most of the time the message even after finding their destinations cannot convey their delivery status to the other nodes, and message replication continues even it is delivered. when the network resources scarce then replications of delivered messages produce high overhead on the buffer space, bandwidth and energy. despite the influence of other factors, these messages also produce the congestion that a node overcomes by dropping its stored messages. hence, a solution is required to remove the delivered message from network. inspired by immunity based routing protocol, we have defined a de-centralized mechanism to remove the previously delivered messages. principal-2. the algorithm states that when a node deliverers a message to the current connection as a final recipient then it stores the message id in vector delivered (vd) and remove it from the buffer. the tbbcrp , each network node maintains a vector called vector delivered (vd). when a node forwards the message copy to a connection as final recipients it inserts the message id in vd and removes the message from the list of its carried messages. this module is invoked after the threshold computation and before the transmission of messages. figure 3: exchange of previously delivered messages figure 3 show the technical flow of removing delivered messages. accordingly, on encounterthreshold based best custodian routing protocol for delay tolerant network 303 ing, ni forward vdi [16-17] to nj. vdi hold ids of delivered messages known by ni. nj subtracts vdi from vdj to get vdrequired that holds list of delivered messages not known by nj and send it to ni by using eq. (2). v drequired = (v di − v dj) (2) ni computes v dremove by intersecting vdrequired from vdi and send it to nj by using eq. (3). v dremove = (v drequired ∩ v di) (3) finally, nj removes the v dremove messages from buffer and update vdj by using eq. (4). v dj = v dremove ∪ v dj (4) 1.4 control on message replication by buffer space the dtn message consists of message header and payload header. the payload header contains the actual contents of message. the message header is collection of control information such as message identification, hop count, and time to live. the dtn node utilizes control information to forward and drop messages. in tbbcrp, we have included a new data field in message header called as recent quality (rq). the rq is an integer value initialized with zero for the messages generated by the source. principal-3 when a transmitter forward the message copy it updates the rq of message with the of quality value of receiver, while the receiver will update the rq of message to its own. the principal 3 is about the implementation of delegated forwarding in message header. we have used the same concept in df++[24]. briefly, after transmitting message copy, the sender node updates the rq of message header to quality value of node which receives message. similarly, receiving node update the message rq to its own quality value. the idea is that the receiver or transmitter will not replicate the message until the encountered node has higher quality value than the rq. principal-4. the transmitter will forward the message only if the qv of receiver to meet with the message destination is greater then the rq and available buffer space at receiver is capable to store the message. the high quality nodes are likely to encounter the message destination. however, if we forward a message to a congested high quality node then this forwarding decision may degrade the network performance. since, the congested node will drop its previously stored messages to accommodate the new one. in our previous work, df++ [24] and cfbarp [25] an adaptive mechanism has been defined to dealt with buffer space. hence, the node forward the message only if the quality value of receiver is high as well as available buffer is able to accommodate the incoming message. after transmission, the transmitter will subtract the message size from available buffer. 2 simulation and results this section provides the performance analysis of existing and proposed routing protocols in terms of minimizing the message transmissions, message drop and raising the delivery probability by one simulator [31]. one is event driven simulator written in java and has been designed to evaluate the dtn applications. the reality of simulation has bee increased by using a city based environment which consist on pedestrian, cars and trains. the pedestrians were divided into two groups with 40 members at each group. the pedestrian are moving with shortest path map based movement model at the speed between 0.5km/h and 1.5 km/h. each pedestrian has been 304 q. ayub, m. s. mohd zahid, s. rashid, a. hanan abdullah carrying mobiles with 2mb of buffer size. the transmission range of mobile nodes is 10 meters. the 40 cars are moving via map route movement at the speed between 10km/h-50km/h. finally, six trains are moving via map route movement at the speed between 7km/h and10km/h. the random size message generated from the sample of 100k-300k and the inter message creation interval is 25s-35s. the bandwidth if equally distributed at 2mbps. figure 4: transmissions by varying number of nodes figure 4 represents the results of exiting prophet, epidemic and delegated forwarding as compared to proposed tbbcrp routing protocol in terms of number of message transmissions. the flooding based epidemic protocol has been showing high number of transmissions. the delegated forwarding has controlled the message diffusion as compared to epidemic protocol but still forwards high quantity of messages compare to prophet and tbbcrp protocols. the message transmissions are getting higher with increasing number of nodes. the reason is that, at high number of nodes, message exchange gets higher. it is possible to sustain such traffic under the infinite buffer space. however, in the current environment the buffer is limited resource thus a better quality node when receive a message by having no space mechanically triggers the drop event. further, due to the multi copy of each message the same high quantity node may reputedly receive the dropped messages, thus cause high transmissions, message drop and waste of node energy. the proposed tbbcrp routing protocol has reduced the message transmissions. figure 5: message dropped by varying number of nodes the figure 5 depicts the results message drop by increasing the number of nodes. we can see that increasing the number of nodes has raised the message dropped. this is because the buffer space is finite, and nodes cannot accommodate all incoming messages. for instance, at increasing threshold based best custodian routing protocol for delay tolerant network 305 number of nodes such as 186, 216, and 246, even the protocols like prophet and delegated forwarding has dropped large number of messages. the reason is that when the encounter rate among nodes is high then multiple nodes became highly probable to receive the message. the proposed tbbcrp has shown the constant stance for all network traffic. figure 6: delivery by varying number of nodes the figure 6 plot the results of existing and proposed routing protocols in terms of message delivery probability by increasing number of nodes. it can be observed that at less number of nodes such as 126, 156 the protocols such as prophet, epidemic and delegated forwarding has delivered more messages. nevertheless, as the number of nodes gets higher like 186,216 and 246 less number of message find their destinations. the reason is that messages were dropped before reaching destination. 3 conclusion in this paper we have proposed a routing protocol called as threshold based best custodian routing protocol (tbbcrp) for delay tolerant network. a threshold based method has been proposed to compute the quality value of nodes, which is the ability of nodes to carry message. moreover, a self learning method has been used to remove previously delivered messages from network. the proposed protocol out performs well as compared to existing strategies in terms of maximizing the delivery probability, reducing number of transmissions and message drop. acknowledgments this work is financed by institution scholarship provided by utm and ministry of higher education of malaysia. bibliography [1] ariyavisitakul, s. l. (2000); turbo space-time processing to improve wireless channel capacity. ieee transactions on communications. 48(8): 1347-1359. [2] yujin lim.,jesung kim.,sang lyul min.,joong soo ma(2001); performance evaluation of the bluetooth-based public internet access point information networking, 2001. proceedings.15th international conference on information networking, 643-648. 306 q. ayub, m. s. mohd zahid, s. rashid, a. hanan abdullah [3] latiff l.a.,fisal, n.,. (2003); routing protocols in wireless mobile ad hoc network a review. the 9th asia-pacific conference on communications (apcc 2003), 600-604. [4] murthy s., and j.j. garcia-luna-aceves.(1996); an efficient routing protocol for wireless networks, mobile networks and applications, 1(2):183-197. [5] ] perkins c.e., and e.m. royer. (1999); ad-hoc on-demand distance vector routing, ieee wmcsa 99, 90-100 [6] johnson d.b., and d.a. maltz.(1996); dynamic source outing in ad hoc wireless networks, mobile computing, 153-181. [7] fall, k. (2003); a delay-tolerant network architecture for challenged internets. in sigcomm 03: proceedings of the 2003 conference on applications, technologies, architectures, and protocols for computer communications. new york, ny, usa: acm, 27-34. [8] spyropoulos t., k. psounis., and c.s raghavendra.(2004); single-copy routing in intermittently connected mobile networks. in proc. ieee conf. sensor and ad hoc communications and networks (secon), 235-244. [9] ramanathan, r., hansen, r., basu, p., rosales-hain, r. and krishnan, r. (2007); prioritized epidemic routing for opportunistic networks. in proc. of the 1st international mobisys workshop on mobile opportunistic networking, acm, 62-66. [10] spyropoulos, t., psounis, k. and raghavendra, c. (2005); spray and wait: an efficient routing scheme for intermittently connected mobile networks. in proceedings of the 2005 acm sigcomm workshop on delay-tolerant networking, acm, 252-259. [11] spyropoulos, t., psounis, k. and raghavendra, c. s. (2007); spray and focus:efficient mobility-assisted routing for heterogeneous and correlated mobility. in fifth annual ieee international conference on pervasive computing and communications workshops, percom workshops 07, ieee, 79-85. [12] lindgren, a., doria, a. and schelen, o. (2004); probabilistic routing in intermittently connected networks. in service assurance with partial and intermittent resources. springer, 239-254. [13] vahdat, a., becker, d. et al. (2000); epidemic routing for partially connected ad hoc networks. technical report. technical report cs-200006, duke university. [14] de oliveira, e. c. and de albuquerque, c. v. (2009); nectar: a dtn routing protocol based on neighborhood contact history. in proceedings of the 2009 acm symposium on applied computing. acm, 40-46. [15] bulut, e., geyik, s. c. and szymanski, b. k. (2010); conditional shortest path routing in delay tolerant networks. in ieee international symposium on a world of wireless mobile and multimedia networks (wowmom). ieee, 1-6. [16] srinivasa, s. and krishnamurthy, s. (2009); crest: an opportunistic forwarding protocol based on conditional residual time. in 6th annual ieee communications society conference on sensor, mesh and ad hoc communications and networks, secon09, ieee, 1-9. threshold based best custodian routing protocol for delay tolerant network 307 [17] hua, d., du, x., qian, y. and yan, s. (2009); a dtn routing protocol based on hierarchy forwarding and cluster control.in international conference on computational intelligence and security, cis09, ieee, 2: 397-401. [18] wang, g., wang, b. and gao, y. (2010); dynamic spray and wait routing algorithm with quality of node in delay tolerant network. in international conference on communications and mobile computing (cmc), ieee, 3: 452-456. [19] erramilli, v., et al. (2008); delegation forwarding. acm mobihoc08, 251-260. [20] chen, x., et al.(2009); probability delegation forwarding in delay tolerant networks. ieee icccn09, 1-6. [21] t. spyropoulos, k. psounis, and c. s. raghavendra (2007); utility-based message replication for intermittently connected heterogeneous wireless networks, in proc. of ieee wowmom workshop on autonomic and opportunistic communications (aoc), (inria technical report rr-6129), june 2007. [22] ling, s. and wei, w. (2009); feedback adaptive routing algorithm for dtn. in wri international conference on communications and mobile computing, cmc09, ieee, 2: 267-271. [23] liu, c. and j. wu (2009); an optimal probabilistic forwarding protocolin delay tolerant networks. acm mobihoc 09, 105-114. [24] ayub, q., zahid, m. s. m., rashid, s., & abdullah, a. h. (2013); df++: an adaptive buffer-aware probabilistic delegation forwarding protocol for delay tolerant network. cluster computing, 1-8. [25] qaisar ayub; m soperi mohd zahid; abdul hanan abdullah; sulma rashid (2013); connection frequency buffer aware routing protocol for delay tolerant network. journal of electrical engineering and technology, 8(3): 649-657. [26] wang, g., wang, b. and gao, y. (2010); dynamic spray and wait routing algorithm with quality of node in delay tolerant network. in international conference on communications and mobile computing (cmc), ieee, 3: 452-456. [27] nelson, s. c., bakht, m. and kravets, r. (2009); encounter-based routing in dtns. in ieee infocom 2009, ieee, 846-854. [28] elwhishi, ahmed, pin-han ho, sagar naik, and basem shihada (2011); contention aware routing for intermittently connected mobile networks, in afin 2011, the third international conference on advances in future internet, 8-15. [29] jathar, r. and gupta, a. (2010); probabilistic routing using contact sequencing in delay tolerant networks. in 2010 second international conference on communication systems and networks (comsnets), ieee, 1-10. [30] wang, y., li, x., & wu, j. (2010); multicasting in delay tolerant networks: delegation forwarding. in global telecommunications conference (globecom 2010), ieee, 1-5. [31] keranen, a., ott, j. and karkkainen, t. (2009); the one simulator for dtn protocol evaluation. proc. of the 2nd international conference on simulation tools and techniques. icst (institute for computer sciences, social-informatics and telecommunications engineering), 1-10. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iii (2008), no. 2, pp. 135-148 feedback gain design method for the full-order flux observer in sensorless control of induction motor abderrahmane bouhenna, abdellah mansouri, mohammed chenafa, abdelkader belaidi abstract: this paper deals with a feedback gain design method for the full-order flux observer with adaptive speed loop, which enables the minimizing the unstable operation region of this observer to a line in the torque-speed plane. the stability in regenerating mode is studied using necessary condition of stability based on determinant of matrix and a linearized model. simulations results where the proposed observer is compared with an exiting solution (where the unstable region is not totally removed) are presented to validate the proposed observer design. keywords: induction motor, full-order flux observer, sensorless control, stability analysis, adaptive speed estimator, regenerating mode 1 introduction the speed-sensorless control of induction motor drives have developed significantly during the last number of years. speed adaptive full observers introduced by [8], [15] are promising flux estimators for inductions motors drives. the speed adaptive observer consists of a state variable observer augmented with a speed adaptation loop. the observer gain and the speed adaptive law determine the properties of the observer. the speed adaptation law is based on the component of the current estimation error with the estimated rotor flux. the adaptation law was originally derived using the lyapunov stability theory [8]. however, the stability of the adaptation law is not guaranteed and stability problem exist in the regenerating mode. the derivation in [8] neglects a term including the actual rotor flux (which is not measurable). the positive-realness condition is not satisfied as shown [5]. some limits of operation were quickly highlighted [9], [13]. in particular, a well known instability region was described in regenerating mode. thus, the drive stability can’t be guaranteed when this type of observer is associated with a field oriented control. there was many work in order to reduce this region of instability which is due to inadequate observer design [1, 2, 5, 14]. in this paper, we describe the design of an adaptation law that minimizes the instability region of an adaptive speed estimator. the paper is organized as follows. the induction motor model and the speed adaptive flux observer are first defined in section 2 and 3 respectively. we introduce the observer gain design in section 4 leading to a reduced instability region limited to a line. finally, simulations results are presented and discussed in section 5, where the proposed observer is compared with an exiting solution [5, 13]. 2 induction motor model the induction motor is described by the following state equations in the synchronous rotating reference frame with complex notations: d dt x = a (ω, ωs) x + bus (1) is = cx (2) where x = [ ψ r is ] t a =   −( 1 tr + jωsl ) lm tr lm b ( 1 tr − jω) −(a + jωs)   , b =   0 1 σ ls   , c = [ 0(2×2) i ] , i = [ 1 0 0 1 ] (3) copyright © 2006-2008 by ccc publications 136 abderrahmane bouhenna, abdellah mansouri, mohammed chenafa, abdelkader belaidi and the mechanical equation is: d dt ω = p2 lm jlr ℑ(isψ ∗ r )− p tl j (4) where ∗ means a conjugate, j a complex number and ℑ an imaginary part. ψ r : rotor flux; is : stator current; us : stator voltage; ωs : stator angular frequency; ω : motor angular speed; ωsl = ωs − ω : slip angular frequency; rs, rr : stator and rotor resistance; ls, lr : stator and rotor self-inductance; lm : mutual inductance; tl : load torque; j : rotor inertia ; p : number of pole pairs; tr = lr/rr : rotor time constant; a = (l2r rs + l 2 mrr)/(σ lsl 2 r ); b = σ lslr; σ = 1−(l2m)/(lslr) : leakage coefficient. 3 adaptive observer the conventional full-order observer, which estimates the stator current and the rotor flux together [10, 11], is written as the following state equation. d dt x̂ = â (ω̂, ω̂s) x̂ + bus + g(is − îs) (5) îs = cx̂ (6) where ̂ means the estimated values and g = [ g1 g2 ]t is the observer gain matrix. we assume that all machine parameters are perfectly known except the motor speed. using the assumption of constant angular rotor speed ω̇ = 0 (i.e. the speed variations are slow with respect to electrical mode) [8], [5], the speed adaptive law is [8]: d dt ω̂ = λ lm b (eid ψ̂rq −eiqψ̂rd ) (7) where λ is a positive constant and will be tuned in (7) to improve observer dynamics. in practice, proportional-integral action is used in order to improve the dynamic behavior of the estimator. d dt ω̂ = kp d dt (eid ψ̂rq −eiqψ̂rd ) + ki(eid ψ̂rq −eiqψ̂rd ) (8) where eid = isd − îsd , eiq = isq − îsq, (isd , isq) are (d,q) components of stator current, (ψrd , ψrq) are (d,q) components of rotor flux. the speed adaptive observer scheme with the speed adaptation mechanism is presented in fig. 1. 4 observer gain design 4.1 linearized model the nonlinear and complicated dynamics of the speed adaptive observer can be studied via smallsignal linearization. it is useful to proceed with a local analysis based in the principle of stability in the first approximation [12, 7]. we will choose the particular form g1 = g1i2×2 where i2×2 is the identity matrix and g2 = 02×2. the complete adaptive observer may be written as equation (10). note that according the assumption ω̇ = 0, the motor model (1) may be written as (9)    d dt ψ r = −( 1 tr + jωsl )ψ r + lm tr is d dt is = lm b ( 1 tr − jω)ψ r −(a + jωs)is + 1 σ ls us d dt ω = 0 (9) feedback gain design method for the full-order flux observer in sensorless control of induction motor 137 induction motor b â speed adaptive law eq. (8) + + + ∫ ω̂ g c i s x̂ î s − + u s 1 figure 1: speed adaptive observer    d dt ψ̂ r = −( 1 tr + jω̂sl )ψ̂ r + lm tr îs + g1δ ei d dt îs = lm b ( 1 tr − jω̂)ψ̂ r −(a + jω̂s)̂is + 1 σ ls us d dt ω̂ = kp d dt (eid ψ̂rq −eiqψ̂rd ) + ki (eid ψ̂rq −eiqψ̂rd ) (10) we investigate the stability of the observer by linearizing the two systems (10) and (9) around an equilibrium operating point.defining the new state vectors x = xo + δ x with xo = [ ψ ro iso ωo ] t , δ x = [ δ ψ r δ is δ ω ]t and x̂ = x̂o + δ x̂ with x̂o = [ ψ̂ ro îso ω̂o ] t , δ x̂ = [ δ ψ̂ r δ îs δ ω̂ ] t . the reference frame is synchronized with the estimated rotor flux (ψ̂rqo = 0), then its two components are ψ̂rd = ψ̂o + δ ψ̂rd and ψ̂rq = δ ψ̂rq. in these two systems, the stator frequencies are regarded as identical : ωs = ω̂s [5]. preserving only dynamic parts, the two systems (9), (10) become after linearization,:    d dt δ ψ r = −( 1 tr + jωslo )δ ψ r + lm tr δ is − jψoδ ωsl d dt δ is = lm b ( 1 tr − jωo)δ ψ r −(a + jωso)δ is + 1 σ ls δ us − j lm b ψoδ ω − jisoδ ωs d dt δ ω = 0, (11)    d dt δ ψ̂ r = −( 1 tr + jω̂slo )δ ψ̂ r + lm tr δ îs − jψ̂oδ ω̂sl + g1δ ei d dt δ îs = lm b ( 1 tr − jω̂o)δ ψ̂ r −(a + jω̂so)δ îs + 1 σ ls δ us − j lm b ψ̂oδ ω̂ − jîsoδ ω̂s d dt δ ω̂ = −kp(− lm b ωoψ̂oδ ψ̂ rd + lm btr ψ̂oδ ψ̂ rq −ωsoψ̂oδ îsd −aψ̂oδ îsq − lm b ψ̂oδ ω̂s) −ki(−eidoδ ψ̂ rq + eiqoδ ψ̂ rd + ψ̂ oδ eiq). (12) 138 abderrahmane bouhenna, abdellah mansouri, mohammed chenafa, abdelkader belaidi defining δ e = [ δ eψ δ ei δ eω ]t , the system describing the estimation error is as follows:    d dt δ eψ = −( 1 tr + jωslo )δ eψ + ( lm tr −g1)δ ei − jeψo δ ωsl + jeωo δ ψ̂ r + jψ̂oδ eω d dt δ ei = lm b ( 1 tr − jωo)δ eψ −(a + jωso)δ ei − j lm b eψo δ ω − j lm b eωo δ ψ̂ r − jeioδ ωs − j lm b ψ̂ o δ eω d dt δ eω = kp(− lm b ωoψ̂oδ eψ d + lm btr ψ̂oδ eψ q −ωsoψ̂oδ eid −aψ̂oδ eiq − lm b ψ̂oδ eω ) +ki(−eidoδ ψ̂ rq + eiqoδ ψ̂ rd + ψ̂ oδ eiq). (13) separating each state in d and q components, we obtain the corresponding state matrix â1: â1 =   − 1 tr ωslo lm tr −g1 0 0 −ωslo − 1 tr 0 lm tr −g1 ψ̂o lm btr lm b ωo −a ωso 0 −lm b ωo lm btr −ωso −a − lm b ψ̂o −lm b kpωoψ̂o lm btr kpψ̂o −kpωsoψ̂o (ki −akp)ψ̂o − lm b kpψ̂o   (14) note the dependency of the dynamic matrix â1 by the operating condition. in order to obtain analytic conditions about the local stability using the necessary condition for stability based on the determinant of (14) [4], it is possible to obtain a relevant result as reported in the next section. 4.2 stability criterion we use the following property: det(â1) = 5 ∏ i=1 λi (15) where λi are the eigenvalues of matrix â1. the determinant of matrix â1 is: det(â1) = −lmψ̂ 2o kiωso((ωso −ωo)abtr + l2mωo −lmωog1tr + ωsob)/(b2tr) (16) the condition det(â) = 0 leads to: ωso = 0, (17a) ωso = ωo g1lm + rslr (rrls + rslr) . (17b) these conditions of stability may be expressed in the torque/speed plane. let us consider the mechanical equation: d dt ω = p2 lm jlr ℑ(isψ ∗ r )− p tl j . (18) under rfoc conditions and steady state (ψ̂rqo = ψrqo = 0), we obtain: 0 = p lm lr ψ̂oisqo −tlo (19) feedback gain design method for the full-order flux observer in sensorless control of induction motor 139 then isqo = lr plmψ̂o tlo. (20) from system (1), in the same conditions, we find : ωslo = lm trψ̂o isqo. (21) finally using ωso = ωslo + ωo, equations (17a) and (17b) become tlo = − pψ̂ 2o rr ωo (22a) tlo = − pψ̂ 2o rr (1− g1lm rrls )ωo (1 + ts tr ) (22b) with ts = ls/rs. above relations describe respectively two lines, defining two well known instability regions in regenerating mode. an sufficient condition for instability is then: det(â1) > 0. (23) the condition (23) defines a set whose the instability region is a subset. in order to complete the study of local stability, we plot for each eigenvalue, the locus in the torque/plane where conditions (ℜ(λi) > 0, i = 1 . . . 5) are verified. in one hand, if we chose a zero observer gain, as in [9], g1 = 0 (24) we obtain the instability region limited by lines d1 and d2, (fig. 1) where ℜλi > 0, i = 1...5, are the positive real part of the eigenvalues λi of the state matrix â1. the eigenvalues correspond respectively to the states variables δ eψrd , δ eψrq , δ eid , δ eiq and δ eω . tlo = − pψ̂ 2o rr ωo (1 + tr ts ) (25) in other hand, in order to reduce (not totally remove) the unstable region, a real valued observer gain was considered in [13] which corresponds to the region limited by lines d1 and d3, (fig. 3). the value of the parameter g1 selected is: g1 = −0.25rs (26) it is be noted that the curves corresponding to zero observer gain are similar, except that the unstable region is larger. tlo = − pψ̂ 2o rr (1 + 0.25rslm rrls )ωo (1 + tr ts ) (27) the principle of the instability reduction proposed here consists in the calculation of the feedback gain so that the unstable region will be limited to the inobservabilité line (d1). we can note that, whatever the structure of the matrix g, (d1) is always defined by ωso = 0. from equation (16), we can write the 140 abderrahmane bouhenna, abdellah mansouri, mohammed chenafa, abdelkader belaidi −200 −150 −100 −50 0 50 100 150 200 −50 −40 −30 −20 −10 0 10 20 30 40 50 ω o (rad/s) t l o (n m ) region (ω o ,t lo ), zero observer gain ℜλ 2 of δψ rq ℜ 0λ 3 of δi sd reλ 4 of δi sq ℜλ 5 of δω tlo − tlo d1, wso=0 d2 figure 2: torque/speed plane,g1 = 0, ℜ(λi) > 0, i = 1...5 −200 −150 −100 −50 0 50 100 150 200 −50 −40 −30 −20 −10 0 10 20 30 40 50 ω o (rad/s) t l o (n m ) region (ω o , t lo ), observer gain g 1 = − 0.25 r s ℜλ 2 of δψ rq ℜ 0λ 3 of δi sd ℜλ 4 of δi sq ℜλ 5 of δω t lo − t lo d 1 , (ω wso =0) d 2 d 3 figure 3: torque/speed plane, g1 = −0.25rs, ℜ(λi) > 0, i = 1...5 feedback gain design method for the full-order flux observer in sensorless control of induction motor 141 condition ωso = 0 in equation (28) (ωso −ωo)abtr + l2mωo −lmωog1tr + ωsob = 0 (28) which can be achieved by choosing the following observer gains g1 = − lrrs lm (29) the straight line (d1) correspond to zero synchronous speed ωs = 0. it is known in the literature as the inobservability line (normally referred as dc-excitation) [6, 3] and seems to be a generic problem for sensorless control of induction motors. 5 simulations results in order to validate the proposed design, the regenerating mode low speed operation of the speed adaptive observer was investigate by means of simulations. a rotor flux oriented control (rfoc) is simulated using matlab/simulink software. the block diagram of the control system is shown in fig.4. the flux reference is fixed to the nominal value ψ re fo where ref denotes the reference value. the proposed usd -+ ω ref ω̂ + i ref sq -+ ψ ref o ψ̂r + i ref sd cω(s) cψ(s) ci(s) ci(s) isd isq usq motor adaptive observer eq. (5) usd usq isd isq ψ̂r ω̂ i sd i sq ψrd ψrq ω model eq. (1) 1 figure 4: block diagram of sensorless rfoc induction motor simulator. observer is compared with an exiting solution [13]. in order to validate the proposed design, we studied a conventional test used by industrial drive designers: very low and progressive load torque increase under constant speed; fig. 5 depicts results in regenerating mode obtained using the observer gain g1 = −0.25rs, [13], [5]. the speed reference was set to (−25rad/s) (dashed line) and a rated-load torque ramp was applied at t=0. after applying the load progressively, the drive should operate in the regenerating mode. however, the actual angular speed et actual flux of the motor collapse and the system becomes unstable. fig. 6 present results obtained using the proposed observer design. the system behaves stably. on fig. 7, the observer gain g1 = −0.25rs was used. real speed diverges. first subplot shows reference (dashed line) and actual angular speed. second subplot shows rated-load torque ramp. third subplot present actual flux components (ψrα , ψrβ ) in stator reference frame. fouth subplot, shows control voltages. in the fifth and six subplot respectively, we present current and current norm. we note that when the load torque increases, the control voltage, the current and the current norm increase too. on fig.8, the proposed observer design was used. the system becames stable. real rotor angular speed converges well towards the reference value in response to the same rated-load torque. note the behavior of the actual flux at (t ≈ 3.75 s) when the real angular rotor speed crosses the line (d1 = d3). the system becomes unobservable at this time. 142 abderrahmane bouhenna, abdellah mansouri, mohammed chenafa, abdelkader belaidi −50 −45 −40 −35 −30 −25 −20 −15 −10 −5 0 0 5 10 15 20 25 t l o ( n m ) region (ω o ,t lo ) d 1 d 3 −50 −45 −40 −35 −30 −25 −20 −15 −10 −5 0 0 5 10 15 20 25 ω o (rad/s) t l o ( n m ) figure 5: a rated-load torque ramp is applied with the observer gain g1 = −0.25rs. first subplot shows region (ωo, tlo) with the two lines d1 and d3. second subplot shows the actual angular speed. feedback gain design method for the full-order flux observer in sensorless control of induction motor 143 −50 −45 −40 −35 −30 −25 −20 −15 −10 −5 0 0 5 10 15 20 25 t l o ( n m ) region (ω o ,t lo ) d 1 , (ω so =0) d 3 −50 −45 −40 −35 −30 −25 −20 −15 −10 −5 0 0 5 10 15 20 25 ω o (rad/s) t l o ( n m ) figure 6: a proposed observer design was used. first subplot shows region (ωo, tlo) with the line d1 = d3. second subplot shows the actual angular speed. 144 abderrahmane bouhenna, abdellah mansouri, mohammed chenafa, abdelkader belaidi 0 2 4 6 −40 −30 −20 −10 0 ω o (r a d /s ) 0 2 4 6 0 5 10 15 20 25 t l o ( n m ) 0 2 4 6 −3 −2 −1 0 1 2 time(s) ψ rα , ψ rβ ( w b ) 0 2 4 6 −400 −200 0 200 u sα , u sβ ( v ) 0 2 4 6 −30 −20 −10 0 10 20 i s α , i s β (a ) 0 2 4 6 0 10 20 30 c u rr e n t n o rm ( a ) time(s) figure 7: instability phenomenon with observer gain g1 = −0.25rs. feedback gain design method for the full-order flux observer in sensorless control of induction motor 145 0 2 4 6 −40 −30 −20 −10 0 ω o (r a d /s ) 0 2 4 6 0 5 10 15 20 25 t l o ( n m ) 0 2 4 6 −3 −2 −1 0 1 2 time(s) ψ rα , ψ rβ ( w b ) 0 2 4 6 −400 −200 0 200 u sα , u sβ ( v ) 0 2 4 6 −30 −20 −10 0 10 20 i s α , i s β (a ) 0 2 4 6 0 10 20 30 c u rr e n t n o rm ( a ) time(s) figure 8: the instability was removed by the proposed observer design g1 = −lrrs/lm. 146 abderrahmane bouhenna, abdellah mansouri, mohammed chenafa, abdelkader belaidi 6 appendix 6.1 induction motor parameters voltage rating : 380 v , current rating: 2.2 a, number of phases: 3, rated power: 1.1 kw , frequency: 50 hz, rated speed: 1430 r pm/min, p = 2, ls = 0.472 h, lr = 0.4721 h, lm = 0.4475 h, rs = 9.65 ω, rr = 4.3 ω. 7 conclusions the feedback gain design method proposed in this paper reduces the instability region of adaptive observer to a inobservability line (d1) (ωso = 0). the observer using the proposed gain does not have the unstable region, which was shown by means of speed/torque plane and a linearized model. the stability of the regenerating-mode operation was also confirmed by simulations. bibliography [1] a. bouhenna, c. chaigne, n. bensiali, e. etien and g. champenois, design of speed adaptation law in sensorless control of induction motor in regenerating mode, simulat. modell. pract. theory, elsevier, doi:10.1016, simpat.2007.04.005, vol. 15, no.7, pp. 847-863, 2007. [2] a. bouhenna, contribution à la commande sans capteur mécanique de la machine asynchrone en mode générateur à basse vitesse, thèse de doctorat en sciences, université des sciences et de la technologie d’oran, algérie, octobre, 2007. [3] c. canudas de wit and a. youssef and j. p. barbot and ph. martin and f. malrait, observability conditions of induction motors at low frequencies, proc. cdc, sydney, pp. 1-7, 2000. [4] e. etien and n. bensiali and c. chaine and g. champenois, adaptive speed observers for sensorless control of induction motors: a new criterion of stability, international review of electrical engineering, vol. 1, pp. 36-43, 2006. [5] m. hinkanen, stabilization of regenerating-mode operation in sensorless induction motor drives by full-order flux observer design, ieee trans. ind. electron., vol. 51, pp. 1318-1328, 2004. [6] h. hofmann and s. sanders, speed-sensorless vector torque control of induction machine using a two-time-scale approach, ieee trans. on ind. appl., vol. 34, pp. 169-177, 1998. [7] h. k. khalil, nonlinear systems, macmillan, new york, 1983. [8] h. kubota and k. matsuse, dsp-based speed adaptive flux observer of induction motor, ieee trans. ind. appl., vol. 29, pp. 344-348, 1993. [9] h. kubota and i. sato, regenerating mode low speed operation of sensorless induction motor drive with adaptive observer, ieee trans. ind. appl., vol. 38, pp. 1081-1086, 2002. [10] a. mansouri and m. chenafa and a. bouhenna and e. etien, powerful nonlinear observer associated with field-oriented control of an induction motor, international journal of applied mathematics and computer sciences, vol. 14, no. 2, pp. 209-220, 2004. feedback gain design method for the full-order flux observer in sensorless control of induction motor 147 [11] m. chenafa and a. mansouri and a. bouhenna and e. etien and a. belaidi and m. a. denai, global stability of linearizing control with a new robust nonlinear observer of the induction motor, international journal of applied mathematics and computer sciences, vol. 15, no. 2, pp. 235-243, 2005. [12] m. montanarri and s. peresada and a. tilli, observeless scheme for sensorless control of induction motor: stability analysis and design procedure, proc. of the 10th mediterranean conference and automation, med’02, lisbon, 2002. [13] s. suwankawin and s. sangwongwanich, speeds sensorless im drive with decoupling control and stability analysis of speed estimation, ieee trans. ind. electron., vol. 49, pp. 444-455, 2002. [14] s. suwankawin and s. sangwongwanich, design strategy of an adaptive full order observer for speed sensorless induction motor drives-tracking performance and stabilization, ieee trans. ind. electron. vol. 53, pp. 96-119, 2006. [15] g. yang and t. chin, adaptive-speed identification scheme for a vector-controlled speed sensorless inverter-induction motor drive, ieee trans. ind. appl. electron., vol. 29, pp. 820-825, 1993. a. bouhenna1, a. mansouri1, m. chenafa1, and a. belaidi1 1 e.n.s.e.t. d’oran, laboratoire d’automatique et d’analyses des systèmes, (l.a.a.s) département de génie électrique, b.p 1523, el m’naouer, oran, algérie e-mail: bouhenna @ enset-oran.dz, (abouhenna @ yahoo.fr) received: october 10, 2007. 148 abderrahmane bouhenna, abdellah mansouri, mohammed chenafa, abdelkader belaidi abderrahmane bouhenna was born in 1955. he receive the dipl. eng. degree in electronic engineering, the m. s degree and the doctorat in automatic from u.s.t.o, oran, algeria in 1980, 1987 and 2007 respectively. he is currently a professeur searcher in the laboratory of automatic and analysis systems at enset of oran (algeria). he work on the subject of sensorless control and observers of induction motors and obtained some results in this domain for the stabilisation of the observers and the control of the mas in regenerating mode at low speed. abdellah mansouri was born in oran in algeria, in 1953. he received his bs degree in electronic engineering from usto (algeria) in 1979, the ms degree in engineering control from usto (algeria) in 1991, and the phd degree in engineering control from usto (algeria) in 2004. he is currently professor of automatic control at enset of oran (algeria). his research interests are non linear control and observers applied in induction motor and manipulator robot. mohammed chenafa was born in oran in algeria, in 1954. he received his bs degree in electronic engineering from usto (algeria) in 1979, the ms degree in signal processing and robotic from usto (algeria) in 1998, and the phd degree in engineering control from usto (algeria) in 2005. he is currently professor of automatic control at enset of oran (algeria). his research interests are non linear control and observers applied in induction motor and manipulator robot. abdelkader belaidi, professor in 1981 he obtained a ph.d in radiation physics at the university of east anglia, norwich england. his fields of interest are collision damage in materials and neural and fuzzy logic. now, he is a professor of physics and applied computing at the higher school of education oran algeria. he is also head of automatic and system analysis laboratory (laas). international journal of computers, communications & control vol. ii (2007), no. 4, pp. 375-387 fuzzy and neural controllers for a pneumatic actuator tiberiu vesselenyi, simona dziţac, ioan dziţac, mişu-jan manolescu abstract: there is a great diversity of ways to use fuzzy inference in robot control systems, either in the place where it is applied in the control scheme or in the form or type of inference algorithms used. on the other hand, artificial neural networks ability to simulate nonlinear systems is used in different researches in order to develop automated control systems of industrial processes. in these applications of neural networks, there are two important steps: system identification (development of neural process model) and development of control (definition of neural control structure). in this paper we present some modelling applications, which uses fuzzy and neural controllers, developed on a pneumatic actuator containing a force and a position sensor, which can be used for robotic grinding operations. following the simulation one of the algorithms was tested on an experimental setup. the paper also presents the development of a narma-l2 neural controller for a pneumatic actuator using position feedback. the structure had been trained and validated, obtaining good results. keywords: fuzzy control, neural control, force-position feedback, pneumatic actuator. 1 introduction there is a great diversity in which fuzzy inference and neural networks can be used in robotics operation control either in the place it has in the control scheme or in the type of fuzzy or neural controller. from the studied references the conclusion can be drawn that fuzzy inference is used (among others) in trajectory generation [3], robot model design [2], instead of p.i.d. controllers [4] or in combination with these [6]. a detailed presentation of general purpose fuzzy controllers is given in [5]. in the same work, it is shown that there can be made fuzzy controllers similar to classical ones (quasi p.i.d.). in other researches the importance of parameter adjustment is emphasized and also that fuzzy controllers can be adjusted more easily [6]. due to the fact that a large part of fuzzy inference systems had been implemented on heuristic basis (usually the membership functions are chosen upon the educated guess of specialists) there is no guarantee of a reliable operation or stability of the system in unforeseen conditions. due to this, experimental tests must be considered. a great number of researches in this field have as goal the development of methodologies of synthesis and analysis of fuzzy inference systems, in the field of robotics [2], [4] or in the larger field of control systems [5] (i.e. study of stability of fuzzy controllers). also there are works regarding the elaboration of fuzzy models of robots (used in direct and inverse kinematics [3] or in inverse dynamics [1], which can replace analytical models, and shorten the computing times. many researches try a systematic approach of fuzzy systems design (development of a design methodology), which can eliminate the subjectivity in choosing the membership functions and rule sets, as in [2], in which a clear method is presented for a rigorous selection of fuzzy inference parameters. in order to develop a model and test fuzzy and neural control in the design phase an adequate programming environment must be selected. for this purpose we had chosen the matlab programming environment, because it offers predefined functions to develop fuzzy and neural control systems. these functions are linked to extern modules like the "inference system" and the "fuzzy engine", and the simulink module can also use these functions. user applications can be linked to these modules using the predefined functions. the typical base structure of fuzzy systems develop a model which make the correspondence: copyright © 2006-2007 by ccc publications 376 tiberiu vesselenyi, simona dziţac, ioan dziţac, mişu-jan manolescu • crisp value input membership functions inference rules • output characteristics output membership functions crisp output value. also, a typical fuzzy inference system, supposes a user defined set of parameters which try to encrypt the model’s variables characteristics. if instead the development of a process model is wanted for which certain experimental input-output data sets exists, the fuzzy system parameters can be automatically generated that is the system identification can be done. in this case the identification strategy can be a neural-fuzzy approach, which has at its base acquiring knowledge from the presented data set in order to generate membership function parameters. in the matlab environment the adjustment of these parameters can be done with a module which works similar to a neural network named anfis (adaptive neural fuzzy inference system). as teaching algorithm error back propagation is used and the optimization is made by a gradient method, followed by error minimization (by the quadratic sum method). in [10], a methodology is presented for designing an adaptive fuzzy logic controller. "the neurofuzzy controller is first trained using data from an approximate analytical model of a cellular network then the controller is fine tuned and adapted to the unique cell dwell time and call holding time distributions of a particular cell in the network". the ability of neural networks to simulate non-linear systems, is used in some researches [13], in order to develop industrial processes control systems. when using neural networks in controlling processes there have to be two steps: system identification (development of process neural model) and control design (development of neural control system). in the system identification step, the neural model of the controlled process is developed and in the second step this model is used to obtain the neural net that will control the process. the training of process neural model is made "offline" (or "batch processing"), but the training and optimization of neural control must be made "online" using training data sets. a chaos search immune algorithm is proposed in [12] by integrating the chaos optimization algorithm and the clonal selection algorithm: "first, optimization variables are expressed by chaotic variables through solution space transformation. then, taking advantages of the ergodic and stochastic properties of chaotic variables, a chaos search is performed in the neighborhoods of high affinity antibodies to exploit local solution space, and the motion of the chaotic variables in their ergodic space is used to explore the whole solution space. furthermore, a generalized radial basis function neuro-fuzzy controller [...] is constructed and designed automatically". 2 simulation of pneumatic system with fuzzy controller 2.1 general considerations the general scheme of the automated grinding system is presented in figure 1. this system is used to grind metal probes for microscope observations. the "command module" represents a programmable computing unit on which the control algorithm is running (fuzzy or neural controller in this case a pentium iv pc) and it has the possibility to transmit signals to the execution unit and to receive data from sensors (by means of a daq card with analogical and digital channels). the "execution module" contains the execution elements, which are simple and proportional electro valves, "force sensor" and "position sensor" are the sensor used to generate the feedback signals. the pneumatic setup of the force-position feedback system (fpfs) is shown in figure 3. for the position feedback (pfs) system the force sensor is missing. in the position feedback case the system has to move the probe, approaching the grinding surface to an approximate distance, and then to move it with smaller speed to touch the surface. fuzzy and neural controllers for a pneumatic actuator 377 figure 1: scheme of automated metal probe grinding system in the force position feedback case the system has to fulfil the objectives represented in diagram presented in figure 2, that is to move the probe near the grinding wheel surface and then to move it with reduced speed until the force reaches a reference value and then again to maintain the probe pushed on the surface with the reference force. above described strategies can be graphically expressed in the diagram shown in figure 3. figure 2: scheme of pneumatic setup for the fuzzy fpfs case (the complete case) there have to be defined: • two input linguistic variables "position error" and "forces error" and one output linguistic variable named "control value". • position reference is given by the superior limit of the uncertainty domain d2; • the final control will be made by the "force" variable; • for position there are two ranges which must be defined (big and small) and for the force seven ranges (big negative, medium negative, small negative, zero, big positive, medium positive, small positive). 378 tiberiu vesselenyi, simona dziţac, ioan dziţac, mişu-jan manolescu figure 3: functional diagrams for position (a) and force (b) control 2.2 fuzzy position feedback system the model for fuzzy pfs is presented in figure 4. in this case the system is used without force reaction. this model has been made only as a preliminary study. in figure 5 the systems inference diagrams are shown: • the gaussian input membership functions represents 5 linguistic ranges of the "position error"; • the output triangular membership functions represent also 5 linguistic ranges of the output variable "control value"; deffuzyfication is made by a "centroid" function. figure 4: fuzzy pfs model results of simulation is given in figure 6 for some random step reference values. we can observe how the actual controlled position is following the reference position. the delays are relatively large but considering that the pneumatic system is acting similar to a damper, this range of delay is acceptable. 2.3 fuzzy force position feedback system the fuzzy fpfs is presented in figure 7. fuzzy and neural controllers for a pneumatic actuator 379 figure 5: inference diagram of the fuzzy pfs model figure 6: reference (r) and actual (a) signals diagram resulted from system simulation (fuzzy pfs case) figure 7: fuzzy fpfs model 380 tiberiu vesselenyi, simona dziţac, ioan dziţac, mişu-jan manolescu for both inputs (as for "force error" as for "position error") the membership functions are "gaussian" type and for output membership function the "triangular" type had been used. simulation results for step type references are given in figures 8 and 9. analyzing the result diagram we can conclude that the control is working correctly. figure 8: reference and response force values figure 9: reference and response position values 3 experimental setup for fuzzy fpfs the experimental study in order to test fuzzy fpfs operation is shown in figure 10. the base idea of this concept was the use of a pci6023e daq card (from national instruments), for which there are predefined acquisition functions in the matlab "data acquisition toolbox". this fact makes possible the fuzzy and neural controllers for a pneumatic actuator 381 use of data acquisition, fuzzy inference engine and command signal generation from the same program. even the designed system gives birth to considerable delays in comparison with a true real time system (which would use xpctarget modules), it is good enough for testing of controllers accuracy. figure 10: experimental setup scheme the daq card has 8 analogical input channels (ach1...8) on 12 bits, from which ach1 and ach2. were used. a major disadvantage of pci6023 daq card is the absence of analog output, which would be used to command the pressure regulator proportional electro-valve (figure 10). this issue was solved by using 6 out of 8 digital channels of the digital i/o port (dio in figure 10) and a d/a (digital to analog) converter. so we can obtain 64 values of pressure for the 6 bits available, which suffice for the experiments. we cannot use al the 8 cannels of the dio port because 2 channels must be used to command the 2 on-off electro-valves used to change the piston’s movement direction. 4 neural position feedback system in the case of neural pfs there must be two steps to complete: system identification (development of process neural model) and control design (development of neural control system) [3]. in the system identification step, the neural model of the controlled process is developed and in the second step this model is used to obtain the neural net that will control the process. 4.1 identification step in the identification step a convenient structure for the process model must be found and then the neural network will be trained in order to obtain the value of weights, using training data sets. a largely used standard structure, representative for nonlinear discrete systems is narma (nonlinear autoregressive moving average) [3], given by the relation: y(k + d) = n[y(k), y(k −1), . . . , y(k −n + 1), u(k), u(k −1), . . . , u(k −n + 1)] (1) in which: u(k) is the system’s input and y(k) is the output. in order to identify the process, the network will be trained with the non-linear function n. 4.2 control step if the goal of the system is to follow a reference trajectory, y(k + d) = yr(k + d), a non-linear controller will have to be developed: u(k) = g[y(k), y(k −1), . . . , y(k −n + 1), yr(k + d), u(k −1), . . . , u(k −m + 1)] (2) 382 tiberiu vesselenyi, simona dziţac, ioan dziţac, mişu-jan manolescu in order to generate the function g, which minimizes the quadratic mean error, a dynamic backpropagation learning algorithm should be used which is hard to implement and very slow. that is the reason why some approximate models are usually used. such an approximate model is given by relation: y(k + d) = f [y(k), y(k −1), . . . , y(k −n + 1), u(k −1), . . . , u(k −m + 1)] (3) +g[y(k), y(k −1), . . . , y(k −n + 1), u(k −1), . . . , u(k −m + 1)]·u(k) this model is in a form in which the input u(k) is not contained in the non-linear term and if y(k + d) = yr(k + d), it can then be written that: u(k) = yr(k + d)− f [y(k), y(k −1), . . . , y(k −n + 1), u(k −1), . . . , u(k −n + 1)] g[y(k), y(k −1), . . . , y(k −n + 1), u(k −1), . . . , u(k −n + 1)] (4) in this form it is necessary to find the input values u(k), based on the output in the same step y(k), which is inconvenient and is better to use the form: y(k + d) = f [y(k), y(k −1), . . . , y(k −n + 1), u(k), u(k −1), . . . , u(k −n + 1)] (5) +g[y(k), y(k −1), . . . , y(k −n + 1), u(k), . . . , u(k −n + 1)]·u(k + 1) for d ≥ 2. the structure of the process model neural network neural network is given in figure 11. figure 11: process neural model based on relation (5) the controller expression from relation 5, will be: u(k + 1) = yr(k + d)− f [y(k), . . . , y(k −n + 1), u(k), . . . , u(k −n + 1)] g[y(k), . . . , y(k −n + 1), u(k), . . . , u(k −n + 1)] (6) for d ≥ 2. in figure 12 is presented the control scheme in which yr is generated by the neural model ("reference model"). in this case the controller has to make only a few computations and the neural model can be trained off-line. the method can be applied in industrial robot control, but it was not tested yet with pneumatic actuators. the real advantage of using this kind of controllers for industrial robots, would be the use of parallel system for each axes of the robot. as support of computational implementation of the simulation the simulink module of matlab was used. simulink, contains predefined blocks to generate the neural model as well a gui for parameter settings, training and validation of the controller. fuzzy and neural controllers for a pneumatic actuator 383 figure 12: control scheme designed on the relation (6) 4.3 neural position feedback system simulation for this case a proportional valve and two on-off electro-valves have been used. the pneumatic design scheme is the same as it was presented in figure 2, only the controller structure is different. this model is very complex and even if the pc used for simulation (intel pentium iv 2ghz), has a significant computing power training cycles are very long (about hundreds of minutes). that is why we have used the simplified model of the process shown in figure 13. the simplification is made by neglecting the air compressibility terms and the elimination of air from cylinders inactive chamber. figure 13: simplified process scheme the presented model is then integrated with the controller model given the final version shown in figure 14. figure 14: process with controller scheme 384 tiberiu vesselenyi, simona dziţac, ioan dziţac, mişu-jan manolescu after model design, the training, validation and test data sets were generated with the help of process model. on this basis the neural model of process and controller were obtained. the training error as function of training epochs is presented in figure 15. a number of 3000 data in 300 epochs were used as training parameters. as it can be seen from the diagram the convergence of the network is good. simulations were carried out for different reference values for position from 0.025 to 0.06. in figure 16 a sample of obtained response is shown for position reference value of 0.042. figure 15: training of the neural network figure 16: reference (r) and actual (a)value of simulation results 5 results and conclusions the structure of the narma-l2 neural controller for a pneumatic actuator has been trained and validated, obtaining good results. after the analysis of simulation results and the experimental system operation it can be said that the fuzzy controller is working in a proper manner for this application but needs further adjustments in order to increase the robustness of the control. fuzzy and neural controllers for a pneumatic actuator 385 in actual operation there are unwanted effects as: • position control error is constant, but the actual position remains under the reference value with about 6% for a value of reference from 6 to 9; • the uncertainty domain in which the probe can meet the grinding wheel and the domain in which the control is acceptable is small (about 0.003 m), but in practice this is about 0.005 to 0.006 m; • the force control error is increasing with the increase of reference, limiting the application domain in which the error is still acceptable. although in some domains the control is acceptable further adjustments of the control parameters are needed. in the study of neural controller simulation it resulted that the overshoot error of the system was only of 3%. in this case further studies must be carried out to implement the neural controller for the fpfs case and to test it experimentally. bibliography [1] amann p., perronne j.,m., gissinger g.,l., frank p., m., identification of fuzzy relational models for fault detection, control engineering practice 9, 555, 2001. [2] emami m.r., goldenberg a.a., burhan t.r., systematic design and analysis of fuzzy-logic control and application to robotics, part i. modeling, robotics and autonomous systems 33, pp. 65-88, 2000. [3] jang, r., matlab fuzzy toolbox the mathworks, inc. revision: 1.12, date: 2000, 15. [4] novakovic, b., scap, d., novakovic d., an analytic approach to fuzzy robot control synthesis, engineering applications of artificial intelligence 13, pp. 71-83, 2000. [5] preitl, st., precup, e., introducerea în conducerea fuzzy a proceselor, ed. tehnicã, bucuresti, 1997. [6] reznik l., ghanayem o., bourmistrov a., pid plus fuzzy controller structures as a design base for industrial applications, engineering applications of artificial intelligence 13, pp. 419-430, 2000. [7] vesselenyi t., automated flexible cell for microstructure recognition, phd thesis, universitatea "politehnica" timisoara, 2005. [8] m.m., chen, j.a., fairwather, s.a., green, edumech. mechatronic instructional systems. case study: pneumatics systems, production of shandor motion systems, inc., 1999. [9] harbick k., sukhatme s., speed control of a pneumatic monopod using a neural network, www.harbick-ann.com, 2002. [10] raad r., raad i., neuro-fuzzy admission control in cellular networks. communication systems, (10th ieee singapore international conference on communication systems.), pp. 1-7, 2006. [11] wenmei h., yong y., yali t., adaptive neuron control based on predictive model in pneumatic servo system, 2002. [12] zuo x.q., fan y.s., a chaos search immune algorithm with its application to neuro-fuzzy controller design. chaos, solitons & fractals, vol. 30, issue 1, pp. 94-109, 2006. 386 tiberiu vesselenyi, simona dziţac, ioan dziţac, mişu-jan manolescu [13] zhang j., knoll a., schmidt r., a neuro-fuzzy control model for fine-positioning of manipulators, robotics and autonomous systems, 32, pp. 101-113, 2000. tiberiu vesselenyi university of oradea universităţii st. 1, 410087, oradea, romania tvesselenyi@yahoo.co.uk simona dziţac university of oradea universităţii st. 1, 410087, oradea, romania sdzitac@rdslink.ro ioan dziţac department of economic informatics agora university of oradea piaţa tineretului 8, oradea 410526, romania idzitac@univagora.ro mişu-jan manolescu agora university piaţa tineretului 8, 410526 oradea, romania rectorat@univagora.ro received: february, 14, 2007 fuzzy and neural controllers for a pneumatic actuator 387 tiberiu vesselenyi was born in oradea, romania in 1957, he finished the university "politehnica" from timişoara in 1983. from 1983 to 1991 he worked at a machine building company in oradea as designer and cnc programmer. from 1991 till 1994 he was a research engineer at the "geothermal energy research center" in oradea and from 1994 till today is assoc. prof. at the university of oradea, where he teaches robot and cnc programming. he had earned a phd in robotics at the university "politehnica" at timişoara. he had published over 150 papers in national and international conferences and journals, and is author or coo author of 4 books. simona dziţac received b.sc. (2000) and m. sc. (2001) in mathematics-physics, b.sc. (2005) and m. sc. (2007) in energy engineering from university of oradea and b.sc. in economic informatics (2007) from university of craiova, romania. at this moment, she is phd student in energy engineering field and researcher at university of oradea. her current research interests include reliability, applied mathematics and computer science in engineering fields. she published 4 books and 38 scientific papers in journals and conferences proceedings. ioan dziţac received m. sc. in mathematics (1977) and ph. d in information sc. (2002) from "babes-bolyai" university of cluj-napoca. at this moment, he is associate professor and head of economic informatics department at agora university, oradea, romania. his current research interests include different aspects of parallel and distributed computing, applied mathematics and economic informatics. he has edited 4 conference proceedings, published 12 books and 47 scientific papers in journals and conferences proceedings. he was member of the program committee of 18 international conferences. mişu-jan manolescu received m. sc in electro-mechanics (1984) from the university of craiova, ph. d in microwave (1994) from the university of oradea, and ph. d in human resources management (2000) from the university of craiova. now, he is professor and president of agora university, oradea, romania. his current research interests include different aspects of knowledge management, and knowledge engineering. he has edited 4 conference proceedings, published 9 books and 63 scientific papers in journals and conferences proceedings. he has been member of program committee of the 5 international conferences. int j comput commun, issn 1841-9836 vol.7 (2012), no. 4 (november), pp. 776-784 stability of discrete-time systems with time-varying delay: delay decomposition approach s.b. stojanovic, d.lj. debeljkovic, n. dimitrijevic sreten b. stojanovic university of nis, faculty of technology serbia, 16000 leskovac, bulevar oslobodjenja 124 e-mail: ssreten@ptt.rs dragutin lj. debeljkovic, nebojsa dimitrijevic university of belgrade, faculty of mechanical engineering serbia, 11120 beograd, kraljice marije 16 e-mail: ddebeljkovic@mas.bg.ac.rs, ndimitri@verat.net abstract: this article deals with the problem of obtaining delay-dependent stability conditions for a class of discrete-time systems with interval time-varying delay. using the decomposition the delay interval into two unequal subintervals by tuning parameter α, a new interval delay-dependent lyapunov-krasovskii functional is constructed to derive novel delay-dependent stability conditions which are expressed in terms of linear matrix inequalities. this leads to reduction of conservatism in terms of the upper bounds of the maximum time-delay. the numerical examples show that the obtained result is less conservative than some existing ones in the literature. keywords: time-delay systems, interval time-varying delay, asymptotic stability, delay-dependent stability, lyapunov-krasovskii methods. 1 introduction time-delay frequently occurs in many practical systems, such as manufacturing systems, telecommunication and economic systems etc. since time-delay is an important source of instability and poor performance, considerable attention has been paid to the problem of stability analysis and controller synthesis for continuous time-delay systems (see e.g. [3-5, 10, 11, 17-21, 23-26] and the reference therein). inversely, less attention has been drawn to the corresponding results for discrete-time delay systems (see e.g. [1, 2, 6-9, 12-15, 22, 24]). this is mainly due to the fact that such systems can be transformed into augmented systems without delay. this augmentation of the system is, however, inappropriate for systems with unknown delays and for systems with time-varying delay which are the subject analysis in this work. recently, increasing attention has been devoted to the problem of delay-dependent stability of linear systems with time-varying delay, including continuous-time (see e.g. [5, 10, 11, 18-21, 23], 25, 26]) and discrete-time systems (see e.g. [1, 2, 7-9, 12, 14, 15, 24]) and a great number of delay-dependent stability criteria were derived. the key point for deriving the delay-dependent stability criterions is the choice of an appropriate lyapunov–krasovskii functional (lkf). it is known that the existence of a complete quadratic lyapunov-krasovskii functional (cqlkf) is a sufficient and necessary condition for asymptotic stability of the time-delay system. using the cqlkf, one can obtain the maximum allowable upper bound (maub) of delay which is very close to the analytical delay limit for stability. however, the cqlkf leads to a complicated system of partial differential equations, yielding infinite dimensional linear matrix inequalities (lmis). therefore, to develop simpler stability criteria, many authors have used special forms of lkf rather than cqlkf, which give lmis with finite order and a reduced value of maub. further, to reduce the conservativeness of the existing results, some new analysis methods have been proposed, such as descriptor system transformation method [3-5], free weighting matrix copyright c⃝ 2006-2012 by ccc publications stability of discrete-time systems with time-varying delay: delay decomposition approach 777 method [8, 11, 23], matrix inequality method [10, 17, 18] and input–output approach [19]. using these methods, many stability criteria were derived by checking a variation of lkf in a whole interval of the time-delay. contrary to this approach, in [24, 25], in order to obtain some less conservative stability conditions, the interval of the time delay is divided into multiple equidistant subintervals and interval delay-dependent lkf (id-d lkf) is constructed. by checking the variation of the id-d lkf defined on the subintervals, some new delay-dependent stability criteria are derived. it is worth pointing out that the main difference between lkf and id-d lkf lies in that the former allows taking different weighing matrices on different subintervals. therefore, id-d lkf, as expected, will yield less conservative delay-dependent stability criteria. inspired by the idea of zhu and yang [26] on splitting the delay of continuous-time systems into two unequal subintervals, a new method is developed in this paper for stability analysis for discrete-time systems with time-varying delay. the delay interval [k −hm, k −1] in the id-d lkf is divided into two unequal subintervals: [k −hm, k −α−1] and [k −α, k −1], where 0 < α < hm is a tuning parameter. the new id-d lkf is constructed with different weighing matrices in various subintervals. free-weighting matrices and model transformation are not used in order to derive delay dependent criterion. it is shown that the presented stability condition is much less conservative than the existing ones [1, 2, 6-9, 13-15, 22, 24], because it has a lower value of maub. the derived condition can be seen as an extension of the methods in [24, 25], wherein the whole delay range is divided to n ≥ 2 equal subintervals. as the number of subintervals in [24, 25] is greater than two, the decomposition approach is more complex and the resulting stability conditions are more conservative and difficult to implement. to demonstrate the effectiveness of the proposed method, numerical examples are given in section 3. notation: ℜn and z+ denote the n-dimensional euclidean space and positive integers. notation p > 0 (p ≥ 0) means that matrix p is real symmetric and positive definite (semidefinite). for real symmetric matrices p and q, the notation p > q (p ≥ q) means that matrix p − q is positive definite (positive semi-definite). i is an identity matrix with an appropriate dimension. superscript “t” represents the transpose. in symmetric block matrices or complex matrix expressions, we use an asterisk (∗) to represent a term which is induced by symmetry. if dimensions of matrices are not explicitly given, then they are assumed to be compatible for algebraic operations. 2 main results consider the following system with an interval time-varying delay: x(k + 1) = ax(k) + bx(k −h(k)) (1) where x(k) ∈ℜn is the state at instant k, matrices a ∈ℜn×nand b ∈ℜn×n are constant matrices and h(k)is the positive integer representing the time delay of the system that we assume to be time dependent and satisfies the following: 0 ≤ h(k) ≤ hm (2) where hm is known to be a positive and finite integer. the aim of this article is to establish the sufficient condition that guarantee the delaydependent stability of the system (1), which is less conservative than the existing results in literature. we first introduce the following result, which will be used in the proof of our main results. 778 s.b. stojanovic, d.lj. debeljkovic, n. dimitrijevic lemma 1. let y(k) = x(k + 1)−x(k). for any matrix r > 0 [24] −(hm −hm) k−1−hm∑ m=k−hm yt (m)ry(m) ≤ [ x(k −hm) x(k −hm) ]t [ −r r r −r ] [ x(k −hm) x(k −hm) ] = − [x(k −hm)−x(k −hm)] t r [x(k −hm)−x(k −hm)] (3) theorem 2. for given scalars hm(hm > 0) and α(0 < α < hm), the system described by (1)-(2) is asymptotically stable if there exists matrices p = pt > 0, qi = qti ≥ 0 and zi = z t i ≥ 0 (i = 1,2,3), such that the following lmis hold: φ =   φ11 φ12 0 0 φ15 ∗ φ22 φ23 0 φ25 ∗ ∗ φ33 φ34 0 ∗ ∗ ∗ φ44 0 ∗ ∗ ∗ ∗ φ55   < 0 (4) ψ =   ψ11 ψ12 ψ13 0 ψ15 ∗ ψ22 ψ23 ψ24 ψ25 ∗ ∗ ψ33 0 0 ∗ ∗ ∗ ψ44 0 ∗ ∗ ∗ ∗ ψ55   < 0 (5) where φ11 = a t pa−p + q1 + q3 − 1α (z1 + z3) , φ12 = a t pb + 1 α (z1 + z3) , φ15 = (a− i)t u1, φ22 = b t pb −q3 − 1α (2z1 + z3) , φ23 = 1 α z1, φ25 = b t u1, φ33 = −q1 + q2 − 1αz1 − 1 hm −α z2, φ34 = 1 hm −α z2, φ44 = −q2 − 1hm −αz2, φ55 = −u1, ψ11 = φ11, ψ12 = a t pb, ψ13 = 1 α (z1 + z3) , ψ15 = (a− i)t u2, ψ22 = b t pb −q3 − 1hm −α (2z2 + z3) , ψ23 = 1 hm −α (z2 + z3) , ψ24 = 1 hm −α z2, ψ25 = b t u2, ψ33 = −q1 + q2 − 1α(z1 + z3)− 1 hm −α (z2 + z3) , ψ44 = −q2 − 1hm −αz2, ψ55 = −u2, u1 = αz1 + (hm −α)z2 + αz3, u2 = αz1 + (hm −α)z2 + hmz3 proof: construct the interval delay-dependent lkf as v (k) = v1(k) + v2(k) + v3(k) (6) where v1(k) = x t (k)px(k) (7) v2(k) = k−1∑ i=k−α xt (i)q1x(i) + k−1−α∑ i=k−hm xt (i)q2x(i) + k−1∑ i=k−h(k) xt (i)q3x(i) (8) v3(k) = −1∑ i=−α k−1∑ j=k+i yt (j)z1y(j) + −1−α∑ i=−hm k−1∑ j=k+i yt (j)z2y(j) + −1∑ i=−h(k) k−1∑ j=k+i yt (j)z3y(j) (9) stability of discrete-time systems with time-varying delay: delay decomposition approach 779 where p = pt > 0, qi = qti ≥ 0 and zi = z t i > 0 (i = 1,2,3). note, the delay interval [k −hm, k −1] in the lkf is divided into two unequal subintervals: [k −hm, k −α−1] and [k −α, k −1], where 0 < α < hm is a tuning parameter. taking the difference of ∆vi(k) = vi(k + 1)−vi(k), we can obtain ∆v1(k) = x t (k) ( at pa−p ) x(k) + 2xt (k)at pbx(k −h(k)) +xt (k −h(k))bt pbx(k −h(k)) (10) ∆v2(k) = x t (k)q1x(k)−xt (k −α)q1x(k −α) +xt (k −α)q2x(k −α)−xt (k −hm)q2x(k −hm) +xt (k)q3x(k)−xt (k −h(k))q3x(k −h(k)) (11) ∆v3(k) = −1∑ i=−α [ yt (k)z1y(k)−yt (k + i)z1y(k + i) ] + −1−α∑ i=−hm [ yt (k)z2y(k)−yt (k + i)z2y(k + i) ] + −1∑ i=−h(k) [ yt (k)z3y(k)−yt (k + i)z3y(k + i) ] = αyt (k)z1y(k)− −1∑ i=−α yt (k + i)z1y(k + i) + (hm −α)yt (k)z2y(k)− −1−α∑ i=−hm yt (k + i)z2y(k + i) +h(k)yt (k)z3y(k)− −1∑ i=−h(k) yt (k + i)z3y(k + i) = yt (k) [αz1 + (hm −α)z2 + h(k)z3]y(k) − k−1∑ m=k−α yt (m)z1y(m)− k−1−α∑ m=k−hm yt (m)z2y(m) − k−1∑ m=k−h(k) yt (m)z3y(m) = yt (k) [αz1 + (hm −α)z2 + h(k)z3]y(k) − k−1∑ m=k−α yt (m)z1y(m)− k−1−α∑ m=k−hm yt (m)z2y(m) − k−1∑ m=k−h(k) yt (m)z3y(m) (12) it is know from (2) that, for any k ∈ z+, h(k) ∈ [0,α−1] or h(k) ∈ [α,hm]. define two sets ω1 = { k : h(k) ∈ [0,α] , k ∈ z+ } (13) ω2 = { k : h(k) ∈ [α + 1,hm] , k ∈ z+ } (14) in the following, we will discuss the variation of ∆v (k) for two cases (k ∈ ω1 and k ∈ ω2). case 1. for k ∈ ω1, i.e. 0 ≤ h(k) ≤ α. k−1∑ m=k−α yt (m)z1y(m) = k−1−h(k)∑ m=k−α yt (m)z1y(m) + k−1∑ m=k−h(k) yt (m)z1y(m) (15) 780 s.b. stojanovic, d.lj. debeljkovic, n. dimitrijevic ∆v3(k) = y t (k) [αz1 + (hm −α)z2 + h(k)z3]y(k)− k−1−h(k)∑ m=k−α yt (m)z1y(m) − k−1∑ m=k−h(k) yt (m) (z1 + z3)y(m)− k−1−α∑ m=k−hm yt (m)z2y(m) (16) because z1 + z3 > 0, h(k) ≤ α and α−h(k) ≤ α, using lemma 1, it follows − k−1∑ m=k−h(k) yt (m) (z1 + z3)y(m) ≤− 1 h(k) [x(k)−x(k −h(k))]t (z1 + z3) [x(k)−x(k −h(k))] ≤ 1 α xt (k) (−z1 −z3)x(k) + 1α2x t (k) (z1 + z3)x(k −h(k)) + 1 α xt (k −h(k)) (−z1 −z3)x(k −h(k)) (17) − k−1−h(k)∑ m=k−α yt (m)z1y(m) ≤− 1 α−h(k) [x(k −h(k))−x(k −α)] t z1 [x(k −h(k))−x(k −α)] ≤ 1 α xt (k −h(k)) (−z1)x(k −h(k)) + 1α2x t (k −h(k))z1x(k −α) + 1 α xt (k −α) (−z1)x(k −α) (18) − k−1−α∑ m=k−hm yt (m)z2y(m) ≤− 1 hm −α [x(k −α)−x(k −hm)] t z2 [x(k −α)−x(k −hm)] ≤ 1 hm −α xt (k −α) (−z2)x(k −α) + 1hm −α2x t (k −α)z2x(k −hm) + 1 hm −α xt (k −hm) (−z2)x(k −hm) (19) combining (10)-(19), it yields ∆v (k) ≤ ξt (k)φ̂ξ(k) φ̂ =   φ11 + (a− i)t u1(a− i) φ12 + (a− i)t u1b 0 0 ∗ φ22 + bt u1b φ23 0 ∗ ∗ φ33 φ34 ∗ ∗ ∗ φ44   ξ(k) = [ xt (k) xt (k −h(k)) xt (k −α) xt (k −hm) ]t (20) obviously, ∆v (k) < 0 for k ∈ ω1 if φ̂ < 0. using the schur complement, it is easy to see that ∆v (k) < 0 holds if φ < 0 and h(k) ∈ [0,α]. case 2. similarly, for k ∈ ω2, i.e. α + 1 ≤ h(k) ≤ hm , using the schur complement, it is easy to see that ∆v (k) < 0 holds if ψ < 0. from the above discussions, we can see that for all k ∈ z+ if (4)-(5) hold, ∆v (k) < 0, which completes the proof. 2 remark 3. theorem 2 presents a stability result which depends on the maximum delay bound hm . the conditions in theorem 2 are expressed in terms of lmis, and therefore, they can be easily checked by using standard numerical software. remark 4. the delay interval [k −hm, k −1] in the id-d lfk is divided into two unequal subintervals: [k −hm, k −α−1] and [k −α, k −1], where 0 < α < hm is a tuning parameter. stability of discrete-time systems with time-varying delay: delay decomposition approach 781 consequently, the different weighing matrices in the lyapunov functional are used in various subintervals and the information of delayed state x(k −α) can be taken into full consideration. further, using the subintervals and lemma 1, the upper bounds of some terms in ∆v3(k) are more accurately estimated than by using the previous methods since the upper bound hm of delay h(k) on the interval 0 ≤ h(k) ≤ hm is substituted with two less conservative upper bounds α and hm on the subintervals 0 ≤ h(k) ≤ α and α < h(k) ≤ hm , respectively. so the decomposition method presented in theorem 2 can reduce the value of maub. an algorithm for seeking a corresponding values of α (0 < α < hm) subject to (4)-(5), such that the maub of hm has maximal value, can easily be obtained. algorithm 5. step 1. let h = 0 and α = 0. step 2. h = h + 1. step 3. α = α + 1. step 4. if inequalities (4)-(5) is feasible, then αm = α, α = 0 and go to step 2; otherwise, go to step 5. step 5. if α = h−1, go to step 6; otherwise, go to step 3. step 6. the maximal delay is hm = h−1 and the minimal value of tuning parameterαis αm. 3 numerical examples in this section, two examples are presented. the obtained results have been compared with several existing criteria in the literature. example 1. consider system (1) with the time-varying delay h(k) satisfying (2) and a = [ 0.8 0 0 λ ] , b = [ −0.1 0 −0.1 −0.1 ] , λ ∈{0.91, 0.97} case 1 (λ = 0.91). this system was considered in [13] and [22]. table 1 lists the maub of delay obtained from theorem 2 of this paper. for comparison, results from [13, 22] are also listed in the table. it is clear that theorem 2 leads to better results than those in [13, 22]. table 1. comparison of maub of delay based on different existing methods for λ = 0.91 method hm [13] 41 [22, corollary 1] 42 theorem 2 in this paper 46 for α = 19, · · · ,30 case 2 (λ = 0.97). for comparison, the results from [2, 6, 7, 9] and this paper are listed in table 2. it is clear that theorem 2 gives much better results than the existing delay-dependent criteria. example 2. consider system (1) with the time-varying delay h(k) satisfying (2) and case 1. [1, 14, 15] a = [ 0.6 0 0.35 0.7 ] , b = [ 0.1 0 0.2 0.1 ] case 2. [8, 9, 15, 24] a = [ 0.8 0 0.05 0.9 ] , b = [ −0.1 0 −0.2 −0.1 ] 782 s.b. stojanovic, d.lj. debeljkovic, n. dimitrijevic table 2. comparison of maub of delay based on different existing methods for λ = 0.97 method hm [9, theorem 1] 4 [7, theorem 3] 8 [6, lemma 2] 8 [2, theorem 1] 10 theorem 2 in this paper 17 for α = 9,10,11 for the above systems, the lmis conditions for delay-independent stability [16] p = pt > 0, q = qt > 0,   −p + q 0 at p ∗ −q bt p ∗ ∗ −p   < 0 are feasible, from which we deduce that both systems are stable for 0 ≤ h(k) < ∞. using the existing delay-dependent criteria, it can be obtained only the finite value of maub, which will guarantee the stability of the given systems. tables 3 (case 1) and 4 (case 2) list the intervals of time delay for different methods. based on theorem 2 in this paper, large numerical values of maub are obtained (hm → ∞). hence, using of theorem 2 leads to better results than those in [1, 8, 9, 14, 15, 25]. table 3. interval of time delay for case 1 method interval [1, th. 3.1] 2 ≤ h(k) ≤ 10 [15, theorem 1], [15, theorem 2] 2 ≤ h(k) ≤ 13 [14, theorem 3.2] 0 ≤ h(k) ≤ 12 [2, theorem 1] 2 ≤ h(k) ≤ 15 theorem 2 in this paper 0 ≤ h(k) ≤ 10 ·1021 table 4. interval of time delay for case 2 method interval stability [9, theorem 1] 0 ≤ h(k) ≤ 6 [15, theorem 2] 0 ≤ h(k) ≤ 10 [8, theorem 1] 0 ≤ h(k) ≤ 12 [24, theorem 5] 2 ≤ h(k) ≤ 19 [24, theorem 7] 2 ≤ h(k) ≤ 20 theorem 2 in this paper 0 ≤ h(k) ≤ 9.61 ·108 stability of discrete-time systems with time-varying delay: delay decomposition approach 783 4 conclusion in this paper, the problem of obtaining delay dependent stability conditions for a class of systems with interval time-varying delay is discussed. a new interval delay-dependent lyapunov– krasovskii functional is constructed by splitting the delay interval into two unequal intervals by tuning parameter α. the free-weighting matrices and model transformation are not used in order to derive delay-dependent criteria. numerical examples show that the results proposed in this paper are much less conservative while comparing the maximum allowable upper bound of delay with the existing results in the literature. bibliography [1] e.k. boukas, discrete-time systems with time-varying time delay: stability and stabilizability, mathematical problems in engineering, article id 42489:1-10, 2006. [2] k.f. chen and i-k fong, stability of discrete-time uncertain systems with a time-varying state delay, proc. imeche, part i: j. systems and control engineering, 222: 493-500, 2008. [3] e. fridman and u. shaked, a descriptor system approach to h∞ control of linear time-delay systems, ieee transactions on automatic control, 47(2): 253–270, 2002. [4] e. fridman and u. shaked, h∞ control of linear state delay descriptor systems: an lmi approach, linear algebra and its applications, 351–352: 271–302, 2002. [5] e. fridman, new lyapunov–krasovskii functionals for stability of linear retarded and neutral type systems, systems and control letters, 43: 309–319, 2001. [6] e. fridman and u. shaked, delay-dependent h∞ control of uncertain discrete delay systems, european journal of control, 11: 29-37, 2005. [7] e. fridman and u. shaked, stability and guaranteed cost control of uncertain discrete delay systems, international journal of control, 78(4): 235-246, 2005. [8] h. gao and t. chen, new results on stability of discrete-time systems with time-varying state delay, ieee transactions on automatic control, 52: 328–334, 2007. [9] h. gao, j. lam and y. wang, delay-dependent output-feedback stabilization of discretetime systems with time-varying state delay, iee proc.: control theory applications, 151(6): 691-698, 2004. [10] q.l. han and d. yue, absolute stability of lur’e systems with time-varying delay, iet control theory, 1(3): 854–859, 2007. [11] y. he, m. wu, j.h. she and g.p. liu, parameter-dependent lyapunov functional for stability of time-delay systems with polytopic-type uncertainties, ieee transactions on automatic control, 49: 828–832, 2004. [12] x. jiang, q.l. han and x.h. yu, stability criteria for linear discrete-time systems with interval-like time-varying delay, proc. american control conference, new orleans, usa, 2817–2822, 2005. [13] y.s. lee and w.h. kwon, delay-dependent robust stabilization of uncertain discrete-time state-delayed systems, proc. 15thifac world congr., 15(1): barcelona, spain 2002. 784 s.b. stojanovic, d.lj. debeljkovic, n. dimitrijevic [14] v. leite and m. miranda, robust stabilization of discrete-time systems with time-varying delay: an lmi approach, mathematical problems in engineering, 2008: article id 876509, 15 pages, 2008. [15] x.g. liu, r.r. martin, m. wu and m.l. tang, delay-dependent robust stabilization of discrete-time systems with time-varying delay, iee proc.: control theory and applications, 153(6): 689–702, 2006. [16] m.s. mahmood, robust control and filtering for time-delay systems, marcel-dekker, new york, 2000. [17] y.s. moon, p. park and w.h. kwon, robust stabilization of uncertain input-delayed systems using reduction method, automatica, 37: 307–312, 2001. [18] p. park and j.w. ko stability and robust stability for systems with a time-varying delay, automatica, 43: 1855–1858, 2007. [19] e. shustin and e. fridman, on delay-derivative-dependent stability of systems with fastvarying delays, automatica, 43: 1649–1655, 2007. [20] m. wu, y. he, j.h. she and g.p. liu, delay-dependent criteria for robust stability of time-varying delay systems, automatica, 40: 1435–1439, 2004. [21] s. xu and j. lam, improved delay-dependent stability criteria for time-delay systems, ieee transactions on automatic control, 50(3): 384-387, 2005. [22] s. xu, j. lam and y. zou, improved conditions for delay-dependent robust stability and stabilization of uncertain discrete time-delay systems, asian journal of control, 7(3): 344348, 2005. [23] d. yue and q.l. han, a delay-dependent stability criterion of neutral systems and its application to a partial element equivalent circuit model, ieee transactions on circuits and systems-ii, 51(12): 685–689, 2004. [24] d. yue, e. tian and y. zhang, a piecewise analysis method to stability analysis of linear continuous/discrete systems with time-varying delay, international journal of robust and nonlinear control, 19: 1493–1518, 2009. [25] x.m. zhang and q.l. han, a delay decomposition to delay-dependent stability for linear systems with time-varying delays, international journal of robust and nonlinear control, 19: 1922–1930, 2009. [26] x.l. zhu and g.h. yang, new results of stability analysis for systems with time-varying delay, international journal of robust and nonlinear control 20: 596–606, 2010. international journal of computers, communications & control vol. ii (2007), no. 3, pp. 279-287 lorenz system stabilization using fuzzy controllers radu-emil precup, marius l. tomescu, ştefan preitl abstract: the paper suggests a takagi sugeno (ts) fuzzy logic controller (flc) designed to stabilize the lorentz chaotic systems. the stability analysis of the fuzzy control system is performed using barbashin-krasovskii theorem. this paper proves that if the derivative of lyapunov function is negative semi-definite for each fuzzy rule then the controlled lorentz system is asymptotically stable in the sense of lyapunov. the stability theorem suggested here offers sufficient conditions for the stability of the lorenz system controlled by ts flcs. an illustrative example describes the application of the new stability analysis method. keywords: chaotic systems, fuzzy control, lyapunov functions, nonlinear equations and systems. 1 introduction chaotic systems exhibit exponential sensitivity to small perturbations and also have a large variety of distinct possible dynamical motions. these properties will be reviewed in this paper along with their consequences and implications to active control of chaotic systems using small control signals. chaos control refers to a process wherein a tiny perturbation is applied to a chaotic system in order to achieve a desirable (chaotic, periodic, or stationary) behavior [9]. the idea of chaos control was formulated in 1990 at the university of maryland [5]. in [5] and a method for stabilizing an unstable periodic orbit was suggested. the basic idea is in the fact that a significant change in the behavior of a chaotic system can be made by a very small correction of its parameters. there exist three historically earliest and most actively developing directions of research in chaos control: open-loop control based on periodic system excitation referred to also as nonfeedback control, the method of poincaré map linearization called also the ott, grebogi and yorke (ogy) method [5] and the method of time-delayed feedback (pyragas method) [7, 8]. lima and pettini [3] proposed a disturbance-based technique of stabilizing the chaotic system towards a periodic state. in this case the periodicity is fixed by the frequency of a control signal disturbing the parameter space. such a technique was called "suppression of chaos" or "nonfeedback control". its implementation can be complicated by the fact that it needs a preliminary learning task of the system response to possible disturbances of variable amplitude. the ogy method [5] stabilizes unstable periodic orbits (upos) found in the chaotic regime via small feedback disturbances to an accessible parameter. the control disturbance is offered when the orbit crosses a given poincaré section such that the trajectory will be close to the stable manifold of the desired upo. in this method in the limit of zero noise the orbit of the controlled system is identical to the upo of uncontrolled system and the feedback disturbance vanishes. a drawback of the ogy method is that it becomes difficult to apply for very fast systems since it requires the detailed computer-aided analysis of the system at each crossing of the poincaré section. also, noise can result in occasional bursts where the trajectory moves far away from the controlled periodic orbit. an alternative method of feedback stabilization of upo’s, introduced by pyragas [7], consists of a continuous linear feedback applied at each computational time step. as in the ogy case, in this method the controlled orbit coincides with the upo of the uncontrolled system and the feedback vanishes for zero noise when control is achieved. the feedback procedure can be applied without a priori knowledge on the location of the periodic orbit for a version in which the feedback term contains a delayed variable in which the delay corresponds to the period of the upo. moreover, it is expected that it can be used for fast systems, since no parameters are changed on a fast time-scale, and the method does not require copyright © 2006-2007 by ccc publications 280 radu-emil precup, marius l. tomescu, ştefan preitl a computer-aided analysis of the system. for some systems this method is robust even in the presence of considerable noise [7]. a disadvantage of pyragas’s method is that it achieves control only over a limited range of the parameter space i.e. a given orbit will become eventually unstable in the controlled system as the parameters are varied more deeply into the chaotic regime. the use of delayed feedback also increases the dimensionality of the system. the paper is organized as follows. the accepted class of fuzzy logic control systems with takagisugeno (ts) fuzzy logic controllers (flcs) is described in the next section. section 3 is focused on the design of stable fuzzy logic control systems based on the new stability analysis method expressed in terms of a theorem formulated on the basis of the barbashin-krasovskii theorem. then, section 4 performs an analysis of the lorentz equation that exhibits chaotic behavior. section 5 is dedicated to the stable design of a ts flc to stabilize the lorentz chaotic system, and section 6 concludes the paper. 2 accepted class of fuzzy logic control systems fuzzy logic control has become an important methodology in control engineering because it can offer superior performance indices and better trade-off to system robustness and sensitivity, which results into handling nonlinear control better than traditional methods. calvo and cartwright [1] introduced the idea of fuzzy control in chaotic systems. hua o. wang and kazuo tanaka proposed a stability design approach to lorenz system [10], based on ts fuzzy models using a linear matrix inequality (lmi) technique. in [11] oscar calvo proposed a mamdani flc for control of chaos in chua’s circuit. ahmad m. harb and issam al-smadi presented in [11] a mamdani flc to control the lorenz equation and chua’s circuit to be a stable constant or periodic solution, where a single tuning parameter is chosen in case of lorenz system and the flc adjusts this parameter. in this paper the fuzzy logic control system is accepted to consist of a process and a ts flc as shown in figure 1. the flc consists of r fuzzy rules. the process of extracting the knowledge from human operators in the form of fuzzy control rules is by no means trivial, nor is the process of deriving the rules based on heuristics and good understanding the process and control systems theory. figure 1: fuzzy logic control system structure. let x be a universe of discourse. consider the nonlinear autonomous system of the following form representing the state-space equations of the controlled process: ẋ = f (x) + b (x) u, x (t0) = x0 (1) where: − x ∈ x , x = [x1, x2, ..., xn]t is the state vector, − f (x) = [ f1 (x) , f2 (x) , ..., fn (x)]t , b (x) = [b1 (x) , b2 (x) , ..., bn (x)]t are functions describing the dynamics of the plant, lorenz system stabilization using fuzzy controllers 281 − u is the control signal applied to the process calculated by the weighted sum defuzzification method, − the time variable, t, has been omitted to simplify the further formulation, − x (t0) is the initial state vector at time t0 . the i-th fuzzy rule / fuzzy control rule in the fuzzy rule base of t-s flc is of the form (2): rule i : if xi is xi,1 and ... and xn is xi,n then u = ui (x) , i = 1, r, r ∈ in∗, (2) where xi,1, xi,2, .., xi,n are fuzzy sets that describe the linguistics terms (lts) of input variables, u = ui (x)is the control output of rule i, and the function and is a t-norm and can be a single value or a function of the state vector, x. each fuzzy rule generates an activation degree: αi (x (t)) = and ( µi,1 (x1 (t)) , µi,2 (x2 (t)) ...µi,n (xn (t)) ) , αi ∈ [0, 1] , i = 1, r (3) it is assumed that for any x ∈ x in the input universe of discourse x there exists at least one αi ∈ [0, 1] , i = 1, r, among all rules that is nonzero. the control signal u, which must be applied to the process, is a function of αi and ui . applying the weighted sum defuzzification method, the output of the flc will be: u = ∑ri=1 αiui ∑ri=1 αi (4) where r is the total number of rules. 3 stability analysis of fuzzy logic control systems the stability analysis presented in this section is based on barbashin-krasovskii theorem presented in [2]. this section is concentrated on the formulation and proof of theorem 3 that ensures sufficient conditions for the stability of nonlinear systems controlled by ts flcs. the function v (x) = xt px is considered, where p ∈ irn×n is a positive definite matrix. from this it results that v is positive definite and has continuous partial derivatives. the derivatives of v in the conditions (1) are: v̇ (x) = ẋt px + xt pẋ = ( f (x) + b (x) u (x))t px + xt p ( f (x) + b (x) u (x)) = f (x) + b (x) u (x) (5) where f (x) = f (x)t px + xt p f (x) and b (x) = b (x)t px + xt pb (x). definition 1. if v (x) = xt px is defined on domain x containing the origin containing the origin, then for any fuzzy rule the derivative v̇i = f + bui is defined. proposition 2. for any input x0 ∈ x it results that umin (x0) ≤ u (x0) ≤ umax (x0), where umin (x0) = min (u1 (x0) , ..., ur (x0)) and umax (x0) = max (u1 (x0) , ..., ur (x0)) . proof. let x0 ∈ x , than among all rules two rules can be found, with indices p and q, such that up (x0) = umin (x0) and uq (x0) = umax (x0). hence the following result is valid: umin (x0) = r ∑ i=1 αi (x0) umin (x0) r ∑ i=1 αi (x0) ≤ r ∑ i=1 αi (x0) ui (x0) r ∑ i=1 αi (x0) ≤ r ∑ i=1 αi (x0) umax (x0) r ∑ i=1 αi (x0) = umax (x0) ⇒ umin (x) ≤ u (x) ≤ umax (x) ,∀x ∈ x (6) this property permits the formulation of theorem 3 that outlines the stability analysis approach. 282 radu-emil precup, marius l. tomescu, ştefan preitl theorem 3. let x = 0 be an equilibrium point for (1). let v (x) = xt px be a positive function on domain x containing the origin, such that v̇i (x) ≤ 0, i = 1, r, x ∈ x . let s = { x ∈ x ∣∣v̇ (x) = 0 } and suppose that no solution can stay identically in s excepting the trivial solution x (t) ≡ 0. then, the origin is asymptotically stable. proof. it should be proved that v̇ is negative semidefinite in the conditions (1). from the conditions of theorem 3 one may write: v̇ (x) = f (x) + b (x) ui < 0, i = 1, r, x ∈ x (7) from proposition 2 it is obtained that for ∀x ∈ x there exist two rules, with indices p and q, such that up (x0) = umin (x0) and uq (x0) = umax (x0). three possible cases should be considered as follows: case 1: if b (x) is strictly positive, from proposition 2 the result is: up (x) ≤ u (x) ≤ uq (x) ⇒ ⇒ f (x) + b (x) up (x) ≤ f (x) + b (x) u (x) ≤ f (x) + b (x) uq (x) ≤ 0 ⇒ ⇒ v̇p (x) ≤ v̇ (x) ≤ v̇q (x) ≤ 0, (8) therefore v̇ (x) ≤ 0. case 2: if b (x) is strictly negative, proposition 2 yields: up (x) ≤ u (x) ≤ uq (x) ⇒ ⇒ 0 ≥ f (x) + b (x) up (x) ≥ f (x) + b (x) u (x) ≥ f (x) + b (x) uq (x) ⇒ ⇒ 0 ≥ v̇p (x) ≥ v̇ (x) ≥ v̇q (x) (9) therefore once more v̇ (x) ≤ 0. case 3: if b (x) = 0 from (8) we have v̇ (x) = f (x) < 0. from the above cases it is justified to conclude that, whatever the value of is, the result will be v̇ ≤ 0. consequently, theorem 3 satisfies all conditions of barbashin-krasovskii theorem [2], so the equilibrium point at the origin will be globally asymptotically stable. 4 properties of lorenz equations this section presents an overview on dynamic chaotic processes with focus on the lorenz system referred to also as lorenz equation or attractor [4]. modern discussions of chaos are mainly based on the works about the lorenz attractor. the lorenz equation is commonly defined as three coupled ordinary differential equations expressed in (10) to model the convective motion of fluid cell, which is warmed from below and cooled to above: dx dt = σ (y−x) dy dt = x (ρ −z)−y dz dt = xy−β z (10) where the three parameters σ , ρ, β > 0 are called the prandtl number, the rayleigh number and the physical proportion, respectively. these constant parameters determine the behavior of the system and the three equations exhibit chaotic behavior i.e. they are extremely sensitive to initial conditions. a lorenz system stabilization using fuzzy controllers 283 small change in initial conditions leads quickly to large differences in corresponding solutions. the classic values used to demonstrate chaos are σ = 10 and β = 83 . it is important to note that x, y, z are not spatial coordinates. the variable x is proportional to the intensity of the convective motion while y is proportional to the temperature difference between the ascending and descending currents; similar signs of x and y denote that warm fluid is rising and cold fluid is descending. the variable z is proportional to the distortion of vertical temperature profile from linearity, a positive value indicating that the strongest gradients occur near the boundaries. the essential properties of lorenz equation can be summarized as follows: nonlinearity. the two nonlinearities are xy and xz. symmetry. equations are invariant under (x, y) → (−x,−y). in other words, if (x(t),y(t),z(t)) is a solution, (−x (t) ,−y (t) , z (t)) will be also a solution. volume contraction. the lorenz system is dissipative i.e. volumes in phase-space contract under the flow. fixed points. in order to solve (10) for the fixed points let f (x) =   σ (y−x) x (ρ −z)−y xy−β z   and it is necessary to solve f (x) = 0. it is clear that one of those fixed point is s0 = (0, 0, 0) and with some algebraic operations one may determine that s1,2 = ( ± √ β (ρ −1),± √ β (ρ −1), (ρ −1) ) are equilibrium points and real when ρ > 1. invariance. the z-axis is invariant, meaning that a solution that starts on the z-axis (i.e. x = y = 0) will remain on the z-axis. in addition, the solution will tend towards the origin if the initial conditions belong to the z-axis. solutions stay close to origin. if σ , ρ, β > 0 then all solutions of the lorenz equation will enter an ellipsoid centered at (0, 0, 2b) in finite time. in addition, the solution will remain inside the ellipsoid once it has entered. it follows by definition that the ellipsoid is an attracting set. 5 design of stable fuzzy logic control system the design of the fuzzy logic control system with ts flc starts with rewriting the ordinary differential equation (10) as the following form representing the state-space equations of the controlled process: ẋ =   σ (x2 −x1) x1 (ρ −x3)−x2 x1x2 −β x3   +   1 0 0   u, x (t0) = x0 (11) next, the fuzzification module of ts flc is set according to figure 2 showing the membership functions that describe the linguistic terms (lts) of the linguistic variables x1 and x2. the lts representing "positive", "zero" and "negative" values are noted by p, z and n, respectively. the inference engine employs the fuzzy logic operators and and or implemented by the min and max functions, respectively. the inference engine is assisted by the complete set of fuzzy control rules illustrated in table 1, and the weighted sum defuzzification method is utilized. summarizing, the only parameters to be calculated are the consequents ui, i = 1, 9 , in the 9 fuzzy control rules. 284 radu-emil precup, marius l. tomescu, ştefan preitl figure 2: membership functions of x1 and x2. table 1 fuzzy control rule base rule antecedent consequent x1 x2 u 1 p p u1 2 n n u2 3 p n u3 4 n p u4 5 p z u5 6 n z u6 7 z p u7 8 z n u8 9 z z u9 theorem 3 will be applied as follows to find the values of ui for which the system (11) can be stabilized with the above described ts flc. let x = [−40, 40]× [−40, 40]× [−40, 40] that contain the origin. the lyapunov function candidate v (x) = 12 ( x21 + x 2 2 + x 2 3 ) is considered, which is a continuously differentiable positive function on domain x . the total derivative of v with respect to time using (11) is: v̇ (x) = −σ x21 −x22 −β x23 + x1x2 (σ + ρ) + x1u (12) from (12) it is obvious that v̇ (0) = 0 ⇔ x = 0 and this implies s = {0}. further on, each fuzzy control rule will be analyzed here: • for rule 1 it is obtained x1 is p and x2 is p. then v̇1 (x) = −σ x21 −x22 −β x23 + x1x2 (σ + ρ) + x1u1. in these conditions to satisfy v̇ (x) ≤ 0 it is chosen u1 = −40 (σ + ρ). • for rule 2 it is obtained x1 is n and x2 is n. then v̇1 (x) = −σ x21 −x22 −β x23 + x1x2 (σ + ρ) + x1u2. in these conditions to satisfy v̇ (x) ≤ 0 it is chosen u2 = 40 (σ + ρ). • for rule 3 it is obtained x1 is p and x2 is n. then v̇1 (x) = −σ x21 −x22 −β x23 + x1x2 (σ + ρ) + x1u3. in these conditions to satisfy v̇ (x) ≤ 0 it is chosen u3 = 0. • for rule 4 it is obtained x1 is n and x2 is p. then v̇1 (x) = −σ x21 −x22 −β x23 + x1x2 (σ + ρ) + x1u4. in these conditions to satisfy v̇ (x) ≤ 0 it is chosen u4 = 0. • for rule 5 it is obtained x1 is p and x2 is z. then v̇1 (x) = −σ x21 −x22 −β x23 + x1x2 (σ + ρ) + x1u5. in these conditions to satisfy v̇ (x) ≤ 0 it is chosen u5 = −10 (σ + ρ). lorenz system stabilization using fuzzy controllers 285 • for rule 6 it is obtained x1 is n and x2 is z. then v̇1 (x) = −σ x21 −x22 −β x23 + x1x2 (σ + ρ) + x1u6. in these conditions to satisfy v̇ (x) ≤ 0 it is chosen u6 = 10 (σ + ρ). • for rule 7 it is obtained x1 is z and x2 is p. then v̇1 (x) = −σ x21 −x22 −β x23 + x1x2 (σ + ρ) + x1u7. in these conditions to satisfy v̇ (x) ≤ 0 it is chosen u7 = −x2 (σ + ρ). • for rule 8 it is obtained x1 is z and x2 is n. then v̇1 (x) = −σ x21 −x22 −β x23 + x1x2 (σ + ρ) + x1u8. in these conditions to satisfy v̇ (x) ≤ 0 it is chosen u8 = −x2 (σ + ρ). • for rule 9 it is obtained x1 is z and x2 is z. then v̇1 (x) = −σ x21 −x22 −β x23 + x1x2 (σ + ρ) + x1u9. in these conditions to satisfy v̇ (x) ≤ 0 it is chosen u9 = −x2 (σ + ρ). concluding, due to theorem 3 it results that the system composes by this ts flc and the lorenz process described by (11) is globally asymptotically stable in the sense of lyapunov at the origin. considering the values of process parameters σ = 10, ρ = 28, β = 83 , the initial state x1 (0) = 1, x2 (0) = −1and x3 (0) = 1, the responses of x1, x2 and x3 versus time in the closed-loop system are shown in figures 3-7. figure 3: state variable x1 versus time of lorenz system without flc (a) and with flc (b). figure 4: state variable x2 versus time of lorenz system without flc (a) and with flc (b). figure 5: state variable x3 versus time of lorenz system without flc (a) and with flc (b). 286 radu-emil precup, marius l. tomescu, ştefan preitl figure 6: phase portraits of lorenz system without control (a) and with flc (b). 6 conclusions the paper has proposed a simple and efficient fuzzy logic control solution employing a ts flc meant for stabilizing the lorenz system. the fuzzy logic controller design is assisted by the stability approach stated and proved in terms of theorem 3. theorem 3 has general character guaranteeing sufficient stability conditions for fuzzy logic control systems with ts flcs. this approach decomposes the stability analysis to the analysis of each rule so the complexity is reduced drastically. the new stability analysis approach is different to lyapunov’s theorem and allows more applications. in particular, it is well suited to controlling processes where the derivative of the lyapunov function candidate is not negative definite, therefore applying theorem 3 to nonlinear processes controlled by ts flcs can be successful in case of a wide area of nonlinear dynamic systems. digital simulations illustrated in this paper prove that the proposed stability analysis method is simpler than the nonfeedback control method proposed by lima and pettini [3], than ogy method proposed by ott, grebogi and yorke [5] and than pyragas method [7]. besides, the controller structure presented in section 5 can be implemented as low cost automation solution [6]. further research will be dedicated to offering other low cost fuzzy solutions for chaotic systems. bibliography [1] calvo o., cartwright j. h. e., fuzzy control of chaos, international journal of bifurcation and chaos, vol. 8, number 8, pp. 1743-1747, 1998. [2] khalil h. k., nonlinear systems, 3rd edition, prentice hall, englewood cliffs, nj, 2002. [3] lima r., pettini m., suppression of chaos by resonant parametric perturbations, physical letter a, vol. 41, pp. 726-733, 1990. [4] lorenz e. n., the essence of chaos, university of washington press, 1993. [5] ott e., grebogi, c., yorke, j. a, controlling chaos, physical review letter, vol. 64, pp. 1196-1199, 1990. [6] precup r.-e., preitl s., optimisation criteria in development of fuzzy controllers with dynamics, engineering applications of artificial intelligence, vol. 17, no. 6, pp. 661-674, 2004. [7] pyragas k, continuous control of chaos by self-controlling feedback, physical letter a, vol. 170, pp. 421-427, 1992. lorenz system stabilization using fuzzy controllers 287 [8] pyragas k., tamaževièius a., experimental control of chaos by delayed self-controlling feedback, physical letter a, vol. 180, pp. 99-102, 1993. [9] schuster h. g., handbook of chaos control: foundations and applications, wiley-vch verlag gmbh, 1999. [10] wang h. o., tanaka k., fuzzy modeling and control of chaotic systems, in integration of fuzzy logic and chaos theory, springer-verlag, berlin, heidelberg, 2006. [11] zhong l., halang w. a., chen, g. (eds.), integration of fuzzy logic and chaos theory, springerverlag, berlin, heidelberg, 2006. radu-emil precup "politehnica" university of timisoara department of automation and applied informatics bd. v. parvan 2, ro-300223 timisoara, romania e-mail: rprecup@aut.utt.ro marius l. tomescu "aurel vlaicu" university computer science faculty complex universitar m, str. elena dragoi 2, ro-310330 arad, romania e-mail: tom_uav@yahoo.com ştefan preitl "politehnica" university of timisoara department of automation and applied informatics bd. v. parvan 2, ro-300223 timisoara, romania e-mail: spreitl@aut.utt.ro received: december 24, 2006 int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 1, pp. 6-7 editorial – special issue on collaboration support systems (css) h.s. ko, s. y. nof recent and emerging advances in computer and information science and technology have realized a powerful computing and communication environment. it enables effective interactions and collaboration among groups of people and systems (and systems-of-systems) beyond traditional restrictions of time and space. the evolution in hardware (e.g., pervasive computing devices, wireless sensor networks, nano-electronics) and software (e.g., multi-agent systems, workflow and information integration, interaction models and protocols) technology, and their flexible teaming have further enabled diverse forms of collaboration approaches. it has been observed during the last few decades that numerous collaboration methodologies, tools and applications in various domains have emerged to provide better quality services, helping to solve domain-specific, highly complex problems. the development of collaboration tools and methodologies has increased the domain knowledge that can be discovered and shared by individuals, and the level and intensity of interactions and collaboration that can dramatically decrease problem complexity and increase solution quality. at the same time, inefficient interactions, task and information overloads, and ineffective collaboration are prevalent. in spite of the considerable progress in collaboration tools and methods, a sound foundation of collaboration science is yet to be established; lacking such foundation is a major obstacle in leveling up sophistication and anticipated benefits of collaboration methods. recently, however, collaboration control theory (cct) models and techniques have been proposed as the foundation of designing collaboration support systems (css), and new features of css are being investigated in various areas based on cct. in order to provide readers with a significant opportunity of investigating novel and forthcoming problems in collaboration research, the guest editors have invited authors from various disciplines and focused on collaborative design and modeling features in the respective areas. after careful and rigorous review and revision processes, eight articles have been selected for their special quality and relevance to css. the selected articles are organized along two main topics in css: 1) css models and theories; and 2) css methods and applications. in the area of css models and theories: • "swarming models for facilitating collaborative decisions" – zamfirescu and filip introduce the use of swarming models (stigmergic mechanisms) to build collaborative support systems for complex cognitive tasks, exemplifying them through an experiment for group decision processes (gdp) in e-meetings. • "design of protocols for task administration in collaborative production systems" – ko and nof investigate the design of task administration protocols for collaboration support in production system, where those protocols, as control mechanisms, can manage complicated events in the collaborative task workflow environment and overcome limitations of coordination protocols. • "mining authoritativeness of collaborative innovation partners" – engler and kusiak, present a novel approach to automatically determine the authoritativeness of entities for collaboration and demonstrate the use of mining schema for identifying collaboration partners over the internet. • "reference architecture for collaborative design" – huang, yang, chen, and nof present reference architecture for collaborative design (cd) as a framework for analyzing and supporting cd, then describe and illustrate dimensions forming the architecture as a cube of design aspect, design stage, and collaboration scope. the next group of articles focuses on css methods and applications: copyright c© 2006-2010 by ccc publications editorial – special issue on collaboration support systems (css) 7 • "coordinating aerial robots and unattended ground sensors for intelligent surveillance systems" – de freitas, heimfarth, allgayer, wagner, larsson, pereira, and ferreira present a system solution to enable interoperability and coordination support for heterogeneous sensor networks composed of low-end ground sensor nodes and mobile sensors carried by autonomous aerial robots. • "introducing collaborative practices in small medium enterprises" – antonelli and chiabert propose a methodology to evaluate the possibility of using plm as a framework which exploits collaboration links within an enterprise, based on an exhaustive analysis of the plm impact on different aspects of the enterprise. • "a software system development life cycle model for improved stakeholders’ communication and collaboration" – cohen, dori, and de haan describe a collaborative software system development life cycle model using object process methodology (opm), which includes various stakeholders and variables, and considers multiple aspects in collaboratively developing off-theshelf software. • "gaze, posture and gesture recognition to minimize focus shifts for intelligent operating rooms in a collaborative support system" – wachs describes the design of an intelligent, collaborative system framework, which involves the integration of machine vision, voice recognition and computer graphics-projection techniques to improve operation rooms for surgery by highly intuitive, natural and multimodal interactions. these articles are presented based on related scientific investigations around the world and reflect well on the broad and challenging area of css. the guest editors wish to thank all the contributing authors, the referees, and the editorial office colleagues who have all endeavored to bring this special issue on css to you. h.s. ko and s.y. nof west lafayette, indiana, usa special issue guest editors int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 2, pp. 167-177 hierarchical distributed reasoning system for geometric image generation nicolae ţăndăreanu, mihaela verona ghindeanu, sergiu andrei nicolescu university of craiova, romania department of mathematics and computer science a.i. cuza st, no. 13, 200585 e-mail: ntand@rdslink.ro, mghindeanu@yahoo.com abstract: the concept of hierarchical reasoning system was introduced in [5], where an intuitive method to build such systems based on their inputs is given. in this paper we formalize several concepts which open a possible research line concerning the use of these structures. a hierarchical reasoning system h is a directed graph organized on several levels such that each node of the level j is a hyper-schema of order j. as a mathematical structure, h is an abstract one and a special kind of formal computation is introduced. as a result of this computation we obtain a set f(h) of formulas. we explain what we understand by an interpretation of h and define its corresponding semantical computation. by means of an interpretation i(h) for h and applying the rules of the semantical computation, each element of w ∈ f(h) becomes some object i(w) of a given space. we exemplify these concepts and we show that for two distinct interpretations i1(h) and i2(h) for the same system h, a given formula w ∈ f(h) is transformed into a sentence i1(w) of a natural language whereas i2(w) is a geometric image. a short description of a java implementation of a hierarchical system generating images is also given in a separate section. by examples we show that the mechanism introduced in this paper allows us to model the distributed knowledge. finally several open problems are specified. keywords: semantic schema, interpretation, hyper-schema, distributed reasoning system, geometrical image generation 1 introduction various kinds of mechanisms for image synthesis were presented and implemented on computer. the panel of the mathematical models for this subject includes the rewriting systems and graph-based models. picture-processing grammars ([2]), picture grammars ([3]), stochastic grammars ([14]) and l-systems are some of the rewriting systems used to process images. the l-systems are a class of string rewriting mechanism originally developed by a biologist, a. lindenmayer, in 1968 ([7]). the original emphases were on plant topology spatial relations between cells or larger plant modules. the l-systems are a practical tool for generating fractal forms. today these models are applied in architecture, physiology ([1]) and music. in order to interpret the l-system as music, lmuse system ([9]) maps any of the turtle’s 3d movement, orientation directions (forward, up, and left), its drawing line length, and thickness into musical pitches, note durations and volume. a great number of research works and practical implementations have confirmed the interest of mathematicians and computer scientists in developing and applying the methods of graph theory. these methods were applied to obtain new knowledge representation models and to process images. a very productive notion with large applications in knowledge representation is that of conceptual graph, a copyright © 2006-2009 by ccc publications 168 nicolae ţăndăreanu, mihaela verona ghindeanu, sergiu andrei nicolescu notion introduced in literature by j.f.sowa ([8],[10]). we can find several applications of the graphbased methods in [6] (low-level processing of digital images, learning algorithms for high-level computer vision and pattern recognition). the concept of semantic schema was introduced in [11] as an extension of semantic networks. this structure is obtained by means of a labeled graph and a peano algebra built over the edge labels. since then many applications of this structure were presented (new semantics in logic programming, knowledge representation for intelligent dialog systems etc). in [12] we defined a new mechanism for generating images similar with the edge rewriting in the way that both approaches can be used to define complex images based on some simple other images. in the mentioned paper the concept of hierarchical distributed reasoning system was introduced. each leaf of the system is given by a semantic schema. the other nodes are hyper-schemas ([12]). we presented an intuitive method to obtain geometrical images. the leaves represent the input of the system in semantic schemas and, by appending proper interpretations, they obtain the graphical illustrations of the received inputs. in this manner the leaves obtains the initial images. then, at the upper levels, these images are combined by hyper-schemas to obtain complex images. we obtained a bottom-up method to obtain images from initiators. in this paper we obtain the following results: • starting with the concept of hierarchical distributed and reasoning system (hgr system) introduced in [12] in section 3 we define a formal computation in such a structure. as a result of this computation we obtain a set f(h) of formulas for an arbitrary hdr system h. this is the formal computation in an hdr system. • an hdr system h is an abstract structure. in section 4 we introduce the concept of interpretation for h. by means of an interpretation i(h) for h each element of f(h) becomes some object of a given space. this gives the semantical computation. both the formal and semantical computations are exemplified. we show that for two distinct interpretations i1(h) and i2(h) for the same system h we can generate sentences in a natural language giving the reasoning conclusions and geometrical images respectively. • a short description of a java implementation of an hdr system is also given in section 5. • by examples we show that the mechanism introduced in this paper allows us to model the distributed knowledge. • the last section contains the conclusions and future works. several open problems are specified in this section. 2 basic concepts consider a symbol θ of arity 2. a θ-semantic schema ([11]) or shortly, θ-schema is a system s = (x, a0, a, r), where: • x is a finite non-empty set of symbols named object symbols; • a0 is a finite non-empty set of elements named label symbols and a0 ⊆ a ⊆ a0, where a0 is the peano θ-algebra generated by a0; • r ⊆ x × a × x is a non-empty set of relations which fulfills the following conditions: 1. (x, θ(u, v), y) ∈ r ⇒ ∃z ∈ x : (x, u, z) ∈ r, (z, v, y) ∈ r 2. θ(u, v) ∈ a, (x, u, z) ∈ r, (z, v, y) ∈ r ⇒ (x, θ(u, v), y) ∈ r hierarchical distributed reasoning system for geometric image generation 169 3. {α | ∃(x, α, y) ∈ r} = a an element from r ∩ (x × a0 × x) is a regular arc of s. we denote by ded(s) the least set satisfying the following properties ([13]): • if (x, a, y) ∈ r0 then ([x, y], a) ∈ ded(s) • if ([xi, . . . , xk], u) ∈ ded(s) and ([xk, . . . , xr], v) ∈ ded(s), i < k < r and θ(u, v) ∈ a then ([xi, . . . , xr], θ(u, v)) ∈ ded(s). an element of ded(s) is a deductive path of s. let us consider the schemas s1 = (x1, a01, a1, r1) and s2 = (x2, a02, a2, r2). in the remainder of this section we describe a new structure which relieves a special kind of cooperation between s1 and s2. if d1 = ([x, . . . , y], u) ∈ ded(si) and d2 = ([y, . . . , z], v) ∈ ded(s3−i), where i ∈ {1, 2}, then we say that d1 is connected to right by d2 or d2 is connected to left by d1. we say that d1 is connected by d2 if d1 is connected to right or to left by d2. we consider the sets of deductive paths l1 ⊆ ded(s1) and l2 ⊆ ded(s2). we say that l1 ∪ l2 is a pairwise connected set of deductive paths if every deductive path of li is connected by some deductive path of l3−i. for each i ∈ {1, 2} we consider a set vi of symbols such that vi ∩ (a1 ∪ a2) = ∅. we consider also a set ei such that ei ⊆ xi × vi × xi, card(ei) = card(li) and e1 ∩ e2 = ∅. consider also a bijective mapping gi : li −→ ei such that gi(d) = (x, e, y), where d = ([x, . . . , y], θ(u, v)) ∈ li. this mapping transforms each deductive path ([x, . . . , y], θ(u, v)) from li into a regular arc (x, e, y). shortly, we say that the path ([x, . . . , y], θ(u, v)) is designated by (x, e, y). we can define now a cooperating structure of hyper-schemas. a hyper-schema of order zero is a semantic schema. consider the hyper-schemas s1 and s2 of order zero. a hyper-schema of order one over s1 and s2 obtained by means of l1 and l2 is a θ-schema s which includes the regular arcs obtained from l1 and l2 ([12]). we denote by hyp1({s1, s2}) the set of all hyper-schemas of first order over s1 and s2. in general we write s ∈ hypk({s1, s2}) and we name s a hyper-schema of order k if s1 and s2 are hyper-schemas of order j ≤ k − 1 and at least one of them has the order k − 1. an hdr system ([12]) is the tuple h = (q1, q2, . . . , qk) where k ≥ 2 and • q1 = {s1, . . . , sn1 }, n1 > 1, constitutes the first level of the system. the entities {s1, . . . , sn1 } are hyper-schemas of order zero. the set q1 gives the leaves of h. • q2 = {sn1+1, . . . , sn2 }, n2 ≥ n1 + 1, gives the second level of the system and sn1+1, . . . , sn2 are hyper-schemas of order 1. more precisely, for every m ∈ {n1 + 1, . . . , n2} there are m1, m2 ∈ {1, . . . , n1}, m1 6= m2 such that sm ∈ hyp1({sm1 , sm2 }). • for j ∈ {3, . . . , k}, qj = {snj−1+1, . . . , snj } represents the j-th level of the system, where nj ≥ nj−1 + 1. for every m ∈ {nj−1 + 1, . . . , nj} there is m1 ∈ {nj−2, . . . , nj−1} and there is m2 ∈ {1, . . . , nj−1} such that sm ∈ hypj−1({sm1 , sm2 }). 3 formal computations in hdr systems suppose that h = (q1, q2, . . . , qk) is an hdr system. the components of h are the hyper-schemas s1, . . . , snk . we can visualize h as a graph structure. in order to obtain this structure we represent each hyper-schema by a node and we draw two directed arcs from sr to sj and to sm if sr ∈ hypp({sj, sm}) for some p. the structure obtained in this manner is not a tree. this can be observed in figure 1: there are two distinct paths from s7 to s2 and there is no root of this structure. 170 nicolae ţăndăreanu, mihaela verona ghindeanu, sergiu andrei nicolescu à ^ s1 s2 s5 s3 s6 à ^ à ^ s7 s8 à ^ s4 figure 1: the graph structure of h for each i ∈ {1, . . . , nk} we consider that si is given by the tuple si = (xi, a0i, ai, ri) and we denote r0i = ri ∩ (xi × a0i × xi). for each r ∈ {n1 + 1, . . . , nk} such that sr is a hyper-schema over sj and sm in h we consider: • the connected sets lj,r ⊆ ded(sj) and lm,r ⊆ ded(sm); • the sets ej,r, em,r and the transformational mappings gj,r : lj,r −→ ej,r, gm,r : lm,r −→ em,r. by the assumptions of the previous section we have r0r ⊇ ej,r ∪ em,r. we denote n0r = ej,r ∪ em,r. obviously we have the following property: proposition 1. n0i = ∅ if and only if si is a leaf of h. for a symbol h of arity 1 we consider the set: m = nk⋃ i=1 { h([x, y], a) | (x, a, y) ∈ r0i \ n0i} where we used the notation h([x, y], a) instead of h(([x, y], a)). we consider the symbols σ1, . . . , σnk of arity 2 and denote by hh the peano {σ1, . . . , σnk }-algebra generated by m. we consider the alphabet z including the symbols σi, the elements of xi, the elements of ai, the left and right parentheses, the square brackets [ and ], the symbol h and comma. as in the theory of formal languages, the set z∗ defines all the words over z. because a hyper-schema is a semantic schema we have the following property: proposition 2. if si is a hyper-schema of h and ([x1, . . . , xk+1], θ(u, v)) ∈ ded(si) then there is r uniquely determined such that ([x1, . . . , xr+1], u) ∈ ded(si) and ([xr+1, . . ., xk+1], v) ∈ ded(si). definition 1. let be w1, w2 ∈ z∗. we define the following binary relation on z∗, denoted by ⇒h: • for i ∈ {1, . . . , nk}, if (x, e, y) ∈ r0i \ n0i then w1([x, y], e)w2 ⇒h w1h([x, y], e)w2; • for i ∈ {1, . . . , nk}, if (x, e, y) ∈ n0i then w1([x, y], e)w2 ⇒h w1dw2, where d is the deductive path designated by (x, e, y); • suppose that ([x1, . . . , xk+1], θ(u, v)) ∈ ded(si), i ∈ {1, . . . , n(h)}, ([x1, . . . , xr+1], u) ∈ ded(si) and ([xr+1, . . ., xk+1], v) ∈ ded(si) then: w1([x1, . . . , xk+1], θ(u, v))w2 ⇒h w1σi(([x1, . . . , xr+1], u), ([xr+1, . . ., xk+1], v))w2 the reflexive and transitive closure of ⇒h is denoted by ⇒∗h. we denote f(si) = {w ∈ h(h) | ∃d ∈ ded(si) : d ⇒∗h w} and f(h) = ⋃nk i=1 f(si). let us exemplify this computation. we consider the hyper-schemas s1 and s2 of order zero from figure 2 and the hyper-schema of order 1 from figure 3. if we take hierarchical distributed reasoning system for geometric image generation 171 x1 x2 x3 x4 ? a b a ? θ(a, b) x3 x6 x5 y1 6 b c a ? θ(b, c) s1 s2 figure 2: semantic schemas s1 and s2 of order zero x1 x3 x5 z1e1 e2 a? θ(e1, e2) 6 θ(θ(e1, e2), a) figure 3: hyper-schema s3 ∈ hyp1({s1, s2}) • l1,3 = {([x1, x2, x3], θ(a, b))}, l2,3 = {([x3, x6, x5], θ(b, c))} • e1,3 = {(x1, e1, x3)}, e2,3 = {(x3, e2, x5)} • g1,3([x1, x2, x3], θ(a, b)) = (x1, e1, x3), g2,3([x3, x6, x5], θ(b, c)) = (x3, e2, x5) then we obtain the following computations: • ([x1, x3, x5], θ(e1, e2)) ⇒h σ3(([x1, x3], e1), ([x3, x5], e2)) • ([x1, x3], e1) ⇒h ([x1, x2, x3], θ(a, b)) ⇒h σ1(([x1, x2], a), ([x2, x3], b)) ⇒∗h σ1(h([x1, x2], a), h([x2, x3], b)) ∈ f(s1) • ([x3, x5], e2) ⇒h ([x3, x6, x5], θ(b, c)) ⇒h σ2(([x3, x6], b), ([x6, x5], c)) ⇒∗h σ2(h([x3, x6], b), h([x6, x5], c)) ∈ f(s2) in conclusion, ([x1, x3, x5], θ(e1, e2)) ⇒∗h σ3(σ1(h([x1, x2], a), h([x2, x3], b)), σ2(h([x3, x6], b), h([x6, x5], c))) and the last formula is an element of f(h), where h = (q1, q2), q1 = {s1, s2} and q2 = {s3}. 4 semantical computations in hdr systems the semantical computation in an hdr system h transforms every formula of f(h) into an object of some space. in this section we describe this transformational process. let us consider the hdr system h = (q1, q2, . . . , qk) and an element w ∈ f(h). if d = ([x1, . . . , xk], θ(u, v)) ∈ ded(si and d ⇒∗h w then we write sort(w) = θ(u, v). definition 2. an interpretation for h is a system i = (ob, ob, alg): • ob is a set of objects; • ob : x −→ ob, where x = ⋃nki=1 xi, is a mapping that "interprets" each node as an object; • alg = ⋃nki=1{algiu}u∈ai , where algiu is an algorithm with two input arguments and one output argument such that if gj,k([x, . . . , y], θ(u, v)) = (x, e, y) then algke = alg j θ(u,v) . definition 3. the valuation mapping valh of the hdr system h is defined as follows: • if w = h([x, y], a) ∈ f(h) then valh(w) = ⋃nk i=1{alg i a(ob(x), ob(y))}. 172 nicolae ţăndăreanu, mihaela verona ghindeanu, sergiu andrei nicolescu • if w = σj(w1, w2) ∈ f(h), w1 ∈ f(h), w2 ∈ f(h) and sort(w) = α then valh(w) = {alg j α(o1, o2) | ok ∈ valh(wk), k = 1, 2} in order to exemplify the computations we consider again the hdr system h from section 3. we define an interpretation of h by means of some sentential forms. such a structure is a sentence containing two variables. if we substitute each variable by an object then a sentential form becomes a sentence in a natural language. we shall consider the following sentential forms: p1(x, y)="x is the father of y"; p2(x, y)="x is the mother of y"; p3(x, y)="x is the brother of a y"; p4(x, y)="x likes to eat y"; q1(x, y)="x is the grandmother of y"; q2(x, y)="a brother of x likes to eat y"; r(x, y)="a nephew of x likes to eat y"; we consider the following algorithms: algorithm alg1a(o1, o2) { return p1(o1, o2)}; algorithm alg 1 b(o1, o2) { return p2(o1, o2)}; algorithm alg2b(o1, o2) { return p3(o1, o2)}; algorithm alg 2 c(o1, o2) { return p4(o1, o2)}; algorithm alg1 θ(a,b) (o1, o2) { if o1 = p1(t1, t2), o2 = p2(t2, t3) then return q1(t1, t3)} algorithm alg2 θ(b,c) (o1, o2) { if o1 = p3(t1, t2), o2 = p4(t2, t3) then return q2(t1, t3)} algorithm alg1e1 (o1, o2) { return q1(o1, o2)}; algorithm alg1e2 (o1, o2) { return q2(o1, o2)}; algorithm alg1b(o1, o2) { return p2(o1, o2)}; algorithm alg3 θ(e1,e2) (o1, o2) { if o1 = q1(t1, t2), o2 = q2(t2, t3) then return r(t1, t3)} consider the interpretation i1 = (ob1, ob1, alg1) of the system h, where we specify only the useful entities allowing to exemplify the computation: • ob1 = {peter, helen, john, sorin, pizza} • ob1(x1) = peter, ob1(x2) = helen, ob1(x3) = john, ob1(x6) = sorin, ob1(x5) = pizza • alg1 = {alg1a, alg1b, alg2b, alg2c, alg1θ(a,b), alg2θ(b,c), alg3e1 , alg3e2 , alg3θ(e1,e2)} where alg3e1 = alg 1 θ(a,b) , alg3e2 = alg 2 θ(b,c) . it is not difficult to observe that for the formula w = σ3(σ1(h([x1, x2], a), h([x2, x3], b)), σ2(h([x3, x6], b), h([x6, x5], c))) = σ3(α, β) from the last part of the previous section we obtain the following computations: valh(α) = {alg 1 e1 (o3, o4) | o3 ∈ valh(h([x1, x2], a)), o4 ∈ valh(h([x2, x3], b))} valh(h([x1, x2], a)) = {alg 1 a(peter, helen)} = {p1(peter, helen)} valh(h([x2, x3], b)) = {alg 1 b(helen, john), alg 2 b(helen, john)} = {p2(helen, john), p3(helen, john)} therefore valh(α) = {alg1e1 (p1(peter, helen), p2(helen, john)), alg 1 e1 (p1(peter, helen), p3(helen, john))} = {q1(peter, john)} valh(β) = {alg 2 e2 (o5, o6) | o5 ∈ valh(h([x3, x6], b)), o6 ∈ valh(h([x6, x5], c))} valh(h([x3, x6], b)) = {alg 1 b(john, sorin), alg 2 b(john, sorin)} = {p2(john, sorin), p3(john, sorin)} valh(h([x6, x5], c)) = alg 2 c(sorin, pizza)} = {p4(sorin, pizza)} therefore valh(β) = {alg2e2 (p2(john, sorin), p4(sorin, pizza)), alg 2 e2 (p3(john, sorin), p4(sorin, pizza))} = {q2(john, pizza)} finally, from valh(α) and valh(β) we deduce valh(w) = {alg 3 θ(e1,e2) (q1(peter, john), q2(john, pizza))} = {a nephew of peter likes to eat pizza} we observe that the conclusion obtained by h can not be obtained neither by s1, neither by s2. this explains why h is named a distributed system. hierarchical distributed reasoning system for geometric image generation 173 figure 4: the image generated by i2 we give now a short description of another interpretation i2 for the same system h. as a result we obtain geometrical images. • ob2 = {1, (3, 3), (3, 1.5)} • ob2(x1) = 1, ob2(x2) = ob2(x6) = (3, 3), ob2(x3) = 1, ob2(x5) = (3, 1.5) • alg1a(p, q){return the interior of circle with radius p and center q} • alg1b(p, q){return the interior of the square centered in p and the sides of length 2*q parallel with coordinate axes } • alg1 θ(a,b) (α, β){ if α = alg1a(p, q) and β = alg 1 b(q, r) then return β \ α } • alg2b(p, q){return the exterior of circle with radius p and center q} • alg2c(p, q){return the interior of the rectangle centered in p and the sides of lengths specified by q, parallel with coordinate axes } • alg2 θ(b,c) (α, β){ if α = alg1 θ(a,b) (p, q) and β = alg2 θ(b,c) (q, r) then return β ∩ α } • alg3 θ(e1,e2) (α, β){ if α = alg2a(p, q) and β = alg 2 c(q, r) then return β ∪ α } for the same formula w ∈ f(h) as in the previous computation, the object valh(w) given by i2 is shown in figure 4. 5 a java implementation if we note by a the set consisting of some geometrical objects names then each system’s input is an word w = a1 . . . ak over the alphabet v = a ∪ {+, −} having the following properties: • ai = +/− means a left/right rotation with a specific angle, denoted by δ and to draw a line on the current direction • ai = oj means to draw the graphical illustration of the object oj such that its entry direction is on the current direction. in our implementation, each geometrical object used in the generation method is an instance of the a class named object. graphically, it is a representation of a figure inside a square. every instance of this class can have one of the following types: circle, triangle, star and square corresponding to the figure it consists of. other members of this class are the entry direction and the exit direction related to some corner of the object. the corner corresponding to the entry direction becomes the entry point of the object. similar for the exit point. the main routine of the algorithm is createhdrs (algorithm 2). the construction of the system starts by defining the schemas of the agents (steps 1 ÷ 4). the hyper-schemas of order one corresponding to the managers of the second level are constructed using the steps 7 ÷ 14. the condition for existing a hyper-schema over two schemas is that their maximal paths are connected deductive paths. this property is verified using the routine connectedpaths (algorithm 3). if the second level of the system was successfully defined (if condition of step 15) then the process of creating new levels in hdrs continues using the while loop of step 17. the hyper-schemas of orders greater than 2 are created using the routine createhypschs (algorithm 4). the geometrical objects that are used for the image generation process are introduced using the first 174 nicolae ţăndăreanu, mihaela verona ghindeanu, sergiu andrei nicolescu figure 5: first window of the application window of the application. for each object the user must specify the type, the entry and exit points (the corners are numbered starting from the down-left) and related to them the entry and the exit direction. also, using the controls of the first window, the input descriptions can be edited(see figure 5). the second window of the application gives the outputs provided by the system’s reasoning components (see figure 6). it consists of three buttons and a panel. the application can draw maximum 1000 images with maximum 50 geometrical objects per image. 6 conclusions and future works in this paper we formalized the syntactical and semantical computations in an hdr system. we exemplified these computations and for some hdr system h we gave two interpretations: one interpretation generates phrases and the other generates geometrical images. this examples give an idea concerning the generative power of our mechanism. we relieved also by these examples the fact that the distributed reasoning can be modeled by an hdr system. a short description of a java implementation for an hdr system generating images is also given. we intend to develop the applications of an hdr system. first, we intend to use the mobile agents to process such systems ([4]). second, we intend to use the hdr systems in e-learning. the basic idea comes from the fact that a link in an html document gives a reference to another document of the similar structure. hierarchical distributed reasoning system for geometric image generation 175 (a) the initiators and some images obtained by the managers of q2 and q3 levels (b) images obtained at the 4th level in the system figure 6: second window of the application 176 nicolae ţăndăreanu, mihaela verona ghindeanu, sergiu andrei nicolescu algorithm 2 procedure createhdrs procedure createhdrs 1. for i ← 1, nocmd 2. call create−schema(commands[i], schema[i], agent[i]) 3. maximalpath[i] ← schema[i].getmaximalpath() 4. endfor 5. noag ← nocomd 6. nokm ← noag + 1 7. for i, j ← 1, noag; j 6= i 8. if connectedpaths(maximalpath[i], maximalpath[j]) 9. call create−hypersch(hypsch[nokm], schema[i], schema[j]) 10. hypsch[nokm].order ← 1 11. maximalpath[nokm] ← hypsch[nokm].getmaximalpath() 12. nokm ← nokm + 1 13. endif 14. endfor 15. if nokm > noag + 1 16. order ← 2 17. while createhypschs(order) 18. order ← order + 1 19. endwhile 20. endif endprocedure algorithm 3 function connectedpaths function connectedpathspath1, path2 1. if path1.lastnode=path2.firstnode 2. return true 3. endif 4. if path1.firstnode=path2.lastnode 5. return true 6. endif 7. return false endfunction algorithm 4 function createhypschs functioncreatehypschsorder 1. newhypsch ← false 2. for i ← nokm − 1, noag 3. if hypsch[i].order 6= order − 1 4. continue 5. endif 6. for j ← 1, nokm − 1; j 6= i 7. if connectedpaths(maximalpath[i], maximalpath[j]) 8. newhypsch ← true 9. if j ≤ noag 10. call create−hypersch(hypsch[nokm], hypsch[i], schema[j]) 11. else 12. call create−hypersch(hypsch[nokm], hypsch[i], hypsch[j]) 13. endif 14. hypsch[nokm].order ← order 15. maximalpath[nokm] ← hypsch[nokm].getmaximalpath() 16. nokm ← nokm + 1 17. endif 18. endfor 19. endfor 20. return newhypsch endfunction hierarchical distributed reasoning system for geometric image generation 177 bibliography [1] allen m., prusinkiewicz p., dejong t. (2004) systems for modeling the architecture and physiology of growing trees: the l-peach model, proceedings of the 4thinternational workshop on functional-structural plant models, pp. 220-225 [2] chang shi-kuo (1970) the analysis of two-dimensional patterns using picture processing grammars, annual acm symposium on theory of computing archive, proceedings of the second annual acm symposium on theory of computing, p. 206-216 [3] drewes f., ewert s., klempien-hinrichs r., kreowsky h.j. (2003) computing raster images from grid picture grammars, journal of automata, languages and combinatorics, vol.8, issue 3, p. 499-519 [4] dzitac i., bărbat b. e. (2009) artificial intelligence + distributed systems = agents, int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844, vol. iv, no. 1, pp. 17-26 [5] ghindeanu m. (2008) constructing architectures for an hierarchical distributed reasoning system based on its inputs, international multi-conference on engineering and technological innovation, usa, p. 231-234 [6] kandel a., bunke h., last m. (eds) (2007) applied graph theory in computer vision and pattern recognition, springer, studies in computational intelligence 52 [7] lindenmayer a. (1968) mathematical models for cellular interaction in development, parts i and ii, journal of theoretical biology (18), p. 280-315. [8] priss u., corbett d., angelova g. (eds.) (2002) conceptual structures: integration and interfaces, 10th int. conf. on conceptual structures, iccs 2002 [9] sharp d. (1998) lmuse version 0.7b, http://www.geocities.com/athens/academy/8764/ lmuse/lmusetxt.html [10] sowa j.f. (1984) conceptual structuresinformation processing in mind and machine, addisonwesley [11] ţăndăreanu n. (2004). semantic schemas and applications in logical representation of knowledge, proceedings of the 10th international conference on cybernetics and information technologies, systems and applications, usa, vol.iii, p. 82-87 [12] ţăndăreanu n., ghindeanu m. (2008) hierarchical semantic structures applied in automatic image generation, proceedings of 11th iasted international conference on intelligent systems and control, isbn: 978-0-88986-777-2 [13] ţăndăreanu n., ghindeanu m. (2008) path-based reasoning in semantic schemas, annals of university of craiova, mathematics and computer science series, vol.35, p.171-181 [14] zu song-chun , mumford d. (2006) a stochastic grammar of images, foundations and trends in computer graphics and vision, vol. 2, issue 4, p. 259-362 int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iii (2008), no. 3, pp. 318-318 book review florin gheorghe filip introduction to discrete event systems by christos g. cassandras and stèphane lafortune, 2008, springer science + business media llc, xxiv + 776 p., isbn: 978-0-387-33332-8. discrete event systems (des) represent a class of dynamic, discrete time systems whose states can take values only in a discrete-value space and can change only as a result of asynchronous occurrences of planned or spontaneous events. at present, there are ever more numerous subclasses of discrete event systems. queuing systems, computer and communication systems, manufacturing and traffic systems are only a few examples of des which are described in the book. this is a second revised and added edition of a book written by professors cassandras of boston university and lafortune of the university of michigan. it builds upon the cassandras’ book entitled “discrete event systems: modelling and performance analysis”, which was published in 1993 by irwin and aksen associates and received in 1999 the “harold chesnut” prize of ifac (international federation of automatic control) for the best text book in control engineering. the necessity of the book was perceived by the authors as a consequence of remarking several facts as it follows. there are more and more various complex real-life technical objects, which can be viewed as instances of des class. they are to be understood, their operations are to be controlled and, if it is possible, optimized. the traditional methods, which are based on differential or difference equations, though have been proved effective in controlling many systems which show a continuous in time function to represent the state trajectories, are not adequate for the des, whose sample paths (state trajectories) are described by piece-wise constant functions of time. consequently, there is a need to develop new modelling frameworks, analysis techniques, design tools, testing methods, and systematic control, and optimization procedures for this new generation of highly complex systems. to fulfil their objective the authors adopt a multidisciplinary approach by building on control theory (“for performance optimization via feed back control”), computer science (“for modelling and verification of event-driven processes”), and operations research (“for analysis and simulation of stochastic des”). also the authors propose modelling frameworks and describe new analysis and control methods which are specific for des and introduce new paradigms to allow combining mathematical tools with processing experimental data. in the preface, the authors emphasize the critical role of the electronic computer in performing various activities such as system analysis, design, and control. the authors are well aware of the existence of numerous books and papers addressing various subclasses of des such as: language and automata theory, petri nets , queuing models, markov chains, discrete time simulation, perturbation analysis and so on , all using specific representations of the objects studied. the authors aim at proposing in the book a unified modelling framework with a view to enabling a coherent and systematic study of the objects which belong to almost all des subclasses mentioned above. consequently, two discrete event modeling formalisms are utilized throughout the book to represent the state transition structures: automata and, to a lesser extent, petri nets. in section 1.3.3, copyright © 2006-2008 by ccc publications 318 florin gheorghe filip the authors introduce three levels of abstraction in the study of des (untimed, or logical, timed, and stochastic) to describe sequences of events. the levels of abstraction are used to gradually refine the presentation of the notions and methods contained in the book. the first chapter of the book contains an introduction to system concepts and parallel presentations of the main concepts of continuous variable dynamic systems, (cds) and discrete event driven systems. it also introduces hybrid systems; which most of the time be have as cds, but in certain time moments when discrete events cause discontinuities in the state trajectory. chapter 2, 3 and 4 contain a study of des at the logical (or untimed) level of abstraction. language models of des and the representation of languages by automata are described in chapter 2. software tools for analysis of dss are presented too. supervisory control issues are studied in chapter 3, which contains also a detailed presentation of decentralized control. in chapter 4, petri nets concepts are addressed. chapter 5 refines the models presented in chapter 2 (automata) and 4 (petri nets) to include time through the clock mechanism and gives an introduction to hybrid systems. chapters 6-11 utilise the third (stochastic) level of abstraction. the first three chapters (7, 8, and 9) contain aspects which are presented in the "traditional" manner of stochastic models based on probability theory. markov chains, and classical queuing theory models are presented in chapters 7 and 8, respectively. control and decision models based on markov chains are described in chapter 9. chapters 10 and 11 presents several concepts and techniques which heavily rely on the use of computer and do not require adopting the assumptions which were necessary when using classical stochastic models. chapter 10 contains an introduction to discrete event simulation including a presentation of languages and corresponding software products (in section 10.4.). chapter 11 presents sensitivity analysis and concurrent estimation methods, including the new “infinitesimal perturbation analysis” (ipa). the book contains also auxiliary material such as: a) a review of probability theory, and b) a description of ipa estimator. a web site (http://vita.bu.edu/cgc/book ), which is continuously maintained, can help the reader in his/her study. this is a high quality book, rich in content, up-to-date, and well written. the presentation style utilized throughout the book is a formal one. various examples presented and references made to relevant web sites increase value and usability of the book. the book can be of great value for various categories of senior undergraduate and postgraduate students and of the people which are interested in control, communications, computer science as well as in manufacturing and industrial engineering. therefore, i warmly recommend it to the readers of the international journal of computers, communications, and control (ijccc). florin gheorghe filip romanian academy 125, calea victoriei, 010071 bucharest-1, romania e-mail: ffilip@acad.ro int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 1, pp. 125-137 swarming models for facilitating collaborative decisions c.b. zamfirescu, f. g. filip constantin-bala zamfirescu lucian blaga university of sibiu, faculty of engineering, department of computer science and automatic control romania, 69121 sibiu, 17 emil cioran e-mail: zbc@acm.org florin gheorghe filip 1. romanian academy ince and bar romania, 010071 bucharest, 125 calea victoriei, and 2. nat. institute for informatics -ici romania, 011455 bucharest, 8-10 b-dul maresal al. averescu e-mail: filipf@acad.ro abstract: the paper highlights the computational power of swarming models (i.e., stigmergic mechanisms) to build collaborative support systems for complex cognitive tasks such as facilitation of group decision processes (gdp) in e-meetings. unlike traditional approaches that minimize the cognitive complexity by incorporating the facilitation knowledge into the system, stigmergic coordination mechanisms minimize the complexity by providing the system with emergent functionalities that are shaped by the environment itself through the possibility to structure it in terms of high-level cognitive artefacts. this is illustrated by conducting a socio-simulation experiment for an envisioned collaborative software tool that acts as a stigmergic environment for modelling the gdp. the results show superior results when the users are allowed to increase the representational complexity of a gdp model with cognitive artefacts that support guidance and action in the conceptual problem space. keywords: collaborative working environments, group decision support systems, facilitation, social simulation, stigmergy, swarming models of computation. 1 introduction the simplicity of swarming models is becoming ever more popular in the design of decentralized systems that are developed to run in open, dynamic, and unknown environments. inspired from the behaviour of social insects, marine animals, birds, and even humans, the concept covers a broad spectrum of mechanisms able to generate an intelligent collective behaviour. these mechanisms are simply identified as stigmergic coordination[1]. the most cited example of swarming models is the food foraging behaviour in ant colonies [2]. each ant senses the signs (pheromones) in its environment and acts in accordance with them without any direct communication with other ants from the colony. if there is no sign in the environment the ant executes a randomized search for food. when an ant discovers a food source, it drops a smelling chemical substance (pheromone) on its way back to the nest while carrying a bit of food. thus the ant creates a pheromone trail between nest and food source. when an ant senses the pheromone trails, it will follow the most intense one to the food source, the intensity of the pheromone signifies the shortest path toward the food that was discovered until that point in time; when the ant arrives at the food source, it will return with food, while depositing more pheromones, intensifying the pheromone trail. the above copyright c© 2006-2010 by ccc publications 126 c.b. zamfirescu, f. g. filip simple reactive behaviour of each ant results in an emergent intelligent behaviour of the colony that is able to find the shortest path from the nest to the food source without any central coordination. this behaviour is constrained by the limited sensorial aptitude of an ant to sense the local pheromone trails with no mental plan on how to find the shortest path or knowledge about the environment in which they act. consequently, the ant’s behaviour is an emergent property induced at the same time by two exogenous factors: 1) the environment (the surface of the terrain and the evaporation mechanism for the pheromones), and 2) the ants’ actions over the environment (the pheromone trails). despite its behavioural simplicity (i.e., stimuli-response rules), a single stigmergic agent can emulate any turing machine and can execute any symbolic or sub-symbolic algorithm proposed by the ai (artificial intelligence) research mainstream [3]. nevertheless, most of the computer applications are outside the domain of human cognitive abilities, serving merely as cognitive support systems through an active and semantically rich environment [4]. therefore swarming models of computation are seen as a feasible approach to construct systems that are not limited to the classical optimization problem of finding the shortest route [5], but are able to support human decisions as well [6, 7, 8]. one application domain with a high level of cognitive complexity is the facilitation of gdp in emeetings. the complexity associated with the construction, coordination and execution of gdp is well recognized in the research field of group decision support system (gdss) [9]. for this reason gdss has seldom been a full success story and a widely adopted technology as it has been foreseen by its pioneers. all the applications developed to support the group facilitation in e-meetings follow the traditional centralized approach where the system explicitly codifies the facilitation knowledge. examples include software tools that embed knowledge about the collaborative patterns of interaction [10] and workflows [11] for the most frequently used gdps. these applications basically suffer from the same obstacles met in the traditional ai mainstream such as [12]: 1) the restrictions to codify the human’s knowledge into the computing system; 2) the lack of self-development capabilities for this knowledge; 3) the blackbox perspective over a system disconnected from the environment where the relevant knowledge are extracted. in contrast to the traditional approach that codifies the facilitation knowledge into the gdss, this paper illustrates how the stigmergic mechanisms of swarming models may be employed to build emergent and self-organizing functionalities that support group facilitation in e-meetings. in this particular case, the stigmergic coordination mechanisms are implemented over the conceptual environment of the gdp modelling space which is exploited by the users of an e-meeting system. unlike conventional approaches that minimize the cognitive complexity associated with the construction and execution of a gdp by incorporating the facilitation knowledge into the system, stigmergic coordination mechanisms minimize it by structuring a shared conceptual environment populated with the cognitive artefacts that represents the basic skills of conducting an e-meeting. in this way the users are collectively constructing and interpreting the facilitation knowledge through the successive uses of the system. the remaining part of this paper is organized as follows. the next section presents a brief analysis of the innate relationship between distributed cognition and stigmergy as it has been presented by many authors within the web 2.0 technology mainstream. section 3 describes the main components of an envisioned collaborative software tool that act as a stigmergic environment for modelling the gdp: the structure of the semantic environment, the low-level behaviour of the users in interacting with this environment, and the high-level of cognitive constructs that may be employed by the users to structure the gdp. these components are implemented and tested in a socio-simulation experiment which is described in section 4. the experimental results show clear self-organizing capabilities, but simultaneously high dependability of system’s performance on the user’s ability to structure the stigmergic environment. from the engineering standpoint of constructing purposeful facilitation tools for e-meetings, these results are discussed and summarized in the last section. swarming models for facilitating collaborative decisions 127 2 stigmergy and cognition most of the research in cognitive science is rooted in the basic assumption that cognition is purely an internal process of representation and manipulation of knowledge disconnected from the environment. nevertheless, recent studies in cognitive sciences reveal the essential role of the environment (physical or artificial) in mediating the knowledge by facilitating their external representation and information exchange among these representations [13]. it becomes clear ’that individuals are socially and culturally situated and that the environment needs to be considered in order to understand cognition’ [14]. by admitting the critical role of the environment in cognition, the researchers became increasingly aware of the relationship between cognition and stigmergy. even if the term stigmergy has been primarily used for typically reactive (non-rational) agents, its relationship with cognition was investigated for the first time by susi and ziemke [15]. the authors conclude with the assertion that the conceptual framework of stigmergy offers a common denominator for the social sciences theories (i.e., activity theory; situated and distributed cognition). for example a cave painting is an emblematic case of stigmergic coordination where people used the physical environment to indirectly communicate their knowledge in ancient human society. the relevance to cognition of the stigmergic coordination mechanisms in human society is illustrated by parunak in several examples of social activities [6]. albeit the use of stigmergic mechanisms may be observed in various social activities, it is more evident in the digital world realm. in a comprehensive study of collaborative support systems, elliot [16] found that stigmergy is a coordination mechanism inherent not only in collaborative processes over physical environments, but also in a range of collaborative support systems. for instance the plethora of applications that are considered to be web 2.0 technology (especially media such as wiki and community blogging) is generally recognized to be stigmergic systems. moreover, applications such as google’s pagerank system, ebay’s online auctioning, amazon’s recommender systems [6], wikipedia.org, open source software and multiplayer social environment second life [17] are employing the stigmergic coordination mechanisms to exhibit the functionalities of an intelligent collective behaviour. in these stigmergic systems the users exploit their digital environment through the use of engineered artefacts that may be annotated with symbolic information representing the human’s cognition [7]. given that the essential capability of any stigmergic system is to transfer the cognitive complexity from the humans to the environment [18], the problem-solving capabilities of the users decisively depend on how the problem is represented in the digital environment. a standard representation of the problem in the environment is realized as a composition of cognitive artefacts linked in a weighted graph. basically, this graph signifies a navigation map that supports the cognitive effort to find and reach any artefact from the place where it is needed. the artefacts commonly stand for the possible states of the problem, while the links are the set of possible actions that guide the decision process (the conceptual navigation) from one state to another of the problem space. similar with the intensity of the pheromone trails in the case of real ants, these actions are weighted in order to discriminate the most effective ones. as a result, the improvement of problem-solving ability requires two corresponding processes [8]: 1) the augmentation of the environment with additional states and actions to increase the accuracy of problem representation, and 2) the improvement of the preference function for an action in order to compensate the expansion of the exploration space. the wide employment of the stigmergic patterns of interaction in collaborative working environments has been triggered by its fundamental advantage of preventing the humans’ cognition to be exposed to the complexity of the environment [19]. as users interact only locally, there is no need for tasks allocation, the tasks being preferentially performed by the most expert since they are the most attracted to act and finalize the task with minimal effort [20]. moreover there is no need for prediction since the environment records actions in the problem space and the unexpected events are automatically traced through the outcome of the users’ actions over the environment. all these advantages make the stigmergic models of computation a suitable approach to support the cognitive complexity of facilitating gdp, a process that 128 c.b. zamfirescu, f. g. filip runs in a dynamic, open and uncertain environment. 3 a simulation model for group decisions in e-meetings the wide range of tools that support group decisions in e-meetings falls under the gdss general umbrella term. gdss is defined as an interactive computer-based environment that supports a concerted and coordinated team effort towards completion of joint tasks [21]. a gdss is composed of a set of highly configurable collaborative "tools" (e.g., brainstorming, voting and ranking, multi-criteria analysis, etc.) that requires a high level of expertise for an effective use for complex decisions [22]. the strong relationship between the gdp outcome and the presence of a skilful facilitator to direct the joint decision process is thoroughly presented in many field studies of gdss research [23]. the inaccessibility of many organizations to a well-trained gdss facilitator is recognized to be one of the main obstacles which limit the adoption of gdss technology [24]. to reduce the dependence on the facilitator, the participant-driven gdss was proposed as the most promising research direction to leverage the skills and abilities of each group member [25]. however, this approach is highly constrained by the cognitive complexity associated with the construction, coordination and execution of gdp by inexperienced users. to overcome the problem of cognitive complexity briggs and de vreede [24] introduced the thinklet (tl) concept as a discrete facilitation unit that integrates a specific tool, its configuration and a script to use it a predefined interaction protocol among users that is enforced and mediated by a specific collaborative tool. this concept was anticipated by the declarative model of the experienced decision maker (medm), which was proposed in late 80’s to help the user of a dss to build the model, select the appropriate solver, and evaluate various solutions provided by the computerized algorithms in so called "mixed knowledge" dss [22, 26]. the tls are considered to be the smallest piece of essential knowledge to design collaborative processes. examples include [24]: strawpoll (evaluate) used to reveal the agreements or disagreements within a group; leafhopper (diverge) used when the participants does not known in advance the topics for discussion or they have different interests or level of expertise; expertchoice (organize) used when the ideas of the group are organized by a single participant; reviewreflect (converge) used when a different reviews from group members for a document should reach an agreement etc. with tls, the conceptual model for a gdp takes the form of a shared plan of collaborative actions [24, 27]. each collaborative action is an interaction protocol embodied in a tl. as any plan, the model for a gdp may be hierarchically decomposed in sub-plans at different levels of abstraction. this conceptual structure of a gdp model is acknowledged in any application domain of the gdss technology, such as project management [28], user requirement elicitation [29], crisis response management [30], scenario design [31], risk identification [32], etc. in the view of envisioning a collaborative software tool that acts as a stigmergic environment for modelling the gdp (in the same way in which a collaborative cad software acts as a stigmergic environment for architectural design [33]), we developed a socio-simulation model that mimics the users’ conceptual ’navigation’ over the semantic structure of the problem space for facilitating the e-meetings. as for any stigmergic system the simulation model entails the description of agents’ behaviour and the structure of the shared environment where the agents are localized and moved over it. for the gdss domain, the agents are the users responsible to define, execute and evaluate a gdp model which is a path through the conceptual space of the available tls. the environment is the collaborative facilitation tool that supports the conceptual representation of the problem space comprising all the tls discovered and documented by the users’ community (so far there are over 70 tls acknowledged in literature [34]). swarming models for facilitating collaborative decisions 129 3.1 the semantic environment for facilitating the e-meetings according to parunak [6], a stigmergic environment assumes the definition of three main components: 1) topology, 2) states, and 3) processes. structurally, the topology may be viewed as a fully connected weighted graph that codifies the facilitation knowledge of group decision in e-meetings. this knowledge presumes correlated information among the users and the tls, reflecting the users’ evaluation of the performance for a tl (a node in the graph) relative to a problem type. the performance is stored for each problem type in a variable associated with each edge of the graph. the problem type is simply codified through a unique id to distinguish among different performances when they are read, during the modelling phase of the gdp, or modified, after the gdp has been executed and evaluated by agents. evaluation of a gdp model entails a subjective assessment of the model against some performance criteria after its execution. regularly, gdp’s performance may be quantified in terms of efficiency, effectiveness and users’ satisfaction, as illustrated in several studies from the gdss research field [35]. the performance from all the graph’s edges describes the state of the environment over time. usually, the environment executes a set of processes on the variables (as aggregation end evaporation in the case of ants). for our case, we apply a simple weighted additive rule to simulate the aggregation of performances: pjk(t lk,t) = pjk(t lk,t − ) + u pjk(t lk)/w (1) where: t represents the temporal component of the model which is incremented by one for each successive use of the gdss; k is the tl’s identification index from the set of tls used to model the gdp;u pjk(t lk) is the user’s performance of the k-th tl evaluated from the side of tl j at moment t;pjk(t lk,t) and pjk(t lk,t − ) are the new and previous values of the (collective) performance stored on the edge between the tls j and k; w is a tuning parameter, arbitrarily chosen, to weight the impact of the last evaluation. 3.2 the agents’ behaviour over the semantic environment the agents are the users who interact with the envisioned collaborative tool to model the gdp. conceptually, in any point in time an agent is "located" in a node (tl) of the cognitive environment of the problem space, performing one of the following basic actions: 1) evaluates the preference for the next possible tl (or tls) that are going to be executed given the current execution context of the gdp; 2) selects the next best tl (or a group of tls) for further completing the gdp model; 3) executes the tl (or the group of tls) from the model, and finally; 4) evaluates the performance for the executed tls. the evaluation activity is simulated using the formula (1), while the first three actions with luce’s selection axiom [36]: p jk = e p jk(t lk)/t / m∑ i= ep ji(t li)/t , (2) where p jk represents the preference for an alternative tl, i.e. the selection probability of the tl k from the tl j; i is the index of tls connected from the side of node j (in fact all the m tls available in the problem space as long the graph is fully connected); and t is a parameter used to define the deviation from a pure rational behaviour. the above formula is the most common model of stochastic decisions due to its correlation with the psycho-social observations of human behaviour in several domains. as a result of normalization, the preferences for the unexploited tls are diminishing after each performance update. this mechanism replicates the pheromone evaporation process of the real ants (e.g., even if a tl has been positively evaluated after an execution of a gdp model, the associated preference will decrease once a better alternative is discovered and more frequently used). the uncertainty associated with the construction of preferences is generally modelled in equation (2) with the parameter t that range between 0 (when 130 c.b. zamfirescu, f. g. filip selection is deterministic as is the ideal case of a perfectly informed decision) and 1 (when the selection is completely random as in the case of a completely irrational decision). note that luce’s selection axiom does not specify the reasons of uncertainty which for the modelling of gdp may cover any aspect of complexity, unfeasibility, cost or even refusal to evaluate the performance of a tl after its execution. 3.3 navigation strategies over the semantic environment the agents, who are reflecting the users in modelling a gdp, are engaging in means-ends reasoning activities to achieve the group decision goal. during the execution of the model they must be able to adapt to the changes and uncertainties associated with both the execution and the decision goal. moreover, when the decision problem is not completely clear or too complex to be fully tackled, the users are defining intermediate sub-goals to be subsequently achieved. as a result, the design of a gdp model is most often done incrementally and interleaved with its execution. table 1: different modelling strategy to construct a gdp model design strategies execution problem’s goal codification (ds) ds1 certain stable the problem type is codified through a unique id ds2 uncertain stable the problem type is codified as a variation from the current state of execution to the desired one ds3 uncertain unstable the problem type is codified as a variation from the current state of execution to any future possible state in table 1 we have summarized three basic design strategies (ds), with direct implications on the way the gdp model is decomposed on different levels of abstractions. these are: • ds1 which corresponds to the traditional use of gdss when the gdp model is predefined by the facilitator. in this case, the user is providing a complete structure of the gdp model, the facilitator having a complete vision over the execution plan for the gdp. it includes all the necessary collaborative actions (in the form of tls) together with their precedence constraints. this design strategy relates to the hypothetical conditions when the execution context remains stable in time as regards the tl execution’s outcome and the decision’s objectives. thus, each problem type is identically codified in all edges that connect the tls used in modelling the gdp and relates to a low level of semantically structured environment. • ds2 which corresponds to problem types with stable objectives but uncertain tl execution’s outcome from the gdp model. in this case, after the execution of each tl, adjusting the remaining gdp is needed. this design strategy is codified in the conceptual graph of gdp modelling with different ids for each sub-problem that corresponds to the variance from the current state of execution to the desired one. • ds3 which relates to complex plans of actions for the gdp model, where both the outcome of an activity and the decision’s objectives are unstable during the execution. in this case, when the decision problem is not entirely clear or too complex for designing an entire gdp model, the users are defining intermediate sub-goals that are subsequently dealt with. this design strategy is codified in the conceptual graph for the gdp modelling with different ids for each sub-problem that corresponds to the variance from the current state of execution to any future possible one. swarming models for facilitating collaborative decisions 131 4 experimental results to evaluate the design strategies for modelling the gdp we conducted a virtual experiment following the research methodology proposed by carley [37], implementing in the netlogo multi-agent simulation environment [38] the model described in the previous section. in the experiment the users ("turtles") are to engage themselves in the facilitation of e-meetings, trying to define the gdp model for a problem type by moving in the conceptual graph of tls (the nodes and edges are implemented also as "turtles"). the number of tls that compose the graph is arbitrary chosen from the interface (the "num-tls" variable in figure 1), while their utilities for a certain problem type is predefined with random values between 0 and 1 when the experiments are initialized. note that the netlogo implementation includes some additional variables required to analyse if the model presents similar results to those reported in the traditional ethnographical studies. this issue is beyond the scope of this paper and is detailed in [39]. figure 1: the interface of the model in the netlogo environment in the next sections the normalized performance of the gdp models and its associated entropy for 100 successive explorations (iterations) in the three design strategies (the "planning-degree" variable in figure 1): ds1, ds2 and ds3 are presented. an exploration stands for a complete execution cycle of a gdp. it includes three consecutive phases: 1) finding a suitable model through the successive selection (using the equation (2)) of tls that compose the gdp for the given problem type; 2) executing the identified model and assessing its performance by reading and averaging the predefined utility values of all the tls that compose the gdp model; 3) evaluating the model by updating the performance value (using the equation (1)). the statistics are aggregated from 30 experiments for a relatively simple problem type of 5 successive tls. the parameter t from equation (2) is set to 0.7 (the "pheromonesensitivity" variable from figure 1 to favour a faster convergence rate in finding a suitable solution in the problem space composed of 70 tls. from the engineering viewpoint, the design strategies have direct implications on the way in which the cognitive environment for modelling the gdp is structured. as described before, the structure of 132 c.b. zamfirescu, f. g. filip the environment is simply reflected in the possibility to semantically decompose the problem in subproblems on different levels of abstractions. this design issue implies in our implementation to record with specific ids the performance for each sub-problem type that emerges from the decomposition process. 4.1 the impact of the modelling strategies over the model’s performance in figure 2 the aggregate performance (a relative number between 0 and 1 as resulted from averaging the predefined utility values of all the tls that compose the gdp model) from 30 experiments of a gdp modelling process for the defined design strategies ds1, ds2 and ds3 are shown. as may be expected, the performance fits an exponential function, a typical behaviour for a stigmergic system [40, 41, 42]. figure 2: the gdp models’ performance for the defined modelling strategies figure 3: the distribution of gdp models’ performance for the three modelling strategies like any heuristic model, the stigmergic coordination mechanisms do not guarantee finding an optimal solution, but a near-optimal or acceptable one. as a consequence, from one experiment to another there are some variations in performance for the convergence values. figure 3 illustrates in whiskers diagrams the distribution of performance for the same experimental data that are depicted in figure 2. the three strategies show different performance and convergences to an optimal solution. contrasting with ds1, ds2 takes benefit from the prior experiences not only in relation with the entire problem type but also from the intermediate solutions to model the sub-problems in which the initial problem has been decomposed. in addition, ds3 increases the granularity by adding the opportunity to decompose the problem’s objective. as a consequence, the figures show the influence of problem type decomposition on the gdp model’s performance. as may have been expected, ds3 shows the best performance results and a low inconsistency among the identified solution for a gdp model. but, ds2 converge faster to an optimal solution and a lower swarming models for facilitating collaborative decisions 133 inconsistency among the feasible solutions (the deviation of the average solution from those with maximal and minimal performances). this can be explained by the additional constraints of having stable objectives during the modelling process. 4.2 the cognitive complexity associated with each modelling strategy the auto-organization of relations between tls (i.e., the performance update after successive evaluations) entails a decrease of freedom due to the emergence of contextual constraints that reduce the probability to select some tls (i.e., the preference for the available tl as defined in equation (2)). for a problem type, the degree of freedom corresponds to the probabilistic distribution of preferences for the selection alternatives that is equivalent with the shannon normalized entropy [40, 43]. the shannon normalized entropy for the selection of a tl is given by: e(pjk) = − m∑ k= pjk ·ln(pjk)/ln(m) (3) where pjk represents the preference, the selection probability of the tl k from the tl j; k is the index for the tls connected from the node j (in fact, all the m tls available in the problem space). when the recorded performance is equal for all the available modelling alternatives, the user is considering the entire problem space when he selects a feasible tl (the probabilities from equation (3) being equally distributed entail an entropy equal with 1). contrary, when the recorded performance favours a single alternative, the user will have no freedom in the selection of the best tl (all the probabilities from formula 2 being 0 except the best alternative that is 1, entails an entropy equal with 0). thus, the entropy associated with tl’s selection is a measure of cognitive complexity for modelling the gdp. moreover, it is a local metrics that can be computed for each tl’s selection activity for modelling the gdp. figure 4 shows the cognitive complexity associated with gdp modelling for the design strategies: ds1, ds2 and ds3. the data are obtained for the same experimental settings as introduced in the beginning of this section. because this measure is computed on the basis of the local data for each selection action (the performances available on the edges from the current tl), the figure corresponds to the average of entropies for all the tl selection actions needed to complete the gdp model (5 successive tls, for this case). similar to the distribution of gdp models’ performance, comparing with ds1 the normalized entropy for ds2 and ds3 converge faster to 0. for ds3 the gradual increase in the complexity of the representations for a gdp model in the conceptual problem space (through the definition of different sub-plans on different semantic levels) makes the design of the gdp process more manageable in the sense that it reduces the overall complexity of representing the gdp model. nevertheless to reduce the cognitive complexity for modelling the gdp, the users should be supported in using this representation for guidance and action in the conceptual problem space. this entails the reflection of the abstract representations into the conceptual environment through an increased semantic complexity (by adding new problem types, codified with distinct ids in our implementation) which facilitates the synergy among partially overlapping gdp. only in this way the users may move from one sub-plan to another or from low level detailing with tls to high level of plan and to be fully supported by the stigmergic coordination mechanisms at the same time. 5 summary and conclusions stigmergic mechanisms are widely employed in human society, from the classical examples of largescale complex adaptive systems such as stock markets, supply logistics and cultural memes to the more 134 c.b. zamfirescu, f. g. filip figure 4: the normalized entropy of the gdp modelling for the three design strategies recent range of collaborative working environments such as wiki, google, social networking and opensource software. continuing our work in achieving inter-paradigmatic synergy between symbolic and sub-symbolic reasoning for large-scale complex systems [44, 45], the paper highlighted the computational power of stigmergic coordination mechanisms to build collaborative support systems for complex decisions such as facilitation of gdp in e-meetings. unalike conventional approaches that minimize the cognitive complexity by incorporating the facilitation knowledge into the system, stigmergic coordination mechanisms minimize it by offering emergent functionalities that are made up not only by the user’s actions (through the selection of the right modelling components) but also by the environment itself (through the possibility to structure it in terms of high-level cognitive artefacts such as plans or sub-plans). this was demonstrated by implementing and testing in a socio-simulation experiment an envisioned collaborative software tool that act as a stigmergic environment for modelling the gdp. the results show superior results when the users are supported to gradually increase the complexity of the representations for a gdp model which is reflect on its turn in the relational complexity of the conceptual problem space. bibliography [1] p.p. grassé. la theorie de la stigmergie: essai d’interpretation du comportement des termites constructeurs, insectes sociaux, 6: 41-81, 1953. [2] p. valckenaers, h. van brussel, hadeli, o. bochmann, b. saint germain, c. zamfirescu on the design of emergent systems: an investigation of integration and interoperability issues. engineering applications of artificial intelligence, 16(4): 377-393, 2003. [3] h.v.d. parunak, s.a. brueckner. the cognitive aptitude of swarming agents http://www.newvectors.net/staff/parunakv/casa.pdf, 2009. [4] m.a. rodriguez. faith in the algorithm, part 1: beyond the turing test, http://faithinthealgorithm.net/, 2009. [5] s.c. negulescu, c.v. kifor, c. oprean. ant colony solving multiple constraints problem: vehicle route allocation, international journal of computers, communications and control, 3(4): 366-373, 2008. [6] h.v.d. parunak. a survey of environments and mechanisms for human-human stigmergy, lecture notes on artificial intelligence, 3830: 163-186, springer, 2006. swarming models for facilitating collaborative decisions 135 [7] a. ricci, a. omicini, m. viroli, l. gardelli, e. oliva. cognitive stigmergy: towards a framework based on agents and artifacts, lecture notes in computer science, 4389: 124-140, springer, 2007. [8] f. heylighen, j. bollen. the world-wide web as a super-brain: from metaphor to model, proceedings of cybernetics and systems ’96, austrian society for cybernetics, 917-922, 1996. [9] g.j. de vreede, j.a. boonstra, f. niederman. what is effective gss facilitation? a qualitative inquiry into participants’ perceptions, proceedings of the th hawaiian internal conference on system sciences, ieee computer society press, 2002. [10] g.l. kolfschoten, w. veen. tool support for gss session design, proceedings of the th annual hawaii international conference on system sciences, ieee computer society press, 2005. [11] a.v. lopez, q. booker, n.s. shkarayeva, r.o. briggs, j.f. nunamaker jr.. embedding facilitation in group support systems to manage distributed group behavior, proceedings of the th hawaii international conference on system sciences, ieee computer society press, 2002. [12] r.a. brooks. intelligence without representation, artificial intelligence journal, 47: 139-159, 1991. [13] e. hutchins. cognition in the wild. cambridge, mit press, 1995. [14] j. rambusch, t. susi, t. ziemke. artefacts as mediators of distributed social cognition: a case study. proceedings of the th annual conference of the cognitive science society, 1113-1118, mahwah, nj: erlbaum, 2004. [15] t. susi, t. ziemke. social cognition, artefacts, and stigmergy: a comparative analysis of theoretical frameworks for the understanding of artefact-mediated collaborative activitym, cognitive systems research, 2(4): 273-290, 2001. [16] m.a. elliott. stigmergic collaboration: a theoretical framework for mass collaboration, phd thesis, centre for ideas, victorian college of the arts, the university of melbourne, 2007. [17] m.a. elliott. stigmergic collaboration: the evolution of group work,m/c journal, 9(2), http://journal.media-culture.org.au/0605/03-elliott.php, 2006. [18] h.v.d. parunak, t.c. belding, r. bisson, s.a. brueckner, e. downs, r. hilscher. stigmergic reasoning over hierarchical task networks. proceedings of the th international conference on autonomous agents and multiagent systems, 1195-1196, 2009. [19] i. dzitac, b.e. barbat, artificial intelligence + distributed systems = agents, international journal of computers, communications and control, 4(1):17-26, 2009. [20] f. heylighen. accelerating socio-technological evolution: from ephemeralization and stigmergy to the global brain. in: globalization as an evolutionary process: modeling global change, edited by george modelski, tessaleno devezas, and william thompson, london: routledge, 286-335, 2007. [21] g. desanctis, b. gallupe. a foundation for the study of group decision support systems, management science, 589-609, 1987. [22] f.g. filip, decision support and control for large-scale complex systems, annual reviews in control, 32(1): 61-70, 2008. 136 c.b. zamfirescu, f. g. filip [23] f. niederman, c.m. beise, p.m. beranek. issues and concerns about computer-supported meetings: the facilitator’s perspective, mis quarterly, 20(1): 1-22, 1996. [24] r.o. briggs, g.j. de vreede, j.f. nunamaker, jr.. collaboration engineering with thinklets to pursue sustained success with group support systems, journal of management information systems, 19(4): 31-63, 2003. [25] j.h. helquist, j. kruse, m. adkins. developing large scale participant-driven group support systems: an approach to facilitating large groups, proceedings of the first hicss symposium on field and case studies of collaboration, ieee computer society press, los alamitos, ca, 2006. [26] f.g. filip. system analysis and expert systems techniques for operative decision making. j. of syst. anal. model. simul. 8(2): 296-404, 1990. [27] c.b. zamfirescu. an agent-oriented approach for supporting self-facilitation in group decisions, studies in informatics and control, 12(2): 137-148, 2003. [28] r.j. harder, j.m. keeter, b.w. woodcock, j.w. ferguson, f.w. wills. insights in implementing collaboration engineering, proceedings of the th hawaiian internal conference on system sciences, ieee computer society press, 2005. [29] m. de hengst, e. de kar, j. appelman. designing mobile information services: user requirements elicitation with gss design and application of a repeatable process, proceedings of the th hawaiian internal conference on system sciences, ieee computer society press, 2004. [30] j.h. appelman, j. driel. crisis-response in the port of rotterdam: can we do without a facilitator in distributed settings? proceedings of the th hawaiian internal conference on system sciences, ieee computer society press, 2005. [31] b. enserink. creating a scenariologic design and application of a repeatable methodology, proceedings of the th hawaiian internal conference on system sciences, ieee computer society press, 2003. [32] g.j. de vreede, r.o. briggs. collaboration engineering: designing repeatable processes for highvalue collaborative tasks. proceedings of the th hawaiian internal conference on system sciences, ieee computer society press, 2005. [33] l.r. christensen. the logic of practices of stigmergy: representational artifacts in architectural design, proceedings of the 2008 acm conference on computer supported cooperative work, acm, new york, 559-568, 2008. [34] g.j. de vreede, r.o. briggs, g.l. kolfschoten. thinklets: a pattern language for facilitated and practitioner-guided collaboration processes, international journal of computer applications in technology, 25: 140-154, 2006. [35] r.o. briggs, b.a. reinig, g.j. de vreede. meeting satisfaction for technology-supported groups: an empirical validation of a goal-attainment model, small group research, 37, 2006. [36] d. luce. individual choice behaviour, wesley, new york., 1959. [37] k.m. carley. on generating hypotheses using computer simulations. proceedings of the 1999 international symposium on command and control research and technology. evidence based research, vienna, 1999. swarming models for facilitating collaborative decisions 137 [38] u. wilensky. netlogo, http://ccl.northwestern.edu/netlogo/, center for connected learning and computer-based modeling, northwestern university, evanston, 1999. [39] c.b. zamfirescu. anthropocentric group decision support systems. phd thesis, politehnica university of bucharest, 2006. [40] s. guerin, d. kunkle. emergence of constraint in self-organizing systems. nonlinear dynamics, psychology, and life sciences, 8(2): 131-146, 2004. [41] b.e. barbat, s.c. negulescu, c.b. zamfirescu. human-driven stigmergic control. moving the threshold. proceedings of the th imacs world congress scientific computation, applied mathematics and simulation, paris, 2005. [42] v.h.d. parunak, s.a. brueckner, j.a. sauter, r. matthews. global convergence of local agent behaviors, proceedings of the th international joint conferenece on autonomouse agents and multi.agent systems (aamas), utrecht, netherlands, 2005. [43] v.h.d. parunak, s. brueckner. entropy and self-organization inmulti-agent systems. proceedings of the fifth international conference on autonomous agents, montreal, canada, 124-130, 2001. [44] b.e. bărbat, c.b. zamfirescu, s.c. negulescu. the best from ants and humans: synergy in agentbased systems. studies in informatics and control journal, 13(1): 47-59, 2004. [45] f.g. filip, k. leiviskä. large-scale complex systems. handbook of automation ( s. nof, editor), 619-638, springer, 2009. constantin-bala zamfirescu is currently associate professor (lecturer) at "lucian blaga" university of sibiu, romania. he took the phd from "politehnica" university bucharest in automation and control field. his research interests include the employment of agent-oriented engineering paradigm in group decision support and manufacturing control systems. he is a member of ifac technical committees 5.4, eu cost action 298 and nd eu noe for the advancement of artificial cognitive systems, interaction and robotics. florin gheorghe filip member of the romanian academy (elected in 1991), received his m. sc and ph. d. in control engineering from the technical university "politehnica" bucharest in 1970 and 1982, respectively. he has been with the national r & d institute for informatics since 1970 and was the managing director of ici from 1991 to 1997. he was elected in 2000 (and re-elected in 2002 and 2006) as a vice-president of the romanian academy. he has been the chair of the ifac technical committees 5.4 (2002-2008). his main scienti?c interests include: control and optimization of large-scale systems, decision support systems, technology management, and models for knowledge-information society. he is author/co-author of six books, editor of nineteen volumes and the author/co-author of over two hundred articles published in contributed volumes and scientific journals. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 1 (march), pp. 125-133 ontology model of a robotics agents community h. latorre, k. harispe, r. salinas, g. lefranc homero latorre karina harispe universidad tecnológica metropolitana de chile departamento de informática y computación josé pedro alessandri 1242, ñuñoa. santiago – chile e-mail: helatorre@gmail.com renato salinas universidad de santiago de chile departamento de ingenieria mecánica avda. bernardo o’higgins 3363. estación central. santiago – chile e-mail: renato.salinas@usach.cl gastón lefranc pontificia universidad católica de valparaíso escuela de ingenieria eléctrica avda. brasil 2147. valparaiso – chile e-mail: glefranc@ucv.cl abstract: this paper presents an ontology model of necessities and decisions for a cooperative community of heterogeneous robotic agents. based on an ant community, it defines the characteristics of a collaborative and cooperative community of robots, using multi agent theory where each robot shares information with others robots in the community, to accomplish a common objective. keywords: robotics, intelligent agents, multi agents, collaborative and cooperative robotics, long-range-dependent, network layer, network traffic, selfsimilar process. 1 introduction collaborative and cooperative tasks are used in robotic agents, where the knowledge is shared; there exist communication to talk to the others robots where the area to the tasks is located and which decision that they have make, according to the necessities. these works have the hierarchy characteristics similar to that presented in nature, especially in the animal kingdom, where the most interesting are the behavior of bees, ants and termites, communicating where are the foods and the decisions that they have to take, according to the necessities. based on this group behavior, these insects are intelligent agents designed by nature. the agents can be simulated in a computer and put in group of robots that will be able to do tasks in coordinating way, where the decisions are taken with the group characteristics. the tasks in a community of collaborative and cooperative robots have increased in complex way, meaning the tasks cannot be done with only one robot, but needing a heterogeneous community of robots. this heterogeneity of collective robots produces new challenges, adds a new difficulty to collective robotics, such as the aptitude to coordinate individuals with multiple qualities of an intelligent way; and to solve a specific goal working together and taking into account all the characteristics. to solve a certain task, each of them fulfills a specific function with which the copyright c⃝ 2006-2011 by ccc publications 126 h. latorre, k. harispe, r. salinas, g. lefranc rest must be able to coexist, the satisfactory results by means of the union of their qualities, and work together [1]. there exist several methodologies for building multi agent systems. few of them address the information domain of the system. it is as important as the representation of the system in the information domain is the various agents’ information domain view. heterogeneous systems can contain agents with differing data models, a case that can occur when reusing previously built agents or integrating legacy components into the system. most existing methodologies lack specific guidance on the development of the information domain specification for a multi agent system and for the agents in the system. an appropriate methodology for developing ontology must be defined for designers to use for specifying domain representations in multi agent systems. the existing methodologies for designing domain ontologies are built to describe everything about a specific domain; however, this is not appropriate for multi agent systems because the system ontology should only specify the information required for proper system execution. the system ontology acts as a prerequisite for future reuse of the system, as the ontology specifies the view of the information domain used by the multi agent system. any system that reuses the developed multi agent system must ensure that the previously developed system ontology does not conflict with the ontology being used in the new system [2]. once the system ontology is constructed, a multi agent system design methodology should allow the analyst to specify objects from the data model as parameters in the conversations between the agents. to ensure the proper functionality of the multi agent system, the designer must be able to verify that the agents have the necessary information required for system execution. since the information is represented in the classes of the data model, the design of the methodology must show the classes passed between agents [3,4]. in this paper is presented an ontology model of necessities and decisions for a cooperative community of heterogeneous robotic agents based on an ant community. it defines the characteristics of collaborative and cooperative community of robots, using multi agent theory where each robot shares information with others robots in the community, to accomplish a common objective. 2 description of the problem in general, the complexity of the tasks added to the lack of constant structure in the environment, does not allow a homogeneous and structured community should be capable to fulfill the aims raised of an ideal way. nevertheless, these limitations of capacities can be covered if different technologies are integrated; applying the heterogeneity in the architecture added to intelligent behaviors handled by means of multi agent systems – mas. it is important to create mas capable of controlling a heterogeneous community of robots in order to exhibit cooperation and collaboration among them, reaching the proposed aim. there are two concepts that have to be clarified, how the members of the community must be interrelated to the moment to make the work; and cooperation and collaboration. cooperation and collaboration is not the same thing. to collaborate is to contribute, is to give help (in knowledge or work) to do something, is to share the same proposition and the common goals. to cooperate is a collective work with a common goals, it means working together simultaneously, to have interactions of the collective work and interchange of ideas [5]. the cooperative and collaborative environments seek to transform heterogeneous groups into intelligent, flexible and autonomous communities. the community does not need having subordination; this means that the community has no teachers and apprentices nor masters and slaves; the agency is a mere way of communication towards the exterior, to send of results or requests of support [6, 7]. ontology model of a robotics agents community 127 in some communities, the robots communicate to a robot agency, to send reports outside of the community. in a heterogeneous community of robots, the communication between the robots must be in an egalitarian form, without intermediaries. each of them has to have the capacity of direct connection with its partners, without establishing first the communication with the agency. briefly, it is important to have a mas capable of provoking cooperation and collaboration, generating intelligence, flexibility and autonomy, without subordination, to arrive to results obtained from actions realized as consequence of consensus decisions [1, 8–10]. 3 description of the organization of the community the community is formed by heterogeneous robots; each one is specializing in specific functions, which it helps in the achievement of the common goals. inside the community, every robot must report its characteristics to the rest of the community, in order to generate knowledge that it allows to determine which of its qualities are adapted to a specific works. every robot has responsibilities, which will be resolved by consensus of the community, and it will have to realize its actions without being an obstacle to its companions, and giving any help that is necessary in unforeseen moments. to realize the work, and unlike the existing robotic communities, a leader does not exist hence, the communication, transfer of information, and capture of decisions is of "all with all", where every contribution has the same value for the community. if we base the study on a community of ants, these have behaviors that lead them to achieving its goals, though an ant alone is not capable of feeding and defending its anthill, millions of them can do it. in turn, a robot is not capable of realizing tasks that need major capacities that those that it possesses, but a group of them that brings together as a whole all the characteristics necessary, it can realize it. considering a group of agents, natural or artificial, which must come to an aim it can say that: (a) the ants (natural agents) possess intrinsic behaviors of communication, work and capture of decisions. (b) the robots (artificial agents) possess behavior learned or incorporated by means of programming which is based on predefined ontologies. to manage to obtain a "natural behavior" in a robotic community modularization of every action realized by the natural agents in an artificial agent is based in the observation of the nature. 4 systems of ontologies of communication every robot must know the directions of its community, or detect what signs are available, to be able to send the requests of connection and initiate the communicative act that will coordinate the execution of the tasks to realize. for this connection, three agents are needed: one agent capable of detecting signs, and to check the existing directions, one agent to request connection and other one to accept the request (fig. 1). on the other hand, whenever a new member joins the community, it has to report to its companions which are its characteristics and the functions for those that it was designed. for that purpose, it needs to have an agent that it should allow it to deliver its "curriculum" to the rest of the community, and in turn, it needs an agent capable of catching the information relating to each of its new collaborators. (fig. 1). during the accomplishment of the tasks, certain needs arise when in an individual is not capable of solving the task. in this case, it requests help to others. this implies that every 128 h. latorre, k. harispe, r. salinas, g. lefranc individual has the aptitude to communicate its needs and in turn to process requests raised from others. a way of solving this problem is by means of agents in charge of realizing these tasks (fig. 2). besides, it is necessary to have an agent capable of trying the need and to generate an action that could cover this need, based on the knowledge that it has, or to declare itself unable to realize something of usefulness. in brief, there needs an agent capable of making an individual decision. 5 ontology system of making of decisions in the execution of a cooperative and collaborative work, the communities of robotic agents face problems that need of rapid solution, even if its behavior is intrinsic in them and present certain characteristics of collective memory. they have to take decisions that in some cases involve to sacrifice some members of the colony. for it, when a need is communicated and it receives several individual answers to that aptitude to cover the need, the community must decide which of the offers is better, or how to coordinate them. for it, there can be applied different concepts based on the ontologies of decision in which the robotic mas is sustained. the last instance, before realizing the pertinent assignments to cover a need, is the taking of decisions (d), where the conducts of the community meet reflected with major force, from the coordination up to the cooperation. unlike the traditional focus in the collective robotics, in the community of robotic agents there appears a new concept, the consensus. any decision taken must be realized by mutual agreement for the whole community that it is directly faced to a problem – fig. 3. the making of decisions carries out three different forms, commonly used among human groups: decision for similarity (ds): this one is taken when more than one individual presents the same solution, expressed of different way; the community may take both as different, and nevertheless it has to have the aptitude to detect the similarities among them and to obtain the unique solution. yet, when there are significant differences between the offers realized, it must be feasible to unite the similarities among them and to optimize the differences to obtain the final solution that will be applied. decision for quality (dc): this one turns out to be one of the most complexes since the robots must be capable of determining which of all the offers is the optimal, or the best solution. this requires that to community possesses another human characteristic, the figure 1: fipa protocol for community of robotic agents. ontology model of a robotics agents community 129 figure 2: architecture of knowledge behaviors for community of robotic agents. intelligence, which brings the artificial intelligence, besides needing a major quantity of resources assigned to the making decisions. decision for majority (dm): this one refers to the choice of a solution from the generated offer more time, this solution goes of the hand in the majority of the times of the decision for similarity, since it turns out slightly probable that two or more agent components of robots with different characteristics and particular knowledge generate exactly equal solutions. the decision for majority implies reducing the number of offers by means of the assimilation, then there determining which is the accepted solution by means of the acceptance of the members’ major quantity. the behavior (similar to the human being) assigned to the community of robotic agents, goes directly related to the solution of problems. it is for it that the actions to take realize in a sequenced way being careful not to omit any of them in order not to present faults to the moment of it determines the suitable way of handling a certain situation arisen during the palliation of pertinent labors for the fulfillment of an aim. on the other hand, the satisfactories generally they turn out to be inactive individuals of the colony, already be that they are in the nest, or that have stopped its labors inside the community. these agents robots are capable of solving problems and of covering the needs of its companions as they were described in the previous point. the general way of obtaining the satisfactory determined by means of the process that culminates in the making of decisions, is by means of communicative acts which allow to deliver information to other members of the colony on what it is looked, in order that these could provide it. in a graphical form this sequence of actions is represented in fig. 4. figure 3: architecture for making decisions in a community of robotic agents. 130 h. latorre, k. harispe, r. salinas, g. lefranc figure 4: sequence of actions for the search of a satisfactory decision. in a robotic community, every individual must have a small base of knowledge handled by an agent which will determine those actions can be used in the future and which not, beside to determine which are those situations that must be reported to the rest of the community to create a collective memory. likewise, the robots must possess an agent capable of making a record of mistakes and correct them, registering also the above mentioned alteration. also it has to have the aptitude to eliminate the knowledge that is unnecessary or is obsolete, to avoid information garbage inside the robot and its community. 6 ontologies for needs these types of ontologies define the particularities of every type of existing need. nowadays, there are four basic needs, the nourishment, affiliation, assignment and repair. each of them possesses certain points by means of which there can be associated an individual and to a labor, bequeathing this way to facilitating the take of decisions in the active community. each of these needs possesses a particular class, these are: class nourishment: this one describes the levels of discharge of energy that produce some problem in a robot, and defines the possible solution to the above mentioned mishap. the table 1 shows one of the subclasses that shape the class nourishment. class affiliation: this one describes the most particular need of all, the affiliation. this one takes place at the arrival of an individual to the active community and defines what must be fulfilled in order that this one is accepted by who will be its companions of making place the affiliation of the new individual to the group. the table 2 shows one of the subclasses that shape the class affiliation. class assignment: this one describes the need of union between an individual and a labor. when it arises a certain task must be looked the one who realizes it, of compatibility exists with the agents robots inactive these they will be assigned, of not being like that a request ontology model of a robotics agents community 131 will realize to the agency robots. they can be given in combination when more than one individual needs, and the total coverage of the satisfactory does not exist in the active community. the table 3 shows one of the subclasses that shape the class assignment. class repair: this one describes possible damages somewhere or a piece of an individual, defining possibilities of repair, change or to send to workshop, depending on the severity of the fault. the table 4 shows one of the subclasses that shape the class repair. subclass definition level of hunger measure in per cent. it refers at the level of discharge that presents the battery of nourishment of the robot. intermediate state it adduces to the state to which there will change the individual while it is recharged (if it is left or it is still active or in wait) satisfactory it will determine depending on the unload if a loader or a derrick is necessary. way of connection it determines the type of connector between the individual uncharged and the individual charger. table 1: subclass of the class nourishment subclass definition community activates it determines the identificator of the active community to which an individual is sent. individual it determines the identificator of the individual who is sent. minimal percentage of acceptance it determines the minimal percentage of approval that must exist in order that the individual be accepted as member of the community. table 2: subclasses of the class affiliation. subclass definition labor definition of the labor to realize. satisfactory determination of the type of suitable individual. compatibility with oi existence of compatibility with the agents robots inactive inside the active community. (alphanumeric chain which first character is the number of agents compatible robots, followed by its identificator). request to agency robots package of request of compatible individuals to send to the agency (chain of characters where the first character defines the quantity followed by the description of the satisfactory). table 3: subclasses of the class assignment in the communities of robotic agents, it is necessary to define each and every of the ontologies that there are forming the base of knowledge of the robotic agents, which are fundamentals to define the task that the community is capable of realizing. in this work, it presents a proposition of the ontologies of connection, of decision and of needs, for a community of robotic agents in particular, which joined the development of intelligent agents, manages to form a type of social behavior very similar to the behaviors in the conducts presented by social groups such as the ants, which possess a highly developed instinct of collaboration and cooperation. 132 h. latorre, k. harispe, r. salinas, g. lefranc subclass definition damaged piece it determines which is the piece that presents damage. level of damage it determines of percentage form the severity of the fault. capacity of change it determines if it is possible to replace the damaged piece or not. intermediate state it adduces to the state to which there will change the individual while it is repaired (if it is left or it is still active or in wait) satisfactory it determines that repairer is the most suitable, or if it turns out to be more suitable if a derrick is requested. table 4: subclasses of the class repair figure 5: ontology of needs the following image (fig 5) presents a graph of the components of the classes of the ontology of needs. 7 conclusions in the communities of robotic agents, it is necessary to define each and every one of the ontologies that there are forming the base of knowledge of the robotic agents, which are fundamental to define the task that the community is capable of realizing. in this work, it presents a proposition of an ontology model for a cooperative community of heterogeneous robotics agents. it defines the characteristics of collaborative and cooperative community of robots, using multi agent theory where each robot shares information with others robots in the community, to accomplish a common objective. the ontologies are for connection, for decisions and for needs, for a community of robotic agents in particular, which joined the development of intelligent agents, manages to form a type of social behavior very similar to the behavior by social groups such as the ants. bibliography [1] lefranc, g. “colony of robots: new challenge”, int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844, vol. iii (2008), suppl. issue: ijccc 2008, pp. ontology model of a robotics agents community 133 92-107. [2] ellips masehian, and davoud sedighizade, “classic and heuristic approaches in robot motion planning – a chronological review”. proceedings of world academy of science, engineering and technology volume 23, august 2007. [3] mcguinness, d.l., fikes, r., rice, j. and wilder, s. an environment for merging and testing large ontologies. principles of knowledge representation and reasoning: proceedings of the seventh international conference. a. g. cohn, f. giunchiglia and b. selman, editors. san francisco, ca, morgan kaufmann publishers. 2000. [4] supnithi t., inaba a., ikeda m., toyoda j., mizoguchi r., 1999. “learning goal ontology supported by learning theories for opportunistic group formation”. proc. of aied99, pp. 67-74. [5] brophy s., biswas g., katzlberger t., bransford j. and schwartz d., 1999. “teachable agents: combining insights from learning theory and computer science”. in s. p. lajoie and m. vivet (eds.), artificial intelligence in education, pp. 21-28. [6] sloczinski, h. lucila maria costi santarosa, “aprendizagem coletiva em curso mediado pela web”, vii congresso iberoamericano de informática educativa, 2003. [7] nwana h., 1996. “software agents: an overview”. vol. 11, no 3, pp. 205-244. cambridge university press. [8] paquette g., 1999. “meta-knowledge representation for learning scenarios engineering”. proceedings of ai-ed, 1999. [9] posadas j., 2003. “arquitectura para el control de robots móviles mediante delegación de códigos y agentes”. tesis doctoral. universidad politécnica de valencia. [10] soller, a. linton, f., goodman, b., and lesgold, a., 1999. “toward intelligent analysis and support of collaborative learning interaction”. proceedings of the ninth international conference on artificial intelligence in education, le mans, france, 75-82. [11] russell, s., norvig, p., 1995. “artificial intelligence: a modern approach”. ed. prentice hall. [12] wooldridge m. and jennings r., 1995. “intelligent agents: theory and practice”. the knowledge engineering review, 10(2): pp. 115–152. international journal of computers, communications & control vol. i (2006), no. 4, pp. 35-44 numerical aspects and performances of trajectory planning methods of flexible axes jean-yves dieulot, issam thimoumi, frédéric colas, richard béarée abstract: adequate path planning design is an important stage for controlling flexible axes because it may allow to cancel vibrations induced by oscillating modes. among bang-bang profiles which are linked to optimal control, jerk assignment (acceleration derivative) and input shapers have been investigated. theoretical results show the performance and robustness with respect to natural frequency mismatch. practical validations on a real robot arm show the relevance of the jerk algorithm which is more robust with the same productivity performances as input shaping techniques. keywords: flexible axes, vibrations, bang-bang laws, input shapers, motion planning. 1 introduction vibration control of flexible cartesian robots or axes is strongly related to productivity. indeed, lighter structures allow material savings and higher performances in terms of speed, acceleration. . . which, in turn, cause undesirable oscillations. in practice, classical servo control fails to reduce vibrations, which, in many cases, restrains the operating speed [1]. the only way to damp the oscillations consists of designing proper smooth reference trajectories, which should be however as fast as possible [2]. many methods have been introduced using the inversion motion equations or a combination of smooth trigonometric or polynomial functions, e.g. [3]. very often, though, these methods face the robustness problem with respect to uncertainties in the dominant frequency, which is the main model parameter to be taken into account into anti-vibration algorithms design. near-optimal bang-bang methods have been designed to solve the trade-off between productivity and vibrational behavior. these mainly consist of convolving a series of impulses with the control input, where the inter-pulse duration is adequately chosen in a way that the resulting oscillations are in phase opposition [4]. these techniques are known as input shapers, and have proven to be quite effective on linear systems e.g. [5, 6]. on another hand, the most popular damping scheme consists of a bang-bang profile in jerk (which is the derivative of acceleration), and has been proven to be related to input shapers with negative weights [7]. the goal of the paper is to compare different bang-bang laws (shapers, jerk. . . ) in terms of productivity and sensitivity to parameter uncertainty, for a classical model of flexible axis controlled in closed-loop. in a first part, different trajectory planning methods are presented and theoretical results, in terms of vibration attenuation and robustness, are presented. the second part deals with an experimental validation on a real industrial pick-and-place robot which will draw a fair comparison in terms of productivity between these methods. 2 preshaping laws 2.1 architecture of cnc axes control of a cnc axis or a cartesian robot arm is decomposed into three phases [1]: • geometric motion planning which provides the desired path within the whole workspace (in 2or 3-d) copyright c© 2006 by ccc publications 36 jean-yves dieulot, issam thimoumi, frédéric colas, richard béarée figure 1: control strategy of a cnc axis • a time-dependent path planning stage which should, of course, take into account for saturations, but, which will be shown later, should be designed to cancel undesirable vibrations • a servo controller which should help to reject disturbances and to track the reference trajectory. in practice, the model of a flexible axis controlled with cascaded loop will be [1] x(s) xre f (s) = 1 + k fvkv s 1 + skv 1 + 2ζω s 1 + 2ζω s + 1 ω 2 s 2 (1) where x is the position of the motor, xre f is the reference trajectory, kv, k fv are servo gains (position and speed gains, respectively), ζ , ω are respectively the damping ratio (assumed as very small, ζ << 1) and the natural frequency of the axis modeled as a two-mass-spring damper unit. the main purpose of this paper will be the design of the time-dependent path-planning of the closed-loop system represented by equation (1). 2.2 smooth continuous laws a first way of designing the reference trajectory, would be to invert the whole path planning, such as in the so-called flatness approach [3]. in practice, though, model parameters vary since the structure consists of an assembly of flexible beams. the linear approximation of equation (1) is indeed valid for one set-point, which is the case during the short periods corresponding to the start and stop procedures, but not during the whole course. for point-to-point displacements, a stage of acceleration and deceleration is necessary. in cnc control, these stages are driven in a very sharp way, commonly a bang-bang in acceleration, which consequently excites the flexible mode of equation (1) and generates high-amplitude vibrations. smooth trajectories based upon the natural frequency of the closed-loop system (1) can thus be designed, and an obvious method consists of introducing such laws as trigonometric trajectories (fig. 2). the maximal vibratory error for a "sine" profile is (εvib)max = x̄re f sin ( π τ 2 ) π τ (τ 2 −1) (2) for a square sine [2] profile ( τ = tf ω/2π ; tf is the motion duration, x̄re f is the target position). in practice, the error is significantly reduced for a movement of duration superior to 4 times the natural period. an alternative consist of building smooth polynomials which would verify smooth constraints on the dynamics such as minimizing the amplitude of the jerk during the whole displacement, the corresponding polynomial is: xre f = 30t 3x̄re f t 5f ( t 2 5 − tf t 2 + t 2f 3 ) (3) numerical aspects and performances of trajectory planning methods of flexible axes 37 figure 2: smooth control laws and the corresponding maximum error decreases again after a few periods [2]: (εvib)max = 15x̄re f π 5τ 5 [ 3π τ cos (π τ) + ( −3 + π 2τ 2 ) sin (π τ) ] (4) 2.3 jerk laws contrary to previous laws, it is also possible to use "bang-bang" or discontinuous laws to achieve the feedforward control of cnc axes, which are related to time-optimal control as proven in [8]. figure 3: bang-bang profile (jerk-limited) the maximum theoretical error for a bang-bang in acceleration is max t≥t f (ε(t)) x̄re f = ( sinc ( π.τ 2 ))2 , τ = tf ω/2π . for a profile with a bang-bang in jerk, one obtains [2]: max t≥tf ∣∣∣∣ ε(t) x̄re f ∣∣∣∣ = 4am ω 2 · √ 1 + k f 2v ω 2/k2v 1 + ω 2/k2v · ∣∣∣∣sinc ( tjω 2 )∣∣∣∣ ψ (tj, ta, tv) (5) where tf = 4tj + 2ta + tv (see fig. 3), am is the maximum acceleration, and: ψ (tj, ta, tv) = ∣∣sin ((ta + tj) ω/2) sin ((ta + 2tj + tv) ω/2) ∣∣ in theory, it is possible to find a value of jerk for zero-vibration which will be: tj = 2π ω and j = 2π ω am (6) the theoretical robustness with respect to an uncertainty in the natural period t is shown below: 38 jean-yves dieulot, issam thimoumi, frédéric colas, richard béarée figure 4: residual vibrations 2.4 input shapers the principle of input shapers was introduced by singer and seering [4] and consists of convolving the reference or the input with a series of well-chosen impulses. jerk motion planning can be considered as input shaping with negative weights. the underlying idea is that the oscillation will be compensated figure 5: input shaper principle by that induced by a shifted pulse as shown fig. 6: a shaper is thus a series of shifted pulses which can figure 6: superposition of oscillations be written f (s) = a0 + n ∑ i=1 aie −sti (7) where n ∑ i=1 ai = 1. the response to a second order system will be: x(t) = n ∑ i=1 x̄re f [ aiω√ 1−ζ 2 e−ζ ω(t−ti) ] sin(ω √ 1−ζ 2(t −ti)) numerical aspects and performances of trajectory planning methods of flexible axes 39 • the zero-vibration (zv) shaper, which is a two pulse filter for which the vibration at the stop stage should be zero; one obtains the following equations:    a0 + a1eζ ω t1 cos(ω √ 1−ζ 2t1) = 0 a1e ζ ω t1 sin(ω √ 1−ζ 2t1) = 0 a0 + a1 = 1 • the zero-vibration derivative (zvd) shaper which is a three pulse filter for which the position and its derivative at the stop stage should be zero; equations are: [ ai ti ] =   1 d(ζ ) 2e −ζ π√ 1−ζ 2 d(ζ ) e −2ζ π√ 1−ζ 2 d(ζ ) 0 π ω √ 1−ζ 2 2π ω √ 1−ζ 2   where d(ζ ) = 1 + 2e −ζ π√ 1−ζ 2 + e −2ζ π√ 1−ζ 2 figure 7: shaper robustness figure 7 shows shaper robustness where the percentage of residual vibration is plotted versus the ratio ω1/ω where ω1 = 2π/t1 (a specified insensitive is a 4-pulse shaper [9]). the more pulses, the more the shaper will be robust with respect to natural frequency uncertainty. however, it can be understood that adding pulses leads to higher cycle times, as will be shown later on. one can see that the zv shaper, which is quite performing, is not really robust with respect to natural frequency mismatch. when applied to equation (1), the zv shaper gives a maximal error of: max t≥tf ∣∣∣∣ ε(t) x̄re f ∣∣∣∣ = a 2 + 2t 2ω 2 √√√√√√   ( 1 + cos ( π ω ω1 ) −t ω sin ( π ω ω1 ))2 + ( t ω + t ω cos ( π ω ω1 ) + sin ( π ω ω1 ))2   with a theoretical cycle time of: tf = 1 2 (√ 16x̄re f a −t1 ) the zvd shaper gives a maximal error of max t≥tf ∣∣∣∣ ε(t) x̄re f ∣∣∣∣ = a 2 + 2t 2ω 2 √√√√√√   ( t ω + t ω cos ( π ω ω1 ) + cos ( π ω ω1 ) sin ( π ω ω1 ) + sin ( π ω ω1 ))2 + ( 1 + cos ( π ω ω1 ) + t ω sin ( π ω ω1 ) + t ω cos ( π ω ω1 ) sin ( π ω ω1 ))2   40 jean-yves dieulot, issam thimoumi, frédéric colas, richard béarée and the cycle time is tf = 12 (√ 16x̄re f a + t1 ) , which is much longer than precedingly, and comparable to the bang-bang of jerk cycle time. 3 experimental comparisons of path planning laws 3.1 material and methods the experimental validations are carried out on a 3-axes robot (fig. 8). it has been equipped with a real-time "dspace 1103" control card. the available measurements on the motor part come from the actuator axis encoders [10]. when the horizontal axis is only moving, it will be assumed that the axis stiffness remains almost constant (with an actual variation of 30 %) and the system can be considered as linear. the validation was undertaken for a displacement on the x axis, with x2 varying from x20 = 0 to x2 = 900mm and y2 = 0mm with a height z = 315mm. according to experimental results, the natural figure 8: overview of the first test-setup prototype (stroke [mm]: x-1000 y-400 z-800, max. feedrate: 120 m.min−1, max. acceleration: 4 m.s−2). frequency was calculated as fn = 9.8hz, the damping ratio was taken as ς = 0.0132. in this case, the optimal value of the jerk given by equation (6), which limits the amplitude of vibrations, is of 30 m.s−3. parameters of the industrial control loop were kp = 14s−1, kv = 2a.rd.s−1. 3.2 experimental results one can see that, from fig. 9, the curve follows a sinc (.) evolution. fine jerk tuning is indeed effective. when the jerk parameter is not well-tuned, the vibrational behavior deteriorates slightly. as for the zvd shaper (fig. 10) , one can find out what was simulated, one obtains zero vibration when a = ω1/ω is even, and the curve is flat around points cancelling vibrations. results for the zv shaper are alike and not shown here. 3.3 comparison and discussion one can compare experimentally the maximal error for the three laws: figure 11 shows that the zv shaper is not robust at all. the jerk law is quite robust with respect to input shapers, because the jerk is a continuous derivative of the acceleration during the whole motion, whereas the zvd cancels vibrations only at the point where zero vibrations are desired. for a value of a = 1, the three curves have the same percentage of residual vibration (same performances). when a = 0, one finds again the classical bang-bang in acceleration for which by definition 100% vibrations occur (fig. 11). the cycle time is numerical aspects and performances of trajectory planning methods of flexible axes 41 figure 9: jerk-limited residual vibrations (%)-(a) : variation of maximal residual vibrations as a function of a = ω j/ω, ω j = 2πtj (b) : amplitude of vibrations at jerk j=100 (c) : amplitude of vibrations at jerk j=30 figure 10: zvd residual vibrations (%)-(a) : variation of maximal residual vibrations as a function of a = ω1/ω , (b) : amplitude of vibrations at a = 0.5 (c) : amplitude of vibrations at a = 1 42 jean-yves dieulot, issam thimoumi, frédéric colas, richard béarée figure 11: comparison of planning laws: percentage of maximum vibrations figure 12: comparison of planning laws: cycle time numerical aspects and performances of trajectory planning methods of flexible axes 43 defined as the instant for which the motor position reaches its reference by 0.2 mm. figure 12 shows the evolution of the cycle time of the different laws. one can see that the zv shaper is quicker by 13%. as a conclusion, the jerk bang-bang algorithm is more robust to parameter variations than a zv shaper which has approximately the same cycle time. another advantage is that this algorithm is now implemented in most industrial cnc control devices. when the frequency is always the same, fixed and well-estimated, one can expect to gain some productivity using a zv shaper instead. a possibility to cope with non-linearities consists either of using "gain scheduling" or adaptive/iterative algorithms e.g. [11, 12]. 4 summary and conclusions the design and computation of near-optimal reference trajectories for cnc devices is a difficult challenge which should examine the trade-off between cancellation of undesirable vibrations and rapidity. this design can be achieved using continuous "jerk" (derivative of acceleration) laws or input shapers, which consist of convolving pulses with the control input. theoretical and empirical results show the relevance of using jerk laws due to their productivity performances at comparable robustness properties (with respect to natural frequency uncertainties). a good calibration of the model parameters would allow to use a two-pulse shaper, which is faster but less robust. in practice, however, these algorithms have to be adapted to be implemented into cnc units, because of neglected dynamics (such as dry friction) which may cause static errors. references [1] béarée, r., barre p.j., bloch s., influence of high feed rate machine tool control parameters on the contouring accuracy, j. intell. robotic systems, vol. 40, 2004, pp 321-342. [2] barre p.-j., bearee r., borne p., dumetz, e., influence of a jerk controlled movement law on the vibratory behaviour of high-dynamics systems, j. of intell. robot. systems, vol. 42, 2005, 275293. [3] fliess m., lévine j., martin p., rouchon, p., flatness and defect of nonlinear systems: introductory theory and applications, int. j. control, vol. 61, 1995, pp 1327-1361. [4] singer n., seering w., preshaping command inputs to reduce system vibration, j. dynam. syst., meas. contr., vol. 112, 1990, pp 76-82. [5] meckl p. h., seering w. p., experimental evaluation of shaped inputs to reduce vibration of a cartesian robot, j. of dyn. systems, meas. control, vol. 112, 1990, pp 159-165. [6] peláez g., gu. pelaez, j.m. perez, a. vizán, e. bautista, input shaping reference commands for trajectory following cartesian machines, control eng. practice, vol. 13, 2005, pp 941-958. [7] singhose w., singer n., seering w., time-optimal negative input shapers, j. dynam. syst., meas., contr., vol. 119, 1997, pp. 198-205. [8] lau m. a., pao, l. y. , input shaping and time-optimal control of flexible structures, automatica, vol. 39, 2003, pp 893-900. [9] singhose w. e., derezinski s. j., singer n. c., extra insensitive input shaper for controlling flexible spacecraft, j. guidance control dynamics, vol. 19, 1996, pp 385-391. 44 jean-yves dieulot, issam thimoumi, frédéric colas, richard béarée [10] dumetz e., dieulot j.y., barre p.j., colas f., control of an industrial robot using acceleration feedback, j. intell robotic systems, vol. 46, 2006, pp 111-128. [11] md zain m.z., tokhi m.o., mohamed z., hybrid learning control schemes with input shaping of a flexible manipulator system, mechatronics, vol. 16, 2006, 209-219. [12] cutforth, c. f., pao l. y., adaptive input shaping for maneuvering flexible structures, automatica, vol. 40, 2004, pp 685-693. jean-yves dieulot ensam de lille research technological team cemodyne 8 bd louis xiv, 59000 lille cedex e-mail: jean-yves.dieulot@polytech-lille.fr received: november 6, 2006 int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 719-726 a homogeneous algorithm for motion estimation and compensation by using cellular neural networks c. grava, a. gacsádi, i. buciu cristian grava, alexandru gacsádi, ioan buciu university of oradea faculty of electrical engineering and information technology oradea, romania e-mail: {cgrava,agacsadi,ibuciu}@uoradea.ro abstract: in this paper we present an original implementation of a homogeneous algorithm for motion estimation and compensation in image sequences, by using cellular neural networks (cnn). the cnn has been proven their efficiency in real-time image processing, because they can be implemented on a cnn chip or they can be emulated on field programmable gate array (fpga). the motion information is obtained by using a cnn implementation of the well-known horn & schunck method. this information is further used in a cnn implementation of a motion-compensation method. through our algorithm we obtain a homogeneous implementation for real-time applications in artificial vision or medical imaging. the algorithm is illustrated on some classical sequences and the results confirm the validity of our algorithm. keywords: cellular neural networks, motion estimation, horn & schunck method. 1 introduction the motion estimation and compensation algorithms were developed for different applications as artificial vision, video information compression, medical imaging, digital and high-definition television, video-telephony, virtual-reality and multimedia techniques. motion estimation allows one to reduce the temporal redundancy in a sequence of images in order to reduce the transmission rate and has been widely used in television signal coding (e.g. motion-compensated (mc) prediction, mc interpolation) and videoconference services [1]. to avoid this limitation in this paper we propose a fully parallel solution in order to realize the motion estimation and compensation, using cnn [2], as a competing alternative to classical computational techniques. the advantage of the algorithms that can be implemented on cnns is that these kinds of neural networks already exists in hardware version [3], [4] and thus we can obtain real-time applications. in our case, because the motion estimation and compensation methods have generally a great computational cost, we develop homogeneous algorithms (that is the estimation part and compensation part are implemented in the cnn environment) for real-time applications that can be after that applied in artificial vision or medical imaging. after a small introduction, in the second part of this paper we present an overview of motion estimation and compensation methods, followed by a section that presents the cnn implementation of the horn and schunck motion estimation method and a section that presents the cnn implementation of the motion compensation. in the section dedicated to experimental results we present results that confirm the validity of our algorithm and we finalize our paper with a section of conclusions giving also some perspectives to our work. copyright c⃝ 2006-2010 by ccc publications 720 c. grava, a. gacsádi, i. buciu 2 overview on motion estimation and compensation methods the most used motion estimation methods are [5]: • differential methods (or gradient methods); in this case the motion being estimated based on the spatial and temporal gradients of images [6], [7], [1]; • block-based methods (or correlative methods). these methods could be classified in phasecorrelation methods and block-matching methods. in the case of phase-correlation methods, the motion is estimated based on the fourier phase-difference between two blocks from two successive images. these methods are less used in practice because of the high noise-sensitivity. in the case of block-matching methods the location of the block (in the following or previous images) that best matches the reference block in the current image is searched, based on a certain matching or difference criteria. both methods are usually applied in the case of a translation movement, but could be also adapted for other spatial models of the movement [8]. the principle of almost all motion estimation methods is that the brightness intensity of each pixel is constant along the motion trajectory or is modifying in a predictable way. this hypothesis of brightness intensity preservation of each point (x,y,t) along the motion trajectory can be expressed through the equation of displaced frame difference (dfd), between the t and t − 1 = t − ∆t instants [9]: dfd(x, y) = φ(x − dx, y − dy, t − dt) − φ(x, y, t), (1) where φ(x, y, t) denote the brightness distribution of the image at the moment t and d = [dx, dy]t is the displacement vector between the moments t and t′ = t − ∆t (dx and dy being the displacement vectors on x and y direction, respectively). differential motion estimation methods are based on spatial and temporal gradients of a sequence of images. if the brightness intensity of a pixel is not varying in time, then dφ/dt = φt = 0 [1]. the first order taylor development of this last relation, has as result the “equation of movement constraint” emc (or “optical flow equation” ofe) that links the spatial and temporal gradients of brightness intensity [9]: ∂φ ∂x ∂x ∂t + ∂φ ∂y ∂y ∂t + ∂φ ∂t = 0. (2) we can rewrite: φx · νx + φy · νy + φt = 0, (3) where φx and φy are the spatial gradients, φt is the temporal gradient of the brightness intensity and νx = dx/dt, νy = dy/dt are the velocities on x and y directions [1]. as it can be observed in eq. 3, the optical flow equation has two unknowns (νx, νy), hence the system is under-determined leading to an ill-posed issue. in order to obtain both movement components (νx, νy), it has to be introduced a second constraint, to obtain a fully determined system (two equations with two unknowns). one of the possible constraints is offered by the well-known horn & schunck motion estimation method [1], which assumes that all the neighbor pixels have similar movement (we say that the velocity field is uniform or smooth). it results that it has to be minimized an energy: e2 = e2flow + γe 2 uniformity (4) the first term correspond to the difference related to the projection of the velocity vectors on the spatial gradient (as in the emc) and the second term correspond to the difference related a homogeneous algorithm for motion estimation and compensation by using cellular neural networks 721 to a smooth field, g being the weighting term between the two terms. the uniformity constrain is expressed by the equation: e2uniformity = ( ∂νx ∂x )2 + ( ∂νx ∂y )2 + ( ∂νy ∂x )2 + ( ∂νy ∂y )2 (5) e2flow = ( φx · νx + φy · νy + φt )2 (6) the solution is obtained after a gauss-seidel minimization [1]. equation 6 will be minimized when the error (that means the difference between two successive values of (νx, νy), will be considered as being the minimal or when the maximum chosen number of iterations will be reached. this method is not limited to translations as block-matching method and the computations are shorter, but the movement amplitude has to be small (less than three pixels) because of the considerations regarding taylor development. the solution to avoid the constraint regarding small amplitude of movement is to use the multi-resolution technique [1]. using two consecutive frames (φi(x, y, ti) and φf(x, y, tf)) of a sequence (fig. 1), after the application of a motion estimation algorithm, as horn and schunck method, for each pixel it results an estimation of its movement in the both two directions (x,y) of the system of co-ordinates that is attached to image plane. figure 1: the illustration of motion compensation. the purpose of motion compensation is that based on estimated motion information and starting from a reference image (the initial image φi, in fig. 1) to obtain an estimation of the real comparison image (the final image φf) that was used in the process of motion estimation [10]. 3 the cnn implementation of the horn & schunck motion estimation method regarding cnn gray-scale image processing generally, variational computing based template design is possible if the design constrains are respected [11], [12]. in order to analytically determine the template, cost functions or energies are used in the designing step. an important designing aspect is represented by the way to associate each energy function with one of the a, b, c or d template, as well as the way to chose the layers number of the cnn. taking into account the characteristics of the existing cnn chip it is recommended to use only mono-layer cnn and only a and b templates. in the cost functions some weights can be introduced in order to maintain the state values in the linear zone of the state-output transfer characteristic. the motion estimation results using the two images, φ1(x, y, t) and φ2(x, y, t + ∆t), a two-layer cnn structure and the hosch.tem (see fig. 2). after the cost function minimization [13], for the hosch.tem it results: • polarization images z1 = φxφt and z2 = φyφt, 722 c. grava, a. gacsádi, i. buciu figure 2: two-layer cnn structure for the proposed horn & schunck motion estimation method. • nonlinear templates a1 and a2: 0 a 0 a 1-4a a 0 a 0 where the parameter a also includes the γ parameter from the equation (6), obviously weighted with the constants that results at the energy minimization. nonlinear d − type template d is: 0 0 0 0 dkl 0 0 0 0 where each element dkl is expressed as follows: d11(νx) = (φx)2 · νx, d21(νy) = φxφy · νy, d22(νy) = (φ y)2 · νy, and d12(νx) = φxφy · νx, respectively. in fig. 3 the images with the estimated motion in the case of the “taxi” real sequence are presented. the first two images represent two images of the well-known sequence in motion estimation reference, “hamburg-taxi”, and in the last two images the motion νx and νy (or displacement) images are presented. the two last images represent the displacement in the two spatial directions. the combination between these two images could be also represented as a single image with vectors corresponding to the displacement of each pixel of the reference image, as we will present in the section presenting other experimental results. for a better visualization, the motion images, νx and νy, are not calibrated in the cnn domain. if we want to use these images in image compensation we have to calibrate them in the cnn domain [−1, +1]. figure 3: two images of the hamburg-taxi real sequence, used in our experiments. a homogeneous algorithm for motion estimation and compensation by using cellular neural networks 723 4 the cnn implementation of motion compensation in order to develop a motion compensation algorithm that can be directly implemented on a cnn chip, we have to decompose such an algorithm as elementary steps that can be then implemented on the existing hardware. as a result of the motion estimation process, a pixel could be stationary or can change its position in one of the eight elementary directions: n, n-e, e, s-e, s, s-w, w, n-w. after the application of a motion estimation technique, the pixels of the intermediary image frame φ(x, y, t), that corresponds to any given moment t ∈ (ti, tf), could be classified in the following categories (see fig. 1), where t is the time-position of the intermediary image, between the initial image and the final image: • pixels of “a” type that has an identical position in the two consecutive images. these pixels does not change, at a given moment t ∈ (ti, tf), neither their positions nor their values; • pixels of “b” type, that will moves as a result of the fact that the corresponding pixels in the two images that contains the motion information has a value greater than a current elementary value (that could be view as a quantum or a threshold). the value of these pixels is not changing, but inserting intermediary images between the initial and final image, φi and φf, their positions are changing successively with one elementary value (one quantum) corresponding to the spatial discretization. the maximum number of intermediary images that could be inserted equals the maximum number of elementary values (quantum) that could be identified in the images that contains motion (or displacement) information; • pixels of “c” type are those pixels that will change their values because will be covered by the pixels that will arrive in that position, overlapping the initial pixel: c(t) = shift(b(t)) (7) • pixels of “d ” type, with unknown values, that are the result of the displacement of “b” type pixels, that liberates a location but there is no pixel arriving in that location. in each step of the movement, the value of these pixels could be determined through spatial cnn spline-cubic interpolation [10]: d(t) = ⌊d(t − 1) · c(t) + b(t) · c(t)⌋ · b(1) (8) • pixels of “e” type that at the current time during the processing will have the same value as in the initial image φi, due to the movement of the pixels (arrivals and departures of the pixels). the values of these pixels will be restored from the initial image: e(t) = c(t − 1) · b(t) · c(t) · d(t) (9) each intermediary step has as result an associate image and to create this intermediary image it has to be done the following operations: • determine the “c” type pixels, that is the displacement with one pixel in the direction resulting from the motion information; • interpolation, in order to determine the values of unknown pixels, that is the “d ” type pixels. for each intermediary image, the value of a pixel results after the determination of the type of that pixel. the state of a pixel could change during the processing. the initial and final image, 724 c. grava, a. gacsádi, i. buciu φi and φf, and the images that contain the motion information have the same dimensions. in this paper, all images are converted to standard cnn gray-scale images, taking values between -1 to +1 (see fig. 5). the convention is that pixels with negative value are coding a displacement to the left (fig. 5 a) or to up, respectively (fig. 5 b) and the pixels having positive values are coding the displacements to the right (fig. 5 a) or to down, respectively (fig. 5 b). the values of the pixels coding the motion are multiples of the minimum detectable value of the motion. in order to detect the pixels that will change their position and to move the pixels, the treshold.tem and shift.tem templates family are used [14]. the determination of the value of pixels of “d ” type could be made for each intermediate image or only to the final image. in order to avoid the modification of the pixels of “a” type or in order to restore the pixels of “e” type, some mask-images are created during the processing, using the equations (7), (8) and (9). figure 4: conventions in the images containing the motion information. 5 experimental results in this section some experimental results obtained by using the "cadetwin" (cnn application development environment and toolkit under windows [14]) are presented. the images containing the motion estimation, νx and νy, are calibrated in the cnn domain [−1, +1]. the processing time depends on the number of interpolations and on the number of the motion estimation quantum, that results after spatial discretization, that is on the total number of images inserted between the initial and final image, φi and φf. due to parallel processing, this total processing time is independent by the dimensions of the original images or by the number of moving pixels. in order to illustrate the implemented method in the case of real images and in the case of a complex movement, starting from an initial image of a well-known “tennis-table” sequence, we have simulated a complex motion, using the free form deformation principle, resulting (see fig. 5) three real images of a sequence (φ1, φ2, φ3). in figure 5 we also represented the “motion estimation field” obtained after applying our cnn implementation of the horn and schunck motion estimation method. starting from the first image of the real sequence (φ1) and this “motion estimation field” we can obtain the “motion compensated image” (φ̂2), that represent an estimation of the real image (φ2). as it can be observed, the biggest errors between the real image and the motion compensated image (φ2 − φ̂2) are located in the region with a high gradient of intensity that usually corresponds to the regions with different motion. another cause oh these errors could also be the discrete nature of the image spatial support and the interpolations that are necessary. 6 conclusions generally, in the case of serial implementation of a motion compensation algorithm, the processing time depends on the image dimensions. in the case of our cnn motion estimation and compensation algorithm, that uses only 3×3 linear templates, the algorithm can be directly a homogeneous algorithm for motion estimation and compensation by using cellular neural networks 725 figure 5: motion estimation and compensation using cnn: principle and results. implemented on the cnn-universal chip [4] and thus the image processing become completely parallel. the advantage of using the cnn hardware platform is that the total processing time doesn’t depend on image dimensions, being dependent only on the number of displacement steps that has to be performed and thus we can obtain real-time applications with applications in artificial vision and medical imaging. taking into account that our cnn motion estimation and compensation algorithm is based on the cost functions minimization (usually partially differential equations) resulting nonlinear templates, our attention is focused on the fpga implementation of our algorithm, on a digital emulator of the cnn [11], [12]. 7 acknowledgement this work was partially supported by a grant from the romanian national university research council, pncdi program id-668/2008. bibliography [1] horn b.k.p. and schunck b.g, determining optical flow, artificial intelligence, vol.17, pp. 185-203, 1981. [2] chua l.o. and yang l, fuzzy control rules in convex optimization, ieee transactions on circuits and systems, vol.35, pp.1257-1290, 1998. 726 c. grava, a. gacsádi, i. buciu [3] cembrano g.l., rodríguez-vázquez a., espejo-meana s., and domínguez-castro r., ace16k: a 128×128 focal plane analog processor with digital i/o, int. j. neural syst., vol.17, issue 6, pp. 427-434, 2003. [4] roska t. and chua l.o., the cnn universal machine: an analogic array computer, ieee transactions on circuits and systems, vol.40, pp. 167-173, 1993. [5] konrad j., motion detection and estimation, image processing handbook, networking and multimedia, pp. 207-227, 2000. [6] bruhn a., weickert j., feddern c., kohlberger t., schnorr c., real-time optic flow computation with variational methods, computer analysis of images and patterns, pp. 222-229, 2003. [7] brox t., bruhn a., papenberg n., weickert j., high accuracy optical flow estimation based on a theory for warping, eccv, pp. 25-36, 2004. [8] wei w., hou z.-x., guo y.-c., a displacement search algorithm for deformable block matching motion estimation, proc. of ieee international symposium on communications and information technology, pp. 457-460, 2005. [9] barron j.l., fleet d.j., beauchemin s., performance of optical flow techniques, international journal of computer vision, vol. 12, issue 1, pp. 43-77, 1994. [10] grava c., gacsádi a., gordan c., maghiar t., bondor k, motion compensation using cellular neural networks, proc. of the european conference on circuit theory and design (ecctd), vol. i, pp. i-397-i-400, krakow, poland, 2003. [11] kincses z., nagy z., szolgay p, implementation of nonlinear template runner emulated digital cnn-um on fpga, proc. of the 10th international workshop on cellular neural networks and their applications, pp. 186-190, istanbul, turkey, 2006. [12] nagy z., vörösházi zs., szolgay p., emulated digital cnn-um solution of partial diferential, int. journal of circuit theory and applications, vol. 34, issue 4, pp. 445-470, 2006. [13] gacsádi a., grava c., tiponut v., szolgay p., a cnn implementation of the horn & schunck motion estimation method, proc. of the 10th international workshop on cellular neural networks and their applications, pp. 381-385, istanbul, turkey, 2006. [14] *** cadetwin, cnn application development environment and toolkit under windows. version 3.0, analogical and neural computing laboratory, hungarian academy of sciences, budapest, 1999. international journal of computers, communications & control vol. ii (2007), no. 2, pp. 195-204 robust pid decentralized controller design using lmi danica rosinová, vojtech veselý abstract: the new lmi based method for robust stability analysis for linear uncertain system with pid controller is proposed. the general constrained structure of controller matrix is considered appropriate for both output feedback and decentralized control and the respective guaranteed cost control design scheme is presented. the sufficient robust stability condition is developed for extended quadratic performance index including first derivative of the state vector to damp oscillations. the obtained stability condition is formulated for parameter-dependent lyapunov function. keywords: uncertain systems, robust stability, decentralized control, linear matrix inequalities (lmi), lyapunov function 1 introduction robust stability and robust control belong to fundamental problems in control theory and practice; various approaches in this field have been proposed to cope with uncertainties that always appear in real plant ([2];[8];[7];[5];[4]). the development of linear matrix inequality (lmi) computational techniques has brought an efficient tool to solve a large set of convex problems in polynomial time (e.g. [2]). significant effort has been therefore made to formulate crucial control problems in algebraic way ([12]), so that the numerical lmi solution can be adopted. this approach is advantageously used in solving control problems for linear systems with convex (affine or polytopic) uncertainty domain. however, many important problems in linear control design, such as decentralized control, simultaneous sof or more generally structured linear control problems have been proven as np hard ([1]). intensive research has been devoted to overcome nonconvexity and transform the nonconvex or np-hard problem into convex optimisation problem in lmi framework. various techniques have been developed using inner or outer convex approximation of the respective nonconvex domain. the common tool in both inner and outer approximation is the use of linearization or convexification. in ([6]; [3]) the general convexifying algorithm for the nonconvex function together with potential convexifying functions for both continuous and discrete-time case have been proposed. linearization approach for continuous and discrete-time system design was independently used in ([13];[11]). proportional-integral-derivative (pid) controllers belong to the most popular ones in the industrial world. the derivative part of the controller, however, causes difficulties when uncertainties are considered. in multivariable pid control schemes using lmi developed recently ([14]) the incorporation of the derivative part requires inversion of the respective matrix, which does not allow including uncertainties. the other way to cope with the derivative part is to assume the special case when output and its derivative are state variables, robust pid controller for first and second order siso systems are proposed for this case in ([7]). in this paper a state space approach to designing decentralized (multi-loop) pid robust controllers is proposed for linear uncertain system with guaranteed cost with a new quadratic cost function. the major contribution is in considering the derivative part in robust control framework. we adopt the new pid control problem formulation using lmi that is appropriate for polytopic uncertain systems. the robust pid control scheme is proposed for structured control gain matrix, thus enabling decentralized pid control design. in section 2 the robust control design problem with structured control gain matrix is formulated in general. the robust optimal control design procedure in state space with the extended cost function is proposed in section 3. the main result-robust pid controller design approach is provided in section copyright © 2006-2007 by ccc publications 196 danica rosinová, vojtech veselý 4, the developed approach is appropriate for decentralized control structure. in section 5 the results are illustrated on the examples. 2 problem formulation and preliminaries consider a linear affine uncertain system: δ x(t) = (a + δ a)x(t) + (b + δ b)u(t) y(t) = cx(t) (1) where δ x(t) = ẋ(t) for continuous time system δ x(t) = x(t + 1) for discrete time system x(t) ∈ rn, u(t) ∈ rm, y(t) ∈ rl are state, control and output vectors respectively; a, b,c are known constant matrices of appropriate dimensions corresponding to the nominal system, δ a, δ b are matrices of uncertainties of the respective dimensions.the uncertainties are considered to be affine of the form δ a = p ∑ j=1 ε jã j, δ b = p ∑ j=1 ε jb̃ j (2) where ε j ≤ ε j ≤ ε̄ j are unknown uncertainty parameters; ã j, b̃ j, j = 1, 2, ..., p are constant matrices of uncertainties of the respective dimensions and structure. the uncertain system (1), (2) can be equivalently described by a polytopic model given by its vertices {(a1, b1,c), (a2, b2,c), ..., (an , bn ,c)}, n = 2p. the decentralized feedback control law is considered in the form u(t) = fcx(t) (3) where f is a matrix corresponding to decentralized controller. the uncertain closed-loop polytopic system is then δ x(t) = ac(α)x(t) (4) where ac(α) ∈ { n ∑ i=1 αiaci, n ∑ i=1 αi = 1, αi ≥ 0 } , aci = ai + bifc. (5) to assess the performance quality a quadratic cost function known from lq theory is often used. however, in practice the response rate or overshoot are often limited. therefore we include into the cost function the additional derivative term for state variable to open the possibility to damp the oscillations and limit the response rate. jc = ∞∫ 0 [x(t)t qx(t) + u(t)t ru(t) + δ x(t)t sδ x(t)]dt for a continuous-time and jd = ∞ ∑ k=0 [x(t)t qx(t) + u(t)t ru(t) + δ x(t)t sδ x(t)] (6) robust pid decentralized controller design using lmi 197 for a discrete-time system, where q, s ∈ rn×n, r ∈ rm×m are symmetric positive definite matrices. the concept of guaranteed cost control is used in a standard way: let there exist a feedback gain matrix f0 and a constant j0 such that j ≤ j0 (7) holds for the closed loop system (4), (5). then the respective control (3) is called the guaranteed cost control and the value of j0 is the guaranteed cost. the main aim of this paper is to develop a decentralized pid control algorithm that stabilizes the uncertain system (1), with guaranteed cost with respect to the cost function (6). we start with basic notions concerning lyapunov stability and convexifying functions. in the following we use d−stability concept ([4]) to receive the respective stability conditions in more general form. definition 1. (d-stability) consider the d-domain in the complex plain defined as d = {s is complex number : [ 1 s ]∗ [ r11 r12 r∗12 r22 ] [ 1 s ] < 0}. the considered linear system (1) is d-stable if all its poles lie in the d-domain. (to simplify the reading of formulas we use in the definition 1 scalar values of the parameters ri j, in general the stability domain can be defined using matrix values of parameters ri j with the respective dimensions.) the standard choice of ri j is r11 = 0, r12 = 1, r22= 0 for a continuous-time system; r11 = −1, r12 = 0, r22= 1 for a discrete-time system. the quadratic d-stability is equivalent to the existence of one lyapunov function for the whole set that describes the uncertain system model. definition 2. (quadratic d− stability) the uncertain system (4) is quadratically d−stable if and only if there exists a symmetric positive definite matrix p such that r12pac(α) + r∗12a t c (α)p + r11p + r22a t c (α)pac(α) < 0 (8) instead of quadratic stability, a robust stability notion is considered based on the parameter dependent lyapunov function (pdlf) defined as p(α) = n ∑ i=1 aipi where pi = p t i > 0 (9) to obtain less conservative results than using quadratic stability with unique lyapunov function. definition 3. ([5]) system (4) is robustly d-stable in the convex uncertainty domain (5) with parameterdependent lyapunov function (9) if and only if there exists a matrix p(α) = p(α)t > 0 such that r12p(α)ac(α) + r∗12a t c (α)p(α) + r11p(α)+ +r22atc (α)p(α)ac(α) < 0 (10) for all α such that ac (α) is given by (5). the sufficient robust d-stability condition which can be considered as not too conservative has been proposed in ([9]), recalled in the following lemma. lemma 4. if there exist matrices e ∈ rnxn, g ∈ rnxn and n symmetric positive definite matrices pi ∈ rnxn such that for all i= 1,. . . ,n: [ r11pi + atcie t + eaci r12pi −e + atcig r∗12pi −e t + gt aci r22pi −(g + gt ) ] < 0 (11) then system (4) is robustly d−stable. 198 danica rosinová, vojtech veselý note that matrices e and g are not restricted to any special form; they were included to relax the conservatism of the sufficient condition. to transform nonconvex problem of structured control (decentralized control in our case) into convex form, the convexifying (linearizing) function can be used ([6]; [3];[11];[13]). the respective potential convexifying function for x−1 and xwx has been proposed in the linearizing form: the linearization of x−1 ∈ rnxn about the value xk > 0 is φ(x−1, xk) = x−1k −x−1k (x −xk)x−1k (12) the linearization of xw x ∈ rnxn about xk is ψ(xw x , xk) = −xkw xk + xw xk + xkw x (13) both functions defined in (12) and (13) meets one of the basic requirements on convexifying function: to be equal to the original nonconvex term if and only if xk = x . however, the question how to choose the appropriate nice convexifying function remains still open. in the sequel, x > 0 denotes positive definite matrix; * in matrices denotes the respective transposed term to make the matrix symmetric; i denotes identity matrix and 0 denotes zero matrix of the respective dimensions. 3 robust optimal controller design in this section the new design algorithm for optimal control with guaranteed cost is developed using parameter dependent lyapunov function and convexifying approach employing iterative procedure. the proposed control design approach uses sufficient stability condition inspired by the result of ([9]). the next theorem provides the new form of robust stability condition for linear uncertain system with guaranteed cost. theorem 5. consider uncertain linear system (1), (2) with static output feedback (3) and cost function (6). the following statements are equivalent: i) closed loop system (4) is robustly d-stable with pdlf (9) and guaranteed cost with respect to cost function (6): j ≤ j0 = xt (0)p(α)x(0). ii) there exist matrices p(α) > 0 defined by (9) such that r12p(α)ac(α) + r∗12a t c (α)p(α) + r22a t c (α)p(α)ac(α)+ +r11p(α) + q + ct f t rfc + atc (α)sac(α) < 0 (14) iii) there exist matrices p(α) > 0 defined by (9) and h, g and f of the respective dimensions such that [ r11p(α) + atc (α)h t + hac(α) + q + ct f t rfc r∗12p(α)−h t + gt ac(α) ] . [ . ∗ r22p(α)−(g + gt ) + s ] < 0 (15) aci = (ai + bifc) denotes the i−th closed loop system vertex. matrix f is the guaranteed cost decentralized control gain for the uncertain system (4), (5). robust pid decentralized controller design using lmi 199 proof. for brevity the detail steps of the proof are omitted where standard tools are applied. (i) ⇔(ii): the proof is analogous to that in ([10]). the (ii) ⇒(i) is shown by taking v (t) = x(t)p(α)x(t) as a candidate lyapunov function for (4) and writing δv (t), where δv (t) = v̇ (t) for continuous time system δv (t) = v (t + 1)−v (t) for discrete time system δv (t) = r∗12δ x(t) t p(α)x(t) + r12x(t)t p(α)δ x(t) + r11x(t)t p(α)x(t) + r22δ x(t)t p(α)δ x(t) (16) substituting for δ x from (4) to (16) and comparing with (14) provides d−stability of the considered system when the latter inequality holds. the guaranteed cost can be proved by summing or integrating both sides of the following inequality for t from 0 to ∞: δv (t) < −x(t)t [q + ct f t rfc + atc (α)sac(α)]x(t) the (i) ⇒(ii) can be proved by contradiction. (ii)⇔(iii): the proof follows the same steps to the proof of lemma 1: (iii) ⇒(ii) is proved in standard way multiplying both sides of (15) by the full rank matrix: [ i atc (α) ] {l.h.s.(15)} [ i ac(α) ] < 0. (ii) ⇒ (iii) follows from applying a schur complement to (14) rewritten as r12p(α)ac(α) + r∗12a t c (α)p(α) + q + c t f t rfc+ +r11p(α) + atc (α)[r22p(α) + s]ac(α) < 0 therefore [ x11 x12 x t12 x22 ] < 0 where x11 = r11p(α) + r12p(α)ac(α) + r∗12a t c (α)p(α) + q+ +ct f t rfc x12 = atc (α)[r22p(α) + s] x22 = −[r22p(α) + s] which for h = r12p(α), g = [r22p(α) + s] gives (15). the guaranteed cost control design is based on the robust stability condition (15). since the matrix inequality (15) is not lmi we use the inner approximation for the continuous time system applying linearization formula (13) together with using the respective quadratic forms to obtain lmi formulation, which is then solved by iterative procedure. 4 pid robust controller design for continuous-time systems control algorithm for pid is considered as u(t) = kpy(t) + ki t∫ 0 y(t)dt + fdcd ẋ(t) (17) the proportional and integral term can be included into the state vector in the common way defining the auxiliary state z = t∫ 0 y(t), i.e. ż(t) = y(t) = cx(t). then the closed-loop system for pi part of the controller is ẋn = [ ẋ ż ] = [ a + δ a 0 c 0 ] [ x z ] + [ b + δ b 0 ] u(t) 200 danica rosinová, vojtech veselý and u(t) = fcx(t) + fdcd ẋ(t) (18) where fcx(t) and fdcd ẋ(t) correspond respectively to the pi and d term of pid controller. the resulting closed loop system with pid controller (17) is then ẋn(t) = ac(α)xn(t) + b(α) [ fdcd 0 ] ẋn(t) (19) where the pi controller term is included inac(α). (for brevity we omit the argument t.) to simplify the denotation, in the following we consider pd controller (which is equivalent to the assumption, that the i term of pid controller has been already included into the system dynamics in the above outlined way) and the closed loop is described by ẋ(t) = ac(α)x(t) + b(α)fdcd ẋ(t) (20) let us consider the following performance index js = ∞∫ 0 [ x ẋ ]t [ q + ct f t rfc 0 0 s ] [ x ẋ ] dt (21) which formally corresponds to (6). then for lyapunov function (9) we have the necessary and sufficient condition for robust stability with guaranteed cost in the form (14), i.e. for continuous time system: [ x ẋ ]t [ q + ct f t rfc p(α) p(α) s ] [ x ẋ ] < 0. (22) the main result on robust pid control stabilization is summarized in the next theorem theorem 6. consider a continuous uncertain linear system (1), (2) with pid controller (17) and cost function (21). the following statements are equivalent: • closed loop system (19) is robustly d-stable with pdlf (9) and guaranteed cost with respect to cost function (21): j ≤ j0 = xt (0)p(α)x(0). • there exist matrices p(α) > 0 defined by (9), and h, g, f and fd of the respective dimensions such that [ atcih t + haci + q + ct f t rfc pi −mtdih + gt aci ∗ −mtdig−gt mdi + s ] < 0 (23) aci = (ai + bifc) denotes the i−th closed loop system vertex, mdi includes the derivative part of the pid controller: mdi = i −bifdcd . proof. owing to (20) for any matrices h and g: ( −xt h − ẋt gt ) (ẋ−ac(α)x−b(α)fdcd ẋ)+ +(ẋ−ac(α)x−b(α)fdcd ẋ)t ( h t x−gẋ ) = 0 (24) summing up the l.h.s of (24) and (22) and taking into consideration linearity w.r.t. α we get condition (23). theorem 6 provides the robust stability condition for the linear uncertain system with pid controller. notice that the derivative term does not appear in the matrix inversion and allows including the uncertainty in control matrix b into the stability condition. robust pid decentralized controller design using lmi 201 considering pid control design, there are unknown matrices h, g, f and fd to be solved from (23). (recall thataci = (ai + bifc), mdi = i −bifdcd .) then the inequality (23) is not lmi, to cope with the respective unknown matrices products the linearizing approach using (13) has been adopted and the pid iterative control design algorithm based on lmi (4x4 matrix) has been proposed. the resulting closed loop system with pd controller is ẋ(t) = (i −bifdcd )−1(ai + bifc)x(t), i = 1, ..., n (25) the extension of the proposed algorithm to decentralized control design is straightforward since the respective f and fd matrices are assumed as being of the prescribed structure, therefore it is enough to prescribe the decentralized structure for both matrices. 5 examples in this section the major contribution of the proposed approach: design of robust derivative feedback is illustrated on the examples. the results obtained using the proposed new iterative algorithm based on (23) to design the pd controller are provided and discussed. the impact of matrix s choice is studied as well. we consider affine models of uncertain system (1), (2) with symmetric uncertainty domain: ε j = −q, ε j = q (26) example 1. consider the uncertain system (1), (2) where: a =   −4.365 −0.6723 −0.3363 7.0880 −6.5570 −4.6010 −2.4100 7.5840 −14.3100   b =   2.3740 0.7485 1.3660 3.4440 0.9461 −9.6190   c = cd = [ 0 1 0 0 0 1 ] uncertainty parameter q=1; uncertainty matrices: ã1 =   −0.5608 0.8553 0.5892 0.6698 −1.3750 −0.9909 3.1917 1.7971 −2.5887   b̃1 =   −0.1602 −0.3521 0.1162 −2.4839 −0.1106 −4.6057   ã2 =   0.6698 −1.3750 −0.9909 −2.8963 −1.5292 10.5160 −3.5777 2.8389 1.9087   b̃2 =   0.1562 0.1306 −0.4958 4.0379 −0.0306 0.8947   the uncertain system can be described by four vertices; the corresponding maximal eigenvalues in the vertices of open loop system are respectively: -4.0896 ± 2.1956i ; -3.9243 ; 1.5014; -4.9595 notice that the open loop uncertain system is unstable (see third vertex). the stabilizing optimal pd controller has been designed. optimality is considered in the sense of guaranteed cost w.r.t. cost function (21) with matrices r = i2x2,q = 0.001∗i3x3 . the results summarized in the tab. 1 indicate the differences between results obtained for different values of s 202 danica rosinová, vojtech veselý s f (proportional part) fd (derivative part) max eig in vertices 1e-6 *i -1.0567 -0.5643 -2.1825 -1.4969 -0.3126 -0.2243 -0.0967 0.0330 -4.8644 -2.4074 -3.8368 ± 1.1165i -4.7436 0.1*i -1.0724 -0.5818 -2.1941 -1.4642 -0.3227 -0.2186 -0.0969 0.0340 -4.9546 -2.2211 -3.7823 ± 1.4723i -4.7751 table 1: example 1, example 2 consider the uncertain system (1), (2) where: a =   −2.9800 0.9300 0 −0.0340 −0.9900 −0.2100 0.0350 −0.0011 0 0 0 1 0.3900 −5.5550 0 −1.89   , ã1 =   0 1.5 0 0 0 0 0 0 0 0 0 0 0 0 0 0   , b =   −0.0320 0 0 −1.6000   , b̃1 =   0 0 0 0   , c = [ 0 0 1 0 0 0 0 1 ] the results are summarized in tab. 2 for r = 1, q = 0.0005 ∗ i4x4 for various values of matrix s in cost function. as indicated in tab.2, increasing values of s slow down the response (max.eig.cl shifted to zero) as assumed. 6 conclusion the new robust pid controller design method is proposed for uncertain linear system based on lmi. the important feature of this pid design approach is that the derivative term appears in such form that enables to consider the model uncertainties. since the structured feedback matrix is assumed, this approach is appropriate for decentralized pid control design. the guaranteed cost control is proposed with a new quadratic cost function including the derivative term for state vector as a tool to influence the overshoot and response rate. the obtained results are illustrated on the examples: to show the robust pid control design and the influence of the choice of matrix s in the extended cost function. acknowledgment the work has been supported by slovak scientific grant agency, grant n 1/3841/06. robust pid decentralized controller design using lmi 203 s=10e-8*i4x4 qmax 1.1 max.eig.cl -0.189 s=0.1*i4x4 qmax 1.1 max.eig.cl -0.1101 s=0.2*i4x4 qmax 1.1 max.eig.cl -0.0863 s=0.29*i4x4 qmax 1.02 max.eig.cl -0.0590 table 2: comparison for various s example 2 references [1] blondel, v. and j.n. tsitsiklis (1997). np-hardness of some linear control design problems. siam j. control optim., 35, pp.2118-2127. [2] boyd, s., l. el ghaoui, e. feron and v. balakrishnan (1994). linear matrix inequalities in system and control theory. siam studies in applied mathematics, philadelphia. [3] han, j. and r.e. skelton (2003). an lmi optimization approach for structured linear controllers. in: proc. 42nd ieee cdc, hawaii, usa, pp. 5143-5148. [4] henrion, d., d. arzelier and d. peaucelle (2002). positive polynomial matrices and improved lmi robustness conditions. in: 15th ifac world congress, barcelona, spain. [5] deoliveira, m.c., j. bernussou and j.c. geromel (1999). a new discrete-time robust stability condition. systems and control letters, 37, pp. 261-265. [6] deoliveira, m.c., j.f. camino and r.e. skelton (2000). a convexifying algorithm for the design of structured linear controllers. in: proc. 39nd ieee cdc, sydney, australia, pp. 2781-2786. [7] ge, ming, min-sen chiu, qing-guo wang (2002). robust pid controller design via lmi approach. journal of process control, 12, pp.3-13. [8] gyurkovics, e., takacs, t.( 2000): stabilisation of discrete-time interconnected systems under control constraints. iee proceedings control theory and applications, 147, no. 2, 137-144 [9] peaucelle, d., d. arzelier, o. bachelier and j. bernussou (2000). a new robust d-stability condition for real convex polytopic uncertainty. systems and control letters, 40, pp. 21-30. [10] rosinová, d., v. veselý and v. kučera (2003). a necessary and sufficient condition for static output feedback stabilizability of linear discrete-time systems. kybernetika, 39, pp. 447-459. [11] rosinová, d. and v. veselý (2003). robust output feedback design of discrete-time systems – linear matrix inequality methods. in: 2th ifac conf. csd’03 (cd-rom), bratislava, slovakia. 204 danica rosinová, vojtech veselý [12] skelton, r.e., t. iwasaki and k. grigoriadis (1998). a unified algebraic approach to linear control design, taylor and francis. [13] veselý, v. (2003). robust output feedback synthesis: lmi approach. in: 2th ifac conference csd’03 (cd-rom), bratislava, slovakia. [14] zheng feng, qing-guo wang, tong heng lee (2002). on the design of multivariable pid controllers via lmi approach. automatica, 38, pp.517-526. danica rosinová and vojtech veselý slovak university of technology institute for control and industrial informatics ilkovičova 3 81219 bratislava , slovakia {danica.rosinova;vojtech.vesely}@stuba.sk received: december 24, 2006 danica rosinová was born in bratislava, slovak republic in 1961. she received msc. and phd from slovak university of technology in 1985 and 1996. since 1985 she has been with the department of automatic control systems, now institute for control and industrial informatics at the faculty of electrical engineering and information technology stu bratislava. since 2006 she has been associate professor. her research interests concentrate on robust control, large scale systems theory and optimization. students. vojtech veselý was born in 1940. since 1964 he has worked at the department of automatic control systems at the faculty of electrical engineering and information technology, slovak university of technology in bratislava. since 1986 he has been full professor. his research interests include the areas of power system control, robust control, decentralized control of large-scale systems, proces control and optimization. he is author and coauthor of more then 270 scientific and technical papers, he successful supervised up today 19 phd international journal of computers communications & control issn 1841-9836, 11(2):179-193, april 2016. design of a fuzzy networked control systems. priority exchange scheduling algorithm h. benítez-pérez, j. ortega-arjona, j.a. rojas-vargas, a. durán-chavesti h. benítez-pérez universidad nacional autónoma de méxico apdo. postal 20-726, admón. 20, del. a. obregón, méxico d. f., cp. 01000. hector.benitez@iimas.unam.mx jorge ortega-arjona facultad de ciencias unam av. universidad 3000, c. u., méxico d. f. jloa@ciencias.unam.mx jared a. rojas-vargas iimas unam cto. escolar 3000, c. u., méxico d. f. jared_36_23@hotmail.com a. durán-chavesti* universidad nacional autónoma de méxico apdo. postal 20-726, admón. 20, del. a. obregón, méxico d. f., cp. 01000. *corresponding author: adrian.chavesti@iimas.unam.mx abstract: this work presents a supervisory control strategy for networked control systems (ncss). this shows the identification and control of the plant using fuzzy theory. the fuzzy model incorporates the delay dynamics within the fuzzy rules based upon a real-time hierarchical scheduling strategy. a hierarchical scheduling priority exchange algorithm is used based upon codesign strategy following mutual correlation among control and network algorithms in order to bounded time delays. a system of magnetic levitation is presented as a case study. keywords: fuzzy control, networked control system, time delay codesign. 1 introduction the control design and stability analysis of network-based control systems (ncss) have been studied in recent years [14], [8] and [24] based upon codesign strategy. the main advantages of this kind of systems are their low cost, small volume of wiring, distributed processing, simple installation, maintenance and reliability. in a ncs, one of the key issue is the effect of network-induced delay in the system performance. the delay can be constant, time-varying, or even random, this depends on the scheduler, network type, architecture, operating systems, etc [24]. one strategy to be followed is the codesign since it takes both desired procedures to be followed. nilsson analyzes several important facets of ncss [15]. it introduces models for the delays in ncs, first as a fixed delay, after as an independently random, and finally like a markov process. the author introduces optimal stochastic control theorems for ncss based upon the independently random and markovian delay models. in [18], introduces static and dynamic scheduling policies for transmission of sensor data in a continuous-time lti system. they introduce the notion of the maximum allowable transfer interval (mati), which is the longest time after a sensor should transmit a data. [18] derived bounds of the mati such that the ncs is stable. this mati ensures that the lyapunov function of the system under consideration is strictly decreasing at all times. in [22] extends the copyright © 2006-2016 by ccc publications 180 h. benítez-pérez, j. ortega-arjona, j.a. rojas-vargas, a. durán-chavesti work of walsh., he developed a theorem which ensures the decrease of a lyapunov function for a discrete-time lti system at each sampling instant, using two different bounds. these results are less conservative than those of walsh, because he doesn’t require the system’s lyapunov function to be strictly decreasing at all time. besides, following the work presented by [13] although the strategy is similar as well as the case study in here, the proposed fuzzy control follows each local time delay produced by the scheduling algorithm which is dynamic and reactive to external tasks modification (as priority exchange proposes). although the results are stable in both cases, in here the challenging strategy is to dismiss dynamic local time delays without forcing system bounds. it is important to mention that this work follows the expressions designed in [3], [4] and [3] with the characteristic of real local time delays and local gain control design following eqn. 10 and lmi procedure as presented in section 4. in [7], [17], [20] and [21] introduce a number of different linear matrix inequality (lmi) tools for analyzing and designing optimal switched ncss. [23] takes into consideration both the network-induced delay and the time delay the plant, a controller design method is proposed by using the delay-dependent approach. an appropriate lyapunov functional candidate is utilized to obtain a memoryless feedback controller, this is derived by solving a set of linear matrix inequalities (lmis). in [19] models the network induced delays of the ncss as interval variables governed by a markov chain. using the upper and lower bounds of the delays, a discrete-time markovian jump system with norm-bounded uncertainties is presented to model the ncss. based on this model, the h∞ state feedback controller can be constructed via a set of lmis. recently [9] introduced a new (descriptor) model transformation for delay-dependent stability for systems with time-varying delays in terms of lmis, and she also refines recent results on delay-dependent h∞ control and extend them to the case of time-varying delays. alternatively [10] takes into consideration both the network-induced delay and the time delay in the plant, and thus, introduces a controller design method, using the delay-dependent approach. an appropriate lyapunov functional candidate is used to obtain a memoryless feedback controller, derived by solving a set of linear matrix inequalities (lmis) [6]. [11] models the network induced delays of the ncss as interval variables governed by a markov chain. using the upper and lower bounds of the delays, a discrete-time markovian jump system with norm-bounded uncertainties is presented to model the ncss. based on this model, a h∞ state feedback controller can be constructed via a set of lmis. an interesting approximation has been presented by [2] where time delays incorporation has been proposed following state space representation. 2 systems proposal based on this review, this paper defines a model (fig. 1) that integrates the time delays for a class of nonlinear system, where the actual proposal it is the enhancement of states in order to represent of control and plant states to fulfill a complete modeling of time delays according to priority exchange dynamic schedulling algorithm. it comprises two types of fuzzy rules, one that models the dynamics of the plant and another that introduces the networked-induced time delay. it involves estimating the time delay based scheduling behaviour where the fuzzy rules are such as: if xi(k) is µij then x n j (k + 1) = ajx(k) + b0u(k) (1) i=1...n j=1...r h=1...s design of a fuzzy networked control systems. priority exchange scheduling algorithm 181 figure 1: fuzzy model proposed. the overall system is: x̂(k + 1) = r∑ j=1 xnj + s∑ h=1 xdh = r∑ j=1 rjajx(k) + s∑ h=1 sh(b0)u(k) (2) where xi is the ith state of the plant, µij is the membership function of the ith state and s is the total number of local time delays and the jth rule. aj ∈ rnxn, b0,h ∈ rn, x ∈ rn, u(k) ∈ r, with n states, and r nominal fuzzy rules, sh is he nominal selection of current fuzzy rule. where n and d denote nominal and delayed model respectively. the fire strength ψj is defined as the function multiplication between the membership functions µij. µij = exp ( − (xi − cij)2 σ2ij ) (3) ψj = m∏ i=1 µij (4) rj = ψj∑r k=1 ψk (5) 0 < rj ≤ 1, r∑ j=1 rj(x) = 1 (6) for the s fuzzy rules with delay τcah, νh is the gaussian membership function of the time delay with center αh and standard deviation βh. sh = νh∑s k=1 νk (7) νh = exp ( − (τcah −αh)2 β2h ) (8) the proposed decomposition in terms of feedback state space representation has been reviewed by [4], where the indexing is defined by the time delays as local and bounded situations through the network. firstly, as augmented states and the related bounded time delays of plant and controller, following the strategies presented in [3], [4] and [3] the control structure is modified according to a particular gain control per local time delays scenarios and different local operational points from a particular case study. in here the strategy is modified by designing local control laws as 182 h. benítez-pérez, j. ortega-arjona, j.a. rojas-vargas, a. durán-chavesti gain rather them a dynamic state feedback control. the results as shown in section 5 are quite promising in that respect, it is presented as such: x = [ xc xp ] (9) xc(k + 1) xp(k + 1) = ∑n j=1 ∑n i=1 [ hjhi [ b p j (xc(k − tcai)) ] + hja p jxp(k) ] ∑n j=1 ∑n i=1 [ hjhi [ fcj ( cipxp(k − tsci) )] + hjf c j ] (10) where the delays are independent based upon the time obtained from scheduling approximation: tca1 + tsc1 < tca2 + tsc2 < ... < tcam + tscm < t (11) now from the derivative of a candidate lyapunov function is expressed as: ∆u(k) = v (k + 1)−v (k) (12) and the related lyapunov function is: v (k) = x(k)t px(k) (13) each of the fuzzy rules is given as an expression of local delays from current condition from plant towards controller, and vice versa. [ xc xp ] =   xc(k) xc(k − tca1) xc(k − tca2) ... xc(k − tcam) xp(k) xp(k − tsc1) xp(k − tsc2) xp(k − tsc3) ... xp(k − tscm)   (14) for each rule, there is a delay related to a particular condition to the plant and controller. each of the rules maybe updated through learning procedure or lmi process. each of the rules is unique on every specific time. in this case, these are associated to a particular relationship of last equation. in terms of the lyapunov candidate, this is expressed as in eqn 15 which is consistent to eqn. 8. v (k + 1)−v (k) = [ xc(k + 1) xp(k + 1) ]t p [ xc(k + 1) xp(k + 1) ] − [ xc(k) xp(k) ]t p [ xc(k) xp(k) ] (15) v (k + 1)−v (k) =   ∑mj=2 ∑mi=2 ( hjhi ( b p j (xc(k − tcaj)) ) + hja p jxp(k) ) ∑m j=2 ∑m i=2 ( hjhi ( fcj (xp(k − tscj)) ))  t p design of a fuzzy networked control systems. priority exchange scheduling algorithm 183   ∑mj=2 ∑mi=2 ( hjhi ( b p j (xc(k − tcaj)) ) + hja p jxp(k) ) ∑m j=2 ∑m i=2 ( hjhi ( fcj ( cipxp(k − tscj) )))  t − [ xc(k) xp(k) ]t p [ xc(k) xp(k) ] (16) therefore: v (k + 1)−v (k) = [ xc(k + 1) xp(k + 1) ]t p [ xc(k + 1) xp(k + 1) ] −   xc(k) xc(k − tca1) xc(k − tca2) ... xc(k − tcam) xp(k) xp(k − tsc1) xp(k − tsc2) xp(k − tsc3) ... xp(k − tscm)   t p   xc(k) xc(k − tca1) xc(k − tca2) ... xc(k − tcam) xp(k) xp(k − tsc1) xp(k − tsc2) xp(k − tsc3) ... xp(k − tscm)   (17) tcaj and tscj are the related time delays. considering the fuzzy system representation: v (k + 1)−v (k) =   ∑mj=2 ∑mi=2 ( hjhi ( b p j ( cicxc(k − tcaj) )) + hia p i xp(k) ) ∑m j=2 ∑m i=2 ( hjhi ( fcj ( cipxp(k − tscj) )))  t p (18)   ∑mj=2 ∑mi=2 ( hjhi ( b p j ( cicxc(k − tcaj) )) + hia p i xp(k) ) ∑m j=2 ∑m i=2 ( hjhi ( fcj ( cipxp(k − tscj) )))  −   xc(k) xc(k − tca1) xc(k − tca2) ... xc(k − tcam) xp(k) xp(k − tsc1) xp(k − tsc2) xp(k − tsc3) ... xp(k − tscm)   t p   xc(k) xc(k − tca1) xc(k − tca2) ... xc(k − tcam) xp(k) xp(k − tsc1) xp(k − tsc2) xp(k − tsc3) ... xp(k − tscm)   (19) if only one of the time delays is considered: 184 h. benítez-pérez, j. ortega-arjona, j.a. rojas-vargas, a. durán-chavesti 0 > [ xc(k + 1) xp(k + 1) ]t p [ xc(k + 1) xp(k + 1) ] −   xc(k) xc(k − tcaj) xp(k) xp(k − tscj)  p   xc(k) xc(k − tcaj) xp(k) xp(k − tscj)   (20) in here every time delay is local, independent and bounded according to dynamic scheduling algorithm which is based upon the structural codesign section. 3 structural codesign the codesign proposal follows the iteration between schedulability and stability analysis following online approximation. in fact, according to dynamic scheduling algorithm proposal which is based upon structural codesign strategy, these time delays can be seen like a phase modification within the communication period from the involved processes. this scenario presents a complete phase modification at the entire system. the communication network plays a key role in order to define the behavior of the dynamic system in terms of time variance giving a nonlinear behavior. in order to understand such a nonlinear behavior, time delays are incorporated by the use of real-time system theory that allows time delays to be bounded even in the case of causal modifications due to external effects, based upon priority exchange [4]. this algorithm bounds time delays through a real-time scheduling algorithm within communication network. according to fig. 3, structural reconfiguration takes place as a result of priority exchange scheduling algorithm and the associated user request. this reconfiguration causes a control law modification [19] which is the actual control law reconfiguration. scheduling approach potentially modifies frequency execution and communication of tasks in order to give certain priority to some of them during a bounded time as shown in fig. 3. furthermore, in this kind of strategy tasks modifies their priority, it does not imply that neither the period nor the consumption times are modified. therefore the tasks would have a bounded delay within the sampling time which is reflected as changing on the phase. potential modifications onto scheduling approach deploy change in the priorities that affects time delays and the respective control law. the delays are measured as ∆ t and bounded into the inherent control period of time according to eqn. 11. now by taking partial results from scheduling algorithm like tsj and the related ∆t, the actual time delays are used at the control law for parameters design. the involved time delays are depicted as τij and come from this scheduling design. other delays like actuators and control delays are not used in the design of the control law, although play an important role. therefore scheduling and control analysis merge together when time delays are complete bounded even in the case of time variance. the main restriction is in terms of predictable time delays. the objective here is to present a reconfiguration control strategy developed from the time delay knowledge, following scheduling approximation where time delays are known and bounded according to used scheduling algorithm. the scheduling strategy proposed here pursues to tackle local faults in terms of fault tolerance. in this situation, current time delays would be inevitable. classical earliest deadline first (edf) plus priority exchange (pe) [4] algorithm are used here to decompose time lines and the respective time delays when present. for instance, time delays are supervised for a number of tasks as follows: c1 → cnt1 → tn (21) design of a fuzzy networked control systems. priority exchange scheduling algorithm 185 priority is given as the well-known edf algorithm, which establishes that the process with the closest deadline has the most important priority [12]. however, when an aperiodic task appears, it is necessary to deploy other algorithms to cope with concurrent conditions. to do so, the pe algorithm is used to manage spare time from the edf algorithm. the pe algorithm [6] uses a virtual server that deploys a periodic task with the highest priority in order to provide enough computing resources for aperiodic tasks. this simple procedure gives a proximity, deterministic, and dynamic behavior within the group of included processes. in this case, time delays can be deterministic and bounded. as an example, consider a group of tasks as shown in table 1. in this case, consumption times as well as periods are given in terms of integer units. remember: the server task is the time given for an aperiodic task to take place on the system. name consumption (in units) period (in units) task 1 2 9 task 2 1 9 task 3 2 10 server 1 6 table 1: first example for pe algorithm. the result of the ordering based upon pe is presented in fig. 2. figure 2: related organization for pe of tasks in table 1. based on this dynamic scheduling algorithm, time delays are given as current calculations in terms of task ordering. in this case, every time that the scheduling algorithm takes place, the global time delays are modified in the short and long term. for instance, consider the following example, in which four tasks are set, and two aperiodic tasks take place at different times, giving different events with different time delays. the following task ordering is shown in fig. 3, using the pe algorithm, where clearly time delays appear. now, from this, a resulting ordering of different tiny time delays is given for two scenarios, as shown in fig. 4. these two scenarios present two different local time delays that need to be taken into account before hand, in order to settle the related delays according to scheduling approach and control design. these time delays can be expressed in terms of local relations between both dynamical systems. these relations are the actual and possible delays, bounded as marked limits of possible and current scenarios. then, delays may be expressed as local summations with a high degree 186 h. benítez-pérez, j. ortega-arjona, j.a. rojas-vargas, a. durán-chavesti name consumption (in units) period (in units) task 1 2 9 task 2 1 9 task 3 2 10 server 1 6 aperiodic task 1 (ap1) 0.9 it occurs at 9 aperiodic task 2 (ap2) 1.0 it occurs at 13 table 2: second example of pe. figure 3: related time delays are depicted according to both scenarios. of certainty (as presented in [13]). in this last example, during the second scenario, a total delay is given as: total delay = consumption_time_delay_aperiodic_task1 + consumption_time_delay_task1 + tsc2 + consumption_time_delay_task2 + consumption_time_delay_aperiodic_task2 + consumption_time_delay_task3 now, from this example, lp is equal to 2 and lc is equal to 3. lp and lc are the total number of local delays within one scenario from sensor to control and from control to actuator respectively. in this case, local time delays as presented in the general eqn. 14 are the result of the iteration of scheduling algorithm. in the approximation presented in this paper the local delays are around four time delays as expressed as last expression called total delays. the approach followed at the control reconfiguration does not take into account scheduler decision in a direct manner. it takes the time delays as bounded values already defined and used to design a suitable control law. therefore, according to current state plant values, the related fuzzy rule is selected. for a ncs, the communication network strongly affects the dynamics of the system, expressed as a time variance that exposes a nonlinear behaviour. such nonlinearity is addressed by incorporating time delays. from real-time system theory, it is known that time delays are bounded even in the case of causal modifications due to external effects. design of a fuzzy networked control systems. priority exchange scheduling algorithm 187 4 case of study the case of study consists of a simulation from magnetic levitation system whose sensors and actuators are operated by a "host", the signals from the sensors are sent by the host through a ethernet 10/100 network and received by a "server" where the control input is calculated and sent over ethernet network to the host. fig. 5 shows current system configuration as in real state. figure 4: current configuration of magnetic levitation system. the system consists of a coil inside a cabin, the coil levitates a steel ball that rests on a black post. the elevation of the ball is measured from the post using a light sensor inside the post. the issue of the experiment is to design a controller that does levitate the steel ball following a desired trajectory. figure 5: maglev system. the nonlinear equations for the magnetic levitation system are: 188 h. benítez-pérez, j. ortega-arjona, j.a. rojas-vargas, a. durán-chavesti ẋ1 = x2 ẋ2 = −kmx23 2mb(x1) 2 + g ẋ3 = 1 lc (−rx3 + u) were r = rc + rs and u = vc input voltage and rc electromagnet resistance rs resistor in series wiht the coil km constant of electromagnet force mb mass of the ball g gravitacional constant lc coil inductance the values of the parameters are provided in [16]. the method generated three rules for the nominal fuzzy control and the range of delay was divided in six parts then the delayed fuzzy control has six fuzzy rules. for the fuzzy model three feedback vector f j were designed to ensures the stability of the overall system. following eqn. 10 and resolving eqn. 20 through lmi it is possible to verify the stability in an asymptotic procedure. 5 results once the fuzzy control laws are designed according to equations (17)-(20) where the objective is to find a common positive definite matrix p satisfying the linear matrix inequality. two tests are performed to prove the effectiveness of the method proposed. in all tests the reference trajectory signal applied is a sine signal to be followed by the steel ball. three fuzzy rules are defined to approximate the magnetic levitation system by means of three linear models, as follows: rule 1: if x1(t) is about 0.006m, then x(k + 1) = a1x(k) + b1u(k) rule 2: if x1(t) is about 0.009m, then x(k + 1) = a2x(k) + b2u(k) rule 3: if x1(t) is about 0.013m, then x(k + 1) = a3x(k) + b3u(k) where x1 is the ball position in meters and design of a fuzzy networked control systems. priority exchange scheduling algorithm 189 a1 =  1.0016 0.0010 03.2718 1.0016 −0.055 0 0 0.9737   a2 =  1.0011 0.0010 02.1808 1.0011 −0.0175 0 0 0.9737   a3 =  1.0012 0.0010 02.3774 1.0012 −0.0212 0 0 0.9737   b1 = b2 = b3 =   00 0.0024   the control gains obtained by means lmi matlab’s toolbox are: f1 = [ −51650 −1102 379 ] f2 = [ −48530 −1058 341 ] f3 = [ 22546 −479 128 ] these control gain values guarantee the stability of the system during the presence of local time delays according to table 3. in this case local time delays are responsive in terms on an periodic external task, that is presented every determined seconds. with the next positive definite matrix p : p =   0.1980 0.0042 −0.00070.0042 0.0001 −0.0000 −0.0007 −0.0000 0.0000   (22) in order to prove the effectiveness of the metod proposed, two experiments were performed, in the first scenario the plant tracks a reference signal (sine wave) and the transmission task were the following (table 3) name consumption (in milliseconds) period (in milliseconds) task 1 2 10 task 2 1 12 task 3 2 14 aperiodic task 1 90 table 3: pe the activation task was performed using stateflow as shown in fig. 6 where according to table 3 the task 1 is the controller transmission task and has the priority one, the task 2 is the sensor transmission task an has the priority two and the task 3 and 4 are the transmissions task 190 h. benítez-pérez, j. ortega-arjona, j.a. rojas-vargas, a. durán-chavesti and the sporadic transmission task from others nodes. the system response obtained in this first experiment is shown in fig. 7 (without time delays). figure 6: activation tasks following table 3 figure 7: ball position response in the first scenario the second test is to apply a time delay less than the sampling period according to the total time delay. fig. 8 shows the behavior of the system which maintains stability with a delay of 2 ms and a sampling period of 90 ms related to the aperiodic task. the behavior is very similar to the system without time delay. design of a fuzzy networked control systems. priority exchange scheduling algorithm 191 figure 8: ball position response in the second scenario 6 conclusion it has been established a supervisory fuzzy control to minimize the effects caused by the time delay due to communication into the network which is designed through codesign strategy. this approach introduces the time delay produced by scheduling approach named priority exchange procedure. with this fuzzy model a fuzzy control is designed and the stability analysis is proposed for this controller. this approach shows that the system with a time delay smaller than sampling period but with a complex behaviour maintains the stability, the stability analysis for time varying delay and a bound for this delay remain a work in the future. although the example related to the time delays is fairly demonstrative it becomes challenging in terms of the dynamic scheduling approach where local time delays is pursued according to eqn. 10 in a general form and implemented through state flow tool in each node. acknowledgments the authors acknowledge the support of unam-papiit in100813, conacyt 176556 and picco 10-53. bibliography [1] almeida, l. et al (2002); the ftt-can protocol: why and how. ieee transactions on industrial electronics, 49(6):1189-1201. [2] benítez-pérez, h. et al (2013); networked control systems design considering scheduling restrictions and local faults using local state estimation, international journal of innovative computing, information and control (ijicic), 9(8): 3225-3239. 192 h. benítez-pérez, j. ortega-arjona, j.a. rojas-vargas, a. durán-chavesti [3] benítez-pérez, h. et al (2012); networked control systems design considering scheduling restrictions, international journal on advanced fuzzy systems, http://dx.doi.org/10.1155/2012/927878. [4] benítez-pérez, h. et al (2012); networked control systems design considering scheduling restrictions and local faults, international journal of innovative computing, information and control (ijicic), 8(10):8515-8526. [5] benítez-pérez, h.; garcía-nocetti, f. (2005); reconfigurable distributed control, springer. [6] buttazo, g. (2004); hard real-time computing systems, springer. [7] czornik, a.; swierniak, a. (2003); on direct controllability of discrete time jump linear system, tech. rep., mathematical biosciences institute, the ohio state university. [8] eidson, j.c. et. al (2002); ieee-1588 standard for a precision clock synchronization protocol for networked measurement and control systems, ieee standard, 1588-2002. [9] fridman, e.; shaked, u. (2003); delay-dependent stability and h∞ control: constant and time-varying delays, international journal control, 76(1):48-60. [10] lian, f. et al (2002); network design consideration for distributed control systems, ieee transactions on control systems technology, 10(2):297-307. [11] liu, j. (2000); real-time systems, prentice hall. [12] méndez-monroy, p.e.; benítez-pérez, h. (2009); supervisory fuzzy control for networked control systems, international journal innovative computing, information and control express letters, icic-el, 3-2, 233-240. [13] méndez-monroy, p.e.; benítez-pérez, h. (2011); codesign strategy based upon fuzzy control for networked control systems and a scheduling algorithm, ieee international conference on networking, sensing and control (icnsc), 221-226. [14] moarref, m.; rodrigues, l. (2015); piecewise affine networked control systems, ieee transactions and control of network systems, doi 10.1109/tcns.2015.2428452. [15] nilsson, j. (1998); real-time control systems with delays, ph.d. thesis, lund institute of technology, dept. of automatic control. [16] quanser inc.(2006), magnetic levitation experiment, quanser consulting. [17] tzes, a. et. al (2005); development and experimental verification of a mobile client-centric networked controlled system, european journal of control, 11(3): 229-241. [18] walsh, g. c. et. al (1999); stability analysis of networked control systems, american control conference, 2876-2880. [19] wang, y.; sun, z. (2007); h-inf control of networked control systems via lmi approach, international journal of innovative computing, information and control, 3(2):343-352. [20] xiao, l. et al (2000); control with random communication delays via a discrete-time jump system approach, american control conference, 3:2199-2204. design of a fuzzy networked control systems. priority exchange scheduling algorithm 193 [21] yu, m. et al (2003); an lmi approach to networked control systems with data packet dropout and transmission delays, international journal of hybrid systems, 3(2):3545 3550. [22] zhang, w. (2001); stability analysis of networked control systems. ph.d. thesis, case western reserve university, dept. of electrical engineering and computer science. [23] zhu, x. et al (2008); state feedback control design of networked control systems with time delay in the plant, international journal of innovative computing, information and control, 4(2): 283-290. [24] http://www.mathworks.com/help/robust/lmis.html (retrieved on july 20, 2015). ijcccv4n1draft.pdf int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 1, pp. 82-91 development journey of qadpz a desktop grid computing platform monica vlădoiu, zoran constantinescu monica vlădoiu petroleum-gas university of ploieşti, departament of informatics bd. bucureşti, nr. 39, ploieşti, romania e-mail: mmvladoiu@acm.org zoran constantinescu zealsoft ltd. str. tg. neamţ, nr. 60, bucureşti, romania e-mail: zoran@unde.ro abstract: in this paper we present qadpz, an open source system for desktop grid computing, which enables users of a local network or internet to share resources. qadpz allows a centralized management and use of the computational resources of idle computers from a network of desktop computers. qadpz users can submit compute-intensive applications to the system, which are then automatically scheduled for execution. the scheduling is performed according to the hardware and software requirements of the application. users can later monitor and control the execution of the applications. each application consists of one or more tasks. applications can be independent, when the composing tasks do not require any interaction, or parallel, when the tasks communicate with each other during the computation. the paper describes both qadpz functionality and the process of design and implementation, with focus on requirements, architecture, user interface and security. some future work ideas are also presented. keywords: desktop grid computing, distributed and parallel computing. 1 introduction grid computing and peer-to-peer (p2p) are both concerned with the pooling and coordinated use of resources within distributed communities, and are constructed as overlay structures that operate largely independently of institutional relationships [1]. the grid is foreseen as a system that coordinates distributed resources using standard, open, general-purpose protocols and interfaces to deliver nontrivial qualities of service [1, 2]. grid computing systems can be classified into two broad types: heavy-weight, feature-rich systems that provide access to large-scale, intraand inter-institutional resources, such as clusters or multiprocessors, and desktop grids, in which cycles are scavenged from idle desktop computers. p2p networks are typically used for connecting nodes via largely ad-hoc connections. a pure p2p network does not have the notion of clients or servers but only equal peer nodes that simultaneously function as both “clients” and “servers” to the other nodes on the network [3]. this paper deals with qadpz [’kwod ’pi: ’si:], an open source system for desktop grid computing, which enables users from a local network or internet to share their resources [4, 5]. qadpz (quite advanced distributed parallel zystem) is a system for heterogeneous desktop grid computing that allows a centralized management and use of the computational resources of idle computers from a network of desktop computers. qadpz users can submit compute-intensive applications to the system, which are then automatically scheduled for execution. applications can be independent, when the composing tasks do not require any interaction, or they can be parallel, when the tasks communicate with each other during the computation. thus, the system provides support for both taskand data-parallelism. here are some important features of qadpz [4]: • native support for multiple operating systems: linux, windows, macos and unix; • support for legacy applications, which for different reasons could not be rewritten; • object-oriented development framework that supports either low-level programming languages as c and c++, or high-level language applications (such as lisp, python, or java), and that provides for using such applications in a computation; copyright © 2006-2009 by ccc publications development journey of qadpz a desktop grid computing platform 83 • master worker-model that is improved with some refined capabilities: pushing of work units, pipelining, sending more work-units at a time, adaptive number of workers, adaptive timeout interval for work units, and the use of multithreading [6]; • a master can act as a client to another master. that makes it possible to create a distributed master, which consists of independent master nodes which communicate with each other, thus creating a virtual master; • extended c/c++ api, which supports creation of lightweight tasks and parallel computing, using the message passing paradigm (mpi) [7]; • low-level optimizations: on-the-fly compression and encryption for communication. to increase performance, an experimental, adaptive compression algorithm, which can transparently choose from different algorithms, is also provided; • efficient communication by using two different protocols (udp and tcp/ip); • autonomic computing characteristics: self-knowledge, self-configuration, self-optimization and self-healing [8]. 2 justification for a new desktop grid system the idea of using the idle computational resources from existing desktop computers is not new, though the use of such distributed systems, especially in a research environment, has been limited. this is due to the lack of supporting applications, and challenges regarding security, management, and standardization. the need to develop qadpz has arisen from the following main reasons: o many existing systems were highly specialized in a very limited set of computationally challenging problems, and hence did not allow the execution of a general application. for example, seti@home was programmed to solve one specific task: the analysis of data from telescopes [9, 10]. similarly, distributed.net aimed to test the vulnerability of some particular encryption schemes [11]; o at the time of the development, the source code was generally not available, hence making difficult the extension or analysis of any new, non-standard application. commercial systems such as entropia, office grid and united devices offered numerous features, but they were not free [4, 12]. on the other hand, some open source systems were available, e.g. xtremweb [13], boinc [14, 15], condor [16], but they were limited in functionality; o very few existing systems allowed specific considerations to be made wrt. challenges of computationally intensive applications, especially those of scientific computing and visualization [4]. systems like boinc and bayanihan [12] allowed only task parallelism, where there was no communication between the running tasks during a computation. most computationally intensive applications need such communication; o most of the existing systems usually had a complicated deployment procedure, requiring high-level, privileged access to the desktop computers, which made very hard to use such systems on a larger scale, and also made further maintenance of the computers complicated e.g. condor and globus toolkit [12, 17, 18]; o many of today’s networks are heterogeneous, thus requiring a distributed computing system with support for various architectures and different type of operating systems. the java language provides the incentives for such requirements, and many java based systems emerged: jxta, bayanihan, xtremweb, javelin [12]. there were very few systems supporting different architectures and operating systems in native mode, some of them being condor and boinc. there were also systems, which run only on one type operating system, either windows or unix, thus limiting their usability in heterogeneous environments for instance, entropia [12]. 84 monica vlădoiu, zoran constantinescu 3 qadpz requirements given the reasons mentioned in the previous section, we have set up a set of requirements that a successful desktop grid computing system should satisfy to support computationally intensive applications. the overall goal of the system was to be friendly, flexible and suitable to a variety of needs. the main prerequisite has therefore been an open architecture that could evolve in pace with the needs and challenges of the real world. two sets of requirements for qadpz have been specified: one for the system as a whole, mostly from a functional point of view, and another for the system interface. additionally, a set of non-functional requirements that concern the development of the platform itself has been established. system requirements are concerned mainly with sharing and management of both resources and application jobs, in a heterogeneous environment. they also involve performance and usability of the system, as required by our conceptual model (extended master-worker). the system interface covers both user interfaces and programming interfaces [4, 6]. the system requirements are listed further on: o resource sharing: idle computational cycles, storage space, specific data, etc. of the desktop machines which contribute to the system; o resource management: efficient management of the available shared resources, which remain under the control of their owners via use policies and specific mechanisms; o job management: users should be able to submit, monitor and control the execution of computational jobs on the system; o heterogeneity: ability to work on a network of heterogeneous desktop computers, with different architectures (intel, risc, etc.) and different operating systems (unix, windows, mac os, linux); o simple installation and minimal maintenance; o parallel programming support: support for different parallel programming paradigms, for example both taskand data-parallelism, by using well known standards; o network support: ability to work both in a lan environment and in internet; o communications: the higher level communication protocol should rely on both tcp/ip and udp/ip, this dual support increasing the efficiency; o autonomous features: support for different autonomicity aspects: self-management, self-optimization, self-healing, self-configuration, and self-knowledge; o provide performance measurements, which could be exploited for better usage of the available resources; o on-line/off-line support for both batch (the user submits jobs which will be executed at a later time) and interactive applications (the user can inspect the partial result and interact with the execution of the application). the interface requirements can be split up into two parts: first, the user interfaces that is the graphical interface, which the human users use to access the system. using this interface, the users can either monitor or control the behavior of the system. the other interface is the programming interface (api), which allows different user applications to interact with the system. the interface requirements are enlisted beneath: o personalization: different levels of access for various users, according to their personal skills and preferences; o job management interface: a simple, platform independent, graphical user interface, to allow submission, monitoring and control of the different computational jobs; o resource sharing interface: a simple, intuitive graphical user interface, which allows the control of shared resources. the main non-functional requirements concern object oriented programming, for its well-known advantages, and open source development, which is a natural choice for modern research, as it encourages integration, cooperation and boosting of new ideas [19]. development journey of qadpz a desktop grid computing platform 85 figure 1: the qadpz close-up architecture 4 qadpz architecture the qadpz system has a centralized architecture, based on the client-server model, which is the most common paradigm used in distributed computing. in our case, the server manages the available computational resources of the desktop computers. the client is a process that needs computational services in order to accomplish a certain work. it sends a request to the server, in which it asks for the execution of a concrete task that is covered by the services. usually, the server carries out the task and sends back the result to the client. in our situation, the server has two parts: a single master, which accepts new requests from the clients, and multiple slaves, which handle those requests. the system consists of three types of entities: master, client, and slave (figure 1). the control and data flow in the system are separated. the data files (represented by binary, input, and output files) that are necessary to run the applications are not sent to the master. they are stored on one or more data servers. the smallest independent execution unit of the qadpz is called a task. to facilitate the management, multiple tasks can be grouped into jobs. different types of jobs can be submitted to the system: programs written in scripting languages (e.g. lisp, java, python), legacy applications and parallel programs (mpi). a job can consist of independent tasks, which do not require any kind of communication between them. this is usually called task parallelism. jobs can also consist of parallel tasks, where different tasks running on different computers can communicate with each other. inter-slave communication is accomplished using a mpi subset. the current implementation of qadpz considers only one central master node. this can be an inconvenience in certain situations, when computers located in different networks are used together. however, our high-level communication protocol between the entities allows a master to act as a client to another master, thus making possible to create a virtual master, consisting of independent master nodes, which communicate with each other. 4.1 master the main role of the master is to start and control the tasks, and to keep track of the availability, capabilities and configuration of the slaves. the master is responsible for managing the available resources and it has always an up-to-date overview of the system resources. it knows which slaves can accept jobs for execution and how to contact them. it schedules also the computational tasks submitted by any authorized user. jobs are sent to the appropriate slave based on the hardware and software requirements from the job description. tasks can be started, stopped, or re-scheduled by the master. users create tasks that can be submitted to the master by using a client, which acts as an interface to the system. to make 86 monica vlădoiu, zoran constantinescu figure 2: simplified uml diagram of qadpz’s architecture this possible, the master keeps a database of authorized users (figure 2). 4.2 slave each computer contributing with computing resources to the system is called a slave and has two roles: first, it has to report the shared resources to the master. these are mainly computational resources (cpu cycles), but can also be storage space, input or output devices etc. the slave periodically sends to the master information about the system, which describes the hardware architecture of the slave (cpu type, cpu speed, physical memory, etc.), the software environment available on that architecture (operating system, available application or libraries), and the resources available on that slave (figure 5). secondly, the slave can accept computational jobs from the master. after accepting a computational request from the master, the slave downloads the corresponding binaries and data files for the task, executes the task, and uploads the result files after finishing. this can be done only when the slave is free, and any interactive, local user is not using the resources. the presence of a user logging into a slave computer is automatically detected and the task is killed or moved to another slave to minimize the disturbance to the regular computer users. the slave decides for itself whether or not to accept a computational job to be run (by setting some configuration parameters). the user can configure different times of day when the slave may accept computational jobs. it can also disable the slave at any time. the slave component runs as a small background process on the user’s desktop. it starts automatically when the system starts. the program does not need any special privileges to run, which makes it very easy to install and control by an ordinary user. below we present a simple example on how to create a computational application to be executed on a slave. // slavedumb simple example of how to create a computational job #include "slaveserv.h" // callback functions for notification from the slave service void taskstop () { istaskstop = 1; dbug_print("taskstop"); } void taskctrl (const char *arg) { istaskctrl = 1; dbug_print("taskctrl arg=%s", arg); } // this is the exec loop on each task-thread int taskexec (char *data, char *datares, char *userdata) { int isfinished = 0; dbug_print("task started"); // set callback functions q2adpz_slv_setcb_task_stop (taskstop); q2adpz_slv_setcb_task_ctrl (taskctrl); development journey of qadpz a desktop grid computing platform 87 dbug_print("input data ’%s’", data); // start main task loop while (! isfinished) { //do some crunching of the data { ... if (ok) isfinished = 1; } //task needs to be stopped if (istaskstop) { ... dbug_print("task stop executed"); break; } if (istaskctrl) { ... q2adpz_slv_task_status (task_ok, "task ctrl"); dbug_print("info", ("task ctrl executed"); } //if crunching finished if (isfinished) { dbug_print("task finished res=’%s’", datares); break; } } // while } 4.3 client the client represents the interface for submitting jobs in the system. there are two execution modes for the client: a batch mode and an interactive mode. in the batch mode, a project file describes a job by specifying the required resources and how to start the tasks. this information is sent to the master, which is responsible for scheduling the tasks. the client can detach from the master and connect later for the results. each project is described by using the xml language. in the interactive mode, the client remains connected to the master for the entire execution of the job. also, the client can get direct connection to each of the slaves involved in the computation. the client has a lot of freedom over the creation and controlling of new tasks: it can dynamically create new tasks, send messages to the tasks already in execution, and receive feedback from the running tasks, either through the master node, or by means of direct communication with the slaves running the respective tasks. an example of a job description in xml is listed beneath. 3 ./datafiles/ http://www-data/qadpz/cgi-bin simple/source.txt simple/dest.txt 1 1 3600 linux i386 http://www-data/qadpz/app/lib/linux/i386/libslv-app.so simple 1 1 3600 win32 i386 http://www-data/qadpz/app/lib/win32/i386/slv_app.dll simple.exe 1 1 3600 sunos sun4u http://www-data/qadpz/app/lib/sunos/sun4u/libslv-app.so 88 monica vlădoiu, zoran constantinescu simple 4.4 jobs, tasks and subtasks the qadpz users can submit, monitor, and control computing applications to be executed on the computers that share resources. tasks can be binary programs, which can run on any of the sharing computers. a task comes in the form of an executable program, compiled for a specific architecture and operating system. for better performance, a task can be also in the form of a shared (dynamic) library, which can be more efficiently loaded by the slave program. as an alternative to native binary programs for a specific platform, a task can also be an interpreted or precompiled program. for example, it can be a compiled java application or an interpreted program (e.g. perl, python), which further needs, respectively, a java virtual machine or a specific interpreter, on the host computer. multiple tasks, which are related to each other, can be grouped into a job that is actually what a user submits to the system (see job life in figure 3). a job can be composed of one or more tasks. using jobs provides for easier structuring and management of the computational applications for both the user and the system. each job is assigned uniquely to one user, however, a user can have multiple jobs submitted at the same time to the system. the tasks that correspond to a job can be independent or not at execution time. tasks can further be divided into subtasks, consisting of finer work units that are executed within a task. subtasks are used for interactive applications, which require permanent connection between a client and the slaves. they are usually generated at run-time at the client, and sent for execution to an already running task, which can solve them. the main reason for having subtasks is to improve the efficiency of smaller execution units without the overhead of starting a new task each time. figure 3: qadpz job life 4.5 user interface the qadpz user interface provides for a user-friendly environment, in which the user can interact with the system. this interaction involves mainly the submission, the monitoring, and the management of the submitted computational applications, along with the resource monitoring and control. the first interface is the job-monitoring interface that is a web-based interface that provides detailed information about all existing jobs in the system. the user can browse the jobs, see their status, and view their component tasks. s/he can also easily create new jobs and tasks. using this interface, each job can be stopped or deleted (figure 4). the second interface is also web-based and provides information related to the resources in the system. basically it gives a list of the slaves registered in the system and their current status (figure 5). the owner of a desktop computer running a slave is given an interactive application, development journey of qadpz a desktop grid computing platform 89 which permits easy configuration of the slave. the user has complete control over the slave running on her computer. figure 4: job-monitoring web interface figure 5: slave information/configuration interface 5 security because of the unreliability of the udp protocol, which is our first option for the low-level communication protocol due to its benefits, it is not guaranteed that the execution tasks arriving to the slave computers are undoubtedly sent by the master. this is a serious security threat since it allows for a malicious hacker to submit any piece of code to the slave nodes (ipspoofing). for that reason, and on the cost of a decreased performance, all communication from clients to master and from master to slaves is encrypted and/or signed. particularly, the data flow from client to master has to be authorized by the name and the password of a qadpz user, and encrypted using a master public key. a master private key signs the data flow from master to slaves and the authenticity is verified using a master public key on slave nodes. it is important to note that the data flow from slaves to master and from master to clients is neither encrypted nor signed, which means that a malicious hacker can monitor (packet sniffing) or alter (ipspoofing) the data or control information arriving back to master or client nodes, and thus put the slave nodes and/or the master node out of operation, modify the resulted data that are submitted by the slaves, or do any other kind of harm to the computational process. in other words, the current qadpz security scheme is designed to protect the security of the computers in the network, i.e. a malicious hacker cannot submit an alien piece of code to be executed instead of a user computational task. however, this scheme does not protect the qadpz user data. we plan to provide optional data integrity in the future versions of the system. security of the system is handled in two ways. on the one hand, only registered users are allowed to submit applications for execution. this is done by using a user/password scheme, and allows a simple access control to the computational resources. the qadpz system manages its own user database, completely independent of any of the underlying operating systems, thus simplifying users’ access to the system. the qadpz administrator can create new users by using some supporting tools. on the other hand, security involves the encryption of messages exchanged between various components of qadpz. this is done by using public key encryption, and provides an additional level of protection against malicious attacks. 6 conclusions and future work the present paper reveals the development experience of qadpz, a desktop grid computing environment. we summarized the main features of the system that make it a powerful platform for running computationally intensive applications. the reasons that have justified the endeavor of developing a new desktop grid platform are also presented. the qadpz requirements have included all the core capabilities that a successful desktop grid system should provide [12]. we presented the detailed architecture of the system, along with some of the design details. when we started this work, our main goal was to build an easy to use, open source system that provides the complex functionality that users expect from 90 monica vlădoiu, zoran constantinescu such a platform [4, 12]. it is worth mentioning that qadpz has over a thousand users who have already downloaded it [20]. many of them use it for their daily tasks and we have got valuable feedback from them [4]. further on we present some future work ideas that aim to improve the qadpz system: o many areas of the qadpz system are incomplete. for example, many large scale parallel problems require checkpointing: running a parallel application for hours or even days and loosing all results due to one failing node is unacceptable; o data integrity is an important issue, especially in an open environment (internet); o improved support for user data security: computation results data can be encrypted and/or signed so that the user of the system can be sure the received data is correct; o users could be provided with different scheduling algorithms to choose from, according to the type of their problem; o more complete implementation of the mpi layer and development of a complete library of the collective communication routines; o adding a set of transparent profiling tools for evaluating the performance of the different components, which is crucially important when running parallel applications; o decentralizing the system by employing p2p services, which would permit to a group of users to form an ad-hoc set of shared resources; moving towards a p2p architecture; o interconnection with a grid computing environment that must be decentralized, robust, highly available, and scalable [21], while efficiently mapping application instances to available resources in the system. these future developments of qadpz subscribe to the belief that the vision that motivates both grid and p2p, i.e. that of “a worldwide computer within which access to resources and services can be negotiated as and when needed, will only become real if we are successful in developing a technology that combines important elements of p2p and grid computing” [1]. bibliography [1] i. foster, and a. iamnitchi, on death, taxes, and the convergence of peer-to-peer and grid computing, in 2nd int. workshop on p2p systems iptps 2003, pp. 118-128, 2003. [2] i. foster, c. kesselman, the grid: blueprint for a new computing infrastructure, boston: morgan kaufmann, 2004. [3] j. i. khan and a. wierzbicki, eds., foundation of peer-to-peer computing, special issue, elsevier journal of computer communication, volume 31, issue 2, feb. 2008. [4] z. constantinescu, a desktop grid computing approach for scientific computing and visualization, phd thesis, norwegian university of science and technology, trondheim, norway, 2008. [5] qadpz, [online] available: http://qadpz.sourceforge.net. [accessed august 1, 2008]. [6] m. vladoiu, z. constantinescu, an extended master-worker model for a desktop grid computing platform (qadpz), in 3rd int. conference on software and data technologies -icsoft 2008, pp. 169-174, 2008. [7] z. constantinescu, j. holmen, p. petrovic, using distributed computing in computational fluid dynamics, in 15th int. conf. parallel computational fluid dynamics parcfd-2003, pp. 123-129, 2003. [8] z. constantinescu, towards an autonomic distributed computing environment, in 14th int. workshop on autonomic computing systems, 14th int. conf. on database and expert systems applications dexa 2003, pp. 694-698, 2003. development journey of qadpz a desktop grid computing platform 91 [9] d. p. anderson, j. cobb , e. korpela , m. lebofsky, d. werthimer, setihome: an experiment in public-resource computing, communications of the acm, vol. 45, no. 11, pp. 56-61, 2002. [10] seti@home [online] available: setiathome.ssl.berkeley.edu [accessed may 5, 2003]. [11] distributed.net [online] available: http://distributed.net. [accessed may 5, 2008] [12] m. vladoiu, z. constantinescu, a taxonomy for desktop grids from users perspective, in int. conference on parallel and distributed computing icpdc 2008, world congress on engineering (wce 2008), pp. 599-605, 2008. [13] c. germain, v. neri, g. fedak and f. cappello, xtremweb: building an experimental platform for global computing, in 1st ieee/acm workshop on grid computing grid2000, pp. 91-101, 2000. [14] d. p. anderson, boinc: a system for public-resource computing and storage, in 5th ieee/acm international workshop on grid computing, pp. 365-372, 2004. [15] boinc open source software for volunteer computing and grid computing [online] available: http://boinc.berkeley.edu. [accessed november 25, 2007]. [16] j. basney, m. livny, managing network resources in condor, in proc. of the 9th ieee symposium on high performance distributed computing (hpdc9), pp. 298-299, 2000. [17] globus [online] available: http://www.globus.org [accessed may 15, 2008]. [18] i. foster, and c. kesselman., globus: a metacomputing infrastructure toolkit, intl j. supercomputer applications, vol. 11, no. 2, pp. 115-128, 1997. [19] j. cassens, z. constantinescu, free software: an adequate form of software for research and education in informatics?, in linuxtag 2003 conference, pp. 5-10, 2003. [20] sourceforge, [online] available: http://sourceforge.net [accessed april 1, 2008]. [21] f. berman, g. fox, a.j.g. hey, grid computing: making the global infrastructure a reality, new york: j. wiley, 2003. monica vlădoiu got her msc (1991) and phd (2002) in the department of computer science of the polytechnic university of bucharest, romania. since then, she has been with the dept. of informatics, petroleum-gas university of ploieşti (upg), romania. her main research interests include digital libraries, learning objects, multimedia databases, reflective and blended learning, desktop grid computing and e-society. she has published over 30 research papers concerning these topics and she has (co-) authored 3 books. zoran constantinescu got his msc (1997) in the dept. of computer science of the polytechnic university of bucharest, romania. since then, he has been working both in the software engineering industry and in higher education. he got his doctoral degree in computer science (2008), from the norwegian university of science and technology, trondheim, norway. his research interests include parallel and distributed computing, desktop grid computing, gps systems and embedded systems. he has published over 20 research papers dealing with the above mentioned topics. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iii (2008), no. 2, pp. 149-160 a neural approach of multimodel representation of complex processes nesrine elfelly, jean-yves dieulot, pierre borne abstract: the multimodel approach was recently developed to deal with the issues of complex processes modeling and control. despite its success in different fields, it still faced with some design problems, and in particular the determination of the models and of the adequate method of validities computation. in this paper, we propose a neural approach to derive different models describing the process in different operating conditions. the implementation of this approach requires two main steps. the first step consists in exciting the system with a rich (e.g. pseudo random) signal and collecting measurements. these measurements are classified by using an adequate kohonen self-organizing neural network. the second step is a parametric identification of the base-models by using the classification results for order and parameters estimation. the suggested approach is implemented and tested with two processes and compared to the classical modeling approach. the obtained results turn out to be satisfactory and show a good precision. these also allow to draw some interpretations about the adequate validities’ calculation method based on classification results. keywords: complex processes, modeling, multimodel approach, kohonen map 1 introduction nowadays, technological developments increase the complexity of systems. this complexity can be caused by non linearity, non stability, wide operating domain, variations of system parameters or external perturbations. as a result, it is often difficult or even impossible to propose a simple model which could reckon with the whole process complexity by using physical laws. consequently, it is very useful to focus on advanced and practical approaches in order to handle this complexity. the multimodel approach has recently been developed and applied in several science and engineering domains. it was proposed as an efficient and powerful method to cope with modeling and control difficulties when complex non linear and/or uncertain processes are concerned. the multimodel approach supposes the definition of a set of models. then, it becomes possible to replace the unique model by a set of simpler models thus making a so-called models’ base. each model of this base describes the behavior of the considered process at a specific operating point. the multimodel approach objective is to decrease the process complexity by its study under certain specific conditions. several researchers have been interested in multimodel analysis and control approaches [7, 8, 19] and many applications have been proposed in different contexts. in spite of its success in many fields (academic, biomedical, . . . ), the multimodel approach remains confronted with several difficulties such as the calculation of models’ validities, the adequate technique of fusion or switching between models as well as the determination of the models’ base. indeed, in 1985, takagi and sugeno [21] suggested a fuzzy process representation by the contribution of local models. this approach has been applied in many fields of activities but often faced problems related to the lack of information about the system structure or the incertitude of its parameters. besides, it can sometimes lead to a large number of models which generates a high computational burden when designing the control algorithm. this has led several researchers to develop other approaches in order to cope with these difficulties. for uncertain complex systems with bounded parameters, some approaches were developed for the determination of the models’ base [9, 11]. an extension for uncertain discrete systems has been proposed by mezghani [13]. the case of multivariable systems was addressed by raissi [16]. the approaches, copyright © 2006-2008 by ccc publications 150 nesrine elfelly, jean-yves dieulot, pierre borne previously named, require the knowledge of parameters’ variations limits, which is generally not possible, in particular in case of uncertain systems for which parameters variations domains are unknown or ill-known. other related studies [12, 19] suggest the system be described by a set of local models often defined by using a reference model and some linearization methods. in another context, both fuzzy logic and neural networks were carried out for the multimodel control [1, 18]. an idea was to use neural approaches for complex systems modeling. these methods have the advantage of requiring very little information about the considered process and are useful for uncertain systems. within this context, very few studies [2, 3, 15, 17, 20, 22] were proposed but they don’t address strongly non-linear systems. in this paper, a neural approach for the determination of the models’ base for uncertain complex systems is proposed, in particular those which exhibit strong non-linearities. the proposed approach requires a priori little knowledge about the considered system; only input/output information can be sufficient. in the following section, the different steps of the proposed modeling approach are detailed. two simulation examples and some interpretations are then presented. the evaluation of the suggested modeling strategy is the topic of the last section. we finish the present paper by a conclusion. 2 complex systems’ modeling: neural approach in this section, a models’ base-determination approach for uncertain complex systems is described. this approach requires neither a global model definition nor the knowledge of parameters variations domains; only input/output information are needed. the suggested approach allows the generation of the base-models’ structure and parameters. the application of this approach requires first to classify the numerical data by exploiting a kohonen map and to determine the number of models. secondly, a structural and parametric identification of different base-models is carried out by using classification results. then, the validity index of each model is computed. finally, the multimodel output is obtained by the fusion of the models’ outputs weighted by their validity indexes. 2.1 classification of the numerical data by using a kohonen map as a first step, the output or input/output measurements collected from the considered process have to be classified in order to identify operating clusters from which the models’ base will be deduced. this classification is carried out by using a self-organizing kohonen map. self-organizing kohonen map methodology the self-organizing map (som) represents a specific kind of neural networks. in its original form, the som was invented by the founder of the neural networks research center, professor teuvo kohonen in 1981-82. the special property of the som is that it effectively creates spatially organized internal representations of various features of input signals and their abstractions [10]. the schematic representation of this network is given in figure 1. neurons in the target layer are ordered and correspond to cells of a bi-dimensional map. every neuron of the input layer is connected to every neuron of the output layer. the classification strategy consists in applying the kohonen rule. this rule is characterized by an unsupervised competitive learning where a competition takes place before the modification of the network-weights. only the neurons which win the competition have the right to change their weights. the kohonen rule works as follows: a neural approach of multimodel representation of complex processes 151 figure 1: kohonen map weights are initialized to random values; an input vector is presented to the network; the distance between the input vector and weights connecting inputs to each output neuron is computed; the neuron corresponding to the smallest distance, i.e. the nearest to the input vector, wins the competition, weights connecting inputs to this neuron are modified accordingly [5]. this procedure is repeated several times until weights stabilize. at the end of the learning stage, the representative vectors of different clusters and their centers are obtained. determination of the operating-system clusters the determination of the operating-system clusters requires firstly that the considered system be excited. secondly, the number of clusters has to be determined. the third step consists in classifying data by using a kohonen neural network. the first step consists in applying an input signal and then collecting (output or input/output) measurements that will be used for classification. the excitation signal must be rich enough and persistently exciting with well-chosen parameters in order to allow a full excitation of the operating dynamics, and to take in consideration the non-linear aspect of the considered process. for the second step, the method proposed by talmoudi has been adopted for the determination of the adequate number of clusters which corresponds to the number of models [22]. within this scope, a kohonen network, with an important number n of neurons in the output layer, has been considered. at the end of the learning procedure, if the repartition of classes is not good, the clusters i having a number of elements nci verifying the relation (1), will have to be removed [22]. nci ≤ 1 2 nh n , (1) where nh represents the number of the considered measures. else, the number of neurons in the output layer is increased and training is restarted. the same procedure is repeated over and over until the satisfactory number of clusters is obtained. afterwards, the data classification is tackled by using a kohonen network for which the number of neurons in the input layer equals the number of system-variables to be considered and the number of neurons in the output layer equals the number of clusters determined with the help of the method previously described. the classification results will then be exploited for the identification of the different base-models. 2.2 parametric identification of the base-models in this section, the orders of the models are estimated in a first step. the chosen method is the socalled instrumental determinants’ ratio-test. this method is mainly based on the conditions concerning 152 nesrine elfelly, jean-yves dieulot, pierre borne a matrix called "information matrix" which contains the input/output measurements [4]. this matrix is described as follows: qm = 1 nh nh ∑ k=1   u(k) u(k + 1) u(k −1) ... u(k −m + 1) u(k + m)   [ y(k + 1) u(k + 1) ··· y(k + m) u(k + m) ] , (2) where nh is the number of observations. the instrumental determinants’ ratio (rdi) is given by: rdi(m) = ∣∣∣∣ det(qm) det(qm+1) ∣∣∣∣ . (3) for every value of m, the determination procedure of the order consists in building the matrices qm and qm+1 and in evaluating the ratio rdi(m), the retained order m is the value for which the ratio rdi(m) quickly increases for the first time. as a second step, the parametric identification issue consists in calculating the values of the parameters of the corresponding model-equation, given several experimental measures which describe the dynamic behavior of the system. as previously mentioned, the data classification gives a certain repartition of clusters. for each cluster, input/output measurements are collected. these measurements allow the identification of the corresponding model. for this, the recursive least-squares method (rls) [4] was applied to achieve the parameters estimation. 2.3 computation of validities the validity coefficient is a number belonging to the interval [0 1]. it represents the relevance degree of each base-model calculated at each instant. in literature, several methods have been proposed to deal with the validity issue. in our study, the residues’ approach was adopted for the calculation of validities. this method is based on the distance measurement between the process and the considered model. for example, the residue can be given by the following expression: ri = |y−yi| i = 1, . . . , n (4) where: n: number of base-models; y: process output; yi: output of the model mi. if this residue value is equal to zero, the corresponding model mi perfectly represents the process at that time. on the contrary, a non null value translates the fact that the model mi represents the system partially. the normalized residues are given by: r′i = ri ∑nj=1 r j . (5) within the context of the residues’ approach, several methods have been proposed for the calculation of validities [6, 13, 14]. only two methods will be considered: the simple and the reinforced validities. the validities are given by: vi = 1−r′i . (6) the simple and reinforced validities are defined by using the following formulas. a neural approach of multimodel representation of complex processes 153 simple validities: the normalized simple validities are defined so that their sum must be equal to 1 at each time: vsimpi = vi n −1 . (7) reinforced validities: for this type of validities, the reinforcement expression is introduced as: v ′ren f i = vi n ∏ j =1, j 6=i (1−v j) . (8) the normalized reinforced validities could be written as follows: vren fi = v ′ren f i ∑nj=1 v ′ren f j . (9) 2.4 computation of the multimodel output the multimodel output is calculated by a fusion of the models’ outputs weighted by their respective validity indexes, as illustrated by the following expression: ymm(k) = n ∑ i=1 yi(k) vi(k) . (10) vi(k) could be a simple or a reinforced validity for which ∑ni=1 vi(k) = 1. 3 simulation examples in order to underline the interest and the performance of the proposed approach, some simulation examples are carried out. 3.1 example 1: second order discrete system the considered system is a complex discrete system whose evolution is described by the following equation: y(k) = −a1(k) y(k −1)−a2(k) y(k −2) + b1(k) u(k −1) + b2(k) u(k −2) . (11) the variation laws of different parameters of the process are given by: a1(k) = 0.04 sin (0.035k)−0.8 , a2(k) = 0.005 sin (0.03k) + 0.1 , (12) b1(k) = 0.02 sin (0.03k) + 0.5 , b2(k) = 0.01 sin (0.035k) + 0.2 . (13) first, the system is excited by a uniform random signal u(k ). then, the measurements y(k ) and y(k−1) are collected at different instants. the adequate number of clusters determined by using the method described in paragraph 3.1.2, is equal to three. the numerical data are fed into a kohonen network which presents an input layer with two neurons and three neurons in the output layer. the classification results are given in figure 2. from each of the three data sets relative to the various clusters, the orders and the parameters of the transfer functions relative to the three base-models are estimated. figure 3 shows the evolutions of the rdii(m) for the three models. 154 nesrine elfelly, jean-yves dieulot, pierre borne 2.5 3 3.5 4 4.5 2 2.5 3 3.5 4 4.5 y(k−1) y (k ) clusters’ centers figure 2: classification results 1 2 3 4 0 0.5 1 1.5 2 2.5 x 10 5 rdi 1 (m) 1 2 3 4 0 2 4 6 8 x 10 4 rdi 2 (m) 1 2 3 4 0 1 2 3 4 x 10 4 rdi 3 (m) m 1 =2 m 2 =2 m 3 =2 figure 3: evolutions of the rdi for the three base-models it appears clearly that the estimated orders of the three models are equal to two. this result is predictable since the considered system is a second order one. the application of the recursive leastsquares method allows us to write the following transfer functions: f1(z −1) = 0.33549 z−1 −0.047793 z−2 1−0.83697 z−1 −0.039754 z−2 , (14) f2(z −1) = 0.41086 z−1 −0.021659 z−2 1−0.8633 z−1 −0.055068 z−2 , (15) f3(z −1) = 0.36985 z−1 −0.0079934 z−2 1−0.9234 z−1 −0.060727 z−2 . (16) in order to evaluate the obtained global model, a validation step is worked out where other inputs which are different from those used for classification are fed into the system. then, the real and the multimodel outputs are compared. let us consider the following input sequence: u(k) = 1 + sin (0.08k) . (17) the validation results are given in figures 4 and 5. y: real output of the system. ymmr, ymms: multimodel outputs obtained by using respectively the methods of reinforced and simple validities. ymmr (k) = n ∑ i=1 yi(k) v ren f i (k) , ymms (k) = n ∑ i=1 yi(k) v simp i (k) . (18) er, es: relative errors between the real and the multimodel outputs. er(k) = ∣∣∣∣ y(k)−ymmr (k) y(k) ∣∣∣∣ , es(k) = ∣∣∣∣ y(k)−ymms (k) y(k) ∣∣∣∣ . (19) a neural approach of multimodel representation of complex processes 155 0 100 200 300 400 500 0 1 2 3 4 5 6 time y (t ), y m m r( t) , y m m s (t ) ymm s (t) y mm r (t) y(t) figure 4: real and multimodel outputs 0 100 200 300 400 500 0 5 10 time e r( t) , e s (t ) e s (t) e r (t) figure 5: evolutions of the relative errors it can be seen that the multimodel output, obtained by the fusion of the base-models’ outputs weighted by the reinforced validities, follows the real output with a negligible error (er(t)). this error is more important when applying the simple validities method (es(t)). this allows to conclude that, for this kind of system, the reinforced validities method is more appropriate than the simple validities one. 3.2 example 2: second order continuous system with input/output-dependent parameters as a second simulation example, we consider the system whose evolution is described by the following equations: a0y + a1ẏ + ÿ = u + bu̇ , (20) a0(y) = 0.3 + sigm(y−2) , a1(y) = sat(y2) , b(u) = sat(u) , (21) sigm is the sigmoid function; sat is the saturation function. the considered system is complex and strongly non-linear with parameters being functions of both the input and the output, which makes the modeling task difficult. the chosen excitation signal (figure 6) is a sine curve distorted by a random uniform signal since this input is richer than a simple random signal and allows considering the complex and non-linear aspects of the system. it is worthy to note that signal parameters (frequency, amplitude) need to be adjusted in order to obtain good results. 0 2 4 6 8 10 −2 −1 0 1 2 time u (t ) figure 6: evolution of the excitation signal moreover, in order to describe the system dynamics, the number of variables used for classification is increased. in fact, not only the output data y(k ) and y(k − 1) are considered but also the input data 156 nesrine elfelly, jean-yves dieulot, pierre borne u(k − 1). after generating the output, a sampling of the input and output signals followed, with an adequate sampling period, in order to collect the different measurements: y(k ), y(k−1), and u(k−1). once the number of clusters determined, the numerical data are presented to a kohonen network owning three neurons in both input and output layers. at the end of the learning procedure, three data sets (figure 7) are obtained, each of which being used for the identification of the corresponding model. −0.5 0 0.5 1 1.5 2 −1 0 1 2 −2 −1 0 1 2 y(k−1)y(k) u (k − 1 ) clusters’ centers figure 7: three data sets relative to the different clusters the application of the instrumental determinants’ ratio-test method allows us to determine the three models’ orders which are respectively: two, three and two. the corresponding transfer functions are given by the following expressions: f1(z −1) = 0.058303 z−1 + 0.052302 z−2 1−0.43246 z−1 −0.45748 z−2 , (22) f2(z −1) = −0.023815 z−1 + 0.0013593 z−2 + 0.031481 z−3 1−0.68802 z−1 −0.24025 z−2 −0.072128 z−3 , (23) f3(z −1) = 0.12995 z−1 −0.014359 z−2 1−0.64023 z−1 −0.37958 z−2 . (24) the application of the following input sequence is the purpose of the validation step: u(k) = 1 + sin (k) . (25) the validation results are illustrated by the figures 8 and 9. 0 2 4 6 8 10 0 0.5 1 1.5 2 2.5 time y (t ), y m m r( t) , y m m s (t ) ymm s (t) y(t) ymm r (t) figure 8: real and multimodel outputs 0 2 4 6 8 10 0 0.2 0.4 0.6 0.8 time e r (t ), e s( t) e s (t) e r (t) figure 9: evolutions of the relative errors contrary to the first example, the results obtained by the application of the simple validities’ method are much better than those given by the reinforced validities’ method. in fact, the figures 8 and 9 show that the multimodel output ymms (t), deducted by fusion of the base-models’ outputs weighted by the simple validities, follows almost perfectly the real output with an error es(t) nearly null compared to the a neural approach of multimodel representation of complex processes 157 error er(t) which is relatively important. in order to give prominence to the capacity of the identified models to reproduce the operating system in different domains, let us consider another input sequence given by: u(k) = 1.2 + 1.5 sin (2k) . (26) the multimodel output is generated by application of the simple validities’ method. the result is given in figure 10. 0 2 4 6 8 10 0 0.5 1 1.5 2 2.5 time y( t), y m m s(t ) y mm s (t) y(t) figure 10: real and multimodel outputs (second validation) 3.3 interpretations referring to the obtained simulation results, it can be noticed that the application of the suggested approach allows a good modeling of the considered systems. we can also make an important interpretation about the use of the validities’ calculation methods. as remarked before, in some cases simple validities give better results and in other cases it is preferable to use reinforced validities. so, more simulations and observations were worked out to conclude that the choice of the validities’ calculation method depends on the classification results i.e. the clusters structure and repartition. thus, it can be noted that when there are several variations in the same cluster and when an overlapping between clusters occurs, which is the case in the second simulation example (figure 7), it is worth to use the simple validities’ method since it takes account of different models’ outputs refering to the expression (7). in this case, no model could represent ideally the process at any time. but when the clusters present very few variations and are well separated (figure 2), the reinforced validities’ method is better-adapted. the application of this method, thanks to the reinforcement expression (8), promotes the contribution of the most dominant model which represents at best the process behavior. 4 evaluation of the suggested modeling strategy in order to highlight the interest and the performance of the proposed modeling strategy, the classical modeling approach involving the identification of a global model was carried out. then, the results given by the suggested approach were compared to those given by classical modeling strategy. let us consider the second simulation example described by the equations (20) and (21) and the excitation signal given by figure 6. by using the instrumental determinants’ ratio-test method for the estimation of the order and the recursive least-squares method for the parametric identification, the transfer function f(z−1) of the global model is given by the following expression: f(z−1) = 0.03601 z−1 + 0.0035236 z−2 1−0.55726 z−1 −0.42595 z−2 . (27) by using the same numerical data, the classification and the identification steps give the three models described by equations (22), (23) and (24). the input sequence given by the equation (25) was applied 158 nesrine elfelly, jean-yves dieulot, pierre borne again. figure 11 represents the evolutions of the real, the multimodel and the global model outputs. the multimodel output is obtained by fusion of the models’ outputs weighted by their simple validities degrees calculated at each instant. 0 2 4 6 8 10 0 0.5 1 1.5 2 2.5 3 time y( t), y m m s(t ), y g (t) y g (t): global model output y(t): real output y mm s (t): multimodel output figure 11: evolutions of the real, multimodel and global model outputs the adopted multimodel approach offers a very satisfactory precision compared to the case of the classical modeling approach based on the unique global model. 5 conclusion in this paper, a neural approach of multimodel representation is proposed. this approach is applicable when dealing with complex, strongly non-linear and uncertain processes. it allows the determination of the models’ base by using a kohonen network and two methods of structural and parametric identification. the different steps were detailed. the multimodel output is obtained by using the technique of fusion and the adequate validities’ computation method. the suggested approach has been implemented and tested for different complex systems. simulation results, two of which were described in this paper, prove the efficiency and the precision of the proposed modeling strategy and show that the method works well with various processes even when highly complex. some interpretations have been made about the choice of the adequate validities’ calculation method to be applied. furthermore, in order to demonstrate the performance and the relevance of the suggested approach, a comparison with the classical modeling approach has been made. bibliography [1] m. a. al-akhras, g. m. aly, r. j. green, neural network learning approach of intelligent multimodel controller, iee proceedings on control theory and applications, vol. 143, pp. 395-400, 1996. [2] i. s. baruch, r. l. beltran, j-l. olivares, r. m. garrido, a fuzzy-neural multi-model for mechanical systems identification and control, mexican international conference on artificial intelligence n. 3, vol. 2972, pp. 774-783, 2004. [3] i. s. baruch, f. thomas, j-m. flores, e. gortcheva, a multimodel recurrent neural network for systems identification and control, ieee international joint conference on neural networks, vol. 2, pp. 1291-1296, 2001. [4] r. ben abdennour, p. borne, m. ksouri, f. m’sahli, identification et commande numérique des procédés industriels, editions technip, paris, france, 2001. a neural approach of multimodel representation of complex processes 159 [5] p. borne, m. benrejeb, j. haggège, les réseaux de neurones, editions technip, paris, france, 2007. [6] f. delmotte, l. dubois, p. borne, a general scheme for multi-model controller using trust, mathematics and computers in simulation, vol. 41, pp. 173-186, 1996. [7] a. el kamel, m. ksouri-lahmari, p. borne, contribution to multimodel analysis and control, studies in informatics and control, vol. 9, pp. 29-38, 2000. [8] t. a. johansen, b. a. foss, editorial: multiple model approaches to modelling and control, international journal of control, vol. 72, pp. 575, 1999. [9] z. kardous, a. el kamel, n. ben hadj braiek, p. borne, on the quadratic stabilization in discrete multimodel control, ieee conference on control applications, vol. 2, pp. 1398-1403, 2003. [10] t. kohonen, the self-organizing map, ieee proceedings, vol. 78, pp. 1464 1480, 1990. [11] m. ksouri-lahmari, p. borne, m. benrejeb, multimodel: the construction of model bases, studies in informatics and control, vol. 3, pp. 199-210, 2004. [12] d. j. leith, w. e. leithead, analytic framework for blended multiple model systems using linear local models, international journal of control, vol. 72, pp. 605-619, 1999. [13] s. mezghani, a. el kamel, p. borne, multimodel control of discrete systems with uncertainties, international journal of studies in informatics and control, 2000. [14] k. s. narendra, j. balakrishman, adaptative control using multiple models, ieee transactions on automatic control, vol. 42, pp. 171-187, 1997. [15] j. c. principe, l. wang, m. a. motter, local dynamic modeling with self-organizing maps and application to nonlinear system identification and control, ieee proceedings, vol. 86, pp. 22402258, 1998. [16] t. raissi, a. el kamel, w. byrski, p. borne, multimodel analysis and control of multivariable systems, ieee international conference on systems man and cybernetics, vol. 1, pp. 640-645, 2001. [17] m. ronen, y. shabtai, h. guterman, hybrid model building methodology using unsupervised fuzzy clustering and supervised neural networks, biotechnology and bioengineering, vol. 77, n. 4, pp. 420-429, 2002. [18] n. sadati, a. talasaz, robust fuzzy multimodel control using variable structure system, ieee conference on cybernetics and intelligent systems, vol. 1, pp. 497-502, 2004. [19] r. schorten, r. marry-smith, r. bjorgan, h. gollee, on the interpretation of local models in blended multiple model structures, international journal of control, vol. 72, pp. 620-628, 1999. [20] r. r. selmic, f. l. lewis, multimodel neural networks identification and failure detection of nonlinear systems, ieee conference on decision and control, vol. 4, pp. 3128-3133, 2001. [21] t. takagi, m. sugeno, fuzzy identification of systems and its applications to modelling and control, ieee transactions on systems man and cybernetics, vol. 15, pp. 116-132, 1985. [22] s. talmoudi, r. ben abdennour, a. kamel, p. borne, a systematic determination approach of a models base for uncertain systems: experimental validation, ieee international conference on systems man and cybernetics, vol. 6, pp. 73-81, 2002. 160 nesrine elfelly, jean-yves dieulot, pierre borne nesrine elfelly1, jeans-yves dieulot2, pierre borne3 1université des sciences et technologies de lille (ustl), 2ecole polytechnique de lille, 3ecole centrale de lille (ec lille) laboratoire d’automatique, génie informatique et signal ecole centrale de lille, cité scientifique bp 48 59651 villeneuve d’ascq cedex, france e-mail:nesrine.elfelly@ed.univ-lille1.fr,jean-yves.dieulot@polytech-lille.fr,pierre.borne@ec-lille.fr received: december 4, 2007 nesrine elfelly was born in monastir, tunisia in 1982. she received the engineer diploma degree in computer engineering from the "ecole nationale d’ingénieur de tunis" (tunisia) in 2006. she obtained the master of automatic control, computer engineering and image processing from the "ecole centrale de lille" (france) in 2007 and the master of automatic control and signal processing from the "ecole nationale d’ingénieur de tunis" in the same year. she is currently a phd student in automatic control and applied computing within the framework of lagisustl and lara-enit cooperation. her current research interests are the applications of neural approaches for complex systems analysis and control. jean-yves dieulot graduated from french ecole d’ingénieurs institut industriel du nord in 1990, and obtained a phd from the university of lille in 1993. he is currently associate professor at ecole polytechnique universitaire de lille, france, and with lagis (laboratory of automatic control). his main interests are in fuzzy, nonlinear and robot control. pierre borne was born in corbeil, france in 1944, he received the master degree of physics in 1967, the masters of electronics, of mechanics and of applied mathematics in 1968. the same year he obtained the diploma of "ingénieur idn" (french "grande ecole"). he obtained the phd in automatic control of the university of lille in 1970 and the dsc in physics of the same university in 1976. he became doctor honoris causa of the moscow institute of electronics and mathematics (russia) in 1999, of the university of waterloo (canada) in 2006 and the polytechnic university of bucharest (romania) in 2007. he is author or coauthor of about 200 journal articles and book chapters, of 35 plenary lectures and of about 250 communications in international conferences. he has been the supervisor of 69 phd thesis and is author of 20 books. he is fellow of ieee and has been president of the ieee/smc society in 2000 and 2001. he is presently professor "de classe exceptionnelle" at the ecole centrale de lille and director of the french "plan pluriformation" of research in automatic control. ijcccv4n3draft.pdf int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 3, pp. 234-243 p–lingua 2.0: a software framework for cell–like p systems manuel garcía-quismondo, rosa gutiérrez-escudero, miguel a martínez-del-amor enrique orejuela-pinedo, i. pérez-hurtado research group on natural computing dpt. of computer science and artificial intelligence university of sevilla avda. reina mercedes s/n, 41012 sevilla, spain e-mail: mangarfer2@alum.us.es, {rgutierrez, mdelamor, orejuela, perezh}@us.es received: april 5, 2009 accepted: may 30, 2009 abstract: p-lingua is a programming language for membrane computing. it was first presented in edinburgh, during the ninth workshop on membrane computing (wmc9). in this paper, the models, simulators and formats included in p-lingua in version 2.0 are explained. we focus on the stochastic model, associated simulators and updated features. finally, we present one of the first applications based on plingua: a tool for describing and simulating ecosystems. keywords: programming languages, software development, p systems, membrane computing, p-lingua 1 introduction membrane computing (or cellular computing) is a branch of natural computing that was introduced by gh. păun [10]. the main idea is to consider biochemical processes taking place inside living cells from a computational point of view. the initial definition of this computing paradigm is very flexible and many different models have been defined. each model displays characteristic semantic constraints that determine the way in which rules are applied. hence, the need for software simulators capable of taking into account different scenarios when simulating p system computations comes to the fore. an initial approach could be defining inputs for each simulator specifically. nevertheless, this approach involves defining new input formats for each simulator, so designing simulators would take a great effort. a second approach could be standardizing the simulator input, so all simulators need to process inputs specified in the same format. these two approaches raise up a trade-off: on the one hand, specific simulator inputs could be defined in a more straightforward way, as the used format is closer to the p system features to simulate. on the other hand, although the latter approach involves analyzing different p systems and models to develop a standard format, there is no need to develop a new simulator every time a new p system should be simulated, as it is possible to specify it in the standard input format. moreover, researches would not have to devise a new input format every time they specify a p system and would not need to change the way to specify p systems which need to be simulated every time they move on to another model, as they would keep on using the standard input format. this second approach is the one considered in p-lingua project, a programming language whose first version, presented in [3], is able to define p systems within the active membrane p system model with division rules. the authors also provide software tools for compilation, simulation and debug tasks. from now on, we will call p-lingua 1.0 this version of the language and its associated tools. as p-lingua is intended to become a standard for p systems definition, it should also consider other models. in this paper, we present p-lingua 2.0 as a framework to define cell-like p system models, including several algorithms to simulate p system computations for the supported models (from now on, copyright c© 2006-2009 by ccc publications p–lingua 2.0: a software framework for cell–like p systems 235 simulators), as well as different formats to represent p systems with associated parsers to translate from each other. this paper is structured as follows. in section 2 the supported models at this stage are enumerated. the next section introduces some algorithms used to simulate p systems, focusing on the stochastic and the probabilistic p system models. in section 4 different file formats to representate cell-like p systems are presented, for example, p-lingua 2.0 programming language. model definitions, simulators and parsers have been encoded in a java library, plinguacore c©, presented in section 6, this library is free software and it can be easily expanded. command-line tools to compile files and simulate p systems have been slightly modified in section 5 these changes are presented. the next section introduces one of the first applications of p-lingua, a software tool to describe and simulate ecosystems. finally, some conclusions and future work are enumerated in section 8. 2 supported p system models the supported models developed so far are enumerated below, but a standard mechanism for defining new cell-like models has been included on the p-lingua 2.0 framework. each model displays characteristic semantic constraints entailing the rules applied, such as the number of objects specified on the left-hand side, membrane creation, polarization, and so on. it is possible to define additional models by including the corresponding semantic constraints within the plinguacore java library. this mechanism has been used on all the existent models. the supported p system models in p-lingua 2.0 are transition p systems, symport/antiport p systems, p systems with active membranes, with membrane division and membrane creation rules, probabilistic p systems and stochastic p systems. more details on those models can be found in [13], except for stochastic p systems, which are described in [10]. 3 simulators in p-lingua 1.0, only one simulator was supported, since there was only one p system model definition. however, as new models have been included, new simulators have been developed, providing at least one simulator for each supported model. all simulators in p-lingua 2.0 can step backwards (as well as the simulator in p-lingua 1.0), but this option should be set before the simulation starts. p-lingua 2.0 also takes into account the existence of different simulation algorithms for the same model and provides a means for selecting a simulator among the ones which are suitable to simulate the p system, by checking its model. so far, only the stochastic p system model provides several simulation algorithms to choose, but the plugin-oriented architecture of the plinguacore java library allows easily to encode new simulators. 3.1 simulators for stochastic p systems in the original definitions p systems evolve in a non-deterministic and maximally parallel manner (that is, all the objects in every membrane that can evolve by a rule must do it [10]). when trying to simulate biological phenomena, like living cells, the classical non-deterministic and maximally parallel approach is not valid anymore. first, biochemical reactions, which are modelled by rules, occur at a specific rate (determined by the propensity of the rule), therefore they cannot be selected in an arbitrary and non-deterministic way. second, in the classical approach all time steps are equal and this does not represent the time evolution of a real cell system. the strategies to replace the original approach are based on gillespie’s theory of stochastic kinetics 236 manuel garcía-quismondo, rosa gutiérrez-escudero, miguel a martínez-del-amor enrique orejuela-pinedo, i. pérez-hurtado [6]. a constant c is associated to each rule, which provides p systems with a stochastic extension. the constant c depends on the physical properties of the molecules involved in the reaction modeled by the rule and other physical parameters of the system. besides, it represents the probability per time unit at which the reaction takes place. also, it is used to calculate the propensity of each rule which determines the probability and time needed to apply the rule. two different algorithms based on the principles stated above have been implemented and integrated in plinguacore. multicompartimental gillespie algorithm the gillespie [6] algorithm or ssa (stochastic simulation algorithm) was developed for a single, well-mixed and fixed volume/compartment. p systems generally contain several compartments or membranes. for that reason, an adaptation of this algorithm was presented in [10] and it can be applied in the different regions defined by the compartmentalised structure of a p system model. the next rule to be applied in each compartment and the waiting time for this application is computed using a local gillespie algorithm. the multicompartimental gillespie algorithm can be broadly summarized as follows: repeat until a prefixed simulation time is reached: 1. calculate for each membrane i,  ≤ i ≤ m, and for each rule r j ∈ rli the propensity, a j, by multiplying the stochastic constant clij associated to r j by the number of distinct possible combinations of the objects and substrings present of the left-side of the rule with respect to the current contents of membranes involved in the rule. 2. compute the sum of all propensities a = m∑ i= ∑ r j∈rli a j 3. generate two random numbers r and r from the uniform distribution in the unit interval and select τi and ji according to τi =  a ln(  r ) ji = the smallest integer satisfying ji∑ j= a j > ra in this way, we choose τi according to an exponential distribution with parameter a. 4. the next rule to be applied is r ji and the waiting time for this rule is τi. as a result of the application of this rule, the state of one or two compartments may be changed and has to be updated. the multicompartimental next reaction method the gillespie algorithm is an exact numerical simulation method appropriate for systems with a small number of reactions, since it takes a time proportional to the number of reactions (i.e., the number of rules). an exact algorithm which is also efficient is presented in [5], the next reaction method. it uses only a single random number per simulation event (instead of two) and takes a time proportional to the logarithm of the number of reactions. we have adapted this algorithm to make it compartimental. the idea of this method is to be extremely sensitive in recalculating a j and ti, trying to recalculate them only if they change. in order to do that, a data structure called dependency graph [5] is introduced. let r : u[v]l c −→ u′[v′]l be a given rule with propensity ar and let the parent membrane of l be labelled with l ′. we define the following sets: p–lingua 2.0: a software framework for cell–like p systems 237 • dependson(ar) = {(b,t) | b is an object or string whose quantity affect the value. ar, t = l if b ∈ v and t = l ′ if b ∈ u}. generally, dependson(ar) = {(b, l) | b ∈ v} ∪ {(b, l ′) | b ∈ u} • affects(r) = {(b,t) | b is an object or string whose quantity is changed when the rule. r is excuted, t = l if b ∈ v ∨ b ∈ v′ and t = l ′ if b ∈ u ∨ b ∈ u′}. generally, affects(r) = {(b, l) | b ∈ v ∨ b ∈ v′} ∪ {(b, l ′) : b ∈ u ∨ b ∈ u′}. definition 1. given a set of rules r = rl ∪ ··· ∪ rlm , the dependency graph is the directed graph g = (v, e), with vertex set v = r and edge set e = {(vi, v j) | affects(vi) ∩ dependson(av j ) 6= /0} in this way, if there exists an edge (vi, v j) ∈ e and vi is executed, as some objects affected by this execution are involved in the calculation of av j , this propensity would have to be recalculated. the dependency graph depends only on the rules of the system and is static, so it is built only once. the times τi, that represent the waiting time for each rule to be applied, are stored in an indexed priority queue. this data structure, discussed in detail in [5], has nice properties: finding the minimum element takes constant time, the number of nodes is the number of rules |r|, because of the indexing scheme it is possible to find any arbitrary reaction in constant time and finally, the operation of updating a node (only when τi is changed, which we can detect using to the dependency graph) takes log |r| operations. the multicompartimental next reaction method can be broadly summarized as follows: 1. build the dependency graph, calculate the propensity ar for every rule r ∈ r and generate τi for every rule according to an exponential distribution with parameter ar. all the values τr are stored in a priority queue. set t ←  (this is the global time of the system). 2. get the minimum τµ from the priority queue, t ← t + τµ . execute the rule rµ (this is the next rule scheduled to be executed, because its waiting time is least). 3. for each edge (µ, α) in the dependency graph recalculate and update the propensity aα and • if α 6= µ , set τα ← aα,old (τα − τµ ) aα,new + τµ • if α = µ , generate a random number r, according to an exponential distribution with parameter aµ and set τµ ← τµ + r update the node in the indexed priority queue that holds τα . 4. go to 2 and repeat until a prefixed simulation time is reached. both multicompartimental gillespie algorithm and multicompartimental next reaction method are the core of the direct stochastic simulator and efficient stochatic simulator, respectively. one of them, which can be chosen in runtime, will be executed when compiling and simulating a p-lingua file that starts with @model. 238 manuel garcía-quismondo, rosa gutiérrez-escudero, miguel a martínez-del-amor enrique orejuela-pinedo, i. pérez-hurtado 3.2 a simulator for probabilistic p systems next, we describe how the simulator for probabilistic p systems implements the applicability of the rules to a given configuration. (a) rules are classified into sets so that all the rules belonging to the same set have the same left–hand side. (b) let {r, . . . , rt } be one of the sets of rules. let us suppose that the common left-hand side is u [v] α i and their respective probabilistic constants are cr , . . . , crt . in order to determine how these rules are applied to a given configuration, we proceed as follows: – one computes the greatest number n so that un appears in the parent membrane of i and vn appears in membrane i. – n random numbers x such that  ≤ x <  are generated. – for each k ( ≤ k ≤ t) let nk be the amount of numbers generated belonging to interval [ ∑k− j= cr j , ∑k j= cr j ) (assuming that cr = ). – for each k ( ≤ k ≤ t), rule rk is applied nk times. 4 file formats to define p systems together with models and simulators, new formats have been included in p-lingua 2.0. p-lingua 1.0 provided a programming language to define p systems and an xml file format [3]. both have been upgraded to allow representations of p systems which have a cell-like structure. it also supports backwards compatibility, so any file which defines a p system by using p-lingua 1.0 is also recognized by p-lingua 2.0 tools. a detailed description of the syntax of p-lingua programming language, including the new extensions added in order to support the new models, can be found in [4]. a new format has been included as well, the binary format, whose purpose is to use less disk space than the xml format. at this point, the concepts input format and output format should be introduced. an input format is a file format which, if a p system is specified in a file by following that format, the p system specified can be processed by the plinguacore java library. an output format is a file format which, if a p system is specified in a file by following that format, that file can be generated by the library. these concepts are similar to the source code and object code concepts [3]. for p-lingua 2.0 framework, p-lingua programming language is an input format, the binary format is an output format and, eventually, xml is both an input and an output format. this means that plingua programs can be processed by plinguacore, binary files can be generated by plinguacore and xml files can be both processed and generated by the library. 5 command-line tools p-lingua 1.0 provided command-line tools for simulating p systems and compiling files which specify p systems [3]. in p-lingua 2.0, the command-line tool general syntax has changed but, as it provides backwards compatibility, all valid actions in p-lingua 1.0 are still valid in p-lingua 2.0, as well. p–lingua 2.0: a software framework for cell–like p systems 239 5.1 the compilation command-line tool the command-line tool general syntax for compiling input files is defined as follows: plingua [-input_format] input_file [-output_format] output_file [-v verbosity_level] [-h] the command header plingua requests the system to compile the p system specified on a file to a file specified on another, whereas the file input_file contains the programme that we want to be compiled, and output_file is the name of the file that is generated [3]. optional arguments are in square brackets: • the option -input_format defines the format followed by input_file, which should be an input format. • at this stage, valid input formats are p-lingua and xml. • if no input format is set, the p-lingua format is assumed. • the option -output_format defines the format followed by output_file, which should be an output format. • at this stage, valid output formats are xml and bin. • if no input format is set, the xml format is assumed by default. • the option -v verbosity level is a number between 0 and 5 indicating the level of detail of the messages shown during the compilation process [3]. • the option -h displays some help information [3]. 5.2 the simulation command-line tool the simulations are launched from the command line as follows: plingua_sim input_file -o output_file [-v verbosity level] [-h] [-to timeout] [-st steps] [-mode simulatorid] [-a] [-b] the command header plingua_sim requests the system to simulate the p system specified on a file, whereas input_xml is an xml document where a p system is formatted on, and output file is the name of the file where the report about the simulated computation will be saved [3]. optional arguments are in brackets: • the option -v verbosity level is a number between 0 and 5 indicating the level of detail of the messages shown during the compilation process [3]. if no value is specified, it is 3 by default. • the option -h displays some help information [3]. • the option -to sets a timeout for the simulation defined in timeout (in milliseconds), so when the time out has elapsed the simulation is halted. if the simulation has reached a halting configuration before the time out has elapsed this option has no effect. • the option -st sets a maximum number of steps the simulation can take (defined in steps), so when the time out has elapsed the simulation comes to a halt. if the simulation has reached a halting configuration or the time out has elapsed (in case the option -to is set) before the specified number of steps have been taken this option has no effect. 240 manuel garcía-quismondo, rosa gutiérrez-escudero, miguel a martínez-del-amor enrique orejuela-pinedo, i. pérez-hurtado • the option -mode sets the specific simulator to simulate the p system (defined in simulatorid). this option reports an error in case the simulator defined by simulatorid is not a valid simulator for the p system model. • the option -a defines if the simulation can take alternative steps. this option reports an error if the simulator does not support alternative steps. • the option -b defines if the simulation can step backwards. as every simulator supports stepping backwards, this option does not report errors. 6 the plinguacore java library plinguacore c© is a java library which performs all functions supported by p-lingua 2.0, that is, models definition, simulators and formats. this library reports the rules and membrane structure read from a file where a p system is defined, detects errors in the file, reports them. if the p system is defined in p-lingua programming language, it locates the error in the file. this library performs simulations by using the simulators implemented as well as taking into account all options defined. it reports the simulation process, by displaying the current configuration as text and reporting the elapsed time. eventually, this library translates files that define a p system between formats, for instance, from p-lingua language format to binary format. this library is free software published under lgpl license [12], so everyone who is interested can upgrade, change and distribute it respecting the license restrictions. 7 a tool for simulating ecosystems based on p-lingua the bearded vulture (gypaetus barbatus) is an endangered species in europe that feeds almost exclusively on bone remains of wild and domestic ungulates. in [1], it is presented a first model of an ecosystem related to the bearded vulture in the pyrenees (ne spain), by using probabilistic p systems where the inherent stochasticity and uncertainty in ecosystems are captured by using probabilistic strategies. in order to validate experimentally the designed p system, the authors have developed a simulator that allows them to analyze the evolution of the ecosystem under different initial conditions. that software application is focused on a particular p system, specifically, the initial model of the ecosystem presented in [1]. with the aim of improving the model, the authors are adding ingredients to it, as new species and more complex behaviour for the animals. the improved model, together with results of virtual experiments made with this software application, is exhaustively described in [2]. a new gpl [11] licensed java application with a friendly user-interface sitting on the plinguacore java library has been developed. this application provides a flexible way to check, validate and improve computational models of ecosystem based on p systems instead of designing new software tools each time new ingredients are added to the models. furthermore, it is possible to change the initial parameters of the modelled ecosystem in order to make the virtual experiments suggested by experts. these experiments will provide results that can be interpreted in terms of hypotheses. finally, some of these hyphoteses will be selected by the experts in order to be checked in real experiments. 8 conclusions and future work creating a programming language to specify p systems is an important task in order to facilitate the development of software applications for membrane computing. in [3] p-lingua was presented as a programming language to define active membrane p systems with division rules. the present paper extends that language to other models: transition p systems, p–lingua 2.0: a software framework for cell–like p systems 241 symport/antiport p systems, active membranes p systems with division or creation rules, probabilistic p systems and stochastic p systems. we have developed a java library which recognizes the models, implements several simulators for each model and defines different formats to codify p systems, like the p-lingua one or a new binary format. this library can be expanded to define new models, simulators and formats. it is possible to select different algorithms to simulate a p system, for example, there are two different algorithms for stochastic p systems. the library can be used inside other software applications, in this sense, we present a tool for virtual experimentation of ecosystems. an internet website [14], still under construction, will be available to download the applications, libraries, source-code and technical reports, as well as provide information about the progress of the p-lingua project. in addition, this site aims to be a meeting point for users and developers through the use of web-tools as forums. the syntax of the p-lingua programming language is sufficiently standard for specifying different models of cell-like p systems. however, a new version of the language is necessary in order to specify tissue-like p systems but this will be the aim of a future work. although p-lingua 2.0 provides a way to simulate and compile p-systems, command-line tools are usually not user-friendly. it means it is not easy and intuitive for people to use them. for this purpose, p-lingua 1.0 provided an integrated development environment (ide) [3], which eased the way people could use p-lingua 1.0. for p-lingua 2.0, a new ide, called plinguaplugin, is being developed. such an application is integrated into the eclipse platform [13], so it makes the most of eclipse’s capabilities to provide a framework for translating, developing and testing p systems. it aims to be user-friendly and useful for p system researchers. 8.1 acknowledgement the authors acknowledge the valuable assistance given by mario j. perez-jimenez whose vast experience and human quality was essential for us in taking our first steps in scientific research. the authors also acknowledge the support of the project tin2006–13425 of the ministerio de educación y ciencia of spain, cofinanced by feder funds, and the support of the project of excellence with investigador de reconocida valía of the junta de andalucía, grant p08-tic-04200. bibliography [1] m. cardona, m.a. colomer, m.j. pérez–jiménez, d. sanuy, a. margalida. modeling ecosystems using p systems: the bearded vulture, a case study. lecture notes in computer science, 5391, 137–156, 2009 [2] m. cardona, m.a. colomer, a. margalida, i. pérez–hurtado, m.j. pérez–jiménez, d. sanuy. p system based model of an ecosystem of the scavenger birds, proceedings of the 7th brainstorming week on membrane computing, vol. i, 65–80, in press. [3] d. díaz–pernil, i. pérez–hurtado, m.j. pérez–jiménez, a. riscos–núñez. a p-lingua programming environment for membrane computing, proceedings of the 9th workshop on membrane computing, 155–172, 2008. [4] m. garcía–quismondo, r. gutiérrez–escudero, i. pérez–hurtado, m.j. pérez–jiménez. p-lingua 2.0: new features and first applications, proceedings of the 7th brainstorming week on membrane computing, vol. i, 141–168, in press. [5] m.a. gibson and j. bruck. efficient exact stochastic simulation of chemical systems with many species and many channels, j. phys. chem., 104, 1876–1889, 2000. 242 manuel garcía-quismondo, rosa gutiérrez-escudero, miguel a martínez-del-amor enrique orejuela-pinedo, i. pérez-hurtado [6] d.t. gillespie. exact stochastic simulation of coupled chemical reactions, j. phys. chem., 81, 2340–2361, 1977. [7] gh. păun. membrane computing. an introduction, springer–verlag, berlin, 2002. [8] gh. păun. computing with membranes, journal of computer and system sciences 61(1) 108–143, 2000. [9] gh. păun. p systems with active membranes. journal of automata, languages and combinatorics, 6, 1, 75–90, 2001. [10] f.j. romero–campero. p systems, a computational modelling framework for systems biology, doctoral thesis, university of seville, department of computer science and artificial intelligence, 2008. [11] the gnu general public license: http://www.gnu.org/copyleft/gpl.html [12] the gnu lesser general public license: http://www.gnu.org/copyleft/lgpl.html [13] the eclipse project: http://www.eclipse.org [14] the p-lingua website: http://www.p-lingua.org manuel garcía-quismondo fernández was born in june 11, 1985. he got his degree in ingeniería técnica en informática de sistemas in the university of sevilla in june 2007. currently, he is about to get another degree, this time in ingeniería en informática, at the same university. since september 2008, he has been a granted student at the department of computer science and artificial intelligence. he has developed a software application called plinguaplugin and co-developed another one called plinguacore, directed by agustín riscos-nuñez and ignacio pérez-hurtado. rosa gutiérrez-escudero was born in august 16, 1984. she received her degree in computer science in june 2008 from the university of sevilla. since september 2008, she has been a phd student at the department of computer science and artificial intelligence of the university of sevilla (spain). she is also a member of the research group on natural computing in the same university. her main research interests within the membrane computing area are computer simulation and complexity theory. miguel a. martínez-del-amor was born in july 10, 1984. he received his degree in computer science from the university of murcia (spain) in june 2008. currently, he is a phd student at the department of computer science and artificial intelligence in the university of sevilla (spain). he is also a member of the research group on natural computing at the same university, and his main research interest is to joint membrane computing and high performance computing by using efficient computer simulations. enrique orejuela-pinedo was born in june 7, 1979. he received his degree in biology in 2005, from the university of sevilla (spain). he has cooperated as internal student at the departament of ecology and vegetal biology in the university of sevilla. currently, he is a phd student at the department of computer science and artificial intelligence in the university of sevilla. he is also a member of the research group on natural computing at the same university, and his main research interests are natural computing and membrane computing, specially computational models of ecosystems. ignacio pérez-hurtado was born in september 21, 1977. he received his degree in computer science in october 2003. he was systems analyst in a company for three years. since september 2006, he has been a phd student at the department of computer science and artificial intelligence in the university of sevilla (spain). he is an associate professor at the same department. he is also p–lingua 2.0: a software framework for cell–like p systems 243 a member of the research group on natural computing at the said university, and his main research interests within the membrane computing are computer simulation, models for biological processes and complexity theory. international journal of computers, communications & control vol. i (2006), no. 1, pp. 81-99 grigore c. moisil (1906 1973) and his school in algebraic logic george georgescu, afrodita iorgulescu, sergiu rudeanu abstract: we present in the paper a very concise but updated survey emphasizing the research done by gr. c. moisil and his school in algebraic logic. keywords: n-valued lukasiewicz-moisil algebra, θ -valued lukasiewicz-moisil algebra, post algebra the mathematical logic is one of the domain in which the creative spirit of gr.c. moisil manifested plenary. his work in logic stands out by the the novelty, the variety and the depth of treated subjects. his first works are connected to the top results of the time and wear an algebraic seal. the young professor from jassy came after a rich experience in mechanics and differential equations. van der waerden treatise of algebra has decisively influenced his entry in logic by the algebraic gate. in the same time, these works have a powerful philosophical imprint. from this vast creation, the contributions in multiple-valued logics represent the part with the most intense impact on today researches. the first system of multiple-valued logic was introduced by j. łukasiewicz in 1920. independently, e. post introduced in 1921 a different multiple-valued logic. for łukasiewicz, the motivation was of philosophical nature he was looking for an interpretation of the concepts of possibility and necessity while for post, the research was intended as a natural mathematical generalization of bivalent logic. in 1930, łukasiewicz and tarski studied a logic whose truth values are the real numbers from the interval [0, 1]. 1 łukasiewicz-moisil algebras in 1940, gr. c. moisil has defined the 3-valued and the 4-valued łukasiewicz algebras and in 1942, the nvalued łukasiewicz algebras (n ≥ 2). his goal was to algebrize łukasiewicz’s logic. boolean algebras, algebraic models of classical logic, are particular cases of that new structures. in the description of a logical system, the implication was traditionally the principal connector. the n-valent system of łukasiewicz had as truth values the elements of the set ln = { 0, 1 n−1 , 2 n−1 , . . . , n−2 n−1 , 1 } and was built around a new concept of implication, on which are based the definitions of the other connectors. for moisil, the basic structure is that of lattice, to which he adds a negation (getting the so called "de morgan algebra") and also some unary operations (called by moisil "chryssipian endomorphisms"), representing the "nuancing". the łukasiewicz implication was let on a secondary plane and, in the case of an arbitrary valence, was completely lost. further axiomatizations were suggested by a. monteiro, r. cignoli, c. sicoe, s. rudeanu and others. an example of a. rose from 1956 established that for n ≥ 5 the łukasiewicz implication can no more be defined on a łukasiewicz algebra. consequently, only for n = 3 and n = 4 the structures introduced by moisil are models for łukasiewicz logic. the lost of implication has lead to another type of logic, called today "moisil logic", distinct from łukasiewicz system; the logic corresponding to n-valued łukasiewicz-moisil algebras was created by moisil in 1964. the fundamental concept of moisil logic is the nuancing. nowadays we feel it appropiate to call these algebras łukasiewicz-moisil algebras or lm algebras for short. for complete information and references on łukasiewicz-moisil algebras see [25] the work of moisil on lm algebras covers two periods of time: a first period, during 1940-1942, when he introduces the n-valued lm algebras with negation and studies special classes of these structures, as centered and axed lm algebras and a second one, during 1954-1973, when he introduces the θ -valued lm algebras without negation, applies multiple-valued logics to swiching theory and study algebraic properties of lm algebras (representation, ideals, reziduation). copyright c© 2006 by ccc publications 82 george georgescu, afrodita iorgulescu, sergiu rudeanu moisil’s works traced research directions for many romanian and foreign mathematicians. in argentina, at bahia blanca, antonio monteiro and his school (roberto cignoli, luiz monteiro, luiza iturrioz, maurice abad etc.) have contributed decisively to consolidate lm agebras as a domain of algebra of logic and to disseminate them in the mathematical world. in his phd thesis from 1969 [29], r. cignoli makes a very deep study of n-valued moisil algebras (the name he first gives to the n-valued łukasiewicz algebras introduced by moisil). 1.1 n-valued łukasiewicz-moisil algebras the structure called "de morgan algebra" was first studied by moisil; the name was given by antonio monteiro [142]; a duplicate name is "quasi-boolean algebra" given by a. bialynicki-birula and h. rasiowa. definition 1.1. a de morgan algebra is a structure (a,∨,∧,−, 0, 1) such that (a,∨,∧, 0, 1) is a distributive lattice with 0 and 1 and the unary operation −, called negation, verifies: (dm0) 1 = 0−, (dm1) (x−)− = x, (dm2) (x∧y)− = x− ∨y−. remark 1.2. in a de morgan algebra we also have: (dm3) (x∨y)− = x− ∧y−. definition 1.3. let j = {1, 2, . . . , n−1}. an n-valued łukasiewicz-moisil algebra (n ≥ 2) or an lmn bf algebra for short is an algebra a = (a,∨,∧,−, (r j) j∈j , 0, 1) of type (2, 2, 1, (1) j∈j , 0, 0) such that: (i) (a,∨,∧,−, 0, 1) is a de morgan algebra. (ii) the unary operations r1, r2, . . . , rn−1 fulfil the following axioms: for every x, y ∈ a and every i, j ∈ j, (l1) r j(x∨y) = r j x∨r j y, (l2) r j x∨(r j x)− = 1, (l3) r j ◦ri = ri, (l4) r j(x−) = (rn− j x)−, (l5) r1x ≤ r2x ≤ ··· ≤ rn−1x, (l6) if r j x = r j y for every j ∈ j, then x = y; this is the determination principle. if a fulfils (i) and only (l1)–(l5) we shall say that a is an lmn pre-algebra. proposition 1.4. in every lmn algebra a, the following properties are verified: for every x, y ∈ a and every j ∈ j, (l7) r j(x∧y) = r j x∧r j y; (l8) r j x∧(r j x)− = 0; (l9) x ≤ y if and only if (r j x ≤ r j y, for every j ∈ j); (l10) r1x ≤ x ≤ rn−1x; (l11) r j 0 = 0, r j 1 = 1; (l12) let c(a) be the set of complemented elements of a, i.e. c(a) = {x ∈ a | ∃x′ ∈ a, x∨x′ = 1, x∧x′ = 0}. let k j be the set of all elements of a left invariant by r j , j ∈ j, i.e. k j = {x ∈ a | r j x = x}. grigore c. moisil (1906 1973) and his school in algebraic logic 83 then: (i) r j x ∈ c(a), for every j ∈ j, x ∈ a and (ii) c(a) = k j , for every j ∈ j; (l12’) (c(a),∨,∧,−, 0, 1) is a boolean algebra, where x− = x′; (l12”) if z ∈ c(a), then for every x ∈ a: x∧z = 0 ⇐⇒ x ≤ z−, z∨x = 1 ⇐⇒ z− ≤ x; (l13) x− ∨rn−1x = 1; (l14) x∧(rn−1x)− = 0. example 1.5. the algebra ln = l (lmn) n = (ln,∨,∧,−, (r j) j∈j , 0, 1), where ln = { 0, 1 n−1 , 2 n−1 , . . . , n−2 n−1 , 1 } and    x∨y = max(x, y), x∧y = min(x, y), x− = 1−x, r j ( i n−1 ) = { 0, if j + i < n, 1, if j + i ≥ n, i ∈ {0}∪j, j ∈ j, is an lmn algebra, that we shall call the canonical lmn algebra. the proper subalgebras of ln have the form: s = ln − ⋃ x∈ln−{0} {x, x−}. they are lmn algebras. the smallest subalgebra of ln (with respect to ⊆) is c(ln) = {0, 1}, which is also a boolean algebra, cf. (l12’). for instance, the subalgebras of l3 are l2 and l3, l4 are l2 and l4 and l5 are l2, l3, {0, 1/4, 3/4, 1} and l5. remark 1.6. lm2 algebras coincide with boolean algebras. proposition 1.7. in every lmn pre-algebra, the determination principle (l6) is equivalent to each of the following conditions: for every x, y ∈ l, (a) x∧(r j x)− ∧r j+1y ≤ y, for every j ∈ j −{n−1}; (b) x∧ n−1∧ j=1 ((r j x)− ∨r j y) ≤ y. þ (representation theorem of moisil) every lmn algebra can be embedded in a direct product of copies of the canonical lmn algebra ln. corollary 1.1. every lmn algebra is a subdirect product of subalgebras of the canonical lmn algebra ln. ∗∗∗ in 1968, gr. c.moisil introduced the θ -valued łukasiewicz algebras or lmθ algebras for short (without negation), where θ is the order type of a chain with first and last element. the concept of θ -valued łukasiewicz algebra is obtained from that of n-valued, on the one hand, by dropping the negation − and on the other hand, by replacing the set ln by a totally ordered set i with first and least elements and by adapting the axioms to this case; the determination principle is preserved. these structures were thought by moisil as models of a logic with an infinity of nuances. according to a confession done by moisil, he imagined lmθ algebras (without negation) long 84 george georgescu, afrodita iorgulescu, sergiu rudeanu time ago, but the care of finding a strong motivation for them delayed the announcement; the motivation was found when moisil met zadeh’s fuzzy set theory, in which he saw a confirmation of his old ideas. in 1969, marek and traczyk [110] introduced the notion of generalized łukasewicz algebra (with negation), in an attempt to generalize to the infinite case the lmn algebras; but their generalization is not a natural one. in his phd thesis from 1972 [64], g. georgescu studied duality theory for moisil’s lmθ algebras (without negation), the injective objects (and their characterization), monadic and poliadic algebras. in his phd thesis from 1981 [53], a. filipoiu studied the lmθ algebras (without negation) and their associated logic. he gives a representation theorem for lmθ algebras by aids of θ -valent moisil field. in his master thesis from 1981 [12] also, l. beznea studies a generalization of lmθ algebras (without negation) obtained by eliminating the determination principle. later on, in his phd theses from 1984 [21], v. boicescu introduced and studied the n-valued lm algebras without negation, as a particular case of moisil’s lmθ algebras (without negation). following the inverse way, a. iorgulescu, in her phd thesis from 1984 [90] also, introduced and studied a natural generalization of moisil’s lmn algebras to the infinite case, called θ -valued lm algebras with negation or lmθ algebras with negation for short; any lmθ algebra with negation is a moisil’s lmθ algebra without negation. 2 connection with logic gr. c. moisil invented lm algebras in order to create an algebraic structure playing the same role with respect to the multiple-valued logic as boolean algebras play with respect to classical, bivalent logic. however, as shown by the example of a. rose, this only happens for the cases n = 3 and n = 4. the algebraic structures adequate to the infinite-valued logic of łukasiewicz (truth valued in the real interval [0, 1]) are the mv-algebras introduced by c.c. chang in 1958 or, equivalently, the wajsberg algebras introduced by font, rodriguez and torrens in 1984; d. mundici proved in in 1986 that mv algebras are categorically equivalent to lattice-ordered abelian groups with strong unit. r. grigolia’s mvn algebras, introduced in 1977, and cignoli’s proper łukasiewicz algebras, introduced in 1982, are algebraic structures corresponding to n-valued logic of łukasiewicz. the logic corresponding to lmn algebras was created by moisil himself in 1964. łukasiewicz logic has implication as its primary connector, while moisil logic is based on the idea of nuance, expressed algebraically by the chrysippian endomorphisms. the “engine" of the latter logic is moisil’s determination principle, according to which an n-valued sentence is determined by its boolean nuances. the determination principle realizes a transfer from the multiple-valued logic to the classical logic. this determination brings moisil logic much closer to classical logic than łukasiewicz logic. one could say that moisil logic is derived from classical logic by the idea of nuancing. algebraically, this tight relationship is expressed by the fundamental adjunction between the categories of boolean and łukasiewicz algebras. v. boicescu in 1971 and a. filipoiu in 1981 introduced and studied logics appropiate to lmθ algebras without negation. (i.e. infinite-valued lm algebras). a. filipoiu generalized smullyan’s method of analytic tableaux to θ -valued logic without negation and studied the θ -valued predicate calculus as well, with applications to systems of recording and retrieval of information. łukasiewicz logic, post logic and moisil logic consitute the three directions in the classical theory of multiplevalued logic. their corresponding algebraic models are mv algebras, post algebras and lm algebras. 3 connections with other structures of algebraic logic moisil introduced in 1941 the centered lm3 algebras. post algebras (cf. p. rosenbloom (1942), g. epstein (1960), t. traczyk (1963)) turn out to be centered lm algebras, cf. r. cignoli (1969) and g. georgescu c. vraciu (1969). the θ -valued post algebras were studied by t. traczyk (1967) and g. georgescu (1971). gr. c. moisil, r. cignoli, l. iturrioz, a. monteiro and v. boicescu studied lm algebras as particular cases of heyting algebras. v. boicescu also studied lm algebras as stone algebras. grigore c. moisil (1906 1973) and his school in algebraic logic 85 lm3 algebras and lm4 algebras are polynomially equivalent to mv3 algebras and mv4 algebras, respectively, since they are the algebraic counterpart of the 3-valued łukasiewicz logic and the 4-valued łukasiewicz logic, respectively. d. mundici was first to point out the equivalence between lm3 algebras and mv3 algebras, in 1989. then a. iorgulescu, in 1998-2000 [91] [94], pointed out the isomorphism between the categories of lmk algebras and of mvk algebras, for k = 3, 4 and also studied the categories lmn and mvn for n ≥ 5, showing that every mvn can be made into an lmn algebra. she then studied those lmn algebras that can be viewed as mvn algebras: 3.1 connections between lmn algebras and mvn algebras mv algebras were introduced by c.c. chang, in 1958 [26]. a simplified list of axioms of mv algebras was given by mangani [109], as follows: definition 3.1. an mv algebra is an algebra a = (a,⊕,−, 0) of type (2, 1, 0), where the following axioms are verified: for every x, y, z ∈ a, (mv1) (a,⊕, 0) is an abelian monoid, (mv2) x⊕0− = 0−, (mv3) (x−)− = x, (mv4) (x− ⊕y)− ⊕y = (y− ⊕x)− ⊕x, where x ·y = (x− ⊕y−)−. definition 3.2. for any m ∈ in, we have: (i) 0x = 0 and (m + 1)x = mx⊕x, (ii) x0 = 1 and xm+1 = xm ·x. the mvn algebras were introduced by revaz grigolia in 1977 [87], as follows. definition 3.3. an mvn algebra (n ≥ 2) is an mv algebra a = (a,⊕,−, 0), whose operations fulfil the additional axioms: (m1) (n−1)x⊕x = (n−1)x, (m1’) xn−1 ·x = xn−1 and, if n ≥ 4, the axioms: (m2) [( jx)·(x− ⊕[( j −1)x]−)]n−1 = 0, (m2’) (n−1)[x j ⊕(x− ·[x j−1]−)] = 1, where 1 < j < n−1 and j does not divide n−1. corollary 3.1. mv2 algebras coincide with boolean algebras. example 3.4. the mv algebra ln = l (mvn) n = (ln,⊕,−, 0), where ln = { 0, 1 n−1 , 2 n−1 , . . . , n−2 n−1 , 1 } and for any x, y ∈ ln: x⊕y = min(1, x + y), x ·y = max(0, x + y−1), x− = 1−x and x∨y = max(x, y), x∧y = min(x, y), is an mvn algebra. we shall call it the canonical mvn algebra. note that b(ln) = {0, 1}. the subalgebras of ln are of the form: sm = { 0, k n−1 , . . . , (m−2)k n−1 , 1 } , 86 george georgescu, afrodita iorgulescu, sergiu rudeanu where k = n−1m−1 , if m−1 divides n−1. the subalgebras sm of ln are isomorphic to lm = { 0, 1m−1 , . . . , m−2 m−1 , 1 } , if m − 1 divides n − 1, and they are mvn algebras. hence lm = (lm,⊕,·,−, 0, 1) (m ≤ n) is an mvn algebra if and only if m−1 divides n−1. for instance, the subalgebras of: l3 are l2 and l3, l4 are l2 and l4 and l5 are l2, l3 and l5. þ every mvn algebra is a subdirect product of subalgebras of the canonical mvn algebra ln. d. mundici was the first to prove in 1989 [152] that mv3 algebras coincide with lm3 algebras. a. iorgulescu has proved in 1998-2000 [91] [94] that: 1 mv4 algebras coincide with lm4 algebras, 2 the canonical mvn algebra coincides with the canonical lmn algebra (n ≥ 2), 3 for n ≥ 5, any mvn algebra is a lmn algebra, 4 those lmn algebras which are mvn algebras, for every n ≥ 5, are exactly cignoli’s proper n-valued łukasiewicz algebras. here are for short the results 1-3: ∗∗∗ to obtain the transformation of an mvn algebra into an lmn algebra, for any n ≥ 3, iorgulescu used suchoń’s transformation [174]: suchoń defines moisil operators (σ j) j∈j ( σ j = rn− j ) of the canonical lmn algebra (n ≥ 3) starting from the łukasiewiczian implication → and from the negation −. he puts b3(x) = (x −) → x and b j+1(x) = (x−) → b j(x), j ≥ 3. (1) then he defines: σ1x = bn(x) (2) and for 1 < j ≤ [n/2], σ j x = { σn−1(bl+1(x)), l j ≥ n−1 σl j(bl+1(x)), l j < n−1, (3) where l = max{m | m( j −1) < n−1}, while σn− j(x) = (σ j(x−))−, for 1 ≤ j ≤ [n/2]. (4) suchoń’s moisil operators verify: σ1 ≥ σ2 ≥ . . . ≥ σn−1. remark 3.5. if we want to use suchoń’s construction, it is convenient to consider not the mv algebra (a,⊕,−, 0), but the wajsberg algebra, (a,→,−, 1), introduced by j. m. font, a. j. rodriguez and a. torrens in 1984; mv algebras and wajsberg algebras are isomorphic structures: • if a = (a,→,−, 1) is a wajsberg algebra and if we define α(a) = (a,⊕,−, 0) by x⊕y = x− → y, 0 = 1−, (5) then α(a) is an mv algebra. • conversely, if a = (a,⊕,−, 0) is an mv algebra and if we define β (a) = (a,→,−, 1) by x → y = x− ⊕y, (6) 1 = 0−, then β (a) is a wajsberg algebra. • the maps α, β are mutually inverse. grigore c. moisil (1906 1973) and his school in algebraic logic 87 it follows immediately by (1) that b3(x) = x⊕x = 2x and b j+1(x) = x⊕b j(x) = jx, j ≥ 3. (7) by using suchoń’s construction, iorgulescu then gave the following definition 3.6. let a = (a,⊕,−, 0) be an mvn algebra (n ≥ 3). define φs(a) = (a,∨,∧,−, (r j) j∈j , 0, 1) by x∨y = x ·y− ⊕y, x∧y = (x− ∨y−)−, rn−1x = (n−1)x, (8) rn− j x = { r1(lx), l j ≥ n−1 rn−l j(lx), l j < n−1, (9) for 1 < j ≤ [n/2], l = max{m | m( j −1) < n−1} , r j x = (rn− j(x−))−, 1 ≤ j ≤ [n/2]. (10) proposition 3.7. if ln is the canonical mvn algebra (n ≥ 3), then φs(ln) is the canonical lmn algebra. þ if a is an mvn algebra (n ≥ 3), then φs(a) is an lmn algebra. ∗∗∗ proposition 3.8. 1) given the canonical lmn algebra (n ≥ 3) ln = (ln,∨,∧,−, (r j) j∈j , 0, 1), define ψ(ln) = (ln, ⊕n ,−, 0) by : if n = 2k + 1, x⊕2k+1 y = (x∨r2ky)∧(y∨r2kx) (11) ∧ (x∗ ∨r2k−1y)∧(y∗ ∨r2k−1x) ... ∧ (x(k−1)∗ ∨rk+1y)∧(y(k−1)∗ ∨rk+1x), if n = 2k, x⊕2k y = (x∨r2k−1y)∧(y∨r2k−1x) (12) ∧ (x∗ ∨r2k−2y)∧(y∗ ∨r2k−2x) ... ∧ (x(k−1)∗ ∨rky)∧(y(k−1)∗ ∨rkx), where x∗ is the successor of x and x2∗ = (x∗ )∗ , xm∗ = ( x(m−1)∗ )∗ (13) then ψ(ln) is the canonical mvn algebra. 2) the maps φs, from proposition 3.7, and ψ are mutually inverse. since for n = 3, in the canonical lm3 algebra l3, the operation ⊕ is: x⊕y = (x∨r2y)∧(y∨r2x) and for n = 4, in the canonical lm4 algebra l4, the operation ⊕ is defined by: x⊕y = (x∨r3y)∧(y∨r3x)∧(x∗ ∨r2y)∧(y∗ ∨r2x) = = (x∨r3y)∧(y∨r3x)∧(x− ∨y− ∨r2x∨r2y), it follows that the transformation ψ is not polynomial for n ≥ 5). those lmn algebras which are mvn algebras (i.e. for which the transformation ψ is defined), for every n ≥ 5, are exactly cignoli’s proper n-valued łukasiewicz algebras [34], but the proof is very technical [94]. 88 george georgescu, afrodita iorgulescu, sergiu rudeanu 4 representation theorems numerous representation theorems have been given for lm algebras. the first is due to moisil himself and is reminiscent of the representation theorem for boolean algebras: every lmn algebra can be embadded into a cartesian power of ln. in a modern vision [25], every lm algebra is a subdirect product of subalgebras of the algebra in(i, l2) of incresing functions from i to l2. in particular every lmn algebra is a subdirect product of subalgebras of ln (cignoli). l is a direct product of subalgebras of ln if and only if it is complete and atomic (boicescu 1984). the representation by continuous functions studied by cignoli, boicescu and filipoiu, means that for every lmθ algebra without negation l there is a unique boolean space x such that l is isomorphic to the algebra of all continuous functions f : x → in(i, l2), where in(i, l2) is endowed with the topology having as basis the principal ideals and the principal filters generated by the characteristic functions of the sets {k | k > α}, α ∈ θ . the representation of lmθ algebras without or with negation by moisil fields of sets is due to filipoiu. the stone duality was extended from boolean algebras to lmθ algebras without or with negation with the aid of a suitable concept called lmθ -valued stone space (cignoli, georgescu, iorgulescu), while the priestley duality is based on a suitable adaptation of the concept of priestley space (filipoiu). the representation of lm algebras as algebras of fuzzy sets was studied by d. ponasse, j.l. coulon and j. coulon, s. ribeyre and s. rudeanu. the representation of lmn algebras by lm3 algebras is legitimated by the "good" properties of the latter and was studied by a. monteiro, l. monteiro, f. coppola, v. boicescu and a. iorgulescu. 5 categorial aspects the stone and priestley dualities are in fact equivalences of categories. other categorial properties of lm algebras were studied. here are a few samples. the association of l with the boolean alegebra c(l) of complemented elements of l is extended to a functor c :lmθ →b, while the association of a boolean algebra b with the algebra in(i, b) is extended to a functor t : b →lmθ . then c and t are adjoint functors, c is faithful and t is fully faithful. this yields in particular the representation theorem of moisil. the construction of the functors c and t was given by moisil himself. the injective and projective objects have also been studied, for instance, an lmθ algebra is injective if and only if it is a complete post algebra (whose center is a complete boolean algebra), cf. l. monteiro, r. cignoli, g. georgescu and c. vraciu, v. boicescu. 6 ideals and congruences the study of the appropiate ideal and congruence theory for lm algebras was undertaken by gr. c. moisil, a. monteiro, r. cignoli, c. sicoe. v. boicescu introduced the concepts of θ -ideal and θ -congruence, the prime spectre. for instance, in the case of lmn algebras without negation, the congruence lattice of l is a boolean algebra (a stone algebra) if and only if l is finite (c(l) is a complete boolean algebra). 7 monadic and polyadic algebras l. monteiro and g. georgescu studied the generalization to lm algebras of the monadic and polyadic boolean algebras introduced by p.r. halmos. sample results: the representation of monadic lm algebras by functional monadic lm algebras and the semantic completeness for polyadic lm algebras. a paper of g. geogescu, a. iorgulescu and i. leuştean investigates monadic mvn algebras and closed mvn algebras. grigore c. moisil (1906 1973) and his school in algebraic logic 89 8 miscellanea various other topics have also been studied. thus: v. boicescu proved that the lattice of equational subclass of lmn is a finite heyting algebra. the study of atomic algebras and the characterization of simple algebras as subalgebras of in(i, l2) and the property that lmn algebras without negation form an equational class, are also due to boicescu. the study of irredundant algebras and of exactly n-valued algebras is due to boicescu as well. a. iorgulescu introduced and studied m-complete lmθ algebras with negations, generalizing many of the properties of m-complete boolean algebras. g. georgescu and i. leuştean studied probabilities on lm algebras. l. beznea studied a generalization of lm algebras, obtained by dropping the determination principle. let us also mention m. sularia’s theory of d algebras. these structures are subdirect products between a heyting algebra and a brouwer algebra and represent the algebraic counterpart of a logic of problem solving. in [44], [48], the authors study łukasiewicz bck algebras endowed with moisil oparators. in [163], [164], c. sanza introduced and studied (monadic) n×m-valued łukasiewicz algebras with negation. in a very recent paper [103], i. leuştean proposes a unifying framework for lmn algebras, mv algebras and post algebras; essentially, an lmn+1 algebra is charcterized by a string of n boolean ideals of his boolean center. the necessary and sufficient conditions are given that such a string must satisfy to define a mvn+1 algebra or a post algebra of order n + 1. this result could be seen as a generalization of moisil’s determination principle. as an application, in paper [75], some special cauchy completions of mvn+1 algebras are characterized by using the properties of corresponding strings of boolean ideals. in another very recent paper [75], g.georgescu and a. popescu introduced the notion of n-nuanced mv algebra, by performing a łukasiewicz-moisil nuancing construction on top of mv-algebras. these structures extend both mv-algebras and łukasiewicz-moisil algebras, thus unifying two important types of structures in the algebra of logic. on a logical level, n-nuanced mv algebras amalgamate two distinct approaches to manyvaluedness: that of the infinitely valued łukasiewicz logic, more related in spirit to the fuzzy approach, and that of moisil n-nuanced logic, which is more concerned with nuances of truth rather than truth degrees. they study n-nuanced mv algebras mainly from the algebraic and categorial points of view and also consider some basic model-theoretic aspects. the relationship with a suitable notion of n-nuanced ordered group via an extension of the γ construction is also analyzed: 8.1 n-nuanced mv algebras usually, mv algebras are defined only in terms of ⊕, − and 0. however, in order to point out the symmetry of these structures, the authors prefered the following slightly redundant definition: definition 8.1. an mv algebra is a structure (a,⊕,¯,−, 0, 1), satisfying the following axioms: (mv1’) (a,⊕, 0) and (a,¯, 1) are commutative monoids, (mv2’) x¯0 = 0 and x⊕1 = 1, (mv3’) (x−)− = x, (mv4’) (x⊕y)− = x− ¯y−, (mv5’) (x¯y−)⊕y = (y¯x−)⊕x. definition 8.2. a generalized de morgan algebra is a structure l = (l,⊕,¯,−, 0, 1), where ⊕, ¯ are binary operations, − is a unary operation, and 0, 1 are constants such that the following conditions hold: (i) (l,⊕, 0), (l,¯, 1) are commutative monoids; (ii) (x⊕y)− = x− ¯y− and (x−)− = x for all x, y ∈ l; . remark 8.3. if l is a generalized de morgan algebra, then (x¯y)− = x− ⊕y− for all x, y ∈ l. definition 8.4. an n-nuanced mv-algebra (nmvn algebra for short) is a structure l = (l,⊕,¯,−, r1, . . . , rn−1, 0, 1) such that (l,⊕,¯,−, 0, 1) is a generalized de morgan algebra and r1, . . . , rn−1 satisfy the following axioms: (a0) rix⊕((rix)− ¯riy) = riy⊕((riy)− ¯rix), for i ∈ {1, . . . , n−1}, 90 george georgescu, afrodita iorgulescu, sergiu rudeanu (a1) ri(x⊕y) = rix⊕riy, ri(x¯y) = rix¯riy, ri(0) = 0, ri(1) = 1, for i ∈ {1, . . . , n−1}, (a2) rix⊕(rix)− = 1, rix¯(rix)− = 0, for i ∈ {1, . . . , n−1}, (a3) ri ◦r j = r j , for i, j ∈ {1, . . . , n−1}, (a4) ri(x−) = (rn−ix)−, for i ∈ {1, . . . , n−1}, (a5) (determination principle:) if rix = riy for each i ∈ {1, . . . , n−1}, then x = y, (a6) r1x ≤ r2x ≤ . . . ≤ rn−1x. remark 8.5. nmvn algebras provide a common generalization of mvand łukasiwicz-moisil algebras. indeed, if n = 2, then, because of the determination principle, r1 is the identity, thus an nmvn algebra can be identified with an mv-algebra; if (l,⊕,¯,−, 0, 1) is a de morgan algebra, then (l,⊕,¯,−, r1, . . . , rn−1, 0, 1) becomes an lmn algebra. example 8.6. let a = (a,⊕,¯,−, 0, 1) be an mv-algebra. consider the set t (a) = {(x1, . . . , xn−1) ∈ an−1 | x1 ≤ . . . ≤ xn−1}. since an−1 is an mv-algebra (with operations taken component-wise from a) and t (a) is closed under the operations 0, 1, ⊕, ¯ (where 0 and 1 are the constant vectors), then we can consider these operations on t (a). we furthermore define −, r1, . . . , rn−1 by: (x1, . . . , xn−1)− = (x−n−1, . . . , x − 1 ), ri(x1, . . . , xn−1) = (xi, . . . , xi), for i ∈ {1, . . . , n−1}. then (t (a),⊕,¯,−, r1, . . . , rn−1, 0, 1) is an nmvn algebra. define m(l) = {x ∈ l | rix = x for all i ∈ {1, . . . , n−1}}. then m(l), together with the operations ⊕,¯,−, 0, 1 induced by l, is an mv algebra, called the mv-center of l. in the mv-algebra m(l) we have a canonical order ≤. let us define an extension of this order to l by: x ≤ y iff, for each i ∈ {1, . . . , n−1}, rix ≤ riy. because of the determination principle, this is indeed an order and because of (a3), it is indeed an extension of the order on m(l). moreover, the compatibility properties listed in the following lemma are obvious: proposition 8.7. the following properties are true in a l: (1) 0 is the greatest and 1 is the least element in l w.r.t. ≤; (2) for each x, y ∈ l, x ≤ y iff y− ≤ x−; (3) for each x, x′, y, y′ ∈ l, if x ≤ x′ and y ≤ y′, then x⊕y ≤ x′ ⊕y′ and x¯y ≤ x′ ¯y′, (4) r1x ≤ x ≤ rn−1x, for any x ∈ l, (5) for x, y ∈ l, if x⊕y = 1 and x¯y = 0, then x, y ∈ m(l) and y = x−, (6) m(l) = {x ∈ l | x⊕x− = 1′, x¯x− = 0}. 9 applications to switching theory whereas boolean algebra is a suitable tool for the study of networks made up of binary devices, the study of networks involving multi-positional devices and the so-called hazard and race phenomena have imposed the use of other algebraic tools, namely galois fields, łukasiewicz-moisil algebras and the theory of discrete functions. moisil investigated circuits involving devices such as polarized relays with unstable neutral, ordinary relays under low self-maintaining current, valves, resistances, multi-positional relays, as well as transistors and other electronic devices. see also [125]. moisil has described the operation of such devices by characteristic equations of the form xk+1 = ϕ(ξk, xk), where the variable x associated with the relay contact takes values in ln, where n ≤ 5 grigore c. moisil (1906 1973) and his school in algebraic logic 91 depends on the type of the relay, ξ ∈ l2 is a variable associated with the current and the index k or k + 1 indicates the value of the corresponing variable at time t = k or t = k + 1, respectively. the synthesis problem consists in designing a circuit made up of several relays and whose operation be described by a given equation of the form xk+1 = f(ak, xk), (14) where x is the vector of the variables x associated with the relays of the circuit, a is the input vector and the meaning of the index k or k + 1 is the same as above. to solve this problem, moisil notices the crucial point that the structure of such a circuit is determined by the expression of a function g which satisfies the identity ξ = g(a, x ), (15) where ξ is the vector of the variables ξ associated with the relay of the circuit. so if xk+1 = phi(ξk, xk) (16) is the vector form of the characteristic equations of the relays of the circuit, it follows from (14) and (15) that f(ak, xk) = φ(ξk, xk), (17) for any k. therefore (15) transforms (17) into the identity f(a, x ) = φ(g(a, x ), x ), (18) and moisil’s method for solving the synthesis problem consists in solving the functional equation (18) with respect to g. references [1] a. amroune, representation des algèbres de łukasiewicz θ -valentes involutives par des structures floues, busefal (institut de recherche en informatique de toulouse), 43, 1990, 5-11. [2] m. abad, three valued łukasiewicz algebras with an additional operation, rev. union mat. argentina, 32, 1985, 107-117. [3] m. abad, a. figallo, characterization of three-valued łukasiewicz algebras, rep. math. logic, 18, 1984, 47-59. [4] m. abad, l. monteiro, on three-valued moisil algebras, logique et analyse, 27, 1984, 407-414. [5] i. bădele, v. boicescu, sur les extensions des algèbres de łukasiewicz, c.r. acad. sci. paris, 269, 1969, 313-315. [6] d. becchio, axiomatisation d’une logique trivalente łukasiewiczienne, 1971, manuscript. [7] d. becchio, nouvelle démonstration de la complétude du système de wajsberg axiomatisant la logique trivalente de łukasiewicz, c.r. acad. sci. paris, 275, 1972, 679-681. [8] d. becchio, sur les définitions des algèbres trivalentes de łukasiewicz donées par a. monteiro, logique et analyse, 63-64, 1973, 339-344. [9] d. becchio, algèbres de heyting, algèbres de brouwer et algèbres trivalentes de łukasiewicz, logique et analyse, 21, 1978, 237-248. [10] d. becchio, logique trivalente de łukasiewicz, ann. sci., univ. clermont-ferrand, 16, 1978, 38-89. [11] d. becchio, l. iturrioz, sur une définition des algèbres de łukasiewicz et de post d’ordre n, demonstratio math., 11, 1978, 1083-1094. [12] l. beznea, θ -valued moisil algebras and dual categories (romanian), master thesis, university of bucarest, 1981. 92 george georgescu, afrodita iorgulescu, sergiu rudeanu [13] v. boicescu, sur la représentation des algèbres de łukasiewicz θ -valentes, c.r. acad. sci. paris, 270, 1970, 4-7. [14] v. boicescu, sur les algèbres de łukasiewicz, logique,automatique, informatique, editions de l’academie de la r.s.roumanie, 1971, 71-89. [15] v. boicescu, sur les systèmes déductifs dans la logique θ -valente, publ. dép. math. lyon, 8, 1971, 123-133. [16] v. boicescu, on łukasiewicz algebras (romanian), in: probleme de logică, vol.iv, ed. academiei r.s.romania, bucharest, 1972, 245-276. [17] v. boicescu, on θ -valued logics (romanian), in: probleme de logică, vol. v, 1973, edit. acad. r.s.romania, 241-255. [18] v. boicescu, sur une logique polivalente, rev. roum. sci. soc., sér. philos. et logique, 17, 1973, 393-405. [19] v. boicescu, researches in łukasiewicz algebras, rev. roum. sci. soc., sér. philos. et logique, 20, 1976, 197-200. [20] v. boicescu, extensions of homomorphisms of lulasiewicz algebras with bounding semimorphisms, rev. roum. sci. soc., sér. philos et logique, 23, 1979, 367-370. [21] v. boicescu, contributions to the study of łukasiewicz algebras (romanian), ph.d. thesis, university of bucharest, 1984. [22] v. boicescu, irredundant n-valued moisil algebras, discrete math., 71, 1988, 197-204. [23] v. boicescu, g. georgescu, les algèbres de łukasiewicz centrées et axées, rev. roum. math. pures et appl., 15, 1970, 675-681. [24] v. boicescu, a. iorgulescu, current research in the field of łukasiewicz-moisil algebras (romanian), studii cerc. mat., 39, 1987, 97-106. [25] v. boicescu, a. filipoiu, g. georgescu, s. rudeanu, łukasiewicz-moisil algebras, annals of discrete mathematics, 49, 1991, north-holland. [26] c.c. chang, algebraic analysis of many valued logics, trans. amer. math. soc., 1958, 88, 467-490. [27] r. cignoli, boolean elements in łukasiewicz algebras. i. proc. japan acad., 41, 1965, 670-675. [28] r. cignoli, un teorema de representation para algebras de łukasiewicz trivalentes, rev. union mat. argentina, 23, 1966, 41. [29] r. cignoli, algebras de moisil de orden n, ph.d. thesis, universidad nacional del sur, bahia blanca, 1969. [30] r. cignoli, moisil algebras, notas de logica matematica, inst. mat., univ. national del sur, bahia-blanca, no. 27, 1970. [31] r. cignoli, representation of łukasiewicz algebras and post algebras by continuous functions, colloq. math. 24, 1972, 127-138. [32] r. cignoli, topological representation of łukasiewicz and post algebras, notas de logica matematica, inst. mat. univ. national del sur, bahia-blanca, no. 33, 1974. [33] r. cignoli, coproducts in the categories of kleene and three-valued łukasiewicz algebras, studia logica, 39, 1979, 237-245. [34] r. cignoli, proper n-valued łukasiewicz algebras as s-algebras of łukasiewicz n-valued propositional calculi, studia logica, 41, 1982, 3-16. [35] r. cignoli, an algebraic approach to elementary theories based on n-valued łukasiewicz logics, z. math. logik u. grundl. math., 30, 1984, 87-96. grigore c. moisil (1906 1973) and his school in algebraic logic 93 [36] r. cignoli, m.s. de gallego, the lattice structure of 4-valued łukasiewicz algebras, j. symbolic logic, 46, no. 1, 1981, 185. [37] r. cignoli, m.s. de gallego, the lattice structure of some łukasiewicz algebras, algebra universalis, 13, 1981, 315-328. [38] r. cignoli, a. monteiro, boolean elements in łukasiewicz algebras. ii., proc. japan acad., 41, 1965, 676680. [39] r. cignoli and d. mundici, an elementary proof of chang’s completeness theorem for the infinite-valued calculus of łukasiewicz, studia logica, to appear. [40] r. cignoli, i.m.l. d’ottaviano, d. mundici, algebraic foundations of many-valued reasoning, kluwer 2000, volume 7. [41] j. coulon, j.l. coulon, a propos de la représentation des algèbres de łukasiewicz et des algèbres booléiennes floues, rev. roum. math. pures et appl., 34, 1989, 403-411. [42] j. coulon, j.l. coulon, un nouveau resultat concernant la representation d’une algèbre de łukasiewicz involutive dans l’algèbre des parties floues d’une structure floue involutive, rev. roumaine math. pures et appl., 38, 1993, 319-326. [43] a. figallo, j. tolosa, algebras de łukasiewicz trivalente, univ. nac. de san juan, 1982. [44] a. figallo jr., m. figallo and a. ziliani, free (n + 1)-valued łukasiewicz bck algebras, demonstratio mathematica, xxxvii, 2, 2004, 245-254. [45] a.v. figallo, i. pascual, a. ziliani, notes on monadic n-valued łukasiewicz algebras, math. bohem., 3, 129, 2004, 255-271. [46] a.v. figallo, c. sanza , a. ziliani, functional monadic n-valued łukasiewicz algebras, accepted by mathematica bohemica. [47] a.v. figallo, i. pascual, a. ziliani, subdirectly irreducible monadic łukasiewicz-moisil algebras, manuscript. [48] a.v. figallo, łukasiewicz bck-algebras endowed with moisil operators, manuscript. [49] a.v. figallo, a. figallo jr., m. figallo, a. ziliani, łukasiewicz residuation algebras with infimum, manuscript. [50] a. filipoiu, analytic tableaux for θ -valued propositional logic, math. seminar notes, 6, 1978, 517-526. [51] a. filipoiu, representation theorems for łukasiewicz algebras, discrete math., 27, 1979, 107-110. [52] a. filipoiu, representation of łukasiewicz algebras by means of ordered stone spaces, discrete math., 30, 1980, 111-116. [53] a. filipoiu, θ -valued łukasiewicz-moisil algebras and logics (romanian), ph.d. thesis, univ. of bucharest, 1981. [54] a. filipoiu, representation theorems for θ -valued łukasiewicz algebras, discrete math., 33, 1981, 21-27. [55] a. filipoiu, some remarks on the representation theorem of moisil, discrete math., 33, 1981, 163-170. [56] j. m. font, a. j. rodriguez, a. torrens, wajsberg algebras, stochastica, viii, 1, 5-31, 1984. [57] g. georgescu, algébres de łukasiewicz complètes, c.r. acad. sci. paris, 269, 1969, 1181-1184. [58] g. georgescu, the centered epimorphisms and the construction of the tensor product in łuk, rev. roum. math. pures appl., 15, 1970, 693-709. [59] g. georgescu, n-valued complete łukasiewicz algebras, rev. roum. math. pures appl., 16, 1971, 41-50. 94 george georgescu, afrodita iorgulescu, sergiu rudeanu [60] g. georgescu, the θ -valued łukasiewicz algebras. i., rev. roum. math. pures appl., 16, 1971, 195-209. [61] g. georgescu, algebras de łukasiewicz de orden θ . ii., rev. roum. math. pures appl., 16, 1971, 363-369. [62] g. georgescu, les algèbres de łukasiewicz θ -valentes, in: logique, automatique, informatique, edit. acad. r.s.romania, bucharest, 1971, 99-169. [63] g. georgescu, the θ -valued łukasiewicz algebras. iii. duality theory., rev. roum. math. pures appl., 16, 1971, 1365-1390. [64] g. georgescu, algebre łukasiewicz θ -valente, ph.d. thesis, math. institute, bucharest, mai 1972. [65] g. georgescu, représentation des algèbres de łukasiewicz θ -valentes polyadiques, c. r. acad. sci. paris, a-b, 274, 1972, a944-a946. [66] g. georgescu, reprezentarea algebrelor łukasiewicz poliadice local finite de grad infinit, studii cerc. mat., 24, 1972, 1107-1117. [67] g. georgescu, some remarks on the polyadic łukasiewicz algebras, rev. roum. mat. pures appl., 22, 1977, 641-648. [68] g. georgescu, on the homogeneous-universal łukasiewicz algebras, rev. roum. mat. pures appl., 23, 1978, 29-32. [69] g. georgescu, a. iorgulescu, pseudo-mv algebras: a noncommutative extension of mv algebras, the proceedings of the fourth international symposium on economic informatics, bucharest, romania, may 1999, 961-968. [70] g. georgescu, i. leuştean, towards a probability theory based on moisil logic, soft computing, 2000, 4, no.1, 19-26. [71] g. georgescu, a. iorgulescu, pseudo-bl algebras: a noncommutative extension of bl algebras, abstracts of the fifth international conference fsta 2000, slovakia, february 2000, 90-92. [72] g. georgescu, a. iorgulescu, pseudo-bck algebras: an extension of bck algebras, proceedings of dmtcs’01: combinatorics, computability and logic, springer, london, 2001, 97-114. [73] g. georgescu, i. leuştean, probabilities on łukasiewicz-moisil algebras, international journal of approximate reasoning, 1998, 18, no.3-4, 201-215. [74] g. georgescu, i. leuştean, conditional probabilities on łukasiewicz-moisil algebras, analele universitatii bucuresti, 1998, 47, 55-64. [75] g. georgescu., i. leuştean, a. popescu, order convergence and distance on łukasiewicz-moisil algebras, multiple valued logic, to appear. [76] g. georgescu, a. popescu, a common generalization for algebras and łukasiewicz-moisil algebras, submitted. [77] g. georgescu, c. vraciu, le spectre maximal dšune algèbre de łukasiewicz, c.r. acad. sci. paris, 268, 1969, 928-929. [78] g. georgescu, c. vraciu, n-valued centered łukasiewicz algebras, rev. roum. math. pures appl., 14, 1969, 712-723. [79] g. georgescu, c. vraciu, sur les algèbres de łukasiewicz centrées, c.r. acad. sci. paris, 268, 1969, 9981000. [80] g. georgescu, c. vraciu, sur le spectre maximal d’une algèbre de łukasiewicz, publ. dép. math. lyon, 6, 1969, 42-54. [81] g. georgescu, c. vraciu, sur les épimorphismes centrées des algèbres de łukasiewicz, c.r. acad. sci. paris,269, 1969, 4-6. grigore c. moisil (1906 1973) and his school in algebraic logic 95 [82] g. georgescu, c. vraciu, on the characterization of centered łukasiewicz algebras, j. algebra, 16, 1970, 486-495. [83] g. georgescu, c. vraciu, monadic boolean algebras and monadic łukasiewicz algebras (romanian), studii cerc. mat., 23, 1971, 1025-1048. [84] g. georgescu, c. vraciu, la dualité des algèbres de post θ -valentes, j. algebra, 21, 1972, 74-86. [85] h. goldberg, h. leblanc, g. weaver, a strong completeness theorem for 3-valued logic, notre dame j. formal logic, 15,1974, 325-332. [86] m. greniewski, using three valued logics in the theory of swiching theory (romanian). i. realizarea cu circuite a funcţiilor fundamentale, comunic. acad. r.p.r., 6, 1956, 225-229. [87] r. grigolia, algebraic analysis of łukasiewicz-tarski’s n-valued logical systems, in: selected papers on łukasiewicz sentential calculi (r. wójcicki and g. malinowski, eds.), polish acad. of sciences, ossolineum, wroclaw, 1977, 81-92. [88] a. iorgulescu, on the construction of three-valued łukasiewicz-moisil algebras, discrete math., 47, 1984, 213-227. [89] a. iorgulescu, functors between categories of three-valued łukasiewicz-moisil algebras, discrete math., 49, 1984, 121-131. [90] a. iorgulescu, (1 + θ )-valued łukasiewicz-moisil algebras with negation (romanian), ph.d. thesis, univ. of bucharest, 1984. [91] a. iorgulescu, connections between mvn algebras and n-valued łukasiewicz-moisil algebras i, discrete mathematics, 181 (1-3), 1998, 155-177. [92] a. iorgulescu, connections between mvn algebras and n-valued łukasiewicz-moisil algebras ii, discrete mathematics, 202, 1999, 113-134. [93] a. iorgulescu, connections between mvn algebras and n-valued łukasiewicz-moisil algebras -iii, manuscript. [94] a. iorgulescu, connections between mvn algebras and n-valued łukasiewicz-moisil algebras iv, journal of universal computer science, vol. 6, no i , 2000, 139-154. [95] l. iturrioz, axiomas para el calculo proposicional trivalente de łukasiewicz, rev. union mat. argentina, 22, 1965, 150. [96] l. iturrioz, sur une classe particulière d’algèbres de moisil, c. r. acad. sci. paris, 267, 1968, 585-588. [97] l. iturrioz, les algèbres de heyting-brouwer et de łukasiewicz trivalentes, notre dame j. formal logic, 17, 1976, 119-126. [98] l. iturrioz, algèbres de łukasiewicz symétriques, publ. dép. math. lyon, 13, 1976, 73-96. [99] l. iturrioz, łukasiewicz and symmetrical heyting algebras, z. math. logik u. grund.math., 23, 1977, 131136. [100] l. iturrioz, two characteristic properties of monadic three-valued łukasiewicz algebras, rep. math. logic, 8, 1977, 63-68. [101] l. iturrioz, an axiom system for three-valued łukasiewicz propositional calculus, notre dame j. formal logic, 18, 1977, 616-620. [102] l. iturrioz, o. rueda, algèbres implicatives trivalentes de łukasiewicz libres, discrete math., 18, 1977, 35-44. [103] i. leuştean, a unifying framework for łukasiewicz-moisil algebras, mv-algebras and post algebras, submitted. 96 george georgescu, afrodita iorgulescu, sergiu rudeanu [104] j. łukasiewicz, on three-valued logic (polish), ruch filozoficzny, 5, 1920, 160-171. [105] j. łukasiewicz, philosophische bemerkungen zur mehrwertigen systemen des aussagenkalküls, c.r. séances soc. sci. lettres varsovie, cl. iii, 23, 1930, 51-77. romanian translation in: logică şi filozofie, ed. politică, bucureşti, 1966, 295-320. [106] j. łukasiewicz, die logik und das grundlagenproblem, les entretiens de zurich sur les fondements et la méthode des sciences mathématiques, 1941, 88-100. [107] j. łukasiewicz, a. tarski, untersuchungen über den aussagenkalkül, c.r. séances soc. sci. lettres varsovie, cl. iii, 23, 1930, 30-50. [108] m.g. malinowski, n-valued łukasiewicz algebras and their connection to post algebras of order (polish), zeszyty naukowe u.t., filozofia, 1972. [109] p. mangani, on certain algebras related to many-valued logics (italian), boll. un. mat. ital. (4) 8, 68-78, 1973. [110] w. marek, t. traczyk, generalized łukasiewicz algebras, bull. acad. polonaise sci. sér. math. astronom. phys., 17, 1969, 789-792. [111] gr. c. moisil, recherches sur l’algèbre de la logique, ann. sci. univ. jassy, 22, 1935, 1-117. [112] gr. c. moisil, sur le mode problématique, c.r. séances acad. sci. roumanie, 2, no. 2, 1938, 101-103. [113] gr. c. moisil, recherches sur les logiques non-chrysippiennes, ann. sci. univ. jassy, 26, 1940, 431-466. [114] gr.c. moisil, notes sur les logiques non-chrysippiennes, ann. sci, univ. jassy, 27, 1941, 86-98. [115] gr. c. moisil, sur les anneaux de caractéristique 2 ou 3 et leurs applications, bul. politechn. bucharest, 12, 1941, 66-90. [116] gr. c. moisil, contributions à l’étude des logiques non-chrysippiennes. i. un nouveau système d’axiomes pour les algèbres łukasiewicziennes tétravalentes, c.r. acad. sci. roumanie, 5, 1941, 289-293. [117] gr. c. moisil, contributions à l’étude des logiques non-chrysippiennes. ii. anneaux engendrés par les algèbres łukasiewicziennes centrées, c.r. acad. sci. roumanie, 6, 1942, 9-14. [118] gr. c. moisil, contributions à l’étude des logiques non-chrysippiennes. iii. anneaux engendrés par les algèbres łukasiewicziennes tétravalentes axées, c.r. acad. sci. roumanie, 7, 1942, 14-18. [119] gr. c. moisil, logique modale, disquis. math. phys., 2, 1942, 3-98. [120] gr. c. moisil, the algebra of networks with rectifiers (romanian), rev. univ. c.i. parhon şi a politehnicii bucureşti, 4-5, 1954, 9-41. [121] gr. c. moisil, utilization of three-valued logics in the theory of switching circuits. ii. the characteristic equation of a relay. iii. actual-contact circuits. iv. realization of the working functions in actual operation (romanian). comun. acad. r.p. române, 6, 1956, 231-239, 385-386, 971-973. [122] gr. c. moisil, applications of three-valued logics to the study of actual operation of relay-contact circuits (romanian), bul. mat. soc. st. fiz. r.p. române, 1(49), 1957, 147-191. [123] gr. c. moisil, utilization of three-valued logics to the theory of switching circuits. v. p-i circuits (romanian), comunic. acad. r.p. române, 8, 1958, 1127-1128. [124] gr. c. moisil, utilization of three-valued logics to the theory of switching circuits. vi. polarized relays with unstable neutral. vii. operation of ordinary relays under low self-maintaining current. viii. -twoterminals with contacts and resistances. ix. -two-terminals with contacts, valves and resistances. x. physical interpretation of the characteristic function of a multiterminal (romanian), comunic. acad. r.p. române, 9, 1959, 411-413, 531-532, 533-535, 665-666, 667-669. grigore c. moisil (1906 1973) and his school in algebraic logic 97 [125] gr. c. moisil, sur l’application de la logique à trois valeurs à l’étude des circuits électriques à contacts, redresseurs et résistances, rev. math. pures appl. , 4, 1959, 173-183. [126] gr. c. moisil, the algebraic theory of switching circuits (romanian), ed. tehnică bucureşti. english translation 1969, pergamon press, oxford, and editura tehnică bucureşti. [127] gr.c. moisil, sur les idéaux des algèbres łukasiewicziennes trivalentes, analele univ. c.i. parhon, seria acta logica, 3, 1960, 83-95. [128] gr.c. moisil, on predicate calculus in three-valued logics (russian), an. univ. c.i. parhon, acta logica, 4, 1961, 103-112. [129] gr.c. moisil, sur la logique à trois valeurs de łukasiewicz, an. univ. c.i. parhon, acta logica, 5, 1962, 103-117. [130] gr.c. moisil, les logiques non-chrysippiennes et leurs applications, acta philos. fennica, 16, 1963, 137152. [131] gr.c. moisil, le algèbre di łukasiewicz, an. univ. c.i. parhon, acta logica 6, 1963, 97-135. [132] gr.c. moisil, applicazioni dell’algebra alle calcolatrici moderne, atti reunione del groupement des math. d’expression latine, 26.ix-3.x. 1961, ed. cremonese, roma, 1963. [133] gr.c. moisil, the interest of the actual operation of switching circuits for the logician, an. univ. bucureşti, acta logica, 7-8, 1964, 131-139. [134] gr.c. moisil, sur les logiques de łukasiewicz à un nombre fini de valeurs, rev. roum. math. pures appl., 9, 1964, 905-920. [135] gr.c. moisil, încercări vechi şi noi de logică neclasică (old and new essays on non-classical logics), edit. ştiinţifică, bucharest, 1965. [136] gr.c. moisil, théorie structurelle des automats finis, gauthiers-villars, paris, 1967. [137] gr.c. moisil, łukasiewiczian algebras, computing center, university of bucharest (preprint), 311-324, 1968. [138] gr.c. moisil, essais sur les logiques non-chrysippiennnes, ed. academiei r.s.r., bucharest, 1972. [139] gr.c. moisil, ensembles flous et logiques à plusieurs valeurs, centr. rech. math., université de montréal, mai, 1973 (preprint). [140] gr.c. moisil, lecţii despre logica raţionamentului nuanţat (lectures on the logic of fuzzy reasoning), ed. ştiinţifică şi enciclopedică, bucharest, 1975. [141] gr.c. moisil, sur l’emploi des mathématiques dans les sciences de l’homme, accad. naz. lincei, contributi del centro linceo interdisciplinare di sci. mat. e loro appl., no. 17, 1976. [142] a. monteiro, matrices de morgan caracteristiques pour le calcul propositional classique, ann. acad. brasil, 52, 1960, 1-7. [143] a. monteiro, sur la définition des algèbres de łukasiewicz trivalentes, notas de logica matematica, 21, 1964. [144] a. monteiro, construction des algèbres de łukasiewicz trivalentes dans les algèbres de boole monadiques, notas de logica matematica, 11, 1964. [145] a. monteiro, r. cignoli, construccion geometrica de las algebras de łukasiewicz trivalentes libres, rev. union mat. argentina, 22 , 1965, 152-153. [146] l. monteiro, axiomes indépendentes pour les algèbres de łukasiewicz trivalentes, notas de logica matematica, 32, 1974. 98 george georgescu, afrodita iorgulescu, sergiu rudeanu [147] l. monteiro, algebras de łukasiewicz trivalentes monadicas, notas de logica matematica, 32, 1974. [148] l. monteiro, sur la construction l des algèbres de łukasiewicz trivalentes, rev. roum. math. pures appl., tome xxiii, no. 1, 1978, 77-83. [149] l. monteiro, l.g. coppola, sur une construction des algèbres de łukasiewicz trivalentes, notas de logica matematica, 17, 1964. [150] d. mundici, mv-algebras are categorically equivalent to bounded commutative bck-algebras, math. japonica, 31, no. 6, 1986, 889-894. [151] d. mundici, interpretation of af c?-algebras in łukasiewicz sentential calculus, j. funct. anal., 65, 1986, 15-63. [152] d. mundici, the c∗-algebras of three-valued logic, logic colloquium’88, ferro, bonotto, valentini and zanardo (editors), elsevier science publishers b.v. (north-holland), 1989, 61-77. [153] gh. nadiu, on a method for the construction of three-valued łukasiewicz algebras (romanian), studii cerc. mat., 19, 1967, 1063-1070. [154] a. petcu, the definition of the trivalent łukasiewicz algebras by three equations, rev. roumaine math. pures appl., 13, 1968, 247-250. [155] i. petrescu (i. voiculescu), injective objects in the category of de morgan algebras, rev. roumaine math. pures appl., 16, 1971, 921-926. [156] d. ponasse, algèbres floues et algèbres de łukasiewicz, rev. roumaine math. pures appl., xxiii, 1, 1978, 103-111. [157] e. post, introduction to a general theory of elementary propositions, amer. j. math., 43, 1921, 163-185. [158] e. radu, l’oeuvre de gr. c. moisil en logique mathématique, i., ii., rev. roumaine math. pures appl., 23, 1978, 463-477, 605-610. [159] j. rodriguez, a. torrens, wajsberg algebras and post algebras, studia logica, 53, 1994, 1-19. [160] p. rosenbloom, post algebras. i. postulates and general theory, amer. j. math., 64, 1942, 167-183. [161] s. rudeanu, on łukasiewicz-moisil algebras of fuzzy sets, studia logica, 52, 1993, 95-111. [162] a. sade, algèbres de łukasiewicz dans la logique trivalente, univ. beograd, publ. elektrotehn. fak. ser. mat. fiz., no. 247-273, 1969, 123-130. [163] c. sanza, notes on n×m-valued łukasiewicz algebras with negation, l. j. of the igpl, 6, 12, 2004, 499-507 () [164] c. sanza, monadic n×m-valued łukasiewicz algebras with negation, manuscript. [165] c. sicoe, strictly chrysippian elements in many-valued łukasiewicz algebra (romanian), an. univ. of bucharest, 15, 1966, 123-126. [166] c. sicoe, sur les ideaux des algèbres łukasiewicziennes polivalentes, rev. roum. math. pures appl., 12, 1967, 391-401. [167] c. sicoe, note asupra algebrelor łukasiewicziene polivalente, stud. şi cerc. mat., 19, 1967, 1203-1207. [168] c. sicoe, on many-valued łukasiewicz algebras, proc. japan acad., 43, 1967, 725-728. [169] c. sicoe, a characterization of łukasiewicz algebras. i. ii. proc. japan acad., 43, 1967, 729-732, 733-736. [170] c. sicoe, sur la définition des algèbres łukasiewicziennes polyvalentes, rev. roumaine math. pures appl., 13, 1968, 1027-1030. grigore c. moisil (1906 1973) and his school in algebraic logic 99 [171] w. suchon, on the non-equivalence of two definitions of the algebra of łukasiewicz, p polish acad. sci. inst. philos. sociol. bull. sect. logic, 1, no. 1, 1972, 35-37. [172] w. suchon, inéquivalence de certaines définitions des algèbres infinites de łukasiewicz, rep. math. logic, 1, 1973, 21-26. [173] w. suchon, on defining moisil’s functors in n-valued łukasiewicz propositional logic, polish acad. sci. inst. philos. sociol. bull. sect. logic, 2, 1973, 195-196. [174] w. suchoń, définition des foncteurs modaux de moisil dans le calcul n-valent des propositions de łukasiewicz avec implication et négation, reports on mathematical logic 2, 43-48, 1974. [175] w. suchon, matrix łukasiewicz algebras, rep. math. logic, 4, 1975, 91-104. [176] w. suchon, définition des foncteurs modaux de moisil dans le calcul n-valent des propositions de łukasiewicz avec implication et négation, reports on mathematical logic, 2, 1974, 43-48. [177] j. varlet, algèbres de łukasiewicz trivalentes, bull. soc. roy. sci. liège, 36, 1968, 399-408. [178] j. varlet, considérations sur les algèbres de łukasiewicz trivalentes, bull. soc. roy. sci. liège,38, 1969, 462-469. [179] m. wajsberg, axiomatization of three-valued propositional calculus (polish), c.r. séances soc. sci. lettres varsovie, cl. iii, 24, 1931, 126-145. [180] r. wójcicki, a theorem on the finiteness of the degree of maximality of the n-valued łukasiewicz logic, polish acad. sci. inst. philos. sociol. bull. sect. logic, 4, 1975, 19-25. [181] r. wójcicki, g. malinowski (editors), selected papers on łukasiewicz sentencial calculi, ossolineaum, wroclaw and warsaw, 1977. [182] l.a. zadeh, fuzzy sets, inform. and control, 8, 1965, 338-353. george georgescu university of bucharest address: str. academiei no. 14, romania e-mail: georgescu@funinf.cs.unibuc.ro afrodita iorgulescu academy of economic studies address: piaţa romană nr. 6 r 70167, oficiul poştal 22, bucharest, romania e-mail: afrodita@ase.ro sergiu rudeanu university of bucharest address: str. academiei no. 14, romania e-mail: rud@funinf.cs.unibuc.ro international journal of computers, communications & control vol. iii (2008), no. 1, pp. 69-79 olc, on-line compiler to teach programming languages cayetano guerra artal, maria dolores afonso suarez idafen santana perez, ruben quesada lopez editorial note: the program committee of icvl 2007 sent to ijccc nine of the best papers presented at the conference, with the recommendation to be published in an extended form. after a new evaluation, our reviewers decided that five of those papers can be published in ijccc, two of which in this issue. abstract: the advance of internet towards web 2.0 conveys the potential it has in a wide range of scopes. the ongoing progress of the web technology and its availability in teaching and learning, as well as a students’ profile increasingly more used to managing an important amount of digital information, offers lecturers the opportunity and challenge of putting at students’ disposal didactic tools making use of the internet. programming is one of the essential areas taught in university studies of computer science and other engineering degrees. at present, it is a knowledge acquired through tutorial classes and the practice with different tools for programming. this paper shows the acquired experience in the development and use of a simple compiler accessible through a web page. in addition it presents a teaching proposal for its use in subjects that include programming languages lessons. olc on-line compiler is an application which greatly lightens the student’s workload at the initial stage of programming. during this initial period they will neither have to deal with the complexities of the installation and the configuration of these types of tools, nor with the understanding of multiple options which they present. therefore students can concentrate on the comprehension of the programming structures and the programming language to be studied. keywords: compiler, e-learning, interactivity, virtual laboratory, web 2.0, video tutorials. 1 introduction new information and communication technologies (ict’s) still imply changes in a wide range of society scopes. a society in which the information, its use and distribution in digital format are part of a number of tasks carried out every day. new generations understand their environment in a natural way as well as the use of these technologies. this is reflected in university students who are increasingly getting used to managing this kind of information [1]. from the beginning internet has meant a milestone in communication at a world wide level [2]. it provides net users with an hypertext document system with a rich content and hyperlinks that allow to reference another resource: text document, image, animation, audio or video. in its beginning, information was contained in html pages that were not updated very frequently. from here its evolution was directed to dynamic html pages that make use of cms with data bases. next, it was directed to web 2.0, a concept that has turned into reality in internet, through which contents and the way of interaction with these contents in internet have changed. users, from sharing these contents, have turned their role into the edition and classification of them. 2 web 2.0 this term, web 2.0 [3], coined for the first time in 2004, makes reference to a new vision of the web which has evolved towards users’ communities, interaction and social networks. amongst the technologies that bring the web 2.0 into reality we find: the ones that use the web platform to host desktop copyright © 2006-2008 by ccc publications 70 cayetano guerra artal, maria dolores afonso suarez idafen santana perez, ruben quesada lopez applications; the ones that separate content and design using style sheets; the ones that support rss or the ones that allow the management of users and communities. in the same way, there exists a set of web services that can be considered indicators of the advance towards web 2.0 [4]. these services provide communication mechanisms that make it possible to present information to users actively. amongst these services we can find advertising services, shared photo-database, community-generated tag database, delivery of contents, encyclopaedias or personal web pages. web navigators have experienced an important progress with regard to their function. they have been evolving from their beginnings, in which their function consisted in showing the contents of static web pages and their hyperlinks. later on they went further, interpreting the interfaces that show one or various data bases’ content. at present they support newest technologies that allow not only the interaction with dynamic web pages but also a wide range of web services. amongst them social networks, that promotes information interchange between users. internet has undergone a change which has made it become a relevant instrument for teaching and knowledge spreading, no matter the subject area. the presence of e-learning in internet has allowed to increase education quality and its disposition. its use to create teaching applications is fully interesting, being able to use, among other features, interactivity. 3 olc: learning programming nowadays, programming is one of the essential areas taught in university studies of computer science and other engineering degrees, as well as in diplomas of computer science. at present, it is a knowledge acquired through theoretical classes and the practice with different tools for programming such as editors, compilers, linkers, debuggers or interpreters. in addition to this we find teacher tutorial classes. being included in all programming subjects, compilers generate the executable programs made of the instructions written by the programmers. compilation is a process carried out by means of the succession of a set of operations through which, using the instructions written in a programming language, we obtain the code written in another language that is understood by the computer. this is how we obtain executable programs. the web 2.0 is at its very peak, this fact and the change in the web navigators’ function allow to make use of them as production tools such as text editors, spreadsheets or, as we expound in the present work, a compiler. in this way we can move a multilanguage ide from a computer to a web page. olc -on line compileris an application which greatly lightens the student’s workload at the initial stage of programming. its use is especially intended for subjects where the students must learn how to make programs. during this initial period they will neither have to deal with the complexities of the installation and the configuration of these types of tools, nor with the understanding of multiple options which they present. therefore students can concentrate on the comprehension of programming structures and the programming language to be studied. the use made of the compiler in this initial stage is restricted to create programs from a set of instructions free of errors. the student will have written these instructions and the compiler will have helped to correct them. this way in later stages of the learning process and once the students have acquired the basic knowledge of the programming structures and the programming language, they can start using compilers with more complexities in the installation and configuration tasks as well as with a higher number of options. the knowledge acquired by the students in this first stage of the learning process helps them to get a better comprehension of the use and handling of the compilers. 3.1 functionality olc interface is simple and intuitive.an interface defines the communication boundary between two entities, in this case between the students and the application. after the student identification, s/he must choose the programming language with which s/he is going to work. this can be performed from the selection language menu. figure 1. olc, on-line compiler to teach programming languages 71 figure 1: welcome screen. where the student chooses the programming language to work with. once the programming language is chosen, the screen on figure 2 will be displayed. the tabs menu located at the top of the screen, offers users the navigation through the editor, the files, the tutorials and other sections. in the section "my files" a list with the user’s files located on the server for that language is shown. these files are displayed by the last modified date. the students can manage their files using the available options. they can also load files in the server and work with them. under the tabs menu, the tools bar can be found, whose options make it possible to work with different files and the code. below the screen is divided into two text areas: • one for the code, at the top, where students will write the code of their programs. • one for the results, at the bottom, only for reading, that will give information about the different actions performed in the files. local files are shown as a list with a colour code used to identify them. figure 3. apart from the local files the application has a set of tutorials in video. by using them the information transmition can be carried out in a more efficient and interesting way for students. through these videos figure 4, a learning process can be established. these video tutorials have got a clear purpose: to assist users in learning how to manage the compiler and to follow the practical classes. these video tutorials are delivered through files embedded in the web page of olc. this kind of material has got an increasingly didactic interest, since students will have the possibility of viewing again those chapters of the video where some doubts may have arisen. they could do this as many times as necessary [5] and from any device [6] with an internet connection. in the server we can find also a set of local files with examples. the teachers after the lessons can propose practical exercises using the different examples that can be found in the folders on-line. during the teaching period they can check the results of students’ work, directing them in their learning process. 72 cayetano guerra artal, maria dolores afonso suarez idafen santana perez, ruben quesada lopez figure 2: olc work screen. with the tabs menu, the tools bar, and two text areas: one for the code and the other one for the results. figure 3: list of local files, with a colour code used to identify them. olc, on-line compiler to teach programming languages 73 figure 4: multimedia tutorials. 4 teaching proposal with olc the subject "fundamentals of computer science" is taught in the first year of various engineering degrees, amongst them the degree of "chemical engineering" of the "school of industrial engineering" that belongs to the university of las palmas de gran canaria. this subject, in this degree, is taught in the second term of the first academic year. its aim is that students acquire basic knowledge of computer science in the theoretical classes and basic knowledge of programming in the practical classes. the theoretical hours are taught in the theory classrooms and the practical hours are taught in the laboratory. this is a classroom equipped with computers, one for each student, and connected to internet. programming knowledge is necessary for the use of certain tools that students will use in the following academic years, such as programmable calculators or software like matlab. the number of teaching hours is three of theoretical lessons per week using traditional methods and one hour of practical lessons. this means fifteen practical hours during the term. • in the first five hours of practical classes basic knowledge of programming, tools to use and c language are introduced. in the first hour of these five an introductory class of olc will be taught. the following four hours the lessons will be taught through the video tutorials that will indicate how to make use of olc and will ask the students to work with the examples proposed by the teachers. these first exercises will consist of simple programs that students will have to edit, modify and compile. • in the following five hours, students will work in the modification and creation of computer programs. the proposal of modification made by the teachers will consist of making changes in programs that make use of a bit more complex structures than the ones used in the previous phase. a group of ten exercises could be completed, two for each practical hour, with an increasingly difficulty. 74 cayetano guerra artal, maria dolores afonso suarez idafen santana perez, ruben quesada lopez • for the last five hours the students must work in a practical exercise proposed by themselves. teachers must give their approval to this proposal computer program. this will be the final exercise to evaluate and should be made with a more powerful compiler. at this stage of the learning process, students must already control the use of the compiler and its different options. the independence acquired by the student after these stages using olc and their control in the comprehension and use of other compiles should be noticeable. the assessment process will take into account the valuation of the supervision of their work with olc and the final exercise proposed. according to this approach, using olc compiler the role of teachers is greatly restated. they could dedicate time to transmitting motivation about the subject being studied and directing the students in their studies. besides, the supervision of their work and the evaluation is a continuous process that could be done completely on-line. the role of students has also changed. the use of this kind of applications gives them more autonomy than in previous academic courses. 5 didactic perspective the work developed has been exposed to a group of five teachers [7] whose subjects deal with programming. all these subjects are included in the first academic year of several engineering studies. different opinions about the exposition have been found. the features that the teachers have taken into account to give their viewpoint about olc have been: the professional profile their students will have in the future; the use they will make of programming both in the academic and professional field, and the level of knowledge they must achieve during the academic course. to a lesser extent they have considered the percentage that programming has in the final evaluation and the number of hours they must dedicate to the practical part of the subject. • teacher one. he teaches "basic knowledge of computer science" belonging to the first course of "chemical engineering". to me the teaching proposal is the right one. the students find many difficulties in the understanding of compilers at the initial period of learning programming, he thinks. furthermore, sometimes even the use of computer is complex for them, although they learn quickly. professionally they will unlikely need to make computer programs. nevertheless, in the following academic years, surely, they will need to understand what variables are and how to make use of them in small computer programs as well as using them in small functions created for matlab. according to the teacher in their senior thesis, it is very probable that they will only need to manage statistics information and perhaps make some update or modification in a small computer program in order to obtain some of this information. • teacher two. she is the responsible for the practical part of the subject "fundamentals of computer science". this subject corresponds to the group of training complements in the degree in industrial organization engineering. in my opinion the teaching proposal with olc is very well structured. maybe the level of knowledge shown in the video tutorials is quite high, since generally the students who apply for this degree do not have a high interest in programming. the number of students that choose the subject is low. they try to pass the subject without paying attention to the tools they are using. professionally it is not probable that they will make use of these programming skills in the future. that is why the percentage of the final evaluation is low, although the use of multimedia material could be proposed for other subjects, since these multimedia tools are ideal for teaching. • teacher three. he is the coordinator of the subject "programming methodology" belonging to the first course of "technical computer science engineering". he thinks it is a good proposal. in his department they have thought to develop a similar tool. however, they have not considered using olc, on-line compiler to teach programming languages 75 multimedia tutorials, just only help files. we should take advantage of these multimedia capacities. his students will surely use the programming knowledge both at an academic level and in their professional life. • teacher four. she has taught "fundamentals of computers" in the first year of "computer science systems engineering". she finds the teaching proposal with olc interesting. in fact in "fundamentals of computer" she uses a program developed for students to use in order to create computer programs in assembly language . thus they will not have to make use of the machine compiler. according to her, the students are well trained in computer science; therefore they do not need to begin with a tool so simple. however there exists a fundamental aspect of this tool that should be exploited: to have production tools in the web. undoubtedly they mean a great advantage, she thinks. proposals with more functions and accessible from the web would be very interesting for her students. • teacher five. she teaches "basic knowledge of computer science" belonging to the first course of "public works technical engineering" in all its three specialities: "hydrology", "civil building", and "urban transport and services". she thinks that the proposal is of no interest for what she needs, bearing in mind the number of practical lessons per week and the teaching content, which do not match together. she believes that in the future students will not make much use of this programming knowledge. it is quite complex to make them understand some concepts. we are just in a period in which not all the students who reach university studies are equally trained in computer science, and only some of them have got a computer at home. maybe the practical lessons should be concentrated on removing programming knowledge from the contents of the subject. some students have also given their opinion about the environment. their profile extends from students of computer science engineering in their second year at the university to some professional that already have finished the bachelor of "computer science". • student one. he studies the 2nd year of the degree "industrial engineering" (basic knowledge). the possibility of make my programs without having to install any software in my computer (i have had problems trying to install some c compilers before), makes me save time and allows me to work in the same conditions independently of the place -laboratory or home-. furthermore with no knowledge about linux, i could compile my first program in that operating system. • student two. he studies the 2nd year of the bachelor degrees "computer science engineering" (learning c programming language) the use of olc is ideal to learn a programming language. firstly, it hides tiresome options that do not add anything to a beginner user, as path specifications or debug options. moreover the availability of the video tutorials while using the editor is much handy that making programs in a conventional programming environment, where you should read the tutorial in a web page or using paper format. finally, the operations i can carry out with "my files" option are the necessary ones for a beginner (save, compile, indent, and download the .exe file), and can be made in an easy way. • student three. he has finished the bachelor degree "computer science engineering" and in a few weeks will defend his senior thesis (high level of programming language knowledge). i like the idea implemented, to have the compiler in the web allow access from any computer with an internet connection and it is useful to compile in a specific moment with no need to download and install a compiler. for students that just make a brief immersion in the programming studies is ideal in fact, but if you study computer science at a professional level you find limitations. apart of this, to download the .exe file with the result of the compilation every time is not much useful cause it is more handy the characteristic exe button of an ide. 76 cayetano guerra artal, maria dolores afonso suarez idafen santana perez, ruben quesada lopez • student four. he still has to study three subjects to finish the degree "computer science engineering" (high level of programming language knowledge). for me it wasn’t a trouble to install the c compiler, this tool is not the compiler i would have liked to use at the beginning. i think that when i began to make programs in c language i installed the compiler in my computer at home so i had not used olc at all. on the other hand i cannot execute the programs made on-line and this is something that i do not like. when you compile a program the first thing you want to prove is the result, and using olc you must download the .exe file, i prefer to compile with my ide. the great advantage could be to compile and execute on-line. • student five. he has finished the bachelor degree "computer science engineering" and is developing his senior thesis (high level of programming language knowledge). advantages: easy to use. you do not have to install any software, just only an internet connection is necessary. you can work from everywhere: home, laboratories, any place with an internet connection. disadvantages: to download the .exe file. just a few options of debug i do not like that my programs be in the web, for security reasons. in resume: it is interesting at an initial stage of programming or for easy programs in computers where you do not have a compiler installed but apart from that it does not offer much more. it could be more attractive if you could execute your programs on -line. 6 conclusions this work proposes an interesting initiative in the field of e-learning: to portray interactive applications on-line for didactic use [8]. a simple compiler accessible from the web provides students with an easy way to learn at the initial stage of programming studies. olc makes use of multimedia contents to support teachers in the teaching process. these contents show how to work with the on-line compiler and with the programming language [9]. apart from these there are further advantages, such as having at students and teachers’ disposal [10] the compiler through the web and the possibility of making use of it at any time and from any computer with an internet connection. thus barriers of time and space are eliminated. students will have access to the compiler, the contents and the files with which they have been working. besides, there is no need to save the work in store devices nor to install software. to have remote virtual laboratories increases their efficiency [11] and reduces the costs of the resources used in learning [12]. olc could not compete with the powerful traditional ide’s due to its limitations for the expert software development, but undoubtedly it fills the void that exists in the area of teaching programming at the initial stage. this experience has already passed from the development phase to the experimental one. at present we are working on a research of those subjects that include programming languages lessons, and the use of olc could be suitable. as many other initiatives, to promote its acceptance and to achieve its use depend on programming teachers, who have already valued the application positively and favourably. according to their opinion, to introduce the students in the field of programming is hard in the initial stage. not only for the complexity of the control and data structures and the programming language to learn, but also for the complexity of the tools they need to use, such as compilers, bearing in mind that students only need to make use of a reduced number of functions at their initial period of their programming studies. using this kind of applications with multimedia contents allow teachers to dedicate time to activities which improve the quality of teaching, transmitting a further motivation about the subject to be studied and the results they can obtain, as well as directing students closely in their studies. [13]. in the near future, we will continue this line of work. our new projects involve the development of other applications accessible from the web [14], used as olc, in e-learning field. some of these applications will count on a key feature: interactivity [15]. thus students will develop new working methods [16] in order to ensure participation [17]. olc, on-line compiler to teach programming languages 77 7 acknowledgment this article is an extended version of our paper [18], published in proceedings of the 2nd international conference on virtual learning ű icvl 2007 (m. vlada, g. albeanu, d.m. popovici, eds). we wish to thank the program committee of icvl 2007 that select and recommended the publication of our extended work in ijccc. bibliography [1] p. brey. "evaluating the social and cultural implications of the internet". volume 35, issue 3 (september 2005). acm press. [2] c. mchugo, t. hall. "enhancing ict learning with real-life problems". emrc university of limerik.". [3] t. o’reilly. "what is web 2.0 design patterns and business". models for the next generation of software [4] a. yong-yeol, h. seungyeop, k. haewoon, m. sue, j. hawoong. "semantic web and web 2.0: analysis of topological characteristics of huge online social networking services". proceedings of the 16th international conference on world wide web www’07. acm press [5] t. hartsell, s. chi-yin yuen. "video streaming in online learning". aace journal. 14(1), pp. 3143. chesapeake, va: aace. [6] l. neal. "predictions for 2006: e-learning experts map the road ahead". volume 2006 issue 1 (january 2006). acm press. new york. [7] s. anderson. "working together to develop a professional learning community". herdsa 2002 [8] g. lautenbach. "engaging with e-learning: emerging epistemologies and increased pedagogic vision in higher education". iadis inter national conference e-learning 2007 [9] r. rheeder, r. diseko, g. lautenbach. "the design of interactivity for a web based learning environment at a higher education institution." iadis international conference e-learning 2007 [10] h. samaras, t. giouvanakis, d bousiou, k. tarabanis. "towards a new generation of multimedia learning research". aace journal. 14 (1), pp. 3-30. chesapeake, va: aace [11] lawson, e.a. stackpole, w. (2006). it education online education: does a virtual networking laboratory result in similar student achievement and satisfaction?. proceedings of the 7th conference on information technology education sigite’06. (october 2006). acm press.w. nejdl, m. wolpers "european e-learning: important research issues and application scenarios". l3s research center and university of hannover.) [12] w. nejdl, m. wolpers "european e-learning: important research issues and application scenarios". l3s research center and university of hannover. [13] k. fraser, "a blended learning approach to teaching" introduction to multimedia, the e-bit!. aishe conference 2006 [14] a. leene. "the microweb". using content in theory and practice microlearning 2006, innsbruck 7 june 2006 78 cayetano guerra artal, maria dolores afonso suarez idafen santana perez, ruben quesada lopez [15] p. valente, p. sampaio. "defining goals for the design of interactive multimedia documents". in proceedings of world conference on educational multimedia, hypermedia and telecommunications 2006 (pp. 955-962). chesapeake, va: aace. [16] n. sala. "web based learning environment: an example". in proceedings of 2000 (pp. 807-808). chesapeake, va: aace. [17] c. gráinne, "making sense of today’s technology-enhanced environment for learning: rethinking student and teacher roles" iadis international conference e-learning 2007 [18] cayetano guerra artal, maria dolores afonso suarez, idafen santana perez, ruben quesada lopez, "olc, on-line compiler to teach programming languages", in (m. vlada, g. albeanu, d.m. popovici, eds) proceedings of the 2nd international conference on virtual learning icvl 2007, october 26-28, 2007, constanta, romania, pp. 283-290, 2007. cayetano guerra artal, maria dolores afonso suarez idafen santana perez, ruben quesada lopez instituto universitario de sistemas inteligentes y aplicaciones numéricas en ingeniería iusiani universidad de las palmas de gran canaria departamento de informatica y sistemas edif. de informatica y matematicas. campus universitario de tafira. 35017. las palmas. spain. e-mail: cguerra@iusiani.ulpgc.es, mafonso@dis.ulpgc.es idafensp@gmail.com, rubenql@gmail.com received: november 4, 2007 cayetano guerra artal, graduated at the university of las palmas de gran canaria in computer science. he has been teaching multimedia for seven years at the computer science and systems department of the university of las palmas de gran canaria. he received his ph.d. in computer vision. his main research fields are computer vision, computer graphics, multimedia and e-learning. olc, on-line compiler to teach programming languages 79 maria dolores afonso suarez, graduated at the university of las palmas de gran canaria in computer science. she has worked in private enterprises developing software and as associated professor at the computer science and systems department of the university of las palmas de gran canaria. her main research fields are software engineering, multimedia, and e-learning. idafen santana pérez, graduated at the school of computer science of the university of las palmas de gran canaria. he is a pre-graduated student of the faculty of computer science of this university. his current research interests are in e-learning aplications and natural language processing. ruben quesada lopez, graduated at the school of computer science of the university of las palmas de gran canaria. he is a pre-graduated student of the faculty of computer science of this university. his current research interests are in e-learning aplications and natural language processing. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 871-883 natural language based on-demand service composition f.-c. pop, m. cremene, j.-y. tigli, s. lavirotte, m. riveill, m. vaida florin-claudiu pop, marcel cremene, mircea vaida technical university of cluj-napoca, faculty of electronics, telecommunications and it, cluj-napoca, romania e-mail: {florin.pop, marcel.cremene, mircea.vaida}@com.utcluj.ro michel riveill, jean-yves tigli, stéphane lavirotte université de nice sophia antipolis, sophia antipolis cedex, france e-mail:{riveill, tigli, lavirott}@unice.fr abstract: the widespread of web services in the ubiquitous computing era and the impossibility to predict a priori all possible user needs generates the necessity for on-demand service composition. natural language is one of the the easiest ways for a user to express what he expects regarding a service. two main problems need to be solved in order to create a composite service to satisfy the user: a)retrieval of relevant services and b) orchestration/composition of the selected services in order to fulfill the user request. we solve the first problem by using semantic concepts associated with the services and we define a conceptual distance to measure the similarity between the user request and a service configuration. retrieved services are composed, based on aspect oriented templates called aspects of assembly. we have tested our application in an environment for pervasive computing called ubiquarium, where our system composes a service according to the user request described by a sentence. the implementation is based on the wcomp middleware that enables us to use regular web services but also web services for devices. keywords: natural language, service composition, on-demand, middleware, templates 1 introduction background. since web 2.0 marked it’s appearance as a concept in the fall of 2004 and introduced the principle of the internet as a platform [19], the complexity and diversity of this platform grew together with the more enhanced features it was providing to its users. given the fact that the software in the internet era is delivered as a service, not as a product and there is no release cycle for the services, it is the user who’s in charge of finding a service and using it. in a near future services will be more diverse and widespread as computers will become ubiquitous. in the same way the information across the web is structured, classified and then presented to a user that sends a natural language request to a search engine, so a collection of applications should be assembled, classified and deployed using the services that are found within a given context based on a similar unrestricted language request coming from the user. the problem. in a context where various services with different functionality are available, it’s possible to compose new services on-demand, based on a user request, only when the right selection of components is used. on-demand service composition involves two operations: service retrieval and service orchestration. service retrieval refers to identifying those specific services that are addressed by the user copyright c⃝ 2006-2010 by ccc publications 872 f.-c. pop, m. cremene, j.-y. tigli, s. lavirotte, m. riveill, m. vaida request or the closest functional match to the request. the transition from a natural language request to a list of services is a challenge that is even more difficult when no restrictions are added to control the request. service orchestration or service assembly is the process of linking the retrieved services in a functional flow so that the user demand is fulfilled. both mentioned problems make the object of this paper, but while service retrieval was the field of our research, for service orchestration we used an existing approach. scenario. we consider the following scenario to illustrate the purpose of dynamic service composition based on natural language requests. a handicapped person lives inside an intelligent house, surrounded by intelligent devices, sensors and actuators. each device has its own inputs and outputs and is able to process different types of data, leading to a large number of possible functional combinations of those devices. the person in the intelligent house wants to use those devices by combining their functionality (e.g. link the output from a sensor to the input of an actuator), but his disability prevents him to physically interact with the devices or he simply lacks any technical knowledge. therefore, he expresses his need using the natural language (either written or spoken): "i want to use my remote control on the wheel chair to turn off the light, change the channel on tv and play some music on the media center". each device the user addresses through his request (remote control, light, tv, media center) provides a different service with specific actions that can be used in various configurations. finding a way to sort these configurations by relevance to the user’s request is a key requirement for the imagined scenario. also, when one of the devices the user wants to use is not present in the intelligent house or it was replaced with an updated version, the system should adapt and assemble a service that is the closest match to the user’s need. approach. dynamic service composition solves the problem of adaptation to different contexts and user preferences. also, by composing services on demand, the learning curve required for the user to work with new configurations is reduced as the user "gets what he wants" from the application. existing systems for dynamic service composition based on natural language requests either provide a restricted natural language interface or don’t offer support for adaptation to structural and behavioral changes of the service configuration. we start with an initial set of services that are discoverable across a network. the user requests a completely new service using an unrestricted natural language sentence. in order to find specific devices to satisfy the request, we use semantic concepts and define a conceptual distance between the request and a service configuration. concepts are leafs on a lexical tree that is generated by deriving and generalizing a notion. once the services that match the request are identified, some aspect oriented advices are used to connect the services so that when a service disappears from the context or a new service is made available, the service configuration adapts. outline. this paper is organized as follows: the next section examines the problems a dynamic service composition system should solve in order to be usable in the modern context of web services. section 3 describes the solution we propose including the principles that lead to this solution, while section 4 focuses on the design and implementation patterns we used, along with the test results. section 5 examines some of the existing dynamic service composition approaches. the paper ends with conclusions and further research. 2 an overview of the problem on-demand dynamic service composition based on natural language requests raises some challenges that need to be studied before a solution is to be proposed. service retrieval. the first problem we approach in this paper is finding and retrieving a particular service. the large variety of web services useable for composition needs to be classified natural language based on-demand service composition 873 in a way that would make it machine meaningful and semantic-rich for the search to provide the best results. one web extension, called semantic web [3] is focused on enabling better web service interoperation. the semantic web’s purpose is to bring structure to the meaningful content of web pages, creating an environment where software agents roaming from page to page can readily carry out sophisticated tasks for users. one dialect of the daml [14] family of semantic web markup languages was proposed in [17] for the markup of web services. this so-called semantic markup of web services creates a distributed knowledge base (kb) that provides a means for agents to populate their local kbs so that they can reason about web services to perform automatic web service discovery, execution and composition and interoperation. but the semantic mark-up uses a narrow, predefined vocabulary as identified in [16], which makes possible only the retrieval of those web services for which the vocabulary is known. queries or requests from web services or user requests, using another vocabulary than the predetermined vocabulary are not suitable to find or retrieve such a web service. therefore, a narrow vocabulary for the semantic mark-up of web services is not appropriate to be used in combination with natural language requests. service composition. the second problem that needs to be solved is the actual composition of the retrieved services. there are 3 types of systems for dynamic service composition according to [12]. template-based systems [18,21] are using a service template to compose an application. they can handle complex interactions between components and allow some level of flexibility by choosing different sets of components. the drawback of this approach is that they cannot compose applications for which templates are not available. interface-based systems [7] allow the user to submit a set of inputs and outputs for the application he is requesting. these systems have a higher adaptability than the template-based systems, but certain applications cannot be represented as a set of inputs and outputs (e.g. an email sending service does not output any data). logic-based systems [20,23] extend the interface-based approach by adding extra information into interface information using first order logic or linear logic. a user requests an application by submitting a first order formula representing the logic that must be satisfied by the application. they are more adaptable than the template-based systems since they donőt require service templates and offer support for more varieties of services than the interface-based systems. their main disadvantage is given by the fact that they are not extensible and are not suitable for a distributed environment. adaptation. the last, but the most important problem that needs to be solved by a dynamic service composition system is adaptation. we distinguish two types of changes that require adaptation [2]: structure changes and behavior changes. structural adaptation consists in modifying the retrieved services while preserving the global behavior of the application. the behavior describes the sequence of operations to be executed to fulfill the user request. structural changes are triggered when a retrieved services disappears from the context or is replaced by another (for example the analog tv is replaced by a digital tv). behavior changes are related to the user who may decide that the current service does no longer satisfy his need. 3 proposed solution our dynamic service composition system was inspired out of the user’s need to interact with the intelligent devices that surround him. this user-machine interaction should be as natural as human interaction, through unrestricted natural language. intelligent devices are entities that provide services to the user and have networking capabilities. roughly, there are two types of 874 f.-c. pop, m. cremene, j.-y. tigli, s. lavirotte, m. riveill, m. vaida intelligent devices: basic devices that are simple service producers (e.g. a light that offers the illuminating service, a tv that offers the tuning service that allows changing the tv channels) and controller devices that can consume services by other devices, acting as interfaces for a composite service (e.g. a mobile phone, an ultra-mobile pc, a netbook). hybrid devices that implement both previously mentioned functionalities can also be imagined. the intelligent devices are connected to form an ad-hoc network inside the intelligent house, which they use to exchange information. this means that the devices can appear and disappear from the network structure on-the-fly: a device can auto-configure when it joins the network and then leave the network without notice. network and device management is a task for the middleware that runs the intelligent house. 3.1 semantic descriptions for web services wsdl [10] is intended for the functional description of web services and the semantic web mark-up is limited to a narrow vocabulary, which is not suited for natural language requests. a lexical tree [16] would add too much semantic information to a service and would not be suited for embedded devices. to overcome these limitations, we propose the use of general notions, called concepts, to describe the utility of a service. a service is not entirely identified by a single concept, but by an infinite number of concepts that are determined through the generalization of a notion. this notion will serve as a semantic description for an intelligent device that offers a service. we use the television notion to describe a tv, for example. through generalization we find that both the television and electronic equipment concepts refer to the same device. to increase the precision, a lexical analysis is also conducted for the service description and the user request by the composition system. this way, the service description suffers little or no modifications due to the extra semantic information. 3.2 linguistic processing we consider the scenario where the user interacts with appliances and he expresses his need through a sentence: "i want to use my phone to turn off the light, turn on the tv and play some music on hifi". in order to retrieve the services required to satisfy the user need, the request goes through a linguistic processing module, responsible for: • text segmentation required to separate the words in the phrase (e.g. switch off the light is transformed into switch, off, the, light); • removing stop words that are considered to be irrelevant (e.g. the, to, and); • stemming (e.g. lights is transformed into light, using becomes use); • spell-checking to correct the misspelled words and the words "damaged" during stemming. the output text segments for the user request in the considered scenario are: want, use, phone, switch, light, turn, tv, play, music, hifi. 3.3 the graph of concepts the text segments together with the service descriptions are nodes in a graph, called the graph of concepts. the arcs in this graph connect each text segment to each service description. the weight of each arc represents the conceptual distance between the text segment and the natural language based on-demand service composition 875 service description. we introduced the conceptual distance to measure the relationship between two notions. measures of semantic similarity or relatedness are found in linguistic processing literature. according to the study published in [6], the jiang and conrath’s measure proved to be best for practical usability. all compared distances are based on the information content of the lexical terms (the probability of encountering an instance). while this information may be valuable for other applications, it is of less importance for service composition and it adds to the complexity of the implementation, therefore making it slower. our approach, the conceptual distance, is faster and less complex than jiang and conrath’s measure, while the last is more accurate. the conceptual distance is a numerical evaluation of how accurate two notions refer to the same concept. for example the words phone and telephone describe the same concept a communication device, therefore the conceptual distance is null. on the other hand, the words phone and electronic equipment can describe the same concept a communication device, but one of them is more general, therefore it can address more concepts, which leads to a non-null distance between these words. figure 1 shows an example of a graph of concepts where the text segments are tv, light, hifi, phone and the service semantic descriptions are television, light, dvd, hifi, mobile phone and pda. figure 1.a. represents all the distances and figure 1.b. represent only the arcs with minimum conceptual distance. figure 1: the graph of concepts 3.4 knowledge structure in order to evaluate the conceptual distance we need to find a way to classify the lexical basis of the english language. we used for this purpose a specialized dictionary called wordnet [11]. wordnet groups nouns, verbs, adjectives and adverbs in sets of synonyms, called synsets. each synset describes a different concept. different senses of a word are in different synsets. most synsets are connected to other synsets via a number of semantic relations. for example, the semantic relations for nouns include: 876 f.-c. pop, m. cremene, j.-y. tigli, s. lavirotte, m. riveill, m. vaida • hypernyms: y is a hypernym of x if every x is a (kind of) y (mobile phone is a hypernym of phone); • hyponyms: y is a hyponym of x if every y is a (kind of) x (phone is a hyponym of mobile phone); • coordinate terms: y is a coordinate term of x if x and y share a hypernym (mobile phone is a coordinate term of cellular phone, and cellular phone is a coordinate term of mobile phone); • holonym: y is a holonym of x if x is a part of y (mobile phone is a holonym of transmitter); • meronym: y is a meronym of x if y is a part of x (transmitter is a meronym of mobile phone). while semantic relations apply to all members of a synset because they share the same meaning, words can also be connected to other words through lexical relations, including antonyms and derivationally related, as well. both nouns and verbs are organized into hierarchies, defined by hypernym or is a relationships. for example, the hierarchy for mobile phone is: • cellular telephone, cellular phone, cellphone, cell, mobile phone • radiotelephone, radiophone, wireless telephone • telephone, phone, telephone set • electronic equipment • equipment the words at the same level in hierarchy are synonyms of each other. 3.5 the concept hierarchy the algorithm that evaluates the conceptual distance uses the wordnet lexicon to create concept hierarchies. a concept hierarchy is generated in 4 steps: 1. find the synset that contains the concept for which the hierarchy is generated. each word in the synset becomes a root for a tree in the concept hierarchy. 2. for each tree root, find the synsets that are in a relationship with the rootős synset. each word in the related synset becomes a leaf for the tree, on the next level in hierarchy, branching from the root. 3. for each word on the current level in hierarchy, find the synsets related to the wordős synset and add the words in the found synsets as leafs for the tree on the next level. 4. repeat step 3 until the hierarchy is big enough so that the degree of generalization for the notion for which the hierarchy is built, corresponds to an accepted accuracy that produces best results. the bigger the hierarchy the longer it takes to generate it, but the smaller the hierarchy the more confusion can occur among concepts. the hierarchy for the notion mobile phone is shown in figure 2. the roots of each the tree are part of the same synset and each level in a tree represents the words from the synsets that are related to the word they are branching from. natural language based on-demand service composition 877 figure 2: the concept hierarchy for the notion mobile phone 3.6 the conceptual distance in order to evaluate the conceptual distance for two notions, a concept hierarchy is built for each notion. then, the conceptual distance is calculated as follows: • the minimum difference of levels between the common node of the 2 hierarchies and the node that represents the notion on which the hierarchy is built for, if such a common node exists. • the maximum number of levels a hierarchy can have if thereős no common notion among the two. examples: • d (mobile phone, cell) = 0 • d (radiotelephone, radiophone) = 0 • d (mobile phone, radiophone) = 1 • d (mobile phone, telephone) = 2 3.7 service retrieval finding and retrieving the closest services to the user request resumes to identifying the couples (text segment, service description) in the graph of concepts that have a minimum sum of conceptual distances. this makes it easier to take advantage of a service that only partially matches a request. 878 f.-c. pop, m. cremene, j.-y. tigli, s. lavirotte, m. riveill, m. vaida in order to find mentioned couples we need to apply tow transformations to the graph of concepts: • finding the sub-graph that has the minimum distance path that includes all the nodes. after this transformation each service description will be connected to 2 text segments. • for each triplet (service description, text segment 1, text segment 2) remove the arc that has the maximum weight. in order to find the minimum weight sub-graph, we use the kruskal [15] algorithm that calculates the minimum spanning tree (mst). the nodes that contain the service descriptions resulted after the two transformations are applied, represent the services that are used to generate the composite service requested by the user. the transformed sub-graph for the graph in figure 1 contains the nodes that are connected with the thick continuous line. 3.8 service composition we used a template-based service composition system because of its capability to handle complex interactions between components and the flexibility of choosing different sets of components. the system we used, called aspects of assembly (aoa) [9] is part of the wcomp [8] middleware for ubiquitous computing and besides the benefits that derive from the fact that is template-based, also offers support for auto-adaptation. these templates can be automatically selected either by the service composition system when satisfying a user request or triggered by context changes in a self-adaptive process and composed by a weaver with logical merging of high-level speciţcations. the result of the weaver is projected in terms of pure elementary modiţcations (pems) ð add, remove components, link, unlink ports. the aoa architecture consists of an extended model of aspect oriented programming (aop) for adaptation advices and of a weaving process with logical merging. an aoa template is structured as an aspect with a list of components involved in composition (called pointcut) and adaptation advice (a description of the architectural reconfigurations), which is specified using a domain specific language (dsl). we will examine some aoa templates and the composition process in detail in the next section. 4 implementation and results we used the wcomp [8] platform for ubiquitous computing as the middleware of our intelligent house. wcomp uses the upnp protocol to achieve device interconnectivity and interoperability. each upnp device has a software proxy that acts like a software component. using this proxy, we can treat web services for devices similar to the ui components of a gui designer. we added some meta-data to the upnp service description for each device to serve as semantic description. using wcomp, we have simulated the following devices/services: tv set, described by the television notion; dvd recorder, described by the dvd notion; mobile phone, described by the mobile phone notion; pda, described by the pda notion; hifi, described by the hifi notion; lighting system, described by the light notion. the interactions between these components were specified using aoa templates. following, is an example of such a template that is used to connect the mobile phone to the tv: pointcut inputdev:=/mobilephone.*/ natural language based on-demand service composition 879 outputdev:=/television.*/ advice keytochannel(inputdev, outputdev): input.^key_pressed -> ( output.set_channel ; call ) the first 3 lines describe (defining the pointcut as in aspect programming), using filters in the awk language, the components involved in the interaction: a mobile phone (inputdev) and a television (outoutdev). the filters of type /instancename.*/ will find components that have their name prefixed by instancename. line 4 declares a composition schema that uses the previously described components. lines 4-5 specify the composition mechanics: call the tv.set_channel method when the mobile phone fires the event key_pressed. the service composition system implements a upnp device that offers the service of designing composite services for the user in order to fit seamlessly with the wcomp middleware. requests from the user are captured by a wcomp assembly of components, and then sent using the upnp protocol to the service designer along with a description of the context where the devices are located. the service designer queries the devices for service descriptions (semantic meta-data), and then finds only those services that are relevant to the user request. instances of the devices that provide the named services are interconnected based on the rules described in the aoa templates. scenario 1. "i want to use my phone to turn off the light, turn on the tv and play some music on hifi". this phrase contains many irrelevant words to the service composition system, but the relevant words are identical (except tv) to the service semantic descriptions. irrelevant words have an effect of increasing the time required to process the graph of concepts. all the relevant services are identified and then composed. figure 3: the dynamically composed service for scenario 1 scenario 2. "use pda for broadcasting". this user request is challenging for any composition system because it doesn’t address the tv directly, but through the abstract concept of broadcasting. due to the use of the specialized dictionary, the tv is found and then connected to the pda. figure 4: the dynamically composed service for scenario 2 880 f.-c. pop, m. cremene, j.-y. tigli, s. lavirotte, m. riveill, m. vaida the wordnet (free download) dictionary is installed on a local machine thus the word search is fast. the main complexity of the algorithm is given by the conceptual distance computation. the time necessary in order to compute the distance between two concepts (similarity), based on the english dictionary, was about 6 ms (on a dell latitude 830 laptop, cpu dual core 2.2ghz, 2g ram) with the fastest algorithm that we have found (a wordnet similarity implementation, the java package edu.sussex.nlp.jws.jiangandconrath). thus, if we have m user concepts and n service concepts, we need a time equal to m∗n 2 ∗ 6 ms. for instance, we can create the concept graph for 10 user concepts and 64 services in about 2 seconds. the aoa application is very fast also (less than 1 sec. for up to 350 components). in this section we have shown that we are able to create, in practice, new services, ondemand, using real devices, by applying the patterns available for the selected set of services. the scenarios discussed above were tested in a dedicated environment for ambient computing, called ubiquarium [24]. real devices were used (a part of the services are real devices and another part are virtual ones). 5 related work composing web services on the basis of natural language requests. the solution described in [4] and [5] assumes that the user requests are expressed with a controlled subset (a narrow vocabulary) of natural language. the sentence that represents the userős request is transformed into a flow model using templates (e.g. if . . . then . . . else, when . . . do). verbs are used to identify the action and its parameters. each available service is paired with a well-defined set of keywords. owl-s annotations are used to provide operation semantics and an ontological classification of web services. the operations act as nodes of a direct acyclic graph and the relations among their iopes (inputs, outputs, preconditions and effects) establish arcs. the graph is translated into an executable service at invocation time. the example the authors use to sustain the proposed solution uses the phrase "if there is any cinema showing "big fish" in turin then send a sms to dario containing "lets go to the movies tonight!"" an if . . . then template is used to identify the flow model. in the next step, called context focus, the service types are identified: cinema, sms. the verb (send) triggers the parameter retrieval stage where iotypes recognizers based on format (date, time, telephone numbersé) and values (city namesé) are used to extract the actionős parameters. this solution establishes a synergy between the semantic service descriptions and the natural language interpretation of user requests, through a common ontology and a consistent lexical vocabulary. therefore it canőt be used in active environments where new components that act as black-boxes appear and disappear from the context dynamically. also, the use of a controlled subset of natural language makes it non intuitive for the user as he is restricted to the use of templates when expressing a request. semantics-based dynamic service composition. papers [12] and [13] propose the cosmos model and the segsec platform for dynamic service composition. their idea is to transform the semantics of the user request into a semantic graph. nodes in the semantic graph represent operations, inputs, outputs and properties of a component, as well as their data types and concepts. arcs (labeled links) represent the relationships among the nodes. concepts, entities representing abstract ideas actions are used to annotate the semantics of the operations, inputs, outputs and properties of components. the user request is parsed and the components addressed by the user form a workflow. the example in [13] uses the phrase "print directions from home to restaurant". the semantic graph contains the predicate (print), the target of the action (direction) and the parameters (home, restaurant). the workflow, containing the retrieved components, is executed as soon as natural language based on-demand service composition 881 it satisfies the user request. this analysis takes place in a step called semantic matching and consists in a test that verifies that all the links that appear in the user request also appear in the graph that models the workflow. the authors of [13] admit that their solution is not suited for environments where a large number of components are deployed. the platform lacks the feature of providing a solution in the case where the workflow doesnőt satisfy the userős request. if the generated workflow doesnőt match exactly the user request, then the dynamic service composition fails. also, the ability of the implementation to discover certain components is to be questioned because itős limited to work with a narrow set of keywords and it lacks a vocabulary. web service with associated lexical tree. the invention claimed by alcatel [16] relates to a method to mark-up a web service in order to allow finding and retrieving said service via a natural language request. a lexical tree, built by deriving the service description, finding synonyms and related forms of the derived keywords, is associated to each service. finding a service based on the user request resumes to comparing the natural language query to the lexical tree of each web service. this method of retrieving a web service proves to be the most appropriate when dealing with natural language requests. the invention however doesnőt exploit the full potential of this finding, as it lacks service composition. 6 conclusion this paper proposes a new method for assembling services on demand, starting from the user request expressed in natural language. we use a semantic analysis of the user request, in order to identify the services described by concepts that are related to concepts from the user request. retrieved services are then composed, based on composition patterns, called aoa (aspects of assembly). the uses of patterns, which assure that the new service is always valid, compensate the ambiguity of the natural language. another important advantage of the aoa patterns, comparing to other existent pattern-based approaches, is the fact these patterns may be superposed by composition. thus, a large number of combinations are possible using a given set of patterns. additionally, for a given set of services, the aoa mechanism applies only the patterns that lead to valid services. the solution was implemented on the wcomp platform and tested in a dedicated ambient computing environment, called ubiquarium, using real and virtual intelligent devices/services. the service composition is user driven, by natural language (voice) and allows the user to get the service on-demand. from this point of view, our solution is less restrictive than the other solutions described in the state of the art section. an important advantage of our solution is the reuse of wordnet free dictionary, which is acting like ontology. due to this, we can relax very much the limitations for the natural language, imposed by solutions where an ontology (usually restricted) must be created by the developer. otherwise, the creation of a rich ontology is a very costly task and our solution succeeded to avoid it by reusing wordnet. this choice has another important advantage: it solves the problem of dealing with different ontologies and does not need to impose a common ontology (the only requirement is to use english). a particular aspect of our proposal is the mixed approach: semantic and pattern-based. this approach combines the advantages of the both: thanks to composition patterns, it allows us to build complex composite services, which are always valid and functional. with other approaches (interface, logic, semantic based), that are not using patterns/templates, it is very difficult to create complex architectures that are valid and work correctly. as future work, we intend to extend our solution for dynamic service adaptation (at runtime) and this should be feasible because wcomp was designed for dynamic service reconfiguration in 882 f.-c. pop, m. cremene, j.-y. tigli, s. lavirotte, m. riveill, m. vaida pervasive environments. acknowledgments this work was supported by the econet project code 18826ym and the romanian national project pncdi ii code 1062, and 1083 financed by uefiscsu. thanks to other members of the rainbow team for fruitful discussions and feedback: vincent hourdin, daniel cheung-foo-wo, eric callegari. bibliography [1] a. v. aho, b.w. kernighan, p. j. weinberger, the awk programming language, addisonwesley, 1988. [2] anastasopoulos m.; klus h.; koch j.; niebuhr d.; werkman e., doami a middleware platform facilitating (re-)conţguration in ubiquitous systems, in system support for ubiquitous computing workshop. at the 8th annual conference on ubiquitous computing (ubicomp 2006), sep 2006. [3] berners-lee, t.; hendler, j.; lassila, o., the semantic web, scientific american magazine, may 17 2001. [4] bosca, a.; ferrato, a.; corno, f.; congiu, i.; valetto, g., composing web services on the basis of natural language requests, ieee international conference on web services (icws’05), pp. 817-818, 2005. [5] bosca, a.; corno, f.; valetto, g.; maglione, r., on-the-fly construction of web services compositions from natural language requests, journal of software (jsw), issn : 1796-217x, vol. 1 issue 1, pag 53-63, july 2006. [6] e. budanitsky, g. hirst, semantic distance in wordnet: an experimental, application-oriented evaluation of five measures, 2001. [7] chandrasekaran s.; madden s.; ionescu m., ninja paths: an architecture for composing services over wide area networks, cs262 class project writeup, uc berkeley, 2000. [8] cheung-foo-wo, d.; tigli, j.-y.; lavirotte, s.; riveill, m., wcomp: a multi-design approach for prototyping applications using heterogeneous resources, in 17th ieee intern. workshop on rapid syst. prototyping, pag 119ð125, creta, 2006. [9] cheung-foo-wo, d.; tigli, j.-y.; lavirotte, s.; riveill, m., self-adaptation of eventdriven component-oriented middleware using aspects of assembly, in 5th international workshop on midd leware for pervasive and ad-hoc computing (mpac), california, usa, nov 2007. [10] christensen, e.; curbera, f.; meredith, g.; weerawarana, s. web services description language (wsdl) 1.1, website, 2001 http://www.w3.org/tr/wsdl [11] cognitive science laboratory, princeton university, wordnet ð a lexical database for the english language, website, 2006 http://wordnet.princeton.edu/ natural language based on-demand service composition 883 [12] fujii, k.; suda, t., component service model with semantics (cosmos): a new component model for dynamic service composition, saint-w ’04: proceedings of the 2004 symposium on applications and the internet-workshops (saint 2004 workshops). washington, dc, usa: ieee computer society, 2004. [13] fujii, k.; suda, t., semantics-based dynamic service composition, ieee journal on selected areas in communications, vol 23(12), pag 23612372, dec 2005. [14] hendler, j.; mcguinness, d., the darpa agent markup language, ieee intelligent systems, vol. 15, no. 6, nov./dec. 2000, pp. 72ð73. [15] kruskal, j. b., on the shortest spanning subtree of a graph and the traveling salesman problem, proc. amer. math. soc., vol 7, 1956. [16] larvet, p., web service with associated lexical tree, european patent, ep1835417. [17] mcilraith, s. a.; cao son, t.; zeng h., semantic web services, ieee intelligent systems, vol. 16, no. 2, mar./apr. 2001, pp. 46-53. [18] molina a. j.; koo h.-m.; ko i.-y., a template-based mechanism for dynamic service composition based on context prediction in ubicomp applications, in proceedings of the international workshop on intelligent web based tools (iwbt’2007), 2007. [19] o’reilly, t. what is web 2.0, website, 2005 http://www.oreillynet.com/pub/a/oreilly/tim/news/2005/09/30/what-is-web-20. html [20] rao j.; kungas p.; matskin m., logic-based web services composition: from service description to process model, proceedings of the ieee international conference on web services, p.446, june 06-09, 2004. [21] sirin, e.; parsia, b.; hendler j., template-based composition of semantic web services, in aaai fall symposium on agents and the semantic web, 2004. [22] upnp forum, website, 2008 http://www.upnp.org/ [23] wu d.; parsia b.; sirin e.; hendler j.; nau d., automating daml-s web services composition using shop2, in proceedings of 2nd international santic web conference (iswc2003), sanibel island, florida, october 2003. [24] hourdin, v.; cheung-foo-wo d.; s.l. ; j.y.t., ubiquarium informatique: une plate-forme pour l’etude des equipements informatiques mobiles en environnement simule., in proccedings of 3-eme journees francophones mobilite et ubiquite (ubimob), paris, september 2006. int j comput commun, issn 1841-9836 vol.7 (2012), no. 3 (september), pp. 473-481 optimal bitstream adaptation for scalable video based on two-dimensional rate and quality models j. hou, s. wan junhui hou, shuai wan northwestern polytechnical university school of electronics and information xi’an 710129, china e-mail: houjunhuihn@gmail.com swan@nwpu.edu.cn abstract: in this paper, a two-dimensional (2d) rate model is proposed considering the joint impact of spatial (i.e., the frame size) and snr (i.e., the quantization step) resolutions on the overall rate-distortion performance. a related 2d quality model is then proposed in terms of perceptual quality. then the two proposed models are applied to scalable video to address the problem of optimal bitstream adaptation. experimental results show that the proposed rate and quality models fit the actual data very well, with high coefficients of determination and small relative root mean square errors. moreover, given the bandwidth constraint and required display resolution of the end users, the optimal combination of snr and spatial layers that provides the highest perceptual quality can be achieved using the proposed models. keywords: 2d rate model, 2d perceptual quality model, scalable video, bitstream adaptation. 1 introduction recent multimedia applications are featured by various resolutions designed for a variety of devices with different computational and display capabilities. these devices range from cell phones and pda’s with small screens and restricted processing power to high-end work stations with high-definition displays. the related video services or applications are connected to different types of networks with various bandwidth limitation and loss characteristics. a highly attractive approach to address the vast heterogeneity is known as scalable video, which allows for spatial, temporal, and snr scalabilities [1]. in scalable video coding (svc), the video signal can be encoded into a base layer (bl) and one or more enhancement layers (els), with each enhancement layer improving the resolution (either temporally or spatially) or the fidelity of the video sequence. as a result, certain parts of the scalable bitstream can be removed for adaptation to various capabilities of end users as well as varying network conditions. at the network proxy or gateway, a bitstream adaptor is usually employed to extract the bitstream to meet particular constraints, e.g., targeted bit-rates and/or spatial or temporal resolutions. for a given set of constraints, the solution can be varieties of resolution combinations, leading to different visual qualities. the challenging problem of bitstream adaptation is therefore how to determine the combination of the spatial resolution (i.e., the frame size (s)), temporal resolution (i.e., the frame rate (t)) and snr (signal-to-noise ratio) resolution (i.e., the quantization step (q)) to be used for bitstream extraction under a given targeted bit-rate to maximize the resulting quality. many efforts have been devoted to bitstream adaptation for scalable video. a basic and content independent extractor is provided in the reference software of the joint scalable video model (2) [2]. in [3], an alternative extraction method is proposed based on rate-distortion optimization. this technique utilizes the concept of quality layers and improves the performance of the jsvm basic extractor by arranging the priority of layers based on their contributions to the global improvement in quality. a more copyright c⃝ 2006-2012 by ccc publications 474 j. hou, s. wan efficient method for extraction is proposed in [4], using an accurately and efficiently estimation of the quality degradation resulting from discarding an arbitrary number of network abstraction layer (nal) units from multiple layers taking drift into account. however, the methods in [3] and [4] are executed only within a single resolution, e.g., the snr plane. in [5], an effective method is proposed to quickly solve the problem of spatial resolution selection based on an analysis on the content information. however, the peak-signal-to-noise ratio (psnr) is used as the distortion criterion, which does not correlate well with the perceptual quality especially with regard to spatial scalability. in [6], two-dimensional (2d) rate and perceptual quality models in terms of the frame rate and the quantization step are built, and then the two models are applied to optimal bitstream extraction in svc. however, the spatial resolution is not considered in [6] and the parameters in the quality model are difficult to obtain. in [7], the video quality under different spatial, temporal and snr combinations is quantitatively and perceptually assessed, based on which an efficient adaptation algorithm is proposed. however, there is lack of a rate model to estimate the related bit-rate. on the other hand, performance improvement can be achieved by resorting to network-related technologies, such as using a priority mechanism [8], or self optimization of networked communications [9] presents a model for self optimization of network communications in order to improve cluster performance by shortening the data transfer time. in this paper, 2d rate and quality models are proposed for optimal bitstream adaptation for scalable video under given bandwidth constrains and required display resolutions at the end user. assuming that the frame rate is determined, the two 2d models are applied to extract bitstream to achieve the optimal combination of spatial and snr resolutions. the rest of the paper is organized as follows: section 2 presents the 2d rate and perceptual quality models considering the impact of spatial and snr resolutions. their application in constrained scalable video adaptation is introduced in section 3. section 4 presents the experimental results. section 5 concludes this paper and discusses future directions. 2 two-dimensional rate and perceptual quality models in this section, the impact of the spatial and snr resolutions on the bit-rate and the perceptual quality is analyzed, based on which a 2d rate model and a 2d perceptual quality model are respectively derived. 2.1 two-dimensional rate model considering snr and temporal scalabilities, we have proposed an analytical 2d rate model for h.264/svc [10]. in this paper, this model is extended to the spatial domain where a product of a power function of the quantization step q and a power function of the spatial resolution index s are used, given as r(q, s) = cqαsγ, (1) where α and γ are both content-dependent model parameters. the values of α and γ characterize how fast the bit-rate reduces with the increase of q and how fast the bit-rate increases with the refinement of the spatial resolution, respectively. usually a sequence with richer texture has larger absolute values of α and γ. here s is computed through dividing the frame size of the current spatial resolution by the frame size of the lowest spatial resolution. in order to evaluate the model accuracy, sequences, with qcif, cif and 4cif resolutions were encoded into 3 dyadic spatial layers using jsvm9.19.7 [11], respectively. each spatial layer contained 5 quality layers. the base quality layer in a lower spatial layer was used to perform inter-layer prediction to avoid drifting at the decoder. the gop (group of picture) size was set to 1 to avoid the effect of temporal scalability. 120 frames were encoded for each sequence. the difference of the quantization parameter (qp) between adjacent quality layers and adjacent spatial layers optimal bitstream adaptation for scalable video based on two-dimensional rate and quality models 475 were set to 5 and 6, respectively, following [12]. the qp of the base quality layer of the lowest spatial level was set to 38. the model parameters were obtained by minimizing the root mean square error (rmse) between the actual and predicted bit-rates. the actual values and those predicted using (1) are plotted in figure 1. it is clear that the proposed 2d rate model fits the actual data very well. table 1 gives the used parameters and the model accuracy in terms of the rrmse (rrms e = rms e/rmax, where rmax denotes the maximum bit-rate in the actual data) and the coefficient of determined (cod), defined as: cod = 1 − ∑ i ( xi − x̂i )2 ∑ i ( xi − x )2 , (2) where xi and x̂i are the actual and the predicted values of the bit-rate, respectively, and x is the mean of all actual bit-rates shown in figure 1. it is once again demonstrated that the proposed rate model is very accurate in prediction, where high cod and small rrmse values can be observed for all tested sequences. specifically, the average cod and rrmse are 0.9892 and 2.81%, respectively. and as expected, the “city" sequence with the richest texture among the tested sequences has the largest absolute values of α and γ. 0 50 100 150 200 250 0 500 1000 1500 2000 2500 3000 3500 quantization step b it ! ra te ( k b /s ) soccer actual data qcif actual data cif actual data 4cif predicted values using (1) 0 50 100 150 200 250 0 500 1000 1500 2000 2500 3000 3500 quantization step b it ! ra te ( k b /s ) crew actual data qcif actual data cif actual data 4cif predicted values using (1) 0 50 100 150 200 250 0 500 1000 1500 2000 2500 3000 quantization step b it ! ra te ( k b /s ) city actual data qcif actual data cif actual data 4cif predicted values using (1) 0 50 100 150 200 250 0 500 1000 1500 quantization step b it ! ra te ( k b /s ) ice actual data qcif actual data cif actual data 4cif predicted values using (1) figure 1: actual bit-rates and predicted values using (1). table 1: the values of parameters in (1) and model accuracy soccer crew city ice ave. c×10 3 9.67 13.58 10.76 4.23 α -1.276 -1.365 -1.462 -1.048 γ 0.970 0.965 1.078 0.756 cod 0.9958 0.9860 0.9792 0.9929 0.9892 rrmse 1.71% 3.31% 3.97% 2.27% 2.81% 476 j. hou, s. wan 2.2 two-dimensional quality model it has been widely acknowledged that the quality metrics of the psnr and the mean square error (mse) do not correlate well with the perceptual quality. on the other hand, the subjective quality can be well captured by the mean opinion scores (mos) and video quality metric (vqm) [13], at the cost of high complexity in testing and computations. trading off between the complexity and the consistency with the human perception, the structural similarity (ssim) [14] is used as the quality measure in this paper. the ssim measures the structural similarity as well as the luminance and contrast similarity between two images block by block. in this paper, the ssim values have been measured with regard to different combinations of spatial and snr resolutions, where the layers of lower spatial resolutions were upsampled to 4cif using a set of 6-taps filters provided by the jsvm. according to empirical observations, a logarithmic function in terms of the spatial resolution index and the quantization step is used to model the perceptual quality regarding different spatial and snr resolutions, which is expressed as qmssim(q, s) = a0 + a1 ln q + a2 ln s + a3 ln q ln s, (3) where a0,a1,a2 and a3 are all content-dependent model parameters. here the second and third terms indicates the impact of the snr and the spatial resolution on perceptual quality, respectively. the forth term models the joint impact of the snr and the spatial resolution. the model parameters can be derived easily by minimizing the rmse between the actual and predicted values. the actual and predicted qualities are shown in figure 2. table 2 lists the used parameters and the model accuracy in terms of the cod and rrmse. it can be concluded from figure 2 and table 2 that the proposed 2d perceptual quality model is very accurate in prediction with high cod and small rrmse values for all tested sequences. 0 50 100 150 200 250 0.5 0.55 0.6 0.65 0.7 0.75 0.8 0.85 0.9 0.95 quantization step q m s s im soccer actual data qcif actual data cif actual data 4cif predicted values using (3) 0 50 100 150 200 250 0.65 0.7 0.75 0.8 0.85 0.9 0.95 1 quantization step q m s s im crew actual data qcif actual data cif actual data 4cif predicted values using (3) 0 50 100 150 200 250 0.4 0.5 0.6 0.7 0.8 0.9 1 quantization step q m s s im city actual data qcif actual data cif actual data 4cif predicted values using (3) 0 50 100 150 200 250 0.82 0.84 0.86 0.88 0.9 0.92 0.94 0.96 0.98 1 quantization step q m s s im ice actual data qcif actual data cif actual data 4cif predicted values using (3) figure 2: actual qualities and predicted values using (3). optimal bitstream adaptation for scalable video based on two-dimensional rate and quality models 477 table 2: the values of parameters (3) in and model accuracy soccer crew city ice ave. a0 0.7709 0.9112 0.7268 0.9395 a1 -0.0614 -0.0569 -0.0791 -0.0193 a2 0.2292 0.1454 0.2918 0.0739 a3 -0.0393 -0.0251 -0.0444 -0.0141 cod 0.9871 0.9842 0.9754 0.9561 0.9757 rrmse 1.39% 1.11% 2.49% 0.8% 1.45% 3 optimal bitstream adaptation for scalable video using proposed models the proposed models are applied to constrained bitstream adaptation for scalable video. figure 3 provides a systematical view of the adaptation problem. for each video, a single full-resolution scalable bitstream is available at a server, where the bitstream will be adapted at a network proxy or gateway according to the user channel conditions and viewing preferences (i.e., the displayed spatial resolution). when a user requests the video from the server, the adaptor (at the proxy) will determine an appropriate bit-rate rt for extraction based on the channel condition. based on rt and the user’s settings of viewing preference (embedded in the user profile and sent to the adaptor), the adaptor determines the optimal set of spatial and snr layers to extract, so as to provide the best perceptual quality. for a given targeted bit-rate rt and the required display spatial resolution, the adaptation problem can be formulated as the following constrained optimization problem: determine q, s to maximize qmssim(q, s) subject to r(q, s) ≤ rt u(s)|s < s, (4) where rt and s denote the targeted bit-rate and the required display spatial resolution index, respectively. by u(s)|s < s it is indicated that up-sampling is executed if the extracted spatial resolution is less than the required display spatial resolution. assume that both the spatial resolution and the quantization step may take on any effective value. by setting r(q, s) = rt, it can be obtained that q = α √ rt csγ , (5) which describes the feasible q for a given s, to satisfy the rate constraint rt. substituting (5) into (3) yields qmssim(s) =− a3γ(ln s)2 α + (a3 ln rt/c +αa2 − a1γ) ln s α + a0 + a1 ln rt/c α . (6) equation 6 is the achievable quality with different spatial resolutions under the targeted bit-rate rt. clearly, this function has a unique maximum, which can be derived by setting its first order derivative with respect to s to be zero. this yields s = e(a3 ln (rt/c)+αa2−a1γ)/2a3γ. (7) 478 j. hou, s. wan for any given rt and s , we can solve (7) numerically to determine the optimal spatial resolution. then using (5) and (6) the optimal quantization step can be determined, and the corresponding quality can be maximized. the parameters for the rate model, i.e., c, α and γ, can be easily derived from the bit-rates corresponding to several different (q, s)combinations using least square fitting. the quality model parameters, i.e., a0, a1, a2 and a3, can be derived using the least square fitting at the encoder, and then embedded in the header field of the video stream. based on the simulations, only several bytes are required to represent those parameters which can be neglected compared to the actual video stream payload. �� !�� "�� #�� $�� !�� %�&� $ ' � � � � ��! � � � � �� ( �# � � �� "�� !�� )) ) )) figure 3: constrained scalable video adaptation. 4 experimental results the experimental results are presented in this section to evaluate the performance of the proposed extraction method. firstly, assuming that the spatial resolution can be any positive values, and then the practical case where spatial resolutions to be discrete is considered. 4.1 optimal solutions assuming q and s taking continuous values assuming that both the spatial resolution and the quantization step can take continuous values. figure 4 shows the optimal spatial resolution, quantization step and quality as functions of the targeted bitrate rt. as expected, as the targeted bit-rate increases, the optimal s increases while the optimal q reduces, and the achievable best quality continuously improves. notice that the optimal s increases more rapidly for the “city" sequence than for the other sequences because of its richer texture. the up-sampling introduces more severe quality decrease than the quantization step. therefore, under the bit-rate constraint, a larger spatial resolution with a larger quantization step is a better choice. 4.2 optimal solutions under dyadic spatial resolution scalability the h.264/svc includes three profiles [15], i.e., the “scalable baseline" profile, the “scalable high" profile, and the “scalable high intra" profile. while the latter two profiles support full spatial svc scalability, the scalable baseline profile imposes some constraints to enable simplified application scenarios. for example, dyadic spatial scalability is provided in the baseline profile, where the scaling ratio of the width and height between adjacent spatial layers is equal to 2. from a practical point of view, it will be interesting to see the optimal combination of the spatial resolution and quantization step for different optimal bitstream adaptation for scalable video based on two-dimensional rate and quality models 479 0 500 1000 1500 2000 2500 3000 3500 4000 0 10 20 30 40 50 60 70 80 90 100 targeted bit!rate (kb/s) soccer optimal s optimal q optimal (100*qm ssim ) 0 500 1000 1500 2000 2500 3000 3500 4000 0 10 20 30 40 50 60 70 80 90 100 targeted bit!rate (kb/s) crew optimal s optimal q optimal (100*qm ssim ) 0 500 1000 1500 2000 2500 3000 3500 4000 0 20 40 60 80 100 120 140 targeted bit!rate (kb/s) city optimal s optimal q optimal (100*qm ssim ) 0 500 1000 1500 2000 2500 3000 3500 4000 0 20 40 60 80 100 120 targeted bit!rate (kb/s) ice optimal s optimal q optimal (100*qm ssim ) figure 4: optimal quantization step, spatial resolution index and the corresponding quality versus the targeted bit-rate by assuming the quantization step and the spatial resolution to be continuous. targeted bit-rates under this svc structure. to obtain the optimal solution for this svc structure, we first determine the optimal s using (7), and then choose two spatial resolutions up and down around the value from the candidates. finally, compute the quality using (6) corresponding to the two spatial resolutions and choose the spatial resolution that leads to a better quality. the experimental results are shown in figure 5. because the spatial resolution can only increase in a discrete step, the optimal quantization step does not decrease monotonically with the bit-rate. whenever the optimal s jumps to the next higher value, the optimal q first increases to meet the rate constraint, and then decreases while the optimal s is held constant, as the rate increases. consistent with the previous results in figure 4, for the “city" sequence with richer texture, the optimal s is 16 (corresponding to 4cif) at a low bit-rates, whereas for other sequences, the optimal s stays 4 (corresponding to cif) even at high bit-rates. in practice, the svc encoder with quality scalability does not allow the quantization step to change continuously. the finest granularity in quality scalability is a decrement of qp by 1 with each additional quality layer. this means that the quantization step reduces by a factor of 2−1/6 with each additional layer. in practice, much coarser granularity is typically used, with a decrement of qp by 3 to 6 typically [11]. when we limit the values of q to be discrete in addition to allow only dyadic spatial resolutions, a rate constraint cannot be always exactly met. however, one may still obtain the optimal q and s for any given constrains using the proposed scheme by estimating the bit-rate and quality of each combination in the finite set of feasible values for q and s. 5 conclusions and future works in this paper, a 2d rate model and a 2d quality model have been proposed, based on which a modeldriven method for optimal bitstream adaptation is developed. experimental results have demonstrated the accuracy of the two models. using the proposed extraction method, the optimal combination of quality and spatial layers can be determined, providing the highest perceptual quality for a given bandwidth 480 j. hou, s. wan 0 500 1000 1500 2000 2500 3000 3500 4000 0 20 40 60 80 100 120 targeted bit!rate (kb/s) soccer optimal s optimal q optimal (100*qm ssim ) 0 500 1000 1500 2000 2500 3000 3500 4000 0 10 20 30 40 50 60 70 80 90 100 targeted bit!rate (kb/s) crew optimal s optimal q optimal (100*qm ssim ) 0 500 1000 1500 2000 2500 3000 3500 4000 0 20 40 60 80 100 120 140 targeted bit!rate (kb/s) city optimal s optimal q optimal (100*qm ssim ) 0 500 1000 1500 2000 2500 3000 3500 4000 0 20 40 60 80 100 120 targeted bit!rate (kb/s) ice optimal s optimal q optimal (100*qm ssim ) figure 5: optimal quantization step, spatial resolution index and the corresponding quality versus the targeted bit-rate by assuming that the q varies continuously and the spatial resolution takes qcif/cif/4cif. constraint and required display frame rate of the end user. future work may include an extension of the proposed models to three-dimension, taking temporal scalability into account. moreover, the proposed models can be applied to other applications, e.g., advanced multidimensional rate control for video coding. acknowledgement this work was supported by the national science foundation of china (60902052, 60902081), the doctoral fund of ministry of education of china (no.20096102120032), and the npu foundation for fundamental research (jc201038). bibliography [1] h. schwarz, d. marpe, t. wiegand, overview of the scalable video coding extension of the h.264/avc standard, ieee trans. circuits syst. video technol., vol. 17, no. 9, pp.1103-1120, 2007. [2] j. reichel, h. schwarz, and m. wien, joint scalable video model 11 (jsvm 11), joint video team, doc. jvt-x202, 2007. [3] i. amonou, n. cammas, s. kervadec, s. pateux, optimized rate distortion extraction with quality layers in the scalable extension of h.264/avc, ieee trans. circuits syst. video technol., vol. 17, no. 9, pp.1186-1193, 2007. [4] ehsan maani, aggelos k. katsaggelos, optimized bit extraction using distortion modeling in the scalable extension of h.264/avc, ieee trans. image process., vol. 18, no. 9, pp.2022-2029, 2009. optimal bitstream adaptation for scalable video based on two-dimensional rate and quality models 481 [5] yu wang, lap-pui chau, kim-hui yap, spatial resolution decision in scalable bitstream extraction for network and receiver aware adaptation, proceedings of the 2008 ieee international conference on multimedia and expo, pp.577-580, 2008. [6] y. wang, z. ma, y.-f. qu, modeling rate and perceptual quality of scalable video as function of quantization and frame rate and its application in scalable video adaptation, proceedings of the ieee 17th packet video workshop, pp.1-9, 2009. [7] guangtao zhai, jianfei cai, weisi lin, xiaokang yang, wenjun zhang, three dimensional scalable video adaptation via user-end perceptual quality assessment, ieee trans. broadcasting, vol.54, no.3, pp.719-728, 2008. [8] a. rahim, z.s. khan, f.b. muhaya, m. sher, m.k. khan, information sharing in vehicular adhoc network, int. j. of computers, communications and control, 5(5):892-899, 2010. [9] a. rusan, c.-m. amarandei, a new model for cluster communications optimization, int. j. of computers, communications and control, 5(5):910-918, 2010. [10] junhui hou, shuai wan, fuzheng yang, "frame rate adaptive rate model for video rate control," proceedings of the 2010 ieee international conference on multimedia communication, pp.226229, 2010. [11] h.264 svc reference software (jsvm 9.19.7) and manual cvs sever, jvt, 2010 [online]. available: garcon.ient.rwth-aachen.de. [12] xiang li, peter amon, andreas hutter, andr kaup, performance analysis of inter-layer prediction in scalable video coding extension of h.264/avc, ieee trans. broadcasting, vol. 57, no. 1, pp6674, 2011. [13] m. pinson, s. wolf. a new standardized method for objectively measuring video quality, ieee transactions on broadcasting, vol. 50, no.3, pp.312-322, 2004. [14] z. wang, a. c. bovik, h. r. sheikh, and e. p. simoncelli, image quality assessment: from error visibility to structural similarity, ieee trans. image process., vol. 13, no. 4, pp.600-612, 2004. [15] t. wiegand, g. j. sullivan, j. reichel, h. schwarz, m. wien, eds., amendment 3 to itu-t rec. h.264 (2005) | iso/iec 14496-10:2005, scalable video coding, 2007. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 799-812 discussion of the analysis of self-similar teletraffic with long-range dependence (lrd) at the network layer level g. millán, h. kaschel, g. lefranc ginno millán, héctor kaschel universidad de santiago de chile departamento de ingeniería eléctrica avda. libertador bernardo o’higgins #3363. estación central. santiago chile e-mail: {ginno.millan,hector.kaschel}@usach.cl gastón lefranc pontificia universidad católica de valparaíso escuela de ingeniería eléctrica avda. brasil #2147. valparaíso chile e-mail: glefranc@ucv.cl abstract: traffic streams, sources as well as aggregated traffic flows, often exhibit long-range-dependent (lrd) properties. this paper presents the theoretical foundations to justify that the behavior of traffic in a high-speed computer network can be modeled from a self-similar perspective by limiting its scope of analysis to the network layer, since the most relevant properties of self-similar processes are consistent for use in the formulation of traffic models when performing this specific task. keywords: long-range-dependent, network layer, network traffic, self-similar process. 1 introduction still rooted in the genesis of the design of present day high speed computer networks is the trend to scalable development with a base prepared for the primary support of processing applications which, although requiring a reliable transport service, are not demanding in terms of other quality of service (qos) parameters such as delay, flow rate, latency, and loss rate. it is a reality that is subordinated to financial justifications, unable to reflect both the behavior and the operation of present day network environments, most of them characterized not only by their scalability and support of services and added value applications with high band width and availability requirements, but also by their convergence, complementarity, and interoperability. on the other hand, sustained development in the fields of optical, nanometric, and quantum technologies with greater emphasis has allowed the evolution of computer networks, providing them with the capacity needed to satisfy simultaneously the requirements of diverse traffic, creating the scenario inherent to the appearance of new services and applications for which this characteristic is essential; we are in the presence of services that involve real time traffic and which, because of their nature, have highly demanding needs to and from the available bandwidth. therefore, the new high speed networks must be capable of providing a service that not only ensures the availability of the resources, but is also provided under quality conditions that are well defined, parameterizable, adaptable, and dynamic in their assignment, because the requirements of present day real time applications and services cannot be satisfied using the high-level protocols if the carrier networks do not offer the necessary guaranties. it is crucial, therefore, to make quantitative analyses that evaluate the service quality offered by the new technologies, leaving copyright c⃝ 2006-2010 by ccc publications 800 g. millán, h. kaschel, g. lefranc aside the unsubstantial bases, arguments, and assumptions. the main problem that appears at the time of making a rigorous evaluation of the performance of a communications network is that of modeling the input traffic to the network. in fact, to many authors, traffic modeling is the most critical problem related to the evaluation of the performance of communications networks, because the success of the analyses depends directly on how representative of reality are the traffic models used. historically, traffic modeling had its origin in conventional telephony systems and has been based almost exclusively on assumptions of independence between the times of arrival of successive grids and exponential durations in the use of the resources. concretely, the acceptance of both assumptions implies a restriction toward the stochastic processes so that they obey a universe of poisson or markov processes. in that respect, the usefulness of their use by network designers as well as by systems analysts for planning capacities and predicting performance is not questioned [1]. however, in a wide range of real world cases it has been verified that the results predicted from the analysis of tails differ significantly from the actual observed performance, and this marked discrepancy has its origin in the fact that traffic processes often present lrd on many or all the temporal scales, while poisson or markov models, which have no memory or show short-range dependence (srd), present traffic flow over much shorter time scales. as a result, there is a tendency to produce highly optimistic forecasts of performance due to the use of distributions with finite variance for characterizing the periods with the presence and absence of burst packets. in view of the above arguments, the following working hypothesis is proposed: “it is completely feasible to restrict the evolution of a statistically self-similar process to a well defined application setting without altering its nature and its more important properties, in that way highlighting the validity of its postulates and giving greater plausibility to its physical interpretation.” in this context, the plausibility refers to the action of conferring an admissible character, therefore worthy of consideration, to one or several parameters that are components of an analytical model, whose interpretations do not constitute only a mathematical idealization. this paper presents a detailed discussion of the theoretical bases that justify the fact that the behavior of a high speed computer network traffic in the presence of long-range dependence can be modeled from a selfsimilar perspective limiting its analysis setting to the network layer level, stressing that all the most relevant properties of the self-similar processes are consistent for use in the formulation of traffic models when this distinction is made, since the need for its concept is justified to describe the traffic that is registered in the settings of present day computer networks. 2 bibliographic discussion since kleinrock’s publication [2], later expanded in [3], which establishes the mathematical theory that governs networks with packet switching, the existence of temporal dependence in the performance of the different types of data traffic flows has become an exciting field of research with countless discoveries, with the huge impact and influence that they have on the performance of tail systems standing out among them. the latter fact accounts for both the current existence and coexistence of a wide variety of input traffic models that show structures with rather complex correlations, which are applied to cases in which the models of the communications systems that are being studied allow adequate analytic handling. in any case, these models, basically markovian, neglect the temporal correlations starting from a given temporal separation, even if the latter can be increased arbitrarily at the expense of complicating the models with additional parameters. since 1993 an increasing number of published studies have documented that the data traffic pattern is well modeled by self-similar processes in a wide range of real world and network situations [1], with their top reference point and foundational work in the research of leland, willinger, taqqu, and wilson [4], presented originally at acm sigcomm ’93, and discussion of the analysis of self-similar teletraffic with long-range dependence (lrd) at the network layer level 801 then amended and extended in [5]. in spite of the existence of some previous papers that provide informal descriptions of this performance, such as [6][8], and to an exception from mandelbrot himself [9], no one had submitted the idea of self-similarity applied to the analysis of data traffic in itself, and that paper shattered the illusion that a simple tail analysis, based on the assumption that the traffic follows a poisson distribution can model adequately all the network traffic [1], showing that traffic in ethernet has a self-similar or fractal nature and therefore requires new modeling and analysis statements. in this respect, the methodology followed by the authors involves a massive compilation of traffic samples from 1898 through 1992, from different ethernet lans of the research and engineering center of bellcore in morristown, usa [5], which resulted in a detailed temporal high resolution collection totaling more than 100 million packets with 10 µs precision, grouped in four sets of measures available in [10], and in the application of a rigorous and exhaustive statistical analysis based on the modeling of the traffic sources using hyperbolic tail distributions, in particular pareto’s, comparing the results with the behavior of the traffic flow of the real traces, and in the observation of the estimated value of hurst’s parameter (h) for each of the four sets of traffic samples, expressed for processing as a series of ordered pairs of data composed of the time of arrival and the size of the ethernet packet, as well as for each level of temporal traffic aggregation considered. a complete analysis of both the statement and the methodology followed by the authors is found in [11] and [12], and their proofs in [13]. specifically, this research shows that: • it is possible to model ethernet traffic producing results similar to those of real ethernet traffic using few parameters (parsimony), and with the fundamental added value of being physically plausible. • ethernet lan traffic can be modeled through the superposition of many sources that vary between a state of burst transmission and one of inactivity, using for their characterizations infinite variance distributions. in particular it is proved using a pareto distribution. • ethernet traffic is statistically self-similar regardless of the place and time at which it is checked. • the degree of self-similarity measured in terms of hurst’s parameter (h) is a function of the use factor of ethernet and can be used to get the magnitude of the traffic bursts. • traditional traffic models are unable to capture the property of self-similarity. these results and their deep implications, as can be noted from the preceding paragraphs, produced a host of researchers seeking to observe that same behavior associated with the largest variety of communications and applications scenarios. what follows is an exhaustive literature review of research results in relation to their field of application, that approaches the treatment of systems communications systems and applications from a self-similar perspective. it should be noted that the idea is not only to show the application of this view, but also to present results that dissent from it. the self-similar or fractal behavior of traffic in wan networks is shown in [14][16], pointing out the failure of poisson models to represent the strong correlations that exist at different temporal scales. evidence and conclusions on this behavior in traffic due to the www are provided in [17][19], considering interconnection scenarios as well as traffic patterns in browsers. on the other hand, [20] and [21] point out the fractal nature of the data flow of the protocols that compose the signaling system 7 in common channel signaling networks [22], showing that the traditional methods are not adequate to interpret their behavior, and the duration of the calls are better characterized if hyperbolic tail distributions are used. in another setting, [9] and [23][25] show that the lrd 802 g. millán, h. kaschel, g. lefranc is a characteristic inherent to vbr video traffic that does not have any relation with the type of codec or the number of special effects that contain the recorded scenes. specifically, the vbr video traffic flow transmitted through b-isdn, atm, and internet networks are studied, showing that the behavior of the distribution’s tail that represents the marginal band width can be described exactly if hyperbolic tail distributions (like pareto) are used; that the self-correlation function of the video sequences decays hyperbolically (this is equivalent to long-range dependence) and it can be modeled using self-similar processes, and finally, that the use of models that only capture the srd is inappropriate for characterizing this kind of traffic because it overestimates the performance, leading to insufficient resource assignments, which finally derives in poor perceptions by the users of the networks when difficulties appear to achieve the service quality expected by them. in the field of wireless communications, in [26] it is shown that traffic in cdpd networks has an lrd behavior. using the r/s and variance-time methods, it estimates values for hurst’s parameters of h = 0.8 and h = 0.9, respectively, thus disregarding the use of predictive models based on poisson arrival processes. in [27] an investigation is made of the impact of mobility on the added traffic in wireless networks in the city of bristol, uk, and of whether the applications of voice and data together produce self-similar traffic. it is concluded that the added traffic generated by the mobile users that use voice and data services together show a self-similar lrd behavior that has no relation with the rate of penetration of the services. they warn on the drastic changes that wireless multimedia service implementations must undergo in terms of the traffic profiles used in their models to be able to capture that characteristic. from another perspective, in [28] it is shown that traffic in wireless networks with ad hoc topology is self-similar and forecastable as a consequence of the fact that subjacent temporal self-similar series are, in essence, predictable. the required data are captured using a wireless test network with ad hoc topology. the analysis of traffic and the design of wireless ip networks describing tcp traffic as dominant in present day internet is approached in [29], indicating that its statistical nature shows the same behavior over all the temporal scales. it also presents an analysis of traffic traces which shows the statistically self-similar nature of the traffic due to www and to the vbr video over these types of networks. in [30] develops a new model for traffic in wireless networks that has its origin in the alternating fractal renewal processes (afrp) proposed as traffic models in [14], and in the wide band network traffic model using the extended alternating fractal renewal processes (eafrp) proposed in [31]. with the incorporation of a limiting rate for alternation between the two states, called extended limiting rate, the rate-limited eafrp model is formulated, which assumes an advance with respect to the wireline models used traditionally for model traffic in wireless networks, since it takes care of its two main deficiencies: omission of the effects of the lrd temporal correlations, and inability to make reliable performance forecasts due to the high dependence on short-range processes. in [32] there is an extensive discussion of the problem of modeling the data traffic that flows from and to the wireless networks with respect to the internet, taking care of large scale wireless communication structures. based on the methodology presented in [33] and tested extensively in [34], [35], it is concluded that the circulation of traffic flows cannot be treated using poisson models, and that their behavior is statistically self-similar. in [36] two solutions are proposed to create an interconnecting bridge between wimax and wifi links. the former is based on maintaining a certain level of qos from one end to the other regardless of the wireless technology used, while the latter is aimed at the reduction of the complexity in its physical implementation at the expense of not providing any qos guaranty. in both cases the performance of the system is compared with computer simulations that consider real time traffic with long-range dependence that is manifested trough a polynomial type decay of the self-correlation function. to model the traffic generated by the many terminals within the wlan, recourse is made to the on/off methodology presented in [14], supporting its foundations on [37]. finally, in the same context, discussion of the analysis of self-similar teletraffic with long-range dependence (lrd) at the network layer level 803 but in another field of application, [38] deals with the use of fractal geometry in the antenna design process, while in [39] a general synthesis methodology is proposed that covers the efficient design of fractal antennas and their wimax applications. in the field of optical networks, [40] deals extensively with the use of self-similar traffic models, in particular that proposed in [15], as the only way to represent reality faithfully and to evaluate the performance of ethernet passive optical networks. in [41] a new protocol is proposed for labeled switching in optical networks with optical burst switching (obs), with time/space labeling that allows to keep the signaling always joined with the addressing functions. the results obtained are based on simulations under the premise of accepting the self-similarity of the traffic in the networks with wavelength division multiplexing. finally, [42] reports on the implications of reducing the self-similarity of ip traffic by the obs assembly algorithms. in spite of what all the above arguments imply in terms of putting in evidence the merits and advantages of the use of parsimonious models and which also provide a plausible physical interpretation to its parameters, the question arises on the degree of prevalence of these self-similar traffic patterns and on which are the conditions for the analysis of performance to depend critically on considering self-similarity. in this sense it is valid to inquire not only about the origin of the data that have been analyzed with respect to the synthetic traces generated, but also on what is the context or the setting in which these comparisons have been made, and where are the results aimed at. it is no less true that in the light of all the research presented, it seems even irrelevant to think of a traditional analysis of tails to represent data traffic flow in present day high speed networks. but this is neither categorical nor restrictive; this methodology cannot be disregarded flatly, and neither is its proper to think that the former is the unified solution by simply arguing it ubiquitousness in all the temporal scales, because this, analyzed from a higher perspective, and even though the presence of the correlation in the traffic is not under discussion, brings about the question of whether the correlation structure alone is sufficient to characterize traffic over self-similar processes. the context described in this way is very extensive, and therefore the results must be limited. there are several reports that put in evidence the lack of consensus on the application field of self-similar models and of the impact of lrd on the performance of communications systems, and even though their number is much more limited than that of those whose results support exactly the opposite, their conclusions must be analyzed carefully because they bring forth, in essence, a critical and fundamental matter in common: “since the traditional tail models are unable to put in evidence the self-similarity characteristic, its validity for predicting performance would be supported if it is shown that self-similarity does not have a measurable impact on performance,” and even more so if it is shown that the models based on self-similar stochastic processes fail when considering the impact of important characteristic parameters in each particular network case. precisely based on this last point, [43] presents a detailed analysis of a fault detected in self-similar models by considering that they are incapable of reflecting the impact of the range of temporal scales of interest for evaluating the performance and the prediction of problems, and those first order statistics such as the marginal distribution of the process. based on traces generated by a jpeg encoder of an ntsc television channel and the traces of [6] it is reported that: • there is a correlation horizon such that the rate of loss does not affect performance beyond it. • the correlation level considered for evaluating performance depends not only on the structure of the source traffic correlation, but also on the temporal scales belonging to the system that is being studied. • the scale factor considered has a considerably greater impact on the rate of loss than hurst’s parameter or the size of the buffers. 804 g. millán, h. kaschel, g. lefranc • increasing the size of the buffers helps reduce the rate of loss if we are dealing with srd traffic. on lrd traffic it does not have a considerable impact. the in-depth analysis of this work shows that the impact of the lrd is dealt with over the contribution of the single server model in relation to the size of the input buffer, using for that purpose a model of fluid [44], [45], that presents a hyperbolic drop down to a given cut coefficient from which it drops to zero. based on the results from many simulation experiments using video traces as well as ethernet traces for different values of hurst’s parameter, cut coefficients and buffer sizes, and a wide range of marginal distributions, the authors discovered the existence of a critical cut coefficient that they call “correlation horizon,” such that the rate of loss is not affected if the cut coefficient is increased above it. therefore, the correlation horizon separates the relevant from the irrelevant correlation coefficients with respect to the rate of loss. finally, in their conclusions the authors argue that because of the existence of a finite cut horizon (which in the case of the finite buffers is a function of their size), any model that captures the correlation structure up to that horizon will be valid to represent the system. on the contrary, if the correlation horizon is infinite, i.e., the system’s time scale cannot be determined clearly, then selfsimilar models must be used. based on the above point, [46] states that in the case of considering a finite buffer, the effects of the lrd are detectable only if it makes the occupation periods become sufficiently long, since the behavior of their tail is affected largely by the characteristics of the traffic that arrives during those periods. in that respect, this appraisal is based on [47], where through the definition of the concept of “relevant temporal scale” as the typical duration at which all the arrivals at a tail interact and affect collectively their behavior, it is deduced that for the large buffers the size of the tail can be large, so many arrivals interact in the tail and cause the long-range correlation to cause more losses than those predicted by the models that are not capable of considering it or do not consider it. however, in the case of small buffers, where few arrivals interact, the effect of the lrd is imperceptible. therefore, the standpoint on the appraisal of the effects of the lrd in terms of its impact on the occupied periods is depicted by the authors through the concept of “reset effect,” which involves that as the buffer in question is emptied, the system forgets everything. in this way, in the case of vbr video servers, since they are sensitive to the delay and to the loss of meshes, the intensity of the traffic flow will not be very large, giving rise to short periods of occupation and a very pronounced reset effect in the regions of practical operation, an effect that will also be reinforced in the case of finite buffers, due to the truncating effect of these types of buffers. the latter is due to the fact that an occupied period in which there is overflow is shorter than the corresponding one in an infinite buffer model, or similarly, there can be several periods occupied in the finite buffer version before the corresponding occupation period ends in the infinite buffer version. then, through the use of markov models, the authors finally report that when the srd is strong and parameter h is moderate, the lrd has no impact on the occupation of the buffer, and therefore their models give rise to good estimations, and in the case in which the srd shows a slightly pronounced behavior and parameter h has a high value, the truncating effect is sufficiently strong for their models to estimate well the rate of loss, even though it is admitted that for a high traffic intensity and a large buffer size the estimation of the mean size of the tail is bad. in [48], a research having characteristics similar to those of the previous one, the concept of critical temporal scale (cts) is introduced as follows. given the size of the buffer and the marginal distribution of mesh sizes, the cts of a vbr video source is defined as the number of mesh correlations that contribute effectively to the rate of loss of cells. using models of the video traces of [9], the authors state that for markovian models as well as with lrd, the cts is finite and decreases with the size of the buffer. consequently, and under the assumption that the size of the buffer required to multiplex a large number of vbr video source is typically small due to the restrictions that correspond to real time applications, it is concluded that for buffer sizing scenarios in atm discussion of the analysis of self-similar teletraffic with long-range dependence (lrd) at the network layer level 805 networks it is not necessary to capture the correlations with the lrd of a video source even if the traffic shows this behavior markedly, because for all practical cases the srd has a dominant impact. the following can be argued with respect to this last conclusion and to the research in general: • the result is based on an analysis that relies completely on marginal gaussian distributions, and even though it can be explained from the perspective of keeping an adequate analytic treatment, it is not sufficient for validating absolute generalizations of the “in all cases” type nor to attenuate the fact of working with distributions independent of the behavior of others. • the behavior of the models if hyperbolic tail distributions are used or if they are tested with distributions without margins is not reported. an extended version of this research is [49], where not only these two considerations are taken up, but attention is also given to the relevance of the srd and the lrd in real time vbr video traffic in wide band networks (particularly atm) and in the integrated internet services, and through a theoretical and simulations approach, the authors tackle the problem of determining admissible ranges for the probability of cell loss and its relation with the size of the buffers in terms of the maximum delay. it should be noted that this research takes place within the context of validating the use of markovian models and srd for video applications, and for that purpose the authors show basically that: • the long-range correlations have no impact on the probability of cell loss. • an adequately implemented markov model that captures the relevant range of correlation provides good predictions of performance. • the capture of the lrd by itself can lead to an underestimation of the necessary resources in the network. • the cts explains the strong relation existing between the probability of cell loss in atm and the srd at the expense of the lrd. that is, for the applications of interest, the cts is small and more sensitive to the short-range than to the long-range traffic correlations. • analytically simple markovian models are feasible and have the capacity to harmonize marginal distributions and correlations over a given critical temporal scale. as a last example of the already mentioned lack of consensus in the field of video traffic, [61] presents a model of a vbr traffic source that uses finite states markov chains, and states that although the original model presented in [26] is good in terms of its parsimony, it does not lend itself adequately to analytic studies. summarizing, six reports have been presented so far with a common denominator: dissenting on the blind applicability of the self-similar model and its implications in performance, considering as examples systems that involve video traffic, mainly of the vbr type. but as expected, this is not the only field in which discrepancies appear, and more acutely and in direct relation with the working hypothesis stated there is a number of reports that question the validity of the use of hurst’s parameter as a single descriptor to characterize the lrd of a self-similar stochastic process. but before dealing with this topic a last group of reports are mentioned that have a critical position toward self-similar models in areas other than video traffic. in the field of wireless networks and their associated technologies, [51] studies the behavior of the self-similarity characteristic of traffic when it goes from a wired to a wireless network through a gateway, concluding that the device can change the traffic’s degree of selfsimilarity as a direct consequence of the reassembly and repacking operations on the self-similar 806 g. millán, h. kaschel, g. lefranc input traffic, even reaching its annulment. in [52] the above behavior is reaffirmed considering the study of the influence of the mac mechanism of ieee 802.3e on lrd traffic when it goes through one or several links, suggesting that the traffic transported through a wlan interface undergoes deep structural changes in its statistical model, and showing that the fractional gaussian traffic model is inadequate to describe its behavior. finally, in the field of obs networks, [53] reports the development of an algorithm for the assembly of bursts that has the purpose of reducing the degree of self-similarity of ip traffic. it is admitted that this is a characteristic inherent to www traffic, but its presence causes an important disadvantage in terms of the performance of the tails, so it must be reduced in favor of a random srd traffic. the first paragraph of this page mentions the existence of some research that questions the use of hurst’s parameter as a single indicator to capture the self-similarity characteristic of a stochastic process that boasts of being such, in addition to stressing the importance that this fact has to prove the stated work hypothesis. in this respect, and recalling its statement, it is specified that the idea is not to validate exhaustively the self-similarity parameter or hurst’s parameter, but only to obtain an indicator of the presence of the characteristic in representative traffic series. consider the following definition: the valuated real process x(t) , t ∈ r is self-similar with h > 0 if for all a > 0 the finite-dimensional distributions of x(at) , t ∈ r are identical to the finite-dimensional distributions ah x(t), t ∈ r, i.e., {x(at), t ∈ r} =d {ahx(t), t ∈ r} ∀a > 0 (2.1) where =d means equality for all the finite-dimensional distributions [54], [55]. the property defined by (1), is usually known as the scaling property, and a direct consequence of its definition is that a self-similar process preserves its distribution, and thereby its statistics, since it is subjected to a temporal scaling. also, from the same standpoint parameter h, or hurst’s, is known as the self-similarity parameter for the stochastic process x(t) to which it is associated. the first report that recognized and approached the need to have additional parameters to characterize the variability of the traffic is [26], where although it is accepted that h is necessary for that purpose, it is not sufficient. through a detailed statistical analysis of vbr video samples, the authors conclude that the self-correlation of the vbr video sequences decays hyperbolically, equivalent to the lrd. but since the lrd is related to the frequency of the components of the process and not to the distribution of the bandwidth requirements, if the marginal distribution is compressed because the coefficient of variability (coefficient between the mean bandwidth and the standard deviation) tends to zero when the number of multiplexed input sources that give rise to the traffic tend to infinity, the traffic, as the number of sources increases, is confined within narrow statistical limits, and although within these frontiers the behavior continues to be long range (result confirmed through h = 0.7), in the range in which the standard deviation is much less than the product of the mean of the bandwidth distribution and the number of multiplexed input sources, the traffic does not depend on h. therefore, h is necessary to characterize the variability, but it is not sufficient. so an adequate characterization of video traffic must consider at least the following four parameters: the mean of the bandwidth distribution, the number of multiplexed input sources, the standard deviation, and the coefficient of variability derived from them. the authors make it clear, finally, that these results are valid only if they follow the central limit theorem [56], i.e., when the standard deviation is finite. although the above result may seem to be expected because the parsimony can lead to imprecision in both the interpretations and the results, it is not so in relation to the effect produced by assuming that if the detailed behavior of the components of a given stochastic process, p1, that shows some degree of self-similarity h1 is not known, then a biunivocal correspondence between h1 and p1 is clearly established. in other words, processes that show clearly differentiated behaviors are possible, but their correlation structures must be characterized by the same h parameter. in this discussion of the analysis of self-similar teletraffic with long-range dependence (lrd) at the network layer level 807 respect, [57] approaches this problem considering the asymptotic behavior of an unlimited buffer of a multiplexor under different self-similar input traffic models, in particular, cox’s infinite server models, or m/g/∞ [58], and fractional gaussian noise models [59], [60]. in this way the authors report getting two completely different behaviors for the buffer’s probability tail, namely that while with the former the drop is mostly hyperbolic, with the latter it presents a wiebull asymptotic behavior [61], simply showing that hurst’s parameter is insufficient as a single descriptor to characterize the lrd in input traffic models [62], [63]. in [64] it is also shown how synthetic traces that have identical self-similarity parameters and means differ significantly from one another. finally, in [65] the insufficiency of hurst’s parameter by itself as a precise descriptor of the long-range dependence of the traffic in an ethernet network is reported. in this research it is shown convincingly, through an analysis of tails applied to a series of representative data of the traces of the real traffic registered in the ethernet network of the department of computer science of the university of california at los angeles, ca, usa, that hurst’s parameter does not provide a precise prediction of the performance of the tails for a given lrd traffic, and that its behavior is not monotonic with respect to the presence or absence of bursts if the original series is disaggregated into smaller ones, which also implies that h does not serve to characterize the relative importance of groups within a whole. it is clearly seen that both results are opposed to the conventionalism of [6] since h cannot be used to size the traffic bursts in ethernet. questions on where, when, and under what set of circumstances the use of self-similar processes in modeling communications systems and related applications is completely valid, as well as the uncertainty on the existence of a scenario that brings together criteria and its influence on the basic characteristics of the processes remain unanswered. however, they seem to have an answer derived from the analysis of the research of ryu and lowen, [66], [67], on the use of fractal point processes (fpp) for modeling and analyzing self-similar traffic in networks [68]. concretely, this research proposes to make a distinction between self-similarity at the application level and self-similarity at the network level for the purpose of the design and administration of wideband networks in terms of a correct provision of the qos required by the applications, and their best results are the proofs that self-similarity of vbr traffic can frequently be ignored in the face of the sizing of the buffers in atm, and that self-similar traffic at the applications level can be managed effectively in the context of the admission control for assigning resources with service quality guarantees, because it is independent of the network conditions under which it is sent. 3 conclusions a detailed discussion has been presented of the theoretical bases that support the position of considering that high speed computer network traffic shows self-similar behavior and long-range dependence. with respect to the above, and considering the arguments in favor and against this position, it is believed that traffic in present day computer network settings is of a statistically self-similar nature and presents a pronounced long range dependence (lrd). accepting the above singularities as inherent to the traffic flow of present day high speed network settings, it is proposed that their behaviors are amenable to being modeled by limiting their applicability to the network layer level, estimating that the most relevant properties of self-similar stochastic processes are consistent to be used in the formulation of traffic models when that distinction is made, since the concepts of self-similarity and long-range dependence are justified by the need to describe faithfully the real traffic processes in present day computer network settings. to depict the concepts with which this research deals, the following work hypothesis is proposed: “it is completely feasible to restrict the evolution of a statistically self-similar process to a well defined application setting without altering its nature and its more important properties, in that 808 g. millán, h. kaschel, g. lefranc way highlighting the validity of its postulates and giving greater plausibility to its physical interpretation.” considering all the arguments given above, it is stated that it is theoretically feasible to prove it, since all its arguments are correctly founded and supported, and only their proof at the analytic and experimental levels remains as a future task. bibliography [1] w. stallings, redes e internet de alta velocidad. rendimiento y calidad de servicio, 2nd ed., madrid, pearson prentice hall, 2004, pp. 224-237. [2] kleinrock, information flow in large communication nets, ph.d. thesis, research laboratory of electronics, massachusetts institute of technology, cambridge, ma, 1961. [3] l. kleinrock, communication nets: stochastic message flow and delay, new york, mcgrawhill, 1964. [4] w.e. leland, m.s. taqqu, w. willinger, and d.v. wilson, on the self-similar nature of ethernet traffic, in proc. acm sigcomm ?93, san francisco, ca, pp. 183-193. [5] w.e. leland, m.s. taqqu, w. willinger, and d.v. wilson, on the self-similar nature of ethernet traffic (extended version), ieee/acm trans. netw., vol. 2, no. 1, pp. 1-15, february 1994. [6] a. erramilli, r.p. singh, and p. pruthi, application of deterministic chaotic maps to model packet traffic in broadband networks, in proc. 7th itc specialist seminar, morristown, nj, 1990. [7] w.e. leland and d.v. wilson, high time-resolution measurement and analysis of lan traffic: implications for lan interconnections, in proc. ieee infocom ?91, bal harbour, fl, pp. 1360-1361. [8] j. beran, r. sherman, m.s. taqqu, and w. willinger, long-range dependence in variablebit-rate video traffic, ieee trans. commun., vol. 43, no 2/3/4, pp. 1566-1579, feb/mar/apr 1995. [9] b. mandelbrot, self-similar error cluster in communication systems and the concept of conditional stationarity, ieee trans. commun. technol., vol. 13, no. 1, pp. 71-90, mar. 1965. [10] http://ita.ee.lbl.gov/html/contrib/bc.html. [11] w. willinger, m.s. taqqu, r. sherman, and d.v. wilson, self-similarity through highvariability: statistical analysis of ethernet lan traffic at the source level, in proc. acm sigcomm ’95, cambridge, ma, pp. 100-113. [12] w. willinger, m.s. taqqu, r. sherman, and d.v. wilson, self-similarity through highvariability: statistical analysis of ethernet lan traffic at the source level, ieee/acm trans. netw., vol. 5, no. 1, pp. 71-86, feb. 1997. [13] m.s. taqqu, w. willinger, and r. sherman, proof of a fundamental result in self-similar traffic modeling, acm sigcomm computer communication review, vol. 27, no. 2, pp. 5-23, apr. 1997. discussion of the analysis of self-similar teletraffic with long-range dependence (lrd) at the network layer level 809 [14] a. adas and a. mukherjee. (1994, dec.).on resource management and qos guarantees for long range dependent traffic. georgia inst. tech., ga. [online]. available: http://hdl.handle.net/1853/6797. [15] s.m. klivansky and a. mukherjee. (1995, aug.). the nfsnet. georgia inst. tech., ga. [online]. available: ftp://ftp.cc.gatech.edu/pub/coc/tech_reports/95/git-cc-95-07.ps.z. [16] v. paxson and s. floyd, wide-area traffic: the failure of poisson modeling, ieee/acm trans. netw., vol. 3, no. 3, pp. 226-244, jun. 1995. [17] m.e. crovella and a. bestavros. (1995, oct.). explaining world wide web traffic selfsimilarity. boston univ., ma. [online]. available: http://www.cs.bu.edu/techreports. [18] m.e. crovella and a. bestavros, self-similarity in world wide web traffic: evidence and possible causes, ieee/acm trans. netw., vol. 5, no. 6, pp. 835-846. dec. 1997. [19] m. arlitt, r. friedrich, and t. jin, workload characterization of a web proxy in a cable model environments, performance evaluation review, vol. 27, no. 2, pp. 25-36, sep. 1999. [20] d.e. duffy, a.a. mc intosh, m. rosenstein, and w. willinger, statistical analysis of ccsn/ss7 traffic data from working ccs subnetworks, ieee j. sel. areas commun., vol. 12, no. 3, pp. 544-551, apr. 1994. [21] p. pruthi and a. erramilli, heavy-tailed on/off source behavior and self-similar traffic, in proc. 1995 ieee international conference on communications, seattle, wa, vol. 1, pp. 445-450. [22] g. rufa, developments in telecommunications. whit a focus on ss7 network reliability, berlin, germany: springer-verlag, 2008. [23] m.w. garrett and w. willinger, analysis, modeling and generation of self-similar vbr video traffic, computer communication review, vol. 24, no. 4, pp. 269-280, oct. 1994. [24] b. tsybakov and n.d. georganas, on self-similar traffic in atm queues: definitions, overflow probability bound and cell delay distribution, ieee/acm trans. netw., vol. 5, no. 3, pp. 397-409, jun. 1997. [25] l. yellanki, performance evaluation of vbr video traffic models, m.sc. thesis, dept. comput. sci., univ. saskatchewan, saskatoon, sk, canada, 1999. [26] m. zhonghua, analysis of wireless data network traffic, m.sc. thesis, school of engineering science, simon fraser univ., burnaby, bc, canada, 2000. [27] d.r. basgeet, j. irvine, a. munro, p. dugenie, d. kaleshi, and o. lazaro, impact of mobility on aggregate traffic in mobile multimedia system, in the 5th international symposium on wireless personal multimedia communications, honolulu, hi, 2002, vol. 2, pp. 333-337. [28] q. liang, ad hoc wireless network traffic-self-similar and forecasting, ieee commun. lett., vol. 6, no. 7, pp. 297-299, jul. 2002. [29] t. janevski, characterization and classification of ip traffic, in traffic analysis and design of wireless ip networks, norwood, ma: artech house, inc., 2002, ch. 5, pp. 135-165. [30] j. yu, modeling of high-speed wireline and wireless network traffic, ph.d. dissertation, elect. comput. eng. dept., drexel univ., philadelphia, pa, 2005. 810 g. millán, h. kaschel, g. lefranc [31] x. yang, impulsive self-similar processes, with applications in broadband communication system modeling, ph.d. dissertation, elect. comput. eng. dept., drexel univ., philadelphia, pa, 2001. [32] j. ridoux, a. nucci, and d. veitch, seeing the difference in ip traffic: wireless versus wireline, in proc. 25th ieee international conference on computer communications, barcelona, spain, 2006, pp. 1-12. [33] n. hohn, d. veitch, and p. abry, does fractal scaling at the ip level depend on tcp flow arrival processes?, in proc. 2nd acm sigcomm workshop on internet measurement, marseille, 2002, pp. 63-68. [34] n. hohn, d. veitch, and p. abry, cluster processes, a natural language for network traffic, ieee trans. signal process., vol. 51, no. 8, pp. 2229-2244, aug. 2003. [35] n. hohn, d. veitch, and p. abry, the impact of the flow arrival process in internet traffic, in proc. ieee int. conf. acoustics, speech, and signal processing, hong kong, 2003, vol. 6, pp. vi-37-40. [36] r. fantacci and d. tarchi, bridging solutions for a heterogeneous wimax-wifi scenario, journal of communications and networks, vol. 8, no. 4, pp. 369-377, dec. 2006. [37] a. erramilli, m. roughan, d. veitch, and w. willinger, self-similar traffic and network dynamics, in proc. of the ieee, vol. 90, no. 5, pp. 800-819, may. 2002. [38] j.p. gianvittorio and y. rahmat-samil, fractal antennas: a novel antenna miniaturization technique, and applications, ieee antennas propagat. mag., vol. 44, no. 1, pp. 20-36, feb. 2002. [39] r. azaro, e. zeni, m. donelli, and a. massa, fractal-based methodologies for wimax antenna synthesis, in wimax: technologies, performance analysis, and qos, s. ahson and m. ilyas, eds. boca raton, fl: crc press, 2008, ch. 2, pp. 21-39. [40] g. kramer, ethernet passive optical networks, usa, mcgraw-hill, 2005. [41] a. huang, b. mukherjee, l. xie, and z. li, time-space label switching protocol (tslsp), in high-performance packet switching architectures, i. elhanany and m. hamdi, eds., germany: springer-verlag, 2007, ch. 9, pp. 197-210. [42] m. maier, optical switching networks, new york: cambridge university press, 2008. [43] m. grossglauser and j-c bolot, on the relevance of long-range dependence in network traffic, ieee/acm trans. netw., vol. 7, no. 5, pp. 629-640, oct. 1999. [44] j.-y. le boudec and p. thiran, network calculus. a theory of deterministic queuing systems for the internet, germany: springer-verlag, 2004, pp. 3-6. [45] a. adas, traffic models in broadband network, ieee commun. mag., vol. 35, no. 7, pp. 82-89, jul. 1997. [46] d.p heyman and t.v. lakshman, what are the implications of long-range dependence for vbr-video traffic engineering?, ieee/acm trans. netw., vol. 4, no. 3, pp. 301-317, jun. 1996. discussion of the analysis of self-similar teletraffic with long-range dependence (lrd) at the network layer level 811 [47] k. sriram and w. whitt, characterizing superposition arrival processes in packet multiplexers for voice and data, ieee j. sel. areas commun., vol. 4, no. 6, pp. 833-846, sep. 1986. [48] b.k. ryu and a. elwaid, the importance of long-range dependence of vbr video traffic in atm traffic engineering: myths and realities, computer communication review, vol. 26, no. 4, pp. 3-14, oct. 1996. [49] b.k ryu and a. elwaid, the relevance of short range and long-range dependence of vbr video traffic to real-time traffic engineering, unpublished. [50] k. chandra and a.r. reibman, modeling one-and two-layer variable bit rate video, ieee/acm trans. netw., vol. 7, no. 3, pp. 398-417, jun. 1999. [51] j. yu and a. petropulu, is high-speed wireless network traffic self-similar?, in proc. of ieee int. conf. acoustics, speech, and signal processing, montreal, canada, 2004, vol. 2, pp. ii-425-428. [52] s. bregni, p. giacomazzi, and g. saddemi, transport of long-range dependent traffic in single-hop and multi-hop ieee 802.3e networks, in proc. ieee global telecommunications conference, new orleans, la, 2008, pp. 1-6. [53] a. ge, f. callegati, and l.s. tamil, on optical burst switching and self-similar traffic, ieee commun. lett., vol. 4, no. 3, pp. 98-100, mar. 2000. [54] d.i. sheluhin, s.m. smolskiy, and a.v. osin, self-similar processes in telecommunications, chichester, uk: john wiley & sons, ltd., 2007, ch.1, pp. 8-9. [55] x. yang, impulsive self-similar processes, with applications in broadband communication system modeling, ph.d. dissertation, elec. comput. eng. dept., drexel univ., philadelphia, pa, 2001. [56] f. m. dekking, c. kraaikamp h. p. lopuhaä, and l. e. meester, the central limit theorem, in a modern introduction to probability and statistics. understanding why and wow. springer-verlag, 2005, ch. 14, pp. 195-202. [57] m. parulekar and a.m. makowski, tail probabilities for a multiplexer with self-similar traffic, in fifteenth annual joint conf. ieee computer societies. networking the next generation, san francisco, ca, 1996, vol. 3, pp. 1452-1459. [58] m. parulekar and a.m. makowski. (1996). m/g/∞ input process: a versatile class of models for network traffic, univ. maryland, college park, md, [online]. available: http://hdl.handle.net/1903/5778. [59] b. mandelbrot and w. van ness, fractional brownian motions, fractional noises and applications, siam review, vol. 10, no. 4, pp. 422-437, oct. 1968. [60] j. beran, statistical methods for data with long-range dependence, statistical science, vol. 7, no. 4, pp. 404-416, nov. 1992. [61] h. rinne, the weibull distribution. a handbook, boca raton, fl: chapman & hall/crc, 2009. 812 g. millán, h. kaschel, g. lefranc [62] m. parulekar and a.m. makowski. (1995). buffer overflow probabilities for a multiplexer with self-similar traffic. univ. maryland. college park, md, [online]. available: http://hdl.handle.net/1903/5727. [63] m. parulekar and a.m. makowski (1996). tail probabilities for m|g|∞ input processes (i): preliminary asymtotics. univ. maryland. college park, md, [online]. available: http://hdl.handle.net/1903/5760. [64] a. patel and c. williamson, statistical multiplexing of self-similar traffic: theoretical and simulation results, unpublished. [65] r. ritke, x. hong, and m. gerla, contradictory relationship between hurst parameter and queuing performance (extended version), telecommunication systems, vol. 16, no. 1-2, pp. 159-175, jan. 2001. [66] b.k. ryu and s.b. lowen, point process approaches for modeling and analysis of selfsimilar traffic: part i: model construction, in proc. fifteenth annual joint conference of the ieee computer societies. networking the next generation, san francisco, ca, 1996, vol. 3, pp. 1468-1475. [67] b.k. ryu and s.b. lowen, point process approaches for modeling and analysis of selfsimilar traffic: part ii: applications, in proc. 5th international conference on telecommunication systems, modeling, and analysis, nashville, tn, 1997, pp. 62-70. [68] s.b. lowen and m.c. teich, estimation and simulation of fractal stochastic point processes, fractals. complex geometry patterns and scaling in nature and society, vol. 3, no. 1, pp. 183-210, mar. 1995. international journal of computers, communications & control vol. i (2006), no. 4, pp. 126-138 generic modeling and configuration management in product lifecycle management souheïl zina, muriel lombard, luc lossent, charles henriot abstract: the plm (product lifecycle management) is often defined as a set of functions and procedures which allows one to manage and to exploit the data defining at the same time the products and the processes implemented for their developments. however, the installation of a plm solution remains a difficult exercise taking into account the complexity and the diversity of the customer requirements as well as the transverse utilization of this solution in all the company’s’ functions. the issues faced by both editors and integrators of plm applications arise from the specific aspect of customers’ projects, even tough most functional needs are often generic. in this paper we are focused on product modeling in plm applications, more particularly on configuration management that traces product evolutions throughout its lifecycle. we will insist on the links between the configuration needs and the multi-view approach models and we release problems related to plm applications deployment. our work concerns the plm generic solutions based on the concept of generic models. this generic model takes into account the configurations specification associated to the managed product and can be extended to cover specific needs. keywords: plm, configuration management, generic modeling. 1 introduction monitoring the technical information is one of the main preoccupation of the companies. indeed, the increasingly constraining regulations and the higher competition level require being more rigorous and reactive to the customers’ requests. the product quality improvement and the reduction costs cycles require applying technical data management rules and means. the plm (product lifecycle management) appeared to answer the need of managing growing volumes of data in more and more complex environments. the plm covers product life cycle integrality, without stopping itself at the design stages; it extends today to a wide scope of fields such as the aerospace-defense, food, drugs, public sector, engineering, manufacturing, etc. product term thus indicates diverse entities in very varied trades. the product life cycle management by a plm solution allows including, not only all the necessary elements to ensure its traceability, like modeling, document management, numerical analysis, know-how capitalization, etc. but all the information system components making it possible to ensure the product monitoring from its manufacture to its marketing until its disappearance or likely its recycling. in plm applications, the technical data are organized within configurations. the configuration management is used to manage products complexity and knowledge diversity resulting from various business cases in the company. indeed, the growing number of plm applications users, the technical data volume and the various evolutions associated to these data require: • controlling and checking the exchanged technical data consistency, unicity and safety, • taking into account data evolutions and all their effects on the product and its components. copyright c© 2006 by ccc publications generic modeling and configuration management in product lifecycle management 127 this is why the configuration management is a fundamental component in plm applications, making it possible to control and manage complexity related to the data. in the literature, several researches treat various types of problems around the plm. this work is especially interested in the problems related to the data exchange and sharing [32, 26], the process management problems [19, 12, 33] and the product configuration management problems [20, 21, 18, 4]. in this paper, we are centered on technical data management and particularly on configuration management. in the first part of this document, we present problems related to the configurations modeling and a treatment example on given configurations. in the second part, we will show why the plm solution deployment in a company remains a difficult exercise with the showing off of first problems. the study carried out at lascom1 consisted, starting from existing plm applications, to set up a step of reverse engineering which made it possible to formalize the concepts used in plm applications within a uml meta-model. this fundamental stage made it possible to validate the handled concepts at the conceptual level, then to underline the advantages of working on this modeling level to capitalization. 2 the product modeling in plm applications a technical object is thus a business object on which data management requirements are expressed in regard to the management and the handling of complex objects. the technical data concerns the design, the manufacture, the maintenance, the recycling and the marketing [3]. the definition of various business or technical objects and the links between them strongly depends on the company needs, organization and working methods. the diversity of plm applications (diversity related to the customer’s specificities), the data increasing complexity and the need for evolving and flexible systems (due to the evolution of the needs as time goes by) implies that there is not an universal plm application model being able to meet all the customers conflicting needs. in the literature several research tasks are interested in product modeling with different approaches and technologies: use rdf (resource description framework) [17], topic maps [16], or the orientedobject approaches [9] which aim to model and implement pdm (product data management) systems using uml (unified modeling language). 2.1 configuration management in plm applications research works are interested in methods and tools development for managing the product configuration; they deal especially with generic product modeling. in the literature, several methods, based on artificial intelligence techniques, propose solutions to solve the products configuration problems [29]. these methods are based on rules and constraints [1, 31, 10]. the product configuration is based on a configuration model (often called product generic structure); the model describes the components (called also configuration elements or objects members of a configuration) that can be included in the product configuration. the configuration model also includes the combination rules between these components. various tools, called configurators (e.g. engcon [14], wecotin [2]), make it possible to obtain a precise product description, which satisfies the needs and which is validated by the compliance with the rules of constraints defined in the configuration model. these tools can be integrated in plm applications to 1lascom (www.lascom.com) is an editor of life cycle management solutions concerning company product and processes. 128 souheïl zina, muriel lombard, luc lossent, charles henriot contribute to configure the product. in our work we are centered particularly on the configuration management. like configuration tools, the configuration management tools are also based on product models making it possible to formalize technical objects, links and constraints which express managed and traced needs. indeed, the configuration management is a discipline of management which consists in applying technical and administrative rules to the development, the production and maintenance, in all configuration article life cycle. it consists in managing the technical description of a system (and its different components), like managing the whole of the modifications made during the system evolution. the configuration management finds out an interest when it concerns a product management that have a lot of variants and a long lifecycles or unit complex products like the production system itself or special machines [22]. figure 1 presents a configuration example of a production site composition modeled in a plm application. this configuration follow the evolution of the productions lines and their adaptability to the product. figure 1: example of a production site configuration. the configuration management consists in controlling information of product structure, especially its decomposition in elementary subsets, parts and addition to the whole of this information, functional and physics characteristics. the standard [15] presents recommendations for using configuration management in industry. it provides the detailed process, organization and procedures of management. according to this standard, the configuration management is an integral part of the plm; it provides a clear vision of the configuration state, associated to products or projects, as well as theirs evolutions by guaranteeing a total traceability. configuration management tools integrate functions and mechanisms allowing audit and control of all actions carried out on the product configuration. generic modeling and configuration management in product lifecycle management 129 2.2 product multi-point of view modeling the concepts of view and point of view were studied in several fields related to the data processing: databases, analyzes and design, programming languages, etc. in the literature several research tasks integrated the concepts of view and point of view in products modeling [24, 23, 13, 25, 28], these views are generally used to express different trades’ needs on the product. figure 2 presents an example of different business views around a product. figure 2: various business views associated with a product. the notion of point of view, classically used in the literature, has for main goal the description of a complex entity having several facets. the points of view make it possible to structure information starting from various criteria related with trade or product. this make it more representative and understandable and so easier to exploit. each actor of the company handles particular view of the product that corresponds to his specific needs: functional view, technical view, industrial view, etc. there are two principal approaches which were used to take into account the actors points of view in the product model: multi-view and multi-model approaches. the multi-view approach is based on the development of a single model starting from different views. this unique model is accessible according to several points of view. the main advantage of the use of a single model is that modifications made on a sub-model are reflected in the other sub-models. consequently, the problems of inconsistency due to the division of the data between the partial models are avoided [24, 23]. this approach is very much used in works to product representation in cad (computer-aided design) systems. research of million [23] relates to the problem of designing technical information systems in an industrial multi-actors context in order to visualize information according to various views. the suggested method, called vim (viewpoints information modeling), makes it possible to build up, by successive adjustments, a total data model starting from an initial model centered on the technical object according to the points of view considered. the principal interest of the multi-view approach lies in the fact that there is a unique model to manage, which facilitates the exchanges management and information sharing. however, it leads to a static product representation because the models handled by the various actors are fixed and do not vary at the same time as the product representation [8]. the multi-model approach consists in creating a model associated with each actor’s view on the 130 souheïl zina, muriel lombard, luc lossent, charles henriot product. thus, there as exist many models as of different points of view on the product. each model contains the technical objects and the relations which correspond to the given point of view. the management of the relations between these various models imposes using a whole of coherence rules which must apply to the models whole. the multi-models approach makes possible to structure the data following specific models to each point of view on the product. these separated models can evolve independently. however, coherence maintenance and information sharing between these models are much more difficult to ensure. the problem of coherence is attenuated by the use of rules of coherence, but their identification and their formalization remain difficult. 2.3 configuration models and data processing to be coherent with the different actors needs on the product, the configuration models must take into account these concepts of view and points of view associated to the product. according to the application needs, advitiumtm, software package developed by the lascom company, define several types of configuration for a product (a given technical object). we define, for example, design configuration, documentary configuration, configuration carried out according to the stages reached in the project, etc. these configuration types correspond to different points of view on the product. each configuration is based on its own model and can evolve independently of the other types of configuration associated with the product. these configurations structure the data necessary to the product definition. thus, each actor will gather, treat on a hierarchical basis and complete the technical objects according to his own needs. it is for this purpose that various structures of the product are managed. each one corresponds to a particular configuration type. we defined the concept of “context” related to the configuration elements in order to take into account the specific use of context data. the application of a context allows defining contextual views on the configuration [34]. starting from existing plm applications, the study carried out at lascom consisted in setting a step of reverse engineering (step 1) which made it possible to formalize the concepts (step 2 & 3) used in advitiumtm within an uml meta-model [27] (this meta-model is not presented in this paper). the employed methodology is illustrated by figure 3, the plm applications deployment (step 4 & 5) is described by figure 7 section 3.1. generic modeling and configuration management in product lifecycle management 131 figure 3: the employed methodology. we developed tools for plm applications retro-modeling, these tools allow formalizing configuration links between various technical objects classes. our problematic is located in the formalization of the technical objects and links existing between them to answer: • a technical data presentation matter to user, • optimization of existing links in data base considering semantic studies produced according to customers requirements, • easy access to configuration data and an optimization of the treatments on the configuration elements. 132 souheïl zina, muriel lombard, luc lossent, charles henriot our tool allows, from advitiumtm relational databases, to generate an xsd (xml schema definition) of configurations definition. figure 4 represents the hierarchical structure of classes in an example of a technical object (ot01_site) configuration model. an example of this class instance is given by figure 1. figure 4: example of configuration xml schema. in this model, configuration elements (buildings, drawings, documents, folders, etc.) are linked between them and organized to form configurations. technical objects can be related for various reasons: to express a composition link, to make a specialization, membership, etc. links between the technical objects can be static (specific to a technical object instance) or dynamic (evolve with the technique objects versions) [4]. majority of configuration models are recursive. it is noted, in figure 4, that the class “folder” is re-used at various places in the hierarchy. this led, at the physical level, to infinite hierarchical structures. our tool, associated with graphviz [11], allows the visualization of configuration links graph of an existing plm application. this representation informs us about the configuration structure and characteristics (depth, degree of objects re-uses, etc.). figure 5 presents the graph of product study configuration. we noted that the whole of the technical objects which constitute the product structure can be represented using a dag (directed acyclic graph). each graph consists of nodes set (technical objects); these nodes are connected by links. links have properties (link type, beginning and expired dates, etc.) allowing to describe more precisely relationships among the objects and authorizing, thus, the configuration traceability evolutions. generic modeling and configuration management in product lifecycle management 133 nomenclature d’étude 1763 1648 1773 1582 1784 1681 1805 1588 1568 1576 1791 1792 1794 1795 1796 1682 1671 1672 1678 1595 1597 1567 1570 1640 1641 1552 1553 1639 1642 1564 17551738 1739 1743 1546 1511 17251729 1732 1518 1524 1713 1528 1717 1721 1932 1983 1985 19882877 28781530 1534 1722 1689 1699 1708 1624 17011700 17021710 1711 16871690 1693 16941696 16031599 1600 1601 1602 16041616 1617 1618 1619 1620 1621 16221625 1659 1631 1632 1633 16341638 1703 1704 1705 1706 1707 1691 1692 1606 1607 1608 1609 1610 1611 1612 1613 1626 1627 1628 figure 5: example of study configuration graph in existing plm applications. the nodes can contain additional informations (order, descriptions, etc.), which lead the need for defining an entity node as presented at figure 6. in order to implement applications according to customer requirements, this model includes a set of basic concepts (figure 6.a) independent of any applicability use (e.g. part, configuration, user, group, etc.). these concepts can be specialized to complet the model according to application needs (figure 6.b). the derivation of this conceptual model gives a part of the physical model (relational model) currently implemented in the advitiumtm software package. data treatments related to the configurations can be inspired from techniques and methods resulting from the graphs theories. indeed, in the configurations handling and management, certain basic user’s functions are commonly used: • search, in a given technical object configuration, the whole of descendants of this object (to obtain the configuration elements). • search, in a given technical object configuration, the whole of the ascending of this object (to obtain the employment cases). • make search with criteria related on technical objects and their links properties (e.g. validity dates). in certain research, the criteria on links properties must be checked for the whole links constitute the way between the starting and arrival technical object. in order to improve application performances related to configurations management (in particular search and navigation functions in configurations), we studied certain technical solutions which aim im134 souheïl zina, muriel lombard, luc lossent, charles henriot figure 6: a part of generic model. plementing hierarchical structures in relational databases. among these solutions, we quote the “nested sets” method [5] (well adapted to handle the trees structures in the relational databases). other research tasks are interested in the resolution of this problems type which consists in managing hierarchical data structures in relational databases [7, 30]. 3 the reference model construction plm applications are based on complex and evolving product structures. the issues faced by both editors and integrators arise from the specific aspect of customers’ projects. as company’s needs are often specific, a plm solution implementation requires heavy investments mainly regarding development aspects. these developments require huge implementation timetables and massive resources. one explanation lays on the fact that developments are very specific, and stress applied by plm solution integrators on physical aspect rather than on conceptual aspect. however, the consideration of the needs at the conceptual level, by plm applications editors, permit to capitalize knowledge related to their products engineering and to rationalize the design and development teams working methods. indeed, the knowledge capitalization helps to preserve, to share and above all to re-use know-hows generated through customers’ projects. this re-use thus allows: • offering a better times control of engineering and a more flexible software offer, • limiting plm applications maintenance costs, • facilitating the evolution plm applications and allowing a greater users autonomy. generic modeling and configuration management in product lifecycle management 135 3.1 the generic models so as to capitalize, it is recommended to study a set of reference models by sector or trade, in order to have a base of standard models or generic models. in fact, it is rather easier to particularize a model dedicated to a sector of activity than to reinvent it each time. thereby, we reveal two levels of trade: • generic level: in this level a person is in charge of creating generic models, by studying certain number of similar cases already encountered and modeled in reference models. these models can be enriched, if necessary, with generic specificities even if these specificities are not used till now but they are useful regarding the preceding businesses. • particular level: in this level a person is particularizing this generic level suited to trade to obtain the particular desired model. this model is particularized according to the specification elements. thus, this person will have to particularize a partially defined model existing, and will have only to complete this initial model so as to provide the model to be implemented in the company, consequently making a gain of time. figure 7: process of model instantiation. figure 7 represents the interest that can be disposed by generic models per activity sector to simplify the work of plm solutions integrators or developer. 3.2 functions deployment it has already been showed, compared to database modeling approaches, that a good data organization (coherent models and adapted to the needs) allows to considerably simplify the treatments and 136 souheïl zina, muriel lombard, luc lossent, charles henriot thus to improve the performances of the concerned applications. indeed, a generic implementation is often too general when it is used in a specific situation; this generalization often causes an ineffective execution. to increase the performances, this implementation must be adapted in order to keep preserving only the necessary functionalities in a specific situation. it is thus suitable to define and associate elementary functions to the generic models. these functions can be combined and organized to meet the specific needs expressed during models particularization. otherwise, for better plm application appropriation by users, a “trade translation” of the functions is also necessary. 4 summary and conclusions the product life cycle management is a recent field, the perimeter of plm applications is in constant evolution. this evolution implies that there is not a data model able to meet all the customer requirements. our work concerns the generic solutions of technical data management based on the concept of generic model. this generic model takes into account the configuration specification associated to technical objects. disposing of generic models by sector or by trade permits to facilitate the work of the plm solutions integrators or developers. this work can be done through an audit to extract the modeling invariants. it is advisable to well determine the trade sectors. thus, in the objective of the plm tools appropriation by the users, this classification propose solutions practically ready-made for its deployment in term of modeling of the technical data. this dimension allows decreasing or eliminating the specific developments. in these models, the terminology thus is well taken into account, since particularized with a given industrial sector. references [1] m. aldanondo, h. fargier, and m. véron. configuration, configurateurs et gestion de production, hermès science, traité ic2 productique, pp.179-209, 2001. [2] t. asikainen,t. männistö, and t. soininen. using a configurator for modelling and configuring software product lines based on feature models, wsvmpd, 2004. [3] a. bernard. modèles et approches pour la conception et la production intégrées. productique, méthodes et outils, cpi’99, numéro spécial de la revue jesa, vol.37, issue 2-3, 2000. [4] t. w. carnduff and j. s. goonetillake, configuration management in evolutionary engineering design using versioning and integrity constraints, advances in engineering software, vol.35, issue 3-4, pp.161-177, 2004. [5] j. celko. joe celko’s trees and hierarchies in sql for smarties. morgan kaufmann, 2004. [6] p.y. chao and t.d. chen. analysis of assembly through product configuration, computers in industry, vol.44, pp.189-203, 2001. [7] y. chen, on the computation of recursion in relational databases. in effective databases for text & document management, s. a. becker, ed. idea group publishing, hershey, pa, pp.263-277, 2003. generic modeling and configuration management in product lifecycle management 137 [8] i. el khalkhali. système intégré pour la modélisation, l’échange et le partage des données de produits. institut national des sciences appliquées de lyon, 2002. [9] b. eynard, t. gallet, p. nowak, and l. roucoules. uml based specifications of pdm product structure and workflow. computers in industry, vol.55, issue 3, pp. 301-316, 2004. [10] h. fargierh and l. henocqueh. configuration à base de contraintes. information interaction intelligence, actes des 2ième assises nationales du gdr i3, pp.141-159. [11] at&t labs, 2005. graphviz. available from: http://www.research.att.com/sw/tools/graphviz/. [12] d. grigori, f. casati, m. castellanos, u. dayal, m. sayal, and m.-c. shan. business process intelligence, computers in industry journal, special issue on workflow mining, vol.53, issue 3, pp.321343, 2004. [13] y. harani. une approche multi-modèles pour la capitalisation des connaissances dans le domaine de la conception. phd thesis, institut national polytechnique de grenoble, 1997. [14] o. hollmann, t. wagner and a. günter. engcon ű a flexible domain-independent configuration engine. workshop configuration at ecai-2000, 2000. [15] iso 10007:2003. quality management systems ű guidelines for configuration management, 2003. [16] iso/iec 13250:2003. information technology sgml applications topic maps (2nd edition), 2003. [17] h-b. jun, d. kiritsis, and p. xirouchakis. product lifecycle information modeling with rdf, international conference on product lifecycle management, pp.44-54, 2005. [18] e. kasper and j. riis. expected and realized costs and benefits when implementing product configuration systems, international design conference design 2004, dubrovnik, croatia, 2004. [19] a k a de medeiros, wmp van der aalst, ajmm weijters. workflow mining: current status and futur directions. coops/doa/adbase, lncs 2888, pp.389-406, 2003. [20] t. männistö, r. sulonen. evolution of schema and individuals of configurable products, in proc. of ecdm’99 workshop on evolution and change in data management, versailles, france, springer verlag, november 15-18, 1999. [21] t. männistö. a conceptual modelling approach to product families and their evolution. phd thesis, helsinki university of technology, 2000. [22] p. martin, l. lossent, l. abt and f. brasset. conception de machines spéciales: méthodologie d’élaboration de cahier des charges. mécanique & industries vol.5, pp.305-316, 2003. [23] o. million, m. lombard, g. ris. analysis and modeling of a technical information system: a modular approach. proceedings of idmme’1998 integrated design and manufacturing in mechanical engineering, utc, compiégne (france), vol.4, pp.1245-1252, may 27-29, 1998. [24] m. nassar. vuml: a viewpoint oriented uml extension. in the 18th ieee international conference on automated software engineering (ase’2003), montreal, canada, 2003. [25] f. nöel. a dynamic multi-view product model to share the product behaviours among designers. international conference on product lifecycle management, pp.113-120, 2005. 138 souheïl zina, muriel lombard, luc lossent, charles henriot [26] p. nowak, b. rose, l. saint-marc, b. eynard, l. gzara, m. lombard. towards a design process model enabling the integration of product, process and organisation, 5th international conference on integrated design and manufacturing in mechanical engineering, idmme 2004, bath (united kingdom), april 5-7, 2004. [27] omg. omg unified modeling language specification v1.5: revisions and recommendations, version 1.5. object management group document formal/03-03-01, 2003. [28] a. saucier, un modèle multi-vues du produit pour le développement et l’utilisation de systèmes d’aide à la conception en ingénierie mécanique, thèse ecole normale supérieure de cachan, france, 1997. [29] t. soininen, and m. stumptner. introduction to special issue on configuration. artificial intelligence for engineering design, analysis and manufacturing, vol.17, issue 1-2, 2003. [30] s. subbarayan and h.r. andersen. linear functions for interactive configuration using join matching and csp tree decomposition. ijcai-2005 configuration workshop, 2005. [31] m. véron. modélisation et résolution du problème de configuration industrielle: utilisation des techniques de satisfaction de contraintes. phd thesis, institut national polytechnique, tarbes, november 2001. [32] l. yesilbas, b. rose, m. lombard. specification of a repository to support collaborative knowledge exchanges in ippop project. computers in industry, vol.57, issue 8-9, pp.690-710. [33] h. zhuge : a process matching approach for flexible workflow process reuse, information and software techology, vol.44, pp.445-450, 2002. [34] s. zina, m. lombard, l. lossent. integration of contextual views in configuration management for plm applications. 9th ifac symposium on automated systems based on human skill and knowledge, 2006. souheïl zina 1,2, muriel lombard 1, luc lossent 1, charles henriot 2 e-mail: s.zina@lascom.com 1 centre de recherche en automatique de nancy umr 7039 université henri poincaré, nancy i faculté des sciences b.p. 239 54 506 vandoeuvre-lès-nancy, france 2 lascom burospace antélia 4, route de gisy 91 571 bièvres cedex, france received: november 11, 2006 int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 4, pp. 349-362 generalized modus ponens using fodor’s implication and t-norm product with threshold i. iancu ion iancu university of craiova department of mathematics and computer science 13 a. i. cuza street, 200585, romania e-mail: i_iancu@yahoo.com abstract: using generalized modus ponens reasoning, we examine the values of the inferred conclusion depending on the correspondence between the premise of the rule and the observed fact. the conclusion is obtained using fodor’s implication in order to represent a fuzzy if-then rule with a single input single output and the tnorm with threshold generated by t-norm product, as a compositional operator. a comparison study with the case when the standard t-norm product is used is made. some comments and an example are presented in order to show how the obtained results can be used. keywords: t-norm, t-conorm, negation, implication, fuzzy number, generalized modus ponens rule 1 introduction the database of a rule-based system may contain imprecisions which appear in the description of the rules given by the expert. the imprecision implies the difficulty of representing the rules expressed, generally, by means of natural language. another difficulty is the utilization of these rules in approximate reasoning when the observed facts do not match the condition of the rule. in order to obtain an imprecise conclusion from imprecise premises, zadeh extends the traditional modus ponens rule obtaining generalized modus ponens (gmp). an investigation of gmp inference was made by many papers: [2], [3], [4], [5], [7], [9], [14], [15], [27], [28], [29], [30], [33], [34], [35]. also, we analyzed this type of inference in some papers: [19] , [22], [24] , [25], [26]. the proposition x is a can be understood as the quantity x satis f ies the predicate a or the variable x takes its values in the set a. the semantic content of the proposition x is a can be represented by πx = µa, where πx is the possibility distribution restricting the possible value of x and µa is the membership function of the set a. because the majority of practical applications work with trapezoidal or triangular distributions and these representations are still a subject of various recent papers ([1], [13] and [16], for instance) we copyright c© 2006-2009 by ccc publications 350 i. iancu will work with membership functions represented by trapezoidal fuzzy numbers. such a number n = (a, b, α, β ) is defined as µn (x) =     f or x < a − α x − a + α α f or x ∈ [a − α, a]  f or x ∈ [a, b] b + β − x β f or x ∈ [b, b + β ]  f or x > b + β let x and y be two variables whose domains are u and v , respectively. a causal link from x to y is represented as a conditional possibility distribution [35, 36] πy /x which restricts the possible values of y for a given value of x . for the rule i f x is a then y is b we have ∀ u ∈ u, ∀ v ∈ v, πy /x (v, u) = µa(u) → µb(v) where → is an implication operator and µa and µb are the possibility distributions of the propositions "x is a" and "y is b", respectively. if µa′ is the possibility distribution of the proposition x is a′ then from the rule i f x is a then y is b and the fact x is a′ generalized modus ponens rule computes the possibility distribution µb′ of the conclusion y is b′ as µb′ (v) = sup u∈u t ( µa′ (u) , πy /x (v, u) ) , where t is a t-norm. 2 basic concepts the main concepts used in gmp are presented below, using the terminology of [8], [17] and [32]. definition 1. a function t : [, ] → [, ] is a t-norm iff it is commutative, associative, non-decreasing and t (x, ) = x ∀x ∈ [, ]. definition 2. a function s : [, ] → [, ] is a t-conorm iff it is commutative, associative, non-decreasing and s(x, ) = x ∀x ∈ [, ]. definition 3. a function n : [, ] → [, ] is a strong negation iff it is an involutive and continuous decreasing function from [, ] to itself. generalized modus ponens using fodor’s implication and t-norm product with threshold 351 in order to represent a rule, the notion of fuzzy implication is used. we recall an axiomatic approach (formulated by fodor in [10, 11, 12]) to the definition of fuzzy implication. definition 4. an implication is a function i : [, ] → [, ] satisfying the following conditions: i1: if x ≤ z then i(x, y) ≥ i(z, y) for all x, y, z ∈ [, ] i2: if y ≤ z then i(x, y) ≤ i(x, z) for all x, y, z ∈ [, ] i3: i(, y) =  (falsity implies anything) for all y ∈ [, ] i4: i(x, ) =  (anything implies tautology) for all x ∈ [, ] i5: i(, ) =  (booleanity) the following properties could be important in some applications: i6: i(, x) = x (tautology cannot justify anything) for all x ∈ [, ] i7: i(x, i(y, z)) = i(y, i(x, z)) (exchange principle) for all x, y, z ∈ [, ] i8: x ≤ y if and only if i(x, y) =  (implication defines ordering) for all x, y ∈ [, ] i9: i(x, ) = n(x) for all x ∈ [, ] is a strong negation i10: i(x, y) ≥ y for all x, y ∈ [, ] i11: i(x, x) =  (identity principle) for all x ∈ [, ] i12: i(x, y) = i(n(y), n(x)) for all x, y ∈ [, ] and a strong negation n i13: i is a continuous function. the most important families of implications are given by definition 5. a s-implication associated with a t-conorm s and a strong negation n is defined by is,ns (x, y) = s(n(x), y) ∀x, y ∈ [, ] a r-implication associated with a t-norm t is defined by itr(x, y) = sup{z ∈ [, ]|t (x, z) ≤ y} ∀x, y ∈ [, ] a ql-implication is defined by it,s,nql (x, y) = s(n(x), t (x, y)) ∀x, y ∈ [, ] one of the most important implications is the fodor’s implication if (x, y) = {  i f x ≤ y max( − x, y) otherwise which is [5] a r-implication for t = min, a s -implication for s = max and a ql-implication for t = min and s = max, where min (x, y) = {  i f x + y ≤  min(x, y) i f x + y >  and max (x, y) = {  i f x + y ≥  max(x, y) i f x + y <  352 i. iancu and n(x) =  − x. besides, the fodor’s implication verifies the properties i1-i12. an important class of t-norms (t-conorms) is given by the t-norms (t-conorms) with thresholds, obtained from standard t-norms (t-conorms); the number of thresholds is an integer n ≥ . first example of operators with 1-threshold were given by pacholczyk in [31]. various families of such t-operators can be found in [18, 20, 21, 23], where the advantage of their usage to represent the uncertain knowledge is justified. in this paper we analyze the results obtained by reasoning with imprecise knowledge using a t-norm with threshold as a composition operator. finally we will compare these results with those obtained using the corresponding standard operators. we consider the following t-norm with a single threshold k ∈ (, )[31] tk(x, y) = { k  − k t (  − k k x,  − k k y) i f x ≤ k and y ≤ k min(x, y) i f x > k or y > k obtained from the t-norm t (x, y). we will work with the t-norm generated by tp(x, y) = xy, which is one of the most used; it results tk(x, y) = {  − k k xy i f x ≤ k and y ≤ k min(x, y) i f x > k or y > k 3 main results taking into account the following reasons, we shall work with rules having a single input single output: a) a rule with multiple consequent can be treated as a set of rules with a single conclusion; for instance, the rule i f antecedent then c and c and ....... and cn is equivalent to the rules i f antecedent then c i f antecedent then c ................................. i f antecedent then cn. b) a rule with multiple premise can be broken up into simple rules [6] when the rules are represented with any s-implication or any r-implication and the observations are normalized fuzzy sets. our aim is to obtain the conclusion "y is b′" from the rule i f x is a then y is b and the fact x is a′ where the fuzzy sets a, a′, b and b′ are represented by trapezoidal possibility distributions. the set b′ is computed as µb′(v) = sup u∈u tk(µa′(u), if (µa(u), µb(v))), analyzing five cases, depending on the relation between µa and µa′. generalized modus ponens using fodor’s implication and t-norm product with threshold 353 theorem 6. if the premise contains the observation, i. e. µa′(u) ≤ µa(u) ∀u ∈ u , then µb′(v) = µb(v) i f µb(v) ≥ . µb′(v) ∈ [µb(v),  − µb(v)) i f µb(v) < . proof. i1) value on the set u = {u ∈ u /µa(u) ≤ µb(v)} because if (µa(u), µb(v)) = , we have µb′(v) = sup u∈u tk(µa′(u), ) = sup u∈u µa′(u) ≤ µb(v). i2) value on the set u = {u ∈ u /µa(u) > µb(v) ≥ .}∪{u ∈ u /µa(u) >  − µb(v) > .} we have if (µa(u), µb(v)) = µb(v). if k < µb(v) then µb′(v) = sup u∈u tk(µa′(u), µb(v)) = sup u∈u min(µa′(u), µb(v)) = µb(v). for k ≥ µb(v) and u  = {u ∈ u/µa′(u) ≤ k} we have µb′(v) = sup u∈u  tk(µa′(u), µb(v)) = sup u∈u   − k k µa′(u)µb(v) ≤ ( − k)µb(v) < µb(v). for k ≥ µb(v) and u  = {u ∈ u/µa′(u) > k} we obtain µb′(v) = sup u∈u  tk(µa′(u), µb(v)) = sup u∈u  min(µa′(u), µb(v)) = µb(v). i3) value on the set u = {u ∈ u /µb(v) < µa(u) ≤  − µb(v)} in this case if (µa(u), µb(v)) =  − µa(u) and therefore µb′(v) = sup u∈u tk(µa′(u),  − µa(u)). for k < µb(v) we have  − µa(u) ≥ µb(v) > k and tk ≡ min. it results µb′(v) = sup u∈u min(µa′(u),  − µa(u)) <  − µb(v). for µb(v) ≤ k ≤  − µb(v) we analyze the cases: i): value on the set u  = {u ∈ u /µb(v) ≤ µa(u) <  − k} because k <  − µa(u) we obtain µb′(v) = supu∈u  tk(µa′(u),  − µa(u)) = supu∈u  min(µa′(u),  − µa(u)) < min( − k,  − µb(v)) =  − k. i): value on the set u  = {u ∈ u /µb(v) <  − k ≤ µa(u) ≤  − µb(v)} in this case,  − µa(u) ≤ k and we study three possibilities, depending on µa′(u). i): on the set u ,  = {u ∈ u  /µa′(u) = } we obtain µb′(v) =  i): on the set u ,  = {u ∈ u  /µa′(u) ∈ (, k]} we have µb′(v) = sup u∈u ,  − k k µa′(u)( − µa(u)) < k( − k). 354 i. iancu i): on the set u ,  = {u ∈ u  /µa′(u) > k} we get µb′(v) = sup u∈u , min(µa′(u),  − µa(u)) <  − µb(v). for k >  − µb(v) we consider the set u  = {u ∈ u/µb(v) >  − k} = {u ∈ u / − k < µb(v) < µa(u) ≤  − µb(v)} and we work with the subsets of u  for which µa′(u) = , µa′(u) ∈ (, k] and µa′(u) > k, respectively; we obtain the following corresponding results: µb′(v) = , µb′(v) < k( − k) and µb′(v) <  − µb(v). synthesizing the previous results, one obtain the conclusion formulated in the theorem. theorem 7. if the premise and the observation coincide, i. e. µa(u) = µa′(u) ∀u ∈ u , then µb′(v) = µb(v) i f k > . and µb(v) ≥  − k, µb′(v) ∈ [µb(v),  − k) i f k > . and µb(v) <  − k, µb′(v) = max(., µb(v)) i f k ≤ .. proof. in this case one repeat the proof of the theorem 6 taking account the equality µa(u) = µa′(u) ∀u ∈ u . it results: 1) if . < k ≤ µb(v) then µb′(v) = µb(v) 2) if k ≤ . ≤ µb(v) then µb′(v) = µb(v) 3) if k ≤ µb(v) < . then µb′(v) = . 4) if µb(v) ≤ k ≤ . then µb′(v) = . 5) if . ≤ µb(v) < k then µb′(v) = µb(v) 6) if µb(v) ≤ . < k then µb′(v) = µb(v) i f µb(v) ≥  − k and µb′(v) ∈ [µb(v),  − k) i f µb(v) <  − k from which we get the conclusion. theorem 8. if the observation contains the premise, i. e. µa(u) ≤ µa′(u) ∀u ∈ u , then µb′(v) ≥ max(µb(v),  − k k µb(v)( − µb(v))) i f µb(v) ≤ min(., k) µb′(v) ≥ µb(v) otherwise. proof. i1) value on the set u = {u ∈ u /µa(u) ≤ µb(v)} because if (µa(u), µb(v)) =  we have µb′(v) = sup u∈u min(µa′(u), ) = sup u∈u µa′(u) ≥ µb(v). i2) value on the set u = {u ∈ u /. ≤ µb(v) < µa(u)}∪{u ∈ u /µa(u) >  − µb(v) > .} in this case if (µa(u), µb(v)) = µb(v) and i) for k < µb(v) we obtain µb′(v) = sup u∈u tk(µa′(u), µb(v)) = sup u∈u min(µa′(u), µb(v)) = µb(v) i) for k ≥ µb(v) we consider two subsets of u: generalized modus ponens using fodor’s implication and t-norm product with threshold 355 i) on the subset u   = {u ∈ u/µa′(u) ≤ k} we have µb′(v) = sup u∈u  tk(µa′(u), µb(v)) = sup u∈u   − k k µa′(u)µb(v)) ≤ ( − k)µb(v) < µb(v) i) on the subset u   = {u ∈ u/µa′(u) > k} we have µb′(v) = sup u∈u  tk(µa′(u), µb(v)) = sup u∈u min(µa′(u), µb(v)) = µb(v). i3) value on the set u = {u ∈ u /µb(v) < µa(u) ≤  − µb(v)} . in this case if (µa(u), µb(v)) =  − µa(u) and we analyze the following cases. i) if k < µb(v) then µb′(v) = sup u∈u tk(µa′(u),  − µa(u)) = min(µa′(u),  − µa(u)) <  − µb(v). ii) if k ≥ µb(v) we consider the following subcases: ii) µb(v) ≤ k ≤  − µb(v) ii) on the set u   = {u ∈ u/µb(v) ≤ µa(u) <  − k} we have µb′(v) = sup u∈u  min(µa′(u),  − µa(u)) <  − µb(v) ii) on the set u   = {u ∈ u/ − k ≤ µa(u) ≤  − µb(v)} we consider two subsets: • u , = {u ∈ u  /µa′(u) ≤ k} for which we obtain µb′(v) = supu∈u ,  − k k µa′(u)( − µa(u)) ≥ supu∈u ,  − k k µa(u)( − µa(u)) ≥ max(( − k),  − kk µb(v)( − µb(v))) ≥  − k k µb(v)( − µb(v)) • u , = {u ∈ u  /µa′(u) > k} for which we have µb′(v) = sup u∈u , min(µa′(u),  − µa(u)) <  − µb(v). ii) k >  − µb(v) which defines the set u  = {u ∈ u/ − k < µb(v)} = {u ∈ u / − k < µb(v) < µa(u) ≤  − µb(v)} • for µa′(u) ≤ k we obtain µb′(v) = supu∈u   − k k µa′(u)( − µa(u)) ≥ supu∈u   − kk µa(u)( − µa(u)) ≥  − k k µb(v)( − µb(v)) • for µa′(u) > k it results µb′(v) = sup u∈u  min(µa′(u),  − µa(u)) <  − µb(v). finally we obtain the conclusion formulated in the theorem. theorem 9. if there is a partial overlapping between the sets a and a′ then µb′(v) =  i f core(a′)∩(u − aµb(v)) 6= /0 and µb′(v) ≥ µb(v) otherwise where aα denotes the α -cut of a. 356 i. iancu proof. i1) the case core(a′)∩(u − aµb(v)) 6= /0. on the set u = {u ∈ u /µa(u) ≤ µb(v)} we have if (µa(u), µb(v)) =  and therefore µb′(v) = sup u∈u tk(µa′(u), ) = . i2) the case core(a′)∩(u − aµb(v)) = /0. on the set u = {u ∈ u /µa(u) > µb(v) ≥ .} we have if (µa(u), µb(v)) = µb(v) and therefore µb′(v) = sup u∈u tk(µa′(u), µb(v)) ≥ tk(, µb(v)) = µb(v). if µb(v) < . we analyze three cases. let ũ = {u ∈ u /µa(u) = µb(v)}; card(ũ ) =  if  < µb(v) < . i) the case ũ ∩supp(a′) = /0 and core(a′)∩core(a) 6= /0. on the set u = {u ∈ u /µa(u) ≥  − µb(v) > .} it results µb′(v) = sup u∈u tk(µa′(u), µb(v)) ≥ tk(, µb(v)) = µb(v). i) the case ũ ∩supp(a′) = /0 and core(a′)∩core(a) = /0. we consider the set u = {u ∈ u /µb(v) < µa(u) ≤  − µb(v)}; on the set u = u ∪u we have µb′(v) = sup u∈u tk(µa′(u), if (µa(u), µb(v))) ≥ sup u∈u tk(µa′(u), µb(v)) ≥ tk(, µb(v)) = µb(v). i) the case ũ ∩supp(a′) 6= /0. on the set u we obtain µb′(v) ≥ µb(v), as in the previous case. it results that, in the case i2), µb′(v) ≥ µb(v). the same result is obtained for µb(v) ∈ {, }. we consider the negation with threshold k ∈ (, ) [31] nk(x) = {  −  − kk x i f x ≤ k k  − k ( − x) i f x ≥ k obtained from the standard negation n(x) =  − x. theorem 10. if the premise and the observation are contradictory, i.e. µa′(u) = nk(µa(u)) ∀u ∈ u , then µb′(v) =  ∀v ∈ v. proof. on the set u = {u ∈ u /µa(u) ≤ µb(v)} we have µb′(v) = sup u∈u tk(µa′(u), ) = sup u∈u min(µa′(u), ) = sup u∈u µa′(u) = sup u∈u nk(µa(u)) =  because there is u ∈ u with µa(u) = . 4 interpretation and utilization of results in this section we will compare the results given by the common operators (t-norm product tp(x, y) = xy and negation n(x) =  − x) with those obtained by the corresponding operators with threshold and we will indicate some possibility of their utilization in a fuzzy reasoning system. an example of working with these results is also presented. in the case of standard operators tp and n, according to [24] we have: generalized modus ponens using fodor’s implication and t-norm product with threshold 357 theorem 11. if the premise contains the observation, i.e µa′(u) ≤ µa(u) ∀u ∈ u , then µb′ (v) = µb(v) i f µb(v) ≥ . or (. ≤ µb(v) < .) µb′ (v) < . i f µb(v) < . theorem 12. if the premise and the observation coincide, i.e. µa(u) = µa′(u) ∀u ∈ u , then µb′ (v) = max(µb(v), .) theorem 13. if the observation contains the premise, i.e. µa(u) ≤ µa′(u) ∀u ∈ u , then µb′(v) ≥ µb(v) ∀v ∈ v. theorem 14. if there is a partial overlapping between the sets a and a′, then µb′(v) =  i f core(a′)∩(u − aµb(v)) 6= /0 and µb′(v) ≥ µb(v) otherwise where aα denotes the α−cut of a. theorem 15. if the premise and the observation are contradictory, i.e. ∀u ∈ u µa′(u) =  − µa(u), then µb′(v) =  ∀v ∈ v . if the observation is more precise than the premise of the rule then it gives more information than the premise. however, it does not seem reasonable to think that the generalized modus ponens allows to obtain a conclusion more precise than that of the rule. the result of the inference is valid if µb′(v) = µb(v), ∀v ∈ v . sometimes, the deduction operation allows the reinforcement of the conclusion, as is specified in [28], [19] and [25]: rule: if the tomato is red then the tomato is ripe. observation: this tomato is very red. if we know that the maturity degree increases with respect to color, we can infer "this tomato is very ripe". on the other hand, in the example rule: if the melon is ripe then it is sweet observation: the melon is very ripe we do not infer that "the melon is very sweet" because it can be so ripe that it can be rotten. this examples show that if the expert has not supplementary information about the connection between the variation of the premise and the conclusion, he must be satisfied with the conclusion µb′(v) = µb(v). the theorem 6 gives a valid result if we choose µb′(v) = µb(v) for µb(v) < .. as opposite, the corresponding theorem 11 from the case of the standard t-norm tp does not allow to obtain a valid result if µb(v) < .. when the observation and the premise of the rule coincide the convenient behavior of the fuzzy deduction is to obtain an identical conclusion. a different conclusion indicates the appearance of an uncertainty in the conclusion. the both theorems, 7 and 12, give an uncertain conclusion, but we can choose k > . in the theorem 7 and we obtain a better result, because the uncertainty is smaller in comparison with the result from the theorem 12. if the observation contains the premise, because max(  − kk µb(v)( − µb(v)), µb(v)) ≥ µb(v) it results that theorem 8 gives a better result that theorem 13. in this case the inferred conclusion b′ is a superset of b; we can choose the first superset. 358 i. iancu if there is a partial overlapping between the premise and the observation or the premise and the observation are contradictory then the two t-norms give the same results for the inferred conclusion. the value µb′(v) =  obtained in these cases represents an indeterminate conclusion, all elements v ∈ v having a possibility equal to 1. in the case of "partial overlapping" we propose a "mediation" between the two possible values: µb′(v) =  and µb′(v) ≥ µb(v); if b, b, ..., bk are the supersets of b with µbk (v) ≥ µbk− (v) ≥ .... ≥ µb (v), we can choose b′ = b [ k  ] , where [x] is the greatest integer which is smaller than or equal to x. the theorem 10 gives a waited result, that represents one of the basic properties of gmp reasoning. the results from theorems 6-10 can be used in a fuzzy inference system as in the following example. a customer is interested to buy a computer. the quality of the computer depends on its price as is specified by the rules: rule1: if the price is very low then the quality is below average rule2: if the price is very very high then the quality is very good. rule3: if the price is middle then the quality is good. the variable price has values in the following set of linguistic terms lp = {very very low, very low, low, midle, high, very high, very very high} and the variable quality has values in the set lq = {poor, below average, average, above average, good, very good}. we consider the universes of discourse [, ] for price and [, ] for quality. the linguistic terms are represented by the following trapezoidal fuzzy numbers: very very low = (, , , ) very low = (, , , ) low = (, , , ) middle = (, , , ) high = (, , , ) very high = (, , , ) very very high = (, , , ) poor = (, , , ) below average = (, , , ) average = (., ., ., .) above average = (, , , ) good = (., ., ., .) very good = (., ., ., .). these fuzzy numbers are depicted in the figures 1 and 2. we consider the observations: observation1: the price is very very low observation2: the price is very high observation3: the price is high the theorems 6-10, used together with the comments from this section, give the following results: 1) the conclusion obtained from rule1 and observation1 is "the quality is below average"; this result is obtained with theorem 6 2) theorem 8 is applied for rule2 and observation2 and gives the conclusion "the quality is good" generalized modus ponens using fodor’s implication and t-norm product with threshold 359 0 200 400 600 800 1000 1200 1400 1600 1800 2000 2200 0 0.2 0.4 0.6 0.8 1 de gr ee o f m em be rs hi p v.v.low vlow low v.v.highmiddle high vhigh figure 1: fuzzy sets for linguistic terms from the list lp 0 1 2 3 4 5 6 7 8 9 10 0 0.2 0.4 0.6 0.8 1 de gr ee o f m em be rs hi p poor baverage average aaverage good vgood figure 2: fuzzy sets for linguistic terms from the list lq 3) using theorem 9 for the rule3 and observation3 one obtain the conclusion "the quality is average". as it can be observed from this example, our results allow us to obtain the inferred conclusion by a very simple calculus in comparison with the standard formula used in gmp. 5 summary and conclusions the results obtained in this paper explain how the generalized modus ponens rule works with the fodor’s implication and the t-norm product with threshold. combining these results with the approximations proposed in the previous section we obtain a fast answer for the value of the conclusion inferred by gmp reasoning. we worked with the t-norm product because it is one of the most used in practical applications. as it results from the previous sections, one obtain better results in the case of t-norm with threshold. in a future paper we will analyze the results given by another t-norms with threshold and 360 i. iancu another implications. bibliography [1] t. allahviranloo, m. adabitabar firozja, note on “trapezoidal approximation of fuzzy numbers”, fuzzy sets and systems 7, 2007, pp. 755–756. [2] b. bouchon, inferences with imprecisions and uncertainties in expert systems, in: a. kandel (ed), fuzzy expert systems, addison-wesley, new york, 1987. [3] j. c. buisson, h. farreny, h. prade, dealing with imprecision and uncertainty in expert system diabeto iii, in: proc. 2nd int. conf. on artificial intelligence, marseille, 1986, pp. 705–721. [4] j. f. baldwin, a new approach to approximate reasoning using fuzzy logic. fuzzy sets and systems 2, 1979, pp. 309–325. [5] e. czogala, j. leski, on equivalence of approximate reasoning results using different interpolations of fuzzy if-then rules, fuzzy sets and systems 11, 2001, pp. 279–296. [6] k. demirli, i.b. turksen, rule break up with compositional rule if inference, ieee international conference on fuzzy systems, san diego, 1992, pp. 949–956. [7] d. dubois, h. prade, operations in a fuzzy-valued logic, information and control 43, 1979, pp. 224–240. [8] d. dubois, modeles mathematiques de l’imprecis at de l’incertain en vue d,applications aux techniques d’aide a la decision, these d’etat, univ. scientifique et medicale, grenoble, 1983 [9] d. dubois and h. prade, fuzzy logics and the generalized modus ponens revisited, cybernetics and systems 15, 1984, pp. 293–331. [10] j. fodor, on fuzzy implication operators, fuzzy sets and systems 42, 1991, pp. 293-300 [11] j. fodor, m. roubens, fuzzy preference modelling and multicriteria decision support, kluwer, dordrecht, 1994 [12] j. fodor, contrapositive symmetry of fuzzy implications, fuzzy sets and systems 69, 1995, pp. 141-156 [13] j. fodor, b. bede, recent advances in fuzzy arithmetics, international journal of computers, communications and control, vol. 1, suppl. 5, 2006, pp. 199-207 [14] s. fukami, m. mizumoto, k. tanaka, some considerations on fuzzy conditional inference, fuzzy sets and systems 4, 1980, pp. 243–273. [15] b. r. gaines, fundations of fuzzy reasoning, int. j. man-machine stud. 8, 1976, pp. 623–668. [16] p. grzegorzewski, e. mrowka, trapezoidal approximations of fuzzy numbers revisited, fuzzy sets and systems 7, 2007, pp. 757–768. [17] m. m. gupta, j. qi, theory of t-norms and fuzzy inference methods, fuzzy sets and systems 40, 1991, pp. 431-450 [18] i. iancu, t-norms with threshold, fuzzy sets and systems. int. j. of soft computing and intelligence 85 (1997), pp. 83-92 generalized modus ponens using fodor’s implication and t-norm product with threshold 361 [19] i. iancu, propagation of uncertainty and imprecision in knowledge-based systems, fuzzy sets and systems 94, 1998, pp. 29–43. [20] i. iancu, on a family of t-operators, annals of the univ. of craiova, mathematics and computer science serie, vol xxvi(1999),pp. 84-92 [21] i. iancu, fuzzy connectives with applications in uncertainty management, proc. of the 3-rd annual meeting of the romanian society of mathematical sciences, 1999, pp. 40-47 [22] i. iancu, on a representation of an uncertain body of evidence, annals of the univ. of craiova, mathematics and computer science serie, vol xxx(2003),pp. 100-108 [23] i. iancu, operators with n-thresholds for uncertainty management. j. appl. math. & computing. an int. journal, vol 19(2005), no. 1-2, pp. 1-17, publisher: springer berlin / heidelberg [24] a. giurca, i. iancu, approximate reasoning using fodor’s implication, computational intelligence, theory and applications int. conf. 9th fuzzy days in dortmund, springer, 2006, pp. 513–520, berlin. [25] i. iancu, generalized modus ponens using fodor’s implication and a parametric t-norm, wseas transaction on systems 6(vol. 7), 2008, pp. 738-747 . [26] i. iancu, generalized modus ponens reasoning for rules with partial overlapping between premise and observation, european computing conference, malta, september 2008, pp. 37-43 [27] e. h. mamdani, application of fuzzy logic to approximate reasoning using linguistic systems, ieee trans on computers 26, 1977, pp. 1182–1191. [28] m. mizumoto, h.-j. zimmerman, comparison on fuzzy reasoning methods, fuzzy sets and systems 8, 1982, pp. 253–283. [29] m. mizumoto, fuzzy reasoning under new compositional rule of inference, kybernetics 12, 1985, pp. 107–117. [30] a. nafarieh, j. m. keller, a new approach to inference in approximate reasoning, fuzzy sets and systems 41, 1991, pp. 17–37. [31] d. pacholczyk, introduction d’un seuil dans le calcul de l’incertitude en logique floue, busefal 32, 1987, pp. 11-18 [32] i. j. rudas, j. fodor, information aggregation in intelligent systems using generalized operators, international journal of computers, communications and control, vol. 1, no. 1, 2006, pp. 47-57 [33] y. tsukamoto, an approach to fuzzy reasoning method. in: m. m. gupta, r. k. ragade and r. r. yager (eds.) advances in fuzzy sets theory and applications, north-holland, amsterdam, 1979, pp. 137–149. [34] r. r. yager, an approach to inference in approximate reasoning, int. j. man-machine stud. 13, 1980, pp. 323–338. [35] l. a. zadeh, fuzzy sets as a basis for a theory of a possibility, fuzzy sets and systems 1, 1978, pp. 2–28. [36] l. a. zadeh, a theory of approximate reasoning, machine intelligence (eds: j. e. hayes, d. mikie and l. i. mikulich), john wiley & sons, new york, 1979, pp. 149–194. 362 i. iancu ion iancu received his degree in mathematics from the university of craiova and ph. d. in artificial intelligence from the university of bucharest. currently, he is titular professor of computer science and artificial intelligence at the university of craiova. his main topics of interest include fuzzy sets, modeling of uncertain and imprecise knowledge, approximate reasoning, soft computing. a several number of research articles in the above areas have also been published in various journals and proceedings; also, he is the author of eight books. international journal of computers, communications & control vol. i (2006), no. 2, pp. 85-94 coordinated control of mobile robots based on artificial vision carlos m. soria, ricardo carelli, rafael kelly, juan m. ibarra zannatha abstract: this work presents a control strategy for coordination of multiple robots based on artificial vision to measure the relative posture between them, in order to reach and maintain a specified formation. given a leader robot that moves about an unknown trajectory with unknown velocity, a controller is designed to maintain the robots following the leader at a certain distance behind, by using visual information about the position of the leader robot. the control system is proved to be asymptotically stable at the equilibrium point, which corresponds to the accomplishment of the navigation objective. experimental results with two robots, a leader and a follower, are included to show the performance of the vision-based control system. keywords: mobile robots, coordinated robots, vision based control, artificial vision. 1 introduction during the last years, efforts have been made to give autonomy to single mobile robots by using different sensors, actuators and advanced control algorithms. this was mainly motivated by the necessity to develop complex tasks in an autonomous way, as demanded by service or production applications. robots have thus become highly sophisticated systems. in some applications, a valid alternative (or even the mandatory solution) is the use of multiple simple robots which, operating in a coordinated way, can develop complex tasks ([1]; [2]; [7]). this alternative offers additional advantages, in terms of flexibility in operating the group of robots and failure tolerance due to redundancy in available mobile robots [6]. as the number of robots increases, the task of controlling the system becomes more complex. control strategies can be classified as centralized and decentralized. in a centralized control system, all planning and control functions are developed in a single control unit. each mobile robot has few simple sensors, the actuators and the communication system with the single control unit. every motion and conflict between robots is solved by the control unit. nevertheless, the system becomes vulnerable to any failure that may occur in the control unit. on the other hand, with the decentralized control approach, each robot is equipped with multiple sensors and a controller thus becoming capable of recognizing the environment and taking its own control actions ([3]; [5]). the function of the control center -if availableis to assign tasks to each robot and to govern the information flow in the system. the coordinated control of robots allows a team of robots to perform missions not easy or viable to be achieved by a single robot. the guidance of a mobile robot requires its localization in the environment [4]. a precise localization is needed when multiple robots share a common environment. odometric sensors, sonar sensors, gyros, laser and vision and its fusion are commonly used for robot localization and environment modeling. vision sensors are increasingly applied because of its ever-growing capability to capture information. this work proposes a strategy based on capturing vision information with single cameras mounted on the follower robots, to obtain the relative posture of a leader robot and to guide a specified robots formation. the controller design is then obtained using non linear control theory, and the overall control system is proved to be asymptotically stable. the paper is organized as follows. the concept of robot formation is briefly discussed in section 2 and the robot kinematics model in section 3. the visual measurement of the posture of a leader robot is presented in section 4. the controller design and the stability analysis are given in section 5. some experimental results are discussed in section 6. finally, section 7 presents some concluding comments. copyright c© 2006 by ccc publications 86 carlos m. soria, ricardo carelli, rafael kelly, juan m. ibarra zannatha 2 robots formation in this work the following formation strategy is considered. one of the robots is defined as the leader robot, and moves in an unknown trajectory to the other follower robots. the leader has a pattern mounted on its back, which is observed by the follower robots to obtain information about their relative posture to the leader. this information is used to control their position to reach the specified formation. the visual information is obtained by a looking ahead camera mounted on each follower robot. figure 1 shows typical triangle and convoy configurations that can be defined for three robots by using this strategy. figure 1: a) triangle formation. b) convoy formation. 3 mobile robot model in this work it is considered the unicycle-like mobile robot, with is described by the following kinematics equations: . x = v cos ϕ (1) . y = v sin ϕ . θ = ω where (x, y) are the cartesian coordinates of robot position, and ϕ the robot heading or orientation angle; v and ω are the robot’s linear and angular velocities. the non-holonomic restriction for model (1) is . y cos ϕ − .x sin ϕ = 0 (2) which specifies the tangent trajectory along any feasible trajectory for the robot. the reference point of the robot is assumed to be the middle point between the two driven wheels, fig. 2. v1 and v2 denote linear speeds of the left and the right wheels, respectively. linear and angular velocities of the robot can be expressed as v = (v1 + v2)/2 and ω = (v1 − v2)/l respectively, where l represents the distance between the two driven wheels. coordinated control of mobile robots based on artificial vision 87 figure 2: geometric description of the mobile robot. 4 visual measurement of the leader posture the control objective is that the follower robots follow the leading robot evolving with unknown motion in the working area. the robots have been equipped with a looking-ahead fixed vision camera. this camera captures the image of a pattern mounted on the leading vehicle that features four marks on a square of known side length e[m]. in order to ease the calculations, and without losing generality, the height of the horizontal median of this square is made to coincide with the height of the image’s camera center. the projected pattern on the camera image will appear with a projection distortion as represented in fig. 3. the positions of the pattern’s marks on the image are expressed in pixels as (xi, yi) with i = a, b,c and d. these variables are considered as the image features: figure 3: image of the pattern’s marks. from these image features, as measured by the vision system, it is possible to compute the posture of the leading or target vehicle (xt , zt , ϕt ), measured on a coordinate system associated to the camera (xc, zc). fig. 4 shows a diagram with the horizontal projection of the vision system, showing the posture of the leader or target vehicle on the camera’s coordinates of a follower robot, from which the following expressions are obtained: 88 carlos m. soria, ricardo carelli, rafael kelly, juan m. ibarra zannatha xt = xr + xl 2 zt = zr + zl 2 (3) ϕt = cos−1( xr −xl e ) by resorting now to the model of reverse perspective, expressions are obtained to compute the leading vehicle’s posture of (3) as a function of the measurements supplied by the vision system, fig.4: xl = zl − f f xa, xr = zr − f f xb (4) zl = f ( e hl + 1 ) , zr = f ( e hr + 1 ) using the variables from (3) and (4), and assuming that the camera is mounted on the robot’s center, the relative posture of the follower robot with respect to the leading robot (ϕ, θ , d) can then be calculated. the relative posture is defined as in figs. 4 and 5. these variables are calculated by the following expressions: φ = tan−1( zr −zl xr −xl ) ϕ = tan−1 ( xt zt ) (5) θ = ϕ + φ d = √ x2t + z 2 t figure 4: location of the leader respecting the camera. coordinated control of mobile robots based on artificial vision 89 figure 5: relative position between the leader and a follower robot. 5 servo visual control of the mobile robot 5.1 controller figure 6 defines the control structure to be used in this work, where h represents the relationship between the variables defining the relative posture of the follower and the leading robot, and the image features ξ . figure 6: control structure. the control objective is defined as follows: assuming a case where a leading robot moves along an unknown trajectory, with unknown velocity as well, make the follower robot keep a desired distance dd to the leader and pointing to it (that is ϕd = 0), using only visual information, fig. 5. more specifically, the control objective can be expressed as lim t→∞ e(t) = lim t→∞ (dd −d) = 0 (6) lim t→∞ ϕ̃(t) = lim t→∞ (ϕd −ϕ) = 0 the evolution of the posture of the follower robot relative to the leader will be stated by the time derivative of the two error variables. the variation of distance error is given by the difference between the projection of the leader’s velocity and the follower robot velocity on the line connecting both vehicles, that is: 90 carlos m. soria, ricardo carelli, rafael kelly, juan m. ibarra zannatha . e = −vt cos θ + v cos ϕ̃ (7) likewise, the variation of angle error has three terms: the angular velocity of the follower robot, and the rotational effect of the linear velocities of both robots, which can be expressed as: . ϕ̃ = ω + vt sin θ d + v sin ϕ̃ d (8) for the system dynamics expressed by (7) and (8), the following nonlinear controller is proposed to achieve the control objective given in (6), v = 1 cos ϕ̃ (vt cos θ − f (e)) (9) ω = − f (ϕ̃)−vt sin θ d −v sin ϕ̃ d in (9), f (e), f (ϕ̃) ∈ ω , with ω the set of functions that meet the following definition: ω = { f : ℜ → ℜ/ f (0) = 0 and x f (x) > 0 ∀x ∈ ℜ} in particular, the following functions are considered: f (e) = ke tanh(λee) and f (ϕ̃) = kϕ̃ tanh(λϕ̃ ϕ̃). these functions prevent the control actions becoming saturated. the variables used by this controller (θ , ϕ, d) as given by (5) are calculated from the image captured by the vision system. by combining (7), (8) and (9), the closed-loop system is obtained: . e = − fe(e) (10) . ϕ̃ = − fϕ̃ (ϕ̃) 5.2 stability analysis considering the system of (10) with its single equilibrium point at the origin, the following lyapunov candidate function, v = e2 2 + ϕ̃ 2 2 (11) has a time-derivative on the system trajectories given by . v = −e f (e)−ϕ̃ f (ϕ̃) (12) it is then concluded the asymptotic stability of the equilibrium, that is: e(t) → 0, ϕ̃(t) → 0, with t → ∞. it should be noted that the controller of (9) requires knowing the linear velocity vt of the leader vehicle. this variable should be estimated with the available visual information. by approximating the derivative of the position error as given in (7) by the discrete difference between successive positions, and considering a 0.1s sampling period, the objective velocity can be approximated as follows: v̂t = (dk −dk−1)/0.1 + v cos ϕ̃ cos θ coordinated control of mobile robots based on artificial vision 91 6 experiments in order to evaluate the performance of the proposed coordination control algorithm, experiments were carried out with two pioneer 2dx mobile robots (fig. 7). each robot has its own control system. the vision system includes a frame grabber pxc200 that allows capturing the images from a camera sony ev-d30 mounted on the follower robot. these images are transmitted from the follower robot to a pentium ii-400 mhz pc, in charge of processing the images and of calculating the corresponding control actions. figure 8(a) shows the image captured by the camera; this image is processed to obtain the image shown in fig. 8(b). from this image, the centroids of the four projected pattern’s marks are calculated and used to compute the variables (θ , ϕ, d) needed by the controller. finally, the computed control actions are sent by a transmitter to the follower robot. for the experiences, the following parameter values were used: ke = 200, λe = 0.005, kϕ̃ = 10 and λϕ̃ = 0.1. the follower robot has to follow the leading robot by keeping a desired distance of dd = 0.50m and ϕd = 0. figure 9 shows the evolution of the distance between both robots. figure 10 shows the evolution of angles ϕ and θ . from these figures, the accomplishment of the control objective can be verified. fig. 11 depicts the control actions that are calculated and sent to the follower robot. the estimation of the leader robot’s velocity is shown in fig. 12, and the trajectory of the follower robot is depicted in fig. 13. figure 7: robots used in the experiment. figure 8: (a) image captured by the robot’s camera. (b) image processed to find the centroid of each mark. 92 carlos m. soria, ricardo carelli, rafael kelly, juan m. ibarra zannatha figure 9: evolution of the distance to the objective. figure 10: evolution of angles ϕ and θ . figure 11: calculated control actions. coordinated control of mobile robots based on artificial vision 93 figure 12: estimated velocity of the objective. figure 13: trajectory followed by the follower robot. 7 conclusions in this work, a non-linear vision-based controller for coordinated motion of mobile robots following a leader robot has been presented. the nonlinear controller has been designed with state dependent control gains that allow avoiding the saturation of the control actions. by using the lyapunov method, it has been proven that the resulting control system is asymptotically stable. through experiences, it has been demonstrated that the proposed control system accomplishes the control objective with a good performance, leading to a specified formation in which the follower robots follow a leader robot by using visual information. 8 acknowledgment this work was partially supported by anpcyt and conicet, argentina. the authors thank the science and technology for development program (cyted) for promoting the research cooperation between their groups. references [1] r. alami, multirobot cooperation in the martha project, sieee robotics and automation magazine, vol. 5, no. 1, pp. 36-47, 1998. 94 carlos m. soria, ricardo carelli, rafael kelly, juan m. ibarra zannatha [2] r. c. arkin, integrating behavioural, perceptual and world knowledge in reactive navigation, journal of robotics and autonomous systems, vol. 6, no. 1, pp. 36-47, 1990. [3] h. asama, operation of cooperative multiple robots using communication in a decentralised robotic system, proc. conf. from perception to action, switzerland, 5-7 sept., 1994. [4] j. borestein, navigating mobilerobots systems and techniques, a k peters, wesley, ma, usa, 1996. [5] r. a. brooks, a robust layered control system for a mobile robot, ieee journal of robotics and automation, vol. 2, no. 1, pp. 14-23, 1986. [6] y. ishida, functional complement by co-operation of multiple autonomous robots, proc. ieee int. conf. on robotics and automation, pp. 2476-2481, 1994. [7] m. j. mataric, learning in multi-robot systems, lecture notes in artificial intelligence – adaptation and learning in multi-agent systems, vol. 10, 1996. carlos m. soria, ricardo carelli universidad nacional de san juan instituto de automática av. san martín oeste 1109 5400, san juan, argentina e-mail: csoria@inaut.unsj.edu.ar, rcarelli@inaut.unsj.edu.ar rafael kelly centro de investigación científica y de educación superior de ensenada ensenada, baja california, 22800 méxico e-mail: rkelly@cicese.mx juan m. ibarra zannatha laboratorio de robótica del departamento de control automático centro de investigación y de estudios avanzados av. ipn n 2508, lindavista, 07360 méxico, df e-mail: jibarra@ctrl.cinvestav.mx international journal of computers, communications & control vol. ii (2007), no. 2, pp. 132-142 predictive control of a wastewater treatment process sergiu caraman, mihaela sbarciog, marian barbu abstract: the paper deals with the design of a predictive controller for a wastewater treatment process. in the considered process, the wastewater is treated in order to obtain an effluent having the substrate concentration within the standard limits established by law (below 20 mg/l). this goal is achieved by controlling the concentration of dissolved oxygen to a certain value. the predictive controller uses a neural network as internal model of the process and alters the dilution rate in order to fulfill the control objective. this control strategy offers various possibilities for the control law adjustment by means of the following parameters: the prediction horizon, the control horizon, the weights of the error and the command. the predictive control structure has been tested in three functioning regimes, considered essential due to the frequency of their occurrence in current practice. keywords: predictive control, wastewater treatment, neural network, bioreactor 1 introduction the issue of wastewater treatment belongs to a larger area, namely the environment protection. the environment protection generally and biological wastewater treatments particularly is essential for the life of the human communities and received lately a lot of attention from specialized international organizations. in this context, european laws envisage a series of specific orientations for treating and maintaining the water quality within legal limits (eg. surface water directives, 75/440/eec and 79/869/eec, drinking water directives 80/778/eec/15july1980 and 98/83/eec/3 november 1998, urban wastewater treatment directive 91/271/eec etc.). complementary to what has been already stated, the wastewater treatment processes are very complex, non-linear and characterized by many uncertainties w.r.t. the influent parameters, the structure and the coefficients of the model. moreover, many wastewater treatment plants do not have measurement and control equipments. therefore, there is a need in designing control strategies for the good operation of the process, strategies that may consider various types of models. process modelling: there has been a long transition between adopting the procedure of wastewater treatment using active sludge and setting up the theoretical framework to closely describe the procedure. the delay was mainly caused by the conflicting hypotheses related to the explanation of process mechanisms and their difficult translation into mathematical models [3, 7]. in 1983, international water association (iwa) formed a working group destined to promote and facilitate the practical methods of designing and operating the biological wastewater treatment systems. as a result, the activated sludge model 1 (asm1) has been presented in 1987 (see [8]). the model used 13 state variables and described the elimination of organic carbon and nitrogen. the same working group extended the model afterwards by adding the biological process of phosphorus elimination, and named this model the activated sludge model 2 (asm2) [9]. two other improved versions of asm2, named asm2d and asm3 appeared [10]. the major shortcoming of asm1 is its complexity, which makes it difficult to be used in a control system. a simplified alternative of the asm1 model was obtained by taking into consideration the significant variables on a medium time-scale (a few hours to several days). this is why the variables with a slow evolution were considered constant and those having a fast evolution were neglected [13]. these simplifications allowed the usage of asm1 model in designing control laws. lately, iwa established two major research areas: copyright © 2006-2007 by ccc publications predictive control of a wastewater treatment process 133 • modelling of different industrial wastewater treatment processes: cellulose and paper industry, agricultural farms, spun glass industry, etc. the research team tries to model each process, depending on the substances involved in the process. contrary to the case of domestic wastewater treatment which is made naturally by the microorganisms, the industrial wastewater treatment is done by cultivation of microorganisms, sometimes genetically modified, which consume a certain organic substrate [17, 5]. • conditioning the excess of active sludge in order to use it in other industrial activities, especially as a fertilizer in agriculture [16, 20]. process control: the wastewater treatment systems are complex, non-linear processes, with multiple inputs and outputs (multivariable), which determine equivocal information about the influent’s characteristics, the model’s structure and parameters. two approaches can be distinguished in choosing the control structure for such a process: the first one is process-driven and the second one is model-based. the first approach deals with the separate control of the most important variables. within this category, the well-known problem of controlling the dissolved oxygen level is one of the most important issues for a good operation of the wastewater treatment plants. thus, a good level of dissolved oxygen allows the optimal growth of microorganisms used in the process [12]. recently, the control of nitrogen and phosphor level received also a lot of attention [21]. the second approach has been improved a number of times. these improvements are related to the type of the mathematical model used, as it is the case for state estimators. using simplified models allowed the application of advanced control techniques (e.g. precise linearizing or adaptive control, robust control techniques etc.) [19]. however, when using more complex models, such as the asm1 model, the issue of automatic control became very complicated and the established results were less numerous. for the asm1 model classic control techniques are usually used (pi, pid controllers), arranged hierarchically, in a three-level structure [2]: at the higher level, a stable trajectory for the process is calculated for a certain period of time; the medium level deals with the trajectory optimization for the dissolved oxygen, the flow of the recycled active sludge and the recycled inflow for nitrogen removal; at the lower level, the control of dissolved oxygen concentration is achieved, based on the medium level reference. a well-suited approach for this type of process is the control based on artificial intelligence strategies. thus the intelligent control exploits the knowledge and experience accumulated from managing the process and puts it across the control structures like expert, fuzzy, neuro-fuzzy systems [1, 15, 18]. the present study considers a simplified model of the biological wastewater treatment plant [19]. the process is controlled using a model-based predictive control (mpc) strategy. the predictive controller uses a neural network as internal model of the process. this offers various possibilities for the control law adjustment by means of the following parameters: the prediction horizon, the control horizon, the weights of the error and the command. the control purpose is to maintain the substrate concentration below an admissible limit, which is indirectly achieved by controlling the dissolved oxygen concentration, considering as control input the dilution rate d. the predictive control structure has been tested during several functioning regimes, which are essential due to their frequent occurrence in current practice. the paper is structured as follows: the second section describes the process components and the mathematical model of the plant, the third section introduces theoretical considerations about the control structure used in the paper, while the fourth section refers to the neural network used as internal model of the predictive controller. the fifth section presents the simulation results of the proposed control structure and the last section is dedicated to conclusions. 2 the model of the wastewater treatment process the mathematical model considered in this paper has been proposed in [19]. the model is based on the following assumptions: 134 sergiu caraman, mihaela sbarciog, marian barbu • the system runs in steady-state regime (fin = fout = f , d = f/v ); • the recycled sludge is proportional to the process flow (f ): fr = r · f , where r is the recycled sludge rate; • the flow of the sludge removed from the bioreactor (sludge that is in excess) is considered proportional to the process flow (f ): fβ = β ·f , where β is the removed sludge rate; • there is no substrate or dissolved oxygen in the recycled sludge flow of the bioreactor; • the output flow of the aerated tank is equal to the sum between the output flow of the clarifier tank (settler) and the recycled sludge flow. aerated bioreactor settler influent effluent waste sludge settled sludge recycled sludge s, x , dosin, doin, d x , do, s, (1+r)·d xr , r·d xr , (r+β )·d xr s, (1−β )·d xr , β·d figure 1: the schematic representation of the wastewater treatment process figure 1 presents the schematic representation of the wastewater treatment process. the aeration tank is a biological reactor containing a mixture of liquid and suspended solid, where a microorganism population is grown in order to remove the organic substrate from the mixture. the clarifier tank is a gravity settlement tank where the sludge and the clear effluent are separated. a part of the removed sludge is recycled back to the aeration tank and the other part removed [14]. under these conditions the process model is given by the following mass balance equations: dx dt = µ(t)x (t)−d(t)(1 + r)x (t) + rd(t)xr(t) (1) ds dt = −µ(t) y x (t)−d(t)(1 + r)s(t) + d(t)sin (2) ddo dt = −k0 µ(t) y x (t)−d(t)(1 + r)do(t) + αw (domax −do(t)) + d(t)doin (3) dxr dt = d(t)(1 + r)x (t)−d(t)(β + r)xr(t) (4) µ(t) = µmax s(t) ks + s(t) do(t) kdo + do(t) (5) where x (t) biomass, s(t) substrate, do(t) dissolved oxygen, domax maximum dissolved oxygen, xr(t) recycled biomass, d(t) dilution rate, sin and doin substrate and dissolved oxygen concentrations in the influent, y biomass yield factor, µ biomass growth rate, µmax maximum specific growth rate, ks and kdo saturation constants, α oxygen transfer rate, w aeration rate, k0 model constant, r and β ratio of recycled and waste flow to the influent. the model coefficients have the following values: y = 0.65; β = 0.2; α = 0.018; kdo = 2 mg/l; k0 = 0.5 µmax = 0.15 mg/l; ks = 100 mg/l; domax = 10 mg/l; r = 0.6. predictive control of a wastewater treatment process 135 figure 2: the systemic diagram of the wastewater treatment process the systemic diagram of the process is given in figure 2. figure 3 illustrates the open loop response of the system for a step input d = 0.1 h−1(w = 80 h−1). the initial conditions considered in this simulation are: x (0) = 200 mg/l, s(0) = 88 mg/l, do(0) = 5 mg/l, x r(0) = 320 mg/l, doin = 0.5 mg/l, sin = 200 mg/l. 0 50 100 175 180 185 190 195 200 time [h] x [ m g /l] 0 50 100 55 60 65 70 75 80 time [h] s [ m g /l] 0 50 100 4.5 5 5.5 6 time [h] d o [ m g /l] 0 50 100 320 330 340 350 360 time [h] x r [m g /l] figure 3: open loop system response during the normal functioning of the wastewater treatment process, three regimes have been identified: rain (d = 1/20 h−1, w = 80 h−1), normal (d = 1/35 h−1, w = 60 h−1) and drought (d = 1/50 h−1, w = 20 h−1). the first case is characterized by maximum values for the aeration and dilution rates, the second regime considers medium values for w and d and the third case is characterized by small values for the same parameters. in this study special attention has been paid to the predictive controller, such that it provides good performances for all the three functioning regimes. 3 predictive control predictive control algorithms belong to the class of model-based control strategies, using a process model to incorporate the predicted future behavior of the process into the controller design procedure [6]. independent of the type of model used and of the cost index minimized, the principle of mpc is the same. at each sampling instant t [4]: • use the process model to predict the future output of the process over the prediction horizon n2 , {y(t + k/t), k = 1 . . . n2}, based on past inputs and outputs and postulated future inputs; • minimize the cost index, taking into account possible constraints on input, output and states, in order to determine the optimal control sequence {u(t + k/t), k = 0 . . . nu −1}, where nu is the control horizon; 136 sergiu caraman, mihaela sbarciog, marian barbu • use the receding-horizon control mechanism that introduces feedback into the optimization problem by applying to the process the first optimal control action and discarding the consequent ones. as it is straightforward from the description above (the model is the central element of the entire strategy), the success of mpc strategy is highly dependent on a reliable process model, that is a model which approximates well the process dynamics. a lot of research has been carried out up to now in the area of mpc based on linear models, however many of the real-life processes are characterized by complex non-linearities, the necessity of having a non-linear model of the process becoming straightforward. therefore, emphasis is placed nowadays on using nonlinear models in the framework of predictive control that would lead to improved control performances. the concept of the predictive control algorithm used in this work is illustrated in figure 4. + w u process model step response coefficients min j u g(t) y y m figure 4: predictive control structure at each sampling instant t, the increment of the control input ∆u(t) is calculated by minimizing the cost function j = n2 ∑ k=1 δ 2(k) [w(t + k/t)−y(t + k/t)]2 + nu−1 ∑ k=0 λ 2(k) [∆u(t + k/t)]2 (6) where w(t + k/t) is the setpoint prediction, δ (k) and λ (k) are respectively the weighting coefficients of the prediction errors and of the control input increments. in order to calculate the output prediction, the step response of the model must be determined. to this end, it is necessary to admit that the model can be linearized around the current operating point. then y(t + k/t) = y f ree(t + k/t) + y f orced (t + k/t) (7) where { y f ree(t + k/t), k = 1 . . . n2 } is the model output produced by the control input sequence {u(t + k/t) = u(t −1), k = 0 . . . n2 −1} and y f orced (t + k/t) = k ∑ i=1 gi∆u(t + k −i/t) (8) with {gi, i = 1 . . . n2} the unit step response coefficients. in matrix notation, equation (7) becomes y = gu + yf ree (9) where y = [ y(t + 1/t) . . . y(t + n2/t) ]t (10) u = [ ∆u(t/t) . . . ∆u(t + nu −1/t) ]t (11) yf ree = [ y f ree(t + 1/t) . . . y f ree(t + n2/t) ]t (12) g =   g1 0 . . . 0 g2 g1 . . . 0 ... ... ... ... gn2 gn2−1 . . . gn2−nu−1   (13) predictive control of a wastewater treatment process 137 using the model (9) in (6), a quadratic relation with respect to u is obtained: j = [∆·w −∆(g·u + yf ree)]t [∆·w −∆(g·u + yf ree)] + (λ·u )t (λ·u ) (14) where w = [ w(t + 1/t) . . . w(t + n2/t) ]t (15) ∆ = diag [ δ (1) . . . δ (n2) ] (16) λ = diag [ λ (0) . . . λ (nu −1) ] (17) only the first component of the vector u , ∆u(t/t) = ∆u(t), is used. at the next sampling instant the whole procedure is repeated. 4 the internal model of the process artificial neural networks form an important class of nonlinear systems, with many applications in modeling and control. as mathematically proven (see [11]), any static continuous nonlinear function can be approximated arbitrary well over a compact interval by a multilayer neural network with one or more hidden layers. in this contribution a feedforward neural network is used to model the behavior of the wastewater treatment process. the proposed neural network has three layers: the first one contains 15 neurons, the second one 7 neurons and the output layer 4 neurons. to appropriately capture the interconnections between all variables, up to four time-delayed values of the inputs and states were supplied to the network. the neural model predicts x (t), s(t), xr(t) and do(t) as functions of: d(t −1), d(t −2), d(t −3), w (t −1), w (t −2), w (t −3) x (t −1), x (t −2), x (t −3), x (t −4), s(t −1), s(t −2), s(t −3) xr(t −1), xr(t −2), xr(t −3), xr(t −4), do(t −1), do(t −2), do(t −3) the data used to train the neural network was obtained by integrating the differential equations (1)(4), considering randomly varying dilution rates in the interval [0, 0.1] and randomly varying aeration rates in the interval [0, 100]. before training, the data was scaled to the interval [0, 1]. in the same manner, a second data set was generated and used to validate the accuracy of the model. as figure 5 shows, there is hardly any difference between the measured values from the process and the ones predicted by the neural network for the dissolved oxygen and substrate concentrations. there is a noticeable shift between the biomass calculated based on the differential equations of the process and the biomass predicted by the neural network, but this is not going to affect the performance of the predictive controller since the neural network is used to predict the dissolved oxygen level. 5 simulation results figure 6 illustrates the control principle. the predictive controller calculates the dilution rate, which forces the dissolved oxygen concentration to follow the setpoint. controlling the concentration of dissolved oxygen has a benefic effect on the substrate concentration, which is brought within the limits imposed by the law (below 20mg/l). various configurations of the predictive controller parameters can be chosen in order to fulfill the control requirements. in these simulations a fast control was pursued, which can be generally achieved for a small prediction horizon, and less attention was paid to the magnitude of control input variations. the controller parameters were: n2 = 5, nu = 1, ∆ = i5, λ = 0. 138 sergiu caraman, mihaela sbarciog, marian barbu 0 0.05 0.1 d 0 50 100 w 0 5 10 d o 0 100 200 s 0 200 400 x 0 100 200 300 400 500 600 700 800 900 1000 0 500 x r time [h] figure 5: validation of the neural network model: process continuous line, model dash-dotted line air flow rate (containing do ) consumed do measured do here intervenes the designed controller 1) if the water stays in the tank long enough, the do concentration will increase, because the oxygen is not any longer consumed (treated water is equivalent to the substrate being biologically degraded); 2) if the water does not stay in the tank long enough, the do will decrease, thus the water is not treated (the explanation is that aerobe metabolism reactions are taking place). figure 6: the principle of the dissolved oxygen concentration control 0 50 100 w 4 6 8 d o 0 10 20 s 0 0.05 d 260 280 300 x 0 50 100 150 200 250 300 350 400 450 500 500 550 600 x r time [h] figure 7: simulation results for do constant setpoint (7.5) predictive control of a wastewater treatment process 139 the results of the dissolved oxygen concentration control are presented in figures 7, 8 and 9. figure 7 considers a constant dissolved oxygen setpoint (7.5), while figure 8 shows the case when the dissolved oxygen setpoint is variable. in both cases, the aeration rate w varies such that the process covers all three functioning regimes (rain, normal and drought). 0 50 100 w 4 6 8 d o 0 10 20 s 0 0.05 d 260 280 300 x 0 50 100 150 200 250 300 350 400 450 500 500 550 600 x r time [h] figure 8: simulation results for do variable setpoint figure 9 considers a constant dissolved oxygen setpoint but a variable concentration of the substrate in the influent. at time t = 150 h, sin was changed from 200 mg/l to 300 mg/l and was kept constant to the new value until t = 250 h, when it was changed to 150 mg/l. the controller adjusts the dilution rate and do is brought back to the setpoint value. 0 50 100 w 4 6 8 d o 0 10 20 s 0 0.05 d 200 300 400 x 0 50 100 150 200 250 300 350 400 450 500 500 600 700 x r time [h] figure 9: variable substrate concentration in influent 6 conclusions wastewater treatment is a complex process, which needs control for a good operation. this paper introduces a predictive controller for such a system and evaluates the control performances. the success of the mpc strategy is highly dependent on a reliable process model, that is a model which approximates well the process dynamics. taking into account the complexity of the wastewater treatment process, a neural network has been chosen as internal model for the predictive controller. 140 sergiu caraman, mihaela sbarciog, marian barbu the simulation results show a good performance of the control loop. the controller manipulates the dilution rate and forces the dissolved oxygen concentration to follow the imposed setpoint. this has a benefic effect on the substrate concentration, which is maintained within the limits established by law. the control is effective for various operational regimes, defined by the aeration rate. moreover, it is able to reject disturbances that might appear on the substrate concentration in the inflow. references [1] z. boger, application of neural networks to water and wastewater treatment plant operation, isa trans., vol. 31, pp. 25-33, 1992. [2] m. a. brdys and y. zhang, robust hierarchical optimising control of municipal wastewater treatment plants,preprints of the 9th ifac/ifors/imacs/ifip symposium large scale systems: theory & applications ű lssš2001, bucharest, romania, pp. 540-547, july 18-20, 2001. [3] a. m. buswell and h. l. long, microbiology and theory of activated sludge, journal of american pollution wks. assn., vol. 10, 1923. [4] e. f. camacho and c. bordons, model predictive control, springer-verlag, london, 1999. [5] e. choi et al., high strength nitrogen removal from nightsoil and piggery wastes, proc. of the 6th iwa speciality symposium on strong nitrogenous and agro-wastewater, seoul, korea, 11-13 june, 2003. [6] r. de keyser, model based predictive control, invited chapter in "unesco encyclopedia of life support systems (eolss)", eolss publishers co. ltd., oxford (www.eolss.net), 2003. [7] b. l. goodman and a. j. englande, a unified model of the activated sludge process, journal of water pollution control fed., vol. 46, pp. 312-332, 1974. [8] m. henze et. al., activated sludge model no. 1, iawq scientific and technical report no. 1, iawq, london, great britain, 1987. [9] m. henze et al., activated sludge model no. 2, iawq scientific and technical report no. 3, iawq, london, great britain., 1995. [10] m. henze et al., activated sludge models asm1, asm2, asm2d and asm3, iwa publishing, london, great britain, 2000. [11] k. hornik, m. stinchcobe and h. white, multilayer feedforward networks are universal approximators, neural networks, vol. 2, pp. 359-366, 1989. [12] p. ingildsen, realising full-scale control in wastewater treatment systems using in situ nutrient sensors, ph.d. thesis, dept. of industrial electrical engineering and automation, lund university, sweden, 2002. [13] u. jeppsson, modelling aspects of wastewater treatment processes, ph.d. thesis, dept. of industrial electrical eng. and automation, lund university, sweden, 1996. [14] m. r. katebi, m.a. johnson and j. wilke, control and instrumentation for wastewater treatment plant, springer-verlag, london, 1999. predictive control of a wastewater treatment process 141 [15] r. e. king and a. stathaki, a multi-layer perceptron for the control of a wastewater treatment plant, 12th mediterranean conference on control and automation med2004, kusadasi, turkey, proc. cd-rom, june 6-9, 2004. [16] h. kroiss, what is the potential for utilizing the resources in sludge, water science and technology, vol. 49, pp. 1-10, 2003. [17] g. langergraber et al., monitoring of a paper mill wastewater treatment plant using uv/vis spectroscopy, water science and technology, vol. 49, pp. 9-14, 2003. [18] s. a. manesis, d. j. sapidis and r. e. king, intelligent control of wastewater treatment plants, artificial intelligence in engineering, vol. 12, pp. 275-281, 1998. [19] f. nejjari, a. benhammou, b. dahhou and g. roux, non-linear multivariable adaptive control of an activated sludge wastewater treatment process, int. j. adapt. control signal process., vol. 13, pp. 347-365, 1999. [20] j. t. novak and c. park, chemical conditioning of sludge, proc. of int. conf. on wastewater sludge as a resource biosolids 2003, trondheim, norway, 23-25 june, 2003. [21] s. yagi, h. kohara, y. nakamura and s. shiba, fuzzy control of a wastewater treatment plant for nutrients removal, proc. of the int. conf. on artif. intell. in engineering & technology, sabah, malaysia, june 17-18, 2002. sergiu caraman & marian barbu "dunarea de jos" university of galati, romania department of automatic control and electronics domneasca street, no. 47, 800008-galati, romania sergiu.caraman, marian.barbu@ugal.ro mihaela sbarciog ghent university, belgium department of electrical energy, systems and automation technologiepark 913, zwijnaarde-ghent, 9052, belgium mihaela@autoctrl.ugent.be received: february 19, 2007 sergiu caraman (born on september 30, 1955) received the license degree in 1980 from polytechnical university of bucharest and the phd degree in 1996 from “dunarea de jos” university of galati, romania, both in control systems. presently he is professor in the faculty of electrical engineering and electronics, “dunarea de jos” university. his research interest includes topics such as modelling and control of biotechnological processes, intelligent control. 142 sergiu caraman, mihaela sbarciog, marian barbu mihaela sbarciog(born on may 21, 1976) graduated the faculty of naval and electrical engineering of “dunarea de jos” university of galati in 1999. presently she is a research assistant at faculty of engineering, ghent university. her main research field is the analysis and control of biochemical reaction systems. marian barbu (born on march 22, 1978) received the license degree in 2001 and the phd degree in 2006 from “dunarea de jos” university of galati, both in control systems. presently he is lecturer in the faculty of electrical engineering and electronics, “dunarea de jos” university. his research interest includes topics such as modelling and control of wastewater treatment processes, robust and intelligent control. international journal of computers, communications & control vol. i (2006), no. 1, pp. 33-39 symbolic computations based on grid services dana petcu, cosmin bonchiş, cornel izbaşa abstract: the widespread adoption of the current grid technologies is still impeded by a number of problems, one of which is difficulty of developing and implementing grid-enabled applications. in another dimension, symbolic computation, aiming to automatize the steps of mathematical problem solving, has become in the last years a basis for advanced applications in many areas of computer science. in this context we have recently analyzed and developed grid-extensions of known tools for symbolic computations. we further present in this paper a case study of a web service-based grid application for symbolic computations. keywords: grid computing, web services, mathematical software 1 introduction it is widely recognized that grid computing is the computer buzzword of the decade. many of the greatest challenges for software systems lie in how to enable automated software components deployed by different organizations to dynamically discover one another, communicate and coordinate their actions and form sound, robust and effective compound applications and services. the grid promises to solve these problems. the grid underlying problems are multi-disciplinary and cover a wide range of issues from service discovery, management, robust management of dependencies and system-system communication to security, legal or ethical frameworks, methodologies, verification, testing, and finally deployment of infrastructure in a shared and accessible environment. after the forerunner first-generation grid systems, the second-generation proposed the vision of computing resources to be shared like content on the web over the internet. the associated layered architecture abstracts fundamental system components, their purpose, and their interaction with each other. moreover the current, third-generation grid is aligned with web services. comparing to the conventional distributed computing environments, the usual grid environment focuses on the user: users working on their home machines see the illusion of a single computer, with access to all kinds of data and physical resources. moreover, the specific machines that are used to execute an application are chosen from the user’s point of view, maximizing the performance of that application, regardless of the effect on the system as a whole. the two basic approaches to computational solution of mathematical problems are numerical and symbolic. for a long time, the numerical approach had an advantage of being capable of solving a substantially larger set of problems. developments in symbolic computing are lagging relative to numerical computing, mainly due to the inadequacy of available computational resources: most importantly computer memory, but also processor power. continuous growth in the capabilities of computer hardware led to an increasing interest in symbolic calculations. a transition from numerical modeling to analytical modeling can be observed today in various spheres of science and technology; a motivation consists in the desire to construct more accurate and faithful simulations. currently software tools for symbolic computations, known as computer algebra systems (cas), allow users to study computational problems on the basis of their mathematical formulations and to focus on the problems themselves instead of spending time transforming the problems into forms that are numerically solvable. there are three ways in which a cas can interact with the grid, that are detailed in the next section: the cas uses a grid service to improve its own services, the cas uses the grid infrastructure to improve its response time, or the cas is presented to a client application as a grid service. we have studied the first cases in [11] respectively [10]. in this paper we look to the third approach. the paper is organized as follows. section 2 presents a state-of-the-art in the field. section 3 underlines the benefits of using grid services for symbolic computations. in the context of the current grid and cas technologies, a case study is presented in section 4. further work directions are identified in section 5. 2 the grid and the cas 2.1 approaches for cas-grid interaction a grid-extension of a cas system can tackle one or more of the following approaches: copyright c© 2006 by ccc publications 34 dana petcu, cosmin bonchiş, cornel izbaşa ability to accept services from the grid: the cas must be capable to augment its facilities with external modules, in particular it should be able to explore computational grid facilities, to connect to a specific grid service, to use it, and to translate its results for the cas interface. this approach is taken into consideration by netsolve/gridsolve [1], geodise [4], mathgridlink [14], our maple2g [9]. ability to communicate and cooperate over the grid: several kernels of cass must be able cooperate within a grid when solving problems; in order to have the same cas on different computational nodes a grid-version must be available; in the case of different cass, appropriate interfaces between them must be developed and implemented or a common language for inter-communication must be adopted. this approach is taken into account by maple2g [10]. being a source of grid or web services: the cas or some of its facilities must be reachable as grid or web services and allowed to be activated by remote users under appropriate security and licensing conditions; furthermore, deployment of the services must be done in an easy way from the inside of the cas. this approach is taken into account by the several projects described bellow. more details about the first two approaches can be found in [12]. details about the third approach follows. 2.2 web service-based cas extensions in number theory there exist a number of successful internet projects [6] aiming, among others, at finding large prime numbers, factoring large numbers, computing digits of π , finding collisions on known encryption algorithms etc. a cas web-wrapper component that can be used by multiple systems was reported in [15]. another online system, ogb (for gröbner basis computations), has been recently deployed [5]. maplenet [7] offers a software platform to enhance mathematics and related courses over the web. the client machine must be able to run java applets. a publisher machine is responsible for creating and editing the content of the web pages and, when complete, uploading them to the server. the server is the machine to which clients will connect to access web pages and the applets associated with them. the server also respond to publishing requests from the publisher machine for the transfer of content between the publisher and the server. it manages concurrent maple instances as required to serve client connections for mathematical computation and display services. the server can also provide some additional services including user authentication, logging information, and database access. webmathematica [16] offers access to mathematica applications through a web browser or other web clients. mathematica can be seen as a development environment for webmathematica sites. standard java technologies are used: java servlet and javaserver pages. webmathematica allows a site to deliver html pages that are enhanced by the addition of mathematica commands. when a request is made for one of these pages, the mathematica commands are evaluated and the computed result is inserted into the page. input can come from html forms, applets, javascript, and web-enabled applications. it is also possible to send data files to a server for processing. output can use different formats such as html, images, mathematica notebooks, mathml, svg, xml, postscript, pdf. the monet project [8], funded by the european commission, was a two-year (april 2002-march 2004) investigation into mathematical web services aiming to demonstrate the applicability of the semantic web to the world of mathematical software. the principal objective was to develop a framework for the description and provision of web-based mathematical services. the key to such a framework is the ability to discover services dynamically based on published descriptions which describe both their mathematical and non-mathematical attributes. such discovery and subsequent interaction are mediated by software agents capable of recognizing the criteria which should determine how particular kinds of problems are solved, and extracting them from the user’s problem description. a symbolic solver wrapper tool architecture was designed to provide an environment that encapsulates cass and expose their functionalities through symbolic services deployed. all symbolic services are running as independent web services, each reachable at its own unique url, all of them are enclosed within the symbolic server and they are managed by the wrapper tool symbolic solver environment. each symbolic service is assigned to several instances, such as a service core java class, a source code implementing the service with a mathematical solving software (a cas), and a msdl file. the principal information about each service is provided by the service configuration file that contains tree parts: service’s msdl, service interface to mathematical solving system and the actual service implementation. the technologies used for symbolic solver services implementation are java, axis, tomcat, soap, wsdl, jsp, msdl. maple was chosen as an example of the solving engine for the first implementation and axiom was used to validate the architecture and to demonstrate abilities to adopt different solving engine without performing major changes. symbolic computations based on grid services 35 2.3 grid service-based cas extensions there exist a number of grid-oriented projects that involve cass. the project grid enabled numerical and symbolic services [3], genss (march 2004-february 2006), in the frame of uk e-science programme, addressed the combination of grid computing and mathematical web services, their extension to deliver mathematical problem analysis, the code and the resources to compute the answers, using a common open agent-based framework. the main research focus lied on matchmaking techniques for advertisement and discovery of mathematical services. the project involved the design and implementation of an ontology for symbolic mathematical problems and used to support service specification and registration of services. the ontology has been extended based on work undertaken in monet [8]. the grid execution management for legacy code architecture, gemlca [2] is a recent solution to deploy existing legacy code applications written in any programming language as a grid service without modifying or even requiring access to the legacy code (source or binary). the access point for a client to the gemlca architecture is the front-end layer composed of a number of grid services offering interfaces in order to deploy, query, submit, check the status of, and get the results back from computational jobs. the front-end layer is described in wsdl and can be used by any grid service client to bind and make use of its functionality through soap. in order to access a legacy code program, the user executes the gemlca grid service client that creates a legacy code instance with the help of a legacy code factory. following this, the system submits the job to the compute server through globus toolkit version 3 using a particular job manager. a specific xml format, lcid (legacy code interface description file) is necessary to be used. 3 benefits of using symbolic-computing services based on globus-wsrf the recent version 4 of globus toolkit, de-facto standard for grid technologies, is based on standard web services technologies such as soap and wsdl. it is written according the wsrf specification (web services resource framework). the basic requirements addressed by wsrf is the ability to create, address, inspect, discover, and manage statefull resources. grid services extends web services (usually stateless services) by providing these extra functionalities. wsrf approach is more flexible than the previous ones implemented in globus toolkit (e.g. ogsi implementation) allowing many-to-many mappings between web services (the message processor) and any associated statefull resource (the statefull service instance). 3.1 statefull services the wsrf approach simplifies the development of grid-service wrappers for cass. the cas can take now the role of the statefull service instance. if we go back to the case of the web-wrapper of a cas, we can identify several problems solved by the grid environment. successive related requests to the service hiding the cas will need the maintenance of the service instance in a command waiting cycle, without releasing the connections. when the connection is interrupted, but the client come back to the system it must start as a any new incomer. if a statefull service is used, the latest state of the cas can be registered. despite the fact that the connection was closed, the client can come back and resume the computation at any time before the service instance destruction. 3.2 service instances on remote grid nodes using a appropriate scheduler the cas service instance can run on a different grid hardware resource than the one where the container for web-based grid services resides, primarily addressed by the service client. this approach solves the problem of the server overload of a classic client-server architecture. 3.3 service discovery the user is confronted with thousands of packages are available to perform all kinds of mathematical computations. a standard way to categorize, explore, discover, invoke and compose them is needed. grid computing has 36 dana petcu, cosmin bonchiş, cornel izbaşa awake high expectations for its potential as a discovery accelerator. grid wsrf-based services are described in wsdl standard format that can be easily inspected by any potential client. 3.4 security standards globus’s gsi offers two message-level protection schemes, and one transport-level scheme [13]. gsi secure message scheme that provides message-level security can be used in the case on proprietary cas usage. globus toolkit implements three authentication methods: x.509 certificates, username-password, and anonymous authentication. the first two authentication methods are recommended in the case of a proprietary cas usage. gsi supports also authorization in both the server-side and the client-side. the server has six possible authorization modes: none, self, gridmap, identity-authorization, host authorization, saml callout authorization. depending on the authorization mode that if will be chosen, the server will decide if it accepts or declines an incoming request. identity-authorization or saml callout authorization are recommended in the case of a proprietary cas usage. 4 a case study: grid-based services using maple we proceed with a practical example of how a cas can be made available as grid-wsrf service and for this purpose maple became our cas of choice. the main reason is that, despite its robustness and ease of use, we were not able to locate efforts to link maple with the grid, accept ours, namely maple2g. furthermore, it is well known that maple excels other cass in solving selected classes of problems e.g. systems of nonlinear equations or inequalities. finally, maple has already a build-in socket library for communicating over the internet, and a library for parsing xml. these capabilities match very well with our goal as they suffice to make maple a client for an grid computational service. maple2g (maple-to-grid) was build recently as a grid-wrapper for maple. maple2g consists of two parts a cas-dependent and a grid-dependent one. therefore, any change in the cas or in the grid will be reflected only in one part of the proposed system. furthermore, the cas-dependent part is relatively simple and easy to be ported to support another cas or legacy software. maple2g covers all three approaches described in section 2.1. we describe here the newest component, the one that presents maple as grid service. it is based on the wsrf implementation from globus toolkit 4. figure 1: the four elements of the application, the client, the factory, the instance, and the resource. their interaction in time at the server side, the application has two main components: the factory service, the instance service. a resource is created by the factory service and used by the instance service. the interaction pushed by the client follows the steps (figure 1): 1. the client request the service instance and resource creation; 2. the factory service creates the resource; 3. the key of the resource is identified by the factory; 4. the factory creates the service instance; symbolic computations based on grid services 37 5. the service reference is returned to the factory; 6. the endpointreference (instance and resource identifiers) is send to the client; 7. the client contacts the service instance and request an operation involving the resource; 8. the service instance serves the request using the resource; 9. the service instance sends the result to the client. the client can reiterate the steps (7-9) using the endpointreference. the application components were written in java. the factory service is activated once the globus service container is activated on the resource were the service package was deployed. the instance service launches maple as a concurrent thread on the same hardware resources on which it runs. the communication between the java-based instance service and maple is done via sockets. at start maple thread reads a temporary file that specify the socket channel that should be open and then the cycle in which it accepts any string coming on that communication channel, evaluate it as maple command, and sends the result via the socket connection to the instance service. for example, after the maplefactoryservice activation, the mapleclientperform can send maple commands to the instance service in form of strings: $ mapleclientcreate http://194.102.62.15:8080/wsrf/services/maplefactoryservice >ref $ mapleclientperform ref "ifactor(123456789056789098765098765432100);" (2)^2*(5)^2*(7)*(43)*(113)*(331)*(683)*(12067378391)*(65837)*(202087) $ mapleclientperform ref "p:=12*x^6+84*x^4-54*x^5-270*x^3+168*x^2-216*x+96:" $ mapleclientperform ref "solve(p);" 1/2, 4, i, -i, 2*i, -2*i we have build also a web interface that is depicted by figure 2. we used ajax (asynchronous javascript and xml) for sending and receiving dates to the server. after the user authentification, the interface can be use in few simple step: 1. start the globus container on the server (figure 2.a); 2. choose the proper service (mathfactoryservice) from the list of available services (figure 2.b); 3. start the mathfactoryservice which will launch the maple instance (figure 2.c); 4. fill and execute a maple command (figure 2.d and e). 5. the maple instance executes the command and returns an mathml document as answer. the mathml document will be transform to a svg file which is send back to the interface (figure 2.f). 5 conclusions and further work at this stage maple2g exists as a demonstrator system with some of the functionalities described above. in the near future it will be further developed to include facilities existing in other systems, in order for it to become more robust. currently if the access to maple service is granted, any maple commands can be used. restricted access to a subset of commands (e.g. no access to shell commands revealing the host characteristics or establishing socket connections) should by implemented. specialized services based on maple should be developed and deployed as grid services. experiments on the wide-area grids will help guide further development of the system. deployment of grid services based on other cass than maple using the same codes must be take also into consideration. acknowledgment this work was partially supported by the european project science (fp6-2004-infrastructure-5-026123) and the romanian project compgrid (cncsis-2004-949). 38 dana petcu, cosmin bonchiş, cornel izbaşa figure 2: using a web interface to access symbolic computing services symbolic computations based on grid services 39 references [1] s. agrawal, j. dongarra, k. seymour, s. vadhiyar, “netsolve: past, present, and future – a look at a grid enabled server", in making the global infrastructure a reality, wiley, berman f. (ed.), pp. 613-622, 2003. [2] t. delaitre, a. goyeneche, p. kacsuk, t. kiss, g.z. terstyanszky, s.c. winter, “gemlca: grid execution management for legacy code architecture design”, in procs. of the 30th euromicro conference, special session on advances in web computing, august 2004, rennes, france, pp. 305-315, 2004. [3] genss, available on-line at: http://genss.cs.bath.ac.uk/index.htm. [4] geodise, available on-line at: http://www.geodise.org/. [5] m. gettrick, “ogb: online gröbner bases”, http://grobner.it.nuigalway.ie, 2004. [6] internet-based distributed computing, available on-line at: http://www.aspenleaf.com/ distributed/apmath.html. [7] maplenet, available on-line at: http://www.maplesoft.com/maplenet/. [8] monet, available on-line at: http://monet.nag.co.uk. [9] d. petcu, d. dubu, “an extension of maple for grid and cluster computing”, procs. iccc 2004, i. dziţac, t. maghiar, c.popescu (eds.), oradea, băile felix spa, may 27-29 2004, ed. metropolis, pp. 355-360, 2004. [10] d.petcu, d.dubu, m.paprzycki, “grid-based parallel maple”, lncs 3241, procs. pvmmpi 2004, budapest, hungary, september 19-22, 2004, d. kranzmüller, p. kacsuk, j. dongarra (eds.), springer, pp. 215-223, 2004. [11] d.petcu, m.paprycki, d.dubu, “design and implementation of a grid extension of maple”, scientific programming, vol. 13, no. 2, ios press, pp. 137-149, 2005. [12] d. petcu, d. ţepeneu, m. paprzycki, t. ida, “symbolic computations on grids”, chapter 27, engineering the grid: status and perspective, b. di martino, j. dongarra, a. hoisie, l. yang, and h. zima (eds.), 2006. [13] b. sotomayor, l. childers, globus toolkit 4 : programming java services, morgan kaufmann, 2005. [14] d. ţepeneu, t. ida, “mathgridlink connecting mathematica to the grid”, in procs. ims’04, banff, alberta, canada, 2004. [15] a. weber, w. küchlin, b. eggers, v. simonis, “parallel computer algebra software as a web component”, available on-line at: http://www.cs.ucsb.edu/conferences/ java98/papers/algebra.pdf, 1998. [16] webmathematica, available on-line at: http://www.wolfram.com/products/webmathematica/. dana petcu institute e-austria timişoara and western university of timişoara computer science department address: b-dul vasile pârvan 4, 300223 timişoara, romania e-mail: petcu@info.uvt.ro cosmin bonchiş institute e-austria timişoara, and vasile goldiş university of arad computer science department cornel izbaşa institute e-austria timişoara, and western university of timişoara computer science department international journal of computers, communications & control vol. i (2006), no. 2, pp. 5-5 preface gastón lefranc hernández it is an honor for me to introduce you a list of papers selected from the xvi congress of acca (asociación chilena de control automático, chilean association of automatic control), done in santiago of chile at the end of 2004, in its 30 years of existence. since 1974, acca organize congress (every 2 years), tutorial courses for continuing education, seminars, workshops, exposition of systems and equipments and our magazine automatica and innovation. all of these activities have results: important influence in all ambits, consolidating to acca as a point of meeting of people from industries, from private and public institutions, from the academics world, from suppliers of systems and equipments, and professional from all latin america . in 1976, acca had its first program committee to select papers, and its first proceedings. this congress was organized by acca, ieee chilean chapter on control, robotics and cybernetics, ifac and its technical committees, and the universidad de las américas, chile. acca is nmo of ifac since 1984. acca congress had keynotes speakers for plenary: dr. françois vernadat from france, dr. florin filip from romania, dr. shimon nof from usa and dr. philippe dupont, from france. a round table was organised to discuss about productivity, interested in chile and latin american professionals. the 46 accepted papers selected by the international program committee, were presented in technical sessions where new ideas, critical comments, and the beginning of cooperation leading to future projects will take place. the papers come from argentina, brazil, chile, cuba and méxico. the international program committee selected 10 papers as the best of this congress, to be published in “international journal of computers, communications & control”. the papers are in areas of artificial intelligence, vision, manufacturing, pattern recognition and robotics. acca wants to say thanks to dr. florin filip to permit to show the activities in chile and latin america, oriented to improve and to solve our own problems. it is pleasure to introduce you to this special issue conference dedicated to acca in its 30 years. gastón lefranc hernández chairman of international program committee xvi congress of acca guest editor of this issue copyright c© 2006 by ccc publications int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 2, pp. 144-147 acad. solomon marcus at 85 years (a selective bio-bibliography) f. g. filip (editor in chief of ijccc), i. dzitac (associate editor in chief of ijccc) professor solomon marcus is one of the seniors of the romanian science and culture, with a tremendously diverse and intensive activity, with numerous, fundamental, and many times pioneering contributions to mathematics, mathematical linguistics, formal language theory, semiotics, education, and to other areas, with an impressive internal and international impact and recognition, since several decades very actively involved in the scientific and cultural life – and still as active as ever at his age. that is why the short presentation below is by no means complete, it is only a way to pay our homage to him at his 85th birthday anniversary. we are honored and we thank to professor solomon marcus for having the kindness to publish, in the first issue of this journal, an article occasioned by the moisil centenary1. 1 biography and general data born at march 1, 1925, in bacău, romania. elementary school and high school in bacău, classified the first at the school-leaving examination (“bacalaureat"), in 1944. faculty of science, mathematics, university of bucharest, 1945-1949. assistant professor since 1950, lecturer since 1955, associate professor since 1964, and professor since 1966. professor emeritus since 1991, when he retired. all positions held in the faculty of mathematics, university of bucharest. phd in mathematics in 1956 (with the thesis about “monotonous functions of two variables"), state doctor in sciences in 1968, both at the university of bucharest. corresponding member of romanian academy since april 1993, and full member of the romanian academy (academician) since december 2001. 1grigore c. moisil: a life becoming a myth, international journal of computers, communications & control, 1(1),73-79, 2006 copyright c⃝ 2006-2010 by ccc publications acad. solomon marcus at 85 years (a selective bio-bibliography) 145 research and teaching in the fields of mathematical analysis, theoretical computer science, measure theory, general topology, linguistics, history and philosophy of mathematics, poetics, semiotics, applications of mathematics to natural and social sciences. has published over 50 books, as single author or in collaboration, and over 400 research articles, as well as several hundreds of articles on various cultural topics. has edited over 40 collective volumes. more than 250 invited lectures at various universities from europe, americas, and asia. in his papers and books has solved many open problems in mathematical analysis and has formulated a still larger number of problems and research topics, has introduced new ideas, notions and models, which were addressed and continued by a large number of researchers. just an example: the contextual grammars he has introduced in 1967-68 are now called marcus contextual grammars and were subject of many papers and phd theses, as well as of two monographs. many students and collaborators, a really “contagious" professor and researcher. cited and followed by thousand of authors, from romania and abroad. his books were reviewed and commented by many personalities, among whom, we mention a few: emanuel vasiliu, mike holcombe, sheng yu, yehosua bar-hillel, maurice gross, l. nebesky, michel janot, i.i. revzin, ferenc kiefer, h.p. edmundson, ju.a. sreider, jacob l. mey, carlo tagliavini, gr.c. moisil, matei călinescu, nicolae manolescu, sorin stati, k. sgallova, p. sgall, gabriela melinescu, jean-marie klinkenberg, virgil nemoianu, adrian marino, miron nicolescu, alexandru ivasiuc, wladimir krysinski, mircea mihăieş, şerban cioculescu, eugen simion, constantin negoiţă, tudor octavian, dinu flamând, stelian tănase, alin teodorescu, mircea scarlat, smaranda vultur, cristian calude, gheorghe păun, tudorel urian, paul cernat, irina mavrodin, rosa del conte, jean-pierre descles, umberto eco, marius iosifescu, roman jakobson, gheorghe mihoc, octav onicescu, cesare segre, gheorghe tomozei, iorgu iordan, constanţa buzea, cornel ungureanu, bogdan suceavă, basarab nicolescu, alexandru t. balaban, mihai zamfir, and many others. member of the editorial board of over three dozens of professional journals. among the numerous honors received, we recall: doctor honoris causa of the universities of bacău, constanţa, and craiova. vice president of the international association for semiotic studies (19891999). honorary president of the balkan semiotic society (since 2001). member of the executive committee of the international association for semiotic studies. honorary member of the international association for visual semiotics. honorary member of the toronto semiotic circle, of como (italy) semiotic circle and of the hungarian semiotic society. honorary member of the romanian mathematical society. permanent invited professor, sao paulo catholic university. member of the romanian national mathematical committee. member of the leading council of the romanian linguistic society. member of the romanian union of writers and member (1990-1995) of the leading council of this union. vice-president of the romanian semiotic society. member of the romanian committee for the club of rome. honorary member of the society eratosthene, switzerland. he has also received many prizes, in romania and abroad. with various occasions, special issue of journals and collective volumes were devoted to him. here are some of them: the international journal of computer mathematics (great britain) and revue roumaine de mathematiques pures et appliquées devoted some of their issues in 1985 to his 60th anniversary, while two collective volumes were dedicated to his 70th anniversary, in 1995, mathematical aspects of natural and formal languages, world scientific, singapore, 1994, and mathematical linguistics and related topics, ed. academiei, bucharest, 1995. in the year 2000, two volumes were dedicated to his 75th anniversary: finite vs infinite. an eternal dilemma, springer, london, and recent topics in mathematical and computational linguistics, ed. academiei, bucharest. in the year 2005, volume 64, numbers 1-4 of fundamenta informaticae were published as a “special issue in honor of the 80th birthday of professor solomon marcus", with papers by 70 authors from 16 countries. 146 f. g. filip (editor in chief of ijccc), i. dzitac (associate editor in chief of ijccc) 2 books, authored or co-authored 1. lingvistica matematică. modele matematice în lingvistică. ed. didactică şi pedagogică. bucureşti, 1963. 2. gramatici şi automate finite. ed academiei, bucureşti, 1964. 3. analiza matematică. vol. i. ed. didactică şi pedagogică, bucureşti, 1st edition 1962, 2nd edition 1963, 3rd edition 1966, 4th edition 1971, 5th edition 1980 (in collab. with m. nicolescu and n. dinculeanu). 4. lingvistica matematică (2nd edition, revised and completed with 4 new chapters). ed. didactică şi pedagogică, bucureşti, 1966. 5. introducere în lingvistica matematică. ed. ştiinţifică, bucureşti, 1966 (in collab. with e. nicolau and s. stati) 6. noţiuni de analiză matematică. originea, evoluţia şi semnificaţia lor. ed. ştiinţifică, bucureşti, 1967. 7. limbaj, logică, filozofie. ed. ştiinţifică, bucureşti, 1968 (in collab. with al. boboc, gh. enescu, c. popa, and s. stati). 8. analiza matematică. vol. ii. ed. didactică şi pedagogică, bucureşti, 1st edition 1966, 2nd edition 1971, 3rd edition 1980 (in collab. with m. nicolescu and n. dinculeanu). 9. introduction mathematique a la linguistique structurale. dunod, paris, 1967. 10. algebraic linguistics; analytical models. academic press, new york, 1967. 11. poetica matematică. ed. academiei, bucureşti, 1970. 12. teoretiko-mnozestvennye modeli jazykov. ed. nauka, moscova, 1970 (translation of the first five chapters of book 10 and of the last chapter of book 9). 13. algebraicke modely jazyka. ed. academia, prague, 1969 (translation in czech of book 4, and of a part of book 2). 14. introduzione alla linguistica matematica. casa editrice riccardo patron, bologna, 1970 (revised and completed translation of book 5; in collab. with e. nicolau and s. stati). 15. mathematische poetik. ed. academiei, bucureşti – athenaum-verlag, frankfurt am main, 1973 (revised and completed translation of book 11). 16. matematicka poetika. ed. nolit, belgrad, 1974 (revised and completed serbo-croatian translation of book 11). 17. din gândirea matematică românească. ed. ştiinţifică şi enciclopedică, bucureşti, 1975. 18. semiotica folclorului. abordare lingvistico-matematică. ed. academiei, bucureşti, 1975 (coauthor). 19. matematicka analyza ctena podruhe. ed. academia, prague, 1976 (revised and completed czech translation of book 6). 20. a nyelvi szepseq matematikaja. ed. gondolat, budapesta, 1977. 21. metode distribuţionale algebrice în lingvistică. ed. academiei, bucureşti, 1977 (coauthor). 22. la semiotique formelle du folklore. approche linguistico-mathematique. ed. klincksieck, paris – ed. academiei, bucureşti, 1978 (revised and completed translation of book 18; co-author). 23. introduccion en la linguistica matematica. ed. teide. barcelona, 1978 (revised and completed spanish translation of book 5). 24. semne despre semne. ed. ştiinţifică şi enciclopedică, bucureşti, 1979. acad. solomon marcus at 85 years (a selective bio-bibliography) 147 25. contextual ambiguities in natural & artificial languages. vol. 1, communication and cognition, ghent, belgium, 1981 (revised and completed translation of a part of book 21). 26. snmeia gia ta snmeia. ed. pneumatikos, atena, 1981 (greek translation of book 24). 27. metode matematice în problematica dezvoltării. ed. academiei, bucureşti, 1982 (co-author). 28. gândirea algoritmică. ed. tehnică, bucureşti, 1982. 29. semiotica matematică a artelor vizuale. ed. ştiinţifică şi enciclopedică, bucureşti, 1982 (coordinator and co-author). 30. simion stoilow. ed. ştiinţifică şi enciclopedică, bucureşti, 1983 (in collab. with cabiria andreian cazacu). 31. paradoxul. ed. albatros, bucureşti, 1984. 32. timpul. ed. albatros, bucureşti, 1985. 33. artă şi ştiinţă. ed. eminescu, bucureşti, 1986. 34. analiza matematică. vol. ii, univ. bucureşti, 1986 (co-author). 35. to paradocso. ed. pneumatikos, atena, 1986 (greek version of book 31). 36. şocul matematicii. ed. albatros, bucureşti, 1987. 37. moduri de gândire. colecţia “ştiinţa pentru toţi", ed. ştiinţifică şi enciclopedică, bucureşti, 1987. 38. provocarea ştiinţei. seria “idei contemporane", ed. politică, bucureşti, 1988. 39. invenţie şi descoperire. ed. cartea românească, 1989. 40. analiza matematică. materiale pentru perfecţionarea profesorilor de liceu iii. universitatea din bucureşti, facultatea de matematică, bucureşti, 1989 (co-author). 41. dicţionar de analiză matematică. ed. ştiinţifică şi enciclopedică, bucureşti, 1989 (co-author). 42. controverse în ştiinţă şi inginerie. ed. tehnică, bucureşti, 1991. 43. language, logic, cognition and communication; a semiotic,computational and historical approach. report 9/96. grup de recerca en linguistica matematica i enginyeria del llenguatge. reports universitat rovira i virgili, tarragona, spain, 1996. 44. matematica, manual pentru clasa a ix-a, licee teoretice specializarea filologie (in colab. with mihaela singer). ed. sigma, bucureşti, 1999; 2nd edition, 2000. 45. matematica, manual pentru clasa a xii-a (in colab. with p. alexandrescu, m. radulescu, s. radulescu). ed. paralela 45, bucureşti – piteşti, 2002. 46. jocul ca libertate. ed. scripta (colecţia ludică), bucureşti, 2003. 47. mathematics in romania. cub press 22, baia mare, 2004. 48. intâlnirea extremelor. scriitori în orizontul ştiinţei. ed. paralela 45, bucureşti – piteşti, 2005. 49. paradigme universale. ed. paralela 45, piteşti – bucureşti, 2005. 50. paradigme universale ii. pornind de la un zâmbet. ed. paralela 45, piteşti – bucureşti, 2006. 51. paradigme universale iii. jocul. ed. paralela 45, piteşti – bucureşti, 2007. 52. words and languages everywhere. polimetrica academic publisher, milano, 2007. 53. solomon marcus (protagonist): educaţia în spectacol. spandugino publ. house., bucureşti, 2010. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 2, pp. 238-250 solving problems in a distributed way in membrane computing: dp systems gh. păun, m.j. pérez-jiménez gheorghe păun institute of mathematics of the romanian academy po box 1-764, 014700 bucureşti, romania, and department of computer science and artificial intelligence university of sevilla avda. reina mercedes s/n, 41012 sevilla, spain e-mail: gpaun@us.es mario j. pérez-jiménez department of computer science and artificial intelligence university of sevilla avda. reina mercedes s/n, 41012 sevilla, spain e-mail: marper@us.es abstract: although p systems are distributed parallel computing devices, no explicit way of handling the input in a distributed way in this framework was considered so far. this note proposes a distributed architecture (based on cell-like p systems, with their skin membranes communicating through channels as in tissue-like p systems, according to specified rules of the antiport type), where parts of a problem can be introduced as inputs in various components and then processed in parallel. the respective devices are called dp systems, with the case of accepting strings called dp automata. the communication complexity can be evaluated in various ways: statically (counting the communication rules in a dp system which solves a given problem), or dynamically (counting the number of communication steps, of communication rules used in a computation, or the number of objects communicated). for each measure, two notions of “parallelizability" can be introduced. besides (informal) definitions, some illustrations of these idea are provided for dp automata: each regular language is “weakly parallelizable" (i.e., it can be recognized in this framework, using a constant number of communication steps), and there are languages of various types with respect to chomsky hierarchy which are “efficiently parallelizable" (they are parallelizable and, moreover, are accepted in a faster way by a dp automaton than by a single p automaton). several suggestions for further research are made. keywords: membrane computing, p system, distributed computing, communication complexity, chomsky hierarchy. 1 introduction p systems are by definition distributed parallel computing devices, [11], [12], [17], and they can solve computationally hard problems in a feasible time, [13], but this efficiency is achieved by a trade-off between space and time, based on the possibility of generating an exponential workspace in a linear time, by means of biologically inspired operations, such as membrane division and membrane creation. however, no class of p systems was proposed where a hard problem can be solved in a distributed parallel copyright c⃝ 2006-2010 by ccc publications solving problems in a distributed way in membrane computing: dp systems 239 way after splitting the problem in parts and introducing these subproblems in components of a p system which can work on these subproblems in parallel and produce the solution to the initial problem by interacting/communicating among each other (like in standard distributed computer science). in particular, no communication complexity, in the sense of [2], [9], [16], was considered for p systems, in spite of the fact that computation (time) complexity is very well developed, [13], and also space complexity was recently investigated, [14]. some proposals towards a communication complexity of p systems were made in [1], but mainly related to the communication effort in terms of symport/antiport rules used in so-called evolution-communication p systems of [5]. (note that in communication complexity theory the focus is not on the time efficiency of solving a problem, but the parties involved in the computation just receive portions of the input, in general, distributed in a balanced manner, “as fair as possible" – this distribution introduces an inherent difficulty in handling the input – and then mainly the complexity of the communication needed to parties to handle this input is investigated.) this note tries to fill in this gap, proposing a rather natural framework for solving problems in a distributed way, using a class of p systems which mixes ingredients already existing in various much investigated types of p systems. namely, we consider p systems with inputs, in two variants: (i) like in p automata, [6], [10], where a string of symbols is recognized if those symbols are brought into the system from the environment and the computation eventually halts (it is important to note that the string is “read" during the computation, not before it), and (ii) in the usual manner of complexity investigations, [13], where an instance of a decision problem is introduced in a p system in the form of a multiset of symbols (this operation takes no time, the computation starts after having the code of the problem inside), and the system decides that instance in the end of a computation which sends to the environment one of the special objects yes or no. several such systems, no matter of what type, are put together in a complex system which we call dp system (from “distributed p system"); the component systems communicate through channels linking their skin membranes, by antiport rules as in tissue-like p systems. when accepting strings by dp systems with p automata as components, the device is called a dp automaton. such an architecture was already used, with specific ingredients, for instance, in the investigations related to eco-systems, where “local environments" are necessary to be delimited and communication possibilities exist, linking them; details can be found in the recent paper [4]. the way to use a dp system is obvious: a problem q is split into parts q1,q2,...,qn, which are introduced in the n components of the dp system (as in p automata or as in decision p systems), these n systems work separately on their problems, and communicate to each other according to the skin-toskin rules. the solution to the problem q is provided by the whole system (by halting – in the case of accepting strings, by sending out one of the objects yes or no, etc.). like in communication complexity, [9], we request the problem to be distributed in a balanced way among the components of the dp system, i.e., in “as equal as possible" parts (also an almost balanced way to distribute the input among two processors is considered in [9] – no partner takes more than two thirds of the input – which does not seem very natural to be extended to the general case, of n processors). several possibilities exist for defining the communication complexity of a computation. we follow here the ideas of [1], and introduce three measures: the number of steps of the computation when a communication rule is used (such a step is called communication step), the number of communication rules used during a computation, and the number of objects transferred among components (by communication rules) during a computation. all these three measures are dynamically defined; we can also consider a static parameter, like in descriptional complexity of chomsky languages (see a survey in [8]), i.e., the number of communication rules in a dp system. a problem is said to be “weakly parallelizable" with respect to a given (dynamical) communication complexity measure if it can be split in a balanced way, introduced in the dp system, and solved using a number of communication steps bounded by a constant given in advance; a problem is “efficiently parallelizable" if it is weakly parallelizable and can be solved by a dp system in a more efficient way than by a single p system; more precise definitions are given in the next sections of the paper. 240 gh. păun, m.j. pérez-jiménez various possibilities exist, depending on the type of systems (communicating systems, e.g., based on symport/antiport rules, systems with active membranes, catalytic systems, etc.) and the type of problem we consider (accepting strings, decision problems, numerical problems, etc.). in this note we only sketch the general formal framework and give an illustration, for the case of accepting strings as in p automata. we only show here that all regular languages are weakly parallelizable (only one communication step suffices, hence the weak parallizability holds with respect to all three dynamical measures), and that there are regular, context-free non-regular, context-sensitive noncontext-free languages which are efficiently parallelizable with respect to the first two dynamical measures mentioned above (in view of the results in [9], there are linear languages which are not efficiently parallelizable with respect to the number of communicated objects/bits among components). if we use extended systems (a terminal alphabet of objects is available) and the communication channels among the components of a dp automaton are controlled, e.g., by states, as in [7], or created during the computation, as in [3], then the power of our devices increases considerably: all recursively enumerable languages are weakly parallelizable in this framework. many research problems remain to be explored, starting with precise definitions for given classes of p systems, continuing with the study of usefulness of this strategy for solving computationally hard problems (which problems are weakly/efficiently parallelizable and which is the obtained speed-up for them?), and ending with a communication complexity theory of dp systems, taking into account all measures of complexity mentioned above (for the number of objects communicated among components, which corresponds to the number of bits considered in [9], we can transfer here the general results from communication complexity – note however that in many papers in this area one deals with 2-party protocols, while in our framework we want to have an n-party set-up, and that we are also interested in the time efficiency of the distributed and parallel way of solving a problem). 2 dp systems – a preliminary formalization the reader is assumed familiar with basics of membrane computing, e.g., from [11], [12], and of formal language theory, e.g., from [15], hence we pass directly to introducing our proposal of a distributed p system. the general idea is captured in the following notion. a dp scheme (of degree n ≥ 1) is a construct ∆ = (o,π1,...,πn,r), where: 1. o is an alphabet of objects; 2. π1,...,πn are cell-like p systems with o as the alphabet of objects and the skin membranes labeled with s1,...,sn, respectively; 3. r is a finite set of rules of the form (si,u/v,s j), where 1 ≤ i, j ≤ n, i ̸= j, and u,v ∈ o∗, with uv ̸= λ ; |uv| is called the weight of the rule (si,u/v,s j). the systems π1,...,πn are called components of the scheme ∆ and the rules in r are called intercomponents communication rules. each component can take an input, work on it, communicate with other components (by means of rules in r), and provide the answer to the problem in the end of a halting computation. (a delicate issue can appear in the case of components which can send objects to the environment and bring objects from the environment – this happens, for instance, for symport/antiport p systems; in this case we have to decide whether or not the components can exchange objects by means of the environment, or the only permitted communication is done by means of the rules in r. for instance, solving problems in a distributed way in membrane computing: dp systems 241 a “local environment" for each component can be considered, disjoint from the “local environments" of other components, thus preventing the interaction of components by means of other rules than those in r. actually, the rules in r themselves can be defined between these “local environments" – which is a variant worth to explore. we point out here that also the need of a “local environment" has appeared in the applications of membrane computing to eco-systems investigations, see [4] and its references.) now, we can particularize this notion in various ways, depending on the type of systems πi,1 ≤ i ≤ n, and the type of problems we want to solve. for instance, we can define dp systems with active membranes, as dp schemes as above, with the components being p systems with active membranes, each of them having a membrane designated as the input membrane. having a decision problem – consider, e.g., sat for n variables and m clauses – we can split a given instance of it in parts which are encoded in multisets which are introduced in the components of the dp system. for example, we can introduce the code of each separate clause in a separate component of the dp system. the components start to work, each one deciding its clause, and in the end they communicate to each other the result; if one of the components will find that all m clauses are satisfied, then the whole sat formula is satisfied. intuitively, this is a faster way than deciding the formula by means of a single p system with active membranes – but a crucial aspect has been neglected above: in order to say that the formula is satisfied, all the m clauses should be satisfied by the same truth-assignment, and this supposes that the m components communicate to each other also which is the assignment which turns true the clauses. that is, besides the usual time complexity of solving the problem we have now to consider the cost of communication among the components and the trade-off between these two parameters should be estimated. another interesting case, which will be briefly investigated in the subsequent section, is that of accepting strings in the sense of p automata, [6], [10]; we will come back immediately to this case. on the other hand, we have several possibilities for estimating “the cost of communication", and we adapt here the ideas from [1]. let us consider a dp system ∆ , and let δ : w0 =⇒ w1 =⇒ ... =⇒ wh be a halting computation in ∆ , with w0 being the initial configuration. then, for each i = 0,1,...,h −1 we can write: comn(wi =⇒ wi+1) = 1 if a communication rule is used in this transition, and 0 otherwise, comr(wi =⇒ wi+1) = the number of communication rules used in this transition, comw (wi =⇒ wi+1) = the total weight of the communication rules used in this transition. these parameters can then be extended in the natural way to computations, results of computations, systems, problems/languages. we consider below the case of accepting strings (by l(∆ ) we denote the language of strings accepted by ∆ ): for comx ∈ {comn,comr,comw } we define comx(δ ) = ∑h−1 i=0 comx(wi =⇒ wi+1), for δ : w0 =⇒ w1 =⇒ ... =⇒ wh a halting computation, comx(w,∆ ) = min{comx(δ ) | δ : w0 =⇒ w1 =⇒ ... =⇒ wh is a computation in ∆ which accepts the string w}, comx(∆ ) = max{comx(w,∆ ) | w ∈ l(∆ )}, comx(l) = min{comx(∆ ) | l = l(∆ )}. similar definitions can be considered for more general decidability problem than accepting strings, then complexity classes can be defined. we do not enter here into details for this general case; in the next section we will briefly consider the specific case of dp automata and of languages. the previously sketched approach should be investigated in more details. which is the speed-up for a given problem or class of problems? clearly, comn(α) ≤ comr(α) ≤ comw (α), for all valid 242 gh. păun, m.j. pérez-jiménez α . moreover, in one communication step one can use arbitrarily many communication rules, which therefore move from a component to another one arbitrarily many objects. anyway, independently of the communication cost, presumably, only a linear speed-up can be obtained by splitting the problem in a given number of parts. are there problems which however cannot be solved in this framework in a faster way than by using a single p system (with active membranes) provided that the communication cost is bounded (e.g., using communication rules in r only for a constant number of times)? which is the communication complexity for a given problem or class of problems? finding suggestive examples can be a first step in approaching such issues. a case study will be considered in the next section, not for dp systems with active membranes (which, we believe, deserve a separate and detailed examination), but for a distributed version of p automata. 3 dp automata we consider now the distributed version of p automata, [6], [10], which are symport/antiport p systems which accept strings: the sequence of objects (because we work with strings and symbol objects, we use interchangeably the terms “object" and “symbol") imported by the system from the environment during a halting computation is the string accepted by that computation (if several objects are brought in the system at the same time, then any permutation of them is considered as a substring of the accepted string; a variant, considered in [6], is to associate a symbol to each multiset and to build a string by such “marks" attached to the imported multisets). the accepted string can be introduced in the system symbol by symbol, in the first steps of the computation (if the string is of length k, then it is introduced in the system in the first k steps of the computation – the p automaton is then called initial), or in arbitrary steps. of course, the initial mode is more restrictive – but we do not enter here into details. as a kind of mixture of the ideas in [6] and [10] for defining the accepted language, we can consider extended p automata, that is, with a distinguished alphabet of objects, t , whose elements are taken into account when building the accepted string (the other objects taken by the system from the environment are ignored). here, however, we work with non-extended p automata. a dp automaton is a construct ∆ = (o,e,π1,...,πn,r), where (o,π1,...,πn,r) is a dp scheme, e ⊆ o (the objects available in arbitrarily many copies in the environment), πi = (o, µi,wi,1,...,wi,ki,e,ri,1,...,ri,ki) is a symport/antiport p system of degree ki (without an output membrane), with the skin membrane labeled with (i,1) = si, for all i = 1,2,...,n. a halting computation with respect to ∆ accepts the string x = x1x2 ...xn over o if the components π1,...,πn, starting from their initial configurations, using the symport/antiport rules as well as the intercomponents communication rules, in the non-deterministically maximally parallel way, bring from the environment the substrings x1,...,xn, respectively, and eventually halts. the dp automaton is synchronized, a universal clock exists for all components, marking the time in the same way for the whole dp automaton. it is also important to note that we work here in the non-extended case, all input symbols are recorded in the string. in this way, at most context-sensitive languages can be recognized. the three complexity measures comn,comr,comw defined in the previous section can be directly introduced for dp automata (and they were formulated above for this case). with respect to them, we can consider two levels of parallelizability. a language l ⊆ v ∗ is said to be (n,m)-weakly comx parallelizable, for some n ≥ 2,m ≥ 1, and x ∈ {n,r,w }, if there is a dp automaton ∆ with n components and there is a finite subset f∆ of l such that each string x ∈ l − f∆ can be written as x = x1x2 ...xn, with ||xi|− |x j|| ≤ 1 for all 1 ≤ i, j ≤ n, each solving problems in a distributed way in membrane computing: dp systems 243 component πi of ∆ takes as input the string xi,1 ≤ i ≤ n, and the string x is accepted by ∆ by a halting computation δ such that comx(δ ) ≤ m. a language l is said to be weakly comx parallelizable if it is (n,m)-weakly comx parallelizable for some n ≥ 2,m ≥ 1. two conditions are here important: (i) the string is distributed in equal parts, modulo one symbol, to the components of the dp automaton, and (ii) the communication complexity, in the sense of measure comx , is bounded by the constant m. we have said nothing before about the length of the computation. that is why we also introduce a stronger version of parallelizability. a language l ⊆ v ∗ is said to be (n,m,k)-efficiently comx parallelizable, for some n ≥ 2,m ≥ 1,k ≥ 2, and x ∈ {n,r,w }, if it is (n,m) weakly comx parallelizable, and there is a dp automaton ∆ such that lim x∈l,|x|→∞ timeπ (x) time∆ (x) ≥ k, for all p automata π such that l = l(π) (timeγ (x) denotes here the smallest number of steps needed for the device γ to accept the string x). a language l is said to be efficiently comx parallelizable if it is (n,m,k)-efficiently comx parallelizable for some n ≥ 2,m ≥ 1, k ≥ 2. note that in the case of dp automata, the duration of a computation may also depend on the way the string is split in substrings and introduced in the components of the system; in a natural way, one of the most efficient distribution of the string and shortest computation are chosen. of course, as larger the constant k as better. moreover, while time∆ (x) is just given by means of a construction of a suitable dp automaton ∆ , timeπ (x) should be estimated with respect to all p automata π . an example is worth considering in order to illustrate this definition. let us examine the dp system from figure 1 – the alphabet of objects is o = {a,b,c,d,c1,c2,#}, and e = {a,b}. clearly, component π1 (in the left hand side of the figure) can only bring objects a,c inside, and component π2 (in the right hand side of the figure) can only bring objects b,d inside. in each step, only one of a,c, alternately, enters π1 and only one of b,d, alternately, enters π2 (note that we do not need objects c,d to be present initially in the environment, while one copy of each a and b is sufficient). the computation of each component can stop only by “hiding" the “carrier objects" c,d inside an inner membrane, and this means releasing c1 in π1 and c2 in π2. if these objects are not released at the same time in the two components, so that the exchange rule (s1,c1/c2,s2) can be used, then, because of the maximal parallelism, the object c1 should enter membrane (1,3), and object c2 should enter membrane (2,3); in each case, the trap-object # is released, and the computation never stops: the object # oscillates forever across membrane (1,2) in π1 and across membrane (2,2) in π2. consequently, the two strings accepted by the two components of ∆ should have the same length, that is the language accepted by the system is l(∆ ) = {(ac)s(bd)s | s ≥ 0}. note the crucial role played here by the fact that the system is synchronized, and that a computation which accepts a string xs = (ac)s(bd)s, hence of length 4s, lasts 2s + 2 steps (2s steps for bringing objects inside, one step when objects c,d are introduced in an inner membrane, and one inter-components communication step), with one of these steps being a communication between components. obviously, if we recognize a string xs = (ac)s(bd)s as above by means of a usual symport/antiport p system, then, because no two symbols of the string can be interchanged, no two adjacent symbols can be introduced in the system at the same step, hence the computation lasts at least as many steps as the length of the string, that is, 4s. this shows that our language is not only (2,r)-weakly comx parallelizable, but also (2,r,2)-efficiently comx parallelizable, for (r,x) ∈ {(1,n),(1,r),(2,w )}. this conclusion is worth formulating as a theorem. 244 gh. păun, m.j. pérez-jiménez ' & $ % ' & $ % # " !# " ! # " ! # " ! s1 (1,2) (1,3) c1 # (c1,out;c,in) (#,in) (#,out) (#,out;c1,in) c d (s1,c1/c2,s2) (c,out;a,in) (a,out;c,in) s2 (2,2) c2 # (2,3) (c2,out;d,in) (#,in) (#,out) (#,out;c2,in) (d,out;b,in) (b,out;d,in) figure 1: an example of a dp automaton theorem 1. the language l = {(ac)s(bd)s | s ≥ 0} is efficiently comx parallelizable, for all x ∈ {n,r,w }. note that this language is not regular (but it is linear, hence also context-free). the previous construction can be extended to dp automata with three components: π1 inputs the string (ac)s, π2 inputs (bd)s, and π3 inputs (ac)s, then π1 produces the object c1, π2 produces two copies of c2, and π3 produces the object c3. now, c1 is exchanged for one copy of c2 from π2 and c3 for the other copy, otherwise the computation never stops. the recognized language is {(ac)s(bd)s(ac)s | s ≥ 0}. this language is not context-free, hence we have: theorem 2. there are context-sensitive non-context-free languages which are efficiently comx parallelizable, for all x ∈ {n,r,w }. the previous two theorems show that the distribution, in the form of dp systems, is useful from the time complexity point of view, although only one communication step is performed and only one communication rule is used at that step. moreover, the proofs of the two theorems show that, in general, languages consisting of strings with two well related halves (but not containing “too much" information in each half of the string, besides the length), are weakly parallelizable, and, if no two adjacent symbols of the strings can be interchanged, then these languages are efficiently parallelizable. we have said nothing above about regular languages – this is the subject of the next section. solving problems in a distributed way in membrane computing: dp systems 245 4 all regular languages are weakly parallelizable the assertion in the title of this section corresponds to theorem 2.3.5.1 in [9], which states that for each regular language there is a constant k which bounds its (2-party) communication complexity. the version of this result in terms of weak comx parallelizability is shown by the following construction. consider a non-deterministic finite automaton a = (q,t,q0,f,p) (set of states, alphabet, initial state, final states, set of transition rules, written in the form qa → q′, for q,q′ ∈ q,a ∈ t ). without any loss of generality, we may assume that all states of q are reachable from the initial state (for each q ∈ q there is x ∈ t ∗ such that q0x =⇒∗ q with respect to transition rules in p). we construct the following dp automaton: ∆ = (o,e,π1,π2,r), where : o = q ∪ t ∪{d} ∪ {(q,q′) | q,q′ ∈ q} ∪ {⟨q,q f ⟩ | q ∈ q,q f ∈ f} ∪ {⟨q⟩ | q ∈ q}, e = o −{d}, π1 = (o,[s1[1,2 ]1,2 ]s1,q0,λ ,e,rs1,r1,2), rs1 = {(q,out;q ′a,in) | qa → q′ ∈ p} ∪ {(q,out;⟨q′⟩a,in) | qa → q′ ∈ p}, r1,2 = {(⟨q⟩,in), (⟨q⟩,out) | q ∈ q}, π2 = (o,[s2 ]s2,d,e,rs2), rs2 = {(d,out;(q,q ′)a,in) | qa → q′ ∈ p,q ∈ q} ∪ {((q,q′),out;(q,q′′)a,in) | q′a → q′′ ∈ p,q ∈ q} ∪ {((q,q′),out;⟨q,q f ⟩a,in) | q′a → q f ∈ p,q ∈ q,q f ∈ f}, r = {(s1,⟨q⟩/⟨q,q f ⟩,s2) | q ∈ q,q f ∈ f}. the first component analyzes a prefix of a string in l(a), the second component analyzes a suffix of a string in l(a), first guessing a state q ∈ q from which the automaton starts its work. at some moment, the first component stops bringing objects inside by taking from the environment a symbol ⟨q′⟩ for some q′ ∈ q, reached after parsing the prefix of the string in l(a). this object will pass repeatedly across the inner membrane of π1. the second component can stop if a state q′ is reached in the automaton a for which no rule q′a → q′′ exists in p (and then ∆ never stops, because its first component never stops), or after reaching a state in f , hence introducing an object of the form ⟨q,q f ⟩ for some q f ∈ f . note that q is the state chosen initially and always stored in the first position of objects (q1,q2) used by π2. the computation can halt only by using a communication rule from r, and this is possible only if q = q′ – the first component has reached the state of a which was the state from which the second component started its work. consequently, the concatenation of the two strings introduced in the system by the two components is a string from l(a). thus, the language l(a) is weakly parallelizable. now, consider a regular language such that no two adjacent symbols in a string can be permuted (take an arbitrary regular language l over an alphabet v and a morphism h : v ∗ −→ (v ∪{c})∗, where c is a symbol not in v , such that h(a) = ac for each a ∈ v ). then, clearly, if the two strings accepted by the two components of the dp automaton ∆ are of equal length (note that the strings of h(l) are of an even length), then the time needed to ∆ to accept the whole string is (about) half of the time needed to any p automaton π which accepts the same language. this proves that the language h(l) is efficiently parallelizable, hence we can state: 246 gh. păun, m.j. pérez-jiménez theorem 3. each regular language is weakly comx parallelizable, and there are efficiently comx parallelizable regular languages, for all x ∈ {n,r,w }. of course, faster dp automata can be constructed, if we use more than two components. however, it is not clear whether dp automata with n + 1 components are always faster than dp automata with n components – this might depend on the structure of the considered language (remember that the distribution of the input string to the components of the dp automaton must be balanced). more specifically, we expect that there are (n,m) weakly parallelizable languages which are not, e.g., (n + 1,m) weakly parallelizable; similar results are expected for efficiently parallelizable languages. a natural question is how much the result in theorem 3 can be extended. for instance, is a similar result true for the linear languages, or for bigger families of languages? according to theorem 2.3.5.4 in [9], this is not true for measures comr and comw , the recognition of context-free languages (actually, the language lr at page 78 of [9] is linear) have already the highest communication complexity (in 2party protocols), a linear one with respect to the length of the string. thus, the number of communication rules used by a dp automaton during a computation cannot be bounded by a constant. the case of measure comn remains to be settled: is it possible to have computations with a bounded number of communication steps, but with these steps using an unbounded number of rules? we conjecture that even in this case, languages of the form {x mi(x) | x ∈ {a,b}∗}, where mi(x) is the mirror image of x (such a language is minimally linear, i.e., can be generated by a linear grammar with only one nonterminal), are not weakly comn parallelizable. many other questions can be raised in this framework. for instance, we can consider families of languages: (n,m)-weakly comx parallelizable, weakly comx parallelizable, (n,m,k)-efficiently comx parallelizable, and efficiently comx parallelizable. which are their properties: interrelationships and relationships with families in chomsky hierarchy, closure and decidability properties, hierarchies on various parameters, characterizations and representations, etc. then, there is another possibility of interest, suggested already above: the static complexity measure defined as the cardinality of r, the set of communication rules. there is a substantial theory of descriptional complexity of (mainly context-free) grammars and languages, see [8], which suggests a lot of research questions starting from coms(∆ ) = card(r) (with “s" coming from “static") and extended to languages in the natural way (coms(l) = min{coms(∆ ) | l = l(∆ )}): hierarchies, decidability of various problems, the effect of operations with languages on their complexity, etc. 5 the power of controlling the communication in the previous sections the communication rules were used as any rule of the system, non-deterministically choosing the rules to be applied, with the communication rules competing for objects with the inner rules of the components, and observing the restriction of maximal parallelism. however, we can distinguish the two types of rules, “internal evolution rules" (transition rules, symport/antiport rules, rules with active membranes, etc.) and communication rules. then, as in [1], we can apply the rules according to a priority relation, with priority for evolution rules, or with priority for communication rules. moreover, we can place various types of controls on the communication channel itself. for instance, because the communication rules are antiport rules, we can associate with them promoters or inhibitors, as used in many places in membrane computing. a still more natural regulation mechanism is to associate states with the channels, like in [7]. in this case, the communication rules associated with a pair (i, j) of components πi,π j are of the form (q,u/v,q′), where q,q′ are elements of a given finite set q of states; initially, the channel is assumed in a given state q0. a rule as above is applied only if the cannel is in state q – and the antiport rule (i,u/v, j) can be applied; after exchanging the multisets u,v among the two components πi,π j, the state of the channel is changed to q′. solving problems in a distributed way in membrane computing: dp systems 247 an important decision should be made in what concerns the parallelism. in [7], the channel rules are used in the sequential mode, but we can also consider two types of parallelism: (i) choose a rule and use it as many times as made possible by the objects in the two components, or (ii) apply at the same time all rules of the form (q,u/v,q′) for various u and v (but with the same q and q′), in the non-deterministic maximally parallel way. in the result discussed below, any of these two possibilities works – and the result is somewhat surprising: theorem 4. any recursively enumerable language l is (2,2)-weakly comn and comr parallelizable and has coms(l) ≤ 2, with respect to extended dp automata with channel states. we do not formally prove this assertion, but we only describe the (rather complex, if we cover all details) construction of the suitable dp automaton. take a recursively enumerable language l ⊆ t +, for some t = {a1,a2,...,an}. for each string w ∈ t +, let valn+1(w) be the value of w when considered as a number in base n + 1, using the digits a1,a2,...,an interpreted as 1,2,...,n, without also using the digit zero. we extend the notation to languages, in the natural way: valn+1(l) = {valn+1(w) | w ∈ l}. clearly, l is recursively enumerable if and only if valn+1(l) is recursively enumerable, and the passage from strings w to numbers valn+1(w) can be done in terms of p automata (extended symport/antiport p systems are universal, hence they can simulate any turing machine; this is one of the places where we need to work with extended systems, as we need copies of a and b – see below – to express the values of strings, and such symbols should be taken from the environment without being included in the accepted strings). construct now a dp automaton ∆ with two components, π1 and π2, working as follows. the component π1 receives as input a string w1 ∈ t ∗ and π2 receives as input a string w2 ∈ t ∗, such that w1w2 should be checked whether or not it belongs to the language l. without loss of generality, we may assume that |w1| ∈ {|w2|, |w2|+1} (we can choose a balanced distribution of the two halves of the string). in the beginning, the state of the channel between the two components is q0. both components start to receive the input symbols, one in each time unit; the component π1 transforms the strings w1 in valn+1(w1) copies of a symbol a, and π2 transforms the string w2 in valn+1(w2) copies of a symbol b. when this computation is completed in π1, a special symbol, t, is introduced. for this symbol, we provide the communication rule (q0,t/λ ,q1), whose role is to change the state of the channel. we also consider the rule (q1,a/λ ,q2). using it in the maximally parallel way, all symbols a from π1 are moved to π2, in one communication step. because we have considered w1 at least of the length of w2 and we also need two steps for “opening" the channel and for moving the symbols a across it, we are sure that in this moment in π2 we have, besides the valn+1(w1) copies of a, valn+1(w2) copies of b. the second component takes now these copies of a and b and computes valn+1(w1w2), for instance, as the number of copies of an object c. after that, π2 checks whether or not valn+1(w1w2) ∈ valn+1(l). if the computation halts, then the string w1w2 is accepted, it belongs to the language l. note that the dp automaton ∆ contains two communication rules (hence coms(l) ≤ 2) and that each computation contains two communication steps (hence comn(l) ≤ 2), in each step only one rule being used (hence comr(l) ≤ 2). these observations complete the proof of the theorem. of course, comw (∆ ) = ∞. (similarly, if we define comr taking into account the multiplicity of using the rules, then also comr can be considered infinite – hence the assertion in the theorem remains to be stated only for the measure comn.) instead of changing channel states as above, we can assume that the channel itself switches from “virtual" to “actual", like in population p systems, [3]: the channel is created by object t produced by π1, and then used for moving a from π1 to π2 by a usual communication rule (which, by definition, is used in the maximally parallel way). anyway, the conclusion of this discussion is that the results we obtain crucially depend on the ingredients we use when building our dp systems (as well as on the chosen definitions for complexity 248 gh. păun, m.j. pérez-jiménez measures and types of parallelizability). 6 closing remarks the paper proposes a rather natural way (using existing ingredients in membrane computing, bringing no new, on purpose invented, stuff into the stage) for solving problems in a “standard" distributed manner (i.e., splitting problems in parts, introducing them in various component “computers", and constructing the solution through the cooperation of these components) in the framework of membrane computing. so called dp schemes/systems were defined, and two notions of parallelizability were proposed and briefly investigated for the case of dp automata (accepting strings). a lot of problems and research topics were suggested. the reader can imagine also further problems, for instance, transferring in this area notions and questions from the communication complexity theory, [9], considering other types of p systems (what about spiking neural p systems, where we have only one type of objects and no antiport-like rules for communicating among components?), maybe using unsynchronized p systems, non-linear balanced input, and so on and so forth. we are convinced that dp systems are worth investigating. note. during the recent brainstorming week on membrane computing, 1-5 of february 2010, sevilla, spain, several comments about the definitions and the results of this paper were made, especially by erzsébet csuhaj-varú, györgy vaszil, rudolf freund, and marian kögler. several continuations of this paper are now in preparation; the interested reader is requested to check the bibliography from [17], in particular, the brainstorming proceedings volume. acknowledgements this work is supported by proyecto de excelencia con investigador de reconocida valía, de la junta de andalucía, grant p08 – tic 04200. useful discussions with henry adorna are gratefully acknowledged. bibliography [1] h. adorna, gh. păun, m.j. pérez-jiménez: on communication complexity in evolutioncommunication p systems. manuscript, 2009. [2] l. babai, p. frankl, j. simon: complexity classes in communication complexity. proc. 27th annual symp. founf. computer sci., 1986, 337–347. [3] f. bernardini, m. gheorghe: population p systems. j. universal computer sci., 10, 5 (2004), 509– 539. [4] m. cardona, m.a. colomer, a. margalida, i. pérez-hurtado, m.j. pérez-jiménez, d. sanuy: a p system based model of an ecosystem of some scavenger birds. membrane computing. proc. wmc10, curtea de argeş, 2009 (gh. păun et al., eds.), lncs 5957, springer, 2010, 182–195. [5] m. cavaliere: evolution-communication p systems. membrane computing. proc. wmc 2002, curtea de argeş (gh. păun et al., eds.), lncs 2597, springer, berlin, 2003, 134–145. [6] e. csuhaj-varjú: p automata. membrane computing. proc. wmc5, milano, 2004 (g. mauri et al., eds.), lncs 3365, springer, berlin, 2005, 19–35. [7] r. freund, gh. păun, m.j. pérez-jiménez: tissue-like p systems with channel-states. theoretical computer sci., 330, 1 (2005), 101–116. solving problems in a distributed way in membrane computing: dp systems 249 [8] j. gruska: descriptional complexity of context-free languages. proc. symp. on mathematical foundations of computer science, mfcs, high tatras, 1973, 71–83. [9] j. hromkovic: communication complexity and parallel computing: the application of communication complexity in parallel computing. springer, berlin, 1997. [10] m. oswald: p automata. phd thesis, tu vienna, 2003. [11] gh. păun: membrane computing. an introduction. springer, berlin, 2002. [12] gh. păun, g. rozenberg, a. salomaa, eds.: handbook of membrane computing. oxford university press, 2010. [13] m.j. pérez-jiménez: a computational complexity theory in membrane computing. membrane computing. proc. wmc10, curtea de argeş, 2009 (gh. păun et al., eds.), lncs 5957, springer, 2010, 125–148. [14] a.e. porreca, a. leporati, g. mauri, c. zandron: introducing a space complexity measure for p systems. intern. j. computers, communications and control, 4, 3 (2009), 301–310. [15] g. rozenberg, a. salomaa, eds.: handbook of formal languages. 3 volumes, springer, berlin, 1998. [16] a.c. yao: some complexity questions related to distributed computing. acm symposium on theory of computing, 1979, 209–213. [17] the p systems website: www.ppage.psystems.eu. gheorghe păun graduated the faculty of mathematics, university of bucharest, in 1974 and received his ph.d. from the same university in 1977. from 1990 he is a senior researcher at the institute of mathematics of the romanian academy. he (repeatedly) visited numerous universities in europe, asia, and north america. his main research areas are formal language theory and its applications, computational linguistics, dna computing, and membrane computing; this last research area was initiated by him, in 1998, and the respective models are now called p systems, see http://ppage.psystems.eu). he has published a large number of research papers, has lectured at many universities, and gave numerous invited talks at recognized international conferences. he has published eleven monographs in mathematics and computer science, has (co)edited over seventy collective volumes and special issues of journals, and also published many popular science books, books on recreational mathematics (games), and fiction books. he is a member of the editorial board of more than a dozen international journals and was/is involved in the program/steering/organizing committees for many recognized conferences and workshops. in 1997 he was elected a member of the romanian academy and from 2006 he is a member of academia europaea. he also got other honors, in romania or abroad. he is an isi highly cited researcher (see http://isihighlycited.com/). 250 gh. păun, m.j. pérez-jiménez mario j. pérez-jiménez received his degree in mathematics from the barcelona university and the doctor degree in mathematics from the sevilla university. in the past, he was associated professor at the university of barcelona. currently, he is full professor of computer science and artificial intelligence at the university of sevilla, where is the head of the research group on natural computing. his main research fields are computational complexity theory, natural computing, membrane computing, bioinformatics, and computational modeling for systems biology and population biology. he has published twelve books in computer science and mathematics, and over 200 scientific papers in international journals (collaborating with many researchers worldwide). he is the main researcher in various european, spanish and andalusia research grants. he has been an independent expert to the evaluation of nest (new and emergent science and technology) proposals under the sixth framework programme of the european community, and from may 2006 he is an european science foundation peer reviewer. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 961-967 secure data retention of call detail records f. vancea, c. vancea, d. popescu, d. zmaranda, g. gabor florin vancea, codruţa vancea, daniela elena popescu doina zmaranda, gianina gabor university of oradea, romania 410087 oradea, st. universităţii 1 e-mail: {fvancea,cvancea,depopescu,zdoina,gianina}@uoradea.ro abstract: in today’s world communication is relying heavily on electronic means, both for voice and other native data. all these communication sessions leave behind journaling information by the very nature of the underlying services. this information is both sensitive with respect to user’s rights and important for law enforcement purposes, so proper storage and retrieval techniques have to be taken into consideration. the paper discusses such techniques in relation with recent eu recommendations and suggests some methods for achieving good performance while preserving the required security levels. keywords: call detail records, secure retention, row chaining. 1 introduction during the last years we have seen an increasing interest in secure storage and retrieval of journaling archives of various transactions. the possibility of fraud or inappropriate usage of resources have led to laws and regulations regarding storage of transaction history for phone calls, sms, e-mail traffic and other forms of communication. specifically, the european parliament and the council of the european union have adopted a directive (2006/24/ec) [1] which gives guidelines for retention of data collected from publicly available communication services or public communication networks, in order to ensure that the data is available for investigation, detection and prosecution of serious crime. usually, the devices that perform the actual communication function are generating logging data that can identify the peers, time attributes and other relevant details of the transaction. these logs have been traditionally used for billing and eventually for statistical purposes by the network operators themselves. since this information was stored anyway by the operators, the law enforcement agencies could make use of them for specific purposes within a frame more or less regulated by each country laws. currently, many states are requiring the voice and data operators to store these logs for a well-defined period of time and to present them timely when legally requested by various law-enforcement entities. in order for this information to be useful for investigation and furthermore for prosecution, there is an implicit requirement that the information contained by the logs cannot be tampered with. even if the operators should not (or actually may not) capture and store the actual content of the communication, there are also increasing concerns about the privacy of the service users related to their communication partners. wholesale collection of user communication actions and habits can be viewed as a breach of privacy, so strong protection against casual browsing or unauthorized intentional usage of this information should be provided. when combining the long-term storage and efficient retrieval requirements with privacy, nonrepudiation, integrity, performance and cost requirements, the task of data retention becomes non-trivial. copyright c⃝ 2006-2010 by ccc publications 962 f. vancea, c. vancea, d. popescu, d. zmaranda, g. gabor 2 system structure, main requirements and challenges the communication network operator has to manage during normal operation several pieces of information about each communication session. at the bare minimum this information is used to establish communication channels and to ensure correct traffic flow but it may also be stored and used for billing purposes. we will call such information about one user session call detail record (cdr) as this is the name commonly used by voice service providers. for simplicity we will also name the communication session call but all reasoning below applies also for data sessions (e-mail, internet sessions). the records originate in the network equipment (central switch, access infrastructure), which have limited ability to store data, so periodically the records are transferred into database systems where they are processed (e.g. for billing) and finally discarded. if cdr retention is desired, the records are also stored for a rather extended period of time in some sort of semi-persistent storage until lawfully required through a query or until the retention period expires. network equipment billing process data is discarded legitimate query cdr retention system figure 1: cdr data flow and data retention for the purpose of this discussion we will not consider the billing path because this is part of the existing business model of the operator and is already well-established and less likely to accept any change. user data may be exposed there but this is an old threat, which is supposedly already handled by operator’s procedures. furthermore, the lifecycle of user sensitive data there is relatively short (one month or at most a couple of months) and the impact of integrity loss is only eventually financial to the operator but not deeply legal to the user. the cdr data can be attacked: • before entering the retention system; • while stored in the retention system; • during query; • after supposed deletion. we will consider that the attacks performed before entry in the retention system have a low risk because the data stays in transit between network equipment and storage for a short period of time, the path is internal to the operator (thus is reasonably secure) and more important, the attacks are likely to take place only after a reasonable long time after the call is placed (otherwise the attacker would have refrained from making the call at all or would have attacked directly the network infrastructure to obtain an untraceable call). the attacks on cdr data during the query may be: secure data retention of call detail records 963 • impersonating authorized entities; • query alteration; • result alteration; • result disclosure. all those can be stopped by proper signing and encryption of the request and the result. the attacks after supposed deletion are disclosure attacks and would attempt to extract protected information from media used to store cdr data in the retention system. encrypting the data is of course the main protection method but ciphertext-extensive attacks may exist which exploit the very large amount of encrypted data. to avoid these attacks write-once media should be destroyed properly and rewritable media should be properly erased after retention time expires. we will focus on data stored in the retention system, because the cdr data stays there a rather long time (at least 6 months, according to eu recommendation) and the attacks may be either against the confidentiality or against the integrity of the data. the cdr retention system has to take steps to prevent at least the following attack types: • unauthorized disclosure of call detail, either for a particular call, a particular originator or a particular termination • alteration of call detail (originator, termination, start/end time, other details) • complete removal of one call or of all calls matching a particular originator, termination or time interval • complete denial of service against the retention system when evaluation the potential solutions we should also consider the size of the problem. one reasonably large operator may easily originate or terminate in excess of 10m calls per day (probably at least one order of magnitude more in the case of data services). at a minimum, a cdr will contain the originating subscriber number or identifier (at least 10 characters), termination or forwarding subscriber number (at least 10 characters), starting timestamp and duration (about 6 characters). to cover all situations (international dial numbers) additional space has to be reserved. in the case of mobile services additional location data is required and data services may have longer identification tokens. we estimate therefore the raw cdr to require at least 32 bytes of storage, leading to a minimum figure of 320mb of raw data to be stored for each day of traffic. this extends to about 10gb monthly and 120 gb yearly. the above figures may appear small by today’s standards in memory and storage capacity, but they are absolute minimum estimates of raw data. in a working system there should be protection which is obviously introduced by encryption and a working system should offer some sort of direct random access to the data, preferably optimized like an indexed database query. both features have storage overhead and we consider it to be tenfold from extrapolating previous experiences. in the end the retention system will probably have to deal securely with about 500gb of data for a 6 month retention period on the assumption of 10m calls per day. the above estimation may still appear small by comparison to a large business platform, but we should keep in mind that the cdr retention system is a non-profitable investment for the operator so one cannot expect it to use significant resources in terms of purchased hardware or software. fortunately, the criteria for common queries are limited: by originator identifier (directly or indirectly mapped through user identity), by termination identifier and by time interval. 964 f. vancea, c. vancea, d. popescu, d. zmaranda, g. gabor 3 potential solutions according to the problem description we should figure how to store the data and how to efficiently retrieve the required cdr according to the typical query criteria. 3.1 direct worm storage one obvious approach is to use worm media, given that the write-once feature will provide tampering protection. this approach will not automatically solve the confidentiality issue, so some additional measures have to be taken. the main disadvantage of passive worm media is capacity. considering the above estimations, plain passive worm media is not sufficient unless the time span for data written on one support is short. the time span should also be as short as possible, because data is protected only after write, and data in transit will be vulnerable to integrity attacks. short time spans on one support will translate into many media units for the entire retention period, making the query process difficult and time-consuming. the speed of passive worm media is also limited, affecting the query performance. a better performing approach is to use special active worm devices. these are special storage units presenting the worm feature at the access port (ethernet, scsi) but backed by conventional magnetic storage. the worm feature is protected by dedicated interface and firmware and the whole device is tamper-proof. this kind of device offers good protection but is usually expensive and its main use case is append to log. our goal is to map database-like search capabilities over the protected data and the append to log restricted primitive is not friendly with common database engines and their storage management. the lesser version of active worm is a hybrid software system that achieves the write once feature by modification of the operating system block drivers or by a custom operating system [2]. the protection it provides may be reasonable but far weaker than the one offered by true passive optical media. this hybrid alternative fails to be database-friendly, too. all the solutions based mainly on worm devices will amount to periodically storing the cdr logs and sequentially scanning those for each query. this is not very efficient; especially considering that all the data that is sequentially scanned will be encrypted for confidentiality purposes. 3.2 database backed by worm storage a slightly better solution would be to use a true database for query purposes and a passive worm reference for result validation. the cdr data would both be stored in the database and consolidated with a reasonable short interval to worm storage (passive media should be ok). the initial query would be performed over the regular database and the integrity of the database itself might be checked against some hash checks stored on worm media. the solution is fragile and may not provide either the required level of integrity or the desired level of performance. some dbms (e.g. oracle) offer the possibility to split the storage segments over read-write and read-only media. in such a simple scenario, the retention time window would be split in several fragments out of which only the last one is residing on read-write storage. of course, management of splitting would be rather complicated and there are certain limitations in place which would limit the reasonable unsafe fragment to at least one week. the solution has the disadvantage of poor query performance for time frames located on read-only media, but the main disadvantage is that the unsafe fragment cannot be protected even if conventional rowlevel encryption is applied. little can be done to actually prevent the rows in the read-write partition from being maliciously created, changed or deleted. security mechanisms at database secure data retention of call detail records 965 level do exist but we are looking for an additional layer of trust, which would rule out any database authority from tampering the data. by using encryption at row level one can simply prevent row changing and unauthorized row creation [3]. efficient query is possible by index-scan over plaintext time attributes or by index-scan over pre-encrypted source or target fields. unfortunately row deletion can be always performed at some authority level without being even detected. source target start time duration source target start time duration source target start time duration symmetrical encryption sig sig sig row-level signature figure 2: row content is protected but rows can be deleted 3.3 database with cross-linked rows backed by worm storage an improved solution is using results of our previous work, namely row chaining [4] [5]. beside symmetrical encryption for confidentiality protection and row-level signature for row integrity and authenticity protection we use a chaining scheme that links one row to the previous one. the scheme can detect row deletion unless a whole block of rows from the end are collectively deleted. this would be solved by building the chain two ways, at the cost of revisiting previous records. given the volume and the dynamic of data we consider a single chain is providing enough protection. dur .. dur .. s’. . s’. . s’. . source target start time dur .. source target start time source target start time symmetrical encryption sig sig sig row-level signature figure 3: rows protected by chaining cannot be deleted using the chaining scheme has the great advantage of relieving us from calling the worm storage upon each query. the cdr data can now reside entirely in plain read-write database storage as its integrity is now protected by chains and signatures. when a query is presented, the results are quickly determined at regular database speed, and then the result-set is validated by walking the chain one or two steps for each record. the worm storage is still required in case tampering is detected in any form and as a backup to avoid denial-of-service attacks (complete corruption of the database). the above scheme has however a flaw and can be improved. one can maliciously alter the database indexes to skip certain records, and those records will never be returned in the result 966 f. vancea, c. vancea, d. popescu, d. zmaranda, g. gabor set even if they are present in the database and the storage structure is still cryptographically intact. all checks will succeed, even for queries returning rows adjacent to the masked ones. the solution is to build the chains not on subsequent rows as they are added to the database but on rows belonging to the same source or the same target. this means that the database will contain a chain of rows for each source and a chain of rows for each target. there is a new efficiency issue here, because we will need two chaining fields, one for source and one for target. the issue of protecting the last record in the chain becomes also apparently more sensitive, because the open ends of the chains are living longer. however, undetected deletion of a row requires that it is at the open end of both source and target chains, which is unlikely to happen for long in regular traffic. 4 aggregation attack and countermeasures so far we have considered that encrypting the source and the target fields offers appropriate protection against disclosure of the actual participants to a call. unfortunately, in order to benefit from indexed search a particular source or target identifier has to be encrypted always with the same key, yielding the same encrypted value. the key does not have to be the same for all encryptions (actually should be changed over the source-target population) but once an identifier is encrypted in the system it should use the same ciphertext representation. this invariance property required for indexing opens a path for aggregation attacks. say the attacker has access to the database (either online or in archived form) and is looking for source a, but does not know the key. actually it does not need to. if the attacker knows that a has placed calls at some particular moments in time it can aggregate through the data looking for that particular call pattern. once it finds it all the past and future cdr involving a (at least as source) are accessible. the required number of such fixed points is not very high, as it was previously proven in aggregation attacks on blinded statistical data. the success of the attack depends on the traffic volume around the known time-points and on the number of the time-points available. the attacker may even optimize its chances by explicitly placing calls to a at low-traffic time intervals or if the attacker is itself a, by making calls at such well-chosen moments. to provide some level of protection against this kind of attack we have to dilute the resolution the attacker has in the time domain when building a candidate set based on a given timestamp. when the attacker has a timestamp it can effectively build a set of sources or destinations that have placed or received a call at that moment in time. if the timestamp is say 11:00:00 am, the set will be likely large (high traffic at busy hours) but if the timestamp is 03:00:00 am the set will be considerably smaller. the time resolution dilution has to be made by sacrificing performance at some level. we will store in the start time field a down-rounded timestamp and in an additional encrypted field the offset of the actual start time. this will reduce the resolution of the time searches and will entail some sequential processing to achieve exact query results. however, since the query specifies typically also the source or the target the performance penalty will be small (will use source or target indexes). on the other hand, the attacker has only the degraded start time to work with. to further refine the protection, the rounding should be made according to traffic patterns. lower traffic intervals should be larger and higher traffic intervals may be smaller. this helps reducing the performance degradation we mentioned before and can be achieved by a hamming-like algorithm applied once in a while to determine the optimal rounding intervals. even when using the rounding method to protect against data aggregation, special care should be taken with respect to the data storage pattern. all records have to be added incrementally with the starting timestamp (or eventually the end timestamp) because that is the way they come secure data retention of call detail records 967 from the network device logs. if the rounding algorithm performs for example the rounding at 23:00, 01:00, 05:00 the attacker may place a call shortly after 01:00 am and look for records after but in close proximity of the start time switch from 23:00 to 01:00. the solution is to randomly shift the rounding boundaries each day with at least 10% of the theoretical interval size. 5 conclusions and future works designing and implementing a system which provides secure data retention of call detail records is not a trivial task. there are several inter-related issues ranging from storage methods to data protection by encryption, and from efficient query to safe implementation. unfortunately we have to note that in the current form, secure data retention has little driving force. the operators that manage the data have little to gain by properly implementing such a system and proper implementation is not necessarily cheap. the laws mandating the presence of such systems are rather vague on the required security and performance levels for such systems. the potential direct users have no direct means to encourage proper (read fast) implementations and the real owners of the call data (the subscribers) have also little means to encourage proper (read secure) implementations. we however tried to outline within the limited size of this paper some of the requirements and the challenges related to such a system together with means to achieve a safe system with good performance. of course, each link in the chain has its very important contribution to the overall strength and there are many details left uncovered. some directions may be: the proper protocol to be implemented between the retention system and the authorized partner requesting a query, proper and efficient protocol for destruction of cdr data at the end of retention period, implementation details on the chaining and on the time dilution algorithm, general implementation notes regarding secure split-role administration. some of our future work may be related to these directions. bibliography [1] ***, directive 2006/24/ec of the european parliament and of the council of 15 march 2006, official journal of the european union, l105/54, 13.04.2006 [2] y.wang, y.zheng, fast and secure worm storage systems, proceedings of the ieee security in storage workshop (sisw), pages 11-19, 2003 [3] b. schneier, applied cryptography, john wiley & sons, 1996. [4] f. vancea, c. vancea, f. vancea, c. vancea, practical security issues for a real case application, int. j. of computers, communication and control 3(s), pages 11-16, 2008 [5] f. vancea, c. vancea, protecting data integrity with chained rows and public key cryptography. comments on a real case, proceedings of rsee 2008 international journal of computers, communications & control vol. ii (2007), no. 1, pp. 56-65 advancing electronic assessment nikolaos doukas, antonios andreatos abstract: a computer-aided assessment system is presented that has been designed to produce and deliver tests to the hellenic air force academy students and assess their performance. the system is called e-xaminer and is intended for use in both undergraduate courses and distance learning post-graduate programs of the academy. the e-xaminer uses meta-language concepts to automatically generate tests, based on parametrically designed questions. tests intended for different students may entail differences in the arithmetic parameters. additionally, different tests may be composed from different but equivalent and randomly chosen sub-questions. the system may also present each student with a scrambled sequence of the same questions, as a counter-measure against cheating. examinations are delivered via a webbased interface; an automatically generated program marks the answers submitted by each student. e-xaminer allows the implementation of question parameterisation and counter cheating measures, so that electronic tests become significantly different and more powerful than traditional ones. sample problems are presented which show the additional features of the e-xaminer, intended to facilitate the work of the course organiser in issuing and marking the tests, as well as in combating cheating. this paper focuses on some new, advanced types of questions enabled by electronic assessment; it then compares paper-and-pencil exams to electronic exams; results from a small student poll on the electronic exams are also presented. finally, the directions for planned future work are outlined. keywords: e-xaminer, computer-assisted assessment, domain specific languages, hafa. 1 introduction 1.1 the revolution of e-learning e-learning may be defined as the technology and the academic services related to the teaching and learning processes [1]. it is a wide term covering all the range of previous educational applications such as computer based training (cbt) and web based training (wbt), as well as more recent technologies such as lms, virtual classrooms or labs and digital collaboration [2]. historically, the rush of e-learning may be located in the decade 1990-2000, a period characterised by a boom in the information & communication technologies (ict) and the invent and evolution of the web. for better exploitation of the above technologies, standards which will allow the interoperability of various platforms and the re-use of educational material are being developed. the first such standards appeared in 2001. the above evolution, along with other reasons, not only led to the foundation of many open universities around the world [3], but also pushed many traditional universities to offer distance learning courses [1]. it is estimated that, as far as continuing education in higher education institutions is concerned, distance learning will grow ten times faster than on-campus learning over the next ten years [4]. according to burns, “up to 45 percent of colleges and university enrolment is from adult learners, many of which sign-up for distance learning classes rather than on-campus classes. revenues for continuing education rose 67% at responding institutions since the previous survey in 2004. the trend is expected to grow distance learning 10 times faster than campus classes over the next decade. the growth in distance learning is driven by the growth of interactive marketing” [4]. the web has been considered a means of education and knowledge since its early days. today it is widely used for educational purposes due to its worldwide spread and penetration and because it supports a lot ways for the representation of information [5]. copyright c© 2006-2007 by ccc publications selected paper from icvl 2006 advancing electronic assessment 57 the recent proliferation of distance learning (delivered by open universities worldwide, as well as traditional institutions) has encouraged many higher education organisations to develop software for supporting this type of learning. a well known such package is claroline [6], which is freeware and supports 32 languages. with the expansion of e-learning, the need for electronic examinations has become more imperative. a significant number of computer aided assessment (caa) systems has appeared as a result of the work of both academic institutions and commercial companies. examples of such systems are given in [6], [7], [8], [9], [10], [11] and [12]. these systems have already been extensively tested and are being widely used. 1.2 the merits of electronic testing in ref. [13], the public-private coalition known as the ‘partnership for 21st century skills’ gives a vision of how students should be prepared to face the challenges of the 21st century. within this report, the benefits of using technology in order to give immediate feedback on student assessments is underlined. electronic testing has been accused of bringing non-technology students to a dissadvantage when they are forced to use a keyboard to type their answers, rather than writing them on paper [14]. in response, public and private sector experts state in [13] that 21st century literacy is much more than the basic computer skills required for typing an answer. it is pointed out that the new tools for learning including computers, telecommunications and audio or video based media are critical enablers of learning in a number of realms, even for subjects that have nothing to do with technoology. they hence conclude that there is a need for assessment tools that measure those essential skills that will never be captured by traditional tests. while the value of traditional testing (like portfolio assessment) as a means for classroom level assessment of individual students is not questioned, electronic tests provide rapid feedback on students performance that can be compared across classrooms or schools [13]. furthermore, computer based training can enable the evolvment of novel concepts that instructors would have never contemplated of delivering a few years ago [15]. the authors present a training platform which automatically designs courses that not only adapt to the students capabilities and previous knowledge, but also dynamically adjusts the contents, according to the students performance during the progress of the course. the statistical results presented show that computer delivered courses are extremely efficient in promoting learning. an automated computer-assisted assessment (caa) system has already been presented [12], [16], which has been designed for the needs of the hellenic air force academy (hafa). this system is called e-xaminer, a title which emphasises the fact that this system extends e-learning in the field of examinations. the principle reasons which dictated the development of a new system from scratch, in order to cover the needs of the hafa, where the following: (1) the limited number of types of questions supported by the existing systems. (2) the intention of hafa to experiment with cheating countermeasures so that the final system could eventually be used in its distance learning program [17]. (3) most existing systems do not support the greek language, while teaching in hafa is done exclusively in greek. this fact creates some special requirements that needed to be met. in ref. [16] we have presented the nine different problem types and partial credit possibilities supported by the e-xaminer so far and we have briefly discussed the main advantages of caa. in ref. [12] we have presented the dsl approach adopted by the e-xaminer, cheating countermeasures and some experimental results obtained by the pilot application of the e-xaminer. 1.3 making the difference multiple choice tests suffer from the drawback that examinees may aquire free-marks by taking lucky guesses [18]. furthermore, a way to exploit electronic exams in order to overcome this problem is proposed. given the evolution of web technologies since then, the need to make electronic exams 58 nikolaos doukas, antonios andreatos significantly different is currently more pronounced. another challenge for the design of electronic examination systems is to make exams more pleasant for students. students are naturally reluctant to sit exams, since exams are a source of stess for them. on the contrary, they are keen to spend large periods of time in front of the computer screen, surfing the internet. the question is hence posed: if exams are delivered using a friendly web interface, can they become a less stressful experience? this paper focuses on the implementation of test parameterisation and counter-cheating measures on the e-xaminer, that distinguish e-xams from paper and pencil exams and make them a better tool from promoting learning; it also lists some empirical results and presents a comparison between conventional paper-and-pencil exams and electronic exams given by means of the examiner (and therefore, referred to as ‘e-xams’). the way in which questions are parameterised is firstly presented. examples of parametric questions are shown. counter-measures against exam-time cheating are listed and the way in which these have been implemented in the new system is described. the new system has been used for mid-term tests of students of all four years of the academy, for all specialisations. statistical results from these tests are given. finally conclusions are drawn and the directions for the planned future work are presented. 2 design of the system 2.1 system architecture the architecture of the e-xaminer is depicted in figure 1. the system operates as follows. the instructor sets examination questions and model answers using a domain specific language (dsl) [19]. these files are given as input to the core of the e-xaminer which produces two programs, the examination agent and the marking agent. the examination agent is installed in an appropriate http server and is the only part of the system that is ever exposed to a public terminal. the examinees open the ‘e-xam’ pages and answer the test questions by filling in forms, using their favorite web browser. student answers are stored and finally passed as input to the examination agent that runs on the instructors terminal (e.g. a laptop computer only temporarily connected to the student network). a marking report and statistics are produced. figure 1: architecture of the e-xaminer advancing electronic assessment 59 2.2 the examination procedure figure 2 depicts the examination procedure. the correspondance to the system architecture described earlier on is straightforward. it is interesting to note however that it is possible for the instructor, to reiterate the process with a revised marking scheme. a marking (grading) policy is implicitely determined by the instructor and is included in his/her marking scheme. the marking scheme may include alternative answers that carry different marks. for example, the correct answer to the question “what is the maximum value that can be represented in an n-bit unsigned binary number”, is 2n − 1. the instructor may however decide to grant students a partial credit for the (wrong) answer 2n . this iterative procedure, which would have been more tedious for a paper-and-pencil test, represents a significant advantage of e-assessment. as already pointed out, the main drawback of existing systems, as far as their use by the hafa is concerned, was the fact that they supported only a limited number of question types; furthermore, these questions were often fixed (same for all students). it was required that the new system supported parameterisation. additionally, the whole process needed to be automated. by using the e-xaminer, the entire class is examined simultaneously, but each student sees a different worksheet [12]. figure 2: flowchart of the examination procedure 2.3 question parameterisation apart from the innovative types of questions, a key feature of the e-xaminer is question parameterisation. it was required that the system be able to receive as input from the course instructor a skeleton question and produce a series of different questions from it. the process of defining and designing such skeleton questions is the question parameterisation incorporated in the e-xaminer [12]. in order to implement this parameterisation, our system employs the concept of domain specific languages or dsl [19]. a skeleton question can hence be defined by its textual parts, a series of parameters that it requires and a rule which is used for assembling the final question. an additional parameterisation for the entire test is question order scrambling, i.e., the randomisation of the order in which the questions in the test of a particular student is arranged. this randomisation may be expanded to the arrangement of sub-questions of a particular question. two examples of parameterised questions which were used during the past academic year (2005-6) are given below. the first question was designed to examine a chapter on binary arithmetic. each student was given 60 nikolaos doukas, antonios andreatos six different random numbers. each student was hence required to perform a number of operations on the numbers assigned to them. these operations were selected at random for each student from a set of predefined ones (binary to decimal conversion, decimal to binary conversion, addition, subtraction and negation). the setup of this question eliminated any chance that a student might have copied from their collegue sitting next to them. the second question was designed as part of a course on computer networks [20]. the skeleton of question was the following: “a company owns a set of class b ip addresses. the company has x lans with an increase rate of y% during the next 10 years. each lan connects w computers, with an increase rate of z% during the next 10 years. which is the best way of sharing the class b host bits between subnets and computers?”. all students sitting this examination had to answer this question, however each student had a different set of values for the x, y, z and w parameters. these values where automatically assigned by the e-xaminer. tests delivered during the pilot application phase, were just automated versions of what a paper and pencil test would have looked like. in order to better exploit the potentials of the caa web interface, the decision was made to include various innovative features in the tests. the first such feature was the electronic processing of an (electronic) document, linked to the e-xam by a simple hyperlink. the question had as follows [20]: “given the specification of http 1.1 [rfc 2616] in electronic format, answer the following questions: 1. what is the signalling mechanism used to terminate a persistent connection? 2. who may terminate such a connection? (a) the client, (b) the server, (c) both of them 3. what does an error code 415 signify? 4. what encryption services are provided by http?” rfc 2616 is a 176-page document describing the http standard. students are obviously not asked to memorise this text and it is impractical to copy the entire document for each student for the purpose of the exam. they are however expected to be familiar with such documents and able to extract information from them using various text/word processing tools (such as ‘search’ or ‘find’). testing of this skill is only possible via electronic exams. these questions may further combined with parameterisation, in order to produce a different error code for each student. the next planned test requires the students to use some special services available on the internet, get the answer on their browser and copy it on their ‘e-xam’. telecommunication engineering students in hafa are required to be able to write, assemble and run machine language programs. in another type of question therefore, examinees were required to assemble their programme and submit the assembler listing file. this is another example showing how e-assessment can test skills that would have been impossible to examine with paper-and-pencil exams. 2.4 facilitating the process of in-classroom examinations an additional requirement for the system was that it should shield both instructors and students from having to comprehend any complicated software engineering concepts. as far as students are concerned, the target was met by virtue of the fact that a plain web interface was used for delivering the tests. students were pleasantly surprised to notice that each one of them had a different set of numbers. this made them adopt an attitude such as “i will mind my own business and finish my questions in time”, rather than the more traditional “i will try to see if one of my colleagues sitting next to me can help me with the exam questions”. hence learning was promoted and student and instructor psychology was boosted. in order to maximise the benefits for course instructors as well, the concept of domain advancing electronic assessment 61 specific languages was employed. instructors compose the skeleton examination questions and their model (correct) solutions in html format (figures 1, 2). they are not required to be familiar with html (low-level) coding [12]. they can use their favourite html editor to complete the task, as current editors have a simple user interface (similar to a word processor such as microsoft word). the html files are processed by the e-xaminer core and two sets of programs are produced: the examination agent and the marking agent (figure 1). the examination agent is an active page combining html and perlscript. the testing agent is a perl script which processes the answers submitted by students. the outputs are the marked tests and statistics of the class performance. 3 contribution to the virtual classroom the fact that questions are parameterised has several positive implications for the virtual classroom. firstly, tests can become personalised (a different set of questions for each student) without compromising test fairness. additionally, tests can be given more often than usual (the customary midterm exam may be replaced by weekly exams), since the effort required by the instructor is minimal. both the above achievements are recommended practices against examination-time cheating and in favour of promoting learning [21], [22]. it has additionally been observed that the web interface makes the idea of frequent exams, more agreeable than usual for students. marking papers is a fairer process, since, even at early stages, the system has caught grading errors made by humans. as it was mentioned earlier, the electronic tests in particular have been accused of bringing some students to a disadvantage [14]. the e-xaminer experience has shown that the e-test marks have large correlation to both the oral mark awarded by the instructor, as well as the mark attained during the paper-and-pencil semester final exam. furthermore the hafa, being a technically oriented institution, demands from its students (which, as officers of the hellenic air force, are going to handle electronic equipment worth millions of euros) to be familiar and at ease with technology. accusations have been extended to state that “test-takers are unable to underline text, scratch out eliminated choices and work out math problems all commonly-used strategies" [14]. current web technology allows not only underlining on-screen texts, but also much more elaborate formatting to take place by incorporating a simple html editor (a practice used by many web-based email services). as far as scratching is concerned, we have provided special scratch-pad areas next to each question (implemented as html textareas). we have also encouraged our students to get better help by using electronic dictionaries and calculators, as well as their old paper notebook which is sitting next to their keyboard. 4 pilot application and statistics the e-xaminer is being used in testing digital electronics, computer science, microprocessors and computer network courses taught in hafa since the beginning of the past academic year (2005-6). the caa tests were addressed to students of all disciplines taught in hafa (pilots, aircraft, telecommunications and civil engineers, as well as air-traffic controllers). during the first six months all student answers were being marked by both the e-xaminer and the instructor. during this first period, automatically assigned marks deviated by up to 10% from the instructor marks. this deviation declined steadily as the system evolved and staff acquired experience in assigning better model answers. currently, only a limited number of tests, selected at random, are being manually graded and the deviation between the two sets of marks is is less than 1%. the grading resolution (that is, the minimum grade associated to questions), was set to 0.5% during recent e-xams. after the initial 6-month deployment period, the e-xaminer has been systematically catching human marking errors in up to 10% of student papers. instructors have very often exploited the fact that the system is automatic and have revised their model 62 nikolaos doukas, antonios andreatos answers (along with their marking scheme) so as to better assess class performance. 4.1 comparing paper exams to electronic exams in this subsection electronic exams delivered by the e-xaminer will be compared to traditional paperand-pencil exams. rather than answering a question of the form ‘which one is better’, our experience from the use of the e-xaminer during the current (2006-7) and the past academic year (2005-6) will be presented. each approach has its own merits that need to be considered by course instructors before making a choice. the aim of this section is to support such a choice. the advantages of paper-and-pencil exams include: • they can be easily adopted to any subject. • they may be used by digitally illiterate students. • no technological infrastructure is necessary. • no computer-related background skills are necessary. their major disadvantages are: • the grading process is usually tedious and time-consuming. • it is difficult to embed in them multimedia (other than figures). on the other hand, the advantages of electronic assessment are: • tests may be generated automatically from a pool of questions and problems. • grading is easy, automatic and hence tests may be delivered more frequently. • it is easy to embed multimedia (such as designs, circuits, video and sound clips for courses such as signal processing, electronics and telecoms). • students with bad, illedgible handwriting can produce a well formed document. • results and statistics are immediately generated automatically and students obtain rapid feedback on their performance. • counter-cheating methods may be employed. • supplementary programs may be used (such as dictionaries, assemblers, compilers, matlab etc.). • exams may be easily stored and retrieved, results may be further processed with other computer programs such as excel and spss. • finally it should be noted that e-xams are environment-friendly in that little paper need ever be used. the disadvantages of electronic assessment are [16]: • it might be difficult to adopt it to some types of questions (such as math formulae, where special equation editors are needed for typing the formulae and complex software for interpreting them). • it might be difficult to adopt it to some subjects. • it requires additional equipment such as pcs, lan, software etc. and is hence more complicated to give, as well as, vulnerable to power or system failures. an instructor must take all the above factors into account before making a decision. advancing electronic assessment 63 4.2 student opinions it is important to consider the opinions of users in evaluating any e-learning system [22], [23]. the authors of ref. [22] define such an evaluation procedure based on hypothesis testing. the hypotheses that needed to be tested in the case of the e-xaminer where (1) “e-xams are perceived as equally difficult to the corresponding paper ones”, (2) “e-xams promote learning by making tests more agreable experience for students” and (3) “e-xams promote learning by helping students accept their test marks as a fair assessment of their performance”. in order for us to investigate these hypotheses, three extra questions were added at the end of the most recent tests. the questions where: 1. do you consider the electronic test to be more difficult than what you would expect from a paper test? 2. do you believe that automatic assessment you will get for this test will be fairer than the one you would have got if the test was marked by your instructor? 3. do you prefer this test to a traditional paper one? hafa students answered by an overwhelming majority (over 90%) that the electronic test was equally difficult and more preferable to the traditional test, while they expected their automatically assigned marks to better reflect their performance. answers to these questions are still being collected, as more classes sit electronic exams and will be presented in future publications once the required statistical significance is attained. 5 summary and conclusions the implementation of parametric questions and counter cheating measures of the ‘e-xaminer’, our caa system, was outlined. this implementation was shown to offer advantages to university course organisers, in that it facilitated their work and eliminated much of the tediousness involved in the grading process. the new system was also shown to promote learning by the students, by making exam taking a frequent, fair and agreeable procedure. some contributions were also made to theoretical issues concerning the electronic classroom and its acceptance in general. additionally, e-xaminer was shown to offer a number of advantages in the fight against examination-time cheating. statistics from the pilot application were given, were it was shown that e-xaminer was capable of effectively marking student answers. innovative types of exam questions were used, that could not have been set without the existence of an e-assessment platform. initial statistical results were presented on the acceptance of the system by students. these results are part of a study on the usefulness of this system in promoting learning. during the next academic year we plan to further develop the system in order to: • include some additional types of questions/problems. • support multimedia in both questions and answers. this is a clear advantage of caa over paperand-pencil exams. • include a timer and enforce automatic submission upon time-out. • strengthen security and fight cheating attempts by adding an authentication module that will monitor users login and logout [12]. 64 nikolaos doukas, antonios andreatos references [1] the univ. of iowa in cooperation with hbo systems inc., e-learning assessment summary report, available online at: www.hbosystems.com, 2004, retrieved on march 31, 2006. [2] f. pantano-rokou, educational design for e-learning: models, meaning and impact on learning (in greek), open education, vol. 1, pp. 45-68, 2005. [3] d. vergidis, a. lionarakis, a. lykourgiotis, b. makrakis and ch. matralis, open and distance learning, vol. 1, institution and operation, hellenic open university, patra (in greek; title of book translated by authors), 1998. [4] e. burns, continuing education drives distance-learning enrollment, available online at: www.clickz.com/stats/sectors/education/article.php/3605321, retrieved on may 25, 2006. [5] ch. fidas, ch. tranoris, v. kapsalis and n. avouris, system design for synchronous support and monitoring in web-based educational systems (in greek), proceedings of the 3rd international conference on open and distance learning, propombos, athens, vol. a, pp. 577-585, 2005. [6] claroline documentation, http://www.claroline.net/documentation.htm, retrieved on june 21, 2006. [7] blackboard documentation, http://library.blackboard.com/docs/as/bb_academic_suite_brochure_single.pdf, retrieved on june 21, 2006. [8] univ. of loughborough caa site, http://www.lboro.ac.uk/service/pd/caa/index.htm, retrieved on june 21, 2006. [9] quia site, http://www.quia.com/company/quia-presentation.pdf, retrieved on june 21, 2006. [10] test assessments project, http://www.scribestudio.com/home/ inaction/flash/ss_in_action.jsp?cm=testsassessments_project, retrieved on june 21, 2006. [11] web ct site, http://www.webct.com, retrieved on july 10, 2006. [12] n. t. doukas and a. s. andreatos, implementation of a computer aided assessment system based on the domain specific language approach., wseas transactions on advances in engineering education, vol. 3(5), pp. 382-388, 2006. [13] j. salpeter, 21st century skills: will our students be prepared?, http://www.techlearning.com, oct. 2003, retrieved on march 22, 2006. [14] fairtest site, http://www.fairtest.org/facts/computer.htm, retrieved on june 23, 2006. [15] a. d. styliadis, i. d. karamitsos and d. i. zachariou, personalized e-learning implementation the gis case, international journal of computers, communications and control, vol. i, no. 1, pp. 59-67 (2006). [16] a. andreatos and n. doukas, e-xaminer: electronic examination system, proceedings of the 3rd wseas / iasme international conference on engineering education, vouliagmeni, athens, greece, july 2006. [17] a. andreatos, distance e-learning for the hellenic air force, proceedings of eden’03, rhodes, greece, pp. 428-433, 2003. [18] m. bush, alternative marking schemes for on-line multiple choice tests, 7th annual conference on the teaching of computing, belfast, cti computing 1999. advancing electronic assessment 65 [19] d. spinellis, notable design patterns for domain specific languages, journal of systems and software, 56 (1), pp. 91-99, feb. 2001. [20] j. f. kurose and k. w. ross, computer networking a top-down approach featuring the internet, 3rd ed., addison-wesley, 2005. [21] a. angeletou et al., assessment techniques for e-learning process, proceedings of the 3rd international conference on open & distance learning, vol. b, pp. 47-54, patra, greece, 2005. [22] d. spinellis, p. zaharias and a. vrechopoulos, coping with plagiarism and grading load: randomized programming assignments and reflective grading, computer applications in engineering education, to appear in 2007. [23] r. guidorzi and m. l. giovannini, e-learning tools in higher education: users’ opinions, proceedings of eden’03, rhodes, greece, pp. 201-206, 2003. nikolaos doukas, antonios andreatos dept. of aeronautical sciences div. of computer engineering and informatics hellenic air force academy dekeleia, attica, tga-1010 greece e-mail: nikos@doukas.net.gr, andreatos@hafa.gr received: november 18, 2006 int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 2, pp. 178-184 optimization for date redistributed system with applications mădălina văleanu, smaranda cosma, dan cosma, grigor moldovan, dana vasilescu mădălina văleanu university of medicine and pharmacy "iuliu hatieganu" medical informatics and biostatistics department cluj-napoca e-mail: mvaleanu@umfcluj.ro smaranda cosma babes-bolyai university business department, faculty of business cluj-napoca e-mail: smaranda.cosma@tbs.ubbcluj.ro dan cosma university of medicine and pharmacy "iuliu hatieganu" department of pediatric surgery and orthopedics cluj-napoca e-mail: dcosma@umfcluj.ro grigor moldovan babes-bolyai university computer systems department cluj-napoca e-mail: moldovan@cs.ubbcluj.ro dana vasilescu university of medicine and pharmacy "iuliu hatieganu" department of pediatric surgery and orthopedics cluj-napoca e-mail: dana.vasilescu@umfcluj.ro abstract: in this paper we intend to define a strategy for managing databases with mobile structures, taking into account their redistribution in the nodes of a computer network. the minimal cost of the redistribution is highlighted and some applications for medical and business databases are presented. keywords: distributed database, costs, medical databases, economical databases, wireless network 1 introduction the paper presents a generalization and extension for mobile environments of the context of the problem expressed in paper [1]. it also presents some applications of the problem. let us consider the mobile databases (tables) bi, i = 1, n distributed in r nodes of a network of computer stations with own memories si, i = 1, r. hence, we have: copyright © 2006-2009 by ccc publications optimization for date redistributed system with applications 179 b = {b1, b2, . . ., bn} and s = {s1, s2, . . ., sr}, where b is a distributed database. now we identify the nodes, that is the stations of the computer network considered with the same symbols as for the memory supports si in s. the architectures of the fixed computer networks with nodes s = {s1, s2, . . ., sr} can differ, but on the horizontal level, in general, they are modeled through a relative graph. particular computer network architectures can exist, such as the hypercube or generalizations of the hypercube [2] [3], which provide simple and efficient routing means but whose complexity in number of nodes and restrictions make them inefficient in the end. modeling a computer network in a graph, in general, will give a static network with fixed and well defined geographical locations. in our previous paper [1] we have supposed that subbases bi of the database b have a well defined location and that this location is maintained fixed during the running of a distributed application. at present, mobile databases characterized by allocation in permanent change are known. consequently, dynamism characterizes all their aspects. they are searched for (selected), accessed and processed from a mobile environment made up of laptops, mobile phones etc. they become stronger and stronger elements of data processing. they are connected by means of wireless stations in special fixed points (nodes) belonging to a computer network. the traditional model of transactions migrates towards a mobile transaction model. the following presents a strategy for this kind of databases processing. databases bi, i = 1, n and other soft resources (programs) shall be stored in fixed hosts, noted fh and identified through si (fixed host computers!), in a well established network, s = {s1, s2, . . ., sr}. the mobile environment will inherit the properties of the distributed environment. the mobile environment with ever increasing storing spaces will be able to take over the mobile databases for processing data, by duplicating procedures, and the results will turn back to a fixed host. the fixed hosts will be those that permanently preserve the data subbases bi. databases bi could be preserved in mobile hosts, but any update (modification) should end with their replication to a fixed host mentioned with a new version number. in the fixed network, it is necessary to update all duplicated subbases bi, using the most recent version numbers allotted in order to maintain the consistency of the distributed database b. the update of the databases should occur instantly after the alteration of their version number. the update of duplicated data subbases bi can also be achieved with the help of a jeton function periodically passing through the nodes s of the network in consideration. it is obvious that between the mobile hosts noted mh and fixed hosts noted fh belonging to a computer network, there is a fixed interface, called mobile support station, noted mss (mobile support station) or base station. the connection between a mobile support station (mss) and a mobile host (mh) is wireless. each mss allotted to a node si controls a cell of mobile hosts identified by {i1, i2, . . ., ik} . a mobile host can disconnect from a mss and possibly reconnect to another mss just while running a distributed application. the disconnection and connection of mobile stations occurs frequently. disconnection establishes a new distribution of subbases in the network nodes. the figure below presents the general architecture of a mobile platform. in the design of a mobile database, in each node si we shall highlight for bi an often modifying and dynamic component that frequently is accessed from other nodes of the network, noted bi and other components, noted mi(s) associated to mobile units, quite often accessed locally to mobile units but where the modification of data is less frequent. 180 mădălina văleanu, smaranda cosma, dan cosma, grigor moldovan, dana vasilescu figure 1: mobile platform we have bi = (bi; mi(1), mi(2), . . ., mi(k)) where we used for indices ip notation i(p) and p = 1, k. we can consider that we made a decomposition of the subbases bi by selection and/or projection operations so that by further union operations we get updated subbase bi. at a given moment in the computer network stations we witness a certain distribution, respectively a grouping of databases bi, i = 1, n. in general, if n ≥ r, then, more subbases bi will exist in the memory si. for reasons of simplification, in our study, we will suppose that consecutively, in each station si of the r station, we will have d subbases si , hence n = d.r. we also suppose that we have removed, by means of a certain conveniently selected strategy, the duplications of subbases bi, therefore bi are distinct. a distributed application supposing programs running in the network under consideration leads to the access, from the nodes si, of the subbases bj in a defined succession, until the result needed is obtained. we note the successively accessed data subbases (some of them more often accessed in a distributed application) in a vectorial form, as follows: bl = (bm1 , bm2 , . . ., bms ) then l = (m1, m2, . . .ms); mk ∈ {1, 2, . . ., n} remember that mk identifies the place in the succession of accesses of the subbases in b performed. in general, in the case of an access from si network node to a subbase found in sj, a so-called penalty should also be considered (for instance: time, cost) noted with pij for all i, j = 1, r which will be composed of a fixed penalty pfij established in a fixed computer network and possibly from a penalty due to the specific working manner with mobile databases noted pmij. we will obtain pij = pfij + pmij. 2 grouping data subbases on the network fixed nodes let distributed database be b = {b1, b2, . . ., bn}. the indices of subbases build the set in = {1, . . ., n}. the reorganisation of these data is defined by permutation indices σ = ( 1 2 ...k...n i1i2...ik...in ) where ik ∈ in; k = 1, n are distinct. the permutation σ is also written as σ = (σ(1), σ(2), . . ., σ(n)). if we have two permutations σ and τ , optimization for date redistributed system with applications 181 their produce στ is obtained by the composition of the two functions, so that στ = (σ(τ(1)), σ(τ(2)), . . ., σ(τ(n))). let us mark with supp(σ), the permutation support σ, i.e. the set of the elements i ∈ {1, 2, . . ., n} having the property σ(i) 6= j. a permutation σ is called cyclic of length m, m ≥ 2 if elements i1, i2, . . ., im ∈ supp(σ) exist so as σ(i1) = i2, σ(i2) = i3, . . .σ(im−1) = im, σ(im) = i1. it is known that any permutation that is different from the identical one can be written as a produce of distinct cycles. [4] let us consider that n = d.r, i.e. on any si station in the r computer network we have d data subbases. let us take the following reorganisation (distribution) of the n data subbases on the r workstations, successively: bσ−1(1) bσ−1(2) bσ−1(d) in s1 . . . . . . . . . . . . bσ−1(1+(i−1)d) bσ−1(2+(i−1)d) bσ−1(id) in si . . . . . . . . . . . . bσ−1(1+(r−1)d) bσ−1(2+(r−1)d) bσ−1(rd) in sr note: σ is a bijective application, σ : {1, 2, . . ., n} → {1, 2, . . ., n} where σ(k) = ik, respectively σ−1(ik) = k; k = 1, n. consider the sequence of successive accesses ld = (m1, m2, . . ., ms) of the subbases in b and the indices of these stations are. with these notations, we define the cost of a distributed application with the relationship c(s, b, ld, σ) = s−1∑ k=1 (prσ(mk),rσ(mk+1) + qrσ(mk)) where qrσ(mk) represents the cost of the activities in station srσ(mk). let us note with aij the number of the times (mk, mk+1) = (i, j), k = 1, s − 1 ; i, j ∈ {1, 2, . . ., n}, and with ci , how many times mk = i, k = 1, s; i, j ∈ {1, 2, . . ., n}. then, c(s, b, ld, σ) = n∑ i=1 n∑ j=1 (aijprσ(mk),rσ(mk+1) + ciqrσ(mk)) = n∑ i=1 n∑ j=1 (aσ−1(i),σ−1(j)pri(i),ri(j) + ciqri(i)) the cost of a distributed application will be: c(s, b, ld, σ) = n∑ i=1 n∑ j=1 (aσ−1(i),σ−1(j)p ∗ ij + ciq ∗ i ), where pri(i),ri(j) = p ∗ ij; qri(i) = q ∗ i a way to find out this sums can see in [5]. notes 1. in practice, we can consider that pij is symmetrical, hence p∗ij will also be symmetrical, i.e. p∗ij = p ∗ ji. penalties pij can be determined, for instance, with the help of statistical data after more run182 mădălina văleanu, smaranda cosma, dan cosma, grigor moldovan, dana vasilescu nings of the programs of the distributed application d. 2. if permutation σ is decomposed in a produce of cyclical permutations, the formula for the application cost can be simplified accordingly. 3 applications 3.1 medical databases the medical field is one of the most important fields of social concern. it is defined by high level dynamism. nowadays the healthcare system from every country have to faced to many challenges of the 21st century which influenced significant the financial aspects of the organizations’ activities. in these new conditions the competitions increased and pressed on the costs to acquire and maintain a high quality of the technology and capital outlay. today, the data needed for research are registered in medical documents, in a written form, but not in a unique form and the work necessary to retrieve and process such data is enormous. it is for this reason that it is required to design general interest databases, for doctors, patients, researchers and health units. such databases are characterised by the fact that they will include an enormous volume of data distributed among the nodes of a computer network. the medical field activities can be divided into primary level care, secondary level care activities and the division can go even further. primary care refers to family medicine that is to the first contact and consultation point for the patient. secondary care is the service provided by a specialist, who does not have the first contact with the patient, in general. usually, a doctor providing secondary care services treats patients previously consulted by a family doctor. to establish a correct diagnosis and a proper treatment it is necessary to handle the databases in question that, often mean to return to their redistribution in the nodes of the computer network. a concrete examples are given in papers [6]. in such an application it is equally important to find an answer to a query to the databases in as short a time as possible. a valuable component of a heath care computer-based system is its capacity to work with information on the patients stored in various locations: the national health authority, the social insurance system, providers of primary and secondary care services. in order to find solutions, a distributed approach should be taken into consideration. [7] [8] health care information, such as medical records, x-rays, lab tests results are more are more kept and processed by computers. that is why standards to send such data away in an unambiguous manner among computers are required. in this way, uniform health care-related information shall also be available. a standard represents a conceptual lexis common for all the stakeholders in the healthcare network. standards of this kind are already available for many medicine subfields. optimization for date redistributed system with applications 183 3.2 economical databases the economic applications of distributed databases are diverse and numerous and their importance is also valuable. it is enough to take as an example a system in the field of international trade involving the handling of very large volumes of data relative to the varied goods distributed among shop networks. the management information system of a company becomes decisive for international corporations of the learning-company type that extend more and more their activities using varied international market penetration strategies. e-commerce develops and includes larger and larger scopes. that is why it is necessary to establish standards related to marketed products, in this field too. the queries put to distributed and increasing in size databases should be made more efficient in time. it is sure that in the near future the optimising of costs (query types) of the used databases will have to be considered. marketing research identifies, collects, processes, analyses, interprets and communicates information relevant for a specific marketing situation to make a decision. all the data are organised in distributed databases in more locations. marketing research mainly aims at reducing risk and uncertainty in the conception and grounding of marketing-related decisions and at implementing and controlling the practical putting into application of these decisions. [7] 4 conclusions the fundamental problem, with respect to the distributed application d under consideration, consists in the determination of a permutation σ in the set of possible permutations p having elements {1, 2, . . ., n}, indices of the b data subbases so that the cost of the use of the distributed application d programs would be minimal, i.e. min{c(s, b, ld, σ); σ ∈ p}. it is obvious that the problem relates to combination, its solution is important when the distributed application d is used repeatedly. the problem is solved once and the advantage remains operational all along the use of the respective distributed application. the applications for this are multiple. here are describes some applications in medical and economical domain, that can use distributed databases and where the problem of cost minimizing (time need to transferred information from one point of the network to another) is very important. bibliography [1] g. moldovan, m. văleanu, the performance optimization for date redistributing system in computer network, international journal of computer, communications & control, issn 1841-9836, e-issn 1841-9844, vol iii, supl. issue, p. 116-118, 2008. [2] g. moldovan, m. văleanu, redistributing databases in a computer network, analele univ. bucureşti, ser. math.-info., 56, 2006. 184 mădălina văleanu, smaranda cosma, dan cosma, grigor moldovan, dana vasilescu [3] g. moldovan, i. dzitac, sisteme distribuite modele matematice, ed. univ. agora, 2006. [4] l. aspinall, data base. re-organisation –algorithms, ibm, uksc – 0029, 1972. [5] a. gog, h. grebla, evolutionary tuning for distributed database performance, the 4th international symposium on parallel and distributed computing (ispdc), lille, france, ieee computer society, 2005, p 275-281. [6] s. cosma, d. cosma , a. negrusa, m. văleanu, g. moldovan, d. vasilescu, implementation of the communication system for clubfoot, wseas transactions on communications, issn:1109-2742, issue 9, vol 7, sept 2008, p.932-941 [7] d.e. vasilescu, d. cosma, m. văleanu, i. negreanu, d. vasilescu, the results of the early conservative orthopedic treatment in the congenital talipes equinovarus, applied medical informatics 2004, vol 15, p 34-43. [8] d. cosma, s. cosma, m. văleanu, d.e. vasilescu, g. moldovan, web based guideline for clubfoot: patient orientated materials, journal of international business and economics, issn 1544-8037, 8-1, 2008. int j comput commun, issn 1841-9836 vol.7 (2012), no. 4 (november), pp. 785-790 clustering-based energy-efficient broadcast tree in wireless networks j. yu, h. jiang, g. wang, q. guo jiguo yu, honglu jiang school of computer science, qufu normal university, rizhao, 276826, p.r. china e-mail: jiguoyu@sina.com; jianghonglu88@163.com guanghui wang school of mathematics, shandong university, jinan, 250100, p.r. china e-mail: ghwang@sdu.edu.cn qiang guo key laboratory for computer networks of shandong province shandong computer science center, jinan, 250014, p.r. china e-mail: guoq@keylab.net abstract: the characteristics of wireless networks present formidable challenges to the study of broadcasting problem. a crucial issue in wireless networks is the energy consumption, because of the nonlinear attenuation properties of radio signals. another crucial issue is the trade-off between reaching more nodes in a single hop by using higher power versus reaching fewer nodes in that single hop by using lower power. given a wireless network with a specified source node that broadcasts messages to all other nodes in the network, the minimum energy broadcast (meb) problem is np-hard. in this paper, we propose a hybrid approach cbeeb(clustering-based energy-efficient broadcast) for the meb problem based on clustering. theoretical analysis indicates the efficiency and effectiveness of cbeeb. simulation results show that cbeeb has better performance compared with the existing heuristic approaches. keywords: wireless network, energy-efficient, broadcast, clustering 1 introduction nodes in wireless networks are usually powered by batteries with limited capacity. therefore, energy efficiency is one of the most important design issues for wireless networks. the energy consumption in transmission can be further reduced by using one or more intermediate nodes instead of transmitting directly. broadcasting in wireless networks is different from that in wired networks, since all nodes that are within the transmission range of the sender can receive the transmission without any additional cost at the sender. this characteristic of wireless transmission is known as wireless multicast advantage (wma) [1]. this allows us to seek power optimal range assignment for the energy-efficient broadcast problem. the minimum energy broadcast (meb) problem is to find a broadcast scheme with minimum energy consumption. the problem is also known as minimum power broadcast (mpb) problem or minimum energy consumption broadcast subgraph (mecbs) problem [2]. we use energy and power alternatively throughout this paper. in this paper, we study the problem of broadcasting in wireless networks, where every node is equipped with omni directional antennas. we propose a hybrid algorithm based on clustering and prove its correctness. theoretical analysis shows the effectiveness of our approach. we compare it with the broadcast incremental power (bip) [1] algorithm and the best heuristic approaches known in [2–4] by simulation. copyright c⃝ 2006-2012 by ccc publications 786 j. yu, h. jiang, g. wang, q. guo 2 related work most previously developed models for broadcasting and multicasting problems were linkbased models, which does not properly reflect the properties of the all-wireless network environment. to solve the mpb problem, wieselthier et al. first proposed the node-based approach (bip)which is more suitable for wireless environment than the link-based algorithms [1]. it was subsequently shown in [5] that the bip algorithm has an approximation ratio between 13/3 and 12. das et al. gave an improvement procedure called r-shrink for meb problem [6]. cagalj et al. [7] proposed another improvement procedure called embedded wireless multicast advantage (ewma). kang et al. [8] generalized the ewma into another heuristic called largest expanding sweep search(less). in [9], an ant-colony based approach for the problem was presented. an algorithm called cm using ant colony optimization approach was also presented in [3]. furthermore, in [4] a simple simulated annealing algorithm with metropolis chains of dynamic length was presented. the algorithm is in fact a probabilistic version of the 1-shrink tree-improvement algorithm described in [9]. kang et al. [10] gave a perturbation based iterated local optimization method that uses less as local search. in [11], montemanni et al. presented a simulated annealing approach based on r-shrink. they used sweep procedure to improve the solution obtained through simulated annealing. al-shihabi et al. [12] developed a hybrid approach combining nested partitioning with lp relaxation and r-shrink method. in [13], wolf et al. proposed an evolutionary local search, which uses modified r-shrink as local search and random increase in transmission power of some nodes as mutation operator. recently, wu et al. [14] developed a generational genetic algorithm that uses permutation encoding to represent a broadcast scheme. hashemi et al. presented a simulated annealing algorithm using a special node selection mechanism in its neighborhood structure [2]. in [15], singh et al. proposed a hybrid approach to the meb problem combining a genetic algorithm with a local search heuristic, which is a modified version of r-shrink improvement procedure [6]. 3 preliminaries and model we consider static multi-hop ad hoc wireless networks with omni-directional transmitters, and each node can adjust its transmitting power based on the distance to the receiving node. in the most common power attenuation model, received signal power varies as d−α, where d is the distance from the transmitter and α is an environment-dependent constant typically between 2 and 5. therefore, the transmitter power required to support the direct communication is proportional to dα. to represent one wireless ad hoc network, let g = (v,e) be a directed graph, where v denotes the set of nodes and e denotes the set of edges, and a special node s ∈ v that broadcasts messages to all other nodes of v. the transmission energy required by a node in an arborescence is determined by the longest edge among all edges of that arborescence. leaf nodes do not relay messages to any other node. therefore, the transmission energy required by leaf nodes is zero. the total transmission energy required for the broadcast can be computed by adding the energy which is required by each node in that arborescence. definition 1. min-energy broadcast problem(meb): let g = (v,e) be a directed graph, find a broadcast tree t ⊆ e rooted at s, with the minimum energy cost ∑ u∈v max(u,v)∈t d(u,v) α, where d(u,v) is the distance between the nodes u and v. clustering-based energy-efficient broadcast tree in wireless networks 787 4 hybrid approach–cbeeb we now propose a hybrid approach cbeeb (clustering-based energy-efficient broadcast) to meb problem, which includes a clustering algorithm and the ibip (improved bip) algorithm. a special node s broadcasts messages to all other nodes of v , and each node except s has one unique identifier id. for convenience, the notation and terminology used in the rest of this paper are summarized in table 1. definition 2. priority of nodes: for two nodes i,j ∈ v\{s},i.prio > j.prio ⇔ d(i) > d(j), or d(i) = d(j) & &id(i) > id(j) table 1. notation and terminology id(i) the unique identifier of node i i.prio the priority of node i d(i) the degree of node i n1i the set of one-hop neighbors of node i i.ch the clusterhead elected by node i i.type the type of node i, which is a clusterhead or a leaf node the clustering algorithm is as follows: step 1. each node except s collects the information of all its one-hop neighbors n1i . such information can be acquired by each node exchanging message including its id and priority to its one-hop neighbors. then each node sorts the priority for all nodes within its one-hop neighborhood. step 2. a node i decides to become a clusterhead if either one of the following criteria is satisfied: (1)node i has the highest priority in its one-hop neighborhood. (2) node i is an one-hop neighbor of a node b. furthermore, i has the highest priority in the one-hop neighborhood of b. step 3. all the clusterheads are added to the set d. assign to each clusterhead v ∈ d the transmission range rv = maxv′∈n1i d(v,v′). then, all the cluster members are the leaf nodes of the broadcast tree. the following figure 1 gives an example of the clustering algorithm in a twenty-node network. 1 2 3 5 6 7 4 8 1 0 9 1 2 1 3 1 4 1 1 1 5 1 6 1 7 1 8 1 9 2 0 1 2 3 5 6 7 4 8 1 0 9 1 2 1 3 1 4 1 1 1 5 1 6 1 7 1 8 1 9 2 0 1 2 3 5 6 7 4 8 1 0 9 1 2 1 3 1 4 1 1 1 5 1 6 1 7 1 8 1 9 2 0 f i g u r e 1 a . r a n d o m n e t w o r k ( n = 2 0 ) f i g u r e 1 b . r e d n o d e s a r e c l u s t e r h e a d s e l e c t e d b y t h e c l u s t e r i n g a l g o r i t h m f i g u r e 1 c . c o n n e c t t h e c l u s t e r m e m b e r s t o t h e c l u s t e r h e a d figure 1. an example of clustering in a twenty-node network. we give the ibip(improved bip) algorithm. the ibip algorithm runs on the clusterhead set d. the cluster members elected by the clustering algorithm consist of the leaf nodes of the broadcast tree. step 1. initially, the tree consists of only the source node s. we begin by determining the node that node s can reach with minimum expenditure of power. 788 j. yu, h. jiang, g. wang, q. guo step 2. determine which new clusterhead can be added to the tree at minimum additional power in the set of d. if node i is already in the tree, and node j is not yet in the tree, (i,j ∈ d). if pi,j < rαi , then we add j to the tree, where pi,j is the power required to support the transmission between node i and j, and ri is the transmission range of node i which is assigned in the clustering algorithm. otherwise let p ′i,j = pi,j −max{rαi ,p(i)}, where p ′i,j represents the incremental power associated with adding j to the set of nodes to which node i is already transmitting. the pair {i,j} that causes the minimum value of p ′i,j is selected. node i transmits at a power level sufficient to reach node j. thus, one new node is added to the tree. step 3. this process continues until all nodes are included in the tree. the total power required to maintain this tree is the sum of the transmitted powers at each of the transmitting nodes. the following figure2 gives an example of the ibip algorithm. s 1 2 3 4 9 6 7 5 8 s 1 2 3 4 9 6 7 5 8 s t e p 1 : s 9 s t e p 2 : s 6 f i g u r e 2 a . f i g u r e 2 b . s 1 2 3 4 9 6 7 5 8 s 1 2 3 4 9 6 7 5 8 s t e p 3 : 6 7 f i n a l t r e e f i g u r e 2 c . f i g u r e 2 d . figure 2. an example of ibip algorithm in a nine-clusterhead network with source node s. 5 algorithm analysis and simulation theorem 3. the set of clusterheads elected by the clustering algorithm cover all network nodes. proof: because each node either has the highest priority among its one-hop neighbors or has a neighbor with the highest priority among one-hop neighbors of the node. by the definition of the dominating set, a node either becomes a dominator itself or elects the neighbor with the highest priority among one-hop neighbors of the node as a dominator. thus the network has a dominating set elected by the algorithm. therefore, we conclude that the sets of clusterheads elected by the clustering algorithm can cover all network nodes. 2 theorem 4. the cbeeb algorithm has time complexity of o(m2), where m ≤ n dmin+1 is the number of clusterhead elected by the clustering algorithm and dmin is the smallest degree of nodes in the network. proof: the clustering algorithm adopts a distributed clustering strategy. thus, the time complexity of the entire network equals that of a single node o(1). in the algorithm ibip, it runs on the set of clusterheads. obviously, the number of the clusterheads is smaller than n dmin+1 . moreover, the ibip algorithm is based on the prim’s algorithm. thus the time complexity is o(( n dmin+1 )2). therefore, the total time complexity of the cbeeb algorithm is o(1 +m2), that is o(m2). 2 theorem 5. the overhead complexity of control messages in the network of the cbeeb algorithm is o(n + m3), where m ≤ n dmin+1 is the number of clusterhead elected by the clustering algorithm and dmin is the smallest degree of nodes in the network. clustering-based energy-efficient broadcast tree in wireless networks 789 proof: at the beginning of the clustering algorithm, each node broadcasts a message to collect its information of its one-hop neighbors. therefore, the overhead complexity of control messages in the network is o(n). the ibip runs on the set of clusterheads elected by the clustering algorithm based on the prim’s algorithm. because we need to update the power p ′i,j at each step of the algorithm, the message complexity is o(m3) when a straightforward implementation is used [1]. therefore, we can conclude that the overhead complexity of control messages in the network of the cbeeb algorithm is o(n + m3). 2 comparing with the time complexity of the bip algorithm is o(n2), and the message complexity is o(n3) [1]. obviously, our algorithm cbeeb shows better effectiveness. we now compare the performance of cbeeb algorithm with other six broadcast algorithms blimst [1], bip [1], cm [3], sa [4], esa [2]and ga [15] by simulations. the simulations are carried out in an ideal network generated over a 100 × 100m2 area without consideration for packet lost. the number of network nodes changes from 25 to 250 with each increment of 25. fig 3a. the original network topology with 150 nodes figure 3b.the topology after running clustering algorithm figure 3c.the topology after running ibip figure 4.energy cost comparison among seven algorithms figure 3 illustrates the backbone topology construction process using cbeeb. in figure 4, we compare the energy cost of our algorithm with other six algorithms. the simulation results show that our algorithm makes further improvements on the energy cost compared with the blimst and bip algorithm. cm, sa, esa and ga algorithm can improve the energy cost over the bip algorithm. moreover, we can see cbeeb performs better than the ga algorithm especially when the number of nodes is large. 790 j. yu, h. jiang, g. wang, q. guo 6 conclusions in this paper, we propose a hybrid approach for the meb problem in wireless networks. compared with existing approaches, theoretical and experimental results show the superiority. as further work, we will develop fully distributed algorithms for meb problem. acknowledgments this work was partially supported by the national natural science foundation of china for contract (11101243, 60373012), natural science foundation of shandong province for contract(zr2009gm009, zr2009am013), stpu of shandong province for contract (j10lg09) and tkp of shandong province for contract (2009gg10001014). bibliography [1] j. e. wieselthier, g. d. nguyen, a. ephremides, on the construction of energy efficient broadcast and multicast trees in wireless networks, in: proc. of infocom 2000, 585-594, 2000. [2] s. m. hashemi, mohsen rezapour, ahmad moradi, two new algorithms for the min-power broadcast problem in static ad hoc networks, applied math. and compu., vol.190, pp. 1657-1668, 2007. [3] a. k. das, r. j. marks, m. ei-sharkawi, p. arabshi, a. gray, a cluster-merge algorithm for solving the minimum power broadcast problem in large scale wireless networks, in proc. of the milcom conference, 13-16, 2003. [4] a. k. das, r. j. marks, m. ei-sharkawi, p. arabshi, a. gray, the minimum power broadcast problem in wireless networks: a simulated annealing approach, in proc. of wcnc, 2057-2062, 2005. [5] p. j. wan, g. calinescu, x. y. li, o. frieder, minimum-energy broadcast routing in static ad hoc wireless networks, in proc. of infocom, 1162-1171, 2001. [6] a. k. das, r. j. marks, m. ei-sharkawi, p. arabshai, a. gray, r-shrink: a heuristic for improving minimum power broadcast trees in wireless networks, in proc. of globecom 523-527, 2003. [7] m. cagalj, j. p. hubaux, c. enz, minimum-energy broadcast in all-wireless networks: npcompleteness and distribution issues, in proc. of mobicom’02, 172-182, 2002. [8] i. kang, r. poovendran, broadcast with heterogeneous node capability, in proc. of globecom 4114-4119, 2004. [9] a. k. das, r. j. marks, m. ei-sharkawi, p. arabshi, a. gray, the minimum power broadcast problem in wireless networks: an ant colony system approach, in proc. of the ieee cas workshop on wireless communications and networking, 5-6, 2002. [10] i. kang, r. poovendran, iterated local optimization for minimum energy broadcast, in: proc. of wiopt, 332-341, 2005. [11] r. montemanni, l. m. gambardella, a. k. das, the minimum power broadcast in wireless networks: a simulated annealing approach, in proc. of wcnc, 2057-2062, 2005. [12] s. al-shihabi, p. merz, s. wolf, nested portioning for the minimum energy broadcast problem,in proc. of lion 2, lncs 5313, 1-11, 2008. [13] s. wolf, p. merz, evolutionary local search for the minimum energy broadcast problem,in proc. of evocop’08, lncs 4972, 61-72, 2008. [14] x. wu, x. wang, r. liu, solving minimum power broadcast problem in wireless ad hoc networks using genetic algorithm,in proc. of cnsr, 203-207, 2008. [15] alok singh, wilson naik bhukya, a hybrid genetic algorithm for the minimum energy broadcast problem in wireless ad hoc networks, applied soft computing, vol.11, pp. 667-674, 2011. international journal of computers, communications & control vol. i (2006), no. 3, pp. 41-59 formation control of mobile robots dang binh nguyen, khac duc do abstract: a constructive method is presented to design cooperative controllers that force a group of n mobile robots to achieve a particular formation in terms of shape and orientation while avoiding collisions between themselves. the control development is based on new local potential functions, which attain the minimum value when the desired formation is achieved, and are equal to infinity when a collision occurs. the proposed controller development is also extended to formation control of nonholonomic mobile robots. keywords: formation control, mobile robot, local potential function, nonholonomic mobile robot. 1 introduction over the last few years, formation control of multiple vehicles has received a lot of attention from the control community. applications of vehicle formation control include the coordination of multiple robots, unmanned air/ocean vehicles, satellites, aircraft and spacecraft [1]-[28]. for example, a cooperative mobile sensor network, where each mobile robot serves as a mobile sensor, is expected to outperform a single large vehicle with multiple sensors or a collection of independent vehicles when the objective is to climb the gradient of an environmental field. the single, heavily equipped vehicle may require considerable power to operate its sensor payload, it lacks robustness to vehicle failure and it cannot adapt the configuration or resolution of the sensor array. an independent vehicle with a single sensor may need to perform costly maneuvers to effectively climb a gradient, for instance, wandering significantly to collect rich enough data much like the "run and tumble" behavior of flagellated bacteria. in military missions, a group of autonomous vehicles are required to keep in a specified formation for area coverage and reconnaissance. in automated highway system, the throughput of the transportation network can be greatly increased if vehicles can form to platoons at a desired velocity while keeping a specified distance between vehicles. research on formation control also helps people to better understand some biological social behaviors, such as swarm of insects and flocking of birds. in the literature, there have been roughly three methods to formation control of multiple vehicles: leader-following, behavioral and virtual structure. each method has its own advantages and disadvantages. in the leader-following approach, some vehicles are considered as leaders, whist the rest of robots in the group act as followers [1], [2], [3], [4]. the leaders track predefined reference trajectories, and the followers track transformed versions of the states of their nearest neighbors according to given schemes. an advantage of the leader-following approach is that it is easy to understand and implement. in addition, the formation can still be maintained even if the leader is perturbed by some disturbances. however, a disadvantage is that there is no explicit feedback to the formation, that is, no explicit feedback from the followers to the leader in this case. if the follower is perturbed, the formation cannot be maintained. furthermore, the leader is a single point of failure for the formation. in the behavioral approach [5], [6], [7], [8], [9], [10], [11], [12], [13], [14], few desired behaviors such as collision/obstacle avoidance and goal/target seeking are prescribed for each vehicle and the formation control is calculated from a weighting of the relative importance of each behavior. the advantages of this approach are: it is natural to derive control strategies when vehicles have multiple competing objectives, and an explicit feedback is included through communication between neighbors. the disadvantages are: the group behavior cannot be explicitly defined, and it is difficult to analyze the approach mathematically and guarantee the group stability. in the virtual structure approach, the entire formation is treated as a single entity [15], [16], [17], [18]. when the structure moves, it traces out desired trajectories for each robot in the group to copyright c© 2006 by ccc publications 42 dang binh nguyen, khac duc do track. some similar ideas based on the perceptive reference frame, the virtual leader, and the formation reference point are given in [14], [17], [19]. the advantages of the virtual structure approach are: it is fairly easy to prescribe the coordinated behavior for the group, and the formation can be maintained very well during the manoeuvres, i.e. the virtual structure can evolve as a whole in a given direction with some given orientation and maintain a rigid geometric relationship among multiple vehicles. however requiring the formation to act as a virtual structure limits the class of potential applications such as when the formation shape is time-varying or needs to be frequently reconfigured, this approach may not be the optimal choice. the virtual structure and leader-following approaches require that the full state of the leader or virtual structure be communicated to each member of the formation. in contrast, behavior-based approach is decentralized and may be implemented with significantly less communication. formation feedback has been recently introduced in the literature [18], [20], [21], [22]. in [18], a coordination architecture for spacecraft formation control is introduced to incorporate the leader-following, behavioral, and virtual structure approaches to the multi-robot coordination problem. this architecture can be extended to include formation feedback. in [20], a lyapunov formation function is used to define a formation error for a class of robots (double integrator dynamics) so that a constrained motion control problem of multiple systems is converted into a stabilization problem for one single system. the error feedback is incorporated to the virtual leader through parameterized trajectories. in terms of information from the robots in the group used for feedback in the control design for each robot, there are two main approaches to solve the problem of motion planning/control of a group of mobile robots: centralization and decentralization. in the centralized approach, see for example [18], a single controller and collision free trajectories are constructed in a workspace. the centralized approach has a drawback of computation complexity but guarantees a complete solution. the decentralized approach, see for example [23], requires less computational effort, and offers an easy way to scale the size of the robot group. this approach usually involves a combination of robot based local potential fields [14], [24], [25]. the main problem with the decentralized approach is that it is unable or extremely difficult to predict and control the critical points. basically, the closed loop system under a controller designed by the decentralized approach has multiple equilibrium points. it is rather difficult to design a controller such that all the equilibrium points except for the desired equilibrium one are unstable/saddle points for a group of many robots. moreover even the formation control system is designed in a centralized manner, the tuning constants in several aforementioned papers (e.g. [26], [27], [28], [29]), which are crucial to guarantee that the only desired equilibrium points are asymptotic stable and that the other critical points are unstable, are extremely difficult to obtain for practical implementation. in most of the above papers, point-robots with simple (single or double integrator) dynamics (e.g. [14], [24], [29]) or fully actuated vehicles [19] (which can be converted to a double integrator dynamics via a feedback linearization) were investigated. vehicles with nonholonomic constraints were also considered (e.g. [5]). however, the nonholonomic kinematics are transformed to a double integrator dynamics by controlling the hand position instead of the inertial position of the vehicles. consequently, the vehicle heading is not controlled. in addition, in the tracking control of single nonholonomic mobile robots (e.g. [30], [31], [32]) the tracking errors are often converted into special forms to deal with nonholonomic constraints using several non-trivial coordinate transformations. if these techniques are migrated to formation control of a group of nonholonomic mobile robots, it is extremely difficult to incorporate collision avoidance between the robots. the above problems motivate the contribution of this paper. in this paper, we propose a constructive method to design a cooperative formation control system for a group of n mobile robots. the simple point-mass robots are first considered to clarify the design philosophy. the proposed technique is extended to mobile robots with nonholonomic constraints (nonholonomic mobile robots). new local potential functions are constructed to design gradient based cooperative controllers to achieve almost global asymptotic convergence of a group of mobile robots to a particular formation in term of both shape and orientation, and guarantee no collisions between themselves. formal proof of the results is given. formation control of mobile robots 43 2 problem statement we consider a group of n simple point-mass mobile robots, of which each has the following dynamics q̇i = ui, i = 1, ..., n (1) where qi ∈ rn and ui ∈ rn are the state and control input of the robot i. we assume that n > 1 and n > 1. the assumption that each robot is represented as a point is not as restrictive as it may seem since various shapes can be mapped to single points through a series of transformations [26], [27], [28]. our task is to design the control input ui for each robot i that forces the group of n robots to stabilize with respect to their group members in configurations that make a particular formation specified by a desired vector l(η) = [lt12(η), l t 23(η), ..., l t n−1,n (η)] t , where η ∈ rm is the formation parameter vector to specify the formation change, while avoiding collisions between themselves. the parameter vector η is used to specify rotation, expansion and contraction of the formation such that when η converges to its desired value η f , the desired shape of the formation is achieved. in addition, it requires all the robots align their velocity vectors to a desired bounded one ud ∈ rn, and move toward specified directions specified by the desired formation velocity vector. the control objective is formally stated as follows: control objective: assume that at the initial time t0 each robot initializes at a different location, and that each robot has a different desired location, i.e. there exist strictly positive constants ε1, ε2 and ε3 such that ||qi(t0)−q j(t0)|| ≥ ε1, ||li j(η)|| ≥ ε2, ||∂ li j(η)/∂ η|| ≤ ε3, ∀i, j ∈ {1, 2, ...n}, ∀η ∈ rm. (2) design the control input ui for each robot i , and an update law for the formation parameter vector η such that each robot (almost) globally asymptotically approaches its desired location to form a desired formation, and that the robots’ velocity converges to the desired (bounded) velocity ud while avoiding collisions with all other robots in the group, i.e. limt→∞(qi(t)−q j(t)−li j(η(t))) = 0, limt→∞(η(t)−η f ) = 0, limt→∞(ui(t)−ud ) = 0, ||qi(t)−q j(t)|| > ε4, ∀i, j ∈ {1, 2, ...n}, ∀t ≥ t0 ≥ 0 (3) where ε4 is a strictly positive constant, and η f is a vector of constants that determine the desired formation. the desired formation can be represented by a labeled directed graph ([29], [34]) in the following definition. definition 1. the formation graph, g = {v, e, l} is a directed labeled graph consisting of: -a set of vertices (nodes), v = {ϑ1,··· , ϑn} indexed by the mobile robots in the group, -a set of edges, e = {(ϑi, ϑ j) ∈v ×v}, containing ordered pairs of vertices that represent inter-robot position constraints, and -a set of labels, l = {γi j|γi j = ||qi − q j − li j||2, ∀(ϑi, ϑ j) ∈ e}, li j = qi f − q j f ∈ rn indexed by the edges in e. indeed, when the control objective is achieved, the edge labels become ||qi−q j −li j||2 = 0, ∀(ϑi, ϑ j) ∈ e, i.e. the relative distance between the robots i and j is li j. 3 control design we consider the following local potential function ϕi = γi + δ βi (4) 44 dang binh nguyen, khac duc do where δ are positive tuning constants, the functions γi and βi are the goal and related collision avoidance functions for the robot i specified as follows: -the goal function γi is essentially the sum of all distances from the robot i to its adjacent group members, ni. a simple choice of this function is γi = ∑ j∈ni γi j, γi j = 1 2 ||qi −q j −li j||2. (5) -the related collision function βi should be chosen such that it is equal to infinity whenever any robots come in contact with the robot i, i.e. a collision occurs, and attains the minimum value when the robot i is at its desired location with respect to other group members belong to ni, which are adjacent to the robot i. this function is chosen as follows: βi = ∑ j∈ni ( β ki j β 2ki jl + 1 β ki j ) (6) where k is a positive constant to be chosen later, βi j and βi jl are collision and desired collision functions chosen as βi j = 1 2 ||qi −q j||2, βi jl = 1 2 ||li j||2. (7) it is noted from (7) that βi j = β ji and βi jl = β jil . remark 1. 1. the above choice of the potential function ϕi given in (4) with its components specified in (5)-(6), has the following properties: 1) it attains the minimum value when the robot i is at the desired location with respect to other group member belong to ni, which are adjacent to the robot i, i.e. qi − q j − li j = 0, j ∈ ni, and 2) it is equal to infinity whenever one or more robots come in contact with the robot i, i.e. when a collision occurs. 2. the potential function (4) is different from the ones proposed in [14] and [33] in the sense that the ones in [14] and [33] are centralized and do not put penalty on the relative distance between the robots, i.e. do not include the goal function γi. therefore, the controllers developed in [14] and [33] do not guarantee the formation converge to a specified configuration but to any configurations that locally minimize the potential functions (these potential functions in [14] and [33] are nonconvex). 3. our potential function (4) is also different from the navigation functions proposed in [26] and [29] in the sense that our potential function is of the form of sum of collision avoidance functions while those navigation functions in [26] and [29] are of the form of product of collision avoidance functions . this feature makes our potential function "more decentralized". furthermore, our potential function is equal to infinity while those in [14], [26] and [29] is equal to a finite constant when a collision occurs. however, those in [26] and [29] also cover obstacle and work space boundary avoidance. although these issues are not included in this paper for clarity, considering these issues is possible and is the subject of future work. 4. our potential function does not have problems like local minima and non-reachable goal as listed in [24]. to design the control input ui, we differentiate both sides of (4) along the solutions of (1) to obtain ϕ̇i = ∑ j∈ni [ωti j(ui −u j)−ψti jη̇] = ∑ j∈ni [ωti j(ui −ud −(u j −ud ))−ψti j η̇] = ∑ j∈ni ωti j(ui −ud )− ∑ j∈ni ωti j(u j −ud )− ∑ j∈ni ψti jη̇ (8) formation control of mobile robots 45 where ωi j = qi −q j −li j + δ k ( 1 β 2ki jl − 1 β 2ki j ) β k−1i j (qi −q j) ψi j = [( qi −q j −li j + 2δ kβ ki j β 2k+1i jl li j )t ∂ li j ∂ η ]t . (9) from (8), we simply choose the control ui for the robot i and the update law for η as follows: ui = −c ∑ j∈ni ωi j + ud η̇ = −γ(η −η f ) (10) where c ∈ rn×n+ and γ ∈ rm×m+ are symmetric positive definite matrices. substituting (10) into (8) yields ϕ̇i = − ∑ j∈ni ωti jc ∑ j∈ni ωi j − ∑ j∈ni ωti j(u j −ud ) + ∑ j∈ni ψti jγ(η −η f ). (11) substituting (10) into (1) results in the closed loop system q̇i = −c ∑ j∈ni ωi j + ud , i = 1, ..., n. (12) since the desired formation is specified in terms on relative distances between the robots, we write the closed loop system of the inter-robot dynamics from the closed loop system (12) as q̇i j = −c ( ∑ a∈ni ωia − ∑ b∈n j ω jb ) , (i, j) ∈ {1, ..., n}, i 6= j (13) where qi j = qi −q j. we now state the main result in the following theorem. theorem 1. under the assumptions stated in the control objective, the control for each robot i given in (10) with an appropriate choice of the tuning constants δ and k, solves the control objective. proof. see appendix. 4 simulations we carry out a simulation example in two-dimensional space to illustrate the results. the number of robots is n = 4. the initial positions of robots are chosen randomly in the circle with a radius of 0.5 centered at the origin. the design constants are chosen as c = diag(0.4, 0.4), k = 0.5, δ = 0.1. it is noted that this choice satisfies the conditions in the proof of theorem 1. we run two simulations with ud = [1 0.2]t (linear formation motion meaning that each robot will move on a rectilinear line to form the desired formation) and ud = [sin(0.5t) cos(0.5t)] (circular formation motion meaning that each robot will move on a circle to form the desired formation). for clarity, we take the formation parameter η as a scalar to implement formation expansion. the desired formation is depicted in figure 1. these simulations are motivated by gradient climbing missions in which the mobile sensor network (each mobile robot serves as a mobile sensor) seeks out local maxima or minima in the environmental field. the network can adapt its configuration in response to the sensed environment in order to optimize its gradient climb. for example, gradients in temperature fields (among others) can be estimated from the data collected by the 46 dang binh nguyen, khac duc do mobile robots; these are of interest for enabling gradient climbing to locate and track features such as fronts and eddies. these gradients can be used to compute the desired reference velocity vector ud in our simulations in this section. in the first 4.5 seconds (for the linear formation motion case) and 15 seconds (for the circular formation case), η is set to zero then is updated to η f = 3 for the rest of simulation time. the update gain is chosen as γ = 2 (scalar). robot 1 robot 4 robot 3 robot 2 2( 1 ) 2 (1 ) 2(1 ) 2( 1 )2(1 ) 2 (1 ) figure 1: desired formation for simulation. figures 2 and 3 plot simulation results for the linear formation motion and circular formation cases, respectively. for clarity, we only plot the control u1 = [ux1 uy1]t and distances from the robot 1 to other members in the group, i.e. ‖q12‖,‖q13‖ and ‖q14‖. it is seen from these figures that the desired formation shapes are nicely achieved and there are no collisions between any robots, see the bottom right figures in figures 2 and 3, where the distances from the robot 1 to other members in the groups are plotted. clearly, these distances are always larger than zero. it is also seen from figures 2 and 3 that at the beginning all the robots rapidly move away from each other to avoid collisions since they start pretty close to each other. 5 extension to formation control of nonholonomic mobile robots control of single nonholonomic mobile robots receives considerable attention, and is complicated due to the fact that they have less controls than the outputs to be controlled, see for example [30], [31], [32] and references therein. indeed, control of a group of nonholonomic mobile robots is more complicated due to some nonholonomic (non-integral) constraint. however, in this section we show that the control method developed in section 3 can be readily extended to force a group of n nonholonomic mobile robots of unicycle type to move in such a way that a desired formation is achieved. for clarity, we consider only the kinematic model of the nonholonomic mobile robots. designing the control system at the dynamic level even without requiring robot velocities be measured can be carried out using one more "backstepping" step [35] and our proposed exponential observer in [31]. consider the kinematic model of the unicycle mobile robot i, whose only two wheels are actuated and the third wheel is not actuated formation control of mobile robots 47 0 2 4 6 8 10 12 −2 0 2 4 6 8 x y t=9 s 0 2 4 6 8 10 −1 0 1 2 3 4 5 time [s] c o n tr o l u 1 0 2 4 6 8 10 0 1 2 3 4 time [s] d is ta n ce s −2 0 2 4 6 8 −2 0 2 4 6 8 x y t=4.5 s ||q 12 || ||q 13 || ||q 14 || u x1 u y1 robot 1 robot 2 robot 3 robot 4 figure 2: linear formation motion: simulation results. 2 0 2 4 6 4 2 0 2 4 x y t=30 s 0 10 20 30 2 0 2 4 6 time [s] c o n tr o l u 1 0 10 20 30 0 1 2 3 4 time [s] d is ta n c e s 2 0 2 4 6 4 2 0 2 4 x y t=15 s u x1 u y1 ||q 12 || ||q 13 || ||q 14 || robot 3 robot 2 robot 1 robot 4 figure 3: circular formation motion: simulation results. 48 dang binh nguyen, khac duc do (see figure 4), given by ẋi = ri 2 (cos(θi)ω1i + cos(θi)ω2i) ẏi = ri 2 (sin(θi)ω1i + sin(θi)ω2i) θ̇i = ri 2bi (ω1i −ω2i) (14) where (xi, yi) denote the coordinates of the middle point, p0i, between the left and right driving wheels, and θi denotes the heading of the robot i coordinated in the earth-fixed frame oxy , see figure 4, ω1i and ω2i denote the angular velocities of the wheels of the robot i. moreover ri and bi are defined in figure 4. the task now is to design the control inputs ω1i and ω2i to achieve the control objective stated in section 3. we require an additional assumption on the desired formation velocity vector ud that limt→∞ ||ud (t)|| 6= 0, i.e. we do not consider the stabilization/regulation problem. for convenience, we convert the angular velocities of the wheels to the linear and angular velocities (vi and ri) of the robot i by the following relationship: [ vi ri ] = ( 1 ri [ 1 bi 1 −bi ])−1 [ ω1i ω2i ] . (15) with (15), we can write (14) as ẋi = vi cos(θi) ẏi = vi sin(θi) θ̇i = ri (16) y x o surg e ax is i y i x 2 i b 0i p i sway axis actuated wheel passive wheel 2 i r robot i figure 4: geometric description of a nonholonomic mobile robot. indeed, the kinematic model (14) or (16) possesses the following nonholonomic constraint: ẋi sin(θi)− ẏi cos(θi) = 0. (17) moreover, we will consider the the linear and angular velocities (vi and ri) of the robot i as the control inputs. after these inputs are designed, ω1i and ω2i are calculated from (15). formation control of mobile robots 49 5.1 control design the control design consists of two steps. at the first step, we consider the control vi and the yaw angle θi as a virtual control to steer the robot position (xi, yi) to its desired location. at the second step, the control ri will be desired to force the virtual yaw angle to converge to its actual yaw angle. step 1. define θei = θi −αθi (18) where αθi is a virtual control of θi. with (18), we can write (16) as q̇i = ui + λθei (19) where qi = ∣∣∣∣ xi yi ∣∣∣∣ , ui = ∣∣∣∣ cos(αθi ) sin(αθi ) ∣∣∣∣ vi, λθei = ∣∣∣∣ (cos(θei)−1) cos(αθi )−sin(θei) sin(αθi ) sin(θei) cos(αθi ) + (cos(θei)−1) sin(αθi ) ∣∣∣∣ vi. (20) it is seen that (19) is almost of the same form as (1). however, the problem is that the controls vi and αθi are not solvable directly from the control ui if ui is not designed properly. we therefore present briefly how ui is designed to tackle that problem. consider the following potential function (the same form as (4)) ϕi = γi + δ βi (21) where δ , γi and βi are defined in section 3, see (6) and (7). differentiating both sides of (21) along the solutions of (19) gives ϕ̇i = ∑ j∈ni [ωti j(ui + λθei −u∗d −(u j + λθe j −u∗d ))−ψti jη̇] (22) where ωi j and ψi j are defined in (9), and u∗d = √ 1 + n ∑ i=1 || ∑ j∈ni ωi j||2 ud . it is noted that we use u∗d instead of ud in (22) to overcome the nonholonomic problem of the mobile robot under investigation. indeed, limt→∞ ∑ j∈ni ωi j(t) = 0 implies that limt→∞ u∗d (t) = ud . from (22), we choose the control ui and the update law for η as ui = −c||ud|| ∑ j∈ni ωi j + u∗d η̇ = −γ(η −η f ) (23) where c and γ are diagonal positive definite matrices. again, ||ud|| is included in the control ui to overcome the nonholonomic problem. defining θd = arctan(udy/udx), then from the first equations of (23) and (20), we have cos(αθi )vi = −c1||ud|| ∑ j∈ni ωxi j + √ 1 + n ∑ i=1 || ∑ j∈ni ωi j||2||ud||cos(θd ) sin(αθi )vi = −c2||ud|| ∑ j∈ni ωyi j + √ 1 + n ∑ i=1 || ∑ j∈ni ωi j||2||ud||sin(θd ) (24) where ωxi j and ωyi j are defined as ωi j = [ ωxi j ωyi j]t , c1 and c2 are defined as c = diag(c1, c2). we now need to solve (24) for vi and αθi . to do this, multiplying both sides of the first and second equations of (24) with cos(θd ) and sin(θd ), respectively, then adding them together result in cos(αθi −θd )vi = −c1||ud|| ∑ j∈ni ωxi j cos(θd )− c2||ud|| ∑ j∈ni ωyi j sin(θd ) + √√√√1 + n ∑ i=1 || ∑ j∈ni ωi j||2||ud||. (25) 50 dang binh nguyen, khac duc do on the other hand, multiplying both sides of the first and second equations of (24) with sin(θd ) and cos(θd ), respectively, then subtracting from each other result in sin(αθi −θd )vi = c1||ud|| ∑ j∈ni ωxi j sin(θd )−c2||ud|| ∑ j∈ni ωyi j cos(θd ). (26) from (25) and (26), we have αθi = θd + arctan   c1 ∑ j∈ni ωxi j sin(θd )−c2 ∑ j∈ni ωyi j cos(θd ) −c1 ∑ j∈ni ωxi j cos(θd )−c2 ∑ j∈ni ωyi j sin(θd ) + √ 1 + n ∑ i=1 || ∑ j∈ni ωi j||2   . (27) it is seen that (27) is well-defined if the positive constants c1 and c2 are chosen such that c1 + c2 < 1. (28) the control vi is found by solving (24) as vi = cos(αθi )||ud|| ( −c1 ∑ j∈ni ωxi j + √ 1 + n ∑ i=1 || ∑ j∈ni ωi j||2 cos(θd ) ) + sin(αθi )||ud|| ( −c2 ∑ j∈ni ωyi j + √ 1 + n ∑ i=1 || ∑ j∈ni ωi j||2 sin(θd ) ) . (29) substituting (23) into (22) results in ϕ̇i = −||ud|| ∑ j∈ni ωti jc ∑ j∈ni ωi j + ∑ j∈ni [ωti j(λθei −(u j + λθe j −u∗d )) + ψti jγ(η −η f )]. (30) step 2. to design the control ri, differentiating both sides of (18) along the solutions of the third equation of (16) and choosing the control ri as ri = −diθei −α̇θi − ∑ j∈ni ωti jλθei /θei (31) where di is a positive constant, and the term ∑ j∈ni ωti jλθei /θei is to cancel the cross term ∑ j∈ni ωti jλθei in (30), result in θ̇ei = −diθei − ∑ j∈ni ωti jλθei /θei. (32) note that λθei /θei is well defined since sin(θie)/θie = 1∫ 0 cos(θieλ )dλ and (cos(θie)−1)/θie = 1∫ 0 sin(θieλ )dλ are smooth functions. 4.2 stability analysis we consider the following function ϕtot = log(1 + n ∑ i=1 (ϕi + θ 2ei)) + 1 2 (η −η f )t γ(η −η f ) (33) whose derivative along the solutions of (30), (32) and the second equation of (23) satisfies ϕ̇tot = −2 ||ud|| n ∑ i=1 ∑ j∈ni ωti jc ∑ j∈ni ωi j + n ∑ i=1 diθ 2ei 1 + n ∑ i=1 (ϕi + θ 2ei) + n ∑ i=1 ∑ j∈ni ψti jγ(η −η f ) 1 + n ∑ i=1 (ϕi + θ 2ei) −(η −η f )t γ(η −η f ) (34) formation control of mobile robots 51 where we have used − n ∑ i=1 ∑ j∈ni [ωti j(u j + λθe j −u∗d ) = n ∑ i=1 ∑ j∈ni [ωti j(ui −u∗d + λθei ) = −||ud|| n ∑ i=1 ∑ j∈ni ωti jc ∑ j∈ni ωi j + n ∑ i=1 ∑ j∈ni ωti jλθei . (35) the rest of stability analysis can be carried out in the same lines as in proof of theorem 1 since (34) is of the same form as (40) and limt→∞ ||ud (t)|| 6= 0 by assumption. finally, note that limt→∞ θei(t) = 0 and limt→∞ ∑ j∈ni ωi j(t) = 0 implies that limt→∞(θi(t)−θd ) = 0 , i.e. the yaw angle of all robots converge to the desired angle θd = arctan(udy/udx). 5.2 simulation results we now perform a simulation to illustrate the results in the previous subsection. the number of robots, initial conditions of the robot positions, control gains, desired formation velocity and desired formation shape are the same as in section 4. the robot heading angles are initialized randomly in the circle with a radius of 0.5 centered at the origin. for clarity, we only simulate the circular formation motion, and we do not include simulation results on the formation expansion as in section 4, i.e. the formation parameter η is set to zero in all the simulation time. the other design constants are chosen as di = 5. simulation results are plotted in figure 5. again, it is seen that the robots are forced to move to nicely achieve the desired formation and no collisions between the robots occur. moreover, the yaw angle of all robots converges to the desired value θd , see the top-right figure in figure 5, where the yaw angle errors are plotted. a close look at figure 5 shows that the main different between simulation results in this subsection and those in section 4 is that the robots take a longer time to approach the desired formation. this is because we use the heading angles θi as the virtual controls to steer the robots to overcome the noholonomic constraint. 6 summary and conclusions this paper has contributed the method to construct local potential functions, based on which gradientlike cooperative controllers were designed for a group of mobile robots both with and without nonholonomic constraints to perform certain formation missions. formal analysis of the convergence and feasibility of the control solutions have also been provided. in the near future, it is of interest to apply the proposed control design method combined with the control design scheme for other single underactuated robots with second order nonholonomic constraints such as underactuated ships [37] to achieve a desired formation for a group of underactuated systems. 7 appendix: proof of theorem 1 we prove theorem 1 in two steps. at the first step, we show that there are no collisions between any robots and the solutions of the closed loop system exist. at the second step, we prove that the equilibrium point of the inter-robot dynamics closed loop system (13), at which qi − q j − li j = 0, is asymptotically stable. finally, we show that all other equilibrium(s) of (13) are either unstable or saddle. step 1. proof of no collision and existence of solutions: we consider the following common potential function ϕ given by ϕ = n ∑ i=1 ϕi (36) 52 dang binh nguyen, khac duc do 2 0 2 4 6 3 2 1 0 1 2 3 4 x y 0 2 4 6 8 10 12 1 0.5 0 0.5 1 time [s] h e a d in g a n g le e rr o rs 0 5 10 15 20 5 0 5 10 time [s] c o n tr o ls v 1 , r 1 0 5 10 15 20 0 0.5 1 1.5 2 time [s] d is ta n c e s 1 d 2 d 3 d 4 d v 1 r 1 ||q 12 || ||q 13 || ||q 14 || robot 1 robot 2 robot 3 robot 4 figure 5: mobile robot circular formation motion: simulation result. formation control of mobile robots 53 whose derivative along the solutions of (11) is ϕ̇ = − n ∑ i=1 ∑ j∈ni ωti jc ∑ j∈ni ωi j − n ∑ i=1 ∑ j∈ni ωti j(u j −ud ) + n ∑ i=1 ∑ j∈ni ψti jγ(η −η f ). (37) since li j = −l ji and ωi j = −ω ji, we have n ∑ i=1 ∑ j∈ni ωti j(u j −ud ) = − n ∑ i=1 ∑ j∈ni ωti j(ui −ud ). (38) substituting (38) into (37) gives ϕ̇ = −2 n ∑ i=1 ∑ j∈ni ωti jc ∑ j∈ni ωi j + n ∑ i=1 ∑ j∈ni ψti jγ(η −η f ). (39) we now consider the following total function ϕtot = log(1 + ϕ) + 0.5||η − η f ||2 whose derivative along the solutions of (39) the second equation of (10) satisfies ϕ̇tot = − 2 1 + ϕ n ∑ i=1 ∑ j∈ni ωti jc ∑ j∈ni ωi j + 1 1 + ϕ n ∑ i=1 ∑ j∈ni ψti jγ(η −η f )−(η −η f )t γ(η −η f ) (40) which implies that ϕ̇tot ≤ − 2 1 + ϕ n ∑ i=1 ∑ j∈ni ωti jc ∑ j∈ni ωi j + λmax(γ) 4ε(1 + ϕ)2 ∥∥∥∥∥ n ∑ i=1 ∑ j∈ni ψti j ∥∥∥∥∥ 2 − (λmin(γ)−ε λmax(γ))||η −η f ||2 (41) where ε is a positive constant, λmin(γ) and λmax(γ) denote the minimum and maximum eigenvalues of γ respectively. from (9), and definition of the function ϕ , it can be readily shown that there exists a positive constant ωmax such that 1 (1 + ϕ)2 ∥∥∥∥∥ n ∑ i=1 ∑ j∈ni ψti j ∥∥∥∥∥ 2 ≤ ωmax (42) with (42) in mind, picking ε = λmin(γ)/λmax(γ) we can write (41) as ϕ̇tot ≤ λ 2max(γ) 4λmin(γ) ωmax ∆ = ϖmax. (43) integrating both sides of (43) results in ϕtot (t) ≤ ϕtot (t0) + ϖmax(t −t0). (44) where ϕtot (t) and ϕtot (t0) are (from the definition of ϕtot ) ϕtot (t) = log [ 1 + n ∑ i=1 ( γi(t) + δ ∑ j∈ni ( β ki j (t) β 2ki jl + 1 β ki j (t) ))] + 12||η(t)−η f ||2 ϕtot (t0) = log [ 1 + n ∑ i=1 ( γi(t0) + δ ∑ j∈ni ( β ki j (t0) β 2ki jl + 1 β ki j (t0) ))] + 12||η(t0)−η f ||2. (45) the right hand side of (44) cannot escape to infinity unless when t = ∞ since βi jl > 0 and βi j(t0) > 0 (see definition of βi jl and βi j given in (7)). therefore the left hand side of (44) cannot escape to infinity for 54 dang binh nguyen, khac duc do all t ∈ [t0, ∞). this implies that βi j(t) cannot be zero for all t ∈ [t0, ∞), i.e. no collisions can occur for all t ∈ [t0, ∞). on the other hand, it is true from the second equation of (10) that ||η(t)−η f || ≤ ||η(t0)−η f ||e−λmin(γ)(t−t0) (46) which means that the desired formation shape is achieved exponentially. using (42) and (46), we can write (41) as ϕ̇tot≤ λmax(γ) √ ωmax||η(t0)−η f ||e−λmin(γ)(t−t0). (47) integrating both sides of (47) from t0 to t results in ϕtot (t) ≤ ϕtot (t0) + λmax(γ) √ ωmax||η(t0)−η f ||/λmin(γ). (48) it is seen that the right hand side of (48) is bounded. therefore the left hand side of (48) must also be bounded. this implies that βi j(t) must be larger than a strictly positive constant for all t ∈ [t0, ∞), which in turn means that there exists a strictly positive constant ε4 such that the last inequality of (3) holds. to prove that the solutions of the closed loop system (12) exist, we consider the function w = 0.5 n ∑ i=1 ||qi||2 whose derivative along the solutions of (12), after some simple manipulation, satisfies ẇ ≤ ρ1(1 + 1/ min(βi j))w + ρ2 , where ρ1 and ρ2 are some positive constants, which implies that the solutions of (12) exist since βi j(t) is larger than a strictly positive constant for all t ∈ [t0, ∞). furthermore, applying barbalat’s lemma found in [36] to (41) gives lim t→∞ 1 1 + ϕ(t) n ∑ i=1 ∑ j∈ni ωti j(t)c ∑ j∈ni ωi j(t) = 0 (49) which implies that { limt→∞ ∑ j∈ni ωi j(t) = 0 limt→∞ ϕ(t) = χ1 or { limt→∞ ∑ j∈ni ωi j(t) = χ2 limt→∞ ϕ(t) = ∞ (50) where χ1 and χ2 are some constants. from definitions of ωi j and ϕ , the second limit set in (50) cannot be true. therefore, the first limit set in (50) implies that limt→∞ ∑ j∈ni ωi j(t) = 0. step 2. behavior near equilibrium points. at the steady state, the equilibrium points are found by solving the following equations ∑ j∈ni ωi j = ∑ j∈ni ( qi j −li j + δ k ( 1 β 2ki jl − 1 β 2ki j ) β k−1i j qi j ) = 0, i = 1, ..., n. (51) it is directly verified that q̄ = l̄ where q̄ and are stack vectors of qi j and li j, respectively, i.e. q̄ = [qt12, q t 13..., q t n−1,n ] t and l̄ = [lt12, l t 13..., l t n−1,n ] t , is one root of (51). in addition there is (are) another root(s) denoted by q̄c = [qt12c, q t 13c..., q t n−1,nc] t of (51) different from l̄ satisfying ∑ j∈ni ωi j ∣∣∣∣∣ q̄=q̄c = ∑ j∈ni ( qi jc −li j + δ k ( 1 β 2ki jl − 1 β 2ki jc ) β k−1i jc qi jc ) = 0, i = 1, ..., n (52) where βi jc = 0.5||qic − q jc||2. in the following, we will show that the equilibrium point q̄ = l̄ is asymptotically stable, and the equilibrium point(s) q̄ = q̄c is (are) unstable or saddle. we now write the closed loop system of the inter-robot dynamics (13) as ˙̄q = −c̄f(q̄, l̄). (53) formation control of mobile robots 55 where c̄ = diag(c,··· ,c︸︷︷︸ e ) with e the number of edges of the formation graph, and f(q̄, l̄) = [ ∑ a∈n1 ωt1a − ∑ b∈n2 ωt2b, ∑ a∈n1 ωt1a − ∑ b∈n3 ωt3b, ...., ∑ a∈ni ωtia − ∑ b∈n j ωtjb, ...., ∑ a∈nn−1 ωtn−1,a − ∑ b∈nn ωtnb] t . (54) since (52) holds for all i = 1, ..., n, at the steady state we have ∑ a∈ni ωia− ∑ b∈n j ω jb = 0, ∀(i, j) ∈{1, ..., n}, i 6= j. therefore the equilibrium points q̄ = l̄ and q̄ = q̄c are also the equilibrium points of (53). the general gradient of f(q̄, l̄) with respect to q̄ is given by ∂ f(q̄, l̄) ∂ q̄ =   ∂ ξ12 ∂ q12 ∂ ξ12 ∂ q13 ··· ∂ ξ12∂ qn−1,n ... ... ... ... ∂ ξi j ∂ q12 ··· ∂ ξi j∂ qi j ∂ ξi j ∂ qn−1,n ... ... ... ... ∂ ξn−1,n ∂ q12 ··· ··· ∂ ξn−1,n∂ qn−1,n   , ξi j = ∑ a∈ni ωia − ∑ b∈n j ω jb, (i, j) ∈ {1, ..., n}, i 6= j. (55) it can be checked that ∂ ξi j ∂ qi j = nin×n + 2δ k ( 1 β 2ki jl − 1 β 2ki j ) β k−1i j in×n + 2δ k ( (k −1) ( 1 β 2ki jl − 1 β 2ki j ) β k−2i j + 2k β k+2i j ) qi j q t i j ∆ = hi j ∂ ξi j ∂ qcd = sδ k ( 1 β 2kcdl − 1 β 2kcd ) β k−1cd in×n + sδ k ( (k −1) ( 1 β 2kcdl − 1 β 2kcd ) β k−2cd + 2k β k+2cd ) qcd q t cd , (56) where (c, d) ∈{1, ..., n}, (c, d) 6= (i, j), c 6= d , and s = 1 or s = −1 depending on value of c, d, i and j. however, we do not need to specify the sign of s for our next task. we now investigate properties of the equilibrium points q̄ = l̄ and q̄ = q̄c based on the general gradient ∂ f(q̄, l̄)/∂ q̄ evaluated at those points. step 2.1 proof of q̄ = l̄ being the asymptotic stable equilibrium point: at the equilibrium point q̄ = l̄ , we have ∂ ξi j ∂ qi j ∣∣∣∣ q̄=l̄ = nin×n + 4δ k2 β k+2i jl li jl t i j , ∂ ξi j ∂ qcd ∣∣∣∣ q̄=l̄ = s 2δ k2 β k+2cdl lcd l t cd , (57) where βcdl = 0.5||lcd||2 . with (57), let ξ ∈ rne we have ξ t ∂ f(q̄, l̄) ∂ q̄ ∣∣∣∣ q̄=l̄ ξ ≥ ( n − 4δ k2ne max(l2i ja) min(β k+2i jl ) ) ξ t ξ , (i, j) ∈ {1, ..., n}, i 6= j (58) where li ja is the a th element of li j. therefore, for any given constant k if we choose the tuning constant δ such that n − 4δ k2ne max(l2i ja) min(β k+2i jl ) > 0 → δ < n min(β k+2i jl ) 4k2ne max(l2i ja) , (i, j) ∈ {1, ..., n}, i 6= j (59) then the matrix ∂ f(q̄, l̄)/∂ q̄ ∣∣ q̄=l̄ is positive definite, which in turn implies that the equilibrium point q̄ = l̄ is asymptotically stable. 56 dang binh nguyen, khac duc do step 2.2. proof of q̄ = q̄c being the unstable/saddle equilibrium point(s): the idea is to consider block matrices on the main diagonal of the matrix ∂ f(q̄, l̄)/∂ q̄ ∣∣ q̄=q̄c and show that there exists at least one block matrix whose determinant is negative. define hi jc = ∂ ξi j/∂ qi j ∣∣ q̄=q̄c and let φa and φb be the ath and bth elements of qi jc, (a, b) ∈ {1, ..., n}, a 6= b. we form the matrices h abi jc from the matrix hi jc as follows h abi jc = [ h11 h12 h21 h22 ] h11 = n + 2δ kπi jcβ k−1i jc + 2δ k[(k −1)πi jcβ k−2i jc + 2k/β k+2i jc ]φ 2a h12 = 2δ k[(k −1)πi jcβ k−2i jc + 2k/β k+2i jc ]φaφb h21 = 2δ k[(k −1)πi jcβ k−2i jc + 2k/β k+2i jc ]φaφb h22 = n + 2δ kπi jcβ k−1i jc + 2δ k[(k −1)πi jcβ k−2i jc + 2k/β k+2i jc ]φ 2b (60) where πi jc = 1/β 2ki jl −1/β 2ki jc . the determinant of h abi jc is given by det(h abi jc) = (n + 2δ kπi jcβ k−1 i jc )∆ ab i jc (61) where ∆abi jc = n + 2δ kπi jcβ k−1 i jc + 2δ k[(k −1)πi jcβ k−2i jc + 2k/β k+2i jc ](φ 2a + φ 2b ) (62) let us consider the sum: n−1 ∑ a=1 n ∑ b=a+1 ∆abi jc = n(n−1)n + 2δ k(n−1)(2(k−1) + n)β k−1i jc /β 2ki jl + 2δ k(n−1)(2(k + 1)−n)/β k+1i jc . (63) since n > 1, picking k > n/2 − 1 ensures that n−1 ∑ a=1 n ∑ b=a+1 ∆abi jc > 0. therefore, there exists at least one pair (a, b) ∈ {1, ..., n} denoted by (a∗, b∗) such that ∆a∗b∗i jc > 0. now for all (i, j) ∈ {1, ..., n}, i 6= j let us consider the sum: n−1 ∑ i=1 n ∑ j=i+1 det(h a ∗b∗ i jc ) ∆a∗b∗i jc βi jc = n−1 ∑ i=1 n ∑ j=i+1 (nβi jc + 2δ kπi jcβ ki jc). (64) on the other hand, multiplying both sides of f(q̄c, l̄) = 0 with q̄tc results in q̄ t c f(q̄c, l̄) = 0, which is expanded to n−1 ∑ i=1 n ∑ j=i+1 (nqti jc(qi jc −li j) + 2δ knπi jcβ ki jc) = 0. (65) substituting (65) into (64) results in n−1 ∑ i=1 n ∑ j=i+1 det(h a ∗b∗ i jc ) ∆a∗b∗i jc βi jc = n−1 ∑ i=1 n ∑ j=i+1 (n −2)βi jc + n−1 ∑ i=1 n ∑ j=i+1 qti jcli j. (66) the term n−1 ∑ i=1 n ∑ j=i+1 ( qti jcli j ) is strictly negative since at the point where qi j = li j (the point f in figure 6) all attractive and repulsive forces are equal to zero while at the point where qi j = qi jc (the point c in figure 6) the sum of attractive and repulsive forces is equal to zero (see section 2 for discussion of a simple case). therefore the point qi j = 0 (the point o in figure 6) must locate between the points qi j = li j and qi j = qi jc , see figure 6. furthermore if we write (65) as 2 n−1 ∑ i=1 n ∑ j=i+1 βi jc + δ k(β ki jc/β 2k i jl −1/β ki jc) = n−1 ∑ i=1 n ∑ j=i+1 qti jcli j (67) formation control of mobile robots 57 12 q 13 q 1n q 21 q 2 n q 1,1n q 1,n n q o f c the point where all attractive and repulsive forces are zero the point where sum of attractive and repulsive forces are zero. figure 6: illustration of location of critical points. we can see that deceasing δ results in decrease in βi jc since βi jl is a bounded constant and the right hand side of (67) is negative. therefore, choosing a sufficiently small δ ensures that the right hand of (64) is strictly negative since βi jc = 0.5||qi jc||2 . that is n−1 ∑ i=1 n ∑ j=i+1 det(h a ∗b∗ i jc ) ∆a∗b∗i jc βi jc < 0 (68) which implies that there exists at least one pair (i, j) ∈ {1, ..., n} denoted by (i∗, j∗) such that det(h a ∗b∗ i∗ j∗c) < 0. (69) the inequality implies that at least one eigenvalue of the matrix ∂ f(q̄, l̄)/∂ q̄ ∣∣ q̄=q̄c is negative. this in turn guarantees that q̄c is an unstable/saddle equilibrium point of (53). proof of theorem 1 is completed. references [1] p. k. c. wang, navigation strategies for multiple autonomous mobile robots moving in formation, j. robot. syst., vol. 8, no. 2, pp. 177-195, 1991. [2] p. k. c. wang and f. y. hadaegh, coordination and control of multiple microspacecraft moving in formation, j. astronautical sci., vol. 44, no. 3, pp. 315-355, 1996. [3] j. p. desai, j. ostrowski, and v. kumar, controlling formations of multiple mobile robots, proc. ieee int. conf. robotics and automation, leuven, belgium, pp. 2864-2869, 1998. [4] m. mesbahi and f. y. hadaegh, formation flying control of multiple spacecraft via graphs, matrix inequalities, and switching, aiaa j. guidance, control, dynam., vol. 24, no. 2, pp. 369-377, 2000. [5] r. t. jonathan, r. w. beard and b. j. young, a decentralized approach to formation maneuvers, ieee trans. robot. and automat., vol. 19, pp. 933-941, 2003. 58 dang binh nguyen, khac duc do [6] t. d. barfoot and c. m. clark, motion planning for formations of mobile robots, robot. auton. syst., vol. 46, pp. 65-78, 2004. [7] h. yamaguchi, adaptive formation control for distributed autonomous mobile robot groups, proc. ieee int. conf. robotics and automation, albuquerque, nm, pp. 2300-2305, 1997. [8] t. balch and r. c. arkin, behavior-based formation control for multirobot teams, ieee trans. robot. automat., vol. 14, pp. 926-939, 1998. [9] m. schneider-fontan and m. j. mataric, territorial multirobot task division, ieee trans. robot. automat., vol. 14, pp. 815-822, 1998. [10] q. chen and j. y. s. luh, coordination and control of a group of small mobile robots, proc. ieee int. conf. robotics and automation, pp. 2315-2320, 1994. [11] m. veloso, p. stone, and k. han, the cmunited-97 robotic soccer team: perception and multiagent control, robot. auton. syst., vol. 29, pp. 133-143, 1999. [12] l. e. parker, alliance: an architecture for fault-tolerant multirobot cooperation, ieee trans. robot. automat., vol. 14, pp. 220-240, 1998. [13] k. sugihara and i. suzuki, distributed algorithms for formation of geometric patterns with many mobile robots, j. robot. syst., vol. 13, no. 3, pp. 127-139, 1996. [14] n. e. leonard and e. fiorelli, virtual leaders, artificial potentials and coordinated control of groups, proc. ieee conf. decision and control, orlando, fl, pp. 2968-2973, 2001. [15] w. kang, n. xi, and a. sparks, formation control of autonomous agents in 3d workspace, proc. ieee int. conf. robotics and automation, san francisco, ca, pp. 1755-1760, 2000. [16] m. a. lewis and k.-h. tan, high precision formation control of mobile robots using virtual structures, auton. robots, vol. 4, pp. 387-403, 1997. [17] w. kang and h.-h. yeh, coordinated attitude control of multisatellite systems, int. j. robust nonlinear control, vol.12, pp. 185-205, 2002. [18] r. w. beard, j. lawton, and f. y. hadaegh, a coordination architecture for formation control, ieee trans. control syst. technol., vol. 9, pp. 777-790, 2002. [19] r. skjetne, s. moi and t. i. fossen, nonlinear formation control of marine craft, proc. ieee conf. on decision and control, las vegas, nv, pp. 1699-1704, 2002. [20] p. ogren, m. egerstedt and x. hu, a control lyapunov function approach to multiagent coordination, ieee trans. robot. autom. vol.18, pp. 847-851. [21] w. ren and r. w. beard, formation feedback control for multiple spacecraft via virtual structures. iee proceedings-control theory application, vol. 151, pp. 357-368, 2004. [22] e. w. jush and p. s. krishnaprasad, equilibria and steering laws for planar formations, syst. contr. letters, vol. 52, pp. 25-38, 2004. [23] d. m. stipanovica, g. inalhana, r. teo and c. j. tomlina, decentralized overlapping control of a formation of unmanned aerial vehicles, automatica, vol. 40, pp. 1285-1296, 2004. [24] s. s. ge and y. j. cui, new potential functions for mobile robot path planning, ieee trans. robot. automat., vol. 16, pp. 615-620, 2000. formation control of mobile robots 59 [25] s. s. ge and y. j. cui, dynamics motion planning for mobile robots using potential field method, auton. robots, vol. 13, pp. 207-222, 2000. [26] e. rimon and d.e. koditschek, exact robot navigation using artificial potential functions, ieee trans. robot. and automat., vol. 8, no. 5, pp. 501-518, 1992. [27] e. rimon and d.e. koditschek, robot navigation functions on manifolds with boundary, advances in applied mathematics, vol. 11, pp. 412-442, 1990. [28] h. g. tanner, s. g. loizou and k. j. kyriakopoulos, nonholonomic navigation and control of multiple mobile robot manipulators, ieee trans. robot. automat., vol. 19, no. 1, pp. 53-64, 2003. [29] h.g. tanner and a. kumar, towards decentralization of multi-robot navigation functions, ieee int. conf. robotics and automation, barcelona, spain, pp 4143-4148, 2005. [30] w. e. dixon, d. m. dawson, e. zergeroglu and a. behal, nonlinear control of wheeled mobile robots, springer, london, 2001. [31] k. d. do, z. p. jiang and j. pan, a global output-feedback controller for simultaneous tracking and stabilization of mobile robots, ieee trans. robot. automat., vol.20, pp. 589-584, 2004. [32] t. fukao, h. nakagawa, and n. adachi, adaptive tracking control of nonholonomic mobile robot, ieee trans. robot. automat., vol. 16, pp. 609-615, 2000. [33] h. g. tanner, a. jadbabaie and g. j. pappas, stable flocking of mobile agents, part i: fixed topology, proc. ieee conf. decision and control, hawaii, pp. 2010-2015, 2003. [34] g. godsils and g. royle, algebraic graph theory, new york: springer graduate texts in mathematics, 207, 2001. [35] m. krstic, i. kanellakopoulos and p.v. kokotovic, nonlinear and adaptive control design, new york: wiley, 1995. [36] h. khalil, nonlinear systems, prentice hall, 2002. [37] k. d. do and j. pan, global path-tracking of underactuated ships with non-zero off-diagonal terms, automatica, vol. 41, pp. 87-95, 2005. dang binh nguyen thai nguyen university of technology rector’s office 3-2 street, thai nguyen city, vietnam e-mail: nguyen.d.binh.tnut@gmail.com khac duc do school of mechanical engineering the university of western australia 35 stirling highway, crawley, wa 6009, australia e-mail: duc@mech.uwa.edu.au http://horos.rdsor.ro/ijcccv3n4draft.pdf int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iii (2008), no. 4, pp. 395-411 mccls: certificateless signature scheme for emergency mobile wireless cyber-physical systems zhong xu, xue liu, guoqing zhang, wenbo he abstract: mobile ad hoc network is a self-configurable and self-organizing wireless network of mobile devices without fixed infrastructure support, which makes it a good candidate as underlying communication network for the cyber-physical systems in emergency conditions such as earthquake, flood, and battlefields. in these scenarios, efficient communication schemes with security support are especially desired. two cryptography approaches, the public key cryptography and the identitybased cryptography, face the costly and complex key management problem and the “key escrow" problem in the real-life deployment. recently, the certificateless public key cryptography (cl-pkc) was introduced to address these problems in previous approaches. however, the efficiency of the schemes based on cl-pkc is not high and can be improved further. in this paper, we present an improved certificateless signature scheme (mccls) based on bilinear pairings. first, we theoretically compare the efficiency of mccls with that of existing certificateless signature schemes (cls). second, an empirical study is conducted to compare the traditional aodv with the mccls scheme based on aodv (mcdv) in their efficiency and effectiveness against two most common attacks (i.e. redirection attack and rushing attack). results from theoretical analysis show that the new mccls scheme is more efficient than existing cls solutions, and results from empirical studies show that the mcdv is able to resist the two common attacks without causing substantial degradation of the network performance. keywords: certificateless signature, manets, cyber-physical systems, security; 1 introduction a salient feature of cyber-physical systems (cps) is that it integrates computing, monitoring, and communication capabilities, and constantly interacts with the physical environment. as a result, cyberphysical system must be dependable, safe, secure and efficient [16]. many emergency applications such as earthquake, flood and battlefields [10] proposed for cps will be implemented on networked environments where computing devices are connected through wireless links. for many applications such as the military applications, fixed infrastructure may not be available in the environment or even be destroyed [9]. it is important to solve the connectivity problems with self-configurable and self-organizing characteristics. a possible solution for the lack of communication means is deployment of the mobile ad hoc networks (manets). while manets provide a great flexibility for establishing communications, they are particularly prone to the security threats of eavesdropping, interception and routing attacks. some of these problems may be solved or mitigated with the use of cryptographic schemes [7]. in the recent literature many papers make specific proposals on determining how to use public key infrastructure (pki) [27, 23, 4, 15] and identity-based public key cryptography (id-pkc) [20, 13, 8, 25] cryptographic techniques to secure manets. the traditional pki signature scheme uses a centralized certificate authority to issue a digital certificate that binds a user with the corresponding public key. the requirement of certificate authority inevitably leads to complex certificate management problems in practice. copyright © 2006-2008 by ccc publications 396 zhong xu, xue liu, guoqing zhang, wenbo he the id-pkc which was introduced by shamir [20] is developed from traditional pki to simplify the certificate management process. in the id-pkc based scheme, user’s public key is derived directly from certain aspects of his identity such as email address which is assumed to be publicly known. a private key is generated by a trusted third party – private key generator (pkg). however, a new inherent problem is brought by this approach, namely the “key escrow" problem since the private key of user is known to the pkg. as a result, the pkg is able to impersonate any user of its choice, or decrypt messages. in order to solve the costly and complex key management problems in pki and the “key escrow" problem in id-pkc respectively, al-riyami and paterson [1] proposed the first certificateless public key cryptography (cl-pkc) scheme. in the certificateless signature (cls) scheme, key generation center (kgc) only provides user with a partial private key, which is related to the user’s identity and the master private key only known by pkc. then the user generates the remaining part of the private key and the corresponding public key. as a result, the kgc does not know the user’s private key because the user’s private key is generated by user itself, thereby solving the “key escrow" problem in id-pkc based schemes. however, cls schemes are usually computationally intensive, and hence they are not readily applicable in practical applications. in this paper, we present mccls scheme, a new cls scheme for mobile wireless cyber-physical systems. compared with existing cls schemes, mccls scheme only requires one pairing operation in the verification phase, and none in the signing phase. since the pairing operation is the most time-consuming computation in pairing-based cryptosystems, our mccls scheme has less computation overhead and therefore is more efficient than those schemes proposed previously in [1, 12, 14, 26]. we also provide a detailed security proof for mccls scheme based on the computational diffie-hellman problem (cdhp) [6]. then an empirical study is conducted to compare the mccls based on aodv (mcdv) with the traditional aodv in their efficiency and effectiveness against the two most common attacks, redirection attack and rushing attack, based on qualnet simulation software [19]. results show that our scheme is efficient in terms of computation overhead and it can resist redirection attack [18] and rushing attack [11]. the remainder of the paper is organized as follows. section 2 provides a brief description on the related work. section 3 introduces the preliminaries and the background on the security model and the attack model. section 4 presents our efficient mccls scheme. section 5 analyzes the security of mccls scheme in detail. section 6 evaluates the performance of mccls scheme under the redirection attack and the rushing attack. finally, section 7 concludes this paper with summaries and the directions of future work. 2 related work cyber-physical systems (cps) are physical and engineered systems whose operations are integrated, monitored, and controlled by a computational core [17]. cps integrate the communication and computation with the physical process [2]. since cyber-physical systems constantly interact with the physical environment, they must be dependable, safe, secure and efficient [16]. cps is a new active research area. the position papers published in the nsf workshop on cyberphysical system [16] presents a good overview of the different aspects of cps research. though security is an important research issue of cps , little work has been done [3] so far for the security of cps. since many emergency applications proposed for cps will be implemented on mobile ad hoc networks (manets), it is natural to ask the question if security schemes proposed for manets are practical for cps. to overcome the security problems in manets due to their infrastructure-less nature, we need some new methods to solve these problems. one of these methods is the lightweight and efficient key management scheme. recently, in order to solve the key management problem in public key mccls: certificateless signature scheme for emergency mobile wireless cyber-physical systems 397 cryptography and “key escrow" problem in identity-based cryptography schemes, al-riyami and paterson [1] proposed the first certificateless signature (cls) scheme but fail to provide the security proof. later, huang et al. [12] found that this cls scheme was insecure against a type i forger attack. a modified cls scheme was proposed with security proved under the random oracle model [5]. however, the scheme requires more pairing operations than the original scheme proposed in [1]. in [14], li et al. therefore, proposed another cls scheme, with a formal security analysis omitted. another shortcoming of this scheme is that the verification algorithm requires four quite expensive pairing operations. zhang et al. [26] presented a cls scheme with a formal security analysis but it still needs four pairing operations in the verification phase. following that, yap et al. [22] proposed a new cls scheme, which requires no pairing operation in the signing phase but requires two pairing operations in the verification phase. however, since pairing operations are costly in computation and are usually time consuming, using more pairing operations in the scheme will make it difficult to be applied for emergency cyber-physical systems, because cps need constantly interact with the physical environment and with stringent timing requirements. in this paper, we present mccls scheme, which is more efficient and hence is a good alternative to be used in cyber-physical systems. a good security protocol must be resilient against security attacks. in the following, we briefly introduce two most commonly studied attacks. later in this paper, we prove that the proposed mccls scheme is resilient against these two attacks. redirection attack [18] is one of the many possible attacks in manets. in this attack, a malicious node sends a forged route reply (rrep) packet to a source node by altering control message fields with falsified values. when a source node receives multiple rreps, by comparing the destination sequence numbers contained in rrep packets, it regards the largest one as the most recent routing information and selects the route through which that rrep packet has been sent. if the attacker sends the rrep with destination sequence number higher than that of the real destination node to the source node, the data traffic will be directed toward the attacker. it then drops all data packets it receives instead of forwarding them to the next node on the routing path. consequently, the source and destination nodes will lose communication with each other. rushing attack [11] usually aims at a reactive routing protocol. every node in the network only forwards the first route discovery packet that it receives and drops the rest. malicious nodes can “rush" the route request packets towards the destination. as a result, the nodes that receive these “rushed" request packets forward them and discard other route requests that arrive later. the resulted routes would then include the malicious nodes. in this way, the attacker is placed in an advantageous position. 3 preliminaries in this section, we present some mathematical background which helps in realizing cls based on the bilinear pairing. it is commonly used in cls schemes to realize signature and verification [1, 12, 14]. we define two cyclic groups g1, g2, where g1 is an additive group and g2 is a multiplicative group, where both groups have a prime order p. let e be a computable bilinear map e : g1 × g1 → g2. we have the following conditions: 1. bilinearity: for any p, q, r ∈ g1, we have e(p + q, r) = e(p, r)e(q, r). for a, b ∈ z∗p and p, q ∈ g1, we have e(ap, bq) = e(p, q) ab = e(p, abp) = e(abp, p). 2. non-degeneracy: there exists p, q ∈ g1, such that e(p, q) 6= 1. 3. computability: there is an efficient algorithm to compute e(p, q) for all p, q ∈ g1 the map e will be derived from either the weil or tate pairing on an elliptic curve over a finite field. 398 zhong xu, xue liu, guoqing zhang, wenbo he an efficiently computable bilinear map e provides an algorithm for solving the decision diffiehellman problem (ddhp) [6]. that is, given (p, ap, bp, cp) ∈ g1 and a, b ∈ z∗p, decide whether c ≡ ab ∈ z ∗ p. in bilinear pairing, decision diffie-hellman (ddh) problem is easy and computational diffiehellman (cdh) problem [22] is still hard. that is, for a, b ∈ z∗p, given (p, ap, bp), computing abp is infeasible. 3.1 certificateless scheme usually, a certificateless signature (cls) scheme consists of five polynomial time algorithms [1]: • setup. kgc runs a probabilistic algorithm to initialize the system. it receives a security parameter k and returns a randomly chosen master key and a list of public parameters param. • extract partial private key. kgc takes the master key and an identity id ∈ {0,1}∗ as inputs, and outputs a private partial key did. • generate key pair. the user takes a list of public parameters param as inputs, outputs a private key sid and a public key pid. • cl-sign. the user takes a list of public parameters param, full private keys (did, sid), and a message m to produce a signature σ on m. • cl-verify. anyone in this algorithm may take {param, id, pid} and a message m as inputs, and outputs true if and only if σ is the valid signature, or a symbol ⊥ to indicate a failure. we may note that, once the user received the public parameters, such as public key of kgc, user chooses secret value to generate his key pair including user’s private key and user’s public key. thus the user’s full private key is composed of the partial private key generated by kgc and the user’s private key generated by user himself. neither the kgc nor the user can generate the full private keys by himself, therefore solving the “key escrow" problem. 3.2 adversarial model as defined in [1, 22], there are two types of adversaries, type i and type ii, with different capabilities. in cls, type i adversary ai acts as a third part who tries to impersonate a user. it is not allowed to know the kgc’s master private key. however, ai can replace the public key pid with values of its choice due to nature of the public key generated by the user. this means the adversary is able to fool the user accepting the signature, which is signed by the adversary’s public key. type ii adversary aii represents a malicious kgc who knows the master private key. that is, aii can compute the partial private key by itself. but aii does not know the user’s private key sid and it cannot replace the user’s public keys pid . definition 1. a cls scheme is secure against existential forgery on adaptive chosen message and id attacks against adversary a, of type i or type ii if no polynomial time algorithm has a non-negligible advantage against a challenger c in the following game [1]: 1. the challenger c takes a security parameter k and runs the setup algorithm. challenger c gives a the system parameters param. if a is of type i, the challenge c keeps the master private key to itself. otherwise, c gives the master private key to a. 2. a can request c to answer the following types of queries: mccls: certificateless signature scheme for emergency mobile wireless cyber-physical systems 399 • partial key extraction (for type i adversary only). c returns to a’s partial private key did as the result of running extract partial private key algorithm. • secret value and public key extraction. c returns to a’s private key sid associated with a’s public key pid as the result of running extract partial private key and generate key pair algorithms. in the case of type i adversary, c returns if the user’s public key pid has been replaced. • public key replacement (for type i adversary only). a can replace the associated public key pid to a new public key p ′ id which is chosen by itself. • sign. c returns a valid signature σ using cl-sign algorithm regardless whether the public key pid has been replaced or not. 3. eventually, a outputs a signature (id∗, m∗, σ ∗). a wins game if veri f y(param, pid∗ , m ∗, σ ∗) = true and the generated output fulfills the following conditions: • cl-sign(id∗, m∗) has never been queried. • if adversary a is type i, id∗ has not been submitted to partial key extraction. • if adversary a is type ii, id∗ has not been submitted to secret value and public key extraction. 4 mccls scheme mccls scheme is motivated by the identity-based signature from [24]. our verification phase algorithm requires one pairing operation only, hence mccls scheme outperforms the other existing cls schemes in terms of efficiency. besides, message signing in mccls scheme is fast as it involves no pairing computation. mccls scheme is comprised of the following five stages. • setup. given a cyclic group g1 of prime order p, with an admissible pairing e and its generator p, kgc picks s ∈ z∗p and sets ppub = sp. then chooses two hash functions h1 : {0,1}∗ → g1 and h2 : {0,1}∗ × g1 → z∗p. the public system parameter list is (p, ppub, h1, h2), and the master private key is msk = s. • extract partial private key. given an identity id, kgc computes qid = h1(id) and did = sh1(id) . output did as the partial private key corresponding to qid = h1(id). • generate key pair. the user generates a secret value x ∈ z∗p, the public key is pid = xppub. the user’s private key is sid = x. • sign. given the user’s full private keys (did, sid) and a message m, user picks a number r ∈ z∗p and outputs a signature σ = (v, s, r) where s = 1 sid did, r = (r − sid)p and v = h2(m, r, pid)rp. • verification. given the signature (v, s, r) of a message m for the identity id, anyone in this algorithm can act a verifier to compute h = h2(m, r, pid). then checks whether (ppub,v −hr, s/h, qid) is a valid diffie-hellman tuple, that is, computes whether e(ppub, qid) = e(v − hr, s/h). if yes, accept the signature. otherwise, reject it. 5 analysis of mccls scheme in this section, we analyze the correctness, performance and security proof of mccls scheme. 400 zhong xu, xue liu, guoqing zhang, wenbo he 5.1 correctness the correctness of mccls scheme can be verified as follows: e(v − hr, s/h) = e(hrp − hrp + xhp, s/h) = e(xhp, did/xh) = e(ppub, qid). note that e(ppub, qid) is independent of the message, and only needs to be computed once and for all. so mccls scheme is more efficient than other previous schemes. 5.2 performance mccls scheme only requires two scalar multiplication in signature phase and two scalar multiplication computations and one pairing operation in verification phase. the pairing operations are expensive comparing with scalar multiplication and exponentiation. the comparison between the exiting schemes and mccls scheme according to efficiency of sign and verification algorithms and the length of public keys is shown in table 1. it shows that mccls scheme has the lowest pairing operations requirement and has the same length of public key as other cls schemes. table 1: comparison of the cls schemes ap [1] lcs [14] zwxf [26] yhg [22] mccls sign 1p+3s 2s 3s 2s 2s verify 4p+1e 4p+2s 4p 2p+3s 1p+3s pklen 2 points 2 points 1 points 1 point 1 point pklen: the public key length; s: the scalar multiplication computation; p: the pairing operation; e: the exponential computation. 5.3 security proof in this section we discuss the security of mccls scheme under the security model discussed in section 3. the main theorems concerning the security of our scheme are: theorem 2. our certificateless signature scheme is existentially unforgeable against a type i adversary ai in the random oracle model under the assumption that the cdh problem in g1 is infeasible. proof. suppose there exists an adversary ai which has an advantage in attacking mccls scheme. we build a challenger c that uses ai to solve the cdh problem. c receives an instance (p, ap, bp) of the cdhp. its goal is to compute abp. on the setup phase, c sets p as the generator of the group, and sets ppub = ap where a is the master key, which is unknown to ai. in order to avoid collision, c maintains a list l = (idi, didi , sidi , pidi ) throughout the game. the list is initially empty. c then starts to answer oracle queries with the following procedures [26]: • h1 queries. suppose ai makes qh1 queries to h1 oracle, where qh1 denotes the maximum number of queries. randomly choose j ∈ [1, qh1 ]. when an identity idi is submitted to oracle h1 where i ∈ [1, qh1 ], if i = j, assume that idi = id∗ at this point, c saves a list l1 = (idi, qi, yi) where qi = bp, yi = ⊥ (indicate to failure). otherwise, c generates a random number yi and lets qi = yip, then saves l1 = (idi, qi, yi). mccls: certificateless signature scheme for emergency mobile wireless cyber-physical systems 401 • partial key extraction (idi) queries. when ai makes the query on idi, if idi = id ∗, then c aborts and halts the simulation. otherwise c finds l and performs as follows: – if the list l contains (idi, didi , sidi , pidi ), c checks whether didi = ⊥. if didi 6= ⊥, c returns didi to ai. if didi = ⊥, and idi 6= id∗, c answers with didi = yippub = yi(ap) as partial private key. c then returns didi to ai and adds it to l. – if the list l does not contain (idi, didi , sidi , pidi ), c sets (didi = yippub = yi(ap)). then challenger c sets (sid, pid) = ⊥ and adds idi, didi , sidi , pidi to the list l. • public key extraction (idi) queries. when ai makes the query on idi, c finds l and performs as follows: – if the list l contains (idi, didi , sidi , pidi ), c checks whether pidi = ⊥. if pidi 6= ⊥, c returns pidi to ai. otherwise, c picks a random xi ∈ z∗p, and sets pidi = xippub, sidi = xi. c then returns pidi to ai and adds (sidi , pidi ) to l. – if the list l does not contain (idi, didi , sidi , pidi ), c picks a random xi ∈ z∗p, and sets pidi = xippub, sidi = xi. c then returns pidi to ai and adds (sidi , pidi ) to l. • secret value extraction (idi) queries. when ai makes the query on idi, if idi = id ∗, then c aborts and halts the simulation. otherwise c finds l and performs as follows: – if the list l contains (idi, didi , sidi , pidi ), c checks whether didi = ⊥. if didi = ⊥, c executes partial key extraction queries to obtain didi . if pidi = ⊥, c makes public key extraction queries to obtain sidi = xi, pidi = xippub. then c saves the value and adds full private keys (didi , sidi ) to the list l. – if the list l does not contain (idi, didi , sidi , pidi ), c executes partialkeyextractionqueries to obtain didi and makes publickeyextractionqueries to obtain (sidi , pidi ). then c saves the value and adds full private keys (didi , sidi ) to the list l. • public key replacement(idi, p ′ id) queries. when ai makes the query on (idi, p ′ idi ), c finds l and performs as follows: – if the list l contains (idi, didi , sidi , pidi ), c sets pidi = pid′i and sidi = ⊥. – if the list l does not contain (idi, didi , sidi , pidi ), c sets didi = ⊥, pidi = pid′i and sidi = ⊥. and then c adds to the list l. • h2 queries. when ai makes the query on (m, r, pidi ), c first scans if a list l2 = (m, r, pidi , h j) has been defined. if defined, return the list to ai. otherwise, c picks a random h j ∈ z∗p as the hash value and returns h j, and adds it to l2. • sign queries(idi, m j). when ai asks for a signature by user idi on message m j. c finds (idi, didi , sidi , pidi ). if didi not found, c runs partial key extraction queries. if (pidi , sidi ) not found, c runs public key extraction queries. note that if idi 6= id∗, ai is able to generate signature on any messages using corresponding full private keys (didi , sidi ). as far as idi = id ∗, assume that pidi is current public key and corresponding private key sidi = x, where x ∈ z∗p, additionally submits through the ai. this is because the public key has been replaced earlier by ai, then c cannot know the corresponding private key and thus the signing oracle’s answer may not be correct. on receiving sign queries, c does the following: 402 zhong xu, xue liu, guoqing zhang, wenbo he 1. choose random r j ∈ z∗p and look up the list l2 for h j, if not found, c runs h2 queries to get h j. 2. compute vj = h j(x + a r j )p and s j = r jqi = r jbp, r j = xp; 3. return the signature σ = (vj, s j, r j). now, σ is returned to ai, which appears to be valid signature since e(vj − h jr, s j/h j) = e(h j(x + a r j )p) − h jxp, r jbp/h j) = e(h jap/r j, r jbp/h j) = e(ap, bp) = e(ppub, qid). finally, ai will output a valid forgery r = (id j, m j, r j, s j,vj). if id j 6= id∗, c outputs the fail and aborts the simulation. otherwise, we can compute r j through r j = ah j vj−h j x [21], since (ppub,vjp − h jr j, s j/h j, qi) is a valid diffie-hellman tuple. apply r j to s j, we have s j = ah j vj − h jx qi s j = ah j vj − h jx bp abp = s j(vj − h jx)/h j . so abp = s j(vj − h jx)/h j is the answer to our cdhp instance. if the ai can break our scheme, then the attacker solves the cdh problem. theorem 3. our certificateless signature scheme is existentially unforgeable against the aii adversary in the random oracle model under the assumption that the cdh problem in g1 is infeasible. proof. suppose there exists an adversary aii which has advantage in attacking mccls scheme. we build a challenger c that uses aii to solve the cdh problem. c receives an instance (p, ap, bp) of the cdhp. its goal is to compute abp. on the setup phase, c sets p as the generator of the group, and sets ppub = sp where s is the master key, which is known to aii. in order to avoid collision, c maintains a list l = (idi, sidi , pidi ) throughout the game. the list is initially empty. c then starts to answer oracle queries with the following procedures: • h1 queries. suppose aii makes qh1 queries to h1 oracle, where qh1 denotes the maximum number of queries. randomly choose j ∈ [1, qh1 ]. when an identity idi is submitted to oracle h1 where i ∈ [1, qh1 ], if i = j, assume that idi = id∗ at this point, c saves a list l1 = (idi, qi, yi) where qi = ap, yi = ⊥ (indicate to failure). otherwise, c generates a random number yi and lets qi = yip, and saves l1 = (idi, qi, yi). • public key extraction (idi) queries. when aii makes the query on idi, c finds l and performs as follows: if the list l contains (idi, sidi , pidi ), c checks whether pidi = ⊥. if pidi 6= ⊥, c returns pidi to aii. otherwise, c picks a random xi ∈ z∗p, and sets pidi = bppub, sidi = xi. c then returns pidi to aii and adds (sidi , pidi ) to l. if the list l does not contain (idi, sidi , pidi ), c picks a random xi ∈ z∗p, and sets pidi = bppub, sidi = xi. c then returns pidi to aii and adds (sidi , pidi ) to l. mccls: certificateless signature scheme for emergency mobile wireless cyber-physical systems 403 • secret value extraction (idi) queries. when aii makes the query on idi, if idi = id ∗, then c aborts and halts the simulation. otherwise c finds l and performs as follows: if the list l contains (idi, sidi , pidi ), c checks whether pidi = ⊥. if pidi = ⊥, c makes public keyextractionqueries to obtain (sidi = xi, pidi = xippub). then c saves the value and adds user’s private keys sidi to the list l. if the list l does not contain (idi, sidi , pidi ), c executes public key extraction queries to obtain (sidi , pidi ). then c saves the value and adds user’s private keys sidi to the list l. • h2 queries. when aii makes the query on (m, r, pidi ), c first scans whether a list l2 = (m, r, pidi , h j) has been defined. if defined, return the list to aii. otherwise, c picks a random h j ∈ z∗p as the hash value of hand returns h j, and adds it to l2. • sign queries(idi, m j). when aii asks for a signature by user idi on message m j. c finds (idi, sidi , pidi ). if (pidi , sidi ) not found, c runs public key extraction queries. on receiving sign queries, c does the following: 1. choose random r j ∈ z∗p and look up the list l2 for h j, if not found, c runs h2 queries to get h j. 2. compute vj = ( sh j +bh j r j xi )p and s j = r jxiqi = r jxiap, r j = bp r j xi ; 3. return the signature σ = (vj, s j, r j). now, σ is returned to aii, which appears to be valid signature since e(vj − h jr, s j/h j) = e(( sh j +bh j r j xi p) − h j bpr j xi , r jxiap/h j) = e( sph j r j xi , r jxiap/h j) = e(sp, ap) = e(ppub, qid). finally, aii will output a valid forgery r = (id j, m j, r j, s j,vj). if id j 6= id∗, c outputs the fail and aborts the simulation. otherwise, we can compute r through r j = sh j +bh j xvj , since (ppub,vjp−h jr j, s j/h j, qi) is a valid diffie-hellman tuple. apply r j to s j, we have s j = sh j + bh j xivj xiqi s j = qish j + bh jap v abp = vjr j − qis jh j h j . so abp = vj r j−qis j h j h j is the answer to our cdhp instance. if the aii can break our scheme, then the attacker solves the cdh problem. 404 zhong xu, xue liu, guoqing zhang, wenbo he table 2: general parameters parameter value transmitter 250m bandwidth 2mb/s simulation time 600s environment 900m×900m traffic type cbr (constant bit rate) packet rate 4 packets/s packet size 512 bytes node maximum speed 0, 5,10,15,20 m/s pause time 0s attack nodes 1,2 and 4 redirection, 1,2 and 4 rushing queuing policy at routers first-in-first-out 6 evaluation and analysis in this section, an efficient mccls scheme named mcdv based on the ad hoc on-demand distance vector routing (aodv) is proposed. we start the simulations using qualnet [19] in order to compare the original aodv protocol without any security requirements with mcdv based on the cls with routing authentication extension. we also evaluate the performance of two schemes under 1, 2 and 4 nodes redirection attacks and 1, 2 and 4 nodes rushing attacks, as this is more realistic in the real emergency applications. our implementation retains most of the aodv mechanisms, such as route discovery, reverse path setup, forwarding path setup, route maintenance, and so on. in our experiments, 20 nodes move around in a rectangular area of 900×900m according to a mobility model, i.e., the random waypoint model. the nodes spread randomly over the network. each node starts its journey from a random location to a random destination. we vary the nodes speed from 0m/s to 20m/s, and set the nodes pause time as 0s. table 2 lists the values of the common parameters used in all experiment. other parameters will be given in the description of each specific experiment. the performance of mcdv is compared using the following performance metrics. • packet delivery ratio: ratio of the number of packets received by the destination over the number of packets sent by the source. • rreq ratio: ratio of sum number of rreq initiated, forwarded and retried over the sum of number of data packets sent as source and data packets forwarded. present the number of rreq packets transmitted through the network. • end-to-end delay: the average time experienced by each packet when traveling from the source to the destination. • throughput: ratio of the total bytes sent by all sources nodes over the total time. • packet drop ratio: ratio of the number of packets discarded by attacking nodes over the total number of packets sent by all sources. effects of variousmetrics on differentprotocols: experiments in this section are used to study the performance between mcdv and aodv. the results are shown in fig. 1. the packet delivery ratio and the rreq ratio are shown in fig. 1(a) and fig. 1(b), respectively. we can see that mcdv could work well in the experiment because the packet delivery ratio and rreq ratio in aodv are very similar to that of mcdv, without causing any substantial degradation of the network mccls: certificateless signature scheme for emergency mobile wireless cyber-physical systems 405 !""#$% & !"'# !""()*+ ,-% ./012301 "4# !"'-5 (6% 7( "8# !"4 !"4#9 ,:; ( !"8 !"8# # $ $# < =>??@abc=d !"# !$ !$#% &'( ) ! # !" # " "# $ %%* + ,-../012,3 45678967 (a) comparison with aodv and mcdv, packet delivery ratio (b) comparison with aodv and mcdv, rreq ratio ! "# ! "$ ! %&'( )* " ! "" ! "+,. /0 & ! 1# ! 1$ ! ",2 34 , 56789 78 ! 1 ! 1" ! 1+4 , 9:78 ! 1 ; 1 1; " <=>>? @ab>? @ab?@a;b (c) comparison with aodv and mcdv, end-to-end delay(s) (d) comparison with aodv and mcdv, throughput(bit/s) figure 1: effects of various metrics on different protocols performance. as nodes speed increases, the number of data packets reaching the destination decreases and the number of rreq packets transmitting through the networks increases. end-to-end delay of mcdv scheme is shown in fig. 1(c). our scheme has a little bit higher delay than that of aodv due to the exchange of packets during authentication phase of the security process. result shows that mcdv has a similar end-to-end delay with aodv at a relatively low speed, however, when the maximum speed of nodes is higher than 15m/s, aodv outperforms mcdv scheme. more specifically, our scheme needs authentication operation, and those additional operations are computed in our scheme but not in aodv. we only measure delays for data packets that survived to reach their destination. throughput of mcdv works well as result shown in fig. 1(d) because the effect of throughput of the network is very small (around 0.16%). however, if this scheme in other real scenarios such as disaster scenarios, battlefield scenarios, or even very high-speed scenarios, the effect of throughput of the network may reduce more than this. effects of multiple attackers with redirection attacks: we simulated aodv and mcdv under redirection attacks by varying the nodes speed from 0m/s to 20m/s while setting the number of attack nodes to 1, 2 and 4 nodes, respectively. we first study the packet delivery ratio and packet dropped ratio. from the results of fig. 2(a), we can see that packet delivery ratio drops as the speed increases when we use aodv routing protocol under redirection attacks. meanwhile, we observe that given the same speed of nodes, the higher the number of attackers, the lower the packet delivery ratio in aodv. the packet delivery ratio in the case of 4 attackers declines dramatically to 43% as the speed of nodes 406 zhong xu, xue liu, guoqing zhang, wenbo he ! "!# $%& ' (")("*%+ ,-& .,/0 1234 567 8998:;<=> ?("!("@a $bc ,% 1234 567 8998:;<=> d ?1234 567 8998:;<=> d e1234 567 8998:;<=> dfg:34 567 8998:;<=> d ?g:34 567 8998:;<=> d e( ( h ( h !(g:34 567 8998:;<=> dfijkkl mnopq ! "#$% && ' ( )*+ )!, ./0 1 "#$% 234 5&&56'789 : ("#$% 234 5&&56'789 : ;"#$% 234 5&&56'789 :<=6$% 234 5&&56'789 : (=6$% 234 5&&56'789 : ; )>+ )*/? @1aab, =6$% 234 5&&56'789 ;=6$% 234 5&&56'789 :< ) + )>c .de b/ + > >+ * fghhi jklmn (a) packet delivery ratio under multiple redirection attacks (b) rreq ratio under multiple redirection attacks !" #" $ ! $ !"%&' () $ *" $ # $ #"+, -. / 01 % 2345 678 9::9;<=>? @ a $ " $ * $ *", -. b + 2345 678 9::9;<=>? @ c2345 678 9::9;<=>? @de;45 678 9::9;<=>? @ ae;45 678 9::9;<=>? @ c " * *" # e;45 678 9::9;<=>? @dfghhi jklmn !"""" """"" #"""" !""""$ %& '( )""""*"""" """""+,.+%/0 1234 567 8998:;<=> ? @1234 567 8998:;<=> ? a#""""!"""")""""bcd 1234 567 8998:;<=> ? a1234 567 8998:;<=> ?ef:34 567 8998:;<=> ? @f:34 567 8998:;<=> ? af:34 567 8998:;<=> ?e" " g " g #"f:34 567 8998:;<=> ?ehijjk lmnop (c) end-to-end delay under multiple redirection attacks (d) throughput under multiple redirection attacks figure 2: effects of multiple attackers with redirection attacks increases to 15m/s. in contrast, the packet delivery ratio of mcdv maintains from 94% to 98% even the number of attackers increases to 4 which is slightly lower than normally packet delivery ratio as we can see in fig. 1(a). all of these are brought by the fact that our routing scheme retains most of the aodv’s mechanisms and the extra operations of sign phase and verification phase are very low. as we would expect from fig. 2(b), mcdv is able to detect all redirection attacks and the packet dropped ratio is zero. on the contrary, as the attack nodes increase, the packet dropped ratio also rises at the same speed when using aodv. especially, the highest packet dropped ratio of aodv is almost 25% at speed of 15m/s. mcdv can detect all the attacks because the node will verify the signature when it receives the packet. only if this packet passes the verification, the receiving node updates its routing table entry according to the information carried in the packet. otherwise, the node will drop this packet. readers may note that in fig. 2(c), given the same speed of nodes, the end-to-end delay in the mcdv under redirection attacks are slightly higher than the end-to-end delay in the aodv under redirection attacks. this is simply due to our definition of the end-to-end delay, which is defined as the time a packet takes to travel from the source to the destination. given the same network size, the same number of senders and the same number of receivers, as attacker or more attackers are added to the network, the number of available nodes forwarding packet decreases, making the average end-to-end delay decrease. the result in fig. 2(d) shows the throughput in the network. we can see that the higher the attackers, the lower the throughput at the same speed in aodv. as the speed goes up in aodv, the throughput of network decreases. when the speed is 15m/s, the throughput of aodv drops to 76% comparing with that of original protocol. in contrast, our scheme has the similar trend as the original aodv protocol. as the speed is 15m/s and the network is under 4 redirection attackers, the most effect of throughput is mccls: certificateless signature scheme for emergency mobile wireless cyber-physical systems 407 ! "!#$ %&' ( )"*)"+&, -.' /-0# 1234 567 8998:;<=> ? @)"!)"ab %cd -& 1234 567 8998:;<=> @1234 567 8998:;<=> ? e1234 567 8998:;<=> ?fg:34 567 8998:;<=> ? @g:34 567 8998:;<=> ? e) ) h ) h !)g:34 567 8998:;<=> ?fijkkl mnopq (a) packet delivery ratio under multiple rushing attacks (b) rreq ratio under multiple rushing attacks !" #" $ ! $ !"%&' () $ *" $ # $ #"+, -. / 01 % 2345 678 9::9;<=>? @ a $ " $ * $ *", -. b + 2345 678 9::9;<=>? @ c2345 678 9::9;<=>? @de;45 678 9::9;<=>? @ ae;45 678 9::9;<=>? @ c " * *" # e;45 678 9::9;<=>? @dfghhi jklmn !"""" """"" #"""" !""""$ %& '( )""""*"""" """""+,.+%/0 1234 567 8998:;<=> ? @#""""!"""")""""abc 1234 567 8998:;<=> @1234 567 8998:;<=> ? d1234 567 8998:;<=> ?ef:34 567 8998:;<=> ? @f:34 567 8998:;<=> ? d;" " g " g #"f:34 567 8998:;<=> ?ehijjk lmnop (c) end-to-end delay under multiple rushing attacks (d) throughput under multiple rushing attacks figure 3: effects of multiple attackers with rushing attacks around 0.9%. effectsofmultipleattackerswithrushingattacks: in this section, we compare the varying metrics of aodv and that of mcdv under 1, 2 and 4 rushing attacks, respectively. the graph in fig. 3(a) shows that, the higher the nodes speed, the lower the packet delivery ratio is when using aodv. however, the packer delivery ratio declines dramatically to 24% as the number of attackers increases to 4 nodes and at the speed of 20m/s. on the other hand, the lowest packet delivery ratio in mcdv still maintains 95% when the nodes are at the speed of 15m/s. the fig. 3(b) shows that, given the same speed, the higher the attacker node(s), the higher the packet dropped ratio is under aodv. in contrast, mcdv can detect all the rushing attacks, thus the packet dropped ratio is zero. these results indicate that the aodv protocol performs worse under the rushing attacks than under the redirection attacks. this is because we set the transmit distance as 740m to simulate the rushing attacks. in this situation, the malicious nodes may readily access to the forwarding group and discard all data packets. with the number of attackers increasing, the packet delivery ratio decreases and the packet dropped ratio rises. in contrast, mcdv maintains high packet delivery ratio and the packet dropped ratio is zero. this is due to its less computation overhead and efficient implementation of signature and verification. the fig. 3(c) shows that the end-to-end delay in rushing attacks. mcdv end-to-end delay is slightly higher than aodv end-to-end delay. the explanation for this is similar to the situation discussed in the case of redirection attacks. the difference is that when a node is converted to attacker, the probability of this attacker being selected into the forwarding group increases, and the average end-to-end delay decreases. 408 zhong xu, xue liu, guoqing zhang, wenbo he fig. 3(d) shows the throughput of two protocols under rushing attacks. although mechanisms of redirection attacks and rushing attacks are different, they have a similar way to affect the throughput. the throughput drops more severely under rushing attacks than under redirection attacks. in particular, the lowest throughput almost drops to 63% under 4 rushing attacks when nodes at the speed of 20m/s. in contrast to the aodv protocol under rushing attacks, our scheme has very similar throughput to the original protocol. 7 conclusion an efficient certificateless signature scheme named mccls is proposed in this paper. this scheme is based on the bilinear diffie-hellman assumption in the random oracle model for emergency mobile wireless cyber-physical systems. since mccls only requires one pairing operation in the verification phase, and none in the signing phase, theoretically it is more efficient than existing certificateless signature schemes. we also present simulation of mcdv which is based on mccls scheme and compare its performance under two most common attacks (i.e. redirection attack and rushing attack) with typical protocol-aodv providing no protection mechanism. these results show that mcdv can completely resist the two kinds of attacks without causing substantial degradation of network performance. in the future, we will further investigate security schemes in the wide physical environment. thereby we can find schemes which either prevent more comprehensive external attacks or resist internal attacks from the compromised nodes. 8 acknowledgment this work was supported in part by an nserc discovery grant 341823-07 and a national studyabroad scholarship of p.r.china under grant no. [2007] 3020. part of this work has been published in preliminary form in the proceedings of the first international workshop on cyber-physical systems, in conjunction with icdcs 2008, beijing, china. bibliography [1] s. s. al-riyami and k. g.paterson. certificateless public key cryptography. in asiacrypt: advances in cryptology – asiacrypt: international conference on the theory and application of cryptology. lncs, springer-verlag, 2003. [2] e. a.lee. cyber-physical systems are computing foundations adequate. technical report, uc berkeley, 2006. [3] m. anand, e. cronin, and m. sherr. security challenges in next generation cyber physical systems. technical report, university of pennsylvania, 2007. http://www.truststc.org/scada/papers/paper33.pdf. [4] m. bechler, h.-j. hof, d. kraft, f. pahlke, and l.wolf. a cluster-based security architecture for ad hoc networks. in infocom 2004. twenty-third annualjoint conference of the ieee computer and communications societies, volume 4, pages 2393–2403 vol.4, 7-11 march 2004. [5] m. bellare and p. rogaway. random oracles are practical: a paradigm for designing efficient protocols. in 1st acm conference on computer and communications security, pages 62–73, 1993. mccls: certificateless signature scheme for emergency mobile wireless cyber-physical systems 409 [6] d. boneh and m. franklin. identity-based encryption from the weil pairing. in advances in cryptology crypto 2001: 21st annual international cryptology conference, volume 2139, pages 213–229. lncs, 2001. [7] v. daza, j. herranz, p. morillo, and carla. cryptographic techniques for mobile ad-hoc networks. comput. networks, 51(18):4938–4950, 2007. [8] h. deng, a. mukherjee, and d. p. agrawal. threshold and identity-based key management and authentication for wireless ad hoc networks. in international conference on information technology: coding and computing, 2004. proceedings. itcc 2004, pages 107– 111, april 2004. [9] b. d.noble and j. flinm. wireless,self-organizing cyber-physical systems. technical report, university of michigan, 2006. http://varma.ece.cmu.edu/cps/position-papers/noble-flinn.pdf. [10] w. he, y. huang, k. nahrstedt, and w. c. lee. smock: a self-contained public key management scheme for mission-critical wireless ad hoc networks. in percom ’07: proceedings of the fifth ieee international conference on pervasive computing and communications, pages 201–210, washington, dc, usa, 2007. ieee computer society. [11] y.-c. hu, a. perrig, and d. b. johnson. rushing attacks and defense in wireless ad hoc network routing protocols. in proc of the acm workshop on wireless security (wise 2003), pages 30–40, 2003. [12] x. huang, w. susilo, y. mu, and f. zhang. on the security of certificateless signature schemes from asiacrypt 2003. in international conference on cryptology and network security (cans), lncs, volume 4, 2005. [13] a. khalili, j. katz, and w. arbaugh. toward secure key distribution in truly ad hoc networks. in proc. ieee workshop security and assurance in ad hoc networks, pages 342– 346, jan 2003. [14] x. li, k. chen, and l. sun. certificateless signature and proxy signature schemes from bilinear pairings. lithuanian mathematical journal, 45(1), 2005. [15] h. luo, j. kong, p. zerfos, s. lu, and l. zhang. ursa: ubiquitous and robust access control for mobile ad hoc networks. ieee/acm transactions on networking, 12(6):1049–1063, 2004. [16] national science foundation. cyber-physical systems. technical report, nsf workshop on cyberphysical systems, 2006. http://varma.ece.cmu.edu/cps/. [17] national science foundation. computer systems research. technical report, nsf, 2007. http://www.nsf.gov/pubs/2007/nsf07504/nsf07504.htm. [18] k. sanzgiri, b. dahill, b. levine, c. shields, and e. belding-royer. a secure routing protocol for ad hoc networks. network protocols, 2002. proceedings. 10th ieee international conference on, pages 78–87, 12-15 nov. 2002. [19] scalable network technologies. qualnet simulator. http://www.scalable-networks.com/. [20] a. shamir. identity-based cryptosystems and signature schemes. in crypto: proceedings of crypto, 1984. [21] s. xu, y. mu, and w. susilo. online/offline signatures and multisignatures for aodv and dsr routing security. in 11th australasian conference on information security and privacy,acisp 2006. lncs, 2006. 410 zhong xu, xue liu, guoqing zhang, wenbo he [22] w.-s. yap, s.-h. heng, and b.-m. goi. an efficient certificateless signature scheme. in euc workshops, volume 4097 of lecture notes in computer science, pages 322–331, 2006. [23] s. yi and r. kravets. moca: mobile certificate authority for wireless ad hoc networks. in proc. second ann. pki research workshop (pki ’03), apr 2003. [24] h. yoon, j. h. cheon, and y. kim. batch verifications with id-based signatures. in icisc: international conference on information security and cryptology. lncs, 2004. [25] y. zhang, w. liu, w. lou, y. fang, and y. kwon. ac-pki: anonymous and certificateless publickey infrastructure for mobile ad hoc networks. in 2005 ieee international conference on communications, 2005. icc 2005, pages 3515–3519, may 2005. [26] z. zhang, d. s. wong, j. xu, and d. feng. certificateless public-key signature: security model and efficient construction. in applied cryptography and network security, 4th international conference, acns 2006, singapore, june 6-9, 2006, proceedings, volume 3989 of lecture notes in computer science, pages 293–308, 2006. [27] l. zhou and z. haas. securing ad hoc networks. network, ieee, 13(6):24–30, nov/dec 1999. zhong xu1,2, xue liu 1mcgill university school of computer science 3480 university street, montreal, quebec, canada, h3a 2a7 e-mail: {zhongxu,xueliu}@cs.mcgill.ca guoqing zhang 2northwestern polytechnical university college of automation xi’an, shaanxi, china e-mail: gqzhang@cs.mcgill.ca wenbo he university of illinois at urbana-champaign dept. of computer science urbana, il, usa. e-mail: wenbohe@uiuc.edu zhong xu received the b.e degree in automation from xi’an technological university in 2001 and the m.e degree in computer science from xidian university in 2005. currently, zhong is a joint ph.d student in mcgill university, montreal, canada and northwestern polytechnical university, xi’an, china. from august 2001 to august 2002, he was an assistant lecturer in xi’an technological university, china. his research interests include security of ad hoc networks, information security, embedded systems and cyber-physical systems. mccls: certificateless signature scheme for emergency mobile wireless cyber-physical systems 411 dr. xue (steve) liu is an assistant professor in the school of computer science at mcgill university. he is also affiliated to the centre for intelligent machines (cim). xue obtained his ph.d. in computer science from the university of illinois at urbana-champaign in 2006. he obtained his b.s. degree in mathematics and m.s. degree in automatic control both from tsinghua university, china. he worked briefly in the hewlettpackard labsibm t. j. watson research center. he received the ray ozzie fellowship, the saburo muroga fellowship, the mavis memorial fund award, and the c. w. gear outstanding graduate award, from the university of illinois at urbana-champaign. he has filed 5 patents, and published more than 50 research papers in international journals and major peer-reviewed conference proceedings. guoqingzhang received the b.e degree and m.e. degree in automation both from northwestern polytechnical university. currently, he is a ph.d student in northwestern polytechnical university, xi’an, china. his research interests include vehicular ad-hoc networks, information security and embedded systems. wenbo he is currently a ph.d student at department of computer science, in university of illinois at urbana-champaign, where she is advised by professor klara nahrstedt. she received the mavis memorial fund scholarship award from college of engineering of uiuc in 2006, and c. w. gear outstanding graduate award from department of computer science in 2007. she is also a recipient of vodafone fellowship in 2005-2008. wenbo received the m.s. degree in electrical and computer engineering from the university of illinois at urbana-champaign in 2000. she received the m.eng. degree in automatic control theory from tsinghua university, beijing, china, in 1998, and the b.e. degree in automatic control from the harbin engineering university, heilongjiang, china, in 1995. from august 2001 to january 2005, she was a software engineer with cisco systems inc., champaign, il. her research interests include pervasive and mobile computing, and network security and privacy. international journal of computers, communications & control vol. i (2006), no. 4, pp. 45-52 the fast fourier and hilbert-huang transforms: a comparison denis donnelly abstract: the conversion of time domain data via the fast fourier (fft) and hilbert-huang (hht) transforms is compared. the fft treats amplitude vs. time information globally as it transforms the data to an amplitude vs. frequency description. the hht is not constrained by the assumptions of stationarity and linearity, required for the fft, and generates both amplitude and frequency information as a function of time. the behavior and flexibility of these two transforms are examined for a number of different time domain signal types. keywords: fast fourier transform, hilbert-huang transform, data analysis 1 introduction a common approach in spectrum analysis for extracting frequency information from time series data is to use the fast fourier transform (fft) [don, 05]. a more recent method [hua, 98], [hua, 05], which generates amplitude and frequency vs. time spectra, is the hilbert-huang transform (hht). these two approaches are fundamentally different. the fft assumes stationarity and linearity of the data and relies on globally defined orthogonal basis states. the hht does not require the same assumptions of the data. in order to represent nonlinear and nonstationary data, global basis states must be replaced with adaptive, locally determined ones, a process the first stage of the hht does perform. the resulting basis states are, in general, not strictly orthogonal. after a very brief review of the fft, the hht is described. the comparative behavior of the two transforms is then explored. various data sets with different signal characteristics are examined. finally, the chirp of a bat is analyzed. 2 the fast fourier transform the fast fourier transform (fft) provides an efficient algorithm for converting data from the time domain into the frequency domain. typically, the data to be transformed consists of n uniformly spaced points x j = x(t j) where n = 2n with n an integer, and t j = j∆t where j ranges from 0 to n − 1. the discrete fourier transform can be expressed in several ways. a commonly used form is the following (with i = √ −1): xk = n−1 ∑ j=0 x jex p(−2π i j n k) (1) where k = −n/2, . . . ,−1, 0, 1, . . . n/2 − 1 and where x j represent the time domain data and xk their representation in the frequency domain. the algorithm for the fft conversion process (cooley-tukey or any of several other comparable algorithms) makes the fft widely applicable as it reduces the number of computations from something on the order of n2 to nlogn which obviously provides an enormous reduction in computation time. the frequency data are typically displayed in one of two ways: an amplitude spectrum or a power spectrum. the amplitude spectrum is typically expressed by the relation ak = 2 n |xk|. (2) copyright c© 2006 by ccc publications 46 denis donnelly whereas the power spectrum is typically expressed by the relation pk = 1 n ∣∣x 2k ∣∣ . (3) where k = 0, 1, . . . , n/2. 3 the hilbert-huang transform the hilbert-huang transform is carried out in two stages: 1) the empirical mode decomposition (emd) process, which deconstructs the signal into a set of intrinsic mode functions (imf) and 2) the extraction of frequency vs. time information from each of the imf’s in combination with its hilbert transform (ht). a brief summary of the process for discretely sampled signals follows. the emd process deconstructs the original signal into a set of imf’s. each imf, extracted from the signal by a series of siftings, has two fundamental properties: 1) the number of extrema and the number of zero crossings differ, at most, by one and 2) the mean value of the envelopes defined by the local maxima and local minima is zero. unlike the harmonic functions of a fourier series, these oscillatory functions may vary in both amplitude and frequency over time. in this decomposition process, the first imf contains the highest frequencies associated with the original signal; each subsequent imf contains lower-frequency components. amplitudes and frequencies are extracted from these imf’s in the second stage of the hht process. the instantaneous amplitude and angular frequency associated with each imf depend on the amplitude and phase of a complex number that the imf and its hilbert transform (ht) define. the real part of the complex number is the imf; the imaginary part of the number is the imf’s ht. the instantaneous amplitude is the amplitude of this complex number. the instantaneous angular frequency associated with that imf is the derivative of the unwrapped phase. the entire process is repeated for each imf to extract the complete frequency versus time information from the original data. the computation of the ht is essentially a convolution of an imf, x(t), with 1/t. the effect of this convolving is to emphasize the local properties of x(t). this locality preserves the time structure of the signal’s amplitude and frequency. the emd process is designed to deconstruct the complete signal into a set of imfs, each of which is extracted from a starting data set via a sifting process. this sifting process is repeated until the criteria listed above are satisfied and the difference between successive siftings is suitably small (there is some discussion as to what the terminating difference between siftings should be; it depends to some extent on the data set being examined). the process of extracting imf’s terminates when the residual contains no significant frequency information. this sifting process comprises several steps. given a discretely sampled signal y(t), 1. determine the location of all maxima, ymax(t), and minima ymin(t) y(t). 2. fit a cubic spline through the ymax(t) and another through the ymin(t). 3. calculate the mean of the spline curves at each point m(t) = (ymax(t) + ymin(t))/2 4. remove the trend, m(t). let d(t) = y(t)−m(t). 5. is d(t) an imf? if d(t) meets the criteria defining an imf, let ci(t) = d(t) and advance i by 1. extract the residual r(t) = y(t)−d(t). if d(t) doesn’t meet the criteria, further sifting is required. repeat steps 1 through 5, substituting d(t) for y(t). the fast fourier and hilbert-huang transforms: a comparison 47 6. repeat steps 1 through 5 until the residual no longer contains any useful frequency information. the original signal is, of course, equal to the sum of its parts. if we have n imfs and a final residual rn (t), y(t) = n ∑ i=1 ci(t) + rn (t) (4) the second stage of the hht process extracts the amplitude and frequency information from each imf. the steps for amplitude and frequency extraction from a given discrete imf are as follows: 1. compute the imf’s discrete fourier transform (dft) using the series expression (1) for the transform. 2. compute the ht. use the real and imaginary parts of step 1’s dft as coefficients (m = n/2): y j = 1 n m ∑ k1=0 ( re(xk1 sin(2π k1 j n + im(xk1 cos(2π k1 j n ) + (−1 n ) n−1 ∑ k2=m+1 ( re(xk2 sin(2π k2 j n + im(xk2 cos(2π k2 j n ) (5) 3. form the complex number z j = x j + iy j, extract the phase φ j = tan−1(y j/x j). 4. unwrap the phase so that it becomes a monotonically increasing function. 5. determine the frequency. take the derivative of the phase f j = 1 2π dφ j dt 6. determine the amplitude. a j = √ x2j + y 2 j 4 examples the first examples treat cases where the signal in the time domain has the properties of an imf and no sifting is required. the examples are chosen to demonstrate the responses of the fft and hht to changes in frequency and amplitude of the time domain signal. the first signal considered consists of two cycles of a sine wave with frequency equal to one followed after a brief pause by four cycles of a sine wave with a frequency equal to two. fig. 1 shows both the signal and its hht. the oscillations in the hht are typical and vary only slightly with the amplitude of the original signal. the low frequency end of the fft for this example is shown in fig. 2. while it does show the major peaks at frequencies equal to one and two, it has a rather rough appearance. the complex structure of the fft arises because, in effect, the fft has to create a fourier series to fit the signal comprising both the two frequencies as well as the portions where the signal is zero. the second signal has a fixed frequency sine wave with amplitude increasing with time. the hht, fig. 3, is not daunted by this amplitude variation. the fft on the other hand has an amplitude which is consistent with an average value of the peak amplitudes for the time domain data. the third signal, a chirp, has a constant amplitude and a variable frequency which increases over time by one order of magnitude. the hht provides an accurate representation of the frequency as long as the signal is sampled with sufficient frequency. if the number is too small the oscillations in the frequency curve grow and can have negative values. fig. 4 shows the frequency response; here n=2048 and there 48 denis donnelly are more than twenty computed sample points in the last (highest frequency) cycle. the fft (fig. 5.) is nearly constant for the frequency spanned but drops off near the ends. an example which clearly contrasts the behavior of the fft and hht and which demonstrates which process is more correct physically, is that of a sum of two sine waves with frequencies close enough to readily show a beat pattern (f1 = 1, f2 = 1.08). the fft of such a pattern is shown in fig. 6. the two original frequencies are clearly present in the amplitude spectrum. the hht on the other hand generates a single frequency of 1.04 which is an average of the two sine wave frequencies. (when dealing with signals that show a beat pattern, for example, artifacts are likely to appear in the frequency representation near the nodal points of the signal.) the hht amplitude function follows the beat pattern as an envelope. while the description of the beat signal as a sum of two individual frequency signals is mathematically equivalent to a single signal with a frequency equal to the average of the two frequencies used to construct the signal, times an amplitude modulating function with a frequency equal to the difference in frequency of the two signals, the two different representations are not equivalent physically. the hht provides the physically correct frequency description; the instantaneous frequency is the average frequency. this is an important notion that is worth emphasizing. we are used to talking about the harmonic content of signals, which is a mathematically correct description, but it may not be the best representation of what is actually taking place. it is interesting to observe what happens to the amplitude spectrum of the fft, as a smaller and smaller number of cycles of the pattern are subjected to analysis. if the number of cycles is small enough that the signal does not go beyond the first node of the envelope, then the fft shows only one peak at the average value of the two frequencies. the hht cannot treat functions such as a single cycle of a square wave as there are no maxima and minima to fit, while the fft can readily analyze such a signal. if a small amplitude sine wave is made to ride on top of a square wave, for example, the hht will recognize the sine wave away from the transition region. near the step transition, the frequency response shows a large spike. it is useful to know how differing frequencies are distributed during the emd process. an estimate can be obtained by taking the fft’s of the first and second imf’s as the difference between two frequencies is increased. from this, we can observe the behavior of the sorting process. for example, a factor of two or greater in the frequencies is needed to shift the lower frequency signal well into the next imf (see table 1). for example, if the emd process is applied to a signal comprising frequencies 1 and 0.5, the first imf yields an average frequency value of 1 with fluctuations ranging approximately from 0.96 to 1.05; the second imf yields an average value of 0.5 with the fluctuations ranging from 0.49 to 0.51. frequency 2 peak amplitude peak amplictude peak amplitude frequency 1 = 1 frequency 2 imf1 frequency 2 imf2 0.95 526 489 0.9 521 482 0.8 516 444 14 0.7 513 362 55 0.6 510 188 178 0,5 507 34 327 0.4 508 409 for a final application, we look at the chirp of a bat: 400 data points sampled at 7 intervals. the bat’s chirp does consist of frequencies decreasing over time, consequently, an fft of the entire signal would not yield useful information. instead, to use the fft, a sliding window approach is used [don, 06]. for example, look at 64 point subsets of the entire data set. in this time range, the inherent frequency structure is revealed. as this short window slides along, the changing frequencies can be tracked. the hht of the chirp, shows information of interest in the first and second imf’s. results are shown in fig. 7. the the fast fourier and hilbert-huang transforms: a comparison 49 second harmonic had too small an amplitude for the hht process to recognize it. both analysis forms would benefit from a shorter sampling interval. in the figure, the fit to the fundamental is proportional to ex p(−const √ t) . the fit to the first harmonic is simply two times that of the fundamental. because of the complexity of the signals in the figure, the fft results for the sliding window technique are not shown. but they approximate the fitted lines shown in the figure. there is a weak second harmonic which can be extracted from the data when using the fft process. however, that signal is too weak for the hht to recognize. 5 final comments the combination of sifting associated with the emd process, and then taking the ht, unwrapping the phase, and taking the derivative of each imf is computationally much more demanding than taking the fft. however, the hht not only provides instantaneous frequency and amplitude descriptions, it results in representations that are more meaningful physically. acknowledgments: i thank curtis condon, ken white and al feng of the beckman institute of the university of illinois for the bat chirp data and for permission to use it in this article. i thank edwin rogers for helpful comments. 0 2 4 6 8 2 0 2 time f re q u e n c y figure 1: hht frequency representation (solid line) of the time domain signal (dashed). 0 1 2 3 0 50 100 150 frequency a m p li tu d e figure 2: fft amplitude spectrum (solid line) of the (dotted) signal shown in fig. 1. only the lower frequencies are displayed for emphasis. 50 denis donnelly 0 2 4 6 8 5 0 5 time a m p li tu d e figure 3: hht amplitude spectrum (solid line) of the (dotted) signal. 0 2 4 6 8 10 0 5 10 time f re q u e n c y / a m p li tu d e figure 4: hht frequency representation of a chirp signal. the hht follows the increase in chirp frequency as it scales from one to ten. the amplitude one chirp signal is also shown. 0 10 20 0 100 200 frequency a m p li tu d e figure 5: fft of a chirp signal with a frequency ranging from one to ten. only the lower frequencies are displayed for emphasis. the fast fourier and hilbert-huang transforms: a comparison 51 0.8 1 1.2 0 500 1000 1500 frequency a m p li tu d e 1 1.08 figure 6: the fft of a signal consisting of the sum of two sine waves with frequencies 1 and 1.08. only a portion of the fft is shown for emphasis. 0 1 2 0 1 10 5 2 10 5 3 10 5 4 10 5 time f re q u e n c y figure 7: frequency vs. time of a bat chirp as determined via hht. the fluctuating curves represent the hht results. the solid curve corresponds to the fundamental. the dotted curve is the first harmonic. the two smooth curves are fits to the data. 52 denis donnelly references [1] denis donnelly and bert rust, "the fast fourier transform for experimentalists: part i concepts," computing in science & eng., vol. 7, no. 2, pp. 80-88, 2005. [2] denis donnelly, "the fast fourier transform for experimentalists: part vi chirp of a bat," computing in science & eng., vol. 8, no. 2, pp. 72-78, 2006. [3] norden e. huang et. al., "the empirical mode decomposition and the hilbert spectrum for nonlinear and non-stationary time series analysis," proc. r. soc. lond. a., vol. 454, pp. 903-995, 1998. [4] norden e. huang and samuel shen, eds. new jersey: world scientific, 2005. denis donnelly siena college department of physics loudonville, ny 12211 e-mail: donnelly@siena.edu received: november 8, 2006 int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 642-653 tense θ-valued moisil propositional logic c. chiriţă carmen chiriţă university of bucharest faculty of mathematics and computer science romania, 010014 bucharest, 4 academiei e-mail: stama@funinf.cs.unibuc.ro abstract: in this paper we study the tense θ-valued moisil propositional calculus, a logical system obtained from the θ-valued moisil propositional logic by adding two tense operators. the main result is a completeness theorem for tense θ-valued moisil propositional logic. the proof of this theorem is based on the representation theorem of tense θ-valued łukasiewicz-moisil algebras, developed in a previous paper. keywords: łukasiewicz-moisil algebras, tense moisil logic. 1 introduction the first contribution to the algebraic logic of finite-valued łukasiewicz propositional calculus is moisil’s paper [18], where n-valued łukasiewicz algebras (named today łukasiewicz-moisil algebras) were introduced. according to an example given by a. rose (1957), for n ≥ 5 the łukasiewicz implication cannot be defined in an n-valued łukasiewicz-moisil algebra. hence, moisil discovered a new many-valued logical system (named today moisil logic), whose algebraic models are n-valued łukasiewicz-moisil algebras. in 1969, moisil defined the θ-valued łukasiewicz algebras, where θ is the order type of a bounded chain. these structures extend a part of the definition of n-valued łukasiewicz algebras, but they differ from these by accepting many negation operations ( [3], [10], [16], [23]). the logic corresponding to the θ-valued łukasiewicz-moisil algebras was developed by boicescu [1] and filipoiu [10] (see also [2]). this logical system is called the θ-valued moisil propositional logic. the chrysippian endomorphisms of θ-valued łukasiewicz-moisil algebras are reflected in the syntax of the θ-valued moisil propositional logic by chrysippian operations. this paper is devoted to the tense θ-valued moisil propositional calculus, a logical system obtained from the θ-valued moisil propositional calculus by adding the tense operators g and h. the algebraic basis of this logic consists of tense θ-valued łukasiewicz-moisil algebras (tense lmθ-algebras), algebraic structures studied in our paper [7]. we extend some of the results of [8], where a tense n-valued propositional logic was studied. the tense θ-valued moisil propositional calculus unifies two logical systems: the classical tense logic and the θ-valued moisil logic. the connection between these logics is realized by axioms that express the behaviour of the tense operators with respect to the chrysippian operations. the paper is organized as follows. in section 2 we recall some definitions and basic facts on θ-valued łukasiewicz-moisil algebras and θ-valued moisil logic, with emphasis on the connectives →k and ↔k and their algebraic counterparts. section 3 deals with tense θ-valued łukasiewicz-moisil algebras (tense lmθ-algebras), algebraic structures obtained from θ-valued łukasiewicz-moisil algebras by adding the two tense operators g and h. section 4 contains the syntactical construction of the tense θ-valued moisil propositional calculus. we establish some properties regarding the inferential structure of this logical system. copyright c⃝ 2006-2010 by ccc publications tense θ-valued moisil propositional logic 643 the lindenbaum-tarski algebra associated with the tense θ-valued moisil propositional calculus is studied in section 5. we obtain the structure of tense lmθ-algebra. the syntactical properties of the tense θ-valued moisil logic are reflected in this tense lmθ-algebra, thus we use the algebraic framework in order to obtain results for the logical system. in section 6 we define the interpretations of tense θ-valued moisil propositional calculus and the k-tautologies of this logic. our main result is the completeness theorem proved in this section (theorem 26). its proof uses the representation theorem of tense lmθ-algebras applied to the lindenbaum-tarski algebra constructed in the previous section. 2 θ-valued moisil logic and θ-valued łukasiewicz-moisil algebras let (i, ≤) be a totally ordered set, with first and last element, denoted by 0 and 1 respectively, and of order type θ, through this paper. we fix an element k ∈ i, through this paper. in this section, we recall the θ-valued moisil logic mθ described in [2]. the axiomatization of θ-valued moisil propositional calculus uses the system of axioms of θ-valued calculus introduced by boicescu [4] and filipoiu [10]. the basic results are taken from filipoiu [10](see also [2]). the alphabet of mθ has the following primitive symbols: an infinite set v of propositional variables; the logical connectives ∨, ∧, φi, φi for all i ∈ i and the parantheses (,). the set prop(v) of propositions of mθ is defined by canonical induction. for each i ∈ i, we shall use the following abbreviations: p →i q = φip ∨ φiq and p ↔i q = = (p →i q) ∧ (q →i p). the θ-valued propositional calculus has the following k-axioms: (2.1) p →k (q →k p), (2.2) (p →k (q →k r)) →k ((p →k q) →k (p →k r)), (2.3) p ∧ q →k p, (2.4) p ∧ q →k q, (2.5) (p →k q) →k ((p →k r) →k (p →k q ∧ r)), (2.6) p →k p ∨ q, (2.7) q →k p ∨ q, (2.8) (p →k q) →k ((r →k q) →k (p ∨ r →k q)), (2.9) φi(p ∧ q) ↔k φip ∧ φiq, for every i ∈ i, (2.10) φi(p ∨ q) ↔k φip ∧ φiq, for every i ∈ i, (2.11) φjp ↔k φiφjp, for every i, j ∈ i, (2.12) φjp ↔k φiφjp, for every i, j ∈ i, (2.13) φjp ↔k φiφjp, for every i, j ∈ i, (2.14) φjp ↔k φiφjp, for every i, j ∈ i, (2.15) φip →k φjp, for every i, j ∈ i, i ≤ j. 644 c. chiriţă the notion of formal proof in mθ is defined in terms of the above k-axioms and the k-modus ponens inference rule: p, p →k q q . for briefness, we will say "modus ponens" (m.p) instead of "k-modus ponens" from now on. we shall denote by ⊢k p that p is a k-theorem. we remind some k-theorems of mθ, which will be used in our proofs. proposition 1. ( [2], p. 491, example 3.12) the following propositions are k-theorems of mθ: (2.16) ⊢k p →k p, (2.17) ⊢k p ↔k φkp, (2.18) ⊢k (φip ∨ φip), for every i ∈ i, j ∈ i, (2.19) ⊢k (φj(p ∨ q) ↔k φjp ∨ φjq), j ∈ i, (2.20) ⊢k (φj(p ∧ q) ↔k φjp ∨ φjq), j ∈ i, (2.21) ⊢k ((p →k q) →k (φkq →k φkp)), (2.22) p φjp , j ≥ k, (2.23) φkp →k φkq p →k q . proposition 2. the following propositions are k-theorems of mθ: (2.24) ⊢k p →k (q →k (p ∧ q)), (2.25) ⊢k (p ∧ q →k r) →k (p →k (q →k r)), (2.26) ⊢k (p →k (q →k r)) →k ((p ∧ q) →k r), (2.27) ⊢k (p →k q) →k ((q →k r) →k (p →k r)), (2.28) ⊢k (p →k q) → ((r →k t) →k (p ∧ r →k q ∧ t)). proof: we shall establish only the k-theorems (2.24), (2.25) and (2.28). (2.24) we shall use (2.5), (2.16), (2.1), modus ponens and the deduction theorem (see [2], p. 495, proposition 3.17). {p, q} ⊢k (p →k p) →k ((p →k q) →k (p →k p ∧ q)) (2.5) {p, q} ⊢k p →k p (2.16) {p, q} ⊢k (p →k q) →k (p →k p ∧ q)) (m.p) {p, q} ⊢k q →k (p →k q) (2.1) {p, q} ⊢k q {p, q} ⊢k p →k q (m.p) {p, q} ⊢k p →k (p ∧ q) (m.p) {p, q} ⊢k p {p, q} ⊢k p ∧ q (m.p) {p} ⊢k q →k (p ∧ q) (deduction theorem) ⊢k p →k (q →k (p ∧ q)) (deduction theorem) tense θ-valued moisil propositional logic 645 (2.25) we shall apply (2.24), modus ponens and the deduction theorem. {p ∧ q →k r, p, q} ⊢k p →k (q →k (p ∧ q)) (2.24) {p ∧ q →k r, p, q} ⊢k p {p ∧ q →k r, p, q} ⊢k q →k (p ∧ q) (m.p) {p ∧ q →k r, p, q} ⊢k q {p ∧ q →k r, p, q} ⊢k p ∧ q (m.p) {p ∧ q →k r, p, q} ⊢k p ∧ q →k r {p ∧ q →k r, p, q} ⊢k r (m.p) {p ∧ q →k r, p} ⊢k q →k r (deduction theorem) {p ∧ q →k r} ⊢k p →k (q →k r) (deduction theorem) ⊢k (p ∧ q →k r) →k (p →k (q →k r)) (deduction theorem) (2.28) we shall use k-axioms (2.3), (2.4), modus ponens, k-theorem (2.24) and the deduction theorem. {p →k q, r →k t, p ∧ r} ⊢k p ∧ r {p →k q, r →k t, p ∧ r} ⊢k p ∧ r →k p (2.3) {p →k q, r →k t, p ∧ r} ⊢k p (m.p) {p →k q, r →k t, p ∧ r} ⊢k p →k q {p →k q, r →k t, p ∧ r} ⊢k q (m.p) {p →k q, r →k t, p ∧ r} ⊢k p ∧ r →k r (2.4) {p →k q, r →k t, p ∧ r} ⊢k r (m.p) {p →k q, r →k t, p ∧ r} ⊢k r →k t {p →k q, r →k t, p ∧ r} ⊢k t (m.p) {p →k q, r →k t, p ∧ r} ⊢k q →k (t →k (q ∧ t)) (2.24) {p →k q, r →k t, p ∧ r} ⊢k t →k (q ∧ t) (m.p) {p →k q, r →k t, p ∧ r} ⊢k q ∧ t (m.p) applying the deduction theorem three times we obtain that ⊢k (p →k q) → ((r →k t) →k (p ∧ r →k q ∧ t)). the rest of the proof is straightforward. 2 the θ-valued łukasiewicz-moisil algebras constitute the algebraic counterpart of the θ-valued moisil logic. the lindenbaum-tarski algebra of the θ-valued moisil propositional calculus is an θ-valued łukasiewicz-moisil algebra (see [2], p. 500, theorem 3.30). we shall recall the definition of θ-valued łukasiewicz-moisil algebras. definition 3. a θ-valued łukasiewicz-moisil algebra (lmθ-algebra) is an algebra l = (l, ∧, ∨, {φi}i∈i, {φi}i∈i, 0l, 1l) of type (2, 2, {1}i∈i, {1}i∈i, 0, 0) such that for all x, y ∈ l, (2.29) (l, ∧, ∨, 0l, 1l) is a bounded distributive lattice, (2.30) φi is a bounded distributive lattice endomorphism for all i ∈ i, (2.31) φix ∧ φix = 0l; φix ∨ φix = 1l for all i ∈ i, (2.32) φi ◦ φj = φj for all i, j ∈ i, (2.33) if i ≤ j then φi ≤ φj for all i, j ∈ i, (2.34) if φix = φiy for all i ∈ i, then x = y (this is known as moisil’s determination principle). 646 c. chiriţă let l = (l, ∧, ∨, {φi}i∈i, {φi}i∈i, 0l, 1l) be an lmθ-algebra. we say that l is complete if the lattice (l, ∧, ∨, 0l, 1l) is complete. l is completely chrysippian if for every {xk}k∈k (xk ∈ l for all k ∈ k) such that ∧ k∈k xk and ∨ k∈k xk exist, the following properties hold: φi( ∧ k∈k xk) = ∧ k∈k φixk, φi( ∨ k∈k xk) = ∨ k∈k φixk (∀i ∈ i). example 4. let b = (b, ∧, ∨,− , 0b, 1b) be a boolean algebra. the set d(b) = b[i] = {f|f : i → b, i ≤ j ⇒ f(i) ≤ f(j)} of all increasing functions from i to b can be made into a lmθ-algebra d(b) = (d(b), ∧, ∨, {φi}i∈i, {φi}i∈i, 0d(b), 1d(b)) where 0d(b), 1d(b) : i → b are defined by 0d(b)(i) = 0b and 1d(b)(i) = 1b for every i ∈ i, the operations of the lattice (d(b), ∧, ∨, 0d(b), 1d(b)) are defined pointwise (cf. [2], p.6, example 1.10) and (φif)(j) = f(i), (φif)(j) = (f(i))− (∀j ∈ i) (∀i ∈ i). let l = (l, ∧, ∨, {φi}i∈i, {φi}i∈i, 0l, 1l) be an lmθ-algebra. for each j ∈ i we consider the binary operation →j on l defined by (2.35) a →j b = φ̄ja∨φjb = (φja∧φ̄jb)− for all a, b ∈ l. this implication is associated to ∧ (like for boolean algebras), but like for boolean algebras also, there exists the following implication: a ;j b = φ̄ja ∧ φjb, associated to ∨. the notion of morphism of lmθ-algebras is defined as usual ( [2]). of course, a morphism of lmθ-algebras preserves the operation →j. 3 tense θ-valued łukasiewicz-moisil algebras in this section we shall recall some definitions and basic results on tense θ-valued łukasiewiczmoisil algebras from [7]. definition 5. a tense lmθ-algebra is a triple at = (a, g, h), where a = (a, ∧, ∨, {φi}i∈i, {φi}i∈i, 0a, 1a) is an lmθ-algebra and g, h : a → a are two unary operations on a such that for all x, y ∈ a, (3.1) g(1a) = 1a, h(1a) = 1a, (3.2) g(x ∧ y) = g(x) ∧ g(y), h(x ∧ y) = h(x) ∧ h(y), (3.3) g ◦ φi = φi ◦ g, h ◦ φi = φi ◦ h, for any i ∈ i, (3.4) g(x) ∨ y = 1a iff x ∨ h(y) = 1a. definition 6. let (a, g, h) be a tense lmθ-algebra. for any i ∈ i, let us consider the unary operations pi, fi defined by pix = φihφix and fix = φigφix, for any x ∈ a. proposition 7. let a = (a, ∧, ∨, {φi}i∈i, {φi}i∈i, 0a, 1a) be an lmθ-algebra and g, h be two unary operations on a that satisfy conditions (3.1), (3.2) and (3.3). then, the condition (3.4) is equivalent with (3.4′) φi ≤ g ◦ pi and φi ≤ h ◦ fi for all i ∈ i. thus, if we replace in definition 5 the axiom (3.4) with the condition (3.4′), we obtain an equivalent definition of tense lmθ-algebra. proposition 8. let a = (a, ∧, ∨, {φi}i∈i, {φi}i∈i, 0a, 1a) be an lmθ-algebra and g, h be two unary operations on a that satisfy conditions (3.1) and (3.3). then, the condition (3.2) is equivalent to (3.2′) g(a →k b) ≤ g(a) →k g(b); h(a →k b) ≤ h(a) →k h(b) for all k ∈ i where→k is defined by (2.35). tense θ-valued moisil propositional logic 647 thus, if in definition 5 we replace the axiom (3.2) by (3.2’), we obtain an equivalent definition for tense lmθ-algebra. definition 9. a frame is a pair (x, r), where x is a nonempty set and r is a binary relation on x. let (x, r) be a frame and l = (l, ∧, ∨, {φi}i∈i, {φ̄i}i∈i, 0l, 1l) be a complete and completely chrysippian lmθ-algebra. lx has a canonical structure of lmθ-algebra. let’s us define for all p ∈ lx and x ∈ x: g∗(p)(x) = ∧ {p(y)|y ∈ x, xry}, h∗(p)(x) = ∧ {p(y)|y ∈ x, yrx}. proposition 10. for any frame (x, r), (lx, g∗, h∗) is a tense lmθ-algebra. let (b, g, h) be a tense boolean algebra. we define on d(b) the unary operations d(g) and d(h) by: d(g)(f) = g ◦ f, d(h)(f) = h ◦ f for all f ∈ d(b). lemma 11. if (b, g, h) is a tense boolean algebra then (d(b), d(g), d(h)) is a tense lmθalgebra. theorem 12. (the representation theorem for tense lmθ-algebras) for every tense lmθ-algebra (a, g, h) there exist a frame (x, r) and an injective morphism of tense lmθ-algebras α : a → (d(l2))x, where l2 = {0, 1}, the standard boolean algebra. 4 tense θ-valued moisil logic (the syntax) in this section we introduce the tense θ-valued moisil propositional calculus tmθ, a logical system obtained from the θ-valued propositional calculus (see [2]) by adding the two tense operators g and h. we define the notion of k-theorem and k-deduction then we establish some syntactical properties of tmθ. the alphabet of tmθ has the following primitive symbols: an infinite set v of propositional variables; the logical connectives ∨, ∧, φi, φi for all i ∈ i; the tense operators g and h and parantheses (, ). the set e of propositions of tmθ is defined by canonical induction. definition 13. we shall use the following abbreviations: for all α, β ∈ e and i ∈ i, we define α →i β = φiα ∨ φiβ; α ↔i β = (α →i β) ∧ (β →i α); fiα = φigφiα; piα = φihφiα. definition 14. we call a k-axiom of tense θ-valued moisil propositional calculus a proposition of one of the following forms: (4.1) the k-axioms of θ-valued moisil propositional calculus ((2.1)-(2.15) in section 2); (4.2) g(α →k β) →k (gα →k gβ); h(α →k β) →k (hα →k hβ); (4.3) gφiα ↔k φigα; hφiα ↔k φihα, for all i ∈ i; (4.4) φiα →k gpiα; φiα →k hfiα, for all i ∈ i. the notion of formal k-proof in tmθ is defined in terms of the above axioms and the following inference rules: α, α →k β β (modus ponens); α gα α hα (temporal generalizations) definition 15. we say that a proposition α is a k-theorem of tmθ if there exists a k-proof of it. we will denote by ⊢k α the fact that α is a k-theorem of tmθ. definition 16. let γ ⊆ e and α ∈ e. we say that α is a k-deduction from γ and write γ ⊢k α if there exist n ∈ n = {0, 1, 2, ...} and α1, ..., αn ∈ γ such that ⊢k n∧ i=1 αi →k α. 648 c. chiriţă we remark that the logical structure of tmθ (k-theorems and k-deduction) combines the logical stuctures of two logical systems: the θ-valued moisil logic and tense classical logic. further we shall prove some syntactical properties. lemma 17. let γ ⊆ e and α ∈ e. then γ ⊢k α iff there exist n ∈ n and α1, ..., αn ∈ γ such that ⊢k α1 →k (α2 →k ...(αn →k α)...). proof: by definition 16 and k-theorems (2.25) and (2.26). 2 lemma 18. let γ ⊆ e and α ∈ e. then γ ⊢k α iff there exists γ ′ ⊆ γ, γ ′ finite, such that γ ′ ⊢k α. proof: by definition 16 and lemma 17. 2 proposition 19. let γ, σ ⊆ e and α, β ∈ e. the following properties hold: (i) if ⊢k α then γ ⊢k α; (ii) if γ ⊆ σ and γ ⊢k α then σ ⊢k α; (iii) if α ∈ γ then γ ⊢k α; (iv) {α} ⊢k β iff ⊢k α →k β; (v) if γ ⊢k α and {α} ⊢k β then γ ⊢k β; (vi) if γ ⊢k α and γ ⊢k α →k β then γ ⊢k β; (vii) γ ⊢k α ∧ β iff γ ⊢k α and γ ⊢k β. proof: (i) using definition 16 for n = 0. (ii) by applying definition 16. (iii) using k-theorem (2.16) and definition 16. (iv) we assume that ⊢k α →k β. then, by definition 16, we obtain that {α} ⊢k β. conversely, if {α} ⊢k β then there exists n ∈ n such that ⊢k (α ∧ ... ∧ α︸︷︷︸ n ) →k β. by using k-axioms (2.4) and (2.5), we get that ⊢k (α ∧ ... ∧ α︸︷︷︸ n ) ↔k α, so ⊢k α →k β. (v) we suppose that γ ⊢k α and {α} ⊢k β. then there exist n ∈ n and α1, .., αn ∈ γ such that ⊢k n∧ i=1 αi →k α. using (iv), it follows that ⊢k α →k β and by applying k-theorem (2.27) and modus ponens, we obtain that ⊢k n∧ i=1 αi →k β, so γ ⊢k β. (vi) let γ ⊢k α and γ ⊢k α →k β. by applying lemma 18, there exist γ1, γ2 ⊆ γ such that γ1 ⊢k α and γ2 ⊢k α →k β. by (ii), it follows that γ1 ∪ γ2 ⊢k α and γ1 ∪ γ2 ⊢k α →k β. if we consider γ1 ∪ γ2 = {γ1, ..., γn}, we obtain that ⊢k n∧ i=1 γi →k α and ⊢k n∧ i=1 γi →k (α →k →k β). by applying k-axiom (2.2) and modus ponens, we get that ⊢k n∧ i=1 γi →k β, so γ ⊢k β. (vii) we assume that γ ⊢k α ∧ β. by using k-axioms (2.3) and (2.4) and applying (i) and (vi), we obtain that γ ⊢k α and γ ⊢k β. conversely, we assume that γ ⊢k α and γ ⊢k β. by using k-theorem (2.24) and (i), we obtain that γ ⊢k α →k (β →k α ∧ β). by applying twice (vi), we get γ ⊢k α ∧ β. 2 theorem 20. (the deduction theorem) let γ ⊆ e and α, β ∈ e. then γ ∪ {α} ⊢k β iff γ ⊢k α →k β. proof: we assume that γ ∪ {α} ⊢k β. then there exist n ∈ n and α1, ..., αn ∈ γ such that ⊢k ( n∧ i=1 αi ∧ α) →k β. by applying k-theorem (2.25) and modus ponens, it follows that tense θ-valued moisil propositional logic 649 ⊢k n∧ i=1 αi →k (α →k β). using definition 16, we obtain that γ ⊢k α →k β. conversely, we suppose that γ ⊢k α →k β. thus, by proposition 4.1 (ii), we get γ ∪ {α} ⊢k α →k β. also, by proposition 4.1 (iii), we have that γ ∪ {α} ⊢k α, hence by applying proposition 4.1 (vi), it results that γ ∪ {α} ⊢k β. 2 proposition 21. in tmθ, the following properties hold: (4.5) if ⊢k α ↔k β, then ⊢k gα ↔k gβ, (4.6) ⊢k g(α ∧ β) ↔k (gα ∧ gβ). proof: (4.5) by using k-axioms (2.3), (2.4), k-theorem (2.24) and modus ponens, we obtain that: ⊢k α ↔k β iff ⊢k α →k β and ⊢k β →k α. applying the temporal generalization rule g, we get that ⊢k g(α →k β) and ⊢k g(β →k α). then, by k-axiom (4.2) and modus ponens, it follows that ⊢k gα →k gβ and ⊢k gβ →k gα, hence ⊢k gα ↔k gβ. (4.6) we shall prove that ⊢k g(α ∧ β) →k (gα ∧ gβ) and ⊢k (gα ∧ gβ) →k g(α ∧ β). by applying proposition 21 (4.5) for k-axioms (2.3), (2.4), we obtain that ⊢k g(α ∧ β) →k gα and ⊢k g(α ∧ β) →k gβ. using k-axiom (2.5) and modus ponens, it results that ⊢k g(α ∧ β) →k (gα ∧ gβ). by k-teorem (2.24) and the temporal generalization rule g, we obtain that ⊢k g(α →k (β →k α∧β)). applying k-axiom (4.2), modus ponens and k-theorem (2.27), it follows that ⊢k gα →k (gβ →k g(α∧β)). using k-theorem (2.26) and modus ponens, we get that ⊢k (gα ∧ gβ) →k g(α ∧ β). thus ⊢k g(α ∧ β) ↔k (gα ∧ gβ). 2 we remark that there exists a similar proposition concerning h. 5 the k-lindenbaum-tarski algebra of tense θ-valued moisil logic in this section we shall prove that the k-lindenbaum-tarski algebra of tmθ is a tense θvalued łukasiewicz-moisil algebra. therefore, the tense θ-valued łukasiewicz-moisil algebras constitute the algebraic structures of tmθ and the properties of tense lmθ-algebras reflect the syntactical properties of tmθ. we consider the binary relation ∼k on the set of all propositions e, defined by: α ∼k β iff ⊢k φiα ↔k φiβ for all i ∈ i. lemma 22. ∼k is an equivalence relation on e. for any proposition α ∈ e, we denote by [α]k the equivalence class of α. we can define the following operations on the set e/∼k : [α]k∨[β]k = [α∨β]k; [α]k∧[β]k = [α∧β]k; φi[α]k = [φiα]k; φi[α]k = [φiα]k for all i ∈ i; g([α]k) = [gα]k; h([α]k) = [hα]k; 0k = [φkα]k, 1k = [φkα]k, where α is a k-theorem of tmθ. proposition 23. (e/∼k, ∧, ∨, {φi}i∈i, {φi}i∈i, 0k, 1k, g, h), the k-lindenbaum-tarski algebra of tmθ, is a tense lmθ-algebra. proof: by ( [2], p.500, theorem 3.30), we have that (e/∼k, ∧, ∨, {φi}i∈i, {φi}i∈i, 0k, 1k) is an lmθ-algebra. what is left to prove is that the operations g and h are well defined and the conditions (3.1)-(3.4) are satisfied. due to the symmetrical position of g and h we shall only include the proofs for g. let α, β ∈ e such that α ∼k β. thus, ⊢k φiα ↔k φiβ for all i ∈ i. applying proposition 21 (4.5), we obtain that ⊢k gφiα ↔k gφiβ for all i ∈ i. using k-axiom (4.3), it follows that ⊢k φigα ↔k φigβ for all i ∈ i, so gα ∼k gβ. 650 c. chiriţă (3.1) we have to prove that g([φkα]k) = [φkα]k i.e. by definition of ∼k that ⊢k φigφkα ↔k φiφkα for every α such that ⊢k α and for all i ∈ i. let α ∈ e such that ⊢k α and i ∈ i. by k-theorem (2.22), we obtain that ⊢k φkα and by applying the temporal generalization rule g, we obtain that ⊢k gφkα. using k-axiom (2.1) and modus ponens, it results that ⊢k φkα →k gφkα and ⊢k gφkα →k φkα. thus, we get that (i) ⊢k φkα ↔k gφkα. using k-axiom (2.11), we have that (ii) ⊢k φiφkα ↔k φkα and by using proposition 21(4.5), we obtain that (iii) ⊢k gφiφkα ↔k gφkα. using k-axiom (4.3) and the conditions (i),(ii), (iii), it results that ⊢k φigφkα ↔k φiφkα. (3.2) let α, β ∈ e. we must prove that g([α]k ∧ [β]k) = g([α]k) ∧ g([β]k) i.e. g(α ∧ β) ∼k gα ∧ gβ which is equivalent with ⊢k φig(α ∧ β) ↔k φi(gα ∧ gβ) for all i ∈ i. let i ∈ i. by using proposition 21(4.6) for α = φiα and β = φiβ, we obtain that (i) ⊢k g(φiα ∧ ∧φiβ) ↔k (gφiα ∧ gφiβ). by using k-axiom (2.9) and proposition 21(4.5), we get that (ii) ⊢k gφi(α ∧ β) ↔k g(φiα ∧ φiβ). by conditions (i) and (ii), we obtain that (a) ⊢k gφi(α∧β) ↔k (gφiα∧gφiβ). by k-axiom (4.3), we have: ⊢k gφiα ↔k φigα and ⊢k gφiβ ↔k φigβ. applying k-theorem (2.28), it follows that (b) ⊢k (gφiα ∧ gφiβ) ↔k↔k (φigα∧φigβ). by conditions (a), (b) and k-axiom (4.3), we obtain that ⊢k φig(α∧ ∧β) ↔k φi(gα ∧ gβ). (3.3) we have to prove that ⊢k φjgφiα ↔k φjφigα for all i, j ∈ i. let i, j ∈ i. by kaxiom (2.11), we obtain that (a) ⊢k φjφigα ↔k φigα. using k-axiom (4.3), we have that (b) ⊢k φjgφiα ↔k gφjφiα. by k-axioms (2.11) and proposition 21(4.5), it follows that (c) ⊢k gφjφiα ↔k gφiα. by (a), (b), (c) and k-axiom (4.3), we get that ⊢k φjgφiα ↔k φjφigα. (3.4) since by proposition 7, the condition (3.4) is equivalent with (3.4’), we shall prove that [φiα]k ≤ [gpiα]k for all i ∈ i, i.e. ⊢k φjφiα →k φjgpiα for all i, j ∈ i. let i, j ∈ i. by k-axiom (2.13), we have that ⊢k piα ↔k φjpiα. applying proposition 21 (4.5), it follows that ⊢k gpiα ↔k gφjpiα. using k-axiom (4.3), it results that (1) ⊢k gpiα ↔k φjgpiα. also, by k-axiom (2.11), we have that (2) ⊢k φiα ↔k φjφiα. by (1), (2) and k-axiom (4.4), we get that ⊢k φjφiα →k φjgpiα. 2 6 semantics and completeness theorem of tense θ-valued moisil logic this section concernes with the semantics of tmθ, which combines the properties of kripke semantics for t and the algebraic semantics for mθ. we establish a completeness theorem for tmθ by using the representation theorem of tense θ-valued łukasiewicz-moisil algebras [7]. definition 24. let (x, r) be a frame. a valuation of tmθ is a function v : e × x → l[i]2 such that for all α, β ∈ e and x ∈ x, the following equalities hold: v(α →k β, x) = v(α, x) →k v(β, x); v(α ∧ β, x) = v(α, x) ∧ v(β, x); v(α ∨ β, x) = v(α, x) ∨ v(β, x); v(φiα, x) = φiv(α, x) for any i ∈ i; v(φiα, x) = φiv(α, x) for any i ∈ i; v(gp, x) = ∧ {v(p, y)|xry}; v(hp, x) = ∧ {v(p, y)|yrx}. the first five conditions of the previous definition reflect "the many-valued past" of tmθ (see [2], p.487) and the last two conditions correspond to "the tense past" of tmθ (see [5], p.93). definition 25. we say that a proposition α is a k-tautology and we write |=k α if for every frame (x, r), for any valuation v : e × x → l[i] 2 and for all x ∈ x, we have v(α, x)(k) = 1. tense θ-valued moisil propositional logic 651 the following result establishes the equivalence between the k-theorems and the k-tautologies of tmθ. the proof of the main implication is based on the representation theorem for tense θvalued łukasiewicz-moisil algebras (theorem 12). theorem 26. (completeness theorem). for any proposition α of tmθ, we have: ⊢k α iff |=k α. proof: (⇒). we shall prove by induction on the definition of ⊢k α that for every frame (x, r) and for any valuation v : e × x → l[i] 2 , we have v(α, x)(k) = 1, for all x ∈ x. let (x, r) be a frame, v : e × x → l[i] 2 be a valuation and x ∈ x. • we suppose that α is a k-axiom. (a) let α be g(p →k q) →k (gp →k gq) with p, q ∈ e. it is known that a →k (b →k c) = = (a ∧ b) →k c ( [7], p.6, proposition 2.1 (l)). we have: v(α, x)(k) = = v(g(p →k q) →k (gp →k gq), x)(k) = [v(g(p →k q), x) →k (v(gp, x) →k v(gq, x))](k) = = [(v(g(p →k q), x) ∧ v(gp, x)) →k v(gq, x)](k) = [∧ xry ((v(p, y) →k v(q, y)) ∧ v(p, y)) →k →k ∧ xry v(q, y)](k) = [φk ∧ xry ((v(p, y) →k v(q, y)) ∧ v(p, y)) ∨ φk ∧ xry v(q, y)](k) = = [( ∧ xry (v(p, y) →k v(q, y))∧v(p, y))(k)]−∨(∧ xry v(q, y))(k) = [ ∧ xry ((v(p, y)(k))−∨v(q, y)(k))∧ ∧v(p, y)(k)]− ∨ ( ∧ xry v(q, y)(k)) = [ ∧ xry (v(q, y)(k) ∧ v(p, y)(k))]− ∨ ( ∧ xry v(q, y)(k)). since v(q, y)(k), v(p, y)(k) ∈ l2 and v(q, y)(k) ∧ v(p, y)(k) ≤ v(q, y)(k), we obtain that∧ xry (v(q, y)(k) ∧ v(p, y)(k)) ≤ ∧ xry v(q, y)(k). since in a boolean algebra we have a ≤ b iff ā ∨ b = 1, we get that [ ∧ xry (v(q, y)(k) ∧ v(p, y)(k))]− ∨ ( ∧ xry v(q, y)(k)) = 1. (b) let α be gφip ↔k φigp with p ∈ e and i ∈ i. then v(α, x)(k) = = v(gφip ↔k φigp, x)(k) = v((gφip →k φigp) ∧ (φigp →k gφip), x)(k) = = [(v(gφip, x) →k v(φigp, x)) ∧ (v(φigp, x) →k v(gφip, x))](k). since l[i]2 is complete and completely chrysippian, it follows that v(gφip, x) = ∧ xry φiv(p, y) = φi( ∧ xry v(p, y)) = = v(φigp, x). we know that a →k a = 1 ( [7], p.6, proposition 2.1 (f)), hence v(α, x)(k) = 1. (c) let α be φip →k gpip with i ∈ i. we have: v(α, x)(k) = v(φip →k gpip, x)(k) = = (v(φip, x) →k v(gpip, x))(k) = (φiv(p, x) →k ∧ xry v(pip, y))(k) = = (φiv(p, x) →k ∧ xry ∨ zry φiv(p, z))(k) = φk(φiv(p, x))(k) ∨ φk( ∧ xry ∨ zry φiv(p, z))(k) = = [v(p, x)(i)]− ∨ ∧ xry ∨ zry v(p, z)(i). let y ∈ x such that xry. then v(p, x)(i) ≤ ∨ zry v(p, z)(i), hence v(p, x)(i) ≤ ∧ xry ∨ zry v(p, z)(i). we obtain that [v(p, x)(i)]− ∨ ∧ xry ∨ zry v(p, z)(i) = 1. • we assume that α was obtained by applying the modus ponens rule. we have that v(β, x)(k) = 1 and v(β →k α, x)(k) = 1. but v(β →k α, x)(k) = (v(β, x) →k v(α, x))(k) = = (φkv(β, x) ∨ φkv(α, x))(k) = φk(v(β, x))(k) ∨ φk(v(α, x))(k) = [v(β, x)(k)] − ∨ v(α, x)(k). we deduce that v(α, x)(k) = 1. 652 c. chiriţă • we suppose that α = gβ such that ⊢k β. we have that v(β, x)(k) = 1, for every x ∈ x. then v(gβ, x)(k) = ( ∧ xry v(β, y))(k) = ∧ xry v(β, y)(k) = 1. (⇐). we shall prove that if ̸⊢k α then ̸|=k α. assume that ̸⊢k α, so [α]k ̸= 1k. by using proposition 23, we have that the k-lindenbaum-tarski algebra (e/∼k, g, h) of tmθ is a tense lmθ-algebra. applying the representation theorem for tense lmθ-algebras (theorem 12), there exist a frame (x, r) and an injective morphism of tense lmθ-algebras d : (e/∼k, g, h) →→ (d(l2)x, g∗, h∗). let us consider the function v : e × x → l[i]2 defined by v(α, x) = d([α]k)(x), for all α ∈ e and x ∈ x. it is straightforward to prove that v is a valuation. since d is injective and [α]k ̸= 1k, we obtain that d([α]k) ̸= 1d(l2)x, hence there exists a ∈ x such that v(α, a) = d([α]k)(a) ̸= 1d(l2). thus α is not a k-tautology. 2 7 concluding remarks the tense θ-valued moisil propositional calculus tmθ can be viewed as a common generalization of the θ-valued moisil propositional logic mθ and the classical tense logic t. tmθ combines the logical structures of these logical systems and its semantic is inspired from the semantics of t and mθ. the main result of this paper is a completeness theorem for tmθ. its proof is derived from the representation theorem of tense θ-valued łukasiewicz-moisil algebras [7]. an open problem is to obtain a proof of the representation theorem for tense θ-valued łukasiewicz-moisil algebras by using theorem 26. the next step in the study of tense aspects of moisil logic is to define the tense θ-valued predicate logic (the syntax and the semantic) and the algebras corresponding of this logic (polyadic tense θ-valued łukasiewicz-moisil algebras). we hope to prove a completeness theorem for tense θ-valued moisil predicate logic and a representation theorem for the corresponding algebras. the tense logics corresponding to the lmθ-algebras with negations [16] will be the subject of another paper. bibliography [1] v. boicescu, sur les systèmes déductifs dans la logique θ-valente, publ. dép. math. lyon, 8, 123-133, 1971. [2] v. boicescu, a. filipoiu, g. georgescu and s. rudeanu, łukasiewicz-moisil algebras, northholland, 1991. [3] v. boicescu, contributions to the study of łukasiewicz-moisil algebras (romanian), ph.d. thesis, university of bucharest, 1984. [4] v. boicescu, sur une logique polyvalente, rev. roumaine sci. soc., sér. philos. et logique, 17, 393-405, 1973b. [5] j. p. burgess. basic tense logic. in: dov gabbay and f. guenthner, eds., handbook of philosophical logic, chapter ii.2, reidel, 89-134, 1984. [6] r. cignoli, i.m.l. d’ottaviano and d. mundici, algebraic foundations of many-valued reasoning, kluwer, 2000. [7] c. chiriţă, tense θ-valued łukasiewicz -moisil algebras, to appear in journal of multiplevalued logic and soft computing. tense θ-valued moisil propositional logic 653 [8] d. diaconescu, g. georgescu, tense operators on mv-algebras and łukasiewicz-moisil algebras, fundamenta informaticae xx: 1-30, 2007. [9] i. dziţac, l. andrei, 65 years from birth of prof. gheorghe s. nadiu (1941-1998), international journal of computers, communications & control vol. i, no. 3, pp. 93-98, 2006. [10] a. filipoiu, θ-valued łukasiewicz-moisil algebras and logics (romanian), ph.d.thesis, university of bucharest, 1981. [11] g. georgescu, a. iorgulescu, s. rudeanu, grigore c. moisil (1906-1973) and his school in algebraic logic, int. journal of computers, communications & control, vol. i, no. 1, pp. 81-99, 2006. [12] g. georgescu, a. iorgulescu, s. rudeanu, some romanian researches in algebra of logic, in: grigore c.moisil and his followers, editura academiei romane, 86-120, 2007. [13] g. georgescu, a. iorgulescu, i. leuştean, monadic and closure mv-algebras, multiplevalued logic, 3, 235-257, 1998. [14] r. goldblatt, logics of time and computation, csli lecture notes no. 7, 1992. [15] p. hájek, metamathematics of fuzzy logic, kluwer acad.publ., dordrecht, 1998 [16] a. iorgulescu, 1 + θ -valued łukasiewicz-moisil algebras with negation (romanian). ph.d. thesis, university of bucharest, 1984. [17] j. łukasiewicz, on three-valued logic, ruch filozoficzny (polish), 5, 60-171, 1920. [18] gr. c. moisil, recherches sur les logiques non-chrysippiennes, ann. sci. univ. jassy, 26, 431-466, 1940. [19] gr. c. moisil, notes sur les logiques non-chrysippiennes, ann. sci. univ. jassy, 27, 86-98, 1941. [20] gr. c. moisil, logique modale, disquis. math. phys, 2, 1942. [21] gr. c. moisil, łukasiewiczian algebras, computing center, university of bucharest (preprint), 311-324, 1968. [22] gr. c. moisil, essais sur les logiques non-chrysippiennes, ed. academiei, bucharest, 1972. [23] gh. s. nadiu, cercetări asupra logicilor necryssipiene/research about necryssipiene logics, 1972 (phd thesis, supervisor grigore c. moisil). [24] h. rasiowa, an algebraic approach to non-classical logics, north-holland publ., amsterdam, polish scientific publ., warszawa, 1974. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 1 (march), pp. 150-157 intelligent management of the cryptographic keys g. moise, o. cangea gabriela moise, otilia cangea petroleum-gas university of ploiesti romania, 100680 ploiesti, 39 bvd. bucuresti e-mail: {gmoise,ocangea}@upg-ploiesti.ro abstract: with the continuous development of the computers networks, new problems have been posed in the process of keys management in the cryptographic systems. the main element in the cryptographic technologies is the keys management, as the cryptographic algorithms are known, while the keys have to be either secret (for unauthorized users that do not need them), or public (for users that need them). with an efficient cryptographic keys management system and the existing encryption techniques, there may be implemented a proper security system in the informational systems of the organizations. the process of cryptographic keys management consists in the following operations: keys generation, distribution, update, revocation, storage, backup/ recovery, import and export, usage control, expiration, and destruction. the cryptographic keys management techniques depend on the type of the keys, i.e. symmetric or public. nowadays, the efforts of the researches in the cryptographic keys management are focused on the standardization and interoperability of the keys management. in this paper, the authors analyze the existing keys management systems and standards available for the keys management techniques, emphasizing the advantages and disadvantages of different systems. they also propose a cryptographic keys management model based on the ideas and principles of the interrap architecture (a conceptual model developed by jőrg műller for intelligent agents). also, there are incorporated some intelligent techniques to manage emergency situations, such as keys losing or their improper usage. keywords: cryptographic key management, intelligent agents, key management model. 1 introduction the key management is the core of a cryptographic system. the processes related to the key management consist in generation, distribution, update, revocation, storage, backup/recovery, import and export, usage control, expiration, and destruction of the cryptographic keys. practically, the security of the information is assured by keeping secret the private cryptographic keys. key management consists in a set of protocols that enable to establish and maintain the keying relationships between the entities of a network [6]. the concept of “keying relationship”is defined in [6] as the state wherein parties of the cryptosystems share keying material. according to the type of cryptographic algorithm used in a cryptosystem, there are two situations: key management used in a symmetric cryptosystem and key management used in an asymmetric cryptosystem. in the former case, the sender and the receiver share the same secret key or two keys computationally feasible and in the latter case, there are involved two transformations: one to generate the public key and the other to generate the private key. [6] [11] the techniques used to distribute confidential keys are: key layering, key translation center, and symmetric copyright c⃝ 2006-2011 by ccc publications intelligent management of the cryptographic keys 151 key certificate techniques. key layering comprises the following techniques: master key, key encrypting keys and data keys. key translation center (ktc ) consists in a trusted server, which allows two entities to establish a secure communication using long-term keys. techniques used to distribute public keys are: point to point delivery over a trusted channel, direct access to a trusted public file (public-key registry), use of an online trusted server, use of an off-line server and certificates, and use of systems implicitly guaranteeing authenticity of public parameters [6]. the advantages of using the keys management in the situation of a public key are: use of a simple key management, on-line trusted server not-required and enhancing the functionality of the system. in this paper, there is studied the problem of the cryptographic key management in large distributed systems, more specifically, the problems of keys distribution and generation. the main concepts used in this paper are the security domain and the keys graph. the concept of security domain is defined in [7] as “a collection of systems (servers, devices, and so on) that share a common set of keys and are attached to an administered network”. in this paper the concept is used in the sense of a collection of entities (to allow an abstractive interpretation) which share a private key. the concept of key graph was introduced in [10] as an arrangement of the keys into a hierarchy and a key server manages all keys. a particular keys hierarchy is the keys tree, which enables to define key management scheme. in this paper it is proposed an intelligent key management model suited to the structure of the network. the new ideas introduced in this paper are: combining the behaviour agent architecture into the key distribution problem and distribution based on crt (chinese reminder theorem [13]). the paper is structured as follows: • formalism of the key management problem, that studies the existing key management systems and introduces the concept of security domain graph; • backgrounds of the intelligent key management model, referring to the chinese remainder theorem, as an important calculation algorithm used in order to generate the key management model; • intelligent key management, proposing a model of the cryptographic key management that may be used in a sdg type architecture, based on the principles of the interrap architecture, introducing intelligent agents responsible with the key management in the cryptographic system; • conclusions, that emphasize the importance of the key management and the advantages offered by the proposed model. 2 formalism of the key management problem to formalize the problem of keys management in a computers communication system, there are defined the security domain and a partially ordered relation between security domains. a security domain is a set of the entities (users, data, hardware devices, etc.) which share the same secret key. so, it can be established an equivalence relation between two entities e1, e2, according to the following statement: e1 ≡e2 if e1, e2 share the same secret key. the equivalence relation between two entities produces equivalence classes, called security domains. let us consider n security domains sd={sd1,sd2, ..., sdn}. the set sd contains all the entities of the computer communications system. on the set of the security domains, it can be defined 152 g. moise, o. cangea a binary partially-ordered relationship, using the operator ≺. the relation sdj≺sdi means that the entities of the security domain sdi have a security clearance higher or equal than that of the entities of the security domain sdj. for example, the entities from sdi can decrypt the message received from the entities that belong to sdj. also, it is used the expression that the security domain sdi dominates the security domain sdj. in this way, one may determine a partially ordered set (sd, ≺) , shortly called poset. messages (data, plain texts) from the security class sdi are encrypted with the cryptographic key ski and data from the security domain sdj are encrypted with the cryptographic key skj. if there is the relation sdj≺sdi , the entities of the security domain sdi have the right to decrypt the cipher text using the cryptographic key ski. in contrast, the entities which are parts of the security domains, sdj , cannot decrypt the messages received from the entities of the security domain sdi . also, if the following relations between three security domains sdk≺sdj and sdj≺sdi, are true, that means that the entities of sdj can decrypt the cipher texts received from the entities of sdk and the entities of sdi can decrypt the cipher texts received from the entities of sdj . consequently, the entities of sdi can decrypt the cipher texts received directly from the entities of sdk. in this manner, there are generated domains hierarchies. a hasse diagram can represent a poset. this diagram is called in [1] a security class privilege graph (scpg), in this paper it is used the term of security domain graph (sdg). an example of sdg is shown in figure 1 (a security domains tree (sdt), b a general security domain graph (gsdg)). figure 1: security domain graph where the following representations have the similar sense (figure 2) the most simple keys management model assumes the existence of a keys server and if an entity figure 2: graphical representation of a partially-ordered relationship between two security domains needs to decrypt a message, it has to ask the proper key over a secure channel, or each security domain has to store all successors secret keys. these models are not really feasible, because the tendencies of increasing the networks dimensions require large storage spaces and a lot of secure channels. in this way, there is quite a challenge to define a keys management model properly adjusted to the security domain graph. intelligent management of the cryptographic keys 153 3 backgrounds of the intelligent key management model there were proposed a lot of cryptographic keys management architecture. to implement data encryption standard, ibm proposed a key management scheme for des in the 70’ years. the cryptographic keys management architecture consists in cryptographic systems connected via a communications network. each cryptographic system has a cryptographic facility, a cryptographic key data set, a cryptographic facility access program and is using application programs. one solution based on control vector is proposed in [5]. the scheme uses a control vectors which facilities the implementation of owner key management policy and rules. this technique enables key distribution in different environments: peer-to-peer distribution, key distribution center, and key translation center. a list of keys management architecture can be found at [8]. akl and taylor proposed the first cryptographic keys assignment scheme to solve problems related to access control in hierarchies (at scheme). according to at scheme [1], each security class (the security class contains data and users with the same rights) has associated a secret key and a public parameter. for a relation sc j≺sc i , sc i use the public parameter t j and the secret key ki to derive the secret key k j. the secret key ki is computed according the formula ki = k ti 0 (modm), where k 0 is the secret key of the central authority and m is the product of two secret large prime numbers. t i is a public parameter with the following property: sc j≺sc i , if t j is a multiple of t i. in order to generate the public parameters, for t i it is used the formula ti = ∏ scinot≻sck pk, where pk are prime numbers related to each security class.the major inconvenient of the method is the fact that the value of t i increases and will become impractical (figure 3). another problem is represented by the question: is the arrangement of the security figure 3: the diagram of generating the t parameter in an at scheme domains according to the sdg? (that is, when a security domain has more than one parent). in order to generate a model for the key management, the authors studied the chinese remainder theorem that is an ancient, but important calculation algorithm in modular arithmetic. it enables one to solve simultaneous equations with respect to different moduli in a considerable generality. here is the statement of the problem that the chinese remainder theorem solves. theorem 1. chinese remainder theorem [12]. let m1, m2, ..., mk ∈z with gcd(mi, mj)=1,any i,j=1, k, i ̸=j . let m be the product m=m1×m2×...mk . let a1, a2, ...,ak ∈z . consider the system: x≡a1(mod m1) x≡a2(mod m2) ... x≡ak(mod mk) 154 g. moise, o. cangea then there exists exactly one x∈zmsolution of the system. the solution to the system above may be obtained using the following algorithm: step 1 for each to i=1 to k calculate z i=m1×m2×...×mi−1×mi+1×...mk step 2 for each to i=1 to k calculate yi=z −1 i (mod mi) step 3 calculate x=a1×y1×z 1+...+ak×yk×z k, return x. 4 intelligent key management in this paper it is proposed a model of cryptographic keys management that may be used in the architecture of sdg type. each entity of the security domains has associated an intelligent agent (sdkma) responsible with the cryptographic key management in the system (figure 4). where sdi=ei1uei2uei3..., where eij use k i and each sdi is organized according to the schema figure 4: security domains graph and intelligent key management agents shown in figure 5. figure 5: the entities relations in a security domain each sdkma is structured according the interrap intelligent agent architecture defined by műller [9]. interrap architecture is a layered bdi model (belief-desire-intention model) with three layers: behavior based layer, local planning layer, and cooperative planning layer. the behavior based layer contains the reactivity and procedural knowledge used in routine tasks, the local planning layer provides reasoning to realize the local tasks and to produce goal oriented behaviors, and the cooperative planning layer enables and facilities collaborative work with other agents. the structure of sdkma is presented in figure 6. behavior planning layer acts in the emergency situations (renewal keys, delete entity, add entity, destroy keys). local planning layer manages the cryptographic keys within the framework of the security domain. cooperative planning layer manages the cryptographic keys between security domains. world kb contains the procedures and functions used in the emergency situations (structure intelligent management of the cryptographic keys 155 figure 6: the structure of sdkma of the networks and security domains, new arrivals and leaving of the entities). planning kb contains procedures and functions responsible with the key management within the framework of the security domain and social kb contains procedures and functions used to key management between security domains. these knowledge bases are self-updated; therefore, the keys management model has the property of flexibility that is modifications in the structure of the network cause modifications in the key management model. keys management within the framework of security domains can be realized according to key server schema. for each security domains, it is selected, in a randomized way, an entity. its sdkma will play the role of key server. the server stores the key and the associated list of the entities from the security domain. entities request the key to the server. upon entity authentication, the server sends the keying material if the entity is authorized. key management between security domains can be realized according toat scheme if the node corresponding to the security domain has one parent. the arrangement in security domains offers the advantage of calculating not greater values of parameters t. there were provided solutions to optimize the at schema in order to compute smaller values of parameters t [4]. if the relations between security domains are according to the scheme presented in figure 7, then one has the following situation: a node has more than one parents and a node has more than two ancestors. figure 7: security domain sd3 with two parents and two ancestors if an entity of sd3 broadcasts a message m, than the entities of sd1 and sd2 can decrypt the 156 g. moise, o. cangea cipher text c. so, the entities have to know a single decryption key. the asymmetric keys generator has to send to each security domains a private key or each sdkma can compute (cooperative planning layer) the private key using the chinese remainder theorem. for each entity (sd1 and sd2) it will select a large secret prime number n1, respectively n2. the secret key is the solution (in the space zn1×n2 ) of the system: x≡a1(mod n1) x≡a2(mod n2), where a1 , a2 are public parameters. to calculate the solution of the system, it can be used the lagrange method: step 1 find cofactors u12, u21 using extended euclidian algorithm step 2 calculate l1=u21×m2 , l2=u12×m1 step 3 calculate the solution x=a1×l2+a2×l2 if one entity of sd4 and one entity of sd5 broadcast in the same time two encrypted messages c4 and c5, the sdkma (the cooperative planning layer) of each entity of sd3 manage a waiting queue and decrypt the messages using randomized priorities. the advantages offered by the proposed model are: more securely, because the intelligent property changes any time the parameters used in keys generation, and more efficiently, through the use of a security domain based structure of the network. the dynamism of the network can be easily implemented because the knowledge bases related to each layer of the sdkma are updated with actual information in any moment. also, in order to generate the keys, the procedures and functions may contain different algorithms and the sdkma can change any time the key distribution algorithms according to the dimensions of the networks or the necessary security level. 5 conclusions starting from the analysis of the existing key management systems and standards available for the keys management techniques, in this paper it is proposed an intelligent cryptographic key management model between security domains, sdkma type, based on the ideas and principles of the interrap architecture, emphasizing the advantages referring to security, efficiency and feasibility. as future directions, one may consider designing a fuzzy model of the cryptographic key management system, but only if this approach would bring important improvements for the model, regarding to the procedures and functions used to generate the keys. nevertheless, the most secure situation is to hold the keys in secure hardware and perform all processing there [3], being impossible to achieve this goal in large scale networks. bibliography [1] akl, s.g., taylor, p.d., cryptographic solution to a problem of access control in a hierarchy,acm transactions on computer system,3 (1), 1983. [2] hassen, r. h., bouabdallah a., bettahar, h., challal, y., key management for content access control in a hierarchy, computer networks, 51 3197-3219, 2007. [3] lin, j. c., huang, k. h., lai, l., lee, h. c., secure and efficient group key management with shared key derivation, it computer standards and interfaces, 31, 2009. [4] mackinnon, s. , taylor, p., meijer, h., akl, s., an optimal algorithm for assigning cryptographic keys to control access in a hierarchy, ieee transactions on computers, c-34 (9), 1985. intelligent management of the cryptographic keys 157 [5] matyas, s. m., le, a.v. abraham, d. g., a key-management scheme based on control vectors, ibm systems journal, vol. 2, issue 3, 1991. [6] menezes, a., van oorschot, p., vanstone, s., handbook of applied cryptography, crc press, 1996. [7] michener, j. r., acar, t., security domains: key management in large-scale systems,ieee software, 2000. [8] savard, j. j. g., a cryptographic compendium, http://www.quadibloc.com/crypto/ jscrypt.htm, accessed on the december 5th, 2009. [9] müller, j. p., the design of intelligent agents: a layered approach. lecture notes in computer science, lecture notes in artificial intelligence, 1177, springer-verlag, 1996. [10] wong, c.k., gouda, m., lam, s., secure groups communication using key graphs,proceedings of the acm sigcomm’98, 1998. [11] key management in cryptography, http://www.netlab.tkk.fi/opetus/s38153/ k2003/lectures/g33keymgmt.ppt, accessed on december 10, 2009. [12] chinese remainder theorem, http://www.math.tamu.edu/∼jon.pitts/courses/ 2005c/470/supplements/chinese.pdf, accessed on december 10, 2009. [13] zhou, j., ou, o. h., key tree and chinese remainder theorem based group key distribution scheme, proceedings of the 9th international conference on algorithms and architectures for parallel processing, isbn:978-3-642-03094-9, 2009. international journal of computers, communications & control vol. ii (2007), no. 3, pp. 288-298 optimization of queries with conjunction of predicates nicoleta livia tudor abstract: a method to optimize the access at the objects of a relational database is through the optimization of the queries. this article presents an approach of the cost model used in optimization of select-project-join (spj) queries with conjunction of predicates and proposes a join optimization algorithm named system ro-h (system rank ordering heuristic). the system ro-h algorithm for optimizing spj queries with conjunction of predicates is a system r dynamic programming algorithm that extends optimal linear join subplans using a rank-ordering heuristic method as follows: choosing a predicate in ascending order according to the h-metric, where the h-metric depends on the selectivity and the cost per tuple of the predicate, using an expression with heuristic constants. the system rank-ordering heuristic algorithm finds an optimal plan in the space of linear left deep join trees. the system ro-h algorithm saves not a single plan, but multiple optimal plans for every subset, one for each distinct such order, termed interesting order. in order to build an optimal execution plan for a set s of i relations, the optimal plan for each subset of s, consisting of i-1 relations is extended, using the lemma based on a h-metric for predicates. optimal plans for subsets are stored and reused. the optimization algorithm chooses a plan of least cost from the execution space. keywords: optimal join subplans, cost function, query tree, join optimization algorithm, h-metric, heuristic method 1 introduction this article proposes a select-project-join query optimization algorithm, based on a heuristic function and suggests an approach of the cost model used in optimization of queries with conjunction of predicates. the paper is organized as follows: • the section prior work presents the improved alternatives of join optimization algorithms • the section system rank ordering heuristic algorithm: algorithm shows a different improved system r dynamic programming algorithm based on utilization of a heuristic function and the mathematical expression of the system rank ordering heuristic algorithm • the section performance evaluation presents the performance results and implementation details. 2 prior work there are known many improved alternatives for optimizing queries with user defined predicates: system r dynamic programming algorithm [1] illustrates the system r dynamic programming algorithm that finds an optimal plan in the space of linear (left-deep) join trees [2]. the algorithm proceeds by building optimal execution plans for increasingly larger subsets of the set of all relations in the join. in order to build an optimal plan for a set s of i+1 relations, the optimal plan for each subset of s, consisting of i relations is extended, and the cheapest of the extended plans is chosen. the system-r algorithm copyright © 2006-2007 by ccc publications optimization of queries with conjunction of predicates 289 saves not a single plan, but multiple optimal plans for every subset s, one for each distinct such order, termed interesting order. the enumeration complexity of the algorithm is o(n2n−1), where r1, r2,. . . , rn relations. the optimization algorithm for the space of bushy join trees is similar to the system-r algorithm, except that the both inputs of a join operator can be an intermediate result. the number of optimal subplans that must be stored for a join with n tables is 2n times the number of interesting orders. the complexity is o(3n). ldl algorithm was used in the ldl project at mcc [3] and subsequently at the papyrus project at hp laboratories [4]. the ldl algorithm treats expensive predicates and relations alike and may produce plans that are significantly worse than plans produced by the traditional optimization algorithm where all selections are evaluated as early as possible. ldl algorithm cannot consider all plans in the space of unconstrained linear execution trees. hellerstein [5] shows that the ldl algorithm fails to consider plans that evaluate expensive predicates on operands of a join prior to taking the join. in order to optimize a query that consists of a join of n relations and k expensive predicates, the dynamic programming algorithm will need to construct 2n+k optimal subplans. the predicate migration algorithm improves on the ldl approach. predicate migration approach, which given a linear join tree, chooses a way of interleaving the join and the user-defined predicates and is integrated with a system r style optimizer [5]. the algorithm places the user-defined predicates in their optimal position relative to the join nodes. this approach has serious drawbacks that limit its applicability [6]: it cannot guarantee an optimal plan because it uses a heuristic to estimate the rank of join predicates that influence the choice of the plan. the naive optimization algorithm for the space of unconstrained linear join trees behaves exactly like the system r algorithm. the total number of stored plans per each distinct set of relations increases to 2k, and the number of plans that need to be stored increases to 2k+n, where k is the number of user-defined predicates. the complexity of this algorithm is exponential in the number of user-defined predicates and in the number of relations in the query. optimization algorithms with complete rank-ordering [7] use the ability to order the execution of predicates (called ranks or rank-order), that were applied prior to application of any other operators. when the join methods are regular, the algorithm restricts the sequence in which the user-defined predicates may be applied and reduces the complexity of enumeration from exponential to polynomial in the number of user-defined predicates. optimization algorithm with pruning [7] compares and prunes plans that have different tags. the "udp-pushdown" rule provides a sufficient condition for a predicate to be pushed down and it can be used to pin the selections as soon as they are evaluable and helps avoid constructing plans where the predicates are pulled up. the "udp-pullover" rule allows avoid generating alternative plans that push down user-defined predicates and are suboptimal. the conservative local heuristic algorithm can choose among plans that result from application of a sequence of udp-pushdown and udp-pullover rules. the two plans picked by the conservative local heuristic complement each other, and the heuristic can guard against the choice of a plan resulting from greedily pushing down a predicate by the pull-rank algorithm. thus, conservative local heuristic can find optimal plans that pull-rank and other global heuristics fail to find due to their greedy approach, but incurs only low computational overhead. as the following lemma states, the conservative local heuristic algorithm produces an optimal plan in several important special cases [7]. in the next section, we present a different improved system r dynamic programming algorithm for optimizing select-project-join queries with conjunction of predicates, based on a heuristic method. we implement the optimization algorithm by extending a system r style optimizer. 290 nicoleta livia tudor 3 system rank ordering heuristic algorithm in this section, we discuss an approache proposed for optimizing select-project-join (spj) queries with conjunction of predicates, using an algorithm system ro-h (system rank ordering heuristic), based on a heuristic method. this algorithm extends optimal linear join subplans by choosing one predicate in ascending order according to the h-metric (heuristic metric). h-metric determines minimum value between the rank of the predicate and the ratio between selectivity minus 1 and cost per tuple. to define the heuristic method for extending optimal linear join subplans, the following shall be considered. 3.1 regular join methods let us consider a class of spj queries on relations r1, r2, . . . , rn, n ∈n* and an implementation of the join operator with conjunction of k predicates p1, . . ., pk, k∈n*. definition 1. a join method is called regular if the cost f(r1, r2) of joining two relations of sizes r1 and r2 depends on the sizes of the relations as follows: f (r1, r2) = a + br1 + cr2 + dr1r2 where the constants a, b, c, d are independent of the sizes of relations r1 and r2 [7]. if join operators follow the assumption of being regular joins, we can restrict enumeration to execution trees where all predicates are ordered by rank order heuristic. 3.2 tags for plan representation the following definition states formally how we associate a tag with a join tree [7]: definition 2. let t be an unconstrained linear join tree that consists of a join among a set r of relations and evaluation of a set u ⊆ s of user defined predicates where s is the set of all user defined predicates in the query that can be evaluated over the subexpression of the query that consists of the join among relations in r. then, the tag associated with the tree t is the ordered set of predicates s−u, sorted by rank order. figure 1 illustrates the execution plans that need to be considered when there are three relations r1, r2, r3 and two selection predicates p1, p2 on r1. t and t’ are possible plans for r1 ./ r2 ./ r3 (each with differing tags). the tags for t and t’ are < > and < p1> respectively. figure 1: the execution plans 3.3 predicate order in optimal plans let us consider a class of spj queries on relations r1, r2, . . . , rn and an implementation of the join operator with conjunction of k predicates p1, . . ., pk. the problem of join ordering, addressed in [8], [9], [10] and [11] utilizes the notion of a rank. the rank of a predicate pi, (i = 1, . . ., n), rank(pi) = costpertuple(pi) / (1 − selectivity(pi)). chaudhuri and shim refer to such execution trees as rank ordered [7], as defined below: optimization of queries with conjunction of predicates 291 definition 3. the user-defined predicates in an unconstrained execution tree t are rank ordered if for any two user-defined predicates p and p’ in t such that rank(p) < rank(p’), either p precedes p’ in the tree t, or p is not evaluable in the tree t’ obtained by exchanging the positions of p and p’ in t. hellerstein et al. [12] consider expensive predicates, i.e., where the computation needed for evaluating whether the predicate is true or false dominates the overall cost [12]. in that context, it is shown that predicates should be ranked in ascending order according to the metric (selectivity−1)/(cost pertuple). hellerstein et al. consider that the processor is perfect in its prediction, and it predicts the branch to the next iteration of the query will be taken when the selectivity ≤ 0.5, and will not be taken when selectivity > 0.5. in system rank ordering heuristic algorithm, we refer to such execution trees as rank ordered, according to the h-metric based on a heuristic method. definition 4. we call the h-metric (heuristic metric) of predicate pi, i = 1, . . . ,k having selectivity si, the pair ( si, c1 ∗si + c2 ∗(1−si) cost pertuple(pi) ) , where c1 and c2 are heuristic constants, defined as follows: c1 = 0, c2 = -1, if 0.5 < si ≤ 1 and c2 = 0, c1 = -1, if 0 ≤ si ≤ 0.5. observations: 1. the utility of the h-metric is that we avoid generating a large number of intermediate-quality plans, that improve on the currently computed best cost, without being optimal. 2. the heuristic method is not guaranteed to find the optimal solution, but we will demonstrate that it finds good solutions. lemma 5. let t be an unconstrained linear join tree that consists of a join among a set of n relations and let si and s j be the selectivities for pi and p j respectively. the plan of the execution tree t cannot be optimal if si ≥ s j and c1 ∗si + c2 ∗(1−si) cost pertuple(pi) > c1 ∗s j + c2 ∗(1−s j) cost pertuple(p j) , where c1 and c2 are heuristic constants, defined as follows: c1 = 0, c2 = -1, if 0.5 < si ≤ 1 and c2 = 0, c1 = -1, if 0 ≤ si ≤ 0.5. proof: let t be an unconstrained linear join tree that consists of a join among a set r of n relations, two predicates p1, p2 with s1 ≥ s2 and let τ be a subexpression among execution tree t and let us refer to this subexpression by τ (r), where the parameter r refers to the input relation of τ . then: τ(r) = τ(σp1(r1)) we can relate the execution trees in figure 2 and the following correspondence holds: t = σp2 (τ) the following parameters shall be defined: • cost(p1) = cost of predicate τ per tuple • |r1| = number of tuples in relation r1 292 nicoleta livia tudor figure 2: τ : subexpression of execution tree t • cost(τ ) = cost of execution tree τ • size(τ ) = size of the output of execution tree τ • si = selectivity of the predicate pi • τ0 = τ (r1) we now estimate the cost of the execution tree t as follows: cost(t ) = cost(p1)|r1|+ cost(τ) + cost(p2)size(τ), where size(τ ) = size(τ0) s1, then cost(t) = cost(p1) |r1| + cost(τ ) + cost(p2) size(τ0) s1 we can represent that the cost of an expression τ (r1) to be the sum of the following three components: 1. cost of evaluating predicates in the expression τ (r1): the sum of all such costs is ∑ i costisizei(r1) where costi is the cost of applying the predicate per tuple and sizei is the size of the relation preceding the i-th application of a predicate. 2. cost of evaluating join nodes in the expression τ (r1) that are ancestors of r1: the sum of all such costs is ∑ j cost(join) j(r1) 3. cost of evaluating all other operators that are not affected by the input relation r1: we denote this cost by cost0. then the cost of an expression τ (r1) can be computed as follows: cost(τ(r1)) = ∑ i costisizei(r1) + ∑ j cost(join) j(r1) + cost0 cost(t ) = cost(p1)|r1|+ ∑ i costisizei(r1) + ∑ j cost(join) j(r1) + cost0 + cost(p2)size(τ0)s1 if s1 ≥ s2 and cost(p1) = cost(p2), then c1 ∗s1 + c2 ∗(1−s1) cost pertuple(p1) > c1 ∗s2 + c2 ∗(1−s2) cost pertuple(p2) optimization of queries with conjunction of predicates 293 if we change p1 with p2, let t’ be a join tree that consists of a join among a set r of n relations, two predicates p1, p2 with s1 ≥ s2, then cost(t ′) = cost(p2)|r1|+ ∑ i costisizei(r1) + ∑ j cost(join) j(r1) + cost0 + cost(p1)size(τ0)s2 cost (t) > cost (t’) ⇒ the plan of the execution tree t cannot be optimal a corollary of this lemma is that whenever two consecutive terms appear anywhere as conjunctions in an optimal plan, then the one with lower selectivity must appear first if it has the same h-metric. we use lemma 5. in the system rank-ordering heuristic algorithm below. 3.4 system rank-ordering heuristic algorithm the system rank-ordering heuristic algorithm finds an optimal plan in the space of linear (leftdeep) join trees. the cost function assigns a real number to any given plan in the execution space and satisfies the principle of optimality [13]. an optimal plan for a set of relations must be an extension of an optimal plan for some subset of the set. the optimization algorithm chooses a plan of least cost from the execution space. definition 6. the system rank-ordering heuristic algorithm for optimizing spj queries with conjunction of predicates is a system r dynamic programming algorithm that extends optimal linear join subplans using a rank-ordering heuristic method as follows: • choosing a predicate in ascending order according to the h-metric • h-metric depends on the selectivity and the cost per tuple of the predicate, using an expression with heuristic constants. the system rank-ordering heuristic algorithm saves not a single plan, but multiple optimal plans for every subset, one for each distinct such order, termed interesting order [14]. in order to build an optimal execution plan for a set s of i relations, the optimal plan for each subset of s, consisting of i 1 relations is extended, using the lemma 1 based on a h-metric for k predicates. optimal plans for subsets are stored in the optplan() array and are reused. the system rank-ordering heuristic algorithm is presented as follows: procedure system rank-ordering heuristic for i = 2 to n do for all s from {r1, ..., rn} with | s | = i bestplan = a plan with infinite cost for all rj, sj, where s = union ({rj}, sj),intersect({rj},sj) = null for all p from optplan(sj, t) with all different tag t nr1 = |p|; nr2 = | rj | r1 = array with evaluable predicates on p r2 = array with evaluable predicates on rj euristicorder(r1); euristicorder(r2) // ascending ordering of predicates // in the p, rj according to the h-metric for i = 0 to nr1 for j = 0 to nr2 p’ = extendjoinplan( p, rj, r1[i], r2[j]) if cost (p’) < cost(bestplan[tag(p’)]) 294 nicoleta livia tudor bestplan[tag(p’)] = p’ endif repeat repeat repeat repeat optplan(s) = bestplan repeat repeat finalplan = a plan with infinite cost for all plan p from optplan({r1, ě, rn}) if complete_cost(p) < cost(finalplan) finalplan = completed plan of p endif repeat return(finalplan) end function extendjoinplan( p, rj, r1[i], r2[j]) let r1[i] be a predicate applied to the plan p for sj, t r2[j] = a predicate for the relation rj p’ = the join plan between sj, t and rj, for the subset of predicates (r1[i], r2[j]) extend optplan(sj, t) with the plan p’ return procedure euristicorder(r1) for i = 0 to nr1 1 for j = i + 1 to nr1 // si = selectivity for r1[i] // cost (pi) = cost of the predicate r1[i] if r1[i] > r1[j] and (c1*si + c2*(1-si)) / cost(pi) > (c1*sj + c2*(1-sj)) / cost(pj) change r1[i] with r1[j] endif repeat repeat return observation: arrays r1 and r2 are useful for ascending ordering of the predicates according to the h-metric. when the join methods are regular, the system rank-ordering heuristic algorithm enables us to further restrict the sequence in which the predicates may be applied and reduces the complexity of enumeration in the number of predicates. 4 performance evaluation in this section we present the results of performance evaluations on our implementation. optimization of queries with conjunction of predicates 295 4.1 experimental framework we used an experimental framework similar to that in [15] and [16] and [7]. we performed experiments using an amd athlon(tm)xp 1600+ machine with 256 mb of ram and running windows xp professional version 2002. the algorithms were run on queries consisting of equijoins. relation cardinalities ranged from a hundred to a thousand tuples, and the numbers of unique values in join columns varied from 25o/o to 100o/o of the corresponding relation cardinality. the selectivities of predicates were randomly chosen from 10−4 to 1.0 and the cost per tuple of predicates was represented by the number of i/o accesses and selected randomly from 1 to 1000. we considered nested-loop, merge-scan, and simple and hybrid hash joins as join methods [17]. in our experiments, only the cost for number of i/o accesses was accounted for. for our experiments, we generated 3 join (join among four relations) queries, 5 join queries, and 7 join queries. we performed two sets of experiments. in the first set, we varied the number of selection predicates that apply on one relation. in the second set, we varied the distribution of the selection predicates among multiple relations in the query, i.e., we kept the number of selection predicates fixed, but varied how these predicates are distributed among the relations in a query. 4.2 candidate algorithms for each query instance, we ran the following optimization algorithms: • system r dynamic programming algorithm: the system r style optimization algorithm that evaluates all predicates as early as possible • optimization algorithms with complete rank-ordering: it compares plans that have the same tag over the same set of relations • opt-rank-conservative algorithm: this algorithm uses conservative local heuristic with complete rank-ordering • system rank-ordering heuristic algorithm that extends optimal linear join subplans using a rankordering heuristic method. 4.3 effect of number of predicates in this set of experiments, the number of predicates was varied from 1 to 5 and the number of join queries was varied from 3 to 7 (7 joins for 8 relations). the results presented for each data point represents an average over 100 queries. these queries were generated by randomly choosing one relation on which all the predicates apply and then randomly picking the cost and selectivities of the predicates as well. table 1 shows the average number of enumerated plans for the algorithms: system r dynamic programming, optimization with complete rank-ordering, opt-rank-conservative and system rankordering heuristic algorithm. figures 3, 4 and 5 show a comparison of the performances (average number of enumerated plans) for the 4 algorithms (system r dynamic programming, optimization with complete rank-ordering, opt-rank-conservative and system rank-ordering heuristic algorithm). the results obtained for queries with 3, 5 or 7 joins show a similar trend: • the enumerations necessary in the system r dynamic programming algorithm is independent of the number of predicates 296 nicoleta livia tudor table 1. worst-case estimates for enumerated plans ( 3 join query) figure 3: 3 join query figure 4: 5 join query figure 5: performance on a varying number of predicates optimization of queries with conjunction of predicates 297 • the optimization algorithms with complete rank-ordering generated more plans than the optrank-conservative algorithm and system rank-ordering heuristic algorithm • the average number of enumerated plans for the system rank-ordering heuristic algorithm is approximately linear • the gap in the number of enumerated plans between the complete rank-ordering algorithm and system rank-ordering heuristic algorithm increases significantly as the number of predicates grows 5 conclusion this article presents an approach of the cost model used in optimization of select-project-join (spj) queries with conjunction of predicates and proposes a join optimization algorithm named system ro-h (system rank ordering heuristic). the system ro-h algorithm is a system r dynamic programming algorithm that extends optimal linear join subplans using a rank ordering heuristic method. the comparison of the performances of algorithms shows that our proposed system rank-ordering heuristic techniques are extremely effective and are guaranteed to generate optimal plans. bibliography [1] s. ganguly, w. hasan, r. krishnamurthy, query optimization for parallel execution, in proceedings of the acm sigmod international conference on management of data sigmod, ed. acm press, new york, pp. 9-18, 1992. [2] p. g. selinger, m. m. astrahan, r. a. lorie, t. g. price, access path selection in a relational database management system, in proceedings of acm sigmod international conference on management of data sigmod, boston, ma, acm press, new york, pp. 23-34, 1979. [3] d. chimenti, r. gamboa, r. krishnamurthy, towards an open architecture for ldl, in proceedings of the 15th international conference on very large data bases, vldb, netherlands, ed. morgan kaufmann publishers inc., san francisco, ca, pp. 195-203, 1989. [4] s. chaudhuri, k. shim, query optimization in the presence of foreign functions, in proceedings of the 19th international conference on very large data bases, ireland, morgan kaufmann publishers inc., san francisco, ca, pp. 526-541, 1993. [5] j. m. hellerstein, practical predicate placement, in proceedings of the acm sigmod international conference on management of data, minneapolis, acm press, new york, pp. 325-335, 1994. [6] j. m. hellerstein, m. stonebraker, predicate migration: optimizing queries with expensive predicates, in proceedings of the acm sigmod international conference on management of data, washington, acm press, new york, pp. 267-276, 1993. [7] s. chaudhuri, k. shim, optimization of queries with user-defined predicates, acm transactions on database systems, vol. 24, no. 2, pp. 177-228, 1999. [8] j. m. hellerstein, j. m., m. stonebraker, predicate migration: optimizing queries with expensive predicates, in proceedings of the acm sigmod international conference onmanagement of data, washington, dc, eds. acm press, new york, ny, pp. 267-276, 1993. 298 nicoleta livia tudor [9] r. krishnamurthy, h. boral, c. zaniolo, optimization of nonrecursive queries, in proceedings of the 12th international conference on very large data bases, kyoto, japan, vldb endowment, berkeley, ca, pp. 128-137, 1986. [10] c. l. monma, j. sidney, sequencing with series-parallel precedence constraints, math. oper. res. 4, pp. 215-224, 1979. [11] k. y. whang, r. krishnamurthy, query optimization in a memory-resident domain relational calculus database system, acm trans. database syst. 15, pp. 67-95, 1990. [12] j. m. hellerstein, m. stonebraker, predicate migration: optimizing queries with expensive predicates, in proceedings of the acm sigmod conference, 1993. [13] t. h. cormen, c. e. leiserson, r. l. rivest, introduction to algorithms, mit press, cambridge, ma, 1990. [14] p. g. selinger, m. m. astrahan, r. a. lorie, t. g. price, access path selection in a relational database management system, in proceedings of acm sigmod international conference on management of data, boston, ma, acm press, new york, pp. 23-34, 1979. [15] y. e. ioannidis, y. c. kang, randomized algorithms for optimizing large join queries, in proceedings of the 1990 acm sigmod international conference on management of data, ed. acm press, new york, ny, pp. 312-321, 1990. [16] s. chaudhuri, k. shim, including group-by in query optimization, in proceedings of the 20th international conference on very large data bases, vldb endowment, berkeley, ca, 1994. [17] l. d. shapiro, join processing in database systems with large main memories, acm trans. database syst. 11, pp. 239-264, 1986. tudor nicoleta liviana petroleum-gas university of ploiesti department of computer science 39 bucuresti avenue, 100680, ploiesti, romania e-mail: tudorl@upg-ploiesti.ro received: january 25, 2007 revised: june 13, 2007 nicoleta liviana tudor (born on july 15, 1968) graduated the faculty of mathematics of the bucharest university in 1992. she has 15 years experience of teaching in the field of computer science both in petroleum-gas university of ploiesti, and in other educational institutions of romania. since 2006 she is lecturer in the computer science department at petroleum-gas university of ploiesti, romania. she is a phd student at the petroleumgas university of ploiesti, control engineering and computers department. her main research fields are middle-tier business objects, xml web services, data processing, relational databases. she has authored 2 books and more than 14 research papers in the area of relational databases, data structure, and published in international journals, and in the proceedings of prestigious international conferences. international journal of computers, communications & control vol. ii (2007), no. 3, pp. 209-216 60 years from birth of academician f.g. filip ioan dziţac, mişu-jan manolescu, horea oros, emma văleanu “to whom it may concern ... dr. filip has shown himself to be a very innovative and productive researcher whose papers are equivalent in quality and cutting-edge findings to those from the best research organizations of the world. witness his publications in automatica, computers in industry and other leading journals in the field. the establishment of dr. filip’s stature as an internationally recognized researcher in his field and the acceptance of his work are shown by the large number of international conferences where he has served as session chairman and/or program committee member and many invited seminars he has presented in other countries... 1” 1 introduction born in bucharest on the 25th of july 1947, the academician florin gheorghe filip, editor in chief of our journal (ijccc), turns 60, this is why the operative/executive editorial staff of ijccc wants to 1prof. theodore j. williams professor of engineering and director, purdue lab. for applied industrial control, purdue university; former president of instrument society of america; former president of the american federation of information processing societies; chairman of ifac/ifip task force on “architecture for enterprise integration” (august 1994) copyright © 2006-2007 by ccc publications 210 ioan dziţac, mişu-jan manolescu, horea oros, emma văleanu dedicate this short biographical-sketch in order to pay him homage. his life is under the sign of creation in science, of generosity in his relationship with his co-workers, of his innovative courage in the coordination of the projects he has worked on, of the stimulating energy in the community he has worked for a longer period of time or just collaborated occasionally, and under the sign of modesty in his relation with other people with whom he was in contact. most of the people who know him through their work acknowledge his good and stimulating effects on their work or career. in order to show him respect, lots of similar journals dedicated him special numbers in this period. not willing to be in competition with other journals, we would like to dedicate only this short biographical sketch, but in no 4/2007 of ijccc, we are going to publish some of the scientific articles of authors who expressed their wish to dedicate them to academician filip when he turns his 60. we will present in this paper a short digest of the biography and work of f.g. filip 2 opera omnia prof. f. g. filip is author/coauthor of 6 monographs published in ed. tehnica (http://www.edituratehnica.ro/) and he is editor/coordinator of 9 contribution volumes published in ed. tehnica, romanian academy editing house (www.ear.ro), ed. expert and elsevier. also, he publish more than 200 scientific papers in several international journals (computers in industry, ifac j. automatica, ifac j control engineering practice, large scale systems, j. of human systems management, systems analysis, modeling and simulation, studies in informatics and control etc.) and some contribution volumes printed in the publishing houses: pergamon press, north holland, chapmann and hall, springer, kluwer, ed. tehnica and ed. academiei române. more than 50 articles of f.g. filip are registered in thomson isi web of knowledge database and 22 in scopus and elsevier databases. 3 academic qualifications, scientific titles and professional positions f.g. filip graduated the faculty of automatics, university politehnica of bucharest, obtaining the engineer diploma in 1970 and received his ph. d (1982) in automatics (electrical processes automation field). later on, he participated at several courses of professional forming and improvement: “complex informatics systems, tele-processing systems” (sweden, at the universities from goeteborg, uppsala, malmoe, stockholm 1974); “research and technology management” (germany, fraunhofer iitb karlsruhe, 1995); “decisional systems” (germany, fraunhofer first, berlin, 1998) etc. f.g. filip is corresponding member of the romanian academy since 1991 and full member of the romanian academy since 1999. since 2000 acad. filip is vicepresident of the romanian academy. last professional titles obtained by f.g. filip are first degree researcher (1990) and university professor (1998). he jobs and positions held is following: engineer (1970-1976); scientific researcher (1976-1979); main researcher iii-(1979-1990); head of the research laboratory (1985-1991); main researcher i (1990); directorthrough contest at ici (1991-1997); scientific directorresponsible with the international cooperation (1997-2001); president of the scientific committee (1995-2003); vice-president of the romanian academy (2000-now). 4 awards, diplomas and distinctions f.g. filip received the following award, diplomas and distinctions: 60 years from birth of academician f.g. filip 211 1. “grigore moisil” award and the diploma “man of the year 1999” granted by inforec (the association of economical informatics). 2. doctor honoris causa of "lucian blaga" university, sibiu (2000). 3. the award “copy ro for informatics” for the year 2003 for the monograph “computer aided decision: base methods and techniques” (ed. tehnica, bucharest, 2002, in romanian). 4. honorary professor of “dunarea de jos” university, galati (from 2003). 5. excellency diploma “the most prominent romanian personality of the decade in it&c” granted by aries (romanian association for the electronically and software industry). 6. honorary member of the romanian technical sciences academy astr (www.astr.ro) (from 2006). 7. honorary member of moldavian academy of science (from 2006). 8. decorated with the national order “loyal service” in rank of “great cross” (granted by the romanian president on 30.11.2000) 9. honorary citizen of campeni city(2002). 5 research activities research projects coordination regarding: • the elaboration of the dimensioning methods for the equipments configurations; • participating on projects regarding the informatics systems in industry; • researches regarding the designing of discrete automate systems; research projects coordination regarding: • hierarchical leadership methods and hierarchical compute methods; • real time hierarchical informatics systems; applicative projects coordination regarding: • implementation and design of real time informatics systems and decision support systems (dss); • contribution to the design of the dispecer ssd (exported in 1996) researcher leadership in conceiving ssd with combined knowledge (numerical models and artificial intelligence elements) ici teams coordination participating to projects/networks of excellence (“network of excellence”noe) or international “working group”-wg: • for the ec-peco program: esatt + (“european science and technology transfer network+”), • for the inco program: a) noe ametmas (cp 96-26), b) inids (“information dissemination in european rtd”ic-1030) 212 ioan dziţac, mişu-jan manolescu, horea oros, emma văleanu • for the esprit program: orbit (“object-oriented decision assistance for continuous operations scheduling and coordination”-24487), noe icims (9251), si e (simulation europe) wg (cod: 8467) general coordination of the research activities from ici. coordination of the rnc (national computer network for research) project accomplishment, the first romanian network connected to the internet (1993). coordination of the projects for introducing the informatics in the cultural institutes (libraries, museums). founding (in 1989) and coordination of the international journal “studies in informatics and controlsic” (http://www.ici.ro/ici/revista/sic.html) (registered from 1995 the ieee inspec database, indexed in isi web of knowledge, the first romanian journal that had an on-line version from 1992). founding (in 2006) and coordination of the international journal “international journal of computers, communications and control-ijccc” (http://www.journal.univagora.ro). participating to some ici international projects: a) interbit (“interbalkan net of technology”ist 1999-14022), b) mobitech (“european sme challenge in mobile telecom technologies”-ips1999-50125), c) idealist 5fp, d) prolearn (as “associated partner”) coordination of the following romanian academy sections: • technical sciences • geological sciences • sociological, juridical and economical sciences • informatics science and technology managerial coordination of the romanian academy library (bar) supervision of the construction activities at the romanian academy library coordination of two fundamental projects of the romanian academy • informational society knowledge society (2001-2002) http://www.academiaromana.ro/pro_pri/pag_com01socinf_prpri.htm • inter-disciplinary program for preventing the major risks phenomenon at a national scale (2004-2005) http://www.icmpp.ro/institute/p_fundam_ar.htm coordination of four international projects of the romanian academy: • forsociety “laying the foundations for an era-net on foresight and society” (contract erasct-2003-003231) • rointera “romanian research community integration in the european research area” (contract inco-ct-2003-510469), (www.rointera.ro) • romob “romanian mobility center” (contract fp 6 513461) • forsociety “transnational foresight era net” (http://195.251.117.130/forsociety/partners/index.html) coordinator of projects: the programs ec-peco, esprit, copernicus, ist of ue the responsible of the romanian teams from ici at 10 projects/networks of excellence/workgroups (1993-2000: see the above no. 8) and at 2 projects of the romanian academy in fp6 (2003-): a) rointera (nco-ct-2003-510469), b) forsociety (eranet 011832) evaluator: 1992 (in the first team with invited experts from the central and east europe), 1996, 1999, 2004, 2005. member in : 60 years from birth of academician f.g. filip 213 • program committee of the ist (“information society technologies”) research program of ue (1999-) national representative named. • consultative committee of the european committee in the priority domain ist from fp6 (istaginformation society technologies advisory group) (invited directly by the european committee) (2003-2004) nato science for peace • associate member of the sub-committee computer networking (2000-2001) (selected by nato) cdi national program • member of the consultative board for research development innovation (1992-) and general secretary (1998-) member of the “informatica” 5th committee of the consultative board (from 1991), president of the “informatica” 5th committee (1993-1994) and vice-president (from 1995) romanian academy grants (gar) • vice-president of the gar committee (1995-2007) romanian academy priority and fundamental programs • coordinator of the project “informational society knowledge based society” http://www.academiaromana.ro/pro_pri/pag_com01socinf_prpri.htm (2001-2002) • leadership of the program “major risk phenomenon and processes at national scale” http://www.icmpp.ro/institute/contributii_stiintifice.htm (2004-2005) 6 didactical activities doctorate guidance at • upb (university politehnica of bucharest), faculty of automatics specialization “automatics” (from 1993) 8 finalized doctorates from which one is made in association with franche conte university) • romanian academy specialization “computer sciences” (from 2002) professor of “applied informatics” at “valahia” university, targoviste (1998-2001) courses of: • “artificial intelligence” • “computer aid decision” • “computer aid enterprise engineering” (master level) visiting professor at • university politehnica of bucharest, faculty of mechanics (mecatronica department). course: “expert systems” (1998) • hyperion university of bucharest (1999-2003). courses of: a) "computer aided decisions" and b) "artificial intelligence" • agora university (2006now) “decision support systems” 214 ioan dziţac, mişu-jan manolescu, horea oros, emma văleanu master courses at: • “lucian blaga” university sibiu. course: “industrial management” (1998) • upb-cpru: course: "enterprise re-engineering" (1998) and "computer aided decision"(2005, 2006) • university of bucharest, faculty of letter. course: “computer aided decision” (2004) • ase bucharest. the modules “computer aided decision” from the course “project management” (2000) • ecole centrale de lille (france, 2006) 7 member in the editorial staff of some scientific journals/conferences member in international program committees (ipc) at more than 50 international conferences and congresses from europe, sua, south america, asia and africa. member in the editorial board of the following journals: 1. sams (systems analysis, modeling and simulation) (taylor and francis) (1993-2004) 2. international j. of critical infrastructures (interscience publishers: www.interscience.com ) (2004) 3. computer journal of moldova , chisinau (1993), information technologies and control (issn 1312-2622), sofia (din 1998) 4. information technologies and control, sofia (2003) 5. romjist (romanian journal of information science and technology) of the romanian academy 6. studies in informatics and control -sic (founder and chief-editor, from 1989) 7. international journal of computers, communications and control (http://www.journal.univagora.ro/)chief-editor (from 2006) 8. control engineering and applied informatics (http://www.upg-ploiesti.ro/srait/publicatii.html) 9. romanian journal of informatics and automatics (http://www.ici.ro/ici/revista/ria.html): founder 10. romanian journal of automatics (http://www.ipa.ro/web3-8re.html) invited at conferences and seminars at universities and research institutes from: england (1984), austria (1996), brazil (1995), chile (2004), china (1983, 1988, 2006), france (1992, 2006), germany (1991), kuwait (1986), moldova republic (1995, 2006), sweden (1970), tunisia (1998). 8 member in professional organizations • ifac (“international federation of automatic control”): vice-president of the technical committee (tc) 5.4 “large-scale complex systems”, http://www.academiaromana.ro/ifac/ifac_tc54.htm), (1998-2002), president of tc 5.4 (20022005), confirmed for a new mandate (2005-2008) • founder member of sie (simulation europe) (1993-1997) 60 years from birth of academician f.g. filip 215 • honorary member of atic (association for communication and informatics technology, from 1996) (www.atic.ro) • romanian forum for the informational society committee of the romanian academy, executive president (from 1997) • http://www.academiaromana.ro/forum_info/fpsc.htm • srait (society of automatics and technical informatics), founder member (1992), vice-president (1992-2001), now member of the director committee (http://www.upg-ploiesti.ro/srait/comitet.html) • member of ifac ifip (“international federation of information processing”) task-force “architectures for entreprise integration” (from 1995) • honorary member of pitch (“association for the promotion of international technological cooperation for humanistic ends”-) (2000-2006) bibliography [1] corina pascu, f g filip (eds). (2005). visions on information society future in an enlarged europe, ed academiei, bucharest. [2] filip. f.g. (2004). decision support systems, ed tehnica, bucharest. [3] filip, f.g. (2002). computer aided decision. ed tehnica, bucharest [4] filip f.g., dumitrache i., iliescu, s.s. (eds) (2002). large scale systems: theory and applications 2001 (lss’01). elsevier science ltd. (pergamon), oxford. [5] filip f.g. (coordinator) (2001). informational society knowledge society. ed. expert, bucuresti. [6] filip f.g., boldur b. (1999). industrial informatics: new paradigms and applications, ed. tehnica. [7] filip f.g. (1998). optimization models with sparse matrices and relatively constant parameters. systems analysis modeling and simulation, 33, 407-430. [8] filip f.g., alexandru a. and socol i. (1997). technology management and international cooperation: several success stories. j. human systems management, 16, 223-229. [9] filip f.g., neagu g. and donciulescu d. (1996). dssfm: from technology to decision making. in proceedings 14th ifac world congress, san francisco vol b, 367-372. [10] filip f.g. (1995). it culture dissemination in romania: experiments and achievements. in a. inzelt and r. coenen (eds.) knowledge technology transfer and forecasting, kluwer academic, dordrecht, 149 160. [11] filip f.g. (1995). towards more humanized dss. in: l.m. camarinha matos and h. afsarmanesh (eds.) balanced automation systems: architectures and design methods. chapman & hall, london, 230 240. [12] filip f.g. (1994). evolutions in systems analysis, modeling and simulation in computer based industrial automation, systems analysis, modeling and simulation, 15, 135 149. [13] filip f.g. and neagu g. (1993). cim in continuous and discrete manufacturing: object-oriented generic modeling, ifac j.control eng. practice, 1(5), 815 825. 216 ioan dziţac, mişu-jan manolescu, horea oros, emma văleanu [14] filip f.g. (1992). system analysis and expert systems techniques for operative decision-making. in: a. sydow (ed.), computational systems analysis: topics and trends, elsevier, amsterdam, 285 304. [15] filip f.g. (1988), operative decision-making in the process industry. in: preprints, 12th imacs world congress, paris, vol.4, 523 525. [16] guran m. and filip f.g. (1986). hierarchical and real time systems with distributed data processing. ed tehnica, bucuresti. [17] filip f.g. and donciulescu d.a. (1983). on on-line direct dynamic coordination method in process industry, ifac j. automatica, 19(1), (pergamon press , oxford), 317 320. [18] filip f.g., neagu g. and donciulescu d. (1983). job shop scheduling optimization in real time production control. computers in industry, 4(4) (north holland, amsterdam), 395 403. [19] filip f.g., donciulescu d., muratcea m., oraseanu l. and predoiu d. (1980). hierarchical control of large-scale chemical and metallurgical systems. in a. titli and m. g. singh (eds.). large scale systems: theory and application. pergamon press, oxford, 563 575. [20] http://www.ici.ro/ici/revista/sic.html [21] http://www.journal.univagora.ro/ [22] http://www.ici.ro/ici/homepage/filipf.html [23] http://www.academiaromana.ro/cv2006/birou_vice02filip.htm [24] http://romania-on-line.net/whoswho/filipfloring.htm ioan dziţac executive editor of ijccc agora university piata tineretului 8, 410526 oradea romania idzitac@univagora.ro mişu-jan manolescu managing editor of ijccc agora university piata tineretului 8, 410526 oradea romania rectorat@univagora.ro horea oros technical editor and editorial secretary of ijccc university of oradea universitatii st. 1, 410087, oradea romania horos@uoradea.ro emma văleanu webmaster and editorial secretary of ijccc agora university piata tineretului 8, 410526 oradea romania evaleanu@univagora.ro int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 813-818 software solution for monitoring street traffic and generating optimum routes using graph theory algorithms m. moise, m. zingale, a.i. condea maria moise, marilena zingale, alexandru ioan condea romanian american university, romania e-mail: maria.moise@rau.ro, zingale@un.org, zander.aq@gmail.com abstract: nowadays, big cities are facing traffic jams, generated by the great number of automobiles in regard to the limited infrastructure capacity. drivers are being presented with these problems: increased time spent between areas of interest, a higher risk of having an accident and of course stress suffering. in order to solve the urban traffic-jam problem a number of solutions have been developed. one of these is tomtom , which offers, free of charge, the possibility to generate navigation indications for a route. unfortunately, the traffic monitoring service is limited to a few countries, but some countries are not on their coverage area. at this time there isn’t a complete method to calculate the optimum route from a destination to another, taking into account street traffic. the only way to get relevant information is represented by the drivers personal expetraffic jams a series of solutiontraffic jams a series of solutionrience and the news on tv/radio. thus, the choice for generating an optimum route is up to the driver/client and, as a consequence, this method is not a scientific one, being certified only empirically. in this context, the paper presents a software solution, which determines the optimum route, taking street traffic into regard, thus contributing to a substantial reduction of time spent in traffic by drivers. the information needed for the application regarding the state of the street traffic can be supplied by the agents that check all available information sources(news bulletins, radio, police announcements) and the mobile agents that patrol the streets. the aim of this paper is to present a solution for determining the optimum route choice for cars. the solution is composed of two applications: first is mapmaker, which designs a street map. second is bestroute which can add traffic coefficients to streets and calculate the optimum route. in order to choose the best route two criteria are used: the minimum distance and the street traffic coefficients. the data regarding the streets map and the traffic situation is taken from a mysql database; the optimum route from destination a to the destination b is calculated using a modified dijkstra algorithm. keywords: optimum routes, graph theory, dijkstra algorithm. 1 introduction today most big cities are facing traffic jams that are generated by the exponential increase of automobiles versus the original limited traffic flow design. in order to solve the urban traffic jams a series of solutions have been proposed and used worldwide. one of these is tomtom , which offers free of charge the possibility to generate a navigation path on their website. unfortunately, the traffic monitoring service is limited to a few countries, and romania is not on their supported list. copyright c⃝ 2006-2010 by ccc publications 814 m. moise, m. zingale, a.i. condea in romania such a solution has not been developed yet; existing solutions in bucharest are limited to the synchronization of the traffic lights and the existence of a reduced number of intelligent traffic lights. at this time a complete solution to calculate the optimum route from a destination to another, with the traffic taken into account, does not exist. the only information available represents the personal experience and/or the news on tv or radio. thus calculating the optimum route is up to the driver(client) and as a consequence it is certified only empirically. in this context, the paper presents a software solution, which determines the optimum route, taking street traffic into regard, thus contributing to a substantial reduction of time drivers spend in traffic. the information needed regarding the state of street traffic can be acquired either by agents that check all the available information sources or by agents that patrol the streets. 2 solutions description the solution for determining the optimum route is composed of two applications. first is mapmaker a map designer with the ability to use a background satellite image for easier plotting. for example, it could use a selection of satellite images from bucharest, covering the doamna ghica area and the adjacent neighborhoods. after the background image is selected the map is designed node by node with the use of the mouse. the streets are made by connecting the desired nodes. the resulting map can be saved both on the local hard-disk and on an external mysql database. the second application is bestroute, used to set traffic coefficients and calculate optimal routes. the necessary data is taken from a mysql database. the application has two modes of usage: • the client mode, which is used to calculate the optimal route based on the selection of two nodes. the result is shown both graphically and in text mode; • the admin mode, which is used for altering the traffic coefficients of a street. in this mode, you can add, modify and delete user accounts. this mode also provides a set of reports about customers or streets. the bestroute application uses elements and algorithms from the graph theory. a modified dijkstra algorithm has been used in order to calculate the minimum distance path between two nodes of the graph. the features of the algorithm’s dijkstra are the following: • there are established a list of distances, a list of previous nodes, a list of visited nodes and a current node; • the values from the list of distances are initialized with an infinite value, excluding the home node, which is set with value "0"; • all values in the list of visited nodes are set with value "false"; • all values in the list of previous nodes are initialized with the value "-1"; • the home node is set as the current node; • the current node is marked as visited; • the distances are updated based on nodes that can be viewed immediately from the current node; • the current node is updated to the unvisited node, which may be visited by means of the shortest path from the home node; • to be repeated (from point f) until all nodes are visited. formalization of the dijkstra algorithm consists of: step 1. initialization of the initial peak having a value of 0: w(x0) = 0; step 2. setting up the lot a comprised of the initial node: a = x1; step 3. analysis of nodes outside the lot a, namely: software solution for monitoring street traffic and generating optimum routes using graph theory algorithms 815 i. if the nodes can be reached by direct arcs from nodes in a, then for these nodes, we calculate: w(xi) = min(w(xj + v(xj, xi)) for problems of minim (2.1) xj ∈ a ∃(xj, xi) and it is added to the lot a only that node for which the minimum value is obtained, then step 4 is initialized. step 4. analysis of a multitude: i. if xn ∈ a, then its value represents the value of the optimum value path from x1 to xn; in order to find this path we start backwards from the final node xn and we find the node xk1, xk2, ..., xkr q which form the path searched for, where xk1 = xn, xkr = x1, and each other index ki+1 is the one for which: w( ) + v(xki+1, xki) = w(xki); stop. ii. ifxn /∈ a, the algorithm is resumed from step 3. w( ) + v(xki+1, xki) = w(xki); stop. iii. if xn /∈ a, the algorithm is resumed from step 3. the application bestroute uses data from public sources and its own agents. on its execution the program queries the mysql database and retrieves the map and its traffic coefficients, then it shows a color coded visual representation of the traffic map. upon requesting a route from address a to address b a modified dijkstra algorithm is used. it is also possible to select whether one wants the fastest route (traffic wise) or the shortest route (geographically). this application also has the possibility to generate reports on customers (useful for the administrator) and on the state of the streets. the reports generated by the application can be exported in excel worksheets and/or pdf document format. for generating reports microsoft report viewer component within the net framework was used and the reports were created using the report wizard plus a manual tweak for better presentation of data. 3 solution output presentation 3.1 mapmaker in fig. 1 we have the main screen. figure 1: main screen of module mapmaker. figure 2: the menu background image. by clicking on file we can: • create a new map new; 816 m. moise, m. zingale, a.i. condea • save the existing map save; • open a saved map open; • save the map to the database save to db; • open a saved map from the database open from db. selecting a background image from the hard-disk is done by clicking on background image (fig. 2). to design the nodes, we use the buttons from fig.3, which enable: • add a new node to the map nod nou; • show details about a selected street detalii muchie; • delete a selected node sterge; • clear the map scale clear scara. figure 3: buttons for creating, modifying and deleting nodes and clearing the maps scale. also mapmaker uses a system which automatically determines the scale for the map. it is necessary to introduce the distance between nodes only the first time. the calculation is done by the ratio between the initially introduced distance and the actual pixel distance between the two points. 3.2 bestroute the applications main screen is illustrated in fig. 4. figure 4: options of the file menu. figure 5: options of the reports menu. by clicking on file we can download map and user data from the server or quit. by clicking on clienti while in administrator mode one can add new users, modify old ones, delete them or set whether a specific user is restricted or not. by clicking on reports (fig.5) one can request reports on clients, restricted clients and street traffic. on opening the application, the system displays the map in img. 6, and selecting the start node is illustrated in fig. 7. generating a non-optimized route in fig.8; after selecting the destination node the application calculates the shortest path available. optimized route provided by the application is illustrated in fig.9. software solution for monitoring street traffic and generating optimum routes using graph theory algorithms 817 figure 6: initial state of the map on opening the application. figure 7: selecting the start node. figure 8: image (red one) of nonoptimized route. figure 9: image of optimized route, enabling avoiding crowded streets. if the optimize option is checked the application will generate an optimum route to avoid heavy traffic areas. also a time estimate is offered. for calculating the optimal route, the following distance modifiers based on traffic coefficients were used: 1, 1.25, 1.66, 2.50, 5 and 999 for closed streets. to calculate the estimated time, the following maximum speeds were assumed: for excellent traffic conditions a speed of 50kph, reducing speed by 10kph to the minimum of 10kph for very bad traffic. thus, a relatively accurate time can be calculated for the generated route. for the best estimation possible, a interval of plus/minus 20% is assumed. on both generating options a street by street text solution is also supplied. 4 conclusions using such an application by drivers leads to avoiding stress and fatigue generated by traffic jam, thus reducing car crash risks. also, the application provides information on routes unknown to drivers thus leading to a better awareness of the city. implementing the application for a taxi or currier company offers an edge on competitors, generating a shorter delivery or reply time. on the whole, large-scale use of the application in partnership with the town hall and police may lead to general fluidization of traffic, equally improving the environment. 818 m. moise, m. zingale, a.i. condea bibliography [1] moise, m., zingale, m., condea, a., informatics application which determines the optimum routes for the cars, in proc. of the e-comm-line 2009 conference, section v 35, pp. 5. [2] moise, m. data base informatics systems, prouniversitaria publishing house, 2008, bucharest [3] *** http://msdn.microsoft.com microsoft developer network [4] *** http://stackoverflow.com/ stackoverflow [5] *** http://dev.mysql.com/usingmysql/dotnet/ documentation mysql int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 727-734 towards low delay sub-stream scheduling w. guofu, d. qiang, w. jiqing, b. dongsong, d. wenhua wu guofu, dou qiang, wu jiqing, ban dongsong,wenhua dou national university of defense technology school of computer science changsha, hunan, p.r.china e-mail: {gfwu,qdou,jqwu,dsban,whdou}@nudt.edu.cn abstract: peer-to-peer streaming is an effectual and promising way to distribute media content. in a mesh-based system, pull method is the conventional scheduling way. but pull method often suffers from long transmission delay. in this paper, we present a novel sub-stream-oriented low delay scheduling strategy under the push-pull hybrid framework. first the sub-stream scheduling problem is transformed into the matching problem of the weighted bipartite graph. then we present a minimum delay, maximum matching algorithm. not only the maximum matching is maintained, but also the transmission delay of each sub-stream is as low as possible. simulation result shows that our method can greatly reduce the transmission delay. keywords: p2p streaming, scheduling, sub-stream, weighted bipartite graph, matching 1 introduction the emerging peer-to-peer (p2p) systems have appeared to be the most promising driving force for video streaming over the internet. by distributing the workload to a large number of low-cost computing hosts such as pcs and workstations, one can eliminate the need for a costly centralized server and at the same time improve the system’s scalability. there are already commercial products emerging, e.g., pplive [1], coolstreaming [3]. any p2p streaming system consists of two distinct but related components: (i) an overlay construction mechanism, and (ii) a content scheduling mechanism. to improve the performance of p2p streaming systems, many studies focus on the overlay construction. however, the content scheduling mechanism also has greatly impaction on the performance. carefully designed scheduling mechanism could have a better tradeoff among maximum streaming rate, minimum transmitting delay and control overhead. the overlay construction falls into two categories: tree(s) and mesh. in the mesh overlay, the pull method which is very similar to that of bit-torrent protocol [2]is widely used. recent result [5]shows that the protocol of unstructured mesh overlay outperforms the traditional multi-tree approaches much in many aspects. several media content scheduling mechanisms for mesh-based system have been proposed. coolstreaming/donet [3] and chainsaw [4] proposes pull-based scheduling framework. jm li [6] designs a scheduling algorithm to manipulate the order of data blocks to improve the transmission efficiency in p2p streaming system. m zhang [7] defines priorities for different blocks according to their rarity property and their emergency property, and tries to maximize the average priority sum of each node. based on the traffic from each neighbor, a node in gridmedia [8] subscribes the pushing packets from its neighbors at the end of each time interval. pulsar [9] combines push-based operations along a structured overlay with flexibility of pull operations. lstreaming [10] uses sub-streams in the push-pull streaming system. copyright c⃝ 2006-2010 by ccc publications 728 w. guofu, d. qiang, w. jiqing, b. dongsong, d. wenhua this paper presents an effectual content scheduling strategy which can largely reduce the content transmission delay meanwhile remaining the main advantage of the pure pull method. first, the original stream is divided into k sub-streams, and each sub-stream has the same rate. a node requests one sub-stream instead of one content block from its neighbors. then the sub-stream scheduling problem is transformed into the matching problem of weighted bipartite graph. the well known hungarian algorithm which solves the maximum matching problem is ameliorated. not only maximum matching is reserved by the new improved algorithm, but also the transmission delay of each sub-stream is as low as possible. the main difference between our scheme and lstreaming is the choice of available neighbors. the following paper is organized as follows: section 2 describes our motivation. in section 3, we present our algorithm for the sub-stream scheduling problem. simulation result shows in section 4. finally, we conclude our work in section 5. 2 motivation 2.1 delay analysis of the pull-based scheduling strategy the main drawback of pull-based scheduling strategy is that content blocks suffer from long delay. now we give a quantitative analysis showing why this strategy resulting in long transmission delay. in such system, the media content is divided into equal size blocks, each of which has a unique sequence number. every node (including the source node) periodically broadcasts all of its neighbors a bit vector called buffer map (bm) which represents the availability of useful blocks in its buffer pool. according to the announcement, each node decides from which neighbor to ask for which blocks, and periodically sends requests to its neighbors for the desired blocks. when a neighbor receives the request, it puts the desired blocks into its output queue, waiting to send out. for the efficiency of information exchange, bm and requests will only be sent periodically so that dozens of packets can be mapped into a single packet. we denote the interval between two buffer map packets or two request packets as t. figure 1 shows a typical process that a content block goes from one node to a neighbor. at the time t1, a fresh block arrives in node a. because the bm packets will only be sent at the beginning of each time slot, the useful information will wait until the time t2, when a new time slot begins. after a period of time td1, at the time t3, the bm packet reaches at node b. time td1 contains two components: propagation latency and transmit latency. the propagation latency lies on the length of links between node a and node b, while the transmit latency lies on the length of packet and the available bandwidth. the width of the shadow represents the transmit latency. because the length of bm and request packets is small, we ignore the latency. at the time t3, node b would make a design whether or not to request the fresh block from node a. as the same reason, node b sends the request packet at the time t4. at the time t5, node a receives the request packet, and then it puts the fresh block to the output queue. after a queue delay tw, the fresh block is sent out. because the length of content block is large, we can’t ignore the transmit latency of the packet. at the time t7, the fresh block is received by node b. now we can compute the one hop delay, which is the interval between the time t1 and t7. apparently, tp and tr are uniform random variable on the length t of time slot, and they are independent. their average values are both t/2. we suppose the packet with largest waiting time will be sent first, the blocks before the time t1 reached at node a at the same time slot will be sent before the fresh block, so the queue delay tw is t −tp. thus, the average latency t1−delay for one packet in one hop can be computed as t1−delay = e[tp + td1 + tr + td2 + tw + td3] = 3t 2 + 3td + l u (1) towards low delay sub-stream scheduling 729 where td is the average end-to-end delay, and l is the length of content block, and u is the average upload bandwidth of nodes. figure 1: one hop latency 2.2 using sub-stream to reduce the one hop latency t1−delay we can reduce the latency by adopting a shorter cycle t ′, but this will bring in more information packets. meanwhile the three end-to-end delays can’t be avoidable. a naive solution is that when node a receives the fresh content block, it directly sends it to node b. in this case, the one hop latency is only td + lu . but node b may receive repeated blocks from different neighbors under blind packets scheduling. an efficient way is that node a sends specifically sub-stream(s) to node b, while other neighbors send another sub-stream(s) to node b. different sub-streams are not intersected. node b decides to subscribe which sub-stream from which neighbor, and requests for sub-stream(s) instead of content block(s). theoretically, node b requests for substream(s) only once, and then just waits to receive media content. figure 2 shows the creation figure 2: content blocks in different sub-streams figure 3: the information peerp collects of sub-streams. the original media content is divided into blocks with equal size, and each block has a unique sequence number, counting from number 1. suppose we divide the original stream into k sub-streams, then what content blocks each sub-stream has is expressly described in figure 2. which sub-stream a content block with sequence number m belonging to is determined by the following function f(m). f(m) = { m mod k if x mod k ̸= 0 k else (2) 3 sub-stream scheduling in our framework, participating peers adopt gossip protocol to self-organize into unstructured mesh overlay. in this section, we study the problem of sub-stream scheduling, that is to say a 730 w. guofu, d. qiang, w. jiqing, b. dongsong, d. wenhua node subscribe which sub-stream from which neighbor. we suppose in the initial period, the node uses pull-based method to get the content blocks. after that, push-based method works. 3.1 exchanging information peers periodically broadcast sub-stream information and available upload bandwidth to neighbors. the period can be long, because other cases may trigger a node to broadcast the information or send the information to chosen neighbors. when a node joins the system, all of its neighbors send information packets to it actively. the information includes the sub-streams the neighbor can provide {sij}, latency of each sub-stream from the source to the neighbor {hij} and the available upload bandwidth of the neighbor {ci}. sij =1 presents neighbori can provide sub − streamj, else sij = 0. hij presents the number of hops of sub − streamj from the source to neighbori. if sij = 0 then hij = +∞. ci is not the real bandwidth, but it presents the multiples of one sub-stream rate. for example, neighbori’s available upload bandwidth can support transmitting two sub-streams to local node, then ci = 2. after a short period, the local node gets information from all of its neighbors. suppose the original stream is divided into 4 sub-streams. figure 3 shows the information that peerp collects from all of its neighbors. the information on the left side of every connection presents the sub-streams neighbors can provide and their latency starting from the stream source node, while the number on the right side presents how many sub-streams the neighbor’s available upload bandwidth can support. for example, in figure 3, neighbor p1 can provide sub-stream 1, 2, 3 respectively, and the corresponding latency is 3, 4, 4 hops respectively. meanwhile the available bandwidth between p1 and p can support 2 sub-streams. 3.2 constructing weighted bipartite graph more than one neighbors can provide the same sub-stream, so we should make a decision which neighbor is more suitable as a provider for one sub-stream. here we transform the substream scheduling problem into the matching problem of weighted bipartite graph. first, we construct the weighted bipartite graph, according to the information collecting from neighbors. the weighted bipartite graph can be expressed by a quadruple group g = (x, y, e, w), where x presents the set of sub-stream providers, y presents the set of sub-streams, e presents the set of connections between x and y, and w presents set of weight on each connection. there is at most one connection between any two elements in x and y respectively. so if a neighbor’s upload bandwidth can support several sub-streams, we should characterize this situation in our bipartite graph. here, we allow same elements to coexist in set x to deal with this situation. if neighbori’s available upload bandwidth can support c sub-streams, then we copy c same elements of neighbori into set x. when all neighbors are put into set x, we complete the construction of set x. next we build the connections between x and y. if any element x∈ x can provide sub-stream y∈ y, then a connection element e = (x, y) is added into connection set e. the corresponding weight of connection e is the hops of sub-stream y from stream source to provider x. we use the example in figure 3 to illustrate the construction of weighted bipartite graph. neighbor p1’s available upload bandwidth can support 2 sub-streams, so there are 2 p1 elements in the set x, as shown in figure 4. neighbor p1 can provide sub-stream 1, 2, 3 respectively, and the corresponding latency is 3, 4, 4 hops respectively. accordingly, in the weighted bipartite graph, both of the two provider p1 have connections with sub-stream 1, 2, 3 respectively, and the weight on each connection is 3, 4, 4 respectively. the whole weighted bipartite graph transformed from figure 3 is shown in figure 4. the number on each edge presents the weight of each edge. towards low delay sub-stream scheduling 731 3.3 minimum delay, maximum matching algorithm (mdmma) after the weighted bipartite graph has been constructed, we try to find the best matching. here we mainly consider two targets: first, more sub-streams should be transmitted in parallel, because this can make use of peers’ upload bandwidth as much as possible, speeding up the transmission of the media content; second, hops of each sub-stream from source to the node should be as little as possible, because this can lead to lower delay. existent matching algorithms of bipartite graph only care one of the two targets. in this paper, we enhance the hungarian algorithm which is the well known algorithm to solve the maximum matching of bipartite graph. the ameliorated hungarian algorithm first insures that the matching is the maximum matching, and then it tries to find the lowest latency matching. the algorithm in detail is described in table 1. the main changes occur in step 1.2 and step 1.4. in step 1.2, when there are several candidate "uncheck" providers, we choose the provider whose connected edge has the minimum weight. in step 1.4, to find the augment chain, we start from the vertex whose connected edge has the minimum weight, making sure that the edge with the lowest weight is put into the matching. although the running time of the algorithm is o(n3), n is usually is less than 30, so the peer can find the appropriate neighbors quickly. table 1 min delay max matching algorithm min delay max matching input g = (x, y, e, w) output m 0 set m = ϕ , and set all vertexes in g with no label. 1.1 let s = {x/x ∈ x, and exist y ∈ y, (x, y) ∈ m}. if x/s = ϕ , then finish; else label every elements x ∈ x/s with "-1" and "uncheck". 1.2 if all x ∈ x is checked, finish; else choose the "uncheck" xi whose connected edge has the minimum weight. 1.3 if all y ∈ {y/y ∈ y, and (xi, y) ∈ e} have been labeled, then label xi with "checked", goto step 1.2. 1.4 let p = {y/y ∈ y, and (xi, y) ∈ e, and y is not labeled}, label all y ∈ p with "i". let q = {y/y ∈ y, and exist x ∈ x, (x, y) ∈ m}. if p/q ̸= ϕ, then choose yj ∈ p/q whose connected edge has the minimum weight, goto step 2; else for every yj ∈ p, label xp((xp, yj) ∈ m) with "j" and "unchecke", label xi with "checked", goto step 1.2. 2 find the augment chain c starting from yj until found x(∈ s) with label "-1", m = m ⊕ c, cancel labels of all vertex in g, goto step 1.1. figure 4: weighted bipartite graph figure 5: min delay, max matching the ameliorated hungarian algorithm is heuristic. it can insure that the matching is the maximum matching, but it can’t insure that the matching is minimum delay matching theoretically. applying the min delay max matching algorithm to the weighted bipartite graph in figure 4, we get the following matching as shown in figure 5. according to the result, neighbor p1 provide sub-stream 1, 2 to peer p, and neighbor p3 provides sub-stream 3, 4 to peer p 732 w. guofu, d. qiang, w. jiqing, b. dongsong, d. wenhua respectively. 3.4 broadcasting new sub-stream information when a node decides to subscribe which sub-stream from which neighbor, it sends sub-stream request to the appointed neighbor. if the neighbor rejects the request, it cuts the matching connections in the weighted bipartite graph, and finds another provider with lowest latency. it requests again until it subscribe the sub-stream successfully or there is no useful connection. then it broadcasts the subscribed sub-streams to its neighbors. the hop of each sub-stream in the information packets is adding the accepted hops by 1. when a neighbor accepts the request, it sends out the content blocks belonging to the desired sub-stream in the push window. if there is sub-stream(s) which can’t find proper provider, the node should find more new neighbors. 3.5 other discussion monitor neighbor’s available upload bandwidth. we measure the interval of consecutive packets of the same sub-stream monitor neighbor’s upload bandwidth. if the interval remains changeless (or change slightly), we suppose the bandwidth of the neighbor is affluent. if the interval changes greatly, we suppose the bandwidth of the neighbor is deficient. the rational reason behind this conclusion is that: if the bandwidth is affluent, there will no queue in the neighbors, little burst will happen, so the interval is changeless; if the bandwidth is deficient, burst will often happen, which leading to changing intervals. request missing blocks from neighbors with affluent bandwidth. packets in transmission may be lost. when the missing content blocks enter into the pull widow, we request the block explicitly from the neighbor with affluent available upload bandwidth. sub-stream re-scheduling. due to network dynamics, neighbors’ upload capacity may fluctuate. when a neighbor is not competent as a sub-stream provider, a new provider should be chosen. also we choose the appropriate provider from the neighbors with affluent. the neighbor who can provide sub-stream with fewer hops has higher priority. 4 performance evaluation 4.1 simulation settings we use the random model of gt-itm [11] to generate the underlying topology with 5000 routers. each link transmitting delay is set with a uniform random variable within [10ms, 100ms]. we randomly choose 2000 routers and connect one peer with one router. a peer randomly selects 10 to 15 other nodes as its neighbors to construct the mesh overlay. we set the playback rate of the original stream is 512kbps, and divide the stream into 32 sub-streams, each sub-stream with rate of 16kbps. the size of each content block is 1k bytes. the upload capacity of video source is 5 mbps. all peers are assumed to be dsl users with three type of available upload bandwidth of 1 mbps, 512 kbps and 256 kbps. these three types of peer represent 15%, 60% and 35% of the total peers. we suppose peer’s download capacity is infinite. we run the simulation in the environment of matlab7.0. 4.2 simulation result here we compare our algorithm mdmma with the pure pull-based method. we mainly care about the transmission delay of the stream. figure 6 shows the cumulative distribution function of the content block transmission delay of the two methods in steady state. we can see that our towards low delay sub-stream scheduling 733 figure 6: content block transmission delay cdf algorithm mdmma reduce the transmission delay greatly. for example, when 90% of the nodes get the content blocks, it only costs less than 50 seconds in our method, while in the push-based method, it costs more than 80 seconds. we can conclude that our proposed method is much better than pure pull-based method in the delay performance. 5 conclusion p2p streaming system consists of two components: (i) overlay construction mechanism, and (ii) content scheduling mechanism. studies show that mesh-based system is better than tree-based system especially in high churn rate of node. in this paper, we present new scheduling mechanism to improve the performance of mesh-based p2p streaming systems. our contribution includes two folds: first, we transform the sub-stream scheduling problem into the matching problem of the weighted bipartite graph; second, we present a minimum delay, maximum matching algorithm. not only the maximum matching is maintained, but also the transmission delay of each sub-stream is as low as possible. simulation shows that our method can reduce the transmission delay greatly. bibliography [1] pplive, http://www.pplive.com/. [2] bittorrent, http://bitconjuer.com/. [3] x.zhang, j.liu, and et al. "coolstreaming/donet: a data-driven overlay network for efficent media streaming". in proc. of infocom 2005, us, pp.2102-2111, mar.2005. [4] v.pai, k.kumar, and et al. "chainsaw: eliminating trees from overlay multicast". peer-topeer system ˘ô, pp.127-140, nov.2005. [5] n.magharei, r.rejaie, and y.guo. "mesh or multiple-tree: a comparative study of p2p live streaming services". in proc. of infocom 2007, usa, pp.1424-1432, may.2007. [6] jm.li, c.k.yeo, and b.s.lee. "peer-to-peer streaming scheduling to improve real-time latency". in proc. of multimedia and expo, china, pp.36-39, jul.2007. [7] m.zhang, y.q.xiong, and et al. "optimizing the throughput of data-driven peer-to-peer streaming". ieee transactions on parallel and distributed systems, vol.20, no.1,pp.97110, may.2008 734 w. guofu, d. qiang, w. jiqing, b. dongsong, d. wenhua [8] m.zhang, j.g.luo, and et al. "a peer-to-peer network for live media streaming using a push-pull approach". in proc. of the 13th annual acm internatioan conference on multimedia, singapore, pp.287-290, 2005. [9] t.locher, r.meier, and et al. "push-to-pull peer-to-peer live streaming". in proc. of disc 07, germany, pp.388-402, 2007. [10] z.j.li, y.yu, and et al. "towards low redundancy push-pull p2p live streaming". in proc. of of acm sigcomm 2008 demo, usa, aug. 2008. [11] k.c.ellen, w.zegura and s.bhattacharjee. "how to model an internetwork". in proc. of infocom 1996, usa, pp.594-602, 1996. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iii (2008), no. 2, pp. 149-160 a neural approach of multimodel representation of complex processes nesrine elfelly, jean-yves dieulot, pierre borne abstract: the multimodel approach was recently developed to deal with the issues of complex processes modeling and control. despite its success in different fields, it still faced with some design problems, and in particular the determination of the models and of the adequate method of validities computation. in this paper, we propose a neural approach to derive different models describing the process in different operating conditions. the implementation of this approach requires two main steps. the first step consists in exciting the system with a rich (e.g. pseudo random) signal and collecting measurements. these measurements are classified by using an adequate kohonen self-organizing neural network. the second step is a parametric identification of the base-models by using the classification results for order and parameters estimation. the suggested approach is implemented and tested with two processes and compared to the classical modeling approach. the obtained results turn out to be satisfactory and show a good precision. these also allow to draw some interpretations about the adequate validities’ calculation method based on classification results. keywords: complex processes, modeling, multimodel approach, kohonen map 1 introduction nowadays, technological developments increase the complexity of systems. this complexity can be caused by non linearity, non stability, wide operating domain, variations of system parameters or external perturbations. as a result, it is often difficult or even impossible to propose a simple model which could reckon with the whole process complexity by using physical laws. consequently, it is very useful to focus on advanced and practical approaches in order to handle this complexity. the multimodel approach has recently been developed and applied in several science and engineering domains. it was proposed as an efficient and powerful method to cope with modeling and control difficulties when complex non linear and/or uncertain processes are concerned. the multimodel approach supposes the definition of a set of models. then, it becomes possible to replace the unique model by a set of simpler models thus making a so-called models’ base. each model of this base describes the behavior of the considered process at a specific operating point. the multimodel approach objective is to decrease the process complexity by its study under certain specific conditions. several researchers have been interested in multimodel analysis and control approaches [7, 8, 19] and many applications have been proposed in different contexts. in spite of its success in many fields (academic, biomedical, . . . ), the multimodel approach remains confronted with several difficulties such as the calculation of models’ validities, the adequate technique of fusion or switching between models as well as the determination of the models’ base. indeed, in 1985, takagi and sugeno [21] suggested a fuzzy process representation by the contribution of local models. this approach has been applied in many fields of activities but often faced problems related to the lack of information about the system structure or the incertitude of its parameters. besides, it can sometimes lead to a large number of models which generates a high computational burden when designing the control algorithm. this has led several researchers to develop other approaches in order to cope with these difficulties. for uncertain complex systems with bounded parameters, some approaches were developed for the determination of the models’ base [9, 11]. an extension for uncertain discrete systems has been proposed by mezghani [13]. the case of multivariable systems was addressed by raissi [16]. the approaches, copyright © 2006-2008 by ccc publications 150 nesrine elfelly, jean-yves dieulot, pierre borne previously named, require the knowledge of parameters’ variations limits, which is generally not possible, in particular in case of uncertain systems for which parameters variations domains are unknown or ill-known. other related studies [12, 19] suggest the system be described by a set of local models often defined by using a reference model and some linearization methods. in another context, both fuzzy logic and neural networks were carried out for the multimodel control [1, 18]. an idea was to use neural approaches for complex systems modeling. these methods have the advantage of requiring very little information about the considered process and are useful for uncertain systems. within this context, very few studies [2, 3, 15, 17, 20, 22] were proposed but they don’t address strongly non-linear systems. in this paper, a neural approach for the determination of the models’ base for uncertain complex systems is proposed, in particular those which exhibit strong non-linearities. the proposed approach requires a priori little knowledge about the considered system; only input/output information can be sufficient. in the following section, the different steps of the proposed modeling approach are detailed. two simulation examples and some interpretations are then presented. the evaluation of the suggested modeling strategy is the topic of the last section. we finish the present paper by a conclusion. 2 complex systems’ modeling: neural approach in this section, a models’ base-determination approach for uncertain complex systems is described. this approach requires neither a global model definition nor the knowledge of parameters variations domains; only input/output information are needed. the suggested approach allows the generation of the base-models’ structure and parameters. the application of this approach requires first to classify the numerical data by exploiting a kohonen map and to determine the number of models. secondly, a structural and parametric identification of different base-models is carried out by using classification results. then, the validity index of each model is computed. finally, the multimodel output is obtained by the fusion of the models’ outputs weighted by their validity indexes. 2.1 classification of the numerical data by using a kohonen map as a first step, the output or input/output measurements collected from the considered process have to be classified in order to identify operating clusters from which the models’ base will be deduced. this classification is carried out by using a self-organizing kohonen map. self-organizing kohonen map methodology the self-organizing map (som) represents a specific kind of neural networks. in its original form, the som was invented by the founder of the neural networks research center, professor teuvo kohonen in 1981-82. the special property of the som is that it effectively creates spatially organized internal representations of various features of input signals and their abstractions [10]. the schematic representation of this network is given in figure 1. neurons in the target layer are ordered and correspond to cells of a bi-dimensional map. every neuron of the input layer is connected to every neuron of the output layer. the classification strategy consists in applying the kohonen rule. this rule is characterized by an unsupervised competitive learning where a competition takes place before the modification of the network-weights. only the neurons which win the competition have the right to change their weights. the kohonen rule works as follows: a neural approach of multimodel representation of complex processes 151 figure 1: kohonen map weights are initialized to random values; an input vector is presented to the network; the distance between the input vector and weights connecting inputs to each output neuron is computed; the neuron corresponding to the smallest distance, i.e. the nearest to the input vector, wins the competition, weights connecting inputs to this neuron are modified accordingly [5]. this procedure is repeated several times until weights stabilize. at the end of the learning stage, the representative vectors of different clusters and their centers are obtained. determination of the operating-system clusters the determination of the operating-system clusters requires firstly that the considered system be excited. secondly, the number of clusters has to be determined. the third step consists in classifying data by using a kohonen neural network. the first step consists in applying an input signal and then collecting (output or input/output) measurements that will be used for classification. the excitation signal must be rich enough and persistently exciting with well-chosen parameters in order to allow a full excitation of the operating dynamics, and to take in consideration the non-linear aspect of the considered process. for the second step, the method proposed by talmoudi has been adopted for the determination of the adequate number of clusters which corresponds to the number of models [22]. within this scope, a kohonen network, with an important number n of neurons in the output layer, has been considered. at the end of the learning procedure, if the repartition of classes is not good, the clusters i having a number of elements nci verifying the relation (1), will have to be removed [22]. nci ≤ 1 2 nh n , (1) where nh represents the number of the considered measures. else, the number of neurons in the output layer is increased and training is restarted. the same procedure is repeated over and over until the satisfactory number of clusters is obtained. afterwards, the data classification is tackled by using a kohonen network for which the number of neurons in the input layer equals the number of system-variables to be considered and the number of neurons in the output layer equals the number of clusters determined with the help of the method previously described. the classification results will then be exploited for the identification of the different base-models. 2.2 parametric identification of the base-models in this section, the orders of the models are estimated in a first step. the chosen method is the socalled instrumental determinants’ ratio-test. this method is mainly based on the conditions concerning 152 nesrine elfelly, jean-yves dieulot, pierre borne a matrix called "information matrix" which contains the input/output measurements [4]. this matrix is described as follows: qm = 1 nh nh ∑ k=1   u(k) u(k + 1) u(k −1) ... u(k −m + 1) u(k + m)   [ y(k + 1) u(k + 1) ··· y(k + m) u(k + m) ] , (2) where nh is the number of observations. the instrumental determinants’ ratio (rdi) is given by: rdi(m) = ∣∣∣∣ det(qm) det(qm+1) ∣∣∣∣ . (3) for every value of m, the determination procedure of the order consists in building the matrices qm and qm+1 and in evaluating the ratio rdi(m), the retained order m is the value for which the ratio rdi(m) quickly increases for the first time. as a second step, the parametric identification issue consists in calculating the values of the parameters of the corresponding model-equation, given several experimental measures which describe the dynamic behavior of the system. as previously mentioned, the data classification gives a certain repartition of clusters. for each cluster, input/output measurements are collected. these measurements allow the identification of the corresponding model. for this, the recursive least-squares method (rls) [4] was applied to achieve the parameters estimation. 2.3 computation of validities the validity coefficient is a number belonging to the interval [0 1]. it represents the relevance degree of each base-model calculated at each instant. in literature, several methods have been proposed to deal with the validity issue. in our study, the residues’ approach was adopted for the calculation of validities. this method is based on the distance measurement between the process and the considered model. for example, the residue can be given by the following expression: ri = |y−yi| i = 1, . . . , n (4) where: n: number of base-models; y: process output; yi: output of the model mi. if this residue value is equal to zero, the corresponding model mi perfectly represents the process at that time. on the contrary, a non null value translates the fact that the model mi represents the system partially. the normalized residues are given by: r′i = ri ∑nj=1 r j . (5) within the context of the residues’ approach, several methods have been proposed for the calculation of validities [6, 13, 14]. only two methods will be considered: the simple and the reinforced validities. the validities are given by: vi = 1−r′i . (6) the simple and reinforced validities are defined by using the following formulas. a neural approach of multimodel representation of complex processes 153 simple validities: the normalized simple validities are defined so that their sum must be equal to 1 at each time: vsimpi = vi n −1 . (7) reinforced validities: for this type of validities, the reinforcement expression is introduced as: v ′ren f i = vi n ∏ j =1, j 6=i (1−v j) . (8) the normalized reinforced validities could be written as follows: vren fi = v ′ren f i ∑nj=1 v ′ren f j . (9) 2.4 computation of the multimodel output the multimodel output is calculated by a fusion of the models’ outputs weighted by their respective validity indexes, as illustrated by the following expression: ymm(k) = n ∑ i=1 yi(k) vi(k) . (10) vi(k) could be a simple or a reinforced validity for which ∑ni=1 vi(k) = 1. 3 simulation examples in order to underline the interest and the performance of the proposed approach, some simulation examples are carried out. 3.1 example 1: second order discrete system the considered system is a complex discrete system whose evolution is described by the following equation: y(k) = −a1(k) y(k −1)−a2(k) y(k −2) + b1(k) u(k −1) + b2(k) u(k −2) . (11) the variation laws of different parameters of the process are given by: a1(k) = 0.04 sin (0.035k)−0.8 , a2(k) = 0.005 sin (0.03k) + 0.1 , (12) b1(k) = 0.02 sin (0.03k) + 0.5 , b2(k) = 0.01 sin (0.035k) + 0.2 . (13) first, the system is excited by a uniform random signal u(k ). then, the measurements y(k ) and y(k−1) are collected at different instants. the adequate number of clusters determined by using the method described in paragraph 3.1.2, is equal to three. the numerical data are fed into a kohonen network which presents an input layer with two neurons and three neurons in the output layer. the classification results are given in figure 2. from each of the three data sets relative to the various clusters, the orders and the parameters of the transfer functions relative to the three base-models are estimated. figure 3 shows the evolutions of the rdii(m) for the three models. 154 nesrine elfelly, jean-yves dieulot, pierre borne 2.5 3 3.5 4 4.5 2 2.5 3 3.5 4 4.5 y(k−1) y (k ) clusters’ centers figure 2: classification results 1 2 3 4 0 0.5 1 1.5 2 2.5 x 10 5 rdi 1 (m) 1 2 3 4 0 2 4 6 8 x 10 4 rdi 2 (m) 1 2 3 4 0 1 2 3 4 x 10 4 rdi 3 (m) m 1 =2 m 2 =2 m 3 =2 figure 3: evolutions of the rdi for the three base-models it appears clearly that the estimated orders of the three models are equal to two. this result is predictable since the considered system is a second order one. the application of the recursive leastsquares method allows us to write the following transfer functions: f1(z −1) = 0.33549 z−1 −0.047793 z−2 1−0.83697 z−1 −0.039754 z−2 , (14) f2(z −1) = 0.41086 z−1 −0.021659 z−2 1−0.8633 z−1 −0.055068 z−2 , (15) f3(z −1) = 0.36985 z−1 −0.0079934 z−2 1−0.9234 z−1 −0.060727 z−2 . (16) in order to evaluate the obtained global model, a validation step is worked out where other inputs which are different from those used for classification are fed into the system. then, the real and the multimodel outputs are compared. let us consider the following input sequence: u(k) = 1 + sin (0.08k) . (17) the validation results are given in figures 4 and 5. y: real output of the system. ymmr, ymms: multimodel outputs obtained by using respectively the methods of reinforced and simple validities. ymmr (k) = n ∑ i=1 yi(k) v ren f i (k) , ymms (k) = n ∑ i=1 yi(k) v simp i (k) . (18) er, es: relative errors between the real and the multimodel outputs. er(k) = ∣∣∣∣ y(k)−ymmr (k) y(k) ∣∣∣∣ , es(k) = ∣∣∣∣ y(k)−ymms (k) y(k) ∣∣∣∣ . (19) a neural approach of multimodel representation of complex processes 155 0 100 200 300 400 500 0 1 2 3 4 5 6 time y (t ), y m m r( t) , y m m s (t ) ymm s (t) y mm r (t) y(t) figure 4: real and multimodel outputs 0 100 200 300 400 500 0 5 10 time e r( t) , e s (t ) e s (t) e r (t) figure 5: evolutions of the relative errors it can be seen that the multimodel output, obtained by the fusion of the base-models’ outputs weighted by the reinforced validities, follows the real output with a negligible error (er(t)). this error is more important when applying the simple validities method (es(t)). this allows to conclude that, for this kind of system, the reinforced validities method is more appropriate than the simple validities one. 3.2 example 2: second order continuous system with input/output-dependent parameters as a second simulation example, we consider the system whose evolution is described by the following equations: a0y + a1ẏ + ÿ = u + bu̇ , (20) a0(y) = 0.3 + sigm(y−2) , a1(y) = sat(y2) , b(u) = sat(u) , (21) sigm is the sigmoid function; sat is the saturation function. the considered system is complex and strongly non-linear with parameters being functions of both the input and the output, which makes the modeling task difficult. the chosen excitation signal (figure 6) is a sine curve distorted by a random uniform signal since this input is richer than a simple random signal and allows considering the complex and non-linear aspects of the system. it is worthy to note that signal parameters (frequency, amplitude) need to be adjusted in order to obtain good results. 0 2 4 6 8 10 −2 −1 0 1 2 time u (t ) figure 6: evolution of the excitation signal moreover, in order to describe the system dynamics, the number of variables used for classification is increased. in fact, not only the output data y(k ) and y(k − 1) are considered but also the input data 156 nesrine elfelly, jean-yves dieulot, pierre borne u(k − 1). after generating the output, a sampling of the input and output signals followed, with an adequate sampling period, in order to collect the different measurements: y(k ), y(k−1), and u(k−1). once the number of clusters determined, the numerical data are presented to a kohonen network owning three neurons in both input and output layers. at the end of the learning procedure, three data sets (figure 7) are obtained, each of which being used for the identification of the corresponding model. −0.5 0 0.5 1 1.5 2 −1 0 1 2 −2 −1 0 1 2 y(k−1)y(k) u (k − 1 ) clusters’ centers figure 7: three data sets relative to the different clusters the application of the instrumental determinants’ ratio-test method allows us to determine the three models’ orders which are respectively: two, three and two. the corresponding transfer functions are given by the following expressions: f1(z −1) = 0.058303 z−1 + 0.052302 z−2 1−0.43246 z−1 −0.45748 z−2 , (22) f2(z −1) = −0.023815 z−1 + 0.0013593 z−2 + 0.031481 z−3 1−0.68802 z−1 −0.24025 z−2 −0.072128 z−3 , (23) f3(z −1) = 0.12995 z−1 −0.014359 z−2 1−0.64023 z−1 −0.37958 z−2 . (24) the application of the following input sequence is the purpose of the validation step: u(k) = 1 + sin (k) . (25) the validation results are illustrated by the figures 8 and 9. 0 2 4 6 8 10 0 0.5 1 1.5 2 2.5 time y (t ), y m m r( t) , y m m s (t ) ymm s (t) y(t) ymm r (t) figure 8: real and multimodel outputs 0 2 4 6 8 10 0 0.2 0.4 0.6 0.8 time e r (t ), e s( t) e s (t) e r (t) figure 9: evolutions of the relative errors contrary to the first example, the results obtained by the application of the simple validities’ method are much better than those given by the reinforced validities’ method. in fact, the figures 8 and 9 show that the multimodel output ymms (t), deducted by fusion of the base-models’ outputs weighted by the simple validities, follows almost perfectly the real output with an error es(t) nearly null compared to the a neural approach of multimodel representation of complex processes 157 error er(t) which is relatively important. in order to give prominence to the capacity of the identified models to reproduce the operating system in different domains, let us consider another input sequence given by: u(k) = 1.2 + 1.5 sin (2k) . (26) the multimodel output is generated by application of the simple validities’ method. the result is given in figure 10. 0 2 4 6 8 10 0 0.5 1 1.5 2 2.5 time y( t), y m m s(t ) y mm s (t) y(t) figure 10: real and multimodel outputs (second validation) 3.3 interpretations referring to the obtained simulation results, it can be noticed that the application of the suggested approach allows a good modeling of the considered systems. we can also make an important interpretation about the use of the validities’ calculation methods. as remarked before, in some cases simple validities give better results and in other cases it is preferable to use reinforced validities. so, more simulations and observations were worked out to conclude that the choice of the validities’ calculation method depends on the classification results i.e. the clusters structure and repartition. thus, it can be noted that when there are several variations in the same cluster and when an overlapping between clusters occurs, which is the case in the second simulation example (figure 7), it is worth to use the simple validities’ method since it takes account of different models’ outputs refering to the expression (7). in this case, no model could represent ideally the process at any time. but when the clusters present very few variations and are well separated (figure 2), the reinforced validities’ method is better-adapted. the application of this method, thanks to the reinforcement expression (8), promotes the contribution of the most dominant model which represents at best the process behavior. 4 evaluation of the suggested modeling strategy in order to highlight the interest and the performance of the proposed modeling strategy, the classical modeling approach involving the identification of a global model was carried out. then, the results given by the suggested approach were compared to those given by classical modeling strategy. let us consider the second simulation example described by the equations (20) and (21) and the excitation signal given by figure 6. by using the instrumental determinants’ ratio-test method for the estimation of the order and the recursive least-squares method for the parametric identification, the transfer function f(z−1) of the global model is given by the following expression: f(z−1) = 0.03601 z−1 + 0.0035236 z−2 1−0.55726 z−1 −0.42595 z−2 . (27) by using the same numerical data, the classification and the identification steps give the three models described by equations (22), (23) and (24). the input sequence given by the equation (25) was applied 158 nesrine elfelly, jean-yves dieulot, pierre borne again. figure 11 represents the evolutions of the real, the multimodel and the global model outputs. the multimodel output is obtained by fusion of the models’ outputs weighted by their simple validities degrees calculated at each instant. 0 2 4 6 8 10 0 0.5 1 1.5 2 2.5 3 time y( t), y m m s(t ), y g (t) y g (t): global model output y(t): real output y mm s (t): multimodel output figure 11: evolutions of the real, multimodel and global model outputs the adopted multimodel approach offers a very satisfactory precision compared to the case of the classical modeling approach based on the unique global model. 5 conclusion in this paper, a neural approach of multimodel representation is proposed. this approach is applicable when dealing with complex, strongly non-linear and uncertain processes. it allows the determination of the models’ base by using a kohonen network and two methods of structural and parametric identification. the different steps were detailed. the multimodel output is obtained by using the technique of fusion and the adequate validities’ computation method. the suggested approach has been implemented and tested for different complex systems. simulation results, two of which were described in this paper, prove the efficiency and the precision of the proposed modeling strategy and show that the method works well with various processes even when highly complex. some interpretations have been made about the choice of the adequate validities’ calculation method to be applied. furthermore, in order to demonstrate the performance and the relevance of the suggested approach, a comparison with the classical modeling approach has been made. bibliography [1] m. a. al-akhras, g. m. aly, r. j. green, neural network learning approach of intelligent multimodel controller, iee proceedings on control theory and applications, vol. 143, pp. 395-400, 1996. [2] i. s. baruch, r. l. beltran, j-l. olivares, r. m. garrido, a fuzzy-neural multi-model for mechanical systems identification and control, mexican international conference on artificial intelligence n. 3, vol. 2972, pp. 774-783, 2004. [3] i. s. baruch, f. thomas, j-m. flores, e. gortcheva, a multimodel recurrent neural network for systems identification and control, ieee international joint conference on neural networks, vol. 2, pp. 1291-1296, 2001. [4] r. ben abdennour, p. borne, m. ksouri, f. m’sahli, identification et commande numérique des procédés industriels, editions technip, paris, france, 2001. a neural approach of multimodel representation of complex processes 159 [5] p. borne, m. benrejeb, j. haggège, les réseaux de neurones, editions technip, paris, france, 2007. [6] f. delmotte, l. dubois, p. borne, a general scheme for multi-model controller using trust, mathematics and computers in simulation, vol. 41, pp. 173-186, 1996. [7] a. el kamel, m. ksouri-lahmari, p. borne, contribution to multimodel analysis and control, studies in informatics and control, vol. 9, pp. 29-38, 2000. [8] t. a. johansen, b. a. foss, editorial: multiple model approaches to modelling and control, international journal of control, vol. 72, pp. 575, 1999. [9] z. kardous, a. el kamel, n. ben hadj braiek, p. borne, on the quadratic stabilization in discrete multimodel control, ieee conference on control applications, vol. 2, pp. 1398-1403, 2003. [10] t. kohonen, the self-organizing map, ieee proceedings, vol. 78, pp. 1464 1480, 1990. [11] m. ksouri-lahmari, p. borne, m. benrejeb, multimodel: the construction of model bases, studies in informatics and control, vol. 3, pp. 199-210, 2004. [12] d. j. leith, w. e. leithead, analytic framework for blended multiple model systems using linear local models, international journal of control, vol. 72, pp. 605-619, 1999. [13] s. mezghani, a. el kamel, p. borne, multimodel control of discrete systems with uncertainties, international journal of studies in informatics and control, 2000. [14] k. s. narendra, j. balakrishman, adaptative control using multiple models, ieee transactions on automatic control, vol. 42, pp. 171-187, 1997. [15] j. c. principe, l. wang, m. a. motter, local dynamic modeling with self-organizing maps and application to nonlinear system identification and control, ieee proceedings, vol. 86, pp. 22402258, 1998. [16] t. raissi, a. el kamel, w. byrski, p. borne, multimodel analysis and control of multivariable systems, ieee international conference on systems man and cybernetics, vol. 1, pp. 640-645, 2001. [17] m. ronen, y. shabtai, h. guterman, hybrid model building methodology using unsupervised fuzzy clustering and supervised neural networks, biotechnology and bioengineering, vol. 77, n. 4, pp. 420-429, 2002. [18] n. sadati, a. talasaz, robust fuzzy multimodel control using variable structure system, ieee conference on cybernetics and intelligent systems, vol. 1, pp. 497-502, 2004. [19] r. schorten, r. marry-smith, r. bjorgan, h. gollee, on the interpretation of local models in blended multiple model structures, international journal of control, vol. 72, pp. 620-628, 1999. [20] r. r. selmic, f. l. lewis, multimodel neural networks identification and failure detection of nonlinear systems, ieee conference on decision and control, vol. 4, pp. 3128-3133, 2001. [21] t. takagi, m. sugeno, fuzzy identification of systems and its applications to modelling and control, ieee transactions on systems man and cybernetics, vol. 15, pp. 116-132, 1985. [22] s. talmoudi, r. ben abdennour, a. kamel, p. borne, a systematic determination approach of a models base for uncertain systems: experimental validation, ieee international conference on systems man and cybernetics, vol. 6, pp. 73-81, 2002. 160 nesrine elfelly, jean-yves dieulot, pierre borne nesrine elfelly1, jeans-yves dieulot2, pierre borne3 1université des sciences et technologies de lille (ustl), 2ecole polytechnique de lille, 3ecole centrale de lille (ec lille) laboratoire d’automatique, génie informatique et signal ecole centrale de lille, cité scientifique bp 48 59651 villeneuve d’ascq cedex, france e-mail:nesrine.elfelly@ed.univ-lille1.fr,jean-yves.dieulot@polytech-lille.fr,pierre.borne@ec-lille.fr received: december 4, 2007 nesrine elfelly was born in monastir, tunisia in 1982. she received the engineer diploma degree in computer engineering from the "ecole nationale d’ingénieur de tunis" (tunisia) in 2006. she obtained the master of automatic control, computer engineering and image processing from the "ecole centrale de lille" (france) in 2007 and the master of automatic control and signal processing from the "ecole nationale d’ingénieur de tunis" in the same year. she is currently a phd student in automatic control and applied computing within the framework of lagisustl and lara-enit cooperation. her current research interests are the applications of neural approaches for complex systems analysis and control. jean-yves dieulot graduated from french ecole d’ingénieurs institut industriel du nord in 1990, and obtained a phd from the university of lille in 1993. he is currently associate professor at ecole polytechnique universitaire de lille, france, and with lagis (laboratory of automatic control). his main interests are in fuzzy, nonlinear and robot control. pierre borne was born in corbeil, france in 1944, he received the master degree of physics in 1967, the masters of electronics, of mechanics and of applied mathematics in 1968. the same year he obtained the diploma of "ingénieur idn" (french "grande ecole"). he obtained the phd in automatic control of the university of lille in 1970 and the dsc in physics of the same university in 1976. he became doctor honoris causa of the moscow institute of electronics and mathematics (russia) in 1999, of the university of waterloo (canada) in 2006 and the polytechnic university of bucharest (romania) in 2007. he is author or coauthor of about 200 journal articles and book chapters, of 35 plenary lectures and of about 250 communications in international conferences. he has been the supervisor of 69 phd thesis and is author of 20 books. he is fellow of ieee and has been president of the ieee/smc society in 2000 and 2001. he is presently professor "de classe exceptionnelle" at the ecole centrale de lille and director of the french "plan pluriformation" of research in automatic control. international journal of computers, communications & control vol. ii (2007), no. 4, pp. 388-402 a toolbox for input-output system inversion antonio visioli, aurelio piazzi abstract: in this paper a matlab-based toolbox for the input-output system inversion of linear systems is presented. different methods, based either on analytical or numerical approaches, are implemented. the toolbox can be exploited in the design of a feedforward action for control systems in different contexts in order to improve performances in the set-point regulation. the use of a pre-actuation and a post-actuation time can be easily analyzed as well as the role played by the choice of the desired output function. keywords: cacsd, input-output inversion, feedforward, set-point regulation, optimization. 1 introduction it is well-known that a (properly designed) feedback controller provides robustness to a control system with respect to parameter variations and allows to compensate for external disturbances. on the other side, a high performance in the set-point following task can be achieved by adopting a suitable feedforward action. indeed, the proper design of a control system consists of suitably combining feedback and feedforward control. different techniques have been developed for the synthesis of a feedforward controller for a linear system (see e.g. [1, 2, 3]). when the set-point regulation is of concern, a (noncausal) system inversion approach has been proven to be effective in this context [4]-[13]. basically, the approach consists of selecting a desired output function in order to achieve a transition from a current output value y0 to a new one y1 and then to determine the corresponding input function by applying a stable inversion procedure. then, the calculated input function is adopted as a reference command input to the (closed-loop) system, instead of the typical step signal. actually, while many software packages are available for the synthesis of feedback controllers (for example, via root locus techniques or bode plots), they are not available for the synthesis of a system inversion based feedforward action. indeed, the presence of a computer aided control systems design tool makes the applicability of these (somewhat complex) techniques much easier and it can be exploited to understand deeply the role of the command function in the context of set-point regulation. in this paper a matlab-based toolbox for the input-output system inversion of a linear system is presented. different techniques in this context are considered, related both to a noncausal and a causal approach. the toolbox allows to evaluate the role of the use of a pre-actuation and a post-actuation time as well as the role of the choice of the desired output function. it can be adopted as a useful tool in different fields, such as robust control, process control and control of mechatronic systems. the paper is organized as follows. in section 2 the input-output system inversion approach is briefly reviewed and the different methodologies implemented in the toolbox are presented. the functions implemented in the toolbox are described in section 3 and application examples are shown in section 4. conclusions are drawn in section 5. notation. ci denotes the set of scalar real functions that are continuous till the ith derivative and bci denotes the subset of ci of the scalar real functions that are bounded. the ith order differential operator is di. copyright © 2006-2007 by ccc publications a toolbox for input-output system inversion 389 2 input-output system inversion consider a general asimptotically stable nonminimum-phase linear system σ described by the following rational transfer function (note that this might represent the transfer function of a feedback control system): h(s) = k1 b(s) a(s) = k1 sm + bm−1sm−1 +···+ b0 sn + an−1sn−1 +···+ a0 , k1 6= 0 (1) where it is assumed that polynomials a(s) and b(s) are coprime (no pole-zero cancellations occur) and that σ has not purely imaginary zeros. the input and output of σ are u ∈ r and y ∈ r respectively and the relative order (or relative degree) of σ is ρ := n−m. the set of all cause/effect pairs associated with σ is denoted by b := {(u(·), y(·)) ∈ pc ×pc : dny + an−1dn−1y +···+ a0y = k1(dmu + bm−1dm−1u +···+ b0u)} (2) where pc denotes the set of piecewise continuous functions defined over (−∞, +∞), i.e. the real field r. in the framework of the behavioral approach, b is the behavior set of σ that can be rigorously introduced by means of the so-called weak solutions of the differential equation associated to σ [14]. the following proposition [14] is useful in the development of the subsequent analysis. proposition 1. consider any pair (u(·), y(·)) ∈ b. then, u(·) ∈ cl (r) if and only if y(·) ∈ cρ+l (r) with l being a nonnegative integer. the considered regulation problem consists of obtaining an output transition from a previous value y0 to a new value y1. without loss of generality, in the following we will consider y0 = 0. define yd (·) ∈ bck with yd (t) = 0 for t < 0 as the desired output function to obtain the transition. from a practical point of view, a transition time τ has to be defined, i.e. the desired output function is defined as yd (t) :=    0 for t < 0 y01(t) for 0 ≤ t ≤ τ y1 for t > τ. (3) then, the following stable input-output inversion (sioi) problem can be formulated. sioi problem. determine an input function ud (·) ∈ bck−ρ such that (ud (·), yd (·)) ∈ b. (4) the general solution to the sioi problem can be derived as follows [15]. first, express the inverse of the transfer function (1) as h−1(s) = 1 k1 a(s) b(s) = ξρ sρ + ξρ−1sρ−1 +···+ ξ0 + h0(s) (5) where h0(s) is a strictly proper rational transfer function representing the zero dynamics of σ. by using the fraction expansion, h0(s) can be decomposed as h0(s) = h − 0 (s) + h + 0 (s) = d(s) b−(s) + e(s) b+(s) (6) where b−(s) and b+(s) are the monic polynomials containing the roots of b(s) with negative and positive real parts, respectively. define η−0 (t) and η + 0 (t) as the analytic extensions of l −1[h−0 (s)] and l−1[h+0 (s)] over the space of the bohl functions for which η − 0 (t)1(t) = l −1[h−0 (s)] and η + 0 (t)1(t) = 390 antonio visioli, aurelio piazzi l−1[h+0 (s)] respectively. then, the solution of the sioi problem is derived as: ud (t) = ξρ dρ yd (t) +···+ ξ1dyd (t) + ξ0yd (t) + ∫ t 0 η−(t −v)yd (v)dv− ∫ +∞ t η +(t −v)yd (v)dv. (7) it is worth noting that, in general, ud (t) is defined over the time interval (−∞, +∞) and therefore, in order to practically use it, it is necessary to truncate it. thus, the input function exhibits a pre-actuation (associated with the unstable zeros) and a post-actuation (associated with the stable zeros) time intervals (see for example [16]), denoted as tp and t f respectively. they can be calculated with arbitrary precision by selecting two arbitrary small parameters ε0 and ε1 and by subsequently determining t0 := max{t′ ∈r : |ud (t)| ≤ ε0 ∀t ∈ (−∞,t′]} (8) and t1 := min { t′ ∈r : ∣∣∣∣ud (t)− 1 h(0) ∣∣∣∣ ≤ ε1 ∀t ∈ [t′, ∞) } . (9) then, it has to be fixed tp = min{0,t0} t f = max{τ,t1}. (10) hence, the actual input function to be applied to the system is given by ũd (t) :=    0 for t < tp ud (t) for tp ≤ t ≤ t f 1 y1 for t > t f . (11) alternatively, the pre-actuation and post-actuation time intervals can be calculated as [4] tp = − 10 drhp t f = 10 dlhp (12) where drhp and dlhp are the minimum distance of the right and left half plane zeros respectively from the imaginary axis of the complex plane. it is worth noting that the pre-actuation time is zero when there are no unstable zeros and the postactuation time is zero when there are no stable zeros. in general, the integrals in expression (7) can not be solved analytically and therefore a numerical solution has to be determined (in this context the cavalieri-simpson’s rule can be conveniently exploited to found an accurate solution with a small computational time [15]). a notable exception occurs when the selected desired output function is a polynomial function [17], i.e. yd (t; τ) =    0 for t < 0 y1 (2k+1)! k!τ 2k+1 ∑ k i=0 (−1)k−i i!(k−i)!(2k−i+1) τ it 2k−i+1 for 0 ≤ t ≤ τ y1 for t > τ (13) note that function yd (t; τ), parameterized by the transition time τ is a ck-function over (−∞, +∞) and is strictly increasing in the interval [0, τ] so that neither overshooting nor undershooting appear in this output planning for set-point regulation. in this case an analytical solution can be found [18] and this fact can be exploited in speeding up the computational time and most of all in avoiding numerical problems. a very interesting application of the analytical stable inversion procedure is for the improvement of the set-point following performance of proportional-integral-derivative (pid) controllers. specifically, if a pid controller is employed for a first-order plus dead-time (fopdt) process, described by the following transfer function, p(s) = k t s + 1 e−ls, (14) a toolbox for input-output system inversion 391 or for an integrator plus dead-time (ipdt) process, described by the transfer function p(s) = k s e−ls, (15) then a closed-form solution of the stable input-output inversion applied to the closed-loop system can be exploited (a rational closed-loop transfer function is obtained by adopting a padè approximation) [13]. indeed, the actual command signal to be applied for a given process and a given pid controller is determined by substituting the actual value of the process and pid parameters into the resulting closedform expression. in this context the pid transfer function is expressed as c(s) = kp ( 1 + 1 tis + td s ) 1 tf s + 1 , (16) where kp is the proportional gain, ti is the integral time constant, td is the derivative time constant and tf is the time constant of the filter that is adopted to render the system proper. a polynomial output function can be also usefully exploited in determining a causal input-output inversion despite the presence of unstable zeros [9]. in particular, the order of the polynomial function is selected in order to satisfy boundary conditions so that yd (·) ∈ bck with k ≥ ρ and at the same time to have a number of free coefficients equal to the number of the unstable zeros of the system σ. then, the free parameters are determined in order to annihilate the unstable modes in the input function determined by the inversion procedure. in this way there is no need of a pre-actuation time interval and the resulting inversion is causal. it can be therefore employed when a preview time is not available in a given application. however, this is paid by the possible presence of undershoots and overshoots in the resulting output function. note that the approach can be easily extended in order to avoid also the presence of a post-actuation time interval. 3 toolbox description the designed matlab-based toolbox implements the methods described in section 2. it requires the control system toolbox and the symbolic math toolbox of matlab. the following main functions are available. [time,command,preaction,postaction]=numdyninvcs(sys,yd,tau,st) this function determines the input command function of a system that causes a desired output function by means of an input-output inversion scheme based on the use of the cavalieri-simpson’s rule for the determination of the integrals in (7) [15]. in particular, the meaning of the parameters is the following one: • sys is the transfer function of the system expressed in symbolic form (with s as a symbolic variable); • yd is the desired output function (for t ∈ [0, τ]) expressed as an array of numerical values from 0 to τ corresponding to the time instant equally spaced by the sampling time; • tau is the transition time; • st is the sampling time; • time is the output time vector; it starts from the preaction time tp but for convenience the zero time is shifted to tp; 392 antonio visioli, aurelio piazzi • command is the determined input function expressed as a numerical array corresponding to the time array time; • preaction is the pre-actuation time calculated with formula (12); • postaction is the post-actuation time calculated with formula (12). [time,command,preaction,postaction]=outdyninvcs(sys,yd,tau,st) this function operates basically as numdyninvcs with the difference that the desired output function yd is expressed as a symbolic expression with symbolic variable t. [time,command,preaction,postaction]=numdyninv(sys,yd,tau,st) this function is very similar to numdyninvcs but it performs the numerical integration by applying a rectangular rule. in order to obtain an accurate result, a small value of the sampling time has to be selected. this might result in a high computational time. [time,command,preaction,postaction]=outdyninv(sys,yd,tau,st) this function is very similar to outdyninvcs but it performs the numerical integration by applying a rectangular rule. also in this case, in order to obtain an accurate result, a small value of the sampling time has to be selected. this might result in a high computational time. [time,command,preaction,¯postaction]=dyninv(sys,y1,tau,threshold0, threshold1,st) this function solves the input-output inversion problem when the desired output function is a polynomial function (13). the order of the polynomial is automatically selected, according to proposition 1, in order to obtain a continuous input function, i.e. such as ud (·) ∈ bc0. since the input function is determined analytically, the pre-actuation and post-actuation time intervals are conveniently determined by adopting formulae (10). the function parameters that are different from those that have been already described have the following meaning: • y1 is the desired new output steady-state value (it is assumed, without loss of generality, that the current input and output steady-state values are zero); • threshold0 is the parameter ε0 in formula (8), which is adopted to calculate the pre-actuation time; • threshold1 is the parameter ε1 in formula (9), which is adopted to calculate the post-actuation time. [time,command,taunum,preaction,postaction]= optdyninv(sys,y1,constraints,threshold0,threshold1,tc) this function solves the minimum-time inversion problem that consists of finding the minimum output transition time subject to constraints posed on the input function and its derivatives until an arbitrary order l. formally, the optimisation problem is posed as follows [18]: min τ∈r+ τ (17) such that, i = 0, 1, . . . , l, |diud (t; τ)| ≤ u(i)m ∀t ∈ (−∞, +∞) (18) a toolbox for input-output system inversion 393 where the positive values u(i)m , i = 0, 1, . . . , l, are given bounds of the problem. note that the problem admits a solution if u(0)m > 1/|h(0)| and u (i) m > 0, i = 1, . . . , l. the optimisation problem is solved by applying a simple bisection algorithm in conjunction with a gridding of the time axis [17]. with respect to the function dyninv there are the following different parameters: • constraints is the array (of l + 1 elements) of the constraints for the input derivatives until the lth order; note that the user-chosen dimension of the array automatically determines the order of the constrained derivatives and therefore the order of the output polynomial function (which is determined as l −1 + ρ so that ud (·) ∈ bcl−1); • taunum is the resulting optimal transition time. it is worth noting that, if a rigorous determination of the transition time is sought, the posed optimisation problem should be approached with the tools of global optimisation. in this context the presented inputoutput inversion toolbox can be easily integrated with the b4m toolbox that allows to handle interval arithmetic, which is a well-known effective tool for global optimisation [19]. [time,command,out,postaction]=causaldyninv(sys,y1,tau,st,hbc,pa) this function implements the causal approach proposed in [9]. the resulting pre-actuation time is always zero despite the possible presence of unstable zeros. in particular, the function deals with the following parameters: • hbc is the order h of the boundary conditions to be satisfied for the polynomial output function at time t = 0 and t = τ , so that yd (·) ∈ bc2h+1. note that it has to be h ≥ ρ in order to ensure that the input function is at least continuous, i.e. ud (·) ∈ bc0; • pa is a string that, if set to ‘nopostaction’, avoid also the use of a post-actuation time even if the system has stable zeros. in other words, in this case the system attains an equilibrium point at t = τ . if the parameter is not adopted or if it is set to another value, then a post-actuation time is present and it is determined by means of formula (12). [time,command,preaction,¯postaction]=piddyninvfopdt(k,t,l,kp,ti,td, tf,tau,st) this function determines the input command function to unitary feedback closed-loop system in which a process described by a fopdt transfer function is controlled by a pid controller. the following parameters are adopted: • k, t, l are the process gain, time constant and dead time respectively (see (14)); • kp, ti, td, tf are the pid parameters (see (16), where the meaning of the different parameters is obvious). [time,command,preaction,postaction]=piddyninvipdt(k,t,l,kp,ti,td,tf, tau,st) this function determines the input command function to unitary feedback closed-loop system in which a process described by a ipdt transfer function is controlled by a pid controller. the meaning of the parameters can be straightforwardly derived by considering those of the previous function together with expression (15). 394 antonio visioli, aurelio piazzi 4 application examples in order to better understand the usage of the input-output inversion toolbox, a few examples are given. consider the system h(s) = 4 (1−s)(s + 1) (s + 2)(s2 + 2s + 2) . (19) to insert the system in symbolic form in the matlab workspace, the following two commands can be applied: syms s h=4*(1-s)*(s+1)/(s+2)/(s^2+2*s+2) then, suppose that an output transition from 0 to y1 = 1 is required to be performed in τ = 3 by means of the following output function: y(t) = 4796646617206209 562949953421312 t 63 25 − 4658008624178539 562949953421312 t 127 50 + (20) 6313836048447483 5316911983139663491615228241121378304 t 39. the output function can be inserted in the matlab workspace (denote the variable as yt) either in symbolic form or as an array whose elements are the values of y(t) for t = 0, ts, 2ts, . . . , τ , where ts is the sampling time (in this case it has been selected ts = 0.001). in the first case the command to be adopted is: [time,command,preaction,postaction]=outdyninvcs(h,yt,3,0.001) while in the second case it has to be [time,command,preaction,postaction]=numdyninvcs(h,yt,3,0.001) in both cases the pre-actuation time results to be tp = −10 s and the postaction time is t f = 10 s (according to expressions (12)). the resulting input function and the output function obtained by applying it to the actual system are reported in figure 1 (note that the result is the same in both cases). select now a polynomial output function (13) to perform again an output transition from 0 to y1 = 1 and select the parameters ε0 = ε1 = 10−3. in this case the matlab command to be adopted is [time,command,preaction,postaction]=dyninv(h,1,3,0.001,0.001,0.001) the resulting pre-actuation and post-actuation times preaction and postaction (determined by means of formula (10)) are respectively tp = −6.256 s and t f = 3.547 s. the determined input and the corresponding system output are plotted in figure 2. note that the resulting output function is a cubic polynomial, i.e. yd (t; τ) = y1 ( − 2 τ 3 t 3 + 3 τ 2 t 2 ) t ∈ [0, τ] as it is k = ρ = 1 in formula (13). consider now the minimum time problem (17)-(18) and set the limits on the input derivatives as u(0)m = 2, u(1)m = 10 and u (2) m = 20. this means that the following matlab command has to be executed: limits=[2 10 20] then, the following function has to be employed (note that the sampling time is 0.001 s as before and again it is ε0 = ε1 = 10−3): [time,command,taunum,pre,post]=optdyninv(h,1,limits,0.001,0.001,0.001) the resulting optimal transition time taunum is equal to 0.875 s, while the pre-actuation and postactuation times are tp = −7.111 s and t f = 4.402 s. the determined command input and the corresponding system output are shown in figure 3, while the first and second derivatives of the command input are plotted in figure 4. it can be deduced that the active constraint is the one related to the second time derivative of the input. if the causal approach is desired, i.e. the pre-actuation time is to be avoided, then the function a toolbox for input-output system inversion 395 causaldyninv has to be adopted. in particular, we select the order of the boundary conditions as h = ρ = 1 and the desired transition time as τ = 3. then, if a post-actuation time interval is allowed, the matlab command to be adopted is: [time,command,out,postaction]=causaldyninv(h,1,3,0.001,1) in this case the resulting command input and the corresponding output are plotted in figure 5. note that the post-actuation time interval is t f = 10 s. conversely, if a post-actuation is not allowed, the matlab function to be employed is [time,command,out,postaction]=causaldyninv(h,1,3,0.001,1,‘nopostaction’) the resulting command input and the corresponding output function are shown in figure 6. it can be seen that in both cases the output function is not monotonic. indeed, the possible overshoots and undershoots are more and more significant when the selected transition time decreases. in order to verify the effectiveness of the inversion approach for pid control, consider first the system p(s) = 2 10s + 1 e−5s, (21) controlled by a pid controller (see (16)) with kp = 1.2, ti = 10, td = 2.5, tf = 0.01. the transition time is fixed to 15 s and the sampling time to 0.01 s. then, the following matlab command is adopted: [time,command,pre,post]=piddyninvfopdt(2,10,5,1.2,10,2.5,0.01,15,0.01) the resulting command input (applied to the closed-loop system) and output functions are plotted in figure 7. the pre-actuation and post-actuation time intervals are tp = −16.67 s and t f = 50.01 s. note that the output function is slightly different from the desired one because of the padè approximation. then, a ipdt transfer function is considered: p(s) = 0.1 s e−5s. (22) in this case the tuning of the pid parameters is kp = 0.12, ti = 10, td = 2.5, tf = 0.01. the transition time and the sampling time are the same as before, so that the following function is employed: [time,command,pre,post]=piddyninvipdt(0.1,10,5,0.12,10,2.5,0.01,15,0.01) figure 8 reports the determined closed-loop command input and the resulting system output. also in this case the pre-actuation and post-actuation time intervals are tp = −16.67 s and t f = 50.01 s. 5 conclusions a matlab-based toolbox for the input-output inversion of linear scalar systems has been described in this paper. the toolbox is very useful to understand and to verify the effectiveness of a feedforward action in a control scheme and, in general, of a combined feedback/feedforward synthesis. in this context, the role played by the planned output function can be easily evaluated as well as the influence of the noncausal approach with respect to a causal one. indeed, all the aspects related to the use of an inputoutput inversion-based control design can be thoroughly investigated and analyzed. the toolbox can be exploited in different fields such as motion control, robust control, and process control. it can be downloaded from the website http://www.ing.unibs.it/∼visioli/ioitoolbox.htm 396 antonio visioli, aurelio piazzi 0 2 4 6 8 10 12 14 16 18 20 0 0.2 0.4 0.6 0.8 1 time [s] command input resulting output figure 1: command input and resulting system output with system (19) and desired output function (20) 0 2 4 6 8 10 12 14 0 0.2 0.4 0.6 0.8 1 time [s] command input resulting output figure 2: command input and resulting system output with system (19) and a polynomial desired output function (13) a toolbox for input-output system inversion 397 0 2 4 6 8 10 12 0 0.2 0.4 0.6 0.8 1 1.2 time [s] command input resulting output figure 3: optimal command input and resulting minimum-time system output with system (19) and a polynomial desired output function (13) 0 2 4 6 8 10 12 −20 −15 −10 −5 0 5 10 15 20 time [s] first derivative second derivative figure 4: first and second derivative of the optimal command input with system (19) and a polynomial desired output function (13) 398 antonio visioli, aurelio piazzi 0 1 2 3 4 5 6 −0.2 0 0.2 0.4 0.6 0.8 1 time [s] command input resulting output figure 5: command input and resulting system output with system (19) and a causal approach with post-actuation 0 0.5 1 1.5 2 2.5 3 3.5 4 −1 −0.5 0 0.5 1 1.5 2 time [s] command input resulting output figure 6: command input and resulting system output with system (19) and a causal approach without post-actuation a toolbox for input-output system inversion 399 0 10 20 30 40 50 60 70 0 0.2 0.4 0.6 0.8 1 time [s] command input resulting output figure 7: command input and resulting system output with system (21) with a pid controller 0 10 20 30 40 50 60 70 −3 −2 −1 0 1 2 3 4 5 6 time [s] command input resulting output figure 8: command input and resulting system output with system (22) with a pid controller 400 antonio visioli, aurelio piazzi bibliography [1] b. c. kuo, automatic control systems, prentice hall, englewood cliffs (nj), 1995. [2] a. wallen, k. j. åström, pulse-step control, preprints of the 15th ifac world congress on automatic control, barcelona (spain), 2002. [3] a. visioli, a new design for a pid plus feedforward controller, journal of process control, vol. 14, no. 4, pp. 455-461, 2004. [4] h. perez, s. devasia, optimal output transitions for linear systems, automatica, vol. 39, pp. 181-192, 2003. [5] q. zou, s. devasia, preview-based optimal inversion for output tracking: application to scanning tunneling microscopy, proceedings ieee international conference on decision and control, las vegas (usa), pp. 79-85, 2002. [6] d. iamratanakul, h. perez, s. devasia, feedforward trajectory design for output transitions in discrete-time systems: disk-drive example, proceedings of the american control conference, denver (usa), pp. 3142-3147, 2003. [7] a. piazzi, a. visioli, minimum-time system-inversion-based motion planning for residual vibration reduction, ieee/asme transactions on mechatronics, vol. 5, no. 1, pp. 12-22, 2000. [8] a. piazzi, a.visioli, optimal inversion-based control fo the set-point regulation of nonminimumphase uncertain scalar systems, ieee transactions on automatic control, vol. 46, no. 10, pp. 16541659, 2001. [9] m. benosman, g. le vey, stable inversion of siso nonminimum phase linear systems through output planning: an experimental application to the one-link flexible manipulator, ieee transactions on control systems technology, vol. 11, no. 4, pp. 588-597, 2003. [10] c. guarino lo bianco, a. piazzi, a servo control system design using dynamic inversion, control engineering practice, vol. 10, no, 8, pp. 847-855, 2002. [11] a. piazzi, a. visioli, optimal dynamic inversion based control of an overhead crane, iee proceedings control theory and applications, vol. 149, no. 5, pp. 405-411, 2002. [12] a. visioli, a. piazzi, improving set-point following performance of industrial controllers with a fast dynamic inversion algorithm, industrial engineering and chemistry research, vol. 42, pp. 13571362, 2003. [13] a. piazzi, a. visioli, a noncausal approach for pid control, journal of process control, vol. 16, pp. 831-843, 2006. [14] j. w. polderman, j. c. willems, introduction to mathematical systems theory, springer-verlag, new york, 1998. [15] d. pallastrelli, a. piazzi, stable dynamic inversion of nonminimum-phase scalar linear systems, preprints of the 16th ifac world congress on automatic control, prague (cz), 2005. [16] q. zou, s. devasia, preview-based inversion of nonlinear nonminimum-phase systems: vtol example, proceedings of the ieee international conference on decision and control, paradise island (the bahamas), pp. 4350-4356, 2004. a toolbox for input-output system inversion 401 [17] a. piazzi, a. visioli, optimal noncausal set-point regulation of scalar systems, automatica, vol. 37, no. 1, pp. 121-127, 2001. [18] a. piazzi, a. visioli, using stable input-output inversion for minimum-time feedforward constrained regulation of scalar systems, automatica, vol. 41, no. 2, pp. 305-313, 2005. [19] e. hansen, g. w. walster, global optimization using interval analysis 2nd edition, marcel dekker, 2003. antonio visioli university of brescia dipartimento di elettronica per l’automazione via branze 38, i-25123 brescia, italy e-mail: antonio.visioli@ing.unibs.it aurelio piazzi university of parma dipartimento di ingegneria dell’informazione parco area delle scienze 181a, i-43100 parma, italy received: january 2, 2007 402 antonio visioli, aurelio piazzi antonio visioli received the laurea degree in electronic engineering from the university of parma, parma, italy, and the ph.d. degree in applied mechanics from the university of brescia, brescia, italy, in 1995 and 1999 respectively. his ph.d. dissertation was on control strategies for industrial robot manipulators. he is currently an associate professor of automatic control, department of electronics for automation, university of brescia, italy. his research interests include industrial robot control and trajectory planning, dynamic-inversion-based control and process control. he is the author or co-author of more than 100 papers in international journals and refereed conference proceedings and he is the author of the book practical pid control published by springer. dr. visioli is a senior member of ieee and a member of ifac and anipla (italian association for the automation). aurelio piazzi received the laurea degree in nuclear engineering in 1982 and the ph.d. degree in system engineering in 1987, both from the university of bologna, italy. from 1990 to 1992, he was research associate in system theory, deis, university of bologna. since 1992 he has been at the university of parma where he is currently full professor of automatic control at the department of informatics engineering. his main research interests are in system and control theory and related engineering applications. his recent research activities have focused on optimization and dynamic inversion techniques for autonomous vehicle guidance and for the design of high-performance control systems. scientific coordinator of bilateral research programs in collaboration with various industries (among them cnr, enel, rfi -ferrovie dello stato), in 2002 and 2003 he has directed the european project cookies within the eu cluster eutist-imv (integrated machine vision) in collaboration with gruppo colussi (perugia) for the artificial vision-based control of food industrial ovens. he is a member of ieee and siam and has published over 90 scientific papers in international journals and conference proceedings. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 892-899 information sharing in vehicular adhoc network a. rahim, z.s. khan, f.b. muhaya, m. sher, m.k. khan aneel rahim, zeeshan shafi khan 1. prince muqrin chair for it security, king saud university, saudi arabia 2. international islamic university, pakistan e-mail: aneelrahim,zeeshanshafi@ksu.edu.sa fahad bin muhaya prince muqrin chair for it security, king saud university, saudi arabia e-mail: fmuhaya@ksu.edu.sa muhammad sher international islamic university, pakistan e-mail: m.sher@iiu.edu.pk muhammad khurram khan center of excellence in information assurance, king saud university, saudi arabia e-mail: mkhurram@ksu.edu.sa abstract: relevance technique broadcast the useful information and removes the redundant data. 802.11e protocol implementation has certain flaws and is not suitable for vanets scenarios. main issue in 802.11e protocol is internal sorting of packets, no priority mechanism within the queues and often lower priority traffic get more medium than high priority traffic. in this paper, the mathematical model of relevance scheme is enhanced so that it can consider the network control in real scenario by considering the impact of malicious node in network. problems of 802.11e protocol can be resolved by making virtual queue at application level. we analyze the comparison of simple virtual queue with the over all impact of virtual queue and mathematical model. similarly we compare the mathematical model with over all impact of virtual queue and modified mathematical model using ns-2 simulator. keywords: vanets, broadcast, 802.11e, malicious 1 introduction vehicle to vehicle (v2v) communication enhances the safety of passenger and driver [1]. v2v communications is unreliable because of shadowing, doppler shifts and multi-path fading. [2] security is an important concern in mobile adhoc network [3] [4] [5]. attacks are easily launched on vanets [6] because of high speed [7], no infrastructure and topology changes frequently [8] [9]. several security attacks are possible on safety application, which includes denial of service [10], masquerade [11], fake information, false position information and id disclosure [12]. vehicular communication vulnerabilities are explained in [13] which include jamming, forgery, in-transit traffic tampering, impersonation, privacy violation and on-board tampering. malicious data in vanet is because of distributed environment and unreliable components of data generation. [14] we in this paper resolve the problems of 802.11e protocol by making virtual queue at application and enhance the mathematical model of message benefit to consider copyright c⃝ 2006-2010 by ccc publications information sharing in vehicular adhoc network 893 the network traffic. we measure the global benefit in real scenario by considering the impact of malicious node. we also show the comparison of simple virtual queue with the over all impact of virtual queue and mathematical model. similarly we compare simple mathematical model with over all impact of virtual queue and mathematical model. this paper is organized as follows: in section 2, we discuss relevance based approach, its characteristics and its implementation using cross layer, 802.11e and 802.11e with virtual queue. in section 3, proposed study and results are presented using ns-2. lastly in section 4 conclusions is given. 2 related work relevance technique disseminates the useful information and removes the redundant data [16]. vehicle contains huge information that can’t be shared to due the high speed of vehicles. technique which gives high priority traffic more medium as compare to low priority traffic is a suitable approach for vanets. so relevance technique is the only option as it forward data according to its relevance. relevance techniques based upon the calculation of relevance value of message and its distribution according to its priority [15, 16]. altruism, application-oriented information differentiation, controlled unfairness is the basic characteristics of relevance based approach [15, 17, 18]. 2.1 cross layer and 802.11 e implementation relevance technique can be implemented through cross layer design or by 802.11e protocol. in cross layer design, relevance value of every packet is measured at application layer and pass to link layer through packet header. modified medium access control and interface queue broadcast the high priority traffic with help of application layer information. [15, 17]. 802.11e protocol implementation has certain flaws and is suitable for vanets scenarios [18]. main issue in 802.11e protocol is that it does not provide internal sorting of packets, no priority mechanism within the queues and performance of the network degrades as lower traffic some times get more medium than high priority traffic [15]. 2.2 802.11e implementation with virtual queue 802.11e protocol problems are overcome by adding four virtual queues at application level. packets are sorted according to their priority and most importance messages are near the head of queues. sorting is done by getting the current information from application layer and length of 802.11e is set to be one. when 802.11e is empty, one high priority packet is moved from virtual queue to 802.11e queue. packet in 802.11e queue does not mean that it always broadcast. if get a packet in virtual queue that has higher relevance than packet in 802.11e queue than we swap both the packets in order to achieve higher global benefit [19]. 2.3 mathematically model for relevance based approach the mathematical model for relevance based approach is given below. message benefit = 1∑n i=0 αi ∗ n∑ i=0 αi ∗ bi(m, v, i) − − − − − − − − − [18]. to determine the relevance value of message, message (m), vehicle (v), and information (i) context parameters are used. the n parameters are computed with the help of application dependent function bi. the n parameters are then weighted with application dependent factors ai. in the end, all parameters are sum up and divided by the sum of all ai. 894 a. rahim, z.s. khan, f.b. muhaya, m. sher, m.k. khan 2.4 enhanced mathematically model for relevance based approach as the existing model does not have the support for network control traffic. so network performance can improve by adding the network control in the mathematical model [20]. enhanced message benefit = 1∑n i=0 αi ∗ n∑ i=0 αi ∗ bi(m, v, i) + n∑ i=0 pi a) n∑ i=0 pi = 0 if it is user traffic where as n∑ i=0 pi = 1 for operational level network problem, n∑ i=0 pi = 2 for administrative level, n∑ i=0 pi = 3 for maintenance level b) message benefit = n∑ i=0 pi (for network traffic only) if 0 n∑ i=0 pi � 3 then 1∑n i=0 αi ∗ n∑ i=0 αi ∗ bi(m, v, i) = 0 user and network traffic is assigned a value between zero and three, in order to handle them easily with four queues (q0, q1, q2, and q3) of 802.11e. high priority traffic is assigned q0 so that it can be forward before the packets in q1, q2 and q3. queues are assigned to user and network control traffic according to there relevance value. but in the existing approach there is no mechanism for priority for network control traffic. so the global benefit is enhanced by considering network traffic. 3 proposed study and results in this study we simulate the relevance based approach and calculate global benefit in ideal scenario that all nodes are doing their properly and there is no malicious node in the network. in the second scenario we consider the impact of malicious node and measure how much global benefit is decreased. the malicious nodes forward the relevant messages first but also inject some surplus information. in last scenario malicious node forward the surplus message first and ignore the relevant message. in order to validate the proposed study, we compare the performance of relevance based approach in real and ideal scenario with 802.11e protocol. ns2, a network simulator [21], is used to simulate the behavior for relevance based approach in vanets scenarios. we use manhattan mobility model and traffic is generated by generic mobility simulation framework [22]. vehicles are moving at a speed of 72km/hr to 108 km/hr within an area of 3000m x 3000m with transmission range of 300m. performance of relevance based approach is measured by calculating the global benefit. information sharing in vehicular adhoc network 895 table 1: simulation parameters parameters settings channel wireless vehicles 50,100,150 mac protocol 802.11e time 50s routing protocol dsdv network simulator is used for the simulation and different parameter used in the following study is given in table 1. 3.1 improvement due to mathematical model in this study we simulate enhanced mathematically model of message benefit shown above with existing relevance based approach. figure 1(a) shows that 50 vehicles are moving at high speed and share safety and comfort information with each other. relevance based approach consider only user traffic and ignore network traffic. so its global benefit can be improved by improving the mathematical model. we now evaluate the performance of relevance based approach by adding the network control parameter in the existing formula. figure 1(a) shows the global benefit with enhanced relevance based approach. it is clear from figure 1(a) that global benefit is low by using existing relevance based approach because network control traffic set lower priority and get less bandwidth than user traffic. so lower priority traffic can get more bandwidth than higher priority traffic. that’s why the global benefit is improved by adding the network parameter in relevance based approach. figure 1: (a) improvement due to mathematically (b) improvement due to virtual queue 3.2 improvement due to virtual queue figure 1(b) shows simple 802.11e and virtual queue with 802.11e, safety messages and route messages are forwarded by vehicles. in this study 150 vehicles exchanging information with each other. in simple 802.11 e, there is no mechanism of priority assignment. this problem is resolved by virtual queue. so its global benefit is greater than simple 802.11 e because it does not allow lower priority traffic to get more medium than higher priority traffic. 3.3 improvement due to virtual queue and mathematically model first we check the improvement due mathematical model and virtual queue separately but now we consider the impact of both on the global benefit of network. figure 2 shows that global benefit of existing and enhance relevance based approach due to virtual queue and mathematically 896 a. rahim, z.s. khan, f.b. muhaya, m. sher, m.k. khan model. enhanced relevance based approach has higher global benefit because it resolves the problem of priority mechanism and ignorance of network control traffic. figure 2: improvement due to virtual queue and mathematically model 3.4 comparison this study shows the comparison of simple virtual queue with the over all impact of virtual queue and mathematical model. similarly we compare simple mathematical model with over all impact of virtual queue and mathematical model. fig 3(a) shows the global benefit due to message benefit (mb), enhance message benefit (emb) and virtual queue + emb. it is clear from figure that global benefit by using virtual queue + emb is greater than simple emb because within a queue a there is no priority mechanism available. fig 3(b) shows the global benefit due to 802.11e, virtual queue and emb + virtual queue. it is clear from figure that global benefit by using emb + virtual queue is greater than 802.11e and simple queue because in simple queue we don’t have discriminate between user traffic and network traffic. figure 3: (a)comparison of mathematically model with both virtual queue and mathematical model (b) comparison of virtual queue with both mathematical models 3.5 impact of malicious node in this study we consider the impact of malicious node on emb, virtual queue and both (emb + virtual queue).figure 4(a) shows that 50 vehicles are moving at high speed and share safety and comfort information with each other. first we simulate the mb and emb in ideal information sharing in vehicular adhoc network 897 scenario that no malicious node exists and all nodes try to improve the benefit of network rather than their own benefit. after that we simulate emb in real scenario that malicious exist and damage the performance of the network. figure 4(a) shows that global benefit of emb in real scenario lies between the emb mb in ideal scenario. figure 4(b) shows that 150 vehicles exchanging information with each other. first we simulate the 802.11e and virtual queue in ideal scenario that no malicious node exists and all nodes try to improve the benefit of network rather than their own benefit. after that we simulate virtual queue in real scenario that malicious exist and damage the performance of the network. figure 4(b) shows that global benefit of emb in real scenario lies below than 802.11e and virtual queue in ideal scenario. figure 5 shows that 150 vehicles are moving at high speed and share safety and comfort information with each other. first we simulate the mb and vq + emb in ideal scenario that no malicious node exists and all nodes try to improve the benefit of network rather than their own benefit. after that we simulate vq+ emb in real scenario that malicious exist and damage the performance of the network. figure 5 shows that global benefit emb + vq in real scenario lies between the emb +vq and mb in ideal scenario. figure 4: (a) impact of malicious node on emb (b)impact of malicious node on virtual queue (vq) figure 5: impact of malicious node on enhanced message benefit and virtual queue 4 conclusion relevance scheme rely on intermediate node for communication so it consider there is no selfish node exist in network. however it is not possible in real scenario. we in this paper simulate 898 a. rahim, z.s. khan, f.b. muhaya, m. sher, m.k. khan the relevance based approach using 802.11e, virtual queue with 802.11e and enhance message benefit in real and ideal scenario. simulation results shows that global benefit is improved by using virtual queue with enhance mathematical model. 5 acknowledgments this research is supported by the prince muqrin chair (pmc) for it security at king saud university, riyadh, saudi arabia. bibliography [1] y. wu, l. yang, g. wu, j. guo, an improved coded repetition scheme for safety messaging in vanets, ieee, 2009. [2] r. k. shrestha, s. moh, i. chung, d. choi, vertex-based multihop vehicle-to-infrastructure routing for vehicular ad hoc networks, ieee, proceedings of the 43rd hawaii international conference on system sciences 2010. [3] h. kumar, , r.k singla,., s. malhotra„ issues and trends in autoconfiguration of ip address in manet, international journal of computers communications and control, volume:3, supplement: suppl. s, pp. 353-357, 2008. [4] m. a rajan, m. g chandra, l. c. reddy, p. hiremath, concepts of graph theory relevant to ad-hoc networks, international journal of computers communications and control, volume:3, supplement: suppl. s, pp. 465-469, 2008. [5] j. sun, c. zhang, y. zhang, y. fang, an identity-based security system for user privacy in vehicular ad hoc networks, ieee transactions on parallel and distributed systems, 2010 [6] s. dietzel, e. schoch, b. konings, m. weber, resilient secure aggregation for vehicular networks, ieee network, vol 24 pp 26-31,2010 [7] i. jang, w. choi, h. lim, a forwarding protocol with relay acknowledgement for vehicular ad-hoc networks, ieee 2008. [8] h. l. nguyen, u. t. nguyen, study of different types of attacks on multicast in mobile ad hoc networks, international conference on mobile communications and learning technologies, ieee , 2006. [9] s. mao, s. lin, s. s. panwar, y. wang, e. celebi, video transport over ad hoc networks: multistream coding with multipath transport, ieee journal on selected areas in communications, vol. 21, no. 10, december 2003 [10] b. r. moyers, j. p. dunning, r. c. marchany, j. g. tront, effects of wi-fi and bluetooth battery exhaustion attacks on mobile devices, proceedings of the 43rd hawaii international conference on system sciences , 2010. [11] k.a. bakar, b. s. doherty, evaluation of the recorded state mechanism for protecting agent integrity against malicious hosts, international journal of computers communications and control, vol.3, no.1, pp. 60-68, 2008 [12] m. raya, j-p. hubaux. the security of vehicular ad hoc networks. in workshop on security in ad hoc and sensor networks (sasn), 2005. information sharing in vehicular adhoc network 899 [13] m. raya, p. papadimitratos,j.-p. hubaux, securing vehicular communications, in ieee wireless communications magazine, special issue on inter-vehicular communications, october 2006. [14] k. sha„ s. wang, w. shi, rd4: role-differentiated cooperative deceptive data detection and filtering in vanets, ieee transactions on vehicular technology, vol. 59, no. 3, march 2010. [15] t. kosch, c. j. adler, s. eichler, c. schroth, m. strassberger, the scalability problem of vehicular ad hoc networks and how to solve it, ieee wireless communications , october 2006. [16] c. adler, s. eichler, t. kosch, c. schroth, m. strassberger, self-organized and contextadaptive information diffusion in vehicular ad hoc networks, 3rd international symposium on wireless communication systems, 2006. [17] s. eichler, c. schroth, t. kosch, m. strassberger, strategies for context-adaptive message dissemination in vehicular ad hoc networks, second international workshop on vehicle-tovehicle communications, july 2006. [18] c. schroth, r. eigner, s. eichler, m. strassberger, a framework for network utility maximization in vanets, acm international conference on mobile computing and networking, usa september 29, 2006. [19] a. rahim, m. yasin, i. ahmad, z. s. khan, m. sher, relevance based approach with virtual queue using 802.11e protocol for vehicular adhoc networks , 2nd international conference on computer, control and communication, karachi, 14 feb 2009. [20] a. rahim, f. b. muhaya, z. s. khan, m.a. ansari, m. sher, enhance relevance based approach for network control relevance, accepted in infomatica journal issn: 0350-5596. [21] network simulator, ns2 http://www.isi.edu/nsnam /ns [22] r. baumann, f. legendre, p. sommer, generic mobility simulation framework (gmsf), acm mobilitymodels’08, hong kong sar, china, may 26, 2008 http://horos.rdsor.ro/ijcccv3n4draft.pdf int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iii (2008), no. 4, pp. 324-335 on dpa-resistive implementation of fsr-based stream ciphers using sabl logic styles reza ebrahimi atani, sattar mirzakuchaki, shahabaddin ebrahimi atani, willi meier abstract: the threat of dpa attacks is of crucial importance when designing cryptographic hardware. this contribution discusses the dpa-resistant implementation of two estream finalists using sabl logic styles. particularly, two feedback shift register (fsr) based stream ciphers, grain v.1 and trivium are designed in both bsim3 130nm and typical 350nm technologies and simulated by hspice software. circuit simulations and statistical power analysis show that dpa resistivity of sabl implementation of both stream ciphers has a major improvement. the paper presents the tradeoffs involved in the circuit design and the design for performance issues. keywords: dpa attack, stream cipher, grain v.1, trivium, sabl, standard cmos. 1 introduction the term of security for a cryptographic primitive can be considered from two points of view: mathematical security (resistance against classical cryptanalysis) and the second one is physical security. physical attacks on cryptographic devices take advantage of implementation-specific characteristics to recover the secret parameters. they are therefore much less general since they are specific to a given implementation but often much more powerful than classical cryptanalysis, and are considered very seriously by cryptographic devices implementors. a side-channel attack occurs when an attacker is able to use some additional information leaked from the implementation of a cryptographic function to cryptanalyze the function. clearly, given enough side-channel information, it is trivial to break a cipher. one side channel attack in particular, namely the differential power analysis (dpa) is of great concern. it was first reported by kocher et al. in 1998 that the power consumption of a smart card could reveal the secret key of the cryptographic algorithm [1]. dpa is a well-known and thoroughly studied threat for implementations of block ciphers (des and aes), public key algorithms (rsa) and recently stream ciphers (grain and trivium [4]). stream ciphers as part of the symmetric key cryptography family, have always had the reputation of efficiency in hardware and speed. they have attracted much attention since the beginning of the estream project in 2004. although there is vast literature about dpa on implementations of block ciphers and public key algorithms, only few publications can be found about dpa attacks on stream ciphers ([2], [3], [4], [8], [13], [14]). in power analysis attacks, it is assumed that the power consumption of a circuit is correlated to the data handled. an attacker can therefore recover secret information by simply monitoring the power signals of a running device. stream ciphers require frequent synchronization to prevent synchronization loss between sender and receiver. normally the initialization will be done with the same secret key and with a different initial value iv. so an attacker can disrupt the synchronization and apply a new known iv and measure the power traces in the initialization phase to apply a dpa on the embedded system of the stream cipher. so far, there is only one report on a practical dpa targeting hardware implementations of stream ciphers [4]. in that paper, a chosen iv dpa attack on grain and trivium stream ciphers has been described and executed. protecting implementations against dpa attacks is usually difficult and expensive. the goal of countermeasures against dpa attacks is to make the power consumption independent of intermediate values of the stream cipher. in general, there are three basic groups into which these countermeasures can be copyright © 2006-2008 by ccc publications on dpa-resistive implementation of fsr-based stream ciphers using sabl logic styles 325 characterized: protocol countermeasures, algorithmic countermeasures, and hardware countermeasures [11]. the principles of the countermeasures can be implemented at different levels in a cryptographic device. in general, these techniques are theoretical countermeasures and only reduce the side channel leakage and do not fundamentally prevent a dpa. but the advantage of these countermeasures is to make the attack significantly harder. in this article, we provide a brief overview of hiding and masking logic styles (hardware countermeasures) and particularly we will use sense amplifier base logic (sabl) for secure implementation of stream ciphers. sabl is a logic style that uses a fixed amount of charge for every transition, including the degenerated events in which a gate does not change state. in every cycle, a sabl gate charges a total capacitance with a constant value. so far, there has not been a unified architecture which can be used as a test bench for applicability of logic styles on stream ciphers. regarding this, two fsr-based stream ciphers grain v.1 and trivium stream ciphers are implemented in cell level to find out the tradeoffs involved in designing the architecture and performance issues. power traces of the resulting circuits exhibit that sabl significantly reduces signal to noise ratio (snr). the rest of the paper is structured as follows: a general model of power analysis attack on stream ciphers is given in section 2. section 3 describes an overview of dpa countermeasures on cell level. in sections 4 and 5 the descriptions of grain v.1 and trivium are explained. design and simulation issues are described in section 6 and finally, conclusions are drawn in section 7. 2 differential power analysis of stream ciphers dpa is based on the fact that cmos logic and application specific details cause logic operations to have power characteristics that depend on the input data. it relies further on statistical analysis and error correction to extract the information from the power consumption that is correlated to the secret key [1]. in a dpa a hypothetical model of the device under attack is used to predict the power consumption. the classical setup for a dpa on stream ciphers is illustrated in fig. 1. output power traces are determined by the input data, iv, private key, output of the device and by many other parameters. an attacker to some extent has the potential knowledge of some of them (e.g. iv, input data and output data) while others are unknown. regarding a dpa attack, multiple measurements of the power consumption of a cryptographic device are made. for each measurement, different chosen iv’s are sent to the device. since the cryptographic algorithm is known, a hypothesis on intermediate values can be used to calculate the targeted data values based on the random input values. if the correct hypothesis is used, the targeted data values are calculated correctly for all measurements. according to (1), the total power consumption of an embedded device depends on 3 factors: ptotal = pcons. + pnoise + pdd. (1) with the help of statistical methods (calculation of correlations, mean values, etc.), the randomness of the data values that are not targeted (pconst. : leakage currents and data independent power consumption and pnoise: which comes from electrical noise) is exploited to reduce their effects on the power consumption traces. pdd is the data dependent power consumption and is targeted in statistical analysis. after all, the result of the statistical operation indicates which key hypothesis is correct. normally, a hamming distance power model is used to map the transitions that occur at the outputs of cells of a netlist to power consumption values. in cmos gates, it is reasonable to assume that the main component of the data dependent power consumption is the dynamic power consumption which is the power dissipation of charging and discharging of output capacitance nodes (p0→1 or p1→0). in a cmos gate, we can express dynamic power consumption by: pdynamic = n ·cl · f ·v 2dd (2) 326 reza ebrahimi atani, sattar mirzakuchaki, shahabaddin ebrahimi atani, willi meier where cl is the gate load capacitance and n is the probability of a 0 → 1 or 1 → 0 output transition and f is the clock frequency. this equation shows that the power consumption of cmos circuits is data dependent. note that n is the most important factor in the hypothetical model. there are different techniques for calculation of it. for example, a variable gate delay model can be used for measuring the number of transitions and glitches of a circuit [7]. this technique can be easily applied to circuits by using a vhdl simulator in register transfer level. figure 1: differential power analysis model of stream ciphers. 3 dpa countermeasures on cell level so far, several methods in different ways have been proposed to counteract dpa attacks. in this section different known dpa countermeasures on cell level (hiding and masking techniques) are briefly presented and then their merits and disadvantages will be discussed. the first structured approach to counteract dpa attacks at the cell level was the use of hiding logic styles. these styles try to break the correlation between an algorithm’s intermediate results and the power consumption of the cryptographic device that executes this algorithm by making the instantaneous power consumption of the cells either random or the same in each clock cycle. the three major types of hiding logic styles are: dual-rail precharge (drp), asynchronous, and current mode logic (cml). drp logic styles are the most popular types. for instance, sabl [10] and wave dynamic differential logic (wddl) [15] are dual rail precharge logic styles whose logic gates are driven by a precharge signal to prevent glitches, and each logic signal is represented by two complementary wires. other examples of drp logic styles are dual spacer dual rail logic (dsdr), three-phase dual-rail precharge logic (tdpl) [16], and three state dynamic logic (3sdl). data dependent time of evaluation of the wddl and its memory effect made it vulnerable to dpa attacks. one of the major drawbacks of hiding logic styles is the balancing of the cells and interconnect layouts to achieve constant power consumption. since the charge and discharge of output nodes of dynamic and differential styles follow simple rc charge and discharge, cells and wires must mainly be balanced in a capacitive and resistive manner. but due to process variations, complex cross-coupling effects, and area limitations it is a hard task. besides hiding, masking at the cell level has become popular during the past few years. using a masked logic style, designers also break the correlation between an algorithm’s intermediate values and the power consumption of the cryptographic device that executes this algorithm. all intermediate values are masked by a random value. the cells then process only the masked intermediate values and their corresponding mask. because the unmasked values and the masked value are uncorrelated, power consumption of the cell also remains uncorrelated to intermediate values. generally there are two types of masking operation: boolean masking or arithmetic masking. if the masked cells are not activated in a data or operation dependent manner, masked logic styles counteract dpa attacks. there are two different on dpa-resistive implementation of fsr-based stream ciphers using sabl logic styles 327 possible masking schemes: one mask per circuit (single masking) or one mask per signal. these masked bits are normally prepared by some random number/sequence generators. before, masking was mainly used at the architecture level. as a result, only a few practical results are available for this type of cell level countermeasure. for examples masked dual rail precharge logic (mdpl) [17] and dual rail random switching logic (drsl) [18] were introduced by combining the masking scheme and dual rail precharge logic in order to use semi custom design tools without routing constrains. designers can implement mdpl cells using commonly available conventional single rail standard cells. only sequential cells are connected to the clock signal, and combinational cells precharge their outputs when their inputs have been set to the precharge value. the memory effect can reduce the dpa resistance of masked logic styles. practical evaluations of the manufactured chips have also shown that early propagation is also a major threat to the dpa resistancy of masked logic styles. although all these efforts, it has been shown ([19], [20], [21]) that mdpl leaks information. for example in [19], it has been shown that mdpl is susceptible to the early propagation effect. in order to combat the early propagation issues, the designers of mdpl introduced a so called improved mdpl (imdpl). in each imdpl gate there is an evaluation precharge detection unit, which consists of three (cmos) and gates and two (cmos) or gates. hence it is not surprising that the area requirements for imdpl gates increased significantly compared to mdpl gates. another threat to masked circuits is the detection of the mask value, which lets attackers completely cancel out the effect of masking in a dpa attack. in particular, such an attack is dangerous for single masked circuits, where only one mask value is used for all signals in the circuit. increasing the number of mask values per circuit is an option but it is impractical regarding its high complexity and area utilization. 4 sense amplifier based logic in this paper we will concentrate on sabl [10] for dpa resistive implementation of stream ciphers. sabl is part of the drp logic styles. fig. 2 shows the transistor schematic of standard sabl gate library used for implementation of ciphers. equation (3) illustrates the power consumption of a sabl gate, p = cl · f ·v 2dd +cclk · f ·v 2dd (3) where cl represents the total output capacitance of the gate and cclk is the clock propagation circuitry capacitance. as can be seen in the fig.2, sabl gates can be designed using differential pull down networks (dpdn) or differential pull up networks (dpun), controlled respectively by clk and clk. this allows two modes for cascading sabl gates: domino connection (by connecting the outputs of the gate to the inputs of the next gate through inverters) or np-connection (n-gates followed by p-gates like in np-logic). in sabl, the concepts of dual rail and precharge logic are combined to achieve constant power consumption. precharging breaks a signal’s sequence of values by splitting each clock cycle into precharge and evaluation phases. in the precharge phase, the complementary wires encoding a signal are set to a predefined precharge value, such as 1. in the subsequent evaluation phase, one of the two complementary wires is set to 1 according to the actual value that is processed. as a result, for each signal in a circuit, exactly one 0 → 1 transition and one 1 → 0 transition occur in a clock cycle. by ensuring a balance between the complementary wires between cells on the one hand and a balance of the internal structure of the cells on the other hand, designers can achieve constant power consumption. the price is high power consumption and high current spikes of these gates which appear at the beginning of the precharge phase. by the use of delayed clock mechanism introduced in [13] and [14] we can reduce the peak of these spikes. but in practice the throughput is highly dependent on layout design of the chip to have balanced complementary wires. since the charge and discharge of output nodes of differential styles follow simple 328 reza ebrahimi atani, sattar mirzakuchaki, shahabaddin ebrahimi atani, willi meier figure 2: (i) sabl d-flip flop (ii) sabl nand2 gate (iii) sabl xor2 gate rc charge and discharge, cells and wires must mainly be balanced in a capacitive and resistive manner. but due to process variations, complex cross-coupling effects, and area limitations this is hard to achieve. avoiding these effects often requires custom cell design, which involves considerably more design effort than using available standard cells. 5 grain stream cipher grain v.1 [5] is a stream cipher introduced in 2005 as a candidate for the hardware profile of estream project. grain v.1 is a binary additive synchronous stream cipher with an internal state of 160 bits si, si+1,. . . , si+79 and bi, bi+1,. . . , bi+79 residing in a linear feedback shift register (lfsr) and a nonlinear feedback shift register (nlfsr), respectively. the design of the algorithm mainly targets hardware environments where gate count, power consumption and memory is very limited. the key size of grain is 80 bits (ki, 0 ≤ i ≤ 79). additionally, an initial value of 64 bits (ivi, 0 ≤ i ≤ 63) is required. in initialization phase, all 80 nlfsr elements are loaded with the key bits, (bi = ki, 0 ≤ i ≤ 79), then the first 64 lfsr elements are loaded with the iv bits, (si = ivi, 0 ≤ i ≤ 63). the last 16 bits of the lfsr are filled with ones. f (x) and g(x) are two polynomials used as feedback function for the lfsr and nlfsr. f : si+80 = si+62 ⊕ si+51 ⊕ si+38 ⊕ si+23 ⊕ si+13 ⊕ si (4) g : bi+80 = si ⊕ bi ⊕ bi+9 ⊕ bi+14 ⊕ bi+21 ⊕ bi+28 ⊕ bi+33 ⊕ bi+37 ⊕ bi+45 ⊕ bi+52 ⊕ bi+60 ⊕ bi+62 ⊕ bi+63 · bi+60 ⊕ bi+37 · bi+33 ⊕ bi+15 · bi+9 ⊕ bi+60 · bi+52 · bi+45 ⊕ bi+33 · bi+28 · bi+21 ⊕ bi+63 · bi+45 · bi+28 · bi+9 ⊕ bi+60 · bi+52 · bi+37 · bi+33 ⊕ bi+63 · bi+60 · bi+21 · bi+15 ⊕ bi+63 · bi+60 · bi+52 · bi+45 · bi+37 ⊕ (5) bi+33 · bi+28 · bi+21 · bi+15 · bi+9 ⊕ bi+52 · bi+45 · bi+37 · bi+33 · bi+28 · bi+21 the output function h(x) uses as input selected bits from both feedback shift registers: h(x) = x1 ⊕ x4 ⊕ x0 · x3 ⊕ x2 · x3 ⊕ x3 · x4 ⊕ x0 · x1 · x2 ⊕ + x0 · x2 · x3 ⊕ x0 · x2 · x4 ⊕ x1 · x2 · x4 ⊕ x2 · x3 · x4 (6) on dpa-resistive implementation of fsr-based stream ciphers using sabl logic styles 329 where the variables x0, x1, x2, x3, and x4 corresponds to the tap positions si+3, si+25, si+46, si+64 and bi+63 respectively. the output of the filter function is masked with the some state bits from the nfsr to produce the keystream zi: zi = bi+1 ⊕ bi+2 ⊕ bi+4 ⊕ bi+10 ⊕ bi+31 ⊕ bi+43 ⊕ bi+56 ⊕ h(si+3, si+25, si+46, si+64, bi+63) this output is used during the initialization phase as additional feedback to lfsr and nlfsr. during normal operation this value is used as key stream output. the generated output bits per clock cycle is called radix. by implementing the small feedback functions, f (x) and g(x), and the output function several times the speed of grain can easily reach up to radix-32. 6 trivium stream cipher trivium [6] is a stream cipher introduced in 2005 as a candidate for the hardware profile of the estream project. trivium has an internal state of 288 bits ai, ai+1, . . . , ai+92, bi, bi+1, . . . , bi+83 and ci, ci+1, . . . , ci+110 residing in three coupled nlfsrs a, b, and c of 93, 84, and 111 bits respectively. trivium has a key k = (k0, . . . , k79) of 80 bits as well as an initial value iv = (iv0, ..., iv79) of 80 bits. the initialization of the key and iv is done as follows:    (a0, . . . , a92) = (0, . . . ,0, k79, . . . , k0) (b0, . . . , b83) = (0,0,0,0, iv79, . . . , iv0) (c0, . . . , c110) = (1,1,1,0,0, . . . ,0,0) (7) then, the state is updated over 4 full cycles, according to (3), but without generating key stream bits. after 1152 clocking it outputs a key stream bit zi.    ai+93 = ai+24 ⊕ ci ⊕ (ci+1 · ci+2) ⊕ ci+45 bi+84 = bi+6 ⊕ ai ⊕ (ai+1 · ai+2) ⊕ ai+27 ci+111 = ci+24 ⊕ bi ⊕ (bi+1 · bi+2) ⊕ bi+15 (8) zi = ai ⊕ bi ⊕ ci ⊕ ai+27 ⊕ bi+15 ⊕ ci+45 (9) trivium has a very simple structure that is well suited for different radix implementations from radix-1 to radix-64 without noticeable hardware penalties. the basic structure of the grain v.1 and trivium stream ciphers are shown in fig. 3. in april 15, 2008, the estream competition was finished and according to the final report [12] both ciphers were selected among the four finalists of the h/w profile. 7 design and simulation results both estream candidates are modeled at transistor level using a spice netlist. circuit design of grain v.1 and trivium are mainly based on the techniques presented in [13] and [14]. in order to specify the impact of minimum feature size on the design, ciphers are designed using two technologies: typical bsim3 0.13µ m cmos soi technology and typical 0.35µ m cmos soi technology. spice simulations were run to test the circuits by test vectors provided by the inventors of the ciphers using hspice circuit simulator and c compiler. domino cascading scheme is used for all sabl gate connections to make sure having a 0 → 1 transition in the input of all cascaded gates to prevent possible glitches. first a new standard gate library based on sabl logic is designed. minimum possible sized transistors are used to lower the total capacitance to get lower dynamic power in (2). this will also minimize the charging time 330 reza ebrahimi atani, sattar mirzakuchaki, shahabaddin ebrahimi atani, willi meier figure 3: (i) grain stream cipher (ii) trivium stream cipher. during precharge phase. besides, this will help to cut the current spikes in the beginning of the precharge phase of each cycle. in order to get rid of the spikes, a delayed clocking mechanism is used [13], [14]. in order to increase security and speed of the initialization phase, a parallel data loading scheme is used, since in case of serial bit loading a straight forward simple power analysis attack is very likely to be successful in recovering all key bits. in parallel loading, key and iv will be loaded in the state bits after the first rising edge of the clk signal. the gate level architecture of the parallel data loading scheme for standard cmos is shown in fig. 4. note that in case of sabl all the wires and gates are dual rail. the area overhead is three nand2 gates for each flipflop of the fsrs in the ciphers. since all the components in the architecture need a clk signal for switching from precharge phase into evaluation and vice versa, a chained buffer clock signal is needed. for the standard cmos implementation of a the stream ciphers, standard two input nand gates (4 transistors), 8 transistor two input xor gates, and the former 24 transistor, edge triggered d-flipflops (using eight nand2), are used. in order to monitor all current variations, one sample has been taken every 50ps. both simulations were run for four different 80-bit keys and iv’s (64 bit iv for grain v.1) in both sabl and standard cmos designs. all power simulations are observed by 5mhz clock signal. the average power consumption per cycle was extracted by averaging the power consumption on 100 consecutive clock cycles. then, the mean power consumption (mpc), the power consumption standard deviation (pcsd), the normalized energy deviation (ned) and normalized standard deviation (nsd) were extracted for each simulated logic style (10). for example supply current traces for standard cmos design of grain v.1 for the choice of k2, iv3 (in table 1) in initialization phase is shown in fig. 5. ned = max(energy/cycle) − min(energy/cycle) max(energy/cycle) , nsd = pcsd mpc (10) in terms of transistor cost, the complete trivium (including parallel data loading and clock buffering circuitary) required ≈ 23000 transistors for the sabl and ≈ 8500 transistors for the standard cmos. in case of grain v.1, ≈ 13500 transistors for the sabl and ≈ 6000 transistors for the standard cmos are needed, confirming more than two times higher hardware cost for sabl styles. table 1 shows the summary of final statistical power analysis results. for example in 0.13µ m technology, and for k1, iv1, for grain v.1, pcsdsabl pcsdscmos = 0.016 which shows that the power consumption fluctuations of sabl implementation is nearly 1.6% of standard cmos (power = current ×costant supply voltage). this is a on dpa-resistive implementation of fsr-based stream ciphers using sabl logic styles 331 figure 4: parallel data loading scheme in fsrs (standard cmos) figure 5: supply current variation of standard cmos design of grain v.1 in 350nm technology 332 reza ebrahimi atani, sattar mirzakuchaki, shahabaddin ebrahimi atani, willi meier table 1: statistical power analysis of trivium and grain v.1 for different 80 bit hexadecimal key and iv’s (k1 = aa . . . a, k2 = 80. . .0, iv1 = 55. . .5, iv2 = f f . . . f, iv3 = 00. . .0, iv4 = 11. . .1). [note that in case of grain v.1 the iv’s are 64 bits] stream cipher trivium grain trivium mpc [µw ] pcsd [µw ] ned nsd mpc [µw ] pcsd [µw ] ned nsd sabl 0.35µ m,vdd = 3.3v,vt n = 0.6v,vt p = −0.85v k1, iv1 949 1.2632 0.0091 0.00133 616 0.9497 0.0136 0.00154 k1, iv2 940 1.2469 0.0106 0.00132 605 0.9375 0.0111 0.00155 k2, iv3 938 1.2403 0.0089 0.00131 601 0.9318 0.0122 0.00155 k2, iv4 943 1.2531 0.0117 0.00133 611 0.9393 0.0120 0.00153 s-cmos 0.35µ m,vdd = 3.3v,vt n = 0.6v,vt p = −0.85v k1, iv1 641 49.1 0.3292 0.0766 421 26.8 0.2784 0.0636 k1, iv2 637 19.3 0.2351 0.0303 402 18.5 0.1905 0.0460 k2, iv3 629 23.5 0.3139 0.0374 397 17.1 0.2187 0.0430 k2, iv4 635 41.7 0.2842 0.0657 415 29.6 0.3882 0.0713 sabl 0.13µ m,vdd = 1.2v,vt n = 0.4v,vt p = −0.39v k1, iv1 545 0.8435 0.0061 0.0015 378 0.7610 0.0124 0.0020 k1, iv2 537 0.7927 0.0054 0.0014 371 0.7424 0.0082 0.0020 k2, iv3 536 0.7831 0.0050 0.0014 369 0.7291 0.0079 0.0019 k2, iv4 541 0.8237 0.0059 0.0015 374 0.7482 0.0101 0.0020 s-cmos 0.13µ m,vdd = 1.2v,vt n = 0.4v,vt p = −0.39v k1, iv1 337 31.20 0.8945 0.0926 258 22.50 0.7889 0.0872 k1, iv2 321 16.12 0.8191 0.0190 246 14.14 0.8026 0.0545 k2, iv3 319 17.14 0.8614 0.0537 242 12.31 0.8402 0.0509 k2, iv4 326 29.94 0.9218 0.0918 252 21.72 0.7924 0.0862 major improvement but still pcsdsabl 6= 0 and very small current variations are detectable. the overall comparison between sabl and standard cmos design for 4 different key and iv choices (table 1) is shown in table 2. dpa resistivity factor is calculated in (11): dpa resistivity ∝ 1 pcsd (11) one of the fundamental parameters of a cryptographic algorithm is the amount of data it can process within a given period. the total throughput of the algorithm is expressed as mbits/s and can be calculated from t = f × radix where f is the clock frequency of the design (e.g. 5mhz). since trivium throughput rate for sabl and s-cmos designs are equal, in order to make a fair comparison, a new normalized table 2: overall comparison for sabl and s-cmos design of trivium (all data are normalized) cipher trivium grain logic style sabl s-cmos sabl s-cmos sabl s-cmos sabl s-cmos technology 0.13µ m 0.35µ m 0.13µ m 0.35µ m transistor cost (a) 1 0.37 1 0.37 1 0.44 1 0.44 power consumption (p) 1 0.60 1 0.67 1 0.66 1 0.67 dpa resistancy (dr) 1 0.0374 1 0.0435 1 0.045 1 0.043 qualifying factor (qf) 1 0.062 1 0.065 1 0.068 1 0.064 on dpa-resistive implementation of fsr-based stream ciphers using sabl logic styles 333 qualifying factor (qf) is defined in (12): qf = dr × t p × a (12) where, a, p, and dr corresponds to transistor cost, power consumption, and dpa resistancy respectively. at the end of the simulations and data analysis we exhibited that sabl logic styles allow to significantly decrease the supply current variations of both estream circuits. but still in both designs very small current variations are detectable. as a disadvantage this could be a start of a dpa attack since the predictability of the energy variations is more critical than their amplitude. it is clear that decreasing the power consumption variations will affect all the stream cipher design components in exactly the same way, and therefore not affect the snr. since dpa efficiency depends on the possibility to predict the power consumption of a device in function of its input data and the value of the correlation coefficient, the attack is still theoretically feasible against sabl circuits. but these current differences are due to the presence of parasitic capacitances in the design and therefore, they cannot be predicted without a precise transistor level knowledge of the circuit. as a consequence, an attacker can only target one specific implementation and preliminarily needs to build a table containing the power consumption differences in function of the circuit input data. these informations are not usually made available to the users in full custom design. moreover, under the assumption that we can perfectly predict and measure the power consumption, a circuit resistance is equal for any logic style. nevertheless, in practice, measurements are not perfect and induce noise, independently of the logic style considered. this will cause a reduction of the correlation values, depending on the power consumption variances, although it is hard to evaluate and highly depends on the attacker measurement setup. as can be seen in the table 1 and table 2 the dpa resistancy is improved for smaller minimum feature sized designs. although, current variations do not follow the scaling rules. this is mainly because of clock feedthrough effect and also the former subthreshhold leakages which play a big roll in deep submicron designs. stream ciphers always had the reputation of efficiency in hardware. their smaller architecture helps to use full custom design flow in order to have balanced routing of component wires. so simpler stream cipher designs would have lower design costs. regarding design flow, trivium has lower hardware complexity and circuit design is easier. although trivium has bigger architecture, timing constraints and clock distribution of trivium are the same as grain. comparing resistance against dpa attacks of the two estream candidates, simulations show grain has lower current spikes and smaller current variations. this is thanks to the higher circuit complexity of grain which combines different current variation of gates to achieve a semi random supply current variation. current spikes in trivium are due to the higher number of flip flops. another disadvantage of trivium is its large number of iterations in initialization phase (1152 rounds) which let attackers to have more power traces. 8 summary and conclusions this paper investigated the use of sabl logic to counteract power analysis attacks. in particular, an efficient dpa resistive circuit for grain v.1 and trivium stream ciphers have been designed and compared with their standard cmos implementations. first we exhibited that sabl allow to significantly decrease the circuit energy variations. this is due to equal amounts of power consumption in each clock cycle of sabl gates. all implementations have been done on transistor level but in practice the cipher itself is part of a system on chip with lots of other circuits which can increase pcons. +pnoise in (1) to achieve lower snr. although sabl cannot be completely tamper resistant, this logic probably presents acceptable security margins for general applications of stream ciphers. for future work interested researchers can investigate some circuit changes in sabl styles to counteract other side channel attacks such as fault attacks to obtain more security. 334 reza ebrahimi atani, sattar mirzakuchaki, shahabaddin ebrahimi atani, willi meier acknowledgment reza ebrahimi atani wishes to thank the iran telecommunication research center (itrc) for their financial support (www.itrc.ac.ir). bibliography [1] p. c. kocher, j. jaffe, and b. jun, “differential power analysis,” advances in cryptology crypto’99, springer-verlag, lncs vol. 1666, pp. 388–397, 1999. [2] ch. rechberger and e. oswald, “stream ciphers and side-channel analysis” in sasc 2004 the state of the art of stream ciphers, brugge, belgium, workshop record, pp. 320–326, oct. 14-15, 2004. [3] j. lano, n. mentens, b preneel, and i. verbauwhede, “power analysis of synchronous stream ciphers with resynchronization mechanism” in sasc 2004 the state of the art of stream ciphers, brugge, belgium, workshop record, pp. 327–333, oct. 14-15, 2004. [4] w. fischer, b. m. gammel, o. kniffler, j. velton, “differential power analysis of stream ciphers,” topics in cryptology ct-rsa 2007, springer-verlag, lncs, vol. 4377, pp. 257–270, 2007. [5] m. hell, th. johansson, a. maximov, and w. meier, “grain a stream cipher for constrained environments,” 2006, estream project website. [6] c. de canniere, and b. preneel, “trivium specifications,” 2005, estream project website. [7] t. seko, a. nakamura, and t. kikuno, “measurement of glitches based on variable gate delay model using vhdl simulator,” asia-pacific conference on circuits and systems, nov. 1998, pp. 767 – 770. [8] b. gierlichs et. al., “susceptibility of estream candidates towards side channel analysis,” sasc 2008, switzerland, feb. 13-14, 2008, workshop record, pp. 320 – 326. [9] k. tiri, and i. verbauwhede, “charge recycling sense amplifier based logic: securing low power security ics against dpa” 30th european conference on solid-state circuits, 21-23 sept. 2004, pp. 179 – 182. [10] k. tiri, m. akmal, and i. verbauwhede, “a dynamic and differential cmos logic with signal independent power consumption to withstand differential power analysis on smart cards,” 28th european solid state circuits conference, ieee press, pp. 403 – 406, , 24-26 sep. 2002. [11] s. mangard, e. oswald, and t. popp, power analysis attacks: revealing the secrets of smart cards, springer, 2007. [12] s. babbage et. al., the estream portfolio, april 2008, estream project website. [13] r.e. atani, w. meier, s. mirzakuchaki, and s.e.atani, “design and implementation of dpa resistive grain128 stream cipher based on sabl logic”, international journal of computers, communications & control, vol. iii (supl. issue), pp. 293 – 298, 2008. [14] r.e. atani, w. meier, s. mirzakuchaki, and s.e.atani, “design and simulation of a dpa resistive circuit for trivium stream cipher based on sabl styles” mixdes 2008, 19-21 june. 2008, pp. 203 – 208. [15] k. tiri, and i. verbauwhede, “a logic level design methodology for a secure dpa resistant asic or fpga implementation” date 2004, 2004, pp. 246–251. [16] m. bucci, l. giancane, r.o luzzi, and a. trifiletti, “three-phase dual-rail precharge logic” in cryptographic hardware and embedded systems ches 2006, vol. 4249 of lncs, springer-verlag, 2006, pp. 232–241. on dpa-resistive implementation of fsr-based stream ciphers using sabl logic styles 335 [17] t. popp, and s. mangard, “masked dual-rail pre-charge logic: dpa-resistance without routing constraints” in cryptographic hardware and embedded systems ches 2005, vol. 3659 of lncs, springer, 2005, pp. 172–186. [18] z. chen, and y. zhou, “dual-rail random switching logic: a countermeasure to reduce side channel leakage,” in cryptographic hardware and embedded systems ches 2006, vol. 4249 of lncs, springerverlag, 2006, pp. 242–254. [19] d. suzuki, and m. saeki, “security evaluation of dpa countermeasures using dual-rail pre-charge logic style” in cryptographic hardware and embedded systems ches 2006, vol. 4249 of lncs, springer-verlag, 2006, pp. 255–269. [20] p. schaumont, and k. tiri, “masking and dual-rail logic dont add up,” in cryptographic hardware and embedded systems ches 2006, vol. 4249 of lncs, springer-verlag, 2006, pp. 95–106. [21] b. gierlichs, “dpa-resistance without routing constraints?” in cryptographic hardware and embedded systems ches 2006, vol. 4249 of lncs, springer-verlag, 2006, pp. 107–120. reza ebrahimi atani electrical engineering department iran university of science and technology (iust), narmak,16846, tehran, iran. e-mail: rebrahimi@iust.ac.ir sattar mirzakuchaki electrical engineering department iran university of science and technology (iust), narmak,16846, tehran, iran. e-mail: m-kuchaki@iust.ac.ir shahabaddin ebrahimi atani mathematics department university of guilan p.o.box 1914, rasht, iran. e-mail: ebrahimi@guilan.ac.ir willi meier iast institute fhnw ch 5210, windisch, switzerland. e-mail: willi.meier@fhnw.ch int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iii (2008), no. 2, pp. 161-171 antenna arrays principle and solutions: robust control approach florin hutu, sebastien cauet, patrick coirault abstract: this paper treats solutions on the ability of a chain of non identical oscillators to drive antenna arrays. frequency approaches were studied in order to solve the problem of synchronization of the oscillators. however, in this article, a new structure of chain of oscillators is introduced. secondly, lyapunov theory of stability is used to design a dynamical controller guarantying the oscillators synchronization. the problem of synchronization is transformed into a problem of asymptotic stabilization for a nonlinear system. it is formulated as a system of linear matrix inequalities where the parameter variations of the two oscillators and their differences are modeled by polytopic matrices. the theoretical result is successfully applied to an array of transistor-based oscillators used in "smart antenna" systems. keyword : nonlinear systems, control applications, antenna arrays 1 introduction the demand of mobile communication services is in a continuous growth, moreover, it is estimated that the rate will be maintained in the next years. this continuous development has stimulated the research of new hardware and software solutions in order to increase the volume of exchanged data and a better management of the emitted or received electromagnetic field. smart antenna arrays comprise a number of antennas that work in conjunction with an intelligent system that processes the received and transmitted data. the processing can be realized in a hardware or in a software way and allows smart antenna arrays to focus beams into particular directions. this problem can be partially solved by using several directional antennas. this solution divides the 360-degree coverage area into sectors. however, smart antenna arrays provide a much more effective solution by focusing the transmitted power toward user and only looking in the direction of the user for the up link signal. this ensures that the user receives the optimum quality of service and the maximum coverage from a base station. the new technologies development increases the antenna array performances and minimizes the costs of production and the occupied space. it makes them implementable in domains like wireless or satellite communications, radar systems, missile defense systems, automobile industry, etc. smart antennas or antenna arrays are a part of communication systems that can improve their global performances. this technique can increase the spectral efficiency and reduce the multi path fading, bit error rate (ber), the co-channel interferences (cci) and the system complexity [1]. this is possible by electronically adjusting the beam pattern of the antenna array in order to provide important gain for the desired signals and small gain for interference signals. at emission, the purpose of smart antennas is to minimize the interference between the different transmitters who works on the same communication channel and, thus to more efficiently use the emitted power. for this reason, the beam shape must be controlled in order to minimize the amplitude of the side lobes and to maximize the energy in the main lobe. moreover, the direction of the main lobe must be controlled. the focused application is inter-vehicle communication. the bandwidth will be, first of all, in the area of 24ghz and finally around 79ghz. at these frequencies, technique like "software defined radio" can not be used. one of the main objectives that is pointed out is to develop a structure which will extend in both space and time the safety information available to drivers by using the infrastructure and vehicles as sources. copyright © 2006-2008 by ccc publications 162 florin hutu, sebastien cauet, patrick coirault when the smart antennas are used in reception systems, the signals coming from interference directions must be rejected and those which comes from the desired directions must be privileged. for this reason, different phases and amplitudes must be assured by the carrier signals locally generated [2, 3, 4]. the work that has been done in the field of dynamics of coupled nonlinear systems using the frequency approach [5][6, 7][8] shows that they offer methods of phase control among array elements and beam scanning capabilities but also implies problems of stabilization. the proposed structure of the array of antennas is based on unidirectional coupled oscillators. in details, this paper treats the synchronization of a system made by two oscillators with an unidirectional coupling and this problem of synchronization is transformed in a problem of stabilization for a nonlinear system. the strategy chosen is to find an output feedback dynamic controller using lyapunov functions that assures a robust synchronization despite parameters variations of the oscillators. the problem of computing dynamic output feedbacks on lti (linear time-invariant) systems in term of matrix inequalities is difficult to solve. there are two known techniques: the iterative algorithms and the cancellation of variable products by using the matrix separation lemma. the reader can found some papers on treatment of this problem by lmi (linear matrix inequality)-s who can be numerically solved [9, 10, 11, 12, 13]. the variations of the parameters of the oscillators are taken into account by considering the state matrix as a polytopic one. the polytopic structure is easily tractable by linear matrix inequalities. once the stability of the vertices, defined for the polytope is proved, the stability and the synchronization of the two oscillators is assured for all systems inside the polytope. the nonlinear character of the oscillators allows the synchronization (if their free running frequencies are in a certain domain [6]) but also makes them dependent of initial conditions. the main objective is to cancel the nonlinear effect and to maintain the synchronization when the physical parameters of the oscillators and external conditions are modified. the originality of the method comes from the inclusion of the non-linear term and the undesired variations in a perturbation. this problem is transformed in a h∞ optimization. in section 2 is presented an overview of the antenna array theory. in section 3 a model of master and slave oscillators and the controller synthesis are introduced. section 4 presents the numerical results in the case of an array of coupled nonlinear oscillators. 2 antenna array theory overview 2.1 theoretical background a smart antenna is composed by an array of individual radiative elements (elementary antennas), which are placed in a particular configuration (linear, circular or matrix). by gathering these elementary antennas in such arrays and by changing the characteristics of the signals associated to each element, the array can present different gains according to the direction. let us consider an uniform linear array of n identical patch antennas placed at the same distance d between them as in fig. 1. for the theoretical study of this configuration, it is assumed that in the elementary antennas, harmonic signals of the same frequency but different amplitudes and phases are injected. the mathematical expression of the total electromagnetic field generated by the array in far-field regions (fraunhofer regions) can be written as the product of the electromagnetic field of the reference antenna (which is considered the first antenna in the array) and a term which depends on the amplitudes and phases of the injected signals etotal = ere f ∗ f (θ). (1) thereafter, the gain of the antenna array will be considered, knowing that it is a normalization of the antenna arrays principle and solutions: robust control approach 163 wm sin 2π f0tw1 sin 2π f0t w2 sin 2π f0t wn sin 2π f0t θ z0 d figure 1: an uniform antenna array amplitude of the electromagnetic field. the array factor can be written as f (θ) = n ∑ m=1 wme− j(m−1)k0d cos θ, (2) where wm = ame jϕm . in the easiest emission case, the amplitudes have the same value, the mathematical expression of the radiation pattern is f (θ ) = 1 n sin nγ2 sin γ2 , (3) where γ = ϕ − k0d cos(θ ), n the number of antennas, d the distance between them. it can be seen that a quantity of the radiated energy is lost in the side lobes which implies a certain weakening of the antenna array gain. it is obvious that in the emission case, both variations of the amplitudes and the phases of the carrier signals are in a large interval. this paper proposes a new technique to generate such signals, which have the same frequency and different phases and amplitudes. it can be concluded that both variations of the amplitudes and phases are in a large interval. 3 problem formulation 3.1 problem statement if the amplitude variation can be easily solved by using variable gain amplifiers, the problem of the phase variation is more constraining. there are several techniques which permits solving this problem. the main technical problem with beamforming for transmit is realizing the phase and amplitude of the signals in each antenna channel. the first step will be generating the reference signal with the selected or required frequency. it has to be modulated with the information to be transmitted (e.g. radar, communication). then it has to be amplified and distributed to all transmit channels. then the individual signals have to be weighted for beamforming, that means amplified or attenuated, according to the desired weighting amplitude. the desired phase has to be realized by a suitable steerable phase shifter or delay line. digital devices are now available, under the headline "software defined radio". this technique is not usable on frequencies over 400 mhz. here, techniques, which are pointed out, are designed for applications over 2ghz. techniques can be divided into both main approaches. the first approach uses the signal generated by one oscillator and the second which uses signals generated by array of coupled oscillators. signals with the same frequency but different phases and amplitudes can be built by delaying the signal generated with one master oscillator using high-frequency power dividers and variable delay lines or butler couplers. this approach is very useful when discrete-time systems are built. another approach 164 florin hutu, sebastien cauet, patrick coirault is to use polyphasic oscillators and a multiplexing system [5]. but in these techniques, it can not be obtained continuous phase variations. the second approach is based on the synchronization of arrays of oscillators having their free running frequencies with a weak dispersion. in [6, 8] it was demonstrated that arrays of coupled nonlinear oscillators can synchronize. moreover, according to the coupling strength and to free-running frequencies, phase variations can be made. recent works [14] shows how the phase variation can be guaranty by changing only the free-running frequencies of all coupled oscillators in the array. in order to generate these carrier signals, the following general schematic fig. 2 is proposed. ( f0 , an , ϕn ) + + + slave3slave2master slaven σcτn−1σcτ2σcτ1 gain1 gain2 gain3 gainn σσς ( f0 , a1 , ϕ1 ) ( f0 , a2 , ϕ2 ) ( f0 , a3 , ϕ3 ) figure 2: unidirectional coupling of a chain of oscillators this schematic is a variation of the york’s approach shown in [15]. because of the unidirectional coupling, each slave oscillator is driven only by its left neighbor. hence, the study of this configuration is reduced to the study of a pair of two non-linear systems (master-salve synchronization). the purpose is to design the parameters of the controller system σc in order to make the output ye(t) tends toward zero. when this objective is fulfilled, the delayed output of the master oscillator and the output of the slave oscillator become identical, so both oscillators are synchronized. generally, because of the technological realization, the oscillators don’t have the same free running frequencies. this is the reason why the feedback loop was introduced to guarantee the robust synchronization between both oscillators. the delay element and the variable gain amplifier will guaranty different phases and amplitudes for the output signals. the difference between the oscillators will be modeled as a variation of the slave oscillator parameters around those of the master oscillator parameters, which is considered as the reference. the variations due to the temperature or at the ageing of the components are modeled by a polytopic uncertainty of the master oscillator parameters around the nominal values. the oscillators are built using a double differential pair structure. in order to determine the parameters of the dynamical controller, the nonlinear oscillators is modeled using the van der pol model. in this article, we consider that the system is a perturbed van der pol model as: { ẋ = a(θ1)x + g(x,t, θ1) + bu y = cx , (4) where x = [ il v0 ] a(θ1) =   0 1 l0 − 1 c0 0   u = iin j c0 g(x,t, θ1) =   0 α c0 x2 − β c0 x2 3   b = [ 0 1 ] c = [ 0 1 ] , (5) with the uncertain parameters θ1 = [α, l0,c0]. antenna arrays principle and solutions: robust control approach 165 3.2 master-slave synchronization the structure is made up by two different systems which belong to the class previously described. the master system is considered independent (u = 0) and the dynamical controller σc drives the slave system using the error signal as reference. the error signal is constituted by the difference between a delayed version of the master output and the slave output. the state-space representation of the master system can be written as follows σm : { ˙xm = am(θ1)xm + gm(xm,t, θ1) ym = cxm . (6) for the slave system, the state-space representation can be written as σs : { ẋs = as(θ1)xs + gs(xs,t, θ1) + b2u ys = cxs . (7) between the parameters of the master and the slave oscillators, it is considered that there is the same difference δ    ls = lm (1 + δ ) cs = cm (1 + δ ) αs = αm (1 + δ ) βs = βm (1 + δ ) . (8) in that case, this notation can be introduced am(θ1) = as(θ1) + b1(θ2). (9) this difference is transformed into the difference between the state matrix of the master and the slave. with the assumptions in (8), b1(θ2) can be written as follows b1(θ2) =   0 − δ lm (1 + δ ) δ cm (1 + δ ) 0   . (10) if an error state is defined as e(t) = xm(t −τ)−xs(t), (11) a state-space representation can be written σe : { ė = am(θ1)e−b1(θ2)xs + eg(xm, xs,t, θ1)−b2u ye = ce , (12) where eg(xm, xs,t, θ1) = gm(xm,t, θ1)−gs(xs,t, θ1). (13) 3.3 nonlinear bound determination in order to determine the bounds of the nonlinearities difference, the scalar function f : d1 7→ d2 f (x) = −α x + β x3 is used. the bounds can be considered as the slopes of the tangents passing through x = xm and x = 0 of f (x). −α(x2 −x1) ≤ ( f (x2)− f (x1)) ≤ (−α + 3β xm2)(x2 −x1) ∀x1, x2 ∈ d1 (14) 166 florin hutu, sebastien cauet, patrick coirault consider both nonlinear oscillators and the domain d1 = [−1.35v, 1.35v ], then the bound of the nonlinearities difference (13) can be written as follows   0 − 1 c0 ( α + 3β−0.22 )   ≤ eg(xm, xs,t, θ ) ≤   0 α c0   . (15) 3.4 controller synthesis assume that am(θ1) resp. b1(θ2) are two matrices that belong to a polytope of matrices and it is represented by a convex combination of the extreme matrices ai resp. b1i with i = 1 . . . 2m . am = { am(θ1)| am(θ1) = 2m ∑ i=1 ξiai; ξi ∈ ∆1 } (16) and consider that the matrix eg(xm, xs,t, θ1) can be bounded with nb(θ1) being its upper bound eg(xm, xs,t, θ1) ≤ nb(θ1)e. (17) the worst case for our system is the superior limit, then the matrix an (θ ) = am(θ1) + nb(θ1). assume that the dynamical output controller of the system (12) is described by the following statespace representation and its dimension is nc. σc : { ẋc = acxc + bcye u = ccxc + dcye (18) the purpose of this controller is to make the slave system follow the delayed output of the master system. this condition is performed when the error signal defined in (11) tends toward zero. the term b1(θ2)xs, representing the difference between both systems, acts as a perturbation on the error state e. the synthesis of this controller has been made with a technique similar to [9, 10] the following theorem solves the problem of variable matrices product in the synthesis problem by introducing extra unknown variable matrices. theorem 1. if there exists a set of matrices pi > 0, a state feedback controller k0, an unknown variable square and nonsingular matrix g ∈ rnu+nc , an unknown variable matrix h ∈ r(nu+nc)×(nu+nc) and four unknown variables matrices f1, f4 ∈ r(nx+nc)×(nx+nc), f2 ∈ rnx×(nx+nc) and f3 ∈ r(n∞+nc)×(nx+nc) such that the inequality (19) is verified, then the dynamical controller k = g−1l makes the error system (12) asymptotically stable for all matrices an (θ1) and b1(θ2) described as a polytope. φ2 + 1sym      f1 f2 f3 f4 o   [ o o o o b̃2 ]    + sym      o o o o i   l [ c̃ o o o o ]    +sym      o o o o i   g [ −k0 o o o −i ]    < o; (19) antenna arrays principle and solutions: robust control approach 167 the matrix φ2 is defined as follows φ2 =   o o ccl t pi o o −γi o o o ccl o −γi o o pi o o o o o o o o o   + sym      f1 f2 f3 f4 o   [ ã0i b̃1 j o −i o ]    ∀i ∈ {1 . . . 2m} and ∀ j ∈ {1 . . . 2p} , (20) where ã0i = ãni + b̃k0 and k = [ dc cc bc ac ] . (21) the expression (16) can be numerically solved using matlab’s© "lmi toolbox". 4 numerical results in order to check the theoretical result, a transistor-based simulation has been done using agilent’s ads© software and mosfet transistors in 0.35µ m silicon technology. it was considered that all the parameters of an (θ ) have ±5% variation around their nominal values. this variation can be seen as the variation depending on the temperature of the oscillators that are built on the same integrated circuit substrate. this is mathematically transformed into the variation of the state matrix an (θ1) inside the polytope. using the matlab’s ”lmi toolbox” applied to the 8 vertices of the polytope, the following output-feedback controller was found k = [ dc cc bc ac ] = [ 2.66588·1011 22.3994 7.51499·109 −1.1885 ] . (22) it assures the synchronization of oscillators having δ = ±5% difference between parameters. this difference is represented by variation of the perturbation matrix b1(θ2) inside the polytope. variations between lm , cm and ls, cs parameters, corresponds to a possible difference between the free-running frequencies of both oscillators f0s ∈ [ f0m (1−|δ|)2 f0m (1 +|δ|)2 ] . (23) the difference between αm , βm and αs, βs stands for a possible difference between the transistor operating points of both nonlinear oscillators. this controller was applied to a pair of both non-linear oscillators. their free running frequencies are f0m = 2ghz and f0s = 2.2ghz. it has been chosen those frequencies in order to build a discrete component platform. in fig. 4 are presented both output voltages for master and slave oscillators and is divided into three sequences. a first sequence in which, the controller σc is not activated, both oscillators oscillates from their free-running frequencies. the second step at t = 55ns, the controller σc is activated. the obtained delay is closed to the imposed value (τ = t /4 = 1.25·10−10s). this delay will correspond to a orientation of the main lobe in θ = 120◦. finally, at t = 60ns, in order to verify the robustness of the dynamical controller, the free running frequency of the master oscillator was changed to fm = 2ghz. in fig. 3 the error between both output signals is presented. it can be seen that the error tends toward zero after a short period of time when the controller is started. 1sym{x} = x t + x ; ∀x ∈rn 168 florin hutu, sebastien cauet, patrick coirault figure 3: output voltages of both oscillators figure 4: the error between both signals provided by the oscillators 4.1 array of oscillators consider the situation where the θp = 60◦ direction must be privileged and θi1 = 90◦ and θi2 = 120◦ must be rejected. in table 4.1, the necessary and final values of the amplitudes and phases are shown for n = 8 antennas. figure fig.5 depicts the corresponding radiation pattern. the orientation of the main lobe is closed to the desired value θp = 58.14 and both interference directions θi1 = 90◦ and θi2 = 120◦ are rejected. 5 conclusion this paper presents a novel method to drive antenna arrays. it is based on unidirectionally coupled oscillators. an output feedback controller has been designed to assure synchronization with advanced control theory using lmi (linear matrix inequalities) tools. the result was successfully extended to a chain of eight unidirectionally coupled oscillators. additional research will be made to constrain the dynamical controller to realize the desired delay in order to eliminate the delay element. antenna arrays principle and solutions: robust control approach 169 amplitudes [v ] phases [◦] necessary simulated necessary simulated 0.269 0.21 0 0 0.21 0.2 143 138 0.14 0.14 180 179 0.4 0.4 -162 160 0.08 0.08 0 -2 0.14 0.13 63.1 57 0 0 0 7 0.3 0.29 -24.4 -30 table 1: computed and final values for the amplitudes and phases figure 5: the radiation pattern in the particular case of θp = 60◦ 170 florin hutu, sebastien cauet, patrick coirault bibliography [1] l. c. godara (ed.), applications of antenna arrays to mobile communications, part i: performance improvement, feasability and system consideration, vol. 85, proceedings of the ieee, 1997. [2] s. chandran, adaptative antenna arrays trends and applications, springer-verlag, 2004. [3] s. j. orfanidis, electromagnetic waves and antennas (2004). http://www.ece.rutgers.edu/ orfanidi/ewa/ [4] r. c. hansen, phased array antennas, wiley-interscience, 2001. [5] x. guan, h. hashemi, a. hajimiri, a fully integrated 24 ghz eight-element phased-array receiver in silicon, ieee journal of solid-state circuits 39 (12) (2004) 2311–2320. [6] p. liao, r. a. york, a six-element beam scanning array, ieee microwave and guided wave letters 4 (1) (1994) 20–22. [7] a. tombak, a. mortazawi, a novel low-cost beam-steering technique based on the extended resonance power dividing method, ieee transactions on microwave theory and techniques (2003) 1–7. [8] r. a. york, t. itoh, injection and phase-locking techniques for beam control, ieee transactions on microwave theory and techniques 46 (11) (1998) 1920–1929. [9] d. arzellier, d. peaucelle, s. salhi, robust static output feedback stabilization for polytopic uncertain systems: improving the guaranteed performance bound, in: rocond milan-italy, 2003. [10] d. mehdi, e. boukas, o. bachelier, static output feedback design for uncertain linear discrete time system, ima journal of mathematical control and information. [11] t. iwasaki, r. e. skelton, k. grigoriadis., a unified algebraic approach to linear control design, taylor and francis, 1998. [12] d. peaucelle, d. azellier, an efficient numerical solution for h2 static output feedback synthesis, europeen control conference. [13] s. boyd, l. e. ghaoui, e. feron, v. balakrishnan, linear matrix inequalities in system and control theory, vol. 15, studies in applied mathematics, usa, 1994. [14] t. heath, simultaneous beam steering and null formation with coupled, nonlinear oscillator arrays, ieee transactions on antennas and propagation 53 (6) (2005) 2031–2035. [15] r. a. york, nonlinear analysis of phase relationshiops in quasi-optical oscillator arrays, ieee transactions on microwave theory and techniques 41 (10) (1993) 1799–1809. florin hutu university of poitiers department : laii-esip 40 avenue du recteur pineau e-mail: florin.hutu@etu.univ-poitiers.fr received: november 28, 2007 antenna arrays principle and solutions: robust control approach 171 florin hutu was born in romania in 1977 and received the ph.d. degree in automatic control in 2007. he is currently a post doc at the university of poitiers where his interests are currently involved with the design of microwave antenna array. sébastien cauet was born in france in 1971 and received the ph.d. degree in automatic control in 1999. he is currently an associated professor of electrical and computer engineering at the university of poitiers where his interests are currently involved with the control of electrical power rotating motors, the design of microwave antenna array and application of control to chaos transmission. patrick coirault was born in poitiers, france, in 1965. he received the ph.d. degree from the university of poitiers in 1991. he was an assistant professor at the institute of technology of poitiers from 1992 to 1997. he is now professor at the institute of technology of chatellerault. his current research interests are in nonlinear identification and control, with applications to chaotic systems. ijcccv4n1draft.pdf int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 1, pp. 92-98 a note on the generative power of axon p systems xingyi zhang, jun wang, linqiang pan huazhong university of science and technology, department of control science and engineering key laboratory of image processing and intelligent control wuhan 430074, hubei, people’s republic of china e-mail: xyzhanghust@gmail.com, junwangjf@gmail.com, lqpan@mail.hust.edu.cn abstract: axon p systems are a class of spiking neural p systems. in this paper, the axon p systems are used as number generators and language generators. as a language generator, the relationships of the families of languages generated by axon p systems with finite and context-free languages are considered. as a number generator, a characterization of the family of finite sets can be obtained by axon p systems with only one node. the relationships of sets of numbers generated by axon p systems with semilinear sets of numbers are also investigated. this paper partially answers some open problems formulated by h. chen, t.-o. ishdorj and gh. păun. keywords: membrane computing, sn p systems, axon p systems 1 introduction the spiking neural p systems (in short, sn p systems) are a class of bio-inspired computing devices introduced in [6], which attempts to incorporate the idea of spiking neurons into the area of membrane computing. the resulting models are a variant of tissue-like and neural-like p systems, with specific ingredients and way of functioning inspired from spiking neurons. in sn p systems, the main “informationprocessor" is the neuron, while the axon is only a channel of communication without any other role – which is not exactly the case in neurobiology. so, recently, a special form of spiking neural p systems, called axon p systems, is introduced in [3], which corresponds to the activity of ranvier nodes of neuron axon. actually, axon p systems are a sort of linear sn p systems. spikes are transmitted along the axon, to the left and to the right, from one node to another node, and an output is provided by the rightmost node. a symbol bi is associated with a step when i spikes exit the system, in this way a string is associated with a computation. in [3], the language generating power of axon p systems under the above definition was investigated, and many open problems and research topics were formulated. in this paper, we continue the study of axon p systems, specifically, the number generative power and the language generative power are investigated, and in this context we answer some open problems formulated in [3]. sn p systems can be used as number generators (e.g., [2, 4, 6, 7]) or language generators (e.g., [1, 2, 5, 10, 12]). as a variant of sn p systems, axon p systems can also be used as number generators and language generators. as a number generator, we do not care whether or not the computation halts, but we only request that the output node spikes exactly twice during the computation, and the result of a computation is the number of steps elapsed between the two moments when the output node spikes. in this case, a characterization of the family of finite sets is given by axon p systems with one node. also, the relationships of sets of numbers generated by axon p systems with semilinear sets of numbers are also investigated. as a language generator, each configuration is described by a corresponding string, and the result of a halting computation is defined as the strings associated with configurations where the system emits a spike. in this case, the relationships of the families of languages generated by axon p systems with finite and context-free languages are considered. the paper is organized as follows. in section 2, formal language theory prerequisites useful in the following sections are recalled. in section 3, the definition of axon p systems and the problems considered in this paper are given. axon p systems as number generators and language generators are investigated respectively in section 4 and section 5. conclusions and remarks are drawn in section 6. 2 formal language theory prerequisites we assume the reader to be familiar with basic language and automata theory, as well as basic membrane computing [8] (for more updated information about membrane computing, please refer to [11]), so that we specify here only a few notations and basic definitions. copyright © 2006-2009 by ccc publications a note on the generative power of axon p systems 93 let us start by mentioning the following convention: when comparing two generative or accepting devices, number zero is ignored (this corresponds to the usual convention in language theory of ignoring the empty string). for an alphabet v , v ∗ denotes the set of all finite strings over v , with the empty string denoted by λ . the set of all nonempty strings over v is denoted by v +. when v = {a} is a singleton, then we write simply a∗ and a+ instead of {a}∗, {a}+. the length of a string x ∈ v ∗ is denoted by |x|. for a language l ⊆ v ∗, the set length(l) = {|x| | x ∈ l} is called the length set of l. the families of finite, regular, linear and context-free languages are denoted by fin, reg, lin, cf, respectively. the families of length sets of languages in fin, reg, lin and cf are denoted by nfin, nreg, nlin, ncf, respectively. the family of languages generated by l systems is denoted by l, and we add the letter e in front of l if extended l systems are used. we also denote by slin the family of semilinear sets of numbers (the subscript indicates that we work with one-dimensional vectors, not with semilinear sets of vectors in general). it is known that the following equalities hold true: nreg = nlin = ncf = slin (see, e.g., [8]). a regular expression over an alphabet v is defined as follows: (i) λ and each a ∈ v is a regular expression, (ii) if e,e are regular expressions over v , then (e)(e), (e)∪(e), and (e)+ are regular expressions over v , and (iii) nothing else is a regular expression over v . with each expression e we associate a language l(e), defined in the following way: (i) l(λ ) = {λ } and l(a) = {a}, for all a ∈ v , (ii) l((e) ∪ (e)) = l(e) ∪l(e), l((e)(e)) = l(e)l(e), and l((e)+) = l(e)+, for all regular expressions e,e over v . non-necessary parentheses are omitted when writing a regular expression, and also (e)+ ∪ {λ } can be written as e∗. a chomsky grammar is given in the form g = (n,t,s,p), with n being the nonterminal alphabet, t the terminal alphabet, s ∈ n the axiom, and p is the finite set of productions. for regular grammars, the productions are of the form u → v, for some u ∈ n,v ∈ t ∪ tn (in regular grammars, we also allow productions of the form u → λ , but only when this is useful for simplifying the grammar: because of the convention that the empty string is not counted when comparing the languages generated by two grammars, such productions are not necessary in regular grammars). 3 axon p systems we now introduce the axon p systems. an axon p system of degree m ≥  is a construct of the form π = (o, ρ, . . . , ρm), where: 1. o = {a} is the singleton alphabet (a is called spike); 2. ρ, . . . , ρm are (ranvier) nodes, of the form ρi = (ni,ri),  ≤ i ≤ m, where: a) ni ≥  is the initial number of spikes contained in ρi; b) ri is a finite set of rules of the form e/a c → (al,ar), where e is a regular expression over a, c ≥ , and l,r ≥ , with the restriction that r contains only rules with l = . the nodes are arranged along an axon in the order ρ, . . . , ρm, with ρm at the end of the axon; this means that node ρm is the output node of the system. a rule e/ac → (al,ar) ∈ ri is used as follows. if the node ρi contains k spikes, and ak ∈ l(e),k ≥ c, then the rule can be applied, and this means consuming (removing) c spikes from ρi (thus only k − c spikes remain in ρi), and it sends l spikes to its left hand neighbor and r spikes to its right hand neighbor; the first node, ρ does not send spikes to the left, while in the case of the rightmost node, ρm, the spikes sent to the right are “lost” in the environment. a global clock is assumed, marking the time for the whole system, hence the functioning of the system is synchronized. if a rule e/ac → (al,ar) has e = ac, then we will write it in the simplified form ac → (al,ar). if several rules can be used at the same time, then the one to be applied is chosen non-deterministically. 94 xingyi zhang, jun wang, linqiang pan during the computation, a configuration of the system is described by the number of spikes present in each node; thus, the initial configuration is 〈n, . . . ,nm〉. using the rules as described above, one can define transitions among configurations. a transition between two configurations c,c is denoted by c ⇒ c. if c = 〈k,k, . . . ,km〉, then c = 〈k′,k′, . . . ,k′m〉, where k′i = (ki − ci) + ri− + li+, i is the current node, ci is number of spikes consumed in this node, and ri−, li+ are the numbers of spikes sent to this node by the neighboring nodes,  ≤ i ≤ m − . in the case of i =  or i = m, there is only one neighbor. any sequence of transitions starting in the initial configuration is called a computation. a computation halts if it reaches a configuration where no rule can be used. in this paper, we consider the following two ways of defining the result of a computation for axon p systems. (i) similar to [6] and [10], we do not care whether or not the computation halts, but we only request that the output node spikes exactly twice during the computation. then, the number of steps elapsed between the two spikes is the number computed by the axon p system along that computation. we denote by n(π ) the set of numbers computed in this way by the axon p system π , and by spikapm(rulek,consp) the family of sets n(π ) generated by axon p systems with at most m nodes, at most k rules in each node, each rule consuming at most p spikes. as usual, any of these parameters is replaced by ∗ if it is not bounded. (ii) as formulated in [3], a language is associated with a computation of axon p systems in the following way: for each node ρi we consider a symbol ci and a configuration 〈k, . . . ,km〉 is described by the string ck . . .c km m ; then, the result of a halting computation is defined as the strings associated with configurations where the system emits a spike. all these strings form the language generated by the system. we denote by l(π ) the language generated in this way by the axon p system π , and by lapm(rulek,consp) the family of languages l(π ) with m,k, p having the same meaning as above. 4 axon p systems as number generators in this section, we investigate the number generative power of axon p systems. 4.1 a characterization of nfin in [6], it has been proved that sn p systems can characterize nfin by using only one neuron. for axon p systems, we have a similar result. theorem 1. nfin = spikap(rule∗,cons∗). proof. the inclusion spikap(rule∗,cons∗) ⊆ nfin can be easily proved: in each step, the number of spikes in an axon p system with only one node decreases by at least one, hence any computation lasts at most as many steps as the number of spikes present in the system at the beginning. thus, axon p systems with only one node can only compute finite sets of numbers. to prove the opposite inclusion nfin ⊆ spikap(rule∗,cons∗), let us take a finite set of numbers, f = {n, . . . ,nk}, and we assume that we have n < n < . . . < nk. an axon p system that generates f is shown in figure 1. initially, the node contains nk +  spikes, hence, only the rule a nk+/a → (λ ,a) can be used in the first step. it consumes one spike and immediately sends a spike to the environment. in the next step, we have to continue with rules ank+−t/a → (λ , λ ), for t = , and then for the respective t = , . . . ,n. but for t = n (in step n + ), there is another rule a nk+−t → (λ ,a) which can be used, non-deterministically chosen. if we choose the second rule, the system will send a spike to the environment again and then the computation halts. therefore, the number n will be generated. if we choose the first rule, the process will continue, and in a similar way the numbers n, . . . ,nk− can be generated in turn. in step nk +  (the last step) only the rule ank+−t → (λ ,a) with t = nk can be used, therefore, the number nk can be generated and this concludes the computation. consequently, all the numbers in f can be generated by the above axon p system, which concludes the proof. a note on the generative power of axon p systems 95 ank+ ank+/a → (λ ,a) ank+−t/a → (λ , λ ) t = , , . . . ,nk −  ank+−t → (λ ,a) t = ni,  ≤ i ≤ k figure 1: an axon p system generating a finite set of numbers 4.2 relationships with semilinear sets of numbers as mentioned in section 4.1, sn p systems with one neuron generate exactly the family of finite sets of numbers. actually, sn p systems with two neurons also characterize the family of finite sets. so, the following results on axon p systems are unexpected: such systems with two nodes generate all semilinear sets of numbers. theorem 2. slin ⊆ spikap(rule∗,cons∗). proof. consider a regular grammar g = (n,t,s,p) with n = {a,a, . . . ,an},n ≥ , s = an, t = {b,b, . . . ,bs}, and the productions in p are of the forms ai → bka j, ai → bk,  ≤ i, j ≤ n,  ≤ k ≤ s. then length(l(g)) can be generated by an axon p system as shown in figure 2. in the initial configuration we have n +  spikes in node ρ and n +  spikes in node ρ, therefore, in the first step, only the rule an+/a → (λ ,a) can be used in node ρ, and the rule an+/a → (λ , λ ) is used in node ρ. thus, a spike is sent to the environment and n spikes remain in node ρ and n spikes in node ρ. in the next step, node ρ fires by a rule a n+i/an+i− j → (an, λ ) or an+i → (λ ,a) associated with a production an → bka j or an → bk from p, for i = n. if the second rule is used, another spike is sent to the environment and the computation halts. in this way, the generated number is . if the first rule is used, n − j spikes are consumed from node ρ. in this step node ρ also fires and sends n spikes to node ρ. it will send n spikes back to node ρ as long as it receives n spikes from node ρ. assume that in some step t, the rule an+i/an+i− j → (an, λ ), for ai → bka j, or an+i → (λ ,a), for ai → bk, is used, for some  ≤ i ≤ n, and n spikes are received from node ρ. if the first rule is used, then n spikes are sent to node ρ, n + i − j spikes are consumed, and j spikes remain in node ρ. then in step t + , we have n + j spikes in node ρ, and a rule for ai → bka j or ai → bk can be used. in step t +  node ρ also receives n spikes. in this way, the computation continues. if the second rule is used, then no spike is sent to node ρ, all spikes in node ρ are consumed, and n spikes are received from node ρ, which will remain in node ρ forever. moreover, in this step another spike is sent to the environment again. then the computation halts. we know that, if we use a production ai → bka j or ai → bk one time, then the length of a string generated by the grammar g increases by one, and this corresponds to the number of steps increasing by one by using the associated rule an+i/an+i− j → (an, λ ) or an+i → (λ ,a) one time in the axon p system. moreover, the second spike is sent to the environment and the computation halts whenever a rule an+i → (λ ,a) is used, which corresponds to a production ai → bk being used in g. therefore, the length set of all the strings in l(g) can be generated, and the proof is complete. the inclusion in the previous theorem is proper. theorem 3. spikap(rule,cons) − slin 6= /0. proof. let us consider the axon p system from figure 3. in the initial configuration, only node ρ contains  spikes, hence it fires in the first step by the rule a(a)+/a → (λ ,a) and sends  spikes to node ρ. in the next step, the rule a → (a, λ ) or a → (λ ,a) can be used in node ρ, non-deterministically chosen. if we use the rule a → (a, λ ), then we get a number of spikes of the form k +  in the second node, hence the first rule a(a)+/a → (a, λ ) in node ρ is applied as many times as possible, thus returning 96 xingyi zhang, jun wang, linqiang pan -¾ 1 2 an+ an+/a → (λ , λ ) an → (λ ,an) an+ an+/a → (λ ,a) an+i/an+i− j → (an, λ ) for ai → bka j ∈ p an+i → (λ ,a) for ai → bk ∈ p figure 2: an axon p system generating a semilinear set the spikes to node ρ. to end this returning, we have to use the rule a → (a, λ ) in node ρ, which makes again the number of spikes from node ρ be of the form k +  (note that no rule can be applied in any node when the number of spikes is multiple of 4). this process can be iterated any number of times, thus multiplying by  the number of spikes present in node ρ. assume that we have a configuration 〈n + , 〉 in step t, for some n ≥ ; initially, this is the case, with n = . in node ρ, the rule a(a )+/a → (λ ,a) can be used as many times as possible until the node remains with only one spike. it moves all spikes to the second node, multiplied by . therefore, in step t + n− + , we have a configuration 〈, n+〉. in the next step, node ρ can also use the rule a → (λ ,a) instead of a → (a, λ ). then the number of spikes from node ρ is of the form k +  (n+ +  spikes), hence the rule a(a)+/a → (λ ,a) should be applied in step t + n− +  and a spike is sent to the environment. at the same time, the number of spikes decreases by one and will be of the form k +  (n+ +  spikes). therefore, the rule a(a)+/a → (λ , λ ) should be applied in step t + n− + . this rule does not change the form of the number of spikes and it remains in the form of k + , hence it is used as many times as possible. in this way, no spike is sent to the environment until the number of spikes from ρ becomes  and this process needs  n −  steps. then, in the step t + n− + n + , the rule a → (λ ,a) should be used and the second spike is sent to the environment. therefore, the number of steps elapsed between the two spikes which were sent to the environment is n and the computation halts. consequently, the set {n | n ≥ } can be generated by this system, which, obviously, is not a semilinear set. -¾ 1 2 a a(a)+/a → (λ ,a) a → (λ ,a) a → (λ ,a) a(a)+/a → (a, λ ) a → (a, λ ) a(a)+/a → (λ ,a) a(a)+/a → (λ , λ ) a → (λ ,a) figure 3: an axon p system generating a non-semilinear set {n|n ≥ } in [6], it has been shown that semilinear sets of numbers can be characterized by sn p systems with a bound on the number of spikes present in any neuron, but the number of neurons is not bounded. the following theorem also gives a characterization of semilinear sets of numbers, but here only two nodes are used in the axon p systems. theorem 4. slin = spikap(rule∗,cons∗,bound∗), where bound∗ indicates that axon p systems have a bound on the number of spikes present in any node, but this bound is not specified. proof. from theorem 2, it is enough to prove the inclusion spikap(rule∗,cons∗,bound∗) ⊆ slin. the proof is similar to the one of lemma 9.1 in [6], so we do not recall it here. a note on the generative power of axon p systems 97 5 axon p systems as language generators we now pass to considering the language generative power of axon p systems. all strings generated by an axon p system in the way mentioned in section 3 are of the form ck c k  . . .c km m , where 〈k,k, . . . ,km〉 is a configuration of the system. therefore, if there is a bound on the number of spikes in any node of axon p systems, we can directly have the following result. remark 5. lapn(rule∗,cons∗,bound∗) ⊆ fin, for n ≥ . moreover, as a direct consequence of this restrictive way of defining the languages associated with axon p systems, we cannot find a characterization of finite languages for the general axon p systems. for instance, it is not difficult to find that the string ccc cannot be generated by any axon p system. 5.1 beyond cf theorem 6. there exists at least a language l ∈ el −cf, which can be generated by axon p systems. proof. let us consider the language l = {cn c n  c n  | n ≥ }; it is obvious that l ∈ el −cf. we construct the axon p system whose initial configuration is shown in figure 4. 1 2 3 a (a)+/a → (λ ,a) a (a)+/a → (a,a) a (a)+/a → (a, λ ) (a)+/a → (a,a) -¾-¾ figure 4: an axon p system generating the language {cn c n  c n  | n ≥ } initially, each node of the system contains two spikes. hence, each node can be activated, and in node ρ both the rule (a )+/a → (a, λ ) and the rule (a)+/a → (a,a) can be applied, non-deterministically chosen. if the first rule is applied, then the two spikes in each node are consumed and no spike is sent to the environment. at the same time, each node accumulates 4 spikes (the 4 spikes in nodes ρ and ρ are received from node ρ; the 4 spikes in node ρ are received from both node ρ and node ρ, two from node ρ and two from node ρ). in this way, all the nodes can be activated in the next step. if in node ρ we continue using the rule (a)+/a → (a, λ ), then in a similar way each node obtains 6 spikes. this process can be iterated until the second rule is applied. in this case, the number of spikes in each node increases by two in each step. therefore, each node of the system accumulates n spikes in step n, for n ≥ . at any moment, we can also apply the second rule in node ρ. assume that it is applied in step n + , then node ρ sends a spike to the environment, which means that the string c n  c n  c n  is generated by this system. at the same time, node ρ accumulates n +  spikes, node ρ accumulates n +  spikes, and node ρ accumulates n +  spikes. in the next step, only node ρ can be activated. thus, it consumes two spikes and sends two spikes to node ρ. this process can be iterated until all the spikes in node ρ are moved to node ρ, and then the computation halts. therefore, the language {cn c n  c n  | n ≥ } can be generated by the axon p system and this concludes the proof. 6 conclusions and remarks in this paper, the number generative power and the language generative power of axon p systems are investigated. as a variant of sn p systems, there remain many open problems and research topics about axon p systems to be considered. one important aspect is suggested by the research about sn p systems. for instance, as usual in sn p systems, an arbitrary delay can be considered in axon p systems. what about considering the spike trains (finite or infinite) themselves as the result of a computation in axon p systems, as investigated in [10]. the various applications for axon p systems also deserve to be considered. as suggested by gheorghe păun in [9], a more general and probably more interesting problem is combining neurons and axons in a global model; maybe also astrocytes can be added, thus obtaining a 98 xingyi zhang, jun wang, linqiang pan more complex model, closer to reality. please refer to [3, 9] for more open problems and research topics related to axon p systems. acknowledgements comments and suggestions from gheorghe păun are greatly acknowledged. the project was supported by national natural science foundation of china (grant nos. 60674106, 30870826, 60703047, 60503002, and 60533010), 863 program of china (2006aa01z104), program for new century excellent talents in university (ncet-05-0612), ph.d. programs foundation of ministry of education of china (20060487014), chenguang program of wuhan (200750731262), and hust-srf (2007z015a). bibliography [1] h.m. chen, m. ionescu, m. j. pérez-jiménez, r. freund, and gh. păun, on string languages generated by spiking neural p systems, fundamenta informaticae, vol. 75(1–4), pp. 141–162, 2007. [2] h. m. chen, m. ionescu, t.-o. ishdorj, a. păun, gh. păun and m. j. pérez-jiménez, spiking neural p systems with extended rules, in: m. a. gutiérrez-naranjo, gh. păun, a. riscos-núñez, f. j. romero-campero, eds., fourth brainstorming week on membrane computing, vol. i, rgnc report 02/2006, research group on natural computing, sevilla university, fénix editora, pp. 241–266, 2006. [3] h. m. chen, t.-o. ishdorj and gh. păun, computing along the axon, progress in natual science, vol. 17(4), pp. 417–423, 2007. [4] o. h. ibarra, s. woodworth, f. yu and a. păun, on spiking neural p systems and partially blind counter machines, natural computing, vol. 7(1), pp. 3–19, 2008. [5] o. h. ibarra and s. woodworth, characterizing regular languages by spiking neural p systems, international journal of foundations of computer science, vol. 18(6), pp. 1247–1256, 2007. [6] m. ionescu, gh. păun and t. yokomori, spiking neural p systems, fundamenta informaticae, vol. 71(2–3), pp. 279–308, 2006. [7] m. ionescu, gh. păun and t. yokomori, spiking neural p systems with exhaustive use of rules, international journal of unconventional computing, vol. 3(2), pp. 135–154, 2007. [8] gh. păun, membrane computing – an introduction, springer-verlag, berlin, 2002. [9] gh. păun, twenty six research topics about spiking neural p systems, in: m. a. gutiérreznaranjo, gh. păun, a. romero-jiménez, a. riscos-núñez, eds., fifth brainstorming week on membrane computing, rgnc report 01/2007, research group on natural computing, sevilla university, fénix editora, pp. 263–280, 2007. [10] gh. păun, m. j. pérez-jiménez and g. rozenberg, spike trains in spiking neural p systems, international journal of foundations of computer science, vol. 17(4), 975–1002, 2006. [11] the p system web page: http://ppage.psystems.eu [12] x. y. zhang, x. x. zeng and l. q. pan, on string languages generated by spiking neural p systems with exhaustive use of rules, natural computing, to appear. xingyi zhang was born in china on june 6, 1982. he received his master degree in applied mathematics from huazhong university of science and technology in 2006. currently, his main research fields are formal language theory and its applications, unconventional models of computation, especially, membrane computing. he has published several scientific papers in international journals. linqiang pan was born in zhejiang, china on november 22, 1972. he got phd at nanjing university in 2000. since 2004, he is a professor at huazhong university of science and technology, china. his main research fields are graph theory and membrane computing. international journal of computers communications & control issn 1841-9836, 11(2):209-223, april 2016. influence model of user behavior characteristics on information dissemination s.c. han, y. liu, h.l. chen, z.j. zhang shaochun han beijing jiao tong university beijing, 100044, china 15620009060@126.com yun liu beijing jiao tong university beijing, 100044, china liuyun@bjtu.edu.cn huiling chen* 1. school of pharmaceutical science and technology tianjin university 2. tianjin university of traditional chinese medicine *corresponding author: 15022613010@163.com zhenjiang zhang beijing jiao tong university beijing, 100044, china zhjzhang1@bjtu.edu.cn abstract: quantitative analysis on human behavior, especially mining and modeling temporal and spatial regularities, is a common focus of statistical physics and complexity sciences. the in-depth understanding of human behavior helps in explaining many complex socioeconomic phenomena, and in finding applications in public opinion monitoring, disease control, transportation system design, calling center services, information recommendation. in this paper,we study the impact of human activity patterns on information diffusion. using sir propagation model and empirical data, conduct quantitative research on the impact of user behavior on information dissemination. it is found that when the exponent is small, user behavioral characteristics have features of many new dissemination nodes, fast information dissemination, but information continued propagation time is short, with limited influence; when the exponent is big, there are fewer new dissemination nodes, but will expand the scope of information dissemination and extend information dissemination duration; it is also found that for group behaviors, the power-law characteristic a greater impact on the speed of information dissemination than individual behaviors. this study provides a reference to better understand influence of social networking user behavior characteristics on information dissemination and kinetic effect. keywords: sir, behavior dynamics, scaling laws, information dissemination. 1 introduction the analysis target of user behavior time characteristics is the statistical regularities manifested when humans repeatedly engaged in certain things, which was firstly proposed by poisson introduced the concept of probability in his work of case judgment management, namely poisson distribution. when human data collection capabilities are limited, the poisson distribution is widely used as a classic means to depict human activity patterns. in recent years, with the emergence of high-performance processors and constant enhancement of computer parallel computing power, making the massive social network data processing copyright © 2006-2016 by ccc publications 210 s.c. han, y. liu, h.l. chen, z.j. zhang become possible. at present, through empirical analysis, research and mining user behavior characteristics of large data and use simulation technology [1][8], a large number of scholars make analysis of network relationships and identify potential objective law. by analyzing massive data of various networks, more and more facts have proven that, user behavior corresponded time interval distribution has obvious heavy-tailed effect, which can be well fitted by power function [16][18]. at the same time, scholars use massive data of social networks, from many fields, multi-angle and multi-dimensional human behavior characteristics were studied. for example, x.song et al. [9] analyzed the geographical distribution of twitter users, user’s neighbor nodes and the degree of correlation coefficient, and twitter users were grouped. h.kwak et al. [10] studied the average shortest interval and length of twitter micro blog, posts survival time, maximum repost depth and user grouping sorting features, the text sorted twitter users according to the number of fans and page-rank value, the final results of the two sorting methods are substantially the same, which is obviously different from the final sorting result obtained by users information forwarding number, indicating that there is not tight dependencies between users information forwarding number and their owned neighbor node. m.cha et al. [11] by comparing correlation coefficient of twitter users posts forwarded number, post reply number and the number of neighbor nodes, studied the effect of core users on information dissemination. the article [12] conducted further analysis of twitter posts forwarded relevant factors. the article [13] also conducted data analysis of scholarly articles downloads from an economy physics web site, and found that download rate of different papers show exponential decrease per unit time,and the average download rate f and its variance approximately satisfy f ∝ oα, of which, α is located between 0.6 to 0.9. the paper [14] extracted sina blog user’s interaction data, through network degree distribution analysis, it can be found that in the sina blog, the in-degree and out-degree obeys power-law distribution, but the exponent of out-degree is larger than in-degree, which explains that part of the blog user do not add more friends, and even of users do not have friends, and studies have found that the correlation coefficient of blog network in-link and out-link degree distribution is positive; while the correlation coefficient of out-link and in-link degree distribution is negative. in this paper, firstly carry out mathematical statistical analysis of users publish information and information reply time intervals in qq space data set, and by means of sir [15] model to investigate influence of time characteristics of user behavior on information dissemination process in the social network, and make comparative analysis of even stepping model. 2 data in this paper, the author utilized qq spatial data set. this data set is obtained by the use of crawling program wrote by python language. the program logins qq space by the way of simulated browser, automatically go to access to interaction information between user and his friends, and write into the corresponding xml file, then read xml into the database by using python parsing, remove user hidden qq number and other abnormal data, to get the final data set of qq space. topological statistics characteristics of the data set is shown: v denotes the number of nodes(4800), e denotes border coefficient(66475), d is the network diameter(5), c is network clustering coefficient(0.423), kmax is the maximum node degree(854), k is average node degree(30.882). influence model of user behavior characteristics on information dissemination 211 2.1 all nodes interval features all nodes features refers to the overall behavior features of all nodes in data set. figure 1 is analysis of group posting behavior characteristic in each board; figure 2 is analysis of group reply behavior characteristics in each board. it can be found from the figures that, all group behaviors are consistent with power-law distribution, and power exponent can be obtained through simulation fitting. in posting behavioral characteristics analysis,the exponent of message board is the largest, reaching 1.1223, while the exponent of talk board is only 0.816, with exponent of group posting and log board is at the average;however, in analysis of reply behavioral characteristics, the exponent of talk board is the largest, is 1.1041, and the smallest is exponent of log board; so even posting and replying behavioral characteristics are different in the same board. figure 1: analysis of group posting behavior characteristic in each board 2.2 individual node interval features the paper also analyzes the features of individual behavior, firstly the data sets are sorted in accordance with the number of nodes posting and the number of reply’s in descending order, and then select the first rank (denoted by a), three-quarters (denoted as b ), intermediate (denoted as c) and fourth (denoted as d) four-node data respectively, post and reply node behavior characteristics were analyzed. as shown in figure 3 and 4, the actual post and reply number are at a high level at data set node a and b, so the power index of posting and reply are relatively close, which shows that for active nodes, dealing with things usually by a specific behavior pattern, which is consistent with the literature conclusions; while at data set node c and node d, the actual post number is few but reply of a larger number, so causing power exponent of its post behavior is small, while power exponent of reply behavior is large. to explain this phenomenon, author of this paper used nc algorithm to calculate network binding targets of these four nodes, found that nodes a and b have greater binding, while the binding of node c and d are small, seen from the above exponential distribution of post and reply two types of nodes and the calculation result of nc index value, for node a and b such very active nodes, because of its dominant position in 212 s.c. han, y. liu, h.l. chen, z.j. zhang figure 2: analysis of group reply behavior characteristics in each board the message, the posting content has high credibility and timeliness, resulting in follow-up reply interaction increases, while for nodes c and d, as a result of information disadvantage, it can only attract other nodes to forward information by replying behavior. therefore, the posting and replying behaviors of active nodes are positively related, while posting and replying behaviors of non-active nodes are negatively related. 2.3 cluster interval features according to the above sort results, data sets are divided into 20 equal portions in this paper, which generate 20 clusters (the higher ranking clusters have higher degree of activity), distribution exponent of each cluster posting and reply behavior interval age calculated. in figure 5 and figure 6, from left to right are the eighth cluster, the tenth cluster and twelfth cluster posting and reply behavior time interval distribution. it can be found that with the lower degree of cluster activity, power index of the three cluster posting and reply behaviors also decreased, figure 7 is exponential distribution figure of 20 cluster posting behavior, from curve trend of the figure, we can draw a conclusion that cluster activity degree is positively correlated with power exponent,which is consistent with analysis results of the literature. 2.4 bm phase diagram analysis when the time interval between incidents follows power-law distribution, means that many events will concentrated occur in a relatively short period of time, followed by a long idle period, this situation is called event paroxysmal feature. from incident time interval distribution, for system with strong paroxysmal feature, most of the time interval will be less than the average event interval, but relatively large time interval may also occur, this phenomenon means that the standard deviation of its time distribution is relatively large. event paroxysmal feature can be measured by variation coefficient b of time interval, as shown in formula 1: b ≡ στ mτ −1 στ mτ + 1 = στ −mτ στ −mτ (1) influence model of user behavior characteristics on information dissemination 213 figure 3: characteristics of individuals posting behavior figure 4: characteristics of individual reply behavior figure 5: analysis of three cluster post behavior characteristics 214 s.c. han, y. liu, h.l. chen, z.j. zhang figure 6: analysis of three cluster reply behavior characteristics figure 7: relation diagram of 20 cluster exponential distribution and degree of activity of which στ and mτ represent standard deviation and average value of pτ , the value range of b is (-1 to 1), with respect to the poisson distribution, the average value and standard deviation are equal, the paroxysmal is 0 therefore, can be seen as an equilibrium point set between goh and barabasi; for recurring events, the time interval distribution is actually a δ function,the standard deviation is 0, b value is -1. for the power-law distribution, the standard deviation is much larger than average value, b is close to 1, that is, the closer to 1 indicates the stronger paroxysmal, close to 0 indicates a neutral, belonging to random events series, close to -1 indicates no paroxysmal, is cyclical periodic events. in addition to paroxysmal feature of events, events characteristics may also be depicted by memory description: time sequence of events has a certain memory, a long interval is also followed by a longer time interval, and a short interval is also followed by a shorter time interval, then all of the time intervals form a sequence according to occurring time sequence (time interval sequence of two successive behaviors), assume that this sequence has nτ elements, i.e., nτ +1 events occurred,define the previous nτ -1 elements constitute sequence 1, and define latter nτ -1 elements constitute sequence two, as shown in formula 2, pearson correlation of the two sequences can be used to measure the sequence memory. m ≡ 1 nτ −1 nτ −1∑ i=1 (τi −m1)(τi −m2) σ1σ2 (2) m1 and m2 are the mean of sequence 1 and sequence2 respectively, σ1 and σ2 are standard deviation of sequence 1 and sequence 2. obviously, the value range of m is also between (-1 to 1): m > 0 represents memory effect, m < 0 represents anti-memory effect. when m is close to 1, indicates a long (short) time interval is more inclined to corresponding long (short) time interval after another; when closes to 0 indicates a neutral; when closes to -1 indicates a long (short) time interval is more inclined to corresponding short (long) time interval after another. figure 8 is influence model of user behavior characteristics on information dissemination 215 figure 8: bm phase diagram of posting behavior bm phase diagram of posting amount exceed 10, 30, 50, 100, 200, 400, 600, 800 and 1000 posts respectively in data set. it can be found from the figure that, although there are great variance in number of posts, their average point location in bm phase diagram are stable between at the horizontal coordinates (0.244-0.26) and vertical coordinates (0.416-0.464), indicating that in qq space, user posting behavior has obvious paroxysmal and memory features. 3 model building in this paper, the information dissemination model involved utilized sir model of infectious disease dynamics to complete. define users of social network as nodes, friends relationship between nodes is defined as side, and nodes are divided into three categories: stifler,ignorant and spreaders. stiflers indicates that the node receives and knows contents of the information, but it does not take the initiative to disseminate this information; ignorant indicates that the node has not received the information sent by the neighboring nodes, so it will not conduct information dissemination, but it will receive this information at a certain probability; spreaders indicates that the node has received information from one of its neighboring nodes, at the next timestepping, the node will take the initiative to disseminate information to its neighboring nodes. for spreaders, in information dissemination process, if the neighbor node receiving information belongs to ignorant, then the neighboring node may become spreaders, and the node will conduct information dissemination at next time; if neighbor node receiving information belongs to spreaders or stiflers, i.e.the receiver has received the information, then the information receiving party will abandon to continue to spread this information, and if the neighbor node is spreaders, it will change its own state into stiflers. it can thus be seen that for social network node, its transfer between stiflers, ignorant and spreaders states depends not only on the node’s state at 216 s.c. han, y. liu, h.l. chen, z.j. zhang the current time-stepping step, also relates to state of its neighbor nodes for information interaction. at the same time this model assumes: social network structure is relatively static, that is number of nodes, sides relationship and weight between nodes do not change with time; in the social network, node send messages to its neighbor nodes at certain time-stepping,and after neighbor node receiving the information, it will be forwarded at a certain degree of probability;information will only spread among neighbors. to sum up, let the state transition rules as follows: 1. infection probability λ : after spreaders sending message to its neighbor nodes, ignorant will become spreaders at λ probability, and this information will be disseminated at the next time-stepping; 2. stiflers probability µ: spreaders will become stiflers at µ probability, if and only if spreaders contact with other neighbors spreaders or stiflers; 3. attenuation probability υ: with time attenuation, spreaders will no longer take the initiative to disseminate information, so define attenuation probability υ, namely, when spreaders has no interaction with any neighbor nodes, it will automatically become stiflers at υ probability. the probability λ of ignorant i to believe and actively disseminate information is associated with its neighbor node spread influence, so the spread probability of node i is: λ = 1− ∏ j∈ϕi (1− ωij∑ki m=1 ωij) (3) where in ϕi, is defined as the set of spread nodes of neighbor nodes to node i, ωij is defined as the connection weight between node i and node j. according to sir propagation model, it shows the entire information dissemination process: spreaders(j) + ignorant(i) λ−→ spreaders(j) + spreaders(i) spreaders(m) + spreaders(n) µ −→ stiflers(m) + spreaders(n) spreaders(m) + stiflers(k) µ −→ stiflers(m) + stiflers(k) spreaders(p) υ−→ stiflers(p) (4) 3.1 propagation model according to the evolution rule of formula 3, build mean-field differential evolution equations of information dissemination model. define s, i and r to represents spreaders, ignorant and stiflers states; for nodes of k degree, define the total number of nodes in spreaders, ignorant and stiflers states asmk,s,mk,i and mk,r, the total number of all nodes k degree is mk, and define: mk = mk,s + mk,i + mk,r (5) assuming in social network, the node i is in ignorant state at t time, in [t,t+△t] period of time, define piii as the probability of i will remain in ignorant state, definep i is as the probability of i to change from ignorant state to spreaders state, andpiis = 1 −p i ii.if j remains in ignorant state in △t period of time, indicating that the neighbor node of j in the spreaders state fail to disseminate information to j. so piiican be expressed as: piii = g∏ m=0 (1−△tλi) (6) influence model of user behavior characteristics on information dissemination 217 where in g=g(t) is defined as the total number of nodes neighboring i at t time, specifically as follows: ∏ g,t = ϕ(k,t)g(1−ϕ(k,t))k−g (7) where in, ϕ(k,t) is defined as probability of ignorant of k degree has adjacency relationship with certain spreaders at t time: ∏ (g,t) = ∑ k′ p(k′ | k)p(sk′ | ik) ≈ ∑ k′ p(k′ | k)ρs(k′, t) (8) in formula 8, p(k′ | k)is the correlation function of degree, that is conditional probability of node of k degree and node of k′ degree have adjacency relationship; p(sk′ | ik) is defined when node of k′ degree has connection relationship with uninfected node of k degree, the node belongs probability of propagation state; ρs(k′, t)is defined as spreaders density at t time and k′ degree. by traversing all possible g and i, the mean of piii can be calculated, i.e. pii(k,t), which is used to describe the mean field dynamics equation of the model: pii(k,t) = 1 mk ∑ i k∑ g=0 g∏ m=0 (1−△tλmi)ϕ(k,t)g[1−ϕ(k,t)]k−g (9) where in, mk is the total number of all nodes of k degree in the network. in [t,t+△t] period of time, piss is defined as the probability of spreaders to maintain spread state, traverse all possible g, the average probability pii(k,t) for node to maintain spread state: pii(k,t) = k∑ g=0 (1−µ△ t)gϕ(k,t)g(1−ϕ(k,t))k−g(1−υ △ t) = k∑ g=0 (1−µ△ t)ϕ(k,t)g(1−ϕ(k,t))k−g(1−υ △ t) = (1−µ△ t)ϕ(k,t) + 1−ϕ(k,t)k(1−υ △ t) = (1−µ△ tϕ(k,t)k)(1−υ △ t) = (1−µ△ t ∑ k′ p(k′ | k)[ρs(k′, t) + ρrs(k′, t)])k(1−υ △ t) (10) the probability for spreaders to become stiflers (contact) can be expressed as psr(k,t) = 1−pss(k,t). on the basis of average state transition probability, according to node change rules of formula 5, changing situation of nodes of k degree in three states in[t,t+△t]time period can be obtained, as shown in equation 11, 12 and equation 13: mk,i(t +△t) = mk,i(t)−mk,i(t)(1−pii(k,t)) = mk,i(t)−mk,i(t)[1− 1 mk ∑ i k∑ g=0 g∏ m=0 (1−△tλmi)ϕ(k,t)g[1−ϕ(k,t)]k−g] (11) 218 s.c. han, y. liu, h.l. chen, z.j. zhang mk,i(t +△t) = mk,s(t) + mk,i(t)(1−pii(k,t))−mk,s(t)(1−pss(k,t)) = mk,s(t) + mk,i(t)[1− 1 nk ∑ i k∑ g=0 g∏ m=0 (1−△tλmi) ϕ(k,t)g[1−ϕ(k,t)]k−g]−mk,s(t)[1− (1−µ△ t ∑ k′ p(k′ | k) [ρs(k′, t) + ρrs(k′, t)])k(1−υ △ t)] (12) mk,r(t +△t) = mk,r(t) + mk,s(t)(1−pss(k,t)) = mk,r(t) + mk,s(t)[1− (1−µ△ t ∑ k′ p(k′ | k) [ρs(k′, t) + ρrs(k′, t)])k(1−υ △ t)] (13) to simplify the calculations, denoteφ(k,g,t) = ϕ(k,t)g(1 −ϕ(k,t))k−g, for formula 11, the following variants can be realized: mk,i(t +△t)−mk,r(t) mk = mk,i(t) mk [1− 1 mk ∑ i k∑ g=0 g∏ m=0 (1−△tλmi)φ(g,k,t)] (14) denote ρi(k,t) = mk,i(t) mk . both ends of formula(14)take△t →0. since lim△t→0 ∏g m=0(1−△tλ mi) = ∑g m=0(−λ mi), so formula (14) can be defined as: φρi(k,t) φ(t) = ρi(k,t) mk ∑ j g∑ m=1 k∑ g=1 φ(g,k,t)λmi (15) similarly, it can be derived from formula (13) mk,r(t +△t)−mk,r(k,t) nk = mk,s(k,t) nk (1−pss(k,t)) = mk,s(k,t) mk [1− (1−µ△ t ∑ k′ p(k′ | k) [ρs(k′, t) + ρrs(k′, t)])k(1−υ △ t)] (16) at right end of formula(16) (1−µ△ t ∑ k′ p(k′ | k)[ρs(k′, t) + ρrs(k′, t)])k = ∑ k=0 nc(k,n)(−µ△ t ∑ k′ p(k′ | k[ρs(k′, t) + ρrs(k′, t)])n = ck0 + c k 1 (−µ△ t ∑ k′ p(k′ | k[ρs(k′, t) + ρrs(k′, t)]) = 1−kµ△ t ∑ k′ p(k′ | k[ρs(k′, t) + ρrs(k′, t)] (17) therefore, formula(16) is varied: influence model of user behavior characteristics on information dissemination 219 mk,r(t +△t)−mk,r(k,t) nk = ρs(k,t)[1− (1−kµ△ t ∑ k′ p(k′ | k)[ρs(k′, t) + ρrs(k′, t)])(1−υ △ t)] = ρs(k,t)(kµ△ t ∑ k′ p(k′ | k) + υ △ t + kµ△ t2 ∑ k′ p(k′ | k))[ρs(k′, t) + ρr(k′, t)] (18) both ends of formula (18) divide △t and take the limit △t → 0. φρr(k,t) φ(t) = kµρs(k,t) ∑ k′ [ρs(k′, t) + ρr(k′, t)]p(k′ | k) + υρs(k,t) (19) since φρ i(α,k,t) φ(t) + φρs(α,k,t) φ(t) + φρr(α,k,t) φ(t) = 0,so it is easy to obtain: φρs(k,t) φ(t) = ρi(k,t) mk ∑ j g∑ m=1 k∑ g=1 φ(g,k,t)λmi −kµρs(k,t) ∑ k′ [ρs(k′, t) + ρr(k′, t)]p(k′ | k)−υρs(k,t) (20) by formula (15), (19) and (20) simultaneous,obtain the dynamical evolution equations of information dissemination in social network for depicting changes in the relationship between spreaders,ignorant and stiflers density over time. 4 simulation analysis figure 9 is the propagation and evolution diagram of stiflers, ignorant and spreaders density under different power exponent α and even time-stepping( α = 0), and from left to right are spreads density evolution diagram, ignorant evolution diagram and stiflers evolution diagram. it can be seen from spreads density evolution diagram that, with the decrease of power exponent α, peak of wave of spreads density continues to decline, but the final information dissemination duration has been prolonged, indicating that the smaller heterogeneity of temporal characteristics of user behaviors, breadth of information dissemination may be affected, but the final information dissemination duration will be extended. it can be seen from ignorant evolution diagram and stiflers evolution diagram that, for the former, with the decrease of power exponent α, the greater proportion of ignorant; while for the latter, with the decrease of power exponent α, the smaller proportion of stiflers. figure 10 is respectively represent evolution diagram of the proportion of new spreaders number per time step and proportion of cumulative spreaders number, which also confirmed from edgewise that the power-law characteristic of user activity time distribution will have a huge impact on information dissemination. when the power exponent is small, although there are many new spreaders, the information dissemination comes and goes fast, which may not have a greater impact; but when the power exponent is large, although the number of people implementing information dissemination is small, it lasts longer, it will produce more lasting influence, which explains the phenomenon why some information gets spread again after a long silence. relaxation time refers to the time required for model to start from evolution to tended to be 220 s.c. han, y. liu, h.l. chen, z.j. zhang figure 9: influence of time-order character on propagation figure 10: left:evolution diagram of proportion of new spreaders number per time step. right: evolution diagram of proportion of cumulative spreaders number stable, figure 11 are the relationship between power exponent and the relaxation time and respectively represent when the number of spreads goes over half of time t, it can be seen from the figure that, as the power exponent increases, the relaxation time shows obvious linear downward trend. since the power-law distribution of user behavior can be expressed in group and individual nodes, so the paper conducted simulation analysis of impact of power-law characteristic at groups and individual-level on information dissemination process. the so-called power-law distribution at group level refers to the behavior time distribution sequence of each node is regular, namely the time interval of node behavior remains unchanged,but the degree of activity between nodes varies greatly, and meets power-law distribution. power-law distribution at individual-level refers to the time interval distribution of individual’s own behavior meets the power-law characteristic, but the degree of activity and time intervals satisfied different distribution between individuals are the same. the center and right respectively represent when the number of spreads goes over half of time t, impact of power-law distribution characteristics group and individual level on speed of information dissemination relationship diagram. it can be seen from the above figures that, at the group level, the power-law distribution has greater impact on the dissemination of information, while power-law distribution at individual level has less effect on the dissemination of information. figure 12 respectively represent relation diagram between the maximum propagation range of information dissemination cmax and the maximum node propagation density smax and power exponent, as it can be seen from the figure,with continued exponential increases, the maximum propagation range of information dissemination cmax and smax the maximum node propagation density simultaneously increase, but when the power index α > 1.5, the maximum propagation range of information dissemination has been stabilized, while it will still have some impact on the maximum node propagation density smax. influence model of user behavior characteristics on information dissemination 221 figure 11: left:relationship between power exponent α and relaxation time. center:impactof power-law distribution characteristics group-level on speed of information dissemination. right: impact of power-law distribution characteristics individual-level on speed of information dissemination figure 12: left:relation diagram between the maximum propagation range of information dissemination and power exponent. right: relation diagram between the maximum node propagation density and power exponent 5 conclusion with empirical data of social networks, here conduct quantitative analysis of user behavior time interval characteristics, which is conducive to explain many complex networks information propagation phenomena, and can generate social benefits and application value in the public opinion monitoring, disease prevention, information recommendation and other aspects. firstly, the paper makes use of real social network user behavior data, to analyze information posting and reply behaviors of networking groups, network individuals and network groups respectively, then use the bm phase diagram to analyze paroxysmal and memory characteristics of user’s information posting and reply behaviors. by means of sir propagation model and empirical data, implement quantitative study of impact of user behavior time interval characteristic on information dissemination process, and found that user behavior time interval meets the characteristics of power-law distribution, although it will slow speed of information dissemination to a great extent, it will also extend the duration of the information dissemination and can increase the ultimate scale of information dissemination, and thus more likely to have a greater impact on the society; it also found that at the group level, power-law characteristic has a greater impact on speed of information dissemination, while at individual level, the speed of information dissemination is less affected by power-law characteristic. acknowledgment this research is supported by: 1. national natural science foundation of china (no.61271308); 2. beijing natural science foundation (no.4112045); 222 s.c. han, y. liu, h.l. chen, z.j. zhang 3. beijing city science and technology project (no.z121100000312024); 4. specialized research fund for the doctoral program of higher education of china (no. w11c100030). bibliography [1] zhou, t.; han, x.p.et al(2013); statistical mechanics on temporal and spatial activities of human, journal of university of electronic science and technology of china, issn 1674862x, 4(4):481-540. [2] zhang, h.p.(2015); an agent-based simulation model for supply chain collaborative technological innovation diffusion, international journal of simulation modelling, issn 1726-4529, 14(2):313-324. [3] liu, s.; gong,d.(2014); modelling and simulation on recycling of electric vehicle batteriesusing agent approach, international journal of simulation modelling, issn 1726-4529, 13(1):79-92. [4] pasztor, a.(2014); gathering simulation of real robot swarm, technical gazette, issn 18486339, 21(5):1073-1080. [5] shang, y.l.(2013); measuring degree-dependent failure in scale-free networks of bipartite structure, international journal of simulation & process modelling, issn 1740-2131, 8(1):7478. [6] lerher, t.; ekren, y.b.; sari,z.;rosi,b.(2015); simulation analysis of shuttle based storage and retrieval systems, international journal of simulation modelling, issn 1726-4529, 14(1):48-59. [7] cho, y.c.(2015); a novel approach of adaptive socially aware routing algorithm in delay tolerant networks, technical gazette, issn 1848-6339, 22(1):61-70. [8] xue, y.g.et al(2014); determination of statistical homogeneity by comprehensively considering the discontinuity information, technical gazette, issn 1848-6339, 21(5),971-977. [9] java, a.; song, x.; finin, t.; tseng,b.(2007); webkdd/snakdd 2007:web mining and social network analysis post-workshop report, acm sigkdd explorations newsletter, 9(2):8792. [10] kwak, h.; lee, c.; park, h.(2010); what is twitter,a social network or a news media,international conference on world wide web,591-600. [11] cha, m.; haddadi, h.et al(2010); measuring user influence in twitter: the million follower fallacy, proceedings of the fourth international aaai conference on weblogs and social media, 23-26. [12] suh, b.; hong, l.; pirolli, p.; chi, e.h.(2010); want to be retweeted? large scale analytics on factors impacting retweet in twitter network, 2010 ieee second international conference on social computing, 177-184. [13] han, d.d.et al(2008); fluctuation of the download network, chinese physics letters, issn 0256-307x, 25(2):765-768. influence model of user behavior characteristics on information dissemination 223 [14] fu, f.; liu, l.h.; wang, l.(2008); empirical analysis of online social networks in the age of web 2.0, physica a, issn 0378-4371, 387(2):675-684. [15] wang, z. et al(2015); coupled disease-behavior dynamics on complex networks: a review, physics of life reviews, issn 1571-0645, 15(1):30-31. [16] alessandro, a.; laura, b.; george, l.(2015); privacy and human behavior in the age of information, science, 347(6221):509-14. [17] freitas, c.r.d.(2015); weather and place-based human behavior: recreational preferences and sensitivity, international journal of biometeorology, issn 0020-7128, 59(1):55-63. [18] medina, j.r.; lorenz,t.; hirche, s.(2015); synthesizing anticipatory haptic assistance considering human behavior uncertainty, robotics ieee transactions on, 31(1):180-190. ijcccv4n3draft.pdf int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 3, pp. 244-252 first steps towards a cpu made of spiking neural p systems miguel a. gutiérrez-naranjo, alberto leporati miguel a. gutiérrez-naranjo university of sevilla, department of computer science and artificial intelligence avda. reina mercedes s/n, 41012, sevilla, spain e-mail: magutier@us.es alberto leporati università degli studi di milano – bicocca, dipartimento di informatica, sistemistica e comunicazione viale sarca 336/14, 20126 milano, italy e-mail: alberto.leporati@unimib.it received: april 5, 2009 accepted: may 30, 2009 abstract: we consider spiking neural p systems as devices which can be used to perform some basic arithmetic operations, namely addition, subtraction, comparison and multiplication by a fixed factor. the input to these systems are natural numbers expressed in binary form, encoded as appropriate sequences of spikes. a single system accepts as inputs numbers of any size. the present work may be considered as a first step towards the design of a cpu based on the working of spiking neural p systems. keywords: spiking neural p systems, arithmetic operations, membrane computing 1 introduction spiking neural p systems (sn p systems, for short) have been introduced in [3] as a new class of distributed and parallel computing devices. they were inspired by membrane systems (also known as p systems) [12, 13, 7] and are based on the neurophysiological behavior of neurons sending electrical impulses to other neurons. in sn p systems the processing elements are called neurons and are placed in the nodes of a directed graph, called the synapse graph. the contents of each neuron consist of a number of copies of a single object type, namely the spike. each neuron may also contain rules which allow to remove a given number of spikes from it, or to send spikes (possibly with a delay) to other neurons. the application of every rule is determined by checking the contents of the neuron against a regular set associated with the rule. formally, an sn p system of degree m ≥ , as defined in [4], is a construct of the form π = (o, σ, σ, . . . , σm, syn, in, out), where o = {a} is the singleton alphabet (a is called spike); σ, σ, . . . , σm are neurons, of the form σi = (ni, ri), with  ≤ i ≤ m, where: ni ≥  is the initial number of spikes contained in σi; ri is a finite set of rules of the following two forms: (1) e/ac → a; d, where e is a regular expression over a, and c ≥ , d ≥  are integer numbers. if e = ac, then it is usually written in the simplified form ac → a; d; similarly, if a rule e/ac → a; d has d = , then we can simply write it as e/ac → a. hence, if a rule e/ac → a; d has e = ac and d = , then we can write ac → a; (2) as → λ , for s ≥ , with the restriction that for each rule e/ac → a; d of type (1) from ri, we have as 6∈ l(e) (where l(e) denotes the regular language defined by e); syn ⊆ {, , . . . , m}×{, , . . . , m}, with (i, i) 6∈ syn for  ≤ i ≤ m, is the directed graph of synapses between neurons; in, out ∈ {, , . . . , m} indicate the input and output neurons of π . the rules of type (1) are called firing (also spiking) rules, and are applied as follows. if the neuron σi contains k ≥ c spikes, and ak ∈ l(e), then the rule e/ac → a; d ∈ ri can be applied. the execution of this rule removes c spikes from σi (thus leaving k − c spikes), and prepares one spike to be delivered to all the neurons σ j such that (i, j) ∈ syn. if d = , then the spike is immediately emitted, otherwise it is emitted after d computation steps of the system. (observe that, as usually happens in membrane computing, a global clock is assumed, marking the time for the whole system, hence the functioning of the system is synchronized.) if the rule is used in step t and d ≥ , then in steps t,t + ,t + , . . . ,t + d −  the neuron is closed, so that it cannot receive new spikes (if a neuron has a copyright c© 2006-2009 by ccc publications first steps towards a cpu made of spiking neural p systems 245 synapse to a closed neuron and tries to send a spike along it, then that particular spike is lost), and cannot fire new rules. in the step t + d, the neuron spikes and becomes open again, so that it can receive spikes (which can be used starting with the step t + d + ) and select rules to be fired. rules of type (2) are called forgetting rules, and are applied as follows: if the neuron σi contains exactly s spikes, then the rule as → λ from ri can be used, meaning that all s spikes are removed from σi. in what follows we will use an extended version of forgetting rules, written in the form e/as → λ ; d. the application of these rules is analogous to that of firing rules. with respect to their basic version, extended forgetting rules are controlled by a regular expression, and may compete against firing rules for their application. it is possible to prove that the use of extended forgetting rules does not modify the computational power of sn p systems. in each time unit, if a neuron σi can use one of its rules, then a rule from ri must be used. in case two or more rules can be applied in a neuron at a given computation step, only one of them is nondeterministically chosen. thus, the rules are used in the sequential manner in each neuron, but neurons function in parallel with each other. the initial configuration of the system is described by the numbers n, n, . . ., nm of spikes present in each neuron, with all neurons being open. during the computation, a configuration is described by both the number of spikes present in each neuron and by the number of steps to wait until it becomes open (this number is zero if the neuron is already open). a computation in a system as above starts in the initial configuration. a positive integer number is given as input to a specified input neuron. usually, the number is specified as the time elapsed between the arrival of two spikes. however, as discussed in [4], other possibilities exist: for example, we can consider the number of spikes initially contained in the input neuron, or the number of spikes read in a given interval of time. all these possibilities are equivalent from the point of view of computational power. to pass from a configuration to another one, for each neuron a rule is chosen among the set of applicable rules, and is executed. generally, a computation may not halt. however, in any case the output of the system is usually considered to be the time elapsed between the arrival of two spikes in a designated output cell. defined in this way, sn p systems compute (partial) functions of the kind f : n → n; they can also indirectly compute functions of the kind f : nk → n by using a bijection from nk to n. it is not difficult to show that sn p systems can simulate register machines [4], and hence are universal. if we do not specify an input neuron (hence no input is taken from the environment) then we use sn p systems in the generative mode; we start from the initial configuration, and we look at the output produced by the system. note that generative sn p systems are inherently nondeterministic, otherwise they would always reproduce the same sequence of computation steps, and hence the same output. dually, we can neglect the output neuron and use sn p systems in the accepting mode; for k ≥ , the natural numbers n, n, . . . , nk are read in input and, if the computation halts, then the numbers are accepted. also in these cases, sn p systems are universal computation devices [3, 4]. in this paper we consider sn p systems in a different way. we will use them to build the components of a restricted arithmetic logic unit in which one or several natural numbers are provided in binary form, some arithmetic operation is performed and the result is sent out also in binary form. the arithmetic operations we will consider are addition, subtraction and multiplication among natural numbers. each number will be provided to the system as a sequence of spikes: at each time step, zero or one spikes will be supplied to an input neuron, depending upon whether the corresponding bit of the number is  or . also the output neuron will emit the computed number to the environment in binary form, encoded as a spike train. the paper is organised as follows. in section 2 we present an sn p system which can be used to add two natural numbers expressed in binary form, of any length (that is, composed of any number of bits). in section 3 we present an analogous sn p system, that computes the difference (subtraction) among two natural numbers. section 4 contains the description of a very simple system that can be used to compare two natural numbers. section 5 first extends the system presented in section 2 to perform the addition of any given set of natural numbers, and then describes a spiking neural p system that performs the multiplication of any natural number, given as input, by a fixed factor embedded into the system. finally, section 6 concludes the paper and suggests some possible directions for future research. 2 addition in this section we describe a simple sn p system that performs the addition of two natural numbers. we call such a system the sn p system for 2-addition. it is composed of three neurons (see figure 1): two input neurons and an addition neuron, which is also the output neuron. both input neurons have a synapse to the addition neuron. each input neuron receives one of the numbers to be added as a sequence of spikes, that encodes the number in 246 miguel a. gutiérrez-naranjo, alberto leporati time step input input add output t =  0 0 0 0 t =  0 1 0 0 t =  0 0 1 0 t =  1 1 0 1 t =  1 0 2 0 t =  1 1 2 0 t =  0 0 3 0 t =  0 0 1 1 t =  0 0 0 1 table 1: number of spikes in each neuron of πadd , and number of spikes sent to the environment, at each time step during the computation of the addition  +  =  binary form. as explained above, no spike in the sequence at a given time instant means  in the corresponding position of the binary expression, whereas one spike means . note that the numbers provided as input to the system may be arbitrarily long. the input neurons have only one rule, a → a, which is used to forward the spikes to the addition neuron as soon as they arrive. the addition neuron has three rules: a → a, a/a → λ and a/a → a, which are used to compute the result. figure 1: an sn p system that performs the addition among two natural numbers theorem 1. the sn p system for 2-addition outputs the addition in binary form of two non-negative integers, provided to the neurons σinput and σinput in binary form. proof. at the beginning of the computation, the system does not contain any spike. during the computation, neuron σadd may contain , ,  or  spikes. we can thus divide the behavior of σadd in three cases: • if there are no spikes, no rules are activated and in the next step 0 spikes are sent to the environment. this encodes the operation  +  = . • if there is 1 spike, then the rule a → a is triggered. the spike is consumed and one spike is sent out. this encodes  +  =  +  = . • if there are 2 spikes, then the rule a/a → λ is triggered. no spike is sent out and one spike (the carry) remains in the neuron for the next step. • if there are 3 spikes, then the rule a/a → a is applied. one spike is sent to the environment, two of them are consumed and one remains for the next step. from this behavior, it is easily seen that the output is computed correctly. at the third computation step, the first of the spikes in the spike train that encodes the output in binary form is emitted by σadd . as an example, let us consider the addition  +  = , that in binary form can be written as  +  = . table 1 reports the number of spikes contained in each neuron of πadd , as well as the number of spikes sent to the environment, at each time step during the computation. the input and the output sequences are written in bold. 3 subtraction the subtraction sn p system, illustrated in figure 2, consists of ten neurons. the first input number, the minuend, is provided to neuron σinput in binary form, encoded as a spike train as described above. similarly, the first steps towards a cpu made of spiking neural p systems 247 figure 2: an sn p system that performs the subtraction among two natural numbers time step input input aux aux aux aux aux sub output t =  0 0 0 0 0 0 0 0 0 t =  0 1 0 0 0 0 1 0 0 t =  0 1 0 0 0 1 1 1 0 t =  1 0 0 0 0 1 1 2 0 t =  1 0 1 1 1 0 1 3 1 t =  0 1 1 1 1 0 1 5 0 t =  1 1 0 0 0 1 1 4 0 t =  1 0 1 1 1 1 1 2 1 t =  0 0 1 1 1 0 1 6 1 t =  0 0 0 0 0 0 1 5 1 t =  0 0 0 0 0 0 1 1 0 table 2: number of spikes in each neuron of πsub, and number of spikes sent to the environment, at each time step during the computation of the subtraction  −  =  second input number (the subtrahend) is supplied in binary form to neuron σinput . the set of neurons σaux , σaux and σaux act as a multiplier of the minuend: they multiply by 3 the number of spikes provided by neuron σin put . the system contains also a subsystem composed of neurons σgen, σaux_ f low and σaux , whose target is to provide a constant flow of spikes to σsub. all the neurons mentioned up to now have only one rule: a → a. the neurons σauxi , for  ≤ i ≤ , are connected with neuron σsub; this is both the output neuron and the neuron in which the result of the subtraction is computed, by means of six rules: a → λ , a/a → a, a/a → λ , a → a, a → λ and a/a → a. at the beginning of the computation all neurons are empty except σgen, which contains one spike. theorem 2. the subtraction sn p system outputs the subtraction, in binary form, of two non-negative integer numbers, provided in binary form to neurons σinput (the minuend) and σinput (the subtrahend). the result can be easily checked by direct inspection of all possible cases. a detailed proof of this theorem — not given here, due to the lack of space — can be found in [2]. as an example let us calculate  −  = , that in binary form can be written as  −  = . table 2 reports the number of spikes that occur in each neuron of πsub, at each time step during the computation. note that at each step only one rule is active in the subtraction neuron, and thus the computation is deterministic. the first time step in which the output starts to be emitted by the system is t = . 248 miguel a. gutiérrez-naranjo, alberto leporati figure 3: an sn p system that compares two natural numbers of any length, expressed in binary form 4 checking equality checking the equality of two numbers is a different task with respect to computing addition or subtraction. when comparing two numbers the output should be a binary mark, which indicates whether they are equal or not. since an sn p system produces a spike train, we will encode the output as follows: starting from an appropriate instant of time, at each computation step the system will emit a spike if and only if the two corresponding input bits (that were inserted into the system some time steps before) are different. so doing, the system will emit no spike to the environment if the input numbers are equal, and at least one spike if they are different. stated otherwise, if we compare two n-bit numbers then the output will also be an n-bit number: if such an output number is 0, then the input numbers are equal, otherwise they are different. bearing in mind these marks for equality and inequality, the design of the sn p system is trivial. it consists of three neurons: two input neurons, having a → a as the single rule, linked to a third neuron, the checking neuron. this checking neuron is also the output neuron, and it has only two rules: a → λ and a → a. the system is illustrated in figure 3. 5 multiplication in this section we present a first approach to the problem of computing the multiplication of two binary numbers by means of sn p systems. the main difference between multiplication and the addition or subtraction operations presented in the previous sections is that in addition and subtraction the n-th digit in the binary representation of the inputs is used exactly once, to compute the n-th digit of the output, and then it can be discarded. on the contrary, in the usual algorithm for multiplication the different digits of the inputs are reused several times; hence the design of a device that executes this algorithm needs some kind of memory. other algorithms for multiplication, such as booth’s algorithm (see, for example, [1]) also need some kind of memory, to store the intermediate results. we propose a family of sn p systems for performing the multiplication of two non-negative integer numbers. in these systems only one number, the multiplicand, is provided as input; the other number, the multiplier, is instead encoded in the structure of the system. the family thus contains one sn p system for each possible multiplier. in the design of our systems, we exploit the following basic fact concerning multiplication by one binary digit: any number remains the same if multiplied by 1, whereas it produces a 0 if multiplied by zero. bearing this fact in mind, an sn p system associated to a fixed multiplier only needs to add different copies of the multiplicand, by feeding such copies to an addition device with the appropriate delay. before presenting this design, we extend the 2-addition sn p system from section 2 to an n-addition sn p system. 5.1 adding n numbers in this section we present a family {πadd (n)}n∈n of sn p systems which allows to add numbers expressed in binary form. precisely, for any integer n ≥  the system πadd (n) computes the sum of n natural numbers. in what follows we will call πadd (n) the sn p system for n-addition. for n =  we will obtain the sn p system for 2-addition that we have described in section 2. the system πadd (n) consists of n +  neurons: n input neurons and one addition neuron, which is also the output neuron. each input neuron has only one rule, a → a, and is linked to the addition neuron. this latter neuron computes the result of the computation by means of n rules ri, i ∈ {, . . . , n}, which are defined as follows: ri ≡ a i/ak+ → a if i is odd and i = k + , whereas ri ≡ ai/ak → λ if i is even and i = k. as an example, figure 4 shows πadd (), the sn p system for 5-addition. first steps towards a cpu made of spiking neural p systems 249 figure 4: an sn p system that performs the addition among five natural numbers theorem 3. the sn p system for n-addition outputs the addition in binary form of n non-negative integer numbers, provided to the neurons σinput , . . . , σinputn in binary form. proof. let a, . . . , an be the n numbers to be added, and let a p i a p− i . . . a  i be the binary expression of ai,  ≤ i ≤ n, padded with zeros on the left to obtain (p + )-digit numbers (where p +  is the maximum number of digits among the binary representations of a, . . . , an). hence we can write ai = ∑p k= a k i  k for all i ∈ {, , . . . , n}. for each i ∈ {, . . . , n}, let a′i be the number with binary expression a p i . . . a  i , i.e., a ′ i = ∑p k= a k i  k−. moreover, let u = ∑n i= a  i and let k ∈ n and α ∈ {, } such that u = k + α (α =  if u is odd and α =  if u is even). the addition of a, . . . , an can be written as: n∑ i= ai = n∑ i= p∑ k= a k i  k = ( n∑ i= p∑ k= a k i  k ) + n∑ i= a  i =  ( n∑ i= a ′ i + k ) + α according to this formula, if br . . . b is the binary expression of ∑n i= ai, then b = α and br . . . b is the binary expression of ∑n i= a ′ i + k. let us assume now that at the time instant t there are i spikes in neuron σadd . these spikes can come from the input neurons, or they may have remained from the previous computation step. let us compute bt , the t-th digit of the output, dividing the problem in the following cases. • let us assume that i is odd and i = k + . then, according to the previous formula, bt =  and k units should be added to the computation of the next digit. this operation is performed by the rule ai/ak+ → a. by applying this rule, one spike is sent to the environment (bt = ) and k +  spikes are consumed, so that i − (k + ) = k +  − (k + ) = k spikes remain. • let us assume that i is even and i = k. then, according to the previous formula, bt =  and k units should be added to the computation of the next digit. this operation is performed by the rule ai/ak → λ . by applying this rule, no spike is sent to the environment (bt = ) and k spikes are consumed, so that i − k = k − k = k spikes remain for the next step. as an example, let us consider the addition of the numbers 3, 4, 2, 7 and 1, whose binary representations are , , ,  and , respectively. table 3 shows the evolution of the number of spikes in the neurons of the sn p system πadd () (illustrated in figure 4), as well as the number of spikes sent to the environment at each computation step, when performing such an addition. the input and the output sequences are written in bold. according with the computation, the result of the addition is  = . 5.2 multiplication by a fixed multiplier we now describe a family {πmult (n)}n∈n of sn p systems, one for each natural number n, that operate as multiplier devices. precisely, the system πmult (n) takes as input a number in binary form, and outputs the input multiplied by n. the output is also expressed in binary form. given a natural number n, the sn p system πmult (n) is defined as follows. it consists of one input neuron, σinput , linked to k neurons σaux , . . . , σauxk , where k is the number of occurrences of the digit 1 in the binary 250 miguel a. gutiérrez-naranjo, alberto leporati time step input input input input input add output t =  1 0 0 1 1 0 0 t =  1 0 1 1 0 3 0 t =  0 1 0 1 0 4 1 t =  0 0 0 0 0 4 0 t =  0 0 0 0 0 2 0 t =  0 0 0 0 0 1 0 t =  0 0 0 0 0 0 1 table 3: number of spikes in each neuron of πadd () (the system illustrated in figure 4) and number of spikes sent to the environment, at each time step during the computation of the addition  +  +  +  +  =  representation of n. for each i ∈ {, . . . , k}, neuron σauxi is connected with a new neuron σauxi , which is connected with σauxi , etc. this sequence of neurons is a path of linked neurons that extends until reaching σauxi j i , where ji is the number of order of the corresponding digit in the binary representation of n, where the first digit corresponds to , the second one corresponds to , and so on. all the last neurons of the k sequences are connected with a final neuron σadd , which is the same as the output neuron of the k-addition sn p system πadd (k) described above. this neuron has the rules for the addition of k natural numbers. all the other neurons have only the rule a → a. for example, let us consider n = , whose binary representation is . such a representation has three digits equal to 1, at the positions 2, 4 and 5. the system πmult (), illustrated in figure 5, has 13 neurons: σinput , σadd , and three sequences of neurons associated with the three digits equal to 1: σaux and σaux , corresponding to the 1 in the second position (corresponding to the power ); σaux , σaux , σaux and σaux , corresponding to the 1 in the fourth position (corresponding to the power ); σaux , σaux , σaux , σaux and σaux , corresponding to the 1 in the fifth position (corresponding to the power ). the last neurons of these sequences, namely σaux , σaux and σaux , are linked to neuron σadd , which is also the output neuron. the rules of this neuron are a → a, a/a → λ and a/a → a, which are the same as in the addition neuron of the 3-addition sn p system πadd () described in the previous section. figure 5: an sn p system that computes the product among the natural number given as input (in binary form) and the fixed multiplier  = , encoded in the structure of the system theorem 4. the sn p system πmult (n) built as above takes as input a number m in binary form and outputs the result of the multiplication m · n in binary form. proof. since we already proved that the neuron σadd performs the addition of several numbers in binary form, it only remains to transform the multiplication m · n (where n is a fixed parameter) into an appropriate addition. to this aim, let n = ∑q j= n j j . then we can write m · n = m ·   q∑ j= n j j   = q∑ j= ( m ·  j ) n j = ∑ ≤ j≤q ∧ n j = ( m ·  j ) according to this expression, m · n can be calculated as the addition of as many copies of m as the number of digits n j equal to  that appear in the binary representation of n. such copies have to be padded with j zeros first steps towards a cpu made of spiking neural p systems 251 time step input aux aux aux add out t =  1 0 0 0 0 0 t =  0 0 0 0 0 0 t =  1 1 0 0 0 0 t =  1 0 0 0 1 0 t =  1 1 1 0 0 1 t =  0 1 0 1 2 0 t =  0 1 1 0 3 0 t =  0 0 1 1 3 1 t =  0 0 1 1 3 1 t =  0 0 0 1 3 1 t =  0 0 0 0 2 1 t =  0 0 0 0 1 0 t =  0 0 0 0 0 1 table 4: number of spikes in neurons σaux , σaux , σaux and σadd of πmult () (the system illustrated in figure 5) and number of spikes sent to the environment, at each time step during the computation of the multiplication  ·  =  on the right (that is, they have to be multiplied by  j ), to take into account the correct weight of n j . hence, if k = ∑q j= n j then to compute m · n it suffices to provide k copies of m — each shifted in time of a number of steps that corresponds to the weight of a bit n j equal to  — to a neuron that computes the addition of k natural numbers. 6 conclusion and future work in this paper we have presented some simple sn p systems that perform the following operations: addition, multiple addition, comparison, and multiplication by a fixed factor. all the numbers given as inputs to these systems are expressed in binary form, encoded as a spike train in which at each time instant the presence of a spike denotes 1, and the absence of a spike denotes 0. the outputs of the computations are also expelled to the environment in the same form. the motivation for this work lies in the fact that we would like to implement a cpu using only spiking neural p systems. to this aim, the first step is to design the arithmetic logic unit of the cpu, and hence to study a compact way to perform arithmetical and logical operations by means of spiking neural p systems. ours is certainly not the unique possible way to approach the problem; other two possibilities are: (1) implementing the cpu as a network composed of and/or/not boolean gates, and (2) simulating the cpu by means of register machines. in both cases, using techniques widely known in the literature, one could design an sn p system that simulates the boolean network (resp., the register machine), thus implementing the cpu. in any case, an interesting extension to the present work is to try to design an sn p system for the multiplication, where both the numbers m and n to be multiplied are supplied as inputs. and, of course, we would also need a system to compute the integer division between two natural numbers; probably, this last system is the most difficult to design. acknowledgement the first author wishes to acknowledge the support of the project tin2006–13425 of ministerio de educación y ciencia of spain, cofinanced by feder funds and the support of the project of excellence with investigador de reconocida valía of the junta de andalucía, grant p08-tic-04200. the second author was partially supported by the miur project “mathematical aspects and emerging applications of automata and formal languages" (2007). 252 miguel a. gutiérrez-naranjo, alberto leporati bibliography [1] m.j. flynn. advanced computer arithmetic design. john wiley publisher, 2001. [2] m.a. gutiérrez-naranjo and a. leporati. performing arithmetic operations with spiking neural p systems. in proc. of the seventh brainstorming week on membrane computing, vol. i, fénix editora, seville, spain, 2009, 181–198. availble at http://www.gcn.us.es. [3] m. ionescu, gh. păun and t. yokomori: spiking neural p systems. fundamenta informaticae, 71(2-3):279– 308, 2006. [4] m. ionescu, a. păun, gh. păun, m.j. pérez-jiménez. computing with spiking neural p systems: traces and small universal systems. in dna computing, th international meeting on dna computing (dna12), revised selected papers, lncs 4287, springer, 2006, 1–16. [5] gh. păun. computing with membranes. journal of computer and system sciences, 61:108–143, 2000. see also turku centre for computer science — tucs report no. 208, 1998. [6] gh. păun. membrane computing. an introduction. springer–verlag, 2002. [7] the p systems web page: http://ppage.psystems.eu/ miguel a. gutiérrez-naranjo is an associate professor at the department of computer science and artificial intelligence of the university of seville in spain. he obtained his doctoral degree in mathematics in 2002. his main research area is natural computing, with a special interest in membrane computing. alberto leporati obtained a ph.d. in computer science from the university of milano (italy) in 2002. since 2004, he is assistant professor at the university of milano – bicocca. his research interests are in membrane computing, theoretical computer science and computational complexity. international journal of computers, communications & control vol. iii (2008), no. 1, pp. 80-89 adaptive compensation of friction forces with differential filter kouichi mitsunaga, takami matsuo abstract: in this paper, we design an adaptive controller to compensate the nonlinear friction model when the output is the position. first, we present an adaptive differential filter to estimate the velocity. secondly, the dynamic friction force is compensated by a fuzzy adaptive controller with position measurements. finally, a simulation result for the proposed controller is demonstrated. keywords: nonlinear friction, adaptive controller, fuzzy basis function expansion, adaptive differential filter. 1 introduction friction is one of the greatest obstacles in high precision positioning systems. since it can cause steady state and tracking errors, its influence on the response of the systems must be considered seriously ([10]). many friction models have been proposed that differ on the friction effects that are modeled in a lubricated contact. these models are divided into two categories: the kinetic and dynamic friction models. the kinetic friction models take into account the friction effects such as the viscous friction, the coulomb friction, and the stribeck effect. another category of friction model includes dynamic friction model that embody the natural mechanism of friction generation such as the lugre model ([2]). the coefficients of the various friction-related effects are usually very difficult to measure. a number of methods for friction estimation and compensation have been proposed for these models: adaptive control, joint torque control, learning control, variable structure control, and so on. friedland et al.[5] proposed a reduced-order nonlinear observer to estimate the velocity-dependent coefficient of the classical nonlinear friction model with velocity measurements. huang et al.[9] presented an adaptive radial basis function observer to compensate for the effects of the dynamic friction model full-states measurements. tomei[15] considered the tracking problem for robot manipulators with unknown parameters and dynamic lugre model using full-states information. ge et al.[6] presented adaptive controllers by combining neural network parameterization, dual observer for state estimation/stability and adaptive control techniques based on the dynamic lugre friction model. sato et al.[14] proposed an adaptive friction compensation method with an h∞ performance using the neural network approximation that is equivalent to the radial basis function approximation or fuzzy basis function approximation. they used the neural network approximation to parameterize the nonlinear characteristics of the dynamic lugre model. a sliding-mode type error function is introduced that requires full-states measurements. canudas de wit et al.[3] designed an observer-based adaptive friction compensation scheme for systems with generalized position/velocity static characteristics based on full stats measurement. the proposed controller guarantees the global asymptotic stability of the tracking error while preserving boundedness of all the internal signals. the nonlinear friction is approximated either by a linear span of set of continuous known function, or by a neural network of bounded basis function. putra et al.[12] proposed an observer-based friction compensation for a class of the kinetic friction models with known system parameters based on the strictly positive real condition. in the case of partial-states measurements, an adaptive estimator that does not require the strictly positive realness of the plants is needed. ray et al.[13] presented a non-model-based friction estimation method using extended kalman-bucy filtering. the filter is used to estimate a friction force with the full-states measurements. for the classical nonlinear friction model without velocity measurements, xia et al.[17] employed a velocity observer based on state space nonlinear friction model and designed an adaptive controller that achieved a semi-global asymptotically stability. the velocity observer is a model-based estimator that requires plant parameters to estimate the velocity. park et al.[11] proposed a non-model-based differential filter of a nonlinear function based on copyright © 2006-2008 by ccc publications adaptive compensation of friction forces with differential filter 81 the adaptive control theory. the proposed differential filter is applied to estimate the time-derivative of a nonlinear function of the intensity for each pixel to detect moving objects within a scene acquired by a stationary camera. in estimating the friction forces, the differential filter is applicable to estimate the velocity with the position measurements. in this paper, we design a friction compensator with the position measurement. to begin with, an adaptive differential filter that is a non-model based adaptive algorithm to estimate the derivative of a signal, is designed to estimate the velocity signal. next, the velocity signal in friction compensator proposed by canudas de wit et al.[3] is replaced by the estimate of the adaptive differential filter. finally, a simulation result of the proposed compensator is presented. from the simulation result, it is shown that the proposed compensator is robust against additive noises. 2 adaptive differential filter the velocity information is the most important data to estimate and compensate the friction forces. most of the conventional papers assume that the velocity is measurable. if the velocity information is not available, we have to estimate the velocity signal. park et al.[11] proposed the adaptive estimator of the time-derivative of an output that is called the adaptive differential filter. their estimator causes the estimation error in the case of fast time-varying signals. in this paper, we present another adaptive differential filter applicable to fast time-varying signals. let y(t) be a measurement signal at a time t and f (y(t)) be its nonlinear function. we define θt (t) as the derivative with respect to time of the nonlinear function f (y(t)), i.e. θt (t) = d f (y(t)) dt = d f (y) dy dy(t) dt . (1) if d f (x)dx has the inverse ( d f (x) dx )−1 , we have ẏ(t) = θt (t)ξ (t), (2) ξ (t) = ( d f (y) dy )−1 . (3) the problem is defined as follows: problem 1. design a differential filter to estimate the derivative, θt (t), of the nonlinear function f (y(t)) with available signal y(t) and without any knowledge of its dynamics. if the signal ξ (t) is available, problem 1 is equivalent to the estimation problem of the time-varying parameter θt in eq.(2) with the available signals y(t) and ξ (t). it is assumed that the time-varying parameter in (2) is satisfied the following inequality: |θt (t)−θt 0| ≤ εt 0 where θ0 is an unknown constant, and ε0 is a known constant. eq.(2) can be rewritten as ẏ = θt 0ξ (t) + εt (t)ξ (t) (4) where εt (t) = θt (t)−θt 0. we give an adaptive observer as ˙̂yt = −k(ŷt −y) + θ̂t (t)ξ (t)− ε̂t (t)sgn(ŷt −y)|ξ (t)| (5) where k > 0. defining the observer error as et (t) = ŷt (t)−y(t), we obtain the error system as ėt = −ket + θ̃t (t)ξ (t)− ε̂t (t)sgn(et )|ξ (t)|−εt (t)ξ (t) (6) 82 kouichi mitsunaga, takami matsuo where θ̃t (t) = θ̂t (t)−θt 0. the parameter update laws are selected as ˙̃θt (t) = ˙̂θt (t) = −γ et (t)ξ (t) (7) ˙̂εt (t) = |et (t)||ξ (t)| (8) where γ > 0. we can prove the following lemma: lemma 1. consider the error system (6). the parameter update laws guarantee the stability of the origin of the error system as follows: et (t) → 0 (t → ∞). moreover, if limt→∞ėt (t) = 0, then limt→∞θ̃t ε (t) = 0, where θ̃t ε (t) = θ̂t (t)−θt (t). proof: the whole system can be written by ėt = −ket + θ̃t (t)ξ (t)− ε̂t (t)sgn(et )|ξ (t)|−εt (t)ξ (t) ˙̃θt = −γ et ξ (t) ˙̂εt = |et ||ξ (t)|. define a lyapunov-like function as v1 = 1 2 ( e2t (t) + 1 γ θ̃ 2t (t) + ε̃ 2 t (t) ) where ε̃t (t) = ε̂t (t)−εt 0. its time derivative is given by v̇1 = et ėt + 1 γ θ̃t ˙̃θt + ε̃t ˙̂εt = et (−ket + θ̃t (t)ξ (t)− ε̂t (t)sgn(et )|ξ (t)| −εt (t)ξ (t)) + 1 γ θ̃t ˙̃θt + ε̃t ˙̂εt = −ke2t + θ̃t ( et ξ (t) + 1 γ ˙̃θt ) −et ε̂t sgn(et )|ξ (t)| −et εt ξ (t) + ε̂t ˙̂εt −εt 0 ˙̂εt = −ke2t −(et εt ξ (t) + et εt 0sgn(et )|ξ (t)|) = −ke2t −(et εt ξ (t) + εt 0|et ||ξ (t)|) ≤ −ke2t +|et ||εt ||ξ|−εt 0|et ||ξ (t)| ≤ −ke2t ≤ 0. hence, we have et ∈ l2 ∩ l∞, θ̃t ∈ l∞, ε̃t ∈ l∞. since et ∈ l∞, θ̃t ∈ l∞, ε̃t ∈ l∞, we have ėt ∈ l∞. from barbalat’s lemma, we conclude that limt→∞ et (t) = 0. moreover, setting u as ut = θ̃t (t)ξ (t)− ε̂t (t)sgn(et )|ξ (t)|−εt (t)ξ (t), the error dynamics can be expressed in a first-order system: ėt (t) = −ket (t) + ut (t). adaptive compensation of friction forces with differential filter 83 if limt→∞ėt (t) = 0, then lim t→∞ (θ̂t (t)−θt 0 −εt (t)) = lim t→∞ (θ̂t (t)−θt (t)) = 0. we obtain the estimator for the differential of the signal y(t) by selecting the nonlinear function f (y(t)) as the linear function f (y(t)) = y(t). in this case, the regressor signal ξ (t) is equal to 1. the adaptive observer and the update laws are given by ˙̂yt = −k(ŷt −y) + θ̂t (t)− ε̂t (t)sgn(ŷt −y) (9) ˙̂θt (t) = −γ et (t), ˙̂εt (t) = |et (t)| (10) where et (t) = ŷt (t)−y(t). thus, we get the estimate of the differential of the signal y(t) as ˆ̇y = θ̂t = − ∫ t 0 γ et (τ)dτ. (11) we call this estimator (11) the adaptive differential filter. 3 nonlinear friction model canudas de wit et al. present a new dynamic friction model that captures dynamic friction effects (the gahl effect, frictional memory, stick-slip motion) as well as steady state friction effects, including the stribeck effect ([2, 8]). the interface between two surfaces is modeled by contact between sets of bristles. if z represents the average bristle deflection, ẏ the velocity between two surfaces, the friction force ff can be expressed as ff = σ0z + σ1ż + σ2ẏ ż = −α(ẏ)|ẏ|z + ẏ α(ẏ) = σ0 fc + ( fs − fc) exp{−(ẏ/ẏs)2} where σ0 is the stiffness for deflection angle, σ1 the damping coefficient for slip, σ2 the viscous friction coefficient, fc the coulomb friction level, fs the level of the stiction force, and ẏs the stribeck velocity. the system under discussion is the following equation: mÿ(t) + ff (t) = u(t) (12) where m is known and ff is unknown. as in canudas de wit et al.[3], the nonlinear function α(ẏ) is approximated by fbfe as α(ẏ) = kt ζ (ẏ) + ε f (13) k = [ k1, ··· , km ]t (14) ζ (ẏ) = [ ζ 1(ẏ), ··· , ζ m(ẏ) ]t (15) where ε is the approximation error and ζ j is defined as ζ j(ẏ) = µa j (ẏ(t)) ∑mj=1 µa j (ẏ(t)) . where the membership functions are selected as the gaussian functions: µa j (v) = exp(− (v− v̄ j)2 2σ 2j ) σ j = σ , j = 1,··· , m. the approximation error is assumed to be bounded and sufficiently small. 84 kouichi mitsunaga, takami matsuo 4 controller design consider the tracking control problem of system (12) under the assumption that (i) the parameters m, σ0, σ1, σ2 are known and the nonlinear function α(ẏ) in unknown. the desired trajectory yd is smooth enough and bounded. the tracking error and its derivatives are defined as e = yd −y, ė = ẏd − ẏ, ë = ÿd − ÿ. 4.1 adaptive controller with full-states measurements we review the adaptive controller proposed by canudas de wit et al.[3] with full-states measurements. they presented the internal state observer for the average bristle deflection as: ˙̂z = ẏ− k̂t ζ (ẏ)|ẏ|ẑ + k0z̃ + k1e (16) z̃ = m σ1 ė + kd −mρ σ1 e + e f (17) ė f = −ρ e f + cρ e (18) where ẑ is the estimate of the average bristle deflection and k0 > 0, k1 > 0, c = kp−ρ kd +mρ 2 σ0 . defining the estimation error of the average bristle deflection as z̃ = z− ẑ, (19) we have the following estimation error equation: ˙̃z = −α(ẏ)|ẏ|z̃− k̃t ζ (ẏ)|ẏ|ẑ−k0z̃−k1e (20) where k̃ = k − k̂. the controller is given as follows[3] u = mÿd + h(p)e + f̂f (21) f̂f = σ0ẑ + σ1 ˙̂z + σ2ẏ (22) where h(p) is the following differentiator: h(p) = kd p + kp, p = d dt . (23) moreover, we obtain the tracking error equation as më + kd ė + kpe = σ1 ˙̃z + σ0z̃. (24) we give the same adaptive update law as in [3]: ˙̂k = −γ 1 k1 ζ (ẏ)|ẏ|z̃ẑ. (25) 4.2 adaptive controller with position measurements in the position measurement case, we use the estimate ˆ̇y of the adaptive differential filter instead of the velocity signal ẏ. they presented the internal state observer for the average bristle deflection as: ˙̂z = ˆ̇y− k̂t ζ ( ˆ̇y)| ˆ̇y|ẑ + k0z̃ + k1e (26) z̃ = m σ1 ˆ̇e + kd −mρ σ1 e + e f (27) ė f = −ρ e f + cρ e (28) adaptive compensation of friction forces with differential filter 85 where ˆ̇y is the estimate of the velocity by the adaptive differential filter and ˆ̇e = ẏd − ˆ̇y. defining the estimation error of the adaptive differential filter ε as ε = ˆ̇y− ẏ, we have ˆ̇e = ė−ε. we have the following estimation error equation: ˙̃z = −α(ẏ)|ẏ|z̃− k̃t ζ ( ˆ̇y)| ˆ̇y|ẑ−k0z̃−k1e−kt ε 0ẑ−ε (29) where k̃ = k − k̂ and ε 0 is caused by estimation error of the adaptive differential filter. moreover, we obtain the tracking error equation as më + kd ė + kpe = σ1 ˙̃z + σ0z̃ + (kd −σ2)ε. (30) the controller is given as follows: u = mÿd + kd ˆ̇e + kpe + f̂f (31) f̂f = σ0ẑ + σ1 ˙̂z + σ2 ˆ̇y. (32) replacing the velocity signal by its estimate of the adaptive differential filter, the adaptive update law is given by: ˙̂k = −γ 1 k1 ζ ( ˆ̇y)| ˆ̇y|z̃ẑ. (33) 4.3 simulation example next example is the following lugre model: m = 10, σ0 = 5, σ1 = 2 √ 5, σ2 = 0.01, fc = 1, fs = 1.5, vs = 0.1. the initial conditions are selected as y(0) = 0, ẏ(0) = 0.2, z(0) = 1. the desired reference signals yd , ẏd , ÿd are given by yd = 3 sin 0.02πt, ẏd = 0.06π cos 0.02πt, ÿd = −0.0012π sin 0.02πt. the parameters of the fuzzy basis function are selected as follows: σ j = 0.5( j = 1,··· , 5), v̄1 = −1, v̄2 = −0.5, v̄3 = 0.0, v̄4 = 0.5, v̄5 = 1. the parameters of the adaptive differential filter is given by k = 1, γ = 0.5. the simulation parameters of the compensator are selected as follows: kd = kp = 1, k0 = 1, k1 = 0.1, γ = 50   1 0 0 0 0 0 1 0 0 0 0 0 1 0 0 0 0 0 1 0 0 0 0 0 1   . 86 kouichi mitsunaga, takami matsuo 0 50 100 150 200 250 300 −3 −2 −1 0 1 2 3 time[sec] o u t p u t a n d i t s r e f e r e n c e [ m ] 0 50 100 150 200 250 300 −3 −2 −1 0 1 2 3 time[sec] o u t p u t a n d i t s r e f e r e n c e [ m ] figure 1: the output y(solid line) and its desired reference yd (dotted line) in the time range [0, 300]. left: noise free ,right: additive noise n(0, 10−4). 0 20 40 60 80 100 −3 −2 −1 0 1 2 3 time[sec] o u t p u t a n d i t s r e f e r e n c e [ m ] 0 20 40 60 80 100 −3 −2 −1 0 1 2 3 time[sec] o u t p u t a n d i t s r e f e r e n c e [ m ] figure 2: the output y(solid line) and its desired reference yd (dotted line) in the time range [0, 100]. left: noise free ,right: additive noise n(0, 10−4). the simulation is carried out in the noise free case and in the case where the output yd with the additive noise n(0, 10−4). figure 1 shows the outputs y(solid line) and its desired references yd (dotted line) in the time range [0, 300] in both cases. figure 2 shows the outputs y(solid line) and its desired references yd (dotted line) in the time range [0, 100] in both cases. figure 3 shows the velocities ẏ(solid line) and its estimates ŷ by the adaptive differential filter (dotted line) in both cases. figure 4 shows the inputs u in both cases. figure 5 shows the friction force(solid line) and its estimate(dotted line). from these figures, the proposed friction compensator is robust against the additive noise. 5 conclusion we proposed the adaptive differential filter and its application to friction compensation. specifically, the velocity information in the friction compensator proposed by canudas de wit et al.[3] is replaced by the estimate with the adaptive differential filter. from the simulation results, the proposed friction compensator is robust against the additive noise. adaptive compensation of friction forces with differential filter 87 0 50 100 150 200 250 300 −0.4 −0.2 0 0.2 0.4 0.6 0.8 1 time[sec] v e l o c i t y a n d i t s e s t i m a t e [ m / s e c ] 0 50 100 150 200 250 300 −0.4 −0.2 0 0.2 0.4 0.6 0.8 1 time[sec] v e l o c i t y a n d i t s e s t i m a t e [ m / s e c ] figure 3: the velocity ẏ(solid line) and its estimate ˆ̇y(dotted line). left: noise free ,right: additive noise n(0, 10−4). 0 50 100 150 200 250 300 −3 −2 −1 0 1 2 3 4 time[sec] i n p u t [ n ] 0 50 100 150 200 250 300 −3 −2 −1 0 1 2 3 4 time[sec] i n p u t [ n ] figure 4: the input signal. left: noise free ,right: additive noise n(0, 10−4). 0 50 100 150 200 250 300 −3 −2 −1 0 1 2 3 4 5 time[sec] f r i c t i o n f o r c e a n d i t s e s t i m a t e [ n ] 0 50 100 150 200 250 300 −3 −2 −1 0 1 2 3 4 5 time[sec] f r i c t i o n f o r c e a n d i t s e s t i m a t e [ n ] figure 5: the friction(solid line) and its estimate(dotted line). left: noise free ,right: additive noise n(0, 10−4). 88 kouichi mitsunaga, takami matsuo bibliography [1] b. armstrong-helouvry, p. dupont, c. canudas de wit, “ a survey of models, analysis tools and compensation methods for the control of machines with friction”. automatica, vol.30, pp.1083– 1138, 1994. [2] c. canudas de wit, h. olsson, k.j. astrom, and p. lischinsky, “ a new model for control of systems with friction”. ieee transaction on automatic control, vol.40-3, pp.419–425, 1995. [3] c. canudas de wit and s.s. ge, “ adaptive friction compensation for systems with generalized velocity/position friction dependency”. proc. of the 36th ieee conference on decision and control, pp.2465–2470, 1997. [4] d. v.-díaz and y. tang, “ adaptive robust control of nonlinear systems”. ieee trans. on systems, man, and cybernetics, part b, vol.34-3, pp.1596–1601, 2004. [5] b. friedland and y.-j. park, “ on adaptive friction compensation”. ieee transaction on automatic control, vol.37-10, pp.1609–1612, 1992. [6] s.s. ge, t.h. lee, and j. wang, “ adaptive nn control of dynamic systems with unknown dynamic friction”. proc. of the 39th ieee ieee conference on decision and control, pp.1760–1765, 2000. [7] h. han and c.-y. su, “ robust fuzzy control of nonlinear systems using shape-adaptive radial basis functions”. fuzzy sets and systems, vol.125, pp.23–38, 2002. [8] r.m. hirschorn and g. miller, “ control of nonlinear systems with friction”. ieee trans. on control systems technology, vol.7-5, 588–595, 1999. [9] s.n. huang, k.k. tan, t.h. lee, “ adaptive motion control using neural network approximations”. automatica, vol.38, pp.227–233, 2002. [10] e. papadopoulos and g. chasparis, “ analysis and model-based control of servomechanisms with friction”. asme j. dynamic systems, measurement and control, vol.126-4, pp.911–915, 2004. [11] s.-h. park and t. matsuo, “ time-derivative estimation of noisy movie data using adaptive control theory”. international journal of signal processing, vol.2-3, pp.170–177, 2005. [12] d. putra, l.p.m. moreau, and h. nijmeijer, “ observer-based compensation of discontinuous friction”. proc. of the 43rd ieee conference on decision and control, pp.4940–4945, 2004. [13] l. r. ray, a. ramasubramanian, and j. townsend, “ adaptive friction compensation using extended kalman-bucy filter friction estimation”. control engineering practice, vol.9, pp.169–179, 2001. [14] k. sato, y. mishima, k. tsuruta, and k. murata, “ adaptive positioning control for linear slider with friction compensation”. trans. of society of the instrument and control engineers,japan, vol.40-2, pp.275–277, 2004 (in japanese). [15] p. tomei, “ robust adaptive friction compensation for tracking control of robot manipulators”. ieee transaction on automatic control, vol.45-6, pp.2164–2169, 2000. [16] j. wang, s.s. ge and t.h. lee, “ adaptive friction compensation for servo mechanism”. in: adaptive control of nonsmooth dynamic systems g. tao and f.l. lewis (eds.), springer, pp.211– 248, 2001. adaptive compensation of friction forces with differential filter 89 [17] q.h. xia, s.y. lin, m.h. ang jr. and t.m. lim, “ adaptive joint friction compensation using a model-based operational space velocity observer”. proc. of ieee international conf. on robotics and automation, pp.3081–3086, 2004. kouichi mitsunaga oita institute of technology control engineering department 407-27 higashihama, nakatsu, oita, japan 871-0006 e-mail: mitu@oita-it.ac.jp takami matsuo oita university department of architecture and mechatronics 700 dannoharu, oita, japan 870-1192 e-mail: matsuo@cc.oita-u.ac.jp received: august 4, 2007 kouichi mitsunaga was born in japan on july 19,1963. he received the b.s. degree in electrical engineering from oita university, oita, japan, in 1986. he is currently an assistant professor in the control engineering department, oita institute of technology, japan. he has worked in control system engineering. takami matsuo was born in japan on march 24,1958. he received the doctoral degree in engineering from kyushu university, fukuoka, japan, in 1985. he is currently a professor in the department of architecture and mechatronics, oita university, japan. he has worked in control system theory and its applications. matsuo is a member of ieee, and siam. international journal of computers, communications & control vol. i (2006), no. 2, pp. 95-101 improvement and extension of virtual reality for flexible systems of manufacture flavio véliz vasconcelo, gastón lefranc hernández abstract: in this work it presents the improvement and extension of the virtual reality software created by the robotics, artificial intelligence and automatization outpost laboratory in the school of electrical engineering of the pontificia universidad católica de valparaíso. keywords:software, virtual reality, java, fms. 1 introduction the development of this program has as it bases the version 1,0 on which a virtual model is created prototype of a cartesian manipulator pertaining to a cell of storage as/rs of a flexible system of manufacture. version 2,0, version created in this project bases its modifications on three aspects: • the 3d model, is made using support tools that facilitate the work and allow a better design, in this one case 3dstudio. • extends software adding to the virtual reality new objects from the flexible system of complete manufacture that is in the laboratory • settles down a control of the scara manipulator and the virtual reality of the work imagines that is made in an assembled flexible cell of, constituted by a robotic manipulator, a vision system and a automated transport system. the reason for this one project is to maintain the control of a process and also to make modifications in the cell doing the simulation in the virtual reality to optimize the work of the flexible system reducing to the died time of the cell produced by the halting of the process when making modifications and tests of the same ones. can be tested in the virtual reality then this account with all the factors that affect a process, for example, gravity and restriction of movement of the objects. 2 state of the art the generation of interactive tridimensional graphs has advanced immensely in the last years, now we can simulate real worlds with a good degree of reality in real time. but the techniques of interface man-machineare still in investigation. the first virtual reality (vr) systems appeared at the end of 80’ and beginnings of 90’, but the investigation in this field begun at end of the 60’ decade, especially in the aerospace military industry, and in investigation center as the massachusetts institute of technology, and de university of north of carolina. between the 90’ decade ant the beginning of the xxi century, the vr has experimented a big development united to the advances in communications, computers and all the digital technology. now the virtual reality is considered as a work tool for the industry, like the automotive industry, robotic, etc., in universities like universidad autónoma de méxico (unam), pontificia universidad católica de valparaíso and universidad politécnica de cataluña. even in the art universities like pompeu fabra and museum like the gugenheim museum from new york. copyright c© 2006 by ccc publications 96 flavio véliz vasconcelo, gastón lefranc hernández in the pucv the virtual reality began in 2001 with a prototype of this type of software. this was the representation of a manipulator in the space. this manipulator is part of a flexible manufacturer system (fms) made in the robotic, automation and artificial intelligence laboratory of the engineering electrical school. the first idea was represent a cell from the fms, the software must be flexible and must work in any operative system so was made completely with java. in the v2.0 of this software, is represent all the system that exist in the laboratory, using 3d development tools. this version can control both manipulators in the fms, the cartesian manipulator, part of the as/rs cell, and the scara manipulator that work with artificial stereo vision and the assembled cell. at this moment, the laboratory is working in the creation of the environment in real time, using the 3d vision, and the final idea is add this application to the virtual reality v2.0 software to make this more flexible. 3 structure of the system the represented cell is a manufacture cell of storage that is made up of three stages, the first in charge one of the administration of information (base it of data), the second interface for the administrator and third, the mechanism that will execute the indicated tasks. the storage system is conformed by a warehouse in rectangular form that denominated matrix and a cartesian manipulator of 4 degrees of freedom. with respect to the design of the virtual world a virtual universe was structured so that it was possible to be added other devices of a sfm, integrating of this form in which are different cells from a flexible production system as it is in figure 1. figure 1: 4 representation and control of the graphical devices the established movements of the manipulator in the classes of cartesian manipulator settle down by means of the incorporation of the inverse kinematics of the manipulator, defined as a general characteristic of the manipulating class, which will inherit each manipulator i specify. for the cartesian manipulator simulated, the inverse kinematics the position given by the system of data base is applied directly, since or a user, will be carried out on the base of several final positions. since the manipulator is cartesian, the final position and the position of each one of their joints are referred a single system of coordinates, unlike another type of manipulators. improvement and extension of virtual reality for flexible systems of manufacture 97 the information that gives the data base working in automatic way, with respect to the task to make is the following one: • move manipulator to one position (x1,y1, z1) • storage the "pallet" • take out the "pallet" • take the "pallet" from the transport system (x ,y, z) • put the "pallet" on the transport system (x ,y, z) the information entered by the user when working in manual way updates the data base so that this one issues the action order to the manipulator. the orders that can be sent are the same ones that gives the data base to him when working in automatic way, that is to say, to keep "pallet", to remove "pallet", etc. the real manipulator is implemented with sensors that confirm their position when executing a movement and this one information is realimentada to the program of virtual reality so that it makes sure that what it is happening in the virtual reality is just like it is happening in the true manipulator. 5 interface with the reality an interface with the data base must exist, whose the objective is to receive information of this one or to send information towards this to modify it. this interface is implemented by means of the program net8, which allows to the connection between the program of virtual reality and the data base to be able to extract the information stored by means of order tables or position. the program must be able to execute the control of a new cell of manufacture, corresponding to the cell of assembled and vision stere. the cell is composed by a manipulator of the type scara 7547, the work table, the conveyor belts, "pallets" that are used for the transport of products and the system of vision by computer. this flexible system of production imagines in a virtual reality of with the intention of having the total control on the same one, being able to represent everything what it is happening in the reality, in a virtual reality, thus is possible to be detected any fault in the real system while the behavior of the virtual reality is being observed. the system of virtual reality has communication with the assembled system of through the port of com1 communications of interface rs-232, through which the coordinates are transmitted towards which the manipulator is due to move when he is commanded from the assembled station of or to order a manual control of the scara manipulator when controlling the interface of virtual reality. 6 tools of programming and design the described program previously was developed programming in java language using the api of design 3d, java3d. the programming language was chosen by the following reasons: 1. the possibility must exist of creating an interface between the graphical generator and the source program. because java3d is a api of java, this one function already is built-in. 2. the availability and facility of manual and tutorial obtaining about java of these programs. 3. the advantage to create a program multiplatform, since java works on a virtual machine. 4. as version 1,0 of the program of virtual reality exists, created with the tools of java, it will use its code of source to reuse it in new versions. the great advantage of java is that it is a programming language oo and of neutral architecture, and the advantage of the programming oriented to objetos (poo) is the capacity to reuse code. another one of the great advantages of java is that not they must creates applications different to use the application 98 flavio véliz vasconcelo, gastón lefranc hernández in different platforms. with java we can to develop an application that automatically can be used in diverse platforms, like windows, unix or macintosh systems. using jbuilder, of the borland company, it is possible to be made an express and easy joint of the elements of a graphical interface for java applications. to construct user interfaces is not of great complexity because it is counted on several blocks of components, which are selected from a trowel that contains components such as: text bellboys, areas, lists, pictures of dialogues, etc. soon the values of the properties of these components are selected and al is enclosed component event del the code that will treat this event, saying al program how to respond to an event in the user interface. in order to obtain a better design of virtual reality that resembles more the real fms it uses the design program 3dstudio max5 version in spanish supported by the tool of autocad design 2002 version in spanish. to the support of these tools and using the class "loader" of j3d it is possible to be related the archives of objects 3d of 3dstudio and java. the design is begun creating the objects in the autocad program thus to be able to handle the measures of the objects and to give with greater facility the different forms from the objects, soon the archives of autocad are exported towards 3dstudio and here the design is completed giving colors and textures to these. 7 3d model the great improvement that incorporates version 2,0 of the program of virtual reality bases on the generation of objects since in version 1,0 the manipulator was created in his totality having used the api of java 3d, which implies to have to write great amount of lines of code for a simple object and without great definition. although using j3d a scene can be created, also it is possible to be done generating the objects of the scene in a program of design like for example autocad or 3dstudio max. this is a powerful form to bring objects towards the virtual world being concerned the information from a file. to load the data from a file allows to accede to the data created in another application. the program 3dstudio max5 provides the tools necessary to be able to create the objects that will compose the flexible system of manufacture. the simulation of a universe is generated using a modelador. the development of a modelador in 3-d includes three main phases, as it is in figure 2. figure 2: generation of primitive: the simple geometric objects are created, as it bases of any geometric model. the simple objects are created by primitive parametric of generation, or linear or radial sweeping. composition: the compound objects are created assembling simple objects. the composition generates more complex objects with rigid or articulated behavior. animation: the animation consists of equipping with movement to the objects within the environment. this can be improvement and extension of virtual reality for flexible systems of manufacture 99 applied to the objects composed in its totality or one of its joints. the animation includes movements of camera (point of view of the observer) and manipulation of the lights. through different techniques from design the images of figures 3 and 4 were obtained: figure 3: simple object figure 4: compound object in order to be able to obtain these one better quality in the design the support for the design looks for using the programs corel draw and their application with scanner to make ocr of vectorization. the data from measures of the manipulator are taken, that are in the manual of this one, where are two views that are sufficient to be able to make a design 3d of the manipulator. similarly all the components are being created that comprise of the flexible system which they were mentioned in the previous points, arriving at a final design of the version 2,0 that can be seen in figure 5. 8 inverse kinematics when we select a position in the cartesian plane inside of the work space, the scara is instruct to move to the selected position. with this information we can calculate the inverse kinematics to generate the animation of the 3d manipulator, moving the articulations or joints to the set point. figure 5: virtual sfm in every joint, the z axis is selected as rotation axis, as x axis the junction of the joints, and the y axis the third perpendicular axis. 100 flavio véliz vasconcelo, gastón lefranc hernández to describe the position of the joints from the angles of joints we can describe a transformation matrix that it relates the frame assigned to a joint with the frame of the previous joint. to the scara manipulator, the transformation matrix between the ended frame and the base frame is: t =   cosθ12 −sinθ12 0 l1cosθ1 sinθ12 cosθ12 0 l1sinθ1 0 0 1 0 0 0 0 1   (1) where: θ12 = θ1 + θ2 the problem of the inverse kinematic is obtain the angle from the position in the cartesian coordinates (x,y) of the greeper. this angles can be obtain in algebraic way or geometrically. in this case we chose the geometrical method, because is more simple. in this way, we obtain: cosθ2 = (r2 −l21 + l22)/(2l1l2) (2) θ1 −β + δ , i f θ2 < 0 (3) θ1 = β −δ , i f θ2 > 0 (4) where, r2 = x 2 + y 2 (5) tan β = y /x (6) cosδ = (r2 + l21 −l22)/(2l1r) (7) l1 and l2 are know values, because are the length of the two part of the arm of the scara manipulator, so is necessary to know the value of r to could calculate δ . like the point of destiny of the greeper is know too, and know in cartesian coordinates, the x and y values are replaced in the equation number 5, then we can obtain the r value. x and y value are data entered by the user, using the virtual reality software interface, or trough the database that command all the flexible manufacturer system. like we have the values of l1, l2, and r, we can obtain the β value working with equation 7. with the x and y values replaced in equation 6 we can obtain the value of β . with equation 2 we obtain the θ2, that indicate the rotation angle of l2 and that define the equation of rotation of the extremity l1, equations 3 and 4 depending of the case. this information is passed to the animation class, class that have the capacity to move the 3d design using the behavior class of j3d. 9 conclusions the use of the virtual reality in the industrial area allows to increase the production and to keep us to the moved away personnel of the accomplishment of dangerous tasks. with the simulation of the flexible system tests in a simulated system are reduced to the production costs when doing that it counts on all the factors that affect to the real system, in such a way that the best option can be selected to make a task without having to stop the system for a long time, but so single the necessary thing to make the modification. the advantage to make the virtual reality in java is to allow to implement the application in independent platforms without no problem then this runs on a virtual machine, and as java were born oriented to the internet, also it allows us to adapt software so that service to the production can be used in the web rendering and allowing the control of the flexible system, from any part of the world. improvement and extension of virtual reality for flexible systems of manufacture 101 figure 6: virtual scara, joints and rotation axis virtual reality can join two communities very different as artist and scientist, because this projects need informatics and engineering experts and designers and expert in visual arts. references [1] zepeda, roberto (lefranc, gastón, p.g.), modelación en redes de petry y simulación en realidad virtual de sistemas flexibles de manufactura, informe final de proyecto de titulación, pontificia universidad católica de valparaíso. agosto, 2001. [2] flores, javier (lefranc, gastón, p.g.), realidad virtual a sistemas flexibles de manufactura, informe final de proyecto de titulación, pontificia universidad católica de valparaíso. mayo, 2004. [3] aaron e. walsh, doug gehringer, java 3d, api jump start. [4] selman, daniel, java 3d programming. [5] burgos, daniel, 3dstudio max práctico. guía de aprendizaje. [6] lemay cadenhead, aprendiendo java en 21 días. flavio véliz vasconcelo, gastón lefranc hernández pontificia universidad católica de valparaíso escuela de ingeniería eléctrica av. brasil 2147, valparaíso, chile e-mail: flavio.veliz@vtr.net, glefranc@ieee.org international journal of computers, communications & control vol. ii (2007), no. 2, pp. 143-148 self-organizing maps for analysis of expandable polystyrene batch process mikko heikkinen, ville nurminen, yrjö hiltunen abstract: self-organizing maps (som) have been successfully applied in many fields of research. in this paper, we demonstrate the use of som-based method for the analysis of expandable polystyrene (eps) batch process. to this end, a data set of eps-batch process was used for training a som. reference vectors of the som were then classified by k-means algorithm into six clusters, which represent product types of the process. this som could also be used for estimating the optimal amounts of the stabilisation agent. the results of a validation data set showed a good agreement between the actual and estimated amounts of the stabilisation agent. based on this model a web application was made for test use at the plant. the results indicate that the som method can also be efficiently applied to the analysis of the batch process. keywords: neural networks, self-organizing maps, process control, batch process 1 introduction batch processes are typically based on predefined process recipes. if process circumstances, chemicals and recipes are constant, the product should basically be always the same. a batch process is also commonly used for producing expandable polystyrene (eps). however, in practice this polymerisation reaction is a very sensitive process and numerous variables affect it, which makes the process difficult to control. the eps production has to be able to follow fast the aims and quality requirements of the market, which causes additional demands on the process control. archived process data is an important resource for the knowledge management of the process and it can be used for the optimization and improvement of productivity. recent applications have demonstrated that artificial neural networks can provide an efficient and highly automated method for modelling industrial data [1], [2]. in particular, studies, which use standardized protocols, are most likely to benefit from automated ann analysis [1], [2]. self-organizing maps [1], [3]-[5] have also been successfully applied in many areas of research and are thus a tool for process optimization. the som method offers an efficient means of handling complex multidimensional data, which is typically the situation in industrial applications. in addition, the som method is robust for missing values of data. here, we apply self-organizing maps to the analysis of an eps-batch process. the optimal amounts of the stabilisation agent can be estimated using the som model. in the study we have also included some features of the supervised approach in the designed unsupervised method. 2 methods 2.1 the process and the data the studied process was a typical suspension polymerisation batch process, which is commonly used for producing eps (expandable polystyrene). the polymerisation stage is executed in a pressuretemperature range below the boiling point of styrene-water suspension system. after the polymerisation stage the process continues into the impregnation stage, where the blowing agent is impregnated into the beads. the impregnation stage assumed to be negligible in the means of bead size distribution. the biggest challenge in the suspension polymerisation process is to achieve the required bead size distribution. it is common knowledge that the basic variables in the term of the bead size are the mixing copyright © 2006-2007 by ccc publications 144 mikko heikkinen, ville nurminen, yrjö hiltunen properties and the amount and quality of the suspension stabilizers. however the suspension polymerisation of styrene is a very sensitive process and numerous variables affect it. most of these variables cannot be measured or followed by in a reasonable way. for example to analyse all impurities from all raw materials is too heavy a task for any industrial laboratory. some variables are quite easily measurable, but have not been traced due to the assumption that they would not have a significant contribution to the process. to be able to model the process the studied system required elimination of the variables, which were assumed to be inessential. the data for the model were divided into three groups: recipe, results and process parameters. process parameters, such as actual reactor temperature, were measured and stored automatically from each batch every one minute. process parameter data was not used for modelling in the first part of this study and the target is to add it later to the model. table i shows the recipe and result variables. the data contained 15 production campaigns and about 450 batches in 4 reactors. the data had to be divided into two separate groups due to the process changes in the stabilisation system. these changes limited the size of useful data to 251 batches, which is a low amount for accurate modelling. fortunately more production campaigns will be executed almost every month and more data will be available for further studies. 2.2 computational methods self-organizing maps self-organizing maps (soms) are an artificial neural network methodology, which can transform an n-dimensional input vector into a oneor two-dimensional discrete map. the input vectors, which have common features, are projected to the same area of the map e.g. (in this case described as şneuronsť). each neuron is associated with an n-dimensional reference vector, which provides a link between the output and input spaces. during learning, the input data vector is mapped onto a particular neuron (best matching unit, bmu) based on the minimal n-dimensional distance between the input vector and the reference vectors of the neurons. then the reference vectors of the activated neurons are updated. when the trained map is applied, the best matching units are calculated using these reference vectors. in this unsupervised methodology, the som can be constructed without previous a priori knowledge [1]. the data were coded into 11 inputs for the som. all input values were variance scaled. the som having 676 neurons in a 26x26 hexagonal arrangement was constructed. the linear initialization and table 1: recipe and result variables of the particular styrochem suspension polymerisation process. recipe variables u nit result variables u nit amount of stabilisation agent % from ms under sized % mixing speed rpm product a % polymerisation temperature °c product b % reactivity min product c % amount of styrene kg product d % additional stabilisation kg product e % product type product f % batch number in campaign over sized % reactor mean particle size mm delta mm narrowness self-organizing maps for analysis of expandable polystyrene batch process 145 batch training algorithms were used in the training of the map. a gaussian function was used as the neighbourhood function. the map was taught with 10 epochs and the initial neighbourhood had the value of 6. the som toolbox [7] was used in the analysis under a matlab-software platform (mathworks, natick, ma, usa). the data set of an eps-batch process (n= 251 batches) was divided into two subsets. the first subset (the first 190 batches) was the training set, which was used for training the map. the other subset (the last 61 batches) was the test set. variables of each batch and amounts of the stabilisation agent were used as an input for the som in the training phase. however, the amounts of the stabilisation agent were missing values in the test phase, i.e. best matching neurons for the test set were sought using only those other batch variables specified above. estimated values for the amounts of the stabilisation agent were obtained from the reference vectors of neurons. k-means method the k-means algorithm was applied to the clustering of the map. the k-means method is a wellknown non-hierarchical cluster algorithm [8]. the basic version begins by randomly picking k cluster centers, assigning each point to the cluster whose mean is closest in a euclidean distances sense, then computing the mean vectors of the points assigned to each cluster, and using these as new centers in an iterative approach. 3 results and discussion the map was obtained by training a self-organizing network with the training set of an eps-batch process. the map and the six clusters calculated by the k-means method are shown in figure 1. these clusters represent different types of the products. the brief descriptions of the clusters are also illustrated in figure 1. the method was validated by using the test set, i.e. the last 61 data vectors were in test. the results, when the som was trained to estimate the amounts of the stabilisation agent, are illustrated in fig. 2. the correlation coefficient between the actual amount of the stabilisation agent and the estimated one was 0.851 (fig 2 a). in fig. 2 b) the actual and estimated amounts of the stabilisation agent are shown as a function of the batch number. table 2: variables used for modelling. variables under sized product a product b product c product d product e mean particle size batch number in campaign amount of stabilisation agent in previous batch amount of stabilisation agent a web application based on this som model was made for estimating the optimal amount of stabilisation agent in practice at the plant. figure 3 illustrates the interface of this application. the same 146 mikko heikkinen, ville nurminen, yrjö hiltunen figure 1: som using the data of an eps-batch process. the background colours visualize the six clusters of the map. short descriptions for each cluster are also shown. figure 2: a) the correlation between the actual amount of the stabilisation agent and the one estimated by the som analysis. the results are from the test set (the last 61 batches) and the value for the correlation coefficient is 0.851. b) the actual and estimated amounts of the stabilisation agent as a function of the batch number. self-organizing maps for analysis of expandable polystyrene batch process 147 figure 3: the interface of the web application. variables, which are shown in table 2, have been used in this application. a user gives as many as possible out of the ten upmost variables and the application calculates the lowest one. the first seven variables define the target amounts of the product. our earlier results showed that the som method could be successfully applied to process state monitoring and optimization of nox emissions in the case of a continuous process [6]. the results presented here illustrate also the advantages of using som method in the analysis of a batch process. figure 3 shows that the method can be used for estimation of optimal parameters of the process. it seems to reduce varieties of the process and so helps to get better products. because our som method is also nonsensitive for the presence of missing values, it is feasible in the analysis of industrial data. furthermore, som analysis does not require extensive knowledge of neural nets and it can easily be included in any kind of software. an attractive property of som is also that it can be retrained, if new product types of the process are to be analysed. 4 conclusion the som analysis provides an efficient and automated method for data analysis in the process industry. the present study shows that this kind of data-driven approach is a fruitful way of analysing a batch process. 5 acknowledgement this research was supported by styrochem ltd. references [1] t. kohonen, self-organizing maps, springer-verlag, berlin heidelberg new york, 2001. [2] s. haykin, neural networks: a comprehensive foundation, upper saddle river, nj: prentice hall, 1999. 148 mikko heikkinen, ville nurminen, yrjö hiltunen [3] j. kaartinen, y. hiltunen, p. t. kovanen, m. ala-korpela, classification of human blood plasma lipid abnormalities by 1h magnetic resonance spectroscopy and self-organizing maps, nmr biomed, vol. 11, pp. 168-176, 1998. [4] m. t. hyvönen, y. hiltunen, w. el-deredy, t. ojala, j. vaara, p. t. kovanen, m. ala-korpela, application of self-organizing maps in conformational analysis of lipids, journal of the american chemical society, vol. 123, pp. 810-816, 2001. [5] m. heikkinen, m. kolehmainen, y. hiltunen, classification of process phases using self-organizing maps and sammonšs mapping for investigating activated sludge treatment plant in a pulp mill, proceedings of the fourth european symposium on intelligent technologies and their implementation on smart adaptive systems, pp. 281-297, 2004. [6] m. heikkinen, a. kettunen, e. niemitalo, r. kuivalainen, y. hiltunen, som-based method for process state monitoring and optimization in fluidized bed energy plant, icann 2005, lecture notes in computer science 3696, eds. w. duch, j. kacprzyk, e. oja, s. zadrozny, springer-verlag berlin heidelberg, pp. 409-414, 2005. [7] homepage of som toolbox, helsinki university of technology, laboratory of computer and information science (cis), http://www.cis.hut.fi/projects/somtoolbox/. [8] j. macqueen, some methods for classification and analysis of multivariate observations, in proceedings of the fifth berkeley symposium on mathematical statistics and probability, vol. i: statistics, university of california press, berkeley and los angeles, pp. 281-297, 1967. mikko heikkinen university of kuopio, department of environmental sciences p.o. box 1627, fin 70211 kuopio, finland e-mail: mikko.heikkinen@uku.fi ville nurminen styrochem ltd p.o. box 360, fin 06101 porvoo, finland e-mail: ville.nurminen@styrochem.com yrjö hiltunen university of kuopio, department of environmental sciences p.o. box 1627, fin 70211 kuopio, finland e-mail: yrjo.hiltunen@uku.fi received: march 6, 2007 international journal of computers, communications & control vol. i (2006), no. 1, pp. 41-46 an exact algorithm for steiner tree problem on graphs milan stanojević, mirko vujošević abstract: the paper presents a new original algorithm for solving steiner tree problem on graph. the algorithm is simple and intended for solving problems with relatively low dimensions. it is based on use of existing open source software for solving integer linear programming problems. the algorithm is tested and shown very efficient for different randomly generated problems on graphs of up to 50 nodes, up to 10 terminals and average node degree 7. keywords: steiner tree problem on graph, branch and cut, algorithm, optimization 1 introduction the steiner tree problem (stp) is met in telecommunication and energetic systems, vlsi technologies and in other network planning tasks. the problem is to find a minimal length tree which connects all terminal nodes of a given graph, and contains arbitrary number of other nodes. the problem is similar to the well known shortest spanning tree problem (sstp), but unlike that it doesn’t necessary contain all nodes of given graph. also, a very important difference is that stp is much harder problem than sstp. the usual formulation of stp is the following: a connected undirected graph g = (n, e), where n = {1, . . . , n} is a set of nodes and e ⊆ {{i, j} | i ∈ n, j ∈ n, i < j} denotes a set of edges, is given. a positive value (length, weight, etc.) ce is associated to every edge e ∈ e. also, a set t ⊂ n of so called terminal (steiner) nodes is given. definition 1. steiner tree for t in g is a subgraph st = (n′, e′), n′ ⊆ n, e′ ⊂ e which satisfies one of the following statements: 1. t ⊆ n′ and st is a tree; 2. ∀s,t ∈ t in st exists exactly one path from s to t. the length of steiner tree is the sum of lengths of all edges which it consists of. the steiner tree problem is to find the shortest steiner tree. this is a np-hard problem and its decision problem variant belongs to np-complete problem class [6]. the problem is presented in details in surveys: [4], [5], [9] and [11]. in papers [2], [8] and [7] the problem was solved to optimality using sophisticated branch and cut methods. in this paper an original exact algorithm, generally based on branch and cut procedure, for solving steiner tree problems on graph is proposed. it uses code from an available open source project which develops software for solving linear and integer linear programming problems. an intention was to formulate an algorithm which can be implemented in relatively short time and which will be able to solve stp of “reasonable” dimensions in “reasonable” time. to be more precise, the implementation of proposed algorithm lasted about 3-4 days. the program could solve problems of dimensions which are larger of the most of real life problems in, at most, several minutes. in this paper, in the next section, a mathematical model of the steiner tree problem and its explanation are given. in section 3, the algorithm and the process of implementation are explained in details. in section 4 the experiments are described and some conclusions about the algorithm behavior are made. in section 5, the conclusion of the whole paper is given. 2 mathematical model a steiner tree can be represented by a vector of binary variables x = (xe)|e|. each element of the vector is assigned to one edge from the set e. the value of a variable xe indicates whether the corresponding edge e is in the steiner tree (xe = 1) or not (xe = 0). a mathematical model of the stp on undirected graphs has the following copyright c© 2006 by ccc publications 42 milan stanojević, mirko vujošević form: min ∑ e∈e cexe subject to (i) ∑ e∈δ (m) xe ≥ 1 ∀ m ⊂ n, m ∩t 6= /0, (n\m)∩t 6= /0 (ii) xe ∈ {0, 1} ∀ e ∈ e (1) where n, e and t are as in definition 1, ce is a positive value associated to each edge e and δ (m) denotes a graph cut defined by subset of nodes m ⊂ n, i.e. the set of edges with one end node in m and the second one in complement set n \m, δ (m) = {{i, j} ∈ e | i ∈ m, j ∈ n \m}. the mathematical model (1) is linear and integer. constraints (i) ensure that in every cut, for which terminal nodes are on the both sides (m and n\m), at least one edge exists. in other words, they ensure that between each two terminal nodes exists at least one path. a feasible solution of model (1) is not necessary a steiner tree. but the optimal solution will be a steiner tree because any edge constructing a contour would violate the optimality condition. so, although the formulation is rather comprehensive, it can be applied only for those problems where the goal is to minimize the length of the steiner tree. a disadvantage of model (1) is that the number of constraints grows exponentially in the problem size. on the other hand, in branch and cut methods, the relaxation of the formulation may give acceptable results. 3 implementation two main challenges in solving the model (1) are: (i) the exponential number of constraints, and (ii) the exponential time needed to solve the integer program even with smaller number of constraints. to overcome the first challenge, the proposed algorithm uses a relaxation of model (1). the idea was inspired by the paper [7] and partly published at [10]. namely, because of a big number of constraints, solving a model which includes all the constraints is very hard and in many cases impossible. on the other hand, in the most cases it is not necessary to include all the constraints in order to obtain an optimal solution. only the constraints which are active in the optimal solution are really necessary. of course, we cannot predict which of them will be active, but we can start with some smaller number of constraints, giving priority to those which are “more likely” to be necessary to obtain a feasible solution (steiner tree). if we solve a model with smaller number of constraints and the solution is a steiner tree of the given graph, then this solution will be the optimal for the starting problem, i.e. by adding more constraints we cannot improve the solution. otherwise, the obtained solution will consist of two or more subtrees. then, we iteratively add those constraints for which we find they are violated and which would probably lead to feasible solution until we finally obtain a steiner tree. the most common way to solve a binary linear programming problem is implementation of branch and bound method in combination with simplex method. according to the one of main features of the algorithm we intended to formulate – quick and easy implementation, development of simplex and branch and bound algorithms from a scratch wouldn’t be appropriate. although it would finally give better performance if they would be incorporated in the essence of complete procedure, the development of these procedures would last very long. an alternative was found among open source projects. the project used in this implementation was lp_solve. lp_solve [1] is an open source project that realize very robust procedures and techniques for solving linear programming problems. beside that, it implements the branch and bound method for solving binary, integer and mixed integer linear problems. it has a lot of options by which it is possible to influence the branch and bound procedure changing its strategies, so it is possible to significantly improve its performance [13]. lp_solve can be used as independent application when it can read problem files in lp and mps formats, and as a set of functions, when it can be incorporated into other programs and controlled from the host code. the project itself doesn’t have any restrictions of the problems dimensions. some successful applications on mixed integer programming problems with several thousands variables were reported. the license of the project is glgpl (gnu lesser general public license) [12] and it allows free download, using, changing and redistribution of the source code of lp_solve project. the algorithm is formally formulated as follows: an exact algorithm for steiner tree problem on graphs 43 algorithm 1 simplified branch and cut algorithm for stp 1. formulation of initial integer linear mathematical model: (a) goal function formulation: one variable is introduced for each edge and corresponding edge length is associated as a parameter to each variable. (b) for each terminal node, one constraint of type (i) is formulated, so the terminal node is a single node on one side of the cut and the rest of the nodes are on the other side, i.e. ∑ e∈δ (m) xe ≥ 1 ∀m ∈ {{t} | t ∈ t}. the number of constraints after step 1 will be |t|. 2. solve the current mathematical model. 3. check if the obtained solution is a steiner tree, i.e. if there is a path between all pairs of terminal nodes. if so, the optimal solution was found in step 2; the end of the procedure. otherwise, next step. 4. if solution is not a steiner tree, then it represents two or more unconnected subtrees. for every subtree add one type (i) constraint defined by cut δ (m) where the nodes of that subtree belong to set m. go to step 2. the proposed algorithm can be qualified as a simplified version of branch and cut method, i.e. a combination of branch and bound and cutting planes. the steps of the algorithm will be illustrated by an example. suppose, we have to obtain the minimal steiner tree for the given graph, illustrated in figure 1, where four terminal nodes are marked with bigger circles. the mathematical model created in step 1 and updated in step 4 will have one column for every edge. the constraints added in the first step are necessary to provide that every terminal is connected to, at least, one edge. in the example, four constraints will be added in the first step – one for every terminal. they will ensure that at least one edge is connected to each terminal. in figure 2, the four cuts are marked as open curves surrounding each terminal node, and the edges marked by dashed lines are candidates to be in the first solution. figure 1: initial graph figure 2: first step the possible solution after step 2 could be like one in figure 3. as mentioned above, if the obtained solution is not a steiner tree, it will consist of several subtrees. the number of subtrees generally can be between 2 and |t|. for determining if the obtained solution is a steiner tree (in step 3), dijkstra’s shortest path algorithm was used. in step 4 a new constraint for each subtree is added. in the example, three new constraints, corresponding to three subtrees shown in figure 3, are added. on each image of figure 4, one cut (represented by the curve surrounding the subtree) and edges (dashed lines) among which, at least one will be in the next solution are shown. after the next optimization, a possible solution could be like the one shown in figure 5. the solution satisfies all added constraints and the graph structure is a steiner tree. without further checking, we can claim that it is an 44 milan stanojević, mirko vujošević figure 3: possible solution after the first iteration figure 4: three new constraints optimal solution of the given steiner tree problem. figure 5: final – feasible and optimal solution the main criteria for determining data structures were access speed and simplicity of implementation. the amount of used memory was not considered because of relatively small graph dimensions of target instances. the logical choice were static structures (vectors and matrices). the realized structures enabled fast data access and mapping between graph structure (realized through neighborhood matrix) and vector of edges with all corresponding attributes. the algorithm was implemented in c language and it is compiled and tested on linux operation system (with gcc gnu c compiler) on pc with pentiumr iii processor on 600 mhz and 384 mb of ram. the implemented program supports so called stp format [15] of steiner tree problems, and it is compatible with the library of standard steiner tree problems [14]. 4 experimental results the developed program was tested on different problem instances. a characteristic of the steiner tree problem is that the complexity of a procedure for its solving depends on three attributes: number of nodes (n), number of edges and number of terminal nodes (t). the largest dimensions of instances successfully solved by the program were [n/t]: 20/10, 32/8 and 50/5, with the average node degree 7. the solving procedures lasted between 2 seconds and 2 minutes for the most of the instances. these dimensions may look modest in comparison with those with several thousands nodes which were successfully solved as it was reported in papers [2] and [7]. however, comparing the procedure complexity, simplicity of implementation and the fact that in many real-life telecommunication planning processes, even smaller size problems may appear, the proposed implementation could be very useful. to get a more precise insight in the behavior of the algorithm, experiments were performed with instances where some parameters were varied. in the following table, results (mean values and standard deviation) obtained by experiments where every dimension was tested on 15 randomly generated instances are given. in the columns named “no. of rows” the number of constraints needed to obtain an optimal solution (the optimization in the last iteration) is given. the columns named “no. of iterations” represent a number of iterations of algorithm 1 when an exact algorithm for steiner tree problem on graphs 45 passing through steps 2-4, i.e. the total number of solved binary subproblems (in step 2). the columns “time” represent cpu time spent to solve the problem. the column “ancai” shows the average number of constraints added per iteration. it’s obtained by formula: no.rows−tno.iterations where t represents the number of terminal nodes, i.e. the number of constraints added in initial mathematical model in step 1. table 1: the complexity analysis of the algorithm instance dimensions no. of rows no. of iterations time [sec.] an[n/t] x σ x σ x σ cai 1 2 3 4 5 6 7 8 1 20/5 25 8.9 7.5 3.0 0.2 0.3 2.7 2 32/5 38 22.1 12.3 8.0 3.0 9.7 2.7 3 50/5 60 28.3 20.0 11.3 7.2 13.3 2.8 4 20/8 65 20.3 19.5 8.5 11.5 27.5 2.9 5 32/8 88 31.9 26.3 12.0 34.5 46.7 3.0 6 20/10 83 15.5 23.6 5.2 13.4 9.9 3.1 on the basis of data given in table 1, some conclusions can be made. the fact that complexity of stp grows with the number of nodes is obvious from the first three rows. although the speed of the growth cannot be determined exactly on the basis of so small sample, it is obviously nonlinear – probably exponential. the more interesting conclusion is that the growth of complexity is faster by changing the number of terminal nodes than the total number of nodes in graph. comparing rows 1, 4 and 6, a kind of “explosion” of complexity can be seen: when the number of terminals was increased two times, execution time was increased 67 times. similar conclusion can be made observing rows 2 and 5: addition of three terminals resulted in execution time increase of more than 11 times. the most important analysis here concerns the number of constraints needed to get an optimal solution. according the column 2, that growth is almost linear in problem size. we cannot be certain if it is linear, but it is definitely not exponential. finally, we can conclude that, although the number of constraints in model (1) grows exponentially, the number of constraints necessary to obtain an optimal solution grows much slowlier. the number of iterations grows even less. the parameter in column 8 is also interesting. the average number of constraints added in each iteration also represents the average number of subtrees obtained in each sub solution. it seems that the relatively small values in column 8 doesn’t depend much on the number of terminal nodes. the explanation could be that the current solution of the solving procedure relatively quickly forms a structure which consists of a few subtrees each containing several terminals. this may contradict to the previous statement that complexity depends more on the number of terminals than on the total number of nodes, because, what influences the number of iterations is the number of unconnected subtrees, and not the number of terminals. one possible explanation is that the structure of subtrees changes, so subtrees contain different terminals in different iterations. it is also important to have in mind that every iteration lasts more than the previous one because every mathematical model has more rows (constraints) than the previous one. yet another interesting thing from table 1 is relatively big dispersion (represented by standard deviation) of data obtained by different randomly generated instances with same characteristics. this is a consequence of a nature of the algorithm (which is nondeterministic polynomial). it is impossible to predict the number of iterations necessary to obtain final solution. in the worst case it can be exponential. 5 conclusion the first impressions and experiment conclusions indicate that the proposed algorithm can be efficiently used when there is a need for rapid development of an algorithm for solving smaller size steiner tree problems. although the worst case number of constraints needed to obtain a final solution remains exponential, the algorithm have shown a kind of “good behavior” – in all solved examples the number stayed relatively low. although the exponential complexity of the branch and bound method (step 2 of the algorithm) remains, instances with acceptable dimensions can be solved in real time. 46 milan stanojević, mirko vujošević the idea of successive adding violated constraints could be also applied to some other problems whose mathematical models have an exponential number of constraints. concerning that, some new researches have been planed, where the idea would be applied to the traveling salesman problem (tsp). namely, the so called dfj formulation of tsp [3] also has exponential number of constraints, but it has shown a good behavior in relaxation based algorithms. references [1] m. berkelaar, k. eikland, p. notebaert, lp_solve, files and discussion group, ftp://ftp.es.ele.tue.nl/pub/lp_solve, http://groups.yahoo.com/group/lp_solve/, 1994-2006. [2] s. chopra, e. gorres, m. r. rao, “solving a steiner tree problem on a graph using branch and cut”, orsa journal on computing, vol. 4, pp. 320-335, 1992. [3] g. b. dantzig, d. r. fulkerson, s. m. johnson, “solution of a large-scale traveling-salesman problem”, operations research, vol. 2, pp. 393-410, 1954. [4] f. k. hwang, d. s. richards, “steiner tree problems”, networks, vol. 22, pp. 55-89, 1992. [5] f. k. hwang, d. s. richards, p. winter, the steiner tree problem, north-holland, amsterdam, 1992. [6] r. m. karp, “reducibility among combinatorial problems”, r. e. miller, j. w. thatcher (ed.), complexity of computer computations, pp. 85-103, plenum press, new york, 1972. [7] t. koch, a. martin, “solving steiner tree problems in graphs to optimality”, networks, vol. 32, pp. 207-232, 1998. [8] a. lucena, j.e. beasley, “a branch and cut algorithm for the steiner problem in graphs”, networks, vol. 31, pp. 39-59, 1998. [9] n. maculan, “the steiner tree problem in graphs”, surveys in combinatorial optimization, s. martello, g. laporte, m. minoux, c. c. ribeiro (ed.), annals of discrete mathematics, vol. 31, pp. 185-212, 1987. [10] m. stanojević, m. vujošević, “a new algorithm for solving steiner tree problem on graph” (in serbian), 12th telecommunications forum telfor 2004, belgrade, http://www.telfor.org.yu/telfor2004/e-index.html (http://www.telfor.org.yu/telfor2004/radovi/tm-2-4.pdf), 2004. [11] p. winter, “steiner problem in networks: a survey”, networks, vol. 17, pp. 129-167, 1987. [12] gnu lesser general public license, http://www.gnu.org/copyleft/lesser.html [13] lp_solve reference guide, http://www.geocities.com/lpsolve/ [14] steinlib testsets – the library of standard steiner problems, http://elib.zib.de/steinlib/testset.php [15] stp – description of the stp data format, http://elib.zib.de/steinlib/format.php milan stanojević, mirko vujošević university of belgrade faculty of organizational sciences address: jove ilića 154, belgrade, serbia and montenegro e-mail: {milans,mirkov}@fon.bg.ac.yu int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 4, pp. 363-373 fuzzy logic in genetic regulatory network models c. muñoz, f. vargas, j. bustos, m. curilem, s. salvo, h. miranda carlos muñoz poblete, francisco vargas parra, jaime bustos gomez millaray curilem saldias, sonia salvo garrido, horacio miranda vargas university of la frontera avenue francisco salazar 01145, temuco, chile e-mail: comunoz@ufro.cl abstract: interactions between genes and the proteins they synthesize shape genetic regulatory networks (grn). several models have been proposed to describe these interactions, been the most commonly used those based on ordinary differential equations (odes). some approximations using piecewise linear differential equations (pldes), have been proposed to simplify the model non linearities. however they not allways give good results. in this context, it has been developed a model capable of representing small grn, combining characteristics from the ode’s models and fuzzy inference systems (fis). the fis is trained through an artificial neural network, which forms an adaptive nertwork-based fuzzy inference system (anfis). this network allows to adapt the membership and output functions from the fis according to the training data, thus, reducing the previous knowledge needed to model the specific phenomenon. in addition, fuzzy logic allows to express their rules through linguistic labels, which also allows to incorporate expert knowledge in a friendly way. the proposed model has been used to describe the lac operon in e. coli and it has been compared with the models already mentioned. the outcome errors due to the training process of the anfis network are comparable with those of the models based on odes. additionally, the fuzzy logic approach provides modeling flexibility and knowledge acquisition advantages. keywords: genetic regulatory network, fuzzy logic, anfis, differential equations, lac operon 1 introduction factors in charge of regulating the expression of a gene can be both environmental (such as produced by other factors genes) or those produced by other genes or even the same gene under regulation. the latter is the basis for understanding the so-called genetic regulatory networks (grn), since they are real networks of interaction between genes in a cell. being able to accurately predict these interactions using mathematical and / or computational models would benefit a wide variety of applications such as medicine or agriculture. there have been many approaches to model these networks, such as bayesian networks, boolean networks, models based on ordinary differential equations, piecewise linear models, stochastic models and others [13], [14], [13], [17]. from them, we can to highlight the models based on ordinary differential equations (odes) since they describe biological phenomena in large detail, being used primarily for modeling small regulatory networks. a disadvantage of these models is the large number of parameters (to be known a priori) acceptable for a biological description, requiring an exhaustive study copyright c© 2006-2009 by ccc publications 364 c. muñoz, f. vargas, j. bustos, m. curilem, s. salvo, h. miranda of the available literature to specify the parameters and/or designing experiments to estimate each of the parameters as required. it is now possible, using the necessary experimental data, to use optimization tools or artificial intelligence to solve this problem [2], [25]. an approach to the ode-based models is based on a piecewise linear differential equations (pldes) [3], [5], [7]. on the other hand, artificial intelligence techniques, including fuzzy logic, have been incorporated primarily to the classification and analysis of data obtained through microarrays [8], [11], [21], [15], [23]. moreover, the techniques of fuzzy logic have also been considered for describing biological systems of which some a priori knowledge exists [12], [18], [19]. in this context, it is necessary to propose models that attempt to get good predictions, reducing the need for prior knowledge. we must also consider that these models should have the ability to easily incorporate the knowledge of experts in the field of genomics, as well as experimental information thus complementing previous work with the new developments. similarly, the steady states of the model should be analyzed in order to ensure a biologically acceptable description. in this work we propose the development of a model that integrates the fuzzy inference with differential equations. we have chosen the differential equations because they constitute a model that describes with sufficient fidelity the regulatory processes; we also incorporate fuzzy logic because they have the ability to work with nonlinear systems. the proposed model can reduce the need for prior knowledge of the phenomenon due to the training of the network anfis, transforming it into a grey box model [1], combining differential equations, and network training. in addition, the proposed model makes it possible to express their fuzzy rules across linguistic labels, which gives the ability to incorporate expert knowledge in a friendly language. 1.1 biological background a gene is active when it is able to synthesize one or more (depending on the body) types of proteins, which can play a regulatory role in the expression of the same or other genes, and also can catalyze chemical reactions within a cell. therefore the function of a cell in an organism depends on the genes that are active, or in other words, it depends on the expression of its genes. the process of synthesis of a protein consists of 2 phases, the first of them is known as transcription. at this stage the segment of dna that contains the information of the gene is transcribed into a string of messenger rna (mrna) through the enzyme called rna polymerase. the action of this enzyme is regulated by a series of molecules called transcription factors (tf), which use certain areas of dna, called zones cis-regulatory that are specific to this end. then, when we have the chain of mrna with the information on the protein synthesis it comes a second stage called translation. here an internal organelle called a ribosome reads the information chain mrna and, together with the transfer rna (trnas), it links the amino acids needed to form the protein that indicates the information of the gene. this protein may regulate the expression of the same or other genes, and can also participate in metabolic processes of the cell. 2 materials and methods this section defines the model based on a fuzzy inference system (fis), presenting also the characteristics of the first models based on odes and its piecewise linear approximations in order to compare all these models in a real system. 2.1 odes based models these models are based on a series of ordinary differential equations that relate mrna molecules with proteins that they synthesize, the action of other molecules present in the regulation can also be fuzzy logic in genetic regulatory network models 365 incorporated. usually this kind of differential equations presents the form: dxi dt = ∑ j αi j fi j(x j) − γixi (1) where xi represents a molecule produced in the process, ai j is the production rate of the molecule i due to the molecule j, γi is the degradation rate of the molecule i, and fi j is a function that determines the interaction of the molecule x j with the molecule xi, which is called regulation function. this is a nonlinear function, which provides for realism from a biological point of view. it is generally defined as a function of sigmoidal type, commonly the hill function [3], [5]: fi j = h + i j (x j, θi j, mi j) = x mi j j θ mi ji j + x mi j j (2) this equation shows that for values x j well over the threshold, θi j, the function tends to a value of 1, whereas when x j tends to values below the threshold, θi j, the function is close to the value 0, as seen in figure 1.a. the speed with which the function passes from the value 0 to 1 (while x j varies) depends on the slope at the point threshold. this slope changes depending on the value of mi j. 2.2 plde-step models due to the nonlinear nature of the odes-based models, piecewise linear approximations have emerged that attempt to simplify the odes-based model to a set of linear models. such models are based on a piecewise linear differential equations (pldes). the number of potential resulting linear models depends on the amount of regulation functions to approximate and the amount of linear segments on each approximation. a widely used approach approximates the regulation function, fi j, to only 2 cases: fi j = si j(x j, θi j) = { , if x j > θi j , if x j ≤ θi j (3) the regulation function is then approximated to a step function [1], so this model has been named piecewise linear differential equations-step (plde-step) [20]. in this case, the value of the threshold, θi j, is the only parameter to estimate for each regulation function. the curve is shown in figure 1.b. 2.3 plde-logoid models in addition to the step function, there are other features to approximate a nonlinear model to a piecewise linear. thus, we find in the literature approaches that use the ramp function as part of the linear segments of the model [5]. this approximation of the regulation function is also known as a logoid function [3] calling such models as based on piecewise linear differential equations-logoid(plde-logoid) [20]. in this case, the curve takes the form shown in figure 1.c, and the regulation function is defined as: fi j = li j(x j, θi j, δi j) =    , if x j > θi j + δi j   δi j (x j − θi j) +  , if θi j − δi j  < x j ≤ θi j + δi j , if x j ≤ θi j − δi j (4) where the new parameter, δi j, corresponds to the piece at which the function moves from 0 to the value set to 1, corresponding to the inverse of the ramp function in that segment. as shown in (4), there are 3 possible cases for every regulation function, which increase the number of potential linear differential equations to solve compared to the model plde-step. however this also increases the accuracy of the approximation. 366 c. muñoz, f. vargas, j. bustos, m. curilem, s. salvo, h. miranda figure 1: different regulation function: a) the hill function, b) step function and c) logoid function. 2.4 proposed model in this case each regulation function is approximated by a fuzzy inference system (fis), of the takagi-sugeno type. this fis is capable of representing the nonlinear behavior of the regulation function making it possible to define linguistic labels to determine the concentrations of molecules. moreover, one can assume the production rates as unknown and include their action within the fis, which diminishes the prior knowledge required for modeling. thus, the differential equations take the form: dxi dt = ∑ j fisi j(x j) − γixi (5) by comparing (5) to (1) we observe that it has been replaced αi j fi j by the fuzzy inference system f isi j, i.e. not only approaching the regulation function but also including the production rates. for the design of the fis it should mainly be considered the characteristics of the membership functions, the number of them, the fuzzy rules, and the output functions. the latter implies that when designing the fuzzy inference system a good knowledge of the phenomenon is required, including aspects such as the ranges of concentrations, the fuzzy rules, and so on. that is why we use a training network called anfis [16] to get a fuzzy system that approximates the odes-based model. this network is trained with experimental data and is capable of adapting, using a hybrid learning algorithm, the characteristics of membership functions and output functions so to reduce the error between the experimental data and data generated by the fis. figure 2, shows a fuzzy inference system with 2 inputs x and y, 2 membership functions for each input (a, a, b and b), 2 fuzzy rules which consequences are f  and f , and 1 output f . among the most important factors to be considered for the training of the network are the type and number of membership functions, the maximum amount of training epochs and the error goal. we must also consider that the data for training must provide sufficient information to model the dynamics of the system. the training network allows complementing the phenomenological model with the information obtained from the experimental data. consequently, the model is considered a hybrid model or grey box model. in this work the proposed model is called ode-fis model due to its characteristics. 2.5 implementation in a real system all models are compared representing the lac operon in e. coli. the lac operon is a very well studied process in the bacterium escherichia coli [10], [22], [24], and although broadly it appears simple, in reality it can be modeled so detailed that you can include more than 100 biochemical reactions. in this paper we use the model of the lac operon shown in [24]. fuzzy logic in genetic regulatory network models 367 figure 2: structure of a anfis network: a) takagi-sugeno type fuzzy inference system, b) anfis representation of the system. the description of the process is as follows. the main source of carbon for the bacteria e. coli is glucose. when glucose is not present in the environment cell, the bacterium is able to form glucose through lactose. for this, there is a regulatory mechanism that allows to synthesize the enzymes necessary to obtain glucose. this mechanism is called lac operon. this model describes the synthesis of glucose from lactose by the bacterium e coli. this is due to the fact that in the absence of glucose, but in the presence of lactose, the bacteria activates the synthesis of β -galactosidase and permeases. the β -galactosidase breaks down the lactose into alolactosa, glucose and galactose, being the alolactosa the inducer in the operon regulation. moreover permeases allow the passage of external lactose towards the cell. an outline of the system as used in [7], is shown in figure 3. the model presented in [24]is based on 5 nonlinear differential equations, where we can find 6 hill functions. these equations describe the production of mrna, β -galactosidase, alolactosa, lactose and permeases, also allowing to manipulate the external lactose and feed phosphate. figure 3: simplified outline of the lac operon in e. coli as the model used is based on hill functions, for comparison purposes it should be approximated by the models plde and ode-fis. in the case of plde-step model we use the same threshold values presented by the odes-based model, and therefore it does not require a new design parameter. in the case of plde-logoid model the thresholds do not vary, however we must design the value δi j, shown in (4). for doing this, the coefficient k must be calculated, such that δi j j = ki j, which in turn delivers the lowest value of steady state error with respect to the ode models. in the case of the ode-fis model we train 6 regulation functions with data obtained from the model shown in [24], and which entries correspond to 368 c. muñoz, f. vargas, j. bustos, m. curilem, s. salvo, h. miranda states of the system. to cover a wide range of training conditions we reduce the external lactose concentration of 0.08 mm to 0 mm, through 4 negative steps, which in turn allows obtaining more information on the dynamics of the system. the structure of the model corresponds to 3 membership functions per entry, each one of the gauss type, labeled low, medium and high referring to the concentration level of each entry. the training epochs are 80 and the error goal is 0. for comparison purposes we reproduce the 2 experiments shown in [24] using the 4 models described previously. in each case, both the expression of β -galactosidase and permeases are plotted. the first of these experiments consists on monitoring the changing states of the system in time for a given set of initial conditions, and where the external lactose level and the feeding phosphate rate are kept constant. in the second experiment we maintain a constant level of external lactose changing periodically the phosphate feeding. for the comparison of β -galactosidase we also count with the experimental data [9] and [14] for the experiment 1 and data from [6]to experiment 2, which were provided by the authors of [24]. to compare, on a quantitative basis the approximations of models pldes and ode-fis, we have replicated experiment 1 and formulated a table with the steady state error (sse) and the integrated square error (ise) with respect to the odes-based model, considering a simulation time of 500. it was also implemented an experiment to assess whether the training of the network anfis was able to capture the main equilibrium points of the system. thus, we performed a sweep sampling of the values of the external lactose and the initial conditions to see if the model ode-fis has the same equilibrium points as the odes model. all the experimental work is developed in the matlab software, using primarily the simulink, fuzzy logic and optimization toolboxes. 3 results in the work of yildirim and mackey [24] the expression of β -galactosidase for two types of experiments is presented, being reproduced for all models. the standard profile of β -galactosidase, is shown in figure 4 for the odes-based model, the plde-step model, the plde-logoid model, the ode-fis model, and the experimental data of the work of [9] and [14]. it is noted that all models evolve to the same steady state except plde-step, which clearly does not represent adequately the profile of β galactosidase. in addition to the β -galactosidase we present the dynamics of the permeases, standardized and shown in figure 5. once again the poor performance of the plde-step model is repeated. we also stress the monitoring overshoot achieved by the ode-fis model. figure 6 shows the normalized profile of β -galactosidase for all models, in addition to the experimental work of [6]. this figure shows the similarity between the experimental data, and the ode and ode-fis models, but not pldes models (having a lower yield), particularly the plde-step model. for the experiment 2, the permeases level is followed properly for the ode-fis model, as shown in figure 7. in addition, the models pldes again show a poor performance. in order to determine which of the approaches better represents the model based on odes we obtain error rates by simulating the experiment 1. table 1 shows the steady state error (sse) and the integrated square error (ise) for a time simulation of 500. fuzzy logic in genetic regulatory network models 369 figure 4: comparison of the dynamics of β -galactosidase for different models depending on the conditions of the experiment 1. figure 5: comparison of the dynamics of permeases for different models depending on the conditions of the experiment 1. 370 c. muñoz, f. vargas, j. bustos, m. curilem, s. salvo, h. miranda figure 6: comparison of the dynamics of β -galactosidase for different models depending on the conditions of the experiment 2. figure 7: comparison of the dynamics of permeases for different models depending on the conditions of the experiment 2. sse ise β -gal permease β -gal permease plde step 7.300 e-4 1.503 e-2 2.272 e-4 1.100 e-1 plde logoid 2.739 e-8 3.349 e-7 1.030 e-6 9.080 e-4 ode fis 5.257 e-7 1.080 e-5 3.314 e-9 6.677 e-6 table 1: steady state error with respect to odes model fuzzy logic in genetic regulatory network models 371 to demonstrate the use of rules and linguistic labels we mention the case of a term describing the behavior of the alolactosa, which depends on 2 inputs; internal lactose (l) and β -galactosidase (b). given that we define 3 fuzzy sets ( low, medium, high), there are 8 possible rules () that connect the 2 inputs. as an example we mention 3 rules that were obtained from the training process: r1-if l is high and b is low then the influence of the term is null r2-if l is medium and b is low then the influence of the term is null r3-if l is low and then b is low then the influence of the term is null the label null is associated to the output function of the takagi-sugeno system, and corresponds to a value of 0. when analyzing these rules we observe that if the concentration of β -galactosidase is low, the associated term does not influence the production of alolactosa, which is consistent with reality. due to the fact that these 3 rules depend mainly on the value of b, they can be edited and replaced by a single one having the form: rthen the influence of the term is null in addition to editing rules, you can edit the features of the membership functions. the toolbox of matlab allows you to graphically edit the membership functions, without the need of a detailed mathematical knowledge of these functions. in addition, using simulink you can see the level of activation of each rule and the degree of membership of the states in the fuzzy sets in the inference systems, all this while the simulation takes place. figure 8: equilibrium points of odes model and its comparison with the equilibrium points found for ode-fis model. with regards to the stability of the ode-fis model, figure 8 shows the equilibrium points for alolactosa as external lactose level, as shown in [24]. additionally the figure also shows the steady states of the ode-fis model. most of the equilibrium points of odes model are also equilibrium points of the ode-fis model. it must be noticed that for each value of the external lactose in the ode-fis model we do not see more than 1 equilibrium point. on the contrary, in the odes model we find up to 3 points. 372 c. muñoz, f. vargas, j. bustos, m. curilem, s. salvo, h. miranda 4 summary and conclusions the flexibility that delivers the fuzzy logic and the capacity of training provided by anfis allowed to represent the non-linear system behaviour with enough similarity to the obtained with ode, showing a better performance than the plde models, even without describing the dynamic transient problems of interacting molecules. it should be mentioned that the plde models can be enhanced with optimization techniques and/or artificial intelligence tool to better design parameters θi j and δi j as needed. however this requires additional algorithms, not necessarily trivial. the anfis network allowed to obtain (from the training data) all the information needed to describe not only the transient state of the performed experiments, but also was able to achieve a large quantity of stable points of odes. this shows the great training capacity of the network, and the flexibility of the fuzzy inference system. the fuzzy logic has proved to be an important tool due to its ability to represent non-linear systems, its friendly language to express knowledge and the ability to incorporate and edit fuzzy rules. in addition, complementing the fuzzy logic with an artificial neural network for training (anfis) turns to be a powerful tool for obtaining knowledge from experimental data, suggesting the development of new techniques based both on fuzzy logic as in the networks of training and differential equations. the natural step now is to work with larger regulatory networks, adressing the criteria for modeling them in such a way to obtain an acceptable representation of biological phenomena without compromising the viability of its computational implementation, while striving to maintain a simple and understandable language that allows systems analysis from a qualitative perspective. bibliography [1] g. acuña, e. cubillos, development of a matlab toolbox for the design of grey-box neural models, international journal of computers, communications and control, vol. 1, pp. 7-14, 2006. [2] d. akçay, inference of switching networks by using a piecewise linear formulation, institute of applied mathematics, metu, msc thesis, 2005 [3] f. a. cubillos, g. acuña and e. l. lima, real-time process optimization based on grey-box neural models, brazilian journal of chemical engineering, vol. 24, pp. 433-443, 2007. [4] h. de jong, modeling and simulation of genetic regulatory systems: a literature review, journal of computacional biology, vol. 9, pp. 67-1003, 2002. [5] n. friedman, m. linial, i. nachman, and d. pe’er, using bayesian networks to analyze expression data, journal of computational biology, vol. 7, pp. 601-620, 2000. [6] j. gebert, n. radde, and g. weber, modeling gene regulatory networks with piecewise linear differential equations, european journal of operational research, in press, 2006. [7] b. c. goodwin, oscillatory behaviour in enzymatic control process, adv. enz. regul. vol. 3, pp. 425-438, 1969. [8] a. halász, v. kumar, m. imielinski, c. belta, o. sokolsky, s. pathak, and h. rubin, analysis of lactose metabolism in e.coli using reachability analysis of hybrid systems, iet systems biology, vol. 1, pp. 130-148, 2007. [9] s. kim, j. kim, and c. kwang-hyun, inferring gene regulatory networks from temporal expression profiles under time-delay and noise, computational biology and chemistry, vol. 31, pp. 239-245, 2007. fuzzy logic in genetic regulatory network models 373 [10] w. a. knorre, oscillation of the rate of synthesis of β -galactosidase in escherichia coli ml 30 and ml 308, biochem. biophys. res. commun, vol. 31, pp. 812-817, 1968. [11] a. kremling, k. bettenbrock, b. laube, k. jahreis, w. lengeler, and e. gilles, the organization of metabolic reaction networks iii. application for diauxic growth on glucose and lactose, metabolic engineering, vol 3, pp. 362-379, 2001. [12] r. linden, and a. bhaya, evolving fuzzy rules to model gene expression, biosystems, vol. 88, pp. 76-91, 2007. [13] b. lee, j. yen, l. yang, and j. liao, incorporating qualitative knowledge in enzyme kinetic models using fuzzy logic, biotechnology and bioengineering, vol. 62, pp. 722-729, 1999. [14] h. mcadams, and l. shapiro, circuit simulation of genetic networks, science, vol. 269, pp. 650-656, 1995. [15] s. pestka, b. l. daugherty, v. jung, k. hotta, and r. k. pestka, anti-mrna: specific inihbition of translation of single mrna molecules, proc. natl. acad. sci. usa, vol. 81, pp. 7525-7528, 1984. [16] h. ressom, p. natarajan, r. s. varghese, and m. t. musavi, applications of fuzzy logic in genomics, fuzzy sets and systems, vol. 152, pp. 125-138, 2005. [17] j. shing, and r. jang, anfis: adaptive-network-based fuzzy inference system, trans. on systems, man and cybernetics, vol. 23, pp. 665-685, 1993. [18] p. smolen, d. baxter, and j. byrne, modeling transcriptional control in gene networks methods, recents results, and future directions, bulletin of mathematical biology, vol. 62, pp. 247-292, 2000. [19] b. a. sokhansanj, and j. p. fitch, urc fuzzy modeling and simulation of gene regulation, 23rd annual internacional conference of the ieee engineering in medicine and biology, instanbul, turkey, 2001. [20] a. tepeli, and a. hortaçsu, a fuzzy logic approach for regulation in flux balance analysis, biochemical engineering journal, in press, 2007. [21] f. vargas, c. muñoz, and j. bustos, fuzzy logic in gene regulatory networks models, 1st internacional cgna workshop, utilizacion of novel and sustainable plant products in aqua-feeds, temuco, chile, 2008. [22] s. vinterbo, e. y. kim, and l. ohno-machado, small, fuzzy and interpretable gene expression based classifiers, bioinformatics, vol. 21, pp. 1964-1970, 2005. [23] p. wong, s. gladney, and j. keasling, mathematical model of the lac operon: inducer exclusion, catabolite repression, and diauxic growth on glucose and lactose, biotechnol. prog, vol. 13, pp. 132-143, 1997. [24] p. woolf, and y. wang, a fuzzy logic approach to analyzing gene expression data, physiol genomics, vol. 3, pp. 9-15, 2000. [25] n. yildirim, and m. mackey, feedback regulation in the lactose operon: a matematical modeling study and comparison with experimental data, biophysical journal, vol. 84, pp. 2841-2851, 2003. [26] x. zhu, and j. xu, estimation of time varying parameters in nonlinear systems by using dynamic optimization, industrial electronics society. iecon 2005. 31st annual conference of ieee, 2005. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 968-975 robust 2-dof pid control for congestion control of tcp/ip networks r. vilanova, v. m. alfaro ramón vilanova department de telecomunicació i enginyeria de sistemes universitat autònoma de barcelona 08193, bellaterra, spain, e-mail: ramon.vilanova@uab.cat víctor m. alfaro escuela de ingeniería eléctrica universidad de costa rica san josé, 11501-2060, costa rica. e-mail: victor.alfaro@ucr.ac.cr abstract: this paper presents how robust pid control can improve the performance of congesion control on tcp/ip networks. the proposed approach is compared with other control methods, such as pi control or red/aqm, showing the advantages of the proposed technique. keywords: congestion control, active queue management, pid control. 1 introduction internet congestion control and congestion avoidance have been active research interests in the area of networking (see, for example [1] [2] [3]) during the last two decades. it has two components: (1) the end-to-end congestion control protocol, such as tcp [4], and (2) an active queue management (aqm) scheme implemented in routers. aqm signals congestion by discarding or marking packets. when congestion is detected by tcp, it will take actions to reduce the source sending rate. normally, aqm objectives are: to stabilize the buffer queue length at a given target, thereby achieving predictable queueing delay, and to minimize the occurrences of queue overflow and underflow, thus reducing packet loss and maximizing link utilization. thus, it is necessary to reduce as much as possible this problem. at present, there are methodologies to deal with this issue [5]: congestion control which is used after the network is overloaded and congestion avoidance which takes action before the problem appears. this paper deals with congestion control because it is where feedback control techniques can be openly and easily applied. recently several mathematical models of active queue management (aqm) schemes supporting transmission control protocol (tcp) flows in communication networks have been proposed [2] [3]. from these models a control theory-based approach can be used to analyze or to design aqm schemes. the more well known aqm scheme is probably red [1]. red can detect and respond to long-term traffic patterns, but it cannot detect congestion caused by short-term traffic load changes. in addition, it is well known that an appropriate tuning of red parameters is not an easy task an may result in a non stabilizing controls scheme. this fact has motivated the research for alternative control approaches. this paper presents the application of a robust pid approach as an aqm controller. the controller can be easily tuned from the network parameters and it is compared with red and the pi controller proposed in [3]. the performance under different load conditions shows the robustness and superiority of the presented approach. in addition the simple formulation of the pid controller also constitutes a motivation for implementation. copyright c⃝ 2006-2010 by ccc publications robust 2-dof pid control for congestion control of tcp/ip networks 969 the rest of the paper is organized as follows. next section presents the nonlinear model for a tcp router as well as the tcp control problem formulation. section 3 reviews the red and pi controller approaches for aqm control whereas section 4 presents the robust 2-dof pid approach. in section 5 a discussion and comparison is conducted. the paper ends with drawing some conclusions on the reported results. 2 aqm router dynamic model and control problem statement in this section the dynamic nonlinear/linearized equations of tcp behavior developed in [3] are briefly reviewed as well as the purposes of aqm control stated. 2.1 dynamic tcp model as in the literature, a network configuration consisting of a single congested router with a transmission capacity c is considered in this paper. tcp timeout mechanisms have been ignored for simplification. using fluid-flow and stochastic differential equation analysis, the following coupled, nonlinear differential equations have been proposed as the dynamic model of the tcp behavior: ˙w(t) = 1 r(t) − w(t) 2 w(t − r(t − r(t))) r(t − r(t)) p(t − r(t)) (2.1) ˙q(t) = { −c + n(t) r(t) w(t) q(t) > 0 max 0, −c + n(t) r(t) w(t) q(t) = 0 where w . = average tcp window size (packets) q . = average queue length (packets) r(t) . = round-trip time= q(t) c + tp (secs) c . = link capacity tp . = propagation delay (secs) n . = number of active tcp sessions p . = probability of packet mark the queue length q, and window-size w, are positive bounded quantities; i.e., q ∈ [0, q̄] and w ∈ [0, w̄] where q and w denote buffer capacity and maximum window size respectively. in this formulation, the congestion window size w(t) is increased by one every round-trip time if no congestion is detected, and is halved upon a congestion detection. moreover it has been assumed that the aqm scheme implemented at the router marks packets using explicit congestion notification (ecn) to inform the tcp sources of impending congestion. to linearize (2.1), it is assumed that the number of active tcp sessions and the link capacity are time-invariant, i.e., n(t) ≡ n and c(t) ≡ c. in addition the dependence of the time delay argument t − r on queue length q, is ignored and it is assumed to be fixed to t − ro. then, local linearization of (2.1) about the operating point results in the following equation: 970 r. vilanova, v. m. alfaro figure 1: aqm block diagram of the linearized model along with high frequency uncertain dynamics ˙δw(t) = − n r2oc (δw(t) + δw(t − ro)) (2.2) − 1 r2oc (δq(t) − δq(t − ro)) − r2oc 2n2 δq(t − ro) ˙δq(t) = n ro δw(t) − 1 ro δq(t) where δw(t) = w(t) − wo and δq(t) = q(t) − qo are the incremental variables with respect to an operating point. the operating point for a desired equilibrium queue length qo is given by: ro = qo c + tp wo = roc n po = 2 w2o (2.3) this leads to a low order nominal model of the network dynamics that is accurate at a particular operating point (ro, qo, co, wo, po) given by: p(s) = po(s)e −ros = c2/(2n) (s + 2n r2oc )(s + 1 ro ) (2.4) by modelling the high frequency dynamics using a block ∆(s) such that ∆(s) = 2n2 r3oc 3 s(1 − e−ros) (2.5) this term represents the high frequency, necessarily parasitic, network uncertainty in the model. computational experience has shown that this can adequately capture certain deviations from nominal network performance. these considerations lead to the generation of a simplified feedback control system as shown in fig. (1) 2.2 aqm control problem the function of an aqm control law is to mark packets (with probability p) as a function of measured queue length q. marking of packets is consecutively used by the sender to throttle the amount of data sent; if no marked packets are received the window size is increased. upon reception of a marked packet the window size is halved. the principal performance objectives for an aqm control law are: 1. efficient queue utilization, to avoid overflow or emptiness of the queue buffer. robust 2-dof pid control for congestion control of tcp/ip networks 971 2. regulated queueing delay, to minimize (optimize) the time required for a data packet to be serviced by the routing queue. the queueing delay is equal to q/c. 3. robustness, to maintain closed-loop performance in spite of plant uncertainties, n, ro and c. 3 red and pi approaches to aqm this section reviews two of the more well established approaches for aqm control, say red and the pi approach presented in [3]. 3.1 red approach to aqm random early detection (known as red) was presented by [1]. a red gateway calculates the average queue size, using a low-pass filter with an exponential weighted moving average. the average queue size is compared to two thresholds (minimum and maximum). when the average queue size is less than the minimum threshold, no packets are marked. when the average queue size is greater than the maximum threshold, every arriving packet is marked. if marked packets are in fact dropped, or if all source nodes are cooperative, this ensures that the average queue size does not significantly exceed the maximum threshold. when the average queue size is between the minimum and the maximum threshold, each arriving packet is marked with probability p, where p is a function of the measured queue length q. hollot et al. in [3] proposed the following transfer function model for the red controller: cred(s) = klred s + k = kred s/kred + 1 (3.1) where lred = pmax (maxth − minth) kred = r3oc 2 (2n)2 kred = −c ln(1 − αred) (3.2) and where αred is reds queue-averaging weight. the corresponding controller block diagram is shown in figure (2). figure 2: block diagram of red as a cascade of low-pass filter and nonlinear gain element. 3.2 pi approach to aqm according to [3] the transfer function of a pi controller can be written as: cpi = kpi s/z + 1 s (3.3) this controller is very well known by the control community. its parameters can be tuned following methods proposed in the control literature. for example, [3] gave guidelines based on the bode diagram tuning technique: 972 r. vilanova, v. m. alfaro figure 3: 2-dof control system z = 2n r2on kpi = ωgz ∣∣∣∣jωg + 1/roc2/(2n) ∣∣∣∣ ωg = βro (3.4) where β determines the phase margin of the resulting nominal control system. the pi proposed in [3] is designed for a phase margin of about 30o. 4 robust pid control this section briefly presents the approach for analytical robust tuning (art) of 2-dof pid controllers recently presented in [6]. consider the two-degree-of-freedom (2-dof) control system of fig. 3, where p(s) is the controlled process transfer function, cr(s) the set-point controller transfer function, cy(s) the feedback controller transfer function, and r(s) the setpoint, d(s) the load-disturbance, and y(s) the controlled variable. the output of the 2-dof controller is given by u(s) = cr(s)r(s) − cy(s)y(s) (4.1) for a pid2 [7] it is u(s) = kc ( β + 1 tis ) r(s) − kc ( 1 + 1 tis + tds ) y(s) (4.2) where kc is the controller gain, ti the integral time constant, td the derivative time constant, and β the set-point weighting factor (0 ≤ β ≤ 1). we will start right now with a second-order-plus-dead-time (sopdt) model of the form p(s) = kpe −l ′′ s (t ′′ s + 1)(at ′′ s + 1) , τo = l′′ t ′′ (4.3) the pid controller parameters are determined by the following equations for processes with parameters in the range 0.1 ≤ τo ≤ 1.0 and 0.15 ≤ a ≤ 1.0. κc = 10τi 21τc + 10τo − 10τi (4.4) τi = (21τc + 10τo)[(1 + a)τo + a] − τ 2 c(τc + 12τo) 10(1 + a)τo + 10a + 10τ2o (4.5) τd = 12τ2c + 10τiτo − (1 + a)(21τc + 10τo − 10τi) 10τi (4.6) robust 2-dof pid control for congestion control of tcp/ip networks 973 β = min { 1 kc , τct ′′ ti , 1 } (4.7) the controller normalized parameters κc, τi and τd, and β depend on the model normalized dead-time τo and time constants ratio a, and on the design parameter τc. a minimum system robustness level is incorporated into the design process estimating a recommended maximum speed (τcmin) of the resulting closed-loop control system parameterized in terms of the maximum sensitivity function (ms) by using τcmin = k11(ms) + k12(ms)a k13(ms) (4.8) k11(ms) = 2.442 − 2.219ms + 0.515ms 2 k12(ms) = 10.518 − 8.990ms + 2.203ms 2 k13(ms) = 0.949 − 0.197ms combining the performance and robustness consideration above the design parameter may be selected in the range τcmin ≤ τc ≤ 1.25 + 2.25a. the range limits for the design parameter selection then combine the necessary restriction so that all controller parameters are positive and the accomplishment of a specified maximum sensitivity, with the necessity that the obtained response does not deviate too much away from the desired response, due of the dead-time approximation used in obtaining the tuning equations. for a more detailed presentation and discussion of the method please see [6]. 5 discussion in order to illustrate the effectiveness of the robust 2-dof pid method, a numerical situation will be presented by taking the network simulation parameters of [3] where qo = 175 packets, tp = 0.2 seconds, c = 3750 packets/s (this corresponds to a 15mb/s link with an average packet size of 500 bytes.). for a load of n = 60 tcp sessions we have wo = 15 packets, po = 0.008, and ro = 0.246. therefore p(s) = po(s)e −0.246 = 1.17126 105 (s + 0.53)(s + 4.1) e−0.246 ∆(s) = 2.24 10−6s(1 − e−0.246s) (5.1) the corresponding red controller parameters are (see [3]) k = 0.005 and lred = 1.86 10−4 whereas those of the pi controller kpi = 9.64 10−6 and z = 0.53. on the other hand, for the application of the art method the parameters of the soptd model are needed. these result to be a = 0.1297, l′′ = 0.2467 and t ′′ = 1.9014. and the controller parameters depend upon the desired robustness level expressed in terms of te ms level. a minimum robustness is assured by ms = 2.0 whereas highly robust systems are designed with ms = 1.4. here we take an intermediate level with ms = 1.6. the resulting 2-dof pid controller parameters are: kc = 4.4241 10 −5, ti = 2.1443, td = 0.3436 and β = 0.6172. figure (4) shows the performance of the three presented control strategies applied to the nonlinear system (2.1). as it can be seen the red controller cannot reach the new references. this is an inherent drawback because it lacks the corresponding integrator. on the other hand both the pi and art-pid controllers reach the desired targets. on the right part of figure (4) it can de observed that the art-pid controller reaches the desired set-point faster and without any overshoot. in addition, a test under different load conditions has been performed. in this case the number of tcp sessions 974 r. vilanova, v. m. alfaro 0 50 100 150 200 160 180 200 220 240 260 red (dash−dot) pi (dash) art (solid) time (sec) 0 50 100 150 200 5 6 7 8 9 10 11 x 10 −3 red (dash−dot) pi (dash) art (solid) time (sec) 10 12 14 16 18 20 160 180 200 220 240 260 red (dash−dot) pi (dash) art (solid) time (sec) q u e u e l e n g th 10 12 14 16 18 20 5 6 7 8 9 10 11 x 10 −3 red (dash−dot) pi (dash) art (solid) time (sec) p a c k e t m a rk in g figure 4: changes on desired queue length. comparison of pi, pid and red performance. plots on the right side show a zoomed version to better compare pi and pid is a signal of the form n(t) = n(1 + 0.01 ∗ sin(0.05t)) + v(t) being v(t) a normally gaussian distributed random number of zero mean and variance 0.01 ∗ n. v(t) is assumed to change with a sampling time of 5 sec. it can be seen that the art-pid provides faster response to the load variation and faster recuperation of the desired queue length. table (1) shows the mean value and standard deviation of the queue length error computed with respect to the desired target qo = 175packets. as a proof of performance, as we achieve a lower variation of the queue, a predictable performance will be expected. therefore better qos. table 1: mean and standard deviation of the queue length error controller red pi art mean -0.43 -0.27 -0.1 std 10.9 9.2 5.0 6 conclusions in this paper the suitability of applying robust pid controllers for the purpose of improving internet congestion control has been presented. the main advantage of the art pid tuning is its one-parameter tuning. in addition this parameter is a direct specification of the desired robustness level. therefore suitable for the sittuations with changes in load and system parameters. the performance has been compared to that of red and a pi controller previously proposed in the literature. acknowledgment this work has received financial support from the spanish cicyt program under grant dpi2007-63356. support from the universidad de costa rica is greatly appreciated. robust 2-dof pid control for congestion control of tcp/ip networks 975 0 20 40 60 80 100 120 140 160 180 200 150 160 170 180 190 200 red (dash−dot) pi (dash) art (solid) time (sec) 0 20 40 60 80 100 120 140 160 180 200 5 6 7 8 9 10 11 12 x 10 −3 red (dash−dot) pi (dash) art (solid) time (sec) figure 5: regulation performance on a desired queue length of qo = 175packets facing random variations of the number of tcp sessions bibliography [1] s. floyd and v. jacobson, “random early detection gateways for congestion avoidance.,” ieee/acm transactions on networking, vol. 1, pp. 397–413, 1993. [2] f. kelly, “mathematical modeling of the internet, in mathematics unlimited-2001 and beyond,” b. enquist and w. schmid, eds, berlin, germany, springer-verlag., 2001. [3] c. hollot, v. misra, d. towsley, and w. gong, “analysis and design of controllers for aqm routers supporting tcp flows,” ieee trans. automat. contr., vol. 47, pp. 945–959, 2002. [4] j. postel, “transmission control protocol,” rfc 793, 1981. [5] s. ryu, c. rump, and c. qiao, “advances in active queue management (aqm) based tcp congestion control,” telecommunication systems, vol. 25, pp. 317–351, 2004. [6] v. alfaro, r. vilanova, and o. arrieta, “a single-parameter robust tuning approach for twodegree-of-freedom pid controllers,” the european control conference,budapest, hungary, august 23-26, 2009, 2009. [7] k. åström and t. hägglund, advanced pid control. isa the instrumentation, systems, and automation society, 2006. international journal of computers, communications & control vol. ii (2007), no. 1, pp. 66-73 development of an algorithm for groupware modeling for a collaborative learning ikuo kitagaki, atsushi hikita, makoto takeya, yasuhiro fujihara abstract: this paper reports an algorithm for forming groups of students with regard to a computer system for collaborative learning designed to give a cue for debate utilizing mobile terminals. with this system, questionnaires which should be used as the seeds for debates are prepared in advance on the web and all students attending the class answer to the questionnaires on the web through their mobile terminals. following this, the computer assigns students to appropriate groups based on the results of answers, and transmits each of answers and information of the group member to terminals of the students. based on the information, students form groups and each group starts debate. in this study, system composition is dealt with first and algorithm for forming groups using answers by the students is discussed. keywords: group forming, algorithm, collaborative learning, mobile terminal, debate system, groupware 1 introduction this study relates to university class in japan. in many cases, lectures are given in one-sided manner, and it is pointed out that some schemes are necessary to encourage students to express their opinions or to induce discussions by students themselves[5][8]. considering the recent trend that almost all students carry their own mobile terminals[6][7], this study relates to a learning system that utilizes mobile terminals as an ancillary tools for debate. students are divided into groups making use of the information of answers by the students against questionnaires presented beforehand. on this occasion, information necessary for grouping and results of grouping are transmitted together with contents of each of answers to mobile terminals of the students. students are then requested to form groups based on these information and to initiate debates in each group. in other words, this system is one sort of blended learning in which face-to-face debate is carried out utilizing electronic educational information communication. in this study, algorithm for grouping based on information of answers by the students is dealt with. with collaborative learning as mentioned, two points can be expected as mentioned below. 1. groups are being formed considering education effects. therefore, every student can know why he/she is assigned to the current group. 2. students are instructed to start debate using the information on their answers as the seeds. therefore, they can easily grasp a cue for debates. according to the conventional method of grouping of students, groups are formed simply mechanically in the order of student id numbers or those seating nearby are assigned to the same group. meanwhile, in this study, it is attempted to form groups appropriately based on answers against some sort of assignments including tests. teaching by groups which were formed based on answers of the students was attempted by the prior study[2][3]. at the time of this study, since possession of a mobile terminal by every student was not practical, test results were collected and marked by the teacher, groups were formed based on the results. test papers were returned to the students, groups were formed, and students in each of the groups are requested to discuss primarily incorrect answers. for this reason, it took enormous time from answering to the questionnaires by the students to group discussion. this time, fully making use of information technology available today, we configured a new collaborative learning copyright c© 2006-2007 by ccc publications selected paper from icvl 2006 development of an algorithm for groupware modeling for a collaborative learning 67 system. thanks to assistance by mobile terminals, it is expected that from answering by the students till execution of discussions can be made promptly and smoothly. this system is currently under development. in section 2, outline of system composition will be described, and an algorithm for forming groups will be introduced in section 3 onward. although the algorithm is one portion of the system, it is significant to make that in order to put the system into practical use, which may promote the practical e-learning in universities[1]. 2 system flow total four files are prepared and used in this system, student file, questionnaire file, aggregation file and groupware file. these files are used in this order. 1. student file basic information such as mail address, age, gender or the like of students are summarized in this file. 2. questionnaire file questionnaire or the like concerning discussion theme are summarized in this file. this file can be consisted of any of three types, multiple-choice method that allows only one selection, multiplechoice method that allows a plurality of selections, and free composition. (multiple-choice method alone is subjected to processing of item 4. onward.) 3. aggregation file answers by the students for questionnaires in 2. are aggregated in this file. 4. groupware file the computer forms student groups appropriately based on results of answers by the students in 3. this file also includes information on, in addition to answers by the students, which students form one group. execution procedures for preparing and utilizing above-mentioned files are shown in fig.1. (1) a student accesses the predetermined url, enters basic information such as gender, id number or the like and transmits it. his/her file (excel) is then completed while the teacher may execute input and deletion of the information by operating excel directly. basic information of a certain class is stored here and this file can be utilized at every round-table discussion held by the class. (2) with regard to a discussion planned by a certain teaching, a group of questionnaires which may benefit grouping is prepared and is registered to the system as a questionnaire file. (3) several groups of questionnaires are being registered in the questionnaire file. the teacher picks up specific groups of questionnaires and transmits url for browsing questionnaire items to mail addresses of all students. (4) students answer the questionnaires and transmit them to create an aggregation file. (5) based on the information in the aggregation file, students are assigned appropriately to a certain group. (6) information on grouping is transmitted to mobile terminals of the students. upon completion of above-mentioned procedures, the students are grouped in the class and start discussions using results of answers as the cue. 68 ikuo kitagaki, atsushi hikita, makoto takeya, yasuhiro fujihara (6)mending students groupware information instruction of group discussion (3)sending url for browsing questionnaire items (4)assembling answer data procudure questionnaire maker regst. of personal inform.(ftp) web server command of mail sending assembled data internet mail web browser mail receiving url keying in the answer excel excel student data csv hp for generating questionnaire hp for kicking mails data assembly program csv regst. of qustionnaire item download(ftp) making groups program starting the duscuss student (5)making groups sending personal information (1) reg. student data (2)reg.questio nnaires download(ftp) figure 1: system flow development of an algorithm for groupware modeling for a collaborative learning 69 3 construction of students groups suppose that there are four students a, b, c and d, and three questions a, b and c are given. results of evaluation of answers to these questions are shown by o and x as shown in table 1. one group consists of two students and discussion is made within the group primarily for questions which resulted in incorrect answers. in this case, two types of grouping shown in the table are compared. with grouping p, students a and b and students c and d constitute one group, respectively. however, in both groups, each of group members presented the same correct/incorrect pattern. namely, in group 1, both members gave incorrect answer for question b and a possibility that they can teach each other for problem solving is low. the same also applies to group 2. contrary, with grouping q, students a and c and students b and d constitute one group, respectively, and the said problem does not occur in this case. from above-mentioned cases, it may be said that grouping q is superior to grouping p. when we focus on problem solving of correct answers and incorrect answers of the test, possibility of problem solving is considered to be higher for such a case where there are different patterns of answers rather than a case where there is only one pattern in the same group. table 1: example of results of evaluation of answers and grouping student result of result of result of grouping p grouping q evaluation of evaluation of evaluation of (group id) (group id) question a question b question c a © × © 1 1 b © × © 1 2 c × © × 2 1 d × © × 2 2 suppose that the number of members in each of groups is the same. then excellence of grouping ug in a certain group g (g ∈ g) is defined. question j and question set are expressed by m j and m (m ∈ m), respectively, and student i in one group g and the student sets is expressed by si and sg (si ∈ g), respectively. further, result e of evaluation of answer for question mi and for student sk are expressed by e (mi,sk). • answer is correct → e(mi,sk) = 1 • answer is incorrect → e(mi,sk) = 0 ug = ∑ sk∈g ∑ mi∈m (1−e(mi, sk)) ⋃ s j∈g e(mi, s j) |m ‖ sg| (1) where, |m| denotes the number of elements of sets m. grouping u of entire class is expressed by equation (2) u = ∑ g∈g ug |g| (2) although two ug’s obtained by equation (1) are comparable to each other only for the case that both groups consist of the same number of the students, the proposed idea in this paper will be able to be 70 ikuo kitagaki, atsushi hikita, makoto takeya, yasuhiro fujihara extended to the different number of groups in a classroom by equation (3) where both equations (1) and (2) are combined: ug = ∑ g∈g ∑ sk∈g ∑ mi∈m (1−e(mi, sk)) ⋃ s j∈g e(mi, s j) |m ‖ sg| (3) where n means the number of the students in a classroom. actually there could be that several groups are different in the number of group members because the total number of the classroom cannot be well divided into an integer by appropriate integers. whether the number of the students in a group is equal or not, it is necessary to calculate equation (2) or (3) for all the combination of student group in order to obtained the maximum solution. although it is possible to do that theoretically, it is difficult to obtain the strict solution because of the enormous number of calculations. in reality, if the number of group members is supposed to be 4, the computer can execute computations of a class consisting of 50 peoples at the most. then more simplified method should be used. in other words, with the relevant algorithm, suppose that students of one class are re-expressed as s1, . . , sn , the following replacement is made which results in calculations of n (n-1) times in total, and a group in which u becomes the maximum is judged to be the optimum grouping. s1 and s2, s1 and s3,··· , s1 and sn s2 and s3, s2 and s4,··· , s2 and sn ······ ············sn−1 and sn (4) it is already known that when one class consists of 20 and several members and one group consists of four members, excellence of grouping by the simplified algorithm as used in the current proposal is, compared with a case where the same is obtained for all combinations without simplification, approximately 60 ∼ 100% optimization rate depending upon the answer pattern[4]. 4 discussion in a classroom two kinds of discussion were administrated concerning the proposed method. the first kind is the one for discussing test answers, where the evaluation either of correct or wrong was determined. digital mathematics served as the subject. several test problems were given to the students. all the test problems had multiple choice type of answers thus the students selected an answer among them. because the evaluation was already done, the instructor, after group making, told them to start by telling the evaluated result then discuss the answer which was evaluated wrong. the second kind is the one for discussing opinions, where the evaluation either of correct or wrong cannot be determined. training program for job hunting served as the subject. the example of a question done by an interview and the answer by the interviewee was presented followed by several opinions for the answer then the students selected an opinion which they thought the closest to their impression. the instructor, after group making, told them to start by telling their own choice, discuss a better answer then make the report as group co-working. through several administrations, the method was thought to give the students learning motivation because, every time when the proposed discussions were done, group member changed for clear reason leading to their refreshment. in addition to that, we thought that, if the more detail direction in what point the discussion ought to be done was given to the students, they may think it easier to start. development of an algorithm for groupware modeling for a collaborative learning 71 5 consideration according to the algorithm for forming groups used in the current proposal, grouping is made so that results of evaluation of answers of the test by students may become different as much as possible. for the sake of establishment of more generalized algorithm, overall investigations are necessary which includes what sort of group discussions are actually available, what sort of information are effective for forming groups, what judgment criteria should be used to determine good or bad of grouping based on the said information. although in the present study, results of evaluation of answers are digitized to either 0 or 1, the authors intend to investigate another algorithm capable of coping with more diversified evaluation results. at the same time, utility of this group discussion system and points to be improved will be checked from both software and hardware aspects through practicing. 6 aknowledgements the present study has been promoted under the grant partly from grant-in-aid for scientific research, germination study no.17650260, fundamental study (b) no.18300288, and germination study no.17300275, each sponsored by the ministry of education, culture, sports, science and technology. we are sincerely thankful to all the persons and bodies helping our works. references [1] styliadis d. a., karamitsos d.i. and zachariou i.d: personalized e-learning implementation-the gis case international journal of computers, communications & control, vol. 1,1, pp. 59-66, 2006. [2] kitagaki i., shimizu y. and suetake k., an instructional method which permits the students to critically discuss their own test answers, jour.of educ. technol., vol.5, no.1, pp.23-33, 1980. [3] kitagaki i., shimizu y., consideration on an educational system which permits the students to discuss over their own test answers, res. sci.educ., vol.5, no.1, pp.22-28,1981. [4] kitagaki i, a consideration on an educational groupware algorithm using fuzzy integral, ieice, tech.rep.,vol.et94-101, pp.61-66,1994. [5] nagai m., kitazawa t., koshikawa h., kato h., akahori k, eevelopment and verification of the formative evaluation system with utilizing mobile phone for web-based collaborative learning, japan journal of educational technology , vol.28, no.4, pp.333-342, 2005. [6] nagamori m., ueno m., ando m., pokpong s., endo k., nagaoka k., response analyzer system using mobile phones for distance education, japan journal of educational technology , vol.29, suppl., pp.57-60, 2005. [7] otuska k., yahiro t., the input interface of the value of evaluation in an evaluation system of instruction using mobile phones, japan journal of educational technology, vol.30, no.2, pp.125134, 2006. [8] ozawa s., mochizuki t., egi h., kunifuji s, facilitating reflection in collaborative learning using formative peer evaluation among groups, japan journal of educational technology vol.28, no.4, pp.281-294, 2005. 72 ikuo kitagaki, atsushi hikita, makoto takeya, yasuhiro fujihara ikuo kitagaki hiroshima university, japan graduate school of education the research institute for higher education 1-2-2 kagamiyama, higashi-hiroshima, 739-8512, japan e-mail: kitagaki@hiroshima-u.ac.jp atsushi hikita hiroshima university, japan community cooperation center e-mail: hikita@hiroshima-u.ac.jp makoto takeya takushoku university, japan faculty of engineering department of computer science 815-1 tate-machi, hachioji, tokyo, 193-0985 japan e-mail: takeya@cs.takushoku-u.ac.jp yasuhiro fujihara iwate prefecutural university faculty of software and information science fuji@iwate-pu.ac.jp received: november 7, 2006 editor’s note about the authors: ikuo kitagaki, born in aichi on august 9, 1947, received his be and me degrees in electronics in 1970 and 1972, and his doctor engineer degree in 1981 from tokyo institute of technology. joining tokyo institute of technology in 1973 and employment promotion corporation in 1986, he was engaged in various research areas including the development of computer applications in education, fuzzy science, science of laughter, and so on. since 2000, he has been belonging to hiroshima university, hiroshima, japan. presently he is professor of graduate school of education/the research institute for higher education, hiroshima university. one of his most important recent works is "university authority (in japanese)". his career is shown also in "who’s who in science and engineering, marquis 9th ed.(2006-2007), usa". development of an algorithm for groupware modeling for a collaborative learning 73 atsushi hikita (born on january 30, 1968) graduated the faculty of physics of sophia university, japan in 1991, and got his master degree at the same faculty in 1993. he worked in mitsubishi research institute as a media and information planner and researcher (1993-2000). his main research fields are information design (information and its places, especially map and pictograms as a nonverbal communications), and media design (media communications for society, museum, library and classroom). he has coauthered and co-edited, several books and papers for these fields. he is a chair of apng (asia pacific networking group) education and live e! wg, committee member of chugoku-shikoku internet council, hiroshima region ipv6 deploy committee. makoto takeya, born in tokyo on november 2, 1941, received his be and me degrees in applied physics in 1966 and 1968, and his doctor of science degree in 1981 from waseda university. joining nec corporation in 1968, he was engaged in various research areas including the development of computer applications in education. since 1986 he has been belonging to takushoku university, tokyo, japan. during 1992-1993 he was a visiting scholar of department of educational psychology, university of illinois. presently he is professor of department of computer science, takushoku university. among his most important books are "a new test theory: structure analysis methods for educational information" (in japanese) and "structure analysis methods for instruction: theory and practice of instructional architecture, design and evaluation". he is the recipient of the 1976 yonezawa memorial award from the institute of electronics and communications engineers of japan, the 1989 excellent research award from behaviormetrics society of japan, the 1996 winning paper award from japan society of educational technology, and the 1999 engineering education award from japanese society for engineering education. yasuhiro fujihara (born on january 8, 1971) graduated the faculty of education of kobe university in 1993. presently he is an assistant professor at faculty of software and information science, iwate prefectural university, japan. his main research fields are educational technology (educational evaluation, e-learning) and ict education. he is a member of japan society for educational technology, and the institute of electronics, information and communication engineers. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 3, pp. 325-335 consensus problem of second-order dynamic agents with heterogeneous input and communication delays c.-l. liu, f. liu cheng-lin liu, fei liu jiangnan university institute of automation wuxi 214122, jiangsu, people’s republic of china e-mail: liucl@jiangnan.edu.cn, fliu@jiangnan.edu.cn abstract: consensus problem of second-order multi-agent systems with velocity damping term in agent’s dynamics is investigated. based on frequency-domain analysis, decentralized consensus condition, which depends on the input delays, is obtained for the system based on undirected and symmetric graph with heterogeneous input delays. for the system based on directed graph with both heterogeneous input delays and communication delays, decentralized consensus condition, which is dependent on the input delays but independent on the communication delays, is also obtained. simulations illustrate the correctness of the results. keywords: coordination control, consensus, second-order multi-agent systems, communication delay, input delay 1 introduction in the last decade, distributed coordination of multiple autonomous agents has attracted more and more attention from various research communities for its broad application including automated highway systems, air traffic control, congestion control in internet, etc. consensus problem, which is one of the most fundamental and important issues in coordination control of multi-agent systems, requires that the outputs of several spatially distributed agents reach a common value without recurse to a central controller. for the first-order multi-agent systems with agents’ dynamics modeled by single integrators and second-order multi-agent systems with agents’ dynamics modeled by double integrators, consensus algorithms have been proposed to solve the consensus problem, and sufficient conditions have been obtained for the system converging to the consensus with static or switched interconnection topology [1–4]. recently, more and more attention has been paid on the delay effect on consensus convergence of multi-agent systems. generally speaking, two kinds of time delays cannot be negligible in the multiagent systems. one is communication delay, which is related to the information transmission between neighboring agents. the other is input delay, which is related to the processing and connecting time for the packets arriving at each agent [5]. consensus problem under communication delays has been extensively studied for the first-order multi-agent systems based on different analysis methods, such as lyapunov functions analysis [6, 7], frequency-domain analysis [2, 8], method based on delayed and hierarchical graphs [9, 10], method based on difference of maximum value and minimum value [11, 12], etc. however, consensus analysis of second-order multi-agent systems with communication delay is much more difficult, and many existing results are mostly on the synchronous consensus algorithm [13–15], in which self-delays equaling to the corresponding communication delays are introduced for each agent in the coordination control part. compared with the first-order multi-agent systems, the consensus algorithm without any self-delay, which is called asynchronous consensus algorithm, has not been studied extensively for the second-order multi-agent systems. using small-µ stability theorem, yang et al. [16] obtained the frequency-domain copyright c© 2006-2010 by ccc publications 326 c.-l. liu, f. liu consensus conditions for the second-order multi-agent systems with time-varying communication delays. based on frequency-domain analysis [17] and lyapunov-krasovskii functional method [18], spong et al. proved that, by choosing proper consensus protocol and control parameters, the second-order multiagent systems with heterogeneous communication delays can converge to a stationary consensus without any relationship to the delays. using the properties of nonnegative matrices, lin and jia [19] obtained delay-independent sufficient conditions for the second-order discrete-time multi-agent systems with heterogeneous communication delays converging to the stationary consensus under dynamically changing topologies. to our knowledge, however, the consensus problem under input delays has not attracted much more attention. in some reports, the identical communication delay introduced in the synchronous consensus algorithm can be treated as the identical input delay [2, 14, 15]. using frequency-domain analysis method, tian and liu [5] considered the consensus problem of the first-order multi-agent systems with heterogeneous input delays based on undirected graphs, and obtained the decentralized consensus criterion depending on the input delays. moreover, the decentralized consensus condition, which depends only on the input delays, is also obtained for the first-order multi-agent systems with both heterogeneous communication delays and input delays based on the digraph in [5]. in [20], tian and liu investigated the leader-following consensus problem of the second-order multi-agent systems with heterogeneous input delays and symmetric coupling weights, and the decentralized consensus condition with some prerequisites is obtained for the system converging to the states of the dynamic leader asymptotically. furthermore, the robustness of the symmetric system with asymmetric weight perturbation is also investigated in [20], and a bound of the largest singular value of the perturbation matrix is obtained as the robust consensus condition. in this paper, we consider the consensus problem of second-order multi-agent systems with velocity damping term in the agent’s dynamics, and analyze the consensus conditions for the system with heterogeneous delays converging to the stationary consensus. firstly, we investigate the consensus problem for the system based on undirected and symmetric graph with heterogeneous input delays, and a decentralized consensus condition, which is delay-dependent, is obtained by using some early results for the internet congestion control with heterogeneous communication delays [21]. then, we study the consensus problem for the system based on general directed graph with both heterogeneous input delays and communication delays by using greshgorin disc theorem, and another decentralized consensus condition, which depends on the input delays only, is also obtained. this consensus condition is more conservative than the former for the existence of heterogeneous communication delays and the asymmetry of coupling weights, but it can be applied to the systems based on directed graph with asymmetric weights. 2 preliminaries on graph theory a weighted directed graph (digraph) g = (v, e, a) of order n consists of a set of vertices v = {, ..., n}, a set of edges e ⊆ v×v and a weighted adjacency matrix a = [ai j] ∈ rn×n with nonnegative adjacency elements ai j. the node indexes belong to a finite index set i = {, , ..., n}. an edge of the weighted diagraph g is denoted by ei j = (i, j) ∈ e, i.e., ei j is a directed edge from i to j. we assume that the adjacency elements associated with the edges of the digraph are positive, i.e., ai j >  ⇔ ei j ∈ e. moreover, we assume aii =  for all i ∈ i. the set of neighbors of node i is denoted by ni = { j ∈ v : (i, j) ∈ e}. in the digraph g, if (i, j) ∈ e ↔ ( j, i) ∈ e, we usually say g is undirected graph or bidirectional graph. the out-degree of node i is defined as: degout (i) = ∑n j= ai j. let d be the diagonal matrix with the out-degree of each node along the diagonal and call it the degree matrix of g. the laplacian matrix of the weighted digraph is defined as l = d − a. if there is a path in g from one node i to another node j, then j is said to be reachable from i. if not, then j is said to be not reachable from i. if a node is reachable from every other node in the digraph, then consensus problem of second-order dynamic agents with heterogeneous input and communication delays 327 we say it globally reachable. a digraph is strongly connected if every node in the digraph is globally reachable. an undirected graph is connected if it contains a globally reachable node. 3 problem formulation in a multi-agent system composed of n agents, each agent can be regarded as a node in a digraph, and information flow between neighboring agents can be considered as directed paths between the nodes in the digraph. thus, the interconnection topology of multi-agent systems can be described as a diagraph g = (v, e, a). in this paper, we just consider static topology g, i.e., the connection of the nodes in the diagraph g does not change with time. consider the second-order dynamic agents modeled by ẋi = vi, miv̇i = fi, i ∈ i, (1) where xi ∈ r and vi ∈ r are the position and the velocity of the agent i respectively, mi >  is the mass of the agent i, and fi is the total force on the agent i. analogous to [22], the total force fi in the second-order model (1) consists of two components: fi = −αivi + ui, where ui is the external control input, and −αivi with αi >  denotes the velocity damping term caused by the resistance, e.g., the friction. then, the second-order model (1) becomes ẋi = vi, miv̇i = −αivi + ui, i ∈ i. (2) with non-negligible input delays for the external control, the agents (2) become ẋi(t) = vi(t), miv̇i(t) = −αivi(t) + ui(t − ti), i ∈ i, (3) where ti >  is the input delay of the agent i. for the system (3), we take a consensus protocol based on the agents’ position states as follows ui = κi ∑ j∈ni ai j(x j − xi), (4) where κi > , ni denotes the neighbors of agent i, and ai j >  is the adjacency element of a in the digraph g = (v, e, a). under the communication delays, the protocol (4) becomes ui(t) = κi ∑ j∈ni ai j(x j(t − τi j) − xi(t)), (5) where τi j is the communication delay from agent j to agent i. with the protocol (5), the closed-loop form of the system (3) is ẋi(t) = vi(t), miv̇i(t) = −αivi(t) + κi ∑ j∈ni ai j(x j(t − ti − τi j) − xi(t − ti)), i ∈ i. (6) remark 1. in [20], tian and liu has studied the leader-following consensus problem of the second-order multi-agent systems with heterogeneous input delays under double-consensus algorithm, and obtained the consensus conditions for the system with symmetric and asymmetric weights respectively. different from [20], we consider the stationary consensus of the second-order dynamic agents (6) with velocity damping term, and analyze the consensus conditions for the system with heterogeneous communication delays and input delays. 328 c.-l. liu, f. liu 4 consensus criterion 4.1 consensus under heterogeneous input delays in this section, we investigate the consensus problem of multi-agent systems (6) just with heterogeneous input delays only as follows ẋi(t) = vi(t), miv̇i(t) = −αivi(t) + κi ∑ j∈ni ai j(x j(t − ti) − xi(t − ti)), i ∈ i. (7) firstly, we give an assumption on the velocity damping coefficient αi, the mass mi and input delay ti in the following. assumption 2. (ti αimi − tj α j m j )(ti − tj) ≤ , ∀i, j ∈ i, i 6= j. now, we present some sufficient conditions for second-order multi-agent systems with heterogeneous input delays. theorem 3. consider the network of n dynamic agents (7) with a static interconnection topology g = (v, e, a) that is undirected (or bidirectional) and connected, and the topology graph has symmetric weights, i.e., ai j = a ji. then, under assumption 2, all the agents in system (7) asymptotically converge to a stationary consensus, i.e., limt→∞ xi(t) = c, limt→ vi(t) = , ∀i ∈ i, where c is a constant, if ∑ j∈ni ai j < mi κi(gmi )− ,∀i ∈ i, (8) where gmi is the gain margin of the transfer function wi(s) = e−sti s+ αi mi s . before proving theorem 3, we list two useful lemmas as follows. lemma 4. [23] let q ∈ cn×n, q = q? ≥  and t = diag{ti,ti ∈ c}. then λ (qt ) ∈ ρ(q)co(∪{ti}), where λ (·) denotes matrix eigenvalue, ρ(·) denotes the matrix spectral radius, and co(·) denotes the convex hull. based on remark 4 and claim 1 in [21], we obtain the following lemma. lemma 5. suppose that assumption 2 holds for the frequency response of a family of systems described by gi(jω) = gmi s + j αimi ω e−jtiω , i ∈ i, where gmi is the gain margin of the transfer function wi(s) = e−sti s+ αi mi s . then, γ co(∪{gi(jω), i ∈ i}) does not contain the point (−, j) for any given real number γ ∈ [, ) and any ω ∈ (−∞, ∞). now, we give the proof of theorem 3. the system (7) is rewritten as follows ẋi(t) = vi(t), v̇i(t) = −ᾱivi(t) + κ̄i ∑ j∈ni ai j(x j(t − ti) − xi(t − ti)), i ∈ i, (9) consensus problem of second-order dynamic agents with heterogeneous input and communication delays 329 where ᾱi = αimi and κ̄i = κi mi . taking the laplace transform of the system (9), we obtain the characteristic equation of the system (9) about x(t) = [x(t),··· , xn(t)]t as follows det(diag{s + ᾱis, i ∈ i} + diag{κ̄ie−tis, i ∈ i}l) = . define d(s) = det(diag{s + ᾱis, i ∈ i} + diag{κ̄ie−tis, i ∈ i}l), and we will prove that all the zeros of d(s) are on the open left half complex plane or s =  in the following. when s = , d() = det(diag{ + ᾱi, i ∈ i} + diag{κ̄ie−ti, i ∈ i}l) = det(diag{κ̄i, i ∈ i}) det(l), because the interconnection graph g = (v, e, a) is connected,  is a simple eigenvalue of l [24], i.e., rank(l) = n − . hence, d(s) has only one zero at s = . when s 6= , define f(s) = det(i + diag{ κ̄is+ᾱis e −tis, i ∈ i}l). we will prove that the zeros of f(s) lie on the open left half complex plane. according to the general nyquist stability criterion [25], the zeros of f(s) are on the open left half complex plane, if λ (diag{ κ̄i (jω)+jω ᾱi e−jω ti , i ∈ i}l) does not enclose the point (−, j) for ω ∈ r. for the symmetric weights (ai j = a ji), we get l = lt ≥  according to the definition of the laplacian matrix. based on lemma 4, we get λ (diag{ κ̄i −ω  + jᾱiω e−jtiω }l) = λ (diag{ gmi −ω  + jᾱiω e−jtiω diag{ √ κ̄i(gmi )−}ldiag{ √ κ̄i(gmi )−}) ∈ ρ(diag{ √ κ̄i(gmi )−}ldiag{ √ κ̄i(gmi )−})co(∪ gmi −ω  + jᾱiω e−jtiω ). since the spectral radius of any matrix is bounded by its largest absolute row sum, it follows from the condition (8) that ρ(diag{ √ κ̄i(gmi )−}ldiag{ √ κ̄i(gmi )−}) = ρ(diag{κ̄i(g m i ) −}l) ≤ max i∈i κ̄i(gmi ) −( ∑ j∈ni ai j) < . therefore, from lemma 5, we obtain that (−, ) 6∈ ρ(diag{ √ κ̄i(gmi )−}ldiag{ √ κ̄i(gmi )−})co(∪ gmi −ω  + jᾱiω e−jtiω ), i.e., λ (diag{ κ̄i (jω)+jω ᾱi e−jω ti , i ∈ i}l) does not enclose the point (−, j) for ω ∈ r, which implies that the zeros of f(s) are all on the open left half complex plane. now, we have proved that d(s) has its zeros on the open left half complex plane except for one zero at s = . thus, the state xi(t) of the system (7) converges to a steady state, i.e., limt→∞ xi(t) = x∗i , i ∈ i, and limt→∞ vi(t) = ,∀i ∈ i holds for (7). it is obtained from (7) that l[x∗,··· , x∗n]t = . since rank(l) = n − and l[,··· , ]t =  from the definition of the laplacian matrix l, the roots of lx∗ =  can be expressed as x∗ = c[,··· , ]t , where c is a constant. theorem 3 is proved. 2 remark 6. obviously, the consensus condition (8) in theorem 3 depends strictly on the assumption 2 and the symmetry of the coupling weights between agents. 330 c.-l. liu, f. liu 4.2 consensus under heterogeneous input and communication delays in multi-agent systems, the interconnection topology composed of dynamic agents is usually asymmetric, and the communication delays caused by information transmission always exist between neighboring agents. thus, the lemma 4 and lemma 5 which play important roles in the proof of theorem 3 cannot be applied in these cases. in this section, we will analyze the consensus of the second-order dynamic agents (6) with both heterogeneous input delays and communication delays under general directed interconnection topology. theorem 7. consider the network of n dynamic agents (6) with a static interconnection topology g = (v, e, a) that has a globally reachable node. if ∑ j∈ni ai j < α i κi(mi + αiti) , ∀i ∈ i, (10) all the agents in the system (6) converge to a stationary consensus asymptotically. proof: firstly, rewrite the system (6) as ẋi(t) = vi(t), v̇i(t) = −ᾱivi(t) + κ̄i ∑ j∈ni ai j(x j(t − ti − τi j) − xi(t − ti)), i ∈ i, (11) where ᾱi = αimi and κ̄i = κi mi . taking the laplace transform of the system (12), we obtain that the characteristic equation of the system (12) about x(t) = [x(t),··· , xn(t)]t is det(diag{s + ᾱis, i ∈ i} + diag{κ̄ie−tis, i ∈ i}l(s)) = , where the n×n matrix l(s) = {li j(s)} is defined by li j(s) =    −ai je−τi j s, j ∈ ni;∑ j∈ni ai j, j = i; , otherwise, and l() = l, which is the laplacian matrix. define d̃(s) = det(diag{s + ᾱis, i ∈ i} + diag{κ̄ie−tis, i ∈ i}l(s)), and we will prove that all the zeros of d̃(s) are on the open left half plane or s =  in the following. when s = , d̃() = det(diag{ + ᾱi, i ∈ i} + diag{κ̄ie−ti, i ∈ i}l()) = det(diag{κ̄i, i ∈ i) det(l), because the interconnection topology g = (v, e, a) has a globally reachable node,  is a simple eigenvalue of l [24]. hence, d̃() = , i.e., d̃(s) has only one zero at s = . when s 6= , define f̃(s) = det(i + diag{ κ̄is+ᾱis e −tis, i ∈ i}l(s)). we will prove the zeros of f̃(s) lie on the open left half complex plane. according to the general nyquist stability criterion [25], the zeros of f̃(s) are on the open left half complex plane, if λ (diag{ κ̄i (jω)+jω ᾱi e−jω ti , i ∈ i}l(jω)) does not enclose the point (−, j) for ω ∈ r. based on the greshgorin’s disc theorem, λ (diag{ κ̄i −ω  + jᾱiω e−jtiω , i ∈ i}l(s)) ∈ ⋃ i∈i {ζ : ζ ∈ c, |ζ − κ̄i( ∑ j∈ni ai j) −ω  + jᾱiω e−jtiω | ≤ ∑ j∈ni | κ̄iai j −ω  + jᾱiω e−j(ti+τi j )ω |} = ⋃ i∈i {ζ : ζ ∈ c, |ζ − κ̄igi −ω  + jᾱiω e−jtiω | ≤ | κ̄igi −ω  + jᾱiω e−jtiω |} consensus problem of second-order dynamic agents with heterogeneous input and communication delays 331 holds for ω ∈ r, where gi = ∑ j∈ni ai j. then, λ (diag{ κ̄i (jω)+jω ᾱi e−jω ti , i ∈ i}l(jω)) does not enclose the point (−, j) for ω ∈ r as long as the point (−a, j) with a ≥  does not in the disc {ζ : ζ ∈ c, |ζ − κ̄igi −ω +jᾱiω e−jtiω | ≤ | κ̄igi −ω +jᾱiω e−jtiω |} for all ω ∈ r, i.e., | − a + j − κ̄igi −ω +jᾱiω e−jtiω | > | κ̄igi −ω +jᾱiω e−jtiω | holds for all ω ∈ r when a ≥ . by calculating, we obtain | − a + j − κ̄igi −ω  + jᾱiω e−jtiω | − | κ̄igi −ω  + jᾱiω e−jtiω | = a(a − κ̄igi cos(ω ti) + ᾱi sin(ω ti) ω ω  + ᾱ i ). because cos(ω ti) ≤  and sin(ω ti)ω ≤ ti hold for ω ∈ r, it follows from (11) that κ̄igi cos(ω ti) + ᾱi sin(ω ti) ω ω  + ᾱ i ≤ κ̄igi( + ᾱiti) ᾱ i < . hence, | − a + j − κ̄igi −ω +jᾱiω e−jtiω | > | κ̄igi −ω +jᾱiω e−jtiω | holds for all ω ∈ r when a ≥ . now, we have proved that d̃(s) has its zeros on the open left half complex plane except for a zero at s = . thus, the state xi(t) of the system (6) converges to a steady state, i.e., limt→∞ xi(t) = x∗i , i ∈ i, and limt→∞ vi(t) = ,∀i ∈ i holds for (6). then, analogous to the proof of theorem 3, the system (6) converges to a stationary consensus for the digraph that has a globally reachable node. theorem 7 is proved. 2 remark 8. in the networks composed of interconnected dynamic systems, the scalability is a important property that needs to be maintained [5, 20, 21, 23]. obviously, the decentralized consensus conditions (8) and (11) maintain the scalability of the multi-agent network. without having to redesign the entire network whenever an agent is added or removed, the networks (6) and (7) can achieve the desired collective behavior as long as the local conditions for the agent and its neighbors hold respectively, and the connectedness of the interconnection topology is maintained. remark 9. in the consensus analysis of the multi-agent systems, greshgorin’s disc theorem has been extensively used to obtain the communication delay-independent consensus condition for the system with heterogeneous communication delays [8, 17]. in [17], decentralized frequency-domain consensus conditions have been obtained for the multi-agent systems with agents’ dynamic modeled by strictly stable linear systems under heterogeneous communication delays. then, by transformation, the system (6) can be expressed as a special case of the system studied in [17]. however, (11) gives a concrete algebraic criterion, which is convenient for the design of the consensus algorithm. remark 10. according to [20] (the inequality (24) there in), the consensus condition (8) in theorem 3 satisfies mi κi(gmi )− > αi κiti > α i κi(mi + αiti) . thus, under the same conditions, the consensus condition (11) in theorem 7 is more conservative than the consensus condition (8) given in theorem 3. 5 simulation example 11. consensus under input delays based on symmetric graph. consider a multi-agent network of five dynamic agents described by (7). the interconnection topology is described in figure 1, and the graph is undirected and connected. the symmetric weights of the edges are: a = a = ., a = a = ., a = a = ., a = a = ., a = a = .. the input delays of the agents are: t = .(s), t = .(s), t = .(s), t = .(s) and t = .(s). the velocity 332 c.-l. liu, f. liu damping coefficients of the agents are: α = , α =  , α = ., α = , α = , and the mass of each agent is assumed to be , i.e., mi = , i = ,··· , . thus, the assumption 2 holds for all the agents. for the transfer functions wi(s) = e−sti s+ αi mi s , i = ,··· , , by using the matlab simulator, we obtain the gain margins: gm ' ., gm ' ., gm ' ., gm ' . and gm ' .. according to the condition (8), we obtain that the control parameters κi satisfy: κ ∈ (, .), κ ∈ (, .), κ ∈ (, .), κ ∈ (, .), κ ∈ (, .), and we choose κ = , κ = ., κ = ., κ =  and κ = . then, with the initial states generated randomly, the agents in the system (7) converge to a stationary consensus (see figure 2). 31 2 45 figure 1: undirected graph with symmetric weights 0 5 10 15 20 25 30 35 40 45 −0.5 0 0.5 1 1.5 time/second x i 0 5 10 15 20 25 30 35 40 45 −1 −0.5 0 0.5 1 1.5 time/second v i figure 2: consensus convergence under input delays example 12. consensus under input and communication delays based on asymmetric digraph. consider a network of five agents described by (6). the interconnection topology is a digraph described in figure 3, and the globally reachable node set of the digraph is {, }. the weights of the directed edges are: a = ., a = ., a = ., a = ., a = ., a = ., and the corresponding communication delays are: τ = .(s), τ = .(s), τ = .(s), τ = .(s), τ = .(s), τ = .(s). the velocity damping coefficients of the agents are: α = ., α = , α = , α = , α = , and the mass of each agent is assumed to be . choosing the control parameters: κ = , κ = , κ = ., κ = , κ = , we obtain from the condition (11) that the constraints on the input delays are: t ∈ (, .)(s), t ∈ (, )(s), t ∈ (, )(s), t ∈ (, .)(s) and t ∈ (, .)(s). with t = (s), t = .(s), t = .(s), t = (s), t = .(s), the agents in the system (6) converge to a stationary consensus (see figure 4). 6 conclusions in this paper, we investigate the consensus problem of second-order multi-agent systems with velocity damping term in the agent’s dynamic. based on the frequency-domain analysis, two sufficient consensus problem of second-order dynamic agents with heterogeneous input and communication delays 333 3 1 2 45 figure 3: digraph composed of 5 agents 0 5 10 15 20 25 30 35 40 45 −0.5 0 0.5 1 1.5 time/second x i 0 5 10 15 20 25 30 35 40 45 −0.5 0 0.5 1 1.5 time/second v i figure 4: consensus convergence under input and communication delays decentralized consensus conditions are obtained. one consensus condition is for the system with heterogeneous input delays based on undirected and symmetric graph, and is dependent on the input delays. the other consensus condition is for the system with both heterogeneous input delays and communication delays based on general directed graph, and depends on the input delays only. although the later consensus condition is more conservative than the former, it can be applied to the systems based on directed graph with asymmetric coupling weights. acknowledgements this work was supported by specialized research fund for the doctoral program of higher education of china (grant no. 20090093120006). bibliography [1] a. jadbabaie, j. lin, a.s. morse, coordination of groups of mobile autonomous agents using nearest neighbor rules, ieee transactions on automatic control, 48(6):988-1001, 2003. [2] r. olfati-saber, r. murray, consensus problems in networks of agents with switching topology and time-delays, ieee transactions on automatic control, 49(9):1520-1533, 2004. [3] w. ren, e. atkins, distributed multi-vehicle coordinated control via local information exchange, international journal of robust and nonlinear control, 17(10-11):1002-1033, 2007. [4] y. hong, l. gao, d. cheng, j. jiang, lyapunov-based approach to multiagent systems with switching jointly connected interconnection, ieee transactions on automatic control, 52(5):943-948, 2007. 334 c.-l. liu, f. liu [5] y.-p. tian, c.-l. liu, consensus of multi-agent systems with diverse input and communication delays, ieee transactions on automatic control, 53(9):2122-2128, 2008. [6] w. wang, j.j.e. slotine, contraction analysis of time-delayed communication delays, ieee transactions on automatic control, 51(4):712-717, 2006. [7] y.-g. sun, l. wang, consensus of multi-agent systems in directed networks with nonuniform time-varying delays, ieee transactions on automatic control, 54(7):1607-1613, 2009. [8] j. wang, n. elia, consensus over network with dynamic channels, proc. of the 2008 american control conference, seattle, pp.2637-2642, 2008. [9] m. cao, a.s. morse, b.d.o. anderson, reaching an agreement using delayed information, proc. of the 45th ieee conference on decision and control, san diego, pp.3375-3380, 2006. [10] f. xiao, l. wang, asynchronous consensus in continuous-time multi-agents with switching topology and time-varying delays, ieee transactions on automatic control, 53(8):1804-1816, 2008. [11] v.d. blondel, j.m. hendrickx, a. olshevsky, j.n. tsitsiklis, convergence in multi-agent coordination, consensus, and flocking, proc. of the 44th ieee conference on decision and control, seville, pp.2996-3000, 2005. [12] v. gazi, stability of an asynchronous swarm with time-dependent communication links, ieee transactions on systems, man, and cybernetics-part b: cybernetics, 38(1):267-274, 2008. [13] j. hu, y. hong, leader-following coordination of multi-agent systems with coupling time delays, physica a, 374(2):853-863, 2007. [14] h. su, x. wang, second-order consensus of multiple agents with coupling delay, proc. of the 7th world congress on intelligent control and automation, chongqing, pp.7181-7186, 2008. [15] p. lin, y. jia, j. du, s. yuan, distributed consensus control for second-order agents with fixed topology and time-delay, proc. of the 26th chinese control conference, zhangjiajie, pp.577-581, 2007. [16] w. yang, a.l. bertozzi, x. wang, stability of a second order consensus algorithm with time delay, proc. of the 47th ieee conference on decision and control, cancun, pp.2926-2931, 2008. [17] d.j. lee, m.k. spong, agreement with non-uniform information delays, proc. of the 2006 american control conference, minneapolis, pp.756-761, 2006. [18] s. kawamura, m. svinin (eds.), advances in robot control: from everyday physics to humanlike movements, berlin: springer-verlag, 2006, pp.107-134. [19] p. lin, y. jia, consensus of second-order discrete-time multi-agent systems with nonuniform time-delays and dynamically changing topologyies, automatica, 45(9):2154-2158, 2009. [20] y.-p. tian, c.-l, liu, robust consensus of multi-agent systems with diverse input delays and asymmetric interconnection perturbations, automatica, 45(5):1374-1353, 2009. [21] y.-p. tian, g. chen, stability of the primal-dual algorithm for congestion control, international journal of control, 79(6):662-676, 2006. [22] r. pedrami, b.w. gordon, control and analysis of energetic swarm systems, proc. of the 2007 american control conference, new york, pp.1894-1899, 2007. consensus problem of second-order dynamic agents with heterogeneous input and communication delays 335 [23] i. lestas, g. vinnicombe, scalable robustness for consensus protocols with heterogeneous dynamics, proc. of the 16th ifac world congress, prague, 2005. [24] z. lin, b. francis, m. maggiore, necessary and sufficient graphical conditions for formation control of unicycles, ieee transactions on automatic control, 50(1):121-127, 2005. [25] c.a. desoer, y.t. wang, on the generalized nyquist stability criterion, ieee transactions on automatic control, 25(2):187-196, 1980. cheng-lin liu was born in china in 1981. he got phd at southeast university in 2008. since 2008, he is a faculty member at institute of automation, jiangnan university, china. his current research interests include internet congestion control and coordination control of multi-agent systems. fei liu was born in china in 1965. he is a professor at institute of automation, jiangnan university, china. his research interests include the theory and application of advanced process control, process monitoring and diagnose on industrial system, and integrated automatic system for fermentation process. ijcccv11n4.dvi international journal of computers communications & control issn 1841-9836, 11(4):567-579, august 2016. an abnormal network traffic detection algorithm based on big data analysis h.p. yao, y.q. liu, c. fang haipeng yao* 1. state key laboratory of networking and switching technology beijing university of posts and telecommunications no 10, xitucheng road, haidian district, beijing, prc 2. beijing advanced innovation center for future internet technology beijing university of technology 100 ping le yuan, chaoyang district, beijing, prc *corresponding author: yaohaipeng@bupt.edu.cn yiqing liu state key laboratory of networking and switching technology beijing university of posts and telecommunications no 10, xitucheng road, haidian district, beijing, prc colin617@qq.com chao fang 1. beijing advanced innovation center for future internet technology beijing university of technology 100 ping le yuan, chaoyang district, beijing, prc fangchao.bupt@gmail.com 2. college of electronic information and control engineering beijing university of technology 100 ping le yuan, chaoyang district, beijing, prc fangchao.bupt@gmail.com abstract: anomaly network detection is a very important way to analyze and detect malicious behavior in network. how to effectively detect anomaly network flow under the pressure of big data is a very important area, which has attracted more and more researchers’ attention. in this paper, we propose a new model based on big data analysis, which can avoid the influence brought by adjustment of network traffic distribution, increase detection accuracy and reduce the false negative rate. simulation results reveal that, compared with k-means, decision tree and random forest algorithms, the proposed model has a much better performance, which can achieve a detection rate of 95.4% on normal data, 98.6% on dos attack, 93.9% on probe attack, 56.1% on u2r attack, and 77.2% on r2l attack. keywords: anomaly traffic detection, big data, k-means, decision tree, random forest. 1 introduction with the improvement of network, storage, calculation and transmission,the internet is interacting more closely with people than ever before. while the internet is making our life more convenient, it brings about some potential risks. for example, malicious attacks involving user privacy and security become more and more frequent. the changes of how people use the internet is a new challenge for traditional abnormal network event detection techniques. it is more hard for researchers to get aware of some new kinds of attacks. to resolve these problems, some abnormal network traffic detection methods copyright © 2006-2016 by ccc publications 568 h.p. yao, y.q. liu, c. fang have been proposed. traditional abnormal traffic detection method can be classified into two categories [1–3]. one is misuse detection, and the other is abnormal detection. the two methods have their own pros and cons. misuse detection has a high accuracy but needs support from known knowledge. abnormal detection do not need known knowledge, but cannot categorize the type of attacks, the accuracy is also lower. for example, hari om [4] designes a hybrid detection system, which is a hybrid anomaly detection system considering k-means, k-nearest neighbor and naïve bayes methods. however, the explosive increase of network traffic has directly or indirectly pushed the internet into the big data era, which makes anomaly traffic detection more difficult to deal with because of high calculation volume and constant changes of network data distribution caused by big data [5–8]. because the speed of network data generation is fast, it makes the volume of normal traffic and abnormal traffic differ a lot, and the distribution of the data change. besides, with big data, the difference between normal traffic and abnormal traffic is increasing. it makes the traditional methods unable to effectively detect abnormal traffic. therefore, to increase the accuracy of abnormal traffic and avoid the loose caused by false negative detection, we propose a novel model based on big data analytics, which can avoid the influence brought by adjustment of network traffic distribution, increase dectection accuracy and reduce the false negative rate. the core of the proposed model is not simply combination of traditional detection methods, but a novel detection model based on big data. in the simulation, we use k-means, decision tree and random forest algorithms as comparative objects to vertify the effectiveness of our model. simulation results reveal that the proposed model has a much better performance, which can achieve a detection rate of 95.4% on normal data, 98.6% on dos attack, 93.9% on probe attack, 56.1% on u2r attack, and 77.2% on r2l attack. the rest of this paper is organized as follows. in section 2, related work of this paper is presented. the system model is given in section 3. simulation results are presented and discussed in section 4. finally, we conclude this study in section 4.3. 2 related work 2.1 k-means k-means is a classic clustering algorithm [9,10], which uses simple iteration algorithm to cluster the data set into certain amount of categories.commonly, the number of clusters is annotated to be k. the four steps of k-means are: 1. initialization: randomly select k data points from the data set as the centers of the kclusters; 2. distribution: assign each point in the data set to the nearest center; 3. update: calculate new centers according to the cluster assignment, the new center is the average point of all the points in a cluster; 4. repeat: repeat these steps until no center is updated in this round, and the clustering is converged. k-means needs the number of classification k to be specified. if k is not chosen properly, it will lead to an improper result of classification. so choose a proper cluster number is crucial to the result of k-means. another disadvantage of k-means is that, k-means can only use euclidean distance. even though euclidean distance is convenient to calculate, but it cannot take the difference between two features into consideration, it means it treats all features as same. in the reality, it will sometimes lead to poor performance. anyway, k-means has its own advantages when dealing with big data. an abnormal network traffic detection algorithm based on big data analysis 569 1. k-means is simple. the time complexity is n ( nd∗k+1 log n ) , it can be fast when the number of clusters and the number of features are small; 2. k-means can be well adjusted to big data set and has high performance. 2.2 decision tree decision tree [9] is a common algorithm used in machine learning. a complete decision tree is composed by three kind of elements: 1. decision node, indicating which feature is used in split; 2. chance nodes, indicating possible values of each features; 3. leaf node, indicating which category is the record in. there are two steps needed to use a decision tree: 1. tree generation: generate a tree according to training set. need to determine which feature need to use in the split, and determine which category the result is in. 2. classification: classify new records from the root of the decision tree, and compare the record with each of the decision node, move to corresponding branch with the result. repeat this process, and after a data reaches the leaf node, the category of leaf node is the new category of the node. quinlan proposed c4.5 algorithm in [11], which is a well known decision tree algorithm.the main method is to generate the decision tree from root to leaf, in order to reduce the level of uncertainty. therefore, this algorithm can be described as follows. gain ratio is the index c4.5 used to select feature. define a feature in the feature set to be ak, the training set to be tďż˝ďż˝definition of information gain is defined like this: gain(t,ak) = info(t)− infoak(t) (1) where info(t) = − n ∑ i=1 freq(ci,t) |t | log2 freq(ci,t) |t | (2) infoak(t) = − ∑ ak∈d(ak) |takak | |t | info(t ak ak ) (3) freq(ci,t) means the number of records belongs to ci in t . takak express that subset which ak is ak, and domain of ak is d(ak). splitinfo(ak) is defined to be: splitinfo(t,ak) = − ∑ ak∈d(ak) |takak | |t | log2 |takak | |t | (4) gain ratio is gainratio(t,ak) = gain(t,ak) splitinfo(ak) (5) the advantages of decision tree are: 1. the tree generated is easy to generate and easy to explain; 2. performs well when dealing with large data set. 570 h.p. yao, y.q. liu, c. fang 2.3 random forest random forest algorithm [9, 12] is a classification algorithm and contains multiple decision trees, where each tree has a vote, and result is the one with highest vote. when generating decision tree, feature selection and pruning can be used to avoid over fitting. but when the number of features is large, the problems can hardly be avoided. random forest consists of multiple decision trees, which can effectively avoid those problems. random forest has following advantages: 1. it can be used in various situation with a pretty high accuracy on classification; 2. it can effectively support multi-feature situation without feature selection; 3. it can report the importance distribution of features. 3 system model influenced by big data, network data distribution is gradually changing. this paper try to solve the problem that caused by the increasing difference between normal traffic and abnormal traffic. therefore, we proposed a new abnormal traffic detection model based on big data analysis, and this model includes three sub-models. 3.1 normal traffic selection model normal traffic selection model uses classification and clustering algorithm to distinguish normal and anomaly behaviors, rather than involved specific anomaly behaviors. this model includes two stages: 1. training stage: training model uses data that labeled normal or abnormal, and the model applies in test stage. 2. test stage: test stage is similar to detection in practice. using unlabeled date, the model classifies traffic data into normal or abnormal, and labels them. normal traffic selection model uses k-means clustering algorithm, knn, decision tree and random forest classification algorithms. traditionally, before using k-means algorithms, it is very important to set the number of categories, because we don’t know how many categories. but in order to distinguish normal and abnormal behavior, the normal traffic selection model uses k-means as following way. in training stage, using labeled information classify data into normal and abnormal. these two categories use k-means separately instead of clustering all data at once, getting the center of the data set respectively. then using the center of the data set, knn clustering algorithm classifies test data. decision tree and random forest classification algorithms train with labeled normal and abnormal data. 3.2 abnormal traffic selection model the purpose of abnormal traffic selection model is avoid influence caused by too many normal traffic than abnormal traffic. this model classifies anomaly traffic into specific categories, and includes two stage as well: 1. training stage: this stage only use abnormal data to train classification model, and every data label specific attack group. using classification algorithms learns classified rules. 2. test stage: test stage is similar to detection in practice, using unlabeled data (including normal behavior data). the classification model classifies anomaly traffic into specific categories according to the classified rules, and gives specific label to every data. an abnormal network traffic detection algorithm based on big data analysis 571 table 1: distribution of kddcup99 data set data set normal dos probe r2l u2r 10 percent of training data set 97278 391458 4107 1126 52 test data set 60593 229853 4166 16189 228 abnormal traffic selection model uses decision tree and random forest classification algorithms. abnormal traffic selection model and normal traffic selection model are independent, without order of priority in training stage or test stage. mixed compensation model combines the result from normal traffic selection model and abnormal traffic selection model to produce a final result. although abnormal traffic selection model is more effective because without influence of normal traffic data, the model has high false negative rate due to this characteristic. therefore use normal set produced by normal traffic selection model to compensate abnormal set a = {a1,a2, · · · ,ak} produced by abnormal traffic selection model. ai, i ∈ [1,k] denote specific attack category. if c denote detection result, rule of compensation as follow: { if c ∈ ai, c ∈ n, then c ∈ n if c ∈ ai, c /∈ n, then c ∈ ai. (6) 4 simulation results and discussions before using three sub-models of anomaly detection based on big data analysis, data set needs be preprocessed with label for training model. it should be noted that rightly selecting feature is a good way to reduce dimension and increase efficiency of running. in the simulation, three different algorithms are used to verify validity of the proposed model. 4.1 data set in the simulation, we use kddcup99 [13] data set to test my model. kddcup99 data set is widespread use for testing abnormal detection model, which is obtained and processed from kddcup99 [14]. kddcup99 data set has 41 features and been sorted into three group: basic feature, content feature and time feature [15]. the distribution of data set is shown as table 1, where training data has 5 million records, 10 percent of training data has 494021 records, and test data has 311029 records. every record is labeled to be normal or abnormal, and abnormal data can be classified into four groups: dos, u2r, r2l and probe. from table 1, we find that normal data in training data set is more than abnormal data in test data set. therefore, this data set can be used to test the performance of the proposed model under different circumstances. 4.2 simulation results as shown in table 2, we have done eight experiments with the model based on big data analysis, and three control experiments which used k-means, decision tree or random forest respectively. in the control groups, training classify model uses all training data set with five categories, then classifying test data into five categories. another control group is winner of kddcup99. in the simulation, prediction accuracy is used as a simulation metric of detection effect, which is shown in table 3. besides, we adopt way of sorting and grading for every type. for example, 572 h.p. yao, y.q. liu, c. fang table 2: number of experiments no. normal traffic selection model abnormal traffic selection model no. of control group algorithm 1 k-means1∗ random forest 9 k-means 2 k-means1∗ decision tree 10 decision tree 3 k-means2∗ random forest 11 random forest 4 k-means2∗ decision tree 12 winner of kddcup99 5 decision tree decision tree 6 decision tree random forest 7 random forest decision tree 8 random forest random forest *note: in the normal traffic selection model, the number of cluster of normal and abnormal respectively is 4 and 30 in k − means1, and the number of cluster of normal and abnormal respectively is 100 and 300 in k − means2. table 3: prediction accuracy no. experiment normal dos probe u2r r2l 1 k-means1+random forest 0.632 0.814 0.939 0.561 0.679 2 k-means1+decision tree 0.656 0.791 0.878 0.500 0.772 3 k-means2+random forest 0.945 0.983 0.910 0.513 0.510 4 k-means2+decision tree 0.946 0.979 0.852 0.500 0.504 5 decision tree+decision tree 0.951 0.984 0.829 0.500 0.512 6 decision tree + random forest 0.951 0.986 0.831 0.550 0.517 7 random forest + decision tree 0.954 0.980 0.861 0.500 0.521 8 random forest + random forest 0.952 0.985 0.872 0.520 0.510 9 k-means 0.938 0.968 0.785 0.500 0.528 10 decision tree 0.951 0.983 0.793 0.500 0.500 11 random forest 0.952 0.985 0.875 0.522 0.507 12 winner of kddcup99 0.995 0.971 0.833 0.132 0.084 all experiments are sorted by prediction accuracy of normal. the first grades 1 point, the second grades 2 points, and so on. finally, adding grade of five groups is final grade. as shown in table 4, the experiment group and winner of kddcup99 are sorted by final grade. while the later has high detection rate in normal data, as for four attack types, the result of model based on big data analysis is better than winner of kddcup99. algorithm of winner of kddcup99 is c5 decision tree [16–19]. training data of winner of kddcup99 is a little different with my experiment. thus for evaluating detection effect of the proposed mode, we did three control experiments with same training data and test data, used with k-means, decision tree or random forest respectively. the number of these experiments is noted as 11, 10 and 9. sorting result shows that detection effect of algorithm of the proposed model is better than no use, as shown as table 5. we will discuss experiments results, compared no.8 with no.11, no.7 with no.5 and no.3 with no.4. discussing result of no.8 and no.11 score of top three are same. judging no.8 and no.11 with final grade, detection result of two experiments are almost same. and both of them use random forest algorithm. but the difference is: an abnormal network traffic detection algorithm based on big data analysis 573 table 4: compared with winner of kddcup99 no. experiment normal dos probe u2r r2l final socre rank 8 random forest+random forest 3 2 2 2 4 13 1 6 decision tree+random forest 4 1 6 1 2 14 2 7 random forest +decision tree 2 5 3 4 1 15 3 2 k-means2+random forest 7 4 1 3 5 20 4 5 decision tree+decision tree 4 3 7 6 3 23 5 4 k-means2+decision tree 6 6 4 5 6 27 6 12 winner of kddcup99 1 7 5 7 7 27 6 table 5: compared with control group no. experiment normal dos probe u2r r2l final socre rank 6 decision tree+random forest 4 1 6 1 3 15 1 8 random forest + random forest 2 2 3 3 5 15 1 11 random forest 2 2 2 2 7 15 1 7 random forest + decision tree 1 7 4 5 2 19 4 3 k-means2+ random forest 8 5 1 4 5 23 5 5 decision tree + decision tree 4 4 7 5 4 24 6 10 decision tree 4 5 8 5 9 31 7 9 k-means 9 9 9 5 1 33 8 4 k-means2+ decision tree 7 8 5 5 8 33 8 1. importance of variable used in classifying is different; 2. no.8 has lower false negative rate. • importance of variable as shown as fig. 1, variables chosen by random forest in no.8 and no.11 are different. random forest algorithm can output importance of variables, noted gini index [9]. fig. 1 shows that top 20 have important variables in comparison with top 1, whose value is higher and more important. in no.8, rank of variables is different between normal traffic selection model and abnormal traffic selection model. this means that variable used for predicting normal or abnormal and specific attack is different. therefore, choosing variable in no.11 is influenced by both sides, and output a compromised result when choosing variables, that’s why prediction of model in no.11 has deviation. • comparison of false negative rate in order to evaluate effect on predicting abnormal behavior, false negative rate is used as an important index, which can measure how many attack events are omitted. table 6 shows confusion matrix of results of experiments no.8 and no.11 when using random forest. row express information of prediction, and column express actual information. false negative rate of no.8 in normal type is very low, but high in u2r and r2l type. in no.8, false negative rate of normal selection model in normal is low. without influence of normal training data, false negative rate of abnormal selection model in four specific attack types are lower than no.11. discussing result of no.5 and no.7 no.5 and no.7 respectively compare with no.6 and no.8 by using same algorithm in normal traffic selection model, and their ranks are lower when using decision tree in abnormal traffic 574 h.p. yao, y.q. liu, c. fang (a) no.11 (b) no.8 normal traffic selection model (c) no.8 abnormal traffic selection model figure 1: importance of variables in random forest. an abnormal network traffic detection algorithm based on big data analysis 575 table 6: confusion matrix no.11 prediction normal dos probe u2r r2l normal 60287 5967 847 159 15839 dos 69 223814 191 8 0 probe 233 72 3128 50 104 u2r 1 0 0 10 5 r2l 3 0 0 1 241 false negative 0.00505 0.026273 0.24916 0.95614 0.985113 no.8 normal traffic selection model prediction normal abnormal normal 60289 22853 abnormal 304 227583 false negative 0.005017 0.091253 no.8 abnormal traffic selection model prediction dos probe u2r r2l dos 229231 769 20 4693 probe 297 3393 135 5646 u2r 0 0 39 32 r2l 325 4 34 5818 false negative 0.002706 0.18555 0.828947 0.64062 table 7: confusion matrix of abnormal traffic selection model with decision tree predition dos probe u2r r2l dos 227792 589 34 6245 probe 1434 3192 20 283 u2r 0 0 0 0 r2l 627 385 174 9661 selection model. table 7 is confusion matrix of abnormal traffic selection model with decision tree algorithm. it shows that u2r can not be detected and false negative rate of r2l is higher. in order to find the reason, classify tree is checked in fig. 2, where the classification model prefers dos and probe attack, then r2l attack, and no result point of u2r attack. distribution of training data can explain this phenomenon, which can be shown in fig. 3. when generating decision tree, the obtained information will cause results in favor of feature which have more samples. therefore, if the number of training data set in every group is different enough, it cannot get efficient classification model for small samples. moreover, because the number of between training data is comparatively equal, classification result is better, such as no.6, when normal traffic selection model uses decision tree. discussing result of no.3 and no.4 no.3 and no.4 use k-means in normal traffic selection model to choose clustering center. table 8 shows final prediction accuracies in no.3 and no.4. because final results are lower than that of normal traffic selection model or abnormal traffic selection model, we find that this problem is caused by using k-means in normal selection model. table 9 shows confusion matrix 576 h.p. yao, y.q. liu, c. fang figure 2: classify tree of abnormal traffic selection model. figure 3: distribution of training data. an abnormal network traffic detection algorithm based on big data analysis 577 table 8: accuracy of no.3 and no.4 no. model algorithm accuracy no.3 normal traffic selection k-means 0.926 abnormal traffic selection random forest 0.948 mixed compensation model 0.923 no.4 normal traffic selection k-means 0.925 abnormal traffic selection decision tree 0.961 mixed compensation model 0.918 table 9: confusion matrix of normal traffic selection model of no.3 and no.4 no. prediction normal abnormal no.3 normal 59189 21663 abnormal 1404 228773 no.4 normal 59428 22221 abnormal 1165 228215 of normal traffic selection model of no.3 and no.4. many abnormal records are predicted as normal, which cause high false negative rate. therefore, many abnormal records predicted by abnormal traffic selection model will be regarded as normal after mixed compensation model. nowadays, many novel attacks are unknown to researchers, and many attacks will be disguised as normal. it’s very dangerous to have high false negative rate, and it does not fit the proposed model. because the effect of k-means has great correlation with the number of centers chosen to cluster, and we can fine tune the strength of clustering, and lower the false negative rate to establish a strict normal selection model. in no.3 and no.4, the number of centers for normal traffic and attacks is 100 and 300, respectively. although it can achieve a good overall accuracy, its false negative rate is higher than other model. however, according to table 10, by choosing 4 and 30 in no.1 and no.2, it has lower false negative rate, and only classify four kinds of attacks. besides, a strict normal detection model is established. by adjusting the parameters and reducing false negative rate in no.1 and no.2, we can find that the rank has increased rapidly compared with no.3 and no.4. especially, when kmeans combines with random forest, it has a very high accuracy on probe, u2r and r2l attack. therefore, we can draw the conclusion that by adjusting the parameters of k-means, the strength of abnormal traffic detection can be controlled by adjusting the strength of normal traffic identification. 4.3 summary based on the results analyzed above, as shown in table 11, the following conclusions can be drawn: 1. random forest classification algorithm can adapt to the change of distribution of network data, and this algorithm by using the proposed model can reduce false negative rate. 2. if the number of training data in different group is largely different with each other, the classify model built by decision tree will prefer to attack types, which have more training data. so we should avoid using decision tree in abnormal traffic selection model. however, in the normal traffic selection model, the difference between different groups is comparatively small. in this 578 h.p. yao, y.q. liu, c. fang table 10: results of experiments no. experiment dos probe u2r r2l final rank 1 k-means1+random forest 10 1 1 2 14 1 6 decision tree+ random forest 1 8 2 5 16 2 8 random forest + random forest 2 5 4 7 18 3 11 random forest 2 4 3 9 18 3 3 k-means2+ random forest 5 2 5 7 19 5 2 k-means1+ decision tree 11 3 6 1 21 6 7 random forest + decision tree 7 6 6 4 23 7 5 decision tree + decision tree 4 9 6 6 25 8 9 k-means 9 11 6 3 29 9 4 k-means2+ decision tree 8 7 6 10 31 10 10 decision tree 5 10 6 11 32 11 table 11: summary of model model 1 model 2 model 3 normal traffic selection model k-means1 decision tree random forest abnormal traffic selection model random forest random forest random forest situation, using decision tree can fast get classify model, and the results have higher accuracy. 3. there are more and more unknown abnormal events in the future. in order to avoid loss of false negative prediction, we can change the number of clustering in the normal traffic selection model with k-means algorithm to reduce false negative rate and increase the accuracy of detecting abnormal events. conclusion with the change of distribution of network data, traditional anomaly traffic detection techniques can not fit this situation. in order to solve the problem, we propose an anomaly traffic detection model based on big data analysis. simulation results show that the proposed model achieves a detection rate of 95.4% on normal data, 98.6% on dos attack, 93.9% on probe attack, 56.1% on u2r attack, and 77.2% on r2l attack.therefore, the model can increase the accuracy of attack behavior, and reduce false negative rate. acknowledgment this work was supported by nsfc (61471056) and china jiangsu future internet research fund (by2013095-3-1, by2013095-3-03). bibliography [1] patcha, a.; park, j.m. (2007); an overview of anomaly detection techniques: existing solutions and latest technological trends, computer networks, issn 1389-1286, 51(12): 34483470. [2] lazarevic, a.; kumar, v.; srivastava, j. (2005); intrusion detection: a survey, managing cyber threats, issn 0924-6703, 5: 19-78. an abnormal network traffic detection algorithm based on big data analysis 579 [3] axelsson, s. (1998); research in intrusion-detection systems: a survey, department of computer engineering, chalmers university of technology, goteborg. sweden, technical report 98-17. [4] om, h.; kundu, a. (2012); a hybrid system for reducing the false alarm rate of anomaly intrusion detection system, ieee 1st international conference on recent advances in information technology (rait), isbn 978-1-4577-0694-3, 131-136. [5] kaisler, s. et al (2013); big data: issues and challenges moving forward, ieee 46th hawaii international conference on system sciences (hicss), issn 1530-1605, 995-1004. [6] michael, k.; miller, k.w. (2013); big data: new opportunities and new challenges, computer, issn 0018-9162, 46(6):22-24. [7] russom, p. et al (2011); big data analytics, tdwi best practices report, fourth quarter. [8] fan, w.; bifet, a. (2013); mining big data: current status, and forecast to the future, acm sigkdd explorations newsletter, issn 1931-0145, 14(2): 1-5. [9] james, g. et al (2013); an introduction to statistical learning, springer, issn 1431-875x. [10] guan, y.; ghorbani, a.a.; belacel, n. (2003); y-means: a clustering method for intrusion detection, ieee canadian conference on electrical and computer engineering, issn 08407789, 2:1083-1086. [11] quinlan, j.r. (1993); c4.5: programs for machine learning, morgan kaufmann publishers inc., isbn 1558602402. [12] elbasiony, r.m. et al (2013); a hybrid network intrusion detection framework based on random forests and weighted k-means, ain shams engineering journal, issn 2090-4479, 4(4): 753-762. [13] kdd cup 1999, http://kdd.ics.uci.edu/databases/kddcup99/kddcup99.html. may 2015 [14] lippmann, r.p. et al (2000); evaluating intrusion detection systems: the 1998 darpa offline intrusion detection evaluation, ieee proceedings of darpa information survivability conference and exposition (discex), isbn 0-7695-0490-6, 2:12-26. [15] tavallaee, m. et al (2009); a detailed analysis of the kdd cup 99 data set, proceedings of the second ieee symposium on computational intelligence for security and defence applications (cisda), isbn 978-1-4244-3763-4, 1-6. [16] pfahringer, b. (2000); winning the kdd99 classification cup: bagged boosting, acm sigkdd explorations newsletter, issn 1931-0145, 1(2): 65-66. [17] yu, g. d. et al (2014); multi-objective rescheduling model for product collaborative design considering disturbance, international journal of simulation modelling, issn 1726-4529, 13(4): 472-484. [18] gusel, l. r. et al (2015); genetic based approach to predicting the elongation of drawn alloy, international journal of simulation modelling, issn 1726-4529, 14(1): 39-47. [19] prasad, k. et al (2016); a knowledge-based system for end mill selection, advances in production engineering & management, issn 1856-6250, 11(1): 15-28. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 2, pp. 251-260 implementing web services using java technology m. pirnau mironela pirnau “titu maiorescu” university bucharest romania, 22 dambovnicului street, district 4, 040051, bucharest, romania e-mail: mironela.pirnau@utm.ro abstract: there are several essential activities that need to take place in any service-oriented environment such as: a web service has to be created, to have its interfaces and invocation methods defined, its service has to be published to one or more intranet or internet repositories to locate potential users. a web service needs to be located in order to invoke potential different users. the technologies offered by web services enable us to cast older applications, we can use them and the packages already existing in a certain enterprise. the infrastructure associated with older applications can also be wrapped as a serial set of services. in the following pages i tried to present general characteristics of the web services up to present, as well as the usage of java technologies for the implementation of web services. the present paper targets a systematisation of the main web services, as well as highlighting and testing the most important technologies used to develop them. keywords: web service, java, xml, soap and semantic web. 1 introduction web services are software applications that conform to the web service interoperability organization. they are becoming more and more popular and reveal a model in which discrete tasks within e-business processes are widely distributed throughout a value net. the label for "web services" has two levels of meaning one specific and one conceptual. web services represent standards, which describe services oriented to applications with architecture having a base of components. these are software components, which can be reusable, having included functionality [11, 12, 16]. the web services use standard xml languages. they use http for sending messages and they are independent of the platform and of the language. web services are freer systems, and the customer may not have knowledge of the web service up to the time it invokes, being suited to join requests of applications meeting demands of an internet-wide [22]. 2 web services structure web services systems maintain important decoupling and dynamic bonds of the elements. the entire elements in a system represent and publish an application programming interface (api) message to different components in the network that work together. services are comprised among applications by employing service discovery for the dynamic bonds of the partners. the web services show such architecture full of service orientated approach by orchestrating and finding out available network services, or even precise integration of the incidences that are based on the term of constructing applications. the web services’ architecture need three basic performs such as publish, find and bind. the providers publish services toward a certain service interloper [1, 4]. the requesters "find" demands services using an interloper and "bind" to them. the architecture of web services describes certain principles for building and also creating dynamic systems with no single involving. there are many ways to instantiate a web service, choosing various implementation techniques for the roles and operations of the web services structure. from the structure of web services we may mention: the service processes that involve more than one web service, including discovery belongs to this part of the architecture as it allows us to locate one particular service from among a collection of web services. the architecture of web service invokes many layers and joins technologies [22] as we can see in the figure 1. web services can support other network protocols and distributed computing technologies. the implementation, according to the soa principles, is based on the following protocols: http, soap, wsdl, and uddi [17]. copyright c⃝ 2006-2010 by ccc publications 252 m. pirnau http[3] represents the most used internet protocol and also the leading method of transmitting the information within the web. protocol of demand-answer type between the clients and the servers is another type of method to send data through web. simple object access protocol (soap) is a format used to transmit xml message [1, 2]. it also represents a certain type of communication protocol among applications within internet. it is a platform independent protocol no matter the language used and it permits the avoiding firewalls. web services description language (wsdl) is also a xml language that permits the specification of the accessing manner within the web services. it allows the description of the web services, too. universal description discovery and integration has the following acronym uddi. it is a system of registers that is based on xml; it is independent of the platform. uddi is requested by the soap messages that demand a web service [10]. it offers access towards the wsdl descriptive document within the web service, too. figure 1: the architecture layers of the web services the soap mainly specifies the manner we should format demands towards the server and the manner the server could build its own reply. soap represents a popular selection for web services. along the transport, messages might be transmitted within a specific manner between servers and clients. the protocol of choice is represented by the http architecture. the pile depicted above (figure 2) enables web services to leverage the existing internet infrastructure. it creates a low cost of entry to an omnipresent environment. the flexibility is not compromised by the interoperability requirement provided as alternative and value-add technologies [1]. an action that makes a wsdl document available is the service provider sending a wsdl document directly to a service requestor, named "direct publication". the service provider can publish the wsdl document describing the service to a host local wsdl registry, a private uddi registry or to the uddi operator. figure 2: the interoperable base web services stack implementing web services using java technology 253 3 java technologies web services represent web-based enterprise applications which use transport protocols that are xmlbased, open standards in order to exchange data to their clients [18]. from the java technologies we mention the following, presented in the table below [20]. table 1: java technologies wsit web services interoperability technology jax-ws10 the java api for xml web services, package javax.xml.ws jax-rpc the java api for xml processing jaxp provides a java interface to dom, sax, and xslt jaxb java architecture for xml binding, package javax.xml.bind saaj the soap with attachments api for java, package javax.xml.soap jaxr the java api for xml registries jstl javaserver pages standard tag library jsr 181 java web services metadata, package javax.jws as an example of these technologies i shall present the saaj [22] technology that stands for soap with attachments api for java and that provides a standard set of apis in order to send xml documents including attachments over the internet. it is similar to jax-rcc but it needs additional effort both on the client’s and on the server’s side [15, 21]. the saaj apis provides a standard means for handling soap messages. it can be used to create, to inspect and to alter soap messages. saaj also contains methods for creating and sending attachments with the soap message. the attachments may contain data in the shape of any format, not being limited to xml as in the case of the soap messages. the attachments can be used to send inappropriate data in xml format as images for example. the saaj apis offers the attachmentpart class in order to represent the attachment part of a soap message [16]. any soap message object has automatically a soap part object and requires sub-elements but because the attachmentpart objects are optional, we must create and add them ourselves. saaj contains the api in order to create, populate and access soap messages, according to soap and soap attachment specifications. saaj contains api needed to send non-provider-model messages [21]. we present the package relationship in figure 3 . figure 3: saaj package relationship saaj represents a part from jaxm (java api for xml messaging) which was evolved from the jsr expert group [1, 13, 21]. the final form of the specifications show two different aspects: the core functionality concerned with manipulating of the soap messages and a very high level of messaging facility. regarding saaj his an api of a very low standard, which is completely aware of the messages that are exchanged within the web service and its customers [16]. an application that uses saaj needs to construct soap messages little by little and take data out of the reply messages. the usage of saaj demands a great deal of work deposed from the developer’s side than jax-rpc. we present a few cases in which it would be better to be helped by saaj and its relative jaxm instead of using jax-rpc [1, 7]. 254 m. pirnau jax-rpc can be accessed just when the customer and the service are active at the same specific time and we also assume that there is a free network way to bond them. in the case the service is not free the moment the customer accesses it, there is no possibility of using jax-rpc [17, 20]. jaxm is a service that provides a confident service able to deliver without need to demand the customer elements to be included in the way the confidence is offered. saaj as well as jaxm deliver a wide result disposal for messages based on xml. the api from the javax.xml.soap package permits specifications such as: creating a piece of xml; creating access, adding, modifying parts of the soap message; creating a point-to-point contact to a certain endpoint; creating, adding, modifying a certain soap fault of information type etc. the table 2 includes the javax.xml.soap package and includes the api for building soap messages [4, 20]. the package is defined and included in soap with attachments api for javatm is named saaj. saaj a tool for creating and sending messages to allow sellers to bring up the implementations, the entire saaj api is built of interfaces and some classes that are abstract. in order to create a soap message, there is need to be used a messagefactory type message. part of the javax.xml.soap package represents an abstract class and that might be obtained by using the newinstance method. this type of method is represented by the public static messagefactory newinstance that propagates soapexception. the message factory implies two methods to create the soap messages. the first method is represented by one public soapmessage createmessage that throws soapexception. one public soapmessage createmessage(mimeheaders headers, inputstream) which throw soapexception. the second of the methods is used to non-serialize messages in a link of an input. table 2: methods included in the javax.xml.soap package class summary attachmentpart single attachment to a soapmessage object messagefactory factory for creating soapmessage objects mimeheader object that stores a mime header name and the value soapconnection connection that a client might use for transmitting messages to a certain remote party (url) soapconnectionfactory factory for creating soapconnection objects soapelementfactory creating soapelements soapfactory creating various objects existing in the soap xml stack soapmessage soap messages root class the default messages that comes back by the messagefactory createmessage method contains a soappart but also contains no types of attachments. an object of soapenvelope style is included in the soappart. in the middle of the envelope there are certain parts such as the header, made by an object able to implement the soapheader interface and a soap message body in the shape of an object of soapbody classification. the body and the header, both contain certain xml elements. the java technologies represent methods for constructing web services [8]. 4 related work the web semantic implemented and proposed use of the machinery to extend the web represents a common goal. the multiple demands on the web create a semantically temptation having a highly expressive formalisms because the web represents a principled architecture of standards, formalisms and languages [6, 14]. what is specific for web 2.0 is that it connects humans among them and for web 3.0 it is specific that it connects humans to information in a very innovative new manner. web 3.0 functions as an intelligent assistant, which perceives the concepts of the web pages and transmits important information of one individual. it not only looks for robotically through keywords. tim berners-lee is the person who founded the world wide web. he defines the future online environment as being the semantic implementing web services using java technology 255 web [5, 8, 9]. this is the same web 3.0 a type of internet that perceives information and not just transmits it to the client. web 3.0 will be capable to offer for visitors complete data on any subject. the communities that are based on the same complete data exchange will certain develop. mobile web which is supported by mobile intelligent terminals would be able to enable the data access everywhere and anytime. rich internet applications (ria) would appear more and more. web 3.0 represents a much more direct, closer, measurable, reliable and honest communication. the metro style is made up from: jax-ws, jaxb, and wsit and within it is possible to create web services that are secure, transactional, confident and interoperable, also certain customers [15]. the metro elements are a certain part of the project metro, which part of glassfish, java ee (java platform, enterprise edition), and only part in java se (java platform, standard edition). both glassfish and java ee represent support for jax-rpc api [15, 17]. 5 web applications. java web start technology jws (java web start) represents a technology for installing, launching, and updating the java applications directly from the web and uses the jnlp (java network launching protocol). for java applications, that use of the jws technology launches a range of order of javaws application as the following syntax: javaws [ options ] [url], and the options are those in figure 4. figure 4: options for java web start the role of the programmer: creates a java application and archives the program along with its all resources; signs the jar archive (if necessary) and creates a configuration folder (jnlp) that describes the application; creates a web page having a link to the configuration folder and copies on the web server, the jar folder, the jnlp and html one. the role of the user: accesses the link to the application from the programmer’s web page; the application will be installed automatically, locally and installs a jre if necessary; launches in execution the application; for each execution of the application, it verifies if a new version on the programmer’s new web page exists. as a practical application, i have created a project that contains java classes with administration role of the students. the purpose of the application is to use the java start technology and to realize "a following" in using the application for this technology as well as other technologies that i have tested (for example the servlets technology etc). to realise the source, i also used the classes from the swing, awt, sql etc packages. if we open the application in the netbeans ide 6.7 [23] environment, the project window looks like the one in figure 5. 256 m. pirnau figure 5: the window of the application opened in netbeans helped by java web start, the users may access one java application only with click of a mouse on an html link towards a jnlp file for the certain applications from inside the web browser [19]. this should be installed onto the customer machine. it is not necessary to install jdk (java development kit). in order to permit our java application to be ran with java web start, we have to configure the main properties of the manner the ide could construct the project. the moment java web start is accessed, the ide creates automatically a jnlp file and an html page along with the link to the jnlp file, along with the jar file. for the beginning, we must configure the project in order to activate java web start and to locally test the application. so we must choose the properties option from the project’s shortcut menu, and in the window from the figure 6 the suitable settings are set. the application’s project must be configured so that the java web start technology may be used at the launching.certain sources are mixed together and then the main class of the project is launched in execution. the ide mixes sources and we might see the java screen starts, and in the warning window we are asked if the application might be executed and also to select the checkbox and click run inside the proper warning window. the "application" application begins, the same as in figure 7. after activating the application, the dist folder is created and contains for java web start, the following two additional folders: launch.jnlp it is an xml file having special parts and specifications instructing browsers how to run the application. the attributes for the jnlp files might include the jnlp version, application title, the seller’s name, a link of the application of the jar file etc. the configuration folder jnlp is in an xml format and offers information regarding the application that will be installed using jws. its structure is presented in figure 8. launch.html it is generated in an automatic html page a link to the jnlp file. the users might click the link in order to begin applications through java web start (figure 9). how to run an application from a distant location the moment we are sure that the application begins in a successfully manner using java web start from local sources, we might upload this to a distant location and start launching it from far there. in order to develop applications using java web start inside the web, the server of the web is able to manipulate jnlp files. the web server can be configured in order to recognize jnlp files as applications, such as the mime type for jnlp. each web server has a certain manner for adding mime types. to develop our application to the implementing web services using java technology 257 figure 6: project properties web start figure 7: the main menu of the application 258 m. pirnau figure 8: the structure of the launch.jnlp file figure 9: the structure of the launch.html file implementing web services using java technology 259 customer, we should be sure that a web server accesses all the files that contain our application. this may represent typically amounts for copying one or many of the jar files, with a jnlp file, to the directories of the web server. the settings needed to enable the web site sustain that java web start is quite the same to develop a content based on html. in order to launch the application out of the web, we have to use a certain link of the files of applications source in the jnlp file. regarding this, the project is chosen and in the "properties" window the codebase section is defined. the url where the files are loaded has the jnlp extension and also the jar and html extension (figure 10). figure 10: settings for running applications on the web at this moment we may run our application, in the window of a browser, we type the url towards the launch.html file and then press the "launch the application" connection. our "application" application begins with java web start. after analyzing the current application, we may observe that the integration of the applications and of the data sources must be realized without significant changes of the applications and of the data. because the web allows the easy access to information and services, it became a main element of communication having the condition to be used for applications that run it, adequate technologies such as those implemented on the java platform. 6 conclusions the web services represent a wide range of technologies used in the application of distributed implementation at the web system level. these technologies develop a program-to-program communication pattern using the xml standard, which assures a neutral shape report to platforms, programming languages and software specifications. a web services application is an application that interacts with the world using xml for data definition, wdsl for service definition and soap for communication with another software. the fundamental idea of the web services is the integration [16]. this concept represents a set of standard technologies that facilitate the interoperability among heterogenic systems at an organizational level or on the internet. for the optimization of the web aplications, the companies head more and more to the programming environment of java and of the relevant platforms. java represents an independent language regarding the platform that is projected for the web and, at the same time, it is a developing platform of various aplications, no matter the company’s specific. next to the java platform the xml technology has developed having the goal to exchange data among the aplications. as future research i intend to run more applications on java different platforms and to compare both the manner of using these for working in the environment and the security level implemented the platforms. 260 m. pirnau bibliography [1] e. hewitt, java soa cookbook, o’reilly, 2009. [2] j. governor, d hinchcliffe, d , web 2.0 architectures, o’reilly, 2009. [3] j. zukowski, java 6 platform revealed, springerlink date, 2006. [4] *** , soap with attachments api for java (saaj), sun microsystems, tech. rep., 2004. [5] r. volz, s. handschuh, s. staab, l. stojanovic, and n. stojanovic, unveiling the hidden bride: deep annotation for mapping and migrating legacy data to the semantic web, j. web semantics, vol. 1, no. 2, pp. 187-206, 2004 [6] i. dzitac, b. e. barbat, artificial intelligence + distributed systems = agents, international journal of computers, communications and control, 4(1):17-26, 2009. [7] r. chinnici, java api for xml-based rpc (jaxrpc), java community process, tech. rep., 2003. [8] r. nagappan, r. skoczylas, r. sriganesh, developing java web services, wiley 2003. [9] i. dzitac, s. dzitac, e. valeanu, web distributed computing for landscape architecture, in eleftheriadis, n., styliadis, a., paliokas, i. (eds), proceedings of the lant07, international conference landscape architecture and new technologies, 25-26 may 2007, drama, greece, pp. 25-36, 2007. [10] ***, java 2 platform, enterprise edition specification, sun microsystems, tech. rep., 2003; [11] l. duta, f.g. filip, control and decision making in disassembling used electronic products, studies in informatics and control, 17 (1):17-28, 2008. [12] l.a. zadeh, d. tufis, f.g. filip, i. dzitac ( eds), from natural language to soft computing: new paradigms in artificial intelligence, editing house of the romanain academy, bucharest, 2008. [13] n. kassem, a. vijendran, and rajiv.mordani, java api for xml messaging (jaxm), sun microsystems, tech. rep., 2003, http://java.sun.com/xml/downloads/jaxm.html. [14] http://www.nowpublishers.com/. [15] http://java.sun.com/webservices/technologies/index.jsp. [16] http://en.wikipedia.org/wiki/service-oriented_architecture. [17] http://java.sun.com/developer/technicalarticles/webservices/soa/. [18] http://java.sun.com/javase/technologies/desktop/javawebstart/index.jsp. [19] http://java.sun.com/javase/6/docs. [20] http://java.sun.com/webservices/docs/1.6/tutorial/doc/. [21] http://www.javapassion.com/webservices. [22] http://www.w3.org/tr/ws-arch/. [23] http://www.netbeans.org. mironela pirnau (1966) has a m.sc. in automatics, faculty of automatics and computing, politechnical university in bucharest ( 1992) and phd in electronics (2003) from university of pitesti. her current interests include applied informatics. since 2008 she has been a lecturer phd at the faculty of science and technology of information, titu maiorescu university in bucharest. she has published books and articles on informatics. she has recently been interested in researches on the use of java technologies in developing applications of informatics. international journal of computers communications & control issn 1841-9836, 9(4):471-481, august, 2014. networked predictive fuzzy control of systems with forward channel delays based on a linear model predictor s.w. tong, d.w. qian, g.p. liu shiwen tong∗ college of automation beijing union university beijing 100101, china ∗ corresponding author: shiwen.tong@buu.edu.cn dianwei qian school of control and computer engineering north china electric power university beijing 102206, china dianwei.qian@ncepu.edu.cn guo-ping liu 1.school of engineering university of south wales portypridd cf37 1dl, uk 2.ctgt center harbin institute of technology harbin 150001, china guoping.liu@southwales.ac.uk abstract: this paper presents a novel networked control framework, using fuzzy logic control, for systems with network delays which are known to greatly weaken the control performance of the controlled system. to deal with the network delays, the predicted differences between the desired future set-points and the predicted outputs from a model predictor are utilized as the inputs of a fuzzy controller, thus a series of future control actions are generated. by selecting the appropriated control sequence in the plant side, the network delays are compensated. the simulative results demonstrate that the proposed method can obviously reduce the effect of network delays, and improve the system dynamic performance. keywords: networked control, model prediction, fuzzy control, delay compensation. 1 introduction the emergence of the network technology has changed the communication architecture of control systems from traditional point-to-point to current common bus. sensors, actuators and controllers are connected through network, formed a feedback control system (namely, networked control system). this architecture has injected fresh blood to the classic and modern control theories and also arises higher challenges to the controller design at the same time. on the one hand, the introduction of the network to the control system brings many advantages such as low cost, easy maintenance and high reliability. on the other hand, the unavoidable time delay, data dropout and other complicated phenomenon existing in the network should be considered. in recent years, networked control theory and technology have become an important and hot research area. scholars from different countries have made a lot of breakthroughs in the networked control [1][8]. network delay has become one of the most concerned issues in the networked control system. because the network delays can dramatically degrade the control performance of the systems copyright © 2006-2014 by ccc publications 472 s.w. tong, d.w. qian, g.p. liu even makes the systems unstable. from recent published literatures, it can be seen that the treatment of the network delays can be summarized in the following: the first one is to integrate delay information into the controller design by designing a robust controller to decrease the effect of time delay [3], [6][8]; the second one is to estimate delay information in the backward or forward channel by using reason-rule-table, average-value or delay windows (dw) method [9,10]; the third one is to eliminate the network delays in the return path by using a cascade control structure with p (proportion) control in the inner loop and fuzzy adaptive control in the outer loop [4]; as we all known, one of the obvious characteristics of the networked control system is that the communication networks can transmit a packet of data simultaneously. this feature provides another solution to compensate for network delays in the forward channel [11,12]. based on it, liu et al. [1, 2] proposed networked predictive control (npc), using the strings of future control actions, to compensate for the forward channel delays. in this paper, we try to design a networked controller by using fuzzy control theories. different from networked predictive control method, we separate the model prediction from the controller design. the function of the model predictor is just to produce future predicted outputs. a fuzzy controller is designed to generate a series of future control sequence based on the errors between the desired future outputs and the model predicted outputs. then the strings of future control actions are packed and sent to the plant side through the communication channel. thus, the effect of delays in the forward channel is lessened by using a delay compensator in the plant side. predictive control and fuzzy control are powerful tools. they have been used in the design of the networked controller [1][5], [7, 8]. some researchers have connected the prediction to the fuzzy control and have proposed ’predictive fuzzy control’ method [13, 14]. they use ’future error’ and ’future error change’ as inputs of the fuzzy controller to produce the control actions at the current time. to my knowledge, the combination of networked control system and the predictive fuzzy control has not been reported except one paper written by us in 2007 [15]. we call it ’networked predictive fuzzy control’ with abbreviation npfc. the core idea of the npfc is producing ’future control actions’ by fuzzy controller design according to the ’future error’ and ’future error change’ from a model predictor. then the network delays can be compensated by choosing the ’future control actions’. this paper is organized as follows: the architecture of networked predictive fuzzy control (npfc) is firstly presented. secondly, a model predictor based on diophantine equation is proposed. thirdly, the fuzzy controller using ’future errors’ and ’future error changes’ as inputs to derive ’future control actions’ is designed. then the delay compensation mechanism is discussed and the method is implemented in a servo control system. finally, the conclusions are drawn in section 4. 2 design of networked control systems 2.1 structure of networked predictive fuzzy control systems the networked predictive fuzzy control system as shown in fig. 1 mainly consists of three key parts: the model predictor, the fuzzy controller and the delay compensator. the model predictor is used to predict future outputs of the controlled system y(t|t), y(t + 1|t), · · · , y(t + n − 1|t) according to the delayed output y(t − 1) of the controlled system in the backward channel and the control actions u(t − d − 1), u(t − d − 2), · · · , u(t − d − nb) in the past. the errors e(t|t), e(t + 1|t), · · · , e(t + n − 1|t) between the desired future outputs r(t|t), r(t + 1|t), · · · , r(t + n − 1|t) and the predictive future outputs y(t|t), y(t + 1|t), · · · , y(t + n − 1|t) of the controlled system can be used to design a fuzzy controller to produce the future control sequences u(t|t), u(t + networked predictive fuzzy control of systems with forward channel delays based on a linear model predictor 473 1|t), · · · , u(t+nu −1|t). then the future control sequences are packed and sent to the plant side through network. in the plant side, a delay compensator is used to compensate for the forward network delays by selecting appropriate control sequence. figure 1: structure of networked predictive fuzzy control system 2.2 model predictor the function of the model predictor is to produce a series of future outputs of the controlled object according to the delayed output in the backward channel and the control actions in the past. models, no matter linear or nonlinear model, of such functions can all be the predictor. to simplified the issue and focus on the research work step by step, we only consider forward channel delays in this paper. this consideration is reasonable in some cases. for example, in a network that the transmission velocity in the backward channel is faster than the forward channel thus the delays in the backward channel can be neglected. therefore a linear model predictor based on the diophantine equation is proposed. consider a single-input and single-output process with the following form a(z−1)y(t) = z−db(z−1)u(t) (1) where, a(z−1) = 1 + a1z −1 + · · · + anaz −na b(z−1) = b0 + b1z −1 + · · · + bnbz −nb introducing a diophantine equation to derive the model predictor. ∆a(z−1)ei(z −1) + z−ifi(z −1) = 1 (2) where, ei(z−1) is of order i − 1 and fi(z−1) is of order na. ei(z −1) = 1 + i−1∑ j=1 ei,jz −j, fi(z −1) = na∑ j=0 fi,jz −j. 474 s.w. tong, d.w. qian, g.p. liu define n as predictive horizon, nm as model control horizon, from (1) and (2), the predicted value yp(t + 1) of the controlled system can be obtained. yp(t + 1) = p∆u(t − d) + q∆u(t − d − 1) + fy(t − 1) (3) where, p =   g0 0 · · · 0 g1 g0 · · · 0 ... ... . . . ... gnm−1 · · · · · · g0 ... ... · · · ... gn−1 gn−2 · · · gn−nm   yp(t + 1) = [yp(t|t), yp(t + 1|t), · · · , yp(t + n − 1|t)]t ∆u(t − d) = [∆u(t − d), ∆u(t − d + 1), · · · , ∆u(t − d + nm − 1)]t ∆u(t − d − 1) = [∆u(t − d − nb), ∆u(t − d − nb + 1), · · · , ∆u(t − d − 1)]t f = [f1(z −1), f2(z −1), · · · , fn(z−1)]t q =   gnb · · · g1... · · · ... gnb+n−1 · · · gn   let gi(z−1) = b(z−1)ei(z−1) = g0+g1z−1+· · ·+gnb+i−1z −(nb+i−1), we can construct matrix p and q by solving the diophantine equation. it should be noted that the item ∆u(t−d) = [∆u(t−d), ∆u(t−d+1), · · · , ∆u(t−d+nm−1)]t in equation (3) can not be obtained at the current time t. we assume ∆u(t − d + i) = 0, i = 0, 1, · · · , nm − 1. this assumption is reasonable. on the one hand, the dynamic response of the future time can be reflected by using the past control actions and the previous process outputs. on the other hand, the item y(t − 1) is corresponding to the correction of the predicted model. that is y(t − 1) = ym(t − 1) + e(t − 1), where ym(t − 1) is the model output and e(t − 1) is the error between the process output and the model output. 2.3 fuzzy controller the successful implementation of networked predictive fuzzy control relies on an assurance that the fuzzy controller must calculate all the pre-defined control actions in every sample time. so the fuzzy control algorithm should be simple and suitable to the real-time control. a realtime simplified fuzzy controller [16] is used to generate the future control actions. unlike the ordinary fuzzy control, which uses error and error change at the present time to produce the current control action, the networked predictive fuzzy controller uses ’future error’ and ’future error change’ to derive ’future control actions’. the ordinary fuzzy control can be regarded as a special condition of the networked fuzzy control when the predictive step is equal to zero. this networked predictive fuzzy controller has two-input one-output. one input is the error e between the desired future output and the predicted output. the other one is the change of the error ec. the output of the fuzzy controller is the change of the future control action ∆u. the membership functions of e and ec are adopted as triangular forms and the membership function of the output ∆u is adopted discrete form as fig. 2 shown. the ’simplification’ of the fuzzy controller relays on the inference process. for the typical two-input-one-output fuzzy controller, only four control rules are excited at each cycle time with networked predictive fuzzy control of systems with forward channel delays based on a linear model predictor 475 figure 2: membership functions of e, ec and ∆u. (a) is the membership function of input e, (b) is the membership function of input ec, (c) is the discrete form membership function of output ∆u. the form ’if e is le and ec is lec, then ∆u is l∆u’, where l is the linguistic variables pb, pm, ps, ze, ns, nm, nb. due to the characteristic of the triangular membership function, e is at most belong to two membership functions µie and µ i+1 e , ec is at most belong to two membership functions µjec and µ j+1 ec , thus ∆u has 2 × 2 combinations, that is four control rules: if e is l(i)e and if ec is l (j) ec then ∆u is l (i,j) ∆u if e is l(i)e and if ec is l (j+1) ec then ∆u is l (i,j+1) ∆u if e is l(i+1)e and if ec is l (j) ec then ∆u is l (i+1,j) ∆u if e is l(i+1)e and if ec is l (j+1) ec then ∆u is l (i+1,j+1) ∆u from fig. 2(c) shown, output ∆u is adopted discrete form membership function. it is assumed that output domain has been divided into cm(k), m = 1, 2, · · · , n. do minimum and maximum operator, calculate µ̃ (m) l (i,j) ∆u = ∧(µ l (i) e , µ l (j) ec , µ (m) l (i,j) ∆u ) µ̃ (m) l (i,j+1) ∆u = ∧(µ l (i) e , µ l (j+1) ec , µ (m) l (i,j+1) ∆u ) µ̃ (m) l (i+1,j) ∆u = ∧(µ l (i+1) e , µ l (j) ec , µ (m) l (i+1,j) ∆u ) µ̃ (m) l (i+1,j+1) ∆u = ∧(µ l (i+1) e , µ l (j+1) ec , µ (m) l (i+1,j+1) ∆u ) (4) and ˜̃µ(m)∆u = ∨(µ̃(m) l (i,j) ∆u , µ̃ (m) l (i,j+1) ∆u , µ̃ (m) l (i+1,j) ∆u , µ̃ (m) l (i+1,j+1) ∆u ) (5) where m = 1, 2, · · · , n it is not easy to directly get the inference rules of the future; however, the typical dynamic of the second order system can be obtained ahead of time. fig. 3 presents the phase plane of 476 s.w. tong, d.w. qian, g.p. liu −0.04 −0.02 0 0.02 0.04 0.06 0.08 0.1 −0.08 −0.06 −0.04 −0.02 0 0.02 0.04 e ec pb nb ps ns ze nb nm ns pb pm ps ze ze ze figure 3: phase plane and fuzzy control rules analysis of typical second order linear system. the typical second order linear system with the x-axis standing for variable e and y-axis standing for variable ec. from this figure, it can be seen that the points in x-axis and y-axis are crucial. if these points are controlled very well, the control performance is guaranteed. so the main inference rules are deduced as shown in table 1. table 1: main control rules of the fuzzy controller in npfc no. control rules 1 if e = pb and ec = ze then ∆u = pb 2 if e = ze and ec = nb then ∆u = nb 3 if e = nb and ec = ze then ∆u = nb 4 if e = ze and ec = pb then ∆u = pb 5 if e = ps and ec = ze then ∆u = ps 6 if e = ze and ec = nm then ∆u = nm 7 if e = ns and ec = ze then ∆u = ns 8 if e = ze and ec = pm then ∆u = pm 9 if e = ze and ec = ns then ∆u = ns 10 if e = ze and ec = ps then ∆u = ps 11 if e = ze and ec = ze then ∆u = ze thus, the incremental control actions ∆u(t + η) = n∑ m=1 ˜̃µ(m)∆u ∗ cm n∑ m=1 ˜̃µ(m)∆u (6) and the predicted control actions u(t + η) = u(t + η − 1) + ∆u(t + η), η is integer and 0 ≤ η ≤ nu − 1 (7) can be given, where nu is the control horizon. if e is the error and ec is the error change at present time, then ∆u is the incremental control action at present time. if e and ec are the values of the future time, then the future incremental control actions can be derived. networked predictive fuzzy control of systems with forward channel delays based on a linear model predictor 477 as paper [17][19] presents, variable domain implemented in the fuzzy control can greatly improve the control accuracy, which has successfully been applied to the control of a quadruple inverted pendulum [20] and the wing rock phenomenon [21]. this is achieved by the domain contraction and expansion. domain contraction is equivalent to the increase of the control rules. although the real-time algorithm has considered only 11 rules, together with the variable domain strategy, the fuzzy controller can acquire satisfactory control performance. the varies of domain can be achieved by multiplying flex factors α(e), β(ec) and γ(e, ec) of domains e, ec and ∆u, respectively. there are many different forms of flex factors. in this paper, we adopt the exponential form of functions: α(e) = [ |e| e ]τ1 , 0 < τ1 < 1 β(ec) = [ |ec| ec ]τ2 , 0 < τ2 < 1 γ(e, ec) = [[ |e| e ]τ1 [ |ec| ec ]τ2]τ3 , 0 < τ1, τ2, τ3 < 1 (8) to summarized, the networked predictive fuzzy controller has eleven parameters to be designed. four parameters are related to the model predictor. they are the order of the predictive model na and nb, the predictive horizon n, and the model control horizon nm. seven parameters are belong to the networked fuzzy controller. they are the control horizon nu, the scaling gains ke, kec, k∆u of error e, error change ec and incremental control action ∆u, and the variable domain parameters τ1, τ2 and τ3. the program steps for the networked predictive fuzzy control plus variable domain strategy are summarized below: step 1 : use equation (3), calculate the future outputs yp(t|t), yp(t + 1|t), · · · , yp(t + n − 1|t) of the controlled system according to the delayed output of the feedback channel and the previous control actions. step 2 : calculate the differences between the desired future outputs r(t|t), r(t + 1|t), · · · , r(t + n −1|t) and the model predicted values yp(t|t), yp(t+1|t), · · · , yp(t+n −1|t) to get e(t|t), e(t+ 1|t), · · · , e(t + n − 1|t) and ec(t|t), ec(t + 1|t), · · · , ec(t + n − 1|t). step 3 : adjust input and output domain using equation (8) in terms of e(t + η) and ec(t + η). step 4 : calculate membership functions of input e(t + η) and ec(t + η) and output ∆u(t + η). step 5 : use minimum-maximum inference method [see equation (4) and (5)]. step 6 : calculate the predicted control actions u(t + η) using equation (6) and (7). step 7 : let η = 0 to nu − 1, repeat step 3-6. step 8 : send the control actions ∆u(t),∆u(t + 1),· · · ,∆u(t + nu − 1) with a packet to the plant side. step 9 : select the control action u(t|t − k) and add to the controlled process. step 10 : in the next sample time, repeat step 1-9. 2.4 network delays compensation it is assumed that the network communication delay in the forward channel is not greater than the length of the predicted control horizon. to make use of the ’packet transmission’ characteristic of the network, a string of future control actions which contain u(t), u(t+1), · · · u(t+ nu − 1) at sample time t are sent to the plant side at the same time. then the control value from the latest control sequence available on the plant side is chosen as a control input of the plant to compensate for the forward channel delay. for example, if the latest control sequence 478 s.w. tong, d.w. qian, g.p. liu on the plant side is   u(t − k|t − k) u(t − k + 1|t − k) ... u(t − k + nu − 1|t − k)   then the output selected control signal will be u(t) = u(t|t − k) 0 2 4 6 8 10 12 −5 0 5 10 15 20 25 30 35 40 45 time(s) a n g le (d eg re e) npc npfc reference no delay compensation figure 4: the step responses of npc and npfc with 1-step forward channel delay. 3 simulations consider the servo control system as reference [2] shown. the system model with discrete form is as follows: g(z−1) = −0.00886z−1 + 1.268227z−2 1 − 1.66168z−1 + 0.6631z−2 (9) where sample time is 0.04 second. suppose there is one step delay in the forward channel, following section 2.2 and section 2.3, we design a linear model predictor and a real-time fuzzy controller. the parameters of the model predictor are: na = 2, nb = 2, n = 12, nm = 10. the parameters of the networked fuzzy controller are: nu = 10, ke = 0.00125, kec = 0.02, k∆u = 0.6, τ1 = 0.1, τ2 = 0.1 and τ3 = 0.01. the npc parameters are set to n = 12, nu = 10, ρ = 1500. fig. 4 shows the control performance of npfc and npc. the dot line is the set-point. the solid line stands for the npfc method. the dash line stands for npc method. the dash-dot line stands for the nfpc method without delay compensation. from the figure, it can be seen that the npfc can be regulated better than the npc in control performance with rapid dynamic and small overshot. the delay compensation mechanism is very effective. suppose the case that six step delays exist in the forward channel. the npfc controller parameters are adjusted as: ke = 0.0004, kec = 0.008, k∆u = 0.08, and the npc parameters networked predictive fuzzy control of systems with forward channel delays based on a linear model predictor 479 are set to n = 25, nu = 10, ρ = 100000. to testify the control performance of the networked predictive fuzzy control method, the results of the npc and the npfc are presented in fig. 5. through model prediction, fuzzy controller design and delay compensation, the npfc presents very obviously better performance than npc method. the rising time of npfc is about 1.1 seconds while 1.5 seconds for npc method. moreover, npc has 3.75% overshot while npfc has nearly no overshot. when npfc method not considers delay compensation, static errors can be seen in fig. 5. on the contrary, the dynamic response reaches steady state after 1.4 seconds when the delay compensator is acting. 0 2 4 6 8 10 12 −5 0 5 10 15 20 25 30 35 40 45 time(s) a ng le (d eg re e) npc npfc reference no delay compensation figure 5: the step responses of npc and npfc with 6-step forward channel delay. 4 conclusions this paper proposes a network predictive fuzzy controller structure. by solving the diophantine equation, the linear model predictor generates a series of predictive outputs of the controlled system according to the outputs and the control sequence in the past time. the errors between the desired future outputs and the predictive outputs from the linear model predictor and the error change are used to design a real-time fuzzy controller. so a series of future control sequence is produced in the controller side. by selecting the appropriate control sequence in the plant side, the delays in the forward channel are compensated. because npfc has more parameters which can be regulated, the control performance can be adjusted better than npc method. acknowledgments this work was supported in part by the beijing municipal excellent talents funded project (grant no. 2013d005022000003), the korea foundation for advanced studies fellowship to dianwei qian, and the national natural science foundation of china (grant no. 61273104 and 61333003). 480 s.w. tong, d.w. qian, g.p. liu bibliography [1] liu g. p.; rees d.; chai s. c.; nie x. y. (2005); design, simulation and implementation of netowrked predictive control systems, measurement and control, issn 2029-2940, 38:17-21. [2] liu g. p.; mu j. x.; rees d.; chai s. c. (2006); design and stability analysis of networked control systems with random communication time delay using the modified mpc, international journal of control, issn 0020-7179, 79(4):288-297. [3] mahmoud m. s.; saif a. a. (2012); robust quantized approach to fuzzy networked control systems, ieee journal on emerging and selected topics in circuits and systems, issn 2156-3357, 2(1):71-81. [4] du f.; qian q. q. (2008); the research of heterogeneous networked control systems based on modify smith predictor and fuzzy adaptive control, ieee international conference on fuzzy systems. [5] tang p. l.; de s. c. w. (2006); compensation for transmission delays in an ethernet-based control network using variable-horizon predictive control, ieee transactions on control systems technology, issn 1063-6536, 14(4):707-718. [6] jia x. c.; zhang d. w.; zheng l. h.; zheng n. n. (2008); modeling and stabilization for a class of nonlinear networked control systems: a t-s fuzzy approach, progress in natural science, issn 1002-0071, 18(8):1031-1037. [7] jiang x. f.; han q. l. (2008); on designing fuzzy controllers for a class of nonlinear networked control systems, ieee transactions on fuzzy system, issn 1063-6706, 16(4):10501060. [8] hajebi p.; almodarresi s. m. t. (2012); online adaptive fuzzy logic controller using neural network for networked control systems, international conference on advanced communication technology. [9] ren c. q.; wu p. d.; wang x. f.; ma s. y.; chen z. l. (2002); a study on the forecast arithmatic of hydraulic telecontrol system based on internet, journal of beijng institute of technology, issn 1004-0579, 22(1):85-89. (in chinese) [10] zhen w.; xie j. y. (2002); on-line delay-evaluation control for networked control systems, ieee conference on decision and control. [11] zhang y. y.; zhang j. l.; luo x. y.; guan x. p. (2013); faults detection for networked control systems via predictive control, international journal of automation and computing, issn 1476-8186, 10(3):173-180. [12] tang x. m.; ding b. c. (2012); design of networked control systems with bounded arbitrary time delays, international journal of automation and computing, issn 1476-8186, 9(2):182190. [13] li p. f.; yan x. p.; qiu l. p.; zhou q. y.(2009); study on predictive fuzzy control of great inertia system based on grey model, 2009 second international conference on intelligent computation technology and automation. [14] hu j. q.; rose e.(1997); predictive fuzzy control applied to the sinter strand process, control engineering practice, issn 0967-0661, 5(2):247-252. networked predictive fuzzy control of systems with forward channel delays based on a linear model predictor 481 [15] tong s. w.; liu g. p.(2007); design and simulation of fuel cell networked predictive fuzzy control systems, proceedings of the 26th chinese control conference. [16] tong s. w.; liu g. p.(2008); real-time simplified variable domain fuzzy control of pem fuel cell flow systems, european journal of control, issn 0947-3580, 14(3):223-233. [17] li h. x. (1995); to see the success of fuzzy logic from mathematical essence of fuzzy control, fuzzy systems and mathematics, issn 1001-7402, 9(4):1-14. (in chinese) [18] oh s. y.; park d. j. (1995); self-tuning controller with variable universe of discourse, ieee international conference on systems, man and cybernetics. [19] li h. x. (1999); variable universe adpative fuzzy controller, science in china (series e), issn 1006-9321, 20(1):32-42. [20] li h. x.; miao z. h.; wang j. y. (2002); variable universe adaptive fuzzy control on the quadruple inverted pendulum, science in china (series e), issn 1006-9321, 45(2):213-224. [21] liu z. l.; su c. y.; svoboda j. (2004); control of wing rock phenomenon with a variable universe fuzzy controller, proceeding of the american control conference. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 2, pp. 160-170 ewma algorithm in network practice p. čisar, s. bošnjak, s. maravić čisar petar čisar telekom srbija prvomajska 2-4, subotica, serbia e-mail: petarc@telekom.rs saša bošnjak faculty of economics segedinski put 9-11, subotica, serbia e-mail: bsale@eccf.su.ac.yu sanja maravić čisar subotica tech marka oreškovića 16, subotica, serbia e-mail: sanjam@vts.su.ac.rs abstract: abstract: intrusion detection is used to monitor and capture intrusions into computer and network systems which attempt to compromise their security. many intrusions manifest in changes in the intensity of events occuring in computer networks. because of the ability of exponentially weighted moving average (ewma) control charts to monitor the rate of occurrences of events based on their intensity, this technique is appropriate for implementation in control limits based algorithms. the paper also gives a review of a possible optimization method. the validation check of results will be performed on authentic network samples. keywords: intrusion detection, ewma, control limits, optimization, autocorrelation 1 introduction the exponentially weighted moving average is a statistic for monitoring the process that averages the data in a way that gives less and less weight to data as they are further removed in time. for the ewma control technique, the decision regarding the state of control of the process depends on the ewma statistic, which is an exponentially weighted average of all prior data, including the most recent measurements. by the choice of weighting factor λ , the ewma control procedure can be made sensitive to a small or gradual drift in the process. the statistic that is calculated is the following: ew mat = λyt +(1− λ )ew mat−1 t = 1,2,...,n (1) where • ew ma0 is the mean of historical data (target) • yt is the observation at time t • n is the number of observations to be monitored including ewma0 • 0 < λ ≤ 1 is a constant that determines the depth of memory of the ewma. copyright c⃝ 2006-2010 by ccc publications ewma algorithm in network practice 161 this equation has been established by roberts as described in [4]. the parameter λ determines the rate at which "older" data enter into the calculation of the ewma statistic. a value of λ = 1 implies that only the most recent measurement influences the ewma. thus, a large value of λ = 1 gives more weight to recent data and less weight to older data a small value of λ gives more weight to older data. the value of λ is usually set between 0.2 and 0.3 [2] although this choice is somewhat arbitrary. lucas and saccucci [3] have shown that although the smoothing factor λ used in an ewma chart is usually recommended to be in the interval between 0.05 to 0.25, in practice the optimally designed smoothing factor depends not only on the given size of the mean shift δ , but also on a given in-control average run length (arl). the estimated variance of the ewma statistic is approximately: σ 2ew ma = λ 2− λ · σ 2 (2) where σ is the standard deviation calculated from the historical data. the center line for the control chart is the target value or ew ma0. the upper and lower control limits are: ucl = ew ma0 + kσew ma (3) lcl = ew ma0 − kσew ma where the factor k is either set equal 3 (the 3-sigma control limits) or chosen using the lucas and saccucci tables (arl = 370). in addition to the aforementioned authors, the publications [6]-[13] have also dealt with the topic of ewma statistics and statistical anomaly detection in computer networks. control charts are specialized time series plots, which assist in determining whether a process is in statistical control. some of the most widely used forms of control charts are x-r charts and individuals charts. these are frequently referred to as "shewhart" charts after the control charting pioneer walter shewhart who introduced such techniques. these charts are sensitive to detecting relatively large shifts in the process (i.e. of the order of 1.5σ or above). in computer network practice, shifts can be caused by intrusion or attack, for example. two types of charts are primarily used to detect smaller shifts (less than 1.5σ ), namely cumulative sum (or cusum) charts and ewma charts. a cusum chart plots the cumulative sums of the deviations of each sample value from a target value. an alternative technique to detect small shifts is to use the ewma methodology. this type of chart has some very attractive properties, in particular: 1. unlike x-r and individuals charts, all of the data collected over time may be used to determine the control status of a process. 2. like the cusum, the ewma utilizes all previous observations, but the weight attached to data exponentially decreases as the observations become older and older. 3. the ewma is often superior to the cusum charting technique due to the fact that it better detects larger shifts. 4. ewma schemes may be applied for monitoring standard deviations in addition to the process mean. 5. ewma schemes can be used to forecast values of a process mean. 6. the ewma methodology is not sensitive to normality assumptions. 162 p. čisar, s. bošnjak, s. maravić čisar in real situations, the exact value of the shift size is often unknown and can only be reasonably assumed to vary within a certain range. such a range of shifts deteriorates the performance of existing control charts. one of the algorithms for determining the maximal shift in normal traffic is described in [17]. the paper describes the process of application of ewma algorithm for one major user, given as an example. it can be shown that the obtained results are valid for the other analyzed users as well. this research uses samples of authentic network traffic (i.e. traffic intensity in a unit of time). traffic analysis is realized in the form of statistical calculations on samples which derives from traffic curve. from the appropriate pattern of internet traffic, 35 samples of local maximums are taken in order to ensure that the statistical analysis is performed on a large sample (number of samples n > 30), thus supporting and leading to general conclusions. the aim of this research is to determine those allowed ewma values of traffic, that when they are exceeded, it will be considered as the appearance of a statistical anomaly suspected to attack. in this sense, the choice of only local maximums for analysis can be accepted as logical, because the critical point of maximum value of aggregate traffic is in this way also included. the proposed method of calculating the overall optimal value λ is applied to traffic patterns, on the basis of which the lower and upper control limits of traffic are determined. for statistical detection of an attack the primary interest is the appearance of a situation in which the upper control limit is exceeded. the overstepping of the lower control limit can be understood as a statistical anomaly, but in the case of this research, it is only related to the local maximum (and not to the aggregate network traffic) and as such does not endanger the security of the computer network in general. therefore, the situation in which the value of network traffic falls below some lower limit is not considered to be a suspicious event or attack, because the initial presumption of this research is the increase of traffic during an external attack. for the observed pattern of traffic, ewma values are calculated and if these values are outside of the control limits, that situation is interpreted as a statistical anomaly. emphasis in this work is placed on determining the occurrence of false alarms, as an important security feature of applied algorithm. 2 optimized exponential smoothing calculating the optimal value of parameter λ is based on the study of authentic samples of network traffic. random variations of network traffic are normal phenomena in the observed sample. in order to decrease or eliminate the influence of individual random variations of network traffic on occurrence of false alarms, the procedure of exponential smoothing is applied, as an aspect of data preprocessing. for any time period t, the smoothed value st is determined by computing: st = λ yt−1 +(1− λ )st−1 where 0 < λ ≤ 1 and t ≥ 3 (4) this is the basic equation of exponential smoothing. the formulation here is given by hunter [2]. this smoothing scheme begins by setting s2 to y1, where si stands for smoothed observation or ewma, and yi stands for the original observation. the subscripts refer to the time periods 1, 2, ..., n. for example, the third period is s3 = λ y2 +(1− λ )s2 and so on. there is no s1. the optimal value for λ is the value which results in the smallest mean of the squared errors (mse). the initial ewma plays an important role in computing all the subsequent ewma’s. there are several approaches to define this value: 1. setting s2 to y1 2. setting s2 to the target of the process 3. setting s2 to average of the first four or five observations ewma algorithm in network practice 163 it can also be shown that the smaller the value of λ , the more important is the selection of the initial ewma. the user would be well-advised to try several methods before finalizing the settings. for different input values of initial parameter s2, an application in "matlab" is created which calculates and plots the dependence of sse and partial value of λ in range of 0 ÷1, with adjustable step. in addition, the optimal value λopt is also calculated. for the optimal value, in accordance with the smoothing scheme, that particular value is taken for which the sse is minimal. the following figure shows an example for calculating the optimal value of the parameter λ for a specific s2. figure 1: calculation λopt (sse) due to the lack of an exact method of calculation in available publications about the determination of the initial s2 in the procedure of exponential smoothing, the authors of this paper have dealt with researching the link between selection of s2 = y1 and λopt , i.e. s2(λopt). in that sense, the range of s2 is determined using the lowest to the highest sample value during the period of observation. this research was conducted on an authentic sample of network traffic of an internet service provider and the segment of observation was the range of values of local maximums (in this concrete case from 8 to 34 mb/s), with a large enough number of values n = 33 > 30, taking into account the generality of conclusions. the period of observation was one month. the next table (table 1) shows the numerical and graphical dependence s2(λopt). since a set of different results has been obtained for partial values of λopt , the authors suggest for the overall optimal parameter λopt to accept the average of all the partial results (in this particular case it is 0.75). 3 autocorrelation autocorrelation or serial correlation of time series means that the value of the observed variable in a time unit depends on values which appear prior to or later in series. in practical situations, autocorrelation of the first order is usually examined, which may be shown by a simple correlation coefficient or so-called autocorrelation coefficient. let rt be the time series data, where t = 1,2,...,t , then the autocorrelation coefficient of the first order is given by: 164 p. čisar, s. bošnjak, s. maravić čisar s2 λo pt 1 8 0.72 2 9 0.72 3 10 0.72 4 11 0.72 5 12 0.71 6 13 0.71 7 14 0.72 8 15 0.72 9 16 0.72 10 17 0.72 11 18 0.72 12 18.5 0.73 13 19 0.73 14 19.5 0.73 15 20 0.73 16 20.5 0.73 17 21 0.74 18 21.5 0.74 19 22 0.74 20 22.5 0.75 21 23 0.75 22 23.5 0.75 23 24 0.75 24 25 0.76 25 26 0.77 26 27 0.77 27 28 0.78 28 29 0.79 29 30 0.8 30 31 0.8 31 32 0.81 32 33 0.82 33 34 0.82 table 1: calculation of s2(λopt) ρ(r) = t∑ t=2 rt rt−1√√√√ t∑ t=2 r2t t∑ t=2 r2t−1 -1 ≤ ρ ≤ 1 (5) one of the standard features of traffic time series is that increasing rates of traffic rt are not mutually significantly autocorrelated, ie. the value of autocorrelation coefficient is near zero. at the same time, this means that the distribution of positive and negative values of increasing rates is random and that does not follow a specific systematic regularity. positive autocorrelation implicates that the positive values are followed by mainly positive values and negative values by negative ones and then is ρ ≈ +1. in the case of negative autocorrelation, there is often a change of sign, i.e. the positive rate in most cases leads to a negative rate and vice versa and then is ρ ≈ −1. since there is no typical scheme, on the basis of positive rate in particular time period there is now way of concluding (it can not be concluded) with a significant probability that in the next period either a growth or decline will appear. the same applies to the situation for the negative rate. researchers in [5] dealt with the influence of autocorrelated and uncorrelated data on the behavior of intrusion detection algorithm. in their work they came to conclusion that ewma algorithm for autocorrelated and uncorrelated data works well in the sense of intrusion detection in some information systems. the advantage of ewma technique for uncorrelated data is that this technique (as opposed to the case of autocorrelated data) can detect not only rapid changes in the intensity of events, but also small changes in the mean value realized through the gradual increase or decrease of the intensity of events. however, in ewma for uncorrelated data, the initial value of smoothed intensity events is to be reset after intrusion detection, in order to avoid the impact of current values of parameters on future results (carry-over effect). in case of ewma for autocorrelated data this reset is not necessary, because ewma automatically adjusts the upper and lower control limits. generally, the smoothing constant should not be too small, so that a short-term trend in the intensity of events in the recent past can be detected.other ewma algorithm in network practice 165 publications have also shown the need for taking into account the autocorrelation of input data. as it is emphasized in [18], in the case of dynamic systems the autocorrelation in variables is taking into account incorporating time lags of the time series during the modeling stage. samples of network traffic were obtained by network software "mrtg" (multi router traffic grapher) version 2.10.15. this software generates three types of graphs: • daily with calculation of 5-minute average • weekly with calculation of 30-minute average • monthly with calculation of 2-hour average the graphs also enable numerical information on the maximum and average traffic for the appropriate period of time. daily, weekly and monthly graphs of the first measurement will be used for calculation of initial historical data, while the application of ewma statistics, with aim of checking the validity of certain parameters, will be realized on daily, weekly and monthly traffic graphs of the second measurement. for the application of exponential smoothing method to network traffic it is necessary first to determine the historical values: ew ma0 and standard deviation σ0. for this purpose, it is necessary to collect appropriate traffic samples to perform adequate calculations. this study will use a total of 105 samples of local maximum: 35 samples from the daily traffic graph, 35 samples from the weekly traffic graph and 35 samples from the monthly traffic graph (table 2). time yt(daily) yt(weekly) yt(monthly) 1 12 21 23 2 10.5 22.5 30 3 8.5 23 27 4 10.5 20 27 5 18 20.5 25 6 22 23.5 27 7 25.5 24 22 8 20 21 24 9 33.9 23 23 10 25 25 20 11 24 25.5 24.5 12 26.5 24.5 26.5 13 27.5 22 28 14 23 25.5 27 15 25 27 23 16 24 28 22.5 17 23 27 26.5 18 23 28 31 19 22 25.5 22.5 20 23 30 22.5 21 23 29 27 22 23 26.5 25 23 23 29 26 24 16 26.5 28 25 16 27.5 21 26 9 26 24 27 11.5 25 22 28 8.5 24 22 29 8.5 23.5 22 30 14 22 23 31 23 22.5 27 32 23 24 29 33 20 24 25 34 23 25 25 35 23 23 22 table 2: network samples on the basis of the data presented in the table the following can be calculated: ew ma0 = 23.10 and σ0 = 4.87. in accordance with the method described above and to justify the usage of ewma statistics, it is important to determine the statistical independence of samples, which will be examined by checking the existence of correlation between data. for this purpose, pearson’s correlation coefficient will be used, which is supplied as a ratio of covariances of two variables and the product of their standard deviations: 166 p. čisar, s. bošnjak, s. maravić čisar ρxy = cov(x,y ) σxσy −1 ≤ ρxy ≤ 1 (6) other authors proposed different interpretation ways of correlation coefficient. cohen [1] noted that all the criteria are based on the greater or lesser extent of arbitrariness and should not be kept too strictly. yet, one often used interpretation of these coefficients is given below, as described in [16]: • ρ between 0 and 0.2 no correlation or is insignificant • ρ between 0.2 and 0.4 low correlation • ρ between 0.4 and 0.6 moderate correlation • ρ between 0.6 and 0.8 significant correlation • ρ between 0.8 and 1 high correlation the value of correlation coefficient ρxy can be calculated using the statistical function correl (array1, array2) in ms excel. when examining the table above, it is possible to identify three series of data (daily, weekly and monthly) and in this sense three different correlation coefficients can be calculated: • correlation coefficient for daily weekly series: ρ1 = 0.28 → low correlation • correlation coefficient for daily monthly series: ρ2 = 0.04 • correlation coefficient for weekly monthly series: ρ3 = −0.04 beside testing the correlation coefficient within a single measurement, it is important to check the existence of correlation between corresponding periods from different measurements. for that purpose, values of correlation coefficient of two daily (weekly, monthly) intervals are checked and the following results are obtained: • correlation coefficient for daily daily series: ρ4 = −0.15 • correlation coefficient for weekly weekly series: ρ5 = 0.11 • correlation coefficient for monthly monthly series: ρ6 = −0.02 as all calculated coefficients are with low degree of correlation, or without it, it can be concluded that the used data are statistically independent and that the application of ewma statistics is justified. 4 network practice values ew ma0 and σ0 are calculated for the period of one month. it can be reasonably assumed that these values in another monthly period of observation would be different, having in mind the various unpredictable traffic situations and accidental nature of network traffic as a process. therefore, in the further phase of research, the extent of change of these values will be studied. the determination of maximum changes of characteristic traffic values (maximum and average) is based on the analysis of numerical data of several larger internet users that derives from the popular network software mrtg, which is related to the period of one day, week and month. without the loss of generality, the graphical presentation of curves from three users is given below, noting that the observed traffic curves of other users do not deviate significantly from the forms shown here. ewma algorithm in network practice 167 figure 2: traffic curves of different users daily1 daily2 diff. weekly1 weekly2 diff. monthly1 monthly2 diff. [mb/s] [mb/s] [%]) [mb/s] [mb/s] [%] [mb/s] [mb/s] [%] user 1 max 33.9 33.1 -2.4 29.7 33.4 12.4 9.7 9.8 1 average 16.5 19.1 15.8 17.0 21.1 24.1 6.01 6.6 9.8 user 2 max 3.94 3.63 -7.8 3.98 3.68 -7.5 48.2 49.2 2 average 2.35 2.09 -11 2.28 2.09 -8.3 30.9 30 -3 user 3 max 9.31 10.0 7.4 9.71 9.99 2.9 9.9 9.7 -2 average 5.71 6.01 5.2 5.63 6.64 17.9 5.4 4.9 -9.2 user 4 max 9.69 9.99 3.1 10.0 9.91 -0.9 10 10 0 average 4.96 5.14 3.6 5.2 4.94 -5 7.4 7.6 2.7 user 5 max 48.2 46.3 -3.9 48.5 45.2 -6.8 1.8 1.8 0 average 29 24.4 -15.9 30.4 26.4 -13.1 0.14 0.14 0 user 6 max 10.1 10.1 0 10.0 10.0 0 3.94 3.66 -7.1 average 7.78 7.95 2.2 7.43 8.14 9.6 1.9 2.03 6.8 user 7 max 3.98 3.97 -0.02 3.94 3.99 1.2 3.9 3.9 0 average 1.74 1.79 2.9 1.88 1.99 5.9 1.9 2 5.2 table 3: differences in characteristic values of traffic the maximum and average values of traffic are calculated twice in the period of a month. the results are arranged in the following table: by analysis of the numerical data it can be concluded that the maximum change of mean value p is not greater than 25%. namely, that is the largest deviation of size in both measurements. since the main idea of this research is to find out the maximum and minimum tolerant ewma values of local maxima, in further calculations the following values will be used (for the accepted p = 0.25): ew ma0max = (1+ p)ew ma0 ≈ 1.25ew ma0 ew ma0min = (1− p)ew ma0 ≈ 0.75ew ma0 similarly, σ0max ≈ 1.25σ0 and σ 2ew ma = (λ /(2− λ ))σ 2 0max. previously calculated values of parameters ew ma0 and λopt , as well as the values of ucl and lcl from the first measurement will be verified in different situations of daily, weekly and monthly traffic from the second measurement. ucl value is given by ew ma0max + kσew ma, while lcl is ew ma0min − kσew ma. the following results are obtained: it is important to notice that according to the previous figure in case of appropriately determined ewma parameters, there is no situation of threshold exceeding, which eliminates the appearance of false alarms. 168 p. čisar, s. bošnjak, s. maravić čisar figure 3: verification (daily traffic, second measurement) figure 4: verification (weekly traffic, second measurement) 5 conclusions the aim of this research was to examine the possibility of applying ewma statistics in intrusion detection in network traffic. the research has shown that direct application of this algorithm on computer network traffic, as applied in industrial processes, does not provide acceptable results. namely, often proposed values for exponential smoothing factor in case of network application of the algorithm, may in some circumstances lead to the creation of false alarms, thus endangering the security level of system. due to the lack of an acceptable precise method for determination of initial value of the coefficient in exponential smoothing procedure in available publications, this research has been directed towards establishing a relation between the choice of initial ratio and optimal value for smoothing. by creating the appropriate application, the practical way was presented for testing the impact of different values of parameters on the level of anomaly detection. this enabled the establishment of graphical presentation of input depending on output sizes, which all contributed to the creation of proposed method for calculating the optimal value of smoothing factor. before the start of the implementation of statistical analysis of traffic, the extent of autocorrelation between the used data has to be examined, by calculating the correlation coefficients. one of the imporewma algorithm in network practice 169 figure 5: verification (monthly traffic, second measurement) tant results is that it is shown that analysis of properties of network traffic based on individual patterns of daily traffic only is not recommended, because of the increased level of autocorrelation. for this reason, when calculating the historical parameters, network traffic must be viewed in a wider context of time, taking into account the weekly and monthly periods. using the network monitoring software, it is also necessary to determine the maximum variations of basic traffic characteristics (average and maximum). to make this algorithm properly applicable in network environment it is necessary to perform previous processing of historical data, in order to obtain initial values of key parameters. based on the proof lent by the obtained results it can be concluded that the choice of ewma parameters significantly affects the operation of this algorithm in network environment. therefore, the optimization process of parameters before the application of the algorithm is of particular importance. bibliography [1] j. cohen, statistical power analysis for the behavioral sciences (2nd ed.), lawrence erlbaum associates, hillsdale, new jersey, 1998. [2] j.s. hunter, the exponentially weighted moving average, journal of quality technology 18: 203210, 1986. [3] j.m. lucas, m.s. saccucci, exponentially weighted moving average control schemes: properties and enhancements, technometrics 32, 1-29., 1990. [4] s.w. roberts, control chart tests based on geometric moving averages, technometrics, 1959. [5] ye et al., computer intrusion detection through ewma for autocorrelated and uncorrelated data, ieee transactions on reliability vol. 52, no. 1, 2003. [6] g. fengmin, deciphering detection techniques: part ii anomaly-based intrusion detection, white paper, mcafee security, 2003 [7] s. sorensen, competitive overview of statistical anomaly detection, white paper, juniper networks, 2004. [8] v. a. mahadik, x. wu and d. s. reeves, detection of denial-of-qos attacks based on χ2 statistic and ewma control charts, http://arqos.csc.ncsu.edu/papers/2002-02-usenixsec-diffservattack.pdf 170 p. čisar, s. bošnjak, s. maravić čisar [9] a. s. neubauer, the ewma control chart: properties and comparison with other quality-control procedures by computer simulation, clinical chemistry, http://www.clinchem.org/cgi/content/full/43/4/594 [10] d. seibold, enterprise campus security-addressing the imploding perimeter, http://www.itsa.ufl.edu/2003/presentations/intsec.ppt [11] a. vasilios, s. and f. papagalou, application of anomaly detection algorithms for detecting syn flooding attacks, http://www.ist-scampi.org/publications/papers/siris-globecom2004.pdf [12] j. viinikka and h. debar, monitoring ids background noise using ewma control charts and alert information, http://viinikka.info/viideb2004.pdf [13] y. zhao, f. tsung and z. wang, dual cusum control schemes for detecting a range of mean shifts, ieee transactions, http://qlab.ieem.ust.hk/qlab/download/papers/paper%2035.pdf, 2005 [14] engineering statistics handbook-ewma control charts, http://www.itl.nist.gov/div898/handbook/pmc/section3/pmc324.htm [15] engineering statistics handbook−single exponential smoothing, http://www.itl.nist.gov/div898/handbook/pmc/section4/pmc431.htm [16] savannah state university, office of institutional research & planning, http://irp.savstate.edu/irp/glossary/correlation.html [17] p. čisar, s. maravić čisar, a first derivate based algorithm for anomaly detection, international journal of computers, communications & control, 3(s):238-242, 2008 [18] j. mina, c. verde, fault detection for large scale systems using dynamic principal components analysis with adaptation, international journal of computers, communications & control, 2(2):185194, 2007.. petar čisar was born on september 08, 1965. he graduated at the faculty of electrical engineering in belgrade. master’s study completed in information engineering at the faculty of economics in subotica. currently works on phd thesis. the spheres of his interest are mobile technologies, as well as the development of security methods in network environments. saša bošnjak was born on december 31, 1961. he is a associate professor of computer science at the faculty of economics subotica. he holds a range of courses in information engineering. his research interests are database, software development, computer networks, reuse methodology, ebusiness and internet technology. he earned a phd degree in information system faculty of economics subotica in 1995. sanja maravić čisar was born on august 03, 1970. she graduated at the faculty of electrical engineering in belgrade. master’s study completed at the technical faculty in zrenjanin. she works as lecturer at subotica tech at following courses: visual programming, object-oriented based programming, java and multimedia systems. currently works on phd thesis. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 819-824 adaptive web applications for citizens’ education. case study: teaching children the value of electrical energy. i. moisil, s. dzitac, l. popper, a. pitic ioana moisil, alina pitic lucian blaga univeristy of sibiu, romania e-mail: im25sibiu@gmail.com simona dzitac, laurentiu popper university of oradea, romania e-mail: simona.dzitac@gmail.com, director@perfect-service.ro abstract: “the foundation of every state is the education of its youth.” diogenes laertius long-term energy saving and reduction of environmental consequences of energy consuming are among the most challenging objectives of our time. people are prone to routine and habit. to change these habits is almost a sisif’s work. in spite of continuous efforts from environmental specialists, we are witnessing an increase in electricity (and gas) consumption, at least at the level of households. studies carried on have shown that consumers have in many cases an irrational behaviour. to correct that, researchers are studying consumers’ decision making behaviour and try several intervention measures. in our paper we are presenting the design and development of a web based adaptive system aimed to educate citizens for an electrical energy saving behaviour. the system is composed of three subsystems: adaptation system, user’s profile and knowledge base. we have used a user-centered design approach. for adults, users’ profiles are build taking into account age group, educational level, gender, income, professional aspects, consuming behaviour. a set of questionnaires have been designed in order to collect users’ data. for children, the standard profiles are more complicated and, in function of the age group, can be obtained off line through interviews or/and through online activities (games, quizzes etc.). the knowledge base is build for the electrical energy domain. the adaptation sub-system will present information to the user based on s/he profile. the system is populated with data for users of 6 to 10 years of age. for this users group a social and affective interaction design approach was used. keywords: adaptive web, electrical energy saving, citizens education, interaction design, user-centred design. 1 introduction the european union (eu) has established that reducing energy consumption and eliminating energy wastage are among the main goals to be achieved in the near future. “at the end of 2006, the eu pledged to cut its annual consumption of primary energy by 20% by 2020. to achieve this goal, it is working to mobilise public opinion, decision-makers and market operators and to set minimum energy efficiency standards and rules on labelling for products, services and infrastructure”. [1] it seems more than natural, in the information society, to use the web as a mean to make people aware of the energy saving problem. in fact there are many web sites informing about the ways of reducing energy consumption [1–6]. copyright c⃝ 2006-2010 by ccc publications 820 i. moisil, s. dzitac, l. popper, a. pitic analysing the content of these web presentations we have found out that though very informative and scientifically correct, they are not always answering to some users’ expectations, being too serious, and too boring. this was the opinion of 56% of 200 users of age 6 to 22 investigated in 2009. it is clear that different actors that have a word to say in the process of reducing energy consumption must be approached in different ways. it is also important to note that it is a problem of changing the behaviour, the mentality and this is a very sensitive process. having this in mind we have tried to design and develop of a web based adaptive system aimed to educate citizens for an electrical energy saving behaviour. the system is composed of three subsystems: adaptation system, user’s profile or model and knowledge base (domain model). we have used a user-centered design approach. for adults, users’ profiles are built taking into account age group, educational level, gender, income, professional aspects, consuming behaviour. a set of questionnaires have been designed in order to collect users’ data. for children, the standard profiles are more complicated and, in function of the age group, can be obtained off line through interviews or/and through online activities (games, quizzes etc.). the knowledge base is build for the electrical energy domain. the adaptation sub-system will present information to the user based on s/he profile. at present the system is populated with data for users of 6 to 10 years of age. 2 design considerations in our paper we are presenting a first case study with the educational software for electrical energy saving. the case study is considering users of 6 to 10 years of age. for this category of users we had use a hybrid user-centred design methodology, blending different kinds of usercentred designs with interaction (social interaction, affective interaction) and participatory design and taking into account learning objectives and learners age and preferences. the design research phase had two steps. firstly we identified the need of computerbased educational products, the interest of the users towards computers, the level of satisfaction concerning other computer-based educational products, what characteristics of human-computer interaction they prefer (sounds, colours, mediating agents) and also the general level of computer literacy. we have also investigated users’ behaviour towards the educational/formative software market (are they buying educational software, if so, on what subjects, who is the buyer, how often, etc.). the second step aimed to investigate users’ behaviour towards electrical energy consumption aspects (how interested they are, did they know what is electricity, how they perceive other educational software on this topic, are their parents concerned by energy saving, etc). 130 young pupils from both rural and urban locations participated in the design. 63% were girls and 37% boys, the age interval being from 6 to 10 years old. 84% had a computer at home. 82% have internet connexion. • children in the first and forth grade (6 years old and 10 years old) are at the beginning of an educational cycle and therefore more exposed to change, with distributed interests and less interested than the others (50% are satisfied by the use of computers in class). • 75% of the subjects that have a computer at home are very interested in using an educational software, but only 40% of those that do not have a computer at home showed a maximum interest. • 86% of the subjects prefer computer games and 80% like also cartoons, movies, etc.; 66% of the girls choose stories and only 43% of the boys; attraction to music is growing with the age, from 60% at the age of 6 to 85% at the age of 10. • most parents are restricting children access to internet. • interest to something new is decreasing with age, from 93% at the age of 7 to 33% at the age of 11. adaptive web applications for citizens’ education. case study: teaching children the value of electrical energy. 821 • the influence of the family environment is very important. • 27% of the subjects voted for a software based mediator agent. • 94% of the subjects prefer software with sound incorporated. • for 57% of the subjects software products are provided by parents, 27% are buying themselves these products. • 87% of the children do not know the most important sources of electrical energy (“electricity is coming from the cable”). • 73% of the subjects appreciate the idea of quizzes on how to save electrical energy. • children with intellectual parents are more aware of the aspects concerning energy saving especially linked to global warming. children from rural area are more concerned by the cost of energy than by global warming. some of them are thinking that this is not their problem. • 76% of the subjects are attracted by computers. based on these results it has been decided how to design the software interface and how to structure the content. the software has to use the children’ preferred colours, to have sound facilities, to enable the use of the mouse but also of the keyboard, the concepts introduced have to be transparent, the children being able to navigate freely through the software, games have to be included in order to make more attractive the product and less boring, if the case, and the mediator has to be included in the software. all these data have been obtained during working meetings with the children. for example, we have discussed the video-clip energy, let’s save it! [1]. the video-clip has been found attractive by children 8 to 10 years of age. for the others it was not interesting. one observation was that “it goes too quickly”. in fact the video clip is presenting in less than 4 minutes a global view of energy waste in the day by day activities. the presentation is user-friendly, in an attractive way and it raises worries, but no more. another observation, concerning a different web application was that “it sounds like papa!” (citing: “don’t leave lights on when no one is in the room. if you are going to be out of the room for more than five minutes, turn off the light.” [2]). so it was very clear that the site must be designed taking into account the children preferences. as a result we have structured the web application in four main chapters: 1. what is electricity a. short history b. experiments c. sources of electrical energy i. classical ii. alternative 2. using and saving electrical energy 3. games, activities, quizzes, puzzles, enigmas 4. resources for teachers, parents and educators the system architecture is composed of three subsystems: adaptation system, user’s profile or model and knowledge base (domain model). the domain model for electrical energy is a set of concepts, each concept consisting of a set of topics. topics are linked to each other thus forming a semantic network. a topic is presented usually on a web page. a page is divided in sub-pages (chunks) that will differ in function of the user type [7–9]. the adaptation module is providing different interfaces and contents in function of the user profile (model). for the moment each page has implemented two variants: user in the age group 6 to 10, and children older than 10 years. we have used plants and trees images ant the beginning because these were interesting for children of all ages (fig.2) 822 i. moisil, s. dzitac, l. popper, a. pitic figure 1: system architecture. figure 2: and 2b . accessing the web application the user is identified by a user name and a password. this is in fact the link to the user profile. the user can freely navigate in the site but also s/he can choose a certain topic looking at the icons on the screen. for example in fig.4 there several icons (buttons) giving access to information on electrical energy sources. each button is also explained by voice. the children can experience the effect of using too much light (fig.5 and 6). adaptive web applications for citizens’ education. case study: teaching children the value of electrical energy. 823 figure 3: access to the web application. figure 4: buttons giving access to information on electrical energy sources. figure 5: all electrical bulbs are off (the tree has all leaves) figure 6: four electrical bulbs are on (the tree has lost most of the leaves) the client is implemented as web pages. there is a number of linked frames. we have used flash, php, mysql and javascript language that made possible to overcome html limitations. evaluation the current interface has been evaluated from the point of view of presentation aspects and functionalities by the children participating in the design and also by other 34 children (6 to 11 years of age). 92% were very satisfied with the web application. 7% would have liked more games and 1% declared that he is not interested in the subject. two teachers and two parents participated also in the evaluation process. the first appreciation was positive. 3 conclusion and further research in this paper we have described the design and development of an adaptive webbased system for citizens’ education in respect to electrical energy consumption reduction. the system is a complex one. the core system is implemented and also several modules. we had presented a first case study with the educational software for electrical energy saving. the case study is considering users of 6 to 10 years of age. for this category of users we had use a hybrid user-centred design methodology, blending different kinds of user-centred designs with interaction (social interaction, affective interaction) and participatory design and taking into 824 i. moisil, s. dzitac, l. popper, a. pitic account learning objectives and learners age and preferences. the results of the first evaluations were positive. future research will continue to develop the system with other users’ profiles (other age category, in the first stage) and to evaluate the educational effectiveness of the system’s adaptation. bibliography [1] http://ec.europa.eu/energy/efficiency/index_en.htm. [2] http://www.energyquest.ca.gov/saving_energy/index.html. [3] http://tonto.eia.doe.gov/kids/energy.cfm?page=3. [4] http://www.kyotoinhome.info/. [5] www.eais.info. [6] thttp://www.oppapers.com/subjects/energy-saving-page1.html. [7] ayersman, d.j. & minden, a.v. (1995). individual differences, computers, and instruction. computers in human behavior, 11(3-4), 371-390. [8] brusilovsky, p.(1996). methods and techniques of adaptive hypermedia. user modeling and user-adapted interaction. 6, 87-129. [9] jonassen, d. & wang, s. (1993). acquiring structural knowledge from semantically structured hypertext. journal of computer-based instruction, 20(1), 1-8. [10] liu, y., ginther, d. (1999). cognitive styles and distance education. on-line journal of distance learning administration, 2,3. [11] c.y. huang, t.t. yang, w.l chen, s. y. nof, reference architecture for collaborative design, int. j. of computers, communications & control, issn 18419836, e-issn 18419844 vol. v (2010), no. 1, pp. 71-90. [12] antonios andreatos, virtual communities and their importance for informal learning international journal of computers, communications & control, vol. ii (2007), no. 1, pp. 39-47. international journal of computers, communications & control vol. i (2006), no. 4, pp. 53-60 an iterative method for the design process of mode handling model nadia hamani, nathalie dangoumau, etienne craye abstract: this paper focuses on formal verification and validation of a model dedicated to mode handling of flexible manufacturing systems. the model is specified using the synchronous formalism safe state machines. a structured framework for the design process is presented. the obtained model is characterized by a strong hierarchy and concurrency that is why within the design process an iterative approach for specification, verification and validation is propose in order to improve this process. the main properties being verified are presented and the approach is illustrated through an example of a manufacturing production cell. keywords: flexible manufacturing systems, supervision, mode handling, functional and behavioral modeling, verification, validation. 1 introduction according to our design approach of fault tolerant control systems, mode handling is a function of supervision. in view of a disturbance (failures, breakdowns) mode handling allows implementing the decisions about mode and configuration changing. the design of mode handling function needs to provide a model representing the operating modes of the production system and its subsystems. to this aim, it is important to use an adequate modeling method and powerful specification formalisms. those characterizing the most significant approaches are compared in [9]. we proposed in our early work a modeling approach for reactive mode handling of flexible manufacturing systems (fms) [10]. this approach is based on a functional modeling [11] and a synchronous reactive approach using safe state machines (ssm) [1] [2]. due to increasing complexity and flexibility of fms, some problems characterize mode changing if coherence and safety constraints are not taken into account in the specification / modeling stages. so it is necessary to verify and validate the proposed models at the early stages of the design process. a greater interest has been allocated for few years to formal verification methods, which guarantee that for all possible evolutions of a model, several properties are satisfied. for our modeling approach, one the interests of using ssm formalism relies on its strong semantics leading to the possibility of using analysis tools in order to formally prove that the behavior of the system respects some properties. the purpose of this paper is to present the iterative approach used in the design process of the model dedicated to mode handling. the paper is organized as follows. in section 2 the basic concepts of v&v are reminded and the design process based on formal methods is introduced. the stages of this design process including v&v are detailed in section 3 and section 4. the iterative approach is introduced and the main properties being verified within the design process are presented. finally the approach is illustrated using an example of a flexible manufacturing cell. 2 the design process a design process should involve many intermediary stages of v&v. this enables to detect and eliminate as soon as possible the specification or modeling errors. the modifications are thus carried out with a lower cost in particular in the final step. indeed, v&v characterize any correct development process. according to [14] verification is the proof that the internal semantics of a model is correct independently of the modeled system whereas validation determines if the model corresponds to the copyright c© 2006 by ccc publications 54 nadia hamani, nathalie dangoumau, etienne craye attempts of the designer formulated in the functional requirements. the required properties are either generic, they depend on the formalism we use, or specific to the modeled system. v&v consists of analyzing the properties that should be satisfied by the final model using some analysis tools. two complementary methods exist, simulation and formal verification. we are interested in our study only in formal methods for v&v. for formal verification, we have to build a formal model representing the behavior of the system and formal specifications of the properties. fig. 1 shows the design process including v&v: the formal specification as well as the structured modeling process that we remind in the next section, allow ensuring a correct development process. v&v of the obtained model is an iterative approach characterized by some corrections, which require reconsidering the specification/modeling stages as needed. based on the verified and validated model the implementation is ensured through automatic code generation. figure 1: the design process 3 specification/modeling the functional requirements provide the informal specifications about mode handling function: the intended behavior of the fms and the properties that should be satisfied. other properties such as determinism are also mandatory. the designer formalizes those informal requirements to provide formal specifications of the model, of the required properties and, if necessary, some assumptions about the environment. the formalization task provides the following models. 3.1 the formal model representing the behavior of the system the specifications of the model dedicated to fms mode handling are detailed in our earlier work [10][11]. we remind in the following the main characteristics of the modeling process. the fms subsystems. in our approach, an operating sequence characterizes each part. the specification method of fms subsystems was described in [11]. the idea is to decompose and identify the functional subsystems, which take part in the realization of its missions. a mission corresponds to a set of operating sequences that the system should produce simultaneously. the fms functional model is obtained by a hierarchical decomposition leading to the elementary machining and transfer operations that the fms performs. the structural aspect completes this specification by associating the resources to the elementary operations they perform. the obtained subsystems are organized in six layers as shown in fig. 2. an iterative method for the design process of mode handling model 55 the functional representation is a graph composed of functional subsystems (called entities) linked by logical relationships (and, or). or nodes correspond either to an inclusive or an exclusive logical operator according to the constraints given in the functional requirements. figure 2: the fms functional model behavior specification. given the fms functional model, one should provide behavioral specifications. it is necessary to represent the activation and deactivation mechanisms at all the hierarchy levels of the specification as well as the availability of the resources and the functions of the fms. the activation/deactivation of an entity is represented by the working mode. the functioning mode represents the availability/unavailability of an entity. the aim is to handle concurrently the information flows downwards to transmit high-level control and reconfiguration orders and upwards to follow up the reports and failures detected by the monitoring function. the reactivity needed for this bi-directional exchange of information, in addition with the characteristics of concurrency and preemption, require a synchronous approach. the behavior of the identified fms subsystems and the logical relationships between them are specified using the synchronous formalism ssm [1][2]. ssm inherits many features from statecharts [12] but offers several forms of preemption and benefits of more strict semantics fully compatible with that of esterel synchronous language [4]. ssm supports also, with a very rigorous semantics, hierarchy, communication, concurrency, and various forms of preemption, which characterize our modeling approach. in addition, ssm takes advantage of an industrial development environment [7], which provides necessary tools for a design process. the behavior of each entity of the model, whatever its level, is represented by a ssm model. this model allows knowing, at any time, if an entity belonging to the current mode of the fms is in normal state e_n, degraded state e_d or out of order state e_out according to the functioning mode point of view; and if it is activated x_active or inactive (in stopping state) e_off according to the working mode point of view. the change-of-state of the entities of the model belonging to successive levels is carried out according to precise rules which depend on the kind of the logical relation and the number of child entities. several cases are studied in [10]. for the representation of the model, we proposed some reference models specified using ssm [10]. these are generic models depending on the number of the child entities and the logical relationships that connect them. the reference models are instantiated for each entity of the model. the proposed method is iterative, it is necessary to specify the models of level i , then we encapsulate the ssm of level i in the ssm of level i+1 until completing the specification of the model (see fig. 3). the encapsulation of the models enables representing the hierarchical levels which characterize the model. the aim is to improve the legibility in the specification/modeling stage. the hierarchy is represented using the encapsulation of successive levels whereas the entities belonging to the same level are separated by dashed lines; these are used to represent concurrency in ssm syntax (see [2]). 56 nadia hamani, nathalie dangoumau, etienne craye figure 3: the modular specification 3.2 the formal specification of properties predictability and dependability are the main characteristics of reactive systems. in our work, we consider only properties of logical behavior such as determinism and safety properties. as we adopt a reactive synchronous approach based on ssm formalism, generic properties of reactivity and determinism should be verified. the rigorous semantics of ssm enable to provide formal verification mechanisms ensuring such properties. several properties specific to mode handling function can be verified. the main properties are presented in section 4. 4 verification and validation in the verification stage (fig. 1), one checks if the model satisfies generic properties. ssm specifications are automatically translated into esterel program which is compiled in a system of boolean equations (logical circuits) an implicit form of finite state machines (fsms). the properties of logical correctness of esterel programs can then be verified on this system. the analysis technique used to this aim is the constructive causality analysis carried out by esterel compiler v5_x. causality analysis is a semantic analysis which allows accepting or rejecting esterel programs according to whether they are constructive or not [5]. the processor which implements the algorithms of constructive analysis builds the reachable state space using symbolic techniques based on binary decision diagrams (bdd). one checks in the validation stage if the model satisfies the specific properties formalized in the specification stage. the analysis techniques used are model-checking and/or theorem proving. several model-checking tools were developed for esterel language. for instance built-in verifier tool an evolution of xeve (xesterel verifier) [6] is currently integrated into esterel studio. xeve takes as input the system of boolean equations and built the reachable state space using bdd (symbolic model-checking [13]) in order to check the statutes of output signals of the model or the observers [8] representing the properties. failing of v&v stage implies that there is a specification or a modeling error. it is then necessary to go back over this model as needed. the formal specifications are may be incorrect. the errors are either in the specifications of: the model, the property, the correspondence between the model and the property or the assumptions related to the environment. if the specification/modeling stages are considered to be correct then we should reconsider informal requirements. indeed, within the formal specification of the model or the properties, errors are may be caused by the ambiguity or the incompleteness of the informal requirements provided in the functional an iterative method for the design process of mode handling model 57 requirements. in this case, these requirements must be improved and formalized again. we propose to complete the modular and hierarchical specification shown in fig. 3 by introducing an iterative method of v&v. this method is presented in the following. the aim is to reduce the specification/modeling errors and to enable their early correction by introducing intermediate stages of v&v into the multi level specification stage. 4.1 an iterative method for v&v we propose at first to check each single reference model used for the specification of the model. the aim is to prove reactivity and determinism. the verification of these reference models is performed using the analysis tools integrated into esterel studio environment. the single verification of the reference models: for each reference model write the model verify the model if the verification fails, correct the specification and repeat the verification as needed. the single reference models checked in this way are instantiated in order to obtain the behavioral models of the fms subsystems. then the integration of these models allows editing progressively the whole model. using the ssm graphical editor, the best way is to follow a bottom up approach based on the encapsulation of ssm of level i in the ssm of level i+1. at each level we should be sure that the intermediary ssm satisfies some properties and the accumulation of errors will be avoided. that is why we propose to associate with this specification an iterative method of v&v. if the v&v does not converge then it is necessary to correct the specification and to repeat the v&v process as needed. indeed, v&v is carried out in an iterative way until the whole model (the final ssm) will be verified and validated. the method described in the following is represented in fig. 4. figure 4: the iterative v&v repeat for all the models of level i (i = 0, n) (1) write the models of level i the specification is carried out by instantiation of the predefined and verified models (2) verify then validate the ssm of level i if the v&v fails, correct the specification and repeat the previous step as needed if i = n go to (4) if not (3) encapsulate the models of level i ’ssm of level i+1 is then created’ i = i + 1 end of the procedure 58 nadia hamani, nathalie dangoumau, etienne craye (4) the ssm represents the fms model (the top level corresponds to the whole model) if the previous steps converge, we conclude that the final ssm is verified and validated. when the final ssm is verified and validated, the code could be generated. the properties being verified within this process are presented in the following. 4.2 the properties the model dedicated to mode handling is translated automatically into esterel language. esterel compiler is composed of several processors for type checking, syntactical and lexical analysis. semantic analysis also called causality analysis allows checking generic properties (deadlock freeness, reactivity, determinism). deadlock freeness. causality analysis makes it possible to prove that the program is causal according to the constructive causality defined by berry [5]. constructive causality of esterel programs ensures deadlock freeness, which characterizes synchronous languages. deadlocks are due to the synchronism assumption (see causality problems in [3]). reactivity and determinism. if an esterel program is constructive then it is reactive and deterministic. a program is said to be reactive if it provides a well-defined solution for each input. it is deterministic if this solution is unique [5]. the aim of specific properties is to ensure coherence and safety constraints of mode handling. we have proposed three kinds of safety properties: mode reachability, the mission uniqueness of a fms and mutual exclusion of incompatible modes. states reachability. we can check the reachability of the states specified by the model (according to the specifications, an output signal is associated with each control state of the model). to this aim, the statutes of output signals that correspond to control states of the model are tested. the mission uniqueness of a fms. this property ensures operation safety of the fms according to the selected mission. indeed, selecting a mission implies performing some operations and activating the resources, which take part in this mission. thus, the operations and the resources, which do not take part in the current mission, should not be activated for safety and coherence reasons. the property is guaranteed by construction of the model. incompatibility of the modes. in order to ensure mode coherence, the proposed specifications guarantee mutual exclusion of incompatible states (each state is belonging to a distinct mode) of the entities of the model. it is about mutual exclusion of working states of the entities related with the logical relation xor (for example redundant resources). the previous properties are verified within the iterative approach of v&v. the intermediate verifications are important because if causality cycles are detected early their correction will be easier. contrary to verification, intermediate validations are may be expensive and some properties are checked only on the final model, in particular the properties related to the uniqueness of the fms mission. in addition to the main properties, other properties of mode handling model can be checked. they deal with the conditions of mode changing and the respect of modes sequence [10]. 5 illustration we have applied our approach to the example of the cell represented in fig. 5. this cell consists of two machines m1, m2 and input/output buffers (i/o). the transport system is composed of two robots r1, r2. we suppose that m1 is loaded by r1 and m2 is loaded by r2, this later will be used to load the an iterative method for the design process of mode handling model 59 two machines when r1 is failing. the machining functions performed by this system are turning and milling. turning (t) is carried out by m1, milling (f) by m1 or m2. the cell has three missions, m1, m2 and m3. the corresponding operating sequences are respectively os1 (t), os2 (f) and then os1 and os2 simultaneously. figure 5: illustration example the obtained model using our modeling method is specified using 4 reference models and 29 instances for 809 lines of esterel code generated from the ssm models. this model needs 34 input signals and 145 output signals. we simulated at first some scenarios of behavior in order to correct the specification errors so that the intended behavior will be in conformity with the functional requirements. the interactive simulator xes [7] is used. the studied cell can be extended with addition of machining resources (simple or polyvalent) and redundancies of the transport system (robots, conveyor). this allows studying the problem with increasing complexity. for the specification, adding or removing reference or instantiated models enable integrating easily the changes on the model thanks to the modularity and the hierarchy of the specification approach. however, due to increasing complexity of the cell some tests performed by the model-checker could not be concluded (out of memory problem). we try to solve this problem in a future work. 6 summary and conclusions this paper deals with v&v within the design process of a model dedicated to mode handling of fms. the first contribution of this study is a structured process for carrying out v&v. it is an extension of our control system design approach in order to take into account v&v stage. an iterative method is used for specification and v&v stages of the design process. the aim is to refine the model construction by introducing intermediate steps of v&v for an early correction of errors. within the proposed framework the required properties (determinism, reachability, mutual exclusion of incompatible modes,..) are given. the analysis tools integrated into esterel studio are used in this process. the proposed approach can be extended to deal with environment constraints in the specification stage. in addition, further work aims at studying some examples of increasing complexity. references [1] c. andré, representation and analysis of reactive behaviors: a synchronous approach, in proc. of the ieee-smc computational engineering in systems applications (cesa 96), lille, france, july, 1996, pp. 19-29. [2] c. andré, semantics of ssm (safe state machines), guyancourt, france, april 2003, available at: http://www.esterel-technologies.com [3] a. benveniste, p. caspi, s. edwards, n. halbwachs, p. le guernic, and r. de simone, the synchronous languages twelve years later, in proc. of the ieee, 91(1), special issue on embedded systems, pp. 64-83, 2003. 60 nadia hamani, nathalie dangoumau, etienne craye [4] g. berry, g. gonthier, the esterel synchronous programming language: design, semantics, implementation, science of computer programming, vol. 19(2), pp. 87-152, 1992. [5] g. berry, the constructive semantics of pure esterel draft version 3, 1999, available at: http://www.esterel-technologies.com [6] a. bouali, xeve an esterel verification environment, in proc. 14th int. conf. on computer aided verification cav’98, lncs, ubc, vancouver, canada, june 1998. [7] esterel studiotm, available at: http://www.esterel-technologies.com. [8] n. halbawachs, f. lagnier and p. raymond, synchronous observers and the verification of reactive systems, in proc. of amast’93, 1993. [9] hamani n, dangoumau n, craye e, a comparative study of mode handling approaches, in proc. of the 35th international conference on computers & industrial engineering (cie 05), istanbul turkey, june 19-22, 2005. [10] n. hamani, a contribution to modeling and verification for mode handling of manufacturing systems, phd dissertation, ecole centrale de lille, france, 2005 [in french]. [11] n. hamani, n. dangoumau, e. craye, functional modeling for mode handling of flexible manufacturing systems, in proc. 12th ifac symposium on information control problems in manufacturing, saint etienne, france, 2006. [12] d. harel, statecharts: a visual approach for complex systems, science of computer programming, vol. 8, nř3, pp. 231-275, 1987. [13] k.l. mcmillan, symbolic model checking, kluwer academic publishers, 1993. [14] j.m. roussel, and j.j. lesage, validation and verification of grafcet using finite state machines, in proc. of imacs-ieee multiconference on computational engineering in systems applications, lille, france, 1996, pp. 758-764. nadia hamani, nathalie dangoumau, etienne craye laboratoire d’automatique, génie informatique et signal (lagis), cnrs umr 8146 ecole centrale de lille, bp. 48, villeneuve d’ascq, f-59650 france e-mail: nadia.hamani@ec-lille.fr received: november 6, 2006 int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 1 (march), pp. 158-165 heuristic algorithms for solving the generalized vehicle routing problem p.c. pop, c. pop sitar, i. zelina, v. lupşe, c. chira petrică claudiu pop, ioana zelina, vasile lupşe north university of baia mare, department of mathematics and informatics romania, v. babeş , 430083, baia mare e-mail: petrica.pop@ubm.ro, ioanazelina@yahoo.com, vasilelupse@yahoo.co.uk corina pop sitar north university of baia mare, department of economics romania, v. babeş , 430083, baia mare e-mail: sitarcorina@yahoo.com camelia chira “babes-bolyai” university of cluj-napoca romania, m. kogalniceanu, 400084, cluj-napoca e-mail: cchira@cs.ubbcluj.ro abstract: the vehicle routing problem (vrp) is one of the most famous combinatorial optimization problems and has been intensively studied due to the many practical applications in the field of distribution, collection, logistics, etc. we study a generalization of the vrp called the generalized vehicle routing problem (gvrp) where given a partition of the nodes of the graph into node sets we want to find the optimal routes from the given depot to the number of predefined clusters which include exactly one node from each cluster. the purpose of this paper is to present heuristic algorithms to solve this problem approximately. we present constructive algorithms and local search algorithms for solving the generalized vehicle routing problem. keywords: network design, combinatorial optimization, generalized vehicle routing problem, heuristic algorithms. 1 introduction combinatorial optimization is a lively field of applied mathematics, combining techniques from combinatorics, linear programming, and the theory of algorithms, to solve optimization problems over discrete structures. the study of combinatorial optimization owes its existence to the advent of modern digital computer. most currently accepted methods of solution to combinatorial optimization problems would hardly have been taking seriously 30 years ago, for the simple reason that no one could have carried out the computations involved. moreover, the existence of digital computers has also created a multitude of technical problems of a combinatorial character. combinatorial optimization problems can be generalized in a natural way by considering a related problem relative to a given partition of the nodes of the graph into node sets, while the feasibility constraints are expressed in terms of the clusters. in this way, it is introduced the class of generalized combinatorial optimization problems. in the literature one finds generalized problems such as the generalized minimum spanning tree problem [15], the generalized traveling salesman problem, the generalized vehicle routing problem, the generalized (subset) assignment problem, etc. these generalized problems belong to the class of np-complete problems, copyright c⃝ 2006-2011 by ccc publications heuristic algorithms for solving the generalized vehicle routing problem 159 are harder than the classical ones and nowadays are intensively studied due to the interesting properties and applications in the real world, even though many practitioners are reluctant to use them for practical modeling problems because of the complexity of finding optimal or nearoptimal solutions. the generalized vehicle routing problem (gvrp) is an extension of the vehicle routing problem (vrp) and was introduced by ghiani and improta [4]. the gvrp is the problem of designing optimal delivery or collection routes, subject to capacity restrictions, from a given depot to a number of predefined, mutually exclusive and exhaustive node-sets (clusters). the gvrp can be viewed as a particular type of location-routing problem (see, e.g. laporte [7], nagy and salhi [10]) for which several algorithms, mostly heuristics, exist. ghiani and improta [4] showed that the problem can be transformed into a capacitated arc routing problem (carp) and baldacci et al. [1] proved that the reverse transformation is valid. recently, pop [14] provided a new efficient transformation of the gvrp into the classical vehicle routing problem (vrp). in 2003, kara and bektas [5] proposed an integer programming formulation for gvrp with a polynomially increasing number of binary variables and constraints and in 2008 kara and pop [6] presented two integer linear programming formulations for gvrp with o(n2) binary variables and o(n2) constraints, where n is the number of customers which are partitioned into a given number of clusters. as far as we know, the only specific algorithm for solving the gvrp was developed by pop et al. [13] and was based on ant colony optimization. the complexity of obtaining optimum or even near-optimal solutions for the generalized combinatorial optimization problems may lead to the development of: • efficient transformations of the generalized combinatorial optimization problems into classical combinatorial optimization problems [4, 13]; • heuristic and metaheuristic algorithms [11]. the aim of this paper is to describe three classes of heuristic algorithms for solving approximately the generalized vehicle routing problem. 2 definition of the gvrp let g = (v, a) be a directed graph with v = {0, 1, 2, ...., n} as the set of vertices and the set of arcs a = {(i, j) | i, j ∈ v, i ̸= j}. a nonnegative cost cij associated with each arc (i, j) ∈ a. the set of vertices (nodes) is partitioned into k + 1 mutually exclusive nonempty subsets, called clusters, v0, v1, ..., vk (i.e. v = v0 ∪ v1 ∪ ... ∪ vk and vl ∩ vp = ∅ for all l, p ∈ {0, 1, ..., k} and l ̸= p). the cluster v0 has only one vertex 0, which represents the depot, and remaining n nodes belonging to the remaining k clusters represent geographically dispersed customers. each customer has a certain amount of demand and the total demand of each cluster can be satisfied via any of its nodes. there exist m identical vehicles, each with a capacity q. the generalized vehicle routing problem (gvrp) consists in finding the minimum total cost tours of starting and ending at the depot, such that each cluster should be visited by exactly once, the entering and leaving nodes of each cluster is the same and the sum of all the demands of any tour (route) does not exceed the capacity of the vehicle q. an illustrative scheme of the gvrp and a feasible tour is shown in the next figure. 160 p.c. pop, c. pop sitar, i. zelina, v. lupşe, c. chira 2 3 1 0 7 6 4 5 10 9 8 v 1 v 0 v 2 v 3 v 4 d1=3 d2=5 d3=4 d0=0 d10=3 d9=4 d8=2 d7=2 d4=5 d5=4 d6=5 q1=12 q2=9 q0=0 q4=11 q3=5 11 v 5 d11=7 q5=7 m=2 and q=25 figure 1 an example of a feasible solution of the gvrp in figure 1, it is presented a feasible solution consisting of a collection of two tours (routes): 0-3-5-0 and 0-11-7-6-0 satisfying the capacity restrictions and the condition that from each cluster is visited exactly one node. the cost of this feasible solution is obtained summing the costs of the arcs belonging to the selected tours. the gvrp reduces to the classical vehicle routing problem (vrp) when all the clusters are singletons and to the generalized traveling salesman problem (gtsp) when m = 1 and q = ∞. the gvrp is np -hard because it includes the generalized traveling salesman problem as a special case when m = 1 and q = ∞. several real-world situations can be modeled as a gvrp. the post-box collection problem described in laporte et al. [8] becomes an asymmetric gvrp if more than one vehicle is required. furthermore, the gvrp is able to model the distribution of goods by sea to a number of customers situated in an archipelago as in philippines, new zeeland, indonesia, italy, greece and croatia. in this application, a number of potential harbours is selected for every island and a fleet of ships is required to visit exactly one harbour for every island. several applications of the gtsp may be extended naturally to gvrp. 3 heuristic algorithms for solving the gvrp two fundamental goals in computer science are finding algorithms with provably good run times and with provably good or optimal solution quality. a heuristic is an algorithm that abandons one or both of these goals; for example, it usually finds pretty good solutions, but there is no proof the solutions could not get arbitrarily bad; or it usually runs reasonably quickly, but there is no argument that this will always be the case. heuristics are typically used when there is no known method to find an optimal solution, under the given constraints (of time, space etc.) or at all. several families of heuristic algorithms have been proposed for the classical vrp, see for example laporte et al. [9]. these can be classified into two main classes: classical heuristics and metaheuristics. most standard construction and improvement procedures in use belong to the first class. these methods performs a relatively limited exploration of the solution space and generally produce good quality solutions in reasonable computational times. in what it follows we will provide three classes of heuristic algorithms for solving the gvrp: heuristic algorithms for solving the generalized vehicle routing problem 161 • constructive heuristics: nearest neighbour and a clarke-wright based heuristic; • improvement heuristics: string cross (sc), string exchange (se), string relocation (sr) and string mix (sm); • a local-global heuristic. 3.1 constructive heuristics nearest neighbour perhaps the most natural heuristic for the gvrp is the famous nearest neighbour algorithm (nn). in this algorithm the rule is always to go next to the nearest as-yet-unvisited customer subject to the following restrictions: we start from the depot, from each cluster is visited exactly one vertex (customer) and the sum of all the demands of the current tour (route) does not exceed the capacity of the vehicle q. if the sum of all the demands of a current tour (route) exceeds the capacity of the vehicle then we start again from the depot and visit next the nearest customer from an unvisited yet cluster. if all the clusters are visited, then the algorithm terminates. a collection of routes traversing exactly one city from each cluster in the constructed order represents the output of the algorithm. the nearest neighbour algorithm is easy to implement and executes quickly, but it can sometimes miss shorter routes, due to its greedy nature. the running time of the described nearest neighbour algorithm is o(n2). a clarke-wright based heuristic algorithm the clarke and wright [2] savings algorithm is perhaps the most well known heuristic for the vrp. it applies for the problems for which the number of vehicles is a decision variable, and works in the case of directed and undirected problems. the algorithm in the case of the gvrp works as follows: step 1 (savings computation). for each i ∈ vl and j ∈ vp, where l ̸= p and l, p ∈ {1, ..., k} compute the savings: sij = ci0 + c0j − cij. it is obviously that sij ≥ 0 and sij = sji. we order the savings in a nonincresing fashion. at the beginning we create k routes denoted (0, il, 0), l ∈ 1, ..., k as follows for each cluster vl we define c0il = min{c0j | j ∈ vl}. there will be as many routes as the number of clusters and total distance of the routes is: d = c0i1 + c0i2 + ... + c0ik. step 2 (route extension). consider in turn each route (0, i, ..., j, 0). determine the first saving sui or sjv that can feasibly be used to merge the current route with another route ending with (u, 0) or starting with (0, v), for any u ∈ vl and v ∈ vp, where l ̸= p and l, p ∈ {1, ..., k} and vl and vp are clusters not visited by the route (0, i, ..., j, 0). because at a given moment there can exist more feasible route extensions, the priority will have that one that produces the biggest reduction of the total distance of the route. we implement the merge and repeat this operation to the current route. if no feasible merge exists, consider the next route and reapply the same operations. stop when no route merge is feasible. the clarke-wright based algorithm for solving the gvrp is easy to implement and its running time is o(n2 log n). 162 p.c. pop, c. pop sitar, i. zelina, v. lupşe, c. chira 3.2 improvement heuristics the improvement heuristics algorithms for the gvrp are based on simple routes modifications and may operate on each vehicle route taken separately, or on a several routes at a time. in the first case, any improvement heuristic for traveling salesman problem (tsp) can be applied, such as 2-opt, 3-opt, etc. in the second case, procedures that exploit the multi-route structure of the gvrp can be developed. we can see these improvements as a neighbourhood search process, where each route has an associated neighborhood of adjacent routes. the heuristics algorithms for the gvrp that we are going to describe are based on the classification of the van breedam [16] of the improvement operations as string cross, string exchange, string relocation and string mix. a) string cross (sc): two strings of vertices are exchanged by crossing two edges of two different routes. v0 v0 vp vq vi vj vj vi vp vq figure 2 an example of a possible string cross. in the left side are presented the routes before the exchange of the vertices strings and in the right side the routes after the exchange in the above picture there were presented just the clusters of the string of vertices that are exchanged in order to have a clearer figure. it is important to mention that we investigate all the possible connections of the exchanged vertices within the clusters in order to get improved routes, as is shown in figure 2: the nodes belonging to the marked clusters after the exchange may be diferrent. b) string exchange (se): two strings of at most r vertices are exchanged between two routes. v0 v0 vp vq vi vj vq vp vi vj figure 3 an example of a possible string exchange c) string relocation (sr): a string of at most k vertices is moved from one route to another (k = 1 or k = 2). v 0 v 0 vj vq vp vi vj vi vq vp figure 4 an example of a possible string relocation heuristic algorithms for solving the generalized vehicle routing problem 163 d) string mix (sm): consists in selecting the best move between the string exchange and string relocation. 3.3 a local-global heuristic for gvrp the last heuristic algorithm for solving the gvrp that we are going to describe is based on local-global approach and it aims at distinguishing between global connections (connections between clusters) and local connections (connections between nodes from different clusters). as we will see, having a global collection of routes connecting the clusters it is rather easy to find the corresponding best (w.r.t. cost minimization) solution of the gvrp. there are several generalized collection of routes, i.e. routes containing exactly one node from a cluster, corresponding to a global collection of routes. between these generalized collection of routes there exist one called the best generalized collection of routes (w.r.t. cost minimization) that can be determined either by dynamic programming or by solving an linear integer program. the local-global approach was applied succesfully to other generalized combinatorial optimization problems such as: generalized minimum spanning tree problem (gmstp) and generalized traveling salesman problem (gtsp) in order to provide exact exponential time algorithms, strong mixed-integer programming formulations, solution procedures based on these mixedinteger programming formulations and a heuristic algorithm for solving the gmstp, see [?, 12]. let g′ be the graph obtained from g after replacing all nodes of a cluster vi with a supernode representing vi. we will call the graph g′ the global graph. for convenience, we identify vi with the supernode representing it. edges of the graph g′ are defined between each pair of the graph vertices v1, . . . , vk. in the next figure we present the collection of generalized routes corresponding to the a global collection of routes. v 0 v 2 v 1 v 5 v 4 v 3 v 0 v 2 v 1 v 5 v 4 v 3 figure 5 example showing a generalized collection of routes corresponding to a global collection of routes given a global route and a sequence (v0, vk1, ..., vkp) in which the clusters are visited, we want to find the best feasible route r∗ (w.r.t cost minimization), visiting the clusters according to the given sequence. this can be done in polynomial time, by solving the following shortest path problem as we will describe below. we construct a layered network, denoted by ln, having p + 2 layers corresponding to the clusters v0, vk1, ..., vkp and in addition we duplicate the cluster v0, containing the vertex denoted 0 and representing the depot. the layered network contains in addition the extra node denoted by 0′ for each. there is an arc (i, j) for each i ∈ vkl and j ∈ vkl+1 (l = 1, ..., p − 1), having the 164 p.c. pop, c. pop sitar, i. zelina, v. lupşe, c. chira cost cij and an arc (i, h), i, h ∈ vkl , (l = 2, ..., p) having cost cih. moreover, there is an arc (i, 0 ′) for each i ∈ vkp having cost ci0′ . v 0 v v vk vk k v 0 1 2 p ... figure 6 example showing a route in the constructed layered network ln we consider paths from 0 to 0′, that visits exactly one node from each cluster vk1, vk2, ..., vkp , hence it gives a feasible route. conversely, every route visiting the clusters according to the sequence (v0, vk1, ..., vkp) corresponds to a path in the layered network from 0 to 0′. therefore, it follows that the best (w.r.t cost minimization) route r∗ visiting the clusters in a given sequence can be found by determining all the shortest paths from 0 to the corresponding 0′ with the property that visits exactly one node from each of the clusters (vk1, vk2, ..., vkp). the overall time complexity of the above procedure is o(m + log n), where by m we denoted the number of edges and n number of nodes. therefore, given a global collection of routes connecting the clusters we can find efficiently the best corresponding collection of generalized routes. in order to provide global collections of routes we may use any improvement heuristics for the classical vrp. 4 conclusion and future work the generalized vehicle routing problem is an extension of the vehicle routing problem (vrp) and consists in designing optimal delivery or collection routes, subject to capacity restrictions, from a given depot to a number of predefined, mutually exclusive and exhaustive node-sets (clusters). the gvrp is an np -hard problem and finds many interesting real-world applications. the aim of this paper was to present three classes of heuristic algorithms: constructive heuristics including nearest neighbour and a clarke-wright based heuristic; improvement heuristics including string cross (sc), string exchange (se), string relocation (sr) and string mix (sm) and a local-global heuristic for the gvrp. computational results are planned in order to assess the effectiveness of the proposed heuristic algorithms. acknowledgement. this work was partially supported by cncsis-uefiscsu, project number pnii idei 508/2007. bibliography [1] r. baldacci, e. bartolini and g. laporte, some applications of the generalized vehicle routing problem, le cahiers du gerad, g-2008-82, 2008. [2] g. clarke and j.w. wright, scheduling of vehicles from a central depot to a number of delivery points, operations research, vol. 12, pp. 568-581, 1964. heuristic algorithms for solving the generalized vehicle routing problem 165 [3] m. fischetti, j.j. salazar and p. toth, a branch-and-cut algorithm for the symmetric generalized traveling salesman problem, operations research, 45:378-394, 1997. [4] g. ghiani, g. improta, an efficient transformation of the generalized vehicle routing problem, european journal of operational research, vol. 122, pp. 11-17, 2000. [5] i. kara and t. bektas, integer linear programming formulation of the generalized vehicle routing problem, in proc. of the 5-th euro/informs joint international meeting, 2003. [6] i. kara and p.c. pop, new mathematical models of the generalized vehicle routing problem and extensions, in proc. of the international conference on applied mathematical programming and modelling, bratislava, slovakia, may 27-30, 2008. [7] g. laporte, location-routing problems, in b.l. golden and a.a. assad (eds.), vehicle routing: methods and studies, north-holland, amsterdam, pp. 163-197, 1988. [8] g. laporte, s. chapleau, p.e. landry and h. mercure, an algorithm for the design of mail box collection routes in urban areas, transportation research b, vol. 23, pp. 271-280, 1989. [9] g. laporte, m. gendreau, j-y. potvin and f. semet, classical and modern heuristics for the vehicle routing problem, international transactions in operational research, volume 7, issue 4-5, pp. 285 300, 2006. [10] g. nagy and s. salhi, location-routing issues: models and methods, european journal of operational research, vol. 177, pp. 649-672, 2007. [11] c.m. pintea, d. dumitrescu and p.c. pop, combining heuristics and modifying local information to guide ant-based search, carpathian journal of mathematics, vol. 24, no. 1, pp. 94-103, 2008. [12] p.c. pop, c. pop sitar, i. zelina and i. tascu, exact algorithms for generalized combinatorial optimization problems, lecture notes in computer science, vol. 4616, pp. 154-162, 2007. [13] p.c. pop, c.m. pintea, i. zelina and d. dumitrescu, solving the generalized vehicle routing problem with an acs-based algorithm, american institute of physics (aip), conference proceedings: bics 2008, vol.1117, no.1, 157–162, 2009. [14] p.c. pop, efficient transformations of the generalized combinatorial optimization problems into classical variants, proceedings of the 9-th balkan conference on operational research, constanta, romania, 2-6 september 2009. [15] p.c. pop, a survey of different integer programming formulations of the generalized minimum spanning tree problem, carpathian journal of mathematics, vol. 25, no. 1, pp. 104-118, 2009. [16] a. van breedam, an analysis of the behavior of the heuristics of the vehicle routing problem for a selection of problems, with vehicle-related, customer-related and time-related constraints, ph.d. dissertation, university of antwerp, belgium, 1994. international journal of computers, communications & control vol. i (2006), no. 3, pp. 61-69 bringing iptv to the market through differentiated service provisioning cathryn peoples, petre dini, sally mcclean, gerard parr, michaela black abstract: the world of telecommunications continues to provide radical technologies. offering the benefits of a superior television experience at reduced long-term costs, iptv is the newest offering. deployments, however, are slow to be rolled out; the hardware and software support necessary is not uniformly available. this paper examines the challenges in providing iptv services and the limitations in developments to overcome these challenges. subsequently, a proposal is made which attempts to help solve the challenge of fulfilling real-time multimedia transmissions through provisioning for differentiated services. initial implementations in opnet are documented, and the paper concludes with an outline of future work. keywords: transport layer, real-time communication, qos, multimedia protocols, reconfigurable stacks, opnet simulation. 1 introduction it is anticipated that iptv will be enthusiastically accepted [1] [2; cited by 4] by early adopters, those with a passion for technology. however, while the desire for rich multimedia-based information grows, it is necessary to remove any service impairments to maximise the possibilities of widespread attraction. iptv, with its high bandwidth and strict quality of service (qos) requirements, will currently not operate to a satisfactory quality in many networks throughout the world. investments and improvements are therefore required. the potential for iptv to become a killer application will depend on such developments. without an efficient and reliable service, demand for it cannot grow. as with previous internet applications with widespread demand, it is anticipated that iptv will have a similar appeal. the race is now on to make the necessary hardware and software upgrades to ensure an end-user experience of quality. 2 iptv a report by accenture [3] revealed that there is a lack of consumer awareness on the definition of iptv. for those who claim to understand, some exhibited widely varying interpretations. in the context of this paper, it is our (the authors’) understanding that iptv describes the transmission of traditional television over an ip network. in addition, iptv seeks to further build upon the user experience. the intention is to enable a customised and interactive service [4] [5]. it will be possible to search for ondemand viewing, obtain access to exclusive content, and succumb to product recommendation. to attract initial customers, it is not thought imperative to provide a service which is fully reliable or which exceeds the current viewing experience. in the early stages, iptv is likely to be used as a supplement to the traditional television service, adopted by those with a passion for new technologies. however, to ensure longevity, it is necessary to provide a service which exceeds the current experience. this demands a service which is uninterrupted, reliable, and ideally offers cost advantages. in addition, availability through minimal operational efforts is advantageous in terms of encouraging widespread appeal. copyright c© 2006 by ccc publications 62 cathryn peoples, petre dini, sally mcclean, gerard parr, michaela black 2.1 challenges of transmitting iptv there are numerous challenges to providing iptv. voice and video are applications with stringent qos requirements, demanding an end-to-end delay of less than 100 milliseconds and a bit error rate (ber) of less than 10−6. in addition, they require minimal variation on the amount of end-to-end delay; jitter has a negative effect on the ability to stream the service smoothly. indeed, the accumulation of a sufficient amount can render the communication useless [6]. ensuring high levels of qos is difficult, especially given the increasing popularity of the internet and resulting pressure on restricted resources. in addition, increased roll-out of wireless links creates connections with notoriously error-prone links. rejaie et al (1999) [7] have the opinion that, "the quality of delivered service to real-time operations is neither controllable nor predictable." while this fact may have been true in 1999, developments are ongoing to remove the occurrence of such limited network performance. the ability to achieve certainty in the reliability and quality of transmissions is required. customers purchase internet packages according to different levels of qos, which guarantee that they will receive a certain level of performance. therefore, the ability to ensure the quality of a real-time service is required. 3 multimedia protocols numerous multimedia protocols have been designed in recent years to meet the challenges involved in achieving real-time communication. in addition to considering those developed specifically for multimedia transmissions, protocols with rate-based control mechanisms are also applicable to multimedia transmissions. flow control is an important attribute where applications require a steady rate of throughput and where reliability is required. window-based flow control, as in the case of tcp [8], is inappropriate for multimedia applications. variable file sizes result in transportation through the network at different rates. setting a retransmission time-out which does not match the time-out appropriate to the media can increase jitter, and hence qos. in addition, rate-based transport protocols are considered to be more reliable than window-based approaches. the design of the window-based approach is highly constrained in terms of window-size and retransmission time-out, and any changes in the external environment will cause the protocol to fail. rate-based approaches, in contrast, adapt based on real-time, and not pre-defined, information. the broadband application transport system (bats) [9] was proposed in 1994 as a rate-based transport protocol. supporting four classes of service, provisions are made for a variety of application requirements. while ensuring throughput of class a applications (those which are time-sensitive, connection-oriented, and which require a constant bit rate) by blocking all subsequent communications, there is a limitation of this approach in terms of coping ability when there is more than one class a transmission. as all connections are blocked when a class a application is transmitting to ensure sufficient bandwidth, the implication is that a second class a transmission must wait until the first has completed. this approach, therefore, has significant limitations. rt-ring [10] was developed in 2002 to transmit multimedia in real-time. it offers three qualities of service and a fairness control mechanism which provisions for traffic with both real-time and non real-time requirements. however, a limitation also exists with this approach. the principle behind this protocol is that a non-satisfied station (one which has not achieved its real-time transmission) can maintain priority until it is satisfied. what happens, however, if there are multiple stations with the same priority? in large-scale deployments, this approach is likely to be unsuitable. the rate adaptation protocol (rap) [7] was created to provide qos for multimedia applications by adapting the quality of a flow of data. quality is adapted by using layered encoding, and simultaneously delivering a number of layers which can fit inside the available bandwidth. however, despite the benefits of bandwidth-provisioning, doubts exist regarding the direct provisions made for the transmission of bringing iptv to the market through differentiated service provisioning 63 real-time traffic. each packet is acknowledged and losses are detected by missing acks. lost data is not retransmitted, but the information is used to update the rate at which traffic flows. in addition, rap incorporates a feature of adapting the rate of flow depending on the feedback delay. adjusting the rate of flow does not seem like a suitable response to take for video applications, where such actions will result in varying amounts of network jitter. the rate control scheme (rcs) is designed for environments with high error rates and high bandwidthdelay products. it uses dummy packets to perform congestion control and an amid algorithm to adjust its transmission rate. a flaw exists in this protocol in relation to its approach to flow and congestion control. due to the protocol’s inability to perform retransmissions, if congestion occurs between a dummy packet being received at the source and the actual transmission being sent, the data will be lost without the ability to be recovered. in addition, it appears as though this protocol could cause the application to experience variable jitter. if a dummy packet is not received, the transmission will cease at least until the period when the next dummy segment is to be sent. providing sufficient bandwidth exists, the transmission will continue. however, a gap in the transmission will have been introduced, and bringing with it jitter. a qos-guaranteed transport system (qts) [11] provides differentiated qos based on the needs of the application. a bandwidth allocator determines the rate at which data is transported through the network, and the transport protocol module performs rate-based congestion control at a rate which has been determined by the bandwidth allocator. as with bats, the limitation is that there is no mechanism to cope with multiple applications having the same qos requirements. it therefore seems difficult to ensure that applications will receive the required qos. the main criticism of the current protocols which have been developed for multimedia applications is that they do not take network resource information into account when making protocol choices. while the qos requirements of the application are important, there are benefits in also incorporating network information. in this way, the available resources can be maximised for all applications and not only those with class a (in bats [9] language) requirements. 3.1 reliable transportation from the application or transport layer? it is appropriate at this stage to discuss the location of transportation reliability within the protocol stack. provisions for transmission reliability are increasingly being incorporated within the application layer. the real-time protocol (rtp) [12] is one such example, used in the application layer of the osi (open systems interconnection) [13] protocol stack for the transmission of applications with real-time requirements. cfdp [14], the ccsds [15] file delivery protocol developed by the delay tolerant network research group (dtnrg) [16], is another example, deployed within the ccsds protocol stack [17]. flow and error control functions are combined within the application layer in an attempt to compress the layered approach to solving the communication problem as introduced by the iso (international standards organisation) [18]. however, it is the opinion of the authors that it makes greater sense to retain the traditional layered approach to solving the communication problem, and have each layer solve an individually complex problem. this explains why the protocols under investigation are located within the transport, and not the application, layer. 4 reconfigurable protocol stacks the concept of reconfigurable protocol stacks has long been under investigation [19] [20] [21], and it is believed that there are significant performance improvements to be achieved in the real-time multimedia transmission challenge through development of this technique. there are obvious benefits to operating a reconfigurable protocol stack in 21st century networks due to increasing pressures on resources and the need to provide quality of service. a dynamic network architecture allows exactly 64 cathryn peoples, petre dini, sally mcclean, gerard parr, michaela black the right combination of protocols to be chosen for each application. however, reconfigurable protocol stacks remain to be deployed in networks today. an obvious concern which arises from the consideration of reconfigurable protocol stacks is the risk of increasing processing time when choosing the optimum stack configuration. there is also the risk that, in incorporating multiple choices into the stack, memory increases, and hence hardware costs, will be required. there are several approaches to incorporating reconfigurability into the protocol stack. one approach is to empower the message with the intelligence to decide the path through the stack and the functions necessary in relation to the requirements of the packet [22] [23]. another approach is to group the functions specific to an application at all layers [24] [25]. while there may be multiple functions within each layer, only one from each will be relevant to an application and the grouping will be determined in advance. the final approach investigated is to build a middleware layer into the stack, which makes provisions for allowing adaptability [26] [27]. the approaches presented are diverse and thorough, and yet many more exist in addition to those examined here [28] [29] [30]. however, each has certain features with specific application for various environments and applications. as with the multimedia protocols, the main criticism is that the majority of the proposed reconfigurable protocol stacks do not take environmental information into account when making protocol choices, but rely on the needs of the application. as transmissions depend heavily on the environmental constraints in the network, it is important that explicit network information is used in the decision-making process. 5 research proposal current transport layer protocols are incapable of achieving the stringent qos requirements of realtime multimedia applications. they neither possess the capabilities to characterise applications nor adapt performance to maximise the possibility that qos is achieved. such functionalities, however, are thought paramount to the achievement of application qos in tomorrow’s networks. the research proposal incorporates application and environmental information at the top of the stack to influence decisions made in the lower layers. in addition, the functionalities which are available in each layer will be increased. currently, within the transport layer, either tcp [8] or the user datagram protocol (udp) [31] will be available according to the nature of the application. each protocol has welldefined and contrasting approaches to communication. however, network conditions can be extremely variable and dynamic. therefore, my proposal incorporates a greater amount of variability into the transport layer, different protocols being more appropriate to use in different circumstances. the protocol stack will be evaluated on a hop-by-hop basis. variability of protocol functions will be introduced in terms of connection orientation of the communication, flow control and the rate of flow control, error control, and the retransmission strategy. the proposed protocol stack is shown in figure 1. environmental information is passed to the protocol stack via a connecting network diagnostic and management system. the diagnostic system will not be discussed in depth in this paper; this represents future work and may be documented at a later date. several attributes are passed between the protocol stack and the diagnostic system. this is in keeping with bashir et al’s (2005) [32] opinion that network performance is typically measured using metrics for throughput, connectivity, end-to-end and round-trip packet delay, and packet loss. several of these parameters have been selected to combine environmental and application information. the subsequent result will be used to select the transport protocol. the formula is shown in equation 1. result = √√√√ √ ber( √ u d2e2 ∗(db + dt ) ) ∗tos (1) bringing iptv to the market through differentiated service provisioning 65 context-aware protocol stack network agent & fault diagnosis system real-time contextual information sending and receiving timestamp source address destination address ip traffic dropped (packets/sec) ip background traffic delay (sec) jitter (sec) data drop rate (buffer o’flow) (bps) data drop rate (retry threshold exceeded) (bps) load (bps) packet delay (sec) packet delay variation (sec) tcp delay (sec) tcp segment delay (sec) tcp retransmission count application c1 c2 c3 network c1 c2 c3 data link c1 c2 c3 bundling context-aware middleware application classification bw mc rt p tcp rcs transport parsing & normalisation real-time analysis fault identification fault diagnosis action b1 b2 … bx physical c1 c2 c3 qts rtp rap udp multi-hop backbone data collectors figure 1: proposed protocol stack and network management function combination o f application and environmental in f ormation where: u link utilisation de2e end-to-end delay db data drop (buffer overflow) dt data drop (retry threshold exceeded) ber bit error rate tos type of service the type of service (tos) attributes refers to the qos required by an application. the scale being used is 0 best effort application, 3 excellent effort, and 6 interactive voice. the qos therefore increases in parallel with the tos value. as an example of how this protocol stack will be used, it is the intention that where the link is under-performing (i.e. low link utilisation, high end-to-end delay, high number of data drops, and high bers), the transport protocol deployed is one with a reduced amount of overhead. the rationale behind this approach is that, where the network is operating in a manner which is sub-optimal, a protocol with less control overhead should be used to maximise the opportunity that some of the data transmitted will be received (taking into account that iptv is an application which can cope with some data loss). if a protocol with more stringent error-control requirements was used in such an environment, there is a greater opportunity that the data will not be received due to an inability to cope with data loss. it should also be noted that, while it is shown in the stack that the complete protocols are deployed at each hop when the conditions are evaluated, it is the intention that the separate functionalities of each protocol can be called. for example, it may be decided that the four classes of service from bats should 66 cathryn peoples, petre dini, sally mcclean, gerard parr, michaela black be deployed alongside the qts error-control scheme (error-control only for loss-sensitive applications). it is the intention to call the current multimedia protocol functions to maximise their potential and the success of the transmission, given the application requirements and the environmental constraints. the calling of individual error, flow, and retransmission strategies will depend on individual formulaic conditions designed. 6 implementation implementation of the research proposal has partially occurred. the functionality being documented in this paper is the integration of adaptability in terms of the connection-orientation of the communication. within opnet [33], this has involved integrating environmental information, which has not previously been provisioned for. the environmental information and the evaluation criteria discussed in section 5 have been inserted into the application layer. defined within the function block, the changes are subsequently called from the process model states when spawning the application profiles. dependencies between the layers have also been provisioned for. this has involved integrating the changes made in the application layer within the subsequent modules called. opnet uses a global data structure from which it determines the transport protocol being used for any application. it is referred to several times in the initiation of a communication for the purpose of populating data structures with the transport protocol. one of the functions of this implementation was therefore to remove all reliances on the global transport protocol service, and to ensure that all references were instead made to the intelligently calculated protocol. changes were required in the tpal layer and the process models generated by the spawning of the application process, the video_calling_manager and gna_profile_mgr models. after the tpal layer, the transport protocol module and the lower layers of the stack are called. as the changes are being made to the choice of transport protocol, all changes must therefore be made before this layer. figure 2 shows the dependencies between the process modules, and the span of changes required to achieve implementation at the client. the changes implemented in a top-down fashion between the application and tpal layers have ensured developments at the client. developments must now be made in a bottom-up approach between the same layers to allow incorporation of the same changes at the server. 7 conclusions and future work the research proposal has been thoroughly investigated and it is believed that the proposed scheme can have a realistic existence in future networks. reconfigurable protocol stacks have been widely investigated in the literature, but remain to be deployed. given the dynamism of network conditions, it seems sensible to incorporate flexibility into the protocol stack. conditions in the external environment are pivotal in enabling the transport protocol’s operation, and hence the fulfilment of the application’s requirements. therefore, its incorporation into the decision-making process is thought fundamental to providing future service differentiation. it is the challenge to design one which is technically and commercially feasible. this research has been conducted during a four-month internship with cisco systems in silicon valley. the protocol stack developments discussed in this paper are also simultaneously being developed for a phd research project. this internship has allowed research to occur specifically on the transmission of multimedia, although the applicability of all application types to this proposal will be considered for the final project. this period has seen the initiation of the task of delving into opnet’s code to modify how functions are performed at present. future work will involve incorporating the other techniques with an adaptable status. these include flow control, error control, and the retransmission strategy. bringing iptv to the market through differentiated service provisioning 67 destroy_profile spawn_profile svc_compl arrival client_closed rtrv_complete close open remote_ profile wait start init gna_clsvr_mgr ace gna_profile_mgr init idle spawn end close gna_video_calling_mgr open connect end idle receive send_ close send tpal_v3 pre_init init wait req reg error_ trap open gna_profile_spawn app_pk_to_tpal_send figure 2: data flows from application to tpal layers references [1] multimedia research group, "ip tv global forecast 2006 2009: semiannual ip tv global forecast", march 2006. [2] gartner group, "iptv prediction", gartner group research report, winter 2005. [3] zadeh, a; douglass, g, k; dogra, r; "communications and high tech: "infinite possibilities" television?", accenture, may 2006. available at: http://www.accenture.com/global/research_and_insights/outlook/by_issue/y2006/ infinitetelevision.htm [4] shin, d, h; "potential user factors driving adoption of iptv. what are customers expecting from iptv?", technological forecasting and social change, 2006. [5] harris, a; "white paper of enabling iptv: what carriers need to know to succeed", may 2005, idc. available at: http://www.emc.com/analyst/pdf/idc_iptv_whitepaper_jun_9_05.pdf 68 cathryn peoples, petre dini, sally mcclean, gerard parr, michaela black [6] chan, s-p; kok, c-w; wong, a, k; "multimedia streaming gateway with jitter detection", ieee transactions on multimedia, volume 7, number 3, june 2005, pages 585 592. [7] rejaie, r; handley, m; estrin, d; "rap: an end-to-end rate-based congestion control mechanism for realtime streams in the internet", proceedings of the eighteenth annual joint conference of the ieee computer and communications societies, volume 3, 21 25 march 1999, pages 1337 1345. [8] postel, j; "transmission control protocol", request for comments 793, september 1981. [9] yuang, m, c; liu, j, c; shay, c, l; "bats: a high-performance transport system for broadband applications", ieee proceedings of the 19th conference on local computer networks, 2 5 october 1994, pages 448 455. [10] conti, m; donatiello, l; furini, m; "design and analysis of rt-ring: a protocol for supporting real-time communications", ieee transactions on industrial electronics, volume 49, issue 6, december 2002, pages 1214 1226. [11] yuang, m, c; liu, j, c; "qts: a qos-guaranteed transport system for broadband multimedia communications", ieee transactions on industrial electronics, volume 45, number 1, february 1998, pages 69 77. [12] schulzrinne, h; casner, s; frederick, r; jacobson, v; "rtp: a transport protocol for real-time applications", network working group, request for comments 3550, july 2003. [13] international organisation for standardisation, "open systems interconnection basic reference model: the basic model", iso/iec 7498-1: 1994, available at: http://isotec.iso.org/livelink/livelink/fetch/2000/2489/ lttf_home/publiclyavailablestandards.htm [14] consultative committee for space data systems, "ccsds file delivery protocol (cfdp): recommendation for space data system standards", blue book, october 2002. [15] consultative committee for space data systems, available at: www.ccsds.org [16] delay tolerant network research group. available at: www.dtnrg.org [17] consultative committee for space data systems; "overview of space link protocols", ccsds 130.0-g-1, green book, june 2001. [18] international standards organisation, available at: www.iso.org [19] curran, k; parr, g; "the use of dynamically reconfigurable protocol stacks for streaming multimedia to mobile devices", proceedings of the 8th international conference on communication systems, 25 28 november 2002, volume 2, pages 947 951. [20] johnny, t, f; "adaptive protocol suite for next generation wireless internet", proceedings of the ieee 2005 international conference on personal wireless communications, 23 25 january 2005, pages 509 513. [21] akyildiz, i; altunbasak, y; fekri, f; sivakumar, r; "adaptnet: an adaptive protocol suite for the next-generation wireless internet", ieee communications magazine, march 2004, pages 128 136. [22] o’malley, s, w; paterson, l, l; "a dynamic network architecture", acm transactions on computer systems, volume 10, number 2, may 1992, pages 110 143. bringing iptv to the market through differentiated service provisioning 69 [23] schöler, t; müller-schloer, c; "design, implementation and validation of a generic and reconfigurable protocol stack framework for mobile terminals", proceedings of the ieee 24th international conference on distributed computing systems workshops, 2004. [24] haas, z; "a protocol structure for high-speed communication over broadband isdn", ieee network magazine, january 1991, pages 64 70. [25] zitterbart, m; stiller, b; tantawy, a, n; "a model for flexible high-performance communication subsystems", ieee journal on selected areas in communications, volume 11, number 4, may 1993, pages 507 518. [26] curran, k; parr, g; "the use of dynamically reconfigurable protocol stacks for streaming multimedia to mobile devices", proceedings of the 8th international conference on communication systems, 25 28 november 2002, volume 2, pages 947 951. [27] zhang, c, h; chin, t, k; koh, k, y; ong, g, m; suthon, s; peng, c, h; pung, h, k; "octopus: an adaptive group communication environment for multimedia application", the ieee 10th international conference on networks, 27 30 august 2002, pages 145 150. [28] akyildiz, i; altunbasak, y; fekri, f; sivakumar, r; "adaptnet: an adaptive protocol suite for the next-generation wireless internet", ieee communications magazine, march 2004, pages 128 136. [29] an, l; pung, h, k; zhou, l; "design and implementation of a dynamic protocol framework", computer communications 29, 2006, pages 1309 1315. [30] guan, s-u; lim, s-s, "modeling adaptable multimedia and self-modifying protocol exceution", future generation computer systems 20, 2004, pages 123 143. [31] postel, j, "user datagram protocol", request for comments 768, august 1980. [32] bashir, o; parish, d; sandford, m; phillips, i; "optimising data processing in network performance monitoring systems", proceedings of the iee communications, volume 152, issue 5, 7 october 2005, pages 633 642. [33] opnet, available at: www.opnet.com cathryn peoples, gerard parr, sally mcclean, michaela black university of ulster school of computing and information engineering coleraine, northern ireland bt52 1sa e-mail: {si.mcclean,gp.parr,mm.blackg}@ulster.ac.uk petre dini cisco 170 west tasman drive san jose, ca 95134 usa e-mail: pdini@cisco.com ijcccv4n1draft.pdf int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 1, pp. 6-16 incremental improvement of the evaluation algorithm in the concept map based knowledge assessment system alla anohina, marks vilkelis, romans lukasenko riga technical university, department of systems theory and design kalku street 1, riga, latvia, lv-1658 e-mail: alla.anohina@rtu.lv, markvilkel@inbox.lv, lrexpress@inbox.lv abstract: the paper is devoted to the knowledge assessment system that has been developed at the department of systems theory and design of riga technical university for the last four years. the system is based on concept maps that allow displaying the knowledge structure of a particular learner in the form of a graph. teacher’s created concept maps serve as a standard against which learner’s concept maps are compared. however, it is not correct to compare teacher’s and learners’ concept maps by examining the exact equivalence of relationships in both maps, because people construct knowledge in different ways. thus, an appropriate mechanism is needed for the flexible evaluation of learners’ concept maps. the paper describes the algorithm implemented in the concept map based knowledge assessment system and its evolution through four prototypes of the system. keywords: knowledge assessment system, concept maps, evaluation algorithm 1 introduction rapid development of information and communication technologies and availability of huge amount of electronic information resources has led to changes in the roles of the main actors of the educational process, namely, a teacher and a learner. nowadays teachers should guide learners through the learning process by advising them and providing necessary stimuli while learners search information in different places, inter alia in e-learning environments (for example, [1, 2]) and turn it into knowledge. special methods are needed for the evaluation of learners’ knowledge structures created in such conditions and one of them is concept maps that allow displaying the knowledge structure of a particular learner in the form of a graph. the department of systems theory and design of the faculty of computer science and information technology of riga technical university has been developing the concept map based knowledge assessment system since the year 2005. the system has twofold goals in the context of the integration of technology into the traditional educational process: 1) to promote learners’ knowledge self-assessment, and 2) to support the teacher in the improvement of the learning course through systematic assessment of learners’ knowledge and analysis of its results. the goals are reached by the use of concept maps as an assessment tool. three prototypes of the system have been already implemented and the fourth one is under development at the moment [3]. the paper is a logical continuation of the description of the system’s working principles presented in [4] by focusing on a different aspect of its functionality, namely, on the algorithm for the comparison of learner’s and teacher’s concept maps and its incremental improvement through system’s prototypes. the paper is organized as follows. section 2 gives an overview of the system. section 3 discusses related works concerning scoring systems of concept maps. evolution of the algorithm for the comparison of learner’s and teacher’s concept maps is described in section 4. finally, conclusions are presented. copyright © 2006-2009 by ccc publications incremental improvement of the evaluation algorithm in the concept map based knowledge assessment system 7 2 overview of the system as it was mentioned in introduction concept maps are used as an assessment tool in the developed knowledge assessment system. according to [5] they can foster the learning of well-integrated structural knowledge as opposed to the memorization of fragmentary, unintegrated facts and externalize the conceptual knowledge (both correct and erroneous) that learners hold in a knowledge domain. a concept map is a graph with labeled nodes corresponding to concepts in a problem domain and with arcs indicating relationships between pairs of concepts. arcs can be directed or undirected and with or without linking phrases on them. a linking phrase specifies the kind of a relationship between concepts. concept map based tasks can be divided in 1) "fill-in-the map" tasks, where the structure of a concept map is given to the learner and he/she must fill it using the provided set of concepts and/or linking phrases, and 2) "construct-a-map" tasks, where the learner must decide on the structure of the concept map by him/herself. the developed concept map based knowledge assessment system is used in the following way. the teacher defines stages of knowledge assessment and creates concept maps for all of them by specifying relevant concepts and relationships among them in such a way that the concept map of each stage is nothing else then an extension of the previous one. thus, the concept map of the last stage includes all concepts and relationships among them. teacher’s created concept maps serve as a standard against which the learners’ concept maps are compared. during knowledge assessment the learner solves a concept map based task corresponding to the assessment stage. after the learner has submitted his/her solution, the system compares the concept maps of the learner and the teacher, calculates the score of the learner’s result and generates feedback. the system offers five concept map based tasks that are ranged from the easiest to the most difficult (figure 1) [6]. eight transitions between tasks are implemented allowing the learner to find a task most suitable for his/her knowledge level. four transitions increase the degree of task difficulty. they are carried out after the analysis of the learner’s solution if the learner has reached the teacher’s specified number of points in the current assessment stage without reducing the degree of difficulty of the original task. so, this is a system’s adaptive reaction to the learner’s behavior. other four transitions reduce the degree of task difficulty and they are carried out by the voluntary request from the learner during the solving of the task. thus, the system supports knowledge self-assessment as it makes the analysis and evaluation of learners’ concept maps, as well as provides feedback about the learner’s errors. it promotes systematic knowledge assessment because it allows the extension of the initially created concept map for other assessment stages. moreover, statistical information about differences between learners’ concept maps and teacher’s concept map is collected providing opportunities for the teacher to improve the learning course [4]. the system is implemented as a web-based three-tier client-server application [4, 7] consisting of the following architectural layers (figure 2): 1) a data storage layer represented by data base management system (dbms); 2) an application logics layer composed of the application server and the server side code running on it; a special persistence and query framework is used to communicate with the dbms; and 3) the representation layer or graphical user interface. as it is shown in figure 2 the concept map based knowledge assessment system can be divided into three logical domains: administrator, teacher and student. each domain has its own goal, but they are strictly linked together. functionality of each domain can be used by one of three user roles which names correspond to the names of the domains. an administrator is responsible for the administration and maintenance of the whole system using such functions as input, editing and deleting of data about users (teachers and students), courses and student groups. the teacher domain provides all necessary functions for the creation of concept maps and defining of their attributes, as well as for the viewing of learners’ results. functionality of the student domain includes all things related to the completion of the 8 alla anohina, marks vilkelis, romans lukasenko concept map based tasks by learners and providing of feedback after the completion of the task. figure 1: tasks offered in the concept map based knowledge assessment system figure 2: the three-tier architecture of the system 3 related works there are a lot of research related to scoring systems of concept maps. in [14] three main approaches are highlighted: evaluation of components of a concept map, comparison with the expert concept map, or combination of both previously mentioned kinds. novak and gowin [11] have offered a scheme in which different number of points are assigned to such components of a concept map as propositions, levels of hierarchy, cross-links, specific examples of concepts. in [13] the previously described set of components is extended by number of branchings in the concept map. for the comparison of the learner’s concept map with the expert map goldsmith [8] has offered "the closeness index" indicating the degree of similarity between the expert and the learner’s concept map. in turn, herl and colleagues [9] use a matching algorithm that includes several expert maps evaluating each learner’s concept map. in their work two evaluation indicators are calculated: (a) stringent semantic content score on the basis of the exact relationships matches between learner’s and expert’s concept incremental improvement of the evaluation algorithm in the concept map based knowledge assessment system 9 maps, and b) categorized semantic content score, when learner’s defined relationship matches some set of possible relationships in the expert concept map. in [10] concept maps are scored by comparing each learner’s concept map with two expert maps. the learner receives half a point for each relationship that matches with an expert relationship and a full point, if the relationship matches with both experts’ relationships. learners also receive additional points for relationships that are more valued by experts ("critical relationships") and relationships that are less like what experts might include. both the evaluation of components of concept maps and the comparison with the expert concept map are combined in [12], where three different approaches to the scoring of concept maps are used: (a) a total score, defined as a number of valid learner’s relationships, (b) a congruence score, defined as a proportion of valid learner’s relationships to all relationships in the expert map, (c) a salience score, defined as a proportion of valid learner’s relationships to all relationships in the learner’s concept map. regardless of the diversity of schemes for scoring of concept maps, it is necessary to note that basically they are developed for the evaluation of tasks, in which learners must create their own concept maps. thus, a question about the evaluation of fill-in-the-map tasks remains open. moreover, the greater part of the offered schemes are not considered in the context of computer-assisted assessment systems, thus, it is very difficult to evaluate, whether they are feasible and useful in the concept map based knowledge assessment system. these factors motivated the authors of the paper to develop an algorithm suitable for the evaluation of both fill-in-the-map tasks and construct-a-map tasks in the concept map based knowledge assessment system. 4 evaluation algorithm the developed algorithm is sensitive to the arrangement and coherence of concepts in the learners’ concept maps. it is based on the assumption that the learner’s understanding of the presence of a relationship between concepts has the primary value, while other aspects such as the type of the relationship, the linking phrase, the direction of the arc and the places of concepts are secondary things. the algorithm is capable of recognizing different patterns of the learner’s solution. two kinds of relationships are used in concept maps: 1) important relationships which show that relationships between the corresponding concepts are considered as important knowledge in the learning course, and 2) less important relationships that specify desirable knowledge. for each correctly provided important relationship the learner receives 5 points, but for each less important relationship only 2 points are assigned. in the first system’s prototype developed in 2005 only one fill-in-the-map task was offered to learners. thus, all learners received the same structure of a concept map and a list of concepts. learners must insert provided concepts in correct nodes of the concept map structure. arcs were undirected and did not have semantics. taking into account that the value of a completely correct relationship is 100 %, the following contributions of its constituent parts were defined: the presence of the relationship in the learner’s concept map 50% (a fact that the learner understands the presence of the relationship between concepts has the primary value), the correct type of the relationship 30% (the learner should be able to distinguish, what is important and what is less important in the learning course), both concepts related by the relationship are placed in the correct places 20% (this factor has the greatest subjectivity). thus, the patterns of the learner’s solution which the algorithm was capable of recognizing are the following [15, 16]: • pattern 1. the learner has interrelated concepts in the same way as they are related in the teacher’s concept map. in this case the learner receives the maximum score regarding the type of the relationship. figure 3.b. shows that the concepts a and e in the learner’s concept map are interrelated in the same way as they are interrelated in the teacher’s concept map (figure 3.a). • pattern 2. the learner has defined a relationship that does not exist in the concept map of the teacher. in this case he/she does not receive any points. figure 3.c demonstrates that the learner 10 alla anohina, marks vilkelis, romans lukasenko figure 3: patterns of the learner’s solution that the first system’s prototype was able to recognize: a) the teacher’s concept map; b)-f) the patterns in the learner created concept map has made a relationship between the concepts a and h that are not interrelated in the teacher’s concept map (figure 3.a). • pattern 3. the learner’s defined relationship exists in the teacher’s map, the type of the relationship is correct, but at least one of the concepts is placed in an incorrect place. the learner receives 80% of the maximum score for the correct relationship of this type. figure 3.d shows that the learner has interrelated the concepts b and d. a similar relationship exists in the teacher’s concept map (figure 3.a). however, both concepts are located in incorrect places, although the type of the relationship is correct. • pattern 4. the learner’s defined relationship exists in the teacher’s map, the type of the relationship is wrong, and at least one of the concepts is placed in an incorrect place. the learner receives 50% of the maximum score for the correct relationship of this type. this pattern is displayed in figure 3.e. comparing the learner’s defined relationship between the concepts a and f with the same teacher’s relationship (figure 3.a) it is possible to notice that the concept f is located in the incorrect place, as well as the type of the relationship is a less important relationship instead of an important relationship. • pattern 5. a concept is placed in a wrong place, but its place is not important. the learner receives the maximum score for a corresponding relationship. figure 3.f demonstrates that the learner has changed places of the concepts m and l. new tasks 3 fill-in-the-map and 2 construct-a-map tasks (figure 1)were introduced in the second system’s prototype developed in 2006. moreover, linking phrases were added to concept maps causing modification of points received for relationships and extension of the set of patterns recognized by the system. assuming that a value of the fully correct relationship is 100%, the following contributions of its constituent parts were defined: the presence of the relationship in the learner’s concept map 40%, a correct linking phrase provided for the relationship 30% (semantics of relationships are important knowledge units), a correct type of the relationship 20%, both concepts related by the relationship are placed in the correct places 10%. therefore, the extended set of patterns of the learner’s solution was the following [6]: incremental improvement of the evaluation algorithm in the concept map based knowledge assessment system 11 • pattern 1. the learner has defined a completely correct relationship. in this case the learner receives the maximum score regarding the type of the relationship. figure 4.b demonstrates that the concepts a and e in the learner’s concept map are interrelated in the same way as they are interrelated in the teacher’s concept map (figure 4.a). • pattern 2. the learner has defined a relationship that does not exist in the concept map of the teacher and he/she does not receive any points. figure 4.c shows that the learner has related the concepts a and h which are not related in the teacher’s map (figure 4.a). • pattern 3. the learner’s defined relationship exists in the teacher’s map, both the type of the relationship and the linking phrase are correct, but at least one of the concepts is located in an incorrect place. the learner receives 90% of the maximum score for that relationship. this pattern is displayed in figure 4.d. the learner has defined the relationship between the concepts b and d by providing the correct type of the relationship and the linking phrase. however, both concepts are located in different places as compared to the teacher’s concept map (figure 4.a). • pattern 4. the learner’s defined relationship exists in the teacher’s map, but the type of the relationship is incorrect. the learner receives 80% of the maximum score for the correct relationship. this pattern is valid only for construct-a-map tasks (tasks 4 and 5 in figure 1) where places of concepts are not important. assuming that figure 4.e displays the learner’s created concept map, one can see that the learner has interrelated the concepts b and c by defining the correct linking phrase, but providing the important relationship instead of a less important one as it is in the teacher’s map (figure 4.a). • pattern 5. the learner’s defined relationship exists in the teacher’s map, but the linking phrase is incorrect. the learner receives 70% of the maximum score for the correct relationship. figure 4.f shows that the learner has located the concepts a and k in the same places as they are located in the teacher’s concept map (figure 4.a), but the linking phrase differs. • pattern 6. the learner’s defined relationship exists in the teacher’s map, the type of the relationship is incorrect, and at least one of concepts is placed in the incorrect place. the learner receives 70% of the maximum score. figure 4.g shows that the learner has related the concepts a and f using the less important relationship instead of the important relationship (figure 4.a) and has located the concept f in other place. • pattern 7. the learner’s defined relationship exists in the teacher’s map, the linking phrase is incorrect, and at least one of concepts is located in the incorrect place. the learner receives 60% of the maximum score. this pattern is displayed in figure 4.h, where one can see that the learner has defined the relationship between the concepts g and f, but has pointed out the linking phrase that differs from the linking phrase in the teacher’s concept map (figure 4.a), as well as has located both concepts incorrectly. • pattern 8. the learner’s defined relationship exists in the teacher’s map, but both the type of the relationship and the linking phrase are incorrect. the learner receives 50% of the maximum score for the correct relationship. this pattern is valid only for construct-a-map tasks (tasks 4 and 5 in figure 1), where places of concepts are not important. assuming that figure 4.i displays the learner’s created concept map, one can see that the learner has interrelated the concepts o and b that are also interrelated in the teacher’s concept map (figure 4.a). the learner has provided an incorrect type of the relationship (an important relationship instead of a less important relationship) and an incorrect linking phrase. 12 alla anohina, marks vilkelis, romans lukasenko figure 4: patterns of the learner’s solution that the second system’s prototype was able to recognize: a) the teacher’s concept map; b)-j) the patterns in the learner created concept map (numbers on links display different relationships and represent linking phrases) incremental improvement of the evaluation algorithm in the concept map based knowledge assessment system 13 • pattern 9. the learner’s defined relationship exists in the teacher’s map, both the type of the relationship and the linking phrase are incorrect, in addition at least one of the concepts is placed in the incorrect place. the learner receives 40% of the maximum score for that relationship. this pattern is shown in figure 4.j. by comparing the learner’s defined relationship between the concepts j and n with the same relationship in the teacher’s concept map (figure 4.a), one can see that both concepts are located in incorrect places, and the type of the relationship and the linking phrase are incorrect. in the third prototype implemented in 2007 directed arcs were introduced causing the following modifications of points received for relationships: the presence of the relationship in the learner’s concept map 40%, a correct linking phrase provided for the relationship 30%, a correct direction of the arc corresponding to the relationship 15%, a correct type of the relationship 10%, both concepts related with the relationship are placed in the correct places 10%. totally 36 patterns were acquired for all tasks and they are summarized in figure 5. no modifications of points have been made in the fourth system’s prototype. thus, the evaluation algorithm is based on the set of patterns specified in figure 5. however, improvement of the algorithm is continued by studying possibility to reveal extra relationships in learners’ concept maps [17]. figure 6 displays a situation when some relationships are "hidden": there are only 3 relationships in the teacher’s concept map (figure 6.a), but 2 more relationships can be derived (figure 6.b). these derived relationships are correct too and could appear in learners’ concept maps, so it is necessary to define a mechanism according to which the system could detect extra relationships and, thus, make assessment more flexible and automated. with aim to determine extra relationships 6 types of relationships are considered [17]:is a a relationship between concepts meaning that one of the concepts is a sub-class of other, part of a relationship between concepts meaning that one of the concepts is a part of other, attribute a relationship between a concept and its attribute, example a relationship between a general concept and a particular example of it, value a relationship between an attribute and its value, kind of a relationship between levels of hierarchy. figure 7 shows the structure of a pattern further explained in figure 8. the pattern has two main relationships (relation 1 and relation 2) that have types mentioned previously. in some cases combination between relationships is not allowed. in other cases an extra relationship (relation 3) can be formed. column "combination allowed" identifies either combination between relation 1 and relation 2 is allowed or not. the entry "cannot be specified" in the column "relation 3" indicates that there can be situations when an extra relationship can be added, but not always. in turn, the entry "no extra relationship" in the same column points out that no additional relationship of considered 6 types can be added. additional relationships that can be derived from more than three concepts and two relationships between them can be revealed as well. in such case the algorithm must iteratively go through the concept map searching for patterns and adding extra relationships whenever it is possible. the algorithm stops when no new relationship has been added during the last iteration. 5 conclusions the use of concept maps for the evaluation of learners’ knowledge structures demands an appropriate mechanism for the comparison of learners’ concept map with the teacher’s one. the mechanism should consider that people construct knowledge in different ways and there can be differences between concept maps that are not indicators of incorrect knowledge. the paper presents the algorithm capable of recognizing several patterns of the learner’s solution on the basis of such criteria as location of concepts, types of relationships, direction of arcs and correctness of linking phrases allowing the learner to receive 14 alla anohina, marks vilkelis, romans lukasenko figure 5: the extended set of patterns of the learner’s solution in the third system’s prototype figure 6: hidden relationships (adopted from [17]) figure 7: structure of the pattern (adopted from [17]) incremental improvement of the evaluation algorithm in the concept map based knowledge assessment system 15 some points in case if his/her relationships match teacher’s ones only partly. moreover, detection of extra relationships that can appear in learners’ concept maps makes assessment more automated. figure 8: patterns containing three concepts and two relationships (adopted from [17]) bibliography [1] a.styliadis, i.karamitsos, d.zachariou,personalized e-learning implementation-the gis case, international journal of computers,communications & control, vol.1, no.1, pp.59-67, 2006. [2] g. moise, a software system for online learning applied in the field of computer science, international journal of computers,communications & control, vol.2, no.1, pp.84-93, 2007. [3] j. grundspenkis, a. anohina, evolution of the concept map based adaptive knowledge assessment system: implementation and evaluation results, scientific proceedings of riga technical university, rtu publishing, riga, 2008 (submitted). [4] m. vilkelis, a. anohina, r. lukashenko, architecture and working principles of the concept map based knowledge assessment system, proceedings of the 3rd international conference on virtual learning, october 31 november 2, constanta, romania, pp. 81-90, 2008. [5] a. j. cañas, a summary of literature pertaining to the use of concept mapping techniques and technologies for education and performance support, technical report, 2003. [6] a. anohina, d. pozdnakovs, j. grundspenkis, changing the degree of task difficulty in concept map based assessment system, proceedings of the iadis international conference "e-learning 2007", july 6-8, lisbon, portugal, pp. 443-450, 2007. [7] r. lukashenko, m. vilkelis, a. anohina, deciding on the architecture of the concept map based knowledge assessment system,proceedings of the international conference on computer systems and technologies, june 12-13, gabrovo, bulgaria, pp. v.3-1 v.3-6, 2008. 16 alla anohina, marks vilkelis, romans lukasenko [8] t. e. goldsmith, p. j. johnson, w. h. acton, assessing structural knowledge, journal of educational psychology, vol.83, no.1, pp. 88-96, 1991. [9] h. herl, e. baker, d. niemi, construct validation of an approach to modelling cognitive structure of u.s. history knowledge, journal of educational research, no. 89, pp. 206-219, 1996. [10] d. c. d. klein, g. k. w. k. chung, e. osmundson, et al., examining the validity of knowledge mapping as a measure of elementary students’ scientific understanding, cse techical report no.557, 2002. [11] j. d. novak, d. b. gowin, learning how to learn, london, 1984. [12] m. a. ruiz-primo, r. j. shavelson, s. e. schultz, on the validity of concept map-based assessment interpretations: an experiment testing the assumption of hierarchical concept maps in science, cse technical report 455, 1997. [13] j. d. wallace, j. j. mintzes, the concept map as a research tool: exploring conceptual change in biology, journal of research in science teaching, vol. 27, pp. 1033-1052, 1990. [14] m. a. ruiz-primo, r. j. schavelson, problems and issues in the use of concept maps in science assessment, journal of research in science teaching, no.6, pp. 569-600, 1996. [15] a. anohina, j. grundspenkis, prototype of multiagent knowledge assessment system for support of process oriented learning, proceedings of the 7th international baltic conference on databases and information systems, july 3-6, vilnius, lithuania, pp. 211-219, 2006. [16] a. anohina, v. graudina, j. grundspenkis, using concept maps in adaptive knowledge assessment, advances in information systems development "new methods and practice for the networked society", springer, pp. 469-480, 2006. [17] j. grundspenkis, m. strautmane, usage of graph patterns for knowledge assessment based on concept maps. scientific proceedings of riga technical university, rtu publishing, riga, 2008 (submitted). alla anohina is a lecturer at the department of systems theory and design of riga technical university in latvia. she obtained her doctoral degree in computer systems from riga technical university in 2007. her main research fields are intelligent tutoring systems, computer-assisted assessment systems and artificial intelligence. she has seven years’ experience of teaching in the field of computer science both in riga technical university, and in other educational institutions of latvia. she has participated in several research projects related to the development of of educational software. marks vilkelis is currently developing the master thesis "the knowledge assessment system as a client-server java application" at the institute of information technology of riga technical university. in 2005 he obtained the bachelor degree from the mentioned institution. his scientific interests cover java open source technologies and applications of artificial intelligence. he has experience of working as a programmer in some research projects. his professional activities are related to the programming of complex business applications in the international software development company. romans lukasenko is a phd student of computer systems at the riga technical university in latvia. he obtained the master degree in computer systems from the mentioned institution in 2006. his main research fields are intelligent tutoring systems, computer-assisted assessment systems and student modelling. his thesis is related to the implementation of a student model in the knowledge assessment system. romans lukashenko works also as a researcher at the department of systems theory and design of riga technical university. he has participated in several research projects related to the development of educational software. international journal of computers, communications & control vol. ii (2007), no. 3, pp. 217-228 reactive control using behavior modelling of a mobile robot imen ayari, abderrazak chatti abstract: this paper deals with the reactive control of an autonomous robot which should move safely in a crowded unknown environment to reach a goal. a behavior based approach is used to realize obstacle avoidance within a neural model conceived from a set of examples of perception/action relations; supervised learning is used for the aim; while goal-reaching task is realized using a fuzzy rule-based system. a task activation module is used to generate the overall command, resulting from the fuzzy controller and the neural model. real time simulation examples of generated path with proposed techniques are presented. keywords: reactive control, mobile robots, neural networks, learning,fuzzy control. 1 introduction one of the major challenges in the development of intelligent mobile robotic systems is endowing them with an ability to plan motions and to navigate autonomously in a crowded environment avoiding any type of obstacles. different kinds of the path planning problem can be illustrated; the simplest one is to find a continuous path from a starting position to a goal position given the exact description of the environment. different global approaches were applied for the purpose such as decomposition, road-map, and retraction methods. when dealing with unknown environment, much fewer approaches are used; obstacles in that case are either detected locally during the robot movement and dynamically incorporated into the path generation process, or approximated in the workspace (miura, uozumi, and shirai [9]; yu and su [10]; bennewitz, burgard and thrun [11]). [12] a way to deal with the navigation problem is within behavior based navigation approaches. the main idea is to subdivide the navigation task into small subtasks. several behavior-based control schemes have been achieved inspired by the subsumption architecture of brooks[13][14]such us reactive behaviors defined by arkin using motor schemes (arkin [15][16][17]); damn architecture presented by rosenblatt et al.[18][19][12] in which a centralized module of votes provided by independent behaviors combines into a voted output and fuzzy logic approach to manage behaviors used by saffiotti [13], seraji et al. [14][15] and others [16][17].[18] this framework provides an overview on the contribution of soft computing to the field of behavior based robot control, a hybrid approach is adopted using both neural and fuzzy logic methods to realize the specified task of reaching a goal with obstacles avoiding in a crowded unknown environment. a neural model is conceived to synthesize avoiding obstacle behavior from a set of examples representing the perception/action relation using a supervised learning strategy. goal-reaching task is realized using a fuzzy rule-based system. a task activation module is used to generate the overall command, resulting from the fuzzy controller and the neural model. simulation examples of generated path with proposed techniques are presented. 2 neural networks in robotics historically, robots for industrial purposes involved little or no learning. recently, a growing interest in unstructured environments has encouraged learning intensive design methodologies. the emerging class of robots must be able to interact responsively with people and other robots providing assistance and service that will increasingly affect everyday life. robots may learn by adjusting parameters, building copyright © 2006-2007 by ccc publications 218 imen ayari, abderrazak chatti environmental models such as maps, exploiting patterns, evolving rule sets, generating entire behaviors, devising new strategies, predicting environmental changes, recognizing the strategies of opponents or exchanging knowledge with other robots. essentially, neural networks deal with cognitive tasks such as learning, adaptation, generalization and optimization. indeed, recognition, learning, decision-making and action constitute the principal navigation problems. to solve these problems fuzzy logic and neural networks are used. they improve the learning and adaptation capabilities related to variations in the environment where information is qualitative, inaccurate, uncertain or incomplete. artificial neural networks (anns) are algorithms based very loosely on the neural phenomenon of spreading activation. anns can encode knowledge and skill implicitly as associative connections between nodes. stimulation introduced to the input nodes of a neural network travels between layers of the network to produce some output on the other end. this output is evaluated by a trainer who applies supervised or unsupervised learning to alter the weights of synaptic connections and thereby change the way the network will respond. in this way, anns allow human knowledge and guidance to orchestrate the learning process. such techniques, where learning is applied by a knowledgeable teacher, are often referred to as robot shaping. usually robot shaping involves symbolic interaction between a human and robot and may even involve a high-level tasking language. such an interface allows humans to supply high-level assistance and allows the robot to accomplish the low-level learning necessary to achieve the goal. [19] 3 robot description the robot used in this work has been conceived in our laboratory [20][21] around a hardware architecture that facilitates the communication with a remote computer in different ways. it was based on different modules as shown in figure (1). every module has processing capabilities that allows updating the type of the low level current controller for the motor driver (2), test different dead reckoning approximations for the wheel encoders that determines the position (3) or compute different averaging methods for the infrared sensors (4). the robot can be totally free with an embedded master controller while transmitting wireless information to a computer, or the computer can be its master taking inputs and giving orders. a mechanical structure handling this architecture was based on two dc motors controlling through gears two differential wheels. wheels are provided with encoders that allows after processing determining an approximate relative position of the robot. eight infrared sensors are placed on a circle every 45◦, they sense for obstacles by measuring the reflected light. figure 1: robot architecture reactive control using behavior modelling of a mobile robot 219 figure 2: ir sensors (ir diode + phototransistor) figure 3: robot parameters the figure (3) shows the robot parameters where c is the center of the wheels and g is the gravity center, d separate these two centers as the in the general case d is always different than zero. the movement vector q containing variables of the robot has the cartesian coordinates of the center, the orientation and angle of each of left and right wheels: q = (xc,yc, φ , θr, θl ) (1) given the kinematics equations of this model: ẏc cos φ − ẋc sin φ −dφ̇ = 0 ẋc cos φ + ẏc sin φ −bφ̇ = rθ̇l ẋc cos φ + ẏc sin φ + bφ̇ = rθ̇r we can obtain the pfaffian kinematics constraint: a(q)q̇ = 0 with a(q) =   −sin φ cos φ −d 0 0 −cos φ −sin φ −b r 0 −cos φ −sin φ b 0 r   220 imen ayari, abderrazak chatti deriving the kinetic energy equation and considering the system with the pfaffian constraint, we obtain the matrix differential system: m(q)q̈ + v (q, q̇) = e(q)τ −at (q)λ a(q)s(q) = 0 st (msυ̇(t) + mṡυ(t) + v ) = τ the resulting system is based on non-stationary matrices that will be re-computed every simulation step: ẋ = [ sυ 0 ] + [ 0 i ] u thus we have an approximate model of the robot dynamics. matrices of this differential equation have parameters measured on the real robot and some others are variables computed on the run with the simulation like position and speed. the input of this equation is the torque applied on every wheel. resolving theses equations provide the speed and position of the robot. figure 4: robot dynamic model input output 4 proposed approach a hybrid approach is adopted using both neural and fuzzy logic methods to realize the specified task of reaching a goal point while avoiding obstacles. the control structure is made of two tasks as seen in the figure (5), a task for reaching the goal and a second one for avoiding obstacles; the input of the control system is sensors data and the outputs are the actuators command. obstacle avoidance goal reaching (main task) actuators sensors figure 5: control structure 5 goal reaching task the goal reaching task is expected to align the robot’s heading with the direction of the goal; goal is the orientation difference between the robot axe and the goal. a fuzzy controller is used to accomplish the task as shows the figure (7), with fc-lw and fc-rw the fuzzy command respectively of the left wheel and the right wheel. reactive control using behavior modelling of a mobile robot 221 θgoal goal point r figure 6: goal orientation resolving fuzzy controller θgoal fc_lw fc_rw figure 7: goal reaching controller fuzzy sets the variable ’ goal ’ has 8 fuzzy sets: ( left, front left, front, front right , right, right back, back, left back ) the variables fc-lw and fc-rw have 6 fuzzy sets: (go ahead, go more on left, go more on right, quick turn left, quick turn right, go back ) fuzzy inference system the inference system id defined by the following rules: • if goal is on front then go ahead • if goal is on left then go more on left • if goal is right then go more on right • if goal is back then go back • if goal is on front left then quick turn left • if goal is left back then go back and go more on left • if goal is on right back then go back and go more on right • if goal is on front right then quick turn right -180° -135° -90° -45° 0° 45° 90° 135° 180° b lb l fl f fr r rb b θgoal figure 8: inputs and outputs membership functions 222 imen ayari, abderrazak chatti 6 obstacle avoidance task using neural robot behavior modelling for the task of avoiding obstacles, a neural model is utilized, reproducing the robot behavior in the used environment in order to avoid obstacles. it is a way of learning the environment with obstacle existence by affecting command values to each vector data from sensors. in our application the model inputs are the 8 infrared sensors data and the outputs are the commands of each wheel motor. robot behavior model c1 c2 nc_lw nc_rw c8 figure 9: obstacle avoidance strategy c1 c2 c3 c4 c5 c6 c7 c8 figure 10: infrared sensors position the corresponding neural network model is a normalized multilayer perceptron (mlp) composed of 5 hidden neurons with sigmoidal activation functions and two output linear neuron representing wheels command. the training algorithm used is the backpropagation algorithm with simple gradient. the simple gradient version consists in modifying the weights w according to the following formula: w( k) =w(k −1) + ∆w(k) with : ∆w(k) = −µ ∂ j ∂ w where µ is the training step. ∑ nc_lw ∑ nc_rw c1 c8 . . . . . figure 11: neural network architecture the network training is made using a conceived indoor environment with many obstacles of different shapes. the training data was collected using a joystick to guide the robot navigation in the environment reactive control using behavior modelling of a mobile robot 223 while avoiding obstacles. a sequence of the 8 sensors data and the corresponding wheels command is registered in a file used for the training and test of the achieved model. the model validation is made by computing the prediction error and by testing the robot behavior in the same environment after its training. the training examples describing the obstacle avoiding behavior should be chosen carefully in order to be representative and can be generalized for other environments. a first training was made in a learning environment endowed with simple obstacles shape; we use a more type of the learning environment. the training principle is to learn the robot how to dodge obstacles; the robot goes forward, its front should be free otherwise it turns and follows the obstacle border as seen in figures (12) and (13). as a basic environment was used the one shown in the figure1.a; the test result was satisfactory in the same environment but collisions were detected when adding particular shapes of obstacles like corners or u traps. a second training is then achieved to learn avoiding this particular type of obstacles as seen in figure (13); the navigation test was satisfactory for corners but not for u traps; the robot begins to oscillate between the right and left side without having an outlet, in fact the same command is alternately applied for left and right wheel which causes the oscillation behavior. this situation was treated in the task activation module that gives the overall command for reaching goal and avoiding obstacles tasks. figure 12: behavior robot learning in simple environment 7 task activation the overall command, resulting from the fuzzy controller for reaching goal and the neural model for avoiding obstacles, is computed by a task activation module as shows the figure (14). a third unit is used at the input of the task activation module; it determines the situation (context) concerning the robot localization. three situations are considered according to sensors data giving information about obstacles existence. s1: if c1 is far and c2 is farěand c8 is far then obstacle is far s2: if c1 is near or c2 is nearěor c8 is near then obstacle is near s3: if c1 is near and c2 is near and c4 is near and c6 is near and c7 is near and c8 is near then trap situation 224 imen ayari, abderrazak chatti figure 13: behavior robot learning in complex environment (particular shaped obstacles) sensors configuration (c1…c8) θgoal obstacle avoidance (neural model) goal reaching (fuzzy system) situation selection (s1, s2, s3) task activation (if situation then taski ) nc_lw nc_rw fc_lw fc_rw sk c_lw c_rw task1 task2 figure 14: task activation module reactive control using behavior modelling of a mobile robot 225 the input fuzzy sets describing the distances of each sensor to the nearest obstacle is defined in the figure (15). 0 20 100 far near 1 figure 15: membership functions for distance to obstacle trap situation is activated when the robot is trapped in a narrow corridor; the neural controller gives in this case a bad behavior; the robot begins to oscillate between the right and left side without having an outlet. the figure (16) shows an example of a trap situation. figure 16: example of a trap situation the task activation module is made up of a fuzzy rule base: if situation then task i three rules are considered as follows: • if obstacle is far then goal reaching • if obstacle is near then obstacle avoidance • if trap situation and 0 < goal < pi then turn left else turn right 8 simulation results to verify the validity of the proposed approach, some cases were illustrated using a real time simulation of the upper presented robot; it has to move from a given current position to a desired goal position in an unknown environment endowed with different obstacle shapes: simple shapes as seen in figure (17) and particular ones depicted by figure (18.2). the learned environment and a new one were both tested for the robot navigation depicted respectively by figures (18) and (17). the robot is any case, able to reach the goal placed behind obstacles while avoiding obstacles successfully. 9 summary and conclusions this framework presented a behavior based approach making use of the neural network and fuzzy control to realize the task of reaching goal in a crowded environment. a neural network modelling the robot behavior when encountering obstacles is conceived using supervised learning. different shapes of obstacles were learned; the neural model should associate a particular sensor data to a given action. the goal reaching task was achieved using a fuzzy rule-based system; coordination of the two tasks was addressed using a task activation module giving the overall action depending on the context. the simulation examples of the generation of the collision free path with goal reaching show that designed strategy is acceptable as solution of this problem. 226 imen ayari, abderrazak chatti figure 17: collision free goal reaching in an unlearned environment figure 18: collision free goal reaching in a learned environment reactive control using behavior modelling of a mobile robot 227 bibliography [1] j.miura, h.uozumi and y.shirai, mobile robot motion planning considering the motion uncertainty of moving obstacles, in proceedings of ieee international conference on systems, man, and cybernetics, pp.692-697, 1999. [2] h.yu and t. su, a destination driven navigator with dynamic obstacle motion prediction, in proceedings of ieee international conference on robotics and automation, pp. 2692-2697, seoul, korea: ieee, 2001. [3] m.bennewitz, w.burgard and s. thrun, adapting navigation strategies using motions patterns of people, in proceedings of ieee international conference on robotics and automation, pp.20002005, taipei, taiwan: ieee, 2003. [4] dmitry v. lebedev;1, jochen j. steil, helge j. ritter,the dynamic wave expansion neural network model for robot motion planning in time-varying environments, university of bielefeld, faculty of technology, neuroinformatics group, 2005 [5] r. c. arkin, towards cosmopolitan robots: intelligent navigation in extended man-made environments, phd thesis, university of massachusetts, department of computer and information science, 1987. [6] r. c. arkin, motor schema-based mobile robot navigation, int. j. of robotic research, vol 8, pp. 92-112, 1989. [7] r. c. arkin and tucker balch, principles and practice in review , journal of experimental and theoretical artificial intelligence (jetai), vol. 9, no. 2/3, pp. 175-188, april, 1997 [8] r. a. brooks, a robust layered control system for a mobile robot, ieee journal of robotics and automation, vol. 2, no. 1, march 1986, pp. 14-23; also mit ai memo 864, september 1985. [9] r. a. brooks, a robot that walks; emergent behavior from a carefully evolved network, neural computation, 1:2, pp. 253-262, 1989,. also in ieee international conference on robotics and automation, scottsdale, az, pp. 292-296, may 1989. [10] j. rosenblatt, damn: a distributed architecture for mobile navigation, ph.d. dissertation, carnegie mellon university robotics institute technical report cmu-ri-tr-97-01, pittsburgh, pa, 1995. [11] j. rosenblatt and d.w. payton, a fine-grained alternative to the subsumption architecture for mobile robot control, proceedings of the ieee/inns international joint conference on neural networks, washington dc, vol. 2, pp. 317-324, june 1989. [12] d. langer, j.k. rosenblatt, and m. hebert, a behavior-based system for off-road navigation, ieee journal of robotics and automation, vol. 10, no. 6, pp. 776-782, 1994. [13] saffiotti, the uses of fuzzy logic for autonomous robot navigation: a catalogue raisonn’e, soft computing research journal, vol. 1, no. 4, pp. 180-197, 1997. [14] h. seraji and a. howard, behavior-based robot navigation on challenging terrain: a fuzzy logic approach, ieee trans. rob. autom., vol. 18, no. 3, pp. 308-321, 2002. [15] simon x. yang, hao li, , max q.-h meng, and peter x liu, an embedded fuzzy controller for a behavior-based mobile robot with guaranteed performance, ieee transactions on fuzzy systems, vol. 12, no. 4, pp.436-446,august 2004. 228 imen ayari, abderrazak chatti [16] x. yang, m. moallem, and r. v. patel, a layered goal-oriented fuzzy motion planning strategy for mobile robot navigation, ieee transactions on systems, man, and cybernetics, vol. 35, no. 6, 1214-1224, december 2005. [17] majura f. selekwa, damion d. dunlap, and emmanuel g. collins, jr., implementation of multivalued fuzzy behavior control for robot navigation in cluttered environments, proceedings of the 2005 ieee international conference on robotics and automation barcelona, spain, pp.3699-3706, april 2005. [18] fatmi, a. al yahmadi, l. khriji, and n. masmoudi, a fuzzy logic based navigation of a mobile robot, transactions on engineering, computing and technology, vol. 15, issn 1305-5313, 2006 [19] d. janglová, neural networks in mobile robot motion, inernational journal of advanced robotic systems, vol.1 n.1, pp. 15-22,2004. [20] w. filali, i. ayari, a.chatti, modelling, real time simulation and fuzzy control of a differential wheels mobile robot, congrès international en sciences et techniques de l’automatique sta, hammamet, 2006. [21] w. filali, i. ayari, a. chatti, research platform on mobile robotics with modular architecture, tunisian-japanese seminar on culture, science and technology tjcst, sousse, 2005. [22] i. rivals, l. personnaz, g. dreyfus and d. canas, real-time control of an autonomous vehicle: a neural network approach to the path following problem, in neuro-nimes, pp 219-229, nimes, 1994. imen ayari, abderrazak chatti institut national des sciences appliquées et de technologie (insat) centre urbain nord, bp. 676, 1080 tunis cedex, tunisie e-mail : ayari.imen@gmail.com, abderrazak.chatti@insat.rnu.tn received : february 12, 2007 revised : june 26, 2007 imen ayari (born on july 10, 1979) received the engineer diploma degree in 2003 and the master degree in 2005, both in automatic and industrial computing from the "institut national des sciences appliquées et de technologie", tunisia. she is currently preparing the ph.d. degree in automatic and computer science within the framework of lagis-ec-lille and lara-enit (tunisia) cooperation. her research is related to the appliance of soft computing methods especially neural networks for mobile robots control. abderrazak chatti received the master of science degree in 1988 from polytechnical school of saint-pétersbourg from russia and the phd degree in technical sciences in 1994 from polytechnical university of saint-pétersbourg, russia, both in automatics and industrial computing. presently he is associate professor in the "institut national des sciences appliquées et de technologie", tunisia. his research interest includes topics such as modelling and control of non linear dynamic processes, robust and intelligent control. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 664-664 a fuzzy control heuristic applied to non-linear dynamic system using a fuzzy knowledge representation f.m. cordova, g. leyton felisa m. cordova university of santiago of chile ecuador 3769. estacion central chile, santiago e-mail: felisa.cordova@usach.cl guillermo leyton university of la serena benavente 980 e-mail: gleyton@userena.cl abstract: this paper presents the design of a fuzzy control heuristic that can be applied for modeling nonlinear dynamic systems using a fuzzy knowledge representation. nonlinear dynamic systems have been modeled traditionally on the basis of connections between the subsystems that compose it. nevertheless, this model design does not consider some of the following problems: existing dynamics between the subsystems; order and priority of the connection between subsystems; degrees of influence or causality between subsystems; particular state of each subsystem and state of the system on the basis of the combination of the diverse states of the subsystems; positive or negative influences between subsystems. in this context, the main objective of this proposal is to manage the whole system state by managing the state combination of the subsystems involved. in the proposed design the diverse states of subsystems at different levels are represented by a knowledge base matrix of fuzzy intervals (kbmfi). this type of structure is a fuzzy hypercube that provides facilities operations like: insert, delete, and switching. it also allows boolean operations between different kbmfi and inferences. each subsystem in a specific level and its connectors are characterized by factors with fuzzy attributes represented by membership functions. existing measures the degree of influence among the different levels are obtained (negatives, positives). in addition, the system state is determined based on the combination of the statements of the subsystems (stable, oscillatory, attractor, chaos). it allows introducing the dynamic effects in the calculation of each output level. the control and search of knowledge patterns are made by means of a fuzzy control heuristic. finally, an application to the co-ordination of the activities among different levels of the operation of an underground mine is developed and discussed. keywords: fuzzy systems, knowledge representation, heuristics, nonlinear dynamic systems. 1 introduction organizations can be visualized as complex systems composed of various subsystems that respond to different problems and have their own dynamics. this process in turn is recursive, so each subsystem has a particular dynamics. such is the case of managements, business areas, departments, primary and support activities of the value chain, activities plans, besides many copyright c⃝ 2006-2010 by ccc publications a fuzzy control heuristic applied to non-linear dynamic system using a fuzzy knowledge representation 665 other systems and subsystems existing in the organization. each subsystem is characterized by its variables and by inputs that can alter its performance and its outputs, which are the inputs of other subsystems, whose dependent effects are known only approximately. this constitutes a situation of a set of highly dynamic subsystems and with clearly nonlinear characteristics. usually, these factors are not considered in the decision making processes. it is clear that a universe of this kind is quite heterogeneous, dynamic, and growing. also, because of the nature of the stated problem, it must be considered that these subsystems represent inputs among themselves, giving the problem a high dose of parallelism. insofar as these subsystems serve as inputs among themselves, feedback is taking place continuously, making the system’s dynamics difficult to control, predict, manage and administer [1]. it is also necessary to take into account the increasing number of data, information and knowledge that current systems must administer, in particular their adequate representation [9]. if we consider that the problem of knowledge-based management and decision making must be carried out in organizations having these characteristics, then it is ever more important to support conceptual models and tools adequate for the planning, management and control processes of this dynamics. on the other hand, the representation knowledge is a fundamental component in any intelligent system that allows coding knowledge, objects, objectives, actions, and processes. the scheme for the chosen representation of knowledge determines the reasoning process and its efficiency. numerous studies on the representation of knowledge show that a representation can be more adequate than another one for a particular case or it can be capable of covering a greater number of cases [8]. the more traditional methods used are semantic networks, frames, production rules, trees, and bits matrices. cazorla et al. [3] suggest that knowledge can be classified according to the specific application to be used that develops knowledge: procedural, declarative, meta-knowledge, heuristic, or structural. however, the theory of diffuse sets proposed by zadeh [12], [13] allows the generation of knowledge representations that are closer to the nature itself of what it is desired to represent. the conceptual models of systems, their representation based on knowledge, and the tools for supporting management and decision making must then consider in their design factors such as high dynamism, parallelism, feedback, incompleteness, handling of uncertainty, nonlinearity, vagueness, qualitative definitions and behaviors, personal opinions, etc. along this line, some authors [1], [16], [15] make a profound development of various concepts such as fuzzy function approximations, chaos and fuzzy control, and processing of fuzzy signals. however, his greatest contribution refers to the calculation and representation of knowledge by means of fuzzy cubes and fuzzy cognitive maps. mcneill [6] also works with fuzzy theory as a means of representing environments with uncertainty usually characterized by their nonlinearity. welstead, on the other hand, supported by one of kosko’s results [11] suggests that fuzzy rules can be represented by one or more fuzzy associative memory matrices (fam); combining the above with genetic algorithms he proposes a model to approach prediction problems. they also use fuzzy representations centered mainly on the interaction of fuzzy theory, neural networks, and genetic algorithms, supporting a new line of work known as computational intelligence. tsoukalas [10] is more centered on the interaction and creation of fuzzy theory and neural network hybrids. to approach these kinds of problems, models are designed making use mainly of causal diagrams or knowledge maps with a series of nodes that would represent the concepts that are relevant to the system, and links between them that show the causal relation (influence) between concepts. in this context, the objective of this paper is to make a study and analysis that will allow modeling some types of dynamic systems, representing knowledge by means of a knowledge base matrix of fuzzy intervals and fuzzy cognitive maps [4], [14] and [15] with the purpose of achieving their categorization and fuzzy weight, as well as the levels of incidence in other subsystems, in this way characterizing the complete system with its levels of fuzzy incidence [5], [10]. 666 f.m. cordova, g. leyton 2 modeling of the diffuse knowledge base matrix in this proposal each of the map’s concepts corresponds to a fuzzy set, and it is specifically a particular knowledge base matrix of fuzzy intervals (kbmfi). the connections between concepts will have an associated value in the [-1,1] range that represents the degree of influence of a (kbmfi) node on another. if the value is positive, it indicates that an increase on the evidence of the origin concept increases the meaning, the evidence or the truth value of the destination concept. if it is negative, an increase of the evidence of the source causes a decrease of that of destination. if the value is 0, there is no connection, and no causal relation. in this way it is possible to get blurred cognitive maps from the opinion of one or various experts on the relations between some aspects of the evaluation process of a hypothetical case. also, the clear recursiveness involved in these types of systems is considered, and a vision of granularity is proposed that allows overcoming the various levels of abstraction subjacent in the dissimilar subsystems. on the other hand, internally each subsystem can be represented by kbmfis, allowing their incidence weight to be obtained with respect to other subsystems and at the same time represent their particular behavior. definition 1. let x be a classical set of objects, called the universe. belonging to a subset a of x can be defined in terms of the characteristic function: µa : x −→ [0, 1] x −→ µa(x) (1) where: µa(x) = { 1 x ∈ a 0 x ̸∈ a if the evaluation set 0, 1 is extended to the real interval [0,1], then it is possible to talk about the partial belonging in a, where µa(x) is the degree of belonging of x in a, and the values 0 and 1 are interpreted as "non-belonging" and "total belonging", respectively. clearly, a is a subset of x, which has no defined boundaries. this leads to the following definition. definition 2. let x be an object’s space. a fuzzy set a of x is characterized by the set of pairs: a = {(x, µa(x))/x ∈ x} where µa : x −→ [0, 1] (2) the fuzzy concept proposed by zadeh [11] is based on the fact of allowing the partial belonging in a set for certain elements of a given universe. definition 3. a fuzzy hypercube can be considered as a unit hypercube, i.e., a hypercube in = [0, 1]n. the n fuzzy cube has two vertices or binary subsets. a fuzzy cube contains all the fuzzy sets of a set x of n objects. the non-fuzzy sets are found at the vertices of the cube. the continuum of fuzzy sets is in the cube. definition 4. knowledge base matrix of fuzzy intervals (kbmfi) means the hypercube that is constituted by the various knowledge e1, e2, e3, ..., en, relative to a domain of knowledge, considering also the different weight or importance that each of them has in the particular domain. the kbmfi is a fuzzy hypercube where e1, e2, ..., en, represent the various contingencies or characteristics of the area under discussion, according to the opinion of the experts. ej, with j = 1, 2, ..., n, do not necessarily have the same relevance or weight, they can be in particular a fuzzy control heuristic applied to non-linear dynamic system using a fuzzy knowledge representation 667 fuzzy frames consisting of s1, ..., sm, where s1, s2, ..., sm, are the possible factors, not necessarily disjoint, such that each characteristic ei can be expressed by means of some particular union of s1, s2, ..., sm factors. now the particular determination of each ei through its particular factors s1, s2, ..., sm, model systems composed of a range of nodes n1, n2, ..., nn, continually influencing each other if and where the incidence of one with respect to others is completely dynamic. in particular, this outlines a vision of dynamic nonlinear systems which in similar but not equal versions are seen as causality maps. if the map is adjusted to the opinions of several experts, one would have to get the assessments of all of them and therefore establish the definitive values associated with the causality relations. it must be noted that in general the causalities mentioned by the experts with respect to the various influences exerted by the nodes of the maps are more attributable to qualitative than quantitative concepts. as already stated, nonlinear dynamic systems involve nonlinear and feedback behaviors. in these systems the output of a process or node is used as input for the following node or iteration, and the output of this can again be the input of the same previous node, i.e., self-recurrent behaviors. this behavior corresponds to the following equation: f(x0) =   xn−1 xn xn+1 assuming that the following situation occurs when modeling the system: x1, x2, x3, ..., xn. definition 5. let x0 be an arbitrary starting node, then the above sequence is called the trajectory. considering these definitions, several behaviors can occur, such as, for example: fixed points; periodic trajectories; behaviors given by attractor nodes, and chaos. 3 case study the case study corresponds to the situation of an underground mine which has three levels: production level, reduction level, and transport level. the problem consists in "providing support to activity scheduling management". the problem consists in "providing support to activity scheduling management" [2]. the total system shown by figure 1 consists of these three subsystems and the dynamics that exists between them. this situation is denoted as level 1. figure 1: production, reduction and transport levels. 668 f.m. cordova, g. leyton n1: production level considers n11, n12, n13, as subsystems; n2: reduction level considers n21, n22, as subsystems; n3: transport level considers no subsystems. looking at it at a more particular abstraction level, level 2 appears, as shown in figure 2. from the particular situation shown, in figure 1 it is seen that: n1 influences n2 negatively and n3 positively, n2 influences n1 and n3 positively, n3 influences n2 negatively and n3 positively. however, figure 2 shows that the information obtained at level 1 of abstraction of the system does not have the sensitivity or reliability that is obtained at level 2 of abstraction, whose granularity or disaggregation is slightly higher. figure 2: diagram of influence at the different levels. if both levels are confronted, it may be incorrectly deduced that apparently contradictory information is obtained. for example, if we look at level 1 and level 2 for the case of n3 with n2, at level 1 it was stated that n3 influences n2 negatively, but at level 2 it could be concluded that both have the opposite influence, n31 influences n22 negatively and n21 influences n31 positively. this apparent contradiction can be explained, for example, by saying that when production at the reduction level decreases, there is less pressure on the demand for trains or cars, and on the other hand, if there is not sufficient transport from n31 there is an impact due to accumulation of material at the reduction level, which is considered a negative influence. then the question is, which of the two situations has greater incidence weight? according to figure 3, and only as an example, it can be stated that the negative impact from n31 to n22 is greater than the influence of n22 on n31. the main observations to the system are: it is clear that it is a dynamic fuzzy system. in turn, every ni is a dynamic fuzzy subsystem. the connections between the various ni are fuzzy. these connections can be positive or negative. if positive, ni influences positively on nj. if negative, ni influences negatively on nj. 4 design and implementation of the kbmfi matrix going more deeply into table 1, the experts draw these kbmfi as causal tables. they do not state equations, but make links between subsystems. the kbmfi systems convert each pictograph into a fuzzy rules weight matrix. the nodes of the kbmfi can model the complex nonlinearities between the input and output nodes. the kbmfi can model the dynamics that occur in the multiple iterations that take place in these dynamic systems. the kbmfis with n nodes have nn arcs. since ni(t) nodes are fuzzy concepts, their values ∈ [0, 1]; a state of a kbmfi is the ni(t) = (n1(t), n2(t), ..., nn(t)) vector, so it is a point of the a fuzzy control heuristic applied to non-linear dynamic system using a fuzzy knowledge representation 669 hypercube in = [0, 1]n. an inference in a kbmfi is a road or sequence of points in in, i.e., it is a fuzzy process or an indexed family of fuzzy sets n(t). it is clearly seen that the kbmfis can perform "forward chaining," and whether they can perform "backward chaining" (nonlinearity inverse causality) is an open question. the kbmfis form, as nonlinear dynamic systems, semantic fuzzy networks and act as neural networks. the kbmfis can converge to a fixed point, to a limited cycle, that can be a stable or oscillating state or a chaotic attractor in the fuzzy cube in. in this context, one of the basic questions to be answered is: what happens if the input to the (kbmfi) system is known? in this sense, each kbmfi stores a set of global rules of the form: if n(0) then attractor a (3) a kbmfi with a single fixed global point has only one global rule. the size of the attractor regions in the fuzzy cube governs the number of these global regions or hidden patterns. the kbmfis can have large and small attractor regions in in, each of them with a different degree of complexity. therefore an input state can lead to chaos and a relatively close input state can end up in a fixed point or limited cycle or a stable state. since the kbmfis correspond to a semantic fuzzy network structure, it is possible to associate a matrix m. this matrix lists the causal links between ni nodes. as an example, if it is considered again the case described by figure 2, the corresponding kbmfi is presented where a row is the incidence of ni on nj; columns are nodes influence ni and α, β, γ, δ, η, τ, are values. fuzzy function: [little, moreorless, much, etc.]. n11 n12 n13 n21 n22 n31 n11 0 −αµ +αµ 0 0 +αµ n12 0 0 0 +βµ 0 0 n13 +γµ 0 0 0 −γµ 0 n21 0 −ηµ −ηµ 0 −ηµ 0 n22 +δµ 0 0 0 0 +δµ n31 +τµ 0 0 0 −τµ 0 the proposed model is decomposed in diverse abstraction levels and at each level is represented by a corresponding kbmfi. initially, observing figure 2, the abstraction level 0 appears. only the influence shapes are observed. a node ni can influence positively or negatively to the node nj. abstraction level 0 appears: n1 n2 n3 n1 0 + n2 + 0 n3 + 0 experts are asked to qualify the degree of influence between: µ = [nothing, irrelevant, few, influence, regular, alter, a lot, very much, so much] as shown in the incidence graphic of figure 3. applying the incidence graphic, a second level of abstraction 01 is obtained: n1 n2 n3 n1 0 −µ +µ n2 +µ 0 −µ n3 +µ −µ 0 it is observed that the degree of incidence between a node ni with a node nj, this means that it exists a bigger degree of specificity (granulation) between them. this enhancement of 670 f.m. cordova, g. leyton figure 3: incidence graphic. specificity is explicit in the following level, it exist a "slot" between ni with nj. in this case different situations are denoted: n1 influences in a negative way to n2; n1 influences in a positive way to n3; n2 influences in a positive way to n1; n2 influences in a negative way to n3; n3 influences in a positive way to n1; n3 influences in a negative way to n2. if it is considered that a node ni can be decomposed in ni1, ni2, ..., nik, in where those nim, m = 1, 2, ..., k, with a particular dynamic conforms a ni, the situation in the analyzed case is as follows: n1 = (n11, n12, n13); at level 0; node or subsystem n1. n1 at level 01, node or subsystem n1i is defined by: n11 n12 n13 n11 0 + n12 0 0 0 n13 + 0 0 n1 at level 011 is defined by: n11 n12 n13 n11 0 −αµ +αµ n12 0 0 0 n13 +αµ 0 0 n2 = n21, n22; at level 0; node or subsystem n2. n2 at level 01, node or subsystem n2i is defined by: n21 n22 n21 0 n22 0 0 n2 at level 011 is defined by: a fuzzy control heuristic applied to non-linear dynamic system using a fuzzy knowledge representation 671 n21 n22 n21 0 −ηµ n22 0 0 applying the same procedure to node n3 and it is only characterized by n31, n3 at level 011 is defined by: n31 n31 0 at this point only the fuzzy subsystem cohesion is developed. so, it is necessary to visualize what it happens with the external dynamic between subsystems, in order to obtain the fuzzy matching inter systems. continuing with the fuzzy cohesion procedure, links between nodes n1, n2 and n3, at level 0 by nodes are obtained: n1 n2 n3 n1 0 + at level 01 by node n1: n1 n2 n3 n1 0 −αµ +αµ at level 011 by node n1: n1 n21 n22 n31 n11 0 0 0 +αµ at level 02 by node n1: n1 n21 n22 n31 n11 0 +βµ 0 0 in this way, influences are obtained allowing the fuzzy matching. 5 heuristic control for the kbmfi each ni level has fij factors that determine it, with i = 1, 2, 3; j = 1, 2, ..., m. table 1 shows relevant characteristics, factors, attributes and fuzzy functions at production level. table 2 shows relevant factors, attributes and fuzzy functions at production level. each fij factor has aijs attributes that determine it, where i = 1, 2, 3; j = 1, 2, ..., m; s = 1, 2, ..., k (see table 1). each aijs has attribute metrics associated with its nature. these metrics are functions of fuzzy membership (see table 2). for the above points, it is possible to state that the degrees of influence (negative or positive) that exist between the various levels can be measured, allowing the calculation of the existing dynamics of the system to achieve an intelligent fuzzy control with the purpose of keeping the system in a desirable state (stable). 672 f.m. cordova, g. leyton characteristics of level 1 (production) attributes (metrics or fuzzy functions table 1.1) fuzzy functions factors 1 rel. card. (crc) 1 2 3 4 5 6 7 8 9 1. number of workmen present 2. drilling, agents, and resources 3. blasting, agents and resources 4. technologies involved 5. number of equipments 6. lectures relative cardinality of level 1 (crn1) table 1: relevant characteristics of level 1 at production level. 6 heuristic the proposed heuristic consists of the following stages: stage 1: obtaining the fij factors of each level ni. stage 2: obtaining the aijs attributes of each fij factor. stage 3: determining the metrics associated with each aijs attribute, i.e., determining the fuzzy membership functions for each aijs. stage 4: determining the "formula" that corresponds to each fij from the aijs, for example: fij = λ1aij1 ⊕ λ2aij2 ⊕ ... ⊕ λkaijk (4) where ⊕ is the operator to be determined (=>, ∨, ∪, etc.) and ∑ λk = 1. stage 5: determining ni from the fij, for example: ni = λ1fi1 ⊕ λ2fi2 ⊕ ... oplusλmfim (5) where ⊕ is the operator to be determined (=>, ∨, ∪, etc.) and ∑ λm = 1. note that the output of all nj must be between 0 and 1. stage 6: determining whether the "influence" of the output of ni to other levels is negative or positive. stage 7: recalculating the nt output, with its internal values, considering the influence exerted on it by the recursive dynamics of the nodes ni at stages 1, 2,..., 5. stage 8: determining the output of nt, input of nl, and determining whether we feed nl or ni, and specifying the times. note that in this step we distinguish between what influences what, or we make a push, we make a pull, or both at the same time, with a delay of one with respect to the other, etc. a fuzzy control heuristic applied to non-linear dynamic system using a fuzzy knowledge representation 673 factors and attributes fuzzy functions project system 1. number of workmen present. (decision making complexity). 1.1 number of engineers. µ11(x) = 1 − 25−x 25 10 ≤ x ≤ 25 1.2 number of technicians. µ12(x) = 1 − ( 75−x 75 )2 30 ≤ x ≤ 75 1.3 number of miners. µ13(x) = 1 − ( 150−x 150 )2 60 ≤ x ≤ 150 1.4 number of equipments µ14(x) = 1 − 30−x 30 12 ≤ x ≤ 30 x = amount of engineers, miners, ... 2. drilling, agents and resources. for evaluating this characteristic, first the predominant factor must be identified and then the calculation can be made. for example, if x = 25 or 30 or 90 or 21, for respective: 2.1 planned drillings. µ1i : µ 1 1(25) = 1; µ 1 2(30) = 0.64; µ 1 3(90) = 0.84; µ 1 4(21) = 0.91 2.2 direct agents involved. µ21(x) = 1 − √ 50−x 50 20 ≤ x ≤ 50 2.3 indirect agents involved. µ22,3(x) = 1 − ( 15−x 30 )3 6 ≤ x ≤ 15 table 2: factors, attributes and fuzzy functions at production level. 7 conclusions and future works the work done in the paper allows the characterization of a complex system through subsystems considering the dynamics and the incidence of each subsystem on the others. from the display of the complexity of the system and subsystems, the kbmfi is constructed, which allows an adequate representation of diffuse knowledge and the dynamics associated with the system. a fuzzy control heuristic is also designed that allows managing the kbmfi. in the case of the planning of mining operations, the kbmfi and the associated heuristic allow the evaluation of the impact of the incidence of various factors such as reduction of the number of planned workers in a shift, faults in load haul and dump lhd equipment, rock breakers, shafts, and trains, among others. if someone is considering developing software from this proposal, it should be kept in mind that in the tool there should be an agent module that is informed (alert) of the acceptable critical values for each node, so that this node does not alter acceptable states (experts) of the nodes with which it interacts. in such case the agent must learn about the acceptable critical values, know and learn preventive measures; know and learn mitigation measures, and know and learn corrective measures. bibliography [1] d. alahakoon, s. k. halgamuge, and b. srinivasan, dynamic self organizing maps with controlled growth for knowledge discovery, ieee trans. neural networks, vol.11:601-614, 2000. [2] f. cordova, l. canete, l. quezada, f. yanine, an intelligent supervising system for the operation of an undergound mine, international journal of computers, communications and control, vol. iii: 259-269, 2008. [3] m. gupta, r.k. ragade,yager, advances in fuzzy sets theory and applications, north holland, amsterdam, 1979. 674 f.m. cordova, g. leyton [4] b. kosko, fuzzy ingineering. prentice hall, 1997. [5] g. martinez, servente and pasquini,sistemas inteligentes, nl nueva libreria, argentina, 2003. [6] t. mcneill, fuzzy logic a practical approach. academic press, 1997. [7] h. roman, sobre entropias fuzzy, tesis de doctorado, universidad de campinas, brasil, 1989. [8] e. schnaider, a. kandel, applications of the negation operator in fuzzy production rules, fuzzy sets and systems, vol. 34: 293-299, noth holland, 1990. [9] w. silder, j. buckley, fuzzy expert system and fuzzy reasoning, john wiley and sons inc., new jersey, 416, 2005. [10] u. tsoukalas, fuzzy and neural approaches in engineering. wiley interscience, 1997. [11] s. welstead, neural network and fuzzy logic applications in c++. wiley interscience, 1994. [12] l.a. zadeh, the role of fuzzy logic in the management of uncertainly in expert systems, aproximate reasoning in expert systems, elservier science pub., north holland, 3-31, 1985. [13] l.a. zadeh et al (eds.), from natural language to soft computing: new paradigms in artificial intelligence, editing house of romanian academy, 2008. [14] zadeh, l.a., outline of a new approach to the analysis of a complex systems and decision processed, ieee trans. syst. man cybern., vol. 3: 28-44, 1973. [15] l.a. zadeh, fuzzy sets and fuzzy information: granulation theory, beijing normal university press, beijing, 1997. [16] l.zhong, w.a. halang,g. chen, integration of fuzzy logic and chaos theory, springerverlag, berlin heidelberg, 2006. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 735-743 full-text search engine using mysql c. gyorodi, r. gyorodi, g. pecherle, g. m. cornea cornelia gyorodi, robert gyorodi, george pecherle, george mihai cornea department of computer science faculty of electrical engineering and information technology university of oradea, str. universitatii 1, 410087, oradea, romania e-mail: cgyorodi@uoradea.ro, rgyorodi@rdsor.ro, gpecherle@uoradea.ro, generalmip@yahoo.com abstract: in this article we will try to explain how we can create a search engine using the powerful mysql full-text search. the ever increasing demands of the web requires cheap and elaborate search options. one of the most important issues for a search engine is to have the capacity to order its results set as relevance and provide the user with suggestions in the case of a spelling mistake or a small result set. in order to fulfill this request we thought about using the powerful mysql full-text search. this option is suitable for small to medium scale websites. in order to provide sound like capabilities, a second table containing a bag of words from the main table together with the corresponding metaphone is created. when a suggestion is needed, this table is interrogated for the metaphone of the searched word and the result set is computed resulting a suggestion. keywords: full-text, search, mysql, index, search engine, ranking, metaphone, levenstein 1 introduction before the advent of the search engine, users had to search manually through dozens or hundreds of articles to find the ones that were right for them. nowadays, in our more usercentered world, we expect the results to come to the user, not the other way around. the search engine gets the computer to do the work for the user. full-text search is widely used for various services of the internet. a high-speed and a more efficient full-text search technology are necessary because of the amount of increasing handled document and corresponding document data every day [6]. according to the mysql manual, full-text is a "natural language search"; the words are indexed and appear to represent the row, using the columns you specified. as an example, if all your rows contain "mysql" then "mysql" won’t match much. it’s not terribly unique, and it would return too many results. however, if "mysql" were present in only 5% of the rows, it would return those rows because it doesn’t appear too often to be known as a keyword that’s very used. mysql full-text search doesn’t have too many user-tunable parameters. if you want more control over your search you will have to download the sources and compile them yourself after you’ve made the changes you wanted. anyway, mysql full-text is tuned for best effectiveness. modifying the default behaviour in most cases can actually decrease effectiveness [1]. our implementation comes to bring a plus of functionality to the basic search capabilities that mysql offers by returning better quality results to the user. for small data sets to be searched, its performance can be easily compared with the one of the more advanced dedicated full-text search engines. copyright c⃝ 2006-2010 by ccc publications 736 c. gyorodi, r. gyorodi, g. pecherle, g. m. cornea 2 full-text search in text retrieval, full-text search refers to a technique for searching a computer-stored document or database. in a full-text search, the search engine examines all of the words in every stored document as it tries to match search words supplied by the user. when dealing with a small number of documents it is possible for the full-text search engine to directly scan the contents of the documents with each query, a strategy called serial scanning. this is what some rudimentary tools, such as grep, do when searching. however, when the number of documents to search is potentially large or the quantity of search queries to perform is substantial, the problem of full-text search is often divided into two tasks: indexing and searching. the indexing stage will scan the text of all the documents and build a list of search terms, often called an index, but more correctly named a concordance. in the search stage, when performing a specific query, only the index is referenced rather than the text of the original documents. some indexers also employ language-specific stemming on the words being indexed, so for example any of the words "drives", "drove", or "driven" will be recorded in the index under a single concept word "drive" [2]. before passing on to the search, we would like to talk a little about how mysql full-text indexes work. a mysql full-text query returns rows according to relevance. but what is relevance? it is a floating-point number based on formulas. researchers have shown that these formulas produce results that real users want. for every word that isn’t too short or isn’t a too common one, mysql calculates a number to determine its relevance inside the text. to be noticed that mysql will not increase weight if two keywords are close to each other. local weight, global weight, and query weight are the only things that matter. mysql can work with stemming but as we’ve seen from different tests it isn’t working very well, usually bombarding the user with tons of irrelevant results. there are three formulas we have to mention for full-text index [3]. local weight = (log(dtf)+1)/sumdtf * u / (1+0.0115*u) global weight = log((n-nf)/nf) query weight = local weight * global weight * qf where: dtf how many times the term appears in the row sumdtf -the sum of "(log(dtf)+1)" for all terms in the same row u how many unique terms are in the row n how many rows are in the table nf how many rows contain the term qf how many times the term appears in the query notice that local weight depends on a straight multiplier, the term-within-row frequency times the unique frequency. but most simply: if a term appears many times in a row, the weight goes up. why does local weight depend on how many times the term is in the row? think of the document you are reading now. i inevitably mention "mysql" and "full-text" several times. that’s typical: if words appear several times, they are likely to be relevant. notice that global weight depends on an inverse multiplier, the count of rows minus the count of rows that the term appears in. put most simply: if a term appears in many rows, the weight goes down. to illustrate this better we’ve chosen to give a practical example. for this example, we’ve used a smaller database in order to be able to calculate the relevancies. to see what is in a fullfull-text search engine using mysql 737 text index, we can use the myisam_ftdump program. it comes with the standard distribution. the name of owner table is demo and it is located in dmp database. the table consists in 2 columns, the second one being the one with full-text index. in order to see the local weights of the different words we have to use the command dump index. this command will return the data offsets and the word weights (fig. 1). figure 1: dump index in order to see the global weights of the words we have to use the command calculate per-word stats. this command will return for us a word count and the global weight (fig. 2). figure 2: calculate per-word stats for the first formula: (log(dtf)+1)/sumdtf * u/(1+0.0115*u); where dtf is how many times the term appears in the row "mysql" appears 2 times in row 0 so log(dtf()+1) = 0.6931472 + 1 = 1.6931472 sumdtf the sum of "(log(dtf)+1)" for all terms in the same row "mysql" appears 2 times in row 0, so add log(2)+1 "world" appears 1 times in row 0, so add log(1)+1 "popular" appears 1 times in row 0, so add log(1)+1 "open" appears 1 times in row 0, so add log(1)+1 "source" appears 1 times in row 0, so add log(1)+1 "database" appears 1 times in row 0, so add log(1)+1 "free " appears 1 times in row 0, so add log(1)+1 so sumdtf = log(2)+1 + (log(1)+1)*6 = 7.6931472 738 c. gyorodi, r. gyorodi, g. pecherle, g. m. cornea u how many unique terms are in the row there are 7 unique terms in row 0 so u/(1+0.115*u) = 7/(1+0.0115*7) = 6.478482 local weight = 1.6931472 / 7.6931472 * 6.478482= 1.4258175 (check figure 2 for the first occurrence of the term “mysql” * ) for the second formula: log((n-nf)/nf); where: n how many rows are in the table; there are 4 rows in the ’demo’ table nf how many rows contain the term; the term "special" occurs in 3 rows log((4-3)/3)= -1.0986123 (check fig. 3 for the term “mysql” ** ) note that because this term appears in more than 50% of the rows it has a negative global weight. in an actual search, this term will practically be ignored, only the adjacent terms being representative. 3 implementation the primary table is the table where we keep all the data that has to be searched and ranked. also, from this table we will create later the “bag of words” together with their metaphones in order to provide correction suggestions to the user. the primary table will consist of 6 columns, the first one being the row id and the others all being meant for full-text search. the structure of the table is: id unique identifier for every row url the url where the text was taken from title the title of the page the text was taken from content everything on the page that is not part of the formatting and functionality (only text, no html or other scripts, including the dealers and strong text) headers h1,... contents from the page. they will have higher relevancy than strong and content but lower than url and title strong words that appear between b or strong tags. they will have higher relevancy than ordinary text from content but lower than all the others. during the construction of the table in the indexing process we decompose every page to obtain the required fields. first we will obtain the url and the title fields. after we have those two we can get rid off the head part of the web page and move on to the body. before parsing out the entire html and other script tags from the text, we will have to take out the header and the strong keywords that will have a higher relevancy in our future search. after we’ve done that we can parse all the remaining script tags and send the resulting data into the mysql database. an improvement to this table would be a new column that contains all the keywords linking to this page. this way you can extend the article relevancy behind the actual page. those linking keywords can be considered the ones between the anchor tag or the most relevant keywords from that page (calculated by their density after we’ve previously removed the stop words) or a combination of both. however, this approach requires many more pages to be scanned and many of them could come from outside our website. after the first indexing is complete we can create the full-text index on the table. from now on we can make full-text searches on that table. any new insert into the database will full-text search engine using mysql 739 be attached on the fly internally by mysql so that we don’t have to worry about this [9]. the sql syntax for doing this is: alter table ‘content‘ add fulltext (‘url‘,‘title‘, ‘content‘,‘headers‘,‘strong‘) searching the database is not very hard. actually we will only use a query to apply a search on the database and let mysql do the rest. it is a lot harder to determine the best values for the weights of the results. depending on those constants, we will have different search results as we will show in the following examples. in order to query the database we will use the following query. the query is virtually composed of three parts. the first part is meant to determine the relevancies of every column in part, the second one is meant for determining the matching rows and the last one is for ordering. you can see the query in fig. 3. figure 3: search query relurl, reltitle, relcontent, relheaders and relstrong are the different relevancies returned by mysql after the preliminary search of every different column in part. the parameters u, t, c, h and s represent the importance of those relevancies in part. the query will execute like this: first, different relevancies will be returned from the search on the individual columns. those relevancies will be later used to calculate the order of the results. after we have got the relevancies we have to move on to the retrieval of all the rows that contain the terms that we searched. in order to do this we have to select all the matches inside the text fields (‘content‘, ‘url‘, ‘title‘, ‘headers‘, ‘strong‘). once we have those results we will have to reorder them by relevance. in order to do this we will determine the order by argument using the following formula. relevance = relurl *u + reltitle *t + relcontent *c+ relheaders *h+ relstrong *s figure 4: relevance formula the relevance for every column should be carefully selected in order to achieve the best result order. the url relevance is not so important in our opinion. we have added it because all the big search engines have it. we will treat it as it would have the same relevance as the page title. the next column on the relevance scale, after the url and the title would be the headers relevance. let’s think about a certainty search. we want to search for an article about “mysql fulltext”. the script will first determine the pages that contain the keywords inside the title and url of the page because those are the most suitable for us. the next in importance is the headers column. the last two of the columns are the content column and the strong column. the strong column contains keywords that the writer of that page thought they are of a higher importance, with strong relevance to the subject. 740 c. gyorodi, r. gyorodi, g. pecherle, g. m. cornea the order of the relevancies is clear. the plain content relevance is the lower one, followed by the keywords relevance (strong), headers, title which is about the same to the url relevance. the problems appear when we have to determine the best magnitude for those relevance parameters. if the difference of the relevance parameters is too small, we will not have effective search results ordering. but if we fall into the other side, we can provide excessive ordering based only on the hierarchy we’ve determined for the parameters and less based on the relevancies of the results. a schematisation of the proposed indexing and search algorithm is provided below: a) indexing 1. the web page is processed by reading its contents. 2. the internal links from the page are determined. 3. the unique internal links and full page contents are inserted in a ’pages’ table. 4. the next link to follow is taken from the ’pages’ table and the new corresponding page is processed by going to step 1, until there are no more unprocessed pages. 5. the ’pages’ table is processed to determine the page title, headings and strong text and the results are placed in a new ’contents’ table. b) full-text search 1. query the ’contents’ table against a search term. 2. determine the relevancies of every column in part. 3. determine the matching rows. 4. order the results by relevance, calculated using the formula in fig. 4. the parameters u, t, c, h and s are chosen by us and they are an indicator of the importance of each of the relevancies (url, title, content, headers and strong text). 4 testing to test the algorithm, we have set up a database of 320 mb (including data and index) and a total number of 11,670 records. the database was built by indexing a subset of the wikipedia.com website, using the indexing algorithm described in the previous section. for testing purposes, we have chosen the following values for the u, t, c, h and s parameters. the user can set his own values. a higher value means a higher importance of that element: u=1.14, t=1.14, c=1, h=1.3, s=1.2 the testing process was done by asking different users to give ratings to each of the search results. we have performed a total of 143 tests. the ratings given by the users proved that our algorithm is more relevant than the default mysql method by 19.47% . here are some examples of tests done with specific keywords: keyword: "programming" results: our algorithm (the left panel) returned relevant results in positions 1 and 2 (an article titled "computer programming" and another one "application programming interface"). this was because our algorithm assigned a higher importance to the page title and url (1.14) than the page contents (1). the right panel did not return many relevant results (a result which was close to our expectations was in position 2, but it was not very specific because it was about a specific programming language and not about programming languages in general). keyword: "universal serial bus" results: our algorithm (the left panel) returned a relevant result in position 1 (an article titled full-text search engine using mysql 741 "universal serial bus"). the right panel did not return any relevant results (instead it returned pages that contain "universal" only and not the whole search term). 5 search suggestions in many cases, there appears the necessity to correct the spelling mistakes made by the users when they don’t know exactly how to spell something or are just making a typing mistake. to do this, we will have to give them the closest correct form that best matches the initial search. metaphone is a phonetic algorithm, an algorithm published in 1990 for indexing words by their english pronunciation. the algorithm produces variable length keys as its output, as opposed to soundex’s fixed-length keys. similar sounding words share the same keys. metaphone was developed by lawrence philips as a response to deficiencies in the soundex algorithm. it is more accurate than soundex because it uses a larger set of rules for english pronunciation. metaphone is available as a built-in operator in a number of systems, including later versions of php. the original author later produced a new version of the algorithm, which he named double metaphone, that produces more accurate results than the original algorithm. however i will use the first version of the script because it returns accurate enough keys with a much better performance than the second one. [4] in order to provide fast enough processing for the suggestions we will create a “bag of words” containing all the distinct words contained inside the content column. for each one of them we will attach the metaphone key. when a search is done we will select the candidates by calculating the metaphone of every word inside the search string and select the matches inside the metaphone table. after that operation we will have about 3 to 8 possible words to choose from. the levenshtein distance is defined as the minimal number of characters you have to replace, insert or delete to transform str1 into str2 . the complexity of the algorithm is o(m*n), where n and m are the length of str1 and str2 (rather good when compared to similar_text(), which is o(max(n, m)**3), but still expensive). in its simplest form the function will take only the two strings as parameters and will calculate just the number of insert, replace and delete operations needed to transform str1 into str2. [5] we will use this algorithm to determine the closest form from the list of candidates by selecting the candidate with the smallest levenshtein distance to the searched keyword. to be noticed that this algorithm only works for word spelling mistakes. it does not work for mistakes like missing space, or composed words. missing space example: “mysqlfultext” will not suggest “mysql fultext” composed words example: “pine apple” will not suggest “pineapple” although some of the suggestion results can be hidden from the users because of the use of metaphone equivalence, this approach gives an incredible boost of performance. with a suggestion table containing over 130,000 rows it would take way too long to determine the levensthein distance between the searched form and the correct form regarding the fact that, as we have said before, the levenshtein algorithm has a complexity o(m*n). in owner case we would have a complexity o(m*n)*nrrows, where nrrows exceeds 130,000. also, by applying this direct approach, we wouldn’t solve the problems we’ve illustrated above. by applying owner algorithm we can reduce the candidates from 130,000 to approximately 4-7 words. this represents a significant improvement of performance with only little disadvantages. furthermore, those disadvantages could be counteracted by completing the suggestion table according to the newly searched term if the suggestion we provided is wrong and if there 742 c. gyorodi, r. gyorodi, g. pecherle, g. m. cornea is a sufficient similarity between the previous and the actual search, similarity which can be determined by dividing the levenshtein distance with the string length. a schematisation of the proposed search suggestions algorithm is provided below: 1. create a table containing all the words in the contents column and their attached metaphone keys. 2. the metaphone of the searched keyword is determined using the metaphone function [4]. 3. the levenshtein distance is calculated as in [5]. 4. the word with the smallest levenshtein distance is chosen. 6 comparison with other similar technologies in comparison with other similar technologies we have strong points and weak points at the same time. it is impossible to have all of them at the same time and there has to be a compromise between costs, speed, maintenance and quality of the results. in comparison with mysql basic full-text search we have the advantage of a better ordered result set. however, this comes with the price of an increased search time because of the extra computation needed for the relevance ordering, according to fig. 4. both methods have the advantages of being free, with easy maintenance (the update of the full-text index is done on the fly) and they don’t require special permissions on the server. if we compare it with the free open-source sql full-text search engine sphinx, we can mention other differences. the common part of the two solutions is that they are both free. the strong points of our solution are that we have a better ordered result set and easier maintainability. the strong point for sphinx is the search speed making it suitable for searching bigger databases. another week point for sphinx is the maintainability. the full-text index is not able to be updated on the fly so that it has to be recomputed periodically, task that can take from 15 minutes for small databases to a few hours for large ones. 7 conclusions by using our implementation it is possible to offer a complete and powerful search option. for the future, we would like to improve the search speed by using a dedicated full-text search engine. another improvement we would like to implement would be a new parameter which will determine the popularity of a result based on determining if a user found what he wanted or not. this will use a combination of the time the user spent on that certain page divided by the content length, how many times the user returned to this page or if a page is or is not a stop page (which can indicate that the user found what he wanted to know). also, in addition to search, we would also like to implement high-speed insert and delete, allowing full-text search to be used in the same way as other types of database search in which data can be searched right after data is inserted [7]. another idea for future improvement is handling scientific document searches, especially mathematical text and mathematical operations [8]. for the suggestion tool we want to improve the algorithm in such a way to be able to correct as many mistakes as possible. full-text search engine using mysql 743 bibliography [1] fine-tuning mysql full-text search http://dev.mysql.com/doc/refman/5.0/en/fulltextfine-tuning.html [2] full text search by wikipedia http://en.wikipedia.org/wiki/full_text_search [3] mysql’s full-text formulas by database journals http://www.databasejournal.com/ features/mysql/article.php/3512461/mysqls-full-text-formulas.htm [4] metaphone by wikipedia http://en.wikipedia.org/wiki/metaphone [5] the levenstein distance http://us2.php.net/levenshtein [6] atlam, e.-s., ghada, e.-m., fuketa, m., morita, k., aoe, j., a compact memory space of dynamic full-text search using bi-gram index, computers and communications, 2004. proceedings iscc 2004. ninth international symposium [7] ikeda, t., mano, h., itoh, h., takegawa, h., hiraoka, t., horibe, s., ogawa, y., "trmeister: a dbms with high-performance full-text search functions", data engineering, 2005. icde 2005. proceedings. 21st international conference [8] misutka, j., galambos, l., "mathematical extension of full text search engine indexer", information and communication technologies: from theory to applications, 2008. ictta 2008. 3rd international conference [9] d. zmaranda, g. gabor, issues on optimality criteria applied in real-time scheduling, international journal of computers communications & control, issn 1841-9836, suppl.s, 3(s):536-540, 2008. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iii (2008), no. 2, pp. 161-171 antenna arrays principle and solutions: robust control approach florin hutu, sebastien cauet, patrick coirault abstract: this paper treats solutions on the ability of a chain of non identical oscillators to drive antenna arrays. frequency approaches were studied in order to solve the problem of synchronization of the oscillators. however, in this article, a new structure of chain of oscillators is introduced. secondly, lyapunov theory of stability is used to design a dynamical controller guarantying the oscillators synchronization. the problem of synchronization is transformed into a problem of asymptotic stabilization for a nonlinear system. it is formulated as a system of linear matrix inequalities where the parameter variations of the two oscillators and their differences are modeled by polytopic matrices. the theoretical result is successfully applied to an array of transistor-based oscillators used in "smart antenna" systems. keyword : nonlinear systems, control applications, antenna arrays 1 introduction the demand of mobile communication services is in a continuous growth, moreover, it is estimated that the rate will be maintained in the next years. this continuous development has stimulated the research of new hardware and software solutions in order to increase the volume of exchanged data and a better management of the emitted or received electromagnetic field. smart antenna arrays comprise a number of antennas that work in conjunction with an intelligent system that processes the received and transmitted data. the processing can be realized in a hardware or in a software way and allows smart antenna arrays to focus beams into particular directions. this problem can be partially solved by using several directional antennas. this solution divides the 360-degree coverage area into sectors. however, smart antenna arrays provide a much more effective solution by focusing the transmitted power toward user and only looking in the direction of the user for the up link signal. this ensures that the user receives the optimum quality of service and the maximum coverage from a base station. the new technologies development increases the antenna array performances and minimizes the costs of production and the occupied space. it makes them implementable in domains like wireless or satellite communications, radar systems, missile defense systems, automobile industry, etc. smart antennas or antenna arrays are a part of communication systems that can improve their global performances. this technique can increase the spectral efficiency and reduce the multi path fading, bit error rate (ber), the co-channel interferences (cci) and the system complexity [1]. this is possible by electronically adjusting the beam pattern of the antenna array in order to provide important gain for the desired signals and small gain for interference signals. at emission, the purpose of smart antennas is to minimize the interference between the different transmitters who works on the same communication channel and, thus to more efficiently use the emitted power. for this reason, the beam shape must be controlled in order to minimize the amplitude of the side lobes and to maximize the energy in the main lobe. moreover, the direction of the main lobe must be controlled. the focused application is inter-vehicle communication. the bandwidth will be, first of all, in the area of 24ghz and finally around 79ghz. at these frequencies, technique like "software defined radio" can not be used. one of the main objectives that is pointed out is to develop a structure which will extend in both space and time the safety information available to drivers by using the infrastructure and vehicles as sources. copyright © 2006-2008 by ccc publications 162 florin hutu, sebastien cauet, patrick coirault when the smart antennas are used in reception systems, the signals coming from interference directions must be rejected and those which comes from the desired directions must be privileged. for this reason, different phases and amplitudes must be assured by the carrier signals locally generated [2, 3, 4]. the work that has been done in the field of dynamics of coupled nonlinear systems using the frequency approach [5][6, 7][8] shows that they offer methods of phase control among array elements and beam scanning capabilities but also implies problems of stabilization. the proposed structure of the array of antennas is based on unidirectional coupled oscillators. in details, this paper treats the synchronization of a system made by two oscillators with an unidirectional coupling and this problem of synchronization is transformed in a problem of stabilization for a nonlinear system. the strategy chosen is to find an output feedback dynamic controller using lyapunov functions that assures a robust synchronization despite parameters variations of the oscillators. the problem of computing dynamic output feedbacks on lti (linear time-invariant) systems in term of matrix inequalities is difficult to solve. there are two known techniques: the iterative algorithms and the cancellation of variable products by using the matrix separation lemma. the reader can found some papers on treatment of this problem by lmi (linear matrix inequality)-s who can be numerically solved [9, 10, 11, 12, 13]. the variations of the parameters of the oscillators are taken into account by considering the state matrix as a polytopic one. the polytopic structure is easily tractable by linear matrix inequalities. once the stability of the vertices, defined for the polytope is proved, the stability and the synchronization of the two oscillators is assured for all systems inside the polytope. the nonlinear character of the oscillators allows the synchronization (if their free running frequencies are in a certain domain [6]) but also makes them dependent of initial conditions. the main objective is to cancel the nonlinear effect and to maintain the synchronization when the physical parameters of the oscillators and external conditions are modified. the originality of the method comes from the inclusion of the non-linear term and the undesired variations in a perturbation. this problem is transformed in a h∞ optimization. in section 2 is presented an overview of the antenna array theory. in section 3 a model of master and slave oscillators and the controller synthesis are introduced. section 4 presents the numerical results in the case of an array of coupled nonlinear oscillators. 2 antenna array theory overview 2.1 theoretical background a smart antenna is composed by an array of individual radiative elements (elementary antennas), which are placed in a particular configuration (linear, circular or matrix). by gathering these elementary antennas in such arrays and by changing the characteristics of the signals associated to each element, the array can present different gains according to the direction. let us consider an uniform linear array of n identical patch antennas placed at the same distance d between them as in fig. 1. for the theoretical study of this configuration, it is assumed that in the elementary antennas, harmonic signals of the same frequency but different amplitudes and phases are injected. the mathematical expression of the total electromagnetic field generated by the array in far-field regions (fraunhofer regions) can be written as the product of the electromagnetic field of the reference antenna (which is considered the first antenna in the array) and a term which depends on the amplitudes and phases of the injected signals etotal = ere f ∗ f (θ). (1) thereafter, the gain of the antenna array will be considered, knowing that it is a normalization of the antenna arrays principle and solutions: robust control approach 163 wm sin 2π f0tw1 sin 2π f0t w2 sin 2π f0t wn sin 2π f0t θ z0 d figure 1: an uniform antenna array amplitude of the electromagnetic field. the array factor can be written as f (θ) = n ∑ m=1 wme− j(m−1)k0d cos θ, (2) where wm = ame jϕm . in the easiest emission case, the amplitudes have the same value, the mathematical expression of the radiation pattern is f (θ ) = 1 n sin nγ2 sin γ2 , (3) where γ = ϕ − k0d cos(θ ), n the number of antennas, d the distance between them. it can be seen that a quantity of the radiated energy is lost in the side lobes which implies a certain weakening of the antenna array gain. it is obvious that in the emission case, both variations of the amplitudes and the phases of the carrier signals are in a large interval. this paper proposes a new technique to generate such signals, which have the same frequency and different phases and amplitudes. it can be concluded that both variations of the amplitudes and phases are in a large interval. 3 problem formulation 3.1 problem statement if the amplitude variation can be easily solved by using variable gain amplifiers, the problem of the phase variation is more constraining. there are several techniques which permits solving this problem. the main technical problem with beamforming for transmit is realizing the phase and amplitude of the signals in each antenna channel. the first step will be generating the reference signal with the selected or required frequency. it has to be modulated with the information to be transmitted (e.g. radar, communication). then it has to be amplified and distributed to all transmit channels. then the individual signals have to be weighted for beamforming, that means amplified or attenuated, according to the desired weighting amplitude. the desired phase has to be realized by a suitable steerable phase shifter or delay line. digital devices are now available, under the headline "software defined radio". this technique is not usable on frequencies over 400 mhz. here, techniques, which are pointed out, are designed for applications over 2ghz. techniques can be divided into both main approaches. the first approach uses the signal generated by one oscillator and the second which uses signals generated by array of coupled oscillators. signals with the same frequency but different phases and amplitudes can be built by delaying the signal generated with one master oscillator using high-frequency power dividers and variable delay lines or butler couplers. this approach is very useful when discrete-time systems are built. another approach 164 florin hutu, sebastien cauet, patrick coirault is to use polyphasic oscillators and a multiplexing system [5]. but in these techniques, it can not be obtained continuous phase variations. the second approach is based on the synchronization of arrays of oscillators having their free running frequencies with a weak dispersion. in [6, 8] it was demonstrated that arrays of coupled nonlinear oscillators can synchronize. moreover, according to the coupling strength and to free-running frequencies, phase variations can be made. recent works [14] shows how the phase variation can be guaranty by changing only the free-running frequencies of all coupled oscillators in the array. in order to generate these carrier signals, the following general schematic fig. 2 is proposed. ( f0 , an , ϕn ) + + + slave3slave2master slaven σcτn−1σcτ2σcτ1 gain1 gain2 gain3 gainn σσς ( f0 , a1 , ϕ1 ) ( f0 , a2 , ϕ2 ) ( f0 , a3 , ϕ3 ) figure 2: unidirectional coupling of a chain of oscillators this schematic is a variation of the york’s approach shown in [15]. because of the unidirectional coupling, each slave oscillator is driven only by its left neighbor. hence, the study of this configuration is reduced to the study of a pair of two non-linear systems (master-salve synchronization). the purpose is to design the parameters of the controller system σc in order to make the output ye(t) tends toward zero. when this objective is fulfilled, the delayed output of the master oscillator and the output of the slave oscillator become identical, so both oscillators are synchronized. generally, because of the technological realization, the oscillators don’t have the same free running frequencies. this is the reason why the feedback loop was introduced to guarantee the robust synchronization between both oscillators. the delay element and the variable gain amplifier will guaranty different phases and amplitudes for the output signals. the difference between the oscillators will be modeled as a variation of the slave oscillator parameters around those of the master oscillator parameters, which is considered as the reference. the variations due to the temperature or at the ageing of the components are modeled by a polytopic uncertainty of the master oscillator parameters around the nominal values. the oscillators are built using a double differential pair structure. in order to determine the parameters of the dynamical controller, the nonlinear oscillators is modeled using the van der pol model. in this article, we consider that the system is a perturbed van der pol model as: { ẋ = a(θ1)x + g(x,t, θ1) + bu y = cx , (4) where x = [ il v0 ] a(θ1) =   0 1 l0 − 1 c0 0   u = iin j c0 g(x,t, θ1) =   0 α c0 x2 − β c0 x2 3   b = [ 0 1 ] c = [ 0 1 ] , (5) with the uncertain parameters θ1 = [α, l0,c0]. antenna arrays principle and solutions: robust control approach 165 3.2 master-slave synchronization the structure is made up by two different systems which belong to the class previously described. the master system is considered independent (u = 0) and the dynamical controller σc drives the slave system using the error signal as reference. the error signal is constituted by the difference between a delayed version of the master output and the slave output. the state-space representation of the master system can be written as follows σm : { ˙xm = am(θ1)xm + gm(xm,t, θ1) ym = cxm . (6) for the slave system, the state-space representation can be written as σs : { ẋs = as(θ1)xs + gs(xs,t, θ1) + b2u ys = cxs . (7) between the parameters of the master and the slave oscillators, it is considered that there is the same difference δ    ls = lm (1 + δ ) cs = cm (1 + δ ) αs = αm (1 + δ ) βs = βm (1 + δ ) . (8) in that case, this notation can be introduced am(θ1) = as(θ1) + b1(θ2). (9) this difference is transformed into the difference between the state matrix of the master and the slave. with the assumptions in (8), b1(θ2) can be written as follows b1(θ2) =   0 − δ lm (1 + δ ) δ cm (1 + δ ) 0   . (10) if an error state is defined as e(t) = xm(t −τ)−xs(t), (11) a state-space representation can be written σe : { ė = am(θ1)e−b1(θ2)xs + eg(xm, xs,t, θ1)−b2u ye = ce , (12) where eg(xm, xs,t, θ1) = gm(xm,t, θ1)−gs(xs,t, θ1). (13) 3.3 nonlinear bound determination in order to determine the bounds of the nonlinearities difference, the scalar function f : d1 7→ d2 f (x) = −α x + β x3 is used. the bounds can be considered as the slopes of the tangents passing through x = xm and x = 0 of f (x). −α(x2 −x1) ≤ ( f (x2)− f (x1)) ≤ (−α + 3β xm2)(x2 −x1) ∀x1, x2 ∈ d1 (14) 166 florin hutu, sebastien cauet, patrick coirault consider both nonlinear oscillators and the domain d1 = [−1.35v, 1.35v ], then the bound of the nonlinearities difference (13) can be written as follows   0 − 1 c0 ( α + 3β−0.22 )   ≤ eg(xm, xs,t, θ ) ≤   0 α c0   . (15) 3.4 controller synthesis assume that am(θ1) resp. b1(θ2) are two matrices that belong to a polytope of matrices and it is represented by a convex combination of the extreme matrices ai resp. b1i with i = 1 . . . 2m . am = { am(θ1)| am(θ1) = 2m ∑ i=1 ξiai; ξi ∈ ∆1 } (16) and consider that the matrix eg(xm, xs,t, θ1) can be bounded with nb(θ1) being its upper bound eg(xm, xs,t, θ1) ≤ nb(θ1)e. (17) the worst case for our system is the superior limit, then the matrix an (θ ) = am(θ1) + nb(θ1). assume that the dynamical output controller of the system (12) is described by the following statespace representation and its dimension is nc. σc : { ẋc = acxc + bcye u = ccxc + dcye (18) the purpose of this controller is to make the slave system follow the delayed output of the master system. this condition is performed when the error signal defined in (11) tends toward zero. the term b1(θ2)xs, representing the difference between both systems, acts as a perturbation on the error state e. the synthesis of this controller has been made with a technique similar to [9, 10] the following theorem solves the problem of variable matrices product in the synthesis problem by introducing extra unknown variable matrices. theorem 1. if there exists a set of matrices pi > 0, a state feedback controller k0, an unknown variable square and nonsingular matrix g ∈ rnu+nc , an unknown variable matrix h ∈ r(nu+nc)×(nu+nc) and four unknown variables matrices f1, f4 ∈ r(nx+nc)×(nx+nc), f2 ∈ rnx×(nx+nc) and f3 ∈ r(n∞+nc)×(nx+nc) such that the inequality (19) is verified, then the dynamical controller k = g−1l makes the error system (12) asymptotically stable for all matrices an (θ1) and b1(θ2) described as a polytope. φ2 + 1sym      f1 f2 f3 f4 o   [ o o o o b̃2 ]    + sym      o o o o i   l [ c̃ o o o o ]    +sym      o o o o i   g [ −k0 o o o −i ]    < o; (19) antenna arrays principle and solutions: robust control approach 167 the matrix φ2 is defined as follows φ2 =   o o ccl t pi o o −γi o o o ccl o −γi o o pi o o o o o o o o o   + sym      f1 f2 f3 f4 o   [ ã0i b̃1 j o −i o ]    ∀i ∈ {1 . . . 2m} and ∀ j ∈ {1 . . . 2p} , (20) where ã0i = ãni + b̃k0 and k = [ dc cc bc ac ] . (21) the expression (16) can be numerically solved using matlab’s© "lmi toolbox". 4 numerical results in order to check the theoretical result, a transistor-based simulation has been done using agilent’s ads© software and mosfet transistors in 0.35µ m silicon technology. it was considered that all the parameters of an (θ ) have ±5% variation around their nominal values. this variation can be seen as the variation depending on the temperature of the oscillators that are built on the same integrated circuit substrate. this is mathematically transformed into the variation of the state matrix an (θ1) inside the polytope. using the matlab’s ”lmi toolbox” applied to the 8 vertices of the polytope, the following output-feedback controller was found k = [ dc cc bc ac ] = [ 2.66588·1011 22.3994 7.51499·109 −1.1885 ] . (22) it assures the synchronization of oscillators having δ = ±5% difference between parameters. this difference is represented by variation of the perturbation matrix b1(θ2) inside the polytope. variations between lm , cm and ls, cs parameters, corresponds to a possible difference between the free-running frequencies of both oscillators f0s ∈ [ f0m (1−|δ|)2 f0m (1 +|δ|)2 ] . (23) the difference between αm , βm and αs, βs stands for a possible difference between the transistor operating points of both nonlinear oscillators. this controller was applied to a pair of both non-linear oscillators. their free running frequencies are f0m = 2ghz and f0s = 2.2ghz. it has been chosen those frequencies in order to build a discrete component platform. in fig. 4 are presented both output voltages for master and slave oscillators and is divided into three sequences. a first sequence in which, the controller σc is not activated, both oscillators oscillates from their free-running frequencies. the second step at t = 55ns, the controller σc is activated. the obtained delay is closed to the imposed value (τ = t /4 = 1.25·10−10s). this delay will correspond to a orientation of the main lobe in θ = 120◦. finally, at t = 60ns, in order to verify the robustness of the dynamical controller, the free running frequency of the master oscillator was changed to fm = 2ghz. in fig. 3 the error between both output signals is presented. it can be seen that the error tends toward zero after a short period of time when the controller is started. 1sym{x} = x t + x ; ∀x ∈rn 168 florin hutu, sebastien cauet, patrick coirault figure 3: output voltages of both oscillators figure 4: the error between both signals provided by the oscillators 4.1 array of oscillators consider the situation where the θp = 60◦ direction must be privileged and θi1 = 90◦ and θi2 = 120◦ must be rejected. in table 4.1, the necessary and final values of the amplitudes and phases are shown for n = 8 antennas. figure fig.5 depicts the corresponding radiation pattern. the orientation of the main lobe is closed to the desired value θp = 58.14 and both interference directions θi1 = 90◦ and θi2 = 120◦ are rejected. 5 conclusion this paper presents a novel method to drive antenna arrays. it is based on unidirectionally coupled oscillators. an output feedback controller has been designed to assure synchronization with advanced control theory using lmi (linear matrix inequalities) tools. the result was successfully extended to a chain of eight unidirectionally coupled oscillators. additional research will be made to constrain the dynamical controller to realize the desired delay in order to eliminate the delay element. antenna arrays principle and solutions: robust control approach 169 amplitudes [v ] phases [◦] necessary simulated necessary simulated 0.269 0.21 0 0 0.21 0.2 143 138 0.14 0.14 180 179 0.4 0.4 -162 160 0.08 0.08 0 -2 0.14 0.13 63.1 57 0 0 0 7 0.3 0.29 -24.4 -30 table 1: computed and final values for the amplitudes and phases figure 5: the radiation pattern in the particular case of θp = 60◦ 170 florin hutu, sebastien cauet, patrick coirault bibliography [1] l. c. godara (ed.), applications of antenna arrays to mobile communications, part i: performance improvement, feasability and system consideration, vol. 85, proceedings of the ieee, 1997. [2] s. chandran, adaptative antenna arrays trends and applications, springer-verlag, 2004. [3] s. j. orfanidis, electromagnetic waves and antennas (2004). http://www.ece.rutgers.edu/ orfanidi/ewa/ [4] r. c. hansen, phased array antennas, wiley-interscience, 2001. [5] x. guan, h. hashemi, a. hajimiri, a fully integrated 24 ghz eight-element phased-array receiver in silicon, ieee journal of solid-state circuits 39 (12) (2004) 2311–2320. [6] p. liao, r. a. york, a six-element beam scanning array, ieee microwave and guided wave letters 4 (1) (1994) 20–22. [7] a. tombak, a. mortazawi, a novel low-cost beam-steering technique based on the extended resonance power dividing method, ieee transactions on microwave theory and techniques (2003) 1–7. [8] r. a. york, t. itoh, injection and phase-locking techniques for beam control, ieee transactions on microwave theory and techniques 46 (11) (1998) 1920–1929. [9] d. arzellier, d. peaucelle, s. salhi, robust static output feedback stabilization for polytopic uncertain systems: improving the guaranteed performance bound, in: rocond milan-italy, 2003. [10] d. mehdi, e. boukas, o. bachelier, static output feedback design for uncertain linear discrete time system, ima journal of mathematical control and information. [11] t. iwasaki, r. e. skelton, k. grigoriadis., a unified algebraic approach to linear control design, taylor and francis, 1998. [12] d. peaucelle, d. azellier, an efficient numerical solution for h2 static output feedback synthesis, europeen control conference. [13] s. boyd, l. e. ghaoui, e. feron, v. balakrishnan, linear matrix inequalities in system and control theory, vol. 15, studies in applied mathematics, usa, 1994. [14] t. heath, simultaneous beam steering and null formation with coupled, nonlinear oscillator arrays, ieee transactions on antennas and propagation 53 (6) (2005) 2031–2035. [15] r. a. york, nonlinear analysis of phase relationshiops in quasi-optical oscillator arrays, ieee transactions on microwave theory and techniques 41 (10) (1993) 1799–1809. florin hutu university of poitiers department : laii-esip 40 avenue du recteur pineau e-mail: florin.hutu@etu.univ-poitiers.fr received: november 28, 2007 antenna arrays principle and solutions: robust control approach 171 florin hutu was born in romania in 1977 and received the ph.d. degree in automatic control in 2007. he is currently a post doc at the university of poitiers where his interests are currently involved with the design of microwave antenna array. sébastien cauet was born in france in 1971 and received the ph.d. degree in automatic control in 1999. he is currently an associated professor of electrical and computer engineering at the university of poitiers where his interests are currently involved with the control of electrical power rotating motors, the design of microwave antenna array and application of control to chaos transmission. patrick coirault was born in poitiers, france, in 1965. he received the ph.d. degree from the university of poitiers in 1991. he was an assistant professor at the institute of technology of poitiers from 1992 to 1997. he is now professor at the institute of technology of chatellerault. his current research interests are in nonlinear identification and control, with applications to chaotic systems. international journal of computers, communications & control vol. ii (2007), no. 4, pp. 303-313 domino: trivalent logic semantics in bivalent syntax clothes boldur e. bărbat abstract: the paper describes a rather general software mechanism developed primarily for decision making in dynamic and uncertain environments (typical application: managing overbooking). domino (decision-oriented mechanism for "if" as non-deterministic operator) is meant to deal with undecidability due to any kind of future contingents. its description here is self-contained but, since a validation is underway within a much broader undertaking involving agent-oriented software, to impair redundancy, several aspects explained in very recent papers are here abridged. in essence, domino acts as an "if" with enhanced semantics: it can answer "yes", "no" or "undecidable in the time span given" (it renders control to an exception handler). despite its trivalent logic semantics, it respects the rigours of structural programming and the syntax of bivalent logic (it is programmed in plain c++ to be applicable to legacy systems too). as for most novel approaches, expectations are high, involving a less algorithmic, less probabilistic, less difficult to understand method to treat undecidability in dynamic and uncertain environments, where postponing decisions means keeping open alternatives (to react better to rapid environment changes). keywords: undecidability; open, heterogeneous, dynamic and uncertain environments (ohdue); decision-making; trivalent logic semantics; agent-oriented software engineering. 1 introduction despite the major changes taking place in the internet and globalization era and their increasing speed because of the geometrically increasing computing power (due to moore’s law, that is expected to hold at least other ten years) -, basic software mechanisms advanced much too slowly. as regards uncertain knowledge processing, the hindrance is obvious: available software tools are either hardly affordable (because of high complexity both cognitive and structural) or rather ineffective (designed for other environments, applied to ill-defined problems or lacking expected functionality). for instance, the very concept of "uncertainty" was treated inadequately regardless of its growing relevance for application domains (mainly where real-time decision-making is involved [14]), environments (even more dynamic and uncertain [19]), end-user expectations (requesting anthropocentric interfaces [4]), it paradigms (predominantly "computing as interaction" [1]) and so on. the main weaknesses: a) insufficient theoretical rigour (undecidability is considered primarily atemporal keeping its initial mathematical meaning); b) poor practical effectiveness (confusing "unknown" with "unknowable" and applying sophisticated prediction methods in inappropriate contexts); c) unfit apparatus (algorithmic approaches implying determinism, decidability, and bivalence). since the first two issues are investigated in very recent papers [8] [6] [9] [7], to reduce redundancy and to keep focusing on the very mechanisms, here the approach is an explicitly software engineering one: computer science aspects are dealt with only at the beginning (what for new mechanisms?) and at the end (what are the expectations?). in particular, the paper presents in detail domino, a mechanism developed primarily for decision making in dynamic and uncertain environments (a typical example for potential application area comes from the overbooking policy of carrier companies). the decision-oriented mechanism for "if" as non-deterministic operator (domino) is meant to deal with undecidability due to any kind of future contingents. as a rather general software mechanism, its description here is self-contained. however, since a validation is underway within a much broader undertaking involving agent-oriented software (including non-algorithmic approaches to treat uncertainty), several aspects explained in the papers mentioned above are here abridged or skipped copyright © 2006-2007 by ccc publications 304 boldur e. bărbat over. as a result, the paper is structured as follows: section 2 shows the rationale (i.e., premises and diagnosis) for developing new methods to handle uncertainty in decision support systems (dss). section 3 delimits the problem in both meanings: firstly it restricts it (narrowing the scope to undecidability due to future contingents) and secondly it defines it (as software engineering task). section 4 outlines the architecture, basically in search of the third value semantics. on this groundwork, section 5 explains the structure, in search of bivalent syntax clothes. since conclusions are prohibited for a bottom-up project, before its validation, section 6 lists the expectations, ranged in three time horizons (in fact, they are first conclusions). 2 rationale. premises and diagnosis since in this section the paper is to some extent also a position paper, for the sake of simplicity, the premises, opinions, criteria, motives, and their corollaries are not separated in conceptual categories but asserted clustered around two "attractor words": premises and diagnosis. thus, consensus is not mandatory to assess the research results. in other words, the utility of domino can be evaluated even when some of the claims made here are rejected. most of the assertions below are valid in any modern it context but dss are explicitly referred to because "if" is the basic tool (again in both senses: essential and simple) for decision-making. likewise, the emphasis is on the main changes affecting the essence of "if". to preserve both text autonomy and shortness, for topics dealt with recently, details should be found in the paper the quotation comes from. premises. they reflect the general context, representing a very simplified input vector: • environment. "present-day it environments, except for some irrelevant applications, are open and heterogeneous (the resources involved are unalike and their availability is not warranted), dynamic (the pace of exogenous and endogenous changes is high) and uncertain (both information and its processing rules are revisable, fuzzy, uncertain and intrinsically non-deterministic" [6]. • system. except trivial applications, the system exposed to decision-making is "man-machine system [...], highly complex (multi-distributed mainly in space, but also in time and organization), under real-time control (the subsystems act on each other at least, communicate intensely), almost always online" [7]. • user expectations. "since most decision makers are already familiar with google, most available general information is either known or easy accessed; now they need acceptable good answer, but very fast and with incomplete or even uncertain information" [7]. "intelligent system behaviour whatever that could mean becomes a crucial user expectation" [8]. • task. most aim at "managing situations"; main attributes are: "high complexity, multicriterial, pareto optimality, approximate, online, parallel, critical response time, high risk" [7]. • dss architectonics. the software process (not package) is "vaguely specified, validated against end-user expectations" and has "two new fundamental design-space dimensions: time and uncertainty" [7]. • it infrastructure and paradigms. "the it infrastructure is sufficiently advanced (in both facts and trends: nanoelectronics, broadband communication, semantic web, multimodal interfaces, etc.) to allow anthropocentrism for most it applications" [8]. the leading paradigm is "computing as interaction" [1] [19]. second echelon paradigms (prevalent in modern artificial intelligence) are assessed through an "affordability filter" in [6]. as regards software engineering, the paradigm that becomes dominant is agent-orientation. domino: trivalent logic semantics in bivalent syntax clothes 305 diagnosis. the main weaknesses of current it systems are investigated in [7], where the focus is on conventional modelling. since dss weaknesses are very similar, they are stated here adapted and abridged from [7]: they stem from inappropriate conceptualising, based on rigid, algorithmic (i.e., deterministic, almost sequential, "computational", and atemporal processing), meant for decision making as "step by step solving of arising sub-problems", not for decision making as "continuous process of dealing with unexpected, potentially risky, fast changing situations requesting immediate albeit not optimal response". sectorial aspects are: • poorly reflected (or absent) temporal dimension. limited parallelism (if any), ineffective multithreading, poor reactivity (scarce interrupt handling impairing proper reaction to environment stimuli); no exception propagation; no dynamic priorities; no proper thread synchronization). since the agent is a process now acknowledged as such by a formal standard [13] its temporality cannot be disregarded anymore. • poor concurrent programming. often such situations are handled by resource wasting "active wait loops" or counterproductive "mutexes" instead of a simple wait (for an event) with timeout. • misunderstood uncertainty. even if the fact that accurate numeric data are hard to get is accepted, the emphasis is on approximated, predicted, evaluated by rule of thumb, or even on intrinsically fuzzy data, rather than on missing ones (lacking sensor information, delayed previous decisions, server crash etc.). • distorted prediction. bayesian inferences are considered unsuitable to decision-making because "even if decision-makers could get all [ě] answers in due time, would they believe them strongly enough to make critical decisions only on their basis? humans are not "probabilistic beings" and are very prone to any sort of "gambler’s fallacy"." [9]. in short, the real-world decision-making challenge is: the situations are such that you have no time to solve (accurately, complex) problems. 3 delimiting the problem the multifaceted issue of handling uncertainty in dss must be first restricted to arrive at manageable complexity and afterwards defined as software engineering task. restricting the scope. the target was narrowed to deal with undecidability due to future contingents because of five reasons: • unaffordable complexity. uncertainty as epistemic concept, together with its species and degrees, was investigated in [9] starting from the 28 definitions found on the web. beside the overwhelming diversity of those definitions, ranging from "doubt" to "statistically defined discrepancy", the very meaning of uncertainty "depends on the professional background and on the task to carry out (better said, mostly on the time available to complete it)" [9]. thus, "uncertain" means practically (mainly, subconsciously) for mathematicians, unknowable, for software developers, undependable, and for end users (decision-makers), undecidable. recent related work attests the link between uncertainty and complexity: a terminology and typology of uncertainty "together with the role of uncertainty at different stages in the modelling processes. brief reviews have been made of 14 different (partly complementary) methods commonly used in uncertainty assessment and characterisation" is presented in [17]. a rare case when undecidability is discussed outside mathematics is in a sociological context where the authors "suggest that as well as being able to consider organizational decision-making as an instance of (albeit bounded) rationality or calculability, it can 306 boldur e. bărbat also be regarded as a process of choice amongst heterogeneous possibilities" [12]. "in this context could be found a common denominator for a general definition of uncertainty at least, acceptable to the three categories mentioned above? uncertainty, in its widest sense, comprises any unsure link in the chain of steps necessary to fulfil a task" [9]. much too complex to cope with. • avoidable complexity. on the other hand, from the 28 definitions "only a few are interesting, since they are anthropocentric, mirroring the common user (mainly decision-maker) stance: a) "doubt" [...]; b) "the fundamental inability to make a deterministic prognosis" [...]; c) "lack of knowledge of future events"." [9]. • intrinsic importance. "real-world problems show that the most important and ill-treated kind of uncertainty is that due to future contingents: decisions are difficult to make because a relevant event not happened yet, not because a result is imprecise. moreover, its pragmatic corollary highlights a key aspect in decision-making: since any statement about a future event is undecidable, how to proceed in this case? should it be predicted, circumvented, waited?" [9]. • time pressure. the geometrically increasing computing power due to moore’s law (mentioned in section 1) promotes factors tending to reduce radically the role of algorithms and (bivalent) logic in it [7]. two reasons are already manifest: "since deterministic applications are vanishing (because of ohdue), the conventional algorithm is not anymore program backbone. even when still useful, the conventional algorithm is not anymore the main programming instrument (being hidden in procedures easily reached in a host of libraries or being generated by 4gl)" [8]. • risky side effects. an algorithm is almost unable to feel time: no sense of future events, no such step as: "warning: i don’t know yet". worse: often the algorithm cheats, confusing undecidability with negation. defining the task. "since here the issue is to design a mechanism not a particular application, the cardinal concern, from a clear-cut software engineering perspective, is about reducing complexity, both structural (to make the mechanism useful to legacy systems too) and cognitive (to motivate system designers as well as to increase user acceptance)" [9]. software engineering constraints imply: simple tools, with immediate applicability in current designs; no sophisticated concepts or instruments (such as agents, temporal logics, explicit uncertain information processing, computability theory and computational complexity theory, bayesian methods, certainty factors, etc.); bottom-up design and testing; as much as possible conventional development (prevalent algorithmic reasoning, usual api functions, straightforward implementation, downward compatibility, etc.). while most restrictions are comprehensible albeit very tough -, prohibiting both temporal logics and explicit uncertain information processing seems counterproductive, since one of the premises claimed time and uncertainty as fundamental design-space dimensions. unfortunately, even most responsive and appropriate approaches ([14] included) are less applicable because they are sectorial (e.g., treating time without uncertainty or vice versa). other interesting temporal logics are too sophisticated: for instance, the "linear time" logic (used for specifying reactive systems) is based on the two modalities "since" and "until"; in [16], extensions of this logic are investigated from the perspective of undecidability for "x will happen within t unit of time" showing that the extension is undecidable, whenever t is irrational. although aspects of a primitive temporal dimension could be implemented using common api functions e.g., wait (for an event) with timeout as well as synchronization methods used in multithreading, no similar mechanism is available for uncertainty. here lies the innovative core of the undertaking with all its potential, openings and risks: the only mechanism on hand in common operating systems, able to shift initiative from algorithm to environment, is exception handling. thus, the algorithm gives up a morsel of its proactiveness deterministic par excellence to gain a touch of reactivity; this "small domino: trivalent logic semantics in bivalent syntax clothes 307 amount of non-determinism" is mandatory to be able to mirror (in real time) its epistemic facet: uncertainty. (in domino, when a conditional expression in if is undecidable, an exception is raised and control is handed over to its handler.) the idea to assign a new semantic value i.e., enabling a software entity (labelled or not as agent) to react prompt to asynchronous environment stimuli to a mechanism meant to increase robustness was advanced (and defended in detail) in [3] after testing it in expressing emotions of pathematic agents. later, using exceptions to achieve agent reactivity was suggested in the context of sub-symbolic inferences [8] and of emergence as leverage [9]: "even primeval animals move "algorithmically" ("if gap then avoid, else go on") only a few steps, in very hostile environments. moreover, reaction to stimuli cannot mean perpetual looking for the stimulus. the cardinal hindrance stems [...] from the mechanisms employed: neither nature, nor technology can afford in the long run mechanisms involving large amount of testing because they are too time-consuming tools". (currently exceptions are tested to help self-referencing agents to show some primitive form of self-awareness [5]. however, in recent related work exceptions are still regarded solely as a "mechanism for structuring error recovery in software systems so that errors can be more easily detected, signaled, and handled" [11]. one of the "fundamental motivations for employing exception handling in [...] robust applications" is to "improve the maintainability of the normal code and promote reuse of error handling code" [11]. in [10], the exception handling mechanism of a state-of-the-art industrial embedded software system is analysed and the fault-proneness of the return-code idiom for dealing with exceptions is evaluated.) as a result, since a conditional expression in an ordinary if has now a third exit variant (the exception), the very if acquires a trivalent logic semantics. then, why stay at the stage of "trivalent logic semantics" and not go further towards "trivalent logic"? because such logics are far to meet the challenge, despite the huge effort to conceptualise, develop and implement them. a three-valued logic is a "logical structure which does not assume the law of the excluded middle. three truth values are possible: true, false, or undecided. there are 3072 such logics" [18]. to avoid the 3073rd one1 , "for the sake of simplicity, the trivalent semantics should be grounded on a usual bivalent software infrastructure" [9]. indeed, the proposed solution for domino considers the revisited concept of undecidability [7] [9] filtered through the double sieve of relevance to usual decision-making (section 4) and design simplicity as vital software engineering request (section 5). 4 in search of the third value semantics since the key aspect in decision-making is to handle "don’t know"-like uncertainty [7] (e.g., when a relevant component of an if condition is undecidable because an expected event not happened yet), it is appropriate to a third value meaning something like "caveat: i don’t know (yet)", "unknowable" or "unknown". in many-valued logics it "is general usage [...] to assume that there are two particular truth degrees, usually denoted by "0" and "1", respectively, which act like the traditional truth values "falsum" and "verum"." [15]. "obviously, any decision-making needs those pillars of bivalence" [9]. thus, the third value should be added to the two boolean constants of if. in short, the three outputs are: yes, no, undecidable, where it must still be decided what should "undecidable" really mean. the investigation starts with the first and more relevant three-valued logics and their respective motivations and meanings of the third truth value: • łukasiewicz. the first intention was to use the third value for "possible" (to model modalities) to deal with future contingents (first of all, the "paradox of the sea battle" posed by diodorus cronus). ignoring the philosophical motivation and context (the ontological status of the future, is it determined or not, does free will actually exist, etc.), the third value "i" for "indeterminate", interpreted 1at least (supposing that in this century no other trivalent logic has been concocted) 308 boldur e. bărbat as "unknowable" or "problematical" is semantically very close to the real-world decision-making problems. (on the contrary, the "1/2" notation although elegant from a (meta)mathematical perspective is totally unacceptable in software because, for both theoretical and practical reasons, interpreting it as intermediate value of "half true and half false" is at least confusing and useless; moreover, it could lead to computational disaster.) • kleene. the third value in this logic is "u", interpreted as "undefined" or "undetermined" or "unknown" with two connotations: "permanently unknown" or "temporary lack of knowledge". the second meaning is helpful for postponing decisions and is actually used in data base applications. for instance, "sql implements ternary logic as a means of handling null field content. sql uses null to represent missing data in a database. if a field contains no defined value, sql assumes this means that an actual value exists, but that value is not currently recorded in the database. [...] comparing anything to null even another null results in an unknown truth state. for example, the sql expression "city = ’paris’" resolves to false for a record with "chicago" in the city field, but it resolves to unknown for a record with a null city field. in other words, to sql, an undefined field represents potentially any possible value: a missing city might or might not represent paris." (en.wikipedia.org/wiki/ ternary_logic). moreover, the kleene logic, as a natural sublogic of belnap’s four-valued logic [2], is an important framework for many paraconsistent logics major feature for application in computer science. • bochvar. inspired by the examination of semantic paradoxes, the third truth value "m" means "meaningless" or "paradoxical". this logic is useful when syntactic meaningfulness (rather than semantic one) is looked for (crucial for program verification but dispensable for decision-making). hence the semantics of the third value in a "three-output if" should be based on a blend of a łukasiewicz "i" interpreted as "unknowable" or "problematical"2 and a kleene "u" interpreted as "temporary lack of knowledge". thus, the semantics of "undecidable" is refined to "undecidable in the time span given". in fact, it gives a chance to the "yet" in "i don’t know (yet)", postponing the verdict of "undecidable" as much as possible (depending heavily on both the problem to solve and the decision-making strategies applied). of course, to be effective, such a procrastination strategy must be put to work only in a distributed environment (reflected in software through multithreading); otherwise, the user would be frustrated by the frequent wait periods. 5 in search of bivalent syntax clothes domino respects the rigours of structural programming and the syntax of bivalent logic, is programmed in plain c++ (to be applicable to legacy systems too), is based on a few functions of the windows32 api, was tested only within the toy problems (regarding real-time decisions in industrial control or in medical informatics) the examples are taken from, and is currently tested in a real-world problem (a simplified version of implementing overbooking policy). its flowchart is in figure 1 (the only symbol that is not among the common flowchart notation is the one for raising exceptions). the (windows 2000) api functions involved are: a) bool setevent(hevent) handle hevent; /* event-object handle */ example: setevent(g_hevent_phlimit); 2according to the ancient stoic perspective, updated by łukasiewicz domino: trivalent logic semantics in bivalent syntax clothes 309 exception handler raise exception resetevifcond domino evifcond ? yes no wait evifcond timeout? no yes usual if yes no usual then usual else usual if yes no usual then usual else undecidable (in the timespan given) warning or action end domino figure 1: domino: architecture expressed in bivalent logic 310 boldur e. bărbat b) bool resetevent(hevent) /* likewise */ c) dword waitforsingleobject(hobject, dwtimeout); handle hobject; /* handle of the event waited for*/ dword dwtimeout; /* maximal wait duration (ms) (infinite: unlimited wait) (0: testing the event state) */ example from an alarm-bell program: dwresult = waitforsingleobject(g_heventoverheating, infinite); d) dword waitformultipleobjects(cobjects, lphobjects, fwaitall, dwtimeout); dword cobjects; /* number of objects (maximum 64) */ const handle *lphobjects; /* handle table address */ bool fwaitall; /* flag for conjunction /disjunction (true: all events are waited) (false: only the first event is waited)*/ dword dwtimeout; /* likewise*/ example from a domotics program (for 5 boilers; uses multiple events): dwresult = waitformultipleobjects(5, hboilerpressure, true, 4000); remarks. 1. the exception handler can: a) give control to the human decision maker (after a warning); or b) act (for instance, propagating dynamically the exception from the callee to the caller). 2. time is not just explicitly present in "wait" but, more important, hidden in thread synchronization (setevent is employed in the thread providing information needed to make a decision). 3. uncertainty is dealt with through the intrinsically uncertain interrupts generated by the environment stimuli, expressed at the program level through the exception handler. 4. if dwtimeout = 0, the mechanism is a conventional "if" (however, a more robust one since evaluation is preceded by a real-time validation test). 6 expectations the three time horizons are roughly delimited by: a) validating domino in the overbooking application; b) designing and validating other (similar) mechanisms of the agorithm toolbox [9]; c) api functions for domino-like primitives. short range. a) domino addresses the main problem of current "if" in applications running in ohdue: "if" is inadequate not because it cannot say "probably 80b) it solves this problem in a simple way (for both designer and user). c) it is easy to implement due to its straightforward structure involving only common api functions: despite its enriched semantics the emulated "if" does not need a modified compiler. domino: trivalent logic semantics in bivalent syntax clothes 311 evifcond yes no usual then usual else undecidable (łukasiewicz semantics) raise exception figure 2: domino: semantics interpreted in trivalent logic middle range. a) corollary 1: being downward compatible, it should be useful for legacy systems. b) corollary 2: emulating a language primitive, it should be useful not only for dss, but for any application in ohdue. c) corollary 3: dealing with both time and uncertainty, it should be useful for any agent-oriented application: other agorithm mechanisms could be outlined similarly. d) hopefully it will start a "virtuous circle" able to adapt the dss to the "information age" requirements changing both the decision-makers expectations and the programming paradigm. long range. a) a new interpretation of domino: a three-valued if within the frame of bivalent logic b) the badly needed shift in software engineering towards exception handling: the stimulus causes an interrupt that is treated as exception. (real-world applications cannot afford large amount of "if temperature > n 0c then alarm" testing c) motivating applied research in ai logics, avoiding the gap between new logics (dealing at highly abstract mathematical levels with either time or non-determinism but rarely with both) and decisionsupport applications (either remaining at the level of rigid algorithmic approach or using approaches that ignore the fundamentals of human decision making). d) a new, "procrastination logic": less algorithmic, less probabilistic, less difficult to understand and to apply in ohdue, where postponing a decision means keeping open alternatives (to react better to rapid environment changes). bibliography [1] agentlink roadmap: overview and consultation report, agentlink iii. agent based computing, university of southampton, 2004. [2] anderson, a.r, n.d. belnap, j.m. dunn. entailment: the logic of relevance and necessity, volume 2. princeton university press, princeton, 1992. 312 boldur e. bărbat [3] barbat, b.e. agent-oriented intelligent systems. romanian academy publishing house, bucharest, 2002 (in romanian, "grigore moisil" prize of the romanian academy). [4] barbat, b.e. the impact of broad-band communication upon hmi language(s). (chapter 7.) communicating in the world of humans and icts. (chapter 8.) in cost action 269. e-citizens in the arena of social and political communication (l. fortunati, ed.), pp. 113-142, eur21803, office for official publications of the european communities, luxembourg, 2005. [5] barbat, b.e., a. moiceanu, i. pah. gödelian self-reference in agent-oriented software. proc. of the 11th wseas international conference on computers (iccomp ’07) (n.e. mastorakis et al, eds.), 92-97, agios nikolaos, crete, 2007. [6] barbat, b.e, a. moiceanu, s. plesca, s.c. negulescu. affordability and paradigms in agent-based systems. computer science journal of moldova, 2007. (in print.) [7] barbat, b.e., r.s. muntean, r. fabian. approximation versus undecidability in economic modelling. proc. of the international workshop new approaches, algorithms and advanced computational techniques in approximation theory and its applications (d. simian, ed.), 2007. (in print.) [8] barbat, b.e., s.c. negulescu. from algorithms to (sub-)symbolic inferences in multi-agent systems. international journal of computers, communications & control, 1, 3, 5-12, 2006. (paper selected from the proc. of icccc 2006.) [9] barbat, b.e., s.c. negulescu, s. plesca. emergence as leverage and non-algorithmic approaches in agent-oriented software. studies in informatics and control journal, 16, 4, 2007. (in print.) [10] bruntink, m., a. van deursen, t. tourwé. discovering faults in idiom-based exception handling. proc. of the 28th international conference on software engineering, 242 251, acm press, new york, 2006. [11] castor filho, f., a. garcia, c.m.f. rubira. error handling as an aspect. proceedings of the 2nd workshop on best practices in applying aspect-oriented software development, acm press, new york, 2007. [12] clegg, s., kornberger, m., rhodes, c. organizational ethics, decision making, undecidability. the sociological review, 55, 2, 393-409(17), blackwell publishing, 2007. [13] fipa tc agent management. fipa agent management specification. standard sc00023k (2004/18/03). http://www.fipa.org/specs/fipa00023/sc00023k.pdf, 2004. [14] fisher, m. temporal development methods for agent-based systems. autonomous agents and multi-agent systems, 10, 41-66, springer science + business media inc., 2005. [15] gottwald, s. many-valued logic. in stanford encyclopedia of philosophy (e.n. zalta, ed.). http://plato. stanford.edu/entries/logic-manyvalued/, 2004. [16] rabinovich, a. temporal logics with incommensurable distances are undecidable. information and computation, 205, 5, 707-715, elsevier, 2007. [17] refsgaard, j.c. et al. uncertainty in the environmental modelling process-a framework and guidance. environmental modelling & software, 1543-1556, elsevier, 2007. [18] weisstein, e.w. three-valued logic. from mathworld–a wolfram web resource. http://mathworld. wolfram.com/three-valuedlogic.html. crc press llc, wolfram research, 1999. domino: trivalent logic semantics in bivalent syntax clothes 313 [19] zambonelli, f., a. omicini. challenges and research directions in agent-oriented software engineering. autonomous agents and multi-agent systems, 9, 253-283, kluwer academic publishers, 2004. boldur e. bărbat "lucian blaga" university of sibiu department of computer science 5-7 ion raţiu st., 550012, sibiu, romania email: bbarbat@gmail.com received: march, 24, 2007 boldur barbat m.sc. in electronic engineering, postgraduate specialising in programming, ph.d. in digital computers ("politehnica" university bucharest). he is with "lucian blaga" university sibiu, faculty of sciences (full professor) and "politehnica" university timisoara, faculty of automation and computers (advisor for doctoral studies in computer science). over 20 books (romanian academy it prize, 2002). about 50 papers/articles in english in the last five years. current research interests: emergence in agent-based systems (self-awareness, stigmergy), human-agent interaction (transdisciplinary), agentoriented software engineering (non-algorithmic real-time software, uncertainty). international journal of computers communications & control issn 1841-9836, 10(5):746-759, october, 2015. multi-attribute collaborative filtering recommendation c. yu, y. luo, k. liu changrui yu school of information management and engineering, shanghai university of finance and economics, shanghai 200433, china yucr@sjtu.edu.cn yan luo* shu-uts silc business school, shanghai university, 20 chengzhong rd, shanghai 201800, china *corresponding author: luoyan@shu.edu.cn kecheng liu informatics research centre, henley business school, university of reading, reading, rg6 3xa, u.k. k.liu@reading.ac.uk abstract: currently researchers in the field of personalized recommendations bear little consideration on users’ interest differences in resource attributes although resource attribute is usually one of the most important factors in determining user preferences. to solve this problem, the paper builds an evaluation model of user interest based on resource multi-attributes, proposes a modified pearson-compatibility multiattribute group decision-making algorithm, and introduces an algorithm to solve the recommendation problem of k-neighbor similar users. this study addresses the issues on preference differences of similar users, incomplete values, and advanced converge of the algorithm, and realizes multi-attribute collaborative filtering. the algorithm is proved to be effective by an experiment of collaborative recommendation among multi-users in a virtual environment. the experimental results show that the algorithm has a high accuracy on predicting target users’ attribute preferences and has a strong anti-interference ability on deviation and incomplete values. keywords: personalized recommendation, group decision-making, multi-attribute, collaborative filtering, pearson-compatibility. 1 introduction a recommender system aims to generate meaningful recommendations to users for items or products that might interest them [1]. in many markets, consumers face a wealth of products and information from which they can make choices. to alleviate this problem, many web sites attempt to help users by incorporating a recommender system that provides users with a list of items and/or web pages that are likely to interest them. there are real-world operations of industrial recommender systems, such as the recommendations for books on amazon, or movies on netflix. as one of the most successful approaches to building recommender systems, collaborative filtering (cf) uses the known preferences of a group of users to make recommendations or predictions of the unknown preferences for other users [2]. the developers of one of the first recommender systems, tapestry [3] coined the phrase "collaborative filtering (cf)", which has been widely adopted regardless of the facts that recommenders may not explicitly collaborate copyright © 2006-2015 by ccc publications multi-attribute collaborative filtering recommendation 747 with recipients and recommendations may suggest particularly interesting items, in addition to indicating those that should be filtered out [4]. the fundamental assumption of cf is that if users x and y rate n items similarly, or have similar behaviors (e.g., buying, watching, and listening), and hence will rate or act on other items similarly [5]. studies in literature [3] and [4] have shown that users’ interest in a product or service is affected by user topic preferences, content preferences, user habits, public evaluation and other factors, and that these factors are decided by the different attributes of items. for example, users liking a new movie may be caused by one or more attributes of the movie, such as the director, star, theme, content, style, public comment and so forth. thus, in the application of collaborative filtering algorithm, it is necessary to use a multi-attribute analysis model, in which the user rating to an item is based on a different perspective (attributes) to describe their interest preferences. although user and resource clustering based on the attribute information has been widely discussed in the multi-attribute collaborative filtering literatures, the recommend method is still traditional [6]. such methods can only obtain a set of potential interest items of target users, but the reasons of such a recommendation are not given to the target user. in addition, the present studies scarcely consider the characteristics differences of similar users interested in the item attributes, which can lead to recommendation deviation [6]. for example, in a traditional way, user b is the most similar one to the target user a because a and b have the same degree of interest in the same film. however, if the film properties they prefer are completely different, it will lead to recommendation deviation when we give greater weight to b. based on our previous research, we propose that multi-attribute collaborative filtering can be treated as a group decision making process. by building the rating matrix of target items for the similar users, we remove the user who has a large attribute preference difference to target user from the nearest user set, and save the problem of recommendations deviation. in addition, we can analyze the user’s interest performance from the view of item attributes and give the descriptions for the recommendation. accordingly, this paper proposes a modified pearsoncompatibility multi-attribute group decision-making algorithm and introduces the algorithm to solve the recommendation problem of k-neighbor similar users. this paper has extended our previous research significantly with much more details of the theoretical model, such as the characters of the multi-attribute evaluation, the selection of the nearest neighbor of target user, and the group decision-making model of personalized recommendation. more importantly, we have enriched and refined the collaborative filtering recommendation algorithm which is the core of this paper. this paper is organized as follows. we review recommender systems and multi-attribute utility theory. we then introduce the establishment of user interest model using applied ontology method to describe user profile and illustrate the algorithm in specific steps. finally, we do an experiment and conclude with findings and discussions. 2 descriptions of basic model 2.1 user rating matrix a user’s comment on a certain item is usually an integration of multi-attribute comments made from different angles [6]. suppose an item is shown as follows: p = {a1,a2,a3, . . .an} based on the revised rating model, the paper establishes the user rating matrix. suppose the user set is denoted as u = {u1,u2, . . .up} and the user uj rating for item pi is denoted as 748 c. yu, y. luo, k. liu a(uj,pi): a(uj,pi) = a1 a2 a3 an−1 an    ω11 ω12 ω13 . . . ω1n−1 ω1n a1 ω21 ω22 ω23 . . . ω2n−1 ω2n a2 ω31 ω32 ω33 . . . ω3n−1 ω3n a3 . . . . . . . . . . . . . . . . . . . . . ω(n−1)1 ω(n−1)2 ω(n−1)3 . . . ω(n−1)(n−1) ω(n−1)n an−1 ωn1 ωn2 ωn3 . . . ωn(n−1) ωnn an where ωxy is the importance of attribute ax of product pi in comparison with attribute ay for user uj. here we use the 1-9 scale paired comparison method to analyze the compared importance level of each attribute of the product that a user evaluates [7]. the rating matrix of an item is mainly acquired through user scoring, or acquired through user behavior analysis, or acquired with the approaches of web semantic digging and fuzzy mathematics [6]. the present user rating system, such as movielens, only asks a user to make a synthetic rating for the movie he or she watched and give a quantitative scoring between 1 and 5. this approach is not accurate in identifying the similarity in the preference of two users. for example, if two users are interested in the same movie. when they rate it, they give it the same score. however, the angles of their preference for the movie are totally different. a user may like the star and the style. the other user prefers the theme and the content of the movie. therefore, we propose to build a multi-attribute rating system to evaluate a product, i.e., a product has many attributes. when evaluating the product, a user mainly gives the preference ratings of each product attribute. another notable character of the multi-attribute evaluation is the absolute sparsity of the judgment matrix. this is caused by users who only know a few attributes of the product. this matrix is like the incomplete value judgment matrix of group-decision making, which easily lead to obvious deviations for the prediction results. so we should take some adjusted measures to solve the problem. 2.2 user interest model suppose user uj has rated several items and the rating matrix set is as = {a(uj,p1),a(uj,p2), . . .a(uj,pt)}, where a(uj,pi)(i = 1,2, . . . t) is user uj rating matrix for product i (i.e., pi). this paper applies the rating matrix set to establishing the user interest model [6]. the specific steps are as follows. 1. calculating the feature weight vector of each rating matrix, and then acquire the feature weight vector set v s ={v p1uj (w1,w2,w3, . . .wsize(a(uj,p1))),v p2 uj (w1,w2,w3, . . .wsize(a(uj,p2))) . . . v ptuj (w1,w2,w3, . . .wsize(a(uj,pt)))}. where v piuj (w1,w2,w3, . . .wsize(a(uj,pi))) denotes the feature weight vector of the user rating matrix a(uj,pi)(i = 1,2, . . . t) and size(a(uj,pi)) denotes the length of the feature weight vector. 2. according to the category of each attribute, calculate the user interest weights of the relavant attribute in the related resource category. referring to the methods proposed multi-attribute collaborative filtering recommendation 749 by [8] and [9], we propose the following formula for calculating the degree of the user interest. v a(uj,ay,n) = n∑ k=1 a(uj,pk)×v pk uj (wy) n (1) where v a(uj,ay,n) denotes the degree to which user uj is interested in attribute ay. n is the number of the items which has attribute ay and user uj have rated. a(uj,pk)×v pk uj (wy) (k = 1,2,3, . . .n) denotes the degree of user uj interest in attribute ay of product pk, which indicates how user uj preference on item pk is mostly determined by attribute ay. 2.3 selection of the nearest neighbor of target user we define the target user as the online user which requires evaluations and preliminary recommendations. the set of nearest neighbors is composed of the users who have the most similar interest and preference to item with the target user. traditionally, researchers use k-nearest neighbor (knn) algorithm and pearson correlationbased similarity formula to do cluster analysis on the target user and the similar users, according to their similar interest and preference. through these methods, the similar user set with different group standards can be obtained. in the process of collaborative filtering recommendation with group decision-making method, we consider the characteristics of the target user preference and search the similar user sets. the specific steps are as follows. step 1. search the user set which has similar interest distribution with target user ut , i.e., to obtain the intersection set of the interest distribution of all users and the target user. two users can rate the same attributes in a number of categories and have similar interest weights. this approach can be applied to obtain the initial nearest neighbor set: iu = {(u1,ω1),(u2,ω2),(u3,ω3) . . . ,(uw,ωw)} where w is the number of users totally in the initial nearest neighbor set. uk denotes the kth user whose interest set is sk. ωk denotes the interest intersection between user uk and the target user ut . step 2. use pearson correlation-based similarity formula to calculate the similarity between target user ut and a random user uk. when the degree of similarity reaches the threshold, uk can be divided into sut which is the similar user set of ut . finally, sut = {u1,u2, . . .us} is obtained, i.e., s similar users meet the threshold. the set of the interest intersection between the target user ut and s similar users that meet the threshold is seut = ω1∪ω2∪ω3 . . . . . .∪ωs. 3 collaborative filtering recommendation algorithm based on multi-attribute group decision-making 3.1 group decision-making model of personalized recommendation after acquiring a similar user set sut , we need predict and recommend the items that target user ut has not commented yet. suppose the item set is source = {p1,p2,s3, . . .pn} and pk (k = 1,2, . . .n.) is the item that the target user has not commented yet. the traditional collaborative filtering recommendation algorithm is applied to calculate the overall evaluation value of the item a(ut ,pk) given by similar users and then obtain the initial recommended source sinitial. suppose a random item pk ∈ sinitial has n attributes denoted as a set spk = 750 c. yu, y. luo, k. liu {a1,a2, . . .an}, the set satisfies the condition spk ⊆ seut . the comment matrix is denoted as a1,a2,a3, . . .ap, which means similar users in sut commend pk. we suppose p users are similar with the target user and have made comments to the item. the algorithm proposed in this paper requires that p >= 3. when p < 3, refer to the article [10]. after obtaining the evaluation matrix a1,a2,a3, . . .ap, it vital for this research to use the information of comments to get preference matrix of the target users. the collaborative filtering recommendation of multi-attribute similar users is a typical group decision problem, in which many similar users make their decisions independently without any discussions and then the computer synthesizes their opinions to make recommendations to the target user. the result requires that all similar users’ comprehensive evaluation matrixes and comprehensive characteristics weight vector {w1,w2,w3, . . .wn } be calculated. wi denotes the comprehensive preference of the similar users to attribute ai, which is compared with other attributes of the product that has not been recommended. according to comprehensive characteristics weight vector, we can know which attribute determines the users’ interest on a specific item. applying the semantic analysis method and the semi-structured description language to the attributes, we can make better recommendation to users. a recommendation model can be transferred to group decision-making model in order to solve a problem. however, the traditional group decision-making algorithm still need be improved when applied since there exists differences between similar users and decision-making expert. the differences mainly exist in the following aspects: 1. it is hard to use the weight vectors to measure the influence of similar users in the recommendation processes mainly because of the complexity of user preferences. according to the traditional method, two users have a high similarity in their interest preferences. however, it is hard to identify the deviation existing in the preferences of the two users on a specific item. 2. there are a large number of incomplete values in user comment information because some users may make no comments on the unfamiliar attributes to ensure the accuracy of evaluations. in this situation, the application of some traditional group decision-making algorithms, such as weighted average method or weighted least squared logarithm method may result in a great deviation of the final result. to solve the two problems mentioned above, this paper makes improvements in group decisionmaking compatibility test algorithm and builds a collaborative correction algorithm based on pearson-compatibility model. the core of this algorithm is to simulate the expert group-decision making process, discover the common opinions of most experts via data analysis, and revise the opinions of each expert to realize the final consistent compatibility. moreover, to solve the problem of premature convergence, this paper adopts a mode of multi-user collaborative correction. in addition to the usual algorithms, when it calculates the value of each user’s impact, this paper uses the common opinion of all users rather than target user‘s opinion as a measurable index. the user whose opinion is more similar to the common opinion would be given a higher weight, which is different to traditional method. 3.2 the related definitions firstly, we introduce the calculation of the value of user impact weight. this value is an important indicator to measure the degree of evaluation information consistency between a user and the others [6]. the user matrix with higher group evaluation consistency will get higher multi-attribute collaborative filtering recommendation 751 weight. vise versa. this paper adopts the concept of user rating similarity [11, 12]. we turn all the similar n×n user rating matrixes into n2×1 one dimensional vector. the user uk judgment matrix ak could be denoted as v k = {ωk11,ω k 12,ω k 13 . . .ω k 1n,ω k 21,ω k 22,ω k 23 . . .ω k 2n . . .ω k n1 . . . . . .ω k nn}. pearson similarity formula to calculate the rating matrix between the user uk and the user ul is show as follows: si(ak,al) = n2∑ i=1 (v k(i)−v k)× (v l(i)−v l)√ n2∑ u=1 [v k(i)−v k]2 × √ n2∑ u=1 [v l(i)−v l]2 (2) v k is the average value of all elements of user uk rating matrix. v k = ωk11 + ω k 12 + . . . + ω k nn n2 . the similarity between user k and other users could be calculated as follows: sik = p∑ l=1,l ̸=k si(ak,al)/(p−1). where p denotes the number of users. we propose a formula dk = 1 −sik as the approximate measure of variance, which indicates the deviation degree of evaluation matrix. the approximate influence weight of user uk is shown as follows: θk = (1−max{sil, l = 1,2, . . .p})2 d2k (3) after acquiring the similar user influence weight, we suppose the group integrated approximate evaluation matrix of p users is a∗, and the value of each element ω∗ij in matrix a ∗ is as following: ω∗ij = p∑ k=1 θk ×ωkij/ p∑ k=1 θk (4) matrix a∗ is not a positive reciprocal matrix. suppose x is a positive reciprocal matrix composed of xij. this paper uses the least square method to modify x and propose the following formula: f(x) = min n∑ i=1 n∑ j=1 (xij −ω∗ij) 2 s.t  xij ×xji = 1xij > 0 (i,j = 1,2, . . .n) (5) the definition of compatibility and comprehensive compatibility are as follows: definition 1. suppose x is the group user comprehensive evaluation matrix obtained by using the method of the least squares. then the judgment matrix compatibility between user k and the other users is as follows: s(ak,x) = n∑ i=1 n∑ j=1 ω (k) ij ×xij max((ω (k) ij ) 2,(xij)2) n2 −α (6) 752 c. yu, y. luo, k. liu although paper [13] has defined expert judgment matrix compatibility in usual cases, it does not consider the incomplete value. formula (6) is a modified approach to solve the problem. firstly, the block that the user does not rate is processed and given the value 0. then α is used to indicate the number of 0. the aim of this approach is to eliminate the influence of user judgment matrix on compatibility indicator. definition 2. suppose a1′, a2′ . . .ap′ are the compatibility correction of matrixes of p users’ judgment matrixes. then we get the comprehensive consistency indicator s̄, as follows: s̄ = p∑ k=1 s(a,ak′) p (7) readers can refer to the simulation result of article [7]. when s(a,b) >= 0.8, the two evaluation matrixes is considered nearly compatible. when s̄ ≥ 0.8, evaluation matrixes of all p similar users is considered compatible. 3.3 collaborative correction algorithm based on pearson-compatibility pearson-compatibility model could be used to simulate the process of experts doing group discussions and finally making group decisions. the model mainly consists of two indicator calculation formulae: pearson similarity calculation and compatibility test. pearson similarity calculation formula is mainly used to calculate the information reservation degree of user rating matrix after the matrix has been revised. and compatibility test is used to test degree of consistency of user rating matrix after matrixes has been revised. the mainly function of pearsoncompatibility model is to build the associations between two variational indicators. based on this association, compatibility correction algorithm will choose the best revised matrix in each iteration process, which could make the user matrixes unanimous and most similar to the true value. referring to [14], based on the related method [15, 16], this paper proposes a pearsoncompatibility model as follows: ω(t) = si(t) 1−β ×s(t) 1+β (8) where ω(t) denotes the calculation result of pearson-compatibility model after t times iteration. with all the continuous revision of user matrixes, si(t) decreases from 1 to 0, and s(t) increases from 0 to 1. so each time of modification generates a better revised scheme which minimizes the change of si(t) while maximizes the change of s(t). this indicates that all the experts acquires a better compatibility with the least possible original information lose. this method is similar to the real decision processes [16] and can provide an effective way to approach the true value. in pearson-compatibility model, the best revised scheme in each iteration process is the one that maximizes ω(t). β is used to control the marginal diminishing rate and the marginal increasing rate of si(t) and s(t). the efficiency of this algorithm is low when the consistency of the evaluation matrixes is low. moreover, it is possible that the algorithm converges in advance, i.e., ω(t) reaches the max value at the beginning of calculation. at that time, if the value of β increases, the efficiency of the algorithm can be improved and simultaneously solve the problem of pre-mature convergence to some degree. regarding the designation and realization of compatibility correction algorithm based on pearson-compatibility, readers could refer to [17, 18]. in order to solve the problem of premature convergence, this paper introduces a multi-user collaborative correction mechanism which multi-attribute collaborative filtering recommendation 753 is based on simulated annealing algorithm (saa). the following experiments in this paper testify that this method can solve the problem of pre-mature convergence in advance, and further improves the accuracy of the experiment result. this algorithm is described as follows: step1: suppose u is the iteration times of compatibility test. zs is the number of users whose evaluation matrix could not be revised further in each step, which is given the original value 0. suppose original user matrixes of all p users are a1(0),a2(0) . . . . . .ap(0). then after u times compatibility correction iteration, p users evaluation matrixes are: a1(u),a2(u) . . . . . .ap(u) step2: use formula (2) and (3) to calculate the influence weights of the evaluation matrixes of p users. step3: use formula (4) and (5) to calculate the user comprehensive rating matrix x(u) after the uth compatibility correction. step4: use formula (6) to calculate the compatibility between the kth user ak(u) and x(u). then apply formula (7) to calculate the comprehensive consistency degree of s̄(u). step5: when zs = p and s̄(u) < 0.8, go to step7. when zs ̸= p and s̄(u) < 0.8, go to step6. when zs ̸= p and s̄(u) ≥ 0.8, we believe p users comprehensive consistency could pass the test, go on with step8. step6: referring to [17], we can revise x(u) and ak(u). suppose the deviation matrix of ak(u) is dk(u) = {ek(u)ij }, where e k(u) ij = x (u) ij −ω k(u) ij . suppose e k(u) wr = max(|x (u) ij −ω k(u) ij |) is the maximum deviation item after u times iteration, where ωk(u)ij is nonzero term. let u = u+1 and begin a new round of revision. the revised rating matrix is: ak(u) =   x (u−1) ij −e k(u−1) ij ×0.95 µ, when i = w,j = r 1/[x (u−1) ij −e k(u−1) ij ×0.95 µ], when i = r,j = w x (u−1) ij −e k(u−1) ij , when i ̸= w,i ̸= r,j ̸= r,j ̸= w (9) where u is the step length adjusting factor. this paper gives a limitation between 0 and 10 to u. thus, the range of 0.95µ is [0.5987, 1]. the larger u is, the faster the rate of convergence of rating matrix of user k will be. in order to prevent an oversized adjustment of a single user matrix and thus lose the information in the initial rating matrix, the maximum adjustment extent is set as 0.5987. if u is given an appropriate value in each iteration, the calculating speed and the accuracy of the algorithm result will be highly improved. however, the existing algorithm is not effective enough. this paper proposes an approach of building a multi-user collaborative filtering model based on simulated annealing algorithm (saa), i.e., applying saa to revise the maximum deviation item in each iteration and calculate the optimal correction factor µk for each user k(k = 1,2 . . .p). in the simulated annealing algorithm (saa), each point s of the search space is analogous to a state of some physical system. the goal is to bring the system, from an arbitrary initial state, to a state with the minimum possible energy. according to the principle of metropolis, the probability of solid particle at temperature t changed from the disordered state to the steady state is exp(− ∆e k×t ). ∆e is the internal energy change quantity when the solid temperature change to t and k means the boltzm-ann constant. when we apply saa method to solve the 754 c. yu, y. luo, k. liu combinational optimization problem, internal energy e means the value of target function and temperature t means the control factor. then we can get the simulated annealing algorithm. in the saa, we suppose the initial energy value of target function in the model is e and control factor is t . then we can control the attenuation amplitude in each iteration and use the random function to simulate the random motion of particle. and we calculate the objective function difference as ∆e, and according to metropolis principle to choose the result. when t decrease to the critical value or the model solution could not be better, we stop this algorithm and get the approximate optimal result. the saa applied to the multi-user collaborative correction algorithm could be described as following: step6.1: suppose it is the uth iteration. let formula (8) be the energy function. suppose c = 0 is the number of iterations for calculating µk, and sc = 0 is the verdict factor to judge whether to stop the iteration. step6.2: for user ak(u),(k = 1,2, . . .p), apply formula (9) to confirm the maximal deviation item and to calculate the evaluation set a1(u)c ,a 2(u) c ,a 3(u) c , . . .a p(u) c . randomly choose the initial state set [µ1c,µ 2 c, . . .µ p c] between 0 and 1. step6.3: run matlab software to generate p random numbers ∆1,∆2, . . .∆p, which are between 0 and 1. suppose µkc = ∆k (k = 1,2 . . .p, and ∆k in each iteration is different). let c = c + 1 and calculate the energy function ω(u)c . suppose t is the temperature schedule. its initial value t0 = 100. the attenuation function is tc = tc−1 × 0.95. suppose the counting variable is rt and rt = rt + 1. let ∆f = ω(u)c −ω (u) c−1. then we make the following judgment: when ∆f > 0, µ1c,µ 2 c, . . .µ p c is accepted as the new state solution and go on with step4. when ∆f < 0, there are two possibilities. if e( ∆f tc ) > random[0,1] holds, µ1c,µ 2 c, . . . µ p c is accepted as the new state solution and go on with step4. otherwise, let µkc = µ k c−1, sc = sc +1. step6.4: when sc > 10 or tc < 0.01, which means we reach the end condition and export the result µ1c,µ 2 c, . . .µ p c. otherwise, go on with step2 after the value of u is obtained. step7: popup dialog prompt whether to set a new threshold sxy which means the user acceptable level. if the user choose to reset, zs value would be zero. and we will let the value of pearson in each loop iteration compared with sxy and get the value of zs. then we execute step5. if the user choose not reset, we execute step8. step8: export the calculation result x(u), s (u) and p user final evaluation matrixes ak(u),(k = 1,2, . . .p). we have testify the algorithm is effective in the previous studies. if you are interested in the test process, you can refer to article [16] and [17]. 4 experimentation we build an experiment environment to execute our algorithm at current conditions to validate the effectiveness of this algorithm [6]. the environment is described as follows: we adopt ontology and the relevant methods in order to design and develop the movie information database. jena 2.6.2 is applied to store the movie information in rdf format and arq-2.2 is used to manage the movie information. we have imported 300 movies which involve 10 categories. a semantic analysis of each movie is conducted to get key words and form the initial attribute set. then the synonyms and the similar words in the initial set are combined. take some topical words as the characteristic attributes and use them to represent multi-attribute collaborative filtering recommendation 755 these movies. finally, 15 attributive categories and 282 concrete attributes are extracted. then an online multi-attribute rating system based on the movie database and a collaborative filtering recommendation system based on group-decision making are designed and developed. the concrete process that tests the algorithm is as follows [6]: 1. select four evaluated movies in which g(u,p) is comparatively big and use them as the testify set. they respectively include 6, 7, 8 and 9 attributes. then, use the target user evaluation matrixes which are further used as the real weight vectors to calculate the user interest vectors for each movie. 2. based on the user-evaluated movies set (excluding the 4 movies in the test set), apply the methods in sections 2.3 and 2.4 to searching the most similar user set for the target user (i.e., the similar interest distributions). take movie 1 with 6 attributes as an example. the real interest vectors are s = [3.7288, 2.7053, 1.9627, 0.4657, 0.3293, 0.3293]. the total score of this movie is 4.5 which indicates that the target user has a high preference to this movie. moreover, the preference is mainly determined by the first three attributes. totally, 9 similar users have evaluated this movie. firstly, the traditional collaborative filtering algorithm is applied to obtaining the weighted average of the total score of this movie and gets the result 3.94. we are not sure whether the target users have interests in this movie. thus, we need use the similar user evaluation matrixes to make judgments. the evaluation matrixes of six similar users are listed as follows: a =   1 2 2 5 7 9 1/2 1 1 6 5 6 1/2 1 1 7 7 6 1/5 1/6 1/7 1 1 2 1/7 1/5 1/7 1 1 3 1/9 1/6 1/6 1/2 1/3 1   b =   1 2 3 3 7 7 1/2 1 0 4 7 6 1/3 0 1 0 4 5 1/3 1/4 0 1 1 2 1/7 1/7 1/4 1 1 3 1/7 1/6 1/5 1/2 1/3 1   c =   1 2 2 4 0 0 1/2 1 1 3 0 0 1/2 1 1 2 0 0 1/4 1/3 1/2 1 0 0 0 0 0 0 1 0 0 0 0 0 0 1   d =   1 2 3 4 6 8 1/2 1 2 2 4 6 1/3 1/2 1 1/2 4 7 1/4 1/2 2 1 3 2 1/6 1/4 1/4 1/3 1 2 1/8 1/6 1/7 1/2 1/2 1   e =   1 2 0 7 8 0 1/2 1 2 8 7 7 0 1/2 1 0 7 8 1/7 1 0 1 1 2 1/8 1/7 1/7 1 1 1 0 1/7 1/8 1/2 1 1   f =   1 2 4 0 7 0 1/2 1 2 6 7 7 1/4 1/2 1 0 2 2 0 1/6 0 1 0 2 1/7 1/7 1/2 0 1 1 0 1/7 1/2 1/2 1 1   3. apply the four algorithms to calculate the score of the four movies and make comparisons on the deviations of the real weight vectors of the target users. the result is listed as follows: table 1: the comparison between algorithms movie 1 (6 order) movie 2 (7 order) movie 3 (8 order) movie 4 (9 order) arithmetic weighted average method 0.1589 0.0564 0.1985 0.1132 logarithmic least squares method 0.1054 0.0534 0.1398 0.0831 compatibility correction algorithm 0.0877 0.0556 0.1042 0.0687 our algorithm 0.0780 0.0543 0.0885 0.0683 756 c. yu, y. luo, k. liu when the scores of a part of similar users have a large deviation from those of the other users, the algorithm proposed in this paper can solve the problem of early convergence better than the other algorithms and obtain an accurate result, as shown in table 1. the core of our algorithm is the revised values of the comprehensive evaluation matrix determined by the majority of users. accordingly, the highly deviated evaluation values are revised. the result of seven order matrix experiment shows that the deviations of the result of any algorithms are not notable when all the similar users have unanimous evaluation matrixes, the result of nine order matrix experiment shows that the result of the proposed algorithm is similar to that of compatibility correction algorithm when all the similar users have unanimous evaluation matrixes, while still have some incomplete values, and is better than the other two algorithms obviously. when there are 5 similar users and six order evaluation matrix is executed with our algorithm, the change tendencies of the main indicators are shown in figure 1: 0 2 4 6 8 10 12 14 16 0.65 0.7 0.75 0.8 0.85 0.9 0.95 1 iterations p e a rs o n -c o m p a ti b il it y v a lu e compatibility value of five similar users pearson value of five similar users pearson-compatibility value of five similar users figure 1: the main indicators change in our example 4. for example, for the first movie with the six order evaluation matrix, the influence of the user number on the accuracy of recommendation results is examined. suppose the user number is 3, 5, 7 and 9. the accuracy and the number of iterations are calculated with different means of permutation and combination. part of the result is shown in table 2: table 2: the comparison of different similar user numbers user number 3 5 7 9 initial indicator of comprehensive consistency degree 0.5872 0.6890 0.6872 0.7081 deviation of result 0.1680 0.1093 0.0828 0.0780 number of iterations 12 16 28 37 more deviation items are generated as the user number increases. therefore, the iterations of this algorithm rise. this test indicates that the effectiveness of this algorithm is highly related to the initial consistency degrees of all users and the number of users. in general, when the initial consistency degree is low and the similar user set is limited (e.g. there are 3 users), it is hard for the algorithm to dig out the common information among the users. therefore, the result deviation is huge. however, when the number of similar users increases to a certain degree (e.g. the number is equal or bigger than 7), the algorithm still remains a good accuracy, even if the initial compatibility is low. regarding the provision of personalized services to the target users, this paper calculates the comprehensive evaluation weight vectors of each movie with the group-decision making model. multi-attribute collaborative filtering recommendation 757 take the first movie with 6 attributes as an example. the comprehensive evaluation score of nine similar users is g(u,p) = 3.94. the comprehensive evaluation vectors are v = [4.0653 2.9492 1.7630 0.3972 0.3044 0.3607]. each value of the weight vector represents the potential interest degree of the target user on the corresponding product attributes. thus, the total score calculation formula is tscore = g(u,p)× n∑ i=1 vi/n (10) where tscore denotes the total score, n denotes the number of attributes, and vi denotes the comprehensive evaluation value of the (i)th attribute of the product. the recommendation set can be fixed through the way of ranking or threshold setting. the total scores of the four movies is shown in figure 2: 0 1 2 3 4 5 6 7 t s c o re 6.4616 5.9400 5.6734 4.6622 movie1 movie1 movie3 movie4 figure 2: the tscore of four movies the first movie has the highest score with six attributes, as shown in figure 2. the characters of this movie are analyzed as follows using the user interest model. firstly, attribute vi of weight vector v is normalized and generate vector v = [0.4131,0.2997,0.1792,0.0404,0.0309,0.0367]. the three attributes whose values are bigger than the average value 0.1666 are picked out. when the attribute value is bigger than 0.1666, the majority of users have evident preference on movie one. in the target user interest model, there are 124 attributes totally. the 3 attributes of movie one that are bigger than the average value are connotation, characteristic and special efficiency are still larger than the average value (1/124 = 0.0081) among 124 target user attribute preferences. this result indicates that target user has evident preference to these 3 attributes and the popularity of this movie is mainly determined by these attributes. therefore, we could introduce movie one to target user and provide the reasons why this movie is introduced. we also could use semantic analysis technique to describe each attribute in detail and provide more personal service to target user. the comparison analysis histogram is shown in figure 3. m bar means the percentages which attributes are ranking at top 3 in movie one, u bar means percentages which attributes are ranking at top 3 in target user interest model. the analysis show us that target user may have larger interest preference to attribute one in movie one. 5 conclusion the traditional collaborative filtering personal recommended algorithms seldom consider the multi-attribute problem. our approach is based on group-decision making. we propose an 758 c. yu, y. luo, k. liu 0 0.1 0.2 0.3 0.4 0.5 0 0.01 0.02 0.03 0.04 0.05 u barm bar connotation special efficacyplot 0.4131 0.2997 0.1792 0.0450 0.0254 0.0326 figure 3: comparison analysis histogram of movie one with target user interest. improved pearson-compatibility algorithm which is applied to the collaborative filtering recommend field. we then build a virtual recommend environment and testify the effectiveness and feasibility of this algorithm. the advantages of the collaborative filtering personal recommended algorithm based on group-decision include: identifying a more suited similar users set for the target user. an accurate target user model could be set up via field subdivision according to field attributes. then the users who have similar interest distribution with target user can be found. the similar user set is generated. providing more accurate and personal recommend service to the target user. the traditional collaborative filtering method could neither recommend a result set to target user, nor provide analysis service [6]. this weakness is overcome by making an information integration to know what are mainly factors determining the user preference, so that we could handle the user need more accurate. considering evaluation deviation between the similar users and revising the user evaluation. instead of weighted mean, group-decision making method is used to calculate the comprehensive evaluation score. deleting the deviation item and revising the evaluation matrix could make the result have a better fitting effect [6]. the collaborative filtering method based on pearsoncompatibility is applied to the personal recommended field. the result of the experiment shows that the algorithm is stable when dealing with the deviation items and identifies the common preference information between similar users. acknowledgment this research work is supported by innovation program of shanghai municipal education commission (no. 12zz070, no. 12ys018) and humanity and social science youth foundation of ministry of education, china (no. 12yjc630136). bibliography [1] g. adomavicius, a. tuzhilin (2005); toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions, ieee trans. on knowl. and data eng., 17(6):734-749. multi-attribute collaborative filtering recommendation 759 [2] xiaoyuan su, taghi m. khoshgoftaar(2009); a survey of collaborative filtering techniques, advances in artificial intelligence, 2009:1-19. [3] d. goldberg, d. nichols, b. m. oki, d. terry(1992); using collaborative filtering to weave an information tapestry, communications of acm, 35(12):61-70. [4] p. resnick, h. r. varian(1997); recommender systems, communications of the acm, 40(3):56-58. [5] k. goldberg, t. roeder, d. gupta, c. perkins (2001); eigentaste: a constant time collaborative filtering algorithm, information retrieval, 4(2):133-151. [6] c. yu, y. luo, k. liu(2014); a multi-attribute collaborative filtering recommendation algorithm based on improved group decision-making, iciso 2014, ifip aict, 426:320-330. [7] f. herrera, e. herrera-viedma, f. chiclana (2001); multiperson decision-making based in multiplicative preference relation, european j. of operational research, 129:372-385. [8] a. shepitsen, j. gemmell, b. mobasher, r. burke (2008); personalized recommendation in social tagging systems using hierarchical clustering, proc. of the 2008 acm conference on recommender systems, lausanne, switzerland. [9] a. ypma, t. heskes (2002); categorization of web pages and user clustering with mixtures of hidden markov models, proc. of the webkdd 2002 workshop: web mining for usage patterns and user profiles, sigkdd 2002, edmonton, alberta, canada. [10] z.s. hua, b.g. gong, x.y. xu (2008); a ds–ahp approach for multi-attribute decision making problem with incomplete information, expert systems with applications, 34(3):22212227. [11] h. jeon, t. kim, j. choi (2010); personalized information retrieval by usingadaptive user profiling and collaborative filtering, aiss: advances in information sciences and service sciences, 2(4):134-142. [12] liu, a., yang, z.(2010); watching, thinking, reacting: a human-centered framework formovie content analysis, international journal of digital content technology and its applications, 4(5):23-37. [13] y. dong , y. chen, s. wang (2004); algorithm of solving weights for group decision making by improving compatibility, systems engineering-theory & practice, 2004-10. [14] c.-s. yu (2002); a gp-ahp method for solving group decision-making fuzzy ahp problems, computers & operations research, 29(14): 1969-2001. [15] y. zhang, h. wang (2002); development and application of p-s aided decision system, systems engineering-theory methodology application, 2002-04. [16] l. liang, l. xiong, g. wang (2004); a new method of determining the reliability of decisionmakers in group decision, systems engineering, 22(6):91-94. [17] j. barzilai, f. a. lootsma (1994); power relations and group aggregation in the multiplicative ahp and smart, proc. 3rd int. symp. ahp, 157-168. [18] j. sun, w. xu, q.d. wu(2005); a new algorithm for incomplete matrixes’ compatibility improvement and group ranking in group decision making, systems engineering-theory & practice, 10(10):89-94. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 910-918 a new model for cluster communications optimization a. rusan, c.-m. amarandei andrei rusan, cristian-mihai amarandei “gheorghe asachi” technical university of iaşi department of computer science and engineering, adress: bd. dimitrie mangeron, nr. 53a, 700050, iaşi, romania e-mail: andrei.rusan@tuiasi.ro, camarand@cs.tuiasi.ro abstract: performance losses of cluster applications can arise from various sources in the communications network of computer clusters. typically, cpu intensive applications generate a small amount of network traffic the overall influence of the network subsystem is minimal. on the other hand, a dataintensive and network aware application generates a large amount of network traffic and the influence of the network subsystem is significantly greater. this paper presents a model that aims to improve the cluster’s network performance by reducing the data transfer time, this solution having the advantage that doesn’t imply modifications of the original applications or of the kernel. keywords: cluster communications optimization, network performance 1 introduction the computing performance of a cluster is dependent on the performance of cluster components: computing nodes and communication infrastructure. the cluster communication infrastructure was built using network devices whose performance can be modified only by hardware changes, i.e. replacing 100mbps ethernet switches with gigabit switches; therefore this component of the cluster will be neglected. the performance of a computing node is defined by hardware and software performance, where the hardware performance can be considered a constant value and it can be influenced by the hardware changes only. the software can be separated in components: application and operating system. the computing performance of the cluster can be influenced by each of these components. cluster performance increasing by performing modifications at the applications level is a goal very hard to achieve, some applications requiring a complete rewrite for this task. of course, there are exceptions too, like network-aware applications, which are built exactly with this goal in mind and they do not require any optimizations. the last component that can influence the performance is the operating system through the kernel configuration and at the network layer optimizations. taking into account that clusters typical contains a large number of computing nodes, the solution for cluster performance improvements must be done with minimal modifications in the systems. these requirements are necessary to keep the administrative tasks at a decent level. there is a research work involving the network tuning mechanisms or network-aware applications trying to solve these issues. the projects developed so far, like wad (work around daemon) [1] or enable [2] don’t meet the imposed requirements, as for wad a modified kernel must be used, and for enable the applications must be rewritten. the wad project provides a transparent mechanism to work around a variety of network issues, including tcp buffer size, mtu size, packet reordering, and leaky network loss [1]. the wad goal is to eliminate the "wizard gap" representing the difference between the network performances achievable by manually handcrafting of the optimal tuning parameters, compared copyright c⃝ 2006-2010 by ccc publications a new model for cluster communications optimization 911 to an untuned application [1]. to attain this goal, wad requires a modified kernel from the web100 project. this solution could not be applied in our case, because of the different linux kernel versions the web100 project provides a kernel patch starting from the 2.6.12. through the use of a different kernel version that the provided one by the linux distribution, centos 4.5 in our case, problems in maintaining the operating system across clusters can occur. therefore our solution works fine no mater the kernel version used. the enable project, includes monitoring tools, visualization tools, archival tools, problem detection tools, and monitoring data summary and retrieval tools. enable provides an api that makes it very easy for application or middleware developers to determine the optimal network parameters [2]. however, the solution provided by this project was not applicable in our case, because applications could not be rewritten. this paper presents a model for self optimization of the network communications in order to improve cluster performance by shortening the data transfer time. the model implementation does not require applications and kernel structure modifications or adding new modules to the existing ones. also, the implementation uses only the tools provided by the operating system for runtime configuration and therefore the automatic operating system and kernel updates can be applied immediately. in the next section a brief review of the tcp transport protocol issues, linux kernel network subsystem and the netpipe network performance measurement tool are presented. section 3 describes the network self optimization model and the proposed algorithm. section 4 presents the test environment and the experimental results. the final section summarizes author’s efforts on tuning the cluster communications network and considers future extensions of the work. 2 background tcp protocol transmits new data into the network when old data has been received as indicated by acknowledgments from the receiver to the sender. the data rate is determined by the window size and is limited by the application, the buffer space at the sender or the receiver and by the congestion window. tcp adjust the congestion window to find an appropriate share of the network capacity of the path between source and destination. missing or corrupted data segments are repaired by tcp by retransmitting the data from the sender’s buffer. this process requires an entire window of data to fit into both sender and receiver buffers [1]. the largest tcp window can be 216=65kb because the tcp header uses 16 bits to report the receive window size to the sender. the window scale option was introduced defining an implicit scale factor used to multiply the windows size value from tcp header in order to obtain the real tcp window size, as described in [3]. these buffers have default values that may either be changed by the applications using system calls or by using tools provided by the operating system, i.e. sysctl tool from linux/unix. the second part of this section is focused on the network subsystem of the linux kernel. starting from the version 2.4 of linux kernel, an auto tuning technique is used to perform memory management. this technique simply increases and decreases buffer sizes depending on available system memory and available socket buffer space. by increasing buffer sizes when they are full of data, tcp connections can increase their window size performance improvements are an intentional side-effect [4]. on the other hand, this is done within the limitation of the available system memory and socket buffer space, and on the busy cluster that are valuable resource. the network subsystem of the linux operating system should be tuned in order to obtain an optimal performance of a computing system. in order to do that, changes can be operated at the following levels: network interface and kernel parameters. the kernel parameters allowing to change the network behavior can be tuned by modifying the following files located in 912 a. rusan, c.-m. amarandei /proc/sys/net: /proc/sys/net/core/rmem_max /proc/sys/net/core/rmem_default /proc/sys/net/core/wmem_max /proc/sys/net/core/wmem_default /proc/sys/net/ipv4/tcp_stack /proc/sys/net/ipv4/tcp_timestamps /proc/sys/net/ipv4/tcp_keepalive_time /proc/sys/net/ipv4/tcp_mem /proc/sys/net/ipv4/tcp_rmem /proc/sys/net/ipv4/tcp_wmem /proc/sys/net/ipv4/tcp_window_scaling the network interface can also be tuned by modifying the speed and duplex settings and the mtu size. two problems have to be addressed while setting up the cluster: • the default kernel values don’t provide the best performance for the custom environment, and • the number of communication devices needed to be set. since solutions to solve these two problems are missing an optimal value for each system in cluster to get the best possible performance is proposed. by using the right tools, the network settings related changes are available immediately, the optimization algorithm presented in this paper being based on these features. the values of the send/receive buffers (tcp_wmem and tcp_rmem) can be changed by specifying minimum size, initial size, and maximum size as follows: sysctl -w net.ipv4.tcp_rmem="4096 87380 8388608" sysctl -w net.ipv4.tcp_wmem="4096 87380 8388608" the third value must be the same as or less than the wmem_max and rmem_max values. the first value can be increased on high-speed, high-quality networks so that the tcp window starts out at a sufficiently high value [5]. also, the tcp window scaling is an option to enlarge the transfer window. performance measurements of the cluster network can be done using a wide area of tools like iperf [6], netperf [7] or netpipe (network protocol independent performance evaluator). because the performance measurements must be done for both tcp and mpi layer and the netpipe provides a complete measurement of the communication performance on both of them, the tests were performed using this tool. the netpipe utility performs simple ping pong tests, bouncing messages of increasing size between two computers. to provide a complete test, netpipe modifies the message size, with a slight perturbation, at regular intervals and measures the point-to-point communications performance between nodes [8]. because we want to determine of the maximum bandwidth available for different use cases, usage of different message size is a must. from all the performance measurements tools available, only netpipe had this feature by default, which defines it as a right tool for this kind of tests. an in depth description of the netpipe utility can be found in [8][10]. linux kernel network subsystem information gathered by running netpipe on the cluster were used in order to improve the throughput. 3 proposed model the proposed model implies the computation of a best possible set of values for a given set of parameters. figure 1 presents the model schematics composed from three parts: ”control logic”, a new model for cluster communications optimization 913 ”parameter computation” and ”network test tool”. the first component is responsible for sending starting set of values to ”parameter computation” and keeps the running flux under control. the second component computes the sets of values based on previously known sets of data and on the results of the current run received from the ”network test tool” and send it to the kernel in order to set the network subsystem. the ”network test tool” is responsible to run set of tests and provide the results to ”parameter computation” component. the optimization process is started by ”control logic”, which sends starting values to ”parameter computation” that are set in the kernel network subsystem and starts the first set of tests through the ”network test tool”. after the tests are finished, the results are sent to "parameter computation" which computes a new set of values and the process continues until meeting the end condition. compute node compute node network test tool network subsystem network device compute node network test tool network subsystem application kernel hardware control logic parameter computation network device network infrastructure network test tool network subsystem network device figure 1: the cluster communication optimization model the process provides self optimization of the kernel network subsystem, the only necessary interaction with the application being the configuration file that contains necessary values for the application startup and for the components behavior. these values can be set by administrators to meet the specific needs. an algorithm implementing model functionality is proposed. this algorithm performs the bandwidth measurements and adjusts the sets of parameters to obtain the highest bandwidth usage for each case and running tests. given l, the number of tests to be performed, let it be n = {n1, n2, . . . , nk} the set of nodes in cluster, t = {t1, t1, . . . , tl} the set of test variables (i.e. tcp_rmem, tcp_wmem, tcp_window_scalling), i = {i1, i2, . . . , il} the kernel network subsystem parameters start values and e = {e1, e1, . . . , el} be the set of computed values for each test ti. also, given m as the number of messages, ms = {ms1, ms2, . . . , msm} is defined as the set of message sizes used by the testing tool, x = {x1, x2, . . . , xm} as the best set of result value for each test tt, the results set ri = {r1, r2, . . . , rk}i one for each cluster node, s = {s1, s2, . . . , sm}, where si = ∑k j=i rij, and b = {b1, b2, . . . , bm} as the set of best values for each test ti. the algorithm computes the values for the test variables 914 a. rusan, c.-m. amarandei as follows: the algorithm has two main components: the network test component corresponding to ”network test tool” in figure 1 and implemented by lines 3-19 from the algorithm, and the computational component corresponding to ”parameter computation” in figure 1 and implemented by lines 20-30. the methods used in the algorithm implements the following actions: • generate_set: produce a new set of parameters used for network testing; • start_remote_testing_program: launches the remote component of the testing application (netpipe in our case); • prepare_local_testing_program: prepare the local component of the testing application, necessary to maximize the accuracy of the measured values; • execution_of_local_testing_programs: runs the testing application; • get_max_count: extract the parameter value corresponding to the maximal throughput. finally, the line 31 sequentially sets the kernel parameters to the best computed values on the entire cluster. 4 implementation and experimental results to improve the cluster communications, dynamic tests and adjustments for the following linux kernel network parameters are performed: tcp_window_scalling, tcp_rmem and tcp_wmem. bandwidth measurements and tcp parameters adjustments were carried out to obtain the highest bandwidth usage for each case and to determine the maximum bandwidth available for different use cases. the environment used to test the proposed model consists in a grid cluster with the following configuration: one front-end computer with 4 x 3.66 ghz intel xeon processors, 4 x 146 gb hard drive and 8 gb of ram and 12 computing nodes with 1 x 2.33ghz intel core2 duo cpu, 1 x 160gb hard drive and 2gb of ram with gigabit ethernet card connected with cat6 cables via a gigabit switch. first implementation of the algorithm was made in bash, but due the dificulties in working with data structures was switched to perl. also, to preserve measurement accuracy and performance the tests results were saved to files for further usage, like graphical presentation. the performance results were obtained based on a test array with three elements: one for a new model for cluster communications optimization 915 tcp windows scaling, a second one for tcp read buffer and the third one for the tcp write buffer kernel parameters. for each of this there is a graphical presentation, where on the x and y-axis the message size used during tests and the resulted bandwidth values are, respectively, presented. the results for the tcp windows scaling parameter are showed in figure 2(a), where one line is for net.ipv4.tcp_window_scaling=0, and the other one is for net.ipv4.tcp_window_scaling=1. for an easier reading of the results graph, we apply a bezier function in order to obtain the presentation from figure 2(b). the influence of tcp read/write buffer size over the available bandwidth are presented in figure 3(a)/figure4(a). in this case, there are a large number of graphic representations and the observation is very difficult, so a bezier function was applied on the results values to make the graphical presentation more readable, as shown in figure3(b)/figure4(b). the red line in the graphical representations corresponds to the default value for tcp_rmem (4kb) and the blue dotted line is the best value resulted using this model, with 100mbps more than the default value. by applying different values for tcp buffers and running tests bandwidth variation for each of them is presented (figure 5). tcp buffers size starts from 4kb and each algorithm step doubles the previous value. 0 100 200 300 400 500 600 700 1 10 100 1000 10000 100000 1e+06 1e+07 b a n d w id th i n m b p s message size in bytes "scaling_disabled" "scaling_enabled" (a) 0 100 200 300 400 500 600 700 1 10 100 1000 10000 100000 1e+06 1e+07 b a n d w id th i n m b p s message size in bytes "scaling_disabled" "scaling_enabled" (b) figure 2: tcp window scaling influence over bandwidth 0 100 200 300 400 500 600 700 800 900 1 10 100 1000 10000 100000 1e+06 1e+07 b a n d w id th i n m b p s message size in bytes "4096" "8192" "16384" "32768" "65536" "131072" "262144" "524288" (a) 0 100 200 300 400 500 600 700 800 900 1 10 100 1000 10000 100000 1e+06 1e+07 b a n d w id th i n m b p s message size in bytes "4096" "8192" "16384" "32768" "65536" "131072" "262144" "524288" (b) figure 3: tcp read buffer influence over bandwidth 916 a. rusan, c.-m. amarandei 0 100 200 300 400 500 600 700 800 900 1 10 100 1000 10000 100000 1e+06 1e+07 b a n d w id th i n m b p s message size in bytes "4096" "8192" "16384" "32768" "65536" "131072" "262144" "524288" (a) 0 100 200 300 400 500 600 700 800 900 1 10 100 1000 10000 100000 1e+06 1e+07 b a n d w id th i n m b p s message size in bytes "4096" "8192" "16384" "32768" "65536" "131072" "262144" "524288" (b) figure 4: tcp write buffer influence over bandwidth figure 5: bandwidth achieved for different tcp buffer size a ram drive on all computers was built in order to test the results without any delays introduced by the hard disk drive. because of the 2gb memory limits on the computing nodes, the file transferred and the ram drive size was 512 mb. in the figure 6(a) transfer time for the file is shown. in this test the tcp buffers size was changed from 4kb to 512 kb and data transfer starts in both directions for each value, from the frontend to cluster nodes and back. 54.00 54.50 55.00 55.50 56.00 56.50 57.00 57.50 58.00 58.50 59.00 tr a n s fe r ti m e ( m s ) 40 96 81 92 16 38 4 32 76 8 65 53 6 13 10 72 26 21 44 52 42 88 buffer size tcp_wmem tcp_rmem (a) 547 724 480 715 371 695 0 100 200 300 400 500 600 700 800 before after b a n d w id th wscalling wmem rmem (b) figure 6: (a)transfer time for a 512mb file between cluster; (b)the benefits of parameters adjustments the best transfer time was obtained when both tcp_rmem and tcp_wmem values were 16kb a new model for cluster communications optimization 917 or 32kb. during the tests, this solution provides all values for the considered tcp parameters, which can be useful for other scenarios. in figure 6(b), the bandwidth improvement is presented. using only the default values, the cluster internal network available bandwidth is not optimally used with a strong impact on the overall computing performance. using this optimization model, the bandwidth available in the cluster is efficiently used. 5 conclusions and future work using the proposed model, the communication between cluster nodes has been improved. all results are considered for the specific needs of mentioned cluster, where a significant amount of data needs to be transferred between cluster nodes. for other applications, like a web server farm, the final results may be slightly different, but can be optimized by adjusting the test tool for those specific needs. the network kernel parameters computed can be used later if the use case is changed, i.e. a web server farm. the algorithm can be used for ipv6 too, however the authors doesn’t implemented nor tested. the further development of presented application will follow two directions: one is to extend its capabilities to support udp traffic performance adjustment; and the second one is to support tuning parameters other than the ones related to the linux kernel. bibliography [1] t. dunigan, m. mathis, b. tierney , a tcp tuning daemon, conference on high performance networking and computing, proceedings of the 2002 acm/ieee conference on supercomputing, baltimore, maryland, 2002 [2] b.l. tierney, tcp tuning guide for distributed application on wide area networks, usenix;login. http://www-didc.lbl.gov/tcp-wan-perf.pdf., 2001 [3] v. jacobson, r. braden, d. borman, rfc1323 tcp extensions for high performance, may 1992 [4] b.l tierney, d. gunter, j. lee, m. stoufer, j.b. evans, enabling network-aware applications, proceedings of the 10th ieee international symposium on high performance distributed computing, page: 281-288, 2001, isbn: 0-7695-1296-8 [5] netpipe, webpage: http://www.scl.ameslab.gov/netpipe/ [6] a. tirumala, l. cottrell, "iperf quick mode", http://www-iepm.slac.stanford.edu/bw/ iperfres.html [7] netperf homepage, http://www.netperf.org/netperf/netperfpage.html [8] d. turner, a. oline, x. chen, and t. benjegerdes,integrating new capabilities into netpipe, lecture notes in computer science, springer-verlag, september 2003, pp. 37-44. [9] d. turner, x. chen, protocol-dependent message-passing performance on linux clusters, proceedings of the ieee international conference on cluster computing, september 2002, pp. 187-194. [10] q.o. snell, a. mikler, j.l. gustafson, netpipe: a network protocol independent performance evaluator, asted international conference on intelligent information management and systems, june 1996. 918 a. rusan, c.-m. amarandei [11] h. sivakumar, s. bailey, r. l. grossman,psockets: the case for application-level network striping for data intensive applications using high speed wide area networks, proceedings of ieee supercomputing 2000, nov., 2000, http://www.ncdm.uic.edu/html/psockets. html [12] j. postel, rfc793 transmission control protocol, september 1981 [13] r. braden , rfc1122 requirements for internet hosts – communication layers, october 1989 [14] v. paxson, g. almes, j. mahdavi, m. mathis, rfc 2330 framework for ip performance metrics, may 1998 [15] e. ciliendo, t. kunimasa, b. braswell, linux performance and tuning guidelines, ibm, july 2007. http://horos.rdsor.ro/ijcccv3n4draft.pdf int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iii (2008), no. 4, pp. 336-342 fuzzy szpilrajn theorem through indicators irina georgescu abstract: in this paper there are studied some numerical indicators which measure the degree to which a fuzzy relation verifies some properties (reflexivity, transitivity, etc. ).the main result is a fuzzy generalization of the szpilrajn theorem in terms of such numerical indicators and is applied to any fuzzy relation. keywords: fuzzy relation, szpilrajn theorem, similarity 1 introduction the classical szpilrajn theorem [10] asserts that any partial order can be extended to a total order. this result has been followed by refinements and generalizations and has been used in applications too (e. g. consumer theory [8]). the first fuzzy version of the szpilrajn theorem has been established by zadeh [11]. other fuzzy versions of this theorem can be found in [4], [6]. in [3] the topic is systematically studied in the framework of the fuzzy orders with respect to a left–continuous t–norm ∗ and a ∗–similarity relation ω. the idea of this paper is the following: instead of studying a property p of a fuzzy relation r (e. g. reflexivity, transitivity, etc.) to define numerical indicators which should express "the degree to which the fuzzy relation r verifies the property p". in this way, instead of considering a fuzzy order r on a set x we will have a number ord(r) which should measure "the degree to which r is a fuzzy order". the main result of the paper is a generalization of the szpilrajn theorem expressed in terms of such numerical indicators. it is a refinement of theorem 6.2 in [3] and it is applied to any fuzzy relation. 2 preliminaries in this section we shall recall some basic facts on the residuum associated with a left–continuous t–norm and on fuzzy relations ([1], [2], [4], [5], [7], [9]). for any a, b ∈ [0,1] we denote a ∨ b = max(a, b) and a ∧ b = min(a, b). more generally, for any set {ai}i∈i ⊆ [0,1] we denote ∨ i∈i ai = sup{ai|i ∈ i} and ∧ i∈i ai = inf{ai|i ∈ i}. let ∗ be a left–continuous t–norm [7], [5]. the residuum → associated with ∗ is introduced by a → b = ∨ {c ∈ [0,1]|a ∗ c ≤ b}. the biresiduum ↔ is denoted by a ↔ b = (a → b) ∧ (b → a). we fix a left–continuous t–norm ∗. lemma 1. [1], [5] for any a, b, c ∈ [0,1] the following properties hold: (1) a ∗ b ≤ c iff a ≤ b → c; (2) a ∧ b = a ∗ (a → b); (3) a ≤ b iff a → b = 1; (4) a = 1 → a; (5) 1 = a → a; (6) a ≤ a ∗ (a ↔ b). copyright © 2006-2008 by ccc publications fuzzy szpilrajn theorem through indicators 337 lemma 2. [1], [5] for any {ai}i∈i ⊆ [0,1] and a ∈ [0,1] the following properties hold: (1) ( ∨ i∈i ai) ∗ a = ∨ i∈i (ai ∗ a); (2) a → ( ∧ i∈i ai) = ∧ i∈i (a → ai); (3) ( ∨ i∈i ai) → a = ∧ i∈i (ai → a). let x be a non–empty subset. a fuzzy subset of x is a function a : x → [0,1]. denote by f(x ) the family of the fuzzy subsets of x. for any a, b ∈ f(x ) denote a ⊆ b if a(x) ≤ b(x) for any x ∈ x. a fuzzy relation on x is a function r : x 2 → [0,1]. r is said to be • reflexive if r(x, x) = 1 for any x ∈ x; • symmetric if r(x, y) = r(y, x) for any x, y ∈ x; • ∗–transitive if r(x, y) ∗ r(y, z) ≤ r(x, z) for all x, y, z ∈ x; • strongly complete if r(x, y) ∨ r(y, x) = 1 for any x, y ∈ x. a reflexive, symmetric and ∗–transitive fuzzy relation ω on x will be called ∗–similarity relation. let ω be a ∗–similarity relation on x and r a fuzzy relation on x. r is said to be: • ω–reflexive if ω(x, y) ≤ r(x, y) for any x, y ∈ x; • (∗, ω)–antisymmetric if r(x, y) ∗ r(y, x) ≤ ω(x, y) for any x, y ∈ x. a (∗, ω)–order is ∗–transitive, ω–reflexive and (∗, ω)–antisymmetric fuzzy relation r on x. let r, q be two (∗, ω)–orders on x. we say that q is an extension of r if r ⊆ q, i. e. r(x, y) ≤ q(x, y) for all x, y ∈ x. the following fuzzy generalization of the szpilrajn theorem was proved in [3]: theorem 3. let ω be a ∗–similarity relation on x . then any (∗, ω)–order on x has a strongly complete extension. 3 some indicators let ∗ be a left–continuous t–norm and ω be a ∗–similarity relation on x. definition 4. for any fuzzy relation r on x let us define: re f (r) = ∧ x∈x r(x, x); trans(r) = ∧ x,y,z∈x [r(x, y) ∗ r(y, z) → r(x, z)]; re fω(r) = ∧ x,y∈x (ω(x, y) → r(x, y)]; antω(r) = ∧ x,y∈x [r(x, y) ∗ r(y, x) → ω(x, y)]; sc(r) = ∧ x,y∈x (r(x, y) ∨ r(y, x)); ordω(r) = re fω(r) ∧ antω(r) ∧ trans(r). lemma 5. for any fuzzy relation r the following equivalences hold: (1) re f (r) = 1 iff r is reflexive; (2) trans(r) = 1 iff r is ∗–reflexive; 338 irina georgescu (3) re fω(r) = 1 iff r is ω–reflexive; (4) antω(r) = 1 iff r is (∗, ω)–antisymmetric; (5) sc(r) = 1 iff r is strongly complete; (6) ordω(r) = 1 iff r is a (∗, ω)–order. re f (r) will be called the degree of reflexivity of r, trans(r) the degree of ∗–transitivity of r, etc. the indicators introduced above refine the properties of reflexivity, transitivity, etc. of fuzzy relations. thus, instead of saying that the fuzzy relation r is reflexive, the real number re f (r) will measure "the degree to which r is reflexive". proposition 6. let r be a fuzzy relation on x and x, y, z ∈ x . then (1) re f (r) ≤ r(x, x); (2) trans(r) ∗ r(x, y) ∗ r(y, z) ≤ r(x, z); (3) re fω(r) ∗ ω(x, y) ≤ r(x, y); (4) antω(r) ∗ r(x, y) ∗ r(y, x) ≤ ω(x, y); (5) ordω(r) ≤ ω(x, y) ↔ (r(x, y) ∗ r(y, x)); (6) ordω(r) ≤ r(x, x). 4 main result in this section we shall prove a generalization of the theorem of szpilrajn formulated in terms of the indicators introduced in the previous paragraph. the result will be valid for any fuzzy relations and in particular one will obtain theorem 3. let ∗ be a left–continuous t–norm and ω a ∗–similarity relation on x. if r and q are two fuzzy relations on x then we denote r 4 q iff r ⊆ q and ordω(r) ≤ ordω(q). it is easy to see that 4 is a partial order on the set of the fuzzy relations defined on x. if q is a fuzzy relation on x then we denote by ext(q) the set of all fuzzy relations r on x with the property that q 4 r. lemma 7. if q is a fuzzy relation on x then the partially ordered set (ext(q), 4) admits a maximal element. proof. we prove that (ext(q), 4) is inductive. we consider a chain (ri)i∈i in ext(q): for any i, j ∈ i we have ri 4 r j or r j 4 ri. of course q 4 ri for any i ∈ i. we will denote r = ⋃ i∈i ri. it suffices to prove that r ∈ ext(q). it is obvious that q ⊆ r therefore we have to prove that ordω(q) ≤ ordω(r). we show first that (a) ordω(q) ≤ re fω(r). let x, y ∈ x and i ∈ i. since q ⊆ ri it follows immediately ordω(q) ≤ re fω(q) ≤ re fω(ri) ≤ ω(x, y) → ri(x, y). by applying lemma 1 (2) and the previous inequality ordω(q) ∗ ω(x, y) ≤ ω(x, y) ∗ (ω(x, y) → ri(x, y)) = = ω(x, y) ∧ ri(x, y) ≤ ri(x, y) ≤ r(x, y) from where according to lemma 1 (1), ordω(q) ≤ ω(x, y) → r(x, y). it follows fuzzy szpilrajn theorem through indicators 339 ordω(q) ≤ ∧ x,y∈x (ω(x, y) → r(x, y)) = re fω(r). we intend to prove that (b) ordω(q) ≤ antω(r) let x, y ∈ x. then ordω(q) ∗ r(x, y) ∗ r(y, x) = ordω(q) ∗ [ ∨ i∈i ri(x, y)] ∗ [ ∨ j∈i r j(y, x)] = = ∨ i, j∈i ordω(q) ∗ ri(x, y) ∗ r j(y, x). let i, j ∈ i. assume that ri 4 r j therefore ri ⊆ r j and ordω(ri) ≤ ordω(r j). then, according to q 4 r j and proposition 6 (4): ordω(q) ∗ ri(x, y) ∗ r j(y, x) ≤ ordω(r j) ∗ r j(x, y) ∗ r j(y, x) ≤ ≤ antω(r j) ∗ r j(x, y) ∗ r j(y, x) ≤ ω(x, y). since this inequality is valid for any i, j ∈ i it follows ordω(q)∗ r(x, y)∗ r(y, x) ≤ ω(x, y), therefore, according to lemma 1 (1), ordω(q) ≤ r(x, y) ∗ r(y, x) → ω(x, y). from here we deduce ordω(q) ≤ ∧ x,y∈x [r(x, y) ∗ r(y, x) → ω(x, y)] = antω(r). we still have to prove (c) ordω(q) ≤ trans(r). let x, y ∈ x. then ordω(q) ∗ r(x, y) ∗ r(y, z) = ∨ i, j∈i ordω(q) ∗ ri(x, y) ∗ r j(y, x) let i, j ∈ i. assume ri 4 r j therefore ri ⊆ r j and ordω(ri) ≤ ordω(r j). according to proposition 6 (2): ordω(q) ∗ ri(x, y) ∗ r j(y, z) ≤ ordω(r j) ∗ r j(x, y) ∗ r j(y, z) ≤ ≤ trans(r j) ∗ r j(x, y) ∗ r j(y, z) ≤ r j(x, z) ≤ r(x, z) from where, ordω(q) ∗ r(x, y) ∗ r(y, z) ≤ r(x, z). by applying lemma 1 (1) it follows that for any x, y, z ∈ x we have ordω(q) ≤ r(x, y) ∗ r(y, z) → r(x, z), from where ordω(q) ≤ ∧ x,y,z∈x [r(x, y) ∗ r(y, z) → r(x, z)] = trans(r). from (a), (b) and (c) one obtains ordω(q) ≤ ordω(r). we have shown that (ext(q), 4) is inductive. according to zorn’s axiom a maximal element exists in ext(q). in the following we will situate ourselves in the case of the gödel t–norm ∧. theorem 8. let q be a fuzzy relation on x . then there exists a fuzzy relation r on x such that q 4 r and ordω(q) ≤ sc(r). proof. according to lemma 7 there exists a fuzzy relation r on x maximal in (ext(q), 4). then q 4 r. it remains to prove that ordω(q) ≤ sc(r). we assume by absurdum that ordω(q) 6≤ sc(r) = ∧ x,y∈x (r(x, y) ∨ r(y, x)). therefore there exist a, b ∈ x such that ordω(q) 6≤ r(a, b) ∨ r(b, a) from where ordω(q) 6≤ r(a, b) and ordω(q) 6≤ r(b, a). assume r(a, b) ≤ r(b, a). according to the lines above, r(b, a) < ordω(q). we define a new fuzzy relation r′ on x by r′(x, y) = r(x, y) ∨ (r(x, b) ∧ r(a, y)) for any x, y ∈ x. we intend to prove that r 4 r′. it is obvious that r ⊆ r′ therefore it remains to prove that ordω(r) ≤ ordω(r′). from r ⊆ r′ it follows immediately (1) ordω(r) ≤ re fω(r′) we establish now the inequality: (2) ordω(r) ≤ antω(r′) let x, y ∈ x. then 340 irina georgescu ordω(r) ∧ r′(x, y) ∧ r′(y, x) = = ordω(r) ∧ [r(x, y) ∨ (r(x, b) ∧ r(a, y))] ∧ [r(y, x) ∨ (r(y, b) ∧ r(a, x))] = = [ordω(r) ∧ r(x, y) ∧ r(y, x)] ∨ [ordω(r) ∧ r(x, y) ∧ r(y, b) ∧ r(a, x)]∨ ∨[ordω(r) ∧ r(y, x) ∧ r(x, b) ∧ r(a, y)] ∨ [ordω(r) ∧ r(x, b) ∧ r(a, y) ∧ r(y, b) ∧ r(a, x)] we will establish the following inequalities: (a) ordω(r) ∧ r(x, y) ∧ r(y, x) ≤ ω(x, y); (b) ordω(r) ∧ r(x, y) ∧ r(y, b) ∧ r(a, x) ≤ ω(x, y); (c) ordω(r) ∧ r(y, x) ∧ r(x, b) ∧ r(a, y) ≤ ω(x, y); (d) ordω(r) ∧ r(x, b) ∧ r(a, y) ∧ r(y, b) ∧ r(a, x) ≤ ω(x, y). in order to obtain (a) we use proposition 6 (4): ordω(r) ∧ r(x, y) ∧ r(y, x) ≤ antω(r) ∧ r(x, y) ∧ r(y, x) ≤ ω(a, b). we treat now the other three cases. according to proposition 6 (5) we have ordω(r) ≤ ω(a, b) ↔ (r(a, b) ∧ r(b, a)) = ω(a, b) ↔ r(a, b) we consider first the case r(x, y) ≤ r(y, x). then ordω(r) ≤ ω(x, y) ↔ r(x, y) with the same argument as above. (b) results like this: ordω(r) ∧ r(x, y) ∧ r(y, b) ∧ r(a, x) ≤ r(x, y) ∧ [ω(x, y) ↔ r(x, y)] ≤ ω(x, y) now we treat cases (c) and (d). first we notice that according to proposition 6 (2): ordω(r) ∧ r(y, x) ∧ r(x, b) ∧ r(a, y) ≤ trans(r) ∧ r(a, y) ∧ r(y, x) ∧ r(x, b) ≤ r(a, b) therefore ordω(r) ∧ r(y, x) ∧ r(a, y) ≤ r(a, b) ∧ [r(a, b) ↔ ω(a, b)] ≤ ω(a, b). analogously we obtain: ordω(r) ∧ r(x, b) ∧ r(a, y) ∧ r(y, b) ∧ r(a, x) ≤ ω(a, b). we consider the possible subcases: (i) ω(a, b) ≤ r(x, y); (ii) ω(a, b) > r(x, y). according to the proof above, in case (i) the inequalities (c) and (d) are immediate. we are situated now in case (ii). one notices that ordω(r) ∧ r(x, b) ∧ ω(a, b) ∧ r(a, y) ≤ ω(a, b) ∧ [ω(a, b) ↔ r(a, b)] ≤ r(a, b) ≤ r(b, a) therefore ordω(r) ∧ r(x, b) ∧ ω(a, b) ∧ r(a, y) ≤ ordω(r) ∧ r(x, b) ∧ r(b, a) ∧ r(a, y) ≤ trans(r) ∧ r(x, b) ∧ r(b, a) ∧ r(a, y) ≤ r(x, y) from ordω(r) ∧ r(x, b) ∧ ω(a, b) ∧ r(a, y) ≤ r(x, y) and ω(a, b) > r(x, y) it follows ordω(r) ∧ r(x, b) ∧ r(a, y) ≤ r(x, y). by using this last inequality we have ordω(r) ∧ r(y, x) ∧ r(x, b) ∧ r(a, y) ≤ ordω(r) ∧ r(x, b) ∧ r(a, y) ≤ r(x, y) from where ordω(r) ∧ r(y, x) ∧ r(x, b) ∧ r(a, y) ≤ r(x, y) ∧ [r(x, y) ↔ ω(x, y)] ≤ ω(x, y) thus (c) was proved and (d) follows analogously. the case r(y, x) ≤ r(x, y) is treated analogously. therefore the inequalities (a)–(d) are true, so ordω(r) ∧ r′(x, y) ∧ r′(y, x) ≤ ω(x, y). cf. lemma 1 (1) for any x, y, z ∈ x we have ordω(r) ≤ (r′(x, y) ∧ r′(y, x)) → ω(x, y) therefore ordω(r) ≤ ∧ x,y∈x [(r′(x, y) ∧ r′(y, x)) → ω(x, y)] = antω(r′) now we establish the inequality (3) ordω(r) ≤ trans(r′). let x, y, z ∈ x. we prove (4) ordω(r) ∧ r′(x, y) ∧ r′(y, z) ≤ r′(x, z). we notice that ordω(r) ∧ r′(x, y) ∧ r′(y, z) = fuzzy szpilrajn theorem through indicators 341 = ordω(r) ∧ [r(x, y) ∨ (r(x, b) ∧ r(a, y))] ∧ [r(y, z) ∨ (r(y, b) ∧ r(a, z))] = = [ordω(r) ∧ r(x, y) ∧ r(y, z)] ∨ [ordω(r) ∧ r(x, y) ∧ r(y, b) ∧ r(a, z)]∨ ∨[ordω(r) ∧ r(y, z) ∧ r(x, b) ∧ r(a, y)] ∨ [ordω(r) ∧ r(x, b) ∧ r(a, y) ∧ r(y, b) ∧ r(a, z)]. then to prove (4) is equivalent with establishing the following inequalities: (e) ordω(r) ∧ r(x, y) ∧ r(y, z) ≤ r′(x, z); (f) ordω(r) ∧ r(x, y) ∧ r(y, b) ∧ r(a, z) ≤ r′(x, z); (g) ordω(r) ∧ r(y, z) ∧ r(x, b) ∧ r(a, y) ≤ r′(x, z); (h) ordω(r) ∧ r(x, b) ∧ r(a, y) ∧ r(y, b) ∧ r(a, z) ≤ r′(x, z). (e) follows by applying proposition 6 (2): ordω(r) ∧ r(x, y) ∧ r(y, z) ≤ trans(r) ∧ r(x, y) ∧ r(y, z) ≤ r(x, z) ≤ r′(x, z). (f) and (g) follow like this: ordω(r) ∧ r(x, y) ∧ r(y, b) ∧ r(a, z) ≤ (trans(r) ∧ r(x, y) ∧ r(y, b)) ∧ r(a, z) ≤ r(x, b) ∧ r(a, z) ≤ r′(x, z); ordω(r) ∧ r(y, z) ∧ r(x, b) ∧ r(a, y) ≤ (trans(r) ∧ r(a, y) ∧ r(y, z)) ∧ r(x, b) ≤ r(x, b) ∧ r(a, z) ≤ r′(x, z). (h) follows similarly. we established (e)–(h), therefore (4) is true. cf. lemma 1 (1) for any x, y, z ∈ x we have ordω(r) ≤ (r′(x, y) ∧ r(y, z)) → r′(x, z), from where ordω(r) ≤ ∧ x,y,z∈x [(r′(x, y) ∧ r′(y, z)) → r′(x, z)] = trans(r′) from (1), (2) and (3) we deduce ordω(r) ≤ ordω(r′) therefore r 4 r′. we can see that r′(a, b) = r(b, a) ∨ (r(b, b) ∧ r(a, a)) and r(b, a) < ordω(q) ≤ ordω(r) (since q 4 r). then, by applying proposition 6 (6): r(b, a) < ordω(r) ≤ r(b, b) ∧ r(a, a) ≤ r′(a, b). it follows r 6= r′, contradicting the maximality of r. we conclude ordω(q) ≤ sc(r), therefore the theorem is proved. remark 9. by applying lemma 5 we see that theorem 3 is a particular case of theorem 8. bibliography [1] r. bělohlávek, fuzzy relational systems. foundations and principles, kluwer, 2002. [2] u. bodenhofer, similarity–based generalizations of fuzzy orderings preserving the classical axioms, international journal of uncertainty, fuzziness and knowledge based systems, vol. 3, pp. 593–610, 2000. [3] u. bodenhofer, f. klawonn, a formal study of linearity axioms for fuzzy orderings, fuzzy sets and systems, vol. 145, pp. 323–354, 2004. [4] s. gottwald, fuzzy sets and fuzzy logic, vieweg, braunschweig, 1993. [5] p. hájek, methamathematics of fuzzy logic, kluwer, 1998. [6] u. höhle, n. blanchard, partial ordering in l–undeterminate sets, information sciences, vol. 35, pp. 135–144, 1985. [7] e. p. klement, r. mesiar, e. pap, triangular norms , kluwer, 2000. [8] m. richter, revealed preference theory, econometrica, vol. 34, pp. 635–645, 1966. [9] i. j. rudas, j. fodor, information aggregation in intelligent systems using generalized operators, international journal of computers, communications and control, vol. 1, pp. 47–57, 2006. 342 irina georgescu [10] e. szpilrajn, sur l’extension de l’ordre partiel, fundamenta mathematicae, vol. 16, pp. 386–389, 1930. [11] l. a. zadeh, similarity relations and fuzzy orderings, information sciences, vol. 3, pp. 177–200, 1971. irina georgescu academy of economic studies department of economic cybernetics piata romana no 6, r 70167, oficiul postal 22 bucharest, romania e-mail: irina.georgescu@csie.ase.ro irina georgescu received her phd in economics (information systems) from abo akademi university, turku, finland in 2005. since then she is a teaching assistant at the department of economic cybernetics, academy of economic studies, bucharest, romania. between 2007–2008 she was a postdoctoral researcher at abo akademi university, turku, finland. she is the author of about 30 scientific publications and of a monograph issued by springer. her research interests are mainly in the area of soft computing techniques, consumer theory and social choice and welfare economics. international journal of computers, communications & control vol. iii (2008), no. 1, pp. 6-20 robust fuzzy sliding mode controller for discrete nonlinear systems hafedh abid, mohamed chtourou, ahmed toumi abstract: in this work we are interested to discrete robust fuzzy sliding mode control. the discrete siso nonlinear uncertain system is presented by the takgisugeno type fuzzy model state. we recall the principle of the sliding mode control theory then we combine the fuzzy systems with the sliding mode control technique to compute at each sampling time the control law. the control law comports two terms: equivalent control law and switching control law which has a high frequency. the uncertainty is replaced by its upper bound. inverted pendulum and mass spring dumper are used to check performance of the proposed fuzzy robust sliding mode control scheme. keywords: nonlinear systems, sliding mode, t-s fuzzy systems, reaching law. 1 introduction many of the industrial plants include nonlinearities or/and uncertainties. to reach the wanted performances, using the classical theories, nonlinearities must be identified to calculate the appropriate controller. the robust control theories is one of the techniques that permits to reach the desired performances in presence of external or/ and internal disturbances. in addition to the stability, the tracking problem must be solved independently of uncertainties. in the literature, many methods have been developed in continuous as well in discrete-time to solve the tracking problem for nonlinear systems. in the last decade many researches combine classical techniques with intelligent one, such as sliding mode with neuronal systems or sliding mode with fuzzy systems [3][4], to benefit from the advantages of the two control techniques. the sliding mode control (smc) was originally developed for variable structure systems in continuous domain. utkin [12] gives a thorough description of the sliding mode theory in continuous time. also, slotine and li [16] describe in detail continuous sliding mode controllers. at the end of the twentieth century, the research of discrete time smc has been attracted more attention, such as [6],[8],[9],[10], as for the implementation of the controller on a digital computer requires a sampling time and the assumption of an infinite switching time does not hold any more. the fuzzy systems have been combined with classical sliding mode control to provide robust stability to the fuzzy controller. the combination of the two control principles, is called fuzzy sliding mode control (fsmc), it provides an alternative to design a robust controller for nonlinear systems with uncertainties [15],[14]. our contribution in this work consists in presenting a new robust fuzzy sliding mode controller based on the takagi-sugeno fuzzy state model for discrete nonlinear systems. this paper is organised as follow. in section 2, we recall the discrete takagi-sugeno type fuzzy state model for nonlinear systems. then, we describe the sufficient and necessary reaching conditions of sliding mode control for discrete nonlinear systems in the first part of the third section ten, a fuzzy sliding mode controller for discrete time of nonlinear systems is developed in the second part and tracking robust fuzzy sliding mode control law is described in the third part. the simulation results of two nonlinear systems show performances of the proposed fsmc in section 4. conclusions are drawn in the final section. copyright © 2006-2008 by ccc publications robust fuzzy sliding mode controller for discrete nonlinear systems 7 2 problem statement and fuzzy systems 2.1 problem statement consider a class of discrete nonlinear siso systems described by the following equations:    x1 (k + 1) = x2(k) xi (k + 1) = xi+1(k) xn (k + 1) = f (x (k)) + g (x (k)) u(k) y (k) = cx (k) (1) where, c=[1, 0, ..., 0], x (k) = [ x1(k) . . . xn (k) ]t ∈ rn it is the state vector that is assumed to be observable. we note that f (x (k)) and g(x (k)) represent two discrete bounded nonlinear functions of the nonlinear siso systems . they can be obtained from the continuous form by the first order discretized system using eulers approximation. in order for (1) to be controllable, it is required that g(x (k)) 6=0. if both functions f (x (k)) and g(x (k)) in (1) are available for feedback, the feedback linearization control can be used to design a well-defined controller, which is usually given in the form: u (k) = 1 g (x (k)) ( −kt e (k)− f (x (k)) + xnd (k + 1) ) (2) where, the state vector x (k)and the desired state vector xd (k) are defined as: x (k) = [ x1(k) . . . xn (k) ]t ∈ rn ; xd (k) = [ x1d (k) . . . xnd (k) ]t ∈ rn e (k) represents the state tracking error, it is defined as: e (k) = x (k)− xd (k), the vector k (k) =[ kn kn−1 . . k1 ]t ∈ rn will be chosen such that all roots of the following polynomial h (s) = sn +k1sn−1 +...+ kn are situated inside the unit complex disc. in general case, the functions f (x (k)) and g(x (k)) are badly known nonlinear functions so, the control law (2) cannot be implanted. to overcome this difficulty, many approaches are used such as adaptive control, linearization around operating points, fuzzy control. . . etc. 2.2 discrete takagi-sugeno type fuzzy systems the advantage of the t-s type fuzzy models is that their description permits the utilization of the state representation, and by consequence to exploit the maximum of the potential relative to this representation. the takagi-sugeno (t-s) type fuzzy model can be viewed as a natural expansion of piecewise linear partition for nonlinear systems. the nonlinear system is represented as a collection of the fuzzy if-then rules, where each rule describes the local dynamics by a linear system model. the general fuzzy model is achieved by fuzzy amalgamation of the linear systems models [1][2]. the ith rule of the discrete fuzzy model has the following form: ith plant rule : if z1is µi1and...and zn is µin t hen x (k + 1) = (adi + ∆adi) x (k) + (bdi + ∆bdi) u (k) (3) where { µi j } are the fuzzy sets, adi ∈ rnxn and bdi ∈ rnxm are recpectively the ith state matrix and the input matrix, c is the number of the if-then fuzzy rules; u(k) is the input vector, z (k) =[ z1(k) . . . zn (k) ] are the premise variables they represent some measurable system variables, they can be chosen as a state variables. for each rule ri is attributed a weight wi(z(k)) which depends on the grade of the membership function of the premise variables z j(k) in fuzzy sets µi j : 8 hafedh abid, mohamed chtourou, ahmed toumi wi (z (k)) = n ∏ j=1 µi j (z j (k)) ; wi (z (k)) â 0 ;for i = 1, ..., c; c ∑ i=1 wi (z (k)) â 0. where, µi j (z j (k)) is the grade of the membership function of z j (k) to the fuzzy set µi j. the discrete takagi-sugeno type fuzzy model is inferred as follows: x (k + 1) = c ∑ i=1 wi (z (k)) ((adi + ∆adi) x (k) + (bdi + ∆bdi) u (k)) c ∑ i=1 wi (z (k)) (4) the normalized weight is defined as which is presented as follow: hi(z (k)) = wi(z(k)) c ∑ i=1 wi(z(k)) ; 0 ≺ hi(z (k)) ≺ 1; i = 1, ..., c; c ∑ i=1 hi (z (k)) = 1. the output of the discrete takagi-sugeno type fuzzy model for the uncertain nonlinear systems can be inferred as:    x (k + 1) = c ∑ i=1 hi (z (k)) ((adi + ∆adi) x (k) + (bdi + ∆bdi) u (k)) y (k) = c ∑ i=1 hi (z (k))cix (k) (5) where ci = [1, 0, 0, . . . , 0]. it is required that all bdi(k) are different from zero to assure the controllability of (5). 3 sliding mode control law and reaching conditions sliding mode control, first appeared in the soviet literature, it has been widely recognized as a potential approach to uncertain dynamical non-linear systems that are subject to external disturbances and parameter variations [12]. in sliding mode control (smc), the control action forces the system trajectories to cross a manifold of the state space which is called the sliding surface designated by the designers [12]. the system trajectories are then constrained to the sliding surface for all subsequent time via the use of high speed switching controls. the most significant advantage of the sliding mode is robustness against changes in system parameters or disturbances. the major disadvantage associated to the sliding mode control is the chattering phenomena, because it can excite undesirable high frequency dynamics. the sliding mode control comports three modes, namely, the reaching mode (rm), sliding mode (sm), and steady-state mode (ss). let us describe the discrete sliding mode control however, only a few researches are interested by discrete-time systems. a discrete version of smc has a big importance when the implementation of the control is realized by numerical components which need a sampling period to compute the appropriate controller. it must be pointed out that the discrete version of smc cannot be obtained from their continuous counterpart by means of simple equivalence. among the first which are interested by smc problem and used an equivalent form of the continuous reaching condition to give a discrete reaching condition are dote and hoft [5]. [s (k + 1)−s (k)] s (k) ≺ 0 (6) milosavljevic [6] recommended the concept of the quasi-sliding mode and signalled hat condition (6) is not sufficient for a discrete sliding mode control. robust fuzzy sliding mode controller for discrete nonlinear systems 9 sarpturk, et al. [7], used the following reaching condition. |s (k + 1)| ≺ |s (k)| (7) furuta [8] used the equivalent form of a lyapunov-type of continuous reaching condition to give the discrete version. v (k + 1)−v (k) ≺ 0 with v (k) = 1 2 (s (k))2 (8) weibing gao et al. [9] pointed out that all these forms of reaching conditions are incomplete for a satisfactory characterization of a discrete-time sliding mode. he suggests that the state trajectory of a discrete sliding mode control system must have some attributes which form the basis of the discrete sliding mode control, for more information see [9]: 3.1 discrete fuzzy sliding mode control law for a discrete smc the following reaching law has been chosen: s (k + 1) = s (k)−qt s (k)−ε t sgn (s (k)) , with1−qt â 0, ε â 0, q â 0. (9) the sliding surface is defined as: s (k) = gt x (k) where gt is a constant row vector gt = [g1, ..., gn−1, 1] such that all the roots of the following polynomial are situated in the left-half open complex plane: h (s) = sn−1 + gn−1sn−2 + ... + g1 the sliding mode control comports two terms which are:equivalent control term and switching control term[3][5][7][12]. ug = ue + us (10) the equivalent control law in the first part we assume that : ∆adi = 0nxn and ∆bdi = [0, ..., 0]t . the switching function is defined as: s (k) = gt x (k) the ideal quasi sliding mode satisfies: s (k + 1) = s (k) = 0 we deduct : 0 = gt c ∑ i=1 hi (z (k)) (adi (k + 1) x (k) + bdi ue (k)) ; k = 0, 1, ... (11) the equivalent control term is given by: ue (k) = − ( c ∑ i=1 hi (z (k)) g t bdi )−1 [ gt c ∑ i=1 hi (z (k))adi (k + 1) x (k) ] (12) we assume that condition is hold: ( c ∑ i=1 hi (z (k)) gt bdi ) 6= 0 the switching control law from the reaching law we can write:s (k + 1)−s (k) = −qt s (k)−ε t sgn (s (k)) s (k + 1)−s (k) = gt c ∑ i=1 hi (z (k)) (adi (k + 1) x (k) + bdi u (k))−gt x (k) (13) 10 hafedh abid, mohamed chtourou, ahmed toumi if we compare the two latest equations we deduct the global control law: ug (k) = −z−1 [ c ∑ i=1 hi (z (k)) ( gt adi (k + 1) ) x (k)−(1−qt ) s(k) + ε t sgn (s (k)) ] (14) where z = ( c ∑ i=1 hi (z (k)) gt bdi ) from equations and we obtain the switching control term: us (k) = − ( c ∑ i=1 hi (z (k)) g t bdi )−1 [−(1−qt ) s (k) + ε t sgn (s (k))] (15) 3.2 robust fuzzy sliding mode control law consider the discrete system in the perturbed condition. it will be described by the t-s type fuzzy model . where ∆adi represents system parameters variation and ∆bdi is the external disturbance for each sub-model. we assume the matching conditions are satisfied: ∆adi = bdi ^ adi and ∆bdi = bdi ^ bdi where, ^ adi is a row vector and ^ bdi is a scalar . they should be written as: ^ adi = [ −∆ai1 −∆ai2 . . . −∆ain ] ^ bdi = −∆bi. then the equation (5) becomes: x (k + 1) = c ∑ i=1 hi (z (k)) ( adix (k) + bdiu (k) + ^ adix (k) + ^ bdi ) (16) however, the global control law will be expressed as: ug (k) = −z−1 [ c ∑ i=1 hi (z (k)) ( gt adi (k + 1) ) x (k) + γ−(1−qt ) s (k) + ε t sgn (s (k)) ] (17) where γ = gt bdi ( ^ adix (k) + ^ bdi ) in general case and are unknown, but their upper bound are known, so the last global control law can not be implemented. however, to over com this difficulty we replace respectively the unknown terms and by the following expressions: ai = (√( eig ( ^ atdiup ^ adiup )))t ; bi = ∆bi max. ^ adiup = [ −∆ai1 max −∆ai2 max . . . −∆ain max ] ^ bdiup = −∆bi max we define a new set of perturbations and control parameters as follow:. sig = gt bdiaix (k); fig = gt bdibi; the choice of sig and fig is done to ensure that the sign of the incremental s(k) is opposite to the sign of s(k). the global control law will be expressed as: ug (k) = −z−1 [ c ∑ i=1 hi (z (k)) ( gt adi ) x (k) + qi −(1−qt ) s (k) + ε t sgn (s (k)) ] (18) where qi = (sig + fig)−(sig + fig) sgn (s(k)) robust fuzzy sliding mode controller for discrete nonlinear systems 11 3.3 tracking robust fuzzy sliding mode control law the tracking problem will be transformed into the stability problem. indeed,s (k) = 0 represents an equation whose unique stationary solution is . the tracking problem of the desired vectorxd comes back to locate inside the quasi-sliding band width the sliding surface for all sampling time. the sliding surface will be expressed as: s (k) = gt x̃ (k) where, x̃ (k) = x (k)−xd (k) the control law will be expressed as: ug (k) = −z−1 [ c ∑ i=1 hi (z (k)) ( gt adi ) x̃ (k) + qi −(1−qt ) s (k) + ε t sgn (s (k)) ] (19) 4 illustration to illustrate the performance of the presented approach, we choose inverted pendulum and mass spring damper which are widely used in the control literature of nonlinear system. 4.1 inverted pendulum the equations of system in continuous form are given by (20) [18]: where, x1 is the angle in radian of the pendulum from the vertical axis; x2 is the angular velocity in rad/s; g is the gravity acceleration; m and 2l are respectively the mass and the length of the pendulum; m is the mass of the cart and u is the force applied to the cart. the nominal values of the parameters are: g= 9.81m/s2, m = 2 kg, m = 8 kg, 2l =1m.    . x1(t) = x2(t) . x2(t) = f (x1, x2) + g(x1, x2)u + d(t) f (x1, x2) = mlx22 sin x1 cos x1−(m+m)g sin x1 ml cos2 x1− 4l3 (m+m) ; g(x1, x2) = cos x1 ml cos2 x1− 4l3 (m+m) (20) the membership functions for xi ∈ ]−π/2, π/2[are: µ1i(xi) = 1− ∣∣∣ xi(k)π/2 ∣∣∣ and µ2i(xi) = ∣∣∣ xi(k)π/2 ∣∣∣ , the state matrices and input vectors for sub-systems are: ad1 = [ 1 0.01 0.1729 1 ] ; ad2 = [ 1 0.01 0.0936 1 ] , bd1 = [ 0 0.0018 ] , bd2 = [ 0 0.000052 ] ∆ad1u p = [ 0 0 0.05229 0 ] , ; ∆ad2up = [ 0 0 0.028 0 ] , ∆bd1up = 0.0005; ∆bd2up = 0.00002; we have been chosen: q = 70; t = 0.01; ε = 0.1; gt = [10 1]; the figure 1 presents the simulation results of the behavior of variable state x1(k)and s(k) of inverted pendulum for nominal system. the initial conditions are given by:x (0) = [π/3; 0]. we present by the figures2, 3, 4, 5, 6 and 7 the simulation results of the behavior of the state variables x1(k), x2(k), the position and velocity error e1(k) and e2(k), the sliding surface and the control law respectively of the inverted pendulum with parameters vary of an uncertain way in time.the initial conditions are given by: x (0) = [−π/60; 0] ; gt =[15 1], q=80, ε =1.5; the function sign is replaced by the well known sat function which is defined as: { if s ≺ 1φ ; sat = s if s º 1φ ; sat = sgn(s) 12 hafedh abid, mohamed chtourou, ahmed toumi 0 50 100 150 200 −1 −0.8 −0.6 −0.4 −0.2 0 0.2 0.4 0.6 0.8 1 evolution of x1(k) and s(k) (k) x1(k) s(k) figure 1: stabilization of x1(k) and the sliding surface s(k) 4.2 mass spring damper the mass-spring-damper system is described in continuous time by the following equation [17]: m1ẍ1(t) + c1x1 (t) + c2 · x1 (t) + c3x1 (t) + c4x21 (t) = ( 1 + c5 ( · x1 )3 (t) ) u (t) thet-s fuzzy model is discribe by the following rules: rule 1: ifx1 is m11 and x2 is m12 then x (k + 1) = ad1x (k) + bdi1u (k) rule 2: if x1 is m21 andx2 is m22 then x (k + 1) = ad2x (k) + bdi2u (k) rule 3: if x1 is m31 and x2 is m32 then x (k + 1) = ad3x (k) + bdi3u (k) rule 4: if x1 is m41 and x2 is m42 then x (k + 1) = ad4x (k) + bdi4u (k) for nominal values of m1, c2, c3 and c4, matrices ai and bi are given by: ad1 = ad2 = [ 1 0.01 −0.0001 0.99 ] ; ad3 = ad4 = [ 1 0.01 −0.0023 0.99 ] ; bd1 = bd3 = [ 0 0.0143 ] , bd2 = bd4 = [ 0 0.0056 ] ; ∆ad1up = ∆ad2up = [ 0 0 0 0.003 ] , ∆ad3up = ∆ad4up = [ 0 0 0.0007 0.003 ] , ∆bd1up = ∆bd3up = 0.043; ∆bd2up = ∆bd4up = 0.0017; the initial condition and parameters are chosen as:x (0) = [−π/60; 0] ; gt =[15 1], t=0.01; q=70, ε =0.15. we present by the figures 8and 9 the simulation results of the behavior of the state variables x1(k), x2(k), of the mass spring damper with parameters vary of an uncertain way in time. we present by the figures 10, 11, 12 and 13 the simulation results of the behavior of the position and velocity error e1(k) and e2(k), the sliding surface and the control law respectively of the mass spring damper with parameters vary of an uncertain way in time. robust fuzzy sliding mode controller for discrete nonlinear systems 13 0 1000 2000 3000 4000 5000 6000 7000 −0.2 −0.15 −0.1 −0.05 0 0.05 0.1 0.15 evolution of x1 and x1d k x1(k) x1d(k) figure 2: evolution of x1 and x1d 0 1000 2000 3000 4000 5000 6000 7000 −0.2 −0.15 −0.1 −0.05 0 0.05 0.1 0.15 0.2 0.25 0.3 evolution of x2 and x2d k x2(k) x2d(k) figure 3: evolution of x2 and x2d 14 hafedh abid, mohamed chtourou, ahmed toumi 0 1000 2000 3000 4000 5000 6000 7000 −0.01 −0.005 0 0.005 0.01 0.015 0.02 0.025 0.03 0.035 0.04 evolution of the position error k figure 4: evolution of the position error 0 1000 2000 3000 4000 5000 6000 7000 −0.2 −0.15 −0.1 −0.05 0 0.05 evolution of the speed error k figure 5: evolution of the speed error robust fuzzy sliding mode controller for discrete nonlinear systems 15 0 1000 2000 3000 4000 5000 6000 7000 −0.4 −0.3 −0.2 −0.1 0 0.1 0.2 0.3 0.4 evolution of the sliding surface k figure 6: evolution of the sliding surface 0 1000 2000 3000 4000 5000 6000 7000 −50 −40 −30 −20 −10 0 10 20 30 40 50 evolution of the control law u k figure 7: evolution of the control law 16 hafedh abid, mohamed chtourou, ahmed toumi 0 200 400 600 800 1000 1200 1400 1600 1800 2000 −0.2 −0.15 −0.1 −0.05 0 0.05 0.1 0.15 evolution of x1 and x1d k x1(k) x1d(k) figure 8: evolution of x1 and x1d. 0 200 400 600 800 1000 1200 1400 1600 1800 2000 −0.5 0 0.5 1 1.5 evolution of x2 and x2d k x2(k) x2d(k) figure 9: evolution of x2 and de x2d. robust fuzzy sliding mode controller for discrete nonlinear systems 17 0 20 40 60 80 100 120 140 160 180 200 −0.25 −0.2 −0.15 −0.1 −0.05 0 0.05 evolution of the position error k figure 10: evolution of the position error 0 200 400 600 800 1000 1200 1400 1600 1800 2000 0 0.5 1 1.5 2 2.5 evolution of the speed error k figure 11: evolution of the speed error 18 hafedh abid, mohamed chtourou, ahmed toumi 0 200 400 600 800 1000 1200 1400 1600 1800 2000 −2 −1.8 −1.6 −1.4 −1.2 −1 −0.8 −0.6 −0.4 −0.2 0 evolution of the sliding surface k figure 12: evolution the sliding surface 0 200 400 600 800 1000 1200 1400 1600 1800 2000 −20 0 20 40 60 80 100 evolution of the control law u k figure 13: evolution of the control law robust fuzzy sliding mode controller for discrete nonlinear systems 19 5 conclusions in this paper we present a robust fuzzy sliding mode controller for discrete nonlinear systems. first, we recall the discrete takagi-sugeno type fuzzy model, then the principe of the sliding mode control in discrete time. the uncertainty are assumed to be verifie the matching conditions. we develop a robust controller based on the sliding mode and the dynamic t-s fuzzy state model. the uncertainties are replaced bye the bigger eigen-value of the upper bound matrices of uncertainties. the expressions in discrete time of both equivalent control term and hitting term are developed. the tracking control law is developed. simulation results for inverted pendulum and mass spring damper with parameters variation show the performance of the proposed control law. bibliography [1] michio sugeno "on stability of fuzzy systems expressed by rules with singleton consequents, (ieee transaction on fuzzy systems,vol 7 n 2 feb 1999). [2] t.takagi and m.sugeno, fuzzy identification of systems and its applications to modeling and control ieee trans syst. man,cybern. vol15. 116-132 jan/feb1985. [3] w.chang, j.bae park, y.hoan joob and g.chen design of robust fuzzy model-based controller with sliding mode control for siso non-linear systems fuzzy sets and systems 125 (2002) pp 1-22. [4] hafedh abid mohamed chtourou et ahmed toumi, a sliding mode based robust fuzzy controller for a class of uncertain systems ssd05, mrach 2005 sousse tunisia. [5] y. dote and r.g..hoft, microprocessor based sliding mode controller for dc motor drives presented at the industrial application. soc. annu. meeting, cincinnati, oh, 1980. [6] d. milosavljevic, general conditions for the existence of a quasi-sliding mode on the switching hyperplane in discrete variable systems automat. remote contr., vol. 46, pp(307-314) 1985. [7] s. z. sarpturk, y istefanopulos, and o. kaynak, on the stability of discrete-time sliding mode control systems ieee trans. automat. contr., vol. 32, n 10, pp(930-932) 1987.. [8] k. furuta, sliding mode control of a discrete system" , systems and control letter, volume 14, pp145-152. [9] weibing gao, yufu wang and abdollah homaifa, discrete-time variable structure control systems ieee transactions on industrial, electronics, volume 42, n 2, april 1995, pp(117-122). [10] weibing gao., and hung, j.c., variable structure control of nonlinear systems: a new approach ieee transactions on industrial electronics, 40, 45, 1993 [11] h. lee, e. kim, h.j. kang et m. park, a new sliding-mode control with fuzzy boundary layer fuzzy sets and systems 120 (2001), pp. 135-143 [12] utkin, v. i, sliding modes and their application in variable structure systems (moscow: nauka) 1974 (in russian, and also, 1978, mir, in english). [13] utkin, v. i, variable structure systems with sliding mode” ieee transactions on automatic control, vol. ac-22, no. 2, pp 212-222, april., (1977). [14] yu, x. h., man, z. h. and wu, b. l design of fuzzy sliding-mode control systems, fuzzy sets and systems, 95, pp.295-306. ., (1998) 20 hafedh abid, mohamed chtourou, ahmed toumi [15] ting, c. s., li, t. h. s, and kung, f. c., an approach to systematic design of the fuzzy control system fuzzy sets and systems, 77, pp. 151-166. (1996). [16] slotine, j. j. e. and li. w., applied nonlinear control , prentice hall, englewood cliffs, nj.(1991), [17] kazuo tanaka, takayuki ikeda and hua o.wang, robust stabilization of a class of uncertain nonlinear systems via fuzzy control: quadratic stability, h∞ control theory, and linear matrix inequalities ieee trans on fuzzy system, vol 4 n 1 feb 1996. [18] mehrdad hojati and saeed gazor hybrid adaptive fuzzy identification and control nonlinear systemsieee transactions on fuzzy systems vol 10, n 2. april 2002 pp 198-210. hafedh abid1,3, mohamed chtourou2, ahmed toumi3 1institut supérieure des études technologiques de sfax laboratoire d’automatique, génie informatique et signal cité scientifique, bp 48, 59651 villeneuve d’ascq, france 2unité de commande intelligente, design et optimisation des systèmes complexes(icos) enis, b.p. w, 3038 sfax, tunisie 3unité de procédés industriels unité de commande automatique (ucpi) enis,b.p. w, 3038 sfax, tunisie e-mail: hafedh.abid@isetso.rnu.tn mohamed.chtourou@enis.rnu.tn ahmed.toumi@sta-tn.com received: june 13, 2007 int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iii (2008), no. 2, pp. 172-182 analytical model for a multiprocessor with private caches and shared memory angel vassilev nikolov abstract: we develop an analytical model of multiprocessor with private caches and shared memory and obtain the following results: the instantaneous state probabilities and the steady-state probabilities of the system. both transient behaviour and equilibrium can be studied and analyzed. we showed that results can be applied to determine the output parameters for both blocking and non-blocking caches. keywords: invalidate cache-coherence protocol, queuing system, discrete transform 1 introduction shared memory multiprocessors are widely used as platforms for technical and commercial computing [2]. performance evaluation is a key technology for design in computer architecture. the continuous growth in complexity of systems is making this task increasingly complex [7]. in general, the problem of developing effective performance evaluation techniques can be stated as finding the best trade-off between accuracy and speed. the most common approach to estimate the performance of a superscalar multiprocessor is through building a software model and simulating the execution of a set of benchmarks. since processors are synchronous machines, however, simulators usually work at cycle-level and this leads to enormous slowdown [9]. it might take hours even days to simulate. for memory structures relatively accurate analytical models were developed [3, 7, 9, 10] through extensive use of various queuing systems. open queue system with poisson arrivals and exponential service times is considered quite good for description of memory hierarchies [7]. our focus is on the impact of the cache-coherence protocols on the overall system performance. the most commonly used technique for this purpose is the mean value analysis (mva) [3, 5, 7, 8, 9]. it allows the total number of the customers to be fixed (closed queue system), and this seems to be more adequate representation of the processes of self-blocking requestors [5]. calculations of output parameters such as residency times, waiting times and utilization are shown in [3, 8, 9]. mva is based on the forced flow that means in equilibrium output rate equals input rate. however, instantaneously, we can have input rate different from output rate, so that the instantaneous probabilities could be different from equilibrium [7]. mva offers no possibility to study transient effects. moreover, the assumption of exponential service times is not realistic, in fact all bus access times and memory access times are constants. it will be seen later in this paper that state probabilities depend on the server’s time density function. we use the technique of markov processes to describe the behaviour of the multiprocessor implementing cache-coherence protocols. 2 definition and analysis of the model a multiprocessor consists of several processors connected together to a shared main memory by a common complete transaction bus. each processor has a private cache. when a processor issues a request to its cache, the cache controller examines the state of the cache and takes suitable action, which may include generating bus transaction to access main memory. coherence is maintained by having all cache controllers "snoop" on the bus and monitor the transaction. snoopy cache-coherence protocols fall in two major categories: invalidate and update [2, 3, 10]. invalidating protocols are studied here but the concepts can be applied with some modifications to updating protocols too. transactions may copyright © 2006-2008 by ccc publications analytical model for a multiprocessor with private caches and shared memory 173 or may not include the memory block and the shared bus. typical transaction that does not include memory block is invalidate cache copy which occurs when a processor requests writing in the cache. all other processors simply change the status bit(s) of their on copies to invalid. if the memory block is uncached or not clean it can be uploaded from the main memory, but in today’s multiprocessors it is rather uploaded from another cache designated as owner (o) (cache-to cache transfer). memory-tocache transfer occurs when the only clean copy is in the main memory. a cache block is written back (wb) in the main memory (bus is used) when a dirty copy is evicted [6]. the bus and the main memory are also used when synchronization procedures are executed [2]. apparently the bus can be considered as the bottleneck of the system. in terms of the queuing theory processors can be viewed as customers (clients) and the bus can be viewed as a server. inter-arrival times are exponentially distributed with parameter λ . this assumption is adequate for most applications [7]. requests are served on first come first served (fcfs) basis. immediately after issuing a request for cache-to-cache transfer or synchronization procedure the customer blocks itself. the service time for blocking request has a density function f1(x). when service is completed the processor (customer) resumes processing with probability p or resumes processing and generates a new request with probability q (p+q=1). details on how to obtain the input parameters are given in [2, 3, 8, 9]. this new request has a different density function f2(x) and corresponds to wb transaction. it does not block the customer but the server is held until completion of wb transaction therefore adding to the queue. the system can be in one of the following states: 1) n: all n customers are doing internal processing; 2) j, 1: j customers are doing internal processing (n-j are blocked respectively) and all requests are of type 1(0≤j≤n-1), 3) j ,2: j customers are doing internal processing , the server is serving request of type 2, and n-j customers are waiting in the queue for service of type 1 (0≤j≤n). the transitions between these states are illustrated in fig. 1. throughout this paper we use the following notations pn (t) probability[all n customers are doing internal processing at time t] p j,i(t,x) probability[j customers are doing internal processing, n-j are in the queue and/or in the server, and the server is busy doing service of type i at time t and the elapsed service time lies between x and x+dx ] p j,i(x) probability[in the equilibrium state j customers are doing internal processing, n-j are in the queue and/or in the server, the server is busy doing service of type i and the elapsed service time lies between x and x+dx ] p j,i(t) probability[j customers are doing internal processing, n-j are in the queue or in the server, the server is busy doing service of type i at time t] pn , p j,i steady-state probabilities. pn = limt→∞ pn (t), pj,i = ∫ ∞ 0 pj,i(x)dx βi = jλ fi(x) cumulative distribution function (c.d.f.) of the service time of type i ; i=1,2 fi(x) probability density function (p.d.f.) of the service time of type i ; i=1,2 δ m,n kronecker delta 1 µi = ∫ ∞ 0 xfi(x)dx i=1,2 hi(x) = fi(x) 1−fi(x) service rate for type i; i=1,2 fi(s), fi(s+βn), fi(βn) laplace transforms (lt) of fi(x) t.u. time unit viewing the nature of the system, we obtain the following set of integro-differential equations [ d dt + βn ] pn = p ∫ t 0 pn−1(t, x)h1(x)dx + ∫ t 0 pn,2(t, x)h2(x)dx (1) 174 angel vassilev nikolov figure 1: state-transition diagram of the model. 1 ≤ j ≤ n [ d dt + ∂ ∂ t + βn−1 + h1(x) ] pn−1,1(t, x) = 0 (2) [ d dt + ∂ ∂ t + βn + h2(x) ] pn,2(t, x) = 0 (3) [ d dt + ∂ ∂ t + β j + hi(x) ] pj,i(t, x) = β j+1pj+1,i(t, x) (4) for i = 1, 1 ≤ j ≤ n −1; i = 2, 1 ≤ j ≤ n [ d dt + ∂ ∂ t + hi(x) ] p0,i(t, x) = β1p1,i(t, x) (5) for i=1,2 having the following boundary and initial conditions pj,1(t, 0) = (1−δ j,0)p ∫ ∞ 0 pj−1,1(t, x)h1(x)dx + ∫ ∞ 0 pj,2(t, x)h2(x)dx + δ j,n−1βn pn (t) (6) for 0 ≤ j ≤ n −1 pj,2(t, 0) = q ∫ ∞ 0 pj−1,1(t, x)h1(x)dx (7) for 1 ≤ j ≤ n pn (0) = 1, p0,2(t, 0) = 0, pj,i(0, 0) = 0 (8) for i = 1, 1 ≤ j ≤ n −1; i = 2, 1 ≤ j ≤ n by using laplace transform and discrete transform [4, 8] the above equations are transformed as follows (s + βn) pn = 1 + p ∫ ∞ 0 pn−1(s, x)h1(x)dx + ∫ ∞ 0 pn,2(s, x)h2(x)dx (9) analytical model for a multiprocessor with private caches and shared memory 175 [ s + d dx + β j + hi(x) ] u j,i(s, x) = 0 (10) for i = 1, 1 ≤ j ≤ n −1; i = 2, 1 ≤ j ≤ n [ s + d dx + +hi(x) ] p0,i(s, x) = β1p1,i(s, x) (11) for i=1,2 where u j,1(s, x) = ∑n−1n= j ( n j ) pn,1(s, x), pj,1(s, x) = ∑n−1n= j (−1)n− j ( n j ) un,1(s, x) for 1 ≤ j ≤ n − 1, and u j,2(s, x) = ∑nn= j ( n j ) pn,2(s, x), pj,2(s, x) = ∑nn= j(−1)n− j ( n j ) un,2(s, x) for 1 ≤ j ≤ n. let v j,i(s, x) = u j,i(s,x) 1−fi(x) and p ′ 0,1(s, x) = p0,1(s,x) 1−fi(x) . then from (10 and 11) we have after some transformations [ s + d dx + βi ] v j,i(s, x) = 0 for i = 1, 1 ≤ j ≤ n −1; i = 2, 1 ≤ j ≤ n and [ s + d dx ] p ′ 0,i(s, x) = β1p1,i(s, x) for i = 1, 2. hence the solutions of (9-11) are u j,i(s, x) = [1−fi(x)]u j,i(s, 0)e−(s+βi)x (12) pn (s) = 1 + p f1(s + βn−1)un−1,1(s, 0) + f2(s + βn )un,2(s, 0) s + βn (13) p0,1(s, x) = [1−f1(x)]β1e−sx [ p0,1(s, 0) + n−1 ∑ n=1 (−1)n−1n 1−e −βnx βn un,1(s, 0) ] (14) p0,2(s, x) = [1−f2(x)]β1e−sx [ n ∑ n=1 (−1)n−1n 1−e −βnx βn un,2(s, 0) ] . (15) by integrating (12, 14, and 15) we obtain the lt of the instantaneous probabilities pj,1(s) = n−1 ∑ n= j (−1)n− j ( n j ) [ 1− f1(s + βn) s + βn ] un,1(s, 0) (16) for 1 ≤ j ≤ n −1 pj,2(s) = n ∑ n= j (−1)n− j ( n j ) [ 1− f2(s + βn) s + βn ] un,2(s, 0) (17) for 1 ≤ j ≤ n p0,1(s) = p0,1(s, 0) [ 1− f1(s) s ] + β1 n−1 ∑ n=1 (−1)n−1n [ 1− f1(s) s − 1− f1(s + βn) s + βn ] un,1(s, 0) βn (18) 176 angel vassilev nikolov p0,2(s) = β1 n ∑ n=1 (−1)n−1n [ 1− f2(s) s − 1− f2(s + βn) s + βn ] un,2(s, 0) βn . (19) taking lt of (6-7) and using (8 and 12-15) we get after some transformations the following system of linear equations n−1 ∑ n= j (−1)n− j ( n j ) un,1(s, 0) = p n−1 ∑ n= j (−1)n− j+1 ( n j −1 ) f1(s + βn)un,1(s, 0) (20) + n ∑ n= j (−1)n− j ( n j ) f2(s + βn)un,2(s, 0) + δ j,n−1βn pn for 2 ≤ j ≤ n −1 n ∑ n= j (−1)n− j ( n j ) un,2(s, 0) = q n−1 ∑ n= j−1 (−1)n− j+1 ( n j −1 ) f1(s + βn)un,1(s, 0) (21) for 2 ≤ j ≤ n n−1 ∑ n=1 (−1)n−1 ( n j ) un,1(s, 0) = pp0,1(s, 0) f1(s) + pβ1 [ n−1 ∑ n=1 (−1)n−1n f1(s)− f1(s + βn) βn un,1(s, 0) ] (22) n ∑ n=1 (−1)n−1 ( n j ) un,2(s, 0) = qp0,1(s, 0) f1(s) + qβ1 [ n−1 ∑ n=1 (−1)n−1n f1(s)− f1(s + βn) βn un,1(s, 0) ] (23) coefficients u j,i(s,0) can now be determined from the above equations. we can apply the final-value theorem to (16-19) to obtain the steady-state probabilities but it will require use of the l’hopital rule and seems difficult and impractical [11]. instead we set the following differential equations βnpn = p ∫ ∞ 0 pn−1(x)h1(x)dx + ∫ ∞ 0 pn,2(x)h2(x)dx (24) [ d dx + βn−1 + h1(x) ] pn−1,1(x) = 0 (25) [ d dx + βn + h2(x) ] pn,2(x) = 0 (26) [ d dx + β j + hi(x) ] pj,i(x) = β j+1pj+1,i(x) (27) for i = 1, 1 ≤ j ≤ n −1; i = 2, 1 ≤ j ≤ n [ d dx + hi(x) ] p0,i(x) = β1p1,i(x) (28) for i=1,2. equations (24-28) are to be solved under the following boundary conditions and normalizing condition pj,1(0) = (1−δ j,0)p ∫ ∞ 0 pj−1,1(x)h1(x)dx + ∫ ∞ 0 pj,2(x)h2(x)dx + δ j,n−1βn pn (29) analytical model for a multiprocessor with private caches and shared memory 177 for 0 ≤ j ≤ n −1 pj,2(0) = q ∫ ∞ 0 pj−1,1(x)h1(x)dx (30) for 1 ≤ j ≤ n −1 p0,2(0) = 0 (31) pn + n−1 ∑ j=0 pj,1 + n ∑ j=0 pj,2 = 1. (32) the solutions of (2.29-2.32) are pn = 1 + p f1(βn−1)un−1,1(0) + f2(βn )un,2(0) βn (33) pj,1 = n−1 ∑ n= j (−1)n− j ( n j ) [ 1− f1(βn) βn ] un,1(0) (34) for 1 ≤ j ≤ n −1 pj,2 = n ∑ n= j (−1)n− j ( n j ) [ 1− f2(βn) βn ] un,2(0) (35) for 1 ≤ j ≤ n −1 p0,1 = p0,1(0) µ1 + n−1 ∑ n= j (−1)n− jn [ 1 µ1 − 1− f1(βn) βn ] un,1(0) (36) p0,2 = n ∑ n= j (−1)n− jn [ 1 µ2 − 1− f2(βn) βn ] un,2(0) (37) for u j,i(0) and p0,1(0) we have n−1 ∑ n= j (−1)n− j ( n j ) un,1(0) = p n−1 ∑ n= j (−1)n− j+1 ( n j −1 ) f1(βn)un,1(0) + δ j,n−1βn pn (38) for 2 ≤ j ≤ n −1 n−1 ∑ n= j (−1)n− j ( n j ) un,2(0) = q n−1 ∑ n= j (−1)n− j+1 ( n j −1 ) f2(βn)un,2(0) (39) for 2 ≤ j ≤ n −1 p0,1(0) = β1 n ∑ n=1 (−1)n−1n [ 1− f2(βn βn ] un,2(0) (40) n−1 ∑ n=1 (−1)n−1nun,1(0) = pp0,1(0) + pβ1 n ∑ n=1 (−1)n−1n [ 1− f1(βn βn ] un,1(0) + n ∑ n=1 n f2βnun,2(0) (41) n ∑ n=1 (−1)n−1nun,2(0) = qp0,1(0) + qβ1 n ∑ n=1 (−1)n−1n [ 1− f1(βn βn ] un,1(0) (42) the coefficients u j,i(0) can be determined from (32) and (38-42). 178 angel vassilev nikolov 3 examples in order to obtain the transient state probabilities first we have to determine pn (s) and pj,i(s) from (1619) and (20-24) and then to apply the inverse laplace transform to them. we used the packages of maple 8 on a standard pc platform under windows xp for these computations [12]. results were produced and printed in less than a second. for n=4 the instantaneous probabilities are listed in appendix a. various performance characteristics can be computed using the steady-state probabilities. for example, the average number of blocked customers (anbc) in the case of blocking caches will be given by anbc = 2 ∑ i=1 n ∑ j=0 (n − j)pj,i. (43) in the case of non-blocking caches anbc will be anbc = n ∑ j=0 (n − j −1 + k)pj,1 + n−1 ∑ j=0 (n − j)pj,2. (44) where k is the ratio of average memory stall time [2] . k depends strongly on the application. (1-k) actually refers to the fraction time the processor is consuming data while cache-to-cache or memory-tocache transfer is in progress. in appendix b we list the anbc for two popular service time distributions: exponential and erlangian [1], for blocking and fully non-blocking caches (k=0). the time to solve (33-42) and calculate anbc was meaninglessly short. 4 concluding remarks this work presented a model for a shared bus, shared memory multiprocessor with private caches and captures the whole spectrum of invalidate type cache coherence protocols. although we started with fairly sophisticated set of integro-differential equations, the output of the model is a set of few linear equations from which the state probabilities can be determined. the approach eliminates the main drawbacks of the most commonly used mva analysis: inability to deal with transients and constraint on the service time distribution. the model gives insights into the transient behaviour of the system. moreover, the assumption of exponentially distributed service times can be dropped; any continuous distribution can be used. the ease of obtaining performance measures in a meaningless time makes very feasible the incorporation of the model in a multiprocessor design tool. bibliography [1] s. k. bose, introduction to queuing systems, kluwer/plenum publishers, 2001 [2] j. l. hennessy, d. a. patterson; computer architecture: a quantitative approach, pearson publishers, 2003 [3] m. c. chiang, memory system design for bus based multiprocessor, phd thesis, university of wisconsin, 1991 [4] t. itoi, t. nishida, m. kodama and e. ohi, n-unit parallel redundant system with correlated failures and single repair facility, microelectronics and reliability, vol. 17, pp. 279-285, 1978 analytical model for a multiprocessor with private caches and shared memory 179 [5] e. lazowska, j. zahorjan, g. graham, and k. sevcik, quantitative system performance, computer system analysis using queuing network models, prentice-hall, englewood clis, nj, may 1984 [6] a. louri, a.k. kodi, an optical interconnection network and a modifying snooping protocol for the design of large-scale symmetric multiprocessors (smps), ieee transactions on parallel and distributing systems, vol. 15, no. 12, dec. 2004, pp. 1093-11047. [7] r. e. matick, comparison of analytic performance models using closed mean-value analysis versus open-queuing theory for estimating cycles per instruction of memory hierarchies, ibm journal of research and development, jul 2003 [8] d. j. sorin et. al., a customized mva model for ilp multiprocessors, technical report no.1369, university of wisconsin-madison, 1998 [9] d. j. sorin et. al., evaluation of shared-memory parallel system with ilp processors , proc. 25th int’l symp. on computer architecture, june 1998, pp. 180-191 [10] j. sustersic, a. hurson, coherence protocol for bus-based and scalable multiprocessors, internet and wireless distributed computing environments: a survey , advances in computers, vol.59, 2003, pp. 211-278 [11] schiff, joel l., the laplace transform, springer, 1999 [12] waterloo maple inc., introduction to maple 8, 2002 appendix a for n=4, λ =0.001[1/t.u.], f1(x)=0.1exp(-0.1x), and f2(x)=0.01exp(-0.01x) the instantaneous probabilities are p4(t) =0 .9211361286+0.8058476879e-2*exp(-0.1248619627*t) +0.8535072295e-2*exp(-0.1089825679*t)+0.9049529656e-2*exp(-0.9494144284e-1*t) +0.9696074769e-2*exp(-0.8072343638e-1*t)+0.1774027054e-3 exp(-0.1510201407e-1*t) +0.1728181365e-2*exp(-0.1398702636e-1*t) +0.5618533851e-2*exp(-0.1256085210e-1*t) +0.1211910345e-1 exp(-0.1067946234e-1*t)+0.2388149701e-1*exp(-0.8161235321e-2*t), p31(t) =0.3792913471e-1-0.1093143496e-1*exp(-0.1248619627*t) -0.1007354818e-1 *exp(-0.1089825679*t)-0.9271731350e-2*exp(-0.9494144284e-1*t) -0.8405607569e-2*exp(-0.8072343638e-1*t)+0.2658572506e-2 *exp(-0.1510201407e-1*t)-0.2212663963e-5*exp(-0.1398702636e-1*t) -0.3015750621e-3*exp(-0.1256085210e-1*t)-0.6739486112e-3 *exp(-0.1067946234e-1*t)-0.9276492013e-3*exp(-0.8161235321e-2*t), p21(t) = 0.1420742616e-2+0.2324288902e-2*exp(-0.1248619627*t) +0.2986557798e-3* exp(-0.1089825679*t)-0.1243034332e-2 *exp(-0.9494144284e-1*t)-0.2544948528e-2*exp(-0.8072343638e-1*t) -0.5329230583e-2*exp(-0.1510201407e-1*t)+0.6737760872e-2 *exp(-0.1398702636e-1*t)+0.4442015626e-3*exp(-0.1256085210e-1*t) -0.5688666731e-3*exp(-0.1067946234e-1*t)-0.1539483112e-2 *exp(-0.8161235321e-2*t), p11(t) =0 .7684028624e-4-0.2290986748e-3*exp(-0.1248619627*t) +0.3160128043e-3*exp(-0.1089825679*t)+0.2148505581e-3 *exp(-0.9494144284e-1*t)-0.3252577041e-3 180 angel vassilev nikolov *exp(-0.8072343638e-1*t)+0.3775725072e-2*exp(-0.1510201407e-1*t) -0.8400171763e-2*exp(-0.1398702636e-1*t)+0.4974831098e-2 *exp(-0.1256085210e-1*t)+0.5143708805e-3*exp(-0.1067946234e-1*t) -0.9181578239e-3*exp(-0.8161235321e-2*t), p01(t) = 0.9242283829e-5*exp(0-.1248619627*t)-0.3513395647e-4 *exp(-0.1089825679*t)+0.4257071327e-4*exp(-0.9494144284e-1*t) -0.1688587212e-4*exp(-0.8072343638e-1*t)-0.9200675435e-3 *exp(-0.1510201407e-1*t)+0.2810623081e-2*exp(-0.1398702636e-1*t) -0.3183584754e-2*exp(-0.1256085210e-1*t)+0.1611954912e-2 *exp(-0.1067946234e-1*t)-0.3239077071e-3*exp(-0.8161235321e-2*t) +0.5218152790e-5, p42(t) = 0.2709223908e-1+0.9859983387e-3*exp(-.1248619627*t) +0.1060558367e-2*exp(-0.1089825679*t)+0.1145474465e-2 *exp(-0.9494144284e-1*t)+0.1259769099e-2*exp(-0.8072343638e-1*t) -0.2412466943e-1*exp(-0.1510201407e-1*t)-0.1705507775e-2 *exp(-0.1398702636e-1*t)-0.2095511260e-2*exp(-0.1256085210e-1*t) -0.2029637013e-2*exp(-0.1067946234e-1*t)-0.1588776483e-2 *exp(-0.8161235321e-2*t), p32(t) = -0.2421204825e-3*exp(-0.1248619627*t)-0.7509940526e-4 *exp(-0.1089825679*t)+0.9576676158e-4 *exp(-0.9494144284e-1*t)+0.3013803504e-3 *exp(-0.8072343638e-1*t)+0.7126069503e-1*exp(-0.1510201407e-1*t) -0.6135152996e-1*exp(-0.1398702636e-1*t)-0.8971987351e-2 *exp(-0.1256085210e-1*t)-0.5950006752e-2*exp(-0.1067946234e-1*t) -0.4494935895e-2*exp(-0.8161235321e-2*t)+0.9428952497e-2, p22(t) =0 .2421271696e-2+0.2626333487e-4*exp(-0.1248619627*t) -.3154175021e-4*exp(-0.1089825679*t)-0.2945613244e-4 *exp(-0.9494144284e-1*t)+0.3412946115e-4*exp(-0.8072343638e-1*t) -0.8108903801e-1*exp(-0.1510201407e-1*t)+0.1349032466 *exp(-0.1398702636e-1*t)-0.4071010637e-1*exp(-0.1256085210e-1*t) -0.9622074403e-2*exp(-0.1067946234e-1*t)-0.5904604182e-2 *exp(-0.8161235321e-2*t), p12(t) = -0.1765308077e-5*exp(-0.1248619627*t)+0.4800731626e-5 *exp(-0.1089825679*t)-0.4448905932e-5*exp(-0.9494144284e-1*t) +0.1599282603e-5*exp(-0.8072343638e-1*t)+0.4177917201e-1 *exp(-0.1510201407e-1*t)-0.9973555226e-1*exp(-0.139870263e-1*t +0.7256040480e-1*exp(-0.1256085210e-1*t)-0.9747995399e-2 *exp(-0.1067946234e-1*t)-0.5300997812e-2*exp(-0.8161235321e-2*t) +0.4449749927e-3, p02(t) = -0.4618227199e-6*exp(-0.1248619627*t)-0.2203030325e-6 *exp(-0.1089825679*t)+0.4881890483e-7*exp(0-.9494144284e-1*t) -0.1699392257e-7*exp(-0.8072343638e-1*t)-0.8188760719e-2 *exp(-0.1510201407e-1*t)+0.2501502200e-1*exp(-0.1398702636e-1*t) -0.2833447693e-1*exp(-0.1256085210e-1*t)+0.1434663085e-1 *exp(-0.1067946234e-1*t)-0.2882912592e-2*exp(-0.8161235321e-2*t) +0.4449749927e-4. in the above expressions e-i means 10-i for i=1,7. analytical model for a multiprocessor with private caches and shared memory 181 appendix b table 1: n=8, f1(x)=0.1exp(-0.1x), f2(x)=0.01exp(-0.01x) λ [1/t.u.] p anbc for blocking anbc for fully caches nonblocking caches 0.001 0.9 0.154099881194466 0.075640880006411 0.002 0.9 0.441552853804251 0.290383910880334 0.003 0.9 0.822750601431095 0.607433119474025 0.004 0.9 1.253944990222998 0.984102789831906 0.001 0.8 0.230012889507952 0.152313018403034 0.002 0.8 0.729883782777377 0.584481458432927 0.003 0.8 1.382033782478873 1.183494795953230 0.004 0.8 2.063720956300253 1.826269794552253 table 2: n=8, f1(x)=0.13x2ex p(−0.1x)/2!, f2(x)=0.013x2ex p(−0.01x)/2! λ [1/t.u] p anbc for blocking anbc for fully caches nonblocking caches 0.001 0.9 0.384839057891723 0.211437492029451 0.002 0.9 1.313451009452606 0.582993712839022 0.003 0.9 2.390481400874492 1.782339618354729 0.004 0.9 3.691834116720534 2.882438452093385 0.001 0.8 0.614956120345239 0.400820549913285 0.002 0.8 2.611487230549326 1.722034656332087 0.003 0.8 4.062557145097248 3.429652938504840 0.004 0.8 5.899361833023557 5.394204692051840 angel vassilev nikolov national university of lesotho department of mathematics and computer science roma 180 lesotho e-mail: av.nikolov@nul.ls received: december 17, 2007 182 angel vassilev nikolov angel vassilev nikolov received the beng degree in electronic and computer engineering from the technical university of budapest, hungary in 1974 and the phd degree in computer science from the bulgarian academy of sciences in 1982 where he worked as a research associate. in 1989 he was promoted to associate research professor in bulgaria. dr nikolov also served as a lecturer of computer science at the national university of science and technology, bulawayo, zimbabwe and at the grande prairie regional college, alberta, canada and as an associate professor at sharjah college, united arab emirates. his research interests include computer architecture, performance evaluation of multiprocessors, and reliability modeling. he has published numerous journal and conference articles and holds four patents on the above topics. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 1, pp. 8-19 introducing collaborative practices in small medium enterprises d. antonelli, p. chiabert dario antonelli, paolo chiabert politecnico di torino department of production systems and economics corso duca degli abruzzi 24, 10129 torino, italy e-mail: {dario.antonelli,paolo.chiabert}@polito.it abstract: in an enterprise, collaborative working schemes are obtained only by adopting a suitable organization of the enterprise functions. at the informative level collaboration can be enabled by using suitable project management tools for the exchange of information that is the basis of collaboration. one of these tools is product lifecycle management (plm). on the basis of an exhaustive analysis of the plm impact on the different aspects of the enterprise, the paper proposes an original methodology to evaluate the possibility of using plm as framework in which exploit collaboration links within the enterprise. the methodology is not founded on the formal declared organization but on the actual data flows that are induced by the relations among the documents used to develop a project. data are collected on the field using questionnaires. the links are made explicit by applying the hierarchical clustering with single linkage agglomerative technique. the attitude to the implementation of plm is then analysed with respect to the organization and to the informative system. such methodology is general and can be profitably applied to assist the plm implementation in the enterprises. keywords: product life-cycle management, concurrent engineering, business process management, cluster analysis, collaboration. 1 introduction small and medium enterprises (smes) are companies with a turnover of less than 50meuro and a number of employees below 250, as for the eu recommendation 2003/361/ec. in the european countries they are the backbone of the economic system [7]. concurrent engineering (ce) is an effective strategy to shorten ’time to market’ of innovative products. the key for its accomplishment stays in the collaboration among all the enterprise personnel appointed to the project. the goal is accomplished by the parallel development of different activities obtained by the integration of several company functions in a multidisciplinary group in order to anticipate the constraints generated by the industrial dynamics [14]. the core aspect of ce is therefore the creation of tight co-operation links among all the industrial actors involved in the definition of product and process. a significant tool to assist the ce effectiveness is plm. unfortunately, plm implementation inside smes is scarce. the reason goes beyond implementation difficulties or excessive costs of it systems. it derives from the poor conformity of the ordinary sme organization of work to the requirements of ce in term of strategy, organization and technology [13]. it is useful to remark that often in literature the term collaboration is given the same significance as the term coordination. collaboration means working with the others for the success of an activity. coordination can refer to optimisation of the work by finding the best division of activities to be performed by humans and machines. the definition put forth by malone and crowston [15] is: "coordination is managing dependencies between activities". coordination also, recalling the theory of systems, implies the presence of a coordinator, then some hierarchical shape on the organisation [18]. instead, collaboration is the process of various individuals, groups or systems working together on a voluntary basis without the need for a manager or a work program [16]. obviously collaboration works only if some behaviour rules are respected, as the ones proposed in [21] in the case of collaborative software agents. in the organization oriented to ce, different coordination mechanisms have been adopted [10]. they differ mainly on the instant of application of the coordination: predetermined before starting the work or determined during the work as a result of a negotiation. a discussed list of co-ordination systems is given in [20]: information workflow, copyright c© 2006-2010 by ccc publications introducing collaborative practices in small medium enterprises 9 negotiated objectives, milestones, professional rules, mapped parameters, design space sharing, joint parameters, data coherency. some coordination systems need the active participation of a coordinator, e.g. the project manager. conversely other systems like design space sharing and data coherency must be enforced by the use of appropriate software tools, e.g. the plm. a comprehensive definition of product lifecycle management is "a strategic business approach for the effective management and use of corporate intellectual capital" where "corporate intellectual capital" consists of products definition, products history and best practices" [6]. several plm systems already operates in large companies demonstrating their effectiveness. they range from the ford project c3p (cad/cae/cam and product information management) started in 1996 and based on sdrc’s i-deas and metaphase software to the boeing 7e7 dreamlinear airplane entirely designed in the dassault systemes plm software platform, to the siemens teamcenter suite largely adopted in automotive industries. presently new lighter software suites, based on the "out of the shelf" commercial policy, make plm systems affordable to smes. despite the availability of software, plm do not have an adequate diffusion among them. in authors’ opinion the most important justification of enterprises’ interest in plm should be the competitive edge provided by collaboration. as a matter of fact, plm assures the most effective control of enterprise’s projects by increasing the efficiency in information management along three directions: • increase of efficiency in the management of technical product information. product data represent the core of manufacturing enterprise. they should be protected against unauthorized accesses but, at the same time, should be promptly shared for design activities as well as for many other functions in the enterprise. • easiness of product data exchange by establishing proper interfaces towards enterprise functions (manufacturing, maintenance ...) and, vice versa, by defining communicational channels to gather any relevant information for the upgrading. • management of the enterprise knowledge database for storing and retrieving historical products. the carry over process, widely adopted to speed up the time to market of new products, benefits from the use of plm tools. the reasons of its inadequate diffusion are uniformly spread over enterprise organization and information technology system: • accessing product data from enterprise functions not directly involved in product development is a functionality provided by it tools like erp that are seldom present inside the smes. • product/process development is historically a self-governed function, with human resources, activities and hardware/software tools sharply separated by other industrial functions. • the implementation of it tools linking different enterprises areas requires a clear definition of decisional and informational processes. the idea behind this research is that the work organization inside a sme is based on spontaneous collaboration that overcomes the attempts to organize the activities by introducing coordination mechanisms, as in the case of plm implementation. spontaneous collaboration works well as far as the dimensions of the firm are small enough to allow everyone to know where to find the information they need. as far as the enterprise grows a technology like plm becomes necessary to assist the information retrieval. to assess the validity of this supposition we need to find a way to measure the extent of collaboration. the paper illustrates an original approach to the analysis of management processes of sme’s product data. using simple data-driven procedures, it is possible to forecast the impact of collaboration on sme organization without inferring formal models in any case available in literature [4]. in detail, the proposed analysis evaluates the sme readiness to plm implementation, by matching the collaborative network with the functional organization of the enterprise. 2 product and process information in sme: dataflow it is useful to distinguish among products and processes to develop products. innovative and competitive products increase enterprise’s market share whilst efficient and robust industrial processes allows the enterprise emerging over the competitors. as a consequence, the information on enterprise products and processes, in terms of data and metadata (table 1), has to be as efficient as possible along the whole product lifecycle. 10 d. antonelli, p. chiabert table 1: enterprise information product data processes business metadata (step, pdm) enterprise content management engineering data (gps, step) manufacturing planning pdm/plm systems were born to control data proliferation in design activities, therefore their primary function is the control of product data in terms of integrity, protection and modification. a second important function regards data availability: user-friendly, simultaneous and multi-point access to product data [20]. plm systems do not limit their function to database repository, but actively support product data development by tracking design activities, maintaining configuration of evolving products, and allowing the re-use of the data in the development of new products [9]. the enlargement of the scope of product data, which are no more property of design function but become capital of the whole enterprise, requires a redefinition of their accessibility and modifiability, with regard to other it systems like erp. the redistribution of the responsibilities on product data among enterprise functions represents the most innovative aspect of plm systems. obviously it is also the most difficult aspect to be addressed in plm implementation and it justifies the lateness of plm diffusion in the smes. plm systems have an impact on the entire enterprise and require a structured organization where informative flows, interfaces and decision makers are well identified. this is normal in large companies, but it is not predictable in smes, where the same person often provides different functions and where decision makers are individuated more on the basis of capital share than on the basis of organizational role and technical competence. the intrinsic complexity of plm systems and the informal organization of small companies represent a real difficulty in the implementation of plm in smes [1]. 2.1 sme organizational model the technical literature widely illustrates case studies related to the implementation of plm in large and well structured organizations. unfortunately few experiences are available on plm implementation in smes, where informal organizational structure does not allow the application of standard methodologies, based on the development of a detailed ontological model (sap, uml, ...), in order to perform an efficient description of enterprise processes [8], [17], [25]. a preliminary analysis of enterprise organization should rely on the generic product lifecycle functions [2] schematized in figure 1: • direction: the overall project management, this function is transversal to the others. • design concept: idea for new product or, more often, product design enhancement is recognized, based on market knowledge. • market demand analysis: manufacturer studies need for the new product design and estimates demand and feasibility of meeting demand. • engineering: design engineers create the product design using all available information from the plm system, including after-market factors, manufacturability data, customers’ needs/preferences, and more. • sourcing: procurement carry out preliminary work required to acquire parts, materials, components, equipment, and anything else is required to manufacture the product. • production: product is built to design specifications established by engineering and using parts and materials acquired by sourcing. conformance to specification is checked through quality control/assurance or process control methodologies. • distribution: product is shipped to either distributors, who store it until order is received from customer, or directly to final customer. • after market: products are maintained, serviced, or repaired under warranty or as a value-added service. using a unique repository of various after-market data ensures that after-market factors are taken into account in subsequent design projects, increasing the value of the product to the customer. introducing collaborative practices in small medium enterprises 11 figure 1: product lifecycle analysis (from aberdeen [1]). the experimental data used in the paper were collected from a sme which designs, manufactures, installs and maintains industrial choppers all around the world. the enterprise structure, according to a functional point of view, adheres to the proposed model. 2.2 sme investigation procedure there is a widespread commonplace saying that every sme works without any notable organization. actually, the organization exists but it is usually not explicitly formalized: it is a natural organization based on significance relationships and spontaneous collaboration. therefore, it makes no sense to look at classical organization devices, like organization charts, information systems, team management tools (pert, gantt) [23]. there are four basic principles which make the organization of a sme efficient: • process-based cooperation, without a fixed separation of tasks. • extended sharing of knowledge, unfortunately transmitted in a informal way. • small teams with continuous communications and interchanges of information. • potential for outside development by having recourse to the networks of enterprises. these same principles represent the hardest obstacle to an efficient description of the enterprise data, functions, processes and related managing tools. researchers adopt a simple data-driven approach focused on the investigation of enterprise documents in order to overcome the obstacle. a set of questionnaires submitted to enterprise people are used to collect information on produced and consulted documents thus providing an unbiased objective description of the true organization, processes and hierarchies operating in the enterprise. the questionnaires play a fundamental role in diminishing the noise involuntarily introduced by interviewed people: the focus on documents and their management and not on sensible data regarding the enterprise organization, defines the search field and the required information. researchers submitted questionnaires to enterprise personnel during the development of a new project and transformed the enterprise in a living laboratory where researchers interacted with employers during the questionnaires filling, observed the product development process and analyzed the documents identifying their dependence relationships as well as their format. 12 d. antonelli, p. chiabert 2.3 sme information reorganization several operations applied to the information collected by questionnaires, provide a more structured description of enterprise and its organization, functions, processes and hierarchies: • analysis and evaluation of plm functionalities in the enterprise context; • analysis and definition of product data: quantity, relationships, applicability, supporting media; • definition of enterprise functions and analysis of their role in product data management; • analysis of enterprise’s acquaintance with informatics; • development of the dlsm (data lifecycle simulation matrix) representing the lifecycle of product data and their dependencies within the enterprise functions; • development of the dism (data interchange simulation matrix) representing the interfaces used to communicate the product data among the different functions; • creation of a metric to exploit the enterprise position, according to technological and organizational aspects, against the best practice. the dlsm and dism matrices provide a complete picture of product data management across the enterprise functions and along the product lifecycle. on this basis it is possible to evaluate the organizational and informational structure by detailing the interactions with each enterprise function. moreover, analyzing the dlsm and dism matrix it is possible to extrapolate useful indicators which address the reorganization of the enterprise. 3 reorganizing the data: dlsm and dism matrices the core of the methodology is the building of a dlsm (data lifecycle simulation matrix), a square matrix containing a complete description of the enterprise organization based on its documents. the dlsm matrix represents the dependence relationship among the data/documents inside different enterprise functions. the rows of the dlsm matrix, contain the enterprise documents classified according to the enterprise function responsible for their production and maintenance. the columns of the dlsm matrix contain the same documents presented in the same order. the dependence relationships among documents are made explicit by activating the cells at the intersection of the row and of the column corresponding to the associated documents. when a cell is activated, the document along the column will make use/reference to the document identified in the row. a formal description of dlsm matrix is: dlsm(i, j = ) = {enterprise documents source} dlsm(i = , j) = {enterprise documents access} (1) dlsm(i, j) = {  when document i is accessed by document j  otherwise figure 2 shows a pattern view of the 160x160 sparse dlsm matrix representing the documents required to describe product lifecycle within the analysed case study. the grey areas placed on the matrix diagonal highlight the enterprise functions: direction, marketing, administration, design, planning, supply, manufacturing and assembling, inventories, after sale services. figure 3 shows the upper and left corner of dlms matrix including some documents produced by the two functions direction and marketing. some interesting results can be extracted from simple operations performed on the dlsm matrix. the dism (data interchange simulation matrix) is a square matrix having the same structure of dlsm, but now the cells are activated when the format and the support used to save two inter-dependent documents are the same. the core of the method is the transformation of the dlsm into the dism. for every intersection activated in the dlsm, the document formats and supports are compared: if they agree (i.e. both ms word format, or excel format, or paper support, ...) the cell in the dism is activated. obviously there are quite less occurrence in the dism with respect to dlsm. figure 4 represents the upper and left corner of the dism matrix including some documents produced by the introducing collaborative practices in small medium enterprises 13 s tr a te g y : m a rk e ts a n d c o m p e ti to rs a n a ly s is m a rk e t o p p o rt u n it ie s a n a ly s is n e w c o n c e p ts d e s ig n a n a ly s is t a rg e t m a rk e ts a n a ly s is t a rg e t p ri c e s a n a ly s is t a rg e t c o s ts a n a ly s is t im e t o m a rk e t a n a ly s is e c o n o m ic a l ri s k s a n a ly s is f u n d in g s o u rc e s b u d g e ti n g a ll o c a ti o n a n a ly s is p ro to ty p e t a rg e ts c li e n ts r e q u ir e m e n ts r e p o rt in g c li e n t in te rv ie w s k e y p e rf o rm a n c e c o m p e ti to r p ro d u c ts t a rg e t s e ll in g p ri c e d e m a n d f o re c a s t h is to ri c a l e c o n o m ic a l a n a ly s is e c o n o m ic f o re c a s t p ro d u c ts p o rt fo li o m a rk e t s h a re e x te rn a l lo g is ti c s p ri c e p o li c ie s p ro d u c ts g u a ra n te e s a p p li c a ti o n p ro d u c ts s e ll in g p la n p o rt fo li o a c tu a l o rd e rs o ff e ri n g , s a le s p ro d u c t a g re e m e n t m t o : c o n fi g u ra ti o n , ti m e to m a rk e t a d v e rt is in g m a te ri a ls p ro d u c ts e c o n o m ic a l p e rf o rm a n c e h is to ri c a l e c o n o m ic a n a ly s is e c o n o m ic a n a ly ti c a l re p o rt e c o n o m ic e v a lu a ti o n o f p ro to ty p e s a n a ly s is p ro d u c t c o s ts s e n s it iv it y a n a ly s is p ro d u c t b il li n g s u p p li e rs p a y m e n ts a rc h iv io p ro g e tt i c o n c lu s i e s tu d i e le n c o p ro g e tt i in e s s e re s it u a z io n e d e i p ro g e tt i in e s s e re l is te " m e tr ic h e " p e r le s p e c if ic h e o b ie tt iv o d i p ro d o tt o s p e c if ic h e o b ie tt iv o c o n c e tt i d i p ro d o tt o a lt e rn a ti v i d is ti n ta m a te ri a li a p p ro x d e i m o d e ll i c o n c e tt u a li d is ti n ta a p p ro x l a v o ra z io n i d e i m o d e ll i c o n c e tt u a li p ia n o a p p ro x d i a s s e m b la g g io d e i m o d e ll i c o n c e tt u a li c a ra tt e ri s ti c h e f u n z io n a li d e i m o d e ll i c o n c e tt u a li c a ra tt e ri s ti c h e t e c n ic h e d e i m o d e ll i c o n c e tt u a li d e fi n iz io n e p ro to ti p o s p e c if ic h e t e c n ic h e f in a li s p e c if ic h e e s te ti c h e f in a li s p e c if ic h e f in a li d i m a n u te n ib il it à s p e c if ic h e f in a li d i a ff id a b il it à s p e c if ic h e f in a li d i d u ra ta a rc h iv io a n a li s i e c o n o m ic h e s ti m a e c o n o m ic a ( d i c o m p e te n z a ) d e i c o n c e tt i e la b o ra ti p ia n o d i c o n d iv is io n e d e l p ro d o tt o d is e g n i 3 d p a rt ic o la ri d e l p ro to ti p o d is e g n i 2 d p a rt ic o la ri d e l p ro to ti p o d is e g n i 3 d c o m p le s s iv o p ro to ti p o d is e g n i 2 d c o m p le s s iv o p ro to ti p o d is e g n i e s p lo s i d e l p ro to ti p o m a te ri a li p a rt ic o la ri p ro to ti p o d e fi n iz io n e d e ll e la v o ra z io n i s u l p ro to ti p o d e fi n iz io n e d e ll e t o ll e ra n z e p a rt ic o la ri p ro to ti p o a rc h iv io r is u lt a ti d e i te s t e le n c o t e s t d a e ff e tt u a re r is u lt a ti t e s t s u l p ro to ti p o c o s ti p ro to ti p o d is e g n i 3 d p a rt ic o la ri p ro d o tt o d is e g n i 2 d p a rt ic o la ri p ro d o tt o d is e g n i 3 d c o m p le s s iv o p ro d o tt o d is e g n i 2 d c o m p e s s iv o p ro d o tt o d is e g n i d e g li e s p lo s i e le n c o e d e s c ri z io n e m o d if ic h e d is e g n i d e l p ro d o tt o d e fi n iz io n e d e ll e t o ll e ra n z e d e l p ro d o tt o c o d ic i p a rt ic o la ri c o d ic e p ro d o tt o d e n o m in a z io n e p a rt ic o la ri d is ti n ta m a te ri a li p ro d o tt o d im e n s io n i e p e s o d e i g re z z i d im e n s io n e e p e s o d e i fi n it i c a ra tt e ri s ti c h e p a rt ic o la ri c o m m e rc ia li t ra tt a m e n ti te rm ic i/ g a v a n ic i/ e tc . d is ti n ta b a s e e le n c o t e s t d a e ff e tt u a re s u l p ro d o tt o ( p re s e ri e ) r is u lt a ti t e s t s u l p ro d o tt o ( p re s e ri e ) p ro c e d u ra c o n tr o ll o q u a li tà in te rn a ( p a rt ic o la ri / a s s e m b la ti / p ro d o tt o ) p ro c e d u re c o n tr o ll o q u a li tà in a c c e tt a z io n e ( p a rt ic o la ri / a s s e m b la ti / p ro d o tt o ) p ro c e d u ra c o n tr o ll o q u a li tà p re s s o i l fo rn it o re s c h e m i e le tt ri c i c e rt if ic a z io n i d e l p ro d o tt o n o rm a ti v e v ig e n ti s c h e m a a s s e m b la g g io p ro d o tt o d e fi n iz io n e d e ll e c o n fi g u ra z io n i m a n u a li d 'u s o e m a n u te n z io n e d is ti n ta e c a ra tt e ri s tc h e d e ll e m a c c h in e u te n s il i d a u ti li z z a re p e r o g n i s in g o lo p ro d o tt o s e le z io n e /p ro g e tt a z io n e d e ll e a tt re z z a tu re u te n s il i d a u ti li z z a re p e r le la v o ra z io n i p a ra m e tr i d i la v o ra z io n e c ic li d i la v o ra z io n e p ro g ra m m i p e r i c n c d e fi n iz io n e d e i c o s ti d i la v o ra z io n e r is u lt a ti a n a li s i "m a k e o r b u y " c o s ti d i p ro d u z io n e b u d g e t d i p ro d u z io n e p ia n o p ri n c ip a le d i p ro d u z io n e ( m p s ) p ia n if ic a z io n e d e i fa b b is o g n i ( te c n ic h e p u s h o p u ll ) p ro g ra m m a z io n e o p e ra ti v a ( g a n tt , e tc . ) p o rt a fo g li o f o rn it o ri c o d ic e f o rn it o re a n a g ra fi c a f o rn it o re p re v e n ti v i fo rn it o ri o rd in e d i a c q u is to d o c u m e n ta z io n i te c n ic h e d e i fo rn it o ri c o s to d e l p a rt ic o la re c o m m e rc ia le ( c o n c o rd a to c o n i l fo rn it o re ) c o s to d e l p a rt ic o la re n o n c o m m e rc ia le ( c o n c o rd a to c o n i l fo rn it o re ) l o tt o m in im o d i a c q u is to d e l p a rt ic o la re ( c o n c o rd a to c o n i l fo rn it o re ) d a ta d i c o n s e g n a r ig a d 'o rd in e l e a d t im e f o rn it o re c o n d iz io n i d i p a g a m e n to c o n d iz io n i p e r il t ra s p o rt o c o n d iz io n i d i g a ra n z ia e v e n tu a li p e n a li tà p e r ri ta rd a ta c o n s e g n a t ip o lo g ia d 'im b a ll o s e q u e n z a f o rn it o ri s u i p a rt ic o la ri r ie p il o g o s o ll e c it i d is ti n ta d o c u m e n ti i n v ia ti a fo rn it o ri a n a li s i te m p i e c o s ti d i a s s e m b la g g io p ia n o a s s e m b la g g io c o m m e s s e p ia n o a s s e m b la g g io p ro d o tt o s c h e d a a v a n z a m e n to la v o ro s c h e d a r il e v a m e n to d if e tt o s it à b o ll a d i c a ri c o b o ll a d i s c a ri c o b o ll a d i re s o b u o n o d i p re li e v o i n te rn o b u o n o d i v e rs a m e n to in te rn o q u a n ti tà d is p o n ib il e d i o g n i s in g o lo p a rt ic o la re q u a n ti tà i m p e g n a ta d i o g n i s in g o lo p a rt ic o la re v a lo re q u a n ti tà d is p o n ib il e d i o g n i s in g o lo p a rt ic o la re s ta tu s d e l p a rt ic o la re ( c o n tr o ll a to -n o n c o n tr o ll a to ) s ta tu s d e l m a te ri a le ( g re z z o -s e m il a v o ra to la v o ra to -c o m m e rc ia le ) c o d ic e d i p re li e v o la v o ra z io n e i n te rn a c o d ic e d i p re li e v o la v o ra z io n e e s te rn a c o d ic e d i p re li e v o v e n d it a c o d ic e r e s o c o o rd in a te d i s to c c a g g io d i o g n i s in g o lo p a rt ic o la re a n a li s i a b c s to ri c o i n te rv e n ti s u i p ro d o tt i s im il i, g ià c o m m e rc ia li z z a ti s to ri c o i n te rv e n ti s u l p ro d o tt o d u ra n te i l p e ri o d o d i a tt iv it à a rc h iv io m o d if ic h e s u i p ro d o tt i s im il i, g ià c o m m e rc ia li z z a ti a rc h iv io m o d if ic h e s u l p ro d o tt o d u ra n te i l p e ri o d o d i a tt iv it à a rc h iv io d if e tt o s it à s u i p ro d o tt i s im il i, g ià c o m m e rc ia li z z a ti a rc h iv io d if e tt o s it à s u l p ro d o tt o d u ra n te i l p e ri o d o d i a tt iv it à s ta ti s ti c h e d e i g u a s ti s u i p ro d o tt i s im il i, g ià c o m m e rc ia li z z a ti s ta ti s ti c h e d e i g u a s ti s u l p ro d o tt o d u ra n te i l p e ri o d o d i a tt iv it à a rc h iv io d e i re c la m i in e re n ti a i p ro d o tt i s im il i, g ià c o m m e rc ia li z z a ti r e p o rt d e i re c la m i in e re n ti a l p ro d o tt o a rc h iv io o s s e rv a z io n i s u i p ro d o tt i s im il i, g ià c o m m e rc ia li z z a ti r e p o rt o s s e rv a z io n i s u l p ro d o tt o ,d e i c li e n ti strategy: markets and competitors analysis market opportunities 1.0 analysis new concepts design 1.0 1.0 analysis target markets 1.0 analysis target prices 1.0 analysis target costs 1.0 1.0 analysis time to market 1.0 1.0 1.0 analysis economical risks 1.0 1.0 1.0 1.0 1.0 1.0 analysis funding sources budgeting allocation analysis prototype targets 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 clients requirements 1.0 1.0 1.0 1.0 reporting client interviews key performance competitor products 1.0 target selling price 1.0 1.0 1.0 demand forecast 1.0 1.0 1.0 historical economical analysis economic forecast 1.0 1.0 1.0 1.0 1.0 products portfolio 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 market share external logistics price policies products guarantees application 1.0 1.0 1.0 1.0 products selling plan 1.0 portfolio actual orders offering, sales product agreement mto: configuration, time to market advertising materials products economical performance historical economic analysis economic analytical report 1.0 1.0 1.0 1.0 1.0 1.0 economic evaluation of prototypes analysis product costs sensitivity analysis product billing suppliers payments projects historical library projects currently active active projects status qfd analysis for active projects 1.0 1.0 1.0 target functionalities / requirements 1.0 1.0 1.0 1.0 1.0 collection of conceptual alternative solutions 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 bill of materials of conceptual solutions 1.0 1.0 1.0 1.0 manufacturing bom of conceptual solutions 1.0 engineering bom of conceptual solutions 1.0 1.0 functional characteristics of conceptual solutions 1.0 technical characteristics of conceptual solutions 1.0 design prototype 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 final technical specifications 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 final shaping specifications 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 final maintenance specification 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 final reliability specifications 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 final product lifecycle specifications 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 economic analysis stima economica ( di competenza ) dei concetti elaborati 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 piano di condivisione del prodotto 1.0 1.0 1.0 1.0 1.0 disegni 3d particolari del prototipo 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 disegni 2d particolari del prototipo 1.0 1.0 disegni 3d complessivo prototipo 1.0 1.0 disegni 2d complessivo prototipo 1.0 disegni esplosi del prototipo 1.0 materiali particolari prototipo 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 definizione delle lavorazioni sul prototipo 1.0 1.0 1.0 definizione delle tolleranze particolari prototipo 1.0 1.0 1.0 1.0 1.0 archivio risultati dei test elenco test da effettuare 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 risultati test sul prototipo costi prototipo 1.0 1.0 1.0 1.0 1.0 disegni 3d particolari prodotto 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 disegni 2d particolari prodotto 1.0 1.0 disegni 3d complessivo prodotto 1.0 1.0 1.0 disegni 2d compessivo prodotto 1.0 disegni degli esplosi 1.0 elenco e descrizione modifiche disegni del prodotto 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 definizione delle tolleranze del prodotto codici particolari 1.0 1.0 1.0 codice prodotto denominazione particolari distinta materiali prodotto 1.0 1.0 1.0 1.0 1.0 1.0 dimensioni e peso dei grezzi 1.0 1.0 1.0 dimensione e peso dei finiti 1.0 1.0 1.0 caratteristiche particolari commerciali 1.0 1.0 1.0 1.0 1.0 1.0 trattamenti termici/gavanici/etc. 1.0 1.0 1.0 1.0 1.0 1.0 distinta base elenco test da effettuare sul prodotto ( pre serie ) 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 risultati test sul prodotto ( pre serie ) procedura controllo qualità interna ( particolari / assemblati / prodotto ) 1.0 1.0 1.0 1.0 1.0 1.0 procedure controllo qualità in accettazione ( particolari / assemblati / 1.0 1.0 1.0 1.0 1.0 1.0 procedura controllo qualità presso il fornitore schemi elettrici 1.0 certificazioni del prodotto 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 normative vigenti schema assemblaggio prodotto 1.0 1.0 1.0 1.0 1.0 1.0 1.0 definizione delle configurazioni 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 manuali d'uso e manutenzione 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 distinta e caratteristiche delle macchine utensili da utilizzare 1.0 1.0 1.0 1.0 1.0 1.0 selezione/progettazione delle attrezzature 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 utensili da utilizzare per le lavorazioni 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 parametri di lavorazione 1.0 1.0 1.0 1.0 1.0 1.0 cicli di lavorazione 1.0 1.0 1.0 1.0 1.0 1.0 1.0 programmi per i cnc 1.0 1.0 1.0 1.0 definizione dei costi di lavorazione 1.0 1.0 1.0 1.0 1.0 1.0 1.0 risultati analisi "make or buy" 1.0 1.0 1.0 costi di produzione 1.0 1.0 budget di produzione 1.0 1.0 1.0 piano principale di produzione ( mps ) 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 pianificazione dei fabbisogni ( tecniche push o pull 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 programmazione operativa ( gantt, etc. ) 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 portafoglio fornitori codice fornitore anagrafica fornitore preventivi fornitori 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 ordine di acquisto 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 documentazioni tecniche dei fornitori costo del particolare commerciale ( concordato con il fornitore ) 1.0 1.0 1.0 1.0 costo del particolare non commerciale ( concordato con il fornitore ) 1.0 1.0 1.0 1.0 lotto minimo di acquisto del particolare ( concordato con il fornitore ) data di consegna riga d'ordine lead time fornitore condizioni di pagamento condizioni per il trasporto condizioni di garanzia eventuali penalità per ritardata consegna tipologia d'imballo sequenza fornitori sui particolari riepilogo solleciti distinta documenti inviati al fornitori analisi tempi di assemblaggio piano assemblaggio commesse piano assemblaggio prodotto scheda avanzamento lavoro scheda rilevamento difettosità bolla di carico bolla di scarico bolla di reso documento di prelievo interno documento di versamento interno quantità disponibile quantità impegnata valore quantità disponibile status del particolare ( controllato-non controllato ) status del materiale ( grezzo-semilavoratolavorato-commerciale ) codice di prelievo lavorazione interna codice di prelievo lavorazione esterna codice di prelievo vendita codice reso coordinate di stoccaggio analisi abc storico interventi sui prodotti simili, già commercializzati storico interventi sul prodotto archivio modifiche sui prodotti simili, già commercializzati archivio modifiche sul prodotto archivio difettosità sui prodotti simili, già commercializzati archivio difettosità sul prodotto statistiche dei guasti sui prodotti simili, già commercializzati statistiche dei guasti sul prodotto archivio dei reclami inerenti ai prodotti simili, già commercializzati report dei reclami inerenti al prodotto archivio osservazioni sui prodotti simili, già commercializzati report osservazioni sul prodotto dei clienti dlsm figure 2: overview of the complete dlsm. s tr a te g y : m a rk e ts a n d c o m p e ti to rs a n a ly s is m a rk e t o p p o rt u n it ie s a n a ly s is n e w c o n c e p ts d e s ig n a n a ly s is t a rg e t m a rk e ts a n a ly s is t a rg e t p ri c e s a n a ly s is t a rg e t c o s ts a n a ly s is t im e t o m a rk e t a n a ly s is e c o n o m ic a l ri s k s a n a ly s is f u n d in g s o u rc e s b u d g e ti n g a ll o c a ti o n a n a ly s is p ro to ty p e t a rg e ts c li e n ts re q u ir e m e n ts r e p o rt in g c li e n t in te rv ie w s k e y p e rf o rm a n c e p ro d u c ts t a rg e t s e ll in g p ri c e strategy: markets and competitors analysis market opportunities 1 analysis new concepts design 1 1 analysis target markets 1 analysis target prices 1 analysis target costs 1 1 analysis time to market 1 1 1 analysis economical risks 1 1 1 1 analysis funding sources budgeting allocation analysis prototype targets 1 1 1 1 1 1 1 1 1 clients requirements 1 1 1 reporting client interviews key performance products 1 target selling price 1 1 d i r e c t i o n dlsm m a r k e t i n g figure 3: sample of a portion of dlsm including some documents of the function "direction". 14 d. antonelli, p. chiabert enterprise function "direction". for every intersection activated in the dism, the compatibility of the adopted applications and their publication on the firm’s network is verified and the sum over the rows and the columns occurrence are generated in order to deduce the integration in the product data flow. s tr a te g y : m a rk e ts a n d c o m p e ti to rs a n a ly s is m a rk e t o p p o rt u n it ie s a n a ly s is n e w c o n c e p ts d e s ig n a n a ly s is t a rg e t m a rk e ts a n a ly s is t a rg e t p ri c e s a n a ly s is t a rg e t c o s ts a n a ly s is t im e t o m a rk e t a n a ly s is e c o n o m ic a l ri s k s strategy: markets and competitors analysis market opportunities 1 analysis new concepts design 1 analysis target markets analysis target prices analysis target costs 1 analysis time to market 1 analysis economical risks 1 1 d i r e c t i o n dism figure 4: sample of a portion of dism including some documents of the function "direction". the activated cells in the dism play a relevant role from a plm perspective when the document format and the support are electronic. taking in account only the interfaces based on electronic format, the distance of dism from dlsm addresses the investments in the informational infrastructure the enterprise needs. 3.1 analysing the dlsm matrix the sum α and β of the occurrences along the rows and the columns of dlsm allows to determine quantitatively the amount of functional dependencies among the documents in the information flow. table 2 presents the α and β values for the document "analysis target prices" produced by the enterprise function "direction": table 2: dependencies of "analysis target prices" document. document α β price target 1 7 the document is consulted by only another document ("analysis of market opportunities", produced by the function direction) and requires the existence of other seven documents belonging to the direction, marketing and design functions. it is noteworthy that the same document can be shared among different functions inside the enterprise, therefore another important parameter is the percentage of occurrences of every document (i) inside every function (j), associated to the matrix dlsm(i,j). as an example figure 5 shows the percentage use of all the documents generated by the function "direction". in the collaboration network, the function "direction" behaves as a supplier, the other enterprise functions are then the clients. the analysis can be profitably reverted in figure 6 that shows the percentage use of documents, generated by other functions, through the function "direction", which now acts as the client for the other functions. different analyses of dlsm structure bring to light unpredicted outcomes. 4 clustering the dlsm: multivariate analysis the scope of this section is illustrating the motivations behind the choice of the clustering method used to group the project documents. clustering of the documents is the main application of the dlsm matrix. produced documents have already a taxonomy, belonging to one of the product lifecycle functions, that correspond directly introducing collaborative practices in small medium enterprises 15 analysis engineering design production direction sourcing figure 5: use of direction documents by different enterprise functions including direction (labels represents the destinations). analysis engineering design production assembly sourcing direction ’ figure 6: use of enterprise data by the direction function (labels represents the origins). to the enterprise departments. nevertheless, a typical sme structure is by far different and usually simpler than what appears from the organizational chart. therefore it is noteworthy to test if the clustering of the document based on their correlations matches the official taxonomy. we remark that the clustering performed here has nothing to do with the known study field of information science that goes under the name of document classification. this latter can be defined in the following way [24]: "document clustering is a fundamental operation used in unsupervised document organization, automatic topic extraction, and information retrieval. it provides a structure for organizing large bodies of text for efficient browsing and searching". in our case, document clustering, based on the dlsm matrix of correlations, is only a pretext to analyse the enterprise organization from an unbiased point of view. in common with the ordinary document classification is the choice of unsupervised clustering methods. as there are reasonable doubts that the real structure of the collaboration network would match the enterprise structure it is convenient not to use the knowledge about the existing organization and leave the classification of the documents to an unsupervised method: the statistical cluster analysis. cluster analysis investigates grouping by minimizing a suitable distance measure among the data. the distance function should give a way to measure the similarity between two documents. euclidean distance is obviously improper for measuring the similarity in the correlations among documents. for the research aims the best suited is the hamming distance that calculates the percentage of attributes that differ between two objects. given the dlsm, the element dlsm(i,j) represents the attribute j of the document i. the number of documents and of attributes is the same, n. for every two documents r, s the hamming distance dh is: dh = dlsm(r, j) − dlsm(s, j) n (2) 16 d. antonelli, p. chiabert hierarchical clustering is the more appropriate technique for working with ordinal metrics. it creates a cluster tree, which is not a single set of clusters, but rather a multi-level hierarchy, where clusters at one level are joined as clusters at the next higher level. the decision of the most appropriate level of clustering is found by choosing the threshold of the inconsistency coefficient for each link of the hierarchical cluster tree. there are different agglomerative techniques [3], among which the nearest neighbour or single linkage. two clusters are joined together on the basis of the minimum distance between the two nearest elements among all the existing clusters. if cluster a is made of documents ai (i=1,...,m) and cluster b is made of documents b j (j=1,...,r), the distance between the two clusters is: dsl = min(dh(ai, b j))∀i,∀ j (3) single linkage is seldom selected among the techniques because it has the drawback of creating elongated cluster (in a spatial visualization of the distance measure) if there is a long chain of documents each one differing from the successive for some attribute. in present study the attributes are the dependencies of documents and it is reasonable to expect that every document be the starting point for a successive document. before to apply the clustering, a pre-processing has been executed on the dlsm to reduce the search space [5]. after a binary sorting, the void rows and columns have been excluded from the clustering. these represent documents which have been produced independently the ones from all the others. it is possible that most of them be useless documents and be requested by internal bureaucratic procedures. the result of the application of the hierarchical clustering to the modified dlsm is the dendrogram graph represented in figure 7. the dashed line represents a threshold inconsistency value chosen to group the documents in ten clusters which correspond to the nine divisions actually present in the enterprise. figure 7: dendrogram of the hierarchical tree for the dlsm matrix. a number of considerations can be drawn on the result of this clusterization: • one cluster (document index from 1 to 67) is by far larger than the others and includes documents belonging to nearly all the divisions; • with the exception of the fourth cluster (document index from 71 to 74), all the others have only one document inside; • these one-document clusters have an high unconsistency value, therefore they cannot be attributed to a larger one changing the agglomerative technique. the first consideration describe a positive situation. the majority of documents have strict reciprocal dependencies. a possible explanation is that the sme applies implicitly a concurrent engineering strategy in the introducing collaborative practices in small medium enterprises 17 management of new projects. as a matter of fact every project is carried on by an unstructured working team made usually of few persons if not just one, who are responsible for the production of the key documents which are used by all the divisions to produce their own documents. second and third considerations indicate the presence of dead end documents, i.e. documents that get their data from other documents but are no longer used to produce new documents. this is not necessarily an issue as it seems natural that every division produces a final report on the project done. the fourth cluster instead represents a small number of documents totally independent from the others. the issue is not in the presence of a cluster but in the high value of the inconsistency index (measured by the length of the dendrogram branches) that denotes an excessive distance between this cluster and every other. from a plm perspective the results of the dlsm clustering suggests the need of investments in the sharing of documents. under the hypothesis of a working team operating in a concurrent engineering framework is essential to increase the accessibility of documents by all the enterprise functions. in a sme, the issues related to the condivision of product data are strictly related to the possibility of access to the different database and data formats. inside the large majority of the smes, the integration of the enterprise documentation is not considered a critical issue [11]. 5 evaluating the attitude to change the questionnaires provide a picture of product data management across the enterprise functions and along the product lifecycle. such information can be reorganized according to the dism in order to gain a better knowledge on enterprise organization. on this base it is possible to evaluate the impact of plm on the informational structure by detailing the effects on each enterprise function. the evaluation of attitude to collaborative working in the ce context should be expressed in terms of costs and benefits, which depend on enterprise readiness and propensity to accept innovation [19]. the test case focused on the analysis of two attitude indexes: attitude to organizational changes and attitude to informational changes. the attitude to innovation assumes different characteristics depending on enterprise functions. authors extracted from technical literature a set of criteria to be applied to the most relevant functions, as identified on the basis of dslm and dism. different evaluating scales and different weights have been applied according to criteria nature [17]. the results of evaluation of both organizational and informational indexes for the most relevant enterprise functions (p1 marketing, p2 design, and p3 supply) are summarized in figure 8, which represents a pictorial description of the enterprise global attitude to innovation. figure 8: global attitude index for the test case. 18 d. antonelli, p. chiabert 6 conclusions the paper focuses on the development of a method for the evaluation of enterprise readiness to collaborative product and process design, with particular attention to the implementation of plm in the smes. data collection relies on questionnaires focused on the document management process adopted in the enterprise. the questionnaires are submitted to enterprise personnel while researchers integrates knowledge and analysis of decisional processes. in order to evaluate the amount of business process management required to upgrade the enterprise organisation to plm pre-requisites, the authors developed an analysis tool consisting of a classification of product data flows and their supporting formats. subsequent refinements of the gathered information allow to benchmark the enterprise propensity to the implementation of plm. the simple approach proposed in the paper seems to solve some of the problems related to the introduction of information technology systems in smes: insufficient awareness of organizational issues, insufficient involvement of end users, inadequate traininig of users. bibliography [1] aberdeen, "plm it’s for any manufacturer striving for product excellence", an executive white paper, aberdeen group, inc., www.aberdeen.com, 2002. [2] r.w. bagshaw, s.t. newman, "structured approach to the design of a production data analysis facility part 1: conceptual design and information requirements", int. j. prod. res., vol. 39, no. 10, pp. 2121-2141, 2001. [3] a. cevahir, "clustering of correlated documents into designated number of clusters: a practical approach", www.cs.bilkent.edu.tr, 2005. [4] d. chen, g. doumeingts, "developing a theory of design a tentative approach using mathematics", studies in informatics and control, vol.11, no.1, 2002, ici, bucharest. [5] g. czibula, g. serban, "hierarchical clustering based design patterns identification", int. j. of computers, communications & control, vol. iii, pp. 248-252, 2008, ccc publications. [6] datamation limited, "understanding product lifecycle management", rep. no. plm-11, rev. no. 1.0, september 2002, cambridge, uk. [7] european commission, sme and cooperation, observatory of european smes, 2003, enterprise pubblication, brussels. [8] b. eynard, t. gallet, p. nowak, l. roucoules, "uml based specifications of pdm product structure and workflow", computers in industry, vol.55, pp. 301-316, 2004. [9] b. fagerström, l. olsson, "knowledge management in collaborative product development", systems engineering, vol. 5, no. 4, pp. 274-285, 2002. [10] f.g. filip, "decision support and control for large-scale complex systems", annual reviews in control, vol.32, no.1, pp. 61-70, 2008. [11] f. franceschini, advanced quality function deployment, boca raton, st. lucie press, 2002. [12] h. kaschel, l.m. sánchez y bernal, "importance of flexibility in manufacturing systems", int. j. of computers, communications & control, vol. i (2), pp. 53-60, 2006, ccc publications. [13] j. krauth, introducing information technology in small and medium sized enterprises, studies in informatics and control, vol.8, no.1, 1999, ici, bucharest. [14] a. kusiak, concurrent engineering: automation, tools, and techniques, wiley, usa, 1992. [15] t.w. malone, k. crowston, "the interdisciplinary study of coordination", acm computing surveys (csur), vol. 26, no. 1, pp.87-119, 1994. [16] s.y. nof, g. morel, l. monostori, a. molina, f. filip, "from plant and logistisc control to multi-enterprise collaboration", annual reviews in control, vol.30, no.1, pp. 55-68, 2006. introducing collaborative practices in small medium enterprises 19 [17] k. schmid, i. john, "a customizable approach to full lifecycle variability management", science of computer programming, vol.54, pp. 259-284, 2004. [18] m.g. singh, a. titli, systems decomposition, optimisation and control, pergamon press, 1988. [19] r. sudarsan, s.j. fenves, r.d. sriram, f. wang, "a product information modelling framework for product lifecycle management", computer aided design, vol.37, pp.1399-1411, 2005. [20] j. stark, product lifecycle management 21st century paradigm for product realisation, 2005, springerverlag, london, uk. [21] s.n. talukdar, "collaboration rules for autonomous software agents", decision support systems, vol. 24, pp.269-278, 1999. [22] s. tichkiewitch, d. brissaud, "co-ordination between product and process definitions in a concurrent engineering environment", annals of cirp, vol. 49, pp. 75-78, 2000. [23] a. tiwana, b. ramesh, "a design knowledge management system to support collaborative information product evolution", decision support system, vol.31, pp.241-262, 2005. [24] c. xiaohui, g. jinzhu, t.e. potok, "a flocking based algorithm for document clustering analysis", j. of systems architecture, vol.52, pp. 505-515, 2006, elsevier. [25] s. zina, m. lombard, l. lossent, c. henriot, "generic modeling and configuration management in product lifecycle management", int. j. of computers, communications & control, vol. i (4), pp. 126-138, 2006, ccc publications. dario antonelli italy 1966, holds a m.s. degree in mechanical engineering from the politecnico di torino, 1990. he worked at fiat research center until 1992. he is currently associate professor at the department of production systems and economics of politecnico di torino. his scientific activity is mainly related to numeric finite element simulation of metalworking processes, to experimental identification of process parameters and to supply chain management. paolo chiabert (b. june 28, 1963). m.sc. degree in electronic engineering, politecnico di torino (1993), ibm consultant (1994), researcher in design and methods of industrial engineering at the politecnico di torino (1995-2003), associate professor in technology and manufacturing systems (2004). according to the product lifecycle management perspective, the research interests range from design methods, to manufacturing technologies, to production planning and control, to quality management. since 1997 italian expert in the iso technical committee 213 geometrical product specifications and verification, he is member of the gps technical commission at uni. international journal of computers, communications & control vol. ii (2007), no. 2, pp. 149-158 blind steganalysis: estimation of hidden message length sanjay kumar jena, g.v.v. krishna abstract: steganography is used to hide the occurrence of communication. discovering and rendering useless such covert message is an art of steganalysis. the importance of techniques that can reliably detect the presence of secret messages in images is increasing as images can hide a large amount of malicious code that could be activated by a small trojan horse type of virus and also for tracking criminal activities over internet. this paper presents an improved blind steganalysis technique. the proposed algorithm reduces the initial-bias, and estimates the lsb embedding message ratios by constructing equations with the statistics of difference image histogram. experimental results show that this algorithm is more accurate and reliable than the conventional difference image histogram method. it outperforms other powerful steganalysis approaches for embedded ratio greater than 40% and comparable with rs steganalysis technique for shorter hidden message length. keywords: steganography, steganalysis, hidden message extraction 1 introduction steganography is the art of passing information through apparently innocent files in a manner that the very existence of the message is unknown. the term steganography in greek literally means, “covered writing”. the innocent files can be referred to as cover text, cover image, or cover audio as appropriate. after embedding the secret message it is referred to as stego-medium. a stego-key is used to control the hiding process so as to restrict detection and/or recovery of the embedded data. while cryptography is about protecting the content of messages (their meaning), steganography is about hiding the message so that intermediate persons cannot see the message. historically, steganography has been a form of security through obscurity where the security lies in that only sender and receiver know the method in which the message is hidden. this is in violation of kirchoff’s principle, which states that the security should lie in key alone. steganography can be either “linguistic steganography”or “technical steganography”[1]. the ancient techniques that hide messages physically are called as technical steganographic systems. they include microdots, tattoos, invisible inks and semagrams. recent techniques belong to the linguistic steganography. these techniques hide message in the cover images, which are of digital form. steganalysis is the process of detecting the existence of the steganography in a cover medium and rendering it useless. current trend in steganalysis [4] seems to suggest two extreme approaches (a) little or no statistical assumptions about the image under investigation where statistics are learnt using a large database and (b) a parametric model is assumed for the image and its statistics are computed for steganalysis detection. the messages embedded into an image are often imperceptible to human eyes. but there exists some detectable artifacts in the images depending on the steganographic algorithm used [2,5]. the steganalyst uses these artifacts for the detection of the steganography. by far the most popular and frequently used steganographic method is the least significant bit embedding (lsb). it works by embedding message bits as the lsb’s of randomly selected pixels. several techniques for the steganalysis of the images for lsb embedding are present. fridrich and goljan [6,7] proposed lsb steganography dual detection method, named rs method, based on probability statistics in the color or grayscale images. the basic idea is that lsb plane seems random in the typical cover images, but to some extent the other 7 bit planes could predict it. this method is suitable for detection of the non-sequential steganography reliably. pfitzmann and westfeld [8] introduced a method based on statistical analysis of pairs of values (povs) copyright © 2006-2007 by ccc publications 150 sanjay kumar jena, g.v.v. krishna that are exchanged during message embedding this method, which became known as the χ 2 attack, is quite general and can be applied to many embedding paradigms besides the lsb embedding. it provides very reliable results when the message placement for sequential embedding. fridrich et al. [9] developed a steganographic method for detecting lsb embedding in 24-bit color images-the raw quick pairs (rqp) method. the new method is based on analyzing close pairs of colors created by lsb embedding. on the condition that the number of unique colors in the cover image will be less than 30 percent that of the total pixels, it works reasonably well. when the number of unique colors exceeds about 50 percent that of total pixels, the results gradually become unreliable. this frequently happens for high resolution raw scans and images taken with digital cameras stored in an uncompressed format. another disadvantage of the rqp method is that it can’t be applied to grayscale images. there are few papers in the field of the detecting pixels that contain the hidden message. ian davidson and goutam paul [10] proposed the hidden message location problem as outlier detection using probability/energy measures of images. pixels contributing the most to the energy calculations of an image are deemed outliers. though results for grayscale images are quite accurate; they are not as good as for color images. the algorithm can be defeated if the steganography algorithm has knowledge of probability/energy function or if the message is carefully embedded in the high-energy regions of an image. the difference image histogram proposed by t.zhang and x.ping [10] consists of an initial-bias. the proposed algorithm constructs the embedding ratio-estimate equations using the difference image histogram and reduces the initial bias. experimental results show that the novel algorithm is more accurate than the conventional difference image histogram method and other steganalysis techniques. in the following section, we review principle of the difference image histogram method, and then in section 3, describe the improved difference image histogram method (idih) algorithm. sections 4 show the experimental results and conclude this paper in section 5. 2 principles of difference image histogram tao zhang and xijian ping introduced the difference image histogram method, which uses the measure of weak correlation between successive bit planes to construct a classifier for discrimination between stego-images and cover images. considering the property of lsb steganography, the difference image histogram is used as a statistical analysis tool. the difference image is defined as d(i, j) = i(i + 1, j)−i(i, j). (1) where i(i, j) denotes the value of the image i at the position (i, j). t.zhang and x.ping found that there exists difference between the difference image histograms for normal image and the image obtained after flipping operation on the lsb plane. this fact is utilized to realize the steganalysis technique. to explain the details of difference image histogram method (dih), we need to define some notions, let i be the test image, which has m ×n pixels. the embedding ratio p is defined as the percentage of the embedded message length to the maximum capacity. if the difference image histogram of an image is represented by hi, that of the image after flipping all bits in the lsb plane by fi, and that of the image after setting all bits in the lsb plane to zero by gi, there exist the following relationships between hi, fi and gi. h2i = f2i = a2i,2ig2i, h2i+1 = a2i,2i+1g2i + a2i+2,2i+1g2i+2, (2) f2i+1 = a2i,2i−1g2i + a2i+2,2i+3g2i+2 in which a2i,2i+ j is defined as the transition coefficient from the histogram gi to hi. when j = 0, 1,−1 then 0 < a2i,2i+ j < 1, otherwise a2i,2i+ j = 0, and they satisfy a2i,2i−1 + a2i,2i + a2i,2i+1 = 1 (3) blind steganalysis: estimation of hidden message length 151 h2i a2i,2i−1 a2i,2i 2i+1h a2i,2i+1 g2i h2i−1 g2i f 2i+12i−1f f 2i a2i,2i a2i,2i−1a2i,2i+1 figure 1: the transition diagram from gi to hi, fi starting from the approximate symmetry of the difference histogram about i = 0, we first get a0,1 ∼ a0,−1 from the above equations (2) we obtain the following iterative formula for calculating transition coefficients for all positive integers i: a0,1 = a0,−1 = g0 −h0 2g0 , a2i,2i = h2i g2i , (4) a2i,2i−1 = h2i−1 −a2i−2,2i−1g2i−2 g2i , a2i,2i+1 = 1−a2i,2i −a2i,2i−1. assuming the embedded hidden message forms a random bit sequence, for the stego-image with the lsb plane fully embedded (i.e.p = 100%) the lsb plane is independent of the neighboring bit planes. therefore, for such stego-images we have a2i,2i+1 ≈ 0.25, a2i,2i+1 ≈ 0.5, a2i,2i+1 ≈ 0.25. the h2i+1 consists of two parts: a2i,2i+1g2i and a2i+2,2i+1g2i+2. statistical tests show that for natural images these two parts make an approximately equal contribution to h2i+1, that is a2i,2i+1g2i ≈ a2i+2,2i+1g2i+2 (5) if αi = (a2i+2,2i+1)/(a2i,2i+1), βi = (a2i+2,2i+3)/(a2i,2i−1) and γi = g2i/g2i+2 then the statistical hypothesis of the steganalytic method is that for a natural image the following equation should be satisfied: αi ≈ γi (6) while for stego-images with the lsb plane fully embedded αi ≈ 1. (7) the physical quantity αi, can be viewed as the measure of the weak correlation between the lsb plane and its neighboring bit planes. from further experiments they got that for a given i the value of αi, decreases monotonically with the increasing length of embedded secret messages (p) and when the embedding ratio p increases to 100%, αi decreases to 1 approximately.figure-2 shows the functional relation between αi and the embedding ration p when i = 0 for the “lena”image. the relationship between α , and the embedding ratio p will be modeled using a quadratic equation y = ax2 + bx + c. the following four critical points p1 = (0, γi), p2 = (p, αi), p3 = (1, 1), and p4 = 152 sanjay kumar jena, g.v.v. krishna 0 0.5 1 1.5 2 0.85 0.9 0.95 1 1.05 1.1 1.15 1.2 1.25 1.3 embedded ratio(p) α i (0,γ i ) (p,γ i ) (1,1) (2−p,β i ) α i = 0.0624p2−0.2885p+1.2366 figure 2: the functional relation between αi and p(i = 0) (2− p, β i). now the following equation set is obtained: c = γi, ap2 + bp + c = αi, (8) a + b + c = 1, a(2− p)2 + b(2− p) + c = βi. assume d1 = 1−γi, d2 = αi −γi, d3 = βi −γi, then above equation set (8) can be simplified to 2d1 p 2 + (d3 −4d1 −d2)p + 2d2 = 0 (9) the embedding ratio p can be obtained from the root of above whose absolute value is smaller. if the discriminant is smaller than zero, then p ≈ 1. 3 principles of improved difference image histogram steganalysis the difference image histogram algorithm was primarily based on the statistical hypothesis that for the natural images αi ≈ γi (10) and for a stego-images with the lsb plane fully embedded αi ≈ 1. (11) obviously, the hypotheses given in equations(10) and (11) will affect the precision of the difference image histogram method. once in these hypotheses there exists some initial bias, the estimate value via the equation(9) will not be reliable. when the embedding ratio is low, the bias of these hypotheses will lead the incorrect decision, and if there are no embedding messages in images, the false alarm rate is high. table 1 will show the mean and variance of the γi to α i value. with the increase in i the variance increases and the mean begins to deviate from 1. in some cases the detection lead to an incorrect decision blind steganalysis: estimation of hidden message length 153 i=0 i=1 i=2 i=3 mean 1.0013 1.0034 1.0079 1.0443 variance 7.6e-04 1.4e-03 2.8 e-03 4.9e-03 table 1: statistical data on the ratio of γi to α i for natural images of estimating more than 1% embedding, for the normal images. the figure "proposed" shows the initial value of difference between αi and γi for a lena image and figure "proposed (b)" shows a close look at the αi and γi at p = 0 values. this initial deviation may lead a serious estimate error. the initial-bias in detection of the message in the normal image is affecting the detection of stego-images, as the error present in the normal image will effect the estimation of the hidden message length for stego-image. if the stego-image created with embedded embedding ratio p is denoted as sp, and the image created by flipping all bits in the lsb plane of sp as rp, the value of αi can be calculated for the images sp and rp(note that the value of αi for the images rp is equal to the value of βi for the image sp). moreover, we note that in sp only p/2 of the pixels are flipped by message embedding, while in rp about 1−(p/2) of pixels are flipped. therefore, rp is equivalent to a “stego-image”with “embedding ratio”2− p. so given a stego-image we can calculate the values of αi at p and 2 − p/2, as the value of βi at p is equal to the value of αi at value 2−(p/2). 0 0.5 1 1.5 2 0.9 0.95 1 1.05 1.1 1.15 1.2 1.25 embedded message ratio(p) α i γ i α i figure 3: proposed let αi(0) be the initial value of αi and γi(0) be the initial value of γi (i.e. when embedding ratio is zero). the error ‘ε ’be the initial-bias between the γi and αi. so we have ε = γi(0)−αi(0). (12) from the difference image histogram method, γi = g2i/g2i+1 where g is the difference image histogram after setting all bits in lsb plane to zero. the grayscale value of pixels in the image will be even as the lsb are set to zero. when the image is embedded with hidden message using the lsb insertion and then performing the operation of setting all lsb plane bits to zero for the stego-image will result in the values of g2i and g2i+1 unmodified. hence γi(0) = γi ∀ p (13) so the value of error ε will become ε = γi −αi(0). (14) 154 sanjay kumar jena, g.v.v. krishna 0 0.05 0.1 0.15 0.2 0.25 1.21 1.215 1.22 1.225 1.23 1.235 1.24 1.245 1.25 embedded message ratio(p) α i γ i α i figure 4: proposed-(b) the difference image histogram models the relationship between αi and the embedded ratio (p) using a quadratic equation y = ax2 + bx + c. considering the statistical hypotheses given in equation (15) to be correct initially, we can find (p,αi), (1,1), (2 − p,βi) are the three points on the curve y = ax2 + bx + c. now we obtain the following equation set: ap2 + bp + c = αi, a + b + c = 1, (15) a(2− p)2 + b(2− p) + c = βi. assume e1 = 1− p, e2 = 1−αi, e3 = 1−βi and then constant value “c”in equation 4.21 can be simplified to c = 2e21 −2e1e2 −(e2 + e3)(1−e1) 2e21 . (16) the value of ‘c”in equation (15) will give the αi(0) for an image. hence subtracting the error from the estimated ratio p will remove the initial bias in the image. hence the new estimated ratio “pmodi f ied ”will be pmodi f ied = p−ε (17) 4 description of idih algorithm we now describe our detection algorithm input: a set of bmp images for detecting. output: the embedded ratio estimate pmodi f ied for each image. step 1. select one image in the image set; step 2. obtain difference image histogram of the image before (hi) and after flipping the lsb bit planes to “zero”(gi); step 3. do from the step 4 to 8 for each value of i= 0,1,2; step 4. calculate the statistical values for the image i.e αi = (a2i+2,2i+1)/(a2i,2i+1), βi = (a2i+2,2i+3)/(a2i,2i−1) blind steganalysis: estimation of hidden message length 155 and γi = g2i/g2i+2, where the transition co-efficient can be estimated using the following equation a0,1 = a0,−1 = g0 −h0 2g0 , a2i,2i = h2i g2i , a2i,2i−1 = h2i−1 −a2i−2,2i−1g2i−2 g2i , a2i,2i+1 = 1−a2i,2i −a2i,2i−1. step 5. obtain the value of “p” from the root of the below equation whose absolute value is smaller. 2d1 p 2 + (d3 −4d1 −d2)p + 2d2 = 0 where d1 = 1−γi, d2 = αi −γi, d3 = βi −γi; step 6. calculate the value αi(0) which represents the estimation of αi for zero embedded message length using the following equation αi(0) = 2e21 −2e1e2 −(e2 + e3)(1−e1) 2e21 where e1 = 1− p, e2 = 1−αi and e3 = 1−βi; step 7. calculate the initial bias ‘ε ”as ε = γi −αi(0). step 8. subtract the error ‘ε ’from the p to obtain the modified estimation ratio pmodi f ied (i). pmodi f ied (i) = p−ε step 9. the average of pmodi f ied (i) for i= 0,1,2 will give the final embedded ratio pmodi f ied . 5 experimental results we select 150 standard 512 × 512 test images (such as lena, peppers and so on). applying random and sequential lsb replacement to embed the images with the ratio of p= 0, 10%, 20%,. . . , 90%,100% respectively with 10% increments we created two databases. then we have use the rs method [7], dih method [11] and gefr method [12] to estimate the embedding ratio of secret information respectively. the mask used in the rs method is [1,0; 0,1]. the testing results of the test images got by dih method and the proposed method (idih) are shown in table 2. the leftmost column in table 2 is the real embedding ratio, and column “idih”, “dih”represent the estimate embedding ratio got by improve difference image histogram method (proposed method) and difference image histogram method (dih) respectively. it can be seen in table 2 that the estimate precision of idih is higher than dih obviously. figure [4] and [5] shows the corresponding plot of the embedded message length to the mean absolute error of the estimated values for random embedding and sequential embedding. figure [4] indicates that the proposed algorithm (idih) algorithm outperforms the other three steganalysis techniques for embedded ratios greater than 40%. improved difference image histogram algorithm has performance comparable to the rs steganalysis for short messages (when p is smaller than 40%). however, because it is harder to detect smaller messages than large messages, the accuracy of the estimate is far more important for smaller message embedding. the proposed algorithm proves to be effective and reliable when complete range of embedding lengths is considered and compared to the existing algorithms. 156 sanjay kumar jena, g.v.v. krishna embedding random sequential ratio(%) idih dih idih dih 0% 0.3052 1.6855 0.3052 1.6855 10% 14.7804 15.3881 15.6703 16.0368 20% 20.38 20.80 27.98 28.11 30% 20.3764 20.8017 27.9818 28.1124 40% 40.1524 42.9062 44.3258 44.922 50% 48.6793 52.2864 49.7154 48.5228 60% 62.245 63.8 60.5394 56.5979 70% 72.7311 66.67118 69.7919 68.726 80% 84.6388 73.4632 80.8796 72.2582 90% 90.9915 85.8664 84.8516 81.955 100% 98.6088 92.5193 98.6088 92.5193 table 2: comparison between idih and dih 0 10 20 30 40 50 60 70 80 90 100 0 2 4 6 8 10 12 embedded ratio (p) m e a n e rr o r idih dih gefr rs figure 5: comparison with other steganalytic techniques for random lsb embedding in case of sequential embedding (as shown in figure ??), the accuracy is much higher than the case of random embedding for the embedded ratios of greater than 40%. it is having a higher performance to all the other steganalytic techniques for entire range of possible embedding lengths. 6 summary and conclusions this paper proposes a new detection algorithm, which is an improved algorithm to the difference image histogram algorithm and performed tests on a group of raw lossless images. experimental results show that the improved difference image histogram steganalysis method is more accurate and reliable than the conventional difference image histogram method. the proposed algorithm reduces the mean error by 50% for embedding ratios greater than 40% when compared to the dih algorithm. references [1] f.a.p.petitcolas, r.j.anderson and m.g.kuhn, “information hiding-a survey”, proceedings of the ieee, vol.87(7), special issue on protection of multimedia content, june 1999, pp.1062-1018. blind steganalysis: estimation of hidden message length 157 0 10 20 30 40 50 60 70 80 90 100 0 5 10 15 20 25 embedde ratio (p) m e a n e rr o r idih dih gefr rs figure 6: comp. with other steganalytic techniques for sequential lsb embedding [2] n.f.johnson and s.jajodia, “steganalysis of images created using current steganography software”, lecture notes in computer science, vol.1525, springer, berlin, april 1998, pp.273-289. [3] n.f.johnson, and s.jajodia, “exploring steganography: seeing the unseen ”, ieee, computer, february 1998, pp. 26-34. [4] r.chandramouli and k.p.subbalakshmi, “current trends in steganalysis: a critical survey ”, international control, automation, robotics and vision conference 2004, volume 2, december 2004, pp.964 967 [5] neil f. johnson and sushil jajodia, “steganalysis: the investigation of hidden information”, ieee information technology conference, september 1998, pp.113-116. [6] j.fridrich, m.goljan and r.du, “detecting lsb steganography in color and grayscale images”, ieee, vol.8(4) multimedia, october-december 2001, pp.22-28. [7] j. fridrich, m. goljan, r. du, “reliable detection of lsb steganography in grayscale and color images”, proceeding of acm, special session on multimedia security and watermarking, ottawa, canada, 2001, pp. 27-30. [8] a.westfeld, a. pittzmann, “attacks on steganographic systems”, information hiding, third international workshop, ih’99, dresden, germany, 29 september-1 october, 1999. [9] j.fridrich, r.du and m.long, “steganalysis of lsb encoding in color images’, in proceedings of icme 2000, july-august 2000. [10] t.zhang and x.ping, “reliable detection of lsb steganography based on the difference image histogram”, ieee international conference on acoustics, speech, and signal processing, volume 3, april 2003, pp.545-548. [11] li zhi and sui ai fen, “detection of random lsb image steganography”, vehicular technology conference ieee 60th , vol.3, september 2004, pp.2113 2117. [12] a.pinet, securengine professional v1.0, 2004, http://securengine.isecu qrelabs.com. [13] cbir image database, university of washington, available at http://www.cs.washington.edu/research/imagedatabse/groundtruth/. 158 sanjay kumar jena, g.v.v. krishna [14] the usc-sipi image database, available at: http://sipi.usc.edu/ser vices/databse/database.html. sanjay kumar jena national institute of technology department ff computer science and engineering rourkela, orissa india, 769008 e-mail: skjena@nitrkl.ac.in g.v.v. krishna national institute of technology department ff computer science and engineering rourkela, orissa india, 769008 e-mail: gvvkrishna@yahoo.co.in received: august 18, 2006 dr. s.k. jena was born in 28 april, 1954. he received his ph.d. from indian institute of technology, bombay and m.tech from indian institute of technology, kharagpur. he has joined national institute of technology as professor in the department of computer science and engineering in 2002. currently he is working as professor and head of computer science and engineering department. he has more than 35 publications in international journals and conferences. his research areas of interest are database engineering, distributed computing, parallel algorithm, information security and data compression. mr. g.v.vamsi.krishna was born in 17 june, 1985. he received his m.tech in computer science & engineering from national institute of technology, rourkela in 2006 and b.tech in computer science & engineering from sarada institute of science, technology and management, hyderabad in 2004. currently he is working as software engineer in ibm india pvt. ltd., bangalore. international journal of computers communications & control issn 1841-9836, 11(2):259-272, april 2016. a multi-objective approach for a multihoming load balancing scheme in whn c. lozano-garzon, m. molina, y. donoso carlos lozano-garzon* universidad de los andes, bogotá, colombia, south america & universitat de girona, girona, spain. *corresponding author: calozanog@ieee.org miguel molina, yezid donoso universidad de los andes bogotá, colombia, south america mf.molina35@uniandes.edu.co, ydonoso@uniandes.edu.co abstract: the telco operators face up to challenges related to the need of ensuring a quality of service to the user in a planning, maintenance and resource allocation in their complex networks. these challenges are directly related with the need to ensure an user’s service with a good level of quality in a highly dynamic environment in terms of changes in the radio access technologies, growth in the number of mobile users, technical requirements of the new services and applications, and the possibility to connect to different networks at the same time, among others. in this paper, we address the problem of the user’s service allocation into the different feasible networks in order to reduce the network overloading. we present a multihoming load balancing scheme that allows the re-allocation of services according to their qos requirements and the availability of network resources. we propose a multi-objective optimization model of this problem together with an evolutionary algorithm to solve it. through simulation in different scenarios, we show that our algorithm is efficient, sensitive, scalable and provides optimal solutions. keywords: heterogeneous networks, load balancing, multihoming, multi-objective optimization, multi-objective evolutionary algorithms, vertical handover. 1 introduction given the continuous advances in network technologies, the growth in the number of mobile users and the increasing demand of the new services and applications, the mobile network operator are confronted with multiple challenges in the planning, maintenance and operation of their complex infrastructure. in many cases, these networks are composed by multiple radio access technologies, which allows to the user the access to different services by using simultaneously one or more their network interfaces. in some cases during the network operation, it is possible that some radio access channels could be overloaded because of a traffic growth caused by the increased of the number of user’s services connected to these channels. therefore, the mobile operators need some mechanisms that allows a balanced distribution of the traffic load over the available networks. note that this process could involve could involve the reallocation of some users. the main goal of this mechanism is the optimal use of the available network resources whilst the technical requirements needs of each service are guaranteed. from the viewpoint of the mobile network operators, one of the most appropriate ways to achieve this goal is that the deployed infrastructure must be capable to perform the user’s reallocation for one network to another, this process is called vertical handover (vho). due to standards such as ieee 802.21 [1] only provide the framework for vho, the decision making copyright © 2006-2016 by ccc publications 260 c. lozano-garzon, m. molina, y. donoso algorithm that allow establish the best connection for each user is an open challenge [2]. in this algorithms, the re-allocation must be based on different metrics obtained from the mobile devices and/or from the performance parameters of the available networks [3]. also, it is important that this network changes process should be transparent for the user. based on the recent advances in the mobile phones, nowadays these devices could establish connections to multiple networks in a simultaneously way, which is referred to as multihoming [4]. this characteristic facilitates a seamless vho process while it is seamless to the user [5] and allows the simultaneous transmission of multiple services across multiple network interfaces [6]. several research projects use the multihoming strategy over a heterogeneous environment in order to achieve a load balancing [4]; [7–9] or to make a better distribution of the bandwidth charge [10,11]. in other studies this strategy was used as a decision tool for the vho process [12–15]. considering the aforementioned problems, we addressed our study about the always best connected (abc) problem in heterogeneous wireless networks (hwn) in [16]. in this work we designed a proactive vertical handover decision algorithm (vho-da) based on user preferences, qos requirements, and network conditions. later in [17], we presents a load balancing optimization scheme; this scheme is composed by one mathematical model and a two-step algorithm based on the anchor-adjustment heuristic. in this paper, we also address the problem of load balancing across heterogeneous networks from the viewpoint of the operator. we present a multi-objective optimization model to solve the traffic load balancing problem into hwn using a multihoming strategy, and an evolutionary algorithm to solve it. the remainder of this paper is structured as follows. in section 2 we introduce the mathematical model that encodes the multi-objective function in order to obtain a global load balancing among hwn. in section 3 the load balancing algorithm based on strength pareto evolutionary algorithm is presented. the experimental results about the performance of our proposal are shown in section 4. finally, concluding remarks and directions for further research are given in section 5. 2 load balancing mathematical model as it was mentioned in [17], the load balancing is an important strategy used by the mobile operators in order to allocate, in a fair way, the available resources in a network. however, this strategy implies, in many cases, the reallocation of mobile devices; therefore, it is necessary to consider the cost of connecting services to the new networks and the energy consumption of the mobile device. considering the above statements and the possibility of the simultaneous use of multiple network interfaces by each mobile device [6], in this section we proposed a multiobjective multihoming mathematical model. 2.1 decision variable by assuming that mobile devices are able to perform multihoming in the network, we define the decision variable x as a binary variable that specify if the service s of the mobile k is connected to the network j or not (see figure 1). the variable is represented as follows: x k,s j = { 1 if the service is connected to the network 0 otherwise a multi-objective approach for a multihoming load balancing scheme in whn 261 figure 1: multihoming cellular system 2.2 objective functions in order to design an efficient load balancing algorithm, the mathematical model is formulated under the premise of achieving an overall load balance in the wireless heterogeneous networks, whilst the connection cost and the energy consumption is also minimized. this function is expressed as: min(α,β,γ), where α represents the load balancing function, β the connection cost function, and γ the energy consumption function. load balancing function the load balancing function (α) is the main function of this model. this function determines the network traffic load by considering the demand of the services of the mobile devices in relation to the theoretical available bandwidth of the network. for this model, the load function is defined as: α = max(αj),∀j ∈ n, where j represents the destination network of the mobile device, n the set of access networks and αj the load of the network j. we calculate αj as the sum of demanded bandwidth (ds) of each connected service (s), for each mobile (k) over the theoretical available bandwidth of the network (bwj) αj = ∑ k ∑ s ds ·x k,s j bwj ,∀j connection cost function the function determines the maximum monetary cost of the mobile devices that are connected to the network. if a mobile device has at least one service connected to the network, its cost is taken into account to access the network. the connection cost function is expressed as: β = max(βj),∀j ∈ n, where j represents the network and βj the maximum cost of connected mobile devices to this network (costj). βj is defined as: βj = ∑ k maxs(costj ·x k,s j ),∀j 262 c. lozano-garzon, m. molina, y. donoso energy consumption function this function determines the energy consumption of the mobile devices that are connected to the network. if a mobile device has at least one service connected to the network, the consumption that generates for being connected to the network is taken into account in our model. we defined the energy function as: γ = max(γj),∀j ∈ n. whilst γj is determined as the maximum consumption that is generated by mobile device k for being connected to the jth network γj = ∑ k maxs{cons(rssk,j) ·x k,s j },∀j finally, in order to guarantee consistency in our model, we consider that when a services s is active in the device, it will generate a traffic demand ds. following the work presented in [18], the values of received signal strength are discretized at three levels: low, medium and high. note that the power consumption is inversely proportional to the received signal strength (rss) and therefore, a high signal level results in a low power consumption by the mobile radio interface through which communication is established. because multiple network interfaces can be used in a multihoming scenario, it is a privilege to be connected to those network which you receive better signals from, i.e. where less energy is consumed for being connected. consumption levels derived from the rss are modeled in the cons(rssk,j) function. cons(rssk,j) =   1 rssk,j > rssth2 2 rssth1 ≤ rssk,j < rssth2 3 0 < rssk,j < rssth1 0 rssk,j = 0 2.3 model constraints through the model constraints, we intend to guarantee the adjustment of this model to the real-life telco networks. in this case, the model only allows the service connection to those networks that are in the coverage area, comply with the cost the user can assume to connect to the network, and offer enough bandwidth to meet the demand of service and an adequate power consumption according to the level of battery charge that the mobile device has. demand constraint the demand constraint states that the service can be only connected to a network that has enough bandwidth to meet its demand. ds ·x k,s j ≤ bwj,∀j cost constraint the cost constraint states that the overall cost to access network j (costj), i.e. the cost to connect any service to network j must be less or equal to the cost incurred by the user in the contract of the mobile device k (costk). costj ·x k,s j ≤ costk,∀k a multi-objective approach for a multihoming load balancing scheme in whn 263 access constraint through the access constraint it is ensured that each service s that is active on the mobile device k is connected to a n network and can only be connected to one network. ∑ j x k,s j = yk,s,∀k,∀s reach constraint this constraint ensures that only networks that exceed the defined signal strength threshold (rssth) are considered in the model assessment. rssk,j ≥ rssth,∀j,∀k power consumption constraint the power consumption constraint ensures that services only can be connected to those networks that offer lower power consumption, according to the current battery level of the mobile device. cons(rssk,j) ·x k,s j ≤ batk,∀j,∀k where batk is defined as: batk =   1 chargek < batth1 2 batth2 ≤ chargek ≤ batth1 3 chargek > batth2 a service s of a mobile device k is considered active when the constraints are satisfied at least for one of the networks; i.e. if the service can be connected to at least one of the networks available for the device. when the service s is active, it consumes the traffic demand in the network that it is connected. 3 multi-objective evolutionary algorithm in order to solve the multi-objective model, several strategies can be used. one of them is to evaluate each objective function in the model separately (mono-objective approach). the weight sum method is one of these strategies. however, it has several disadvantages, including not finding all optimal solutions if the solution set is not convex, and the need to normalize the functions so that no one predominates over the others [7]. for this reason we propose the use of an multi-objective evolutionary algorithm (moea) to find the best set of solutions for all objective functions at the same time. the chosen algorithm is the elitist type evolutionary algorithm spea (strength pareto evolutionary algorithm) proposed by zitzler and thiele [7,19]. the time complexity of this algorithm is upper bounded by o(nm2) in each generation, where m is the population size and n is the number of objectives. 264 c. lozano-garzon, m. molina, y. donoso algorithm 1: spea algorithm pseudo-code 1: generate a random population m 2: while not max number of generation do 3: evaluate population according objective function 4: calculate the fitness of each of the individuals 5: classification population based on fitness (m,m′) 6: generate new population mt+1 7: apply binary tournament selection 8: apply crossover operator 9: mutation operator 10: end while 11: find pareto optimal set. 3.1 chromosome representation as starting point for implementing the evolutionary algorithm it is necessary to define the chromosome, i.e. the data representation of the solutions in the model. in the proposed mathematical model, the solutions can be expressed in a matrix representation (see fig. 3). the rows represent mobile devices and the columns represent services; whilst the cell values represent the network in which the service will be connected, the value of zero is given when the service is not active and it is not connected to any network. node s node i1 node s node i1 node s node i1 node s node i1 ... ... node s node i1 node s node i1 node 1 ... node i2 node i3 node 2 ... node i2 node 3 ... node i2 ... node 4 node i2 node i3 node n-1 node i2 ... node n figure 3: chromosome representation of the solution the decision variables of the mathematical model can be directly obtained from the chromosome, for example if service 2 of the mobile device 3 is connected to network 4, it means that x 3,2 4 = 1 and for the remaining j networks the variable is 0. 3.2 genetic operators the crossover function takes two initial solutions (i.e. chromosomes), called parent solutions, and then created a new one from them. we define a crossover function based on the well-known a multi-objective approach for a multihoming load balancing scheme in whn 265 (a) crossover function (b) mutation function figure 4: genetic operators single-point strategy proposed in [19]. the new solutions are generated by combining the first p rows of the first parent solution with the last k −p rows of parent second parent solution, and vice-versa. the figure 4(a) shows an example of this operator. since the solution of each mobile device in the parent chromosomes meets the constraints of the model, the solutions of the mobile devices that are part of the child solutions also meet the constraints since each mobile device represented in a row conserves its signal strength and battery level conditions as it moves from one chromosome to another. also note that the inactive services are the same in the different solutions so they are conserved. the mutation strategy is based on a random function that takes a mobile device service from its current to a new one as long as the constrains are met, i.e. the network will be connected to a feasible network. the function takes into account only the services that are activated and have more than one feasible network. figure 4(b) shows an example of the mutation strategy. in this case, the service 3 of the mobile 2 is moved from network 3 to network 4 3.3 generation of scenario parameters once the genetic operators are defined, the algorithm starts to generate pseudo-randomly parameters for each mobile. this parameters are the maximum cost that the user can assume to connect to one network (costk), the percentage values and battery charge levels (batk and chargek), the signal strength values that each mobile perceives to each network j (rssk,j), with its corresponding power consumption level parameter (consk,j). when the scenario is created, we proceed to validate all constraints of model for each available network: demand, cost, reach and power consumption. after this validation, we store in a vector the networks that can be used to connect each service of each mobile device or the zero value ([0]) 266 c. lozano-garzon, m. molina, y. donoso if the service cannot be connected to any network. the result is a feasibility matrix (factk,s), this matrix is used to computed a matrix of active services yk,s, where it is defined the probability of use for a mobile service, for each service that can be connected to at least one network. once the active services are defined, a population of m initial solutions and an m′ elite (or external) population are randomly generated from both the feasibility and active services matrix. in our implementation, the value of m and m′ are set to 20 and 4, respectively. 4 experimental results in order to verify the correct operation of the proposed algorithm, we define four different experimental scenarios. for each one of them, we execute 500 iteration of the algorithm with different number of networks, mobile devices and services. 4.1 first experimental scenario the aim of this scenario is to compare the quality of the solution obtained by our algorithm with respect to the optimal solutions obtained by solving each function separately. the optimal solutions were computed using gams system [20]. for this purpose, the scenario is composed by five mobile devices that will connect three services in three different available access networks. tables 1 and 2 describe the parameters of bandwidth, cost, distribution of active services and network coverage for each mobile device. table 1: network parameters network technology theoretical bandwidth (mbps) costj (monetary units) lte 70 80 wifi g 54 0 hspa+ 15 40 table 2: bandwidth demand parameters service voice video web demand (mbps) 0.1 3 0.5 to solve a multi-objective mathematical model through a general optimization software such as gams [20], it is necessary to convert the optimization problem into a single objective one. in this case, each objective function was optimized separately obtaining the solutions shown in table 3. on the other hand the proposed algorithm converges rapidly to a set of 4 optimal solutions; three of them are unique. the results are presented in table 4. based on the feasible solutions obtained, we plot three different graphics in order to identify the pareto-optimal front. these graphics are: load (α) vs. cost (β), cost (β) vs. consumption (γ), and load (α) vs. consumption (γ), as you can see in figure 4:(a). in the load (α) vs. cost (β) graph, the optimal pareto front can be seen, because as the load decreases the cost grows. the cost (β) vs. consumption (γ) graph shows that solutions move in cost values of 80, 120 and 160, and zero-cost solution is quite atypical. finally, in the load (α) vs. consumption (γ) graph can be seen that some optimal solutions are on value 4 and others on power consumption value 7. making a comparison between the two sets of solutions obtained (table 3 and 4), the proposed algorithm found several intermediate solutions belonging to the pareto optimal front. these solutions cannot be found under mono-objective approaches because they are clearly not a linear a multi-objective approach for a multihoming load balancing scheme in whn 267 table 3: mathematical model solutions obtained using gams lte (load) wifi g (load) hspa+ (load) α β γ solution 1 0.086 0.080 0.080 0.086 160 4 solution 2 0 0.213 0 0.213 0 7 solution 3 0.103 0.080 0 0.103 160 4 table 4: solution results for the first study case lte (load) wifi g (load) hspa+ (load) α β γ solution 1 0.051 0.115 0.113 0.115 120 7 solution 2 0.051 0.124 0.080 0.124 80 4 solution 3 0.086 0.080 0.080 0.086 160 4 combination of the objective functions. by looking at the solutions obtained by gams, it is possible to see that the solution 3 is dominated by solution 1, so that the solution 3 is not part of the pareto optimal front. moreover, it is highly improbable that the zero solution cost (found with gams) can be found by our algorithm, because this solution must connect all the services to the same network that goes against the main objective of our proposal. finally, in order to evaluate the quality of the solutions found by the algorithm, a performance metric called spacing was calculated. this metric, as its name implies, analyze the distribution of the solutions in a pareto front. s = √√√√ 1 |q| · |q|∑ i=1 (di −d)2 where: di = mink∈q∧k∧≠i m∑ m=1 |fim −f k m| d = |q|∑ i=1 di |q| it is important to note that di are the distance measure, d is the mean value of the above distance measure, and fkm is the mth objective function value of the kth member of the population [7]. for our implementation the value of s obtained was 21.505. this value was influenced in a strongest way by the difference in the cost values. 4.2 second experimental scenario we proceed to perform of the algorithm with the same external and elite population size m = 20, m′ = 4; but changing to 20 mobile devices, 5 services for each device, and 7 access networks. this scenario was randomly generated according to the characteristics presented in table 5 and 6. the feasible solutions obtained are presented in table 7. the solutions obtained show that the services present a tendency to be allocated into the networks with the highest capacity restricted only by the cost that each mobile could pay. the 268 c. lozano-garzon, m. molina, y. donoso table 5: network parameters network technology lte wifi n wifi g wimax hspa+ hsdpa umts theoretical bandwidth (mbps) 70 300 54 15 15 2 0.3 costj (monetary units) 80 0 0 60 40 20 10 table 6: bandwidth demand of services for second study case service voice video web game chat demand (mbps) 0.1 3 0.5 2 0.2 table 7: network parameters lte wifi n wifi g wimax hspa+ hsdpa umts α β γ solution 1 0.057 0.205 0.652 0.527 0.447 0.4 0.667 0.667 160 29 solution 2 0.086 0.205 0.585 0.587 0.493 0.467 0.587 0.587 240 27 solutions obtained by our algorithm show that if the model constrains are met, then the algorithm trends to allocate the mobile services in those networks with the highest capacity and low cost in addition, we plot three same graphics as the previous scenario (see figure 5:(a)). in the first two graphs it is observed the optimal pareto fronts. these first two graphs shows that in order to achieve a better load distribution across networks and to reach a lower power consumption of mobile devices, the services should be grouped into the higher-cost networks. however, it can be said that the load variations are little in comparison with the cost difference that can be obtained; so that the operator may prefer the solution which means lower cost. the value of the spacing metric (s) was 35.542, despite having three repeated solutions. the difference in the values of cost and load shows that the solutions are not equally distant. 4.3 third experimental scenario for this scenario, we only change the number of mobile devices to 500 in comparison with the latest scenario. the solutions obtained are shown in table 8. based on the achieved solutions for this scenario we can assert that our algorithm present a good grade of sensitivity. we can observe that small changes in the main parameters are reflected in the services allocation, which mean changes in the set of optimal solutions. in figure 6:(a) you can see, well defined, the optimal pareto fronts in the first two graphs. this scenario also shows how small changes in load balancing produce appreciable changes in the cost function. the value of s for this scenario was 30.168, again due to the wide differences in cost values. 4.4 fourth experimental scenario in this last scenario we increased the number of mobile devices to 10000 in order to validate the scalability of the algorithm. the other scenario parameters are kept as the scenario 2. the feasible solutions are shown in table 9. figure 6:(b) shows the optimal solutions found by our algorithm. it is possible to see that solutions trend to maintain the the same power consumption in both graphs, cost (β) vs. consumption (γ) and load (α) vs. consumption (γ). the s metric value was 1.0, so that the solutions are fairly well spaced, being an unique solutions. the algorithm is fully adaptable to any situation within the proposed mathematical model designed. a multi-objective approach for a multihoming load balancing scheme in whn 269 (a) first study case (b) second study case figure 5: 2d perspectives of solution distribution table 8: solution results for the third study case solution 1 solution 2 solution 3 solution 4 lte 1.531 1.517 1.544 1.544 wifi n 3.161 3.163 3.153 3.165 wifi g 18.113 18.219 18.302 18.237 wimax 16.62 16.58 16.533 16.547 hspa+ 27.247 26.947 26.747 26.747 hsdpa 48.6 48.467 48.6 48.533 umts 64.667 64.667 65.333 65 α 64.667 64.667 65.333 65 β 7560 7500 7440 7440 γ 732 734 732 733 after running the algorithm on this scenario, we prove that the algorithm maintains its characteristics of high sensitivity in the searching for optimal solutions regardless of the number of mobile devices on the problem. that means our algorithm is scalable. 270 c. lozano-garzon, m. molina, y. donoso table 9: solution results for the fourth study case solution 1 solution 2 solution 3 solution 4 lte 30.46 30.26 30.263 30.463 wifi n 65.56 65.63 65.62 65.57 wifi g 358.61 358.64 358.61 358.55 wimax 329.43 328.86 328.69 329.6 hspa+ 537.59 537.41 537.79 537.34 hsdpa 934.93 936.67 937.13 934.53 umts 1730 1730.33 1729 1731.33 α 1730 1730.33 1729 1731.33 β 152760 152520 152600 152680 γ 15024 15027 15027 15022 (a) first study case (b) fourth study case figure 6: 2d perspectives of solution distribution a multi-objective approach for a multihoming load balancing scheme in whn 271 5 conclusions and future work we have presented a multihoming load balancing model in heterogeneous wireless networks based on a multi-objective approach. in this model we take as objective functions the network load, connection cost, and energy consumption; with the aim of performing an efficient use of the capacity resources in the available networks. based on this model it was designed a vertical handover algorithm (vho) using evolutionary algorithms, specifically the strength pareto evolutionary algorithm (spea). through the proposed environments we validate the correct operation of our algorithm; in the first scenario we validate the exactitude of the feasible solutions obtained by our algorithm in comparison to the solutions of the mathematical model using gams. in the other scenarios, we validate the sensibility and scalability of our evolutionary algorithm. the results obtained by our proposal were satisfactory and provided a starting point for the mobile network operator to run a vho processes in their networks. with this process they could get an efficient use of their network resources, reduce the connection costs, and extend the battery life of mobile devices. due to we proposes a multi-objective optimization algorithm, the model is opens up to incorporate additional parameters as objective functions; these parameters could be obtained from both the available access networks and the mobile devices. as future work we propose to continue this research, we want to introduce the concept of fairness in the load balance optimization and also include the concept of quality of experience (qoe) in the objective functions. 6 acknowledgment the authors would like to thank to administrative department of science, technology and innovation (colciencias) for the financial support to carlos lozano-garzon through the 528 2011 national call for doctoral studies in colombia bibliography [1] ieee computer society (2008); ieee 802.21: media independent handover services, ieee-sa standards board available at https://standards.ieee.org/getieee802/download/802.21-2008.pdf. [2] yan, x.; şekercioğlu, y. a.; narayanan, s. (2010); an overview of vertical handover techniques: algorithms, protocols and tools, comput. netw., issn 1389-1286, 54(11):1848-1863. [3] marquez-barja, j.; calafate, c.t.; cano, j.c.; manzoni, p. (2011); a survey of vertical handover decision algorithms in fourth generation heterogeneous wireless networks, computer communications, issn 0140-3664, 34(8):985-997. [4] sousa, b.m.; pentikousis, k.; curado, m. (2011); multihoming management for future networks, mob. netw. appl., issn 1383-469x, 16(4):505-517. [5] paasch, c.; detal, g.; duchene, f.; raiciu, c.; bonaventure, o. (2012); exploring mobile/wifi handover with multipath tcp, proceedings of the 2012 acm sigcomm workshop on cellular networks: operations, challenges, and future design, new york, ny, usa: acm, 31-36 (available at http://doi.acm.org/10.1145/2342468.2342476). [6] hyun-dong, c.; hyunjung, k.; seung-joon, s. (2013); flow based 3g/wlan vertical handover scheme using mih model, 2013 international conference on information networking (icoin), 658-663. 272 c. lozano-garzon, m. molina, y. donoso [7] donoso, y.; fabregat, r. (2007); multi-objective optimization in computer networks using metaheuristics, auerbach publications, boston, ma, usa. [8] capela, n.; sargento, s. (2012); optimizing network performance with multihoming and network coding, 2012 ieee globecom workshops (gc wkshps), pp. 210-215. [9] yang, r.; chang, y.; sun, j.; yang, d. (2012); traffic split scheme based on common radio resource management in an integrated lte and hsdpa networks, 2012 ieee vehicular technology conference (vtc fall), 1-5. [10] sungwook, k.; varshney, p.k. (2002); an adaptive bandwidth reservation algorithm for qos sensitive multimedia cellular networks, 2002 ieee vehicular technology conference (vtc fall), 1475-1479. [11] sungwook, k.; varshney, p.k. (2003); adaptive load balancing with preemption for multimedia cellular networks, 2003 ieee wireless communications and networking (wcnc), 1680-1684. [12] li, m.; fei, y.; leung, v.; randhawa, t. (2003); a new method to support umts/wlan vertical handover using sctp, 2002 ieee vehicular technology conference (vtc fall), 1788-1792. [13] bin, l.; boukhatem, n.; martins, p.; bertin, p. (2010); multihoming at layer-2 for interrat handover, 2010 ieee international symposium on personal indoor and mobile radio communications (pimrc), 1173-1178. [14] eun, k.p.; si, y.h.; hanlim, k.; jong-sam, j.; seong-choon, l.; sang-hong, l. (2008); seamless vertical handover using multihomed mobile access point, ieee global telecommunications conference, 2008 (ieee globecom 2008), 1-4. [15] folstad, e.l.; helvik, b.e. (2009); managing availability in wireless inter domain access, international conference on ultra modern telecommunications workshops, 2009. (icumt ’09), pp. 1-6. [16] lozano-garzon, c.; ortiz-gonzalez, n.; donoso, y. (2013); mobile network a proactive vhd algorithm in heterogeneous wireless networks for critical services, international journal of computers, communications & control, issn 1841-9836, 8(3):425-431. [17] donoso, y.; lozano-garzon, c.; camelo, m.; vila, p. (2014); a fairness load balancing algorithm in heterogeneous wireless networks using a multihoming strategy, international journal of computers, communications & control, issn 1841-9836, 9(5):555-569. [18] mittal, r.; kansal, a.; chandra, r. (2012); empowering developers to estimate app energy consumption, proceedings of the 18th annual international conference on mobile computing and networking, 317-328. [19] deb, k. (2001); multi-objective optimization using evolutionary algorithms, john wiley & sons, inc., new york, ny, usa. [20] rosenthal, r. e. (2015); gams a user’s guide, gams development corporation., available at http://www.gams.com/dd/docs/bigdocs/gamsusersguide.pdf. ijcccv4n3draft.pdf int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 3, pp. 253-262 mutation based testing of p systems florentin ipate, marian gheorghe florentin ipate the university of pitesti department of computer science, faculty of mathematics and computer science str targu din vale 1, 110040 pitesti e-mail: florentin.ipate@ifsoft.ro marian gheorghe the university of sheffield department of computer science regent court, portobello street, sheffield s1 4dp, uk e-mail: m.gheorghe@dcs.shef.ac.uk received: april 5, 2009 accepted: may 30, 2009 abstract: although testing is an essential part of software development, until recently, p system testing has been completely neglected. mutation testing (mutation analysis) is a structural software testing method which involves modifying the program in small ways. in this paper, we provide a formal way of generating mutants for systems specified by context-free grammars. furthermore, the paper shows how the proposed method can be used to construct mutants for a p system specification. keywords: mutation testing, p systems, kripke structures, context-free grammars 1 introduction membrane computing, the research field initiated by gheorghe păun in 1998 [12], aims to define computational models, called p systems, which are inspired by the behaviour and structure of the living cell. since its introduction in 1998, the p system model has been intensively studied and developed: many variants of membrane systems have been proposed, a research monograph [13] has been published and regular collective volumes are annually edited. furthermore, a comprehensive bibliography of p systems can be found at [16]. of the many variants of p systems that have been defined, in this paper we consider cell-like p systems without priority and membrane dissolving rules [13]. testing is an essential part of software development and all software applications, irrespective of their use and purpose, are tested before being released. testing is not a replacement for a formal verification procedure, when the former is also present, but rather a complementary mechanism to increase the confidence in software correctness [5]. although formal verification has been applied to different models based on p systems [1], until recently testing has been completely neglected in this context. the main testing strategies involve either (1) knowing the specific function or behaviour a product is meant to deliver (functional or black-box testing) or (2) knowing the internal structure of the product (structural or white-box testing). in black-box testing, the test generation is based on a formal specification or model, in which case the process could be automated. a number of recent papers devise black-box testing strategies for p systems based on rule coverage [4], finite state machine [8] and stream x-machine [7] conformance techniques. in this paper, we propose an approach to p system testing based on mutation analysis. mutation testing (mutation analysis) is a structural software testing method which involves modifying the program in small ways [14], [9]. the modified versions of the program are called mutants. copyright c© 2006-2009 by ccc publications 254 florentin ipate, marian gheorghe consider, for example, the following fragment of a java program: if (x ≥ &&a) y = y + ; else y = y + ; then mutants for this code fragment can be obtained by substituting: (i) && with another logic operator, e.g., ||; (ii) ≥ with another comparison operator, e.g., >, =; (iii) + with another arithmetic operators, e.g., −; (iv) substituting one variable (e.g., x) with another one, e.g., y (we assume that the two variables have the same type). some (not all) mutants of the above code fragment are given below. if (x ≥ ||a) y = y + ; else y = y + ; if (x > &&a) y = y + ; else y = y + ; if (x ≥ &&a) y = y − ; else y = y + ; if (x ≥ &&a) y = y + ; else y = y − ; if (x ≥ &&a) x = y + ; else y = y + ; if (x ≥ &&a) y = y + ; else x = y + ; a variety of mutation operators (ways of introducing errors into the correct code) for imperative languages are defined in the literature [9], [10] (a few examples are given above). these are called traditional mutation operators. beside these, there are mutation operators for specialised programming environments, such as object-oriented languages [10]. a popular tool for generating mutants for java programs is mujava [15], [10]. the underlying idea behind mutation testing is that, in practice, an erroneous program either differs only in a small way from the correct program or, alternatively, a bigger fault can be expressed as the summation of smaller (basic) faults and so, in order to detect the fault, the appropriate mutants need to be generated. if the test suite is able to detect the fault (i.e., one of the tests fails), then the mutant is said to be killed. two kinds of mutation have been defined in the literature: weak mutation requires the test input to cause different program states for the mutant and the original program; strong mutation requires the same condition but also the erroneous state to be propagated at the end of the program. mutation analysis has been largely used in white-box testing, but only a few tentative attempts to use this idea in black-box testing have been reported in the literature [11]. offutt et al. propose a general strategy for developing mutation operators for a grammar based software artefact, but the ideas that outline the proposed strategy for mutation operator development are rather vague and general and no formalisation is provided. in this paper we provide a formal way of generating mutants for systems specified by context-free grammars. given such a specification, a derivation (or parse) tree can be associated with it. based on the tree, we formally describe the process of generating the mutants for the given specification. furthermore, the paper shows how the proposed method can be used to construct mutants for a p system specification. 2 preliminaries for an alphabet v = {a, ..., ap}, v ∗ denotes the set of all strings over v ; λ denotes the empty string. for a string u ∈ v ∗, |u|ai denotes the number of ai occurrences in u. each string u has an associated vector of non-negative integers (|u|a , ..., |u|ap ). this is denoted by ψv (u). the concept of context-free grammar is assumed to be known, for details we refer to a classical textbook [6]. only proper context-free grammar, i.e., with no useless symbols and no λ or renaming productions, will be used in this paper. for any derivation from the start symbol to a string of terminal symbols, w, a derivation (or parse) tree with the yield, the string of terminals obtained by concatening the leaves from left to right, w, is associated. the set of terminal strings derived from the start symbol mutation based testing of p systems 255 is called the language generated by the language. a grammar is said to be ambiguous if there exists a string and in any leftmost derivation (always the leftmost nonterminal is rewritten) this can be generated by more than one derivation (parse) tree. in the sequel possibly ambiguous grammars will be considered. 2.1 p systems a basic cell-like p system is defined as a hierarchical arrangement of membranes identifying corresponding regions of the system. with each region there are associated a finite multiset of objects and a finite set of rules; both may be empty. a multiset is either denoted by a string u ∈ v ∗, where the order is not considered, or by ψv (u). the following definition refers to one of the many variants of p systems, namely cell-like p systems, which uses non-cooperative transformation and communication rules [13]. we will call these processing rules. since now onwards we will refer to this model as simply p system. definition 1. a p system is a tuple π = (v, µ, w, ..., wn, r, ..., rn), where v is a finite set, called alphabet; µ defines the membrane structure, which is a hierarchical arrangement of n compartments called regions delimited by membranes these membranes and regions are identified by integers 1 to n; wi,  ≤ i ≤ n, represents the initial multiset occurring in region i; ri,  ≤ i ≤ n, denotes the set of processing rules applied in region i. the membrane structure, µ , is denoted by a string of left and right brackets ([, and ]), each with the label of the membrane it points to; µ also describes the position of each membrane in the hierarchy. the rules in each region have the form u → (a,t)...(am,tm), where u is a multiset of symbols from v , ai ∈ v , ti ∈ {in, out, here},  ≤ i ≤ m. when such a rule is applied to a multiset u in the current region, u is replaced by the symbols ai with ti = here; symbols ai with ti = out are sent to the outer region or outside the system when the current region is the external compartment and symbols ai with ti = in are sent into one of the regions contained in the current one, arbitrarily chosen. in the following definitions and examples all the symbols (ai, here) are used as ai. the rules are applied in maximally parallel mode which means that they are used in all the regions in the same time and in each region all the symbols that may be processed, must be. a configuration of the p system π , is a tuple c = (u, ..., un), where ui ∈ v ∗, is the multiset associated with region i,  ≤ i ≤ n. a derivation of a configuration c to c using the maximal parallelism mode is denoted by c =⇒ c. in the set of all configurations we will distinguish terminal configurations; c = (u, ..., un) is a terminal configuration if there is no region i such that ui can be further derived. for the type of p systems we investigate in this paper, multi-membranes can be equivalently collapsed into one membrane through properly renaming symbols in a membrane. thus, for the sake of convenience, subsequently we will only focus on p systems with only one membrane. 2.2 kripke structures definition 2. a kripke structure over a set of atomic propositions ap is a four tuple m = (s, h, i, l), where s is a finite set of states; i ⊆ s is a set of initial states; h ⊆ s × s is a transition relation that must be left-total, that is, for every state s ∈ s there is a state s′ ∈ s such that (s, s′) ∈ h; l : s −→ ap is an interpretation function, that labels each state with the set of atomic propositions true in that state. usually, the kripke structure representation of a system results by giving values to every variable in each configuration of the system. suppose var, . . . , varn are the system variables, vali denotes the set of values for vari and vali is a value from vali,  ≤ i ≤ n. then the states of the system are s = {(val, . . . , valn) | val ∈ val, . . . , valn ∈ valn}, and the set of atomic predicates are ap = {(vari = vali) |  ≤ i ≤ n, val ∈ vali}. naturally, l will map each state (given by the values of variables) onto the corresponding set of atomic propositions. additionally, a halt (sink) state is needed when h is not lefttotal and an extra atomic proposition, that indicates that the system has reached this state, is added to ap. for convenience, in the sequel ap and l will be omitted from the definition of a kripke structure. 256 florentin ipate, marian gheorghe 3 mutation testing from a context-free grammar in this section we provide a way of constructing mutants for systems specified by context-free grammars. given the system specification, in the form of a parse tree, we formally describe the generation of mutants for the given specification. consider a context-free grammar g = (v, t, p, s) and l(g) the language defined by g. we assume that, for every production rule p : a −→ x . . . xk, we have defined a set mut(p), called the set of mutants of p. a mutant p′ of p is a production rule of the form a −→ x ′ . . . x ′ n such that each symbol x ′ , . . . , x ′ n is either a terminal or is found among x, . . . , xk. furthermore, p ′ is either a production rule of g itself or has the form a −→ a, a ∈ v ; this condition ensures that the yield of the mutated tree is syntactically correct. among the mutants of p, the following types of mutants can be distinguished: • a terminal replacement mutant is a production rule of the form a −→ x ′ . . . x ′ k if there exists j,  ≤ j ≤ k, such that x j, x ′ j ∈ t , x j 6= x ′ j and x ′ i = xi,  ≤ i ≤ n, i 6= j. • a terminal insertion mutant is a production rule of the form a −→ w where w is obtained by inserting one terminal into the string x . . . xk (at any position). • a string deletion mutant is a production rule of the form a −→ w where w is obtained by removing one or more symbols from x . . . xk. • a string reordering mutant is a production rule of the form a −→ w where w is obtained by reordering the string x . . . xk. given any parse tree tr for g, the set of mutants of tr is defined as follows: • a one-node tree has no mutants. • let tr be the tree with root a and subtrees tr, . . . , trk having roots, nodes x, . . . , xk, respectively and p ∈ p the corresponding production rule of g, of the form a −→ x . . . xk. this is denoted by tr = maketree(a, tr, . . . , trk). let tr ′ denote a mutant of tr. then either – (subtree mutation) tr ′ = maketree(a, tr ′, . . . , tr ′ k ), where there exists j,  ≤ j ≤ k, such that tr ′j is mutant of tr j and tr ′ i = tri,  ≤ i ≤ k, i 6= j, or – (rule mutation) tr ′ = maketree(a, tr ′, . . . , tr ′ n), where there exists a mutant p ′ of p of the form a −→ x ′ . . . x ′ n such that for every i,  ≤ i ≤ n, there exists ji,  ≤ ji ≤ k, such that tr ′i = tr ji . according to [11] these operations can be made such as to keep the result produced by them in the same language or in a larger one. in the first case a much simpler approach can be considered whereby each rule having a certain nonterminal in the left hand side is replaced by another different rule having the same nonterminal as left hand side. however the above set of operations provide a two stage method which generates mutants by considering first the rule level and then the derivation (parse) tree. if these operations are restricted to produce strings in the same language then we have the following result. lemma 3. every mutant of a parse tree from g is also a parse tree from g. proof. follows by induction on the depth of the tree. thus, the yield of any mutant constructed as above belongs to the language described by g and so only syntactically correct mutants will be generated. syntactically incorrect mutants are useless (they do not produce test data) and so the complexity of the testing process is reduced by making sure that these are ruled out from the outset. mutation based testing of p systems 257 let us consider the grammar g = (v, t, p, s) where v = {s}; t = {, . . . , n} ∪ {+, −}, with n a fixed upper bound; p = {p, p}∪{p i  |  ≤ i ≤ n}, with p : s −→ s + s, p : s −→ s − s, p i  : s −→ i,  ≤ i ≤ n. suppose we have the following rule mutants: • for p : s −→ s − s (terminal replacement), s −→ s (string deletion) • for p : s −→ s + s (terminal replacement), s −→ s (string deletion) • for pi : s −→ i −  and s −→ i +  if  < i < n, s −→  if i =  and s −→ n −  if i = n. the mutants of pi are of terminal replacement type and are based on a technique widely used in software testing practice, called boundary value analysis. according to practical experience, many errors tend to lurk close to boundaries; thus, an efficient way to uncover faults is to look at the neighbouring values consider the string  +  −  and a parse tree for this string as represented in figure 1 (leaf nodes are in bold). the construction of mutants for the given parse tree is illustrated in figures 2, 3 and 4. thus, the mutated strings are  +  − ,  +  − ,  +  − ,  +  − ,  − ,  − ,  +  − ,  +  − ,  +  + ,  −  − ,  + , . some of these produce the same result as the original string; these are called equivalent mutants. since no input value can distinguish these mutants from the correct string, they will not affect the test suite when strong mutation is considered. s + s s 3 1 2 s s figure 1: example parse tree 4 p system mutation testing consider a 1-membrane p system π = (v, µ, w, r), where r = {r, . . . , rm}; each rule ri,  ≤ i ≤ m, is of the form ui −→ vi, where ui and vi are multisets over the alphabet v . in the sequel, we treat the multisets as vectors of non-negative integers, that is each multiset u is replaced by ψv (u) ∈ nnn k, where k denotes the number of symbols in v . in order to keep the number of configurations finite we will assume that each component of a configuration u cannot exceed an established upper bound denoted max. we denote u ≤ max if ui ≤ max for every  ≤ i ≤ k and n k max = {u ∈ nnn k | u ≤ max}. analogously to [3], the system is assumed to crash whenever u ≤ max does not hold (this is different from the normal termination, which occurs when u ≤ max and no rule can be applied). under these conditions, the 1-membrane p system π can be described by a kripke structure. in order to define the kripke structure equivalent of π we use two predicates, maxparal and apply, defined by: 258 florentin ipate, marian gheorghe s 1 s 2 s 3 tree 1 st level mutants s 0 s 1 s 2 s 2 s 3 s 4 figure 2: 1st level mutants maxparal(u, u, v, n, . . . , um, vm, nm), u ∈ n k max, n, . . . , nm ∈ nnn signifies that a derivation of the configuration u in maximally parallel mode is obtained by applying rules r : u −→ v, . . . , rm : um −→ vm, n, . . . , nm times, respectively; apply(u, v, u, v, n, . . . , um, vm, nm), u ∈ n k max, n, . . . , nm ∈ nnn, denotes that v is the result of applying rules r, . . . , rm, n, . . . , nm times, respectively. then the kripke structure equivalent m = (s, h, i, l) of π is defined as follows: s = nkmax ∪{halt,crash} with halt,crash /∈ nkmax, halt 6= crash; i = w; h is defined by: • (u, v) ∈ h, u, v ∈ nkmax, if ∃n, . . . , nm ∈ nnn · maxparal(u, u, v, n, . . . , um, vm, nm) ∧ apply(u, v, u, v, n, . . . , um, cm, nm); • (u, halt) ∈ h, u ∈ nkmax, if ¬∃v ∈ n k max, n, . . . , nm ∈ nnn· apply(u, v, u, v, n, . . . , um, vm, nm); • (u,crash) ∈ h if ¬∃v ∈ nkmax ∪ {halt} · (u, v) ∈ h; • (halt, halt) ∈ h, (crash,crash) ∈ h. it can be observed that the relation h is left-total. in order to use mutation analysis in p system testing we first have to describe an appropriate contextfree grammar, such that the p system specification can be written as a string accepted by this grammar. the parse tree for the string is then generated and the procedure presented in the previous section is used for mutant construction. the grammar definition will depend on the level at which testing is intended to be performed. at a high level (for instance in integration testing) the predicates maxparal and apply will normally be assumed to be correctly implemented and so they will be presented as terminals in the grammar; obviously, they can be themselves described by context-free grammars and appropriate mutants will be generated in a similar fashion. on the other hand, it is possible to incorporate the definitions of the two predicates mutation based testing of p systems 259 s s + s 1 2 tree 2 nd level mutants s s + s 0 2 s s + s 2 2 s s + s 0 1 s s + s 2 3 s s + s 1 2 s s 1 s s 2 figure 3: 2nd level mutants into the definition of the transition relation h; in this case the corresponding grammar will be much more complex and system testing will be performed in one single step. the following (simplified) example illustrates the above strategy for high-level testing of p systems. example 4. consider a 1-membrane p systems with 2 rules r : u −→ v, r : u −→ v. then the transition of the kripke structure representation of π is given by the formulae: • (u, v) ∈ h, u, v ∈ nmax, if ∃n, n ∈ n · maxparal(u, u, v, n, u, v, n) ∧ apply(u, v, u, v, n, u, c, n); • (u, halt) ∈ h, u ∈ nmax, if ¬∃v ∈ n  max, n, n ∈ nnn· apply(u, v, u, v, n, u, v, n); • (u,crash) ∈ h if ¬∃v ∈ nmax ∪ {halt} · (u, v) ∈ h; • (halt, halt) ∈ h, (crash,crash) ∈ h; then such a system can be described by a context-free grammar g = (v, t, p, s) where v = {s, s, s,u,v,u,v,u,v}; t contains (bounded) vectors from nnn , the additional states halt and 260 florentin ipate, marian gheorghe s s 3 left tree mutant s + s 1 2 s s right tree mutant s + s s 3 1 2 + s s s s 3s + s 1 2 s s 3 rd level mutants of the original tree figure 4: 3rd level mutants crash, predicates maxparal and apply, the "true" logical value, logical operators, quantifiers and other symbols, i.e., t = nmax ∪ {halt,crash, maxparal, apply,true, ∧, , ∨, ¬,∃,∀, n, n,·, (, )}. the set of production rules consists of: p : s −→ ¬s; p : s −→ s ∧ s; p : s −→ s ∨ s; p : s −→ true; p : s −→ ∃n · s; p : s −→ ∃n · s; p : s −→ s ∧ s; p : s −→ apply(u,v,u,v, n,u,v, n); p : s −→ maxparal(u,u,v, n,u,v, n); rules that transform nonterminals u,u,v,u,v into vectors from nnn. the following mutants can be defined for the rules p to p: p ′  : s −→ s; p ′  : s −→ s ∨s, p ′′  : s −→ s; p′ : s −→ s ∧ s, p ′′  : s −→ s; p ′  : s −→ ¬true; p ′  : s −→ ∀n ·s; p ′  : s −→ ∀n ·s. p ′  : s −→ s ∨ s, p′′ : s −→ s. for p mutants can be defined by negating de predicate, changing parameters such that the obtained formula is syntactically correct, e.g., switch u and u. similarly, mutants for p are obtained by negating de predicate, changing parameters such that the obtained formula is syntactically correct. for the remaining rules mutants are generated by adding  to or subtracting  from each integer value. mutation based testing of p systems 261 5 conclusions in many applications based on formal specification methods the test sets are generated directly from the formal models. the same applies to formal models based on grammars. however the approach presented in [11], although novel and with many practical consequences, lacks a rigorous method of defining the process of generating the mutants. in this paper a formal method is introduced to rigorously define operations with rules and subtrees of derivation trees for context-free grammar formalisms. this is then extended to p systems and some examples are provided to illustrate the approach. in this paper, the mutation operators are applied to the kripke structure equivalent of the p system rather than to the p system itself. the advantage of this approach is that test values can be simply generated using a model checking tool (these are the counterexamples returned by the tool). future work may investigate the application of the mutation operators directly to the p system and the associated test generation process. acknowledgment. this work is supported by the cncsis grant idei 643/2009 (evomt). the authors are grateful to reviewers for their comments. bibliography [1] f. bernardini, m. gheorghe, j. j. romero-campero, n. walkinshaw, a hybrid approach to modelling biological systems, workshop on membrane computing 2007, lecture notes in computer science, vol. 4860, pp. 138–159, 2007. [2] g. ciobanu, gh. păun, m. j. pérez-jiménez (eds.), applications od membrane computing, springer, 2006. [3] z. dang, o. h. ibarra, c. li, g. xie, on the decidability of model-checking for p systems, journal of automata, languages and combinatorics, vol. 11, pp. 279–298, 2006. [4] m. gheorghe, f. ipate, on testing p systems, workshop on membrane computing, lecture notes in computer science, vol. 5391, pp. 204–216, 2008. [5] m. holcombe, f. ipate, correct systems: building a business process solution, springer, 1998. [6] j. e. hopcroft, r. motwani, j. d. ullman, introduction to automata theory, languages, and computation (2nd edition), addison-wesley, 2001. [7] f. ipate, m. gheorghe, testing non-determinstic stream x-machine model and p systems, electronic notes in theoretical computer science, vol. 227, pp. 113–126, 2009. [8] f. ipate, m. gheorghe, finite state based testing of p systems, natural computing, to appear, 2009. [9] j. offutt, a practical system for mutation testing: help for the common programmer, international test conference, pp. 824–830, 1994. [10] y.-s. ma, j. offutt, y. r. kwon, mujava: an automated class mutation system, software testing, verification and reliability, vol. 15, pp. 97–133, 2005. [11] j. offutt, p. ammann, g. mason, l. (ling) liu, mutation testing implements grammar-based testing, proceedings of the second workshop on mutation analysis, 2006. [12] gh. păun, computing with membranes, journal of computer and system sciences, vol. 61, pp. 108–143, 2000. [13] gh. păun, membrane computing: an introduction, springer-verlag, berlin, 2002. 262 florentin ipate, marian gheorghe [14] http://en.wikipedia.org/wiki/mutation_testing [15] http://cs.gmu.edu/ offutt/mujava/ [16] http://ppage.psystems.eu florentin ipate was born on 4th december 1967 in constanta. fi holds a phd and msc degrees with the university of sheffield and a bsc with politehnica university of bucharest, all in computer science. he is now a professor of computer science and phd supervisor with the university of pitesti. he has been awarded in hoc signo vinces prize for research and publications, by the national research council for higher education, romania, 2002 and copyro publishing prize for computer science, romania, 2000. fi’s research interests are in specification and model based testing, formal specification languages for software systems, agile modelling and testing, modelling and testing biology-inspired computing systems. his main research results have been published in a research monograph with springer and in high profile journals. marian gheorghe was born on 2nd february 1953 in bucharest. mg holds a phd and a bsc with the university of bucharest. he is now senior lecturer with the university of sheffield and head of the verification and testing group. mg’s research interests are in formal computational models, verification and testing, modelling biological systems, agent technologies, artificial life, empirical software engineering. he has published in important international journals and is featured in the main computer science publications database, dblp, with around 60 items. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 4, pp. 374-385 an immuno-genetic hybrid algorithm e. nabil, a. badr, i. farag emad nabil misr university for science and technology,information technology faculty, computer science department. e-mail: emadnabilcs@gmail.com amr badr, ibrahim farag cairo university,computers and information faculty, computer science department. 5 dr. ahmed zewail street, postal code: 12613, orman, giza, egypt e-mail: ruaab@rusys.eg.net, i.farag@fci-cu.edu.eg abstract: the construction of artificial systems by drawing inspiration from natural systems is not a new idea. the artificial neural network (ann) and genetic algorithms (gas) are good examples of successful applications of the biological metaphor to the solution of computational problems. the study of artificial immune systems is a relatively new field that tries to exploit the mechanisms of the natural immune system (nis) in order to develop problemsolving techniques. in this research, we have combined the artificial immune system with the genetic algorithms in one hybrid algorithm. we proposed a modification to the clonal selection algorithm, which is inspired from the clonal selection principle and affinity maturation of the human immune responses, by hybridizing it with the crossover operator, which is imported from gas to increase the exploration of the search space. we also introduced the adaptability of the mutation rates by applying a degrading function so that the mutation rates decrease with time where the affinity of the population increases, the hybrid algorithm used for evolving a fuzzy rule system to solve the wellknown wisconsin breast cancer diagnosis problem (wbcd). our evolved system exhibits two important characteristics; first, it attains high classification performance, with the possibility of attributing a confidence measure to the output diagnosis; second, the system has a simple fuzzy rule system; therefore, it is human interpretable. the hybrid algorithm overcomes both the gas and the ais, so that it reached the classification ratio 97.36, by only one rule, in the earlier generations than the two other algorithms. the learning and memory acquisition of our algorithm was verified through its application to a binary character recognition problem. the hybrid algorithm overcomes also gas and ais and reached the convergence point before them. keywords: genetic algorithms, artificial immune system, fuzzy logic, breast cancer diagnosis, memory acquisition. 1 introduction computing and engineering have been enriched by the introduction of the biological ideas to help developing solutions to various problems. this can be exemplified by the artificial neural networks (ann), evolutionary algorithms (ea) [11], artificial life (alife), and cellular automata (ca) [13]. there exist three different approaches; the first is: biologically motivated computing, under this umbrella the ea, ann and artificial immune system (ais) [21]; the second is computationally motivated biology, where computing provides models and inspiration for biology (i.e. alife and ca). the third approach is computing with biological mechanisms, which involves the use of information processing capabilities of biological systems to replace or supplement the current silicon-based computers (e.g. membrane copyright c© 2006-2009 by ccc publications an immuno-genetic hybrid algorithm 375 computing, quantum computing and dna computing) [8], [9] [14], [18]. our research point will be under the umbrella of the first approach. in this paper, we combine two methodologies which are genetic algorithms and artificial immune system (ais), so as to automatically produce a fuzzy system for breast cancer diagnosis. the major advantage of fuzzy systems is that they favor interpretability [3], [4] and provide what is called confidence measure which means, in our case, the degree of benignity or malignancy. finding good fuzzy systems is quite a hard task. so, this is where ga and ais algorithms work, enabling the automatic production of fuzzy systems, based on a database of training cases. in this paper we also ensure the ability of memory acquisition and learning of the algorithm by applying it to a binary pattern recognition problem. the paper is organized as follows: in the next two sections we provide an overview of the clonal selection algorithm and the genetic algorithm. in section [4] we present our proposed hybrid algorithm between ga and ais that will be tested on the wisconsin breast cancer diagnosis (wbcd) problem described in section [5]. evolving fuzzy system of the wbcd, parameters setup and testing also included in section [5]. section [6] speaks about the learning and memory acquisition of the hybrid algorithm. the algorithm testing is delineated also in section [6], followed by concluding remarks in section [7]. 2 the clonal selection algorithm the standard clonal selection algorithm clonalg [5], [6], [15], [16], [17] can be summarized as follows. begin t=0; initialize the initial population p(t) randomly; identify antigen s; evaluate affinity p(t)versus s; while (not finished) do begin t= t+1; select c(t) from p(t-1); proportional cloning of c(t) forming c’(t); mutation c’(t) forming c"(t); select p(t) from c"(t) and p(t-1); select memory cell from p(t); metadymanics; end. end. 3 the genetic algorithm (ga) the standard genetic algorithm [7], [23] can be summarized as follows. begin t=0; initialize the initial population p(t) randomly; evaluate structures in p (t); while (not finished) do begin t= t+1; select parents c(t) from p(t-1); crossover and mutate structures in c (t) forming c’ (t); 376 e. nabil, a. badr, i. farag replace c’ (t) by p (t-1); end. end 4 the proposed hybrid algorithm d = l∑ i= δ where { , abi 6=agi , otherwise } (1) the affinities of individuals are measured using the hamming distance depicted in equation 1. the proposed algorithm modifies clonal selection algorithm mutation method. the mutation in nature occurs at small percentage value = 0.002 and this is rational from the computational point of view to ensure that the good solutions are not distorted too much. however, researches have shown that an initial large mutation rate that decreases exponentially as a function of the generation number improves the convergence speed and accuracy [1]. the initial large mutation rate ensures that a large space is covered, while the mutation rate becomes smaller when the individuals start to converge to the optimum. this is accepted solution for the trade off between the exploration and exploitation we used the time-decaying formula in equation (2) [18], [22], [24] where τ is a positive constant, m() is the initial large mutation rate and t is the generation number. the equation is depicted in figure 1. we have imported the crossover operator from the genetic algorithms in order to increase the exploration of the landscape and to add a recombination operator in the clonal selection algorithm. m(t) = m()e−t/τ  (2) figure 1: the effect of the degraded function on mutation value the proposed algorithm can be summarized as follows. begin t=0; initialize the initial population p(t) randomly; identify antigen s; evaluate fitness p (t) versus s; while (not finished) do begin 1. t= t+1; 2. select c(t) from p(t-1); 3. proportional cloning of c(t) forming c’(t); an immuno-genetic hybrid algorithm 377 table 1: the wcbd data representation case v v ... v diagnosis 1 1 2 ... 8 benign 2 2 4 ... 3 benign ... ... ... ... ... ... 683 4 8 ... 1 malignant 4. degraded proportional mutation c’(t) forming c"(t); 5. crossover c"(t) forming c*(t); 6. select p(t) from c*(t) and p(t-1); 7. select memory cell from p(t); 8. metadymanics; end. end. the proposed algorithm will be tested on the famous wisconsin breast cancer diagnosis problem (section 5) and simple binary pattern recognition problem (section 6) to ensure the memory acquisition ability of the algorithm. 5 the wisconsin breast cancer diagnosis problem in this section, we present the wisconsin breast cancer diagnosis problem [3] which is the test case of our proposed algorithm. breast cancer is the most common cancer among women, excluding skin cancer. the presence of a breast mass is an alert sign of a cancer, but it does not always indicate a malignant one. fine needle aspiration (fna) is an outpatient procedure that involves using a small-gauge needle to extract fluid directly from a breast mass. fna procedure over breast masses is a cost-effective, non-traumatic, and mostly non-invasive diagnostic test that obtains information needed to evaluate malignancy. the wisconsin breast cancer diagnosis (wbcd) database [4] is the result of the efforts made at the university of wisconsin hospital for accurately diagnosing breast masses based solely on an fna test. nine visually assessed characteristics of an fna sample considered relevant for diagnosis were identified, and assigned an integer value between 1 and 10. the measured variables are as follows: 1. clump thickness (v); 2. uniformity of cell size (v); 3. uniformity of cell shape (v); 4. marginal adhesion (v); 5. single epithelial cell size (v); 6. bare nuclei (v); 7. bland chromatin (v); 8. normal nucleoli (v); 9. mitosis (v). 378 e. nabil, a. badr, i. farag the database itself consists of 683 cases. the general form of the database is described in table 1. there exist some previous systems that achieved high classification ration, but these systems look like black boxes and with no explanation or interpretation about how the decision was taken. further, the degree of benignity or malignancy is not provided. these two points are covered in this study besides high performance classification ratio. 5.1 evolutionary fuzzy modeling evolutionary algorithms are used to search large, and often complex, search spaces. they have proven worthwhile on numerous diverse problems and are able to find near-optimal solutions with an adequate performance measure. fuzzy modeling can be seen as an optimization problem where part or all of the parameters of a fuzzy system constitute the search space. 5.2 applying evolution to fuzzy modeling three of the four types of fuzzy parameters can be used to define targets for evolutionary fuzzy modelling: structural parameters, connective parameters, and operational parameters. logical parameters are usually predefined by the designer based on experience. the evolutionary algorithm is used to tune the knowledge contained in the fuzzy system by finding membership function values (p, d values) and the relevant variables. evolutionary structure learning is carried out by encoding within the genome an entire fuzzy system. this is known as the pittsburgh approach. figure 2: the proposed diagnosis system. note that the fuzzy subsystem displayed to the left is the fuzzy inference system in figure 3. figure 3: basic structure of a fuzzy inference system 5.3 evolving fuzzy systems for the wbcd problem the solution scheme we propose for the wbcd problem is depicted in figure 2, note that the fuzzy subsystem displayed to the left of figure 2 is the fuzzy inference system of figure 3 [10], [20]. figure 2 consists of a fuzzy system and a threshold unit. the fuzzy system computes a malignancy value of the an immuno-genetic hybrid algorithm 379 malignancy of a case, based on the input values, the threshold unit then outputs a benign or malignant diagnostic according to the fuzzy system’s output. if the malignancy value is less than or equals 3, it is considered a benign case. other than that, it is diagnosed as a malignant one. 5.4 fuzzy system parameters according to information obtained from previous work [3], we have deduced the following points. • small number of rules: systems with no more than four rules have been shown to obtain high performance [2], [19]. • small number of variables: rules with no more than 4 antecedents have proven to be adequate[2]. • nature of the input variables: higher-valued variables are associated with malignancy. some fuzzy models forgo interpretability in the interest of improved performance. where medical diagnosis is concerned, interpretability, also called linguistic integrity, is the major advantage of fuzzy systems. this motivated us to take into account the following semantic criteria, defining constraints on the fuzzy parameters [12]: • distinguishability: to what extent the system is understood and has interpretability. • justifiable number of elements: the number of membership functions of a variable. this number should not exceed the limit of ± distinct terms. the same criterion is applied to the number of variables in the rule antecedent; this is to be familiar for humans. • orthogonality. for each element of the universe of discourse, the sum of all its membership values should be equal to one. 5.5 the fuzzy system setup logical parameters • reasoning mechanism: singleton-type fuzzy system, i.e. output membership functions are real values, rather than fuzzy ones. • fuzzy operators: min. • input membership function type: orthogonal, trapezoidal. • defuzzification method: weighted average. structural parameters • relevant variables: there is insufficient a priori knowledge to define them; therefore, this will be one of the algorithm’s objectives. • number of input membership functions: two membership functions denoted low and high. • number of output membership functions: two singletons are used, corresponding to the benign and malignant diagnostics. • number of rules: in our approach, this is a user-configurable parameter. will there be only one rule? the rule itself is to be found by the genetic algorithm. connective parameters 380 e. nabil, a. badr, i. farag • antecedents of rules: to be found by the algorithm. • consequent of rules: the algorithm finds rules for the benign diagnostic; the malignant diagnostic is an else condition. • rule weights: active rules have a weight of value 1 and the else condition has a weight of 0.25. operational parameters • input membership function values: to be found by the evolutionary algorithm. • output membership function values: following the wbcd database, we used a value of 2 for benign and 4 for malignant. 5.6 the evolutionary algorithm setup we apply pittsburgh-style structure learning, using our algorithm to search for three parameters. the relevant variables, the input membership function values, and the antecedents of rules. they are constructed as follows: • membership function parameters. there are nine variables (v-v), each with two parameters p and d, defining the start point and the length of the membership function edges, respectively. • antecedents. the ith rule has the form: if (v is ai )and...and (v is ai ) then (output is benign) where aij represents the membership function applicable to variable vj. a i j can take the values: 1 (low), 2 (high), or 0 or 3 (other). • relevant variables are searched for implicitly by letting the algorithm choose non-existent membership functions as valid antecedents; in such a case, the respective variable is considered irrelevant. table 2: parameters encoding of a genome, total genome length is 54+18= 72 parameter values bits quantity total bits p 1-8 3 97 27 d 1-8 37 9 27 a 0-3 2 9 18 the parameters encoding are described in table 2, which form a single individual’s genome. table 3 shows a sample genome. we used a genetic algorithm with a fixed population size of 200 individuals to evolve the fuzzy inference system, and fitness-proportionate selection. the algorithm terminates when the maximum number of generations is reached. an example of a genome for a rule system depicted in table 3, the first 18 positions encode the parameters p and d for the nine variables v -v. the rest encode the membership function applicable for the nine antecedents of the rule; table 4 is an interpretation of the database and the rule base of the rule system encoded in table 3. 5.7 testing the proposed algorithm has been tested on the wsbc problem. the three algorithms have been implemented and have been tested in wisconsin database. the three algorithms have reached a valid classification ratio equal to 97.36% i.e. 665 valid diagnosis cases from 683 cases. and the results of the an immuno-genetic hybrid algorithm 381 three algorithms were depicted in figure 4. it is clear that the hybrid algorithms reached the maximum classification ratio in the earlier generations before the ga and the ais. also the ais reached before the ga. table 3: database p d p d p d p d p d p d p d 3 5 4 1 2 8 5 1 7 7 2 5 5 5 p d p d a a a a a a a a a 7 2 4 7 1 1 3 3 3 1 3 1 1 table 4: rule base rule if ((v is low) and (v is low) and (v is low) and (v is low) and (v is low)) then( output is benign ) default else(output is malign) figure 4: the execution of the three algorithms 6 the pattern recognition problem the learning and memory acquisition was verified through its application to a binary character recognition problem. in this case, we assumed that the antigen population was represented by a set of ten binary characters (n = 10) to be learned. each character is represented by a bit string of length l = 121. the population size =200. the original characters are depicted in figure 5. figure 6 illustrates the initial memory set. figure 7 illustrates the input patterns. (i.e. the antigens) for which the learning will takes place. figures 8,9 and 10 represent the maturation of the memory set through 200 generations. the affinity here refers the degree of matching of the antigens, i.e. the affinity measure is the hamming distance (discussed in section 4) between the antigens and antibodies, the. note that the exact matching is not important for recognition; partial matching is enough. the hybrid algorithm converged at generation 200. figure 8, 9 and 10 presents the application of the ga, ais and the hybrid algorithm to the binary character recognition problem respectivily where (a) presents memory set after 50 cell generations; (b) 382 e. nabil, a. badr, i. farag figure 5: the original digits figure 6: the initial input patterns figure 7: the input patterns (i.e. the antigens) for which the learning will take place figure 8: application of the ga to the binary character recognition problem figure 9: application of the ais to the binary character recognition problem an immuno-genetic hybrid algorithm 383 figure 10: application of the hybrid algorithm to the binary character recognition problem. presents memory set after 100 cell generations; (c) presents memory set after 150 cell generations and (d) presents memory set after 200 cell generations.it is clear that the hybrid algorithm overcomes the other two algorithms. we have used static mutation values, also proportional to affinity, in the binary recognition problem because results showed that static mutation is better than dynamic one. figure 11 represents the affinity of ga, ais and the hybrid algorithms in recognition of the pattern zero, note that the affinity refers the degree of matching with antigens. i.e. the hamming distance with antigens. figure 11: the affinity of ga, ais and the hybrid algorithms in recognition of the pattern zero 7 conclusions in this research, artificial immune system is combined with genetic algorithms in one hybrid algorithm. a modification is proposed to the clonal selection algorithm which is inspired from the clonal selection principle and affinity maturation of the human immune responses. the adaptability of the mutation rate is introduced by simple degrading function. also, the crossover is merged into the clonal selection algorithm, two-point crossover applied after the mutation process, to increase the exploration of the landscape. the hybrid algorithm is combined with fuzzy logic and applied to the well-known wisconsin breast cancer diagnosis problem. we claim that our evolved system exhibits two important characteristics; first, it attains high classification performance, with the possibility of attributing a confidence measure to the output diagnosis; second, the system has a simple fuzzy rule system; therefore, it is interpretable. the hybrid algorithm overcomes both the genetic algorithm and the artificial immune system and reached the highest classification ratio 97.36, by only one rule, in the earlier generations than the two other algorithms. the proposed system also applied to a binary character recognition problem. the mutation in the hybrid algorithm was adapted using a degraded function so that the mutation decreases with time, but in 384 e. nabil, a. badr, i. farag the binary character recognition problem, the results showed that it is better to keep the mutation value small and static through all generations. the hybrid algorithm overcomes the other two algorithms. the hybrid system can solve the wbcd problem with more than one fuzzy rule. this is to increase the classification accuracy. also, there are many gene representation techniques that can be used instead of the pittsburgh approach like the michigan approach, the iterative rule learning approach and hybridization between them. this can be considered as a future work. we claim that our hybrid algorithm is highly effective and better than gas and ais, for sure not in all cases, future experiments may prove that gas or ais separately is better, but at least in memory acquisition, the wcdb and similar problems we claim that our algorithm is better. bibliography [1] a.p. engelbrecht, computational intelligence: an introduction, england, john wiley & sons; 2003. [2] c.a. pena reyes, m.a. sipper, evolving fuzzy rules for breast cancer diagnosis, proc nonlinear theory and applications, 2, pp 369-372, 1998. [3] c.a. pena reyes, m.a. sipper, fuzzy-genetic approach to breast cancer diagnosis,artificial intelligence in medicine; vol: 17, num:2, 131-155, 1999. [4] c.j. merz, p.m. murphy, uci repository of machine learning database, http:/www.ics.uci.edu/m̃learn/mlrepository.html, 1996. [5] d. dasgupta , artificial immune systems and their applications, springer-verlag, inc., 1999. [6] d. dasgupta, n. attoh-okine, immunity-based systems, ieee international conference on systems, man, and cybernetics, orlando, florida, pp 363-374, october 12-15,1997. [7] d.a. coley, an introduction to genetic algorithms for scientists and engineers, world scientific publishing co.,inc., 2001. [8] e. gutuleac, descriptive timed membrane petri nets for modelling of parallel computing, international journal of computers, communications & control, vol. i, no. s: suppl. issue, pp. 256-261, 2006. [9] g. ciobanu, a programming perspective of the membrane systems,international journal of computers, communications & control, vol. i, no. s: suppl. issue, pp.13-22, 2006. [10] h. zhang, d. liu, fuzzy modeling and fuzzy control, birkhauser, 2006. [11] j. rennard, genetic algorithm viewer: demonstration of a genetic algorithm, http://www.rennard.org/alife/english/gavgb.pdf, 2000. [12] j.j. espinosa, j. vandewalle, constructing fuzzy models with linguistic integrity, ieee transactions on fuzzy systems; vol. 7, no. 4, pp. 377-393, 1999. [13] l.n. de castro, fundamentals of natural computing: basic concepts, algorithms, and applications, crc press llc; 2007. [14] l.n. de castro, f.j. zuben, artificial immune systems: part i – basic theory and applications, eec/unicamp, campinas, sp, tech. rep. – rt dca 01/99, p. 95. 1999. [15] l.n. de castro, f.j. zuben, learning and optimization using the clonal selection principle ,ieee transactions on evolutionary computation , vol.:6, num.:3, pp 239-251, jun, 2002. an immuno-genetic hybrid algorithm 385 [16] l.n. de castro, f.j. zuben, the clonal selection algorithm with engineering applications, artificial immune system workshop, genetic and evolutionary computation conference , a. s. wu (ed.), pp. 36-37, 2000. [17] l.n. de castro, j. timmis, artificial immune systems (a new computational approach) , springer verlag, 2002. [18] l.n. de castro, natural computing,information science and technology, idea group, inc., 2005. [19] r. setiono, extracting rules from pruned neural networks for breast cancer diagnosis,artificial intelligence in medicine, vol. 8, no. 1, pp. 37-51, feb. 1996. [20] r.r. yager, l.a. zadeh, fuzzy sets, neural networks, and soft computing, new york, van nostrand reinhold, 1994. [21] s. forrest, s.a. hofmeyrt, a. somayajit, architecture for an artificial immune system,evolutionary computing, vol. 8, no. 4, pp 443-473, 2000. [22] t. back, d. fogel, z. mechalewicz, glossary, evolutionary computation 1: basic algorithms and operators, institute of physics publishing, bristol and philadelphia, 2000. [23] t. back, the interaction of mutation rate, selection & self-adaptation within a genetic algorithm, in proc. 2nd int. conf. on parallel problem solving from nature, north-holland, amsterdam, pp. 85-94, 1992. [24] w. m. spears, adapting crossover in genetic algorithms, artificial intelligence center internal report aic-94-019, naval research laboratory, washington, dc 20375, 1994. emad nabil received his bsc and msc degrees in computer science from computers and information faculty, cairo university in 2004 and 2008 respectively; his main research interests are in the natural computing area including p systems, ga, ais and ann.now i am working in my ph.d. in p systems and their applications in optimization. amr badr currently he is an associate professor of computer science, computers and information faculty,cairo university. his research interests include computational intelligence, petri nets, bioinformatics and medical imaging. he has published about 50 journal research papers in these areas and supervised nearly 50 msc and phd students. he is currently on the editorial boards and a reviewer at several journals. ibrahim farag professor of computer science, computers and information faculty,cairo university. he is the founder of the faculty of computers and information at cairo university and one of the pioneers of computer science in egypt. he has supervised more than 200 msc and phd students at cairo university. international journal of computers, communications & control vol. iii (2008), no. 1, pp. 90-102 a formal description of the systemic theory based e-learning gabriela moise abstract: this paper aims at presenting the systems theory-based approach to the learning (instructional) process. such as approach is necessary, if one takes into consideration the the complex and holistic nature of the learning process. the presented modeling is based on the definition of an automatic regulation system and artificial intelligence techniques. for this purpose, an intelligent instructional system (iis) is defined using the concepts: inputs, outputs, perturbations factors, regulation, feedback and ai learning techniques. the contextual environment of learning plays the major role in this system and it establishes the perturbation factors. the regulation consists in ai learning techniques, so the iis is a flexible and adaptive system. the process’ behaviour is described using pseudo-boolean and linear equations. this kind of approach enables solving the problems of the complexity and uncertainty of the learning (instructional) systems and, particularly, those of the e-learning (online and offline). applications of the proposed approach may be found in e-learning courses for mathematics, computing, architecture, archaeology, cultural heritage, etc. keywords: online learning, system theory, feedback, intelligent instructional system. 1 introduction the theoretical bases of the instructional process design are formed of a set of theories such as the instruction and learning theory, the communication theory, the systems’ theory. the system theory and the systemic thinking have a major influence on the field of the instructional design. the systemic approach of the instruction process design was issued by james finn [7], who applied the systems theory in the educational technology. in 1978, walter dick and lou carey published the book entitled "the systemic design of instruction". the model built by dick and carey [5] doesn’t follow a linear approach, each level of the instructional process being revised for certainty of achieving the objectives of instructional process. the model is presented in figure no. 1. figure 1: dick and carey’s model any learning (instructional) form may be tackled from the point of view of the general systems theory, distance learning forms included. a system is defined by a set of elements that interact and work together in order to achieve an objective. moore and kearsley [15] present the importance of the copyright © 2006-2008 by ccc publications a formal description of the systemic theory based e-learning 91 systemic approach of the distance educational process. the distance educational system has to be viewed as a system within which all the elements are interdependent and interconnected. the consequence of this fact is that any change of an element of the system will affect the whole instructional process. moore (1983) defined the concept of transactional distance, in that he rigorously presented the relation between the teacher and the student: the distance between teacher and student is a geographical, an educational and a psychological one.[16] saba [17] refined the theory of moore and proved trough experiments, that the transactional distance is a measure of the relation existing between the teacher and the student. saba asserts: "transactional distance varies by rate of dialogue and structure", so that when dialogue increases, the structure and transactional distance decrease; when the structure increases, the transactional distance increases and the dialogue decreases (see figure 2). figure 2: stella modeling of dynamic of saba’s system transactional distance(t) = transactional distance(t −dt) + (dialogue−structure)∗dt norbert wiener [20] has introduced the cybernetic concept in the paper "the human use of human beings": "when we desire a motion to follow a given pattern the difference between this pattern and the actually performed motion is used as a new input to cause the part regulated to move in such a way as to bring its motion closer to that given by the pattern." the notion of instructional system was introduced by robert glaser in 1962. the levels of the system defined by glaser are represented in the figure no. 3. the individualized instruction is viewed as a continuous cycle of diagnose, specification and evaluation. the output condition is defined as follows: the student has learned all the proposed objectives. figure 3: glaser’s instructional model with feedback banathy (1996) [2] comprised the educational system in the human activity systems (has). he defined the term of "system view of education": "we learn to think about education as a system, we can understand and describe it as a system, we can put the systems view into practice and apply it in educational inquiry, and we can design education so that it will manifest systemic behavior". the 92 gabriela moise systemic approach of the educational system facilitates the exploration of the process according to the subsystems constituents: teacher, student, institution, administration etc. banathy (1992) [1] proposed two phases of the systemic development of education, which are considered the principles of systems’ design. (see figure 4) figure 4: the phases of the development of a systemic approach of education (banathy) in 1992, banathy [1] proposed three models of social systems namely the systems-environmental model, the functions-structure model and the process-behaviour model. these models were be used to describe the educational systems "open, dynamic and complex social systems". the systems-environmental model enables one to see the learning (instructional) systems into the context of the relationships, interactions and interdependencies between the system and the environment. the functions-structure model enables one to describe the goals of the systems and the functions that have to be implemented in order to achieve these goals. also it defines the components of the system and the relations between them, thus defining the very structure of the system. the process-behavior model enables one to understand the behaviour of the system: which are the inputs of the system, how these inputs have transformed to produce the expected output. also, banathy identified four domains: systems’ philosophy, theory, methodology, and application, constituting the conceptual system of systems inquiry in educational systems. a detailed report of the systems view of the distance learning may be found in [6]. the e-courses have to be design to incorporate active and cooperative learning techniques in a systematic way that addresses all aspects of the course: delivery, management, and assessment.[9] in this paper, the author proposes a new model of the learning (instructional) process based on the systems theory. the model focuses on the regulation, so that the system will produce the expected results. the behaviour of the system is described using a mathematical model, based on the pseudoboolean equations and linear equations, which was called "quasi-boolean system". the concept of the "quasi-boolean system" is introduced to portray the behaviour of the e-learning system. the genesis of the "quasi-boolean system" is the idea that the students are evaluated at the end of a training session and they get a grade: a number or a truth value (true or false). in the traditional learning (instructional) process, the teacher takes care of achieving the objectives of the learning (instructional) process. in the e-learning system, one uses a combined regulation: a feedback regulator and a predictive regulator. the predictive regulator uses ai learning techniques (q-learning [13]). the remaining part of the paper is organized as follows: in section 2, a systematic model of the learning (instructional) process is presented; in section 3, a mathematical model is given; in section 4, an application is developed; in section 5 the complexity of the instructional (learning) process is identified and finally, in section 6 summary and conclusions, the advantages using the proposed model are presented. a formal description of the systemic theory based e-learning 93 2 a systemic model of the learning (instructional) process the multidisciplinary nature of the systems theory facilities the implementation of the system thinking in the learning (instructional) process.[3] in this paper, the learning (instructional) process is modeled starting from the definition of an automatic regulation system. definition 1. an automatic regulation system (s,r,a) is defined according to the schema presented in figure no. 5.[11] figure 5: an automatic regulation system y∗ is the reference value, y is the output value, v are the perturbation factors, u is the input value, ε is the error between the reference value and the output value. definition 2. an intelligent instructional system (iis) with a feedback regulator is defined according to the schema: figure 6: the intelligent instructional system the instructional system is a system with inverse connection that decides the behavior view to nullify the error. definition 3. an intelligent instructional system (iis) with a combined regulation is defined according to the schema presented in the figure no. 7. 94 gabriela moise figure 7: intelligent instructional system the input of the system represents the reference value, what the students have to realize after the learning (instructional) process has finished. the state at k step (the system is sequential) is defined by the students’ knowledge and skills at step k. the instruction context may be referred as a social, an emotional, a mental, school, technological, knowledge context.[14] in this approach, all factors that affect the learning motivation, goals, previous knowledge, interest, teaching styles, learning styles, classroom climate, parents, preoccupations, hobbies, etc represent the perturbations. definition 4. an intentional model of instruction is a structure with the form: (com, con, cri), where com, con, cri have the following meanings: • com defines the behaviour • con describes the conditions in which the students prove learning • cri is a precise setting of acceptable standards and performance the teacher has to consider the inferences in the learning process, starting from measurable pieces of evidence that can be measured, while the learning can not be measured directly. the goals of learning are formulated according to the domains of learning: the cognitive domain, the affective domain and the psychomotor one. the teacher and the instructional designer have the major role in defining the goals of learning. definition 5. the input values of iis are defined by the goals of the instructional process, the intentional models of behaviour. the goals of a course describe the intention of the teacher referring to the students, who attend the course. the goals define what is expected from the students after the instructional process has finished. y∗re f = (com ∗ re f , con ∗ re f , cri ∗ re f , gradere f ), where • com∗re f defines the reference behaviour • con∗re f describes the reference conditions in which the students prove learning • cri∗re f is the setting of the reference standards and performance • grade∗re f is the reference grade (the accepted grade) a formal description of the systemic theory based e-learning 95 for example: y∗re f = (behaviour : write aprogram, conditions : usingc + + language in 15 minutes, criteria : program runs without errors and display correct results, 8) or y∗re f = (write a program, c + + language, 15 minutes, programs run without errors, display correct results, 10) y∗re f = ( com∗re f , con ∗ re f , cri ∗ re f , grade ∗ re f ) ,    com∗re f = write a program con∗re f = { language = c + + time = 15 min cri∗re f = { run without errors display correct results grade∗re f = { 10 meaning: at the end of the module, the student will be able to write a program to sum the first n integer numbers, using c++ language, in 15 minutes, the program will run without errors and the result will be correct, the reference grade is 10. definition 6. the outputs of the iis system are defined as follows: y = (y1, y2, ..., ym) , where yi are defined according to gagne [8] • declarative knowledge; • procedural knowledge; • cognitive strategies; • attitudes; • psychomotor abilities. the outputs (the students’ knowledge) are measured (y∗) in a way that allows them to be compared with the reference values. in order to evaluate the outputs of the iis system, a measuring system (ms) is to be established. ms : y → m, y is the possible outputs’ sets. the measuring system has to take into account goals of the instruction (the reference value of the iis system). the behaviour of the student has to be evaluated in the conditions defined by the goals of the instruction and has to satisfy a set of criteria. the measuring system evaluates the outputs of the system (what the student realize in the evaluation phase) and produces feedback, so the measuring system establishes an output behaviour model. y∗ = (com∗, con∗, cri∗, grade∗) ,    com∗ = write a program con∗ = { language = c + + time = 30 min cri∗ = { run without errors display incorrect results grade∗ = { 8 meaning: at the end of the module, the student has to write a program to sum the first n integer numbers, using c++ language, in 30 minutes, the program runs without errors and the result is correct, the grade is 8. 96 gabriela moise the output of the iis system is evaluated according to the ms and has the following values: y∗ = ms(y) = (com, con, cri, grade) the error is defined as a difference between two models (patterns). to determine the difference between two behaviour patterns, a patterns’ analyzer (defined through a procedures collection) will be used.(see figure 8) figure 8: patterns’ analyzer the analysis of the behaviors’ patterns imply the following steps: 1. the criteria will be decomposed in primitive criteria; 2. the primitive criteria are analyzed according to some rules based procedures; 3. the analysis procedures are updated according to learning techniques taken from artificial intelligence. in the example above: cri∗ = { run without errors display correct results −cri∗re f = { run without errors display incorrect results the criteria analysis establishes that the program written by the student has a logical error. the value of the error: invalid logical program. definition 7. the perturbation factors are defined by the contextual environment of the learning (instructional) process. the contextual learning is built on all conditions that affect the instructional process. cole and griffin have shown the complexity of this concept.[4] the predictive regulator of the iis system decides what kind of instructional techniques and information will be used in the learning process, analyzing the contextual factors. the predictive regulation contains ai techniques to instruction of the iis system. the predictive commands are rules having the following structure: r = (c1 ∧c2 ∧...∧cn ) → (a1; a2; ...; ak) , where ci, i = 1, ..., n are the conditions and ai, i = 1, ..., k are the actions in the iis system, the conditions are perturbation factors and they are inputs of the system. the feedback regulator of the iis system checks the value of the error and provides a corrective command. while the iis system is not a numeric system, the behaviour of the system cannot be described using equations with real numbers. a formal description of the systemic theory based e-learning 97 3 a mathematical model of the instructional system definition 8. a vector of n elements, where the elements are real numbers and logical values, is called a quasi-boolean vector. x = ( x1, x2, ..., xk, xk+1, .., xn ) where xi ∈ r, i = 1, k and xi ∈ b2, i = k + 1, n the real part of x is noted with r(x) and the boolean part of x is noted b(x) and they are defined as: r(x) = ( x1, x2, ..., xk ) b(x) = ( xk+1, xk+2, ..., xn ) definition 9. it is called a quasi-boolean sequential system a pair of functions ( f = ( f1, f2, f3), g) defined as: xik = f i 1(b(xk−1), b(uk−1), b(vk−1)) + f i 2(r(xk−1), r(uk−1), r(vk−1)), i = 1, n1 xik = f i 3(b(xk−1), b(uk−1), b(vk−1)), i = n1 + 1, n yik = g i 1(b(xk), b(uk)) + g i 2(r(xk), b(uk)), i = 1, p1 yik = g i 3(r(xk), r(uk)), i = p1 + 1, p where: x ∈ x = rn1 ×bn22 , n1 + n2 = n represent the set of the states of the system, u ∈ u = rm1 ×bm22 , m1 + m2 = n represent the set of the input signals of the system, v ∈ v = rr1 ×br22 , r1 + r2 = r represent the set of the perturbations of the system, y ∈ y = rp1 ×bp22 , p1 + p2 = p represent the set of the output signs of the system, f i1, i = 1, n1 are pseudo-boolean functions, f i2, i = 1, n1 are linear functions, f i3, i = n11, n are boolean functions, gi1, i = 1, p1 are pseudo-boolean functions, gi2, i = 1, p1 are linear functions, gi3, i = p1 + 1, p are boolean functions. remarks: the signs of the quasi-boolean system are vectors with real numbers and boolean values elements. the complexity of the system is described through linear, pseudo-boolean and boolean functions. the informational state of the system is described using real numbers and truth values. any boolean function can be expressed using a pseudo-boolean function. in order to model the learning (instructional) process, there will be used integer numbers from an interval, instead of real numbers, so the instructional system will become a finite states system. using the interpolation formula for pseudo-boolean functions [10], the equations of the system become: xki = ∑ cγ x αn1+1 k−1 , ..., x αn k−1, u αm1+1 k−1 , ..., u αm k−1, v αr1+1 k−1 , ..., v αr k−1 + f i 2(r(xk−1), r(uk−1), r(vk−1), i = 1, n, cγ = f i1(αn1+1, ..., αn, αm1+1, ..., αm, αr1+1, ..., αr) yki = ∑ bδ x δn1+1 k−1 , ..., x δn k−1, u δm1+1 k−1 , ..., u δm k−1 + g i 2(r(xk−1), r(uk−1)), i = 1, n, bδ = g i 1(δn1+1, ..., δn, δm1+1, ..., δm) the equations of the system can be expressed using linear polynomial dependent on the states, the perturbations and the inputs of the system. if the pseudo-boolean functions are linear pseudo-boolean functions, they can be expressed as follows: t (x) = c1x1 + c2x2 + ... + cnxn + d, where c1, c2, ..., cn, d are coefficients (in the system described in the paper, the coefficients are integer number). the topic of the paper is not to describe the quasi-boolean systems. these quasi-boolean of systems (in particular cases) are observable, controllable and there may be defined regulation laws (feedback regulation and predictive regulation). the methods of solving linear pseudo-boolean equations could be found in [10]. 98 gabriela moise example: one considers an online course with three modules. the initial state is: x0 = (0, 0, 0) i.e. there is no teaching the inputs (commands) of the system are: u0 = (0, 0, 0) i.e. there is no teaching process u1 = (1, 0, 0) i.e. the system teaches module no. 1 u2 = (0, 1, 0) i.e. the system teaches module no. 2 u3 = (0, 0, 1) i.e. the system teaches module no. 3 the perturbations are: (0, 0) i.e. the electrical power is interrupted (0, 1) i.e. the internet connection failed (1, 0) i.e. the computer is failed the equation of the system is: xk+1 = (x1k v 1 k v 2 k + u 1 k v 1 k v 2 k −x1k u1k v1k v2k , x2k v1k v2k + u2k v1k v2k −x2k u2k v1k v2k , x3k v1k v2k + u3k v1k v2k −x3k u3k v1k v2k ) the reference value is: y∗re f = (1, 1, 1) if there is no perturbation, the evolution of the system is: x1 = (0, 0, 0) x2 = (1, 0, 0) x3 = (1, 1, 0) x(4) = (1, 1, 1) the regulator verifies the obtained value (1, 1, 1) with the reference value (1, 1, 1). if there is a difference between them (considering a perturbation), the output value could be (0, 1, 1). the error formula is: y∗re f = (y 1 re f , y 2 re f , y 3 re f ) y∗ = (y∗1, y∗2, y∗3) e = (y1re f y ∗1, y2re f y ∗2, y3re f y ∗3) the patterns’ analyzer described in section 2 is expressed as a boolean equation. in the former case, the error is e = (0, 0, 0) and the process finished with success. in the latter case, the error is e = (1, 0, 0) and the feedback regulation has to order to teach the first module. 4 an application example the objective of the learning(instructional) process: at the end of the teaching session, students will be able to design a simple web page using html language and the web page will be written correctly. in order to achieve this goal, the system have to teach: (1) module no. 1: the structure of the web pages, (2) module no.2: the < html >, < head >, < title >, < body >, < p > tags, (3) module no. 3: applications. for each module, the learning (instructional) system has a set of pedagogical resources. the reference value is: y∗re f = (1, 1, 1, 10), where 1 is the true value=the student has written a web page, 1 is the true value=the student has used the html language, 1 is the true value=the structure of the web page is correct the perturbation considered are defined by the learning styles (visual, auditive, kinesthetic) and the age categories (three categories). (1, 0, 0, 1, 0, 0) i.e. learning style=visual and age category=1 (0, 1, 0, 1, 0, 0) i.e. learning style=auditive and age category=1 (0, 0, 1, 1, 0, 0) i.e. learning style=kinesthetic and age category=1 (1, 0, 0, 0, 1, 0) i.e. learning style=visual and age category=2 (0, 1, 0, 0, 1, 0) i.e. learning style=auditive and age category=2 (0, 0, 1, 0, 1, 0) i.e. learning style=kinesthetic and age category=2 (1, 0, 0, 0, 0, 1) i.e. learning style=visual and age category=3 a formal description of the systemic theory based e-learning 99 (0, 1, 0, 0, 0, 1) i.e. learning style=auditive and age category=3 (0, 0, 1, 0, 0, 1) i.e. earning style=kinesthetic and age category=3 the instructional system has to teach three modules; for each module there is a set of pedagogical resources (noted with prk=pedagogical resources with identification k). the commands of the system are: (0, 0, 0, 0) i.e. invoke a virtual meeting between the teacher and the student (1, 0, 0, pr1) i.e. teach module no. 1 using the pedagogical resource with identification 1 (1, 0, 0, pr2) i.e. teach module no. 1 using the pedagogical resource with identification 2 (1, 0, 0, pr3) i.e. teach module no. 1 using the pedagogical resource with identification 3 (1, 0, 0, pr4) i.e. teach module no. 1 using the pedagogical resource with identification 4 (1, 0, 0, pr5) i.e. teach module no. 1 using the pedagogical resource with identification 5 (0, 1, 0, pr6) i.e. teach module no. 2 using the pedagogical resource with identification 6 (0, 1, 0, pr7) i.e. teach module no. 2 using the pedagogical resource with identification 7 (0, 1, 0, pr8) i.e. teach module no. 2 using the pedagogical resource with identification 8 (0, 0, 1, pr9) i.e. teach module no. 3 using the pedagogical resource with identification 9 (0, 0, 1, pr10) i.e. teach module no. 3 using the pedagogical resource with identification 10 (0, 0, 1, pr11) i.e. teach the module no. 3 using the pedagogical resource with identification 11 (0, 0, 1, pr12) i.e. teach module no. 3 using the pedagogical resource with identification 12 so, there are five pedagogical resources for module no. 1, three pedagogical resources for module no. 2, and four pedagogical resources for module no. 3. the equations of the system are: x1k = x 1 k−1 + u 1 k−1 −x1k−1u1k−1 x2k = x 2 k−1 + u 2 k−1 −x2k−1u2k−1 x3k = x 3 k−1 + u 3 k−1 −x3k−1u3k−1 x4k = x 4 k−1 + u 1 k−1u 4 k−1 x5k = x 5 k−1 + u 2 k−1u 5 k−1 x6k = x 6 k−1 + u 3 k−1u 6 k−1 a state of the form (1, 1, 0, 4, 7, 0) means that the student has followed the modules no.1 and 2 using the pedagogical resources no. 4 and 7. the output is: y = (x1k x 2 k x 3 k , x 1 k x 2 k x 3 k , x 1 k x 2 k x 3 k , α x 1 k x 2 k x 3 k + β x 1 k x 2 k x 3 k + γ x 1 k x 2 k x 3 k ), where α + β + γ = 10 are given coefficients. this means that a student may obtain at the evaluation the grade=10, if the behaviour has the value 1, the conditions have the value 1 and the criteria have the value 1. a scenario of the system’s behavior is: a new student has to learn web pages. the student has a learning style and an age category. the predictive controller has learned which pedagogical resources fit with the profile of the student. this can be realized using ai learning techniques [13] or the algorithm proposed in [18]. the error is: e = y4∗re f −y4∗ + α y1∗re f y1∗ + β y2∗re f y2∗ + γ y3∗re f y3∗ if the error is too big, the feedback controller provides a command to invoke a virtual meeting between the teacher and the student. 5 problems of the online instructional systems the major problems connected to the online instructional systems are posed by their complexity and uncertainty. generally speaking, the uncertainty refers to the impossibility of exact prediction. the uncertainty is a feature of the human behaviour. the human factor is a part of the online instructional systems and the instructional process act on humans (figure no. 9). so, an online instructional system has 100 gabriela moise to be viewed from the point of its complexity and uncertainty. the uncertainty of the system means that the inputs of the system don’t determine exactly the outputs of the system. the outputs of the system have figure 9: inputs and outputs of the instructional process to satisfy the standards knowledge according to the objectives of the instructional process. the authors of the paper "defining uncertainty a conceptual basis for uncertainty management in model-based decision support" [19] define the uncertainty concept taking into consideration three dimensions: • the location of the uncertainty; • the level of uncertainty; • the nature of the uncertainty the source of uncertainty, the imperfection of the knowledge, the diversity of the values, etc. at a macro level, there may be distinguished two sources of the uncertainty: 1. the variability; 2. the limit of the knowledge. the instructional systems are developing from the new scientific perspective of the analysis of their uncertainty. if we consider an instructional system with perturbation factors having null values, the outputs of the system cannot be predicted with accuracy. the instructional process is unpredictable and can not be reduced at linear sequences to produce predictable outputs. considering the perturbation factors and the complexity of the relations between the parts of the system and the relations between system and the environment, the instructional system is a complex system. to obtain a reference value, the instructional system has to be defined as a complex adaptive system. a complex adaptive system (cas) is a system with a lot of internal and external relations; the system can adapt itself and evaluate according to changes from the environment. it is important to see that there is not a precise boundary between the system and the environment. the system influences and changes the environment. an instructional system is linked with other systems, so that the changes have to be considered in terms of a co-evolution with other systems. an instructional system has to be designed from the viewpoint of the internal variable side and external variable side. the system records information about the environment, this information determines the behaviour of the system, the system transmits information to the environment and acts in accordance with the environment. another viewpoint of instructional process is the hermeneutic aspect, in which the learning is defined as an interpretation act, inseparable from the cultural and historical context. jonassen [12] states that the instructional design theory has to consider the learning as an open system, which receives inputs from many sources, such as individual differences, emotional states, social and economical factors, demographical factors and so forth. the instructional designers have to understand the complex and holistic nature of the learning process and have not to isolate this kind of systems in closed systems. 6 summary and conclusions in the perspective of the evolution of the whole society, it is necessary to approach the instructional process from the viewpoint of the systems theory. this perspective enables to solve the problems of a formal description of the systemic theory based e-learning 101 the instructional process, in special considering the distance learning process. practically, the distance instructional process, speaking about online learning process, can be successfully applied only using the techniques from systems theory and artificial intelligence. in this paper, the author has modeled the learning (instructional) process using the general systems theory. in this paper, the term of "quasiboolean system" was introduced in order to provide a mathematical model describing the behaviour of the instructional system. the ai techniques are used in the regulation of the system. the possible applications of the proposed approach may be found in the instructional system to teach mathematics, computer science, architecture [21], physics, chemistry, etc. this kind of approach enables solving the problems of the complexity and uncertainty of the instructional systems, especially those of the online instructional systems. bibliography [1] b. h. banathy, a systems view of education: concepts and principles for effective practice. englewood cliffs, n.j.: educational technology publications, 1992. [2] b. h. banathy, systems inquiry and its application in education, in d. h. jonassen (ed), handbook of research for educational communication and technology, new york: macmillan library reference, 1996. [3] d. chen and w. stroup, toward a conceptual framework for science and technology education for all, journal of science education and technology, volume 2, no.3, 1993. [4] m. cole., p. griffin, contextual factors in education. information analyses report. prepared for committee on research in mathematics science, and technology education, commission on behavioral and social sciences and education, national research council. wisconsin center for education research, madison, 1987. [5] w. dick, l. carey, the systematic design of instruction, glenview,il.:scott,foresman, 1978. [6] r.r. du mont, distance learning: a systems view an assessment and review of the literature, research report, grant awarded by the ohio learning network in cooperation with research center for educational technology kent state university, 2002. [7] j.d. finn, automation and education: i. general aspects, av communication review, 5(1), 1957. [8] r. gagné, l. briggs, w.w. wager,principles of instructional design,(3rd ed.). new york, ny: holt, rinehart and winston, inc., 1988. [9] g. gonzalez, a systematic approach to active and cooperative learning in cs1 and its effects on cs2, technical symposium on computer science education, proceedings of the 37th sigcse technical symposium on computer science education, 2006. [10] p.l. hammer(ivanescu), s. rudeanu, methodes booleennes en recherche operationelle, dunod, paris, 1970. [11] c. ilas, teoria sistemelor de reglare automata, matrix rom, bucuresti, 2006. [12] d.h. jonassen, et al. certainty, determinism, and predictability in theories of instructional design: lessons from science, educational technology, (37)1, 1997. [13] g. moise, a software system for online learning applied in the field of computer science, ijccc, volume: ii, no: 1, 2007. 102 gabriela moise [14] g. moise, a rules based on context methodology to build the pedagogical resources, proceedings of the 2nd international conference on virtual learning, bucharest university press, 2007. [15] m.g. moore and g. kearsley, distance education: a systems view, belmont, ca:wadsworth publishing co., 1996. [16] m.g. moore,theory of transactional distance, in d. keegan (ed.) theoretical principles of distance education. new york: routledge, 1993. [17] f. saba, r.l. shearer,verifying key theoretical concepts in a dynamic model of distance education, the american journal of distance education, 8 (1), 1994. [18] a.d. styliadis, i. d. karamitsos, d. i. zachariou, personalized e-learning implementation the gis case, international journal of computers, communications & control, vol. i, no.1, 2006. [19] w.e. walker, p. harremöees, j. rotmans, j.p. van der sluijs, m.b.a. van asselt, p. janssen and m.p. krayer von krauss, defining uncertainty a conceptual basis for uncertainty management in model-based decision support, integrated assessment , vol. 4, no. 1, 2003. [20] n. wiener, the human use of human beings : cybernetics and society, second edition revised, doubleday anchor, 1954. [21] a.d. styliadis, e-learning documentation of historical living systems with 3d modeling functionality, informatica, vol. 18, no. 3, pp. 419-446, 2007. gabriela moise petroleum-gas university of ploiesti computer science department no. 39 bd. bucuresti, ploiesti, romania e-mail: gmoise@upg-ploiesti.ro received: february 13, 2007 gabriela moise (born on february 13, 1969) graduated the faculty of mathematics, specialization informatics of the bucharest university. she worked in the software industry, developing it solutions. since 2003 she is lecturer at petroleum-gas university of ploiesti. her research fields are: e-learning, graph theory, pedagogical agents, knowledge representation, e-health. she has (co)authored seven books and more than twenty research papers. she has participated to many international conferences in the elearning and e-business area. international journal of computers, communications & control vol. i (2006), no. 2, pp. 7-14 development of a matlabr toolbox for the design of grey-box neural models gonzalo acuña, erika pinto abstract: a matlab toolbox is developed for the design, construction and validation of grey-box neural network models. this toolbox, available in www.diinf.usach.cl/gacuna has been tested in simulations with a continuously stirred reactor process. the grey-box model performs well for validation data with 5% additive gaussian noise for one-step-ahead (osa) and model-predictive-output (mpo) estimations. keywords: grey-box model, neural networks, one-step-ahead estimation, multiple prediction output, time variant parameter identification. 1 introduction in the development of dynamic system models it is best to take advantage of the a priori knowledge of a process, generally expressed in terms of a set of ordinary differential equations which represent mass or energy balances. in complex biotechnological processes, the most difficult task is the modeling of the time-varying parameters, such as specific kinetics. in order to address this problem [1], proposed the use of grey-box models which combine a priori knowledge expressed in terms of a phenomenological, or white-box model, with a black-box model such as neural network. these models have proved to be satisfactory for dynamic systems, they have better generalization characteristics, and they can be identified with a smaller amount of data [1]. [2] classified these grey-box models into two principal categories: those which deliver intermediate values (of parameters or variables) for use in phenomenological models (serial grey-box models), or those in parallel with the dynamic model, adjusted to compensate for modeling errors (parallel grey-box models). [3] showed that the series strategy resulted in grey-box models with superior results. more recently [4] and [5] have employed and analyzed this type of model demonstrating their performance and their use in complex processes. matlabr is well known as a completely integrated application development environment, oriented to projects that involve complex mathematical calculations and graphic visualizations. this software has a large variety of toolboxes which are specialized packets which can carry out different functions according to the area of development, for example; optimization, image processing, neural networks, simulations and statistics among others. thus, the present work deals with the creation of a group of functions integrated in a matlabr toolbox which allows the development of an grey-box neural model (gbnm) for complex systems in general. the present document is organized as follows: section 2 details some of the aspects of the gbnms used, section 3 examines the design and construction of the toolbox; section 4 looks at the applications of the toolbox; and section 5 outlines the conclusions reached. 2 grey-box neuronal models as mentioned previously gbnms take advantage of the combination of the a priori knowledge of a given process -expressed in terms of a set of differential equations that represent the first principles that govern the processwith neural networks. the latter are responsible for modeling the interaction between variables that are relevant to the system, and certain time-varying. it is a well established fact that neural networks are capable of approximating non-linear functions. in particular, it has been demonstrated that perceptrons, with only one hidden layer and an adequate number of neurons in their internal layer, are copyright c© 2006 by ccc publications 8 gonzalo acuña, erika pinto universal approximators [6]. for the purposes of the present work it is important to distinguish between two training modes for neural networks included in gbnms. the first type, also known as the direct learning mode [5], uses the error generated at the output of the neural network for the correct determination of its weights (figure 1). figure 1: grey-box neural model in its direct learning mode. the second type corresponds to an indirect mode by which the error generated at the output of the gbnm is used for the training of the neural network [5] (figure 2). figure 2: grey-box neural model in its indirect learning mode. the toolbox developed in the present work is based on the direct learning mode of the neural network. the neural networks used are multi-layered perceptrons with only one hidden layer. the training algorithm is error retropropagation combined with a levenberg-marquardt optimization. the validation of the results obtained is carried out with tests that consist in evaluating the error produced when using the gbnm for a one step ahead (osa) prediction, and for model predictive output (mpo) prediction, [7]. the error indices used are three: the root mean square (rms) error, the relative standard deviation (rsd) and the adecuation index (ia), which are presented below [7]: rms = √ ∑ni=1(oi − pi)2 ∑ni=1 o 2 i ; rsd = √ ∑ni=1(oi − pi)2 n ; ia = 1− ∑ n i=1(oi − pi)2 ∑ni=1(|o′i|+|p′i|)2 where oi and pi are the observed and predicted values respectively, in time i, and n is the total number of data. p′i = pi −om and o′i = oi −om, where om is the median value of the observations. 3 development of the matlab toolbox this toolbox consists of two principal parts: a neural network and serial grey box model, either with osa or mpo estimates. development of a matlabr toolbox for the design of grey-box neural models 9 the methodology used corresponds to the modified cascade model which has the following stages: • definition of requirements. • system design. • implementation and testing of units. • integration and testing of system. the details of the methodology can be found in [8]. 3.1 neural network this is the first part of the toolbox, and its principal function is to create and train the network that the serial grey-box model uses for estimating the unknown parameter or parameters which are then combined with the phenomenological part of the model. for this reason it is necessary to run this part of the program before running the serial grey-box model. the principal functions of the neural network are the following: • verify the existence of data: this function’s role is to verify the existence of the data necessary for the functioning of the neural network. • verify dimensions: this function verifies that the quantity of input output data is equal. • verify layers: this function verifies that the total number of neurons in hidden layers is correct. • divide data: this function divides input and output data that have been entered by the user into training and validation data, (70% and 30% respectively). • create and train the network: this function creates and trains the neural retropropagation network based on the data input by the user. this is carry out using the matlabr neural network toolbox functions. for the creation of the neural network the “trainlm” function is used which applies the levenberg-marquardt method for optimizing weights. • simulate network outputs: this function simulates network outputs for the purpose of validation. • graph network inputs: this allows the visualization of the behavior of the entered input. 3.2 serial grey box model this is the second part of the toolbox which allows the development of general serial grey-box models either with osa or mpo estimations. the principal functions of the serial grey-box model applied either to osa or mpo estimations are the following: • verify the existence of the files: this function verifies the existence of the files necessary for the functioning of the serial grey-box model. • verify the existence of data: among these data are the number of iterations, the initial values for the status vector, the value of delta t, and where applicable to input u and the real values of the status vector variables for each instant of time used. 10 gonzalo acuña, erika pinto • verify dimensions: this function verifies that the dimensions of the input vector be the same as the number of iterations. for estimating osa the quantity of real values of the status variables must be equal to the number of iterations. for mpo it only returns the results for the output of the system, but cannot evaluate said output with real data. • develop model: this function is in charge of developing the grey-box model which consists of estimating the unknown parameter or parameters by using the neural network created and trained previously in order to develop the different equations entered. • return different graphs: graphs the inputs of the system, each of the components of the status vector for the total number of iterations, the validation of the unknown parameter or parameters, and the output of the system together with its validations. • calculate the error indices: calculates rsd, rms and ia for the output. • display results: returns the last iteration both for the status vector and the output vector, and for the error indices that correspond to the unknown parameter or parameters or the output or outputs of the system. 4 aplications and results 4.1 description of the process in the simulation of a cstr process a first order exothermic reaction is used [9], in which the inputs to the system correspond to the temperature of the cooling sleeve and the system output corresponds to the degree of completion of the reaction. the systems is described by the following status equations: x′1 = −x1 −da ·(1−x1)·e ( x2 1+ x2 γ ) (4.1) x′2 = −x2 −b.da ·(1−x1)·e ( x2 1+ x2 γ ) + β .(u−x2) (4.2) y = x1 (4.3) where: x1 : degree of completion of the reaction; x2 : adimensional temperature of the reactor contents; u : input that corresponds to the adimensional flow rate of the heat-transfer fluid through the cooling sleeve, and y is the output of the system. for the purposes of this simulation the following values are used for the model’s constants: da = 0, 072; b = 8, 0; β = 0, 3; γ = 20, 0 because grey-box models are made up of two parts, one of which is the phenomenological model represented by the different differential equations, and the other of which corresponds to the empirical model represented by the neural network, the equations that represent the phenomenological model of the serial grey-box model are the following: x′1 = −x1 + 0, 072.(1−x1).ρ (4.4) x′2 = −x2 + 8.0, 072.(1−x1).ρ + 0, 3.(u−x2) (4.5) y = x1 (4.6) development of a matlabr toolbox for the design of grey-box neural models 11 where ρ is the parameter that is difficult to obtain: ρ = e ( x2 1+ x2 20 ) when equations 4.4 through 4.6 are discretized in the interval between t and t+1, the following equations are obtained: x1(t+1) = x1(t) + (−x1(t) + 0, 072.(1−x1(t)).ρ).∆t (4.7) x2(t+1) = x2(t) + (−x2(t) + 8.0, 072.(1−x1(t)).ρ + 0, 3.(u−x2(t))).∆t (4.8) thus the equations that constitute the phenomenological part of the serial grey-box model, either with osa or mpo estimations, are the following: x1(t+1) = x1(t) + (−x1(t) + 0, 072.(1−x1(t)).ρ).∆t (4.9) x2(t+1) = x2t + (−x2(t) + 8.0, 072.(1−x1(t)).ρ + 0, 3.(u−x2(t))).∆t (4.10) y = x1(t+1) (4.11) 4.2 osa estimation with 5% noise the validation obtained for the unknown parameter with 5% noise can be seen in figure 3. figure 3: validation of the unknown parameter with osa estimation and 5% noise. the validation obtained from the output of the system can be seen in figure 4. figure 4: validation of the output of the system with osa estimation and 5% noise. 12 gonzalo acuña, erika pinto the error indices obtained from the serial grey-box model with osa estimation and 5% noise, for parameter ρ and the output of the system are presented in table 4.1. table 4.1. error indices for parameter and the output of the system with an osa estimation and 5% noise. parameter ρ system output rsd 4.368098e-2 2.756748e-3 rms 1.733075e-2 1.944067e-2 ia 9.111975e-1 9.846065e-1 the error indices obtained are quite acceptable given that the acceptability values are rsd < 0.1, rms < 0, 1 and ia > 0.9 , as explained in section 2.6 this indicates that the serial grey-box model developed for the simulation of a cstr process, with osa estimation and 5% noise, is quite good. 4.3 mpo estimation with 5% noise the validation obtained for the unknown parameter with 5% noise can be seen in figure 5. figure 5: validation of the unknown parameter with mpo estimation and 5% noise. the validation obtained from the output of the system can be seen in figure 6. figure 6: validation of the output of the system with mpo estimation and 5% noise. the error indices obtained from the serial grey-box model with mpo estimation and 5% noise, for parameter ρ and the output of the system are presented in table 4.2. table 4.2. error indices for parameter ρ and the output of the system with an mpo estimation and 5% noise. parameter ρ system output rsd 4.368098e-2 3.441815e-3 rms 1.733075e-2 2.427178e-2 ia 9.111975e-1 9.764686e-1 development of a matlabr toolbox for the design of grey-box neural models 13 the error indices obtained are quite acceptable given that the acceptability values are rsd < 0.1 , rms < 0, 1 and ia > 0.9. this indicates that the serial grey-box model developed for the simulation of a cstr process, with mpo estimation and 5% noise, is quite good. 5 conclusions grey-box models constitute a real alternative for those real world processes for which the available a priori knowledge is incomplete, for example in a variety of industrial processes. as in grey-box models only some of the physical and/or chemical laws that represent the model are known, and there are unknown parameters that must somehow be estimated, multi-layered perceptron neural networks have been employed for their notable capacity to approximate complex functions on the basis of observed data. one of the advantages of this toolbox, with reference to the creation and training of neural networks, is that neuron transfer functions for each of the hidden and output layers that make up the neural network, can be determined by the user, who can then create and train the neural network in a manner that is in keeping with the specific problem that is to be modeled. concerning the estimation of unknown system parameters, the developed toolbox has the capacity to estimate as many parameters as necessary as long as the necessary data are available. the developed toolbox (available at www.diinf.usach.cl/gacuna), allows the creation of serial greybox models, either with osa (one step ahead) or mpo (model predictive output) estimations, depending on what is to be modeled. with the creation of this matlabr toolbox, the end user has access to a simple, trustworthy and fast tool which allows the development and subsequent manipulation of different serial grey-box models, either with osa or mpo estimations for solving a particular problem. this tool will become part of the matlabr toolboxes such that the user can access it at will simply by downloading matlabr. thus this new tool becomes a real and simple alternative for modeling those real work processes for which a priori knowledge available is incomplete. the error indices obtained are good, both for the unknown parameters and for each of the outputs of the system, even when the level of noise applied to the various input data was 5%, which indicates that the models developed are of good quality. acknowledgements the authors wish to thank partial financing provided by fondecyt project 1040208. references [1] psichogios, d.; ungar, l. (1992). “a hybrid neural network-first principles approach to process modeling”, computers & chemical engineering, 38(10): 1499-1511. [2] thompson, m.; kramer, m. (1994). “modeling chemical processes using prior knowledge and neural networks”, computers & chemical engineering , 40(8):1328-1340. [3] van can, h; hellinga, c.; luyben, k.; heijnen, j. (1996). “strategy for dynamic process modeling based on neural network in macroscopic balances”, aiche journal, 42:3403-3418. [4] thibault, j, acuña, g., pérez-correa, r., jorquera, h., molin, p., agosin, e., (2000) “a hybrid representation approach for modelling complex dynamic bioprocesses” bioprocess engineering, 22(6):547-556. [5] acuña g.; cubillos, f.; thibault, j.; latrille, e. (1999). “comparison of methods for training greybox neural network models”, computers & chemicals engineering supplement, 23:561-564. 14 gonzalo acuña, erika pinto [6] hornik k. stinchcombe m., white h., (1989) “multilayer feedforward networks are universal approximators”, neural networks, 2:359-366. [7] billings s.a., jamaluddin h. b. y chen s., (1992) “properties of neural network with applications to modeling non-linear dynamical system”, int. j. control, 55(1):193-224. [8] pinto armijo, erika (2004), “a matlab based application for developing grey-box models”, memoria de título de ingeniería civil informática, universidad de santiago de chile (in spanish). [9] hernández e.; arkun, y. (1992). “study of the control-relevant properties of backpropagation neural network models of nonlinear dynamical systems”, computers & chemical engineering, 16(4):227-240. gonzalo acuña, erika pinto universidad de santiago de chile departamento de ingeniería informática avda. ecuador no 3659 casilla 10233; santiago, chile e-mail: gacuna@usach.cl international journal of computers, communications & control vol. ii (2007), no. 1, pp. 74-83 a methodology for providing individualised computer-generated feedback to students michael lambiris abstract: the traditional way of providing feedback to students after tests or assignments is labour-intensive. this paper explains the concepts and techniques used by the author to build computer-based applications that analyse students’ answers and generate individualised, detailed and constructive feedback. the paper explains how the data gathered from a student’s answers can be combined with other knowledge about the subject matter being taught, and the specific test questions, to create computerised routines that evaluate the individual student’s performance. this information can be presented in ways that help students to assess their progress, both in relation to their acquired knowledge in specified areas of study, and with regard to their ability to exercise relevant skills. in this way, appropriate feedback can be provided to large numbers of students quickly and efficiently. the same techniques can be used to provide information to the instructor about the performance of the group as a whole, with a degree of detail and accuracy that exceeds the impressions usually gained through traditional marking. the paper also explains the role of the subject instructor in designing and creating feedback-generating applications. the methodologies described provide insight into the details of the process and are a useful basis for further experimentation and development. keywords: teaching technology, computer-generated feedback, methodology and design, teaching large classes 1 difficulties with providing good feedback it is widely recognised by educators that detailed, constructive, prompt and individualised feedback is an important aspect of good teaching and effective learning. see [1]. but providing feedback to students in the traditional form, that is, by reading the students’ answers, evaluating them and writing comments, can be very time-consuming, especially with large classes. i teach a subject called principles of business law that attracts enrolments of up to 700 students each semester. assessment in this subject consists of four computerised tests, each comprising 30 to 40 multiple-choice questions. the tests are done under examination conditions. scores are posted a day or two afterwards. it was in this context that i wished to provide individualised feedback after each test to the students. with classes of this size, it is impractical for an instructor to write comments for each student. a way was needed to produce feedback by means of a computer program. 2 what should feedback consist of? one way of providing feedback would be to publish the test questions together with the correct answers. this is often what students expect, but it may not be the best approach to learning. thirty or forty questions cannot comprehensively test everything a student should know. a test is usually only a sampling of the student’s knowledge and skills. when students correctly answer questions in a test, this indicates a probability that they know the relevant subject area well. similarly, when they answer questions wrongly, this indicates a probability that they have an inadequate grasp of the subject area. if a student’s answers demonstrate a weakness, they will likely need to revise that whole area of study rather than being given the correct answers to specific questions. accordingly, my aim is to provide feedback in the form of general analysis, comment and advice. see [2]. copyright c© 2006-2007 by ccc publications selected paper from icvl 2006 a methodology for providing individualised computer-generated feedback to students 75 3 extracting useful information from basic data instructors who are knowledgeable in their own specialist area may not also be competent computer programmers and will need to employ specialist help to create application software. but the instructor needs to understand some basic programming concepts and techniques to be able to participate effectively in the process of designing and shaping feedback-generating software appropriately. in this paper i explain in detail one way in which such software can be created. the starting point of is to identify what basic data is available. in each of the assessment tests that i use in teaching my subject, the students answer the questions by selecting a letter that represents their chosen answer (a, b, c, etc.). this letter is recorded in an electronic database so that the student’s record consists of a sequence of 30 to 40 individual letters. a symbol (-) is used to indicate unanswered questions. a typical string of answers looks like this: baabebccaecb-dabbab-abbceabaccaabaeadac. to create effective feedback, techniques are needed to extract more information from such basic data. how can this be done? essentially, the process involves combining three types of information. the first is the particular answer the student chose for each question. the second is what the instructor knows generally about the subject area and skills being tested. the third is the focus or intent of each particular question in the test. the computer application can be designed to take proper account of these three factors and to draw specified conclusions from them. in this way, it is possible to build a useful picture of how the student has performed, and to identify their particular strengths and weaknesses. this information, properly presented with comments and advice, forms the basis for individualised feedback. to fully envisage what is possible, it helps to understand the computer processes that are involved. an easy example of this is working out whether or not a particular answer is correct or incorrect. essentially, the student’s answer must be compared to the correct answer, to see if they are the same. a computer program compares data by using variables. variables can be thought of as an electronic slots in which specified information can be stored. to compare a student’s chosen answer with the correct answer, the data representing a student’s answer can be retrieved from the database where it is permanently stored, and temporarily placed in a specified variable. the data representing the correct answer can be placed in another variable. the computer program then compares the contents of the two variables. if they match, it follows that the student has answered the question correctly and the ok result can be stored in a third variable, in the form of an increasing number or score. if there is no match, the student’s answer is wrong and this not ok conclusion can be stored in a fourth variable (or by adjusting the number in the third variable downwards). using a process like this for all the student’s answers, it is possible to work out how many answers were right or wrong. but a difficulty immediately emerges. it is apparent that the results obtained do not disclose why a student has chosen a particular answer. there can be many reasons for getting an answer wrong. for example, the student may have simply misread the question; or failed to understand the significance of a particular term used in the question; or not have had the necessary knowledge or skill to answer correctly. similar possibilities exist in respect of correct answers. taken individually, therefore, a student’s correct and incorrect answers do not provide a sufficiently reliable basis for giving feedback and advice. but, given sufficient data, it is possible to look for significant patterns in a studentšs right and wrong answers. when all of the student’s answers are analysed in the light of the particular knowledge and skills that the various questions are designed to test, distinct patterns emerge that can be useful as the basis for providing that student with helpful feedback. 4 identifying categories of skill and knowledge to develop a computer-based application that carries out the necessary analysis of the student’s answers requires careful thought and planning. the first step is to analyse each of the questions on the test, to identify, describe and name the particular categories of knowledge and skill involved. to do this 76 michael lambiris the instructor must combine their subject-matter expertise, teaching experience and examining skills. it may initially seem difficult to categorise each question in a specific and uncompromising way some questions defy any neat classification. but, when classifying questions in one or more specified ways, it quite often happens that fresh insight is gained into what a question is truly attempting to do, and how that question might be improved so that it achieves its objectives more clearly and precisely. this is not a bad thing to happen. examples taken from specific tests illustrate the way in which categories may be defined. in the first test written by pbl students, analysis shows that each of the 40 questions involves one of three different generic skills. one is an ability to recall and apply acquired knowledge. another is the ability to find specified information in a statute and a law report. the third is the ability to understand, analyse, and draw conclusions from specific facts. each question can also be categorised according to the area of knowledge involved. in the test being discussed, the areas of knowledge are: (1) constitutional arrangements and the organs of government in australia; (2) the law-making powers of specified organs of government; (3) the processes and procedures for enacting legislation; (4) the hierarchy of the federal and state court systems; (5) the nature and organisation of law; (6) understanding and appropriate use of legal terms and concepts; (7) the interpretation and application of statutory law; (8) the interpretation and application of case-law; and (9) recognition and understanding of judicial reasoning. when each category of skill and knowledge has been identified, it needs to be given a brief but distinctive name. using the example above, the three categories of skill can be named qt1; qt2 and qt3. the nine categories of knowledge can be named: ch; lmp; nol; cs; lc; leg; sti; cl; and jr. these names can be used (with some modification) to identify variables in a computer program. to carry out an analysis, the program will need three separate variables for each named category of skill and knowledge. this allows us to take account of whether the student answered that question rightly (r) or wrongly (w), or left it unanswered (n). using this naming regime, the name of the first category above qt1 is transformed into three variables named qt1r, qt1w or qt1n. similarly, qt2 becomes qt2r, qt2w and qt2n; and so on. further named variables will be needed to track other important aspects of the results, for example: q1; q2; q3, etc to hold the student’s answer to each question; right to hold the correct answer being considered; wrong for the incorrect answer being considered; result for the result of comparing two variables; rans for the total number of correct answers; wans for the total number of incorrect answers; nans for the total number unanswered questions; tans for the total number of questions attempted; and score for the final score for the test. 5 developing routines for analysis to see what sort of information can now be extracted from the basic data requires some understanding of the computer-based processes involved. imagine that we want to begin by analysing a student’s answer to the first question in the test. the computer program begins by finding the particular student’s string of answers in the database. it then selects the answer chosen by that student to the first question, and places the appropriate letter (a,b or c) in the relevant named variable, for example, in q1. next, the program places the letter which represents the correct answer to that question (a, b or c) in the variable right. by comparing the letter stored in q1 with the letter stored in right, the program can decide whether or not the question was correctly answered. this result can then be stored in a third variable where the total number of correct answers is kept: rans. if the question was not answered, this fact can be recorded in the variable that stores the total number of unanswered questions: nans. and if the question was answered wrongly, this conclusion is stored in the variable that stores the total number of incorrect answers: wans. the program can now be made to classify the student’s answer to the first question by reference to a category of skill. for example, assume question 1 tested the student’s ability to understand, analyse, and draw conclusions from specific facts. recall that the relevant variable for this skill was named qt3. a methodology for providing individualised computer-generated feedback to students 77 if the student got the answer right, the program can store this conclusion in the variable that counts the student’s correct answers in this category qt3r. alternatively, if the question was answered wrongly, that conclusion can be recorded in the variable qt3w which shows the total of wrong answers in this category. unanswered questions in this category are recorded in the variable qt3n. the same procedure is followed to classify the student’s answer to this question in relation to the area of knowledge being tested, using the variables lmpr; lmpw; or lmpn. in this way, the student’s answer to question one is being evaluated in various ways. the same routines are then repeated for each of the remaining questions, with appropriate changes to the variables used to store the conclusions. once these basic routines have been carried out, further processes can be used to derive additional information from the data, or to organise it usefully. for example, the total number of questions answered by the student can be calculated by adding together the number of the student’s correct and incorrect answers (rans + wans), and placing the result in the variable tans (for total answers). similar processes add to the value of the available information. so far, the individual questions have been classified as belonging to one of nine different areas of knowledge. for the purpose of generating feedback, the areas of knowledge can usefully be grouped into a smaller number of broader categories. the point of doing this is that it often helps students to understand where their strengths and weakness might lie in general terms, before going on to a more detailed analysis. in the first test written by pbl students, the nine areas of knowledge can be grouped into three broader categories, represented by variables total1, total2 and total3, as shown below: in total1 the broad area of knowledge is organs, powers and processes of government and includes: constitutional arrangements and the organs of government in australia (ch); the law-making powers of specified organs of government (lmp); the processes and procedures for enacting legislation (leg); and the hierarchy of the federal and state court systems (cs). in total2, the broad area of knowledge is legal concepts and language and includes: the nature and organisation of law (nol); and understanding and appropriate use of legal terms and concepts (lc). in total3 the broad area of knowledge is the interpretation and application of law and includes: the interpretation and application of statutory law (sti); the interpretation and application of case-law (cl); and recognition and understanding judicial reasoning (jr). the totals in the relevant variables (shown in brackets above) are added together to show how the student has performed in each broad area of knowledge. this is done separately for right answers, wrong answers and unanswered questions. for example, the numbers in the variables chr; lmpr; legr; and csr are added together in total1r to show the correct answers in this broad area of knowledge, while chw; lmpw; legw; and csw are added together in ’total1w’ to show the incorrect answers in this same area. the variables chn; lmpn; legn; and csn are added together in total1n to show the unanswered questions in this area. the same type of process can be used to produce data in relation to other specified learning objectives. finally, we can calculate the student’s score for the test and place it in score. this is done by taking the number of correct answers (already contained in the variable rans) and doing whatever arithmetic calculation is needed to express it as a final mark. in the test now being discussed, a mark out of 15 is needed because the test counts for 15 per cent of the overall assessment for the subject. the number in rans is therefore divided by 2.667 and the result placed in score. 6 presenting information as feedback using routines to analyse the basic data and extract additional information in the way described above is only the initial stage of actually providing feedback to a student. the next step is to build an interface that presents this data appropriately. the information available is sufficient to provide quite detailed feedback if it is built into a careful sequence of explanation, coupled with comment and advice. this should be presented in a clear, friendly, constructive and flexible way. one possibility is to follow 78 michael lambiris a traditional web-page design, with a list of contents on the left of the screen to indicate the extent and structure of the available feedback, with direct hyperlinks different sections. see figure 2 below. as far as possible, the feedback should be individualised, by displaying the particular student’s own data. in addition, particular comments and advice can be displayed selectively, depending on whether the particular student has a good score, an average score, or a poor score. the screenshots below provide examples. to script a full range of alternative comments and advice requires considerable forethought but the result is worthwhile. the feedback can also include information about how the individual student’s performance compares to the class as a whole. and it can usefully include information and advice about future tests, for example, what new forms of question will be encountered, and what specific preparation may be needed. students are very receptive to such information in the immediate aftermath of a test. the feedback applications can be made available to students either on a local area network, or by providing a downloadable version, or by running them on-line. 7 providing feedback to the instructor so far, this paper has been concerned with providing feedback to the students but it is also important that the instructor get feedback on the effectiveness of their teaching, the validity of the questions set in the test, and the extent and accuracy of student learning. traditional marking, which involves reading the answers, provides this feedback because, if a significant number of students make the same mistake, the instructor quickly becomes aware of the problem. with computer-based testing it is harder to get a clear idea of these matters. the normal output of a computer based test is a list of final marks and these do not tell the instructor much about where specific problems might lie. however, it is possible to use the techniques described above (with appropriate modification) to provide an analysis of the group results. for a group analysis, the program begins by finding each student’s string of answers in the database and carrying out the same sort of analysis already described, classifying the answers as right or wrong, and categorising the right and wrong answers in various ways, for example, by area of knowledge or skill, or in relation to specified learning objectives. as each student’s string of answers is analysed, a cumulative total is built up, so that in the end it is known how many students in the entire group got each question right or wrong; what the distribution of marks is; what percentage of the answers were right or wrong in relation to particular areas of law; and what percentage of students satisfactorily demonstrated competency at particular skills. this type of analysis would be time-consuming to do manually but it is quickly and easily accomplished using the methodologies described. the results give an accurate and clear picture of group performance for example, see figures 6 and 7 below. if too many students appear to be answering a particular question wrongly, the instructor will quickly notice this and be able to investigate the different possibilities. it may be that the question is badly written; or that the topic is poorly taught; or that the student’s have prepared inadequately in that area of study. responding appropriately helps to improve the quality of the teaching and learning process. 8 conclusions by using appropriate techniques, and properly coordinating the skills and experience of instructors and computer-programmers, it is possible to automatically generate and deliver very satisfactory indivdualised feedback for students and instructors. although the examples discussed here use the data obtained from computer-based tests in multiple-choice form, the same ideas could be adapted to tests that are not computer based, or that do not consist of multiple-choice questions. all that is required is to work out a marking scheme where numbers or letters are used to record the marker’s evaluation of what the student has achieved. this data could be digitalised and used as the basis for computer-generated analysis a methodology for providing individualised computer-generated feedback to students 79 figure 1: a sample question from a test. this question involves case-law, more specifically the meaning of coded information in case citations (variable cl). the student must interpret and evaluate the significance of that information (variable qt3). and feedback, in much the same way as described in this paper. in essence, therefore, the techniques explained in this paper could find application in a wide range of situations. the screenshots illustrate various aspects of the ideas explained in this paper. they show how the information generated from the basic data can be presented in a constructive, meaningful and readable style, and within a well-contextualised framework. the last two screenshots present an analysis of group data and show how a clear and detailed overview can be gained by the instructor of class performance as a whole. references [1] johnstone r, patterson j and rubinstein k, improving criteria and feedback in student assessment in law, cavendish publishing, australia, 1998. [2] east r, effective assessment strategies in law, http: //www.ukcle.ac.uk/resources/ assessment/effective.html, 2005. [3] higgins, e. and tatham, l, assessing by multiple choice question (mcq) tests,http:// www.ukcle.ac.uk/resources /trns/mcqs/index.html, 2003. [4] lambiris m, assessment management software, australian law courseware pty ltd, australia, http://www.alcware.com, 2005 2006. 80 michael lambiris figure 2: in the feedback application, the topics are listed on the left of the screen with hyperlinks to the content of each section. this particular screen explains the scoring process, shows the individual student’s final score and grade, and provides an appropriate comment. figure 3: this screen provides a detailed analysis of the individual student’s performance in a specified area of law (organs, powers and processes of government) and selectively provides appropriate comment. the feedback is based on the variables total 1r; chr; lmpr; legr and csr. a methodology for providing individualised computer-generated feedback to students 81 figure 4: this screen uses the variables qt1r; qt2r and qt3r to analyse the individual student’s ability to perform tasks involving specified skills. appropriate comments are also displayed selectively, depending on the values in these variables. figure 5: this screen summarises all of the available data. presented in tabular form, it gives a concise overview of the student’s performance. it also shows how a substantial amount of meaningful information can be generated from the basic data. 82 michael lambiris figure 6: using the same variables as devised for the feedback application, the data for the entire group of students can be generated for the instructor. this screen shows how many students in one group answered particular questions correctly or not. figure 7: group data can also give the instructor an overview of performance in relation to areas of knowledge, or particular skills. this screen shows the percentage of correct answers for the entire group in relation to the eleven areas of knowledge being tested. a methodology for providing individualised computer-generated feedback to students 83 michael lambiris the university of melbourne, faculty of law victoria 3010 australia e-mail: m.lambiris@unimelb.edu.au received: november 6, 2006 editor’s note about the author: michael lambiris (born january 22, 1950) obtained an llb (hons) from the university of london in 1971, and a phd from rhodes university in 1988. he has held positions at the university of zimbabwe (1976-1982 and at rhodes university, south africa (1982-1991). he is presently an associate professor and reader in the faculty of law, the university of melbourne, victoria, australia. his main fields of teaching and research are commercial law and computer-based legal education. in addition to writing computer-based learning materials, he has developed computer-based testing and feedback software, written various papers and books and presented papers at many international conferences. he is the managing director of australian law courseware (pty) ltd which publishes computer-based learning materials for law students. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 2, pp. 185-197 quality control of statistical learning environments and prediction of learning outcomes through reproducible computing patrick wessa k.u.leuven association lessius, dept. of business studies belgium e-mail: patrick@wessa.net abstract: this article introduces a new approach to statistics education that allows us to accurately measure and control key aspects of the computations and communication processes that are involved in non-rote learning within the pedagogical paradigm of constructivism. the solution that is presented relies on a newly developed technology (hosted at www.freestatistics.org) and computing framework (hosted at www.wessa.net) that supports reproducibility and reusability of statistical research results that are presented in a so-called compendium. reproducible computing leads to responsible learning behaviour, and a stream of high-quality communications that emerges when students are engaged in peer review activities. more importantly, the proposed solution provides a series of objective measurements of actual learning processes that are otherwise unobservable. a comparison between actual and reported data, demonstrates that reported learning process measurements are highly misleading in unexpected ways. however, reproducible computing and objective measurements of actual learning behaviour, reveal important guidelines that allow us to improve the effectiveness of learning and the e-learning system. keywords: reproducible computing, learning environment, quality control, statistics education, psychometrics 1 introduction in education-related research it is common practice to investigate learning processes through measurements that are based on questionnaires. reported measures often reveal interesting information about a wide variety of aspects of computing-assisted learning such as: computer attitudes [22]; computer emotions and knowledge [17]; learner experiences and satisfaction [34]; etc... the importance of such measurements has been highlighted by many authors from various perspectives ([7], [15], [12]) especially from the perspective of the constructivist pedagogical paradigm ([35], [30], [11], [24]). these reported measures, while intrinsically interesting, may not always provide us with the information we need to assess and improve systems that support e-learning. moreover, the implementation of new learning technologies and data analysis tools open up a wide array of measurement opportunities which lead to new areas of research. an excellent example is the use of data mining tools in the open source e-learning environment called moodle [28]. even though it seems to be very difficult to measure and empirically prove [25], there is no doubt in my mind that the introduction of computers in homes and classrooms has led to an improvement of overall learning productivity, educational communication mechanisms, social constructivism, and collaboration. however, the use of computers and software in statistics education may unwillingly result in several types of adverse effects because the complex processes that are required to learn and (truly) copyright © 2006-2009 by ccc publications 186 patrick wessa understand statistical concepts are often mystified by technicalities and a variety of practical problems that have nothing to do with mathematics or statistics. it is within this context that i argue that a system for quality control should be embedded into the e-learning system, which is not limited to the virtual learning environment but extends to the statistical software, databases, and learning repositories (statistical learning environment). there is an important, additional benefit for implementing such a monitoring and control system it is directly related to the problem of irreproducible research which has received a great deal of attention within the statistical computing community ([9], [26], [29], [14], [13], [18], [10]). the most prominent citation about the problem of irreproducible research is called claerbout’s principle ([9]): an article about computational science in a scientific publication is not the scholarship itself, it is merely advertising of the scholarship. the actual scholarship is the complete software development environment and that complete set of instructions that generated the figures... several solutions have been proposed ([5], [10], [19]) but have not been adopted in statistics education because they require students to understand the technicalities of scientific word processing (latex) or statistical programming (r code). based on a newly developed statistical learning environment (sle) i propose a solution that is feasible for educational purposes and allows us to monitor, research, and control the learning processes based on the dynamics of between-student communication and collaboration. 2 reproducible computing 2.1 r framework the r framework allows educators and scientists to develop new, tailor-made statistical software (based on the r language) within the context of an open-access business model that allows us to create, disseminate, and maintain software modules efficiently and with a very low cost in terms of computing resources and maintenance efforts [36]. the so-called r modules empower students to perform statistical analysis through a web-based interface that does not require them to download or install anything on the client machine. this permits students to focus primarily on the interpretation of the analysis however, the r framework also allows advanced students and scientists to inspect and change the r code that was coded by the original author. this results in the creation of so-called derived r modules that may be better suited for particular purposes. there are several important reasons why the r framework helps in controlling the quality of the statistical learning processes that are supported by the computer: • the r modules are web applications with an advanced session management which includes all aspects of the computations that are executed. in addition, the session manager uses attributes that identify the student and the course in which (s)he is enrolled. therefore all computations that are performed within the context of a statistics course can be associated with an individual student to implement this feature, the educator only needs to use certain html tags in the hyperlink that is inserted in the virtual learning environment. • every r module is uniquely described by an expandable set of meta data (incl. the actual statistical code) which can be stored and transmitted. this implies that every computation that is executed can be uniquely defined by the r module’s meta data and additional information about the data and the parameters that have been specified by the user. as a consequence, every computation can be uniquely described and archived with meta data. quality control of statistical learning environments and prediction of learning outcomes through reproducible computing 187 • the r framework allows other servers (under certain conditions) to send meta data through an ordinary http request which allows it to rebuild and execute the r module with the specified data and parameters in real time. therefore it is possible to remotely store computational objects and send them back to the r framework such that the original computation can be reproduced and reused. • all the processes that are associated with the above items are automatically stored in a so-called process measurement database. this implies that all computer-assisted learning activities are objectively measured and stored for the purpose of analysis. 2.2 compendium platform if a derived r module contains generic improvements or if a computation needs to be communicated to other students/scientists then it is necessary to have a simple, transparent mechanism that allows one to permanently store the computation in a repository of computational objects that can be easily retrieved, recomputed, and reused. such a repository was recently created within the oof 2007/13 project of the k.u.leuven association and is called the compendium platform. the main reason for creating the r framework and the compendium platform, is that it allows anyone to create and use compendia of reproducible research. a compendium is defined as [37]: a research document where each computation is referenced by a unique url that points to an object that contains all the information that is necessary to recompute it. such documents can be easily created (even by students) and permit any reader to (exactly) recompute the statistical results that are presented therein. a few simple clicks are sufficient to have the r framework reproduce the results and to reuse them in derived work [37]. the practical implications of this technology will become obvious in section 3 because the three figures that are presented can be recomputed and reused through the compendium platform. 2.3 communication, feedback, and learning the concept of reproducible computing was implemented in several undergraduate statistics courses in order to thoroughly test the new system and to measure key aspects of the educational activities and experiences. two different student populations were investigated in detail: a group of (academic) bachelor students, and a group of so-called switching students. the second population is of particular interest because it consists of students who obtained a (professional) bachelor degree and decided to make the switch to an academic master which requires them to complete a preparatory year. on the one hand, switching students are highly motivated and more mature than the bachelor students. a priori, one would expect them to prefer practical activities (such as communication and computing) above theory and critical reflection. on the other hand, one might expect the bachelor students to have a more critical (scientific) attitude and better mathematical background than the switching students. students from both populations took a similar statistics course which covered topics from introductory statistics, regression analysis, and introductory time series analysis. the main learning activities in both statistics courses were based on a weekly series of workshops where each student was required to investigate practical, empirical problems. at the end of each week, students submitted their papers electronically. during the lecture i proposed a series of solutions and illustrated commonly made mistakes. after the lecture, students had to work on the next assignment and complete a series of peer reviews (assessments) about the work that was submitted the week before. the assessment grades did not count towards the final score however, each submitted peer review was accompanied by verbal feedback messages. i graded a (quasi random) sample of these messages in order to provide students with a strong incentive to take the review process seriously. there is strong empirical evidence that this approach had beneficial effects on non-rote learning of statistical concepts [38]. 188 patrick wessa 3 objective measurements versus reported data in a recent paper [37] it is illustrated how the compendium platform’s repository supports “technical” quality control of the statistical software and accompanying documentation for students. on the one hand, reproducible computing allows students to accurately communicate computational problems and questions without the need to understand the underlying technicalities. on the other hand, it allows the educator (and creator of the computational software) to analyze the reported problem (based on the detailed, raw output of the r engine that executed the request) and to transparently communicate the solutions to the students. moreover, the measurement of learning activities and experiences is a conditio sine qua non for controlling the “overall” quality of the sle. this will be illustrated, based on the data that have been collected from both student groups. at the same time, the importance of objective (as opposed to reported) measurements is illustrated based on a simple, comparative diagnostic tool. the reported measurements were obtained through questionnaires on a 5-point likert scale and should consequently be treated as ordinal data. the questions were based on well-known psychological surveys ([12], [8]) and the ibm computer system usability survey [20] which was adapted and extended [27]. useful data was obtained from a total of 111 bachelor students and 129 switching students the response ratio was very high (between 82.9% and 92% depending on the questionnaire). all observations of actual learning activities were measured on a ratio scale (the number of archived computations and the number of submitted feedback messages). a total number of 34438 meaningful, verbal feedback communications and 6587 archived computations were registered. in order to compare the actual and reported data, all measurements were converted to ordinal rank orders. in addition, the pearson’s rho correlations and kendall’s tau rank correlations ([1], [2], [16]) that represent the degree of linear association between the properties under investigation, were computed (these can be consulted in the archived computations about the figures). in electronic versions of this paper, one can simply (ctrl-)click the hyperlinks below figures 1, 2, and 3 to view the archived computation in the repository. readers of the printed version of this document, have to manually enter the respective urls into their internet browser to view the statistical computations that have been stored (at www.freestatistics.org). figure 1 displays the bivariate kernel density [21] between the rank order of the number of feedback messages that have been submitted in peer reviews about the workshops (x-axis) and the rank order of the number of (reproducible) computations that have been archived in the repository (y-axis). the rank orders have been computed within the bachelor population for the top panels, and within the switching population for the bottom panels. this implies that the ranks that are attributed to female and male students are expressed on the same axes and can be compared. figure 1 clearly demonstrates that female bachelor students are much more involved in feedback and computing than their male colleagues. at the same time, female switching students are more computing-oriented whereas the male switching students seem to have a slight preference for feedback communication. this information has important repercussions for controlling the quality of the learning environment and it provides clear guidelines towards actions that should be taken (by me) to improve participatory incentives towards male bachelor students in future courses. would i have been able to gain this insight based on reported measurements alone? the answer is clearly negative (as is illustrated in figures 2 and 3). it is quite obvious that male bachelor students highly over-estimate their performance in terms of feedback submissions (see figure 2) because the rank orders of reported measures (x-axis) are higher than the ranks of actual feedback submissions (y-axis). female bachelor students however, underestimate their involvement (relative to their male colleagues) because they are concentrated above the diagonal line. in the male switching student population several clusters of high density can be detected which leads us to conclude that we cannot treat them as one homogeneous group. in figure 3 the comparison between reported computing measures (x-axis) and actual computing (y-axis) leads to similar conclusions. male bachelor students highly exaggerate their efforts, whereas quality control of statistical learning environments and prediction of learning outcomes through reproducible computing 189 0 20 40 60 80 100 120 0 2 0 4 0 6 0 8 0 1 0 0 1 2 0 female bachelor students # submitted messages # r e p ro d u ci b le c o m p u ta tio n s 0.002 0.0 02 0.004 0.006 0.008 0.008 0.01 0.012 0.014 0.016 0.018 0.02 0.022 0 20 40 60 80 100 120 0 2 0 4 0 6 0 8 0 1 0 0 1 2 0 male bachelor students # submitted messages # r e p ro d u ci b le c o m p u ta tio n s 0.005 0.01 0.01 0.01 0.01 0.0 1 0.015 0.02 0.025 0.03 0 50 100 150 0 5 0 1 0 0 1 5 0 female switching students # submitted messages # r e p ro d u ci b le c o m p u ta tio n s 0.001 0.001 0.002 0.003 0.004 0.005 0.006 0.007 0.008 0.009 0 .0 1 0.01 0.01 0.011 0.012 0.013 0 50 100 0 5 0 1 0 0 male switching students # submitted messages # r e p ro d u ci b le c o m p u ta tio n s 0.002 0.004 0.006 0.006 0.008 0.01 0.01 0.01 0.012 0.014 0.014 0.016 0.016 figure 1: submitted feedback versus reproducible computations www.freestatistics.org/blog/date/2008/jun/30/t1214840420q0fyankop4x9ebf.htm female bachelor and switching students underestimate themselves. the group of male switching students is heterogeneous. overall, the testimony of students is extremely misleading and poorly correlated with actual observations. if we would have recomputed figure 1 with reported measures then the conclusions would have been the opposite of what is true. the reader can try out this experiment by simply reproducing the computation of figure 1 with reported measures on both axes. 4 quality control in order to be able to control (and improve) the quality of the sle, it is necessary to estimate the impact of key-aspects of the learning processes that are associated with the sle. the methodology that allows us to do this is based on a mathematical model which is described in [40] and relates the learning outcomes to objectively measured activities and reported experiences. typically, models that predict learning outcomes based on exogenous variables that are related to the learning (and computing) environment have an extremely low percentage of variance explained. in a recent and extensive study [25], six models were discussed that predicted the statistics subtest scores of the massachusetts comprehensive assessment system the variance explained ranged between 4% and 7%. it is obvious that any model that is used to control the quality of an sle should perform much better. there are three important requirements to build high-quality models: 1. high-quality exogenous variables (preferably based on objective measurements) [39]; 190 patrick wessa 0 20 40 60 80 100 120 0 2 0 4 0 6 0 8 0 1 0 0 1 2 0 female bachelor students reported feedback submissions a ct u a l f e e d b a ck s u b m is si o n s 0.002 0.004 0.006 0.008 0.01 0.012 0.014 0.016 0.018 0.02 −20 0 20 40 60 80 100 120 0 2 0 4 0 6 0 8 0 1 0 0 1 2 0 male bachelor students reported feedback submissions a ct u a l f e e d b a ck s u b m is si o n s 0.002 0.002 0.004 0.006 0.008 0.01 0.012 0.012 0.014 0.014 0 50 100 0 5 0 1 0 0 1 5 0 female switching students reported feedback submissions a ct u a l f e e d b a ck s u b m is si o n s 0.001 0.001 0.002 0.003 0.004 0.005 0.006 0.006 0.007 0 .0 08 0.008 0.009 0.01 0. 01 1 0.011 0 50 100 150 0 5 0 1 0 0 male switching students reported feedback submissions a ct u a l f e e d b a ck s u b m is si o n s 0.002 0.002 0.004 0.006 0.008 0.008 0.01 0.01 0.01 0.012 0.012 0.012 0.014 0.014 figure 2: reported versus actually submitted feedback http://www.freestatistics.org/blog/date/2008/jun/30/t12148409608o0dnj2k4s04jil.htm 2. high-quality endogenous variable (c.q. test scores) based on optimal weights of the individual items (section 4.1, [40]); 3. homogeneous sample for which the model is computed. the third condition refers to the fact that student populations may consist of different types of students with specific learning behaviors. in the formentioned statistics course there were 4 groups with distinct characteristics. this is clearly illustrated in section 3 and in figures 1, 2, and 3. instead of computing separate models (for each of the sub populations) section 4.2 presents a comprehensive model with all combinations of interaction effects (male/female and bachelor/switching). this greatly improves the interpretation of the prediction model and allows us to perform differential quality control of the sle. 4.1 model first, a classical regression approach is used to predict the learning outcomes (c.q. exam scores) as a linear function of (k − 1) ∈ n0 exogenous variables of interest. let ~y represent an n × 1 vector for all n ∈ n students (with n > k), containing the weighted sum of g item scores (c.q. scores on individual exam questions): ~y ≡ ∑gj=1 ωj~yj with initial unit weights ωj ≡ 1. in addition, define an n × k matrix x that represents all exogenous variables (including a one-valued column which represents the constant), and a k × 1 parameter vector ~b that represents the weights of the linear combination of all columns in x that is used to describe ~y. the complete model is denoted m1 and is defined by ~y = x~b + ~e where ~e ← iid n(~0, σ2e) represents the prediction error. quality control of statistical learning environments and prediction of learning outcomes through reproducible computing 191 0 20 40 60 80 100 120 0 2 0 4 0 6 0 8 0 1 0 0 1 2 0 female bachelor students reported intention to use # r e p ro d u ci b le c o m p u ta tio n s 0.002 0.004 0.006 0.008 0.01 0.01 0.012 0.012 0.014 0.016 0 20 40 60 80 100 120 0 2 0 4 0 6 0 8 0 1 0 0 1 2 0 male bachelor students reported intention to use # r e p ro d u ci b le c o m p u ta tio n s 0.002 0.00 2 0.002 0.002 0.004 0.006 0.008 0.01 0.01 0.012 0.012 0.014 0.014 0.016 0 .0 1 8 0 .0 2 0 50 100 0 5 0 1 0 0 1 5 0 female switching students reported intention to use # r e p ro d u ci b le c o m p u ta tio n s 0.002 0.004 0.006 0.008 0.01 0.012 0 50 100 150 0 5 0 1 0 0 male switching students reported intention to use # r e p ro d u ci b le c o m p u ta tio n s 0.002 0.004 0.006 0.00 6 0.008 0.008 0.01 0.01 0.012 0.012 0.012 0.012 0.014 0.014 0.014 0.016 figure 3: reported versus actual reproducible computing http://www.freestatistics.org/blog/date/2008/jun/30/t1214841152sn6jlyhgseclgqm.htm in the second model m2, the prediction of the first model is specified by a linear combination of the individual items (questions) that made up the total exam score. let y represent the n × g matrix that contains all g item scores, then it is possible to define the model ~̂y = y~c + ~a where ~a ← iid n(~0, σ2a). note that there is no constant term in this model. the third model (m3) simply combines m1 and m2 by relating ^̂ ~y to x in the regression model ^̂~y = x~f+~u. the estimator for ~f can be shown to be ~̂f = (x′x)−1 x′^̂~y = (x′x)−1 x′y (y ′y)−1 y ′x (x′x)−1 x′~y ([40]). m3 is likely to yield different results from m1 unless the estimated parameters m2 are (nearly) equal to the original weights ~̂c = (ĉ1, ĉ2, ĉ3, ..., ĉg) ′ ' (ω̂1, ω̂2, ω̂3, ..., ω̂g)′. from a statistical point of view it is not possible to test the improvement that is induced by the objective exam score transformations. the reason for this is that the traditional f-test assumes that the endogenous variables in two models (m1 and m3) to be compared are identical. therefore it is necessary to use an auxiliary model (m∗3) which is based on m3 and includes ~y as an explanatory variable. this extended model ^̂~y = x~f + ~yg + ~u can be shown to be equivalent to ( y (y ′y)−1 y ′x (x′x)−1 x′ − gin ) ~y = x~f + ~u such that it can be concluded that m∗3 is equal to m1 with a transformed endogenous variable. the interesting aspect about this auxiliary regression is the limiting case when g → 0 and y (y ′y)−1 y ′x (x′x)−1 x′ → in because it leads to m1 with ~f = ~b and ~u = ~e. this result is important because it is now easy to test if it is necessary to apply the transformation to the endogenous variable. the null hypothesis is simply h0 : g = 0 versus h1 : g 6= 0 which can be tested with the conventional t-test. in other words, if the null hypothesis is rejected then the transformation is necessary and the estimated parameters ~̂c and ~̂f interpretable. the usefulness of this modeling approach is illustrated in the next subsection. 192 patrick wessa 4.2 empirical evidence the data that was collected from the implemented sle (as described in section 2.3) contained the following exogenous variables: • bcount: actual computations • gender: 0 = female / 1 = male • future: intention to use • pop: 0 = bachelor / 1 = switching • nnzfg: actually submitted feedback messages in peer review • reflection: reported feedback messages in peer review table 1 presents the empirical results of two models (m1 and m3). the endogenous variable in m1 is the sum of all exam questions with unit weights whereas m3 is based on objective exam score transformations (optimal weights of individual questions). table 1: empirical results variable estimate m1 sig. estimate m3 sig. (intercept) 6.935987 * 6.333557 *** bcount 0.033281 0.035939 *** gender -2.166419 -1.465320 pop -4.616769 -0.494553 nnzfg 0.027379 * 0.030161 *** future 0.625812 . 0.639711 *** reflection -0.167591 -0.167980 ** bcount:gender -0.027786 -0.036090 ** bcount:pop 0.018510 -0.007901 gender:pop 3.699211 2.116220 gender:nnzfg -0.001449 -0.013074 ** pop:nnzfg -0.020088 -0.024768 *** gender:future -0.274359 -0.354713 * pop:future -0.038318 -0.143013 gender:reflection 0.161042 0.225622 * pop:reflection 0.289011 0.160574 . bcount:gender:pop 0.019236 0.021735 gender:pop:nnzfg -0.002538 0.009896 . gender:pop:future -0.248325 -0.157158 gender:pop:reflection -0.128991 -0.160408 residual standard error 3.446 0.9593 degrees of freedom 179 179 adj. r-squared 0.1607 0.6626 f-statistic 2.995 *** 21.47 *** signif. codes: 0 *** 0.001 ** 0.01 * 0.05 . 0.1 1 from the results in table 1 it is clear that m3 provides unlike m1 a lot of interesting information about the relationship between optimally weighted exam scores and the exogenous variables which are quality control of statistical learning environments and prediction of learning outcomes through reproducible computing 193 under the control of the educator. the percentage of variance explained (adjusted r2) in m3 is more than 66% which allows us to make much better predictions than what is usually reported in otherwise excellent academic articles [25]. as explained before, the traditional f-test cannot be used to test the significance of the improvement. however, the auxiliary regression’s null hypothesis h0 : g = 0 is rejected even if an extremely low type i error is chosen (the p-value is 3.23 × 10−11). this implies that m3 performs significantly better and the objective exam score transformations are necessary. in addition, several diagnostic tests about the final model (m3) are shown in figure 4 they indicate no statistical inadequacies. the most interesting aspects of this analysis are the estimated parameters of m3. with regard to quality control of the sle the following conclusions can be made: • there is a positive effect of performing reproducible, statistical computations (bcount). this effect is significant at the 0.1% type i error level and cannot be measured without optimal weights (m1). however, this effect is only relevant for female students because the parameter that is associated with bcount:gender is also significant and has a negative sign. • submitting feedback messages (in peer review) is very beneficial and improves exam scores (pvalue < 0.01%). this effect is about twice as large for female students than for males (the gender:nnzfg parameter partially offsets the effect for male students). in addition, students from the switching population benefit less from feedback submissions. • the reported “intention to use” (as measured in the usability survey) positively affects exam scores. this effect is strongest for female students. note that previous research has shown that intention is mainly related to student’s perception about the comparative advantage (of the software system) to learn statistics as compared to other alternatives (such as textbooks) [27]. • females who report a high number of submitted feedback messages have significantly lower exam scores. on the other hand, male students who exaggerate their efforts are not in danger of having lower exam scores. this implies that the female exaggeration bias is small but harmful the male exaggeration bias is big and harmless. based on these empirical results it is now possible to control (improve) the quality of the sle: • female students should be encouraged to generate more reproducible computations. • peer review (based on reproducible computing) is highly beneficial to learn statistics especially when it requires students to engage in submitting feedback messages to their peers. male students need to (at least) double their efforts (compared to females) in order to obtain the same effect. students from the switching population also need more feedback submissions than bachelor students. • it is important to explain the sle to students emphasizing the comparative advantages of the system and the potentially improved exam scores. however, male students need more (or better) arguments before they accept the new technology and exhibit an increased degree of “intention to use.” • female students who exaggerate their reported efforts should receive accurate feedback about their real performance which is based on objective measurements. self assessment and reflection about student’s actual efforts (as compared to perceived efforts) should be an integral part of the sle. 194 patrick wessa 6 8 10 12 − 3 − 2 − 1 0 1 2 3 fitted values r e si d u a ls residuals vs fitted 145 31 141 −3 −2 −1 0 1 2 3 − 3 − 2 − 1 0 1 2 3 theoretical quantiles s ta n d a rd iz e d r e si d u a ls normal q−q 31 145 141 6 8 10 12 0 .0 0 .5 1 .0 1 .5 fitted values s ta n d a rd iz e d r e si d u a ls scale−location 31 145 141 0.00 0.05 0.10 0.15 0.20 0.25 0.30 − 4 − 2 0 2 leverage s ta n d a rd iz e d r e si d u a ls cook’s distance residuals vs leverage 31 10368 figure 4: diagnostics of m3 5 summary and conclusions the good news is that we now have a technology and methodology to assess actual and reported learning activities for any student population that makes use of the new compendium technology. ultimately, this allows us to take control and improve the sle which includes the e-learning environment, the statistical software, the course materials, and the overall learning experiences of all students. bibliography [1] arndt s., turvey c., andreasen n. (1999), correlating and predicting psychiatric symptom ratings: spearman’s r versus kendall’s tau correlation, journal of psychiatric research, 33, 97-104 [2] arndt s., magnotta v. (2001), generating random series with known values of kendall’s tau, computer methods and programs in biomedicine, 65, 17-23 [3] attitudes to thinking and learning survey, (n.d.), retrieved december 22, 2004, from www.moodle.org [4] benson j. (1989). structural components of statistical test anxiety in adults: an exploratory study, journal of experimental education, 57, 247-261. [5] buckheit j., and donoho d. l. (1995), wavelets and statistics, springer-verlag, editor: antoniadis, a. [6] chambers j. m., cleveland, w. s., kleiner, b. and tukey, p. a. (1983), graphical methods for data analysis, wadsworth & brooks/cole. quality control of statistical learning environments and prediction of learning outcomes through reproducible computing 195 [7] chen z. (2008), learning about learners: system learning in virtual learning environment, international journal of computers, communications & control, vol. iii, no. 1, pp. 33-40 [8] constructivist on-line learning environment survey, (n.d.), retrieved december 22, 2004, from www.moodle.org [9] de leeuw j. (2001), reproducible research: the bottom line, department of statistics papers, 2001031101, department of statistics, ucla., url http://repositories.cdlib.org/uclastat/papers/2001031101 [10] donoho d. l., and huo, x. (2005), beamlab and reproducible research, international journal of wavelets, multiresolution and information processing, 2(4), 391-414 [11] eggen p., and kauchak, d. (2001), educational psychology: windows on classrooms (5th ed.), upper saddle river, nj: prentice hall. [12] galotti k. m., clinchy b. m., ainsworth k., lavin b. and mansfield a. f. (1999), a new way of assessing ways of knowing: the attitudes towards thinking and learning survey (attls), sex roles, 40(9/10) p745-766 [13] gentleman r. (2005), applying reproducible research in scientific discovery, biosilico, url http://gentleman.fhcrc.org/fld-talks/rgrepres.pdf [14] green p. j. (2003), diversities of gifts, but the same spirit, the statistician, 52(4), 423-438 [15] hilton s., schau c., olsen j. (2004), survey of attitudes toward statistics: factor structure invariance by gender and by administration time, structural equation modeling, 11(1) [16] hollander m., and wolfe d. a. (1973), nonparametric statistical inference, new york: john wiley & sons., 185-194 (kendall and spearman tests). [17] kay r. h. (2008), exploring the relationship between emotions and the acquisition of computer knowledge, computers & education, 50, 1269-1283 [18] koenker r., and zeileis a.(2007), a., reproducible econometric research (a critical review of the state of the art), research report series, department of statistics and mathematics wirtschaftsuniversit wien [19] leisch f. (2003), sweave and beyond: computations on text documents, proceedings of the 3rd international workshop on distributed statistical computing [20] lewis j. r. (1993), ibm computer usability satisfaction questionnaires: psychometric evaluation and instructions for use, ibm corporation, technical report 54.786 [21] lucy d., aykroyd r. g. and pollard a. m.(2002), non-parametric calibration for age estimation, applied statistics 51(2), 183-196 [22] meelissen m. r. m., drent m. (2008), gender differences in computer attitudes: does the school matter?, computers in human behavior, 24, 969-985 [23] miller j. b., (n.d.), examining the interplay between constructivism and different learning styles, retrieved october 20, 2005 from http://www.stat.auckland.ac.nz/ĩase/publications/1/8a4_mill.pdf [24] mvududu n. (2003), a cross-cultural study of the connection between students’ attitudes toward statistics and the use of constructivist strategies in the course, journal of statistics education, 11(3) 196 patrick wessa [25] o’dwyer l. m., russell m., bebell d., seeley k. (2008), examining the relationship between students mathematics test scores and computer use at home and at school, journal of technology, learning, and assessment, 6 (5) [26] peng r. d., dominici f., and zeger s. l. (2006), reproducible epidemiologic research, american journal of epidemiology, 163(9), 783-789 [27] poelmans s., wessa p., milis k., bloemen e., and doom c. (2008), usability and acceptance of e-learning in statistics education, based on the compendium platform, proceedings of the international conference of education, research and innovation, international association of technology, education and development [28] romero c., ventura s., garcia e. (2008), data mining in course management systems: moodle case study and tutorial, computers & education, 51, 368-384 [29] schwab m., karrenbach n., and claerbout j. (2000), making scientific computations reproducible, computing in science & engineering 2(6), 61-67 [30] smith e. (1999), social constructivism, individual constructivism and the role of computers in mathematics education, journal of mathematical behavior, 17(4) [31] statistical computations at freestatistics.org (2008a), office for research development and education, retrieved mon, 30 jun 2008, url http://www.freestatistics.org/blog/date/2008/jun/30/t1214840420q0fyankop4x9ebf.htm [32] statistical computations at freestatistics.org (2008b), office for research development and education, retrieved mon, 30 jun 2008, url http://www.freestatistics.org/blog/date/2008/jun/30/t12148409608o0dnj2k4s04jil.htm [33] statistical computations at freestatistics.org (2008c), office for research development and education, retrieved mon, 30 jun 2008, url http://www.freestatistics.org/blog/date/2008/jun/30/t1214841152sn6jlyhgseclgqm.htm [34] sun p., tsai r. j., finger g., chen y., yeh d. (2008), what drives a successful e-learning? an empirical investigation of the critical factors influencing learner satisfaction, computers & education, 50, 1183-1202 [35] von glasersfeld e. (1987), learning as a constructive activity, problems of representation in the teaching and learning of mathematics, hillsdale, nj: lawrence erlbaum associates, 3-17. [36] wessa p. (2008a), a framework for statistical software development, maintenance, and publishing within an open-access business model, computational statistics, www.springerlink.com (doi 10.1007/s00180-008-0107-y) [37] wessa p. (2008b), learning statistics based on the compendium and reproducible computing, proceedings of the international conference on education and information technology, berkeley, san francisco, usa [38] wessa p. (2008c), how reproducible research leads to non-rote learning within a socially constructivist e-learning environment, proceedings of the 7th european conference on e-learning, cyprus [39] wessa p. (2008d), measurement and control of statistics learning processes based on constructivist feedback and reproducible computing, proceedings of the 3rd international conference on virtual learning, constanta, romania quality control of statistical learning environments and prediction of learning outcomes through reproducible computing 197 [40] wessa p. (2009a), discovering computer-assisted learning processes based on objective exam score transformations, proceedings of the world congress on educational sciences, cyprus int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 2, pp. 104-117 a novel fuzzy artmap architecture with adaptive feature weights based on onicescu’s informational energy răzvan andonie, lucian mircea sasu, angel caţaron răzvan andonie computer science department central washington university, ellensburg, usa and department of electronics and computers transylvania university of braşov, romania e-mail: andonie@cwu.edu angel caţaron department of electronics and computers transylvania university of braşov, romania e-mail: cataron@vega.unitbv.ro lucian mircea sasu applied informatics department transylvania university of braşov, romania e-mail: lmsasu@unitbv.ro abstract: fuzzy artmap with relevance factor (famr) is a fuzzy artmap (fam) neural architecture with the following property: each training pair has a relevance factor assigned to it, proportional to the importance of that pair during the learning phase. using a relevance factor adds more flexibility to the training phase, allowing ranking of sample pairs according to the confidence we have in the information source or in the pattern itself. we introduce a novel famr architecture: famr with feature weighting (famrfw). in the first stage, the training data features are weighted. in our experiments, we use a feature weighting method based on onicescu’s informational energy (ie). in the second stage, the obtained weights are used to improve famrfw training. the effect of this approach is that category dimensions in the direction of relevant features are decreased, whereas category dimensions in the direction of non-relevant feature are increased. experimental results, performed on several benchmarks, show that feature weighting can improve the classification performance of the general famr algorithm. keywords: fuzzy artmap, feature weighting, lvq, onicescu’s informational energy. 1 introduction the fam architecture is based upon the adaptive resonance theory (art) developed by carpenter and grossberg [7]. fam neural networks can analyze and classify noisy information with fuzzy logic, and can avoid the plasticity-stability dilemma of other neural architectures. the fam paradigm is prolific and there are many variations of carpenter’s et al. [7] initial model: art-emap [9], dartmap [8], boosted artmap [27], fuzzy artvar [12], gaussian artmap [28], probart [21], pfam [20], copyright © 2006-2009 by ccc publications a novel fuzzy artmap architecture with adaptive feature weights based on onicescu’s informational energy 105 ordered fam [11], and µartmap [14]. the fam model has been incorporated in the mit lincoln lab system for data mining of geospatial images because of its computational capabilities for incremental learning, fast stable learning, and visualization [25]. one way to improve the fam algorithm is to generalize the distance measure between vectors [10]. based on this principle, we introduced in previous work [2] a novel fam architecture with distance measure generalization: fam with feature weighting (famfw). feature weighting is a feature importance ranking algorithm where weights, not only ranks, are obtained. in our approach, training data feature weights were first generated. next, these weights were used by the famfw network, generalizing the distance measure. potentially, any feature weighting method can be used, and this makes the famfw very general. feature weighting can be achieved, for example, by lvq type methods. several such techniques have been recently introduced. these methods combine the lvq classification with feature weighting. in one of these approaches, rlvq (relevance lvq), feature weights were determined to generalize the lvq distance function [16]. a modification of the rlvq model, grlvq (generalized rlvq), has been proposed in [18]. the srng (supervised relevance neural gas) algorithm [17] combines the ng (neural gas) algorithm [22] and the grlvq. ng [22] is a neural model applied to the task of vector quantization by using a neighborhood cooperation scheme and a soft-max adaptation rule, similar to the kohonen feature map. in [1], we introduced the energy supervised relevance neural gas (esrng) feature weighting algorithm. the esrng is based on the srng model. it maximizes onicescu’s ie as a criteria for computing the weights of input features. the esrng is the feature weighting algorithm we used in [2], in combination with our famfw algorithm . famr is a fam incremental learning system introduced in our previous work [4]. during the learning phase, each sample pair is assigned a relevance factor proportional to the importance of that pair. the famr has been successfully applied to classification, probability estimation, and function approximation. in famr, the relevance factor of a training pair may be user-defined, or computed, and is proportional to the importance of the respective pair in the learning process. in the present paper, we focus on the famr neural network, the esrng feature weighting algorithm, and the distance measure generalization principle. we contribute the following: 1. we introduce a novel famr architecture with distance measure generalization: famr with feature weighting (famrfw), adapting the famfw model for the famr case. 2. compared to [2], we include new experiments on standard benchmarks. we first introduce the basic fam and famr notations (section 2), and the esrng feature weighting algorithm (section 3). in section 4, we describe the new famrfw algorithm, which uses a weighted distance measure. section 5 contains experimental results performed with the famrfw method. section 6 contains the final remarks. 2 a brief description of the famr we will summarize the fam standard architecture and the famr learning mechanism, which differentiates it from the standard fam. 2.1 the fam architecture a detailed fam description can be found in carpenter’s et al. seminal paper [7], but more simplified presentations are given in [26] and [19]. 106 răzvan andonie, lucian mircea sasu, angel caţaron figure 1: fuzzy artmap architecture [7]. the fam architecture consists of a pair of fuzzy art modules, arta and artb, connected by an inter–art module called mapfield (see fig. 1). arta and artb are used for coding the input and output patterns, respectively, and mapfield allows mapping between inputs and outputs. the arta module contains the input layer fa1 and the competitive layer f a 2 . a preprocessing layer f a 0 is also added before fa1 . analogous layers appear in artb. the initial input vectors have the form: a = (a1, . . . , an) ∈ [0, 1]n. a data preprocessing technique called complement coding is performed by the fa0 layer in order to avoid node proliferation. each input vector a produces the normalized vector a = (a, 1 − a) whose l1 norm is constant: |a| = n. let ma be the number of nodes in fa1 and na be the number of nodes in f a 2 . due to the preprocessing step, ma = 2n. the weight vector between fa1 and f a 2 is w a. each fa2 node represents a class of inputs grouped together, denoted as a category. each fa2 category has its own set of adaptive weights stored in the form of a vector waj , j = 1, . . . na, whose geometrical interpretation is a hyper-rectangle inside the unit box. similar notations are used for the artb module. for a classification problem, the class index is the same as the category number in fb2 , thus artb can be substituted with a vector. the mapfield module allows fam to perform associations between arta and artb categories. the number of nodes in mapfield is equal to the number of nodes in fb2 . each node j from f a 2 is linked to each node from fb2 via a weight vector w ab j . the learning algorithm is sketched below. for each training pattern, the vigilance parameter factor ρa is set equal to its baseline value, and all nodes are not inhibited. for each (preprocessed) input a, a fuzzy choice function is used to get the response for each fa2 category: tj(a) = |a ∧ waj | αa + |waj | , j = 1, . . . , na (1) let j be the node with the highest value computed as in (1). if the resonance condition from eq. (2) is not fulfilled: ρ(a, waj ) = |a ∧ waj | |a| ≥ ρa, (2) then the jth node is inhibited such that it will not participate to further competitions for this pattern and a new search for a resonant category is performed. this might lead to creation of a new category in arta. a novel fuzzy artmap architecture with adaptive feature weights based on onicescu’s informational energy 107 a similar process occurs in artb and let k be the winning node from artb. the fb2 output vector is set to: ybk = { 1, if k = k 0, otherwise k = 1, . . . , nb (3) an output vector xab is formed in mapfield: xab = yb ∧ wabj . a mapfield vigilance test controls the match between the predicted vector xab and the target vector yb: |xab| |yb| ≥ ρab (4) where ρab ∈ [0, 1] is a mapfield vigilance parameter. if the test from (4) is not passed, then a sequence of steps called match tracking is initiated (the vigilance parameter ρa is increased and a new resonant category will be sought for arta); otherwise learning occurs in arta, artb, and mapfield: wa(new)j = βa ( a ∧ wa(old)j ) + (1 − βa)w a(old) j (5) (and the analogous in artb) and wabjk = δkk, where δij is kronecker’s delta. with respect to βa, there are two learning modes: i) fast learning for βa = 1 for the entire training process, and ii) fast-commit and slow-recode learning corresponds to setting βa = 1 when creating a new node and βa < 1 for subsequent learning. 2.2 the famr learning mechanism the main difference between the famr and the original fam is the updating scheme of the wabjk weights. the famr uses the following iterative updating [4]: w ab(new) jk =    w ab(old) jk if j 6= j w ab(old) jk + qt qnew j ( 1 − w ab(old) jk ) w ab(old) jk ( 1 − qt qnew j ) if k 6= k (6) where qt is the relevance assigned to the tth input pattern (t = 1, 2, . . . ), and qnewj = q old j + qt. the relevance qt is a real positive finite number directly proportional to the importance of the experiment considered at step t. this wabjk approximation is a correct biased estimator of the posterior probability p(k|j), the probability of selecting the k-th artb category after having selected the j-th arta category [4]. let q be the vector [q1 . . . qna ]; initially, each qj (1 ≤ j ≤ na) has the same initial value q0. na and nb are the number of categories in arta and artb, respectively. these are initialized at 0. for incremental learning of one training pair, the famr mapfield learning scheme is described by algorithm 1. the vigilance test is: nb w ab jk ≥ ρab (7) for a clearer presentation, not to create a confusion between vector relevancies and feature weights, we will assume in all our following experiments that relevancies are set to a constant positive value. since we actually do not use relevances, is this famr equivalent to the standard fam model, as introduced in [7]? the answer is no, because, unlike the standard fam: i) the famr accepts one-to-many relationships; and ii) the famr is a conditional probability estimator, with an estimated convergence rate computed in [4]. 108 răzvan andonie, lucian mircea sasu, angel caţaron algorithm 1 the t-th iteration in the famr mapfield algorithm [4]. step 1. accept the t-th vector pair (a, b) with relevance factor qt. step 2. find a resonant category in artb or create a new one. if |b ∧ wbk|/|b| < ρb, for k = 1, . . . , nb then nb = nb + 1{add a new category to artb} k = nb if nb > 1 then wabjk = q0 nbqj , for j = 1, . . . , na {append new component to wabj } wabjk = w ab jk − wab jk nb−1 , for k = 1, . . . , k − 1; j = 1, . . . na{normalize} end if else let k be the index of the artb category passing the resonance condition and with maximum activation function. end if step 3. find a resonant category in artb or create a new one. if |a ∧ waj |/|a| < ρa, for j = 1, . . . , na then na = na + 1{add a new category to arta} j = na qj = q0 {append new component to q} wabjk = 1/nb, for k = 1, . . . , nb {append new row to w ab} else let j be the index of the arta category passing the resonance condition and with maximum activation function. end if step 4. j, k are winners or newly added nodes. check if match tracking applies. if vigilance test (7) is passed then {learn in mapfield} qj = qj + qt wabjk = w ab jk + qt qj (1 − wabjk ) wabjk = w ab jk ( 1 − qt qj ) , for k = 1, . . . , nb, k 6= k else perform match tracking and restart from step 3 end if 3 the esrng feature weighting algorithm we use the esrng feature weighting algorithm to compute the generalized distance measure in the famrfw. details of the esrng algorithm can be found in [1]. is is based on onicescu’s ie, and approximates the unilateral dependency of random variables by parzen windows approximation. before outlining the principal steps of the esrng method, we review the basic properties of the ie. 3.1 onicescu’s informational energy for a discrete random variable x with probabilities pk, the ie was introduced in 1966 by octav onicescu [24] as e(x) = ∑n k=1 p 2 k. for a continuous random variable y, the ie was defined by silviu guiaşu [15]: e(y) = ∫ +∞ −∞ p2(y)dy, a novel fuzzy artmap architecture with adaptive feature weights based on onicescu’s informational energy 109 where p(y) is the probability density function. for a continuous random variable y and a discrete random variable c, the conditional ie is defined as: e(y|c) = ∫ y m∑ m=1 p(cm)p 2(y|cm)dy. in order to study the interaction between two random variables x and y, the following measure of unilateral dependency was introduced by andonie et al. [3]: o(y, x) = e(y|x) − e(y) with the following properties: 1. o is not symmetrical with respect to its arguments; 2. o(y, x) ≥ 0 and the equality holds iff y and x are independent; 3. o(y, x) ≤ 1 − e(y) and the equality holds iff y is completely dependent on x. this measure quantifies the unilateral dependence characterizing y with respect to x and corresponds to the amount of information detained by x about y. 3.2 the feature weighting procedure esrng is an online algorithm which adapts a set of lvq reference vectors by minimizing the quantization error. at each iteration, it also adapts the input vector feature weights. the core of the method is based on the maximization of the o(y, c) measure. to connect input vector xi with its class j, represented by vector wj, we use a simple transform. we consider a continuous random variable y with its samples yi = λi(xi − wj), i = 1, . . . , n, where: • λ is the vector of weights; • xi, i = 1, . . . , n, are the training vectors, each of them from one of the classes c1, c2, . . . , cm; • wj, j = 1, . . . , p, are the lvq determined class prototypes. assuming that the m class labels are samples of a discrete random variable denoted by c, we can use gradient ascend to iteratively update the feature weights by maximizing o(y, c): λ(t+1) = λ(t) + α n∑ i=1 ∂o(y, c) ∂yi i (xi − wj) . from the definition of o(y, x), we obtain: o(y, c) = e(y|c) − e(y) = m∑ p=1 1 p(cp) ∫ y p2(y, cp)dy − ∫ y p2(y)dy. (8) this expression involves a considerable computational effort. therefore, we approximate the probability densities from the integrals using the parzen windows estimation method. the multidimensional gaussian kernel is [13]: g(y, σ2i) = 1 (2π) d 2 σd · e− yty 2σ2 (9) 110 răzvan andonie, lucian mircea sasu, angel caţaron where d is the dimension of the definition space of the kernel, i is the identity matrix, and σ2i is the covariance matrix. we approximate the probability density p(y) replacing each data sample yi with a gaussian kernel, and averaging the obtained values: p(y) = 1 n n∑ i=1 g(y − yi, σ2i). we denote by mp the number of training samples from class cp. we have: ∫ y p2(y, cp)dy = 1 n2 mp∑ k=1 mp∑ l=1 g(ypk − ypl, 2σ2i) and ∫ y p2(y)dy = 1 n2 n∑ k=1 n∑ l=1 g(yk − yl, 2σ2i), where ypk, ypl are two training samples from class cp, whereas yk, yl represent two training samples from any class. equation (8) can be rewritten, and we obtain the final esrng update formula of the feature weights: λ(t+1) = λ(t) − α 1 4σ2 g(y1 − y2, 2σ2i) · (y2 − y1)i · ·(x1 − wj(1) − x2 + wj(2)), where wj(1) and wj(2) are the closest prototypes to x1 and x2, respectively. the esrng algorithm has the following general steps: 1. update the reference vectors using the srng scheme. 2. update the feature weights. 3. repeat steps 1 and 2, for all training set samples. this algorithm uses a generalized euclidean distance. the updating formula for the reference vectors can be found in [1]; we will not explicitly use this formula in the present paper. the esrng algorithm generates numeric values assigned to each input feature, quantifying their importance in the classification task: the most relevant feature receives the highest numeric value. we use these factors as feature weights in the famrfw algorithm. 4 famrfw – a novel neural model the famrfw is a famr architecture with a generalized distance measure. for an arta category wj, we define its size s(wj): s(wj) = n − |wj| (10) and the distance to a normalized input a: dis(a, wj) = |wj| − |a ∧ wj| = n∑ i=1 dji, (11) a novel fuzzy artmap architecture with adaptive feature weights based on onicescu’s informational energy 111 where (dj1, . . . , djn) = wj − a ∧ wj. in [10] it is shown that: tj(a) = n − s(wj) − dis(a, wj) n − s(wj) + αa (12) ρ(a, waj ) = n − s(wj) − dis(a, wj) n (13) a generalization of dis(a, wj) is the weighted distance: dis(a, wj; λ) = n∑ i=1 λidji, (14) where λ = (λ1, . . . , λn), and λi ∈ [0, n] is the weight associated to the ith feature. we impose the constraint |λ| = n. for λ1 = · · · = λn = 1, we obtain in particular the famr. charalampidis et al. [10] used the following weighted distance: dis(x, wj|λ, ref) = n∑ i=1 (1 − λ)lrefj + λ (1 − λ)lji + λ dji, (15) where lrefj is a function of category j’s lengths of the hyper-rectangle, and λ is a scalar in [0, 1]. in our case, the function dis(a, wj; λ) does not depend on sides of the category created during learning, but on the computed feature weights. this makes our approach very different than the one in [10]. the effect of using distance dis(a, wj; λ) for a bidimensional category is depicted in fig. 2(a). the hexagonal shapes represent the points situated at constant distance from the category. these shapes are flattened in the direction of the feature with a larger weight and elongated in the direction of the feature with a smaller weight. this is in accordance with the following intuition: the category dimension in the direction of a relevant feature should be smaller than the category dimension in the direction of a non-relevant feature. hence, we may expect that more categories will cover the relevant directions than the non-relevant ones. (a) bounds for constant weighted distance dis(a, wj; λ) for various values of λ. the rectangle in the middle represents a category. (b) bounds for constant distance dis(a, wj; λ) for the null feature weight. the rectangle in the middle represents the category. figure 2: geometric interpretation of constant distance when using dis(a, wj; λ) for bidimensional patterns. for a null weight feature (fig. 2(b)), the bounds are reduced to parallel lines on both sides of the rectangle representing the category. in this extreme case, the discriminative distance is the one along the remaining feature dimension. this is another major difference between our approach and the one in [10], where, while using function dis(x, wj|λ, ref), the contours of a constant weighted distance are inside 112 răzvan andonie, lucian mircea sasu, angel caţaron some limiting hexagons. in our method, the contour is insensitive to the actual value of the null weighted feature. 5 experimental results we test the famrfw for several standard classification tasks, all from the uci machine learning repository [5]. the experiments are performed on the famr and the famrfw architectures. the two famrfw stages are: i) the λ feature weights are obtained by the esrng algorithm; ii) these weights are used both for training and testing the famr. a nice feature of the fam architectures and the esrng algorithm is the on-line (incremental) learning capability, i.e., the training set is processed only once. this type of learning is especially useful when dealing with very large datasets, since it can reduce significantly the computational overhead. for famr training and for both famrfw stages we use on-line learning. 5.1 methodology for each experiment, we use three-way data splits (i.e., the available dataset is divided into training, validation, and test sets) and random subsampling. random subsampling is a faster, simplified version of k-fold cross validation: 1. the dataset is randomized. 2. the first 60% of the dataset is used for training and the next 20% for validation (i.e., for tuning the model parameters). the following parameters are optimized using a simple “grid-search” for ρa, ρab ∈ {0, 0.1, . . . , 0.9} and βa ∈ {0, 0.1, . . . , 1}. the goal is to allow both fast learning and fast-commit slow-recode. the optimal parameter values are the ones producing the highest pcc and the lowest number of arta categories. 3. the network with optimal parameters is trained with the joint training + validation data. 4. the last 20% of the dataset is used for testing. as a result, the percent of correct classification (pcc) and the number of generated arta categories are computed. 5. repeat this procedure six times. the ρa value, optimized during training/validation, controls the number of generated arta categories. after training/validation, this number does not change. for ρa > 0, some test vectors may be rejected (i.e., not classified). in all our experiments, after the arta categories were generated, we set ρa = 0 for testing. this has the following positive effects: • all test vectors are necessarily classified. • we obtain experimentally better classification results, both for the famr and the famrfw, compared to the ones with optimized ρa values. this is shown in table 1, for all considered classification tasks. the feature weights values in the famrfw are the ones mentioned in the following sections. a novel fuzzy artmap architecture with adaptive feature weights based on onicescu’s informational energy 113 table 1: average pcc test set results using the optimized ρa (computed in the validation phase) vs. using ρa = 0. famr famrfw optimized ρa ρa = 0 optimized ρa ρa = 0 breast cancer 86.54% 91.22% 91.22% 91.22% balance scale 75.86% 76.53% 75.92% 78.13% wine recognition 83.33% 84.72% 83.79% 89.35% ionosphere 85.44% 88.96% 85.91% 89.43% 5.2 breast cancer classification this dataset (formally called wisconsin diagnostic breast cancer) includes 569 instances. the instances are described by 30 real attributes. the given features are computed from a digitized image of a fine needle aspirate (fna) of a breast mass. the famrfw generated weights are: [0.784, 0.816, 0.795, 2.847, 0.784, 0.784, 0.784, 0.784, 0.784, 0.784, 0.784, 0.784, 0.785, 0.808, 0.784, 0.784, 0.784, 0.784, 0.784, 0.784, 0.784, 0.829, 0.828, 5.047, 0.784, 0.784, 0.784, 0.784, 0.784, 0.784]. in table 2, we observe that the average pcc for the famr and the famrfw is the same, but the famrfw has much less arta categories than the famr. table 2: classification performance for the breast cancer problem. test famr famrfw no. no. of arta categories pcc no. of arta categories pcc 1 61 93.85% 24 87.71% 2 7 90.35% 7 93.85% 3 10 95.61% 8 91.22% 4 39 85.08% 6 88.59% 5 6 92.98% 6 94.73% 6 6 89.47% 5 91.22% average 21.5 91.22% 9.33 91.22% 5.3 balance scale classification this dataset was generated to model psychological experimental results. each example is classified as having the balance scale tip to the right, tip to the left, or be balanced. the attributes are the left weight, the left distance, the right weight, and the right distance. the correct way to find the class is the greater of (left-distance * left-weight) and (right-distance * right-weight). if they are equal, it is balanced. the set contains 625 patterns, with a uneven distribution of the three classes; each input pattern has 4 features. the esrng generated feature weights are λ = [1.002, 1.113, 0.827, 1.058]. the famrfw has better classification accuracy and less arta categories than the famr (table 3). 5.4 wine recognition the wine recognition data are the results of a chemical analysis of wines grown in the same region in italy, but derived from three different cultivars. the analysis determined the quantities of 13 constituents found in each of the 3 types of wines. the dataset contains 178 instances. 114 răzvan andonie, lucian mircea sasu, angel caţaron table 3: classification performance for the balance scale problem. test famr famrfw no. no. of arta categories pcc no. of arta categories pcc 1 95 74.4% 53 75.2% 2 70 80.0% 39 80.0% 3 22 78.4% 54 81.6% 4 75 75.2% 44 85.6% 5 125 71.2% 69 72.0% 6 62 80.0% 107 74.4% average 74.83 76.53% 61 78.13% the esrng algorithm produced the weights λ = [0.900, 0.757, 0.659, 1.668, 2.349, 0.702, 1.028, 0.668, 0.774, 0.874, 0.666, 0.701, 1.253]. the famrfw classification results are better, with less generated arta categories (table 4). table 4: classification performance for the wine recognition problem. test famr famrfw no. no. of arta categories pcc no. of arta categories pcc 1 10 88.88% 6 86.11% 2 15 97.22% 10 97.22% 3 32 69.44% 11 86.11% 4 17 83.33% 11 86.11% 5 55 80.55% 39 94.44% 6 12 88.88% 8 86.11% average 23.5 84.71% 14.16 89.35% 5.5 ionosphere this binary classification problem starts from collected radar datasets. the data come from 16 highfrequency antennas, targeting the free electrons in the ionosphere. “good” radar returns are those showing evidence of some type of structure in the ionosphere. “bad” returns are those passing through the ionosphere. there are 351 instances and each input pattern has 34 features. the esrng generated λ vector is: [0.551,0.520,1.179,1.168,1.301,1.180,0.940,1.272,1.024,0.903, 0.843,0.976,0.870,0.844,0.807,0.877,0.893,1.012,0.994,1.012,0.964,1.061,1.029,1.227,0.978,1.020,0.943, 1.027,1.087,1.032,0.978,1.117,0.999,1.374]. on average, famrfw produced much less arta categories than the famr. this time, the famr produced a slightly better pcc (table 5). 6 conclusions according to our experiments, using the feature relevances and the generalized distance measure may improve the classification accuracy of the famr algorithm. in addition, the famrfw uses less arta categories, which is an important factor. the number of categories controls the generalization a novel fuzzy artmap architecture with adaptive feature weights based on onicescu’s informational energy 115 table 5: classification performance for the ionosphere problem. test famr famrfw no. no. of arta categories pcc no. of arta categories pcc 1 28 81.69% 8 90.14% 2 20 81.69% 8 85.91% 3 17 91.54% 7 83.09% 4 9 94.36% 8 88.73% 5 5 90.14% 5 94.36% 6 9 94.36% 5 94.36% average 14.66 88.96% 6.83 89.43% capability and the computational complexity of a fam architecture. this generalization is a trade-off between overfitting and underfitting the training data. it is good to minimize the number of categories if this does not decrease too much the classification accuracy. the esrng feature weighting algorithm can be replaced by other weighting methods. we have not tested the function approximation capability of the famrfw neural network because the esrng weighting algorithm is presently restricted to classification tasks. lvq methods can be extended to function approximation [23] and we plan to adapt the esrng algorithm in this sense. this would enable us to test the famrfw + esrng procedure on standard feature approximation and prediction benchmarks. our approach is at the intersection of two major computational paradigms: 1. carpenter and grossberg’s adaptive resonance theory, an advanced distributed model where parallelism is intrinsic to the problem, not just a mean to speed up [6]. 2. onicescu’s informational energy and the unilateral dependency measure. to the best of our knowledge, we are the only ones using onicescu’s energy in neural processing systems. bibliography [1] r. andonie and a. caţaron. feature ranking using supervised neural gas and informational energy. in proceedings of ieee international joint conference on neural networks (ijcnn2005), canada, montreal, july 31 august 4, 2005. [2] r. andonie, a. caţaron, and l. sasu. fuzzy artmap with feature weighting. in proceedings of the iasted international conference on artificial intelligence and applications (aia 2008), innsbruck, austria, febr. 11-13, 2008, 91–96. [3] r. andonie and f. petrescu. interacting systems and informational energy. foundation of control engineering, 11, 1986, 53–59. [4] r. andonie and l. sasu. fuzzy artmap with input relevances. ieee transactions on neural networks, 17, 2006, 929–941. [5] a. asuncion and d. j. newman. uci machine learning repository, 2007. university of california, irvine, school of information and computer sciences http://www.ics.uci.edu/∼mlearn/mlrepository.html 116 răzvan andonie, lucian mircea sasu, angel caţaron [6] i. dziţac and b. e. bărbat. artificial intelligence + distributed systems = agents. international journal computers, communications, and control, 4, 2009, 17–26. [7] g. a. carpenter, s. grossberg, n. markuzon, j. h. reynolds, and d. b. rosen. fuzzy artmap: a neural network architecture for incremental supervised learning of analog multidimensional maps. ieee transactions on neural networks, 3, 1992, 698–713. [8] g. a. carpenter, b. l. milenova, and b. w. noeske. distributed artmap: a neural network for fast distributed supervised learning. neural networks, 11, 1998, 793–813. [9] g. a. carpenter and w. ross. art-emap: a neural network architecture for learning and prediction by evidence accumulation. ieee transactions on neural networks, 6, 1995, 805–818. [10] d. charalampidis, g. anagnostopoulos, m. georgiopoulos, and t. kasparis. fuzzy art and fuzzy artmap with adaptively weighted distances. in proceedings of the spie, applications and science of computational intelligence, aerosense, 2002. [11] i. dagher, m. georgiopoulos, g. l. heileman, and g. bebis. an ordering algorithm for pattern presentation in fuzzy artmap that tends to improve generalization performance. ieee transactions on neural networks, 10, 1999, 768–778. [12] i. dagher, m. georgiopoulos, g. l. heileman, and g. bebis. fuzzy artvar: an improved fuzzy artmap algorithm. in proceedings ieee world congress computational intelligence wcci’98, anchorage, 1998, 1688–1693. [13] j. c. principe et al. information-theoretic learning. in s. haykin, editor, in unsupervised adaptive filtering. wiley, new york, 2000. [14] e. gomez-sanchez, y. a. dimitriadis, j. m. cano-izquierdo, and j. lopez-coronado. µartmap: use of mutual information for category reduction in fuzzy artmap. ieee transactions on neural networks, 13, 2002, 58–69. [15] s. guiaşu. information theory with applications. mcgraw hill, new york, 1977. [16] b. hammer, d. schunk, t. bojer, and t. k. von toschanowitz. relevance determination in learning vector quantization. in proceedings of the european symposium on artificial neural networks (esann 2001), bruges, belgium, 2001, 271–276. [17] b. hammer, m. strickert, and t. villmann. supervised neural gas with general similarity measure. neural processing letters, 21, 2005, 21–44. [18] b. hammer and t. villmann. generalized relevance learning vector quantization. neural networks, 15, 2002, 1059–1068. [19] c. p. lim and r. harrison. art-based autonomous learning systems: part i architectures and algorithms. in l. c. jain, b. lazzerini, and u. halici, editors, innovations in art neural networks. springer, 2000. [20] c. p. lim and r. f. harrison. an incremental adaptive network for on-line supervised learning and probability estimation. neural networks, 10, 1997, 925–939. [21] s. marriott and r. f. harrison. a modified fuzzy artmap architecture for the approximation of noisy mappings. neural networks, 8, 1995, 619–641. a novel fuzzy artmap architecture with adaptive feature weights based on onicescu’s informational energy 117 [22] t. m. martinetz, s. g. berkovich, and k. j. schulten. neural-gas network for vector quantization and its application to time-series prediction. ieee transactions on neural networks, 4, 1993, 558–569. [23] s. min-kyu, j. murata, and k. hirasawa. function approximation using lvq and fuzzy sets. in proceedings of the ieee international conference on systems, man, and cybernetics, tucson, az, 2001, 1442–1447. [24] o. onicescu. theorie de l’information. energie informationnelle. c. r. acad. sci. paris, ser. a–b, 263, 1966, 841—842. [25] o. parsons and g. a. carpenter. artmap neural networks for information fusion and data mining: map production and target recognition methodologies. neural networks, 16, 2003, 1075–1089. [26] m. taghi, v. baghmisheh, and p. nikola. a fast simplified fuzzy artmap network. neural processing letters, 17, 2003, 273–316. [27] s. j. verzi, g. l. heileman, m. georgiopoulos, and m. j. healy. boosted artmap. in proceedings ieee world congress computational intelligence wcci’98, 1998, 396–400. [28] j. williamson. gaussian artmap: a neural network for fast incremental learning of noisy multidimensional maps. neural networks, 9, 1996, 881–897. international journal of computers communications & control issn 1841-9836, 9(4):510-523, august, 2014. study on directed trust graph based recommendation for e-commerce system h.d. zhong, s. zhang, y. wang, y. shu haidong zhong institute of modern logistics, zhejiang wanli university no. 8 south qianhu rd., ningbo, zhejiang, 315100, pr, china zhong_hai_dong@163.com shaozhong zhang institute of electronics and information, zhejiang wanli university no. 8 south qianhu rd., ningbo, zhejiang, 315100, pr, china dlut_z88@163.com yanling wang institute of modern logistics, zhejiang wanli university no. 8 south qianhu rd., ningbo, zhejiang, 315100, pr, china yanling_wang1@yahoo.com.cn yonggang shu* school of tourism and city management, zhejiang gongshang university xuezheng street, hangzhou, 310018, pr, china *corresponding author: zhd_1981@163.com abstract: automated recommender systems have played a more and more important role in marketing and ever increasingly booming e-commerce systems. they provide useful predictions personalized recommendations according to customers characteristics and a variety of large and complex product offerings. in many of these recommendation technologies collaborative filtering (cf) has proven to be one of the most successful recommendation method, which has been widely used in many e-commerce systems. the success of cf recommendation depends mainly on locating similar neighbors to get recommendation items. however, many scholars have found that the process of finding similar neighbors often fail, due to some inherent weaknesses of cf based recommendation. in view of this, we propose a trust feedback recommendation algorithm based on directed trust graph (dtg), which is able to propagate trust relationship. in our approach, there is no need to compute similarity between users, but utilize the trust relation between them to conduct prediction calculation. based on the analysis of human trust perception, we incorporate the process into our recommendation algorithm. experimental evaluation on real life epinions datasets shows that the effectiveness and practicability of our approach. keywords: trust, recommendation, graph, e-commerce, feedback. 1 introduction with the fast development of networking systems, an ever-increasing number of merchants are attracted to swarm into e-commerce all over the world [1]. accordingly, the internet is changing from generally simple information exchange and extraction tool into the biggest virtualized market space presenting a great number of commercial services, ranging from electronic web-stores, on-line booking and service center to other social services [2]. with the popularization of the internet, the amount of available information are exponentially growing and e-commerce systems structure becomes more complicated when it provides more and more choices for users [3,4]. under the circumstances, perform transaction tasks typically involved in e-commerce, customers copyright © 2006-2014 by ccc publications study on directed trust graph based recommendation for e-commerce system 511 have often to spend a large amount of time navigating among thousands of web pages to explore desired products and make their purchases. on the other hand, electronic business suppliers suffer from the problems of promoting their commodities to their potential customers in an effective way, considering their preferences, habits and other personalized characteristics. to deal with this challenge, researchers have advanced a recommendation approach which automatically analyze and mining e-commerce system visitors trading and browsed items data to filter web page information, classified newsgroup messages, and recommend valuable merchandise items [5]. recommender systems, one of the most important computer-based intelligent approaches to find out the most appropriate services or goods from a large amount of products, are proved to be important tools that overcome the information overload by sifting through the large set of data and recommending information relevant to the user [6–10]. typically, in e-commerce environment a recommender system analyzes trading data between consumer and sellers and items to find associations among them, and the items bought by similar users are presented as recommendations. using this technology, some e-commerce systems, such as amazon.com, ebay.com and netflix.com, are reported to have enhanced e-commerce sales by transforming ecommerce system browsers to buyers, increasing cross-selling and building customer loyalty [11]. as early as in the early 1990s intensive studies have been conducted in recommender systems, and many scholars deem them as knowledge discovery in database (kdd) systems or electronic agent systems [12–14]. up to now, the existing recommendation means can be generally classified as content based, collaborative, knowledge-based, demographic, and utility based [15–17], among which collaborative filtering based personalized recommendation is proved to be one of the most successfully used technology [15]. in the newer, narrower sense, collaborative filtering builds correlations between pairs of items by making automatic predictions (filtering) about the interests of a user, and then figure out recommendations by finding items with high similarity to many users (collaborating). the acquiescent assumption of the collaborative filtering approach is that if a person a has the same opinion as a person b on an issue, a is more likely to have b’s opinion on a different issue t than to have the opinion on t of a person chosen randomly. for example, a collaborative filtering recommendation system for laptop preferences could make forecasts about which laptop a user should like given a partial list of that user’s preferences (likes or dislikes) [10]. however there are two main inherent weaknesses in the collaborative filtering based recommendation systems [8]: (1) it is a challenge to find similar user, because the probability of two random users have rated any items in common is very small, and hence they are hardly comparable. (2) if ad hoc user profiles with the goal of being considered as similar to the target user a created, cf based recommendation can be easily attacked and recommendation precision can be greatly influenced. in order to overcome these shortages, massa p. and avesani p. proposed a trust-aware recommendation approach [8]: make use of trust propagation to search for trustable users based on the trust network instead of finding similar users as collaborative filtering does. the products bought or preferred by these users are then recommended to the active user over the trust network. three years later, ray s. and mahanti a. put forward a new idea to improve prediction accuracy for trust aware recommender systems by removing all the trust statements that fall below a threshold correlation value to reconstruct the trust network [18]. they assume that a trust statement passed between two users should imply that similarity between both users will be relatively high, and generally utilize all the trust statements present in the data of similar users, calculated based on ratings, to make predictions. experiment on epinions datasets shows their method has better performance and effectiveness than that of the original approach for different levels of trust propagation and threshold correlation values [18]. however, every trust statement passed between users does not imply the correlation between them will also be high, because one user may pass trust statements to another user on the basis of perceived notion that 512 h.d. zhong, s. zhang, y. wang, y. shu his (or her) predilections match with others, while similarity calculated based on ratings may show that they are different. in our opinion, in trust based recommendation method users similarity calculating is not necessary. and we propose an approach where we construct a directed trust graph of users without considering the similarity between them. based on the dtg we present a trust feed back recommendation algorithm (tfbra) to make recommendations for a user. it shows a substantial good performance for generating predictions through experimental evaluation on epinions datasets [19]. the rest of the paper is organized as fellows. in section 2 we review some existing techniques, which are related to our work. in section 3 the approach of trust based recommendation is explained in detail. section 4 conducts verifying experiment, using real-life epinions datasets, and the results of our evaluations is discussed. the last section draws conclusions and points out the related possible future work. 2 related work 2.1 trust since the birth of human beings and human social interactions trust came into being, and almost every aspect of a persons life is based on some form of trust. undoubtedly, trust is positive and vital to humanity since it is part of love and friendship, and meaningful relationships depend upon it. presently, trust is a research interest of many disciplines including management[18], marketing [21] and information systems [22]. however, scholars have difficult in reaching a consensus what exactly trust is, and they disagree even on the basic definitions. presumptively, definitions of trust generally indicate a situation with characteristics of the following aspects [23–25]: one party (trustor) is willing to depend on the actions of another party (trustee); the situation is directed to the future. moreover, the trustor (voluntarily or forcedly) abandons control over the actions performed by the trustee. as a consequence, the trustor is uncertain about the outcome of the other’s actions; he can only develop and evaluate expectations. the uncertainty involves the risk of failure or harm to the trustor if the trustee will not behave as desired. the rapid expansion of e-commerce conducts the research of trust in social science to new challenges. at the same time increasing the importance of trust and the urgency to know what customers shopping decision or preference [26], such as, interpersonal relationship between customers and sellers has been dis-intermediated by the method, and had to be improved upon. additionally, e-commerce systems should be well designed or be made to imply the sellers are trustable, even if the regardless of seller’s actual trustworthiness is not high. many researchers have intensively studied the structure and formation mechanisms of trust from the aspects of both individual and organizational, and have identified five types antecedents to consumer trust, including institution-based, personality-based and calculation-based sources for trust building [27–30]. 2.2 recommendation system since the first appearance of the first paper on collaborative filtering in the mid-1990s, recommender systems have attracted many scholars’ attention, and become an important research areas [31]. over the last decade new approaches have been proposed to improve the efficiency and practicability of recommender system both in the industry and academia. at present recommender system related researches are still popular issues because they constitute problem-rich study on directed trust graph based recommendation for e-commerce system 513 research area and practical use requirement that help the potential buyers to solve information overload by providing personalized recommendation, and useful services according to their characteristics. over the past decades a lot of research work has been conducted on recommendation technologies (or algorithms), which used a wide range of statistical, artificial intelligence, information science and other techniques. these researches have observably improved the state-of-art in comparison to the previous recommender systems which used collaborativeand content-based heuristics. up to now, algorithmic means adopted in recommender systems can be divided into (1) content-based recommendation, collaborative, or hybrid, based recommendation and (2) heuristic-based or model-based recommendation based on the types of recommendation approaches used for the rating evaluation. some of these methods are utilized in the industrialstrength recommender systems, e.g. the ones deployed at amazon [32], movielens [33] and versifi technologies. however, despite there are many recommendation means, the present used recommender systems still need to be intensively improved, including better approaches for representing the recommended product items, more advanced and efficient recommendation algorithm or methods, utilization of multi-criteria ratings, to make recommendation methods more effective and applicable. in recent years, scholars proposed a new recommendation method: trust based recommendation [8, 34, 35], and proved it more robust against shilling attacks and more capable and effective in generating recommendations for e-commerce system visitors, however they still need to calculate users similarity. trust based recommendation systems are proved to make more accurate recommendation compared with traditional systems, because they utilize a new concept of trust propagation over a trust network. in [8], it has been experimentally shown how trust based recommender system outperform traditional recommendation methods on dataset from epinions.com. 3 trust based recommendation algorithm in this section we start by introducing basic notations about trust cognitive process and concept of dtg. based on the analysis of human trust perception, we incorporate the process into our recommendation algorithm. and then we present the logical architecture of trust based recommendation approach. 3.1 trust cognitive analysis from the viewpoint of recommendation in human society, it is easy to find that a person is usually more inclined to trust the recommendation information from his or her “acquaintance” than that of “strangers”. in e-commerce recommendation systems all the people can be connected by trust relationship, which can be denoted as a dtg. as shown in figure 1 (a), solid line represents the direct trust relationship between users (n1 trustn2, n2 trustn3). through the trust relationship of n2 andn3, n1 trust n3 indirectly. if there is no trust relation between n2 andn3, it is impossible to create the trust relationship between n1 and n3 (as is shown in figure (b)). here we first define some basic trust related definitions in graph theory definition 1 (direct trust) direct trust can be defined as an triple< i, j, dti,j >,stand for the directed edge from node i point to nodej,and direct trust value of node i to node j is defined asdti,j, which is a discrete integer value in [1, 5], and a greater value represents a deeper trust degree. definition 2 (indirect trust) indirect trust can be defined as an triple< i, j, idti,j >,let iandjstand for nodes in the trust graph, iis attainable tojthrough limited hoph(h > 1, h ∈ 514 h.d. zhong, s. zhang, y. wang, y. shu figure 1: trust process in human activity n),and indirect trust value of node i to node j is defined as idti,j, which is a discrete integer value in[1, 5], and a greater value represents a deeper trust degree. theorem 1 (direct trust preference principle) for any node i in the trust graph, its direct trust nodes are more credible than indirect trust nodes, where there are trust edges from i′sdirect trust nodes to its indirect trust nodes. for example, in figure 1 (a) dtn1n2 is more credible than idtn1n3 (or dtn1n2 > idtn1n3). 3.2 dtg based trust feedback on the base of trust perception process presented above, we propose a dtg based trust feedback algrithm (the general principle is shown in figure 2). in figure 2 the edges represents the trust relationship between different nodes. while asking for recommendations of u 0, the node just send a trust query information to its direct trust nodes (u 1 ,u 3 ,u 15 and u 16), and these nodes also send the trust query information to their trusted node similarly, until the end of trust inquiry control process and then all the queried node feedback their trust value of any products. through the trust feedback process, comprehensive score of the items recommended by trusted node (include both direct trust nodes and indirect trust nodes) for u 0 . 3.3 dtg establishment as narrated before, direct trust node and indirect nodes are used to represent the trust relationship based on graph theory. the step of establishing a dtg is instroduced here in detail below. definition 3 (direct trust node, dtn) dtn of a certain node refers to any node that has direct trust relationship with it. definition 4 (indirect trust node, idtn) idtn of a certain node refers to any node that has indirect trust relationship with it. in this paper we adopt a five level value: 1,2,3,4,5 to repersent the trust(driect trust and indriect trust) value between one another. the higher trust value implies a further trust. in our framework the relationship between any node and its neighbor can be devided into three categories: direct trust relationship, indirect trust relationship and irrelevant nodes. for example, in table 1, u0 has 4 dtns. figure 3 shows the process of establishing a dtg. 3.4 dtg based feedback trust recommendation fusion calculation according to the transitivity attribute of trust, recommendated items for a specific node can be caculated on the base of feedback trust in dtg. if node ni trust node nj, ni is more likely to accept the items that nj gives a high rating value. and highly-rated-value items of dtns of ni account for a larger proportion in recommendated items than that of idtns of ni. study on directed trust graph based recommendation for e-commerce system 515 figure 2: dtg based trust feedback principle table 1: relationship between u0 and its neighors peer neighbors dtn u0 u1 yes u0 u3 yes u0 u15 yes u0 u16 yes 516 h.d. zhong, s. zhang, y. wang, y. shu figure 3: the process of establishing a dtg study on directed trust graph based recommendation for e-commerce system 517 definition 5 the recommendated items prv (i)of a certain node ni can be abtained by comprehensive evaluation on the feedback trust value and high-rated-value items of its idtns and dtns. recommedated items from the trusted nodes(include dtns and idtns) of ni can be defined as the fellowing formula: prv (i) = level∑ l=1 w(l, α) ∑ v∈tnll tsv ∗ rvi level∑ l=1 w(l, α) ∑ v∈ntll tsv (1) in the formula level denotes the distance from node ni,w(l, α) denotes weighted control factor function of the trust feedback value, which is related to the level and attenuation factor α. in our opinion w(l, α) be defined as the fellowing formula: w(l, α) =   α , level = 1 α level∏ l=1 1 l , level > 1 (2) in the formula α denotes the trust attenuation factor or initial trust factor. 3.5 trust feedback recommendation algorithm as it has already been stated above, in e-commerce trading process people are more likely to accept the recommendations from direct trust persons. in order to simulate the trust based recommendation idea, we designed the trust feedback recommedation algrithm, the pseudo-code is as follows. //tfbra pseudo-code trustfeedbackrecommedationalgrithm input: s, maxlevel, α /* s denotes the active user needs for recommendations, maxlevel denotes maxium search level, α denotes attenuation factor*/ output: pr /*recommended items list with high predict rating value */ begin while l <= maxlevel do search dtns in the lth level in dtg if (nodes in searched dtns of s is not in the trust nodes list tnll) put the nodes into a trust nodes list tnll endif search the dtns in tnll if (l > maxlevel ) endwhile else l add-self endwhile prv (i) = level∑ l=1 w(l,α) ∑ v∈t nll tsv∗rvi level∑ l=1 w(l,α) ∑ v∈nt ll tsv /∗ calcuate the predict rating value of each item of users in tnl, let prv (i) represent the predict rating value of item i*/ put the items with highest predicted rating value (prv ) into pr end 518 h.d. zhong, s. zhang, y. wang, y. shu 4 experimental evaluation and result discussion 4.1 experiment dataset in order to examine the effectiveness of our algorithm, we perform our experiment on dataset from epinions. the epinions dataset was collected by paolo massa in a 5-week crawl (november/december 2003) from the epinions.com web site. the dataset contains 49,290 users who rated a total of 139,738 different items at least once, writing, 664,824 reviews and 487,181 issued trust statements. users and items are represented by anonymized numeric identifiers [19]. the epinions datasets contain two files: (1) ratings_data.txt.bz2 (2.5 megabytes), which contains the ratings given by users to items. in this file every line has the format “user_id item_id rating_value”. user_id is an integer in [1,49290], item_id is an integer in [1,139738] and rating_value is an integer in the range [1,5]; (2) trust_data.txt.bz2 (1.7 megabytes), which contains the trust statements issued by users. in the file every line has the format “source_user_id target_user_id trust_statement_value”. source_user_id and target_user_id are integer in [1, 49290], and trust_statement_value is always 1 (since in the dataset there are only positive trust statements and not negative ones (distrust)). 4.2 evaluation measures to use the epinions datasets in a more flexible way, we imported the two files (ratings_data.txt and trust_data.txt) into microsoft sql server 2005 to create two tables (rating_data and trust_data). and we add a trust_value column into the trust_data table and set a random integer value in [1, 5] to represent the trust value between two users. evaluation of the approach put forward in chapter 3 is conducted in our self-development recommendation prototype system, which is implemented on microsoft visual 2008 platform in windows 7 ultimate environment with a intel core i3-2310 2.1ghz (4 cpus) processor and 6 gigabyte memory. main interface of the prototype system is shown in figure 4. figure 4: main interface of the self-developed recommendation prototype system 4.3 result and discussion all the results are based on the self-developed trust based recommendation system. there are thousands of combination modes of the trust attenuation factor (α) with the maximum search study on directed trust graph based recommendation for e-commerce system 519 figure 5: ten recommended items for a randomly selected user are calculated according to tfbra with αset to be 0.8 level (maxlevel). we find it really hard to determine how α and maxlevel can be chosen. in order to prove the effectiveness of our recommend algorithm, two values (0.8 and 0.6) are chosen for parameterα, and ten recommended items for a randomly selected user are calculated according to tfbra. figure 5 shows the graphs for trust attenuation factor values with 0.8 and 0.6 respectively. error between recommended value and actual rating value is shown in table 2. in the paper mean absolute erro [36] (mae) is used to evaluate the practical applicability of the trust feedback recommendation algorithm, a smaller mae value means the higher accuracy. the calculation method of mae is listed below: mae = 1 n n∑ i=1 |ei| (3) where ei is the difference between recommended rating value and actual rating value on the same item (product). if a recommendation needed node have very few (or no) dtns, mae may be relatively great (as can be seen from figure 5 and figure 6 whenmaxlevel = 1). with the increasing ofmaxlevel,available feedback trust calculation nodes (include dtns and idtns) grows rapidly, which leads to more accurate recommendations. although there are a great variety of combination modes of α with maxlevel, it can be easily find from table 2 that mae decrease obviously while maxlevel increase. table 2: error between recommended rating value and actual rating value mae maxlevel=1 maxlevel=2 maxlevel=4 maxlevel=5 α = 0.8 0.954 0.957 0.41 0.221 α = 0.6 0.757 0.49 0.4 0.331 520 h.d. zhong, s. zhang, y. wang, y. shu figure 6: ten recommended items for a randomly selected user are calculated according to tfbra with αset to be 0.6 5 conclusions recommendation is an important issue in information science; also it is a hot topic in ecommerce systems. when computing recommendations for users in e-commerce systems, most of the existing researches usually focus on find recommendations from similar users. however, many scholars have proved that there are many challenges in finding similar neighbors. the research tries to overcome the difficulties in traditional recommendation methods and put forward a new idea to get more to reasonable recommendations from trusted users. based on the basic notations about trust cognitive process and dtg, which is constructed according to trust links between users, the paper proposes a trust feedback recommendation algorithm. we show through experiments on secondary processed epinions datasets to better highlight the relative advantages of the different algorithms, and the experimental results indicate good effectiveness of our trust based recommendation approach. one limitation of our method is that it has been test and verified on only one dataset i.e. epinions datasets. in future we would like to prove the effectiveness of our trust feedback recommendation algorithm on trust datasets that exhibit characteristics different from epinions datasets. three important research questions that we would like to examine are: (1) study the effect on recommendation accuracy when the way of setting trust value between two users varies; (2) study the relation between weighted control factor function and the accuracy of our trust feedback recommendation algorithm; (3) study the effectiveness of our approach using other datasets and compare the practicability of our method with other analogous means. we believe our approach can improve the coverage of recommender systems; however there is still much work need to be done. acknowledgements this work is supported by the national science foundation of china (grant no. 71373244), the ningbo natural science fundation project (grant nos. 2013a610068 and 2010a610119), the zhejiang province department of education project (grant no. y201329219), the science foundation of zhejiang province (grant no. ly13g030033), the modern port service industry and culture research center of the key research base of philosophy and social sciences of zhejiang province, the science and technology innovation team of ningbo (grant nos. 2013b82009 and study on directed trust graph based recommendation for e-commerce system 521 2012b82003) and the social development project of ningbo (grant no. 2012c50045). bibliography [1] papazoglou, m. p. (2001); agent-oriented technology in support of e-business enabling the development of "intelligent" business agents for adaptive, reusable software, communications of the acm, issn 0001-0782, 44(4): 71-77. [2] levy, a. y.; weld, d. s. (2000); intelligent internet systems, artificial intelligence, issn 0004-3702, 118(1-2): 1-14. [3] peiguang, l.; feng, y.; xiao, y.; qun, x. (2008); personalized e-commerce recommendation based on ontology, in proc. of the international conference on internet computing in science and engineering (icicse ’08), 201-206. [4] siping h.; meiqi f. (2008); personalized recommendation based on ontology inference in e-commerce, in proc. of the international conference on management of e-commerce and e-government, 192-195. [5] resnick, p.; varian, h. r. (1997); recommender systems, communications of the acm, issn 0001-0782, 40(3): 56-58. [6] huang, z.; chung, w. y.; chen, h. c. (2004); a graph model for e-commerce recommender systems, journal of the american society for information science and technology, issn 1532-2882, 55(3): 259-274. [7] chen, l.; l’abbate, m.; thiel, u.; j. neuhold, e. (2005); the layer-seeds term clustering method: enabling proactive situation-aware product recommendations in e-commerce dialogues, information systems frontiers, issn 1387-3326, 7(4/5): 405-419. [8] massa, p.; avesani, p. (2007); trust-aware recommender systems, in proc. of the proceeding of acm recommender systems conference, 17-24. [9] nan, r.; qin, l. (2009); research on the trust model based on the groups’ internal recommendation in e-commerce environment, journal of software engineering and applications, issn 1945-3116, 2(4): 283-287. [10] konstan, j. a.; riedl, j. (2012); recommender systems: from algorithms to user experience, user modeling and user-adapted interaction, issn 0924-1868, 22(1-2): 101–123. [11] zhang, w. (2007); a novel trust model based on recommendation for e-commerce, in proc. of the international conference on service systems and service management, 1-4. [12] ben schafer, j.; konstan, j.; riedl, j. (1999); recommender systems in e-commerce, in proc. of the acm conference on electronic commerce, 158-166. [13] dzitac, i.; barbat b.e. (2009); artificial intelligence + distributed systems = agents, international journal of computers communications & control, issn 1841-9836, 4(1):1726. [14] zmaranda, d.; silaghi, h.; gabor, g.; vancea, c. (2013); issues on applying knowledgebased techniques in real-time control systems, international journal of computers communications & control, issn 1841-9836, 4(1), 8(1): 166-175. 522 h.d. zhong, s. zhang, y. wang, y. shu [15] zuo, m.; wang, k.; li, j. (2005); the application of collaborative filtering for trust management in p2p communities, lecture notes in computer science, issn 0302-9743, 3758: 383394. [16] huang, z.; zeng, d. d. (2011); why does collaborative filtering work?transaction-based recommendation model validation and selection by ananlyzing bipartite random graphs, journal on computing, issn 2010-2283, 23(1): 138-152. [17] walter, f. e.; battiston, s.; schweitzer, f. (2008); a model of a trust-based recommendation system on a social network, autonomous agents and multi-agent systems, issn 1387-2532, 16(1): 57-74. [18] ray, s.; mahanti, a. (2010); improving prediction accuracy in trust-aware recommender systems, in proc. of the 43rd hawaii international conference on system sciences (hicss), 1-9. [19] http://www.trustlet.org/wiki/downloaded_epinions_dataset. [20] mayer, r. c.; davis, j. h.; schoorman, f. d. (1995); an intergrative model of ogranization turst, academy of management review, issn 0363-7425, 20(3): 709-734. [21] morgan, r. m.; hunt, s. d. (1994); the commitment-trust theory of relationship marketing, journal of marketing, issn 0022-2429, 58(3): 20-38. [22] wang, y. d.; emurian, h. h. (2005); an overview of online trust: concepts, elements, and implications, computers in human behavior, issn 0747-5632, 21(1): 105-125. [23] schoorman, f. d.; mayer, r. c.; davis, j. h. (2007); an integrative model of organizational trust: past, present, and future, academy of management review, issn 0363-7425, 32(2): 344-354. [24] mcknight, d. h.; chervany, n. l. (1996); the meanings of trust: university of minnesota. [25] bamberger, w. (2010); interpersonal trust-attempt of a definition: technische universität münchen. [26] mcknight, d. h.; chervany, n. l. (2001); conceptualizing trust: a typology and e-commerce customer relationships model, in proc. of the the 34th annual hawaii international conference onsystem sciences, 36-44. [27] mcknight, d. h.; cummings, l. l.; chervany, n. l. (1998); initial trust formation in new organizational relationships, the academy of management review, issn 0363-7425, 23(3): 473-490. [28] li, x.; hess, t. j.; valacich, j. s. (2008); why do we trust new technology? a study of initial trust formation with organizational information systems, the journal of strategic information systems, issn 0963-8687, 17(1): 39-71. [29] gefen, d.; karahanna, e.; straub, d. w. (2003); trust and tam in online shopping: an integrated model, mis quarterly, issn 0276-7783, 27(1): 51-90. [30] kim, d. j.; ferrin, d. l.; rao, h. r. (2008); a trust-based consumer decision-making model in electronic commerce: the role of trust, perceived risk, and their antecedents, decision support systems, issn 0167-9236, 44(2): 544-564. study on directed trust graph based recommendation for e-commerce system 523 [31] resnick, p.; iakovou, n.; sushak, m.; bergstrom, p.; riedl, j. (1994); grouplens: an open architecture for collaborative filtering of net news, in proc. of the computer supported cooperative work conference, 175-186. [32] linden, g.; smith, b.; york, j. (2003); amazon.com recommendations: item-to-item collaborative filtering, ieee internet computing, issn 1089-7801, 7(1): 76-80. [33] miller, b. n. a.; istvan, lam, s. k.; konstan, j. a.; riedl, j. (2003); movielens unplugged: experiences with an occasionally connected recommender system, in proc. of the international conference on intelligent user interfaces, 263-266. [34] massa, p.; avesani, p. (2004); trust-aware collaborative filtering for recommender systems, in meersman, r.; tari z.; vanderaalst w.; bussler c.; gal a.; cahill v.; vinoski s; vogels w.; gatarci t; sycara k. (eds.), on the move to meaningful internet systems, springerverlag berlin. [35] yuan, w., shu, l.; chao, h. c.; guan, d.; lee, y. k.; lee, s. (2010); itars: trust-aware recommender system using implicit trust networks, iet communications, issn 1751-8628, 4(14): 1709-1721. [36] herlocker, j. l.; konstan, j. a.; terveen, k.; riedl, j. t. (2004); evaluating collaborative filtering recommender systems, acm transactions on information systems, issn 1046-8188, 22(1): 5-53. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 2, pp. 171-178 environmental policy and science management: using a scientometric-specific gis for e-learning purposes n.d. hasanagas, a.d. styliadis, e.i. papadopoulou nikolaos d. hasanagas, ahanasios d. styliadis kavala institute of technology, department of landscape architecture city of drama, greece, gr 66100 email: styliadis@ath.forthnet.gr, nikolaos.hasanagas@gmail.com eleni i. papadopoulou aristotle university of thessaloniki, faculty of agricultural science department of agricultural economics, city of thessaloniki, gr 54124 e-mail epapa@agro.auth.gr abstract: who is the "good scientist" in rural-environmental policy? this is not so self-evident as in the case of private high-tech industry. developing e-learning system in environmental science management is a challenging task in the area of forest and general rural development policy. who determines the most "important" scientific information and who controls it? there are algorithms for measuring centrality in information networks. the concepts of closeness and betweenness centrality are used as basic metadata for categorizing the communication type in the rural-environmental policy networks. this paper discusses the development of a gis-based model which includes region-based scientometrics, regarding policy field communication. keywords: region-based scientometrics, gis e-learning, forest policy, integrated rural development policy, environmental policy, complete network analysis, policy making. 1 introduction it is evident that in high-tech policy sector, "good scientist" is the private industry, since impressive products like mobile phones are usually produced by the industry and not by universities or public agencies. however, this is not the case in rural-environmental policy, where the main issues are about "multiple-use forestry", "nature protection", "integrated rural development", "sustainability" etc, as in these abstract issues, success normally means to persuade about who the "trouble-maker" and who the "rescuer" is and not to find concrete solutions. moreover, despite the common belief that universities or research institutes are the most influential suppliers and controllers of "scientific" information, this is also not necessarily the rule in rural-environmental policy. this paper presents the possibility of developing a pattern of scientometric-specific gis -based model, which is going to be applicable in e-learning for various target groups (e.g. students who are specialized in forest policy and rural policy analysis, lobbyists, policy makers). in the context of this paper, "scientometric" differs from the classical measurements of citation rates or implementation of academic bibliography. it is defined as the management of information which is regarded as "scientific" by lobbyists, public administrators and academics in networks of forest policy issues which are used as case studies in this paper. the complete network analysis conducted here by visone software showed that the "scientific" information, which is regarded as most "important" in a network, is often supplied by practitioners, such as environmental and forest agencies, landowner organizations etc, and not necessarily by actors officially entitled as "scientific", such us universities or research centers. this implies that the "scientific" is rather a feeling induced by factors different from the official orientation of an actor. the analysis pointed out that forest policy, environmental issues and rural development are inseparable policy fields and strongly integrated with each other. the actors who produce "scientific" information, which is considered to be "important", can disseminate the information which is favorable for them, in order to support their socio-political positions. thus, a "scientifically important" actor may be an informal but influential decision-maker. additionally, the network will collapse, if an actor who participates in many information paths and thus controls the flow of information, suddenly abandons the network. in this case, the actor is merely a crucial "postman" but not necessarily a "decision-maker". in policy copyright c⃝ 2006-2010 by ccc publications 172 n.d. hasanagas, a.d. styliadis, e.i. papadopoulou planning, implementation and evaluation, it is useful to distinguish the "decision-maker" from the "postman" in order to focus the lobbying activity on the former rather than on the latter. this is especially useful in integrated rural development, incl. forest policy, as there are numerous complex, informal and different procedures among the countries and unclear indicators (rudi research project findings). therefore, there is a point in developing an adaptive gis e-learning system on region-specific information networks in order to make lobbyists, public administrators and students more familiar with science management in policy arena [14]. 2 selecting the gis learning object and the learner target group adaptive learning systems are supposed to require numerous and elaborated rules [5, 6, 18]. however, the existence of tested algorithms which detect the position of an actor in the formal or informal hierarchy of information in a network simplifies the rules and metadata requirements, even in complex social systems [6, 8, 11, 16, 19]. two algorithms, "closeness centrality" (cc) and "centrality of betweenness" (cb), can be used for classifying the actors. the only question is which type of actors interests each learner group [2, 11, 17, 6, 9]. we can distinguish two aggregated and seven specific learner groups: a) aggregated learner groups: 1) state actor employees (civil servants of agencies), 2) private actor employees (lobbyists, managers, experts and employees of interest groups or enterprises) figure 1: learner target groups. b) specific learner target groups (figure 1): these emerge by the combination of three dimensions [12, 13, 15]: 1) the legal character of the actor (public or private), 2) the orientation of the actor (economic or non-profit seeking regarding the management of natural resources, or science-orientation), and 3) the degree of dependence on the liberal market (i. individual enterprises which are directly exposed to the competition, ii. economic interest groups of enterprises which only represent their interests in the policy network, iii. non-profit seeking interest groups such as cultural association for the renovation of villages, forest recreation, and environmental ngos, which only depend on national and eu funds, or private universities and research centres which can be flexible to the market needs, and iv. state actors which are totally independent from the market, such as agencies, public universities and environmental policy and science management: using a scientometric-specific gis for e-learning purposes 173 research units) (rudi findings). the gis learning objects (figure 2) are the seven actor types described in figure 1 (“archimedes” research project findings), in each category of information management function (cc and cb). an enterprise characterized by high cc (strong "decision-maker") in a certain region-specific policy network is a case of learning object, while the same enterprise characterized by a low cb (weak "postman") is a different case of learning object. figure 2: processing model framework. each learner can thus examine whether the actor he/she belongs to is a strong "decision-maker" or "postman" in comparison to the other actors, so as to plan his/her lobbying strategy more effectively. in this way, instead of a standardized instructional design, the learnerpractitioner (or policy analysis student) can be flexibly oriented by using the cc and cb as indicators. 3 visualizing "decision-makers" and "postmen" various socio-informatics software products have been used for measuring and visualizing the invisible structures of policy networks which very often include informal relationships [1, 3]. the insna web-site ("international network for social network analysis"www.insna.org) offers a remarkable variety of such software. however, the implementation and outputs of this software in e-learning through a region-specific gis learning management system is still an open challenge, where the close cooperation of a multi-disciplinary group of social scientists [13, 15], software engineers, forest policy analysts, economists and practitioners such as lobbyists, foresters, agrarians etc is necessary. the informal relations are sometimes much more decisive for the policy output than the formal ones [3, 4, 7, 14]. a usual relation type in such region-specific networks is the exchange of information [8, 9]. the visone software is appropriate for depicting and analyzing both formal and informal information exchange. the algorithms of cc and cb are used for this purpose. the cc is an indicator of importance of a policy actor, while the cb indicates the capacity of controlling the paths of information exchange and thereby the dissemination of "scientific" information [13]. cc measures the distance d (i.e. the shortest number of links) between two actors. if i.e. the royal scottish forestry society gives information to the friends of the loch lomond and the latter to the national trust of scotland (and there is no direct information link from the royal scottish forestry society to the national trust of scotland), then the distance (d) between the royal scottish forestry society and the national trust of scotland is d=2 (links). the sum of all distances from an actor i to any other actor is the closeness of the actor i and then the closeness centrality of i is defined as its inverse closeness [8, 9]: 174 n.d. hasanagas, a.d. styliadis, e.i. papadopoulou cc(i) = ⌊ ∑ j d( j,i)⌋ −1 (1) the fewer links are needed to connect i to any other actor, the higher its cc is. if an actor possesses information that is regarded as crucial by the other actors, then one can expect this actor to have a high cc. for this reason, cc is an indicator of the information importance. cb quantifies the information control potential of an actor i and is defined as the sum of the ratios of shortest paths between other actors that the actor i sits on: cb(i) = ∑ |pi(i, j)| |p(i, j)| (2) where p(i,j) and pi (i,j) are the sets of all shortest paths between i and j, and those shortest paths passing through i, respectively. thus, an actor with high cb plays the role of the "go-between" for many other actors in term of shortest paths and, in this way, controls the dissemination of information. if the actor is also aware of its capability, it can influence the decision-making processes. cb points out the most crucial "postman" of information in a network, while cc shows who is the most important "sender". the "sender" is an information imposer and thereby a "decision-maker" who can directly influence the policy-making. in everyday life, the "sender" (e.g. a bank which makes a decision) may often be clearly differentiated from the person who brings the letter from the bank to the interested client. however, in politics, it is often unclear who the "sender" and who the "postman" is, as these roles are informal. using the two indicators, cb and cc, this difference can be detected. three examples of region-specific networks of "scientific" information flow are presented in figure 3. these are networks investigating issues like uk1: management of loch lomond and trossachs national park, uk2: scottish forestry strategy, and greece: the amendment of the forest-related article 24 of the national constitution law. the actors are landowners’ associations, forest services, agricultural directorates, cultural associations (related to nature aesthetics and cultural landscapes), angling and hunting associations, environmental ngos, technical chambers, tourist enterprises, forest and agricultural industries, municipalities etc (figure 1 can be used as a legend to understand the nodes in the depicted networks). number of actors, number of links and the network density (%) appear in each table cell of figure 3. in the first row, the whole region-specific networks appear (two from scotland and one from greece). the networks of "scientific" information shaped according to the cc and cb of the actors are presented in the second and third row respectively. the difference of the density degree (%) between whole contact networks (first row) and this of "scientific" information networks is remarkable: the density of "scientific" information is drastically lower than the "general contact" density, though issues like forest-environment and integrated rural development policy are supposed to be multi-disciplinary areas. not only the "science"-interested actors are relatively few but the lower degree of density also shows that the "scientific" communication is much weaker than the total activities (e.g. lobbying and institutional pressure, exchange of material support etc) (rudi findings). the analysis suggests that in some networks, economic or profit-seeking, private or state actors appear to be the most "important" actors in the "scientific" arena, while universities or research institutes are quite low in importance. this is particularly true for a science-oriented actor (a university) in the uk1 issue. this university is totally excluded from the "scientific" information network. thus, it is quite subjective which information is regarded as "scientific" (rudi findings). moreover, by comparing the cc with the cb network of the uk2 issue, it becomes obvious that the most influential "decision-maker" is not necessarily the best "postman": a forest service possesses the highest cc, while an environmental protection agency possesses the highest cb. in figure 4, a comparative view of region-specific scientometric data is provided: each learner target group can focus its research work on the respective learning object (actor type and position in information hierarchy). the learners can examine and interpreting the quantitative results: e.g. why the most important "scientist" and "science" controller is the state, why the science-oriented actors are of disputable importance and much weaker in intermediating "science" etc. the e-learning system functions as an "adaptor" which connects the academic knowledge with the practical and "local knowledge". the synergy between field expert experience and academic knowledge can be thereby optimized. possibilities of coalition building or cross-sectoral over-bridging through science management [12, 14] as well as chances for effective coordination through information control [7, 10, 15] can be discussed and analyzed by all learners. such an e-learning system could be useful in research projects which aim at the better understanding of rural-environmental policy design, delivery, monitoring and evaluation environmental policy and science management: using a scientometric-specific gis for e-learning purposes 175 figure 3: region-specific networks of "scientific" information flow. (“archimedes” and rudi findings). in particular, the implementation of forest policy as a holistic "scientific" toolbox for socio-economic planning, financing and development of forestry, evaluation of the protective and economic role of forest, prediction of supply and demand of forest products and forest administration is enabled. 4 summary and conclusions there are different science management patterns among region-specific policy networks. science-oriented actors are not necessary the leading ones in the information networks which are considered as "scientific" ones by their participants. a scientometric-specific gis based on complete network analysis helps all learners understand that forest policy, integrated rural development and environmental policy constitute a multifaceted and cohesive politico-administrative system, where the potential influence regarding information use and distribution becomes quite complex and makes the policy output unpredictable. by the presented gis e-learning system all learner target groups become more familiar with a policy design which takes into account the needs not only of the agricultural population but of a whole integrated rural territory, which is developed in the framework of a new type of urban-countryside relationship (rudi findings). the quantified learning objects of the integrated rural policy arena should be further interpreted through a participative and qualitative approach. thereby, the socio-economic and ecological driving forces which determine the quantitative results will be better understood by the learner target groups, independently of any political interests or personal views. acknowledgements the research initiative proposed by this paper has been supported by the eu-funded research project “archimedes” 176 n.d. hasanagas, a.d. styliadis, e.i. papadopoulou figure 4: scientometricembedded gis. environmental policy and science management: using a scientometric-specific gis for e-learning purposes 177 (department of landscape architecture, kavala institute of technology, drama, greece), by the eu-funded research project "rudi: rural development impactsassessing the impact of rural development policies, incl. leader" (department of agricultural economics, faculty of agricultural science, aristotle university of thessaloniki, greece), and by the institute of forest policy and nature conservation of goettingen university (germany). bibliography [1] k.a. bakar, b. s. doherty, evaluation of the recorded state mechanism for protecting agent integrity against malicious hosts. int. j. of computers, communications & control, 3(1), 60-68, 2008. [2] a.d. styliadis, i.i. akbaylar, d.a. papadopoulou, n.d. hasanagas, s.a. roussa, l.a. sexidis, metadatabased heritage sites modeling with e-learning functionality. journal of cultural heritage, 10, 296-312, 2009. [3] m. a. rajan, m. girish chandra, l.c. reddy, p. hiremath, concepts of graph theory relevant to ad-hoc networks. int. j. of computers, communications & control, 3(s):465-469, 2008. [4] p. borne, m. benrejeb, on the representation and the stability study of large scale systems. int. j. of computers, communications & control, 3(s):55-66, 2008. [5] a.d. styliadis, d.g. konstantinidou, k.a. tyxola, ecad system design applications in architecture. int. j. of computers, communications & control, 3(2), 204-214, 2008. [6] a.d. styliadis, digital documentation of historical buildings with 3-d modeling functionality. automation in construction, 16, 498-510, 2007. [7] n.d. hasanagas, lobbying management: principles and methods of a more effective practice in the environmental labyrinth of the european union. scottish forestry. the journal of the royal scottish forestry society. 57(1), 28-32, 2003. [8] m. baur, m. benkert, u. brandes, s. cornelsen, m. gaertler, b. koepf, j. lerner, d. wagner, visone software for visual social network analysis. porc. 9th intl. symp. graph drawing, lecture notes in computer science 2265, 463-464, springer, 2002. [9] a.d. styliadis, historical photography-based computer-aided architectural design: demolished buildings information modeling with reverse engineering functionality. automation in construction, 18, 51-69, 2008. [10] n. zarour, s. bouzidi, coalition formation for cooperative information agent-based systems. int. j. of computers, communications & control, 1(3), pp. 85-92, 2006. [11] a. andreatos, virtual communities and their importance for informal learning. int. j. of computers, communications & control, 2(1), 39-47, 2007. [12] m. krott, n.d. hasanagas, measuring bridges between sectors: causative evaluation of crosssectorality. forest policy and economics, 8, 555-563, 2006. [13] km. krott, catalyst for innovation in european forest policy sciences. evaluation of the efi research program 3: policy analysis. forest policy and economics, 5(2), 123-134, 2003. [14] s. shortall, are rural development programmes socially inclusive? social inclusion, civic engagement, participation, and social capital: exploring the differences. journal of rural studies, 24(4), 450-457, 2008. [15] d.j.a. douglas, the restructuring of local government in rural regions: a rural development perspective. journal of rural studies, 21(2), 231-246, 2005. [16] a.d. styliadis, i.d. karamitsos, d.i. zachariou, personalized e-learning implementation the gis case. international journal of computers, communications & control, 1(1), 59-67, 2006 [17] a.d. styliadis, e-learning documentation of historical living systems with 3-d modeling functionality. informatica, 18(3), 419-446, 2007. [18] a.d. styliadis, p.g. patias, n.c. zestas, 3-d computer modeling with intra-component, geometric, quality and topological constraints. informatica, 14(3), 375-392, 2003. 178 n.d. hasanagas, a.d. styliadis, e.i. papadopoulou [19] a.d. styliadis, m.gr. vassilakopoulos, a spatio-temporal geometry-based model for digital documentation of historical living systems. information & management, 42, 349-359, 2005. nikolaos d. hasanagas born in 1974. assistant professor in environment-related subjects at the kavala institute of technology, drama, greece. bsc and msc eq. in environmental sciences (aristotle univ. of thessaloniki, greece), ba and ma eq. in social sciences, phd in environmental policy analysis (goettingen univ., germany). athanasios d. styliadis born in 1956. professor of digital architecture and design computing at the department of landscape architecture at the kavala institute of technology, drama, greece. diploma in surveying engineering, msc in computer science (dundee univ., scotland), phd in caad and gis (aristotle univ. of thessaloniki, greece). eleni i. papadopoulou born in 1957. assistant professor in rural policy at the faculty of agricultural science at the aristotle university of thessaloniki, greece. bsc in agriculture engineering, msc in agricultural economics (univ. of reading, uk), phd (aristotle university of thessaloniki). int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 825-836 a genetic algorithm for multiobjective hard scheduling optimization e. niño, c. ardila, a. perez, y. donoso elías niño, carlos ardila department of computer science universidad del norte km 5, via pto colombia. barranquilla, colombia e-mail: {enino,cardila}@uninorte.edu.co alfredo perez department of computer science and engineering university south florida 4202 e. fowler ave. tampa, florida e-mail: ajperez4@cse.usf.edu yezid donoso department of computing and systems engineering universidad de los andes cra 1 no. 18a-12. bogotá, colombia e-mail: ydonoso@uniandes.edu.co abstract: this paper proposes a genetic algorithm for multiobjective scheduling optimization based in the object oriented design with constrains on delivery times, process precedence and resource availability. initially, the programming algorithm (pa) was designed and implemented, taking into account all constraints mentioned. this algorithm’s main objective is, given a sequence of production orders, products and processes, calculate its total programming cost and time. once the programming algorithm was defined, the genetic algorithm (ga) was developed for minimizing two objectives: delivery times and total programming cost. the stages defined for this algorithm were: selection, crossover and mutation. during the first stage, the individuals composing the next generation are selected using a strong dominance test. given the strong restrictions on the model, the crossover stage utilizes a process level structure (pls) where processes are grouped by its levels in the product tree. finally during the mutation stage, the solutions are modified in two different ways (selected in a random fashion): changing the selection of the resources of one process and organizing the processes by its execution time by level. in order to obtain more variability in the found solutions, the production orders and the products are organized with activity planning rules such as edd, spt and lpt. for each level of processes, the processes are organized by its processing time from lower to higher (plu), from higher to lower (pul), randomly (pr), and by local search (ls). as strategies for local search, three algorithms were implemented: tabu search (ts), simulated annealing (sa) and exchange deterministic algorithm (eda). the purpose of the local search is to organize the processes in such a way that minimizes the total execution time of the level. finally, pareto fronts are used to show the obtained results of applying each of the specified strategies. results are analyzed and compared. keywords: scheduling, process, genetic algorithm, local search, pareto front. copyright c⃝ 2006-2010 by ccc publications 826 e. niño, c. ardila, a. perez, y. donoso 1 introduction the genetics algorithms (ga) are a powerful tool for solving combinatorial problems. nowadays, it exits a lot of algorithms inspired in ga for solving real problems such as design of vehicle suspensions [1], product deployment in telecom services [2], design of the flexible multi-body model vehicle suspensions based on skeletons implementing [3], job-shop scheduling [4], economic dispatch of generators with prohibited operating zones [5], multi-project scheduling [6], inversion analysis of permeability coefficients [7], path planning in unstructured mobile robot environments [8], rough mill component scheduling [9] and power plant control system design [10]. in productive systems is very critical the assignments of resources, for instance, a product has process and the process requires resources. the programming of the execution of the processes affects the overall cost and time of the products. due to this, it is very important try to do the planning and scheduling in the best way. it can be accomplished with a genetic algorithm. 2 preliminaries 2.1 local search local search are techniques that allows finding solutions in a set of solutions. it always tries to improve the actual solution through perturbations. a perturbation is a simple way for changing a solution. the perturbation depends of the way for representing the solutions, for instance in figure 1 can be seen a binary representation of a solution, in this case the perturbation can be done changing ones (1) by zeros (0). on the other hand, in figure 2 can be seen a no-binary representation of a solution (tour of the traveling salesman problem [11] for example), in this case the perturbation can be done swapping two elements of the tour. 001001 001101 perturbation x x’ figure 1: a binary representation of solutions. in this case x is the representation of the decimal number 9. the perturbation was done changing the 4th 0 to 1. due to this, it creates a new solution x’ that is the representation of the decimal number 13. 1-2-3-4 1-3-2-4 perturbation swapping 2 and 3 x x’ figure 2: a no-binary representation of solutions. in this case x is the representation of a tour in tsp. the perturbation was done changing the 2 and 3 in the string. due to this, it creates a new solution x’. a genetic algorithm for multiobjective hard scheduling optimization 827 there exist a many local search algorithms such as tabu search (ts) [12], simulated annealing (sa) [13] and exchange deterministic algorithm (eda) [14]. due to this, it is necessary to find a good representation of the solutions. obviously, it depends of the problem to solve. 2.2 genetic algorithm genetic algorithm (ga) is a search technique used for solving optimizations problem. the most important in ga is the design of the chromosome. it is the representation of the feasible solutions. consequently, the behavior of the ga depends of the chromosome. due to this, a bad chromosome implies a bad behavior of the ga. on the other hand, a good chromosome may imply a good behavior of the ga. the framework of ga can be seen in figure 3. genetic algorithm s = create initial solutions do selection (s) crossover (s) mutation (s) until condition stop = true end genetic algorithm figure 3: the framework of a genetic algorithm. ga has three important steps. first, it selects the solutions for the crossover and mutation step. this selection can be done using a metric, for example the inverse generational distance (igd) [15]. second, it takes pairs of solutions for crossing. it can be done at random. crossover consists in creates new solutions with parts of two solutions. the two original solutions are named parents (father and mother) and the two new solutions sons. it is created with half from father and half from mother. lastly, it takes some sons for mutating it. the mutation is a step that allows creating new solutions. it can be done using a perturbation or a local search. the three steps of ga can be seen in figure 4. 0 1 2 3 1 0 3 2 0 1 3 2 1 0 2 3 3 1 0 2 2 0 1 3 swap(1, 2) swap(0, 3) parents selection crossover mutation sons figure 4: steps of a genetic algorithm. 828 e. niño, c. ardila, a. perez, y. donoso 3 a genetic algorithm for multiobjective hard scheduling optimization we state a new genetic algorithm for multiobjective hard scheduling optimization (gamhso). this algorithm works in scenarios with the following characteristics: there are production orders that are composed by a set of products. each of the products is described by a product tree that contains all the processes needed to build such product. for a product to be ready, it is required the execution of all processes that belongs to its tree. a process may need the execution of another process (precedence) or other subpart before it can be executed. the execution of some processes can be only done in certain times (schedules). to execute a process, a group of resources is required. the defined resources are: machinery, employees, and vehicles. it is not necessary for a group of resources to contain all types of resources. finally, if a subcomponent is required, it has to be constructed by a set of processes or it can be modeled by a process that indicates idle state (the subcomponent has not arrived yet to the system). this scenario can be seen in a domain model in figure 5. figure 5: domain model for scenario of gamhso formally gahmso is defined in figure 6. the two objectives of gamhso is found a set of solutions nondominated of the programming of the processes minimizing the overall time and cost. the initial solutions are created with some heuristics. the heuristics organizes the production orders with rules. gahmso use three rules for creating the initial solutions. it selects the heuristic in at random. the available heuristics for creating the initial solutions are early due date (edd), long process time (lpt) and short process time (spt). edd organizes the production orders from lower to upper respect to the due date. spt organizes the production orders from lower to upper respect to the summation of the processes time of the products. lpt is the contrary to spt. a genetic algorithm for multiobjective hard scheduling optimization 829 once the initial solutions are created, the selection step selects solutions from the overall set of solutions. the selected solutions are named candidates. the candidates are the solutions that can be crossed for creating new solutions. the amount of candidates is defined by the candidate rate cr. it is very important the definition of the chromosome for the crossover step. for instance, consider the product of the figure 7. the representation of two feasible solutions could be to program the execution of processes in the order 8 11 9 10 6 7 5 4 1 2 3 and 11 10 6 5 4 2 8 9 7 1 3. the program indicates a feasible solution for programming the processes. some processes can be executed parallel, so the order indicates the priority in the utilization of the resources. for instance, processes 8 and 9 can be executed at the same time, but in the first solution, if 8 uses a machine that 9 requires, 9 could not executed until 8 free the resource. the problem with this representation is that the crossover step could create unfeasible solutions. for example, if we split the two solutions mentioned in the middle and later we cross those solutions, we will obtain the solutions 8 11 9 10 6 2 8 9 7 1 3 and 11 10 6 5 47 5 4 1 2 3. obviously, those are unfeasible solutions. inputs: c ,r ρr output: a set (s) of nondominated solutions. gamhso s = create initial solutions(edd, lpt, spt) do s’ = selection of candidates (s, c )r c = crossover of candidates (s’, ρ )r x = generated a integer in [0,2] with a uniform distribution. switch(x) case 0: c = tabusearch(c) case 1: c = simulatedannealing(c) case 2: c = exchange deterministic algorithm(c) end s = s u c s = remove domiated solutions(s) until condition stop = true end gamhso figure 6: the framework of gamhso 4 definition of the chromosome the objectives of gamsho are the optimization of overall cost and time. it plays with the programming order of the execution of the processes. so it is very important to provide a good representation of the solutions. consider again the product of the figure 7. it has subparts because it is modeling the reality. but, żwhat does a subpart mean? it means that the process that contains the subpart cannot be executed until the processes that build it have been executed. in other words, there exists a precedence constrain between the process that contain the subpart and the processes 830 e. niño, c. ardila, a. perez, y. donoso that build the subpart. if we applied this to the figure 7, we are going to obtain the tree of the figure 8. in this tree does not exist subpart, we replace the subpart for the precedence between the processes. once the tree is ready, we need to create a chromosome that allows the representation of the programming. process1 process2 process3 process4 process5 process6 process7 process8 process9 process10 process11 prod a subprt1 subprt2 figure 7: tree product a. product a requires of the execution of the processes 1, 2 and 3. process 1 requires the execution of the process 7. process 7 requires the subpart1. subpart1 requires the execution of the processes 8 and 9. process 2 does not require processes. process 3 require the execution of the processes 4 and 5.process 5 requires the execution of the process 6. process 6 requires the subpart 2. subpart 2 requires the execution of process 10 and process 10 requires the execution of process 11. first we group the processes by level. it means that the time execution of a process in a superior level depends on the finalization execution time of the parents in a low level. formally: p.start_time = max(parenti.finalization_time) (4.1) for instance, if process 5 finishes its execution in time 20 and the process 4 finishes its execution in time 40, process 3 will start its execution in time 40. on the other hand, the process 2 can be executed since time 0. once the levels of the processes have been identified, those are grouped in a level structure. it can be seen in figure 9. each process knows its children in the superior level. due to this, the chromosome for gamhso can be seen in figure 10. the production orders are executed from left to the right (from up to down). the products are built from left to the right (from up to down). the processes are executed from right to the left (from down to up). the processes are processed from left to the right (from up to down). the crossover step consists in select two solutions, split in the middle and cross. it can be done by levels. an example can be seen in figure 11. the number of solution that can be crossed a genetic algorithm for multiobjective hard scheduling optimization 831 process1 process2 process3 process4 process5 process6 process7 process8 process9 process10 process11 prod a level 0: processes 1, 2 and 3 level 1: processes 4, 5 and 7 level 2: processes 6, 8 and 9 level 3: process 10 level 4: process 11 figure 8: a view of the precedence tree of product a group by level. process1 process2 process3 process4 process5 process6process7 process8 process9 process10 process11 prod a level0 level1 level2 level3 level4 figure 9: a representation of the product a in a level structure. is specify by the crossover rate (ρr). once the solution is created, it is necessary to schedule the processes of the solution for obtaining the time and cost of the solution. the programming algorithm (pa) is an algorithm that requires a solution for programming all the processes for all the products of the production orders. it verifies is a process can be executed with three validations: first, it verifies the process precedence. second, it verifies if the resource are available for the execution of the process. lastly, it verifies is the process can be executed in the journey. once a process completes its execution, it set the initial time to his children to his finalization time. pa can be seen in figure 12. the mutation step of gamhso consists in the improvement of the solutions through local search (ls). gamhso works with three ls: tabu search (ts), simulated annealing (sa) and exchange deterministic algorithm (eda). 5 experimental settings we tested gamhso in a computer amd turion 64, 2 gb of ram and a hard disk of 120 gb. the test consisted in build 1000 products of type a (figure 7). the parameters were cr = 0.4, 832 e. niño, c. ardila, a. perez, y. donoso sns2s1 s3 o1 o2 o3 om p1 p2 p3 pi l1 l2 l3 lj p1 p2 p3 pk set of solutions s each solution has the same set of productions orders but sorted by edd, spt or lpt each order has a set of products sorted by edd, spt or ltp each product has a set of levels each level has a set of processes figure 10: the chromosome for gamhso. ρr = 0.5. we tested the performance of gomhsa with eda, ts and sa. for making a real comparison, we use the inverted generational distance (igd) metric [15]. it is defined as follows: given a reference set a∗, the igd value of a set a ⊂ rm is defined as: igd(a, a∗) = 1 |a∗| ∑ v∈a∗ {min u∈a d(u, v)} (5.1) 6 results of gamhso the pareto fronts for each ls can be seen in figure 12. the results of igd metric between the lss can be seen in table 1. the running times for gamhso for each ls can be seen table 2. eda sa ts eda sa ts 1137.0605 7485.2654 6157.3669 6388.229 0 0 01134.2321 7367.1584 table 1: the igd-metrics values for each ls against the rest of ls. eda ts sa running time in seconds 32 57 128 table 2: running times of gamhso with each ls. a genetic algorithm for multiobjective hard scheduling optimization 833 s1 o a 0 1 2 3 a b c f d g h e s2 o a 0 1 2 3 b a f c h d g e s3 o a 2 3 h d g e 0 1 a b c f new solution solution 1 e-d-g-h-c-f-a-b solution 2 e-h-d-g-f-c-b-a solution 3 e-h-d-g-c-f-a-b figure 11: an example of the crossover step of gamhso. 7 conclusions and future works we designed a new genetic algorithm for hard scheduling optimization (gamhso). it works with very difficult scenarios of productive systems. also, we define a chromosome for gamhso that avoids the creation of unfeasible solutions. due to this, it is not necessary to verify if a solution (for the crossover step) is a feasible solution. consequently, the performance of the algorithm is satisfactory in comparison with the size of the feasible solutions space. on the other hand, we state a new programming algorithm (pa) for scheduling of a set of production orders. pa is a flexible algorithm that allows the incorporations of new restrictions to the processes. it allows calculate the overall time and cost of a set of production orders. we made a real comparison of the gamhso behavior with some local search strategies such as exchange deterministic algorithm (eda), tabu search (ts) and simulated annealing (sa). the best performance of gamhso was using eda and sa. lastly, we will investigate a new chromosome that allows the crossover between production orders. 834 e. niño, c. ardila, a. perez, y. donoso programming algoritm(o as a set of production orders) cost = 0 time = 0 for each o in oorder for each p in oproduct for each l in plevel for each p in pprocess t = p.initialtime do while p cannot be executed in time t t = t+1 end while it = t while p can be executed in time t t = t+1 end while ft = t for all resources r used for p. r.programtask(it,ft). cost = cost+r.getcost until t = 0 for all processes children pc of p. pc. = max(pc.initialtime,t)initialtime if thent > time time = t end if end for end for end for end for return time,cost end programming algorithm figure 12: the framework of the programming algorithm (pa) for getting the overall cost and time of the programming of a set of production orders. figure 13: pareto fronts for gahmso for each ls. notice that ts is dominated by sa and eda. gahmso a similar behavior for sa and eda a genetic algorithm for multiobjective hard scheduling optimization 835 bibliography [1] jingjun zhang; yanhong zhang; ruizhen gao, “genetic algorithms for optimal design of vehicle suspensions”, engineering of intelligent systems, 2006 ieee international conference on , vol., no., pp.1-6, 0-0 0. url: http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=1703182&isnumber=35938 [2] murphy, l.; abdel-aty-zohdy, h.s.; hashem-sherif, m., “a genetic algorithm tracking model for product deployment in telecom services”, circuits and systems, 2005. 48th midwest symposium on , vol., no., pp.1729-1732 vol. 2, 7-10 aug. 2005. url: http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=1594454&isnumber=33557 [3] guangyuan liu; jingjun zhang; ruizhen gao; yang sun, “a coarse-grained genetic algorithm for the optimal design of the flexible multi-body model vehicle suspensions based on skeletons implementing”, intelligent networks and intelligent systems, 2008. icinis ’08. first international conference on , vol., no., pp.139-142, 1-3 nov. 2008. url: http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=4683187&isnumber=4683146 [4] wu ying; li bin, “job-shop scheduling using genetic algorithm”, systems, man, and cybernetics, 1996., ieee international conference on , vol.3, no., pp.1994-1999 vol.3, 14-17 oct 1996. url: http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=565434&isnumber=12283 [5] orero, s.o.; irving, m.r., “economic dispatch of generators with prohibited operating zones: a genetic algorithm approach”, generation, transmission and distribution, iee proceedings, vol.143, no.6, pp.529-534, nov 1996. url: http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=556730&isnumber=12146 [6] zhao man; tan wei; li xiang; kang lishan, “research on multi-project scheduling problem based on hybrid genetic algorithm”, computer science and software engineering, 2008 international conference on , vol.1, no., pp.390-394, 12-14 dec. 2008. url: http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=4721769&isnumber=4721668 [7] xianghui deng, “application of adaptive genetic algorithm in inversion analysis of permeability coefficients”, genetic and evolutionary computing, 2008. wgec ’08. second international conference on , vol., no., pp.61-65, 25-26 sept. 2008. url: http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=4637395&isnumber=4637374 [8] yanrong hu; yang, s.x.; li-zhong xu; meng, q.-h., “a knowledge based genetic algorithm for path planning in unstructured mobile robot environments”, robotics and biomimetics, 2004. robio 2004. ieee international conference on , vol., no., pp.767-772, 22-26 aug. 2004. url: http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=1521879&isnumber=32545 [9] siu, n.; elghoneimy, e.; yunli wang; gruver, w.a.; fleetwood, m.; kotak, d.b., “rough mill component scheduling: heuristic search versus genetic algorithms” systems, man and cybernetics, 2004 ieee international conference on , vol.5, no., pp. 4226-4231 vol.5, 10-13 oct. 2004. url: http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=1401194&isnumber=30426 [10] lee, k.y.; mohamed, p.s., “a real-coded genetic algorithm involving a hybrid crossover method for power plant control system design”, evolutionary computation, 2002. cec ’02. 836 e. niño, c. ardila, a. perez, y. donoso proceedings of the 2002 congress on , vol.2, no., pp.1069-1074, 2002. url: http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10043914&isnumber=21687 [11] pepper, j.w.; golden, b.l.; wasil, e.a., “solving the traveling salesman problem with annealing-based heuristics: a computational study”, systems, man and cybernetics, part a: systems and humans, ieee transactions on , vol.32, no.1, pp.72-77, jan 2002. url: http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=995530&isnumber=21479 [12] blesa, m.j.; hernandez, l.; xhafa, f., “parallel skeletons for tabu search method”, parallel and distributed systems, 2001. icpads 2001. proceedings. eighth international conference on , vol., no., pp.23-28, 2001. url: http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=934797&isnumber=20222 [13] rose, j.; klebsch, w.; wolf, j., “temperature measurement and equilibrium dynamics of simulated annealing placements”, computer-aided design of integrated circuits and systems, ieee transactions on , vol.9, no.3, pp.253-259, mar 1990. url: http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=46801&isnumber=1771 [14] nińo, e.;ardila, j. , algoritmo basado en automatas finitos deterministas para la obtención de óptimos globales en problemas de naturaleza combinatoria. revista de ingeniería y desarrollo. no 25. pp 100 114. issn 0122 3461. [15] minzhong liu; xiufen zou; yu chen; zhijian wu, “performance assessment of dmoea-dd with cec 2009 moea competition test instances”, evolutionary computation, 2009. cec ’09. ieee congress on , vol., no., pp.2913-2918, 18-21 may 2009. url: http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=4983309&isnumber=4982922 [16] h. li and j.d. landa-silva, evolutionary multi-objective simulated annealing with adaptive and competitive search direction, proceedings of the 2008 ieee congress on evolutionary computation (cec 2008), ieee press, pp. 3310-3317, 01-06 june, 2008, hong kong. international journal of computers, communications & control vol. i (2006), no. 1, pp. 47-57 information aggregation in intelligent systems using generalized operators imre j. rudas, jános fodor abstract: aggregation of information represented by membership functions is a central matter in intelligent systems where fuzzy rule base and reasoning mechanism are applied. typical examples of such systems consist of, but not limited to, fuzzy control, decision support and expert systems. since the advent of fuzzy sets a great number of fuzzy connectives, aggregation operators have been introduced. some families of such operators (like t-norms) have become standard in the field. nevertheless, it also became clear that these operators do not always follow the real phenomena. therefore, there is a natural need for finding new operators to develop more sophisticated intelligent systems. this paper summarizes the research results of the authors that have been carried out in recent years on generalization of conventional operators. keywords: t-norm, t-conorm, uninorm, entropyand distance-based conjunctions and disjunctions. 1 introduction information aggregation is one of the key issues in development of intelligent systems. although fuzzy set theory provides a host of attractive aggregation operators for integrating the membership values representing uncertain information, the results do not always follow the modeled real phenomena and it has been shown that in some situations some operations may work better than others. since the pioneering work of zadeh the basic research was oriented towards the investigation of the properties of t-norms and t-conorms and also to find new ones satisfying the axiom system. as a result of this a great number (of various type) of t-operators have been introduced accepting the axiom system as a fixed, unchangeable skeleton. until the last few years no strong efforts were devoted to generalize t-operators by modifying “weakening” this axiom system. on one hand, the sound theoretical foundation as well as their wide variety have given t-norms and t-conorms almost an exclusive role in different theoretical investigations and practical applications. on the other hand, people are inclined to use them also as a matter of routine. the following observations support this statement and . when one works with binary conjunctions and there is no need to extend them for three or more arguments, associativity is an unnecessarily restrictive condition. the same is valid for commutativity if the two arguments have different semantical backgrounds and it has no sense to interchange one with the other. these observations, which are very often left out of consideration, advocate our study and have urged us to revise definitions and properties of operations for information aggregation and reasoning. 2 traditional operations the original fuzzy set theory was formulated in terms of zadeh’s standard operations of intersection, union and complement. the axiomatic skeleton used for characterizing fuzzy intersection and fuzzy union are known as triangular norms (t-norms) and triangular conorms (t-conorms), respectively. for more details we refer to the book [9]. 2.1 triangular norms and conorms definition 1. a non-increasing function n : [0, 1] → [0, 1] satisfying n(0) = 1, n(1) = 0 is called a negation. a negation n is called strict if n is strictly decreasing and continuous. a strict negation n is said to be a strong negation if n is also involutive: n(n(x)) = x for all x ∈ [0, 1]. the standard negation is simply ns(x) = 1 −x, x ∈ [0, 1]. clearly, this negation is strong. it plays a key role in the representation of strong negations. we call a continuous, strictly increasing function ϕ : [0, 1] → [0, 1] with ϕ(0) = 0, ϕ(1) = 1 an automorphism of the unit interval. copyright c© 2006 by ccc publications 48 imre j. rudas, jános fodor definition 2. a triangular norm (shortly: a t-norm) is a function t : [0, 1]2 → [0, 1] which is associative, increasing and commutative, and satisfies the boundary condition t (1, x) = x for all x ∈ [0, 1]. definition 3. a triangular conorm (shortly: a t-conorm) is an associative, commutative, increasing s : [0, 1]2 → [0, 1] function, with boundary condition s(0, x) = x for all x ∈ [0, 1]. notice that continuity of a t-norm and a t-conorm is not taken for granted. in what follows we assume that t is a t-norm, s is a t-conorm and n is a strict negation. clearly, for every t-norm t and strong negation n, the operation s defined by s(x, y) = n(t (n(x), n(y))), x, y ∈ [0, 1] (1) is a t-conorm. in addition, t (x, y) = n(s(n(x), n(y))) (x, y ∈ [0, 1]). in this case s and t are called n-duals. in case of the standard negation (i.e., when n(x) = 1 − x for x ∈ [0, 1]) we simply speak about duals. obviously, equality (1) expresses the de morgan’s law in the fuzzy case. generally, for any t-norm t and t-conorm s we have tw (x, y) ≤ t (x, y) ≤ tm (x, y) and sm (x, y) ≤ s (x, y) ≤ ss (x, y) , where tm (x, y) = min (x, y), sm (x, y) = max (x, y), tw is the weakest t-norm, and ss is the strongest t-conorm. these inequalities are important from practical point of view as they establish the boundaries of the possible range of mappings t and s. 2.2 uninorms and nullnorms uninorms uninorms were introduced by yager and rybalov [19] as a generalization of t-norms and t-conorms. for uninorms, the neutral element is not forced to be either 0 or 1, but can be any value in the unit interval. definition 4. [19] a uninorm u is a commutative, associative and increasing binary operator with a neutral element e ∈ [0, 1 ] , i.e., for all x ∈ [0, 1 ] we have u (x, e) = x. t-norms do not allow low values to be compensated by high values, while t-conorms do not allow high values to be compensated by low values. uninorms may allow values separated by their neutral element to be aggregated in a compensating way. the structure of uninorms was studied by fodor et al. [11]. for a uninorm u with neutral element e ∈ ] 0, 1 ] , the binary operator tu defined by tu (x, y) = u (e x, e y) e is a t-norm; for a uninorm u with neutral element e ∈ [0, 1[, the binary operator su defined by su (x, y) = u (e + (1−e)x, e + (1−e)y)−e 1−e is a t-conorm. the structure of a uninorm with neutral element e ∈]0, 1[ on the squares [0, e ] 2 and [e, 1 ] 2 is therefore closely related to t-norms and t-conorms. for e ∈]0, 1[, we denote by φe and ψe the linear transformations defined by φe(x) = xe and ψe(x) = x−e 1−e . to any uninorm u with neutral element e ∈]0, 1[, there corresponds a t-norm t and a t-conorm s such that: (i) for any (x, y) ∈ [0, e ] 2: u (x, y) = φ−1e (t (φe(x), φe(y))); (ii) for any (x, y) ∈ [e, 1 ] 2: u (x, y) = ψ−1e (s(ψe(x), ψe(y))). on the remaining part of the unit square, i.e. on e = [0, e[×] e, 1 ] ∪]e, 1 ] ×[0, e[, it satisfies min(x, y) ≤ u (x, y) ≤ max(x, y), and could therefore partially show a compensating behaviour, i.e. take values strictly between minimum and maximum. note that any uninorm u is either conjunctive, i.e. u (0, 1) = u (1, 0) = 0, or disjunctive, i.e. u (0, 1) = u (1, 0) = 1. information aggregation in intelligent systems using generalized operators 49 representation of uninorms in analogy to the representation of continuous archimedean t-norms and t-conorms in terms of additive generators, fodor et al. [11] have investigated the existence of uninorms with a similar representation in terms of a single-variable function. this search leads back to dombi’s class of aggregative operators [7]. this work is also closely related to that of klement et al. on associative compensatory operators [15]. consider e ∈]0, 1[ and a strictly increasing continuous [0, 1] →r mapping h with h(0) = −∞, h(e) = 0 and h(1) = +∞. the binary operator u defined by u (x, y) = h−1(h(x) + h(y)) for any (x, y) ∈ [0, 1]2 \{(0, 1), (1, 0)}, and either u (0, 1) = u (1, 0) = 0 or u (0, 1) = u (1, 0) = 1, is a uninorm with neutral element e. the class of uninorms that can be constructed in this way has been characterized [11]. consider a uninorm u with neutral element e ∈]0, 1[, then there exists a strictly increasing continuous [0, 1] → r mapping h with h(0) = −∞, h(e) = 0 and h(1) = +∞ such that u (x, y) = h−1(h(x) + h(y)) for any (x, y) ∈ [0, 1]2 \{(0, 1), (1, 0)} if and only if (i) u is strictly increasing and continuous on ]0, 1[2; (ii) there exists an involutive negator n with fixpoint e such that u (x, y) = n(u (n(x), n(y)))) for any (x, y) ∈ [0, 1]2 \{(0, 1), (1, 0)}. the uninorms characterized above are called representable uninorms. the mapping h is called an additive generator of u . the involutive negator corresponding to a representable uninorm u with additive generator h, as mentioned in condition (ii) above, is denoted nu and is given by nu (x) = h −1(−h(x)). (2) clearly, any representable uninorm comes in a conjunctive and a disjunctive version, i.e. there always exist two representable uninorms that only differ in the points (0, 1) and (1, 0). representable uninorms are almost continuous, i.e. continuous except in (0, 1) and (1, 0), and archimedean, in the sense that (∀x ∈]0, e[)(u (x, x) < x) and (∀x ∈]e, 1[)(u (x, x) > x). clearly, representable uninorms are not idempotent. the classes umin and umax do not contain representable uninorms. a very important fact is that the underlying t-norm and t-conorm of a representable uninorm must be strict and cannot be nilpotent. moreover, given a strict t-norm t with decreasing additive generator f and a strict t-conorm s with increasing additive generator g, we can always construct a representable uninorm u with desired neutral element e ∈]0, 1[ that has t and s as underlying t-norm and t-conorm. it suffices to consider as additive generator the mapping h defined by h(x) =    − f ( x e ) , if x ≤ e g ( x−e 1−e ) , if x ≥ e . (3) on the other hand, the following property indicates that representable uninorms are in some sense also generalizations of nilpotent t-norms and nilpotent t-conorms: (∀x ∈ [0, 1])(u (x, nu (x)) = nu (e)). this claim is further supported by studying the residual operators of representable uninorms in [6]. as an example of the representable case, consider the additive generator h defined by h(x) = log x1−x , then the corresponding conjunctive representable uninorm u is given by u (x, y) = 0 if (x, y) ∈ {(1, 0), (0, 1)}, and u (x, y) = xy (1−x)(1−y) + xy otherwise, and has as neutral element 12 . note that nu is the standard negator: nu (x) = 1−x. the class of representable uninorms contains famous operators, such as the functions for combining certainty factors in the expert systems mycin (see [18, 5]) and prospector [5]. the mycin expert system was one 50 imre j. rudas, jános fodor of the first systems capable of reasoning under uncertainty [2]. to that end, certainty factors were introduced as numbers in the interval [−1, 1]. essential in the processing of these certainty factors is the modified combining function c proposed by van melle [2]. the [−1, 1]2 → [−1, 1] mapping c is defined by c(x, y) =    x + y(1−x) , if min(x, y) ≥ 0 x + y(1 + x) , if max(x, y) ≤ 0 x + y 1−min(|x|,|y|) , otherwise . the definition of c is not clear in the points (−1, 1) and (1,−1), though it is understood that c(−1, 1) = c(1,−1) = −1. rescaling the function c to a binary operator on [0, 1], we obtain a representable uninorm with neutral element 1 2 and as underlying t-norm and t-conorm the product and the probabilistic sum. implicitly, these results are contained in the book of hájek et al. [14], in the context of ordered abelian groups. nullnorms definition 5. [3] a nullnorm v is a commutative, associative and increasing binary operator with an absorbing element a ∈ [0, 1], i.e. (∀x ∈ [0, 1])(v (x, a) = a), and that satisfies (∀x ∈ [0, a])(v (x, 0) = x) (4) (∀x ∈ [a, 1])(v (x, 1) = x) (5) the absorbing element a corresponding to a nullnorm v is clearly unique. by definition, the case a = 0 leads back to t-norms, while the case a = 1 leads back to t-conorms. in the following proposition, we show that the structure of a nullnorm is similar to that of a uninorm. in particular, it can be shown that it is built up from a t-norm, a t-conorm and the absorbing element [3]. theorem 6. consider a ∈ [0, 1]. a binary operator v is a nullnorm with absorbing element a if and only if (i) if a = 0: v is a t-norm; (ii) if 0 < a < 1: there exists a t-norm tv and a t-conorm sv such that v (x, y) is given by    φ−1a (sv (φa(x), φa(y))) , if (x, y) ∈ [0, a]2 ψ−1a (tv (ψa(x), ψa(y))) , if (x, y) ∈ [a, 1]2 a , elsewhere ; (6) (iii) if a = 1: v is a t-conorm. recall that for any t-norm t and t-conorm s it holds that t (x, y) ≤ min(x, y) ≤ max(x, y) ≤ s(x, y), for any (x, y) ∈ [0, 1]2. hence, for a nullnorm v with absorbing element a it holds that (∀(x, y) ∈ [0, a]2) (v (x, y) ≥ max(x, y)) and (∀(x, y) ∈ [a, 1]2) (v (x, y) ≤ min(x, y)). clearly, for any nullnorm v with absorbing element a it holds for all x ∈ [0, 1] that v (x, 0) = min(x, a) and v (x, 1) = max(x, a). (7) notice that, without the additional conditions (4) and (5), it cannot be shown that a commutative, associative and increasing binary operator v with absorbing element a behaves as a t-conorm and t-norm on the squares [0, a]2 and [a, 1]2. nullnorms are a generalization of the well-known median studied by fung and fu [13], which corresponds to the case t = min and s = max. for a more general treatment of this operator, we refer to [10]. we recall here the characterization of that median as given by czogala and drewniak [4]. firstly, they observe that an idempotent, associative and increasing binary operator o has as absorbing element a ∈ [0, 1] if and only if o(0, 1) = o(1, 0) = a. then the following theorem can be proven. theorem 7. [4] consider a ∈ [0, 1]. a continuous, idempotent, associative and increasing binary operator o satisfies o(0, 1) = o(1, 0) = a if and only if it is given by o(x, y) =    max(x, y) , if (x, y) ∈ [0, a]2 min(x, y) , if (x, y) ∈ [a, 1]2 a , elsewhere . information aggregation in intelligent systems using generalized operators 51 nullnorms are also a special case of the class of t -s aggregation functions introduced and studied by fodor and calvo [12]. definition 8. consider a continuous t-norm t and a continuous t-conorm s. a binary operator f is called a t -s aggregation function if it is increasing and commutative, and satisfies the boundary conditions (∀x ∈ [0, 1])(f(x, 0) = t (f(1, 0), x)) (∀x ∈ [0, 1])(f(x, 1) = s(f(1, 0), x)). when t is the algebraic product and s is the probabilistic sum, we recover the class of aggregation functions studied by mayor and trillas [17]. rephrasing a result of fodor and calvo, we can state that the class of associative t -s aggregation functions coincides with the class of nullnorms with underlying t-norm t and t-conorm s. 2.3 the role of commutativity and associativity one possible way of simplification of axiom skeletons of t-norms and t-conorms may be not requiring that these operations to have the commutative and the associative properties. non-commutative and non-associative operations are widely used in mathematics, so, why do we restrict our investigations by keeping these axioms? what are the requirements of the most typical applications? from theoretical point of view the commutative law is not required, while the associative law is necessary to extend the operation to more than two variables. in applications, like fuzzy logic control, fuzzy expert systems and fuzzy systems modeling fuzzy rule base and fuzzy inference mechanism are used, where the information aggregation is performed by operations. the inference procedures do not always require commutative and associative laws of the operations used in these procedures. these properties are not necessary for conjunction operations used in the simplest fuzzy controllers with two inputs and one output. for rules with greater amount of inputs and outputs these properties are also not required if the sequence of variables in the rules are fixed. moreover, the non-commutativity of conjunction may in fact be desirable for rules because it can reflect different influences of the input variables on the output of the system. for example, in fuzzy control, the positions of the input variables the “error” and the “change in error” in rules are usually fixed and these variables have different influences on the output of the system. in the application areas of fuzzy models when the sequence of operands is not fixed, the property of non-commutativity may not be desirable. later some examples will be given for parametric non-commutative and non-associative operations. 3 generalized conjunctions and disjunctions the axiom systems of t-norms and t-conorms are very similar to each other except the neutral element, i.e. the type is characterized by the neutral element. if the neutral element is equal to 1 then the operation is a conjunction type, while if the neutral element is zero the disjunction operation is obtained. by using these properties we introduce the concepts of conjunction and disjunction operations [1]. definition 9. let t be a mapping t : [0, 1] × [0, 1] → [0, 1]. t is a conjunction operation if t (x, 1) = x for all x ∈ [0, 1]. definition 10. let s be a mapping s : [0, 1] × [0, 1] → [0, 1]. s is a conjunction operation if s(x, 0) = x for all x ∈ [0, 1]. conjunction and disjunction operations may also be obtained one from another by means of an involutive negation n: s (x, y) = n (t (n (x) , n (y))), and t (x, y) = n (s (n (x) , n (y))). it can be seen easily that conjunction and disjunction operations satisfy the following boundary conditions: t (1, 1) = 1, t (0, x) = t (x, 0) = 0, s(0, 0) = 0, s(1, x) = s(x, 1) = 1. by fixing these conditions, new types of generalized operations are introduced. definition 11. let t be a mapping t : [0, 1] × [0, 1] → [0, 1]. t is a quasi-conjunction operation if t (0, 0) = t (0, 1) = t (1, 0) = 0, and t (1, 1) = 1. definition 12. let s be a mapping s : [0, 1]×[0, 1] → [0, 1]. s is a quasi-disjunction operation if s(0, 1) = s(1, 0) = s(1, 1) = 1, and s(0, 0) = 0. 52 imre j. rudas, jános fodor it is easy to see that conjunction and disjunction operations are quasi-conjunctions and quasi-disjunctions, respectively, but the converse is not true. omitting t (1, 1) = 1 and s(0, 0) = 0 from the definitions further generalization can be obtained. definition 13. let t be a mapping t : [0, 1] × [0, 1] → [0, 1]. t is a pseudo-conjunction operation if t (0, 0) = t (0, 1) = t (1, 0) = 0. definition 14. let s be a mapping s : [0, 1] × [0, 1] → [0, 1]. s is a pseudo-disjunction operation if s(0, 1) = s(1, 0) = s(1, 1) = 1. theorem 15. assume that t and s are non-decreasing pseudo-conjunctions and pseudo-disjunctions, respectively. then there exist the absorbing elements 0 and 1 such as t (x, 0) = t (0, x) = 0 and s (x, 1) = s (1, x) = 1. 3.1 entropy-based conjunction and disjunction operators the question of how fuzzy is a fuzzy set has been one of the issues associated with the development of the fuzzy set theory. in accordance with a current terminological trend in the literature, measure of uncertainty is being referred as measure of fuzziness, or fuzzy entropy [16]. throughout this part the following notations will be used; x is the universal set, f(x ) is the class of all fuzzy subsets of x , ℜ+ is the set of non negative real numbers, ā is the fuzzy complement of a ∈f(x ) and |a| is the cardinality of a. definition 16. let x be a universal set and a is a fuzzy subset of x with membership function µa. the fuzzy entropy is a mapping e: f(x ) → ℜ+ which satisfies the following axioms: ae 1 e (a) = 0 if a is a crisp set. ae 2 if a ≺ b then e (a) ≤ e (b); where a ≺ b means that a is sharper than b. ae 3 e (a) assumes its maximum value if and only if a is maximally fuzzy. ae 4 e (a) = e ( ā ) , ∀a ∈ x . let ep be equilibrium of the fuzzy complement c and specify ae 2 and ae 3 as follows: aes 2 a is sharper than b in the following sense: µa (x) ≤ µb (x) for µb (x) ≤ ep and µa (x) ≥ µb (x) for µb (x) ≥ ep, for all x ∈ x . aes 3 a is defined maximally fuzzy when µa (x) = ep ∀x ∈ x . let a be a fuzzy subset of x and define the following function fa : x → [0,1] by fa : x 7→ { µa(x) if µa (x) ≤ ep c (µa (x)) if µa (x) > ep (8) denote φa the fuzzy set generated by fa as its membership function. theorem 17. the g (|φa|) is an entropy, where g:ℜ → ℜ is a monotonically increasing real function and g (0) = 0. definition 18. let a be a fuzzy subset of x . fa is said to be an elementary fuzzy entropy function if the cardinality of the fuzzy set φa = {(x, fa (x))|x ∈ x , fa (x) ∈ [0, 1]} is an entropy of a. it is obvious that fa is an elementary entropy function. now we introduce some operations based on entropy. for more details we refer to [1]. definition 19. let a and b be two fuzzy subsets of the universe of discourse x and denote ϕa and ϕb their elementary entropy functions, respectively. the minimum entropy conjunction operations is defined as i∗ϕ = i ∗ ϕ (a, b) ={( x, µi∗ϕ (x) ) |x ∈ x , µi∗ϕ (x) ∈ [0, 1] } , where µi∗ϕ : x 7→    µa (x) , if ϕa (x) < ϕb (x) µb (x) , if ϕb (x) < ϕa (x) min (µa (x) , µb (x)) , if ϕa (x) = ϕb (x) . (9) definition 20. let a and bbe two fuzzy subsets of the universe of discourse x and denote ϕa and ϕb their elementary entropy functions, respectively. the maximum entropy disjunction operation is defined as u∗ϕ = u ∗ ϕ (a, b) ={( x, µu∗ϕ (x) ) |x ∈ x , µu∗ϕ (x) ∈ [0, 1] } , where information aggregation in intelligent systems using generalized operators 53 figure 1: entropy based conjunction operator (left) and entropy based disjunction operator (right) figure 2: the construction of i∗ϕ (left) and the construction of u ∗ ϕ (right). µu∗ϕ :x 7→    µa (x) , if ϕa (x) > ϕb (x) µb (x) , if ϕb (x) > ϕa (x) max (µa (x) , µb (x)) , if ϕa (x) = ϕb (x) . (10) the geometrical representation of the minimum fuzziness conjunction and the maximum fuzziness disjunction operators can be seen in fig. 1. several important properties of these operations as well as their construction can be found in [1]. now we present only two figures about the construction. notice also that i∗ϕ is a quasi-conjunction, u ∗ ϕ is a quasi-disjunction operation, and u ∗ ϕ is a commutative semigroup operation on [0, 1] [1]. 3.2 a parametric family of quasi-conjunctions let us cite the following result, which is the base of the forthcoming parametric construction, from [1]. theorem 21. suppose t1, t2 are quasi-conjunctions, s1 and s2 are pseudo disjunctions and h, g1, g2 : [0,1] → [0,1] are non-decreasing functions such that g1 (1) = g2 (1) = 1. then the following functions t (x, y) = t2 (t1 (x, y) , s1 (g1 (x) , g2 (y))) (11) t (x, y) = t2 (t1 (x, y) , g1s1 (x, y)) (12) t (x, y) = t2 (t1 (x, y) , s2 (h (x) , s1 (x, y))) (13) are quasi-conjunctions. 54 imre j. rudas, jános fodor by the use of this theorem the simplest parametric quasi-conjunction operations can be obtained as follows: t (x, y) = xpyq, (14) t (x, y) = min (xp, yq) , (15) t (x, y) = (xy)p (x + y−xy)q (16) where p, q ≥ 0. 4 distance-based operations let e be an arbitrary element of the closed unit interval [0, 1] and denote by d (x, y) the distance of two elements x and yof [0, 1]. the idea of definitions of distance-based operators is generated from the reformulation of the definition of the min and max operators as follows min(x, y) = { x, if d (x, 0) ≤ d (y, 0) y, if d (x, 0) > d (y, 0) , max(x, y) = { x, if d (x, 0) ≥ d (y, 0) y, if d (x, 0) < d (y, 0) based on this observation the following definitions can be introduced, see [1]. definition 22. the maximum distance minimum operator with respect to e ∈ [0, 1] is defined as min max e (x, y) =    x, if d (x, e) > d (y, e) y, if d (x, e) < d (y, e) min (x, y) , if d (x, e) = d (y, e) . (17) definition 23. the maximum distance maximum operator with respect to e ∈ [0, 1] is defined as max max e (x, y) =    x, if d (x, e) > d (y, e) y, if d (x, e) < d (y, e) max (x, y) , if d (x, e) = d (y, e) . (18) definition 24. the minimum distance minimum operator with respect to e ∈ [0, 1] is defined as min min e (x, y) =    x, if d (x, e) < d (y, e) y, if d (x, e) > d (y, e) min (x, y) , if d (x, e) = d (y, e) . (19) definition 25. the minimum distance maximum operator with respect to e ∈ [0, 1] is defined as max min e (x, y) =    x, if d (x, e) < d (y, e) y, if d (x, e) > d (y, e) max (x, y) , if d (x, e) = d (y, e) . (20) 4.1 the structure of distance-based operators it can be proved by simple computation that if the distance of x and yis defined as d (x, y) = |x−y| then the distance-based operators can be expressed by means of the min and max operators as follows. min max e =    max (x, y) , if y > 2e−x min (x, y) , if y < 2e−x min (x, y) , if y = 2e−x , min min e =    min (x, y) , if y > 2e−x max (x, y) , if y < 2e−x min (x, y) , if y = 2e−x (21) max max e =    max (x, y) , if y > 2e−x min (x, y) , if y < 2e−x max (x, y) , if y = 2e−x , max min e =    min (x, y) , if y > 2e−x max (x, y) , if y < 2e−x max (x, y) , if y = 2e−x (22) the structures of the maxmine and the min min e operators are illustrated in fig. 3. information aggregation in intelligent systems using generalized operators 55 figure 3: maximum distance minimum operator (left) and minimum distance minimum operator (right). 5 summary and conclusions in this paper we summarized some of our contributions to the theory of non-conventional aggregation operators. further details and another classes of aggregation operators can be found in [1]. references [1] i. batyrshin, o. kaynak, and i. rudas, fuzzy modeling based on generalized conjunction operations, ieee transactions on fuzzy systems, vol. 10, no. 5 (2002), pp. 678–683. [2] b. buchanan and e. shortliffe, rule-based expert systems, the mycin experiments of the stanford heuristic programming project, addison-wesley, reading, ma, 1984. [3] t. calvo, b. de baets and j. fodor, the functional equations of frank and alsina for uninorms and nullnorms, fuzzy sets and systems 120 (2001) 385–394. [4] e. czogala and j. drewniak, associative monotonic operations in fuzzy set theory, fuzzy sets and systems 12 (1984), 249–269. [5] b. de baets and j. fodor, van melle’s combining function in mycin is a representable uninorm: an alternative proof. fuzzy sets and systems 104 (1999) 133-136. [6] b. de baets and j. fodor, residual operators of uninorms, soft computing 3 (1999), 89–100. [7] j. dombi, basic concepts for the theory of evaluation: the aggregative operator, european j. oper. res. 10 (1982), 282–293. [8] j.c. fodor, a new look at fuzzy connectives, fuzzy sets and systems 57 (1993) 141–148. [9] j.c. fodor and m. roubens, fuzzy preference modelling and multicriteria decision support, (kluwer, dordrecht, 1994). [10] j. fodor, an extension of fung–fu’s theorem, internat. j. uncertain. fuzziness knowledge-based systems 4 (1996), 235–243. [11] j. fodor, r. yager and a. rybalov, structure of uninorms, internat. j. uncertain. fuzziness knowledge-based systems 5 (1997) 411–427. [12] j. fodor and t. calvo, aggregation functions defined by t-norms and t-conorms, aggregation and fusion of imperfect information (b. bouchon-meunier, ed.), physica-verlag, 1998, pp. 36–48. [13] l. fung and k. fu, an axiomatic approach to rational decision-making in a fuzzy environment, fuzzy sets and their applications to cognitive and decision processes (k. tanaka, l. zadeh, k. fu and m. shimura, eds.), academic press, new york, san francisco, london, 1975, pp. 227–256. 56 imre j. rudas, jános fodor [14] p. hájek, t. havránek and r. jiroušek, uncertain information processing in expert systems (crc press, 1992). [15] e.-p. klement, r. mesiar and e. pap, on the relationship of associative compensatory operators to triangular norms and conorms, internat. j. uncertain. fuzziness knowledge-based systems 4 (1996), 129–144. [16] j.g. klir and t.a. folger, fuzzy sets, uncertainty, and information (prentice-hall international editions, usa, 1988). [17] g. mayor and e. trillas, on the representation of some aggregation functions, proc. internat. symposium on multiple-valued logic, 1986, pp. 110–114. [18] a. tsadiras and k. margaritis, the mycin certainty factor handling function as uninorm operator and its use as a threshold function in artificial neurons, fuzzy sets and systems 93 (1998), 263–274. [19] r. yager and a. rybalov, uninorm aggregation operators, fuzzy sets and systems 80 (1996) 111–120. imre j. rudas, jános fodor budapest tech institute of intelligent engineering systems address: bécsi út 96/b, h-1034 budapest, hungary e-mail: {rudas,fodor}@bmf.hu the research has been supported in part by otka t046762. editor’s note about the authors: imre j. rudas graduated from bánki donát polytechnic, budapest in 1971, received the master degree in mathematics from the eötvös loránd university, budapest, the ph.d. in robotics from the hungarian academy of sciences in 1987, while the doctor of science degree from the hungarian academy of sciences. he received his first doctor honoris causa degree from the technical university of kosice, slovakia and his second honorary doctorate from university polytechnica timisoara, romania. he is active as a full university professor and head of department of intelligent engineering systems. he serves as the rector of budapest tech from august 1, 2003 for a period of four years. he is a fellow of ieee, administrative committee member of ieee industrial electronics society, chairman of the hungarian chapters of ieee computational intelligence and ieee systems, man and cybernetics societies. he is also a registered expert of the united nations industrial development organization and the eu. he is the president of the hungarian fuzzy association and steering committee member of the hungarian robotics association and the john von neumann computer society. he is member of the editorial boards of several leading international scientific journals and founder of the ieee international conference series on intelligent engineering systems and ieee international conference on computational cybernetics, and some regional symposia. his present areas of research activity are: robotics with special emphasis on robot control, soft computing, computed aided process planning, fuzzy control and fuzzy sets. he has published one book, more than 300 papers in various journals and international conference proceedings. information aggregation in intelligent systems using generalized operators 57 jános fodor obtained his master degree in mathematics in 1981, his university doctorate in 1984. in 2000 he habilitated at the roland eötvös university, budapest, hungary. he defended his csc and dsc degrees in 1991 and 2004, respectively, at the hungarian academy of sciences. in 1997 he won the "széchenyi professorship" in the first phase of this award. at the beginning of his researcher’s activity he was involved in the stochastic problems of operations research, later on in the field of fuzzy logics, preference modeling, decision making, computational intelligence, intelligent control, and mathematical foundations of the management of uncertainties. he is author of two monographs (issued by kluwer and springer), and about 200 other scientific publications. he served as the editor of two scientific books, and serves as the editor of various journal issues. his works obtained more than 700 independent citations. he is member of the editorial boards of various leading international scientific journals. jános fodor is the co-chairman of the hungarian fuzzy association, and the eurofuse euro working group on fuzzy sets. he is member of the operation research committee of the hungarian academy of sciences, and the jury of electronics and electrical sciences of the hungarian research fund (otka). he regularly leads various research projects supported by diverse funds and resources. many times he served as invited professor in various universities in italy, spain, belgium, and france. international journal of computers, communications & control vol. i (2006), no. 3, pp. 71-83 multiobjective optimization scheduling problems by pareto-optimality in agro-alimentary workshop fatma tangour, ihsen saad abstract: this paper deals with the multiobjective optimization problem of an agroalimentary production workshop. three criteria are considered in addition to this initial cost of production: the cost of the out-of-date products, the cost of the distribution discount and the makespan, and a new coding is proposed for this type of workshop. the adopted approach consists in generating optimal solutions diversified in the search space of solutions, and to help the decision maker when it cannot give a particular preference to one of the objective functions to make the good decision with respect to the quoted criteria. keywords: agro-alimentary workshop, scheduling problems, genetic algorithms, pareto-optimality, multiobjective optimization, production cost, makespan. 1 introduction multi-objective optimization aims to optimize several components of an objective functions vector. contrary to mono-objective, the multi-objective problem usually does not have a solution optimizing the whole concerned criteria, but a set of solutions, known as the set of the pareto-optimal solutions. any solution of this unit is optimal in the sense that no improvement can be made on a component without degradation of at least another component of the vector [15]. given that a solution chosen by a decision maker can not be acceptable by another, it proves to be useful to envisage several alternatives to the choice of a pareto optimal solution [18]. in this article, the scheduling problems in the agro-alimentary production workshops are dealt [6]. the principal objective is to search a realizable scheduling minimizing the makespan, the cost of the out-of-date products and the cost of the distribution discount. the transformation methods of the multi-objective problems into mono-objective problems are applied [2]. this article is organized as follows. the one machine scheduling problem is formulated in section 2; the resolution approach suggested with this problem is described in section 3. the effectiveness of this approach is tested for some examples in section 4. 2 problem formulation the problem is to build a multi-objective one machine scheduling problem adapted to agro-alimentary industries. among the constraints and the criteria specific to agro-food industry, the out-of-date of the products and the discount of distribution can be distinguished. the objective is then to select among the cases of realisable scheduling the one which presents the best reducing compromise between the various criteria [7]. the goal of this study is, then, to minimise these criteria such as: • c1 : the cost of the out-of-date products, • c2 : the cost of the distribution discount, • c3 : the makespan. copyright c© 2006 by ccc publications 72 fatma tangour, ihsen saad the basic production cost on the one machine problem is supposed independent from the scheduling. the data of the considered case are as follows: we have a set n of operations, each operation is characterised by its earliest starting time, its effective starting time, its processing time and its effective completion time. notations : ti : effective starting time of operation oi, ri : earliest starting time of the operation oi, γi : effective completion time of the operation oi, pi : processing time of the operation oi, pi : finished product of the operation oi, cik : k th component of the components set of the operation oi, vik : validity limit date of the component cik, cpi : completion time of product pi, dlivpi : delivery date of the product pi, dvpi : lifespan of the product pi, drpi : return delay of the product pi, previk : cost price of the component of the cik product pi, pvenpi : unit selling price of the product pi cstkpi : cost of storage per unit of time of a unit of the product pi. 2.1 criteria formulations three criteria are considered. the two first constitute criteria specific to the agro-alimentary production workshops [16]. the last criterion is traditional and used for the optimization of the scheduling problems of a traditional production workshop. the considered objectives relate to minimization: • c1: the cost of the out-of-date products c1 = ∑ i ∑ k previk ( max (0,ti −vik) (ti −vik) ) (1) • c2: the cost of the distribution discount c2 = ∑ i max ( 0, dlivpi −cpi ) × ( pvenpi dvpi −drpi + cstkpi ) (2) • c3: the makespan c3 = ∑ i (ti + pi) (3) 2.2 lower bound formulations proposition 1. ci ≥ 0, ∀i ∈ {1, 2} and cbi = 0; where cbi represents the lower bound of the criteria ci. proposition 2. the lower bound of the makespan, cb3 is defined as follows: cb3 = ∑ i min (ri + pi) (4) multiobjective optimization scheduling problems by pareto-optimality in agro-alimentary workshop73 proof. when: c3 = max 1≤i≤n γi, ti ≥ ri and then cb3 = min  ∑ i (ti + pi)   (5) cb3 = ∑ i min (ri + pi) (6) 3 genetic algorithms application for the scheduling problems 3.1 principle various approaches have been proposed to solve scheduling optimization problems, among them the genetic algorithms (gas) approach can be distinguish. this approach was largely adopted these last years [11], [14]. the use of gas in many fields proved reliable in particular in combinatorial problems such as the scheduling problems [3], [4], [12]. other hybrid algorithms have also been proposed [1] [8].the main difficulty in the resolution of these problems types results in their algorithmic representation form, which constitutes the most significant point in genetic search. several representation approaches and various standard ags operators were proposed, to solve these problems. among them, the representation based on the priority rules [5]. the principle of a simple genetic algorithm is as follows, figure 1. 3.2 proposed genetic algorithm coding proposed coding in the application case is: ordered operations coding lists “oocl”, table 1. inspired from the clo (operation list coding) coding [9] and the cpm (parallel machines coding) coding [10], it consists in proposing ordered lists for the products line. the proposed coding defined the ordered, the starting time and the completion time of the operations. these dates are calculated and updated by the “dates calculation algorithm”, table 2. table 1: oocl coding 1 2 3 4 o2,t2, γ2 o1,t1, γ1 o4,t4, γ4 . . . . table 2: dates calculation algorithm ti : effective starting time of operation oi, ti = max (ri, γ j) γi : completion time of the operationoi, γi = max (ri, γ j) + pi where γ j represents the completion time of the operation o j that preceded oi the operators used for this coding are: mutation, crossover at a point and crossover at two points. the mutation operator chooses two points of the same individual (list), to generate another individual, table 74 fatma tangour, ihsen saad mutation population generation i probability pm selection evaluation crossover p2 p1 p e1 e2 e probability pc population generation i+1 figure 1: genetic algorithm principle multiobjective optimization scheduling problems by pareto-optimality in agro-alimentary workshop75 3. the crossover at a point operator chooses, two individuals’ parents to generate two other individuals’ children starting from only one point, table 4. and the two points crossover operator chooses, two individuals to generate two other individuals starting from two points, table 5. table 3: mutation algorithm beginning 1. choose two positions i and j of the same individual, for each position correspond an operation oi and o j, 2. permute between the operations oi and o j to obtain the child, 3. update the child, 4. calculate c1 ,c2 ,c3 of the new individual according to the “dates calculation algorithm”, end 3.3 multi-objective evaluation approach generally, the considered criteria present nonlinear and complexes relations and do not have the same importance from the point of view of decision maker. thus, much of considerations can be retained to take account of all these difficulties. with this intention, a fuzzy method evaluation is proposed. this method is based on the steps which follow [13]: for each objective function a lower bound is calculated as follow: ci(x) ≥ cbi ∀x ∈ s, 1 ≤ i ≤ nc (7) where s represents the space of realisable solutions and nc the number of objective functions. the fuzzification is applied by the functions described, figure 2. 1 µ c + εc b m ii i i b h c i i figure 2: fuzzy application in the resolution of the scale problem for each realizable solution x, a vector c(x) is associated, c(x) ∈ [ cb1 , +∞ ] × ... × [ cbnc , +∞ ] , then c(x) = (c1(x), ..., cnc (x)) t ; for each vector c(x), a fuzzification of their components is proposed and 76 fatma tangour, ihsen saad table 4: crossover i algorithm beginning 1. choose two individuals p1 and p2, and a crossover point 2. go to all the operations while i < n do • if j < i then – copy the operations of the p1 in the child1 – copy the operations of the p2 in the child2 • else copy from p2 (respectively p1) with the same position, the missing operations in the child1(respectively child2) • end if end while 3. finish the construction of the child1 (respectively of the child2) with the missing operations (by respecting the order) 4. update the child1 and the child2 5. calculate c1, c2, c3 of the two new individuals according to the “dates calculation algorithm” end multiobjective optimization scheduling problems by pareto-optimality in agro-alimentary workshop77 table 5: crossover ii algorithm beginning 1. choose two individuals p1 and p2 and two crossover points i and k 2. go to all the operations while i < n do • copy the operations of the p1, which precede the first crossover point and which follows the second crossover point, in the child1 • copy the operations of the p2, which precede the first crossover point and which follows the second crossover point , in the child2 • copy, with the same position, the missing operations of the p2 in the child1 • copy, with the same position, the missing operations of the p1 in the child2 end while 3. finish the construction of the child1 (respectively of the child2) with the missing operations (by respecting the order) 4. update the child1 and the child2 5. calculate c1, c2, c3 of the two new individuals according to the “dates calculation algorithm” end 78 fatma tangour, ihsen saad considered as two sub-sets bi and mi, figure 2. if ci(x) ∈ [ cbi , c h i + ε ] then µ bi (ci(x)) = chi −ci(x) + ε chi −cbi + ε , else µ bi (ci(x)) = 0 (8) when chi represents the maximum value of the solution given by a considered heuristics according to the ith objective function. µ bi (ci(x)) is considered as the fuzzy measurement of ci(x) in the sub-set b i. then, the quality of each solution is characterized by the vector cb(x) where all the components are homogeneous since they belong to the same interval and are all without dimension. cb(x) = (a1, ..., anc ) t ai = µ bi (ci(x)), ∀i = 1, 2, ..., nc (9) for the multi-objective evaluation, the objective function cg(x) is reduced to the minimization of the balanced sum of the criteria relating to the use of the aggregation operator owa [17]. cg(x) = nc ∑ i=1 wiai (10) a set of pareto-optimal solutions is built without according privilege to a particular search direction, to help the decision maker when it cannot clearly give a particular preference to an objective function. this approach is based on an algorithm in which, the objective function cg(.), defined at the relation (10), is used for the evaluation of solutions. weightings wi (1 ≤ i ≤ nc) are calculated by using a fuzzy rule. the idea is to measure the average quality of the solutions according to each criterion for each iteration and to calculate the various weights according to the degree of this quality. the goal is to study the profits and the possible improvements of the solutions by giving the priority to the optimization of the objective functions whose average values is far from the lower bound. this approach is called aggregative approach with dynamic search direction. let c̄ k i be the solutions average of the ith objective function found at kth iteration. c̄ k i = ∑ x∈pk cki (x) card(pk) (11) where pk represents the solutions population at this iteration. b i 1 ic µ c +i ε k c i b p l i ' k 0 figure 3: criteria membership function for each vector c(x), a fuzzification is applied to its components ci(x) according to their positions in the interval [ cbi , c̄ 0 i +ε ′ ] ; where ε′ is a little positive value introduced to avoid the division by zero, if c̄ 0 i = c b i then ε ′ = 0.1cbi , else ε ′ = 0. multiobjective optimization scheduling problems by pareto-optimality in agro-alimentary workshop79 the evaluation of the solutions quality is done by using the membership functions defined in figure 3, relating to the two fuzzy subsets, “p" and “l" of the lower bound. the membership functions can thus be formulated as follows: if c̄ k i ∈ [ cbi , c̄ 0 i +ε ′ ] then µ lik ( c̄ k i ) = c̄ k i −cbi c̄ 0 i −cbi + ε′ else µ lik ( c̄ k i ) = 1 (12) the calculation of various weightings is carried out by using the two following fuzzy rules: • if (cki is “p” fromcbi ) then (wk+1i decrease) • if (cki is “l” fromcbi ) then (wk+1i increase) which lead to the following expression: wki = µ lik ( c k i ) nc ∑ j=1 µ ljk ( c k j ) , ∀i ∀k (13) where 1 ≤ i ≤ nc and 2 ≤ k ≤ q, with q the total number of iterations and “l” the index relating to the fuzzy subset. w1i corresponds at the first iteration defined as follow: w1i = 1 nc , ∀i = 1, ..., nc (14) the various weighting vectors (w 1,w 2, ...,w q) are gradually calculated from the kth generation pk at the generation pk+1, according to the distance between the lower bounds and the average of the k th generation individuals, represented by a black circle in the figure 4. the objective is to improve of the solutions by giving the priority to the objective functions optimization whose average of the values is far from the lower bound. indeed, by using a fuzzy rule, it is possible to control the search direction in order to build a final set with solutions approaching as much as possible the optimal values. this method, can be used when the decision maker cannot give a particular preference to an objective function, it also makes it possible to generate weights of the different criteria from an iteration to another in a dynamic way according to the average of the solutions. 4 simulation to illustrate the effectiveness and performance of the proposed approach, six representative examples based on practical data have been selected to compute. these examples deal with 5 to 10 operations. the proposed approach is applied to them to optimize three criteria, represented in eqs. (1-3). for example, the data relating to the example which treats 10 operations and which treats 5 operations is represented respectively in table 6 and table 7. by application of the proposed approach, the following experimental results are obtained, table 8. the different results show that the solutions obtained are generally acceptable and satisfactory. the values of the different objective functions show the efficiency of the suggested approach, table 8. moreover, the proposed method enables us to obtain good results in a polynomial computation time. in fact, the various values of the criteria given by the multiobjective optimization method by paretooptimality show its effectiveness, table 8. the values of the criteria for the pareto border are in the neighbourhood of to the lower bounds. indeed, such an approach makes it possible to generate paretooptimal solutions of good quality. 80 fatma tangour, ihsen saad c b 1 2 c b c 1 2 c solutions for various populations lower bound barycentre for the population p 1 p p k q w w 1 k+1 p k figure 4: search direction table 6: data relating to 10 operations o1 o2 o3 o4 o5 o6 o7 o8 o9 o10 rk 0 1 2 3 4 1 3 2 1 3 pk 1 2 4 2 1 2 1 3 2 4 vi1 13 14 13 13 12 7 7 13 9 9 vi2 15 14 5 14 13 15 15 16 15 15 vi3 12 13 12 11 14 14 12 14 14 previ1 2 3 4 3 2 1 1 2 2 2 previ2 1 2 2 4 3 2 2 1 4 1 previ3 4 3 2 2 1 3 2 3 3 dvpk 35 32 35 33 35 36 31 34 36 31 drpk 14 10 9 11 8 12 7 9 11 10 dlivpk 21 22 25 22 21 20 21 24 26 22 cstkpk 3 2 2 5 2 3 4 3 1 2 pvenpk 4 6 6 8 7 5 6 8 3 5 multiobjective optimization scheduling problems by pareto-optimality in agro-alimentary workshop81 table 7: data relating to 5 operations o1 o2 o3 o4 o5 rk 2 3 1 4 3 pk 1 2 4 2 3 vi1 3 3 4 6 5 vi2 4 1 2 4 3 vi3 2 3 4 previ1 1 2 1 2 4 previ2 3 1 2 4 2 previ3 4 3 1 dvpk 14 16 10 11 14 drpk 5 6 4 7 5 dlivpk 10 13 14 16 12 cstkpk 1 2 3 4 5 pvenpk 8 3 8 5 4 table 8: experimental results n scheduling c1 c2 c3 cg (.) 10 o1o3o5o9o4o2o7o6o8o10 14 4 24 0,915 9 o1o2o3o4o5o7o6o8o9 9 4 20 0,95 8 o1o7o5o2o3o8o4o6 14 1 18 0,963 7 o1o3o6o7o4o2o5 12 9 17 0,752 6 o1o4o3o5o2o6 4 10 14 0,977 5 o1o4o5o3o2 12 10 14 0,53 82 fatma tangour, ihsen saad 5 conclusion a new approach based on the hybridization with the pareto-optimality for solving multiobjective problems in agro-alimentary workshop, is presented. the approach developed in this work provides the possibility to determine an optimal scheduling among several realizable ones; this optimal solution generates the minimization of objective function (10). besides, the proposed approach uses pareto to estimate and to classify obtained decisions. indeed, we can avoid the preemption of certain components, the cost of the out-of-date products, the cost of the distribution discount and the completion time(makespan). the proposed hybrid approach presented in this paper can be considered as effective mechanisms from the computation complexity. references [1] s. cavalieri, p. gaiardelli, hybrid genetic algorithms for a multiple-objective scheduling problem, journal of intelligent manufacturing, vol.9, pp.361-367, 1998. [2] y. collette, p. siarry, optimisation multiobjectif, editions eyrolles, paris, 2002. [3] l. davis, job shop scheduling with genetic algorithm, proceedings of the first international conference on genetic algorithms, lawrence erlbaum associates, pp.136-140, 1985. [4] f. della croce, r. tadei, g. volta, a genetic algorithm for the job shop problem, computer and operations research, vol. 22, no 1, pp. 15-24, 1995. [5] u. dorndorf, e. pesch, evolution based learning in a job shop environment, computers and operations research, vol. 22, pp.25-40, 1995. [6] e. gargouri, s. hammadi, a distributed scheduling for agro-food manufacturing problems, ieee transactions on systems, man, and cybernetics, part c: applications and reviews, vol. 33, no 2, 2003. [7] e. gargouri, s. hammadi, p.borne, new constraints of agro-food industry scheduling problem, acte de ifdicon’2001, europeen workshop on intelligent forecasting, diagnostic and control, 24-28 juin, pp. 73-80, santorin, 2001. [8] a. k. jain, h. a. el maraghy, single process plan scheduling with genetic algorithm, production planning and control, vol.8, no 4, pp.363-376, 1997. [9] i. kacem, s. hammadi, p. borne, pareto-optimality approach for flexible job-shop scheduling problems: hybridization of evolutionary algorithms and fuzzy logic, math. and computers in sim., 60, pp. 245-276, 2002. [10] k. mesghouni, application des algorithmes évolutionnistes dans les problèmes d’optimisation en ordonnancement de la production , phd thesis, université des sciences et technologiques de lille, lille, 1999. [11] m. mori, c. tseng, genetic algorithms for multimode resource constrained project scheduling problem, european journal of operational research, vol.100, no 1, pp.134-141, 1997. [12] r. nakano, t. yamada, conventional genetic algorithm for job shop problems, proceedings of the 4th international conference on genetic algorithms, university of california, pp. 474 -479, 1991. multiobjective optimization scheduling problems by pareto-optimality in agro-alimentary workshop83 [13] i. saad, m. benrejeb, optimisation multicritère par pareto-optimale des problèmes d’ordonnancement en tenant compte du coût de production, revue sciences et technologies de l’automatique, e-sta, vol. 3, no. 1, 2006. [14] m. sakawa, genetic algorithms and fuzzy multiobjective optimization, dordrecht: kluwer academic. [15] e. g. talbi, métaheuristiques pour l’optimisation combinatoire multiobjectif, tutorial, journées evolutionnaires trimestrielles, paris, 1999. [16] f. tangour, s. hammadi, p. borne, m. benrejeb, ordonnancement dynamique dans un atelier de production agroalimentaire, séminaire d’automatique-industrie, sai’06, matmata, 2006. [17] r. r. yager, on weighted median aggregation operators in multicriteria decision making, ieee trans. on systems, man and cybernetics, 18, pp. 183-190, 1988. [18] e. zitzler, l. thiele, multi-objective evolutionary algorithms: a comparative case study and the strength pareto approach. ieee transactions on evolutionary computation, vol.3, no 4, pp. 257-271, november, 1999. fatma tangour1,2, ihsen saad1,2 1ecole nationale d’ingénieurs de tunis unité de recherche lara-automatique bp 37, le belvédère, 1002 tunis, tunisie 2ecole centrale de lille, cité scientifique laboratoire d’automatique, genie informatique et signal bp 48, 59651 villeneuve d’ascq cedex, france e-mail: {fatma.tangour, ihsen.saad}@enit.rnu.tn editor’s note about the authors: fatma tangour was born in 1969 in nabeul, tunisia. she graduated from “ecole normale superieure de l’enseignement technique” and obtain the master of automatic and signal treatment in 2004 at the “ecole nationale d’ingénieur de tunis”. she is currently preparing the ph.d. degree in automatic and computer science within the framework of lagis-ec-lille and lara-enit cooperation. her research interests are in the area of optimization methods for discrete events systems, computer science and operational research. ihsen saad was born in monastir, tunisia, in 1977. he received the engineer diploma degree in electrical automatic control engineering from the “ecole nationale d’ingénieur de gabès”, tunisia, in 2002. he obtain the master of automatic and signal treatment in 2004 at the “ecole nationale d’ingénieur de tunis”. he is currently preparing the ph.d. degree in automatic and computer science within the framework of lagis-ec-lille and lara-enit cooperation. his research is related to the evolutionary optimization methods for discrete events systems, computer science and operational research. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 1 (march), pp. 166-174 human intervention and interface design in automation systems p. ponsa, r. vilanova, b. amante pere ponsa (corresponding author) automatic control department technical university of catalonia epsevg, av. víctor balaguer, 08800 vilanova i la geltrú, barcelona, spain e-mail: pedro.ponsa@upc.edu ramón vilanova department de telecomunicació i enginyeria de sistemes universitat autònoma de barcelona 08193, bellaterra, spain e-mail: ramon.vilanova@uab.cat beatriz amante project department technical university of catalonia etseiat edifici tr5, c. colom 11, 08222, barcelona, spain e-mail: beatriz.amante@upc.edu abstract: human-machine-interfaces are with no doubt one of the constitutive parts of an automation system. however, it is not till recently that they have received appropriate attention. it is because of a major concern about aspects related to maintenance, safety, achieve operator awareness, etc has been gained. even there are in the market software solutions that allow for the design of efficient and complex interaction systems, it is not widespread the use of a rational design of the overall interface system, especially for large scale systems where the monitoring and supervision systems may include hundreds of interfacing screens. it is on this respect hat this communication provides an example of such development also by showing how to include the automation level operational modes into the interfacing system. another important aspect is how the human operator can enter the control loop in different ways, and such interaction needs to be considered as an integral part of the automation procedure as well as the communication of the automation device.in this paper the application of design and operational modes guidelines in automation are presented inside an educational flexible manufacturing system case study. keywords: human-automation interaction, process control, display design. 1 introduction automation can refer to open-loop operation on the environment or closed-loop control. and the human intervention adopts diverse possibilities: human in the loop (human intervention), human out of the loop (controller intervention) and human on the loop (supervisory control mode over the controlled process). the basic automation replaces the human manual control by an automatic controller; however in highly automated systems is necessary human beings for supervision, adjustment, maintenance, expansion and improvement. automation increases complexity. it is difficult to maintain operational skills in a automated environment with the copyright c⃝ 2006-2011 by ccc publications human intervention and interface design in automation systems 167 presence of an abnormal situation when the operator is required to intervene. the complexity of industrial human process supervision makes it necessary to supplement the human-computer interaction approach with a cross-disciplinary cooperation in order to integrate knowledge and methods from other fields, especially automation and artificial intelligence [1]. our view is that complete control systems engineering must encompass all these approaches. this increasing complexity of production systems has also translated to the automation level. the need to face for larger amounts of information and the capability to interact with other subsystems of the production chain requires of the application of appropriate modelling methodologies. on that respect it is worth to mention that, on the academic side, the authors have developed different tools to tackle such problems. what is proposed in this paper, and presented by its application to a laboratory scale flexible manufacturing system (fms), is a complete integrated approach for the design of the hmi (human machine interface) system. the development follows a top-down approach where the different screens that constitute the overall system are conceived and particular methods are used to ensure, within each one of the designed interfaces, appropriate ergonomic usage. one of the most interesting points is the introduction, at the automation level, of considerations related to the different start and stop modes of the process units. the inclusion of such considerations at hmi level will help in achieving a more solid and helpful interaction system on its relation with the safety and maintenance operations. the structure of the paper is as follows. the second section is a brief introduction about the computer architecture, the plant layout in the flexible manufacturing system and brief comments about human-systems interaction. in the next section, the translation from the physical layout to the global design of the hmi is proposed: the human can use the hmi inside a control room or near the machine/process. section four shows explain the relationship between automatic control and human intervention in manual mode. the paper ends by drawing some concluding remarks. 2 human role in the flexible manufacturing system the application of the proposed integrated approach for hmi design is peformed within a concrete educational frame whose base focuses on two subjects from the automatics and electronics engineering degree program: modelling and simulating systems and integrated production systems, at the technical university of catalonia upc, spain, and automatic control and industrial informatics at the "universitat autňnoma de barcelona" uab, spain. the subjects from the technical engineering program make a special point on technical topics of programmable logic controllers (plc) and industrial handler robots. in the remaining subjects of the engineering program, petri net modelling, simulation by means of the arenaš discrete-even simulation software [2], physical distribution of flexible manufacturing systems and production system modelling and simulation are the main topics. here, we focus on training the students with system behaviour analysis and simulation software so that they would therefore be able to apply all these techniques in their professional life. this software allows flow shop, job shop and flexible manufacturing systems simulation [3]. practical exercises in laboratory complement the theory and, with this purpose, a laboratory scale fms has been built up, configured and being operative during the last two years which allows the comparison between simulated models and real plant performance. this paper does not describe more technical details of the role playing 168 p. ponsa, r. vilanova, b. amante methodology and the project-based learning approach, readers interested in more detail about these techniques are directed to prior work ponsa et al. [4]. another step in a foreseeable future is the integration of supervision tools (supervisory control and data acquisition;scada software), management tools (manufacturing execution systems; mes; and enterprise resource planning; erp) as well as production planning tools (planning policy analysis). in this sense it would also be advisable a new subject on automated production management within the study program which integrates mes and erp systems at the highest level of the computer integrated manufacturing cim pyramid. the figure 1 shows the physical distribution of stations in the above referred educational fms, the computers architecture and the human intervention with the fms system: at the top level (human-computer interaction and human in the supervisory task; in the bottom level (human-machine interaction and human in manual mode task). the fms is composed of electropneumatic units controlled by plc’s and pc’s. the main purpose here consists of emulating current manufacturing systems: object mechanization and supply, transfer, product assembling, quality control, checking and storage; and technologies, such as, pneumatics, robotics, plc, monitoring and production supervision come together. a total of 5 stations constitute the fms. figure 1: human intervention in the flexible manufacturing system: high level (supervisory task), low level (machine operation the cell manufactures two types of products: product 1 goes across stations 1, 2, 3 and 4 (palletization); and product 2 goes from station 4 (assembling) to 3 and comes back to station 4 (palletization). the fms system has product and process flexibility. the product 1 has 9 different variations (size, color); the product 2 has 8 different variations (size, color). the fms can produce 17 different products. from the point of view of process flexibility, the system can produce only the product 1, only the product 2 or both; in this last case station 4 performs two tasks (assembling and palletization) but only has a robot so that production planning policy is important in order to prevent an excessive workload in this station. in contrast with other educational manufacturing systems which use pallets as functional trays for the same purposes, piece transport is carried out directly on an item by item way over the conveyor system from one station to the next one,. at cell control level, we make use of different programmable logic controllers(plc’s) such as the cp2ma, cj1m models from omron; industrial communication protocols (rs-232, rs-422, pc-link, device net, ethernet), and a plc industrial network with s7-314 from siemens and communication protocols (rs-485, human intervention and interface design in automation systems 169 profibus dp). regarding programming software which enables local control level and monitoringsupervision linking we are testing products, such as in touchš software and the technological system platform from wonderware [5]. in a future, the authors want to increase the numbers of the stations and add a storage as/rs station after the station 4 [6]. 3 human machine interface design the structure of the hmi application is a distributed interface with six parts (the main application, and one application for each station) [7]. the main application has information about the behavior of the flexible manufacturing system: (type of items, petri net algorithm, and coordination between stations, maintenance, and safety). the programmable controllers of each station have device net modules and send this information to the master plc. each station has an autonomous mode of functioning or an integrated mode of functioning. in the first mode, the global design of the hmi is conformed as a set of five autonomous hmi applications. in a near future, one of the problems to solve is the connection between the hmi single applications and works all the fms in an integrated mode. the full number of screens of the hmi application for the station1 is 17 screens, while the total number for the global hmi design is of more than 100 screens. control engineering students have been working in the development of the hmi application for each station. in this sense, expert students (today are working in air traffic control) are the tutors of the novice students. through the use of project based learning and cooperative learning, the students have made focus groups in order to define a clear and a global structure for all the hmi applications. figure 2: station_i display design. example of a cyclic menu structure. from the main screen to monitoring and supervisory control screens. in order to achieve an effective hmi application some display design guidelines need to be used. here the ergonomic display design guideline called gedis has been employed [8]. the 170 p. ponsa, r. vilanova, b. amante gedis guide is a method that seeks to cover all the aspects of the interface design. from the initial point of view of strategies for effective humancomputer interaction applied to supervision tasks in an industrial control room [9]. the gedis guide offers design recommendations at the time of creating the interface as well as recommendations for improvement of already created interfaces. the gedis guide is composed of two parts: description of ten indicators and measure of ten indicators. the indicators have been defined from extracted concepts of other generic human factors guidelines and from aspects of human interface design in human-computer interaction. the method to continue for the use of the gedis guide is: analyze the indicator, measure the indicator, obtain the global evaluation index and finally offer recommendations for improvement. for the correct use of the gedis guide the collaboration between human factor technicians is necessary since in some cases the expert’s opinion is needed to analyze the indicator. this paper does not describe more details of the gedis, readers interested in more detail about this guideline are directed to prior work ponsa and díaz [10]. for each station we have developed an interface with the same template. the architecture of this interface has two parts: monitoring and supervisory control (see fig. 2). for monitoring a set of screens (graphical layout of the station, automatic control, communication with the programmable controller plc) have been defined. the aim of this design is to consider the automatic control loop and the human control. in human supervisory control a set of high level screens (start and stop modes guideline, alarm system, fault detection, diagnostic) has been defined. the aim of this design is to consider the supervisory control loop, a high level over the automatic control loop. often, the monitoring interface is associated with the activities of the human operator near the plant, or near the industrial machine (automation level). this task is related to the allocation of physical interface functions and human sensory-motor functions (choose manual or automatic control, activate or deactivate devices, use of the teach pendant of the robot, etc.). with the animating objects properties of the scada it is possible to develop a screen with the devices of the station (pneumatic actuators, motors, conveyors and sensors, etc). in this interface it is important the knowledge of the behavior of the devices. in this sense it is important the monitoring interface of station 4. it is a complex interface because it is necessary to develop an animation screen with the activities of the industrial robot (palletization or handling). another important screen within the monitoring interface is the panel screen when the human operator can select manual mode or automatic mode. this screen is the one that permits a clear interaction between the human and the automatic controller plc. on the other hand the supervisory control interface is associated with the activities of the human operator inside the industrial control room (supervisory control level). the human supervisory task is related to the allocation of interface functions and human cognitive functions (information processing, planning). respect to the interface this part develops a set of screens: historical trends, alarm screen, fault detection and diagnosis screen. it is necessary to apply a general framework to define which is the correct interface representation of a risky situation. however, each station has specific devices so a specific framework is necessary in order to define a possible anomaly of the station. human intervention and interface design in automation systems 171 4 human intervention in the automation cycle petri nets (pn) has proved to be a successful approach on a broad range of applications [11], [12], [13]. there is however one point that it is not clear how to deal with by using the pn formalism: the introduction of the human operator. effectively, in every automation problem the fully automated part is just one part of the solution. it is customary that the operator can enter the loop in different ways, for example by tuning a pid controller [14] and such interaction needs to be considered as an integral part of the automation procedure as well as the communication of the automation device usually a (plc) with the operator. a human machine interface (either as a pc display, industrial panel, etc) provides the connection between the human operator action and the input to the control algorithm inside the controller device (control based pc, control based plc). figure 3: the human operator can intervene inside the automation cycle (grafcet transition) with the use of an industrial panel. in this panel the human operator follow the functional operational modes of the gemma guide. on the other side, the industrial counterpart said that other approaches than pn are currently in use. effectively, even pn allow tackling really large and complex problems, other approaches like sequential flow charts; sfc; or state-transition graph; grafcet; can be considered the primary automation design tool found on the industry. in fact, some of the plcs allow direct programming by using grafcet [15]. therefore it seems there is a gap between both of these approaches. what we would like to point out here is that both disciplines should be combined and used. the important point with grafcet; a simpler approach if compared with pn; is that it has a close and clear connection with the design guide for start and stop modes called gemma [16]. even the gemma guide was started to be used twenty-eight years ago and it was introduced to the engineering students in several places, in our opinion we don‘t paid sufficiently attention to the gemma guide. this is the reason this section would like to focus on the advantages of using the gemma guide and using it inside the hmi application explained in the previous section. although grafcet is very useful in describing the detailed operation of a sequential control performed by a plc it does not provide a general approach for the operation of an automated machine. it is still necessary to define general operational modes and conditions. this is usually done at the specifications definition stage. in this sense, gemma is a recommended tool for this task [17]. on the other hand, it has always been difficult in terms of vocabulary to clearly, and precisely, 172 p. ponsa, r. vilanova, b. amante explain how to start with manual mode or a semi-automatic mode or an automatic mode. also what are the consequences of an emergency stop, a safety stop or a fast stop for a production machine? usually, these questions are answered by each designer in relation to his own experience and knowledge but can seldom be related to a systematical analysis, except in large companies which have been able to define design guide and standards for control and instrumentation. from the previous observations, the gemma guide is a valuable tool to introduce automation in a more general setting than only for the automation of the process production cycle. it is really important to understand the role played by the operator and how he interacts with the automated system [18]. the different operation modes need to be established and be interrelated in a clear and well defined way. for example in normal conditions the automation cycle is represented by three gemma modes: from a1 initial state stop, then f1 normal production, and a2 requested stop at the end of the cycle and finally comeback to a1. in abnormal situation the automation cycle include the management of an emergency situation: from a1 initial state stop, then f1 normal production, d1 emergency stop, a5 preparation to restart after a failure, a6 production reset to the initital state and finally a1 again. the main contribution of this section is to advance one step into an integral conception of the automation process: the design of the automation system has to include considerations on operational modes and these have to be reflected in the human interface. with this aim, fig. 4 shows the gemma graphical representation inside the hmi application. when the system is in normal state or in anomaly, the human operator can see an active mode on screen, this facilities the situation awareness. figure 4: gemma graphical representation inside the hmi application. the human operator can observe the changes from production, stop or failure procedures. when the system is in normal state or in anomaly, the human operator can see an active mode on screen, this facilities the situation awareness. in the hmi application there is a clear relationship between maintenance and safety services of the system. inside the plc an emergency grafcet has been designed with instructions in case of emergency. the maintenance recommendations are used in order to reduce the stop of the station, and the stop of the production. the translation of the gemma guide inside the plc needs the use of the grafcet representation. when a problem appears in station3, for example, the human operator can activate the emergency stop: in this moment stations 1, 2 and human intervention and interface design in automation systems 173 figure 5: when the emergency stop is activated the emergency grafcet forced the stop of the other grafcet. in the traded grafcet we have automatic control or human manual control. the production grafcet is the set of machine operations. 3 stop their production while station 4 and transport unit will still be functioning. when the human operator solves the problem, can procede to activate the reset of the system, drive the system to a safety state and begin again the production in stations 1, 2 and 3. 5 conclusions in this work we raised the need for a top-down integrated design for human automation systems, with special emphasis on the use of appropriate design and operational guidelines (gedis, gemma). different actuation levels are first identified going from the flexible manufacturing system level to the automation level at each station. while the interfacing system at the station or global level has to cover, mostly, high level monitoring actions, it is at the automation level where more complex situations may arise. at this point the authors propose to base the use of the start and stop modes guide, gemma guide, to build up a convenient representation of the operational stages of the production machines. by designing the automation level hmi along these lines its integration with the automation control system becomes almost natural. the gemma guide approach is therefore a recommended taxonomy approach for the introduction of the human operator into the automation cycle in complex academic/industrial domains. it provides a natural relationship between the design of the automation system and the operational modes that are to be considered from the industrial panel point of view. acknowledgment this work has received financial support from an internal grant of the technical university of catalonia (project: human centred design in human supervisory control, 2008) and from the spanish cicyt program under grant dpi2007.63356. bibliography [1] t. sheridan and p. parasuraman, “human automation interaction,” reviews of human factors and ergonomics, vol. 1, pp. 89–129, 2006. [2] rockwell, “arena simulation software,” in url: http://www.arenasimulation.com, 2009. 174 p. ponsa, r. vilanova, b. amante [3] h. boukef, m. benrejeb, and p. borne, “a proposed genetic algorithm coding for flow-shop scheduling problems,” international journal of computers, communications and control, no. 3, pp. 229–240, 2007. [4] p. ponsa, b. amante, j. roman, s. oliver, m.díaz, and j. vives, “higher education challenges: introduction of active methodologies in engineering curricula,” international journal of engineering education, vol. 25, no. 4, pp. 799–813, 2009. [5] wonderware, “wonderware intouch hmi,” in url: http://global.wonderware.com, 2009. [6] f. véliz and g. lefranc, “improvement and extension of virtual reality for flexible systems of manufacture,” international journal of computers, communications and control, no. 2, pp. 95–101, 2006. [7] p. ponsa, r. vilanova, and b. amante, “towards integral human-machine system conception: from automation design to usability concerns,” hsi 2009 human system interaction 2nd international conference, catania, italy, 2009. [8] p. ponsa, r. vilanova, m. díaz, and a. gomŕ, “application of a guideline for design improvement in supervisory control,” international journal on human.computer interaction, vol. 1, pp. 21–36, 2007. [9] b. schneiderman, designing the user interface. strategies for effective human.computer interaction. addison-wesley, third edition, 1997. [10] p. ponsa and m. díaz, “application of an ergonomic guideline for sugar mill control room interface design,” faim , 17th international conference on factory automation and intelligent manufacturing, philadelphia, usa, vol. 1, pp. 89–129, 2007. [11] k. kurihara, s. takigawa, n. nishiuchi, and m. kitaoka, “factory automation control software designing method based on petri nets,” international journal of production research, vol. 40, no. 15, pp. 3605–3625, 2002. [12] e. gutuleac, “descriptive time membrane petri nets for modeling of parallel computing,” international journal of computers, communications and control, no. 3, pp. 33–39, 2006. [13] f. córdova, l. canete, and l. q. f. yanine, “an intelligent supervising system for the operation of an underground mine,” international journal of computers, communications and control, no. 4, pp. 259–269, 2008. [14] r. vilanova and o. arrieta, “balanced pid tuning application to series cascade control systems,” international journal of computers, communications and control, no. 4, pp. 521– 525, 2008. [15] afcet-adepa, “le grafcet,” second edition, toulouse, opadus, 1995. [16] adepa, “le gemma. guide d’etude des modes de marches et d’arrets,” montrouge, adepa, 1981. [17] p. ponsa and r. vilanova, automatización de procesos mediante la guía gemma. edicions upc, no. 105, 2005. [18] k. li, s. thompson, p. wieringa, and j. xeng, “a study on the role on human operators in supervisory automation system and its implications,” proceedings of the 4th world intelligent control and automation, pp. 3288–3293, 2002. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 3, pp. 336-350 node availability for distributed systems considering processor and ram utilization for load balancing a. menendez lc, h. benitez-perez antonio menendez leonel de cervantes, hector benitez perez universidad nacional autonoma de mexico instituto de investigaciones en matematicas aplicadas y sistemas departameto de ingenieria de sistemas computacionales y automatizacion ciudad universitaria, mexico d.f. e-mail: toniomlc@gmail.com, hector@uxdea4.iimas.unam.mx abstract: node-availability is a new metric that based on processor utilization, free ram and number of processes queued at a node, compares different workload levels of the nodes participating in a distributed system. dynamic scheduling and load-balancing in distributed systems can be achieved through the node-availability metric as decision criterion, even without previously knowing the execution time of the processes, nor other information about them such as process communication requirements. this paper also presents a case study which shows that the metric is feasible to implement in conjunction with a dynamic load-balancing algorithm, obtaining an acceptable performance. keywords: node-availability, load balancing, distributed systems, highperformance. 1 introduction load-balancing is a technique often used to distribute computational load among processors or other resources in order to get a better performance (i.e. optimal resource utilization and small processing time). when performing load-balancing (lb) for a distributed system (ds) it is expected that the resources (particularly the processors) to be evenly used, therefore obtaining a general system performance increase [4]. several studies have been carried out in terms of performance [10], task allocation [8], communication media [5], dynamic networking [1], mobile performance [5] and so on. however, these strategies depend on previous measures such as execution time of a process or communication requirements, or in standard metrics (e.g. number of processes queued at a node or processor idle time percentage) where availability (i.e. the capacity of a node to process a job) is not observed. in any case the need to measure the performance(i.e. optimal resource utilization and small processing time) of a ds with similar and almost periodic processes is not directly addressed. in this paper a new metric named node-availability is introduced, it takes advantage on existing metrics such as processor and ram utilization, the number of processes queued at a node and processes communications and compose them to create the new metric. by including several existing metrics in its calculation, node-availability is a metric that provides more information of a node in its value, than solely using any of the existing metrics. it compares different workload levels at two or more nodes participating in a distributed system, providing a decision criterion to be implemented in conjunction with a common workload algorithm. dynamic scheduling and load-balancing in distributed systems is achieved through the node-availability metric, even without previously knowing the execution time of the processes, nor other information about them such as process communication requirements. the objective of this paper is to present a metric named node-availability [9], how it is constructed and how it allows a ds to execute a set of processes in a balanced manner obtaining fair utilization of the overall system. one of the advantages of using the node-availability metric resides in its ability to copyright c© 2006-2010 by ccc publications node availability for distributed systems considering processor and ram utilization for load balancing 337 perform the load-balancing of a ds without previously knowing the execution times of the processes involved, because if the processing times where known, the scheduling and execution of the processes could be done using proven algorithms [3], [6]. the rest of this document is organized as follows: the metric is described in section 2. the lb algorithm used is presented in section 3. a case study is in section 4. the conclusions and future work are in section 5. 2 node-availability the execution time of a process in a ds is a function determined by the complexity of the process, the communication strategy and by the resources available within the ds. since this execution time cannot be easily seeing when a ds implementation is performed other strategy needs to be followed. for instance, secondary measurements such as used memory for each node or communication load amongst processors and processes can be followed. the decision of which is the most suitable measure depends entirely on the implementing resources. considering that a metric is a quantitative and periodic measurement interpreted in the context of a series of previous equivalent measurements, the metric to estimate the nodes availability is presented, first from the node resources point of view (2.1), followed by the tasks load (2.2) and the communications costs (2.3). 2.1 availability one of the most used metrics in terms of distributed computing is availability, defined as the capacity of a node to process a job at a specific time, it can be obtained from several secondary measures like the time consumed by each node (processor) or by communication performance per process. a ds can be considered as a set of nodes communicating with each other trough a network, where node is defined as the autonomous processing unit, which consists of one processor and ram (random access memory). the processes that are executed in a ds generally demand to use the processor and/or memory, they are not characterized by a high input-output demand, so common measurements within nodes are the processor idle time or the percentage of free memory available. when a process demands a memory space larger than the physical ram, the operating system provides virtual memory to it, causing the total execution time of the process to be increased. the percentage of processor and memory available (figure 1) during a time sample, allows to realize what the operating system (os) behavior is, in terms of resource allocation to a process. the os tries to allocate all the (ram) memory that a process demands, it also tries to assign the processor all the possible time, to the process being executed. figure 1. shows a process execution which takes about 70 seconds, during this time, the operating system assigns the processor to it, resulting in a 0% processor availability. on the other hand, the ram demand is lower than the capacity, so the availability of it is between 60% and 100%. figure 2. shows a process demanding an amount of memory lager than the physical memory (ram) of the node, the operating system (os) assigns to the process all the available ram and then it provides virtual memory to fulfill the memory demand. as it can be seen in figure 2. the percentage of memory available during the execution of the process is 0%, while the processor availability oscillates at the beginning and towards the end of the process execution. figures 1 and 2 show the same process executed by the same node. it takes longer to be executed when it demands the use of virtual memory (approx. 10 seconds, figure 2) than when it is restricted to use ram (approx. 70 seconds, figure 1). the first step in our proposal is to determine the availability of a node in terms of its idle processor time and its free ram. the highest values for these metrics indicate the most available node and the lowest values indicate the busiest node. these to metrics are multiplied for two reasons, being the first one 338 a. menendez lc, h. benitez-perez figure 1: memory demand within limits of ram figure 2: demanding virtual memory node availability for distributed systems considering processor and ram utilization for load balancing 339 that since they are percentages the outcome of the product is also a percentage the values cannot be added because the outcome would exceed 100%. the second reason is that if the values are averaged instead of multiplied it is possible to obtain unreal outcomes, for example a node with 100% of idle processor time and 0% of available ram averages 50%, which in terms of availability should be 0%, because if any of the two resources (processor and memory) is not available no process can be processed. on the other hand, with the proposed approach the availability is real since the value in this same example is 0%. so the first step to evaluate the node availability of node (i) is calculated by: ai = αiβi (1) where: i ∈ {, , . . . , n} is the node identifier, αi is the idle processor time percentage of node i, and βi is the free ram percentage of node i. as αi and βi, are percentages, they can be multiplied to find the availability of the node, giving a value between 0 and 1. 2.2 number of processes queued at a node the availability of a node depends not only in its respective α and β values, but also on the number of processes queued, so the previous equation of (equation 1) must reflect this situation, considering that the number of processes affects the availability of a node in an exponential way. so equation 1 becomes: bi = αiβi eεi (2) where εi is the number of process queued at node i. 2.3 processes communications from the communications point of view, a process can have communications with another process in the same node that the process is being executed in, with a process that is being executed in a different node or no communications at all. the communication resources availability and the execution time is different in these three scenarios, therefore it is necessary to differentiate the processes queued in a node based on their communication requirements. so two new variables are introduced: κi is the number of processes queued at node i communicating (internally) with processes also queued in node i; and νi is the number of processes queued at node i communicating (externally) with processes queued in any node different to i. as stated previously, the execution time of a process depends (among other things) on its communication requirements, the communication time can be approximated in terms of the type of communication (i.e. internal or external) that a process performs. as each type of communication has a different time impact, two constants are defined: ρ is the internal communication time and τ is the external communication time. the equation of node-availability (equation 2) now becomes: ci = αiβi e(εi+ρ κi+τ νi) (3) from a process communication requirements point of view, all the nodes of the ds are different. a process tends to finish its execution earlier if it communicates with processes running in the same node, therefore depending mostly on the node availability; if the processes are in different nodes, they depend not only on the different nodes availability, but also on the network speed and available bandwidth. 340 a. menendez lc, h. benitez-perez 2.4 number of samples taken and periodicity our proposal is to determine the availability of a node in terms of its idle processor time, its free ram memory, the number of processes queued at the node and their communications. these measures are taken periodically and when δ samples have been read, then, the node availability is calculated obtaining an arithmetic average of the readings. so the final equation of node-availability or φ is: φi, j =  δ δ∑ j= αi, jβi, j e(εi, j +ρ κi, j +τ νi, j ) (4) where: i ∈ {, , . . . , n} is the node identifier, j ∈ {, , . . . , δ } is the sample number, αi, j is the idle processor time percentage of node i considering the jth sample, βi, j is the free ram memory percentage of node i considering the jth sample, εi, j is the number of processes queued at node i considering the jth sample, κi, j is the number of processes queued at node i with internal communications considering the jth sample, νi, j is the number of processes queued at node i with external communications considering the jth sample, ρ is the internal communication time, τ is the external communication time and δ is the number of samples taken of: αi, j, βi, j and εi, j before sending the data to the lb process. 2.5 optimization characteristics of φ in this section we show that the metric φ presents a global minimum dependent on local values. the metric φ is characterized per node where the discrete variables α, β , ε, κ and ν play the role of representing the system behavior based on the sample taken. therefore ρ and τ are the communication time factors that can be distinguish as bounded variables and they can be delimited through a local linear optimization strategy. the time values of ρ and τ help to determine some network characteristics such as the type of protocol or the required speed specifications. the first parameter to be defined is the load-balancing factor of the distributed system(ds), which is defined as: µ j = φmin, j φmax, j (5) where φmin, j is the node-availability value corresponding to the least available node and φmax, j is the value corresponding to the most available node within the ds at sample j. it can be noticed that when φmin, j and φmax, j are similar, µ j tends to be one if the system is balanced. on the other hand, the ds is as unbalanced as µ j tends to zero. based upon this approximation the ds error at sample j can be defined as the ds level of unbalance ω given by: ω j = ( − µ j)  (6) to minimize this error it is necessary to balance the ds around the local loads, having that: i f µ j →  then ω j →  in order to minimize this error, the use of partial derivatives of this equation is pursued in terms of the communication times ρ and τ as shown next: ∂ ω j ∂ ρmax, j , ∂ ω j ∂ ρmin, j , ∂ ω j ∂ τmax, j , ∂ ω j ∂ τmin, j (7) where: ρmax, j and τmax, j are the communication times at the most available node, and ρmin, j and τmin, j are the communication times at the least available node, both cases at sample j. node availability for distributed systems considering processor and ram utilization for load balancing 341 now if we take into account the related node and sample values, the global error ˝ is expressed as follows: ω j =   ( φmin, j φmax, j ) =   (  − αmin, jβmin, j αmax, jβmax, j e(εmax, j −εmin, j +ρmax, j κmax, j −ρmin, j κmin, j +τmax, j νmax, j −τmin, j νmin, j ) ) (8) where: max corresponds to the most available node, min corresponds to the least available node and j is the sample number. reordering this expression in terms of λ ()j and λ () j as follows: λ ()j = αmin, jβmin, j αmax, jβmax, j λ ()j = εmax, j − εmin, j + ρmax, jκmax, j − ρmin, jκmin, j + τmax, jνmax, j − τmin, jνmin, j (9) ω can be expressed as: ω j =   (  − λ ()j e λ ()j ) (10) the partial derivatives are as follows: ∂ ω j ∂ ρmin, j = ( − λ ()j e λ ()j )(λ ()j e λ ()j )(κmin, j) ∂ ω j ∂ ρmax, j = ( − λ ()j e λ ()j )(λ ()j e λ ()j )(−κmax, j) ∂ ω j ∂ τmin, j = ( − λ ()j e λ ()j )(λ ()j e λ ()j )(νmin, j) ∂ ω j ∂ τmax, j = ( − λ ()j e λ ()j )(λ ()j e λ ()j )(−νmax, j) (11) the communication times ρ and τ defined in terms of the next sampling period (j+1) are expressed as: ρmin, j+ = ρmin, j + η ∂ ω j ∂ ρmin, j ρmax, j+ = ρmax, j + η ∂ ω j ∂ ρmax, j τmin, j+ = τmin, j + η ∂ ω j ∂ τmin, j τmax, j+ = τmax, j + η ∂ ω j ∂ τmax, j (12) where: η is a design factor where the metric balances the performance of each node depending on its relations amongst (α, β and ε ). 342 a. menendez lc, h. benitez-perez 2.6 metric optimization examples in order to show the effectiveness of this technique two examples are carried out in which the metric φ is evaluated without performing any load-balancing. the first example has a medium profile processor utilization and is called "relaxed example", the second example has high profile processor utilization and is named "restrictive example". this processor utilization is calculated with the well known metric of processor utilization by a set of periodic tasks called "processor utilization factor" [16]. in both examples a set of 40 periodic tasks is evenly distributed trough 10 nodes (i.e. 4 tasks per node). the theoretical workload that a set of periodic tasks imposes to a processor can be calculated using equation (13), in which the "utilization" of a processor is a value under one. u = n∑ i= ci pi (13) where: u = processor utilization, ci = time consumed by the task i and pi = period of task i and n = number of tasks. in both examples the purpose is to obtain the optimized values for ρ and τ that are used in the next section where the metric φ is implemented within a load-balancing algorithm. the setup for the two examples consists of 10 nodes with 4 periodic tasks per node, the periods and consumption times for each task differ in both cases, table 1 shows the parameters used for the relaxed example and in table 4 are the periods and consumption times for the tasks in the restrictive example. notice that the first three tasks (i.e. tasks numbered 1 to 3) are identical in all the nodes, the difference in the workloads of the nodes resides in the fourth task, in which the period is modified. as it can be seen in table 1, the fourth task at every node has a period equivalent to 8 times the node number (e.g. the period of the fourth task 4 at the node number 1 is 8, the period of task 4 in node 2 is 16 an so on). the number of samples taken during both examples is 1000, and every 8 (i.e. δ = ) samples the value of φ is calculated using equation (4). the number of tasks per node with internal communications is 2 and one task has external communications. table 1: tasks parameters (relaxed example) task number period (p) consumption time (c) 1 8 1 2 16 2 3 32 3 4 8 times the number of node 1 the set of tasks assigned to each node according to table1 has a processor utilization per node as listed in table 2. this example is called "relaxed" because the maximum utilization of a processor corresponds to node number 1, and it is 0.5 (as can be seen in table 2). following the optimization procedure of ρ and τ explained in the previous section, their final values (final ρ and final τ ) after 1,000 samples with δ =8 are shown in table 2. a restrictive example is presented where the processor utilization is between 70% and 81%, as can be seen in table 4. the difference amongst the "relaxed" and "restrictive" example resides in duration of the periods of the first three tasks at every node, as shown in table 3, these periods last half the time for the restrictive case, thus imposing a major workload to the processors as can be seen in table 4. the number of samples taken is again 1000 with a δ of 8. the number of tasks with internal communications is 2 and the number of tasks with external communications is 1. as stated earlier, the main difference with the node availability for distributed systems considering processor and ram utilization for load balancing 343 table 2: processor utilization and final values of ρ and τ per node (relaxed example) node utilization final ρ final τ factor time time 1 0.4688 0.0095 0.0127 2 0.4062 0.0935 0.0673 3 0.3854 0.1033 0.1165 4 0.375 0.1081 0.1406 5 0.3688 0.111 0.1548 6 0.3646 0.1128 0.1642 7 0.3616 0.1142 0.1708 8 0.3594 0.1152 0.1758 9 0.3576 0.1159 0.1797 10 0.3562 0.1166 0.1828 "relaxed" example resides on the "utilization factor" for all the nodes, which is around 0.75 as shown in table 4. based upon these two examples, final values pf ρ & τ represent the communication characteristics or guarantees that the ds must provide in order to have a balanced system, meaning that when a node has more processor utilization, it needs to take less time in its communications. table 3: tasks parameters (restrictive example) task number period (p) consumption time (c) 1 4 1 2 8 2 3 16 3 4 8 times the number of node 1 even tough the value of τ can be seen as a local parameter, it is more common to have or guarantee a global communication time for all the nodes participating in a network, so a unique communication time for the whole network must be considered. if the value chosen for τ is the one corresponding to the minimum value for the τ ’s amongst all the nodes then the network is more restrictive and therefore the external communications need to be faster. on the other hand the maximum value of τ means that the communications are relaxed respect to the time they take, but the counterpart is that they not help to have a balanced ds since the least available nodes need faster external communications in order to have equivalent node-availability values to the most available nodes. it can clearly be noticed that the in order to provide communications at the speed required by the value of τ , the network specifications play an important role. the same reasoning applies to the values of ρ for internal communications. the range of values for ρ and τ listed in tables 2 for the relaxed example or in 4 for the restrictive example, indicate respectively the time that internal (ρ ) and external (τ ) communication must take in order to have a balanced system. further more they show the benefits of using φ as a metric, convenient not only to perform such a task as load-balancing, but also useful to determine the optimal duration for process communications, and in the case of τ providing the speed specifications for the communications network. whether to choose the minimum, maximum or average values from the restrictive or relaxed case 344 a. menendez lc, h. benitez-perez table 4: processor utilization per node (restrictive example) node utilization final ρ final τ factor time time 1 0.8125 0.0167 0.0348 2 0.75 0.1491 0.3455 3 0.7292 0.172 0.4599 4 0.7188 0.1828 0.5138 5 0.7125 0.189 0.5452 6 0.7083 0.1931 0.5657 7 0.7054 0.196 0.5802 8 0.7031 0.1982 0.591 9 0.7014 0.1999 0.5993 10 0.7 0.2012 0.606 for ρ and τ depends strictly on the particular implementation case (i.e. network protocol and processor utilization factor). 3 the high-low (hilo) algorithm. in order to perform lb (load-balancing) and load distribution using the presented metric nodeavailability, a simple and well known algorithm, here named high-low (hilo) is used. the underlying principle in hilo is to determine the most available node and the least available one. the knowledge of these nodes is used by the algorithm to perform its two main methods, the periodic method named balance and the event triggered method named distributor (see figure 3). these two methods are nested depending on the arrival of new processes as shown in figure 4, in this case the periodic balance method is executed every period while distributor is executed only when a new process arrives. figure 3: flow diagram of hilo methods balance an distributor node availability for distributed systems considering processor and ram utilization for load balancing 345 the balance method is performed as follows: the most available node is found by calculating the node-availability of all the nodes participating in the distributed system and selecting the one with the maximum φ value, so the most available node is: max(φ ) and analogously the least available node is min(φ ). in order to be able to balance a ds, the hilo algorithm also requires an aimed balance level for it named ξ having  < ξ ≤ . with this three values (max(φ ), min(φ ) and ξ ), the periodical balance method calculates the actual “load-balancing factor” µ using equation (5). if the obtained value of µ is under ξ then the balance method performs the actual load-balancing. this load-balancing is as simple as removing one process from the queue of the node min(φ ) and migrating it to the node named max(φ ). the second method named distributor is responsible to assign a node to any new process arriving to the distributed system. once a new process arrives, distributor sends it to the node max(φ ). figure 4: hilo methods: balance an distributor 3.1 pseudo-code of hilo the algorithm hilo has two methods the first one is a periodic method named “balance”, the second method which is executed every time a new process arrives to the distributed system is named “distributor”. hilo periodically_execute balance if new process then distributor the balance method: balance if min(φ) / max(φ) < ξ then remove process from min(φ) and send it to max(φ) when activated, the distributor method sends the arriving (new) process, to the node max(φ ): 346 a. menendez lc, h. benitez-perez distributor add_queue max(φ) new process 3.2 simulation the hilo algorithm as well as the metric φ are tested on both: a 16 nodes cluster (case study in next section) and on a simulation using matlab. figure 5 shows the simulation of this process where 500 samples are taken, the processes are generated between samples 50 and 250 using a poisson distribution to simulate both; the number of processes ready to be executed and the duration of each one. it can be seeing that the system reaches an absolute balance around sample 400, but during the whole execution of the set of processes, no single node is over-occupied nor idle. figure 5: simulation of a lb process within a cluster, 500 samples are taken and the processes are generated between samples 50 and 250. the parameters used for the simulation are: number of nodes n =16, total samples(j) = 500, and δ = . the internal communication factor ρ = . and the external communication factor τ = . are obtained by calculating the average times from the minimum respective values from table 2 and table 4. the average consumption time of each process is given by an exponential random distribution with mean = δ ∗  and ξ is .%. the initial value for αi, j is one, this value is updated every time a process is queued at a node i, by decreasing its value by 0.1% per process, the approximated inverse procedure is performed when the process has finished. in terms of βi, j (available memory) a similar procedure is performed with a decrement value of 0.12%. these increments/decrements are according to the availability behavior presented in section 2.1. the value for εi, j is calculated every sample, based on how many processes are queued per node and the values for κi, j and νi, j are random numbers between 0 and εi, j. the impact that φ has on the ds load balance can be seen in figure 5, outlining that the work load was evenly distributed amongst the 16 nodes during the whole simulation. node availability for distributed systems considering processor and ram utilization for load balancing 347 4 case study the case study is based on a real geology-specific application, which consists of several similar processes distributed over a cluster. these processes perform a different number of operations locally. the case study is processed seven times with a different number of processes, each occasion, as shown in table 5. every time the algorithm hilo and two common load distribution algorithms: random and round-robin [12–14] are used to execute these 7 different sets of processes. the processes are ready to be executed based on a poisson distribution, independent for each case study. the implementation details are presented in the 4.1 subsection, and the results in subsection 4.2. 4.1 implementation the case study is implemented in a dedicated cluster, which consists of 16 nodes with the following configuration: one master node with 2 xeon processors at 2.6 ghz, 1.5 gb ram and linux kernel 2.6.8. 15 nodes with pentium iv processor at 2.6 ghz, 512 mb ram and linux kernel 2.6.12. the master node performs the distribution and load-balancing functions of the cluster. the case studies are integrated shown in table 5. table 5: number of processes per case study case study number of processes 1 100 2 200 3 300 4 400 5 500 6 1000 7 1500 these processes are independent amongst each other, and to simulate when a process is ready to be executed, a poisson distribution is used. every process performs a random number of local sums and string concatenations, both random numbers are generated globally using an exponential distribution for each case study [6], [15]. furthermore, as both numbers differ, the demands of processor and memory are also different for every process and case study. for these cases κ and ν are equal to zero since there is no communication between processes. each set of processes is executed using the random, round-robin and hilo algorithms to distribute the load. the random algorithm uses a uniform distribution to select the node in which the arriving process is going to be queued. the round-robin algorithm sends the arriving process to the nodes in a round-robin manner. the algorithm hilo sends the arriving process to the un node. the algorithm hilo uses the following values for the parameters described earlier in this paper, selected (as means of example) in a heuristic manner: δ = , n =  and ξ = .. 4.2 results the total execution times of the seven sets of processes (listed in table 1.) were obtained by executing them in the cluster, using the previously listed algorithms for the load distribution. the presented metric node-availability, implemented in the hilo algorithm outperforms the other two as can be seen in the 348 a. menendez lc, h. benitez-perez figure 6 (algorithm comparison). considering the execution time of hilo as 100%, the other two algorithms (round-robin and random) take more time to complete the execution of the same seven sets of processes, this extra time goes from 10% to 65% (i.e. the execution time with round-robin or random algorithms takes from 110% to 165% compared with hilo which is 100%). it can also be noticed in figure 6 that with the smaller set of processes (i.e. 100) the percentage gain of hilo is larger, meaning that the algorithm is efficient even when the set has a relatively small number of processes. figure 6: percentage compared time-efficiency of the algorithm. hilo=100% . every occasion that the condition µ < ξ was fulfilled, a balance was performed. the total numbers of balances that hilo performed are shown in table 6. the total execution time is expressed in minutes:seconds. table 6: number of balances and total time execution per processes set. # processes # balances total time 100 31 01:32.0 200 86 03:17.8 300 137 04:50.0 400 186 06:27.3 500 226 07:52.0 1,000 456 15:36.0 1,500 676 22:59.9 figure 7. shows the execution times of the processes set listed in table5. using the "random", "round-robin" and hilo algorithms, in every case hilo outperforms the other two. 5 conclusions and future work the metric “node-availability” (φ ), allows performing an efficient lb without previously knowing the execution times of the processes, nor the processes communication requirements. this metric takes into account processor and memory availability every given sample and the estimation of the related communication times per processor and process respectively. an optimization procedure based on the communications protocol performance is carried out in order to guarantee the suitability of this metric. the time values of ρ and τ obtained after this optimization node availability for distributed systems considering processor and ram utilization for load balancing 349 figure 7: total time algorithm comparison. procedure provide the speed specifications of the communications network and the type of protocol required. based upon this information the proposed metric (φ ) and hilo perform an efficient response as shown in figure 6. the results presented here provide a clear idea of the impact of φ as the criterion metric to perform a load-balancing procedure. for the case study taken into account, the load-balancing algorithm has a cost in terms of the time taken which can be neglected and is included in the "total time" column of table 6. in terms of hilo algorithm, the ξ factor gives the possibility of balancing either in a very relaxed or strict manner. for instance, if the desired balance factor (ξ ) is decreased in a significant way, the number of balances decreases towards zero. on the other hand when the value of ξ tends to 1 the system performs a load-balancing process every δ samples. bibliography [1] j. bahi, r. couturier, f. vernier, synchronous distributed load balancing on dynamic networks,journal of parallel and distributed computing, 65 pp.1397-1405, elsevier 2005. [2] d. bertsekas, constrained optimization an lagrange multiplie methods, academic press inc., usa 1992. [3] j. chiasson, z. tang, j. ghanem, t. chaouki„ j. abdallah, d. birdwell, m.m. hayat, h. jrez, the effect of time delays on the stability of load balancing algorithms for parallel computations, ieee transactions on control systems technology, vol. 13, no. 6, november 2005 pp. 932-942. [4] r. f. de mello, l. j. senger, l.t. yang, a routing load balancing policy for grid computing environments, proceedings of the 20th international conference on advanced information networking and applications, 1550-445x/06 ieee 2006. [5] p. ghosh, n. roy, s.k. das, k. basu, a pricing strategy for job allocation in mobile grids using a non-cooperative bargaining theory framework,journal of parallel and distributed computing, 65 pp.1366-1383, elsevier 2005. [6] d. grosu, a. chronopoulos, algorithmic mechanism design for load balancing in distributed systems,ieee transactions on systems, man, and cyberneticsńpart b: cybernetics, vol. 34, no. 1, february 2004, pp. 77-84. 350 a. menendez lc, h. benitez-perez [7] o. lee, m. anshel, i. chung, design of an efficient load balancing algorithm on distributed networks by employing symmetric balanced incomplete block design,iee proc.-commun, vol. 151, no. 6, december 2004. [8] l. keqin, job scheduling and processor allocation for grid computing on metacomputers, journal of parallel and distributed computing, 65 pp.1406-1418, elsevier 2005. [9] a. menendez, h. benitez-perez, node availability for distributed systems considering processor and ram utilization, eighth mexican international conference on computer science, enc07, page(s):131 137, doi:10.1109/enc.2007.24, 2007. [10] b. parhami, swapped interconection networks: topological, performance, and robustness attributes, journal of parallel and distributed computing, 65 pp.1443-1452, elsevier 2005. [11] m. perez, a. sanchez, j. pea, v. robles, a new formalism for dynamic reconfiguration of data servers in a cluster, journal of parallel and distributed computing, 65 pp.1134-1145, elsevier 2005. [12] h. sit, k. ho, h. v. leong, w. p. r. luk, l. ho, an adaptive clustering approach to dynamic load balancing, proceedings of the 7th international symposium on parallel architectures, algorithms and networks (ispanő04) 1087-4089 ieee 2004. [13] d. takemoto, s. tagashira, s. fujita, partitioning in content-addressable networks distributed algorithms for balanced zone, proceedings of the tenth international conference on parallel and distributed systems (icpadső04) 1521-9097 ieee 2004. [14] torque resource manager http://www.clusterresources.com/pages/products/torque-resourcemanager.php 2006. [15] z. zeng, b. veeravalli, rate-based and queue-based dynamic load balancing algorithms in distributed systems, proceedings of the tenth international conference on parallel and distributed systems, 1521-9097/04 ieee 2004. [16] liu w.s. jane, real-time systems, prentice hall, usa, 2000. antonio menéndez lc is a computer science engineer from the universidad la salle and currently phd candidate by the universidad nacional autónoma de méxico (unam). more than 25 years of experience in the computer and technology industries, leading multi-million projects, as well as in the academic world. devoted to research for the last years has participate in international congresses of real-time, convergence, hybrid systems, and computer science. héctor benítez pérez is a full time researcher in the iimas unam (méxico). he obtained his bsc in electronic engineering at the engineering faculty unam in 1994 and his phd at sheffield university, uk en 1999. his areas of interest are in real time control and fault diagnosis. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 3, pp. 268-279 adding lifetime to objects and membranes in p systems b. aman, g. ciobanu bogdan aman, gabriel ciobanu romanian academy, institute of computer science and a.i.cuza university of iaşi, romania e-mail: baman@iit.tuiasi.ro, gabriel@info.uaic.ro abstract: membrane systems are computing devices inspired from the cell functioning. a feature of membrane systems is the fact that objects and membranes are persistent. in fact, this is not quite true in the real world: cells and intracellular proteins have a well-defined lifetime. inspired from these biological facts, we define a model of membrane systems in which each membrane and each object has attached a lifetime. some results show that this model is at least as powerful as the usual one. 1 introduction to membrane computing membrane systems are essentially parallel and nondeterministic computing models inspired by the compartments of eukaryotic cells and their biochemical reactions. the structure of the cell is represented by a set of hierarchically embedded regions, each one delimited by a surrounding boundary (called membrane), and all of them contained inside an external special membrane called skin. the molecular species (ions, proteins, etc.) floating inside cellular compartments are represented by multisets of objects described by means of symbols or strings over a given alphabet. the objects can be modified or communicated between adjacent compartments. chemical reactions are represented by evolution rules which operate on the objects, as well as on the compartmentalized structure (by dissolving, dividing, creating, or moving membranes). a membrane system can perform computations in the following way: starting from an initial configuration which is defined by the multiset of objects initially placed inside the membranes, the system evolves by applying the evolution rules of each membrane in a nondeterministic and maximally parallel manner. a rule is applicable when all the objects which appear in its left hand side are available in the region where the rule is placed. the maximally parallel way of using the rules means that in each step, in each region of the system, a multiset of rules is chosen which is maximal and applicable, namely a multiset of rules such that no further rule can be added to the multiset. a halting configuration is reached when no rule is applicable. the result is represented by the number of objects from a specified membrane. several variants of membrane systems are inspired by different aspects of living cells (symport and antiport-based communication through membranes, catalytic objects, membrane charge, etc.). their computing power and efficiency have been investigated using the approaches of formal languages, grammars, register machines and complexity theory. membrane systems (also called p systems) are presented together with many variants and examples in [21]. several applications of these systems are presented in [12]. an updated bibliography can be found at the p systems web page [22]. for an alphabet v = {a, . . . , an}, we denote by v ∗ the set of all strings over v ; λ denotes the empty string and v + = v ∗\{λ }. a multiset over v is represented by a string over v (together with all its permutations), and each string precisely identifies a multiset. a language (over v ) is any subset of v ∗; a language is denoted usually by l. given a language l, we define the set ps(l) = {ψv (x) | x ∈ l} called the parikh image of l. if f l is a family of languages, then psf l denotes the family of parikh images of languages in f l. definition 1. a p system of degree n ≥  is a construct π = (v, t,c, h, µ, w, . . . , wn, (r, ρ), . . . , (rn, ρn), io) where: copyright c© 2006-2010 by ccc publications adding lifetime to objects and membranes in p systems 269 1. v is an alphabet of symbols; its elements are called objects; 2. t ⊆ v is the terminal (or output) alphabet; 3. c ⊆ v , c ∩t = /0 is the alphabet of catalysts; 4. h is a set of membrane labels; 5. µ ⊆ h × h is a tree that describes the membrane structure, such that (i, j) ∈ µ denotes that the membrane labelled by j is contained in the membrane labelled by i; 6. wi ∈ v ∗, for each  ≤ i ≤ n, is a multiset of objects initially assigned to membrane i; 7. ri, for all  ≤ i ≤ n, is a finite set of evolution rules that is associated with membrane i; an evolution rule is a multiset rewriting rule of the form u → v, with u ∈ v +, either v = v′ or v = v′δ , v′ ∈ ((v ×{here, out})∪(v ×{in j |  ≤ j ≤ n}))∗, and δ a special symbol not appearing in v ; 8. ρi, for all  ≤ i ≤ n, is a partial order relationship defined over the rules in ri specifying a priority relation between these rules; 9. io is the label of an elementary membrane of µ that identifies the output region. therefore, a p systems of degree n ≥  consists of a membrane structure µ containing n ≥  membranes where each membrane i gets assigned a finite multiset of objects wi and a finite set of evolution rules ri. an evolution rule is a multiset rewriting rule which consumes a multiset of objects from v and produces a multiset of pairs (a,t), with a ∈ v and t ∈ {here, out} ∪ {in j |  ≤ j ≤ n} a target specifying where to move the objects after the application of the rule. as well as this, an evolution rule can produce the special object δ to specify that, after the application of the rule, the membrane containing the special object δ has to be dissolved. after dissolving a membrane, all objects and membranes previously present in it become elements of the immediately upper membrane, while the rules of the dissolved membrane are removed. we use p systems without lifetimes instead of p systems in order to make a clear distinction from the p systems with lifetimes which are introduced in what follows. 2 p systems with lifetimes the evolution of complicated real systems frequently involves various interdependence among components. some mathematical models of such systems combine both discrete and continuous evolutions on multiple time scales with many orders of magnitude. for example, in nature the molecular operations of a living cell can be thought of as such a dynamical system. the molecular operations happen on time scales ranging from − to  seconds, and proceed in ways which are dependent on populations of molecules ranging in size from as few as approximately  to approximately as many as . molecular biologists have used formalisms developed in computer science (e.g. hybrid petri nets) to get simplified models of portions of these transcription and gene regulation processes. according to molecular cell biology [18]: (i) “the life span of intracellular proteins varies from as short as a few minutes for mitotic cycles, which help regulate passage through mitosis, to as long as the age of an organism for proteins in the lens of the eye.” (ii) “most cells in multicellular organisms . . . carry out a specific set of functions over periods of days to months or even the lifetime of the organism (nerve cells, for example).” 270 b. aman, g. ciobanu it is obvious that lifetimes play an important role in the biological evolution. we use an example from the immune system. example 1 ( [18]). t-cell precursors arriving in the thymus from the bone marrow spend up to a week differentiating there before they enter a phase of intense proliferation. in a young adult mouse the thymus contains around  to × thymocytes. about × new cells are generated each day; however, only about  to  ×  (roughly  − %) of these will leave the thymus each day as mature t cells. despite the disparity between the numbers of t cells generated daily in the thymus and the number leaving, the thymus does not continue to grow in size or cell number. this is because approximately % of the thymocytes which develop in the thymus also die within the thymus. inspired from these biological facts, we add lifetimes to objects and membranes. we use a global clock to simulate the passage of time in a membrane system. definition 2. a p system with lifetimes of degree n ≥  is a construct π = (vt , t,c, ht , µt , w, . . . , wn, (r, ρ), . . . , (rn, ρn), io) where: 1. vt = v ×(n∪∞) is a set of pairs of the form (a,ta), where a ∈ v is an object (as in definition 1) and ta ∈ (n∪∞) is the lifetime of the object a; 2. t , c are as in definition 1; 3. ht = h ×(n∪∞) is a set of set of pairs of the form (h,th), where a ∈ h is a membrane label (as in definition 1) and th ∈ (n∪∞) is the lifetime of the membrane h; 4. µt ⊆ ht ×ht is a tree that describes the membrane structure, such that ((i,ti), ( j,t j)) ∈ µt denotes that the membrane labelled by j, with the lifetime jt , is contained in the membrane labelled by i, with the lifetime it ; 5. wi ⊆ (vt )∗ is a multiset of pairs from vt assigned initially to membrane i; 6. ri, for all  ≤ i ≤ n, is a finite set of evolution rules that is associated with membrane i of the following forms: (a) u → v, with u ∈v +t , either v = v′ or v = v′δ , v′ ∈ ((vt ×{here, out})∪(vt ×{in j |  ≤ j ≤ n}))∗; δ is a special symbol not appearing in v ; (b) (a,t) → (a,t − ), for all a ∈ v and t >  if an object a is not involved in a rule of type (a) and it has a lifetime t > , then its lifetime is decreased. (c) (a, ) → λ , for all a ∈ v if an object a has the lifetime  then the object is replaced with the empty multiset λ , thus simulating the degradation of proteins. (d) [ ](i,t) → [ ](i,t−), for all  ≤ i ≤ n in each evolution step the lifetime of each membrane of the membrane structure is decreased with one. (e) [ ](i,) → [δ ](i,), for all  ≤ i ≤ n if the lifetime of a membrane reaches  the membrane is dissolved. 7. ρi, for all  ≤ i ≤ n, is a partial order relationship defined over the rules in ri specifying a priority relation between these rules; adding lifetime to objects and membranes in p systems 271 8. io is the label of an elementary membrane of µt that identifies the output region. these rules are applied according to the following principles: 1. all the rules are applied in parallel: in a step, the rules are applied to all objects and to all membranes; an object can be used only by one rule, non-deterministically chosen (there is no priority among rules), but any object which can evolve by a rule of any form, should evolve. 2. if a membrane is dissolved, then all the objects in its region are left free in the region immediately above it. because all rules are associated with membranes, the rules of a dissolved membrane are no longer available at the next step. 3. the skin membrane has the lifetime equal to ∞, so it can never be dissolved. 4. if a membrane or object has the lifetime equal to ∞, when applying the rules simulating the passage of time we use the equality ∞ −  = ∞. the computation stops when the membrane system contains only objects and membranes that have the lifetime equal to ∞. example 2. the concentration of a molecule can be adjusted quickly only if the lifetime of the molecule is short [1]. it is natural to think of signaling systems in terms of the changes produced when a signal is delivered. but it is just as important to consider what happens when a signal is withdrawn. during development transient signals often produce lasting effects: they can trigger a change that persists indefinitely, through cell memory mechanisms. but in most cases, especially in adult tissues, when a signal ceases, the response fades. the signal acts on a system of molecules that is undergoing continual turnover, and when the signal is shut off, the replacement of the old molecules by new ones wipes out the traces of its action. it follows that the speed of reaction to shutting off the signal depends on the rate of turnover of the molecules that the signal affects. it may not be as obvious that this turnover rate also determines the promptness of the response when the signal is turned on. the figure 1 shows the predicted relative rates of change in the intracellular concentrations of molecules with differing turnover times when their rates of synthesis are increased suddenly by a factor of 10. the concentrations of those molecules that are normally being rapidly degraded in the cell ( red lines) change quickly, whereas the concentrations of those that are normally being slowly degraded ( green lines) change proportionally more slowly. the numbers (in blue) on the right-hand side are the half-lives assumed for each of the different molecules. consider, for example, two intracellular molecules x and y , both of which are normally maintained at a concentration of  molecules per cell. molecule x has a slow turnover rate: it is synthesized and degraded at a rate of 10 molecules per second, so that each molecule has an average lifetime in the cell of  seconds. molecule y turns over  times as quickly: it is synthesized and degraded at a rate of  molecules per second, with each molecule having an average lifetime of  seconds. if a signal acting on the cell boosts the rates of synthesis of both x and y tenfold without any change in the molecular lifetimes, at the end of  second the concentration of y will have increased by nearly  molecules per cell ( ×  − ) while the concentration of x will have increased by only  molecules per cell. in fact, after its synthesis rate has been either increased or decreased abruptly, the time required for a molecule to shift halfway from its old to its new equilibrium concentration is equal to its normal half-life that is, it is equal to the time that would be required for its concentration to fall by half if all synthesis were stopped (figure 1). the same principles apply to proteins as well as to small molecules and to molecules in the extracellular space as well as to those in cells. many intracellular proteins that are rapidly degraded have short half-lives, some surviving less than  minutes; in most cases these are proteins with key regulatory roles, 272 b. aman, g. ciobanu figure 1: the importance of rapid turnover [1] whose concentrations are rapidly regulated in the cell by changes in their rates of synthesis. likewise, any covalent modifications of proteins that occur as part of a rapid signaling process most commonly the addition of a phosphate group to an amino acid side chain must be continuously removed at a rapid rate to make such signaling possible. example 3. the scenario presented in example 2 can be modelled using p systems with lifetimes. consider the membrane configuration [(xs, ∞)(x , ) . . . (x , )(ys, ∞)(y, ) . . . (y, )](cell,∞) describing the structure of the cell when in equilibrium. xs and ys represent the molecules from which x and y are synthesized, while (x ,t) and (y,t) represent the fact that there are  molecules x which have the lifetime equal to t and  molecules y which have the lifetime equal to t. the rules describing the evolution of this system are: 1. (x , ) → λ a molecule x which is at the end of its lifetime is degraded. 2. (y, ) → λ a molecule y which is at the end of its lifetime is degraded. 3. (xs, ∞) → (xs, ∞)(x , ) each second  new x molecules are synthesized. 4. (ys, ∞) → (ys, ∞)(y, ) each second  new y molecules are synthesized. 5. (x ,t) → (x ,t − ), t >  the lifetime of a molecule x which is not involved in any reaction is decreased. 6. (y,t) → (y,t − ), t >  the lifetime of a molecule y which is not involved in any reaction is decreased. adding lifetime to objects and membranes in p systems 273 after applying rules in a maximal manner after each second we reach the initial configuration [(xs, ∞)(x , ) . . . (x , )(ys, ∞)(y, ) . . . (y, )](cell,∞). if two signals enter the cell (we model the signals by the pairs (cx ,tc) and (cy ,tc)) then we consider two new rules: 7. (cx ,tc)(xs, ∞) → (cx ,tc − )(xs, ∞)(x , ) if an object cx is present in the cell then the rate of synthesis of x is increases  times. 8. (cy ,tc)(ys, ∞) → (cy ,tc − )(ys, ∞)(y, ) if an object cy is present in the cell then the rate of synthesis of y is increases  times. when adding this rules we also add some priorities between rules, namely  >  and  > . after one second the concentration of y increases by nearly  molecules per cell (× − ) while the concentration of x increases by only  molecules per cell. figure 2: a p system with lifetimes in figure 2 an example of a p system with lifetimes is shown. graphically, the boxes represent the membranes and their nesting reflects the hierarchy. membrane  represent the skin membrane. both membranes have a lifetime equal to ∞ meaning that no dissolving rule is not necessary. inside membrane  we have the initial multiset of pairs, the evolution rules and some priorities between them. in the evolution rule we omit the subscript here for objects, in the products, that remain in the same membrane. the defined p system with lifetimes computes the least common multiple of n and n, namely lcm(n, n). the idea is to put a pair (a, n − ) and a pair (b, n − ) at the beginning of the computation, and to produce a pair (c, ∞) (which is send in membrane ) each time the lifetime of the object a is decreased. the first time the objects a and b appear together with the lifetime  is exactly after lcm(n, n) time units. at this moment in membrane  are lcm(n, n) objects c, which represent the output of the system. 3 systems with and without lifetimes the following results describe certain relationships between p systems with lifetimes and p systems without lifetimes, and between similar p systems with lifetimes. proposition 4. for every p system without lifetimes there exists a p system with lifetimes providing the same output by performing an equal number of steps. 274 b. aman, g. ciobanu proof: [proof (sketch)] it is easy to state that the class of p systems with lifetimes includes the class of p systems without lifetimes, since we can assign ∞ to all lifetimes appearing in the membrane structure and evolution rules. 2 a somehow surprising result is that p systems with lifetimes can be simulated by p systems without lifetimes. proposition 5. for every p system with lifetimes there exists a p system without lifetimes providing the same output by performing an equal number of steps. proof: we use the notation rhs(r) to denote the multisets of pairs which appear in the right hand side of a rule r. this notation is extended naturally to multisets of rules: given a multiset of rules r, the right hand side of the multiset rhs(r) is obtained by adding the right hand sides of the rules in the multiset, considered with their multiplicities. each object a ∈v from a p system with lifetimes has a maximum lifetime (we denote it by li f etime(a)), which can be calculated as follows: li f etime(a) = max({t | (a,t) ∈ wi,  ≤ i ≤ n}∪{t | (a,t)∈rhs(ri),  ≤ i ≤ n}) in what follows we present the steps which are required to build a p system without lifetimes starting from a p system with lifetimes, such that both provide the same output. 1. a membrane structure from a p system with lifetimes is translated into a membrane structure of a p system without lifetimes in the following way the lifetimes of elements from a p system with lifetimes are simulated using the membranes , . . ., k in the corresponding p system without lifetimes as we show at the next steps of the translation. the value of k is the maximum from the finite lifetime of objects and surrounding membrane mem, namely k = max({li f etime(a) | a ∈ v, li f etime(a) 6= ∞} ∪ {tmem | tmem 6= ∞}). if an object or the surrounding membrane has the lifetime equal to ∞ in the p system with lifetimes then we do not need to count the passage of time, namely to use the membranes , . . ., k in the p system without lifetimes for the corresponding object or membrane. the object mem placed inside the membrane labelled  is used to simulate the passage of time for the membrane. the initial multiset of objects wt from membrane mem in the p system with lifetimes, in translated into the multiset w which is added into membrane  inside membrane mem in the corresponding p system without lifetimes since all objects from the initial multiset are just starting their life. 2. the rules (a,t) → (a,t − ), for all a ∈ v , t >  and t 6= ∞ from the p system with lifetimes can be simulated in the p system without lifetimes using the following rules: (a) a → (aoai, out) placed inside membrane i, for all  ≤ i ≤ li f etime(a) −  the object oai is used to keep track of the units of time that have past from the lifetime of the object a adding lifetime to objects and membranes in p systems 275 (b) aoa j → (a, in j+), placed inside membrane mem, for all  ≤ j ≤ li f etime(a) −  and a ∈ v this rules together with the previous ones simulate the passage of a unit of time from the lifetime of object a in the p system with lifetimes, by moving object a from a membrane j to a membrane j +  for  ≤ j ≤ li f etime(a) −  in the p system without lifetimes. 3. the rules (a, ) → λ , for all a ∈ v from the p system with lifetimes can be simulated in the p system without lifetimes using the following rules: (a) a → λ placed inside membrane li f etime(a) if the object a reaches the membrane labelled with li f etime(a) in the p system without lifetimes, it means that the lifetime of object a is  in the corresponding p system with lifetimes, so it is replaced by λ . 4. the rules ut → vt from the p system with lifetimes can be simulated in the p system without lifetimes using the following rules: (a) uou j → (v, in) the multiset ou j contains objects of the form oa j, where a ∈ u and  ≤ j ≤ li f etime(a) − . when replacing the multiset u with the multiset v, we also remove the objects oa j that keep track of the life of the objects appearing in u since we do not need them anymore. we send the newly obtained multiset v in membrane  since all objects from this multiset are just starting their life. 5. the rules [ ](mem,t) → [ ](mem,t−), for all t >  and t 6= ∞ from the p system with lifetimes can be simulated in the p system without lifetimes using the following rules: (a) mem → (mem omem i, out) placed inside membrane i, for all  ≤ i ≤ li f etime(mem) −  the object omem i is used to keep track of the units of time that have past from the lifetime of the membrane mem; (b) mem omem j → (mem, in j+), placed inside membrane mem, for all  ≤ j ≤ li f etime(mem)− this rules together with the previous ones simulate the passage of a unit of time from the lifetime of membrane mem in the p system with lifetimes, by moving object mem from a membrane j to a membrane j +  for  ≤ j ≤ li f etime(mem) −  in the p system without lifetimes. 6. a rule [ ](mem,) → [δ ](mem,) from the p system with lifetimes can be simulated in the p system without lifetimes using the following rules: (a) mem → (oδ , out) placed inside membrane li f etime(mem) −  if the object mem reaches the membrane labelled with li f etime(mem) −  in the p system without lifetimes, it means that the lifetime of membrane mem is  in the corresponding p system with lifetimes, so it needs to be dissolved. (b) oδ → δ (δ , in) . . . (δ , ink) once this object is created by the previous rule, an object δ is created inside membrane mem and inside each membrane j,  ≤ j ≤ k, is send an object δ . this means that all these membranes are dissolved, all the rules are deleted, and all objects are send in the parent membrane. the dissolving of the membranes take place after applying all other possible rules. at the moment of the dissolution the only existing objects are found in membrane mem. for each object a there exists an object oa j that keeps track of the life of a, thus being able to continue the increment the life of a in the parent membrane. 276 b. aman, g. ciobanu the output membrane from the p system with lifetimes, is translated in the output membrane from the p system without lifetimes. after performing the same number of evolution steps in both systems, the output membranes contain the same multisets of objects. 2 we are now able to prove the computational power of p systems with lifetimes. we denote by ntpm(coo,tar) the family of sets of natural numbers generated by p systems with lifetimes of degree at most m ≥ , using cooperative rules, and communication of objects through membranes. we also denote by nre the family of all sets of natural numbers generated by arbitrary grammars. proposition 6. ntpm(coo,tar) = nre, for all m ≥ . proof: [proof (sketch)] since the outcome of each p system with lifetimes can be obtained by an p system without lifetimes, we cannot get more than the computability power of p systems. therefore, according to theorem 3.3.3 from [21], we have that the family ntpmof sets of natural numbers generated by p systems with lifetimes is the same as the family nre of sets of natural number generated by arbitrary grammars. 2 remark 3.1. consider the membrane system π = [[ ](,∞)(a, )(a, )](,∞) with the set of rules r = {(a, ) → (a, )((c, ∞), in); (a, ) → (a, ); (a, ) → λ }. since the membranes have the lifetime ∞ it is not necessary to consider rules for decreasing lifetimes to membranes. if we rewrite this as π ′ = [[ ](,∞)(a, )(a, )(d,t)](,∞), with r = {(a, ) → (a, )((c, ∞), in); (a, ) → (a, ); (a, ) → λ ;(d, i) → (d, i − ); (d, ) → λ } we have that after t units of time the membrane system π has the same evolution as the membrane system π ′. in automata theory the problem of optimizing a finite-state machine, meaning to find the machine with the minimum number of states that performs the same function, was addressed by the theorem of myhill and nerode; a fast algorithm doing this is the hopcroft minimization algorithm [16]. using a similar approach we want to optimize a given p system with lifetimes. this can be realized with the passage of time, namely all objects and membranes which are not used in rewriting rules and have a finite lifetime are eliminated. proposition 7. let π and π ′ be two p systems with lifetimes such that: 1. π = (vt , t,c, ht , µt , w, . . . , wn, (r, ρ), . . . , (rn, ρn), io) 2. π ′ = (vt , t,c, ht , µt , w′, . . . , w ′ n, (r, ρ), (r ′ , ρ ′ ), . . . , (r ′ n, ρ ′ n), i ′ o) 3. i′o = io the output membrane is the same for the two membrane systems. 4. wi ⊆ wi, for all  ≤ i ≤ n the initial multiset from membrane i of π ′ contains the same objects as the initial multiset from membrane i of π and some other objects together with their initial lifetimes. 5. r′i = ri ∪{(a,t) → (a,t − ), (a, ) → λ | a ∈ w′i \wi} the set of rules r′i contains all the rules of ri and some additional rules to simulate the passage of time for all the objects appearing in w′i \wi. 6. ρ ′i = ρi, for all  ≤ i ≤ n the priority orders are the same for the two membrane systems. then the p systems with lifetimes π and π ′ have the same membrane structure and evolution after max{t | (a,t) ∈ w′i \wi,  ≤ i ≤ n} units of time. adding lifetime to objects and membranes in p systems 277 proof: [proof (sketch)] after max{t | (a,t) ∈ w′i \wi,  ≤ i ≤ n} units of time all objects which appeared in wi\w′i in the description of the membrane system π ′ are consumed. in this case we have that the contents of the membranes of π is the same as the contents of the membranes of π ′, and the applicable rules for the two membrane systems are only the rules of ri,  ≤ i ≤ n. 2 4 related work and conclusion we introduce a new class of p systems, namely the p systems with lifetimes. lifetimes are assigned to each membrane and to each object. this new feature is inspired from biology where cells and intracellular proteins have a well defined lifetime. in order to simulate the passage of time, we use rules of the form (a,t) → (a,t − ) for objects, and [ ](i,t) → [ ](i,t−) for membranes. if the lifetime of an object reaches  then the object is consumed by applying a rule of the form (a, ) → λ , while if the lifetime of a membrane i reaches  then the membrane is marked for dissolution by applying a rule of the form [ ](i,) → [δ ](i,). after dissolving a membrane, all objects and membranes previously contained in it become elements of the immediately upper membrane. we do not obtain a more powerful formalism by adding lifetimes to objects and to membranes into a p system. according to proposition 4, proposition 5 and proposition 6, p systems with lifetimes and p systems without lifetimes have the same computational power. however the p systems with lifetimes are able to describe more naturally some biological phenomena involving timing, as in example 3. a similar idea appears in the framework of spiking p systems: considering a duration of life for spikes, but not for cells [15]. if a spike is not used in a number of steps larger than its lifetime, then it is removed. there are also some papers using time in the context of membrane computing in a different manner than in this paper. in [8] a timed p system is introduced by associating to each rule a natural number representing the time of its execution. then a p system which always produces the same result, independently from the execution times of the rules, is called a time-independent p systems. the notion of time-independent p systems tries to capture the class of systems which are robust against the environment influences over the execution time of the rules of the system. other types of time-free systems are considered in [7, 9]. time can also be used to “control” the computation, for instance by appropriate changes in the execution times of the rules during a computation, and this possibility has been considered in [11]. moreover, timed p automata have been proposed and investigated in [5], where ideas from timed automata have been incorporated into timed p systems. frequency p systems has been introduced and investigated in [19]. in frequency p systems each membrane is clocked independently from the others, and each membrane operates at a certain frequency which could change during the execution. dynamics of such systems have been investigated. if one supposes the existence of two scales of time (an external time of the user, and an internal time of the device), then it is possible to implement accelerated computing devices which can have more computational power than turing machines. this approach has been used to construct accelerated p systems where acceleration is obtained by either decreasing the size of the reactors or by speeding-up the communication channels [6]. in [10, 17] the time of occurrence of certain events is used to compute numbers. if specific events (such as the use of certain rules, the entering/exit of certain objects into/from the system) can be freely chosen, then it is easy to obtain computational completeness results. however, if the length (number of steps) are considered as result of the computation, non-universal systems can be obtained. time is considered in [17, 20] as the result of the computation by using special “observable” configurations taken in regular sets (with the time elapsed between such configurations considered as output). the authors of the current paper have also considered time to “control” the computation in two other formalisms: mobile ambients [2–4] and distributed π -calculus [13, 14]. timers define timeouts for 278 b. aman, g. ciobanu various resources, making them available only for a determined period of time. the passage of time is given by a discrete global time progress function. acknowledgments this work was partially supported by cncsis research grants idei 402/2007 and cnmp parteneriate d1/1052/2007. bibliography [1] b. alberts, a. johnson, j. lewis, m. raff, k. roberts, p. walter. molecular biology of the cell fifth edition. garland science, taylor & francis group, 2008. [2] b. aman, g.ciobanu. timers and proximities for mobile ambients. lecture notes in computer science, vol.4649, 33–43, 2007. [3] b. aman, g.ciobanu. mobile ambients with timers and types. lecture notes in computer science, vol.4711, 50–63, 2007. [4] b. aman, g.ciobanu. timed mobile ambients for network protocols. lecture notes in computer science, vol.5048, 234–250, 2008. [5] r. barbuti, a. maggiolo-schettini, p. milazzo, l. tesei. timed p automata. electronic notes in theoretical computer science, vol.227, 21–36, 2009. [6] c.s. calude, gh. păun. bio-steps beyond turing. biosystems, vol.77, 175–194, 2004. [7] m. cavaliere, v. deufemia. further results on time-free p systems. international journal of foundations of computer science, vol.17, 69–89, 2006. [8] m. cavaliere, d. sburlan. time-independent p systems. lecture notes in computer science, vol.3365, 239–258, 2005. [9] m. cavaliere, d. sburlan. time and synchronization in membrane systems. fundamenta informaticae, vol.64, 65–77, 2005. [10] m. cavaliere, r. freund, a.leitsch, gh. păun. event-related outputs of computations in p systems. journal of automata, languages and combinatorics, vol.11, 263–278, 2006. [11] m. cavaliere, c. zandron. time-driven computations in p systems. proceedings of fourth brainstorming week on membrane computing, 133–143, 2006. [12] g. ciobanu, gh. păun, m.j. pérez-jiménez (eds.). applications of membrane computing, springer, natural computing series, 2006. [13] g. ciobanu, c. prisacariu. timers for distributed systems. electronic notes in theoretical computer science, vol.164(3), 81–99, 2006. [14] g. ciobanu, c. prisacariu. coordination by timers for channel-based anonymous communications. electronic notes in theoretical computer science, vol.175(2), 3–17, 2007. [15] r. freund, m. ionescu, m. oswald. extended spiking neural p systems with decaying spikes and/or total spiking. international journal of foundations of computer science, vol.19, 1223–1234, 2008. [16] j. e. hopcroft. an nlogn algorithm for minimizing the states in a finite automaton. the theory of machines and computations, academic press, 189–196, 1971. [17] o.h. ibarra, a. păun. computing time in computing with cells. lecture notes in computer science, vol.3892, 112–128, 2006. adding lifetime to objects and membranes in p systems 279 [18] h. lodish, a. berk, p. matsudaira, c. kaiser, m. krieger, m. scott, l. zipursky, j. darnell. molecular cell biology sixth edition. freeman, 2008. [19] d. molteni, c. ferretti, g. mauri. frequency membrane systems. computing and informatics, vol.27(3), 467–479, 2008. [20] h. nagda, a. păun, a. rodríguez-patón. p systems with symport/antiport and time. lecture notes in computer science, vol.4361, 463–476, 2006. [21] gh. păun. membrane computing. an introduction. springer, 2002. [22] web page of the p systems: http://ppage.psystems.eu. international journal of computers communications & control issn 1841-9836, 11(2):292-304, april 2016. numerical p systems with thresholds z. zhang, l. pan zhiqiang zhang key laboratory of image information processing and intelligent control, school of automation, huazhong university of science and technology, wuhan 430074, hubei, china zhiqiangzhang@hust.edu.cn linqiang pan* key laboratory of image information processing and intelligent control, school of automation, huazhong university of science and technology, wuhan 430074, hubei, china *corresponding author: lqpan@mail.hust.edu.cn abstract: numerical p systems are a class of p systems inspired both from the structure of living cells and from economics. in this work, a control of using evolution programs is introduced into numerical p systems: a threshold is considered and a program can be applied only when the values of the variables involved in the production function of the program are greater than/equal to (lower-threshold) or smaller than/equal to (upper-threshold) the threshold. the computational power of numerical p systems with lower-threshold or upper-threshold is investigated. it is proved that numerical p systems with a lower-threshold, with one membrane and linear production functions, working both in the all-parallel mode and in the one-parallel mode are universal. the result is also extended to numerical p systems with an upperthreshold, by proving the equivalence of the numerical p systems with lowerand upper-thresholds. keywords: membrane computing, numerical p system, computation power, universality, register machine. 1 introduction membrane computing is a branch of natural computing, which is inspired from the structure and functioning of living cells. the computing devices considered in membrane computing are called p systems. they are parallel, distributed and non-deterministic computational models. according to their membrane structure, there are two main classes of p systems: cell-like p systems, with a hierarchical arrangement of membranes [7], and tissue-like p systems or neurallike p systems, with a net of processor units placed in the nodes of a directed graph [3, 5]. the present work deals with a class of cell-like p systems, called numerical p systems [10]. numerical p systems are motivated by the cell structure and the economic reality. numerical variables are placed in the regions of a membrane structure. these variables can evolve by means of programs, which are composed of two components, a production function and a repartition protocol. a production value of the region at a given step is computed by means of the production function. this value is distributed to variables from the region where the program resides, and to variables in its upper and lower neighbors according to the repartition protocol. by a synchronized use of production functions, followed by the repartition of the obtained values, a transition is defined between system configurations. the values assumed by a distinguished variable during a computation form the set of numbers computed by the system. many computational properties of numerical p systems have been investigated at both the theoretical level and at the application level [4,9,10,12–17]. several strategies of using productionrepartition programs were considered: sequential (at each step, in each region, only one program copyright © 2006-2016 by ccc publications numerical p systems with thresholds 293 can be applied), all-parallel (all programs in a region of the membrane structure are used simultaneously, with each variable participating in all programs where it appears), one-parallel (the programs are chosen to be used in parallel in such a way that each variable participates in only one of the chosen programs). using a threshold is an interesting strategy of controlling the use of production-repartition programs. the idea was introduced in numerical p systems in [12], under the name of enzymatic control: a distinguished variable, called enzyme, is associated with each program and the program is applied only if the current value of the enzyme is not smaller than the smallest value of the variables involved in the production function of the program. the “enzymatic control" is useful in designing robot controllers based on numerical p systems [13–15]. we here introduce a related but different strategy, similar to the threshold control used in [18] for spiking neural p systems: rules can be used according to the result of the comparison of the number of spikes in the neuron with the constant, which corresponds to the fact that a neuron can fire when its potential is greater than or equal to its threshold. in our case, a constant is associated with the numerical p system and it is used as a control threshold in two natural ways: a program can be applied only when the values of the variables involved in the production function are not smaller than (the lower-threshold case), respectively not greater than (the upper-threshold case) the constant. the computational power of such p systems is investigated. specifically, it is proved that universality results can be obtained for such p systems with one membrane and linear production functions, working both in the all-parallel mode and in the one-parallel mode. the proof is done (by simulating register machines) only for lowerthresholds, then the result is extended to the case of upper-threshold by proving that numerical p systems with upper-thresholds can simulate systems with lower-thresholds. the possible usefulness of the threshold control remains to be examined for applications (in robot control). for the sake of applications, it could be useful to consider its stronger versions, such as taking different thresholds for different membranes or even for different programs in the system (maybe also mixing the way to use the thresholds, in the lower or upper ways). 2 preliminaries readers are assumed to be familiar with basic elements of membrane computing, e.g., from [7, 8, 11]. here we only mention some notions and notations which are used in this paper. we denote by n the set of natural numbers, and the set of real numbers is denoted by r. the family of all recursively enumerable sets of k-dimensional vectors of non-negative integers is denoted by ps(k)re. since numbers can be seen as one-dimensional vectors, we can replace ps(1) by n in the notation, thus obtaining nre. an n-register machine is a construct m = (n,p,m), where n > 0 is the number of registers, p is a finite sequence of instructions bijectively labeled with the elements of the set {0,1, . . . ,m}, 0 is the label of the first instruction to be executed, and m is the label of the halt instruction of p . registers contain non-negative integer values. the instructions of p have the following forms: • j : (inc(r),k, l), with 0 ≤ j < m,0 ≤ k,l ≤ m, and 1 ≤ r ≤ n. this instruction, labeled with j, increments the value contained in register r, then nondeterministically jumps either to instruction k or to instruction l. • j : (dec(r),k, l), with 0 ≤ j < m,0 ≤ k,l ≤ m, and 1 ≤ r ≤ n. if the value contained in register r is positive, then decrement it by 1 and jump to instruction k. if the value of r is zero, then jump to instruction l (without altering the content of the register). 294 z. zhang, l. pan • m: halt. a deterministic register machine is a register machine in which all inc instructions have the form j : (inc(r),k,k); we write these instructions simply as j : (inc(r),k). a register machine m generates a set n(m) of numbers in the following way: the machine starts with all registers being empty (i.e., storing the number zero); the machine applies the instruction with label 0 and continues to apply instructions as indicated by the labels (and made possible by the contents of registers); if it reaches the halt instruction, then the number present in register 1 at that time is said to be generated by m. if the computation does not halt, then no number is generated. it is known that register machines generate all sets of numbers which are turing computable, hence they characterize nre [6]. a register machine can also be used to compute functions. a function f : nα → nβ is computed by a register machine m if, when starting with n1 to nα in registers 1 to α, if f(n1, . . . ,nα) = (r1, . . . ,rβ), then m halts in the final label m with registers 1 to β containing r1 to rβ, and all other registers being empty; if f(n1, . . . ,nα) is undefined, then the final label of m is never reached. a register machine can also be used as an accepting device. a set n of numbers is accepted by a deterministic register machine m if, when starting with x ∈ n in register 1, m halts in the final label m with all registers being empty. the following propositions concerning the computational power of register machines are essential for the main results established in this work [1, 2, 6]. proposition 1. for any partial recursive function f: nα → nβ(α,β > 0), there exists a deterministic register machine m with (max{α,β}+ 2) registers computing f. proposition 2. for any recursively enumerable set n ⊆ ps(α)re of vectors of non-negative integers there exists a deterministic register machine m with (α + 2) registers accepting n. proposition 3. for any recursively enumerable set n ⊆ ps(β)re of vectors of non-negative integers there exists a non-deterministic register machine m with (β + 2) registers generating n. 3 numerical p systems with thresholds we introduce the class of numerical p systems to be investigated in this work. the definition is general, for the computing case. a numerical p system with a threshold is a construct π = (m,h,µ,t,(v ar1,pr1,v ar1(0)), . . . ,(v arm,prm,v arm(0)),v arin,v arout), where • m ≥ 1 is the number of membranes; • h is an alphabet of labels for membranes in µ; • µ is a rooted tree with q nodes labeled with the elements of h; • t is a constant, called threshold; • v ari, 1 ≤ i ≤ m, is the set of variables in region i; • v ari(0), 1 ≤ i ≤ m, is the set of initial values of the variables in region i; numerical p systems with thresholds 295 • pri, 1 ≤ i ≤ m, is the set of programs in region i; each program has the form fl,i(x1,i, . . . ,xki,i)|t → cl,i,1|vl,i,1 + · · ·+ cl,i,li|vl,i,li, where fl,i(x1,i, . . . ,xki,i) is the production function, and cl,i,1|vl,i,1 + · · ·+ cl,i,li|vl,i,li is the repartition protocol of the program; • v arin and v arout are the sets of input and of output variables, respectively. the programs allow the system to evolve the values of variables during computations. each program is composed of two parts: a production function and a repartition protocol. the former can be any function using variables from the region that contains the program. only polynomial functions are considered here. by using the production functions in each region, the system computes a production value from the values of its variables at that time. this value is distributed to variables from the region where the program resides, and to variables in its upper (parent) and lower (children) compartments, as specified by the repartition protocol. the programs are applied under the control of the threshold t , according to two strategies: bounding the values of variables from below (lower-threshold) and bounding them from above (upper-threshold). more precisely, in the first case a program can be applied only when the current value of each variable from its production function is greater than or equal to the threshold t . dually, in the upper-threshold case, a program can be applied only when the current value of each variable from its production function is smaller than or equal to the threshold t . the repartition of the “production" takes place as follows. for a repartition protocol rpl,i, variables vl,i,1, . . . ,vl,i,li come from the membrane i where the program resides, the parent membrane and the children membrane. formally, {vl,i,1, . . . , vl,i,li}⊆ v ari∪v arpar(i)∪( ∪ ch∈ch(i) v arch), where par(i) is the parent of membrane i and ch(i) is the set of children of membrane i. the coefficients cl,i,1, . . . ,cl,i,li are natural numbers (they may be also 0, in which case the terms “+0|x" are omitted), which specify the proportion of the current production value distributed to each variable vl,i,1,. . . ,vl,i,li . at time t, if we denote with cl,i = li∑ s=1 cl,i,s the sum of all coefficients of the repartition protocol, and denote with ql,i(t) = fl,i(x1,i(t), . . . ,xki,i(t)) cl,i (1) the “unitary portion", then the value adl,i,r(t) = ql,i(t)·cl,i,r represents the value added to variable vl,i,r. if variable vl,i,r appears in several repartition protocols, for example, rpl1,i1, . . . ,rplk,ik , all these values adl1,i1,r, . . . ,adlk,ik,r are added to variable vl,i,r. after computing the production function value, the variables involved in the production function are reset to zero. so, if at time t variable vl,i,r is involved in at least one production function, its value at time t+1 is vl,i,r(t+1) = k∑ s=1 adls,is,r(t); otherwise, vl,i,r(t + 1)=vl,i,r(t) + k∑ s=1 adls,is,r(t). such a system evolves in the all-parallel mode (at each step, in each membrane, all programs which can be applied are applied, allowing that more than one program share the same variable) or in the one-parallel mode (apply programs in the all-parallel mode with the restriction that one variable can appear in only one of the applied programs). a configuration represents the values of all system’s variables at a given computation step. initially, the variables have the values specified by v ari(0),1 ≤ i ≤ m. using the programs in the way mentioned above, a transition of the system from a configuration to the next one is defined. a sequence of such transitions forms a computation. if no program in each region can be applied, we say that the system reaches a halting configuration. 296 z. zhang, l. pan in this way, a numerical p system can compute a function f: nα → nβ(α,β ≥ 0): the α values of the arguments are introduced in the system as the initial values of variables in v arin and the β-vector of the function value is obtained in the variables from v arout in the halting configuration of the system. if the system never reaches a halting configuration, then no result is obtained. by ignoring the input variables, (non-deterministic) numerical p systems with thresholds can also be used in the generating mode, whereas by ignoring the output variables we can use (deterministic or non-deterministic) numerical p systems with thresholds in the accepting mode. note that qj,i(t) are integers only if the value of the production functions fj,i(x1,i(t), . . . ,xki,i(t)) is divisible by the respective sums cj,i(t). if at any step, all the values of the production functions are divisible by the respective sums, we associate this kind of systems with the notation div. if a current production is not divisible by the associated coefficients total, then we can take the following decisions [10]: (i) the remainder is lost (the production which is not immediately distributed is lost), (ii) the remainder is added to the production obtained in the next step (the non-distributed production is carried over to the next step), (iii) the system simply stops and aborts, no result is associated with that computation. we denote these three cases with lost, carry, stop, respectively. in this paper, the numerical p systems with thresholds that we construct are of the div type. the set of natural numbers generated or accepted in the way mentioned above by a system π is denoted by nα(π),α ∈{gen,acc}. we use nαtγpdm (polyn(r),β) to denote the family of all sets nα(π) of numbers computed by systems π working in α mode, with at most m membranes, production functions which are polynomials of degree at most n, with integer coefficients, with at most r variables in each polynomial, using the rules in the mode β ∈{all,one}, where all stands for all-parallel, and one stands for one-parallel, and with the threshold used in the γ ∈ {l,u} way, with l indicating the lower-threshold case and u indicating the upper-threshold case; the letter d indicates the use of deterministic systems (we remove d when the systems may also be non-deterministic). if one of the parameters m,n,r is not bounded, then we replace it with ∗. 4 the universality of numerical p systems with lower-thresholds in this section, we investigate the computational power of numerical p systems with lowerthresholds working in the all-parallel mode and in the one-parallel mode. theorem 4. each partial recursive function f : nα → nβ (α > 0,β > 0) can be computed by a deterministic numerical p system with a lower-threshold, with only one membrane, using linear production functions that use each at most three variables, and working in the all-parallel mode. proof: let m = (n,p,m) be a deterministic register machine with n registers, computing function f. the initial instruction of m has the label 0 and the machine halts only if the instruction with label m is reached. according to proposition 1, n = max{α,β} + 2 is enough. before the computation starts, let us assume that the values of the first α registers are equal to r1, . . . ,rα. when the computation halts, the values stored in the registers 1, . . . ,β are the values computed by f(r1, . . . ,rα). we construct the following numerical p system with a lower-threshold: πm = (1,{0}, [0 ]0,1,(v ar0,pr0,v ar0(0)),v arin,v arout), where • v ar0 = {xi,1,xi,2,pj | 1 ≤ i ≤ n,0 ≤ j ≤ m}; numerical p systems with thresholds 297 • v ar0(0) is the vector of initial values of the variables, with: – xi,1 = xi,2 = ri, for all 1 ≤ i ≤ α; – xi,1 = xi,2 = 0, for all 1 + α ≤ i ≤ n; – pj = 0, for all 0 ≤ j ≤ m with the exception of p0 = 1; • pr0 = {3pj|1 → 1|xi,1 + 1|xi,2 + 1|pk, for all instructions j : (inc(i),k) ∈ p} ∪ {pj|1 → 1|pl, xi,1 −xi,2 −pj|1 → 1|pl, xi,1 −xi,2 + pj|1 → 1|pk, 2(xi,1 −pj)|1 → 1|xi,1 + 1|xi,2, for all instructions j : (dec(i),k, l) ∈ p}; • v arin = {x1,i, . . . ,xα,i | 1 ≤ i ≤ 2}; • v arout = {x1,1, . . . ,xβ,1}. note that the threshold is equal to 1. the value of register i (1 ≤ i ≤ n) is encoded by variables xi,1 and xi,2. the values of xi,1 and xi,2 are always equal. the input values r1, . . . ,rα are set as the initial values of variables x1,i, . . . ,xα,i 1 ≤ i ≤ 2, respectively. variables p0, . . . ,pm are used to indicate the instruction to be simulated. during the computation, the values of p0, . . . ,pm are equal to 1 or 0 (at most one of them is equal to 1 in each step, and this indicates that the system starts to simulate the corresponding instruction of m). the increment instruction j : (inc(i),k) is simulated by the program 3pj|1 → 1|xi,1 + 1|xi,2 + 1|pk in one step. when pj = 0, the program cannot be applied because the value of pj is smaller than the threshold 1. when pj = 1, the program can be applied since the value of pj is equal to the threshold. after the application of this program, each of the variables xi,1,xi,2,pk obtains a portion 1, and variable pj is reset to zero. variable pk = 1 indicates that the instruction labeled k will be simulated at the next step, the increment of variables xi,1,xi,2 corresponds to the increase of the number stored in register i by 1. so, the increment instruction j : (inc(i),k) has been correctly simulated. the decrement instruction j : (dec(i),k, l) is simulated in one step by the following four programs: pj|1 → 1|pl, (2) xi,1 −xi,2 −pj|1 → 1|pl, (3) xi,1 −xi,2 + pj|1 → 1|pk, (4) 2(xi,1 −pj)|1 → 1|xi,1 + 1|xi,2. (5) when a decrement instruction j : (dec(i),k, l) starts to be simulated, which means that pj = 1, there are the following two cases. • pj = 1, xi,1 = xi,2 = 0. in this case, only program (2) satisfies the threshold condition. after applying program (2), variable pj is set to zero, and variable pl receives a contribution 1, which indicates the next instruction to be simulated. programs (3)–(5) cannot be applied since variables xi,1 and xi,2 are zero (smaller than the threshold 1). hence the values of xi,1 and xi,2 are not changed, and the computation continues with the simulation of instruction l of register machine m. • pj = 1, xi,1 = xi,2 ≥ 1. in this case, all the four programs satisfy the threshold condition, thus all of them can be applied. program (4) transfers the production value 1 to variable pk, which indicates the next instruction k to be simulated. by using program (5), variables xi,1 298 z. zhang, l. pan and xi,2 are decreased; their values are first zeroed and each of them receives a contribution of their former value minus one. the role of program (3) is to cancel the effect of program (2). program (2) transfers the value of pj to pl, thus pl gets a contribution of 1, which is canceled by program (3) by sending a contribution of −1 to pl. hence the values of variables xi,1 and xi,2 are decremented by one and the next instruction to be simulated is the one labeled with k. after the simulation of any instruction of m, the values of both variables xi,1 and xi,2 are equal to the contents of register i (1 ≤ i ≤ n), while only one of variables p0, . . . ,pm is equal to 1, indicating the next instruction of m to be simulated. when m reaches the halt instruction, the corresponding value of variable pm is equal to 1. since no program contains the variable pm in the production function, πm reaches a final configuration; the result of the computation is the values of variables x1,1, . . . ,xβ,1. 2 according to proposition 2, for any recursively enumerable set n ⊆ ps(α)re of vectors of non-negative integers there exists a deterministic register machine m with (α + 2) registers accepting n. for this register machine m, following the proof in theorem 4, we can construct a numerical p system with a lower-threshold that accepts n. so, the following corollary holds. corollary 5. ps(α)re = nacctlpd1 (poly 1(3),all). for numerical p systems with lower-thresholds working in the one-parallel mode, the following similar results hold. theorem 6. each partial recursive function f : nα → nβ (α,β > 0) can be computed by a onemembrane numerical p system with a lower-threshold working in the one-parallel mode, having linear production functions that use each at most five variables. proof: we proceed like in the proof of theorem 4, with the difference that here we simulate both deterministic and non-deterministic register machines. let m = (n,p,m) be a non-deterministic register machine with n = max{α,β}+2 registers, computing the function f. as usual, the input values r1, . . . ,rα are stored in the first α registers before the computation starts, with all the other registers being empty. when the computation halts, the values f(r1, . . . ,rα) will be found in registers 1, . . . ,β. the numerical p system with a lower-threshold to simulate register machine m is constructed as follows. πm = (1,{0}, [0 ]0,1,(v ar0,pr0,v ar0(0)),v arin,v arout), where • v ar0 = {pj,g,xi,g | 1 ≤ g ≤ 5, 1 ≤ i ≤ m,0 ≤ j ≤ n}; • v ar0(0) is the vector of initial values of the variables, with: – xi,g = ri, for all 1 ≤ i ≤ α,1 ≤ g ≤ 5; – xi,g = 0, for all 1 + α ≤ i ≤ n,1 ≤ g ≤ 5; – pj,g = 0, for all 0 ≤ j ≤ m,1 ≤ g ≤ 5; – p0,g = 1, for all 1 ≤ g ≤ 5; numerical p systems with thresholds 299 • pr0 = {2 ∑5 g=1 pj,g|1 → ∑5 g=1 1|xi,g + ∑5 g=1 1|pk,g, 2 ∑5 g=1 pj,g|1 → ∑5 g=1 1|xi,g + ∑5 g=1 1|pl,g; for all instructions j : (inc(i),k, l) ∈ p} ∪ {5(xi,1 −pj,1)|1 → ∑5 g=1 1|xi,g, 8(xi,2 −xi,3 + pj,2)|1 → ∑5 g=1 1|pk,g + ∑3 g=1 1|pj,g, −5(xi,4 −xi,5 + pj,3)|1 → ∑5 g=1 1|pl,g, 5pj,4|1 → ∑5 g=1 1|pl,g, −3pj,5|1 → ∑3 g=1 1|pj,g; for all instructions j : (dec(i),k, l) ∈ p}; • v arin = {xi,g | 1 ≤ g ≤ 5,1 ≤ i ≤ α}; • v arout = {x1,1, . . . ,xβ,1}. in order to ensure that at each step only one variable can appear in the production functions of the applied programs, the value of register i (1 ≤ i ≤ n) is contained in five variables xi,g1 ≤ g ≤ 5, and the system uses five variables pi,g,1 ≤ g ≤ 5, to control the simulation of the instruction with label i of register machine m (in the following, for brevity, we use xi,g and pi,g to represent all the five variables for 1 ≤ g ≤ 5, respectively). during the computation, variables xi,g are always equal to each other, and the same holds for variables pi,g. the input values ri (1 ≤ i ≤ α) are introduced into the system as the initial values of variables xi,g (1 ≤ i ≤ α), respectively. when the instruction i is simulated, all the five variables pi,g are equal to 1, while all the others are zero. the simulation of an increment instruction j : (inc(i),k, l) is done in one step by the following two programs: 2 5∑ g=1 pj,g|1 → 5∑ g=1 1|xi,g + 5∑ g=1 1|pk,g, (6) 2 5∑ g=1 pj,g|1 → 5∑ g=1 1|xi,g + 5∑ g=1 1|pl,g. (7) if pj,g = 0, then programs (6) and (7) cannot be executed since the values of variables pj,g = 0 are smaller than the thresholds 1. if pj,g = 1, then only one of programs (6) and (7) can be applied because their production functions share the same variables (the system works in the one-parallel mode). if program (6) (resp., program (7)) is applied, then variable xi,g is increased by one, setting pk,g (resp., pl,g) to 1, thus the system starts to simulate instruction k (resp., instruction l), and resetting variables pj,g to zero. if m is deterministic, then the simulation of the instruction j : (inc(i),k) is performed by using the program (6). in this case, no competition occurs between the programs, and so the simulation is deterministic. the simulation of a decrement instruction j : (dec(i),k, l) is done in one step by the 300 z. zhang, l. pan following five programs: 5(xi,1 −pj,1)|1 → 5∑ g=1 1|xi,g, (8) 8(xi,2 −xi,3 + pj,2)|1 → 5∑ g=1 1|pk,g + 3∑ g=1 1|pj,g, (9) −5(xi,4 −xi,5 + pj,3)|1 → 5∑ g=1 1|pl,g (10) 5pj,4|1 → 5∑ g=1 1|pl,g, (11) −3pj,5|1 → 3∑ g=1 1|pj,g. (12) if pj,g = 0, then programs (8)–(12) cannot be applied, because pj,g = 0 are smaller than the threshold. so, when pj,g = 0, no undesirable simulation steps can appear. if pj,g = 1,xi,g = 0, then the values of xi,g should remain unchanged, and the computation should jump to the simulation of instruction l, which is realized by programs (11) and (12) in one step. (note that in this case programs (8) – (10) cannot be applied, for the values of xi,g are smaller than the thresholds.) the effect of program (11) is to reset variables pj,4 to zero and to give a contribution 1 to each of variables pl,g, whose values are 1 after the application of this program, thus correctly simulating the passing to instruction l. at the same time, program (12) is applied, the role of which is to set all the variables pj,g (g ̸= 4) to zero. variable pj,5 appears in the production function, so its initial value is canceled, and it receives no contribution, hence its final value is zero. for variables pj,1, pj,2 and pj,3, their initial values are 1 and receive contribution −1, hence their final values are zero, which is also correct. if pj,g = 1,xi,g ≥ 1, then the values of xi,g should be decremented and the computation should proceed to the simulation of instruction k. in this case, all the five programs (8)–(12) can be applied. programs (8) and (9) decrement the values of xi,g and increment the values of pk,g by 1. the other programs have auxiliary roles. note that all the variables pj,g and xi,g appear in the production functions of programs (8)–(12), so their values are first reset to zero, and their final values will be the sum of all the contributions they receive. variables xi,g only appear in the repartition protocol of program (8), which gives a contribution of their initial value minus 1, thus correctly decrementing their values by one. variables pj,1, pj,2 and pj,3 receive a contribution 1 from program (9) and a contribution −1 from program (12), thus their values will be equal to 0. variables pj,4 and pj,5 do not appear in any repartition protocol of programs (8) – (12), thus their final values are zero. the role of program (10) is to cancel the effect of program (11). program (11) sends a contribution 1, and simultaneously program (10) sends a contribution −1, to each of variables pl,g, whose final values are hence equal to 0. each of variables pj,g receives a contribution 1, thus their final values are 1, which is also correct. after the simulation of each instruction of m, all the variables xi,g are equal to the contents of register i (1 ≤ i ≤ n), while the variables pj,g (0 ≤ j ≤ m) correctly indicate the next instruction of m to be simulated. when the program counter of m reaches the value k, the corresponding values of variables pk,g is equal to 1. when the program counter of m reaches the value m, the corresponding values of variables pm,g are equal to 1. since no program contains variables pm,g in the production function, πm reaches a halting configuration; the result of the computation is values of variables x1,1, . . . ,xβ,1. 2 numerical p systems with thresholds 301 according to propositions 2 and 3, for any recursively enumerable set n ⊆ ps(α)re of vectors of non-negative integers there exists a deterministic (or non-deterministic) register machine m with (α + 2) registers accepting (generating, respectively) n. for this register machine m, following the proof in theorem 6, we can construct a deterministic (or non-deterministic) numerical p system with a lower-threshold that accepts (or generates, respectively) n. corollary 7. ps(α)re = ngentlp1(poly1(5),one) = nacctlpd1 (poly 1(5),one). in conclusion, we obtain the following characterizations of nre. theorem 8. nre = nacctlpd1 (poly 1(3),all) = ngentlp1(poly 1(5),one) = = nacctlp d 1 (poly 1(5),one). proof: the first equation can be obtained according to corollary 5, where α = 1. similarly, letting α = 1 in corollary 7, we can obtain the last two equations. 2 5 the universality of numerical p systems with upper-thresholds in this section we prove that the computational power of numerical p systems with upperthresholds (for short, utnp systems) is equivalent with that of numerical p systems with lowerthresholds (for short, ltnp systems). lemma 9. for any numerical p system with a lower-threshold πl, there is a p system πu with an upper-threshold, with the same variables, such that the corresponding variables of πl and πu have equal values but of opposite sign. proof: let πl be a numerical p system with a lower-threshold of the form considered in the previous sections. we construct a numerical p system with an upper-threshold πu in the following way. πu has the same membrane structure as πl. in the same membrane, the two systems have the same variables. the initial values of variables in πu is the same as in πl multiplied with −1. similarly, for the thresholds of the two systems (they are equal, but of opposite signs). for a program fl(x1, . . . ,xi)|t → c1|v1 + · · ·+ cl|vl in πl, we introduce in πu the program fu(x1, . . . ,xi)|−t → c1|v1 + · · ·+ cl|vl, where fu(x1, . . . ,xi) is constructed as follows: • if the production function fl(x1,x2, . . . ,xn) is an odd function, that is, fl(−x1,−x2, . . . ,−xn) = −fl(x1,x2, . . . ,xn), then fu(x1,x2, . . . ,xn) = fl(x1,x2, . . . ,xn); • if the production function fl(x1,x2, . . . ,xn) is an even function, that is, fl(−x1,−x2, . . . ,−xn) = fl(x1,x2, . . . ,xn), then fu(x1,x2, . . . ,xn) = −fl(x1,x2, . . . ,xn); 302 z. zhang, l. pan • if the production function fl(x1,x2, . . . ,xn) is neither an even function nor an odd function, then it can be expressed as the addition of an even function with an odd function, that is, fl(x1,x2, . . . ,xn) = fl(x1,x2, . . . ,xn) + fl(−x1,−x2, . . . ,−xn) 2 + fl(x1,x2, . . . ,xn)−fl(−x1,−x2, . . . ,−xn) 2 , and then fu(x1,x2, . . . ,xn) = − fl(x1,x2, . . . ,xn) + fl(−x1,−x2, . . . ,−xn) 2 + fl(x1,x2, . . . ,xn)−fl(−x1,−x2, . . . ,−xn) 2 = −fl(−x1,−x2, . . . ,−xn). based on the previous construction of the utnp system πu, we can get that, if the two systems are deterministic, working in the all-parallel mode, then at any step, the program in πl and its corresponding program in πu can simultaneously be applied or cannot be applied, and the two production functions have equal values but of opposite signs. thus at any step the variable in the two systems get equal contributions but of opposite signs; this is true also for the initial values, hence always the values of variables are equal but of opposite signs. the systems halt simultaneously. in conclusion, no matter whether πl,πu work in computing mode or in generating mode, this lemma holds true. 2 corollary 10. ps(α)re = nacctupd1 (poly 1(3),all). proof: according to proposition 2, for any recursively enumerable set n ⊆ ps(α)re of vectors of non-negative integers there exists a deterministic register machine m with (α + 2) registers accepting n. for this register machine m, following the proof in theorem 4, we can construct a numerical p system with a lower-threshold πm that accepts n. for πm , according to lemma 9, we can construct an utnp system πu with “contrary" configurations (equal values of variables, but of opposite signs). now we add the programs 1 + pm −xi,1|−1 → 1|xi,1, 1 ≤ i ≤ β. (13) to πu thus obtaining a new utnp system π′u. the initial value of pm is 0, hence programs (13) cannot be applied. as long as pm = 0, there is no difference between the functioning of π′u and πu. when pm is equal to −1, πu reaches a halt configuration, while π′u continues executing program (13). the effect of programs (13) is transforming the variables xi,1 ≤ −1 to their contrary. thus the system π′u has the same output as πm . 2 in a similar way to the proof of corollary 7, we can prove the following corollary. corollary 11. ps(α)re = ngentup1(poly1(5),one) = nacctupd1 (poly 1(5),one). if we set α = 1 in corollary 10 and corollary 11, then we can get the following characterizations. theorem 12. nre = nacctupd1 (poly 1(3),all) = ngentup1(poly 1(5),one) = = nacctup d 1 (poly 1(5),one). numerical p systems with thresholds 303 6 conclusions and discussions in this work, we have introduced thresholds into numerical p systems, and the computational power of such p systems has been investigated. specifically, we proved that universality can be obtained for such p systems with one membrane and linear production functions working both in the all-parallel mode and in the one-parallel mode. the rules of numerical p systems with thresholds constructed in section 4 are applied in the all-parallel mode and in the one-parallel mode, respectively. it remains open what the computational power of numerical p systems with thresholds working in the sequential mode is. in this work, the polynomial functions used in numerical p systems with the two kinds of thresholds have at most 3 variables for all-parallel systems and 5 variables for one-parallel systems. it is a natural question whether the number of variables can be decremented. the thresholds are used in the sense of lower-bounds and upper-bounds. other ways to use the thresholds could be of interest, for example, applying a program only if the values of all (or part of) the variables are strictly greater (or smaller) than the threshold. numerical p systems and enzymatic numerical p systems have already been used in robot control [4, 10, 16, 17]. it remains to check whether numerical p systems with thresholds are also useful for such applications. acknowledgements this work was supported by national natural science foundation of china (61320106005 and 61472154), ph.d. programs foundation of ministry of education of china (2012014213008), natural science foundation of hubei province (2011cda027), and the innovation scientists and technicians troop construction projects of henan province (154200510012). bibliography [1] freund, r.; oswald, m. (2002); gp systems with forbidding context. fundamenta informaticae 49(1-3), 81–102. [2] freund, r.; păun, g. (2001); on the number of non-terminal symbols in graph-controlled, programmed and matrix grammars. in: machines, computations, and universality, 3rd internat. conf., mcu, lecture notes in computer science, vol. 2055, springer, berlin, 214–225. [3] ionescu, m.; păun, g., yokomori; t. (2006); spiking neural p systems. fundamenta informaticae 71(2-3), 279–308. [4] leporati, a.; porreca, a.e.; zandron, c.; mauri, g. (2013); improving universality results on parallel enzymatic numerical p systems. proc. 11th brainstorming week on membrane computing, sevilla, 4–8. [5] martín-vide, c.; pazos, j.; păun, gh.; rodriguez-paton, a. (2003); tissue p systems. theoretical computer science 296(2), 295–326. [6] minsky, m.l. (1967); computation: finite and infinite machines. prentice-hall, inc., englewood cliffs, new jersey. [7] păun, g. (2000); computing with membranes. journal of computer and system sciences 61(1), 108–143. 304 z. zhang, l. pan [8] păun, g. (2002); membrane computing–an introduction. springer-verlag, berlin. [9] păun, g. (2013); some open problems about numerical p systems. proc. 11th brainstorming week on membrane computing, sevilla, 245–252. [10] păun, g.; păun, r. (2006); membrane computing and economics: numerical p systems. fundamenta informaticae, 73(1), 213–227. [11] păun, g.;rozenberg, g.; salomaa a.(eds.)(2010); the oxford handbook of membrane computing. oxford university press, new york. [12] pavel, a.b.; arsene, o.; buiu, c. (2010); enzymatic numerical p systems–a new class of membrane computing systems. in: ieee fifth international conference on bioinspired computing: theories and applications (bic-ta), 1331–1336. [13] pavel, a.b.; buiu, c. (2012); using enzymatic numerical p systems for modeling mobile robot controllers. natural computing 11(3), 387–393. [14] pavel, a.b.; vasile, c.i.; dumitrache, i. (2012); robot localization implemented with enzymatic numerical p systems. in: biomimetic and biohybrid systems, springer, 204– 215. [15] pavel, a.b.; vasile, c.i.; dumitrache, i. (2013); membrane computing in robotics. in: beyond artificial intelligence, springer, berlin, 125–135. [16] vasile, c.i.; pavel, a.b.; dumitrache, i. (2013); universality of enzymatic numerical p systems. international journal of computer mathematics 90(4), 869–879. [17] vasile, c.i.; pavel, a.b.; dumitrache, i.; păun, g. (2012); on the power of enzymatic numerical p systems. acta informatica 49(6), 395–412. [18] wang, j.; hoogeboom, h.j.; pan, l.; păun, g.; pérez-jiménez, m.j. (2010); spiking neural p systems with weights. neural computation 22(10), 2615–2646. international journal of computers, communications & control vol. ii (2007), no. 3, pp. 229-240 a proposed genetic algorithm coding for flow-shop scheduling problems hela boukef, mohamed benrejeb, pierre borne abstract: a new genetic algorithm coding is proposed in this paper to solve flowshop scheduling problems. to show the efficiency of the considered approach, two examples, in pharmaceutical and agro-food industries are considered with minimization of different costs related to each problem as a scope. multi-objective optimization is thus, used and its performances proved. keywords: genetic algorithm, operations coding, flow-shop problems, multiobjective optimization, pharmaceutical industries, agro-food industries. 1 introduction in shop scheduling, there are three basic models classified according to the structure of a processing route: • the job-shop scheduling problem is an operation sequencing problem on multiple machines subject to some precedence constraints among the operations, • the flow-shop scheduling problem is a set of jobs that flow through multiple stages in the same order, • the open-shop scheduling problem is a problem where the workshop has several resources and the routing of all the operations is free. in pharmaceutical and agro-food industries, the tasks execution needs the use of several resources in a single order. it is thus about a flow-shop problem. flow-shop problems have received considerable attention from researchers during the last decades and the scheduling criterion most frequently used was the maximum completion time [3],[7], [4], [21], [15], [26]... however, the analysis of the performance of a schedule often involves more than one aspect and therefore requires a multi-objective treatment [8], [17], [25]. the aim of multi-objective optimization is to deal with many criteria at the same time. this paper is focused on minimizing the total costs related to manufacturing and delivery processes. to solve this kind of problems, the exact methods and the approached methods can be applied. the exact methods, such as branch and bound [1] and linear programming methods [23], concern small size problems at the contrary of approached methods, such as tabu search [10], simulated annealing [16], genetic algorithms [13] and ants colony methods [6], that concern big size ones [14]. in this paper, we focus on the use of genetic algorithms method. the principle scope of this method, based on natural selection mechanism, is the improvement of robustness and balance between cost and performance [11]. the genetic algorithms became famous due to their efficiency in solving combinatory optimization problems [19]. their application fields are very important. they vary from complex real application such as pipelines flow control or robot planning path to theoretical combinatory problems. the paper is organized as follows. first, the notations are introduced. section 2 deals with the presentation and formulation of pharmaceutical and agro-food scheduling problems. section 3 tackles the choice of the use of multi-objective evaluation. section 4 handles with the presentation of genetic copyright © 2006-2007 by ccc publications 230 hela boukef, mohamed benrejeb, pierre borne algorithms and the proposed structured coding of list operations. in section 5, two examples, concerning agro-food and pharmaceutical industries scheduling, are treated by using this algorithm. notations : pi : finished product after operation oi pik : conditioning time of oi on machine k cpik : ending processing time ofpi on machine k dlivpi : delivery date of pi cstkpi : storage cost by time unit of pi for pharmaceutical industries: t pik : preparation time on machine kbefore oi t arrik : stoppage time during oion machine k ctotprod : total production cost cuk : production cost by time unit on machine k donettik : cleaning operations time on machine k doch fik : size changing operations time on machine k crdi : costs of distribution delays of pi c fi : manufacturing cost of pi for agro-food industries: ti : effective start time of manufacturing oi ri : earliest start time of oi γi : effective end time of oi cik : kthcomponent from the whole components of oi vik : limit validity date of a component cik dvpi : lifespan of pi drpi : back delay of pi previk : income of a component cik pvenpi : unit sale price of pi 2 presentation and formulation of pharmaceutical and agro-food scheduling problem 2.1 pharmaceutical industries scheduling case in pharmaceutical industries, many problems can appear in production workshop. several operations of cleaning and format changes have to be managed jointly with the manufacturing operations. the unproductive times generated by these operations are rather significant, taking into account the fact that the time launching of a product manufacturing depends on that which precede it. from a product to another, the change involves certain modifications on the level of each machine. these problems can brake the beginning of the production and delay its end. moreover, stoppage time due to machines break down, and production time are as many factors that breed important production costs [2]. costs of distribution delays can also be calculated taking account of storage costs, date of production and end delivery date. let consider objective functions f1 and f2. they represent the minimization of total cost of production, ctotprod , and the minimization of distribution delays cost of the product pi, crdi . a proposed genetic algorithm coding for flow-shop scheduling problems 231 the total cost of production has the following global expression: ctotprod = ∑ 1< j, where r is a petri net and is′ : t → (q+)×(q+ ∪+∞) [10]. time disturbances and filtering of sensors signals in tolerant multi-product job-shops with time constraints 63 definition 2. interpreted t-time petri net is a t-time petri net including an operative part whose state is defined by a set of variables. this state is modified by the operations associated to the places. it determines the value of the conditions (predicates) which are associated to the transitions. the mechanism of watchdog is easily represented by an interpreted t-time petri net. for example, figure 2 shows the detection of a normal state and an abnormal one. if the sensor signal arrives in [tm, tm[, the system is in a normal state. if the sensor signal arrives at the instant tm , the system is in an abnormal one. p2 t1 p1 t2 p3 [tm, tm[*cri [tm, tm]*cri normal state abnormal state figure 2: watchdog mechanism with interpreted t-time petri net within the framework of the robustness integration in the supervision of manufacturing systems with time constraints, we define, figure 3, five time intervals namely: i1i = [∆tm′′/cri , ∆tm′/cri [, i2i = [∆tm′/cri , ∆tm/cri [, i3i = [∆tm/cri , ∆tm/cri [, i4i = [∆tm/cri , ∆tm′/cri [ and i5i = [∆tm′/cri , ∆tm′′/cri [. the margin of passive robustness is available in (i2i ∪i4i) whereas the margin of active robustness is in (i1i ∪ i5i). from a functional point of view, there are three intervals of use in which it is possible to prove the validity: interval of normal functioning, interval of passive robustness and interval of active robustness. in the case of an abnormal functioning, there is always duality of advance and delay scenarios. the adopted filtering mechanism is described by the interpreted t-time petri net of the figure 4. several cases can arise [11]. • if there are absence of order (not ai) and presence of cri, there are freezing of the control and generation of a symptom s2i (place p3). • if the sensor signal cri arrives in the time interval [0, ∆tm′′/cri [, there are freezing of the control and generation of a symptom s2i (place p3). • if the sensor signal cri arrives in the time interval i1i = [∆tm′′/cri , ∆tm′/cri [, there are change of the control (active robustness to an advance) and memorizing a symptom s2i (place p4). • if the sensor signal cri arrives in the time interval i2i = [∆tm′/cri , ∆tm/cri [, there is no change of the control (passive robustness to an advance) but only a memorizing of a symptom s2i (place p5). • if the sensor signal cri arrives in the time interval i3i = [∆tm/cri , ∆tm/cri [, the behavior of the system is normal (place p6). 64 nabil jerbi, simon collart dutilleul, etienne craye, mohamed benrejeb ai (start-event) time ∆tm/cri ∆tm/cri cri (normal functioning) ∆tm’/cri ∆tm’’/cri ∆tm’/cri ∆tm’’/cri 0 normal functioning interval passive robustness interval active robustness interval figure 3: robustness integration in the operation associated model • at the instant ∆tm/cri (transition t7), there is automatically memorizing of a symptom s1i (place p7). • if the sensor signal cri arrives in the time interval i4i = [∆tm/cri , ∆tm′/cri [, it is a case of passive robustness to a delay (place p8). the symptom s 1 i is already memorized (place p7). • if the sensor signal cri arrives in the time interval i5i = [∆tm′/cri , ∆tm′′/cri [, a change of the control is necessary (active robustness to a delay, place p9). • at the instant ∆tm′′/cri (transition t10), there is freezing of the control (place p10). 3 localization of time disturbances in a given topology when, for example, the filtering mechanism executes a control freezing, it is necessary to know where the initial disturbance was occurred. this task is performed on a model of the workshop which uses p-time petri net in order to integrate the staying time constraints in its structure. this aspect is presented in the following section. 3.1 controlled p-time petri net the formal definition of a p-time petri net is given by a pair < r; is >, where [12]: • r is a marked petri net, • is : p → (q+ ∪0)×(q+ ∪+∞) pi → isi = [ai, bi] with 0 ≤ ai ≤ bi. isi defines the static interval of staying time of a mark in the place pi belonging to the set of places p (q+ is the set of positive rational numbers). a mark in the place pi is taken into account in transition validation when it has stayed in pi at least a duration ai and no longer than bi. after the duration bi the token will be dead. using [15], controlled p-time petri net is defined as a quadruplet rpc = (rp, ϕ , u , u0) such that: • rp is a p-time petri net which describes the opened loop system, time disturbances and filtering of sensors signals in tolerant multi-product job-shops with time constraints 65 p1 p5 p7 p4 p9 p3 p10 p8 p6 p2 t1 t2 t3 t11 t13 t5 t7 t6 t9 t10 t14 t12 t16 t17 t15 t4 normal functioning passive robustness active robustness control freezing generation s 2 i and active robustness generation s 2 i and control freezing generation s 2 i and passive robustness generation s 1 i ai*cri cri [0, ∆tm’’/cri[*cri i2i*cri i1i*cri i3i*cri [∆tm/cri, ∆tm/cri] [0, ∆tm’/cri–∆tm/cri[*cri [∆tm’’/cri–∆tm/cri, ∆tm’’/cri–∆tm/cri] [∆tm’/cri–∆tm/cri, ∆tm’’/cri–∆tm/cri[*cri t8 figure 4: robustness integration in the filtering mechanism of sensors signals 66 nabil jerbi, simon collart dutilleul, etienne craye, mohamed benrejeb • ϕ is an application from the set of places (p) toward the set of operations (γ): ϕ : p → γ, • u is the external control of the set of transitions (t ) built on the predicates using the occurrence of internal or external observable events of the system: u : t → {0, 1}, • u0 is the initial value of the predicate vector. let us denote by: • to : the set of observable transitions, • tu o : the set of non observable transitions, • ts : the set of synchronization transitions, • tns : the set of non synchronization transitions, • tp : the set of parallelism transitions, • t◦i (resp. ◦ti) : the output (resp. the input) places of the transition ti, • p◦i (resp. ◦ pi) : the output (resp. the input) transitions of the place pi, • qie : the expected sojourn time of the token in the place pi, • ste(n) : the nnd expected firing instant of the transition t, • st(n) : the nnd effective firing instant of the transition t. 3.2 functional decomposition a workshop in repetitive functioning mode is modeled by a strongly connected event graph (sceg) [13]. performances of a sceg running in mono-periodic functioning mode are proved to be the same as when using the k-periodic functioning [13]. consequently, a mono-periodic functioning is used in order to decrease the complexity of the supervisory problem [14]. in this case, for each transition t, ste(n + 1) = ste(n) + π0 where π0 is the period of the periodic functioning of the given discrete event system. in this paper, the scheduling task is supposed to be done. therefore, the sceg corresponding to the system is provided. moreover, the setting of transitions firing instants is fixed too. as the sojourn times in places have not the same functional signification when they are included in the sequential process of a product or when they are associated to a free resource, a decomposition of the p-time petri net model into four sets is made using [15]. the assumption of multi-product job-shops without assembling tasks as it was established in [16] is used: • ru is the set of places representing the used machines, • rn corresponds to the set of places representing the free machines which are shared between manufacturing circuits, • transc is the set of places representing the loaded transport resources, • transnc is the set of places representing the unloaded transport resources (or the interconnected buffers). time disturbances and filtering of sensors signals in tolerant multi-product job-shops with time constraints 67 figure 5, shows a p-time petri net (g) modeling a system composed by two sequential processes go1 and go2 with two shared machines (m1, m2), where: ru = {p2, p4, p11, p13, p15}, rn = {p6, p7, p8, p9}, transc = {p1, p3, p10, p12, p14}, transnc = {p5, p16}, go1 = (t12, p10, t6, p11, t7, p12, t8, p13, t9, p14, t10, p15, t11) and go2 = (t5, p1, t1, p2, t2, p3, t3, p4, t4). the intervals (isi) and the expected staying times (qie) associated to the places (pi) are: is1 = [30, 50], q1e = 38, is2 = [5, 12], q2e = 7, is3 = [10, 20], q3e = 15, is4 = [5, 20], q4e = 10, is5 = [1, +∞], q5e = 10, is6 = [0, +∞], q6e = 5, is7 = [0, +∞], q7e = 8, is8 = [8, +∞], q8e = 13, is9 = [8, +∞], q9e = 15, is10 = [5, 15], q10e = 12, is11 = [15, 20], q11e = 17, is12 = [3, 7], q12e = 6, is13 = [2, 20], q13e = 5, is14 = [2, 7], q14e = 5, is15 = [15, 20], q15e = 16, is16 = [1, +∞] and q16e = 19. the initial expected firing instants of each transition are: st1e(1) = 15, st2e(1) = 22, st3e(1) = 37, st4e(1) = 7, st5e(1) = 17, st6e(1) = 12, st7e(1) = 29, st8e(1) = 35, st9e(1) = 0, st10e(1) = 5, st11e(1) = 21 and st12e(1) = 0. the repetitive functioning mode is characterized by the period π0 = 40. definition 3. a mono-synchronized subpath is a path containing one and only one synchronization transition which is its last node. definition 4. an elementary mono-synchronized subpath is a mono-synchronized subpath beginning with a place p such as ◦ p is a synchronization transition. in figure 5, there are eight elementary mono-synchronized subpaths constituting a partition of g: l p1 = (p13, t9, p14, t10, p15, t11, p16, t12, p10, t6), l p2 = (p13, t9, p9, t1), l p3 = (p2, t2, p3, t3), l p4 = (p2, t2, p8, t8), l p5 = (p4, t4, p5, t5, p1, t1), l p6 = (p4, t4, p6, t6), l p7 = (p11, t7, p7, t3) and l p8 = (p11, t7, p12, t8). property 1. a place pmp belonging to a sequential process represents a shared machine if and only if p◦mp ∈ tp or ◦ pmp ∈ ts. property 2. the first node of an elementary mono-synchronized subpath is a place belonging to ru and representing a shared machine. 3.3 time disturbances localization let us remember some definitions. definition 5. a time disturbance is detectable if, when it occurs, there exists at least one transition t ∈ to such as st(n) 6= ste(n). definition 6. a time disturbance is quantifiable if its value can be analytically known. definition 7. a time disturbance is localizable when its occurrence node can be identified. definition 8. a time disturbance is partially localizable when its occurrence node location can be proved to belong to a given subset of p. definition 9. a time disturbance is observable when it is detectable, quantifiable and localizable. definition 10. the time passive rejection capacity interval of a path l p is rc(l p) = [ca(l p), cr(l p)] where: ca(l p) = ∑ pi∈(l p∩ (rn∪transnc )) (qie −bi), (1) 68 nabil jerbi, simon collart dutilleul, etienne craye, mohamed benrejeb a place belonging to transc a place belonging to ru a place belonging to rn a place belonging to transnc is4=[2, 20] q4e=10 is1=[30, 47] q1e=38 is2=[5, 12] q2e=7 is3=[10, 20] q3e=15 t4 p4 t2 p2 t1 p1 t5 t3 p3 is5=[1, +∞] q5e=10 p5 m2 m1 is10=[5, 15] q10e=12 t8 p12 t7 p11 t6 t10 p14 t9 t11 p15 p13 p10 t12 is11=[15, 20] q11e=17 is12=[3, 7] q12e=6 is15=[15, 20] q15e=16 is14=[2, 7] q14e=5 is13=[2, 20] q13e=5 is16=[1, +∞] q16e=19 p6 p7 p8 p9 is6=[0, +∞] q6e=5 is7=[0, +∞] q7e=8 is8=[8, +∞] q8e=13 is9=[8, +∞] q9e=15 m1 m2 m3 p16 figure 5: an hillion like model with functional decomposition time disturbances and filtering of sensors signals in tolerant multi-product job-shops with time constraints 69 cr(l p) = ∑ pi∈(l p∩ (rn∪transnc )) (qie −ai). (2) ca(lp) (resp. cr(lp)) is called the time passive rejection capacity for an advance (resp. a delay) time disturbance occurrence. definition 11. let δ a time disturbance and sn a set of nodes belonging to a p-time petri net. δ ∈ sn (resp. δ /∈ sn) means that the occurrence of δ is (resp. is not) in a node of sn. used notations: • cse is the set of elementary mono-synchronized subpaths. • in(l p) is the first node of the path l p. • ou t (l p) is the last node of the path l p. • l p(t∗,t) is the oriented subpath of l p beginning with t∗ and ending with t. • mn−1(l p(t∗,t)) is the number of tokens in l p(t∗,t) after the completion of the cycle (n−1). • given a time disturbance δ , δ rt (n) is the resulting residue quantified at the transition t which is fired at st(n). • ec(in◦(l p),t) is the set of oriented paths connecting the node in◦(l p) of the path l p to the transition t. • h(in◦(l p),t) = min li∈[ec(in◦(l p),t)\l p(in◦(l p),t)] (cr(li)) + δ rt (n). • h′(in◦(l p),t) = min li∈ec(in◦(l p),t) (cr(li)) + δ rt (n). lemma 12. let l p ∈ cse, t ∈ (l p∩to ∩tns), t∗ ∈ (l p∩to) and δ a time disturbance having a residue δ rt (n) 6= 0 quantified at the transition t. the following results are established [17]: δ rt∗(n−mn−1(l p(t∗,t))) = 0 =⇒ δ ∈ [l p(t∗,t)\{t∗}], (3) δ rt∗(n−mn−1(l p(t∗,t))) 6= 0 =⇒ δ /∈ [l p(t∗,t)\{t∗}]. (4) this lemma discusses the case of two observable transitions, t and t∗, such that t is not a synchronization one. when a disturbance is detected at a downstream transition t and is not detected at t∗, it is generated between these two transitions. otherwise, the disturbance occurrence is outside the restriction of the considered path that connects t∗ to t. lemma 13. let l p ∈ cse, t ∈ (l p∩to), t p ∈ (l p∩tp), il p = {li ∈ cse/ ou t (li) = ◦in(l p)} and δ a time disturbance having a residue δ rt (n) > 0 quantified at the transition t. the following assertion is true [17]: δ rt p(n−mn−1(l p(t p,t))) < h′(t p,t) =⇒ δ /∈ { ⋃ li∈il p { li \{in(li), in◦(li)} } ⋃{◦t p, t p }} . (5) 70 nabil jerbi, simon collart dutilleul, etienne craye, mohamed benrejeb in other words, when the residue of the disturbance at the parallelism transition t p does not justify the residue at the transition t, forcibly the disturbance has not crossed t p. lemma 14. let l p ∈ cse, t ∈ (l p∩to ∩ts), t∗ ∈ (l p∩to) and δ a time disturbance having a residue δ rt (n) > 0 quantified at the transition t. the following results are established [17]: δ rt∗(n−mn−1(l p(t∗,t))) = 0 =⇒ δ /∈ [l p(in◦(l p),t∗)\{in◦(l p)}], (6) { 0 ≤ cr(l p(in◦(l p),t∗)) < h(in◦(l p),t) δ rt∗(n−mn−1(l p(t∗,t))) = 0 =⇒ { δ /∈ [(l p\l p(t∗,t))∪{t∗}] δ rin◦(l p)(n−mn−1(l p(in◦(l p),t))) < h(in◦(l p),t) , (7) { δ rt∗(n−mn−1(l p(t∗,t))) 6= 0 δ rt (n) + cr(l p(t∗,t)) 6= δ rt∗(n−mn−1(l p(t∗,t))) =⇒ δ /∈ [l p(in◦(l p),t)\{in◦(l p)}]. (8) the above lemma discusses the case of two observable transitions, t and t∗, such that t is a synchronization one. several results are given. if the residue at the transition t∗ is equal to zero, the disturbance does not belong to the restriction of l p between its only parallelism transition in◦(l p) and t∗. if the disturbance has crossed the parallelism transition of l p (in◦(l p)) and if its residue at in◦(l p) is greater than the passive rejection capacity of the restriction of l p between in◦(l p) and t∗, the residue at t∗ must be different of zero. otherwise, the disturbance has not crossed in◦(l p). if the residue at t∗ is different of zero and if it does not justify the residue at the transition t, the occurrence of the disturbance is not in the restriction of l p between in◦(l p) and t. lemma 15. let l p ∈ cse, t p ∈ (l p∩tp ∩tu o), t ∈ (l p∩to) and cr(l p(t p,t)) the time passive rejection capacity of l p between t p and t for delay occurrence. let us call dif(t p) the set of paths beginning with t p. let us denote difn(t p) the restriction of dif(t p) such that: ∀l p′ ∈ difn(t p), ∀t′ ∈ l p′, we have st′(n + mt′) < st(n) where mt′ = mn−1(l p′(t p,t′))−mn−1(l p(t p,t)). now, let l p′ ∈ difn(t p), t∗ ∈ (l p′ ∩ to) and cr(l p′(t p,t∗)) the passive rejection capacity of l p′ between t p and t∗. given a delay time disturbance δ , the following results are true [17]:    (t /∈ ts)∧(δ rt (n) > 0) δ rt (n) + cr(l p(t p,t))−cr(l p′(t p,t∗)) > 0 δ rt∗(n + mt∗) = 0 =⇒ δ ∈ [l p(t p,t)\{t p}], (9) { (t /∈ ts)∧(δ rt (n) > 0) δ rt∗(n + mt∗) 6= 0 =⇒ δ /∈ [(l p(t p,t)∪ l p′(t p,t∗))\{t p}], (10)    (t ∈ ts)∧(δ rt (n) > 0) cr(l p′(t p,t)) < h′(t p,t) δ rt∗(n + mt∗) = 0 =⇒ { δ /∈ {◦t p, t p} δ rt p(n−mn−1(l p(t p,t))) < h′(t p,t) , (11) { (t ∈ ts)∧(δ rt (n) > 0) δ rt∗(n + mt∗) 6= 0 =⇒ δ /∈ [l p(t p,t)\{t p}]. (12) time disturbances and filtering of sensors signals in tolerant multi-product job-shops with time constraints 71 when t p is a non observable parallelism transition, the following assertion may be used: if a disturbance modifies the t p firing instant, it must be seen downstream of t p. consequently, when the value of the residual effect of the disturbance is greater than the rejection capacity of a given path, a residual variation has to be observed. the different lemmas formulated constitute a tool aiming to define the set of nodes where the disturbance may occur and the subset where it is proved that it did not occur. then the question of using the above lemmas in order to make them collaborate has to be tackled. in other words, it remains to establish an algorithm using these lemmas while testing all mono-synchronized subpaths of the given p-time petri net model. 4 conclusions this paper deals with supervision in critical time manufacturing job-shops. in such systems operation times are included between a minimum and a maximum value. a filtering mechanism of sensors signals integrating the robustness values is described. it provides the avoidance of control freezing if the time disturbance is in the robustness intervals. therefore, it makes it possible to continue the production in a degraded mode providing the guarantees of quality and safety. it should be noted that the knowledge of robustness intervals is a significant parameter in the proposed mechanism. the assumptions formulated in these lines are very restrictive. it is natural to consider different scenarios where the temporal specifications of the process are not fulfilled, nevertheless the production can continue. it is necessary to introduce a finer classification of abnormal functioning and their impact on the considered systems. in this context, fuzzy logic can be used. when a symptom of an abnormal functioning is claimed by the filtering mechanism, it is imperative to localize the time disturbance occurrence. based upon controlled p-time petri nets as a modeling tool, a series of lemmas are quoted in order to build a theory dealing with localization problem. this is quite useful for the maintenance task. in the near future, it is essential to develop an algorithm using the lemmas results and providing localization of time disturbances. references [1] s. calvez, p. aygalinc, and p. bonhomme, proactive/reactive approach for maintenance tasks in time critical systems, ieee international conference on emerging technologies and factory automation (etfa’2005), catane, vol. 1, pp. 947-953, september 2005. [2] s. collart dutilleul, and e. craye, performance and tolerance evaluation, safeprocess’03, ifac symp. on fault detection, supervision and safety for technical processes, washington, june 2003. [3] a. boufaied, a. subias, and m. combacau, chronicle modeling by petri nets for distributed detection of process failures, ieee conference on systems, man, and cybernetics (smc’02), hammamet, october 2002. [4] p. declerck, and m. k. didi alaoui, modelling and analysis of p-time event graphs in the (min, max, +) algebra, ieee conference on systems, man, and cybernetics (smc’04), the hague, vol. 2, pp. 1807-1812, october 2004. [5] n. jerbi, s. collart dutilleul, e. craye, and m. benrejeb, observability of tolerant multi-product job-shops in repetitive functioning mode, imacs’05, paris, july 2005. 72 nabil jerbi, simon collart dutilleul, etienne craye, mohamed benrejeb [6] a. e. k. sahraoui, contribution à la surveillance et à la commande d’ateliers, ph.d. thesis, université paul sabatier, toulouse, 1987. [7] a. toguyeni, e. craye, and j. c. gentina, a method of temporal analysis to perform on-line diagnosis in the context of flexible manufacturing system, iecon’90, vol. 1, pp. 445-450, pacific grovecalifornia, november 1990. [8] m. nourelfath, extension de la théorie de la supervision à la commande des systèmes à événements discrets: application à la sécurité opérationnelle des systèmes de production, ph.d. thesis, insa de lyon, france, july 1997. [9] a. toguyeni, surveillance et diagnostic en ligne dans les ateliers flexibles de l’industrie manufacturière, ph.d. thesis, université des sciences et technologies de lille, november 1992. [10] m. diaz, les réseaux de petri modèles fondamentaux, ed. hermès, paris, 2001. [11] n. jerbi, s. collart dutilleul, e. craye, and m. benrejeb, intégrat0on de la robustesse dans la supervision de systèmes manufacturiers à contraintes de temps, conférence internationale francophone d’automatique (cifa’06), bordeaux, may 2006. [12] w. khansa, p. aygalinc, and j. p. denat, structural analysis of p-time petri nets, computational engineering in systems applications (cesa’96), lille, pp. 127-136, july 1996. [13] s. laftit, j. m. proth, and x. xie, optimisation of invariant criteria for event graph, ieee trans. on automatic control, vol. 37, no. 5, pp. 547-555, may 1992. [14] s. collart dutilleul, j. p. denat, and w. khansa, use of periodic controlled petri net for discrete event dynamical system control synthesis, ecc’95, rome, pp. 2060-2065, september 1995. [15] j. long, and b. descotes-genon, flow optimization method for control synthesis of fexible manufacturing systems modeled by controlled timed petri nets, ieee international conference on robotics and automation, atlanta, georgia, vol. 1, pp. 598-603, may 1993. [16] h. p. hillion, and j. m. proth, performance evaluation of job-shop systems using timed event graphs, ieee trans. on automatic control, vol. 34, no. 1, pp. 3-9, 1989. [17] n. jerbi, s. collart dutilleul, e. craye, and m. benrejeb, localization of time disturbances in tolerant multiproduct job-shops without assembling tasks, computational engineering in systems applications (cesa’06), beijing, pp. 45-50, october 2006. nabil jerbi1,2, simon collart dutilleul1, etienne craye1, mohamed benrejeb2 1ecole centrale de lille laboratoire d’automatique, génie informatique et signal cité scientifique, bp 48, 59651 villeneuve d’ascq, france 2ecole nationale d’ingénieurs de tunis unité de recherche lara-automatique bp 37, le belvédère, 1002 tunis, tunisie e-mail: nabil.jerbi@isetso.rnu.tn, simon.collart_dutilleul@ec-lille.fr, etienne.craye@ec-lille.fr, mohamed.benrejeb@enit.rnu.tn received: november 11, 2006 int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 675-683 towards open agent systems through dynamic incorporation c. cubillos, m. donoso, n. rodríguez, f. guidi-polanco, d. cabrera-paniagua claudio cubillos, makarena donoso nibaldo rodríguez, franco guidi-polanco pontificia universidad católica de valparaíso av. brasil 2241, valparaíso, chile e-mail: {claudio.cubillos,nibaldo.rodriguez,fguidi}@ucv.cl makarena.donoso@gmail.com daniel cabrera-paniagua universidad de valparaíso valparaíso, chile e-mail:daniel.cabrera@uv.cl abstract: this work tackles the problem of providing a mechanism and infrastructure for allowing a given multiagent system (mas) to become open, allowing the incorporation of newly incoming agents to participate within the existing society. for this, a conceptual analysis of the so-called conciliation problem is presented, covering the diverse levels and issues involved in such a process. our dynamic incorporation architecture is presented, which implements an infrastructure for allowing the participation of external agents into a specific multiagent system by incorporating the appropriate behaviours upon arrival. our multiagent architecture for dynamic incorporation covers three levels: semantics, communication and interaction and has been applyed in a book-trading e-market scenario. keywords: multiagent system, dynamic incorporation architecture, open agent systems, passi. 1 introduction software agents are defined as autonomous entities capable of flexible behavior denoted by reactiveness, pro-activeness and social ability [1]. multiagent systems (mas) consist of diverse agents that communicate and coordinate generating synergy to pursue a common goal. at present, multiagent systems (mas) raises as a key paradigm for the development of next generation software systems which are required to be distributed, intelligent (autonomous, proactive), open and dynamic. while much work has been done by the research community in solving distribution and intelligence issues, little effort has been devoted to openness and dynamicity under mas settings. in the medium-to-long-term future we will see open multi-agent systems spanning multiple application domains, and involving heterogeneous participants developed by diverse design teams. agents seeking to participate in these systems will be able to incorporate and learn the appropriate behavior for participation in the course of doing so, rather than having to prove adherence before entry (as happens today) [2]. however, up to now agent systems typically center on closed agent systems with ad-hoc designs and predefined communications protocols. in recent years, agent systems have evolved to the use of agreed protocols and languages thanks to a huge standardization effort (fipa, omg, w3c). nowadays agent system openness is limited to the participation of any agent able to satisfy publicly-advertised standards. moreover, typically communication protocols, languages copyright c⃝ 2006-2010 by ccc publications 676 c. cubillos, m. donoso, n. rodríguez, f. guidi-polanco, d. cabrera-paniagua (acls) and domain knowledge model (ontology) are defined by the design team prior to any agent interactions. therefore much work needs to be done for the above scenario to become true. the current work presents a conceptual analysis of the so-called conciliation problem for then presenting the design of a multiagent architecture devoted to facilitate the dynamic participation of external agents into a specific multiagent system by incorporating the appropriate behavior upon arrival. a book-trade e-market has been used as study case to validate the multiagent implementation. this work gives continuity to our previous research in [4] [5]. 2 related work relevant research has been developed around coalition formation; the process to form a group of agents and solve a problem via cooperation [4], some works on dynamic coalition formation (dcf) [5] tackle the issue of dynamically building beneficial coalitions (coalition algorithms) among agents that can cope with environmental changes without restarting the negotiation process. however, at present coalition formation for virtual organizations is limited, with such organizations being largely static. all of the existing work has been devoted to optimizing for a given agent, the decision of when to participate or not in a coalition (conformation/disband) but not in providing an infrastructure supporting such dynamic coalition conformations and their agents’ heterogeneities. the novelty of our work relies on 1) presenting a conceptual approach for conciliating agents’ heterogeneities under an open mas setting 2) present a solution based in dynamic behavior loading rather than in a mediated-architecture approach and 3) implementing a solution that solves heterogeneities at two levels: semantics and interaction (behavior) while leaving the implementation of the communication level (related services and conciliation)as further work. 3 the three conciliation levels for the dynamic formation of agent systems to become true, a set of diverse issues need to be solved first. this concern: 1) the mechanism needed to provide the incorporation of a foreign agent to the society (agent system), 2) the ways to allow the agent to incorporate upon-arrival the society common knowledge (ontology) in order to interact, and 3) the means allowing the foreign agent to learn or incorporate upon-arrival the communication protocols and languages used by the society (aips [6], acls [7], etc) and the inherent business model. each of the above issues regards a different aspect of the communication infrastructure used by the agent society to interact and each one is covered by different areas of informatics and computer science. in general terms, the problem of conciliating the existing divergences on the communication capabilities of the entering agent and the existing society can be organized into three levels: firstly, the interaction level, involving the diverse agent interaction protocols (aip’s) to formalize the conversations among agents. then the communication level, tackling the heterogeneinity at message level, that is, the protocol and language used on the messages. finally, the semantic level, conciliating the possible divergences on the symbols used in the messages and the underlying knowledge models of the new agent and the mas society. each of these is further explained in the following. 3.1 interaction level the highest level is the interaction one, tackling the conversations among pair of agents through agent interaction protocols (aip’s) that specify the underlying coordination / cooperatowards open agent systems through dynamic incorporation 677 tion mechanism. examples can be the contract-net protocol (cnp) [11], the different auctions (e.g. english, dutch, vickrey, etc.), tuple-based negotiations, among others. these specific protocols are grounded into interaction diagrams that specify the roles of the participants and the expected messages to be sent and received under which conditions. in practical terms, the new agent will need to incorporate the behavior needed to perform a certain role within the mas society. for example, turning back to the cooper e-market, the new agent will need to add or load the manager role for initiating a contract-net, which will allow him to make the call for proposals, evaluate and select the best proposal and award it. all these tasks will need to be incorporated at run-time by simply instantiating the corresponding behavior classes and adding it to the agent. by having this solve, the agent will still need to understand the parameters required by each of the tasks it has loaded, moving us to the semantic level. other alternative is to have the agent with the correct interaction protocol and role (e.g. manager of a contract-net) but have divergences on the message format or language used in the content, moving us to the communication level. both are described below. 3.2 communication level nowadays agent systems do use communication protocols of their choice that define the message structure or syntax and language used to express the content of the message. examples of such can be fipa acl [8], kqml, an ad-hoc xml-based message envelope, or even the simple but effective concatenization of the different data in an specific order (e.g such as in lowlevel protocols). it can happen that the new agent does know the fipa acl message format but not the prolog language used in the content of the message or can do not know both. now when considering the newly incoming agent, how can we enable the incorporation of those message formats and content language upon entry? one scenario is when the new agent has already the adequate roles and behavior to interact (interaction level) but with a message format and content language that is different from the ones used within the agent society. in this case, a mediation service can be provided, by establishing the mapping of the different slots used on the two message formats. a more efficient approach is the use of a meta-format (meta-ontology for communication) used as message interchange format [9]. in this way each new format must provide its mapping to the meta-format and vice-versa instead of providing its mapping to all the other existing formats. other scenario is when the new agent does not have incorporated the interaction role nor the corresponding behaviors. in this case are directly adopted the format for messages and language used by the society when incorporating the behaviors, that is, when solving the problems at interaction level. 3.3 semantic level existing coordination strategies rely on standard interaction and communication protocols, assuming that all participants (agents) understand the shared domain knowledge usually modelled in terms of an ontology. this assumption is no longer valid in open systems in which an incoming agent (usually developed by another party and in another moment) needs to interact with a certain agent society to obtain some service (pursuing a specific goal) and does not know in advance such domain knowledge used to interact. some questions to answer are: how can the agent request or get that ontology from the agent society and process it or understand it? or how can the agent society provide the incoming agent with all the required knowledge. the basic problem here is how to conceal the concepts of the new agent with regard to the ones used by the society. therefore the entering agent will need to be provided by a service 678 c. cubillos, m. donoso, n. rodríguez, f. guidi-polanco, d. cabrera-paniagua capable of aligning his knowledge models with the one of the mas society, requiring ontology alignment and mapping techniques to solve it. 4 dynamic participation architecture this project used passi (process for agent societies specification and implementation) as methodology of development, which uses uml as modelling language. for a detailed description please refer to [3]. figure 1 shows our conceived solution in terms of the conciliation levels described before. figure 1: the 3-tier architecture for conciliation firstly, the foreign agent (fa) is the one that wants to participate in a specific mas to which it does not belong beforehand, thus requiring a conciliation process. a coordinator manages the diverse steps in the agent incorporation from the initial request until the agent is ready to start interacting within the mas society. another type of agent is the mediator, existing one for tackling each of the three levels of possible divergences (semantic, communication and interaction). in the first two cases, each one will have subscribed a set of translator agents providing specific bridging services among a couple of ontologies, or a pair of message formats in correspondence with the level. in the case of the interaction mediator, it will be having associated a set of behavior-provider agents, each one containing the code for the roles in diverse interaction protocols (e.g. manager/bidder in contract net, auctioneer/auctionee in auctions, etc.). the figure 2 shows when a new agent requests incorporation to the book-trading mas. for this, the fa agent sends a conciliation request to the coordinator containing: the mas domain name to which the fa wants to contact (booktrading in our case); the identifier of the message protocol that the fa has; the id of ontology that the fa knows; and the role name that the fa wants to perform within the book-trading mas. such request is evaluated by the coordinator checking whether if the fa needs conciliation services at any of the three considered levels; be provided with appropriate behaviors to participate (interaction), message-protocol translators (communication) and ontology translators (semantic). for this it compares the protocols and semantics used by the agent and by the domain mas searching for heterogeneities. the coordinator derivates to the interaction mediator the search for the requested role and related coded-behaviors. on its turn, the mediator contacts specific behavior providers following the contract-net protocol [11]. a similar process is carried out for the communication mediator and semantic mediator. in the first case, the coordinator sends the id of the message protocol towards open agent systems through dynamic incorporation 679 figure 2: scenario showing when a foreign agent requests a conciliation used by the fa and the domain mas, while in the second, sends the ontologies id of the domain mas and fa. in all three cases what is received is a list of possible providers and their proposals(bids), as the coordinator is the one incharge of selecting the final providers, composing the work team and announcing it to the fa. for selecting the appropriate members of the work team, the coordinator should consider diverse aspects such as the cost, quality of service, reliability, etc. of each candidate according to its utility function. on its actual form, the coordinator selects the providers based on the cost of each service. more variables within the objective function together with other selection schemas (e.g. foreign agent selection or a mixed approach) remains a matter of future work. the conciliation process may result in the need of services with any combination of the three levels or even require no services in case of a perfect match. in this way, the coordinator has the responsibility of structuring an appropriate work team for the fa. usually, the coordinator will select an ontologytranslator (ot), a messagetranslator (mt), and a behaviorprovider (bp) agent. it is important to mention that each role that the fa agent wants to perform needs a specific work team, therefore are tackled as different conciliation requests. however, nothing prevents a service provider of participating in diverse work teams, even for a same foreign agent. together with requesting and selecting an appropriate work team for the foreign agent, another important process is the actual enforcement of those services. in practical terms, this means sending the behaviors from the provider to the fa, plus the possible translation services at communication and semantic level that could be needed. the figure 3 shows the services’ enforcement, beginning with the foreign agent which decomposes the received working team. it gets the behaviors from the selected behavior provider while obtaining the corresponding codecs from the message and ontology translators. upon receiving the behaviors that compose the role, the fa takes the list of parameters required by the behaviors to work and translates it with the ontology codec. this codec translates the concepts from the domain-mas ontology to the fa ontology. in this way, the fa can map the values of 680 c. cubillos, m. donoso, n. rodríguez, f. guidi-polanco, d. cabrera-paniagua figure 3: scenario showing a foreign agent in the contract enforcing process the behavior parameters with its own attributes and provide the necessary arguments to them. on its turn, the communication codec is used to translate all the inbound/outbound messages among the message protocols used inside the behavior and by the domain mas. regarding the work team composition, in a most general case we will be having several ways of composing types of bps, cms and sts to conciliate a same fa-to-target-mas situation. however, due to the combinatorial character of the problem a simplification has been made considering only one possible combination of providers’ types. furthermore, the fa will usually need conciliation at two levels: the interaction and semantics, as usually the behaviors provided will already use the communication protocols employed by the target mas. 5 book-trading scenario the study case is based on the well-known book trading mas environment in which seller and buyer agents pursue their goals through the contract-net protocol (cnp) [11]. the book-trading example included in the jade development has been used as baseline. buyer agents initiate towards open agent systems through dynamic incorporation 681 figure 4: performance comparison of the mediation-based and conciliation-based architectures the interaction with a call-for-bids, adopting the manager role of the cnp. the call includes the book title the buyer is looking for. on the other side, seller agents assume as potential contractors, processing the calls and making bids for the requested title. in our implementation, the seller looks in its database for the title and sends a bid in case of having stock. the price within the bid is specific for each seller. the buyer collects the answers and selects the cheapest one. 5.1 experiments and results the objective of the study case was to prove the feasibility of our dynamic participation architecture and evaluate the associated costs mainly in terms of performance. for this, the experiments focused in comparing a traditional book-trading market (as the one above) with a mediation-based approach and with our proposal. in the first case, buyers and sellers interact through a mediator agent while in the last case, buyer agents correspond to foreign agents willing to dynamically participate in the book-trading mas by adopting upon arrival the required behavior. in this particular case, the role requested for conciliation is manager of the cnp with its diverse tasks (e.g. call for proposal, bid selection, task awarding, etc.) the test scenario has been generated through a random generation of book requests from a list of 30 titles. the number of sellers has been fixed to 10, each of which has one unit of each book in the list at a price that distributes uniformly u[50, 120]. a total of 10 sets of 100 requests has been generated. the simulation considers a main agent devoted to managing the creation of buyer agents, seller agents and the mas for dynamic participation. the generation and arrival of buyer agents follows a poisson distribution, hence the time between arrivals distributes exponential, e(λ), with λ = 2 in terms of requests per second. for more details on the architecture design and passi-uml diagrams please refer to [5]. regarding the considered distributed environment, the simulations were carried out over pcs with intel pentium 4 of 2 ghz. with 256 mb ram, connected through a 10/100 mb. router. figure 4 shows the service mean times for the 10 datasets, measuring the time required for the buyer agents to buy their books within the book-trading e-market. for our conciliated approach, the graph shows the time spent for the contract-enforcing part only. the time spent in the first part of the process (obtaining a working team) was around 6 seconds. the mediated approach 682 c. cubillos, m. donoso, n. rodríguez, f. guidi-polanco, d. cabrera-paniagua does not consider the time spent in looking for a mediator as it already knows it. these two aspects were not considered in the comparison as a roaming agent will usually carry out many transactions with the same mas before leaving, hence these initial processes happens only once. in [4] a comparison of a typical closed contract-net-based mas and our proposal is presented. 6 conclusions and future works an agent-based software architecture for allowing the dynamic participation of foreign agents into an existing mas has been described. additionally, an implementation has been carried out under the book-trading domain giving an insight on the viability of the proposal. the use of a mechanism based on dynamic behavior loading (conciliating differences at communication and semantic levels) raises as a feasible approach for obtaining an open mas system. further work considers: extending to other levels of conciliation (e.g. communication protocol stack), different ways of services’ delivery (e.g. codec, mediation, tuple-based) and applying our solution to other application domains such as transportation, robotics and supply-chain. 7 acknowledgement this work has been partially funded by conicyt through fondecyt project no. 11080284 and the pontificia universidad católica de valparaíso (www.pucv.cl), through nucleus project no. 037.115/2008 "collaborative systems". bibliography [1] g. weiss. multiagent systems: a modern approach to distributed artificial intelligence, mit press, massachusetts, usa. 1999. [2] m. luck, p. mcburney, c. preist, 2003. agent technology: enabling next generation computing. a roadmap for agent based computing, agentlink ii. [3] p. burrafato and m. cossentino. designing a multiagent solution for a bookstore with the passi methodology. in fourth international bi-conference workshop on agent oriented information systems (aois-2002). [4] c. cubillos, m. donoso, 2009. towards open agent systems: a book-trading study case. in fourth international conference on computer sciences and convergence information technology (iccit ’09), pp. 950–953. [5] c. cubillos, m. donoso, d. cabrera-paniagua, 2009. designing an open agent system for book-trading. third international symposium on intelligent information technology application (iita 2009), pp. 578–581. [6] h. lee, c. chen, nov. 2006. multi-agent coalition formation for long-term task or mobile network. int. conf. on computational intelligence for modelling, control and automation, 2006 and int. conf. on intelligent agents, web technologies and internet commerce, pp. 52–57. [7] m. klusch, a. gerber, 2002. dynamic coalition formation among rational agents.ieee intelligent systems, 17(3):42–47. towards open agent systems through dynamic incorporation 683 [8] foundation for intelligent physical agents (fipa). 2002. fipa interaction protocols (ips) specifications. available at: http://www.fipa.org/repository/ips.php3. [9] y.labrou, t. finin, y. peng, 1999. agent communication languages: the current landscape.ieee intelligent systems, 14(2), 45–52. [10] foundation of intelligent physical agents (fipa). 2002. fipa acl message structure specification. doc. no. sc00061g, 03/12/2002. available at: "http://www.fipa.org/specs/fipa00061/" [11] m. uschold, r. jasper and p. clark. three approaches for knowledge sharing: a comparative analysis. in proceedings of the 12th knowledge acquisition, modelling and management workshop, kaw’99, banff, canada, october 1999. [12] fipa contract net interaction protocol specification. available at: www.fipa.org/specs/fipa00029/sc00029h.pdf ijcccv11n4.dvi international journal of computers communications & control issn 1841-9836, 11(4):522-537, august 2016. data-driven control of the activated sludge process: imc plus feedforward approach j.d. rojas, o. arreta, m. meneses, r. vilanova j.d. rojas, o. arrieta escuela de ingeniería eléctrica, universidad de costa rica, san josé, 11501-2060 costa rica. tel: +506-2511-3892, fax: +506-2511-3920 {jdrojas, oarrieta}@eie.ucr.ac.cr m. meneses, r. vilanova* departament de telecomunicacióó i d’enginyeria de sistemes, escola tècnica superior d’enginyeria, etse, universitat autònoma de barcelona, 08193 bellaterra, barcelona, spain {montse.meneses, ramon.vilanova}@uab.cat *corresponding author: ramon.vilanova@uab.cat abstract: in the work presented in this paper, data-driven control is used to tune an internal model control. despite the fact that it may be contradictory to apply a model-free method to a model-based controller, this methodology has been successfully applied to a activated sludge process (asp) based wastewater treatment. in addition a feedforward controller over the influent substrate concentration was also computed using the virtual reference feedback tuning and applied to the same wastewater process to see the effect over the dissolved oxygen and the substrate concentration at the effluent. keywords: activated sludge process, data-driven control, internal model control, wastewater treatment plants 1 introduction data-driven control is a rather new control approach that does not attempt to find the model of the plant to control, instead, it uses experimental data to directly find a controller, which, generally, is meant to minimize some control performance criterion. some of the most remarkable methods within this control approach are the iterative feedback tuning (ift) [1, 2], the windsurfer approach [3, 4], the correlation approach [5, 6] and the virtual reference feedback tuning (vrft) [7–9]. while the ift and the windsurfer approach are iterative methods (that is, several experiments on the plant have to be performed in order to find the controller) the correlation approach and the vrft are one shot methods (only one set of data is needed to find the controller). ift computes an unbiased gradient of a performance index to iteratively improve the tuning of the parameters of a reduced order discrete time controller, at each iteration three different experiments are performed on the system and based on this data, the gradient is computed; in the windsurfer approach the objective is to find a better model for the plant (and subsequently a better controller) using closed-loop data and internal model control (imc) design [10] in such a way that, with every iteration, the closed loop bandwidth can be increased; data-driven control using the correlation approach is a one-shot methodology that attempts to find the values of a restricted order controller that tries to minimize the correlation between the closed-loop error of the system (based in a desired closed-loop behavior) and the reference for the process output, and the vrft translates the model reference control problem into an identification problem, copyright © 2006-2016 by ccc publications data-driven control of the activated sludge process: imc plus feedforward approach 523 being the controller the transfer function to identify based on some "virtual signals" computed from a batch of data taken directly from an open-loop experiment. in this work, the vrft approach is used and extended in order to be applied to a wastewater treatment plant (wwtp). wwtp are an important case of study within the process control area, while an active research area that involves other disciplines as for example chemistry, biology, and instrumentation. moreover, by it self, wwtps have deep impact in the quality of live in big cities. that is why the constraint on the level of pollution of the treated water before discharging it into the receiving waters, is becoming more stringent [11] and because of that, a correct control and operation of wwtp is one of the top priorities for both, industry and academics. among the types of wwtp, the activated sludge process (asp) is one of the most popular and more studied [12, 13]. this is also true from the automatic control perspective: for example in [14] a parameter and state non-linear estimator is used in an adaptive linearizing control of the dissolved oxygen and substrate concentration of an asp but under the assumption that only the dissolved oxygen is available for measurement. in [15], several multivariable pi control method are applied to the asp by linearizing the nonlinear model and the results are presented, as well as the combination of some of these methods. in [16], predictive control is used to maintain a low concentration of substrate at the output by controlling the dissolved oxygen using the dilution rate. the internal model of the predictive control is a three layer neural network. in [17] the control of the substrate concentration is achieve using an estimation based on the dissolved oxygen measurements, a dynamic controller that cope with the change in reference and a pid controller that corrects the steady state error produced by the use of a linearized model in the first controller. in [18] a decentralized pi approach is presented to show that simple well tuned pi controllers can achieve a similar performance than more complex methodologies for the asp case. some other strategies have been proposed recently such as in [19] where an i-p controller control system with pole-placement design is proposed. in all the cases, some sort of model (non-linear o linearized) is used to computed the controller. in several cases it is supposed that some parameters are known which may no be the case for a real plant. the contribution of this paper is to apply a data driven approach to the tuning of discretetime restricted-order linear controller in a decentralized approach to have good performance for both reference tracking and disturbance rejection in an asp based wwtp. without explicitly computing a model of the process an internal model control approach is used in conjunction with the vrft methodology. even more, the effect of the influent concentration disturbance is taken into account by computing a feedforward control using the vrft as well. it was found that this methodology provide excellent results when compared with a pi approach. the rest of the paper is divided in two parts, in section 2, a short overview on vrft is presented as well as the mentioned extensions for the imc control. in section 3 the results of the application of this data-driven method is presented and compared with a two-degrees of freedom pi controller. the conclusion are presented in section 3. 2 virtual reference feedback tuning extensions in this section, an overview on the vrft is presented as well as some results that extend the capacity of the vrft for different control strategies and structure of controllers is presented. 2.1 virtual reference feedback tunning overview the virtual reference feedback tuning (vrft) is a one-shot data-based method for the design of feedback controllers. the original idea was presented in [7], and then formalized by 524 j.d. rojas, o. arreta, m. meneses, r. vilanova + u(t) y(t) c(z; θ) p (z) r̄(t) figure 1: the vrft set up. the dashed lines represent the "virtual" part of the method lecchini, campi, savaresi and guardabassi (see [8, 9]). in [8], the method is presented for the tuning of a feedback controller. if a the controller belongs to the controller class {c (z;θ)} given by c(z;θ) = βt (z)θ, where β (z) = [β1 (z) · · ·βn (z)]t is a known vector of transfer functions, and θ = [θ1 θ2 · · ·θn]t is the vector of parameters, then the control objective is to minimize the model-reference criterion given by: jmr (θ) = ∥ ∥ ∥ ∥ ( p(z)c(z;θ) 1 + p(z)c(z;θ) −m(z) ) w(z) ∥ ∥ ∥ ∥ 2 2 (1) starting from a batch of open-loop data {u(t),y(t)}, a "virtual" signal is computed in such a way that, if the closed-loop system is feed with this virtual signal and the controllers in the loop were the ideal controllers that would achieve a predefined target transfer function, then the input and output signals of the plant in closed-loop would be the same than the batch of open-loop data. the output of the controller should be equal to u(t) and then, this controller can be found by identifying the transfer function which yields the output u(t) when the input r̄(t) − y(t) is applied to the input as depicted in fig. 1 the original vrft algorithm, as presented by the authors in [8], is as follows: given a set of measured i/o data {u(t),y(t)}t=1,...,n 1. calculate: • a virtual reference r̄(t) such that y(t) = m(z)r̄(t), and • the corresponding tracking error e(t) = r̄ −y(t) 2. filter the signals e(t) and u(t) with a suitable filter l(z): el(t) = l(z)e(t) ul(t) = l(z)u(t) 3. select the controller parameter vector, say, θ̂n , that minimizes the following criterion: jnv r (θ) = 1 n n ∑ t=1 (ul(t) −c(z;θ)el(t))2 (2) if c(z;θ) = βt (z)θ, the criterion (2) can be given by jnv r (θ) = 1 n n ∑ t=1 ( ul(t)−ϕtl(t)θ )2 (3) with ϕl(t) = β(z)el(t) and the parameter vector θ̂n is given by θ̂n = [ n ∑ t=1 ϕl(t)ϕl(t) t ]−1 n ∑ t=1 ϕl(t)ul(t) (4) data-driven control of the activated sludge process: imc plus feedforward approach 525 the authors, also showed that, the filter l(z) should be the one that approximates the criterion (2) to (1). this filter should be designed to accomplish the constraint: |l|2 = |1−m|2 |m| |w |2 1 φu (5) where φu is the spectral density of u(t). the vrft framework have been used in several applications and even have been extended for the mimo case and used for pid tuning, for example see [20–24]. 2.2 internal model control using the virtual reference feedback framework internal model control (imc) is a popular control method that incorporates the model of the process into the controller [10]. the standard structure is depicted in fig. 2. p(z) represents the plant, while ¯p(z) is its model. q(z) is the imc controller. if the output of the model and the output of the plant are the same, and there is no disturbance, the control system behaves as if it was in open-loop. if this is the case, to have perfect tracking, q(z) must try to cancel the dynamics of the plant. on the other hand, if there is a mismatch between the plant and its model or if a disturbance acts on the system, the feedback loop enters into play. this characteristics leads to the well know property that an imc system would be nominally internally stable if q(z) is stable, in case the model is equal to the plant. of course, finding a perfect model is rarely achievable and if it were, q(z) may not be possible to be equal to the inverse of this model due to physical limitations or because the inverse of the plant may lead to an unstable controller. in [10] a two-step design is proposed for this kind of controller: 1. solve the nominal performance criterion given, for example, by min q̄(z) ∥ ∥ ( 1 − p̄(z)q̄(z) ) w(z) ∥ ∥ p (6) where w is a filter chosen to give more importance in certain frequencies and ‖·‖p is a given norm that defines the performance criterion. the optimal solution to this problem yields to a sensitivity function given by s∗(z) = 1 − p̄(z)q̄(z) and the complementary sensitivity function given by m∗(z) = p̄(z)q̄(z), that is, the response to a change in the reference is as if it were in open loop, while the response to a disturbance is in closed-loop. of course this response is not achievable and therefore, the model p̄(z) should be divided in an invertible and non-invertible part to be able to approximate the optimal controller. 2. to introduce robustness conditions, the complementary sensitivity has to be rolled off at high frequencies, therefore, it is necessary to add a low pass filter f(z) to the controller q̄(z), to obtain the final controller q(z) = q̄(z)f(z). suppose that the multiplicative uncertainty is bounded by a frequency dependent function l̄m(ω), ∣ ∣ ∣ ∣ p(eω)− p̄(eω) p̄(eω) ∣ ∣ ∣ ∣ ≤ l̄m(ω) then the closed-loop system is robustly stable if and only if |f(eω)| < 1∣ ∣p̄(eω)q̄(eω)l̄m(ω) ∣ ∣ ∀ω (7) 526 j.d. rojas, o. arreta, m. meneses, r. vilanova figure 2: standard structure of the imc. p̄ represents the plant model and q is the imc controller figure 3: disposition for the vrft experiment using the imc topology. the dashed line represents the virtual signals and components. the reasons why the imc control has become very popular is because, finding the controller and the conditions for robust stability can be cast in a very simple form. using the vrft framework, this constraint are not really necessary, since the methodology does not need any modeling step. interested reader on imc control, can find more information on [10, 25]. it is possible to find an imc controller using the vrft framework without concerning about the modeling of the system. in fig. 3, the experimental setup for the vrft applied to the imc topology is depicted. if the target complementary sensitivity function is given by m(z), then the virtual reference r̄(t) is computed as: r̄(t) = m−1(z)y(t) (8) if the ideal controller were in the loop, then one would have p̄(z) = p(z) and the input to the controller q(z,θ) would be r̄(t) and its corresponding output would be u(t) in order to have y(t) as the output of the closed-loop system. from fig 3, it can be found that the ideal controller would be given by q0(z) = m(z)p(z) −1 p̄0(z) = m(z)q0(z) −1 (9) p0(z) would be the ideal plant model that is derived from the ideal controller. this basic idea leads to the following optimization problem which gives the set of optimal parameters θ∗ (in a least square sense): min θ j(θ) = min θ n ∑ i=1 (u(i) −q(z,θ)r̄(i))2 (10) data-driven control of the activated sludge process: imc plus feedforward approach 527 �� figure 4: virtual reference setup for feedforward plus imc controller. solid lines are for "real" components and signals while dashed lines are for "virtual" components and signals once q(z,θ∗) has been determined, it is easy to compute the approximation of the process model of the plant from (9): p̄(z,θ) = m(z)q(z,θ)−1 (11) it is important to note that p̄(z,θ) is seen just as an “instrumental model”, that results from the determination of the optimal controller. in fact, it can be seen just as a part of the imc controller that results from the optimization. of course, if a robust check is performed with the obtained controller, this approximation of the plant can be used as if it were the nominal model. in that case, the controller and the nominal model would be found at once. the filter for robust operation presented in (7), is already included in the determination of q(z,θ∗) given the desired m(z). 2.3 vrft approach to feedforward control sometimes, it is possible to measure disturbances that affect the process output. in those cases, it is desirable to use a feed-forward controller that acts before the effects of these disturbances reach the output of the plant. in [26], the idea of using the vrft controller was presented to be used in conjunction with a one-degree of freedom controller. the main difference is that it is assumed that the disturbance is available for measurement and is used in the optimization problem. in this paper this idea was implemented in conjunction with the vrft-imc controller. suppose that the control system can be represented by the diagram in fig. 4, where p1(z) and p2(z) represent the unknown dynamics of the plant from the input u(t) and the disturbance d(t) to the output y(t), respectively. these three signals are measured from an open-loop experiment. the idea of using the feedforward control plus the imc controller is to cope with both measurable and non-measurable disturbances. q(z), pm(z) and cf(z) are the controllers to be found. the “virtual” components and signals (which are presented with dashed lines in fig. 4) are: • m(z), which is the target closed-loop dynamics from the reference signal to the output of the controlled system. • f(z), is the target closed-loop dynamics from the measured disturbance to the output. 528 j.d. rojas, o. arreta, m. meneses, r. vilanova • r̄v, is the virtual reference computed from the data obtained from an open loop experiment and the closed-loop target functions. • ȳd, is the ideal disturbed output in closed-loop, if the virtual reference is applied in the closed-loop and the ideal controllers are set in place. • d, is the measurable disturbance signal that it is suppose to be available in the open loop experiment. the virtual reference signal r̄v has to be computed according to the ideal relationships and the measured and virtual signals: ȳd = mr̄v + fd (12) if one is able to find the ideal controllers, then ȳd = y. since this is exactly what is needed, the virtual signal is computed from (12) as: ȳd = y r̄v = m −1 (yd −fd) (13) the transfer function of the controlled system is y = p1q 1 + (p1 −pm)q r + (1 −pmq)(p1cf + p2) 1 + (p1 −pm)q d (14) note in (14), that the input signals do not have a bar, denoting that these signals are not virtual, but actually are entering to the system. when comparing (12) and (14), one is able to find the ideal controllers that would, theoretically, drive the system to the desired dynamics (if the transfer functions of the plant were known): qo = m p1 −m(p1 −pm) cfo = 1 p1 ( f(1 − (p1 −pm)q) 1 −pmq −p2 ) (15) once q(z) and cf (z) has been obtained by optimization, the best approximation of pm(z) is pm(z) = m(z)q −1(z), just as in (9) where one expects that pm(z) ≈ p1(z) since the virtual reference was computed to achieve this relationship. this optimization is found by following the paths in the diagram of fig. 4 that lead from the measured and virtual inputs to the u(t) signal, it is straightforward to find that the cost function to optimize is given by: j(θ) = 1 n n−1 ∑ i=0 [u(i) − (q(z,θ) r̄v(i) + cf (z,θ) d(i))]2 (16) solving this optimization problem, one is able to find directly the two controllers and the instrumental model, using only one batch of input-output data without any iterative scheme. 3 application to an asp based wwtp in this section a practical example of the imc-vrft method exposed above is presented. the plant considered in this paper is the wwtp given in [27]. it comprises an aerated tank data-driven control of the activated sludge process: imc plus feedforward approach 529 table 1: initial conditions biomass x(0) = 217.79 mg/l substrate s(0) = 41.23 mg/l dissolved oxygen do(0) = 6.11 mg/l recycled biomass xr(0) = 435.58 mg/l influent substrate sin(0) = 200.00 mg/l influent dissolved oxygen doin(0)= 0.50 mg/l table 2: kinetic parameters β = 0.2 kc = 2mg/l r = 0.6 ks = 100mg/l α = 0.018 kdo = 0.5 y = 0.65 dos = 0.5mg/l µmax = 0.15 h −1 where microorganisms act on organic matter by biodegradation, and a settler where the solids are separated from the wastewater and a proportional part is then recycled to the aerator in order to maintain certain amount of biomass in the system. the layout is shown in fig. 5. the component balance for the substrate, biomass, recycled biomass and dissolved oxygen provide the following set of non-linear differential equations: dx(t) dt = µ(t)x(t) −d(t)(1 + r)x(t) −rd(t)xr(t) (17) ds(t) dt = −µ(t) y x(t) −d(t)(1 + r)s(t) + d(t)sin (18) ddo(t) dt = −koµ(t) y x(t) −d(t)(1 + r)do(t) + kla(dos −do(t)) + do(t)doin (19) dxr(t) dt = d(t)(1 + r)x(t) −d(t)(β + r)xr(t) (20) µ(t) = µmax s(t) ks + s(t) do(t) kdo + do(t) (21) kla = αw(t) (22) where x(t) biomass, s(t) substrate, do(t) dissolved oxygen, dos maximum dissolved oxygen, xr(t) recycled biomass, d(t) dilution rate, w(t) aeration rate, sin and doin substrate and dissolved oxygen concentrations in the influent, y biomass yield factor, µ biomass growth rate in a monod like form [28], µmax maximum specific growth rate, ks and kdo saturation constants, kla oxygen mass transfer coefficient, α oxygen transfer rate, ko model constant, r and β ratio of recycled and waste flow to the influent. the influent concentrations are set to sin = 200 mg/l and doin = 0.5 mg/l. the control strategy is a decentralized control as in [18] where the multivariable process is treated as two separate single variable process. the strategy is depicted in fig.6. with respect to the control problem definition, it is considered that the dissolved oxygen, do(t), and substrate, s(t), are the controlled outputs of the plant, whereas the dilution rate, d(t), and aeration rate w(t) are the two manipulated variables. the control of do provides a method to maintain the necessary amount of biomass in the system while controlling s gives a way to keep the pollution 530 j.d. rojas, o. arreta, m. meneses, r. vilanova �� figure 5: wastewater treatment process �� figure 6: control strategy for the wwtp at the effluent in an acceptable level [14]. the initial conditions and kinetic parameters are taken as in [18, 27] and presented in table1 and 2. the settings of the vrft controller are as follows: for both control loops, the sampling time was selected as ts = 0.5min, the imc controller q(z) has the following parameterization: q(z) = α1 + α2z −1 + α3z −2 β1 + β2z−1 + β3z−2 (23) the target transfer function for the do loop is: mdo(z) = 0.02357z−1 1 −0.9764z−1 (24) which represents a first order transfer function with a constant time of approximately 20min. for the s loop (controlled by manipulating d(t)), the target closed-loop dynamics is a first order transfer function with a constant time of approximately 40min given by: ms(z) = 0.01382z−1 1−0.9862z−1 (25) the input-output data was selected as an additive random signal of 0 mean and variance 90 for the w(t) and variance 7.5e-4 for the d(t) around the operation points given in table 1. the resulting controllers were found as: qdo(z) = 40.69 −19.35z−1 −19.65z−2 1−0.4683z−1 −0.4792z−2 qs(z) = 0.01236 −0.006155z−1 −0.006158z−2 1−0.4863z−1 −0.4924z−2 (26) data-driven control of the activated sludge process: imc plus feedforward approach 531 the imcff-vrft version was also implemented, considering the influent substrate concentration sin as the measurable disturbance. the q controller has the same parameterization as in (23). for the feedforward controller, the parameterization is: cf (z) = γ0 + γ1z −1 1−σ1z−1 (27) the sampling time is the same as for the q controllers, and the desired target transfer function is f(z) = 0, which is normally what is desired with the feedforward control. the experimental data was slightly changed, because the dynamics from the disturbance to the output are slower: the data changes slowly and it was taken into account that a portion of the output data were affected only by the input, while another portion is affected only by the disturbance and, finally, another portion is affected by both. this is helpful to identify correctly both controllers at the same time. the resulting controllers are: qdo(z) = 40.57 −80.37z−1 + 39.8z−2 1 −1.974z−1 + 0.9739z−2 cfdo(z) = −0.0002002 + 0.001138z−1 1−0.9976z−1 qs(z) = 0.01233 −0.02416z−1 + 0.01183z−2 1 −1.947z−1 + 0.9479z−2 cfs(z) = −0.0006154 + 0.0004481z−1 1 −0.7261z−1 (28) the results of this controllers are compared to the two-degrees of freedom, continuous time pi controller of [18]. two different test were performed: a change in the references and a disturbance on the influent substrate sin where it is considered that every 24h, an increase of 10% of the value of sin during 1h takes place. for the change in reference (sref(t) for the substrate concentration reference and doref(t) for the dissolved oxygen reference), the result is as given in fig. 7 and 8. a step change of 10mg/l is applied to sref(t) at time t = 10h while a step change of -2mg/l in doref(t) is applied at time t = 100h. the effect of one loop change in the other loop, due to the process interaction, can be observed as well. in both cases the imc-vrft and the imcff-vrft controller achieves a better performance for both the reference tracking as well as the disturbance rejection when the other loop changes. in table 3 the values of the integral of the squared errors (ise) and the total variation (tv), which measures the aggressiveness of the control effort, are presented. ise and tv are computed as: ise = ∫ t 0 e(t)2 dt tv = n ∑ i=1 |u(i) −u(i−1)| (29) e(t) is the error signal (the reference minus the measured output), and u(i) is the output of the controlled sampled every hour and n is the total number of samples. in the column “reference tracking” it can be seen that for the s loop with the application of the imc-vrft controller the ise is greatly reduced (near the 57%) but with almost the same tv. the do loop is improved in both ise and tv, as can be also seen in fig. 7 and 8, the section that is zoomed details the change in the do, it is clear that the response of the pi controller is much worse than the 532 j.d. rojas, o. arreta, m. meneses, r. vilanova 0 50 100 150 40 42 44 46 48 50 52 54 time (s) s u b s tr a te c o n c e n tr a ti o n ( m g /l ) change in reference, s reference pid imc−vrft imcff−vrft figure 7: step change in the s reference at time t = 10h table 3: comparison of the results between the imc-vrft, imcff-vrft and the pid control reference tracking disturbance rejection s do s do ise pi 77.23 2.94 12.05 0.0021 imc-vrft 32.85 0.64 5.56 6e-005 imcff-vrft 33.25 0.65 0.15 0.0083 tv pi 0.091 89.10 0.15 7.02 imc-vrft 0.083 67.33 0.15 4.69 imcff-vrft 0.082 65.01 0.16 13.07 response of the imc-vrft, which almost has no overshoot. in fig. 9, the plot of the control signals is presented for both the dilution rate and the air flow rate. as it was expected, the performance of the imc-vrft and the imcff-vrft are similar for the reference tracking since no disturbance is present for this simulation, despite the fact that the controllers were found using different optimization methods (in the case of the imc-vrft a simple linear least square problem can be casted, while for the imcff-vrft, the output error (oe) method was applied [29]). for the disturbance in the substrate concentration of the influent, the responses are presented in fig. 10, fig. 11 and fig. 12. pi control is faster to control the disturbance fig. 10, but the overshoot is larger. the imcff-vrft controller performs much better than the imc-vrft controller (the ise is 97% lower) with almost the same tv. the response of the do is greatly improved with a reduction of almost 97% of ise for imc-vrft, but the imcff-vrft performs much worse than the pi controller (the ise is almost 4 times bigger with a tv 2 times grater). it is clear that a feedforward strategy is not adecuate for the do loop, since the effect of the change in sin is much slower than the effect of the aireation. data-driven control of the activated sludge process: imc plus feedforward approach 533 � �� !��"�� !�� #$� $%�&' () $%�((&' () *� *� *+ �� , �� + �� , ��+ ��* � �� , �� !��"�� !�� #$� $%�&' () $%�((&' () figure 8: step change in the do reference at time t = 100h 0 50 100 150 0.05 0.1 0.15 0.2 0.25 time (s) d il lu ti o n r a te ( h − 1 ) dilution rate pid imc−vrft imcff−vrft 0 50 100 150 0 50 100 150 time (s) a ir f lo w r a te ( m 3 h − 1 ) air flow rate pid imc−vrft imcff−vrft figure 9: control effort during the change in the reference 534 j.d. rojas, o. arreta, m. meneses, r. vilanova 0 50 100 150 40.6 40.8 41 41.2 41.4 41.6 41.8 42 42.2 42.4 time (s) s u b s tr a te c o n c e n tr a ti o n ( m g /l ) disturbance rejection, s reference pid imc−vrft imcff−vrft figure 10: effect over the substrate concentration when the substrate input is disturbed 0 50 100 150 6.1 6.105 6.11 6.115 6.12 6.125 6.13 6.135 6.14 6.145 time (s) d is s o lv e d o x y g e n c o n c e n tr a ti o n ( m g /l ) disturbance rejection, do reference pid imc−vrft imcff−vrft figure 11: effect over the dissolved oxygen when the substrate input is disturbed data-driven control of the activated sludge process: imc plus feedforward approach 535 0 50 100 150 0.08 0.09 0.1 time (s) d il lu ti o n r a te ( h − 1 ) dilution rate pid imc−vrft imcff−vrft 0 50 100 150 89 89.5 90 90.5 91 time (s) a ir f lo w r a te ( m 3 h − 1 ) air flow rate pid imc−vrft imcff−vrft figure 12: control effort during the disturbance in the substrate concentration of the influent conclusions in this paper, the vrft method has been studied and was applied within the imc framework. also, a feedforward extension to the imc-vrft controller was also presented. both methodologies were successfully applied to a wwtp process, substantially improving the results of a continuous time two-degrees of freedom pi controller using a restricted order discrete time controller in the case of the reference tracking. for the disturbance rejection, the feedforward controller greatly improved the performance for the substrate loop but for the dissolved oxygen loop, it was found that the feedforward component degrades the performance. the difference in the constant time and the little effect that has the influent substrate concentration over the dissolved oxygen may be the reason of this poor performance. data-driven control is a powerful tool that can be easily applied to several control problems and that can be extended to several control structures. how to chose the closed-loop target functions without any knowledge of the plant (only data) and how to guarantee stability and robustness when the controller is found, are subject that still need further research in this control area. acknowledgment the financial support from the university of costa rica, under the grants 322-b4-218 and 731-b4-213, is greatly appreciated. also, this work has received financial support from the ministerio de economia e innovacion of spain under project dpi2013-47825-c3-1-r. bibliography [1] h. hjalmarsson, m. gevers, s. gunnarsson, o. lequin (1998), iterative feedback tuning: theory and applications, control systems magazine, ieee, doi: 10.1109/37.710876, 18(4): 26–41. 536 j.d. rojas, o. arreta, m. meneses, r. vilanova [2] m. gevers (2002), a decade of progress in iterative process control design: from theory to practice, journal of process control, doi: 10.1016/s0959-1524(01)00018-x, 12(4): 519-531. [3] w. s. lee, b. d. o. anderson, i. m. y. mareels, r. l. kosut (1995), on some key issues in the windsurfer approach to adaptive robust control, automatica, doi: 10.1016/00051098(95)00092-b, 31(11): 1619–1636. [4] b. anderson (2002), windsurfing approach to iterative control design, in: a. p., s. p. a. (eds.), iterative identification and control: advances in theory and applications, springer verlag, 142-166. [5] a. karimi, l. miskovic, d. bonvin (2003), iterative correlation-based controller tuning with application to a magnetic suspension system, control engineering practice, doi: 10.1016/s0967-0661(02)00191-0, 11(9): 1069 1078. [6] a. karimi, k. van heusden, d. bonvin (2007), noniterative data-driven controller tuning using the correlation approach, in: european control conference, kos island, greece. [7] g. guardabassi, s. savaresi (2000), virtual reference direct design method: an off-line approach to data-based control system design, automatic control, ieee transactions on, doi: 10.1109/9.855559, 45(5): 954–959. [8] m. c. campi, a. lecchini, s. m. savaresi (2002), virtual reference feedback tuning: a direct method for the design of feedback controllers, automatica, doi: 10.1016/s00051098(02)00032-8, 38(8): 1337-1346. [9] a. lecchini, m. campi, s. savaressi (2002), virtual reference feedback tuning for two degree of freedom controllers, international journal of adaptative control and signal processing, doi: 10.1002/acs.711, 16(5):355–371. [10] m. morari, e. zafirou (1989), robust process control, prentice-hall international. [11] u. jeppsson (1996), modelling aspects of wastewater treatment processes, ph.d. thesis, department of industrial electrical engineering and automation (iea) lund institute of technology (lth), http://www.iea.lth.se/ ielulf/publications/phd-thesis/phd-thesis.pdf [12] m. henze, p. harremoës, e. arvin, j. la cour jansen (1997), wastewater treatment, biological and chemical process, 2nd edition, enviromental engineering, springer verlag, new york, usa., series editors: förstner, u. and murphy, robert j. and rulkens, w.h. [13] m. henze, w. gujer, t. mino, m. van loosdrecht (2002), activated sludge models asm1, asm2, asm2d and asm3, 1st edition, scientific and technical report, iwa publishing, london, uk, 2002. [14] f. nejjari, b. dahhou, a. benhammou, g. roux (1999), non-linear multivariable adaptive control of an activated sludge wastewater treatment process, international journal of adaptive control and signal processing, 13 (5): 347–365. [15] n. a. wahab, r. katebi, j. balderud (2006), multivariable pid tuning of activated sludge processes, proc. of the international control conference (icc2006), 2006. [16] s. caraman, m. sbarciog, m. barbu (2007), predictive control of a wastewater treatment process, international journal of computers communications & control 2(2): 132–142. data-driven control of the activated sludge process: imc plus feedforward approach 537 [17] f. koumboulis, n. kouvakas, r. king, a. stathaki (2008), two-stage robust control of substrate concentration for an activated sludge process, isa transactions, 47(3): 267278. [18] r. vilanova, r. katebi, v. alfaro (2009); multi-loop pi-based control strategies for the activated sludge process, emerging technologies and factory automation, ieee international conference on, 2009. [19] a. d. kotzapetros, p. a. paraskevas, a. s. stasinakis (2015), design of a modern automatic control system for the activated sludge process in wastewater treatment, chinese journal of chemical engineering, 23(8): 1340-1349. [20] m. nakamoto (2004), an application of the virtual reference feedback tuning for a mimo process, sice 2004 annual conference, sapporo, japan, 3: 2208-2213. [21] f. previdi, t. schauer, s. savaresi, k. hunt (2004), data-driven control design for neuroprotheses: a virtual reference feedback tuning (vrft) approach, control systems technology, ieee transactions on, doi: 10.1109/tcst.2003.821967, 12(1): 176–182. [22] f. previdi, m. ferrarin, s. m. savaresi, s. bittanti (2005), closed-loop control of fes supported standing up and sitting down using virtual reference feedback tuning, control engineering practice, doi: 10.1016/j.conengprac.2004.10.007, 13(9): 1173 -1182. [23] a. sala, a. esparza (2005), extensions to virtual reference feedback tuning: a direct method for the design of feedback, controllers, automatica, doi: 10.1016/j.automatica.2005.02.008, 41(8): 1473 1476. [24] y. kansha, y. hashimoto, m.-s. chiu (2008), new results on vrft design of pid controller, chemical engineering research and design, doi: 10.1016/j.cherd.2008.02.018, 86(8):925 931. [25] d. e. rivera, m. morari, s. skogestad (1986), internal model control: pid controller design, industrial & engineering chemistry process design and development, doi: 10.1021/i200032a041, 25(1): 252-265. [26] g. guardabassi, s. savaresi (1997), data-based simultaneous design of composite feedbackfeedforward controllers: a virtual input direct design approach, 4th european control conference (ecc97), brussels, belgium, 1997. [27] f. nejjari, g. roux, b. dahhou, a. benhammou (1999), estimation and optimal control design of a biological wastewater treatment process, mathematics and computers in simulation, doi: 10.1016/s0378-4754(98)00158-x, 48(3): 269 280. [28] j. monod (1949), the growth of bacterial cultures, annual review of microbiology, 3(1): 371–394. [29] l. ljung (1999), system identification, theory for the user, 2nd edition, prentice hall, 1999. ijcccv4n3draft.pdf special issue on membrane computing seventh brainstorming week on membrane computing the present volume contains a selection of papers resulting from the seventh brainstorming week on membrane computing (bwmc7), held in sevilla, from february 2 to february 6, 2009. the meeting was organized by the research group on natural computing (rgnc) from department of computer science and artificial intelligence of sevilla university. the previous editions of this series of meetings were organized in tarragona (2003), and sevilla (2004 – 2008). after the first bwmc, a special issue of natural computing – volume 2, number 3, 2003, and a special issue of new generation computing – volume 22, number 4, 2004, were published; papers from the second bwmc have appeared in a special issue of journal of universal computer science – volume 10, number 5, 2004, as well as in a special issue of soft computing – volume 9, number 5, 2005; a selection of papers written during the third bwmc has appeared in a special issue of international journal of foundations of computer science – volume 17, number 1, 2006); after the fourth bwmc a special issue of theoretical computer science was edited – volume 372, numbers 2-3, 2007; after the fifth edition, a special issue of international journal of unconventional computing was edited – volume 5, number 5, 2009; finally, a selection of papers elaborated during the sixth bwmc has appeared in a special issue of fundamenta informaticae – volume 87, number 1, 2008. membrane computing is an area of natural computing which studies models of computation inspired by the structure and functioning of living cells, and organization of cells in tissues and other structures. the resulting models (called p systems) are distributed parallel computing devices, processing multisets in compartments defined by membranes. most classes of p systems are computationally universal and, if an exponential working space can be produced in polynomial time (e.g., by membrane division), then they are able to solve computationally hard problems in a feasible time. a series of applications were recently reported, especially in biology and medicine, but also in computer graphics, cryptography, linguistics, economics, approximate optimization, etc. several simulation programs (useful in applications) are available by now. a comprehensive information about this research area (considered in 2003 by isi as “fast emerging research front in computer science") can be found at the website http://ppage.psystems.eu. at this web address one can also find the volumes published after each bwmc, with the papers resulting from these meetings, including the volume with all papers related to bwmc7. for the present volume we have selected only a few of these papers; they have been thoroughly reworked after the meeting and then they went through the standard refereeing procedure of the journal. the selection also intended to provide a good image of the research in membrane computing, so that the volume contains both theoretical and applicative papers, dealing with computing power, computational complexity, “classic" cell-like p systems and the recently introduced spiking neural p systems, programming, simulation of biological processes, and so on. * as mentioned above, the meeting was organized by the research group on natural computing from sevilla university (http://www.gcn.us.es)– and all the members of this group were enthusiastically involved in this (not always easy) work. the meeting was supported from various sources: (i) proyecto de excelencia de la junta de andalucía, grant tic 581, (ii) proyecto de excelencia con investigador de reconocida valía, de la junta de andalucía, grant p08 – tic 04200, (iii) proyecto del ministerio de educación y ciencia, grant tin2006 – 13425, (iv) iv plan propio de la universidad de sevilla, (v) consejería de innovación, ciencia y empresa de la junta de andalucía, well as by the department of computer science and artificial intelligence from sevilla university. * the present volume is dedicated to the 60th birthday anniversary of professor mario de jesús pérezjiménez, the head of the research group on natural computing from sevilla university, and one of the most active researchers in membrane computing. a really unique combination of enthusiasm and mathematical talent, of scientific devotion and altruism, mario not only contributed a lot to the research in membrane computing, with fundamental results especially related to computational complexity issues and also related to many other theoretical research questions, to applications in biology and eco-systems, to programming, etc., but he also created one of the strongest research groups in natural computing in general, in membrane computing in special; it also deserves to be mentioned the organization, for already several years in a row, of the brainstorming week on membrane computing – all these making sevilla a place of current “pilgrimage" of researchers in membrane computing, from europe, asia, america. for all those who know mario personally, it is hard to believe that he has already six decades: he is so active, enthusiastic and hard working that he looks as young as decades ago, and the theoretical possibility to get retired (according to spanish regulations, this is possible for mario) looks like a nonsensical joke... and, for all who know mario personally, it is impossible not to own to him a lot, from science to daily life (one of the sayings which circulate around is that if you have a need, it is wiser not to tell it loudly, because mario will try immediately to help you...). happy birthday, mario, and many happy returns! guest editors: giancarlo mauri, milan, italy gheorghe păun, bucharest, romania agustín riscos-núñez, seville, spain (sevilla, june 2009) ijcccv4n1draft.pdf int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 1, pp. 17-26 artificial intelligence + distributed systems = agents ioan dzitac, boldur e. bărbat ioan dzitac “aurel vlaicu” university of arad, faculty of exact sciences, department of mathematics-informatics str. elena dragoi, nr. 2, complex universitar m (micalaca, zona iii), arad, romania e-mail: idzitac@gmail.com boldur e. bărbat “lucian blaga” university of sibiu, “hermann oberth” faculty of engineering, department of research 10, victoriei bd, sibiu, 550024, românia e-mail: bbarbat@gmail.com every established religion was once a heresy henry buckle “essays” abstract: the connection with wirth’s book goes beyond the title, albeit confining the area to modern artificial intelligence (ai). whereas thirty years ago, to devise effective programs, it became necessary to enhance the classical algorithmic framework with approaches applied to limited and focused subdomains, in the context of broad-band technology and semantic web, applications running in open, heterogeneous, dynamic and uncertain environments-current paradigms are not enough, because of the shift from programs to processes. beside the structure as position paper, to give more weight to some basic assertions, results of recent research are abridged and commented upon in line with new paradigms. among the conclusions: a) nondeterministic software is unavoidable; its development entails not just new design principles but new computing paradigms. b) agent-oriented systems, to be effectual, should merge conventional agent design with approaches employed in advanced distributed systems (where parallelism is intrinsic to the problem, not just a mean to speed up). keywords: open, heterogeneous, dynamic and uncertain environments (ohdue); computer-aided decision-making; nonalgorithmic software; bounded rationality; agent-oriented software engineering (aose). 1 introduction. adapting to the time(s) of change three dominant features of the post-industrial society relevant here are: a) growing speed of change (due to the intense positive feedback entailed by moore’s law outcomes: internet, broad-band technology, semantic web, google, [9] etc.); b) growing complexity (architectural, cognitive, structural [4, 5, 10]). c) globalization (expressed in it context mainly through the modern enterprise paradigms). bernerslee who coined also the term ggg (giant global graph) to describe the semantic web as a new stage of www utters it very pointed: “i have gone from using a 300 board connection on one of those telephone couplers to a 3 million board connection, so that is a 10,000 factor. so the technology underneath this has tremendously increased in terms of speed and functionality, and the web technology had happened on top” [http://dig.csail.mit.edu/, 2007]. copyright © 2006-2009 by ccc publications 18 ioan dzitac, boldur e. bărbat thus, modern it environments, except for simple applications, move towards open and heterogeneous (resources are unalike and their availability is not warranted), dynamic (the pace of exogenous and endogenous changes is high) and uncertain (both information and its processing rules are revisable, fuzzy, and uncertain). most situations to be controlled are complex and uncertain, and involve parallel processes. thus, the applications developed to deal with must be intelligent [7] (to manage complexity and uncertainty) and distributed (to handle parallelism) are intrinsically non-deterministic and endusers have to interact with them in manners they are not get used to. moreover, a second (side-effect) vicious circle comes out from the interaction between difficulty to adapt and the follow-on frustration (for instance, the claims regarding “digital manipulation”). in brief, adapting to the speed of change is mandatory and the target of this paper is to show that it entails adopting modern it paradigms. the approach is based on the homomorphism of the addition in the paper title to that in the famous book of wirth [22]. accordingly, section 2 presents the rationale analysing and supporting the evolution from programs to agents, emphasizing the temporal dimension. deepening the investigation, section 3 explores the unavoidable paradigm shift (here, many and diverse paradigms are x-rayed). section 4 includes the core: the misleading term “distributed” is thoroughly revisited because distribution was mostly part of the solution and is now recognized as main part of the problem too. just to give more weight to some assertions in preceding sections, models, methods, and mechanisms based on the paradigms endorsed before, are abridged after earlier papers and commented upon in section 5. conclusions and intentions for future development close the paper (section 6). 2 rationale. from programs to agents like all changes induced by systems with intense positive feedback, the shift from programs to agents is at once a long way (conceptually, since it involves multiple paradigm changes), a swift leap (historically, since as it beings algorithms are active, whereas agents are teenagers) and a hard chaotic fight (epistemically, since ai as a whole is at the same time feared and ridiculed, overrated and denied). misinterpretations are eased because confusing program architecture with code structure some professionals still consider that x-rayed programs are nothing more than implemented algorithms, since in binary form data and instructions are indiscernible. moreover, in the case of ai, the argument about the difference between programs and agents is complicated by two conceptual inflations due to overstated advertising: a) to many programs are labelled as intelligent; b) most such “intelligent” programs are renamed as agents (with or without minor reshaping). thus, the notion of agent (and the very metaphor behind it) are blurred. object orientation as almost one and only software engineering paradigm adds two more hindrances to: a) already in the 1990s, real-time programming showed that even the conceptual equivalence “program = object” exposed intellectual difficulties (e.g., objects like “mutex”, “event”, or “exception”, state “start”, method “execute”); b) almost as a corollary, by transitivity, it is commonsensical that the equivalence “agent = object” is awful, since nobody could be happy to be considered “intelligent like an object”. besides, objects have at most a very primitive temporal dimension. tough, this dimension is fundamental because no software entity lacking it could be able to (inter)act in the e-world with other artificial intelligence + distributed systems = agents 19 entities, neither artificial (its peers), nor natural (its human end-users). tellingly enough, time entered the software universe via data structures like dynamic data (e.g., lists in pascal) or data types for concurrency (from monitors in concurrent pascal to tasks in ada). likewise, in ai, a genuine temporal dimension was entailed by distributed systems (to handle the parallelism involved albeit architectural inefficiency due to approaches based on “light” multithreading and time-sharing). thus, the homomorphism suggested in the paper title is even deeper than supposed at first. on the other hand at least in countries similar to romania it curricula are obviously lagging behind the state of the art. aspects relevant for this paper are: a) ai syllabi show a long-lasting flavour of “gofai” (good old-fashioned ai). b) even when rigid planning-based ai is replaced by “bic” (biologically inspired computing), the focus is not on modelling inspired from biologic (sub-symbolic) paradigms -, but merely on simulating biologic behaviour. c) software engineering syllabi are entirely object-oriented; thus, agents if considered are designed as objects (e.g., using jade). d) distributed systems are approached in still more conventional manner: the only concern is to boost speed, not to reflect real-world parallelism. hence the rationale is threefold, depending on the perspective: a) ai (intelligent software must be process-based, not program-based); b) software engineering (intelligent applications should be agent-oriented, not object-oriented); c) it curricula (ai and distributed systems should be merged based on agent-orientation). 3 paradigm shift. what is certain? here “paradigm” means: “thought pattern in any scientific discipline or other epistemological context. the merriam-webster dictionary defines broadly: a philosophical or theoretical framework of any kind. [...] perhaps the greatest barrier to a paradigm shift, in some cases, is the reality of paradigm paralysis, the inability or refusal to see beyond the current models of thinking” [http://en.wikipedia.org/wiki/paradigm#paradigm_shifts]. a lot of fundamental scientific concepts inside and outside it changed dramatically their intension since it begun to be the dominant “novum organon” of post-industrial technological development. they have been investigated from agent-oriented perspective in [4, 5] and lately in [6] where the manifold paradigmatic shift they contributed to engender was labelled “from kelvin to zadeh” (figure 1) because the focus was on precision (unnatural in real world, hence inadequate in software) and the shift described there referred to swap from the conventional “computing with numbers” (based mainly on measurements) to the modern “computing with words” (based mainly on perceptions). in addition, figure 1 tries to remind or at least to suggest that: • moore’s law epitomised as feedback loop in the way usual in automation and electronics emphasises not only the growing psychological difficulty to adapt to an unprecedented speed of change but also its known side effects: instability, distortion, complexity (mainly, cognitive), frustration (e.g., the irritation about “digital manipulation”). 20 ioan dzitac, boldur e. bărbat figure 1: manifold paradigm shift (adapted and extended from [6]) • “manifold” as “diverse”: in line with common usage, “paradigm” regards also specific areas within a discipline (e.g., “programming paradigm”, mainly when it “leverages a computer language’s power” [21]). in figure 1 the paradigmatic level lowers from a wide-ranging one to a narrow, specialised one usual in software engineering. thus, the “kelvin paradigm” suggests that it must stay firmly based on mathematical precision, while the “zadeh paradigm” replays that it should shift towards semiotic flexibility [23]. on the other hand, shifting from “client-server”, to “computing as interaction” [1, 24]) is focused not involving (im)precision, (non)determinism, (un)certainty, etc. (besides, another dimension of reason diversity was recently shown: the ethnographical one [17].) • “manifold” as “many”: after redressing the balance (i.e., accepting all kinds of paradigms, not just gofai), ai was inundated by sub-symbolic paradigms; among the best known are artificial neural networks (ann) and genetic algorithms (ga). the most nihilist and powerful, i.e. the ethological paradigm (based on the physical-grounding hypothesis [4]), seems to be still in vogue for developing agents (above all since the agent is accredited as process by a formal standard [12]). • “manifold” as “non-exhaustive”: to save space some important paradigm shifts having rather philosophical/epistemological nature are skipped over. just one example: humans can communicate only among themselves or with machines too? here, the problem could be circumvented replacing “communication” by “interaction” but the epistemological facet is still there: it has to be admitted (or rejected) that, in the framework of “computing as interaction”, their connotations are artificial intelligence + distributed systems = agents 21 very similar. perhaps, the definition given by sieckenius de souza could help: “communication is the process through which, for a variety of purposes, sign producers (i.e., signification system users in this specific role) choose to express intended meanings by exploring the possibilities of existing signification systems or, occasionally, by resorting to non-systematized signs, which they invent or use in unpredicted ways” [19]. • bounded rationality. the term is used as defined, explained and endorsed in [20, 18, 14]. since it is a very rich concept, often applied as principle, there are many “models of bounded rationality” [20] including a nobel prize winner one (“psychology for behavioural economics” [16]). it entails that almost any human undertaking to be effective as regards the time required must be imperfect. in ai context, perfection suggests the ideal of mathematicians and conventional software developers too to achieve algorithm-based optimisation. just one unquestionable example: “the problem is neither to admit that for any medical act (and for even stronger reasons as regards nursing) “just in time” is a sine qua non condition, nor that bounded rationality is the only practical means to achieve it [13]. nevertheless, there is a double hindrance, due to a yet prevalent mentality: a) therapeutic decision-making is an exclusively human attribute; b) non-algorithmic software is if not nonsensical applicable at most to toy problems” [5]. hence, bounded rationality explodes in a fascicle of interrelated, versatile, and highly applicationdependent features. examples: learning or negotiation strategies; most features meant to reduce complexity (like the “zero-overhead rule” in generative programming). • time enters the picture threefold: a) from left to right, it represent the very essence of paradigm shift in the sense of kuhn (for some conservatives unable to adapt the time to accomplish the shift can attain infinity; on the other hand, children have no problem to “communicate” with their artificial playmates). b) explicitly, through the need to manage “just in time” rapidly changing situations. c) implicitly, by handling real-world parallelism. here, parallelism is intrinsic to any interaction (even between an interface agent and its owner), because interactants coexist in time. of course, distribution adds new facets to an already complex temporal dimension. in short, to be effectual in e-world both users and software developers have to pass the mental rubicon separating programs from agents. indeed, agents are here to stay (even if not yet very intelligent); “affective computing”, “semantic web”, “ambient intelligence” and so on, are more than slogans (they are recognized as main it development directions by eu-promoted acts [1]). thus, “what is certain?” in the section title goes beyond the necessary enquiry of uncertainty, questioning the very essence of nowadays agent-oriented application development. for instance, even when blending vigorous well established paradigms, research trends are in line with the suggestions in the right part of figure 1: “interpretability is considered to be the main advantage of fuzzy systems over alternatives like neural networks, statistical models, etc. [...] in the recent years, research has started to focus on the trade-off between interpretability and accuracy [...]. analysis of the model interpretability and comprehensibility is always convenient, and it is a necessity when accuracy is not a model feature” [15]. 4 “distributed” revisited. a solution becomes problem to distribute means “to divide and dispense in portions” implying a previous entirety able to be divided [http://www.thefreedictionary.com/distributed]. paradoxically, the epistemic trouble with dis22 ioan dzitac, boldur e. bărbat tributed systems is that they are conceived, designed and implemented corresponding to the common connotation of “distributed” [11, 8]. thus, “in distributed computing a program is split up into parts that run simultaneously on multiple computers communicating over a network. [...] the main goal of a distributed computing system is to connect users and resources in a transparent, open, and scalable way. [...] distributed computing implements a kind of concurrency. it interrelates tightly with concurrent programming so much that they are sometimes not taught as distinct subjects [...] if not planned properly, a distributed system can decrease the overall reliability” [http:// en.wikipedia.org/wiki/distributed_computing]. “distribution”, as “the act of distributing or the condition of being distributed; apportionment” [http://www.thefreedictionary.com/distribution], is related to resource management (if something is plenty, no need to apportion). in short and maybe oversimplified there are four kinds of circumstances where, within the it treatment of the case, distribution in its conventional meaning can help (inside parentheses are examples) [4]: a) in space. spatial distribution is the oldest and most familiar type of resource apportionment (equipment components, process phases, networking, credit-card terminals). b) in time. time-sharing preceded its name (learning in schools, delegating authority, or reading on a ride are much older than unix-like operating systems or parallel buses). c) in organization. any organism is based on distributed order (human body, company, state, flower). for virtual enterprises it becomes a major raison d’étre. d) in problem solving. “divide et impera” was always a foremost strategy to fight complexity, chiefly cognitive one (most reductionist theories, most methodologies from euclid’s algorithm to structured programming). the difficulties begun when “distributed”, and “parallel” were perceived as quasi-synonymic in the syntagm “distributed computing”. moreover, epistemic confusion escalates when other debatable (semantically antinomic) concepts generated even more doubtful pair of antonyms: “sequential”/“simultaneous” (instead of “parallel”), “holistic”/“analytical” (instead of “reductionist”). a relevant case is related to the basic metaphor of ann: despite the same distributed neural network structure in both brain hemispheres, an antinomic pair of functions is stated as “linear algorithmic processing” vs. “holistical algorithmic processing” [http://en.wikipedia.org/wiki/cerebral_hemispheres]. here the reluctance to accept the paradigmatic shift that processing could be also non-algorithmic (even in the left brain) generated a quasi-pleonasm (could a step-by-step procedure be nonlinear?) and a quasi-contradiction (could a deterministic procedure be holistical?). as regards processing, the real opposition is “sequential”/“parallel”, where parallelism involves distribution. for instance, “as regards the learning process as such prefixed with “e-” or not the viewpoint is that human learning is best described by the information-processing approach in cognitive psychology, in line with the ideas endorsed in [2]: “most modern information-processing theories are “learning-by-doing” theories which imply that learning would occur best with a combination of abstract instruction and concrete illustrations”. learning should be considered in both humans and agents -as a process where most effectiveness is reached through a blend of symbolic (“left-hemisphere”-like) and subsymbolic (“right-hemisphere”-like) modi operandi” [5]. though, confusion become delusion when “concurrency” and “distribution” were perceived as conceptually close enough that agent-oriented applications concurrent par excellence could be effective if they are designed using mechanisms conceived (and used successfully) for distributed systems. since that is a central claim of this paper, it must be elaborated a bit. when designing distributed systems examples above show it clearly distribution was mostly part of the solution not part of the problem. indeed, in most cases, the problem was a whole and, iff it artificial intelligence + distributed systems = agents 23 such a problem can be split into subproblems, the entirety is disaggregated, the subproblems are solved and, finally, the partial solutions are re-aggregated. at the programming level they run in parallel and, when accessing shared resources, need mutual exclusion. on the contrary, applications devised in line with the “computing as interaction” paradigm above all agent-based applications entail at least two interactants (the interface agent and its owner), evolving in parallel, autonomously but not independently (since they interact as in any normal communication process: inform, wait, interrupt each other, etc.). briefly, they are concurrent processing threads; their programming entails multithreading. the crucial software engineering problem is that, while an api (application programming interface) able to support multithreading covers all requirements for mutual exclusion, the opposite is true just in very simple cases. worse, in most cases, designing concurrent applications with api intended for distributed systems results in severe ineffectiveness. hence, acknowledging the difference between distribution and concurrency is paramount not just at epistemic level, but at engineering level too. a relevant step into diminishing confusions was the different name given in c# to a instruction existing in java, without changing its semantics: “synchronize” is now called “lock” expressing what it really does (tellingly, almost the opposite of what its previous name claimed to do, since to preserve coherence it reduces parallelism). 5 models, methods, mechanisms beside the structure as position paper, to give more weight to the assertions regarding the paradigm shifts symbolised in figure 1, results of recent research (in line with the paradigms endorsed above) are abridged and commented upon. all software entities mentioned below are models, methods, and mechanisms but they are abridged without grouping them corresponding to their kind, since they are commented upon in software engineering papers, as well as in [6, 5] and papers referred to there. there are three categories of mechanisms developed for affordable non-algorithmic agent-based applications in ohdue: a) innovative mechanisms dedicated to “computer-aided x”, where x stays for almost any intellectual activity, within the software engineering toolbox agorithm (agent-oriented interactive timesensitive heuristic mechanisms). so far they are implemented or in earlier development stages, but only for solving toy problems or even in simple experimental models, for x = decision, learning, semiosis. b) existing mechanisms employed in previous research (before 2005, mainly in experimental models for captologic virtual therapists, carried out as pseudoavatars). their structure is based on common api functions callable from a customary java development environment. c) conceptualized within the framework of some phd theses in preparation. to increase paradigmatic relevance they are ordered in relation to figure 1, that is focusing on the missing “l” in the toolbox name. (of course, a bijection is out of question.): • decision-making with future contingents. domino (decision-oriented mechanism for “if” as non-deterministic operator). developed primarily for decision making (typical application: managing overbooking) it is meant to deal with undecidability due to any kind of future contingents. it is a “three-output if” where the semantics of the third value is a blend of a łukasiewicz “i” interpreted as “unknowable” and a kleene “u” interpreted as “temporary lack of knowledge”. (in fact, the semantics of “undecidable” is re’fined to “undecidable in the time span given”.) • analog input. scrollbar input is proposed for all kind of data: uncertain knowledge, intrinsically fuzzy, roughly estimated, etc. 24 ioan dzitac, boldur e. bărbat • dynamic priorities. are applied for: a) fine-tuning agent priorities (mainly the features of multi-agent systems) to manage situations “just in time”; b) fading out retention in “thick time”; c) boosting response of exception handlers. • exception-driven reactivity. prompt response to asynchronous must be mostly stimulusdriven because interaction between basically reactive entities. to respond promptly, interrupts are reflected asynchronously in exceptions with dynamic propagation. • antientropic self-cloning. developed to implement “strange loops” (via gödelian selfreference) as first step of investigating agent self-awareness, it means spawning an agent identical to itself, preserving self-representation (its “i”), but with an enriched world model (via lamarckian evolution). • flexible cloning. to reach efficient polymorphism the copies are purposely imperfect, spawning an agent quasi-identical to its parent; differences between clones become extensive only after recurring cloning (a clone is just a “slightly altered alter ego”). unfortunately, the mechanisms listed above have beside lacking validation in vivo (some of them not even in ovo) a double vulnerability: they are either incremental as regards the “kelvin way of thinking” or too loosely linked to new paradigms. thus, what is their relevance? to break the vicious circle since there is no “methodology for paradigm shift” to leave behind the rd order ignorance [3], software should be considered “not a product, but rather a medium for the storage of knowledge. [...] the other knowledge storage media being, in historical order: dna, brains, hardware, and books. [...] software development is not a product-producing activity, it is a knowledge-acquiring activity.” (that is neither fact, nor proof, it is just expectation.) 6 conclusions and intentions a) non-deterministic software is unavoidable; its development entails not just new design principles but new computing paradigms. b) agent-oriented systems, to be effectual, should merge conventional agent design with approaches employed in advanced distributed systems (where parallelism is intrinsic to the problem, not just a mean to speed up). c) the agorithm toolbox still not sufficient as technological infrastructure for agent-oriented software is a good framework to go ahead. d) the paradigm shift “from kelvin to zadeh” becomes urgent to keep pace with a rapidly changing e-world. e) almost as corollary, these paradigm shifts entails also attitudinal ones: shifting from multi-, through inter-, towards genuine trans-disciplinarity. as regards the prospects of non-algorithmic agent-oriented software, short-term intentions include enhancing the agorithm toolbox with two mini-ontologies: a) dynamic: i (agent), you (master), and rest of the world. b) visual: visual rules should simulate “the arrow of time”. acknowledgement. this work was partially supported by the ministry of education and research through contract no. 12 092/2007. artificial intelligence + distributed systems = agents 25 bibliography [1] agentlink iii. agent based computing. agentlink roadmap: overview and consultation report. university of southampton. http://www.agentlink.org/roadmap/al3rm.pdf, 2005. [2] j.r. anderson, l.m. reder, h.a. simon. applications and misapplications of cognitive psychology to mathematics education. texas educational review, 2000. [3] p.g. armour, the five orders of ignorance. comm. acm, 43 (10), 17-20, 2000. [4] b.e. bărbat, agent-oriented intelligent systems. romanian academy publ. house, bucharest, 2002 (in romanian, “grigore moisil” prize of the romanian academy). [5] b.e. bărbat, e-maieutics. rationale and approach. international journal of computers, communications & control, 3, supplement: suppl. s, 40-54, 2008. [6] b.e. bărbat, natural time for artificial agents. abstracts of icccc papers, băile felix, may 15-17, 27-27, 2008.(invited paper.) [7] i. dziţac, artificial intelligence. ed. house “aurel vlaicu” university, 2008 (in romanian). [8] i. dziţac, parallel and distributed methods for algebraic systems resolution, ccc publications, agora university publishing house, 2006. (in romanian). [9] i. dziţac, icccc 2008 & ewnlc 2008 celebrates bardeen’s centenary and welcomes professor zadeh, international journal of computers communications & control, 3 (suppl. s):16-25, 2008. [10] i. dziţac, i. moisil, advanced ai techniques for web mining, proc. of 10th wseas international conference on mathematical methods, computational techniques and intelligent systems (mamectis ’08, corfu, greece, 343-346, 2008. [11] i. dzitac, g. moldovan, distributed systems: information models, agora university publishing house 2006 (in romanian). [12] fipa tc agent management. fipa agent management specification. standard sc00023k (2004/18/03). http://www.fipa.org/specs/fipa00023/ sc00023k.pdf [13] g. gigerenzer, a. edwards. simple tools for understanding risks: from innumeracy to insight. british medical journal, 327, 741-744, 2003. [14] g. gigerenzer, r. selten. bounded rationality. mit press, cambridge, 2002. [15] f. herrera, genetic fuzzy systems: taxonomy, current research trends and prospects. evol. intel., 1, 27-46, 2008. [16] d. kahneman, maps of bounded rationality: psychology for behavioral economics. lecture (when receiving nobel prize; revised version). stockholm, nobel foundation, 2002. [17] e. livingston, ethnographies of reason. ashgate, aldershot, uk, 2008. [18] a. rubinstein, modeling bounded rationality. mit press, cambridge, 1998. [19] c. sieckenius de souza, the semiotic engineering of human-computer interaction. the mit press, 2005. [20] h.a. simon, models of bounded rationality. mit press, cambridge, 1997. [21] d. spinellis, rational metaprogramming. ieee software, 25, 1, 78-79, jan/feb, 2008. [22] n. wirth, algorithms + data structures = programs. prentice hall, 1978. [23] l.a. zadeh, d. tufis, f.g. filip, i. dzitac, (eds.), from natural language to soft computing: new paradigms in artificial intelligence, romanian academy publishing house, 2008. 26 ioan dzitac, boldur e. bărbat [24] f. zambonelli, a. omicini. challenges and research directions in agent-oriented software engineering. autonomous agents and multi-agent systems, 9, 253-283, kluwer academic publishers, 2004. ioan dzitac received m. sc. in mathematics (1977) and ph.d in information sc. (2002) from “babes-bolyai” university of clujnapoca. his current research interests include different aspects of artificial intelligence and parallel and distributed computing. he has edited 6 conference proceedings, published 16 books and more than 50 scientific papers in journals and conferences proceedings. he was member of the program committee of more than 30 international conferences. boldur e. bărbat m.sc. in electronic engineering, postgraduate specialising in programming, ph.d. in digital computers (“politehnica” university bucharest). he is with “lucian blaga” university sibiu from 1997, (full professor, faculty of engineering, faculty of sciences) and “politehnica” university timişoara from 2005 (faculty of automation and computers, advisor for doctoral studies in computer science). 25 books (a monograph received the romanian academy it prize, 2002). about 50 papers in english from 2001. current research interests: time in artificial intelligence; nonalgorithmic decision support systems; computer-aided semiosis; selfawareness of bodiless agents; agent-oriented software engineering; logics for agents; emergence in agent-based systems. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iii (2008), no. 2, pp. 172-182 analytical model for a multiprocessor with private caches and shared memory angel vassilev nikolov abstract: we develop an analytical model of multiprocessor with private caches and shared memory and obtain the following results: the instantaneous state probabilities and the steady-state probabilities of the system. both transient behaviour and equilibrium can be studied and analyzed. we showed that results can be applied to determine the output parameters for both blocking and non-blocking caches. keywords: invalidate cache-coherence protocol, queuing system, discrete transform 1 introduction shared memory multiprocessors are widely used as platforms for technical and commercial computing [2]. performance evaluation is a key technology for design in computer architecture. the continuous growth in complexity of systems is making this task increasingly complex [7]. in general, the problem of developing effective performance evaluation techniques can be stated as finding the best trade-off between accuracy and speed. the most common approach to estimate the performance of a superscalar multiprocessor is through building a software model and simulating the execution of a set of benchmarks. since processors are synchronous machines, however, simulators usually work at cycle-level and this leads to enormous slowdown [9]. it might take hours even days to simulate. for memory structures relatively accurate analytical models were developed [3, 7, 9, 10] through extensive use of various queuing systems. open queue system with poisson arrivals and exponential service times is considered quite good for description of memory hierarchies [7]. our focus is on the impact of the cache-coherence protocols on the overall system performance. the most commonly used technique for this purpose is the mean value analysis (mva) [3, 5, 7, 8, 9]. it allows the total number of the customers to be fixed (closed queue system), and this seems to be more adequate representation of the processes of self-blocking requestors [5]. calculations of output parameters such as residency times, waiting times and utilization are shown in [3, 8, 9]. mva is based on the forced flow that means in equilibrium output rate equals input rate. however, instantaneously, we can have input rate different from output rate, so that the instantaneous probabilities could be different from equilibrium [7]. mva offers no possibility to study transient effects. moreover, the assumption of exponential service times is not realistic, in fact all bus access times and memory access times are constants. it will be seen later in this paper that state probabilities depend on the server’s time density function. we use the technique of markov processes to describe the behaviour of the multiprocessor implementing cache-coherence protocols. 2 definition and analysis of the model a multiprocessor consists of several processors connected together to a shared main memory by a common complete transaction bus. each processor has a private cache. when a processor issues a request to its cache, the cache controller examines the state of the cache and takes suitable action, which may include generating bus transaction to access main memory. coherence is maintained by having all cache controllers "snoop" on the bus and monitor the transaction. snoopy cache-coherence protocols fall in two major categories: invalidate and update [2, 3, 10]. invalidating protocols are studied here but the concepts can be applied with some modifications to updating protocols too. transactions may copyright © 2006-2008 by ccc publications analytical model for a multiprocessor with private caches and shared memory 173 or may not include the memory block and the shared bus. typical transaction that does not include memory block is invalidate cache copy which occurs when a processor requests writing in the cache. all other processors simply change the status bit(s) of their on copies to invalid. if the memory block is uncached or not clean it can be uploaded from the main memory, but in today’s multiprocessors it is rather uploaded from another cache designated as owner (o) (cache-to cache transfer). memory-tocache transfer occurs when the only clean copy is in the main memory. a cache block is written back (wb) in the main memory (bus is used) when a dirty copy is evicted [6]. the bus and the main memory are also used when synchronization procedures are executed [2]. apparently the bus can be considered as the bottleneck of the system. in terms of the queuing theory processors can be viewed as customers (clients) and the bus can be viewed as a server. inter-arrival times are exponentially distributed with parameter λ . this assumption is adequate for most applications [7]. requests are served on first come first served (fcfs) basis. immediately after issuing a request for cache-to-cache transfer or synchronization procedure the customer blocks itself. the service time for blocking request has a density function f1(x). when service is completed the processor (customer) resumes processing with probability p or resumes processing and generates a new request with probability q (p+q=1). details on how to obtain the input parameters are given in [2, 3, 8, 9]. this new request has a different density function f2(x) and corresponds to wb transaction. it does not block the customer but the server is held until completion of wb transaction therefore adding to the queue. the system can be in one of the following states: 1) n: all n customers are doing internal processing; 2) j, 1: j customers are doing internal processing (n-j are blocked respectively) and all requests are of type 1(0≤j≤n-1), 3) j ,2: j customers are doing internal processing , the server is serving request of type 2, and n-j customers are waiting in the queue for service of type 1 (0≤j≤n). the transitions between these states are illustrated in fig. 1. throughout this paper we use the following notations pn (t) probability[all n customers are doing internal processing at time t] p j,i(t,x) probability[j customers are doing internal processing, n-j are in the queue and/or in the server, and the server is busy doing service of type i at time t and the elapsed service time lies between x and x+dx ] p j,i(x) probability[in the equilibrium state j customers are doing internal processing, n-j are in the queue and/or in the server, the server is busy doing service of type i and the elapsed service time lies between x and x+dx ] p j,i(t) probability[j customers are doing internal processing, n-j are in the queue or in the server, the server is busy doing service of type i at time t] pn , p j,i steady-state probabilities. pn = limt→∞ pn (t), pj,i = ∫ ∞ 0 pj,i(x)dx βi = jλ fi(x) cumulative distribution function (c.d.f.) of the service time of type i ; i=1,2 fi(x) probability density function (p.d.f.) of the service time of type i ; i=1,2 δ m,n kronecker delta 1 µi = ∫ ∞ 0 xfi(x)dx i=1,2 hi(x) = fi(x) 1−fi(x) service rate for type i; i=1,2 fi(s), fi(s+βn), fi(βn) laplace transforms (lt) of fi(x) t.u. time unit viewing the nature of the system, we obtain the following set of integro-differential equations [ d dt + βn ] pn = p ∫ t 0 pn−1(t, x)h1(x)dx + ∫ t 0 pn,2(t, x)h2(x)dx (1) 174 angel vassilev nikolov figure 1: state-transition diagram of the model. 1 ≤ j ≤ n [ d dt + ∂ ∂ t + βn−1 + h1(x) ] pn−1,1(t, x) = 0 (2) [ d dt + ∂ ∂ t + βn + h2(x) ] pn,2(t, x) = 0 (3) [ d dt + ∂ ∂ t + β j + hi(x) ] pj,i(t, x) = β j+1pj+1,i(t, x) (4) for i = 1, 1 ≤ j ≤ n −1; i = 2, 1 ≤ j ≤ n [ d dt + ∂ ∂ t + hi(x) ] p0,i(t, x) = β1p1,i(t, x) (5) for i=1,2 having the following boundary and initial conditions pj,1(t, 0) = (1−δ j,0)p ∫ ∞ 0 pj−1,1(t, x)h1(x)dx + ∫ ∞ 0 pj,2(t, x)h2(x)dx + δ j,n−1βn pn (t) (6) for 0 ≤ j ≤ n −1 pj,2(t, 0) = q ∫ ∞ 0 pj−1,1(t, x)h1(x)dx (7) for 1 ≤ j ≤ n pn (0) = 1, p0,2(t, 0) = 0, pj,i(0, 0) = 0 (8) for i = 1, 1 ≤ j ≤ n −1; i = 2, 1 ≤ j ≤ n by using laplace transform and discrete transform [4, 8] the above equations are transformed as follows (s + βn) pn = 1 + p ∫ ∞ 0 pn−1(s, x)h1(x)dx + ∫ ∞ 0 pn,2(s, x)h2(x)dx (9) analytical model for a multiprocessor with private caches and shared memory 175 [ s + d dx + β j + hi(x) ] u j,i(s, x) = 0 (10) for i = 1, 1 ≤ j ≤ n −1; i = 2, 1 ≤ j ≤ n [ s + d dx + +hi(x) ] p0,i(s, x) = β1p1,i(s, x) (11) for i=1,2 where u j,1(s, x) = ∑n−1n= j ( n j ) pn,1(s, x), pj,1(s, x) = ∑n−1n= j (−1)n− j ( n j ) un,1(s, x) for 1 ≤ j ≤ n − 1, and u j,2(s, x) = ∑nn= j ( n j ) pn,2(s, x), pj,2(s, x) = ∑nn= j(−1)n− j ( n j ) un,2(s, x) for 1 ≤ j ≤ n. let v j,i(s, x) = u j,i(s,x) 1−fi(x) and p ′ 0,1(s, x) = p0,1(s,x) 1−fi(x) . then from (10 and 11) we have after some transformations [ s + d dx + βi ] v j,i(s, x) = 0 for i = 1, 1 ≤ j ≤ n −1; i = 2, 1 ≤ j ≤ n and [ s + d dx ] p ′ 0,i(s, x) = β1p1,i(s, x) for i = 1, 2. hence the solutions of (9-11) are u j,i(s, x) = [1−fi(x)]u j,i(s, 0)e−(s+βi)x (12) pn (s) = 1 + p f1(s + βn−1)un−1,1(s, 0) + f2(s + βn )un,2(s, 0) s + βn (13) p0,1(s, x) = [1−f1(x)]β1e−sx [ p0,1(s, 0) + n−1 ∑ n=1 (−1)n−1n 1−e −βnx βn un,1(s, 0) ] (14) p0,2(s, x) = [1−f2(x)]β1e−sx [ n ∑ n=1 (−1)n−1n 1−e −βnx βn un,2(s, 0) ] . (15) by integrating (12, 14, and 15) we obtain the lt of the instantaneous probabilities pj,1(s) = n−1 ∑ n= j (−1)n− j ( n j ) [ 1− f1(s + βn) s + βn ] un,1(s, 0) (16) for 1 ≤ j ≤ n −1 pj,2(s) = n ∑ n= j (−1)n− j ( n j ) [ 1− f2(s + βn) s + βn ] un,2(s, 0) (17) for 1 ≤ j ≤ n p0,1(s) = p0,1(s, 0) [ 1− f1(s) s ] + β1 n−1 ∑ n=1 (−1)n−1n [ 1− f1(s) s − 1− f1(s + βn) s + βn ] un,1(s, 0) βn (18) 176 angel vassilev nikolov p0,2(s) = β1 n ∑ n=1 (−1)n−1n [ 1− f2(s) s − 1− f2(s + βn) s + βn ] un,2(s, 0) βn . (19) taking lt of (6-7) and using (8 and 12-15) we get after some transformations the following system of linear equations n−1 ∑ n= j (−1)n− j ( n j ) un,1(s, 0) = p n−1 ∑ n= j (−1)n− j+1 ( n j −1 ) f1(s + βn)un,1(s, 0) (20) + n ∑ n= j (−1)n− j ( n j ) f2(s + βn)un,2(s, 0) + δ j,n−1βn pn for 2 ≤ j ≤ n −1 n ∑ n= j (−1)n− j ( n j ) un,2(s, 0) = q n−1 ∑ n= j−1 (−1)n− j+1 ( n j −1 ) f1(s + βn)un,1(s, 0) (21) for 2 ≤ j ≤ n n−1 ∑ n=1 (−1)n−1 ( n j ) un,1(s, 0) = pp0,1(s, 0) f1(s) + pβ1 [ n−1 ∑ n=1 (−1)n−1n f1(s)− f1(s + βn) βn un,1(s, 0) ] (22) n ∑ n=1 (−1)n−1 ( n j ) un,2(s, 0) = qp0,1(s, 0) f1(s) + qβ1 [ n−1 ∑ n=1 (−1)n−1n f1(s)− f1(s + βn) βn un,1(s, 0) ] (23) coefficients u j,i(s,0) can now be determined from the above equations. we can apply the final-value theorem to (16-19) to obtain the steady-state probabilities but it will require use of the l’hopital rule and seems difficult and impractical [11]. instead we set the following differential equations βnpn = p ∫ ∞ 0 pn−1(x)h1(x)dx + ∫ ∞ 0 pn,2(x)h2(x)dx (24) [ d dx + βn−1 + h1(x) ] pn−1,1(x) = 0 (25) [ d dx + βn + h2(x) ] pn,2(x) = 0 (26) [ d dx + β j + hi(x) ] pj,i(x) = β j+1pj+1,i(x) (27) for i = 1, 1 ≤ j ≤ n −1; i = 2, 1 ≤ j ≤ n [ d dx + hi(x) ] p0,i(x) = β1p1,i(x) (28) for i=1,2. equations (24-28) are to be solved under the following boundary conditions and normalizing condition pj,1(0) = (1−δ j,0)p ∫ ∞ 0 pj−1,1(x)h1(x)dx + ∫ ∞ 0 pj,2(x)h2(x)dx + δ j,n−1βn pn (29) analytical model for a multiprocessor with private caches and shared memory 177 for 0 ≤ j ≤ n −1 pj,2(0) = q ∫ ∞ 0 pj−1,1(x)h1(x)dx (30) for 1 ≤ j ≤ n −1 p0,2(0) = 0 (31) pn + n−1 ∑ j=0 pj,1 + n ∑ j=0 pj,2 = 1. (32) the solutions of (2.29-2.32) are pn = 1 + p f1(βn−1)un−1,1(0) + f2(βn )un,2(0) βn (33) pj,1 = n−1 ∑ n= j (−1)n− j ( n j ) [ 1− f1(βn) βn ] un,1(0) (34) for 1 ≤ j ≤ n −1 pj,2 = n ∑ n= j (−1)n− j ( n j ) [ 1− f2(βn) βn ] un,2(0) (35) for 1 ≤ j ≤ n −1 p0,1 = p0,1(0) µ1 + n−1 ∑ n= j (−1)n− jn [ 1 µ1 − 1− f1(βn) βn ] un,1(0) (36) p0,2 = n ∑ n= j (−1)n− jn [ 1 µ2 − 1− f2(βn) βn ] un,2(0) (37) for u j,i(0) and p0,1(0) we have n−1 ∑ n= j (−1)n− j ( n j ) un,1(0) = p n−1 ∑ n= j (−1)n− j+1 ( n j −1 ) f1(βn)un,1(0) + δ j,n−1βn pn (38) for 2 ≤ j ≤ n −1 n−1 ∑ n= j (−1)n− j ( n j ) un,2(0) = q n−1 ∑ n= j (−1)n− j+1 ( n j −1 ) f2(βn)un,2(0) (39) for 2 ≤ j ≤ n −1 p0,1(0) = β1 n ∑ n=1 (−1)n−1n [ 1− f2(βn βn ] un,2(0) (40) n−1 ∑ n=1 (−1)n−1nun,1(0) = pp0,1(0) + pβ1 n ∑ n=1 (−1)n−1n [ 1− f1(βn βn ] un,1(0) + n ∑ n=1 n f2βnun,2(0) (41) n ∑ n=1 (−1)n−1nun,2(0) = qp0,1(0) + qβ1 n ∑ n=1 (−1)n−1n [ 1− f1(βn βn ] un,1(0) (42) the coefficients u j,i(0) can be determined from (32) and (38-42). 178 angel vassilev nikolov 3 examples in order to obtain the transient state probabilities first we have to determine pn (s) and pj,i(s) from (1619) and (20-24) and then to apply the inverse laplace transform to them. we used the packages of maple 8 on a standard pc platform under windows xp for these computations [12]. results were produced and printed in less than a second. for n=4 the instantaneous probabilities are listed in appendix a. various performance characteristics can be computed using the steady-state probabilities. for example, the average number of blocked customers (anbc) in the case of blocking caches will be given by anbc = 2 ∑ i=1 n ∑ j=0 (n − j)pj,i. (43) in the case of non-blocking caches anbc will be anbc = n ∑ j=0 (n − j −1 + k)pj,1 + n−1 ∑ j=0 (n − j)pj,2. (44) where k is the ratio of average memory stall time [2] . k depends strongly on the application. (1-k) actually refers to the fraction time the processor is consuming data while cache-to-cache or memory-tocache transfer is in progress. in appendix b we list the anbc for two popular service time distributions: exponential and erlangian [1], for blocking and fully non-blocking caches (k=0). the time to solve (33-42) and calculate anbc was meaninglessly short. 4 concluding remarks this work presented a model for a shared bus, shared memory multiprocessor with private caches and captures the whole spectrum of invalidate type cache coherence protocols. although we started with fairly sophisticated set of integro-differential equations, the output of the model is a set of few linear equations from which the state probabilities can be determined. the approach eliminates the main drawbacks of the most commonly used mva analysis: inability to deal with transients and constraint on the service time distribution. the model gives insights into the transient behaviour of the system. moreover, the assumption of exponentially distributed service times can be dropped; any continuous distribution can be used. the ease of obtaining performance measures in a meaningless time makes very feasible the incorporation of the model in a multiprocessor design tool. bibliography [1] s. k. bose, introduction to queuing systems, kluwer/plenum publishers, 2001 [2] j. l. hennessy, d. a. patterson; computer architecture: a quantitative approach, pearson publishers, 2003 [3] m. c. chiang, memory system design for bus based multiprocessor, phd thesis, university of wisconsin, 1991 [4] t. itoi, t. nishida, m. kodama and e. ohi, n-unit parallel redundant system with correlated failures and single repair facility, microelectronics and reliability, vol. 17, pp. 279-285, 1978 analytical model for a multiprocessor with private caches and shared memory 179 [5] e. lazowska, j. zahorjan, g. graham, and k. sevcik, quantitative system performance, computer system analysis using queuing network models, prentice-hall, englewood clis, nj, may 1984 [6] a. louri, a.k. kodi, an optical interconnection network and a modifying snooping protocol for the design of large-scale symmetric multiprocessors (smps), ieee transactions on parallel and distributing systems, vol. 15, no. 12, dec. 2004, pp. 1093-11047. [7] r. e. matick, comparison of analytic performance models using closed mean-value analysis versus open-queuing theory for estimating cycles per instruction of memory hierarchies, ibm journal of research and development, jul 2003 [8] d. j. sorin et. al., a customized mva model for ilp multiprocessors, technical report no.1369, university of wisconsin-madison, 1998 [9] d. j. sorin et. al., evaluation of shared-memory parallel system with ilp processors , proc. 25th int’l symp. on computer architecture, june 1998, pp. 180-191 [10] j. sustersic, a. hurson, coherence protocol for bus-based and scalable multiprocessors, internet and wireless distributed computing environments: a survey , advances in computers, vol.59, 2003, pp. 211-278 [11] schiff, joel l., the laplace transform, springer, 1999 [12] waterloo maple inc., introduction to maple 8, 2002 appendix a for n=4, λ =0.001[1/t.u.], f1(x)=0.1exp(-0.1x), and f2(x)=0.01exp(-0.01x) the instantaneous probabilities are p4(t) =0 .9211361286+0.8058476879e-2*exp(-0.1248619627*t) +0.8535072295e-2*exp(-0.1089825679*t)+0.9049529656e-2*exp(-0.9494144284e-1*t) +0.9696074769e-2*exp(-0.8072343638e-1*t)+0.1774027054e-3 exp(-0.1510201407e-1*t) +0.1728181365e-2*exp(-0.1398702636e-1*t) +0.5618533851e-2*exp(-0.1256085210e-1*t) +0.1211910345e-1 exp(-0.1067946234e-1*t)+0.2388149701e-1*exp(-0.8161235321e-2*t), p31(t) =0.3792913471e-1-0.1093143496e-1*exp(-0.1248619627*t) -0.1007354818e-1 *exp(-0.1089825679*t)-0.9271731350e-2*exp(-0.9494144284e-1*t) -0.8405607569e-2*exp(-0.8072343638e-1*t)+0.2658572506e-2 *exp(-0.1510201407e-1*t)-0.2212663963e-5*exp(-0.1398702636e-1*t) -0.3015750621e-3*exp(-0.1256085210e-1*t)-0.6739486112e-3 *exp(-0.1067946234e-1*t)-0.9276492013e-3*exp(-0.8161235321e-2*t), p21(t) = 0.1420742616e-2+0.2324288902e-2*exp(-0.1248619627*t) +0.2986557798e-3* exp(-0.1089825679*t)-0.1243034332e-2 *exp(-0.9494144284e-1*t)-0.2544948528e-2*exp(-0.8072343638e-1*t) -0.5329230583e-2*exp(-0.1510201407e-1*t)+0.6737760872e-2 *exp(-0.1398702636e-1*t)+0.4442015626e-3*exp(-0.1256085210e-1*t) -0.5688666731e-3*exp(-0.1067946234e-1*t)-0.1539483112e-2 *exp(-0.8161235321e-2*t), p11(t) =0 .7684028624e-4-0.2290986748e-3*exp(-0.1248619627*t) +0.3160128043e-3*exp(-0.1089825679*t)+0.2148505581e-3 *exp(-0.9494144284e-1*t)-0.3252577041e-3 180 angel vassilev nikolov *exp(-0.8072343638e-1*t)+0.3775725072e-2*exp(-0.1510201407e-1*t) -0.8400171763e-2*exp(-0.1398702636e-1*t)+0.4974831098e-2 *exp(-0.1256085210e-1*t)+0.5143708805e-3*exp(-0.1067946234e-1*t) -0.9181578239e-3*exp(-0.8161235321e-2*t), p01(t) = 0.9242283829e-5*exp(0-.1248619627*t)-0.3513395647e-4 *exp(-0.1089825679*t)+0.4257071327e-4*exp(-0.9494144284e-1*t) -0.1688587212e-4*exp(-0.8072343638e-1*t)-0.9200675435e-3 *exp(-0.1510201407e-1*t)+0.2810623081e-2*exp(-0.1398702636e-1*t) -0.3183584754e-2*exp(-0.1256085210e-1*t)+0.1611954912e-2 *exp(-0.1067946234e-1*t)-0.3239077071e-3*exp(-0.8161235321e-2*t) +0.5218152790e-5, p42(t) = 0.2709223908e-1+0.9859983387e-3*exp(-.1248619627*t) +0.1060558367e-2*exp(-0.1089825679*t)+0.1145474465e-2 *exp(-0.9494144284e-1*t)+0.1259769099e-2*exp(-0.8072343638e-1*t) -0.2412466943e-1*exp(-0.1510201407e-1*t)-0.1705507775e-2 *exp(-0.1398702636e-1*t)-0.2095511260e-2*exp(-0.1256085210e-1*t) -0.2029637013e-2*exp(-0.1067946234e-1*t)-0.1588776483e-2 *exp(-0.8161235321e-2*t), p32(t) = -0.2421204825e-3*exp(-0.1248619627*t)-0.7509940526e-4 *exp(-0.1089825679*t)+0.9576676158e-4 *exp(-0.9494144284e-1*t)+0.3013803504e-3 *exp(-0.8072343638e-1*t)+0.7126069503e-1*exp(-0.1510201407e-1*t) -0.6135152996e-1*exp(-0.1398702636e-1*t)-0.8971987351e-2 *exp(-0.1256085210e-1*t)-0.5950006752e-2*exp(-0.1067946234e-1*t) -0.4494935895e-2*exp(-0.8161235321e-2*t)+0.9428952497e-2, p22(t) =0 .2421271696e-2+0.2626333487e-4*exp(-0.1248619627*t) -.3154175021e-4*exp(-0.1089825679*t)-0.2945613244e-4 *exp(-0.9494144284e-1*t)+0.3412946115e-4*exp(-0.8072343638e-1*t) -0.8108903801e-1*exp(-0.1510201407e-1*t)+0.1349032466 *exp(-0.1398702636e-1*t)-0.4071010637e-1*exp(-0.1256085210e-1*t) -0.9622074403e-2*exp(-0.1067946234e-1*t)-0.5904604182e-2 *exp(-0.8161235321e-2*t), p12(t) = -0.1765308077e-5*exp(-0.1248619627*t)+0.4800731626e-5 *exp(-0.1089825679*t)-0.4448905932e-5*exp(-0.9494144284e-1*t) +0.1599282603e-5*exp(-0.8072343638e-1*t)+0.4177917201e-1 *exp(-0.1510201407e-1*t)-0.9973555226e-1*exp(-0.139870263e-1*t +0.7256040480e-1*exp(-0.1256085210e-1*t)-0.9747995399e-2 *exp(-0.1067946234e-1*t)-0.5300997812e-2*exp(-0.8161235321e-2*t) +0.4449749927e-3, p02(t) = -0.4618227199e-6*exp(-0.1248619627*t)-0.2203030325e-6 *exp(-0.1089825679*t)+0.4881890483e-7*exp(0-.9494144284e-1*t) -0.1699392257e-7*exp(-0.8072343638e-1*t)-0.8188760719e-2 *exp(-0.1510201407e-1*t)+0.2501502200e-1*exp(-0.1398702636e-1*t) -0.2833447693e-1*exp(-0.1256085210e-1*t)+0.1434663085e-1 *exp(-0.1067946234e-1*t)-0.2882912592e-2*exp(-0.8161235321e-2*t) +0.4449749927e-4. in the above expressions e-i means 10-i for i=1,7. analytical model for a multiprocessor with private caches and shared memory 181 appendix b table 1: n=8, f1(x)=0.1exp(-0.1x), f2(x)=0.01exp(-0.01x) λ [1/t.u.] p anbc for blocking anbc for fully caches nonblocking caches 0.001 0.9 0.154099881194466 0.075640880006411 0.002 0.9 0.441552853804251 0.290383910880334 0.003 0.9 0.822750601431095 0.607433119474025 0.004 0.9 1.253944990222998 0.984102789831906 0.001 0.8 0.230012889507952 0.152313018403034 0.002 0.8 0.729883782777377 0.584481458432927 0.003 0.8 1.382033782478873 1.183494795953230 0.004 0.8 2.063720956300253 1.826269794552253 table 2: n=8, f1(x)=0.13x2ex p(−0.1x)/2!, f2(x)=0.013x2ex p(−0.01x)/2! λ [1/t.u] p anbc for blocking anbc for fully caches nonblocking caches 0.001 0.9 0.384839057891723 0.211437492029451 0.002 0.9 1.313451009452606 0.582993712839022 0.003 0.9 2.390481400874492 1.782339618354729 0.004 0.9 3.691834116720534 2.882438452093385 0.001 0.8 0.614956120345239 0.400820549913285 0.002 0.8 2.611487230549326 1.722034656332087 0.003 0.8 4.062557145097248 3.429652938504840 0.004 0.8 5.899361833023557 5.394204692051840 angel vassilev nikolov national university of lesotho department of mathematics and computer science roma 180 lesotho e-mail: av.nikolov@nul.ls received: december 17, 2007 182 angel vassilev nikolov angel vassilev nikolov received the beng degree in electronic and computer engineering from the technical university of budapest, hungary in 1974 and the phd degree in computer science from the bulgarian academy of sciences in 1982 where he worked as a research associate. in 1989 he was promoted to associate research professor in bulgaria. dr nikolov also served as a lecturer of computer science at the national university of science and technology, bulawayo, zimbabwe and at the grande prairie regional college, alberta, canada and as an associate professor at sharjah college, united arab emirates. his research interests include computer architecture, performance evaluation of multiprocessors, and reliability modeling. he has published numerous journal and conference articles and holds four patents on the above topics. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 744-754 complex computer simulations, numerical artifacts, and numerical phenomena d.-a. iordache, p. sterian, f. pop, a.r. sterian dan-alexandru iordache, paul sterian, andreea rodica sterian physics department, university politehnica of bucharest, 313 splaiul independentei, bucharest 060042, romania e-mail: {daniordache2003,paul.sterian,andreea_rodica_sterian}@yahoo.com florin pop lecturer, computer science department, university politehnica of bucharest, 313 splaiul independentei, bucharest 060042, romania e-mail: florin.pop@cs.pub.ro abstract: the study of some typical complex computer simulations, presenting one or more complexity features, as the: a) symmetry breaking, b) nonlinear properties, c) dissipative processes, d) high-logical depth, e) selforganizing processes, etc allows to point out some several numerical artifacts, namely the: (i) distortions, (ii) scattering, (iii) pseudo-convergence, (iv) instability, (v) mis-leading (false) symmetry-breaking simulations and others. the detailed analysis of these artifacts allowed clarifying the numerical mechanisms of some such artifacts, which can be named in following numerical phenomena, because their basic features can be exactly predicted. keywords: computer simulations, numerical artifacts, numerical phenomena, self-organizing processes. 1 introduction we live in a computerized world, our civilization being a "civilization of computers". taking into account that computers control the work of all-present complex installations and devices, the appearance (due to some numerical phenomena) of some important distortions of the simulated processes lead usually to major failures of the technical installations. particularly, the events referring to the erroneous computer (numerical) simulation and design of the flight of the patriot missile which failed (with disastrous results) during the gulf war in 1991 to stop a scud missile [1] and the self-destruction of the european space agency’s ariane 5 rocket, at 37 seconds after its launch [2], were both assigned to computer errors [3] and to their associated numerical artifacts/artefacts/phenomena. given being the computer simulations are considerably cheaper than the experimental studies and they allow the prediction of the physical systems behavior even in inaccessible conditions, the computer simulations are widely used in technical studies. because the modern (optimized) technical systems are complex [4], the computer simulations are also complex, and for this reason they generate specific numerical artifacts. in fact, there is a huge number of publications reporting such phenomena (usually related to some complex numerical simulations), as it can be easily found consulting some search systems, e.g. the google system. as it results from table 1, even eliminating by the use of quotation marks (") the "parasitic" published works entitled as numerical simulations of... boiling phenomena, etc, there remain very large numbers of published works in these fields. this work deals with the study of possibilities to discover the mechanisms of the artifacts intervening in some complex simulations and to predict quantitatively the basic parameters of copyright c⃝ 2006-2010 by ccc publications complex computer simulations, numerical artifacts, and numerical phenomena 745 topics no quotation marks with quotation marks, e.g. "numerical artifacts" numerical artifacts 1,180,000 9,370 numerical artefacts 126,000 2,440 numerical phenomena 4,160,000 1,340 complex simulations 3,000 table 1: numbers of published papers found by the google search system (beginning of 2009) the computer generating errors, transforming so the observed numerical artifacts/artefacts in the so-called numerical phenomena. we have to underline from beginning that the discovery of these mechanisms belong to the field of numbers theory and that many problems in the field of numbers theory are extremely difficult. e.g., the statement of the (pierre de) fermat’s last (greatest) theorem was published (after his death, by his eldest son clément samuel fermat) in 1670 [4], but its solution was found only in 1995 [5] by the professor andrew wiles [6]. we will focus mainly to the study of the main features of the classical [7], [8] and of the newly found [9], [10] numerical phenomena associated to the finite differences (fd) simulations [11] of the pulses propagation through media with sharp interfaces and attenuative character, as well as of other numerical methods (as the random walk method, the gradient one, etc), applied to the study of different physical processes, as diffusion [12], [13], solitary wave propagation, applications to the evaluation of the parameters of some physical systems, etc. 2 symmetry breaking in some computing programs 2.1 symmetry breaking of the wave equation in ideal media the finite differences (fd) discretization of the wave equation in ideal media: ∂2w ∂t̃2 = v2φ ∂2w ∂x2 → wt+1 + wt−1 − 2w τ2 = v2φ wi+1 + wi−1 − 2w ϵ2 (1) which is symmetrical relative to the space steps i − 1, i, i + 1 and the time steps t − 1, t, t + 1, if the fd velocity and the wave propagation one are equal: vfd = ϵ τ = vφ (2) defining the courant’s number [7] by means of the relation: c = vφ vfd (3) one finds easily that if: (i) c > 1, there will intervene instabilities, because the fd schema does not use all information received at the observation point (vφ > vfd), (ii) c < 1, there will intervene distortions, because for vφ < vfd, the fd schema uses more information than it receives, and this additional information acts as a jamming, (iii) c = 1, we have an ideal fd schema (stable and convergent), because this schema has all necessary physical information and nothing more! one finds that the symmetry breaking for values different than 1 of the courant number leads to the "classical" numerical phenomena. 746 d.-a. iordache, p. sterian, f. pop, a.r. sterian 2.2 symmetry breaking of the smoothing model of a sharp interface consider a sharp interface between 2 homogeneous elastic media. if the method of finite differences (fd) is used, then in order to avoid the use of dirac function a certain smoothing of the sharp interface is necessary, spreading it over 2 or 3 fd nodes, whose indices i are denoted as i − 1, i and i + 1 (see figure 1). figure 1: smoothing models of the sharp 1-d interfaces (see also [14]-a). then the differential equation ρ(x)∂ 2w ∂t̃2 = ∂ ∂x [ s(x)∂w ∂x ] of the elastic pulses propagation through an in-homogeneous medium becomes: ρ̃i wt+1 + wt−1 − 2w τ2 = ⟨ ∂s ∂x ⟩ i wi+1 + wi−1 2ϵ + s̃i wi+1 + wi−1 − 2w ϵ2 . (4) where ρ̃i = ⟨ρ⟩i, s̃i = ⟨s⟩i and ⟨ ∂s ∂x ⟩ i are the chosen average values around the fd node i. the chosen expressions of these average values (see table 2) can succeed or not to remake the (apparent) symmetry of the propagation medium in the frame of the fd simulation; e.g., one finds from the examination of table 2, that all required expressions are symmetrical around the sites i − 1, i and i + 1 for model 1, and around the sites i and i + 1 for model 2a, while this general symmetry is not kept for all average expressions of models 2b, 3a and 3b. that is why while the smoothing models 1 and 2a ensure always stable and convergent numerical simulations for the smoothing models 2b 3a and 3b, respectively, there appear the basic types of usual numerical artifacts [7, 8, 14]: a) the instability, b) the pseudo-convergence. the plots of these numerical artifacts for the above-indicated 5 types of studied finite differences (fd) smoothing schemes (models), intended to the simulation of certain elastic pulses propagation through complex materials are presented by figures 2 and 3 below (see also [15]). i ρ̃i s̃i ⟨ ∂s ∂x ⟩ i i − 1 i i + 1 i − 1 i i + 1 ⟨ ∂s ∂x ⟩ i−1 ⟨ ∂s ∂x ⟩ i ⟨ ∂s ∂x ⟩ i+1 1 ρ ρ ′+ρ 2 ρ′ s s+s ′ 2 s′ 0 s ′−s ϵ 0 2a ρ 3ρ ′+ρ 4 3ρ′+ρ 4 s 3s+s ′ 4 s+3s′ 4 0 s ′−s 2ϵ s′−s 2ϵ 2b ρ ρ ρ′ s s s′ 0 s ′−s 2ϵ s′−s 2ϵ 3a ρ ρ ′+ρ 2 ρ′ s s+s ′ 2 s′ s ′−s 4ϵ s′−s ϵ s′−s 4ϵ 3b ρ ρ ′+ρ 2 ρ′ 7s+s ′ 8 s+s′ 2 s+7s′ 8 s′−s 4ϵ s′−s ϵ s′−s 4ϵ table 2: expressions of the average values of the main elastic parameters for the basic smoothing models of one-dimensional (1-d) interfaces (see also [14]-a) complex computer simulations, numerical artifacts, and numerical phenomena 747 figure 2: plots of numerical simulations corresponding to different fd schemes (those of models 3a and 3b are pseudo-convergent, and unstable, respectively). figure 3: convergent numerical simulations corresponding to models 1 and 2a, and pseudoconvergent ones for models 2b, 3a. one finds that while the instabilities can be easily detected and eliminated, the pseudoconvergence is considerably more "dangerous", because: a) the pseudo-convergent simulations have a right shape, while: b) the corresponding wrong displacement values can be considerably more difficult observed, hence the pseudo-convergent simulations could be easily misleading. 3 nonlinear properties of the propagation medium it is well-known [7] that the computer rounding errors are amplified considerably in the frame of some non-linear equations, as those corresponding to certain solitary waves, leading due to some instability numerical artifacts (see figure 4). particularly, the korteweg-de vries equation: ∂u ∂t = −v00u ′ − nuu′ − d1u ′′′ (5) can be discretized as: f(i) = p(i)−γ·[a(i + 1) − a(i − 1)]+α·a(i)·[a(i − 1) − a(i + 1)]+β·[a(i − 2) − a(i + 2)] (6) there were studied the numerical artifacts corresponding to the 2 main types of nonlinear solitary waves (which can propagate keeping their shapes): the bell-shaped (or breathers) and 748 d.-a. iordache, p. sterian, f. pop, a.r. sterian figure 4: fd simulation of a korteweg-de vries (kdv) solitary wave breather propagation the kink-shaped waves [16]. figures 4 and 5 present the basic numerical artifacts intervening in the fd simulations of the breathers propagation (see also [17]). while the artifacts intervening in the simulations of the kdv breathers propagation reduce to a monotonic increase of distortions up to the appearance of the instability (figure 4), the use of the discrete version (dnls) of the cubic nonlinear schrödinger (nls) equation to describe the wave-guide arrays with saturable nonlinearity leads in certain conditions to the artefact corresponding to the merging of two breathers with symmetry breaking (see figure 5 and [17]). figure 5: the symmetry breaking artefact intervening in the merging (bound state formation) of 2 symmetric snls breathers 4 dissipative media because the modulus of the first solution of the attenuation-dispersion relation [15] is larger than 1: |g1| = eϵe > 1, the fd schemes used for the simulation of the acoustic pulses propagation in dissipative media are always unstable. there were pointed out also: (i) the instability of the attenuated wave simulation, even for absolutely exact initial conditions, due to the generation of the amplified wave (mathematically possible, but without a physical meaning) by the stochastic local accumulation of some local "rounding" inaccuracies of the exact values corresponding to such waves, acting as a self-organising process in the computing program run, as well as: (ii) the extremely strong acceleration of the amplified wave generation when the complex wave-functions are used (stability and convergence radii of the magnitude order of 1 db or even smaller). the introduction of some: (i) corrective measures (the use of some analytical expressions of some partial derivatives, particularly), (ii) properly chosen effective parameters, allows the weakening of these unpleasant numerical phenomena, ensuring stability and convergence radii of the magnitude order of 100 db [13], which represent sufficiently high values for accurate descriptions, by complex computer simulations, numerical artifacts, and numerical phenomena 749 means of the finite difference method, of the cases of technical interest. 5 high logical depth as it results from equation (1), the fd scheme of waves propagation in ideal media is not too intricate, even if the courant’s number is less than 1: c < 1. the repeated use [by a large number (n > 105) of successive iterations, e.g. for simulations of the ultrasonic non-destructive examinations of some industrial components] of this equation, leads however to the high logical depth complexity feature of the used computer program, and consequently to several numerical artifacts indicated in the frame of figure 6. the expressions of the main limits are: xe = −ct−2 3 √ t, xp = 1+c(t−1), xp = n+c(t−1), xn = ct + n + 2 3 √ t, while for c ≈ 0.5 and n ≈ 71, the relative (to the incoming pulse one) amplitudes of the echo pulses have the magnitude orders: 0.1 for the rectangular pulse, 0.01 for the sine pulses, and 0.001 for the gaussian pulses. figure 6: structure of fd simulations of the propagation of pulses of different shapes 6 from the numerical artifacts to the numerical phenomena 6.1 difficulties and main methods used to study the numerical artifact mechanisms because many problems in the field of numbers theory are extremely difficult, the aim of this study is to point out the main features of the mechanisms leading to some numerical artifacts intervening in the computer simulations of pulses propagation. the accomplished analysis pointed out that the main methods to study these numerical artifact mechanisms are the methods of the: a) fd transfer coefficients [18], b) fourier’s representation of the exact solutions of the discretized wave equation [19]. 6.2 the method of transfer coefficients in order to explain the results corresponding to the linear fd schemes, a partition of the incoming pulse in n components of amplitudes (in the order of their incoming on the studied 750 d.-a. iordache, p. sterian, f. pop, a.r. sterian material) s1, s2, . . . , sn is considered. the amplitudes of the same components in the previous time step are denoted by s′1, s ′ 2, . . . , s ′ n. the transfer coefficients kti are defined by means of the expressions (4) of the displacement wit corresponding to the space site i at the moment (time step) t: wit = n∑ j=1 kt,n+2+t−i−j · sj − n∑ j=1 kt,n+t−i−j · s′j (7) a simplified definition of the transfer coefficients corresponds to the fd simulations with equal values of the real phase speed vφ and of the fd one: vfd = ϵτ (i.e. for the value 1 of the courant number [7]: c = vφ vfd ), because then the pulse partition components at successive time steps coincide: s′i = si (for any i = 1, 2, . . . , n). in the particular case of a sharp 1-d interface located in the site i, the transfer coefficients describing the transmitted wave are defined as [14]-a: wi+1,t = t−2∑ j=1 kjst−j−1 (8) 6.3 fourier’s representation method of the exact solutions of the discretized wave equation the exact discrete solution of the wave equation is written by means of its fourier expansion, as: wj,t = ∞∑ k=−∞ ck · [g(k)] t · eik·jϵ (9) where g(k) is named the amplification factor. introducing this expression in the wave equation, one obtains the "attenuation-dispersion" relation: g − 2 + 1 g = f(k · ϵ, vfd, w) (10) where f(k · ϵ, vfd, w) is a specific function of the considered wave. according to von neumann’s theorem ( [19]-a, p. 42), the considered fd scheme will be stable if both solutions of the algebraic equation (10) fulfil the requirement: |g1,2| ≤ 1, else this scheme will be unstable. 6.4 applications to the study of mechanisms of some numerical artifacts the method of transfer coefficients. appplying this method to the problem of sharp interfaces (see section 2a and figures 1 3), we will find that the above indicated numerical artifacts belong to the class of numerical phenomena, because they can be identified and described starting from the values of the roots of the characteristic equation [15]: ξ2 − (t_t1 + 2t_t2 + t0t1)ξ − t_t2 = 0. (11) where: ti = ai−1bi − 1, while ai and bi are the coefficients of the fd wave equation (4): wi,t+1 = aiwi+1,t + biwi−1,t − wi,t−1. one finds so that as |ξ| > 1 or |ξ| < 1, the used fd scheme is unstable, or it is stable. the method of fourier’s representation. using the above presented method of the fourier’s representation of the exact solutions to the problem of korteweg-de vries solitons complex computer simulations, numerical artifacts, and numerical phenomena 751 (section 3), one obtains the following expression [19]-b,c of the upper threshold (for the numerical scheme stability) of the time step: τmax = ϵ αa + 4β ϵ2 (12) our study [15] pointed out both the validity of the vliegenthart condition (13), as well as the monotonic improvement of the fd simulations accuracy as the representative point of the fd steps ϵ, τ tends to the vliegenthart’s boarder of the stability and instability regions. one finds so that the stability field of the fd simulations of kdv solitons propagation is rather broad. because the size and borders of the stability domain depend on the: a) strongly (e.g., in the case of some exponential dependencies) or weakly (as in the above studied case) character of the nonlinear dependence, b) number of interacting system components. 7 stability and convergence radii of different numerical schemes the accomplished numerical studies [20], [21] have pointed out that, for given values of the wave frequency (or wavelength) and of the tangent of mechanical losses, beginning from a certain number of space (or time) steps xlim, one finds usually the appearance of large oscillations of the simulated displacements, which lead quickly to instability. because the instability is determined by the value of the factor eex and: e = k tan δ 2 , while the wave intensity is proportional to the square of displacement: i ∝ w2, one finds that the measure (in deci-bells) of the intensity level corresponding to the stability field is: ⟨li,stab⟩db = 2 ⟨lw,stab⟩db = 20exlim = 20kxlim tan δ 2 = 40π xlim λ tan δ 2 . (13) of course, the decrease of the wave intensity corresponding to the stability field (limit) is: ilim i0 = e−2exlim = e ⟨li,stab⟩ 10 (14) table 3 synthesizes the obtained numerical results. type of the wave equation no. of numerical ⟨li,stab⟩db tlim interactions (stability, life, steps) complex stiffness equation 12 0.0321 102 complex stress relaxation time 10 4 12732 complex wave-vector equation 8 20 63622 space evolution equation 6 40.476 128839 real wave, equation 5 80.130 255062 table 3: stability radii and mean life of different numerical simulations [15]. 8 analysis of the obtained results for different studied numerical schemes and physical processes the obtained results (tables 1 and 2) concerning the stability and convergence radii of different numerical schemes intended to the computer simulation of certain physical processes 752 d.-a. iordache, p. sterian, f. pop, a.r. sterian (acoustic pulse propagation, diffusion with drift, absorption, etc) indicate the "accessible" logical depths [22] of the specific studied physical problems, for each of the used numerical schemes. these results present also a considerable importance for the choice and optimization of the numerical schemes [23]. certain numerical schemes, e.g. that corresponding to the complex stiffness s̄ symmetric wave equation of the propagation in dissipative media: ρ ∂2w̄ ∂t′2 = s̄ ∂2w̄ ∂x2 (15) allow multiple solutions; using the fd descriptions: t′ = tτ and: x = iϵ (in terms of the time τ and space ϵ steps) of the real time t and space coordinate x, these solutions can be written as: w̄i,t = a · e±iωtτ · e±(e+ik)iϵ (16) even if the initial conditions launch only the "direct" wave: w̄dir.i,t = a · e −eiϵ · ei(ωtτ+kiϵ) (17) some random accumulations of the rounding errors intervening in the evaluation of the partial derivatives produce a local ("spontaneous") generation of the inverse wave: w̄inv.i,t = a ′ · eeiϵ · ei(ωtτ+kiϵ) (18) leading to the sudden apparition of instabilities. one finds so that the numerical simulations of the waves propagation through dissipative media lead to a typical problem of self-organizing systems, with a spontaneous symmetry breaking. this symmetry breaking corresponds to the "spontaneous" local generation of the inverse wave, launched by the random accumulation of the "garbage" rounding errors and followed by the transition between the attenuated wave and the apparently amplified wave, corresponding to the "inverse" wave. the accomplished study (see tables 1 and 2) points out that the "speed" of this self-organization process crucially depends on the number and intensity of the numerical "interactions" between the components (the values wi,t of the displacement in different sites i, t of the fd grid) of the simulation process. because such numerical "interactions" are achieved mainly by the fd approximate expressions of the partial derivatives, the "spontaneous" breaking of the symmetry appears quicker for (in the decreasing order of importance): • a) large numbers of displacement components involved in the expressions of partial derivatives, e.g. when their expressions with 2 previous time steps (instead of those using an only one previous time step) are used1: ḟ(0) = −f(2τ) + 8f(τ) − 8f(−τ) + f(−2τ)/12τ, f̈(0) = −f(2τ) + 16f(τ) − 30f(τ) + 16f(−τ) − (−2τ)/12τ2 when the instabilities appear after only few tens of iterations, • presence and repeated "mixture" of the values of both real and pure imaginary parts of the complex wave function (displacement) w̄, • more parasitic solutions, • more partial derivatives involved in the expression of the differential equation of the acoustic pulse propagation. 1the formulae in more points are considerably more accurate for rather small numbers of iterations, but they give rise later to spurious solutions and instability (see table 1). complex computer simulations, numerical artifacts, and numerical phenomena 753 for these reasons, the highest "accessible" logical depth [22] is reached (for the simulations of the acoustic pulse propagation through attenuative media) for the numerical scheme using the real wave function equation (see table reftab02), with the usual fd approximations of the first 2 order derivatives: ḟ(0) = f(τ) − f(−τ) 2τ , f̈(0) = f(τ) − 2f(0) + f(−τ) τ2 . (19) 9 conclusions and future works the obtained results concerning the different numerical phenomena associated to the complex computer simulations present a considerable importance for the choice and optimization of these numerical schemes [23]. it was also found that some numerical simulations (e.g, those of the acoustic pulse propagation through attenuative media) allow the study of some features of the self-organizing systems (the "spontaneous" symmetry breaking, the influence of the interactions between the system components on the "accessible" logical depth, etc). acknowledgments the authors acknowledge the financial support from the national center for programs management (cnmp) of the romanian ministry of education, research, youth and sports, under the contract no. d11-044/2007-quantgrid. bibliography [1] r. skeel, siam news, 25(4), p. 11, 1992. [2] siam news, 29(8), pp. 1, 123, 13, 1996, http://www.siam.org/siamnews/general/ariane.htm [3] a) d. w. mcclure "computer errors", in d. a. iordache, d. w. mcclure, selected works of computer aided applied sciences, vol. 2, printech publishing house, bucharest, 2002, p. 535; b) d. w. mcclure "computer errors", basic notions (chapter 9), applications (chapter 10), in the frame of textbook e. bodegom, d. w. mcclure et al (d. iordache, fl. pop, c. roşu editors) "computational physics guide", politehnica presss, bucharest, 2009. [4] cl. s. fermat "diophantus’ arithmetica containing (48) observations by p. de fermat", toulouse, 1670. [5] a. wiles "modular elliptic curves and fermat’s last theorem", annals of mathematics, 142, 443-551(1995). [6] s. singh "fermat’s enigma: the epic quest to solve the world’s greatest mathematical problem", walker publishing company, new york, 1997. [7] r. courant, k. friedrichs, h. lewy, math. ann., 100, 32(1928). [8] p.p. delsanto, t. whitcombe, h.h. chaskelis, r.b. mignogna, wave motion, 16, 65(1992). [9] d. iordache, p. delsanto, m. scalerandi "pulse distortions in the fd simulations of elastic wave propagation", mathl. comp. modelling, 25(6) 31-43, 1997. 754 d.-a. iordache, p. sterian, f. pop, a.r. sterian [10] d. iordache, m. scalerandi, c. rugină, v. iordache "study of the stability and convergence of fd simulations of ultrasound propagation through non-homogeneous classical (zener’s) attenuative media", romanian reports on physics, 50(10) 703-716, 1998; b) d. a. iordache, m. scalerandi, v. iordache, romanian journal of physics, 45(9-10) 685(2000). [11] j. c. strikwerda "finite difference schemes and partial difference equations", wadsworthbrooks, 1989. [12] p. p. delsanto, g. kaniadakis, m. scalerandi, d. iordache, comp. math. applic., 27(6) 51-61(1994). [13] p.p. delsanto, g. kaniadakis, m. scalerandi, d. iordache, mathl. comp. modelling (uk), 19(9) 1-8 (1994). [14] a) p. p. delsanto, d. iordache, c. iordache, e. ruffino "analysis of stability and convergence in fd simulations of the 1-d ultrasonic wave propagation", mathl. comp. modelling, 25(6) 19-29, 1997; b) d. iordache, şt. puşcă, c. toma "numerical analysis of some typical fd simulations of the waves propagation through different media", lecture notes on computer sciences, 3482, 614-620, 2005. [15] d. iordache "contributions to the study of numerical phenomena intervening in the computer simulations of some physical processes", credis printing house, bucharest, 2004. [16] a. v. porubov, m. g. velarde "strain kinks in an elastic rod embedded in a viscoelastic medium", wave motion, 35, 189-204, 2002. [17] j. cuevas, j. c. eilbeck "discrete soliton collisions in a waveguide array with saturable nonlinearity", physics letters a, 358(1) 15-20, 2006. [18] d. iordache, m. scalerandi, c. iordache "mechanisms of some numerical phenomena specific to the finite differences simulations of the ultrasound propagation", proc. 25th congress of the american-romanian science academy, cleveland (us), 2000, pp. 263-266. [19] a) j. c. strikwerda "finite differences schemes and partial difference equations", wadsworth-brooks, 1989; b) a. c. vliegenthart, j. eng. math., 3, 81-94, 1969; c) a. c. vliegenthart, j. eng. math., 5, 137-155, 1971. [20] a) p. p. delsanto, m. scalerandi, v. agostini, d. iordache, il nuovo cimento, b, 114, 141326(1999); b) d. iordache, m. scalerandi m., c. rugină, v. iordache, romanian reports on physics, 50(10) 703-716 (1998). [21] d. iordache, m. scalerandi, v. iordache, romanian j. of physics, 45(9-10) 685-704 (2000). [22] m. gell-mann, europhysics news, 33(1) 17-20 (2002). [23] d. iordache, v. iordache, romanian journal of physics, 48(5-6) 697-704 (2003). int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 919-928 a metrics-based diagnosis tool for enhancing innovation capabilities in smes j. sepulveda, j. gonzalez, m. camargo, m. alfaro juan sepulveda, javier gonzalez, miguel alfaro department of industrial engineering, university of santiago of chile 3769 ecuador ave. santiago, chile. po box 10233 mauricio camargo nancy-universite / erpi (equipe de recherche des processus innovatifs) 8, rue bastien lepage 54010 nancy cedex, france abstract: innovation doubtless represents a main strategic lever for the development of small and medium enterprises (smes) in many industrial sectors and it comprises new techniques, new products and new processes, as well as new services which lead to better customer service and revenue. however, the basic question of how well the company is equipped with the necessary practices, methodologies, people, and beliefs, is far from being completely answered yet. in this paper, a metrics-based diagnosis tool for measuring and enhancing the innovation capabilities in smes is presented along with a set of preliminary results from case-based studies at the local industry. in this paper we propose a new method by studying the competences of smes in concepts tied to innovation and by using a specified framework. as a first step, all of the necessary information by using questionnaires with verbal-scales evaluations is compiled. second, we use a non-compensatory flow-based sorting method with central profiles to identify the current level and to classify the company into predefined levels. third, a detailed analysis of the obtained values is performed in order to make a personalized recommendation. keywords: technological innovation, multicriteria decision making, classification methods. 1 introduction 1.1 innovation process the innovation literature is a fragmented corpus due to the contribution of many scholars with diverse disciplinary backgrounds that try to adopt different ontological and epistemological positions to investigate and analyze this complex and multimensional phenomena. thus, a variety of approaches [1], [2] and many different measurement methods [3], [4], [5] can be found. chiesa et al. in [6] describe process and performance as the two foci of innovation management measures; they overlay core processes with a set of enabling processes, the latter describing the deployment of resources, and the effective use of appropriate systems and tools governed by top management leadership and direction. a close link exists between product and process innovations: the majority of the articles address both type of innovations and only few articles considered only process innovations [7]. we can observe this relationship in the definition made by cormican et al. [8] , that describe the product innovation as a continuous and cross-functional process involving and integrating a growing number of different competences inside the organization. in the area of the variables related to innovation, we can consider important works on frameworks. adams et al. [3] in a survey develops a synthesized framework of the innovation management copyright c⃝ 2006-2010 by ccc publications 920 j. sepulveda, j. gonzalez, m. camargo, m. alfaro process consisting of seven categories: inputs management, knowledge management, innovation strategy, organizational culture and structure, portfolio management, project management and commercialization. boly [9] , based on the literature [7] [8], identified the most used practices by innovative enterprises and he classified them under 13 categories or groups. according to the author, these practices constitute the principal actions performed by the enterprises to define their strategy, to guide and impel the innovation processes and make evolve the organization or its methods of work; they develop these practices completely or partially and in a formal or informal way where the level of use of these practices allows to classify the enterprises according to his innovation potential. 1.2 measuring innovation by assessment of practices corona in [10] defined an index of potential innovation (iip), which is calculated by using multicriteria decision making (mcdm) tools, and uses as criteria the 13 innovation practices defined by boly [9] . these practices are the concrete actions executed by the enterprises to define their strategy, to guide and to impel the innovation processes and to make evolve the organization or its working methods. the index will allow to obtain a classification according to the attitudes and strategies adopted by these enterprises. based on [11] we can classify companies as: proactive, preactive, reactive and passive. on the other hand, morel et al. [12] propose the use of choquet’s integral to consider the interaction between the different innovation practices, defining an aggregated index of potential innovation (apii). finally, assielou in [13], besides of adding two new innovation practices, he makes modifications in the system of treatment and practices. current index-based methods are actually sorting procedures and present limitations to correctly classify enterprises in this field. in this paper we propose a new method to make the evaluation of innovation levels in the small and medium enterprises by using a specified framework. as a first step, all the necessary information by using questionnaires with verbal-scales evaluations is collected. then, a non compensatory flow-based sorting method with limiting profiles is applied. the third step consists of a detailed analysis of the values obtained in each evaluation of an enterprise to perform a personalized recommendation for each company about areas to be improved. an example of its application is given. 2 construction of the gathering tool and reference profiles 2.1 gathering tool based on the works by assielou [13], corona [10] and camargo [14] we use categories of innovation practices ci, each with practices belonging to a similar class, as shown below. i. creation / concept generation (c1) 1.1 use of tools to increase the creativity 1.2 integration of the clients and suppliers in the conception process 1.3 organization, compilation and management of information from the exterior ii. conception activities (c2) 2.1 use of tools of help to the conception 2.2 existence of a methodology of help to the conception 2.3 hardware equipment iii. human resources management (c3) a metrics-based diagnosis tool for enhancing innovation capabilities in smes 921 3.1 management of competences and the skills of the society 3.2 innovation stimulation iv. strategy (c4) 4.1 strategy integrated to favor the innovation 4.2 network operation 4.3 client importance 4.4 financing v. project management (c5) 5.1 project administration 5.2 management of project briefcase 5.3 organization of tasks tied to the innovation vi. capitalization of ideas and concepts (c6) 6.1 continuous improvement of the innovation process 6.2 politics of management of the intellectual property 6.3 knowledge capitalization in order to measure every concept on each category, verbal scales mapped onto five numerical levels are defined with values {0; 0.25; 0.5; 0.75; 1}, where 0 and 1 are the lowest and the highest level , respectively. each level was defined in detail for each concept, in order to avoid ambiguity and make the evaluation easier to the interviewer; a survey with 18 evaluations grouped into six categories is then applied. 2.2 reference profiles by observing the classification established by godet [11] and the six categories above, it is possible to establish intervals in which we can classify the enterprises; passive enterprises have the lowest values in every category whereas the preactive ones obtain the highest values. the division by interval in each category allows us to establish segments of values for each innovation profile in the characteristics to be measured by having into consideration that the values included in the interval belongs to the levels expected for a company of that profile. this can give us the idea that it is possible to determine reference profiles, where for example a proactive company will have all its evaluation values in the highest intervals. thus, each of the four divisions represents to a reference company with passive, reactive, preactive and proactive characteristics (table 1). is it possible to see that these reference profiles represent companies that have homogeneous development levels in each characteristic, which is not always the case; for instance, there exist companies with high levels of development in innovation stimulation, but at the same time poor levels in knowledge capitalization, or a company obtains proactive values in human resources management and reactive values in capitalization of ideas and concepts. we cannot hope that the majority of the companies will be homogeneous, therefore it is necessary to find out a method that allows us to establish a correct classification within the four profiles using a non compensatory mathematical tool. 3 flowsort method based on the ranking methodology of promethee, a new sorting method developed by nemery and lamboray [15] is proposed for assigning actions to completely ordered categories; 922 j. sepulveda, j. gonzalez, m. camargo, m. alfaro category passive reactive preactive proactive i. creation / concept generation [0 a2[ [a2 a3[ [a3 a4[ [a4 1] ii. conception activities [0 b2[ [b2 b3[ [b3 b4[ [b4 1] iii. hr management [0 c2[ [c2 c3[ [c3 c4[ [c4 1] iv. strategy [0 d2[ [d2 d3[ [d3 d4[ [d4 1] v. project management [0 e2[ [e2 e3[ [e3 e4[ [e4 1] vi. capitalization of ideas [0 f2[ [f2 f3[ [f3 f4[ [f4 1] table 1: interval distribution of category values these categories are defined either by limiting profiles or by central profiles (also named centroids). the assignment of an action into a category is based on the relative position of this action with respect to the defined reference profiles in terms or incoming or outgoing net flows. we denote by a : (a1, ..., an) the set of n actions to be sorted. these actions are evaluated on q criteria gj(j = 1, ..., q) that have to be maximized. we denote the categories to which the actions must be assigned by c1, c2, ..., ck. these categories are either delimited by two boundaries, in the case of limiting profiles, or by centroids in the case of central profiles. this categories are ordered as c1 > ...cl > ck, where ch > ck , with h < l, which denotes that ch is preferred to category cl. we denote r = (r1..., rk+1) as the set of limiting profiles in the case when a category is defined by an upper and lower profile, represented as rh+1 . on the other hand, when we define a category by one central profile, the centroid is denoted by r̃ = (r̃1, ..., r̃k), where r̃j is the centroid of category cj. we also define π(x, y) as the preference of action x over an action y, which is used in the same way as in promethee. thus, on the basis of these preference degree, positive, negative and net flows of each action x of ri, are computed by equations ( 3.1),( 3.2),( 3.3), where ri = r ∪ {ai}. ϕ+ ṙi = 1 |ṙi| − 1 ∑ y∈ṙi π(x, y) (3.1) ϕ− ṙi = 1 |ṙi| − 1 ∑ y∈ṙi π(y, x) (3.2) ϕ ṙi = ϕ+ ṙi − ϕ− ṙi (3.3) in this case we use ṙi when no difference can be made between a set of limiting profiles and a set of centroids. the flow-based assignment rules differ in the use of limiting profiles and central profiles. in the case of limiting profiles, the rules of the positive and negative flow assignment are defined as follows: cϕ+(ai) = ch, if ϕ + ri (rh) ≥ ϕ+ri(a1) > ϕ + ri (rh+1) (3.4) cϕ−(ai) = ch, if ϕ − ri (rh) < ϕ − ri (a1) ≤ ϕ−ri(rh+1) (3.5) if we want to strictly impose the assignment to one category, using the net flow we can define the assignment rule by ( 3.6). cϕ(ai) = ch, if ϕri(rh) ≥ ϕri(a1) > ϕri(rh+1) (3.6) a metrics-based diagnosis tool for enhancing innovation capabilities in smes 923 in the case of central profiles, the flow-based assignment rules of positive and negative flows are defined by ( 3.7) and ( 3.8). c̃ϕ+(ai) = ch, if ϕ+ r̃i (r̃h) + ϕ + r̃i (r̃h+1) 2 < ϕ+ r̃i (a1) ≤ ϕ+ r̃i (r̃h) + ϕ + r̃i (r̃h−1) 2 (3.7) c̃ϕ−(ai) = ch, if ϕ− r̃i (r̃h) + ϕ − r̃i (r̃h+1) 2 ≥ ϕ− r̃i (a1) > ϕ− r̃i (r̃h) + ϕ − r̃i (r̃h−1) 2 (3.8) here, we can also strictly impose the assignment to one category using the net flows with the assignment rule ( 3.9). c̃ϕ(ai) = ch, if ϕ r̃i (r̃h) + ϕr̃i (r̃h+1) 2 < ϕ r̃i (a1) ≤ ϕ r̃i (r̃h) + ϕr̃i (r̃h−1) 2 (3.9) 4 application in this section the method is explained in four main steps, by using information obtained from seven smes, ej, with (j = 1, ..., 7) taken from the metalworking industry located at santiago of chile, as follows. e1: appliances manufacturing such as refrigerators, gas and kerosene heaters. e2: appliances manufacturing such as gas and electric stoves and heaters. e3: appliances manufacturing such as home boilers and sinks. e4: faucets and gas valves manufacturing. e5: vehicle transforming maker such as for ambulances and safety vehicles. e6: safety deposit box manufacturing with electronic controls. e7: vending machine refurbishment and adapting for industrial utilization. 4.1 first step: determination of the weights and references profiles. by observing the weight used in each one of the practices in [13], we construct our own weight distribution, considering that our definition of categories and concepts is a grouping and in some cases the division of the innovation practices. this weight distribution can be observed in table 2 , where also for every category the local weight of each concept of a category is indicated. to establish the reference profiles, we will define four areas for each of the six categories using the construction of limiting profiles; this will allow us to establish min and max values in every category ci . in this case we constructed a symmetrical division in each of the categories because the necessary information to establish central profiles was not available (table 3). 4.2 second step: survey application and data processing. in this step we work with the information collected from the surveys made to each enterprise. according to the evaluation of each of the concepts, an evaluation for each category ci is made by the weighted sum ∑ eijwij, where eij is the evaluation between [0, 1] of the j-th concept in the i-th category , and wij is the local weight (table 4). 924 j. sepulveda, j. gonzalez, m. camargo, m. alfaro category (ci) local concept weights (wij) global category weight (wi) i. creation / concept generation {0.26; 0.33; 0.41} 0.175 ii. conception activities {0.43; 0.19; 0.38} 0.107 iii. human resources management {0.47; 0.53} 0.068 iv. strategy {0.05; 0.51; 0.27; 0.17} 0.232 v. project management {0.01; 0.47; 0.52} 0.194 vi. capitalization of ideas and concepts {0.43; 0.29; 0.37} 0.224 table 2: local and global weights concepts limiting profile c1 c2 c3 c4 c5 c6 r1 1 1 1 1 1 1 r2 0.75 0.75 0.75 0.75 0.75 0.75 r3 0.5 0.5 0.5 0.5 0.5 0.5 r4 0.25 0.25 0.25 0.25 0.25 0.25 r5 0 0 0 0 0 0 table 3: limiting profiles defined in flow-sort method a c1 c2 c3 c4 c5 c6 e1 0.31 0.32 0.12 0.13 0 0 e2 0.36 0.59 0.40 0.86 0.15 0.19 e3 0.28 0.16 0.40 0.58 0 0.10 e4 0.17 0.22 0.12 0.43 0.11 0.19 e5 0.26 0.42 0.24 0.34 0.15 0 e6 0.52 0.78 0.52 0.98 0.64 0.42 e7 0.17 0.2 0.18 0.29 0.34 0.11 table 4: evaluation results 4.3 third step: flow-sort aplication by defining the set of actions for the seven enterprises as a = {e1, e2, e3, e4, e5, e6, e7}, which have been evaluated in the six criteria already defined, and the four classification categories {passive, reactive, preactive, proactive} defined by the five limiting profiles of table 3, we start calculating the preference degrees, showed in table 5 between the reference profile and the seven enterprises in order to obtain positive and negative flows. with these calculations we can measure positive, negative, and net flows for each enterprise by using equations ( 3.1), ( 3.2), and ( 3.3).the calculations of all of the flows for the enterprises is shown in table 6, where for example the positive flow of enterprise 1 with respect to limiting profile r4 is calculated as ϕ+ r1 (r4) = ∑ π(r4, rj) + π(r4, e1) |r4| − 1 = 1 + 0.67 6 − 1 = 0.334 the assignment to each category in table 6 were obtained by equations ( 3.4) and ( 3.5); for example for assigning enterprise 1 the flow is between profile limits r4 and r5 as indicated by ϕ+ r1 (r4) ≥ ϕ+r1(e1) ≥ ϕ + r1 (r5) and ϕ−r1(r4) < ϕ − r1 (e1) ≤ ϕ−r1(r5). thus enterprise 1 is classified as passive. in the cases when positive and negative flows difers, for example enterprise 3 and 5, we must apply the equation ( 3.6) to obtain an unique classification. a metrics-based diagnosis tool for enhancing innovation capabilities in smes 925 r1 r2 r3 r4 r5 π(e1, rj) 0 0 0 0.33 0.67 π(rj, e1) 1 1 1 0.67 0 π(e2, rj) 0 0.17 0.33 0.67 1 π(rj, e2) 1 0.83 0.67 0.33 0 π(e3, rj) 0 0 0.17 0.50 0.83 π(rj, e3) 1 1 0.83 0.50 0 π(e4, rj) 0 0 0 0 1 π(rj, e4) 1 1 1 1 0 π(e5, rj) 0 0 0 0.50 0.83 π(rj, e5) 1 1 0 0.50 0 π(e6, rj) 0 0 0 0.50 0.83 π(rj, e6) 1 1 1 0.50 0 π(e7, rj) 0 0 0 0.33 1 π(rj, e7) 0 0 0 0.67 0 table 5: preference degrees between the reference profile and the actions r1 r2 r3 r4 r5 ei class ϕ+ 1 0.8 0.6 0.344 0 0.173 passive r1 ϕ− 0 0.2 0.4 0.656 0.916 0.744 passive ϕnet 1 0.6 0.2 -0.313 -0.916 -0.571 passive ϕ+ 1 0.754 0.532 0.284 0.0 0.431 reactive r2 ϕ− 0 0.246 0.468 0.716 1 0.569 reactive ϕnet 1 0.507 0.064 -0.433 -1 -0.139 reactive ϕ+ 1 0.8 0.554 0.305 0 0.303 passive r3 ϕ− 0 0.2 0.446 0.695 0.961 0.659 reactive ϕnet 1 0.6 0.107 -0.39 -0.961 -0.356 reactive ϕ+ 1 0.8 0.6 0.354 0 0.246 passive r4 ϕ− 0 0.2 0.4 0.646 1 0.754 passive ϕnet 1 0.6 0.2 -0.293 -1 -0.507 passive ϕ+ 1 0.8 0.6 0.297 0 0.258 passive r5 ϕ− 0 0.2 0.4 0.703 0.955 0.697 reactive ϕnet 1.0 0.6 0.2 -0.406 -0.955 -0.439 passive ϕ+ 1.0 0.732 0.445 0.2 0 0.623 preactive r6 ϕ− 0.0 0.268 0.555 0.8 1 0.377 preactive ϕnet 1.0 0.464 -0.11 -0.6 -1 0.246 preactive ϕ+ 1.0 0.8 0.6 0.315 0 0.285 passive r7 ϕ− 0.0 0.2 0.4 0.685 1 0.715 passive ϕnet 1.0 0.6 0.2 -0.37 -1 -0.43 passive table 6: flow-sort results the assignment of an enterprise into one of the four enterprises profiles can be easily displayed using the positive and negative flows diagram. in figure 1, which shows the flows of the enterprise 1, it is possible to view that there is not ambiguity in the classification of this enterprise into the passive profile , since the positive and negative flows allocate this enterprise into the same profile. in other case , in the figure 2 which shows the flows of the enterprise 3, we can observe that the positive flow classifies the enterprise into the passive profile and the negative flow classifies the enterprise into the reactive profile, but the final net flow classify this enterprise into the reactive profile. this final classification can be explained taking into consideration the 926 j. sepulveda, j. gonzalez, m. camargo, m. alfaro proximity of the positive and negative action score to the reactive area classification over the passive area classification. figure 1: flow diagram enterprise 1 figure 2: flow diagram enterprise 3 for the enterprise 6 identified as preactive in this method, the net flow action obtains a positive value, which is opposed to the net values obtained by the enterprises classified as passive which have the most negative value (figure 2). it is important to note that the profile areas where there is not exist any ambiguity depends only on the singular comparison between an enterprise and all of the reference profiles. the former explains the variety of these areas that we see in all these flows diagrams. 4.4 fourth step:analysis of the results. the analysis of the results and the search of possible alternatives of innovation progress in these enterprises can be analyzed by observing the detailed net score flows for each category, which show the net significance of all six categories in obtaining the final net flow value for every enterprise. we can observe that in most cases the category that contributes with the highest negative flows is the category capitalization of ideas, which is responsible for the third part of the total net flow. the second category with the most negative flows is the category project management. in most enterprises, these two categories grouped together represent between the 40% or 50% of the final significance in the net flow action (table 7). this can give us an idea figure 3: flow diagram enterprise 6 a metrics-based diagnosis tool for enhancing innovation capabilities in smes 927 that any improvement in the values obtained in the values of these categories can be critical to obtain a better evaluation. in other words, any improvement in any of the concepts that belongs to the categories above mentioned may produce an advance of the innovation process. thus, these enterprises would be more qualified to move to a higher profile. on the other hand, in the enterprises that obtained a classification as reactive and preactive, the values associated to the category strategy have positive net flows values having a positive significance into the total net flow action; thus we can say that enterprises with good evaluations in the concepts concerning to the strategy can present features of a reactive or a preactive enterprise profile. c1 c2 c3 c4 c5 c6 e1 6.13% 3.75% 7.15% 24.39% 27.19% 31.39% e2 7.61% 4.65% 2.96% 30.26% 25.30% 29.22% e3 7.80% 14.30% 3.03% 10.34% 34.58% 29.95% e4 20.70% 12.66% 8.04% 9.15% 22.95% 26.50% e5 7.97% 4.87% 9.29% 10.56% 26.50% 40.80% e6 10.43% 19.13% 4.05% 41.48% 11.56% 13.35% e7 24.44% 14.94% 9.50% 10.80% 9.03% 31.28% table 7: detailed net flow significance 5 conclusion in this paper a new enterprise classification method by using a non compensatory flowbased sorting method with limiting profiles has been proposed. the method called flow-sort was used to identify the current level of the enterprise and to classify it into four predefined levels of innovation: passive, reactive, preactive, and proactive. the aim of the method, besides classifying the enterprise into a profile, is to give new ideas to formulate an improvement strategy to allow the company to increase its innovation performance. along this work, we established many observations, as (a) the importance of defining a precise framework that allow us to evaluate all the characteristics and concepts in a precise and structured form, (b) the correct construction of the tool for gathering data, since this is a key element for obtaining the necessary information input, (c) the use of a mathematical tool that allows a comparison against an established profile; the tool is independent of the universe of enterprises to be measured so that it can be applied on a reduced or a large number of enterprises without changing its effectiveness, (d) the possibility of establishing the parameters for an innovation improvement strategy for each of the enterprises individually according to their obtained values and the analysis of significance of each of the categories and concepts. as a limitation of the application used in the example above, we have the highly arbitrary definition of the four reference profiles since we used four homogenous zones for the six categories when using limiting profiles, as shown in table 3. more accurate knowledge on a company may allow different values for the profiles, or better the use of central profiles. the latter may be continuously refined as the analysis is repeated in the mid and long term. a classification into new innovation levels can be made by using the proposed method. in such a case, the use of either central or limiting profiles will exclusively depend on the certainty about the characteristics of each new level. another comment is related to the interpretation of results and the creation of incentive policies towards better innovation performance levels; these aspects are left to the policy makers at each company since they cannot be predefined in a standardized way. however, an empirical study 928 j. sepulveda, j. gonzalez, m. camargo, m. alfaro with a broader universe of companies by using the method here proposed could give a better answer on the suitability of best practices of this field. bibliography [1] kerssens-van drongelen, i.c. and bilderbeek, j., r&d performance measurement: more than choosing a set of metrics, r& d management vol 29, nş1, p. 35-46, 1999 [2] koberg c., detienne d., heppard k.,an empirical test of environmental, organizational, and process factors affecting incremental and radical innovation, journal of high technology management research 14 p 21-45, 2002 [3] adams, r., bessant, j., phelps, r., innovation management measurement: a review, international journal of management reviews, vol 8 , nş1, p. 21-47. 2006 [4] bremser w., barsky n., utilizing the balanced scorecard for r&d 5 performance measurement, management vol 34 nş3, p. 229-238, 2004 [5] wang c.,lu i., chen i, evaluating firm technological innovation 15 capability under uncertainty, technovation vol 28, p. 349-, 2008 [6] chiesa, v., coughlan, p., voss, c.a., development of a technical innovation audit, journal of product innovation management 13 (2), p 105-136, 1996 [7] becheikh n., landry, r., amara, n., lessons from innovation empirical studies in the manufacturing sector: a systematic review of the literature from 1993-2003, technovation, vol 26, nş5-6, p. 644-664,2005 [8] cormican k.,sullivan d.,auditing best practice for effective product innovation management,technovation vol 24, p. 819-829, 2004 [9] boly, v., ingénierie de l’innovation : organisation et methodologies des entreprises innovantes,lavoisier, paris, france, 2004 [10] corona a. josé ramón, innovation et metrologie : une approche en terme d’indice d’innovation potentielle, thčse de doctorat, institut nationale polytechnique de lorraine, février, 2005 [11] godet, m., manuel de prospective stratégique. tome 2. l’art et la methode, ed. dunod, paris, france, 1997 [12] morel l., camargo m.,comparison of multicriteria analysis techniques to improve the innovation process measurement, iamot 2006, beijing, china, 8 pages, may 22-26 [13] assielou g, metrologie des processus d’innovation, thčse de doctorat, institut nationale polytechnique de lorraine, 2008 [14] camargo m., morel l., fonteix c., evolutionary based methodology to integrate product innovation degree on a firm technological strategy, iamot 2007 proceedings [15] nemery p, lamboray c. flowsort: a flow-based sorting method with limiting or central profiles, top 16:90-113, springer-verlag, 2008 international journal of computers, communications & control vol. i (2006), no. 2, pp. 15-22 an automatic grading system for panels surfaces using artificial vision cristhian aguilera, mario ramos, gabriel roa abstract: this work describes an automatic grading system using artificial vision to improve the quality of wood panels surfaces. the objective is to control stains on the surface. artificial vision techniques like thresholding and transformed watershed methods are applied. defects quantitative measures found on the surface are also presented, in particular quantity, area, intensity and distribution. keywords:vision, image processing, quality control, plywood 1 introduction the need of manufacturing companies to maintain high quality product requires an exhaustive production control during and at the end of the process. when control is human visual inspection product control is not completely reliable and it is not guaranty of a total quality control. however, the development of new technologies, and especially image analysis systems (artificial vision), has ostensibly improved the quality control process. since it is a noninvasive technology and is capable to inspect the 100% of the production, artificial vision offers an advantage when compared with the manufactured product quality control techniques like human inspection and sampling. the wood industry and specifically in the panels industry has produced efforts to improve each, every one of the production stages. two main characteristics determine the panel quality: mechanical resistance and appearance. artificial vision (garrido, 2003) is a solution for the appearance control quality. panels like plywood are used in the carpentry industry. decorative paneling is the reason why surface quality must be controlled. since the objective of these products is both functional and esthetic. the main objective of this paper is to propose an analytical artificial vision based method to classify panels according to the quantity, area, intensity, and stains distribution on the surface. it includes among these characteristics quantitative methods to measure the stain distribution on the panels. factors to permit the quantification of these parameters and especially of the stain distribution in this paper are proposed. 2 panels 2.1 definition panels can be defined (corma, 2003). structural panels this term refers to those panels employed as structural elements in the construction and packing industry, included plywood. non-structural panels this term refers to those panels employed in the furniture industry, included decorative plywood, hard panels, and medium density fiberboard, mdf. 2.2 classification traditionally, wood, and consequently wood-derived products, has been classified considering two fundamental criteria: resistance and appearance. in relation to the resistance, the objective is to control the mechanical resistance degree. there are standards that these products should satisfy. tests are related with the mechanical quality evaluation like thickness, density, weight, flexion, traction, elasticity, and humidity. copyright c© 2006 by ccc publications 16 cristhian aguilera, mario ramos, gabriel roa in the case of appearance, the objective is the visual appearance control, considering esthetic more than structural values. these criteria are important in the furniture and paneling industry. defects affecting the surface quality, especially in mdf and fiber panels, are the stains on the surface. as an example, a summary of the characteristics considered by a chilean company to grade surface quality of mdf and fiber panels are presented in table 1. it can be observed that panel grading is divided in three types. • first quality (i). • second quality (ii). • third quality (iii). 3 industrial vision system industrial vision systems are rapidly becoming a key factor in the development of total quality procedures in an industrial automation processes context. industrial vision systems permit to inspect production processes without fatigue or distraction, facilitating the quantification of quality variables and contributing to a continuous improvement (silvén and matti, 2003). the figure 1 presents the outline of the stages of an industrial vision system. figure 1: industrial vision system stages 3.1 image acquisitions and digitalization this is the stage in which sensors and the capacity to digitalize the signal produced by the sensor are used. the sensor is either color or monochrome camera that produces a complete image. after capturing the image, this information is sent to the computer to be analyzed (mery, 2003). 3.2 pre-processing in this process, the acquired image is modified in order to improve it according to the parameters to analyze considering: • noise elimination. an automatic grading system for panels surfaces using artificial vision 17 mdf fiber qual surface surface i both faces free of porosity, cracks, semi-inflated and dark stains both faces free of porosity, cracks, unsanded holes, semi-inflated, and stains falling from the surface i minor stains are permitted when there are no more than 2 per face and they are smaller than 2 cm longer strips that do not surpass 0.5 cm2 in area are permitted. the quantity should be less than 10 per face and presenting a separation greater than 5 cms. between them i good behavior during milling and coating application. minor stains are permitted if they are not greater than 5 cm2 of area. the quantity should be less than 5 per face and with a separation greater than 5 cm ii cracks, thick particles, dust, adhesive, or other agent stains, and semi-inflated porosity, cracks, un-sanded holes, thick particles on the surface, adhesive, dust, or other stains iii without quality requirements without quality requirements table 1: characteristics for panel grading according to the surface defects • accentuate or profile image characteristics (borders, limits, etc). • improve the quality of some parts of the image. 3.3 segmentation the segmentation process divides the digital image in unconnected regions in order to separate region of interest from the rest of the scene. in the last years, diverse segmentation techniques have been developed, which can be grouped in three techniques: pixel-oriented, border-oriented, and region-oriented techniques. 3.4 description this process labels the objects considering information supplied by inspection that can be: quantitative: measuring of areas, lengths, perimeters, etc. qualitative: verification of correct task performance (assembling, bottling, labeling, etc.). 3.5 classification classification orders the segmented regions in classes, assigning to each region a group of many preestablished groups that represent all the possible types of regions that are expected to exist in the image. in this stage, a statistical study is performed on the characteristics that are extracted from the objects whose defects are known a priori. 3.6 decision making frequently, computer vision systems control a mechanical apparatus that manipulate the products after classification. 18 cristhian aguilera, mario ramos, gabriel roa 4 proposed techniques to grade the panel surface defects, image analysis techniques need to be used. this stage differentiates one vision system from another. the analysis is performed on images using a gray scale where the intensity values for black are 0 and 255 for white. one of the techniques to be used is thresholding (guindos, 2001). panel surface stains are considered regions. another segmentation technique is the one based in growth of the regions applying functions under morphological operators, specifically the transformed watershed. 4.1 thresholding the thresholding technique (gonzales and goods, 1996) classifies each pixel in two groups, depending on whether or not the gray level exceeds the given threshold. if it does not exceed, then a lower gray level (color black) is assigned; in the opposite, the upper level (white color) is assigned. the objective is to obtain an image segmentation, creating black stains on a white background in a form that is similar to the objects seen from an upper plane, as can be observed in figure 2. figure 2: (a) original image, (b) i. gray-scaled image (c) thresholded image 4.2 transformed watershed this technique is part of a series of algorithms emerging to analyze images as a topographic surface where height is equal to the intensity. the highest pixel values correspond to the highest land areas and vice versa. the transformed watershed simulates, based in the minimum values of the image of water filling the image, figure 3. also, these points can be randomly selected, called markers. the point where the water comes from two regions is called the watershed and is the point that separates the basins corresponding to two minimums. figure 3: (a) marker selection, (b) flooding process from the markers, (c) finalized segmentation, (d) selection of markers using thresholding the key to segment using the transformed watershed is the markers selection. if each local minimum is taken, it could produce an over-segmentation. in order to reduce over-segmentation, the original image an automatic grading system for panels surfaces using artificial vision 19 is frequently softened before performing segmentation. with the thresholding method, a set of markers can be obtained in which a pre-determined filling level is fixed. figure 4 is an example of the over-segmentation when the transformed watershed is applied to the original image without establishing local minimums. figure 4: watershed results: (a) original image, (b) over-segmented image, (c) segmented image using local minimums 5 extraction of panel characteristics when the stains (defects) are found numeric properties are extracted. these characteristics provide information to grade the panel. the characteristic are quantity, area, intensity and distribution. 5.1 quantity this characteristic indicates the number of stains on the panel surface and whose factor is defined by the quantity of stains. fc = ct (1) where: fc is the quantity factor, ct is the number of stains present on the panels surface. 5.2 area this characteristic indicates the stained surface of the panel. its value is defined by the ratio between the sum of stain areas on the panel surface and the total panel surface area. this characteristic is called "area factor" and this is represented by: fa = 1 at k ∑ 1 an (2) where: fa is the area factor, at is the panel area, an is the stain area and k is the number of stains (ct ). in figure 5, it can be observed that the area factor increases proportionally with the stained area of the panel surface, indicating in this way a lower quality. 5.3 intensity this characteristic indicates the intensity level (gray scale, 0-255) of the stains. this property provides information of the stain type on the panel surface. for example, the humidity stain will have a different intensity than an oil stain. to represent this factor, the stain intensity average value and the stain-free panel intensity average are considered. this characteristic will be called "intensity factor" and is represented by: f i = x i −x im x i (3) 20 cristhian aguilera, mario ramos, gabriel roa figure 5: area factor v/s stained area figure 6: intensity factor v/s average stain intensity where: x i is the stain-free panel intensity average and x im is the stain intensity average. consequently, like can be appreciated in figure 6, the intensity factor will increase when stain intensity average value decrease, indicating in this a lower quality. 5.4 distribution this characteristic determines the spatial location of the stains on the surface. this shows if the stains are grouped in only one panel sector or rather are distributed throughout its surface. the position of the objects is established through cartesian coordinates. the method uses the average of the euclidean distances to the objects geometric centers. the factor is defined by the ratio between the distances average and a reference value. the reference value is the distance from the diagonal to the sample. expressed in percentage: f d(%) = 100∗ [ µ(d) dt ] (4) where: µ(d) is the average of euclidean distances, d is the set of objects, dt is the distance from the diagonal of the sample. the average of distances µ(d) depend how distant these object are from the geometric center. the farther they are, the greater is the value of (d) and consequently, f d (%) also be greater, indicating that the object distribution is greater. in figure 7, three examples of distributions are showed. as an example, table 2 presents numerical values of the factors proposed in this work, in relation to the grading criteria presented in table 1. the panel format was 1.5×2.3 meters. an automatic grading system for panels surfaces using artificial vision 21 figure 7: (a) fd=6.8%, (b) fd=9.05%, (c)fd=17.32% mdf fiber qual fc fa (%) fd fc fa (%) fd (%) i < 2 < 0,01 ne < 5 < 0.07 1,8 ii < 10 ne < 10 7 iii sec sec sec sec sec sec table 2: example of factors (ne: not specified, sec: without quality requirements.) 6 comments and conclusions this work is focused towards image analysis by segmentation of stains present on wood panel surfaces and in the definition of factors to quantify defects properties (quantity, area, intensity and distribution). techniques like thresholding and transformed watershed showed goods results to segment stains in gray scales. thresholding is a useful and simple technique, but when it is used only at one threshold level it is not capable of separating (labeling) the distinct stain intensity values that are below this value. the transformed watershed method offers the capacity of separating regions (stains) in function of the intensity value presented by each defect. however, this technique presents over-segmentation, problem that is solved through the image pre-processing to increase the contrast for the objects of interest and the assignment of local minimums (markers). factors proposed in this work provide the necessary information to define a stain quality index to on -line control of based-wood panels. acknowledgements this work was supported by the project incon-032008 3/r of the research secretary, university of bio-bio. references [1] garrido r., adquisición y tratamiento digital de señales para visión artificial, reporte final ingeniería electrónica, universidad del bío-bio, 2003. [2] corma, corporación chilena de la madera, www.corma.cl, 2004. [3] mery d., inspección visual automática, informe interno, departamento de ingeniería en computación, universidad de santiago de chile, 2003. [4] guindos f., fernandez j., and peralta m., visión artificial con imtdi, almería, 2001. 22 cristhian aguilera, mario ramos, gabriel roa [5] gonzález r.,woods r., tratamiento digital de imágenes, addison-wesley, 1996. [6] olli silvén and matti niskanen, framework for industrial visual surface inspections, 6th international conference on quality control by artificial vision, machine vision group, university of oulu, finland, 2003. cristhian aguilera, mario ramos, gabriel roa universidad del bío-bío, concepción-chile departamento de ingeniería eléctrica y electrónica departamento de ingeniería en maderas laboratorio de sistemas automatizados e-mail: cristhia@ubiobio.cl, mramos@pegasus.dci.ubiobio.cl international journal of computers, communications & control vol. ii (2007), no. 4, pp. 314-327 an implementation of reconfigurable network control based upon automata proposal for three conveyor belt case study benítez-pérez h., cárdenas-flores f., garcía-nocetti f. abstract: online reconfiguration performed by a computer network system needs to be addressed from several perspectives due to complexity onto the system. this paper proposes different modeling approximations to obtain a holistic view of reconfiguration onto complex systems. first model is dynamic system modeling, second is an automaton in order to bound possible scenarios and third model is a real time scheduling algorithm to match possible configurations and related control laws. 1 introduction one of the main issues in fault tolerance is to keep availability even in hazard situations. a way to guarantee this is by reconfiguration where several consequences are expected. in that respect, either fault coverage, masking or tolerance are strategies to use during reconfiguration. reconfiguration is a need in order to keep safety during fault scenarios. the results of this action modify several structures within complex systems like communication, dynamic behaviour and predictable response. in order to cover every aspect of these requirements, modeling becomes a crucial issue to get enough information by performing reconfiguration. in here, three different modeling techniques are followed, dynamic system modeling, automaton modeling and scheduling representation. these three strategies, by there own, lacks of a holistic view of the effects of a process such as reconfiguration. for instance, automaton strategy allows a structural view of reconfiguration without determining the effects of this action into dynamic behaviour of the system. alternatively, dynamic system modeling provides a formal view of the effects of time delays and the loose of certain dynamic elements, however, reconfiguration is not a predetermined action only the effects of it. the novelty of this approximation is to integrate various modeling strategies to accomplish reconfiguration and its effects. firstly, dynamic system is modeled in order to determine where time delays play a key role. secondly, automaton modeling is pursued in order to bound scenarios during fault and fault-free situations. from this automaton representation and taking into account real-time requirements, real-time scheduling through a special representation is implemented. the combination of these three allows system modeling even in complex situations like reconfiguration. the objective of this paper is to present a strategy for control reconfiguration based upon time delay knowledge using a scheduling algorithm. fault effects are local within a distributed system environment. the use of a case study is pursued to accomplish this objective. the novelty of this work is the amalgamation of scheduling and control techniques to get this strategy. in particular, for the case of dynamic system modeling, several strategies for managing time delay within control laws have been studied for different research groups. for instance nilsson [8] proposes the use of a time delay scheme integrated to a reconfigurable control strategy based upon a stochastic methodology. on the other hand, wu [10] proposes a reconfiguration strategy based upon a performance measure from a parameter estimation fault diagnosis procedure. another strategy has been proposed by jiang et al. [6] where time delays are used as uncertainties, which modify pole placement of a robust control law. izadi et al. [5] present an interesting view of fault tolerant control approach related to time delay coupling. reconfigurable control has been studied from the point of view of structural modification since fault appearance as presented by blanke et al. [2] where a logical relation between dynamic variables and faults are established. alternatively reconfigurable control may performs a combined modification of system structure as studied by benítez-pérez et al. [1] and thompson [9]. another technique copyright © 2006-2007 by ccc publications an implementation of reconfigurable network control based upon automata proposal for three conveyor belt case study 315 like gain scheduling (khalil, [7]) may give an interesting approximation to several time delay scenarios, however complexity related to system modeling during fault conditions is out the scope of this paper. some considerations need to be stated in order to define this approach. firstly, faults are strictly local in peripheral elements and these are tackled by just eliminating the faulty element. in fact, faults are catastrophic and local. time delays are bounded and restrictive to scheduling algorithms. global stability can be reached by using classical control strategy for online time delays. 2 case study and control reconfiguration approach due to the complexity of this approach a particular implementation is pursued in order to present the potential of complementary modeling for reconfiguration. the case study is based on three conveyor belts (gudmundsson, [4]) integrated as follows, it comprises 3 conveyors belts, 4 actuators, and 12 sensors. it has 16 computing elements that considers the controller and the bus controller. fig. 1 shows a diagram of this implementation. the procedure of the example is next; conveyor belt 1 detects a box, it modifies its speed up to a certain level to transport it in a faster way the box. mc is stated for micro-controller figure 1: conveyor belt example when this box arrives to conveyor belt 2, its speed is modified up to another level to transport this box in a faster manner. similar behavior is presented at conveyor belt 3. the sensor vector is used to detect the current position of box in any of these conveyor belts. furthermore, actuator 4 has the task of pushing the current box when it arrives at this position. for the case of multiple boxes in this example, the aim is that the boxes never crashed between each other. from this explanation, table 1 shows the modification of speeds. this case study is peculiar in that each conveyor belt has two different speeds as shown in table 1. these speeds are dependent on the sensor situation. this sensor situation is depicted as low and high, which is a semaphore for determining the presence of an object. the second peculiarity is related to the difference between hs as follows: hs1 < hs3 < hs2, (1) where the middle conveyor belt is the fastest, then third conveyor belt, and so on. as the reader may realize, there are four motors, three for the conveyor belts and the fourth is to pull any object presented at its region. based on this case study, the response of the three actuators is shown in fig. 2. different speed-ups are shown assuming that a box is presented during a certain time. for instance, the first conveyor belt 316 benítez-pérez h., cárdenas-flores f., garcía-nocetti f. table 1: speed selection conveyor belt conveyor belt conveyor belt 1 2 3 sensors lowspeed low speed low speed s1∗ = low sensors high speed low speed low speed s1∗ = high hs 1 sensors low speed low speed low speed s2∗ = low sensors low speed high speed low speed s2∗ = high hs 2 sensors low speed low speed low speed s3∗ = low sensors low speed low speed high speed s3∗ = high hs 3 presents a faster speed-up during the first 3000 seconds, in comparison with the low speed-up during 3000 to 6000 seconds. this speed-up is shown as a change of slope of the current graphic. similar behavior is presented for both conveyor belts as 2 and 3 are modified, because hs2 is bigger than hs3. 0 1000 2000 3000 4000 5000 6000 7000 8000 9000 0 10 20 30 d is p la m e n t o f c o n ve yo r b e lt 1 time 0 1000 2000 3000 4000 5000 6000 7000 8000 9000 0 10 20 30 40 d is p la m e n t o f c o n ve yo r b e lt 2 time 0 1000 2000 3000 4000 5000 6000 7000 8000 9000 0 10 20 30 d is p la m e n t o f c o n ve yo r b e lt 3 time figure 2: related displacement when a box is present in each conveyor belt 2.1 first modeling approach first modeling approach is based on control law modification taken into account time delay appearance, in particular for current case study. the schematic setup is based on fig. 3 considering system response and control implementation. an implementation of reconfigurable network control based upon automata proposal for three conveyor belt case study 317 figure 3: dynamic system implementation in fig. 3, x is the linear displacement θ∗ is the angular displacement ω∗ is the angular velocity f is the lineal force j∗ is the lineal inercy k is the parameter τ∗ is the torque a is the radius v is the lineal velocity in this case, the plant presents two cases with or without a box per belt. as the second case is trivial, the first case is expressed per belt considering the mass of the box (referred to as m). the first conveyor belt is expressed as [ ẍ1 θ̇1 ] = [ j/m 0 j1 ] [ θ1 ẋ1 ] − [ 1/m 0 ] τ1 y = ẋ1. (2) the second conveyor and the third conveyor belt follow simmilar dynamics from these considerations, discrete plants are defined next by considering the presence of the box x(k + 1) = ax(k) + l ∑ i=0 bki u(k −i) bki = ∫ t ki−1 t ki exp(a(t −τ)bdτ, (3) where l = 1 because the maximum number of sensors with delays is just one. therefore, the a matrix is expressed as: a∗ = [ exp(j/m) 0 0 exp(j∗) ] , (4) 318 benítez-pérez h., cárdenas-flores f., garcía-nocetti f. where t is the inherent sampling period, and t k0 , t k 1 , and t k 2 are the related delays of the plant. for the case of local control laws, these are expressed next as: xc(k + 1) = acxc(k) + bcuc(k) yc(k) = ccxc(x−τc −dcuc(k −τc), (5) giving the delays as a result of decomposition from sensor and actuators, which are expressed as τsc and τca respectively. the augmenting representation is given next: uc(k) = yp(k −τsc) up = yc(k −τca), (6) where states are augmented as: z = [ xp(k) xc(k) ] , (7) and expressed as: z(k + 1) = [ ap 0 0 ac ] z(k) + [ 0 0 bccp 0 ] z(k −τsc) + [ bpdccc 0 0 0 ] z(k −τsc −τca −τc) + [ 0 bpcc 0 0 ] z(k −τca −τc). (8) by modifying eqn. 8 to define the stability of network control, the next configuration is proposed: f j = [ a jp 0 0 a jc ] f j1 = [ 0 0 b jcc j p 0 ] f2 j = [ b jpd j cc j p 0 0 0 ] f j3 = [ 0 b jpc j c 0 0 ] . therefore, the state vector is modified for these time delays where the system is asymptotically stable based on f j + ∑3i=1 f j i . based upon a single control loop is stable (eqn. 8), it is possible to define stability for every loop as shown in eqn. 10. τ < σ δ ∑3i=1 ∣∣∣f ji ( f j + ∑3i=1 f j i )∣∣∣ , (9) where τ is the maximum value from all possible time delays at all loops. the absolute value is used in order to guarantee positve response with respect to current time delay. σ = λmin(q) 2λmax(p) δ = [ λmax(p) λmin(p) ] 1 2 , where λmax(p) and λmin(q) are the maximum and minimum eigenvalues of p and q matrices respectively. λmin and λmax can not be complex values since p and q are bounded to be real values. the proposed configuration is presented ( f j + 3 ∑ i=1 f ji )t p + p ( f j + j ∑ i=1 f ji ) = −q, (10) an implementation of reconfigurable network control based upon automata proposal for three conveyor belt case study 319 where p, q are positive definite symmetric matrices and are eigenvalues of the matrix and where the chosen equation is v (x) = 1 2 x jt (t)px j(t). (11) based upon lyapunov proposed equation (eqn. 11) and by its derivative as shown in eqn. 12, where ẋ is substituted by the enhanced representation of ż which contains both states, from the plant as well as the controller. v̇ (x) = 1 2 ż jt (t)pz j(t) + 1 2 z jt (t)pż j(t) ≤ −1 2 z jt (t)qz j(t)+ ∣∣∣∣∣z jt p 3 ∑ i=1 fi ∫ 0 t ji [ f jz j(t + θ ) + 3 ∑ i=1 f ji z j(t −τ ji + θ ) ] dθ ∣∣∣∣∣ . (12) from control law expression, the related time delays are defined as τsc,τca and τc where their respective values will be incorporated later in this section thorugh spliting the time delay (eqn. 8). 2.2 second modelling approach having shown local control laws structures, second modeling approach is the global structure in terms of an automaton (fig. 4). where reconfiguration is expressed for the formal event manager. in this case, two states are possible with several events, which are managed by the sensor vector for each belt (first, second, and third belts) and expressed as s11≤i≤n ,s 2 1≤i≤n and s 3 1≤i≤n , respectively considering fault free scenario. it is important to mention that s11≤i≤n ,s 2 1≤i≤n and s 3 1≤i≤n are indepdent conditions between conveyors, then, s11≤i≤n = 0 means there is no box on conveyor belt 1. at the same time s 2 1≤i≤n 6= 0 can be stated meaning there is a box in second conveyor belt. other condition can be presented as s11≤i≤n 6= 0, s21≤i≤n 6= 0 and s31≤i≤n 6= 0 and where three boxes are presented on the system, each box per conveyor belt. the same sensor conditions are presented in fig. 5. figure 4: fault-free scenario in terms of global structure the switching effect is neglected in this fault-free scenario. in this scenario two cases are defined, when a box is presented (case ii) or the other case (case i). for the second case the chosen control is to maintain the conveyor belt in zero speedup. for first case, the chosen controller is related to certain speed up depending on each conveyor belt. 320 benítez-pérez h., cárdenas-flores f., garcía-nocetti f. for the case of a fault scenario, a new state appears for global control (fig. 5) related to the action pursued when a fault is presented. the necessary event for reaching such a state is s11≤i≤n 6= 0 , and the fault’s last event is composed of local information given by each local sensor with a relation to the health condition measures. figure 5: local fault scenario for the global structure considering individual modeling, there is one type of local fault to be considered, which is that one of the sensors is faulty with no consideration of the type of fault. it is assumed that the fault is detectable and measurable (this is a condition in this paper). 2.3 third modeling approach third modeling approach is related to the use of scheduling algorithm in order to determine possible time delays between processes. it is important to remember that the fault tolerance strategy is based on the use of consecutive sensors to mask the fault using extra communication to perform lateral agreement. therefore, this approach provides two different time graphs, one for each scenario (fault and fault free), as shown in figs. 6 and 7, respectively. the cases from figs. 4 and 5 are related to bounded time delays from decision maker (benítez-pérez et al., [1] and have an effect into control and plan modeling as shown before. at the end of this section (eqn. 15) it is shown how time delays are modified based on time diagram representation (figs. 6 and 7). the reader should realize that time delays are bounded by the use of scheduling algorithm through art2 network. both scenarios are local with respect to one belt. it is considered that the other two belts do not present faulty conditions. as these two scenarios are bounded (fault and fault-free), the respective total consumption times ( tt and tt f respectively) are shown in eqns. 13 and 14 (figs. 6 and 7, respectively), where variable information is presented. tt = ts ∗4 + t sccm + tc + t cacm + ta (13) where: ts is the consumed time by sensors t sccm is the consumed time by communication between sensor and control tc is the consumed time by control node t cacm is the consumed time by communication between controller and actuator ta is the consumed time by actuator an implementation of reconfigurable network control based upon automata proposal for three conveyor belt case study 321 figure 6: fault-free scenario figure 7: fault scenario considering fault masking 322 benítez-pérez h., cárdenas-flores f., garcía-nocetti f. tt f = ts ∗4 + t sccm + t f sccm + tc + t cacm + ta (14) where: t f sccm is the time consumed for the fault sensor to send messages to its neighbor and produce agreement from this time boundary based on these three modeling approaches, it is feasible to implement the control strategy as presented in eqn. 10. as mentioned before, time delays take place in three representations τsc, τca and τc , therefore, by decomposing eqn. 14, time delays are expressed as: τsc = ts ∗4 + t sccm τc = tcm + tc + t cacm τca = ta, (15) a remarkable issue is related to a particular sensor fault related to any of three belts. considering this configuration, three cases are possible: • one local fault; • two local faults, one per belt; • three local faults, one per belt. based on these three possible configurations, there is a worst-case scenario related to three local faults that has an impact on the global control strategy. the other two configurations present a minor degradation for the global control strategy. despite this performance degradation, the system keeps normal functionality due to the inherent fault tolerance strategy (fault masking between sensors) and the local controllers. taking into account these three possible configurations, the local and global time delays are described in table 2. table 2: time delays related to local and global effects configuration 1 local time delays 110 ms one local fault global time delays 110 ms configuration 2 local time delays 110 ms two local fault global time delays 220ms configuration 3 local time delays 110 ms three local fault global time delays 400 ms 3 results from this implementation several results are presented in terms of fault presence and the related action to overcome system lack of performance. how the system responds to these control strategies is presented in the following graphics taking into account fault-free, one local fault and two local faults, respectively (figs. 8, 9 and 10). first scenario presents a fault free situation where local controller response is shown, it is important to highlight that fourth actuator response presents a normal response. when first fault appears (one local sensor does not response and the masking approach is followed) a local time delay takes place where its effects are shown in first local control response. similar situation an implementation of reconfigurable network control based upon automata proposal for three conveyor belt case study 323 0 1000 2000 3000 4000 5000 6000 7000 8000 9000 0 0.5 1 1.5 l o ca l c o n tr o l 1 time 0 1000 2000 3000 4000 5000 6000 7000 8000 9000 0 0.5 1 1.5 l o ca l c o n tr o l 2 time 7000 7200 7400 7600 7800 8000 8200 8400 8600 8800 9000 0 0.5 1 f o u rt h a ct u a to r time figure 8: fault-free scenario 0 1000 2000 3000 4000 5000 6000 7000 8000 9000 0 0.5 1 1.5 l o ca l c o n tr o l 1 time 0 1000 2000 3000 4000 5000 6000 7000 8000 9000 0 0.5 1 1.5 l o ca l c o n tr o l 2 time 7000 7200 7400 7600 7800 8000 8200 8400 8600 8800 9000 0 0.5 1 f o u rt h a ct u a to r time figure 9: first local fault appearance and related global effects 324 benítez-pérez h., cárdenas-flores f., garcía-nocetti f. 0 1000 2000 3000 4000 5000 6000 7000 8000 9000 0 0.5 1 1.5 l o ca l c o n tr o l 1 time 0 1000 2000 3000 4000 5000 6000 7000 8000 9000 0 0.5 1 1.5 l o ca l c o n tr o l 2 time 7000 7200 7400 7600 7800 8000 8200 8400 8600 8800 9000 0 0.5 1 f o u rt h a ct u a to r time figure 10: second local fault appearance and related global effects is presented when a second fault appears and first fault is still active (fig. 10). in this case no relevant modification is shown due to time switching controller takes place. this example presents three local control cases with a global automaton in which control reconfiguration is based on the scheduling algorithm, which is simple because it is dependent on the fault presence and on the related time delays. this reconfiguration approach becomes feasible due to the knowledge of fault presence and the consequence of time delays. it is obvious that fault presence is measurable; if this local fault localization approach cannot detect faults, this strategy becomes useless. 93%−95% no adequate structural selected plans with reconfiguration 5%−7% reconfiguration adequate structural selected plans with figure 11: percentage of selected valid plans for structural reconfiguration moreover, local time delay management refers to the use of a quasi-dynamic scheduler to propose dynamic reconfiguration based on current system behavior rather than on predefined scenarios. the scheduler performs task reorganization based on their consumption times and fault presence. the number of accepted plans is presented taking into those selected with no adequate response from structural reconfiguration (fig. 11). for instance, some tasks would not have enough time to be sampled and executed. this result is presented as the percentage of the adequate use of structural reconfiguration during on-line stage. in this case, current control law is modified according to time delays status. having defined the percentage related to those adequate plans during structural reconfiguration, this is taking as 100 % and is evaluated in terms of control law performance. the results are presented in fig. 12. in here, 97% of the valid plans have a valid response in terms of the mean square error response an implementation of reconfigurable network control based upon automata proposal for three conveyor belt case study 325 97%−98% reconfiguration adequate dynamic selected plans with no adequate dynamic selected plans with reconfiguration 2%−3% figure 12: percentage of selected valid plans for control law reconfiguration figure 13: basic model of true time from the dynamic response of case study. to define the communication network performance, the use of the true-time network is pursued. this strategy achieves network simulation based on message transactions that are based on the real-time toolbox from matlab. extended information from this tool is available at (cervin et al. [3]) the true time main characteristics are shown next. in the true time model, computer and network blocks are introduced in fig. 13. these blocks are event driven, and the scheduling algorithm is managed by the user independently of each computer block. true time simulation blocks are basically two blocks. these have been developed by the department of automatic control, lund institute of technology, sweden. each kernel represents the interface between the actual dynamical model and the network simulation. here, continuous simulation and digital conversion take place to transmit information through the network. this tool provides the necessary interruptions to simulate delay propagation as well as synchronization within the network. 4 concluding remarks present approach shows the integration of three modelling techniques in order to perform reconfiguration. three approaches are followed, control law design, automaton modelling and rt scheduling strategy. although there is no formal verification in order to follow this sequence, it has been adopted since structural reconfiguration provides settle conditions for control reconfiguration. the use of a real-time scheduling algorithm in order to approve or disapprove modifications on computer network behaviour allows time delays bounding during a specific time window. this local time delay bounding allows the design of a control law capable to cope with these new conditions. preliminary results show that control reconfiguration is feasible as long as the use of a switching technique predetermines when one control is the adequate. this goal is reached by a strategy compose of three algorithms, one which is responsible for structural reconfiguration and it has been implemented in this paper as art2a network. second 326 benítez-pérez h., cárdenas-flores f., garcía-nocetti f. algorithm is responsible for dynamic control design and third algorithm is based on an automaton technique to perform switching control. what it is important for this last approach is that control conditions are strictly bounded to certain response. future work is focused to produce certain evaluation metrics that allows feasible comparison between different approaches. 5 acknowledgements the authors would like to thank the financial support of disca-iimas-unam, and unam-papiit (in101307 and in105303) mexico in connection with this work. and the high performance computing proyect within the “macroproyecto tecnologas para la universidad de la información y la computación” of the universidad nacional autónoma de méxico (unam). bibliography [1] benítez-pérez h. and garcía-nocetti f., reconfigurable distributed control, springer verlag, 2005. [2] blanke m. and kinnaert m. and lunze j. and staroswiecki m., diagnosis and fault tolerant control, springer, 2003. [3] cervin a. and henriksson d. and lincoln b. and eker j. and arzén k., how does control timing affect performance, ieee control systems magazine, vol. 23, pp. 16-30, 2003. [4] gudmundsson d. and goldberg k., tuning robotic part feeder parameters to maximize throughput, assembly automation publisher: mcb university press, vol. 19, no. 3, pp216-221, 1999. [5] izadi-zamanabadi r. and blanke m., a ship propulsion system as a benchmark for fault-tolerant control, control engineering practice, vol. 7, pp. 227-239, 1999. [6] jiang j. and zhao q., reconfigurable control based on imprecise fault identification, proceedings of the american control conference, ieee, pp. 114-118, san diego, june, 1999. [7] khalil h., nonlinear systems, prentice hall, 2002. [8] nilsson, j., real-time control with delays, phd. thesis, department of automatic control, lund institute of technology, sweden, 1998. [9] thompson, h., wireless and internet communications technologies for monitoring and control, control engineering practice, vol. 12, pp. 781-791, 2004. [10] wu n., reliability of reconfigurable control systems: a fuzzy set theoretic perspective, proceedings of the 36 th conference on decision & control, ieee, tp15 5:10, pp. 3352-3356, sandiego, usa, 1997. benítez-pérez h. universidad nacional autónoma de méxico departamento de ingeniería de sistemas computacionales y automatización apdo. postal 20-726., admón. no. 20 del. a. obregón, méxico d. f., cp. 01000, méxico. e-mail: hector@uxdea4.iimas.unam.mx an implementation of reconfigurable network control based upon automata proposal for three conveyor belt case study 327 cárdenas-flores f., garcía-nocetti, f. iimas, universidad nacional autónoma de méxico apdo. postal 20-726., admón. no. 20 del. a. obregón, méxico d. f., cp. 01000, méxico received: august 1, 2007 hector benítez-pérez is a full time researcher in the iimas unam (méxico). he obtained his bsc in electronic engineering at the engineering faculty unam in 1994 and his phd at sheffield university, uk en 1999. his areas of interest are in real time control and fault diagnosis. f. cárdenas-flores obtained his bsc in biomedical engineering at uam-iztapalapa 1990. he works in the departamento de ingenieria en sistemas computacionales iimas unam since 1994. he areas of interest are in real time, parallel computing and digital signal and image processing. f. garcía-nocetti bsc in electric engeneering at the engineering faculty at unam in 1984. mbs and phd at university of wales, uk 1988, 1991. he is a full time researcher at iimas unam since 2000. his areas of interest are architectures and algorithms for high performance computing including real time applications. he is member of ieee, iee and ifac. http://horos.rdsor.ro/ijcccv3n4draft.pdf int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iii (2008), no. 4, pp. 343-352 analysis and design on key updating policies for satellite networks yuxuan ji, hengtai ma, gang zheng abstract: satellite networks are becoming increasingly important because of the exciting global communication services they provide. key management policies have been successfully deployed in terrestrial networks to guarantee the information security. however, long propagation, storage and computation constraints bring new challenges in designing efficient and cost-effective key updating policies for satellite networks. based on the structure and communication features of satellite networks, a dynamic key management model for satellite networks (dkm-sn) is presented, which includes certificates owned by each satellite, primary keys and session keys both of which are shared between two satellites. furthermore, a protocol is designed for updating certificates for satellites; different policies for updating primary and session keys are studied and their efficiency and security are analyzed and compared. in addition, simulation environment for satellite networks is built and the key updating processes are implemented in walker constellation. from the simulation results, further contrasts on key updating time and storage costs between the applications of ibm hybrid key management model (hkmm) and dkm-sn in satellite networks are presented. finally, important suggestions in designing key updating policies are given. keywords: key updating, satellite networks, model, protocol, simulation 1 introduction satellite networks are composed of various kinds of communication satellites, vehicles and constellations. it contains both satellite-to-satellite and satellite-to-ground links. satellite networks integrate terrestrial systems and all sorts of satellites which are deployed in different orbits with diverse tasks. nowadays, satellite networks are increasingly used in the long-distance information transmission services. in order to ensure the message confidentiality, integrity and nonrepudiation, as well as efficiency of communication, a key management mechanism should be used to provide data encryption, authentication and key distribution and updating services for satellite communication. key management model defines the entities in the services, the categories and relationships of the keys, and the key updating protocols and algorithms. in contrast with terrestrial networks, satellite networks are subject to dynamic network topology, long propagation delay, as well as low computing and storage capabilities of satellites. due to these constraints, efforts should be made to decrease the key updating time in order to reduce the communication cost. besides, for those keys used to encrypt large amount of information, the key updating protocols should be based on symmetric encryption techniques for the sake of computation cost. the storage cost on satellites should also be reduced by efficiently lowering the number of the keys. currently, key management policies for terrestrial networks are comparatively sophisticated. there are generally three kinds of key management policies, including those that use symmetric key encryption techniques, public key encryption techniques and combination of the two. for example, kerberos [1] uses symmetric key encryption techniques and a kdc (key distribution center); both symmetric and public key encryption techniques are adopted in ibm hybrid key management model (hkmm) in which three kinds of keys including session key, primary key and public key are used. based on different network features and environments, the number of entities and encryption techniques involved in updating a certain type of key may be very different. for instance, though both for updating the session key, neuman-stubblebine protocol [3] includes three parties while janson-tsudik protocol [4] involves only copyright © 2006-2008 by ccc publications 344 yuxuan ji, hengtai ma, gang zheng two. besides, the encryption technique used to update session key can be either symmetric or public. however, all above key management policies are only applicable in terrestrial networks. some problems concerning key management policies in satellite networks have been studied. after studying a series of possible security threats, ccsds showed the urgency and necessity to implement security policies in satellite networks, such as key management, authentication, access control, etc [5]. ayan roy-chowdhury analyzed some problems that occur during encryption and key distribution processes when applying ipsec and ssl into satellite networks [6]. their discussion was based on a hybrid satellite network with a single satellite component and several ground terminals. cruickshank designed an authentication and key establishment protocol for two satellite users who need to encrypt data and voice information [7]. since the public encryption technique was used, the method was applicable in situations where the satellites are only used to relay messages rather than implement public encrypting operations which entail a large amount of computation cost. tanya vladimirova et al. introduced some security services required on satellites, proposed an on-board security architecture and aes fault-tolerant mechanism [8]. however, current literature seldom studies the key management policy for satellite networks with communication links between satellites. we focus on this issue and mainly discuss the categories of keys, the protocols for updating keys and the key updating efficiencies of different policies. hkmm uses session key, primary key and public key. however, the session key updating in hkmm may cause unendurably long propagation if directly applied in satellite networks. in order to solve this problem, a dynamic key management model applicable in satellite networks (dkm-sn)is presented, which also includes session keys, primary keys and public keys. based on dkm-sn, a protocol is designed for updating public keys in satellite networks. by further studying the protocols such as neumanstubblebine, janson-tsudik and improved beller-yacobi protocols [9], the efficiency and security issues are analyzed in designing policies for updating primary keys and session keys in satellite networks. finally, the differences on time and storage costs between hkmm and dkm-sn are shown through simulations under the satellite network environment. the rest of this paper is organized as follows. in section 2, we discuss dkm-sn, detailing the designing principles of all three parts including public key, primary key and session key updates. section 3 presents the simulation results under satellite network environment. we conclude with a short summary and extensions on future work in section 4. 2 key management model in satellite networks, communication process should be kept secret in order to ensure the message confidentiality. besides, robust and secure protocols are indispensable so that the communication process can resist well-known attacks, such as arrogating, playback, modification and so on. therefore, we design a dkm-sn to ensure the secrecy, authenticity and integrity of messages, and at the same time decrease the time, storage and computation costs with best efforts when providing key updating services. dkmsn consists of public, primary and session keys, which are divided based on their functions. firstly, every satellite has a pair of public and private keys, and a certificate issued by ca (certificate authority), all of which are updated through communication between a certain satellite and ca. secondly, there is a primary key shared between a pair of satellites and it is updated with their public key information (including public and private keys and certificates). besides, two satellites also need to share a session key when they communicate with each other and the session key is updated by the primary key shared between them. the differences between dkm-sn and hkmm are: (1) hkmm uses kdc while dkmsn does not; (2) each primary key in hkmm is shared between satellite and kdc while it is shared between two satellites in dkm-sn; (3) in order to lower storage cost, dynamic primary key updating policy is adopted in dkm-sn, which means that for some pairs of satellites their primary keys exist only when they need to establish session keys to communicate. figure 1 shows hkmm on left and dkm-sn on right. p denotes primary key, s for session key, t for terminal in terrestrial networks, and v for satellite. we can see that in hkmm the primary keys analysis and design on key updating policies for satellite networks 345 are shared between each terminal and kdc while the session keys are shared between every pair of terminals. for instance, in hkmm, session key s1 can be established by terminal 3, terminal 4 and kdc using primary keys p3 and p4. in dkm-sn, both the primary and session keys are shared between two satellites. for instance, session key s21 needs to be established and shared only by two satellites v2 and v3 using their primary key p2. p1 p2 p3 p4 p5 s1 s2 s3 p6p5p4 p3 p2 p1 s21, 22, ... s11, 12, ... hkmm dkm-sn ca kdc/ca t1 t2 t3 t4 t5 v1 v2 v3 v4 v5 figure 1: key management models: hkmm and dkm-sn 2.1 certificate updating generally, an entity’s public key information is updated by itself before it expires. based on pkix standards [10], we design a proper protocol for updating public key information. 1. b → m : b, pkb,cert(b) 2. m → b : m, pkm, [cert(m)], pk ′ m, e(pk −1 m , h(pk ′ m, m, b)) 3. b → m : cert(m)′ b, m ca and satellite m; pkx , pk −1 x , cert(x ) old public key, private key and certificate of satellite x; pk ′ x , (pk ′ x ) −1, cert(x ) ′ new public key, private key and certificate of satellite x; e(k,y ) public key encryption with key k and plaintext y ; h(y ) hush function; [y ] y is optional. the execution of this protocol between ca and satellite proceeds as follows: 1. ca sends its own identifier, public key and certificate to satellite and informs the satellite to start updating its public key information. 2. upon receiving message from ca, the satellite does the followings: (1) check validity of ca’s certificate; (2) generate a new pair of public and private keys for itself; (3) get a signature with its old private key and send this signature, its identifier, certificate and both old and new public keys to ca. 3. upon receiving messages from satellite, ca does the followings: (1) check the validity of satellite’s old certificate; (2) check satellite’s signature; (3) generate a new certificate for the satellite’s new public key. 4. upon receiving new certificate from ca, satellite can check its validity with ca’s public key. 346 yuxuan ji, hengtai ma, gang zheng note that the execution can continue only when the checks in the last step are successful. now we consider some possible attacks on ca and satellite when running the protocol. 1. attacks on ca (1) in the second step of our protocol, it is possible that adversary may arrogate satellite m by sending m’s certificate and a public key the adversary generated. however, the adversary has to send a signature at the same time. since we assume that the adversary doesn’t know the satellite’s old private key before it expires, then the adversary could not get a correct signature and would be detected by ca. (2) also in the second step, the adversary may try to replay a signature signed by the satellite’s private key which has expired. obviously, only if both satellite and ca use their valid public key information that hasn’t expired, our protocol could resist this playback attack. 2. attacks on satellite (1) since satellite m’s new public key is sent without encryption, the adversary may generate a certificate for m by signing m’s identifier and the new public key with its own private key, and then sends this false certificate to m in the third step. however, this attack would fail after satellite has received the certificate and checks its validity using ca’s public key. (2) as an alternative, in the third step the adversary could also replay a certificate that has expired. however, the fact is that in order to keep the freshness of every run of the protocol we require that new public and private keys are different from before. for this reason, the expired certificate would be proved invalid when the satellite checks it. the above discussion is suitable for a satellite. when updating the public key information for ca, we should first update it before it expires and then update certificates for all the satellites using ca’s updated certificate. 2.2 primary key updating although all satellites have their own public key information, we avoid using it to encrypt data due to the high complexity and low efficiency of public key encryption algorithms. in most systems, public key information is used to establish symmetric keys so that encryption on data could be faster. we consider two strategies for establishing symmetric keys with public key information: (1) using public key information, two satellites directly establish a shared session key to encrypt data; (2) they first establish a primary key between them with their public key information and then use this primary key to establish their session keys. both primary key and session keys are symmetric keys shared between the two satellites. in the first strategy, though the cost of data encryption is low, the cost of frequent session key updates remains high. as for the second strategy, since both of frequent data encryption and session key updates adopt symmetric key encryption techniques, it is more efficient than the first one. note that the primary keys are updated much less frequently than session keys. hkmm includes primary keys and a kdc in its system. there is a unique primary key shared between kdc and each terminal (or entity). hence, the number of primary keys in hkmm is n. in dkm-sn, the second strategy is adopted, but no kdc is deployed in our network and each unique primary key is shared between two satellites. we build dkm-sn in this way for the following reasons: (1) kdc may become a bottleneck since all the session key updates have to pass kdc; (2) satellite networks are typical of long propagation. since the session key updating process involving three parties (kdc and two satellites) has to be finished within at least 4 steps while that involving two parties (two satellites) in dkm-sn can be achieved within 3 steps, the former would be much slower than the latter. in analysis and design on key updating policies for satellite networks 347 satellite networks, even one trip may lead to unendurably long propagation due to the multi-hop routing and long space distance between two satellites. as for the communication protocol when updating primary keys, improved beller-yacobi protocol [9] is suggested. this sophisticated protocol is qualified for updating a symmetric key with public key information and engages only two entities. besides, this protocol is suitable for the satellites that have limited computing power. 2.3 session key updating session keys are distributed in the network to encrypt the large amount of data, such as images, languages, commands and so on. according to the previous analysis, session key in dkm-sn is shared between two satellites and updated by the primary key between the same pair of satellites. session keys must be updated more frequently than certificate and primary keys because they are used more often. in this way, the possibility of ciphertext-only attacks can be decreased. obtaining an old session key is not once and for all, since the attacker must intercept and analyze new ciphertexts in order to get new session keys. as for the protocols for updating session keys, security and efficiency are of most importance. first of all, the protocol should be able to resist well-known attacks, such as replay, modification, typing, reflection and so on. besides, it should not cost too much time and storage. based on satellite network features, such as long propagation and limited resources, janson-tsudik 2pkdp [4] is suggested for updating session keys. this protocol is illustrated as follows. 1. a → b : a, nab 2. b → a : au t hkab (nab, kba, b), ekab (n ′ ba ) ⊕ kba 3. a → b : ackkab (nab, kba, a) this key establishment protocol contains the minimum cost of computation and numbers of messages and steps as proved in [4]. a detailed comparison of the time and computation costs between session key updating protocol with kdc and that without kdc is presented as follows. as for session key updates with kdc, we use neuman-stubblebine protocol [3] which also has the minimum number of steps involved. according to janson-tsudik 2pkdp, we get the calculation time c jt in a single entity and overall session key establishment time tjt : tjt = 3∗ tab + 4∗ tmac + 2∗ tes + 2∗ t⊕ c jt = 2∗ tmac + tes + t⊕ (1) tmac calculation time of mac() function; tes calculation time of symmetric encryption; t⊕ calculation time of x or operation; tab propagation delay between satellites a and b. since t⊕ ≤ any other item, so we get: tjt ≈ 3∗ tab + 4∗ tmac + 2∗ tes c jt ≈ 2∗ tmac + tes (2) as for neuman-stubblebine protocol, we get the calculation time cns for a single node and overall session key establishment time tns similarly: tns = (2∗ tab + tac + tbc) + 8∗ tes cns = 8∗ tes (3) 348 yuxuan ji, hengtai ma, gang zheng tac propagation delay between kdc and satellite a; tbc propagation delay between kdc and satellite b. because propagation delay is much larger than the calculation time of either encryption or mac() in satellite networks, we conclude from the above estimation that both the calculation time cost of a single node and overall session key updating time cost of janson-tsudik 2pkdp are much less than those of neuman-stubblebine. for the three-party session key establishment policy, kdc may become a bottleneck in the system. fortunately, the two-party session key establishment policy could avoid this drawback. 3 simulations in order to verify the effectiveness and suitability of dkm-sn in satellite network environment, a proper simulation environment for satellite networks is very important and necessary. based on the simulation system for distributed satellite networks [11], we design a simulation scenario and implement the experiments. we set up a walker constellation consisting of 20 meo satellites. there are 4 orbital planes on each of which 5 satellites are equally deployed. each plane is of an inclination of 75◦ and orbit height 14163km. in certificate updating simulation, the protocol designed in section 2.1 is applied and 160-bit ecdsa [12] is implemented as signature algorithm. in primary key updating, improved beller-yacobi protocol is applied. ecdsa and ecies [13] are used as signature and encryption algorithms respectively. in session key updating, janson-tsudik 2pkdp and 128-bit aes are implemented. 3.1 certificate updating simulation dynamic topology in this part, we examine the influences of topological changes on the certificate updates for a certain satellite. dynamic changes of satellite network topology lead to the variations of routing tables in satellites. therefore, the time cost of certificate updating for a certain satellite varies with time. figure 2 shows both the overall and computation time costs of certificate updates for a certain satellite in different periods of time. we update a satellite’s certificate every 10 seconds. figure 2 shows the time cost during 500 seconds. the longest updating time for the satellite is 834.200ms. routing information shows that in the longest updating data transmission from the satellite to ca entails 5 hops and thus the three-step interaction between them entails 15 hops all together. the minimum time cost is 167.45ms when there is only 1 hop between satellite and ca and so it costs 3-hop effort to finish the certificate updating process. based on the result, we suggest update certificates when communication between satellite and ca needs the minimum number of hops. static topology in this part, we examine the certificate updates of different satellites in walker constellation during a fixed period of time. figure 3 shows both the overall and computation time costs of certificate updates for different satellites during a fixed period of time in walker constellation. we can see that there are huge gaps of time costs among the satellites since the number of hops between ca and the satellites are very different. in the simulation, for example, there are 5 hops between ca and no.6 satellite while 1 hop between ca and no.4. analysis and design on key updating policies for satellite networks 349 0 100 200 300 400 500 600 700 800 900 time (s) overall time cost computation time cost 5 10 15 20 25 30 35 40 45 50 time cost (ms) figure 2: time cost of certificate updating of a certain satellite in different periods of time 0 200 400 600 800 1000 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 samples overall time cost computation time cost time cost (ms) figure 3: time costs of certificate updates of all satellites in the same period of time 350 yuxuan ji, hengtai ma, gang zheng besides, from figure 2 and figure 3 we can see that the computation cost for different satellites varies little. it is a small fraction of the overall time cost of certificate updates, ranging from 5% to 40%. 3.2 comparison of computation time costs figure 4 shows the computation costs of all three kinds of key updates in walker constellation. 20 samples are presented for each kind of updates. 0 20 40 60 80 100 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 samp les p rimary key up dating session key up dating certificate up dating time cost (ms) figure 4: computation costs of three kinds of key updates we see from figure 4 that session key update costs least, which is suitable for the most frequently updated session keys. the average time cost of certificate updates is less than that of primary key updates mainly due to the extra public key encryption in the improved beller-yacobi protocol. 3.3 comparison between key management models comparing dkm-sn with hkmm, we mainly have the following conclusions. firstly, the time costs for session key updates in dkm-sn are much less than that in hkmm. secondly, dkm-sn will contain more primary keys than hkmm if all primary keys are pre-distributed between all pairs of satellites. since the storage capacity is limited in satellite, we recommend that for some pairs of satellites their primary keys exist only when necessary. therefore, if we want to establish a session key between two satellites while they share no primary key, we need to first establish a primary key with their public key information and then establish the session key with their primary key. a much detailed comparison between hkmm and dkm-sn is presented as follows. suppose the total number of satellites is n; the average time cost of primary key updates is z. in dkm-sn, let a denote the maximum number of primary keys, namely a = n×(n−1) 2 ; h denotes the average time cost of session key updates when there is a shared primary key between every pair of satellites; x denotes the practical number of primary keys; y denotes the average time cost of session key updates when x pairs of satellites share primary keys. in hkmm, suppose the average time cost of session key updates is h + δ , δ > 0. based on dkm-sn, we have, y = x × h + (a − x) × (z + h) a = (z + h) − z a × x (4) obviously, given x = 0, we get y = z + h. this means there is no primary key pre-shared in dkmsn and for every two satellites before establishing a session key we should first set up a primary key. similarly, given x = a, we get y = h. under this condition, each pair of satellites owns a primary key. analysis and design on key updating policies for satellite networks 351 if y > h + δ , we get x < a×(z−δ ) z . so we conclude that when n < x < a×(z−δ ) z , hkmm is superior to dkm-sn in both average time cost of session key updates and storage cost. specifically, based on the simulation data, we have z ≈ 400ms, h ≈ 360ms, δ ≈ 120ms, n = 20. under this condition, y = (z + h) − z a × x ≈ 760 − 2 × x. figure 5 shows the relationship between average time cost of session key updates and the number of primary keys in dkm-sn. 760 480 average time cost (ms) number of primary keys 14020 2000 360 40 60 80 100 120 160 180 figure 5: average time cost of session key updates in dkm-sn given y > h + δ , namely 760 − 2 × x > 480, we get x < 140. we can see from the linear function shown in figure 5 that if the number of primary keys ranges from 20 to 140, hkmm is better than dkmsn in both time and storage costs. when 140 < x ≤ 200, average time cost of session key updates in dkm-sn is less than that in hkmm while the number of keys is more than that in hkmm. furthermore, if we can deploy the primary keys for those satellites that communicate most frequently, the average time cost function of the number of primary keys will be a concave function shown in figure 5. we can make estimation and decision given real on-board data in satellite networks. for examples, if satellites could store the maximum number of primary keys, pre-allocating a primary key for every pair of satellites is the best choice, which is especially suitable for a network with a small number of satellites. 4 summary and conclusions in this paper, a new key management model called dkm-sn is designed for satellite networks. a protocol for updating satellite certificate is designed and the efficiency and security of different policies for updating primary and session keys are analyzed. the performance is shown when updating three kinds of keys and a further contrast between dkm-sn and hkmm is given in both time cost and storage cost. the efficiency and applicability of different key updating policies are discussed under different onboard storage constraints and time requirements. in the future, we will further discuss the problems in designing efficient, low-cost and secure key management models for satellite networks. bibliography [1] j. kohl, c. neuman, the kerberos network authentication service (v5), http://www.ietf.org/rfc/rfc1510.txt, rfc 1510, 1993. 352 yuxuan ji, hengtai ma, gang zheng [2] v. le, s. m. matyas, d. b. johnson and j. d. wilkins, a public key extension to the common cryptographic architecture, ibm system journal, vol. 32, pp. 461-485, 1993. [3] b. c. neuman and s. g. stubblebine, a note on the use of timestamps as nonces, acm operating systems reviews, vol. 27, pp. 10-14, 1993. [4] philippe janson and gene tsudik, secure and minimal protocols for authenticated key distribution, computer communications, vol. 18, pp. 645-653, 1995. [5] ccsds, security threats against space missions, washington: informational report, ccsds 350.1-g-1, green book, issue 1, 2006. [6] a. roy-chowdhury et al., security issues in hybrid networks with a satellite component, ieee wireless communications, vol. 12, pp. 50-61, 2005. [7] h s cruickshank, a security system for satellite networks, fifth international conference on satellite systems for mobile communications and navigation, london: iee, pp. 187-190, 1996. [8] tanya vladimirova, roohi banu and martin n. sweeting, on-board security services in small satellites, mapld international conference, washington: nasa office of logic design, 2005. [9] c. boyd and a. mathuria, key establishment protocols for secure mobile communication: a selective survey, lecture notes in computer science, vol. 1438, pp. 344-355, 1998. [10] j. schaad, m. myers, public-key infrastructure (x.509), www.ietf.org/html.charters/pkixcharter.html, ietf, pkix 2797. [11] x. ying, z. gang, modeling and distributed simulation for satellite networks, computer simulation, vol. 25, pp. 65-69, 2008. [12] ansi x9.62, public key cryptography for the financial services industry: the elliptic curve digital signature algorithm (ecdsa), american national standards institute, 1999. [13] ansi. x9.63, public key cryptography for the financial services industry: key agreement and key transport using elliptic curve cryptography, american national standards institute, 2001. yuxuan ji institute of software, chinese academy of sciences national key laboratory of integrated information system technology 4# south fourth street, zhong guan cun, beijing 100190, p.r. china e-mail: jiyuxuan06@gmail.com hengtai ma institute of software, chinese academy of sciences national key laboratory of integrated information system technology 4# south fourth street, zhong guan cun, beijing 100190, p.r. china e-mail: htma@ios.cn gang zheng institute of software, chinese academy of sciences national key laboratory of integrated information system technology 4# south fourth street, zhong guan cun, beijing 100190, p.r. china e-mail: gangzhengcn@yahoo.com.cn international journal of computers, communications & control vol. iii (2008), no. 1, pp. 21-32 electroglottographic measures based on gci and goi detection using multiscale product aicha bouzid, noureddine ellouze abstract: this paper deals with glottal parameter estimation such as local pitch and open quotient from electroglottographic signal (egg). this estimation is based on glottal closing instants and glottal opening instants determined by a multi-scale product of this signal. wavelet transform of egg signal is made with a quadratic spline function. wavelet coefficients calculated on different dyadic scales, show modulus maxima at localized discontinuities of egg signal. the detected maxima and minima correspond to the glottal opening and closing instants called gois and gcis. to improve the estimate precision, we operate the multi-scale product of wavelet transform coefficients of three successive dyadic scales. this processing enhances edge detection. a multi-scale product is a nonlinear combination of successive scales; it reduces noise and spurious peaks. we apply cubic root amplitude on the product to improve the representation of weak amplitudes. the method has a good representation of gci and a best detection of goi. the method was tested on the keele university database; it is effective and robust in multiple cases even for a typical signal showing undetermined gois and multiple peaks at gcis. finally precise measurement of these instants allows accurate estimation of prosodic parameters as local pitch and open quotient. keywords: wavelet transform, multi-scale product, electroglottographic signal, glottal closing instant, glottal opening instant 1 introduction electroglottography is a non-invasive medical exploration technique of a glottal activity. the resulting signal called electroglottogram (egg) is a common and efficient reference signal for pitch estimation. egg signal can be used to determine glottal closing instant (gci) and glottal opening instant (goi). gci is commonly used in speech processing like voiced/unvoiced classification, accurate source parameter estimation and robust instant detection; it is very useful in synchronous speech analysis and synthesis. goi is useful for voice quality estimation and other voice and speaker characterization. in the present work, the estimated gci and goi are used to calculate the local fundamental frequency and the glottal open quotient. referring to childers [1], egg signal presents an important amplitude variation at gci, indicating a rapid behaviour change of the source (glottis). the derivative of the egg signal, called degg shows strong peaks at closing instants and weak ones at opening instants. referring to mallat [2], important information lies in sharp transitions on the signal or its derivative. these singularities are detected by following the wavelet transform modulus maxima at fine scales. as we have shown in previous work [3], wavelet transform is efficient in most cases for detecting singularities in egg signal at closing instants, but not so for glottal opening instant detection. in fact, large variations of amplitude on egg signal can be observed at gcis, however at gois the discontinuities are less obvious and can hardly be detected on the signal [4]. referring to sadler [5], [6], the multiscale product method (mpm) is the product of wavelet transform coefficients on different scales. the scale multiplication gives better results of discontinuity detection than any single scale, especially on the localization performance [7], [8], [9]. this method is used for edge detection in image processing. the aim of this paper is to present a new measurement method of glottal parameters from egg signal, such as goi, gci, the signal fundamental frequency f0 and the open quotient oq. different time and spectral based methods have been proposed for estimating the pitch period and the open quotient oq. in this work, we propose to apply the copyright © 2006-2008 by ccc publications 22 aicha bouzid, noureddine ellouze multiscale product method (mpm) on the egg signal, in order to improve the accuracy of goi and gci measurement, so as to obtain better estimation of pitch and open quotient measures. the present paper is organised as follows. after introduction, section 2 focuses the well-known methods applied on egg signal for gci and goi detection and their related problems. the first method uses various thresholds (or crossing levels) of the egg signal, the second method is based on the derivative of egg signal. besides, it is shown that in many cases, these methods present undetermined goi and in fewer cases ambiguous gci. section 3 presents the multiscale product method and its application to egg signal, in order to improve the detection of gois and gcis. section 4 presents a multiscale product enhancement for gci and goi detection based on the cubic root of the mp (crmp) and its comparison to the mp and the degg methods . section 5 deals with egg parameter measurements given by the crmp, the crossing level and the degg extrema methods. finally, section 6 concludes this work. 2 gci and goi detection methods and limitations glottal opening instants (goi) and closing instants (gci) estimation can be carried out on egg signal, by different methods using time or frequency based methods. here, we present two basic methods: the first method is based on egg amplitude crossing level (with different levels), and the second method is based on maxima and minima detection on the degg signal. egg threshold method rothenberg et al. [10] use the 50% crossing level of the signal amplitude from a base line. this method gives a direct value of goi and gci from egg signal in the case of modal and tense voice. 35% of the maximum amplitude of the egg has been also used as a threshold; this ratio gives also direct values of the gci and goi. howard [11] uses the 3/7 maximum egg amplitude threshold for goi detection. illustrations of these methods are depicted in figure 1. the major drawback of such methods is the lack of accuracy for the gci detection and a missing of goi. 0 50 100 150 −1 −0.5 0 0.5 1 0 50 100 150 −0.7 −0.6 −0.5 −0.4 −0.3 −0.2 −0.1 0 0.1 0.2 goi 50 % 3/7 35 % gci gci goi figure 1: egg signal (female voice, speaker f4, vowel [o]), 3 crossing levels, and the degg. the top of figure 1 depicts the case of egg signal of a female voice pronouncing the vowel [o], the crossing levels of 50%, 3/7 and 35% of the amplitude allowing the estimation of goi and gci. degg maximum and minimum detection method a more precise method for gci detection uses the degg signal [12]. childers characterises the egg inflexion points by the degg [13], [14]. experimental investigation shows that degg signal shows two opposite peaks in each period. the strong peak corresponds to gci and the weak one corresponds to goi [15], [16], [17]. an illustration of the derivative method is depicted at the bottom of figure 1. goi and gci detection related problems electroglottographic measures based on gci and goi detection using multiscale product 23 methods based on egg and degg, for goi and gci detection have different problems, such as missing events or duplication. examples are taken from the keele university database. it is a speech database containing acoustics and egg signals simultaneously recorded in a soundproof room. it is made by five adult females and five adult males speakers. each utterance consists of the same phonetically balanced english text. egg and speech signals are given with the same sampling frequency of 20 khz [18]. many cases of missing peaks appear on degg at glottal closing instants [19]. some gcis and gois are indiscernible on the degg. this glottal behaviour is observed by anastaplo and karnell [20]. pérez et al. underline the difficulty of goi detection from the derivative of egg signal [21]. 0 500 1000 1500 2000 2500 3000 −0.4 −0.3 −0.2 −0.1 0 0.1 0.2 0.3 0 500 1000 1500 2000 2500 3000 −0.12 −0.1 −0.08 −0.06 −0.04 −0.02 0 0.02 figure 2: egg signal of a male voice (speaker m2, voiced sound) and degg depicting the case of gci and goi missing. gci missing has a considerable effect on the time period measure. goi missing influences the oq measure. figure 2 illustrates problematical cases where threshold methods on egg or degg fail to detect gcis and gois. this example shows egg signal of a voiced sound of the male speaker m2 and its derivative. in many cases multiple peaks appear on degg at glottal closing instants and opening ones [19]. a typical example is presented in figure 3. in this case, the degg signal shows undetermined open instant and a double closing instant. egg signal used for this example is a vowel [i] of the female speaker f2. we can distinguish that the egg signal presents noise due to the subject movement during the recording. another typical example is shown on figure 4. here the degg signal shows glottal opening and closing instants with a poor precision. the egg signal used in this example is a voiced fricative [z] of the a male speaker m5. we can distinguish that the egg signal presents natural noise of the fricative vowel. 3 goi and gci detection by multiscale products it is commonly known that wavelet transform is an efficient tool for detecting and characterizing signal singularities [2]. singularities of the signal are detected by finding abscissa, where the wavelet modulus maxima converge at fine scales. the singularity type of the signal is characterised by wavelet vanishing moments and the decay of maxima across scales. this is explained by the fact that wavelet transform with n vanishing moments can be interpreted as a differential operator of nth order of the signal, smoothed by the primitive function of the wavelet called smoothing function. so if the wavelet is chosen to have one vanishing moment, modulus maxima appear at discontinuities of the signal, and 24 aicha bouzid, noureddine ellouze 0 50 100 150 200 250 300 −1 −0.8 −0.6 −0.4 −0.2 0 0.2 0.4 0.6 0 50 100 150 200 250 300 −0.3 −0.25 −0.2 −0.15 −0.1 −0.05 0 0.05 0.1 0.15 unclear goi redoubled gci figure 3: egg signal (speaker f2, vowel [i]) and the degg depicting the case of double and imprecise peaks at gci and goi. 0 100 200 300 400 500 600 700 800 900 −1 −0.5 0 0.5 1 0 100 200 300 400 500 600 700 800 900 −0.3 −0.2 −0.1 0 0.1 0.2 figure 4: egg signal (speaker m5, voiced fricative [z]) and the degg depicting the case of double and imprecise peaks at gci and goi. electroglottographic measures based on gci and goi detection using multiscale product 25 represent the maxima of the first derivative of the smoothed signal. in previous work [3], we showed that the local regularity of the egg signal can be characterised by the wavelet transform modulus maxima. modulus maxima of egg signal present two types of located singularities indicating glottal closing and glottal opening instants. the greater peak correspond to gci and the weak one to the goi. modulus maxima give an estimation of the events with a better precision for small scales. but no scale can give an accurate value of gci and goi, so as singularities are too smoothed for large scales, and too weak for small scales, to be well located. figure 5 shows the egg signal of a voiced of speaker m2 followed by its wavelet transforms at the following scales 1/2, 1 and 2. here wavelet transform (as well as degg), can’t detect some singularities of the signal. missed events still exist. that’s why multiscale analysis seems to be necessary to improve the egg edge localisation. the products of coefficients across scales are frequently used for image analysis. witkin [20] provided the foundation for scale space theory by generalizing rosenfeld’s work [22], in which smoothing filters at dyadic scales are used. based essentially on forming multiscale products of smoothed gradient estimates, this approach attempts to enhance the peaks of the gradients caused by true edges, while suppressing false peaks due to noise. the wavelet transform acts as an edge detector, and the detail coefficients should be equivalent to the estimated gradients. to distinguish edge maxima from noise and inappropriate maxima, mallat and zhong [23] analyze the singularity properties of wavelet transform domain maxima across various scales. the first derivative of a gaussian and the quadratic spline are used to play this role. xu et al. rely on the variations in scale of the wavelet transform. direct multiplication of wavelet transform data at adjacent scales are used to distinguish important edges from noise [9]. sadler and swami [6] studied multiscale product method of a signal in presence of noise. in wavelet domain, it is well-known that edge structures are present at each subband while noise decreases rapidly along the scales. it has been observed that multiplying the adjacent scales could sharpen edges while diluting noise [6], [9]. the expression of multiscale product is given by: p(n) = 3 ∏ j=1 ωs j ( f (n)) (1) where ωs j ( f (n)) is the wavelet transform of the function for the dyadic scale sj. the product p(n) has the property to reveal peaks at signal edges, and has relatively small values elsewhere. thus, singularities produce peaks along scale in wavelet transform. these peaks are reinforced by the product p(n). the signal peaks will just align across the first few scales, not all of them because increasing the amount of smoothing will spread the response and cause singularities separated in time to interact. thus, choosing too large scales will result in misaligned peaks in p(n). an odd number of terms in the product preserves the sign of the edge. choosing three dyadic successive scales is an optimal solution in multiscale product for detecting small peaks. as multiscale product improves the edge detection, we apply mpm to egg signal to outperform the detection of gci and goi and to improve the measure precision particularily for weak singularities at goi. at the bottom of figure 5, we find the mp of the egg signal of a voiced sound of speaker m2. we can clearly see the effect of the product on cancelling the additional noise peaks present at its derivative depicted in figure 2 and consequently the best detection of goi. we note the efficiency of the mpm to strengthen the gci and its ability to detect gois that become clear enough. however, some missing events still exist. that’s why multiscale product needs to be enhanced for an accurate gci and goi localisation. 26 aicha bouzid, noureddine ellouze 0 500 1000 1500 2000 2500 3000 −0.5 0 0.5 0 500 1000 1500 2000 2500 3000 −1 0 1 0 500 1000 1500 2000 2500 3000 −2 0 2 0 500 1000 1500 2000 2500 3000 −2 0 2 0 500 1000 1500 2000 2500 3000 −1 0 1 figure 5: egg signal of a voiced sound of speaker m2, the 3 wavelet transforms and their product. 4 multiscale product enhancement for goi and gci detection we have seen that multiscale product gives better results than wavelet coefficients. we use the cubic root of multiscale product to improve results for weak peaks. the cubic root of multiscale product has a zooming effect which enhances peak representation. this section deals with this basic enhancement and the comparative result detection of gci and goi detection by degg, mp and cubic root of mp. specific examples from the keele university database are presented below. figure 6 shows an example of egg signal and its derivative, which has precise peaks at goi and gci. the example is a frame of vowel [o] uttered by the speaker f4. the figure respectively shows the egg signal, its derivative, the multiscale product of the following scales 1/2, 1 and 2 and the cubic root of the products. the multiscale product depicts the resulting cross-scale product p(n) for three scales and shows clean peaks aligned with the degg signal edges. first, we note two types of peaks in the cross-scale product; those corresponding to gci are more distinguishable than those related to goi. the modulus cubic root shows maxima at goi and gci, the weak peaks at gois given by the modulus cubic root of the product are better represented and effectively reinforced than those obtained by the multiscale product. figure 7 illustrates the example of a female utterance of vowel [i] where the gois are undetermined and the peaks of gci present irregular structures. we note double peaks of glottal closing that bring about inaccurate measurements. besides, we can clearly see the effect of the product in cancelling the additional peaks, which, then, gives a better detection of goi. the modulus cubic root is used to reinforce the small peaks corresponding to goi. this figure underlines the importance of the modulus cubic root of the multiscale product. figure 8 shows the egg signal of fricative [z] of speaker m5, the degg signal, the multiscale product and its modulus cubic root. figure 9 shows the same representation for a voiced sound of speaker m2. these examples illustrate the modulus cubic root applied on the multiscale product of egg signal. in figure 9, continuous lines indicate gci and dotted lines indicate goi for missing events. in fact the cubic root takes out peaks at goi and gci that don’t exist either in degg signal or in the product. it’s a complex case. thus, we can see that modulus cubic root of multiscale product can give a better detection of gci and goi from the egg signal than the threshold methods and the degg approach. the mp cubic root reinforces the gci minima indiscernible not only at degg but also at mp. consequently the mpm associated with the cubic root gives the most efficient reference measures of goi and gci. electroglottographic measures based on gci and goi detection using multiscale product 27 0 50 100 150 200 250 300 −1 −0.5 0 0.5 1 0 50 100 150 200 250 300 −1 −0.5 0 0.5 0 50 100 150 200 250 300 −150 −100 −50 0 50 0 50 100 150 200 250 300 0 2 4 6 figure 6: egg signal of vowel [o] (speaker f4), degg, mp and its modulus cubic root. 0 50 100 150 200 250 300 −1 −0.5 0 0.5 1 0 50 100 150 200 250 300 −0.4 −0.2 0 0.2 0.4 0 50 100 150 200 250 300 −100 −50 0 50 0 50 100 150 200 250 300 0 1 2 3 4 figure 7: egg signal of vowel [i] (speaker f2), degg, mp and its modulus cubic root. 0 100 200 300 400 500 600 700 800 900 −1 0 1 0 100 200 300 400 500 600 700 800 900 −0.5 0 0.5 0 100 200 300 400 500 600 700 800 900 −1 −0.5 0 0.5 0 100 200 300 400 500 600 700 800 900 0 0.5 1 figure 8: egg signal of voiced fricative [z] ( speaker m5), degg, mp and its modulus cubic root. 28 aicha bouzid, noureddine ellouze 0 500 1000 1500 2000 2500 3000 −0.4 −0.2 0 0.2 0.4 0 500 1000 1500 2000 2500 3000 −0.15 −0.1 −0.05 0 0.05 0 500 1000 1500 2000 2500 3000 −2 −1 0 1 0 500 1000 1500 2000 2500 3000 0 0.5 1 1.5 figure 9: egg signal of a voiced sound (speaker m2), degg, mp, and its modulus cubic root. 5 egg parameter measurements this section deals with the local pitch period and the local open quotient measurement. these can be calculated by using the goi(k) and gci(k). the glottal closing instant represent the beginning of the pitch period. the glottal opening instant goi (k) corresponds to the beginning of the open phase. measurement of f0 and oq local pitch period is given by the following formula t0(k) = gci(k + 1)−gci(k) (2) the local fundamental frequency f0(k) is given by f0(k) = 1 t0(k) (3) the open quotient is defined as the ratio between the duration of the glottis open phase and the fundamental period. open quotient is given by the following formula oq(k) = gci(k + 1)−goi(k) t0(k) (4) in section 5, it is shown that the cubic root of mp outperforms the other methods in typical cases presented in this work. moreover, this performance can be confirmed by evaluating prosodic parameter measures using the proposed method. figure 10 depicts the local fundamental frequency f0 of a voiced sound of speaker m2 corresponding to a case where the degg method gives imprecise goi and the threshold method fails in determining some ambiguous gci and goi. figure 11 depicts the local open quotient for the same utterance determined by the same methods. the threshold methods mentioned above fail in detecting some gcis and gois leading to aberrant and missing measures of the fundamental frequency and the open quotient. besides, imprecise detection of goi from degg signal leads to aberrant measures of oq as shown in periods 12 and 14 of figure 11. 6 conclusion parameter characterisation of egg signal by glottal closing instant gci and glottal opening instant goi detection from this signal is carried out by a new method called multiscale product mpm. the proposed method consists of computing the product of the wavelet transform of egg signal at three successive dyadic scales. the wavelet used is the quadratic spline. this wavelet has one vanishing moment electroglottographic measures based on gci and goi detection using multiscale product 29 0 2 4 6 8 10 12 14 16 18 20 0 20 40 60 80 100 120 140 160 180 200 number of periods fu nd am en tal fre qu en cy (h z) degg threshold mp figure 10: fundamental frequency f0 given by degg (o), 3/7 threshold (*) and mp (+) methods for a voiced sound of speaker m2. 0 2 4 6 8 10 12 14 16 18 20 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 number of periods op en q uo tie nt degg threshold mp imprecise detection of goi by degg figure 11: open quotient oq given by degg (o), 3/7 threshold (*) and mp (+) methods for a voiced sound of speaker m2. 30 aicha bouzid, noureddine ellouze and the calculated coefficients show modulus maxima at discontinuities of the signal. wavelet transform is calculated at the following scales 1/2, 1 and 2. then the cubic root amplitude of the product is calculated to enhance the resulting signal maxima. this method gives better gci and goi localisation than classical methods obtained by the crossing level or by the signal derivative, especially in typical cases of multiple peaks and undetermined gci and goi on degg signal. the non-linear products reinforce the cross-scale peaks produced at gci and especially at goi, and reduce spurious noisy peaks. efficiency of the proposed method is proved. comparative results are given with the threshold and derivative methods for glottal closing instants, glottal opening instants, and glottal parameter measurements like pitch frequency and open quotient. locating gci and goi efficiently by using the cubic root of mp allows us to constitute a robust pitch and open quotient reference. bibliography [1] d. g. childers, a. m. smith and g. p. moore, relationships between electroglottograph, speech, and vocal cord contact, folia phoniatr., vol. 36, pp. 105-118, 1984. [2] s. mallat, a wavelet tour of signal processing, second edition, academic press, san diego 1999. [3] a. bouzid, and n. ellouze, local regularity analysis at glottal opening and closing instants in electroglottogram signal using wavelet transform modulus maxima, in proc. eurospeech 2003, geneve, pp. 2837-2840, 2003. [4] n. henrich, c. d’alessandro, m. castellongo, on the use of the derivative of electroglottographic signals for characterization of non-pathological phonation, journal of acoustical society of america, vol. 115, pp. 1321-1332, 2004. [5] b. m. sadler, t. pham, and l. c. sadler, optimal and wavelet based shock wave detection and estimation, journal of acoustical society of america, vol. 104, no. 2, pp. 955-963, 1998. [6] b. m. sadler, and a. swami, analysis of multiscale products for step detection and estimation, ieee trans. inform. theory, vol. 45, no. 3, pp. 1043-1051, 1999. [7] l. zhang, and p. bao, edge detection by scale multiplication in wavelet domain, pattern recognition letters, vol. 23, no. 14, pp. 1771-1784, 2002. [8] p. bao, l. zhang, and x. wu, canny edge detection enhancement by scale multiplication, ieee trans. on pattern analysis and machine intelligence, vol. 27, no. 9, pp. 1485-1490, 2005. [9] y. xu, j. b. weaver, d. m. healy, and j. lu, wavelet transform domain filters: a spatially selective noise filtration technique, ieee trans. image processing, vol. 3, no. 6, pp. 747-758, 1994. [10] m. rothenberg, and j. j. mahshie, monitoring vocal fold abduction through vocal fold contact area, journal of speech and hearing research, vol. 31, pp. 338-351, 1988. [11] d. m. howard, variation of electrolaryngographically derived closed quotient for trained and untrained adult female singers, journal of voice, vol. 9, no. 2, pp. 1212-1223, 1995. [12] d. m. howard, g. a. lindsey, and b. allen, toward the quantification of vocal efficiency, journal of voice, vol. 4, no. 3, pp. 205-212, 1990. [13] d. g. childers, and a. k. krishnamurthy, a critical review of electroglottography, crc critical reviews in biomedical engineering, vol. 12, pp. 131-161, 1985. electroglottographic measures based on gci and goi detection using multiscale product 31 [14] d. g. childers, d. m. hooks, g. p. moore, l. eskenazi, and a. l. lalwani, electroglottography and vocal fold physiology, journal of speech hearing research, vol. 33, pp. 245-254, 1990. [15] d. g. childers, and j. n. lara, electroglottography for laryngeal function assessment and speech analysis, ieee trans. on biomedical engineering bme, vol. 31, no. 12, pp. 807-817, 1985. [16] s. anastaplo, and m. p. karnell, synchronized videoscopic and electroglottographic examination of glottal opening, journal of acoustical society of america, vol. 83, no. 5, pp. 1883-1890, 1988. [17] m. h. hess, and m. ludwigs, strobophotoglottographic transillumination as a method for the analysis of vocal fold vibration patterns, journal of voice, vol. 14, no. 2, pp. 255-271, 2000. [18] f. plante, g. f. meyer, and w. a. ainsworth, a pitch extraction reference database, in . eurospeech 1995, pp. 837-840, 1995. [19] a. bouzid, n. ellouze, contribution à la détection des instants d’ouverture et de fermeture de la glotte sur les signaux de parole voisé par transformée en ondelettes, thése de doctorat, enit, juillet 2004. [20] a. witkin, scale-space filtering, proc. int. joint conf. artif. intell., pp. 1019-1021, 1983. [21] j. pérez, and a. bonafonte, automatic voice-source parametrization of natural speech, in proc. icslp 2005, lisboa, portugal, 2005. [22] a. rosenfeld, a non linear edge detection, proc. ieee, vol. 58, pp. 814-816, 1970. [23] s. mallat, and s. zhong, characterization of signals from multiscale edges, ieee trans. on pattern analysis and machine intelligence, vol. 14, no. 7, pp. 710-732, 1992. aicha bouzid enit signal processing lab enit b. p. 37, le belvédère 1002, tunis, tunisia e-mail: aicha.bouzid@enit.rnu.tn, bouzidacha@yahoo.fr noureddine ellouze enit signal processing lab enit b. p. 37, le belvédère 1002, tunis, tunisia e-mail: n.ellouze@enit.rnu.tn received: september 1, 2007 32 aicha bouzid, noureddine ellouze dr. aicha bouzid was born in 29 april, 1975. she has diploma in electrical engineering at ecole nationale d’ingénieurs de tunis (enit) in 1998, master degree on automatic and signal processing in 2000 and ph. d. thesis at signal processing laboratory (lsts), in july 2004. she has joined the institut supérieur des etudes technologiques de sfax as associate professor in 1999. currently she is working as assistant professor in the department of electronic at the institut supérieur d’electronique et de communication de sfax. she has more than 30 publications in international journals and conferences. her research areas of interest are signal processing, speech processing and applied mathematics. pr. noureddine ellouze was born in 19 december, 1945. he received a ph.d. degree in 1977 at inp (toulousefrance), and electronic engineering diploma from enseeiht in 1968 university p. sabatier. in 1978. pr. ellouze joined the electrical engineering department at enit (tunisia). in 1990, he became professor in signal processing, digital signal processing and stochastic process. he was the head of the electrical department from 1978 to 1983 and general manager and president of irsit from 1987-1994. he is now director of research laboratory lsts at enit, and is in charge of ats master degree at enit. pr. ellouze has directed multiple masters and thesis and published more than 300 scientific papers in journals and proceedings, in the domain of signal processing, speech processing, biomedical applications, pattern recognition, and man machine communication. international journal of computers, communications & control vol. ii (2007), no. 1, pp. 84-93 a software system for online learning applied in the field of computer science gabriela moise abstract: the computer-assisted learning is a very modern study area, which can be applied to the learning process. the main objective of this paper is to present a software system for online learning based on the intelligent software agents technologies. the main ideas on which this paper is built are: to any person is associated a learning profile (the idea is based on the existence of multiple intelligences, defined by gardner [3]); the pedagogical resources can be shaped through educational semantic networks or through conceptual maps; a flexible software system in computer assisted learning must be based on the intelligent agents’ technology. the system dedicated to computer-assisted learning must be adapted to the learning profile of each student. the author presents a flexible online teaching software system, which learns to teach according to the learning profile of each student (the author defines this system in the phd thesis and includes: intelligent agent structures, reward learning algorithms, algorithms to generate plans for an agent). the application includes two agents: the supervising agent and the pedagogical agent, which determines the optimal pedagogical resources for teaching the course. the application has been designed in microsoft visual studio 6.0 and uses microsoft agent technology, which allows vocal recognition. also, the protéjé 3.0 software has been used, software that allows building ontology for computer assisted learning. the system has been experimented on the graph theory course, taught at postuniversitary computer science courses, the results proving the necessity of defining a strategy for selecting the pedagogical resources presented to the students according to their learning profile. keywords: intelligent agent, conceptual map, learning style 1 introduction a software system for online learning in the field of the computer science is proposed in this paper, a system called ilearning. the system is based on shaping the pedagogical resources through conceptual maps or semantic networks and on using intelligent agent technology.[6] the main feature of the system is its flexibility.[7] in order to use this system we have to consider the learning styles of each person.[4] donald clark says that teaching in unconformity with the learning style of the students doesn’t mean that the students don’t learn, but students learn better if the teacher presents pedagogical resources according to the learning style of the students. [13] teachers have to follow six phases in order to use the system [7]: 1. establishing the learning style of the students through an interrogation of them; 2. drawing the conceptual map of the courses using a software to build conceptual maps; 3. teaching is done based on a technique as to traversing the conceptual map of the course; 4. the teacher has to build for each node of the conceptual map a set or pedagogical resources and the system will select the best resources conforming to the learning style of students; copyright c© 2006-2007 by ccc publications selected paper from icvl 2006 a software system for online learning applied in the field of computer science 85 5. the students are evaluated in each node of the map and the teacher inputs the note in the software system; 6. the teaching takes place traversing all conceptual nodes and showing the resource to students. the system was proposed and tested by the author of this paper in the doctoral these with the same title[7]. 2 the architecture of ilearning system the system is composed by six functional components: instructional manage system, communication system, informational system, evaluation system, tutoring system, pedagogical system. [7] the figure 1: architecture of ilearning system instructional manage system is implemented by a sql server and manages databases about: faculties, students, marks, credits, entrances, registration, teachers, fees; builds reports and statistics about the educational process; plans the instructional process: study periods, evaluation periods; records educational plans; starts teaching process; takes care about securing. the communication agent is implemented by the software agent and accomplishes the communication between students, teachers, students and teachers and administration staff by e-mail, forum, chat; accomplishes the extern communication: e-mail, browser, chat, messaging tools. the informational system is implemented by the sql server and manages the courses and pedagogical resources: records the pedagogical materials on the servers, offers tools to search courses and information in the course’s databases; offers web searching tools: search engine. the evaluation agent is implemented by the software agent and builds personalized tests; evaluates students according to objectives of the instructional process; evaluates the educational process: reports and statistics. the tutor agent is implemented by the software agent and has the following roles: guides the teacher to build pedagogical resources; guides the students in the instruction process; guides the teacher in the teaching activities. the pedagogic agent is implemented by the software agent and verifies the instructional objectives (according to bloom’s taxonomy); checks the learning style of the student; defines pedagogical strategies; defines the curriculum; checks the pedagogical resources; checks the correctness of the evaluation’s activities. the interface is implemented by the software agent and customizes the interface. the space developed by the ilearning system is a flexible instructional environment. the work scenario of the participants to the educational process is presented in figure 2. 86 gabriela moise figure 2: ilearning work’s scenario 3 the agents of ilearning system the ilearning system contains three kinds of components [7]: 1. "execution" agents which perform teaching-learning process. these kinds of agents don’t have perception sensors; they receive the inputs from the supervisor agent. these kinds of agents are intelligent, reactive and goal oriented. 2. "supervisor agent" is a software module which has some intelligent grade. this kind of agent manages the whole system and it is able to learn how a system can be managed. 3. databases system. this component was implemented through a sql server and vb client. 3.1 the execution agents the execution agents are adapted according to interrap [8], model to which was added a capacity component and a decisional module, which establishes the ability of the agent to solve a situation. the architecture of this kind of agent is presented in the figure 3. the control unit receives messages about the state of the environment from the supervisor agent. the unit establishes the ability of the agent to solve the problem. if the agent is not able to solve the situation then it returns an error message. the capacity is defined by the role of the agent in the system. the module analyzes the information received from the supervisor agent; recognizes the situation (goal oriented job or reactive job) and decides the level that will be activated. finally, the module receives the actions that have to be performed by the actors or returns a message to the supervisor agent.the activities are implemented through three levels: communication level, goal oriented level and reactive level. the control unit has the duty to realize the maintenance of the plans library, the behaviors library and the knowledge base of the system. 3.2 the supervisor agents the supervisor agent performs the following functions: a software system for online learning applied in the field of computer science 87 figure 3: the architecture of execution agent 1. coordinates the whole activity of the system; 2. receives the information from the environment (students, teachers and staff) and decides the agent that will be called; 3. receives the requests from the agents and transmits them to the destination agents or solves them using its owner resources; 4. records the capacities of the agents; 5. manages the agents’ library; 6. manages the databases systems; 7. communicates with the ilearning developments. the supervisor agent has the properties of the facilitator agents, mediator agents and broker agents. the architecture of the agent is shown in the figure no. 4. the structure of the agent is simpler then the one of the execution agent while it performs only two types of situations: social situations and administrative situations. figure 4: the architecture of supervisor agent 88 gabriela moise 3.3 the model of ilearning system the ilearning system is a hybrid system, which contains a multi-agent system, and two databases systems. the multi-agent system is composed by agents that cooperate between them. [2] the agents have different goals and are managed by the supervisor agent.[9] the supervisor agent collaborates with the databases systems and coordinates the whole educational process. the model is presented in the figure 5. figure 5: the model of ilearning system 3.4 the ilearning’s programming and reward algorithm to test the system, it was realized a software with visual studio 6.0 and microsoft agent technology. the algorithm used to learn the agents of ilearning system is a reward algorithm based on the qlearning algorithm. this technique starts with an initial estimation q(s, a) for each pair state-action. when it is selected the action a in the state s, the system receives a reward r(s, a) and it is observed the next state s ′ . the q-learning algorithm (watkins, 1989) [11] appreciates the function value state-action as follows: q(s, a) = q(s, a) + α ×(r(s, a) + γ ×mina′ q(s ′ , a ′ )−q(s, a)) where α ∈ (0, 1) is the learning rate, γ ∈ (0, 1) is the discount factor and s′ is the state reached from state s executing the action a.[1] [5] the conceptual map of the course defines the space of system’s state. a student with a certain learning style is with his studies in a node of the conceptual map and he was examined and received notes during the instruction process. the objective of the system is to maximize the results of students at different evaluations. the contribution of the authors is the adaptation of this algorithm for the pedagogical agents. the reward is established through the student’s evaluation in a node of the conceptual map and is defined according to the equation: r (s, a) = note or r (s, a) = note×pnote+apriorknowledge×pbasetime where: note is the score received by the student at his/her evaluation in a node of the conceptual map, apriorknowledge is the score received by students at the initial evaluation (before starting teaching the course if the student starts the course or the average of the scores received by the student at the evaluations in the nodes before the current node in the conceptual map), pnote and pbase are parameters. the algorithm used is: a software system for online learning applied in the field of computer science 89 1. start with an array q for all possible pairs state-action. each item of the array is initialized with zero or a small value. 2. the optimal policy is initialized with a supervised policy. qoptim is initialized with q. 3. for each student the conceptual map is traversed and the q array is calculated. 4. the q is analyzed 4 the experimental results to test the system there were selected students with different learning style (visual style, auditory style, kinesthetic style) and different ages and a module from the "graph theory" course. the conceptual map [10] of the course with seven nodes is presented in the figure 6., each node has attached three pedagogical resource. the software contains two agents: the supervisor agent and the pedagogical agent.[12] the pedagogical agent establishes the optimal pedagogical resources to teach the course. the state of the system is defined by the combination (age, learning style and number of the node). the learning style no. 1 means the visual learning style, the style no. 2 means auditory learning style and the style no. 3 means the kinesthetic learning style. the age no. 1 means the persons with age between 20 and 30 years, the age no. 2 means the persons with age between 30 and 40 years and the age no. 3 means the persons with age between 40 and 50 years. the software teaches itself to teach better, selecting the best pedagogical resources for each node of the conceptual map. figure 6: the conceptual map of the course graph theory, module shortest path the results obtained for age category no. 2 and learning style no. 1 are presented in the table no. 1. the last column contains the values for the parameter q from the reward algorithm. the greater values of parameter q give the optimal policy (the best pedagogical resources for each node of the map according to the learning styles of students). the optimal policy is described in the table no. 2. the results obtained for age category no. 3 and learning style no. 1 are presented in the table no. 3. 90 gabriela moise age learning style no. node no. of pedagogical resource q 2 1 0 1 8.75 2 1 0 2 0 2 1 0 3 10.27 2 1 1 1 4 2 1 1 2 6 2 1 1 3 7 2 1 2 1 6.37 2 1 2 2 0 2 1 2 3 7.68 2 1 3 1 4.5 2 1 3 2 6.12 2 1 3 3 8.18 2 1 4 1 3.5 2 1 4 2 9.28 2 1 4 3 0 2 1 5 1 4 2 1 5 2 3.5 2 1 5 3 7 2 1 6 1 0 2 1 6 2 5.25 2 1 6 3 5.75 table 1: experimental results for age 2 and learning style 1 age learning style no. node no. of pedagogical resource 2 1 0 3 2 1 1 3 2 1 2 3 2 1 3 3 2 1 4 2 2 1 5 3 2 1 6 3 table 2: the optimal policy for age 2 and learning style 1 a software system for online learning applied in the field of computer science 91 age learning style no. node no. of pedagogical resource q 3 1 0 1 6.25 3 1 0 2 8.75 3 1 0 3 0 3 1 1 1 0 3 1 1 2 6.5 3 1 1 3 0 3 1 2 1 5 3 1 2 2 5.75 3 1 2 3 0 3 1 3 1 4 3 1 3 2 0 3 1 3 3 5 3 1 4 1 0 3 1 4 2 4.5 3 1 4 3 5 3 1 5 1 0 3 1 5 2 4.5 3 1 5 3 3.5 3 1 6 1 3.5 3 1 6 2 0 3 1 6 3 4.5 table 3: experimental results for age 3 and learning style 1 the greater values of parameter q give the optimal policy (the best pedagogical resources for each node of map according to the learning styles of students). the optimal policy is described in the table no. 4. it is true that the best results will be obtained if there are a lot of pedagogical resources for each node of the conceptual map. these pedagogical resources have to be in different formats (text, multimedia, audio, video) containing details, explanations, exercises, references, cases studies, and so forth. there are a lot of software tools which can be used to make different files formats. also, we need to improve the computers performances in order to use these kinds of softwares. age learning style no. node no. of pedagogical resource 3 1 0 2 3 1 1 2 3 1 2 2 3 1 3 3 3 1 4 3 3 1 5 2 3 1 6 3 table 4: the optimal policy for age 3 and learning style 1 the teachers validate the results; the pedagogical resources no. 2 and no. 3 are the best pedagogical resources built for this course. the pedagogical resources no. 2 and 3 contain a lot of details, examples and explanations. the resource no. 3 (means the selection of the third pedagogical resource) is the best choice for persons with age group no. 2 and learning style no. 1. the resources no. 2 and 3 (means the 92 gabriela moise selection of the second and the third pedagogical resources) are the best choices for persons with age no. 3 and learning style no. 1. 5 summary and conclusions the online instruction model can be implemented with the intelligent agent technology. the ilearning system proves this. the system models the teaching-learning process so that it can adapt itself at the learning profile of each person. there are defined two kinds of agents: the execution agent and the supervisor agent. the author in the doctoral study tested the system and proved the importance of using the technology of intelligent agents in the online instruction systems. references [1] bowling, m., veloso m., multiagent learning using a variable learning rate, journal artificial intelligence, vol. 136, pp. 215-250, 2002. [2] buiu, c., albu. m., agenti software inteligenti, editura icpe, 2000. [3] gardner, h., intelligence reframed, multiple intelligence for the 21st century, published by basic books, 1999. [4] joyce, b, weil. , calhoun.e. , models of teaching, published by basic books, 1999. [5] leon, f., sova, i., gâlea, d., reinforcement learning strategies for intelligent agents in knowledge-based information systems, proceedings of the 8th international symposium on automatic control and computer science, iaşi, 2004. [6] moise, g., the role of intelligent agents in online learning environment, cblis conference procedings, pp. 98-105, 2005. [7] moise g., a software system for online learning applied to higher education in the field of computer science, thesis: petroleum-gas university of ploiesti, 2006. [8] müller, jörg p., the design of intelligent agents: a layered approach, lecture notes in computer science, vol. 1177: lecture notes in artificial intelligence, springer-verlag, 1996. [9] rao, a., s., georgeff, m., p. , bdi agents from theory to practice,proceedings of the first international conference on multi agent systems icmas-95, san francisco, 1995. [10] sowa, j.f. ,kowledge representation logical, philosophical, and computational foundations, brooks/cole, thomson learning. [11] watkins,c. , learning from delayed rewards, thesis: university of cambridge, england. [12] wooldridge, m., jennings, r., n. , bdi agents from theory to practice,intelligent agents: theory and practice, knowledge engineering review , vol. 10 no 2, 1995. [13] http://www.nwlink.com a software system for online learning applied in the field of computer science 93 gabriela moise petroleum-gas university of ploiesti computer science department no. 39 bd. bucuresti, ploiesti, romania e-mail: gmoise@upg-ploiesti.ro received: november 11, 2006 editor’s note about the author: gabriela moise (born on february 13, 1969) graduated the faculty of mathematics, specialization informatics of the bucharest university. she worked as software development in the software industry. since 2003 she is lecturer at petroleum-gas university of ploiesti. her research fields are: e-learning, graph theory, intelligent agents, knowledge representation, e-health. she has (co)authored seven books and more than twenty research papers. she has participated to many international conferences in the elearning and e-business area. international journal of computers, communications & control vol. iii (2008), no. 1, pp. 103-109 state analysis of time-varying singular bilinear systems by rk-butcher algorithms v. murugesh, k. batri abstract: the runge-kutta (rk)-butcher algorithm is used to study the timevarying singular bilinear systems with the exact solutions. the results (discrete solutions) obtained using the haar wavelets, single-term walsh series (stws) and rk-butcher algorithms are compared with the exact solutions of the time-varying singular bilinear systems. it is found that the solution obtained using the rk-butcher algorithm is closer to the exact solutions of the time-varying singular bilinear systems. the rk-butcher algorithm can easily be implemented using a digital computer and the solution can be obtained for any length of time, which is an added advantage of this algorithm. keywords: time-varying singular bilinear systems, haar wavelets, runge-kutta butcher algorithm, stws algorithm. 1 introduction the development of singular bilinear systems has been studied by some researchers. campbell [1] had done some preliminary work, but there was no available closed-form solution in that paper. in some analysis of neural networks, both singular systems [2] and bilinear systems [3] have been used. the multipliers and algebraic interconnections between singular systems and bilinear systems are allowed in dynamical systems. for singular bilinear systems, lewis et al. [4] have been discussed extensively in the literature. however, the solution due to lewis et al. only applies for the time-invariant case. hsiao and wang [5] applied the haar wavelets for the solution of time -varying singular bilinear systems. sepehrian and razzaghi [6] applied the stws approach for finding the numerical solution of time-varying bilinear systems. runge-kutta (rk) methods have become very popular both as computational techniques and as a topic for research [7-12]. butcher [8] derived the best rk pair, together with an error estimate, and in all statistical measures this approach is known as the rk-butcher algorithm. this algorithm appears to be of sixth order because it requires six function evaluations, but in practice the ’working order’ is closer to five (fifth order). however, the accuracy of the results obtained is better than that of all other algorithms examined including the rkfehlberg, rk-merson, rk-centroidal mean (rkcem) and rk-arithmetic mean (rkam) algorithms. murugesh and murugesan [13-15] introduced the rk-butcher algorithm in raster and time-multiplexing cnn simulations. recently, [16, 18] the rk-butcher algorithm is used to find the numerical solution of an industrial application problem. in this article, we present a new approach for solving the time-varying singular bilinear systems using the rk-butcher algorithm with more accuracy. 2 the rk-butcher algorithm the normal order of an rk algorithm is the approximate number of leading terms of an infinite taylor series which calculates the trajectory of a moving point [17]. the remainder of the infinite sum, which is excluded, is referred to as the local truncation error (lte). these rk algorithms are forward-looking predictors, i.e. they do not use any information from preceding steps to predict the future position of a point. for this reason, they require a minimum of input data and consequently are very simple to program and use. copyright © 2006-2008 by ccc publications 104 v. murugesh, k. batri the general p-stage runge-kutta method for solving an ivp is y′ = f (x, y) (1) with the initial condition y(x0) = y0 is defined by yn+1 = yn + h p ∑ i=1 biki where ki = f ( xn + cih, yn + h p ∑ j=1 ai jk j ) , i = 1, 2, 3, ......, p and ci = p ∑ j=1 ai j, i = i, 2, 3, ....., p in the preceding equations c and b are p-dimensional vectors and a(ai j) is the p × p matrix. then the butcher array takes the form c1 a11 c2 a21 a22 c3 a31 a32 a33 · · · · · · · · · · · · · · · · · cp ap1 ap2 ap3 app b1 b2 bp−1 bp the rk-butcher algorithm of equation (1) is of the form k1 = h f (xn, yn) k2 = h f ( xn + h 4 , yn + k1 4 ) k3 = h f ( xn + h 4 , yn + k1 8 + k2 8 ) k4 = h f ( xn + h 2 , yn − k2 2 + k3 ) (2) k5 = h f ( xn + 3h 4 , yn + 3k1 16 + 9k4 16 ) k6 = h f ( xn + h, yn − 3k1 7 + 2k2 7 + 12k3 7 − 12k4 7 + 8k5 7 ) 5th order predictor yn+1 = yn + 1 90 (7k1 + 32k3 + 12k4 + 32k5 + 7k6) 4th order predictor y∗n+1 = yn + 1 6 (k1 + 4k4 + k6) the local truncation error estimate (ee) is ee = yn+1 −y∗n+1 then the formation of the butcher array of the above equation (2) takes the following form state analysis of time-varying singular bilinear systems by rk-butcher algorithms 105 0 1 4 1 4 1 4 1 8 1 8 1 2 0 1 2 1 3 4 3 16 0 0 9 16 1 37 2 7 12 7 12 7 8 7 7 90 0 32 90 12 90 32 90 7 90 1 6 0 0 4 6 0 1 6 3 analysis of time-varying singular bilinear systems consider the linear first order time-varying singular system k . x(t) = ax(t) + b(t)u(t) (3) with x0 = x(0), where k is an n×n singular matrix, a is an n×n matrix, b is an n×r matrix. x(t) is n-state vectors and u(t) is an r-input vector. the time-varying singular bilinear system is of the form e(t) . x(t) = a(t)x(t) + q ∑ i=1 ni(t)x(t)ui(t) + b(t)u(t) (4) equation (4)is written in the form (3) as e(t) . x(t) = ( a(t) + q ∑ i=1 ni(t)ui(t) ) x(t) + b(t)u(t) (5) where the singular matrix e(t) ∈ rn×n, the state x(t) ∈ rn , the control u(t) ∈ rq, a(t) ∈ rn×n and b(t) ∈ rn×q. ni(t) ∈ rn×n and ui(t), i = 1, 2, 3, ...., q, are the components of u(t). the response x(t), 0 ≤ t ≤ ti, is required to be found. the time-varying singular bilinear systems are much more difficult to solve than the time invariant singular bilinear systems. therefore, many authors have tried various transform methods to over come these difficulties. in this article, we introduce rk-butcher algorithms with more accuracy to solve these time-varying singular bilinear systems. 4 numerical example consider the time-varying singular bilinear system of the following form (hsiao and wang [5]) and sepehrian and razzaghi[6]). e(t) =   0 −t 0 1 0 t 0 1 0   a(t) =   −2 t 1 0 −4 2 −2t 0 1   n1(t) =   1 −t 1 0 3 −2 2t 0 −2   (6) b(t) = [ 2 1 3 ]t , u(t) = 1, with initial condition x(0) = [ 12 2 5 ]t 106 v. murugesh, k. batri when we solve (5), the analytic solution for x(t) can be shown as x(t) =   (2−t) ( ex p (−t 2 ) + ex p (t) ) + 8 2ex p (−t 2 ) −ex p (t) + 1 ex p (−t 2 ) + ex p (t) + 3   (7) the discrete solutions of equation (5) are evaluated using the rk-butcher algorithms (with step size t = 0.25 ) represented in equation (2) and the results are compared with the solutions obtained by the haar wavelets method by hsiao and wang [5] and the stws method by sepehrian and razzaghi [6]. the results are shown in tables 1 3 along with the exact solutions calculated using equation (7). errors between the exact and discrete solutions are also given in tables 1 3. s.no time discrete solution x1values exact solutions haar solutions haar error stws solutions stws error rkbutcher solutions rkbutcher error 1 0 12.000000 12.0000 0.000000 12.00000 0.000000 12.000000 0.000000 2 0.25 11.886053 11.8861 0.000047 11.88605 0.000003 11.886053 0.000000 3 0.5 11.791414 11.7914 0.000014 11.79142 0.000006 11.791414 0.000000 4 0.75 11.711533 11.7115 0.000033 11.71154 0.000007 11.711533 0.000000 5 1 11.641283 11.6413 0.000017 11.64128 0.000003 11.641283 0.000000 6 1.25 11.574810 11.5748 0.000010 11.57481 0.000000 11.574810 0.000000 7 1.5 11.505362 11.5054 0.000038 11.50537 0.000008 11.505362 0.000000 8 1.75 11.425089 11.4251 0.000011 11.42510 0.000011 11.425089 0.000000 9 2 11.324812 11.3248 0.000012 11.32481 0.000002 11.324812 0.000000 table 1: solutions for the problem at various values of x1. s.no time discrete solution x2values exact solutions haar solutions haar error stws solutions stws error rkbutcher solutions rkbutcher error 1 0 2.000000 2.0000 0.000000 2.00000 0.000000 2.000000 0.000000 2 0.25 1.745678 1.7457 0.000022 1.74568 0.000002 1.745678 0.000000 3 0.5 1.480968 1.4810 0.000032 1.48097 0.000002 1.480968 0.000000 4 0.75 1.203067 1.2031 0.000033 1.20307 0.000003 1.203067 0.000000 5 1 0.908880 0.9089 0.000020 0.90888 0.000000 0.908880 0.000000 6 1.25 0.594985 0.5950 0.000015 0.59498 0.000005 0.594985 0.000000 7 1.5 0.257579 0.2576 0.000021 0.25758 0.000001 0.257579 0.000000 8 1.75 -0.107578 -0.1076 0.000022 -0.10758 0.000002 -0.107578 0.000000 9 2 -0.505221 -0.5052 0.000021 -0.50522 0.000001 -0.505221 0.000000 table 2: solutions for the problem at various values of x2. 5 conclusions the discrete solutions obtained using the rk-butcher algorithm gives more accurate values when compared to the haar wavelets method discussed by hsiao and wang [5] and the stws method by state analysis of time-varying singular bilinear systems by rk-butcher algorithms 107 s.no time discrete solution x3values exact solutions haar solutions haar error stws solutions stws error rkbutcher solutions rkbutcher error 1 0 5.000000 5.0000 0.000000 5.00000 0.000000 5.000000 0.000000 2 0.25 5.072562 5.0726 0.000038 5.07256 0.000002 5.072562 0.000000 3 0.5 5.166522 5.1665 0.000022 5.16652 0.000002 5.166522 0.000000 4 0.75 5.284021 5.2840 0.000021 5.28402 0.000001 5.284021 0.000000 5 1 5.427522 5.4275 0.000022 5.42752 0.000002 5.427522 0.000000 6 1.25 5.599862 5.5999 0.000038 5.59986 0.000002 5.599862 0.000000 7 1.5 5.804289 5.8043 0.000011 5.80429 0.000001 5.804289 0.000000 8 1.75 6.044524 6.0445 0.000024 6.04451 0.000014 6.044524 0.000000 9 2 6.324812 6.3248 0.000012 6.32480 0.000012 6.324812 0.000000 table 3: solutions for the problem at various values of x3. sepehrian and razzaghi [6]. from the tables 1-3, one can observe that the solutions obtained by the rk-butcher algorithm match well with the exact solutions of the time-varying singular bilinear systems, but the haar wavelets and stws methods yields a little error. hence the rk-butcher algorithm is more suitable for studying the time-varying bilinear systems. bibliography [1] campell, s.l., 1987, bilinear nonlinear descriptor control systems, crsc technical report 102386-01, department of mathematics, n.c. state university, raleigh, nc 27695. [2] declaris, n. and rindos, a., 1984, semistate analysis of neural networks in apysia californica, proceedings of the 27th mscs, 686-689. [3] wiener, n., 1948, cybernetics (cambridge, mit press). [4] lewis, f. l., mertzios, b. c., and marszalek, w., 1991, analysis of singular bilinear systems using walsh functions, iee proceedings part-d,138, 89-92. [5] hsiao, c. h., and wang, w. j., 2000, state analysis of time-varying singular bilinear systems via haar wavelets, mathematics and computers in simulation, 52, 11-20. [6] sepehrian, b., and razzaghi, m., 2003, state analysis of time-varying singular bilinear systmes by single-term walsh series, international journal of computer mathematics, 80, 413-418. [7] alexander, r.k. and coyle, j.j.,1990, runge-kutta methods for differential-algebraic systems. siam journal of numerical analysis, 27, 736-752. [8] butcher, j.c., 1987, the numerical analysis of ordinary differential equations: runge-kutta and general linear methods (chichester: johnwiley). [9] butcher, j.c., 2003, numerical methods for ordinary differential equations (chichester: johnwiley). [10] shampine, l.f., 1994, numerical solution of ordinary differential equations (newyork: chapman & hall). 108 v. murugesh, k. batri [11] yaakub, a.r. and evans, d.j., 1999, a fourth order runge-kutta rk(4,4) method with error control. international journal of computer mathematics, 71, 383-411. [12] yaakub, a.r. and evans, d.j., 1999, new runge-kutta starters of multi-step methods. international journal of computer mathematics, 71, 99-104. [13] murugesh, v., and murugesan, k., 2004, comparison of numerical integration algorithms in raster cnn simulation, lecture notes in computer science, 3285, 115-122. [14] murugesh, v. and murugesan, k., 2005, simulation of cellular neural networks using the rkbutcher algorithm, international journal of management and systems, 21, 65-78. [15] murugesh, v., and murugesan, k., 2006, simulation of time-multiplexing cellular neural networks with numerical integration algorithms, lecture notes in computer science, 3991, 115-122. [16] devarajan gopal, murugesh, v., and murugesan, k., 2006, numerical solution of second-order robot arm control problem using runge-kutta butcher algorithm, international journal of computer mathematics, 83, 345-356. [17] shampine, l.f. and gordon, m.k., 1975, computer solutions of ordinary differential equations: the initial value problem (san francisco, ca:w.h. freeman). [18] v. murugesh and k. batri, “an efficient numerical integration algorithm for cellular neural network based hole-filler template design”, international journal of computers, communications & control, vol. ii (2007), no. 4, pp. 367-374. v. murugesh department of information and communication engineering hannam university 133 ojung-dong daeduk-gu, daejeon 306-791, republic of korea e-mail: murugesh72@gmail.com k. batri department of computer science and engineering muthayammal engineering college rasipuram 637 408 india e-mail: krishnan.batri@gmail.com received: november 26, 2007 dr. v. murugesh obtained his bachelor of science in computer science and master of computer applications degree from bharathiar university, coimbatore, india during 1992 and 1995 respectively. completed his phd in computer science from bharathidasan university, tiruchirappalli, india during 2006. he has held various positions at national institute of technology, tiruchirappalli, india and sona college of technology, salem, india. currently, he is working as assistant professor in the department of information and communication engineering at hannam university, daejeon, republic of korea. his fields of interest are in neural network based image processing and scientific computing. he has published more than 30 technical papers in international, national journals and conferences. state analysis of time-varying singular bilinear systems by rk-butcher algorithms 109 krishnan batri received the m.e. from madurai kamarj university in 2003. he is a research scholar with the department of computer science and engineering in the national institute of technology tiruchirapalli, tamil nadu, india. currently he is working as a assistant proffessor with the department of computer science and engineering at muthayammal engineering college,rasipuram, tamilnadu, india . his research interests include information retrieval, data fusion and genetic algorithms. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 837-843 modeling gilliland correlation using genetic programming m. olteanu, n. paraschiv, o. cangea marius olteanu, nicolae paraschiv, otilia cangea “petroleum-gas” university of ploiesti romania, ploiesti, 100680, bucuresti blvd., no.39 e-mail: {molteanu,nparaschiv,ocangea}@upg-ploiesti.ro abstract: the distillation process is one of the most important processes in industry, especially petroleum refining. designing a distillation column assesses numerous challenges to the engineer, being a complex process that is approached in various studies. an important component, directly affecting the efficient operation of the column, is the reflux ratio that is correlated with the number of the theoretical stages, a correlation developed and studied by gililland. the correlation is used in the case of simplified control models of distillation columns and it is a graphical method. however, in many situations, there is the need for an analytical form that adequately approximates the experimental data. there are in the literature different analytical forms which are used taking into account the desired precision. the present article attempts to address this problem by using the technique of genetic programming, a branch of evolutionary algorithms that belongs to artificial intelligence, a recently developed technique that has recorded successful applications especially in process modeling. using an evolutionary paradigm and by evolving a population of solutions or subprograms composed of carefully chosen functions and operators, the genetic programming technique is capable of finding the program or relation that fits best the available data. keywords: gilliland correlation, artificial intelligence, genetic programming. 1 introduction the early pioneers of computer science, like alan turing, john von neumann, norbert wiener studied natural systems as guiding metaphors for their desire to understand nature and create intelligent computer programs capable to learn and adapt to their environment. in the 1950s and 1960s, several scientists from germany and united states independently studied evolutionary systems with the aim to use evolution as an optimization tool for engineering problems. several techniques have been created in this period by different research groups: evolution strategies, evolutionary programming and genetic algorithms (see [5]). the basic idea behind all this techniques was to start with a random population of candidate solutions to the specific problem and by applying a set of genetic operators, inspired from the field of genetics, to modify this candidate solutions in such a way to achieve a better fitness or adequacy of the solution for the engineering problem in an iterative process. in this article we apply a technique of evolutionary algorithms, that of genetic programming, with the aim at finding an analytic expression for a well studied and used correlation, the gililland correlation, applied to the design of control models for distillation columns. the algorithms for implementing genetic programming are characterized by many heuristic tuning parameters, this paper underlines the most important ones as a result of the simulations. copyright c⃝ 2006-2010 by ccc publications 838 m. olteanu, n. paraschiv, o. cangea 2 genetic programming based symbolic regression genetic programming was introduced by john koza (see [4]) and it can be seen as an extension of the genetic algorithms by the increase of the complexity of the structures used to represent the potential solution to the problem. in his 1992 book ([1]), john koza suggested that these potential solutions should be represented as trees of functions and operators, dynamic structures of varying size and shape. the classes of problems that can be best approached using this technique are symbolic regression, in which an analytic expression for a function has to be discovered such that a set of experimental data is fitted and machine learning, domain that uses a set of possible computer programs that produce the desired behavior in the case of some particular input data. in genetic programming, the set of possible structures is determined by the set of nf functions from f = {f1, f2, ..., fnf}, each function can take a specified number of arguments denoted a (fi) called its arity and the set of nt terminals from t = {t1, t2, ..., tnt}. some examples of functions are: arithmetic operations: plus (+), minus (-) , multiply (*), divide (/); mathematical functions: logarithm (log), trigonometric functions sin, cos, etc; logical functions: and, or, not. the terminals are variable atoms that represent input variables, signals form sensors / detectors or constant atoms, for example the number 11.25 or the boolean constant true. an example of a simple function represented by such a tree is given in figure 1, the corresponding analytic expression being: f (x, y) = x + √ y − 2 . figure 1: example of a tree representing a possible solution as a particular aspect of functions and terminals representation is that of closure which assumes that each of the function from the set must be capable to accept as its input arguments any value and data type that could possibly be returned by any function in the set and any value accepted by any terminal. another property is that of sufficiency which states that the set of functions and terminals must be capable of defining a solution for the actual problem, the designer of the algorithm is the one that decides what are the most probable functions and constants that could express best a solution. the adequacy or fitness of a particular member of the population has to be measured for a set of fitness cases, in the case of symbolic regression these are the experimental data. the algorithm for implementing genetic programming has a structure that resembles that of a genetic algorithm: 1. the designers establishes the set of functions / operators and the set of terminals for the specific problem modeling gilliland correlation using genetic programming 839 2. t=0, t being the generation counter 3. a random initial population of solutions p(0) is generated 4. fitness evaluation for all the population 5. a new population is created by applying the genetic operators of cloning, mutation and crossover. 6. if the stop condition is met, than the algorithm stops, else t=t+1 and go to step 4 3 genetic programming applied to the gilliland correlation the gilliland correlation or gilliland plot (figure 2) correlates the reflux ratio and the number of theoretical stages for a distillation column (see [3]): x = r − rmin r + 1 , y = n − nmin n + 1 (3.1) given that the minimum reflux ratio rmin is calculated from underwood’s equation and the minimum number of stages nmin by fenske’s method. the resulting curve stretches form (0,1) coordinate at minimum reflux to (1,0) at total reflux. in the literature (see [3]) there are a series of numerical equations derived for this correlation, but because there is some scatter in the fit of data to the gilliland plot, the expressions that best fits the plot are not always the best reflux-stages correlations. one of the most used expressions is that of molokanov: y = 1 − e 1−54.4x 11+117.2x · x−1√ x (3.2) used for higher precision, alternative relations being that of eduljee: y = 0.75(1 − x0.5668) (3.3) and rusche (see [2]): y = 0.1256 · x − 0.8744 · x0.291 (3.4) also, other correlations have been obtained as a result of research in the domain of optimal control (see [6], [7] polynomial analytical expression). for implementing the genetic programming algorithm it was used a free matlab toolbox created by s. silva, a toolbox well documented and highly modular having many configurable parameters (see [8]). a usual running of the algorithm is started by entering the following command at the mathlab prompt: >> [vars, b]=gplab(g,n); where: g=maximum number of generations as stop condition n=the number of individuals in the population vars=a structure containing all the algorithm variables b=best fitted individual 840 m. olteanu, n. paraschiv, o. cangea figure 2: plot of the original gilliland correlation the main modules that compose the toolbox have the following functions: variables initializing, initial population and generation creating. among the many important features of the toolbox, the parameters that were especially important in the symbolic regression applied for the gililland correlation: • population initialization has three possible methods: fullinit, growinit and rampedinit which was used in the algorithm, and that produces an initial population having very diverse trees (a combination of fullinit and growinit methods, see [4], [8]); • with the purpose of avoiding function bloating, the toolbox uses a parameter called dynamiclevel that specifies if the trees depth or the trees number of nodes has a fixed limit or not. another experimental property, called veryheavy specifies if this dynamic limit can be decreased under the initial value during the running in the case that the best individual has a smaller depth or number of nodes. by using this option, a much simpler expression for the function has been obtained, and also the running time of the algorithm and the memory resources implied were substantially reduced; • the methods available for selecting the most adequate individuals are the classical roulette and tournament methods, in addition there are implemented other methods like lexictour or doubletour that chooses taking into account the shortest (having the smaller depth or number of nodes) individual. the best results were achieved using the lexictour method. the crossover operator (figure 3) randomly choose nodes from both parents and swaps the respective branches creating one or two offspring, in the case of the mutation a random node is chosen from the parent individual and substituted by a new random tree, taking into account the imposed restrictions on the depth and number of nodes (see [1]). a set of six functions have been chosen and two random constants with values between 0 and 1. the functions were: plus, minus, times, custom divide (having protection to divide-by-zero error) also custom square root and custom natural logarithm. modeling gilliland correlation using genetic programming 841 figure 3: example of applying the crossover operator the algorithm ran on a computer with an intel core 2 duo processor, having 2gb of memory and matlab 7.4 for windows xp. for a population with 1000 members and a number of 20 generations the running time was in the range of 4 minutes to 4.5 minutes. if the number of generations is increased too much it always results a very big expression for the final function, making it very hard for implementing and studying, a very slow increase in performance (fitness) being obtained. adding the two final points (0,1) and (1,0) in the data set conducts to a function that does not approximate well the middle data points having a poor general performance because of the relative big scatter of the terminal points from the plot, so for most of the simulations, the two ending points were not included. the fitness function used calculates the sum of the absolute difference between the desired output values and the value computed by the individual on all fitness cases. fitness = n∑ i=1 |yi − f (xi)| (3.5) where n is the number of fitness cases, yi the desired output and f (xi) is the value returned by the individual. with a generation number of 40 and a population of 500 individuals for a running time of 169 seconds, the expression obtained is presented in the following tree plot (figure 4): another common representation is the string representation, used in matlab to represent the function: f=plus(times(minus(mysqrt(mydivide(0.96486,0.56835)),times(plus(0.96486,0.33765),mysqrt (x1))),mylog(mysqrt(mydivide(0.8073,0.22837)))),mysqrt(plus(times(0.33765,times(x1,times (plus(mysqrt(0.8073),0.56835),mysqrt(x1)))),mylog(minus(mysqrt(mydivide(0.9393,0.56835)), times(plus(mysqrt(0.8073),0.56835),mysqrt(x1))))))), from which we can write the following simplified relation: 842 m. olteanu, n. paraschiv, o. cangea 0.96486 0.56835 mydivide mysqrt 0.96486 0.33765 plus x1 mysqrt times minus 0.8073 0.22837 mydivide mysqrt mylog times 0.33765 x1 0.8073 mysqrt 0.56835 plus x1 mysqrt times times times 0.9393 0.56835 mydivide mysqrt 0.8073 mysqrt 0.56835 plus x1 mysqrt times minus mylog plus mysqrt plus figure 4: the tree representation of the final solution 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 original data eduljee rusche genetic programming figure 5: plot of the final solution and other correlations together with the original data modeling gilliland correlation using genetic programming 843 f = 0.631(1.30294 − 1.30251 √ x) + √ |0.495 · x √ x + ln(|1.285 − 1.467 √ x|)| (3.6) from the plot of the classic correlations (figure 5) and that of the genetic programming function we can conclude that a very good approximation is obtained, challenging the methods used in the domain of identification. 4 conclusions and future works the study presented in this article aims at applying a recently and not too well studied technique of artificial intelligence, that of genetic programming to the problem of finding the best function that fits some data. the well known gililland correlation, applied in the domain of process control of the distillation process has been studied. using the classic representation of the potential solutions as trees, many interesting results have been obtained. after running the algorithm with varying parameters for initial population for the method of selection the best individuals, the method of creating the new offspring, the genetic operators of crossover and mutation and ending with the number of generations and of individuals in the population, it can be stated that the technique of genetic programming has a promising future potential, proved by the good estimation of the experimental data. one aspect that proved to be important is the careful chosen of the running parameters which directly influence the quality of the solution. bibliography [1] m. affenzeller, s. winkler, s. wagner, a. beham, genetic algorithms and genetic programming modern concepts and practical applications, crc press, 2009. [2] j.r. couper, w.r. penney, j.r. fair, s.m. wallas, chemical process equipment second edition, elsevier gulf professional publishing, 2005. [3] h.z. kister, distillation design, mcgraw-hill, 1992. [4] j.r. koza, genetic programming on the programming of computer by means of natural selection, mit press, 1992. [5] m. mitchell, an introduction to genetic algorithms, mit press, 1996. [6] n. paraschiv, equipment and programs for optimal control of fractionation processes, phd thesis, “petroleum-gas” university of ploiesti, ploiesti, 1987. [7] n. paraschiv, an analytical form of the gilliland graphical correlation for the advanced control of fractionation processes, chemistry magazine no.7-8, 1990. [8] s. silva, gplab a genetic programming toolbox for matlab user manual, ecos evolutionary and complex systems group, university of coimbra, portugal, 2007 accessible from http://gplab.sourceforge.net/index.html. international journal of computers, communications & control vol. i (2006), no. 1, pp. 5-6 editorial florin gheorghe filip, ioan dziţac, mişu-jan manolescu welcome to the international journal of computers, communications & control (ijccc), a new scientific journal in computer science. 1 why ijccc? nowadays there are many prestigious journals in the field of computer science, so it is only natural to ask the question: “is there any reason for a new journal in this area?” there is a saying "in the world of computers the only thing that stays the same is the changing". we live in the most dynamic era of creation and circulation of information. the transition from the information society to knowledge society is accompanied by the globalization of information. more and more specialists, researchers from institutes, universities and industries of all over the world, write scientific papers in this field. these papers must be put in the scientific circuit as fast as possible and, the existing journals cannot publish the increased number of papers. the increased interest of the participants from many countries for the international conference on computers and communications, initiated and organized by ioan dziţac in 20041 (with constantin popescu and horea oros, http://iccc.rdsor.ro), under the guidance of acad. florin-gheorghe filip and continued in 20062 under the name of international conference on computers, communications & control (www.iccc.univagora.ro) at agora university of oradea (president mişu-jan manolescu), gave us the courage and confidence to start this new journal, entitled international journal of computers, communications & control (ijccc). to this new editorial project have subscribed as members of the editorial board, the following professors and specialists in computer science: • pierre borne, ecole centrale de lille, france; • antonio di nola, university of salerno, italy; • ioan dziţac, agora university, oradea, romania (executive editor); • ömer egecioglu, university of california, santa barbara, usa; • florin gheorghe filip, romania academy, romania (editor in chief); • constantin gaindric, institute of mathematics and computer science of the academy of sciences of the republic of moldova; • kaoru hirota, tokyo institute of technology, japan; • mişu-jan manolescu, agora university, oradea, romania (managing editor); • george metakides, university of patras, greece; • shimon y nof, purdue university, usa; • gheorghe păun, imar, romanian academy, romania; • mario j. perez-jimenez, university of seville, spain; • imre j. rudas, budapest tech, hungary; • athanasios d. styliadis, alexander institute of technology, thessaloniki, greece; • gheorghe tecuci, george mason university, usa; • horia-nicolai teodorescu, institute of computer science, romanian academy, iasi, romania; • dan tufiş, racai, romanian academy, romania. 1in 2004 we received for review 112 papers (only 67 accepted) from authors of australia, austria, china, egypt, finland, france, germany, greece, india, ireland, italy, japan, republic of moldova, romania, spain, united kingdom and usa. 2in 2006 we received for reviews 142 papers (only 91 accepted) from authors of previous countries and from others countries: algeria, hungary, iran, serbia & montenegro, tunisia and thailand. copyright c© 2006 by ccc publications 6 florin gheorghe filip, ioan dziţac, mişu-jan manolescu 2 about ijccc international journal of computers, communications & control (ijccc) is published from 2006 and has 4 issues per year, edited by ccc publications, powered by agora university editing house, oradea, romania (www.journal.univagora.ro). every issue is published in online format (issn 1841-9844) and print format (issn 1841-9836). we offer free online access to the full content of the journal. the printed version of the journal should be ordered, by subscription, and will be delivered by regular mail. ijccc is directed to the international communities of scientific researchers from the universities, research units and industry. ijccc publishes original and recent scientific contributions in the following fields: • computing & computational mathematics; • information technology & communications; • computer -based control. to differentiate from other similar journals, the editorial policy of ijccc encourages especially the publishing of scientific papers that focus on the convergence of the 3 “c” (computing, communication, control). the articles submitted to ijccc must be original and previously unpublished in other journals. the submissions will be revised independently by two reviewers. ijccc also publishes: • papers dedicated to the works and life of some remarkable personalities; • reviews of some recent important published books. also, ijccc will publish as supplementary issues the proceedings of some international conferences or symposiums on computers, communications and control, scientific events that have reviewers and program committee. the authors are kindly asked to observe the rules for typesetting and submitting described in instructions for authors, which are to be found at the end of the journal and on the journal’s website www.journal.univagora.ro. there are no fees for processing and publishing articles. the authors of the published articles will receive a hard copy of the journal. topics of interest include, but are not limited to, the following: applications of the information systems; artificial intelligence; automata and formal languages; collaborative working environments; computational mathematics; cryptography and security; e-activities; fuzzy systems; informatics in control; information society knowledge society; natural computing; network design & internet services; multimedia & communications; parallel and distributed computing. florin-gheorghe filip editor in chief of ijccc romanian academy 125, calea victoriei, 71102 bucharest-1, romania e-mail: ffilip@acad.ro ioan dziţac executive editor of ijccc agora university 8, p-ta tineretului, 410526 oradea, romania e-mail: idzitac@univagora.ro mişu-jan manolescu managing editor agora university 8, p-ta tineretului, 410526 oradea, romania e-mail: rectorat@univagora.ro int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iii (2008), no. 2, pp. 183-195 comdevalco framework the modeling language for procedural paradigm bazil pârv, ioan lazăr, simona motogna abstract: this work is part of a series referring to comdevalco a framework for software component definition, validation, and composition. its constituents are: a modeling language, a component repository and a set of tools. this is the first paper describing the object-oriented modeling language, which contains finegrained constructions, aimed to give a precise description of software components. the current status of the language reflects the procedural paradigm constructs. keywords: software/program verification, validation, modeling methodologies, computer-aided software engineering, flow charts, object-oriented design methods, testing tools. 1 introduction two main forces drive the software development today: complexity and change. software development community looks for methods and practices to deal with these challenges. complexity in software development [6] is the same as in mathematics, dealing with problem-solving. the usual way of solving it is its reduction by reorganization. brooks [2] makes a distinction between essential and accidental complexity. essential complexity belongs to the problem to be solved and cannot be reduced or eliminated. accidental complexity belongs to its solution, being created when fine-grained, general-purpose abstractions are used to directly implement coarse-grained, problem-specific concepts. it can be reduced or even eliminated by increasing the level of abstraction, i.e. by using more coarsegrained, problem-specific concepts (classes). the other challenge in software development is change management: how to build software systems able to change. software needs to change in response to changes in its operational environment and in its requirements. sources of change are both in the problem domain (invalidating existing requirements, adding new ones) and in the solution domain, due to technological evolution. both challenges can be addressed in a disciplined manner using models, which increase the level of abstraction and allow for the automation of the software development process. model-driven development (mdd) processes propose the creation of extensive models before the source code is written. an example of mdd approach is the object management group’s model driven architecture [12] standard. agile versions of mdd approaches have been defined in order to better deal with change management. agile mdd processes [1] create agile models (models “just barely good enough”) that drive the overall development efforts, instead of creating extensive models before writing source code. another agile approach, agile mda [8], emphasizes on complete executable models [9]. our work follows the idea of executable models, proposing an object-based modeling language that fits the procedural paradigm and allows construction and execution of models. software components considered in our approach to the procedural paradigm are program (the only executable), procedure, and function, and the software development process is component-based and model-driven. the modeling language constructs allow both precise definition of these components (called program units), and their verification and validation (v & v), by simulating their execution. once these components pass the v & v process, they can be stored in a component repository, ready for later use in the development process. the structure of this paper is as follows. after this introductory section, the next one is discussing current status, problems, and ideal properties of a modeling language. the third section presents the proposed modeling language, starting with low-level constructs, followed by statements, and finally copyright © 2006-2008 by ccc publications 184 bazil pârv, ioan lazăr, simona motogna program units. the current status of our work is described in the fourth section, while the last one contains some conclusions and plans further efforts. 2 modeling process: current status, problems, and desired features 2.1 modeling languages it is generally recognized that the use of models raises the level of abstraction and favors the automation of the software development process. unfortunately, as greenfield and short stated in [6], the largest impediment to achieve these tasks was “the promulgation of imprecise general-purpose modeling languages as both de facto and de jure standards”, namely uml. martin fowler [5] identifies three different uml goals: informal design (sketching), model-driven development (blueprinting), and direct execution (model interpretation), noticing a similar and independent opinion made by steve mellor [8]. the conclusion is that uml succeeded in the first goal and failed in the others; the reasons for this failure, as they are discussed in [6], are: (1) lack of precision, (2) poor support for component-based development, and (3) weak extensibility mechanisms. uml 2 and its action semantics [15] provide the foundation to construct executable models, but the standardization efforts for defining a subset of actions sufficient for computational completeness are still in progress [14]. in order to make uml a computational-complete specification language, there are some tools [3, 17, 18, 11] which have defined non-standard subsets of actions. other issues related to uml 2 refer to the graphical notations and textual notations. the current version of uml does not define graphical notations for easy manipulation of uml elements. moreover, there are uml elements (e.g. uml structured activity nodes) without a proposed graphical notation, and textual notations for behavioral elements are still in the process of standardization [14]. 2.2 component-based development and the modeling process the process of component-based software development (or cbd for short) has two sub-processes more or less independent: component development process and system development process. naturally, the requirements concerning the components are derived from system requirements; the absence of a relationship, such as causal, may produce severe difficulties in both sub-processes mentioned above. the system construction by assembling software components [4] has several steps: component specification, component evaluation, component testing, and component integration. the system development sub-process focuses on identifying reusable entities and selecting the components fulfilling the requirements, while in the component development sub-process the emphasis is on component reuse: from the beginning, components are designed as reusable entities. component’s degree of reuse depends on its generality, while the easiness in identification, understanding, and use is affected by the component specification. the sole communication channel with the environment is the component’s interface(s). in other words, the client components of a component can only rely on the contracts specified in the interfaces implemented by the component. thus, it is obvious that component development must be interface-driven. in our opinion, the main cbd challenge is to provide a general, flexible and extensible model, for both components and software systems. this model should be language-independent, as well as programming-paradigm independent, allowing the reuse at design level. the design process of a component-based system [7] follows the same steps as in the classical methods: the design of architecture, which depicts the structure of the system (which are its parts) and the design of behavior (how these parts interact in order to fulfill the requirements). the structural description establishes component interconnections, while behavioral description states the ways in which each component uses the services provided by interconnected components in order to fulfill its tasks. comdevalco framework the modeling language for procedural paradigm 185 2.3 ideal properties of a modeling language our discussion here follows the general uses of a modeling language identified by martin fowler [5] and steve mellor [8]: informal design, model-driven development, and direct execution. in order to fulfill these goals, a modeling language should have: (1) good degree of precision, (2) good support for cbd, (3) good support for agile mda processes, and (4) good extensibility mechanisms. in order to be precise, a modeling language needs to have fine-grained constructs, which allow both the complete definition of computing processes and the simulation of their execution. the language elements should cover low-level constructs referring to data types, expressions, program state and behavior (body of statements). as we stated above, in order to offer a good support for cbd, a modeling language needs to build general, flexible and extensible models for both components and software systems. the resulting models should be both language-independent and programming-paradigm independent. in order to offer a good support for agile mda processes, a modeling language should provide a metamodel, together with graphical and textual notations for easy manipulation of language constructs. the metamodel should satisfy two important properties: computability and completeness. the extensibility of a modeling language means the extensibility of its set of constructs, like data types, expressions, statements, and program units. 3 a modeling language proposal: comdevalco solution this section discusses in more detail our proposal of a modeling language, part of a framework for component definition, validation, and composition. the proposed solution is comdevalco a conceptual framework for software components definition, validation, and composition. its constituents are meant to cover both cbd-related sub-processes described in 2.2: component development and component-based system development. the sub-process of component development starts with its definition, using an object-oriented modeling language, and graphical tools. the modeling language provides the necessary precision and consistency, and the use of graphical tools simplifies developer’s work, which doesn’t need to know the notations of the modeling language. once defined, component models are passed to a v & v (verification and validation) process, which is intended to check their correctness and to evaluate their performances. when a component passes the v & v step, it is stored in a component repository, for later (re)use. the sub-process of component-based system development takes the components already stored in repository and uses graphical tools, intended to: select components fulfilling a specific requirement, perform consistency checks regarding component assembly and include a component in the already existing architecture of the target system. when the assembly process is completed, and the target system is built, other tools will perform v & v, as well as performance evaluation operations on it. constituents of the conceptual framework are: the modeling language, the component repository and the toolset. any model of a software component is described as a compound object, using the elements of the object-based modeling language. the component repository represents the persistent part of the framework and its goal is to store and retrieve valid component models. the toolset is aimed to help developers to define, check, and validate software components and systems, as well as to provide maintenance operations for the component repository. starting in a top-down manner, program units considered are program (the only executable) and proper software components specific to the procedural paradigm procedure and function (see figure 1). each of these software components has a name, a state, and a body; the state is given by all variables local to the component, and the body is, generally speaking, a statement. according to the imperative paradigm, the program execution is seen as a set of state changes, i.e. the execution of a statement changes the value of a variable, usually by evaluating an expression. 186 bazil pârv, ioan lazăr, simona motogna figure 1: comdevalco modeling language. main constructs in the procedural paradigm we describe below, in a bottom-up manner, the language elements as they are considered so far. 3.1 low-level constructs basic language constructs are type and declaration. type class abstracts the concept of data type, while a declaration object is used to associate a specific type object to a name (identifier). this corresponds to explicit variable declaration in imperative programming languages. the next important concept is value. each value object encapsulates a value of a specific type. values are fundamental in our model, because a variable represents an alternate name for a value stored in the memory, a function returns a value, or, more generally, the process of evaluating an expression returns a value. having these facts in mind, we designed the expression class hierarchy shown in figure 2. the root of the hierarchy, expression, is abstract and has a single abstract operation, getvalue(), overriden by subclasses and returning a value result. the concrete specializations of expression are: value, variable, binaryexpression, unaryexpression, and definedfunction. figure 2: expression class hierachy value is the simplest expression descendant, its instances corresponding to typed literals (constants). its proper operations are: getdatatype(), which returns the data type of the value stored in the object, comdevalco framework the modeling language for procedural paradigm 187 getvaluestring(), returning the value as a string, and setvalue(). variable is probably one of the most important subclasses of expression, having multiple uses in our model: (1) a variable is a simple expression; (2) all statement objects deal with variable instances, and (3) the state of a program unit is seen as a collection of variable objects. according to the general definition, each variable object has a name (identifier) and a value (a value object). its own operations are: getname(), getdatatype(), and setvalue(). specific expression classes considered so far are binaryexpression and unaryexpression, having an operator and two, respectively one expression operands. the extensible operator class implements evaluate() operations for all operand types, called by getvalue() code in binaryexpression and unaryexpression. definedfunction object corresponds to a one-argument function call. its instance variables are the name of the function and its actual parameter, an expression. 3.2 statements statement class hierarchy employs composite design pattern, with subclasses simplestatement and compoundstatement. a compoundstatement object contains a list of statement objects; both concrete simple and compound statement objects are treated uniformly. as figure 3 shows, statement class is abstract and represents the root of all simple and compound statement classes. its single abstract operation is execute(), which usually produces a state change. statement’s concrete subclasses will implement this operation accordingly. figure 3: statement class hierachy simplestatement subclasses cover all important control statements in any imperative programming language: assignmentstatement, callstatement, inputstatement, outputstatement, loopstatement, and branchstatement. assignmentstatement object takes an expression and a variable; after its execution, the variable takes the value of that expression. inputstatement takes only a variable, being considered as a special kind of assignment: its execution means reading a value from standard input, evaluating and assigning it to the considered variable. the execution of an outputstatement extracts the value of its variable to standard output. callstatement object corresponds to a procedure call. its instance variables are the procedure being 188 bazil pârv, ioan lazăr, simona motogna called (the callee object) and actual (call) parameters, whose values belong to the caller object. according to the definition of a procedure object (see next subsection), three different parameter lists are needed, corresponding to in-, in-out, and out parameters. the execution of this statement has five steps: (1) extracting the values of inand in-out parameters from the caller state; (2) building the callee state; (3) running the callee object; (4) extracting the values of in-out and out parameters from the callee state and (5) updating the state of the caller object with these values. all loop statements execute repeatedly their body, a statement object. three different loop statements were considered in our design, considered as subclasses of the abstract loopstatement: forstatement, repeatstatement, and whilestatement. in the case of a forstatement object, the number of iterations is known a priori, and its execution uses a counter variable, with values ranging from lower to upper bounds. whilestatement and repeatstatement objects use a test expression to continue the iterative process. their execution differs by the test strategy, i.e. evaluate test then execute body (while), respectively execute body then evaluate test (repeat). branchstatement objects correspond to if-then-else constructs. the condition to be checked is an expression object, and both branches are statement objects. its execution evaluates test expression and then, based on its value, executes the corresponding statement. 3.3 program units as we already stated, the program units considered so far are program, procedure, and function (see figure 4), specific to the procedural paradigm. program is the only executable software component, having a name, a state, and a body; the state is made up by all variables local to the component, and the body is a statement object. the only operation of a program object is its run() method, implemented by a call to the execute() method of its body. figure 4: program units class hierachy proper software components are procedure and function. as in the imperative programming world, a procedure declaration must define its name, formal parameters, local state and procedure body. consequently, it is easy to consider procedure as a program subclass; the proper instance variables of the procedure class are the lists of in-, in-out, and out parameters, needed for a complete implementation of execute() method from callstatement class, discussed in 3.2. the concept of a user-defined function in imperative programming languages considers it without side-effects, i.e. the only result of its execution is the value it produces, without affecting the caller’s state. having this in mind, we designed the function class accordingly, i.e. it has only input (in) formal parameters, and produces as result a value object. comdevalco framework the modeling language for procedural paradigm 189 4 experimental results from a methodological viewpoint, our main concern was to model all theoretical aspects in concrete objects constructs of the modeling language. the idea was to apply an iterative and incremental process: start with simple objects, perform checks after each modeling step, in order to be sure that things work. each modeling step covers both theoretical/analytical activities the abstract model of the concept and practical/applicative ones coding, testing and integrating it in the already existing language. the initial step of our work was to prove that things are working well. so we conducted first a proof-of-concept study, and then we started the development of graphical tools. 4.1 modeling language: proof-of-concept stage the modeling language described in section 3 was implemented in java. the classes containing the current implementation are included in three packages: syntax (expression classes, declaration, declarationlist, operator, state, and type), statements (all classes in figure 3), and programunits (program, procedure, and function). the implementations were tested by building some simple components, like solving of polynomial equations of first and second degree, or computing the integer square root (isqrt()) of a positive integer. as we discuss below, in order to test a proper component p, two program units need to be designed: the component p itself and the corresponding test driver (a program component). in each situation, the building process has two main steps: (1) build the state of the component and (2) build its body. as the state is a set of variable objects, its creation is a sequence of allocate() messages. additionally, in the case of procedure and function components, the parameter lists need to be also defined, in the same way. next, the body of a component is a compoundstatement, so the building process needs to create all the statement objects which describe the computing process, and to include them into the body, preserving the sequence of computing steps. consider the simplest example of designing a component ecgr1 which solves polynomial equations of the first degree, and its corresponding test program. the building process of ecgr1 is defined in the following static method: public static procedure buildecgr1() { declarationlist inp = new declarationlist(); // in params declarationlist outp = new declarationlist(); // out params declarationlist inoutp = new declarationlist(); // in-out params declarationlist locale = new declarationlist(); // local state // input parameters inp.allocate("a", value.tdouble); inp.allocate("b", value.tdouble); // output parameters outp.allocate("cod", value.tint); outp.allocate("x", value.tdouble); compoundstatement body = new compoundstatement("solves the equation a x + b = 0"); procedure proc = new procedure("proc ecgr1", locale, body, inp, outp, inoutp); // body: branchstatement(s1, s2) (a == 0) statement s11 = new assignmentstatement("cod = 2", // infinite solution set proc.getlocalstate().get("cod"), new value(value.tint, "2")); statement s12 = new assignmentstatement("cod = 1", // no solution 190 bazil pârv, ioan lazăr, simona motogna proc.getlocalstate().get("cod"), new value(value.tint, "1")); expression e1 = proc.getlocalstate().get("b"); expression e2 = new value(value.tdouble, "0"); expression e = new binaryexpression(e1, e2, new operator(operator.eq)); // s1: branchstatement(s11, s12) (b == 0) branchstatement s1 = new branchstatement("b == 0", e, s11, s12); compoundstatement s2 = new compoundstatement("unique solution"); s2.add(new assignmentstatement("cod = 0", proc.getlocalstate().get("cod"), new value(value.tint, "0"))); e1 = proc.getlocalstate().get("b"); e2 = new value(value.tint, "-1"); expression e3 = new binaryexpression(e1, e2, new operator(operator.times)); // -b e2 = proc.getlocalstate().get("a"); e = new binaryexpression(e3, e2, new operator(operator.div)); // -b/a s2.add(new assignmentstatement("x = -b/a", proc.getlocalstate().get("x"), e)); e1 = proc.getlocalstate().get("a"); e2 = new value(value.tint, "0"); e = new binaryexpression(e1, e2, new operator(operator.eq)); body.add(new branchstatement("a == 0", e, s1, s2)); return proc; } the method builds a procedure object who implements the well-known algorithm for solving first degree polynomial equations (its body being a branchstatement object) and returns it when the building process is done. this object has two in parameters (a and b) and two out parameters (x and cod). the test driver program object is built as follows: public static program buildprogecgr1() { declarationlist state = new declarationlist(); compoundstatement body = new compoundstatement("test driver for ecgr1"); program prog = new program("demoecgr1", state, body); state.allocate("ca", value.tdouble); // coefficient a state.allocate("cb", value.tdouble); // coefficient b state.allocate("rez", value.tint); // return code state.allocate("sol", value.tdouble); // the solution // resets state prog.setstate(state); // start program body.add(new outputstatement("*** program " + prog.getname() + " started ***", null)); // read coeffs body.add(new inputstatement("read coeff ca", prog.getstate().get("ca"))); body.add(new inputstatement("read coeff cb", prog.getstate().get("cb"))); // calls ecgr1 declarationlist pin = new declarationlist(); pin.allocate(prog.getstate().get("ca")); pin.allocate(prog.getstate().get("cb")); declarationlist pout = new declarationlist(); pout.allocate(prog.getstate().get("rez")); comdevalco framework the modeling language for procedural paradigm 191 pout.allocate(prog.getstate().get("sol")); declarationlist pinout = new declarationlist(); procedure ecgr1 = buildecgr1(); // create the ecgr1 procedure object simplestatement s = new callstatement("call ecgr1", ecgr1, pin, pinout, pout); body.add(s); // print results statement s11 = new outputstatement("unique solution", null); statement s12 = new outputstatement("print solution", prog.getstate().get("sol")); compoundstatement s1 = new compoundstatement("print unique solution"); s1.add(s11); s1.add(s12); expression e1 = prog.getstate().get("rez"); expression e2 = new value(value.tint, "1"); expression e = new binaryexpression(e1, e2, new operator(operator.eq)); s11 = new outputstatement("empty solution set", null); // cod = 1 (a=0, b<>0) s12 = new outputstatement("infinite solution set", null); // cod = 2 (a=0, b=0) branchstatement s2 = new branchstatement("rez == 1", e, s11, s12); e2 = new value(value.tint, "0"); e = new binaryexpression(e1, e2, new operator(operator.eq)); branchstatement st = new branchstatement("rez == 0", e, s1, s2); body.add(st); body.add(new outputstatement("*** program " + prog.getname() + " terminated ***", null)); return prog; } this time, the body of the constructed program object is a sequence of statements which: (1) read the coefficients (using a sequence of inputstatement objects), (2) call the ecgr1 component (using a callstatement object), and (3) print the result (using branchstatement objects in which the condition tests the value of the out parameter rez while the branches are outputstatement objects). the above code also contains statements which create and build the state of the program object, and prepare the call process, i.e. create and populate the actual in, out, and in-out parameter lists and then create the callee procedure object ecgr1 (by invoking buildecgr1() method). the main() method of the demo class first creates the driver program object and then calls its run() method: public static void main(string[] args) { textio.putln("demo program units"); program pecgr1 = buildprogecgr1(); pecgr1.run(); } the program object runs its body by executing sequentially the statements in it: (1) asks the user to enter the values of coefficients, (2) calls the ecgr1 procedure and (3) prints an explanatory message and the unique solution (if this is the case). above-described component definition approach is tedious and error-prone. for example, in order to build a branchstatement object, the process is bottom-up: (1) create the statement objects corresponding to the branches and (2) create the branchstatement object containing them. in a real-world situation, the building process is assisted by graphical tools, as we discuss below. these tools will perform at least the following: (1) graphical or textual building of components, (2) 192 bazil pârv, ioan lazăr, simona motogna saving and restoring component definitions to/from a component repository, (3) component testing and debugging. 4.2 the toolset the comdevalco toolset proposal includes graphical tools for component definition, validation, and composition. this subsection describes current status of these tools. as part of the comdevalco framework, a procedural action language (pal) was defined and it is described in [10]. pal contains all statements included in figure 3, has a concrete textual syntax for uml structured activities, and graphical notations for some uml structured activity actions. the main idea for simplifying the construction of uml structured activities is to use the pull data flow for expression trees. also, we propose new graphical notations for conditionals and loops, following the classical flowchart style. in order to allow the exchange of executable models with other tools, a uml profile is also defined, specifying the mapping between pal and uml constructs. a component definition and validation tool is under development, using both graphical and textual pal notations for building program, procedure, and function program units. this tool is used within an agile mda process which includes test-first component design steps: (1) add a test (in the form of a program component calling a non-existing procedure or function), (2) run the tests (in order to report the missing components), (3) add the production code (i.e. design the missing components), and (4) goto step (2). the process ends during the step (2), when all the tests pass. in the steps (1) and (3), developers are allowed to use either the graphical notation or the concrete syntax of pal; the tool maintains automatically the consistency of the two views. the debugging and testing techniques employed in step (2) are defined according to model-level testing and debugging specification [14, 13]. a detailed description of this tool will be given in a separate paper. 4.3 original elements of the proposed solution the proposed solution brings original elements in at least the following directions: • the object model is precise and fine-grained, because all objects are rigorously defined, and the component behavior is described at statement level. the uml metamodel has no correspondent for modeling constructs more fine-grained than program and procedure; • the models are executable and verifiable because each component can be executed; moreover, one can use tools for validation and evaluation of complexity; • the models are independent of any specific object-oriented language; • the modeling language is flexible and extensible in the following dimensions: the statement set, the component (program units) family, the component definition, the data type definition, and the set of components; • the modeling language allows the use of graphical tools in all the phases: building, validating, and using software component models; • the modeling language allows automatic code generation for components in a concrete programming language, according to model driven architecture (mda) specifications. one can define mappings from the modeling elements to specific constructs in a concrete programming language in a declarative way. comdevalco framework the modeling language for procedural paradigm 193 5 conclusions and further work this paper describes the current status of the modeling language, part of the comdevalco framework. as we discussed above, this version implements a minimal set of elements, corresponding to the procedural programming paradigm. the approach considered was aimed to control the complexity of the problem and of the development process. we started with the simplest programming paradigm, using simple data types and expressions and a small but complete set of statement objects. the development process consisted of small steps, meaning either the implementation of a new concept (transforming the concept into an object), or the extension of a model element. as the experiments were successful, we believe that our approach is feasible. future developments of the modeling language will include: extending type, expression, and operator classes in order to define and manage structured and object types, extending the program units with constructs specific to modular, object-oriented, and component-based paradigms. these steps are considered within the planned evolution of the comdevalco framework, which include steps for defining the structure of component repository and developing the tools aimed to operate in the component definition, validation, evaluation, simulation, and composition. acknowledgements this work was supported by the grant id_546, sponsored by nurc romanian national university research council (cncsis). bibliography [1] s.w. ambler, agile model driven development (amdd): the key to scaling agile software development, http://www.agilemodeling.com/essays/amdd.htm. [2] f.p. brooks, no silver bullet: essence and accidents in software engineering, ieee computer, april 1987, pp. 10-19. [3] k. carter, the action specification language reference manual, 2002. http://www.kc.com/ [4] i. crnkovic, m. larsson, building reliable component-based software systems, prentice hall international, artech house publishers, isbn 1-58053-327-2, available july 2002. http://www.idt.mdh.se/cbse-book/ [5] m. fowler, umlmode, may 2003, http://martinfowler.com/bliki/umlmode.html [6] j. grenfield, k. short, software factories: assembling applications with patterns, models, frameworks, and tools, wiley, 2004. [7] p. henderson, r.j. walters, behavioural analysis of component-based systems, declarative systems and software engineering research group, department of electronics and computer science, university of southampton, southampton, uk, 06 june 2000. [8] s.j. mellor, agile mda, 2005. http://www.omg.org/mda/mda_files/agilemda.pdf [9] s.j. mellor, m.j. balcer, executable uml: a foundation for model-driven architecture, addisonwesley, 2002. 194 bazil pârv, ioan lazăr, simona motogna [10] i. lazăr, b. pârv, s. motogna, i.g. czibula, c.l. lazăr, an agile mda approach for executable uml activities, studia ubb, informatica, lii, no. 2, 2007, pp. 101-114. [11] p.a. muller et al, on executable meta-languages applied to model transformations, model transformations in practice workshop, montego bay, jamaica, 2005. [12] object management group, mda guide version 1.0.1, 2003. http://www.omg.org/docs/omg/0306-01.pdf [13] object management group. uml 2.0 testing profile specification, 2005, http://www.omg.org/cgibin/apps/doc?formal/05-07-07.pdf. [14] object management group, model-level testing and debugging, 2007, http://www.omg.org/cgibin/doc?ptc/2007-05-14/ [15] object management group, uml 2.1.1 superstructure specification, 2007, http://www.omg.org/cgi-bin/doc?ptc/07-02-03/ [16] b. pârv, s. motogna, i. lazăr, i.g. czibula, c.l. lazăr, comdevalco a framework for software component definition, validation, and composition, studia ubb, informatica, lii, no. 2, 2007, pp. 59-68. [17] projtech al: project technology, inc, object action language, 2002. [18] telelogic ab, uml 2.0 action semantics and telelogic tau/architect and tau/developer action language, version 1.0, 2004. bazil pârv, ioan lazăr, and simona motogna babeş-bolyai university faculty of mathematics and computer science department of computer science 1, m. kogalniceanu, cluj-napoca 400084, romania e-mail: {bparv,ilazar,motogna}@cs.ubbcluj.ro received: november 20, 2007 comdevalco framework the modeling language for procedural paradigm 195 bazil pârv is professor at the department of computer science, faculty of mathematics and computer science, babeş-bolyai university, cluj-napoca. 7 books and university courses, and more than 75 papers. his research topics cover: programming paradigms, component-based sofware development, mathematical modeling in experimental sciences, and computer algebra. dr. ioan lazăr is lecturer at the department of computer science, faculty of mathematics and computer science, babeşbolyai university, cluj-napoca. he published 7 books and university courses, and more than 15 papers. his current research topics include: object-oriented analysis and design, modeling languages, and programming methodologies. simona motogna is associate professor at the department of computer science, faculty of mathematics and computer science, babeş-bolyai university, cluj-napoca. she obtained her ph.d. in 2001, with the thesis formal specification of objectoriented languages. her topics of interest are: compilers, semantics, formal specification related to object oriented languages and component based programming. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 2, pp. 118-126 integrated system for stereoscopic cognitive vision, localization, mapping, and communication with a mobile service robot cătălin buiu politehnica university of bucharest department of automatic control and systems engineering spl. independentei 313, 060042 bucharest, romania e-mail: cbuiu@ics.pub.ro abstract: this paper describes a stereo-vision-based mobile robot that can navigate and explore its environment autonomously and safely and simultaneously building a tridimensional virtual map of the environment. the control strategy is rule-based and the interaction with robot is done via bluetooth. the stereoscopic vision allows the robot to recognize objects and to determine the distance to the analyzed objects. the robot is able to generate and simultaneously update a full colour 3d map of the environment that is being explored. the position and type of each detected and recognized object is marked in this 3d map. furthermore, the robot will be able to use a gripper in order to collect detected objects and carry them to dedicated collecting bins, and so will be able to work in commercial waste cleanup applications. this application represents a successful integration of computers, control and communication techniques in mobile service robot control. keywords: control, communication, localization, mapping, mobile robot, stereoscopic vision, virtual reality 1 introduction more than 7 million robots will be sold from 2005 to 2008 according to estimations of the international robotics federation and of the economic commission for europe of the united nations. until 2010 a robust increase of 4% per year in the number of robots is estimated. many of these are service robots which are used to assist or even to replace humans in tedious, dull, dangerous or repetitive tasks. the same sources estimate that by 2010, service robots will be able to fully assist elder people and people with disabilities, will extinct fires, will explore industrial pipes and more [1]. in ecological applications, service robots are used to collect waste and dangerous items in indoor and outdoor environments. for doing that, the robots must be able not only to perceive and act upon the environment by using a wide range of sensors and actuators, but also to manifest human-like cognitive abilities, such as to localize themselves, to recognize and classify objects, to generate maps of the environment, to learn from experience, to interact in a natural way with humans and other robots or to develop physical and cognitive abilities in a kind of developmental process similar to humans. it is often the case that a team of robots is asked to fulfill such a task. there are already a lot of interesting results obtained in collective robotics, see for example [2] where coordinated control based on artificial vision is investigated and [3] where decentralized formation control of mobile robots with limited sensing is addressed. the problem of simultaneous localization and mapping (slam) consists of estimating concurrently the robot’s position and generating a map of its surrounding environment. this is an essential skill for a mobile robot but to this day it has eluded complete and robust solutions because noisy robot dynamics and sensors make solving slam a difficult task. copyright © 2006-2009 by ccc publications integrated system for stereoscopic cognitive vision, localization, mapping, and communication with a mobile service robot 119 slam has been widely used for navigation and typically makes use of laser range-finders or sonars. an advantage of using stereo vision over laser range-finders is the ability to detect obstacles at different heights. the solution presented in [4] is based on learning maps of 3d point-landmarks whose location is estimated using correlation-based stereo and identification is performed using their appearance in images using the scale invariant feature transform (sift) [5]. the authors derive an estimate of the robot’s motion from sparse visual measurements using stereo vision and multiple view geometry techniques [6] known in robotics as visual odometry [7], [8], [9]. a number of professional stereo vision systems and related software systems have been developed and have found a number of interesting applications in various domains, from the control of industrial manipulators for assembly and pick-and-place operations, material handling, collision warning and obstacle detection in robotics, people-tracking, environment modeling, autonomous guidance of corn harvesters, digitizing books, to medical applications, such as ophthalmic diagnostics, ir mammography, and robotic laparoscopy. stereo vision for navigation has a long history and is frequently exploited for autonomous navigation, but has limitations in terms of its density and accuracy in the far field [10]. if landmarks can be placed in the field of view of the camera, the location of a vehicle can be determined by means of stereo vision [11], and if a solid model of the target object is available, a robotic manipulator will have at its disposal a modeled environment for automatic tasks [12]. in [13] it is presented a stereoscopic vision system for a khepera miniature robot. the vision system performs objects detection by using the stereo disparity and stereo correspondence. an adaptive panoramic stereo vision approach for localizing 3d moving objects has been developed in the department of computer science at the university of massachusetts at amherst. in the adaptive stereo model, the sensor geometry can be controlled to manage the precision of the resulting virtual stereo system [14]. other indoor and outdoor stereo vision systems have been developed and tested with satisfactory results and some drawbacks, see [15], [16] and [17]. a novel optical system allows the capture of a pair of short, wide stereo images from a single camera, which are then processed to detect vertical edges and infer obstacle positions and locations within the planar field of view, providing real-time obstacle detection [18]. very few applications concern the problem of waste collecting service robots acting indoor. the application reported in this paper is part of the bigger remaster research project currently under development at the autonomous robotics lab of the politehnica university of bucharest, romania. this project concerns the development of a commercial cognitive service robot to be used in waste cleanup in office buildings. a first prototype (remaster one) has been built. related details on the structure of the prototype and of its cognitive vision system using a monocular vision system are given in [19] and [20]. the acquired expertise has been used to propose the structure and to design a stereoscopic cognitive vision system which is detailed in [21]. the aim of this paper is to present the current phase in the remaster project which consists in the design and implementation of an integrated system for stereoscopic cognitive vision, localization, mapping, and communication with the robot. the goals of this system is to allow the robot to recognize and classify various objects and to determine the distance to the objects. combined with the self-localization ability of the robot, this allows the absolute position of detected objects to be determined and marked on a tridimensional map of the working space that is continously updated. so, stereo vision and slam are integrating in order to create a map of the environment, without using any landmarks. the realization of this vision based mapping is the main contribution of this paper. the paper is structured as follows. section 2 gives an overview of the system architecture, and of the robot control and communication system, while section 3 presents the realization of the stereoscopic cognitive vision system. section 4 describes the way in which the robot is able to generate, maintain and update a tridimensional virtual map of the environment that is being explored. the last section of the paper presents conclusions and some directions for further research and developments. 120 cătălin buiu 2 robot control and communication system the integrated system was implemented on a koala robot (fig. 1) which is a mid-size robot designed for real-world applications and capable of carrying larger accessories. it has been chosen for this application, as koala has the functionality necessary for use in practical applications (like sophisticated battery management), and rides on 6 wheels for indoor and all-terrain operation. it has 16 distance sensors and can be controlled via bluetooth. figure 1: koala mobile robot (www.k-team.com) two commercial webcameras have been mounted on top koala at the same level (fig. 2), at a distance of 95 mm between them, and at a height of 170 mm. the cameras are inclined at 10 degrees, and have cmos 1.3 megapixel sensors (1280*960 pixels images and 640*480 videos), manual focus, and a focal distance of 1/4.8 mm. figure 2: stereoscopic vision system on-board koala on the robot side, there is a bluetooth 333s module installed and directly connected to the serial interface of the robot. the control program runs on a separate laptop which communicates with the robot by using a bluetooth connection. the robot can localize itself by using a dedicated and redundant system consisting of a beacon and two transponders ([19], [21]) and an odometry algorithm. the robot is able to navigate in indoor environments consisting of walls and various objects, such as empty cans and bottles. the control program is implemented in matlab and is based on simple control rules which allow an obstacle avoidance and waste finding behavior (fig. 3). the robot will move forward and will be able of a safe navigation and detection of objects in the workspace (fig. 4). after detecting an obstacle, the robot will stop and action according to the type of the obstacle. if wall, the obstacle will be avoided and the robot will resume moving. if not wall, using the distance sensors on-board the robot, the system will compute the distance to the object and the corresponding angle. using these two measurements, the system will compute the absolute position of the detected object and will compare this with the stored coordinates of previously detected objects. if the object is new (its absolute position is not in the database), it will get more attention from the robot which will turn so that it is facing the object (fig. 5). now, the system is ready to acquire stereoscopic images of the object. the images are processed and analyzed as explained in the next section. the distance to the object is determined and based on integrated system for stereoscopic cognitive vision, localization, mapping, and communication with a mobile service robot 121 figure 3: control algorithm figure 4: robot navigating in a test environment figure 5: robot turning to a detected object 122 cătălin buiu the absolute position of the robot, the absolute position of the object is also determined. then the robot resumes its movement in the workspace after storing the type and coordinates of the object in the database and after having marked its position in the tridimensional map which will be detailed later. 3 stereoscopic cognitive vision system stereoscopy is a technique for infering the 3d position of objects from two or more simultaneously views of the scene. reconstruction of the world seen through stereo cameras can be divided in two steps. first, the correspondence problem means that for every point in one image to find out the correspondent point on the other and compute the disparity of these points. this disparity correlates to distance, and the higher disparity of object pixel means that the object is closer to the cameras. secondly, there is the triangulation step. given the disparity map, the focal distance of the two cameras and the geometry of the stereo setting (relative position and orientation of the cameras) compute the (x,y,z) coordinates of all points in the images. the system presented in this paper solves both steps as will be described below. key advantages of camera based systems include: they offer minimally complex solutions, have very low costs, they are entirely solid state, and colour information can be easily acquired at the same time as range data, helping to build realistic full colour 3d models of the environment. all these advantages are exploited in the application reported in this paper. stereo vision provides realtime, full-field distance information, and is useful in many applications in a wide variety of fields, including robotics. there is a number of dedicated software packages, such as small vision system for realtime stereo analysis from sri’s artificial intelligence center. sentience is a volumetric perception system for mobile robots and uses webcam-based stereoscopic vision to generate depth maps, and from these create colour 3d voxel models of the environment for obstacle avoidance, navigation and object recognition purposes. a "cognitive vision system" is defined in [22] as a system that uses visual information to achieve: recognition and categorization of objects, structures and events, learning and adaptation, memory and representation of knowledge, control and attention. for example, a cognitive monocular vision system for a mobile robot using a cmucam2+ camera is presented in [20]. the visual system’s architecture is presented in fig. 6. all the visual information processing is done on the same separate laptop. given the disparity map, the focal distance of the two cameras and the geometry of the stereo setting (relative position and orientation of the cameras), the system is able to compute the coordinates of all points in the images. the distance to the object is used to determine the absolute position of the object which is marked on the 3d map. figure 6: stereovision system’s architecture integrated system for stereoscopic cognitive vision, localization, mapping, and communication with a mobile service robot 123 screenshots from our application that present two stereoscopic images of a detected object are given in fig. 7. these images will be further processed. figure 7: stereoscopic images of a detected object (left and right hand camera) now the images contain relevant data that will be brought in such a form that contours can be extracted. the images are binarized by extracting colour channels corresponding to the colour of detected objects (yellow, in our case). then, dilatation and errosion filters are applied to the images (see fig. 8). then the images are segmented and objects detected (fig. 9). figure 8: extraction of yellow colour channel and application of dilatation and errosion filters (left and right hand image) figure 9: detected contour for the object in the image using simple scalar descriptors, such as area and perimeter, the detected object is recognized and classified as a can, in our case. 4 generation and update of a tridimensional map of the environment simultaneous localization and mapping (slam) is an essential capability for mobile robots exploring unknown environments. the robot presented in this paper is using a dedicated self-localization 124 cătălin buiu system based on the use of a beacon unit and two transponders (master and slave) [21]. the two transponder units are fixed, while the beacon unit is installed on the robot. half-duplex bidirectional communication between beacon and transponders is realized by using infrared light and ultrasounds. the system is using a atmega8 microcontroller with 16mips at 16mhz. the localization of the robot is realized by triangulation of the distances to the two transponders. more, odometry algorithms contribute to a more precise localization of the robot in the working space. the system is able to generate a a virtual map of the explored environment, in which the space, the robot and the objects are modeled as vrml (virtual reality modeling language) objects. vrml is a standard file format for representing tridimensional interactive vector graphics. it also enables the integration of interactive 3d graphics into the web. by using the virtual reality toolbox from matlab, the system will generate realistic 3d views of the working space and the robot (fig. 10), and objects (fig. 11, in which a question mark means an unknown object). figure 10: vrml models of the working space and robot figure 11: vrml models of objects the robot will explore the working space according to the control strategy presented above and simultaneously will update the tridimensional map. after an object is detected and the robot turns towards it, both cameras are taking images of the scene. further, the images are analyzed and the object recognized. the absolute position of the object is also determined and the object marked on the map (fig. 12). then the robot resumes its movement in the workspace and associated activities: navigation, search, classification and localization of objects. the test results show a good and robust functioning of the stereovision system, and although the processing times are not low, this can be improved by the use of an embedded pc with more computing power. 5 summary and conclusions the main research thrust of this paper has been to demonstrate that an integrated system for communication, control, localization and mapping using stereoscopic vision and 3d maps can be designed and implemented for a mobile service robot which will collect waste in indoor environments. this integrated system will be transferred to a more powerful version of the first prototype (remaster one) of the commercial waste cleanup robot that is the final aim of the remaster project. the new robot will have integrated system for stereoscopic cognitive vision, localization, mapping, and communication with a mobile service robot 125 figure 12: robot taking pictures of an object, recognizing it as a can, and marking it on the map a gripper such that the detected objects can be grasped and carried to dedicated bins. future efforts will address the design and implementation of new navigation strategies based on fuzzy logic. image processing algorithms based on cellular neural networks are currently under investigation and implementation. more research will be done in what regards the interactive aspects of the robotic system so that the robot will be able to interact with humans and other robots in a natural way. acknowledgements we acknowledge the support of the romanian government through the excellence research program (contract 83-ceex-ii-03/31.07.2006) and the work of cristian ionita and laura antochi to the development of the stereoscopic vision system and virtual map. bibliography [1] c. buiu (editor), cognitive robots (in romanian), editura universitara, 2008. [2] c. m. soria, r. carelli, r., j. m. ibarra zannatha, coordinated control of mobile robots based on artifcial vision, international journal of computers, communications, and control, vol. i (2006), no. 2, pp. 85-94. [3] k.d. do, bounded controllers for decentralized formation control of mobile robots with limited sensing, international journal of computers, communications, and control, vol. ii (2007), no. 4, pp. 340-354. [4] p. elinas, r. sim, j. j. little, sigmaslam: stereo vision slam using the rao-blackwellised particle filter and a novel mixture proposal distribution, in proc. of the ieee int. conf. on robotics and automation (icra), florida, usa, 2006. [5] d. g. lowe, object recognition from local scale-invariant features, in int. conf. on computer vision, corfu, greece, september 1999, pp. 1150-1157. [6] r. hartley, a. zisserman, multiple view geometry in computer vision, cambridge, uk: cambridge univ. pr., 2000. [7] d. nister, o. naroditsky, j. bergen, visual odometry, in proc. ieee computer society conference on computer vision and pattern recognition (cvpr 2004), 2004, pp. 652-659. 126 cătălin buiu [8] m. agrawal, k. konolige, rough terrain visual odometry, in proceedings of the international conference on advanced robotics (icar), august 2007. [9] k. konolige, m. agrawal, frame-frame matching for realtime consistent visual mapping, in proceedings of ieee international conference on robotics and automation (icra), april 2007. [10] m. j. procopio, t. strohmann, a. r. bates, g. grudic, jane mulligan, using binary classifiers to augment stereo vision for enhanced autonomous robot navigation, university of colorado at boulder technical report cu-cs-1027-07, april 2007 [11] wang, l. k., s. hsieh, e. c. hsueh, f. hsaio, k. huang, complete pose determination for low altitude unmanned aerial vehicle using stereo vision, in proc. ieee/rsj international conference on intelligent robots and systems (iros 2005), pp. 108 113. [12] lee, s., d. jang, e. kim, s. hong, j. han, a real-time 3d workspace modeling with stereo camera, in proc. ieee/rsj international conference on intelligent robots and systems (iros 2005), pp. 2140 2147. [13] t. chinapirom, u. witkowski, r. ulrich, stereoscopic camera for autonomous mini-robots applied in kheperasot league, research report, university of paderborn, germania, 2007 [14] d. r. karuppiah, z. zhu, p. shenoy, e. m. riseman, a fault-tolerant distributed vision system architecture for object tracking in a smart room, in b. schiele and g. sagerer (eds.), springer lecture notes in computer science 2095, pp 201-219, 2007 [15] s. florczyk, robot vision: video-based indoor exploration with autonomous and mobile robots, weinheim: wiley-vch, 2005. [16] m. f. ahmed, development of a stereo vision system for outdoor mobile robots, m.s. thesis, university of florida, 2006. [17] f. rovira-más, s. han, j. wei, j. f. reid, autonomous guidance of a corn harvester using stereo vision, agricultural engineering international: the cigr ejournal, manuscript atoe 07 013, vol. ix. july, 2007. [18] w. lovegrove, b. brame, single-camera stereo vision for obstacle detection in mobile robots, in intelligent robots and computer vision xxv: algorithms, techniques, and active vision., proceedings of the spie, volume 6764, pp. 67640t, 2007 [19] c. buiu, f. cazan, r. ciurlea, developing of a service robot to recognize and sort waste, in: 16th international conference on control systems and computer science, pp. 298-303. politehnica press, bucharest, 2007. [20] ana pavel, c. vasile, c. buiu, cognitive vision system for an ecological mobile robot, in proceedings of sintes 13, the international symposium on system theory, automation, robotics, computers, informatics, electronics and instrumentation, pp. 267-272, universitaria press, craiova, 2007. [21] c. buiu, design and development of a waste cleanup service robot, in proceedings of the first international eurobot conference, heidelberg, pp. 194-202, 2008. [22] a.g. cohn, d. magee, a. galata, d. hogg, s. hazarika, towards an architecture for cognitive vision using qualitative spatio-temporal representations and abduction, in c. freksa, w. brauer, c. habel, k.f. wender (editors), spatial cognition iii, routes and navigation, human memory and learning, spatial representation and spatial learning, pp. 232-248, springer-verlag, 2003. int j comput commun, issn 1841-9836 vol.7 (2012), no. 3 (september), pp. 403-416 a joint routing and time-slot assignment algorithm for multi-hop cognitive radio networks with primary-user protection h. chen, q. du, p. ren hao chen 1. department of information and communication engineering, xi’an jiaotong university, xi’an, shaanxi,710049, china 2. state key laboratory of integrated services networks, xidian university, xi’an, shaanxi, 710071, china e-mail: js.sq.chenhao@stu.xjtu.edu.cn qinghe du, pinyi ren department of information and communication engineering, xi’an jiaotong university, xi’an, shaanxi,710049, china e-mail: {duqinghe, pyren}@mail.xjtu.edu.cn abstract: cognitive radio has recently emerged as a promising technology to improve the utilization efficiency of the radio spectrum. in cognitive radio networks, secondary users (sus) must avoid causing any harmful interference to primary users (pus) and transparently utilize the licensed spectrum bands. in this paper, we study the puprotection issue in multi-hop cognitive radio networks. in such networks, secondary users carefully select paths and time slots to reduce the interference to pus. we formulate the routing and time-slot assignment problem into a mixed integer linear programming (milp). to solve the milp which is np-hard in general, we propose an algorithm named rsaa (routing and slot assignment algorithm). by relaxing the integral constraints of the milp, rsaa first solves the max flow from the source to the destination. based on the max flow, rsaa constructs a new network topology. on the new topology, rsaa uses branch and bound method to get the near optimal assignment of time slots and paths. the theoretical analyses show that the complexity of our proposed algorithm is o(n4). also, simulation results demonstrate that our proposed algorithm can obtain near-optimal throughputs for sus. keywords: cognitive radio networks; primary-user protection; joint routing and time-slot assignment. 1 introduction the rapid growth in the number of wireless applications such as wifi, wimax, 3g et al. leads to a big radio spectrum shortage. recent studies by the federal communications commission (fcc) highlight that the average utilizations of some licensed spectrum bands allocated through the current static frequency spectrum assignment policies vary between 15% and 85% [1]. to make sufficient use of the spectrum resources in the environment, the notion of cognitive radio (cr) was proposed by dr. joseph mitola in 1999 [2]. in cognitive radio networks, nodes are allowed to sense and explore a wide range of the frequency spectrum and identify currently underutilized spectrum blocks for data transmission. cr can transparently exploit the licensed spectrum bands and is widely considered as the technique for the next generation of wireless communication [3]. to maximize the advantages of cognitive radio networks (crns)łźit is necessary to update the physical layer, media access control (mac) layer and network layer of the traditional wireless communication system. after the concept of cr was proposed, many studies on spectrum sensing and mac protocol copyright c⃝ 2006-2012 by ccc publications 404 h. chen, q. du, p. ren design have been conducted and a lot of progress has been made [4] [5] [6]. the aim of spectrum sensing is to find the spectrum holes in the crns and the mac protocol is to select the one-hop optimal spectrum bands for sus’ transmitting. however, khalife et al. indicated that mac protocol which gave optimal solutions in a single hop configuration may become largely inefficient in a multi-hop scenario and it was of great importance to design cross-layer protocols capable of scheduling, spectrum selecting and routing [7]. cesana et al. in [8] pointed out the challenges of routing in cognitive radio networks: any routing solutions designed for multi-hop crns must be highly coupled to the entire cognitive cycle of spectrum management and the routing module should be able to make fast route maintenance at the sudden appearance of pus. the authors of [9] extended the routing solution in multi-channel multi-radio networks (mcmrns) to crns and proposed a layered graph framework to address channel assignment and routing jointly. hou et al. in [10] [11] illustrated the difference about routing in mcmrns and crns. in crns the radios could send packets over non-contiguous frequency bands and the authors proposed a mixed integer non-linear programming (minlp) model to minimize the required networkwide radio spectrum resources. filippini et al. in [12] proposed a minimum maintenance cost routing for cognitive radio networks. the authors formulated the maintenance cost problem to be an integer optimization model and by carefully selecting routing metrics the authors designed a heuristic distributing algorithm. among all the above routing solutions the accurate information about spectrum availability sensed by the physical layer is crucial for the routing module. so these routing solutions make severe demand on the spectrum sensing module of cr nodes. to ease this demand, in [13] chowdhury et al. proposed a routing solution which avoided harmful interference to pu receivers by designing proper routing metrics. in that solution, each time sus chose the path that passed through regions having minimum overlap with pus’ transmission coverage areas. in [14] ding et al. proposed a distributed algorithm to maximize the capacity of links without generating harmful interference to other users by performing joint routing, dynamic spectrum allocation and scheduling. in [15] xie et al. proposed a geometric approach for relay selections which avoided causing harmful interference to pus in crns. each time the approach selected the best channels available to transmit data to the nearest neighbors or the farthest ones greedily. in [16] the authors zhou et al. gave a mathematical model aiming at minimizing the interference to pus. by relaxing the model’s constraint conditions, the authors transformed the original optimization problem to a linear programming model and gave a joint channel assignment and path selection algorithm. in spectrum sharing multi-hop crns, to guarantee the pus’ priority on the licensed spectrum bands sus must not generate harmful interference to pus. in this paper, to make sufficient use of spectrums we design the routing module of sus through joint routing and time-slot assignments. firstly, we exploit the protocol model [17] which describes the conditions of successful transmission between two nodes and abstract the routing and time-slot assignment problem to be a kind of mixed integer linear programming (milp) model. in the milp model, our objective is to maximize the throughput of sus and our constraints are avoiding harmful interference to pus and eliminating sus’ conflicts caused by concurrent transmissions. then, to get an approximate solution of the milp which is a np-hard problem we propose a near optimal joint routing and time-slot assignment algorithm named rsaa. the theoretical analysis shows that the complexity of rsaa is o(n4). the simulation results demonstrate that rsaa can obtain near optimal throughput. what is more, through the simulation results we analyze the effect of node density and slot periods on the throughput of multi-hop crns. a joint routing and time-slot assignment algorithm for multi-hop cognitive radio networks with primary-user protection 405 2 network model and problem formulation 2.1 network model in 2004, the fcc proposed to allow unlicensed wireless devices to utilize television channel frequencies under the precondition of causing no harmful interference to pus. under this proposal, sus could use the underutilized broadcasting tv spectrum bands for multi-hop communications. in multihop crns, tdma is very necessary for the avoidance of conflicts among sus and the reduction of interference to pus. in the current wlans, nodes use csma/ca to avoid access conflicts. however, csma/ca cannot assure that sus cause no harmful interference to pus. hence it may be not suitable for spectrum sharing crns. in this paper, we consider a cognitive ad hoc network consisting of p pus, n sus and a cognitive scheduling center (csc). the csc is able to access the data base of pus [8] and gathers the information about the pus’ locations and interference thresholds. the interference threshold is defined as the highest interference power which a pu can tolerate. also the csc is designed to be able to collect the information about sus’ locations and the transmitting power via existing communication networks like gsm. after gathering all these information, the csc computes the optimal routing and time-slot assignments and schedule sus’ access to licensed spectrum by delivering these messages to the corresponding nodes. the network architecture is shown in fig. 1. as is shown in fig. 1, the tv receivers act as pus and have priority to use the spectrum f .the csc coordinates and schedules sus to utilize the band f transparently. for example, the csc assigns the path and slot solution s 1 −→ a 2 −→ b 3 −→ c 1 −→ e 2 −→ d for source node s and destination node d. the symbol "a 1 −→ b" represents that node a transmits to node b in time slot 1. although the path s → a → g→ e → d is of less hops and much more throughput, the csc does not choose it because node g causes harmful interference to pus.this paper focuses on finding the optimal routing and time-slot assignment which maximizes the sus’ throughput while avoiding harmful interference to pus. pu a 1 su s su a su b su g su e su d su c 1 2 3 2 pu c pu b csc figure 1: system model 2.2 problem formulation to describe the network mathematically, we first model the successful transmitting conditions among nodes and introduce binary integral variables xi jt, where xi jt=1 indicates that su node i sends packets to su node j in slot t; otherwise xi jt=0. that is, xi jt = { 1, s u i send packets to s u j in slot t 0,otherwise (1) 406 h. chen, q. du, p. ren we assume that all the sus’ radios use the same transmitting power q and the successful transmitting range among sus are rt . let ai denote the neighbor set of node i. so we have ai = { j|di j < rt } (2) where di j denotes the euclid distance between node i and node j. let ii denote the interfered node set of node i, and ri denote the interference range among sus. so we have ii = { j|di j < ri } (3) note that node i cannot transmit to multiple nodes at the same time. we have∑ q∈ai xiqt 6 1 (4) due to potential interference among nodes in the network, if node i uses slot t for transmitting data to node j ∈ ai, then any other nodes which conflict with node j should not use this slot. so we have xi jt + xpqt 6 1, p ∈ i j, p , i,q ∈ ap (5) this is the difference between our model with the model in [9][10], where the authors state the conflict constraint as xi jt + ∑ p∈i j,p,i,q, j xpqt 6 1 (6) it is important to understand that in the conflict constraint (5), two links which interfere with xi jt but do not interfere with each other can access the channel at the same time slot. constraint (4) is the node’s successful sending condition and constraint (5) is the node’s successful receiving condition. under the constraint (4) and (5), nodes in the network can transmit data without conflicting with each other. to guarantee that two nodes transmit data without bit error, the flow rates on each link must not exceed the link’s capacity. let fi jt denote the flow rate between node i and node j at time slot t and gi j denote the channel gain between node i and node j. we have fi jt 6 xi jt blog2(1 + gi j q η ) (7) where b denotes the bandwidth of spectrum band f and η denotes the noise power in the environment. note that the denominator inside the log function contains only η. this is because the interference constraint (5) assures that the interference power received by the su’s receiver is negligible and this helps to simplify our model to be a linear model. what’s more, to make sure that no drop of packet happens at the intermediate nodes, the aggregate data rates in the slot period should meet the flow conservation constraint. i.e. t∑ t=1 ∑ j∈ai fi jt = t∑ t=1 ∑ i∈ap fpit (8) where t denotes the period of scheduling slots of the network. to make sure that all the transmitting power of sus detected at pus does not exceed pus’ threshold int , we have the following constraints n∑ i=1 xi jtgik q 6 int , j ∈ ai,k ∈ p, (9) a joint routing and time-slot assignment algorithm for multi-hop cognitive radio networks with primary-user protection 407 where p denotes the set of pus. in practice the value of int depends on the sensitivity of pu receivers as well as the noise power in the environment. note the average value of sus’ throughput s is the ratio of aggregate amount of data to the time slots, so we have s = 1 t t∑ t=1 ∑ i∈as fsit (10) where s denotes the source node. when there are a number of source-destination pairs in the network, we can introduce one more virtual source-destination pair and simplify the network to be a single source-destination pair network. in such networks, our aim is to find the optimal routing and time-slot assignments which maximize the throughput of sus. mathematically, we have the following optimization problem. max  1t t∑ t=1 ∑ i∈as fsit  (11) s.t.  ∑ q∈ai xiqt 6 1 xi jt + xpqt 6 1, p ∈ i j, p , i,q ∈ ap fi jt 6 xi jt blog2(1 + gi j q η ) t∑ t=1 ∑ j∈ai fi jt = t∑ t=1 ∑ i∈ap fpit n∑ i=1 xi jtgik q 6 int , j ∈ ai,k ∈ p,0 6 t 6 t xi jt = 0 or 1 fi jt > 0 (12) note that the above optimization problem has both integral and continuous variables and the constraint conditions are all linear. so it is a kind of milp problem. in the above milp problem, the optimization variables consists of continuous variables fi jt and binary variables xi jt , while b, t , q, η and int are all constant. 3 routing and time-slot assignment algorithm the above optimization problem is a kind of milp problem, which is np-hard in general [18]. to solve the milp problem, yuan et al. in [16] proposed a greedy algorithm which relaxed the integral constraints and then simplified the milp to be a linear programming (lp) problem. after solving the lp problem, the algorithm fixed the binary variables in the descending order of their relaxed values one by one. although the complexity of this algorithm is equal to solving just one lp problem, the algorithm cannot guarantee that the solution is a feasible flow and that the result meets the interference constraints (4) (5) absolutely. hou et al. in [10] proposed the sequential fixing (sf) algorithm which solved o(n) lp problems iteratively to fix the binary variables. another method to solve milp is to replace the integral constraint to be the following constraint xi jt(xi jt − 1) = 0. (13) based on constraint (13) we can transform milp to be quadratic programming (qp). but we still cannot get optimal solution under polynomial complexity because constraint (13) is non-convex. sherali et al. in [18] pointed out that to solve the milp problem one should exploit the problem’s inherent special structures in the process of model formulations and in algorithmic developments. taking 408 h. chen, q. du, p. ren a closer look at our milp problem we can find that if we relax the integral constraint (1) to be continuous constraint 0 6 xi jt 6 1, (14) and neglect the interference constraint (4)(5). then the milp problem will reduce to a kind of max flow problem. so the optimal solutions of milp are likely to be the subsets of the max flow. based on this idea, we develop our algorithm rsaa which can solve the milp problem efficiently. let e denote the number of edges and n denote the number of su nodes in the crns. since the original milp consists of et binary variables, to get the optimal solution of the milp problem we should enumerate 2et combinations of xi jt. once the binary variables are fixed, the original milp problem is reduced to be the following lp problem. max  1t t∑ t=1 ∑ i∈as fsit  (15) s.t.  fi jt 6 xi jt blog2(1 + gi j q η ) t∑ t=1 ∑ j∈ai fi jt = t∑ t=1 ∑ i∈ap fpit n∑ i=1 xi jtgik q 6 int , j ∈ ai,k ∈ p,0 6 t 6 t fi jt > 0 (16) we denote the new lp problem as lp1 and it consists of et continuous variables. in rsaa algorithm, we first neglect the constraint conditions (4)(5)(7) and solve the max flow problem from the source to the destination. the max flow problem can be formulated to be the following lp problem max  1t t∑ t=1 ∑ i∈as fsit  (17) s.t.  fi jt 6 blog2(1 + gi j q η ) t∑ t=1 ∑ j∈ai fi jt = t∑ t=1 ∑ i∈ap fpit fi jt > 0 (18) we denote the above lp problem as lp2 and use push-pull flow algorithm [20] to obtain the max flow φ= { fi jt}. after getting φ, we construct a new set of binary variables by xt = {xi jt| fi jt > 0, fi jt ∈φ} (19) and xt is the new enumeration set of rsaa. we denote k as the number of variables of xt , i.e. k = |xt |. when we enumerate the new variables in xt , we use constraints (4)(5) as the branch and bound conditions. the complete rsaa algorithm is given in table 1. in the procedure of rsaa algorithm, we first construct new searching variables or edges by the solution of the max flow problem and then cut off all the nodes and edges which have nothing to do with the max flow in the network. to solve the milp in newly constructed network, we take the condition (4) (5) as bound conditions and simplify the milp problem to be a number of lp1 problems. also when we solve lp1, we take it as a kind of special max flow problem, and use dinic algorithm to solve it. as we can see from the algorithm’s process, the solution of rsaa always meets all the constraints of the original milp problem and so we can conclude that the solution of rsaa is a feasible solution to the original milp problem. a joint routing and time-slot assignment algorithm for multi-hop cognitive radio networks with primary-user protection 409 table 1: rsaa algorithm steps contents step 1: let the csc update the pus’ and sus’ locations and compute the sus’ link capacity and their interference power to pus. introduce a virtual source-destination pair and simplify the network to be a signal source-destination pair network. step 2: set up and solve lp2 and obtain its solution φ. use the equation (19) and φ to construct new binary variable set xt . sort the new binary variables in the ascending order by slot index. step 3: initialize the sus’ throughput s = 0; set the current optimal flow solution as φ∗ = ∅, temp_i = 0. step 4: if temp_i > 2k , then the whole algorithm ends, output the optimal throughput s and the optimal flow solution set φ∗; else transform temp_i into |xt | bit binary digits, each digit represents the assignment of corresponding link and slot. if each digit of temp_i meets the interference condition (4)(5),then go to step 5;else temp_i = temp_i + 2b_last−1, where b_last is the smallest digit index in all temp_i’s transformed digits which violate the interference condition (4)(5). go back to step 4. step 5: after getting one combination of {xi jt},use dinic algorithm to solve lp1 and get the max flow value temp_ fval as well as the corresponding flow rates ft = { fi jt}. in the steps of dinic algorithm, when we search the augmenting flows in the layered networks, we select the augmenting flow according to the ascending order of their interference to pus. if the adding of augmenting flow with the minimum interference exceeds int , then augmenting step of dinic algorithm ends; else continue to find the other augmenting flows. when dinic algorithm ends, if s < temp_ fval, then s = temp_ fval,φ∗ = ft ; else go back to step 4. 410 h. chen, q. du, p. ren 4 performance analysis of rsaa 4.1 complexity of rsaa we now analyze the complexity of rsaa using the random network theory. in the original milp problem, the complexity of obtaining optimal solution is exponential and we should solve o(2|e|) linear programming lp1. by reducing the searching variables, the complexity of rsaa algorithm is reduced to be polynomial. in fact, we have the following theorem. theorem 1. in the network where the average degree of each node is constant, the complexity of rsaa is o(n4). proof: let constant d denote the average degree of each node in the network and in practice the value of d is determined by the node density and the node’s transmitting range. by the er model in random network theory [21], the average number of edges in the network is e = dn 2 (20) let l denote the average path length from the source node to the destination node, and then the number of nodes in the network has the following expression [21], n ∝ dl (21) so the average route length can be written as l =α log2 n log2 d , (22) where α is a constant. note that the capacity of each link in random networks is i.i.d. so the maximum throughput we obtain from lp2 are dc, where c denotes the average capacity of links. and then we can conclude that the maximum number of paths in the max flow for source to destination is d. hence, the number of newly constructed binary variables is k = |xt |= dt l =αt d log2 n log2 d . (23) so rsaa need to solve o(2k ) = o(n) linear programming problems lp1. by dinic algorithm the complexity of solving lp1 is o(n3). hence, the total complexity of rsaa algorithm is o(n • n3) = o(n4). � from theorem 1 we can see that both rsaa and sf need to solve o(n) linear programming problems lp1. the difference between them is that rsaa exploits the flow structures of the network to obtain better performance. 4.2 optimal approximation although rsaa restricts the searching space and this restriction may reduce the throughput of sus, we find that this kind of reduction is almost negligible. in fact we have the following theorem. theorem 2. if φ∗ is the optimal solution to the milp problem and pt is the optimal routing and timeslot assignment, then the intersection set between pt and the rsaa’s new constructed searching set xt is not null. a joint routing and time-slot assignment algorithm for multi-hop cognitive radio networks with primary-user protection 411 proof: we prove this theorem by constructing contradictions. suppose the intersection set between xt and the new constructed searching set xt is null, i.e. pt ∩ xt = ∅. we denote the optimal flow rate of the milp as φ∗ and denote the max flow of lp2 as φ. because φ∗ is the solution of the milp, and so φ∗ satisfies all the constraint flow conditions in the milp. then φ∗ satisfies all the constraint conditions of lp2. so φ∗ is a feasible augmenting flow to lp2. because pt ∩ xt = ∅, and this means pt is an independent augmenting path on which we can augment φ∗ to the original max flow. so the optimal solution of lp2 is φ∗ +φ and this is contradictory to the fact that φ is the optimal solution. so the supposition that the intersection set between pt and the rsaa’s new constructed searching set xt is null is false and our theorem is proven. � however, theorem 2 cannot guarantee that the solution of rsaa is optimal. only when pt is the subset of xt , can we say that the solutions of rsaa are optimal. in fact, we can conclude that pt belongs to the subset of xt with high probability according to theorem 2. especially, in the small networks where the max flow consists of only one path, we can conclude that the solution of rsaa will be optimal according to theorem 2 and the flow conservation condition(8). 4.3 the effect of time-slots and conflicts in our milp model, there is a constant t which represents the slot scheduling period of the crns. we can see that the existing of t increases the complexity of our algorithm, and this is because t decides the number of variables. in the network layer, the routing module should fix the best slot period according to the network parameters such as the node density and pu’s threshold. note that the minimum slot period should be enough to avoid the conflicts among sus and the interferences to pus. so we can get the average minimum value of t , tmin = max{lqg∗/int , l/c} (24) in the above equation, c is the average number of mutual interference edges and l is the average route length from the source to destination, and g∗ denotes the average channel gain between two neighbor nodes. in equation (24), the first item means that to connect the source and the destination at least lqg∗/int slot periods are needed to guarantee no harmful interference to pus. the second item means that to avoid the conflicts among sus at least l/c slot periods are needed. and the equation (14) means that only the slot periods are long enough to avoid harmful interference to pus and eliminate the conflicts among sus, the source and destination pair can set up a successful route. in fact, it is necessary to assure that the slot period is greater than the minimum one, or the slot period will become the bottleneck constraint of su’s throughput. and when the slot periods are smaller than the minimum one the increase of slot period will dramatically increase sus’ throughput. but if the slot periods are greater than the minimum one, the increase of slot period will not necessarily lead to the increase of sus’ throughput. and in this scenario sus’ throughput in crns are the tradeoff between the amount of data and the delay as is shown in equation (10). however, if we divide one fixed length of time into a number of slots, we can find that the more slots we divide the fixed time into, the more throughput sus can obtain. in the best case if we divide the fixed time into infinite slots, and then the solution of milp will approximate the solution of the relaxed lp where the integral constraint is reduced to be constraint (14). to describe the effect of conflict on sus’ throughput, we introduce r denoting the ratio of node’s interference distance to transmitting distance. i.e. r = ri/rt (25) note that as the ratio r increases the number of conflicting edges in the network will increase. and this means the number of conflict constraint inequality (5) will increase and so the solutions of the milp will 412 h. chen, q. du, p. ren decrease. intuitively, the increase of the ratio r will decrease the concurrent transmissions and cut down sus’ throughput. however, as the ratio r increases, the number of paths from the source to destination will decrease because of the concurrent conflicts. so according to theorem 2 we can conclude that the solution of rsaa will become closer to the optimal results as the ratio r increases. from the analysis above, we can find that rsaa algorithm can obtain the near optimal throughput of sus at the complexity of o(n4). in the next section we will verify the performance analysis through simulations. 5 simulation results in this section, we present simulation results for the proposed rsaa algorithm and compare it to sf algorithm and the enumeration algorithm. since the enumeration algorithm can obtain optimal solutions, we denote the results of enumeration algorithm as optimal solutions in the following figures. our simulation scenario is set at the rural and mountainous areas where the tv broadcast spectrum is underutilized and the sus can transparently use these spectrums without generating harmful interference to pus. the simulation parameters are shown in table 2. table 2: simulation parameters simulation parameters values the topology area 1000x1000 m2 the distribution of sus’ location uniform distribution channel propagation model two way ground reflection model the transmitting range of sus 250m the interfering range of sus 300m the band width of pus 1mhz the power of noise -140dbw the transmitting power of sus 2w the number of pu 1 the location of pu (0,0) the length of each slot period 1s simulation times 200 5.1 the sus’ throughput vs pu’s outage probability fig. 2 shows the outage probability of pus in the condition that sus do not take the pu’s threshold into consideration. we count a time of outage of pu when pu detects that sus’ interference power exceeds int . in fig. 2 the slot period is set to be 3 and the ratio of interference to transmitting is set to be 6/5. we use software cplex to get the optimal solution of milp. from fig. 2 we can find that as the pu’s threshold increases the outage probability of pu decreases. also the outage probability decreases as the node density nm in the network decreases. this is because the interference power received by pu is the sum of all sus’ transmitting power and more nodes in the network means more interference. as we can see from fig. 2, the outage probability is very high and unbearable if sus do not carefully select paths and slots. so in the networks with high node density, it is very necessary to protect pu from sus’ interference at network layer. fig. 3 compares the sus’ throughput in two different scenarios: in one scenario sus avoid harmful interference to pus and in the other scenario sus neglect pu’s threshold. from fig. 3 we can see that sus’ throughput are very sensitive to pu’s threshold when sus take the interference to pu into a joint routing and time-slot assignment algorithm for multi-hop cognitive radio networks with primary-user protection 413 −115 −110 −105 −100 −95 −90 0 10 20 30 40 50 60 the threshold of pu (dbw) th e o u ta g e p ro b a b ili ty o f p u (% ) nm=8 nm=16 nm=24 figure 2: the outage probability of pu as the increase of pu’s threshold consideration. and the throughput obtained from neglecting pu’s threshold are the upper bound of those of considering pu’s threshold. what’s more, fig. 3 shows that when pu’s threshold is low enough, pu’s threshold becomes the bottleneck of sus’ throughput. fig. 2 and 3 demonstrate the performance tradeoff between pus and sus in spectrum sharing crns. and so our model can offer valuable reference for the design of multi-hop crns. −115 −110 −105 −100 −95 −90 1 2 3 4 5 6 7 the threshold of pu (dbw) th ro u g h p u ts (m b p s) nm=8 not protecting pu nm=16 not protecting pu nm=24 not protecting pu nm=8 protecting pu nm=16 protecting pu nm=24 protecting pu figure 3: sus’ throughput as the increase of pu’s threshold 5.2 the effect of node density fig. 4 compares sus’ throughput of rsaa algorithm, sf algorithm and the enumerate algorithm. in the simulation, the slot period is 3 and the pu’s threshold is -90dbw. the ratio of interference to transmitting is 6/5. from fig. 4 we can find that the results of rsaa outperform the results of sf and can obtain 97% of the optimal throughput for sus averagely. especially when the node density is low, rsaa can obtain 99% of the optimal throughput, while sf just gets 55% optimal throughput for sus. this is because when the node density is low, the optimal path will almost surely locate in the searching set xt as is shown in theorem 2. what’s more, fig. 4 shows that sus’ throughput decrease as the node density increases which is the same with the results in [16]. the reason is that when the node density increases, sus need more slots which mean more delay to avoid the conflicts among sus and the interference to pu. 414 h. chen, q. du, p. ren 6 8 10 12 14 16 18 20 0 2 4 6 8 10 12 throughputs with different node density node density(nodes/km2) th ro u g h p u ts (m b p s) optimal solution rsaa sf figure 4: sus’ throughput on different node density. 5.3 the effect of time-slots and conflict fig. 5 compares the throughput of sus got from the three algorithms as the slot periods in the network increase. in this simulation the node density is set as 13 nodes per square kilometer. the pu’s threshold int and the ratio r are the same with those in fig. 4. from fig. 5 we can see that the rsaa’s approximation to the optimal solution is not affected by slots and in any slot number condition rsaa can get 98% of the optimal throughput for sus. also we can find that when we fix the length of each slot as 1 second and increase the slot numbers in the network, the sus’ throughput increase dramatically when the number of slot periods is small. if we increase the slot periods from 3 to 5, sus’ throughput decrease because the increase of data cannot offset the increase of delay. but when the slot period is 6 the throughput of sus increase as the increase of data outweighs the increase of delay. the fluctuation in fig. 5 shows the complex relationship between relay and throughput in multi-hop wireless networks. 2 2.5 3 3.5 4 4.5 5 5.5 6 1 2 3 4 5 6 7 slot peroids(s) th ro u g h p u ts (m b p s) optimal solution rsaa sf figure 5: sus’ throughput on different slot periods. fig. 6 shows the throughput of sus vary as the ratio of interference distance to transmitting distance increases under the three algorithms. in this simulation the pu’s node density is set as 16 nodes per square kilometer and the pu’s threshold is the same as that in fig. 4. the slot period is 4. from fig. 6 we can also find that rsaa can get 99% of the optimal throughput in average compared to sf’s 34%. what is more, fig. 6 shows that as the ratio increases the solutions of rsaa become much closer to the optimal results and when the ratio exceeds 2, the two solutions nearly overlap with each other which is identical with our above performance analysis. a joint routing and time-slot assignment algorithm for multi-hop cognitive radio networks with primary-user protection 415 1 1.5 2 2.5 1 1.5 2 2.5 3 3.5 4 4.5 5 5.5 the ratio of interferece and transmitting th ro u g h p u ts (m b p s) optimal solutions rsaa sf figure 6: sus’ throughput on different ratio of interference to transmitting. 6 conclusions in spectrum sharing multi-hop crns, by carefully selecting paths and slots sus can utilize the licensed spectrum band transparently. in this paper, we first formulate a milp model to describe the joint routing and time-slot assignment issue and then develop rsaa algorithm to solve this np-hard problem. theoretical analyses and simulation results demonstrate that rsaa algorithm can obtain near-optimal throughput with a polynomial complexity and so it can be widely used in crns. acknowledgment this work was supported in part by the national natural science foundation of china (60832007), national hi-tech research and development plan of china (2009aa011801), and the national science and technology major project( 2010zx03005-003). bibliography [1] fcc, et docket no 03-222 notice of proposed rule makingand order, december 2003. [2] j. mitra iii and g. q. maguire jr., "cognitive radio: making software radios more personal," ieee personal commun., pp. 13-18, aug. 1999. [3] i. akyildiz, w. lee, m. vuran, and s. mohanty. "next generation / dynamic spectrum access/cognitive radio wireless networks: a survey." compo netw. jour. (elsevier), vol.50, no.13, pp. 21272159, sept. 2006. [4] t. yucek, h. arslan, "a survey of spectrum sensing algorithms for cognitive radio applications," communications surveys & tutorials, ieee , vol.11, no.1, pp.116-130, first quarter 2009. [5] h. wang, h. qin, l. zhu, "a survey on mac protocols for opportunistic spectrum access in cognitive radio networks," computer science and software engineering, 2008 international conference on , vol.1, no., pp.214-218, 12-14 dec. 2008 [6] y. wang, p. ren, and g. wu, "a throughput-aimed mac protocol with qos provision for cognitive ad hoc networks," ieice trans. commun., vol. e93-b, no. 6, pp. 1426-1429, jun. 2010. 416 h. chen, q. du, p. ren [7] h. khalife, n. malouch, s. fdida, "multihop cognitive radio networks: to route or not to route," network, ieee , vol.23, no.4, pp.20-25, july-august 2009 [8] m. ceasna, f. cuomo, e. ekici, "routing in cognitive radio networks: challenges and solutions", ad hoc netw., vol. 9, no. 3, pp.228-248, may 2011 [9] x. zhou, l. lin, j. wang and x. zhang, "cross-layer routing design in cognitive radio networks by colored multigraph model," wireless personal communications, vol. 49, no.1, pp. 123-131, april 2009 [10] y.t. hou, y. shi, h.d. sherali, "optimal spectrum sharing for multi-hop software defined radio networks," infocom 2007. 26th ieee international conference on computer communications. ieee , vol., no., pp.1-9, 6-12, may 2007 [11] y.t. hou, y. shi, h.d. sherali, "spectrum sharing for multi-hop networking with cognitive radios," selected areas in communications, ieee journal on , vol.26, no.1, pp.146-155, jan. 2008 [12] i. filippini, e. ekici, m. cesana, "minimum maintenance cost routing in cognitive radio networks," mobile adhoc and sensor systems, 2009. mass ’09. ieee 6th international conference on , vol., no., pp.284-293, 12-15 oct. 2009 [13] k.r. chowdhury, i.f. akyildiz, "crp: a routing protocol for cognitive radio ad hoc networks," selected areas in communications, ieee journal on , vol.29, no.4, pp.794-804, april 2011 [14] l. ding, t. melodia, s.n. batalama, j.d. matyjas, m.j. medley, "cross-layer routing and dynamic spectrum allocation in cognitive radio ad hoc networks," vehicular technology, ieee transactions on , vol.59, no.4, pp.1969-1979, may 2010 [15] m. xie, w. zhang, k.k. wong, "a geometric approach to improve spectrum efficiency for cognitive relay networks," wireless communications, ieee transactions on , vol.9, no.1, pp.268-281, january 2010 [16] z. yuan, j. b. song, z. han, "interference minimization routing and scheduling in cognitive radio wireless mesh networks," wireless communications and networking conference (wcnc), 2010 ieee , vol., no., pp.1-6, 18-21 april 2010 [17] p. gupta, p.r. kumar, "the capacity of wireless networks," informationtheory, ieee transactions on, vol.46, no.2, pp.388-404, mar 2000. [18] k. jain, j. padhye, v.n. padmanabhan, l. qiu, "impact of interference on multi-hop wireless network performance", wireless networks, vol 11, no 4, pp. 471-487, july 2005 [19] h.d. sherali, w.p. adams, p.j. driscoll, "exploiting special structures in constructing a hierarchy of relaxations for 0-1 mixed integer problems", operations research, vol.46, no.3, pp.396-405, may 1998 [20] s. gao. graph theory and network flow theory, 1rd ed., beijing: higher education press, 2009, pp.307-314 [21] p. erdfs and a. renyi, "on the evolution of random graphs", pubi. math. inst. hung. acad. sci. 5, 1960, pp.17-61. international journal of computers communications & control issn 1841-9836, 11(2):282-291, april 2016. the maximum flows in planar dynamic networks c. schiopu, e. ciurea camelia schiopu* 1. transilvania university of brasov romania, 500091 braşov, iuliu maniu, 50 *corresponding author: camelia.s@unitbv.ro eleonor ciurea transilvania university of brasov romania, 500091 braşov, iuliu maniu, 50 e.ciurea@unitbv.ro abstract: an nontrivial extension of the maximal static flow problem is the maximal dynamic flow model, where the transit time to traverse an arc is taken into consideration. if the network parameters as capacities, arc traversal times, and so on, are constant over time, then a dynamic flow problem is said to be stationary. research on flow in planar static network is motivated by the fact that more efficient algorithms can be developed by exploiting the planar structure of the graph. this article states and solves the maximum flow in directed (1,n) planar dynamic networks in the stationary case. keywords: network flow, planar network, dynamic network, maximum flow. 1 introduction the static network flow models bridges several diverse and seemingly unrelated areas of combinatorial optimization. more often in scientific writing, flow in a network refers to the flow of electricity, phone calls, email messages, commodities being transported across truck routes, or other such kinds of flow. many efficient algorithms have been developed to solve the maximum flows problem in static network [1]. the planar static network also arise in practical contexts such as vlsi design and communication networks, and hence it is of interest to find fast flow algorithms for this class of graphs. the computation of a maximum flow in a planar static network has been investigated by many researchers starting from the work of ford and fulkerson [5] who developed an o(n2) time algorithm for (1,n) networks when the source node 1 and sink node n are on the same face. this algorithm was later improved to o(n log n) time by itai and shiloach [8]. by introducing the concept of potentials, hassin [6] gave an algorithm that run in o(n log0.5 n) time using frederickson′s shortest path algorithm [4]. itai and shiloach [8] also developed an algorithm to find a maximum flow in an undirected planar networks when the source node and sink node are not on the same face. for faster maximum flow algorithms in planar (but not necessarily (1,n) planar) undirected and directed static networks see hassin and johnson [7] and johnson and venkatesan [9]. khuller and naor [10] present the flow in planar static networks with nodes capacities. however, in some other applications, time is an essential ingredient [1]. in this instance, to account properly for the evolution of the underlying system over time, we need to use dynamic network flow models. for dynamic network flow problem see [1], [2], [3]. in this paper, we present the maximum flow problem in directed (1,n) planar dynamic networks. we present the case when the planar dynamic network is stationary. further on, in section 2 the maximum flow in directed (1,n) planar static network is exposed. in section 3 some copyright © 2006-2016 by ccc publications the maximum flows in planar dynamic networks 283 basic dynamic network notations and results are presented, while in section 4 is presented the method for solving the maximum flow in directed (1,n) planar dynamic network. the conclusions are presented in section 5 and an example is given in section 6. 2 the maximum flow in directed (1, n) planar static network research on flow in planar static network is motivated by the fact that more efficient algorithms can be developed by exploiting the planar structure of the digraph. definition 1. a digraph g = (n,a) is said to be planar if we can draw it in a two-dimensional plane so that no two arcs intersect each other. researchers have developed very efficient algorithms (in fact, linear time algorithms) for testing the planarity of a digraph. definition 2. let g = (n,a) be a planar digraph. a face of g is a region of the plane bounded by arcs that satisfies the condition that any two points in the region can be connected by a continuous curve that meets no nodes and arcs. the boundary of a face x is the set of all arcs that enclose it. faces x and y said to be adjacent if their boundaries contain a common arc. the planar digraph g has an unbounded face. recall two well-known properties of planar digraphs: – if a connected planar digraph has n nodes, m arcs and q faces, then q = m−n + 2; – if a planar digraph has n nodes and m arcs, then m < 3n. our discussion in this paper applies to a special class of planar digraphs known as (1,n) planar digraphs (the node source 1 and node sink n lie on the boundary of unbounded face). let g = (n,a,u) be a static network with the set of nodes n = {1, . . . , i, . . . ,j, . . . ,n}, the set of arcs a = {a1, . . . ,ak, . . . ,am}, ak = (i,j) and the upper bound (capacity) function u : a → r+, where r is real number set. to define maximal static flow problem, we distinguish two special nodes in the static network g = (n,a,u): a source node 1 and a sink node n. a static flow is a function f : a → r+, satisfying the following conditions: ∑ j f(i,j)− ∑ k f(k,i) =   v, if i = 1 (1a) 0, if i ̸= 1,n (1b) −v, if i = n (1c) 0 ≤ f(i,j) ≤ u(i,j), (i,j) ∈ a (2) for some v ≥ 0. we refer to v as the value of the static flow f. the maximum flow problem is to determine a flow f which maximizes v. a cut is a partition of the node set n into two subsets s and s̄ = n − s. we represent this cut using notation [ s,s̄ ] . we refer to an arc (i,j) with i ∈ s and j ∈ s̄ as a forward arc of the cut and an arc (j,i) with j ∈ s̄ and i ∈ s as a backward arc of the cut. let ( s,s̄ ) denote the set of forward arcs in the cut and let ( s̄,s ) denote the set of backward arcs. we have that the arc set of cut is [ s,s̄ ] = ( s,s̄ ) ∪ ( s̄,s ) . we refer to a cut[ s,s̄ ] as an 1−n cut if 1 ∈ s and n ∈ s̄. for the maximum flow problem, we define the capacity of the 1−n cut [ s,s̄ ] as: 284 c. schiopu, e. ciurea c[s,s̄] = ∑ (s,s̄) u(i,j) (3) we refer to an 1 −n cut whose capacity is the minimum among all 1 −n cuts as a minimal cut. recall the maximum flow minimum cut theorem. theorem 3. the maximum value of the flow from a source node 1 to a sink node n in network g equals the capacity of minimum 1−n cut. many efficient algorithms have been developed to solve the maximum flows problem in some static network [1]. next we present the maximum flow problem in directed (1,n) planar static network. first, we define the dual directed static network denoted by g ′ = (n ′ ,a ′ ,c ′ ). we add the arc (n,1) with u(n,1) = 0, which divides the unbounded face into two faces: a new bounded face and a new unbounded face. in this case we have n ′ = q+1 faces, with q = m−n+2. then we place a node x ′ inside each face x of the network g. we have n ′ = {1 ′ , . . . ,x ′ , . . . ,y ′ , . . . ,n ′ }. let 1 ′ and n ′ , respectively, denote the nodes in the dual directed static network g ′ corresponding to the new bounded face and the new unbounded face. each arc (i,j) ∈ a lies on the boundary of the two faces x and y. corresponding to this arc, the network g ′ contains two oppositely arcs (x ′ ,y ′ ) and (y ′ ,x ′ ). if arc (i,j) is a clockwise arc in the face x, we define the cost c ′ (x ′ ,y ′ ) = u(i,j) and the cost c ′ (y ′ ,x ′ ) = 0. we define arc costs in the opposite manner if arc (i,j) is a counterclockwise arc in the face x. the network g ′ contains the arcs (1 ′ ,n ′ ) and (n ′ ,1 ′ ) which we delete from the network. we have a ′ = {(x ′ ,y ′ ),(y ′ ,x ′ )|x ′ ,y ′ ∈ n ′ ,(x ′ ,y ′ ) and (y ′ ,x ′ ) correspond to (i,j) ∈ a}. there is an one-to-one correspondence between 1−n cuts in the network g and paths from node 1 ′ to node n ′ in the network g ′ . moreover, the capacity of the cut equals the cost of the corresponding path. consequently, we can obtain a minimum 1 − n cut [s,s̄] and c[s,s̄] in network g by determining a shortest path p ′ and c ′ (p ′ ) from node 1 ′ to node n ′ in the network g ′ . we can solve the shortest path problem in the network g ′ using dijkstra′s algorithm [1]. now, we present an algorithm for finding a maximum flow in a directed (1,n) planar static network g = (n,a,u). let d ′ (x ′ ) denote the shortest path distance from node 1 ′ to node x ′ in the dual directed static network g ′ = (n ′ ,a ′ ,c ′ ). the algorithm for maximum flow in directed (1,n) planar static network (amfdpsn) is presented in figure 1 [1]. 1: amfdpsn; 2: begin 3: compute the network g ′ ; 4: dijkstra (g ′ ,d ′ ); 5: for (i,j) ∈ a do 6: f(i,j) := d ′ (y ′ )−d ′ (x ′ ); 7: end for 8: end. figure 1: algorithm for maximum flow in directed (1,n) planar static network theorem 4. the amfdpsn determines a maximum flow in network g. theorem 5. the amfdpsn determines a maximum flow in o(n2) time. using frederickson′s algorithm (see [4]), the amfdpsn determines a maximum flow in o(n1.5) time. the maximum flows in planar dynamic networks 285 3 the maximum flows in dynamic network let g = (n,a,u) be a static network with the node set n, the arc set a, the upper bound function u,1 the source node and n the sink node. let n be the natural number set and let h = {0,1, . . . ,t} be the set of periods, where t is a finite time horizon, t ∈ n. let us state the transit time function h : a×h → n and the time capacity function uh : a × h → r+, where h(i,j;t) represents the transit time of arc (i,j) at time t, t ∈ h and uh(i,j;t) represents the capacity (upper bound) of arc (i,j) at time t,t ∈ h. the maximal dynamic flow problem for t time periods is to determine a flow function fh : a×h → n, which should satisfy the following conditions in dynamic network gh = (n,a,h,uh): t∑ t=0 ( ∑ j fh(i,j;t)− ∑ k ∑ τ fh(k,i;τ)) = vh, if i = 1 (4a) ∑ j fh(i,j;t)− ∑ k ∑ τ fh(k,i;τ) = 0, if i ̸= 1,n, t ∈ h (4b) t∑ t=0 ( ∑ j fh(i,j;t)− ∑ k ∑ τ fh(k,i;τ)) = −vh, if i = n (4c) 0 ≤ fh(i,j;t) ≤ uh(i,j;t), for all (i,j) ∈ a and for all t ∈ h (5) maxvh, (6) where τ = t−h(k,i;τ), vh = t∑ t=0 v(t), v(t) is the flow value at time t and fh(i,j;t) = 0, (i,j) ∈ a, t ∈{t −h(i,j;t) + 1, . . . ,t}. in other words, a dynamic flow fh from 1 to n is any flow fh from 1 to n in which not more than uh(i,j;t) flow units starting from node i at time t and arriving at node j at time t+h(i,j;t) for all arcs (i,j) and all t. note that in a dynamic flow, units may be departing from the source at time 0,1, . . . ,t ′ , t ′ < t . a maximum dynamic flow for t time periods from 1 to n is any dynamic flow from 1 to n in which the maximum possible number of flow units arrive at the sink node n during the first t time periods. we will show how to transform the maximum dynamic flow problem in the dynamic network gh = (n,a,h,uh) into a static flow problem on a static network g ′ h = (n ′ h,a ′ h,u ′ h), called the reduced time-expanded network. for a given dynamic network gh = (n,a,h,uh), we form the time expanded network gh = (nh,ah,uh) as follows. we make t + 1 copies it, t = 0,1, . . . ,t of each node i in gh. node it in gh represents node i in gh at time t. for each (i,j) in gh, there are arcs (it,jθ), θ = t + h(i,j;t), t = 0,1, . . . ,t − h(i,j;t) with capacity uh(it,jθ) = uh(i,j;t) in gh. the arc (it,jθ) in gh represents the potential movement of a commodity from node i to node j in time h(i,j;t). the number of nodes in gh is n(t + 1), and number of arcs is limited by m(t + 1) − ∑ a h̄(i,j), where h̄(i,j) = min{h(i,j; 0), . . . ,h(i,j;t)}. it is easy to see that any dynamic flow in dynamic network gh is equivalent to a static flow in static network gh from the source nodes 10,11, . . . ,1t to the sink nodes n0,n1, . . . ,nt and vice versa. we can further reduce the multiple source, multiple sink problem in network gh to the single source, single sink problem by introducing a supersource node 1∗ and a supersink node n∗ constructing time superexpanded network g∗h = (n ∗ h,a ∗ h,u ∗ h), where n ∗ h = nh ∪{1 ∗,n∗}, a∗h = ah ∪ {(1 ∗,1t)|t = 0,1, . . . ,t}∪{(nt,n∗)|t = 0,1, . . . ,t}, u∗h(it,jθ) = uh(it,jθ) for all (it,jθ) ∈ ah, u∗h(1 ∗,1t) = u ∗ h(nt,n ∗) = ∞, t = 0,1, . . . ,t . now, we construct the time reduced 286 c. schiopu, e. ciurea expanded network g ′ h = (n ′ h,a ′ h,u ′ h) as follows. we define the function h ∗, h∗ : a∗h → n, h∗(1∗,1t) = h ∗(nt,n ∗) = 0, t = 0,1, . . . ,t , h∗(it,jθ) = h(i,j;t), t = 0,1, . . . ,t −h(i,j;t). let d∗(1∗, it) be the length of the shortest path from the source node 1∗ to the node it in network g∗h and d∗(it,n∗)the length of the shortest path from node it to the sink node n∗, with respect to h∗. the computation of d∗(1∗, it) and d∗(it,n∗) for all it ∈ n∗h is performed by means of the usual shortest path algorithms. we have n ′ h = {1 ∗,n∗}∪ {it|it ∈ nh,d∗(1∗, it) + d∗(it,n∗) ≤ t}, a ′ h = {(1 ∗,1t)|d∗(1t,n∗) ≤ t}∪ {(nt,n∗)|d∗(1∗,nt) ≤ t}∪ {(it,jθ)|(it,jθ) ∈ ah,d∗(1∗, it) + h∗(it,jθ) + d ∗(jθ,n ∗) ≤ t} and u ′ h is restriction of u ∗ h at a ′ h. in network g ′ h we rewrite the nodes 1∗, n∗ by 1 ′ , respectively n ′ . it is easy to see that the network g ′ h is always a partial subnetwork of g∗h. since an item released from a node at a specific time does not return to that location at the same or an earlier time, the networks gh, g∗h, g ′ h cannot contain any circuit, and are therefore acyclic always. in the most general dynamic model, the parameter h(i) = 1 is waiting time at node i, and the parameter uh(i;t) is defined as the capacity of the node i, which represents the maximum amount of flow that can wait at node i from time t to t + 1. this most general dynamic model is not discussed in this paper. the maximum dynamic flow problem for t time periods in dynamic network gh formulated in conditions (4), (5), (6) is equivalent with the maximum static flow problem in static network g ′ h as follows: ∑ jθ f ′ h(it,jθ)− ∑ k τ ′ f ′ h(kτ′, it)) =   v ′ h, if it = 1 ′ , (7a) 0, for all it ̸= 1 ′ ,n ′ , (7b) −v ′ h, if it = n ′ , (7c) 0 ≤ f ′ h(it,jθ) ≤ u ′ h(it,jθ), for all (it,jθ) ∈ a ′ h (8) max v ′ h, (9) where by convention it = 1 ′ for t = −1 and it = n ′ for t = t + 1. it is easy to see that network g ′ h is no planar, in general. a dynamic flow problem is said to be stationary if the network parameters as capacities, arc traversal times, and so on, are constant over time (c : a → r+,h : a → n, and so on). in the stationary case it does not require the construction of the reduce time expanded static network g ′ h = (n ′ h,a ′ h,u ′ h) for solving the maximum dynamic flow problem for any t . a maximum dynamic flow in the stationary case can be generated from a maximum value and minimum time flow f in static network g = (n,a,c,u), where c(i,j) = h(i,j) is the cost for any arc (i,j) ∈ a. the algorithm for stationary maximum dynamic flow (asmdf) problem is presented in figure 2 [5]. 1: asmdf; 2: begin 3: amvmcf (g,f); 4: adfef (f,r(p1), . . . ,r(pk)); 5: arf (r(p1), . . . ,r(pk)); 6: end. figure 2: algorithm for stationary maximum dynamic flow. the procedure amvmcf performs the algorithm for maximum value and minimum cost flow f in network g. for statements we suppose that use the algorithm of klein variant (minimum the maximum flows in planar dynamic networks 287 mean cycle canceling algorithm, see [1]). this algorithm have the complexity o(n2m3 log n). the procedure adfef performs the algorithm for decomposition of flow f in elementary flows with r(p1), ...,r(pk) path flows. is necessary that c(pi) ≤ t . this algorithm have complexity o(m2). the procedure arf performs the algorithm for send r(pi) flow, i = 1 . . . ,k, starting out from source node 1 at time periods 0 and repeat it after each time period as long as there is enough time left in the horizon for the flow along the path to arrive at the sink node n. this algorithm have complexity o(kt). hence, the algorithm for stationary maximum dynamic flow have complexity o(n2m3 log n) (we consider that kt ≤ n2m3 log n). the flow obtained with asmdf is called a temporally repeated flow for the obvious reason that is consists of repeated shipments along the same flow paths from 1 to n. the maximum value of a temporally repeated flow obtained with asmdf is: vh = (t + 1)v − ∑ a h(i,j)f(i,j) (10) where v is the maximum value of the flow f obtained with amvmcf. 4 the maximum flows in planar dynamic networks in this section we consider the maximum flows in planar dynamic networks in the stationary case. hence, we use the asmdf which has presented in section 3. the network g = (n,a,c,u) is planar. the minimum mean cycle canceling algorithm is a special case of the klein′s algorithm (cycle canceling algorithm, see [1]). recall that the mean cost of a directed cycle (circuit) p̊ is ( ∑ (c(i,j)|(i,j) ∈ p̊))/ ∣∣∣p̊∣∣∣, and that the minimum mean cycle is a cycle with the smallest mean cost in the network g. is known that use dynamic programing algorithm to find the minimum mean cycle in o(nm) time, see [1]. in this case, the minimum mean cycle canceling algorithm starts with a maximum flow f in the network g. this flow is computed with algorithm presented in section 2. at every iteration, the minimum mean cycle canceling algorithm identifies a minimum mean cycle p̊ in residual network g̃. if the mean cost of the cycle p̊ is negative, the algorithm augments the maximum possible flow along p̊ , updates g̃, and repeats this process. if the mean cost of p̊ is nonnegative, g̃ contains no negative cycle and f is a maximum value and minimum cost flow, so the algorithm terminates. this algorithm is surprisingly simple to state. theorem 6. the asmds correctly computes the maximum flow in planar stationary dynamic network. proof: the asmds correctly computes the maximum flow in general stationary dynamic network. obviously that algorithm is correctly and for planar network. 2 theorem 7. the asdms applied in planar network has the complexity o(n5logn). proof: the asmdf applied in general network has the complexity o(n2m3logn). in planar network we have m = o(n). hence, the asmdf applied in planar network has the complexity o(n5 log n). 2 5 conclusions the computation of a maximum flow in a general network has been an important and well studied problem, both in the fields of computer science and operations research. many efficient 288 c. schiopu, e. ciurea algorithms have been developed to solve this problem, see, e.g., [1]. research on maximum flow in planar network is motivated by the fact that more efficient algorithms can be developed by exploiting the planar structure of the graph. the planar flow algorithms are not only extremely efficient, but they are also very elegant. planar networks also arise in practical contexts such as vlsi design and communication networks, and hence it is of interest to find fast flow algorithms for this class of networks. in this paper, we have studied a generalization of the maximum flow in directed (1,n) planar networks, to include transit time features encountered in many practical situations. the our model, assumes that all attributes in the problem, including arc capacities and transit times, do not change over time. in this case we have used an efficient procedure to find the maximum value and minimum cost flow in directed (1,n) planar static networks g = (n,a,c = h,u), and then develops a set of temporally repeated flows, with the optimal flow decomposed into a set of path flows. we remark that the problem of maximum flow in (1,n) planar dynamic networks was not studied up to the present. also, we introduce the notion of reduced time expanded network g ′ h = (n ′ h,a ′ h,u ′ h) and show how make this network. future research directions include problems: (1) the maximum flow in directed (1,n) planar dynamic networks, where the transit times, the capacities of arcs are all time-varying; (2) the maximum flow in directed (1,n) planar dynamic networks with lower bounds in stationary case and in nonstationary case. these are more practical features in many real-world problems where we desire to control the speed of flows at different arcs. 6 example the planar dynamic network is presented in figure 3(a) and time horizon being set to t = 4, therefore h = {0,1,2,3,4}. the transit times h(i,j) and the upper bounds (capacities) u(i,j) for all arcs are indicate in figure 3(b). (a) (i, j) (1, 2) (1, 3) (2, 3) (2, 4) (3, 4) h(i, j) 1 3 1 2 1 u(i, j) 3 2 1 2 2 (b) figure 3: the planar dynamic network figure 4 shows the (1 ′ ,4 ′ ) dual network g ′ = (n ′ ,a ′ ,c ′ ) corresponding to network gh. the maximum flows in planar dynamic networks 289 figure 4: dual network g ′ corresponding to the network g the flow obtained with amfdpsn is presented in figure 5(a). (a) (b) figure 5: (a) maximum flow; (b) maximum flow of minimum cost with the minimum mean cycle canceling algorithm we obtain the maximum flow of minimum cost and is presented in figure 5(b). applying the procedure adfef we have the following path: p1 = (1,2,4),h(p1) = 3,r(p1) = 2; p2 = (1,2,3,4),h(p2) = 3,r(p2) = 1; p3 = (1,3,4),h(p3) = 4,r(p3) = 1. with the procedure arf we obtain the maximum dynamic flow which is shown in network g ′ h = (n ′ h,a ′ h,u ′ h) in figure 6. 290 c. schiopu, e. ciurea figure 6: the maximum dynamic flow applying the formula (10) we have vh = (4 + 1)4− (3 + 3 + 1 + 4 + 2) = 7. for s ′ h = { 1 ′ ,10,11,22,33 } , s̄ ′ h = { 21,32,43,44,4 ′} we have [ s ′ h, s̄ ′ h ] = (s ′ h, s̄ ′ h) = {(10,21),(22,44),(33,44)} and v ′ h = vh = f ′ h(s ′ h, s̄ ′ h) = u ′ h(s ′ h, s̄ ′ h) = 3 + 2 + 2 = 7. bibliography [1] ahuja,r.; magnanti,t.; orlin,j. (1993); network flows. theory, algorithms and applications, editing prentice hall, inc.,englewood clifss, new jersey. [2] cai,x.; sha,d.; wong,c. (2007); time-varying network optimization, editing springer. [3] ciurea,e. (1984); les problemes des flots dynamiques, cahiers du cero, 26(1-2): 3-9. [4] frederickson,g. (1987); fast algorithms for shortest path in planar graphs, with applications, siam journal on computing, 16: 1004-1022. [5] ford,l.; fulkerson,d. (1962); flows in networks, princeton university press, princeton, n.j. [6] hassin,r. (1981); maximum flows in (s,t) planar networks, information processing letters, 13: 107. the maximum flows in planar dynamic networks 291 [7] hassin,r.; johnson,d. (1985); an o(n log2 n) algorithm for maximum flow in undirected planar networks, siam journal on computing, 14: 612-624. [8] itai, a.; shiloach, y. (1979); maximum flow in planar networks, siam journal on computing, 8: 135-150. [9] johnson, d.; venkatesan,s. (1982); using divide and conquer to find flows in directed planar networks in o(n1.5 log n) time, proceedings of the 20th annual allerton conference on communication , control and computing, university of illinois, urbana-champaign, il., 898-905. [10] khuller,s.; naor,j. (1994); flows in planar graphs with vertex capacities, algorithmica, 11: 200-225. ijcccv4n3draft.pdf int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 3, pp. 263-272 an algorithm for initial fluxes of metabolic p systems roberto pagliarini, giuditta franco, vincenzo manca university of verona, italy computer science department strada le grazie 15, 37134 verona, italy e-mail: {roberto.pagliarini, giuditta.franco, vincenzo.manca}@univr.it received: april 5, 2009 accepted: may 30, 2009 abstract: a central issue in systems biology is the study of efficient methods inferring fluxes of biological reactions by starting from experimental data. among the different techniques proposed in the last years, the theory of metabolic p systems, which is based on the log-gain principle, proved to be helpful for deducing biological fluxes from temporal series of observed dynamics. according to this approach, the algebraic systems provided by the log-gain principle determine the reaction fluxes underlying a system dynamics when initial fluxes are known. here we propose a heuristic algorithm for estimating the initial fluxes, that is tested in two case studies. keywords: biological modeling, p systems, mp systems, metabolic flux estimation, heuristic algorithms. 1 introduction in the last years, the problem of reverse-engineering of biological phenomena from experimental data has spurred increasing interest in scientific communities. for these reasons, many computational models inspired from biology have been proposed. among these models, the metabolic p systems [11, 12], shortly mp systems, proved to be relevant in the analysis of dynamics of biochemical processes, that is, structures where matter of different type is transformed by reactions. by means of mp systems models of several interesting phenomena were provided, among which we mention: the lotka-volterra dynamics [2, 3, 15], a susceptible-infected-recovered epidemic [2], the leukocyte selective recruitment in the immune response [2], the protein kinase c activation [3], the mitotic cycle [14], the pseudomonas quorum sensing [4] and the non-photochemical quenching phenomenon [16]. the importance of mp systems is their potential applicability to the reverse-engineering problem of biological phenomena. in fact, in the framework of mp systems, a theory called log-gain [10, 11, 12] has been introduced, specifically devoted to the deduction of reaction fluxes, that is, the amount of reactants transformed by the reactions at any step of the system. as we will show, a key point for achieving this task consists in the discovery of the fluxes associated to the passage of a metabolic system from the state at the initial observation instant to the next one. in this paper a heuristic algorithm is proposed for estimating the initial fluxes vector from few steps of observation. in few words, the algorithm first roughly computes the initial fluxes by assuming they have a form recalling the mass action principle, and then solves a system of equations to deduce the corresponding fluxes at the next step. from these values, the algorithm evaluates how much of each substance is necessary to activate the first evolution step, and finally the actual initial fluxes are computed by solving a minimization problem. the present paper is organized as follows. section 2 is devoted to the definition of metabolic p systems, while in section 3 log-gain theory is briefly recalled. in section 4 we describe the algorithm copyright c© 2006-2009 by ccc publications 264 roberto pagliarini, giuditta franco, vincenzo manca which solves the initial fluxes problem. section 5 reports the simulations of a couple of systems obtained by starting with initial fluxes computed by our algorithm. further remarks and some directions for future research are discussed in the last section. 2 metabolic p systems mp systems are a special class of dynamical systems (the reader can find some details concerning dynamical aspects of mp systems in [13]), based on p systems [5, 18, 19], which are related to metabolic processes. mp systems are essentially constituted by multiset grammars where rules are regulated by specific functions depending on the state of the system. from a membrane computing point of view, mp systems can be seen as deterministic mono-membrane p systems where the transitions between states are calculated by a suitable recurrent equation. in an mp system the variation of the whole system is considered in a macroscopic time interval. in this manner, the evolution law of the system includes the knowledge of the contribution of each reaction in the evolution from one state to the next one. therefore, dynamics is given at discrete steps, and in each step, it is ruled by a partition of matter among the reactions transforming it. the principle underlying the partitioning is called mass partition principle, and it defines the transformations of object populations, rather than single objects, according to a suitable generalization of chemical laws [11]. the following definition introduces the mp systems in a formal way (n, z, and r denote the sets of natural, integer, and real numbers, respectively). definition 1 (mp system). an mp system m is specified by the following construct: m = (x , r,v, h, φ, ν, µ, τ) where x , r and v are finite disjoint sets, and moreover the following conditions hold, with n, m, k ∈ n: • x = {x, x, . . . , xn} is a finite set of substances. this set represents the types of molecules; • r = {r, r, . . . , rm} is a finite set of reactions. a reaction r is a pair of type αr → βr, where αr identifies the multiset of the reactants (substrates) of r and βr identifies the multiset of the products of r (λ represents the empty multiset). the stoichiometric matrix a of a set r of reactions over a set x of substances is a = (ax,r | x ∈ x , r ∈ r) with ax,r = |βr|x − |αr|x, where |αr|x and |βr|x respectively denote the number of occurrences of x in αr and βr. of course, a reaction r can be seen as the vector r = (ax,r | x ∈ x ) of r n. we also set rα (x) = {r ∈ r | x ∈ αr}, rβ (x) = {r ∈ r | x ∈ βr}, and r(x) = rα (x) ∪ rβ (x); • v = {v, v, . . . , vk} is a finite set of parameters. this set represents entities which affect the dynamics but are not transformed by reactions; • h = {hv | v ∈ v } is a set of parameters evolution functions. the function hv : n → r states the value of parameter v, and h[i] = (hv(i) | v ∈ v ); • φ = {ϕr | r ∈ r} is the set of flux regulation maps, where, for each r ∈ r, ϕr : r n+k → r. let q ∈ rn be the vector of substance values and s ∈ rk be the vector of parameter values. then (q, s) ∈ rn+k is the state of the system. we set by u (q, s) = (ϕr(q, s) | r ∈ r) the flux vector in the state (q, s), constituted by the state q and by the parameters state s; • ν is a natural number which specifies the number of molecules of a (conventional) mole of m; • µ is a function which assigns, to each x ∈ x , the mass µ(x) of a mole of x (with respect to some measure units). an algorithm for initial fluxes of metabolic p systems 265 • τ is the temporal interval between two consecutive observation steps; let x [i] = (x[i], x[i], . . . , xn[i]), for each i ∈ n, be the vector of substances values at the step i, and let x [] be the initial values of substances. the dynamics of an mp system is completely identified by the following recurrent equation, called equational metabolic algorithm, shortly ema: x [i + ] = a ×u (x [i], h[i]) + x [i] (1) where a is the stoichiometric matrix of reactions having dimension n × m, while ×, +, are the usual matrix product and vector sum. we denote by ema[i] the system (1), which allows us to obtain the vector x [i + ] from vectors x [i] and u (x [i], x [i]). if in an mp system the elements ν , µ , and τ are omitted, then the result is called mp grammar. it is a multiset rewrite grammar where rules are regulated by specific functions. such a grammar is completely specified by: i) reactions, ii) flux regulation functions, iii) parameter evolution functions, iv) substances, which are the elements occurring in the reactions, and their initial values and v) parameters, which are the arguments of flux regulation functions different from substances. parameter evolution maps and/or initial values of substances may be omitted when only the mp grammar structure is specified. 3 log-gain theory: a brief recall the starting point of the log-gain theory [10, 11, 12] for mp systems is the allometry law [1, 7], which has many possible formulations [10], but, in the case here discussed, it can be expressed in a simple way. namely, a proportion can be assumed, at each step, between the relative variations of the flux of a reaction and the sum of relative variations of its reactants, with a possible gap, called offset. given the dynamics of an mp system, we will use the following simplified notations, for i ∈ n, and r ∈ r: ur[i] = ϕr(x [i], h[i]) and u [i] = (ur[i] | r ∈ r). (2) assuming to know the vectors x [i] and x [i + ], the equation (1) can be rewritten in the following form, which we call ada[i] (avogadro and dalton action [12]): x [i + ] − x [i] = a ×u [i]. (3) formula (3) expresses a system of n equations and m variables (n is the number of substances and m the number of reactions) which is assumed to have maximal rank. this assumption is not restrictive. in fact, if it does not hold, the rows which are linearly dependent on other rows can be removed, by keeping the notations a, x [i +] and x [i] for the stoichiometric matrix and the vectors of concentration of substances, respectively. we assume thus that a has maximum rank, which we newly call n. then there exist n linearly independent reactions of r, and we call r such a subset of reactions. from a metabolic point of view, this means that fluxes of each reaction of r can be obtained as linear combination of fluxes of the reactions of r. formally, ada[i] is essentially the system ema[i] introduced in section 2. however, these two systems have dual interpretations. in fact, in ema[i], the vectors u [i] and x [i] are known, and the vector x [i + ] is computed by means of them, while in ada[i], the vector x [i + ] − x [i] is known and u [i] is computed by solving a system comprised of both the equations in ada[i] and further equations, dictated by the following log-gain principle, to state the reaction regulation level, as we will see by formula (6). indeed, since the number of reactions is realistically assumed greater than the number of substances, then system (3) has more than one solution. therefore, fluxes cannot be univocally deduced by means of ada[i]. the log-gain principle allows us to add more equations in order to get a univocally solvable system which could provide the flux vector. 266 roberto pagliarini, giuditta franco, vincenzo manca the two following definitions state the log-gain principle. for the detailed motivations of this principle we refer to papers on mp systems theory [10, 11, 12]. further developments providing theoretical and experimental evidences of this principle will be matter of forthcoming papers. definition 2 (discrete log-gain). let (z[i] |i ∈ n ) be a real valued sequence. then, the discrete log-gain of z, for each step i, is given by the following equation: lg(z[i]) = z[i + ] − z[i] z[i] . (4) principle 1 (log-gain regulation). let u [i] be the vector of fluxes at step i, for i ≥ , and let r ⊂ r be a set of n linearly independent vectors of rn. then, the log-gain regulation can be expressed in terms of matrix and vector operations: (u [i + ] − u [i])/u [i] = b × lg(x [i]) + c ⊗ p[i + ] (5) where: • b = (pr,x |r ∈ r, x ∈ x ) where pr,x ∈ {, } with pr,x =  if x is a reactant of r and pr,x =  otherwise; • lg(x [i]) = (lg(x[i]) |x ∈ x ) is the column vector of log-gains of substances; • c = (cr |r ∈ r ), where cr =  if r ∈ r, while cr = ; • p[i + ] is a column vector of values associated with the reactions and called (log-gain) offsets at step i + ; • × denotes the usual matrix product; • +, −, /, ⊗ denote the component-wise sum, subtraction, division and product of vectors. if we assume to know the flux unit vector at step i and put together the equations (5) and (3) at steps i and i +  respectively, we get the following linear system called offset log-gain adjustment module at step i, shortly olga[i], where the number of variables (reported in bold font) is equal to the number of equations: a ×uuu [iii + ] = x [i + ] − x [i + ] (6) (uuu [iii + ] − u [i])/u [i] = b × lg(x [i]) + ccc ⊗ ppp[iii + ]. given the vector lg(x [i]), for i = , , . . . , l, where l ∈ n, it is possible to prove that olga[i], for i = , , . . . , l − , univocally provides u [i] for i = , , . . . , l − . 4 an algorithm to estimate initial metabolic fluxes the iteration of the olga module, introduced in the previous section in order to deduce the fluxes of reactions, assumes the knowledge of the initial values of fluxes. this leads to the formulation of the following problem. problem 1 (initial fluxes problem). given x [] and x [], find a flux vector u [] such that it satisfies the initial dynamics, that is: x [] ≅ a ×u [] + x [] where ≅ means that we are searching for the vector u [] providing the minimum value of the stoichiometric error, defined as (‖·‖ represents the euclidean norm) ‖a ×u [] − (x [] − x [])‖. the algorithm given below solves the initial fluxes problem by using the knowledge about the dynamics in the first evolution steps in order to evaluate the amount of each substance which is necessary to activate the first evolution step. an algorithm for initial fluxes of metabolic p systems 267 4.1 the proposed algorithm our algorithm consists of three phases, some of which include different computational steps. the first phase consists in the approximation of initial fluxes by assuming that fluxes are proportional to the reactant quantities product. in the second phase an olga module is employed to approximate the amount of substances which needs as a fuel for the first evolution step. in the third phase an optimization problem is solved, which is based on the ada system (3). the details of the algorithm work-flow are described in the following. phase 1. the goal here is to roughly evaluate the initial reaction fluxes by assuming that they are proportional to the reactants for certain initial evolution steps i. this could appear restrictive, but at this stage we require only an initial approximation. therefore, at a given step i, for all r ∈ r, we set: ûr[i] = kryr[i] (7) where kr ∈ r, and yr[i] is the product of all substance quantities, at the step i, which are reactants for r. we suppose that if αr = λ then yr[i] = , and we set û [i] = (ûr[i] | r ∈ r). (8) for example, in a metabolic system having three kinds of substances, a, b, c, and as a set of reactions those given in the first column of table 1, the relationships between the fluxes of these reactions and their reactants are reported in the second column of table 1. for any x ∈ x , let us consider the following system, called local-stoichiometric module at the step i, where a is the stoichiometric matrix: x[i + ] − x[i] = ∑ r∈r(x) ax,rûr[i]. (9) if we assume that the constants kr, with r ∈ r, do not sensibly change in few steps, then by applying the system (9), in at most m − n steps either we obtain a square linear system of dimension m having maximum rank or the algorithm ends without an output. in fact, under the assumption that the rank of local-stoichiometric module is n (that is, the number of equations) and that the number of variables is m, with n < m, then the system is completely determined if we add other m − n equations. assuming to gain at least one new significant equation at each step i, then in at most m − n steps we obtain a system of (m − n)n + n equations with m variables and rank equals to m. in this way, we can obtain a square linear system having unique solution. in the example reported in table 1, we have a local-stoichiometric module of  equations having rank  which initially has  variables. at the second iteration of this module, starting from the step , we reactions maps r : a → bc kr a r : b → a kr b r : c → ab kr c r : c → cc kr c table 1: reactions and corresponding flux regulation maps of the local-stoichiometric module. 268 roberto pagliarini, giuditta franco, vincenzo manca get other  equations finally giving the following system: aaa[] − aaa[] = −kkkaaa[] + kkkbbb[] + kkkccc[] b[] − b[] = ka[] − kb[] + kc[] ccc[] − ccc[] = kkkaaa[] − kkkccc[] + kkkccc[] a[] − a[] = −ka[] + kb[] + kc[] bbb[] − bbb[] = kkkaaa[] − kkkbbb[] + kkkccc[] ccc[] − ccc[] = kkkaaa[] − kkkccc[] + kkkccc[] where a[] = , b[] = , c[] = , a[] = ., b[] = ., and c[] = .. this system has rank , and  linearly independent equations are reported in bold font. thus, we can obtain a system of equations having unique solution. in general, if we start with the local-stoichiometric module at the step  then we can compute the vector û [] = (ûr[] |r ∈ r ) by applying the local-stoichiometric module for a suitable number of steps. the algorithm stops with no output if after m − n iterations of the above technique, the number of equations linearly independent is less than m. phase 2. the aim of this step is to estimate the amount of substance necessary to start the first system evolution step. we describe this step along with two sub-phases. in the first sub-phase we solve olga[] module, with u [] = û [], where û [] is the vector of fluxes computed in the previous step. let us call u ∗ = (u∗r | r ∈ r) the solution of this system. however, if some elements of this vector have a negative value, then we choose a different set of n linearly independent reactions in olga and newly apply the above procedure. the algorithm stops with no answer if a positive solution is not found after a number of attempts equal to the number of such different sets. however, general methods are under investigation which systematically and efficiently search for an unique positive solution u ∗. in the second sub-phase we compute, for each x ∈ x , the amount of substance x̄ occurring for the application of the reactions in the first evolution step. if a− is the activation matrix defined by a−x,r = |αr|x, for x ∈ x , r ∈ r, then the searched values are obtained by computing the vector x̄ = a − ×u ∗. phase 3. in the last step we obtain the actual vector of fluxes u ◦ by solving a norm minimization problem [9] such that u◦ provides the minimum of the following (euclidean) norm ‖a × ξ − (x [] − x [])‖ (10) over all the positive vectors ξ = (ξr | r ∈ r) of r m such that a − × ξ = x̄ , (11) where x̄ is the vector computed at the previous step. 5 experiments in this section, in order to evaluate the performance of our algorithm, we apply it to two case studies: i) a synthetic oscillatory metabolic system and ii) the belousov-zhabotinsky reaction [8, 20, 21]. 5.1 a synthetic metabolic system let us consider the synthetic non-cooperative metabolic system without parameters called sirius [11] and given by table 2. firstly, we compute u [] = (ϕ(x [], ϕ(x []), . . . , ϕ(x []). then, we use our algorithm to approximate the vector of fluxes u ◦. the two vectors are essentially the same. an algorithm for initial fluxes of metabolic p systems 269 reactions flux regulation maps r : a → aa ϕ = ka/(k + kc + kb + k) r : a → b ϕ = kac/(k + kc + kb + k) r : b → λ ϕ = kb/(k + k) r : a → c ϕ = kab/(k + kc + kb + k) r : c → λ ϕ = kc/(k + k) x [] = (  ) k = k = k = , k = k = ., k =  table 2: the sirius mp grammar. 5.2 a biochemical case study in this subsection the application of the algorithm to approximate the initial fluxes of the belousovzhabotinsky reaction, also known as bz reaction, is discussed. this system represents a well-known example of biochemical oscillatory phenomenon, in fact this is the first evidence of a chemical clock. although the stoichiometry of the bz reaction is quite complicated, several simplified mathematical models of this phenomenon have been proposed. in particular, prigogine and nicolis [17] proposed a simplified formulation of the dynamics of the bz reaction, called brusselator, whose oscillating behaviour is represented by only two substances, x and y respectively, and it is governed by the following system of differential equations: dx dt = k − kx + kx  y − kx (12) dy dt = kx − kx  y where k = , k = , k =  − and k =  represent constant rates. we use the oscillatory dynamics obtained by solving the system (12), with initial conditions x =  and y = , as experimental data on which testing our algorithm. mp formulation of the brusselator is expressed by the set of rewriting rules reported in table 3, where, according to the literature, the fluxes of each rule r depend on the concentrations of the reactants of r. in fact, species x has two positive and two negative contributions, while one positive and one negative contributions characterize y. thus, the equations can be mapped into suitable stoichiometry by following the strategy described in [6]. rules r : λ → x r : x → y r : xxy → xxx r : x → λ table 3: a set of rewriting rules that describes the brusselator stoichiometry. in the case of bz we adopt a different strategy of validation of our algorithm. in fact, there is a complete correspondence between the dynamics computed by the differential model and that one computed by the equational metabolic algorithm using the fluxes deduced by olga module (figure 1), starting from the initial fluxes inferred by means of our algorithm. 6 conclusions the study of efficient methods for defining mp systems from experimental data is of crucial importance for systematic applications of mp systems to complex dynamics. an essential component of 270 roberto pagliarini, giuditta franco, vincenzo manca figure 1: the bz reaction fluxes calculated by solving the system (6) with initial vector of fluxes inferred by our algorithm. the regulation level of an mp system can be deduced by applying the log-gain theory to data that can be collected from observations of the system. a crucial task to perform in this context is the reliable determination of the initial vector of fluxes. in this paper we have devised an algorithm to infer the initial reaction fluxes of a biological network. the proposed algorithm has been validated on test cases of a synthetic metabolic oscillator and the brusselator phenomenon. future investigations will be developed with the aim i) to develop the computational features of this algorithm and ii) to show the applicability of our method to complex biological cases. bibliography [1] l. von bertalanffy. general systems theory: foundations, developments, applications. george braziller inc, new york, ny, 1967. [2] l. bianco, f. fontana, g. franco, and v. manca. p systems for biological dynamics. in [5], 81–126, 2006. [3] l. bianco, f. fontana, and v. manca. p systems with reaction maps. international journal of foundations of computer science, 17(1):27–48, february 2006. [4] l. bianco, d. pescini, p. siepmann, n. krasnogor, f.j. romero-campero, and m. gheorghe. towards a p systems pseudomonas quorum sensing model. lecture notes in computer science, 4361:197–214, 2007. an algorithm for initial fluxes of metabolic p systems 271 [5] g. ciobanu, m. j. pérez-jiménez, and g. păun (eds.). applications of membrane computing (natural computing series). springer-verlag new york, inc., secaucus, nj, usa, 2006. [6] f. fontana and v. manca. discrete solution to differential equations by metabolic p systems. theoretical computer science, 372:165–182, 2007. [7] j. s. huxley. problems of relative growth. 2nd ed., dover, new york, 1972. [8] d. s. jones and b. d. sleeman. differential equations and mathematical biology. chapman & hall/crc, february 2003. [9] d. g. luenberger. optimization by vector space methods, john wiley & sons, inc., 1969. [10] v. manca. log-gain principles for metabolic p systems, in a. condon et al. (eds.), algorithmic bioprocesses, chapter 28, natural computing series, springer-verlag, berlin heidelberg, 2009. [11] v. manca. the metabolic algorithm for p systems: principles and applications. theoretical computer science, 404:142–157, 2008. [12] v. manca. fundamentals of metabolic p systems. in g. păun, g. rozenberg, and a. salomaa, editors, handbook of membrane computing, chapter 16. oxford university press, 2009. to appear. [13] v. manca. metabolic p dynamics. in g. păun, g. rozenberg, and a. salomaa, editors, handbook of membrane computing, chapter 17. oxford university press, 2009. to appear. [14] v. manca and l. bianco. biological networks in metabolic p systems. biosystems, 91(3):489–498, 2008. [15] v. manca, l. bianco, and f. fontana. evolution and oscillation in p systems: applications to biological phenomena. lecture notes in computer science, 3365:63–84, 2005. [16] v. manca, r. pagliarini, and s. zorzan. a photosynthetic process modelled by a metabolic p system. natural computing, 2009. doi 10.1007/s11047-008-9104-x. [17] g. nicolis and i. prigogine. exploring complexity. an introduction. freeman ans company, san francisco, ca, 1989. [18] g. păun. membrane computing: an introduction. springer, 2002. [19] g. păun and g. rozenberg. a guide to membrane computing. theoretical computer science, 287(1):73–100, september 2002. [20] k. s. scott. chemical chaos. cambridge university press, cambridge, uk, 1991. [21] a. m. zhabotinsky. proc. acc. sci, usrr. 157:392, 1964. roberto pagliarini (july 23, 1981) earned his m. s. degree in computer science at university of verona, where he is currently a phd student in computer science. his research interests focus on system biology, molecular and membrane computing. he has collaborated with the biochemistry and vegetal physiology group at biotechnological department of verona university, in order to investigate computational models for crucial events related to photosynthetic organisms. he is co-author of scientific papers on this subject. 272 roberto pagliarini, giuditta franco, vincenzo manca giuditta franco (august 5, 1974) graduated in mathematics at the university of pisa and earned her phd in computer science, with a dissertation titled “biomolecular computing — combinatorial algorithms and laboratory experiments”, at university of verona, where she is currently an assistant professor. her research interests focus on discrete mathematics and computational models of biological systems, namely on dna and membrane computing. she gave talks in several international workshops and she is co-author of scientific papers published by prestigious sectoral journals. she is an effective member of both the european molecular computing consortium (emcc) and the international society for nanoscale science, computing and engineering (isnsce). vincenzo manca (march 9, 1949) is a full professor at the computer science department of the university of verona, where he is also chair of the bioinformatics programme. he obtained his degrees from the university of pisa. his research interests cover a wide class of topics from mathematical logic, discrete mathematics, and theoretical computer science to informational analysis and computational models of biological systems. at present, his investigation is focused on “natural computing” (in particular dna computing and membrane computing, synthetic computational biology). he is author of more than 100 scientific publications, appearing in international journals and scientific series. international journal of computers, communications & control vol. i (2006), no. 3, pp. 85-92 coalition formation for cooperative information agent-based systems nacer eddine zarour, sabrina bouzidi abstract: the communication technology evolution led to an increase of the carried out services’ and tasks’ number. the aim of actual research in the cooperation and particularly the negotiation between agents is to reach a coherent global state of the multiagent system by favoring agents’ synergy. in this paper, we propose a coalition formation-based negotiation model for the task allocation in the cooperative information agent-based systems. in this model, the agent that activates a negotiation seeks partners for achieving a complex task. the way that the partners take part in a coalition is done one by one according to the choice of all the coalition members. this choice is based on a multicriterion analysis. some obtained experimental results show the suggested model performances. keywords: negotiation, coalition formation, cooperative information systems, multiagent systems, task allocation. 1 introduction in an open environment as cooperative information systems (ciss), which gathers distributed; heterogeneous; and autonomous information systems (iss), the cooperation requires the intervention of a negotiation process in order to identify and structure the various problems, to propose and defend the solutions, to re-examine the intentions in a conflict, and finally to confirm the commitments [1]. in our context, each is is modelled by an autonomous, rational, and egoistic agent. several negotiation models were developed like market mechanisms [2], contract net protocol [3], social laws [4], multiagent planification [5], and organizational structures (coalitions, teams, congregations, etc.) [6]. the coalition formation is a negotiation technique which is particularly preferred for solving the conflicts between agents whose behavior is economic and thus rational. a coalition is a short-term organization based on specific and contextual commitments of involving agents. this allows agents to profit from their respective competencies. to solve the task allocation problem, several coalition formation-based models were suggested [7] [8] [3]. however, these models are not adapted to the context of ciss because they do not define relations of partnership between agents, but rather relations of subcontracting. generally in these models, the agent that activates a negotiation process will only decides to structure the coalition. it selects its partners separately according to its own desires. consequently, the agents’ autonomy is not respected. the objective of this work is to propose a coalition formation-based model for the task allocation in ciss. the proposed model, named cfotacis1 , is inspired from the reality of the cooperation between entreprises. it respects the agents’ autonomy by allowing them to take part in the coalition formation process. it is recursive and includes four phases, the initialization, the negotiation, the evaluation, and the finalization. the rest of the paper is organized as follows. in the following section, we present some related work to the coalition formation. in section 3, we develop the suggested negotiation model. section 4 enumerates the proposed model properties. some experimental results are presented in section 5. section 6 summarizes the contributions of this paper and the research perspectives. 1coalition formation for task allocation in cooperative information systems. copyright c© 2006 by ccc publications selected paper from icccc 2006 86 nacer eddine zarour, sabrina bouzidi 2 related work according to the literature, we consider two approaches for studying the coalition formation process, the macroscopic approach and the microscopic one. the macroscopic approach, which is based on the game theory, considers the coalition as the basic unit [9]. the microscopic approach considers agents as the basic unit where several models were proposed and applied in many fields, especially the e-business [10] and the task allocation [7] [8] [3]. we focuss our discussion on the models based on the second approach which deal with the task allocation problem. in [7], the authors proposed a model where agents exchange their preferences w.r.t. the possible solutions. although, forcing agents to form alliances ensures the termination of the negotiation process, but the model does not respect agents’ freedom since it imposes a solution which may not satisfy all agents. in [8], the authors propose a model where the agent that activates the negotiation process first filters agents of its environment. it uses a multicriterion analysis to find agents that best satisfy its requirements. then, it starts an argumentative negotiation with the candidates, which are classified according to its preferences. this approach does not ensure the reach of a consensus but limiting the negotiation time seems to be a reasonable strategy. two models are proposed in [3]. the first one is dedicated to the competitive agents. the agent selection is based on a preference model, which is built using several criteria. the major inconvenient of this model is that the representing agent of the coalition has a kind of authority on the coalition members. in the second model, agents cooperate in an altruistic way. the model ensures the reach of a consensus and has some advantages like the integration of the new partners one by one in the coalition. however, the altruistic strategy of agents does not require the gradual integration of agents in the coalition. this approach seems to be interesting in the case of egoistic agents. in [10], the authors proposed a formalization of the trust criteria which an agent grants to the other ones of the environment. this trust modeling ensures the stability in the coalition formation due to its reliability. however, the authors do not specify how to find the value of the current evaluation of trust. in order to response at this question in our proposal, we will deduce the trust from some criteria. 3 cfotacis: a coalition formation-based negotiation model in ciss before describing the suggested model, let us admit the following assumption: an agent which wants to integrate the cis must update the portal of this cis [4]. it must specify its localization, its know-how, the information about its branch of industry, its competencies, its capital, etc. cfotacis is recursive and includes four phases, the initialization, the negotiation, the evaluation, and the finalization. in the beginning of the negotiation process, the manager which is the first coalition member, follows only the model phases. after that, when new partners join the coalition and become members, all the members try to find the next partner. the negotiation process iters until the coalition size will be reached. let us give the essential description of the different phases 3.1 the initialization phase the manager consults the portal of the cis (first assumption) to obtain useful informations on agents. then, it decomposes the global task into subtasks w.r.t. the agents’ industry branches. for each subtask, the manager makes a set of the candidates among agents of the environment. coalition formation for cooperative information agent-based systems 87 3.2 the negotiation phase the manager simultaneously sends the proposals to agents of the same set. the message which the manager sends to the candidates must contain all information about the cooperation project, as well as the waiting period that accords to them. then, the manager receives the responses from the candidates. figure 1 represents the state transitions for a negotiation between a manager (m) and a candidate (c). after receiving a proposal from a manager for cooperation, a candidate can: • refuse to cooperate (refuse(c, m)); • accept to achieve the subtask (accept(c, m)); • accept to achieve the subtask and also propose to achieve another subtask(s) (ok(c, m)); • refuse to achieve the subtask but propose to achieve another subtask(s) (no(c, m)); • not answer. in this case, after the expiry of a waiting period, the manager considers the not reply as a rejection (refuse(c, m)). if the candidate agrees to cooperate, it must send to the manager a message that contains its consideration of the cost and the time for achieving the subtask, as well as the maximum reply time that it grants to the manager to evaluate the proposal. figure 1: state transition graph of the negotiation between the manager (m) and a candidate (c) 3.3 the evaluation phase after receiving the candidates’ responses, the manager evaluates and classifies them according to their importances. the evaluation is made using a multicriterion analysis. this phase includes two stages. the first one deals with the evaluation of the criteria and the second one with the aggregation of the evaluations. criterion evaluation in this stage, the manager evaluates agents which sent their proposals according to various criteria. in [11], the author defines a whole of interesting criteria for the selection of the most adequate partner. we select those which seem most significant to our context. we have classified the criteria into three categories: (a) criteria related to the partner the cooperation degree of the candidate agent with the manager (d1) 88 nacer eddine zarour, sabrina bouzidi the cooperation degree of the manager with the candidate agent (d2) the quality of the relation (d3) deduced from three others subcriteria: (i) respect of the allowed time for achieving the subtask (s1) (ii) the quality of achieving the subtask (s2). (iii) honesty with regard to the profit distribution after the last experiment (s3) the experiment in the cooperation: (i) the cooperation number carried out by the partner (a1) (ii) the cooperation number carried out by the partner without interruption (a2). (b) criteria related to the cooperation the capital (c1). the technical capacity (c2). the technological competencies (c3) the existence of a single capability (c4). (c) criteria related to the candidate agent proposal the candidate must specify in its acceptance message the time t and the cost c for achieving the subtask. the aggregation stage after the evaluation of the different criteria, the manager incorporates the evaluations associated with all the criteria in order to have an overall estimate of each agent. in cfotacis, the aggregation of the criteria is used in two cases: (i) for the trust quantification: the trust e, which results from the current relation with the agent partner, is carried out using the equation e = p1*s1+p2*s2+p3*s3 where p1, p2, and p3 are the weights that the agent must specify and p1+p2+p3=1. (ii) for the evaluation of each candidate: the aggregation operator is defined using a deviation at an aspiration point which gathers the preferred values of the criteria [12]. the evaluations are represented using a vector named b. the deviation b to the aspiration point a is defined by the relation. deviation(a,b) = maxj=1..p(λ j(aj-bj)) with λ j=1/( idealjantiidealj) and p is the number of criteria. ideal is a vector which gathers the maximal criteria values of the evaluation vector and antiideal is a vector which gathers the minimal criteria values of the evaluation vector. the best evaluation d* is that which minimizes the deviation to the aspiration point: d*=min{ deviation(a,b)} for well explaining the evaluation mechanism, let us consider the following example. agents ag1, ag2, and ag3 are evaluated according only to the criteria criterion1, criterion2, and criterion3 in the vectors eval1, eval2, and eval3 (table 1). agents ag1 ag2 ag3 ideal antiideal λ criterion eval1 λ (ideal-eval1) eval2 λ (ideal-eval2) eval3 λ (ideal-eval3) criterion1 0.18 0 0.10 1 0.14 0.50 0.18 0.10 12.5 criterion2 0.16 0 0.12 1 0.15 0.25 0.16 0.12 25 criterion3 0.10 1 0.18 0 0.15 0.375 018 0.10 12.5 deviation 1 1 0.50 d* 0.50 coalition formation for cooperative information agent-based systems 89 table 1: an illustrative example of the adopted aggregation operator 3.4 the finalization phase after evaluating agents’ proposals, the manager classes agents in a list according to their deviations. the manager sends the agent which is at the head of the list a message for inviting it to join the coalition; else it contacts the next agent in the list, and so on. the coalition members merge their deviation vectors to have a unified view of the candidates. this merger allows the coalition members to have supplemented information since agents have only a partial vision of their environment. the merger is carried out by the minimum operator. let us consider an example showing how to make the unified evaluation vector. agent ag3 is evaluated by the members ag1 (eval1) and ag2 (eval2) w.r.t. to the eleven (11) criteria (see section 3.3.1). eval1= (0.32, 0.56, 0.88, 20, 10, 2000, 1500, 5, 1, 0.53, 0.23) eval2= (0.56, 0.12, 0.68, 20, 10, 2000, 1500, 5, 1, 0.53, 0.23) the unified deviation vector is eval= (0.32, 0.12, 0.68, 20, 10, 2000, 1500, 5, 1, 0.53, 0.23) thus, for each subtask, the coalition members together try to choose the adequate partner. the negotiation process will be finished when the coalition size is reached, and therefore, the coalition members start the execution of the tasks. hence, with cfotacis, the decision of the coalition structure is collective. 4 the cfotacis properties the proposed model presents several advantages. the choice of the partners is carried out by the coalition members thanks to the unification process of agents’ estimates carried out by each member. the adopted aggregation operator does not authorize the compensation, which permits to choose a partner which has acceptable values for all the criteria. cfotacis ensures the equity property because it offers the same chances to the candidates by limiting the time of the binary negotiations. when the candidate specifies its waiting time, this avoid the coalition members to send message to an agent that is no more waiting. consequently, a considerable profit of time will be obtained. when a partner disengages, it is obvious that it will be sanctioned. this sanction is expressed by the a2 parameter (see section 3.3.1.a). in this case, the coalition members have two alternatives for replacing the disengaged agent. they propose the subtask to the coalition members to cheek whether it exists a partner which is interested to achieve it. therefore, a new negotiation process is avoided. the second alternative is applied when there are not members which are interested. the coalition is obliged to start a new negotiation process to find another partner. finally, if the manager could not find partners or the required size of the coalition is not reached; the negotiation process will be remade. 5 experimental results we have implemented cfotacis using jade 3.0 (java agent development). we have compared its performances with those of a similar model [3]. we remind that in [3], the author proposed two 90 nacer eddine zarour, sabrina bouzidi negotiation models. we are interesting to the one dedicated to egoistic agents considering the rationality of our agents. we have made several series of experiments. in this paper, we show the effect of varying the agents’ and subtasks’ number on the global time of the negotiation process (figures 2 and 3). figure 2: the negotiation time versus the agents’ number for different subtasks’ number in cfotacis figure 3: the negotiation time versus the agents’ number for different subtasks’ number in the model proposed in [3] in each negotiation process, we fixe the subtasks’ number and we increase the agents’ number to finally deduce the consumed time. the graphs presented in this section are drowning using the origin 6.0 software. on the graph of figure 2 (cfotacis), we observe that we could vary agents’ number from 2 to 100 and subtasks’ number from 2 to 10. whereas, on the figure 4 (the comparative model), we could vary agents’ number only from 2 to 8 and subtasks’ number only from 2 to 7. therefore, we deduce that the cfotacis’ scalability is stronger than the one of the comparative model. one reason of this result is that in the comparative model, all agents are in competition for the coalition formation. so, if we have "n" agents in the system, we have also "n" parallel negotiations which cause a scheduling problem in the system. on the other hand, we observe that the negotiation time in cfotacis increases as agents’ and coalition formation for cooperative information agent-based systems 91 subtasks’ number increase. but this increase stills reasonable and very small in comparing it with the one of the comparative model. this is due to the weak scalability of the last one. also, in the comparative model, there is no limitation of the negotiation time between agents 6 summary and conclusions in this paper, we have proposed a coalition formation-based negotiation model for the cooperative information agent-based systems, named cfotacis. the proposed model is inspired from the reality of the cooperation between enterprises. it includes four phases, the initialization, the negotiation, the evaluation, and the finalization. the essential properties of cfotacis are its consideration of the preferences of the coalition members for choosing the future partners. it does not apply any authority on agents and ensures the equity by limiting the time of the binary negotiations due to the absence of the cycles in the proposed negotiation protocol. however, it does not ensure the reach of a consensus but tries to encourage agents to cooperate. the simulation results show that in cfotacis the spent time for the negotiation is reasonable and the messages’ number is deterministic. these results have been confirmed by comparing the cfotacis’ performances with those of the negotiation model proposed in [3]. in the future work, we will improve cfotacis so that it will ensure the reach of a consensus. also, we will adopt it in other application fields like e-commerce. references [1] l. putnam, m. s. pool, “conflict and negociation in handbook of organizational communication: an intersciplinary perspective,” f.m.jablinetal, eds. saga newbury park,, pp. 549-599, california, 1987. [2] t. sandholm, “an algorithm for optimal winner determinism in combinatorial auctions,” in proc. of the 16th int. joint conf. on ai, stockholm, sweden,, 1999. [3] s. aknine, “modèles et méthodes de coordination des systèmes multi agents,” thesis of doctorate, university paris ix dauphine-ufr science of the organizations,, december 2000. [4] n. zarour, m. boufaida, and l. seinturier, “a negotiation framework for organizational information systems,” int. journal of information technology and decision making (ijitdm). world scientific publishing co,, vol. 3, nř2, pp. 213-238, june 2004. [5] a. e. f. saghrouchni and s. haddad, “recursive model for distributed plannin,” in proc. of the 2nd int. conf. on multi-agent systems (icmas’96),, aaai press. kyoto, japan, 1996. [6] c. brooks and e. durfee, “congregating and market formation,” proc. of the 1st int. joint conf. on autonomous agents and multiagent systems,, acm press, pp 96-103, 2002. [7] a. e .f. saghrouchni and g. vauvert, “formation de coalitions pour agents rationnels,” in proc. of ilipn’2000. viiith days of lipn. multi-agent systemes and formal specification and software technologies ,,viltaneuse, france 09, 11-12, 2000. [8] l. k. soh, c. tsatsouli, and h. servey, “a satisficing negotiated and learning coalition formation architecture,” in v.lesser, c.ortiz, and m.tambe, editors, distributed sensor networks: a multi-agent perspective,,kluwer academic publishers, 04, 25-27, 2001. [9] l. larson and t. sandholm, “anytime coalition structure generation: an average case study,” journal of experiments and therotical al,11, 1-20, 2000. [10] vassileva, j. breban and s. horsch, “agent reasoning mechanism for long-term coalitions based on decision making and trust,” computational intelligence, 18, 4, special issue on agent mediated electronic commerce,, pp 583-595, nov. 2002. 92 nacer eddine zarour, sabrina bouzidi [11] c. li and k. sycara, “a stable and efficient scheme for task allocation via agent coalition formation,” www.2.cs.cmu.edu/ softagents/papers/cco3.pdf,, 2003. [12] ] n. r. jennings, p. faratin, p. johnson, m. j. o’brien, and m. wiegand, “using intelligent agents to manage business processes,” practical application of intelligent agents and multi-agents technology, paam 96,1999. nacer eddine zarour, sabrina bouzidi university mentouri of constantine lire laboratory, 25000 algeria e-mail: nasro_zarour@yahoo.fr, sab_bouzidi@yahoo.fr international journal of computers, communications & control vol. i (2006), no. 3, pp. 5-12 from algorithms to (sub-)symbolic inferences in multi-agent systems boldur e. bărbat, sorin c. negulescu abstract: extending metaphorically the moisilean idea of “nuanced-reasoning logic” and adapting it to the e-world age of information technology (it), the paper aims at showing that new logics, already useful in modern software engineering, become necessary mainly for multi-agent systems (mas), despite obvious adversities. the first sections are typical for a position paper, defending such logics from an anthropocentric perspective. through this sieve, section 4 outlines the features asked for by the paradigm of computing as intelligent interaction, based on “nuances of nuanced-reasoning”, that should be reflected by agent logics. to keep the approach credible, section 5 illustrates how quantifiable synergy can be reached even in advanced challenging domains, such as stigmergic coordination by injecting symbolic reasoning in systems based on sub-symbolic “emergent synthesis”. since for future work too the preferred logics are doxastic, the conclusions could be structured in line with the well-known agent architecture: beliefs, desires, intentions. keywords: nuanced-reasoning logic, multi-agent systems, sub-symbolic inferences, stigmergic coordination, synergy 1 introduction. from chrysippus, via moisil, to agent logics for over 40 years, determinism and bivalence of chrysippean logic were the pillars of computer science; likewise, algorithms were the backbone of computer programs, complying with their etymon: pro-gramma = what is written in advance. they sufficed for both fortran-like number crunching and cobol-like business data processing. when early real-time applications (firstly, operating systems) required less autistic programs, algorithms tried to adapt and bizarre terms, such as “unsolicited input”, were coined to fit the incipient non-determinism due to user free will. bivalence not only survived, but also grew in importance strongly backed by hardware. indeed, in the early 70’s, the role of bivalent logic transcended the borders of narrow data processing, penetrating “computer-aided x”, where x stays for almost any intellectual activity. thus, “algorithmic reasoning”, instead of being perceived as a side effect of “analogue humans loosing the battle with digital computers”, became a paradigm in the very sense of kuhn. emerging within this “digital zeitgeist”, nuanced-reasoning [12] was too anti-paradigmatic to redress the balance at least in it (besides, it was technologically useless, as most fascinating heresies). only after the “pc-windows-www” revolution was this “nuanced” kind of fuzzy logic developed by zadeh as “computing with words” acknowledged as an alternative approach to software development (albeit seldom necessary). on the other hand, after a decade of success stories, within artificial intelligence (ai) the perpetual stronghold of applied logics and symbolic processing -, expert systems (based on the newell-simon hypothesis) began to disappoint, because of their brittleness (in all nuances of the word), showing the actual limits of the symbolic paradigm. the reaction was prompt, overwhelming, and exaggerated: “gofai” (good old-fashioned ai) has to be replaced by “bic” (biologically inspired computing), based on sub-symbolic paradigms. the most nihilist and powerful one, i.e. the ethological paradigm (based on the physical-grounding hypothesis), is, for good reasons, still in vogue. however, paradoxically, new, “much nuanced” logics are already used in modern software engineering, tending to become necessary mainly for non-trivial mas, despite many, major, and obvious adversities. the paper aims to: a) defend not just those logics but also the inexorable need of symbolic processing, copyright c© 2006 by ccc publications selected paper from icccc 2006 6 boldur e. bărbat, sorin c. negulescu even in systems where intelligent behaviour emerges sub-symbolically (because of its synergistic potential); b) after explaining why synergy, show how it can be reached. (that is why the title contains the unusual term “(sub-)symbolic”.) thus, after a short history (section 2), the approach is rendered from an anthropocentric perspective: the agent shall behave naturally (i.e., closer to human behaviour), not the opposite (section 3). through this sieve, section 4 outlines the features and symbolic mechanisms asked for by the paradigm of “computing as intelligent interaction”, based on “nuances of nuancedreasoning”. to keep the approach credible, section 5 sums up recent research showing how quantifiable interparadigmatic synergy can be reached even in advanced challenging domains, such as stigmergic coordination by injecting symbolic reasoning in systems based on sub-symbolic “emergent synthesis”. since for future work the preferred logics are doxastic, the conclusions (section 6) far from being apodictic can be structured in line with the well-known agent architecture: beliefs, desires, intentions. 2 history. in search of synergy the research roots are in over 20 papers/articles published in 1997-2002 and synthesised in [3]. after 2002 there are two history strands having the common denominator “looking for synergy in the world of humans and agents”: • stigmergic coordination. after minor improvements in 2003, in [13] some (less quantifiable) synergy was achieved deviating from the biological model applied in the elitist ant systems by adding symbolic processing components (firstly adapting the environment and secondly instituting limited central coordination). in [7] a refined experimental model attested that in operational research, through “stigsynergy” the same solution quality could be reached with fewer ants than used in common benchmarks, saving thus at least one order of magnitude of processing time. • human-agent communication. user-avatar interaction was illustrated in medical captology, employing pathematic agents as virtual therapists [4]. the framework was widened (in the context of broadband communication) to any anthropocentric interface in [5], focusing on the languages enabled by modern multimodal interfaces. on a more abstract level, [6] showed how trans-disciplinary metaphors, applied in communication procedures, can help humanists and technologists get close. 3 approach. towards natural behaviour of artificial entities two perspectives guide the approach: anthropocentric systems, as non-negotiable goal, and agentoriented software engineering (aose), as amendable means depending on long-range effectiveness. (anthropocentrism means focusing on the human being as user, beneficiary, and, ultimately, raison d’être of any application or, more general, technology [5]. here, “anthropocentric” is synonymous to “humancentred”). the premises are: 1. regarding the goal: • despite their fast rising technological level, most it applications involving intense humancomputer interaction (hci) have low degree of user acceptance, ignoring the very slogan: “computing as interaction” [1]. • that drawback holds mainly for ai systems, widening the gap between humanists and technologists. • the main cause: system development is rather technocentric than anthropocentric. from algorithms to (sub-)symbolic inferences in multi-agent systems 7 • the main neglected human features are: 1f) invariants: humans are intrinsically analogue in information processing and multimodal in perception. 2f) prevalent in hci: humans prefer symbolic communication but sub-symbolic response. 2. regarding the means: • the it infrastructure is sufficiently advanced (in both facts and trends: nanoelectronics, broadband communication, semantic web, multimodal interfaces, etc.) to allow anthropocentrism for most it applications. • intelligent system behaviour whatever that could mean becomes a crucial user expectation. regrettably, in ai neither technology, nor design philosophy were yet able to offer it in a user-relevant manner. • nevertheless, agent technology, as ai flagship, proved to be a significant step towards user acceptance. • aose is not bounded to ai, but tends to become the dominant it development paradigm [11], [15]. while the first premises in each category are generally accepted, the last ones are debatable (e.g., a4b is rather an “author thesis” and b4 is strongly contested by object-oriented designers). the corollaries relevant for the paper are: • c1. the geometrically increasing computing power (due to moore’s law) promotes at least five factors tending to reduce radically the role of any species of logic in it at least for applications affordable on usual configurations: 1. since deterministic applications are vanishing, the conventional algorithm is not anymore program backbone. 2. even when still useful, the conventional algorithm is not anymore the main programming instrument (being hidden in procedures easily reached in a host of libraries or being generated by 4gl). 3. in ai the symbolic paradigm is steadily replaced by several sub-symbolic ones, based on fine-grain parallelism. 4. even when symbols are used, they are stored in and retrieved from huge and cheap memory, rather than processed through sophisticated reasoning schemes (case-based reasoning is just a blatant example). 5. cognitive complexity of new, sophisticated logics is too high for a designer, when “cut and try” is affordable. • c2. the rules for human-agent interaction can and should be set by users (at least while we have the demiurgic privilege of shaping agents as we like it!): 1. since interaction is carried out through the interface, anything behind it is user-irrelevant. 2. since natural and artificial intelligence encounter at interface level, they shall join, not collide. 3. to join closer to human demeanour, users should engage interface agents as naturally as possible. 4. hence, let agents behave more and more naturally (e.g., it is not difficult to go beyond gestures to show emotivity, since not emotion has to be replicated, but its appearance firstly forged, later more genuine [5]). 8 boldur e. bărbat, sorin c. negulescu 5. since interaction involves communication, the communication procedures (the term “procedures” is here a prudent, albeit partial, place holder for “language” or, even, “empathy”) must be those humans are familiar with (e.g., body language can and shall be added to verbal messages). 6. since beside how to communicate (the vehicle), it is vital what (the message), beyond the procedures, there must be a representational compatibility between humans and agents (expressed through common ontologies, primitive surrogate of a yet impossible common “weltanschauung”). if regarding c2.1-c2.4, the blend “symbolic/sub-symbolic” is unclear, c2.5 implies symbols, whereas c2.6 is stronger, implying symbolic inferences. at least some of them shall be based on logic(s). for short, acknowledging the decline of logic (because of c1), its necessity is asserted (in line with c2). anyway, the role of desirable features of new logics could be credibly defended outside largescale systems, were the proof is futile only comparing diverse implemented mas designed with or without employing such logics. because of c1.5, this is impractical. to weaken this main approach drawback, the argument is split to render two complementary paths, both based on the idea that the blend “symbolic/sub-symbolic” yields synergy: a) axiological perspective: why and what symbolic processing (section 4, closer to a position paper); b) praxiological perspective: how can symbolic processing be added in experimental sub-symbolic models (section 5, closer to a technical report). 4 nuances of nuanced-reasoning in human-agent dialogue it would be both arrogant and absurd if authors lacking educational background in both mathematics and logic would utter value judgments in these fields. hence: • without claiming that moisil actually attached to “nuanced” other connotation than “fuzzy”, bearing in mind his gifted baroque way of catalysing brainstorming, it is legitimate to use undertones of three (partial) synonyms “degree”, “gradation (sequence, development)”, “fine distinction” as metaphor sources. • all assertions about existing or desirable logics mirror the angle of potential users of such logics, mainly in interface agents and mas based on stigmergy. they convey “calls for help”, not requests, and are uttered as desires. • since, as regards logics dealing with agent-related aspects, for many basic aose requests, fisher’s logic [9] seems for a non-specialist by far to be the most responsive and appropriate, all desiderata below refer to it. ¥ diversified inferences. smith’s propositional-representation theory should be: a) revisited and thoroughly extended; it shall include all main mechanisms (symbolic or not) employed by humans to infer and to make decisions (even “right-hemisphere based” processes, as educated guess, intuition or gambling); b) applied, depending on the sub-field; such mechanisms should be replicated as “omomorph”, as adequate (not as possible!) in agent decision making schemata. if all of them would reach the elegance and dependability of logic, it would be nice, but let it be yet a kind of “princess lointaine”, because in real-world systems most concepts involved tend to become blurred. for instance, even metalogic is now nuanced: soundness remains crucial (still apart from time-critical applications it can be circumvented through revisable reasoning); completeness is more negotiable (the oversimplified solution: “otherwise, nothing happens”). ¥ “more time for agents”. nowadays, any software piece unable to interact efficiently with unpredictable environments (humans included) and with its peers is hardly useful outside toy-problems. that from algorithms to (sub-)symbolic inferences in multi-agent systems 9 means: parallelism, temporal dimension, non-determinism, reactivity. corollary: any such program entity has to be implemented as execution thread (atomic, sequential, asynchronous and dynamic) [3]. to develop into an agent, the thread needs also non-trivial informational and motivational components. (however, the “dynamic component” is a treble confusing term: a) it is not a component but the very agent nature; b) the “sense of time” refers to much more than activity e.g., “waiting” is rather inactivity; c) “dynamic activity” sounds pleonastic from any stance.). ¥ no “start” and no “synchronous agents”. if for e-commerce, it is conceivable to consider that the entire world restarts with each transaction, for process control (even for discrete manufacturing) such eternal re-birth is practically excluded. moreover, it is against the very spirit of: a) the (still dominant) “client-server” paradigm (the tailor is not spawned every time a client needs new clothes); b) real-time software engineering (to react timely to environment stimuli, the thread must exist to handle the interrupt); c) agency itself: the basic feature of autonomy (implying asynchronous behaviour) is endangered. luckily, current timers permit a “fine-grain universal metronome”, avoiding the costly implication: “asynchronously executing agents → temporal logic of the reals”. thus, “asynchronously executing agents” should be perceived as pleonasm, despite their logic is still based upon a discrete model of time, with both infinite past and future. (in real-world mas, there is no “big-bang”.) ¥ no “negative introspection”. unable to comment upon the advantages of ideal doxastic logics outside large-scale mas, the authors feel that positive introspection is highly desirable but that assuming the negative one is ineffective for both agents and humans. thus, if it makes sense and simplifies the features, maybe kd4, not kd45. ¥ no more certitude. less checking. until agent logics offer mechanisms to deal with uncertainty, at least, in simple expressions, the “ugly chasm” separating formal theory and practical system development [9] cannot be avoided. just a plain example of a badly needed such mechanism: exception handling. even primeval animals move “algorithmically” (“if gap then get round, else go on”) only a few steps, in very hostile environments. moreover, reaction to stimuli cannot mean perpetual looking for the stimulus. (instead, the stimulus causes an interrupt that can be treated as exception.) the cardinal hindrance stems not from logic, but from the mechanisms employed: neither nature, nor technology can afford in the long run mechanisms involving large amount of testing because they are too time-consuming tools: “if temperature > n oc then alarm”. thus, the main problem is not the semantics of “unless”, but the repeated checking of “if ”. from this angle, the semantics of “unless” in reiter’s default logic would be more tempting if it would be rather diachronic than synchronic (a bird is or is not a penguin but will never become one). however, a kind of m operator meaning roughly “while no alarm is heard it is consistent to believe that nothing happened”. indeed, the agent is condemned to be a risk-taker, hearing (reactively) the environment, not listening (proactively) to it: the agent stops performing a task only if he hears the alarm bell. the point is that this “if ” belongs to the metalanguage and does not involve thermometer reading! perhaps a non-monotonic logic with “reiter-unless” inserted in a temporal logic with “fisherunless” is what designers dream of. (since dreams are forward-thinking, maybe more: a graphical “flowchart-like” symbol of this m shall be understood by an interpreter of an “aose-ml” -without “object legacy” that can create code for defining, raising, propagating, and handling exceptions. 5 down to ants: synergy, stigmergy, and symbols since as regards stigmergic coordination the research was recently summarised in [7], [8], [13] and the current results are presented in [7], here, only the approach and some relevant aspects of achieving synergy through grafting symbolic processing onto sub-symbolic systems are emphasised. the and written in capitals emphasises the similarity with the synonymous boolean operator, i.e. synergy is searched for in all possible combinations. the mas that relies on sub-symbolic processing more than any other is the biologically inspired ant system (as) where the sub-symbolic echelon is represented by the pheromones in such a way that global 10 boldur e. bărbat, sorin c. negulescu information is available locally. moreover, this system is not only sub-symbolic by itself but it also manifests autopoiesis (it emerges subsymbolic) and the trouble to understand what is in fact going on at system level, is less upsetting than in the case of more familiar sub-symbolic paradigms (as artificial neural networks or evolutionary algorithms) since ant behaviour is easier to follow due to its simplicity. the stigmergy related to mas, “describes a form of asynchronous interaction and information exchange between agents mediated by an ‘active’ environment”, or “the production of certain behaviour in agents as a consequence of the effects produced in the local environment by previous behaviour”. in this context: “the agents are simple, reactive, and unaware of other agents or of the emerging complex activities of the agent society; the environment is an important mechanism to guide activities of these agents and to accumulate information about ongoing activities of the whole agent society” [13]. whereas in [2], [10], [14], the approach was mainly based on self-organization, the approach is an alternative one by obtaining synergy through adding symbolic processing (firstly adapting the environment and secondly instituting limited central coordination). as shown in [7], the as manifests a threshold and it depends on problem type and complexity; the same solution quality can be obtained with fewer ants than used in common benchmarks, saving thus at least one order of magnitude of processing time. details can be found in [13] (improvements to conventional eas), [8] (motivation, approach and new perspective), and [7] (experimental results about moving the threshold in fact modifying the sigmoid function to improve efficiency). possible scientific openings e.g. whether in real-life problems there are instances when “many starts from four” can be found also in [7]. 6 conclusions: beliefs, desires, intentions the conclusions are presented within the bdi frame not just to keep up the atmosphere, but because: a) the conclusions are far from being apodictic and the logics preferred for mas are doxastic; b) using the meanings given by smets, the belief functions have rather dispersed values, and the plausibility functions have quite low values for section 4 and some assertions of section 3; c) the largest part of section 3 is actually a gathering of desires; d) intentions is more humble than “future work”; d) if we intend to interact keener with agents, we have to make steps towards common ontologies preferably based on success stories. beliefs: • despite the fall of conventional algorithms and the fast rise of sub-symbolic paradigms, symbolic processing is unavoidable in aose and agent logics become necessary even outside large-scale systems. • an essential problem in designing agents is implementing their reactivity; main cause: current development environments admit rather very poor exception handling. • even mas based on the most radical sub-symbolic paradigm (stigmergy being “a-symbolic” par excellence), become more effective grafting upon symbolic processing. • taking into account the increasing weight of mas acting as man-machine systems, the anthropocentric perspective requires that human-agent communication should be the model for agentagent communication. • although the brains-surrogate of current agents is still primitive, it shall have two hemispheres, as human do. the left hemisphere, where logic is king, is designed predominantly to implement pro-activeness, whereas the right one, as realm of its instincts, emerges sub-symbolically, and is the main source of reactivity (again, similar to humans). desires: they are addressed to future agent logics, from an outsider (but outspoken aose) perspective: from algorithms to (sub-)symbolic inferences in multi-agent systems 11 • tackle neglected problems common to all kinds of agent-based systems (dwarfs and trolls welcomed). • give us sectorial solutions. they are just fine to begin with. completeness in its polysemy can follow. (if the mas is sound, nobody minds if agents manifest a bit of schizophrenia.) • don’t give us sectorial approaches. they are less applicable (e.g., time without uncertainty or vice versa). • let mas be lasting, even if some agents are mortal. • don’t condemn mas to act synchronously. both environment and users are too capricious to accept it. (instead, we promise to be happy with discrete time.) • don’t sentence us to perpetual testing. to rephrase dijkstra: (the condition in) if is harmful. (allow us to handle exceptions, and we promise not to exaggerate eliminating all “iffs”.) • help us pass the mental rubicon separating objects from agents. (no agent is fond of being considered “intelligent and responsive like an object”.) intentions: • as regards stimergic coordination, the intentions are those states in [7]: for short, increasing “stigsynergy”. • showing how agent reactivity can be significantly improved, through exception-driven multimodal interfaces. • trying dialectics as inference mechanism for negotiation strategies used by e-commerce agents. acknowledgment. this paper is related to the eu-sponsored action cost-298. references [1] “agentlink roadmap: overview and consultation report”, agentlink iii. agent based computing, university of southampton, 2004. [2] banzhaf, w., “self-organizing systems”, encyclopedia of physical science and technology, academic press, new york, 14, pp. 589-598, 2002. [3] bărbat, b.e., “sisteme inteligente orientate spre agent”, ed. academiei române, bucureşti, 467 pages, 2002. [4] bărbat, b.e., “the avatar a pseudo-human extending the genuine one”, the good, the bad and the irrelevant: the user and the future of information and communication technologies, (l. haddon et al, eds.), pp. 38-42, media lab/university of art and design, helsinki, 2003. [5] bărbat, b.e., “the impact of broad-band communication upon hmi language(s). (chapter 7.) communicating in the world of humans and icts. (chapter 8.)”, cost action 269. e-citizens in the arena of social and political communication, (l. fortunati, ed.), pp. 113-142, eur21803, office for official publications of the european communities, luxembourg, 2005. [6] bărbat, b.e., n. bulz, “e-world and real world. from analysing concepts towards integrating approaches”, proc. of the wosc 13th international congress of cybernetics and systems, university of maribor in cooperation with encyclopaedia of life support systems, 1, pp. 47-57, 2005. 12 boldur e. bărbat, sorin c. negulescu [7] bărbat, b.e., s.c. negulescu, c.b. zamfirescu, “human-driven stigmergic control. moving the threshold”, proc. of the 17th imacs world congress (scientific computation, applied mathematics and simulation), paris, july 11-15, 2005. [8] bărbat, b.e., c.b. zamfirescu, s.c. negulescu, “the best from ants and humans: synergy in agent-based systems”, studies in informatics and control journal, 13, 1, 47-59, 2004. [9] fisher, m., “temporal development methods for agent-based systems”, autonomous agents and multi-agent systems, 10, pp. 41-66, springer science + business media inc., 2005. [10] knyazeva, h., haken, h., “synergetics of human creativity. dynamics, synergetics, autonomous agents”, anonlinear systems approaches to cognitive psychology and cognitive science singapore: world scientific, pp. 64-79, 1999. [11] luck, m., mcburney, p., priest, c., “a manifesto for agent technology: towards next generation computing”, autonomous agents and multi-agent systems, 9, 203-252, kluwer academic publishers, 2004. [12] moisil, gr.c., “lecţii despre logica raţionamentului nuanţat”, ed. ştiinţifică şi enciclopedică, bucureşti, 1975. [13] negulescu, s.c., bărbat, b.e., “enhancing the effectiveness of simple multi-agent systems through stigmergic coordination”, fourth international icsc symp. on engineering of intelligent systems (eis 2004), icsc-naiso academic press canada, 149 (abstract; full paper on cd-rom enclosed), 2004. [14] parunak, h.v.d., brueckner, s., john a. sauter, matthews, r., “global convergence of local agent behavior”, submitted to the fourth international joint conference on autonomous agents and multi-agent systems (aamas’05), 2005, available at http://www.erim.org/ vparunak/ aamas05converge.pdf. [15] zambonelli, f., omicini, a., “challenges and research directions in agent-oriented software engineering”, autonomous agents and multi-agent systems, 9, 253-283, kluwer academic publishers, 2004. boldur e. bărbat “lucian blaga” university of sibiu faculty of science address: 5-7, ion raţiu street, sibiu, 550012, românia e-mail: bbarbat@gmail.com sorin c. negulescu “lucian blaga” university of sibiu faculty of engineering address: 4, emil cioran street, sibiu, 550025, românia e-mail: sorin−negulescu@yahoo.com international journal of computers, communications & control vol. i (2006), no. 1, pp. 59-67 personalized e-learning implementation the gis case athanasios d. styliadis, ioannis d. karamitsos, dimitrios i. zachariou abstract: personalized e-learning implementation is recognized as among one of the most interesting research areas in the distance learning web-based education. in particular, the gis e-learning initiatives that incorporate —by default— a number of sequencing spatial techniques (i.e. spatial objects selection and sequencing), will well benefit from a welldefined personalized e-learning implementation with embedded spatial functionality. this is the case addressed in this paper. the gis e-learning implementation introduced in the current paper is based on a set of teaching (lecturing) rules according to the cognitive style of learning preferences of both the learners and the lecturers as well. it is important to note that, in spite of the fact that most of these teaching rules are generic (i.e. domain, view and user independent), there are no so far well-defined and commonly accepted rules on how the learning spatial gis objects and techniques should be selected and how they should be sequenced to make “instructional sense" in a web-based gis course. keywords: e-learning, distance learning, gis, lis, am/fm, spatial sciences. 1 introduction gis learning object selection is the first step to adaptive space navigation and adaptive course sequencing with gis (space) functionality. adaptive navigation seeks to present the gis learning objects associated with an on-line course in an optimized order, where the optimisation criteria takes into consideration the learner’s background and performance in related learning objects [2], whereas adaptive course sequencing is defined as the process that selects learning objects from a digital repository and sequence them in a way which is appropriate for the targeted gis learning community or individuals [16, 11, 17]. selection and sequencing is recognized as among the most interesting research questions in intelligent web-based gis education [13, 6]. although many types of intelligent learning systems with no gis functionality are available, in the proposed gis case five key components could be identified, which are common in most gis systems, namely: data acquisition, retrieval and analysis, preliminary data processing, database construction, and communication and visualization. figure 1 provides a view of the interactions between these five gis modules [18]. figure 1: the main components of a gis intelligent learning systems in most intelligent learning systems that incorporate course sequencing techniques, the pedagogical module is responsible for setting the principles of content selection and instructional planning. the selection of content (in our case, the gis learning objects) is based on a set of teaching rules according to the topology-based cognitive style or learning preferences of the learners [3, 15]. in spite of the fact that most of these rules are generic (i.e. domain independent), there are no well defined and commonly accepted rules on how the gis learning objects should be selected and how they should be sequenced to make “instructional sense” with gis functionality [14]. moreover, in order to design highly adaptive learning systems a huge set of topology-based rules is required, since dependencies between educational characteristics of gis learning objects and learners are rather complex [8, 12, 9, 10]. copyright c© 2006 by ccc publications 60 athanasios d. styliadis, ioannis d. karamitsos, dimitrios i. zachariou in this paper, the learning object selection problem in gis intelligent learning systems is addressed by proposing a methodology that instead of “forcing” an instructional designer to manually define the set of selection rules; produces a decision model that mimics the way the designer decides, based on the observation of the designer’s reaction over a small-scale learning object selection problem. in the next (second) section the gis learning object selection process is discussed as a part of a spatial-course sequencing. the third section discusses a topology-based filtering process of gis learning objects used for reduction of learning objects searching space and proposes gis metadata elements that can be used for learning object filtering according to the open gis consortium guidelines (standards) for gis functionality. 2 gis learning object selection in spatial-course sequencing in automatic and semi-automatic course sequencing, the main idea is to generate a course suited to the needs of the learners. as described in the literature, two main approaches for automatic and semi-automatic course sequencing have been identified: adaptive courseware generation (acg) and dynamic courseware generation (dcg) [3]. in acg the goal is to generate an individualized course taking into account specific learning goals, as well as, the initial level of the student’s knowledge. the entire course is adaptively generated before presenting it to the learner, instead of generating a course incrementally, as in a traditional sequencing context. in dcg on the other hand, the system observes the student progress during his interaction with the course and dynamically adapts the course according to the specific student needs and requirements. if the student’s performance does not meet the expectations, the course is dynamically re-planned. the benefit of this approach is that it applies as much adaptivity to an individual student as possible. both, the above mentioned techniques, in the case of gis lecturing employ a pre-filtering topology-based mechanism to generate a group (“pool”) of gis learning objects that match the general content requirements. this pool can be generated from both distributed and local gis learning object repositories, provided that the appropriate access controls have been granted. the filtering process is based on general requirements such as topology, gis functionality, characteristics of the teaching language, the media of the targeted gis learning objects, as well as, the use of ontologies and topology for the domain in question (known as the spatial knowledge module). the result of the filtering process falls in a virtual pool of gis learning objects that will act as an input space for the content selector (gis functionality). after the creation of the initial pool of the gis learning objects, the content selection process and the underlined topology is applied based on learner characteristics, such as accessibility and competency characteristics or even historical information about related learning activities, included in the student model module. in the next section some filtering elements based on the open gis consortium (o-gis.c) learning object metadata (lom) standard are presented and the methodology is analyzed for the proposed content selection phase of a semi-automatic gis course sequencing with spatial functionality. 3 gis learning object filtering generally, the main goal of a filtering process is the reduction of the searching space. gis learning object repositories (g.lor) often contain hundreds of thousands of gis learning objects, thus the selection process may require a significant computational time and effort. in most intelligent learning systems, learning object filtering is based, either on the knowledge domain they cover, or on the media type characteristics they contain [11]. in the open gis consortium’s (o-gis.c) spatial metadata model, there exist a number of elements covering requirements such as gis functionality, topology, geometry, subject, teaching language, media type of the targeted gis learning object, etc. figure 2 presents the o-gis.c elements identified for each one of the above mentioned topology-based filtering categories and the conditions required. alternatively, filtering can be based on integration of the o-gis.c metadata model elements and ontologies [19, 16]. those approaches assume that both the domain model and the learning objects themselves use the same ontology [14] and limit the filtering only to knowledge domain filtering with gis functionality. personalized e-learning implementation the gis case 61 figure 2: elements for gis learning object filtering the gis learning object selection procedure typically, the design of highly adaptive learning systems requires a huge set of rules, since dependencies between educational characteristics of learning objects and learners are rather complex. this complexity introduces several problems on the definition of the rules required [20, 4], namely: inconsistency, when two or more rules are conf1icting. conf1uence, when two or more rules are equivalent. insufficiency, when one or more rules required have not been defined. the proposed methodology is based on an intelligent mechanism that tries to mimic an instructional designer’s decision model on the selection of the gis learning objects. for this purpose, a framework that attempts to construct a suitability function that maps gis learning object characteristics over learner characteristics and vice versa is designed. the main advantage of this method is that it requires less effort by the instructional designer, since instead of identifying a huge set of rules related to space and topology, only the designer’s selection from a small set of gis learning objects over a reference set of learners is needed. the machine learning technique will try then to discover the dependence between gis learning object and learner characteristics that produce the same selection of gis learning objects per learner as the instructional designer did [1, 13, 20]. the proposed methodology does not depend on the characteristics used for learning objects and learner modelling, thus can be used for extraction of even complex pedagogy-related dependencies. it is obvious that since characteristics/requirements like the domain are used for filtering, the dependencies produced are quite generic, depending only on the educational characteristics of the content and the cognitive characteristics of the learner gis student [5, 2, 16]. figure 2 presents a graphical representation of the proposed selection model extraction framework, which it is consisting of the following three main steps: 1) the 3-d modelling and selection of gis criteria the selection methodology is generic, independent of the learning object and the learner characteristics used for the selection. in the proposed method experiment, gis learning object characteristics was used derived from 62 athanasios d. styliadis, ioannis d. karamitsos, dimitrios i. zachariou figure 3: selection model extraction framework the o-gis.c standard, and learner characteristics derived from the ims global learning consortium inc. learner information package (lip) specification. there exist many criteria affecting the decision of gis learning objects selection. those criteria that lead to a straightforward exclusion of learning objects, such as the topology, the subject, the language and the media type and the gis functionality are used for filtering. the rest set of criteria such as the educational characteristics of gis learning objects are used for selection model extraction, since the dependencies of those criteria can model the pedagogy applied by the instructional designer, when selecting learning objects. those criteria, due to the complexity of interdependencies between them, are the ones that cannot be directly mapped to rules from the instructional designer. thus an automatic or semi-automatic extraction method, like the proposed one, is needed. 2) the selection model extraction after identifying the set of the characteristics and the criteria (see: step 1) that will be used as the input space of the i/o selector, the extract procedure for each gis learning object characteristic and the expert’s suitability evaluation model over a reference set of lip-based characterized learners is presented. the input to this phase is the o-gis.c characteristics of a reference set of learning objects, the ims lip characteristics of a reference set of learners and the suitability preference of an expert for each of the gis learning objects over the whole reference set of learners. the model extraction methodology has the following formulation: let us consider a set of learning objects, called a, which is valued by a set of criteria g = (g1, . . . , gn). the assessment model of the suitability of each gis learning object for a specific learner, leads to the aggregation of all criteria into a unique criterion that we call a suitability function s(g) = s(g1, . . . , gn). personalized e-learning implementation the gis case 63 we define the suitability function as an additive function of the form s(g) = n ∑ i=1 si(gi) with the following additional notation: • si(gi): marginal suitability of the ith selection criterion valued gi, • s(g): global suitability of a learning object. the marginal suitability evaluation for the criterion gi is calculated using the formula si(x) = ai + bix exp(−cix2), where x is the corresponding value of the gi gis learning object selection criterion. this formula produces, according to parameters a, b and c as well as the value space of each criterion, the main criteria forms, we have identified: • monotonic form: when the marginal suitability of a criterion is a monotonic function. • non monotonic form: when the marginal suitability of a criterion is a non-monotonic function. the calculation of the optimal values of parameters a, b and c for each selection criterion is the subject of the knowledge model extraction step. let us call n the strict preference relation and e the indifference relation. if so1 is the global suitability of a learning object o1 and so2 is the global suitability of a learning object o2, then the fol1owing properties generally hold for the suitability function s: so1 > so2 ⇐⇒ (o1)p(o2), and the relation r = p∪i is a week order relation so1 = so2 ⇐⇒ (o1)i(o2). the expert’s requested information then consists of the weak order r defined on a for several learner instances. using the provided weak order relation r and based on the form definition of each learning object characteristic we can define the suitability differences ∆ = (∆1, . . . , ∆m−1), where m is the number of learning objects in the reference set a and ∆k = sok − sok+1 ≥ 0, depending on the suitability relation of (k) and (k + 1) preferred learning object for a specific learner of the reference set. we can introduce an error function e for each suitability difference: ∆k = sok −sok+1 + ek ≥ 0. using constrained optimization techniques, we can then solve the non-linear problem: minimize m−1 ∑ j=1 ( e j )2 subject to the constraints: ∆ j > 0 if o j po j+1 ∆ j = 0 if o j io j+1 } for each one of the learners of the reference set. this optimisation problem will lead to the calculation of the optimal values of the parameter a, b and c for each gis learning object selection criteria over the reference set of learners. 64 athanasios d. styliadis, ioannis d. karamitsos, dimitrios i. zachariou 3) the extrapolation the purpose of this third phase is to generalize the resulted marginal suitability model from the reference set of learners to all learners, by calculating the corresponding marginal suitability values for every combination of learner characteristics. this calculation is based on the interpolation of the marginal suitability values between the two closest instances of the reference set of learners. suppose that we have calculated the marginal suitability sl1i and s l2 i of a criterion gi matching the characteristics of learners l1 and l2 respectively. we can then calculate the corresponding marginal suitability value for another learner l using interpolation if the characteristics of learner l are mapped inside the polyhedron that the characteristics of learners l1 and l2 define, using the formula: si(g l i ) = si(g l1 i ) + gli −g l1 i gl2i −g l1 i [ si(g l2 i )−si(g l1 i ) ] , if si(g l2 i ) > si(g l1 i ) let ci = bci, c?i c, i = 1, 2, . . . , n be the intervals in which the values of each criterion — for both learning object and learners — are found, then we call global suitability surface the space c = ×ni=1ci. the calculation of the global suitability over the above mentioned space is the addition of the marginal suitability surfaces for each of the learning object characteristics over the whole combination set of learner characteristics. some experimental results and discussion in order to evaluate the total efficiency of the proposed methodology both ïn calculating the suitability on the training set of gis learning objects and on estimating the suitability of gis learning objects external from the reference set, we have designed an evaluation criterion, defined by: success(%) = 100· correct learning objects selected n , where n is the number of the desired gis learning objects from the virtual pool that will act as input to the instructional planner. it is assumed that the number of desired gis learning objects is less than the total number of the gis learning objects in the input space (the learning objects pool) and that both the gis learning object metadata and the learner information metadata have normal distribution over the value space of each criterion. additionally, the gis learning objects are classified, for both testing and estimation set, in two classes according to their functionality and aggregation level, since granularity is a parameter affecting the capability of an instructional designer to select learning content for a specific learner. the classification is based on the value space of the “general/aggregation leíel" element of the o-gis.c standard. in the rest of the paper experimental results are presented of the proposed methodology by comparing the resulting selected gis learning objects with those selected by gis experts. also, the success of the proposed method has been evaluated on both the training set of learning objects (the training success) and on the suitability estimation of learning objects external from the reference set (the estimation success). so, figure 4 present average experimental results for gis learning objects with aggregation level 1 and 2 respectively. if it is considered that for one learner instance, the different combinations of gis learning objects, calculated as the multiplication of the value instances of characteristics with gis functionality, lead to more than 1,000 gis learning objects, it is evident that it is almost un-realistic to assume that an instructional designer can manually define the full set of selection rules which correspond to the dependencies extracted by the proposal method and at the same time to avoid the inconsistencies, conf1uence and insufficiency of the produced selection rules. the proposed topology-based methodology is capable of effectively extracting dependencies between gis learning object and learner characteristics affecting the decision of an instructional designer on the gis learning object selection problem. the analysis on the results, presented in figure 3, shows that when the desired number of gis learning objects (i.e. n) is relatively small (less than 100), the selected learning objects by the extracted decision model are almost similar to those the instructional designer would select. on the other hand, when the desired number of gis learning objects is relatively large (say, about 500) the success of the selection is affected, but remains at acceptable level (i.e. about 90%). another parameter affecting the selection success is proved to be the granularity of gis learning objects. granularity mainly affects the capability of an instructional designer to express selection preferences over learning personalized e-learning implementation the gis case 65 figure 4: average experimental results for gis learning objects objects. learning objects with small aggregation level have bigger possibility of producing “gray” decision areas, where the instructional designer cannot decide which gis learning object matches most the cognitive style or learning preferences of a learner [1, 5]. in paper’s experiments, gis learning objects with aggregation level 2, which can be small or even bigger collections of gis learning objects with aggregation level 1, appear to have less possibility of producing indifference relations, enabling to make secure decisions even for bigger desired number of learning objects (n = 1, 000). 4 summary and conclusions in this paper the gis learning object selection problem is addressed as an intelligent topology-based gis learning systems, by proposing a methodology that instead of “forcing” an instructional designer to manual1y define the set of selection rules; produces a decision model that mimics the way the designer decides, based on the observation of the designer’s reaction over a small-scale gis learning object selection problem. hence, the proposed personalized e-learning method, describes a new methodology about selecting and sequencing learning objects with gis functionality in an instructional-sense and user-friendly way. acknowledgements the current paper is supported by the epeaek ii archimedes i research project (action 2.2.17), “personalized e-learning in a reusable way”, of the alexandrium technological educational institute of thessaloniki (atei), greece. this project is co-funded by the greek ministry of education & religious affairs and the european union. references [1] l. b. baruque & r.n. melo, “learning theory and instructional design using learning object”, in learning objects 2003 symposium: lessons learned, questions asked. honolulu, hawaii, usa. retrieved in september 2005 from the world wide web: http://www.cs.kuleuven.ac.be/ erikd/pres/2003/lo2003/baruque.pdf [2] p. brusilovsky, “adaptive and intelligent technologies for web-based education”, kunstliche intelligenz journal, vol. 4, pp.19-25, 1999. [3] p. brusilovsky and j. vassileva, “course sequencing techniques for large-scale web-based education", international journal of continuing engineering education and life-long learning, vol.13 (1/2), pp. 75-94, 2003. [4] l. calvi & a. cristea. “towards generic adaptive systems: analysis of a case study”, in proc, of the 2nd international conference on adaptive hypermedia and adaptive web based systems, malaga, spain, 2002. 66 athanasios d. styliadis, ioannis d. karamitsos, dimitrios i. zachariou [5] k. chitwood, et al., “battle stories from the field: wisconsin online resource centre learning objects project", in the instructional use of learning objects: online version. d.a. wiley, ed., 2000. retrieved in september 2005 from the world wide web: http://reusability.org/read/chapters/ chitwood.doc [6] p. dolog, w. nejdl, “challenges and benefits of the semantic web for user modelling”, workshop on adaptive hypermedia and adaptive web-based systems, in proc. of the 12th international world wide web conference, budapest, hungary, 2003 [7] p. dolog, n. henze, w. nejdl & m. sintek, “personalization in distributed elearning environments”, in proc. of the 13th international world wide web conference, new york, usa, 2004 [8] epeaek ii common european union and greek government project (2001-2006). world wide web page: http://www.epeaek.gr/epeaek/en/home.html [9] ieee, “draft standard for learning object metadata”, ieee 1484.12.1-2002, 2002. retrieved in september 2005 from the world wide web: http://ieeeltsc.org/wg12lom/1484.12.1 [10] ims – global learning consortium. qti lite specification. ims question & test interoperability, 2002. retrieved in september 2005 from the world wide web: http://www.imsglobal.org/question/qtilite03.html [11] kinshuk, r. oppermann, a. patel & a. kashihara, “multiple representation approach in multimedia based intelligent educational systems”, artificial intelligence in education journal, amsterdam: ios press. pp. 259-266, 1999. [12] lsal, scorm best practices guide for content developers, 2003. cernegie mellon learning systems architecture lab. retrieved in september 2005 from the world wide web: http://www.lsal.cmu.edu/lsal/expertise/projects/developersguide [13] g. mccalla, “the fragmentation of culture, learning, teaching and technology: implications for the artificial intelligence in education research agenda in 2010”, international journal of artificial intelligence in education, vol. 11, pp. 177-196, 2000. [14] p. mohan, j. greer & g. mcgalla, “instructional planning with learning objects”, workshop on knowledge representation and automated reasoning for e-learning systems. 2003. in proc. of the 18th international joint conference on artificial intelligence, acapulco, mexico. [15] p. r. polsani, “use and abuse of reusable learning objects”. journal of digital information, (2003). retrieved in september 2005 from the world wide web: http://jodi.ecs.soton.ac.uk/articles/v03/i04/polsani. [16] a. styliadis, k. pehlivanis & d. zahariou, “personalized e-learning for gis lecturing”. hawaii international conference on education. honolulu, hawaii, usa, january 6th-9th, 2006. [17] a. styliadis, k. pehlivanis & d. zahariou, “e-learning with re-usable gis functionality”. global universities in distance education (guide) 2006 – developing a common platform for global co-operation. rome it, february 13th-14th, 2006. [18] a. styliadis & k. pehlivanis, “personalized e-learning in a re-usable way: a proposed gis system design – the architecture”. international journal on engineering and applied sciences (jeas), 2005. [19] m. s. urban & e. g. barriocanal, “on the integration of ieee-lom metadata instances and ontologies”, learning technology newsletter, vol. 5 (1), 2003. [20] h. wu & p. de bra, “sufficient conditions for well-behaved adaptive hypermedia systems” in proc. of the first asia-pacific conference on web intelligence: research and development, maebashi city, japan, 2001. athanasios d. styliadis, ioannis d. karamitsos, dimitrios i. zachariou the atei of thessaloniki, greece epeaek ii, the arximedes 2-2-17 project e-mail: styl@it.teithe.gr personalized e-learning implementation the gis case 67 editor’s note about the author: athanasios d. styliadis is a professor at the department of information technology at the alexander institute of technology (atei) of thessaloniki, greece. he was born in 1956 in florina, greece and he received a diploma in surveying engineering (aristotle university of thessaloniki, greece, 1980), an m.sc. in computer science (dundee university, scotland, 1987) and a ph.d. in cad/gis/computer modeling (aristotle university of thessaloniki, greece, 1997). he was a fellow research scholar at the department of geomatics, university of melbourne and at the center for gis and modeling (cgism), australia. prof. styliadis is the writer of three books in computer graphics, human-computer interaction (hci) programming and gis-geomatics engineering, and he has over 70 journal and conference proceedings publications. also, has been worked at the hellenic army geographical service (hags, athens) for three years as gis system analyst and programmer. his current research interests include: e-learning & geographic information, computer modeling, web-based caad, temporal gis systems, concurrent engineering & 3-d geometry, and digital documentation of monuments and sites. international journal of computers, communications & control vol. ii (2007), no. 2, pp. 159-173 an enhancement of the random sequence 3-level obfuscated algorithm for protecting agents against malicious hosts kamalrulnizam abu bakar, bernard s. doherty abstract: with the advent of agent technology, the need for security of applications becomes a concern that is beyond classical data security capability and considerations. without proper security protection especially against a malicious host attack, the widespread use of agent technology can be severely impeded. the random sequence 3-level obfuscated algorithm has been proposed by the authors to improve agent security: in this paper, an enhancement to the protection level of this algorithm is proposed. the effectiveness of the obfuscation algorithm is enhanced by addition of noise, which surrounds the true value carried by the agent with false values. a malicious host can thus at best guess the true value carried by the agent. keywords: agent security, malicious host, spying attack, noise code. 1 introduction security is considered as an important factor for an agent used in real world applications. the problems in agent security arise when an agent is used in open and unsecured environments [8, 3]. one kind of attack by an execution host (malicious host) is spying on the agent’s code, data and state [6, 11]. spying attack by the malicious host may invade the agent’s privacy, especially an agent’s critical data, for example a user’s maximum budget carried by the agent. knowledge of an agent’s critical data gives a malicious host an advantage in any competition over other hosts because the malicious host knows what is expected by the agent. for example, a customised agent is sent out (in an open and unsecured environment) to find a suitable flight with the fare price under or equal to 500 pounds. malicious host attack based on a spying attack is to raise the offered price until it meets the maximum price that has been set by the agent’s owner, even though the normal price is much lower. spying attack by the malicious host on an agent’s data or state is difficult to detect, because the attack doesn’t leave any detectable trace [6, 11, 5]. the executing host has to read the agent’s code, must have access to the agent’s data, and must be able to manipulate the agent’s variable data in order to execute the agent [6, 5, 8]. therefore, the executing host can see and access all of the agent’s code including data and state, and is thus able to hide any traces of an attack, which makes any attempt to address spying attack difficult. to reduce the likelihood of this attack succeeding, the authors have proposed the random sequence 3-level (rs3) obfuscated algorithm [1], which obfuscates the actual value of an agent’s critical data to prevent a malicious host spying attack. the rs3 obfuscated algorithm consists of multiple polynomial functions which obfuscating the actual value of the agent’s critical data to produce an obfuscated value that is meaningless to the attacker. only selected polynomial functions are used in the conversion process and in each selected function, multiple random inputs are implied. the main objective of using the rs3 obfuscated algorithm is to enable the comparing of confidential values within an unsecured remote host environment without exposing the actual confidential value to an unauthorised party, using comparison of obfuscated values rather than actual values. unfortunately, implementing the rs3 obfuscated algorithm alone may expose the algorithm to an attack, which could execute multiple copies of the algorithm many times in order to analyse it, and propose actual values that will obfuscate to give the comparison outcome sought by the malicious host. this paper proposes an enhancement of the protection level of the rs3 obfuscated algorithm by adding noise code in the agent application that executes in the remote host environment, to hide the actual obfuscated value among a number of false values. the enhancement makes it more difficult for copyright © 2006-2007 by ccc publications 160 kamalrulnizam abu bakar, bernard s. doherty the attacker to analyse the actual obfuscated value in order to discover the actual value of an agent’s critical data: the attacker can deobfuscate the set of values carried by the agent, but can at best guess which is the correct value. this paper is organized as follows: section 2 present an overview of the random sequence 3-level obfuscated algorithm. section 3 describes the random sequence 3-level obfuscated algorithm with noise code, the implementation of noise code to enhance the obfuscated algorithm protection level and the deobfuscation time of the rs3 obfuscated algorithm. section 4 presents the experimental results on the overhead of implementing noise code with the rs3 obfuscated algorithm. section 5 analyse the strength of the rs3 obfuscated algorithm with noise code. section 6 presents a discussion and the conclusion is presented in section 7. 2 an overview of the random sequence 3-level obfuscated algorithm the random sequence 3-level obfuscated algorithm [1] is an algorithm that is designed to protect the confidentiality of an agent against malicious host spying attack. this algorithm uses three polynomial functions for obfuscating the actual value of the agent’s critical data to an obfuscated value that is meaningless to the malicious host, in order to prevent the malicious host from spying on the agent critical data (figure 1 show the obfuscation process of the rs3 obfuscation algorithm). the obfuscated method used in the random sequence 3-level obfuscated algorithm enables the execution host to execute the process of comparing its offer with the agent budget with both values in an obfuscated format without the execution host having any knowledge of the actual value of the agent’s critical data. this comparing process can be done without needing deobfuscation of the data, unlike cryptographic methods, which require decryption of the data, thus revealing its value, before a comparison can be made. although the rs3 obfuscated algorithm is able to obfuscate an agent’s critical data to make it more difficult for the malicious host to spy, the malicious host can execute multiple copies of the obfuscated algorithm in parallel in order to analyse the algorithm quickly, making this obfuscated algorithm vulnerable. this attack can be addressed by limiting the processing time available to the host before the agent is discarded [6]. however, the problem in determining an effective protection interval that can prevent the malicious host having enough time to execute multiple copies of the obfuscated algorithm also makes it difficult for this obfuscated algorithm to be implemented in real applications, where sufficient time must be allowed for legitimate processing. in order to overcome the problem of a malicious host executing multiple copies of the rs3 obfuscated algorithm and of determining an effective protection interval to protect the algorithm, noise code [9, 10] is introduced for an agent that executes in the remote host environment. the rs3 obfuscated algorithm with noise code is described in the next section. 3 the random sequence 3-level obfuscated algorithm with noise code the objective for implementing the noise code in the agent application is to hide the actual obfuscated value among a numbers of fake obfuscated values so the malicious host can at best guess at the true value of the agent’s critical data [9, 10]. the difference between ordinary rs3 algorithm and rs3 with noise code algorithm is in the number of obfuscated values generated and added by the master agent into the slave agent application before the slave agent is dispatched to the remote host execution environment to execute its tasks. in order to discover the true value of the agent’s critical data, the malicious host must first guess the actual obfuscated value among a number of fake obfuscated values. any wrong decision in choosing the obfuscated value will result in using a wrong true value of the agent’s critical data. for instance, if the agent is equipped only with the actual obfuscated value, x without adding any noise code, the probability that the malicious host could discover the actual obfuscated value by searching a range of values is one, an enhancement of the random sequence 3-level obfuscated algorithm for protecting agents against malicious hosts 161 p o l y n o m i a l f u n c t i o n f i r s t r a n d o m n u m b e r u s e r b u d g e t s e l e c t e d s e q u e n c e n u m b e r l e v e l 1 r a n d o m n u m b e r m o d f i r s t r a n d o m n u m b e r p o l y n o m i a l f u n c t i o n l e v e l 2 r a n d o m n u m b e r m o d f i r s t r a n d o m n u m b e r s e l e c t e d s e q u e n c e n u m b e r f i r s t r a n d o m n u m b e r p o l y n o m i a l f u n c t i o n l e v e l 3 r a n d o m n u m b e r m o d f i r s t r a n d o m n u m b e r s e l e c t e d s e q u e n c e n u m b e r f i r s t r a n d o m n u m b e r o b f u s c a t e v a l u e l e v e l 1 l e v e l 2 l e v e l 3 p o l y n o m i a l r e s u l t f r o m l e v e l 1 p o l y n o m i a l r e s u l t f r o m l e v e l 2 figure 1: the rs3 obfuscation algorithm obfuscation process i.e. p(x ) = 1. however, if noise codes ei (fake obfuscated values) are added to the agent, where i = 1, 2, . . . , 100−1, the probability of discovering the actual obfuscated value is 1100 . the probability of guessing the actual obfuscated value becomes smaller as more noise codes are added. figure 2 illustrates the effect of introducing noise codes into the agent application. in addition, the time needed to guess the actual obfuscated value will delay the malicious host in analysing the obfuscated algorithm. therefore, the use of an effective protection interval in enhancing the obfuscated algorithm protection will be less important. 3.1 implementing rs3 obfuscated algorithm with noise code the operation of the rs3 obfuscated algorithm with noise code is almost the same as the operation of the rs3 obfuscated algorithm without noise code (refer to [1]). the only difference between these two obfuscated algorithms is in the number of obfuscated values generated and added by the master agent into the slave agent application before the slave agent is dispatched to the remote host execution environment to execute its tasks. in the operation of rs3 obfuscated algorithm without noise code, the master agent only has to obfuscate the value of user’s budget and add the obfuscated value into the slave agent application before dispatching the slave agent to execute its tasks in the remote host execution environment. however, in the operation of rs3 obfuscated algorithm with noise code, the master agent has to generate more than one obfuscated value (the extra obfuscated values serve as noise codes) and add these obfuscated values 162 kamalrulnizam abu bakar, bernard s. doherty 1 2 3 4 5 6 7 8 9 1 0 1 2 3 4 5 6 7 8 9 1 0 a n a g e n t w i t h o n e o b f u s c a t e d v a l u e a n a g e n t w i t h a n u m b e r o f o b f u s c a t e d v a l u e s n o i s e c o d e ( a n u m b e r o f f a k e o b f u s c a t e d v a l u e s ) + t h e r e a l o b f u s c a t e d v a l u e figure 2: the effect of adding noise codes into the agent application into the slave agent application before dispatching the slave agent to execute in the remote host execution environment (see figure 3). v e c t o r h o s t a d d r e s s = n e w v e c t o r ( ) ; d o u b l e n e w o f f e r = 0 ; d o u b l e b e s t o f f e r 1 , b e s t o f f e r 2 , b e s t o f f e r 3 ; u r l b e s t s h o p 1 , b e s t s h o p 2 , b e s t s h o p 3 ; i f ( n e w o b f u s c a t i o n v a l u e < = o b f u s c a t i o n v a l u e 1 ) ) / / f a k e o b f u s c a t e d v a l u e { b e s t o f f e r 1 = n e w o f f e r ; b e s t s h o p 1 = h o s t a d d r e s s ; } e l s e i f ( n e w o b f u s c a t i o n v a l u e < = o b f u s c a t i o n v a l u e 2 ) ) / / t r u e o b f u s c a t e d v a l u e { b e s t o f f e r 2 = n e w o f f e r ; b e s t s h o p 2 = h o s t a d d r e s s ; } e l s e i f ( n e w o b f u s c a t i o n v a l u e < = o b f u s c a t i o n v a l u e 3 ) ) / / f a k e o b f u s c a t e d v a l u e { b e s t o f f e r 3 = n e w o f f e r ; b e s t s h o p 3 = h o s t a d d r e s s ; } figure 3: a slave agent program added with noise code (data block) to illustrate, the noise code is a fake obfuscated value that is generated by the master agent from a fake user budget value. this fake user budget value is created by the master agent by adding or subtracting a random number to or from the actual user budget value. for example, say the actual user budget value is č500. to create a fake user budget value, the master agent needs to generate a random an enhancement of the random sequence 3-level obfuscated algorithm for protecting agents against malicious hosts 163 number, e.g. 176. if the master agent chooses to add the random number to the actual user budget value, the fake user budget value becomes 676. this value is then converted to the obfuscated value to represent the fake obfuscated value. on the other hand, if the master agent chooses to subtract the random number with the actual user budget value, the fake user budget value becomes 324. the same conversion process will be applied to convert the fake user budget value to a fake obfuscated value. to generate more fake obfuscated values, the master agent has to generate more random numbers. once the obfuscation process in the home host is completed, the master agent dispatches the slave agent together with all the obfuscated values generated (including fake obfuscated values) to the remote host to execute its given tasks. in the remote host execution environment, the slave agent starts its execution process by converting any offer that was gathered from the remote host into an obfuscated value to be used in the comparing process. for example, if the slave agent has 4 obfuscated values (one actual and three fake values), the slave agent has to execute 4 comparing processes. if any of the obfuscated user budget value matches the obfuscated offer value, the corresponding obfuscated user budget value will be excluded from the obfuscation stage (see figure 4). the next obfuscation stage starts by obfuscating a new remote host offer. in the authors’ work, a maximum of three obfuscation stages are used to searches for a flight offer (the obfuscated user budget value is compared with the first class obfuscated fare value in the obfuscation stage 1, business class obfuscated fare value in the obfuscation stage 2 and economy class obfuscated fare value in the obfuscation stage 3) or less than three times if all the obfuscated user budget values have been excluded. o b f u s c a t i o n v a l u e 1 ( f a k e ) o b f u s c a t i o n v a l u e 2 ( t r u e ) o b f u s c a t i o n v a l u e 3 ( f a k e ) o b f u s c a t i o n v a l u e 4 ( f a k e ) o f f e r 1 r s 3 o b f u s c a t i o n a l g o r i t h m o f f e r 1 � s o b f u s c a t i o n v a l u e o b f u s c a t i o n v a l u e 2 ( t r u e ) o b f u s c a t i o n v a l u e 3 ( f a k e ) o b f u s c a t i o n v a l u e 4 ( f a k e ) o f f e r 2 r s 3 o b f u s c a t i o n a l g o r i t h m o f f e r 2 � s o b f u s c a t i o n v a l u e o b f u s c a t i o n v a l u e 3 ( f a k e ) o f f e r 3 r s 3 o b f u s c a t i o n a l g o r i t h m o f f e r 3 � s o b f u s c a t i o n v a l u e m a t c h n o t m a t c h n o t m a t c h n o t m a t c h n o t m a t c h m a t c h m a t c h n o t m a t c h o b f u s c a t i o n s t a g e 1 o b f u s c a t i o n s t a g e 2 o b f u s c a t i o n s t a g e 3 c o m p a r i n g p r o c e s s c o m p a r i n g p r o c e s s c o m p a r i n g p r o c e s s c o m p a r i n g p r o c e s s c o m p a r i n g p r o c e s s c o m p a r i n g p r o c e s s c o m p a r i n g p r o c e s s c o m p a r i n g p r o c e s s figure 4: the illustration of the obfuscation stages and the comparing process after completing the execution process in the remote host, the slave agent returns to its home host together with the remote host offer. the home host then extracts only the offer that fulfils the requirement of the true obfuscated value for further actions. 164 kamalrulnizam abu bakar, bernard s. doherty 3.2 the deobfuscation time of the rs3 obfuscated algorithm with noise code the experiment on the deobfuscation time of the rs3 obfuscated algorithm with noise code is conducted to examine the time taken by the execution host (assumed to be the malicious host) to deobfuscate a full set of obfuscated value. the experiment is conducted using one 700 mhz personal computer with 128 mb of main memory, which running windows 98 operating system. in this experiment, the deobfuscation time is taken starting from the start of the deobfuscation process for the first obfuscated value and ending when the last obfuscated value from a set of obfuscated value have been deobfuscated. the experiment is performed by the execution host by executing the rs3 obfuscated algorithm using different input value many times until the matching obfuscated value is produced and continue until a full set of the obfuscated value are deobfuscated (see figure 5). twenty sets of obfuscated values are examine with the number of values in each set ranging from five to one thousand obfuscated values. each set of obfuscated values contained one correct value and the remainder false values. r s 3 o b f u s c a t e d a l g o r i t h m a s e t o f i n p u t v a l u e s o b f u s c a t e d v a l u e p r o d u c e d a s e t o f o b f u s c a t e d v a l u e s s u p p l i e d c o m p a r i n g p r o c e s s m a t c h ? e n d s t a r t n o y e s figure 5: the deobfuscation process the deobfucation time results (which were gathered in milliseconds and then converted into seconds) of the rs3 obfuscated algorithm with noise code are shown in table 1 and 2, and illustrated in figure 6 and 7 respectively. an enhancement of the random sequence 3-level obfuscated algorithm for protecting agents against malicious hosts 165 obfuscated values 5 10 15 20 25 30 35 40 45 50 deobfuscation time 8.95 21.57 29.42 37.69 49.96 60.71 62.53 72.46 81.66 85.58 table 1: summary statistics of the random sequence 3-level obfuscated algorithm deobfuscation time for small number of obfuscated values obfuscated values 100 200 300 400 500 600 700 800 deobfuscation time 180.15 376.33 581.28 779.12 982.06 1142.27 1376.64 1568.75 obfuscated values 900 1000 deobfuscation time 1765.94 1887.83 table 2: summary statistics of the random sequence 3-level obfuscated algorithm deobfuscation time for large number of obfuscated values p r o t e c t i o n t i m e f o r t h e r s 3 o b f u s c a t e d a l g o r i t h m w i t h n o i s e 0 2 0 4 0 6 0 8 0 1 0 0 5 1 0 1 5 2 0 2 5 3 0 3 5 4 0 4 5 5 0 n u m b e r o f o b f u s c a t e d v a l u e s t e s t e d ti m e i n se co nd s d e o b f u s c a t i o n t i m e figure 6: the deobfuscation time of the random sequence 3-level obfuscated algorithm for small number of obfuscated values from the results given in table 1 and 2, and illustrated in figure 6 and 7 respectively, it can be seen that the deobfuscation time of the rs3 obfuscated algorithm with noise code increases linearly with the number of obfuscated values in the test set. note that even after deobfuscation, the malicious host is left with a set of values of which only one is the correct value, and has no way of knowing which one is the correct value. 166 kamalrulnizam abu bakar, bernard s. doherty p r o t e c t i o n t i m e f o r t h e r s 3 o b f u s c a t e d a l g o r i t h m w i t h n o i s e 0 5 0 0 1 0 0 0 1 5 0 0 2 0 0 0 1 0 0 2 0 0 3 0 0 4 0 0 5 0 0 6 0 0 7 0 0 8 0 0 9 0 0 1 0 0 0 n u m b e r o f o b f u s c a t e d v a l u e s t e s t e d ti m e i n se co nd s d e o b f u s c a t i o n t i m e figure 7: the deobfuscation time of the random sequence 3-level obfuscated algorithm for large number of obfuscated values 4 the overhead of implementing noise code with the rs3 obfuscated algorithm the experiments to measure the overhead of implementing the rs3 obfuscated algorithm with noise code were conducted using six 400 mhz sun ultra sparc 5 workstations with 128 mb of main memory. each of the workstations is running the solaris 8 operating system and is connected to the others using 100 mbit/s utp1 cable. all of the workstations involved in this experiment were situated in the same room. in this configuration, one workstation was chosen among the six workstations to be the home host for the agent, and only this host had permission to manage and dispatch the agent. the rest of the workstations were assumed to be remote hosts, having only the capability to receive the agent and to dispatch the agent back to its home host. to examine the security overhead in implementing noise code with the random sequence 3-level obfuscated algorithm in an agent-based application, times are measured starting from sending of the agents to the remote hosts and ending with the home host receiving the agents back from the remote hosts. four different experiments are conducted starting with one remote host, two remote hosts, three remote hosts and five remote hosts using three different kinds of agent: a plain agent2, an agent with the random sequence 3-level obfuscated algorithm (rs3) and an agent with the rs3 obfuscated algorithm with noise code (rs3n). the times were measured using the “system.currenttimemillis()” method in the java language. this method produces a specific instant in time with millisecond precision [7]. the experiment is performed for 20 runs3 and the time for each run gathered in milliseconds. the average result of the 20 runs is taken and converted into seconds. the result is then rounded and presented in two decimal places as shown and illustrated in tables 4 to 7 and figure 8 to 11 respectively. the results of the security overhead of the random sequence 3-level obfuscated algorithm without noise (rs3) and the random sequence 3-level obfuscated algorithm with noise are compared to the plain agent as shown in table 3. from these results, it can be seen that there is not much difference in 1unshielded twisted pair category 5e 2agents without security mechanisms 3the results were observed to be consistent after twenty runs an enhancement of the random sequence 3-level obfuscated algorithm for protecting agents against malicious hosts 167 number of mean remote hosts plain rs3 rs3n 1 1.49 1.5 1.69 2 2.45 2.52 2.67 3 3.36 3.56 3.63 5 5.42 5.43 5.66 table 3: summary statistics of the random sequence 3-level obfuscated algorithm (1 cycle, 1 obfuscation value experiment(without noise code) and 1000 obfuscation value experiment(with noise code)) the security overhead for the rs3 without noise and rs3 with noise for all numbers of remote hosts. the highest difference is for one remote host, where the security overhead for rs3 with noise is just 12.67 % higher than the security overhead of rs3 without noise. therefore, it is concluded that rs3 with noise will add a small and acceptable overhead. more detail of the results now follows: firstly, consider plain and rs3, then plain and rs3n experimental results. number of mean standard error standard deviation remote hosts plain rs3 plain rs3 plain rs3 1 1.49 1.5 0.001 0.001 0.003 0.004 2 2.45 2.52 0.003 0.005 0.011 0.022 3 3.36 3.56 0.003 0.004 0.011 0.02 5 5.42 5.43 0.007 0.006 0.031 0.025 table 4: summary statistics of the random sequence 3-level obfuscated algorithm overhead (1 cycle and 1 obfuscation value experiment(without noise code)) number of mean standard error standard deviation remote hosts plain rs3n plain rs3n plain rs3n 1 1.49 1.54 0.001 0.001 0.003 0.003 2 2.45 2.52 0.002 0.007 0.011 0.031 3 3.36 3.51 0.003 0.005 0.011 0.022 5 5.42 5.44 0.007 0.007 0.031 0.031 table 5: summary statistics of the random sequence 3-level obfuscated algorithm (1 cycle and 100 obfuscation value experiment(with noise code)) from the results given in tables 4 and 5 and illustrated in figure 8 and 9 respectively, it can be seen that the mean of the security overhead for a plain agent is almost the same as the security overhead for rs3 and rs3n, where when comparing with rs3, the highest difference is just 5.95 % and 4.46 % when comparing with rs3n. however, from table 6 and illustrated in figure 10, it can be seen that as the number of noise codes is increased, the difference becomes larger, where the highest difference is 13.42 %, which is still considered acceptable. from table 7 and figure 11, the security overhead for both the agents is almost the same as the 168 kamalrulnizam abu bakar, bernard s. doherty s e c u r i t y o v e r h e a d f o r i m p l e m e n t i n g t h e r a n d o m s e q u e n c e 3 l e v e l o b f u s c a t e d a l g o r i t h m 0 1 2 3 4 5 6 1 2 3 5 n u m b e r o f r e m o t e h o s t s ti m e i n se co nd s p l a i n r s 3 figure 8: security overhead of the random sequence 3-level obfuscated algorithm (1 cycle and 1 obfuscation value experiment(without noise code)) s e c u r i t y o v e r h e a d f o r i m p l e m e n t i n g t h e r a n d o m s e q u e n c e 3 l e v e l o b f u s c a t e d a l g o r i t h m 0 1 2 3 4 5 6 1 2 3 5 n u m b e r o f r e m o t e h o s t s ti m e i n se co nd s p l a i n r s 3 n figure 9: security overhead of the random sequence 3-level obfuscated algorithm (1 cycle and 100 obfuscation value experiment(with noise code)) security overhead given in tables 4 to 6, even though now the number of cycles4 has been increased to 100. these results show that implementation of the random sequence 3-level obfuscated algorithm with noise code does increase the overhead by up to 13.42 % compared to the plain agent, but the noise code adds little to the overhead. 5 the analysis of rs3 obfuscated algorithm strength with noise code implementation to analyse the strength of the rs3 obfuscated algorithm after the implementation of noise code, the authors have listed vulnerabilities and the way to address them as follows: 4a loopings that simulate an agent tasks an enhancement of the random sequence 3-level obfuscated algorithm for protecting agents against malicious hosts 169 number of mean standard error standard deviation remote hosts plain rs3n plain rs3n plain rs3n 1 1.49 1.69 0.001 0.004 0.003 0.016 2 2.45 2.67 0.003 0.006 0.011 0.028 3 3.36 3.63 0.003 0.003 0.011 0.015 5 5.42 5.66 0.007 0.006 0.031 0.028 table 6: summary statistics of the random sequence 3-level obfuscated algorithm (1 cycle and 1000 obfuscation value experiment(with noise code)) number of mean standard error standard deviation remote hosts plain rs3n plain rs3n plain rs3n 1 1.48 1.65 0.0003 0.002 0.001 0.008 2 2.46 2.65 0.002 0.003 0.01 0.014 3 3.45 3.67 0.007 0.005 0.03 0.021 5 5.46 5.67 0.013 0.004 0.06 0.02 table 7: summary statistics of the random sequence 3-level obfuscated algorithm (100 cycle and 1000 obfuscation value experiment(with noise code)) s e c u r i t y o v e r h e a d f o r i m p l e m e n t i n g t h e r a n d o m s e q u e n c e 3 l e v e l o b f u s c a t e d a l g o r i t h m 0 1 2 3 4 5 6 1 2 3 5 n u m b e r o f r e m o t e h o s t s ti m e i n se co nd s p l a i n r s 3 n figure 10: security overhead of the random sequence 3-level obfuscated algorithm (1 cycle and 1000 obfuscation value experiment(with noise code)) 5.1 the vulnerabilities of rs3 obfuscated algorithm with noise code there are two main weaknesses of the rs3 obfuscated algorithm with noise code that have been identified. • if the attacker (malicious host) takes a guess at the correct obfuscated value from many obfuscated values (including noise code) carried by the visiting agent and the attacker is given enough time to execute, the attacker can execute the rs3 obfuscated algorithm many times using different result values and watch the pattern of the rs3 obfuscated algorithm outcomes (which result value the 170 kamalrulnizam abu bakar, bernard s. doherty s e c u r i t y o v e r h e a d f o r i m p l e m e n t i n g t h e r a n d o m s e q u e n c e 3 l e v e l o b f u s c a t e d a l g o r i t h m 0 1 2 3 4 5 6 1 2 3 5 n u m b e r o f r e m o t e h o s t s ti m e i n se co nd s p l a i n r s 3 n figure 11: security overhead of the random sequence 3-level obfuscated algorithm (100 cycle and 1000 obfuscation value experiment(with noise code)) agent accepts and which it rejects) to analyse the chosen obfuscated value in order to discover the actual value of agent’s critical data. • if the noise code generated from the actual value are very much out of range from the reasonable value, the malicious host could easily omit that values and take a guess from fewer values. 5.2 addressing vulnerabilities of rs3 obfuscated algorithm with noise code in order to overcome the weaknesses of the rs3 obfuscated algorithm with noise code, two main points are suggested: • to prevent the attacker (malicious host) from being able to guess a correct obfuscated value among many obfuscated values in a short time, and running many tests on rs3 obfuscated algorithm by analysing which result value the agent accepts and which it rejects, the number of noise codes added in the agent application, could be made bigger. this is due to the fact that the probability to guess the correct obfuscated value becomes smaller as the number of noise code added become bigger, i.e. p(x ) → 0 as n → ∞. the malicious host can at best guess the actual value from among the noise code. • to prevent the malicious host able to omit any of the noise code values, the value of the noise codes must be within a reasonable range of the actual value and this can be done by limiting the range of the random numbers. 6 discussion in this paper, the problem of a malicious host spying on the actual value of an agent’s critical data, such as the user maximum budget, has been discussed. the random sequence 3-level obfuscated algorithm, which is able to obfuscate the actual value of an agent’s critical data in order to make it more difficult for the malicious host to spy on the actual value of the critical data, has been previously proposed by the authors. to address weaknesses in the rs3 obfuscated algorithm, an enhancement has been proposed in this paper. an enhancement of the random sequence 3-level obfuscated algorithm for protecting agents against malicious hosts 171 the implementation of the rs3 obfuscated algorithm alone exposes the obfuscated algorithm to the attacker that could execute multiple copies of the obfuscated algorithm many times to analyse the algorithm. the agent owner could also face the problem of determining an effective protection interval for the obfuscated value that is carried by the agent in order to prevent the value from being analysed and discovered by the malicious host. these problems can be overcome by introducing noise codes carried by the agent application to force a malicious host to guess the actual obfuscated value, thus leaving a malicious host with at best a guess at the actual value of the user maximum budget. the noise code implementation also delays the malicious host in analysing the obfuscated algorithm, which has been shown in section 3.2. therefore, the use of an effective protection interval to enhance the level of obfuscated algorithm protection is less important. on the other hand, based on the experimental results on the overhead of implementing the rs3 obfuscated algorithm with noise code, the implementation of the rs3 obfuscated algorithm does increase the overhead by up to 13.42 % compared to the plain agent, but this is considered acceptable. this suggests the random sequence 3-level obfuscated algorithm can be implemented in “real world” agent applications to protect the agent application from the spying attack by the malicious host. experiment shown that adding noise codes to the rs3 obfuscated algorithm gives little (12.67 %) increase in security overhead. 7 conclusion the random sequence 3-level obfuscated algorithm is an algorithm that improves the level of protection of an agent against malicious host spying attack. this obfuscated algorithm does not protect against all spying attacks by the malicious host, only an attack to the agent’s critical data. however, the implementation of noise code in the agent application prevents the malicious host discovering the actual value of critical data carried by the agent; the malicious host can at best guess the actual value from among a number of noise values. the addition of noise code has strengthened the protection of the obfuscated algorithm and has reduced the likelihood of successful attack on the rs3 obfuscated algorithm, with very small increase in execution time. references [1] abu bakar, k. and doherty, b. s.: a random sequence 3-level obfuscated algorithm for protecting mobile agents against malicious hosts. proceedings of the 2003 international conference on informatics, cybernetics and systems. i-shou university(2003) 525 – 530 [2] divincenzo, d. p., leung, d. w. and terhal, b. m.: quantum data hiding. ieee transactions on information theory, vol. 48, no. 3. ieee(2002) 580–598 [3] farmer, w.m. and guttman, j.d. and swarup, v.: security for mobile agents: issues and requirements. proceedings of the 19th national information system security conference. baltimore (1996) 591–597 [4] harmsen, j. j. and pearlman, w. a.: steganalysis of additive noise modelable information hiding. proceedings of spie electronic imaging 5022. spie (2003) 21–24 [5] hohl, f.: a framework to protect mobile agents by using reference states. in: proceedings of the 20th international conference on distributed computing systems (icdcs 2000). ieee computer society (2000) 410-417 172 kamalrulnizam abu bakar, bernard s. doherty [6] hohl, f.: time limited blackbox security: protecting mobile agents from malicious hosts. in: g. vigna (ed.). mobile agent and security. lecture notes in computer science, vol. 1419. springerverlag, berlin(1998) 92–113 [7] sun microsystems, inc. java 2 platform std. ed. v1.3.1 http://java.sun.com/j2se/ 1.3/docs/api/index.html (2004) [8] mandry, t., pernul, g. and rohm, a.: mobile agents in electronic markets: opportunities, risks, agent protection. international journal of electronic commerce. m.e. sharpe (2001) 47–60 [9] ng, s. k. and cheung, k. w.: protecting mobile agents against malicious hosts by intention spreading. in h. arabnia (ed.), proc. international conference on parallel and distributed processing techniques and applications (pdpta’99). csrea (1999) 725–729 [10] ng, s. k. and cheung, k. w.: intention spreading: an extensible theme to protect mobile agents from read attack hoisted by malicious hosts. in jimming liu, ning zhong(ed.), intelligent agent technology: systems, methodologies and tools. world scientific (1999) 406–415 [11] sander, t. and tschudin, c.: protecting mobile agent against malicious hosts. in: g. vigna (ed.). mobile agent and security. lecture notes in computer science, vol. 1419. springer-verlag, berlin(1998) 44–60 [12] yeh, w. h. and hwang, j. j.: hiding digital information using a novel system scheme. computers and security, vol. 20, no. 6. elsevier science (2001) 533–538 kamalrulnizam abu bakar faculty of computer science and information system universiti teknologi malaysia 81310 utm skudai johor d. t. malaysia e-mail: kamarul@fsksm.utm.my bernard s. doherty school of engineering and applied science aston university aston triangle, birmingham b4 7et united kingdom e-mail: b.s.doherty@aston.ac.uk received: july 22, 2006 kamalrulnizam abu bakar is a lecturer at universiti teknologi malaysia, malaysia. he received the diploma and degree of computer science in 1994 and 1996 respectively from universiti teknologi malaysia, malaysia. he then received masters in computer communication and networks degree from leeds metropolitan university, united kingdom in 1998 and phd in network security from aston university, united kingdom in 2004. his current research interests include computer and network security, distributed systems and parallel processing, grid computing, wireless and cellular network. an enhancement of the random sequence 3-level obfuscated algorithm for protecting agents against malicious hosts 173 bernard s. doherty (born october 2nd, 1945) obtained the degrees of bachelor of engineering (electrical), bachelor of arts and master of engineering science from the university of melbourne in 1967, 1971 and 1981 respectively. he has held positions with the state electricicity commission of victoria, lm ericsson pty ltd, swinburne college of technology (all in melbourne) and, since 1980, at aston university (birmingham, uk), where is presently lecturer in computer science. his main fields of teaching and research are distributed and networked applications and information security. in addition to supervising a number of doctoral students, he has developed computer-based administration and teaching software, written a number of papers and presented papers at international conferences. international journal of computers, communications & control vol. ii (2007), no. 3, pp. 241-251 hybrid control accommodation for water-asset management of hydraulic systems subjected to large operating conditions eric duviella, pascale chiron, philippe charbonnaud abstract: the hybrid control accommodation (hca) strategy was proposed to improve the water-asset management of hydraulic systems by resource allocation and setpoint assignment. hydraulic system dynamics are taken into account during the setpoint assignment step which consists in controlling gates for large operating conditions. for hydraulic systems subjected to strong disturbances, transfer delays are variable, thus different operating modes must be considered. a multimodelling method, associated to a selection technique of transfer delay, allowing for the determination of the number of models, is proposed. the simulation results on the first reach of the neste canal show the effectiveness of the hca strategy. keywords: hybrid control accommodation, resource allocation, setpoint assignment, multimodelling, water management. 1 introduction hydrographic networks naturally convey water quantities upstream to downstream. they are equipped with dams, catchment areas, channels, etc., and are instrumented for water resource management and satisfaction of human activity needs. several control methods have been designed [9], and are still designed [8, 2], to supply hydraulic systems with water quantities corresponding to the management objectives and rejecting disturbances. other techniques, such as lpv regulation [10] or supervised internal multimodel controller [3] were recently proposed in order to consider these systems for different operating points. these control methods are accurate for a local control. however, they are not designed to allocate water quantities in excess towards the catchments areas, and water quantities in lack amongst the users. an original proposition for the management of such situations consists in recalculating setpoints according to the resource value (continuous dynamics) and to the resource state (discrete events) of the hydrographic systems. the supervision and hybrid control accommodation strategy by resource allocation and setpoint assignment proposed in [4] allows the water-asset management to take into account the management constraints. the strategy efficiency has been shown by simulation, in the case of an open-channel hydraulic system considering large operating conditions. the problem addressed in this paper deals with the water-asset management of a hydraulic system subjected to scenarii with large disturbances. in the second section, the supervision and hybrid control accommodation strategy, well adapted to consider large operating conditions, is presented. in the third section, a multimodelling method is proposed for determining the number of operating modes (om) necessary to represent open-surface hydraulic system dynamics, accurately. for each om, the transfer delay identification is done in the fourth section. finally, the effectiveness of this proposed strategy is shown by simulation on the first reach of the neste canal. 2 hybrid control accommodation hybrid control accommodation (hca) strategy (see figure 1) allows the computation of new setpoints for each gate g j of hydraulic systems according to data measured on the measurement points mi (see figure 2). the setpoints are computed according to the resource state ei, and taking into account management objectives q j ob j, λ j and µ j weekly fixed by the management objective generation module. copyright © 2006-2007 by ccc publications 242 eric duviella, pascale chiron, philippe charbonnaud figure 1: hca strategy scheme. figure 2: open-surface hydraulic system equipped with measurement points mi and gates g j. hybrid control accommodation for water-asset management of hydraulic systems subjected to large operating conditions 243 the resource state ei determination is carried out at the detection period te by concurrent hybrid automata (see figure 3) designed for each measurement point mi [4]. the five pertinent states retained correspond respectively to no-discrepancy state e0, two states where the discharge discrepancy is either positive (e+) or negative (e−) and constant c, and two states where the discharge discrepancy is either positive (e+) or negative (e−) and no constant ¬c. transitions between states are defined as conditions on the measured discharge value and variation:    di : [|∆qmi| > thi] , ψi : [∆qmi < 0] , ωi : [∣∣q̇mi ∣∣ < dthi ] , (1) with ∆qmi = qmi − n ∑ j=ni q j ob j, where qmi is the measured discharge, q j ob j is the management objective of the gate j, ni the index of the first gate downstream the ith measurement point mi, n the number of gates, q̇mi the estimate derivative of qmi , thi and dthi respectively the detection and diagnosis thresholds. the discharge discrepancy allocation is carried out according to the resource state. when the state is e0, the gate setpoints correspond to their discharge objective q j ob j. when the state is e±∧c, the discharge discrepancy ∆qmi is allocated amongst gates, according to their weights λ j and µ j, by optimizing, for each measurement point, a cost function. finally, when the state is e±∧¬c, the discharge discrepancy is allocated on gate each one on its turn [4]. at each detection date kte, the discharge discrepancy allocation leads to the allocation vector qkmi : qkmi = [ 0 0 . . . 0 qkni q k ni+1 . . . qkn ]t , (2) figure 3: hybrid automata for the measurement point mi. then, to synchronize the control of the gate with the appearance of lacks or excess of the water due to the disturbances, the setpoints must be assigned at a date taking into account the transfer delays tmi, j between the measurement point mi and the gate g j. it is computed according to the relation (3): tmi, j = tmi,ni + j ∑ r=ni+1 tr , (3) where tr is the time necessary for the water quantity to go from gate gr−1 to gate gr (see figure 4). the delay tr depends on the physical characteristics of the hydraulic system and on the discharge value qr. thereafter, the allocation vector qkmi is associated with an allocation dates vector t arising 244 eric duviella, pascale chiron, philippe charbonnaud figure 4: time delays between mi and g j. from the transfer delay values tmi, j. finally, taking into account the allocation date, the setpoints are sent to gates periodically. the control period tc is chosen as a multiple integer of te. in the next section, a multimodelling method is proposed to identify the system dynamics for several om. 3 multimodelling steps the modelling of the free-surface hydraulic system dynamics is generally carried out starting from the diffusive wave equation (4) which is obtained by simplifying saint venant equations [1]. dq dt + c dq dx −d d2q dx2 = 0 , (4) where c and d are respectively the celerity and diffusion coefficients. the diffusive wave equation can be linearized according to an operating discharge qe [7], and the identified celerity and diffusion parameters, denoted ce and de, are expressed as:    ce = 1l2 ∂ j∂ qe [ ∂ l ∂ x − ∂ jl∂ z ] , de = 1l ∂ j∂ qe , (5) where l is the surface width, z is the discharge depth, j is the friction slope expressed with the manning-strickler relation as j = q 2p 4 3 k2s 10 3 where k is the strickler coefficient, p is the wetted perimeter and s the wetted surface. the open-channel systems dynamics can be modelled by transfer functions, as: f(s) = e −τ s 1+w1s+w2s2 , (6) where the coefficients w1, w2 and the pure delay τ are computed according to the coefficients ce, de, to the open-channel system length x , and to the adimensional coefficient cl = 2cex 9de ; if cl ≤ 49 , w2 = 0 and τ = 0, if 49 < cl ≤ 1, w2 = 0. this modelling method allows the identification of the free-surface hydraulic system dynamics with good accuracy only around an operating point. however, setpoint assignment must be done for hydraulic system subjected to large operating conditions. thus, the hydraulic system dynamics must be identified for different om. based on the previous modelling method, the multimodelling approach consists in determining the number of om and their corresponding operating points. the celerity coefficient c is a very relevant parameter of the hydraulic system dynamics. the model identified for each om is available as soon as the error on the celerity coefficient is less than a predefined percentage πc. thus, a validity domain is defined for each om, and the number of om which are necessary to identify the dynamics with a good accuracy, is determined. to limit the switching between two om, an interval ∆c is shared by two successive om validity domains. the selection of parameters πc and ∆c is carried out taking into account the system dynamics. the multimodelling steps are described hybrid control accommodation for water-asset management of hydraulic systems subjected to large operating conditions 245 table 1: multimodelling algorithm. input: cmin, cmax, πc, ∆c, output: cidr , cin fr , csupr , cmed = cmin+cmax 2 , r = 1, for i: ⌊ ln cmin cmed ln 1+πc 1−πc ⌋ + 1 to ⌊ ln cmaxcmed ln 1+πc 1−πc ⌋ , cidr = ( 1+πc 1−πc )i cmed − sign(i)∆c1−sign(i)πc |i| ∑ j=1 ( 1+πc 1−πc )i−sign(i). j , csupr = (1 + πc) ( 1+πc 1−πc )i cmed −sign(i) (1+πc)∆c1−sign(i)πc |i| ∑ j=1 ( 1+πc 1−πc )i−sign(i). j , cin fr = (1−πc) ( 1+πc 1−πc )i cmed −sign(i) (1−πc)∆c1−sign(i)πc |i| ∑ j=1 ( 1+πc 1−πc )i−sign(i). j , r + +, endfor by an algorithm (see table 1), where the om are determined starting with cmed . this one is computed with the parameters cmin and cmax corresponding respectively to the minimum and maximum discharges of the system. the algorithm requires the definition of bxc which corresponds to the integer part of x, and sign(x) such as sign(x) = |x|x . then, according to cidr , the water elevation zidr of each r th om is determined, with one millimeter accuracy, using the digital resolution of the relation (7) with the newton method. cid = √ jks 5 3 p 2 3 l2 [ −12 ∂ l∂ z − l3p ( 2 ∂ p∂ z −5 ps ∂ s∂ z )] . (7) then, the water elevation zidr value is used to compute the discharge qidr with the relation (8). the same steps are used for parameters cin fr and csupr to obtain the domain validity boundaries of the om: [qin fr ; qsupr [. qidr = √ jks 5 3 p 2 3 . (8) finally, the celerity and diffusion coefficients cidr and didr (5), and the transfer function parameters w1, w2 and τ (6) are computed according to the discharge qidr . these parameters allow for the computation and the selection of the transfer delays of each om. this is described in the next section. 4 transfer delay identification the hydraulic systems consist of several reaches, i.e. a part between two measurement points (see figure 5.a), each reach being composed of open-channel reach section (ocrs), i.e. a part between two gates, or between a measurement point and a gate or between a gate and a measurement point. thus, the reach dynamics are modelled with a concurrent hybrid automaton cha (see figure 5.b) composed of several ha defined for each ocrs. this representation is directly inspired from the concurrent hybrid automata proposed in [5, 6]. in the study case, the cha is deterministic. the input of the first automaton haoi of the first ocrs following mi is the discharge measured on mi. the input of the second automaton, haoi+1 is the discharge downstream haoi minus qni , the discharge setpoint assigned to the gate between this two ocrs. 246 eric duviella, pascale chiron, philippe charbonnaud figure 5: (a) a canal reach and (b) its modelling by concurrent hybrid automaton (cha). the ocrs dynamics (see figure 6.a) are modelled using the multimodelling method described in the previous section, and are represented by a ha (see figure 6.b). each state corresponds to one om identified as transfer function fl given by the relation (6). the transition conditions are defined according to the upstream discharge qo and the lth om boundaries qin fl and qsupl . figure 6: (a) ocrs with index o and (b) its modelling by hybrid automaton (ha). for each om, the transfer delay tr is obtained from the step response of the corresponding transfer function. it is chosen as the time value for which πq percent of step is reached. the percentage πq can be tuned from simulation. in the next section, the hca strategy is used to valorize water quantities of a hydraulic system subjected to large operating conditions. hybrid control accommodation for water-asset management of hydraulic systems subjected to large operating conditions 247 table 2: geometrical characteristics of each ocrs. ocrs b [m] f x [m] j [rad] qmin [m3/s] qmax [m3/s] 1 5.73 0.79 204 7.10−4 0.8 10 2 5.09 0.96 702 7.10−4 0.8 9 3 5.21 0.95 562 6.10−4 0.7 7 4 3.72 0.94 1360 5.10−4 0.6 5 5 simulation results the effectiveness of the proposed strategy is shown by simulation on the first reach of the neste canal located in gascogne, a french southwestern region. it is composed of one measurement point m1, three gates g1 to g3 and one output considered as a non-controlled gate g4 (see figure 7). this canal reach is composed of four ocrs with trapezoidal profile characterized by the bottom width b, the average fruit of the banks f , the profile length x and the reach slope i (see figure 8). the geometrical characteristics and operating conditions of each ocrs are detailed in table 2. the fourth ocrs, between g3 and g4, is not modelled, because gate g4 is not locally controlled. in the case of a trapezoidal profile, celerity and diffusion coefficients c and d are expressed by the following relation: figure 7: first reach of the neste canal. { c = ql2 [ − f + l3 ( 2b pz + 5l s − 2z )] d = q2lj , (9) where l = b + 2 f z, s = zb + f z2, p = b + 2z √ 1 + f 2. figure 8: geometrical characteristics of a trapezoidal profile. the multimodelling steps which are carried out with πc = 15 % and ∆c = 0.1, lead to the determination of three om for each ocrs. their corresponding transfer function parameters are given in table 3. the transfer delays tr are computed with a value πq = 63 %. this value, chosen by simulation, leads to the best water quantity valorization. 248 eric duviella, pascale chiron, philippe charbonnaud table 3: multiple models of each ocrs. ocrs q qmin [m3/s] qmax [m3/s] w1 w2 τ tr [s] 1.6 0.8 2.4 161 0 0 160 1 3.2 1.9 5.2 128 0 0 130 7.7 4.3 10 100 0 0 100 1.4 0.8 2.2 424 24331 145 575 2 3 1.7 4.9 364 0 89 450 7.4 4.1 9 355 0 0 355 1.2 0.7 1.9 377 0 129 505 3 2.4 1.5 4 357 0 29 385 5.8 3.2 7 320 0 0 320 the neste canal is supplied with water corresponding to an objective discharge equal to 7 m3/s. the objective discharges and the weights of each gate are displayed in table 4. the canal is subjected to strong withdrawals of 3 m3/s and 4.5 m3/s upstream to m1 (see figure 9.a). this scenario was selected among several simulated scenarii, because it reveals accurately the inversion phenomenon of the discharge tendency. the hca strategy goals consist in the allocation of the water resource amongst the gates according to their weights, and in the minimization of the discharge discrepancy at gate g4. the detection and control periods are selected as te = 12 s and tc = 120 s. in figure 9, the simulation results are shown in continuous line when the transfer delay selection method is used, and in dashed line when the transfer delays are fixed to 100 s for the first ocrs, to 355 s for the second and to 320 s for the third. these transfer delay values are selected according to the operating point of each ocrs, i.e. 7 m3/s, 5 m3/s and 4 m3/s. they correspond to the third om of each ocrs. 2 4 6 (a) q m 1 [ m 3 /s ] e+ c e+ ~c e0 e− ~c e− c (b) 0 1 2 (c) q 1 [ m 3 /s ] 0.5 1 (d) q 2 [ m 3 /s ] 0 1 2 (e) q 3 [ m 3 /s ] 0 2 4 6 8 10 12 14 16 18 2 2.05 (f) q 4 [ m 3 /s ] time[h] figure 9: discharge qm1 (continuous line) and objective discharge qm1ob j (dotted line) (a), resource states (b), setpoints assigned q j (continuous line when the delay selection method is used, dashed line in other case) and objective discharge q job j (dotted line) to g1 (c), g2 (d), g3 (e) and discharge at g4 ( f ). hybrid control accommodation for water-asset management of hydraulic systems subjected to large operating conditions 249 table 4: gate objective discharges q job j, gate weights λ j and µ j, and minimum and maximum discharges q jmin and q jmax . gate g1 g2 g3 g4 q job j [m3/s] 2 1 2 2 λ j 10 10 4 − µ j 10 4 10 − q jmin [m 3/s] 0.02 0.1 0.02 0.15 q jmax [m 3/s] 3.6 4.5 3.5 3 table 5: criteria computed when the transfer delay selection method is or not used. criterion min(q4 [m3/s]) max(q4 [m3/s]) v [m3] with selection method 1.9778 2.0451 541 without selection method 1.9763 2.0455 602 figure 9 displays the discharges qm1 in (a), the corresponding discharge states in (b), the setpoint assigned at gate g1 in (c), g2 in (d), g3 in (e) and the resulting discharge at g4 in (f). the resource in lack measured on m1 is allocated on gates g1 and g3, as long as their setpoints are upper than their respectively minimum discharge characteristics q jmin , otherwise it is allocated on gate g2 (see figures 9.c, 9.d and 9.e). thus, the gates g1 and g3 are controlled between the 1st and 4th hours, and the gate g2 between the 4th and 7th hours. during all the simulation time, the discharge on g4 is close to the objective value of 2 m3/s (see figure 9. f ). however, when the transfer delay selection method is not used, the setpoints are always assigned too early. consequently at g4, the discharge is in excess when the water resource decreases on m1, and the discharge is in lack when the water resource increases on m1. the tendency of qm1 discharge is inverted at g4 the end of the canal reach. the use of the transfer delay selection method improves the performances of the hca strategy and maintains the tendency. the maximum and minimum discharges reached at g4 and the water volume v which was not allocated, are displayed in table 5. the maximum discharge discrepancy at g4 corresponds to 2.26 % of the objective discharge q4ob j when the transfer delay selection method is used and to 2.28 % in the other case. 6 conclusion the hca strategy is adapted to valorize the water resource by allocation and setpoint assignment of open-surface hydraulic system submitted to strong disturbances. a multimodelling method is proposed to identify by a determined number of linear models the open-surface hydraulic systems. then, a transfer delay selection method is proposed to take into account variable transfer delays by the selection of the right setpoint assignment date according to the system dynamics and to the measured discharge. finally, the hca strategy performances are shown by simulation on the first reach of the neste canal subjected to large operating conditions and to strong disturbances. the study of control stability proof of the hca strategy is a future goal. 250 eric duviella, pascale chiron, philippe charbonnaud bibliography [1] chow, v. t., d. r. maidment and l. w. mays. applied hydrology. mcgraw-hill. new york, paris, 1988. [2] choy, s. and e. weyer (2005). reconfiguration schemes to mitigate faults in automated irrigation channels. cdc-ecc’05, 44 th ieee conference on decision and control and european control conference, seville, spain, 12-15 december, pp. 1875-1880, 2005. [3] duviella, e., p. charbonnaud, p. chiron and f. carrillo. supervised internal multi-model control of a dam-gallery open-channel system. cdc-ecc’05, 44 th ieee conference on decision and control and european control conference, seville, spain, 12-15 december, pp. 1887-1892, 2005. [4] duviella, e., p. chiron, p. charbonnaud and p. hurand. supervision and hybrid control accommodation for water-asset management. control engineering practice. vol. 15, issue 1, pp. 17-27, 2007. [5] funiak, s., l. j. blackmore and b. c. williams. gaussian particle filtering for concurrent hybrid models with autonomous transitions. csail memo, 2006. [6] hofbaur, m. w. and b. c. williams. hybrid diagnosis with unknown behavioural modes. international workshop on principles of diagnosis (dx-02), semmering, austria, may 2-4, pp. 97-105, 2002. [7] litrico, x. and d. georges. robust continuous-time and discrete-time flow control of a dam-river system. (i) modelling. applied mathematical modelling, vol. 23, pp. 809-827,1999. [8] litrico, x. and v. fromion. design of structured multivariable controllers for irrigation canals. cdcecc’05, 44 th ieee conference on decision and control and european control conference, seville, spain, 12-15 december, pp. 1881-1886, 2005. [9] malaterre, p.o., d.c. rogers and j. schuurmans. classification of canal control algorithms. journal of irrigation and drainage engineering, vol.124, issue 1, pp. 3-10, 1998. [10] puig, v., j. quevedo, t. escobet, p. charbonnaud and e. duviella. identification and control of an open-flow canal using lpv models. cdc-ecc’05, 44 th ieee conference on decision and control and european control conference, seville, spain, 12-15 december, pp. 1893-1898, 2005. eric duviella ecole des mines de douai département informatique et automatique 941, rue charles bourseul, bp 10838, 59508 douai cedex, france phone: 33.3.27.71.21.02 ; fax : 33.3.27.71.29.80 e-mail: duviella@ensm-douai.fr pascale chiron laboratoire génie de production ecole nationale d’ingénieurs de tarbes 47, avenue d’azereix, bp1629, 65016 tarbes cedex, france phone: 33.5.62.44.27.69 ; fax : 33.5.62.44.27.08 e-mail: pascale.chiron@enit.fr philippe charbonnaud laboratoire génie de production ecole nationale d’ingénieurs de tarbes 47, avenue d’azereix, bp1629, 65016 tarbes cedex, france hybrid control accommodation for water-asset management of hydraulic systems subjected to large operating conditions 251 phone: 33.5.62.44.27.34 ; fax : 33.5.62.44.27.08 e-mail: philippe.charbonnaud@enit.fr received: january 29, 2007 revised: may 22, 2007 eric duviella was born in france in 1978. in 2001, he received the diploma in engineering from the national engineering school, tarbes, france. he received the ph.d. degree from the institut polytechnique, toulouse, france, in 2005. since 2007, he has been an assitant professor in the department informatic and automatic of the ecole des mines, douai, france. he has published around 15 papers in journals or international conferences. his research interests include modelling, hybrid dynamical systems, supervision, reactive control strategy. pascale chiron was born in france in 1961. she is assistant professor at the ecole nationale d’ingénieurs de tarbes, france since 2000. she has obtained his phd in 1989 at ecole centrale in nantes on ’matching and similarity criterion in medical imaging’. now, she is involved in the theme ’planning, control, supervision and distributed simulation’ in the ’automated production’ group of laboratoire de génie de production. her domains of interest are modelling and simulation for system control. philippe charbonnaud was born in france in 1962. since 2003, he is full professor at the ecole nationale d’ingénieurs de tarbes. he is coordinator of the spanish-french project gepredo dealing with the management of hydrographical networks and the predictive management of the water resource. the main contribution lies in the development of a hybrid control method applied to distributed systems. his main topic of interest concerns the real-time decision support systems, and more particularly supervision and control accommodation of distributed systems. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 684-692 advanced information technology support of improved personalized therapy of speech disorders m. danubianu, s.g. pentiuc, i. tobolcea, o.a. schipor mirela danubianu, stefan gheorghe pentiuc, ovidiu andrei schipor “ştefan cel mare” university of suceava romania, 720229 suceava, 13 universităţii e-mail: {mdanub, pentiuc, schipor}@eed.usv.ro iolanda tobolcea “alexandru ioan cuza” university of iaşi romania, 700506 iasi, 11 bulevardul carol i e-mail: itobolcea@yahoo.com abstract: one of the key challenges of the sustainable development strategy adopted by the european council in 2006 is related to public health whose general objective envisages a good level of public health. one of the specific targets includes better treatments of diseases. it is true that there are affections which by their nature do not endanger the life of a person, however they may have a negative impact on her/his life standard. various language or speech disorders are part of this category, but if they are discovered and treated in due time, they can be often corrected. the difficulty for researchers and therapists is to identify those children who have disorders that show a wide range of issues that cannot be solved spontaneously or which may lead to further significant deficiencies. information technology in the latest years was used by specialists in order to assist and supervise speech disorder therapy. consequently they have collected a considerable volume of data about the personal or familial anamnesis, regarding various disorders or regarding the process of personalized therapies. these data can be used in data mining processes that aim to discover interesting patterns which can help the design and adaptation of different therapies in order to obtain the best results in conditions of maximum efficiency. the aim of this paper is to present the logo-dm system. this is a data mining system that can be associated with terapers system in order to use the data from its database as a source for analysis and to provide new information based on an improved system of therapy. through the use of appropriate techniques of data mining logo-dm realizes predictions on the evolution and the final status of patients undergoing therapy and enriches the knowledge data of expert system embedded in terapers. keywords: personalized therapy, data mining, classification, clustering, associations rules. 1 introduction various forms of speech disorders affect an important percent of people. there are affections which, by their nature, do not endanger the life of a person, however may have a negative impact on her/his life standard. discovered and treated in due time, they can be corrected, most often during childhood. the use of information technology in order to assist and supervise speech disorder ther-apy allows specialists to collect a considerable volume of data about the personal or familial anam-nesis, regarding various disorders or regarding the process of personalized therapy. copyright c⃝ 2006-2010 by ccc publications advanced information technology support of improved personalized therapy of speech disorders 685 even if these data can provide plenty of statistical information little useful knowledge can be obtained from it. in order to get such useful knowledge it is necessary to discover patterns in the data regarding the common characteristics of children with different types of diagnosis, about the connection between antecedents, personal and family behaviour and evolution of the child, or on the connection between the anamnesis and the response to different types of treatments or to different phases of the therapeutic process. these patterns are used to establish such a future strategy so as to maximize the benefits of the therapy and to minimize the costs. what are the speech disorders? a speech disorder is a problem with fluency, voice, and/or how a person utters speech sounds. classifying speech into normal and disorder is complex because the statistics points out that only 5% to 10% of the population has a completely normal manner of speaking, all others suffer from one disorder or another. the most common speech disorders are: stuttering, cluttering, voice disorders, dysartria and speech sound disorders. the speech disorder therapy should begin as soon as possible. children enrolled in therapy early in their development (younger than 5 years) tend to have better outcomes than those who begin therapy later. during the therapy, speech therapists use a variety of strategies including: oral motor or feeding therapy, articu-lation therapy and language intervention activities [2]. during the language intervention activities the therapist will interact with a child by playing and talking. he may use pictures, books, objects, or ongoing events to stimulate language development. the therapist may also model correct pro-nunciation and use repetition exercises to build speech and language skills. in the area of speech disorders there are some european projects developed as part of the eu quality of life and management of living resources program, like: olp (ortho-logo-paedia) pro-ject [8], star speech training, assessment, and remediation [12] [19], speechviewer iii developed by ibm [11] or artur (articulation tutor) [17] [18]. currently, the priorities at the international level focus on the development of information systems that can provide a personalised therapy. at the national level, little research has been conducted on the therapy of speech impairments [13]. terapers project [1] [2], developed with the financial support granted by the national agency for scientific research, contract ref. no. 56-ceex-ii03/27.07.2006 by the research center for computer science in the university "stefan cel mare" of suceava, aims to assist and support the speech disorder therapists in their efforts to develop personalized programs for the therapy of dyslalia. 2 data mining and its application in logopaedic area data mining is defined as the process of discovering non-obvious and potentially useful patterns in large data volumes. as exploration and analysis technique of large amounts of data in order to de-tect patterns or rules with a specific meaning, data mining may facilitate the discovery from appar-ently unrelated data, relationships that can anticipate future problems or might solve the studied problems. data mining represents one phase in the complex process of knowledge discovery in databases (kdd) [5]. according to crisp-dm [15], the reference model for this process, kdd consists of a sequence of steps. these steps are presented in figure 1. using appropriate methods, data mining can solve two broad categories of problems: prediction and description [10] [14]. the most used methods for prediction are classifications and regressions, and for description, clustering, deviation detection or association rules. the specific logopaedic tasks performed by data mining fall into the following categories [3]: • classification which places the people with different speech impairments in predefinited classes. thus it is possible to track the size and structure of various groups. we can use 686 m. danubianu, s.g. pentiuc, i. tobolcea, o.a. schipor figure 1: crisp_dm process of knowledge discovery in databases classi-fication which is based on the information contained in many predictor variables, such as per-sonal or familial anamnesis data or related to lifestyle, to join the patients with different seg-ments. • clustering which groups people with speech disorders on the basis of similarity of different features. it is an important task because it helps therapists understand their patients. cluster-ing aims to finding subsets of a predetermined segment, with homogeneous behavior to-wards various methods of therapy that can be effectively targeted by a specific therapy but it is not based on the previous definition of groups. • association rules aim to find out associations between different data which seem to have no semantic dependence. it may be a way to determine why a specific therapy program has been successful on a segment of patients with speech disorders and on the other was ineffective. to conclude with we state that data mining can be a useful tool. still, there is a limitation we have to consider. data mining applications generate information by analyzing patterns of data ob-tained from the systems which assist and supervise the speech therapy. such patterns can help pre-dict the evolution of the individuals that are currently in the process of therapy, or design a scheme of an appropriate therapy for them. however data mining technology can not provide information about impairments, people or behaviors that are not found in the databases that provide data for analysis. 3 logo-dm system 3.1 objectives the idea of trying to improve the quality of logopaedic therapy by applying some data mining tech-niques started from terapers project developed within the research center for computer sci-ence in the university "stefan cel mare" of suceava. this project has proposed to develop a system which is able to assist speech therapists in their speech therapy of dislalya and to asses how the pa-tients respond to various personalized therapy programs. starting in march 2008 the system is cur-rently used by the therapists from regional speech therapy center of suceava. at present, because of the limited time and the economical aspects involved, information regard-ing the therapy for each particular case is of interest [4]: what is the predicted final state for a child or what will be his/her state at the end of various stages of therapy, which the best advanced information technology support of improved personalized therapy of speech disorders 687 exercises are for each case and how they can focus their effort to effectively solve these exercises or how the family receptivity which is an important factor in the success of the therapy is associated with other as-pects of family and personal anamnesis. all this may be the subject of predictions obtained by ap-plying data mining techniques on data collected by using a computer based therapy system. it is also interesting, as part of the knowledge discovered by data mining algorithms, to be used to enrich the knowledge base of expert system embedded. to achieve this goal we propose the development of logo-dm system. consequently its objectives are: • analysis of data collected and their preprocessing in order to assure a proper quality for data mining algorithms • feature selection for the elimination of those irrelevant or redundant • the use of corresponding data mining methods and algorithms that can be applied in order to find models which can answer to problems raised in speech disorders therapy • models evaluation and their validation on new cases • to find new rules which can enrich the knowledge base of the expert system embedded in terapers 3.2 system architecture data mining aims at deriving knowledge from data. the architecture of a data mining system plays an important role in the efficiency with which data is mined. considering the characteristic of the domain we have proposed for the system a two tier client server architecture. this architecture is presented in figure 2. figure 2: logo-dm architecture on the client side there is the user interface (gui) which allows the user to communicate with the system in order to select the task to perform, to select and submit the datasets on which data mining needs to be applied. pattern evaluation and the post-processing step consisting in pattern visualiza-tion are performed also on the client. the knowledge base is the module where the background knowledge is stored. 688 m. danubianu, s.g. pentiuc, i. tobolcea, o.a. schipor the more difficult computational tasks of data mining operations are carried out on the server. here, the data mining kernel contains modules able to perform classifications and association rule detection. supplementary the pre-processing data module allows data to become suitable for apply-ing data mining algorithms. 3.3 some aspects regarding the system implementation it is well known that the best results of data mining algorithms are obtained by applying on data in data warehouses. but in this case the development of a data warehouse is not appropriate, so, it is used, as the primary source of data, a database that contains data collected from the different speech therapists’ offices. in order to choose the right solution for the implementation of the system we have made an analysis of available data both its structure and content. we have started from a scheme with over 60 tables and after deleting tables with irrelevant content for the intended purpose we have obtained, as underlying tables for the final data set, 27 tables as presented in figure 3. content analysis can reveal interesting issues related to data quality or the need for transformation. we have made a first assessment of data quality through the following measures: completeness, conformity, accuracy, consistency and redundancy. the mechanisms provided by the used da-tabase management system have imposed a minimum, controlled redundancy and have assured data consistency. values stored in fields correspond to reality, but unfortunately in some records useful data for analysis are missing. therefore it is necessary to supplement data gaps, and where not pos-sible, the removal of the record for accurate results is suggested. figure 3: the useful part of database schema proper data for the analysis are subjected to the following types of transformation: transformations of the structure, and changes aimed value. structural transformations are dictated by the fact that there are fields in the database contain-ing data related to a complex of features to be addressed individually in the analysis. values of transformations refer to the replacement of coded data by the rules, enabling, for example, the effective storage with descriptive values of characteristics allowing rapid interpretation of results. an example of these transformations is the following. an issue addressed in the anamnesis form is related to the skills of the child. in figure 4 we can see that there is a complex of skills of interest (verbal, perceptual, numeric, psycho-motor or special skills). in the database, all these skills are in two distinct fields: one for general skills, which groups data regarding verbal, perceptual, numeric, psycho-motor and intelligence skills and one for advanced information technology support of improved personalized therapy of speech disorders 689 figure 4: sample of anamnesis data special skills (figure 5). the field called ’aptitudini’ is numeric and is represented in the table by a string of five bits, as shown in figure 5. these bits, positioned from left to right, have the following meaning: • the first bit verbal skills (1present, 0absent) • the second bit perceptual skills (1present, 0absent) • the third bit numeric skills (1present, 0absent) • the fourth bit psycho-motor skills (1present, 0absent) • the fifth bit intelligence (1normal intelligence, 0 mental deficiency) figure 5: data to be transformed since all these attributes may affect the analysis it is desirable that they can be addressed indi-vidually and explicitly in the final data set. for this purpose the original table structure is changed and values are converted to descriptive values as in figure 6. these changes have conducted to a modified form of the relational database used by terapers. in the first phase, construction of target data sets for each of the methods to be applied in the system is through the application of relational expressions like those presented in (1).∏ ii (t1 ◃ ▹t2 ◃ ▹... ◃ ▹tk) (1) where: • ii is a superset of the attributes regarding the useful characteristics for each method • t1 . . . tk is the set of tables containing the attributes in the list of projection. 690 m. danubianu, s.g. pentiuc, i. tobolcea, o.a. schipor figure 6: transformed data each of these expressions was implemented in sql, and has generated intermediate tables. for example, the target data set necessary to establish the profile of children with speech disorders, can be obtained by joining tables which contain: general data about children, family and personal an-amnesis, data on complex evaluation and diagnosis associated. the statement that performs that is presented in (2). the result is a table that contains 129 features. create table caract_copii as select f.∗, l.diagn_final from fise f, logopat l where f.idc = l.idc; (2) data mining techniques were not designed to process large amounts of irrelevant features. conse-quently before their application, a selection of the relevant features is required [6] [7]. the most im-portant objectives of feature selection are: to avoid over fitting and improve model performance. a variant of the mrmr method [9] for categorical values has been used for feature selection. it is based on mutual information criteria, formally defined, for two discrete random variables x and y, as: i (x; y) = ∑ y∈y ∑ x∈x p (x, y) log ( p (x, y) p1(x)p2(y) ) (3) where p(x,y) is joint probability distribution function of x and y, and p1(x) and p2(y) are the marginal probability distribution functions of x and y respectively. for discrete random variable, the joint probability mass function is: p(x, y) = p(x = x, y = y) = p(y = y|x = x) ∗ p(x = x) = p(x = x|y = y) ∗ p(y = y) (4) since these are probabilities, we have∑ x ∑ y p(x = x, y = y) = 1 (5) the marginal probability function, p (x = x) is: p(x = x) = ∑ y p(x = x, y = y) = ∑ yp(x = x|y = y)p(y = y) (6) the criterion used is related to minimizing redundancy and maximizing relevance to the chosen characteristics. the result of tests performed on data prepared as described in the example mentioned above, revealed that, for classification, the minimum error is obtained if we deal with a number between 20 and 22 features selected. the target data set, obtained after these steps, is advanced information technology support of improved personalized therapy of speech disorders 691 subject to data mining algorithms. for an effective implementation of algorithms we have taken into account, and we tested, two possibilities: to use the oracle data mining kernel (odm) which offers the possibility to apply algorithms for classification, clustering and association rules and to use some open source implementations of relevant algorithms adapted and integrated into our own system. we took into account the types of data included in the set and we used implementations in oracle of adaptive bayes network, seeker model and decision trees build with cart [16] and id3/c4.5 for classification, for clustering the oracle implementation of a-clustering algorithm and for association rules apriori algorithm. it should be noted that for the moment, the volume of data on which work is relatively low, because the system which is the main source of these data is operational for only several months. 4 conclusions and future works considering the opportunity of data mining techniques application on data collected in the process of speech therapy, we have concluded that methods such as classification, clustering or as-sociation rules can provide useful information for a more efficient therapy. consequently, we have designed and we are currently implementing a data mining system that aims to use data provided by terapers system, developed by the research center for computer science in the university "stefan cel mare" of suceava, in order to achieve an optimized personalized therapy of dyslalia. we have tested the modules for data pre-processing and on target data sets obtained from these modules we have applied more algorithms for detecting the most appropriate solutions for the data mining kernel. at present efforts are directed towards the implementation of evaluation patterns and visualization modules and towards building a user friendly interface. bibliography [1] m. danubianu, s.g. pentiuc, o. schipor, i. ungureanu, m. nestor, distributed intelligent system for personalized therapy of speech disorders, in proc. of the third international multi-conference on computing in the global information technology iccgi, july 27august 01, athens, greece, 2008. [2] m. danubianu, s.g. pentiuc, o. schipor, m. nestor, i. ungurean, d.m. schipor, terapers intelligent solution for personalized therapy of speech disorders, international journal on ad-vances in life science, p.26-35, 2009. [3] m. danubianu, t. socaciu, does data mining techniques optimize the personalized therapy of speech disorders?, journal of applied computer science and mathematics, p.15-19, 2009 [4] m. danubianu, s.g. pentiuc, t. socaciu, towards the optimized personalized therapy of speech disorders by data mining techniques, the fourth international multi conference on computing in the global information technology iccgi 2009, vol: cd, 23-29 august, cannes la bocca, france, 2009 [5] f.g. filip, decizie asistata de calculator, ed. tehnica, bucuresti, 2005 [6] i. guyon, a. elisseeff, an introduction to variable and feature selection. j. mach learn res., 3, p.1157-1182, 2003 692 m. danubianu, s.g. pentiuc, i. tobolcea, o.a. schipor [7] h. liu, h. motoda, feature selection for knowledge discovery and data mining. kluwer aca-demic publishers, norwell, ma, 1998 [8] olp (ortho-logo-paedia) project for speech therapy (http://www.xanthi.ilsp.gr/olp);w.-k. chen, linear networks and systems (book style).belmont, ca: wadsworth, p. 123-135, 1993 [9] h. peng, f. long, c. ding, feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy, ieee transactions on pattern analysis and machine intelligence, vol. 27, no. 8, p. 1226-1238, 2005 [10] b. reiz, l. csató, bayesian network classifier for medical data analysis. international journal of computers communications & control vol. 4, p: 65-72, 2009 [11] speechviewer iii (http://www.synapseadaptive.com/edmark/prod/sv3) [12] star speech training, assessment, and remediation (http://www.asel.udel.edu/speech) [13] tobolcea,i., interventii logo-terapeutice pentru corectarea formelor dislalice la copilul normal, editura spanda, iasi, 2002. [14] p. wessa, quality control of statistical learning environments and prediction of learning outcomes through reproducible computing, international journal of computers communications & control vol. 4, p: 185-197, 2009 [15] r. wirth, j. hipp, crisp-dm: towards a standard process model for data mining. in proceedings of the 4th international conference on the practical applications of knowledge discovery and data mining, pages 29-39, manchester, uk, 2000 [16] www.salford-systems.com/ last visited october 2009 [17] www.speech.kth.se/multimodal/artur/index.html last visited august 2009 [18] o. balter, o. engwall, a.m. oster, h. kjellstrom, wizard-of-oz test of artur a computer-based speech training system with articulation correction. proceedings of the seventh international acm sigaccess conference on computers and accessibility, baltimore, october, 2005, pp.36-43. [19] h.t. bunnel, m.d. yarrington, b.j. polikoff, articulation training for young children, proceedings of 6th international conference on spoken language processing (icslp 2000), beijing, china, october 16-20, 2000, vol.4, pp. 85-88. international journal of computers, communications & control vol. i (2006), no. 1, pp. 7-8 introduction to the first issue ioan dziţac in this inaugural issue we present: • an editorial article, with a general presentation of ijccc and this introduction with short description of the papers published in this first issue; • one anniversary article dedicated to centenary of kurt gödel (1906-1978) and two anniversary articles dedicated to centenary of grigore c. moisil (1906-1973); • some selected papers from the international conference on computers, communications & control, june 1-3, 2006, băile felix oradea, romania, icccc 2006; • two regular papers. 1 introducing the papers daniel c. doolan and sabin tabirca’s paper1 discusses how a java 2 micro edition (j2me) application may be developed to visualize a wide variety of differing fractal types on a mobile phone. an integrated tool has been developed by authors to generate a variety of two dimensional fractals (mandelbrot set, julia set, prime number fractal and the plasma fractal). banshider majhi, y santhosh reddy, d prasanna babu’s propose in their paper a new feature extraction scheme for offline signature verification. the method uses geometric center for feature extraction and euclidean distance model for classification. this classifier is well suitable for features extracted and fast in computation. the method proposed in this paper leads to better results than existing offline signature verification methods. gheorghe păun initiated membrane computing, most known as p systems (with “p” from “păun”; for more details see http://psystems.disco.unimib.it) by paper “computing with membranes”, published in journal of computer and system sciences, 61, 1 (2000), 108-143 (first circulated as tucs research report no 208, november 1998, http://www.tucs.fi). in the paper published in this issue he continues the attempt to bridge brane calculi with membrane computing, following the investigation started in the paper l. cardelli, gh. păun, an universality result for a (mem)brane calculus based on mate/drip operations, intern. j. foundations of computer sci., 17, 1 (2006), 49-68. dana petcu, cosmin bonchiş, cornel izbaşa present in their paper2, a case study of a web service-based grid application for symbolic computations (known as computer algebra systems (cas)), using maple2g (maple-togrid). imre rudas and janos fodor summarizes in their paper3 the research results of the authors that have been carried out in recent years on generalization of conventional operators. aggregating uncertain information is an important issue in the study of intelligent systems. in their contribution, the authors present an overview on generalized operators from the point of view of information aggregation. in this spirit, a comparative study of triangular norms and conorms, uninorms and nullnorms, generalized conjunctions and disjunctions and distancebased operations is presented. milan stanojevic and mirco vujosevic’s propose in their paper4 a new original more efficient algorithm formulation and implementation for solving relatively low dimensions steiner tree problem on graph. athanasios d. styliadis, ioannis d. karamitsos, dimitrios i. zachariou’s paper5 introduces the gis elearning implementation, based on a set of teaching (lecturing) rules according to the cognitive style of learning preferences of both the learners and the lecturers as well. 1selected from icccc 2006 2selected from icccc 2006 3plenary invited paper of icccc 2006 4invited paper of icccc 2006 5invited paper of icccc 2006 copyright c© 2006 by ccc publications 8 ioan dziţac 2 introducing the anniversary articles gabriel ciobanu’s article presents some impressions about life, work, and foundational views of kurt gödel, after his participation to the international symposium celebrating the 100th birthday of kurt gödel, organized between 27-29 april 2006 by the kurt gödel society and university of vienna. gabriel ciobanu has been accepted to this symposium with a contribution regarding a new characterization of computable real numbers. solomon marcus present in this first issue of ijccc a very well documented article6, devoted to 100 years since the birth of great romanian mathematician and computer pioneer (ieee-1996), grigore c. moisil (19061973). in 1976, 1980 and 1992, acad. solomon marcus has edited and prefaced the following volumes: • gr. c. moisil, opera matematica, vol.i, editura academiei, bucuresti, 1976 (preface, edition and introductory study). • gr. c. moisil, opera matematica, vol.ii, editura academiei, bucuresti, 1980 (preface, edition and introductory study). • gr. c. moisil, opera matematica, vol.iii, editura academiei, bucuresti, 1992 (edition and introductory study). george georgescu, afrodita iorgulescu, sergiu rudeanu, three remarkable moisil’s disciples, presents, in their technical paper7, a very concise and updated survey emphasizing the research done by grigore c. moisil and his school in algebraic logic (n-valued lukasiewicz-moisil algebra, θ -valued lukasiewicz-moisil algebra, post algebra etc.). ioan dziţac executive editor of ijccc agora university 8, p-ta tineretului, 410526 oradea, romania e-mail: idzitac@univagora.ro 6plenary invited talk of icccc 2006 7invited paper of icccc 2006 int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 2, pp. 179-192 evolved fuzzy control system for a steam generator d. hossu, i. făgărăşan, a. hossu, s.s. iliescu daniela hossu, ioana făgărăşan, andrei hossu, sergiu st. iliescu university politehnica of bucharest automatic control and industrial informatics no.313 splaiul independentei, district 6, code 060042, bucharest, romania e-mail: andone_dana@yahoo.com, dana@aii.pub.ro, ioana@shiva.pub.ro abstract: poor control of steam generator water level is the main cause of unexpected shutdowns in nuclear power plants. particularly at low powers, it is a difficult task due to shrink and swell phenomena and flow measurement errors. in addition, the steam generator is a highly complex, nonlinear and time-varying system and its parameters vary with operating conditions. therefore, there is a need to systematically investigate the problem of controlling the water level in the steam generator in order to prevent such costly reactor shutdowns. the objective of this paper is to design, evaluate and implement a water level controller for steam generators based on a fuzzy model predictive control approach. an original concept of modular evolved control system, seamless and with gradual integration into the existent control system is proposed as base of implementation of the presented system. keywords: evolved control, nonlinear system, fuzzy model, predictive control. 1 introduction poor control of the steam generator water level in the secondary circuit of a nuclear power plant can lead to frequent reactor shutdowns. such shutdowns are caused by violation of safety limits on the water level and are common at low operating power where the plant exhibits strong non-minimum phase characteristics. difficulties on designing a steam generator (sg) level controller arise from the following factors: reverse dynamics, or non-minimum phase behavior due to swell and shrink effects, particularly at low power, changes in plant dynamics with operating power, dynamics uncertainties, corrupted feed-water flow measurement signal with biased noises. particularly it is difficult to control automatically a steam generator water level during transient period or at low power less than 15% of full power because of its dynamic characteristics. various approaches have been reported in the literature: an adaptive pid level controller using a linear parameter varying model to describe the process dynamics over the entire operating power range [12]; lqg controllers with "gain-scheduling" to cover the entire operating range [15]; a hybrid fuzzypi adaptive control of drum level, a model predictive controller to identify the operating point at each sampling time and use the plant model corresponding to this operating point as the prediction model [14]. a nonlinear physical model with a complexity that is suitable for model-based control has been presented by aström and bell [5]. the model describes the behavior of the system over a wide operating range. a model of the steam generator water level process in the form of a transfer function, determined based on first-principles analysis and expert experience has been presented in [30]. a detailed nonlinear model based on the lumped parameter approach for system modules, developed for a typical pwr power copyright c⃝ 2006-2010 by ccc publications 180 d. hossu, i. făgărăşan, a. hossu, s.s. iliescu plant has been presented in [1]. in the paper trained neural networks are used to predict certain system parameters of the plant for a number of different power demand histories. paper [17] presents a self organizing fuzzy logic controller for the water level control of a steam generator. with the advent of the current generation of high-speed computers, more advanced control strategies not limited to pi/pid, can be applied [11], [18], [20], [23]. model predictive control (mpc) is one such controller design technique, which has gained wide acceptance in process control applications. model predictive control has three basic steps: output prediction, control calculation and closing the feedback loop [6], [7], [16]. in this paper, we apply mpc techniques to develop a framework for systematically addressing the various issues in the sg level control problem. the takagi-sugeno fuzzy model representation often provides efficient and computationally attractive solutions to a wide range of modeling problems capable to approximate nonlinear dynamics, multiple operating modes and significant parameter and structure variations [22], [24]. this paper deals with takagi-sugeno (t-s) fuzzy models because this type of model have a good capability for prediction and can be easily used to design model-based predictive controllers for nonlinear systems [9]. the paper includes simulations of typical operating transients in the sg. a new concept of modular advanced control system designed for a seamless and gradual integration into the target systems is presented. the system is designed in such a way to improve the quality of monitoring and control of the whole system. the project targets the large scale distributed advanced control systems with optimum granularity architecture. 2 fuzzy model fuzzy models have become one of the most well established approaches to non-linear system modeling since they are universal approximations which can deal with both quantitative and qualitative (linguistic) forms of information [8], [9], [21], [26], [27], [28], [29]. these models can broadly be divided into three classes: linguistic models so-called mamdani-type models, fuzzy relational models, and takagi-sugeno (ts) models [14]. both linguistic and fuzzy relational models are linguistically interpretable and can incorporate prior qualitative knowledge provided by experts [25]. however, when linguistic knowledge extraction is not the main purpose of modeling, like in many dynamic system identification and control problems, the use of ts models becomes particularly adequate since they are able to accurately represent a wide class of nonlinear systems using a relatively small number of parameters. in a nutshell, ts models perform an interpolation of local models, usually linear or affine in their arguments, by means of a fuzzy inference mechanism. their functional rule base structure is well-known to be intrinsically favorable for control applications. the ts model representation often provides efficient and computationally attractive solutions to a wide range of modeling problems capable to approximate nonlinear dynamics, multiple operating modes and significant parameter and structure variations. the ability of such model to capture the dynamics of a large class of nonlinear plants has been investigated extensively in the literature [13]. in this paper, we apply mpc techniques to develop a framework for systematically addressing the various issues in the sg level control problem. this paper deals with takagi-sugeno (t-s) fuzzy models because of their capability to approximate a large class of static and dynamic nonlinear systems. in t-s modeling methodology, a nonlinear system is divided into a number of nearly linear subsystems. a quasi-linear empirical is developed by means of fuzzy logic for each subsystem. the whole process behavior is characterized by a weighted sum of the outputs from all quasi-linear fuzzy implication. the methodology facilitates the development of a nonlinear model that is essentially a collection of a number of quasi-linear models regulated by fuzzy logic. it also provides an opportunity to simplify the design of model predictive control. the system is divided into a number of linear or nearly linear subsystems. by takagi-sugeno’s modeling methodology a fuzzy quasi-linear model has to be developed for each subsystem. in such a evolved fuzzy control system for a steam generator 181 model, the cause-effect relationship between control u and output y at the sampling time n is established in a discrete time representation. each fuzzy implication is generated based on a system step response [3, 4, 10]. if y(n) is ai0, y(n −1) is a i 1, ..., y(n − m +1) is a i m−1, and u(n) is bi0, u(n −1) is b i 1, ..., u(n − l +1) is b i l−1 (1) t hen yi(n +1) = y(n)+ t∑ j=1 hij ∆ u(n +1− j) where: aij fuzzy set corresponding to output y(n-j) in the i th fuzzy implication bij fuzzy set corresponding to input u(n-j) in the i th fuzzy implication hij impulse response coefficient in the i th fuzzy implication t model horizon △u(n) difference between u(n) and u(n-1) a complete fuzzy model for the system consists of p fuzzy implications. the system output y(n+1) is inferred as a weighted average value of the outputs estimated by all fuzzy implications y(n +1) = ∑p j=1 µ jy j(n +1)∑p j=1 µ j (2) where µ j = ∧i a j i ∧k b j k (3) considering ω j = µ j∑p j=1 µ j (4) then y(n +1) = p∑ j=1 ω j y j(n +1) (5) 3 fuzzy model predictive control 3.1 problem formulation the design goal of a fuzzy model predictive control is to minimize the predictive error between an output and a given reference trajectory in the next ny steps through the selection of nu step optimal control policies. the optimization problem can be formulated as: min∆ u(n), ∆ u(n+1), ..., ∆ u(n+nu) j(n) (6) j(n) = ny∑ i=1 µi(ŷ(n + i)− yr(n + i)) 2 + nu∑ i=1 vi∆ u(n + i)2 (7) where: 182 d. hossu, i. făgărăşan, a. hossu, s.s. iliescu µi and vi are the weighting factors for the prediction error and control energy; ŷ(n +1) ith step output prediction; yr(n +1) ith step reference trajectory; ∆ u(n + i) ith step control action. the weighted sum of the local control policies gives the overall control policy: ∆ u(n + i) = p∑ j=1 ω j∆ u j(n + i) (8) substituting (2) and (8) into (7) yields (9) j(n) = ny∑ i=1 µi   p∑ j=1 ( ω j ( ŷ j(n + i)− yr(n + i) ))2 + nu−1∑ i=0 vi   p∑ j=1 ω j∆ u j(n + i)  2 (9) to simplify the computation, an alternative objective function is proposed as a satisfactory approximation of (9) [10]. j̃(n) = p∑ j=1  (ω j)2   ny∑ i=1 µi ( ŷ j(n + i)− yr(n + i) )2 + nu−1∑ i=0 vi ∆ u j(n + i)2     (10) the optimization problem can be defined as: min∆ u(n), ∆ u(n+1), ..., ∆ u(n+nu−1) j̃(n) = min∆ u(n), ∆ u(n+1), ..., ∆ u(n+nu−1) p∑ j=1 ( ω j )2 j̃ j(n) (11) where j̃ j(n) = ny∑ i=1 µi ( ŷ j(n + i)− yr(n + i) )2 + nu−1∑ i=0 vi ( ∆ u j(n + i) )2 (12) using the alternative objective function (12), we can derive a controller by a hierarchical control design approach. 3.2 controller design 1. lower layer design: for the jth subsystem, the optimization problem is defined as follows: min∆ u(n), ∆ u(n+1), ..., ∆ u(n+nu−1) j̃ j(n) (13) subject to: r j : { if y(n + k −1) is a j0, ..., y(n + k − m) is a j m−1 t hen y j(n + k) = y j(n + k −1)+ ∑t j=1 h j i ∆ u(n + k − i)+ ε j(n + k − i) (14) where ε j(n + k −1) serves for system coordination and it is determined at the upper layer. 2. upper layer design: the upper layer coordination targets the identification of globally optimal control policies through coordinating ε j(n + k −1) for each local subsystem. evolved fuzzy control system for a steam generator 183 3. system coordination: from the lower layer, the local information of output and control is transmitted to the upper layer. at the upper layer, the error variables are evaluated as: ε j(n + k −1) = y(n + k −1)− y j(n + k −1). these values will be compared with those for the same error variables calculated in the last iteration. if p∑ j=1 ny∑ k=1 ∣∣e j(n + k −1)− ε j(n + k −1)∣∣ > β then the control policies are not optimal and need to be modified at the lower layer; else an optimal control action is found. the whole design is decomposed into the derivation of p local controllers. the subsystems regulated by those local controllers will be coordinated to derive a globally optimal control policy. the objective function defined in (12) can be rewritten in a matrix form: j̃ j(n) = ( ŷ j+(n)−y r(n) )t w j1 ( ŷ j+(n)−y r(n) ) + + ( ∆u j+(n) )t w j2 ( ∆u j+(n) ) (15) where: ŷ j+(n) = ( ŷ j(n +1)ŷ j(n +2)...ŷ j(n + ny) )t (16) y r(n) = (yr(n +1)yr(n +2)...yr(n + ny)) t (17) ∆u j+(n) = ( ∆ u j(n)∆ u j(n +1)...∆ u j(n + nu −1) )t (18) w j1 = diag { µ j1, µ j 2, ..., µ j ny } (19) w j2 = diag { v j1, v j 2, ..., v j nu } (20) the ny step prediction of the output by the jth fi can be rewritten as follows: ŷ j+(n) = a j∆u j+(n)+y (n)+ p j(n)+ e j+(n) (21) where: a j =   a j1 0 0 ... 0 a j2 a j 1 0 ... 0 a j3 a j 2 a j 1 ... 0 ... ... ... . . . ... a jny a j ny−1 a jny−2 ... a j ny−nu+1   (22) a ji = i∑ k=1 h jk (23) y (n) = (y(n) y(n) ...y(n) )t (24) p j(n) = ( p j1 (n) p j 2 (n) ...p j ny (n) )t (25) e j+(n) =  0 2∑ k=1 ε j(n + k −1) ··· ny∑ k=1 ε j(n + k −1)  t (26) p ji (n) = i∑ k=1 t∑ l=k+1 h jl ∆ u(n + k − l) (27) 184 d. hossu, i. făgărăşan, a. hossu, s.s. iliescu the resulting control policy for the jth subsystem can be derived as: j̃ j(n) = ( ∆u j+(n) )t ( a j t w j1 a j +w j2 ) ∆u j+(n)+ + ( ∆u j+(n) )t a j t w j1 z j(n)+ ( z j(n) )t w j1 a j∆u j+(n)+ ( z j(n) )t w j1 z j(n) (28) where: z j(n) = y (n)−y r(n)+ p j(n)+ e j+(n) (29) minimizing (26) yields: δ j̃ j(n) δ ∆u j+(n) = 2(a j t w j1 a j +w j2 )∆u j +(n)+2a jt w j1 z j(n) = 0 (30) the control law by the jth fi can be identified as:( ∆u j+(n) )∗ = −k jz j(n) (31) where k j is: k j = ( a j t w j1 a j +w j2 )−1 a j t w j1 (32) the optimal local control policies at the lower layer are identified through optimization, the optimal global control policies can be accordingly derived at the upper layer. ∆u+(n) = (∆ u(n)∆ u(n +1)...∆ u(n + nu −1)) t (33) 3.3 parameter tuning in controller design, the difficulty encountered is how to quickly minimize the upper bound of the objective function so that the control actions can force a process to track a specified trajectory as close as possible. there has no rigorous solution to the selection of optimal control horizon (nu) and prediction horizon (ny). the model horizon is selected so that t ∆ t ≥ open loop settling time. the ranges of weighting factors w j1 and w j 2 can be very wide, the importance is their relative magnitudes. the following three-step procedure to tune the weighting factors is proposed: (a) select a value for w1 and assign it to all local controllers. determine w j 2 independently for each local controller in order to minimize the objective function for that subsystem (b) identify the largest w2 and assign it to all subsystems. (c) examine the system’s closed-loop dynamic performance. if not satisfied, then reduce the value of w2 gradually until the desirable dynamic performance is identified. 3.4 simulations process modeling the main problem in setting up a signal flow diagram for a level controlled system in a sg can be found in the inhomogeneous contents of the sg. the filling consists of water at boiling temperature, pervaded by steam bubbles. since the volume fraction of the steam bubbles is quite considerable, the mean specific weight of the contents is very strongly dependent on the proportion of steam. this, of evolved fuzzy control system for a steam generator 185 figure 1: responses of water level at different operating power (indicated by %) to (a) a step in steam flow-rate. (b) a step in feed-water flow-rate. course, means that the steam content also strongly influences the level in the sg. the steam content itself depends, in turn, on the load factor, on the changes in feed-water flow, and on feed-water temperature. the presence of steam below the liquid level in the sg causes the shrink-and-swell phenomenon that in spite of an increased supply of water, the water level initially falls. fig. 1 shows responses of the water level to steps in feed-water and steam flow-rates at different operating powers. for generating the responses, it was used the power dependent linear parameter varying model identified by irving [12]. at low loads the non-minimum phase behavior is much more pronounced. the changing process dynamics and the inverse response behavior significantly complicate the design of an effective water level control system. a solution to this problem is to design local linear controllers at different points in the operating regime and then applies gain-scheduling techniques to schedule these controllers to obtain a globally applicable controller. consider a step in feed-water flow rate at 5% operating power because of the strong inverse response. for this system, a fuzzy convolution model consisting of four fuzzy implications is developed as follows: for j=1 to 4 r j: if ydal (n) is a j then y jdal (n +1) = y j dal (n)+ ∑200 i=1 h j dal u(n +1− i) in order to define the fuzzy sets we propose the following strategy: dal1 = min(y 1 dal )·kdal1 , dal2 = min(y2dal )·kdal2 , dal3 = min(y 3 dal )·kdal3 , dal4 = min(y 4 dal )·kdal4 where kdal1 = 0.2, kdal2 = 0.9, kdal3 = 0.9, kdal4 = 0.2. are selected in order to obtain a characteristic as close as possible to the open loop response of water level at 5% operating power to a step in feed-water flow-rate. fig. 2 shows the impulse response coefficients for y1dal , y 2 dal , y3dal , y 4 dal subsystems and fig. 3 shows the definition of fuzzy sets a1, a2, a3 and a4. consider a step in steam flow rate at 5% operating power. for this system, a fuzzy convolution model consisting of fourfuzzy implications is developed as follows: for j=1 to 4 r j: if yd0 (n) is a j then y jd0(n +1) = y j d0 (n)+ ∑200 i=1 h j d0 u(n +1− i) 186 d. hossu, i. făgărăşan, a. hossu, s.s. iliescu figure 2: the impulse response coefficients for y1dal , y 2 dal , y3dal , y 4 dal subsystems. figure 3: definition of fuzzy sets a1, a2, a3 and a4 for fi r1, r2 ,r3 and r4 respectively. in order to define the fuzzy sets we propose the following strategy: d01 = max(y 1 d0)·kd01 ,d02 = max(y2d0)·kd02 , d03 = max(y 3 d0 )·kd03 , d04 = max(y 4 d0 )·kd04 where kd01 = 0.4, kd02 = 0.9, kd03 = 0.9, kd04 = 0.6 are selected in order to obtain a characteristic as close as possible to the open loop response of water level at 5% operating power to a step in steam flow-rate. fig. 4 shows the impulse response coefficients for y1d0 , y 2 d0 , y 3 d0 , y4d0 subsystems, fig. 5 shows the definition of fuzzy sets a1, a2, a3 and a4. controller design the goal in this paper is to study the use of the feed-water flow-rate as a manipulated variable to maintain the sg water level within allowable limits, in the face of the changing steam demand resulting from a change in the electrical power demand. frequent reactor shutdowns are caused by violation of safety limits on the water level and are figure 4: the impulse response coefficients for y1d0 , y 2 d0 , y 3 d0 , y4d0 subsystems. evolved fuzzy control system for a steam generator 187 figure 5: definition of fuzzy sets a1, a2, a3 and a4 for fi r1, r2 ,r3 and r4 respectively. figure 6: normalized water level setpoint. common at low operating power where the plant exhibits strong non-minimum phase characteristics.fig. 6 shows the normalized water level setpoint and safety limits. the goal of fuzzy model predictive control is to minimize the predictive error between an output and a given reference trajectory in the next ny (prediction horizon) steps through the selection of nu (control horizon) step optimal control policies. using the alternative objective function (12), we can design a controller by a hierarchical control design approach. the whole design is decomposed into the derivation of 4 local controllers. the subsystems regulated by those local controllers will be coordinated to derive a globally optimal control policy. the objective function defined in (12) can be rewritten in a matrix form as follows: j̃ j(n) = ( ŷ j+(n)−y r(n) )t w j1 ( ŷ j+(n)−y r(n) ) + + ( ∆u j+(n) )t w j2 ( ∆u j+(n) ) (34) where: ŷ j+(n) = ( ŷ jdal (n +1)ŷ j dal (n +2)...ŷ jdal (n + ny) )t (35) y r(n) = (yr(n +1)yr(n +2)...yr(n + ny)) t (36) yr(n) = yre f (n)− yd0(n) (37) ∆u j+(n) = ( ∆ u j(n)∆ u j(n +1)...∆ u j(n + nu −1) )t (38) w j1 = diag { µ j1, µ j 2, ..., µ j ny } (39) w j2 = diag { v j1, v j 2, ..., v j nu } (40) the simulations are organized around two different power transients: • a step-up in power from 5% to 10% (fig. 7(a)); • a ramp-up in power from 5% to 10% (fig. 7(b)). 188 d. hossu, i. făgărăşan, a. hossu, s.s. iliescu figure 7: water level response to (a) a step power increase from 5% to 10% (nu=2, ny=3, w1=1). (b) a power ramp up from 5% to 10% (w2=0.1, w1=1). the model horizon is t = 200. increasing ny results in a more conservative control action that has a stabilizing effect but also increases the computational effort. the computational effort increases as nu is increased. a small value of nu leads to a robust controller. we can see that the performance is not strongly affected by the presence of the feed-water inverse response, only a slight oscillation is visible in the water level response. all local controllers are used all the time. this means that there is no switch from one local controller to the other in operation. the system output is infered as a weighted average value of the outputs of all subsystems. on the other hand, the overall control policy to the process under control is the weighted sum of all local control policies. this kind of design not only eliminates the controller switch problem and thus possible system instability, but also provides a much more smooth control performance in process operation. the performance is not strongly affected by the presence of the feed-water inverse response, only a slight oscillation is visible in the water level response. the fmpc responses are very satisfactory and not very sensitive to changes in tuning parameters. 3.5 evolved controller client/server architecture an original concept of modular evolved control system, seamless and with gradual integration into the primary control system is proposed. the target systems are the large scale distributed control systems with optimum granularity architecture. the aim of the application is to integrate the concepts of evolved control algorithms, portability of software modules, real time characteristics of the application. we propose an approach of a seamless integration of the evolved control modules into an existing control system. the first part of the life cycle phases of the new control system, from conception to validation stage, the new control system lives hiding in the shadow of the control system it will replace, and after validation the old system will be replaced by the new one. the identification, modeling, control and validation stages of the life cycle of the system, will be done on-line (the new system uses a real image of the i/o process data), without affecting the existing control system. because of high level of interconnectivity between system components, it is necessary to provide the highest independence between communication modules on one-hand and the control modules on the other hand. in order to obtain high ability of integration, the communication modules have to cover the widest possible area of industrial communication interfaces and protocols. one item of the application is to offer a unified api of extended generality and extendibility in order evolved fuzzy control system for a steam generator 189 to unify access and information retrieval from various wireless and wired technology and communication interfaces (rs 232, rs 485, fieldbus: profibus / interbus, ethernet ip, tcp/ip, etc). applications could properly adapt to changes in the network connections. the design and implementation of a solution to hide the embedded communication network problems from the application system programmers is included. a software package for evolved control includes a method based on fuzzy model predictive control. by using the basic concept of decomposition-coordination in a large-scale system theory, the fuzzy model predictive controller design can be accomplished through a two-layer iterative design process. the design is decomposed into the derivation of local controllers. the subsystems regulated by those local controllers will be coordinated to derive a globally optimal control policy. one of the main objectives of the application is to supply an integrated solution of systems, which should support all the phases of the life cycle: modeling, simulation, development and implementation. for parameter tuning, for validation and also for embedding a large number of industrial communication protocols, multi-disciplinary simulation environments are developed which generate instruments for control, i/o data consistency check, and defect detection. in the end, real-time advanced control applications are developed, with seamless and gradual integration into the existing distributed control system. in order to provide the real-time characteristic, we choose a multitasking environment for the application (windows operating system). from structural point of view we propose a client / server architecture for fuzzy controller (fc) [2]: client is a windows application representing the implementation of the graphical user interface (gui). the client enables the operator to control the system in two modes: manual/automatic, to monitor the system response, etc. the client has also the ability to connect and communicate with the server application. server is an activex exe application containing the implementation of the fc kernel. the server includes a collection of objects, these objects cover the tasks of both data processing and the communication between dedicated applications for input and output data. the client application will have a thread pool architecture. the server application will have a real multithreading architecture (each active object having assigned its own execution thread). the server have also a multi-layer structure: at the higher level are implemented upper fc and the communication classes (using different transmission mechanisms dde, opc, hli, activex, winsocket, pipes), at the lower level are implemented the controllers for the subsystems corresponding to the low level fc. the server’s application as real multithreading architecture, provides the fc kernel the real-time response characteristic, required for the industrial process control. 4 conclusions control of sg water level strongly affects nuclear power plant availability. the control task is difficult for a number of reasons, the most important among them being the nonlinear plant dynamics and the non-minimum phase plant characteristics. there has been a special interest in this problem during low power transients because of the dominant reverse thermal dynamic effects known as shrink and swell. the sg level control problem was viewed as a single input/single output control problem with the feed-water as the manipulated variable, the level as the controlled variable and the turbine steam demand as disturbance. the process non-linearity was addressed by scheduling the model (and the controller) with the power level. the sg system is modeled by takagi-sugeno’s fuzzy modeling methodology, where the system output is estimated based on gradient. the complex shrink and swell phenomena associated with the sg water level are well captured by the model. the predictive controller based on fuzzy model is designed in a hierarchical control design. 190 d. hossu, i. făgărăşan, a. hossu, s.s. iliescu an original concept of modular evolved control system, seamless and gradual integration into the existing distributed control system is proposed in the paper. a unified api of extended generality and extendibility in order to unify access and information retrieval from various wireless and wired technology and communication interfaces is developed in order to ensure independence between communication and control modules of the designed systems.a client / server architecture for evolved controller that runs on the windows environment, with real-time characteristics is proposed. bibliography [1] akkurt, colak u., pwr system simulation and parameter estimation with neural networks, annals of nuclear energy, vol. 29, pp. 2087-2103, 2002. [2] andone d. ,dobrescu r., hossu a., dobrescu m., application of fuzzy model predictive control to a drum boiler, icae integrated computer-aided engineering, iospress, 13(4):347-361, 2006. [3] andone d., hossu a., predictive control based on fuzzy model for steam generator, 2004 ieee international conference on fuzzy systems, proc. fuzz-ieee 2004, vol. 3, ieee catalog number 04ch37542, isbn 0-7803-8353-2, isnn 1098-7584, budapest, hungary; pp. 1245-1250; july 2529, 2004. [4] andone d., fagarasan i., dobrescu m., advanced control of a steam generator, the 3rd international ieee conference intelligent systems, september 04-06, 2006 3rd international ieee conference intelligent systems, vol.s 1 and 2, london, united kingdom, pp. 338-343, 2006. [5] aström k., bell r., drum-boiler dynamics, automatica 3, pp. 363-378, 2000. [6] camacho e., bordons c., model predictive control, springer-verlag, london, 2004. [7] demircioglu h., karasu e., generalized predictive control a practical application and comparation of discrete and continuous-time versions, ieee control systems, oct 2000, 20(5):36-44, 2000. [8] dubois d., prade h., fuzzy sets and systems: theory and applications, academic press, inc., orlando, fl, 1997. [9] espinosa j., hadjili m. l., wertz v.,and vanderwalle j., predictive control using fuzzy models. comparative study, european control conference, karlsruhe, germany, sept. 1999. [10] huang y., lou helen h., gong j.p., edgar th. f., fuzzy model predictive control, ieee trans. on fuzzy systems, 8(6):665-668, 2000. [11] hirota k., industrial applications of fuzzy technology, springer-verlag, new york, 1993 [12] irving e., miossec c., and tassart j., toward efficient full automatic operation of the pwr steam generator with water level adaptive control, proc. int. conf. boiler dynamics contr. nuclear power stations, london, u.k., pp. 309-329, 1980. [13] kiriakidis k., non-linear control system design via fuzzy modeling and lmis, international journal of control, 72(7):676-685, 1999. [14] kothare m., mettler b., morari m., bendotti p., falinower c., level control in the steam generator of a nuclear power plant, ieee trans. on control systems technology, 8(1):55-69, 2000. evolved fuzzy control system for a steam generator 191 [15] menon s.k. and parlos a.g., gain-scheduled nonlinear control of u-tube steam generator water level, nuclear sci. eng., vol. 111, pp. 294-308, 1992. [16] morari m. and lee j. h., model predictive control: past, present, and future, computers & chemical eng., pp. 667-682, 1999. [17] park g. y., seong p. h., application of a self-organizing fuzzy logic controller to nuclear steam generator level control, nuclear engineering and design, vol. 167, pp. 345-356, 1997. [18] pedrycz w. and gomide, f., fuzzy systems engineering: toward human-centric computing, wiley-ieee press, 2007. [19] precup, r. e., tomescu m., preitl s., lorenz system stabilization using fuzzy controllers, international journal of computers communications and control, 2(3):279-287, 2007. [20] ross t.j., fuzzy logic with engineering applications, second ed., wiley & sons, 2004. [21] vesselenyi, t; dzitac, s; dzitac, i, manolescu m.j., fuzzy and neural controllers for a pneumatic actuator, international journal of computers communications & control, 2(4):375-387, 2007. [22] yager r., r., zadeh l. a., an introduction to fuzzy logic applications in intelligent systems, kluwer academic publishers, norwell, ma, 1992. [23] yen j., langari r., zadeh l., a., industrial applications of fuzzy logic and intelligent systems, ieee press, piscataway, nj, 1995. [24] ying h., fuzzy control and modeling: analytical foundations and applications, wiley-ieee press, 2000. [25] zadeh, l. , a new frontier in computation computation with information described in natural language, international journal of computers communications and control, 3(s):26-27, 2008. [26] zadeh l. a., toward a generalized theory of uncertainty (gtu): an outline, information sciencesinformatics and computer science: an international journal, 172(1-2):1-40, 2005. [27] zadeh, l. a., knowledge representation in fuzzy logic, ieee transactions on knowledge and data engineering, 1(1):89-100, 1989. [28] zadeh, l. a., is there a need for fuzzy logic?, information sciences: an international journal, 178(13):2751-2779, 2008. [29] zadeh l.a., tufis d., filip f.g., dzitac i.(eds.), from natural language to soft computing: new paradigms in artificial intelligence, editing house of romanian academy, bucharest, isbn 978973-27-1678-6, 2008. [30] zhao f., ou j., du w., simulation modeling of nuclear steam generator water level process a case study, isa transactions, 39, pp. 143-151, 2000. 192 d. hossu, i. făgărăşan, a. hossu, s.s. iliescu daniela hossu is associated professor at university "politehnica" of bucharest. in 1996 she obtained m.s. degree in electrical engineering, specialty control systems and informatics in power systems at university "politehnica" of bucharest. in 2002 she obtained a ph.d. in automatic control from the university "politehnica" of bucharest. she was involved in several projects for retrofitting the instrumentation & control system of thermal power plants. since 1998 she has been involved in lecturing and teaching assistantships at university "politehnica" of bucharest in the following fields: automatic control, data transmission and data compression. her research interests include power plant control systems, artificial intelligence, industrial communication, data compression, software architectures for industrial control. ioana făgărăşan received the m.s. (1996) degree in electrical engineering, specialty control systems and informatics in power systems, and the phd (2002) degree in automatic control from the university politehnica of bucharest (upb). since 1996 she has been assistant professor, lecturer and is currently associated professor in the department of control engineering and industrial informatics at the upb. she has participated in research cooperation programs between the university and the institute for automatic control, technical university of darmstadt, in the fault detection and identification (fdi) field. between 2002 and 2004 she worked as associate professor at the national polytechnic institute of grenoble (automatic control laboratory). her research interests include control system for technical processes, fault detection and isolation methods, diagnostic algorithms and fault tolerant systems. andrei hossu is professor at university "politehnica" of bucharest. in 1982 he obtained m.s. in electrical and computer engineering, department of automatic control and computers, the polytechnic institute of bucharest, romania. in 2002 he obtained a ph.d. in engineering sciences, the electrical control field from the faculty of automatic control and computers, the university "politehnica" of bucharest, romania. his teaching interests include: human-machine interface, artificial vision and artificial intelligence methods. his research interests include: multiprocessor systems for real time control, industrial robots control systems, algorithms, methods for artificial vision systems for industrial robots, structured programming for industrial robots, software architectures for industrial control. sergiu stelian iliescu has received the m.s. (1965) degree from the power system faculty, automatic control department and ph. d. (1979) degree in computerselectronics from university "politehnica" of bucharest (upb). he was starting his professional activity in execution and implementation of automatic control equipments and systems, within the automatic control system execution company (imia). the next step was to work as design engineer within power systems studies and design institute (ispe). since 1968 he is part of the teaching and research team from automatic control and computers faculty, upb and acts as full professor since 1995. he has experience in process computers (tu dresden, 1971-1972) and coordinates collaborative research activities with tu darmstadt (1992-2004), in two research field: in system theory, modeling, identification and control technical processes (institute of automatic technique, tud) and in power system control and monitoring (institute of electrical power systems, tud). his research and teaching domain includes: automatic control theory, informatics system for supervision and control of industrial processes with concern to power system field and system engineering analysis. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 4, pp. 386-400 high performance computing systems with various checkpointing schemes n. naksinehaboon, m. păun, r. nassar, b. leangsuksun, s. scott nichamon naksinehaboon louisiana tech university computer science department louisiana, 71272, usa e-mail: n.nichamon@gmail.com mihaela păun louisiana tech university mathematics and statistics department louisiana, 71272, usa and finance and banks faculty spiru haret university, romania e-mail: mpaun@latech.edu raja nassar louisiana tech university mathematics and statistics department louisiana, 71272, usa e-mail: ran1@suddenlink.net chokchai box leangsuksun louisiana tech university computer science department louisiana, 71272, usa e-mail: box@latech.edu stephen scott oak ridge national laboratory computer science and mathematics division tn 37831-6173 e-mail: scottsl@ornl.gov abstract: finding the failure rate of a system is a crucial step in high performance computing systems analysis. to deal with this problem, a fault tolerant mechanism, called checkpoint/restart technique, was introduced. however, there are additional costs to perform this mechanism. thus, we propose two models for different schemes (full and incremental checkpoint schemes). the models which are based on the reliability of the system are used to determine the checkpoint placements. both proposed models consider a balance of between checkpoint overhead and the re-computing time. due to the extra costs from each incremental checkpoint during the recovery period, a method to find the number of incremental checkpoints between two consecutive full checkpoints is given. our simulation suggests that in most cases our incremental checkpoint model can reduce the waste time more than it is reduced by the full checkpoint model. the waste times produced by both models are in the range of % to % of the application completion time depending on the checkpoint overheads. keywords: large-scale distributed system, reliability, fault-tolerance, checkpoint/restart model, hpc 1 introduction high performance computing (hpc) systems are used to address many challenging computational problems. however, an hpc system is not required only to solve these problems, but also to solve them in a short time. this copyright c© 2006-2009 by ccc publications high performance computing systems with various checkpointing schemes 387 requirement drives the physical size of an hpc system larger. the reliability of a large hpc system is inversely proportional to its size. for example, the blue gene/l system hosted at lawrence livermore national laboratory (llnl), the world second fastest hpc system, has a failure rate of roughly one failure a week [1]. unfortunately, handling the reliability of such large hpc systems is still problematic. a checkpoint/restart mechanism is widely used to deal with the high failure rate problem. if a failure occurs while an application is running, the portion of the application that has been already computed has to be re-computed again. the checkpoint mechanism is able to reduce this re-computing time of an application after a failure occurrence. however, there is an additional time that is spent to perform the mechanism. the detailed of the checkpoint mechanism will be provided in section 2.1. since the message passing interface (mpi) is one of the most popular parallel programming paradigms, we target our reliability study on the system using the mpi standard [20]. normally, an mpi application is decomposed and executed among a group of nodes, where individual subtasks communicate through the mpi. because of the static view of an mpi environment [17], a single node failure would cause the whole application to fail and requires an application restart. the checkpoint/restart scheme has been widely used in [2], [4], [7], [20] to address application outages. however, applications are practically checkpointed without considering the probability of failures. as such, some checkpoints are useless because, at the checkpoint time, the chance of a failure occurrence is small. thus, it causes unnecessary checkpoint overhead. on the other hand, the re-computing time is large because applications are not checkpointed when the probability of failure is high. ideally, a checkpoint is needed when a failure is going to occur within a certain period of time, so there are no useless checkpoints and expensive recomputing time. so we define the waste time of the checkpoint mechanism as the sum of checkpoint overhead, recomputing time and recovery time. consequently, we aim to minimize the waste time by balancing the checkpoint overhead and the re-computing time. in existing studies, the optimal checkpoint placement strategy is either based on the cost function models [2], [3], [20] or markov availability models [5], [14], [19], [22]. in addition, it is typically assumed that the system failure is a poisson process (with a fixed failure rate). however, in practice, the system failure may not always follow the poisson model [16] and the overall system reliability is more complex than that of its individual components. in this paper, we propose two stochastic models for improving the checkpoint/restart scheme in an hpc environment by reducing the waste time. the reliability function of the system is obtained by analyzing historical failure data from the system event log files. 2 full checkpoint/restart model 2.1 behavior of full checkpoint/restart model a full checkpoint/restart mechanism is a traditional checkpoint/restart mechanism which occasionally saves running application states to a local storage. after a failure occurs, the application can be recovered from the last saved state rather than from the starting point. this results in decreasing the time that is spent to re-compute the application. conversely, there is an additional time to save the application states which is called the checkpoint overhead. to improve the checkpoint mechanism, checkpoints should not be performed too frequently, in such a way to balance of the checkpoint overhead and the application re-computing time. thus, we focus on how to determine checkpoint placements or intervals that minimize the waste time. in this section, we present a checkpoint/restart model with the re-computing time coefficient for fault-tolerant parallel applications. we assume that the application model is mpi and supports a coordinated checkpoint mechanism. the characteristic of the model is directly related to the system reliability function where one of the node outage will result in an application outage because of the mpi standard. for a parallel application, the coordinated checkpoint protocol guarantees that the checkpoint of an hpc system, resulted by the synchronization of the local checkpoint on individual process, is consistent. as a result, each process is checkpointed at almost the same time, so we assume that there is no time difference for each individual process checkpoint, and treat it as a single checkpoint. we consider a failure model that allows more than one failure during the lifetime of a given application. moreover, after each failure, the application will be restarted from the last checkpoint. our checkpoint/restart model is shown in figure 1. it follows a renewal reward process in which ωi denotes the ith time between failures in each repeated cycle. we assume that the waste time (checkpoint overhead, recovery time, and re-computing time) of each cycle is a random variable,w,w,w..., since it depends on when a failure occurs. hence, the total waste time may be expressed as 388 n. naksinehaboon, m. păun, r. nassar, b. leangsuksun, s. scott figure 1: checkpoint/restart as a stochastic renewal reward process ct = m∑ i= wi, (1) where m = max { n ∈ {, , , ...}| (∑ni= ωi) ≤ t}, and ct is called a renewal reward process. from [15] the theorem of a renewal reward process is given as lim t→∞ e [ ∑m i= wi] t = e [w] e [ω] (2) in the checkpoint/restart model, eq.(2) shows that the mean of the overall waste time (left hand side of the equation) can be expressed as a function of the mean waste time of the st cycle. this means that minimizing the overall time lost is equivalent to minimizing the waste time in the st cycle. figure 2: checkpoint/restart as a stochastic renewal reward process table 1: parameters of the checkpoint/restart model parameters meaning of checkpoint overhead rf recovery time tre re-computing time nf (t) checkpoint frequency function of the full checkpoint model f (t) probability density function of tbf ωi the cycle between failure i and failure (i+1) consider a checkpoint restart scheme in the first cycle ω (time between two failures). figure 2 illustrates the model with parameters such as checkpoint overhead of (time that is spent to save an application state), recovery time rf (time that is spent to load the application saved state) and re-computing time tre. through the rest of the paper, our failure model is based on the following assumptions: 1. a running application may be interrupted by a series of random transient failures where the time between failures has a certain probability density function (pdf), f (t). 2. the system failure can be detected by a monitoring mechanism, and we assume that there is no failure during the re-computing and recovery time. 3. each checkpoint overhead of is a constant. in practice, we can take this constant to be the average value of multiple checkpoint overheads. high performance computing systems with various checkpointing schemes 389 4. the application can be recovered from the last checkpoint. this implies that the re-computing time tre is a period between the last checkpoint and the present failure. 5. the recovery time rf is a constant. remark: assumption 2 is satisfied since a well-managed system can be engineered with an efficient mechanism to immediately detect the failure. assumption 5, rf , is satisfied if there is a mechanism in place to replace the failed node with a spared node. the objective of this model is the capability of giving the best checkpoint placement sequence that minimizes the total waste time definition 1. let the sequence of discrete checkpoint placements be  = t < t < ... < tn, and let nf (t) be the checkpoint frequency function for the full checkpoint model defined by: ∫b a nf (t)dt =the number of checkpoints from time a to time b. we then can imply that ∫ti+ ti nf (t) dt = . in figure 2, the waste time wi (checkpoint overhead, re-computing time, and recovery time) in a given cycle ωi can be expressed as wi = of ∫ ωi  nf (τ) dτ + tre + rf (3) from assumption 5, rf is a constant, and, from assumption 5, we suppose that the system can be successfully recovered from the last checkpoint. the relationship between re-computing time tre and checkpoint interval is illustrated in figure 3. figure 3: the relationship between rollback tre and checkpoint interval ling et al. [9] and ozaki et al. [11] considered the recovery cost including both re-computing time and recovery time similar to those in our model. they represented their recovery cost by a function, called the recovery cost function. moreover, they illustrated their model with respect to recovery cost by assuming the recovery function to be linear. assuming linearity may be restrictive and may not lead to optimality. in our model, we consider a re-computing time coefficient k ( < k < ) instead of a recovery cost function and propose an algorithm (end of section 2.2) to estimate this re-computing time coefficient. the re-computing time coefficient is general and can be determined for any system failure distribution. this makes our approach useful for application purposes. since ωi is the value between these checkpoint placements, by the mean value theorem, we can estimate the frequency of this interval by nf (ωi). therefore, tre can be approximated by eq.(4), where k is a re-computing time coefficient variable between (0,1), as seen in figure 3. tre ≈ k/nf (ωi), k ∈ (, ) (4) replacing tre in eq.(3) by its value from eq.(4) gives: wi = of ∫ ωi  nf (t) dt + k nf (ωi) + rf (5) according to the theorem of a renewal reward process in eq.(2), the total waste time in the checkpoint and recovery process can be minimized by minimizing e(w). let f (t) be the probability density function of time between failures. then, the probability that the system fails in the interval [t,t + ∆ t] is f (t) · ∆ t. the expected waste time during a cycle in the checkpoint/restart process is e [w] = ∫ ∞  ( of ∫ t  nf (τ) dτ + k nf (t) + rf ) · f (t) dt. (6) 390 n. naksinehaboon, m. păun, r. nassar, b. leangsuksun, s. scott our interest is in determining the optimal checkpoint frequency n (t) in order to minimize the expected waste time as defined by eq.(6). solution: letting x (t) = ∫t  nf (τ) dτ , then x′ (t) = nf (t) and eq.(6) becomes e [w] = ∫ ∞  ( of ·x (t) + k x′ (t) + rf ) · f (t) dt. (7) let φ (x, x′,t) = ( of ·x (t) + kx′(t) + rf ) · f (t) . in order to have an extremum, the necessary condition for eq.(7) requires that the variation of e[w] (the first derivative regarding to the function x of e[w]) to vanish. consequently, the function ô must satisfy the euler’s equation as the following [6]. ∂ φ ∂ x − d dt · ∂ φ ∂ x′ = . (8) since ∂ φ∂ x = of · f (t) and ∂ φ∂ x′ = − k· f (t) (x′(t)) , eq.(8) becomes: of · f (t) + d dt · k · f (t) (x′ (t)) =  (9) by integrating from 0 to t eq.(9) on both sides, the result is: of ·f (t) + k · f (t) (x′ (t)) = c (10) since lim t→∞ f (t) = and lim t→∞ f (t) = , c = of . moreover, x′ (t) = nf (t), then we obtain the optimal checkpoint frequency function in eq.(11) nf (t) = √ k of · √ f (t)  − f (t) , k ∈ (, ) (11) it is worth mentioning that the probability density function (pdf) and the cumulative distribution function (cdf) can be the joint pdf and joint cdf where each corresponding marginal is the failure distribution of each node in an hpc system. 2.2 estimation of the re-computing-time coefficient k in figure 4, tre is the re-computing time of the application recovered after the failure. it is the time interval between the last checkpoint and the failure, which is a random variable depending on the time when the failure occurs from the checkpoint placement. figure 4: tre (a) without checkpoint, and (b) with checkpoint in an application without checkpoints (figure 4(a)), if a failure occurs at time tf , then tre = tf − t. with checkpoints (figure 4(b)), it is obvious that tre is a random variable which depends on the time the failure occurs. therefore, if we know the distribution of the time between failures, then tre can be estimated. high performance computing systems with various checkpointing schemes 391 definition 2. the re-computing time coefficient k is the ratio between the re-computing time and the checkpoint interval in which a failure occurs. as such, k = tre ti+ − ti = tf − ti ti+ − ti to estimate k, we first obtain the expected re-computing time for each checkpoint interval. definition 3. excess life is a random variable, s ≥ , which denotes system survival until time t + s given that it survives until time t. we denote the cdf, the pdf, and the expected value of the excess life s as f (t + s|t) = p ( tf < t + s|t > t ) , f (t + s|t) = df(t+s|t)ds , e [s] = ∫∞  s f (t + s|t) ds. (12) in our checkpoint model, each checkpoint time ti is the time that we expect a failure to occur. the re-computing time during the interval (ti,ti+), trei is a random variable such that its value is in the interval (,ti+ − ti). according to the excess life definition, the expected value of the re-computing time can be calculated as e [trei] = ∫ ti+−ti  s f (ti + s|ti) ds / ∫ ti+−ti  f (ti + s |ti) ds. (13) therefore, for the expected k of the ith checkpoint interval, k̄i, we obtain k̄i = e [trei] / (ti+ − ti). (14) hence, the expected k, k̄, can be express as k̄ = n∑ i= pik̄i / n∑ i= pi, (15) where pi = p ( ti < tf < ti+| tf > ti ) and n is the number of the checkpoints. to estimate k iteratively, we assume an initial value k̂ between 0 and 1. we then calculate the corresponding checkpoint sequence, t,t, ...,tn , from eq.(11). next, we calculate k̄ corresponding to the checkpoint sequence using eqs. (13)(14)(15). we repeat the above procedure by varying k̂ until we obtain a k̄ value that is equal tok̂. algorithm to estimate k step 1: assume k̂ = a, a ∈ (, ) and set t =  step 2: calculate the checkpoint sequence t,t, ...,tn corresponding to k̂ from step 1. step 3: calculate k̄ from eqs. (13)(14)(15) using the sequence in step 2. step 4: if k̂ = k̄, then set k = k̂ = k̄ done else repeat step 1. 2.3 full checkpoint model evaluation full checkpoint model for the exponential distribution by substituting f (t) = λ e−λ t , and  − f (t) = e−λ t in eq.(11), the checkpoint frequency function for the exponential distribution with the failure rate λ , according to eq.(11), is given by nf (t) = √ k of · √ λ . (16) by the definition of the checkpoint frequency function, for i=0, 1, 2. . . we have that ∫ i+ ti nf (t) dt = , (19) 392 n. naksinehaboon, m. păun, r. nassar, b. leangsuksun, s. scott ∫ i+ ti √ k of · √ λ dt = , (17) ti+ = √ of k · √  λ + ti. (18) using induction whent = , the sequence of the optimal checkpoint placements for the exponential distribution with failure rate λ is given by ti = √ of k · √  λ , k ∈ (, ) (19) where ti is the ith checkpoint placement. the checkpoint interval can be obtained by calculating the formula ti − ti−, where i ∈ {, , ...} and t = . then the checkpoint interval for the exponential distribution is expressed as √ k of · √  λ , k ∈ (, ). therefore, according to the proposed model for the exponential distribution, the checkpoint interval is a constant. this reflects the fact that if failures follow the exponential distribution, the failure rate is a constant. estimation of the re-computing time coefficient k for the exponential distribution according to our re-computing time coefficient estimation in section 2.2, let k̂ = a,  < a <  and let its corresponding checkpoint time sequence be {ti} n i=. the cdf, pdf, and expected value of the excess life following an exponential distribution can be derived from eq.(12) to give f (ti + s) = p (ti + s|ti) =  − e −λ t , (20) f (ti + s) = λ e−λ t , (21) e [trei] =  − (√ k of √ λ /  − e−λ √ λ √ of k ) (22) by substituting eq.(22) into eq.(14), we obtain the expected k of the ith checkpoint interval (k̄i) and for the expected k̄ we use eq.(15). 3 incremental checkpoint/restart model 3.1 behavior of the incremental checkpoint/restart model although full checkpoint/restart mechanism helps to reduce the overall waste time of an application in the case of a failure taking place, we have to spend some time to save the application states, so-called checkpoint overhead. if the checkpoint overhead can be reduced, we may have a cheaper waste time. incremental checkpoint mechanism was introduced to reduce the checkpoint overhead by saving the pages that have been changed instead of saving the whole process [12], [8], [13],[18] . in the incremental checkpoint scheme in figure 5, the first checkpoint is typically a full checkpoint. after that, the mechanism determines which pages have changed since the last checkpoint and saves only those pages and repeats this process until another full checkpoint is performed. in order to recover the application, we will load a saved state from the last full checkpoint and load the changed pages from each incremental checkpoint following the last full checkpoint. this results in more expensive recovery cost than the recovery cost of the full checkpoint mechanism. thus, finding the number of incremental checkpoints between two consecutive full checkpoints that balances the recovery cost and the total checkpoint overhead is crucial. this is because too many incremental checkpoints will lead to unnecessary recovery cost. a challenge in achieving minimum overhead using incremental checkpointing schemes is to find a maximum number of incremental checkpoints while maintaining high performance computing systems with various checkpointing schemes 393 lower waste time than traditional checkpoint mechanism. the behavior of incremental checkpoint/restart model is illustrated in figure 5. the incremental checkpoint model consists of two types of checkpoints (full checkpoints and incremental checkpoints). the meaning of each parameter in the incremental checkpoint/restart model is listed in table 2. figure 5: behavior of incremental checkpoint/restart model table 2: parameters in incremental checkpoint/restart model. parameters definitions of full checkpoint overhead. oi incremental checkpoint overhead. tb re-computing time ri recovery time per an incremental checkpoint. rf recovery time per a full checkpoint m number of incremental checkpoints between two consecutive full checkpoints. ωi the cycle between failure (i-1) and failure i where i = 1,2,3,. . . ni (t) checkpoint frequency function for the incremental checkpoint model in our incremental checkpoint model, the recovery cost is decided by the number of incremental checkpoints. after m incremental checkpoints are performed, either another incremental checkpoint or a full checkpoint can be performed. a full checkpoint is chosen if the cost of performing a full checkpoint is cheaper than the recovery cost for an incremental checkpoint. this is what we call a breakeven point. the main idea is to balance a cost saving function with full and incremental checkpoint overheads and the complexity of the recovery that is introduced by the incremental model. the incremental checkpoint model is an extension of the above full checkpoint model, so the assumptions of the full checkpoint model are applied to this incremental checkpoint model as well. however, there are additional assumptions regarding the factors in the incremental checkpoint scheme which are listed as follows: 1. the first checkpoint in an application is a full checkpoint. after an application is recovered from failure, the first checkpoint is a full checkpoint as well. after m consecutive incremental checkpoints, a full checkpoint may be performed if the overall cost reaches a breakeven point between incremental and full checkpoint. we will determine the value of m in the next section. 2. the incremental checkpoint overhead (oi ) may be viewed as an average of all incremental checkpoint overhead performed. although, for an application, there are both small and large incremental checkpoint overheads, this assumption is reasonable because we aim to minimize the waste time caused by the incremental checkpoint mechanism of an application. 3. the recovery cost of the incremental checkpoint (ri ) and the number of incremental checkpoint between two consecutive full checkpoints m is a constant. the evaluation of m is given in section 3.3. in future work, we will extend the model to consider the incremental checkpoint overhead and the number of incremental checkpoints m as functions of time, in order to better represent a realistic scenario. definition 4. let the sequence of discrete checkpoint placements be  = t < t < ... < tn, and the checkpoint frequency function for the incremental checkpoint model denoted by ni (t) is defined by ∫b a ni (t) dt =the number of full and incremental checkpoints from time a to time b. 394 n. naksinehaboon, m. păun, r. nassar, b. leangsuksun, s. scott we still can imply that ∫ti+ ti ni (t) dt = . in figure 5, the total number of checkpoints in cycle ωi is ∫ ωi  ni (t) dt = nf + ni (23) where nf is the number of full checkpoints in cycle ωi and ni is the number of incremental checkpoints in the same cycle ωi. we note that ni ≈ mnf and nf ≈ ∫ωi  ni (t) dt. thus, nf =  m+ ∫ωi  ni (t) dt where m is the number of incremental checkpoint between two consecutive full checkpoints. we recall that the checkpoint procedure is a renewal process. therefore, whenever a failure occurs, the new cycle starts. we follow the renewal reward theory to derive the optimal incremental checkpoint/restart model similarly to the full checkpoint model. therefore, to minimize the overall waste time of the incremental checkpoint model, it is sufficient to find the checkpoint frequency function ni (t) that minimizes only the waste time of the first cycle which consists of full checkpoint overhead, incremental checkpoint overhead, recovery time, and re-computing time in the first cycle. the waste time of the first cycle can be expressed as w = of  m +  ∫ ωi  ni (τ) dτ + oi ∫ ωi  ni (τ) dτ + (rf + mri ) + tre. (24) we suppose that the system can be successfully recovered from the last checkpoint, and the rollback cost tre can be estimated by k / ni (ω), ( < k < ) as in the full checkpoint model, where ni (ω)is the checkpoint frequency at time ω, and k can be evaluated by the similar method as in the full checkpoint scheme. therefore, we substitute tre in eq. (25) and obtain: w = ( of + moi m +  ) ∫ ωi  ni (τ) dτ + (rf + mri ) + k ni (ω) . (25) by following the stochastic renewal reward process theory, minimizing the overall waste time is equivalent to minimizing waste time in cycle ω. the expected waste time during a cycle in the checkpoint process e [w] is e [w] = ∫ ∞  [( of + moi m +  ) ∫ ωi  ni (τ) dτ + (rf + mri ) + k ni (ω) ] . f (t) dt (26) we are now looking for the solution of the overall checkpoint frequency ni (t) to minimize eq.(26). solution: let x (t) = ∫t  ni (τ)dτ , then x′ (t) = ni (t). from eq. (26), we obtain: e [w] = ∫ ∞  [ of + moi m +  x (t) + (rf + mri ) + k x′ (t) ] · f (t) dt. (27) let the function under the integral in right side of eq.(27) beφ(x, x′,t). then φ ( x, x′,t ) = [ of + moi m +  x (t) + (rf + mri ) + k x′ (t) ] f (t) . (28) by following the same argument as in the full checkpoint model, the checkpoint frequency function of the incremental checkpoint model can be expressed as: ni (t) = √ (m + ) k of + moi √ f (t)  − f (t) , k ∈ (, ) . (29) practically, the incremental checkpoint mechanism is an extension of the full (regular) checkpoint mechanism in the sense that incremental checkpoints are performed additionally in order to reduce the total checkpoint overhead of the full checkpoint mechanism. alternately, we can see that the full checkpoint mechanism is the incremental checkpoint mechanism without any incremental checkpoints. according to eq.(11) and eq.(29), the derived models satisfy the connection between the full and incremental checkpoint mechanisms. that is, when m is equal to 0, the incremental checkpoint frequency function, eq.(29), becomes the full checkpoint frequency function, eq.(11). high performance computing systems with various checkpointing schemes 395 3.2 estimation of the consecutive incremental checkpoint number, m we denote the number of incremental checkpoints between two consecutive full checkpoints as m. the value of depends on the next checkpoint type, either incremental or full checkpoint. as discussed earlier, the incremental checkpoint aims to reduce the checkpoint overhead. on the other hand, the recovery cost will increase as the number of subsequent incremental checkpoints (m) increases. this is because the application reconstruction phase requires information from each and every incremental checkpoint since the last full checkpoint. from the model description of the incremental checkpoint below, we assume that the first checkpoint is a full checkpoint, followed by a sequence of incremental checkpoints. moreover, we will perform m incremental checkpoints after a full checkpoint if the expected waste time of having m +  incremental checkpoint is more expensive than that of having m incremental checkpoints. we follow this idea to find m by comparing the expected waste time in two possible cases. in the first case, as shown in figure 6(a), m continuous incremental checkpoints are followed by a full checkpoint. alternatively, as shown in figure 6(b), after placing m continuous incremental checkpoints, we continue to perform the m + th incremental checkpoint. in each case, we consider the probability of failure. details are discussed in what follows. figure 6: sequential incremental checkpoint scenario case (a): after placing m continuous incremental checkpoints, a full checkpoint is performed next as shown in figure 6(a). let pi is the probability that a failure will occur after the second full checkpoint and before the next incremental checkpoint. hence,  − pi is the probability that failure will not occur in that period. if no failure occurs during this period, the overall cost is ca, ca = (of + moi ) + of . alternatively, if the failure occurs, the cost ca is ca = (of + moi ) + of + rf . therefore, the expected cost is ca = ( − pi )(of + moi ) + pi (of + moi + ri ). (30) case (b): after reaching m consecutive incremental checkpoints, another incremental checkpoint is performed as shown in figure 6(b). we consider that the probability of the failure events is approximately the same as in case (a), pi . when no failure occurs, the cost cb is cb = (of + moi ) + oi . alternatively, if a failure happens, the cost cb is cb = (of + moi ) + oi + (rf + (m + )ri ). therefore, the expected cost in case (b) is cb = ( − pi )[of + (m + )oi ) + pi [of + (m + )oi + (rf + (m + )ri )]. (31) we would like to have the number of incremental checkpoints as much as possible that yields the criteria that if another incremental checkpoint is added the expected waste time is larger than that of having another full checkpoint, and then the solution of m must be satisfied cb ≥ ca. thus, we will choose case (a) and perform a full checkpoint after m sequential incremental checkpoints. therefore, we obtain m ≥ of − oi pi ri − . (32) 396 n. naksinehaboon, m. păun, r. nassar, b. leangsuksun, s. scott inequality (32) suggests us that if m ≥ of −oipi ri −  , the cost in case (b) will be greater than the cost in case (a). thus, we take m as m = ⌈ of − oi pi ri −  ⌉ , (33) where d·e is the ceiling function. according to eq.(33), m is proportional to the difference between the full and the incremental checkpoint overhead and inversely proportional to the incremental recovery cost and the probability of a failure occurrence pi . the following points need to be raised at this point. firstly, if the incremental checkpoint overhead oi is nearly as large as the full checkpoint overhead of , performing incremental checkpoints instead of full checkpoints may not reduce the total overhead. in eq.(33), when oi approaches to of , the nominator approaches 0, then the number of incremental checkpoints m is small. secondly, all incremental checkpoints following the last full checkpoints must be loaded after a failure occurs, causing extra cost in the application recovery period. thus, if the incremental checkpoint recovery cost is expensive, then the number of incremental checkpoints should be small in order to maintain the low recovery cost. thirdly, a full checkpoint should be performed when a chance of a failure occurrence is high because, when a failure happens, there are few incremental checkpoints to be loaded. therefore, if the failure probability pi is high, the number of incremental checkpoints m is small as in eq.(33). lastly, the number of incremental checkpoints m does not depend on the full checkpoint recovery cost because there is only one full checkpoint loaded during the application recovery time. 3.3 incremental checkpoint model evaluation we obtained the general solution for our incremental checkpoint model. eq.(29) gives a checkpoint frequency function which is derived from a probability distribution function of the system time between failures (tbf). for the purpose of the incremental checkpoint/restart study and evaluation, we validate our model results only when the system failure follows the exponential distribution. this assumption will help simplify our validation. however, we plan to use these results as a guidance to further our study with other distributions such as time-varying one for the system failures. incremental checkpoint model for the exponential distribution for the time between failures (tbf) that follows an exponential distribution, we substitute f (t) = λ e−λ t , and  − f (t) = e−λ t , t ≥ , λ >  in eq.(34). the optimal model for an exponential distribution can be written as ni (t) = √ (m + ) k of + moi √ λ . (34) we can find the ith checkpoint placement. for i = , , , ... we have that: ∫ ti+ ti √ (m + ) k of + moi √ λ dt = . (35) solving eq.(35), we have that: ti+ = √ of + moi (m + ) k √ λ + ti. (36) using induction and taking t = , the sequence of the optimal checkpoint placements for the exponential distribution with failure rate λ is given by ti = i √ (m + ) k of + moi √  λ , k ∈ (, ) , (37) high performance computing systems with various checkpointing schemes 397 where ti is the ith checkpoint placement either full checkpoint or incremental checkpoint. in eq.(37), the number of incremental checkpoints can be obtained by eq.(33), and we can obtain the failure probability pi analytically. please note that the checkpoint interval (ti+ − ti) is a constant equal to √ (m+)k of +moi √  λ . from eq.(33), the probability pi of failure during the(ti+ − ti) interval is pi = p ( tf < ti+|tf > ti ) = f (ti+) − f (ti)  − f (ti) (38) for the exponential distribution, the cdf is f(t) =  − e−λ t , and we have pi = (  − e−λ ti+ ) − (  − e−λ ti )  − (  − e−λ ti ) (39) since the checkpoint interval is constant and ti+ − ti = t − t = t, then we have pi =  − e −λ t . (40) in practice, we have to find k (the re-computing time coefficient) and m at the same time. algorithm to find k and m: step 1: initialize of , oi , ri and λ . let k̂ = . and t = . //find m corresponding to k̂. step 2: let m̂ = . step 3: calculate t from eq. (35). step 4: calculate pi by eq. (38). step 5: if m̂ < of −oipi ri −  , then m̂ = m̂ + , and go to step 2. else go to step 6. step 6: set m = m̂ −  //finished finding m step 7: calculate the checkpoint sequence t,t, ...,tn corresponding to k̂ and m from eq. (35) step 8: calculate k̄ from eqs. (11)(12)(13) using the sequence in step 7. step 9: if k̂ = k̄ , then set k = k̂ done else set k̂ = k̄, and go to step 1. 4 comparisons between the full checkpoint model and the incremental checkpoint model in this section, we assume that the waste time of the incremental checkpoint model would be less than that of the full checkpoint model. as we discussed before, the incremental checkpoint scheme aims to reduce the total checkpoint overhead. in contrast, it introduces more expensive recovery cost than the full checkpoint mechanism. therefore, in order to provide evidences to the assumption, the comparisons between the waste times of both models are performed based on the actual failure data of white and frost hpc systems. white and frost were ibm supercomputers at lawrence livermore national laboratory. white consisted of 512 nodes with 16 processors per node and 8192 processors in total, and frost consisted of 68 nodes with 16 processors per node and 1088 processors in total. although white and frost systems had retired already at july 26, 2006 and june 30, 2005, respectively, we choose both systems to analyze because we have the failure timestamps of both systems over the period of four years from 2000 to 2004. before we run the simulations, we process both failure data sets (white and frost) as the following. first, we use the time window of one month to break the failure time stamps, and we then have 52 sets of failure times of white system and 50 sets of failure times of frost system. second, for each data set, we calculate the times between failures (tbfs) and the mean time between failures (mtbf). next, the kolmogorov-smirnov test is used 398 n. naksinehaboon, m. păun, r. nassar, b. leangsuksun, s. scott to test the null hypothesis that the data sets follow the exponential distribution with the corresponding mtbf with the significance level of 0.1. after the test, there are 33 sets of tbfs of white and 42 sets of tbfs of frost that follow the exponential distribution. these are the samples that we will use in the simulations for both systems. furthermore, the mtbf of the white samples is around 23 hours, and the mtbf of the frost samples is around 70 hours. an objective of the simulations is to study the ratio between the waste time and the application completion time, denoted as rwc. the application completion time is the time from the beginning of the application computation to the completion including the checkpoint overhead, recovery time, and re-computing time. rwc can be used as a metric to evaluate the efficiency of the models. for example, if rwc value approaches 0, most of application execution time is used to compute the application, which indicates that the performances of the proposed models are likely to be good. otherwise, most of application execution time is the waste time. however, there is no standard or threshold to indicate the goodness of a checkpoint model yet. to simulate the waste time of both models, for each tbf data set, the following procedure is applied. we determine the checkpoint sequences for the full and incremental checkpoint models, and then, for each sequence of tbfs, we compute the waste times for different completion time values, as the range of 0 to 30 days with the increment of 1 day. next we calculate rwc of each completion time value and find the average of rwc over the completion time. four experiments are conducted with different values of full checkpoint overhead (0.5, 2, 10 and 30 minutes). for each experiment, we calculate the average of rwc for the full checkpoint model and the incremental checkpoint model with 3 different values of incremental checkpoint overheads as 10%, 50%, and 90% of the full checkpoint overhead. therefore, we have 16 values of rwc in total for white and frost systems as shown in figure 7, figure 8, respectively. figure 7: white average of rwcs of the full checkpoint model and the incremental checkpoint model with different values of full checkpoint overhead of 0.5, 2, 10, and 30 minutes and different values of incremental checkpoint overhead of 10%, 50%, and 90% of the full checkpoint overhead values. figure 8: frost average of rwcs of the full checkpoint model and the incremental checkpoint model with different values of full checkpoint overhead of 0.5, 2, 10, and 30 minutes and different values of incremental checkpoint overhead of 10%, 50%, and 90% of the full checkpoint overhead values. according to figure 7 and figure 8, in every case of full checkpoint overheads, we notice that the proportions of the rwcs of the incremental checkpoint model are not implied by the ratio between the incremental checkpoint high performance computing systems with various checkpointing schemes 399 overhead and the full checkpoint overhead. for example, when the incremental checkpoint overhead is 10% of the full checkpoint overhead, the averages of rwcs of the incremental checkpoint model is approximately half of the averages of rwcs of the full checkpoint model. in practice, we expect that the waste time of the incremental checkpoint scheme should be less than the full checkpoint scheme, but the graphs show that, while the incremental checkpoint overhead is 90% of the full checkpoint overhead, the average of rwcs of the incremental checkpoint scheme is slightly greater than that of the full checkpoint scheme. moreover, the behaviors of the rwc in white and frost system are very similar although the mtbfs of both systems are notably different. we may imply that our model is able to give similar results in other systems. additionally, the full checkpoint model for the weibul distribution has been compared with the risk-based model [10] in [21]. in the risk-based model, a checkpoint will be performed if the waste time when the checkpoint is performed is less than the waste time when the checkpoint is not performed. in [10], there are three other models, but the risk-based model was the only model chosen for comparison because it is the best one among all the models in [10]. according to [21], with different values of checkpoint overhead, the full checkpoint model produced less waste time than the risk-based model, except for the case when the full checkpoint overhead is equal to 1 hour. in this case, the waste times of both models were very similar. 5 conclusions in this paper, we have presented near optimal checkpoint/restart models in full and incremental checkpoint schemes in a large-scale hpc environment. in these models, the time sequence of checkpoint placements was derived using the theory of a stochastic renewal reward process. the models are general and can be applied to any distribution of time between failures. however, the given example is for the case of the exponential distribution. the re-computing time which directly relates to the failure time, is an important factor in a checkpoint/restart model. instead of using the re-computing time, we introduced the re-computing time coefficient (k), its estimation approach, and an algorithm to estimate k. for the incremental checkpoint model, there is another significant factor which is the number of incremental checkpoints between consecutive two full checkpoints (m). the derived m yields the event that the expected waste time of performing a full checkpoint after m incremental checkpoints is less than that of performing the (m + )th incremental checkpoint. the proposed algorithm does not provide only the evaluation of m but also the re-computing time coefficient. for the model analysis part, the failure data of white and frost, the supercomputing system owned by llnl were used to simulate the waste time of both proposed models. the comparisons between both models are provided. in most cases, the waste times of the incremental checkpoint model are less than those of the full checkpoint model, especially when the incremental checkpoint overhead is much less than the full checkpoint overhead. furthermore, the proportion of the waste times of the incremental and full checkpoint models seems to not relate to the ratio of the incremental and full checkpoint overheads. lastly, the results of white and frost systems are very similar. this may indicate that the proposed models are able to give similar results for other systems. 6 future work in the near future, we will extend the incremental checkpoint model for the case where the number of incremental checkpoints between two consecutive full checkpoints is not a constant. we expect that the extended model will give less waste time than the proposed one. moreover, we will improve the method to evaluate the number of incremental checkpoints between two consecutive full checkpoints to yield optimality. in addition, we will work on improving the models. for example, in some applications, there are many communications between nodes. if one performs a checkpoint while there is a large amount of communications going on, the checkpoint overhead will be expensive. therefore, the communication or i/o transfer rate may be another factor to consider when performing a checkpoint. acknowledgements. work supported in part by the nsf grant no. 0834483, doe grant no. de-fg02-08er25836 and the national plan ii of romania rp5 grant. 400 n. naksinehaboon, m. păun, r. nassar, b. leangsuksun, s. scott bibliography [1] a.r. adiga, g almasi, and et al., an overview of the bluegene/l supercomputer, proceedings of supercomputing, ieee/acm conference, pp. 60-60, 2002. [2] j.t. daly, a model for predicting the optimum checkpoint interval for restart dumps, iccs 2003, lncs 2660, volume 4, pp. 3–12, 2003. [3] j.t. daly, a higher order estimate of the optimum checkpoint interval for restart dumps, future generation computer systems, elsevier, amsterdam, 2004. [4] e. elnozahy, j. plank, checkpointing for peta-scale systems: a look into the future of practical rollbackrecovery, ieee transactions on dependable and secure computing, vol.01,no.2, pp. 97-108, 2004. [5] r. geist, r. reynolds, and j. westall, selection of a checkpoint interval in a critical-task environment, ieee transactions on reliability, vol.37, no.4, pp. 395-400, 1988. [6] i. m. gelfand and s. v. fomin, calculus of variations, dover publications, isbn-10: 0486414485, 2000. [7] l. hancu, data-mining techniques for supporting merging decisions,int. j. of computers, communications and control,vol. iii (2008), pp. 322-326. [8] d. i. hunyadi, m. a. musan, modelling of the distributed databases. a viewpoint mechanism of the mvdb model’s methodology, int. j. of computers, communications and control, vol. iii (2008),pp. 327-332. [9] y. ling, j. mi, and x. lin, a variational calculus approach to optimal checkpoint placement, ieee transactions on computers, vol. 50, no. 7, pp. 699-707, 2001. [10] a.j. oliner, l. rudolph, and r.k. sahoo, cooperative checkpointing: a robust approach to large-scale systems reliability, proceedings of the 20th annual international conference on supercomputing (ics), australia, pp. 14-23, 2006. [11] t. ozaki, t. dohi, and h. okamura, distribution-free checkpoint placement algorithms based on min-max principle, ieee transactions on dependable and secure computing, volume 3, issue 2, pp. 130 – 140, 2006. [12] a. c. palaniswamy, and p. a. wilsey, an analytical comparison of periodic checkpointing and incremental state saving, proc. of the seventh workshop on parallel and distributed simulation, california, pp. 127-134, 1993. [13] j.s. plank, j. xu, and r.h. netzer, 1995a. compressed differences: an algorithm for fast incremental checkpointing, technical report cs-95-302, university of tennessee at knoxville, 1995. [14] j.s. plank, m.a. thomason, the average availability of parallel checkpointing systems and its importance in selecting runtime parameters, the 29th international symposium on fault-tolerant computing, madison, wi, pp. 250-259, 1999. [15] s.m. ross, stochastic processes, wiley; 2nd edition, isbn-10: 0471120626, 1995. [16] b. schroeder, g.a. gibson, a large-scale study of failures in high-performance computing systems, proceedings of international symposium on dependable systems and networks (dsn). ieee computer society, pp. 249–258, 2006. [17] a. tikotekar, c. leangsuksun, s. scott, on the survivability of standard mpi applications, in proceedings of 7th lci international conference on linux clusters: the hpc revolution 2006. [18] n. h. vaidya, a case for two-level distributed recovery schemes, in proceedings of the 1995 acm sigmetrics joint international conference on measurement and modeling of computer systems, pp. 64–73, 1995. [19] k. f. wong, m.a. franklin, distributed computing systems and checkpointing, hpdc, pp. 224-233, 1993. [20] j.w. young, a first-order approximation to the optimum checkpoint interval, communications of acm volume 17, issue 9, pp. 530-531, 1974. [21] y. liu, r. nassar, c. b. leangsuksun, n. naksinehaboon, m. paun, and s. l. scott, an optimal checkpoint/restart model for a large scale high performance computing system, in proc. international parallel and distributed processing symposium, (miami, florida, 2008) pp.1-9, 2008. [22] d. zmaranda, g. gabor, issues on optimality criteria applied in real-time scheduling,int. j. of computers, communications and control, issue 3, pp.536-540, 2008. ijcccv4n3draft.pdf int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 3, pp. 206-213 dictionary search and update by p systems with string-objects and active membranes artiom alhazov, svetlana cojocaru, ludmila malahova, yurii rogozhin artiom alhazov, svetlana cojocaru, ludmila malahova, yurii rogozhin institute of mathematics and computer science, academy of sciences of moldova academiei 5, chişinău md-2028 moldova e-mail: {artiom,sveta, mal, rogozhin}@math.md artiom alhazov iec, department of information engineering, graduate school of engineering hiroshima university, higashi-hiroshima 739-8527 japan yurii rogozhin rovira i virgili university, research group on mathematical linguistics av. catalunya, 35, tarragona 43002 spain e-mail: yrogozhin@yahoo.com received: april 5, 2009 accepted: may 30, 2009 abstract: membrane computing is a formal framework of distributed parallel computing. in this paper we implement the work with the prefix tree by p systems with strings and active membranes. we present the algorithms of searching in a dictionary and updating it implemented as membrane systems. the systems are constructed as reusable modules, so they are suitable for using as sub-algorithms for solving more complicated problems. keywords: membrane computing, p systems, active membranes, dictionary, prefix tree 1 introduction solving most problems of natural language processing is based on using certain linguistic resources, represented by corpora, lexicons, etc. usually, these collections of data constitute an enormous volume of information, so processing them requires much computational resources. a reasonable approach for obtaining efficient solutions is that based on applying parallelism; this idea has been promoted already in 1970s. for instance, the possibilities of applying massive parallelism in machine translation are considered in [5, 2]. many of the stages of text processing (from tokenization, segmentation, lematizing to those dealing with natural language understanding) can be carried out by parallel methods. this justifies the interest to the methods offered by the biologically inspired models, and by membrane computing in particular. however, there are some issues that by their nature do not allow complete parallelization, yet exactly they are often those “computational primitives" that are inevitably used during solving major problems, like the elementary arithmetic operations are always present in solving difficult computational problems. among such “primitives" in the computational linguistics we mention handling of the dictionaries, e.g., dictionary lookup and dictionary update. exactly these problems constitute the subject of the present paper. in our approach we speak about dictionary represented by a prefix tree. p (membrane) systems are a convenient framework of describing computations on trees. since membrane systems are an abstraction of living cells, the membranes are arranged hierarchically, yielding a tree structure. copyright c© 2006-2009 by ccc publications dictionary search and update by p systems with string-objects and active membranes 207 2 definitions membrane computing is a recent domain of natural computing initiated by gh. păun in [12]. the components of a membrane system are a cell-like membrane structure, in the regions of which one places multisets of objects which evolve in a synchronous maximally parallel manner according to given evolution rules associated with the membranes. the necessary definitions are given in the following subsection; see also [4] for an overview of the domain and [6] for a comprehensive bibliography. 2.1 computing by p systems let o be a finite set of elements called symbols; then set of words over o is denoted by o∗, and the empty word is denoted by λ . definition 1. a p system with string-objects and input is a tuple π = ( o, σ , h, e, µ, m,··· , mp, r, i ) , where: • o is the working alphabet of the system (the objects are strings over o), • σ is an input alphabet, • h is an alphabet whose elements are called labels, i identifies the input region, • e is the set of polarizations, • µ is a membrane structure (a rooted tree) consisting of p membranes injectively labeled by elements of h, • mi is an initial multiset of strings over o associated with membrane i,  ≤ i ≤ p, • r is a finite set of rules defining the behavior of objects from o∗ and of membranes labeled by elements of h. a configuration of a p system is its “snapshot", i.e., the current membrane structure and the multisets of string-objects present in regions of the system. the initial configuration is c = (µ, m,··· , mp). each subsequent configuration c ′ is obtained from the previous configuration c by maximally parallel application of rules to objects and membranes. this is denoted by c ⇒ c ′ (no further rules are applicable together with the rules that transform c into c ′). a computation is thus a sequence of configurations starting from c, respecting relation ⇒ and ending in a halting configuration (i.e., such one that no rules are applicable). if m is a multiset of strings over the input alphabet σ ⊆ o, then the initial configuration of a p system π with an input m over alphabet σ and input region i is (µ, m, . . . , mi−, mi ∪ m, mi+, . . . , mp). 2.2 p systems with active membranes to speak about p systems with active membranes, we need to specify the rules, i.e., the elements of the set r in the description of a p system. due to the nature of the problem of this paper, the standard model was generalized in the following: • cooperative rules: a rule operates on a substring of an object (otherwise, the system cannot even distinguish different permutations of a string); this feature is represented by a superscript * in the rule types; • string replication (to return the result without removing it from the dictionary); 208 artiom alhazov, svetlana cojocaru, ludmila malahova, yurii rogozhin • membrane creation (to add words to the dictionary). hence, the rules can be of the following forms: (a∗) [ a → b ] e h for h ∈ h, e ∈ e, a, b ∈ o∗evolution rules (associated with membranes and depending on the label and the polarization of the membranes, but not directly involving the membranes: the membranes are neither taking part in the application of these rules nor are they modified by them); (a∗r ) [ a → b||c ] e h for h ∈ h, e ∈ e, a, b, c ∈ o∗ (as above, but with string replication); (b∗) a[ ] e h → [ b ] e h for h ∈ h, e, e ∈ e, a, b ∈ o ∗ communication rules (an object is introduced into the membrane, possibly modified; the polarization of the membrane can be modified, but not its label); (c∗) [ a ] e h → [ ] e h b for h ∈ h, e, e ∈ e, a, b ∈ o ∗ communication rules (an object is sent out of the membrane, possibly modified; also the polarization of the membrane can be modified, but not its label); (d∗) [ a ] e h → b for h ∈ h, e ∈ e, a, b ∈ o∗ dissolving rules (in reaction with an object, a membrane can be dissolved, while the object specified in the rule can be modified); (g∗) [ a → [ b ] e g ] e h for g, h ∈ h, e, e ∈ e, a, b ∈ o ∗ membrane creation rules (an object is moved into a newly created membrane, possibly modified). additionally, we will write /0 in place of some strings on the right-hand side of the rules, meaning that the entire string is deleted. the rules of types (a∗), (a∗r ) and (g ∗) are considered to only involve objects, while all other rules are assumed to involve objects and membranes mentioned in their left-hand side. an application of a rule consists in replacing a substring described in the left-hand side of a string in the corresponding region (i.e., associated to a membrane with label h and polarization e for rules of types (a∗), (a∗r ) and (d ∗), or associated to a membrane with label h and polarization e for rules of type (c ∗), or immediately outer of such a membrane for rules of type (b∗) ), by a string described in the right-hand side of the rule, moving the string to the corresponding region (that can be the same as the source region immediately inner or immediately outer, depending on the rule type), and updating the membrane structure accordingly if needed (changing membrane polarization, creating or dissolving a membrane). only the rules involving different objects and membranes can only be applied in parallel; such parallelism is maximal if no further rules are applicable in parallel. 3 dictionary dictionary search represents computing a string-valued function {ui −→ vi |  ≤ i ≤ d} defined on a finite set of strings. we represent such a dictionary by the skin membrane containing the membrane structure corresponding to the prefix tree of {ui |  ≤ i ≤ d}, with strings $vi$ ′ in regions corresponding to the nodes associated to ui. let a, a be the source and target alphabets: ui ∈ a ∗ , vi ∈ a ∗ ,  ≤ i ≤ d. due to technical reasons, we assume that for every l ∈ a, the skin contains a membrane with label l. we also suppose that the source words are non-empty. for instance, the dictionary {bat −→ flying, bit −→ stored} is represented by [ [ ]  a [ [ [ $ f lying$′ ]  t ]  a [ [ $stored$′ ]  t ]  i ] b [ ]  c ···[ ]  z ]   dictionary search and update by p systems with string-objects and active membranes 209 consider a p system corresponding to the given dictionary: π = ( o, σ , h, e, µ, m,··· , mp, r, i ) , o = a ∪ a ∪ {?, ? ′, $, $′, $, $,fail} ∪ {?i |  ≤ i ≤ } ∪ {!i |  ≤ i ≤ }, σ = a ∪ a ∪ {?, ? ′, !, $, $′}, h = a ∪ {}, e = {, +, −}, i = , µ and sets mi,  ≤ i ≤ p, are defined as described above. so only the rules and input semantics still have to be defined. 3.1 dictionary search to translate a word u, input the string ?u?′ in region 1. consider the following rules. s1 ?l[ ]  l → [ ? ]  l , l ∈ a propagation of the input into the membrane structure, reaching the location corresponding to the input word. s2 [ ??′ ]  l → [ ] − l /0, l ∈ a marking the region corresponding to the source word. s3 [ $ → $||$ ] − l , l ∈ a replicating the translation. s4 [ $ ] e l → [ ]  l $, l ∈ h, e ∈ {−, } sending one copy of the translation to the environment. s5 [ $ → $ ]  l , l ∈ a keeping the other copy in the dictionary. the system will send the translation of u in the environment. this is a simple example illustrating search. if the source word is not in the dictionary, the system will be blocked without giving an answer. the following subsection shows a solution to this problem. 3.2 search with fail the set of rules below is considerably more involved than the previous one. however, it handles 3 cases: a) the target word is found, b) the target word is missing in the target location, c) the target location is unreachable. f1 [ ? → ?||? ]   replicate the input. f2 [ ? → ? ]   delay the second copy of the input for one step. f3 ?l[ ]  l → [ ? ] + l , l ∈ a propagation of the first copy towards the target location, changing the polarization of the entered membrane to +. 210 artiom alhazov, svetlana cojocaru, ludmila malahova, yurii rogozhin f4 ?l[ ] + l → [ ? ]  l , l ∈ a propagation of the second copy towards the target location, restoring the polarization of the entered membrane. f5 [ ?l → [ ? ] − l ]  k , l, k ∈ a if a membrane corresponding to some symbol of the source word is missing, then the first copy of the input remains in the same membrane, while the second copy of the input restores its polarization. creating a membrane to handle the failure. f6 [ ?? ′ → ? ]  l , l ∈ a target location found, marking the first input copy. f7 [ ? ]  l → [ ] − l /0, l ∈ a marking the target location. in either case, some membrane has polarization −. it remains to send the answer out, or fail if it is absent. the membrane should be deleted in the fail case. f8 [ $ → $||$ ] − l , l ∈ a replicating the translation. f9 [ $ ] e l → [ ]  l $, l ∈ h, e ∈ {, −} sending one copy of the translation out. f10 [ $ → $ ]  l , l ∈ a keeping the other copy in the dictionary. f11 [ ? → ? ] − l , l ∈ a the second copy of input will check if the translation is available in the current region. f12 ?l[ ] − l → [ ? ] − l , l ∈ a the second copy of input enters the auxiliary membrane with polarization −. by now the second copy of the input is in the region corresponding to either the search word, or to its maximal prefix plus one letter (auxiliary one). f13 [ ? → ? ] − l , l ∈ a it waits for one step. f14 [ ? → /0 ]  l , l ∈ a if the target word has been found, the second copy of the input is erased. f15 [ ? ] − l → [ ]  l ?, l ∈ a if not, the search fails. f16 [ ? ]  l → [ ]  l ?, l ∈ a sending the fail notification to the skin. dictionary search and update by p systems with string-objects and active membranes 211 f17 [ ?l → ? ]   erasing the remaining part of the source word. f18 [ ?? ′ ]   → [ ]  fail answering fail. f19 [ ? → ? ] − l , l ∈ a f20 [ ? → ? ] − l , l ∈ a f21 [ ? → ? ] − l , l ∈ a if the target location was not found, the first input copy waits for 3 steps while the membrane with polarization − handles the second input copy. f22 [ ? ]  l → /0, l ∈ a erasing the auxiliary membrane. 3.3 dictionary update to add an entry u −→ v to the dictionary, input the string !u$v$′ in region 1. consider the following rules. u1 [ ! → !||! ]   replicate the input. u2 [ ! → ! ]   delay the second copy of the input for one step. u3 !l[ ]  l → [ ! ] + l , l ∈ a propagation of the first copy towards the target location, changing the polarization of the entered membrane to +. u4 !l[ ] + l → [ ! ]  l , l ∈ a propagation of the second copy towards the target location, restoring the polarization of the entered membrane. u5 [ ! → ! ]  l , l ∈ a if a membrane corresponding to some symbol of the source word is missing, then the first copy of the input remains in the same membrane, while the second copy of the input restores its polarization. marking the fist copy of the input for creation of missing membranes. u6 [ !l → [ ! ] + l ]  k , l, k ∈ a creating missing membranes. u7 [ !$ → $ ]  l , l ∈ a releasing the target word in the corresponding location. 212 artiom alhazov, svetlana cojocaru, ludmila malahova, yurii rogozhin u8 [ !$ → /0 ]  l , l ∈ a erasing the second copy of the input. we underline that the constructions presented above also hold in a more general case, i.e., when the dictionary is a multi-valued function. indeed, multiple translations can be added to the dictionary as multiple strings in the region associated to the input word. the search for a word with multiple translations will lead to all translations sent to the environment. the price to pay is that the construction is no longer deterministic, since the order of application of rules s4 or f9 to different translations is arbitrary. nevertheless, the constructions remain “deterministic modulo the order in which the translations are sent out". all constructions work in linear time with respect to the length of the input. the parallelism is vital for checking for the absence of a needed submembrane, or for checking for the absence of a translation of a given word; sending multiple translation results out is also parallel. 4 discussion in this paper we presented the linear-time algorithms of searching in a dictionary and updating it implemented as membrane systems. we underline that the systems are constructed as reusable modules, so they are suitable for using as sub-algorithms for solving more complicated problems. the scope of handling dictionaries is not limited to the dictionaries in the classical sense. understanding a dictionary as introduced in section 3, i.e., a string-valued function defined on a finite set of strings, leads to direct applicability of the proposed methods to handle alphabets, lexicons, thesauruses, dictionaries of exceptions, and even databases. at last, it is natural to consider these algorithms together with morphological analyzer and morphological synthesizer. acknowledgments all authors gratefully acknowledge the support by the science and technology center in ukraine, project 4032. artiom alhazov gratefully acknowledges the support of the japan society for the promotion of science and the grant-in-aid for scientific research, project 20·08364. yurii rogozhin gratefully acknowledges the support of the european commission, project molcip, mif1-ct2006-021666. bibliography [1] g. ciobanu, g. păun, m.j. pérez-jiménez eds., applications of membrane computing, springerverlag, 2006. [2] h. kitano, challenges of massive parallelism, proceedings of the 13th international joint conference on artificial intelligence, chambery, france, 1993, vol. 1, 813–834. [3] gh. păun, computing with membranes, journal of computer and system sciences 61(1), 2000, 108–143. [4] gh. păun, membrane computing. an introduction, springer-verlag, 2002. [5] e. sumita, k. oi, o. furuse, h. iida, t. higuchi, n. takahashi, h. kitano, example-based machine translation on massively parallel processors, proceedings of the 13th international joint conference on artificial intelligence, chambery, france, 1993, vol. 2, 1283–1289. [6] p systems webpage. http://ppage.psystems.eu/. dictionary search and update by p systems with string-objects and active membranes 213 artiom alhazov (born on october 11, 1979), graduated in mathematics and computer science (the state university of moldova, chişinău, moldova) and received ph.d. in languages and information systems (rovira i virgili university, tarragona, spain). a researcher at the institute of mathematics and computer science of the academy of sciences of moldova. he completed a postdoc in åbo akademi university, turku, finland, and currently has a postdoc in hiroshima university, higashi-hiroshima, japan. his main research interests are theoretical computer science, formal language theory, parallel distributed computational models, and in particular the descriptional complexity of p systems with weak forms of interaction. he published over 90 research papers (collaborating with more than 30 researchers from many countries in europe and asia). he won numerous prizes for programming in school and university years, and the national youth prize in science, technics, literature and arts in 2006 for a collection of research works. svetlana cojocaru (born on july 26, 1952), graduated in mathematics (the state university of moldova, chişinău, moldova, 1974), received ph.d. in computer science (institute of cybernetics, ukrainian academy of sciences, kiev, 1982) and doctor in habilitation in computer science (institute of mathematics and computer science, academy of sciences of moldova, 2007). a deputy director at the institute of mathematics and computer science of the academy of sciences of moldova. her main research interests are formal languages and grammars, natural language processing, computer algebra, molecular computing. she published over 120 research papers. ludmila malahova (born on july 22, 1947), graduated in computer science (the state university of moldova, chişinău, moldova, 1970). a researcher at the institute of mathematics and computer science of the academy of sciences of moldova. she has a significant experience in computer science including computer graphics, formal languages, computer algebra, natural language processing, and molecular computing with more than 80 papers published in international journals, books and conference proceedings. yurii rogozhin (born on november 13, 1949), graduated in mathematics (the kuban state university, krasnodar, russia, 1971), received ph.d. in mathematical cybernetics (computer center of the russian academy of sciences, moscow, 1981) and doctor in habilitation in computer science (moscow state university, russia, department of computational mathematics and cybernetics, 1999). a principal researcher at the institute of mathematica and computer science of the academy of sciences of moldova republic and marie curie iif researcher at rovira i virgili university, research group on mathematical linguistics, tarragona, spain. his main research interests are mathematics, theoretical computer science, formal language theory and its applications, natural (biomolecular) computing and nanotechnology. he published over 110 research papers (collaborating with more than 45 researches from many countries in europe). int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 3, pp. 351-361 improving a svm meta-classifier for text documents by using naive bayes d. morariu, r. creţulescu, l. vinţan daniel morariu, radu creţulescu, lucian vinţan "lucian blaga" university of sibiu engineering faculty,computer science department e. cioran street, no. 4, 550025 sibiu, romania e-mail: {daniel.morariu,radu.kretzulescu,lucian.vintan}@ulbsibiu.ro abstract: text categorization is the problem of classifying text documents into a set of predefined classes. in this paper, we investigated two approaches: a) to develop a classifier for text document based on naive bayes theory and b) to integrate this classifier into a meta-classifier in order to increase the classification accuracy. the basic idea is to learn a meta-classifier to optimally select the best component classifier for each data point. the experimental results show that combining classifiers can significantly improve the classification accuracy and that our improved meta-classification strategy gives better results than each individual classifier. for reuters2000 text documents we obtained classification accuracies up to 93.87%. keywords: meta-classification, support vector machine, naive bayes, text document and performance evaluation 1 introduction while more and more textual information is available online, effective retrieval is difficult without good indexing and summarization of document content. document categorization is one solution to this problem. the task of document categorization is to assign a user defined categorical label to a given document. in recent years a growing number of categorization methods and machine learning techniques have been developed and applied in different contexts. documents are typically represented as vectors in a features space. each word in the vocabulary is represented as a separate dimension. the number of occurrences of a word in a document represents the value of the corresponding component in the document’s vector. in this paper we investigate some strategies for combining classifiers in order to improve the classification accuracy. we used classifiers based on support vector machine (svm) techniques and based on naive bayes theory, respectively. they are less vulnerable to degrade with an increasing dimensionality of the feature space, and have been shown effective in many classification tasks. the svm classifiers are actually based on learning with kernels and support vectors. we combine multiple classifiers hoping that the classification accuracy can be improved without a significant increase in response time. instead of building only one highly accurate specialized classifier with much time and effort, we build and combine several simpler classifiers. several combination schemes have been described in the papers [2] and [6]. a usually approach is to build individual classifiers and later combine their judgments to make the final decision. another approach, which is not so commonly used because it suffers from the "curse of dimensionality" [5], is to concatenate features from each classifier to make a longer feature vector and use it for the final decision. anyway, meta-classification is effective only if classifiers’ synergism can be exploited. in previous studies combination strategies were usually ad hoc and are implementing strategies like majority vote, linear combination, winner-take-all [2], or bagging and adaboost [16]. also, some rather complex strategies have been suggested; for example in [4] a meta-classification strategy using svm [15] is presented and compared with probability based strategies. copyright c© 2006-2010 by ccc publications 352 d. morariu, r. creţulescu, l. vinţan section 2 and 3 contains prerequisites for the work that we present in this paper. in sections 4 we present the methodology used for our experiments. section 5 presents the experimental framework and section 6 presents the main results of our experiments. the last section debates and concludes on the most important obtained results and proposes some further work. 2 support vector machine the support vector machine (svm) is a classification technique based on statistical learning theory [13], [15] that was applied with great success in many challenging non-linear classification problems and on large data sets. the svm algorithm finds a hyperplane that optimally splits the training set. the optimal hyperplane can be distinguished by the maximum margin of separation between all training points and the hyperplane. looking at a two-dimensional problem we actually want to find a line that "best" separates points in the positive class from points in the negative class. the hyperplane is characterized by a decision function like: f (x) = sign(〈~w, φ(x)〉+ b) (1) where ~w is the weight vector, orthogonal to the hyperplane, "b" is a scalar that represents the hyperplane’s margin, "x" is the current sample tested, "φ(x)" is a function that transforms the input data into a higher dimensional feature space and 〈·,·〉 represents the dot product. sign is the sign function. if ~w has unit length, then 〈~w, φ(x)〉 is the length of φ(x) along the direction of ~w. generally ~w will be scaled by ‖~w‖ . in the training part the algorithm needs to find the normal vector "~w" that leads to the largest "b" of the hyperplane. 3 naive bayes the bayes classifier uses the bayes theorem which basically computes prior probabilities for a given class based on the probability for a given term to belong to the specified class. the classifier computes the probability for a document to be into a given class. bayesian theory woks as a framework for making decision under uncertainty a probabilistic approach to inference [4] and is particularly suited when the dimensionality of the inputs data is high. bayes theorized that the probability of future events could be calculated by determining their earlier frequency. bayes theorem states that: p(y = yi | x = xk) = p(y = yi)p(x = xk | y = yi) p(x = xk) (2) where: p(y = yi) prior probability of hypothesis yprior p(x = xk) prior probability of training data x-evidence p(x = xk | y = yi) probability of x given ylikelihood p(y = yi | x = xk) probability of y given xposterior probability. the naive bayes classifier is based on the simplifying assumption that the attribute values are conditionally independent given target value. in other words the assumption is that, given the target value of the instance, the probability of observing the conjunction y, y...yn is just the product of the probabilities for the individual attributes: cmap = argmax b means that a − b is definite positive. at denotes the transpose of a. ? stands for symmetric blocks; • stands for an element that has no influence on the development. 2 system description the structure of a variable speed, pitch regulated wind energy conversion system is presented in fig.1. the system is formed by the rotor, the mechanical structure, and by a generator unit, composed by the generator and the static converter connected to the electrical grid. the control system acts on generator in order to apply the reference electromagnetic torque tg,re f and on the pitch actuator in order to control the pitch angle of the blades β , calculated from the measurements of the rotational speed of the shaft at the generator side, and of the flexion speed of the tower by an accelerometer located at the tower top. the effective wind speed v(t) passing through the rotor is considered as a first order dynamic process disturbed by an exogenous signal mv(t) v̇ = − 1 tv v + mv(t) (1) with the time constant tv calculated from the stochastic properties of the wind speed [8]. the mechanical model describing the structure of the plant has three degrees of freedom: the flexions of the blades (blades flap motion) and of the tower (tower fore-aft motion) in the direction of the wind, and the torsion of the drive train shaft (fig.2). a spring-damper representation is used to describe the flexibility of each component. moreover, the three blades are supposed to move conjointly and to be affected by the same forces at the same time. a linear model of the dynamic behavior of this structure is established using lagrange’s equations. the electrical subsystem, corresponding to the generation unit, composed by the generator and the power electronic components, has very fast dynamics compared with dynamics of the other subsystems. consequently, and considering the study objectives, the electrical dynamics are neglected. hence, electromagnetic torque tg is supposed equal to its reference tg,re f . switching lpv controllers for a variable speed pitch regulated wind turbine 75 figure 1: wind energy conversion system structure. figure 2: wind turbine mechanical structure the pitch actuator subsystem represents the hydraulic or electric system which makes the blades revolve around their lengthwise axis. this system is described by a first order transfer function with a time constant tβ . the aerodynamic conversion process of the turbine rotor is characterized by the extracted torque taero and by the out-of-plane thrust force faero, which are functions of the air mass density ρ , the wind velocity v, the rotational speed of the turbine ωt , the horizontal blades speed ẋp and the power and thrust coefficients cp and ct : taero = 1 2 ρ π r2 (v + ẋp)3 ωt cp(λ , β ), faero = 1 2 ρ π r2(v + ẋp)2ct (λ , β ), (2) with r the length of the rotor blades. the aerodynamic coefficients cp and ct are non-linear functions depending on blades pitch angle β and tip speed ratio λ = ωt rv . 76 fabien lescher, jing-yun zhao, pierre borne 3 control task the controller objectives are to ensure: • stability along the reference trajectory for the whole operation, • good performances of the selected outputs, i.e. energy conversion and alleviation of mechanical loads affecting the plant structure. the wind turbine operation area can be divided into three zones, depending on the wind speed acting on blades. the energy conversion objectives, and thus the control objectives, are different for each zone. for low wind speed, i.e. for v < v1, the main objective is to maximize system energy conversion yield. in this partial load 1 zone, system has to operate at cp(λ , β ) = cp,max. pitch angle β is then maintained constant at βopt and rotational speed ωt is controlled to minimize the criteria δ λ = λ − λopt , by acting only on generator electromagnetic torque tg. for higher wind speed, corresponding to v1 < v < v2, turbine rotational speed ωt is maintained at the nominal generator speed by acting on electromagnetic torque tg. pitch angle β is also maintained at βopt to maximize energy conversion efficiency (partial load 2). for high wind speed, i.e. v > v2, wind turbine operates in full load and electric produced power pelec has to be regulated at nominal generator power. turbine rotational speed is maintained around nominal generator speed and pitch angle β is controlled in order to reduce power coefficient cp(λ , β ). control system is then multivariable in this zone, because it acts on both generator torque and pitch angle. evolution of the main variables in function of wind speed are presented in fig.3. the control system has figure 3: evolution of the main variables in function of wind speed. to operate over the full envelope of wind speeds. hence, transitions between these different operating zones have to be handled by the control system in a smooth manner, which avoids the generation of large transients. switching lpv controllers for a variable speed pitch regulated wind turbine 77 the other main control objective is to reduce mechanical fatigue of most expensive plant components, especially blades, drive train and tower. hence, the control system will be designed in order to reduce the variations of the drive train torsion torque td and the blades and tower flexion forces fb and ft , expressed by: td = dd(ωt −ωg) + kd(θt −θg) fb = dbẋb + kbxb ft = dt ẋt + kbxt with d and k the damper and spring coefficients of the corresponding components. the control system will be designed in order to optimize a trade-off between these objectives by minimizing a criterion expressed in the frequency range. h2 norm optimization permits to reduce the average variations of the energy conversion parameter, i.e. δ λ , δ ωt or δ pelec, in the whole frequency range. h∞ cost minimization is able to alleviate, in the frequency range, the maximum response of the mechanical loads to a variation of wind speed, which is generally the response corresponding to the resonance frequency of the component, and which is the most damaging. 4 lpv modeling the lpv model can be considered as a group of linear local descriptions of nonlinear descriptions. given the nonlinear system describing the wind turbine behavior ẋ = f (x, u, w) y = g(x, u, w) (3) z2 = h2(x, u, w) z∞ = h∞(x, u, w) with x the state of the system, u the control input, w the external disturbance, y the measured output and z2 and z∞ the performance outputs, the jacobian linearization approach can be used to create an lpv system based on the first-order taylor series expansion of the nonlinear model. a family of linear plants is obtained by linearizing the nonlinear system with respect to a set of equilibrium points located on the reference trajectory, which are parametrized by the scheduling parameter ρ and satisfy f (xe(ρ), ue(ρ), 0) = 0. corresponding to a specified family of equilibrium points, the family of the linearized plants can be written in the following form:   δ ẋ δ y δ z2 δ z∞   =   ∂ f ∂ x |e ∂ f ∂ u |e ∂ f ∂ w|e ∂ g ∂ x|e ∂ g ∂ u|e ∂ g ∂ w|e ∂ h2 ∂ x |e ∂ h2 ∂ u |e ∂ h2 ∂ w |e ∂ h∞ ∂ x |e ∂ h∞ ∂ u |e ∂ h∞ ∂ w |e     δ x δ u δ w   (4) where the deviation variables are defined by δ x = x−xe(ρ) δ u = u−ue(ρ) δ y = y−ye(ρ) δ w = w−we(ρ) and where j|e represents the value of the jacobian coefficient j at the equilibrium point (xe(ρ), u(ρ)). before deriving the lpv model of the wind turbine, the scheduling parameters must be selected so 78 fabien lescher, jing-yun zhao, pierre borne that the appropriate equilibrium point can be located on the reference trajectory. because the reference trajectory is decomposed into three parts, corresponding to the three operating areas, the parameter set p = [ρmin, ρmax] is partitioned into three subsets p1 = [ρ1,min, ρ1,max] p2 = [ρ2,min, ρ2,max] p3 = [ρ3,min, ρ3,max] with ρ1,max = ρ2,min and ρ2,max = ρ3,min corresponding to v1 and v2, i.e. the transitions between the different operating regions. hence, by calculating the jacobian coefficients as functions of the scheduling parameter ρ along the reference trajectory for each parameter subset pi, and considering the wind turbine system configuration, the dynamic behavior of the model (4) for the wind turbine system is governed by the equation:   δ ẋ δ y δ z2 δ z∞  =   ai(ρ) bi gi cy,i(ρ) 0 0 cz2,i(ρ) dz2,i(ρ) 0 cz∞,i(ρ) dz∞,i(ρ) 0     δ x δ u δ w   (5) i = {1, 2, 3}. the jacobian coefficients of the expression of the aerodynamic torque and of the thrust force (2) are calculated along the reference trajectory and are approximated by affine functions of the scheduling parameter ρ . hence matrices ai(ρ) have the following form: ai(ρ) = ai,0 + ρ ai,1 (6) and are continuous in the whole parameter set p. 5 lpv control design from the developed family of lpv models of the wind turbine, we aim at designing a family of lpv dynamic output-feedback controllers in the form: [ ẋk u ] = [ ak,i(ρ) bk,i(ρ) ck,i(ρ) dk,i(ρ) ] [ xk y ] , i = {1, 2, 3}. (7) the order of each controller is the same than the plant order, and the controllers dynamics are allowed to be discontinuous at the boundaries of the subsets pi. hence one lpv controller is designed in each subset pi in order to optimize a set of performances, expressed as a multichannel h2/h∞ criteria, corresponding to the control objectives of the operating area. the control synthesis is based on the lmi optimization and on the construction of a continuous piecewise-affine lyapunov function depending on the scheduling parameter ρ , which mimics the parameter dependence of the plant model (6). hence, the parameter dependent lyapunov function is defined as: v (xcl , ρ) = xtcl p(ρ)xcl (8) where xtcl = ( xt xtk ) and p(ρ) = φi(ρ)pi(ρ), with φi(ρ) = 1 if ρ ∈ pi, and φi(ρ) = 0 otherwise. in order to apply the linearizing changing of variables described in [9], matrices pi(ρ) and p−1i (ρ) are partitioned as: pi(ρ) = ( yi(ρ) si(ρ) sti (ρ) • ) p−1i (ρ) = ( x r rt • ) (9) switching lpv controllers for a variable speed pitch regulated wind turbine 79 with i −yi(ρ)x = si(ρ)rt . as explained in [4], matrices x and r are constrained to be constant over the whole parameter set p in order to obtain a family of controllers independent of the gain-scheduling parameter rate ρ̇ , which is not available in real-time. moreover, matrices yi(ρ) are affine in the parameter ρ in each subset pi: yi(ρ) = yi,0 + (ρ −ρi,min)yi,1 (10) and the function y (ρ) = φi(ρ)yi(ρ) is constrained to be continuous over the whole parameter set p: yi(ρ) = yj(ρ)∀ρ ∈ pi ⋂ p j (11) by verifying that for the wind turbine model the triple (ai(ρ), bi(ρ),cy,i(ρ)) is stabilizable and detectable for all ρ ∈ p, and by assuming that the scheduling parameter rate ρ̇ is bounded in a set [−ρ̇max, ρ̇max], the family of controllers can be calculated by applying the following theorem1: theorem:[9][6] suppose there exist symmetric definite positive matrices x and y such as (10) and (11), matrices âi(ρ) = âi,0 + (ρ −ρi,min)âi,1 + (ρ −ρi,min)2âi,2 b̂i(ρ) = b̂i,0 + (ρ −ρi,min)b̂i,1 + (ρ −ρi,min)2b̂i,2 ĉi(ρ) = ĉi,0 + (ρ −ρi,min)ĉi,1 d̂i(ρ) = d̂i,0 + (ρ −ρi,min)d̂i,1 symmetric semi-definite positive matrices mi = ( m1,i ? m2,i m3,i ) and a symmetric matrix q satisfying the following matrix inequalities   u + u t ? âi + (ai + bid̂icy,i)t ρ̇yi,1 + v + v t gt gt yi cz∞ x + dz∞ĉi cz∞ + dz∞ d̂icy,i ? ? ? ? −γ i ? 0 −γ i   + (ρ −ρi,min) 2 ( mi 0 0 0 ) < 0   x ? ? i yi ? dz2ĉi + cz2 x cz2 + dz2 d̂icy,i q   > 0 tr(q) < ν( m1,i ? m2,i + âi,2 m3,i + yiai + b̂i,2cy,i ) > 0 (12) for all pairs (ρ, ρ̇) ∈ {ρi,min, ρi,max}×{ρ̇i,min, ρ̇i,max}, with u = aix + biĉi v = yiai + b̂icy,i 1the dependance on ρ is dropped for the notation conveniance 80 fabien lescher, jing-yun zhao, pierre borne then the family of dynamic controllers (7) with, ∀ρ ∈ p: dk,i = d̂i ck,i = (ĉi −dk,icy,ix )r−t bk,i = s −1 i (b̂i −yibidk,i) ak,i = s −1 i (âi −sibk,icy,ix −yibick,irt −y (ai + bidk,icy,i)x )r−t where matrices r and si verify for all ρ ∈ p the relation rsti = i −xyi, guarantees that: • the elements of the family of closed-loop systems composed by (5) and (7) are asymptotically stable; • the h∞-norm of the transfer w → z∞ is less than √ γ ; • the h2-norm of the transfer w → z2 is less than √ ν . note that the lmi problem formulated in this theorem has a finite number of inequalities, thanks to the introduction of matrices mi, and permits to do without the gridding phase which is commonly used for lpv synthesis and which is very computationally intensive. moreover, in order to facilitate the control system implementation, and to prevent the system from having too fast dynamics, additional lmis constraints are formulated to place the poles of the closed loop systems in a determined region of the state-space domain. placing the poles in a circle centered at the origin of the state-space permits to prevent for obtaining too fast poles and too few damped fast poles. hence if the following lmis are satisfied for ∀ρ ∈ {ρi,min, ρi,max} and for semi-definite positive matrices ni = ( n1,i ? n2,i n3,i ) :   −rx ? ? ? −ri −ryi ? ? aix + biĉi âi −rx ? ai + bid̂icy,i yiai + b̂icy,i −ri −ryi   +(ρ −ρi,min)2 ( ni 0 0 ni ) < 0   n1,i ? ? ? n2,i n3,i ? ? 0 âi,2 n1,i ? 0 b̂i,2cy,i + yi,1ai,1 n2,i n3,i   > 0 (13) then the poles of the closed-loop systems composed by (5) and (7) are located in a circle of rayon r and of center the origin of the state-space. hence, an optimal family of dynamic controllers that minimize the h2-norm of the transfer w → z2 for a given h∞-norm √ (γ) of the transfer w → z∞ can be designed by solving the following convex lmis problem: min ν subject to the lmis (12) and (13) (14) 6 wind turbine control system design and simulation the wind turbine control system is designed from the family of developed lpv systems by calculating lpv controllers which optimize a trade-off between the energy conversion, i.e. the energy yield switching lpv controllers for a variable speed pitch regulated wind turbine 81 maximization in partial load and the output energy regulation in full load, and the reduction of mechanical loads. hence the performance output z2 represents the energy optimization variables δ λ , δ ωt or δ pelec, depending on the operating area, and then the subset of the scheduling parameter, augmented by the corresponding weighting functions. in the full load operation, and in order to reduce the pitch actuator fatigue, the pitch angle deviation δ β is added to this vector. the performance output vector z∞ is composed by the mechanical forces affecting the plant structure, ft , fp and td, augmented by the corresponding weighting functions. in order to reduce the vibrations of the mechanical components for frequencies above their resonance frequencies, the outputs ft , fp and td are multiplied by a high-pass filter, in order to penalize the high-frequency variations of these outputs during the controller synthesis. inversely, the energy optimization variables are multiplied by a low-pass filter in order to ensure to the energy conversion output a good tracking at low frequency. indeed, at high frequency, a good tracking of these components is not reached because it would induce an increase of dynamical loads on the actuators and on the mechanical structure. the scheduling parameter ρ is chosen to be ρ = pelec + kβ 2, k > 0, because it permits to locate the operating point on the reference trajectory and is directly measured, contrary to the wind speed crossing over the turbine rotor. moreover, the bounds of the scheduling parameter rate ρ̇ are derived from the actuators rate limitations. by solving numerically the problem (14), three lpv controllers are designed, one for each operating area. therefore, the control applied to the plant actuators can be discontinuous during controllers switching. because the switchings between partial load 2 and full load occur for β = 0, the pitch control is ensured to be continuous, whereas no continuity guarantees are provided concerning the generator torque. to ensure this, a structural constraint on the controllers matrices is added during the synthesis of the lpv controllers: the lines of the controllers matrices ck,i and dk,i corresponding to the generator torque are constrained to be equal at the boundaries of the subsets pi: ck,i(1)(ρ) = ck, j(1)(ρ) dk,i(1)(ρ) = dk, j(1)(ρ) for all ρ ∈ pi ⋂ p j. the efficiency of the proposed controller is compared, at the sight of simulation results, with the ones of the two other existing controllers, a gain scheduling pi-based controller, and a multivariable gain scheduling lqg one, for the full load operation. indeed, this operating area is the most challenging in the viewpoint of control design because the controller is multivariable, because the mechanical loads affecting the plant are the highest, and because the pitch action is really efficient to alleviate these loads with a carefully designed controller [2]. as mentioned in [2] and [3], the pi-based controller is designed by calculating controllers for two separate loops: firstly, a pi controller is tuned to guarantee power regulation from generator speed measurement. then a tower speed feedback is designed to increase tower fore-aft damping, by calculating a controller which appropriately filters blades flap excitation and which does not interact with the bandwidth of the first loop. unlike the proposed controller, this controller acts only on pitch angle. the gain scheduling lqg controller is designed from the same augmented model and with the same performances outputs as the proposed controller. the lqg methodology permits to design a multivariable controller which optimizes a time domain quadratic criteria representing a trade off between the different control objectives. unlike the proposed lpv design methodology, the lqg design does not provide guarantees of stability and performances along the reference trajectory, but only in several operating points on this trajectory. actually, one lqg controller has to be designed at each linearization point of the trajectory, and a gain scheduling process has to be used to interpolate the different lqg controllers. moreover, and contrary to the proposed method, the lqg design does not provide any specification of performance in the frequency range such as h∞ or h2 norms. 82 fabien lescher, jing-yun zhao, pierre borne 200 205 210 215 220 225 230 235 240 245 250 14 16 18 20 wind speed temporal serie w in d s p e e d ( m /s ) 200 205 210 215 220 225 230 235 240 245 250 3.98 4 4.02 x 10 5 generated electric power e le ct ri c p o w e r (w ) 200 205 210 215 220 225 230 235 240 245 250 8 10 12 14 16 18 pitch angle b e ta ( °) time (sec) figure 4: temporal series. lpv controller; pi-controller; lqg controller. table 1: equivalent load: ratio between the different controllers and pi controller. controller shaft tower blades lpv controller 57.3% 63.8% 82.9% lqg controller 59.0% 73.4% 87.8% pi controller 100% 100% 100% the simulations are based on a dynamic model implemented in matlab-simulink of a three blades 400 kw wind turbine containing: • non linear and stationary aerodynamics, with rotational effects disturbances due to wind shear and tower shadow. • flexible drive train model, flexible tower model including first fore-aft mode, flexible blades in flapwise direction, • pitch actuator limitations on pitch rate (±10◦/s) and pitch amplitude (0◦ 30◦). the simulated wind speed respects stochastic properties of van der hoven spectra with high fluctuations. the controllers performances are compared for both power regulation and alleviation of mechanical fatigue. these controllers have equivalent bandwiths and are tuned in order to guarantee a similar level of performance for power regulation. evaluation of mechanical fatigue is provided by using the rainflow counting algorithm, which calculates the number of load cycles from time domain simulations results, and the fatigue equivalent load for each component [10]. temporal series of wind speed, produced power pelec and pitch angle β are presented in fig.4. fatigue equivalent loads for the shaft, blades and tower are calculated from the simulations, and table 1 presents the ratio between the equivalent loads obtained with the different controllers and with the pi-controller. the proposed lpv controller is seen to be more effective for the two selected control objectives, especially for mechanical fatigue reduction for each component, and despite a pitch activity inferior to the pitch activity caused by the lqg controller. a sensitive gain of lifetime of the mechanical components of the plant, or a sensitive reduction of the mass, and consequently of the cost of these components can switching lpv controllers for a variable speed pitch regulated wind turbine 83 200 250 300 350 400 450 10 12 14 16 w in d s p e e d ( m /s ) 200 250 300 350 400 450 2 2.5 3 3.5 4 4.5 x 10 5 e le ct ri c p o w e r (w ) 200 250 300 350 400 450 0 5 10 p itc h a n g le (° ) time (sec) figure 5: transitions between partial load 2 and full load. then be expected. the behavior of the plant with the proposed control system is also shown during the transition between partial load 2 and full load in fig.5, i.e. for wind speeds around the rated wind speed, which is about 12.2m/s. the control system is then seen to handle efficiently this transition. 7 conclusion in this paper, an hybrid controller composed by several lpv controllers has been proposed for the whole operating area of a variable speed, pitch regulated wind turbine. this control system achieves the optimization of a trade-off between the energy conversion and the reduction of the mechanical loads for the whole envelope of wind speeds acting on the plant, taking into account the different objectives of the energy conversion for the different operating areas. moreover, the employed lpv modeling, as well as the lmi formulation of the problem, provides a good framework for additional possible constraints of the control problem during the synthesis, like actuator saturation, or for ensuring the robustness to the system parameters uncertainties. references [1] t. burton, d. sharpe, n. jenkins, and e. bossanyi, wind energy handbook. john wiley and sons, 2001. [2] e.a. bossanyi, the design of closed loop controllers for wind turbines, wind energy, vol.3, 2001, pp.149-163. [3] w.e. leithead, s. dominguez and c.j. spruce, analysis of tower/blade interaction in the cancellation of the tower fore-aft mode via control, proc. european wind energy conference, london, uk, 2004. 84 fabien lescher, jing-yun zhao, pierre borne [4] p. apkarian and r. j. adams, şadvanced gain-scheduling techniques for uncertain systems,ť ieee transactions on control system technology, vol. 6, pp. 21−32, 1997. [5] f. lescher, j. zhao, and p. borne, şrobust gain scheduling controller for pitch regulated variable speed wind turbine,ť studies in informatics and control, vol. 14, pp. 299−315, 2005. [6] s. lim, analysis and control of linear parameter-varying systems, ph.d. dissertation, stanford university, 1999. [7] b. lu and f. wu, switching lpv control designs using multiple parameter-dependent lyapunov functions, automatica, vol.40, 2004, pp. 1973-1980. [8] c. nichita, d. luca, b. dakyo, and e. ceanga, şlarge band simulation of the wind speed for real time wind turbine simulators,ť ieee transactions on energy conversion, vol. 17, pp. 523−529, 2002. [9] c. scherer, p. gahinet, and m. chilali, şmultiobjective outputfeedback control via lmi optimization,ť ieee transactions on automatic control, vol. 42, pp. 896−910, 1997. [10] m. matsuiski and t.endo, fatigue of metals subjected to varying stress, japan soc. mech. engrg, 1969. f. lescher and p. borne ecole centrale de lille lagis laboratory bp 48, cité scientifique, 59651 villeneuve d’ascq cedex, france e-mail: fabien.lescher@eigsi.fr j.y. zhao eigsi la rochelle erpa laboratory 26 rue vaux le foletier, 17000 la rochelle, france e-mail: jing-yun.zhao@eigsi.fr received: november 2, 2006 int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 768-774 towards the implementation of computer-aided semiosis a.e. lascu, s.c. negulescu, c. butaci, v. cret alina e. lascu, sorin c. negulescu lucian blaga univeristy of sibiu, romania e-mail: alina.lascu@ulbsibiu.ro, sorin.negulescu@ulbsibiu.ro casian butaci, vasile creţ agora university, oradea and r&d agora ltd. cercetare dezvoltare agora oradea, romania e-mail: casian12@yahoo.com, vcret@univagora.ro motto: in the beginning was the word bible, john 1:1 abstract: computer-aided semiosis (cas) is a concept coined by a team of researchers a couple of years ago. since it is a promising domain due to the fact that responds to actual trans-cultural communication needed in the broad-band society where often the message behind the words does not come clear the subject ought being inquired more detailed as promised in other papers of the same authors. this interesting idea was inspired from eco’s theory of communication which states that the receiver "fills the message with significance"; hence it is vital for any communication and is strongly dependent on the cultures involved. in line with eco’s theory, the research in this area must be trans-disciplinary and anthropocentric. in the intention of narrowing the existing gap between the technological offers and user expectations the macro-architectural feature is that translation will progress from textual, semantically correct, to multimodal, culturally adequate, based on common concepts and "grammar" (rules to combine them into meaningful sentences); thus, this paper will present possible approaches towards the implementation of cas. given the fact the ontologies are considered to be the pillars of semantic web but also a key tool in implementing cas, both will be a subject of this paper in the light of finding an implementation solution. the paper is structured on five sections: the first will present the defining aspects of the concept relating it with previous research; the second section will deal with cas approach and architecture, following with the state of the art regarding ontologies and their relation with semantic web. among the conclusions, one is already noticeable: cas could not be possible without a trans-cultural ontology. keywords: computer-aided semiosis (cas), human-computer interaction (hci), ontologies, semantic web, interfaces. 1 introduction the key to effective online cross-cultural communication is a well-designed transcultural ontology which to help in disambiguating between concepts that seem alike but their meaning differs from culture to culture; this tool is intended to be developed in the following years by a team of young researchers in order to substantiate and implement a new and innovative concept in the field of hci: computer-aided semiosis (cas). copyright c⃝ 2006-2010 by ccc publications towards the implementation of computer-aided semiosis 769 university of colorado, usa developed a study regarding cross-cultural communication strategy and came up with the conclusion that often intermediaries who are familiar with both cultures can be helpful in cross-cultural communication situations. they can translate both the substance and the manner of what is said. for instance, they can tone down strong statements that would be considered appropriate in one culture but not in another, before they are given to people from a culture that does not talk together in such a strong way. they can also adjust the timing of what is said and done. some cultures move quickly to the point; others talk about other things long enough to establish rapport or a relationship with the other person. if discussion on the primary topic begins too soon, the group that needs a "warm up" first will feel uncomfortable. a mediator or intermediary who understands this can explain the problem, and make appropriate procedural adjustments [4]. the results of this study can be as well applied in ict due to the fact that online cross-cultural communication could also use virtual intermediaries which to have access to a transcultural ontology assisting thus the user grasping the right meaning of a certain message, i.e. in written (chat), spoken (voice) and/or visual form. in the light of the earlier scan, this paper will present possible approaches towards the implementation of cas. since the ontologies are thought to be the pillars of the semantic web but also an important tool in implementing cas, both will be key-subjects of this paper in the quest of finding an implementation solution. the paper is structured on five sections: the second will present the defining aspects of the concept relating it with previous research; the third will deal with cas rational and approach, following with the state of the art regarding ontologies and their relation with semantic web. among the conclusions, one is already easily remarked: at the online level, communication can be impaired not only by the cultural differences but also by a wide range of differences such as race, age, sex, profession, religion or disabilities; in this regard, if the transcultural ontology will prove its efficiency in disambiguating cultural concepts, consequently other ontologies could be also implemented for aiding the online communication process between different users. 2 defining the concepts in line with history according to modern paradigms, the goal of using icts is "obtaining a service from a huge palette of available ones" and the means is "interacting with an entity". the "entity" is either a human (e.g. when speaking via mobile phones) or a device (e.g. when buying travel documents via computers) [1]. therefore there are three possible ways of communication as described in [1]: a) "face to face", b) "face to interface", c) "interface to interface" in the near future context of semantic web, domain ontologies and so forth. very agent metaphor. thus, when users employ agents (in whatever domain of activity) they expect: a) personalization (agents act considering the specific momentary needs of their clients); b) authorization (agents act on behalf of their clients, within the limits stipulated by the hiring agreement); and c) competence. in short, the agent metaphor suggests that "i hire an agent when i do not have enough time or lack competence to handle the problem myself" [1]. as regards the interface agents, the emphasis is on the interface, entailing that the agent remains hidden (i.e., the users perceive just a "smarter functionality", no "pseudoavatars" [10] intervening in a human-to-human dialogue) [1]. considering these features, one can say that an agent, or more specific an interface agent can act like a smart mediator which to have access to knowledge (i.e. ontology) and by using it efficiently to "translate" to the user only the meaningful messages, saving a lot of time that otherwise would have been wasted. in the 21st century time is money therefore it is vital mostly in the business sector but not restricted to. online communications nowadays means more than a showy website and a newsletter. if 770 a.e. lascu, s.c. negulescu, c. butaci, v. cret new web-based technologies are joint with a society that is rapidly getting to think of online interactions as just as authentic as face-to-face ones, one has the possibility of radically easing the communication even in difficult (but frequent) circumstances, like those involving crosscultural interaction ( [2], [6], [7], [8], [9], [10], [11], [12], [13]) as debated and probed in previous paper of the authors. in our previous researches we approached the transcultural interfaces’ subject which could be explained in human-to-human communication as a progression from textual (semantically correct) conversions to multimodal (culturally adequate) ones [2], based on the concept of cas. since at this research stage the experimental models presented in the related papers are not agent-oriented, the reference to "agents" is found only in [1] and regards conceptual aspects as well as future development. thus far, the emphasise in our research was put on transcultural which represents the ability of people belonging to different cultures to communicate efficiently preserving in the same time their cultural identity [2]. this concept will be further on related with ontology which together, i.e. transcultural ontology, will embody the tool by which cas could be validated as a possible new and challenging domain of hci. 3 rationale and approach as depicted from the title this paper intends to draw the sketch of what it should be the onward study on the implementation/ validation of cas. since the prime motives because of which we started this endeavour was already stated in the previous papers of this team, it is redundant to re-state them; instead this paper is set to focus on the approach and methods. since cas was designed from an anthropocentric perspective, meaning to provide an assistance (i.e. an interface agent which to access a transcultural ontology, transfering the user only meaningful messages), lessening the linguistic hurdles (such as the traduttore-traditore effect), the logocratic pressure of (spoken or written) text, response time criticality, as well as the danger of distortions and noise, via a major upgrade in communication granularity: (one) idea instead of (many) words [9]. the ongoing study should be approached from a trans-disciplinary perspective, in respect with both humanists (i.e. linguists, psychologists) and technologists (i.e. ontology, interface designers and so on). when creating the ontology, the designer should bear in mind the way users and agents may "think"; how an user creates meaning from a piece of image of some sings (i.e. words) and how an interface agent does the same job effortlessly on the contrary. though, do an agent depict the meaning same accurate as a human does? or maybe detecting the true meaning of a message is even more endangered by human’s scrambled mind which is very much contextual, in opposition with an interface agent which will act and respond based only on the given ontology and some very specific rules, and hence the probability to fail giving the expected meaning will be smaller. these are all questions which can not be answered at this point, some answers will be empirically uncovered, some may prove to be exactly contradictory with those thought in the first place. after all, this is what academically we call exploratory research. the approach will be adapted step by step base on the further findings. 4 ontology. state of the art people are able to use the web in order to complete tasks such as finding the icelandic word for "alphabet", reserving a plane ticket, or searching for a low price for an e-book. still, computers cannot perform the same tasks without human direction because web pages are designed to be towards the implementation of computer-aided semiosis 771 read by people, not machines. the semantic web is a vision of information that is understandable by computers, so that they can perform more of the dull work involved in finding, sharing, and combining information on the web [3]. the vision of tim berners-lee regarding semantic web [3]: "i have a dream for the web (in which computers) become capable of analyzing all the data on the web the content, links, and transactions between people and computers. a ’semantic web’, which should make this possible, has yet to emerge, but when it does, the day-to-day mechanisms of trade, bureaucracy and our daily lives will be handled by machines talking to machines. the ’intelligent agents’ people have touted for ages will finally materialize" is related in this study with eco’s semiosis theory (which states that the receiver "fills the message with significance" [5]), meaning that on the same features’ as semantic web’s cas could be implemented but using instead a dedicated/personalized ontology based on which to develop the "translation process". an ontology is a formal representation of a set of concepts within a domain and the relationships between those concepts [17]. the ontology envisioned by this research team is based on the idea of maya script (a logographic type of script which used both logograms and syllabic characters [16]) by replacing, where possible, the words with images as in the catchphrase: "a picture it is worth a thousand words". figure 1 illustrates the way the correspondence between a word-based ontology and an image-based ontology can be created, of course, the example given in this paper is very much simplified. the demarcation line was traced in order to separate the abstract and the concrete synonyms of the word apple. on one side of the line there can be easily remarked every-day interpretations of the word apple, which can without any doubt be recognized by anyone. on the other side, the other interpretations depict more abstract representations of the word apple which require a higher level of knowledge in order to be grasped, i.e. the apple polisher, the temptation (religiously), wilhelm tell representation (historically), newton’s gravitation theory (physical sciences), "an apple a day keeps the doctor away" (health idiom); the point is that the ontology must be trans-disciplinary and trans-culturally created in order to gather all the possible meanings and definitions for a word/concept. disambiguation can further on continue and the figure presented above be expanded with other concepts and relation between them. an example at hand for expanding the ontology would be adding other characteristics for the fruit apple such as variety name (e.g. granny smith, pink lady, red delicious, golden delicious and so on), colour (e.g. red, green, pink, yellow), taste (e.g. sour, sweet, etc.). also, in order to disambiguate applež’s logo there can be made a separation between the old and the new logos. an important disambiguation which must be made in the first place is between english word apple and swedish word äpple which happens to mean the same thing but they belong to different cultures which can generate later on other misunderstanding problems. briefly, the implementation of cas will follow this ontology framework where the icon position has syntactic role (in line with ontology rules) and semantic role for cas (to reduce the differences between "intentio auctoris" and "intentio lectoris") [2]. this kind of ontology based on visual rules can be further enhanced by using animations instead of images. considering this idea, the role of the interface agent will be to "translate". in german, the verb "to translate" means "übersetzen", the reason for this association is that in its most basic visualization, the german word means "to carry something from one side of the river to the other side of the river". in opposition the english word "to translate" does not immediately evoke the same image in the mind. in german, one can say: "mit der fähre den fluss übersetzen" which would literally mean: "to translate the river on a ferry boat". anyway the translator so to name it must take in consideration both cultures from which to which the translation is being made; one must know that ad literam translation doesn’t apply especially when a more subtle message (culturally dependent) must be transmitted, therefore the visualisation of the word "übersetzen" (to move from one side of the river to the other side of the river) leads to 772 a.e. lascu, s.c. negulescu, c. butaci, v. cret figure 1: example of architecture for cas1 several important insights into the nature of translation. in this context, the role of the agent is carrying something across the river, whether it is from here to there or from there to here. the cargo can have a multiplicity of shapes: the description of a technological object, a cultural or historical phenomenon, a poetic image, a metaphorical expression, or a human emotion, to name only a few; the parameters of each word are quite fragile; no two people will take the exact same meaning from a word [14], so the interface agent will have the hardest ever job, to extract the meaning of a message taking into consideration the transcultural ontology in line with the cultural background of both users involved in the communication process. in order to validate cas concept as a possible new subdomain of hci, the researches have to focus from now on, on the implementation by creating a transcultural ontology on the framework presented in the previous section involving in this process preferable a trans-disciplinary team of researchers (linguists, psychologists, anthropologists, designers, programmers and so on) by keeping in mind the fact that an ontology deals with the nature of existence so it is a too impressive domain to be approached only by a team of thrilled researchers, willing to take the burden of exploratory research. the future work will consider refining the ontology framework and hopefully in three-year time span (during the phd studies of the first author) the objectives to be fulfilled involving european teams of researchers interested in this kind of projects. 5 conclusions and future work in order to validate cas concept as a possible new subdomain of hci, the researches have to focus from now on, on the implementation by creating a transcultural ontology on the framework presented in the previous section involving in this process preferable a trans-disciplinary team of researchers (linguists, psychologists, anthropologists, designers, programmers and so on) by keeping in mind the fact that an ontology deals with the nature of existence so it is a too impressive domain to be approached only by a team of thrilled researchers, willing to take the burden of exploratory research. the future work will consider refining the ontology framework and hopefully in three-year towards the implementation of computer-aided semiosis 773 time span (during the phd studies of the first author) the objectives to be fulfilled involving european teams of researchers interested in this kind of projects. acknowledgements: this research is fully supported by the posdru/88/1.5/s/60370 project which is co-financed by the european social fund through the sectoral operational programme for human resources development 2007-2013. we thank mr. cătălin boaru from apple imc romania for kindly granting us the permission to use the appler logo. copyright notice: apple and the apple logo are trademarks of apple computer, inc., registered in the u.s. and other countries. disclaimer: this article is an independent publication and has not been authorized, sponsored, or otherwise approved by apple computer, inc. images were taken from public domain graphic sites with their own disclaimers, and a very few were taken from sites that contained no copyright information or terms of use. bibliography [1] bărbat, b.e. (2009). interface agents for transcultural communication: a framework . in sapio, b. (ed.), the good, the bad and the challanging. the user and the future of information and communication technologies (pp.666-675). copenhagen: cost action 298. [2] bărbat, b.e., negulescu, s.c., lascu, a.e., popa, e.m. (2007). computer-aided semiosis. threads, trends, threats. in mastorakis, n.e. (ed.), proc. of the 11th wseas international conference on computers (pp.269-274). agios nikolaos, crete: iccomp ’07. [3] berners-lee, t. (1999). weaving the web: the original design and ultimate destiny of the world wide web by its inventor. new york: harperone. [4] conflict research consortium (1998). cross-cultural communication strategies. retrived 2010, from university of colorado, usa. web site: http://www.colorado.edu/conflict/peace/treatment/xcolcomm.htm. [5] eco, u. (2005). the limits of interpretation. bloomington, sua: indiana university press. [6] georgescu, a.v., lascu, a.e., bărbat, b.e. (2008). protensity in agent-oriented systems. role, paths, and examples. int. j. of computers, communications & control, iii, 304-309. [7] lascu, a.e., fabian, r. (2007). e-semiotics for romanian-german trans-cultural interfaces. in sapio, b. et al (ed.), the good, the bad and the unexpected: the user and the future of information and communication technologies (pp.on cd). moscow, russian federation: cost action 298 participation in the broadband society. [8] lascu, a.e., georgescu, a.v. (2009). from extensity to protensity in cas: adding sounds to icons. in esposito, a. et al (ed.), multimodal signals: cognitive and algorithmic issues (pp.130-137). vietri sul mare, italy: springer. [9] lascu, a.e., moisil, i., negulescu, s.c. (2009). computer-aided semiosis mirrored in creolization. rational and approach . international journal of advanced statistics and it&c for economics and life sciences, 1(1), 38-43. 774 a.e. lascu, s.c. negulescu, c. butaci, v. cret [10] lascu, a.e., negulescu, s.c., cioca, m., zerbes, m.v. (2009). interface agents as virtual tutors. conference proceedings of balkan region conference on engineering and business education & international conference on engineering and business education, 2, 626-629. [11] lascu, a.e., negulescu, s.c., kifor, c.v. (2009). different time perception in creolization mirrored in transcultural interface for "immediate". in sapio, b. et al (ed.), the good, the bad and the challenging the user and the future of information and communication technologies (pp.661-666). copenhagen, denmark: cost action 298 participation in the broadband society. [12] negulescu, s.c., lascu, a.e., oprean, c. (2009). cultural differences in decision-making. a transcultural interface for gambler’s fallacy. in sapio, b. et al (ed.), future of information and communication technologies (pp.656-661). copenhagen, denmark: cost action 298 participation in the broadband society. [13] prundurel, a., negulescu, s.c., lascu, a.e. (2007). mini-ontology for trans-cultural interfaces. in sapio, b. et al (ed.), the good, the bad and the unexpected: the user and the future of information and communication technologies (pp.on cd). moscow, russian federation: cost action 298 participation in the broadband society. [14] rainer schulte (1999). the translator as mediator between cultures. retrived 11.2009, from translation studies. web site: http://translation.utdallas.edu/translationstudies/mediator_essay1.html. [15] russell s., norvig p. (2003). artificial intelligence: a modern approach. nj: prentice hall. [16] simon ager (2009). mayan script. retrived 2010, from omniglot writing systems and languages of the world. web site: http://www.omniglot.com/ writing/mayan.htm. [17] wikipedia (2009). ontology (information science). retrived 11.2009, from wikipedia. web site: http://en.wikipedia.org/wiki/ontology_(information_ science). int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iii (2008), no. 3, pp. 270-280 disassembly line scheduling with genetic algorithms luminiţa duţă, florin gheorghe filip, jean-michel henrioud, ciprian popescu abstract: disassembly is part of the demanufacturing and it is meant to obtain components and materials from end-of-life products. an essential performance objective of a disassembly process is the benefits it brings, that is the revenue brought by the retrieved parts and material, diminished by the cost of their retrieval operations. a decision must be taken to balance an automatic disassembly line. a well balanced line will decrease the cost of disassembly operations. an evolutionary (genetic) algorithm is used to deal with the multi-criteria optimization problem of the disassembly scheduling. keywords: control, scheduling algorithms, evolutionary programming, genetic algorithms 1 introduction in the second half of the 20th century, the people and governments started to become aware that the "take make waste" system, which resulted from the industrial revolution, is not sustainable. the assumptions on infinite material "sources" of raw materials and "sinks" to absorbe industrial and domestic wastes could not hold any longer because of the exponential growth of world population and accelerated widespreding of industry and consumption. in their inspiring book on system engineering, blanchard and fabrycky [1, p.555] state that "green manufacturing should be an objective adopted by producers to reduce the environmental impact of their products and production operation continuously. real environmental improvement requires a system life-cycle approach to guide design decisions and operational policies". several driving factors such as: a) competitive differentiation, b) customer consciousness, c) legal regulations, and d) fight for improving profitability have led to this new way of thinking and even to new standards such as iso 14,000 series. a new science of sustainability called "industrial ecology" was born [2,frasch ], [3, graedel, allenby]. within it, research directions such as "design for environment" (with a proactive character), and "environmental management" (a reactive, remedial aproach) compose a new design trend called "environmentally conscious design and manufacturing" [1, blanchard and fabrycky, p. 558]. at present, both designers and manufacturers show an increasing tendency to consider product and system entire life-cycle starting with perceiving customers needs, and continuing with design and development, manufacturing, product utilization, maintenance, phase-out, and disposal. in this context, the term "remanufacturing" includes the set of planning and processing activities (such as checking in, disassembling, cleaning, inspection and sorting, reconditioning, reassembling, testing and packaging) for recycling obsolete products. the first steps of remanufacturing process can be grouped under the name "demanufacturing". a new industry for demanufacturing has shown up to provide with new usage of expensive modules, components and materials and, at the same time, to prevent excessive wastes. also industrial producers tend to add their demanufacturing departments to existing manufacturing facilities. sometimes, chains of firms show up to implement new paradigms such as "extended", "networked", and "virtual" enterprise [3, filip, bărbat]. this paper aims at proposing the usage of genetic algorithms for optimal scheduling of disassembly lines. the remaining part of this paper is organised as it follows. first, the main concepts and research directions in modelling, optimization and control in disassembly processes are reviewed. then, the optimal scheduling problem of a disassembly line is formulated. a short description of genetic algorithms is given next. experimental results on solving the optimal disassembly problem by a genetic algorithm are given before presenting the paper conclusions. copyright © 2006-2008 by ccc publications disassembly line scheduling with genetic algorithms 271 2 disassemblya main stage in recycling several types of recycling of end-of-life manufactured goods are possible such as : a) reusing of components and modules of high value, b) refurbishing and partial reconstruction of a good returned from the market, c) recovery of the raw material or energy by incineration and melting. disposal, which is the last solution to resort to, is done only if the other alternatives are not possible. the choice between these different types determines the recycling process and allow for defining the end-of-life destinations for components of manufactured goods. different recycling loops are different approaches of the process. the simplest approach is that of dismantling a product. by applying dismantling operations, a discarded good can be broken down faster and with small costs. in this case more pure fractions can be obtained with less efforts. this simple approach exploits only the value of the raw material. it does not take into account the functional value of the product or of its components. the regain of the functional values needs a recycling process that minimizes the destroying effects on the product. this means to reuse, refurbish and capitalize the components of the used product in order to remanufacture a new one. remanufacturing is a superior form of reusing, since its objective is to maximize the value of repaired parts and to minimize the disposal quantity. central to remanufacturing is the disassembly process that decomposes a product into parts and/or subassemblies. disassembling is a nondestructive technique and implies the extraction of the desired components and/or materials. if the parts are not reusable after reconditioning, partial or total destructive operations are to be applied: drilling, cutting, wrenching, and shearing. these techniques are used in view of the material or energetically recovery. there are several research directions in relation with disassembly processes such as: modeling, planning, and control. the chapter 16 on "design for produceability and disposability" of the book on system engineering of blanchard and fabrycky [1] contains an excellent introduction to product and system design to facilitate remanufacturing. since disassembly processes can be viewed as discrete, event driven systems [5, cassandras, lafortune], petri nets can be a natural solution. there are lots of reported results. for example, moore, gungor and gupta [6] propose disassembly petri nets to take into account operation precedence constraints in planning aplications. penev and ron [7] proposed disassembly graphs. kuo, zhang and huang [8] describe disassembly trees which associate for each branch the direction of the disassembly operation by adapting the assembly trees proposed by henrioud [9]. the usage of object oriented petri nets proposed by lakos [10], to model disassembly processes is analysed by duta, filip and henrioud [11]. for balancing the operation of the disassembly line, duta, filip and henrioud [12] utilize the method of equal piles approach proposed rekiek [13]. to model products associated with incomplete, imprecise, and, sometimes, wrong information, fuzzy reasoning petri nets are proposed by gao and zhou [14] to make real-time disassembly scheduling decisions possible. a review of state-of-art implementations of control structures for disassembly line is given by duta and filip [15]. 3 problem formulation disassembly of manufactured products induces both disassembly costs and revenues from the parts saved by the process. thus, at the planning stage a good trade-off has to be found that depends, both on the "depth" of the disassembly, and on the sequence of operations. the optimization of the ratio between gain and cost can be accomplished by using an appropriate distribution of the disassembly tasks on workstations, an assignment that provides a maximal value for the total profit. the optimization problem depends upon the structure of the disassembly system: if it is made up of a single workstation, the costs depend mainly upon the process duration. if the system is a line, the costs depend mainly 272 luminiţa duţă, florin gheorghe filip, jean-michel henrioud, ciprian popescu upon the line balancing, all the more if it is highly manual. another problem that occurs during a disassembly process is how deep the disassembly sequence must go so as to maximize the outcome of this process. in [11, duta, filip and henrioud], [16, duta, henrioud and caciula] it was shown that an incomplete disassembly sequence can be more profitable than a complete one. destructive and dismantling operations have to be taken into consideration, as well. hence, we have to deal with a multi-criteria optimization problem of a disassembly process: maximizing the benefit it brings deciding how deep the disassembly sequence can go and minimizing the costs using an optimal scheduling along the line. a decision in a scheduling problem upon many criteria is a np-hard to solve problem [17, filip]. stochastic algorithms have already been used to fulfill a multi-criteria optimization problem in [18, minzu and henrioud]. in this paper we consider that the line structure was given and propose an algorithm which will allow finding a disassembly sequence and its assignment on workstations that optimizes a very simple function which integrates the income from the parts and the cycle time of the disassembly line. in this paper we address the case of disassembly lines where the cycle-time is not merely the sum of all operative and logistic times but it also depends strongly upon the line balancing. the objective is to find the most profitable disassembly sequence taking into account, on one hand the end-of-life options for each part or subassembly of a given product, and on the other hand the operational times for a given assignment of the tasks on the disassembly workstations. a cost function which combines both disassembly costs and revenues was proposed in [19, duta, filip and henrioud]. f = r tcy (1) where r is revenue associated to each disassembled part and tcy is the cycle time. the global revenue is the sum of partial revenues obtained according to the end-of-life destinations of the disassembled parts. these partial revenues are established by experts after repeated disassembly processes. r = ∑ k rk, k = 1..nc (2) where nc is the number of final components or the number of subassemblies obtained after the disassembly process. the cycle time can be defined like the operational time of the slowest workstation on the line tcy = max wi ∑ j∈(tasks on wi) t j (3) where wi is the workstation i, and j is one disassembly operation and t j the operational times. we make the following assumptions: • the disassembly line is linear (flow-shop type); • end-of-life revenues of the subassemblies are known; • operational costs are included in the final incomes; • the criterion of maximizing the outcome depending of the success rate of disassembly operations has been taken into consideration; • the failure of the disassembly process is an event that can also occur since certain parts of the product could be deformed and impossible to separate without destroying; disassembly line scheduling with genetic algorithms 273 • the disassembly line is not starving; it works in a continuous flow. evaluating the function from equation (1) reveals the profit on a time unit, which is an important indicator for the productivity of the disassembly system. this function also takes into account the value of the cycle time obtained for a well-balanced line. the optimization can be made both for the manual and automatic disassembly lines. 4 genetic algorithms genetic algorithms are optimization solvers used in many areas due to their capacity to reduce the combinatorial complexity of np-complete problems. they do not give the global optimal solution, but a local optimal one by exploiting a defined search space. a genetic algorithm starts with a set of randomly generated possible solutions called initial population. each member of a population is encoded as a chromosome. chromosomes are represented by a combination of numbers or characters which contain information about the solution. a score named fitness coefficient is assigned to each chromosome based on the viability of the solution. chromosomes with high scores are chosen as parents to create a new population. the objective is to obtain children with better scores. to avoid the uniformity of the population and to increase the space of research, at each step of creation, two processes may occur namely: crossover and mutation. crossover combines the features of two or more parents into one child chromosome. mutation generates a child similar with his parent with one or more genes altered. these operations ensure the diversity of the new generated population [ 20, goldenberg]. once established the initial population and defined the three types of operations (reproduction, crossover, mutation), a genetic algorithm provides new members of population until a stop condition is fulfilled. usually, this criterion is given by a maximal/minimal value of the objective function obtained after a number of iterations of the genetic algorithm. 5 example to test the genetic algorithm, consider the example of the disassembly of a motorola radio set described by salomonski and zussman, [21]. the corresponding disassembly petri net is given in the figure 1. the disassembly process is represented four distinct possible sequences of transitions: {t1 → t3 → t5 → t7}, {t1 → t3 → t6 → t8}, {t2 → t4 → t5 → t7} and {t2 → t4 → t6 → t8}. to take into account additional destructive disassembly methods, in the correspondent petri net there are three alternative tasks represented by transitions {t1,1,t4,1,t6,1}. for the present study we considered that there are three workstations on the line. an alternative destructive task is done only on a station that can perform both destructive and non-destructive operations. we say that this kind of workstation is a "mixed" one. thus, workstations 2 and 3 are considered mixed. in other words, when a task is moved from one station to another, the type of the operation is changed together with its operational time. we supposed that the tasks t1 and t4 can be performed in a non-destructive way on the first workstation and in a destructive way on the second one. for the task t6 a destructive disassembly is done on the third station and a non-destructive disassembly is performed on the second workstation. according to the petri net given in figure 1 the correspondent operational times on the initial partition of the tasks on the workstations are: ti = {0.92, 0.07, 0.07, 0.12, 0.95, 0.75, 0.75, 0.95} taking into consideration the times for the alternative destructive operations the set above becomes: 274 luminiţa duţă, florin gheorghe filip, jean-michel henrioud, ciprian popescu figure 1: disassembly petri net of a radio set tf = {0.95, 0.07, 0.07, 0.15, 0.95, 0.90, 0.75, 0.95} the final revenues are calculated with the method proposed in [11, duta, filip and henrioud] by using the data from the petri net given in the figure 1. the corresponding sets of revenue values are: ri = {−1.36, 0.27, 0.54, 0.54, 0.62, 0.67, 2.75, 2.75} r f = {−1.32, 0.27, 0.54, 0.50, 0.62, 0.60, 2.75, 2.75} the objective is to maximize the value of the function from the equation (1) by finding the sequence that maximize the final revenue and in the same time ensuring a well-balance of the disassembly line (e.g. minimizing the cycle time). in our problem, a chromosome is represented by the possible tasks assignment matrix s which elements are: si j = { 1 i f the task j can be assigned to the workstation i 0 i f the task j can′t be assigned to the workstation i i = 1..n and j = 1..m (n is the number of workstations and m the number of disassembly operations). one s matrix can be the solution of our optimization problem only if it satisfies the following constraints: 1. the non-divisibility constraint that does not allow a task to be assigned to more than one station. si j ∈ {0, 1} (4) disassembly line scheduling with genetic algorithms 275 2. the assignment constraint that it requires that each task be assigned to exactly one station. ∑ i si j = 1 (5) 3. the precedence constraint that invokes technological order so that if task i is to be done before task j (i 0, ∀ (xi, yi) ∈s. denoting by w = (w1, . . . , wd) t the vector whose entries are the coefficients of g, we say that s is separated without errors by the hyperplane hw,b : w t u + b = 0 , (3) and hw,b is called a solution of the separating problem because all examples coming from the class of label 1 belong to the positive semi-space, and all examples coming from the class of label a new linear classifier based on combining supervised and unsupervised techniques 177 −1 belong to the negative semi-space defined by hw,b. obviously a hyperplane is a solution of separating problem if the functional margin min { yi ( wt xi + b ) , 1 ≤ i ≤ n } > 0. in a svm-based approach, by imposing that the functional margin is 1, the search for a solution yields to a constrained quadratic programming problem imposed on the objective function φ(w) = 1 2 ∥w∥2, { min φ(w) yi ( wt xi + b ) ≥ 1 , i = 1, n . (4) if w∗ is a solution of (4), then hw∗,b∗ is called an optimal separating hyperplane. the computation of w∗ and b∗ is carried out using the sv m1 algorithm. algorithm sv m1 ( [9]) input: the learning sequence s; step 1. compute the matrix d = (dik) of entries dik = yiyk (xi) t xk , i, k = 1, n ; step 2. solve the constrained optimization problem  α∗ = arg ( max α∈rn ( αt 1− 1 2 αt dα )) , αi ≥ 0 , ∀ 1 ≤ i ≤ n , n∑ i=1 αiyi = 0 , (5) if α∗i > 0 then xi is called the support vector. step 3. select two support vectors xr, xs such that α∗r > 0 , α ∗ s > 0 , yr = −1 , ys = 1. step 4. compute the parameters w∗, b∗ of the optimal separating hyperplane, and the width of the separating area ρ (w∗, b∗),  w∗ = n∑ i=1 α∗i yixi , b ∗ =− 1 2 (w∗) t (xr + xs) , ρ (w∗, b∗) = 2 ∥w∗∥ (6) output: w∗, b∗, ρ (w∗, b∗). a linear separable sample is represented in figure 1a. the straight lines d1, d2, d3 and d4 are solutions for the separating problem of s, d4 corresponds to the optimal separating hyperplane. the examples placed at the minimum distance to the optimum separating hyperplane are the support vectors. in case of non-linearly separable samples the idea is to determine a separating hyperplane that minimizes the number of misclassified examples. the problem of finding a optimal hyperplane in case of non-linearly separable samples has been approached several ways. the approach introduced by cortes and vapnik ( [3]) uses the error function φσ(ξ) = n∑ i=1 ξσi , (7) where the slack variables ξi , 1≤ i≤n, are taken as indicators for the classification errors (see figure 1b), and σ is a positive real number. the optimality is expressed in terms of the objective function φ : rd ×rn −→ [0, +∞) φ(w, ξ) = 1 2 ∥w∥2 + c f ( n∑ i=1 ξσi ) , (8) 178 l. state, i. paraschiv-munteanu where c > 0 is a given constant, ξ = (ξ1, . . . , ξn), and f is a monotone convex function, f(0) = 0. the idea is to compute a subset of s, say {(xi1, yi1) , . . . , (xik, yik)}, by minimizing φσ(ξ), such that there exists an optimal hyperplane for s\{(xi1, yi1) , . . . , (yik, yik)}. such an optimal hyperplane is referred to as a soft margin hyperplane ( [3]). figure 1: a) optimal separating hyperplane; b)classification errors. a soft margin hyperplane is a solution of the constrained optimization problem  arg ( min w∈rd, b∈r, ξ∈rn (φ(w, ξ)) ) yi ( wt xi + b ) ≥ 1− ξi , ∀ 1 ≤ i ≤ n , ξi ≥ 0 , ∀ 1 ≤ i ≤ n , (9) the samples represented in figure 1b, correspond to the non-linearly separable case. a soft margin hyperplane, the separating area, and the slack variables are indicated in figure 1b. the computation of a soft margin hyperplane is carried out by the algorithm sv m2. algorithm sv m2 ( [9]) input: the learning sequence s; c ∈ (0,∞). step 1. compute the matrix d = (dik) of entries, dik = yiyk (xi) t xk , i, k = 1, n ; step 2. solve the constrained optimization problem  α∗ =arg ( max α∈rn ( αt 1− 1 2 αt dα − (αmax) 2 4 c )) , αi ≥ 0 , ∀ 1 ≤ i ≤ n , n∑ i=1 αiyi = 0 , (10) where αmax = max{α1, . . . , αn} step 3. select two support vectors xr, xs such that α∗r > 0 , α ∗ s > 0 , yr = −1 , ys = 1. step 4. compute the parameters w∗, b∗ of the soft margin hyperplane, and the width of the separating area ρ (w∗, b∗), according to (6). output: w∗, b∗, ρ (w∗, b∗). 3 unsupervised learning using the k-means method center-based clustering algorithms are very efficient for clustering large and high-dimensional databases. they use objective functions to express the quality of any clustering solution, the a new linear classifier based on combining supervised and unsupervised techniques 179 optimal solution corresponding to a solution to a constrained/unconstrained optimization problem imposed of the particular objective function. usually the clusters found have convex shapes and a one of more centers are computed for each cluster. the k-means algorithm was introduced by macqueen ( [8]) for clustering numerical data, each of the produced clusters having a center referred as the cluster mean. let d = {x1, . . . , xn}⊂ rd be the data, k a given positive integer. the classes of any partition {c1, . . . ,ck} of d are called clusters. for any {µ (c1) , . . . , µ (ck)}⊂ rd where each µ (ci) in taken as the center of ci, then the inertia momentum is, ε = k∑ i=1 ∑ x∈ci d2 (x, µ (ci)) , (11) where d is a convenable distance function on rd. in the following we take d as being the euclidean distance, d(x, y) = ∥x−y∥. the k-means method proceeds by iteratively allocate the individuals to the nearest clusters and re-computation of the centers is performed to minimize the inertia momentum, the computation ending when non-significant changes of the centers/value of inertia momentum/membership functions of individuals to clusters are obtained. the k-means algorithm can be treated as an optimization problem where the goal is to minimize a given objective function under certain constraints. let c be the set of all subsets of rd of cardinal k, any particular q = {q1, . . . , qk}∈ c is a possible set of cluster centers. any partition of d into k classes can be obviously represented by a n ×k matrix w = (wil) where (i) wil ∈{0, 1} , i = 1, n , l = 1, k (ii) k∑ l=1 wil = 1 , i = 1, n . (12) the k-means algorithm can be formulated as the constrained optimization problem on the objective function p(w, q) = n∑ i=1 k∑ l=1 wil ∥xi −ql∥ 2 as follows:   min w∈mn×k(r), q∈c p(w, q) wil ∈{0, 1} , i = 1, n , l = 1, k , k∑ l=1 wil = 1 , i = 1, n , q = {q1, . . . , qk} . (13) the ’hard’ problem (13) can be solved by decomposing it into two simpler problems p1 and p2, and then iteratively solving them, where p1. fix q = q̂ ∈c and solve the reduced constrained optimization problem for p ( w, q̂ ) . p2. fix w = ŵ ∈ mn×k (r) and solve the reduced unconstrained optimization problem for p ( ŵ, q ) . the solutions of these problems can be derived by straightforward computations, and they are given by the following theorems: 180 l. state, i. paraschiv-munteanu theorem 1. for any fixed set of centers q̂ = {q̂1, . . . , q̂k} , the function p ( w, q̂ ) is minimized in w (0) = ( w (0) ij ) if and only if w (0) satisfies the conditions w (0) il = 0 ⇐⇒∥xi − q̂l∥ > min 1≤t≤k ∥xi − q̂t∥ , w (0) il = 1 =⇒∥xi − q̂l∥ = min 1≤t≤k ∥xi − q̂t∥ , k∑ j=1 w (0) ij = 1 , for any i = 1, n , l = 1, k. note that in general, for any given q̂ there are more solutions because in general there exist individuals xi at minimum distance to more than one center of q̂. theorem 2. for any fixed ŵ satisfying the constraints of (13), the function p ( ŵ, q ) is minimized there exist an unique point q(0) = { q (0) 1 , . . . , q (0) k } if and only if q (0) l = ( n∑ i=1 ŵilxi )/( n∑ i=1 ŵil ) , l = 1, k. the scheme of the k-means algorithm viewed as search method for solving the optimization problem (13) is: the algorithm k-mop input: d the data set, k the pre-specified number of clusters, d the data dimensionality, t threshold on the maximum number of iterations. initializations: q(0), t ←− 0 solve p ( w, q(0) ) and get w (0) sw ←− false repeat ŵ ←− w (t) solve p ( ŵ, q ) and get q(t+1) if p ( ŵ, q(t) ) = p ( ŵ, q(t+1) ) then sw ←− true output ( ŵ, q(t+1) ) else q̂ ←− q(t+1) solve p ( w (t), q̂ ) and get w (t+1) if p ( w (t), q̂ ) = p ( w (t+1), q̂ ) then sw ←− true output ( w (t+1), q̂, ) endif endif t ←− t + 1 until sw or t > t . output: ŵ, q̂. a new linear classifier based on combining supervised and unsupervised techniques 181 note that the computational complexity of the algorithm k-mop is o(nkd) per iteration. the sequence of values p ( w (t), q(t) ) where w (t), q(t) are computed by k-mop is strictly decreasing, therefore the algorithm converges to a local minima of the objective function. 4 the combined separating technique based on svm and the k-means algorithm at first sight, it seems unreasonable to compare a supervised technique to an unsupervised one, mainly because they refer to totally different situations. on one hand the supervised techniques are applied in case the data set consists of correctly labeled objects, and on the other hand the unsupervised methods deal with unlabeled objects. however our point is to combine svm and k-means algorithm, in order to obtain a new design of a linear classifier. the aim of the experimental analysis is to evaluate the performance of the linear classifier resulted from the combination of the supervised svm method and the 2-means algorithm. the method can be applied to data, either linearly separable or non-linearly separable. obviously in case of non-linearly separable data the classification can not be performed without errors and in this case the number of misclassified examples is the most reasonable criterion for performance evaluation. of a particular importance is the case of linearly separable data, in this case the performance being evaluated in terms of both, misclassified examples and the generalization capacity expressed in terms of the width of separating area. in real life situations, usually is very difficult or even impossible to established whether the data represents a linearly/non-linearly separable set. in using the sv m1 approach we can identify which case the given data set belongs to. for linear separable data, sv m1 computes a separation hyperplane optimal from the point of view of the generalization capacity. in case of a non-linearly separable data sv m2 computes a linear classifier that minimizes the number of misclassified examples. a series of developments are based on non-linear transforms represented be kernel functions whose range is high dimensional spaces. the increase of dimensionality and the convenable choice of the kernel aim to transform a non-linearly separable problem into a linearly separable one. the computation complexity corresponding to kernel-based approaches is significantly large, therefore in case the performance of the algorithm sv m1 proves reasonable good it could be taken as an alternative approach of a kernel-based sv m. we perform a comparative analysis on data consisting of examples generated from two dimensional gaussian distributions. in case of a non-linearly separable data set, using the k-means algorithm, we get a system of pairwise disjoint clusters together with the set of their centers representing a local minimum point of the criterion (13), the clusters being linearly separable when k = 2. consequently, the sv m1 algorithm computes a linear classifier that separates without errors the resulted clusters. our procedure is described as follows input: s = the learning sequence; step 1. compute the matrix d = (dik) of entries, dik = yiyk (xi) t xk , i, k = 1, n ; sh ←− true step 2. if the constrained optimization problem (5) does not have solution then sh ←− false input c ∈ (0,∞), for soft margin hyperplane solve the constrained optimization problem (10) endif step 3. select xr, xs such that α∗r > 0 , α ∗ s > 0 , yr = −1 , ys = 1; 182 l. state, i. paraschiv-munteanu compute the parameters w∗, b∗ of the separating hyperplane, and the width of the separating area, ρ (w∗, b∗) according to (6); step 4. if not sh then compute nr err1 = the numbers of examples incorrectly classified; compute err1 = classification error; endif step 5. split the set d = { xi | xi ∈ rd , i = 1, n } into two clusters c1 and c2 using the 2-means algorithm and label the data belonging to c1 by y′i = 1, and label by y′i = −1 the data belonging to c2. step 6. apply the algorithm svm1 to s′ = {( xi, y ′ i )∣∣ xi ∈ rd, y′i ∈{−1, 1}, i = 1, n} and obtain the parameters of optimal separating hyperplane: w∗1, b ∗ 1, ρ (w ∗ 1, b ∗ 1); compute nr err2 = the number of data incorrectly classified by the algorithm 2−means; compute err2 = classification error resulted after the 2−means splitting ; output: w∗, b∗, ρ (w∗, b∗), nr err1, err1; w∗1, b ∗ 1, ρ (w ∗ 1, b ∗ 1), nr err2, err2. 5 comparative analysis and experimental results the experimental analysis is based on a long series of tests performed on linear/non-linear separable simulated data of different volumes. the analysis aims to derive conclusions concerning: 1. the statistical properties (the empirical means, covariance matrices, eigenvalues, eigenvectors) of the clusters computed by the 2-means algorithm as compared to their counterparts corresponding to the true distributions they come from. 2. the comparison of the performances corresponding to the linear classifier resulted as a combination of svm and the 2-means algorithm described in section 4 and sv m2 in terms of the empirical error. 3. the analysis concerning the influences of the samples sizes on the performance of the procedure described in section 4. 4. the quality of cluster characterization in terms of the principal directions given by a system of unit orthogonal eigenvectors of the sample covariance and empirical covariance matrices of the computed clusters. the analysis aimed to derive conclusions concerning the contribution of each principal direction, and for this reason, some tests were performed on data whose first principal component is strongly dominant, and when the principal directions are of the same importance respectively. the tests were performed on data generated from two-dimensional normal distributions n (µi, σi) , i = 1, 2 of volumes n1 and n2, respectively. the sample covariance matrices are denoted by µ̂i, σ̂i , i = 1, 2. the centers and the empirical covariance matrices corresponding to the clusters computed by the 2-means algorithm are denoted by µi, σi , i = 1, 2. we denote by zi , ẑi , zi , i = 1, 2 orthogonal matrices having as columns unit eigenvectors of σi , σ̂i , σi , i = 1, 2 respectively. test 1: n1 =n2 =50, µ1 = ( 1 1 ) , σ1 = ( 1 0 0 0.25 ) , µ2 = ( 2 3 ) , σ2 = ( 0.5 0 0 0.5 ) . the matrices z1 , z2 and their eigenvalues are λ (1) 1 =0.25 , λ (1) 2 =1, z1 = ( 0 1 1 0 ) , λ(2)1 =0.5 , λ (2) 2 =0.5, z2 = ( 1 0 0 1 ) . a new linear classifier based on combining supervised and unsupervised techniques 183 the set is non-linear separable and it is represented in figure 2i)a. in this case we get µ̂1 = ( 0.92 1.0004 ) , σ̂1 = ( 0.85 0.08 0.08 0.25 ) , µ̂2 = ( 1.98 2.87 ) , σ̂2 = ( 0.44 0.09 0.09 0.63 ) . the matrices ẑ1 , ẑ2 and their eigenvalues being λ̂ (1) 1 =0.24 , λ̂ (1) 2 =0.86, ẑ1 = ( 0.14 −0.98 −0.98 −0.14 ) , λ̂(2)1 =0.40 , λ̂ (2) 2 =0.67, ẑ2 = ( −0.92 0.38 0.38 0.92 ) . using the svm2 with c = 70 we get the classification error class error = 14.70, the number of misclassified samples n errors = 13 and the width of separating area is ρ = 0.61. the value of the error coefficient defined as the ratio of the number of misclassified samples and total volume of the data is c error = 0.13%. the soft margin line d1 is represented in figure 2i)b. −1 0 1 2 3 0 1 2 3 4 −1 0 1 2 3 0 1 2 3 4 −1 0 1 2 3 0 1 2 3 4 −1 0 1 2 3 0 1 2 3 4 dc b d 1 d 2 a i) −1 0 1 2 3 4 5 0 1 2 3 4 −1 0 1 2 3 4 5 0 1 2 3 4 −1 0 1 2 3 4 5 0 1 2 3 4 −1 0 1 2 3 4 5 0 1 2 3 4 a c d d 1 b d 2 ii) figure 2: i) the classification of the data set in test 1; ii) the classification of the data set in test 2. by applying the 2-means algorithm we get clusters whose sample means and covariance matrices are µ1 = ( 0.88 1.06 ) , σ1 = ( 0.64 0.05 0.05 0.30 ) , µ2 = ( 2.13 2.96 ) , σ2 = ( 0.41 −0.06 −0.06 0.56 ) . the matrices z1 , z2 and their eigenvalues are λ (1) 1 =0.29 , λ (1) 2 =0.65, z1 = ( 0.14 −0.98 −0.98 −0.14 ) , λ (2) 1 =0.39 , λ (2) 2 =0.58, z2 = ( −0.92 −0.37 −0.37 0.92 ) , the number of misclassified samples is 10 and the clusters are represented in figure 2i)c. note that the computed centers and clusters are not influenced by the choice of the initial centers. the clusters computed by the 2-means algorithm for randomly selected initial centers are reprezented in figure 2i)c. the separating line d2 resulted by applying the sv m1 algorithm to the data represented by the clusters computed by the 2-means algorithm is represented in figure 2i)d. test 2: n1 = 100 , n2 =200, µ1 = ( 1 1 ) , σ1 = ( 1 0 0 0.25 ) , µ2 = ( 2 3 ) , σ2 = ( 1 0 0 0.25 ) , λ (1) 1 =0.25 , λ (1) 2 =1, z1 = ( 0 1 1 0 ) , λ(2)1 =0.25 , λ (2) 2 =1, z2 = ( 0 1 1 0 ) , µ̂1 = ( 1.12 0.92 ) , σ̂1 = ( 1.35 0.04 0.04 0.26 ) , µ̂2 = ( 2.01 3.00 ) , σ̂2 = ( 0.86 0.05 0.05 0.25 ) , 184 l. state, i. paraschiv-munteanu λ̂ (1) 1 =0.26 , λ̂ (1) 2 =1.35, ẑ1 = ( 0.03 −0.99 −0.99 −0.03 ) , λ̂(2)1 =0.25 , λ̂ (2) 2 =0.87, ẑ2 = ( 0.09 −0.99 −0.99 −0.09 ) . the data set is non-linear separable and it is represented in figure 2ii)a. applying the sv m2 for c = 5 we obtain the soft margin line d1 represented in figure 2ii)b and class error = 19.12, n errors = 13, ρ = 0.25, c error = 0.043%. the clusters computed by the 2-means algorithm are represented in figure 2ii)c and their statistical characteristics are µ1 = ( 0.96 1.004 ) , σ1 = ( 1.19 −0.10 −0.10 0.38 ) , µ2 = ( 2.10 3.007 ) , σ2 = ( 0.76 −0.02 −0.02 0.28 ) , λ (1) 1 =0.37 , λ (1) 2 =1.20, z1 = ( −0.12 −0.99 −0.99 0.12 ) , λ (2) 1 =0.27 , λ (2) 2 =0.76, z2 = ( −0.05 −0.99 −0.99 0.05 ) . in this case the number of misclassified samples is 18. note that the initial choice of the centers does not influence significantly the computed centers and clusters. for instance in figure 2ii)c are represented the resulted clusters in case of randomly selected initial centers. the separating line d2 computed by the algorithm sv m1 applied to the data represented by these clusters is represented in figure 2ii)d. test 3: n1 = n2 =50, µ1 = ( 1 1 ) , σ1 = ( 1 0 0 0.25 ) , µ2 = ( 3 4 ) , σ2 = ( 0.5 0 0 0.5 ) , λ (1) 1 =0.25 , λ (1) 2 =1, z1 = ( 0 1 1 0 ) , λ(2)1 =0.5 , λ (2) 2 =0.5, z2 = ( 1 0 0 1 ) , µ̂1 = ( 0.76 1.008 ) , σ̂1 = ( 1.17 −0.06 −0.06 0.21 ) , µ̂2 = ( 2.87 4.03 ) , σ̂2 = ( 0.56 0.009 0.009 0.31 ) , λ̂ (1) 1 =0.214 , λ̂ (1) 2 =1.180, ẑ1 = ( −0.07 −0.99 −0.99 0.07 ) , λ̂(2)1 =0.31 , λ̂ (2) 2 =0.56, ẑ2 = ( 0.03 −0.99 −0.99 −0.03 ) . the data set is linearly separable and it is represented in figure 3i)a. the soft margin line d1 computed by the sv m1 algorithm is represented in figure 3i)b, the value of the resulted margin being ρ = 1.196429. −2 −1 0 1 2 3 4 0 1 2 3 4 5 −2 −1 0 1 2 3 4 0 1 2 3 4 5 −2 −1 0 1 2 3 4 0 1 2 3 4 5 −2 −1 0 1 2 3 4 0 1 2 3 4 5 a b c d d 2 d 1 i) −1 0 1 2 3 4 0 1 2 3 4 5 −1 0 1 2 3 4 0 1 2 3 4 5 −1 0 1 2 3 4 0 1 2 3 4 5 −1 0 1 2 3 4 0 1 2 3 4 5 ba c d d 1 d 2 ii) figure 3: i) the classification of the data set in test 3; ii) the classification of the data set in test 4. the clusters computed by the 2-means algorithm are represented in figure 3i)c and they are the same as in initial data set whatever the initial choice of the centers is. so, the statistical characteristics are a new linear classifier based on combining supervised and unsupervised techniques 185 µ1 =µ̂1, σ1 =σ̂1, µ2 =µ̂2, , σ2 =σ̂2, λ (1) 1 =λ̂ (1) 1 , λ (1) 2 =λ̂ (1) 2 , z1 =ẑ1, λ (2) 1 =λ̂ (2) 1 , λ (2) 2 =λ̂ (2) 2 , z2 =ẑ2, and the separating line d2 computed by the algorithm sv m1 and represented in figure 3i)d coincides with d1. test 4: n1 = 100 , n2 =150, µ1 = ( 1 1 ) , σ1 = ( 1 0 0 0.25 ) , µ2 = ( 3 4 ) , σ2 = ( 0.5 0 0 0.5 ) . λ (1) 1 =0.25 , λ (1) 2 =1, z1 = ( 0 1 1 0 ) , λ(2)1 =0.5 , λ (2) 2 =0.5, z2 = ( 1 0 0 1 ) . µ̂1 = ( 1.22 1.03 ) , σ̂1 = ( 1.04 −0.03 −0.03 0.24 ) , µ̂2 = ( 2.98 3.99 ) , σ̂2 = ( 0.48 −0.01 −0.01 0.43 ) . λ̂ (1) 1 =0.24 , λ̂ (1) 2 =1.04, ẑ1 = ( −0.04 −0.99 −0.99 0.04 ) , λ̂(2)1 =0.42 , λ̂ (2) 2 =0.49, ẑ2 = ( −0.27 −0.96 −0.96 0.27 ) . the data set is linear separable and it is represented in figure 3ii)a. applying the sv m1 we obtain the soft margin line d1 represented in 3ii)b and ρ = 0.552508. the clusters computed by the 2-means algorithm are represented in figure 3ii)c and their statistical characteristics are µ1 = ( 1.20 1.04 ) ,σ1 = ( 0.98 −0.04 −0.04 0.26 ) , µ2 = ( 3.00 3.98 ) , σ2 = ( 0.48 −0.04 −0.04 0.45 ) , λ (1) 1 =0.26 , λ (1) 2 =0.98, z1 = ( −0.05 −0.99 −0.99 0.05 ) , λ (2) 1 =0.42 , λ (2) 2 =0.51, z2 = ( −0.60 −0.79 −0.79 0.60 ) . in this case the number of misclassified samples is 2 and the initial centers are randomly selected. the separating line d2 computed by the algorithm sv m1 applied to the data represented by these clusters is represented in figure 3ii)d. 6 conclusions and future work although to combine a supervised technique to an unsupervised one seems to be meaningless, mainly because they refer to totally different situations, the combined methodology resulted by putting together k-means and svm methods yielded to an improved classifier. the experimental results point out good performance of the proposed method from both points of view, accuracy and computational complexity. we are optimistic that the research aiming to obtain refined methods by combining supervised, unsupervised and semi-supervised technics has good chances to provide a class of new powerful classification schemes. it has been already performed a series of tests on different types of classifiers obtained by combining pca (principal component analysis), ica (independent component analysis) and kernel based svm’s the results being quite encouraging. bibliography [1] s. abe, support vector machines for pattern classification, springer-verlag, 2005. [2] c.j.c. burges, a tutorial on support vector machines for pattern recognition, data mining and knowledge discovery, 2, pp. 121-167, 1998. [3] c. cortes, v. vapnik, support-vector networks, machine learning, 20(3):273-297, 1995. 186 l. state, i. paraschiv-munteanu [4] n. cristianini, j. shawe-taylor, an introduction to support vector machines and other kernel-based learning methods, cambridge university press, 2000. [5] g. gan, c. ma, j. wu, data clustering: theory, algorithms and applications, siam, 2007. [6] s.r. gunn, support vector machines for classification and regression, university of southampton, technical report, 1998. [7] l. state, i. paraschiv-munteanu, i., introducere in teoria statistică a recunoaşterii formelor, editura universitaţii din piteşti, 2009. [8] j.b. macqueen, some methods for classification and analysis of multivariate observations, proceedings of 5-th berkeley symposium on mathematical statistics and probability, berkeley, university of california press, pp. 281-297, 1967. [9] i. paraschiv-munteanu, support vector machine in solving pattern recognition tasks, proceedings of first doctoral student workshop in computer science, university of piteşti, may 2009. [10] i. paraschiv-munteanu, theoretical approach in performance evaluation of a classsification system, scientific bulletin of the university of piteşti, series mathematics and informatics, no. 14, pp. 203-218, 2008. [11] n. popescu-bodorin, fast k-means image quantization algorithm and its application to iris segmentation", scientific bulletin of the university of piteşti, series mathematics and informatics, no. 14, 2008. [12] r. stoean, c. stoean, m. preuss, d. dumitrescu, evolutionary multi-class support vector machines for classification, international journal of computers communications & control, vol.1 supplement: suppl. s, pp. 423-428, 2006. [13] v.n. vapnik, the nature of statistical learning theory, new york, springer verlag, 1995. [14] v.n. vapnik, statistical learning theory, new york, wiley-interscience, 1998. [15] r. xu, d.c.ii wunsch, clustering, wiley&sons, 2009. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 929-938 network coded transmission in a wireless grid network with an energy constraint r. stoian, a.v. raileanu, l.a. perisoara rodica stoian, adrian victor raileanu, lucian andrei perisoara politehnica university of bucharest romania, 061071 bucharest, 1-3 iuliu maniu e-mail: {rodicastoian2004, adrianrai, lperisoara}@yahoo.com abstract: in wireless networks, routing based on packet forwarding does hardly yield optimum transmission performance in terms of network utilization and throughput. as an alternative to routing, network coding has been introduced in the recent years, where nodes are mixing the data instead of forwarding. in applications, random linear network coding is the most used method, due to its decentralized mode, and due to preserving the achievability of multicast capacity bounds. in this paper, we study the performance of network coding used for multicast transmission of messages in a wireless grid network with an energy constraint. several energy saving schemes have been proposed in the literature, but in this study we will focus on duty cycling scheme, in which nodes are not always in on state. the performance is measured as the end-to-end delay, i.e. the duration until each node can decode the message sent by the source, and the cdf of observations is used to make analysis. keywords: network coding, energy efficient, end-to-end delay, duty cycling. 1 introduction energy saving is an important factor in wireless transmissions, especially in autonomous devices, i.e. battery operated nodes. in applications like battlefield surveillance, environment and habitat monitoring, sometimes it is hostile, hazardous or impractical to replace or recharge the batteries. the performance of wireless network applications highly depends on the lifetime of the network. for practical applications we expect the lifetime to be from several months to several years, so energy saving is crucial in designing the network. energy consumption in a network node can be due to useful sources (transmitting, receiving or processing data) or wasteful sources (channel idle listening, retransmissions due to packet collisions, overhearing, control packets used for errors control). the critical issue is to minimize the energy consumption of network nodes while meeting the application requirements. this paper is organized as follows. in section ii we explain how network coding is applied for wireless networks, marking some advantages of using it. in section iii we describe the scenario of a general multicast transmission in a wireless network. in section iv we present the problem of energy consumption optimization using duty cycling. finally, in section v we present simulation results. 2 network coding and wireless networks network coding is a recent field of information theory that breaks the classical assumption about the routing in the networks. instead of simply forwarding the packets, the intermediate nodes recombine several input packets into one or several output packets. in [1], ahlswede et copyright c⃝ 2006-2010 by ccc publications 930 r. stoian, a.v. raileanu, l.a. perisoara al. showed that network coding achieves the multicast capacity, which is defined as a maximum data rate which is achieved for a multicast transmission. in [2], it is shown that the maximum multicast capacity can be achieved by using linear encoding functions at each node, which implies to solve linear equations at the receiver. in figure 1 we show a simple example of using network coding to reduce the number of transmissions used to exchange two bits b1 and b2, the operation applied being xor. with network coding, the first node can recover the bit b2 from the received bit b1+b2 and the known bit b1. similarly, b1 can be recovered at the second node. network coding can reduces the traffic without increasing delay and so it can save energy by reducing the amount of transmitted data. figure 1: an example of decreasing the transmission time using network coding let suppose for a network that source node s emits k information packets x1, x2, ..., xk, each of length l symbols from a finite field gf(q) to n receivers t1, t2, ..., tn. for linear network coding, each node combines a number of received packets into one or several output packets: y = k∑ i=1 αixi (2.1) where the summation is applied for every symbol position. for random linear network coding, the coefficients αi of the linear combination are generates in a random manner, which assures with high probability a linear independence of the output packets from a node for a sufficiently large size q = 2m of the finite field gf(q), as it was proved in [3]. the encoding coefficients forms the encoding vector α = (α1, α2, ..., αk), which belongs to a k-dimensional vector space over gf(q). all encoding vectors associated with the output edges of all intermediate nodes from s to a specific node t forms the encoding matrix. when we refer to a network code we must specify the all encoding vectors which should be used for the encoding process, for all edges of the network. the encoding coefficients are send to the destination in the packet header, so the destination nodes can decode the packet without knowing the network topology or the encoding rules, even if the nodes are added or removed in an ad-hoc manner. assume a node t has received m coded packets y1, y2, ..., ym and the encoding vectors α1, α2, ..., αm. to decode the packets, it need to solve a linear system with m equations and k unknowns xi, derived from (2.1). to solve this system, the node wait until he receives at least m ≥ k linearly independent packets, equivalently with m linearly independent encoding vectors. network coded transmission in a wireless grid network with an energy constraint 931 this condition is assured using buffers for each input of the node. the buffer stores only the innovative packets (packets which are not a linear combination of the already stored packets in the buffer). non-innovative packets are discarded by gaussian elimination because they do not provide any new information to the node. some advantages of using network coding in wireless networks are throughput and capacity improvements [1], bandwidth and energy savings [4], robustness to noise [5], reduced traffic. 3 a scenario of general multicast we introduce a grid wireless mesh network, which is a generalization of a local network (e.g. office), or a special purpose network deployed over a rectangular area for monitoring or as point of presence (e.g. information panels). the type of communication is one to many, as we consider one original source (e.g. a gateway to another network, a controller, etc.), that sends messages to all the other points, called nodes. this is called general multicast, as one message is sent to a number of participants (e.g. all participants), but because not all nodes are in the radio range of the source, the message is relayed node by node. transmission over radio is simplified, considering only one channel, and without collisions. the radio signal is affected by distance attenuation and a simple exponentially distributed noise floor. the network stack is reduced to simple mac/ip layers, with the purpose of taking into consideration only mac latency and for identifying nodes by addresses. the position of the source is at one corner of the rectangle (see figure 2), as it simulates a gateway or a controller. at the same time, the position was chosen as it provides the worst case scenario, where the source has the lowest number of neighbors possible in the given situation. figure 2: general multicast relay in a wireless grid network, with source in corner position. the radio range of the source is not large enough for broadcast the message unit is considered 1 byte. the entire transmission is an m byte message generated by the source. in normal networks, this would take m consecutive transmissions from the source, and some additional ones when there is no acknowledge received. our model uses random network coding to disseminate information, so that the source emits linear combination of all the message bytes at each transmission. a number of k = 32 random coefficients from gf(256) is used for messages with m < k. the original message is padded with zeros and each 932 r. stoian, a.v. raileanu, l.a. perisoara byte is multiplied the corresponding coefficient ki and summed (i.e. xor-ed). the result is one mixed byte and a list of coefficients that are used for decoding at the receiving nodes. for bandwidth saving, coefficients can be chosen from lower galois fields, e.g. gf(2), reducing the size of the coefficient list. however, using lower galois fields increases the probability that two packets will be linear dependent, and will not contribute to the decoding process. each receiving node accumulates linear independent packets (novel packets), based on analysis of the coefficient list included in each packet. when it has received at least k such packets, it decodes the message using matrix inversion. a node that receives a novel packet will re-encode and send it using a linear combination of all its received packets. if a node receives a packet that is linear dependent to its received packets, it will discard it, without forwarding it. when a node decodes the entire message, it becomes a source, generating packets with its own generated random coefficients. generating acknowledge messages is out the scope of this paper. we evaluate transmission performance using the end-to-end delay (eted) metric, calculated as the time since the source emitted the first packet until a specific node received and decoded the entire message. the overall end-to-end delay (oeted) is the maximum end-to-end delay, i.e. calculated at the last node that received and decoded the message. the minimum endto-end delay is the time until the first successful transmission (i.e. first node that decodes the full message). the average end-to-end delay is the arithmetic average of all the successful transmission times. 4 energy consumption optimization the two main operations that require optimizations of energy are transmission and reception. transmission energy is optimized through radio power adjustment, so that a tradeoff is done between the range and battery life. in our scenario, all the nodes use the same transmission power. in non-centralized networks, however, reduction of transmission energy can be achieved also by reducing the number of redundant sending operations in nodes. in flooding based routing, nodes tend to forward each packet received, creating a lot of duplicated information inside the network. widmer et. al [12] proposed network coding algorithms that reduce the number of transmissions per each node compared to flooding. in [13], fragouli et al. studied distributed algorithms to achieve optimum number of transmissions in grid and random topologies. the results show that network coding can achieve up to 30% energy reduction compared to probabilistic routing. the current study focuses on the optimization of reception and decoding, analyzing how classical energy saving schemes affect performance of network coded transmissions. idle listening and overhearing are major sources of energy consumption in a wireless network. while the first one can be easily optimized, the second one is of great importance to the network coding overall architecture. reducing overhearing in a network coded system may reduce its performance. our article studies the degradation of performance when implementing different levels of energy saving. the method is a simplified version of the s-mac protocol, which use a listen/sleep cycle. the energy consumption is reduced by switching on and off each node independently, so that reception is performed only in some discreet time windows. during power on states, a node is able to receive, decode and retransmit data. during power off states, a node is not able to receive any packet, so it will be lost. this should not be an issue with our current wireless network coding scenario, as neighbor nodes can be in different power states, and take advantage of all the packets. denote t, a full power on power off time interval. we consider an energy duty cycle, the percentage of power on states. for example, a duty cycle of 25% means that the node is on for network coded transmission in a wireless grid network with an energy constraint 933 time t/4, then off for time 3/4t, then again on for t/4, and so on. the power states have random time offsets, so that to avoid situations when all the nodes are in power off state, see figure 3. figure 3: transmission from node a to neighbors b and c. packets that were missed by b in off state (dotted arrow), are recovered later from neighbor c 5 simulations and results this article studies the effects of energy optimizations on the end-to-end delay in the wireless grid network, using random network coding. based on [11], we developed a wireless network simulator, which uses network coding for transmission. we chose a fixed number of nodes n = 100, that are arranged on a 10 × 10 square area. the source message is made of packets of length m = 32 bytes, equal to the number of coefficients, k = 32. the main simulator functional modules are: source, node and scheduler. the source module is responsible to encode the original message and continuously send differently encoded packets to the other nodes. the other nodes contain logic for packet decoding and rank calculation, but also include a packet re-encoder and sender, for relay. the scheduler is responsible for sequencing the packet transfer in the network. transmission hops are determined based on a physical model (as defined in [14], [15]), where instant noise level and attenuation decide whether a node is in the transmission range or not. for studying energy efficiency algorithms, the scheduler has been enhanced to support on-off node states. the simulator engine is event based, so that each transmission and reception has a different timestamp. the link rate is constant for all transmissions, and is simulated as an inter-sending time interval. the time difference is made more realistic by introducing mac latency, propagation time and a random jitter for all other factors that are not explicitly simulated. the first series of simulations were used to analyze the impact of the distance between nodes on the end-to-end delay (eted). we run simulations for distance d = 40 m to d = 179 m. for d < 40 m the networks is close to a broadcast network, (i.e many more than 50% of the nodes are in the source radio range), so it is out of the scope of the current paper. at d = 180 m all the nodes lost connectivity. each simulation was repeated three times. figure 4 shows that for large and medium range coverage, given by d = 80 m and d = 120 m, the nodes completion time increases almost linearly. at very short range, where connectivity is available only with closest neighbors, the last set of nodes is completed in almost exponential time. due to the geometry of the network grid, figure 5 shows two flat regions, for both average and overall eted. 934 r. stoian, a.v. raileanu, l.a. perisoara figure 4: end-to-end delay measured for each node in the network, for different values of d figure 5: end-to-end delay dependency on different values of d these are explained by the number of nodes that are available in one range, for a given d. choosing an optimum d that would maximize the area covered by the network, would be the largest value in any of the flat regions, e.g. 80 m, 120 m, 170 m. now, observing the ratio between average and overall eted between different flat regions, we can note that only the third region (120 170m) has a linear node completeness behavior, while the other ones tend to be logarithmic. in a network where there should not be too many nodes late than the average, choosing d < 130 m would be the right choice. the ratio between the areas covered by the network for the three values of d (80, 120, 170), is 0.22:0.5:1. the ratio observed on average eted is 0.67:0.82:1 and for overall eted is 0.54:0.67:1. if average eted is of interest, network coded transmission in a wireless grid network with an energy constraint 935 maximizing the area leads to a value of d = 170 m. if the overall eted is important, than the value of d = 120 m is offering half of the maximum area but with a 30% reduction of delivery time. in the next stage, for a fixed d = 120 m, we have added to the simulation scenario a power cycle with a duty of 50%. we performed simulations for different values of the power on-off time interval, with t in a range from 10 ms to 500 ms. values lower than 10 ms may not be efficient due to electrical / logical reasons in the device, i.e. network processing chip switching on-off too fast. values above 500 ms are of no interest to the current study, as it reduces the number of switches per experiment to less than 10. in figure 6, at t = 0 are drawn the results when there is no power cycle, for reference. as, expected results show better behavior at lower t values, as the nodes have a quicker opportunity to recover lost packets from their neighbors. figure 6: end-to-end delay dependency on different values of t in the next experiment, we studied the effect of different duty cycles, ranging from 10% to 90%. we observed the behavior of the network for three values of t = [0.010, 0.100, 0.500] s. sometimes, the energy advantage comes from using a duty cycle lower than 50% (i.e. less than half of the energy used). figures 7 and 8 show that it is possible to preserve low values of eted even at a duty cycle of 30%. however, for t > 0.100 s, the optimal duty cycle is close to 60%. we performed another set of measurements with fixed t = 0.100 s, for different duty cycles, and for different values of distance d = [80, 120, 170] m. figures 9 and 10 show that the 80 m and 120 m networks are more sensitive to values lower than 30%, while the 170m network has a higher threshold of 70%. with network coding, the fact that there are only a few nodes connected to each other, does not allow improvement of energy savings. as soon as the number of connections per node doubles, the energy consumption can be optimized up to 30%. 936 r. stoian, a.v. raileanu, l.a. perisoara figure 7: average end-to-end delay dependency on different values of the duty cycle, for t = [0.010, 0.100, 0.500] s figure 8: overall end-to-end delay dependency on different values of the duty cycle, for t = [0.010, 0.100, 0.500] s 6 conclusions this article presents methods for saving energy in a wireless network, in the case of general multicast transmission. the routing of packets inside the network is not store and forward, but mix and forward, namely network coding. we observed the end-to-end delay inside a wireless network coded transmission in a wireless grid network with an energy constraint 937 figure 9: average end-to-end delay dependency on different values of the duty cycle, for d = [80, 120, 170] m figure 10: overall end-to-end delay dependency on different values of the duty cycle, for d = [80, 120, 170] m grid network that uses network coding, for different area sizes (i.e. different node density) and for various reception windows for energy saving. a good tradeoff between area and end-toend delay is to choose half of the maximal area, while obtaining at least 30% improvement in speed. simulations show that energy consumption can be reduced to between 30% and 50%, depending on the duration of the full power on power off cycle. for sparse networks, the energy 938 r. stoian, a.v. raileanu, l.a. perisoara consumption can be reduced up to 70%. further study may reveal important results for other network topologies, other type of transmissions (e.g. unicast), other energy saving schemes or other realistic features (variable range, transmission power, interference, multiple radio channels, etc). bibliography [1] r. ahlswede, n. cai, s. y. r. li, and r. w. yeung, network information flow, ieee trans. inform. theory, vol. 46, no. 4, pp. 1204-1216, july 2000. [2] s. y. r. li, r. w. yeung, and n. cai, linear network coding, ieee trans. inform. theory, vol. it-49, no. 2, pp. 371-381, feb. 2003. [3] y. wu, p. a. chou, and k. jain, a comparison of network coding and tree packing, in proc. isit 2004, chicago, june 2004. [4] y. wu, p. a. chou, and s. y. kung, minimum-energy multicast in mobile ad-hoc networks using network coding, ieee trans. communications, vol. 53, no. 11, pp. 1906-1918, nov. 2005. [5] rodica stoian, l. a. perisoara, and radu stoica, random network coding for wireless adhoc networks, in proc. on int. symp. on signals, circuits and systems (isscs 2009), iasi, romania, vol. 2, pp. 469-472, july 9-10, 2009. [6] g. lu, n. sadagopan, b. krishnamachari, and a. goel, delay efficient sleep scheduling in wireless sensor networks, in proc. of ieee infocom 2005, march 2005. [7] ming xiao, tor m. aulin, energy-efficient network coding for the noisy channel network, isit 2006, seattle, usa,pp. 778-782, july 9-14, 2006. [8] w. ye, j. heidemann, and d. estrin, an energy-efficient mac protocol for wireless sensor networks, in proc. of ieee infocom, june 2002. [9] y. yang, b. krishnamachari, and v.k. prasanna, energy-latency trade offs for data gathering in wireless sensor networks, in proc. of ieee infocom, march 2004. [10] j. h. chang, and l. tassiulas, energy conserving routing in wireless ad-hoc networks, in proc. of ieee infocom, march 2000. [11] slimsim simulator, http://cs.anu.edu.au/ aaron/sim.php [12] j. widmer, c. fragouli, and j.y. le boudec, low-complexity energy-efficient broadcasting in wireless ad-hoc networks using network coding, first network coding workshop (netcod), riva del garda, italy, 2005. [13] c. fragouli, j. widmer, j.y. le boudec, a network coding approach to energy efficient broadcasting: from theory to practice, in ieee infocom, barcelona, spain, apr. 2006 [14] p. gupta, and p. r. kumar., the capacity of wireless networks, ieee transactions on information theory, 46(2):388ďż˝404, march 2000. [15] rodica stoian, and a. raileanu, how to choose a model for wireless networks, scientific bulletin of the “politehnica” university of timişoara, romania, tom 51(65), fasc. 2, pp 109-112, sept. 2006. international journal of computers, communications & control vol. ii (2007), no. 1, pp. 94-102 mas_up-uct: a multi-agent system for university course timetable scheduling mihaela oprea abstract: many real-world applications are mapped into combinatorial problems. an example of such problem is timetable scheduling. in this case, the two basic characteristics can be defined by its distributed and dynamic environment. one efficient solution to solve this problem could be provided by an agent-based approach. a timetable scheduling problem can be modelled as a multi-agent system that provides the final schedule by taken into account all the restrictions. in this paper it is presented a preliminary research work that involves the development of a multi-agent system for university course timetable scheduling, named mas_up-uct. we focus on the architecture of the multi-agent system, and on the evaluation of the communication process by using the interaction diagrams. keywords: intelligent agents, multi-agent systems, timetable scheduling 1 introduction in the last decade, several artificial intelligence (ai) technologies and methods have been applied in the educational domain, at high school or university level. most of the applications that use ai are solving the tutoring/teaching and/or examination tasks, while less of them try to solve also the administrative tasks (e.g. course timetabling, examination timetabling, students presence control, student registration) related to an educational institution. a recent developed educational system that models also such tasks is the e-class personalized prototype system presented in [10]. this system is an extension of the widely available open source learning content management system, e-class, which has a component named school administration, that deals with the tools that handle timetables, financial matters, personal student data, student registration etc. in this paper, we shall focus on the timetabling problem, and we shall discuss about mas_up-uct, an agent-based system that we have designed, which provide solutions to university course timetabling. the general task of solving timetable scheduling problems is iterative and time consuming. in real world applications, the participants to the timetable scheduling have conflicting preferences, which make the search for an optimal solution an np-hard problem. in order to solve the problem it is necessary to find a compromise between all the professors’ requirements, usually conflicting (e.g. day, time). the constraints are related to the availability, timetabling and preferences of each professor, to rooms availability, number of students, and curricula. in order to solve this problem for the particular case of university course timetable scheduling we have adopted the agent-based approach. multi-agents systems (mas) are concerned with coordinating behavior among a collection of autonomous intelligent agents (e.g. software agents) that work in an environment. sometimes, software agents are designed to reconcile their own interests with the constraints implied by other agents. one type of software agents is given by expert assistants who enable us to automate certain manual tasks and who work more efficiently. expert assistant is a term given to an intelligent software agent that performs certain tasks on our behalf [15], [14]. for example, our daily organiser is an assistant. the complexity of multi-agent systems is generally higher than that corresponding to conventional software systems and their success rely on properly designed and well tested subsystems. also, in the particular case of timetable scheduling, the mas could find an optimal or a sub-optimal solution using mainly inter-agent communication (with minimal message passing). in this paper, it is presented the architecture of a multi-agent system, mas_up-uct, that is under development, and has as main purpose the modelling of the university courses timetable scheduling. we copyright c© 2006-2007 by ccc publications selected paper from icvl 2006 mas_up-uct: a multi-agent system for university course timetable scheduling 95 shall describe the architecture of the multi-agent system, focusing on the mapping of a course timetable scheduling in terms of intelligent agents, and finally, we shall make a preliminary evaluation of the multi-agent system. 2 university course timetabling problem the scheduling problem can be defined as a problem of finding the optimal sequence for executing a finite set of operations (tasks or jobs) under a certain set of constraints that must be satisfied. a scheduler usually attempts to maximize the utilization of individuals and/or resources and minimize the time required to complete the entire process being scheduled. there exist a number of different types of scheduling problems, such as job shop problems, sport leagues games scheduling, timetabling, service timetable problem for transportation networks, etc. many scheduling problems share some features with the timetabling problem. in [12] it is presented a survey of automated timetabling. in the educational context, scheduling is the problem of assigning a set of events (courses and/or exams) to limited lengths of periods and to rooms, subject to certain conditions. there are two types of academic schedules: the course schedule and the examination schedule. for both types of problems the resources includes students, staff, rooms, courses, time, equipments. several ai-based educational scheduling systems were reported in the literature (see e.g. the system presented in [13] for examination scheduling in universities). the formulation of the university course timetabling problem (as given in [4] and [12]) is the following: input data: q courses, k1, ..., kq, for each i, course ki consists of ki lectures, r curricula which are groups of courses that have common students, s1, ..., sr, p the number of periods, lk the maximum number of lectures that can be scheduled at period k (i.e. the number of rooms available at period k). goal: find yik (i=1, ..., q; k=1, ..., p), so that (1) ∑pk=1 yik = ki (2) ∑qi=1 yik ≤ lk (3) ∑i∈sl yik ≤ 1 (4) yik = 0 or 1 where i=1, ..., q; l=1, ..., r. the constraints are the following: each course is composed by the correct number of lectures (relation (1)); each time there aren’t more lectures than rooms (relation (2)); avoid conflicting lectures to be scheduled at the same period (relation (3)). the objective function: max ∑qi=1 ∑ p k=1 dikyik where dik is the desiderability of having a lecture of course ki at period k. different solutions, manual or automated, were proposed in the literature. some automated solutions are given by tabu search [3], constraint satisfaction [12], genetic algorithms [2], logic programming [5], and combination of different methods [9]. 96 mihaela oprea 3 the architecture of mas_up-uct system we have designed the architecture of a multi-agent system, mas_up-uct, that tries to solve optimally the university courses timetable scheduling. in figure 1 it is shown the architecture of the multi-agent scheduling system, while figure 2 presents the general overview of the university course timetabling. figure 1: the architecture of mas_up-uct we briefly describe how it is usually made the manual university course timetabling. suppose the university includes five faculties, each of them having a number of specializations. the timetabling for each specialization is done by a person who is dedicated to this job, which we shall name specialization course scheduler. this person will provide five, four or three timetablings corresponding to the specialization’s number of study years. the specialization course scheduler will receive a list of options from each professor that is teaching a course to a certain year of study at that specialization. the list of options will include the professor’s options ordered by their desirability, and will include also, the list of impossible timetable schedulings. after course timetable scheduling is done at every faculty, it is started the activity of rooms allocation at university level. so, the university course timetable scheduling problem is divided in two subproblems: 1. faculty course timetable scheduling (which involves only allocation of course day and time), and 2. university course rooms allocation (which involves allocation of rooms for courses). when all courses have allocated time intervals (day and time) and rooms, the university course timetable scheduling is ended with success. whenever a problem occur, it is started a communication process which will involve mainly a negotiation activity. in most romanian universities, the university course timetable scheduling is done either manually or partial automatically. in order to improve the efficience of the whole activity, we have mapped the course timetabling in terms of autonomous intelligent agents. each faculty has a scheduler multi-agent system (mas-fi), which has to schedule the courses of that faculty. the main scheduler agent (the university scheduler agent) which will allocate the rooms is mscheduler agent. because most professors teach courses to different faculties, every faculty scheduler agent has to communicate with the others scheduler agents, in order to solve some critical situations that may arise. the negotiation strategy used by the agents is similar to that described in [8]. in figure 3 it is presented the mas at faculty level, which includes a faculty scheduler agent, and expert assistants (ea) for each specialization of that faculty. for each specialization it is developed an expert assistant which has to do all the activities connected with that specialization (e.g. evidence of students, course curricula etc). an important activity that should be done by an expert assistant is course timetable scheduling (day and time). a lot of constraints mas_up-uct: a multi-agent system for university course timetable scheduling 97 figure 2: the general overview of the university course timetabling task figure 3: mas-at faculty level 98 mihaela oprea should be satisfied in order to solve the course timetabling. for example, one constraint is that all courses of a specialization are teached for all the groups of that specialization, and this constraint may become more severe in the case of courses that are teached for more than one specialization (this case appear for specializations that have courses with the same curricula). the faculty scheduler agents, who act autonomously, can schedule university course timetable on professor’s individual behalf. ideally, all profesor’s preferences should be accepted. unfortunately, we cannot reach an agreement among agents taking in consideration all professor’s preferences. in course timetable scheduling, agents must quantify the professor’s subjective preferences. in the worst cases (when a classical negotiation will have no results), we can reach a collective agreement by using a persuasion protocol (similar to that presented in [7]). the persuasion protocol is based on the rationality of agents. agents should satisfy some criteria of rationality (e.g. maintaining logical consistency). the advantage is that negotiation using persuasion protocol can reach more agreements compared with existing negotiation protocols and it can improve the rate of agreement in course timetable scheduling. the analysis and design phases of the mas_up-uct development were done by using the methodology gaia v.2 [1]. in this methodology, during the design step three models are built: agent model, service model, and acquaintance model. the agent model specifies the types of agents that compose the system. basically, the assignment of roles to agent types creates the agent model. the service model specifies the services that has to be implemented by the agent types. in gaia, a service is a coherent block of functionality, neutral with respect to implementation details. the acquaintance model shows the communication links between agent types. also, it is built a model of the environment. summarizing, four types of agents are used by our system: main scheduler agent (msa), faculty scheduler agent (fsa), expert assistant agent (eaa), and personal agent (pa). figure 4 shows the roles and responsibilities of each type of agent that compose the system mas_up-uct. figure 4: roles for the agents that compose the system mas_up-uct we make a brief discussion of two critical situations that may arise during a course timetabling: 1) at faculty course timetabling: day and time timetable conflict (two or more professor’s options are identical) solution: start a negotiation process between the expert assistant of that specialization and the professors involved (or their personal agents). a message is sent by the specialization expert assistant to all those professors that are involved in a conflict, and will wait for a solution of the negotiation. if it will receive an answer it will do a rescheduling. if it will receive no solution, it will start a persuasion process of negotiation, suggesting a solution. 2) at university course timetabling: no room is available for a certain day and time course. in this case the mscheduler agent will start a negotiation process between faculty scheduler agents that are involved in the conflict, by given some options. each faculty scheduler involved in the conflict will pass the message to the corresponding expert assistants, or, in mas_up-uct: a multi-agent system for university course timetable scheduling 99 some cases will continue to pass the message to professor’s personal agents, who will than negotiate directly. if after this negotiation no solution will be find out (e.g. some courses cannot be moved in other module or day), the main scheduler agent will start a persuasion dialog between the faculties agents that are in conflict, which in turn will transfer the problem at the lower level. 4 evaluation of the multi-agent system as an evaluation method of our mas we have chosen the interaction diagram method [11]. an interaction diagram is a graph showing the processing of each agent symbollically as one or more vertical bars, and the messaging between agents as horizontal or oblique arrows between agents (from sender to receiver), decorated with message indications. in figure 5 it is presented an example of interaction diagram, which illustrates a negotiation process at faculty level, between two expert assistants (eai and eaj). in figure 6 it is shown the interaction diagram in the case of a critical situation. figure 5: example of interaction diagram in order to evaluate the multi-agent system we can use interaction diagrams to design the communication process between agents (expert assistants, personal agents etc) and to verify that the system executes the correct communication sequences. we have used message flow fragmentation in order to realize an analysis of the communication process. the direct sequence and a part of the inverse sequence of the message flow fragmentation that corresponds to the negotiation process shown in figure 6 is given bellow. beg(mas), beg(f sal ), beg(f sak), beg(ealt), beg(eal r), beg(eaki), beg(eak j), snd(mas, m1), split(m1, m1k, m1l), rcv(f sak, m1k), rcv(f sal , m1l), split(m1k, mki, mk j), split(m1l, mlt, ml r), rcv(eaki, mki), rcv(eak j, mk j), rcv(ealt, mlt), rcv(eal r, ml r), snd(eaki, mki−1), snd(eak j, mk j−1), join(mki−1, mk j−1, m1k−1), ..., end(mas), end(f sal ), end(f sak), end(ealt), end(eal r), end(eaki), end(eak j) the inter-agent communication is done by using the agent language fipa acl. figure 7 shows an example of such a message ma12 ea-ime-ac exchanged during a negotiation. let’s consider a course timetabling conflict at the level of a faculty. this conflict consists in the situation of day and time identical options for two professors (pa1, pa2) that teach at the same specialization (computer science) different courses. this situation is described in figure 8. information exchanges during the conflict solve can be modeled with protocol diagrams using auml notation [6]. figure 8 shows a sample negotiation protocol for day and time timetable conflict solve. as it can be seen, the expert assistant of the computer science specialization (eacs) will inform the two personal agents corresponding to the two professors about the conflict. after this message is sent to both personal agents, between them it will start a negotiation protocol that involves a sequence of proposals and counter-proposals till a solution is accepted by the two agents. at the end of the negotiation process, 100 mihaela oprea figure 6: an example of negotiation in a critical situation figure 7: example of a fipa acl message figure 8: negotation protocol for a day and time conflict in course timetabling mas_up-uct: a multi-agent system for university course timetable scheduling 101 pa1 will inform the expert assistant agent about the solution found. 5 conclusion the paper presented the current state of a research work that involve the development of a multiagent system for university course timetable scheduling. the purpose of our work was to analyse the benefits of using an agent-based approach for the university course timetable scheduling, which involves a lot of communication, cooperation and negotiation processes. we have described the architecture of a multi-agent system for university course timetable scheduling, mas_up-uct, and briefly discussed about the evaluation of the multi-agent system. we can conclude that the main benefits of the agent-based approach adopted for university course timetabling are given by the possibility of doing negotiation between agents as a solution to the conflicts that may arise, and by the analysis of the exchanged messages flow between agents with the interaction diagrams. references [1] l. cernuzzi, t. juan, l. sterling, f. zambonelli, the gaia methodology basic concepts and extensions, methodologies and software engineering for agent systems, eds. bergenti, f., gleizes, m.-p., zambonelli, f., kluwer academic publishers, pp. 69-88, 2004. [2] d. corne, p. ross, h.-l. fang, fast practical evolutionary timetabling, lecture notes in computer science, lncs 865, pp. 251-263, 1994. [3] d. costa, a tabu search algorithm for computing an operational timetable, european journal of operational research, vol. 76, pp. 98-110, 1994. [4] d. de werra, an introduction to timetabling, european journal of operational research, vol. 19, pp. 151-162, 1985. [5] r. fahrion, g. dollanski, construction of university faculty timetables using logic programming techniques, discrete applied mathematics, vol. 35, no. 3, pp. 221-236, 1992. [6] m. -f. huget, j. odell, b. bauer, the auml approach, methodologies and software engineering for agent systems, eds. bergenti, f., gleizes, m.-p., zambonelli, f., kluwer academic publishers, pp. 237-257, 2004. [7] t. ito, t. shintani, an agenda-scheduling system based on persuasion among agents, technical report: nagoya institute of technology, 1997. [8] m. oprea, the use of adaptive negotiation by a shopping agent in agent-mediated electronic commerce, lecture notes in artificial intelligence, lnai 2691, springer-verlag, berlin heidelberg, pp. 594-605, 2003. [9] g. picard, c. bernon, m.-p. gleizes, cooperative agent model within adelfe framework an application to a timetabling problem, proceedings of the 3rd international joint conference on autonomous agents and multi agent systems, new york, usa, pp. 1506-1507, 2004. [10] e. g. prodromou, n. avouris, e-class personalized: design and evaluation of an adaptive learning content management system, artificial intelligence applications and innovations, eds. maglogiannis, i., karpouzis, k., bramer, m., springer, pp. 409-416, 2006. 102 mihaela oprea [11] r. ronnquist, c. k. low, analysing expert assistants through interaction diagrams, proceedings of autonomous agents 97, acm press, pp. 500-501, 1997. [12] a. schaerf, a survey of automated timetabling, technical report: cs-r9567, centrum voor wiskunde en informatica, 1995. [13] c. c. wei, a. lim, automated examination scheduling problem, proceedings of the iasted international conference applied informatics, innsbruck, acta press, pp. 93-96, 2002. [14] g. weiss, multiagent systems, the mit press, cambridge, massachusetts, 1999. [15] m. wooldridge, n. r. jennings, intelligent agents: theory and practice, the knowledge engineering review, vol. 10, no. 2, pp. 115-152, 1995. mihaela oprea university petroleum-gas of ploiesti department of informatics address: b-dul bucuresti 39, 100680 ploiesti, romania e-mail: mihaela@upg-ploiesti.ro received: november 8, 2006 editor’s note about the author: mihaela oprea (born on february 20, 1967) graduated the faculty of automatics and computer science at the university politehnica bucharest in 1990, and got her phd at the department of automatics and computer science of the university petroleumgas of ploiesti in 1996. currently, she is a full professor at the department of informatics of the university petroleum-gas of ploiesti. her main research interests include pattern recognition algorithms, machine learning, knowledge modelling, applications of multi-agent systems and artificial intelligence techniques in various domains such as environmental protection, engineering, and education. she has published more than 70 research papers in the area of artificial intelligence in international journals, and in the proceedings of prestigious international conferences, printed by important publishers (springer, kluwer academic, ios press, ieee computer society press). since 1995 she has visited as a research visitor universities and artificial intelligence research institutes from uk, austria, spain, greece, sweden, czech republic, hungary, and france. she is a member of some professional associations, and of the international programme committees of several conferences held in europe, usa, canada, south america, also participating at some of them as a tutorial presenter. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 1, pp. 20-41 a software system development life cycle model for improved stakeholders’ communication and collaboration s. cohen, d. dori, u. de haan shalom cohen, uzi de haan technion, israel institute of technology haifa, israel e-mail: {shalom1,uzid}@tx.technion.ac.il dov dori 1. technion, israel institute of technology haifa, israel 2. massachusetts institute of technology cambridge, ma, usa e-mail: dori@ie.technion.ac.il abstract: software vendors and entrepreneurs, who try to introduce an innovative software product to a specific organization or an entire market, enter a long and tedious process. during this process, the market and various organizations evaluate the product from different perspectives, such as software robustness, manufacturer reliability, and corporate need for the product. the vendors and entrepreneurs engaged in this process encounter decision crossroads for which no relevant guidance exists in the literature. the research closely monitored the processes associated with the introduction and assimilation of an innovative off-the-shelf (ots) software product into five different organizations in different vertical market segments. observations were carried out to assess organizational and marketing processes and to document and analyze what the software product undergoes before it is accepted for acquisition or full implementation within the organization. the research outcomes offer a unified, collaborative multi-tier system development life cycle (sdlc) framework and methodology for packaged ots software products that greatly improves communication and collaboration among the stakeholders. each tier addresses a different force or stakeholder involved in the software market: vendor, customer, consultants and integrators. all stakeholders refer to the same time-line thus; tasks of various stakeholders are streamlined. adherence to the unified time-line brings about an increased amount of stakeholder interaction, communication and collaboration. newly found tasks that improve communication and collaboration among stakeholders include (1) offering of the ots software product together with personnel as a bundle, (2) an improvisation-intensive iterative task of weaving potential customers’ requirements into the prototype, and (3) a third sale milestone, representing the successful diffusion of the product. the significance of this interdisciplinary research stems from its unique position at a crossroad between software engineering, marketing, and business administration, which has not yet been sufficiently explored or cultivated. keywords: collaboration, system development life cycle model, stakeholders. 1 introduction two major trends dominate the software development world today. the first is the shift of organizations from fulfilling their own software requirements in-house to buying it on the market, either as an copyright c© 2006-2010 by ccc publications a software system development life cycle model for improved stakeholders’ communication and collaboration 21 off-the-shelf packaged software product or from a company tailoring a specific solution.[1] the second trend is the shift from developing tailor-made software to purchasing packaged software from vendors either in stores or directly from the vendors.[2] here we assume that acquisition of packaged software is done by an organizational consumer.1 when relating to a packaged software product, which may be seen as a system by its own accord, one should make a clear distinction between a software product and an information system (is). [3] an is is made up of a number of software products or modules put together.[1] this research examines an off-the-shelf (ots) packaged software product as a system2 which goes through the various stages of system development life cycle (sdlc)3. many software development processes and models use stages outlined in sdlc.[4] we relate to sdlc not only in its traditional "waterfall" sense, but also to other models outlining the stages in software development. since these models, including the spiral and rapid application development (rad) are not as broadly known as the "waterfall" model and are less useful for explaining the market effects on software development delineated hereafter [1], we do focus on the waterfall model as a reference. the lifecycle of an information system includes the various phases that a software product goes through starting with its conception all the way to the stage when it is no longer available for use.[5] the software lifecycle, depicted in figure 1, typically includes the following phases: requirements, analysis, design, construction (or coding), testing (validation), installation, operation, maintenance, and the less emphasized retirement.[6] figure 1: the traditional phases of the system development life cycle model these basic phases have also been adopted for is acquisition purposes. although the names of the phases were changed where appropriate, the basic structure and timeline have been kept. the phases of is acquisition, shown in figure 2, are project justification, financial evaluation of the project, preparations for acquisition, request for proposals, vendor evaluation, contract negotiations and, implementation and maintenance.[7] figure 2: the sdlc model adapted to the acquisition process of information systems the literature research, summarized below, indicates that no significant attempt has been made to extend the sdlc model to other situations encountered by many software vendors and software developing entrepreneurs.[2] software vendors and entrepreneurs, who try to introduce an innovative software product to a specific organization or an entire market, enter a long and tedious process. during this process, the market and various organizations evaluate the product from different perspectives, such as 1the scope of the research is limited to describing the organizational consumer and not the private home user which is a discussion in its own right and differs in many ways from that outlined hereafter. 2"the system," "ots software product," and "packaged software product" will be used from now on interchangeably 3the acronym sdlc will denote here the system development lifecycle model as it relates to software products. its implementation with regards to ots software products will be discussed here. 22 s. cohen, d. dori, u. de haan software robustness, manufacturer reliability, and corporate need for the product. the vendors and entrepreneurs engaged in this process encounter decision crossroads for which no relevant guidance exists in the literature. this lack of guidance is somewhat surprising, since information systems development is best understood as a market phenomenon. it is a perspective which highlights how software is developed, who performs the development, who sells the related products, and how they are introduced to users.[1] 2 research goal and objectives the goal of this research is to develop and evaluate a collaborative multi-tier lifecycle development model for packaged off-the-shelf (ots) software products. the proposed model accounts for market and organizational factors and the way they are woven into the traditional phases of software development. to this end, the research has monitored, outlined, characterized, defined, and mapped specific phases which ots software products typically go through. the resulting comprehensive model relates to the development, marketing, assimilation, and other organizational aspects of the ots software product. the research has identified and defined new, modifiable software lifecycle processes, the adoption of which might benefit various stakeholders under various marketing conditions. our hope that the prevailing model will entail this aim by creating a task-based learning community which is a group of people who are organized around a task i.e. stakeholders, collaborating for a specified period of time to produce a product.[8] here, we attempt to create a unified exhaustive sdlc framework on one timeline with a number of tiers, creating a new collaborative multi-tier system development life cycle methodology. each tier addresses a different force or stakeholder involved in the software market, such as producers, consumers, consultants, and integrators.[1] the basic time frame of the sdlc, especially the beginning (inception) and end (implementation and maintenance) is kept. the various milestones along the sdlc time line indicate an appropriate task for each tier and an explanation of that task. tasks on the same vertical axis are to be performed concurrently and collaboratively. figure 3 depicts a possible scheme of the proposed collaborative multi-tier marketand organizationoriented sdlc model to be fleshed out as a result of the field study outcomes. the list of stakeholders stacked in figure 3 is by no means exhaustive. figure 3: a possible scheme of the proposed collaborative multi-tier marketand organization-oriented sdlc model to be fleshed out as a result of the experiment outcomes the main novelty of this research is that it is a first field-based study that is aimed at the establishment of a collaborative multi-tier sdlc model and a methodology based upon it. in addition, in most is field studies, researchers have access to a limited amount of evidence and observations in participating a software system development life cycle model for improved stakeholders’ communication and collaboration 23 organizations. in contrast, this research takes advantage of the fact that the software that was examined has been developed at the technion and was fortunately being positioned at the point of time required for this research. due to the special ties between the technion and the software vendor, this study has had access to evidence and observations that are normally out of reach to researchers. the validity of these unique findings was tested against the more abridged findings of the control case studies. 3 literature review the only academic development model incorporating any type of market effects is that of carmel and becker.[9] they developed a process model for packaged software development which was partly empirical. carmel and becker [9] point to a few market-related actions necessary to be performed in some, but not all of the described stages. actions like "assessing product differentiation considerations" are attached to the "initial screening stage" of the "requirements loop" with no explanation how they can be achieved. in summary, carmel and becker [9] were the first to attempt a complete process model which adds marketing tasks. their model, however, was only partially based on empirical findings, and instead of having the market define software market needs and sdlc phases, they suggested them a priori. moreover, as cusumano et al.[10] noted, their reliance and justification of a pivotal "freeze specification" stage is problematic in a highly volatile market.[10] as carmel [2] noted, no major study had been conducted on market introductory effects on packaged software innovations before 1995.[2] nevertheless, the idea of introducing a market-based perspective into information systems development was introduced later on.[1] he juxtaposed a market-oriented approach with a simplification of the traditional "waterfall" model. at the basis of his idea is a separation of the traditional sdlc model (from development to user introduction) into two separate parallel models, one for the software developer and the other for the software consumer. in addition sawyer had several interesting assertion as to the growing importance of additional stakeholders in the development process. for example: • third parties (consultants, vendor representatives, etc.) have an increasing role in the initial stages of sdlc. consultants/integrators are now also part of the information system development (isd) process, as they enable and mediate the software market. this contributes to widening the chasm between users and vendors. this chasm is bridged only by indirect links between customers and developers via intermediaries or customer surrogates.[11] • system installation requires a third party in charge of installing the product, customization, and training. • the development process is of smaller importance to the consumer than the final product. although most of the assertions in sawyer’s model may make sense, they are in no way based on empirical evidence and do not have a direct connection to an sdlc model currently in use. his model lacks due reference to the producer’s side, an aspect which this research has elaborated on. in order to cover a large number of organizational and market-related sdlc influencing factors, we searched for academic and professional models in seven domains. we began by looking at the above mentioned few existing market based is/software development models to see how an innovative ots software product is produced in the market. we then continued by looking at works on software cycles and structured development studies to uncover the new ots software manufacturing methods. leaving the is domain, we followed moore’s technology diffusion theory to look for models on technology adoption, innovation introduction and marketing diffusion theory which may relevantly describe the 24 s. cohen, d. dori, u. de haan diffusion process of an ots software product too.[12] we then reviewed research in the cross domain of organizational decision making on is/software related issues to learn how the common is/software related decision-making processes in various organizations are performed. the maturity of the software product, as well as that of the organization, is a matter of much interest to industrial engineers and business administrators and it influences entrepreneurial vendors tremendously and therefore reviewed here too. the relatively young academic field of entrepreneurship was searched for adequate models and research on innovation, innovation-exploration and entrepreneurship in the software market. finally, we surveyed the market for best practices and existing methodologies for ots software development by entrepreneurial vendors. table 1 summarizes the main studies related to this research topic under the various categories. from the above literature review we learnt about the possible variables and added them to the examined research’s model as described in the following section. tabel 1. summary of main studies sdlc and related subjects. aspect article empirical market based is/ carmel and becker[9] very partial software development keil and carmel[11] yes sawyer[1] no sing and kotze[34] partial software cycle and cusumano et al.[10] yes structured development cusumano[35] no carmel[2] yes clark and wheelwright[36][37] partial wheelwright and clark[38] partial boehm and bose[39] yes fine[40] no avison and fitzgerald[4] no ebert[42] yes technology/ innovation introduction/ mustonen-ollila and lyytinen[43] yes diffusion theory marketing4 lucas and spitler[44] yes davis[45] yes moore and benbasat[46] partial brancheau and wetherbe[47] yes cooper and zmud[48] yes fichman and kemerer[25] yes is related decision-making and verville and halingten[49] yes software acquisition processes nelson et al.[50] yes iivari and ervasti[51] yes software product and paulk[52] no organization maturity nordman [53] report lee and o’connor[54] no montaguti et al.[55] no hi tech entrepreneurship shane[56] partial shane[57] no murray and tripsas[58] yes baker et al.[23] yes vera and crossan[27] yes best practices nih matrix[59] best practice agile[61] best practice 4for the most part, the writings in this discipline have not distinguished between the more general definition of an it product and a specific is/software-like product. a software system development life cycle model for improved stakeholders’ communication and collaboration 25 4 methods and experiments in this section, we first provide a short explanation to the research method used (a), and then we describe the case study sites selected (b). data collection efforts are described in (c) and finally in (d) we cover the preliminary research model. a. case study methodology yin identified three main types of case studies based on the purpose for which they are used [13]: (1) explanatory a case study intending to explain the casual links in real-life interventions that are too complex for survey or experimental strategies. (2) descriptive a case study that emphasizes the formation of hypotheses of cause-effect relationships, where a descriptive theory must cover the depth and scope of the case under study.[14] (3) exploratory a case study in which the fieldwork and data collection may be undertaken prior to definition of the research questions and hypotheses. the framework of the study must be created ahead of time to maximize what can be learned, knowing that time is limited. the selected cases should be easy and should include willing subjects.[14] the research strategy utilizes a combination of exploratory and descriptive case studies. the study does not explain an existing theory, so it cannot be categorized as explanatory. rather, it tries to describe and explore emerging software development and marketing processes. fieldwork and data collection were done prior to exact definition of the research questions and hypotheses generations.[15] the research includes multiple exploratory/descriptive case studies, using replication logic. replication logic is a logic by which case studies are selected to create a multiple-case design. cases are selected so that they can either produce typical, negative or disconfirming results or exceptional/discrepant results. this form of case selection is also known as a theoretical sampling of cases as opposed to the normal sampling logic used in quantitative methods. the outcomes of this design are improved theory, generalization ability and cross-case analysis. the latter is achieved by the use of two additional case studies which serve as control or baseline studies. each case study is treated as an independent experiment, and the entire study is comprised of and based upon a sequence of multiple experiments.[16] when a case study strategy is agreed upon, it permits for both qualitative and quantitative sources of evidence to be collected and analyzed.[15] the collection and analysis of these two complementary forms of evidence has enabled triangulation. various methods of data collection and fact retrieval were utilized, as described in section (c) of this chapter. the field-study strategy, by which this research obtained insights into the processes that innovative software products go through, is an empiric study with multiple case studies. the opcat software was introduced into five organizations operating in mostly different vertical market sectors, so that the lessons learned from them cut across sectors. the list of sectors included banking, military, avionic, software and banking-software. one additional off-the-shelf software product (product b) was introduced to a telecom company, and the final product (product c) was introduced to a software organization. the number of case studies chosen (7) reflects a practical balance between the need for sufficient ground for generalization of the findings and the research time and capacity constraints. the number corresponds to the recommended range of 4 to 10 cases for theory building purposes.[15] b. case study sites as the choice of organizations in which to perform the case studies is not a pure random sample, we tried to compensate for this by using theoretical sampling [13] designed to cover a broad spectrum of sectors, company sizes and locations, as depicted in table 2. the customer organizations chosen for the case study sites were: org.1-e is a large airborne avionics systems manufacturer that employs over 3000 employees. 26 s. cohen, d. dori, u. de haan table 1: profile of firms in the exploratory/descriptive multi-firm study company sector size(employees) location introduced product org.4-b banking >10.000 israel opcat org.1-e military/avionics >10.000 israel opcat org.2-el military <1.500 israel opcat org.5-q software/cellular -100 israel opcat org.3-s software/banking -1.000 singapore opcat org.6-c telecom >1.000 israel non opcat org.4-b software >7.000 israel non opcat org.2-el is a medium sized division (<1000 employees) of a high-technological military products manufacturer of ground, air and space-related products. org.3-s: six of the seven case studies were held in the same country and the remaining study took place abroad, where the sale procedure to a large asian banking software developer employing around 1000 people was followed. org.4-b is one of the largest banks in israel, employing around 10,000 employees. the attempt was made to the bank’s business software applications division. org.5-q is a small software company with less than 100 employees, developing software for the cellular phone industry. in addition to introducing opcat into five organizations, two additional case studies org.6-c, org.7-bm were held in which similar ots software products were followed by means of their introductory phases into the market. these two additional case studies served as baseline case studies and assisted with both building the validity and analysis of the findings from the first five studies and with building a more robust and accurate sdlc model4. the case study sites were monitored periodically according to the type of evidence that was collected. for routine correspondence and product-related documentation, ongoing collection was used. for researcher observations, such as meeting attendance with adopting organizations, they were held according to the case study’s natural timetable. evidence collection sessions that is pushed by the researcher, such as questionnaires/surveys and interviews, were held at fixed time intervals across all case studies so that a matrix of observations period versus company was created. these mixed monitoring methods enabled the evaluation of evidence versus specific reference points in time and the description of continuous events as they were unfolding. c. data collection the software products that were introduced into each of these organizations, and the processes that they underwent thereafter until successful adoption installation and acquisition, or possibly rejection by the organization, were monitored and meticulously documented. four different types of evidence collection were utilized for the monitoring and documentation of the abovementioned processes: direct passive observations, documentation collection, open-ended and focused interviews and physical artifacts i.e. generated computer code or diagrams. table 3 summarizes the data collection efforts in the five main case studies held with the opcat vendor. the table also clearly indicates that the most extensive case studies as far as data collection was concerned were org.1-e and org.3-s. two of the remaining 3 studies were shorter studies, mainly as they represent failed attempts of implementation by opcat, and thus spanned a shorter life-cycle. 4due to strict non-disclosure restrictions, the information regarding the two baseline case studies, as well as the products, organizations, and customers examined has been kept confidential notwithstanding its use for hypothesis building and generalization purposes. a software system development life cycle model for improved stakeholders’ communication and collaboration 27 table 2: general data collection types and data collection summary statistics case duration interdirect questiondocumentation other site (months) views passive aires emails/ physical meeting hard copy/ artifacts documenppts/other tation org.1-e 14 4 12+5 day 2/ 5/ 10/ 5 3 brainstorming session org.2-el 14 4 7 1 22/ 3/ 6/ 30 org.3-s 20 6 12+5 day 0 157/ 3/ 8/ 3 8 session course org.4-b 9 2 8 1 12/ -/ 4/ 2 org.5-q 6 2 5 1 10/ -/ 3/ 2 the evidence gathered from these case studies was then compared and analyzed and eventually enabled the identification and definition of common phases that the software went through in the various organizations and industries. d. the preliminary model a general view per the research method described above we defined our own set of a priori basic constructs for this research. the various preliminary variable groups and their interaction are modeled using object process methodology (opm), which provides a variety of complexity management tools that help diagram the model clearly and efficiently.[17] the top-most diagram, seen in figure 4, demonstrates the main process of successful ots software product implementation, which this research examined. this process and its impact on the sdlc of ots software product innovations was our dependent variable. this process is handled and impacted by the various stakeholders in today’s ots software product market, i.e., the vendor, adopting organization, third party integrators/consultants and, indirectly, other market and industry effects that constitute the intervening and contextual variables of this model, respectively. the stakeholders interact via external non-systemic, environmental social networking process, in which they exchange leads, assign projects, etc. the impact of this process as a whole was of concern to this research, but its internal components and intricacies were not further elaborated, as the issue of social networking has already attracted extensive writing and research.[18] the independent variable of this model is the ots software product. for the sake of simplicity, the object representing the product includes only four basic states: specified, developed, acquired, and implemented. these are the most important states in a software product’s lifecycle from the initial undeveloped product state, i.e., product in specification format only, to the successfully implemented product by an adopting organization. research model links in general represent possible hypotheses resulting from these relationships. thus, the bidirectional effect links connecting the various attribute groups in figure 4 mark the possible influence each group may have on others. the bidirectional links generalize unidirectional and bidirectional influences and suggest the variable group undergoes certain changes once the process is performed. the preliminary model spanned 38 variables brought together from the various is, or and marketing domains discussed in section 3. for the sake of brevity we do not bring here a full discussion regarding the reasons for their inclusion, and the variables comprising each variable group|footnotethe full expla28 s. cohen, d. dori, u. de haan legend proces result / consumption link environmental object object effect link instrument link agent link environmental process bi-directional general purpose structure object state figure 4: the proposed preliminary logical model nation is readily available from the authors.. furthermore, the full list of initial variables, as well the final ones, is given in table 4. 5 intermediate findings and conclusions the findings of this research include the final collaborative multi-tier sdlc research model and specifications of how the original research model has been modified throughout the research by omitting, adding and merging variable groups and variables. the aim here was to show how the model and its variables were validated through the various case studies conducted. a table is used to show the original model’s variables vs. the final model’s variables, and how each variable gained or lost validity based on evidence from the case studies. therefore, in this section, we begin by describing the changes to the original research model in (a), and continue with explaining about lead-driven development (ldd) described in (b) brought as a partial downscaled example of the much larger and full collaborative multi-tier system development lifecycle model. we end the section by giving a short explanation as to the contents of the full collaborative multi tier system development lifecycle model in(c). a. changes to the original research model following the guidelines of the case study research methodology [16], we entered the case sites using a preliminary suggested research model described in section 4. as a case study proceeds, the research model is often updated by adding previously missed-out variables and deleting unnecessary or irrelevant ones. table 4 lists the variables in the original and final models, their inclusion or exclusion in the preliminary and final models, the case studies upon which the exclusion or inclusion were based, and the a software system development life cycle model for improved stakeholders’ communication and collaboration 29 nature of the impact of the variable. the nature is depicted using three symbols: +, and ot, as explained next. the plus symbol is used to denote a positive influence on the successful sale and implementation of the ots software product. the minus sign is used to denote a negative influence on the successful sale and implementation of the ots software product. ot, which denotes "other" reasons, is used when the impact is of a compound or qualitative nature. the last "based on" column provides a supporting reason from the literature for the inclusion or exclusion of a specific variable. explanation of the unique impacts and findings of the findings mentioned above: the point in time where the implementation process is completed has been found to be, in the discussed product type, the third sale point. this is so as an initial first sale is either an impulsive buy or an exploratory attempt and is bundled with a human implementer. the second sale is a post-sale buy to try and achieve the product’s associated benefits independently of external vendor human resources attached. the third sale constitutes reconfirmation of the product’s benefits to the adopting organization and is characterized by purchase of licenses and a long-term support plan. the political factors were observed in only one case study which accidentally took part during the second lebanon war, which took place in northern israel during the summer of 2006. however, we attributed this to coincidence and did not otherwise find any political issues effecting the market or industry and therefore deleted these two variables from the final model. a unique influence was found in two of the larger case study sites, i.e., the partially-governmentowned organizations. in these sites, a very strong influence of outsourced personnel, sometimes even positioned within the adopting organization’s decision making units, was noticed. the addition of the outsourced human resources as a descriptive characteristic of the customer’s users was done in tandem with the addition of the same variable in the vendor’s descriptive variables. this is possible, as in many closed and highly specialized industries, many of the organizational employees today are outsourced employees, who are often sent from a common pool of hr outsourcing companies and employees. b. lead-driven development based on the case studies carried out as part of this research, a new approach to software development for off-the-shelf (ots) products of entrepreneurial vendors has been identified. the new model, called lead driven development (ldd) includes detailed guidelines for entrepreneurial vendors developing ots software. these include directions for pure development procedures (at the coding level) along with organizational steps to be held in conjunction with the coding process to support successful product implementation. this model relies on and revolves around an innovative procedure of improvisation, which is new to this industry. improvisation counters many current trends which state that increased formality yields successful implementation. as table 5 indicates, ldd may be highly beneficial to the vendor. examining table 5, we see that in all the five case studies, some form of ldd was followed. the classification level of ldd correspondence of each vendor per each case study site was scored on a scale of 1 to 5, where 5 represents complete correspondence. the classification was made by a number of uninvolved parties who checked for a clear-cut correspondence of the software introduction and development process to ldd. since we examined the development process performed by the vendors and did not follow other stakeholders, we isolated the benefit associated with the use of ldd to be the influence of ldd on ots product sales in the corresponding case study. benefit was therefore observed as the influence of ldd in achieving a preliminary sale with an organization (first sale). a higher level of benefit was achieving a second sale and the highest level a third sale. as explained later, a third sale is a measure of successful implementation. since the observed vendor is of an entrepreneurial character, associated benefits of an efficient development process, such as shorter coding times or increased flexibility were not accounted for. 30 s. cohen, d. dori, u. de haan table 4. original vs. final research model variables top variable variable original final case nature of based group group model model studies impact/ on significance product sdlc introduction maturity stage x x el,e,s,b,q product sdlc growth maturity stage x x el,b,q + product sdlc maturity maturity stage x x na + product sdlc decline maturity stage x x na product trialability x x el,b,q + product complexity x x el,e,s,b,q product compatibility x x el,e,s,b + product relative advantage functional x x el,e,s,b,q + product relative advantage economic x x el,e,s,b,q ? product relative advantage emotional x × not in any na product whole product factor x x el,e,s,b + product specification flexibility x x e,s + vendor hr dmu/ structure stakeholders × x el,b,q ot vendor hr personnel/ structure outsourced hr × x* el,b,q ot [22] vendor service x x el,b,q + vendor business model lock-in x x el,e,s,b,q + vendor business model novelty x x el,e,s,b,q + vendor business model efficiency x x el,e,s,b,q + vendor business complemenmodel tarities x x el,e,s,b,q + vendor marketing strategy-4ps price x x el,e,s vendor marketing strategy-4ps promotion x x el,e,s,b,q + vendor marketing strategy-4ps place x x el,e,s,b,q ot vendor marketing strategy-4ps product x *x el,e,s,b,q a software system development life cycle model for improved stakeholders’ communication and collaboration 31 table 4. original vs. final research model variables (cont.) top variable variable original final case nature of based group group model model studies impact/ on significance market and pest industry political x × el na [19] market and pest industry economic x x el,e,s,b,q + market and pest industry sociological x × not in any na [19] market and pest industry technological x x el,e,s,b,q + market and industry industry type x x el,e,s,b,q ot adopting type organization innovators x x el,e,s + adopting type early organization adopters x x b,q + adopting type early organization majority x x na +/adopting type late organization majority x x na adopting type organization laggards x x na adopting users change organization resistance × x e,b [20] adopting users organization profession x x el,e,s,b,q + adopting users organization position x x el,e,s,b,q + adopting users learning organization curve × x e,b + [21] adopting users outsourced organization hr* × x b,e ot [22] adopting dmu key organization events x x el,s ot adopting dmu time organization x x el,e,s,b,q + adopting dmu power position organization of employees x x el,e,s,b + adopting dmu no. of decision organization makers involved in process x x el,e,b,q 3nd party integrator/ outsourced consultant hr* × x b,e ot [22] table 5: level of lead-driven development implemented by vendors vs. benefit in sales. site ldd level sale 2nd sale 3nd sale org.1-e 4 + + + org.2-s 3 + + org.3-el 5 + not yet not yet org.4-q 1 org.5-b 1 32 s. cohen, d. dori, u. de haan we entered the more comprehensive table figures of table 5 into a statistical software tool and found that the correlation between sales, second sales and third sales (i.e. vendor benefit) and the level of ldd implementation is clearly significant, positive and high. after establishing that the use of ldd is beneficial for the vendor we carefully documented this process and generalized it over all the case studies. the description of the full ldd process now follows. the hereunder elaborated emerging process for software development includes 12 main steps of which at least 4 include some form of improvisation. moreover, the most unique phase of this suggested model, the lead gathering task is improvisation intensive. in addition, the model may serve as a strict continuous model similar to the waterfall model or may be used as a spiral model involving repetitive tasks. at the core of this model, are 12 steps as follows: step 1: initiation this stage is a formal stage in the regular standard sdlc model. however, with entrepreneurial firms this step tends to be an informal one with no accurate start point in time. this stage includes structuring the will and intent to begin with the project and giving the go-ahead instruction as well as providing the limited resources necessary to start exploring the venture. step 2: high level concept development this second step includes forming the high level concept of the product, the problem which it comes to solve, its associated benefit etc. depending on the scope of the project/system this development phase may be fulfilled using limited resources, spare time, and sometimes even academic resources. a substantial level of improvisation is used at this early stage as well as part of the founding process.[23] improvisation is carried out in the development of the suggested product in a quick and result-oriented fashion while using minor or no documentation and testing at all. step 3: prototype the first important milestone of the entrepreneurial vendor is the ability to deliver a functional prototype. the prototype should convey clearly the problem it is solving, its abilities and associated benefits in an easy and understandable manner with a friendly user interface. the number of moderate bugs, missing features as well as load balancing issues is not of much importance at all at this stage as the product shall be used mostly for demo and pilot purposes in the near future. this first prototype release is called by us a "bugged release". the prototype should further include a number of working examples from various domains. step 4: minor testing in non profit environments, academic demo and use after completing the prototype, which should by this time be a powerful demonstration tool, the vendor should strive to demonstrate the tool in non-profit environments. the aim of these demonstrations is finding a limited installation bed for the product. these installations provide the developers with important feedback on bugs, missing features and general use of the tool a preliminary focus feedback group for the tool. the academic scenery is extremely beneficial for these purposes as it also hosts great uncovered commercial potential through conventions, conferences and to a vast number of current or to be professionals. see also penetration attempts in academia by established firms like philips, ibm and sap. step 5: market introduction, benefit oriented demonstrations and mini pilots an additional task which is improvisation intensive is demonstrating the tool to potential customers. the suggested form of product demonstration which we call "benefit oriented demonstration" is a special type of marketing method unused so far in the world of software. the equivalent in the non-software world is that of a vacuum cleaner demonstration in the customer’s home to show him immediate benefits of the product.[24] thus, in this situation we suggest vendors demonstrate their new tools by implementing a form of improvisation at the customer’s site. the vendor should use the tool to perform an on the spot real work task brought in by the customer for which neither the vendor nor the customer were prepared. if this session exceeds one meeting it may be considered as a mini pilot. it is also in this step that the main improvisational task of the entire proposed model is undertaken. from meeting to meeting the vendor’s marketing representatives must try to anticipate using preliminary talks, phone conversations, social networking ties or emails the needs of the potential customer a software system development life cycle model for improved stakeholders’ communication and collaboration 33 as well as his existing environment. this highly informal improvisational task is used to build software requirements for the development coding team. the requirements gathered are for features which will be required in the marketing sessions with the potential customers. these requirements are then addressed and coded immediately by the development team. the new features developed are neither documented nor tested thoroughly as they will be mainly used for demonstration purposes and may be ultimately dropped. however, a mentioning and documentation of the added features is required at least in a "what’s new" file accompanying the product. this step is of a strict repetitive improvisational nature and involves the gathering of lead requirements between marketing meetings and converting them into semi-operational software features. step 6: offer ots + hr = project. after one of the leads materializes the vendor is asked to prepare a formal proposal for sale. many of our case studies have indicated that ots products for the professional organizational realm are rarely sold if they come from unknown vendors due to risk factors mainly of product abandoning. hence, offering the ots with human resources (an implementer) which will assist the adopting organization, prepare the initial material and then tutor its users is usually beneficial to reduce the uncertainty in these situations. after the customer agrees upon the terms of the project (and not only the product) contract engaging commences and the customer is now considered as the baseline customer. hence, we call this sale the "first sale" en route to successful implementation. step 7: bug fixing due to baseline customer requirements and marketing requests after a certain amount of work is done using the tool at the customer’s site, either by the customer’s users or by the hr which was coupled to the ots, important feedback regarding the products begins to accumulate. this information enables bug fixing and tool robustness improving. in addition, important missing features required by the baseline customer and marketing department are added to the software and provide for the first commercial release of the software. this release is still highly saturated by bugs but is already a commercial useable "non-frustrating" version of the product. step 8: constrain features, further commercial releases and support plan the unbridled adding of new features, in the format suggested in steps 5 and 7 above, creates overwhelming monstrous software. at this point the vendor should start to funnel out some features which cater to a smaller audience and which have not been found to be part of the vendor’s targeted audience needs. furthermore, the vendor should try to find a common thread or theme connecting and guiding all other features. this decision enables further product releases each containing additional noteworthy features, bugs correction and feature enhancements. with the continuous use of the product in at least one baseline customer and before the move to the next implementation step, a support plan (or plans) for the product should be created. step 9: develop complexity management tools, train integrators personnel, interface with customer’s software, find additional benefit oriented projects, embed within organizational deliverables within the baseline customer’s everyday work, issues of model complexity very quickly arise. these issues, which are different from testing the product or load-balancing it, should be addressed and solved early on. in addition, this is also the time to deepen the roots within the customer by both trying to embed the product deliverables within the customer’s overall deliverables, interfacing (physically) with the company’s organizational iss and by finding additional projects within the customer’s company to be involved with. deepening the vendor’s role within the customer’s site is a highly improvisational task in nature and hence requires adequate skills. step 10: second sale licenses: the first sale is by no means any indication of a successful implementation or diffusion of the innovation within the adopting organization.[25] moreover, the coupling of the hr with the ots does not really enable a real diffusion of the independent product. hence, a second sale to the same organization marks an important future commitment of the organization to drop the tutoring relationship and proceed to license purchases for independent use. license purchases signify that the organization now associates positive benefit to the use of the product. step 11: maintenance, begin user training, tutoring, software support after a second sale is 34 s. cohen, d. dori, u. de haan made, the relationship between the vendor and customer moves into the maintenance phase. as the previous phase was coupled by hr this is where the customer will be taking his first steps with the software alone. these first steps include: formal user training sessions, one on one user tutoring sessions and general software support. step 12: third sale, support the earliest point one might consider as the point of successful diffusion or implementation of the innovation is, as seen in this study, the point of third sale. after the second sale, the customer independently used the software by himself, learnt the product’s advantages and disadvantages and may now associate self benefit to product more accurately. therefore, a third sale is the first true mark that the customer is truly realizing the benefits of the product and is preparing to use it in the long term. support of the product continues now on an annual basis with milestones for new upgrades and releases. given that a vendor adopts this new 12 stage development process including and especially the improvisation-intensive stages, the question which should arise is how does the organization build and enhance the skills required for improvisation and what are these skills? in the context of our study we identified three main factors which influence improvisational skills: teamwork skills the ability of the entrepreneurial team to communicate with one another and relay timely information, get things done easily and quickly with no inhibitors and outside impeding factors. figure 5: lead driven development: the 12 step timeline experience the entrepreneurial team members’ experience in similar circumstances and their memory to recall their right and wrong doings there. experimental culture the culture of the team, which encourages trying out many a time risky and/or innovative solutions. these three main factors, measured and calibrated according to characteristics described in [26], coincide with improvisational skills factors in the literature. for example, vera and crossan [27] create a theoretical framework based on improvisation and innovative performance in teams. identifying variables from improvisational theatre, they tested the impact of the 16 different related variables on improvisational skills in an environment of a local municipality. they found 4 of the 16 variables to be of higher influence than others. the four factors they isolated were: expertise, teamwork skills, experimental culture and real-time information and communication. a software system development life cycle model for improved stakeholders’ communication and collaboration 35 in the context of our study we can therefore translate and apply their insights as follows: expertise gaining a higher level of software expertise in the intricacies of the development environments enables software developers to find out of the sleeve solutions and bypasses for many software unpredicted difficulties encountered. teamwork skills in general software teams, with a rather higher sense of collaboration usually tend to innovate more. to further clarify this point, we can propose the software teams the following teamwork skills which we encountered: development collaboration, information sharing via email, shared drives, knowledge management portals, inner group dynamics and communication etc. experimental culture the experimental culture includes the ability to import new ideas and procedures from the world wide web, forums, groups and software development associates and try them out. furthermore, experimentation in the software industry, which is not usually backed up by management should be backed up by top management and should also include experimentation on code developed using a number of alternate mechanisms. real-time information and communication the need for real-time information and communication in the software industry is ever more compelling than any other industry. this is so because the software industry is built upon and relies heavily on the backbone of internet. therefore, when improvising it is crucial to gain real-time updated information over the lan or internet and have a variety of channels for communicating with the customer and other team members. each of these channels specializes in a different type of content that may be passed: audio, video, documents, emails etc. c. the collaborative multi-tier system development life cycle the lead-driven development (ldd) paradigm described above represents a list of tasks from the vendor’s point of view. this list helped build the vendor’s tier in the complete collaborative multi-tier system development life-cycle task matrix that caters to all stakeholders. the collaborative multitier model takes into consideration, through the nature of the tasks suggested, pure is development tasks (e.g., prototyping, bug fixing), market influences (e.g., market introduction techniques for entrepreneurial ots software vendors, such as offering the first sale of such a product as a combined project with human resources), and organizational recommendations aimed mainly to avoid customer internal organizational obstacles. the collaborative multi-tier system development life-cycle (cmsdlc) model, which combines common tasks from the various case studies, is consistent as it avoids contradicting tasks. one of its unique features is that it makes a distinction between benefit-oriented tasks and standard waterfalltype formal tasks and milestones. when a task or a set of tasks is performed in an iterative manner it is marked as an on-going task or specifically mentioned in the explanation section as one that needs to be performed iteratively. one such example is the task dealing with lead requirements gathering from potential customers for the purpose of prototype and feature building. the cmsdlc model is also suitable for mature organizations that seek to develop a new ots software product. it is even more suitable when the mature organization separates this entire operation from its existing core operations through various methods, such as founding a new subsidiary. this is akin to an entrepreneurial firm from the development and market perspectives. however, the financial backing of the parent organization and its reputation may shorten the duration of many of the tasks, and may make them more easily achievable. in cases where a mature organization seeks improvement to existing product development methods, the suggested model may not be as applicable, since the level of uncertainty such an organization encounters regarding market and organizational effects (especially regarding the customers), is lower. the chart of the full model depicts in a single, poster style view the multi-tier model. the timeline of the system development lifecycle and the tasks for each stakeholder are stacked on separate horizontal lines. since all the stakeholders share one timeline, the interaction between the stakeholders and the interdependency of tasks is clearly visible. only through extensive collaboration can the effort proceed 36 s. cohen, d. dori, u. de haan as a whole. for each task, a list of case studies which the task was based upon is provided. a list of one-letter abbreviations corresponding to the case study sites which demonstrated the use of the specific task appear below each task. the logic behind gathering all the tasks into the model includes checking tasks for contradictions in the various case studies and merging similar tasks to unifying generic tasks. 6 conclusion and future research at this point we would like to introduce the hypotheses which were derived from the findings of our case studies. the hypotheses suggest a plethora of possible future research to be held on the process of validating them. the first hypothesis which was emerging was based on research question 2. we found that the two parts of an ots software product, the product and its underlying methodology, played an important role in all the case studies and therefore justified the hypothesis. we therefore phrased the first hypothesis as follows. h1: the more a customer is inclined to adopt a methodology, the more he or she is inclined to adopt the ots software tool associated with that methodology. in other words, we will be looking for a possible correlation on a "bundling" relationship between the methodology and its supporting tool. we can explain this phenomenon by looking both at the marketing literature regarding characteristics of products and at the is literature which explains and recommends various modes for is product sale. the marketing literature [28] and [12] and [29] defines clearly the "whole product concept" which was introduced by theodore levitt.[30] levitt defined the whole product as follows: a product is, to the potential buyer, a complex cluster of value satisfactions. the whole product factor denotes the completeness of the product being marketed at a certain point in time with regard to a complete solution. the components of the whole product include the core product, the tangible product, augmented product and total solution. the relationship between the ots product used in our study and its methodology typifies the product as being close to an augmented product. the second hypothesis involved a distinction between organizations using a formal software development procedure and others who use improvisation-intensive development techniques. as we started noticing in the case studies, this distinction is related to the level of uncertainty the vendor organization runs into. hence we defined the following hypothesis: h2: the higher the level of uncertainty is, the higher the vendor’s use of improvisation intensive development methods. improvisation has been shown to be used by entrepreneurial vendors when faced by time pressure, complexity, and uncertainty.[31] future research may extend this assertion to include and emphasize its relevance in the development process too. the third hypothesis concerns the direct benefits associated with utilizing improvisation within the software development process, i.e., the time-to-market of the product and increased sales. the hypothesis was defined as follows. h3: when uncertainty is high, the more a vendor uses improvisation the more his market response time shortens and his ability to make a first sale improves. the organizational change resistance to new technologies in general and software in particular was found in our case studies to be solved by applying a marketing technique which couples a human resource implementer to the product.[32] the human resource implementer escorts the implementation and even performs most of the initial work for the customer using the tool. this triggered the following hypothesis: h4: the higher the level of hr participation in the ots software sale attempt to the change resistant customer is, the higher are the chances for successful implementation. a software system development life cycle model for improved stakeholders’ communication and collaboration 37 similarly to the entrepreneurial vendor’s efforts to overcome change resistance within the adopting organization users, the vendor has to build its legitimacy with the adopting organization’s dmu.[33] we started noticing that this legitimacy buildup was being done via affiliation with accredited scientists/academics and/or through established third party integrators. stinchcombe [33] also specified three reasons for the impediments companies have from entering into a business relationship or buying a product from a new organization. he calls this effect the "liability of newness". the reasons cited for this liability are lack of experience, lack of size and lack of legitimacy. the latter was addressed by opcat, who built its legitimacy in all the case studies through the use of a reputable scientist to improve its lack of legitimacy and external reputation. furthermore, some legitimacy build-up was achieved through the use of large third party integrator with proven reputation and experience in the industry. this gave rise to the following hypothesis: h5: the more a new vendor firm affiliates with a distinguished scientist, or an established third party, the more the chances for successful implementation are higher. 7 summary and recommendations we have proposed and evaluated a software system development life cycle model which aims to improve successful system implementation and adoption by use of communication and collaboration amongst stakeholders. the new model for software development that emerged lead-driven development was discovered, validated against the case studies, and refined via observations in five industry case studies and two additional control studies regarding successful implementations of ots products of entrepreneurial developers. the proposed lead-driven development model accounts for market and organizational factors and the way they are woven into the traditional phases of software development. it offers the basis for the unified, comprehensive multi-tier sdlc framework and methodology that contributes to improved stakeholders’ communication and collaboration through the use of a common reference model for all stakeholders. each tier addresses a different force or stakeholder involved in the software market: vendor, customer, consultants and integrators. the model is potentially beneficial for improving communication and collaboration among life cycle stakeholders in that it embeds action items from the is, marketing and organizational realms. many of these action items are performed using improvisational skills. to excel in lead-driven development in general, and in software development improvisation in particular, entrepreneurial vendors should enhance their improvisational skills. in line with previous studies [27], we found three main factors that influence improvisational communication skills: experience, teamwork skills, and experimental culture. focusing on these facets, organizational training should provide a clear positive effect on improvisational skills and hence on innovation abilities. bibliography [1] s. sawyer, a market-based perspective on information systems development, communications of the acm (44:11), pp.97-102, 2001. [2] e. carmel, cycle time in packaged software firms, journal of product innovation, (12), pp.110123, 1995. [3] d. dori, object process methodology, springer-verlag, 2002. [4] w. royce, managing the development of large software systems, in proceedings of ieee wescon, pp.1-9, 1970. 38 s. cohen, d. dori, u. de haan [5] techtarget, viewed on http://searchvb.techtarget.com/sdefinition/0„sid8_gci755068,00.html, february, 2004. [6] d. howe (ed.), the free on-line dictionary of computing, http://foldoc.doc.ic.ac.uk/ foldoc/foldoc.cgi? query=lifecycle, 1996. [7] gao, gao/imtec-8.1.4 assessing acquisition risks, http://www.gao.gov/ special.pubs/im814.pdf, 1992. [8] b. krogstie, and b. bygstad, cross-community collaboration and learning in customer-driven software engineering student projects, cseet proceedings of the 20th conference on software engineering education & training, 2007. [9] e. carmel, and s. becker, a process model for packaged software development, ieee transaction engineering management(41: 5), pp.50-56, 1990. [10] m. cusumano, a. maccormack, c. f. kemerer, and b. crandall, software development worldwide: the state of the practice, ieee software, pp.28-34, 2003. [11] m. keil, and e. carmel, customer-developer links in software development, communications of the acm (38:5), pp.33-44, may 1995. [12] g. moore, crossing the chasm, harper collins, 1999. [13] r. k. yin, the abridged version of case study research: design and method, in l.bickman and d. j. rog handbook of applied social research methods, thousand oaks, ca: sage, pp.229-259, 1998. [14] w. tellis, introduction to case study, the qualitative report (3:2) july 1997. (http://www.nova.edu/ssss/qr/qr3-2/tellis1.html) [15] k. m. eisenhardt, building theories from case study research, academy of management review (14:4), pp.532-550, 1989. [16] r. yin, case study research: design and methods, thousand oaks, london, new delhi: sage, 1984. [17] d. dori, i. reinhartz-berger, and a. sturm, opcat a bimodal case tool for object-process based system development, proc. ieee/acm 5th international conference on enterprise information systems (iceis 2003), école supérieure d électronique de l ouest, angers, france, pp.286-291, april 23-26, 2003. [18] c. typaldos, social networking, viewed on http://www.typaldos.com/, 2003. [19] pest, viewed on http://www.marketingteacher.com/lessons/lesson_pest.htm, november 2004. [20] k. r. stam, j. m. stanton, i. r. and guzman, employee resistance to digital information and information technology change in a social service agency: a membership category approach, journal of digital information, vol. 5 issue 4, 2004. [21] c. f. kemerer, how the learning curve affects case tool adoption, ieee software, v. 9, pp.238, 1992. [22] m. lacity, and l. willcocks, information technology sourcing reflections, wirtschaftsinformatik, special issue on outsourcing, vol. 45(2), pp.115-125, 2003. a software system development life cycle model for improved stakeholders’ communication and collaboration 39 [23] t. baker, a. s. miner, and d. t. eesley, improvising firms: bricolage, account giving and improvisational competencies in the founding process, research policy, vol. 32, issue 2, pp.255-276, 2003. [24] vacuum marketing strategy. viewed on http://www.marketingsurvivalkit.com/advertising-salesstrategy.htm, october 2006. [25] r. g. fichman, and c. f. kemerer, the illusory diffusion of innovation: an examination of assimilation gaps.information systems research, information systems research (10:3), pp. 255-275, 1999. [26] s. cohen, a multi-tier system development life cycle model for off-the-shelf software with market and organizational effects. ph.d. dissertation, faculty of industrial engineering and management, technion, haifa, israel, 2007. [27] d. vera, and m. crossan, improvisation and innovative performance in teams, organization science, vol. 16, no. 3, pp. 203-224, may-june 2005. [28] a. n. alderman, implementing the whole product concept in strategic sector marketing, proceedings of the fifteenth annual ieee applied power electronics conference and exposition, new orleans, la, usa, pp.27-30, 2000. [29] jwr, viewed on: http://jwr.strategictechnology.com/ pages/choosingsoftware.pdf, november 2005. [30] t. levitt, marketing intangible products and product intangibles, harvard business review, vol. 59 no. 3, pp.94-102, 1981. [31] s. l. brown, and k. m. eisenhardt, the art of continuous change: linking complexity theory and time-paced evolution in relentlessly shifting organizations, administrative science quarterly, vol. 42, no. 1, pp. 1-34, 1997. [32] s&m, how software firms spend their s&m dollars?, http://www.softletter.com/pdfs/fhp10p13.pdf, 2006. [33] a. l. stinchcombe, social structure and organizations in james g. march (ed.), handbook of organizations, chicago: rand mcnally, 1965. [34] s. singh, and p. kotze, an overview of systems design and development methodologies with regard to the involvement of users and other stakeholders, proceedings of saicsit, pp.37-47, 2003. [35] m. cusumano, the business of software: what every manager, programmer, and entrepreneur must know to thrive and survive in good times and bad, free press, 2004. [36] k. b. clark, and s. c. wheelwright, managing new product and process development: text and cases, new york: free press, 1993. [37] k. b. clark, and s. c. wheelwright, the product development challenge: competing through speed, quality, and creativity, boston: harvard business school press, 1995. [38] s. c. wheelwright, and k. b. clark, accelerating the design-build-test cycle for effective product development, international marketing review (11:1), pp.32-46, 1994. [39] b. boehm, and p. bose, a collaborative spiral software process model based on theory w, proceedings, 3rd international conference on the software process, applying the software process, ieee, 1994. 40 s. cohen, d. dori, u. de haan [40] c. h. fine, clockspeed: winning industry control in the age of temporary advantage, perseus publishing, 1999. [41] d. avison, and g. fitzgerald, where now for development methodologies?, communications of the acm, vol. 46, no. 1, pp.78-82, 2003. [42] c. ebert, requirements engineering: understanding the product life cycle: four key requirements engineering techniques, ieee software, pp.19-25, may/june 2006. [43] e. mustonen-ollila and k. lyytinen, why organizations adopt information system process innovations: a longitudinal study using diffusion of innovation theory, information systems journal (13:3), pp.275-297, july 2003. [44] h. c. jr. lucas, and v. spitler, technology acceptance and performance: a field study of broker workstations, decision sciences journal, (30:2) 1999. [45] f. d. davis, perceived usefulness, perceived ease of use, and user acceptance of information technology, mis quarterly, pp.319-339, 1989. [46] g. c. moore, and i. benbasat, development of an instrument to measure the perceptions of adopting an information technology innovation, information systems research (2:3), pp.192-220, september 1991. [47] j. c. brancheau, and j. c. wetherbe, the adoption of spreadsheet software: testing innovation diffusion theory in the context of end-user computing, information systems research (1:2), pp.115-142, 1990. [48] r. b. cooper, and r. w. zmud, information technology implementation research: a technological diffusion approach, management science (36:2), pp.123-139, 1990. [49] j. verville, and a. halingten, an investigation of the decision making process for selecting an erp software: the case of esc, management decision (40:3), pp.206-216, 2002. [50] p. nelson, w. richmon, and a. seidmann, two dimensions of software acquisition, communications of the acm, (39:7), pp.29-35, 1996. [51] j. iivari, and i. ervasti, the impact of alternative is acquisition options upon the is implementation and success, in proceedings of the 1992 acm sigcpr conference on computer personnel research, cincinnati, ohio, united states, pp.338 349. [52] m. c. paulk, structured approaches to managing change, crosstalk: the journal of defense software engineering (12:11) november, pp.4-7, 1999. [53] nordman, commercialization success in early stage technology companies, viewed on www.rocketbuilders.com/ commercialization/ rb_commercialization_presentation_jun2004.pdf, 2004. [54] y. lee, and g. colarelli o’connor, new product launch strategy for network effects products, journal of the academy of marketing science, (31:3), pp.241-255, 2003. [55] e. montaguti, s. kuester, and t. s. robertson, entry strategy for radical product innovations: a conceptual model and propositional inventory, international journal of research in marketing, (19), pp.21-42, 2002. a software system development life cycle model for improved stakeholders’ communication and collaboration 41 [56] s. shane, and s. venkataraman, the promise of entrepreneurship as a field of research, academy of management review, vol. 25, no. 1, pp.217-226, 2000. [57] s. shane, entrepreneurship: a process perspective, south-western college pub; 1st edition, january 2004. [58] f. murray, and m. tripsas, the exploratory process of entrepreneurial firms: the role of purposeful experimentation, advances in strategic management, vol. 21, pp.45-75, 2004. [59] nih system development life cycle (sdlc) it security activities matrix, viewed on http://irm.cit.nih.gov/ security/nih-sdlc.html, december 2006. [60] doj, the department of justice systems development life cycle guidance document, viewed on http://www.usdoj.gov/ jmd/irm/lifecycle/table.htm, january 2003. [61] agile software development: a definition from whatis.com viewed on:http://whatis.techtarget.com/ definition/0„sid9_gci936457,00.html, november 2006. shalom cohen (b. december 16, 1973) completed his msc in operations research at tel aviv university, israel and his phd in the faculty of industrial engineering and management at the technion, israel. in addition to lecturing at the technion on information system related topics shalom holds the position of chief system architect in a high tech company in the homeland security sector. dov dori (b. september 2, 1953) is information and systems engineering professor at the faculty of industrial engineering and management, technion, israel, and research affiliate with the engineering systems division at massachusetts institute of technology. he received his bsc in industrial engineering and management from technion in 1975, msc. in operations research from tel aviv university in 1982 and phd in computer science from weizmann institute of science in 1988. uzi de haan (b. september 5, 1943) completed his msc in aeronautical engineering at the university of delft, holland and his phd in the faculty of industrial engineering and management at the technion, israel. he joined the technion as a professor in the area of strategic management and entrepreneurship at the faculty of industrial engineering after many years in the high-tech industry. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iii (2008), no. 2, pp. 196-203 virtual training centre for computer numerical control m. sahin, s. yaldiz, f. unsacar, b. yaldiz, n. bilalis, e. maravelakis, a. antoniadis abstract: it is a fact that virtual training has been a scope of interest for vocational training for a very long time. however, it needs more time to be more common in all specific training fields. focusing on computer numerical control (cnc) training, new developments in the cnc machinery produce a continuous demand on enhancing the programming and technical capabilities of the involved personnel. training on cnc should follow similar developments and in particular in their programming capabilities, automation they offer and their technical capabilities. based on these main objectives a virtual training center (vtc) for cnc has been developed and it is presented in this paper. the vtc is the main result of a multilateral leonardo da vinci project witch aims to promote and reinforce vocational training in cnc machines. keywords: virtual training center, cnc, vet. 1 introduction in most member states the past two decades there has been a growing awareness of the importance of quality in vocational education and training (vet). the changing demands of the knowledge-based society and the overall trend to increase the efficiency and effectiveness of vet systems, constitute major driving forces behind these developments. major european funding frameworks and programmes, such as leonardo da vinci (ldv) have contributed to improving education and vet systems by raising the level of the services they offer. one of the objectives of the innovative vet systems is the transparency and distribution of information. this function concerns the potential and actual use of information. there may be different systems and structures of information distribution among the various actors, and in the public and there are preconditions for creating transparency in the vet system. to improve quality there must be systems for distributing information and certain mechanisms to ensure the circulated information can be used by the various actors in the policy process. the more widespread the distribution, the better the potential use of the data will be and as a reversal effect, better quality data can be expected, as the actors are able to check the information against their experience and will provide feedback to the systems for gathering data. one of the concrete future strategic objectives of education and training systems in the eu (council of the european union, 2001) is the improvement of the quality and the effectiveness of education and training systems in the eu. this includes improving education and training for teachers and trainers, developing skills for the knowledge society, ensuring access to ict for everyone, increasing recruitment to scientific and technical studies, and making the best use of resources. the second strategic objective is facilitating the access of all to education and training systems. this objective includes open learning environment, making learning more attractive, and supporting active citizenship, equal opportunities and social cohesion. this paper describes the main results of a ldv project that addresses the strategic objectives mentioned above. 2 from the blackboard to the virtual training centre during the 60’s and 70’s, teaching and learning tools were nothing but a piece of chalk and a blackboard eraser, teachers and students who met each other face to face inside the classroom during class. copyright © 2006-2008 by ccc publications virtual training centre for computer numerical control 197 in the 80’s, videotape programs were used as teaching aids. in the 90’s, one-way teaching by computer arrived. and finally today’s advanced computer and information network technology has introduced radical innovative breakthroughs in our teaching and learning methods as well as in the learning environment. students can listen to their teacher or trainers in distant classrooms through pc’s and get a simultaneous view of their teachers and texts as well. they can ask questions and record the "class" for repeated viewing. training organizations can conduct professional training directly via the computer network. these learning environments are not so different from a teacher-guided class with discussions and tests as well. in the report "studies in the context of the e-learning initiative: virtual models of european universities" [1] a key concern was how virtual mobility is being supported in european universities through ict integration and e-learning. the study found that the majority of universities face major challenges in promoting ict integration. ict strategy is very important and those universities that have an ict strategy are significantly ahead in integration of ict in administration and organisation and networking. integration of ict and e-learning is politically important in the eu in terms of internationalisation and globalisation of education, student demand and interest in increasing the quality of education through ict [2], [3], [4]. at the national level, integration of ict should become a key priority with national and regional institutions making a commitment to itc and the development of networks. there must be increased national flexibility with a commitment to support common standards of quality and assessment and to develop national and international metadata standards. in the last 3 decades, a large number of vocational training centres and technical universities are giving priority to cnc training. new developments on cnc machines are providing a continuous need for updated cnc training curriculum. training on cnc should follow similar developments and in particular in their programming capabilities, automation they offer and their technical capabilities. in addition, cnc programming is becoming more and more automated through the use of cad/cam systems. this requires from the programmers to acquire cad operation capabilities, on top of their cnc operation and programming knowledge. the major objective in the field of cnc training is to improve the qualifications and competences of the trainees, which is directly related to a well-designed and effective curriculum to be carried out on cncs. the facilities for cnc training vary a lot and this has had direct impact on the experience that the trainee is acquiring during his/her apprentice. this paper presents the development and promotion of a virtual training centre (vtc), an internet based e-learning facility, specifically based on computer numerical control (cnc) training. this centre includes a virtual space (a cnc training portal) on the internet which allows the constant sharing of e-learning based cnc teaching material, which is created so as to foster the further development of e-learning based cnc educational contents. this virtual training centre aims at setting the standard cnc virtual learning in vocational training systems [5]. 198 m. sahin, s. yaldiz, f. unsacar, b. yaldiz, n. bilalis, e. maravelakis, a. antoniadis 3 developing a common cnc curriculum during the first stages of the project, the equipment, methods, curriculum and techniques currently used for cnc training by the organisations in the partner countries were observed, collected and evaluated [6], [7]. the selected materials were used to create a new and common curriculum. five important factors that contribute to learning were taken into account in order to prepare the cnc curriculum: • motivation • aptitude • presentation • repetition • practice with reinforcement the approach for developing the appropriate training material was based on the following key concepts: • motivation • know your machine (from a programmer’s viewpoint) • prepare to write programs • understand the motion types • know the compensation types • format your programs in a safe, convenient, and efficient manner • know the special features of programming • know your machine (from an operator’s viewpoint) • understand the three modes of operation • know the procedures related to operation • you must be able to verify programs safely this approach combined with the important learning factors finally led to a cnc training curriculum including 28 sessions: 1. machine configuration 2. speeds and feeds 3. visualizing program execution 4. understanding program zero 5. measuring program zero 6. assigning program zero 7. flow of program processing 8. introduction to programming words virtual training centre for computer numerical control 199 9. preparation for programming 10. types of motion 11. introduction to compensation 12. dimensional (wear) tool offsets 13. geometry offsets 14. tool nose radius compensation 15. program formatting 16. the four kinds of program format 17. simple canned cycles 18. rough turning and boring multiple repetitive cycle 19. more multiple repetitive cycles 20. threading multiple repetitive cycle 21. subprogramming techniques 22. control model differences 23. other special features of programming 24. control model differences 25. machine panel functions 26. three modes of operation 27. the key operation procedures 28. verifying new programs safely 4 the structure of the virtual training centre to develop the virtual training centre, firstly, a communication website was developed in order to manage the activities and tasks to be carried out by the partners. then, an interactive teaching program was developed and put into a website to form a virtual training centre (figure 1). the common curriculum developed for this purpose was the base of this training centre. the site, along with the interactive teaching program, was divided into four main areas, "news", "exchange of views", "projects and networks", and "information resources". with these, users would be able to access a newsletter, a bulletin board, online surveys and survey reports, information on vet networks, an electronic library with references, a bookshop with downloadable publications and a number of databases. 200 m. sahin, s. yaldiz, f. unsacar, b. yaldiz, n. bilalis, e. maravelakis, a. antoniadis figure 1: the interface of the interactive cnc training centre (http://www.vtcforcnc.com) figure 2: example of 3phase animated cnc training material virtual training centre for computer numerical control 201 in the main core of the cnc training material, simulations and practical exercises are included into the interactive training centre (figure 2). the feedback of the implementation of the vtc in training centres has been recorded and evaluated in order to produce the final version. the evaluation procedure included content (topics, language used, modules), methods (progress, different levels of difficulty, and range of resources, situations and practical cases) and technology (ease of installation, interactive nature and use without a tutor). the main aim of the vtc for cnc aims is to be an interactive platform, a meeting point for policymakers, social-partners, practitioners, researchers and all those with an interest in cnc field of vocational education and training. experts in the field are able to share and exchange knowledge and experience with associates within and outside the european union. this will foster the long-term viability of the centre. 5 aims and target groups of the vtc the vtc aims to improve the skills and competences of people to promote and reinforce the contribution of vocational training to the process of innovation, with a view to improving competitiveness and entrepreneurship, also in view of new employment possibilities. the specific aims of vtc can be defined as follows: • training the trainers, trainees, technicians and apprentices and all enthusiastic about cnc. • preparing technicians as intermediates having common measurable qualities the industry is seeking. • helping to form a labour force that can use current knowledge and technology, and thus, in search for life-long learning. • supporting the sectoral communication through the national centres in partners. • setting up a website to publish the data collected. • adapting the collected materials to enhance the new curriculum satisfying the requirements in a modern sense. • helping to improve and upgrade competences and skills of the involving institutions’ didactic staff and exchange experiences over the virtual training centre. • enabling the participants to extend the common educational qualifications of cnc technologies, the accreditation of the skills and knowledge of cnc technologies acquired within the network created between participating institutions and organizations. • increase the quality of employment through qualified workers. • helping to increase active use of technology acquired and thus to increase the standards. • contributing to individuals by behaving through life long learning. • having a labour power in accordance with common design and production standards. • contributing to labour market by using the common technology and equipment effectively. 202 m. sahin, s. yaldiz, f. unsacar, b. yaldiz, n. bilalis, e. maravelakis, a. antoniadis • helping to enhance available potential of human sources. target groups include trainers, trainees, technicians, apprentices and all enthusiasts about cnc. the final and potential users of the project’s results are the training organisations, the smes dealing with metal products by cnc usage, and the universities, colleges, vocational schools, training centres. 6 conclusions the integration of itc in this virtual learning environment for cnc, the development of the vtc and the common training curriculum are focused on the eu goals of of internationalisation and globalisation of education, student demand and interest in increasing the quality of education through ict. at the national level, integration of ict should become a key priority with national and regional institutions making a commitment to itc and the development of networks. there must be increased national flexibility with a commitment to support common standards of quality and assessment and to develop national and international metadata standards. this centre addresses the priorities expressed here. furthermore, this virtual training centre addresses the strategic objectives mentioned above: improving the quality and effectiveness of education and training systems in the eu by developing skills for the knowledge society, ensuring access to ict for everyone, increasing recruitment to scientific and technical studies, and making the best use of resources. facilitating the access of all to education and training systems by providing open learning environment, making learning more attractive, and supporting active citizenship, equal opportunities and social cohesion is the other strategic objective that can be achieved through this virtual training centre. the experiences and knowledge gained during the implementation of this centre can be used in developing and improving other training programmes in particular in the area of new information technology applications in related sectors. bibliography [1] ramboll, pls: studies in the context of the e-learning initiative: virtual models of european universities (lot1). draft final report to the european commission, dg education and culture. available at http://elearningeuropa.info available at http://elearningeuropa.info, 2004. [2] a. andreatos, virtual communities and their importance for informal learning, international journal of computers, communications & control, vol. ii, no. 1, pp. 39-47, 2007. [3] a. styliadis, i. karamitsos, d. zachariou, personalized e-learning implementation the gis case, international journal of computers, communications & control, vol. i, no. 1, pp. 59-67, 2007. [4] c. zhengxin, learning about learners: system learning in virtual learning environment, international journal of computers, communications & control, vol. i (2008), no. 1, 2007. [5] m. sahin, n. bilalis, s. yaldiz, a. antoniadis, f.unsacar, e. maravelakis, revisiting cnc traininga virtual training centre for cnc international conference on e-portfolio process in vocational education-epvet, bucharest, romania, 2007. [6] w. xiaoling, z. peng, w. zhifang, s. yan, l. bin, l. yangchun, development an interactive vr training for cnc machining, international conference on virtual reality continuum and its applications in industry, proceedings vrcai 2004 acm siggraph, pp. 131-133, 2004. [7] l. yadong, g. xingui, l. wei, y. kazuo , k. keizo and m. fujishimab, an intelligent nc program processor for cnc system of machine tool, robotics and computer-integrated manufacturing, vol 23 (2), pp 160-169, 2007. virtual training centre for computer numerical control 203 mehmet sahin, suleyman yaldiz, faruk unsacar, burak yaldiz technical science college selcuk university 42031 konya, turkey e-mail: mesahin,syaldiz,funsacar@selcuk.edu.tr nikolaos bilalis department of production engineering & management technical university of crete 73100, chania, greece e-mail: bilalis@dpem.tuc.gr emmanuel maravelakis, aristomenis antoniadis department of natural resources & environment design & manufacturing laboratory technological educational institute of crete 73133 chania, greece e-mail: marvel,antoniadis@chania.teicrete.gr received: december 20, 2007 int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 3, pp. 280-291 extreme data mining: inference from small datasets r. andonie răzvan andonie computer science department central washington university, ellensburg, usa and department of electronics and computers transylvania university of braşov, romania e-mail: andonie@cwu.edu abstract: neural networks have been applied successfully in many fields. however, satisfactory results can only be found under large sample conditions. when it comes to small training sets, the performance may not be so good, or the learning task can even not be accomplished. this deficiency limits the applications of neural network severely. the main reason why small datasets cannot provide enough information is that there exist gaps between samples, even the domain of samples cannot be ensured. several computational intelligence techniques have been proposed to overcome the limits of learning from small datasets. we have the following goals: i. to discuss the meaning of "small" in the context of inferring from small datasets. ii. to overview computational intelligence solutions for this problem. iii. to illustrate the introduced concepts with a real-life application. 1 introduction small dataset conditions exist in many applications, such as disease diagnosis, fault diagnosis or deficiency detection in biology and biotechnology, mechanics, flexible manufacturing system scheduling, drug design, and short-term load forecasting (an activity conducted on a daily basis by electrical utilities). in this section, we describe a computational chemistry problem, review a class of neural networks to be used, and summarize our previous work in this area. 1.1 a real-world problem: assist drug discovery current treatments for hiv/aids consist of co-administering a protease inhibitor and two reverse transcriptase inhibitors (usually referred to as combination therapy). this therapy is effective in reducing viremia to very low levels; however, in 30-50% of patients it is ineffective due to resistance development often caused by viral mutations. due to resistance and poor bioavailability 1 profiles, as well as toxicity associated with these therapies, there is an urgent need for more efficient design of drugs. we focus on inhibitors to the hiv-1 protease enzyme, using the ic as the target value. a detailed description of the problem, from a computational chemistry point of view, can be found in our papers [1–3]. the ic value represents the concentration of a compound that is required to reduce enzyme activity by 50%. a low ic value indicates good inhibitory activity. the available dataset consists of 196 compounds with experimentally determined ic values. twenty of these molecules are used as an external test set after the training is completed. the remaining 176 molecules are used for training and cross-validation. our practical goal is to predict the (unknown) ic values for 26 novel compounds which are candidates for hiv-1 protease inhibitors. we use two ic prediction accuracy measures: the rmse (root mean squared error) and the symmetric mean absolute percentage error (smape). 1bioavailability is the rate at which the drug reaches the systemic circulation. copyright c© 2006-2010 by ccc publications extreme data mining: inference from small datasets 281 the easiest way to represent a molecule is by a vector of features (molecular descriptors) which may be both topological indices and physico-chemical properties. the resulting features may be numerous and inter-correlated. using the complete set of descriptors may lead to overfitting, if it is too large compared to the size of the training set. we select 35 molecular descriptors based on their contribution to molecular entity. although biological activity data has been obtained for many more chemical structures at various pharmaceutical companies and academic laboratories, they are not available in the public domain. actually, most classical studies for a specific enzyme system have been performed on small datasets, due to limited experimentally determined biological activity values in the public domain. the dimensionality (the number of physico-chemical features) characterizing these molecules is relatively high. our dataset shares these undesired characteristics: it is small, with relatively many features, and highly overlapping. 1.2 prerequisites: famr for ic prediction the famr is a fuzzy artmap (fam) incremental learning system used for classification, probability estimation, and function approximation. we review the basic famr notation. details can be found in [4]. a fam consists of a pair of fuzzy art modules, arta and artb, connected by an inter-art module called mapfield. the fuzzy arta module contains the input layer, f a , and the competitive layer, f a  [5]. a preprocessing layer, f a , is also added before f a  . the art modules create stable recognition categories in response to arbitrary sequences of input patterns. the arta and artb vigilance parameters, ρa and ρb, control the matching mechanism inside the modules. during learning, the mapfield weights are updated: the strength of the weight projecting from the selected arta category to the correct artb category is increased, while the strengths of the weights to other artb categories are decreased. a mapfield vigilance parameter ρab calibrates the degree of predictive mismatch necessary to trigger the search for a different arta category. if the weight projecting from the active arta category through the mapfield to the active artb category is smaller than ρab (vigilance test), then the system responds to the unexpected outcome through the so-called match tracking. this triggers an arta search for a new input category. after choosing an arta category whose prediction of the correct artb category is strong enough, match tracking is disengaged, and the network is said to be in a resonance state. in this case, mapfield learns by updating the weights wabjk of associations between each j-th arta category and each k-th artb category. the famr uses the following iterative updating scheme: wab(new)jk =    wab(old)jk if j 6= j wab(old)jk + qt qnewj (  − wab(old)jk ) wab(old)jk (  − qt qnewj ) if k 6= k (1) where qt is the relevance assigned to the tth input pattern (t = , , . . . ) and qnewj = q old j + qt . the relevance qt is a real positive finite number directly proportional to the importance of the experiment considered at step t. this wabjk approximation is a correct biased estimator of the posterior probability p(k| j), the probability of selecting the k-th artb category after having selected the j-th arta. fam (and famr) networks map subsets of rn to rm and can be used for function approximation. the fam has been proven to be a universal function approximator [6]. we use the famr to predict functions that are known only at a certain number of points. more specifically, we predict ic values. 1.3 our previous work the present paper is based on a sequence of results, each describing new computational intelligence tools for biological activity (ic) prediction. in [7], we investigated the use of a fuzzy neural network 282 r. andonie (fnn) for (ic) prediction. in [1] and [2], we improved this model by adding a two-stage genetic algorithm (ga) optimizer: the first for selecting the best subset of features and the the second for optimizing the fnn parameters. we will refer to this ga-optimized fnn as fs-ga-fnn. in [8] we also focused on the ic prediction task, using the famr model. during the learning phase, each sample pair is assigned a relevance factor proportional to the importance of that pair. the prediction method consists of two stages. first, ga-optimization incorporating cross-validation is used to modify the training dataset. this modification consists of finding the best relevances for the data, according to some fitness criterion. the fitness criterion measures the famr ic prediction accuracy for a given training/validation dataset with given relevances. in stage two, the final famr is obtained by training it using the dataset with optimized relevances. in other words, stage one improves the generalization capability of the famr which will be obtained in stage two. we will refer to this model with gaoptimized relevances as ga-famr. we compared the ga-famr and the ordered famr (a famr algorithm which optimizes the order of training data presentation) in [9]. both methods compensate for insufficient training data by additional optimizations. a trade-off between computational overhead and generalization capability is obtained. recently, we performed rule extraction from the trained famr model [10]. we post-processed the set of generated rules in order to improve generalization. we eliminated overfitting by heuristic generalization of rules and by adding new rules. this method proved to be efficient for small training sets. the present paper results from several invited talks [9, 11, 12]. in section 2, we discuss the capability of neural network to infer from rare samples. section 3 describes two methods for neural training on small datasets. after presenting and discussing experimental results in section 4, we conclude with our final remarks (section 5). 2 neural networks trained on small datasets we aim to discuss the difficulties of inferring a neural network (nn) from small, or non-representative, training sets. we will look closer at the overfitting and generalization aspects of the network. but first, we need to define formally what we understand by "small training set". 2.1 what is "small"? in many multivariable classification or regression (e.g., estimation or forecasting) problems we have a training set tp = (xi,ti) of p pairs of input/output vector x ∈ 40l/m2 then flood_risk = yes and hydrologic_code = orange; rule 4 if flow ≥ alert and flow < danger and precipitations < 40l/m2 then flood_risk = yes and hydrologic_code = orange; rule 5 if flow ≥ alert and flow < danger and precipitations > 40l/m2 then flood_risk = yes and hydrologic_code = red; rule 6 if flow ≥ danger then flood_risk = yes and hydrologic_code = red; the precipitations are occurring predominantly as rain or snow. the fall of rain is the largest factor contributing to increased river flows. snow versus rain is the second source of precipitation. melting snow during spring has a considerable impact on river basin management. tracking the level and the periodicity of precipitation involves measuring them by using existing sensors and transducers gauging stations. warning about the risk of flood is used to take appropriate decisions from the analysis of all information related to the maximum river flow. if the analysis step indicates a possible flood production then the prevention system will send signals to the members of the emergency situations committee. the hydrological alert (containing the alarm severity code of flooding) can be transmitted online to pdas, smart terminals with a gps and 848 m. oprea, v. buruiana, a. matei figure 3: the structure of the monitoring and analysis system gprs (general packet radio service) or internet. in figure 3 it is shown the structure of the monitoring and analysis system for a catchment river basin. x1, x2, . . . , xn are the rainfalls in each sub-catchment; h1, h2, . . . , hn are the rivers level. the system meets three important functions: • the function of monitoring the evolution of the hydrographic basin real time measuring of the river level and rainfall level and periodicity; • the analysis function parameters processing and comparing them with the references parameters of the catchment; • the decision and warning function. 4 experimental results of flood alerting case study as a case study we have experimented the microcontroller-based intelligent system for flood alerting in the prahova catchment basin presented in figure 4. the hydrographic network forms a highly developed basin in a palm form flowing nw-se. the main rivers that compose the prahova sub-basin are the prahova river and its main tributaries: azuga, doftana, teleajen, and cricovul sarat. the water recources in the prahova country have increased significantly due to two large lakes, paltinu (doftanei valley) and maneciu-streams (teleajen valley). the prahova river is the largest collection of water in the prahova county, with a length of 193 km, of which the first 6 km and the last 16 km are located in the counties brasov and ilfov. the prahova river basin is characterized by three types of climates: mountain, hill, and plain. the annual quantity of precipitation is 1000-1400 mm in the mountains, 500-1000 mm in the hills, and 550-600 mm in the plain. summer rainfall is more abundant, where the flood may occur in the gauging stations deepened moara domneasca and adancata. for the visualization of the hydrological process evolution in time we have used the program multi router trafic grapher (mrtg) that generates graphics as a function of time. an example of prahova river level evolution and flow in 24 hours during 7 days is given in figure 5. for the prahova hydrometric station the attention (atc), alert (ac) and danger (dc) cotes for the rivers level are the following: atc = 250 cm, ac = 350 cm and dc = 400 cm and for water flow are: atc = 105 m3/s, ac = 230 m3/s and dc = 320 m3/s. our research goal was to use the microcontroller-based intelligent system, that was presented in the previous sections, a microcontroller-based intelligent system for real-time flood alerting 849 figure 4: the prahova catchment basin figure 5: the evolution of prahova river level and flow in 24 hours during 7 days for the prahova river basin with the purpose of sending warning in real time to the emergency situations committee, and to the possible affected population in case of flood waves. in the experiments that were run so far, the data acquisition was simulated. the experiments made so far used for alert transmission two motorola mx300 radio transmission stations, those were modified to work in the 146 mhz bandwidth. figure 6 shows the motorola mx300 radio stations that were used. figure 6: the two motorola mx300 radio stations the main characteristics of the radio stations are that they can be automatically activated when they receive a radio signal through the incorporated microphone or through the microphone input. therefore, the sound output of the microcontroller-based system was connected to the microphone input of the transmitter and according to the hydrologic alert code provided by the analysis module will run audio alert records according to following script: the evolution of the simulated hydrological process is shown on the microcontroller-based 850 m. oprea, v. buruiana, a. matei ♯ !/bin/bash ♯ script receives argument via parameter {$1} which is received at startup time. ♯ {$1} may be as following: ’green’, ’yellow’, ’orange’, ’red’ and are sent on via the main program ♯ alert on system console echo "alerting code {$1}. activating radio transmitter" ♯record alert via syslog (unix system logger) echo " ‘date |cut -c1-19‘: receiving alerting code {$1} |logger -s ♯playing one of the audio files: ’green.wav’, ’yellow.wav’, ’orange.wav’, ’red.wav’ /usr/local/bin/esdplay -s localhost {$1}.wav intelligent system screen as shown in figure 7. three hydrometric gauging stations were included in the simulation, buşteni, cîmpina and prahova, all for the prahova river basin. figure 7: rain/level/flow evolution for the case study figure 8: the experimental system figure 8 shows the experimental system. in the experiments made, for each hydrological alert code it was sent via radio communication the corresponding audio alert signal (attention, alert and danger), according to the results of the analysis step. 5 conclusions as most of the hydrological processes require real time monitoring and online alerting, we have developed a microcontroller-based intelligent system that can be connected to various sensors and measuring devices for hydrological parameters measurements, and can transmit online hydrological alert codes to the decision making factor (such as emergency situations committee that exist in each county from romania). the system can be adapted to any hydrographic basin with minimal changes in its configuration. at the time the system was developed, all the data coming in was simulated, but at any time its knowledge base system can be fed with data coming from real sensors. during long time hard condition tests under simulated air humidity, simulated power fluctuations, while running a cpu stress testing program and also our simulation software, the intelligent system kept its main characteristics of flexibility and robustness, as no indicators of system failure have occurred at unix freebsd 8.0 operating system kernel level. as a future work we shall extend our experiments by using simultaneously with the radio hydrological alert transmission other communication channels such as the internet (by sending an e-mail) and the a microcontroller-based intelligent system for real-time flood alerting 851 mobile phone (by sending a sms). also, we shall analyze some complex hydrological situations by taking into account the interdependencies between different rivers flows and levels from a hydrographic basin and the meteorological forecasts, improving flood prevention due to an early hydrological alert sent to the decision making factors in order to take more efficient prevention measurements. bibliography [1] d. abbot, linux for embedded real-time applications, newnes, usa, 2003. [2] k. arnold, embedded controller hardware design, llh technology publishing, usa, 2001. [3] s. r. ball, embedded microprocessor systems real world design, 3rd edition, newnes, usa, 2002. [4] bsdi, the freebsd handbook, berkeley university of california, usa, 2000. [5] v. buruiana, wireless seismic sensor, proceedings of process control symposium spc-09, ploiesti, 2009. [6] v. buruiana, experimental research regarding the development of a microcontroller-based intelligent monitoring and alert system, research report, university petroleum-gas of ploiesti, department of informatics, 2009. [7] a. matei, intelligent algorithms for hydrographic monitoring, analysis and prediction, research report, university petroleum-gas of ploiesti, department of informatics, 2009. [8] m. oprea, a. matei, e. petre, agent-based modeling of a dam monitoring system, proceedings of 17th international conference on control systems and computer science cscs 2009, politehnica press, 509-514, 2009. [9] d. popescu, r. varbanescu, a. iordan, s. arghir, interconnection of mobile sensor nodes for alert system applications, proceedings of cscs 2009, politehnica press, bucharest, 249-254, 2009. [10] i. watson, a. d. burnett, hydrology an environmental approach, taylor & francis, crc press, 1995. [11] http://www.tgdaily.com/hardware-opinion/41525-marvells-plug-computer-a-tiny-discretefully-functional-5-watt-linux-server [12] ftp://ftp.freebsd.org/pub/freebsd/iso-images-i386/8.0/8.0-rc2-i386-disc1.iso international journal of computers, communications & control vol. i (2006), no. 1, pp. 69-71 “logic will never be the same again” kurt gödel centenary gabriel ciobanu kurt gödel (1906 1978) kurt gödel was born on 28th of april 1906 in brno, which was at that time a city of the austrian-hungarian monarchy. this year we celebrate the 100th birthday of kurt gödel, perhaps the greatest logician of the twentieth century. we mark this event by a short article on his life and scientific results. kurt gödel enrolled at the university of vienna in 1923, and his scientific evolution is strongly related to the cultural environment of vienna. as a student, gödel became interested in mathematical logic, the field to which he made his major contributions. during that period, under the strong influence of david hilbert, the mathematicians were concerned about the consistency and completeness of the formal systems used in mathematics. hilbert has noticed the strong relationship between logic and mathematics; he argued in favor of an axiomatic approach, believing that all of mathematics can be formulated based on a logical foundation, and each result in mathematics follows from a system of axioms. moreover, such an axiom system can be proved to be consistent. a formal system s is consistent whenever no contradiction is provable from s, and it is complete whenever every sentence a is decided by s in the sense that either s proves a, or s proves a. if neither a nor a is provable in s, then a is undecidable by s, and s is said to be incomplete. in his doctoral dissertation, gödel proved the completeness of first-order predicate logic, that states that any logically valid formula is provable, i.e., any sentence that holds in every model of the first-order predicate logic is derivable in the logic. the completeness theorem is an important property of first-order predicate logic. it does not hold for all logics; for instance, second-order predicate logic does not have a completeness theorem. this result was published in 1930, in “die vollstandigkeit der axiome des logischen funktionenkalkuls”, monatshefte f.math. 37, p.349-360. in 1930 gödel became a member of the university of vienna (where he remains until 1938, the year when austria became part of nazi germany). in 1931, at only 25 years old, gödel proved his most famous results, the incompleteness theorems. the first incompleteness theorem is perhaps the most surprising result in mathematical logic. it states that for any consistent formal theory that proves basic arithmetical truths, it is possible to construct an arithmetical statement copyright c© 2006 by ccc publications 70 gabriel ciobanu that is true, but not provable in the theory. that is, any consistent theory of a certain expressive strength cannot prove everything which is true, i.e. such theories are necessarily incomplete. the second incompleteness theorem states that for any formal theory s including basic arithmetical truths and certain truths about formal provability, s includes a statement of its own consistency if and only if s is inconsistent. these results showed that hilbert’s program is impossible. they were first announced by gödel at a meeting in 1930 at koenigsberg, where also hilbert and von neumann were present, and it was published in 1931 as “uber formal unentscheidbare satze der principia mathematica und verwandter systeme”, in i.monatsh.math.phys. 38, p.173-198. this paper is elegantly organized and clearly presented, progressing efficiently with no wasted energy devoted to collateral aspects. godel’s incompleteness results were simply unexpected, and their proofs, though involving new techniques, are not very difficult. their relevance to mathematical logic and in the theory of computation is preeminent and dominant. their relevance to philosophy is important, even though it is not clear yet in what way and how the things will evolve. in the philosophy of mind, lucas-penrose arguments that the human mind does not work on mechanical principles in mathematics use the incompleteness theorem. the second incompleteness theorem has prompted theologians and postmodernists to reflect why mathematics cannot prove its own consistency. gödel’s results has also interesting interpretations in the language of computer science. on the other hand, the impact of gödel’s incompleteness results among working mathematicians is not impressive. the mathematicians, although generally aware of the theoretical possibility that a problem they are working on may be unsolvable in the current axiomatic framework of mathematics, do not have difficulties in using their field results in order to find a solution. in mathematics, gödel’s incompleteness results show that in many cases, such as in number theory or real analysis, it is not possible to create a complete and consistent finite list of axioms, or even an infinite list that can be produced by a computer program. each time you add a statement as an axiom, there are other true statements which still cannot be proved as true, even with the new axiom. furthermore if the system can prove that it is consistent, then it is inconsistent. in his book “from here to infinity”, ian stewart expresses this by saying that “gödel showed that if anyone finds a proof that arithmetic is consistent, then it isn’t!”. in 1935 gödel established the relative consistency of the axiom of choice, and in 1938 that of the generalized continuum hypothesis; these results are published in “the consistency of the axiom of choice and of the generalized continuum-hypothesis”, proc.nat.acad.sci.usa 24, p.556-557, 1938. in 1940 gödel took a position at the institute for advanced study in princeton. at princeton university press he published “consistency of the axiom of choice and of the generalized continuum-hypothesis with the axioms of set theory” where he introduced the constructible universe, a model of the set theory in which the only sets which exist are those which can be constructed from simpler sets. in such a constructible universe both the axiom of choice and the generalized continuum hypothesis are true, and therefore the model should be consistent. during his many years at princeton, gödel’s interests turned to philosophy and physics. at princeton godel had a legendary friendship with einstein. regarding their walks to and from the institute for advanced studies, “logic will never be the same again” kurt gödel centenary 71 einstein said that “his own work no longer meant much; he came to the institute merely to have the privilege of walking home with gödel”. gödel demonstrated the existence of some paradoxical solutions to einstein’s field equations in general relativity. gödel’s “rotating universes” allow time travel, and caused einstein to have doubts about his own theory. in the early 1970s gödel wrote an ontological proof of god’s existence. due to his psychological disorder, in late 1977 gödel refused to eat, and thus he died of starvation in january 1978. an extensive biography of gödel can be found at the mactutor history of mathematics archive at http://wwwgap.dcs.st-and.ac.uk/ history/mathematicians/ gödel.html, which also provides a list of papers. “kurt gödel collected works” were published in five volumes at oxford university press (s. feferman et al., editors). an international symposium celebrating the 100th birthday of kurt godel was organized between 27 and 29 april 2006 by the kurt gödel society and university of vienna. this symposium commemorated the life, work, and foundational views of kurt gödel, exploring also the current research and ideas in the fields of the logic, mathematics, and computer science. the symposium has attracted important scientific personalities: p.cohen (fields medal), s.feferman (main editor of “collected works”), h.putnam (harvard), h.woodin (berkeley), g.kreisel (frs), d.scott (turing award), a.wigderson (nevanlinna prize), c.papadimitriou (knuth prize), j.d.barrow (kelvin medal), r.penrose (wolf prize). the talks presented various aspects related to logic, mathematics, computer science, artificial intelligence, cosmology, philosophy, and theology. there were a lot of discussions, including difficult questions, debatable answers, and polemics. according to a selection process, two submissions of romanian authors were accepted to be presented at this symposium: g.ciobanu (iasi) with a paper regarding a new characterization of computable real numbers, and l.leustean (darmstadt) with a paper on “proof mining”, a technique of getting new knowledge from proofs in mathematics. the volumes devoted to this symposium will be published in the series collegium logicum of the kurt gödel society, and by cambridge university press. gabriel ciobanu http://thor.info.uaic.ro/ gabriel e-mail: gabriel@iit.tuiasi.ro editor’s note about the author: gabriel ciobanu institute of computer science romanian academy, iaşi born: 5 july 1957, piatra neamţ. dr. gabriel ciobanu has wide ranging interests in computing including distributed systems, theory of programming and computational aspects in biology (molecular interaction, membrane computing). he has edited/authored 6 books and over 100 papers on these topics. he enjoy developing new ideas, both the practical ones and the more abstract ones, drawing on both his computer science and mathematical background. over the years he has received public recognition for his research, including the 2004 “octav mayer” award, and the 2000 “grigore moisil” award of the romanian academy of sciences. he was a visiting academic to edinburgh university, paris xi, tohoku university, and national university of singapore, among others. he is a member of some professional associations and societies (eatcs, eapls, ams, iscb), and he have been in the program committees for many international conferences. international journal of computers, communications & control vol. iii (2008), no. 1, pp. 33-40 learning about learners: system learning in virtual learning environment zhengxin chen editorial note: the program committee of icvl 2007 sent to ijccc nine of the best papers presented at the conference, with the recommendation to be published in an extended form. after a new evaluation, our reviewers decided that five of those papers can be published in ijccc, two of which in this issue. abstract: virtual learning is not just about a set of useful it tools for learning. from an examination on where virtual learning stands in the overall learning spectrum, we point out the important impact of natural computing on virtual learning. we survey and analyze selected literature on important role of natural computing aspects, such as emergence (using swarm intelligence to achieve collective intelligence) and emotion, to virtual learning. in addition, in order to effectively incorporate these aspects into virtual learning, we propose using infrastructural support for virtual learning through system learning: the virtual learning environment not only provides facilities for learners, but also observes the behavior of learners and takes actions, so that its own performance can be improved (i.e., to better serve the learners). in this sense, system learning is concerned with learning about learners. consequently, a virtual learning environment is a true human-machine symbiosis, paired by human learning and system learning. keywords: natural computing, collective intelligence, emergence, emotion, system learning, virtual learning architecture 1 introduction as a software system, a virtual learning environment is intended to offer a virtual environment for learning where the learning process is based on information technology (it). virtual learning environment facilitates computerized learning or computer-enhanced learning (e-learning). many projects in virtual learning have been designed to facilitate teachers in the management of educational courses for their students, especially by helping teachers and learners with course administration. the system can often track the learners’ progress, which can be monitored by both teachers and learners. with advanced learning technology (alt) it supports, virtual learning makes learning as a life-long journey easier to achieve than anytime else in history, and the entire world now becomes an open university. however, although virtual learning has started producing fruitful results, fundamental issues related to virtual learning is seldom examined. related to this is the pulic misconception that virtual learning is just a set of it (information technology) tools or techniques. to address this problem, in this paper we first take a look at the learning spectrum, so that we can have a better understanding about where virtual learning stands in this big picture (section 2). since this examination reveals the importance of natural computing for virtual learning, we provide a brief review on natural learning (section 3). we then examine two important features related to natural computing, namely, emergence and emotion, and review existing work relevant to virtual learning (section 4 and section 5). the introduction of natural computing in virtual learning necessities the concept of system learning; its important role is examined (section 6). we conclude this paper by indicating the need for investigating a virtual learning architecture (or a framework) where human virtual learning is paired with system learning (section 7). copyright © 2006-2008 by ccc publications 34 zhengxin chen 2 the learning spectrum virtual learning is concerned with it assisted human learning and much of research work in virtual learning addresses various technical issues to enhance human learning. however, any discussion concerned with learning using contemporary it techniques is incomplete if we do not take a look on the issue of machine learning (http://robotics.stanford.edu/people/nilsson/mlbook.html), a subfield in ai devoted to developing algorithms for enhanced performance of computers. although virtual learning (for humans) and machine learning are two separate research areas, they are both related to learning with computers. so if we examine them together, we should have a better understanding about where virtual learning stands in the big picture. this broad perspective raises an important issue: virtual learning should not be simply viewed as just a set of it tools (or techniques) to assist learning or education. it reminds us the importance of examining fundamental issues related to learning, such as nature of intelligence, various forms of intelligence, consciousness and thinking, as well as recent research progress related to brain and mind, and even study activities observed in various forms of life not just humans, but animals as well, so long as they demonstrate a kind of learning ability. these studies could shed meaningful insight for developing better virtual learning environments. as noted in [16], in recent years educators have explored links between classroom teaching and emerging theories about how people learn. recommended educational approaches, consist primarily of trying to maintain a relaxed, focused atmosphere that offers options for learning in individually satisfying ways. one thing we must be kept in mind is that the brain is complex and while research has revealed some significant findings, there is no widespread agreement about their applicability to the general population or to education in particular. nevertheless, brain research provides rich possibilities for education [16]. articles in the magazine scientific american minds (http://www.sciammind.com/) also shed interesting lights on brain/mind research and education. the issue of studying various forms of learning activities leads to the next question: is anything setting in between virtual learning and machine learning? although this may be an open question which is subject to debate, we would like to offer a possible answer: learning from nature through natural computing, which is the computational version of the process of extracting ideas from nature to develop "artificial" (computational) systems ("artificial" means human-made). since natural computing is aimed to model the nature or even compute with the nature, it looks for intellectual inspiration from all forms of life (not restricted to human beings) such as flocks or ants. since natural computing sets between the research of "full human" (i.e., virtual learning) and non-human (i.e., machine learning), it fits in the missing link in the learning spectrum. amending the hierarchical diagram provided by [1], we have the learning hierarchy as shown in figure 1. (note: for more details on various forms within distance learning, refer to the original figure in [1]). figure 1: learning spectrum from machine learning to various forms of virtual learning but why should we bother natural computing at all in the context of virtual learning? the answer is learning about learners: system learning in virtual learning environment 35 simple: it may provide new interesting ideas not addressed in traditional learning or education context, so that virtual learning can benefit from that. note that natural computing is not new to virtual learning community: as we are going to see soon, projects using various natural computing techniques have been conducted for virtual learning. yet there is a need to conduct more systematic research on the rich impact of natural computing in virtual learning. such kind of investigation will eventually benefit the study of natural computing as well, because the diverse applications in virtual learning extend the horizon of natural computing. 3 basics of natural computing in order to discuss how natural computing can help, first we have to provide a brief introduction on what natural computing is. there are numerous resources available for natural computing. for beginners, [4] provides a comprehensive coverage on the major fields with natural computing. the brief review in this section is based on that book. the philosophy of natural computing lies in that most of computational approaches natural computing deals with are based on highly simplified versions of the mechanisms and processes present in the corresponding natural phenomena. research work in natural computing can be grouped into three major categories, namely, computing inspired by nature, simulation and emulation of natural phenomena in computers, and computing with natural materials. since the last one does not have direct impact on virtual learning (at least for now), we will not address it here. the first category, computing inspired by nature, refers to making use of nature as inspiration for the development of problem solving techniques. the main idea is to develop computational tools (algorithms) by taking inspiration from nature for the solution of complex problems. the diverse areas (or approaches) under this category include evolutionary computing, neurocomputing, swarm intelligence, etc. swarm intelligence refers to a property of systems of unintelligent agents of limited individual capabilities exhibiting collectively intelligent behavior, and has drawn attention from researchers to find useful applications in virtual learning (as to be briefly described later). the second category, simulation and emulation of natural phenomena in computers, refers to a synthetic process aimed at creating patterns, forms, behaviors, and organisms that (do not necessarily) resemble "life-as-we-know-it." its products can be used to mimic various natural phenomena, thus increasing our understanding of nature and insights about computer models. an interesting area under this line of research is artificial life, which is the study of man-made systems that exhibit behaviors characteristic of natural living systems. it has been hoped that by extending the empirical foundation upon which biology is based beyond the carbon-chain life that has evolved on earth, artificial life can contribute to theoretical biology by locating life-as-we-know-it within the larger picture of life-as-it-could be. regardless of computing inspired by nature or simulation and emulation of natural phenomena in computers, there are several general concepts underlying various approaches in natural computing, such as agent, parallelism and distributivity, interactivity, adaptation, feedback, self-organization, emergence, etc. in addition, emotion is demonstrated not just in human beings, but many species of animals as well. emotion has also been studied in artificial life. various natural computing concepts have already been used by authors working in virtual learning. much effort has been put on researchers’ "traditional" favorite such as adaptation. yet successful employment of natural computing in virtual learning goes far beyond adaptation. below we examine selected literature involving two crucial elements of natural computing and provide comments on their relevance to virtual learning: emergence and emotion. 36 zhengxin chen 4 exploring emergence for virtual learning using simple terms, emergence can be defined as "a coming into view." however, emergence is definitely not a simple concept, and the computational study on emergence [6] is still in its infancy. as a typical demonstration of computational approach for emergence, swarm intelligence is a property of systems of unintelligent agents of limited individual capabilities exhibiting collectively intelligent behavior. swarm intelligence includes any attempt to design algorithms or distributed problem-solving devices inspired by the collective behavior of social insects and other animal societies. swarm intelligence is an emergent property of the swarm system as a result of principles of the five principles: proximity, quality, diversity, stability and adaptability. two main lines of research in swarm intelligence are either based on social insects, or based on the ability of human societies to process knowledge [4]. the ability of ants to find short routes between nests and food sources suggests an approach to cost-effective, flexible and implementable wayfinding support. paths identified by ants are not preplanned, but emerge, spontaneously, as a result of indirect communication between members of an ant colony a form of indirect social navigation. ants deposit a chemical substance called pheromone which can be sensed by other ants, thus achieving a kind of stigmergy, which refers to the process of indirect communication. this property can be very useful for virtual learning. in a virtual learning environment considered by [17], learners’ interactions with learning resources and activities are recorded automatically as they progress through a body of knowledge. the time stamping of these interactions allows learning sequences to be identified which can be processed and aggregated to derive a given "pheromone strength" favoring paths along which more learners have been successful. this information can be fed back to other learners, providing a new source of navigational guidance indicating "good" ways through the body of knowledge a self-organizing, stigmergic approach to wayfinding support. in another experiment by [15], ant colony optimization (aco) heuristics was applied to an e-learning problem: the pedagogic material of an online teaching web site for high school students is modeled as a navigation graph where nodes are exercises or lessons and arcs are hypertext links. the arcs’ valuation, representing the pedagogic structure and conditioning the web site’s presentation, is gradually modified through the release and evaporation of virtual pheromones that reflect the successes and failures of students roaming around the graph. a compromise is expected to emerge between the pedagogic structure as originally dictated by professors, the collective experience of the whole pool of students and the particularities of each individual. collective behavior as demonstrated from ant colonies and simulated in computer programs exemplifies collective intelligence (or symbiotic intelligence), an intelligence that emerges from the collaboration and competition of many individuals – an intelligence that seemingly has a mind of its own. yet emergence is not restricted in collective intelligence. in general, emergence refers to the way complex systems and patterns arise from a multiplicity of relatively simple interactions [4]. two influential monographs on emergence should find profound impact on virtual learning: through a series of narratives to show complex adaptive systems that display emergent behavior governed by small sets of local rules, the discussion in [8] may shed useful intuitive thoughts on infrastructural support to achieve emergence in virtual learning, while [6] provides more technical insights on modeling issues of emergence in a more general, abstract setting, where emergence is explained through a reductionist perspective. we summarize system learning with the following features related to emergence. exploring emergent properties using swarm intelligence and other techniques should be continued and strengthened for research virtual learning. yet, current research related to emergence in virtual learning is largely confined in specific tasks. more systematic studies of the role of emergence in virtual learning are needed, particularly those related to the overall infrastructure of virtual learning. in addition, in the context of learning/education (including virtual learning), there is a need to distinguish emergence as a process (such as emerging ideas) from emergence as a product (such as an emerging pattern) a feature which has not attracted enough attention it deserves. here is a partial list of issues to be studied about emergence related to virtual learning: learning about learners: system learning in virtual learning environment 37 nature of emergence (relevant to learning), such as: what are the relationships and differences between emergence and other features related to learning and discovery? what can be achieved through emergence? in the case of emerging ideas: o specific creative task (such as construction of analogs through emergence for analogical problem solving) what can be achieved through emergence? in the case of emerging "products": o global solution (such as an optimization of student pedagogical path); o solution for individuals (learning by taking advantage of emergence), such as how to come up with creativity thoughts in general. in addition, there is a more general question: is there a need to have a dedicated software component at system level to support emergence in a virtual learning environment? if yes, how to achieve this? 5 exploring emotion in virtual learning recently the importance of emotion in education has drawn attention from researchers. for example, according to [16], educators may find the most useful information in research that focuses less on the physical and biochemical structure of the brain and more on the mind – a complex mix of thoughts, perceptions, feelings, and reasoning. studies that explore the effects of attitudes and emotions on learning indicate that stress and constant fear, at any age, can circumvent the brain’s normal circuits. a person’s physical and emotional well-being is closely linked to the ability to think and to learn effectively. emotionally stressful home or school environments are counterproductive to students’ attempts to learn. while schools cannot control all the influences that impinge on a young person’s sense of safety and well-being, classrooms and schools that build an atmosphere of trust and intellectual safety will enhance learning. letting students talk about their feelings can help them build skills in listening to their classmates’ comments. finding ways to vent emotions productively can help students deal with inevitable instances of anger, fear, hurt, and tension in daily life. in an experimental study, the author of [12] interviewed eleven students studying online. these students identified emotions which were critical to their online learning. in order to better understand where emotion stands in learning and education, it would be beneficial to take a look at the two books on general aspects of emotions with quite different emphases. from a psychological perspective, the experimental research of [9] showed that emotion can occur without cognitive processing in the cortex. in particular, we can learn some general principles of emotions by studying fear. in evolutionary terms, "fearless" animals would have been less likely to survive. the author further demonstrates that fear can be related to learning and fear learning is implicit. although by no means we should endorse any kind of "learning through fear," this example does indicate emotion can have impact on education in a controlled manner, and with more secretes of emotion to be revealed in the future, some of the previously unknown principles involving emotions can be incorporated into virtual learning environment. published one decade later, from a computational perspective, [10] aimed to establish a theory of how emotions get created. according to this theory, each emotional state is a different style of thinking. so there is no general theory of emotions, because the main idea is that each of the major emotions is quite different. for an adult person, the management is able to use these different ways of thinking very quickly as part of ordinary, common-sense thinking. what is the indication of this discussion to learning/education? the notion of emotion as a theory of thinking implies a potential opportunity to incorporate emotion into "mainstream" education theories, including those related to virtual learning. 38 zhengxin chen in a recent comprehensive volume directly address the issue of emotion in education [14], various theoretical perspectives on emotions in education have been examined, include the discussions on control-value theory of achievement emotions, self-regulation and social-constructivist learning, emotions as a main component of attributional theory, implications of goal-theory for achievement-related emotions, macro-cultural psychology, etc. the theoretical work is complemented with sets of studies on students’ emotions in educational context, as well as teachers’ emotions in educational contexts. the comprehensive of research on emotion in education as described in this volume sheds light for future work of dealing with the emotion factor in a virtual learning environment (even the book does not address issues directly related to virtual learning). as the literature surveyed above shows, so far the important issue of emotion for education is still largely discussed at the traditional classroom setting. nevertheless researchers have started addressing this issue in the context related to virtual learning. for example, [3] presents an analysis of the issues pertaining to computational emergence and emotion in (cognitive) agent systems and describes how a developing computational theory of cognition can be used to monitor and manage interactions with and within complex systems; this would harness unwanted and emergent states and behaviors before the computational system becomes dysfunctional. in another work, [13] describes a modular hybrid neural network architecture, called shame, for emotion learning. in addition, computational experiments on emotion also exist. for example, [7] proposes the architecture of learning companion agent with facial expression of emotion. based on architectures referred to as abc and tok, the emotion agent architecture contains five modules to realize the interaction in the world. a particular part of this research is the transition between emotion space in emotion module and facial expression space in facial expression module. although this work is not directly related to virtual learning, it represents an interesting step toward computational emotion, and may shed light to the study of emotion in virtual learning. summarizing the above discussion, we note that current research status shows that emotion for virtual learning is a vast area yet to be systematically explored. the following are some sample issues need to be investigated: from a learning perspective, how many types of emotions can be distinguished? under which conditions certain type of emotions should be controlled and under which conditions educators can take advantage of it? what are basic operations of emotions (such as filter out, enlargement, etc.) related to learning and how to develop computational mechanisms to support them? finally, just like the case of the discussion related to emergence, we may wonder whether there is a need to implement any forms of emotion at the system level in virtual learning. 6 learning about learners: the role of system learning summarizing our discussion in the last two sections, if we agree that natural computing can make significant contributions to virtual learning, and important features such as emergence and emotions should be incorporated into the virtual learning process, then we have to provide infrastructural support in virtual learning environments. (in other words, we would like to have a positive answer for the last question appearing at the end of section 4 and section 5.) this can be achieved through system learning: the virtual learning environment not only provides facilities for learners, but also observes the behavior of learners and takes actions, so that its own performance can be improved (i.e., to better serve the learners). in this sense, system learning is concerned with learning about learners. (note that the term system learning may not necessarily new to virtual learning; but it has a new meaning as it is presented here.) among the basic functionalities of system learning are: trace activities of learners, learning about learners: system learning in virtual learning environment 39 apply various natural computing algorithms to support specific tasks of virtual learning (such as find a global solution or to find a solution for an individual learner), invoke appropriate operators to deal with emotions at various degrees to learners with different profiles, notify teachers to change strategies based on learners’ performance, coordinate all the activities in the virtual learning environment, as well as others. consequently, a virtual learning environment is a true human-machine symbiosis, paired by human learning and system learning. in order to support system learning, a number of issues need to be examined. here we list two of them. first, system learning is not just an abstract concept; it has to be implemented. it consists of selfadjustable software distributed in the entire virtual learning environment. as a second issue, we point out the important role of a system-wide database for storage and retrieval of learners’ behavior, based on which various natural learning algorithms can be applied, so that new solutions can emerge. note that the important role of database management in virtual learning environment [5] is rarely examined in literature, and this should be changed in the future. 7 conclusion in this paper we examined virtual learning in the comprehensive learning spectrum, and emphasized the importance of natural computing for virtual learning. within this context, we also discussed important features of emergence and emotions. in addition, our introduction of natural computing into virtual learning makes the new concept of system learning necessary. this also implies the need for investigating a virtual learning architecture (or at least, a framework, in a sense similar to [11]) where human virtual learning is paired with system learning. significant amount of work is needed for this direction of research. acknowledgement this is the extended version of an earlier paper [2] appeared in international conference of virtual learning (icvl 2007). the author thanks technical program committee (chaired by dr. grigore albeanu) of icvl 2007 for recommendation of publishing this paper in ijccc. bibliography [1] a. anohina, the terminology used in the fields of virtual learning, analysis of the terminology used in the field of virtual learning, educational technology & society, vol. 8, no. 3, pp. 91-102, 2005 (http://www.ifets.info/journals/8_3/9.pdf). [2] z. chen, learning from nature: natural computing meets virtual learning, proc. 2nd int’l conf. virtual learning (icvl 2007), pp. 117-124, 2007. [3] d. n. davis, computational emergence and computational emotion, proc. ieee conf. systems, man and cybernetics (smc), 1999. (http://www2.dcs.hull.ac.uk/neat/dnd/papers/smc99.pdf) [4] l. n. de castro, fundamentals of natural computing: basic concepts, algorithms, and applications, chapman & hall/crc, 2006. [5] a. dong and h. li, ontology-based information integration in virtual learning environment, proc. ieee/wic/acm int’l conf. web intel. (wi’05), 2005. [6] j. h. holland, emergence: from chaos to order, oxford university press, 1998. 40 zhengxin chen [7] c.-c. huang, r. kuo, m. chang, j.-s. heh, foundation analysis of emotion model for designing learning companion agent, proc. ieee int’l conf. advanced learning technologies (icalt), pp. 326 330, 2004. [8] s. johnson, emergence: the connected lives of ants, brains, cities and software, penguin books, 2002. [9] j. ledoux, the emotional brain, simon and schuster, 1996. [10] m. minsky, the emotion machine: commonsense thinking, artificial intelligence, and the future of the human mind, simon and schuster, 2006. [11] a. nguyen, t. nakano, t. suda, learning from nature: network architecture inspired by biology, acm crossroads, 2004 (http://portal.acm.org/citation.cfm?id=1144389.1144391). [12] k. o’regan, emotion and e-learning, journal of asynchronous learning networks, vol. 7, no. 3, 2003. [13] m. poel, r. op den akker, a. nijholt, a. j. van kesteren, learning emotions in virtual environments. proc. european meeting on cybernetics and systems research (emcsr), 2002. [14] p. shutz, r. pekrun, g. d. phye (eds.), emotion in education, academic press, 2007. [15] y. semet, e. lutton, p. collet, ant colony optimisation for e-learning: observing the emergence of pedagogic suggestions, proc. ieee swarm intelligence symposium (sis), pp. 46 52, 2003. [16] southwest educational development laboratory, how can research on the brain inform education? classroom campus, vol. 3, no. 2 (http://www.sedl.org/scimath/compass/v03n02/1.html). [17] b. van den berg, r. van es, r., c. tattersall, j. janssen, j. manderveld, f. brouns, h. kurvers, r. koper, swarm-based sequencing recommendations in e-learning, proc. 5th international conference on intelligent systems design and applications (isda’05), pp. 488-493., 2005 zhengxin chen university of nebraska at omaha department of computer science omaha, ne 68182-0500, usa e-mail: zchen@mail.unomaha.edu received: november 28, 2007 zhengxin chen received phd degree in computer science from louisiana state university in 1988. he is interested in a number of issues related to intelligent systems, database management systems, and data mining. among his publications are three books: computational intelligence for decision support, data mining and uncertain reasoning: an integrated approach and intelligent data warehousing. he also co-edited several books, includingintelligent agents and their applications in systems design and data mining and knowledge management. ijcccv4n1draft.pdf int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 1, pp. 27-40 the impact of java applications at microarchitectural level from branch prediction perspective adrian florea, arpad gellert, lucian vinţan, marius velţan adrian florea, arpad gellert, lucian n. vinţan "lucian blaga" university of sibiu, computer science department emil cioran street, no. 4, sibiu 550025, romania e-mail: {adrian.florea,arpad.gellert,lucian.vintan}@ulbsibiu.ro marius n. velţan sc imc information multimedia communications srl sibiu, product development department cristian street, no. 21, sibiu 550073, romania e-mail: marius.veltan@im-c.de abstract: the portability, the object-oriented and distributed programming models, multithreading support and automatic garbage collection are features that make java very attractive for application developers. the main goal of this paper consists in pointing out the impact of java applications at microarchitectural level from two perspectives: unbiased branches and indirect jumps/calls, such branches limiting the ceiling of dynamic branch prediction and causing significant performance degradation. therefore, accurately predicting this kind of branches remains an open problem. the simulation part of the paper mainly refers to determining the context length influence on the percentage of unbiased branches from java applications, the prediction accuracy and the usage degree obtained using a fast path-based perceptron predictor. we realize a comparison with c/c++ application behavior from unbiased branches perspective. we also analyze some java testing programs, built using design patterns or including inheritance, polymorphism, backtracking and recursivity, in order to determine the features of indirect branches, the arity of each indirect jump and the prediction accuracy using the target cache predictor. keywords: object-oriented programs, branch prediction, indirect jumps/calls, unbiased branches, benchmarking 1 introduction as the internet evolves and becomes increasingly popular, the need for a portable programming language becomes increasingly important. providing a common language interface java technology, that includes java language, java virtual machine (jvm) and java api, allows programs to be used across a wide range of machine platforms from browsers, e-commerce, financial and bioinformatics applications on desktop computing systems to software for real-time embedded systems (intelligent mobile phones, palms, pda, etc.). the portability, the object-oriented and distributed programming models, multithreading support and automatic garbage collection are features that make java very attractive for application developers. in addition to portability, security and ease of application development has made it very popular with the software community. according to sun microsystems, in 2007 there were more than 5 billion java enabled devices (desktops, phones, cards, settop boxes, etc), from among 2.1 billions are phone handsets or pda. also, more than 6 million professional java developers are programming these devices [1]. copyright © 2006-2009 by ccc publications 28 adrian florea, arpad gellert, lucian vinţan, marius velţan besides, the last decade is characterized by the advance of visual or object-oriented applications and the portability trend of many of them, as well as the usage of dynamically-linked libraries. to support polymorphism the object-oriented languages such as java, c#, and c++ include dynamically-dispatched function calls (i.e. virtual functions) whose targets are not known until run-time because they depend on the dynamic type of the object on which the function is called. virtual function calls are implemented using indirect jump/call instructions in the instruction set architecture (isa). from microarchitectural point of view object-oriented programming techniques exercise different aspects of computer architecture to support the object-oriented programming style [2], [3]. the object-oriented languages have significantly more indirect jumps than procedural languages. in addition to virtual function calls, indirect jumps are commonly used in the implementation of programming language constructs such as switch-case statements (with more than five options [3]), jump tables, indirect calls through function pointers, and interface calls [4]. in current pipelined processor designs a particularly difficult challenge consists in target prediction for indirect jumps and calls. because the target of an indirect jump (call) can change with every dynamic instance of that jump, predicting the target of such an instruction is really difficult. such hard-to-predict jumps not only limit processor performance and cause wasted energy consumption but also contribute significantly to the performance difference between procedural and object-oriented languages. simulations made on intel core2 duo processor analyzing suggestive applications such matlab, cygwin, excel, firefox, winamp and internetexplorer show that about 41% of all jump mispredictions are due to indirect jumps [5]. besides indirect jumps/calls, another class of hard-to-predict branches consists in unbiased branches. in two of our previous papers [6], [7] we found a minority of dynamic conditional branches showing a low degree of polarization towards a specific prediction context since they tend to shuffle between taken and not-taken. we called them unbiased branches and we have demonstrated that, irrespective of the prediction information length and type, used in the state of the art branch predictors, these branches are characterized by low prediction accuracies (at average about 70%). we have demonstrated that actual programs have a significant fraction of such difficult to predict branches that severely affect prediction accuracy leading to performance degradation and additional power consumption, which are very important especially in the embedded systems. the main goal of this paper consists in pointing out the impact of java applications at microarchitectural level from two perspectives: unbiased branches and indirect jumps/calls, such branches limiting the ceiling of dynamic branch prediction and causing significant performance degradation. therefore, accurately predicting this kind of branches remains an open problem. following our aim, first we extend our tool advanced branch prediction trace driven simulator (abps) [8] for analyzing (detecting and predicting) unbiased branches from spec jvm98 benchmarks [9], a suite of eight standardized java applications proposed to better understand where performance bottleneck exist in the java virtual machines and what optimizations are possible. we determine the context length influence on the percentage of unbiased branches from java applications, the prediction accuracy and the usage degree obtained using a fast path-based perceptron (fpbp) predictor [10]. also, we realize a comparison with the behavior of c/c++ applications from unbiased branches perspective. the differences between object-oriented and procedural applications from execution viewpoint on ilp processors guided us to analyze some java testing programs built using design patterns. based on these java applications we illustrated that the two computer science components (software and hardware) are just apparently "disjointed". thus, in section 4 we describe six well-known design patterns [11], [12] and applications developed based on them. additionally, we created four simple applications that include inheritance, polymorphism, backtracking and recursivity. these programs were execution-driven simulated on "dynamic simplescalar" environment (dss) [13] in order to determine the features of indirect branches, the arity of each indirect jump and the prediction accuracy using the target cache predictor. the remainder of this paper is organized as follows. in section 2 we illustrated comparatively the the impact of java applications at microarchitectural level from branch prediction perspective 29 c/c++ and java languages, emphasizing especially the differences between them from the execution viewpoint on host architectures. section 3 describes the implemented predictors used for studying unbiased branches and indirect jumps/calls. section 4 includes simulation methodology and presents the benchmarks used for simulation. in section 5 we illustrate the experimental results obtained using our developed simulator. finally, section 6 suggests further work directions and concludes the paper. 2 some differences between c/c++ and java languages c++ programming language was designed and implemented as an extension of c language that supports data abstraction, object-oriented concepts and generic programming, dedicated for desktop programming. unfortunately, it inherited also all problems from c. java was created as a new object-oriented and distributed language, without compatibility constraints, mainly dedicated for small electronic devices that embeds networking capabilities and has cross platform portability and also for internet applications programming. in java were removed all the procedural programming concepts that proved to be dangerous or unsecured. unfortunately, the portability leads to decreasing the execution speed of java programs. thus, it was observed that the interpreted java bytecode is 30 times slower than an optimized c++ code. however, just in time (jit) compilers could produce considerable performance improvements over interpretation (20 times) by removing dispatch overhead and applying some optimizations to the generated code. the classes used by server applications are executed using jit compilers in order to obtain high processing performance. though, the memory required by jit compilers (additional space for the generated native code, for profiling and other data structures) is very expensive especially in the case of embedded systems. for these systems are preferred native java processors that use small memory and have a small power consumption, and last but not least, have small costs, easy to support by any consumer. from this reason, the modern researches try to optimize the java virtual machines. using server applications, when sun jvm was updated from the 3rd version to the 6th version the processing performance was improved with 344%. similarly, using desktop applications, the performance increases but not very spectacular [1]. nowadays java virtual machines use even dynamic metrics that optimize the jvm taking into account not only the host processor but also the applications’ behavior. there are applications that run better if they are written in java than in c++. simulation results on the scimark benchmark have shown that java runs with 4% quicker than c++, and linpack benchmarks are slower with only 2% in java against c++. also, the garbage collector is much quicker than dynamic memory allocation / de-allocation realized with the malloc and free c++ instructions [1]. thus, after releasing the last jvm version (jre 6.0), sun microsystems engineers claim that they destroyed the myth interpreted language means not performing. 3 predictors implemented for studying unbiased branches and indirect jumps/calls for indirect branch prediction we considered a tagged target cache (tc) predictor first introduced by chang [15]. the target cache improves the prediction accuracy for indirect branches by choosing its prediction from the last n targets of the indirect branch that have already been encountered. when an indirect jump is fetched, both the pc and the globalhr (a history register that retains the behavior of last k conditional branches) are used to access the tc for predicting the target address. as the program executes, the tc records the target for each indirect jump target encountered. our proposed scheme uses for set selection the least significant bits of the word obtained by hashing (xor) the indirect jump’s address (pc) and the global history (globalhr). the most significant bits of this hashing form the tag. in the case of a hit in the tc the predicted target consists in the corresponding address belonging to that tc set (field adr from figure 1). in the case of a misprediction, (the tags coincide but the target addresses 30 adrian florea, arpad gellert, lucian vinţan, marius velţan differ) after the indirect branch is resolved, the target cache entry is updated with its real target address. we have implemented and simulated a p-way set associative tc, where p=1, 2, 4 like that presented in figure 1. in the case of a miss in the tc the prediction is considered wrong, it does not provide any value and a new entry updated with the proper tag and target is added to the respective set, according with the implemented lru replacement algorithm. figure 1: the structure of target cache predictor the most accurate single-component branch predictors in the literature are neural branch predictors [10], [16] and [17]. their main advantages consist in the possibility of using longer correlation information at linear cost. the perceptron predictor the simplest neural branch predictor keeps a table of weight vectors (small integers that are learned through the perceptron learning rule) [16]. as in global two-level adaptive branch prediction, a shift register records a global history of conditional branche outcomes, recording true for taken, or false for not taken. to predict a branch outcome, a weight vector is selected by indexing the table with the branch address modulo the number of weight vectors. the dot product of the selected vector and the global history register is computed, where true in the history represents 1 and false represents -1. if the dot product is at least 0, then the branch is predicted taken, otherwise it is predicted not taken. once the perceptron output has been computed, the training algorithm starts: it increments the i-th correlation weight when the branch outcome agrees with the i-th bit from the global branch history shift register and decrements the weight otherwise. unfortunately, the high latency of the perceptron predictor and its impossibility to predict the linearly inseparable branches makes it impractical yet for hardware implementation. in order to reduce the prediction latency, the fast path-based perceptron [10] chooses its weights for generating a prediction according to the current branch’s path, rather than according to the branch’s pc and history register. the prediction latency is hidden due to the speculative calculation of the perceptron’s output. let pc be the current branch address, and pci be the ith most recent branch address in the path history. for the perceptron, each weight is chosen with the same index based on pc. for the fpbp, each weight wi is chosen based on an index derived from pci. this provides path history information that can improve prediction accuracy, and spreading out the weights in different entries also helps to reduce the impact of inter-branch aliasing. to implement the fpbp, the lookup phase is actually pipelined over many stages based on the overall branch path / global history length. undertaken from [18], figure 2 exposes a very intuitive scheme of fast path-based perceptron. for a branch at cycle t, the fpbp starts the prediction at cycle t h using pch. for each cycle after t h, the fpbp computes partial sums of the dot-product of weights vector and global history register. pipeline stage i contains the partial sum for the branch prediction that will be needed in i cycles. at the very end of the pipeline, the critical lookup latency consists of looking up the final weight and performing the final addition. the impact of java applications at microarchitectural level from branch prediction perspective 31 figure 2: the structure of fast path-based perceptron predictor 4 simulation methodology and benchmarking related to our evaluations on procedural programs, we collected results from different versions of spec benchmarks: 3 integer (li, go, cc1) and 4 floating point (applu, apsi, fpppp, hydro) spec’95 benchmarks. from the integer spec2000 suite, we simulated 8 benchmarks (gzip, b2zip, parser, crafty, gap, gcc, twolf and mcf ). we also simulated some spec’95 benchmarks in order to compare their behavior with that of the more recent spec2000. all these benchmarks cover a lot of applications ranging from compression (text/image) to word processing, from compilers and architectures to games enhanced with artificial intelligence, etc. these programs were selected based on their characteristics: a high number of indirect branches and high target entropy. recall that the majority of indirect branches are generated by object oriented programs. in software engineering, after structured programming and object orientation, one of the most important innovation with impact on the art of constructing software systems were design patterns [11]. it is hard to understand and practice effective object orientation without being conscious of design patterns and their suitability for the problem at hand. according to [19] a design pattern represents a formal way of documenting a solution to a design problem in a particular field of expertise. in software engineering, a design pattern is a general reusable solution to a commonly occurring problem in software design. using real-life scenarios helps in understanding programming abstraction to some extent but do not provide the deep insight that one gets by actually seeing these abstractions. a design pattern is not a finished design that can be transformed directly into code. patterns show how to build systems with good object oriented design qualities. most patterns allow some part of a system to vary independently of all other parts. patterns provide a share language that can maximize the value of communication between developers. for indirect branch analysis of java applications we simulated six design pattern applications [11], [12] and also four other simple object-oriented programs: jbytemark that sorts arrays of values through many sorting methods, queens_5 that uses inheritance and polymorphism for solving different problems through recursive backtracking, switch_06 that illustrates the indirect jump generation from switch/case statements and simple_inheritance that uses polymorphism and treats uniformly the heterogenic array of objects. table 1 illustrates the characteristics of the design pattern applications [11], [12] used for indirect branch analysis. the results related to the indirect branch analysis illustrated in chapter v were obtained after simula32 adrian florea, arpad gellert, lucian vinţan, marius velţan table 1: design pattern characteristics design pattern application characteristics strategy ducksimulator it is a behavioral pattern that defines a family of algorithms, encapsulate each one, and make them interchangeable. strategy lets the algorithm vary independently on the clients that are using it. example: adventure game observer weatherstation it is a behavioral pattern that defines a one-tomany dependency between objects so that when one object’s state changes, all its dependents are notified and updated automatically. example: newspaper subscription decorator starbuzzcoffee it is a structural pattern that dynamically attaches additional responsibilities to an object. decorators provide a flexible alternative to sub-classing for extending functionality. example: dressing a character in a social game factory method preparingpizza it is a creational pattern that defines an interface for creating an object, but let subclasses decide which class to instantiate. example: factory methods template method serveingdrinks it is a behavioral pattern that defines the skeleton of an algorithm in an operation, deferring some steps to subclasses. template method lets subclasses redefine certain steps of an algorithm without changing the algorithm’s structure. example: searching in a database state gumball machine test it is a behavioral pattern that allows an object to alter its behavior when its internal state changes. the object will appear to change its class. example: automated teller machine. the impact of java applications at microarchitectural level from branch prediction perspective 33 tion on 2.4 ghz pentium iv microprocessor under microsoft windowsxp operating system, disposing of cygwin emulator. taking into account that the simulation time of over 600.000.000 dynamic instructions from aforementioned java benchmarks vary between 35 minutes (queens_5) and more than 24 hours (jbytemark) we chose to limit the simulation to maximum 10 billion instructions. from the spec’95 suite we simulated 100 millions instructions, from the spec2000 set we simulated 500 millions instructions, whilst all java applications were completely simulated (except jbytemark benchmark), the total number of instructions ranging between 600 millions and 4290 millions instructions, respectively. for indirect jump/call analysis we developed a cycle-accurate execution driven simulator (sim-jindir [12]), derived from the sim-cache simulator of the dynamic simplescalar toolset [13]. we modified it to incorporate the indirect jump predictor proposed in section 3 in order to measure the number of indirect (static / dynamic) branches, the arity of each indirect jump and to predict targets of indirect jumps and calls, respectively. dss represents a standardized software tool used to simulate java programs executed on a java virtual machine (in our case ibm jikes research virtual machine) and was created to extend the simplescalar toolset [20]. jikes rvm is a virtual machine that runs java programs by compiling them to native code at runtime. it comes with two java bytecode to native code compilers. the fast baseline compiler does not perform any optimizations. the optimizing compiler performs a complete set of optimizations most importantly inlining and register allocation. the sim-jindir provides a wider variety of configuration options. we can vary the tc associativity degree, the number of sets of tc, the maximum number of instructions executed, the simulated benchmark, and the platform used for simulation. table 2 illustrates the characteristics of standardized specjvm98 benchmarks [21] used for unbiased branch analyze. abps is a trace driven simulator used to analyze (detect and predict) unbiased branches. it is written in java and includes state of the art branch predictors such as fast path-based perceptron predictor. this request as inputs parameters: the number of entries in prediction table, the global history length, the threshold value used by the learning algorithm, number of bits for storing the weights. abps can detect unbiased branches or predict all branches or only those that are unbiased. the abps simulation results consist in four important metrics. the prediction accuracy is the number of correct predictions divided to total number of dynamic branches. we compute also a confidence metric that represents the total cases when the prediction was correct and the perceptron did not need to be trained (the magnitude of perceptron output was greater than the threshold) divided to the total number of correct predictions. while the first two have impact on processor’s performance, the next two metrics have direct influence on transistors’ budget and integration area (the number of perceptrons used in the prediction process and the saturation degree of perceptrons). the saturation degree represents the percentage of cases when the weights of perceptrons cannot be increased / decreased because they are saturated. if the last two metrics are quite low means that the perceptrons are underused. 5 experimental results tables 3 and 4 illustrate comparatively the percentage of static / dynamic (indirect) branches within the tested programs. figures 3 and 4 illustrate the arity of indirect branches from static and dynamic points of view, respectively. the arity means the number of distinct dynamic targets of the same indirect jump/call. this can be determined either after simulations or knowing profile information (identifying branch class) either estimated through source code level analysis. after this process, monomorphic jumps/calls (having a single target) can be successful predicted by a predictor without history (btb), yet, for polymorphic branch prediction (with two or more distinct targets) additional information are necessary. analyzing tables 3 and 4 and figures 3 and 4, we observe some significant differences related to the behavior of java applications against c/c++ applications at microarchitectural level, from indirect 34 adrian florea, arpad gellert, lucian vinţan, marius velţan table 2: spec jvm98 characteristics benchmark characteristics 227_mtrt it represents a modification of 205_raytrace, a ray tracer that creates a pictorial scene portraying a dinosaur. it uses a multi-threaded driver, where the threads render the scene from an input file. 202_jess this benchmark is an expert system shell written entirely in java. the intent of jess is to give java applets the ability to "reason" by continuously applying a set of rules. the workload used in the benchmark solves a set of puzzles. 201_compress represents java version of the 129.compress benchmark from the spec’95 benchmark suite, but improves upon that benchmark in that it compresses real data from files using a modified lempel-ziv method (lzw). it is highly recursive. 209_db it performs multiple database operations (insertion, removing, finding, sorting) on a 1 mbytes memory resident database. 222_mpegaudio this benchmark is an application that decompresses audio files defined by the iso mpeg layer-3 audio specification. the mp3 encoding technique allows data compression of digital audio signals up to a factor of 12, without losing sound quality as perceived by the human ear. the workload consists of about 4mb of audio data. 228_jack it represents a java parser that is based on the purdue compiler construction tool set, in fact an earlier version of javacc compiler. the workload consists of a file named jack.jack, which contains instructions used for the generation of jack itself. this is fed to jack so that the parser generates itself multiple times (highly recursive). 213_javac it represents the java compiler from the jdk 1.0.2. table 3: statistics about static (indirect) branches from the analyzed benchmarks testing programs (1) average of indirect static branches (2) average of static branches (3) average percentage of indirect static branches (4) = (2)/(3)*100% spec 95 59 4759 2.15% spec 2000 57 8187 0.86% design patterns 6765 16619 40.69% table 4: statistics about dynamic (indirect) branches from the analyzed benchmarks testing programs (1) average of indirect dynamic branches (2) average of dynamic branches (3) average percentage of indirect dynamic branches (4) = (2)/(3)*100% maximum percentage of indirect dynamic branches spec 95 432041 13807188 2.47 5.63% spec 2000 790826 95218337 0.78 2.03% design patterns 17904385 93676540 18.82 20.41% the impact of java applications at microarchitectural level from branch prediction perspective 35 figure 3: the arity of indirect branches from static point of view figure 4: the arity of indirect branches from dynamic point of view 36 adrian florea, arpad gellert, lucian vinţan, marius velţan branches perspective: first, the percentage of dynamic indirect branches from java testing programs (maximum 20.41%) is much higher than that from procedural or object oriented c/ c++ programs (maximum 5.63%, see also [22]). the results are similar with those reported in [23]. according to tao, the performance results are different depending on the jvm mode of execution. in interpreter mode, 19.5% of all dynamic branches are indirect branches and 11.8% of all branches are polymorphic indirect branches. in jit compiler mode, 12.4% of dynamic branches are indirect branches and only 5% are polymorphic branches. second, the significant percentage of duomorphic branches from java programs, recommends using a minimum 2-way associative target cache structure for indirect branch prediction. the very small percentage of dynamic polymorphic indirect branches from our proposed java programs could be due to the fact that these programs do not represent standardized benchmarks but only simple testing programs. however, we consider that there are two major reasons for the high number of indirect branches targets within java applications: 1) as an object-oriented programming language, java promotes a polymorphic programming style in which late binding of subroutine invocations is the main instrument for modular code design. with the help of virtual method table, the implementation chosen for java interpreters and compilers, it executes an indirect branch for every polymorphic call. more than that, in java, instance methods are virtually declared by default. if they are not explicitly declared final, they can be overridden in subclasses. 2) switch/case statements (in the bytecode translation routine of the interpreter) and indirect function calls through pointers (calls to the dynamically shared native interface libraries) from runtime interpretation and jit compilation of bytecodes performed by the java virtual machine are subject to high indirect branch frequency. figure 5: the indirect branch prediction accuracy using a target cache predictor with 512 entries from figure 5 we observe that indirect branch prediction accuracy increases as the associativity degree of tc structure increases. the improvement is significant for a 2-way associative tc against a direct mapped one (20%) whilst the improvements diminish to 2.58% for a 4-way associative tc compared with a 2-way associative one. the results from figure 5 are correlated with those from figure 4 which show a high percentage of duomorphic and an insignificant percentage of polymorphic indirect branches. the improvement of prediction accuracy from 96.11% to 98.59% is due to the miss conflict and capacity miss accesses at a 2-way associative tc (that are solved by increasing the associativity degree), but also to the smaller percentage of polymorphic indirect branches. analyzing procedural programs (spec’95 and spec2000), we have shown in [24] that the best prediction accuracy of indirect branches obtained using a 8-way associative target cache with 128 entries was 88.97%, for target cache capacities greater or equal with 256 entries the prediction accuracy becoming saturated. figure 6 exhibit comparatively the behavior of procedural benchmarks (spec2000) versus objectoriented benchmarks (spec jvm98) from unbiased branches perspectives. repeating the detection methodology for a length-ordered set of contexts used by us in [25] it could be observed how the number of unbiased branches decreases within both procedural and object-oriented applications. on the impact of java applications at microarchitectural level from branch prediction perspective 37 figure 6: reducing the number of unbiased branches by increasing global history length (ghl) the spec2000 benchmarks the percentage of unbiased branches decreases in average from 25.12% to 9.26%. we consider that this value is still too high and further investigations are required. for the spec jvm98 benchmarks the percentage of unbiased branches decreases from 8.87% to 5.80% (almost half compared to the spec2000 integer benchmarks). figure 7: fast path-based perceptron prediction accuracies on spec2000 versus spec jvm98 benchmarks using a table of 1024 entries in figure 7 we considered, comparatively on spec2000 versus spec jvm98 benchmarks, the impact of unbiased branches on a fpbp predictor with a table of 1024 entries considering a global history register of 32. we achieved on the spec2000 suite an average prediction accuracy of 95.21% on all branches but the impact of unbiased branches still remained significant at 74.92%. however, in java benchmarks, the smaller percentage of unbiased branches has a lower impact on prediction accuracy. the prediction accuracy obtained on all branches is very high 98.57% at average and even unbiased branches are predicted more accurately than those from the spec2000 integer benchmarks (where 11 of 12 are procedural applications). based on the number of perceptrons used, we determined the usage degree of perceptron table. this metric will directly affect the prediction accuracy and indirectly the processing performance lost. the reduced usage degree (39.77% in average on the spec2000 benchmarks) proves a high interference degree in the perceptrons table, further diminishing the prediction accuracy, the only advantage being the reduced integration area cost. however, this aspect has been not observed on the spec jvm98 benchmarks where the perceptron table is almost entirely exploited (99.65% in average). 38 adrian florea, arpad gellert, lucian vinţan, marius velţan 6 conclusions and further work java technology from software for embedded systems to commercial and research applications and also, visual, interactive and desktop applications are characterized by high percentage of indirect branches. unfortunately, the prediction accuracy of indirect branches is still very low because many indirect branches have multiple targets that are difficult to predict even with specialized hardware. this paper shows that besides indirect jumps, the unbiased branches represent another class of hard to predict branches in java applications that limits the ceiling of dynamic branch prediction and causes performance degradation and additional power consumption. studying the impact of java applications at microarchitectural level from branch prediction perspectives we concluded: • the percentage of dynamic indirect branches from java testing programs (maximum 20.41%) is much higher than that from procedural or object oriented c/ c++ programs (maximum 5.63%). • the significant percentage of duomorphic indirect branches (>90%) from java programs recommends using a minimum 2-way associative target cache structure for indirect branch prediction. • the prediction accuracy improvement is significant for a 2-way associative tc against a direct mapped one (20%) whilst the improvements diminish to 2.58% for a 4-way associative tc compared with a 2-way associative one. • for java benchmarks the percentage of unbiased branches decreases from 8.87% to 5.80% (almost half compared to the c/c++ integer benchmarks). • for java benchmarks, the smaller percentage of unbiased branches has a lower impact on prediction accuracy. the prediction accuracy obtained on all benchmarks is very high (98.57% in average) and even unbiased branches are predicted more accurately than those from c/c++ integer benchmarks. the lower percentage of unbiased branches occurred in java benchmarks could be determined by the reduced number of conditional branches from object oriented benchmarks compared with procedural applications. it is well known that object-oriented programs contain many methods with few instructions, and fewer conditional branches implicitly. hardware-software interface optimizations are not possible without a deeper understanding process that requires an integrated vision about multiple stages of information processing, coded at different semantic levels. a solution could consist in developing some "semantic predictors", based on high level language (hll) information whose importance related to the generation of indirect jumps (polymorphism, indirect function calls, etc.) is proved by us. this might be a completely new approach in branch prediction domain, where hll semantics are often hidden. in order to efficiently use such information we consider it will be necessary to have a significant amount of compiler support. bibliography [1] gosling j., java: a tour of the landscape, sun technology days at a glance, frankfurt, germany, 3-5 december, 2007. [2] calder b., grunwald d., zorn b., quantifying behavioral differences between c and c++ programs, journal of programming languages, volume 2, issue 4, 1994, pp. 313-351. [3] florea a., vinţan l., mihu i.z., understanding and predicting indirect branch behavior, studies in informatics and control, volume13, issue 1, march 2004, pp. 61-82, national institute for research and development in informatics, bucharest. the impact of java applications at microarchitectural level from branch prediction perspective 39 [4] alpern b., cocchi a., fink s., grove d., lieber d., efficient implementation of java interfaces: invoke interface considered harmless, in proceedings of the 16th acm sigplan conference on object-oriented programming systems, languages and applications, pp. 108-124, october 14-18, 2001, tampa, florida, usa. [5] joao j.a., mutlu o., kim h., agarwal r., patt y., improving the performance of object-oriented languages with dynamic predication of indirect jumps, acm sigops operating systems review, volume 42, issue 2, march 2008, pp. 80-90. [6] vinţan l., gellert a., florea a., oancea m., egan c. understanding prediction limits through unbiased branches, lecture notes in computer science, advances in computer systems architecture, vol. 4186, pp. 480-487, 2006, springer-verlag berlin. [7] gellert a., florea a., vinţan m., egan c., vinţan l. unbiased branches: an open problem, lecture notes in computer science, advances in computer systems architecture, vol. 4697, pp. 16-27, 2007, springer-verlag berlin / heidelberg. [8] radu c., calborean h., crapciu a., gellert a., florea a., an interactive graphical trace-driven simulator for teaching branch prediction in computer architecture, the 6th eurosim congress on modeling and simulation, september 9-13, 2007, pp. 58 (6 pg.), ljubljana, slovenia. [9] spec jvm98 benchmarks, http://www.spec.org/jvm98/ [10] jiménez d. fast path-based neural branch prediction, proceedings of the 36th annual international symposium on microarchitecture, december 3-5, 2003, pp. 243-252, san diego, ca, usa. [11] freeman e., freeman e., sierra k., bates b. head first design patterns, o’reilly publishing house, first edition, october 2004. [12] veltan n.m. the interaction of java programs at microarchitectural level from branch prediction viewpoint (in romanian), msc thesis, ’lucian blaga’ university of sibiu, computer science department, romania, 2008. [13] the university of massachusetts amherst and the university of texas dynamic simplesscalar, 2004. http://www-ali.cs.umass.edu/dss/index.html. [14] lindholm t., yellin f., the javatm virtual machine specification, sun press publishing hall, 1999. [15] chang p.y., e. hao, y.n. patt., target prediction for indirect jumps, proceedings of the international symposium on computer architecture, 1997. [16] jiménez d., lin c. neural methods for dynamic branch prediction, acm transactions on computer systems, vol. 20, issue 4, november 2002, pp. 369-397, new york, usa. [17] jiménez d., idealized piecewise linear branch prediction, journal of instruction-level parallelism, vol. 7, april, 2005, pp. 1-11. [18] loh g. h., jiménez d., reducing the power and complexity of path-based neural branch prediction, proceedings of the 5th workshop on complexity effective design (wced5), pp. 1-8, june 5, 2005, madison, wi, usa. [19] http://en.wikipedia.org/wiki/design_pattern 40 adrian florea, arpad gellert, lucian vinţan, marius velţan [20] burger d., austin t., the simplescalar tool set, version 2.0, university of wisconsin madison, usa, computer science department, technical report no. 1342, june, 1997. [21] bowers k. r., kaeli d., characterizing the spec jvm98 benchmarks on the java virtual machine, technical report, northeastern university, ece department, computer architecture group, 1998, pp. 1-20, boston, massachusetts, usa. [22] florea a., the dynamic values prediction in the next generation microprocessors (in romanian), matrixrom publishing house, 2005, bucharest. [23] tao li, lizy k. john, adapting branch-target buffer to improve the target predictability of java code, acm transactions on architecture and code optimization, volume 2, issue 2, june 2005, pp. 109-130. [24] florea a., vinţan l., advanced techniques for improving indirect branch prediction accuracy, proceedings of 19th european conference on modelling and simulation, june 2005, pp. 750-759, riga, latvia. [25] florea a., radu c., calborean h., crapciu a., gellert a., vinţan l., designing an advanced simulator for unbiased branches prediction, proceedings of 9th international symposium on automatic control and computer science, issn 1843-665x, iasi, 2007. international journal of computers, communications & control vol. i (2006), no. 3, pp. 93-98 65 years from birth of prof. gheorghe s. nadiu (1941-1998) ioan dziţac, loriana andrei abstract: in 1967, gheorghe s. nadiu publishes in “mathematical studies and researches”, the article “on a method for the construction of three valued łukasiewicz algebras” (romania), cited in the book “cylindric algebras” by p. monk, l. henkin, a. tarski. this article attracted the attention of grigore c. moisil, who offers him a scholarship, taking him out of production to make his doctorate at the mathematic institute of the romanian academy. he publishes more than 50 scientific papers in theory of algorithms, logics of mathematics and theory of categories fields. “gheorghe s. nadiu, by introducing the notion of quantified filter in a boolean monadic algebra, showed how one can obtain a trivalent łukasiewicz algebra; also, he obtained an algebraic characterization of a completitude theorem from the intuitionist logic of kripke.” (g. st. andonie, science history in romania, academic publishing house, srr, 1981). phd student gheorghe nadiu (1970) at mathematical institute of the romanian academy prof. univ. dr. gheorghe nadiu (1997) director of mathematics department university of oradea (1990-1994, 1996-1998) 1 biographical dates name and surname: nadiu gheorghe date and place of birth: 1941, september, 12, comlăuş-sântana, arad county, romania date and place of dead: 1998, november 7, oradea, bihor county, romania wife: nadiu doina elena children: 1. loriana; 2. adina fields of scientific research: theory of algorithms, mathematical logic, category theory copyright c© 2006 by ccc publications 94 ioan dziţac, loriana andrei 1956-1959: pupil at "moise nicoară" high school, arad, arad county, romania 1959-1964: student at faculty of mathematics and physics, “babes-bolyai” university of cluj 1964-1969: assistant professor at pedagogical institute of oradea, bihor county, romania 1969-1972: phd student at mathematical institute of the romanian academy (supervisor prof. grigore c. moisil) 1972: he obtain phd in mathematics with the phd thesis: “cercetări asupra logicilor necryssipiene” / “research about necryssipiene logics” (supervisor grigore c. moisil) 1973-1978: lecturer at pedagogical institute of oradea 1980-1990: associate professor at pedagogical institute of oradea 19901998: professor at university of oradea. 2 didactical and managerial activity after graduating the faculty of mathematics-physics at the “babeş-bolyai” university of cluj, in 1964, gheorghe s. nadiu becomes teaching assistant at the pedagogic institute from oradea, job in which he functions until 1969, holding seminars of theory of probabilities, differential equations, theoretical mechanics and in the university year 1968/1969 he teaches the course “fundamentals of mathematics”. during 1969-1972 he is taken out of education after obtaining a scholarship for executing the doctorate in mathematics under the scientific leading of professor grigore c. , moisil, at the mathematic institute of the romanian academy. in 1972, after obtaining the title of doctor in mathematics, (mathematical logics) he comes back to his old workplace where he gets, through contest, a job as lecturer for the disciplines “fundamentals of mathematics” and “mathematical logics” which he teaches until 1978. in 1978 he becomes titular of the courses “superior mathematics” and “computer programming”, and in 1980 he occupies by contest the job as readership, on which he functions until 1990 when he obtains, by contest, the didactic title of university professor. in 1990 the institute from oradea changes into university, and professor nadiu becomes headmaster of the mathematics department of the university of oradea, which he founds starting from a department with 2 personnel and develops until obtaining a department of over 30 personnel who operates in the whole university, and which he leads between 1990-1994 and 1996-1998. between 1990-1998 he teaches the courses “mathematical logics” and “functional analysis” at the science faculty of the university of oradea, at the specializations mathematics-informatics and mathematics-physics. he dies prematurely, at the early age of 57, because of cardio-respiratory death, still in full didactical working potential and creative scientific activity. 3 scientific activity in his first published scientific work [1], written as student, in collaboration with prof. petre brd̆eanu from cluj, he presents an optimal moving regime of the ballistic missile, under the form of periodical functioning of exponential type of the engine, with help of the maximum principle of pontriaghin. the main directions of scientific research of gh. nadiu were directed to the theory of algorithms, logics of mathematics, and theory of categories. in the works [2,4] and [5], from the field of algorithms theory, first, there is given a double recursion theorem, in sense of asser’ recursion theorem, and then a localization procedure of algorithms of nogornâi type through the medium of cernevski type, introduced by the author, as well as the construction of a universal turing machine for the normal class algorithms of a. a. markov. 65 years from birth of prof. gheorghe s. nadiu (1941-1998) 95 in the field of mathematical logic nadiu introduces in [3] an algebraic method of construction of trivalent łukasiewicz algebras, starting from the monadic boolean algebras, by means of quantified filters introduced by him. this result is quoted in the treaty “cilindric algebras” by henkin, monk and tarski [8]. afterwards, the results are taken over in some researches of luiz monteiro from the university bahia blanca from buenos aires, argentina. then, in [7], [9] and [10], there is given an interpolation theorem of craig type for the strict positive logic and also a series of results of relative categorical order at the category of models for elementary logic of moisil. in the doctoral dissertation “research on necryssipic logics” [11] elaborated under the scientific leadership of grigore c. moisil, sustained in 1972, g.s. nadiu introduces the notion of “abstract calculation of superior order” as general developing frame of logic systems and gives then extensions of moisil’s modal logics and of logic of nelson’s and a. a. markov’s false constructible, the results are being found in [16]. in the same [11], gh. s. nadiu treated also s. kripke theorem of completitude for heyting intuitionist logic, from point of view of topological models. after this, he studies the pseudo boolean pre fascicle introduced by him through analogy with the kripke models, for which he gives a series of categorical properties. the results obtained by gh. nadiu in his doctoral dissertation [11] and published in “logique automatique informatique” (1971) are in concordance with the results contained in the work of david p. ellerman [12], which treats a particular case of pseudo boolean pre fascicle with values in the category of relational structures. the studies of formal systems, in categorical sense, lead gh. nadiu to results related by the closed cartesian categories [14] and to the obtaining of an exponentation theorem birkoff type, in such categories as well as also to the development of the skolem pre fascicle theory, introduced by him in [18, 20, 21, 22] which led him to fiber categories. the algebraic study of the polyvalence in sense of moisil’s ideas led him to the introduction of notions of lattice under reziduate skolem [23]. in the ulterior works he developed the correspondence category and of general correspondence, ulterior applied at the monad theory. also, in the sense of fröllich formal group, he gave a characterization of formal commutative moufang curls [15], introduced by s. basarab. after 1980 his researches have been oriented to the study of the influence of mathematic logic in various branches of mathematics: functional analysis, algebra, auto-reglation problems in the theory of abstract automatics, model theory for theories with more variable sorts with application at the normative spaces theory. references [1] p. brădeanu, gh. s. nadiu, asupra unui regim optim de mişcare a rachetelor balistice, an. univ. din timişoara, vol. iii, 1965. [2] gh. s. nadiu, asupra unei teoreme de recursie a algoritmilor normali, studii şi cercetări matematice (mathematical reports), tom. 18, 1966/7, pp. 1071. [3] gh. s. nadiu, on a method for the construction of three-valued łukasiewicz algebras (romanian), studii cerc. mat., 19, 1967, 1063-1070. [4] i. maruşciac, gh. s. nadiu, o interpretare a algoritmilor, schemă-graf în legătură cu maşinile turing şi algoritmi de tip cernevski, studii şi cercetări matematice, tom. 20, 1968/7. 96 ioan dziţac, loriana andrei [5] gh. s. nadiu, the interpolation theorem in strict positive logic, bull. math. soc. sci. math. roum., tom. 13, 1968/2. [6] gh. s. nadiu, o maşină turing universală pentru clasa algoritmilor normali, studii şi cercetări matematice, tom. 21, 1969/1. [7] gh. s. nadiu, sur la logique de heyting, in “logique-informatique-automatique”, ed. of romanian academy, bucureşti, 1971. [8] l. henkin, j.d. monk, a. tarski, cylindric algebras, north-holland, amsterdam 1971. [9] gh. s. nadiu, sur la logique de heyting, abstract of iv congress of logic, bucharest, 1971. [10] gh. s. nadiu, elementary logic and models, bull. math. d soc. sci. math. roum., tom. 15, 1971/1. [11] gh. s. nadiu, cercetări asupra logicilor necryssipiene/research about necryssipiene logics, 1972 (phd thesis, supervisor grigore c. moisil). [12] david p. ellerman, sheaves of structures and generalized ultraproducts, annals of mathematical logic, 7 (dec. 1974), 163-195. [13] gh. s. nadiu, an abstract calculi of superior order, proc. of iv congress of logic„ london-ontario, canada, 1975. [14] gh. s. nadiu, o teoremă de exponenţiere în categorii cateziene, lucr. şt., institul ped. oradea, 1977. [15] gh. s. nadiu, asupra buclelor moufang formele comutative/note sur les boucles moufang formelles commutatives (romanian), stud. cercet. mat. 32, pp. 193-207, 1980. [16] gh. s. nadiu, notă asupra spaţiilor metrice constructive, studii şi cercetări matematice, 1980. [17] gh. s. nadiu, non-clasical logic of superior order, resegna internationale di logica, bologna, bull.math., 1981. [18] gh. s. nadiu, considerations sur les categories cartesienes fermes, bull. math. [19] g. st. andonie, history of sciences in romania. mathematics, mechanics, astronomy, (istoria ştiinţelor în românia. matematica, mecanica, astronomie)(romanian), bucuresti: ed. acad. r.s.r.,1981, pp.123. [20] gh. s. nadiu, category of correspondence and extensions, jounal of pure and appl. algebra. [21] gh. s. nadiu, on boolean formal algebras, bull. math. d soc. sci. math. roum., 1978. [22] gh. s. nadiu, prefascicule peste latici skolem (i), lucrări şt. ale inst. de înv. sup. oradea, 1978, ser. a, pp. 37-63. [23] gh. s. nadiu, asupra laticilor skolem, lucrări şt. ale inst. de înv. sup. oradea, pp. 65-69, 1978. [24] nadiu, gh.s., considerations in category of skolem lattices, (romanian. english summary), bul. stiint. inst. politeh. cluj-napoca, ser. electroteh. energ. inf. 25, 29-36 1982. [25] gh. s. nadiu, logici cu mai multe sorturi de variabile şi spaţii vectoriale, în vol. lucrările celui de al iii-lea simpozion naţional de analiză funcţională, craiova, 6-7 nov. 1981. 65 years from birth of prof. gheorghe s. nadiu (1941-1998) 97 [26] gh. s. nadiu, t pop, automate-tehnologii şi decizii, în vol. “cibernetică aplicată”, bucureşti, ed. acad. r.s.r., 1983. [27] gh. s. nadiu, aspecte booleene în categoria spaţiilor vectoriale normate, în vol. lucrările celui de al iv-lea simpozion naţional de analiză funcţională, craiova 6-7 nov. 1983. [28] gh. s. nadiu, prefascicule peste latici skolem (ii), lucrăi şt. ale inst. de înv. sup. oradea, seria a, 1978. [29] gh. s. nadiu, asupra logicii modale generale a lui moisil, lucrări şt. ale inst. de înv. sup. oradea, seria a, 1978. [30] gh. s. nadiu, asupra ordonărilor rudin-kreisler în clasa ultrafiltrelor, lucrări şt. ale inst. de înv. sup. oradea, seria a, 1978. [31] gh. s. nadiu, contribuţii la teoria relaţiilor (i), lucrări şt. ale inst. de înv. sup. oradea, seria a, 1980. [32] gh. s. nadiu, contribu ctii la teoria relaţiilor (ii). corespondenţe via toposuri., lucrări ştiinţifice, oradea, seria a, 1980. [33] gh. s. nadiu, o clasă specială de l-endometrici probabiliste, lucrări ştiinţifice, oradea, seria a, 1980. [34] gh. s. nadiu, gh. niculae, funcţii parţiale injective şi filtre de compunere, lucrări ştiinţifice, oradea, seria a, 1980. [35] gh. s. nadiu, notă asupra relaţiilor de secvenţă, lucrări ştiinţifice, oradea, seria a, 1979. [36] gh. s. nadiu, gh. niculae, asupra categoriei-algebrelor, lucrări ştiinţifice, oradea, seria a, 19811982. [37] gh. s. nadiu, notă asupra filtrelor într-o algebră łukasievich 0-valentă, lucrări şt., oradea, seria a, 1981-1982. [38] gh. s. nadiu, contribuţii la teoria relaţiilor (iii). interpretări sintactice, lucrări şt., oradea, seria a, 1981-1982. [39] gh. s. nadiu, dinamica deciziilor, rezumat vol. celui de al iv-lea simpozion naţional de cibernetică, 1986. [40] gh. s. nadiu, asupra noţiunii de topologie, simpozion naţional de topologie, cluj napoca, 1986. [41] gh. s. nadiu, produse fibrate în categoria automatelor, comunicare la conferinţa naţională de algebră, timişoara, 1986. [42] nadiu, gh. s., on topologies and overtopologies (english), prepr., “babes-bolyai” univ., fac. math. phys., res. semin. 1988, no.2, pp. 121-126 (1988). [43] gh. s. nadiu, loriana popa, asupra teoriei omotopiei, analele univ. oradea, iii, pp. 12 19, 1993. [44] gh. s. nadiu, notă asupra spaţiilor liniar normate, analele univ. oradea, i, pp. 91 94, 1991. [45] gh. s. nadiu, consideraţii algebrice relaţii la cat. sp. metrice., analele univ. oradea, ii, pp. 99 106, 1992. 98 ioan dziţac, loriana andrei [46] gh. s. nadiu, dinamici. aspecte categoriale, analele univ. oradea, iii, pp. 1 12, 1993. [47] gh. s. nadiu, consideraţii asupra categoriilor carteziene, lucrărrile caim 1996, pp. 201 207. [48] a. georgescu, c.l. bichir, g.v. cîrlig, matematicieni români de pretutindeni, ed. the flower power, pp. 135, 2004. [49] g. georgescu, a. iorgulescu, s. rudeanu, grigore c. moisil (1906 1973) and his school in algebraic logic, int. j. of computers, communications and control, ijccc vol. i (2006), no.1, 81-89. ioan dziţac agora university 8 piata tineretului 410526 oradea romania idzitac@univagora.ro loriana andrei university of oradea 3-5 universitatii st. 410087 romania int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 2, pp. 127-136 application of genetic algorithms for the darptw problem claudio cubillos, enrique urra, nibaldo rodríguez pontificia universidad católica de valparaíso escuela de ingeniería informática av. brasil 2241, valparaíso, chile e-mail: claudio.cubillos@ucv.cl, enrique.urra.c@mail.ucv.cl abstract: on the dial-a-ride with time windows (darptw) customer transportation problem, there is a set of requests from customers to be transported from an origin place to a delivery place through a locations network, under several constraints like the time windows. the problem complexity (np-hard) forces the use of heuristics on its resolution. in this context, the application of genetic algorithms (ga) on darptw was not largely considered, with the exception of a few researches. in this work, under a restrictive scenario, a ga model for the problem was developed based on the adaptation of a generic ga model from literature. our solution applies data pre-processing techniques to reduce the search space to points that are feasible regarding time windows constraints. tests show competitive results on cordeau & laporte benchmark datasets while improving processing times. keywords: dial-a-ride, passenger transportation, darptw, heuristic, scheduling. 1 introduction in the research of transport systems, the dial-a-ride problem (darp) or the customers’ transportation problem is largely known [3]. it consists on searching the optimum way to transport a set of customers which are territorially distributed through a locations network, considering diverse constraints, for example the vehicle capacity and the time windows (tw) which are time intervals where a customer can be picked or delivered on the respective location, in a feasibility context. the problem objective is to optimize the transport system factors (vehicles number, travel costs) and the quality of service for customers (waiting time, travel time). darptw (the time windows problem version) is considered a np-hard problem, especially because of the time-window constraints [10]. for this reason, the problem is usually solved through heuristics to find good solutions, under its diverse variants. in fact, the time windows restrictions make the problem highly non-convex, making it difficult to find feasible solutions. one of the tools considered in this context are the genetic algorithms (gas). after their comparison in the scientific scenario thanks to john holland on the 70’s decade [9], these algorithms have been successfully accepted by their efficiency to solve problems of diverse complexity, and to date there is a large number of proposed ga models that considers the canonical ga problems, particularly the linkage concept [8]. in this work, the application of gas on darptw is extended considering two elements: on the one hand, the implementation of the llga model [8] with and adequate adaptation in the context and on the other hand, the use of data pre-processing techniques (namely precedence table of events and incompatible clients’ list) for aiding the ga to avoid infeasible solutions from the time windows perspective. the paper is structured as follows. section 2 explains the darptw problem for then in section 3 tackling other research in the field. section 4 details the implemented ga and section 5 the experiments and its results. the main conclusion of the work are drawn in section 6. copyright © 2006-2009 by ccc publications 128 claudio cubillos, enrique urra, nibaldo rodríguez 2 the darptw problem darptw is a multiple objective optimization problem, because there are two critical factors to be optimized: on the one hand the total transportation costs and on the other, the quality of service offered to customers (minimizing their insatisfaction with the service). there is a set of transportation requests from customers that are known in advance and do not change during algorithm execution, defining the problem as static. each request defines a time window for the customer delivery and a time window for the customer pickup. the upper bound (latest time) and the lower bound (early time) of the time window are supplied. a solution is considered infeasible when the vehicle arrives outside the time window bounds, defining these as hard time windows. to execute the service, there is a homogeneous vehicles set with the same load capacity that cannot be exceeded. the passengers are picked and delivered by the same vehicle. a vehicle can enter on inactivity times or slacks only without passengers on board. additionally, there is only one depot (single depot), a particular location where vehicles start and end their travels. in this work, maximum route duration is not considered. 3 related work as mentioned previously, there are only few researches where the gas are considered for darptw. there are more works which provide relevant elements in this context on vrp (vehicle routing problem), a generic case of darp. an example is thangiah’s work[11], which describes gideon, a ga based heuristic to solve vrp with time windows. this mechanism uses a cluster first route second strategy, starting with the assignation of customers to vehicles and after improving the best solution by a post-optimizing process. for the system implementation, a ga software called genesis is used, where the individuals are represented by bit strings. the client clusters/sections are obtained from an individual by splitting him on k divisions of b bits. each division is used to compute the size of a sector. the individual quality is obtained through the cost function evaluation when all computed clients are served, regarding their derived sector divisions. in this work, b = 3 is used, bigger values showed less satisfactory results. for the testing, the parameter values for the population size, crossover rate and mutation rate were 1000, 0.5 and 0.001 respectively. a set of 56 instances were tested using the solomon’s benchmark data, widely known in this context. in the results, 41 instances showed improvement in comparison to other heuristics developed by solomon and thompson. regarding darptw, in [1], a cluster first route second strategy is also used. the mathematic model used in their work is a generalization of the one used in this work. this generalization is justified by the use of soft time windows, so the objective function considers additional elements related to quality of service. the individual is based on client clusters. a matrix is used, where the row number equals the available vehicles and the columns equals the clients and depots number. if an element on the matrix equals to 1, then the respective client is assigned to the respective vehicle. additionally, a row (vehicle) represents a specific route. because the absence of standardized benchmark data set in darptw, in contrast to vrp case, a set developed by cordeau & laporte is used [5], that contains 20 random instances. this set considers instances from 24 to 144 customers. the obtained results are compared also with a cordeau & laporte research [4], with similar improvement. finally, there is the cubillos work [6], where the objective is to develop a specific ga model to solve the darptw problem, considering it as a deceptive problem, using the ga to solve the full problem, in contrast to a universal solver ga and other researches where the ga solves only a part of the problem. to accomplish this, an adequate framework was developed, that considers all critical ga elements. at the same time, the considered darptw instance was very specific in contrast to other studies, for example, there were only outbound customers. a bus-passenger representation for each gene on an individual was used. the solution decode is done by an ordered lecture, where the first occurrence of a customer is application of genetic algorithms for the darptw problem 129 always a pickup and the second one is always a delivery, this obligates the representation to consider only two genes per customer. the vehicle associated with the pickup is the one actually considered in the solution and the one of the delivery can be different but not considered. the final results showed an improvement on the solution quality factor, in contrast to the vehicle quantity, when compared to previous research. it is important to highlight that none of the above solutions considers any technique for improving the search over the feasible solution space, especially regarding the time-windows constraints. this is especially true when applying the crossover and mutation operators, reason why the present work improves actual ga solutions by trying to avoid or minimize the infeasibilities and subsequent reparations after the operators through the use of pre-feasibility tables or schemas. 4 implementation in this section, the implemented ga framework and each of its elements will be described in the following. 4.1 preliminary feasibility schemas as stated before, the time window constraints make the problem search space highly non convex, meaning that it is very easy to move from a feasible solution to an unfeasible one when searching through meta-heuristics, being the only solution to roll-back or to repair the unfeasible solution. in the present work an approach has been developed to minimize these problems by reducing a priori the feasible planning combinations considering the time windows restrictions. when processing the dataset of requests, a preliminary precedence table of events is built, where each event (pickup or delivery) is associated a list of other events which need to be inserted before that event on a route (if assigned to the same vehicle) to preserve time windows feasibility. in figure 1, for example, are shown 4 clients (a, b, c and d) with their time windows for pickup (+) and delivery (−). an obvious relation is that the pickup a+ must precede its delivery a−. then, the pickup of c (c+) cannot be before the pickup of a (a+) as it would be impossible for the same vehicle to serve both events within their time window bounds as the last ends before the other stars. in the most general case, an event y must precede an event x when etx + drtxy > lty , where etx corresponds to the early time (time windows lower bound) of event x, drtxy the direct ride time from the location of x to the location of y and lty to the latest time (time windows upper bound) of event y. in this way, it is possible to build a list of precedence events for each event. on the other hand, there is an incompatible clients’ list that defines, on a level of passenger assignation (clusters), which clients cannot be transported by the same vehicle, for the time windows feasibility of a solution. this list can be obtained from the pairs of events that precede each other simultaneously: if an event x is preliminary to y and at the same time y is preliminary to x, then it is impossible for both to be part of the same route and, consequently, its respective clients cannot be transported by the same vehicle. in practical terms, the case involved is when a couple of events are too far the one from the other and their time windows are too close to each other making it impossible for a single vehicle to go from one location to the other within their time windows constraints. 4.2 initial population generation it is based on the client incompatibility previously described. the mechanism is concerned that incompatible clients will not be assigned to the same vehicle, first generating a base feasible solution that 130 claudio cubillos, enrique urra, nibaldo rodríguez figure 1: example of preliminary precedence table of events, where each event has other events that must precede it when they are on the same vehicle contains all conflicting clients on separated vehicles and afterwards generating a final feasible solution as a result of the insertion of the remaining clients over the base feasible solution. the implemented mechanism does not assure the generation of a feasible solution on the first try. in this context, the randomness on the insertion heuristic and the vehicle selection for the clients, facilitates the possibility of restart the process when the incapacity of continue generating a feasible solution is detected. 4.3 genotype and crossover a model like llga [7] has been considered. the incorporation of the locus on the genes allows to order them in different ways representing an identical solution. a gene is composed by the locus, that corresponds to the client number on this implementation, and the vehicle assigned to him. figure 2 shows the clients a, b, c, d and the vehicles v1, v2, v3 to which are assigned. figure 2: crossover operation for the implemented genotype this representation tackles the assignment of clients to vehicles (clustering) while the scheduling itself (route construction) is carried out through a greedy insertion heuristic explained in the next subsection. application of genetic algorithms for the darptw problem 131 on crossover, (see figure 2) a father assumes the donor role, giving an own gene segment of its structure, on a random insertion point of the other father that assumes the recipient role, where the duplicated genes are deleted. in this case, the duplicated clients from the recipient are deleted. this generates a change on the clusters from the chromosome external layer, but also triggers the modification of the routes from the chromosome internal layer. because this process is not exempt from unfeasibility problems, in the basis of the randomness of the internal processes that support the crossover (insertions, eliminations, etc.) and the own crossover factors (insertion point, donated segment), a restart of the whole process is considered, until a feasible crossover has been done. 4.4 route scheduling it regards the greedy insertion heuristic used for route generation, plus feasibility evaluation procedures. this mechanism is based on "madarp" model shown on [6], applying the concept of time windows intersection and the use of pre-calculated data for a direct evaluation. figure 3 (a) shows a portion of a schedule, containing the pickups and deliveries of clients a and b on a first block, then a slack (that is, vehicle idle time without passengers onboard) and a second block serving clients c and d. within the first block, the evaluation of client x is carried out by evaluating all the possible permutations of the new client in the sequence, that is, (n+1)(n+2)/2 with n the number of events already present in the block. figure 3: crossover operation for the implemented genotype in addition, the heuristic also considers the possibility of merging blocks as figure 3 shows. the insertion of client x makes the slack time after a− to disappear, resulting in a bigger block. 132 claudio cubillos, enrique urra, nibaldo rodríguez 4.5 phenotype or evaluation it is worth highlighting that there are two generic elements evaluated on a solution: the transport system efficiency and the quality of service offered to the clients. on the first element, the considered factors are the vehicle travel time, the slacks length and the vehicle quantity. on the second element, the excess ride time and the wait time are considered. a weight is assigned to each of these factors that directly influence the evaluation result. 4.6 selection for the proposed models, a tournament selection is used, from where a population subset is obtained, and an individual from these are selected as a winner. on this operator, the selection pressure can be managed through the tournament size. in this implementation, when a winner is obtained from the tournament, it is not removed from the original population, allowing them be the winner on future tournaments and facilitating the convergence. 4.7 mutation in contrast from other models described in literature, two mutation operators have been developed in this implementation: a cluster mutation and a route mutation. each of these has its own probability, hence an individual can be affected by one, both or none of them on the generation advance. the cluster mutation consists in moving a client from one vehicle to another and the probability is applied to each client in the solution as figure 4 (a) shows. figure 4: crossover operation for the implemented genotype the route mutation consists on interchanging the order of an event’s pair from the route and the probability is applied to each route in the solution (see figure 4 (b)). because this operator is not exempt from unfeasibility problems and the mutation probabilities are small, when all possibilities has been evaluated and a feasible result has not been obtained, the process is discarded for a client or a route according to the respective mutation. 5 experiment settings in literature, a common benchmark dataset used to evaluate heuristics for the darptw problem corresponds to the ones provided by cordeau & laporte [5]. these sets are divided on two subsets: one is used in [3] for an branch-and-cut algorithm, while the other one is used in [4] for a tabu search algorithm and in [1] for a cluster-first route-second ga. these are used on this research both for the ga application of genetic algorithms for the darptw problem 133 calibration stage and final testing stage. for the evaluation, the benchmark data was divided on three groups according to the number of clients: small sets (pr01, pr02, pr11, pr12 and pr17), medium sets (pr03, pr05, pr15 and pr19) and big sets (pr16). on the calibration stage, the following parameters were considered: 3 sets (a small one, a medium one and a big one); population size: 100; maximum generation number: 10000; crossover ratio: 0.35, 0.45 and 0.75; cluster mutation ratio: 0.0025, 0.005 and 0.0075; route mutation ratio: 0.025, 0.05 and 0.075; tournament size: 2. these give a total of 108 possible execution combinations. for each combination, 3 runs were done, generating a total of 324 executions on the calibration stage. the problem parameters used were: route duration factor: 8; slack time factor: 1; vehicle quantity factor: 0; excess ride time factor: 2; wait time factor: 0; ride time factor: 4. finally, the parameters associated to the best results were: crossover ratio: 0.45, cluster mutation ratio: 0.005 and route mutation ratio: 0.075 for the small set; crossover ratio: 0.35, cluster mutation ratio: 0.075 and route mutation ratio: 0.075 for the medium set; and crossover ratio: 0.75, cluster mutation ratio: 0.025 and route mutation ratio: 0.025 for the big set. with these parameters, the final tests were executed. in this stage, a bigger number of runs per test and a bigger generation number were used. for each instance set, 20 runs and 15.000 generations were considered. a total of 200 tests with the fore-mentioned characteristics were carried out. 5.1 obtained results the results were compared with jorgensen et al. [2] research (cluster-first route-second ga) and cordeau & laporte [4] research on tabu search. table 1 shows the results obtained by our work while tables 2 and 3 show the results obtained by the previously mentioned researches. because the compared models do not have the same characteristics, the comparison was done on the basis of time units of two critical factors: on the one hand, the total route duration that is associated with the transport system resources optimization, and on the other hand, the total client travel time that is associated with the offered quality of service. table 1: summary of the results obtained by our llga model it is important to mention that our solution considers two restrictions not covered by both, jorgensen et al. and cordeau & laporte researches. these are the time windows as hard constraints and the incapacity of vehicles to transport clients when they are on slack times. despite this more restrictive scenario our solution performed well compared to them. 134 claudio cubillos, enrique urra, nibaldo rodríguez table 2: summary of the results obtained by cluster firstroute second ga of jorgensenet al. [2] table 3: summary of the results obtained by tabu search of cordeau & laporte [4]. application of genetic algorithms for the darptw problem 135 in terms of solution quality, our llga model clearly performed better than the jorgensen et al. solution. a closer look at the tables shows that for all the evaluated datasets our llga model presented lower average times for vehicles’ route duration, clients’ ride times and also cpu time, being this last one specially important. regarding cordeau & laporte results, based on a tabu search solution, on most cases their solution presented best times for the route duration but not for the ride times or the cpu times. in addition, on three cases (pr02, pr11 and pr12) our llga solution performed better also for the route duration. 5.2 results discussion as exposed in the results section, our llga implementation presents better results for the clients’ travel time with respect the other two studies. this is mainly due to the use of time-windows as hard constraints, making it easier to enforce the optimization on this factor. then, when focusing on the vehicles’ route duration our solution showed better times regarding the cluster-first route-second ga of jorgensen et al. while obtaining worse results when compared to the tabu-based solution. this can be understood as the cost of having good average ride times for clients, as there is a trade-off relation among both variables. the improvement obtained in our solution is mainly explained because of the use of the so-called precedence table of events and the incompatible clients list. both elements provide a way to reduce the search-space when considering the time-windows restriction. in this sense, it is important to remember that the darptw problem behaves as a deceptive problem when solved through gas, mainly because of the difficulty for the ga operators to find the appropriate building blocks for constructing feasible solutions. this fact is especially true when facing the darp with tws, as the time-windows do impose additional restrictions on the clients’ requests that make the region of feasible solutions highly non-convex. this means that it is very easy to move within this region from a feasible to an infeasible solution. furthermore, depending on the tightness or looseness of the time-windows such region may look like a small group of feasible points spread over a big region of infeasible solutions. under such an scenario, what the precedence table does is to make such region more convex by pre-processing which sequences of events (pickup & delivery) are feasible within the "sea" of infeasible sequences due to incompatibilities in their respective time windows, making it impossible for a vehicle to serve both events in that sequence and satisfy their time intervals. a similar situation happens with the incompatible clients’ list. in this case, both events of the client are considered, his pickup and delivery, detecting the cases in which it is not possible for a vehicle to serve both clients and their time-windows constraints no matter the sequence used and no matter which other clients are assigned. this is important for identifying requests that are too close in time (either at pickup or delivery) while being too far geographically, avoiding putting them together. in this way, the initial population and ga operators move more around feasible solutions. this reduction follows a similar principle as in constraint satisfaction problems and their techniques to reduce the domain of variables. from other point of view, it can be seen as a search with memory, as in tabu search. however, in this case the memory is used for making the algorithm to "remember" which portions of sequences are feasible in order to reduce effort instead of remembering the solutions found so far to avoid local optima. another important issue is that the cpu time in all cases is lower, being sometimes even the half of it or even less (e.g. pr5, pr15, pr16 and pr19). it is worth highlighting that the llga tests were done with a 2.66 ghz intel pentium 4 cpu, while the cordeau & laporte tests were done with a 2.0 ghz intel pentium 4 cpu and the jorgensen et al. tests were done with a 2.0 ghz intel celeron cpu. although the hardware configurations are dissimilar, they do not completely justify the time improvement. undoubtedly, the precedence table of events and the incompatible clients’ list have caused this 136 claudio cubillos, enrique urra, nibaldo rodríguez diminishing. 6 conclusions the proposed model has shown interesting results according to the comparisons made, highlighting the times lowering, undoubtedly the weaker ga factor. this work will allow researchers to develop comparisons of highly restrictive models, unlike most works in literature where a bigger number of constraints are relaxed. there are two important elements of the developed model on this work that must be remarked. on the one hand, the use of pre-feasibility schemas has represented an interesting support tool for the ga behavior. although it depends strictly on the problem, the ga behaves better when the search space is bigger and complex, as in many np-hard problems while having a convex feasible region of solutions. bibliography [1] k. bergvinsdottir, the genetic algorithm for solving the dial-a-ride problem, master’s thesis, informatics and mathematical modelling, technical university of denmark, dtu, 2004. [2] r. m. jorgensen, j. larsen, k. b. bergvinsdottir, solving the dial-a-ride problem using genetic algorithms, j. oper. res. soc., vol.58, pp. 1321-1331, 2006. [3] j. cordeau, a branch-and-cut algorithm for the dial-a-ride problem, operations research, vol. 54, pp. 573-586, 2006. [4] j. cordeau, g. laporte, a tabu search heuristic for the static multi-vehicle dial-a-ride problem, transportation research part b, vol. 37, issue 6, pp. 579-594, 2003. [5] j. cordeau, g. laporte, benchmark datasets accessed on march-2007, available online at: http://neumann.hec.ca/chairedistributique/data/darp/. 2007. [6] c. cubillos, madarp: multi-agent framework for transportation systems, thesis for the academic degree of dottore di ricerca in ingegneria informatica e dei sistemi (ciclo xvii), politecnico di torino, 2005. [7] c. cubillos, n. rodriguez, b. crawford, a study on genetic algorithms for the darp problem, springer berlin-heidelberg lncs, vol. 4527, pp. 498-507, 2007. [8] g. harik, d.e. goldberg, learning linkage, foundations of genetic algorithms, vol. 4, pp. 247-262, 1996. [9] j.h. holland, adaptation in natural and artificial systems: an introductory analysis with applications to biology, control and artificial intelligence, mit press, isbn 0-262-58111-6, 1998. [10] m.w.p. savelsbergh, the general pickup and delivery problem, transportation science, vol. 29, pp. 17-29, 1995. [11] s. thangiah, vehicle routing with time windows using genetic algorithms, application handbook of genetic algorithms: new frontiers, vol. ii. lance chambers (ed.). crc press, 1995. international journal of computers, communications & control vol. ii (2007), no. 4, pp. 328-339 neural networks-based adaptive state feedback control of robot manipulators ghania debbache, abdelhak bennia, noureddine goléa abstract: this paper proposes an adaptive control suitable for motion control of robot manipulators with structured and unstructured uncertainties. in order to design an adaptive robust controller, with the ability to compensate these uncertainties, we use neural networks (nn) that have the capability to approximate any nonlinear function over a compact space. in the proposed control scheme, we need not derive the linear formulation of robot dynamic equation and tune the parameters. to reduce the nns complexity, we consider the properties of robot dynamics and the decomposition of the uncertainties terms. the proposed controller is robust against uncertainties and external disturbance. the validity of the control scheme is demonstrated by computer simulations on a two-link robot manipulator. keywords: robot manipulator, neural networks, adaptive control, stability. 1 introduction robot manipulators are multivariable nonlinear systems and are frequently subjected to structured and unstructured uncertainties even in a well-structured setting for industrial use. structured uncertainties are mainly caused by imprecision in the manipulator link properties, unknown loads, and so on. unstructured uncertainties are caused by unmodeled dynamics, such as, nonlinear friction, disturbances, and high-frequency dynamics. as a result, it is difficult to obtain an accurate mathematical model so that computed torque controllers [1-6] or other model-based controllers [7-8] can be accurately applied. although adaptive controllers [1-5, 7] can achieve fine control and compensate for partially unknown manipulator dynamics (i.e., structured uncertainties), they often suffer from incapacity to deal with unstructured uncertainties. hence, there is a need for model-free adaptive control strategies. the application of neural networks to robots dynamic control is not new [9-11]. though the proposed methods have been practically successful, it has proved extremely difficult to develop a general analysis and design theory for early nns control systems. during the last few years, a number of papers have been presented to deal with the problem of robot adaptive control [12-14]. the basic idea of these methods is to design the feedback controller based on the computed torque principle, and to use an adaptive nn to approximate the robot nonlinearities needed in the control input design. however, most of the above designs present the drawback that, robot dynamic model is presented as single nonlinearity approximated by a single nn with the robot real and desired positions and velocities as inputs, which results in large nn with lot of parameters to be tuned. in this paper, our goal is to develop a method for designing an adaptive nn control for rigid robot manipulators. a structured or partitioned nn structure, that simplifies the controller design and makes for faster weight tuning online, is designed to ensure the closed loop stability. robust update laws are used to tune the nns parameters, and to ensure their boundedness. lyapunov stability theory is used to drive the stability conditions, and to show the robustness against uncertainties and disturbances. simulation tests for a two-link robot, under uncertainties, disturbance and parameters variations, show the accuracy and the robustness of the proposed adaptive scheme. copyright © 2006-2007 by ccc publications neural networks-based adaptive state feedback control of robot manipulators 329 2 robot control problem the lagrange–euler formulation, the dynamic equation of an n-joint robot arm can be expressed as m(q) .. q + c(q, . q) + g(q) + τc(q, . q) + τd (q, . q) = u (1) where m(q) ∈ rn×n bounded positive definite inertia matrix; c(q, .q) ∈ rn vector representing centrifugal and coriolis effects; g(q) ∈ rn vector representing gravitational torques; τc(q, . q) ∈ rn, τd (q, . q) ∈ rn vectors representing the dynamic effects as nonlinear frictions, small joint and link elasticities, backlash and bounded torque disturbances. here the uncertainties effect is decomposed as continuous part τc(q, . q) and discontinuous part τd (q, . q). u ∈ rn vector of joint torques supplied by the actuators; q ∈ rn vector of joint positions; . q ∈ rn vector of joint velocities and ..q ∈ rn vector of joint accelerations. taking as state vector xt = [ xt1 ... x t n ] with xti = [ qi . qi ] , the robot model (1) can be rewritten as . x = ax + b [ f(q, . q) + g(q)u + d(q, . q) ] (2) where f(q, . q) =   f1 ( q, . q ) ... fn(q, . q)   := −m−1(q) [ c ( q, . q ) + g(q) + τc ( q, . q )] g(q) =   g11 (q) . . . g1n (q) ... . . . ... gn1 (q) . . . gnn (q)   := m−1(q) d(q, . q) =   d1 ( q, . q ) ... dn(q, . q)   := −m−1(q)τd ( q, . q ) and a =diag[a1, .., an], b =diag[b1, .., bn] with ai = [ 0 1 0 0 ] , bi = [ 0 1 ] , i = 1..n the control problem can be stated as: for a given bounded reference trajectories qr, . qr and .. qr ∈ rn design the control input torques u such as the robot’s states follow their references, with all involved signals in closed loop remain bounded. 3 neural networks the general function of one hidden layer feedforward neural network can be described as in (3) as the weighted combination of n activation functions. here the input vector x and ϕi (.) represents the ith activation function (with its parameters vector θi) connected to the output by weight wi. y = n ∑ i=1 ϕi (x, θi) wi (3) the numbers of the input and output layers coincide with the dimension of the input vector and output information number, respectively. since the above neural networks will be trained on line to achieve the 330 ghania debbache, abdelhak bennia, noureddine goléa control task, and in order to reduce computation load, we will assume that the activation functions parameters θi are fixed, i.e., their number and shape is a priori determined. the only adjustable parameters are the wights wi. then, (3) can be rewritten in the compact form y = wt φ (x) (4) where φ t (x) = [ ϕ1 (x) ... ϕn (x) ] and wt = [ w1 ... wn ] . it is known from nns approximation theory [15-19] that the modeling error can be reduced arbitrarily by increasing the number n, i.e., the number of the linear independent activation functions in the network. that is, a smooth function f (x) , x ∈ ωx ⊂ rn can be written as f (x) = w∗t φ (x) + ε (x) (5) where ε (x) is the network inherent approximation error, and w∗ is an optimal weight vector. various well-known results, see e.g. [16-19], for various activation functions ϕi(.), based, e.g. on the stone-weierstrass theorem, say that any sufficiently smooth function can be approximated by a suitably large nn [5-8]. the functional range of nn (4) is said to be dense, if for any f (x) and a constant ε∗ > 0 there exist finite n and w∗ such that (5) holds with |ε (x)| < ε∗. the rang of activation functions include for instance the step, the ramp, the sigmoid and radial basis functions. several algorithms are proposed in the literature to select the structure and parameters for those kind of nns, see e.g. [20-21]. 4 neural state feedback due to approximation property (5), we can assume that the nonlinear terms in (2) can be approximated as fi(q, . q) = θ∗tfi φi(q, . q) + εi(q, . q) gii(q) = θ ∗tgi ψi(q) + εi(q) i = 1..n (6) where θ ∗tfi φi(q, . q) and θ∗tgi ψi(q) are nns of the from (4), and εi(q, . q), εi(q) are the inherent approximation errors due to the finite size of the nns. the optimal weights θ∗fi and θ ∗ gi defined above are quantities required only for analytical purpose. typically θ ∗fi and θ ∗ gi are chosen to minimize εi(q, . q) and εi(q) over the compact regions ω f and ωg respectively, that is θ ∗fi = arg minθ fi { sup q, . q∈ω f ∣∣ fi(q, . q)−θ tfi φi(q, . q) ∣∣ } θ∗gi = arg minθgi { sup q∈ωg ∣∣gii(q)−θ tgi ψi(q) ∣∣ } assumption 1: the neural networks approximation errors are bounded by ∣∣εi(q, . q) ∣∣ ≤ ε0i and |εi(q)|≤ ε0i, i = 1..n, for some constants ε0i and ε0i. assumption 1 results from the universal approximation property of neural networks, that can approximate any well-defined function over a compact space with finite approximation error. using (6) in (2), the robot dynamic can be written as . x = ax + b [ θ∗f φ(q, . q) + θ∗gψ(q)u + h(q)u + ω(q, . q) ] (7) where φ(q, . q) =block-diag [ φ1(q, . q), .., φn(q, . q) ] , ψ(q, . q) =block-diag [ ψ1(q, . q), .., ψn(q, . q) ] , θ∗f =blockneural networks-based adaptive state feedback control of robot manipulators 331 diag [ θ ∗tf1 , .., θ ∗t fn ] , θ∗g =block-diag [ θ∗tg1 , .., θ ∗t gn ] , ω(q, . q) = ε + d(q, . q), with ε t = [ ε1 ... εn ] , and h(q) =   ε1 g12 (q) . . . g1n (q) g21 (q) . . . ... ... . . . g(n−1)n (q) gn1 (q) . . . gn(n−1) (q) εn   based on (7), the control inputs are defined as u = [θgψ(q)] −1 [−θ f φ(q, . q) + .. qr + ke ] (8) where et = [ (qr −q)t ( . qr − . q )t ] is the tracking error vector, θg, θ f are the estimated neural networks parameters, and k =diag[k1, ..., kn] with ki ∈ r2 is pd gain vector, chosen such as the matrix ac = a−bk is hurwitz. then, introducing the control input (8) in (7) yields . e = ace−b [ θ̃ f φ(q, . q) + θ̃gψ(q)u + h(q)u + ω(q, . q) ] (9) where θ̃ f = θ∗f −θ f and θ̃g = θ∗g −θg are the parameters estimation errors. from (9), it can be seen that the tracking error vector is driven by the coupling terms and the finite approximation accuracy effects reflected by h(q) and the uncertainty term ω(q, . q). to design the neural networks parameters update laws and to ensure boundedness of the involved signals in the closed loop robot control, the following assumptions are used: assumption 2: the diagonal elements of g(q) are bounded such as gm ≤diag[g11(q), .., gnn(q)] ≤ gm , the matrix h(q) is bounded by |h(q)| ≤ h0, and the disturbance term ω(q, . q) is bounded by ∣∣ω(q, .q) ∣∣ ≤ ω0. assumption 3: the neural networks parameters are bounded by the constraint sets ω f and ωg such that: ω f = { θ f | ∣∣θ f ∣∣ ≤ fm } and ωg = {θg | gm ≤ |θg| ≤ gm}, respectively, where fm , gm, and gm are some known constants. the first part of assumption 2 follows from the fact that m(q) is bounded positive definite matrix, the second part follows from the boundedness of m(q) and εi(q). finally, the third part follows from boundedness of m(q), τd and εi(q, . q). the bounds used in assumption 3 result from the assumptions 1-2 and are used to ensure the boundedness of the neural networks outputs. in order to constraint the parameters θ f and θg within the sets ω f and ωg, respectively, we use the following parameter projection algorithm [22]: . θ f =    −γ1bt peφt (q, . q) if ∣∣θ f ∣∣ < fm or ( ∣∣θ f ∣∣ = fm and tr [ bt peφt (q, . q)θtf ] ≥ 0) −γ1bt peφt (q, . q) +γ1tr [ bt peφt (q, . q)θ̂tf ] ( 1+|θ f| fm )2 θ f if ∣∣θ f ∣∣ = fm and tr [ bt peφt (q, . q)θtf ] < 0 (10) and . θg =    −γ2bt peut ψt (q) if ∣∣∣θ̂g ∣∣∣ < gm or ( ∣∣∣θ̂g ∣∣∣ = gm and tr [ bt peut ψt (q)θ̂tg ] ≥ 0) −γ2bt peut ψt (q) +γ2tr [ bt peut ψt (q)θtg ] ( 1+|θg| gm )2 θg if |θg| = gm and tr [ bt peut ψt (q)θtg ] ≤ 0 (11) 332 ghania debbache, abdelhak bennia, noureddine goléa where γ1, γ2 > 0 are design parameters, and p = pt > 0 is the solution, for a given q = qt > 0, of the lyapunov equation atc p + pac = −q (12) moreover, in order to guarantee |θg| ≥ gm such that an inverse of θgψ(q) always exists, we use the following law to adjust the parameter θg. 1. whenever any element [θg]i j = gm use [ . θg ] i j = { −γ2 [ bt peut ψt (q) ] i j if [ bt peut ψt (q) ] i j < 0 0 if [ bt peut ψt (q) ] i j ≥ 0 (13) 2. otherwise, use (11). where [a]i j stands for the i jth element of the matrix a. the stability properties of the proposed nns adaptive state feedback are summarized by the following theorem. theorem 1: the robot adaptive control composed by the robot dynamic (2), the control input (8), the update laws (10)-(11) and (13) verifying assumptions 1-3, guarantees the following: 1. ∣∣θ f ∣∣ ≤ fm and gm ≤ |θg| ≤ gm 2. |e| ∈ l∞ 3. |u| ∈ l∞ proof : 1. to prove |θg| ≤ gm , let vg = 12γ2 tr [ θtg θg ] , then . v g = 1γ2 tr [ . θ t g θg ] . if the first line of (11) is true, we have either |θg| < gm or . v g = −tr [( bt peut ψt (q) )t θg ] ≤ 0 when |θg| = gm , that is, we get always |θg| ≤ gm . if the second line of (11) is true, we have |θg| = gm and . v g = tr [ − ( bt peut ψt (q) )t θg + tr [ bt peut ψt (q) θtg ] ( 1 +|θg| gm )2 θtg θg ] = −tr [( bt peut ψt (q) )t θg ] + tr [ bt peut ψt (q) θtg ] ( 1 +|θg| gm )2 tr [ θtg θg ] = −tr [( bt peut ψt (q) )t θg ] + tr [ bt peut ψt (q) θtg ] ( 1 +|θg| gm ) |θg|2 (14) since |θg| = gm, we get . v g = tr [ bt peut ψt (q) θtg ] g2m ≤ 0 (15) that is, |θg| ≤ gm . therefore, we have |θg| ≤ gm , ∀t ≥ 0. from (13) we see that if |θg|i j = gm, then [ . θg ] i j ≥ 0; that is, we have that |θg|i j ≥ gm. using the same analysis, we can prove that ∣∣θ f ∣∣ ≤ fm , ∀t ≥ 0. neural networks-based adaptive state feedback control of robot manipulators 333 2. consider the lyapunov function v = 1 2 et pe + 1 2γ1 tr [ θ̃tf θ̃ f ] + 1 2γ2 tr [ θ̃tg θ̃g ] (16) the differentiation of (16) along (9) yields . v = −1 2 et qe−et pb [ θ̃ f φ(q, . q) + θ̃gψ(q)u + h(q)u + ω(q, . q) ] + 1 γ1 tr [ . θ̃ t f θ̃ f ] + 1 γ2 tr [ . θ̃ t g θ̃g ] (17) which can be arranged as . v = −1 2 et qe−et pb [ h(q)u + ω(q, . q) ] + 1 γ1 tr [( . θ̃ t f −γ1φ(q, . q)et pb ) θ̃ f ] + 1 γ2 tr [( . θ̃ t g −γ2ψ(q)uet pb ) θ̃g ] (18) then, using (10)-(11) and the fact that . θ̃ f = − . θ f ( . θ̃g = − . θg), one can show that the third and fourth terms in (18) are always ≤ 0. therefore, (18) can be written as . v ≤ −1 2 et qe−et pb [ h(q)u + ω(q, . q) ] (19) further, (19) can be upper bounded by . v ≤ −1 2 λmin (q)|e|2 +|e||pb| [ |h(q)||u|+ ∣∣ω(q, .q) ∣∣] (20) then, using (8) and the result in 1, the input torques can be upper bounded by |u| ≤ ∣∣∣[θgψ(q)]−1 ∣∣∣ [∣∣θ f φ(q, . q) ∣∣ + ∣∣..qr ∣∣ +|k||e| ] ≤ 1 gm ( fm + q0 +|k||e|) (21) where q0 is an upper bound on the desired accelerations .. qr. then, using (21) and the assumptions 1-3 in (20), becomes . v ≤ −1 2 λmin (q)|e|2 + κ1 |e|2 + κ2 |e| (22) where κ1 = h0gm |pbk| and κ2 = |pb| ( h0 gm ( fm + q0) + ω0 ) . hence, (22) can be arranged as . v ≤ −1 2 (λmin (q)−2κ1)|e|2 + κ2 |e| (23) if the matrix q is chosen such as λmin (q) > 2κ1, then . v ≤ 0 whenever the tracking error is outside the region given by |e| ≤ 2κ2 (λmin (q)−2κ1) (24) which implies that |e| ∈ l∞. 334 ghania debbache, abdelhak bennia, noureddine goléa 3. using the result (24) in (21) yields |u| ≤ 1 gm ( fm + q0 +|k| 2κ2 (λmin (q)−2κ1) ) (25) this implies that |u| ∈ l∞. remarks: 1. only the diagonal elements of g (q) are estimated and used in the control inputs design. by doing this, we avoid the estimation of the coupling terms (considered here as disturbances) and the need to compute the inverse of the estimation of g (q). 2. although, the control torques (8) are presented in vector form, they can be, in practice, computed independently, since θgψ(q) and k are diagonal matrices and no information is needed from the other input torques. 3. from (25), it can be seen that constraints on the control inputs, i.e., |ui| ≤ ui max can be meet by tuning the the pd gain ki and the desired accelerations magnitudes q0. 5 simulation results to test the proposed adaptive neural control, we consider the two-link manipulator (fig. 1) whose dynamic equations of motion (1) are: m (q) = [ (m1 + m2) l21 m2l1l2 (s1s2 + c1c2) m2l1l2 (s1s2 + c1c2) m2l22 ] c ( q, . q ) = [ 0 −m2l1l2 (c1s2 −s1c2) . q2 −m2l1l2 (c1s2 −s1c2) . q1 0 ] g(q) = [ −g (m1 + m2) s1l1 −gm2l2s2 ] where c1 = cos(q1), c2 = cos(q2), s1 = sin(q1) and s2 = sin(q2). the robot physical parameters are: l1 = l2 = 1m, m1 = m2 = 1kg, and g = 9.81 m/s2. the uncertainties terms in (1) are given by: τc = [ . q1 + sin (3q1) 1.2 . q2 + 0.5 sin (2q2) ] , τd = [ 0.2sign ( . q1 ) 0.1sign ( . q2 ) ] the nns adaptive control design for the two-link robot is as follows: 1. in order to construct the nns approximators, each variable q1, q2 ∈ [−π, π] and . q1, . q2 ∈ [−2π, 2π] range is devised into 3 sub domains, which yields four rbf networks to approximate f1 ( q, . q2 ) , f2 ( q, . q1 ) , g11 (q) and g22 (q), with qt = [ q1 q2 ] , with, respectively, 27, 27, 9 and 9 rbfs. the designed rbf networks take the following compact forms: f̂1 ( q, . q2 ) = θ tf1 φ1 ( q, . q2 ) (26) f̂2 ( q, . q1 ) = θ tf2 φ2 ( q, . q1 ) (27) ĝ11 (q) = θ tg1 ψ1 (q) (28) ĝ22 (q) = θ tg2 ψ2 (q) (29) neural networks-based adaptive state feedback control of robot manipulators 335 m2 l2 q2 m1 q1 l1 figure 1: two link robot where θ f1 , θ f2 ∈ r27×1 and θg1 , θg2 ∈ r9×1 are the nns adjustable parameters. this approach is far from being optimal, but it has the merit to reduce the number of parameters to be learned and thus to reduce the update algorithm complexity and execution time. 2. the control input is designed as in (8) with the adaptive nns defined in (26)-(29). the pd gain is defined as k =diag[k1, k2] with k1 = k2 = [ 16 8 ] . 3. for the choice of q = 100i4 and the solutions of (12) we get p. 4. by analyzing the dynamic of the robot, the following bounds are fixed gm = 2.5, gm = 0.5 and fm = 20. the adjustable parameters are updated using (10)-(11) with γ1 = 0.01 and γ2 = 0.001. in the simulation, the nns parameters are initialized as |θg1 (0)| = |θg2 (0)| = 0.5 and ∣∣θ f1 (0) ∣∣ =∣∣θ f2 (0) ∣∣ = 0. the initial states, in all simulations, are xt (0) = [ 0.5 2 −0.5 1 ] . the first simulation test concerns the regulation of the joint positions under nominal conditions, i.e., no parameters changes and no disturbances. as depicted in figure 2.a, the joint positions exhibit a good transient performance, and no error is remarked in steady state regime. figure 2.b shows the regulation performance under uncertainties effects. from this figure it is seen that these uncertainties affect little the regulation performance and small steady state error is introduced and the nns adaptive control achieves good compensation of the uncertainties effects. in figure 2.c, in addition to the uncertainties effects, a payload change is introduced at t = 15s when m2 passes from 1kg to 3kg. this situation is rapidly taken in account by the nns control and it’s effect is compensated. 336 ghania debbache, abdelhak bennia, noureddine goléa 0 5 10 15 20 0 2 4 x 1 0 5 10 15 20 0 2 4 x 2 0 5 10 15 20 −20 0 20 u 1 t (sec) 0 5 10 15 20 0 2 4 x 3 0 5 10 15 20 0 2 4 x 4 0 5 10 15 20 −20 0 20 u 2 t (sec) a) nominal case 0 5 10 15 20 0 2 4 x 1 0 5 10 15 20 0 2 4 x 2 0 5 10 15 20 −20 0 20 40 u 1 t (sec) 0 5 10 15 20 0 2 4 x 3 0 5 10 15 20 0 2 4 x 4 0 5 10 15 20 −20 0 20 40 u 2 t (sec) b) τc + τd uncertainties effects 0 5 10 15 20 0 2 4 x 1 0 5 10 15 20 0 2 4 x 2 0 5 10 15 20 −20 0 20 40 u 1 t (sec) 0 5 10 15 20 0 2 4 x 3 0 5 10 15 20 0 2 4 x 4 0 5 10 15 20 −20 0 20 40 u 2 t (sec) c) τc + τd + m2 variation effects figure 2: regulation performance 0 10 20 30 40 −2 0 2 x 1 0 10 20 30 40 −5 0 5 x 2 0 10 20 30 40 −20 0 20 u 1 t (sec) 0 10 20 30 40 −2 0 2 x 3 0 10 20 30 40 −5 0 5 x 4 0 10 20 30 40 −20 0 20 u 2 t (sec) a) nominal case 0 10 20 30 40 −2 0 2 x 1 0 10 20 30 40 −5 0 5 x 2 0 10 20 30 40 −20 0 20 u 1 t (sec) 0 10 20 30 40 −2 0 2 x 3 0 10 20 30 40 −5 0 5 x 4 0 10 20 30 40 −20 0 20 u 2 t (sec) b) τc + τd uncertainties effects 0 10 20 30 40 −2 0 2 x 1 0 10 20 30 40 −5 0 5 x 2 0 10 20 30 40 −40 −20 0 20 u 1 t (sec) 0 10 20 30 40 −2 0 2 x 3 0 10 20 30 40 −5 0 5 x 4 0 10 20 30 40 −40 −20 0 20 u 2 t (sec) c) τc + τd + m2 variation effects figure 3: tracking performance the second simulation test concerns the tracking performance for the desired trajectories qr1 = sin(t) + sin(2t) and qr2 = cos(t) + cos(2t). figure 3.a presents the tracking performance in the nominal case. as depicted, the joint positions exhibit a good transient performance, and small error is remarked in steady state regime. figure 3.b shows the tracking performance under uncertainties effects. it is clear that these uncertainties introduce acceptable tracking error, and the nns control inputs compensate the uncertainties with a little effort. in figure 3.c, we add to the uncertainties effects, a payload change is introduced at t = 20s when m2 passes from 1kg to 3kg. this variation affects essentially the developed torques to compensate the additional mass, and small error is remarked. 6 conclusion in this paper, an adaptive nns control scheme for rigid robot control, was proposed. the adaptive capability of handling modeling errors and external disturbances was demonstrated. the error convergence rate with the nns adaptive approach was found to be fast. asymptotic stability of the control system is established using the lyapunov approach. simulation studies for a two-link robot verify the flexibility, adaptation and tracking performance of the proposed approach. the major contributions of the paper are as follows: reduction of the nns complexity and no robustifying control is required to achieve the stability or to enhance the control performance. neural networks-based adaptive state feedback control of robot manipulators 337 bibliography [1] j. y. han, h. hemami, and s. yurkovich, nonlinear adaptive control of a n-link robot with unknown load, int. j. robotics res., vol. 6, no. 3, pp. 71–86, 1987. [2] j. e. slotine and w. li, on the adaptive control of robot manipulators, int. j. robotics res., vol. 6, no. 3, pp. 49–59, 1987. [3] r. ortega and m. spong, adaptive motion control of rigid robots: a tutorial, proc. 27th conf. on decision and control cdc, austin, texas, , pp. 1575-1584, dec 1988. [4] j. j. graig, adaptive control of mechanical manipulators, new york, addison-wesley, 1988. [5] z. qu, j. f. dorsey and d. m. dawson, robust control of robots by computed torque law, systems and control letter, vol.16, pp.25-32, 1991. [6] g. jumarie, tracking control of mechanical systems via sliding lagrangian, j. intell. robotic syst., vol. 13, pp. 181–199, 1995. [7] s. pandian and m. hanmandlu, adaptive generalized model-based control of robot manipulators, int. j. contr., vol. 58, no. 4, pp. 835-853, 1993. [8] w. c. ham, adaptive control based on explicit model of robot manipulators, ieee trans. automat. contr., vol. 38, pp. 654–658, 1993. [9] m. b. leahy, m. a. johnson, s. k. rogers, neural network payload estimation for adaptive robot control, ieee transactions on neural networks, vol. 2, no. 1, pp. 93-100, jan. 1991. [10] m. saad, p. bigras, l.-a. dessaint, k. al-haddad, adaptive robot control using neural networks, ieee transactions on industrial electronics, vol. 41, no. 2, pp. 173-181, apr 1994. [11] l. behera, m.gopal and s.chaudhury, inversion of rbf networks and applications to adaptive control of nonlinear systems, iee proc.-control theory appl., vol. 142, no. 6, nov. 1995. [12] f. l. lewis, a. yesildirak , s. jagannathan, neural network control of robot manipulators and nonlinear systems, taylor & francis, inc. bristol, pa, usa 1998. [13] m. quoy, s. moga, p. gaussier, dynamical neural networks for planning and low-level robot control, ieee transactions on systems, man and cybernetics, part a, vol. 33, no. 4, pp. 523-532, july 2003. [14] c. torras, natural inspiration for artificial adaptivity: some neurocomputing experiences in robotics, proc. 4th intl. conf. on unconventional computation (uc’05), sevilla, spain, oct. 2005. [15] m. j. d. powell, “radial basis functions for multivariable interpolation: a review,“ in algorithms for approximation of functions and data, j. c. mason, m. g. cox, eds. oxford, u.k.: oxford univ. press, 1987, pp. 143-167. [16] e. j. hartman, j. d. keeler, and j. m. kowalski, “layered neural networks with gaussian hidden units as universal approximations,“ neural comp., vol. 2, no. 2, pp. 210-215, 1990. [17] t. poggio and f. girosi, “networks for approximation and learning,“ proc. ieee, vol. 78, pp. 14811497, sept. 1990. [18] j. park and i. w. sandberg, universal approximation using radial-basis-function networks, neural comp. 3, 246-257, 1991. 338 ghania debbache, abdelhak bennia, noureddine goléa [19] b. liu and j. si, the best approximation to c2 function and its error bounds using regular-center gaussian networks, ieee trans. neural networks, vol. 5, pp.848-857, 1994. [20] r.j. schilling, j.j. jr. carroll, a.f al-ajlouni, approximation of nonlinear systems with radial basis function neural networks, ieee transactions on neural networks, vol. 12, pp. 1-15, 2001. [21] h. guang-bin, p. saratchandran, n. sundararajan, a generalized growing and pruning rbf (ggap-rbf) neural network for function approximation, ieee transactions on neural networks, vol. 16, pp. 5767, 2005 [22] s. s. sastry and m. bosdon, adaptive control: stability, convergence and robustness, printicehall, 1989. ghania debbache electrical engineering institute, oum el-bouaghi university, 04000 oum el-bouaghi, algeria gdebbache@yahoo.fr abdelhak bennia electronic departement, constantine university 25000 constantine, algeria abdelhak.bennia@laposte.net noureddine goléa electrical engineering institute, oum el-bouaghi university, 04000 oum el-bouaghi, algeria n.golea@lycos.com received: march 15, 2007 neural networks-based adaptive state feedback control of robot manipulators 339 ghania debbache was born in constantine, algeria, in 1974. she received the b.sc. and m.sc. degrees in electronics from the constantine university, algeria, in 1997 and 2000, respectively. she is currently pursuing the ph.d. degree on the intelligent control at the university constantine. currently, she is a teaching assistant at the technologic sciences institute of oum el bouaghi university, algeria. abdelhak bennia received his d.e.s. degree in 1983 in physics from the university of constantine, algeria. from 1984 to 1990 he attended the graduate school at virginia tech, majoring in electrical engineering. he received the m.sc. degree in 1986 and the ph.d. degree in 1990. since 1990, he has been with the electronics department of the university of constantine. his current research interests are deconvolution, system identification and neural networks applied to character recognition, control systems, and signal processing. noureddine goléa was born in batna, algeria, in 1967. he received the engineer and master grades from sétif university, algeria, and the doctorat from batna university, algeria, all in industrial control, in 1991, 1994, and 2000, respectively. from 1991 to 1994, he was with electronics institute at sétif university, and from 1994 to 1996, he was with the electronics institute at batna university. currently, he is professor of electrical engineering in the technologic sciences institute at oum el-bouaghi university, algeria. his research interests are nonlinear and adaptive control, and intelligent control applied to motion control. http://horos.rdsor.ro/ijcccv3n4draft.pdf int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iii (2008), no. 4, pp. 353-365 quality of service scheduling in real-time systems audrey marchand, maryline chetto abstract: in this paper, we deal with dynamic scheduling components integrating new quality of service (qos) functionalities into a linux-based real-time operating system. in our approach, periodic tasks allow occasional deadline violations within given bounds specified according to the skip-over task model. hence, every task has a minimal qos guarantee which is expressed by the ratio of periodic task instances which must complete before their deadline. the work stated here provides two on-line scheduling algorithms, namely rlp and rlp/t, which enhance the existing skip-over algorithms. more specifically, the proposed algorithms aim at improving the actual qos observed for periodic tasks (which is always greater or equal to the qos guarantee). these novel scheduling techniques rely on the edl (earliest deadline as late as possible) scheduling strategy. simulation results show the performance of rlp and rlp/t with respect to basic skip-over algorithms. finally, we present the integration of these qos scheduling services into cleopatre open-source component library, a patch to linux/rtai. keywords: real-time, dynamic scheduling, quality of service, periodic tasks, component-based systems, linux-based systems 1 introduction software environments, and more precisely operating systems have still difficulties to meet the special demands of multimedia applications. in particular, multimedia applications have real-time constraints which are not handled properly by general-purpose operating systems. in order to meet the requirements imposed by multimedia applications on processor scheduling, we have to turn to the temporal stringency of real-time systems. real-time systems are those in which the time at which the results are produced is important. the correctness of the result of a task is not only related to its logic correctness, but also to when the results occur. traditional classification of real-time systems stands for three classes to characterize the real-time requirement of such systems : hard, soft and firm. in hard real-time systems, all instances must be guaranteed to complete within their deadlines. in those critical control applications, missing a deadline may cause catastrophic consequences on the controlled system. for soft systems, it is acceptable to miss some of the deadlines occasionally. it is still valuable for the system to finish the task, even if it is late. in firm systems, tasks are also allowed to miss some of their deadlines, but, there is no associated value if they finish after the deadline. typical illustrating examples of systems with firm real-time requirements are multimedia systems in which it is not necessary to meet all the task deadlines as long as the deadline violations are adequately spaced. a prominent strategy for performing resource management for multimedia systems is qos-driven management, in which quality requirements such as resolution and frame rate are translated into resource requirements such as computation burst frequencies and durations. this resource information is then used for admission testing and resource reservation. the motivation for this translation from application level requirements to resource requirements is to guarantee a given qos. the complexity of both the qos space and the resource space suggests that perfect characterization is hard to achieve, so it would be desirable to have a scheduling policy that would adapt to changes in user qos requirements. such policy should strive to achieve the desired qos, in an environment with variable resources, as well as complex and variable application demands. copyright © 2006-2008 by ccc publications 354 audrey marchand, maryline chetto in this paper, we address the problem of the dynamic scheduling of periodic tasks with firm constraints. the scope of the paper is to maximize the actual qos of periodic tasks by maximizing the number of instances which complete before their deadline. the remainder of this paper is organized in the following manner. next section introduces existing approaches for scheduling firm real-time systems. then we present relevant background material about both the skip-over model and the edl scheduling algorithm. more particularly, we give the definition of rto and bwp scheduling algorithms, which are based on the skip-over model. the functioning and optimality of the edl algorithm is also outlined. further, we describe the proposed algorithms, namely rlp and rlp/t, as an enhancement of the bwp algorithm, based on the edl scheduling mechanism. moreover, we present a model of a real-world problem to show the practical interest of our work. the performance analysis of both rlp and rlp/t, in terms of task completions, is reported after. then, we describe the integration of these qos components into linux/rtai. finally, in section 8, we summarize our contribution. 2 related work there have been some previous approaches to the specification and design of real-time systems that tolerate occasional losses of deadlines. hamdaoui and ramanathan in [7] introduced the concept of (m,k)-firm deadlines to model tasks that have to meet m deadlines every k consecutive invocations. their algorithm uses a distance-based priority (dbp) scheme to increase the priority of a job in danger of missing more than m deadlines over a sliding window of k requests for service. moreover, algorithms such as vds [17] and dwcs [19] are provably superior to dbp in meeting (m,k) service requirements for a number of specific and non-trivial situations. similar to (m,k)-firm scheduling is the work introduced by koren and shasha [8] with the notion of skip factor. if a task has a skip factor of s, it will have one invocation skipped out of s. it is a particular case of the (m,k)-firm model where m = k − 1. they reduce the overload by skipping some task invocations, thus exploiting skips to increase the feasible periodic load. this approach gives a solution to the scheduling problem of overloaded systems, while representing a system quality of service requirement for real-time applications. broadly speaking, the skip-over scheduling algorithms guarantee the timing correctness of the real-time application. one interesting result is that making optimal use of skips is a np-hard problem. there are also examples of (m,k)-hard schedulers [1], but most such approaches require off-line feasibility tests, to ensure predictable service. in [3, 4], caccamo and buttazzo follow this work by scheduling hybrid task sets consisting of skippable periodic and soft aperiodic tasks. they propose and analyze an algorithm, based on a variant of earliest deadline first (edf) scheduling, in order to exploit skips under the total bandwith server (tbs). in previous works [10, 12], we have considered the same approach but using the earliest deadline as late as possible server (edl). these results have led us to propose a raw version of the rlp algorithm (idle time schedule based on red tasks only) [11]. west and poellabauer in [16] proposed a windowed lost rate, that specifies a task can tolerate x deadlines missed over a finite range or window, among consecutive y instances. in [2], bernat et al. introduce a general framework for specifying tolerance of missed deadlines under the definition of weakly hard constraints. 3 theoretical background 3.1 the skip-over model we are here interested in the problem of scheduling periodic tasks which allow occasional deadline violations (i.e., skippable periodic tasks), on a uniprocessor system. we assume that tasks can be preempted and that they do not have precedence constraints. a task ti is characterized by a worst-case quality of service scheduling in real-time systems 355 task t0 t1 t2 t3 t4 ci 3 4 1 7 2 pi 30 20 15 12 10 table 1: a basic periodic task set -6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 0 30 60 0 20 40 60 0 15 30 45 60 0 12 24 36 48 60 0 10 20 30 40 50 60 t0 t1 t2 t3 t4 : processing red task 6 : release time figure 1: rto scheduling algorithm (si = 2) computation time ci, a period pi, a relative deadline equal to its period, and a skip parameter si, which gives the tolerance of this task to missing deadlines. the distance between two consecutive skips must be at least si periods. when si equals to infinity, no skips are allowed and ti is equivalent to a hard periodic task. one can view the skip parameter as a qos metric (the higher si, the better the quality of service). a task ti is divided into instances where each instance occurs during a single period of the task. every instance of a task can be red or blue [8]. a red task instance must complete before its deadline; a blue task instance can be aborted at any time. however, if a blue instance completes successfully, the next task instance is still blue. red tasks only (rto) algorithm the first algorithm proposed by koren and shasha is the red tasks only (rto) algorithm. red instances are scheduled as soon as possible according to earliest deadline first (edf) algorithm, while blue ones are always rejected. deadline ties are broken in favor of the task with the earliest release time. in the deeply red model where all tasks are synchronously activated and the first si − 1 instances of every task ti are red, this algorithm is optimal. rto is illustrated in figure 1 using the task set t = {t0, t1, t2, t3, t4} of five periodic tasks whose parameters are described in table 1. tasks have uniform skip parameter si = 2 and the total processor utilization factor up = ∑ ci pi is equal to 1.15. as we can see, the distance between every two skips is exactly si periods, thus offering only the minimal guaranteed qos level for periodic tasks. blue when possible (bwp) algorithm the second algorithm studied is the blue when possible (bwp) algorithm which is an improvement of the first one. indeed, bwp schedules blue instances whenever their execution does not prevent the red ones from completing within their deadlines. in that sense, it operates in a more flexible way. deadline ties are still broken in favor of the task with the earliest release time. figure 2 shows an illustrative example of bwp scheduling using the task set previously described in table 1. compared with rto, more task instances complete successfully with bwp. we observe that five violations of deadline relative to blue task instances occur at time instants t = 24 (task t3), t = 30 (tasks t2 and t4) and t = 60 (tasks t3 and t4), thus reducing the qos. 356 audrey marchand, maryline chetto -6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 0 30 60 0 20 40 60 0 15 30 45 60 0 12 24 36 48 60 0 10 20 30 40 50 60 t0 t1 t2 t3 t4 : processing red task : processing blue task 6 : release time figure 2: bwp scheduling algorithm (si = 2) 3.2 the edl algorithm the definition of the earliest deadline as late as possible (edl) algorithm makes use of some results presented by chetto and chetto in [5]. under edl, periodic tasks are scheduled as late as possible. an accurate characterization of the idle times during which the processor is not occupied is necessary. the authors introduced an availability function f xy defined with respect to a task set y and a scheduling algorithm x. f xy (t) = 1 if the processor is idle at t, 0 otherwise. so, for any instants t1 and t2, value of ∫t2t1 f x y (t)dt denoted by ω x y (t1,t2) gives the total idle time in [t1,t2]. f edl y can be described by means of the following two vectors: • k, called static deadline vector, represents the times at which idle times occur and is constructed from the distinct deadlines of periodic tasks. • d, called static idle time vector, represents the lengths of the idle times relating to time instants of vector k. the complexity of the edl algorithm is o(kn) where n is the number of periodic tasks, and k is equal to ⌊ r p ⌋, where r is the longest deadline, and p is the shortest period [13]. we also recall the fundamental property relative to the optimality of edl [5]: theorem 1. let x be any preemptive scheduling algorithm and a a set of independent aperiodic tasks. for any instant t, ω x a (0,t) ≤ ωedl a (0,t) (1) we give now an illustrative example of the computation of the idle times performed by edl. consider the periodic task set t = {t1, t2} consisting of two periodic tasks t1(3,10) and t2(3,6). the f edlt computation produced at time zero is described in figure 3. the authors in [5] described how the edl algorithm can be applied, first to the decision problem that arises when a sporadic time critical task occurs and requires to be run at an unpredictable time and secondly, to the scheduling problem that arises in a fault tolerant system using the deadline mechanism [?] for which each task implements primary and backup copies (the processor time reserved for the execution of the backup copies is realized with edl and is reclaimed as soon as the primary task executes successfully). in next sections, we are interested in using edl first to simulate a schedule (rlp implementation) and then to derive a measure required for deciding whether a blue task can be accepted (rlp/t implementation). quality of service scheduling in real-time systems 357 -6 6 6 6 6 6 6 6 6 6 0 10 20 30 0 6 12 18 24 30 k0 k1 k2 k3 k4 k5 k6 t1 t2 f edl t : processing periodic task : processing idle time 6 : release time figure 3: f edl t computation produced at time zero 4 the proposed algorithms 4.1 red tasks as late as possible (rlp) the main drawback of bwp relies on the fact that blue task instances are executed as background tasks. this leads to abort partially or almost completely executed blue task instances, thus wasting processor time. algorithm outline the objective of rlp algorithm is to bring forward the execution of blue task instances so as to minimize the ratio of aborted blue instances, thus enhancing the actual qos (i.e., the total number of task completions) of periodic tasks. from this perspective, rlp scheduling algorithm, which is a dynamic scheduling algorithm, is specified by the following behaviour: 1. if there are no blue task instances in the system, red task instances are scheduled as soon as possible according to the edf (earliest deadline first) algorithm. 2. if blue task instances are present in the system, these ones are scheduled as soon as possible according to the edf algorithm (note that it could be according to any other heuristic), while red task instances are processed as late as possible according to the edl algorithm. deadline ties are always broken in favor of the task with the earliest release time. the main idea of this approach is to take advantage of the slack of red periodic task instances. determination of the latest start time for every red request of the periodic task set requires preliminary construction of the schedule by a variant of the edl algorithm taking skips into account [12]. in the edl schedule established at time τ, we assume that the instance following immediately a blue instance which is part of the current periodic instance set at time τ, is red. indeed, none of the blue task instances is guaranteed to complete within its deadline. moreover, silly-chetto in [13] proved that the online computation of the slack time is required only at time instants corresponding to the arrival of a request while no other is already present on the machine. in our case, the edl sequence is constructed not only when a blue task is released (and no other was already present) but also after a blue task completion if blue tasks remain in the system (the next task instance of the completed blue task has then to be considered as a blue one). note that blue tasks are executed in the idle times computed by edl and are of same importance beside red tasks (contrary to bwp which always assigns higher priority to red tasks). illustrative example consider once again the periodic task set t defined in table 1. the relating rlp scheduling is illustrated in figure 4. in this example, we can see that, thanks to rlp scheduling, the number of 358 audrey marchand, maryline chetto -6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 0 30 60 0 20 40 60 0 15 30 45 60 0 12 24 36 48 60 0 10 20 30 40 50 60 t0 t1 t2 t3 t4 : processing red task : processing blue task 6 : release time figure 4: rlp scheduling algorithm (si = 2) violations of deadline relative to blue task instances has been reduced to three. they occur at time instants t = 40 (task t4), and t = 60 (tasks t3 and t4). observe that t3 first blue task instance which failed to complete within its deadline in the bwp case (see figure 2), has enough time to succeed in the rlp case, since the execution of t1 and t0 first red task instances is postponed. until time t = 10, red task instances are scheduled as soon as possible. from time t = 10 to the end of the hyperperiod (defined as the least common multiple of task periods), red task instances do execute as late as possible in the presence of blue task instances, thus enhancing the actual qos of periodic tasks. 4.2 red tasks as late as possible with blue acceptance test (rlp/t) the main drawback of rlp relies on the fact that this algorithm attempts to execute blue task instances as soon as possible, at the risk of aborting them before their completion, thus generating a processor time wasting. this assessment led us to propose a novel algorithm named rlp/t (red tasks as late as possible with blue acceptance test). algorithm outline red tasks as late as possible with blue acceptance test (rlp/t) algorithm is designed to maximize the actual qos of periodic task sets defined under skip constraints. it acts as follows: red tasks enter straight the system at their arrival time whereas blue tasks integrate the system upon acceptance. once they have been accepted, blue tasks are scheduled as soon as possible together with red tasks. upon acceptance, blue tasks are again of same importance beside red tasks. deadline ties are always broken in favor of the task with the earliest release time. whenever a new blue task enters the system, the idle times are computed using the edl scheduler. in the edl schedule established at time τ, we assume that the instance following immediately a blue instance which is part of the current periodic instance set at time τ, is also blue. indeed, all blue task instances previously accepted at τ are guaranteed by the schedulability test they passed successfully. this one checks whether there are enough idle times to accommodate the new blue task within its deadline, as described in the following section. acceptance test of blue tasks under rlp/t now, we are ready to present the new feasibility test algorithm for the rlp scheduling scheme which, given any occurring blue task b is capable of answering the question "can b be accepted ?". notice that b will be accepted if and only if there exists a valid schedule, i.e., a schedule in which b will execute by its deadline while red periodic tasks and blue tasks previously accepted, will still meet their deadlines. let τ be the current time which coincides with the arrival of a blue task b. upon arrival, task b(r, c, d) quality of service scheduling in real-time systems 359 is characterized by its release time r, its execution time c, and its deadline d, with r + c ≤ d. we assume that the system supports several blue tasks at time τ. each of them has been accepted before τ and has not completed its execution at time τ. let denote by b(τ) = {bi(ci(τ), di), i=1 to blue(τ)} the blue task set supported by the machine at τ. value ci(τ) is called dynamic execution time and represents the remaining execution time of bi at τ. a deadline occurs at di. we assume that b(τ) is ordered such that i < j implies di ≤ d j. the acceptance test of blue tasks within a system involving rlp skippable tasks presented below in theorem 2, is based on the one established by silly-chetto and al. [14] for the acceptance of sporadic requests occurring in a system consisting of basic periodic tasks (i.e., without skips). theorem 2. task b is accepted if and only if, for every task bi ∈ b(τ) ∪{b} such that di ≥ d, we have δi(τ) ≥ 0, with δi(τ) defined as: δi(τ) = ω edl t(τ)(τ, di) − i ∑ j=1 c j(τ) (2) δi(τ) is called slack of task bi at time τ which represents the maximum units of time during which the task could not be served by the processor without missing its deadline. ωedl t(τ) (τ, di) denotes the total units of time that the processor is idle in the time interval [τ, di]. the total computation time required by blue tasks within [τ, di] is given by ∑ i j=1 c j(τ) the procedure that implements the acceptance test calls for the edl algorithm for the computation of the total idle times which will be used to compute the slack of blue tasks. then, this slack is compared to zero. thus, the acceptance test proposed in this paper runs in o(⌊ r p ⌋n + blue(τ)) in the worst-case, where n is the number of periodic tasks, r is the longest deadline, p is the shortest period, and blue(τ) denotes the number of active blue tasks at time τ, whose deadline is greater or equal to the deadline of the occurring task. note that this acceptance test could be implemented in o(n + blue(τ)) by considering and maintaining to update additional data structures using slack tables, as proved in [15]. illustrative example rlp/t scheduling is illustrated in figure 5 with the periodic task set t defined in table 1. it is easy to see that rlp/t improves on both rlp and bwp. only two violations of deadline relative to blue task instances are observed: at time instants t = 40 (task t4) and t = 60 (task t3). the acceptance test contributes to compensate for the time wasted in starting the execution of blue tasks which are not able to complete within their deadline. as we can observe, in the rlp case (see figure 4), t3 blue instance released at time t = 48 is aborted at time t = 60 (2 units of time were indeed wasted). note that the rejection of this blue task instance, performed with rlp/t, contributes to save time used for the successful completion of t4 blue instance released at time t = 50. in section 6, we quantify more precisely the gain of performance of rlp/t upon rlp, bwp and rto. 5 applying theory to real-world problems 5.1 multimedia applications in order to understand the importance of cpu scheduling in multimedia real-time applications, it is useful to place the issue in context. multimedia implies a certain amount of data to be handled within a specified time frame and requires a tremendous amount of resources to accommodate. for multimedia applications to function correctly, there must be a steady stream of data for the output devices to process. for the viewer to perceive continuous media such as movies or music, the output devices have to output 360 audrey marchand, maryline chetto -6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 0 30 60 0 20 40 60 0 15 30 45 60 0 12 24 36 48 60 0 10 20 30 40 50 60 t0 t1 t2 t3 t4 : processing red task : processing blue task 6 : release time figure 5: rlp/t scheduling algorithm (si = 2) new media within strict time constraints (e.g. 30 frames per second for video applications). given this observation, one gets a better understanding of the crucial role of cpu scheduling in such applications. consider a simplified model of a real-time telesurveillance application, as represented in figure 6. the relevant tasks are the acquiring tasks and the display task. video data are first captured and digitized through video capture devices such as video cameras. then, each video capture task “acqi" reads the input video buffer relative to its camera, thus periodically acquiring incoming frames. downstream from the chain, another task named “display" is in charge of continuously consuming frames from an output frame buffer and sending the acquired video frames to a final display device composed of various telesurveillance screens. figure 6: simplified architecture of a real-time telesurveillance application one important problem part of the management of telesurveillance systems is the data refreshing rate on the display device. indeed, by definition, such a system must provide pictures as recent as possible to be useful. if there is no data for the output devices to process, there is buffer underflow. the media application will stall and wait for new data to be provided. naturally, buffer underflow should be avoided whenever possible. scheduling implies multiplexing a resource among several tasks to ensure all throughput requirements are met. in the present case, the problem consists in ensuring an acceptable refreshing rate (i.e. a guaranteed qos level). 5.2 dynamic qos scheduling for multimedia applications, the cpu scheduler determines the resulting quality of service. the more cpu cycles allocated to a task, the more data can be produced, thus providing a better quality output. in the previous example, more cpu cycles produce more frames, thus allowing more frames per second to be displayed on the workstation monitor. however, if several videos are executing at the same time there may not be enough cpu cycles available to produce all the video frames requested. in this quality of service scheduling in real-time systems 361 figure 7: qos of periodic tasks with low and high skip parameters (si = 2 and si = 6) overloaded situation, the quality of service (i.e. frame rate) is reduced to a lower acceptable level, which results for instance to display an image at 15 frames per second instead of 30 frames per second. the resulting video just appears less smooth during the transient overload period. now, let us consider rlp or rlp/t algorithms for scheduling the application described in figure 6. at initialization time, the application specifies a desired average rate of execution by appropriately setting the skip parameter si of each “acqi" task. rlp (or rlp/t) scheduler can be viewed as an edf prioritybased scheduler coupled with a skip-over rate regulator. that ensures every task not to be executed below a specified rate, whatever is the cpu workload. rlp and rlp/t superiority over rto and bwp effectively results in a higher and guaranteed frame rate on the workstation monitors. 6 simulation study in this section, we summarize the results of a simulation study which compares the performance of the different qos scheduling algorithms. the objective is to maximize the actual qos level of periodic tasks, i.e., the ratio of periodic tasks which complete before their deadline. experiments also evaluate the impact of the skip value for each algorithm, namely rto, bwp, rlp and rlp/t. 6.1 simulation context the simulation context includes 50 periodic task sets, each consisting of 10 tasks with a least common multiple equal to 3360. tasks are defined under qos guarantees specified by uniform si. their worstcase execution time is randomly generated and depends on the input setting of the periodic load up. deadlines are equal to the periods and greater than or equal to the computation times. simulations have been processed over 10 hyperperiods. 6.2 varying the periodic load measurements rely on the ratio of periodic tasks which complete before their deadline. the evaluation is done varying the periodic load up. the results obtained for si = 2 (one instance every two can be aborted) and si = 6 (one instance every six can be aborted) are described on figure 7. from the graphs, we can say that bwp, rlp and rlp/t outperform the rto model in which the qos level is still constant whatever is the periodic load applied. for si = 6, the actual qos (which corresponds to the qos guarantee) remains constant at a rate of 5/6=83%. the advantage of rlp over bwp is slight for low skip parameters, and more significant for high skip parameters. we note that the performance of 362 audrey marchand, maryline chetto figure 8: the cleopatre framework bwp and rlp is dramatically worse than the one achieved by rlp/t. this result was expected because both bwp and rlp attempt to schedule blue instances that have not enough time for completing within their deadlines. this wasted time is not saved for executing other blue instances with closer deadline. in contrast, rlp/t finds a way of saving this cpu time by implementing an acceptance test for blue instances. we can observe that this gain of performance is all the more significant as the periodic load up is higher. for instance, in figure 7, for up ≥ 120%, rlp/t enjoys more than factor 14 success rate advantage over bwp. moreover, we observe a very low gradient for the rlp/t curve which is not the case for other models. for up = 150% and si = 2, actual qos levels for rto and rlp/t are respectively equal to 50% and 84%, which figures the great predominance of rlp/t over rto. the variation of the skip parameter value shows that, for wide loads, the actual qos of periodic tasks is all the more improved with rlp/t that the qos constraint is smaller. for instance, for up = 110%, rlp/t applied to periodic tasks with si = 2 will successfully process twice as many periodic instances over bwp, as with periodic tasks with si = 6. as we can see, the major difference in the performance between rlp/t and bwp appears not only for heavy loads but also for small value of si. 7 integration into a linux-based system 7.1 cleopatre library rto, bwp, rlp and rlp/t algorithms have been integrated into a library of free software components called cleopatre (software open components on the shelf for embedded real-time applications) [6]. this library, part of a french national project 1, was designed to provide more efficient and better service to real-time applications. the purpose was to enrich the real-time facilities of real-time linux versions, such as rtlinux [18] or rtai [9]. rtai was the solution adopted for this project because we wanted the cleopatre components to be distributed under the lgpl2 license which is also the one used in the rtai project. the cleopatre library whose framework is shown in figure 8 offers selectable cots (commercialoff-the-shelf) components dedicated to dynamic scheduling, aperiodic task service, resource control access and fault-tolerance. components are totally independent from the kernel and the hardware. reusability of the components with another hardware and os is made possible by just adapting the os abstraction layer in the tcl component. this component hides the specific features of each platform, so that the run-time components can be implemented in a portable fashion and adapted to the target’s processor architecture and board. rto, bwp, rlp and rlp/t can be found in a new shelf called “quality of service"’. final users 1work supported by the french research office, grant number 01 k 0742 2lesser general public license quality of service scheduling in real-time systems 363 can then build their own customized applications through the flexible and easy-to-use interface provided by the cleopatre framework. 8 conclusions this paper pointed out the need of more flexible scheduling solutions for real-time applications dealing with multimedia and active monitoring systems. our main contribution was actually to propose and validate new scheduling algorithms, namely rlp and rlp/t. their purpose is to enhance the qos of periodic tasks that allow skips (i.e the ratio of task instances that do execute within their deadline) while providing a qos guarantee (i.e the ratio of task instances that must complete within their deadline). we considered a real-world problem (i.e. a multimedia application) to bring to light how these algorithms can be implemented in practice in order to provide a better qos. simulation results show that the improvements with both rlp and rlp/t are quite significant compared with basic algorithms. these new qos functionalities are available under linux/rtai. our future work includes extending these qos scheduling algorithms to multiprocessor systems. bibliography [1] g. bernat, a. burns, combining (n/m)-hard feadlines and dual priority schaduling, 18th ieee realtime systems symposium, pp 46-57, 1997. [2] g. bernat, a. burns, a. llamosi, weakly-hard real-time systems, in ieee transactions on computers, vol. 50, no. 4, pp 308-321, 2001. [3] g.-c. buttazzo, m. caccamo, minimizing aperiodic response times in a firm real-time environment, ieee trans. software eng., vol. 25, no. 1, pp 22-32, 1999. [4] m. caccamo, g.-c. buttazzo, exploiting skips in periodic tasks for enhancing aperiodic responsivess, 18th ieee real-time systems symposium, 1997. [5] h. chetto, m. chetto, some results of the earliest deadline scheduling algorithm. in proceedings of the ieee transactions on software engineering, vol. 15, no. 10, pp 1261-1269, 1989. [6] t. garcia, a. marchand, m. silly-chetto, cleopatre: a r&d project for providing new real-time functionalities to linux/rtai. 5th real-time linux workshop, 2003. [7] m. hamdaoui, p. ramanathan, a dynamic priority assignment technique for streams with (m,k)firm deadlines. ieee transactions on computers, vol. 44, no. 4, pp 1443-1451, 1995. [8] g. koren, d. shasha, skip-over algorithms and complexity for overloaded systems that allow skips. 16th ieee real-time systems symposium (rtss’95), pisa, italy, 1995. [9] p. mantegazza, e. bianchi, l. dozio, m. angelo, d. beal, diapm. rtai programming guide 1.0, lineo inc., 2000. [10] a. marchand, m. silly-chetto, qos scheduling components based on firm real-time requirements, acs/ieee international conference on computer systems and applications (aiccsa’05), le caire (egypt), 2005. [11] a. marchand and m. silly-chetto, rlp: enhanced qos support for real-time applications, 11th ieee international conference on embedded and real-time computing systems and applications (rtcsa’05), hong-kong, 2005. 364 audrey marchand, maryline chetto [12] a. marchand, m. silly-chetto, dynamic real-time scheduling of firm periodic tasks with hard and soft aperiodic tasks. journal of real-time systems, vol. 32, no. 1-2, pp 21-47, 2006. [13] m. silly-chetto, the edl server for scheduling periodic and soft aperiodic tasks with resource constraints, journal of real-time systems, vol. 17, pp 1-25, 1999. [14] m. silly-chetto, h. chetto, n. elyounsi, an optimal algorithm for guranteeing sporadic tasks in hard real-time systems. ieee symposium on parallel and distributed processing, pp 578-585, 1990. [15] t. tia, j. liu, j. sun, r. ha, a linear-time optimal acceptance test for scheduling of hard realtime tasks, technical report, department of computer science, university of illinois at urbanachampaign, il, 1994. [16] r. west, c. poellabauer, analysis of a window-constrained scheduler for real-time and best-effort packet streams, 21st ieee real-time systems symposium, orlando, usa, 2000. [17] r. west, y. zhang, k. schwan, c. poellabauer, dynamic window-constrained scheduling of realtime streams in media servers, ieee trans. on computers, vol. 53, pp. 744-759, 2004. [18] v. yodaiken, the rtlinux approach to real-time, fsmlabs inc., 2004. [19] y. zhang, r. west, x. qi, avirtual deadline scheduler for window-constrained service guarantees, tech. rep. 2004-013, boston university, 2004. audrey marchand university of nantes laboratoire d’informatique de nantes atlantique 2, rue de la houssinière bp 92208 44322 nantes cedex 03, france e-mail: audrey.marchand@univ-nantes.fr maryline chetto university of nantes institut de recherche en communications et cybernétique de nantes 1, rue de la noe 44321 nantes cedex 03 france e-mail: maryline.chetto@univ-nantes.fr quality of service scheduling in real-time systems 365 audrey marchand graduated in computer engineering at the ecole polytechnique of the university of nantes (france), in 2002. after getting a master degree in applied computer science at ecole centrale de nantes in 2003, she received the phd degree in october 2006 from the university of nantes. from october 2006 to august 2007, she hold a post-doc researcher position at the polytechnic university of valencia, spain. she is currently an associate professor at the university of nantes, france. her research interests include real-time scheduling theory, os service mechanisms, quality of service guarantees in realtime systems, and linux-based real-time oses and applications. maryline chetto received the degree of docteur de 3ème cycle in control engineering and the degree of habilitée à diriger des recherches in computer science from the university of nantes, france, in 1984 and 1993, respectively. from 1984 to 1985, she held the position of assistant professor of computer science at the university of rennes, while her research was with the institut de recherche en informatique et systèmes aléatoires, rennes. in 1986, she returned to nantes and is currently a professor with the institute of technology of the university of nantes. she is conducting her research at irccyn. her main research interests include scheduling and fault-tolerance technologies for real-time applications. she has published more than 60 journal articles and conference papers in the area of real-time operating systems. she is the leader of a french national r&d project, namely cleopatre, supported by the french government, which aims to provide free open source real-time solutions. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 693-700 meta-rationality in normal form games d. dumitrescu, r.i. lung, t.-d. mihoc dan dumitru dumitrescu, rodica ioana lung, tudor dan mihoc “babes bolyai” university romania, cluj-napoca, st. universitatii 5, e-mail: {ddumitr, mihoct}@cs.ubbcluj.ro, rodica.lung@econ.ubbcluj.ro abstract: a new generative relation for nash equilibrium is proposed. different types of equilibria are considered in order to incorporate players different rationality types for finite non cooperative generalized games with perfect information. proposed equilibria are characterized by use of several generative relations with respect to players rationality. an evolutionary technique for detecting approximations for equilibria is used. numerical experiments show the potential of the method. keywords: non-cooperative games, evolutionary equilibrium detection, generative relations, nash-pareto, meta-strategy. 1 introduction the most common solutions proposed in game theory are the equilibrium concepts. within the present day approaches each equilibrium concept is addressed separately, meaning that in a particular game players interact accordingly to a unique equilibrium concept. this restriction induces unrealistic results. for example, the concept of nash equilibrium, alone, sometimes can lead to deceptive results so we need to cope with more complex situations. in real life players can be more or less cooperative, more or less competitive and more or less rational, therefore agents guided by the different kind of equilibrium concepts should be allowed to interact. we consider a generalized game where players are allowed to have different behaviours according to their rationality type. players can have different behaviours/rationality types resulting in an adequate meta-strategy concept. game equilibria can be characterized using appropriate generative relations [4]. thus nash equilibrium is characterized by the ascendancy relation [6] and pareto equilibrium by the pareto domination. combining the two relations may lead to different types of joined nash–pareto equilibria. we introduce a new generative relation for nash equilibrium and we use it to compose a new joined nash-pareto equilibria. an evolutionary technique for detecting the two joined nash–pareto equilibria for generalized games is used. 2 generalized games in order to cope with different rationality types the concept of generalized game is defined [4]. definition 1. a finite strategic generalized game is defined as a system by g = (n, m, u) where: • n = {1, ..., n}, represents the set of players, n is the number of players; copyright c⃝ 2006-2010 by ccc publications 694 d. dumitrescu, r.i. lung, t.-d. mihoc • for each player i ∈ n, si represents the set of actions available to him, si = {si1, si2, ..., simi }; s = s1 × s2 × ... × sn is the set of all possible situations of the game; • for each player i ∈ n, mi represents the set of available meta-strategies, a meta-strategy is a system (si|ri) where si ∈ si and ri is the ith player rationality type; • m = m1 × m2 × ... × mn is the set of all possible situations of the generalized game and (s1|r1, s2|r2, ..., sn|rn) ∈ m is a meta-strategy profile. • for each player i ∈ n , ui : s → r represents the payoff function. u = {u1, ..., un}. remark 2. in a generalized game the set of all possible meta-strategies represents the metastrategy search space. the rationality type of a player usually represents the player bias towards a certain equilibrium. 3 generative relations for generalized games three generative relations are considered in this section. two of them correspond to pareto and nash equilibria. the third induces a new type of joined nash–pareto equilibrium. 3.1 np–strict pareto domination we consider the np–strict pareto domination in order to be able to combine several concepts of nash and pareto domination. in a finite strategic generalized game consider the set of players pareto biased ip = {j ∈ {1, ..., n}|rj = pareto} and np = cardip. let us consider two meta strategy profiles x and y from m. definition 3. the meta strategy profile x np–strict pareto dominates the meta strategy profile y if the payoff of each pareto biased player from ip using meta strategy x is strictly greater than the payoff associated to the meta strategy y, i.e. ui(x) > ui(y), ∀i ∈ ip. remark 4. the set of non dominated meta strategies with respect to the np–strict pareto domination relation when np = n is a subset of the pareto front. 3.2 nash ascendancy similar to pareto equilibrium a particular relation between strategy profiles can be used in order to describe nash rationality. this relation is called nash-ascendancy (na). a strategy is called nash equilibrium [5] if each player has no incentive to unilaterally deviate i.e. it can not improve the payoff by modifying its strategy while the others do not modify theirs. we denote by (sij, s ∗ −i) the strategy profile obtained from s ∗ by replacing the strategy of player i with sij i.e. (sij, s ∗ −i) = (s ∗ 1, s ∗ 2, ..., s ∗ i−1, sij, s ∗ i+1, ..., s ∗ n). meta-rationality in normal form games 695 definition 5. the strategy profile x nash-ascends the strategy profile y, and we write x uj(y), x ̸= y} and respectively kn(x, y) = card{i ∈ in|ui(yi, x−i) ≥ ui(x), xi ̸= yi}. remark 8. kp(x, y) measures the relative efficiency of the meta strategies x and y with respect to pareto rationality and kn(x, y) measures the relative efficiency of the meta strategies x and y with respect to nash rationality. definition 9. the meta strategy x n–p dominates the meta strategy y if and only if the following statements hold 1. kp(x, y) = np 2. kn(x, y) < kn(y, x) in what follows we consider that efficiency relation induces a new type of equilibrium called joined nash-pareto equilibrium. 696 d. dumitrescu, r.i. lung, t.-d. mihoc 3.5 joint differential nash pareto domination a new domination relation with respect to nash-pareto equilibrium is introduced by using differential generative relation of nash equilibrium. definition 10. the meta strategy x dgn–p dominates the meta strategy y if and only if the following statements hold 1. kp(x, y) = np 2. m(x, y) < m(y, x) 4 detecting joint n–p equilibria in generalized games consider a three player non-cooperative game. let ri be the rationality type of player i. if r1 = r2 = r3 = nash then all players are nash biased and the corresponding solution concept is the nash equilibrium. if r1 = r2 = r3 = pareto then all players are pareto biased and the corresponding equilibria are described by the set of strictly non dominated strategies (pareto front). we also intend to explore the joint cases where one of the players is nash biased and others are pareto and the one where one is pareto and the others are nash biased. in order to detect the joined nash–pareto equilibria of the generalized game an evolutionary approach is used. for a certain equilibrium the corresponding generative relation allows the comparison of two meta-strategies. this comparison may guide the search towards the game equilibrium. let us consider an initial population of meta strategies for the generalized three player game. each member of the population has the form x = (s1|r1, s2|r2, s3|r3). non domination (with respect to a generative relation) is considered for fitness assignment purposes. evolutionary multiobjective optimization algorithms [3] are efficient tools for evolving strategies based on a non domination relation. the state of the art nsga2 [2] has been considered to illustrate how generative relations can be used for evolutionary detection of proposed equilibria. a population of 100 strategies has been evolved using a rank based fitness assignment technique. in all experiments the process converges in less than 30 generations. 5 numerical experiments in order to illustrate the proposed concepts the oligopoly cournot model is considered (see for instance [4]). let q1, q2 and q3 denote the quantities of an homogeneous product produced by three companies respectively. the market clearing price is p(q) = a − q, where q = q1 + q2 + q3, is the aggregate quantity on the market. hence we have p(q) = { a − q, for q < a, 0, for q ≥ a. let us assume that the total cost for the company i of producing quantity qi is ci (qi) = ciqi. therefore, there are no fixed costs and the marginal cost ci is constant, ci < a. suppose that meta-rationality in normal form games 697 the companies choose their quantities simultaneously. the payoff for the company i is its profit, which can be expressed as: πi(qi, qj) = qip(q) − ci(qi) = qi [a − (qi + qj) − ci] . several experiments have been performed for this game by using red technique [4]. the symmetric cournot model with parameters a = 24 and c1 = c2 = c3 = 9 is considered. according to the data from the table 1 in less than 30 generations the algorithm converges to the nash equilibrium point (14.00, 14.00, 14.00) for each relation. we observe that the differential nash domination provides more accurate results than the nash ascendency. we must consider however the particular nature of this cournot game. for other types of games a normalisation of the deviations must be done in order to sum them. figure 1: the payoffs for the nash-nashpareto front detected in less than 30 iterations for the symmetric cournot game with the nash–pareto generative relation figure 2: the payoffs for the nash-nashpareto front detected in less than 30 iterations for the symmetric cournot game with the differential nash–pareto generative relation figure 3: the payoffs for the nash-paretopareto front detected in less than 30 iterations for the symmetric cournot game with the nash–pareto generative relation. figure 4: the payoffs for the nash-paretopareto front detected in less than 30 iterations for the symmetric cournot game with the differential nash–pareto generative relation. the resulting front in the nash-nash-pareto case spreads from the standard nash equilibrium corresponding to the two player–cournot game (25.00, 25.00) to the nash equilibrium corresponding to the three player–cournot game, and from there to the edges of pareto front for 698 d. dumitrescu, r.i. lung, t.-d. mihoc table 1: average payoff and standard deviation of the final populations in 30 runs with 100 meta-strategies after 30 generations for the symmetric cournot model where all three players are nash biased using nash ascendency and differential nash generative relations. n-n-n average payoff st. dev. maximum payoff minimum payoff player p1 p2 p3 p1 p2 p3 p1 p2 p3 p1 p2 p3 nash ascendency relation average 14.05 14.06 14.05 0.03 0.04 0.04 14.85 15.57 15.00 12.25 12.49 12.45 st. dev. 0.02 0.02 0.02 0.08 0.09 0.08 1.39 2.80 1.83 3.25 3.00 3.05 differential nash relation average 14.06 14.06 14.06 0.00 0.00 0.00 14.06 14.06 14.06 14.06 14.06 14.06 st. dev. 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 table 2: average payoff and standard deviation of the final populations in 30 runs with 100 meta-strategies after 30 generations for the symmetric cournot model where two player are nash biased and one is pareto for both joint nash–pareto and joint differential nash–pareto generative relations. n-n-p average payoff st. dev. maximum payoff minimum payoff player p1 p2 p3 p1 p2 p3 p1 p2 p3 p1 p2 p3 joint nash–pareto relation average 10.99 11.01 29.80 52.81 53.02 182.28 25.92 25.71 56.24 0.00 0.00 0.49 st. dev. 0.36 0.33 0.78 1.75 2.33 17.62 0.92 0.88 0.00 0.00 0.00 1.67 joint differential nash–pareto relation average 8.50 8.48 35.53 22.76 22.67 165.84 18.47 18.92 56.24 0.00 0.00 7.60 st. dev. 0.35 0.31 0.85 0.25 0.25 0.54 2.88 2.93 0.00 0.00 0.00 5.60 the nash–pareto equilibria (see figure 1). for differential nash-pareto (see figure 2) the front spreads from vicinity of the nash equilibrium for cournot game to the edge of the pareto front corresponding to the pareto player. the numerical results are presented in table 2. as we can see in the figure 3 in the nash-pareto-pareto case for nash–pareto generative relation the result is similar to the pareto front. in the same case for differential nash–pareto generative relation (figure 4) the pareto front is deformed in the nash player’s corresponding edge. the numerical results are presented in table 3. 6 conclusions and future work a new generative relation for nash equilibrium based on differences between perturbations is introduced. generative relations between meta strategies induce corresponding solutions concepts named joined nash–pareto equilibrium, respectively joint differential nash–pareto equilibrium. an evolutionary technique for detecting approximations of the generalized equilibria is used. the ideas are exemplified for cournot games with three players and two types of rationality.results indicate the potential of the proposed technique. future work will address generalized games having other rationality types and other methods meta-rationality in normal form games 699 table 3: average payoff and standard deviation of the final populations in 30 runs with 100 meta-strategies after 30 generations for the symmetric cournot model where one player is nash biased and the other two pareto for both joint nash–pareto and joint differential nash–pareto generative relations. n-p-p average payoff st. dev. maximum payoff minimum payoff player p1 p2 p3 p1 p2 p3 p1 p2 p3 p1 p2 p3 joint nash–pareto relation average 17.74 18.52 18.44 242.42 247.83 247.97 56.23 56.24 56.24 0.00 0.00 0.00 st. dev. 0.40 0.36 0.42 7.36 6.98 6.82 0.04 0.00 0.00 0.00 0.00 0.00 joint differential nash–pareto relation average 15.13 19.83 19.76 154.37 250.62 248.44 48.90 56.24 56.24 0.00 0.00 0.00 st. dev. 0.86 0.77 0.56 0.78 0.29 0.32 3.02 0.00 0.01 0.00 0.00 0.00 of combining them. 7 acknowledgements this research is supported partially by the cncsis grant id508 "new computational paradigms for dynamic complex problems" funded by the mec and from the sectoral operational programme human resources development, contract posdru 6/1.5/s/3 "doctoral studies: through science towards society", babeş bolyai university, cluj napoca, românia. bibliography [1] bade, s., haeringer, g., renou, l.: more strategies, more nash equilibria, working paper 2004-15, school of economics university of adelaide university, 2004. [2] deb, k., agrawal, s., pratab, a., meyarivan, t.: a fast elitist non-dominated sorting genetic algorithm for multi-objective optimization: nsga-ii, marc schoenauer, kalyanmoy deb, günter rudolph, xin yao, evelyne lutton, juan julian merelo, and hans-paul schwefel, editors, proceedings of the parallel problem solving from nature vi conference, paris, france, 2000. springer, lecture notes in computer science, 1917, 849-858. [3] deb, k.: multi-objective optimization using evolutionary algorithms, wiley, 2001. [4] dumitrescu, d., lung, r.i., mihoc, t.d.: evolutionary equilibria detection in noncooperative games, book series: lncs, publisher springer berlin / heidelberg, volume 5484 / 2009, book: applications of evolutionary computing, 2009, 253-262. [5] lung, r. i., muresan, a. s., and filip, d. a.: solving multi-objective optimization problems by means of natural computing with application in finance, in aplimat 2006 (bratislava, february 2006), pp. 445-452. [6] lung, r., i., dumitrescu, d.: computing nash equilibria by means of evolutionary computation, int. j. of computers, communications & control, 2008, 364-368 700 d. dumitrescu, r.i. lung, t.-d. mihoc [7] maskin, e. : the theory of implementation in nash equilibrium:a survey, in: l. hurwicz, d. schmeidler and h. sonnenschein, eds., social goals and social organization (cambridge university press), 1985,173-204 [8] mckelvey, r., d., mclennan, a.: computation of equilibria in finite games, in h. m. amman, d. a. kendrick, and j. rust, editors, handbook of computational economics, elsevier,1996. [9] nash.,j.,f.: non-cooperative games, annals of mathematics, 54:286-295, 1951. [10] osborne, m. j., rubinstein, a.: a course in game theory, mit press, cambridge, ma, 1994 ijcccv4n3draft.pdf int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 3, pp. 273-282 spiking neural p systems with anti-spikes linqiang pan, gheorghe păun linqiang pan department of control science and engineering huazhong university of science and technology wuhan 430074, hubei, china e-mail: lqpan@mail.hust.edu.cn, lqpan@us.es gheorghe păun institute of mathematics of the romanian academy po box 1-764, 014700 bucureşti, romania, and department of computer science and artificial intelligence university of sevilla avda. reina mercedes s/n, 41012 sevilla, spain e-mail: george.paun@imar.ro, gpaun@us.es received: april 20, 2009 accepted: may 20, 2009 abstract: besides usual spikes employed in spiking neural p systems, we consider “anti-spikes", which participate in spiking and forgetting rules, but also annihilate spikes when meeting in the same neuron. this simple extension of spiking neural p systems is shown to considerably simplify the universality proofs in this area: all rules become of the form bc → b′ or bc → λ , where b, b′ are spikes or anti-spikes. therefore, the regular expressions which control the spiking are the simplest possible, identifying only a singleton. a possible variation is not to produce anti-spikes in neurons, but to consider some “inhibitory synapses", which transform the spikes which pass along them into anti-spikes. also in this case, universality is rather easy to obtain, with rules of the above simple forms. keywords: membrane computing, p system, spiking neural p system, computability 1 introduction the spiking neural p systems (in short, sn p systems) were introduced in [4], and then investigated in a large number of papers. we refer to the respective chapter of [7] for general information in this area, and to the membrane computing website from [9] for details. in this note, we consider a variation of sn p systems which was suggested several times, i.e., involving inhibitory impulses/spikes or inhibitory synapses and investigated in a few papers under various interpretations/formalizations – see, e.g., [1], [2], [5], [8]. the definition we take here for such spikes – we call them anti-spikes (somewhat thinking to anti-matter) – considers having, besides usual “positive" spikes denoted by a, objects denoted by ā, which participate in spiking or forgetting rules as usual spikes, but also in implicit rules of the form aā → λ : if an anti-spike meets a spike in a given neuron, then they annihilate each other, and this happens instantaneously (the disappearance of one a and one ā takes no time, it is like applying the rule aā → λ without consuming any time for that). we do not claim having a clear biological counterpart of such issues, we only look for an elegant mathematical definition. this simple extension of sn p systems is proved to entail a surprising simplification of both the proofs and the form of rules necessary for simulating turing machines (actually, the proofs here are based on simulating register machines) by means of sn p systems: all rules have a singleton regular expression, which, moreover, indicates precisely the number of spikes or anti-spikes to consume by the copyright c© 2006-2009 by ccc publications 274 linqiang pan, gheorghe păun rule. (precisely, we have rules of the forms bc → b′ or bc → λ , where b, b′ are spikes or anti-spikes; such rules, having the regular expression e such that l(e) = bc are called pure; formal definitions will be given immediately.) this can be considered as a (surprising) normal form for this case; please compare with the normal forms from [3], especially with the simplifications of regular expressions obtained there. anti-spikes are produced from usual spikes by means of usual spiking rules; in turn, rules consuming anti-spikes can produce spikes or anti-spikes (actually, as we will see below, the latter case can be avoided). a possible variant is to produce always only spikes and to consider synapses which “change the nature" of spikes. also in this case, universality is easily proved, using only pure rules. 2 prerequisites we assume the reader to be familiar with basic elements about sn p systems, e.g., from [7] and [9], and we introduce here only a few notations, as well as the notion of register machines, used later in the proofs of our results. we also assume familiarity with very basic elements of automata and language theory, as available in many monographs. for an alphabet v , v ∗ denotes the set of all finite strings of symbols from v , the empty string is denoted by λ , and the set of all nonempty strings over v is denoted by v +. when v = {a} is a singleton, then we write simply a∗ and a+ instead of {a}∗, {a}+. a regular expression over an alphabet v is defined as follows: (i) λ and each a ∈ v is a regular expression, (ii) if e, e are regular expressions over v , then (e)(e), (e)∪(e), and (e) + are regular expressions over v , and (iii) nothing else is a regular expression over v . with each regular expression e we associate a language l(e), defined in the following way: (i) l(λ ) = {λ } and l(a) = {a}, for all a ∈ v , (ii) l((e) ∪ (e)) = l(e) ∪ l(e), l((e)(e)) = l(e)l(e), and l((e) +) = (l(e)) +, for all regular expressions e, e over v . non-necessary parentheses can be omitted when writing a regular expression, and also (e)+ ∪ {λ } can be written as e∗. the family of turing computable sets of natural numbers is denoted by nre. a register machine is a construct m = (m, h, l, lh, i), where m is the number of registers, h is the set of instruction labels, l is the start label (labeling an add instruction), lh is the halt label (assigned to instruction halt), and i is the set of instructions; each label from h labels only one instruction from i, thus precisely identifying it. the instructions are of the following forms: • li : (add(r), l j, lk) (add 1 to register r and then go to one of the instructions with labels l j, lk), • li : (sub(r), l j, lk) (if register r is non-empty, then subtract 1 from it and go to the instruction with label l j, otherwise go to the instruction with label lk), • lh : halt (the halt instruction). a register machine m computes (generates) a number n in the following way: we start with all registers empty (i.e., storing the number zero), we apply the instruction with label l and we proceed to apply instructions as indicated by the labels (and made possible by the contents of registers); if we reach the halt instruction, then the number n stored at that time in the first register is said to be computed by m. the set of all numbers computed by m is denoted by n(m). it is known that register machines compute all sets of numbers which are turing computable, hence they characterize nre. without loss of generality, we may assume that in the halting configuration, all registers different from the first one are empty, and that the output register is never decremented during the computation, we only add to its contents. we can also use a register machine in the accepting mode: a number is stored in the first register (all other registers are empty); if the computation starting in this configuration eventually halts, then the number is accepted. again, all sets of numbers in nre can be obtained, even using deterministic spiking neural p systems with anti-spikes 275 register machines, i.e., with the add instructions of the form li : (add(r), l j, lk) with l j = lk (in this case, the instruction is written in the form li : (add(r), l j)). again, without loss of generality, we may assume that in the halting configuration all registers are empty. convention: when evaluating or comparing the power of two number generating/accepting devices, number zero is ignored. 3 spiking neural p systems with anti-spikes we recall first the definition of an sn p system in the classic form (without delays, because this feature is not used in our paper) and of the set of numbers generated or accepted by it. an sn p system of degree m ≥  is a construct π = (o, σ, . . . , σm, syn, in, out), where: 1. o = {a} is the singleton alphabet (a is called spike); 2. σ, . . . , σm are neurons, of the form σi = (ni, ri),  ≤ i ≤ m, where: a) ni ≥  is the initial number of spikes contained in σi; b) ri is a finite set of rules of the following two forms: (1) e/ac → a, where e is a regular expression over a and c ≥ ; (2) as → λ , for some s ≥ ; 3. syn ⊆ {, , . . . , m} × {, , . . . , m} with (i, i) /∈ syn for  ≤ i ≤ m (synapses between neurons); 4. in, out ∈ {, , . . . , m} indicate the input and output neurons, respectively. the rules of type (1) are firing (we also say spiking) rules, and they are applied as follows. if the neuron σi contains k spikes, and a k ∈ l(e), k ≥ c, then the rule e/ac → a can be applied. the application of this rule means removing c spikes (thus only k − c remain in σi), the neuron is fired, and it produces a spike which is sent immediately to all neurons σ j such that (i, j) ∈ syn. the rules of type (2) are forgetting rules and they are applied as follows: if the neuron σi contains exactly s spikes, then the rule as → λ from ri can be used, meaning that all s spikes are removed from σi. note that we have not imposed here the restriction that for each rule e/ac → a of type (1) and as → λ of type (2) from ri to have a s /∈ l(e). if a rule e/ac → a of type (1) has e = ac, then we will write it in the simplified form ac → a and we say that it is pure. in each time unit, if a neuron σi can use one of its rules, then a rule from ri must be used. since two firing rules, e/a c → a and e/ac → a, can have l(e)∩l(e) 6= /0, it is possible that two or more rules can be applied in a neuron, and in that case only one of them is chosen non-deterministically. thus, the rules are used in the sequential manner in each neuron, but neurons function in parallel with each other. the configuration of the system is described by the number of spikes present in each neuron. the initial configuration is n, n, . . . , nm. using the rules as described above, one can define transitions among configurations. any sequence of transitions starting in the initial configuration is called a computation. a computation halts if it reaches a configuration where no rule can be used. with any computation (halting 276 linqiang pan, gheorghe păun or not) we associate a spike train, the sequence of zeros and ones describing the behavior of the output neuron: if the output neuron spikes, then we write 1, otherwise we write 0. when using an sn p system in the generative mode, we start from the initial configuration and we define the result of a computation as the number of steps between the first two spikes sent out by the output neuron. we denote by n(π ) the set of numbers computed by π in this way. in the accepting mode, a number n is introduced in the system in the form of a number f (n) of spikes placed in neuron σin, for a well-specified mapping f , and the number n is accepted if and only if the computation halts. we denote by nacc(π ) the set of numbers accepted by π . it is also possible to introduce the number n by means of a spike train entering neuron σin, as the distance between the first two spikes coming to σin. in the generative case, the neuron (with label) in is ignored, in the accepting mode the neuron out is ignored (sometimes below, we identify the neuron σi with its label i, so we say “neuron i" understanding that we speak about “neuron σi"). we can also use an sn p system in the computing mode, introducing a number in neuron in and obtaining a result in (by means of) neuron out, but we do not consider this case here. we denote by nα snp(rulek) the families of all sets nα (π ), α ∈ {, acc}, computed by sn p systems with at most k ≥  rules (spiking or forgetting) in each neuron. let us now pass to the extension mentioned in the introduction. a further object, ā, is added to the alphabet o, and the spiking and forgetting rules are of the forms e/bc → b′, bc → λ , where e is a regular expression over a or over ā, while b, b′ ∈ {a, ā}, and c ≥ . as above, if l(e) = bc, then we write the first rule as bc → b′ and we say that it is pure. note that we have four categories of rules, identified by (b, b′) ∈ {(a, a), (a, ā), (ā, a), (ā, ā)}. the rules are used as in a usual sn p system, with the additional fact that a and ā “cannot stay together", they instantaneously annihilate each other: if in a neuron there are either objects a or objects ā, and further objects of either type (maybe both) arrive from other neurons, such that we end with ar and ās inside, then immediately a rule of the form aā → λ is applied in a maximal manner, so that either ar−s or ās−r remain, provided that r ≥ s or s ≥ r, respectively. we stress the fact that the mutual annihilation of spikes and anti-spikes takes no time and that annihilation has priority over spiking and forgetting rules, so that the neuron always contains either only spikes or anti-spikes. that is why, for instance, the regular expressions of the spiking rules are defined either on a or on ā, but not on both symbols. of course, we can also imagine that the annihilation takes one time unit, when the explicit rule aā → λ is used, but we do not consider this case here (if the rule aā → λ has priority over other rules, then no essential change occurs in the proofs below; the no priority case also remains to be investigated). the computations and the result of computations are defined in the same way as for usual sn p systems – but we consider the restriction that the output neuron produces only spikes, not also antispikes (again, this is a restriction which is only natural/elegant, but not essential). as above, we denote by nα sanp(rulek, f org) the families of all sets nα (π ), α ∈ {, acc}, computed by sn p systems with at most k ≥  rules (spiking or forgetting) in each neuron, using also anti-spikes. when only pure rules are used, we write nα sanp(prulek). 4 universality results we start by considering the generative case, for which we have the next result (universality is known for usual sn p systems, without anti-spikes, but now both the proof is simpler and the used rules are all pure): theorem 1. nre = nsanp(prule). spiking neural p systems with anti-spikes 277 proof. we only have to prove the inclusion nre ⊆ nsanp(prule, f org). let us consider a register machine m = (m, h, l, lh, i) as introduced in section 2. we construct an sn p system π (with o = {a, ā}) which simulates m in the way already standard in the literature when proving that a class of sn p systems is universal. specifically, we construct modules add and sub to simulate the instructions of m, as well as an output module fin which provides the result (in the form of a suitable spike train). each register r of m will have a neuron σr in π , and if the register contains the number n, then the associated neuron will contain n spikes, except for the neuron σ associated with the first register (the neurons associated with registers will either contain occurrences of a, hence ā disappears immediately, or only ā is present, and it is consumed in the next step by a rule ā → a). two spikes are initially placed in the neuron σ associated with the first register, so if the first register contains the number n, then neuron σ will contain n +  spikes. these two spikes are used for outputting the computation result. ¶ µ ³ ´ ¶ µ ³ ´ ' & $ % ' & $ % ' & $ % ¶ µ ³ ´ li a → a a → a a → ā a → a l () i a → a a → λ l j lk ¶ µ ³ ŕ ¶ µ ³ ´ ¶ µ ³ ´a → a a → a l () i l () i l () i l () i ©©©©©©©©¼ ? ? ? j j jĵ hhhhhhhhhj @ @ @@r »»»»»»»»»»»»9 ½ ½ ½½= ppppppppq figure 1: module add, simulating li : (add(r), l j, lk) note that the number of spikes in the neuron σ will not be smaller than two before the simulation reaches the instruction lh and the output module fin is activated, because we assume that the output register is never decremented during the computation. one neuron σli is associated with each label li ∈ h, and some auxiliary neurons σ l ( j) i , j = , , , . . . , will be also considered, thus precisely identified by label li (remember that each li ∈ h is associated with a unique instruction of m). the modules will be given in a graphical form, indicating the synapses and, for each neuron, the associated set of rules. in the initial configuration, all neurons are empty, except for the neurons associated with label l of m and the first register, which contain one spike and two spikes, respectively. in general, when a spike a is sent to a neuron σli , with li ∈ h, then that neuron becomes active and the module associated with the respective instruction of m starts to work, simulating the instruction. the functioning of the module from figure 1, simulating an instruction li : (add(r), l j, lk), is obvious; the non-deterministic choice between instructions l j and lk is done by non-deterministically choosing the rule to apply in neuron σ l () i . the simulation of an instruction li : (sub(r), l j, lk) is also simple – see the module from figure 2. the neuron σli sends a spike to neurons σl() i and σ l () i . in the next step, neuron σ l () i sends an anti-spike to neuron σr, corresponding to register r; at the same time, σ l () i sends a spike to each neuron σ l () i , σ l () i . if register r is non-empty, that is, neuron σr contains at least one a, then ā removes one occurrence of a, 278 linqiang pan, gheorghe păun ¶ µ ³ ´ ¶ µ ³ ´ ' & $ % ' & $ % ¶ µ ³ ´ ¶ µ ³ ´ # " ã ! ¶ µ ³ ´ ¶ µ ³ ´ ¶ µ ³ ´ ¡ ¡¡ª @ @@r hhj¡ ¡ª @ @r j j j jĵ hhhhhhhhhj ¢ ¢ ¢ ¢¢® s s s ssw ©©©©©©©©¼ à ? ? li a → a l () i a → a l () ia → a r ā → a l () i a → a l () i a → a l () i a → λ a → a a → λ l () i a → λ a → λ a → a l j lk figure 2: module sub, simulating li : (sub(r), l j, lk) which corresponds to subtracting one from register r, and no rule is applied in σr. this means σ l () i and σ l () i receive only two spikes, from σ l () i and σ l () i , hence σl j is activated and σlk not. if register r is empty, then the rule ā → a is used in σr, hence σ l () i and σ l () i receive three spikes, and this leads to the activation of σlk , which is the correct continuation also in this case. â á ¿ à ¶ µ ³ ´ ¶ µ ³ ´ ¶ µ ³ ´ ¶ µ ³ ´ ¶ µ ³ ´ ¶ µ ³ ´ ¡ ¡¡ª ? ? ? lh a → a h a → a h a → a out a → a h a → a h a → ā 1 a → a ¶ µ ³ h́ a → a ¶ µ ³ h́ a → a a q q qqs ? ? -6 ¾ 6 q q q q q qk j j j j j j j] figure 3: the fin module note that if there are several sub instructions lt which act on register r, then σr will send one spike to neurons σ l () t and σ l () t while simulating the instruction li : (sub(r), l j, lk), but this spike is immediately removed by the rule a → λ present in all neurons σ l () t , σ l () t . the module fin, which produces a spike train such that the distance between the first two spikes equals the number stored in register 1 of m, is indicated in figure 3. at some step t, the neuron σlh is activated, which means that the register machine m reaches the halt instruction and the system π starts to output the result. suppose the number stored in register 1 of m is n. at step t + , neurons σh , σh spiking neural p systems with anti-spikes 279 and σh contain a spike. neurons σh and σh exchange spikes among them, and thus σh sends a spike to neuron σh continuously until neuron σ spikes and neurons σh , σh , σh are “flooded". at step t + , neuron σout receives a spike, and in the next step σout sends a spike to the environment; at the same time, σ receives an anti-spike that decreases by one the number of spikes from σ. at step t + n + , the neuron σ contains one spikes, and in the next step neuron σ sends a spike to neuron σout . at step t + n + , neuron σout spikes again. the distance between the first two spikes emitted by σout equals n, which is exactly the number stored in register 1 of m. the spike produced by neuron σ “floods" neurons σh , σh , and σh , thus blocking the work of these neurons. after the system sends the second spike out, the whole system halts. from the previous explanations we get the equality n(m) = n(π ) and this concludes the proof. note that in the previous construction there is no rule of the form āc → ā; is it possible to also avoid other types of rules? for instance, the rule ā → a only appears in the neurons associated with registers in module sub. is it possible to remove the ā → a by replacing it with the rules ac → a and a → ā? if the sn p systems are used in the accepting mode, then a further simplification is entailed by the fact that the add instructions are deterministic. such an instruction li : (add(r), l j) can be directly simulated by a simple module as in figure 4. ¶ µ ³ ´ # " ã ! ¶ µ ³ ´ ¢ ¢ ¢® q qqs li a → a rl j figure 4: module add, simulating li : (add(r), l j) together with sub modules, this suffices in the case when the number to accept is introduced as the number of spikes initially present in neuron σ. if this number is introduced in the system as the distance between the first two spikes which enters the input neuron, then a input module is necessary, as used, for instance, in [3]. note that the module input from [3] uses only pure rules (involving only spikes, not also anti-spikes), hence we get a theorem like theorem 1 also for the accepting case, for both ways of providing the input number. it is worth mentioning that in the previous constructions we do not have spiking rules which can be used at the same time with forgetting rules. 5 using inhibitory synapses let us now consider the case when no rule can produce an anti-spike, but there are synapses which transform spikes into anti-spikes. the previous modules add, sub, fin can be modified in such a way to obtain a characterization of nre also in this case. we directly provide these modules, without any explanation about their functioning, in figures 5, 6, and 7; the synapses which change a into ā are marked with a dot. note that this time the non-determinism in the add instruction is simulated by allowing the nondeterministic choice among the spiking rule ā → a and the forgetting rule ā → λ of neuron σ l () i , which is not allowed in the classic definition of sn p systems. removing this feature, without introducing rules which are not pure or other ingredients, such as the delay, remains as an open problem. denoting by nα sanps(prulek) the respective families of sets of numbers (the subscript s in ps indicates the use of inhibitory synapses, in the sense specified above), we conclude having the next result: 280 linqiang pan, gheorghe păun º ¹ · ¸ º ¹ · ¸ # " ã ! # " ã ! ' & $ % º ¹ · ¸ º ¹ · ¸ # " ã ! ¡ ¡¡ª u ppppppqj j jĵ ? ? ? ? z z z zz~ ´ ´ ´ ´ +́ li a → a r l () i ā → a ā → λ l () i a → a l () i a → a a → λ l () i a → a a → λ l j lk figure 5: module add, simulating li : (add(r), l j, lk) ¶ µ ³ ´ ¶ µ ³ ´ ' & $ % ' & $ % º ¹ · ¸ º ¹ · ¸ ' & $ % º ¹ · ¸ ? ½ ½ ½ ½ ½½= q q q qqs a a a au hhhhhhhhj à j j j ĵ ©©©©©©©©¼ ¶ ¶ ¶¶/ ? ? li a → a t r ā → a l () i a → a l () i a → a l () i a → λ a → a a → λ l () i a → λ a → λ a → a l j lk figure 6: module sub, simulating li : (sub(r), l j, lk) spiking neural p systems with anti-spikes 281 # " ã ! ¶ µ ³ ´ ¶ µ ³ ´ ² ± °̄ º ¹ · ¸? v lh a → a h a → a h a → a out a → a 1 ā → a ¶ µ ³ h́ a → a ¶ µ ³ h́ a → a ©©©©©©©¼ ¾ ? ? ? figure 7: module fin theorem 2. nre = nsanps(prule). 6 final remarks there are several open problems and research topics suggested by the previous results. some of them were already mentioned, but further questions can be formulated. for instance, can the proofs be improved so that less types of rules are necessary? we have avoided using rules āc → ā, but not the other three types, corresponding to the pairs (a, a), (a, ā), (ā, a). then, following the idea from [6], can we decrease the number of types of neurons, in the sense of having a small number of sets of rules which are used in each neuron (three such sets are found in [6] to be sufficient for universality in the case of usual sn p systems; do the anti-spikes helps also in this respect?). what about cases when the annihilation rule aā → λ takes one time unit or/and it has no priority over other rules? by allowing the output neuron to also produce anti-spikes we can get a spike train over a three letter alphabet: no output, producing spikes, and producing anti-spikes, respectively. this can be an interesting way to produce languages (over three letters or perhaps over two, ignoring the no-output steps). acknowledgements. the work of l. pan was supported by national natural science foundation of china (grant nos. 60674106, 30870826, 60703047, and 60803113), program for new century excellent talents in university (ncet-05-0612), ph.d. programs foundation of ministry of education of china (20060487014), chenguang program of wuhan (200750731262), hust-srf (2007z015a), and natural science foundation of hubei province (2008cdb113 and 2008cdb180). the work of gh. păun was supported by proyecto de excelencia con investigador de reconocida valía, de la junta de andalucía, grant p08 – tic 04200. useful remarks by a. alhazov and m.a. gutiérrez-naranjo are gratefully acknowledged. bibliography [1] a. binder, r. freund, m. oswald, l. vock, extended spiking neural p systems with excitatory and inhibitory astrocytes. submitted, 2007. [2] r. freund, m. oswald, spiking neural p systems with inhibitory axons. arob conf., japan, 2007. [3] o.h. ibarra, a. păun, gh. păun, a. rodriguez-patón, p. sosik, s. woodworth, normal forms for spiking neural p systems. theoretical computer science, vol. 372, pp. 196–217, 2007. [4] m. ionescu, gh. păun, t. yokomori, spiking neural p systems. fundamenta informaticae, vol. 71, pp. 279–308, 2006. 282 linqiang pan, gheorghe păun [5] j.m. mingo, sleep-awake switch with spiking neural p systems: a basic proposal and new issues. proc. 7th brainstorming week on membrane computing, sevilla, 2009, vol. ii, 59–72. [6] l. pan, gh. păun, new normal forms for spiking neural p systems. proc. 7th brainstorming week on membrane computing, sevilla, 2009, vol. ii, 127–138. [7] gh. păun, g. rozenberg, a. salomaa, eds., handbook of membrane computing. oxford university press, 2010 (in press). [8] j. wang, l. pan, excitatory and inhibitory spiking neural p systems. submitted, 2007. [9] the p systems website, http://ppage.psystems.eu. linqiang pan was born in zhejiang, china on november 22, 1972. he got phd at nanjing university in 2000. since 2004, he is a professor at huazhong university of science and technology, china. his main research fields are graph theory and membrane computing. gheorghe păun graduated the faculty of mathematics, university of bucharest, in 1974 and received his ph.d. from the same university in 1977. from 1990 he is a senior researcher at the institute of mathematics of the romanian academy. he (repeatedly) visited numerous universities in europe, asia, and north america. his main research areas are formal language theory and its applications, computational linguistics, dna computing, and membrane computing; this last research area was initiated by him, in 1998, and the respective models are now called p systems, see http://ppage.psystems.eu). he has published a large number of research papers, has lectured at many universities, and gave numerous invited talks at recognized international conferences. he has published eleven monographs in mathematics and computer science, has (co)edited over seventy collective volumes and special issues of journals, and also published many popular science books, books on recreational mathematics (games), and fiction books. he is a member of the editorial board of more than a dozen international journals and was/is involved in the program/steering/organizing committees for many recognized conferences and workshops. in 1997 he was elected a member of the romanian academy and from 2006 he is a member of academia europaea. he also got other honors, in romania or abroad. he is an isi highly cited researcher (see http://isihighlycited.com/). international journal of computers, communications & control vol. i (2006), no. 4, pp. 7-7 preface over the last decades, it has become a strong need for exchange on common computational and algorithmic tools between researchers working in different application backgrounds. under this situation, the first cesa conference (cesa96) was successfully held in lille, france in july 1996. the multiconference on “computational engineering in systems applications” (cesa2006), was co-sponsored by imacs (the international association for mathematics and computers in simulation) and ieee systems man and cybernetics (ieee/smc) society, and has been held on 4-6 october 2006 in beijing, china. it aim was to bring together scholars and practitioners from academia and industries to exchange the latest development in theories, and applications of computational techniques. this conference was co-chaired by professor pierre borne (ecole centrale de lille, france) and professor bo zhang (tsinghua university, china). in addition to the plenary lectures presented by professor james m. tien (rensselaer polytechnic institute, usa), professor tianyou chai (northeastern university, china), professor florin g. filip (vice president of the romanian academy), professor jianwei zhang (university of hamburg, germany) and professor toshio fukuda (nagoya university, japan), 388 communications have been selected and accepted for presentation. the papers presented in this special issue correspond to enlarged and improved papers which have been selected among the best communications presented during the conference. guest editor professor pierre borne chairman of cesa2006 ecole centrale de lille, cité scientifique laboratoire d’automatique, génie informatique et signal bp 48, 59651 villeneuve d’ascq cedex, france e-mail: pierre.borne@ec-lille.fr copyright c© 2006 by ccc publications int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 4, pp. 316-325 a life dedicated to the science, philosophy and romanian society g. tecuci gheorghe tecuci romanian academy george mason university mihai drăgănescu with outstanding contributions in electronics, informatics, and philosophy, and as professor, researcher, and manager, acad. mihai drăgănescu is the most important encyclopedic personality of contemporary romania. educated in the nascent romanian school of electronics, acad. mihai drăgănescu creates a worldclass school of electronic devices and microelectronics. envisioning the evolution of the modern society, becomes initiator and promoter of the informatics revolution in romania, conceptually defining it and coordinating its development. generalizing the concept of information, creates an original philosophy that leads to the development of a new type of science, called structural-phenomenological, with major implications for the understanding of the world and its future. first president of the romanian academy renaissance, he leads both its return to its role and traditional sources, and its renewal and adaptation to the evolution of the civilization. promoter of the scientific and humanistic culture, brings back the deserved recognition to major personalities of the romanian spirituality. mentor and life model, he lightens and encourages many young generations with an extraordinary generosity. any of these achievements would be enough to place acad. mihai drăgănescu among the greatest romanian personalities. their combination, impressive through their diversity and unity, creates the image of a personality of a rare complexity and creativity. 1 founding father of the romanian school of electronic devises and microelectronics acad. mihai drăgănescu was educated as an electronics engineer, and climbed through all the professional levels, from assistant professor to academician. despite the unsupportive research environment of an isolated romania, he succeeded in obtaining truly outstanding results. at the beginning of his scientific career, as a young assistant professor, mihai drăgănescu is among the very few romanian electronics engineers with world-class research results, author of the first romanian phd in the area of copyright c© 2006-2009 by ccc publications a life dedicated to the science, philosophy and romanian society 317 electronic devises, entitled “the capacities of electronic tubes and their dependence of the functioning conditions”. this is followed by other fundamental research results, such as the matz-drăgănescu theory of the transistor at high-level of injection, the study of the inductive effects in semiconductor devices, and the simplification of the dielectric diode theory. they represent the prelude of “electronic processes in semiconductor devices of circuits” [1], one of the first authored books in this domain in the world and an exceptional achievement in the romania of 1962, recognized with the state prize. in 1963 he publishes the first romanian study on microelectronics [2], and is awarded the prize of the ministry of national education for his scientific research. he establishes and heads the research institute for electronic components, in 1969, and teaches the first course on integrated circuits, in 1970. in 1972 he publishes “solid state electronics” [3], the second fundamental book of the romanian school of electronic devices. a unifying understanding of electronics, informatics and philosophy, in a social context, leads prof. drăgănescu to the development of a basic discipline of the science and technology of information, which he calls “functional electronics”. in 1978 he starts teaching this new discipline at the ’politehnica’ university of bucharest. then, in 1991, he publishes the book “functional electronics” [4], followed by several studies, including “microelectronics and functional electronics” [5], “from solid state to quantum and molecular electronics, the deepening of information processing” [6], and “neural engineering and neural electronics facing artificial consciousness” [7]. in an activity spanning almost 50 years, through original research, the publication of world-class foundational books, the development of new courses, the formation and support of a large number of specialists, the guidance of the romanian industry of electronics and computers, including the management of the manufacturing of the integrated circuits, acad. mihai drăgănescu has created the romanian school of electronic devices and microelectronics, significantly influencing the evolution of the romanian society. as an international recognition of his exceptional contributions, acad. mihai drăgănescu is elected fellow ieee in 1994, life fellow ieee in 1997, and receives the millennium iii medal from ieee in 2000. 2 initiator and promoter of the information revolution in romania acad. mihai drăgănescu is credited with envisioning the information revolution, with its conceptual definition, and with the coordination of its development in romania [8]. initiator and leader of the first and only national major technologic program in the area of integrated circuits, computers and informatics, acad. mihai drăgănescu has guided for 20 years, between 1965 and 1985, the economic evolution of romania in these areas, making numerous critical decisions, including the transitioning of romania to the silicon phase in the electronics domain, and the buying of western licenses for the manufacturing of integrated circuits and of 3rd generation computers. in 1971 he is awarded the french order “commandeur de la legion d’honneur” for his contributions to the romanian-french collaboration in informatics. his studies, dating back to 1970, envisioned a future information society (now a reality in the advanced countries), based on an informatics medium aimed at serving each individual, organization, and society as a whole, both in realizing its own functions and in their relationships with each other. moreover, acad. mihai drăgănescu had the vision (now confirmed by the evolution of the internet) that such a medium had to be based on an evolving informatics system, adaptable to the evolution of computers and society. it could not be planned in the smallest details, as required by the political leaders of romania of that time [8]. integrating theoretical research, a global systemic vision, and a creative management, acad. mihai drăgănescu has laid the basis for the development of informatics in romania through several outstanding initiatives. they include the establishment of important institutions, such as the company for the 318 g. tecuci maintenance and repair of computing tools (iiruc, 1968), the institute of computers (itc, 1969), and the institute of electronic components (ice, 1969). an important moment was the establishment, in 1971, of the central institute for informatics which, under his leadership, has quickly become the center of the romanian informatics, both through world-class research in advanced domains (e.g., artificial intelligence, industrial informatics, computer networks) and through the coordination of the development of informatics in the entire country (e.g., the coordination of a network of 40 territorial computer centers, established between 1968 and 1985). at the same time computer centers have been established in universities and research institutes, as well as computer science high schools, and new university degree programs in electronics, automation, computers and informatics. in late 70, acad. mihai drăgănescu proposes a new major national program for the transitioning of romania into the second industrial revolution, based on electronics, automation and informatics [9, 10]. not only that the political leadership of romania rejects this visionary proposal, but the development of informatics is purposely slowed down and, in 1985, even the central institute for informatics is dismantled. “in 1985, a terrible blow was given to the information society in formation” [11]. however, even during this tragic period in his own life and that of the country, acad. mihai drăgănescu formulates the idea of a future knowledge society, idea widely accepted today [12]. in the aftermath of the romanian revolution of 1989, acad. mihai drăgănescu resumes his role of promoter of the information revolution. he establishes the national commission for informatics (1990), the center for new electronics architectures (1991), the section for science and technology of information of the romanian academy (1992), the research institute for artificial intelligence (1994) and the forum for the information society (1997). later on, acad. mihai drăgănescu formulates a new vision for the evolution of the society, defining the era of information, with three stages, the information society, the knowledge society, and the consciousness society [12]. “in essence, the information society is the society based on internet” romania being in a state of underdevelopment of this society [12]. “for romania the development of the information society is essential, but in the current situation this has to take place at the same time with achieving the first objectives of the knowledge society. it is wrong to say: first the information society and then the knowledge society. we should not sentence ourselves to be perpetually delayed” [12]. acad. mihai drăgănescu identifies technological and functional vectors as tools that can transition the information society to the knowledge society: “to take first steps into the knowledge society it is necessary to employ a minimum number of such vectors. the first vector is the development of an ’advanced’ internet, which is a technological vector, then the technology of electronic books (technological vector) and the knowledge management (functional vector with two valences, one for the economic and organizational functioning of a company, multinational corporation or society, and the other one for the moral utilization of knowledge in the global society)” [12]. similar to the national program from 1965-1985, the program for the information and knowledge society [12] is a very significant contribution of acad. mihai drăgănescu, with precise objectives and means, of greatest importance for the progress of the romanian society in the global economy. 3 creator of the structural-phenomenological philosophy of existence (orthophysics) a profound understanding of physics, electronics, informatics and biology, a great analytic, synthetic and unifying spirit, an irresistible philosophical tension, in the context of the romanian spirituality, have led acad. mihai drăgănescu to the creation of an original philosophy of existence, fascinating through its naturalness and generality, a philosophy which, according to the illustrious philosopher constantin noica, “will deeply amaze and impress the unsuspecting thinkers of the xxi century” [13]. acad. mihai drăgănescu realizes that “the ontological model which is at the basis of the structural a life dedicated to the science, philosophy and romanian society 319 science is inadequate, and the structural science itself is limited, it can only be used between certain limits of reality. beyond these limits it is insufficient to explain the reality because it neglects extra ingredients of nature which in cannot recognize due to the character of the methods used. overall, the structural science is incomplete and cannot explain the reality in its entirety” [14]. through fundamental works, such as the depths of the material world [15], orthophysics [16], spirituality, information, matter [17], essays [19], l’universalité ontologique de l’information [20], acad. mihai drăgănescu has developed an original ontological model which, for the first time, explains in a unified way the physical, biological, informational, mental, and psychological processes, based on original fundamental concepts such as informatter, orthoenergy, orthoexistence, orthosense, phenomenological sense, intro-openness, and others. acad. mihai drăgănescu postulates the existence of a tendencial deep reality, with an energetic component (orthoenergy) and an informational one (informatter), outside space and time, from which physical universes are generated, where the non-living matter is a coupling between orthomatter with orthoenergy and informatter, while living organisms contain directly informatter, thus forming rings of existence. the orthophysics philosophy proposes to overcome the paradoxes of the current structural science by developing a new type of science, called structural-phenomenological, for which acad. mihai drăgănescu has formulated several principles, such as the principle of insufficiency of structural knowledge, the principle of existence, of profound matter, of the ontological universality of the information and energy, of the tendencies of becoming, and of the structural-phenomenological modeling [21]. as philosopher constantin noica before him, acad. mihai drăgănescu has made philosophy relying first and foremost on “the pure thought uninfluenced, let flowing in his mind ... feeling than the need of confrontation with the established philosophy and science, but giving science a more prominent role” [19, p.104]. other great thinkers, such as florian nicolau, david bohm and john archibald wheeler, have themselves imagined a more profound stratum of reality than that offered by the structural science, but none has advanced so much in this direction as acad. mihai drăgănescu, who has thus made a major contribution to the human thought. today, an increasing number of scientists in physics, chemistry, and information science adopt points of view that are close to those initially proposed by acad. mihai drăgănescu, confirming the importance and the ramifications of his contributions. a convincing argument for the generality of the orthophysics philosophy of existence is the explanation of the notion of god. the question on the relationship between the orthophysics philosophy and god was raised by constantin noica, in 1987. the answer is given by acad. mihai drăgănescu years later, in 2003, following his interactions with prof. menas kafatos, illustrious american astrophysicist and philosopher who postulated the existence of the consciousness of the universe [22]. following a long investigation in the orthophysics philosophy, and using only the already defined principles, acad. mihai drăgănescu devices a phenomenological-structural model of existence. “which is the relationship between the phenomenological-structural model of the fundamental consciousness of existence and the notion of god from the great and important religions of the world? there is no doubt a very strong relationship, the notion of god in these religions potentially being a certain way of perceiving the fundamental consciousness of existence. if we accept the existence of the fundamental consciousness as being very plausible, then god is also very plausible from the scientific and philosophical point of view, even though he is perceived with different nuances. ... the same way the cosmos is not an organism, the fundamental consciousness does not belong to an organism, yet it exists. the fundamental consciousness of existence is the being by excellence, beyond life and, implicitly, beyond death.” [23] the fruitful collaboration with prof. menas kafatos, with whom acad. mihai drăgănescu has many points of view in common, has led to the development of a new integrated approach to science, which takes into account both the structural and the phenomenological aspects of reality [24, 25]. the publication of a large number of papers, the organization of several conferences on the “structuralphenomenological modeling” at the romanian academy, the emerging of disciples, the collaboration 320 g. tecuci with reputable scientists, the adoption of similar concepts by an increasing number of thinkers, all these confirm noica’s characterization of the orthophysics philosophy [13]. 4 initiator and manager of the romanian academy renaissance next year the romanian academy will celebrate 20 years from its renaissance following the romanian revolution of 1989. in a short period of only four years, between 1990 and 1994, as president of the re-born romanian academy, acad. mihai drăgănescu has initiated and led the return of this primary forum of the romanian spirituality to its traditional role and sources, as well as its renewal and adaptation to the future tendencies of the scientific and humanistic culture, in the national, european and world context. we only need to reflect on the evolution of the romanian society during the never-ending period of transition to understand that acad. mihai drăgănescu was the right person, at the right place and at the right time. as vice prime minister in the first provisional government, acad. mihai drăgănescu sets the renaissance of the romanian academy (which was condemned at physical death by the ceasusescu’s regime) as top priority for the new government which, on january 5th, 1990, promulgates the decreelaw no. 4 on the organization and functioning of the romanian academy, reestablishing its autonomy. one of the first measures the re-born romanian academy under the leadership of acad. mihai drăgănescu was to right the wrongs of history. are recognized as members without interruption of the romanian academy the illustrious representatives of the romanian spirituality, abusively eliminated in 1948 by the communist regime, such as lucian blaga, theodor capidan, dumitru caracostea, ion petrovici, constantin rădulescu-motru, gheorghe i. brătianu, silviu dragomir, dimitrie gusti, alexandru lepădatu, ioan lupaş, simion mehedinţi, petre p. negulescu, nicolae colan, victor slăvescu, constantin brăiloiu, tiberiu brediceanu, onisifor ghibu, pantelimon halipa, dragomir hurmuzescu, grigore t. popa, petru sergescu, emil haţieganu, iuliu hossu, constantin levaditi, iuliu maniu, nicolae bălan, grigore tăuşan, and others [19, p.153]. shining stars who have not received their well-deserved recognition during their lives, are elected posthumously as members of the romanian academy: constantin brâncuşi, mircea eliade, constantin noica, ştefan lupaşcu, mircea florian, nichita stănescu, marin preda, ion barbu, traian lalescu, constantin pârvulescu, călin popovici, ion moraru, ştefan berceanu, virgil madgearu, petre andrei, petru caraman, nicolae c. paulescu, alexandru proca, haralambie vasiliu, ştefan odobleja, bela bartok, herman oberth, eugen lovinescu and theodor aman [19, p.153]. the romanian academy regains its historic role as forum of all romanians through the election of honorary members from bessarabia (grigore vieru, ion druţă, mihai cimpoi, andrei andries, sergiu ion rădăuţan), bucovina (grigore constantin bostan şi alexandrina cernov), and the romanian diaspora (nicolae georgescu-roegen, anghel rugină, radu bălescu, sergiu celibidache, alexandru cioranescu, joseph m. juran, george uscăţescu, ion i. inculeţ, dinu adameşteanu, mihai ion botez, eugenio coşeriu, nicola matteesco-matte, jean negulesco, dinu c. giurescu, iosif antochi, paul stahl, petru dumitriu, mattei dogan, idel ianchelevici, constantin atanasie bona, and other [19, p.154], [26, p. 154-155]. acad. mihai drăgănescu founds, writes its manifesto, and leads “academica”, the romanian academy journal destined to maintain a continuous connection with the romanian intellectuality. well-thinking the role of the romanian nation in the development of the universal scientific and humanistic culture, acad. mihai drăgănescu succeeds in bringing over 50 institutions of fundamental and advanced research back into the romanian academy, and in resurrecting institutions of tradition, successfully defending their activity against powerful opposition from various national and international forums. at the same time, acad. mihai drăgănescu leads the adaptation of the romanian academy to the global tendencies through the founding of the section of arts, architecture and audio-visual, the section of science and technology of information, as well as several new institutes and centers. a life dedicated to the science, philosophy and romanian society 321 another impact on the romanian academy by the first president of its renaissance is his initiation of the election of about 50 of its members. 5 promoter of the romanian scientific and humanistic culture we have already mentioned the major role played by acad. mihai drăgănescu in the development of electronics, informatics and philosophy. but his role as promoter of the romanian scientific and humanistic culture is much wider than his extraordinary achievements in these areas. consider, first of all, his determining role in the reestablishment of the natural orthography of the romanian language, abusively altered in 1953, through political diktat contrary to the national culture. this act, of primary importance for the romanian language, as well as the others already mentioned, shows the extraordinary humanistic and patriotic qualities of acad. mihai drăgănescu. the request by the illustrious linguist, acad. alexandru rosetti, of reestablishing â in the romanian orthography, expressed just before his passing into eternity, was a “testamentary request” for acad. mihai drăgănescu [26, p.206]. as a consequence, on january 31st 1991, he delivers the speech “on several rectifications of the romanian language orthography” at the romanian academy [19, p.162-170]. following a wide public debate, where he confronted the vicious opposition of several linguists and part of the press with numerous scientific arguments, acad. mihai drăgănescu delivers, on february 17th, 1993, the historical speech “the romanian language orthography: with hasdeu, maiorescu, puşcariu and rosetti”, urging “let us have the consciousness of our past, aspirations, and destiny. let us defend our culture, language, and soul.” [19, p.171-194]. the almost unanimous vote (with only one against) of the members of the romanian academy, as well as the support of the saint synod of the romanian orthodox church, the ministry of education, and the ministry of culture, have changed the accusation of imposing “the drăgănescu’s orthography” into a title of glory [26, p. 158]. the exemplary patriotism of acad. mihai drăgănescu is also shown by his relentless work toward the national and international recognition of the personalities of the romanian culture, such as ştefan odobleja. through a 15 years effort, between 1975 and 1990, during which acad. mihai drăgănescu has delivered numerous talks on the contributions of ştefan odobleja, has led the editing of two volumes, romanian precursors of cybernetics [27] and odobleja between ampere and wiener [28], and has edited odobleja’s consonantist psychology in romanian [29], he has succeeded in determining the election of ştefan odobleja, posthumously, as member of the romanian academy, and has increased the prestige of the romanian science [19, p.153]. through his example, acad. mihai drăgănescu has also shown us how to recognize and treasure our professors and mentors. in 1982 he delivers the talk tudor tănăsescu founder of the romanian school of electronics. in 1992 he organizes the conference tudor tănăsescu and the romanian school of electronics at the romanian academy, and leads the editing of the volume tudor tănăsescu founder of the romanian school of electronics [30], solidifying his place in the history of the romanian science and technology. these are only a few examples from acad. mihai drăgănescu’s life-long sustained effort of promoting past and present values of the romanian culture, not only through his own publications and speeches, but also through the organization of conferences and through his support for the publication of books in numerous domains. 6 mentor and life model acad. mihai drăgănescu was always a visionary. he was ahead of his time when he introduced the idea of an information revolution [8], and when he created the orthophysics philosophy [15], and when 322 g. tecuci he defined the concept of functional electronics [4], and when he defined the concept of knowledge society [11], and when he introduced the concept of consciousness society [14]. it is well-known that the younger generation is much better adapted to the new social context created by the computers and the internet than the older ones. in spite of his age, acad. mihai drăgănescu is part of this younger generation. in his seventies, he is the first to publish the first romanian book on the internet [20], as well as the first electronic book [31]. he is the one who, in 2001, organizes the first symposium in romania on the electronic books, as well as the first symposium on the electronic commerce. how could one explain this extraordinary power of assimilation and adaptation that allowed acad. mihai drăgănescu to navigate with such apparent ease the highly dynamic and complex domains of electronics, informatics and philosophy? in his reception speech at the romanian academy, titled “the philosophical tension and the cosmic feeling”, acad. mihai drăgănescu stated: “if the existence forms material world rings, or universes, then we have to accept that our universe has a sense, a tendency, and the good, in its philosophical sense, corresponds to conforming to this tendency by the human and society, which does not mean strict determinism but, to the contrary, advanced knowledge and high creation power. it can be demonstrated that the philosophic good is not only compatible with the social good but also theoretically derives it, as can also be shown based on humanistic considerations” [32, p.27]. the cosmic feeling, advanced by acad. mihai drăgănescu as solution to the philosophical tension for the understanding of the existence, “calls man to a heroic attitude through thought and action in the direction of the fundamental tendency in the universe, that is, toward creation and a spiritual attitude which is also a source for strengthening his will, rationality, and lucidity” [32, p.30]. we see here a philosophical justification of the man’s action in the direction of good, which acad. mihai drăgănescu exemplifies in the most remarkable manner. we think that precisely his deep understanding of the philosophical tension and the cosmic feeling explains his model patriotic vocation for the progress of the romanian society, as well as his capacity of overcoming so many obstacles and injustices. in a long and illustrious career, acad. mihai drăgănescu has lightened and influenced the life of many generations who, as myself, have had the chance of being closer to him, well-thought by him, guided by him, and blessed by him, with unparallel generosity. acad. mihai drăgănescu is a spiritual father and life model for me, the one who has influenced the direction of my life much more than anyone else. 7 final remarks the observer of the activity of acad. mihai drăgănescu is amazed by how a person can excel in domains that are so diverse and difficult. it is the genius of acad. mihai drăgănescu who succeeded to discover and define a line of continuity and complementarity that has led him to the development of a unified encyclopedic work integrating the most dynamic domains of the modern society: electronics, informatics, and the philosophy of science and society. in his words, “the multidisciplinary approach to problems has become common practice. however, this will not be sufficient if we will not find the integrative factors to melt the multidisciplinarity into a unity, be it the case of understanding a complex reality, or that of achieving goals serving the people and society” [33, p.428]. discovering these integrative factors is one of the greatest contributions of acad. mihai drăgănescu. one example: “information technology is a technology of progress in our age, through its two important components: electronics and informatics. there is no boundary between them. the physical seed of the contemporary information society is electronics, or microelectronics to be precise, and the informational seed is provided by the informatics” [4, p.15]. the “physical seed” and the “informational seed” may have inspired acad. mihai drăgănescu in his development of the philosophical conception of deep matter: “under the quantum world there is ... the deep matter governed by two principles: informatter, a matter with informational properties of a life dedicated to the science, philosophy and romanian society 323 phenomenological type (like the mental senses), and energymatter, a matter with energetic properties, which is unstructured but can be structured by the informatter” [33, p.10]. on the other hand, “technology is inherently linked to philosophy because, as instrument of becoming, it has an existential role. however, becoming implies coalescing the social factors with the scientific, technological and economic ones, and with the spiritual-cultural life. none of these factors can be estranged from the others without affecting the reality and endangering the socio-human civilization” [18, p.127]. what ultimately provides this amazing unity in diversity to the work of acad. mihai drăgănescu is his general view of information which encompasses [33, p.428]: “i. philosophy of information; ii. science of information; iii. technology of information; iv. industry of information; v. economy of information; vi. culture-information relationship; vii. information society (including the problems of social intelligence and information democracy); viii. creation (generation) of information.” for acad. mihai drăgănescu, the information “has a multitude of manifestations, it may become thought, consciousness, spirit, computation, poetry, idea, sense. each of these manifestations may be a support for the understanding of matter and life” [18, p.13]. the encyclopedic work of acad. mihai drăgănescu, spanning electronics, philosophy, informatics, and society (see http://www.racai.ro/ dragam/), is an impressive and inspiring example of a life dedicated to the science, philosophy, and romanian society. bibliography [1] m. drăgănescu, electronic processes in semiconductor devices of circuits (in romanian), romanian academy press, bucharest, 1962. [2] m. drăgănescu, on some microelectronics problems (in romanian), automatica şi electronica, 7, 1, 31-40, 1963. [3] m. drăgănescu, solid state electronics (in romanian), editura tehnică, bucharest, 1972. [4] m. drăgănescu, gheorghe ştefan, corneliu burileanu, functional electronics (in romanian), vol. i, editura tehnică, bucharest, 1992. [5] m. drăgănescu, microelectronics and functional electronics (in romanian), in: dan dascalu and gh. brezeanu (eds.), new research in microelectronics, romanian academy press, 9-20, 1994. [6] m. drăgănescu, from solid state to quantum and molecular electronics, the deepening of information processing, invited paper at the international semiconductor conference, sinaia, october 8-11, 1997, published in the volume i of the conference, p.5-21. http://www.racai.ro/md75/ 324 g. tecuci [7] m. drăgănescu, neural engineering and neuroelectronics facing artificial consciousness, communication at the ingimed ii conference, bucharest, dec. 13, 2001; * e-preprint, msreader format, romanian federation of bioengineering. [8] m. drăgănescu, system and civilization, essays (in romanian), editura politică, bucharest, 1976. [9] m. drăgănescu, the second industrial revolution. microelectronics, automatics informatics: determining factors (in romanian), editura tehnică, bucharest, 1980. [10] m. drăgănescu, informatics and society (in romanian), editura politică, bucharest, 1987. [11] m. drăgănescu, the information and knowledge society. the vectors of the knowledge society, study for the romanian academy project "information society knowledge society", bucharest, 9 july 2001, http://www.racai.ro/infosoc-project/ [12] m. drăgănescu broadband and the knowledge society, communication presented at the international conference information society technologies for broadband europe, 9-11 october 2002, marriott grand hotel, bucharest, românia. http://www.racai.ro/ dragam/broadbandinternet-and-the-knowlegde.pdf [13] c. noica, a work for the century (in romanian), contemporanul, 20 june 1986. [14] m. drăgănescu, the consciousness society (in romanian), romanian academy institute for artificial intelligence, bucharest, 2007, isbn 978-973-0-05307-4 [15] m. drăgănescu, the depth of existence, editura politică, bucharest, 1979, http://www.racai.ro/books/doe/ [16] m. drăgănescu, orthophysiscs (in romanian), editura ştiinţifică şi enciclopedică, bucharest, 1985. http://www.racai.ro/ dragam/ortof.html [17] m. drăgănescu, spirituality, information, matter. essays (in romanian), romanian academy press, bucharest, 1988. [18] m. drăgănescu, information of matter (in romanian), romanian academy press, bucharest, 1990. [19] m. drăgănescu, essays (in romanian), romanian academy press, bucharest, 1993. [20] m. drăgănescu, l’universalité ontologique de l’information. préface et notes par yves kodratoff, romanian academy press, bucharest, 1996. html edition by dorin marcu, http://www.racai.ro/books/draganescu/ [21] m. drăgănescu, principes d’une science structurale-phénoménologique, bulletin de la classe des lettres et des sciences morales et politiques, academie royale de belgique, 6e série, tome iv, 7-12, 255-311, 1993. [22] m. kafatos, r. nadeau, the conscious universe, springer verlag, new york, 1990. [23] m. drăgănescu, the fundamental consciousness of existence (in romanian), academica, januarymarch 1998. http://www.racai.ro/md75/ [24] m. drăgănescu, m. kafatos, generalized foundational principles in the philosophy of science, paper presented at the conference on "consciousness in science and philosophy" charleston, illinois, 6-7 nov 1998, published in the noetic journal, vol.2, no.4, oct. 1999, 341-350, republished in the vol. science and the primacy of consciousness, intimation of a 21st century revolution, richard l. amoroso and others (eds), orinda: the noetic press, 2000, chapter 9, 86-98. also * e-copy in msreader format. a life dedicated to the science, philosophy and romanian society 325 [25] m. kafatos, m. drăgănescu, toward an integrative science, noesis, xxvi, 2001. http://www.racai.ro/md75/ [26] m. drăgănescu, caryatids of thought (in romanian), romanian academy press, bucharest, 1996. [27] m. drăgănescu, constantin bălăceanu, victor săhleanu, gheorghe petrovici, gheorghe ştefan and alexandru giuculescu, romanian precursors of cybernetics (in romanian), romanian academy press, 1979. [28] m. drăgănescu, p. golu, c. bălăceanu and a. giuculescu, odobleja between ampere and wiener, romanian academy press, 1981. [29] şt. odobleja, consonantist psychology (in romanian), with an introductory study by mihai drăgănescu and pantelimon golu, editura ştiinţifică şi enciclopedică, bucharest, 1982. [30] m. drăgănescu, a. rusu, şt. iancu (eds), tudor tănăsescu, the founder of the romanian school of electronics (in romanian), editura dorotea, bucharest, 2001. [31] m. drăgănescu, tudor tănăsescu and the romanian school of electronics (in romanian), bucharest, 7 march 2001, * e book, msreader format, edition dragam@racai.ro [32] m. drăgănescu, the philosophical tension and the cosmic feeling. romanian academy reception speech, 6 september 1990. answer by acad. zoe dumitrescu buşulenga. opening remarks by acad. nicolae cajal, romanian academy press, bucharest, 1991. [33] m. drăgănescu, the ring of the material world, editura ştiinţifică şi enciclopedică, bucharest, 1989. gheorghe tecuci is member of the romanian academy, professor of computer science and director of the learning agents center in the volgenau school of information technology and engineering at george mason university, and visiting professor and former chair of artificial intelligence at the us army war college. international journal of computers communications & control issn 1841-9836, 9(6):672-685, december, 2014. auto adaptive identification algorithm based on network traffic flow s. dong, x. zhang, d. zhou shi dong* 1. school of computer science and technology, zhoukou normal university zhoukou, 466001, china 2. school of computer science & technology, huazhong university of science and technology wuhan, 430074, china *corresponding author: njbsok@gmail.com xingang zhang school of computer and information technology, nanyang normal university nanyang, 473061, china zxg@nynu.edu.cn dingding zhou department of laboratory and equipment management, zhoukou normal university zhoukou, 466001, china zdd@zknu.edu.cn abstract: traffic identification is a key task for any internet service provider (isp) or network administrator. machine learning method is an important research method on traffic identification, while impact of the asymmetry router on the traffic identification is considered, so this paper analyzes the impact of asymmetry routing on traffic identification, and proposes an effective method to decrease the impact, and experimental results show the auto adaptive algorithm can improve the traffic identification. keywords: traffic identification, internet service provider (isp), auto adaptive algorithm (aa), asymmetry routing. 1 introduction traffic identification play an important in many fundamental network operations and maintenance activities to detect invade and malicious attacks forbid applications, bill on the content of traffics and ensure quality of service. it increasingly becomes one of the most interesting topics in network science and technology fields, especially in recent years. the current network traffic identification methods roughly five categories: (1) port-based method; (2) based on deep packet inspection (dpi) methods; (3) based on the network flow characteristic; (4) based on host behavior [1]; (5) based on machine learning methods. the machine learning methods are divided into supervised and unsupervised machine learning. these are the more classic identification method; of course, there is also individual qos quality of service features for identification [2]. many share a naive assumption about the internet that traffic on a given link is approximately symmetric, meaning that both directions of a conversation flow across the same physical link. many developers even embed this assumption in their traffic classification tools [3, 4]. in fact, except at network edges, internet traffic is often routed asymmetrically [5], which will impair or invalidate the results of tools and models that assume otherwise. an important cause of this asymmetry is "hot-potato routing" [6], the business practice of configuring traffic crossing one’s network to exit as soon as possible, minimizing resource consumption, and thus cost, of one’s own infrastructure. particularly common in commercial settlement-free peering agreements, hot-potato routing implies that the network on the copyright © 2006-2014 by ccc publications auto adaptive identification algorithm based on network traffic flow 673 receiving side of a packet will bear higher cost per received packet. the underlying assumption is that if both networks in a settlement-free peering agreement follow this practice, it will even out, and both sides will share evenly in carrying traffic exchanged by their customers. another cause of asymmetric traffic is link redundancy, or alternative paths within networks. since routing decisions occur independently for each packet, load-balancing algorithms may cause packets destined to the same endpoint to follow different paths. other traffic engineering techniques, e.g., policy-based spf (shortest path first), may also induce asymmetry in internal routing state of large provider networks, through studying on asymmetric routing, we found it had some impacts on traffic identification, and we propose auto adaptive (aa) method to improve traffic identification. experiments results show that the aa method can achieve better accuracy than others. the paper is structured as follows: section 2 introduces related work of traffic identification; section 3 proposes aa algorithm and evaluation method; in section 4, at last, we list the proportion results which are classified by our identification algorithm, and analyze the impact of ε on traffic identification; section 5 concludes the paper. 2 related work the application identification problem has been changing due the efforts of two factors that are in a continuous competition. on the one hand, the applications, and especially those that do not want to be detected (e.g., p2p applications), in order to use the network resources without control. on the other hand, a group of network operators, investigators and even isps who need to know the traffic characteristics of their networks to manage the resources or even charge the users depending on their consumption. 2.1 research on traffic identification it has become a hot research between domestic and foreign experts who take the traffic identification as research direction, which proceed distinguish, qos, intrusion detection, traffic monitoring, billing and management. from the beginning of the study on port-based method, this method is the use for marking and identifying the traffic type by fixed port which supplied by the iana, the other method is aim at p2p and some certain protocols, which adopt method based on deep packet detection methods, but this method has defect that can’t get some encrypted information and can’t get the new service type. recently traffic identification has new method with a number of new applications come out. with appearance of the new service, the method of machine learning has been applied to the traffic identification. identify fields on the flow, roughly divided into three research directions: one is the feature selection algorithm [7, 8], the other is identification algorithm [1,2,9], another is a category for different types of data sets, for example, all packets can be divided into flows [10–14] that are sampling netflow [15]. complementary information about related work in the field of traffic identification can be found in the survey of traffic identification techniques using machine learning in [16], in the comparison of contemporary classification methods in [13], the survey on internet traffic identification in [17] and the research review on traffic identification in [18]. a critical but constructive analysis of the field of internet traffic identification is proposed in [19], focusing on major obstacles to progress and suggestions for overcoming them. although some articles have been studied on the identification algorithm, but the identification algorithm still exist some problems to be needed to solve, such as the neural network identification algorithm is one point worthy of study. all previous research studies in traffic identification either use insufficient network data, usually non-public, or use very few/meaningless metrics for evaluation, making it impossible to compare results shown in 674 s. dong, x. zhang, d. zhou different papers [17]. in addition to features selection based on flow, especially the impact of the size of packet traffic is always to be concerned. therefore, in this article we propose aa method, and we analyze different feature metric set (bidirection feature or unidirection feature) cause different identification results. 2.2 asymmetry routing for a pair of hosts a and b, if the path from a to b (forward direction) is different from the path from b to a (reverse direction), we say that the pair of paths between a and b exhibit routing asymmetry. this scenario can be very common in the internet core where asymmetric routing is an usual practice [20, 21], this asymmetry in the internet can appear on both as level and router level paths. in fact, the path followed by packets exchanged between end points along one direction can be different from the one followed by packets going in the opposite direction. recent reports suggest that asymmetrical routing might be moving closer to the edge of the internet than one might expect. for example, the analysis presented in [22] argues that this practice is nowadays quite common even in isps directly serving campus-wide networks. 2.3 flow metric definition 1. the definition of flow metric, which is composed with traffic statistical feature such as flow length, flow during etc. these features have high correlation with application type. so considered as flow metric to classify traffic by machine learning. while nowadays there are two kinds of flow metric, one is unidirectional flow metric, and the other is bidirectional flow. unidirectional flow metric uniflow (unidirectional flow)(or one-way) within your network is most likely the result of an incorrect configuration, but may also be symptomatic of a larger problem related to your overall routing architecture. since network communications are bi-directional in nature, unidirectional traffic patterns on your network mean that the traffic flow in one direction is not following the same path as the other. by design, the least cost route to a destination should also be the desired return path. uniclassifier (unidirectional classifier) is classifier which use unidirectional flow metric for training set. where unidirectional flow metric is adopted as table 1 in this paper. bidirectional flow metric biflow(bidirectional flow): a biflow is a flow as defined in the ipfix protocol document [rfc5101], composed of packets sent in both directions between two endpoints. a biflow is composed from two uniflows such that: 1.the value of each non-directional key field of each uniflow (unidirectional flow) is identical to its counterpart in the other, and 2.the value of each directional key field of each uniflow is identical to its reverse direction counterpart in the other. biclassifier(bidirectional classifier) is classifier which use bidirectional flow metric for training set. where bidirectional flow metric is adopted as table 2 in this paper. auto adaptive identification algorithm based on network traffic flow 675 table 1: unidirectional flow feature feature feature description lport low port number hport high port number duration flow duration transproto stream transport protocol used (tcp / udp) tcpflags tcp header flag,transport layer protocol is udp,the feature is 0 pps packets/duration bps bytes/duration mean packets arrived time duration/packets tos tos from netflow mean packet length bytes/packets table 2: bidirectional flow feature feature feature description lport low port number hport high port number duration flow duration transprotocol stream transport protocol used (tcp / udp) tcpflags1 tcp header flag,transport layer protocol is udp,the feature is 0 tcpflags2 tcp header flag,transport layer protocol is udp,the feature is 0 pps packets/duration bps bytes/duration mean packets arrived time duration/packets bidirectional packets ratio forward packets/ backward packets bidirectional bytes ratio forward bytes/ backward bytes bidirectional packet length ratio bidirectional packets length ratio bidirectional packets forward packets + backward packets bidirectional bytes forward bytes + backward bytes tos bidirectional tos or from netflow mean packet length bidirectional bytes/bidirectional packets 676 s. dong, x. zhang, d. zhou 3 methodology 3.1 auto adaptive algorithm (aa) in this paper, we propose an algorithm which can auto adjust the flow metric to adapt the traffic identification. the algorithm is called auto adaptive algorithm(aa). the algorithm’s core thought is that different traffic can select different classifier with different flow metric (unidirectional flow or bidirectional flow). suppose there are n flow samples, each sample has p features, then construct the n*p flow matrix, as follows: a =   x11 x12 · · · x1p ... . . . ... xn1 xn2 · · · xnp   (1) when features number p of the samples are very large which enlarge dimensions of the sample, theoretically, having more features should result in more discriminating power. however, practical experience with machine learning algorithms has shown that this is not always the case. many learning algorithms can be viewed as making an (biased) probability estimate of a set of features with the class label. this is a complex, high dimensional distribution. asymmetric routing existing will impact on the traffic identification. so we can consider to adopt auto adaptive method to do with it. in order to depict the method, we have to introduce the h which represent the threshold. h = bidirection_flow_number total_flow_number (2) definition 2. optimal threshold: which is used to evaluate the traffic accuracy, it is minimum threshold. when the traffic accuracy is maximum. h is optimal threshold ε. according to different h, and select h as optimal threshold to enable to obtain the best traffic results, where h is random variable. when h < ε, it will choose unidirectional flow and generate the unidirectional classifier, conversely, it will choose directional flow and generate the directional classifier. algorithm aa presents the two kinds of flow metric. the sequence of steps that we show in figure 1. the procedure mainly set two kinds of dataset for training and testing data set. with these data, we choose aa algorithm to train and test data. the process of machine learning identification is shown in figure 2: auto adaptive identification algorithm based on network traffic flow 677 1.collecting traffic(input): collecting network data from network traffic 2.selecting traffic features and training data for building traffic classification model(data processing): optimal selecting the known traffic features through the traffic feature selection algorithms. in this paper we only adopt two kinds of feature metric(unidirectional metrics and bidirectional metrics), so extra feature selection method is not added. the traffic classification model is built by training data. 3.classified the traffic by machine learning algorithm (output): using the machine learning identification algorithm to classify network traffic data and generate flow with label. figure 1: traffic identification process of aa method figure 2: process of machine learning, traffic identification 3.2 algorithm evaluation in this paper, we use the routine evaluation standard for verifying the effectiveness of our identification algorithm. the effectiveness of the current flow identification algorithm has the 678 s. dong, x. zhang, d. zhou table 3: noc_set dataset appid application protocal flow number proportion(%) 1 www http 4943 64.6 2 bulk ftp 39 0.5 3 mail imap,pop3,smtp 91 1.19 4 p2p bittorrent,edonkey,gnutella,xunlei 1414 18.5 5 service dns,ntp 433 5.7 6 interactive ssh, cvs, pcanywhere 6 0.08 7 multimedia rtsp,real 20 0.3 8 voice sip,skype 276 3.6 9 others games, attacks 431 5.6 following three concepts evaluation criteria. and the concepts involved are as follows: -tp (true positive): the flows of application a are classified as a correctly, which is a correct result for the identification; -fp (false positive): the flows not in a are misclassified as a. for example, a non-p2p flow is misclassified as a p2p flow. fp will produce false warnings for the identification system; -fn (false negative): the flows in a are misclassified as some other category. for example, a true p2p flow is not identified as p2p. fn will result in identification accuracy loss. the calculating methods are as follows: 1. precision: the percentage of samples classified as a that are really in class a precision = tp tp + fp (3) 2. recall: the percentage of samples in class a that are correctly classified as a recall = tp tp + fn (4) 3. overall accuracy: the percentage of samples that are correctly classified overallaccuracy = ∑n i=1 tpi∑n i=1(tpi + fpi) (5) 4 experiment 4.1 dataset noc_set dataset in order to validate the method and analyze the impact factor,we adopt noc_set as dataset.as shown from table 3. we collected data at southeast university,and the collecting site is a 10g backbone channel on jiangsu province border of cernet. we adopt dpi method to mark flow and generate noc_set dataset,and use ourself l7_filter_modify software to label the flow.l7_filter_modify is developed based on l7filter [23], at last, we generate noc_set dataset. auto adaptive identification algorithm based on network traffic flow 679 lbnl_set dataset table 4: lbnl_set dataset appid category flow number proportion 1 80 15000 47.69% 2 110 1400 4.45% 3 25 1350 4.29% 4 139 3300 10.49% 5 993 400 1.27% 6 443 10000 31.8% this lbnl_set data is randomly sampled in several different periods from one node on the internet. the lbnl traffic traces are collected at the lawrence berkeley national laboratory under the enterprise tracing project [24]. the packet traces are obtained at the two central routers of the lbnl network and they contain more than one hundred hours of traffic generated from several thousand internal hosts. the traffic traces are public, but they are completely anonymized, so ascertaining the "ground truth" on the application behind each recorded flow is not possible. therefore, for this set, we built protocol sets according to the tcp destination port number of each flow, an accepted practice in these cases [25]. we use the traffic traces captured on january 6 and 7, 2005 to obtain the training and the optimization sets. once again we perform the training by using the most frequently used port numbers in the dataset. detail lbnl_set dataset is shown in table 4. caida dataset we built this data set starting from three hour long traces obtained by the cooperative association for internet data analysis (caida) [26], and collect at the ames internet exchange (aix) along an oc48 link on mar 24, 2011. we use flows extracted from the first hour (corresponding to the interval 16:15-17:00 utc) to build the training set the optimization set and from the third hour (18:00-18:10 utc) to buld the evaluation set. as for the previous set, these traces are also anonymized, so port numbers are used as indicators of each protocol. the selection of flows composing the training, optimization and evaluation sets. table 5: caida_set dataset appid category flow number flow(%) packets(%) bytes(%) 1 80 328091 84.69 81.74 81.58 2 110 11539 0.6 0.24 0.25 3 21 28567 3.32 0.03 0.09 4 25 2648 4.57 2.47 2.72 5 4662 2099 0.79 1.34 1.35 680 s. dong, x. zhang, d. zhou 10 15 20 25 30 35 40 45 50 njucernet jsucernet caida−chicago caida−sanjose traffic f s e (% ) flows (a) flows 10 15 20 25 30 35 40 45 50 55 60 njucernet jsucernet caida−chicago caida−sanjose traffic f s e (% ) bytes (b) bytes figure 3: comparison of fses for traffic table 6: the identification overall accuracy rate aa, biclassifier, uniclassifier identification overall accuracy aa 99.6742% biclassifier 88.2% uniclassifier 89.2% 4.2 impact of asymmetry router on traffic identification: in this paper, we adopt experimental data based on the noc-set data set and caida datas set, use matlab tools, weka tools and the corresponding algorithm to identify network traffic data [27]. noc-set data firstly divided into two test data were 20% and 80% of the test data, and we compared our method that is aa with biclassifier and uniclassifier. in order to evaluate and analyze effectiveness of the method about aa. we study traffic identification distribution. in order to analyze asymmetry router, firstly we should remove from the traces any traffic that is inherently asymmetric, such as udp and icmp that do not always expect packet recipients to reply, and which would mislead symmetry comparisons if they appear in different magnitudes across networks. tcp background radiation, such as network scanning and probing, can also be a substantial fraction of total inherently asymmetric flows on some links, although it is usually a much lower proportion of bits. we adopt flow-based symmetry estimator(fse) [28] to evaluate impact degree on traffic, which is a simple method estimate the level of routing symmetry from passively measured flow data. from figure 3 and figure 4 we can see different traffic have different fse, and caida traffic is less. it indicated asymmetry router of caida traffic were more obvious than noc-set. from table 6 we can see that overall accuracy of aa method traffic is better than biclassifier and uniclassifier,we adopt aa method to classify traffic based noc-set data,and select parameter ε=0.5(detailed analysis shown in session f). the data is divided into 9 categories, respectively, www, mail, bulk, service, p2p, interactive, voice, multimedia, others table 6 indicates the aa algorithm achieved better result than biclassifier and uniclassifier method, moreover. p2p can be seen from table 7 and the voice of the precision and the recall has greatly improved. the reason for high accuracy is that the proportion of p2p and voice auto adaptive identification algorithm based on network traffic flow 681 table 7: identification performance for noc_set(precision and recall) category algorithm biclassifier uniclassifier aa precision recall precision recall precosiin recall www 98% 100% 99% 100% 98.5% 99.2% p2p 58% 100% 75% 100% 93.7% 91.2% mail 83% 91.3% 90% 99% 100% 100% service 58.90% 100% 70% 99% 90% 90.4% inter 84.5% 100% 87% 100% 80% 100% multimedia 100% 75% 90% 80% 60% 100% voice 35% 50% 45% 55% 37% 50% others 44% 46% 48% 77% 45% 60% account for set of the total is relatively small,the impact of the identification results reduce to a minimum due to the collection of the specimen caused by imbalance in the ratio.this paper also build noc_set dataset which is constructed by bidirectional flow characteristic. 4.3 comparison of identification algorithm with noc-set dataset experimental data for the noc_set data set (table 3 as fellows) the analysis data are actual measured ip trace [29], while the traffic flow exits about 40% biflow. noc_set dataset is composed by biflow feature.biflow have more information for traffic identification.if use biclassifier to classify the traffic, then the identification result will be improved. in this section, we compare aa algorithm with biclassifier and uniclassifier. traffic identification result is shown in table 7. as shown in table 7, identification result indicates that aa could achieve better accuracy compared with biclassifier and uniclassifier.but observing from inter and service, identification accuracy of aa is lower than the other method. from service to inter types, precision of biclassifier and uniclassifier method is reduced, while the aa is in increments, so that biclassifier and uniclassifier method is easily affected by the number of training samples, while the aa is not vulnerable to the impact of the training sample dataset. among three identification algorithm aa, biclassifier and uniclassifier, the overall accuracy of the aa algorithm is highest. 4.4 comparison of identification algorithm with caida_set dataset the data set used in experimental platform: experimental data for the caida_set data set (table 5 as fellows). the analysis data are actual measured ip trace [29]. the two core links are part of an oc192 tier1 backbone operated by a commercial isp in the u.s. the first link connects chicago and seattle, monitored at an equinix data center in chicago. the other one connects san jose and los angeles, monitored at a datacenter in san jose. on those links, tcp is responsible for about 50% of flows, which was 85% of packets and 93% of bytes on average.udp carried about 45% of flows (13% of packets and 6% of bytes). we adopted port-based method to mark flow and generated caida_set dataset.while the traffic flow exits about 10% biflow. caida_set dataset is composed by uniflow feature. biflow have more information for traffic identification. if use biclassifier to classify the traffic, then the identification result will be improved. in this section, we compare aa algorithm with biclassifier and uniclassifier. traffic identification result is showed in table 8. 682 s. dong, x. zhang, d. zhou table 8: identification performance for caida_set(precision and recall) category algorithm biclassifier uniclassifier aa precision recall precision recall precision recall 80 92% 98% 98% 97% 96.5% 98.2% 110 63% 97% 83% 99% 95.7% 92.2% 21 82% 88.3% 92% 98% 99% 99% 25 60.80% 99% 72% 98% 92% 92.4% 4662 82.4% 99% 89% 98% 82.9% 99.2% overall accuracy 65.72% 94.1342% 95.8921% table 9: identification performance for lbnl_set(precision and recall) category algorithm biclassifier uniclassifier aa precision recall precision recall precision recall 80 96% 98% 97% 93% 96.5% 98.2% 110 78% 90% 85% 90% 92.5% 83.2% 25 88% 82.7% 89% 87% 97% 99% 139 59.80% 98% 78% 92% 93% 91.6% 993 86.5% 99% 79% 99% 87% 99% 443 88.5% 99% 89% 99% 84% 99% overall accuracy 68.83% 93.237% 95.861% as shown in table 8, identification result indicates that aa could achieve better accuracy compared with biclassifier and uniclassifier. according to analysis of 4.4 section on traffic result, we can see caida exists the same phenomena which is unbalance sample data. so that biclassifier and uniclassifier method is easily affected by the number of training samples, while the aa is not vulnerable to the impact of the training sample dataset. among three identification algorithm aa, biclassifier and uniclassifier, the overall accuracy of the aa algorithm is highest. 4.5 comparison of identification algorithm with lbnl_set dataset we obtained lbnl data from the lawrence berkeley national laboratory, and construct the bidirectional and unidirectional flow metric. we respectively train the two metrics and generate biclassifier and uniclassifier. we compute h value the formula 2 in section 3, and adopt aa method to select classifier which is uniclassifier or biclassifier. the experimental results is shown in table 9. from the results we can see uniclassifier and uniclassifier method is affected by unbalance sample data, while aa method can overcome the problem and improve traffic identification results. auto adaptive identification algorithm based on network traffic flow 683 4.6 impact of ε on traffic identification in this paper we propose aa method to auto adaptive select classifier(biclassifier or uniclassifier), while threshold ε is a parameter of aa method.ε decide classifiers which were selected, so it is very important for traffic identification. in this section, we will analyze the impact of ε on traffic identification. detailed experiment method is adopting aa method proposed by varying from ε[0.1,1] based on three dataset(noc_set, caida, lbnl_set). from figure 4 we can 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.8 1 0.5 0.55 0.6 0.65 0.7 0.75 0.8 0.85 0.9 0.95 1 ε o ve ra ll a cc u ra cy noc−set caida lbnl figure 4: the identification results with ε see overall accuracy of caida and noc_set have biggest change happened when ε vary from 0.1 to 1. overall accuracy of caida shows an increasing tendency, while noc_set is descending. the possible reasons why is that cernet network contain more symmetry routing, while asymmetry routing is less. collection point of caida data exist more asymmetry routing. thus when threshold ε is very small, more opportunity will be selected by biclassifier. just as mentioned that collection point of noc_set is cernet network containing more symmetry routing, which will have more bidirectional flow metrics, so noc_set showed an descending tendency and when ε =0, overall accuracy is maximum.ε =0.5, overall accuracy of caida and noc_set is equal. lbnl have not obvious asymmetry routing. so overall accuracy is gentle. 5 conclusion in this paper we propose auto adaptive algorithm, and on this basis, the introduction of biclassifier and uniclassifier, and adopt the improved aa method to classify traffic for moore_set as data set, moreover, compare with two other methods which is the biclassifier and uniclassifier method, the results show that, aa method are greatly improved on identification accuracy, to further prove aa method is effective, this paper collect the data in jiangsu provincial network border and organize trace into flow record such as data sets noc_set, the experimental results show that: aa method has high identification accuracy,and we analyze the impact of ε on traffic identification and find ε=0.5 which can be considered as the fixed value, traffic results will be better. 684 s. dong, x. zhang, d. zhou acknowledgments this paper is supported by education department of henan province science and technology key project funding (14a520065) and research innovation of zhoukou normal university (zknua201408). bibliography [1] t. karagiannis, k. papagiannaki, m. faloutsos (2005); blinc: multilevel traffic classification in the dark, in: acm sigcomm computer communication review, acm, 35: 229–240, doi:10.1145/1080091.1080119. [2] a. moore, k. papagiannaki (2005); toward the accurate identification of network applications, pam’05 proceedings of the 6th international conference on passive and active network measurement, 41–54. [3] a. moore, d. zuev (2005); internet traffic classification using bayesian analysis techniques, in: acm sigmetrics performance evaluation review, acm, 33:50–60, doi:10.1145/1064212.1064220. [4] l. bernaille, r. teixeira, k. salamatian (2006), early application identification, in: proceedings of the 2006 acm conext conference, acm, doi:10.1145/1368436.1368445. [5] wolfgang john, sven tafvelin (2007); differences between inand outbound internet backbone traffic, in: proceedings of terena networking conference, terena, 1-14. [6] hotpotatorouting, http://en.wikipedia.org/wiki/hot-potato_routing. [7] n. williams, s. zander, g. armitage, evaluating machine learning algorithms for automated network application identification, center for advanced internet architectures, caia, technical report 060410b, doi:10.1.1.84.7170. [8] n. williams, s. zander, g. armitage (2006), a preliminary performance comparison of five machine learning algorithms for practical ip traffic flow classification, acm sigcomm computer communication review 36(5):5–16, doi: 10.1145/1163593.1163596. [9] z. li, r. yuan, x. guan (2007), accurate classification of the internet traffic based on the svm method, in: communications, 2007. icc’07. ieee international conference on, ieee, ,1373–1378, doi: 10.1109/icc.2007.231. [10] p. teufl, u. payer, m. amling, m. godec, s. ruff, g. scheikl, g. walzl (2008), infect-network traffic classification, in:networking, 2008. icn 2008. seventh international conference on, ieee, 439–444, doi: 10.1109/icn.2008.42. [11] t. kiziloren, e. germen (2007), network traffic classification with self organizing maps, in: computer and information sciences, 2007. iscis 2007. 22nd international symposium on, ieee, 1–5, doi: 10.1109/iscis.2007.4456852. [12] y. lim, h. kim, j. jeong, c. kim, t. kwon, y. choi (2010), internet traffic classification demystified: on the sources of the discriminative power, in: proceedings of the 6th international conference, acm, doi: 10.1145/1921168.1921180. auto adaptive identification algorithm based on network traffic flow 685 [13] h. kim, k. claffy, m. fomenkov, d. barman, m. faloutsos, k. lee (2008); internet traffic classification demystified: myths, caveats, and the best practices, in:proceedings of the 2008 acm conext conference, acm, doi: 10.1145/1544012.1544023. [14] j. erman, m. arlitt, a. mahanti (2006), traffic classification using clustering algorithms, in: proceedings of the 2006 sigcomm workshop on mining network data, acm, 281–286, doi: 10.1145/1162678.1162679. [15] v. carela-espanol, p. barlet-ros, j. solé-pareta (2009), traffic classification with sampled netflow, doi:10.1.1.390.5780. [16] t. nguyen, g. armitage (2008), a survey of techniques for internet traffic classification using machine learning, communications surveys & tutorials, ieee, 10(4):56–76. [17] a. callado, c. kamienski, g. szabó, b. gero, j. kelner, s. fernandes, d. sadok (2009), a survey on internet traffic identification, communications surveys & tutorials, ieee, 11(3):37–52. [18] m. zhang, w. john, k. claffy, n. brownlee (2009), state of the art in traffic classification: a research review, in:pam ’09: 10th international conference on passive and active measurement, student workshop, seoul, korea. [19] a. dainotti, a. pescape, k. claffy (2012), issues and future directions in traffic classification, network, ieee, 26(1):35–40. [20] z. mao, l. qiu, j. wang, y. zhang (2005), on as-level path inference, in: acm sigmetrics performance evaluation review, acm, 33:339–349. [21] y. he, m. faloutsos, s. krishnamurthy (2004), quantifying routing asymmetry in the internet at the as level, in: global telecommunications conference, globecom’04. ieee, 3: 1474–1479. [22] w. john (2008), on measurement and analysis of internet backbone traffic, thesis for the degree of licentiate of engineering, a swedish degree between m.sc. and ph.d., chalmers university of technology. [23] j. levandoski, e. sommer, m. strait, et al.(2008), application layer packet classifier for linux, http://l7-filter.sourceforge.net/. [24] *** lbnl/icsi enterprise tracing project, http://www.icir.org/enterprisetracing. [25] t. karagiannis, a. broido, m. faloutsos, et al. (2004), transport layer identification of p2p traffic, in: proceedings of the 4th acm sigcomm conference on internet measurement, acm, 121–134, doi: 10.1145/1028788.1028804. [26] *** the cooperative association for internet data analysis(caida), http://www.caida.org. [27] t. nguyen, g. armitage (2006), training on multiple sub-flows to optimise the use of machine learning classifiers in real-world ip networks, in: local computer networks, proceedings 2006 31st ieee conference on, ieee, 369–376, doi: 10.1109/lcn.2006.322122. [28] w. john, m. dusi, k. claffy (2010), estimating routing symmetry on single links by passive flow measurements, in: proceedings of the 6th international wireless communications and mobile computing conference, acm, , 473–478, doi: 10.1145/1815396.1815506. [29] *** ip trace distribution system, http://iptas.edu.cn/src/system.php. international journal of computers, communications & control vol. i (2006), no. 4, pp. 85-91 realization of embedded multimedia system based on dual-core processor omap5910 peng li, yu lu, shen li, hongxing wei abstract: this paper focuses on the realization of a complete embedded system using the dual-core processor omap5910. detailed description of how to compose the hardware system is presented with a description of the software system on our platform. tasks communication between the two cores is realized using the dsp driver. the system bootloader and the dsp bootloader are described in detail. the implementation of the mpeg-4 video decoder has been realized on the presented system. higher speed can be achieved and less power is needed for mpeg-4 video processing on the dual-core platform. this dual-core system can be applied to 3g wireless communication, robot control and vision systems. keywords: embedded system, mpeg-4 decoder, arm, omap5910, multimedia platform, multimedia application, operating system. 1 introduction with the growth of the 2.5/3g wireless markets, full-featured multimedia services are required by the wireless applications. more and more powerful processors are used to achieve the requirement. dual-core architecture processors meet the rapid processing needs of next-generation embedded devices. owing to the urgent need of the dual-core processor application, this paper presents a platform using the dual-core processor omap5910 and applications on this platform. the omap5910 chip is a highly integrated hardware and software platform. it integrates a tms320 c55xdsp core with a ti-enhanced arm925 core on a single chip for the optimal combination of high performance with low power consumption. this unique architecture makes it ideal for audio and video processing in multimedia applications [1], [2]. the remaining part of this paper will be compiled as follows. in section 2, the composing of the hardware system is described and in section 3, the software system architecture is presented. in section 4, a detailed description of the system bootloader and dsp bootloader is given. since video processing is one of the most important applications in multimedia systems, the implementation of the video decoding is described at the end of the paper. 2 hardware system omap5910 contains numerous interfaces for connecting to peripherals or external devices [3]. the complete embedded multimedia system we designed is illustrated as figure 1. as illustrated by figure 1, 32m sdram and 16m nor flash are used to compose the minimum system. the nor flash is used to store the bootloader and the operating system. another 256k sram is applied. a jumper is used to control the boot overlay mode, in which the sram is mapped to bank 0 and the nor flash is mapped to bank 3. besides the minimum system, other devices of the platform are as follows: a camera for video capturing, a tft color lcd for displaying, an audio codec for audio input and output, an ethernet controller for operating system downloading and data transmitting, a ps/2 mouse and a keyboard as human device interfaces. the usb host/function interfaces and sd/mmc card interface are included in omap5910. they are used for the data storage in our system. copyright c© 2006 by ccc publications 86 peng li, yu lu, shen li, hongxing wei figure 1: hardware system due to the integration of the camera interface for cmos sensors on omap5910, we choose the ov7648 for video capturing. the ov7648 is a sensor-on-board camera and lens module designed for mobile applications. the control interface of the sensor is the sccb (omnivision serial camera control bus), a 3-wire serial bus. three mpu i/os are used to realize the sccb protocol. ov7648 output format is ycrcb 4:2:2 or rgb, and the ycrcb format is chosen in the system. the color lcd controller on omap5910 supports a direct connection to the lcd panel and an 18bit tft lcd is chosen for displaying. a high priority, dedicated dma channel has been implemented in the system specifically for the lcd controller. this dedicated dma-lcd channel ensures minimum latency in real-time lcd operations. ethernet makes high-speed data exchanging, a very important function in multimedia systems, available between pc and the platform. it also provides an efficient way for system debugging. here we adopt a fast ethernet controller ax88796. the ax88796 is a high performance and highly integrated local cpu bus ethernet controller with embedded 10/100mbps phy/transceiver and 8k*16 bit sram. it supports both 8 bit and 16 bit local cpu interfaces including mcs-51 series, 80186 series, mc68k series cpu and isa bus. the 80186 16bit mode is chosen to meet the 16bits data bus of omap5910. because omap5910 does not have sufficient general i/os, an avr microcontroller is used to interface the ps/2 mouse and the keyboard. iic bus is used to realize communicate between the avr microcontroller and the omap5910. it is a good solution for the shortage of the gpio in the system. since three mcbsps (multi-channel buffered serial ports) are integrated on the omap5910, an audio codec is connected to one of them directly. tlv320aic23b, a high-performance stereo audio codec with highly integrated analog functionality, is selected to compose our codec system. the tlv320aic23b supports glueless interface to the ti mcbsp. we use the iic bus to implement its control interface. various multimedia applications can be realized using this hardware platform on which we have implemented a mpeg4 video decoder. realization of embedded multimedia system based on dual-core processor omap5910 87 3 software system we choose the das u-boot as the bootloader for the system, and the linux 2.4.21 as the kernel. the software tools we used to develop programs are code composer studio (ccs) and the gnu tool chain. ccs is used to develop dsp programs, while the gnu tool chain is used to compile the bootloader and the kernel. programs on the mpu and the dsp are developed separately. in our system, the arm risc is used for handing control code, such as user interface, os and os application, while the dsp is used for the real-time signal-processing. both the system bootloader and the kernel run on the mpu, and the dsp is registered as a character device in the kernel. applications that run on the mpu can not access or make use of the dsp directly, so they must use the dsp driver to control or communicate with dsp core. the dsp driver in the kernel provides user applications with the capability of controlling and the interfaces to communicate with dsp. to communicate or make use of dsp, applications have to send read, write or i/o control request to kernel, then kernel would deliver such request to dsp driver. subsequently, dsp driver set the corresponding control registers and send the control code or application date to the dsp. there are three ways we used to communicate between the mpu and the dsp. the first one is to use the mpui port. it allows access to the full memory space (16m bytes) of the dsp, and it is the only way for the mpu and the system dma to access the i/o spaces of the dsp. since configuration and data registers for all the peripherals of the dsp reside in the dsp subsystem i/o space, we use this way to set the corresponding peripherals registers in the dsp. the second way is to use the mpu mmu and the dsp mmu, through which shared access to system memory can be realized between the two cores. the mpu mmu performs virtual-to-physical address translations and accesses permission checks for access to the system memory. the dsp mmu is controlled by the mpu. when the dsp mmu is on, the dsp mmu translates addresses from the dsp (virtual address) to addresses mapped by the traffic controller. the dsp mmu is used when the dsp software accesses external memory and we adopt the dsp mmu to map the external memory space of the dsp on the external sdram. we also utilize the mpu mmu to map an appropriate system addressing space on the external sdram. then both the mpu and the dsp can access this shared memory regions. the third way is to use the mailbox-interrupt mechanism. this mechanism provides a very flexible software protocol between the two cores. we use it to create handshaking interrupts which will properly synchronize the mpu and dsp operations. the software architecture is shown in figure 2. figure 2: software architecture when certain user application wants to utilize dsp to accomplish a specific calculating task, it must 88 peng li, yu lu, shen li, hongxing wei send a request to the dsp driver. the dsp driver would write this request code and application data to the mailbox. so the dsp core can get to know what the user application wants and where is the storage of the raw data. then dsp would be interrupted, and both the control code and the application data address could be obtained during this mailbox interrupt. with these data, dsp starts to do the corresponding calculation as the application asks. after finishing the application task, dsp returns a response to the mpu by writing to the mailbox. dsp driver would process this mailbox interrupt on the mpu side, and fetch the result of the application task. 4 system bootloader and dsp bootloader boot process is very important to both the mpu and the dsp. each of the two cores needs their own bootloader. the system (mpu) bootloader is adopted to perform a simple initialization and to load the kernel to the system memory. u-boot is a bootloader for embedded system based on powerpc and arm processors. it is a full featured bootloader that is small and also easy to build and use. one of the most useful features of u-boot is the supporting of the tftp downloading. the u-boot 1.1.1 supports the arm925t architecture and the omap5910 processor. we need to set appropriate configurations and write drivers for various peripherals used during the booting process. the ethernet controller driver enables the tftp boot feature of u-boot and such tftp boot function greatly facilitates the debugging process of the kernel. much higher speed could be achieved when loading kernel to the system ram, compared with the using of serial or parallel port. the loading process can be fully automatic by the u-boot when power on. the mpu is the master in the system and is responsible for setting up and bringing the dsp out of reset. appropriate boot mode of the dsp must be set by mpu before releasing the dsp from reset. when the dsp core is taken out of reset, dsp bootloader will be executed. the bootloader of the dsp resides in the dsp subsystem pdrom. it performs some initialization of the dsp resources before loading code. after the initialization, the bootloader branches to the starting point address specified by the selected boot mode. there are four boot modes: direct boot, external memory boot, dsp idle boot, internal memory boot. the internal memory boot mode is selected in our system, because it is more convenient compared with other modes. to adopt the internal boot mode, we have to load code and data sections into dsp subsystem internal memory. the mpu core or system dma can be used to accomplish this task. we utilize mpu core with the mpu interface (mpui) to perform the loading. mpui allows the mpu to communicate with the dsp and its peripherals. the endian mode should be noticed during the data loading process. the mpu (ti925t) operates in little endian mode while the dsp operates in big endian mode. therefore, data have to be converted into big endian format when loaded into the dsp’s memory. there are swapping buffers between the dsp and the mpui, and the word or byte swapping can be programmed. after loading data into dsp internal memory, the mpu core can call the dsp subsystem out of reset, then the dsp bootloader begins to execute. the dsp bootloader transfers execution to the appropriate byte address, and the loaded application starts running. it should be noticed that the output format of ccs is dsp coff format, and it cannot be used directly in arm side. so we utilize the hex55 utility to convert the coff format file into hex format file. the hex file could be parsed to get the dsp code and its running address. 5 system application mpeg video processing is a very important application for multimedia systems. due to the highspeed requirement of the computation, powerful processors are needed to process these data. our dualcore system provides a better solution for video processing compared with normal risc [6] or dsp realization of embedded multimedia system based on dual-core processor omap5910 89 systems. handling of the control code and the real-time signal processing are implemented by the mpu and the dsp separately. the arm risc is well suited for handling control code, such as user interface, os and os applications. on the other hand, the dsp side is better suited for real-time data processing. appropriate division of the work makes the dual-core system get a better performance. the c55x dsp is especially suited for multimedia signal processing. three hardware accelerators are included along with the c55x dsp core to enhance the multimedia processing capability [4]. they are discrete cosine transform (dct) and its inverse idct accelerator, motion estimation accelerator and pixel interpolation accelerator. dct/idct, motion estimation, and pixel interpolation are common tasks to all industry video-imaging standards. they are also the most time-consuming tasks in video processing. these c55x hardware accelerator modules assist the dsp core in implementing algorithms that are commonly used in video compression applications such as mpeg4 encoders/decoders. these accelerators enable implementation of such algorithms using fewer dsp instruction cycles and dissipating less power than the dsp core is operating alone. figure 3: the decoding procedure on the dsp the hardware accelerators are utilized via functions from the tms320c55x image/video processing library available from texas instruments [5]. imglib is an optimized image/video processing functions library for c programmers using tms320c55x devices. it includes many c-callable, assemblyoptimized, general-purpose image/video processing routines. these routines are typically used in computationally intensive real-time applications where optimal execution speed is critical. by using these routines, we achieve execution speeds considerably faster than equivalent code written in standard ansi c language. the hardware accelerator and the imglib are utilized to implement a mpge-4 video decoder on our multimedia system. the decoding procedure on the dsp core is illustrated as figure 3. the mpeg-4 bit-streams are stored in shared system memory. because the dsp accesses the internal daram and saram at much higher speed, the decoding process is completed in its own local memory. by the help of the tms320c55x image/video processing library, we get the decoded data in the local memory. because the decoded data are in yuv 4:2:0 formats, post-processing is needed to convert the 90 peng li, yu lu, shen li, hongxing wei figure 4: results of comparative benchmarking decoded data into rgb format. having converted the data into16 bits rgb (565) format, we send it to the lcd frame buffer for displaying [7]. compared with a risc processor arm9, the performance of the decoding process is greatly improved. results of comparative benchmarking by pace soft silicon are shown in figure4. these benchmarks demonstrate that multimedia tasks require three times as many cycles and two times as much power to execute on a latest-generation risc processor as they do on a c55x dsp [4]. 6 summary and conclusions this embedded multimedia system is successfully designed and implemented. various multimedia applications can be realized on such platform. the software system presented herein provides a solution for communication between the dual-core. the successful implementation of the mpeg4 decoding procedure on this system proves that the higher speed can be achieved and the less power is needed. with the growth of the 3g wireless markets, dual-core architecture systems will be widely used. the authors believe that the presented platform is suitable for many 3g wireless multimedia applications in the near future. references [1] james song, thomas shepherd, minh chau, a low power open multimedia application platform for 3g wireless, proceedings of soc conference, 2003, ieee international, pp. 377-380, sept.2003. [2] jcheng-nan chiu, chien-tang tseng, and chun-jen tsai, tightly-coupled mpeg-4 video encoder framework on asymmetric dual-core platforms, iscas 2005, ieee international, pp. 2132-2135, may 2005. [3] texas instruments, omap5910 dual-core processor technical reference manual, ti technical document spru602b, jan.2003. [4] texas instruments, tms320c55x hardware extensions for image/video applications programmer’s reference, ti technical document spru098, feb.2002. [5] texas instruments, tms320c55x image/video processing library programmer’s reference preliminary, ti technical document spru037c, jan.2004. realization of embedded multimedia system based on dual-core processor omap5910 91 [6] hae-yong kang, kyung-ah jeong, jung-yang bae, young-su lee, seung-ho lee, mpeg4 avc/h.264 decoder with scalable bus architecture and dual memory controller, circuits and systems, 2004. iscas ’04. proceedings of the 2004 international symposium on, vol. 2, pp.ii 145-8, may. 2004 [7] thanh tran, omap 5910 video encoding and decoding, ti application report spra985, dec. 2003. peng li, yu lu beijing university of aeronautics and astronautics school of automation science and electrical engineering shen li beijing university of aeronautics and astronautics school of software hongxing wei beijing university of aeronautics and astronautics school of mechanical engineering and automation no. 37 xue yuan road, haidian district 100083 beijing p.r. china lastmars@gmail.com received: november 5, 2006 int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 775-782 tool support for fuml models c.-l. lazăr, i. lazăr, b. pârv, s. motogna, i.-g. czibula codruţ-lucian lazăr, ioan lazăr, bazil pârv, simona motogna and istván-gergely czibula department of computer science babeş-bolyai university, cluj-napoca, romania romania, 400084 cluj-napoca, 1 m. kogălniceanu e-mail: {clazar, ilazar, bparv, smotogna, istvanc}@cs.ubbcluj.ro abstract: in this paper we present a tool chain that aids in the construction of executable uml models according to the new foundational uml (fuml) standard. these executable models can be constructed and tested in the modeling phase, and code can be generated from them towards different platforms. the fuml standard is currently built and promoted by omg for building executable uml models. the compatibility of the executable models with the fuml standard means that only the uml elements allowed by fuml should be used for the abstract syntax and the extra constraints imposed by the fuml standard should be considered. the tool chain we propose is intregrated with the existing uml tools of eclipse modeling infrastructure. keywords: class diagram, fuml, action language, code generation, eclipse. 1 introduction the executable models are models that can be executed and tested without having to generate code from them and test them in a specific platform. creating executable models in the process of developing an application is considered to be a good approach, because the business model and functionality can be implemented in the modeling phase, while the decisions regarding the implementation in the specific platform can be delayed to the phase for code generation from the model. the executable models have the advantage of not being poluted with code that is not related to the business logic, keeping the model and functionality much more compact and clear. and they also have the advantage of not being tied to a specific platform or technology. the foundational uml (fuml) [1] is a computationally complete and compact subset of uml [2], designed to simplify the creation of executable uml models. the semantics of uml operations can be specified as programs written in fuml. we introduced in a previous paper [3] an action language based on fuml, with a concrete syntax that follows the principles of the structured programming, which is supported by the modern languages like java and c++. in this paper we describe a tool chain that is aimed at building and testing executable uml models, as well as generating code towards different target platforms. the generated code is meant to be complete, with no code placeholders for the developer to fill out. the tools are built on top of the eclipse modeling framework project (emf) and some other projects from eclipse that are part of the eclipse modeling tools distribution. these tools are integrated with the eclipse modeling infrastructure and with each other. the remainder of the paper is organized as follows: section 2 presents the infrastructure needed to build executable models and section 3 presents the research problem. then, section 4 describes the tool chain we propose. section 5 presents the existing work related to ours and section 6 gives the conclusions of this paper. copyright c⃝ 2006-2010 by ccc publications 776 c.-l. lazăr, i. lazăr, b. pârv, s. motogna, i.-g. czibula 2 background the uml class diagrams are widely used to create the structure of a model. they are intuitive and easy to use. however, the uml behavior diagrams (activity diagrams and state machines) are not easy to use for larger models. the fuml standard provides a simplified subset of uml action semantics package (abstract syntax) for creating executable uml models. it also simplifies the context to which the actions may be applied. for instance, the structure of the model will consist of packages, classes, properties, operations and associations, while the interfaces and association classes are not included. however, creating executable fuml models is difficult, because the uml primitives intended for execution are too low level, making the process of creating reasonable sized executable uml models close to impossible. a concrete textual syntax is needed, because it enforces a certain way of constructing models. this means that a lot of elements that need to be created explicitly in the graphical uml activity diagram can be implicitly derived from the syntax and created automatically. omg issued an rfp for a concrete syntax for an action language based on fuml [4]. because there is no standardized action language at this moment, we proposed an action language of our own [3] as part of our framework: comdevalco (framework for software component definition, validation, and composition) [5–7]. the fuml standard also specifies how to create a virtual machine that can execute fuml executable models. to generate code from mof compliant models, omg created mof model to text transformation language [8], which is suitable to generate code from fuml models. 3 research problem the research problem is to investigate the creation of a tool chain that aids in the construction of fuml models that can be created and tested in the modeling phase, and from which code can be generated towards different platforms. the tehniques mentioned below represent the core of a tool chain that can change the development experience for the better, by simplifying the process and allowing the developer to take decisions in the proper stage of development. model creation. to create the structure of the model, the usual uml class diagrams should be used. these class diagrams, however, must restrict the elements that can be used to those included in the fuml standard. to create the part of the model corresponding to the behavior of the operations, an action language based on fuml needs to be used. this is because it is close to impossible to use uml activity diagrams for this task, as the user will need to create, configure and relate too many elements. the class diagram editor needs to be integrated with the action language textual editor, which is used to create the behavior for each operation. the integration refers to the ability to select a class or an operation from the model and, with a simple action (double-click or some key shortcut), the action language editor for the behaviors can be opened and used for the selected element. the action language editor must be able to load the existing behavior under the selected element and display it properly. also, on save action, it should properly update the model under the selected element. an action language that follows the principles of the structured programming is to use by many programmers familiar with structured programming languages. thus, we consider this tool support for fuml models 777 an important factor in choosing the action language, even though the fuml standard allows non-structured control flow. because many programmers are familiar with object-oriented languages like java or c++, having a similar syntax will make the language much easier to be used. also, the effort for learning the new language will be very much reduced. the action language will need to create the abstract representation for the statements and control structures as provided by the programming languages mentioned above, it must support complex expressions and easy access to parameters and variables. model execution. after creating the model, or parts of it, there is a need to simulate the execution from certain starting points. this can be achieved with a virtual machine that knows how to execute fuml models. this means that it can execute the activities built with the action language editor described above. code generation. after creating and testing the model, the only thing left is to generate code to a specific platform. a code generation tool should take into considerations any tags (stereotypes) placed on the model elements, and adjust the code generation process accordingly. if the resulted action language has a well structured abstract representation, it should be possible to apply model transformations and generate code to structured programming languages with little effort. thus, the executable models for which the behavior is created with such an action language can be converted to a multitude of platforms. 4 tool chain in this section we present the tools we use to create fuml based executable models. the tools are integrated in the eclipse modeling infrastructure. the uml class diagram editor and the fuml execution framework exist and they only need to be integrated with the other tools. the fuml based action language editor is a tool proposed by us. the code generation utility exists, but the templates used to generate code are written by us. we use a point-of-sale (pos) example model, presented in fig. 1. the example model consists of a pos class, which contains a list of products. the user can make a new sale (stored in pos as the currentsale) by invoking the makenewsale operation and by adding saleitems to the current sale in the form of product code and quantity. the pos component finds the product associated with the given product code and, if present, passes the product and quantity to the currentsale. the sale creates a new saleitem instance and adds it to its list of sale items. fig. 2 and 3 present the implementation of the operations, as it is written using our action language. fig. 4 shows the abstract representation in fuml of sale.additem’s behavior. figure 1: pos example model class diagram editor 778 c.-l. lazăr, i. lazăr, b. pârv, s. motogna, i.-g. czibula 4.1 uml class diagram editor the uml meta-model is provided by the eclipse uml2 project [9], which is part of the larger model development tools (mdt) project. uml2 is built on top of emf(core) [10] (which is part of the bigger eclipse modeling framework project emf) with some adaptations, as uml has a structure that is not compatible with emf. the uml2 project provides the java classes that correspond to the uml model, so that a uml model can be represented in java. the model can be saved in files with the “uml”extension in xmi format, as it is standardized by omg. the uml2 project comes with a tree-based editor that allows the user to manipulate an “uml” file and build the uml models using a tree. this editor is not quite easy to use, but, for building only the structure of classes of a model, it is enough. the eclipse uml2 tools [11], which is also part of mdt, provides a class diagram editor. this project allows the user to attach uml class diagrams to the uml models and build the uml models using a class diagram editor. the editor is quite mature and easy to use, and most programmers are used to building models using class diagrams, so this editor is the best choice for a tool to build the class structure of a model (fig. 1). because we want to build fuml based models, it is important that only the elements allowed by fuml are used. the interface, for instance, was excluded from fuml, so the user should not include interfaces in the model. to simulate an interface, the programmer can use abstract classes with public abstract methods. a specialized editor for class diagrams that allows only fuml elements to be used, is considered for the future. 4.2 fuml based action language editor we introduced in a previous paper [3] an action language based on fuml, with a concrete syntax similar to the concrete syntax of the modern programming languages like java or c++. this action language follows the principles of the structured programming. we have also built an eclipse textual editor for this action language using the xtext project [12], which is the only remaining project from the bigger textual modeling framework (tmf) project from eclipse. this textual editor can take the textual representation for the functionality of an operation (fig. 2 and 3) and convert it to an activity with all the uml actions and other elements necessary to provide the same functionality in uml (fig. 4). this activity is added to the operation’s classifier (of class type) and set as the behavior of the operation. only the elements allowed by fuml are used to create this uml model of the activity. the textual editor for the action language is integrated with the class diagram editor, so that when the user wants to edit the behavior of an operation, s/he only needs to double-click, or use a context menu item of the operation, or use a key shortcut to open the textual editor. after the behavior is created, the user needs to save it by pressing ctrl+s, at which point the main “uml” model file is updated to contain the model for the operation’s behavior. 4.3 fuml execution framework the fuml standard specifies how a virtual machine for fuml models should work, and there is a reference implementation in progress from modeldriven.org [13]. we managed to integrate this tool in the eclipse workbench, so that we can test our fuml models. we can either pass the activities created with our action language, along with the needed parameters, directly to the execution framework to be executed, or we can write test activities with our action language and execute the tests. tool support for fuml models 779 public makenewsale() { self.currentsale := new sale; } public addsaleitem(code:string, quantity:integer) : boolean { def product:product := null; foreach (prod in self.product) { if (code = prod.code) { product := prod; } } if (product = null) { return false; } else { self.currentsale.additem(product,quantity); return true; } } figure 2: pos::makenewsale and pos::addsaleitem activity concrete syntax 4.4 code generation utility to generate code, we used the acceleo project [14], which is part of the bigger model to text (m2t) project from eclipse. this project implements the mofm2t standard from omg. it allows the user to create templates, which can later be used to generate code from the models. in our case, these templates need to work on the elements included in fuml. generating the structure of classes is straightforward. however, generating code for the behavior of the operations is more complex, because the structure of the elements and the way the actions are connected needs to be considered. because we took the decision to follow the structured programming principles, the uml model resulted for the activities is well structured, so we are able to generate code with ease to languages like java or c++. if we wouldn’t have taken this decision, then the code generation step would have been a real problem, as fuml allows the user to create interactions between actions that might be impossible to represent in the languages mentioned above. also, due to the resemblance in concrete syntax between the action language and the target languages, we are able to generate compact code in the target languages. an important note is that the templates are specifically written for activities constructed with an editor compatible with our action language. this is because the way the elements are structured and the way they interact is very important. if these aspects are not followed, the templates will produce a bad output. for a fuml activity that does not respect these constraints and which might be constructed using a different fuml based action language, a special set of templates need to be written. this is not necessarily an issue, because the action languages can be used in conjunction to construct uml models and because the templates only need to be written once. our action language is a general purpose language and might be a bit hard to specify and build a proper editor for it. but a different action language that is specialized on a more concrete domain could have a simpler syntax and the code generation templates might be easier to be written. fig. 5 shows a snippet from the acceleo templates we used to generate java code. it shows how the statements are iterated, considering the structuredactivitynodes that contain the 780 c.-l. lazăr, i. lazăr, b. pârv, s. motogna, i.-g. czibula 1 public additem(product:product, quantity:integer) { 2 def newitem : saleitem := new saleitem; 3 newitem.product := product; 4 newitem.quantity := quantity; 5 self.item.add(newitem); 6 } figure 3: sale::additem activity concrete syntax ( product : product, quantity : integer ) activity additem product : product quantity : integer main_block [lines 1-6] add_stmt [line 5] <> item object value insertat <> * result <> self result assign_stmt [line 4] <> quantity objectvalue assign_stmt [line 3] <> product objectvalue def_stmt [line 2] <> saleitem result : saleitem newitem quantity product figure 4: sale::additem activity fuml abstract syntax actions for each statement, and the edges that enforce the sequential flow between these nodes. the operationactivity template selects the only structuredactivitynode it contains, which is the node containing all the statement nodes (main_block node from fig. 4). the activityblock template selects the statement node with no incoming edges, which represents the node corresponding to the first statement (def_stmt [line 2] from fig. 4).the iterateblockstatements template prints the text for the statement node by calling the blockstatement template and, if the node has an outgoing edge to the next statement node, calls recursively the iterateblockstatements for the next statement node. 5 related work only a few frameworks for building executable models exist and some of them are proprietary: mentor graphics’ bridgepoint product with its object action language (oal) [15], kennedy carter’s iuml product with its action specification language (asl) [16] and others. some of these frameworks have their action languages based on the action semantics package from uml, but none is based strictly on fuml, as the standard is still in beta version. tool support for fuml models 781 figure 5: acceleo template snippet for java 6 conclusions and further work in this paper we presented a tool chain that can be used to create, execute and generate code from fuml executable models. the tool chain integrates an uml class diagram editor, a fuml action language editor, a fuml execution framework and a code generation framework. we plan to further work on this tool set, to better integrate the tools, in order to improve the user experience with the tool set as a whole. a specialized editor for class diagrams that allows only fuml elements to be used, is considered as future work. also, we plan to investigate the possibilities of applying transformations to the models created by our action language, and to generate code in other programming languages. acknowledgment this work was supported by the grant id 546, sponsored by nurc romanian national university research council (cncsis). bibliography [1] semantics of a foundational subset for executable uml models, object management group standard, rev. 1.0, beta 2, october 2009. [online]. available: http://www.omg.org/spec/ fuml/1.0/beta2/pdf/ [2] uml superstructure specification, object management group standard, rev. 2.2, february 2009. [online]. available: http://www.omg.org/spec/uml/2.2/superstructure/pdf/ [3] c.-l. lazăr, i. lazăr, b. pârv, s. motogna, and i.-g. czibula, “using a fuml action language to construct uml models,” proceedings of the 11th international symposium on symbolic and numeric algorithms for scientific computing, 2009. [4] concrete syntax for a uml action language, object management group request for proposal, 2008. [online]. available: http://www.omg.org/docs/ad/08-08-01.pdf 782 c.-l. lazăr, i. lazăr, b. pârv, s. motogna, i.-g. czibula [5] b. pârv, s. motogna, i. lazăr, i.-g. czibula, and c.-l. lazăr, “comdevalco a framework for software component definition, validation, and composition,” studia universitatis babeş-bolyai, informatica, vol. lii, no. 2, pp. 59–68, 2007. [6] i. lazăr, b. pârv, s. motogna, i.-g. czibula, and c.-l. lazăr, “an agile mda approach for executable uml structured activities,” studia universitatis babeş-bolyai, informatica, vol. lii, no. 2, pp. 101–114, 2007. [7] c.-l. lazăr and i. lazăr, “on simplifying the construction of executable uml structured activities,” studia universitatis babeş-bolyai, informatica, vol. liii, no. 2, pp. 147–160, 2008. [8] mof model to text transformation language (mofm2t), object management group standard, rev. 1.0, january 2008. [online]. available: http://www.omg.org/spec/mofm2t/ 1.0/pdf/ [9] uml2, the eclipse foundation, 2010. [online]. available: http://www.eclipse.org/ modeling/mdt/?project=uml2 [10] eclipse modeling framework (core), the eclipse foundation, 2010. [online]. available: http://www.eclipse.org/modeling/emf/?project=emf [11] uml2 tools, the eclipse foundation, 2010. [online]. available: http://www.eclipse.org/ modeling/mdt/?project=uml2tools [12] xtext a programming language framework, the eclipse foundation, 2010. [online]. available: http://www.eclipse.org/xtext [13] foundational uml reference implementation, modeldriven.org, 2009. [online]. available: http://portal.modeldriven.org/project/foundationaluml [14] acceleo, the eclipse foundation, 2010. [online]. available: http://www.eclipse.org/ modeling/m2t/?project=acceleo [15] object action language reference manual, mentor graphics, 2009. [online]. available: http://www.mentor.com/products/sm/techpubs/ object-action-language-reference-manual-38098 [16] uml asl reference guide, kennedy carter limited, 2003. [online]. available: http: //www.ooatool.com/docs/asl03.pdf int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 5, pp. 939-945 on polar, trivially perfect graphs m. talmaciu, e. nechita mihai talmaciu, elena nechita university of bacău, romania e-mail: mtalmaciu@ub.ro, enechita@ub.ro abstract: during the last decades, different types of decompositions have been processed in the field of graph theory. in various problems, for example in the construction of recognition algorithms, frequently appears the so-called weakly decomposition of graphs. polar graphs are a natural extension of some classes of graphs like bipartite graphs, split graphs and complements of bipartite graphs. recognizing a polar graph is known to be np-complete. for this class of graphs, polynomial algorithms for the maximum stable set problem are unknown and algorithms for the dominating set problem are also np-complete. in this paper we characterize the polar graphs using the weakly decomposition, give a polynomial time algorithm for recognizing graphs that are both trivially perfect and polar, and directly calculate the domination number. for the stability number and clique number, we give polynomial time algorithms. keywords: polar graphs, trivial perfect graphs, weakly decomposition, recognition algorithms, optimization algorithms. 1 introduction polar graphs are a natural extension of some classes of graphs like bipartite graphs, split graphs and complements of bipartite graphs. according to ( [3]), a graph g = (v, e) is called polar if the set v of its vertices can be partitioned into (s,q) (s or q possibly empty) such that s induces a complete multipartite graph (that is a join of stable sets) and q is a disjoint union of cliques. in ( [3]) has been proved that the problem of recognizing an arbitrary graph to be polar is np-complete. recently some important result concerning polar graphs have been proven. hereby, in ( [9]) the authors give a characterization through forbidden subgraphs of polar cographs and a polynomial algorithm that finds the largest induced subgraph in a cograph. in ( [8]) is presented a polynomial algorithm to recognize the polar property for triangulated graphs. in ( [7]), a polynomial algorithm for the recognition of graphs that are both polar and permutation is given. in ( [16]) they assert that polynomial algorithms for independent set are unknown and algorithms for domination number are np-complete for split graphs (see ( [2] and [4]). both problems to find independent maximal set of maximum and minimum weight are nphard, in general. in ( [12]) are given polynomial time algorithms that solve the problems formulated above for classes of polar graphs. 2 definition and notation throughout this paper, g = (v, e) is a connected, finite and undirected graph, without loops and multiple edges ( [1]), having v = v(g) as the vertex set and e = e(g) as the set of edges. g is the complement of g. if u ⊆ v, by g(u) or [u]g we denote the subgraph of g induced by u. by g − x we mean the subgraph g(v − x), whenever x ⊆ v, but we simply write g − v, when copyright c⃝ 2006-2010 by ccc publications 940 m. talmaciu, e. nechita x = {v}. if e = xy is an edge of a graph g, then x and y are adjacent, while x and e are incident, as are y and e. if xy ∈ e, we also use x ∼ y, and x ̸∼ y whenever x, y are not adjacent in g. a vertex z ∈ v distinguishes the non-adjacent vertices x, y ∈ v if zx ∈ e and zy ̸∈ e. if a, b ⊂ v are disjoint and ab ∈ e for every a ∈ a and b ∈ b, we say that a, b are totally adjacent and we denote by a ∼ b, while by a ̸∼ b we mean that no edge of g joins some vertex of a to a vertex from b and, in this case, we say that a and b are non-adjacent. the neighbourhood of the vertex v ∈ v is the set ng(v) = {u ∈ v : uv ∈ e}, while ng[v] = ng(v) ∪ {v}; we simply write n(v) and n[v], when g appears clearly from the context. the neighbourhood of the vertex v in the complement of g will be denoted by n(v). if d ⊂ v and every vertex from v − d has at least one neighbour in d, then d is called a dominating set of g. the minimum size of a dominating set is the domination number ν(g). a complete graph is a graph in which every vertex is adjacent to every other. the neighbourhood of s ⊂ v is the set n(s) = ∪v∈sn(v) − s and n[s] = s ∪ n(s). a clique is a subset q of v with the property that g(q) is complete. the clique number of g, denoted by ω(g), is the size of the maximum clique. an independent set or stable set is a set of vertices of which no pair is adjacent. the independence number α(g) of a graph g is the size of a largest independent set of g. by pn, cn, kn we mean a chordless path on n ≥ 3 vertices, a chordless cycle on n ≥ 3 vertices, and a complete graph on n ≥ 1 vertices, respectively. the distance dg(u, v) between two (not necessary distinct) vertices u and v in a graph g is the length of a shortest path between them. a graph is called triangulated if it does not contain chordless cycles having the length greater or equal to four. a graph is called cograph if it does not contain p4. a graph is a split graph if the vertex set can be partitioned into a clique and a stable set. a graph g is trivially perfect ( [10]) if for each induced subgraph h of g, the number of maximal cliques of h is equal to the maximum size of an independent set of h. let n ≥ 1 and π be a permutation over {1, ..., n}. we will denote π equivalently as a permutation sequence (π(1), ..., π(n)). the inversion graph of π has vertex set {1, ..., n} and two vertices u, v are adjacent if (u − v)(π−1(u) − π−1(v)) < 0. a graph is a permutation graph if it is isomorphic to the inversion graph of a permutation sequence. let f denote a family of graphs. a graph g is called f-free if none of its subgraphs is in f. the zykov sum of the graphs g1, g2 is the graph g = g1 + g2 having: v(g) = v(g1) ∪ v(g2), e(g) = e(g1) ∪ e(g2) ∪ {uv : u ∈ v(g1), v ∈ v(g2)}. when searching for recognition algorithms, frequently appears a type of partition for the set of vertices in three classes a, b, c, which we call a weakly decomposition, such that: a induces a connected subgraph, c is totally adjacent to b, while c and a are totally nonadjacent. the structure of the paper is the following. in section 3 we recall the notion of weakly decomposition. in section 4 we present a new characterization of polar, trivially perfect graphs. in section 5 we give a recognition algorithm for this class of graphs. in section 6 we give combinatorial optimization algorithms for polar, trivially perfect graphs. in the last section we have our concluding remarks. 3 preliminary results at first, we recall the notions of weakly component and weakly decomposition. on polar, trivially perfect graphs 941 definition 1. ( [6], [13], [14]) a set a ⊂ v(g) is called a weakly set of the graph g if ng(a) ̸= v(g) − a and g(a) is connected. if a is a weakly set, maximal with respect to set inclusion, then g(a) is called a weakly component. for simplicity, the weakly component g(a) will be denoted with a. definition 2. ( [6], [13], [14]) let g = (v, e) be a connected and non-complete graph. if a is a weakly set, then the partition {a, n(a), v − a ∪ n(a)} is called a weakly decomposition of g with respect to a. below we remind a characterization of the weakly decomposition of a graph. the name of "weakly component" is justified by the following result. theorem 1. ( [5], [13], [14]) every connected and non-complete graph g = (v, e) admits a weakly component a such that g(v − a) = g(n(a)) + g(n(a)). theorem 2. ( [13], [14]) let g = (v, e) be a connected and non-complete graph and a ⊂ v. then a is a weakly component of g if and only if g(a) is connected and n(a) ∼ n(a). the next result, that follows from theorem 1, ensures the existence of a weakly decomposition in a connected and non-complete graph. corollary 1. if g = (v, e) is a connected and non-complete graph, then v admits a weakly decomposition (a, b, c), such that g(a) is a weakly component and g(v − a) = g(b) + g(c). theorem 2 provides an o(n + m) algorithm for building a weakly decomposition for a noncomplete and connected graph. algorithm for the weakly decomposition of a graph ( [13]) input: a connected graph with at least two nonadjacent vertices, g = (v, e). output: a partition v = (a, n, r) such that g(a) is connected, n = n(a), a ̸∼ r = n(a). begin a := any set of vertices such that a ∪ n(a) ̸= v n := n(a) r := v − a ∪ n(a) while (∃n ∈ n, ∃r ∈ r such that nr ̸∈ e ) do begin a := a ∪ {n} n := (n − {n}) ∪ (n(n) ∩ r) r := r − (n(n) ∩ r) end end corollary 2. for g = (v, e) a connected non-complete graph, and (a, n, r) a weakly decomposition with g(a) the weakly component the following relation holds: α(g) = max{α(g(a)) + α(g(r)), α(g(a ∪ n))} . in ( [13]) some applications of weakly decomposition have been depicted. let g = (v, e) be connected, non-complete graph and (a, n, r) a weakly decomposition, with a the weakly component.the following hold: a) g is p4 − free if and only if a ∼ n ∼ r and g(a), g(n) and g(r) are p4 − free; b) g is triangulated if and only if n is a clique and r and g − r are triangulated. each of the results above lead to recognition algorithms for the specified graphs. c) if g is triangulated then α(g) = α(g(a)) + α(g(r)) and this leads to the algorithm that determines α(g). 942 m. talmaciu, e. nechita 4 characterization of polar, trivially perfect graphs in this section, using the weakly decomposition, we present a recognition algorithm for the polar trivially perfect graphs. at first we remind a characterization in terms of forbidden subgraphs of polar cographs and two characterization of trivially perfect graphs. theorem 3. ( [9]) for a cograph g, the following statements are equivalent: a) g is polar; b) neither g nor g contains any one of the graphs h1, h2, h3, h4 as induced subgraphs, where hi = gi ∪ fi (1 ≤ i ≤ 4), and every gi is a p3 and fi, described as sequences of degrees, are: f1 : (4, 3, 3, 3, 3); f2 : (5, 3, 2, 2, 2, 2); f3 : (4, 4, 3, 3, 3, 3); f4 : (5, 5, 3, 3, 3, 3). theorem 4. ( [15]) if g is a connected, non-complete graph and (a, n, r) is a weakly decomposition with g(a) the weakly component, then g is trivially perfect if and only if: i) a ∼ n ∼ r; ii) n is clique; iii) g(a), g(r) are trivially perfect if and only if it contains no vertex subset that induces p4 or c4. theorem 5. ( [12]) a graph is trivially perfect if and only if it contains no vertex subset that induces p4 or c4. theorem 6. let g = (v, e) be a connected, non-complete graph and (a, n, r) a weakly decomposition with g(a) the weakly component. let also g and g be trivially perfect graphs. g is polar if and only if g(a) and g(r) are polar graphs. proof. if g is a polar graph then g(a) and g(r) are polar graphs, as every induced subgraphs of a polar graph is also polar. conversely, suppose that g(a) and g(r) are polar graphs. we show that g is a polar graph. because g is trivially perfect it follows that a ∼ n ∼ r and n is a clique. suppose that x ⊂ v still exists such that g(x) is isomorphic to one of the following four graphs: h1, h2, h3, h4, where hi = gi ∪ fi (1 ≤ i ≤ 4), and every gi is a p3 and every fi, described as a sequence of degrees, is: f1 : (4, 3, 3, 3, 3); f2 : (5, 3, 2, 2, 2, 2); f3 : (4, 4, 3, 3, 3, 3); f4 : (5, 5, 3, 3, 3, 3). because g is trivially perfect it follows that g is {p4, c4}-free. if x is the vertex of degree 4 in f1, z and t are the vertices of degree 4 in f3, u and v are the vertices of degree 5 in f4 then f1 − {x} is isomorphic to c4, f3 − {z, t} is isomorphic to c4, f4 − {u, v} is isomorphic to c4, which is a contradiction. we know that the complement of a polar graph is a polar graph and that g is c4-free, because it is trivially perfect. if y is the vertex of degree 5 in f2 and a is one of the vertices of degree 2 adjacent to the vertex of degree 3 in f2 then f2 −{a, y} is isomorphic to c4, which is a contradiction. 5 recognition of polar trivially perfect graphs theorem 6 leads to the following recognition algorithm. input: g = (v, e) a connected graph satisfying the conditions in theorem 6 output: an answer to the question: is g a polar graph ? begin l = {g} // l is a list of graphs while (l ̸= ∅) begin extract an element h from l find a weakly decomposition (a, n, r) for h if (a ̸∼ n ̸∼ r) then g is not trivially perfect else introduce in l the connected, non-complete components of g(a), g(r) on polar, trivially perfect graphs 943 end return: g is polar end in what follows, we give some remarks on the algorithm. because the operation inside the body of while loop that takes the longest execution time is the weakly decomposition (namely o(n + m)) it follows that the total execution time of the algorithm is o(n(n + m)). 6 combinatorial optimization algorithms for polar, trivially perfect graphs in this section we calculate the domination number, give o(n(n+m)) algorithms to calculate the stability number and clique number. theorem 6 leads to the following result. corollary 3. let g = (v, e) be a connected, non-complete graph and (a, n, r) a weakly decomposition with g(a) the weakly component. if g is trivially perfect and also polar then the following hold: i) α(g) = α(g(a)) + α(g(r)); ii) ω(g) = |n| + max{ω(g(a)), ω(g(r))}; iii) ν(g) = 1. proof. let t ⊂ a ∪ n such that t is stable and |t | = α(g(a ∪ n)). because n is a clique it follows that |t ∩ n| ≤ 1. if t ∩ n = ∅ then t ∪ {r} is a stable set in g(a ∪ r), and if t ∩ n = {n0} then (t − {n0}) ∪ {r} is a stable set in a ∪ r, for every r ∈ r. it follows that in the relation in corollary 2, the maximum is obtained only for the first component. so i) holds. because a ∼ n ∼ r and n is a clique it follows that ω(g) = ω(g(n))+max{ω(g(a)), ω(g(r))}, but ω(g(n)) = |n|. so ii) holds. because a ∼ n ∼ r and n is a clique it follows that a domination set of minimum cardinal is a set determined by any vertex in n. so iii) holds. corollary 3 implies an algorithm for the construction of a stable set of maximum cardinal and of a clique of maximum cardinal in a trivially perfect, polar graph. input: g = (v, e) a connected graph satisfying conditions in corollary 3 output: a stable set s with |s| = α(g) and a clique q with |q| = ω(g) begin s = ∅, q = n l = {g} // l is a list of graphs while (l ̸= ∅) begin extract an element h from l if (h is complete) then return: s = s ∪ {v}, ∀v ∈ v(h) q = q ∪ n else determine a weakly decomposition (a, n, r) for h put [a]h and the connected component of [r]h in l end end 944 m. talmaciu, e. nechita facility location analysis deals with the problem of finding optimal locations for one or more facilities in a given environment ( [11]). a type of problems in facility location analysis concerns the determination of a location that minimizes the maximum distance to any other location in the network. the following centrality indices are defined in ( [11]): the eccentricity of a vertex u is eg(u) = max{d(u, v)|v ∈ v}; the radius is r(g) = min{eg(u)|u ∈ v}; the center of a graph g is c(g) = {u ∈ v|eg(u) = r(g)}. using theorem 6 we obtain the following result. corollary 4. let g = (v, e) be a connected, non-complete graph and (a, n, r) a weakly decomposition with g(a) the weakly component. if g is both trivially perfect and polar the following hold: (i) eg(u) = 2 for u ∈ a ∪ r; (ii) eg(u) = 1 for u ∈ n; (iii) r(g) = 1; (iv) c(g) = n. 7 conclusions and future work using the weakly decomposition we have obtained polynomial time recognition algorithms for polar graphs and directly calculated the domination number, while for the stability number and for density we give polynomial algorithms. future work concerns on other classes of graphs, characterized by forbidden subgraphs. bibliography [1] c. berge, graphs, north-holland, amsterdam, 1985. [2] a. a. bertoss, dominating sets for split and bipartite graphs, inform. process. lett., 19:3740, 1984. [3] z. a. chernyak, a. a. chernyak, about recognizing (α, ω)-classes of polar graphs, discrete mathematics, 62:133-138, 1986. [4] d. g. corneil, y. perl, clustering and domination in perfect graphs, discrete appl. math., 9:27-39, 1984. [5] c. croitoru, e. olaru, m. talmaciu, confidentially connected graphs, proceedings of the international conference "the risk in contemporany economy", in annals of the university "dunarea de jos" of galati, suppliment to tome xviii (xxii), 2000. [6] c. croitoru, m. talmaciu, a new graph search algorithm and some applications, presented at rosycs 2000, univ. "al.i.cuza" iaşi, 2000. [7] t. ekim, p. heggernes, d. meister, polar permutation graphs, report in informatics, report no 385, march 2009. [8] t. ekim, p. hell, j. stacho, d. de werra, polarity of chordal graphs, discrete applied mathematics, volume 156, 2469-2479, 2008. [9] t. ekim, n. v. r. mahadev, d. de werra, polar cographs, discrete applied mathematics, volume 156, 1652-1660, 2008. [10] m. c. golumbic, "trivially perfect graphs", discrete math. 24:105-107, 1978. on polar, trivially perfect graphs 945 [11] d. koschutzki, k. lehman, l. peaters, s. richter, tenfelde-padehl and o. zlotowski, "centrality indices", lecture notes in computer sciences, springer berlin / heidelberg, network analysis, 16-61, 2005. [12] v. v. lozin, r. mosca, polar graphs and maximal independent sets, rutcor research report 4-2004, february, 2004. [13] m. talmaciu, decomposition problems in the graph theory with applications in combinatorial optimization ph. d. thesis, university "al. i. cuza" iasi, romania, 2002. [14] m. talmaciu, e. nechita, recognition algorithm for diamond-free graphs, informatica, vol. 18, no. 3, 457-462, 2007. [15] m. talmaciu, e. nechita, an algorithm for the bisection problem on trivially perfect graphs, bics 2008, amer. inst. physics, volume 1117:60-66, 2008. [16] http://wwwteo.informatik.uni-rostock.de/isgci/classes/gc−314.html. international journal of computers, communications & control vol. i (2006), no. 3, pp. 13-24 a programming perspective of the membrane systems gabriel ciobanu abstract: we present an operational semantics of the membrane systems, using an appropriate notion of configurations and sets of inference rules corresponding to the three stages of an evolution step in membrane systems: maximal parallel rewriting step, parallel communication of objects through membranes, and parallel membrane dissolving. we define various arithmetical operations over multisets in the framework of membrane systems, indicating their complexity and presenting the membrane systems which implement the arithmetic operations. finally we discuss and compare various sequential and parallel software simulators of the membrane systems, emphasizing their specific features. keywords: membrane systems, operational semantics, arithmetical operations over multisets. 1 membrane systems membrane systems represent a computational model inspired by cell compartments and molecular membranes. essentially, such a system is composed of various compartments, each compartment with a different task, and all of them working simultaneously to accomplish a more general task of the whole system. a detailed description of the membrane systems (also called p systems) can be found in [17]. a membrane system consists of a hierarchy of membranes that do not intersect, with a distinguishable membrane, called the skin membrane, surrounding them all. the membranes produce a delimitation between regions. for each membrane there is a unique associated region. regions contain multisets of objects, evolution rules and possibly other membranes. only rules in a region delimited by a membrane act on the objects in that region. the multiset of objects from a region corresponds to the “chemicals swimming in the solution in the cell compartment”, while the rules correspond to the “chemical reactions possible in the same compartment”. graphically, a membrane structure is represented by a venn diagram in which two sets can be either disjoint, or one is a subset of the other. more details (concepts, results) and several variants of membrane systems are presented in [17]. a p system consists of several membranes that do not intersect, and a skin membrane, surrounding them all. the membranes delimit regions, and contain multisets of objects, as well as evolution rules. each membrane has a unique associated region. the space outside the skin membrane is called the outer region (or the environment). because of the one-to-one correspondence between the membranes and the regions, we usually use the word membrane instead of region. only rules in a region delimited by a membrane act on the objects in that region. moreover, the rules must contain target indications, specifying the membrane where objects are sent after applying the rule. the objects can either remain in the same region, or pass through membranes in two directions: they can be sent out of the membrane which delimits a region from outside, or can be sent in one of the membranes which delimit a region from inside, precisely identified by its label. the membranes can also be dissolved. when such an action takes place, all the objects of the dissolved membrane remain free in the membrane placed immediately outside, but the evolution rules of the dissolved membranes are lost. the skin membrane is never dissolved. the application of evolution rules is done in parallel, and it is eventually regulated by priority relationships between rules. a p system of degree m is a structure π = (o, µ, w1, . . . , wm, (r1, ρ1),. . . , (rm, ρm), io), where: (i) o is an alphabet of objects, and µ is a membrane structure; copyright c© 2006 by ccc publications selected from icccc 2006 (invited paper) 14 gabriel ciobanu (ii) wi are the initial multisets over o associated with the regions defined by µ ; (iii) ri are finite sets of evolution rules over o associated with the membranes, of typical form u → v, with u a multiset over o and v a multiset containing paired symbols (messages) of the form (c, here), (c, in j), (c, out) and the dissolving symbol δ ; (iv) ρi is a partial order relation over ri, specifying a priority relation among the rules: (r1, r2) ∈ ρi iff r1 > r2 (i.e., r1 has a higher priority than r2); (v) i0 is either a number between 1 and m specifying the output membrane of π, or it is equal to 0 indicating that the output is the outer region. since the skin is not allowed to be dissolved, we consider that the rules of the skin do not involve δ . these are the general p systems, or transition p systems; many other variants and classes were introduced [17]. the existing results regarding the p systems refer mainly to their computation power and complexity, namely to their characterization of turing computability (universality is obtained even with a small number of membranes, and with rather simple rules), and the polynomial solutions to np-complete problems by using an exponential workspace created in a “biological way" (e.g., membrane division, string replication). other types of formal results are given by normal forms, hierarchies, connections with various formalisms. in this paper we refer to some “programming” aspects of the membrane systems. we first present an operational semantics of the p systems, together with some correctness results. then we define several arithmetical operations in membrane systems using a natural encoding of numbers. finally some software simulators of the membrane systems are presented. 2 structural operational semantics membrane systems provide an abstract model for parallel systems, and a suitable framework for distributed and parallel algorithms [7]. for each abstract model, theory of programming introduces various paradigms and uses different notions of computations. turing machines and register machines are related to imperative programming, and λ -calculus is related to functional programming. it is natural to look at the membrane systems from the point of view of programming theory. this means that we define an abstract syntax, and an operational semantics of the membranes systems. the operational semantics of the membrane systems is given in a big-step style, each step representing the collection of parallel steps due to the maximal parallelism principle. a computation is regarded as a sequence of parallel application of rules in various membranes, followed by a communication step and a dissolving step. the membrane structure and the multisets in π determine a configuration of the system. we can pass from a configuration to another one by using the evolution rules. this is done in parallel: all objects, from all membranes, which can be the subject of local evolution rules, as prescribed by the priority relation, should evolve concurrently. since the right hand side of a rule consists only of messages, an object introduced by a rule cannot evolve at the same step by means of another rule. the use of a rule u → v in a region with a multiset w means to subtract the multiset identified by u from w, and then to add the objects of v according to the form of the rule. if an object appears in v in the form (c, here), then it remains in the same region. if we have (c, in j), then c is introduced in the child membrane with the label j; if a child membrane with the label j does not exist, then the rule cannot be applied. if we have (c, out), then c is introduced in the membrane placed immediately outside the region of the rule u → v. if the special symbol δ appears in v, then the membrane which delimits the region is dissolved; in this way, all the objects in this region become elements of the region placed immediately outside, while the rules of the dissolved membrane are removed. a programming perspective of the membrane systems 15 let o be a finite alphabet of objects organized as a free commutative monoid o∗c , whose elements are called multisets. formally, the set of membranes for a system π, denoted by m(π), and the membrane structure are inductively defined as follows: • if l is a label, and w is a multiset over o ∪ (o∗c ×{here}) ∪ (o+c ×{out}) ∪{δ}, then 〈 l | w 〉 ∈ m(π); 〈l |w〉 is called simple (or elementary) membrane, and it has the structure 〈〉; • if m1, . . . , mn ∈ m(π) with n ≥ 1, the structure of mi is µi for all i ∈ [n], l is a label, w is a multiset over o∪(o∗c ×{here})∪(o+c ×{out})∪(o+c ×{inl(m j )| j ∈ [n]})∪{δ}, then 〈l|w ; m1, . . . , mn〉 ∈ m(π); 〈l |w ; m1, . . . , mn 〉 is called a composite membrane, and it has the structure 〈µ1, . . . , µn〉. a finite set of membranes is usually written as m1, . . . , mn. we denote by m +(π) the set of nonempty finite sets of membranes. the union of two multisets of membranes m+ = m1, . . . , mm and n+ = n1, . . . , nn is written as m+, n+ = m1, . . . , mm, n1, . . . , nn. an element from m+(π) is either a membrane, or a set of sibling membranes. a committed configuration for a membrane system π is a skin membrane which has no messages and no dissolving symbol δ , i.e., the multisets of all regions are elements in o∗c . we denote by c(π) the set of committed configurations for π, and it is a proper subset of m+(π). we have c ∈ c(π) iff c is a skin membrane of π and w(m) is a multiset over o for each membrane m in c. an intermediate configuration is a skin membrane in which we have messages or the dissolving symbol δ . the set of intermediate configurations is denoted by c#(π). we have c ∈ c#(π) iff c is a skin membrane of π such that there is a membrane m in c with w(m) = w′w′′, w′ ∈ (msg(o)∪{δ})+c , and w′′ ∈ o∗c . by msg(o) we denote the set (o∗ ×{here})∪(o+ ×{out})∪(o+ ×{inl(m)}). a configuration is either a committed configuration or an intermediate configuration. each membrane system has an initial committed configuration which is characterized by the initial multiset of objects for each membrane and the initial membrane structure of the system. each p system has an initial configuration which is characterized by the initial multiset of objects for each membrane and the initial membrane structure of the system. for two configurations c1 and c2 of π, we say that there is a transition from c1 to c2, and write c1 ⇒ c2, if the following steps are executed in the given order: 1. maximal parallel rewriting step, consisting of non-deterministically assigning objects to evolution rules in every membrane and executing the rules in a maximal parallel manner; 2. parallel communication of objects through membranes, consisting in sending existing messages; 3. parallel membrane dissolving, consisting in dissolving the membranes containing δ . the last two steps take place only if there are messages or δ symbols resulted from the first step, respectively. if the first step is not possible, consequently neither the other two steps, then we say that the system has reached a halting configuration. an operational semantics of the p systems, considering each of the three steps, is presented in [2]. we mention here the main results. we can pass from a configuration to another one by using the evolution rules. this is done in parallel: all objects from all membranes evolve simultaneously according to the evolution rules and their priority relation. the rules of a membrane are using its current objects as much as this is possible in a parallel and non-deterministic way. however, an object produced by a rule cannot evolve at the same step as source of another rule. the use of a rule u → v in a region with a multiset w has as effect the subtraction of the multiset identified by u from w, followed by the addition of the multiset identified by v. we denote the maximal parallel rewriting on membranes by mpr =⇒ and by mpr=⇒l the maximal parallel rewriting over the multisets of objects of the membrane labelled by l (we omit the label whenever it is clear from the context). the rules defining the maximal parallel rewriting use two predicates regarding mpr-irreducibility and (l, w)-consistency. 16 gabriel ciobanu proposition 1. let π be a membrane system. if c ∈ c(π) and c′ ∈ c#(π) such that c mpr=⇒ c′, then c′ is mpr-irreducible. we denote the parallel communication relation by tar =⇒. the rules defining the parallel communication relation use a predicate expressing tar-irreducibility. proposition 2. let π be a p system. if c ∈ c#(π) with messages and c tar=⇒c′, then c′ is tar-irreducible. we denote the parallel dissolving relation by δ =⇒. the rules defining the parallel dissolving relation use a predicate expressing δ -irreducibility. we note that c ∈ c(π) iff c is tar-irreducible and δ -irreducible. proposition 3. let π be a p system. if c ∈ c#(π) is tar-irreducible and c δ=⇒c′, then c′ is δ -irreducible. according to the standard description in membrane computing, a transition step between two configurations c,c′ ∈ c(π) is given by: c ⇒ c′ iff c and c′ are related by one of the following relations: either c m pr =⇒; tar=⇒ c′, or c mpr=⇒; δ=⇒ c′, or c mpr=⇒; tar=⇒; δ=⇒ c′. the three alternatives in defining c ⇒ c′ are given by the existence of messages and dissolving symbols along the system evolution. starting from a configuration without messages and dissolving symbols, we apply the “mpr” rules and get an intermediate configuration which is mpr-irreducible; if we have messages, then we apply the “tar” rules and get an intermediate configuration which is tarirreducible; if we have dissolving symbols, then we apply the dissolving rules and get a configuration which is δ -irreducible. if the final configuration has no messages or dissolving symbols, then we say that the transition relation ⇒ is well-defined as an evolution between the initial and final configurations. proposition 4. the relation ⇒ is well-defined over the entire set c(π) of configurations. examples of inference trees, as well as the proofs of the results are presented in [2]. operational semantics provides us with a formal way to find out which transitions are possible for the current configuration of a membrane system. given an operational semantics, we can derive easily an interpreter for membrane systems, as well as the basis for the definition of certain equivalences and congruences between membrane systems. moreover, given an operational semantics, we can reason about the rules defining the semantics. a notion of bisimulation can be defined (see [2]), and the bisimulation relation allows to compare the evolution behaviour of two membrane systems. 3 arithmetical operations in membrane systems the problem of number encoding using multisets is interesting and complex. the first paper on the encodings and arithmetical operations in membrane systems is [5]. in [5] we present several combinatorial results and some encodings of numbers using multisets. here we present some arithmetical operations over numbers encoded by a simple and natural encoding (each object of a membrane represents a unit, and we use n objects to represent the number n). we indicate the complexity of some arithmetical operations, and build the membrane systems which implement the arithmetic operations over the encoded numbers. a programming perspective of the membrane systems 17 addition time complexity: o(1) π = (v, µ, w0, (r0, /0), 0), v = {a, b}, µ = [0]0, w0 = a nbm, r0 = {b → a}. addition is trivial; we consider n objects a and m objects b. the rule b → a says that an object b is transformed in one object a. such a rule is applied in parallel as many times as possible. consequently, all objects b are erased. the remaining number of objects a represents the addition n + m. subtraction time complexity: o(1) π = (v, µ, w0, (r0, /0), 0), v = {a, b}, µ = [0]0, w0 = a nbm, r0 = {ab → λ}. subtraction is described in the following way: given n objects a and m objects b, a rule ab → λ says that one object a and one object b are deleted (this is represented by the empty symbol λ ). consequently, all the pairs ab are erased. the remaining number of objects represents the difference between n and m. multiplication without promoters time complexity: o(n·m) the object is a promoter for a rule if the rule can be applied only in the presence of object. figure 1 presents a p system π1 without promoters for multiplication of n (objects a) by m (objects b), the result being the number of objects d in membrane 0. in this p system we use the priority relation between rules; for instance bv → dev has a higher priority than av → u, meaning the second rule is applied only when the first one cannot be applied anymore. initially only the rule au → v can be applied, generating an object v which activates the rule bv → dev m times, and then av → u. now eu → dbu is applied m times, followed by au → v. the procedure is repeated until no object a is present within the membrane. we note that each time when one object a is consumed, then m objects d are generated. π1 = (v, µ, w0, (r0, ρ0), 0), v = {a, b, e, v, u}, µ = [0]0, w0 = a nbmu, r0 = {r1 : au → v, r2 : bv → dev, r3 : av → u, r4 : eu → dbu}, ρ0 = {r2 > r1, r4 > r3}. 18 gabriel ciobanu an bm u eu → dbu bv → dev 0 av → u au → v> > figure 1: multiplier without promoters multiplication with promoters time complexity: o(n) figure 2 presents a p system π2 with promoters for multiplication of n (objects a) by m (objects b), the result being the number of objects d in membrane 0. in this p system we use rules with priority and with promoters. the object a is a promoter in the rule b → bd|a, i.e., this rule can only be applied in the presence of object a. the available m objects b are used in order to apply m times the rule b → bd|a in parallel; based on the priority relation and the availability of a objects (except one a as promoter), the rule au → u is applied in the same time. the priority relation is motivated because the promoter a is a resource for which the rules b → bd|aand au → u are competing. the procedure is repeated until no object a is present within the membrane. we note that each time when one object a is consumed, then m objects d are generated. π2 = (v, µ, w0, (r0, ρ0), 0), v = {a, b, u}, µ = [0]0, w0 = a nbmu, r0 = {r1 : b → bd|a, r2 : au → u}, ρ0 = {r1 > r2}. an bm u 0 au → u>b → bd|a figure 2: multiplier with promoters the membrane systems for multiplication differ from others presented in the literature [17] because they do not have exponential space complexity, and do not require active membranes. as a particular case, it would be quite easy to compute n2 by just placing the same number n of objects a and b. another interesting feature is that the computation may continue after reaching a certain result, and so the system acts as a p transducer [12]. thus if initially there are n (objects a) and m (objects b), the system evolves and produces n·m objects d. afterwards, the user can inject more objects a and the system continues the computation obtaining the same result as if the objects a are present from the beginning. for example, if the user wishes to compute (n + k) ·m, it is enough to inject k objects a at any point of the computation. therefore this example emphasizes the asynchronous feature and a certain degree of reusability and robustness. a programming perspective of the membrane systems 19 division we implement division as repeated subtraction. we compute the quotient and the remainder of n2 (objects a in membrane 1) divided by n1 (objects a in membrane 0) in the same p system evolution. the evolution starts in the outer membrane by applying the rule a → b(v, in1). the (v, in1) notation means that the object v is injected into the child membrane 1. therefore the rule a → b(v, in1) is applied n1times converting the objects a into objects b, and object v is injected in the inner membrane 1. the evolution continues with a subtraction step in the inner membrane, with the rule av → e applied n1 times whenever possible. π = (v, µ, w0, w1, (r0, ρ0), (r1, ρ1), 0), v = {a, b, b′, c, s, u, v}, µ = [0[1]1]0, w0 = a n1 s, w1 = a n2 s, r0 = {a → b(v, in1), b′ → a, r1 : bu → b′|¬v, r2 : u → δ|¬v, r3 : csu → u|v}, ρ0 = {r1 > r2, r2 > r3}, r1 = {r1 : av → e, r2 : v → (v, out), r3 : es → s(u, out)(c, out), r4 : e → (u, out)}, ρ1 = {r1 > r2, r2 > r3, r3 > r4}. 0 1 an2 s a → b(v, in1) an1 bu → b′|¬v b′ → a av → e > v → (v, out) > e → (u, out)>es → s(u, out)(c, out)> t ctu → u|v figure 3: p system for division two cases are distinguished in the inner membrane: • if there are more objects a than objects v, only the rules es → s(u, out)(c, out) and e → (u, out) are applicable. rule es → s(u, out)(c, out) sends out to membrane 0 a single c (restricted by the existence of a single s into this membrane) for each subtraction step. the number of objects c represents the quotient. on the other hand, both rules send out n1 objects u (equal to the number of objects e). the evolution continues in the outer membrane by applying bu → b′|qv of n1times, meaning the objects b are converted into objects b′ by consuming the objects u only in the absence of v (|qv denotes an inhibitor having an effect opposite to that of a promoter). then the rule b′ → a produces the necessary objects a to repeat the entire procedure. 20 gabriel ciobanu • when there are less objects a than objects v in the inner membrane we get a division remainder. after applying the rule av → e, the remaining objects v activate the rule v → (v, out). therefore all these objects v are sent out to the parent membrane 0, and the rules es → s(u, out)(c, out) and e → (u, out) are applied. due to the fact that we have objects v in membrane 0, the rule bu → b′|qv cannot be applied. since n2 is not divisible by n1, the number of the left objects u in membrane 0 represents the remainder of the division. a final cleanup is required in this case, because an object c is sent out even if we have not a ”complete” subtraction step; the rule ctu → u|v removes that extra c from membrane 0 in the presence of v. this rule is applied only once because we have a unique t in this membrane. the natural encoding is easy to understand and work with. however it has the disadvantage that the membranes can contain a very large number of objects when working with very large numbers. we introduce and study the most compact encoding using two object types (binary case) in [5], where we present other p systems implementing the arithmetical operations on numbers encoded using the binary cases of the most compact encoding. we use a web-based simulator available at http://psystems. ieat.ro to implement the arithmetical operations, and test each p system. 4 software implementations several programming paradigms and programming languages have been selected for implementing membrane systems simulators: lisp, haskell, mzscheme (as functional programming languages) prolog, clips (as declarative languages), c, c++, java (as imperative and object-oriented languages). the user interface can be designed separately from the engine performing the evolution, and it is possible to use different programming languages able to communicate with each other. each programming paradigm, each programming language has advantages and disadvantages. transition membrane systems and deterministic membrane systems with active membranes are simulated in prolog [14]; they are used to solve np-complete problems as sat, validity, subset sum, knapsack, and partition problems. sevilla carpets describing the complexity of a membrane system computation [11] are used as a graphical representation for a partition problem in [20]. membrane systems with active membranes, input membrane and external output are simulated in clips and used to solve np-complete problems in [18]. the simulator presented in [18] allows to observe the evolution of the systems with active membranes based on production system techniques. the set of rules and the configurations in each step of the evolution are expressed as facts in a knowledge base. rewriting membrane systems and membrane systems with symport/antiport rules are described as executable specifications in maude in [1]. the advantage of this approach is that it uses the existing tools of maude, and it is used to verify the temporal properties of the membrane systems expressed in linear temporal logic. a more complex simulator (written in visual c++) for membrane systems with active membranes and catalytic membrane systems is presented in [10]. it provides a graphical simulator, interactive definition, visualization of a defined membrane system, a scalable graphical representation of the computation, and step-by-step observations of the membrane system behaviour. the simulation of these membrane systems has to deal with the potential growth of the membrane structure and adapt dynamically the topology of the configurations depending if some membranes are added or deleted. polynomial-time solutions to np-complete problems via membrane systems can be reached trading time by space. this is done by producing (via membrane division) an exponential amount of membranes that can work in parallel. in [10] it is presented a software implementation which provides a graphical simulation for two variants of membrane systems: for the initial version of catalytic hierarchical cell systems, and for membrane a programming perspective of the membrane systems 21 systems with active membranes. its main functions are given by an interactive definition of a membrane system, a visualization of a defined membrane system, a graphical representation of the computation and final result, and saving and (re)loading a defined membrane system. the application is implemented in microsoft visual c++ using mfc classes. for a scalable graphical representation, the microsoft directx technology is used. one of the main features of this technology is that the size of each component of the graphical representation is adjusted according to the number of membranes of the system. the system is presented to the user with a graphical interface where the main screen is divided into two windows: the left window gives a tree representation of the membrane system including objects and membranes. the right window provides a graphical representation of the membrane system given by venn-like diagrams. a menu allows the specification of a membrane system for adding new objects, membranes, rules and priorities. by using the functions start, next and stop, the users can observe the system evolution step-by-step. by simulating parallelism and nondeterminism on a sequential machine one can lose the power and attractiveness of membrane system computing. parallel and cluster implementation for transition membrane systems in c++ and mpi are reported in [8] and [9]. the rules are implemented as threads. at the initialization phase, one thread is created for each rule. rule applications are performed in terms of rounds. to synchronize each thread (rule) within the system, two barriers implemented as mutexes are associated with the thread. at the beginning of each round, the barrier that the rule thread is waiting on is released by the primary controlling thread. after the rule application is done, the thread waits for the second barrier, and the primary thread locks the first barrier. since each rule is modelled as a separate thread, it should have the ability to decide its own applicability in a particular round. generally speaking, a rule can run when no other rule with higher priority is running, and the resources required are available. when more than one rule can be applied in the same conditions, the simulator picks randomly one among the candidates. with respect to the synchronization and communication, for every membrane, the main communication is done by sending and receiving messages to and from its father and children at the end of every round. with respect to the termination, when the system is no longer active, there is no rule in any membrane that is applicable. when this happens, the designated output membrane prints out the result and the whole system halts. in order to detect if the membrane system halts, each membrane must inform the other membranes about its inactivity. it can do so by sending messages to others, and by using a termination detection algorithm [4]. the implementation was designed for a cluster of computers. it is written in c++ and it makes use of message passing interface (mpi) as its communication mechanism. mpi is a standard library developed for writing portable message passing applications, and it is implemented both on shared-memory and on distributed-memory parallel computers. the program was implemented and tested on a linux cluster at the national university of singapore; the cluster consisted of 64 dual processor nodes. the above implementations represent the first generation of membrane systems simulators. the recent developments are related to biological applications, and to a new generation of web-based simulators. webps is an open-source web-enabled simulator for membrane systems [6]. the simulator is based on clips, and it is already available as a web application. as any web application, webps does not require an installation. it can be used from any machine anywhere in the world, without any previous preparation. a simple and easy to use interface allows the user to supply an xml input both as text and as a file. a friendly way of describing membrane systems is given by an interactive javascript-based membrane system designer. the interface provides a high degree of (re)usability during the development and simulation of the membrane systems. the initial screen offers an example, and the user may find useful documentation about the xml schema, the rules, and the query language. the query language helps the user to select the output of the simulation. the simulator is free software, and it offered at http://psystems.ieat.ro under the gnu general public license. this allows anyone to contribute with enhancements and error corrections to the code, and possibly develop new interfaces for the c and clips level apis. these interfaces can be local (graphical or command-line), or yet other 22 gabriel ciobanu web-based ones. in the same paper [6], the authors present an accelerator for parallelization of the existing sequential simulators. this accelerator is used to parallelize an existing clips simulator [18]. the speedup and the efficiency of the resulting parallel implementation are surprisingly close to the ideal ones. 5 conclusion and related work structural operational semantics is an approach originally introduced by plotkin [19] in which the operational semantics of a programming language or a computational model is specified in a logical way, independent of a machine architecture or implementation details, by means of rules that provide an inductive definition based on the elementary structures of the language or model. structural operational semantics is intuitive and flexible, and it becomes more attractive during the years by the developments presented by kahn [15] and milner [16]. configurations are states of transition systems, and computations consists of sequences of transitions between configurations, and terminating (if it terminates) in a final configuration. we present a structural operational semantics of the membrane systems; the inference rules provide a big-step operational semantics due to the parallel nature of the model. a structural operational semantics of the systems emphasizes also the deductive nature of the membrane computing by describing the transition steps by using a set of inference rules. considering a set r of inference rules, we can describe the computation of a membrane system as a deduction tree. in [3] we translate the big-step operational semantics of membrane systems into rewriting logic. by using the rewriting engine maude [13], we obtain an interpreter for membrane systems, and verify various properties of these systems. looking at the membrane systems from the point of view of programming theory, we define an appropriate data representation for p systems, and make the first steps to define an arithmetic unit for these abstract machine inspired by cells. the natural encoding over multisets is very close to biology, and can help to understand some biological mechanisms, improving also some computational models inspired by biology. we have designed and implemented sequential and parallel software simulators; we present some of them, and compare with other software simulators of the p systems. a web-based implementation is presented in [6]. acknowledgements the contributions of this paper were obtained together with my colleagues. many thanks to oana andrei and dorel lucanu for the joint work on the operational semantics of the membrane systems. many thanks to cosmin bonchiş and cornel izbaşa for their contributions to the arithmetical operations over multisets in the framework of membrane systems, and to the software implementation webps. references [1] o. andrei, g. ciobanu, d. lucanu. executable specifications of the p systems. in membrane computing wmc5, lncs vol.3365, springer, 127-146, 2005. [2] o. andrei, g. ciobanu, d. lucanu. structural operational semantics of p systems. proceedings wmc6, lncs vol.3850, springer, 32-49, 2006. [3] o. andrei, g. ciobanu, d. lucanu. operational semantics and rewriting logic in membrane computing. proceedings sos workshop 2005, to appear in entcs. a programming perspective of the membrane systems 23 [4] h. attiya, j. welch, distributed computing: fundamentals, simulations and advanced topics. mcgraw-hill, 2000. [5] c. bonchiş, g. ciobanu, c. izbaşa. encodings and arithmetic operations in membrane computing. in jin-yi cai, s. barry cooper, angsheng li (eds.): theory and applications of models of computation, lncs 3959, springer, 618–627, 2006. [6] c. bonchiş, g.ciobanu, c. izbaşa, d. petcu. a web-based p systems simulator and its parallelization. in c.calude et al. (eds.): unconventional computing, lncs vol.3699, springer, 58-69, 2005. [7] g. ciobanu. distributed algorithms over communicating membrane systems. biosystems vol.70, elsevier, 123-133, 2003. [8] g. ciobanu, r. desai, a. kumar. membrane systems and distributed computing. in proceedings wmc3, lncs vol.2597, springer, 187-202, 2003. [9] g. ciobanu, w. guo. p systems running on a cluster of computers. in procedings 4th wmc, taragona, lncs vol.2933, springer, 123-139, 2004. [10] g. ciobanu, d. paraschiv. p system software simulator. fundamenta informaticae 49, 61-66, 2002. [11] g. ciobanu, gh. păun, gh.ştefănescu. sevilla carpets associated with p systems. report 26/03 rovira i virgili university, tarragona, 135-140, 2003. [12] g. ciobanu, gh. păun, gh. ştefănescu. p transducers. new generation computing 24, 1–28, 2006. [13] m. clavel, f. durán, s. eker, p. lincoln, n. martí-oliet, j. meseguer, j.f. quesada. maude: specification and programming in rewriting logic. theoretical computer science, vol.285, 187-243, 2002. [14] a. cordon-franco, m.a. gutierrez-naranjo, m.j. perez-jimenez, a. riscos-nunez, f. sanchocaparrini. implementing in prolog an effective cellular solution for the knapsack problem. in proceedings 4th wmc, taragona, lncs vol.2933, springer, 140-152, 2004. [15] g. kahn. natural semantics, technical report 601, inria sophia antipolis, 1987. [16] r. milner. operational and algebraic semantics of concurrent processes. in j. van leeuwen (ed.), handbook of theoretical computer science vol.b, 1201-1242, elsevier science, 1990. [17] gh. păun. membrane computing. an introduction. springer, 2002. [18] m.j. perez-jimenez, f.j. romero-campero. a clips simulator for recognizer p systems with active membranes. in proceedings 2nd brainstorming week on membrane computing, university of sevilla tech. rep 01/2004, 387-413, 2004. [19] g. plotkin. structural operational semantics. journal of logic and algebraic programming vol.60, 17-139, 2004. [20] a. riscos-núñez, cellular programming: efficient resolution of numerical np-complete problems. phd thesis, university of seville, 2004. 24 gabriel ciobanu gabriel ciobanu romanian academy institute of computer science address: blvd. carol i nr.8, iaşi e-mail: gabriel@iit.tuiasi.ro int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 2, pp. 193-204 objects detection by singular value decomposition technique in hybrid color space: application to football images m.moussa jlassi, a. douik, h. messaoud mourad moussa jlassi ecole nationale d’ingénieurs de monastir avenue ibn el jazzar 5019 monastir tunisie e-mail: mourad.enim@yahoo.fr ali douik and hassani messaoud ecole nationale d’ingénieurs de monastir avenue ibn el jazzar 5019 monastir tunisie e-mail: ali.douik@enim.rnu.tn e-mail: messaoud.hassani@enim.rnu.tn abstract: in this paper, we present an improvement non-parametric background modeling and foreground segmentation. this method is important; it gives the hand to check many states kept by each background pixel. in other words, generates the historic for each pixel, indeed on certain computer vision applications the background can be dynamic; several intensities were projected on the same pixel. this paper describe a novel approach which integrate both singular value decomposition (svd) of each image to increase the compactness density distribution and hybrid color space suitable to this case constituted by the three relevant chromatics levels deduced by histogram analysis. in fact the proposed technique presents the efficiency of svd and color information to subtract background pixels corresponding to shadows pixels. this method has been applied on colour images issued from soccer video. in the other hand to achieve some statistics information about players ongoing of the match (football, handball, volley ball, rugby...) as well as to refine their strategy coach and leaders need to have a maximum of technical-tactics information. for this reason it is prominent to elaborate an algorithm detecting automatically interests color regions (players) and solve the confusion problem between background and foreground every moment from images sequence. keywords: segmentation, color image, statistic algorithm, histogram analysis, singular value decomposition. 1 introduction image segmentation based on background modeling is a research thematic up until now locates on the head of enormous number of studies. indeed background modeling and image segmentation techniques leads to remedy many problems in a wide spectrum of computer vision applications; k. elgammal et al. [1] present non-parametric model for the images segmentation when the background is dynamic. ying ming et al. [2] propose a statistical algorithm inspired from the idea of elgammal based on cauchy distribution; they proved that the intensity values of background pixels are adapted to cauchy’s distribution. o. javed et al. [3], propose mixture models to handle the backgrounds that exhibit multimodal characteristics, integration of gradient information are suggested as another feature of the multiple models. although gaussian mixture models can converge to any arbitrary distribution provided by enough copyright c⃝ 2006-2010 by ccc publications 194 m.moussa jlassi, a. douik, h. messaoud number of components. r. agarwal et al. [4] present a data-hiding algorithm that exploit the singular value decomposition (svd) representation of the data. they compute the svd of the host image and the watermark and embed the watermark in the singular vectors of the host image. the proposed method leads to an imperceptible scheme for digital images, both in grey scale and color and is quite robust against attacks like noise and jpeg compression. in this paper, we describe an algorithm modeling and subtracting the background pixels based on svd approach which is used in many computer vision problems on one hand, select the appropriate color space among the wide set of color levels commonly used in color image analyze on the other hand. both compactness power of various density distributions and quality of approximated image are increased by singular value decomposition approach. also this technique can be used to answer several purpose for example eliminate noise and highlights [5, 6] issues from the change of illuminations, dynamic background pixels, camera displacement and shadows. finally specify the suitable color space constituted by the three significant color levels, where this segmentation technique will be realized and use a new similarity measure between reference and candidate image that consists to calculate the intersection coefficients of color histogram images. this paper is organized as follows. section 2 presents an overview of statistical algorithms and background model. in section 3 we define the singular value decomposition and we describe the main issues of the svd approach. hybrid color space constituted by significant levels has been explained in section 4. finally, section 5 describes foreground segmentation and experimental results which will evaluate the robustness of this technique. 2 an overview of statistical algorithms and background model 2.1 statistical algorithms a large variety of background subtraction and image segmentation’s algorithms have been developed in last few years ranging from parametric to non-parametric, from pixel to region [8]. statistical algorithms are frequently used in computer vision; m. seki et al. [7] propose background subtraction based on cooccurrence of image variations. oriol pujol et al. [9] propose a new deformable model defined in a statistical framework to segment objects of natural scenes. they perform a supervised learning of local appearance of the textured objects and construct a feature space using a set of co-occurrence matrix measures. linear discriminant analysis allows them to obtain an optimal reduced feature space where a mixture model is applied to construct a likelihood map. instead of using a heuristic potential field, their active model is deformed on a regularized version of the likelihood map in order to segment objects characterized by the same texture pattern. different tests on synthetic images, natural scene and medical images show the advantages of their statistic deformable model. k. verma et al. [10] propose a new improved mountain clustering technique, which is compared with some of the existing techniques such as k-means, fcm, em and modified mountain clustering. the performance of all these clustering techniques towards color image segmentation is compared in terms of cluster entropy as a measure of information and observed via computational complexity. the cluster entropy is heuristically determined, but is found to be effective in forming correct clusters as verified by visual assessment. a. farhadi et al. [11] present a method for the segmentation of images based on local higher order statistics. the algorithm can be applied for the separation of objects from a texture background and the segmentation of textures. the proposed technique makes no use of a data bank and its complexity is o(χ) where χ is the number of pixels. objects detection by singular value decomposition technique in hybrid color space: application to football images 195 2.2 non-parametric background model because the parametric background model still lacks flexibility when the background pixels are dynamic, a highly flexible non-parametric technique is proposed to estimate background probabilities from many recent samples over time using kernel density estimation. in the non-parametric model all recently observed pixel values x1, x2,.. xn are modeled by probability density functions using a certain kernel estimator function, which is often chosen to be a gaussian. the weighted sum of all these gaussians results in the final probability density function of the pixel value xt : p(xt) = 1 n n∑ i=1 k(xt − xi) (1) the kernel estimator function k is chosen as a normal function n(0,σ) where σ is the kernel function bandwidth. color channels are assumed independent and each channel has its own kernel band width σ 2. these assumptions lead us to the final density estimation which can be written as: pr(xt) = 1 k k∑ i=1 d∏ j=1 1√ 2π σ 2j exp− (xt j − xi j)2 2σ 2j (2) when this probability is higher than a certain threshold which is a global threshold over all the image, the pixel is classified as a foreground pixel. in other applications using kernel density estimation the kernel bandwidth dependent on the number of samples. to estimate the kernel band width σ 2 for the jth color channel for a given pixel we compute the median absolute deviation over the sample for consecutive intensity values of the pixel. that is, the median, m, of |xi − xi+1| for each consecutive pair (xi,xi+1) in the sample, is calculated independently for each color channel. if we assume that the distribution is normal n(µ,σ 2), then the deviation (xi − xi+1) is normal n(0,2σ 2). so the standard deviation of the first distribution can be estimated by σ = m 0.68 √ 2 (3) this method ensures that the local deviation is large when there are many large jumps between consecutive samples and smaller when this is not the case. 3 singular value decomposition (svd) 3.1 introduction the singular value decomposition (svd) is a generalization of the eigen decomposition which can be used to analyze rectangular matrices (the eigen-decomposition is defined only for squared matrix). by analogy with the eigen decomposition, which decomposes an image into two simple matrix, the main idea of the svd is to decompose a rectangular matrix as a product three matrix: two orthogonal and one diagonal. the svd is equivalent to principal component analysis (pca) and is therefore an essential tool for multivariate analysis. 3.2 svd image approximation the svd uses eigen decomposition of a positive semi-definite matrix in order to derive a similar decomposition applicable to all rectangular real number matrix. the main idea is to decompose any matrix into three matrix; two orthonormales and one diagonal. formally, if a is a rectangular matrix, its svd is written by: a = p∆ qt (4) 196 m.moussa jlassi, a. douik, h. messaoud p: the (normalized) eigenvectors of the matrix aat (ppt = i). the columns of p are called the left singular vectors of a. q: the (normalized) eigenvectors of the matrix aat (qqt = i). the columns of q are called the right singular vectors of a. ∆ the diagonal matrix of the singular values, ∆ = λ 1 2 with λ being the diagonal matrix of the eigen values of matrix aat and of the matrix at a, since they are the same. a non-parametric background modeling technique has been applied on soccer video images. the main problem that can appear is the occurring of wrong detection pixels. indeed the detection of moving objects as a shadows pixels allows to an over segmentation which will damage many works where this paper is registered. this algorithm is extremely important because it is a part of player’s classification and tracking [15] on a football scenes. svd representation is very useful in image processing applications in particular for the spectral image compression [16]. the interest of svd in this method is to determine number’s svd of the treated image. besides we will see the prominent contribution using svd approach to restore and eliminate shadows, highlights and noise from camera displacement. in the developed background segmentation method described in this paper, the main objective is summarized by the use of the singular value decomposition. let a is a given image represented by a matrix ap = [ai j], which can be decomposed into a product of three matrix uk sk v tk , where ai j is the appearance frequency of background pixel’s chromaticity and intensity; (p= red, green, blue). d. valentin et al. [17] and h. abdi et al. [18] prove the property’s importance to approximate a given matrix by svd which will be substituted by another having reduced rank and dimension of each initial level from r to k (i.e. suppression of r-k column). the determined singular values for each level was presented in frequency space, their representation prove that for each one correspond a discrete frequency. the noise that can appear in the signal (in frequency space the amplitude of noise is constant) corresponds to a low amplitude, singular value whereas high amplitudes of these represents global signal energy. 3.3 confidence intervals research this section presents the steps which allow to determine the singular values number for each level. – image improvement : in this section, we describe the basic background model and the background subtraction process with singular value decomposition. the latter was used in both restoration, reconstruction of considred image, and increase the compactness distribution of different class, and also provide useful image information. – weights interpretation: to evaluate mathematical contribution of singular value decomposition, a quantification of global signal energy distribution according the weight of each singular value skk was done. figure 1 illustrate the energy distribution e defined by: e = k∑ i=1 a2i (5) the relative energy contained by each singular value k, noted pk is defined by: pk = s2kk e (6) where the energy of the k singular value is equal to s2kk. objects detection by singular value decomposition technique in hybrid color space: application to football images 197 figure 1: evaluation of energy according to singular values decomposition. as it is shown on the figures 2a, 2b and 2c, the choice of the image’s size which will be manipulated, is deduced from the curves representing standard deviation of each colour levels according to the singular value decomposition. in fact a good choice of singular value number leads to reduce both compactness in different distribution and computing time. figure 2: evaluation of standard deviation according to singular values of the red level. 198 m.moussa jlassi, a. douik, h. messaoud figure 3: evaluation of standard deviation according to singular values of the green level. figure 4: evaluation of standard deviation according to singular values of blue level. figures 2a, 2b and 2c denote two zones, the first one defined in the interval [0, (skkl)i where (skkl)i is the singular value limits corresponding to the linear part of the curve i(red, blue, green), on this zone the curve prove a slope, beyond (skkl)i a second zone appears where the standard deviation varied slightly, therefore the optimal singular value (ŝkkl)i must necessarily belong to the first zone of each curve. table 1 illustrates initial and improved standard deviations for three channels (rgb). – choice of the singular values number: the choice of the singular values number, which will be kept, depends on two issues, the first one is the energy curve evaluated by figure 2 and the second is standard deviation curves of each chromatic level shown in figures 2a, 2b and 2c. in fact we specify for each component the limit singular value defined previously. table 2 shows confidence intervals as well as singular value limits and the optimal value. objects detection by singular value decomposition technique in hybrid color space: application to football images 199 table 1: evaluation of improvement parameters. σ µ r 4.2044 119.610 g 4.7227 152.080 b 4.313 88.988 rsvd 3.927 119.07 gsvd 4.620 152.05 bsvd 3 995 88.407 table 2: specification of confidence intervals. (skkl)i (ŝkk)i confidence intervals r 29 19 [0, 29] g 28 13 [0, 28] b 19 13 [0, 13] 4 hybrid color space research by histogram analysis many descriptors can be used in image processing field, c. huang et al. [12] combine texture and color descriptors [19] to form hybrid visual feature index to retrieve natural color images, in fact this method is insensitive to image rotation and translation and then the experimental results show that the method achieves better performance than other recent relevant methods. the rgb space isn’t always the best one [20], indeed, other colorimetric components, deduced from red, green and blue, can be more suitable according to a considered case. michael k. ng et al. [13] consider restoring a single-color image from two degraded frames of the same scene by a rgb sensor and a luminance sensor. the rgb-to-yiq transformation, the classical tikhonov regularization and the neumann boundary condition are used in the restoration process. r. missaoui et al. [14] illustrate the superiority of an efficient content-based image mining and retrieval approach towards similarity analysis and retrieval effectiveness computation both in the use l* c* h* and ciecam02 color spaces. in order to refine the result, a new effectiveness measure are proposed and experimented on an image, this approach consists to convert image from rgb space to other one which are grouped in ψ set and select the suitable system which minimizes the overlapping between different clusters. where ψ = (rrgb, grgb, brgb, rrgb, grgb, brgb, hhsl, shsl, lhsl, xxy z , yxy z , zxy z , llab, alab, blab, yy uv , uy uv , vy uv , ccmy , mcmy , ycmy , i1i1i2i3, i2i1i2i3, i3i1i2i3, aac1c2, c1ac1c2, c2ac1c2, yy iq, iy iq, qy iq, llch, clch, hlch, hhsv , shsv , vhsv , yy cbcr, cby cbcr, cry cbcr, yy pbpr, pby pbpr, pry pbpr, yy dbdr, dby dbdr, dry dbdr), multidimensional space defined by a set of color levels frequently used. a new similarity measures which compare two color histograms and calculates the intersection coefficients, as illustrates in figure 3a and 3b. however histogram analysis and conversion of image in various color space are an important technique to achieve many studies in this field and lead to make decision about discrimination capacity of 200 m.moussa jlassi, a. douik, h. messaoud these levels thus determine a hybrid color space constituted by the three significant components. algorithm 1 calculate similarity criterion between reference, and candidate image. a b figure 5: (a) represents the intersection coefficient (0.041965) between reference level and blab for player a. (b) represents the intersection coefficient (0.019788) between reference level and hhsl for player b. algorithm 1: read original image in gray level. calculate histogram hg. for each level i read the image in this level calculate histogram (hp)i estimate intersection between hg and (hp)i if inter ≈ 0 pi is a suitable plan else choose pi+1 end if end for. this algorithm is compiled for each player with various levels, the intersection coefficients between original (gray level) and converted frame were evaluated and given the following results: – relevant levels for player a: hhsl, vluv, i3 and i2, blab, rrgb and brgb. – relevant levels for player b: hhsl, i3, pry pbpr, jpegcrjpegy cbcr, dry dbdr, vy uv and c2y c1c2. to appreciate and highlight the interest of this technique, we recapitulate results of some favorite levels in tables 3 and 4. this supervised training enables to control statistical parameters, confirm the efficient choice of this method and increase compactness power. table 5 and 6 show the improvements introduced to the distribution of each different class. – for player a: table 5 illustrates mean µ and standard deviation σ before and after segmentation, of original image (gray level) and represented image in each relevant level and the ratio of the standard deviation σ , before and after segmentation, by standard deviation of reference image. σre f be f (before) =15.232, σre f a f t (after) = 8.317. objects detection by singular value decomposition technique in hybrid color space: application to football images 201 table 3: significant levels for player a (pa). level inter 1 hhsl 0 2 vluv 0 3 i3 0.00022401 4 i2 0.00029869 5 blab 0.00171740 6 rrgb 0.03345300 7 brgb 0.04196500 – for player b: similarly for player b and with the same way, segmentation parameters were given in table 6 and we conserve the same notation given for player a. table 4: significant levels for player b (pb). level inter 1 jpegcrjpegy cbcr 0 2 i3 0 3 hhsl 0.00067204 4 dry dbdr 0.00238950 5 pry pbpr 0.00418160 6 vy uv 0.01493400 7 c2y c1c2 0.01978800 this technique contributes to research a suitable hybrid color space that is separate some distributions corresponding for each set pixels class by an iterative selection procedure. this procedure consists to extract the significant level from a set of color components according specified actors present in image. the obtained results show clearly how the use of color improves the segmentation quality. 5 experimental results using the probability pr(xt) calculated in the equation (3), a pixel is considered as a foreground pixel if pr(xt) < th. the threshold th is a global threshold over all the image that can be adjusted to achieve a desired percentage of false positives. practically, this probability can be calculated in a very fast way using pre calculated lookup tables for the kernel function values given the intensity value difference,(pi − pt), and the kernel function bandwidth. the detection of shadows as foreground pixels was a source of confusion with a background. the solution that allows to overcome this problem incorporates both singular value decomposition to increase the compactness power of distributions and color information [21] where pixels intensities will be expressed. indeed chrominance levels were better than these of lightness. they lead to discriminate foreground and their shadows. the work space where this segmentation was carried out is the hrb constituted by hsl and rgb previously determined. 202 m.moussa jlassi, a. douik, h. messaoud table 5: evaluation of parameters in favorite levels before and after segmentation (pa). σbe f rapbe f σa f t rapa f t hhsl 14.353 0.942 2.804 0.099 vluv 3.873 0.254 2.480 0.087 i3 5.477 0.359 4.370 0.154 i2 5.831 0.382 7.424 0.262 blab 6.082 0.399 5.624 0.198 rrgb 10.392 0.682 20.640 0.728 brgb 11.705 0.768 13.935 0.492 table 6: evaluation of parameters in favorite levels before and after segmentation (pb). σbe f rapbe f σa f t rapa f t jpegcr 16.000 1.076 0.747 0.027 i3 5.385 0.362 4.502 0.164 hhsl 16.000 1.076 24.416 0.894 dry dbdr 6.000 0.403 0.811 0.029 pry pbpr 9.0554 0.609 0.765 0.028 vy uv 10.050 0.676 0.762 0.027 c2y c1c2 10.724 0.721 0.760 0.027 objects detection by singular value decomposition technique in hybrid color space: application to football images 203 ! " # $ % & ' ( ) * figure 6: (a, f) original images. (b, g), (c, h) present respectivly binary images before and after treatment. (d, i) segmented image with hrb system. (e, j) output images in rgb system. 6 conclusion in this paper, we present a non-parametric kernel density estimation technique, to model and subtract background which was developed in other color space more suitable instead the rgb space. experimental results has been improved by using an appropriate mathematical approach for objects detection and shadows suppression. this technique incorporates singular value decomposition to increase the compactness power of each distribution, also approximate a new image which will be manipulated on one hand, determine a suitable color space constituted by significant levels among set of levels commonly used in color image analysis based on a novel similarity measures for comparing color histogram of reference and candidate image where we calculate the intersection coefficient which present criterion’s discrimination on other hand. also this technique proves that the gray levels and the rgb space not efficient for all applications. however the deduced color space (hrb) allows to convincing results and confirm the efficiency of this method. bibliography [1] a. elgammal, r. duraiswami, d. harwood, and l. s. davis, background and foreground modeling using nonparametric kernel density estimation for visual surveillance, proc. ieee 90(7), pp. 1151-1163, 2002. [2] y. ming, j. jiang, j. ming, background modeling and subtraction using a local-lineardependence-based cauchy statistical model, proc. viith digital image computing: techniques and applications, pp. 469-478, 2003. [3] o. javed, k. shafique, and m. shah, a hierarchical approach to robust background subtraction using color and gradient information, in ieee workshop on motion and video computing, 2002. [4] r. agarwal and m. s. santhanam, digital watermarking in the singular vector domain, international journal of image and graphics, vol. 8, no. 3, pp. 351-368, 2008. [5] yl. liu, j. wang, x. chen, a robust and fast non-local means algorithm for image denoising, journal of computer science and technology, 23(2): 270-279 mar. 2008. [6] x. zhang, k. kobayashi, s. saito and m. nakajima, reflectance-field-based separation of surface reflection components, information and media technologies, vol. 1, no. 2, pp.1040-1048, 2006. 204 m.moussa jlassi, a. douik, h. messaoud [7] m. seki, t.f. wada and h.sumi, background subtraction based on cooccurrence of image variations, in computer vision and pattern recognition, pp. 65-72, 2003. [8] t. brox, m. rousson, r. deriche, and j.weickert, unsupervised segmentation incorporating colour texture and motion,computer analysis of images and patterns, volume 2756 of lecture notes in computer science, pp. 353-360, 2003. [9] o. pujol and p. radeva, segmentation by statistical deformable models, international journal of image and graphics, vol. 4, no. 3, pp. 433-452, 2004. [10] k. verma and m. hanmandlu, color segmentation via improved mountain clustering technique, international journal of image and graphics, vol. 7, no. 2, pp. 407-426, 2007. [11] a. farhadi, m. shahshahani, image segmentation via local higher order statistics, international journal of imaging systems and technology, vol. 13, no. 4, pp. 215-223, 2003. [12] c. huang, j. zhou and s. yu, color image retrieval based on color-texture-edge feature histograms, international journal of image and graphics, vol. 6, no. 4, pp. 583-598, 2006. [13] michael k. ng and n. k. bose, fast color image restoration with multisensors, international journal of imaging systems and technology, vol. 12, no. 5, pp. 189-197, 2002. [14] r. missaoui, m. sarifuddin and j. vaillancourt, similarity measures for efficient content-based image retrieval, iee proceedings. vision, image, and signal processing, vol. 152, no. 6, pp. 875887, 2005. [15] a. colombari, a. fusiello, v. murino, segmentation and tracking of multiple video objects, pattern recognition, pp. 1307-1317, 2007. [16] h. kotera, rgb to pseudo-spectral image conversion using spectral palette and compression by singular value decomposition, nihon gazo gakkaishi, vol. 42, pp. 215-223, 2003. [17] abdi, h. (2003). multivariate analysis. in m. lewis-beck, a. bryman, and t. futing (eds): "encyclopedia for research methods for the social sciences". thous and oaks: sage. [18] abdi, h., valentin, d. (2006). "mathématiques pour les sciences cognitives (mathematics for cognitive sciences)". grenoble: pug. [19] h. matsuda, s. kubota and h. sato, comparison of hair color using image analysis, japanese journal of forensic science and technology, vol. 13, no. 2, pp. 151-166, 2008. [20] m. rousson, t. brox, and r. deriche, active unsupervised texture segmentation on a diffusion based feature space, in proc. ieee computer society conf. on computer vision and pattern recognition, vol. 2, pp. 699-704, june 2003. [21] m. g. linguraru, m. á. g. ballester, n. ayache. deformable atlases for the segmentation of internal brain nuclei in magnetic resonance imaging, international journal of computers, communications and control, vol. ii, no. 1, pp. 26-36, 2007. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 3, pp. 385-397 robust control of particle size distribution in aerosol processes z. xiang zhengrong xiang school of automation nanjing university of science and technology nanjing, 210094, people’s republic of china e-mail: xiangzr@mail.njust.edu.cn abstract: this paper deals with a comprehensive study on robust control of particle size distribution of fractal agglomerate in aerosol processes with simultaneous chemical reaction, nucleation, condensation and coagulation. firstly, a general aerosol process is described by population balance and mass and energy balances, which describes the evolution of particle size distribution, continuous phase species and temperature of the aerosol system, respectively. a lognormal moment approximations of the population balance model is then presented. then, the robust state feedback controller is designed for the aerosol process with some unknown uncertainties, the proposed controller is composed of an nominal control term and a robust control term so that it only ensures the stability of the closed-loop system, but also attenuates the effect of the unknown uncertainties on the system. a high-gain observer is adopted to estimate state variables required in the on-line implementation of the state feedback. finally, the proposed robust controller is applied to an aerosol process for achieving an aerosol size distribution with desired geometric average particle diameter, the simulation results show the robustness properties of the controller with respect to parametric model uncertainty and unmodeled dynamics. keywords: particle size distribution, aerosol process, population balance, nonlinear systems, robust control 1 introduction aerosol process is of interest in many problems of ecology, atmospheric physics, and mechanics of multiphase system[1-3]. the evolution of an aerosol size distribution is modeled by population balance equations, which give rise to nonlinear partial integro-differential equation systems. the nonlinearities usually arise from complex reaction, nucleation, condensation and coagulation rates and their nonlinear dependence on temperature. a variety of solution techniques have been developed to address the complexity at various levels. in most cases, one needs to resort to numerical solutions. one of the standard numerical techniques is to discretize the population balance equation using finite difference/element methods (see[4-8], and the references cited therein), but these methods suffer from extremely large computational requirements which cannot be accommodated by conventional computers. sectional models offer a computationally less demanding solution by approximating the continuous size distribution by a finite number of sections within which the particle size distribution (psd) function is assumed to be constant [9]. on the other hands, under appropriate simplifying assumptions, analytical solutions have been developed to solve the population balance equation [10-13]. recently, the issue of population balance mode-based feedback control of psd has received considerable attention [14-16]. previous work in this area include stability analysis and the application of the conventional control schemes to crystallizers and emulsion polymerization processes [17]. unfortunately, conventional control schemes perform poorly in the face of severe process nonlinearities, and may even copyright c© 2006-2010 by ccc publications 386 z. xiang lead to destabilization of the closed-loop system. these limitations of conventional control schemes have motivated research efforts towards synthesizing nonlinear model-based feedback controllers on spatiallyhomogeneous aerosol processes with application to size distribution control in continuous crystallizers [18,19], and titania aerosol reactor [20,21]. however, most real aerosol systems have uncertainties include unknown or partially known timevarying process parameters, exogenous disturbance, and unmodeled dynamics. it is well known that the presence of uncertain variables and unmodeled dynamics, if not taken into account in the controller design, may be lead to severe deterioration of the nominal closed-loop performance or even to closedloop instability. therefore, it is very important thing to study robust control of nonlinear aerosol systems with uncertainty. this paper focuses on robust control of particle process described by uncertain population balances. the objective is to develop a general method for the synthesis of practically implementable robust nonlinear controller that explicitly handle time-varying uncertain variables (such as unknown process parameters and disturbances) and unmodeled dynamics. the robust nonlinear controller enforces stability in the closed-loop system and attenuation of the effect of uncertain variables on the system, and achieve psd with desired characteristics. to present robust output feedback control techniques that are applicable to a broad range of aerosol systems, we choose to focus on an aerosol process that involves simultaneous chemical reaction, nucleation, condensation and coagulation rather than a specific aerosol. the remainder of this paper is organized as follows: in section 2, a general aerosol process is described by population balance and mass and energy balances, which describes the evolution of particle size distribution, continuous phase species and temperature of the aerosol process, respectively. a lognormal moment approximations of the population balance model is presented. in section 3, the robust state feedback controller is designed for the aerosol system with some unknown uncertainties including unknown or partially known time-varying process parameters, exogenous disturbance, and unmodeled dynamics, the proposed controller is composed of a robust control term and a nominal control term so that it only ensures the stability of the closed-loop system, but also attenuates the effect of the unknown uncertainties on the system. a high-gain observer is adopted to estimate state variables required in the on-line implementation of the state feedback. in section 4, the proposed robust controller is applied to a general aerosol process for achieving an aerosol size distribution with desired geometric average particle diameter, the simulation results show the robustness properties of the controller with respect to parametric model uncertainty and unmodeled dynamics. finally, conclusions are given in section 5. 2 mathematical model of aerosol process 2.1 spatially-homogeneous aerosol process consider a general aerosol process which described by the following nonlinear partial integro-differential equation [22]: ∂ n(v,t) ∂ t + ∂ (g(x̄, v)n) ∂ v − i(v∗)δ (v − v∗) =   ∫ v  β (x̄, v − v̄, v̄)n(v − v̄,t)n(v̄,t)dv̄ − n(v,t) ∫ ∞  β (v, v̄)n(v̄,t)dv̄ (1) where the term n(v,t) represents the psd function at time t , v is the particle volume, i(v∗) is the nucleation rate, g(x̄, v) and β (x̄, v − v̄, v̄) are the diffusional condensation growth function and the brownian coagulation kernel of agglomerates, δ (·) is the standard dirac function. robust control of particle size distribution in aerosol processes 387 a mathematical model which predicts the time evolution of the concentrations of species and temperature of the gas phase has the following form [21]: dx̄ ∂ t = f (x̄) + g(x̄)u(t) + ā ∫ ∞  a(η, v, x̄)dv (2) where x̄(t) is an n-dimensional vector of state variables that depend on time, ā is constant matrix, f (x̄), g(x̄), a(η, v, x̄) are nonlinear vector functions and u(t) is the time-varying manipulated input (e.g. wall temperature). the term ā ∫∞  a(η, v, x̄)dv accounts for mass and heat transfer from the continuous phase to all the particles in the population. the diffusional condensation growth function g(x̄, v) and the brownian coagulation kernel of agglomerates β (x̄, v − v̄, v̄), for the free molecule size regimes, are represented by gf m(x̄, v) = bv /(s − ), b = (π)/vns ( kbt π m )/ , βf m (x̄, v, v̄) = b ( v/d f + v̄/d f ) √  v +  v̄ , b = (/π) /d f −/ (kbt /ρ) / r−/d f (3) and for the near-continuum and the continuum regimes gc(x̄, v) = bv /(s − ), b = (π )/c f vns,c f = λ  ( kbt π m )/ βc (x̄, v, v̄) = b ( v/d f + v̄/d f ) [ c(v) v/d f + c(v̄) v̄/d f ] , b = kbt µ (4) in the above equations, s is the saturation ratio, t is the absolute temperature, d f is the fractal (hausdorff) dimension, c f is the condensable vapor diffusivity, λ is the mean free path of the gas ( λ = vπ mw/2kbt nav, where v and mw are the kinematic viscosity and molecular weight of the fluid, respectively, and nav is the avogadro’s number), µ is the viscosity of the fluid, ns is the monomer concentration at saturation (ns = ps/kbt , where ps is the saturation pressure), m is the monomer mass, v is the monomer volume, ρ is the particle density, r is the particle radius, c(v) =  + bλ /r is the cunningham slip correction factor, and b = .. the nucleation rate i(v∗) is assumed to follow the classical becker-doring theory and is given by the following expression [23]: i = ns ( kbt π m )/ s (  π )/ σ / exp(−k∗ ln s  ) (5) where s is the monomer surface area and k∗ is the number of monomers in the critical size nucleus which is given by: k∗ = π  ( σ ln s ) (6) where σ = γ v/ /kbt and γ is the surface tension. remark 1. it is found experimentally that in many cases of practical interest, the total number of primary particles np in an agglomerate is related to characteristic radius r through a power law expression np ∼ rd f , where the exponent d f is called the fractal dimension. this is usually true in a statistical sense after averaging over many agglomerates with the same np. the value of d f depends on the details of the agglomerate formation process. for compact agglomerates we have d f → , while for chain-like agglomerates we have d f →  . 388 z. xiang remark 2. it is noticed that the collision kernel βf m in (3) is in a form that appears to be rather difficult to expand into a series with a manageable number of terms. the following approximation can be made [11]; (  v +  v̄ )/ = b (  v/ +  v̄/ ) the variables for b depend on the initial geometric standard deviation σ and on the fractal dimension d f . remark 3. the manipulated input u(t) enters system (1)-(2) through (2), this assumption is usually satisfied in most practical applications where the wall temperature is chosen as the manipulated input. 2.2 moment model in this subsection, we present moment model of size distribution of fractal agglomerates in aerosol processes with simultaneous chemical reaction, nucleation, condensation and coagulation. many experimental results and numerical calculation indicate that aerosol psds can be adequately described by unimodal lognormal functions. this result makes it possible to develop a moment model for the aerosol process in terms of the three leading moments of the size distribution. the log-normal size distribution function is written as: n(v,t) =  v n(t)√ π ln σ (t) exp [ − ln{v/vg(t)}  ln σ (t) ] where n(t) is the total number concentration of particles, σ (t) is the geometric standard deviation, and vg(t) is the geometric number mean particle volume. the kth moment of the particle size distribution is written as: mk(t) = ∫ ∞  vkn(v,t)dv where k is an arbitrary real number. according to the properties of a log-normal function, any moment can be written in terms of m, vg and σ as follows mk = mv k− g exp {   (k − ) ln σ } (7) obviously, the three leading moments are sufficient to generate the lognormal psd. using the same derivation procedure as the one in [21], we can easily obtain the following equations, which describe the time evolution of the three leading moments (i.e. the zeroth, first and second moments) of the size distribution for the free molecule size. dm dt = i − bb(m/d f −/m + m/d f m/d f −/ + m/d f m−/) (8) dm dt = iv∗ + b(s − )m/d f −/ (9) dm dt = iv∗ + b(s − )m+/d f +bb(m/d f +/m + m+/d f m/d f +/ + m+/d f m/) (10) similar to the case of the free molecule regime, the dynamics of the zeroth and second moments of the aerosol size distribution in the continuum size regime is described by the following equations: dm dt = i − b[m   + m/d f m−/d f + bλ (   π )/ (mm−/d f + m/d f m−+/d f )] (11) dm dt = iv∗ + b(s − )m/d f (12) robust control of particle size distribution in aerosol processes 389 dm dt = iv∗ + b(s − )m+/d f +b[m   + m+/d f m−/d f + +bλ (   π )/ (mm−/d f + m/d f m+/d f )] (13) table 1 dimensionless variables n = m/ns, aerosol concentration v = m/nsvd, aerosol volume v = m/nsv, second aerosol moment τ = (π m/kbt )//ns, characteristic time for particle growth k = (kbt /µ)nsτ , coagulation coefficient kn = λ /r, monomer knudsen number i ′ = i/(ns/τ), nucleation rate r′r = rr/(ns/τ), reaction rate group v′g = vg/v r ′g = rg/r θ = t/τ the moment equations for the free molecule size and continuum size regimes may be combined to describe the aerosol dynamics over the entire particle size spectrum by using the harmonic average of the dimensionless coagulation rates in the free molecule size and continuum size regimes [23]. using the dimensionless variables listed in table 1, we have to the following equations: zeroth moment dn dθ = i ′ − ξ n (14) where ξ = ξf m ξc ξf m + ξc , ξf m = r ′/ g b [ exp (   ln σ ) +  exp (   ln σ ) + exp (   ln σ )] ξc = k [  + exp (ln σ ) + b ( kn r ′g ) exp (   ln σ ) ( + exp ( ln σ ) ] first moment dv dθ = i ′k∗ + η(s − )n (15) where η = ηf m ηc ηf m + ηc , ηf m = v ′/ g exp ( ln  σ ) ηc = kn  v ′/ g exp (   ln σ ) second moment dv dθ = i ′k∗ + ε(s − )v + ζ v  (16) where ε = εf m εc εf m + εc , ζ = ζf m ζc ζf m + ζc εf m = v ′/ g b exp ( ln  σ ) , εc = kn  v′/g exp (   ln σ ) ζf m = r ′/ g b exp (   ln σ ) [ exp (   ln σ ) +  exp (   ln σ ) + exp (   ln σ )] ζc = k [  + exp (ln σ ) + b ( kn r ′g ) exp ( −   ln σ ) ( + exp (− ln σ ) ] 390 z. xiang the rate of change of s can be obtained from a monomer balance and is given by: ds dθ = r′rnav − i ′k∗ − η(s − )n (17) 2.3 mathematical model of aerosol process we consider a typical aerosol process in a cylindrical volume with diameter dt . under the assumption of lognormal aerosol size distribution, the dimensionless model that describes the evolution of the first three moments of the distribution, along with the saturation ratio, reactant concentrations and fluid temperature, takes the following form: dn dθ = i ′ − ξ n dv dθ = i ′k∗ + η(s − )n dv dθ = i ′k∗ + ε(s − )v + ζ v  ds dθ = cc̄c̄ − i ′k∗ − η(s − )n dc̄ dθ = −ac̄c̄ dc̄ dθ = −ac̄c̄ dt̄ dθ = bc̄c̄t̄ + et̄ (t̄w − t̄ ) (18) where c̄ and c̄ are the dimensionless reactant concentrations, t̄ , t̄w are the dimensionless fluid temperature and the dimensionless temperature of the heat transferring medium at the system boundary, respectively. a, a, b,c, e are dimensionless quantities, their explicit form are given in table 2. table 2 dimensionless variables for the model a = τ kpy/rt a = τ kpy/rt b = τ kp∆ hτ yy/rt cp c = navτ kyy(p/rt)/ns e = u rtτ/dt cpp c̄i = yi/yit̄ t̄ = t /t t̄w = tw/t we formulate the control problem as one of tracking the geometric average particle diameter of the aerosol system, by manipulating the wall temperature, i.e. y(t) = dpg(t), u(t) = t̄w(t) − t̄ws where t̄ws = tws/t = . we write the model (18) in the following form: dx̃ dt = f̃ (x̃) + g̃(x̃)u(t), y = h̃(x̃) (19) where the explicit form of x̃, f̃ (x), g̃(x̃) can be obtained by comparing (18) and (19). robust control of particle size distribution in aerosol processes 391 remark 4. in addition to being highly nonlinear, the real aerosol processes have uncertainties including unknown or partially known time-varying process parameters, exogenous disturbance, and unmodeled dynamics. therefore, the real aerosol systems can be described by the following systems. dx̃ dt = f̃ (x̃) + ∆ f̃ (x̃) + (g̃(x̃) + ∆ g̃(x̃))u(t), y = h̃(x̃) (20) where ∆ f̃ (x̃) and ∆ g̃(x̃) represent uncertainties caused by unknown or partially known time-varying process parameters, exogenous disturbance, and unmodeled dynamics. 3 robust controller design in this section, we will begin with the design of the robust controller. our objective is to design a robust output feedback controller which guarantees boundedness of all variables for the closed-loop system and tracking of a given reference signal yd . by differentiating the output y(t) with respect to t , we obtain the following nonlinear system represented by the differential equation y(r) = f (y, y(),··· , y(r−)) + ∆ f (y, y(),··· , y(r−))+ (g(y, y(),··· , y(r−)) + ∆ g(y, y(),··· , y(r−)))u (21) where r is the relative order (the definition can be found in [24], and omitted here for brevity) of the output y(t) with respect to the manipulated input u(t) . by taking x = y, x = y() up to xr = y(r−), we can represent the augmented system by the state space model ẋ = ax + b(v + d(x)) y = x (22) where x = (x, x,··· , xr)t , v = f (x) + g(x)u, δ(x) = ∆ g(x)/g(x), d(x) = ∆ f (x) + δ(x)(v − f (x)), (a, b) is controllable canonical pairs of the form a =      ···     ···  ··· ··· ··· . . . ...    ···     ···    ∈ rr×r, b =     ...     ∈ rr× assuming the states is available for feedback, taking e = y − yr = x − yr e = ẏ − ẏr = x − ẏr ... er = y(r−) − y (r−) r = xr − y (r−) r and e = (e, e,··· , er)t , we rewrite (22) as ė = ae + b(v + d(e + yd ) − y (r) d ) (23) where yd = (yd , ẏd ,··· , y(r−)d )t . 392 z. xiang obviously, if d(·) =  (i.e. ∆ f (x) = ∆ g(x) =  ), we can use the following nominal state feedback control to achieve our objective. u =  g(x) (−ke + y(r)d − f (x)) (24) where matrix k is chosen such that the matrix am = a − bk is hurwitz, i.e., all its eigenvalues are in the left open plane. however, due to the presence of uncertainties ∆ f (x) and ∆ g(x) , a nominal state feedback control alone cannot ensure the stability of the closed-loop system, it is necessary to add a robust compensator to deal with it. u = −  g(x)( − ∆(x)) bt peµ (x) ‖bt pe‖µ(x) + ε exp(−εt) (25) where ε, ε > , p = pt >  is a solution of the following lyapunov equation p(a − bk) + (a − bk)t p = −q, ∀qt = q >  µ(x) = ∆(x) (∣∣∣y(r)d − ke ∣∣∣ + | f (x)| ) + |∆ f (x)| , ∆(x) = sup x∈rr |δ(x)| the robust state feedback controller we proposed is as follows u = u + u (26) theorem 5. suppose that ∆(x) = sup x∈rr |δ(x)| < , consider system (22) under the robust state feedback controller (26), the closed-loop system have the following properties: (1) the state of the closed-loop system is bounded. (2) the output of the closed-loop system satisfy lim t→∞ |y − yd | = . proof: let ν = −  ( − ∆(x)) bt peµ(x) ‖bt pe‖µ(x) + ε exp(−εt) the closed-loop system under the robust state feedback controller (26) is as follows ė = (a − bk)e + b(v + d(x)) (27) consider the lyapunov function candidate v (e) = et (t)pe(t) the derivative of v (e) along the trajectories of the system is given by v̇ (e) = et p(a − bk)e + et pb(v + d(x)) (28) obviously, we have |d(x)| ≤ |∆ f (x)| + ∆(x)(|v| + | f (x)|) ≤ ∆(x) |v| + |µ(x)| therefore et pb(v + d(x)) ≤ − ( − ∆(x)) ∥∥bt pe ∥∥· |v| +  ∥∥bt pe ∥∥ µ(x) robust control of particle size distribution in aerosol processes 393 ≤ b t peµ(x)ε exp(−εt) ‖bt pe‖µ(x) + ε exp(−εt) ≤ ε exp(−εt) (29) substituting (29) into (28), we have v̇ (e) ≤ −λmin(q)‖e‖ + ε exp(−εt) (30) where λmin(q) denotes the minimum eigenvalue of q . using the lemma in [25], we have lim t→∞ e =  this completes the proof. 2 the on-line implementation of the controller (26) requires that the values of the state variables x are known. unfortunately, x may be not be known in many practical applications. one way to address this problem is to use state observer to estimate x. here we use the high-gain state observer as ˙̂ei = êi+ + αi ε (e − ê),  ≤ i ≤ r −  ˙̂en = αn ε (e − ê) (31) where ê = x̂ − yd , x̂ denote the estimate state of the state x, ε is a small positive parameter, the constants αi >  are chosen such that the roots of sr + αsr− + ···+ αr−s + αr =  have negative real parts. the state feedback controller (26) and the state observer (31) can be combined to yield the following nonlinear robust output feedback controller: ˙̂ei = êi+ + αi ε (e − ê),  ≤ i ≤ r −  ˙̂en = αn ε (e − ê) u =  g(ê + yd ) ( y(r)d − kê − f (ê + yd ) −  ( − ∆(ê + yd )) bt pêµ (ê + yd ) ‖bt pê‖µ(ê + yd ) + ε exp(−εt) ) (32) theorem 6. suppose that ∆(x) = sup x∈rr |δ(x)| < , consider system (22) under the robust state feedback controller (32), the closed-loop system have the following properties: (1) the state of the closed-loop system is bounded. (2) the output of the closed-loop system satisfy lim t→∞ |y − yd | =  . we can prove theorem 2 by following the proof line of theorem 1, the detailed proof is omitted here. 4 simulation study the performance of the nonlinear robust output feedback controller (28) was tested through simulations. the values of the process parameters are shown in table 3. table 3 model parameters for the simulation study 394 z. xiang dt = .m, process diameter p = atm, process pressure t = k, process pressure y = y = ppm, initial mole fraction of reactants u = .jm−s−k−, overall coefficient of heat transfer ∆ hr = .kjml−s−, heat of reaction cp = .jmol−k−, heat capacity of process fluid mw = .×−kgmol−, molecular weight of process fluid k = .mmol−s−, reaction constant µ = .×−kgm−s−, viscosity of process fluid γ = .nm−, surface tension v = .×−m, molecular weight of process fluid r = .jmol−k−, universal gas constant nav = .mol−, avogadro’ constant kb = .×−jk−, boltzmann’s constant several simulations were performed to evaluate the disturbance attenuation and set-point tracking capabilities of the robust nonlinear controller, as well as its robustness with respect to uncertainties in model parameters and unmodeled dynamics. the objective of these simulations is to show that the use of proposed robust control allows producing an aerosol product with a desired geometric average particle diameter (dpg = .µ m) within a given type of batch reactor even in the presence of uncertainty in the process model parameters. figure 1 presents the closed-loop trajectory (dashed line) and closed-loop trajectory (solid line) for dpg under robust nonlinear controller and nonlinear controller, respectively, and the corresponding manipulated input trajectory in the case of 10% error in the value of the rate constant k. it is clear that the robust nonlinear control regulates the output dpg to the desired point value (i.e. dpg = .µ m), and attenuate the effect of timevarying uncertainty on the process. however, nonlinear control cannot guarantee the output dpg to the desired point values in the presence of uncertainty. there is an error between steady-state value and the desired point one. figure 2 presents the closed-loop trajectory (dashed line) and closed-loop profile (solid line) for dpg under robust nonlinear controller and nonlinear controller, and the corresponding manipulated input trajectory in the case of a 8% decrease in the parameters. again, the robust nonlinear controller allows achieving an aerosol product with the desired dpg. also we simulated the robustness properties of the proposed controller with respect to parametric model uncertainty and unmodeled dynamics in the presence of a set-point change, the robust nonlinear controller was found to have very good robustness properties, keeping the output on the set-point. remark 7. it is worth to point out that the proposed approach for the design of robust nonlinear controllers is applicable to most aerosol systems for which the hypothesis of unimodal lognormal aerosol size distribution for long times is valid. 5 conclusions in this paper, we present a comprehensive study on robust control of particle size distribution of fractal agglomerate in aerosol processes with simultaneous chemical reaction, nucleation, condensation and coagulation. initially, a general aerosol process is presented by population balance and mass and energy balances, which describes the evolution of particle size distribution, continuous phase species and temperature of the aerosol system, respectively. a lognormal moment approximations of the population balance model is then presented. then, the robust state feedback controller is designed for the aerosol system with some unknown uncertainties, the proposed controller is composed of a robust control term and an nominal control term so that it only ensures the stability of the closed-loop system, but also attenrobust control of particle size distribution in aerosol processes 395 figure 1: (a) dpg using the robust nonlinear controller and nonlinear controller, (b) the corresponding manipulated input trajectory figure 2: (a) dpg using the robust nonlinear controller and nonlinear controller, (b) the corresponding manipulated input trajectory 396 z. xiang uates the effect of the unknown uncertainties on the system. a high-gain observer is adopted to estimate state variables required in the on-line implementation of the state feedback. finally, the proposed robust controller is applied to an aerosol process for achieving an aerosol size distribution with desired geometric average particle diameter, the simulation results show the robustness properties of the controller with respect to parametric model uncertainty and unmodeled dynamics. acknowledgment this work is supported by the national natural science foundation of china (no. 60974027). bibliography [1] j. m. hidy and j. r. brock, the dynamic of aerocolloidal system, oxford, pergamon press, 1970. [2] v. m. voloshuk, and y. s. sedunov,coagulation phenomena in disperse systems, moscow: moskovskiy inghenerno fizicheskiy institut, 1975 (in russia) [3] v. n. piskunov, physical phenomena in disperse systems, leninggrad: gidrometeoizdat publishers, 1991 (in russia) [4] m. j. hounslow, a discretized population balance for continuous systems at steady-state. american institute of chemical engineers journal, 36, pp. 106-116,1990 [5] p. j. hill, and k. m. ng, new discretization procedure for the breakage equation. american institute of chemical engineers journal, 41 , pp. 1204-1216, 1995 [6] p. j. hill, and k. m. ng, new discretization procedure for the agglomeration equation. american institute of chemical engineers journal, 42, pp. 727-741, 1996 [7] n. v. mantzaris , p. daoutidis, and f. srienc, numerical solution of multivariable cell population balance models. parts: i, ii and iii. computers and chemical engineering, 25, pp. 1411-1481, 2001 [8] m. nicmanis, and m. j. hounslow, finite-element methods for steady-state population balance equations. american institute of chemical engineers journal, 44, pp. 2258-2272, 1998 [9] j. d. landgrebe, and s. e. pratsinis, a discrete sectional model for particulate production by gas phase chemical reaction and aerosol coagulation in the free molecular regime. journal of colloid interface science, 139, pp. 63-86, 1990 [10] e. otto, h. fissan, s. h. park, and k. w. lee, the log-normal size distribution theory of brownian aerosol coagulation for the entire particle size range. part i: analytical solution using the harmonic mean coagulation kernel, journal of aerosol science, vol. 30. no. 1 pp. 3-16, 1999 [11] s. h. park, and k. w. lee, analytical solution to change in size distribution of polydisperse particles in closed chamber due to diffusion and sedimentation, atmospheric environment, vol. 36. pp. 5459-5467, 2002 [12] s. h. park, and k. w. lee, change in particle size distribution of fractal agglomerates during browian coagulation in the free-molecule regime, journal of colloid and intreface science, vol. 246. pp. 85-91, 2002 robust control of particle size distribution in aerosol processes 397 [13] v. n. piskunov, and a. i. golubev, the generalized approximation method for modelling coagulation kinetics ¨c part i: justification and implementation of the method, journal of aerosol science , vol. 33, pp. 51-63,2002 [14] p. daoutidis, and p. d. christofides, dynamic feedforward/output feedback control of nonlinear processes, chemical engineering science, vol. 50, pp. 1889-2007, 1995 [15] a. kalani, and p. d. christofides, nonlinear control of spatially-in homogeneous aerosol processes, chemical engineering science,vol. 54, pp. 2669-2678, 1999 [16] t. j. crowley, e. s. meadows, e. kostoulas, and f. j. doyle iii, control of particle size distribution described by a population balance model of semibatch emulsion polymerization, journal of process control, vol.10, 419-432, 2000. [17] d. semino, and w. h. ray, control of systems described by population balance equations-ii. emulsion polymerization with constrained control action, chemical engineering science, vol.50, pp. 1825-1839, 1995. [18] t. chiu, and p. d. christofides, nonlinear control of particulate processes, american institute of chemical engineers journal,vol. 45, pp. 1279-1297, 1999. [19] n. h. el-farra, t. chiu, and p. d. christofides, analysis and control of particulate processes with input constraints, american institute of chemical engineering journal, 47, pp. 1849-1865, 2001. [20] a. kalani, and p. d. christofides, modeling and control of a titania aerosol reactor, aerosol science and technology, 32, pp. 369-391, 2000. [21] a. kalani, and p. d. christofides, simulation, estimation and control of size distribution in aerosol processes with simultaneoue reaction, nucleation, condensation and coagulation, computers and chemical engineering, 26, pp. 1153-1169, 2002. [22] s. k. friedlander , smoke, dust and haze: fundamentals of aerosol dynamics, new york: oxford university press, 2000. [23] s. e. pratsinis, simultaneous nucleation, condensation, and coagulation in aerosol reactors, journal of colloid interface science, 124, pp. 416-426, 1988. [24] a. isidori, nonlinear control systems: an introduction (second edition). berlin, heidelberg, springer-verlag, 1989. [25] e. yaz, stabilizing compensator design for uncertain nonlinear systems, systems and control letter, 21, pp.11-17, 1993 zhengrong xiang was born in china, in 1969. he obtained his ph.d. degree in control theory and control engineering from the nanjing university of science and technology, china, in 1998. since 2001, he has been an associate professor in the school of automation. he is an ieee member. his main research interests include nonlinear control, robust control, intelligent control, and switched systems. he has published significantly on the subjects with over 80 technical papers in journals and conferences. e-mail: xiangzr@mail.njust.edu.cn. int j comput commun, issn 1841-9836 vol.7 (2012), no. 3 (september), pp. 417-431 gasant: an ant-inspired least-cost qos multicast routing approach based on genetic and simulated annealing algorithms m. damanafshan, e. khosrowshahi-asl, m. abbaspour morteza damanafshan, ehsan khosrowshahi-asl department of computer and network, institute for research in fundamental sciences (ipm), tehran, iran. e-mail: damanafshan@iranet.ir maghsoud abbaspour department of computer engineering, faculty of electrical and computer engineering, shahid beheshti university, g.c., tehran, iran. e-mail: maghsoud@sbu.ac.ir abstract: computing least-cost multicast routing tree while satisfying qos constraints has become a key issue especially by growing communication networks. to solve this problem, a triplex algorithm called gasant which is based on ant colony optimization (aco), genetic algorithm (ga), and simulated annealing (sa) has been proposed in this paper. through aco, we have both provided improved initial population to feed ga and reduced search process. besides, sa has been deployed to refrain ga from getting stuck into local optimum solutions. simulation results assert that gasant not only has high speed convergence time, but also generates least-cost multicast routing trees of high qos. keywords: multicast routing; quality of service (qos); ant colony optimization (aco); genetic algorithm (ga); simulated annealing (sa) 1 introduction multicasting service is a technique in which the same information is sent concurrenly from a source node to a subset of all possible destinations (multicast group) in a computer network. the current approach to provide such a service is to establish a multicast tree. this tree includes a route node (sender), some internal nodes (intermediate routers) and some leaf nodes (recipients). to carry large numbers of multicast sessions, a network must minimize the sessions’ resource consumption [1]. therefore, it is important for a multicast session to adopt a multicast tree whose network cost is minimal. by network cost we mean the accumulation of the costs of resource usages of all the links constructing the multicast tree. this problem immediately is reduced to finding a steiner tree [2] which is one of the karp’s 21 np-complete problems [3]. the tree cost should be minimized to the most possible extent. this is due to the fact that after the multicast tree is built, all the network traffic flows along the links of the tree, especially in real-time applications which are intrinsically connection-oriented. the less the tree cost, the less the valuable resources are used during the whole connection time. finding minimal multicast tree gets more problematic when some quality-of-service (qos) constraints such as delay (end-to-end delay), and bandwidth constraints are also to be considered at the same time. finding the multicast tree or the steiner tree under any of the aforementioned qos constraints converts the problem into finding a constrained steiner tree (qos multicast routing) problem, which is np-complete itself [4]. several methods [4–9] have applied heuristic to solve qos multicast routing problem. kpp [4], bsma [5], and some others [6–8] are notable heuristic works in computing multicast trees. however, copyright c⃝ 2006-2012 by ccc publications 418 m. damanafshan, e. khosrowshahi-asl, m. abbaspour a comprehensive research in [9] has shown that most of the heuristic algorithms are notorious either for working too slowly or failing in computing of an optimized solution or both. some works [1, 10–13, 15–20] have mainly focused on applying ga to find constrained multicast routing trees. in [1] a bandwidth delay constrained least-cost multicast routing algorithm based on conventional ga has been proposed. it has used tree structure coding for chromosome representation and penalty functions for those candidate solutions that violate predefined thresholds. besides, [10, 11] have solely emphasized on conventional ga. however, all these methods [1, 10, 11] suffer from lack of local search and also problem of premature convergence. also, all these approaches generate their initial population mainly based on a randomized depth-first search algorithm [12, 13].this method suffers from applying uninformed search which most of the time performs worse than a good heuristic based informed search [14]. some others [16, 17] apply shimamoto’s approach [18] for coding routing tables and multicast trees. in this method for each pairs of (source, multicast-destination) several paths are stored, and the final multicast tree is yielded through combining these paths. as the network size grows, maintaining these paths can itself be a problem. the closest work to ours is [20] which presents a method namely ngsa for least-cost qos multicast routing based on both ga and simulated annealing (sa) algorithm. this paper ( [20]) adopts a rather new population initialization method mostly the same way as [12,13] which has two steps: trunk-creating and limb-appending. in trunk creating phase, a path is found from a source node to one of the multicast destinations. then, in the limb-appending phase, other multicast destination nodes are appended to the trunk through randomly discovered paths. [20] also uses sa to escape from premature convergence, one of the eminent shortcomings of ga. however, finding paths in both phases is done through random uninformed selection of neighbors which suffers from deficiencies inherent in uninformed search methods mentioned before. considering the fact that multicast tree creation and maintenance time is crucial and meanwhile ga’s evolution time toward better solution can be unpredictable [21], it is important to improve convergence speed. generating improved initial population [22], and reducing search process are two effective approaches to achieve this goal. [22] has shown that improving generation of initial population and being meticulous about it can significantly improve convergence time. all the aforementioned ga based researches applied random selection approaches in their initial population generation. adopting such a randomized behavior may cause the algorithm to go astray in establishing a multicast tree at least for a while. therefore, ga must compensate its improper selections by making further attempts, and this prolongs the convergence time. therefore, instead of passing the buck to next generations in ga and expecting the next generations to compensate the primitive generations’ probable fault, it is reasonable to make the first decision more scrupulously. this becomes more important when we cope with large networks. reducing search process can also be considered as another method to improve convergence time. actually, reducing search process hinders ga algorithm-at least for a great extent-from blundering and moving back and forth and revisiting the same links for an excessive number of times in hope of finding a solution. applying a randomed paradigm automatically causes ga to undesirably adopt try and error behevior to acheive a satisfactory solution. in this paper, we have proposed and impelmented a new algorithm called gasant which takes the aforementioned two improvements on ga into consideration. to put it in a nutshell, we have both provided improved initial population and reduced search process by deploying ant colony optimization (aco). by improved initial population, we mean that links constructing initial population are more likely to appear in optimal multicast tree. by reducing search process, we mean that for finding a satisfactory solution, gasant visits edges of the network graph for a small number of times rather than excessive number of times. besides, sa has been used to escape from getting stuck into local optimum solutions. aco is based on distributed society of autonomous agents called ants. ants provides a valuable gasant: an ant-inspired least-cost qos multicast routing approach based on genetic and simulated annealing algorithms 419 approach of exploring the network and collecting information about link statuses like link-state protocols, but in an efficient and deliberate way. ants can provide better initial population for ga, since they traverse through network and discover (near) optimal paths among nodes. the produced least-cost multicast tree more likely contains a subset of edges comprising these optimal paths. thus, considering such paths while we want to establish a multicast trees can result in a solution in higher speed. consequently, the search process is reduced. the experiment results proves this claim. also, note that ants are small packets and put low load burden on network nodes; their lightweight approach can be very effective for gathering information for generating qos-aware initial population [23]. the experiments certify this claim, too. the rest of the paper is organized as follows: section 2 explains the problem formulation and modeling. section 3 describes the proposed multicast routing algorithm (gasant) in detail. in section 4, the convergence of gasant is investigated. we evaluate gasant comprehensively in section 5, and finally section 6 concludes the paper. 2 problem formulation and modeling in this paper, the network is expressed as an undirected weighted graph namely g = (v, e), where v is the set of network nodes and e is the set of links connecting network nodes to each other. the link e ∈ e with source node m and destination node n is denoted by (m,n). multicast tree which is denoted as mt (s, m) consists of two main parts: s ∈ v as the source node of multicasting, and m ⊆ v−{s} as the multicast destination nodes. each link e is characterized by a qos 3-tuple (b(e), d(e),c(e)) representing bandwidth, delay and cost associated with it respectively. here, b(e) > 0, c(e) ≥ 0 and d(e) ≥ 0. this paper tries to discover a multicast tree with the minimum cost subject to two qos constraints namely bandwidth, and delay constraints. if p(s,di) is a path in the tree mt starting from the source node s and ending at a multicast destination node di, then bandwidth and delay constraints and cost function are defined as follows: • bandwidth constraint it is required that the minimum value of the link bandwidth in the multicast tree mt , along the path (bpath) originating at the source node s and ending at any multicast destination node di ∈ m be greater than or equal to the predefined required bandwidth ωb. • delay constraint it is required that the end-to-end delay in the multicast tree mt , along the path (dpath) originating at the source node s and ending at any multicast destination node di ∈ m be smaller than or equal to the predefined maximum end-to-end delay ωd . • cost function we define the cost of the (multicast) tree as the sum of the costs of all the links of the tree. formula 1 formalizes it: ctree(mt ) = ∑ e∈mt c(e) (1) c(e) typically represents the cost of the link monetarily or any administratively interested cost. the proposed algorithm of this paper aims at constructing a least-cost multicast tree subject to delay and bandwidth constraints defined above. formalizing this problem as a constrained optimization problem, we have i f ctree(mt ) = ∑ e∈mt c(e) t hen minimize(ctree(mt )) 420 m. damanafshan, e. khosrowshahi-asl, m. abbaspour sub ject to bpath(p(s,di)) ≥ωb and dpath(p(s,di)) ≤ωd 3 the proposed multicast routing algorithm this section explains how gasant utilizes the cooperation between ga and aco to produce multicast trees. as shown in fig. 1, gasant consists of both reactive and proactive components. the proactive component itself has an aco module within. the reactive component is composed of two main modules namely path setup (ps) module and genetic and simulated annealing (gsa) module. figure 1: gasant architecture in this paper, a modified version of antnet [23] has been implemented in aco module. this modified version increases antnet’s performance and adapts it to multicast nature of routing. aco module proactively sends ants through network in order to find feasible paths to different destinations and keep nodes aware of network dynamism. this process continues periodically during the whole network uptime. beside this, the reactive component is responsible for multicast tree construction process. the process is carried out through two modules called ps module and gsa module. when a multicast tree creation request with a certain set of qos criteria is issued by an application, ps module starts to find paths between the multicast source node and each destination node. this process is done through propagating control messages throughout the network. these messages try to discover feasible paths considering the qos metrics by fetching the information provided by aco module stored in intermediate nodes. each of these paths is called trunk which is in fact a simple path that connects the source node to a multicast destination node. the result of ps module is in fact a set of highly crowded paths which were frequently traversed by ants. the discovered paths are fed into the gsa module. the gsa module uses these paths as raw data to construct its initial population. it then tries to find a multicast tree through running crossover and mutation operators in iterations. if a multicast tree with required qos metrics is found, then the mission is complete. otherwise, gasant commences a negotiation with the application that triggered the strict request in order to ask it to relax its qos requirements. in the following sub-sections, we have explained these three modules in more detail. 3.1 aco module at regular intervals, from every network node, aco sends forward ants toward a destination node in order to discover feasible and satisfactory paths. multicast destination nodes have higher chance of being selected as forward ant’s destination nodes. analogously to antnet, ants will travel toward destination node and if successful, then they will return back to source node. while returning back to the source node, ants update routing table of the intermediate nodes en route. this update is based on a reinforcement value r (refer to section 4 of [23] for more detailed information) which is a function of the goodness of the path that the ant has just traversed. in gasant, the value of r is calculated based on ant’s traversed path trip time (t ript ime), bandwidth (bandwidth) and cost (cost) through formula 2: r = c1 × ( t ript imebest t ript ime ) + c2 × ( bandwidth bandwidthbest ) + c3 × ( costbest cost ) (2) gasant: an ant-inspired least-cost qos multicast routing approach based on genetic and simulated annealing algorithms 421 here, t ript imebest is the best trip time experienced by the ants, and bandwidthbest and costbest are the best bandwidth and the best cost discovered while travelling toward same destination over the last observation window (in this paper we considered last 20 samples), respectively. the coefficients c1, c2 and c3 weigh the importance of each term. in our implementation, we have set each of these constants euqally to 0.33. in this way, all three qos parameters are treated equally. ants can easily keep track of bandwidth and cost of paths by saving minimum link bandwidth and summing the cost of individual links they are traversing. 3.2 ps module the multicast route discovery process triggers upon receiving a multicast tree setup request from a source to a number of destinations. the purpose of this reactive process is to find a set of trunks according to qos metrics. this is accomplished by actually sending multicast tree discovery (mtd) messages through network. these messages find links with high pheromones which can provide better trunks. in fact, this procedure mainly uses the information provided by aco module to find the best single paths to destinations. as soon as a multicast tree setup request is received, the source node starts propagating mtd messages to find feasible and shortest paths to multicast destinations. the source node broadcasts a limited number of mtd messages to its connected neighbors. this number is set through branchfactor variable which is the cardinality of a subset of the node’s neighbors. using this variable, the number of packets being broadcast throughout the network can be limited. therefore, the traffic overhead caused by mtd messages can be controlled. this subsetting is carried out through choosing the neighbors which are connected via links with highest pheromones to the source node. each mtd message is composed of seven fields: 1.requesti d, 2.sourcenodei d, 3.cost, 4.bandwidth, 5.delay, 6. path, 7.branchfactor. a mtd message stores the traversed path and its corresponding accumulated cost into its path and cost fields, respectively. the minimum bandwidth and total delay of the path are being kept track in bandwidth and delay fields. each mtd message is associated with a uniquerequesti d field; together with the sourcenodei d field, a network node can uniquely identify a mtd message. these two ids together are considered as the identi f ier(i d) pair. when an intermediate node receives an mtd message, the node inspects the message’s i d pair to check whether the message is a duplicate. in gasant, the intermediate node is allowed to forward at most a limited number of duplicated mtd messages (messages with same id pair) defined by alloweddupnumber. it is critical to somehow limit the number of packets being generated and forwarded through network or the network will get inundated with extraneous traffic. our experiments show that even with the branchfactor of 2 for a network of size 100 nodes, when the number of passing messages is not limited, more than millions of mtd messages of the same id are created and passes the same nodes for more than tens of thousands times. also, our experiments show that values around 50 for the alloweddupnumber for networks with more than 100 nodes can significantly reduce the message overhead while still providing vast amount of available trunks from source to destination nodes. after forwarding alloweddupnumber number of mtd messages of the same identifier pair, any subsequent mtd messages of this identifier pair is discarded by that intermediate node. to keep track of the number of times a particular mtd message has been forwarded, each node maintains a table of counters for each of the mtd messages the node has forwarded. if the counter of the received mtd message does not violate the alloweddupnumber, the node updates some fields of the message before forwarding. the cost field is increased by the cost of the link the message has just traversed. also the current node’s address is appended to the path list. the bandwidth field is set to minimum of the current bandwidth field value and the bandwidth of the traversed link. once the update is complete, like the source node, this node forwards the mtd message to selected branchfactor number of its neighbors. the selection is based on the pheromone level entries of the probabilistic routing table [23]. the higher values have the higher chance of being selected. the value of branchfactor can significantly affect the 422 m. damanafshan, e. khosrowshahi-asl, m. abbaspour amount of messages overhead generated in the network. higher branchfactor values will make messages explore more parts of the network and consequently gather more possible paths to destinations in expense of exponentially increasing message overhead. in this paper, the branchfactor is initially set to a constant large value compared to average degree of the network graph nodes and then it is reduced on next hops. this way most likely all neighbors of the source node will receive the mtd message. as a result, search area is widened and more parts of the network get explored while still having reasonable load. two logarithmic and linear methods have been tested for reducing the branchfactor along the path. in the logarithmic approach, the reduction is fast and branchfactor converges to one just after passing a few hops. when branchfactor value is one, the node forwards one copy of the received mtd message to its best candidate neighbor. in this way, the branchfactor and allowedduplimit together can control the amount of mtd message overhead. a constant branchfactor through the whole network is also investigated. however, our experiments show that the logarithmic approach surpasses the other two approaches in controlling the overhead while still producing nearly the same trunks as the others (constant and linear reduction approaches) produce. when a mtd message reaches its destination node, a reply message containing the same data as the mtd message is generated and is sent back to source node. the reply message traverses the same path, but will be queued in high priority queues. when the first message reaches back to the source node, the source node triggers a timer to collect as many routes as possible from different destinations. as the timeout expires, a set of trunks are extracted from the path field of the collected reply messages. these trunks are then sent to the gsa module. 3.3 gsa module in the following subsections, the specifications and operators of the gsa module are explained in detail. coding the tree structure coding has been chosen in this paper. in this method, every chromosome represents a multicast tree. as a result, the coding space is greatly reduced and coding-decoding operation is omitted and the meaning of genetic operations becomes more visual and the time of conversion between encoding and solution spaces is saved [1]. pruning in this phase, those links with a bandwidth less than the predefined bandwidth threshold are deleted. it is probable that the pruned graph gets decomposed into several smaller connected subgraphs. if all the multicasting nodes including the source node are all in the same subgraph, this means that the pruned network satisfies the bandwidth threshold. otherwise, the source node should start a new round of negotiations with the applicant program in order to reach a mutual satisfactory agreement with more relaxed threshold. initial population formation initial population is performed in two main stages: trunk-selection and limb-appending. trunk-selection: in this stage, one of the trunks created by the aco module is selected randomly. a trunk is a simple path that connects the source node to a multicast destination node (as in section 3). this trunk is supposed as the current multicast tree. gasant: an ant-inspired least-cost qos multicast routing approach based on genetic and simulated annealing algorithms 423 limb-appending: in this stage, a multicast destination node which is not part of the current multicast tree (obtained from previous stage) is selected as the current node. then, one of the links leaving this current node is selected. the more pheromone deposited on the link, the more probable the link is selected. after this, the other end of the link is selected as the current node, and the same action is done about it. this procedure continues until the current node becomes one of the nodes of the current multicast tree. at this time, all of these nodes which in turn were as current nodes and also the current multicast tree nodes together are set to the new current multicast tree. on the condition that the entire multicast destination nodes are in the current multicast tree, the mission is complete; otherwise the whole limb-appending procedure is repeated for the rest of the multicast destination nodes isolated from the current multicast tree. applying selections based on high pheromone values is more efficient than that of [20] which uses a random paradigm. this seems reasonable since the trunk-selection and limb-appending methods use previously gathered global information in making their decisions. the experiments in section 5 also prove this claim. fitness function basically, penalty functions are used to handle the constraints [24]. through these functions the constrained problems are transformed to unconstrained problem. in fact, these functions are used to penalize the individuals based on their constraint violation. the penalty imposed on infeasible individuals can range from completely rejecting the individual to decreasing its fitness based on the degree of violation [16]. in this paper, we have adopted the same fitness function formulae as in [20]. we have utilized penalty function in determining the fitness of each individual. here, the fitness of the multicast tree mt is defined as formula 3: fitness(mt ) = e −penalty(mt ) t (3) where, t is the temperature in which this fitness is calculated and penalty is the amount of penalty that has been considered for mt . the penalty itself is calculated via formula 4: penalty(mt ) = kc × costtree(mt ) + kb × bv (mt ) + kd × dv (mt ) (4) here, costtree(mt ) is calculated via formula 1; bv (mt ), and dv (mt ) are determined via formulae 5 and 6; kc , kb, and kd are the constants to weigh the importance of each of the cost, and the violations occurred from bandwidth and delay constraints, respectively. bv (mt ) = ∑ d j∈mt maximum(ωb − bpath(s,d j),0) (5) dv (mt ) = ∑ d j∈mt maximum(dpath(s,d j) −ωd,0) (6) as it can be understood from all above, the less violation an mt makes (formulae 5,6), the fewer penalties are considered for it (formula 4), and consequently, such an mt is fitter to the delay and bandwidth constraints (formula 3). selection method in this paper, fitness proportionate selection (roulette wheel selection) has been applied. in this method, the probability of selecting a chromosome ci as a parent is defined as follows (formula 7): 424 m. damanafshan, e. khosrowshahi-asl, m. abbaspour s electionprobability= fitness(fitness(ci))∑populations ize j=1 fitness(c j) (7) where fitness(ci) is the fitness of chromosome i. in fact, more elitist chromosomes have higher chances to be selected in comparison with their counterparts. adaptive crossover method crossover is a genetic operator that combines two chromosomes called parents to produce a new chromosome namely offspring. since experiments in [25] indicated that adaptive crossover performed as well or better than a traditional crossover, in this paper, adaptive crossover probability is applied. here, the crossover probability is calculated through formula 8: crossoverprobability=  c1 + c2 × f itmax− f it′ f itmax− f itavg f it′ ≥ f itavg c3 f it′< f itavg (8) here, f itavg is the average of the fitness of the population in the current generation; f itmax is the biggest fitness existent in the current generation; f it′ is the maximum of the fitnesses associated with the two parents to be crossovered; c1. c2, c3 are constants and c3 = c2 + c1. in this paper, for crossovering two parent trees, somewhat the same method as in [12] has been adopted. for more clarity, the crossover and mutation (next section) processes for a hypothetical network has been illustrated in fig. 2. suppose that the network graph is the same as in fig. 2.a in which node 0 (green node) is the source node and the nodes 2 and 4 are the multicast destination nodes (yellow nodes). also, assume that fig. 2.b.1 are the two possible multicast trees. now, for crossovering, first of all, the common edges of the two parent trees (fig. 2.b.1) are extracted and are inserted to the offspring tree as the first set of edges (fig. 2.b.2). the produced offspring is not necessarily a single connected tree and may consist of several disconnected components. in this offspring, it is likely that some of the multicast destination nodes fall into graph components (orphan components) other than the component which the source node is in (main component). therefore, somehow these orphan components must be connected to the main component. to this end, at first, an orphan component is randomly selected. then, it is tried to connect this orphan component to the main component through highly pheromone bridge links (links existent in network graph that can connect two different components to each other). while doing this procedure, these selected bridge links may also connect other orphan components to each other. if there may still be orphan components that are remained disconnected, the same procedure should be performed about them. the produced multicast tree for our example is now the same as fig. 2.b.3. of course, some extra nodes may appear as the leaf nodes of offspring tree. in fact, these nodes are not member of the multicast tree destinations (node 5 in fig. 2.b.3); thus, such nodes and their entering links should be removed after crossover (fig. 2.b.4). adaptive mutation method mutation is a genetic operator that alters one or more gene values in a parent and produces a new offspring. this operator can prevent the population from stagnating at any local optima. in this paper, a simulated annealing technique has been adopted for mutation operator. regarding this, each new offspring produced by the mutation operator is considered as a neighbor of the initial parent. this new offspring replaces its parent according to a probability calculated in formula 9: mutationprobability=  e −( f it− f it′) t f it > f it′ 1 f it ≤ f it′ (9) gasant: an ant-inspired least-cost qos multicast routing approach based on genetic and simulated annealing algorithms 425 figure 2: crossover and mutation operations. (a) the hypothetical network graph. (b.1-4) stages of crossover operator. (c.1-3) stages of mutation operator. here, f it and f it′ are the fitnesses associated with the parent and offspring, respectively, and t is the temperature. as can be inferred from formula 9, if the offspring is better than the parent, it replaces the parent in the next generation, otherwise this replacement is done through an exponential probability. the temperature decreases gradually from a high initial degree. as this decreasing continues, the probability of the replacement of worse offspring decreases accordingly. continuing our example, suppose that we want to apply mutation operator for fig. 2.b.4. mutation for a parent tree (fig. 2.b.4) which leads to finding a neighbor offspring tree is done through the following procedure: a random edge is deleted from the parent tree (edge (0,1) from fig. 2.b.4 is deleted). then, it is tried to reconnect these two subtrees (fig. 2.c.1) with the same procedure explained in section 3.3. this new connected tree (fig. 2.c.2) is considered as the offspring (neighbor) of the parent tree. this offspring replaces its parent according to a probability calculated in formula 9. the same as the crossover, some extra links may appear as the leaf nodes of offspring tree. in fact, these nodes are not member of the multicast tree destinations (node 1 in fig. 2.c.2); thus, as mentioned in crossover, they should not be included in the final multicast tree, and as a result fig. 2.c.3 is the real neighbor of the fig. 2.b.4. 4 analysis of convergence in theorem 2.7 of [26], guoliang et al. has shown that applying ga can finally lead the problem to converge to a global optimized solution. achieving an optimal solution for the aforementioned multicast qos routing can sometimes take up a great deal of time. this is due to the np-completeness property of the problem. nevertheless, most of the times, achieving a solution is possible by well-adjusting various parameters in the proposed algorithm. 5 experiments gasant has been implemented in c++. the aco module of gasant has been implemented by extending and modifying the antnet package provided in [27]. the whole program is simulated in omnet ++ environment on a pentium iv 2.4 ghz cpu, 2 gb ram. to make the results independent of the networks under simulation and also in order to run experiments on a reasonable amount of graphs, we have used random graph generator introduced by waxman [28]. in this method, the network nodes are randomly scattered in a rectangular environment. the probability of existence of an edge between two nodes u, v is calculated via formula (10): p(u, v) =αe −d(u,v) βl (10) 426 m. damanafshan, e. khosrowshahi-asl, m. abbaspour here, d(u, v) is the distance between two nodes u and v; l is the maximum distance between any two nodes in the generated graph. α ∈ (0,1] is responsible for controlling the average degree of the random graph. the greater α results in a denser graph; a graph with more links. in the same way but rather different, β ∈ (0,1] effects the number of longer edges. the larger values of β increase the number of longer edges. also, the randomly generated edges of the graph accepts their cost from [1,100], delay from [0.01,0.1] ms, and bandwidth values from range [10,50] mbps. gasant was run on 20, 40, 60, 80, and 100-node network graphs. also, three different percentages of network nodes were selected to be multicast destinations in the runs: 10%, 20%, and 30%. we call these percentages as multicast percentages, or briefly mp. for the sake of convenience, the delay and bandwidth bounds are supposed to be the same for all multicast destinations. besides, gsa module of gasant iterated for 15 generations, and the population size of each of these generations was 30. all the experiments were run until we reach a confidence interval of less than 5%, using 95% confidence level. before delving into the experiements, we need to define routing request success ratio [29] which is defined as formula 11: s uccessratio = nack/nreq (11) where the nreq is the number of multicast tree requests issued by the application, and nack is the number of these requests that are successfully answered. in the next subsections, we compare gasant and ngsa through a comprehensive set of experiments. 5.1 success ratio analysis in this paper, we have compared gasant to ngsa [20] which is one of the recent works in qos multicast routing literature. table 1 illustrates the success ratio of gasant to ngsa as a function of different parameters. table 1 displays the success ratio of gasant to ngsa as a function of network size and multicast percentage (mp). increasing multicast percentage and network size usually results in higher s uccessratios of gasant to ngsa. this is due to the fact that as network size or multicast percentage increases, applying random paradigm used in trunk creation and limb appending is more likely to adopt improper links toward multicast destination nodes. this figure can be regarded as a scalability performance, too. table 1 and table 1 illustrate the success ratios of gasant to ngsa as functions of minimum possible bandwidth and maximum possible end-to-end delay constrained on the problem, respectively. as it can be inferred from these figures, independent from these constraints, the success ratio of gasant is most often higher than ngsa’s one. table 1: (a) success ratio of gasant to ngsa as a function of network size and multicast percentage. (b) success ratio of gasant to ngsa as a function of minimum possible bandwidth.(c) success ratio of gasant to ngsa as a function of maximum possible end-to-end delay. net. size mp=0.1 mp=0.2 mp=0.3 20 0.83 1 1 40 0.78 1.05 1.2 60 0.83 1.15 1.25 80 0.96 1.05 1.08 100 1 1.25 1.66 min. bw. success r. 10 1.1 15 1.2 20 1.23 25 1.22 30 1.17 35 1.02 40 1.07 max. delay success r. 0.10 1.4 0.20 1.0 0.30 1.5 0.40 1.8 0.50 1.22 0.60 0.81 0.70 1.04 gasant: an ant-inspired least-cost qos multicast routing approach based on genetic and simulated annealing algorithms 427 table 2: (part 1) average time ratio of gasant to ngsa for detecting the first satisfactory multicast tree. (part 2) average cost ratio of gasant to ngsa for the first detected satisfacroty multicast tree. (part 3) average maximum end-to-end delay ratio of gasant to ngsa for the first detected satisfacroty multicast tree. (part 4) average minimum bandwidth ratio of gasant to ngsa for the first detected satisfacroty multicast tree. part 1 2 3 4 mp 0.1 0.2 0.3 0.1 0.2 0.3 0.1 0.2 0.3 0.1 0.2 0.3 network size 20 0.11 0.16 0.49 0.78 0.91 0.89 0.57 0.80 0.85 1.12 0.98 1.04 40 0.14 0.43 0.01 0.80 0.97 0.72 0.67 0.93 0.78 0.99 0.98 1.00 60 0.08 0.01 0.54 0.78 0.80 0.85 0.72 0.81 0.93 1.02 1.07 1.00 80 0.26 0.01 0.13 0.83 0.89 0.82 0.85 0.82 0.97 0.98 1.02 1.01 100 0.74 0.67 0.06 0.87 0.92 0.66 0.96 0.76 0.91 1.08 1.01 0.95 5.2 comparison of execution times, quality of solutions and generations part 1 of the table 2 illustrates the average time ratio of gasant to ngsa for detecting the first satisfactory multicast tree. as can be seen, the average required time for achieving the first solution by gasant is almost less than half of the time required by ngsa. the average cost and average maximum end-to-end delay associated with the first solution discovered by gasant, as shown in parts 2 and 3 of the table 2, are always less than that of ngsa’s. also, part 4 of the table 2 demonstrates that the average minimum existent bandwidth is nearly the same for both multicast trees discovered by gasant and ngsa. part 1 of table table 3 illustrates the average cost ratio of gasant to ngsa for every fifth generation (yellow rows) and the average cost ratio of gasant to ngsa for the least cost multicast tree existent in that generation (white rows) as a function of network size and multicast percentage. as it is obvious, the cost of the trees in each generation associated with gasant is less than that of ngsa’s. also, the least cost trees existent in each generation of gasant has smaller cost in comparison with ngsa’s. as can be inferred from this part of the table, aco module helps gsa start the production of generations with smaller cost trees and this continues along the rest of the generations till end. part 2 of table table 3 illustrates the average maximum end-to-end delay ratio of gasant to ngsa for each generation (yellow rows) and the average maximum end-to-end delay ratio of gasant to ngsa associated with the least cost multicast tree existent in that generation (white rows) as a function of network size and multicast percentage. part 3 of table table 3 illustrates the average minimum bandwidth ratio of gasant to ngsa for each generation (yellow rows) and the average minimum bandwidth ratio of gasant to ngsa associated with the least cost multicast tree existent in that generation (white rows) as a function of network size and multicast percentage. all the parts of the table table 3 illustrate that the qos of multicast trees discovered by gasant is superior to the trees discovered by ngsa by having smaller cost, end-to-end delay and nearly the same bandwidth. it can be inferred from this table that aco module donates a more global view of the network to gasant in comparison with the ngsa which only relies on bare ga and sa. table 3: (part 1)yellow rows: average cost ratio of gasant to ngsa for every fifth generation as a function of network size and multicast percentage. white rows: average cost ratio of the least cost multicast tree for every fifth generation as a function of network size and multicast percentage. (part 2) yellow rows: average maximum end-to-end delay ratio of gasant to ngsa for every fifth generation as a function of network size and multicast percentage. white rows: average maximum end-to-end delay ratio of the least cost multicast tree for every fifth generation as a function of network size and multicast percentage. (part 3) yellow rows: average minimum bandwidth ratio of gasant to ngsa for every fifth generation as a function of network size and multicast percentage. white rows: average minimum bandwidth ratio of the least cost multicast tree for every fifth generation as a function of network size and multicast percentage. 428 m. damanafshan, e. khosrowshahi-asl, m. abbaspour table 3: (part 1) edge hit ratio of gasant to ngsa for detecting the first satisfactory multicast tree. (part 2) average overhead analysis of aco module. part 1 part 2 network size mp=0.1 mp=0.2 mp=0.3 total consumed bandwidth (kbps) 20 0.04 0.12 0.26 0.12 40 0.14 0.28 0.01 0.28 60 0.08 0.16 0.67 0.45 80 0.23 0.24 0.07 0.62 100 0.48 0.33 0.09 0.88 gasant: an ant-inspired least-cost qos multicast routing approach based on genetic and simulated annealing algorithms 429 5.3 analysis of reduction of search process part 1 of table 3 illustrates edge hit frequency ratio of gasant to ngsa during the detection of the first satisfactory multicast tree. edge hit frequency is the number of times that the edge is visited by the multicast tree discovery algorithm during trunk creation or limb appending phases. as can be seen, edge hit ratio of gasant is considerably smaller than ngsa’s. this implies that gasant visits the edges of the network graph sufficiently not excessively and redundantly. this redundancy can be a symptom of high try and error nature of ngsa before achieving a satisfactory multicast tree. whereas, gasant has a global view of the network specific properties. thus, with a smaller effort, it gains a solution even with higher qos. therefore, gasant discovers a satisfactory solution with a smaller search process. gasant owes its success to non-stoping efforts of diligent ants of the proactive aco module. 5.4 overhead analysis of aco module part 2 of table 3 illustates the total bandwidth consumed by control messages of aco module as a function of network size. as can be seen, by increasing network size the amount of overhead increases almost linearly. this implies that the proactive aco module is scalable as the network size grows. 6 conclusion in this paper, we have proposed gasant to solve the np-complete problem of finding a qos constrained least-cost multicast tree for a given communication network. to this end, we have combined aco, ga and sa together to utilize the full advantages of them. here, aco is responsible for both improved initial population which are fed into ga, and also reduced search process. by improved initial population, we mean that links constructing initial population are more likely to appear in optimal multicast tree. by reduced search process, we mean that the ga visits the network edges for a small number of times rather than moving back and forth on the same and previously-visited edges for many times. for fleeing from standing still around a local optimal solution and also extending search space sa has been deployed. experiments show that ants in aco provides a valuable approach of exploring the network and collecting information about states of the links in an efficient and deliberate way. in this way, ga is fed by a better initial population in its first step. also, it considers the links that are highly recommended by ants to be found more likely in the final multicast tree and consequently reaches to a satisfactory solution sooner. the experiments in this paper ensure the aforementioned claims, and prove that gasant outperforms its close counterpart ngsa. bibliography [1] w. zhengying, s. bingxin, z. erdun, bandwidth-delay-constrained least-cost multicast routing based on heuristic genetic algorithm computer communications, volume 24, issues 7-8, april 2001. [2] f.k. hwang, d.s. richards, p. winter, the steiner tree problem, annals of discrete mathematics, vol. 53, 1992. [3] r. m. karp, reducibility among combinatorial problems, complexity of computer computations, 1972. [4] v. p. kompella, j. pasquale, g. c. polyzos, multicast routing for multimedia communication ieee/acm transactions on networking. 1(3) (1993) 286-292. 430 m. damanafshan, e. khosrowshahi-asl, m. abbaspour [5] m. parsa, q. zhu, j.j. garcia-luna-aceves, an iterative algorithm for delay-constrained minimumcost multicasting, ieee/acm transactions on networking 6 (4) 1998. [6] r. widyono, the design and evaluation of routing algorithms for realtime channels, technical reports tr-94-024, tenet group, department of eecs, university of california at berkeley, 1994. [7] a.g. waters, a new heuristic for atm multicast routing, 2nd ifip workshop on performance modeling and evaluation of atm networks, 1994. [8] q. sun, h. langendorfer, an efficient delay-constrained multicast routing algorithm, journal of high-speed networks 7(1) 1998 43-55. [9] h.f. salama, d.s. reeves, y. viniotis, evaluation of multicast routing algorithms for real-time communication on high-speed networks, ieee journal on selected areas in communications 15(3) (1997) 332-345. [10] f. xiang, l. junzhou, w. jieyi, g. guanqun, qos routing based on genetic algorithm, computer communications 22(15-16) (1999) 1392-1399. [11] a.t. haghighat, k. faez, m. dehghan, a. mowlaei, y. ghahremani, ga-based heuristic algorithms for qos based multicast routing, knowledge-based systems 16 (2003) 305-312. [12] c.p. ravikumar, r. bajpai, source-based delay-bounded multicasting in multimedia networks, computer communications 21 (1998) 126-132. [13] z. wang, b. shi, e. zhao, bandwidth-delay-constrainted least-cost multicast routing based on heuristic genetic algorithm, computer communications 24 (2001) 685-692. [14] s. russell and p. norvig, artificial intelligence: a modern approach, prentice-hall, 2003. [15] q. zhang, y.w. leung, an orthogonal genetic algorithm for multimedia multicast routing, ieee trans. evol. comput. 3 (1) (1999) 53-62. [16] k. vijayalakshmi, and s. radhakrishnan, dynamic routing to multiple destinations in ip networks using hybrid genetic algorithm (drhga), , international journal of information technology. 4(1) 2008. [17] vijayalakshmi ,s. radhakrishnan, artificial immune based hybrid ga for qos based multicast routing in large scale networks (aismr), computer communications, 2008. [18] n. shimamoto, a. hiramatsu, k. yamasaki, a dynamic routing control based on a ga, in proceedings of the ieee international conference on neural network (1993) pp. 1123-1128. [19] j.j. wu, r.h. hwang, h.i. lu, multicast routing with multiple qos constraints in atm networks, information sciences 124 (2000) 29-57. [20] li zhang, lian-bo cai, meng li, fa-hui wang, a method for least-cost qos least-cost multicast routing based on genetic simulated annealing algorithm, computer communications, 31 (2008) 3984-3994. [21] sushil j. louis gregory j. e. rawlins, predicting convergence time for genetic algorithms, technical report, indiana university, 1992. [22] r. r. hill, ch. hiremath, improving genetic algorithm convergence using problem structure and domain knowledge in multidimensional knapsack problems, international journal of operational research 2005 vol. 1, no.1/2 pp. 145 159. gasant: an ant-inspired least-cost qos multicast routing approach based on genetic and simulated annealing algorithms 431 [23] g. di caro, m. dorigo, antnet: distributed stigmergetic control for communications networks, journal of artificial intelligence research 9 (1998) 317-365. [24] özgür yeniay, penalty function methods for constrained optimization with genetic algorithms, journal of mathematical and computation applications, 10(1) (2005) pp. 45-56. [25] j. d. schaffer, a. morishima, an adaptive crossover distribution mechanism for genetic algorithms, icga 1987. [26] c. guoliang, w. xufa, z. zhenquan, genetic algorithm and its application, people’s posts and telecommunications press, 1996. [27] m. farooq, implementation of antnet in omnet++, http://www.omnetpp.org/component/content/article/9software/3559, november 2009. [28] b.m.waxman, routing of multipoint connections. ieee j. select. areas commun. 6(9) (1998) 1617-1622. [29] s. kirkpatrick, c. d. gelatt and m. p. vecchi, optimization by simulated annealing, science, new series, vol. 220, no. 4598 (1983) pp. 671-680. ijcccv4n3draft.pdf int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 3, pp. 214-223 p systems with endosomes roberto barbuti, giulio caravagna, andrea maggiolo-schettini, paolo milazzo università di pisa dipartimento di informatica largo pontecorvo 3, 56127 pisa, italy e-mail: {barbuti, caravagn, maggiolo, milazzo}@di.unipi.it received: april 5, 2009 accepted: may 30, 2009 abstract: p systems are computing devices inspired by the structure and the functioning of a living cell. a p system consists of a hierarchy of membranes, each of them containing a multiset of objects, a set of evolution rules, and possibly other membranes. evolution rules are applied to the objects of the same membrane with maximal parallelism. in this paper we present an extension of p systems, called p systems with endosomes (pe systems), in which endosomes can be explicitly modeled. we show that pe systems are universal even if only the simplest form of evolution rules is considered, and we give one application example. keywords: p systems, pe systems, endosomes 1 introduction p systems were introduced by pǎun in [10] as distributed parallel computing devices inspired by the structure and the functioning of a living cell. a p system consists of a hierarchy of membranes, each of them containing a multiset of objects, representing molecules, a set of evolution rules, representing chemical reactions, and possibly other membranes. for each evolution rule there are two multisets of objects, describing the reactants and the products of the chemical reaction. a rule in a membrane can be applied only to objects in the same membrane. some objects produced by the rule remain in the same membrane, where each membrane is identified by its labels, others are sent out of the membrane, others are sent into the inner membranes. evolution rules are applied with maximal parallelism, meaning that it cannot happen that some evolution rule is not applied when the objects needed for its triggering are available. many variants and extensions of p systems exist that include features to increase their expressiveness and that are based on different evolution strategies. among the most common extensions we mention p systems with dissolution rules that allow a membrane to disappear and release in the surrounding membrane all the objects it contains. we mention also p systems with priorities, in which a priority relationship exists among the evolution rules of each membrane and can influence the applicability of such rules, and p systems with promoters and inhibitors, in which the applicability of evolution rules depends on the presence of at least one occurrence and on the absence, respectively, of a specific object. see [11] for the definition of these (and other) variants of p systems and [14] for a complete list of references to the bibliography of p systems. in this paper we present another extension of p systems, called p systems with endosomes (pe systems), with the following features: • objects can be contained inside the regions delimited by the membranes and on the surfaces of the membranes (as in p systems with peripheral proteins [6, 13] and as in membrane systems with surface objects [1, 2]); copyright c© 2006-2009 by ccc publications p systems with endosomes 215 • rules are contained on the surfaces of the membranes (they can rewrite objects outside/on/into the membranes); • endosomes can be explicitly created in order to model a biologically inspired transportation mechanism. the definition of this extension of p systems has a biological inspiration. in fact, the endocytosis of macromolecules is the process by which cells absorb material (molecules such as proteins) from outside the cell by engulfing it with their cell membrane. it is used by all cells because most substances important to them are large polar molecules that cannot pass through the hydrophobic plasma membrane or cell membrane. there exist three kinds of endocytosis: phagocytosis, pinocytosis, and receptor–mediated endocytosis. in particular, phagocytosis (literally, cell–eating) is the process by which cells ingest large objects, such as cells which have undergone apoptosis, bacteria, or viruses. the membrane folds around the object, and the object is sealed off into a large vacuole known as a phagosome. pinocytosis (literally, cell–drinking) is concerned with the uptake of solutes and single molecules such as proteins, and, finally, receptor–mediated endocytosis is a more specific active event where the cytoplasm membrane folds inward to form coated pits. these inward budding vesicles bud to form cytoplasmic vesicles [11]. figure 1 summarizes the kinds of endocytosis. from the point of view of the modeler, these three processes are made possible by vesicles (in fact, this transportation mechanism is known as vesicle–mediated transportation) which, in the most general case, engulf the macromolecules together with molecules from the surface of the membranes (i.e., receptors). this leads to the creation of endosomes containing the engulfed molecules. the endosomes transfer their content inside the cell by possibly interacting with other components. the endosomes could also be degraded by the interaction with the lysosomes. we define an extension of p systems (pe systems) which can explicitly model the creation of endosomes and their interaction inside the cells and, consequently, can easily model these three kinds of endocytosis. this variant of p systems, together with other modeling features such as the modeling of exocytosis (the biologically counterpart of endocytosis), and enriched with channel–mediated communication [3], would provide a powerful and complete modeling language for naturally describing transportation mechanism of molecules inside cells. figure 1: three kind of endocytosis: phagocytosis, pinocytosis and receptor–mediated endocytosis. picture taken from http://cellbiology.med.unsw.edu.au/units/science/lecture0806.htm we show that pe systems are universal even if only the simplest form of evolution rules is considered, namely non–cooperative rules. finally, we give one application example to show that endosomes can ease the description of biological systems when pe systems are used as a modeling formalism. 216 roberto barbuti, giulio caravagna, andrea maggiolo-schettini, paolo milazzo 2 p systems with endosomes in this section we formally define p systems with endosomes (pe systems). we assume the reader to be familiar with the standard definition of p systems [11]. we start by assuming the same membrane structure µ of a p system. as regards objects, similarly to p systems with peripheral proteins [6, 13], we assume that objects can be contained inside a membrane (as in classical p systems) and on the surface of a membrane. in order to qualify in an evolution rule the position of an object with respect to a membrane, we use in to identify the object inside the membrane, out to identify the object outside the membrane and here to identify the object on the surface of the membrane. let t ar be the set of message targets {in, out, here}; given a set of objects v we denote with vtar the corresponding set of messages o × t ar. we can now introduce the evolution rules of pe systems; rules are conceptually divided in evolution rules (in the same sense of p systems) and rules for the creation of endosomes. we recall that, differently from p systems, the rules of pe systems are conceptually associated with the surfaces of the membranes of the system. evolution rules are of the form u → v where u ∈ v +tar, v ∈ v ∗ tar, v + tar = v ∗ tar\{ε} and ε is the empty string. the definition of cooperative and non–cooperative rules are the same as for p systems. notice that this format for evolution rules, which are syntactically different from those of p systems, may seem to be less expressive than the one of p systems, in particular for rule moving objects into specific regions enclosed by membranes (communication rules). in order to show that this is not the case, let us assume an hypothetical membrane structure µ such that (l, l ′) ∈ µ , namely a membrane structure in which l ′ is nested into l. in order to give a rule which moves an object inside membrane l ′ we cannot use the identifier inl ′ in a rule of the surface of the membrane l (as in p systems) because we cannot use the identifier l ′ as subscript to in. however, the same behaviour can be obtained by replacing the rule u → (v, inl ′ ) in the membrane l, as in usual p systems, with the pe system rule (u, out) → (v, in) on the surface of the membrane l ′. the behaviour modeled by this rule, which is in some sense an “attraction” by the nested membrane rather than the “sending” from the top membrane, leads to results analogous to those obtained by p systems, namely to the transportation of the object inside the nested membranes. the rules for creating endosomes are of the form endoe (u, v), where u, v ∈ v ∗ and: • e is a set of evolution rules for the endosome; • u is the multiset of objects that must appear on the surface of the membrane containing the rule; • v is the multiset of objects that must appear outside the membrane containing the rule. note that each endosome has got its own evolution rules in set e. these rules model the behaviour of the endosome. as regards the creation of an endosome, it is necessary that objects in u are present on the surface of the membranes (they can be seen as the receptors) and that objects in v are present outside of the membrane creating the endosome (they can be seen as the molecules to be engulfed). we remark that in our endosome rules, the objects inside and on the surface of the created endosome are explicitely defined. this is different from the approach of [5] in which in the case of pino rules the surface objects are randomly distributed to the two resulting membranes. more formally, the applicability of an endosome rule is possible in the following general case: let ( j, i) ∈ µ and let endoe (u, v) be a rule belonging to the surface of the membrane i, then it can be applied only if u is a submultiset of the objects contained on the surface of the membrane i, and only if v is a submultiset of the objects contained inside the membrane j. the result of the application of such a rule is the creation of an endosome inside membrane i containing u on its surface and containing v inside. the endosome itself behaves like a membrane having on its surface rules e. we can now formally define a pe system as follows. definition 1. a pe system π is a tuple (v, µ, w, . . . , wn, z, . . . , zn, r, . . . , rn) where: • v is an alphabet whose elements are called objects; p systems with endosomes 217 • µ ⊂ n × n is a membrane structure; • wi with  ≤ i ≤ n are strings from v ∗ representing multisets over v associated with the content of membranes , , . . . , n of µ ; • zi with  ≤ i ≤ n are strings from v ∗ representing multisets over v associated with the surfaces of membranes , , . . . , n of µ ; • ri with  ≤ i ≤ n are finite sets of evolution and endosome rules associated with the surfaces of the membranes , , . . . , n of µ . the notions of (successful) computation and of result of computations of pe systems are the same as for standard p systems. 3 universality of pe systems in this section we prove a universality result for pe systems by showing that any matrix grammar with appearance checking can be simulated by a pe system. before giving the result and its proof, we recall from [11] the definition of this variant of matrix grammars and some related notions. 3.1 matrix grammars with appearance checking a (context-free) matrix grammar with appearance checking is a tuple g = (n, t, s, m, f), where n and t are disjoint alphabets of non–terminals and terminals, respectively, s ∈ n is the axiom, m is a finite set of matrices, namely sequences of the form (a → x, . . . , an → xn) of context–free rules over n ∪ t with n ≥ , and f is a set of occurrences of rules in the matrices of m. for a string w, a matrix m : (r, . . . , rn) can be executed by applying its rules to w sequentially in the order in which they appear in m. rules of a matrix occurring in f can be skipped during the execution of the matrix if they cannot be applied, namely if the symbol in their left–hand side is not present in the string. formally, given w, z ∈ (n ∪ t )∗, we write w =⇒ z if there is a matrix (a → x, . . . , an → xn) in m and the strings wi ∈ (n ∪ t ) ∗ with  ≤ i ≤ n +  such that w = w, z = wn+ and, for all  ≤ i ≤ n, either (1) wi = w ′ i aiw ′′ i and wi+ = w ′ i xiw ′′ i , for some w ′ i , w ′′ i ∈ (n ∪ t ) ∗, or (2) wi = wi+, ai does not appear in wi and the rule ai → xi appears in f . we remark that f consists of occurrences of rules in m, that is, if the same rule appears several times in the matrices, it is possible that only some of these occurrences are contained in f . the language generated by a matrix grammar with appearance checking g is defined as l(g) = {w ∈ t ∗ | s =⇒∗ w}, where =⇒∗ w is the reflexive and transitive closure of =⇒. the family of languages of this form is denoted by mat λac, when rules having the empty string λ as right hand side (λ –rules) are allowed, and by matac when such rules are not allowed. moreover, the family of languages generated by matrix grammars without appearance checking (i.e., with f = /0) is denoted by mat λ , when λ –rules are allowed, and by mat , when such rules are not allowed. it is known that (1) mat ⊂ matac ⊂ cs; (2) mat λ ⊂ mat λac = re, where cs and re are the families of languages generated by context–sensitive and arbitrary grammars, respectively. let ac(g) be the cardinality of f in g and let |x| denote the length of the string x. a matrix grammar with appearance checking g = (n, t, s, m, f) is said to be in the strong binary normal form if n = n ∪ n ∪ {s, #}, with these sets mutually disjoint, ac(g) ≤  and the matrices in m are in one of the following forms: 1. (s → x a), with x ∈ n, a ∈ n; 2. (x → y, a → x), with x ,y ∈ n, a ∈ n, x ∈ (n ∪ t )∗, |x| ≤ ; 3. (x → y, a → #), with x ,y ∈ n, a ∈ n; 218 roberto barbuti, giulio caravagna, andrea maggiolo-schettini, paolo milazzo 4. (x → λ , a → x), with x ∈ n, a ∈ n, x ∈ t ∗, |x| ≤ . moreover, there is only one matrix of type 1, and f consists exactly of all rules a → # appearing in matrices of type 3. we remark that # is a trap symbol, namely once introduced it cannot be removed, and a matrix of type 4 is used only once, in the last step of a derivation. for each matrix grammar (with or without appearance checking) there exists an equivalent matrix grammar in the strong binary normal form. consequently, for each language l ∈ re there exists a matrix grammar with appearance checking g satisfying the strong binary normal form and such that l(g) = l. conventions a matrix grammar with appearance checking in (strong) binary normal form is always given as g = (n, t, s, m, f), with n = n ∪ n ∪ {s, #} and with n +  matrices in m, injectively labeled with m, m, . . . , mn. the matrix m : (s → xinit ainit ) is the initial one, with xinit a given symbol from n and ainit a given symbol from n; the next k matrices are without appearance checking rules, mi : (x → α, a → x), with  ≤ i ≤ k, where x ∈ n, α ∈ n ∪ {λ }, a ∈ n, x ∈ (n ∪ t )∗, |x| ≤  (if α = λ , then x ∈ t ∗); the last n − k matrices have rules to be applied in the appearance checking mode, mi : (x → y, a → #), with k +  ≤ i ≤ n, x ,y ∈ n, and a ∈ n. since the grammar is in strong binary normal form, we have (at most) two symbols b() and b() in n such that the rules b ( j) → # appear in matrices mi with k +  ≤ i ≤ n. we remark that in matrix grammars in strong binary normal forms we can assume that all symbols x ∈ n appear as the left-hand side of a rule from a matrix: otherwise, the derivation is blocked after introducing such a symbol, hence we can remove these symbols and the matrices involving them. 3.2 universality we prove that pe systems are universal by showing that the family, denoted pse(ncoo), of sets ps(πe) of results computed by pe systems with at least two membranes and with non–cooperative rules is equivalent to the family, denoted psre, of the images of all the languages in re obtained through the parikh mapping (this is the family of recursively enumerable sets of vectors of natural numbers). as p systems with non-cooperative rules are not universal, our result implies that universality is due to the presence of endosomes. theorem 2. pse(ncoo) = psre. proof. it is enough to show that for a matrix grammar g in strong binary normal form there is a pe system πg such that ps(πg) = ψt (l(g)). we assume that the output of this pe system is given by the objects sent out from the skin membrane. the alphabet of objects v we take into consideration is given by t ∪ n ∪ n ∪ {c} ∪ {ci, di, d ′ i | i = , }. we build πg as a system with a root membrane, labeled , and one child membrane labeled , namely µ = {(, )}. all the objects encoding the grammar will be stored inside membrane  and the matrices will be simulated by membrane . the initial configuration is given by the objects corresponding to xinit and ainit contained in membrane , namely objects of w, and by the token c contained on the surface of membrane , namely z = {c}. differently, w and z are initially empty multisets. this pe system works as follows: it has a cyclic behaviour such that, at the beginning of the cycle, at most one endosome in membrane  can be created and, if possible, all terminal symbols inside membrane  are sent out as output symbols. the created endosome can start a series of steps resulting in the interpretation of the application of a matrix or, differently, it can start a checking phase to model the fact that, if there exist non–terminal symbols which cannot be rewritten by any grammar, then the computation will not halt. in the case in which πg starts the simulation of a matrix of type  or  (a matrix mi with  ≤ i ≤ k), the involved non–terminals are taken by the endosome which contains as rules the ones interpreting the matrix. objects will be sent into membrane  by these rules creating the result of the p systems with endosomes 219 application of the the corresponding matrix to the non–terminals. subsequently, these objects are sent out to membrane  to restart the cyclic behaviour. we recall that during this process no other endosomes can be created, hence no other matrices can be simulated. differently, in the case in which a matrix of type  (a matrix mi with k +  ≤ i ≤ n) is applied, the single non–terminal of n is taken into the endosome. the endosome will work in the same sense of the endosomes interpreting matrices of type  and  even though, at the end of the application of this matrix, instead of restarting with the cyclic behaviour, a checking process is started. this process checks, by creating endosomes, the presence of the proper non–terminal symbol b( j). if this symbol is found, a special endosome is created which will introduce a trap symbol in this pe system so that the computation will not halt. analogously, if the symbol is not found, an endosome will restore the configuration of this pe system so that the cyclic behaviour can start again. we list now the rules of πg. membrane  contains just one single set of rules to create the output of the pe system: 1. {(a, in) → (a, out) | ∀a ∈ t } . all terminal objects in membrane  are sent out as output. the simulation of any matrix is done by the rules of membrane  which are the following: 1. ∀x ∈ n ∪ n. endo(x ,in)→(#,out)(c, x ). if any non–terminal is present in membrane , πg will always be able to create, by using an endosome, a trap symbol inside membrane . this will ensure that, if a derivation of g reaches a deadlock configuration, then πg can always enter an endless configuration. 2. ∀a ∈ n ∪ n ∪ t. (a, in) → (a, out). every terminal and non–terminal present inside membrane  is sent out to membrane . 3. (c, in) → (c, here). object c inside membrane  is restored on the surface of membrane  so that other endosomes can be created. 4. (#, in) → (#, in). the trap symbol lets this computation not to be recognized such. 5. mi : (x → α, a → x),  ≤ i ≤ k. endo(x ,in)→(α,out),(a,in)→(x,out),(c,here)→(c,out)(c, x a). for any rule of type  and , we create an endosome by taking x a from membrane  and c from the surface of membrane  (this locks the creation of other endosomes). the endosome contains rules to rewrite x and a with the result of applying the matrix. object c is not consumed and sent out to membrane  together with α and x. 6. mi : (x → y, a → #), k +  ≤ i ≤ n. endo(x ,in)→(y,out),(c,here)→(ci,out)(c, x ). for any rule with appearance checking, we create an endosome by taking only x from membrane  and c from the surface of membrane  (this locks the creation of other endosomes). the endosome contains rules to rewrite x with y and c with ci. both objects are sent out to membrane . 7. (ci, in) → (ci, here)(di, here). object ci, together with a new object di, is moved on the surface of membrane . 8. endo(b(i),in)→(#,out)(ci, b (i)). this implements the appearance checking feature of grammar g. we create, if possible, an endosome by taking only b(i) from membrane  and ci from the surface of membrane . the endosome creates a trap symbol in membrane ; this will make πg start an endless computation. 9. (di, here) → (d ′i , here). the symbol di is rewritten in the same place as d ′ i . this is done even if also rule 8 can be applied. however, in the case that rule 8 cannot be applied (namely b(i) was not present), this completes the appearance checking operation and lets πg start an operation which will restart its cyclic behaviour. 10. endo(ci,here)→(c,out),(d ′i ,here)→λ (cid ′ i , /0). this endosome lets πg restart its cyclic behaviour. we create an endosome by simply taking both the control symbols only c and d ′ i from the surface of membrane . the endosome destroys d ′i and rewrites ci with c in membrane  (restarting πg will be obtained by applying rule 3). 220 roberto barbuti, giulio caravagna, andrea maggiolo-schettini, paolo milazzo it is clear that these rules, applied in a proper order, provide the correct interpretation of the application of any matrix to the starting symbols of the grammar and, consequently, we get ps(πg) = ψt (l(g)) which concludes the proof. 4 an application: the egf signaling pathway in this section we give an application of pe systems to the description of the initial phases of the egfr signalling cascade. egf egfr eff cell membrane p h o s p h o r y l a t i o n s cbl ub e n d o s o m e lysosome dna rna nucleus figure 2: the egf signaling pathway (picture taken from [3]). in biology, signal transduction refers to any process by which a cell converts one kind of signal or stimulus into another. signals are typically proteins that may be present in the environment of the cell. in order to be able to receive the signal, namely to recognize that the corresponding ligand is available in the environment, a cell exposes some receptors on its external membrane. a receptor is a transmembrane protein that can bind to a signal protein on its extracellular end. when such a binding is established, the intracellular end of the receptor undergoes a conformational change that enables interaction with other proteins inside the cell. this typically causes an ordered sequence of biochemical reactions inside the cell, usually called signalling pathway, that are carried out by enzymes and may produce different effects on the cell behaviour. a complex signal transduction cascade, that modulates cell proliferation, survival, adhesion, migration and differentiation, is based on a family of receptors called epidermal growth factor receptors (egfrs). while egfr signalling is essential for many normal morphogenic processes, the aberrant activity of these receptors has been shown to play a fundamental role in proliferation of tumor cells. epidermal growth factor receptors (egf r) are produced by specific genes in the dna (through the rna) and they are located on the cell surface. receptors are activated by the binding with a specific ligand (epidermal growth factor, egf ) to form a egfr (ligand-receptor) complex (com). upon activation, egfr undergoes a transition from a monomeric form to an active dimeric one (dim). egfr dimerization stimulates its intracellular phosphorylation (dim p) which activates signalling proteins. these activated signalling proteins (effector proteins) initiate several signal transduction cascades, leading to dna synthesis and cell proliferation. after the activation of effector proteins, ligand-receptor dimers are internalized in endosomes. an ubiquitin ligase, known as cbl, binds an ubiquitin protein (u b) to the dimer (ubiquitination). the ubiquitin protein targets the dimers for lysosomal degradation (see figure 2). the pe system modeling the egf is given in figure 3. membrane  models the environment external p systems with endosomes 221 figure 3: a pe systems model of the egf signaling pathway. the rules are represented inside the membranes. to the cell, membrane  represents the cell surface and membrane  is the nucleus. in the external environment egf corresponds to the epidermal growth factor egf which can bind the receptor on the surface of the cell. the receptor is modeled by egf r in membrane 2, which can move on the surface of the membrane. the complex of egf with the receptor is obtained by rewriting egf and egf r with the complex com on the surface of membrane . after the binding of two complexes we can bind them leading to a dimer dim. such a dimer, present on the surface of the membrane, can be phosphorylated by a phosphorus p inside the cell. such phosphorilated dimer dim p could interact with protein shc and start a chain of interactions we do not model here aimed at activating cell proliferation. furthermore, it can be enclosed in an endosome which could either decompose the dim p dimer into its original components (in order to recycle the two egf r proteins) or, if ubiquitine u b is present, degradate the dim p dimer and release the phosporus. the nucleus of the cell (membrane ) is responsible for the production of egf r through the dna and rna (dna and rna). the rna reaches the cell cytoplasm and there it produces egf r which is sent, again, to the cell surface. 5 future work and conclusions in this paper we have presented an extension of p systems, called p systems with endosomes (pe systems), in which endosomes can be explicitly modeled. pe systems uses some ideas taken from other variants of p systems, in particular as regards objects which can be stored on the surface of the membranes we got inspiration by p systems with peripheral proteins [6, 13] and by membrane systems with surface objects [1, 2]. furthermore, as regards other calculi, operations for modeling transportation mechanisms have already been introduced in brane calculi [4] and in p systems with transport and embedded proteins [9]. although similar, pe systems permit to model in a clearer way these mechanisms. an analysis of pe systems and brane calculi [4] (and also some of their variants like projective brane 222 roberto barbuti, giulio caravagna, andrea maggiolo-schettini, paolo milazzo calculi [7]) could be done along the line of the one done in [5, 12] for p systems and brane calculi. as regards expressiveness of this formalism, we have shown that pe systems are universal even if only the simplest form of evolution rules is considered, namely non–cooperative rules. this expressiveness is achieved by the use of endosomes as classical p systems with this kind of rules are shown not to be universal [10]. at the end of the paper we have given an application example describing the modeling of the initial phases of the egfr signalling cascade. bibliography [1] aman, b., ciobanu, g.: membrane systems with surface objects. proceedings of the int. workshop on computing with biomolecules (cbm 2008), vienna, 17–29, 2008. [2] aman, b., ciobanu, g.: mutual mobile membrane systems with objects on surface. proceedings of the seventh brainstorming week on membrane computing (bwmc09), seville, 2009. [3] barbuti, r., maggiolo–schettini, a., milazzo, p. tini, s.: p systems with transport and diffusion membrane channels. int. workshop on concurrency, specification and programming (cs&p’08), gross vaeter, germany, september, 2008. [4] cardelli, l.: brane calculi. interactions of biological membranes. in: danos, v., schachter, v. (eds.), lncs 3082 (2005), pp. 257-280. [5] cardelli, l., pǎun, g.: an universality result for a (mem)brane calculus based on mate/drip operations. internat. j. found. comput. sci. 17(1), pp. 49–68. [6] cavaliere, m., seawards, s.: membrane systems with peripheral proteins: transport and evolution. proc. of the first workshop on membrane computing and biologically inspired process calculi (mecbic 2006), entcs 171 (2007), pp. 37–53. [7] danos, v. pradalier, s.: projective brane calculus. proc. of the fourth conference on computational methods in systems biology (cmsb04), lncs 3082 (2005), pp. 134–148. [8] freund, r., oswald, m.: p systems with activated/prohibited membrane channels. proc. of wmc 2002, lncs 2597 (2003), pp. 261–269. [9] krishna, s.n.: membrane computing with transport and embedded proteins. theoretical computer science 410 (2009), pp. 355–375. [10] pǎun, g.: computing with membranes. journal of computer and system sciences 61 (2000), pp. 108–143 [11] pǎun, g.: membrane computing. an introduction. springer (2002). [12] pǎun, g.: membrane computing and brane calculi. old, new, and future bridges. theoretical computer science 404(1-2), pp. 19–25. [13] pǎun, a., popa, b.: p systems with proteins on membranes. fundamenta informaticae 72(4) (2006), pp. 467 – 483. [14] p systems, web page. http://ppage.psystems.eu/. p systems with endosomes 223 roberto barbuti was born in 1953. he received a laurea degree in computer science in 1977 from the university of pisa. in 1982 he became assistant professor at university of pisa. he took the position of associate professor in 1989, and the position of full professor in 2000. presently, he is working in the research fields of formal methods for systems biology. giulio caravagna was born in 1982. he received a bachelor and a master degree in computer science from the univeristy of pisa in years 2005 and 2007, respectively. at present, he is a phd student in computer science working in the fields of formal methods for systems biology and natural computing. andrea maggiolo schettini was born in 1938. he received a laurea degree in physics from the university of genova and from 1966 to 1968 he was a researcher of the italian national institute for nuclear physics in bologna. after shifting his interest to fundamental studies in computer science, from 1968 to 1981 he was a researcher of the national research council in naples and in pisa. from 1981 to 1983 he has been full professor of computer science at the university of turin and since 1983 he is full professor of computer science at the university of pisa. he has done research in computability theory, semantics of programming languages, specification and verification of concurrent and distributed systems. his present interests include systems biology and natural computing. paolo milazzo was born in 1979. he received a master degree in computer science in 2003 at the university of bologna and a ph.d. in computer science in 2007 at the university of pisa. actually, he is research fellow at the department of computer science of the university of pisa. working on computational systems biology and natural computing with a particular focus on the definition and application of formal methods. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iii (2008), no. 2, pp. 116-124 e-learning multimedia applications: towards an engineering of content creation maria dolores afonso suarez, cayetano guerra artal, francisco mario tejera hernandez abstract: in the same manner that e-learning applications are becoming increasingly important at the university, there are still some critical questions that should be solved with the objective of making use of the potential offered by current web technologies. the creation of contents that are able of capturing the attention of interest of the students and their disposal in an appropriate way constitute the main purpose of this work. the teaching content engineering expounded shows the different stages that should form part of the process. a development team, composed of different professional profiles, will work together with the lecturers of the subject to which the contents are been created, i.e. multimedia videos and interactive applications. this process should be developed according to a methodology that assure the use of appropriate resources, all that tasks -suitable of beingshould be modularized and factorized. this paper presents the acquired experience in the development and use of multimedia contents for e-learning applications, created for some of the subjects of the degree in computer science engineering. the deliveries of these contents make use of internet and video streaming techniques. the result of the work shows the students satisfaction, including their comments. keywords: teaching content engineering, didactic objectives, active learning, passive learning. 1 introduction during recent years university education has experienced an important progress with respect to the resources used for lecturing. techniques have changed from traditional blackboard methods to the use of projector transparencies (slices) and, later on, the use of video projector and powerpoint to present the content of the lessons. nowadays, the use of these technologies has become more common. as far as content availability for the student is concerned, it has passed from traditional notes taken in classrooms, books and photocopies to digital format of books or notes taken in classrooms, which are available in the subject’s web page. in fact, the universities use to place courses on the web focusing on information delivery rather than learning. the proposal of european higher education area provides the framework to take an important step forward in techniques used in lectures and in the methodology used to create them. for this reason, new methodological approaches are introduced in order to value an active learning [1] as opposed to a traditional passive learning. to value the effort that students should make in order to assimilate knowledge and not only consider the number of hours of attendance to classes. the use of e-learning techniques makes this easier; its wide range of application allows increasing education quality and delivery of information. 2 e-learning in ehea the european higher education area proposes the setting up of a convergence process of educational material in europe. new methodological approaches are introduced in order to value an active learning as opposed to a traditional passive learning. to value the effort that students should make in order to assimilate knowledge and not only consider the number of hours of attendance to classes. all this is made easier by using an e-learning [2] approach, whose extensive work field, in all range of subjects, copyright © 2006-2008 by ccc publications e-learning multimedia applications: towards an engineering of content creation 117 allows the quality of education and its availability to increase [3]. the creation of multimedia contents for e-learning could be developed together with other european universities, and its use could be shared within the same knowledge areas. this allows a new common line of work to be opened. it will be possible to promote european cooperation to guarantee the quality of higher education using comparable methodologies and criteria. 3 teaching content engineering we understand for that concept the whole methodological corpus that allows inserting development elearning technologies in the production of didactic solutions with appropriate invests in production time, resources and person/hour. in this manner, the creation of contents will be developed in a systematic way, planning the process and using adequate techniques. the integration of these techniques and the extent of the teaching material development project require a multidisciplinary team. this multidisciplinary team composed of a group of different professional profiles will carry out different tasks using a wide variety of technologies. these tasks will come together in order to create a production line to maximize results. the improvement of productivity and quality in teaching content creation will constitute the main objective of this engineering [4]. the proposed engineering of teaching content is carried out by means of a methodology, which divides the multimedia production process into phases and assigns tasks to each member of this professional development team. all these different phases include research on suitability of the content for e-learning, a previous analysis to select multimedia techniques to apply, the development of the contents based on the project design and, finally, the maintenance. with respect to the multidisciplinary team, it will be composed of a project manager, programmers, designers, and multimedia experts, as well as the lecturers of each subject. 4 methodology the obtaining of an optimum workflow [5] and the use of resources by the different production tasks is the main purpose of the proposed methodology. for that, we define different production phases. 4.1 suitability of the content for e-learning to create multimedia material for subjects using e-learning techniques means a considerable resource investment. therefore, some factors, which help to make a decision about its profitability, should be taken into consideration. in this first phase a study on the suitability is made in accordance with the factors below: the content validity. the period of time in which teaching contents do not need to be updated. the number of students who attend classroom training. this factor takes into consideration the number of resources used in their learning. the modularity and reusability of independently operable units, which are part of the total structure for creating more contents. 4.2 contents analysis the creation of multimedia contents is carried out through the division of teaching contents in didactic objectives. in this phase the modularization of content takes place. the extraction of the lecturers’ knowledge will be necessary for the selection and proposal of different multimedia didactic elements, 118 maria dolores afonso suarez, cayetano guerra artal, francisco mario tejera hernandez which will constitute part of these multimedia didactic units. it is recommendable to follow a methodology belonging to knowledge engineering, where the knowledge is produced according to the lecturer’s subject specific knowledge, and contains the knowledge and analytical skills of one or more human experts. to achieve this aim, the tasks below are introduced: meetings with the lecturers. in these meetings lecturers expound the teaching objectives of the subject, explain the lesson content and the current way to portray the contents to the students. the selection of lessons for multimedia format. the project manager, together with the lecturers, carries out the selection of lessons for which multimedia teaching content will be created proposal and techniques selection [6]. according to the techniques selected, and the modules repository, the project manager makes a proposal to the lecturers. 4.3 development starting from the analysis of the previous phase, a formal design is carried out, this formal design will identify the activities and work planning that will be done under the supervision of the project manager [7]. therefore, this phase is structured as shown below: formal design includes all the multimedia didactic elements to be used. in accordance with this design, both human and material resources are managed. it is a process of problem solving and planning for reaching the objectives of the project. development of interactive applications will follow a development methodology belonging to software engineering. creation of videos, for which the work will be divided into: script writing, recordings, postproduction and codification [8]. on-line disposition of multimedia contents, becoming, this way, accessible for the students. in the process the creativity of the development team is considered fundamental. 4.4 maintenance once the project is finished, it is difficult to assure that it will work properly unless it is tested. in order to realize a high quality solution, testing throughout each phase of the process is proposed. the project team should be involved in the maintenance phase. they are expected to work on their known issues and prepare for a new release of the created material. in order to detect defects and deficiencies in the multimedia material, some tasks are introduced: interviews with lecturers and students, to know first hand how they feel about this new content to use, in various aspects: accessibility, manage, design. questionnaires, for recover statistics results about all the aspects to be evaluated. comparison of academic results, where an evaluation report will be made, and academic results will be compared with those of other groups of students that have made use of traditional learning methodologies. all these tasks will help to obtain a constructive feedback and to enhance and optimize this multimedia material and its different aspects evaluated. a thorough study of time and staff required for project development will take us to a suitable planning and this to an appropriate economic investment. therefore, cost and resources affect directly in the amortization of the investment made for the creation of contents. e-learning multimedia applications: towards an engineering of content creation 119 5 resources among the means that will be used to carry out this technology we find both human and material resources. the competences of the multidisciplinary team are defined by different profiles: lecturers, whose main tasks include the structure of the subject program, script writing, and the proposal and collective agreement with the project manager on the multimedia didactic units. project manager, who advises lecturers, coordinates efforts of the development team and assigns tasks according to the planning made. designers, which will carry out tasks of graphic design, user interfaces development, and 2d 3d elements creation. analysts/programmers, to analyze and develop software application and programs. multimedia technicians, in charge of recording, editing, and postproduction. regarding to material resources, these are comprised of the multimedia recording studio and the laboratory room for development of software applications and editing of contents. the workstations of the editing room in the laboratory are fully equipped with the necessary computer material for creating and editing audiovisual content. figure 1. figure 1: on the left we can observe the record studio used for the recording of lessons. on the right workstation of the editing room. 6 other methodologies there are different strategies for the creation of teaching content for e-learning. these approaches often use a similar set of stages or phases for content development. among them we find the "waterfall philosophy" characterized by the use of a number of phases strictly ordered in such a way that each phase begins once the previous one is finished; and the "evolutionary philosophy" or "based in prototypes", characterized by considering since the beginning that, although the project start out with a set of requirements, will arise changes in these requirements as the project is developed. however, the methodological proposal made in this work for an e-learning content engineering focuses on lecturing and this means a set of differences, related below: scope. traditional methodologies are directed, generally, towards content development for corporate e-learning. the proposed methodology is focused on content development for lecturing. 120 maria dolores afonso suarez, cayetano guerra artal, francisco mario tejera hernandez contents modularity. in each knowledge area to create independently operable units intended for reuse in the same or a different one is proposed. this means the use of a knowledge area thorough analysis in order to make these operable units relocation possible when creating new teaching programs. extend of involvement. lecturers should be part of the work team that will create multimedia contents for e-learning, should be involved not only in didactic advises tasks, according to the teaching objectives of the subject, but also should take part in the conception of the whole set of tools proposed by the project manager. resources. the wide variety and quantity of multimedia resources used are essential in the proposed methodology for e-learning content creation. multimedia technology and the development of interactive applications provide suitable capabilities to develop quality contents at the university level. 7 lecturing experiences for the evaluation of e-learning techniques in lecture content, some subjects have been selected. specifically, the subjects correspond to bachelor and graduate degrees on "computer science engineering", multimedia and automaton theory and formal languages ii. specially, the subject of multimedia has the particular feature of covering two fundamental aspects in the teaching of computer science knowledge. on the one hand, we find purely theoretical and mathematical content which supports audio compression, image and video technologies. on the other, we find content that makes reference to the use of multimedia tools and web programming languages. in particular, the subject of multimedia has a teaching timetable of 30 theoretical hours (twice a week), using traditional classrooms, and other 30 hours of practical lessons in the laboratory. the prepared material embraces half the timetable in the classroom. the content generated for automaton theory and formal languages ii corresponds to the practical exercises of the subject. as lecturers can now count on this new multimedia teaching material, the methodology, which has been followed till now in multimedia, has changed substantially. firstly, the students still have the same number of learning hours; however, they receive some lessons by means of video in a dedicated server. this means that they receive, weekly, one hour in the classroom and another one through video streaming. this new material allows traditional lessons to be divided into two groups. in this way the lesson in the classroom is repeated and the student attends the most convenient classes. therefore, not only do they not miss the opportunity to ask the lecturer doubts, but also the number of students is reduced, allowing a more personalized treatment. with respect to automaton theory and formal languages ii, the prepared material only provides support to the personal training of the student. the results of this experience have been very positive. students have valuated unanimously this new teaching model as completely recommendable. even, the students have taken part in the improvement of the material contributing with their own ideas, needs, and wishes, like including random questions in order to implement interactivity [9]. from the valuation questionnaire issues below are recovered: multimedia videos are very appreciated. no comments were made about accessibility or reproduction difficulties or even misunderstanding. although students think it is necessary: the inclusion of subject’s content in a pdf format file and an index to facilitate access directly to each unit of the lesson. and just only one student pointed out that the teacher should propose more practical material in the classroom. the valuation questionnaire results are represented in the table 1. as we can observe, percentage for the last one question is quite good. figure 2. with respect to the academic efficiency revealed in the assessments, we should say that the percentage of success in the exams remains constant if we compare with the results of past years where lessons were taught in a traditional way. however, a particular fact, which is known by all lecturers when the exams’ e-learning multimedia applications: towards an engineering of content creation 121 table 1: results from bachelor and graduate degrees on "computer science engineering" figure 2: graphical representation for general assessment. results from bachelor and graduate degrees on "computer science engineering" 122 maria dolores afonso suarez, cayetano guerra artal, francisco mario tejera hernandez dates come close, should be commented. in the previous weeks to exam dates the students come more frequently, and in many cases only on those dates, to the tutorials. it is remarkable that during this teaching experience there have been practically no consultations in tutorials in all the academic year, not even on dates coming up to the exams. 8 future research the acquired experience in the development of multimedia contents for e-learning applications shows that this emergent task should be tackled through the use of a systematic, disciplined, quantifiable approach to the development, manage and maintenance of this teaching content material. in the near future, we will test the process of development of a new project from the university of las palmas de gran canaria. this project involves the creation of multimedia material for subjects of three different areas: the technical area, the health care science area, and the human, society and legal area. the election of the subjects has been made according to the factors expounded in the phase of suitability of the subject. it has been preferential for the technical department to choose subjects of the first courses of various degrees with similar contents, what implies a considerable number of lessons in common. so that, there will be subjects with the 85 or 90 per cent of their content delivered with multimedia material, which is more profitable than having only one or two subjects with the 100 per cent of the content delivered with this new material. the subject is named mathematics in some degrees and calculus in other ones. something similar has happened in the other two knowledge areas: in the health care science area anatomy has been chosen, this is a subject of the first course of "medical degree" with a huge number of students. for the human, society and legal area has been chosen constitutional law, which corresponds to two different degrees: "law" and "economics and business sciences", degrees with a large number of students in the first courses too. in this way, we will delve into each phase of this new engineering focusing on improving the efficiency and cost-effectiveness of the development of multimedia contents for e-learning in these subjects selected. 9 conclusions new information technologies and e-learning will be indispensable tools in lecturing in the near future. the increasing bandwidth available for internet connection and multimedia capacities found in current computers allow them to be used as a completely valid way for teaching. however, the lack of multimedia content production is a very important cause that limits an e-learning widespread. the creation of large-scale teaching content for lecturing needs a production methodology to assure the optimization of resources and, therefore, a reduction in costs. this work intends to contribute with the methodological lines applied in the production of multimedia teaching content for lecturing. new teaching methodologies require new teaching aims; the role of lecturers in the new educational process should be restated in the whole educational process. lecturers should reduce the time dedicated to teaching lessons using traditional blackboard methods. this activity could be mainly, replaced by e-learning techniques. lecturers could dedicate time to activities that offer a better quality in teaching, transmitting motivation about the subject being studied and directing the students in their studies [10]. the future of teaching in the european society should see an improvement in the quality of education, its availability, and a lower cost for students as well as for educative organizations. 10 acknowledgment we wish to thank the programm committee of icvl 2007 that recommended the publication of an extended version of this paper. e-learning multimedia applications: towards an engineering of content creation 123 bibliography [1] a. m. barry, creating and sustaining an effective learning environment!. aishe conference 2006. [2] http://ec.europa.eu/education/programmes/elearning/index_en.html [3] http://ec.europa.eu/education/policies/educ/higher/higher_en.html [4] b. shackelford, project managing e-learning. astd. eeuu, 2002. [5] r. pedrosa and s. petitz, production of interactive contents for training in e-learning environmentsprocedures and methodologies. iadis international conference e-learning 2007. [6] w. horton, k.horton, e-learning tools and technologies, (1st edition), wiley. eeuu, 2003. [7] l. dublin, j. cross, implementing e-learning. astd. eeuu, 2002. [8] j. casteleira and p. leão, production and distribution of contents in digital video of quality for the distance learning. iadis international conference e-learning 2007. [9] r. rheeder, r. diseko and g. lautenbach, the design of interactivity for a web-based learning environment at a higher education institution. iadis international conference e-learning 2007. [10] k. fraser, a blended learning approach to teaching "introduction to multimedia" the e bit!. aishe conference 2006. maria dolores afonso suarez, cayetano guerra artal francisco mario hernandez tejera instituto universitario de sistemas inteligentes y aplicaciones numericas en ingenieria iusiani universidad de las palmas de gran canaria departamento de informatica y sistemas edif. de informatica y matematicas. campus universitario de tafira. 35017. las palmas. spain. e-mail: mafonso@dis.ulpgc.es, cguerra@iusiani.ulpgc.es mhernandez@iusiani.ulpgc.es received: november 30, 2007 124 maria dolores afonso suarez, cayetano guerra artal, francisco mario tejera hernandez maria dolores afonso suarez, graduated at the university of las palmas de gran canaria in computer science. she has worked in private enterprises developing software and as associated professor at the computer science and systems department of the university of las palmas de gran canaria. her main research fields are multimedia and e-learning. cayetano guerra artal, graduated at the university of las palmas de gran canaria in computer science. he has been teaching multimedia for seven years at the computer science and systems department of the university of las palmas de gran canaria. he received his ph.d. in computer vision. his main research fields are computer vision, computer graphics, multimedia and e-learning. francisco mario hernandez tejera graduated in electrical engineering and received his ph.d. in computer science at the university of las palmas of gran canaria. he is assistant professor of computer science and engineering at the computer science and systems department of the university of las palmas de gran canaria. is member of the institute of electrical and electronic engineers ieee , the international association of pattern recognition (iapr), the asociacion española de reconocimiento de formas y analisis de imgenes (aerfai) and the asociacion española para la inteligencia artificial (aepia). his current research interests are in autonomous systems, knowledge-based systems, active vision, visual learning, scene analysis, mobile robotics and interactive robotic systems, that is robots that interact with people. as well as multimedia and e-learning. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 1 (march), pp. 187-195 using fixed priority pre-emptive scheduling in real-time systems d. zmaranda, g. gabor, d.e. popescu, c. vancea, f. vancea doina zmaranda, gianina gabor, daniela elena popescu codruta vancea, florin vancea university of oradea romania, 410087 oradea, 1 universitatii st. e-mail: {zdoina,gianina,depopescu,cvancea,fvancea}@uoradea.ro abstract: for real-time applications, task scheduling is a problem of paramount importance. several scheduling algorithms were proposed in the literature, starting from static scheduling or cyclic executives which provide very deterministic yet inflexible behaviour, to the so called best-effort scheduling, which facilitates maximum run-time flexibility but allows only probabilistic predictions of run-time performance presenting a non-predictable and nondeterministic solution. between these two extremes lies fixed priority scheduling algorithms, such as rate monotonic, that is not so efficient for real-time purposes but exhibits a predictable approach because scheduling is doing offline and guarantees regarding process deadlines could be obtained using appropriate analysis methods. this paper investigates the use of rate monotonic algorithm by making adjustments in order to make it more suitable for real-time applications. the factors that motivate the interest for fixed priority scheduling algorithms such rate monotonic when doing with real-time systems lies in its associated analysis that could be oriented in two directions: schedulability analysis and analysis of process interactions. the analyzing process is carried out using a previously implemented framework that allows modelling, simulation and schedulability analysis for a set of real-time system tasks, and some of the results obtained are presented. keywords: real-time systems, fixed priority preemptive scheduling. 1 introduction real-time systems are often safety critical and require a high quality design in order to obtain and guarantee the requested properties. the design process consists in building models on which the required system properties are assessed; based on this previously developed models an implementation that preserves these properties is further developed. in order to develop a large-scale real-time system we must be able to manage both the logical complexity and timing complexity using a highly disciplined approach [10].the problem of dealing with logical complexity is addressed by the several existing software engineering general methodologies [11] while timing complexity represents an issue that is addressed by specific scheduling algorithms. for real-time systems, two modelling approaches are known in the literature: first of it allows handling of traditional, periodically sampled control systems, and is represented by the so called timed-triggered approach; the second type of model deals with discrete event systems, and it is known as the event-triggered approach. if the main advantage of event-driven approach is flexibility and better resource utilization, the main advantage of time-driven approach is predictability. copyright c⃝ 2006-2011 by ccc publications 188 d. zmaranda, g. gabor, d.e. popescu, c. vancea, f. vancea in the time-triggered approach, all communication and processing activities are initiated at predetermined points in time: there is only one interrupt and that is the periodic clock interrupt, which partitions the continuum time into sequences of equally spaced granules. timed-triggered tasks are characterized by a period and a deadline; also, knowledge about task’s worst case execution time (wcet) is generally assumed. on the other side, event-triggered approach is dictated by the external environment: all communication and processing activities are initiated whenever a significant change of state, i.e., an event other than regular event of a clock tick, is noted. the signalling of significant events is realized by the well-known interrupt mechanism. the event-triggered based systems require a scheduling strategy to achieve the appropriate software task that services the event. in practice, for several real-time applications event-triggered tasks are sporadic, and exhibit a predictable inter-arrival time. thus, this time could be seen as task period, deadline being smaller or equal with this [15]. in practical situations, when dealing with hard real-time issues, mixed systems are often encountered [13]. for these, a common approach is to model the system as a timed-triggered one, and deal with events as periodic tasks with inter-arrival times considered as their period. of course, if the system requires handling of urgent events, that implies a pre-emptive scheduling strategy to be able to meet those deadlines. consequently, when we are modelling such real-time mixed systems, choosing the right scheduling strategy is an important aspect, and several issues has to be considered [17]: if we are considering time-triggered tasks, a static scheduling proves to be efficient both for scheduling and for the communications purposes. it is important into the design phase to correctly divide system functionality into tasks and further, tasks with long periods should be statically divided into subtasks with shorter periods; if sporadic event-triggered tasks occur and they have shorter deadline that execution time of another task, allowing pre-emption is mandatory and therefore leading to a pre-emptive scheduling approach. therefore, using a static and pre-emptive scheduling strategy together with some initial reasonable assumptions when constructing task’s model, could provide a solution for analysing and developing mixed systems [8]. in this case, as the static approach, we consider that fixed priority assignment can be adopted without loosing the benefits of the fully static approach. 2 fixed priority pre-emptive scheduling and real-time systems fixed priority scheduling algorithms exhibit a predictable approach: because scheduling is doing offline, guarantees regarding process deadlines could be obtained using appropriate analysis methods [5]. fixed priority scheduling has been often criticized as being too static by the supporters of best effort scheduling and too dynamic by the supporters of cyclic executives. for building a mixed real-time system from a number of periodic tasks and several sporadic tasks, static priority pre-emptive scheduler implies that at run time the highest priority task is run, this pre-empting other lower priority tasks [6]. from a historical perspective, rate monotonic scheduling algorithm is the most appropriate in this sense, because it is pre-emptive and because it is known to be the optimal in their respective classes [14]. rate monotonic scheduling algorithm is an example of a priority driven algorithm with static priority assignment [2], in the sense that the priorities of all requests are known before their arrival, the priorities for each task being the same and known a-priori (they are determined only by the period of task). therefore, for the time being, rate monotonic is used in most practical applications [3]. the reasons for this choice are the following: easy to implement an to analyze; however, the schedulability assessment given by liu and layland [12] is sufficient (all task sets that pass the using fixed priority pre-emptive scheduling in real-time systems 189 test are guaranteed to be schedulable) but not necessary (a task set that fails to pass the test is not necessarily unschedulable) [12]; more predictable, especially in high overloaded conditions; this prediction is linked to several initial simplifying assumptions and restrictions that rate monotonic has: all tasks are independent and periodic, deadline is equal to their period. the factors that motivated the interest for fixed priority scheduling algorithms such rate monotonic when doing with real-time systems lies in its associated analysis, which could be oriented in two directions [3]: schedulability analysis based on worst case execution times of the processes; one property of the early utilization schedulability analysis is its simplicity both in concept and in computational complexity. this simplicity comes from the initial assumptions that are based on constraints upon characteristics of all processes (all processes are periodic and must have deadline equal to their period) and assumption that process priorities given by their period (according to rate monotonic policy) are correctly assigned, otherwise analysis is not efficient and inclusion of aperiodic processes by making them periodic based on estimated inter-arrival times. rate monotonic priority assignment policy [12] states that process deadlines must be equal with their respective periods di = ti. this assignment could be restrictive, especially for hard real-time sporadic tasks that have deadlines not related to their inter-arrival times, and hence they cannot be modelled as simple periodic tasks with period equal with deadline. for this case the following relation holds: ci ≤ di ≤ ti (1) where: ci represents the computation time for task i di represents task i deadline ti represents task i period a variation of original rate monotonic, called deadline monotonic assign priorities in inverse order to the task deadlines. deadline monotonic algorithm is equivalent with rate monotonic when, for all processes di = ti. deadline monotonic priority assignment is optimal in a similar manner to rate monotonic if there is a feasible priority ordering over a set of processes, a deadline monotonic priority ordering over those processes will be also feasible [4]. both rate monotonic and deadline monotonic approaches assume that all processes have a common release time: if processes are permitted to have arbitrary offsets, then optimality could be affected [8,14]. under these circumstances, neither priority assignment is optimal [9]; but, if controlled offset is allowed this property may be not significant affected. 3 considering the overheads even if in most models overhead induced by scheduling is neglected and considered 0, this is not the case in reality. generally, implementation scheme for fixed priority schedulers implies maintaining at least two queues: a ready queue and a waiting queue. the ready queue contains the tasks that are ready for execution and the waiting queue contains tasks that have already been executed and they are waiting for the next period. the queues are ordered in different ways: the ready queue is ordered based on task priority (the most priority task first for rate monotonic the task with lower period ti is the most priority task) and the waiting queue is ordered based on the starting time of the task ri. if a task from the waiting queue becomes ready for execution, then it is moved to the ready queue according to its priority; if the priority of the first task from the ready queue becomes bigger than the priority of the current task, then a context switch will occur [4]. this leads to two kinds of overheads: context switching overhead being the time needed to pre-empt a task, save its context and load the context of another task and scheduling overhead the time taken to move newly arrived or pre-empted tasks between the two queues. 190 d. zmaranda, g. gabor, d.e. popescu, c. vancea, f. vancea the common ways to consider these times into a system model implies adding the overhead times to the known times according to the following [6]: for context switching times, the simplest way to do it is to increase task computation time ci of all tasks with the double of estimated time needed to do of doing the context switch csw : ci = ci + 2∗csw (2) and for scheduling overhead the same approach could be used, this time being added the scheduling overhead time csch : ci = ci + csch (3) generally, when considering the total of all overheads, an average could be calculated over all tasks, denoted by cov, and could be added to each computation time: ci = ci + cov (4) this approach of measuring the total system overhead, averaging it and including in all computation times is simple to be modelled and used [16]. another way to include overheads into the model could be considered and modelled as additional tasks, but this complicates it too much and sometimes is unnecessarily. when constructing a real-time system model, from the accuracy point of view, it is important to take into consideration these overheads. but, an important aspect is represented also by the possibilities of reducing the overheads impact on the overall system’s performance [20]. because most of these overheads are linked with pre-emption (both context switching and scheduling overheads occur after a task pre-emption), a possibility of doing this could be to reduce the number of pre-emption over system’s task set. generally, one of the weaknesses of rate monotonic algorithm comes from the high level of overhead that results due to high pre-emptions. therefore, reducing unnecessary pre-emptions could have a significant impact on algorithm performance, in the same time by keeping its simplicity [7]. 4 reducing the number of pre-emptions the idea of the proposed method is based on the observations derived from rate monotonic algorithm usage for scheduling real-time tasks (tasks with deadlines), from which a high number of pre-emptions were noticed. as it is well known, every pre-emption induce a run-time overhead and we consider that, by reducing the number of pre-emptions in the resulting scheduling scheme, the overall runtime overhead decreases, and this could result in an increasing efficiency when we speak about real-time applications.in order to avoid pre-emptions, it is important to know when they occur: generally, they take place when a higher priority task is activated during the execution of a lower priority task. lower priority tasks would experience more pre-emption than high priority ones, as they stay longer in the ready queue. to reduce the chance for pre-emption, one possible method implies to reduce the period of time while a task stays into the ready queue. it should be possible to do this, and one method is to delay the activation of the task, by setting ri to a value > 0; of course, this delay should not have implications to the overall schedulability of the task set, and, consequently, must be done under strict control. in order to derive such a method, based on an algorithm that delays start time for a set of tasks, the following task model is considered: every task is denoted by ζi; each task is periodic and the method applies itself only for periodic tasks, the period of task ζi is denoted by ti; ci represents the worst case execution time (wcet) for task ζi; pi represents the priority of task and the priority of each task is fixed; the ratio ci/ti represents the utilization using fixed priority pre-emptive scheduling in real-time systems 191 factor of the task ζi and represents the fraction of processor time that is used by task ζi; the deadline of a task di represents a typical task constraint in real-time systems and represents the time before which the task must complete its execution usually, the deadline of the task is relative, meaning that, from the moment when a task ζi arrives, it should finish within di time units and in particular, when using rate monotonic algorithm, the deadline is considered equal with task period: di = ti; task ready (arrival) time denoted by ri which represents the moment of time when the task ζi is ready for execution. consequently, the algorithm for ready time modification calculates first the maximum delay time for each task, in order not to affect the task deadline. for task ζi that is characterized by a period (and deadline) ti and has a worst case execution time ci, the following relation is used: max_delay(ζi) = ti −ci. (5) let’s denote with s a task set of size n and consider that tasks are ordered by their priority (as they appear in the ready queue): s = {ζ1 ζ2 . . . ζn−1 ζn} (6) where ζ1 has the highest priority and ζn the lower one. the algorithm picks every task from s, in the decreasing order of their priority, and verifies if it is possible to change the ready time ri; in order to reduce the possibility of pre-emption for the higher priority task (ζi), the algorithm starts by delaying it as much as possible [1]. verification is based on the maximum delay that is possible for each task, calculated as presented in (5) and take into consideration the computation time for tasks that were already delayed in order not to compromise the schedulability for the given task. the algorithm tries to delay as much as possible tasks with high priority, given to the tasks with low priority the chance to be less pre-empted. the set of tasks that have been delayed during execution of algorithm are included in delayed (corresponding to the waiting queue) and corresponding delay is calculated (ri). this will be used further into the simulation tool for modifying ready times. consequently, we obtained the following structure of the algorithm: r1 = max_delay(ζ1) = t1 −c1; include (ζ1) in delayed ; for(i = 2; i ≤ n; i ++) { if ((max_delay(ζi)−σj in delayed cj) > ci) { ri = max_delay(ζi)−σj in delayed cj; include(ζj) in delayed; } else { ri = max_delay(ζi); } } 5 case study analysis and results the idea of the above algorithm is illustrated considering the case study shown in figure 1. each task is represented by its computation time c, period t and deadline d, and it is assumed 192 d. zmaranda, g. gabor, d.e. popescu, c. vancea, f. vancea that release time r for all tasks is null. so, we considered a task set consisting of three tasks ζ1, ζ2, and ζ3 with the following characteristics: c1 = 10; t1 = 30; c2 = 30; t2 = 90; c3 = 20; t3 = 120. we simulated the execution of these tasks using the framework developed in [18, 19] and the results obtained are presented in figure 1. the simulation is carried out using a developed framework which allows modelling, simulation and schedulability analysis for a set of real-time systems tasks. the tool has a graphical user interface for introducing tasks parameters, such as: deadline, execution time, priority (as presented in the left panel) permitting that all these parameters to be saved in a (text) file for each given task, in a specific format. also, several scheduling algorithms are implemented into the tool, and the user could choose from a list of implemented algorithms. these algorithms are grouped into two categories: for periodic and non-periodic tasks. for our case study, we consider to work only with rate monotonic. the framework uses a built-in simulator that illustrates a graphical representation of the generated trace of execution for the set of tasks, according to the chosen scheduling algorithm (as presented in the right panel). from these results we noticed that task ζ1 exhibits no pre-emption, ζ2 exhibits maximum one prefigure 1: set of 3 tasks scheduled with rate monotonic without modifying start (ready) times emption and task ζ3 maximum 2 pre-emptions. by applying start times modifications according to the proposed algorithm, the following release times for the considered tasks were calculated: i = 1; r1 = t1 −c1 = 30−10 = 20; s = {ζ1}; i = 2; max_delay(ζ2) = t2 −c2 = 90−30 = 60; 60−20 = 40 ⇒ r2 = 40; s = {ζ1 ζ2}; i = 3; max_delay(ζ3) = t3 −c3 = 120−20 = 100; 100−40 = 60 => r3 = 60; s = {ζ1 ζ2 ζ3}; we modified the release times for our tasks accordingly; consequently, as it is shown into the new simulation presented in figure 2, we observed that the number of pre-emption for task ζ3 is using fixed priority pre-emptive scheduling in real-time systems 193 reduced to maximum one. it is obvious that the number of pre-emptions for task ζ3 is reduced by this modification. figure 2: set of 3 tasks scheduled with rate monotonic with modifying start (ready) times one thing about our resulting model is no concluding is the delay of the least priority task: based on the algorithm idea, it should not be delayed at all, because so it has a chance to run when other tasks have not, due to their delays, and, therefore, the chance for pre-empting it decreases in these conditions. but, in many cases, an overall lower pre-emption rate is achieved. 6 conclusions starting from the premises that rate monotonic algorithm is simpler to implement and exhibits a predictable behaviour resulted from its associated analysis, in this paper, a possible adaptation of rate monotonic algorithm is proposed, in order to overcome some of its disadvantages when using it in real-time applications. one aspect that has impact on the model overall accuracy takes into consideration the overheads implied by the scheduling process by including them into the task’s computation times, and a possible way of considering this is proposed. another aspect, with impact on scheduling overall performance focuses on reducing the number of pre-emptions, accrediting the idea that by doing this, the performance of the algorithm increases. pre-emptions were reduced based on tasks start times modifications, and an algorithm that controls these start time adjustments was proposed. the algorithm was tested using several use cases (one example presented in the paper), pseudo-randomly generated and the same conclusion was reached: by controlling tasks release times according to the proposed algorithm, in most cases the number of pre-emptions decreases. thus, the proposed algorithm together with the implemented tool provides a very powerful analysis framework that can be used in real-time application modelling and further development. bibliography [1] a. aravind and j. chelladurai, activation adjusted scheduling algorithms for real-time systems, advances in systems, computing sciences and software engineering, pp. 425-432, 194 d. zmaranda, g. gabor, d.e. popescu, c. vancea, f. vancea springer 2006. [2] n. audsley, on priority assignment in fixed priority scheduling, fuzzy control rules in convex optimization, inf. process. lett., 79(1), pp.39-44, 2001. [3] n. audsey, a. burns, r. davis, k. tindell, a. wellings, fixed priority preemptive scheduling: an historical perspective, real time systems, vol. 8, pp. 173-198, 1995. [4] r.j. bril, p.j.l. cuijpers, analysis of hierarchical fixed-priority pre-emptive scheduling revisited, tu/e cs-report 06-36, 2006. [5] g. c. butazzo, rate monotonic vs. edf: judgement day, real-time systems, 2005. [6] r.i. davis, a. burns, hierarchical fixed priority pre-emptive scheduling, proceedings of the 26th ieee real time system symposium, ieee computer society, pp. 389-398,2005. [7] r. dobrin and g. fohler, reducing the number of preemptions in fixed priority scheduling, proceedings of euromicro conference on real time systems, pp. 144-152, 2004. [8] j. goossens, scheduling of offset free systems, real-time systems, 24(2), pp. 239-258, 2003. [9] j. goossens, r. devillers, the no-optimality fo the monotonic priority assignments for hard real-time systems, real-time systems, 13(2), pp. 107-126, 1997. [10] c. kirch, principles of real-time programming, emsoft02, lncs 2491, springer-verlag berlin, 2002. [11] j. kollar, j. poruban, p. vaclavik, evolutionary nature of crosscutting modularity, proceedings of the 9th international conference of modern electric systems, emes’07, pp. 43 48, 2007. [12] c. l. liu and j. w. layland, scheduling algorithms for multiprogramming in a hard real time environment, journal of the acm, vol. 20(1), pp. 46-61, 1973. [13] c. l. liu, real-time systems, prentice hall, 2000. [14] m. naghibzadeh and k. h. kim, a modified version of rate monotonic scheduling algorithm and its eficiency assessment, proceedings of the seventh ieee internation workshop on object oriented real time dependent systems, pp. 289-294, 2002. [15] i.shin, i. lee, periodic resource model for compositional real-time guarantees , proceedings of 24th ieee real time system symposium, (rtss), pp.2-13, 2003. [16] s. saewong, r. rajkumar, j.p. lohoczky, m.h. klein, analysis of hierarchical fixedpriority scheduling, proceedings of 14th euromicro conference on real time systems, (ecrts), pp. 152-160, 2002. using fixed priority pre-emptive scheduling in real-time systems 195 [17] k. somasundaram; s. radhakrishnan, task resource allocation in grid using swift scheduler, international journal of computer, communication and control, issn 1841-9836, eissn 1841-9844, vol. iv, no.2, pp. 158-166, 2009. [18] d. zmaranda, g. gabor, tool for modeling and simulation of real-time systems behavior, proceedings of the 2nd ieee international workshop on soft computing applications, sofa 2007, gyula, hungary oradea, romania, isbn: 978-1-4244-1608-0, pp. 211-215, 2007. [19] d. zmaranda, c. rusu and m. gligor, a framework for modeling and evaluating timing behaviour for real-time systems, proceedings of the international symposium on systems theory software engineering, sintes vol iii, pp. 514-520, isbn 973-742-148-5, 2005. [20] c. gyorodi, r. gyorodi, m. dersidan, l. bandici, applying a pattern length constraint on the fp-growth algorithm, proceedings of the international workshop on soft computing applications sofa 2009, ieee computational intelligent society, 29 july 1 august 2009, szeged-hungary, arad romania, ieee catalog number cfp0928d-prt, isbn 987-1-42445054-1, pp. 181-185, 2009 international journal of computers, communications & control vol. ii (2007), no. 1, pp. 17-25 the moments in control: a tool for analysis, reduction and design abdelmadjid bentayeb, nezha maamri, jean-claude trigeassou abstract: in this paper we present a new method of model reduction via the moments. the reduction technique is composed of two steps, the first one consists on using the least squares linear optimization algorithm to minimize a cost function representing the norm 2 of the error between different moments of the full order function and the reduced model. this solution represents an initialization of the second step algorithm which is based a non linear programming minimizing a new criterion composed of the cost function of the first step and an equality constraint. keywords: moments, model reduction, optimization. 1 introduction during the last 30 years many design techniques like h∞ [10] was elaborated in order to obtain better performances of controlled plants but once the performances objectives ensured, implementation problems appears because most of industrial applications still use a simple controllers structure like pid (proportional integral derivative) [1][10]; so the aim after the synthesis step is to find a reduced order controller easy to use and to implement, this reduced controller must ensure as possible the same performances of the full order controller[9]. since reduction can not ensure the same performances of the full order controller in all frequencies, it is reasonable to specify a frequency range to proceed to the reduction [3][4]. there is many model reduction techniques like balanced truncation [11] and optimal hankel norm approximation [10]; every method differs from the other one by the importance accorded to d.c gain or to middle and high frequencies. our reduction model method is original, it is based on the notion of moments which is a description of linear time invariant system around a given pulsation [12]. the methodology is composed of two steps, in the first one a cost function representing the norm 2 of the error between different frequency moments of the full order system and the reduced order one is minimized. some parameters are imposed a priori to obtain a linear criterion, so the parameters of the reduced model are computed using least squares algorithm [2][5]. the solution obtained from the step one is used to initialize the non linear programming algorithm of the second step in order to minimize a criterion composed of the cost function of the first step and an equality constraint between the first time moments of the full order system and the reduced model to ensure low frequencies performances [7]. the paper is organized as follows: in section 2, we present definitions and computing methods of time and frequency moments; in section 3, we develop the model reduction technique by presenting the principle, the linear optimization and the non linear optimization and section 4 is devoted to conclusions. notice that the illustrative examples are presented in section 3. 2 the moments let us consider a linear siso system, characterized by its transfer function g (s) analytic in the rhp plan (.i.e re (s) > 0) and let g (t) be its impulse response: g (s) = ∫ ∞ 0 g (t) e−st dt (1) copyright c© 2006-2007 by ccc publications 18 abdelmadjid bentayeb, nezha maamri, jean-claude trigeassou the transfer function is given by the following state space (not necessary minimal) realization: g (s) s = [ a b c d ] = c (si −a)−1 b + d (2) where a ∈ rn×n, b ∈ rn×1,c ∈ r1×n and d ∈ r1×1. 2.1 time moments by expanding e−st in taylor series in the vicinity of s = 0, we get: g (s) = ∫ ∞ 0 ∞ ∑ n=0 (−1)n sn t n n! g (t) dt (3) g (s) = ∞ ∑ n=0 (−1)n an (g) sn (4) where: an (g) = ∫ ∞ 0 t n n! g (t) dt (5) an: represents the nth order moment of g (t). remark 1. the time moments an give a description of the system at low frequencies. • a0(g) represents the area or the d.c gain of g(t). • a1(g) defines mean time of g(t). • a2(g) deals with the ’dispersion’ of g(t) around its mean time,...etc [2][5][7] 2.2 frequency moments let consider the variable s = jω . by expanding e−st in taylor series in the vicinity of s0 = jω0, we get: g ( jω) = ∞ ∑ n=0 (−1)n ( jω − jω0)n an,ω0 (g) (6) with: an,ω0 (g) = ∫ ∞ 0 t n n! e−(s−s0)t g (t) dt (7) remark 2. like the time moments, the frequency moments describe the system around ω = ω0: • a0,ω0 represents g ( jω) at ω = ω0. • a0,ω0 − j(ω −ω0)a1,ω0 permits to enlarge the previous approximation around ω = ω0. notice that the moments an,ω0 are complex and if ω0 = 0, we recover the time moments of the system (i.e an,0 = an)[7]. the moments in control: a tool for analysis, reduction and design 19 2.3 computing the moments using state space realization time moments using the following equality: (si −a) ( −a−1 −sa−2 −s2a−3 −···− ) = i ⇒ (si −a)−1 = − ∞ ∑ n=0 ( sna−(n+1) ) (8) an from (2) and (4), we can write: g (s) = −c ( ∞ ∑ n=1 sna−(n+1) ) b + ( −ca−1b + d ) (9) so: a0 (g) = −ca−1b + d and an (g) = (−1)n+1 ca−(n+1)b, (n = 1···∞) (10) frequency moments realizing a variable change µ = jω − jω0, equation (6) becomes: g (µ) = ∞ ∑ n=0 (−1)n (µ)n an,ω0 (g) (11) and (2): g (µ) = c (µ i −(− jω0i + a))−1 b + d (12) so, we get: a0,ω0 (g) = −c (− jω0i + a) −1 b + d (13) an,ω0 (g) = (−1) n+1 c (− jω0i + a)−(n+1) b, (n = 1···∞) (14) 3 model reduction the purpose of model reduction is, starting from a real system, to find a reduced model making it possible as well as possible to approximate it in a given frequency range. 3.1 principle let g (s) be a nominal transfer function of high order: g (s) = b0 + b1s +···+ bmsm a0 + a1s +···+ an−1sn−1 + sn , with m ≤ n (15) and the parameters vectors is: θ t = [a0 a1 ···an−1b0 b1 ···bm] (16) we define a reduced structure: gr (s) = b0r + b1rs +···+ bmrsmr a0r + a1rs +···+ a(n−1)rsnr−1 + snr , with mr ≤ nr (17) 20 abdelmadjid bentayeb, nezha maamri, jean-claude trigeassou 3.2 linear optimization let consider the following reduced structure [6]: gr (s) = nr (s) dr (s) (18) in linear optimization, we consider that the reduced denominator dr (s) is fixed a priori and only the numerator parameters have to be optimized. as it is evoked before the reduced model try to ensure: gr (s) = g (s) (19) in a frequency range evidently. equation (19) can be written: b0r + b1rs +···+ bmrsmr dr (s) = g (s) = 1 dr (s) ( mr ∑ k=0 bks k ) (20) using the moments, the previous equation becomes: ( mr ∑ k=0 bks k ) ( ∞ ∑ k=0 (−1)k ak,ω0 sk ) = ( ∞ ∑ k=0 (−1)k a′k,ω0 s k ) (21) where: ak,ω0 represent the n th order frequency moment of 1dr (s) (22) and a ′ k,ω0 represent the n th order frequency moment of g (s) (23) after truncation until nr and by equalizing the terms of the same power in s, the equation (21) can be written as follow:   a0,ω0 0 ··· ··· 0 −a1,ω0 a1,ω0 0 ··· 0 ... ... . . . ··· ... anr,ω0 anr−1,ω0 ··· ··· a0,ω0     b0r b1r ... bmr   =   a ′ 0,ω0 −a′1,ω0 ... a ′ nr,ω0   (24) which can be written as: φnr,mrθ r = γnr (25) let: ε nr = γnr −φnr,mrθ r (26) be the error between the mrth first moments between the real system and the reduced one. notice that the moments can be either temporal or frequency ones it depends on the frequency range chosen. we want to determine θ r which minimize the following quadratic cost: j = ε t ε (27) since the system given by (25) is linear, we can determine θ r using least squares method, so: θ r = [ φtnr,mrφnr,mr ]−1 [ φtnr,mrγnr ] (28) the moments in control: a tool for analysis, reduction and design 21 example 3. let us consider the following transfer function: g (s) = 1.042s7 + 21.77s6 + 206.5s5 + 1049s4 + 2583s3 + 1789s2 + 437.5s + 35 s7 + 22.38s6 + 228.3s5 + 1323s4 + 3832s3 + 6339s2 + 1995s + 157.5 (29) we want to find a two states reduced model which approximates as well as possible g (s) in low, medium and high frequencies. first let choose a reduced denominator, for our case the choice is: dr (s) = (1 + 0.5s) 2 (30) so the number of numerator’s parameters to be computed will not exceed 3 and the number of moments used equals the number of parameters; we choose three pulsations for reduction: ω0 = 0rd/s, ω0 = 0.5rd/s and ω0 = 2rd/s (31) the parameters vector θ tr = [b0r b1r b2r] for the three cases is: θ tr = [0.2222 0.1852 2.9012] , for ω0 = 0rd/s (32) θ tr = [0.1805 0.5030 0.2990] , for ω0 = 0.5rd/s (33) θ tr = [0.2969 0.5255 0.3808] , for ω0 = 2rd/s (34) the frequency response of the real system and reduced model for the three cases is given in (figure 1). −30 −20 −10 0 10 20 30 m a g n itu d e ( d b ) 10 −2 10 −1 10 0 10 1 10 2 −45 0 45 90 135 180 p h a se ( d e g ) bode diagram frequency (rad/sec) −15 −10 −5 0 5 m a g n itu d e ( d b ) 10 −2 10 −1 10 0 10 1 10 2 −30 0 30 60 p h a se ( d e g ) bode diagram frequency (rad/sec) g (s) (−), gr (s) (−.−), ω0 = 0rd/s g (s) (−), gr (s) (−.−), ω0 = 0.5rd/s −15 −10 −5 0 5 m a g n itu d e ( d b ) 10 −2 10 −1 10 0 10 1 10 2 −30 0 30 60 p h a se ( d e g ) bode diagram frequency (rad/sec) g (s) (−), gr (s) (−.−), ω0 = 2rd/s figure 1: frequency response of real system and reduced models 22 abdelmadjid bentayeb, nezha maamri, jean-claude trigeassou 3.3 non linear optimization it is clear that the fact of imposing the denominator of the reduced model, limits the optimization’s performances, also if the reduction is needed in high frequencies, it is necessary to ensure at the same time the low frequencies behaviour. in this part, we will optimize both poles and zeros of the reduced model or the parameters of numerator and denominator; a non linear optimization algorithm is used to minimize a quadratic cost between the moments of the real system and the reduced model including an equality constraint between the two first time moments in order to keep the same low frequencies performances. principle let us consider the parameters vector of a reduced model: θ tr = [ a0r a1r ···a(n−1)rb0 b1r ···bmr ] (35) we want to find θ tr which minimizes (27); for that we use marquardt’s algorithm [8] which is a good combination between rapidity and convergence. algorithm principle parameter estimation is performed using an iterative optimization procedure: θ̂ i+1 = θ̂ i −{[j ′′ + λii]−1.j ′}θ̂ =θ̂ i (36) ( ∂ j ∂ θ r ) = j ′ : the gradient vector (37) ( ∂ 2j ∂ θ 2r ) = j ′′ : the hessian matrix (38) λi : coefficient to be adjusted (39) the initialization is given by the vector parameters emerged from the least squares optimization: θ̂0 = θ̂ls (40) computing j ′ and j ′′ the calculation of the gradient and the hessian is crucial for the optimization procedure, we use parametric sensitivity function to calculate them: j ′ ≈ −2 n ∑ n=0 εnθn and j ′′ ≈ 2 n ∑ n=0 θnθtn (41) θ = dan,ω0 (gr) dθ r (42) where: dan,ω0 dθi = (−1)n+1 ( dc dθi a−(n+1)µ b−ca−(n+1)µ da(n+1)µ dθi a−(n+1)µ b + ca −(n+1) µ db dθi ) (43) the moments in control: a tool for analysis, reduction and design 23 where: da(n+1)µ dθi = ( daµ dθi anµ + aµ danµ dθi ) and aµ = (a− jω0i) (44) if we use a control or an observer canonical realization, (43) will be much easier to calculate. for example, for control realization: dan,ω0 dai = (−1)n+1 ( −ca−(n+1)µ da (n+1) µ dai a−(n+1)µ b ) and dan,ω0 dbi = (−1)n+1 ( dc dbi a−(n+1)µ b ) (45) where: ar = [a0r a1r ···anr] and br = [b0r b1r ···bmr] (46) now, we want to find θ r which minimizes (27) and ensures at the same time the following equality: f (θ r) =‖ a0 (g)−a0 (gr) ‖ + ‖ a1 (g)−a1 (gr) ‖= 0 (47) the optimization problem can be reformulated as: min θ r jconst with jconst = j + γ f (θ r) (48) which is equivalent to: γ represents the vector of lagrange multipliers to be estimated. to solve this problem, we can use the algorithm described in (36), by substituting j by jconst , with: j ′ const =   ∂ jconst ∂ θ ∂ jconst ∂ γ   , j ′′ const =   ∂ 2jconst ∂ θ 2 ∂ 2jconst ∂ θ ∂ γ ∂ 2jconst ∂ θ ∂ γ ∂ 2jconst ∂ γ 2   (49) example 4. let us take the same system given in example 2: g (s) = 1.042s7 + 21.77s6 + 206.5s5 + 1049s4 + 2583s3 + 1789s2 + 437.5s + 35 s7 + 22.38s6 + 228.3s5 + 1323s4 + 3832s3 + 6339s2 + 1995s + 157.5 (50) the aim is to find a reduced order model of three states which approximate the real system around ω0 = 11rd/s and have the same low frequencies behaviour; for that we will compare the results obtained from least squares, non linear programming (nlp) and non linear programming with equality constraint. the reduced model have the following structure: gr (s) = b0r + b1rs + b2rs2 a0r + a1rs + s2 (51) of course for least squares optimization, the denominator is fixed a priori: dr (s) = (1 + 0.5s) 2 (52) the three reduced model are: gr (s) = 1.019−1.698s+0.6728s 2 (1+0.5s)2 least squares gr (s) = 14.63+14.19s+1.032s 2 36.96+14.84+s2 non linear programming gr (s) = 0.1351+0.4075s+1.052s 2 0.6078+1.935s+s2 nlp with equality constraint (53) 24 abdelmadjid bentayeb, nezha maamri, jean-claude trigeassou the frequency response of the real system and the reduced models is illustrated by the (figure 2) −15 −10 −5 0 5 m a g n itu d e ( d b ) 10 −2 10 −1 10 0 10 1 10 2 −90 0 90 180 270 360 p h a se ( d e g ) bode diagram frequency (rad/sec) least squares g (s) (−) gr (s) (−.) −15 −10 −5 0 5 m a g n itu d e ( d b ) 10 −2 10 −1 10 0 10 1 10 2 −30 0 30 60 p h a se ( d e g ) bode diagram frequency (rad/sec) nlp g (s) (−) gr (s) (−.) −20 −15 −10 −5 0 5 m a g n itu d e ( d b ) 10 −2 10 −1 10 0 10 1 10 2 −30 0 30 60 p h a se ( d e g ) bode diagram frequency (rad/sec) nlp with constraint g (s) (−) gr (s) (−.) figure 2: frequency response of real system and reduced models remark 5. we saw that the moments may be used for model reduction; we presented three optimization methods: least squares, non linear programming and non linear programming with equality constraint. if the optimization is done in low frequencies,the non linear programming can be used without equality constraint take into account the radius of convergence which allow to ensure the low frequencies behaviour. 4 summary and conclusions in this paper, we presented a new method for model reduction and controller design. the technic is based on the moments tool which is able to give a description of any linear system or linear system with time delay in low frequencies using time moments or around a given frequency using the frequency moments. the optimization procedure is composed of two steps, in the first one, we use least squares algorithm to have reduced model to initialize the non linear programming with equality constraint between the two first time moments between the reduced model and the full order system. for controller design, the aim is to ensure the closed loop performances using a reference model which regroups dominant poles, auxiliary poles and system’s singularities. using youla parametrization, we obtain an ideal controller which will be reduced for implementation using the moments. the moments in control: a tool for analysis, reduction and design 25 references [1] k.astöm and b.wittenmark "computer controlled systems, theory and design", prenctice hall (1984) [2] a.bentayeb, n.maamri and d.mehdi "moments based synthesis approach comparison with h∞ design",ifac decom-tt "istanbul, turkey" (2003). [3] s.boyd and c.barratt "linear controller design-limits of performance". prentice hall (1991). [4] k.v.fernando and h.nicholson "singular perturbational model reduction of balanced systems", ieee transactions on automatic control, ac-27:2:466-468 (1982). [5] n.maamri and j.c.trigeassou "pid design for time delayed systems by the method of moments", european control conference, groningen holland (1993). [6] f.monroux. "méthodologie générale de synthèse de correcteurs par la méthode des moments, approche mixte: fréquentielle et temporelle", thèse de doctorat, université de poitiers, 1999. [7] n.maamri, a.bentayeb and j-c.trigeassou "design and iterative optimization of reduced robust controllers with equality-constraints" rocond-milan (2003). [8] d.w.marquardt "an algorithm for least-squares estimation of non linear parameters" journal of soc. indust. appl. math v(11-2) (1963). [9] m.g.safonov and r.y.chiang "a schur method for balanced truncation model reduction". ieee transactions on automatic control ac-34:729-733, (1989). [10] s.skogested and i.postlethwaite "multivariable feedback control", jw (1996). [11] m.s.tombs and i.postlethwaite "truncated balanced realization of a stable non-minimal statespace system",international journal of control:46:1319-1330. [12] j.c.trigeassou "la méthode des moments en automatique". cifa, lille (2000). a.bentayeb, n.maamri and j-c.trigeassou university of poitiers laboratoire d’automatique et d’informatique industrielle 40 avenue du recteur pineau 86022 poitiers e-mail: abdelmadjid.bentayeb@gmail.com received: november 24, 2006 editor’s note about the author: abdelmadjid bentayeb was born on august 9, 1977 in annaba (algeria). he received the engineer diploma of automatic control in 2000 and the applied studies diploma of advanced control in 2001 at the university of annaba. he received the master degree of automatic and industrial data processing in 2002 and the phd degree of automatic in 2006 with a very honourable distinction at the university of poitiers. currently, he prepares an industrial orientation of his work carried out at university of poitiers. his current research interests include the multivariable robust control systems and model reduction using the method of the moments. international journal of computers, communications & control vol. iii (2008), no. 1, pp. 41-50 composite predictive functional control strategies, application to positioning axes jean-yves dieulot, tarik benhammi, frédéric colas, pierre-jean barre abstract: different predictive control strategies have been validated on a dc motor. icascaded predictive control, which consists of a cascaded loop where the traditional servo algorithms are replaced by pfcs, could enhance the cycle time. predictive functional control alone is simpler to tune and can exhibit comparable performances, except that the controller is more sensitive to nonlinear phenomena such as dry friction, which were not taken into account into the model and generate a static error. keywords: model-based control, dc motor, predictive control. 1 introduction the traditional cascaded-loop servo algorithm of a positioning device consists of a current, a speed and a position loop, for which the individual controllers within each loop use proportional or pi algorithms [2, 4]. this control structure is robust with respect to disturbances and easy to tune, loop after loop, when the inner loops dynamics are much faster than that of the outer loops. this is not always the case and the tracking lag may cumulates through the loops thus achieving poor performances. among various alternative control algorithms, model-based predictive control which embeds both feedforward and a servo algorithms, is known for its tracking abilities due to multiple-step-ahead prediction and robustness with respect to model uncertainties [1]. in this paper, the focus will be on the implementation and fast tuning of the predictive functional control algorithm introduced by richalet (e.g. [7, 8, 9, 10, 12]) which has been shown to be able to control a variety of complicated robot structures (see e.g. [11]). however, the main drawbacks of those predictive control methods are the need for a preliminary black-box model identification which hampers the implementation in real-time industrial world where tuning procedures should be simple and as short as possible. since a global controller may lose some of the advantages of cascaded loops, i.e. rejection of disturbances for each loop, dry friction compensation... . cascaded control structures have been introduced, where both position and speed loop can be replaced by a predictive controller. since the pfc tracking performance is better, the lag in the inner loops should be reduced and better overall performances is assumed [5, 6]. in this paper, a comparison is drawn for different cascaded predictive functional control strategies, which combines the pfc controller with a grey box model which parameters are related to physical characteristics. 2 predictive functional control 2.1 pfc philosophy in this section, the outline of model pfc algorithm, which is fully detailed in [7, 9] is recalled. pfc is based on a discrete linear model, { xm(n + 1) = fm xm(n) + gm u(n) ym(n) = ctm xm(n) (1) xm is the state variable, un is the input, ym the system output, fm ,gm ,cm are constant matrices. the predictive control strategy is summed up in fig. 1. a control sequence is designed to enforce the model copyright © 2006-2008 by ccc publications 42 jean-yves dieulot, tarik benhammi, frédéric colas, pierre-jean barre output to stick to a reference trajectory yr at given coincidence points. over this receding horizon, the reference trajectory which is the path to the future set point, is resetted at every instant and is given by: c(n + i)−yr(n + i) = α i.(c(n)−yp(n)), 0 6 i 6 h (2) where c(n) is the set-point, yp(n) the real process output, α is a parameter α : 0 6 α 6 1, which represents the exponential convergence of the algorithm, and thus fixes the closed-loop behavior. figure 1: principle of pfc control 2.2 computation of control the future controls sequence should optimize the performance index ((4)) which turns to find the minimum in the least-square sense of the tracking error at fixed set-points with respect to the future control sequence: d(n) = nh ∑ j=1 {ŷp(n + h j)−yr(n + h j)}2 (3) where h j, 1 ≤ i ≤ h are the coincidence points. in order to compute easily the control, it will be assumed that the latter is a composition of a priori orthonormal base functions (usually time-dependent polynomials ubk (i) = i k−1 ), u(n + i) = nb ∑ k=1 (µk(n)ubk (i)), i ≥ 0, (4) where ubk (i), 1 ≤ k ≤ nb are the base functions, the optimization problem reducing to find the corresponding weights µ(k). of course, one could apply the whole control sequence, but, in fact, only the current control (5) is used, and the whole algorithm is computed again next step. u(n) = nb ∑ k=1 (µk(n)ubk (0)). (5) for computational purposes, the model output is divided into a free (unforced) response, where the control is set to zero, and a forced response to the control (4). ym(n + i) = yf (n + i) + yu f (n + i), 1 6 i 6 h (6) composite predictive functional control strategies, application to positioning axes 43 where yf (n + i) = nb ∑ k=1 µk(n).ybk (i), 1 6 i 6 h is the forced output, and ybk is the response to the base function ubk , yu f (n + i) = c t m f i m xm(n) is the unforced output. let the predicted future error be approximated by a polynomial function: ê(n + i) = e(n) + de ∑ i=1 em (n) i m (7) where h j, 1 6 j 6 nh is a fixed approximation degree, finally, the controller is obtained by setting ∂ d(n)∂ µ = 0 (see [9], for details). the parameters only depend on the base functions, the convergence exponent and the coincidence points and can be computed off-line: u(n) = k0{c(n)−yp(n)}+ max(dc,de) ∑ m=1 km{cm(n)−em(n)}+ vtx xm(n) (8) where dc is the degree of the polynomial approximation of the set-point and k0 = vt   1−α h1 1−α h2 ... 1−α hnh   , km = v t   hm1 hm2 ... hmnh   , vx =   ctm(f h1 m −i) ctm(f h2 m −i) ctm(f hnh m −i)   t v v = ( nh ∑ j=1 yb(h j)yb(h j)t )−1 [ yb(h1) ... yb(hnh ) ] ub (0) the algorithm consists thus of a feedforward term devoted for tracking error, a disturbance rejection term and of a state-space controller. in the application considered, control tuning will involve a "flat " prediction (ê(n + i) = e(n)) and a parameter α , the parameters will be chosen with a rule of thumb [9] as α = e −3te clrt (9) along with three coincidence points h = ( [ clrt /3 clrt /2 clrt ] /te) (10) where clrt is the closed-loop rise time and te is the sampling period. 3 industrial control and pfc strategies figure 2: full structure of the industrial cascaded-loop control the industrial control consists of nested loops and assumes that the dynamics of the inner loops are the fastest ; the current loop has generally a high dynamics and is thus neglected, the speed loop involves 44 jean-yves dieulot, tarik benhammi, frédéric colas, pierre-jean barre a pi structure and the position loop only a proportional controller, and sometimes a low-pass filter which has been shown to improve the dynamics of axes, particularly in presence of flexibilities. traditionally, the set-point is a bang-bang in acceleration which takes in account limitations in speed and acceleration and is a time-optimal trajectory. as was explained in introduction, this controller has an easy step-bystep tuning procedure, from the fastest to the slowest loop, and saturations, disturbances can be easily handled at every loop level. however, it is known that performances are not as good as those of a global controller, because the tracking lag is cumulating over the successive loops. the speed controller is simply c (s) = kv ( 1 + 1τvs ) where kv, τv are the speed loop proportional and integral gains. as an alternative to this cascaded controller or to a global controller (such as a pfc), a composite cascaded structure where the speed loop pi controller is replaced by a speed loop pfc controller, hence realizing the decoupling of the different dynamics (fig. 3). figure 3: composite cascaded controller the last scheme consists of the introduction of a cascaded pfc control which is the same as the classical industrial controller except that pfc servo algorithms are embedded into both loops (fig. 4). figure 4: cascaded pfc controller 4 modelling and experimentation the experimental set-up is a brushless dc servo-motor (parvex f9m-4-57), equipped with a tacho-generator (f9ts-7-270) and a position coder (resolver), which drives a rotating shaft mounted on bearings, optionnally coupled to an additional flywheel. when neglecting the electrical resistances, the global model of the brushless motor is: jeq dω dt = cm − fvω,cm = kt i (11) where ω is the rotating speed, cm and i are the control torque and current, kt is the torque constant, jeq is the equivalent motor inertia, fv is the viscous damping coefficient. most pfc algorithms require a proper identification and modeling stage; the motor inertia was estimated from the known motor and load moment of inertia which were added to the inertia resulting from the transmission coupling. only the viscous and dry friction coefficients were estimated experimentally from a plot of the motor current versus the shaft rotation speed. characteristics are summed up in table 1. composite predictive functional control strategies, application to positioning axes 45 figure 5: experimental dc motor controller kp (rad.a−1) clrt (s) pfc n/a 0.02 cascaded pfc n/a speed loop 0.01 position loop 0.06 composite pfc 10 0.05 industrial control 10 n/a table 1: motor characteristics 46 jean-yves dieulot, tarik benhammi, frédéric colas, pierre-jean barre nominal torque n.m nominal speed tr.min−1 electric power w equivalent inertia kg.m2 flywheel inertia kg.m2 viscous damping cf. nmsrad−1 0.346 3000 108 1.5.10-4 8.5.10-4 7.8 table 2: control parameters the measure of speed was low-pass filtered with a filter cut-off frequency of 2.10−3 s. measurements recording and real-time control were achieved using a dspace 1103 board with a sampling period of te = 5.10−4 s. the experimental reference position was set to 12 rad, the maximum speed vmax to 150 rads−1, the maximum acceleration to amax = 346 rads−2. the traditional trajectory for positioning axes consists of a time-optimal controller [3] with constrained speed and acceleration. the control gains for each experiment are given in table 2. the integral gain of the pi speed is calculated to compensate the first-order pole, with τv = fv jeq , the speed gain kv is tuned so that the control torque does not exceed the peak torque, which gives kv = 1 rad.s.a 1. pfc algorithms were tuned after performing simulations in a way that the maximum current does not overcome that of the industrial loop. 5 experimental results and discussion figure 6: comparison of control performances fig. 6 shows the basic results that will serve as a comparison of the different control structures: one can see that a global controller such as a pfc or a controller with a fast closed-loop position controller exhibits a small rise time due mainly to the cancellation of the phase lag in the speed loop and in the position loop. the performances of pfc controllers outperform that of the traditional industrial control, for a comparable current maximum amplitude. however, these controllers have two main drawbacks: the pfc has an important static error because the dry friction is not taken into account into the model. this phenomenon does not occur with the cascaded loop because these errors appear firstly in the speed loop, and can then be compensated with the position loop as in traditional control. cascaded pfc seems composite predictive functional control strategies, application to positioning axes 47 to be the most appealing, but one has to remind that the tuning is far more delicate because it involves two pfc loops. one should take an appropriate clrt ratio between the loops (at least 5) in a way that the position loop be truly the fastest. figure 7: drive current control fig. 7 shows that, for cascaded pfc -as for other pfc algorithms , less energy may be needed, but the current signal is less smooth than with an industrial control scheme; clearly, this could excite vibration modes when a very flexible load is carried. most controls are robust when the additional inertia has been removed, only the rise-time is a little higher. the composite controller does not improve the industrial loop by much, except when an external disturbance is applied (e.g. braking with the hand) results are not shown here -, with a better rejection. in this case, classical pfc is the less robust with respect to other controls. figure 8: control performances with no load 48 jean-yves dieulot, tarik benhammi, frédéric colas, pierre-jean barre 6 conclusion composite predictive functional control algorithms a cascaded structure where only the speed loop servo algorithms or both speed and position controllers were replaced by pfcs were validated on a dc servo-motor. the tuning procedure was quite simple and involved the general rigid model which can be found from the physical characteristics of the motor and a single experiment (for the determination of the viscous friction parameter). cascaded predictive control could improve the tracking and was shown to be more robust to load mass variations. classical pfc is easier to tune (only one pfc loop) , the closed-loop performances being nearly the same; however, the controller is more sensitive to unmodelled nonlinear phenomena such as dry friction, which were not taken into account into the model. the resulting static error can be unacceptable and alternative solutions have, in this case, to be proposed. future work will focus on application to flexible manipulators. bibliography [1] clarke, d. w., mohtadi, c.,tuffs, p. s. generalized predictive control. part i: the basic algorithm. part ii: extensions and interpretations, automatica, vol. 23, 1987, pp 137 160. [2] ellis, g., control system design guide (2nd edition), academic press, boston, 2000. [3] erkorkmaz, k., altintas, y., high-speed cnc system design. part iii: high speed tracking and contouring control of feed drives, int. j. machine tools & manuf., vol. 41, 2001, pp. 1637 1658. [4] gross, h., harmann, j., wiegârtner, g., electrical feed drives in automation, mcd corporate publishing, siemens, isbn 3-89578-148-7, 2001. [5] guemghar, k., srinivasan, b., mullhaupt, ph., bonvin, d., analysis of cascade structure with predictive control and feedback linearisation, iee proceedings-control theory and applications, vol. 152, 2005, pp 317 324. [6] hedjar, r., toumi, r., boucher, p., dumur, d., two cascaded nonlinear predictive controls of induction motor, proceedings of 2003 ieee conference on control applications., 2003 [7] richalet, j., industrial applications of model-based predictive control, automatica, vol. 29, 1993, pp 1251 1274. [8] richalet, j., estival, j.l., fiani, p., industrial applications of predictive functional control to metallurgical industries, proc. 4th ieee conf. control applications, 1995. [9] richalet, j., abu, e., arber c., kuntze, h.b., jacubasch, a., schill, w., predictive functional control. application to fast and accurate robot, 10th ifac world congress, munich, germany, 1997 [10] rossiter, j. a., predictive functional control: more than one way to prestabilise, 15th ifac world congress. barcelona, spain, 2002. [11] vivas, a., poignet, p., predictive functional control of a parallel robot, control engineering practice, vol. 13, 2005, 863 874. [12] yim w., singh s. n., nonlinear inverse and predictive end point trajectory control of flexible macromicro manipulators, j. dyn. dyst. meas. control, vol. 119, 1997, pp 412 420. composite predictive functional control strategies, application to positioning axes 49 jean-yves dieulot ensam de lille research technological team cemodyne 8 bd louis xiv, 59000 lille cedex e-mail: jean-yves.dieulot@polytech-lille.fr tarik benhammi ensam de lille research technological team cemodyne 8 bd louis xiv, 59000 lille cedex e-mail: jean-yves.dieulot@polytech-lille.fr frédéric colas ensam de lille research technological team cemodyne 8 bd louis xiv, 59000 lille cedex e-mail: frederic.colas@centraliens-lille.org pierre-jean barre ensam de lille research technological team cemodyne 8 bd louis xiv, 59000 lille cedex e-mail: barre@lille.ensam.fr received: november 6, 2007 50 jean-yves dieulot, tarik benhammi, frédéric colas, pierre-jean barre jean-yves dieulot graduated from french ecole d’ingénieurs institut industriel du nord in 1990, and obtained a phd from the university of lille in 1993. he is currently associate professor at ecole polytechnique universitaire de lille, france, and with lagis (laboratory of automatic control) his main interests are in fuzzy, nonlinear and robot control. tarik benhammi graduated from ecole nationale supérieure des arts et métiers, in meknès, morocco, in 2007. frédéric colas graduated from ecole centrale de lille, france, in 2003. he has obtained a phd from ecole centrale de lille, france, in robotics and automatic control, which was funded by cartesian robot manufacturers. pierre-jean barre is a full professor and head of ecole nationale supérieure des arts et métiers, in lille, since 2004. he has also developed a technological research platform devoted to highspeed flexible robot control and planning, funded by many french and european manufacturers. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 3, pp. 292-300 a novel model for adaptive control systems a state machine approach f. valles-barajas fernando valles-barajas universidad regiomontana information technology department 15 de mayo 567 pte., c.p. 64000 colonia centro, monterrey, nuevo león, méxico e-mail: fernando.valles@acm.org, fernando.valles@ieee.org abstract: in this paper a new model, based on state machines, of adaptive control systems is presented. due to its high level of expressiveness, uml was chosen as the modeling language. in particular the paper presents a model of an indirect adaptive control system. this model can be used to document and to have a better understanding of adaptive control systems. keywords: adaptive control systems, state machines, personal software process (psp), software design 1 introduction adaptive control is one of the research areas of control engineering that deals with time-varying systems [9]. to control a process using this technique, the first step is to obtain a model of the process based on measurements of the input and output of the process (these measurements are stored in a vector called measurement or observation vector φ (k)); this step is called identification and is done by using a parameter adaptation algorithm (paa) like the recursive least squares algorithm. once the parameters of the process are obtained, the controller is designed using these parameters. the personal software process1 is a modern methodology that helps the engineer to ensure quality software products [7]. the design phase of this methodology requires that the software engineer builds four models. every of these models represents a different view of a system; for example there model that represents the states and the transitions between these states of a particular entity using state machines, which represent the internal dynamic view of a system. state machines are used in software engineering to analyze the behavior of complex systems, for example the rational unified process (rup)2 uses state machines to model a use case, an operation or an object [2]. in this paper the concepts of state machines will be applied to model an adaptive control system (acs). there are several notations to represent state machines (see for example [5], [6], [7]), in this paper the notation of the uml will be used. the reason for this, is the maturity of this modeling language and the success shown by uml to model complex systems. motivation and contribution of the paper: 1. the graphical model obtained with the state machine will help the control engineer to have a better understanding of the acs control law. 2. this model could be used by a software engineer as a base to implement the software of the acs. 1 c© software engineering institute 2 c© ibm copyright c© 2006-2010 by ccc publications a novel model for adaptive control systems a state machine approach 293 3. by using this new model, a better understanding by the software engineer of an adaptive control system will be obtained. 4. this model can be used as documentation of the control system. 5. a better communication between the software engineer and the control engineer will be obtained by using the proposed model. conventions used in the paper: in all the paper, for the purpose of clarity, all the parts of the state machines (states, transitions, events, guard conditions and actions) are printed in italic font. the key concepts of the state machines will be written in bold font. related works: in [4] an adaptive strategy based on state machines is presented. the events that are considerd in the strategy are: threshold crossings, commands from the operator of the system and the occurrence of several patterns in the process signals (control error e(k), process input u(k) and process output y(k)) among others. the occurrence of one of these events may provoke that the adaptive strategy changes from one state to another. the states modeled in this strategy are: initial state, open loop state, closed loop state and final state. the open and closed loop states are composite states, which are states composed of other states [12]. in that paper the author shows a successful example of his proposal. the paper of [13] shows an application of state machines in fault-adaptive control systems. as stated in that paper this kind of system can deal with time-varying systems and also with systems that present faults; the performance of the control system must remain unaffected in spite of these two problems. to deal with these problems that paper proposes to add to the control system a supervisory controller module and a reconfiguration management module. the relevant point in the paper is that the supervisor controller module is represented as a finite state machine (fsm). that paper contains an example of the application of the roll control of a simulated airplane. organization of the paper: section 2 presents the necessary concepts of the adaptive control theory to understand the proposed model. in section 3 an explanation of the building blocks for state machines is given. section 4 contains the proposed model for an adaptive control system using state machines. the last section gives some concluding remarks. 2 adaptive control systems fig. 1 shows the configuration of an indirect adaptive control system. in this kind of adaptive control, the model of the process gp(z−) is obtained based on a set of input-output measurements (u(k), y(k)) and then the controller is designed with this model. the parameter adaptation algorithm (paa) block is responsible for obtaining this model. the controller design block designs the controller gc(z−) based on the model obtained by the paa and on the desired performance specified by the operator of the system. the adaptive control system must control the process in spite of the disturbances d(k), d(k), the noise n(k) and the parametric variations of the process. 3 state machines state machines are graphical models that allow the specification of the states of one entity, the legal transitions between these states, the events that can occur in the life of the entity and how the entity manages these events [10]. the transitions from one state to another can be fired by an event. some 294 f. valles-barajas paa gp(z−)gc(z−)σ y(k)u(k)e(k)yre f (k) θ̂paa(k) desired performance controller σ ? d(k) σ ? d(k) supervisor block σ ¾ n(k) --? 6 ¾ ¾ 6 ? u ? θ̂cd(k) design figure 1: indirect adaptive control system equipped with a supervisor block condition (called guard condition) can be specified as a requirement to the occurrence of the transition. to model an entity using a state machine the entity is isolated from the world and the communication with the rest of the world is specified by detecting events and responding to them. in uml state machines are represented using state diagrams [12]. a state machine can also be used to model use cases, which are useful to specify the functional requirements of a system. a state machine that describes the external events and its sequence in a use case is a kind of state diagram of use cases [10]. the external events happen between the actors, which are any external entity that interacts with the system, and the system [8]. events that pass between objects residing in the system are referred to as internal events. some authors like [10] recommend to use state machines to model the external and time events and the responses to them instead of using state machines to model the internal events of objects (instances of a class). a state machine is drawn inside a rectangle with the name label in the top left-hand side (see fig. 2), though the frame may be omitted if the context is clear [1]. if the context is an instance of a class, all the classes related with it are candidate targets for actions and for inclusion in guard conditions [3]. a state is drawn as a rounded rectangle with two compartments: one for the state name and the other for the internal transitions and the internal behavior (see fig. 2). the second compartment may define entry and exit actions, activities, internal transitions and deferred events. there must be only one initial state3, but there may be any number of final states, including no final state at all. initial states are pseudostates, they do not have all the properties of a real state. final states are real states; they posses all the properties of a real state; this means that while the entity is in a final state it can perform some activity and can response to some events. the difference between final states and the rest of the real states is that they do not have transitions out. an initial state is drawn using a solid black circle and a final state is drawn using a shallow black dot (see fig. 2). a state machine can have an exit point (drawn as a circle with a cross, as in fig. 2) that indicates the occurrence of an exception [1]. 3it is important to mention that [1] considers that a state machine may have more than one initial state only if each of these states are labeled with the event that created the entity. a novel model for adaptive control systems a state machine approach 295 state machine name t thw s ¸ state name 1 enter do exit e/defer e[c]/a e[c]/a y ~ initial state final state transition yn nstate name 2 figure 2: a state machine example state machine can model parallel operations; this is important in control systems if we consider the situation when it is necessary that the system responses to commands from the operator and at the same time operates the plant. this section has presented some basic concepts of state machines. the next section contains the model of an adaptive control system based on state machines. also in the next section advanced concepts of state machines will be illustrated using the proposed model. 4 the model this section explains how an adaptive control system can be modeled by a state machine. the explanation is divided in three section, which correspond to every of the building blocks of a state machine: the states, the events and the transitions. the actions that occur while a control system is in some state will be explained in the section of the states and the actions executed when a transition occurs will be explained in the section of the transitions. 4.1 states a state describes the condition of an object [11]. this condition for a control system can be expressed in any of the following ways: • in the terms of the control system attributes: – the values of the attributes that define the control system’s structure. for example the values of the poles of the close loop transfer function (see the polynomial am(z−) in eq. 1) define if the system is in stable or unstable state. hcl (z −) = z−d+bm(z−) am(z−) (1) the control system will be in the stable state if all the roots of the transfer function denominator of eq. 1 are inside the unit circle, as it is stated in the eq.  + amz − + ···+ amnz−n =  ⇒ |z| <  (2) 296 f. valles-barajas otherwise the state of the system will be in the unstable state. – the value of the attributes that define the object’s relationships. in this point it is important to mention that the entire control system is represented with a class called acs. when a fault occurs in the system a fault detection and isolation (fdi) object of the class fdi will be created to manage this fault. a fdi reference variable will be defined inside the acs class, so when the fault occurs an instance of the class fdi will be created inside a method of the class acs. if there is no fault in the system, the reference variable of fdi will have the value of null and the system will be in the no fault state, but when a fault appears the reference variable will point to the fdi object that will handle the fault and the system will be in the fault state. • in terms of a behavior that the object is engaged in: – a period of time during which a control system performs some outgoing activity. for example, one of the activities that a controller must do to achieve robustness against the external agents to the system (disturbances, noise, faults) is to shape the sensitivity functions. this state will be called shaping the sensitivity functions. the output sensitivity function s is the transfer function between the output disturbance d(k) and the plant output y(k); this transfer function is usually named as sensitivity function and is given by: syd (z −) = a(z−)s(z−) a(z−)s(z−) + z−d b(z−)r(z−) = a(z−)s(z−) p(z−) (3) the noise sensitivity function t is the transfer function between the measurement noise n(k) and the plant output y(k) and is given by: syn(z −) = −z−d b(z−)r(z−) a(z−)s(z−) + z−d b(z−)r(z−) = −z−d b(z−)r(z−) p(z−) (4) an equation that relates these two transfer functions is syd (z −) − syn(z −) = s + t =  (5) – a period of time during which an object waits for some event or events to occur. suppose that the acs is equipped with a fault detection and isolation module to manage the faults that can appear in the system. the fdi system can be modeled as an object. one of the states of the fdi is the waiting for a fault state. the response of a control system to an event depends on its state. for example if the system is in closed loop state and an the event change reference yre f (z−) occurs the system will take y(k) close to yre f (k) but if the system is in closed loop state and the same event occurs the system will do nothing. 4.2 events an event is any occurrence that provokes the reaction of an object, this reaction can be the transition from one state to another, the execution of one action or do nothing. it is important to mention that although events starting with uml 2.0 are called triggers; in this paper we will use the term events. there are four types of events: signals events, time events, call events and change events. they will be explained in the following sections. a novel model for adaptive control systems a state machine approach 297 signal events a signal is an asynchronous event; it is not necessary that a sender waits for the answer of a receiver when a sender sends a signal to a receiver. signals can also be triggered by an operation; this kind of signals are called exceptions. signals are a one-way asynchronous communication between active objects. an exception is the most common kind of signals that specifies the abnormal behavior of an operation [8]. exceptions are modeled in uml as stereotyped classifiers. the parameters of the exceptions contain information of the abnormal behavior, and these are modeled as attributes in the exceptions. because exceptions are modeled as classifiers, a hierarchy of exceptions can be drawn. fig. 3 shows some of the exceptions that can be generated in an adaptive control system. these exceptions form a hierarchy. «exception» lack of pec «exception» θ̂ out of v z «exception» p(k) low «send» «send» gc(z−) tune() «exception» exception sethandler() «exception» |am(z−)| ≥  «exception» s bad formed paa updatep() «send» gp(z−) updateθ̂ () figure 3: modeling the exceptions of an adaptive control system the root of this hierarchy is the abstract class “exception” which was specified by typing the name of the class in italic font. the “exception” class has a method called sethanlder(); this method is useful to specify the entity that is going to manage the exception, once it occurs. in this proposal, the supervisor is the entity that handles all the exceptions of an acs. fig. 3 shows some of the exceptions that can be thrown in the methods of the classes paa, gp(z−) and gc(z−). when gc(z−) is tuned, the resulting controller may provoke that the closed loop system changes to the unstable state (the poles of am(z−) are outside the unit circle; see eq. 1). to specify that an operation can trigger an exception the exception can be drawn as a stereotyped class and then a send dependency can be drawn between the operation that can trigger the signal and the signal. see for example that in fig. 3 a send dependency between the operation updateθ̂ () and the signal “θ̂ is out of vz” has been drawn to indicate that the operation updateθ̂ () can trigger the exception “θ̂ is out of vz” (v z is the valid zone which is a zone where the parameters of gp(z−) are allowed. this can be obtained from previous knowledge or by association with the physical parameters of the process). also the resulting controller may be poorly robust; this can be inspected by looking at the sensitive function s (see eq. 3) which is a measure of how the disturbance d(k) affects the output of the process y(k). one of the advantages of the configuration of the fig. 3 is that polymorphism can be used to handle the exceptions; the handler of the exceptions can be specified in such a way that it handles all the exceptions of the acs. also when we specify a transition, we can specify that any of the exceptions of the acs can trigger a transition; in this case the transition is polymorphic and can be triggered by the exception “exception” or by any of its specializations, for example the “lack of pec” exception [3]. 298 f. valles-barajas signals are used to establish a communication between two objects in an asynchronous way. the reception of a signal is an event for the receiver. in this paper d(k), d(k), n(k) and the bursting phenomenon will be modeled as signals as fig. 4 indicates. is is important to mention that because d(k), d(k) and n(k) are unknown, they must be estimated. the entity that generates these signals is the environment and the receiver is the adaptive control system. fig. 4 shows that signals are modeled as stereotyped classifiers and because of this a hierarchy of signals «signal» d(k) «signal» d(k) «signal» n(k) «signal» signal sethandler() «signal» brusting of θ̂ figure 4: modeling of the signals for an adaptive control system can be established. signals can also have parameters in the form of attributes, these are useful to store information related with the signals. when we define signals we are making the opposite procedure for exceptions. when signals are defined the entities that handle the signals are defined and when the exceptions are defined the abnormal behavior that can generate an operation is defined [3]. in the uml, you model the signals that an object may receive by naming them in an extra compartment of the class, as shown in fig. 5 where the signals that the supervisor block can handle are defined. signals do not return values to the caller [8]. supervisor d(k) d(k) n(k) lack of pec θ̂ out of v z bursting of θ̂ p(k) low |am(z−)| ≥  signals figure 5: signals that the supervisor can handle time events a time event evaluates the passage of time as a trigger. it is assumed that the object has some mechanism to monitor the passage of time. time can be specified either in absolute mode (time of day) or relative mode (time elapsed since a given event). time can be considered as an event from the environment. the general form of a time event is: a novel model for adaptive control systems a state machine approach 299 after(exp)[guard condition]/ action. an example of a time event is: after(5 minutes)[|p(k)| low]/reset p(k) where 5 minutes is the expression that is evaluated, [|p(k)| low] is called guard condition and reset p(k) is the action that is executed when the expression and the guard condition are true. call events in the uml, you model the call events that an object may receive as operations on the class of the object. for example, in fig. 3 the operation of the class gc(z−) was modeled as a call event. signal events and call events involve at least two objects: the object that sends the signal or invokes the operation and the object to which the event is directed. change events a change event occurs when a boolean expression becomes true. when an event is declared as a change event, the system always monitors that boolean expression. a change event represents a continuous and potential nonlocal computation (action at a distance, because the value or values tested may be distant). another alternative for a change event is to model this as a guard condition in the transition. the advantage of a guard condition is that it is evaluated only when a transition occurs. the general form for a change event is: when(expression) / operation() where operation() is executed when the expression is true. the persistent excitation condition can be verified by using the following change event: when ( limk→∞  k k∑ k= φ (k)φ t (k) ≤  ) / freezeadaptation() 5 conclusions in this paper a new model for adaptive control systems has been presented. this model is based on state machines and is useful to document and to analyse control systems. the states of an adaptive control system, its transitions and the events that may provoke the transition from one state to another were defined in the proposed model. with this model the software implementation of an adaptive control system, made by a software engineer, will be easier. bibliography [1] s. bennett, j. skelton, and k. lunn. schaum’s outline of uml. mcgraw-hill, usa, 2005. [2] s. bergstrom and l. raberg. adopting the rational unified process: success with the rup. addison wesley, usa, 2004. [3] g. booch, j. rumbaugh, and i. jacobson. the unified modeling language user guide. addisonwesley professional, nd edition, may 19, 2005. 300 f. valles-barajas [4] d. drechsel. an adaptive control system modeled as finite state machine. proceedings of xvi annual convention and exhibition of the ieee in india, pages 124 – 128, 1990. [5] m. fowler. uml distilled: a brief guide to the standard object modeling language. addisonwesley, rd edition, 2003. [6] d. harel. statecharts: a visual formalism for complex systems. science of computer programming, pages 231–274, 1987. [7] w. s. humphrey. pspsm : a self-improvement process for software engineers (sei series in software engineering). addison-wesley professional, usa, 2005. [8] ibm. object oriented analysis and design using uml. usa, 2004. [9] p. ioannou and b. fidan. adaptive control tutorial. society for industrial and applied mathematics, 2006. [10] c. larman. applying uml and patterns : an introduction to object-oriented analysis and design and iterative development. prentice hall, usa, rd edition, 2004. [11] t. pender. uml bible. wiley, 2003. [12] j. rumbaugh, i. jacobson, and g. booch. the unified modeling language reference manual. addison-wesley professional, nd edition, 2004. [13] g. simon, t. kovácsházy, g. péceli, t. szemethy, g. karsai, and a. lédeczi. implementation of reconfiguration management in fault-adaptive control systems. ieee instrumentation and measurement technology conference, 2002. anchorage, ak, usa. fernando valles-barajas obtained a graduate degree in computer science at center for research and graduate programs of la laguna institute of technology (1991). he received an ms in control engineering (1997) and a phd in artificial intelligence (2001) from monterrey institute of technology (itesm) campus monterrey. he was a research assistant at mechatronics department of itesm campus monterrey (1997-2001). he received certification as a psp developer from the software engineering institute of carnegie mellon university (2008). he is member of the ieee and acm. his research interests include topics in software engineering and control engineering. currently, he is full-time professor in the information technology department at universidad regiomontana, monterrey, nuevo león, méxico. he also teaches modules at both bs and ms levels in computer science and software engineering. international journal of computers, communications & control vol. i (2006), no. 2, pp. 33-40 the challenge of designing nervous and endocrine systems in robots felisa m. córdova, lucio r. cañete abstract: we discuss in conceptual terms the feasibility of designing a nervous system and an endocrine system in a robot and to reflect upon the bionic issues associated with such highly complex automatons. the emulation of biological phenomenon in artificial systems, both nervous and endocrine imitation in a mechatronic automaton is an attractive proposition as a mechanism of organic integration. the ability of living organisms to maintain their internal organization in spite of external changes, encourages attempts to imitate such achievements in devices. the complexity of the network sensors-integrators-effectors and the proportion of internal fluids must be increased when seeking the homeostasis of the robot from mechatronic design to bionic design. keywords: bionics, integration, robots. 1 introduction biology has developed in a very accelerated way over the last three decades with the 21st century, widely predicted to be the "biology age" (ovchinnikov, 1987). in this context, a certain interest in emulating biological characteristics in appliances can be seen, for the purpose of recreating the successful behavior of living organisms in artificial systems. sensors such as hair and skin, effectors such as feet and fins and integrators such as artificial brains are some examples of bionic projects (cañete, 2002). however, in order to come close to achieving the behavior exhibited by animals as inspiratory beings, then by necessity (ofek, 2001), it must be recognized the pertinence of nervous system and endocrine system. of course, when the nervous system and the endocrine system work in unison in animals, they could generate synergic effects in the behavior of these alive beings. the use and integration of both systems in a human being can be seen for example when a person is attacked by a swarm of wasps. the electromagnetic receptors (the eyes), acoustics (the hearing) and those related to pain (the skin) receive information which is interpreted as threatening, so the association neurons and effectors not only order the locomotors effectors (the leg’s skeletal muscles) to take flight but also order the glands to release adrenaline and other hormones that designate resources to the task of escaping. no doubt, animal characteristics like these are interesting to emulate in automatons (cañete and córdova, 2003). so, the objects of this work are to conceptually design a nervous system and endocrine system in robots as a biological emulation, and to reflect upon the bionic design of those systems. 2 why a nervous system and a endocrine system? on examining any animal, one observes that the digestive, circulatory, respiratory and excretory systems have specific structures and functions, but none of them can carry out their functions independently of the others; rather they work together in harmony to meet the metabolic needs of the body. any living being, not only an animal, is an organized entity that behaves as a unit and this unity is the result of the participation of a larger coordinated and integrated system. as an automaton, a robot must also possess such a system, and select animal inhalation as a perfect example of how it must fit together with its environments. vertebrates and other multicellular beings whose structures are the result of the evolution (paccault, 1991), the source of the emulation, posses two coordination systems of varying form and complexity copyright c© 2006 by ccc publications 34 felisa m. córdova, lucio r. cañete that are closely related: nervous and endocrine. both serve to make the body a unique active entity, individually regulating the functions of it parts and enabling it to adjust to changes. in biology texts, robots designers are accustomed to seeing both systems compared with each other with the emphasis on the considerable difference that exist between them: in the nervous system the message is an electrochemical disturbance (a nerve impulse) that travels through a nerve fiber, whereas in the endocrine system the message is a chemical substance (a hormone) that travels in fluid, like blood. if a robot had a sort of these system, it behavior in aggressive environments would be better. so, as it shown in figure 1, the immediate task is to emulate the nervous system and the endocrine system from natural stage to artificial stage (animal to robot). figure 1: from a scene of origin towards another one of destiny. considering the contribution of the nervous system and endocrine system to the successful performance of the animal facing its ecosystem, such systems conceptually may be copied in a robot to face its productive environment 3 the nervous system 3.1 the necessity of sensors it is possible to face the design of basic a nervous system that allows to make the sensomotor coordination to guide a load-haul-dump (lhd) vehicle inside a street within a tunnel in an underground mine (córdova, 1996). the vehicle loads mineral from a pit and it dump it in a ore-pass. then, a set of sensors could be in charge to acquire the relevant data from the tunnel. among them, is possible to use some of the following ones: • distance sensors, that can be an adjustment of ultrasonic sensors, or infrared sensors. • laser sensors to measure depth. • tact and of proximity sensors. • vision sensors, that uses tv cameras and processors of stereoscopic images, that can detect a painted line in the ceiling of the gallery or the profile of the tunnel. • angle sensors. • torque and acceleration sensors. • sensors of virtual surroundings, that can detect them holes of ditches and resentments. the challenge of designing nervous and endocrine systems in robots 35 • sensors of the operational conditions of the vehicle, among them sensorial of humidity, temperature, pressure. • sensors of environmental conditions of the tunnel, such as fire detectors. • tags, that allows to detect figures of bar codes and it the location of the vehicle inside the gallery. 3.2 a sensomotor coordination design in this particular design, an array of six located ultrasonic sensors in the flanks of the vehicle is used. the navigation criteria is of homeostatic type, that means that the vehicle maintains a equidistant distance to the walls of the tunnel. in this design, a neuronal network of sensomotor coordination can be used that coordinates the sensors and the actuators. the strategy of sensomotor coordination makes use of a propioceptive network presented in figure 2. figure 2: propioceptive network in the reflective network of senso-motor coordination of figure 3, from the general and previous design of a fuzzy neuron, defined the characteristic of design for each neuron in individual, obtaining itself eight types of neurons. in the case of the sensorial neuron a not-linear excitation (sigmoidal) and with different weight for each sensor from the respective sub-network is required (the agonist-antagonist sub-networks are symmetrical in their construction). modulating neuron m dynamically varies the characteristic of the excitatory neuron e1 and the threshold neuron. this activity of modulation is function of the speed of the vehicle and the ratio of turn rg. the rest of the neurons has excitatory and/or inhibitory activities according to its connectivity in the network. the rank of activity of all the neurons varies between 0 and 1, minimum activity and maximun activity for all the variables of the neurons. the signals of activity are excitation, inhibition, modulation and a threshold of adjustment for each neuron. • excitatory neuron e1: this neuron receives the activity of the distance sensors whose weight decreases and this is modulated as well by the activity of the modulating neuron depending on inverse on the turning ratio and the speed of vehicle v at low speeds and v2 at high speeds (over v2 march). • excitatory neuron e2: this neuron actives the turn actuator, receiving a modulated activity of the previous activity of its sensors, inhibitions of the antagonistic network that are against to him and of its own actuator that restrains it and stabilizes as the networks arrive at an activity balance. 36 felisa m. córdova, lucio r. cañete • inhibitory neuron i1: this neuron allows the incorporation of the activity from one sub-network to the other, and its action is strong in high ranks of internal activity to allow to a fast compensation forehead to strong stimulus of the network that originates it when the networks are very far from the balance. • inhibitory neuron i2: this neuron restrains the activity of the turn actuator when it is in conditions near the balance, or allows a fast turn (depending on the threshold neuron) when a strong sensorial activity of distance exists and the vehicle goes fast. • modulating neuron m: this neuron modulates the activity of sensors of distance in agreement with the speed and the turning ratio of the vehicle, in addition stimulates to the neuron threshold when its own activity is high along with the activity of the distance sensors. • threshold neuron: when this neuron is stimulated by the modulating neuron, it inhibits to inhibitor of the actuator to correct in fast form the direction of the vehicle. this is required in the case that the vehicle has a small turning radius or when the speed is high. • sensor neuron: the activity received by each sensor of the sub-network is heavy in decreasing form while more moved away it is of the advance direction, in addition the sensation one to the activity is not-linear for each sensor. figure 3: sensomotor network the challenge of designing nervous and endocrine systems in robots 37 structurally speaking, nerves can be emulated and in fact they have been, by way of electrical conductors specifically in mechatronic automaton like lhd. 4 the endocrine system 4.1 the necessity of an internal fluid in the case of blood, it has not been emulated in robots and the absence of internal fluid in an automaton was one of the reasons why work was halted on the emulation of other animal features dependent on internal chemical transportation. of course, the existence of fluids in the body of the robot would not only facilitate the emulation of the endocrine systems’ own hormones but also the emulation of antibodies for immunity, of bradicins (when animal tissue is broken, enzymes are released that convert certain plasmatic proteins into a substance called bradicin), for nociception (perception of damage) and of solvents for quimosensitivity (to stimulate the sense of taste, the chemical substances must be dissolved). furthermore, a fluid not only facilitates the movement of intra-corporal messengers but could also act as a lubricant, combustible and a thermo-regulator. a classic mechatronic robot contains in percentage terms of volume approximately one sixth part of fluids which are principally lubricants, and substances for hydraulic devices and combustion. an arthropod on the other hand, contains at least three quarter parts of fluid (cañete and córdova, 2003). therefore, the first challenge in this respect consists of the proportion of internal fluids in the robot. such an increase would lead to an increase in homeostasis which would bring with it an increase in the design complexity as shown in figure 4. figure 4: complexity versus homeostasis. an increase in complexity for endocrine system, comes with increasing the proportion of internal fluids when seeking the homeostasis of the robot from mechatronic design to bionic design 4.2 a simple design when a robot works in changing environments, some parts of the surroundings without its control would become dangerous. in such situations, the robot will have to react in urgent form to fit its internal structures and to assure its viability. one of such urgencies can be the repair or reinforcing a part of the body that is being exposed to an external requesting. in order to face such situation it is possible to be resorted to a endocrine emulation as it is explained next and presented in figure 5. 38 felisa m. córdova, lucio r. cañete the internal part of the sensible zone of the body exposed to harmful emboss and ruptures are designed rough and it is covered with little smooth laminas. the last ones are conductive elements with bad plastic behavior; of such form get fractures and can release them when happens a damage of the outside. the internal part in addition is bathed by a fluid and slightly energized with electrical current. an internal sensor to the circuit of the fluid exists somewhere in addition that is able to capture fragments of little laminas conductors by magnetic action. whenever it captures some piece of laminas, releases a plastic and rough substance in form of grume (small semisolid mass) and it orders to a pump to increase the speed of the fluid circulation. the rough grumes slide by the smooth surface and they are crowded in the rough surface that is being dangerous. finally the grumes reinforce the exposed zone. figure 5: cross section of the bodywork with phenomenon of endocrine emulation. a: an external agent breaks or damages the interior where a fluid circulates, conductive fragments are freed in the open leaving one rough surface. the fragments when traveling by the fluid are captured by a magnetic receiver that releases rough grumes. b: the rough grumes travel by the current and they lodge in the naked zone, cushioning the external action. the effect on the speed flow caused by the grumes is insignificant due to the extension planar of the conduit and to the precise location of the grumes. why not to design a hard bodywork and thus to avoid a complex endocrine system? for the same reason for which the nature through million years of evolution has not done it (david and samadi, 2000). comparatively (massé and thibault, 2000), to maintain a hard bodywork is more expensive than to maintain a light bodywork with repair capacity. 5 organic integration in the robot once the endocrine and nervous systems have been conceived, both systems may be connected to each other and at the same time joined to the functions which they control. it is then that an organization is configured, which for the purpose of the robot being proposed in the current research, would have a internal communication structure as shown in figure 6. in animal kingdom, the levels are recursive and thus subordinates: that is to say the supreme level contains and governs the homeostasis level (nervous and endocrine) which in turn does the same with the sensomotor level. the challenge of designing nervous and endocrine systems in robots 39 figure 6: three levels in organic integration of bionic robot. the nervous and endocrine reaction corresponds at a homeostatic level. such reactions are subordinated to supreme level, but they govern at sensomotor level. adaptive control, as the name implies, is the higher level of coordination and functional integration in a robot. its structure can be a managed by a system that knows the mission the robot and links both information of the inner state of the automata and his surroundings. a proposed future challenge in the present work, is the conceptual design of this adaptive control. 6 conclusions although the robot has neither nerves nor blood where it may send messages in charge of functionally integrating and coordinating all its parts, they can be emulated. there is evidence that the emulation with certain restrictions of both nervous and endocrine system is feasible. however such an emulation would require the convergence of two sciences: biology and robotics. the convergence of biology with its different branches (ecology, genetics, evolution among others) and robotics is no accidental; rather it constitutes an essential issue in as far as abstract terms are concerned, that is: there is little to distinguish automatons and living beings. in reality both beings show behavior that is autopoietic (self produced), telenomic (self initiating) y homeostatic (self regulating with feedback function). put more precisely: many of the features of a living being are desirable in an automaton. aspiring to have such features comes from the observations that man has carried out in the biological world of he himself is a part. the ability of living organisms to maintain their internal organization in spite of external changes, encourages attempts to imitate such achievements in devices, whose internal compositions are quite modest when compared with those of plants and animals. this motivation, which has further increased on the verification of real successes achieved by man more than three decades ago (in sonar, prosthesis and artificial neuronal networks among others) provides a particular incentive to continue with the current research. 40 felisa m. córdova, lucio r. cañete references [1] cañete, l., ecología cognitiva en robots terrenos para el desierto de atacama, tesis doctoral, facultad de ingeniería de la universidad de santiago de chile, 2002. [2] córdova, f., guiado autónomo de equipos cargadores frontales lhd en una mina subterránea, informe proyecto fondef. conicyt. chile, 1996. [3] cañete, l. and córdova, f., ecología cognitiva en robots terrenos, proceedings of the first ieee latin american conference on robotics and automation lcra, santiago de chile, 2003. [4] cañete, l. and córdova, f., a robot with immunity and perception of damage, proceedings of the first ieee latin american conference on robotics and automation lcra, santiago de chile, 2003. [5] david, p. and samadi, s., la théorie de l’évolution, paris: champus université flammarion, 2000. [6] glávic, n. and ferrada, c., biología. tercer año de educación media, santiago: ministerio de educación, 1985. [7] ofek, h., second nature: economics origins of human evolution, cambridge: university press, 2001. [8] ovchinnikov, y., basic tendencies in physico-chemical biology, moscow: mir publisher, 1987. [9] massé, g., and thibault, f., intelligence économique, bruxelles: de boeck université, 2001. [10] paccault, i. (1991). la terre et la vie. paris: larousse. felisa m. córdova, lucio r. cañete universidad de santiago de chile departamento de ingeniería industrial laboratorio de concepción e innovación de productos, lacip santiago de chile e-mail: fcordova@usach.cl int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 3, pp. 362-374 fuzzy filtering of sensors signals in manufacturing systems with time constraints a. mhalla, n. jerbi, s. c. dutilleul, e. craye, m. benrejeb anis m’halla, nabil jerbi, mohamed benrejeb ecole nationale d’ingénieurs de tunis unité de recherche lara-automatique bp 37, le belvédère, 1002 tunis, tunisie e-mail: anis.mhalla@enim.rnu.tn, nabil.jerbi@isetso.rnu.tn, mohamed.benrejeb@enit.rnu.tn anis m’halla, simon collart dutilleul, etienne craye ** ecole centrale de lille laboratoire d’automatique, génie informatique et signal cité scientifique, bp 48, 59651 villeneuve d’ascq, france e-mail: simon.collart_dutilleul@ec-lille.fr, etienne.craye@ec-lille.fr, abstract: the presented work is dedicated to the supervision of manufacturing job-shops with time constraints. such systems have a robustness property towards time disturbances. the main contribution of this paper is a fuzzy filtering approach of sensors signals integrating the robustness values. this new approach integrates a classic filtering mechanism of sensors signals and fuzzy logic techniques. the strengths of these both techniques are taken advantage of the avoidance of control freezing and the capability of fuzzy systems to deal with imprecise information by using fuzzy rules. finally, to demonstrate the effectiveness and accuracy of this new approach, an example is depicted. the results show that the fuzzy approach allows keeping on producing, but in a degraded mode, while providing the guarantees of quality and safety based on expert knowledge integration. keywords: alarm filtering, fuzzy logic, symptoms generation, robustness, time constraints, manufacturing. 1 introduction in general, the detection of failure symptoms related to the process elements requires a development of a system model to be supervised [1]. this model can be either a normal functioning model or dysfunction model. if a model is adopted, two basic mechanisms are used for detection. the first one consists of comparing the evolutions of the observed system with those of the process model, or with those of normal functioning signatures evolving in real time with the system. the second one is based on observing known failures signatures. these signatures describe historical or theoretical failures known from the process or the process elements. without the supervision of a system model, the adopted strategy consists of an exploitation of the information given by the sensors and the detectors at a local level of the process [2]. sometimes, in manufacturing workshops with time constraints, the information given by sensors signals is dubious and the symptoms generated are vague. furthermore, the validation interval associated to each sensor signal is not always exact, this is the case where the temporal windows are badly defined. these reasons bring us to use fuzzy logic which is based on an approximate reasoning able to take into account the uncertainty and the inaccuracy of knowledge. this paper is an extension of jerbi work [3]. in [3] was proposed an integration of the robustness in the filtering mechanism of sensors signals. this mechanism, presented by toguyeni in 1992, aims at generating symptoms for the diagnosis [4]. our main contribution is a fuzzy filtering mechanism of sensors signals. this paper is organised as follows. the first part summarizes the proposed filtering mechanism of sensors signals taking into account the robustness intervals. the second part introduces a new fuzzy filtering approach where copyright c© 2006-2010 by ccc publications fuzzy filtering of sensors signals in manufacturing systems with time constraints 363 fuzzy logic and filtering of sensors signals techniques are integrated. in order to show the effectiveness of this approach, in the third part, an illustrative example is outlined and the results are discussed. finally, conclusions of this work are given. 2 robustness integration in the filtering of sensors signals 2.1 symptoms generation permanently, the state of the process model is updated by the evolutions caused by the control and the sensors signals. these sensors signals are sent by the controlled system in response to a control request. the mechanism developed for the detection of failures symptoms is based on the impact study of sensors signals on the process model, called reference model, and on that of the control. these two models do not make it possible to characterize all failures symptoms of a controlled system, for example, the absence of sensors signals (this can be the case, when a sensor is not functional or the control request was not carried out). in order to take into account these problems, mechanisms of "watchdogs" were integrated in the control and process models. these mechanisms are based on two dates provided by the scheduling task: the beginning date as soon as possible noted ∆ tm/cri and the completion date noted ∆ tm/cri of the control operation [4]. the idea consists of modelling any operation from a temporal approach. at each operation ai is associated a sensor signal cri. to each sensor signal cri is associated a temporal interval [∆ tm/cri, ∆ tm/cri] (figure 1). the report cri is valid only inside this window. ∆ tm/cri and ∆ tm/cri are defined relatively to the beginning of the operation ai (start-event). the filtering principle is to position the temporal window of each sensor signal cri when its start-event was received.two types of symptoms are distinguished: • symptoms type i noted si : this class of symptoms corresponds to awaited sensor signal which is not received at ∆ tm/cri. the detection mechanism of this symptom type corresponds to the traditional mechanism of watchdog, but implemented in a separate way of the control. • symptoms type ii noted si : it is generated by the occurrence of a sensor signal which is not expected. two cases are considered, the first one corresponds to an action but its report occurs before the validation interval. the second one corresponds to the occurrence of a report in absence of any control which can create it. ai (start-event) tm/cri validation interval of cri time 0 tm/cri figure 1: operation associated model [4] 2.2 symptoms generation the robustness of a system can be defined as its ability to preserve the specifications facing some expected or unexpected variations. so, the robustness of a system characterizes its capacity to deal with disturbances [6]. it is interpreted into different specializations. the passive robustness is based upon variations included in validity time intervals.there is no control loop modification to preserve the required specifications. on the other hand, active robustness uses observed time disturbances to modify the control loop in order to satisfy these specifications. therefore, the robustness intervals must be 364 a. mhalla, n. jerbi, s. c. dutilleul, e. craye, m. benrejeb integrated in the filtering mechanism of sensors signals. in [3], a filtering mechanism of sensors signals integrating the robustness values is proposed. this mechanism allows the integration of the robustness results in the symptoms generation and the classification of the various actions reports. this classification is very useful for the supervision because it makes it possible to preserve the production function. this constitutes an enhancement of the filtering mechanism. five time intervals, figure 2, are defined, namely: ii = [∆ tm′′/cri , ∆ tm′/cri [, ii = [∆ tm′/cri , ∆ tm/cri [, ii = [∆ tm/cri , ∆ tm/cri [, ii = [∆ tm/cri , ∆ tm ′/cri [ and ii = [∆ tm ′/cri , ∆ tm ′′/cri [. the margin of passive robustness is available in ii ∪ ii whereas the margin of active robustness is in ii ∪ii. several cases can arise: ai (start-event) time tm/cri tm/cri cri (normal functioning) tm’/critm’’/cri tm’/cri tm’’/cri 0 normal functioning interval passive robustness interval active robustness interval figure 2: robustness integration in the operation associated model [3] • if there are absence of order (not ai) and presence of cri, there are freezing of the control and generation of a symptom si . • if the sensor signal cri arrives in the time interval [, ∆ tm′′/cri [, there are freezing of the control and generation of a symptom si . • if the sensor signal cri arrives in the time interval ii, there are change of the control (active robustness to an advance) and memorizing a symptom si . • if the sensor signal cri arrives in the time interval ii, there is no change of the control (passive robustness to an advance) but only a memorizing of a symptom si . • if the sensor signal cri arrives in the time interval ii, the behavior of the system is normal. • at the instant ∆ tm/cri , there is automatically memorizing of a symptom si . • if the sensor signal cri arrives in the time interval ii, it is a case of passive robustness to a delay. the symptom si is already memorized. • if the sensor signal cri arrives in the time interval ii, a change of the control is necessary (active robustness to a delay). • at the instant ∆ tm ′′/cri , there is freezing of the control. therefore, the robustness intervals are integrated in the filtering mechanism of sensors signals. it makes it possible to continue the production in a degraded mode. however the assumptions formulated in [3] are very restrictive. it is natural to consider different scenarios where the temporal specifications of the fuzzy filtering of sensors signals in manufacturing systems with time constraints 365 process are not fulfilled, nevertheless the production can continue. the next section presents a fuzzy filtering mechanism of sensors signals which introduces a finer classification of abnormal functioning and integrates the vague knowledge of the robustness intervals in the interpretation of sensors signals, coming from the workshop, for the generation of symptoms. the objective is to avoid the freezing of the control when the time disturbance is in the robustness intervals. 3 fuzzy filtering of sensors signals 3.1 introduction fuzzy logic is a mathematical tool that allows us to approach an unknown function by means of linguistic descriptions. nevertheless, the linguistic information is a feature of the human reasoning and not of the mechanical components or programs. in consequence, this tool has achieved widespread applications and success in many areas such as control, supervision, image filtering and communications [6–9]. the essential characteristics of fuzzy logic are as follows [10]: • exact reasoning is viewed as a limiting case of approximate reasoning. • everything is a matter of degree. • any logical system can be fuzzified. • knowledge is interpreted as a collection of elastic or, equivalently, fuzzy constraint on a collection of variables. • inference is viewed as a process of propagation of elastic constraints. fuzzy logic calls upon a base of dubious knowledge, modelled by the sequence of fuzzy rules. this technique seems very promising thanks to its potential of use in dynamic monitoring and supervision, with the possibility of remaining human operator, by taking into account its way of reasoning and offering an interesting traceability. before proceeding, we define some important terms. definition 1. [11]: a fuzzy set f in a universe of discourse u is characterized by a membership function µf : u → [, ] definition 2. [11]: a linguistic variable x in a universe of discourse u is characterized by: t (x) ={ tx, t  x, ..., t k x } and m(x) = { mx , m  x , ..., m k x } where t (x) is the term set of x, that is the set of names of linguistic values of x with each value t ix being a fuzzy number with membership function m i x on u . 3.2 basic structure of a fuzzy system figure 3, shows the basic structure of a conventional fuzzy system. such system can be seen as consisting of four basic building blocks: fuzzification, fuzzy rule set, inference method and defuzzification. let us examine these building blocks in details: • the fuzzification transforms a numerical input variable in a fuzzy set described by linguistic expressions. • fuzzy rules set: the fuzzy if-then rule expresses a fuzzy implication relation between the fuzzy sets of the premise and the fuzzy sets of the conclusion. • the inference makes it possible to implement, on the basis of fuzzy rules, the logical dependence between input variables and output fuzzy variables [12]. 366 a. mhalla, n. jerbi, s. c. dutilleul, e. craye, m. benrejeb fuzzification defuzzification fuzzy rules inference input output x y figure 3: fuzzy set stages • the defuzzification transforms the output fuzzy set in a numerical variable. following the above definitions, the input vector x which includes the input state linguistic variables xi’s, and the output state vector y which includes the output state linguistic variables yi’s, can be defined as: x = [ xi,ui, { txi , t  xi , ..., t k xi } , { mxi , m  xi , ..., m k xi }] i=[,...,n] (1) y = [ yi,ui, { tyi , t  yi , ..., t k yi } , { myi , m  yi , ..., m k yi }] i=[,...,m] (2) the fuzzifier, in figure 3, is a mapping from an observed input space to fuzzy sets in certain input universe of discourse. so, a specific value xi(t) at the time t is mapped to the fuzzy set tx with degree m  x (xi(t)) and to the fuzzy set tx with degree m  x (xi(t)) and so on. fuzzification of the input and output variables if we want to introduce linguistic information, we have to define an interface. this interface is denominated fuzzification, and it translates the sensor measurements into linguistic concepts. to carry out such transformation, the fuzzification resorts to a characteristic function called membership function. the aim of fuzzification is to produce initial membership functions. therefore, the universe of discourse u of the input and output variables are divided into fuzzy subsets. the first step consists of choosing the input and output variables. this choice depends on the parameters available and the type of application [13]. in order to produce initial membership functions, the input and output spaces are divided into fuzzy regions. in our example, we have two inputs and two outputs variables, all membership functions are represented by trapezoidal forms. the sensor signal (cri) and the occurrence of the start-event (ai) are considered as inputs variables. however, the type of symptom (si) and the control decision (cd) are considered as outputs ones. the second step consists of defining the universe of discourse which can take each variable. then, we define the fuzzy sets associated to the inputs and outputs variables and their corresponding membership functions. thus, the universe of discourse is divided into intervals at which a descriptive label is associated. this last choice is based on the experiment of the operator. • fuzzification of sensor signal (cri) we define, figure 4, thirteen time intervals, namely: ii = [, ∆ tm′′/cri [, ii = [∆ tm′′/cri , ∆ tm′′ /cri [, ii = [∆ tm′′ /cri , ∆ tm′/cri [, ii = [∆ tm′/cri , ∆ tm′/cri [, ii = [∆ tm′/cri , ∆ tm/cri [, ii = [∆ tm/cri , ∆ tm/cri [, ii = [∆ tm/cri , ∆ tm/cri [, ii = [∆ tm/cri , ∆ tm/cri [, ii = [∆ tm/cri , ∆ tm ′ /cri [, ii = [∆ tm ′ /cri , ∆ tm ′ /cri [, ii = [∆ tm ′ /cri , ∆ tm ′′/cri [, ii = [∆ tm ′′/cri , ∆ tm ′′ /cri [ and ii = [∆ tm ′′ /cri , +∞[. the full set intervals is summarised in table 1. fuzzy filtering of sensors signals in manufacturing systems with time constraints 367 tm’1/cri µ(cri) tm’’/cri ara pra nf prd ard start-event tm’2/cri tm1/cri tm2/cri tm’1/critm1/cri tm2/cri tm’’/critm’2/cri cri tm’’1/cri npre npre tm’’1/cri figure 4: fuzzy robustness integration in the operation associated model table 1: linguistic variables associated to the input cri tcri linguistic variable tcri cri arrives in the interval ii = [, ∆ tm′′/cri [ tcri cri arrives in the interval ii = [∆ tm′′/cri , ∆ tm′′ /cri [ tcri cri arrives in the interval ii = [∆ tm′′ /cri , ∆ tm′/cri [ tcri cri arrives in the interval ii = [∆ tm′/cri , ∆ tm′/cri [ tcri cri arrives in the interval ii = [∆ tm′/cri , ∆ tm/cri [ tcri cri arrives in the interval ii = [∆ tm/cri , ∆ tm/cri [ tcri cri arrives in the interval ii = [∆ tm/cri , ∆ tm/cri [ tcri cri arrives in the interval ii = [∆ tm/cri , ∆ tm/cri [ tcri cri arrives in the interval ii = [∆ tm/cri , ∆ tm ′ /cri [ tcri cri arrives in the interval ii = [∆ tm ′ /cri , ∆ tm ′ /cri [ tcri cri arrives in the interval ii = [∆ tm ′ /cri , ∆ tm ′′/cri [ tcri cri arrives in the interval ii = [∆ tm ′′/cri , ∆ tm ′′ /cri [ tcri cri arrives in the interval ii = [∆ tm ′′ /cri , +∞[ 368 a. mhalla, n. jerbi, s. c. dutilleul, e. craye, m. benrejeb the margin of active robustness is available in (ii ∪ ii ∪ ii)∪(ii ∪ ii ∪ ii), whereas the margin of passive robustness is in (ii ∪ii ∪ii)∪(ii ∪ii ∪ii). from a functional point of view, there are six intervals of use in which it is possible to prove the validity: intervals of no proof of robustness existance (npre), normal functioning (nf), passive robustness to an advance (pra), passive robustness to a delay (prd), active robustness to an advance (ara) and active robustness to a delay (ard). if the functioning is abnormal, there is duality of advance and delay scenarios. • fuzzification of a start-event (ai) figure 5, shows the different set of the input variable (start-event ai). the full set of linguistic variables associated to each membership is summarised in table 2. µ(ai) ai absence of ai occurrence of ai figure 5: membership functions of ai table 2: linguistic variables associated to the input start event tai linguistic variable tai occurrence of the start-event ai tai absence of the start-event ai • symptoms fuzzification figure 6, shows an uniform distribution of fuzzy logic membership functions associated to the output “ type of symptom ”. similarly, table 3 shows linguistic variables associated to the output “ type of symptom ". µ(si) si generation of si 1 memorizing of si 1 generation of si 2 memorizing of si 2 1 s i m 2 s i m 3 s i m 4 s i m figure 6: membership functions of si fuzzy filtering of sensors signals in manufacturing systems with time constraints 369 table 3: linguistic variables associated to the output “ type of symptom" tsi linguistic variable tsi generation of a symptom s  i tsi memorizing of a symptom s  i tsi generation of a symptom s  i tsi memorizing of a symptom s  i • fuzzification of control decision (cd) the three fuzzy set for the output cd are chosen as indicated in figure 7. hence the three membership functions, uniformly distributed, are denoted m1cd, m 2 cd and m 3 cd. the linguistic variables are summarised in table 4. the integration of the approach generation of symptoms and the classification of the reports of various actions allows a qualitative description of fuzzy variables. these variables have balanced values of truth, pertaining to the interval [0, 1]. no change of control change of control control freezing (cd) cd 1 cdm 2 cdm 3 cdm figure 7: membership functions of cd table 4: linguistic variables associated to the output control decision tcd linguistic variable tcd no change of control tcd change of control tcd control freezing definition of fuzzy rules next we have to evaluate the rules. the rules associate the input variables with the output ones by means of linguistic terms, and according to their physical properties [14]. the rules can present different structures (mimo, miso, siso, ...), although the most common one is the multiple inputs multiple outputs (mimo). we have used this structure to build fuzzy rules, and its arrangement is: r jmimo : if ( x is tx ) and ... and ( xp is txp ) (3) then ( y is ty ) and ... and ( yq is tyq ). 370 a. mhalla, n. jerbi, s. c. dutilleul, e. craye, m. benrejeb being p the number of input variables, q the number of output variables whereas txi and tyi represent their respective fuzzy sets for jth rule. the preconditions of r jmimo, form a fuzzy set (tx1 ×tx2 × ... ×txp ) and the consequent of r j mimo is the union of q independent outputs [11]. so, the rule can be represented by a fuzzy implication: r j mimo : (tx1 ×tx2 ×... ×txp ) → (ty1 + ... + tyq ) (4) where "+" represents the union of independent variables.the fuzzy rules are merely a series of if-then statements. these statements are usually derived by an expert to achieve optimum results. thus, according to (3) we can formulate the rules as following: rule 1: if there are absence of order (not ai) and presence of cri, then there are freezing of the control and generation of a symptom si . rule 2: if the sensor signal cri arrives in the time interval [, ∆ tm′′/cri [ and the start-event ai is occurred, then there are freezing of the control and generation of a symptom si . rule 3: if the sensor signal cri arrives in the time intervals (ii ∪ ii ∪ ii) and the start-event ai is occurred, then there are change of control (ara) and memorizing of a symptom si . rule 4: if the sensor signal cri report arrives in the time intervals (ii ∪ ii ∪ ii) and the start-event ai is occurred, then there are no change of the control (pra) and memorizing of a symptom si . rule 5: if the sensor signal cri arrives in the time intervals (ii ∪ ii ∪ ii) and the start-event ai is occurred, then the behaviour of the system is normal (no change of the control). rule 6: if the sensor signal cri arrives in the time intervals (ii ∪ ii ∪ ii) and the start-event ai is occurred, then it is the case of passive robustness to a delay (no change of the control) and memorizing of a symptom si . rule 7: if the sensor signal cri arrives in the time intervals (ii ∪ii ∪ii) and the start-event ai is occurred, then a change of the control is necessary (ard) and memorizing of a symptom si . rule 8: if the sensor signal cri arrives in the time interval cri in the time interval ii,then there are freezing of the control and memorizing of a symptom si . since the two outputs (symptoms and control decision) of mimo rule are independent, the general rule structure of mimo fuzzy system can be represented as a collection of multiple-input and single-output (miso) fuzzy systems by decomposing the above rules into q (q=2) subrules with as the single consequent of the jth subrule. therefore, the inference engine matches the rule preconditions in the fuzzy rule base with the input state linguistic terms and performs implication. in this subsection, for clarity, we will consider miso system in the following analysis. it is interesting to mention that each fuzzy rule just controls a part of the function to approach. these parts are denominated ”patches”, and they are the result of the localization property of the fuzzy basis functions. nevertheless, from the fuzzy rules base (rule 1, rule 2, ..., rule 8), we need to model numerically the operators and and then. the fuzzy systems define an intermediate stage denominated inference. the inference is the part of the fuzzy systems that carries out an isomorphism between propositional logic and the set and the algebraic theories. however, it is not valid whatever relation between the logical and math operators. in concrete, if they want to be equivalent, their logical and math tables for the crisp values {, } have to be the same. the selected inference method is the mamdani type which is known as the max-min method. defuzzification after inferring all rules, the fuzzy systems need to fusion them.this is the main goal of the defuzzification step, and it constitutes the last part of all fuzzy systems.this fusion is not unique, although the centre of area (coa) defuzzifier is the widespread one, figure 8. in the coa method, the fused fuzzy filtering of sensors signals in manufacturing systems with time constraints 371 measurement output y∗ is obtained as: y∗ = ∑ y∈y µf (y).y ∑ y∈y µf (y) (5) y * g f y o figure 8: centre of area defuzzifier 4 illustrative example to demonstrate the effectiveness and accuracy of the fuzzy filtering approach, an example with two fuzzy rules is outlined. consider the following fuzzy rules base: rule 2: if the sensor signal cri arrives in the time interval [, ∆ tm′′/cri [ and the start-event ai is occurred, then there are freezing of the control and generation of a symptom si . rule 3: if the sensor signal cri arrives in the time intervals (ii ∪ ii ∪ ii) and the start-event ai is occurred, then there are change of control (ara) and memorizing of a symptom si . • each rule use the operator "and" in the premise, since it is an and operation, the minimum criterion is used (mamdani inference method), and the fuzzy outputs corresponding to these rules are represented by figure 9 and figure 10. • next we perform defuzzification to convert our fuzzy outputs to a single number (crisp output), various defuzzification methods were explored to select the best one for this particular application. according to the relation(5),the weighted strengths of each output member function are multiplied by their respective output membership function center points and summed. finally, this area is divided by the sum of the weighted member function strengths and the result is taken as the crisp outputs. in practice, there are two fuzzy outputs to defuzzify (symptoms and control decision). to obtain a numerical output, we can take the coa of each fuzzy output, named gsy and gcd. the measures of the two cao using specific values of sensor signal and start-event are summarized in table 5. table 5, shows the measures obtained by using defuzzification method mentioned above.analysing the data, it is noted that the first and the third cases represent a classic filtering mechanism of sensors signals, integrating the robustness values described in [3]. the second case, using fuzzy filtering approach, gives better results than the two cases previously analysed. these cases reveal that the proposed approach is able to avoid control freezing (the coa gcd belongs to the membership function "change of control") same if the sensor signal arrives in the "no proof of robustness existance (npre)" interval. therefore, the fuzzy filtering approach makes it possible to continue the production in a degraded mode providing the guarantees of quality and safety. consequently, the intelligent fuzzy logic control strategy, based on expert knowledge, provides the avoidance of control freezing if the time disturbance is in the robustness intervals. 372 a. mhalla, n. jerbi, s. c. dutilleul, e. craye, m. benrejeb figure 9: three-dimensional trapezoidal membership function: symptoms= f (cri, ai) figure 10: three-dimensional trapezoidal membership function: control decision= f (cri, ai) table 5: measures of gsy and gcd variables measures first case second case third case cri cri arrives cri arrives cri arrives in the interval ii in the interval ii in the interval ii ai ai is occurred ai is occurred ai is occurred gcd gcd ∈ mcd gcd ∈ mcd gcd ∈ mcd gcd gcd ∈ msi gcd ∈ m  si gcd ∈ m  si 5 conclusion this paper deals with supervision of manufacturing workshops with time constraints. a new approach integrating a classic filtering mechanism of sensors signals and fuzzy logic techniques has been presented. this approach exploits the advantages that both techniques have: the avoidance of control freezing using robustness knowledge and the ability of fuzzy systems to deal with imprecise information by using fuzzy rules. in this new approach, an enhancement technique based on various combinations of fuzzy logic linguistic statements in the form of if-then rules, based on expert knowledge, makes it possible to continue the production in a degraded mode providing the guarantees of quality and safety.the establishment of fuzzy logic is interesting, but it is necessary to call upon the human expertise, in an environment of uncertainty and imprecision, able to formulate and to transmit its knowledge for decision making. fuzzy filtering of sensors signals in manufacturing systems with time constraints 373 the results obtained in the illustrative example show that this fuzzy approach is effective in situations where the sensor measurement is contaminated with different kind of noises. in this case, the temporal windows associates to each sensor signal are badly defined. conventionally, the selection of fuzzy if-then rules often relies on a substantial amount of heuristic observation to express proper strategy’s knowledge. obviously, it is difficult for human experts to examine all the input-output data from a complex system to find the suitable number of rules within the fuzzy systems. for this reason, a fuzzy system with neural network’s learning ability is required. a new approach using neural fuzzy filter (nff), based upon a neural network’s learning ability and fuzzy if-then rule structure can be developed in order to supervise critical time manufacturing job-shops. this fuzzy filtering approach shows how the knowledge of the robustness could make the supervision more efficient, by introducing two events (start-event and sensor signal). a chronicle recognition approach, using the additional information provided by the occurrences of intermediate events, is a challenging technique for performing early diagnosis. bibliography [1] a. boufaied, a. subias, and m. combacau, distributed fault detection with delays consideration, 15th international workshop on principles of diagnosis, carcassonne, june 2004. [2] a. boufaied, a. subias, and m. combacau, the distributed time constraints verification modelled with time petri nets, 17th imacs word congress on scientific computation, applied mathematics and simulation (imacs’05), paris, july 2005. [3] n. jerbi, s. collart dutilleul, e. craye, and m. benrejeb, time disturbances and filtering of sensors signals in tolerant multi-product job-shops with time constraints, international journal of computers, communications & control, vol. 1, no. 4, pp. 61 – 72, 2006. [4] a. toguyeni, surveillance et diagnostic en ligne dans les ateliers flexibles de l’industrie manufacturière, ph.d. thesis, université des sciences et technologies de lille, november 1992. [5] n. jerbi, s. collart dutilleul, e. craye, and m. benrejeb, robust control of multiproduct job-shops in repetitive functioning mode, ieee conference on systems, man, and cybernetics (smc’04), the hague, vol. 5, pp. 4917 – 4922, october 2004. [6] n. sawaya, and b. ghaddar, a fuzzy logic approach for adjusting the contention window size in ieee 802.11e wireless ad hoc networks, ieee international symposium on communication, control, and signal processing (ieee isccsp), marrakech 2006. [7] j. bas, a. pérez, and m. lagunas, differential fuzzy filtering for adaptive line enhancement in spread spectrum communications, signal processing journal, vol. 86, issue 5, pp 984 – 1009, may 2006. [8] d. van de ville, m. nachtegael, d. van der weken, e. kerre, and w. philips, noise reduction by fuzzy image filtering, ieee transactions on fuzzy system, vol. 11, no. 4, pp. 429 – 436, august 2003. [9] r. mikut, a. lehmann, and g. bretthauer, fuzzy stability supervision of robot grippers, ieee international conference on fuzzy systems, budapest, vol. 3, pp. 1473 – 1478, july 2004. [10] l. zadeh, knowledge representation in fuzzy logic, ieee transactions on knowledge and data engineering, vol. 1, no. 1, pp. 89 – 100, march 1989. [11] c. teng lin, an adaptive neural fuzzy filter and its applications, ieee transactions on systems man and cybernetics, vol. 27, no. 4, pp. 635 – 656, august 1997. 374 a. mhalla, n. jerbi, s. c. dutilleul, e. craye, m. benrejeb [12] f. lotte, a. l´ecuyer, f. lamarche, and b. arnaldi, studying the use of fuzzy inference systems for motor imagery classification, ieee transactions on neural systems and rehabilitation engineering, vol. 15, no. 2, june 2007. [13] z. shafiq, f. muddassar, and s. khayam, a comparative study of fuzzy inference systems, neural networks and adaptive neuro fuzzy inference systems for portscan detection (evoworkshop), pp. 52 – 61, 2008. [14] j.l. castro, j.m. benitez, and i. requena, are artificial neural networks black boxes?, ieee transaction on neural networks, vol. 8, no. 5, pp.1156 – 1164, september 1997. anis m’halla was born in mahdia, tunisia in 1980. he obtained the engineer degree in electromechnical engineering from the "ecole nationale d’ingénieurs de sfax (enis) " and obtain the master degree in automatic and industrial maintenance from the "ecole nationale d’ingénieur de monastir" in 2006. he is currently preparing the ph.d. degree in automatic and computer science within the framework of lagis-ec-lille and lara-enit cooperation. his research is related to robustness and supervision of multi-product job-shops with time constraints. nabil jerbi was born in tunis, tunisia, in 1970. he obtained the engineer degree in electrical engineering from the ecole nationale d’ingénieurs de tunis (enit), in 1994. also, he received the aggregation certificate from the ecole supérieure des sciences et techniques de tunis and the master degree in automatic and signal treatment from enit, in 2001 and 2003, respectively. he is currently an assistant at "institut supérieur des sciences appliquées et de technologie de kairouan ". his research interests include robustness and supervision of multi-product job-shops with time constraints. pr. simon collart dutilleul obtained a ph.d. degree in electronics, electrotechnics and automatic control from université de savoie in 1997. he is currently a professor at ecole centrale de lille, and a member of the research team on discrete event systems in the lagis laboratory (laboratory of control engineering, computer science & signal), where he studies specifically time constrained systems. pr. etienne craye was born in roubaix (france) in 1961; he obtained in 1984 the engineer diploma of the "institut industriel du nord" (french "grande ecole") and the same year his master degree in computer sciences. he obtained a ph.d. in automatic control for manufacturing and discrete events systems in 1989 and his "habilitation ŕ diriger des recherches" in 1994. he is now, since 1995, professor at the ecole centrale de lille and in the same time the director of this institution. pr. craye is currently working on monitoring and supervision of fault tolerant systems. specially, reconfiguration and working mode management are today studied by taken into account on one-hand failures and on the other hand the flexibilities of the system architecture. the objective is to be able to go on with the production and not to reconsider the objectives. pr. mohamed benrejeb was born in tunisia in 1950. he obtained the diploma of "ingénieur idn" (french "grande ecole") in 1973, the master degree of automatic control in 1974, the phd in automatic control of the university of lille in 1976 and the dsc of the same university in 1980. full professor at "ecole nationale d’ingénieurs de tunis" since 1985 and at "ecole centrale de lille" since 2003, his research interests are in the area of analysis and synthesis of complex systems based on classical and non conventional approaches. ijcccv11n1.dvi international journal of computers communications & control issn 1841-9836, 11(1):105-112, february 2016. membrane computing and economics: a general view g. păun gheorghe păun institute of mathematics of the romanian academy po box 1-764, 014700 bucureşti, romania gpaun@us.es abstract: three are the points we briefly discuss here: using membrane computing tools for efficient computing/optimization, the possibilities of using “general" membrane computing (p systems using multisets of symbol objects processed by biochemical-like evolution rules) as a framework for modeling economic processes, and the numerical p systems, a class of computing devices explicitly defined with a motivation related to economics. the discussion is rather informal, only pointing out research directions and providing bibliographical information. keywords: membrane computing, p system, economics, numerical p system. 1 a quick glimpse to membrane computing membrane computing is a branch of natural computing aiming to abstract computing models from the structure and the functioning of the living cells – at least this was the initial motivation of the research in this area, [12]. basically, in the compartments of a cell-like membrane structure one places multisets (sets with multiplicities associated with their elements) of objects, represented by symbols of a given alphabet, and evolution rules, in the form of “reactions" transforming certain objects in other objects (multiset rewriting rules, written in the form u → v, where u and v are multisets); the objects can also pass through membranes, from a compartment to another one. starting from an initial configuration (of multisets and membranes) and applying the evolution rules according to specified strategies (semantics), one obtains transitions among configurations, hence computations. in this way we get a computing device, usually called a p system. results are associated with computations, in various ways. most investigated in membrane computing are p systems with the rules applied in the nondeterministically maximally parallel way (a multiset of rules is non-deterministically chosen and applied in each multiset, such that there is no applicable multiset which contains strictly the chosen multiset) and the result associated only with halting computations, those which reach a configuration where no evolution rule can be applied. there are many variations of this basic model. first, one can consider various forms of the rules, as well as ways of controlling their application: priority, promoters, inhibitors, periodical change of rules in time, etc. then, other ways of using the rules can be considered: sequential, limited parallelism, minimal parallelism, asynchronous systems, etc. also the result of a computation can be defined in various ways (internally, externally, in the form of a number or of a string), not necessarily for halting computations. further suggestions come from biology. for instance, instead of multiset rewriting rules, as those corresponding to biochemical reactions, we can consider symport or antiport rules, which move multisets of objects across membranes, [11]. similarly, we can consider rules for handling also the membranes, not only the objects: membranes can be created, dissolved, divided, the relationships between membranes can change in timne (e.g., by exocytosis, phagocytosis, etc.), and so on. copyright © 2006-2016 by ccc publications agora university 106 g. păun then, an extension can be considered from hierarchical arrangements of membranes, like in a cell, described by trees, to a tissue-like arrangement of membranes, described by a general graph. all ingredients mentioned above can be extended to this case. in short, a framework for processing multisets (of symbol objects) in a distributed parallel way. instead of symbol objects, we can also consider more complex objects, strings for instance – and then the evolution rules should be chosen suitably. most of the classes of the p systems suggested above are equivalent as computing power with turing machines, and this happens even for rather restricted (as the number of membranes or the complexity of the evolution rules) systems. furthermore, due to their massive parallelism, and to the possibility of creating an exponential workspace in a linear time, certain classes of p systems can solve computationally hard problems (typically, np-complete problems) in a feasible (polynomial) time. (this has been called in [14] fypercomputing, as an extension from hypercomputing, with “f" coming from “fast".) this is based on a time-space trade-off, with the space created by operations like membrane creation, membrane division, string duplication. depending on the used ingredients, even (polynomial) characterizations of pspace are obtained. two classes of p systems different from the previous ones are the numerical p systems, which will be discussed below, and the spiking neural p systems (in short, sn p systems), [7], inspired from the way the neurons cooperate in a brain, communicating through spikes, electrical impulses of identical shape. the results are the same also for sn p systems: computational universality and the possibility of fypercomputing in the case when neuron division or similar operations are present. another interesting idea leading to fypercomputations in this area is to use (arbitrarily large) pre-computed resources, with a limited amount of information present in the initial configuration. besides these theoretical directions of investigations, concerning the power and the efficiency of p systems, well developed is the application area, naturally, starting with applications in biology and biomedicine. many other applications are already reported, in ecology, linguistics, computer graphics, cryptography, approximate optimization, robot control – as well as to economics. the literature of membrane computing is rather large. details can be found at the domain web site from http://ppage.psystems.eu. an introduction can be found in [13], a comprehensive coverage (at the level of 2009) in [18], while applications can be found in [4], [6], [20]. many collective volumes, phd theses and downloadable papers can be found at the above mentioned web address. 2 using membrane computing tools in economics two ideas mentioned in the previous section suggest already that membrane computing can be useful for economics, especially in the computability, the operation research, and the optimization. the p systems as models will be discussed below, here we have in mind p systems as tools for handling “classic" models. first, we mentioned the many results where np-complete problems are solved in polynomial time by certain classes of p systems. most of the non-trivial practical problems appearing in economics (as well as in other areas) are np-complete. thus, it would be great to solve them in a polynomial time by using a "cellular computer". however, the solutions provided by membrane computing are still in info, theoretical, as there is no implementation of p systems in vitro/vivo, not even in silico (such that the massive parallelism can be effectively used). there are implementations of p systems on parallel hardware, but not at the level of making membrane computing and economics: a general view 107 practical the theoretical fypercomputing. on the other hand, it is highly probably that even a bio-implementation of a p system will not help too much, it will be similar to the case of dna computing, with the 1994 adleman experiment, which remained at the level of a demo, of solving toy problems, without practical consequences. much more practical is the possibility of using the so-called membrane algorithms in solving computationally hard optimization problems. this is a rather developed direction of research in membrane computing, with a large bibliography and very encouraging results. basically, this is nothing else than distributed evolutionary computing, but with the distribution controlled by means of membranes, and with further ingredients from membrane computing used, such as membrane creation, division, dissolution, communication among membranes, neural-like organization of cells, etc. many engineering applications of membrane algorithms were reported – the reader is advised to consult [4], [6], [18], the forthcoming volume [20], as well as the bibliography from the membrane computing web site. 3 p systems as models for economics we pass now to a more particular approach, namely to consider membrane computing as a framework for building models for economic processes. of course, the first question which could be formulated is: why? there are so many (mathematical) frameworks already used in economics, why considering a new, somewhat exotic (for instance, by its biological inspiration), one? the answer is complex. in general, new tools are always good to be checked, maybe they will bring new possibilities to address old problems in a new framework, ideally, to also formulate new questions in the new framework. membrane computing is not only natural in a direct sense, but also sufficiently developed at the theoretical, abstract level that it is just expected to be applicable in many areas, apparently far away from biology. then, the biological metaphor is very general and very useful, as indicated by the many applications of membrane computing in areas rather different from one another. more specifically, there are several features of p systems which make them attractive as models, for economics like for biology, too. they pertain to discrete mathematics, so that they are adequate to situations when we have to deal with small numbers of objects and agents. applying tools of continuous mathematics (especially differential equations) in situations which are clearly of a discrete type leads to wrong results. then, p systems are easily scalable, easily understandable, easily programmable – three appealing properties. the behavior of a (nontrivial) p system is “emergent", impossible to be predicted by simply examining the rules. (the universality also implies, through rice theorem, that no non-trivial question about a p system can be answered algorithmically, hence a computer simulation is needed.) finally, there are many economic issues which can be easily and transparently captured in terms of membrane computing. membranes can describe the organization, the multiset rewriting rules and the symport/antiport rules can describe production and commercial operations. only a hint, recalling some lines from [16]. assume that units of row material a are introduced on a market by a supplier s and are used by producers pi, each having a number of production units bi, to manufacture goods di; these goods are then sold by pi at price pi to retailers rj, each having a number of capacity units cj; in turn, the retailers sell the goods to a general consumer c, which introduces the need for d, denoted by d̄, on the market. for instance, by rules s → san, c → cd̄m, the supplier introduces n units of a and the consumer introduces the need for m copies of d. 108 g. păun producer pi can produce one d by consuming one a and one bi, bia → di, while retailer rj takes one order for d by means of cjd̄ → d̄j. (the good d and the need for d are indexed with the label of the producer – who fix its price – and of the retailer, respectively.) then, the producers and the retailers interact: by did̄j → bicj [pi,j], one copy of good d passes from pi to rj, because rj has an order for this good, and this happens with the probability pi,j, which depends on the previous transactions between producer pi and retailer rj; units of capacities, bi and cj, are free again. we have not yet included money here, but this can be easily done. instead of bia → di, we have to consider the rule biau r i → diu r s, with the meaning that pi has paid r monetary units (denoted by ui when belonging to pi) for a, thus s has gained r copies of us (notation for a monetary unit in possession of s). in turn, instead of did̄j → bicj [pi,j], we have to use did̄jv pi j → bicju pi i [pi,j], with the meaning that pi monetary units have passed from the “account” (= multiset) of rj to that of pi. the retailer will then sell the good at a price (if possible) greater than pi to the general consumer c, and the cycle is repeated. the model can still be enlarged: for instance, the number of a’s and d̄’s introduced on the market can fluctuate, the prices producers and retailers set can vary in time; then, both producers and retailers can make investments (transform money into production/storing units), maybe in a prudent manner (with low probability associated with investment rules and higher probability associated with saving rules, of the form ui → ui, just keeping the money unused), with or without limits on the total number of investments, and so on. it is important to note that coefficients pi,j written in square brackets on the right hand side of the rules are not “pure” probabilities, but they also express the trust of retailers when buying from producers, depending on the history of their collaboration. these coefficients can be changed over time, which adds to the complexity of the model. the reader is refereed to [15], [16], [17] for further details, illustrations, and discussions. several other titles are provided in the bibliography, where economic processes are approached in terms of p systems. after writing such a model, it is necessary to simulate it on a computer, hence a program is needed (ideally, implemented on a parallel support). of course, the model is not used as a computing one, but the evolution itself of the model is of interest (this is similar to the applications of p systems in biology). we are convinced that this direction of research deserves further efforts. 4 numerical p systems this class of p systems was introduced (in paper 16 from the list in the next section) with an explicit economic inspiration. briefly speaking, in the regions of a cell-like membrane structure, we have numerical variables (not “chemicals", as in the p systems discussed above), evolving by means of production-repartition programs. the variables from region i are denoted by membrane computing and economics: a general view 109 x1,i,x2,i, . . . ,xki,i. if the model is used as a computing devices, the variables take integer values, but for applications the values can be real numbers. a production-repartition program (from region i) is of the form fi(x1,i, . . . ,xki,i) −→ c1|v1 + c2|v2 + · · · + cni|vni, where fi(x1,i, . . . ,xki,i) is the “production function” and c1|v1 + c2|v2 + · · · + cni|vni defines the “repartition protocol”. denote ci = ∑ni s=1 cs and denote by xi,j(t) the value of variable xi,j at step t ≥ 0 (t = 0 corresponds to the initial values of variables). at any time t ≥ 0, we compute fi(x1,i(t), . . . ,xki,i(t)). the value q = fi(x1,i(t), . . . ,xki,i(t))/ci represents the “unitary portion” to be distributed to variables v1, . . . ,vni , according to coefficients c1, . . . ,cni in order to obtain the values of these variables at time t+1. specifically, vs will receive q · cs,1 ≤ s ≤ ni. if a variable receives such “contributions” from several neighboring compartments, then they are added in order to produce the next value of the variable. a variable used in a production function is consumed, reset to 0; if a variable does not appear in a production function, then its value remains unchanged – of course, it can change by receiving “production portions” (note that they can also be negative) from its region or from the neighboring regions. in this way, we pass from given values of the variables to next values. the process can be iterated. in this way, we get a computation. the positive values of a specified variable form the set of numbers generated by a numerical p system. again, further ingredients can be added – for instance, a control on the use of a program, such as the “enzymatic control" considered in several papers dealing with robot controllers based on numerical p systems, or restrictions can be considered – particular functions as production functions. as expected, these systems are both powerful (turing complete) and efficient. in the robot control area, numerical p systems were used as devices for computing functions of several real variables, and this seems to be the most plausible idea also for economic applications. anyway, again we expect applications in the two directions mentioned also for usual p systems: as tools for efficient computing and as models. both at the theoretical level and in what concerns the applications, numerical p systems are waiting for further attention, and we believe that the effort will be rewarding. in order to help the reader in this direction, we provide below a list of all titles we know at this moment related to numerical p systems (but further research are in development). 5 a bibliography of numerical p systems 1. o. arsene, c. buiu, n. popescu: snups, a simulator for numerical membrane computing. intern. j. of innovative computing, information and control, 7, 6 (2011), 3509–3522. 2. c. buiu, o. arsene, c. cipu, m. pătraşcu: a software tool for modeling and simulation of numerical p systems. biosystems, 103, 3 (2011), 442-447. 3. c. buiu, c.i. vasile, o. arsene: development of membrane controllers for mobile robots. information sciences, 187 (2012), 22–51. 4. m. garcia-quismondo, l.f. macias-ramos, m.j. pérez-jiménez: implementing enzymatic numerical p systems for ai applications by means of graphic processing units. topics in intelligent engineering and informatics, vol. iv, springer, berlin, 2013, 137–159. 5. a. leporati, a.e. porreca, c. zandron, g. mauri: improved universality results for parallel enzymatic numerical p systems. intern. j. unconventional computing, 9 (2013), 384–404. 110 g. păun 6. a. leporati, g. mauri, a.e. porreca, c. zandron: enzymatic numerical p systems using elementary arithmetic operations. membrane computing. proc. 14th intern. conf., cmc 2013, chişinău, august 2013, lncs 8340, springer, berlin, 2014, 249–264. 7. d. llorente–rivera, m.a. gutiérrez–naranjo: the pole balancing problem with enzymatic numerical p systems. proc. bwmc 2015, sevilla, february 2015, fenix editora, sevilla, 2015 (in press). 8. a.b. pavel: membrane controllers for cognitive robots. master’s thesis, department of automatic control and system engineering, politehnica university of bucharest, romania, february 2011. 9. a.b. pavel: development of robot controllers using membrane computing. phd thesis, department of automatic control and system engineering, politehnica university of bucharest, romania, july 2015. 10. a.b. pavel, o. arsene, c. buiu: enzymatic numerical p systems – a new class of membrane computing systems. the ieee fifth intern. conf. on bio-inspired computing. theory and applications. bic-ta 2010, liverpool, sept. 2010, 1331–1336. 11. a.b. pavel, c. buiu: using enzymatic numerical p systems for modeling mobile robot controllers. natural computing, 11, 3 (2012), 387–393. 12. a.b. pavel, i. dumitrache: hybrid numerical p systems. upb scientific bulletin, series a, submitted 2014. 13. a.b. pavel, c.i. vasile, i. dumitrache: robot localization implemented with enzymatic numerical p systems. proc. conf. living machines 2012, lncs 7375, springer, berlin, 2012, 204–215. 14. gh. păun: some open problems about numerical p systems. proc. 11th brainstorming week on membrane computing, sevilla, 4-8 february 2013, fénix editora, sevilla, 2013, 235–242. 15. gh. păun: some open problems about catalytic, numerical and spiking neural p systems, membrane computing. proc. 14th intern. conf., cmc 2013, chişinău, august 2013, lncs 8340, springer, berlin, 2014, 33–39. 16. gh. păun, r. păun: membrane computing and economics: numerical p systems. fundamenta informaticae, 73 (2006), 213–227. 17. gh. păun, r. păun: membrane computing as a framework for modeling economic processes. seventh intern. symp. symbolic and numeric algorithms for scientfic computing, synasc 2005, ieee press, 2005, 11–18. 18. gh. păun, r. păun: membrane computing and economics. section 23.6 in handbook of membrane computing, oup, 2010, 632–644. 19. c.i. vasile: distributed control for multi-robot systems. phd thesis, department of automatic control and system engineering, politehnica university of bucharest, romania, july 2015. 20. c.i. vasile, a.b. pavel, i. dumitrache: universality of enzymatic numerical p systems. intern. j. computer math., 90, 4 (2013), 869–879. membrane computing and economics: a general view 111 21. c.i. vasile, a.b. pavel, i. dumitrache, j. kelemen: implementing obstacle avoidance and follower behaviors on koala robots using numerical p systems. tenth brainstorming week on membrane computing, sevilla, 2012, vol. ii, 215–227. 22. c.i. vasile, a.b. pavel, i. dumitrache, gh. păun: on the power of enzymatic numerical p systems. acta informatica, 49, 6 (2012), 395–412. 23. x. wang, g. zhang, f. neri, t. jiang, j. zhao, m. gheorghe, f. ipate, r. lefticaru: design and implementation of membrane controllers for trajectory tracking of nonholonomic wheeled mobile robots. integrated computer-sided engineering, to appear. 6 concluding remarks in many circumstances, computer science got very fruitful inspiration from biology, and this became a systematic research direction in the last decades under the mane of natural computing. a comprehensive overview can be found in [19]. membrane computing is part of this endeavor, well developed at the mathematical level, general, versatile, promising at the level of applications in very (apparently) different areas. here, we only mentioned some of the possibilities and the researches reported so far concerning the applications of membrane computing in economics, calling the attention to this research area, with the conviction that it deserves to be explored. bibliography [1] j. bartosik (2004); paun’s systems in modeling of human resource management. second conference on tools and methods of data transformation, wsu, kielce, poland.. [2] j. bartosik (2004); heaps of pieces and paun’s systems. second conference on tools and methods of data transformation, wsu kielce. [3] j. bartosik, w. korczynski (2002); systemy membranowe jako modele hierarchicznych struktur zarzadzania, mat. pokonferencyjne ekonomia, informatyka, zarzadzanie. teoria i praktyka, wydzial zarzadzania agh, tom ii, agh 2002. [4] g. ciobanu, gh. păun, m.j. pérez-jiménez (eds.) (2006); applications of membrane computing. springer, berlin. [5] r. freund, m. oswald, t. schirk (2007); how a membrane agent buys goods in a membrane store. progress in natural science, 17: 442–448. [6] p. frisco, m. gheorghe, m.j. pérez-jiménez (eds.) (2014); applications of membrane computing in systems and synthetic biology, springer, berlin. [7] m. ionescu, gh. păun, t. yokomori (2006); spiking neural p systems. fundamenta informaticae, 71(2-3): 279–308. [8] w. korczynski (2004); on a model of economic systems. second conference on tools and methods of data transformation, wsu kielce. [9] w. korczynski (2005); păun’s systems and accounting. pre-proceedings of sixth workshop on membrane computing wmc6, vienna, austria, 461–464. 112 g. păun [10] m. oswald (2007); independent agents in a globalized world modelled by tissue p systems. artificial life and robotics, 11: 171–174. [11] a. păun, gh. păun (2002); the power of communication: p systems with symport/antiport. new generation computing, 20(3): 295–306. [12] gh. păun (2000); computing with membranes. j. computer and system sciences, 61(1): 108–143, first circulated as tucs report 208, november 1998 (www.tucs.fi). [13] gh. păun (2002); membrane computing. an introduction. springer, berlin. [14] gh. păun (2012); towards "fypercomputations" (in membrane computing). languages alive. essays dedicated to jurgen dassow on the occasion of his 65 birthday (h. bordihn, m. kutrib, b. truthe, eds.), lncs 7300, springer, berlin, 207–221. [15] gh. păun, r. păun (2005); membrane computing as a framework for modeling economic processes. proc. synasc 05, ieee press, 11–18. [16] gh. păun, r. păun (2006); a membrane computing approach to economic modeling: the producer-retailer interactions and investments. analiză şi prospectivă economică, part i: 3(1): 30–37, part ii: 4(2): 47–54. [17] gh. păun, r. păun (2006); membrane computing models for economics. an invitationsurvey. studii şi cercetări de calcul economic şi cibernetică economică, 40(1-2): 5–19. [18] gh. păun, g. rozenberg, a. salomaa, eds.(2010); handbook of membrane computing. oxford university press. [19] g. rozenberg, t. bäck, j.n. kok, eds. (2010); handbook of natural computing. 4 vols., springer, berlin, 2012. [20] g. zhang, m. gheorghe, m.j. pérez-jiménez (2016); real-life applications with membrane computing. springer, berlin (in press). int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 2 (june), pp. 258-265 an authenticated key agreement protocol using isogenies between elliptic curves d. he, j. chen, j. hu debiao he, jianhua chen, jin hu wuhan university school of mathematics and statistics wuhan, hubei, 430072, china e-mail: hedebiao@163.com, chenjh-ecc@163.com, hujin-ecc@163.com abstract: all the current public-key cryptosystems will become insecure when size of a quantum register is sufficient. an authenticated key agreement protocol, which is against the attack of quantum computer, is proposed. the proposed protocol can provide the security properties known session key security, forward security, resistance to key-compromise impersonation attack and to unknown key-share attack, key control. we also prove its security in a widely accepted model. keywords: public-key cryptosystem; quantum computer; isogeny; elliptic curve; key agreement protocol. 1 introduction key agreement is one of the fundamental cryptographic primitive after encryption and digital signature. such protocols allow two or more parties to exchange information among themselves over an adversarially controlled insecure network and agree upon a common session key, which may be used for later secure communication among the parties. thus, secure key agreement protocols serve as basic building block for constructing secure, complex, higher-level protocols. the first modern key agreement protocol is the diffie-hellman protocol given by diffie and hellman in the seminal paper [1] in 1976. its security is based on the difficulty of solving discrete logarithm problems. as the first practical key agreement protocol without authentication, it is not secure to the man-in-the-middle attack. after that, lots of protocols have been published and some of them use certificates generated by thetrusted third parties (public key infrastructures, pkis) to prevent attacks such as the man-in-the-middle attack. most of the systems based on pki are complex and expensive for the cost of the authentication, refresh andrevocation of certificates. security of the known key agreement protocols is based on two general mathematical problems: determination of order and structure of a finite abelian group, and discrete logarithm computation in a cyclic group with computable order. both of the problems can be solved in polynomial time using shor’s algorithm for a quantum computer [2]. thus, most of the current public-key cryptosystems will become insecure when size of a quantum register is sufficient. development of key agreement protocols, which would be strong against a quantum computer, is necessary. the rest of our paper is organized as follows. section 2 describes theoretical background and a public-key encryption technique. section 3 analyses the complicity of the problem of searching for an isogeny between elliptic curves. we give the proposed key agreement protocol in cection4, and analyse the security of the proposed protocol in section 5. we conclude this paper in section 6. copyright c⃝ 2006-2011 by ccc publications an authenticated key agreement protocol using isogenies between elliptic curves 259 2 elliptic curves over fp and isogeny star let e be a elliptic curve, defined on the finite fields fp, and it’s equation is y2 = x3 + ax + b, a, b ∈ fp (1) then the map π : (x, y) → (xp, yp) (2) specifes the frobenius endomorphism of the curve e. a frobenius map satisfies its characteristic equation π2 − tπ + p = 0 (3) where t = p − a − ♯e(fp) is the frobenius trace. through the hasse’s theorem, we know that |t | < 2 √ p). (4) so the discriminant dπ of the frobenius equation (3) satisfies dπ = t 2 − 4p < 0 (5) theorem 1 elliptic curves are isogenous over fp if and only if they have equal number of points. proof. see[3]. theorem 2 let an elliptic curve e(fp) have the frobenius discriminant dπ and (dπl ) be a kronecker symbol for some l-degree isogeny. if (dπ l ) = −1, then there are no l-degree isogenies; if (dπ l ) = 1, then two l-degree isogenies exist; if (dπ l ) = 0, then 2 or l + 1 l-degree isogenies exist and l is called elkies prime number. proof. see[3]. let u = ei(fp) be a set of elliptic curves with equal number of points, so that each element of u is uniquely determined by a j−invariant of an elliptic curve. according to the theorem 1 and the equation (4), we can consider u as a category, and the set of isogenies between elements of u as a set of morphisms of this category. we can compute ♯u = hdπ , where hdπ is the degree of hilbert polynomial[3]. let l is elkies prime number, we can get that there are two isogenous elliptic curves for any elliptic curves of u, from theorem 2. it is practically determined that, when ♯u is prime, all the elements of u form a single isogeny cycle. let l1 ̸= l be one more prime isogeny degree with the property that (dπli ) = 1. in this case, li−degree isogenies form a cycle over u as well. then we can put the l and li degree isogeny cycles over each other. same can be done for other isogeny degrees of such kind. definition 1. a graph, consisted of prime number of elliptic curves, connected by isogenies of degrees satisfying (dπ li ) = 1, is an isogeny star. if an isogeny star is wide enough, we can use it for crypto algorithm constructing. for that purpose, it is necessary to specify a direction on a cycle and route of isogeny stat. the method for direction determination on an isogeny cycle is mentioned in [4], we don’t give the detail here.. it uses impact of frobenius endomorphism on an isogeny kernel. the definition of isogeny stat is following. let s be an isogeny star, l = {li}−a set of elkies isogeny degrees being used and f = {πi}−a set of frobenius eigenvalues, which specify positive direction for every li ∈ l. 260 d. he, j. chen, j. hu definition 2. a set r = ri, where ri is number of steps by the li−isogeny in the direction f = πi, is a route on the isogeny star. we can define composition[3] of routes a = {ai} and b = {bi} as ab = {ai + bi}. routes are commutative: ab = ba. the computation of iosgeny between elliptic curve can be done using the method in [5, 6, 7], we don’t give the detail here. 3 complexity of isogeny search the are several techniques can be used for isogeny search[3]: • brute-force: complexity of these attacks is estimated at isogeny computations. • meet-in-the-middle: complexity of the attack is estimated at isogeny computations. • method described in [8]: complexity of the attack is estimated at isogeny computations. the reason of the problem of searching for an isogeny between elliptic curves can against the attack of quantum computer is following[3]. in order to computer the isogenies between elliptic, we must solve the equation ϕ(x, j) = 0, (6) and the process of computing the isogeny cycle is following e1 → ϕl1(x, je1) = 0 → je2 → e2 → ϕl2(x, je2) = 0 → je3 → ... (7) . to compute a chain of q isogenies, one should consecutively solve these q equations, because of the equation parameter (j−invariant) is changed with every step. so one can’t parallelize and the problem is against the attack of quantum computer. then, we conclude that the complexity of searching for an isogeny between elliptic curves is o( √ n) ≈ o( 4 √ p), and the problem can be against the attack of quantum computer. from the above discussion, the decisional diffie-hellman assumption can be easily extent to the isogenies through the property of isogenies between the elliptic curves. definition 3. the decisional diffie-hellman assumption over isogeny star (ddha-is ): ddha-is is that it is difficult to distinguish the following real diffie-hellman distribution γreal = {r1(einit), r2(einit), r1r2(einit)|r1, r2 ∈ g} (8) and random diffie-hellman distribution γrand = {r1(einit), r2(einit), r3(einit)|r1, r2, r3 ∈ g} (9) . more formally, if we define the advantage function advddh−isg (a) as advddh−isg (a) = |pr[a(x) = 1|x ∈ γreal] − pr[a(y ) = 1|y ∈ γrand]|, (10) we say that the ddha-is holds in set g if advddh−isg (a) is negligible for any probabilistic polynomial time adversary a. an authenticated key agreement protocol using isogenies between elliptic curves 261 4 key agreement protocol based on isogeny in this section we describe the proposed key agreement protocol which is specified by the key generation and the protocol description. 4.1 key generation in this paper we use an elliptic curve e defined over a finite field fp, the parameters is following. 1)fp: the finite field; 2)einit: an initial elliptic curve, its equation is y2 = x3 + ainitx3 + binit, ainit, binit ∈ fpłť 3)d:number of isogeny degrees being used; 4)l = li, 1 ≤ i ≤ d: a set of elkies isogeny degrees being used; 5)f = πi, 1 ≤ i ≤ d: a set of frobenius eigenvalues, which specify the positive direction for every li ∈ l; 6)k:a limit for number of steps by one isogeny degree in a root. for any root {ri}, numbers of steps are selected in −k ≤ ri ≤ k; 7)h:sha-1; 8)select random routes rpriva and rprivb. the value rpriva is a secret key of the user a, and rprivb is the secret key of the user b; 9)compute the curves epuba = rpriva(einit) and epubb = rprivb(einit), which are the public key of a user a and b, respectively; 4.2 our key agreement protocol let e be the elliptic curve, defined on the finite field , with the equation (1), and let ae and be be its parameter and j be its j−invariant. the proposed protocol is following. 1) a generate random route ra and computes ea = ra(einit), ea = h(aea, bea). at last, a sends m1 = {aea, bea, ea} to b. 2) upon receiving m1, b checks whether ea equals h(aea, bea). if not, b stops the session. otherwise, b generate random route rb and computes eb = rb(einit), e′b = rb(ea), e′′b = rprivb(ea), eb = h(aeb , beb , ae′′b ), be′′b ). at last, b sends m2 = {aeb , beb } to a. 3) upon receiving m2, a computes e′a = ra(eb), e ′′ a = ra(epubb), e ′′′ a = ra(epubb) and checks whether the equation eb = h(aeb , beb , ae′′a), be′′a) holds or not. if it does not hold, then a terminates the execution. otherwise, a computes e′a = h(ae′′′a ), be′′′a ) the session key skab = h(ae′ a ) ⊕ be′ a ). at last, a sends m3 = e′a to b. 4) upon receiving m3, b computes e′′′b = rb(epuba) and checks whether the equation e ′ a = h(ae′′′ b ), be′′′ b ) holds or not. if it does not hold, then b terminates the execution. otherwise, computes the session key skba = h(ae′ b ) ⊕ be′ b ). as a result, a and a achieve the same shared secret key: e′a = ra(eb) = ra(rb(einit)) = rb(ra(einit)) = rb(ea) (11) skab = h(ae′ a ) ⊕ bea) = h(ae′a) ⊕ be′a) = skab (12) and authenticate each other. 262 d. he, j. chen, j. hu 5 security analysis 5.1 security model in this work we shall use a modified bellare-rogaway key exchange model [9, 10] to analyse the protocol security. in the model, each party involved in a session is treated as an oracle, and an adversary can access the oracle by issuing some specified queries (defined later). an oracle πsi,j denotes the s−th instance of party i involved with a partner party j in a session. the security of a protocol is defined by a game with two phases. in the first phase, an adversary e is allowed to issue the following queries in any order. 1)send(πsi,j, m). upon receiving the message m, oracle π s i,j executes the protocol and responds with an outgoing message m or a decision to indicate accepting or rejecting the session. if the oracle πsi,j does not exist, it will be created as initiator if m = λ, or as a responder otherwise. 2)reveal(πsi,j). if the oracle has not accepted, it returns ⊥; otherwise, it reveals the session key. 3)corrupt(i). the party responds with its private key. once the adversary decides that the first phase is over, it starts the second phase by choosing a fresh oracle and issuing a query, where the fresh oracle and query are defined as follows. definition 4 (fresh oracle) an oracle πsi,j is fresh if (1)π s i,j has accepted; (2)π s i,j is unopened (not been issued the query); (3) party j ̸= i is not corrupted (not been issued the corrupt query); (4) there is no opened oracle πtj,i, which has had a matching conversation to . the above fresh oracle definition is particularly defined to cover the key-compromise impersonation resilience property since it implies that the user could have been issued a query. 4)test(πsi,j).oracle π s i,j which is fresh, as a challenger, randomly chooses b ∈ {0, 1} and responds with the session key, if b = 0, or a random sample from the distribution of the session key otherwise. after this point the adversary can continue querying the oracles except that it cannot reveal the test oracle πsi,j or its partner π t j,i (if it exists), and it cannot corrupt party j. finally, the adversary outputs a guess b′′ for b. if b′ = b, we say that the adversary wins. the adversary’s advantage is defined as adve(k) = |2pr[b′ = b] − 1|. (13) we use the session id which can be the concatenation of the messages in a session to define matching conversations, i.e., two oracles πsi,j and π t j,i have matching conversations to each other if they have the same session id. a secure authenticated key (ak) agreement protocol is defined as follows. definition 5 protocol π is a secure ak if: 1). in the presence of a benign adversary, which faithfully conveys messages, on πsi,j and π t j,i, both oracles always accept holding the same session key, and this key is distributed uniformly on {0, 1}k; 2). for any polynomial time adversary e, adve(k) is negligible. 5.2 security analysis using the above security definitions, we have the following theorem 1. theorem 3. in the random oracle, if ddha-is is hard, our proposed protocol is a secure ak protocol. proof : the first two conditions follow immediately from the description of our proposed protocol and the assumption that h is random oracle. let’s turn to the second condition. we use the method proposed by pan et al.[10] to analyze the security. consider there exists an adversary e and adve(k) is non-negligible. suppose its an authenticated key agreement protocol using isogenies between elliptic curves 263 running time is t. we will use e to construct an algorithm f which can solve ddha-is. let l, n and qh be the number of sessions related to e, the number of entitys and the queries’ numbers of h(·) made by e. given (e1, e2, e3), where e1 = r1(einit), e2 = r2(einit), e3 = r3(einit). first, f selects randomly i, j, r, s and generates the public/private key pair for every entity. then, f starts e and answers all queries made by e. h query: if the input exists, answers by its corresponding value. otherwise, f picks a random number as the answer to the new query, and adds the input, output pair at the end of the h−string; corrupt query: because f knows all entities’ private keys, f can answers by the corresponding private key; reveal query: if the input is πsi,j or π t j,i, f selects b ′ as its output and halts. suppose the input is πs ′ i′,j or π t′ j,i′ , where (i ′, s′) ̸= (i, t) and (i′, t′) ̸= (i, t). because f knows all entity’s private keys and simulates the run of i′ and j according to the protocol, f can get random numbers selected by i′ and j. so, f knows the session key of πs ′ i′,j or π t′ j,i′ , and answers by this session key. send query: if the input is πsi,j and an empty string, f computes ea = h(ae1, be1) and answers by ae1, be1, ea. if the input is π s i,j and a string that is not none, f computes e′′′a = rpriva(e2) and answers by {e ′ a}. if the input is π t j,i and a string that is not none, computes e′′b = rprivb(e1), eb = h(ae2, be2, ), a ′′ e2 , b′′e2 and answers by ae2, be2, eb. else, answers by random numbers according to the protocol. test query: if the input is not πsi,j, f outputs b ′ ∈ {0, 1}k and halts. else, if ae3 and be3 exist in h−string, let the corresponding value be r; else, f selects r randomly and appends {ae3, be3, r} to the h−string. f answers by r. when e halts, f outputs e’s output b′ and halts. 1)suppose πsi,j, selected by f , is the input of the test oracle, and π s i,j and π t j,i have matching conversations. if e3 is the dh value of e1, e2, r is the session key. else r is a random number. because e3 is the dh value of e1, e2 with the probability 12 and f answers all queries made by e correctly, the probability of the event that distinguishes r and the session key correctly is equivalent to the probability of the event that f decides whether (e1, e2, e3) is a dh triple. 2) suppose πsi,j, selected by f , is the input of the test oracle, or π s i,j and π t j,i have not matching conversations. f outputs correctly with the probability 1 2 . suppose the success probability of e is 1 2 +ϵ, where ϵ is non-negligible. because the first event happens with the probability 1 l2 , the success probability of f in this condition is 1 2l2 + ϵ l2 . the second event happens with the probability 1 − 1 l2 . so the success probability of in this condition is 1 l2 − 1 2l2 . from the above discussion, we know the success probability of f is 1 2 + ϵ l2 . then we can solve the ddhp-is with non-negligible probability. this is a contradiction. so our proposed protocol satisfies the second condition. i.e., the protocol is a secure ak protocol. 5.3 other discussion a number of desirable attributes of key agreement protocols have also been identified [9] and nowadays most protocols are analyzed with such attributes. here, the following six security properties must be considered for the proposed protocol: a known-key security, perfect forward secrecy, a key-compromise impersonation attack, a unknown key-share security, a key-control security. regarding the above mentioned definitions, the following theorems are used to analyze the six security properties of the proposed protocol. our protocol also satisfies the following security notions which are often used to judge the security of key agreement protocols. 264 d. he, j. chen, j. hu known session key security: a protocol is called known session key security, if an adversary, having obtained some previous session keys, cannot get the session keys of the current run of the key agreement protocol. in our scheme, the agreed session key relies on the one-way hash function and session secrets. the output of hash function is distributed uniformly in {0, 1}k, thus one session key which is the output of hash function has no relation with the others. besides, the session key is generated with the session secrets which are computed from the random ephemeral key, thus even one session’s session secrets are revealed, the other session secrets will still remain safe. perfect forward secrecy: a protocol is called perfect forward secrecy, if compromise of the three private keys of the participating entities does not affect the security of the previous session keys. even if an attacker gets the value the secret key rpriva and rprivb in our scheme, he can’t deduce e′a or e ′ b, without the knowledge of the two random numbers ra and rb. therefore, our scheme can provide perfect forward secrecy. no key-compromise impersonation: the compromise of one entity’s static private key does not imply that the private keys of other entities will also be compromised in our protocol. the adversary may impersonate the compromised entity in subsequent protocols, but he cannot impersonate other entities. this property is called no key-compromise impersonation. first, suppose an attacker c obtains the long-term private key rpriva from the compromised user a. in order for the key-compromise impersonation attack to succeed, c must know a’s ephemeral keys . in this case, c would also have to extract from ’s ephemeral public value ra, so as to generate the same session key with a. c, however, will face the problem of searching for an isogeny between elliptic curves. therefore, the proposed protocol is secure against a key-compromise impersonation attack. no unknown key-share: if the adversary convinces a group of entities that they share some session key with the adversary, while in fact they share the key with another entity, we call the protocol as suffering from unknown key-share attack. to implement such an attack on our protocol, the adversary is required to learn the private key of some entity. otherwise, the attack hardly works. hence, we claim that our protocol has the attribute of no unknown key-share. no key control: no key-control security means that neither entity can’t force the session key to a preselected value. from the execution of the proposed protocol, we know that the only possibility of key-control attack may be brought out by the participant of the protocol b. but, for the party b to make the party a generate the session key kb which is preselected value by b, b should solve the equation e′b = rb(ea).this is the problem of searching for an isogeny between elliptic curves. therefore, the proposed protocol provides a no key-control security. 6 conclusion in this paper, we propose a secure and efficient authenticated key agreement, which works on the isogeny star. we prove that our protocol meets the security attributes under the assumption that the problem of searching for an isogeny between elliptic curves is secure. 7 acknowledgments the authors would like to thank the anonymous reviewers for their constructive comments. an authenticated key agreement protocol using isogenies between elliptic curves 265 bibliography [1] w. diffie and m. hellman, new directions in cryptography, ieee trans. info. th, vol. 22, pp.644-654, 1976. [2] boneh d., lipton r. quantum cryptanalysis of hidden linear functions. proceedings of the 15th annual international cryptology conference on advances in cryptology (lncs 963), 1995:424-437. [3] rostovtesv a. and stolbunov a., public-key cryptosystem based on isogenies. cryptology eprint archive, report 2006/145, 2006. http://eprint.iacr.org/. [4] couveignes j. m., dewaghe l., morain f. isogeny cycles and the schoof-elkies-atkin algorithm. ecole polytechnique, france, 1996. [5] elkies n., elliptic and modular curves over finite fields and related computational issues, proceedings of a conference in honor of a.o.l. atkin, ams international press, 1998, pp.2176. [6] muller v., ein algorithmus zur bestimmung der punktanzahl elliptisher kurven uber endlichen korpern der charakteristik groser drei, 1995. http://www.informatik.tudarmstadt.de/ti/forschung/ecc. [7] f.morain, e.schost, fast algorithms for computing isogenies between elliptic curves. http://www.lix.polytechnique.fr/ morain/jcomp.pdf, 2006. [8] s. galbraith. constructing isogenies between elliptic curves over finite fields, journal of computational mathematics, vol. 2, pp.118-138, 1999. [9] s. blake-wilson, d. johnson and a. menezes, key agreement protocols and their security analysis, proceedings of sixth ima international conference on cryptography and coding, cirencester, uk, 1997, pp. 30-45. [10] h. pan, j.-f. li, q.-s. zheng, a provable-security mutual authenticated key agreement protocol for mobile communication, the 4th international conference on wireless communications, networking and mobile computing, 2008, pp.1-4. international journal of computers, communications & control vol. ii (2007), no. 3, pp. 252-268 extraction of critical scenarios in a railway level crossing control system malika medjoudj, pascal yim abstract: this paper deals with the safety of the level crossing control system. we propose one way of the safety evaluation witch consist on the extraction of feared scenarios in the petri net model of the system. we use esa_petrinet tool (extraction scenarios & analyzer by petri net model) that was developed in the aim of extraction of feared scenarios in computer-controlled systems. these scenarios characterize the sequences of actions leading to dangerous situations. the taking into account of the failures, the temporal constraints and partially the continuous dynamic (by temporal abstraction) of the system makes it possible to respect the order of appearance of the events in the generated scenarios. keywords: critical scenarios, hybrid dynamic, level crossing control system, safety, temporal petri net 1 introduction one way to evaluate the safety [2] of complex system such as a level crossing control system (lccs) is the extraction of critical scenarios leading to the feared states. a qualitative analysis method of safety, aiming the extraction of all the critical scenarios from a petri net model [5] of computer-controlled systems was developed by [3]. this approach witch is an extension of a method developed by [6] but which operated only on the discrete aspect of the system, takes into account the continuous aspect of the system and the temporal specifications. this approach based on linear logic [7] determines more precisely the exact conditions of the occurrence of the feared event, i.e what has led the system to leave its normal operation and to evolve into the feared state. the originality of this approach is that the order of occurrence of the events is taken into account, and impossible scenarios with respect to continuous dynamics and temporal specifications of the system are eliminated. the automation of all stages of the process has led to the development of esa_petrinet tool (extraction & scenarios analyser by petrinet model) [4] that has been interfaced with tina tool (time petri net analyzer) [8].we will use in this paper esa_petrinet tool to extract dangerous scenarios from the level crossing benchmark published by [10]. we will present the method of extraction of feared scenarios and the basic of the algorithm in section 2, the level crossing control system in section 3, its petri net modelling in section 4, the use of esa_petrinet tool to generate the critical scenarios in the section 5 and we will end by a conclusion. 2 method of extraction of feared scenarios the application of this method requires the modelling of the system by a time petri net model and identifying the places of nominal behaviour. the appropriate petri net modelling of computer-controlled systems is a predicate transitions differential stochastic petri net (ptds petri net) as they are generally hybrid (discrete and continuous dynamics) and there safety analysis require taking into account failures. a temporal abstraction is necessary to translate this model to a time petri net by associating to the transitions a temporal interval of firing corresponding to the time which the system can spend to reach the state in question. a preliminary analysis will refine fields of variables according to various accessible markings by reasoning on the invariants of places. indeed, the invariants of places determine the possible dynamics, and which other places can be simultaneously marked when a token is present in a given place. copyright © 2006-2007 by ccc publications extraction of critical scenarios in a railway level crossing control system 253 2.1 principal the method of extraction of feared scenarios is made up of two steps [3]: a backward reasoning and a forward reasoning. the backward reasoning takes as an initial marking in the reversed petri net model (the initial petri net in which all the arcs are reversed), the only target state (feared) and seeks exhaustively all the scenarios making it possible to consume the initial marking (feared state since forward reasoning) and reach a final marking composed only of places associated to the normal operation. the forward reasoning takes as an initial state these places of normal operation in the initial petri net model. the objective is to locate the junctions between the feared behaviour and the normal operation of the system as well as the conditions implied in these junctions. thus we have not only the explanation of the dangerous behaviour but also of strategies allowing its avoidance. a significant point of the method is that the context in which occurred the feared event is enriched gradually. the enrichment (of marking) consists on putting tokens in empty places in the petri net model when it is necessary to make evolve the system and generate scenarios. the invariant of places are used as a mechanism of checking the coherence of the enrichment of marking. indeed the new tokens added are removed if they do not respect the dynamics of the system. each scenario is given in form of a partial order between the events necessary to the appearance of the feared event what differs from a failure tree, which gives a whole of static combinations of the partial states necessary for obtaining the feared state. 2.2 dealing with continuous dynamics by temporal abstraction this method takes into account the conditions associated to the firing of certain transitions. these conditions are thresholds involving continuous variables. by temporal approximation of the hybrid dynamics, these thresholds are transformed to durations, which correspond to time that the system puts to reach when the transitions are enabled. from a qualitative point of view, the objective is to determine the firing order of the transition. thus, when we enrich the marking, we can find situation where two transitions t1 and t2 are enabled if only the ordinary petri net is considered, but whose are such as t1 will be always fired before t2 if the temporal abstraction is also considered. in the generation of the scenarios only the firing of t1 will be considered since that of t2 before t1 would be in fact incoherent with the continuous dynamics. this appears in the form of a priority: if t1 and t2 are enabled, only the case of t1, priority, is examined. the taking into account of these precedence relations coming from the continuous dynamics and not specified by the ordinary petri net allows to reduce the number of scenarios generated by eliminating a certain number of incoherent scenarios with respect to continuous dynamics. let us consider an example. in figure 1 we suppose that the differential-algebra system associated to the place p1 guarantees that the variable x is increasing. we associate to the transition t1 the threshold x = v1 and to the transition t2 the threshold x = v2 with v1 < v2. finally, we suppose that when the token arrives in the place p1 we have always x < v1. so, if the place p3 is marked, the transition t1 will be fired before t2 since the threshold associated to t1 is lower than that of t2. in this case we don’t consider the scenario associated to the firing of t2. on the other hand, if t3 is already fired for example if we consider that t1 is a stochastic transition corresponding to a failure (place p3 empty) and if the place p2 is marked, t1 cannot be fired and then t2 will be fired. in the example above, finally only one type of scenarios is examined, those for which the transition t2 is fired after t3. so, there is a precedence relation between the firing of t3, which empties the place p3 and that of t2, however there is no place connecting t3 to t2. this precedence relation is so, a consequence of continuous dynamics and thresholds associated to transitions t1 and t2. we are talking in this case about indirect precedence relation and about indirect causality. the direct precedence relations and causality are those that are highlighted by the only petri net, i.e. by the only discrete aspect. 254 malika medjoudj, pascal yim figure 1: temporal abstraction and priority due to thresholds of transitions 2.3 esa_petrinet tool esa_petrinet tool uses two output files of tina tool as input files. the first file is a textual description of the petri net model of the system and the second contains the invariant of places. the indirect precedence between certain transitions firing resulted by the temporal abstraction of continuous dynamics, is expressed in the algorithm in the form of rules of priority (a certain transition is not fired if another is enabled). the transition time interval of tina tool permits to express this rule of priority. if we take the example of figure 1, if there is an intersection between the time interval associated to transitions t1 and t2, they will have the same priority of firing and the two scenarios will be generated. we note that only one execution of the algorithm generates automatically several scenarios. all the possible and coherent scenarios with respect to the continuous dynamics and the temporal constraints of the system are generated. 3 level crossing control system case study figure 2: railway level crossing 3.1 general description this case study concerns a decentralized radio-based railway level crossing control system taken from a realistic specification of a new radio-based train control system, which has been developed for the german railways. it is presented by [10] and studied by [9] using a transformation of a p-time petri net model of the system to automata in the purpose of avoiding forbidden states. this modelling is of a high level of abstraction and does not take into account the failures of the system. although, simplification has been made in the presentation of this example, it remain especially interesting as it is well known by the railway specialists, takes into account software and hardware specification, hybrid dynamic and temporal constraints. our aim is a whole modelling of the system extraction of critical scenarios in a railway level crossing control system 255 using a petri nets model by taking into accounts the hybrid dynamic, temporal constraints and failures. then, applying the method described above to extract critical scenarios. 3.2 composition of the system and specification the radio-based level crossing control system is used in an intersection area between a single track railway line and a road as illustrated in figure 2a. to avoid collision, trains and road traffic must not enter at the same time this crossing zone called danger zone. the level crossing is controlled by means of signals radio communication between a train-borne control system (on-board system), a level crossing control system and an operation centre which supervises interactions between the two preceding components. it is important to note that transmission times on the network may vary and radio telegrams may be lost. the railway crossing is equipped with half barriers, a red and a yellow road traffic light. road users shall stop at the level crossing if possible when the yellow light is shown and must stop when the red light is shown as the level crossing is closed for road users in this case. the yellow light and the red light never must be shown together and when both are off the danger zone can be crossed by road users. the traffic lights and barriers at the level crossing are controlled by the level crossing control system which will be activated (turn on) with the approach of a train to the level crossing. when the level crossing control system is activated, it carries out a sequence of actions at a specific timing to ensure a safely closing of the crossing and the danger zone to be free of road traffic. first, the yellow light is switched on, then after 3 seconds it is switched off and the red light is switched on. after 9 seconds the barriers are started to be lowered within a maximum time of 6 seconds. if the barriers have been completely lowered within this time, the level crossing control system signals the safe state of the level crossing and the train can cross it. when the train has completely passed the danger zone, the level crossing may be opened for road traffic. in the level crossing opening phase, the barriers are first opened then the red traffic light and the level crossing control system are switched off. the half barriers are used to block the entry lane on either side of the level crossing. as there are no barriers for the exit lanes, imprudent road users may enter the crossing area on the opposite lane if the closure time of the level crossing exceeds 240 seconds. a general view of the normal operating of the level crossing control system is given in figure 2b. the train is equipped on board by a route map which contains the positions of danger points at level crossings and provides information for the train (lineside equipment or signal staff) when or where to send an activation order to the corresponding level crossing control system. the train on-board system sends so a radio message to the level crossing control system in order to close the level crossing in time and let the train pass through without any delay or braking action. it will also set a breaking curve for speed supervision making the train stop at the danger point in a failure situation. the level crossing control system acknowledges receipt of the activation order to the train. after receipt of the acknowledgement the on-board system waits the necessary time for the closing of the level crossing, then sends a status request to the level crossing control system. if the level crossing is in a safe state it will be reported to the train which allows cancelling the breaking curve and safely pass over the level crossing. the vehicle sensor at the rear of the level crossing will be triggered allowing the opening of the level crossing. 3.3 possible failures a main cause of failures is the malfunctioning of sensors or actuators. the main physical structures, communication systems and the control systems themselves may be failed. failure may occur at any time. defective devices will be repaired after some time but will not take place when a train approaching or passing the level crossing in case of non recoverable failure. in this case study, only a limited number of failures are taken into account: 256 malika medjoudj, pascal yim • failure of the yellow or the red traffic light • failure of barriers (actuators) • failure of vehicle sensor • the delay or loss of telegrams on the radio network the traffic lights and the vehicle sensor are constantly supervised and defect is immediately reported to the level crossing control system. failure of the barriers can only be detected by time-out when barriers fail to reach upper or lower end position in time or at all. 3.4 behaviour of the control system under failure the level crossing control system detects the occurrence and repair of failures of traffic lights and vehicle sensor and immediately reports them as an event to the operations centre. train operation is not suspended on the affected track section until repair. when the train sends a status request, if in the sequel it does not receive the status report with the safe state of the level crossing before entering its breaking curve the on-board system will apply the breaks until the status report will be received or the train has come to a stand still. if the status report is received before stand still, breaks are released and train can continue its run. if not a request is prompted on the driver’s display to make sure that the level crossing can be passed safely and to confirm the safe state on the display. if meanwhile the status has been received the message is cancelled from the display, the break are released and the driver does not need to confirm anymore. otherwise the driver has to confirm the safe state of the level crossing in order to release the breaks and continue its run. the train supervises a maximum arrival time of 240 seconds to avoid long waiting times of road users. if the train detects that it cannot arrive at the level crossing within a specified time and still is able to stop before the danger point it cancels the activation order by sending a deactivated order to the level crossing. in this situation the train discards any information received from the level crossing and supervises a breaking curve ending at the danger point. the level crossing will be opened upon receipt of the deactivation order. the driver has to confirm as described above the safe state before passing unclosed level crossing. the level crossing control system will not be activated if the red traffic lights or the vehicle sensor are defective and it will not send an acknowledgment to the train. if the level crossing control system has been activated, a minimum green time is considered since the last deactivation of the level crossing before switching on the yellow light for 3 seconds. if the yellow traffic light becomes defective either before or during the yellow light period, the traffic lights are switched to red and the red light period of 9 seconds is extended correspondingly by the messing time of the yellow light period. if the red traffic light fails after activation of the level crossing control the closing procedure has to be cancelled unless the barriers have yet begun to be lowered. the failure state of the level crossing must be reported if the barriers fail to be completely lowered within a maximum duration of 6 seconds or if in the meantime the red traffic light has become defective. the current status of the level crossing will be reported to the train upon request. if the vehicle sensor becomes defective the level crossing control system can not be deactivated anymore by passing train. consequently the barriers remain lowered and the red traffic light remains switched on. however, the level crossing control system supervises a maximum closure time starting from the red light be switched on. the exceeding of the maximum closure time will be reported to the operation centre by the level crossing control system. the operation centre finds out, whether the train has yet passed the level crossing or not. in the first case, the operations centre sends a deactivation order to the level crossing. otherwise the train is still approaching or just running on the level crossing and the rules for late arrival at the level crossing apply as described above. extraction of critical scenarios in a railway level crossing control system 257 3.5 feared events there are many feared events in the system, but we will interest only to the catastrophic one: the collision, it means the presence of a train and a road user in the danger zone at the same time. 4 modelling petri nets have been used with success as a formal model for traffic signal control [11], urban traffic control [12], and level crossing control system [13] aiming security. this section deals with the modeling of the level crossing control system by a t-time petri net model (temporal intervals associated to transitions). although the appropriate abstraction of certain dynamics of the system is a pt-arc-time petri net (temporal intervals associated to arcs related places to transitions) or a p-time petri net (temporal intervals associated to places) we have chosen the t-time petri net model as the principal of the esa-petrinet tool is based on the priority of firing of conflictive transitions. 4.1 general view in the general view given in figure 3a, msgi represent radio messages. the message msg1 is sent by the train to the lccs to switch on when the on-board system detects the approaching of a level crossing. the message msg2 represents the receipt acknowledgement of the activation order. the message msg3 corresponds to the status request of the level crossing and the message msg4 represents the safe state of the level crossing reported to the train. note that, transmission times on the radio network may vary and messages may be lost as represented in figure 3b. the radio message is represented by the place msgi. the time interval [dmi, dmi] associated to the out put transition of place msgi means that the transmission time may vary between the minimal value dmi and the maximum one dmi. according to the radio message and the crossing state, the train will pass with out braking, with braking, come to a stand still or stop. the dely1 corresponds to the maximum closure time of 240 seconds supervised by the level crossing control system starting from the red lights be switched on. crossing the danger zone by the train and the road user are respectively represented by dz1 and dz2. the message msg7 corresponds to the deactivation order of the opening of the level crossing sent by the train when it detects that it can not arrive at the level crossing within the maximum supervised arrival time of 240 seconds. in this paper, we will interest to the feared scenario corresponding to the presence of a train and a road user in the danger zone at the same time (collision). this is represented by the petri net model of the figure 3c, where the transition e_fail representing the feared event can be fired only when both places dz1 (presence of a train in the danger zone) and dz2 (presence of a road user in the danger zone) are marked. place s_fail represents the feared state (collision). figure 4 represents a general view of different radio messages exchanged between the on-board system and the level crossing control system. to simplify the case study, we have not presented the radio messages exchanged with the operations centre like the failure and repair of different devices. 4.2 petri net model of the level crossing control system a detailed view of the lccs is given in figure 5. note that messages msg1 and msg2 are the same as in figure 3. places lccs_off and lccs_on1 correspond respectively to the deactivation and activation mode of the level crossing control system. transition on1 will be fired after reception of the activation order msg1, to switch to the activated mode if the vehicle sensor (place s3_ok) and the red traffic light (place red_off ) are not defective. the level crossing control system will be deactivated if the place lccs_on2 or lccs_on3 is marked. place lccs_on2 will be marked when the barriers are opened after closure. place 258 malika medjoudj, pascal yim figure 3: general view of the model figure 4: general view of radio messages exchanges lccs_on3 corresponds to the cancelling of the closing procedure if the red traffic light fails after the activation of the level crossing control system. the green time passed since the last deactivation of the level crossing will be described in section 4.7 figure 5: petri net model of the level crossing control system 4.3 petri net model of the yellow light place yell_off1 in figure 6a represents the mode off of the yellow light. it will be switched to the activated mode (place yell_on) when the level crossing is activated (place lccs_on1 marked). after 3 seconds of the activation of the yellow light, it will be deactivated (marking of places yell_off1 and yell_off2). the yellow light can fail in the deactivated mode (yell_ko1) or in the activated mode (yell_ko2). figure 6b represents the petri net model of failure and repair of the devices that may be faired in the system (traffic lights, vehicle sensor and barriers). failure and repair are represented respectively by the stochastic transition faili and repi. while failure may occur at any time, repair will not take place when there is a train approaching or passing the level crossing. this is represented by the minimal value of reparation dri. extraction of critical scenarios in a railway level crossing control system 259 figure 6: petri net model of the yellow light 4.4 petri net model of the red light the model is similar to the yellow light. the red traffic light can be in mode off (place red_off ), mode on (place red_on), fail before activation (place red_ko1) or after activation (place red_ko2). we note three cases of the activation mode of the red light represented in figure 7 according to the time activation of the yellow light. in case (a), the yellow light was activated for 3 seconds before the lights traffic switch to the red. in this case the place yell_off2 is marked and transition on3 can be fired to switch to the activated mode of the red traffic for 9 seconds. this delay is represented by the time interval [9, 9] related to the transition cls1 that corresponds to the order of lowering barriers. the place dely1 is the same as described in figure 4. the red traffic light can be deactivated when the barrier will be completely opened represented by the place br4. place lccs_on2 is the same as described in figure 5. in case of figure 7b, the yellow traffic light becomes defective before the yellow light period (place yell_ko1). in this case the red traffic light period of 9 seconds is extended to 12 seconds to take into account the yellow light period. this is represented by the time interval [12, 12] attached to the transition cls2 that corresponds to the order of lowering barriers. in case (c), the yellow traffic light becomes defective during the yellow light period (place yell_ko2). in this case, the red light period of 9 seconds is extended correspondingly to the missing time of the yellow light period. this is represented by the time interval ]9, 12[ associated to the transition cls3. transitions and places concerning the barriers will be described in section 4.6. figure 7: petri net model of the red light 4.5 petri net model of sensors the system contains three sensors (si): a sensor for the barriers loading, a sensor for the barriers closing and a vehicle sensor. the petri net model of a sensor si is similar to the model given in figure 6b. as described in this figure, a sensor si can be defective and repaired by firing transition repi. 260 malika medjoudj, pascal yim figure 8: petri net model of barriers closing and cancelling of the closing 4.6 petri net model of barriers (actuators) closing to simplify the petri net model, we assume that figure 8a represents the closing of the two half barriers which are actuated by an actuator for opening (place act1_ok). note that places dely2, dely3 and dely4 are the same as in figure 7. the place br1 represents the continuous dynamic of the closing barriers. its temporal abstraction is represented by the temporal interval [1, 6] attached to the transition cls4 as the maximum closure time is 6 seconds and we suppose that the minimum closure duration is 1 second. if the opening actuator fails before the barriers have completely closed (act1_ko marked), the immediate transition fail8 will be fired and the dynamic of place br1 will be interrupted. this corresponds to the blocking of the barriers in opening represented by the marking of the place bck1. if the sensor that detects that the door is closed is defective (marking of place s1_ko), the transition cls4 can not be fired and the level crossing is considered in a failure state. this is represented by the firing of transition fail14 in the temporal interval ]6, 6+]. figure 8b represents the cancelling of the closing procedure if the red traffic light fails after the activation of the level crossing control system before the barriers begun to be lowered. this is represented by firing the immediate transitions fail10, fail11 or fail12 according to the activation mode of the red light represented in figure 7. place lccs_on3 corresponding to the order of deactivation of the level crossing will be marked. opening the petri net model is similar to the petri net model of closing. the dynamic of the barriers opening is determined by the position of the actuator for opening (place act2_ok). the dynamic of the opening is represented by the temporal interval [1, 6]. this dynamic can be interrupted if the actuator fails before the end of the opening procedure. in this case the immediate transition fail9 will be fired and the place bck2 corresponding to blocking on opening will be marked. if the sensor of opening is defected (place s2_ko marked) the transition fail15 will be fired after 6 seconds. we note four cases for barriers opening represented in figure 9. case (a) corresponds to the nominal behaviour. in this case, the vehicle sensor is not defective (place s3_ok) and the train has completely passed the danger zone in time (marking of the place trn12 as it will be detailed in section 4.7). the maximum closure time is represented by the temporal interval [0, 240] associated to the transition opn1. in case (c), the train detects that it can not arrive to the level crossing in time and it sends a deactivation order to open the level crossing. this is represented by the message msg7. in case (b) and (d) the vehicle sensor is defective and the level crossing extraction of critical scenarios in a railway level crossing control system 261 figure 9: petri net model of the barriers opening can not be opened by passing of the train. in this case, the exceeding of the maximum closure time is reported to the operations centre that finds out wither the train has passed the level crossing or is still approaching. accordingly the operations centre sends a deactivation order (b) in the first case and (d) in the second case. 4.7 petri net model of the train nominal operating and late arrival the detailed petri net model of the train is given in figure 10. place trn1 (figure 10a) corresponds to the approaching of the train a level crossing. when the train on-board system detects this approaching, it sends a radio message msg1 to the level crossing control system to switch on by firing the transition tr1. place bc represents the setting of a breaking curve for speed supervision to make the train stop at the potential danger point in a failure situation. after receipt of the acknowledgement (place msg2), the on-board system waits an appropriate time (18 seconds) for the level crossing to be closed and sends the statute request (place msg3) to the level crossing control system. the level crossing is reported to the train to be in a safe state (place msg4) if the barriers are completely lowered (place br2) and the red traffic light is in the activated mode (place red_on). after reception of the safe state of the level crossing, the train cancels the breaking curve (place bc) and passes the level crossing with out braking. this is represented by firing transitions tdz1 and tr6. the continuous dynamic of the train is represented by the temporal interval [dti, dti] attached to transitions tri and tdz1. this means that the tokens has to remain in the input places of this transitions at least dti and at most dti. place trn7 and trn12 represent the train out of the danger zone. transition tr7 can be fired after a green time duration dg to specify a non-finite behaviour of the track. transition tr14 is fired if the train detects that it can not arrive at the 262 malika medjoudj, pascal yim figure 10: petri net model of the train level crossing within the maximum supervised arrival time of 240 seconds (late arrival) and still is able to stop before the danger point. it sends a deactivation order to the level crossing (place msg7) and discards any information received so far from the level crossing and supervises a breaking curve (firing transition tr15 after a green time duration dg). braking and stand still or stop figure 10b represents the case in which the train does not receive the status report with the safe state of the level crossing before entering its breaking curve. note that places trn4, bc, msg4, red_on, br2, trn6, trn12, dz1, msg7 and trn11 are the same as in figure 10a. the temporal interval ]dmi, dmi+] represents the fact that the train does not receive the status report before entering its breaking curve. in this case the on-board system apply the breaks (place trn8) until the status report will be received. the transition tr9 will be fired to release the breaks and continue the run if the status report is received before a stand still (place trn9). place trn10 represents the request prompted on the driver’s display to make sure that the level crossing can be passed safely. transition tr11 is fired if meanwhile the status report has been received. otherwise transition tr12 will be fired to confirm the safe state by sending the message msg5. if the level crossing is in its safe state the transition (place msg6), the transition tr13 will be fired otherwise the train will stop (place trn14). 4.8 petri net model of the road user places road_user and dz2 in figure 11 represent respectively the road user in the entrance of the danger zone and crossing the danger zone. the road user may pass the level crossing only if the red traffic light is not in its activated mode (place red_on is not marked) or the level crossing is still open. it means that road users may pass when the red traffic light is off (place red_off ) or in its defective mode (place red_ko1 or red_ko2) even if the half barriers are lowered as they can pass in the opposite lane or when the half barriers are not yet lowered (place dely2, dely3 or dely4 marked). to simplify, note that we are focussing on the red traffic light as the yellow traffic light is included in this cases: the yellow traffic extraction of critical scenarios in a railway level crossing control system 263 light can be activated when the red traffic light is in its deativated mode or defected before activation (red_ko1). the failure of the yellow light is also taken into account as in this case place dely3 or dely4 will be marked. the transition usr represents the non-finite behaviour of the road users. figure 11: petri net model of the road user 4.9 the whole petri net model of the system places labelled with "n" in figure 12, modelling the whole system, represent normal operating and transitions labelled with "f" will be added to forbidden transitions and can not be fired. this transitions concern repair and non-finite behaviour as repair of defective devices will not take place when there is a train approaching or passing the level crossing and we are interesting in this paper only to one round. we will seek the feared scenarios corresponding to the presence of both train and road user in the danger zone, i.e. all the scenarios which lead to the marking of the place s_fail. 5 extraction of feared scenarios a general view of esa_petrinet and tina tools is given in figure 13. to use esa_petrinet, we first edit the petri net model of the system on the graphic editor of tina tool to generate two input files: a descriptive file of the petri net model and a file containing the invariant of places. generated scenarios can be illustrated in the form of a precedence graph. esa_petrinet generates a total of 196 scenarios (nominal and feared) in which 88 are feared. note that the actual version of esa_petrinet generates non minimal scenarios, so most of the generated scenarios are redundant. this explains the important number of the scenarios generated. note also that this version of esa_petrinet support continuous dynamics and temporal constraints and an important number of incoherent scenarios are yet eliminated. we have chosen these parameters: dmi = 0, dmi = 4, dti = 1, dti = dg = 10, dri= 250 the 88 feared scenarios (collision) correspond to the following situations: 1) crossing of both the road user (tdz4) and the train (tdz1) the danger zone when the red traffic light fails after activation (fail4). in this case, just after the train has received the safe state of the level crossing, the red traffic light fails and the road user passes at the same time the danger zone thinking that the train has already passed. we note three categories of scenarios according to the way of the train crossing. • the train is crossing without braking: sc10, sc13, sc19, sc22, sc16, sc25, sc30, sc33, sc36, sc41, sc46, sc53, sc61, sc65, sc69, sc76, sc80, sc84, sc91, sc95, sc102, sc108, sc112, sc116, sc123, sc127, sc131, sc138, sc142, sc149. these scenarios are represented by sc1a: {tr4, tr5, tdz1, fail4, tdz4, e_fail}. • the train is crossing with braking before stand still: sc50, sc57, sc62, sc66, sc73, sc77, sc81, sc88, sc92, sc99, sc104, sc109, sc113, sc120, sc124, sc128, sc135, sc139, sc146, sc154, sc157, sc162, sc165, sc168, sc173, sc176, sc181. these scenarios are represented by sc1b: {tr8, tr9, tdz1, fail4, tdz4, e_fail}. 264 malika medjoudj, pascal yim trn1 tr1 [0,0] trn2 ko1 lccs_off on1 [0,4] lccs_on1 off1 [0,0] off4 [0,0] tr2 [0,4] msg2 trn3 tr3 [18,18] msg3 on2 [0,0] yell_on yell_off1 lost3 trn4 tr8 tr4[0,4]lost4 trn8 msg6 lost6 tr5 trn5 msg4 ko4 bc tr10 tr12 msg5 trn6 tdz1 dz1 tr14 msg7 lost7 trn7 tr6[1,10] off2 [3,3] yell_off2 cls1 [9,9] br1 fail3rep3 red_ko1 br2 opn1 [0,240] br3 s3_ok on4 [0,0] yell_ko2 fail2 rep2 cls5 [0,4] cls6 [0,4] dely2 cls2 on5 [0,0] cls3 fail11 [0,0] fail10 [0,0] trn12 ko3 ko2 lost2 trn10 tr9[0,4] tr13 [0,4] trn9 tr11 [0,4] dely1 lccs_on3 lccs_on2 fail12 [0,0] cls4 [1,6] msg1 dely3 tdz4 tdz5tdz3 dely4 red_ko2 fail4 rep4 red_on on3 [0,0] tdz2 fail1 yell_ko1 rep1 usr off3 [0,0]red_off dz2 opn2 lost5 ko5 trn13 tr7 tr15 tr16 trn14 ko7 ko6 trn11 lost1 ]9,12[ [12,12] ]4,5] ]4,5] ]4,5] ]4,5] tdz6 tdz7 s1_ok rep5 s3_ko s1_ko rep13 fail13 fail14 ]6,7] fail15 ]6,7] opn3 [0,4] fail8 [0,0] bck1 bck3 act1_ko act1_ok fail6rep6 act2_ok fail7rep7 act2_ko fail9 [0,0] bck2 fail5 opn4 [0,4] bck4 opn5[1,6] s2_ok fail16 s2_ko rep16 br4 e_fail [0,0] s_fail road_user figure 12: whole petri net model of the system extraction of critical scenarios in a railway level crossing control system 265 figure 13: screen shots of tina and esa_petrinet tools • the train is crossing after stand still: sc54, sc71, sc86, sc172, sc180, sc161, sc97, sc118, sc133, sc144. these scenarios are represented by sc1c: {lost4, tr13, tdz1, fail4, tdz4, e_fail}. the precedence graph of these three scenarios is given in figure 14. ii and fi represent respectively initial and final events. figure 14: precedence graph of the scenarios 2) crossing of the road user the danger zone before the barriers to be lowered (tdz5, tdz6 or tdz7) then, crossing of the train (tdz1). in this situation, the road user may be slow down stopped or break down on the danger zone. what often arrives on this zone, in general difficult to cross compared to the normal road as attested by the statistics of this field. we note three categories of scenarios according to the mode of activation of the red traffic light and each category contain deferent scenarios according to the way of train crossing. • in the case of the activation of the red traffic light after the yellow light period (place dely2 marked), we find the scenario sc21a: {tr4, tr5, tdz5, tdz1, e_fail}, sc21b: {tdz5, tr8, tr9, tdz1, e_fail} and sc21c: {lost4, tr13, tdz5, tdz1, e_fail}. the scenario sc21a corresponding to the crossing of the train without braking regroups scenarios sc3, sc5 and sc11. the scenario sc21b corresponding to the crossing of the train with braking before stand still regroups scenarios sc23, sc28, and sc44. the scenario sc21c representing to the crossing of the train after stand still corresponds to the scenario sc37. • in the case of the activation of the red traffic light after the yellow traffic light become defective in its activated mode (place dely4 marked), we find the scenario sc22a: {tr4, tr5, tdz6, tdz1, e_fail}, sc22b: {tdz6, tr8, tr9, tdz1, e_fail} and sc22c: {lost4, tr13, tdz6, tdz1, e_fail}. the scenario 266 malika medjoudj, pascal yim sc22a corresponding to the crossing of the train without braking regroups the scenarios sc1, sc2 and sc7. the scenario sc22b corresponding to the crossing of the train with braking before stand still regroups the scenario sc17, sc20 and sc34.the scenarios sc22c representing the crossing of the train after stand still corresponds to the scenario sc26. • in the case of the activation of the red traffic light when the traffic light become defective in its deactivated mode (place dely3 marked), we find the scenario sc23a: {tr4, tr5, tdz7, dz1, e_fail}, sc23b: {tdz7, tr8, tr9, tdz1, e_fail} and sc23c: {lost4, tr13, tdz7, tdz1, e_fail}. the scenario sc23a corresponding to the crossing of the train without braking regroups the scenarios sc6, sc9 and sc14. the scenario sc23b corresponding to the crossing of the train with braking before stand still regroups the scenario sc31, sc39 and sc51.the scenarios sc23c representing the crossing of the train after stand still corresponds to the scenario sc47. to facilitate the identification of the feared scenarios among the scenarios of normal operating, esa_petrinet tool illustrates them with a different colour. 6 summary and conclusions two objectives have been reached in this paper. the first is a whole modelling of the level crossing by a temporal petri net model. the second is the extraction of the critical scenarios using esa_petrinet tool. the analysis of these scenarios permitted to propose a solution to improve the safety of the level crossing. this simplest solution consists on the importance of adding a sensor to allow the detection of road users by the train in the danger zone. among the perspectives of this work: the quantification of these scenarios by a monte carlo simulation [1] that has been implemented in esa_petrinet, checking different temporal constraints and taking into account the minimality of the scenarios to eliminate the unnecessary events and the redundancies. these analyses can be extended to a level crossing used in the intersection area between a multiple track railway line and a road. bibliography [1] kalos, m.h., whitlock, p.a., mont carlo methods, vol. 1: basics, john wiley and sons, new york, 1986. [2] laprie, j.c., dependability: basic concepts and terminology, vol. 5, springer, 1992. [3] medjoudj, m., khalfaoui, s., demmou, h., valette, r., ”a method for deriving feared scenarios in hybrid systems,” probabilistic safety assessment and management (psam7-esrel04), berlin, germany, 14-18 june 2004. [4] medjoudj, m., demmou, h., valette, r., ”esa_petrinet tool : extraction scenarios & analyzer by petri net model : application to the extraction of feared scenarios in a landing gear system,” european simulation and modeling conference (esm2006), laas, toulouse, france, pp. 375-382, 23-25 october 2006. [5] murata, t., ”petri nets: propreties, analysis and applications,” ieee proc, vol. 77, pp. 541-580, april 1989. [6] demmou, h., khalfaoui, s., riviere, n., valette, r., ”extracting critical scenarios from a petri net model using linear logic,” journal européen des systèmes automatisés (apii-jesa), vol. 36, n7, pp. 987-999, 2002 extraction of critical scenarios in a railway level crossing control system 267 [7] girard, j.y., ”linear logic,” theoretical computer science, vol. 50, pp. 1-102, 1987. [8] berthomieu, b., ribet, p.o., vernadat, f., ”the tool tina construction of abstract state spaces for petri nets and time petri nets,” international journal of production research, vol. 42, n14, pp.27412756, 15 july 2004. [9] collart-dutilleul, s., deffossez, f., bon, p., ”safety requirements and p-time petri nets: a level crossing case study,” imacs-ieee multiconference on computational engineering in systems applications, pp. 1118-1123, oct 2006. [10] jansen, l., schnieder, e., ”traffic control systems case study: problem description and a note on domain-based software specification,” technical rapport, technical university of braunschweig, 2000. [11] list, g.f., cetin, m., ”modeling traffic signal control using petri nets,” ieee trans. on intelligent transportation systems, vol. 5, n3, pp. 177187, 2004. [12] febbraro, a.di., giglio, d., sacco, n., ”urban traffic control structure based on hybrid petri nets,” ieee trans. on intelligent transportation systems vol. 5, n4, pp. 224-237, 2004. [13] padberg, j., gajewsky, m., ”rule-based refinement of petri nets for modeling train control systems,” ifac conference on control systems design (csd2000), elsevier science, pp, 299-304. 2000. malika medjoudj, pascal yim lagis, ecole centrale de lille cité scientifique, bp 48 villeneuve d’ascq, 59651, france e-mail: {malika.medjoudj, pascal.yim}@ec-lille.fr received: june 27, 2007 this work was supported by the pole st2 and the region nord-pas de calais malika medjoudj was born in tizi-ouzou (algeria) on february 21, 1977. she received the engineer diploma degree in electronics (control) and the diploma of higher education applied in technical english from mouloud mammeri university of tiziouzou (algeria) in 2001. she obtained the master in industrial systems from ups-laas-cnrs of toulouse (france) in 2002 and the phd in industrial systems from the same university and laboratory in march 2006. she is actually a post doctorate at the ecole centrale de lille in collaboration with inrets after a scientific stay of six months in the nuclear metrology service of the université libre de bruxelles (fnrs-belgium). her research is related to the reliability of hybrid and dynamic systems (computer-controlled systems), checking of temporal constraints, extended petri nets for safety (transportation systems), feared scenarios and simulation. 268 malika medjoudj, pascal yim pascal yim, married, 3 children, was born in papeete (french polynesia) on november 3, 1963. he is professor at the ecole centrale de lille, a french technical university. his research is based on the cross fertilization of concepts from discrete automatic control, and computer science. he has a particular interest in petri nets, constraint programming and information systems. his main application field is the design and optimisation of transportation systems (mainly railways). he published numerous papers in international journals and conferences. he has also supervised several industrial contracts for various customers: sncf (the french railway company), the fluvial port of lille, the "3 suisses" (a large retail french company) and other companies. professor pascal yim is a member of scientific board of french "competitivity poles" on intelligent transportation systems (i-trans) and distribution (picom). he is also correspondent for the north of france of the european excellence network on railways (eurnex). int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iii (2008), no. 2, pp. 204-214 ecad system design applications in architecture a.d. styliadis, d.g. konstantinidou, k.a. tyxola abstract: the rapid advances in learning technologies, computer modeling, multimedia and spatial sciences, as well as the availability of many powerful graphics pcs and workstations, make 3-d modeling-based methods for personalized e-learning with ecad (modeling) functionality feasible. personalized ecad learning is a new term in engineering, environment and architecture education, related to the development of learning educational units (3-d learning objects) with re-usable digital architecture functionality, and introduced to literature for the first time within this paper. in particular, for university education courses in ecad, digital architecture, design computing and caad (reagarding spatial information systems, architectures, monuments, cultural heritage sites, etc.), such a e-learning methodolgy must be able to derive spatial, pictorial, geometric, spatial, topological, learning and semantic information from the target object (a 3-d model) or scene (a 3-d landscape environment) or procedure (a 3-d simulation approach to a phenomenon), in such a way that it can be directly used for e-learning purposes regarding the spatial topology, the history, the architecture, the structure and the temporal (time-based) 3-d geometry of the projected object, scene or procedure. this paper is about the system design of such a e-learning method. for this purpose, the requirements, objectives and pedagogical extensions are presented and discussed. finaly, a practical project is used to demonstrate the functionality and the performance of the proposed methodology in architecture. keywords: e-learning documentation, modeling functionality, digital photography, cad, historical living systems 1 introduction the rapid advances in digital imaging sensors and scanners, off-the-shelf haptic devices, computer modeling software and the availability of many powerful graphics pcs and workstations make a method for personalized ecad learning with 3-d modeling or 2-d drafting functionality feasible (kalay, 2006). personalized ecad learning is a new term in informatics, engineering and architecture, related to digital architecture documentation with e-learning functionality, and introduced to literature for the first time within this paper. the proposed term personalized ecad learning is defined as a digital architecture documentation procedure with e-learning functionality based on metric and non-metric (qualitative) data, and spatial and 3-d modeling semantic information (please see next section). in particular, for the historical living systems (i.e. monuments, churches, basilicas, archaeological sites, etc.), such a methodolgy must be able to derive pictorial, geometric, spatial, topological, learning and semantic information from the target architectural object (historical living system), in such a way that it can be directly used for e-learning purposes regarding the history, the architecture, the structure and the temporal (time-based) 3-d geometry of the projected historical living system. improvements and new developments in the fields of sensor technology and computer modeling allow the acquisition of digital images in video-realtime, without developing and digitizing a photographic film (streilein, 1996; kazakeviciute et al., 2005). such a system -which is well described by andre streilein from the eth zurichfor digital photogrammetry and architectural design consists of two sub-systems: a sub-system for the digital photogrammetric station and a sub-system for the caad (streilein, 1996; streilein et al., 1992). in this docopyright © 2006-2008 by ccc publications ecad system design applications in architecture 205 main the aim of a method for e-learning documentation is to make the photogrammetric data acquisition and processing easier and faster, to create a three-dimensional geometric and semantic object description, and to allow caad/multimedia data integration, haptic rendering, íisua1ization and architectural processing in an easy and user-friendly way (vladoiu, 2004; weber, 2004). therefore, such a method must be capable to acquire imagery with sufficient resolution, process the data with a big leíel of automation, and pass the results to a data structure useful for 3-d caad modeling (hirschberg et al., 1996). this can be achieved using solid-state sensors and manual, semiautomatic or automatic measurement techniques. also, for environmental management purposes, the current status of a relative methodology for such a system is described by l. dimen et al. (2005) and a relative method is being deíeloped in a joint project of the atei of thessaloniki in co-operation with the agora university of oradea, romania. with the constant progress of multimedia technology and network bandwidth, the traditional teaching environment that based on text and pictures, will be integrated with media streams, 3-d modeling, intelligent agents, mathematics, virtual reality, haptic rendering and spatial objects (scienses)ů as described by a. styliadis et al. (2006) for the gis case, by engeli et al. (1996) for the intelligent agents, by i. dzitac (2000, 2002) and by i. dzitac et al. (2007) for the mathematics case, and by silva et al. (2002) for the insertion of 3-d architectural objects in photography. for this reason, in this paper, the proposed methodology shows and demonstrates an architecture (3-d model) that can support these new, rich in e-learning functionality, environments. recently, more and more systems come up that use any caad and semantic information available prior the measurement process. such a system is the modelling-and-rendering system developed at the university at berkeley by debevec et al. in 1996. this system uses a rough object description in order to guide a stereo matching technique for the digital reconstruction of the primary object details with relative accuracy better than 10-3. another similar system (a caad system named "naos"), dealing with 3-d geometry (with relative 10-3 accuracy as well) and qualitative information for caad documentation, was developed in 1997 at the aristotle university of thessaloniki, school of surveying engineering and at the atei of thessaloniki, greece (styliadis, 1997). also, very interested is the work at the university of helsinki from h. haggren and s. mattila (1997) dealing for 3-d indoor modeling development based on videography data. in particular, in this work a functional 3-d model of indoor scenes is built first and the measurements of the geometry based on video images are performed thereafter. finally, an interested caad system under development exists at the university of delft (frank van den heuvel, 2003), which makes use of a priori geometric object information in the form of parameterized object models with image lines as the main type of observations. on the other hand, e-learning is a process that needs quite amount of mental and body strength. in order to promote the e-learning efficiency, it is important to improve the learning environment and this is the case of the proposed methodology (e-learning with caad functionality). apart from traditional design, the media stream or the virtual reality, can stimulate learner even more, reinforce the learner’s motivation, attention and mentality. some systems adopt different technology and implemented similar environment also demonstrate satisfactory results. however, they need to spend a lot of money and time to achieve that, such as vrml (virtual reality modeling language) it can establishes a virtual 3-d scene with walk-through functionality in the scene by a simple parameter, but while controlling the behaviors of the 3-d objects that enhance photorealism -such as the materials, the lights, the object scale, etc. a script procedure must be writen in the complicated vrml markup language. evenmore, the vrml modeling relative inaccuracy is greater than 10-3 and so, this is not acceptable for e-learning documentation applications. the proposed method is based on a virtual learning environmental architecture that integrates synchronous, asynchronous and co-operative characteristics. 206 a.d. styliadis, d.g. konstantinidou, k.a. tyxola the paper is structured as follows: in section 2 (personalized ecad learning: term formulation) the new term personalized e-learning is introduced and described. in section 3 (personalized ecad learning: the system design learning requirements) an overview of the proposed personalized e-learning methodology is given. the section 4 (personalized ecad learning: the main streams) presents the main sub-systems outline design of the proposed e-learning methodology. finaly, in section 5 (personalized ecad learning: an application in architecture) a practical 3-d model based application is presented and its e-learning functionality is discussed. 2 personalized ecad learning: term formulation personalized ecad learning is a new term in engineering and digital architecture, related to digital (geometricand semantic-based) learning with re-usable spatial functionality, and introduced to literature for the first time within this paper. the proposed term personalized ecad learning is defined as a digital documentation procedure with e-learning re-usable spatial functionality based on metric and non-metric (qualitative) data, and spatial 3-d modeling semantic information. at the ’heart’ of this e-learning is the 3-d model (vector format) of the object (which could be any architecture, building, monument, etc.) or the process (environmental polution, weather forecast, water flood estimation, etc.) being described (dimen, 2005). the objectives of the personalized ecad learning are: • to facilitate and encourage the collaboration and the critical awareness between the design students, scientists and professionals (architects, designers, engineers and so on). • to design virtual spaces using different representation methods and techniques for haptic rendering regarding architectures, landscapes and urban design. • to communicate through 3-d model-based multimedia data in ways traditional caad, photography and video does not support. • to support access to prior understandings regarding the ’pathology’ (nature) and the characteristics of the described object or process. • to allow ideas exchange and to support design autonomy. • to test the efficiency of the various sub-systems involved in design and construction processes for the architecture, engineering and construction (aec) community. • to promote self-directed reusable learning exercises which lead to a critical awareness of the learning process and the learner’s empirical background. similarly, the pedagogical profits are: • the establishment, through this documentation, of a new reality in education and design practice, whereby the accepted realism and the level of the 3-d modeling accuracy (or inaccuracy), of the object or the process, are not so necessary to communicate performative and reusable educational and design concepts (martin et al., 2004). • the real-time collaborative and reusable interactivity. • the feedback (learning domain 3-d ontologies for semantic caad descriptions). • the ability to develop manual, semi-automatic or automatic reusable caad learning tools that could support architectural, photogrammetric, art, historical or archaeological training and education. ecad system design applications in architecture 207 the pedagogical strategy of the personalized ecad learning is based on encouraging and facilitating the communication and ideas exchange between the personnel involved in the design, implementation and spatial analysis process. finaly, active role, interaction, group work and design autonomy are the characteristics of the proposed personalized ecad learning. 3 personalized ecad learning: the system design learning requirements the structure and the processing steps for the proposed personalized ecad learning method are shown in fig. 1. in this figure, the relation among the 3-d learning objects with modeling functionality and the available learners’ (engineers, cad personnel, architects) profiles are presented in connection with the geometric, topologic and thematic spatial data. figure 1: the proposed ecad system design (personalized ecad learning) on designing the proposed methodology for the e-learning documentation, the statistical analysis results from an e-learning course in caad and the vrlab research project at the atei of thessaloniki are examined (tsakiris et al., 2005). so, according to students suggestion, the functional specifications for a 3-d based e-learning system are defined as follows: • 3-d map video processing • 2-d image processing • archive or historical photography, rich in geometric regularities and properties (i.e. clues, like: planarity, parallelism, orthogonality, symmetry, perpendicularity, topology, etc.), processing (styliadis et al., 2003) • item-by-item 3-d modeling functionality in an e-learning caad environment 208 a.d. styliadis, d.g. konstantinidou, k.a. tyxola • haptic rendering of the resulting virtual representations of both the 3-d maps (models) and 2-d images (drawings) • gui with drag-n-drop functionality • multimedia functionality • learning functionality incorporating historical and semantic data • force-field haptic rendering functionality • virtual reality functionality • notingand shared-board functionality • non-stop study functionality actually, what the learner needs is a synchronism and adaptive e-learning system which can interact in real-time with the teacher in class (wu, 2002). in this domain an asynchronous system can let learner to study in his free time by adapting learning object selection (based on discrete and reusable 3-d modeling items) in intelligent learning systems (karampiperis et al., 2004). also, such as system can let learners discuss with each other through media stream. besides, they also need 3-d virtual environment with haptic rendering functionality, which can increase learner’s interest and attention. 4 personalized ecad learning: the main streams after defining its functional specifications the main sub-systems (streams) of the re-usable e-learning system are defined as follows: • media stream services: this is the server sub-system; for which a number of media stream servers are needed (e.g. a system or central server). these servers can provide data (photography, imagery, history, architecture, modeling) for learners on real-time. these servers can also store material in repository (i.e. material palettes) which then can be searched by researchers or learners (e.g. students in architecture, art, history, etc.). • virtual learning environment for 3-d visualization: this is the client sub-system; which includes a user interface based on 3-d graphics, haptic equipment and virtual reality tools (petrovic, 1996). it is the stage for the learner and it includes virtual and resource classroom, chat room, etc. this e-learning sub-system provides chatting functionality on a learner-to-learner or learner-to-teacher basis. after the learner’s logon to the system, he can control the learning process on focusing: (i) on particular 3-d modeling and rendering methods supporting e-learning functionality, e.g. haptic rendering, phong shading rendering (fig. 2) or phong rendering; and (ii) on particular details of the historical living system using a gui input device like the keyboard, the light-pen or the mouse. evenmore, using this stream the learner can also communicate on-line with other learners (e-students). • web portal: this e-learning stream provides the learner with additional information and operates as an integration platform for the entire e-learning documentation. this stream mainly includes the system’s operation manual, monument’s relevant documents and teaching materials, monument’s architecture, history, archive (historic) photography, digital imagery, etc. ecad system design applications in architecture 209 figure 2: 3-d modeling of aghios achilleios basilica 5 personalized ecad learning: an application in architecture beyond the thirsty coasts of greece lies a lush heartland where a coachload of sunburnt tourists are a rarer sight than a flock of pink-backed pelicans. this is macedonia, a land of legendary battles and untamed landscapes, more familiar for the news headlines it attracts than for its startling natural beauty. nestled, among macedonia’s verdant mountains, are the twin cool lakes of prespa (north-west greece). declared as a national park in 1974; these twin lakes provide a sanctuary for over 280 species of birds and the largest pelican breeding ground in the world. though stranded on the edge of greece, prespa lies at the ’heart’ of the balkans. straddling greece, albania and the former yugoslavian republic of macedonia (fyrom), these lakes are a smooth expanse of serenity caught in the crossfire of shifting borders. locals are an unusual stew of immigrants and refugees from pondos and asia minor, and nomadic vlachs; many still speak their native dialects. but as with most of rural greece, prespa’s population is dwindling. at the turn of the century, there were 12,000 inhabitants and 21 villages. now, only 1,200 locals and 12 villages are left; they survive now, as then, by fishing and farming. until 1969, locals carried special id cards and foreign visitors required a visa. even today, tourism is just a slow trickle in this watery wonderland. the aghios achilleios basilica, in lakes prespes, was chosen to demonstrate the functionality and efficiency of the proposed method. actually, the aghios achilleios basilica is located at the aghios achilleios island on the minor prespes lake. the monument is a three-aisled, wooden-roofed basilica with a narthex and domes over the parabemata (fig. 3). it was founded in ca. 986-990 by tsar samuel of bulgaria. initially, it was the cathedral of samuel’s short lived empire and later, until the middle of the 15th century, was an episcopal church. a tomb covered with a relief tombstone is preserved in the south arm of the cruciform diaconicon; tradition say that the relics of aghios achilleios were kept in this tomb. along the south wall of the south aisle, four other graves are preserved, in which important persons of the church or the local community were buried. the few fragments of the wall paintings belong to two different layers and have been removed from 210 a.d. styliadis, d.g. konstantinidou, k.a. tyxola figure 3: the aghios achilleios basilica at the twin lakes prespes the building. they are now on display in the exhibition of byzantine and post-byzantine art, in the byzantine museum of florina. today, only a part of the super-structure of the building is preserved, especially on the east side. it stands to a privileged and dominating position, nearly 20 metres above the lake of prespes at the isle of aghios achilleios. the monument is about 22 m in length, 16 m in width and 6 m in height. a detailed discussion about the history, architectural design and construction of this basilica is given by prof. emeritus of architecture nikolaos moutsopoulos (1999). the monument has been under restoration since 1987, and the wall masonry will be rebuilt as long as there is available evidence of its construction. the result of the photogrammetric processing was a 3-d geometric and semantic object description, which was passed automatically via java-mdl programming (java and c++ coding) to the microstation masterpiece caad system. this system is able to pre-process the data and store it in data structures adapted to architectural purposes; allowing, as well, data transformation into other representations in an easy way. for the personalized ecad learning, the task of the learner is the creative finding of new modeling solutions (point or parallel perspective) as well as to evaluate the current modeling accuracy, for both the point and the parallel perspective projections, in connection with the imagery processing equipment and technique used. more for 3-d reconstruction from perspective images could be found in (yang et al., 2005). the e-learning documentation is important for documentation and visualization purposes, and for complex simulations, manipulations and analysis of the target architectural object. this could be used in e-learning courses about architecture, archaeology and art history, in preservation of historical monuments and sites, in regional and local planning, as well as in renovations, reconstructions and reverse engineering projects. ecad system design applications in architecture 211 figure 4: a top view of the east part of aghios achilleios basilica (architectural drawing) figure 5: phong haptic rendering: a point-perspective view of the aghios achilleios basilica caad model (haptic representation based on digital low-resolution canon ci-10 imagery) 212 a.d. styliadis, d.g. konstantinidou, k.a. tyxola figure 6: phong haptic rendering: a parallel-perspective view of the aghios achilleios basilica caad model (haptic representation based on digital low-resolution canon ci-10 imagery) figure 4 shows the top view of an architectural drawing regarding the east part of the aghios achilleios basilica, and figures 5 and 6 illustrate two haptic rendering perspective views ïf the photogrammetric generated caad model of the same monument. it is important to note that haptic interaction and rendering is especially important to e-learning students and populations with disabilities, such as the visually impaired, because tactile interpretation is one of the most important modalities they can use to perceive the world and to appreciate the monuments and the cultural heritage. acknowledgments the current paper is supported by the epeaek ii archimedes research project "personalized learning in a reusable way" of the alexander institute of technology & education (atei), department of information technology, thessaloniki, greece. the epeaek project is co-financed by the european union (75%) and the greek ministry for education & religious affairs (25%). bibliography [1] p. debevec, c. j. taylor and j. malik , "modeling and rendering architecture from photographs: a hybrid geometryand image-based approach", proceedings of siggraph 1996, pp. 11-21, 1996. [2] l. dimen and i. ienciu, "environmental management plan", revcad journal of geodesy and cadastre, vol. 5, pp. 271-274, 2005. [3] i. dzitac, "random-filtred asynchronous iterative method", bul.st. univ.baia mare, seria b, mat. inf., vol. xvin r.1, pp. 17-24, 2000. [4] i. dzitac, parallel and distributed procedures in solving of some operator equations / procedee de calcul paralel si distribuit in rezolvarea unor ecuatii operatoriale, ph.d. thesis, univ. "babes-bolyai" din cluj-napoca, 230 pg., 2002 (in romanian). ecad system design applications in architecture 213 [5] i. dzitac, s. dzitac, e. valeanu, "web distributed computing for landscape architecture", in n. eleftheriadis, a. styliadis, i. paliokas (eds), proceedings of the international conference "landscape architecture and new technologies" lant07, 25-26 may 2007, drama, greece, isbn 978-960-363-027-2, pp. 25-36, 2007. [6] m. engeli and d. kurmann, "spatial objects and intelligent agents in a virtual environment", automation in construction, vol. 5, pp. 141-150, 1996. [7] h. haggren and s. mattila, "3-d indoor modeling from videography", in proceedings of spie, vol. 3174 (videometrics v), pp. 14-20, 1997. [8] f. van den heuvel, automation in architectural photogrammetry (line-photogrammetry for the reconstruction from single and multiple images), ph.d. thesis, ncg and the royal netherlands academy of arts and sciences (knaw) (eds.), isbn: 9061322812, 2003, the netherlands. [9] u. hirschberg and a. streilein, "caad meets digital photogrammetry: modelling ’weak forms’ for computer measurement", automation in construction, vol. 5, pp. 171-183, 1996. [10] y. e. kalay, "the impact of information technology on design methods, products and practices", design studies, vol. 27, pp. 357-380, 2006. [11] p. karampiperis and d. sampson, "adaptive learning object selection in intelligent learning systems", journal of interactive learning research, vol. 15, no. 4, 2004. [12] g. kazakeviciute, e. januskevicius, r. rosenbaum and h. schumann, "tamper-proof image watermarking, based on existing public key infrastructure", informatica, vol. 16, no. 1, pp. 75-92, 2005. [13] pedro leao ramos ferreira neto and margarida amaral, "caad and e-learning: a blended approach", elearning papers, vol. 3, issn: 1887-1542, 2007. [14] d. martin and i. sommerville, "patterns of cooperative interaction: ethnomethodology and design", acm transactions on computer-human interaction, vol. 11, no. 1, pp. 59-89, 2004. [15] n. k. moutsopoulos, the basilica of st. achilleios in lake prespes, paratiritis editions, isbn: 960-260-9931, 1999, thessaloniki (in greek). [16] i. k. petrovic, "computer design agents and creative interfaces", automation in construction, vol. 5, pp. 151-159, 1996. [17] b. silva, v. alvarez, p. cezar, p. carvalho and m. gattass, "insertion of three-dimensional objects in architectural photos", journal of wscg, vol. 10, no. 1, 2002. [18] a. streilein, "utilization of cad models for the object oriented measurement of industrial and architectural objects", in international archives of photogrammetry and remote sensing, vol. xxxi, part b5, pp. 548-553, 1996. [19] a. streilein, h. beyer and t. kersten, "digital photogrammetric techniques for architectural design", in international archives of photogrammetry and remote sensing, vol. xxix, part b5, pp. 825-831, 1992. [20] a. d. styliadis, "digital documentation of historical buildings with 3-d modeling functionality", automation in construction, vol. 16, no. 4, pp. 498-510, 2007. [21] a. d. styliadis, i. d. karamitsos and d. i. zachariou, "personalized e-learning implementation the gis case", international journal of computers, communications & control, vol. i, no. 1, pp. 59-67, 2006. [22] a. d. styliadis and m. gr. vassilakopoulos, "a spatio-temporal geometry-based model for digital documentation of historical living systems", information & management, vol. 42, pp. 349-359, 2005. [23] a. d. styliadis, p. g. patias and n. x. zestas, "3-d computer modeling with intra-component, geometric, quality and topological constraints", informatica, vol. 14, no. 3, pp. 375-392, 2003. 214 a.d. styliadis, d.g. konstantinidou, k.a. tyxola [24] a. d. styliadis, digital documentation of monuments and sites with 3-d geometry and qualitative information. ph.d. thesis, faculty of rural & surveying engineering, aristotle university of thessaloniki, 1997, greece (in greek). [25] a. tsakiris, i. filippidis, n. grammalidis, d. tzovaras and m. g. strintziz, "remote experiment laboratories using virtual reality technologies: the vrlab project", acta universitatis apulensis (mathematics and informatics), vol. 11, pp. 365-378, 2005. [26] m.-m. vladoiu, "towards building an open digital library for instructional design that facilitates reflective e-instruction", informatics in education, vol. 3, no. 1, pp. 127-140, 2004. [27] w. weber, "selma self-guided learning in teaching mathematics", informatics in education, vol. 3, no. 1, pp. 141-150, 2004. [28] honggang wu, designing a reusable and adaptive e-learning system, m.sc. thesis, university of saskatchewan, saskatoon, 2002. [29] a. y. yang, k. h. huang, s. rao, w. hong and y. ma, "symmetry-based 3-d reconstruction from perspective images", computer vision and image understanding, vol. 99, pp. 210-240, 2005. athanasios d. styliadis, debbie g. konstantinidou, kyriaki a. tyxola the alexander institute of technology and education department of information technology the archimedes 2.2.17 research project p.o. box 141, 574 00 thessaloniki,greece e-mail: styliadis@ath.forthnet.gr received: december 20, 2007 int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 4, pp. 401-414 3d virtual spaces supporting engineering learning activities d.m. popovici, j.p. gerval, f. hamza-lup, r. querrec, m. polceanu, n. popovici, r. zăgan dorin-mircea popovici, mihai polceanu, norina popovici, remus zăgan ovidius university of constanta address: 124 b-dul mamaia, 900527, constanta, romania e-mail: dmpopovici@univ-ovidius.ro, polceanum@gmail.com, norinapopovici@yahoo.com, rzagan@univ-ovidius.ro jean-pierre gerval isen 20, rue cuirassé bretagne cs 42807 29228 brest cedex 2 france e-mail: jean-pierre.gerval@isen.fr felix g. hamza-lup armstrong atlantic state university faculty of computer science address: 11935 abercorn st., savannah, ga e-mail: felix@cs.armstrong.edu ronan querrec ecole nationale d’ingénieurs de brest laboratoire d’informatique des systèmes complexes address: 25 rue claude chappe, f-29280 plouzané, france e-mail: querrec@enib.fr abstract: virtual environments constitute the support platform for various teaching and learning activities. instead of a local application for this purpose, this paper explores the effects of using distributed virtual reality environments in the educational process. the architecture of the presented system is based on the recently developed web-based technology called ajax (asynchronous javascript and xml), implemented on a linux operating system configured to run apache with php and mysql support; i.e., lamp architecture, which contributes to the portability and ease of installation of the application. the platform is designed to support the integration of lesson modules such as the engview environment which is discussed in more detail in this contribution. pedagogical, technical, and implementation-related aspects are presented in conjunction with the virtual environment used in the engineering training curriculum. statistical information resulted from the training shows a significant increase in task completion time when the virtual setup is used. keywords: virtual environment, learning, teaching, motivation, virtual training. knowledge acquisition has shifted from an individual to a collective activity. there is a migration of the learning process from one individual to a group of individuals as knowledge becomes a collective activity enhanced by the phenomena of social interaction. the complexity of the information and the way we interact with it makes us active parts in the educational environment. searching, discovering, and testing are the most frequent human activities in such situations. when an interpretive level of comprehension is reached, these activities are mature enough to trigger creational thinking, and constitute the beginning of the applied level of understanding. as complements to learning, virtual training gives constructive feedback to learners by providing them with a hands-on approach to the studied subject. in the following sections, we emphasize the potential of distributed virtual environments to improve the learning process. to prove the point, we try to answer one question: "what do the 3d virtual spaces bring into the learning processes in order to make them effective and evolutionary?" copyright c© 2006-2009 by ccc publications 402 d.m. popovici, j.p. gerval, f. hamza-lup, r. querrec, m. polceanu, n. popovici, r. zăgan 1 introduction a three-dimensional (3d) virtual space is a computer-generated space that is perceived by us via pure virtual reality (vr) technologies and/or mixed reality (mr) technologies [1]. this perception can only be obtained through placing the user in the space, from the user’s interaction with the space. furthermore, this space is not passive since the users interact with each other and/or with other virtual entities, by the means of virtual agents or avatars. virtual objects are subjects in the users’ direct or indirect interactions and may enhance collaboration between users. users’ multimodal communication is realized through exchanging typed or verbal messages, gestures, and facial expressions. in other words, the virtual space must be constructed, first of all, considering the user’s cognitive and empirical attributes. this means that when we create virtual space models, the base criterion should be the accuracy of the human representation of reality which may not necessarily correspond with reality. to this end, the human experience is first constructed by situating the user in the virtual context, then tested through the user’s direct interaction with the environment, and later reconsidered, in a recursive process. how efficient and effective is such experience? a possible answer may be given by evaluating the user experience in the frameworks proposed by burdea [2] and zelter [3]. we do not discuss the imagination aspect of the user experience but leave it private to the user. it may surprise the reader, but we are not trying to obtain an accurate sensorial rendering of the virtual space in order to immerse the user in the environment. instead, by high quality immersion inside this virtual space we mean intense user interactivity with objects or other users within the virtual space. the efficiency and the effectiveness of such experiences are considered acceptable if the user is able to apply the knowledge and skills obtained in the virtual space in similar real-space conditions; i.e., if both the gained knowledge and skills are reusable. applying such virtual spaces in learning and teaching activities provides the conditions for transforming the sometimes passive actors (i.e., students and teachers) into involved, very active actors [4]. with the support of new technologies, we hope to infuse them the sensation that they represent active parts of the learning/teaching process [5]. our goal is to catalyze the creative state-of-mind and self-confidence at an individual level as premises of collaboration among individuals, with personal perspective as the basis for the learning communities. these communities provide the necessary conditions for transforming the users’ interactions, expressed through direct communication and cooperation with other individuals, into long-term social interactions. many educational virtual environments such as "virtual theatre" and "virtual classroom" use various metaphors to facilitate the trainee in learning on an abstract (e.g., math, physics, electronics, and other [6, 7, 8, 9]) or concrete (e.g., gesture or behavior in certain situations [10, 11]) level. few environments take into consideration the trainee’s motivation to learn. driven by this observation, our goal is to obtain a solution designed to serve as a motivational feedback to its users. virtual theatre or narrative-based metaphors have one major advantage as compared to other metaphors; i.e., they challenge and encourage the user to verbalize/render his/her experience in a situational context. such an environment is highly evolutional since every actor comes with his/her own personal experience in a similar situation; this way knowledge is collectively and continuously modeled to better express the social point of view. the more we express the knowledge, the better the result becomes. multimodal environments that combine haptic feedback with 3d visualization and sound rendering [12] prove to be very efficient learning tools, especially for understanding abstract concepts. as students gradually gain confidence in the team they belong to, they become autonomous and willing to acquire new knowledge; thus, they change from being dependent on the team to being independent, and the relationships among individuals become dynamic and friendly. in particular, team-based environments are suitable for interdisciplinary teams. for example, the engview system about which we discuss in this paper was developed by a mixed team of computer scientists, engineers, and managers, as 3d virtual spaces supporting engineering learning activities 403 well as a group of enthusiastic students. engaging such a team, we have attained our main pedagogical objective that is to assure a rapid and successful integration in the professional context for our students. however, some difficulties rise due to factors such as the different levels of knowledge acquired by students during their studies, the student’s level of interest in the information presented and the student’s motivation to learn. learning speed varies from person to person. often, theory is easier to grasp than to translate into practice. or vice-versa, practical skills are quickly acquired, even without any basic understanding of the theory. despite these difficulties, students need to achieve good theoretical and practical skills. at the theoretical knowledge level, the widely used method of multiple choice examinations can be computer-graded or easily marked with a template. however this method does not provide any insight into the trainee’s work methods and adaptability. a much better choice is a written examination. on the other hand, practical examinations are somewhat more probing; however, the trend is to have the candidate demonstrate his/her skills in a simple application where the results can be easily and uniformly graded [13]. because paradigms such as vr and multimodal environments facilitate learning through the construction of concepts relying on the intuition that arises from direct user experience in the virtual environment [14], we decided to complement our teaching/learning process by using these technologies. we do not eliminate multiple choice examinations, but we consider that communication and interaction within a collaborative virtual environment may represent essential motivational dimensions to the trainee. therefore, we consider interaction and communication as being the most important requirements of vr-based training technologies. another important aspect is the reduced accessibility of the real training setups for a group of trainees. by means of switching between training sessions in real environment and virtual replicas, the trainee is able to obtain the confirmation of his/her practical results obtained in the virtual environment. thus, we do not eliminate traditional assessment, but we let the students exercise longer within a virtual setup, without any physical risks and at potentially lower costs. when students reach a certain level of "virtual expertise", they are allowed to prove this expertise in a real environment. 2 engview a training tool for engineers to demonstrate the effectiveness of the educational concepts mentioned above, we implemented a training environment for engineers, called engview [15], that is a supplementary tool in the training process in the domain of non-destructive testing (ndt), as detailed in the next section. because the presented training environment addresses mature users, motivation may not necessarily come from the environment itself, but from the user’s desire to succeed in his/her integration within a professional context. in such a context, social interactions frequently appear in team setups and trigger individual development on both theoretical and practical levels. due to frequent switching between experimentation and theory, it is not surprising that discovery, creation, and innovation are expected side-effects in engineering learning contexts. 2.1 ndt principles the most used formats in the ndt training process are the a-scan, b-scan, and c-scan presentations. these provide different ways of visualizing and evaluating the inspected material region. for our purposes, we have chosen to visualize only the c-scan method. the high-frequency ultrasonic c-scan presentation provides the planar view, depth location, and size of the defects inside the probe; this makes c-scan a valuable tool to monitor the precise location of the defects between certain layers (see figure 1). the plane of the image is parallel to the scan pattern of 404 d.m. popovici, j.p. gerval, f. hamza-lup, r. querrec, m. polceanu, n. popovici, r. zăgan the transducer. c-scan presentations are produced with an automated data acquisition system, such as a computer controlled immersion scanning system. the c-scan method is based on the transmission of a very-high-frequency signal (up to 50 mhz) directed to the sample by the transducer. the sample and the transducer are submerged in a coupling medium (water in our case). the initial signal is partially reflected back to the transducer by the interface’s grains, defects and porosities, or by other substantial differences in acoustic impedance in the sample and the signal of the transducer. if not fully reflected, the signal continues through the sample. in other words, between the initial pulse and the back-wall peaks there is an additional peak caused by the sound wave going from the water into the test material. this additional peak is called the "front wall peak". the ultrasonic tester can be adjusted to ignore the initial-pulse peak, so the first peak it will show will be the front-wall peak. some energy is lost when the ultrasound waves hit the test material, so the front-wall peak is slightly lower than the peak of the initial pulse. in return, the peak amplitudes and the time-of-flight of each returning signal are stored in a computer data file and processed offline to produce maps of the scanned area for the sample placed at a particular depth. figure 1: c-scan principle and samples of scanning figure 1 shows four ultrasonic c-scan images of a silicon solar plate (solar cell). all images were produced using a pulse-echo technique with the transducer scanning from above the sample in an immersion scanning system. for the c-scan image in figures 1.a and 1.c, the gate was set up to capture the amplitude of the sound reflecting from the head surface of the silicon plate. light areas in the images indicate the regions that reflected a greater amount of energy back to the transducer. in the c-scan image in figures 1.b and 1.d, the gate was moved to record the intensity of the sound reflecting from the back surface of the plate. the details on the back surface are clearly visible, but the front surface features are also visible since the sound energy is affected by these features as it travels through the head surface of the silicon plate. 2.2 related work because of the complexity of the real ndt setups, training of experts in nondestructive testing should take place in specially equipped laboratories. the cost of such a training configuration is rather great. this makes its implementation in academic laboratories difficult and, even so, the accessibility of students to the installation is reduced [16]. it also explains the small number of ndt training systems. the virtual nondestructive evaluation (nvde) system proposed in [17] offers a full computerbased replica of a real ndt examination setup. using nvde, the user is able to generate the testing scenario, as trainer, to practice with the virtual setup and to perform assessment sessions to determine the performance level reached by the trainees. 3d virtual spaces supporting engineering learning activities 405 the civa software developed by cea permits the visualization, optimization, and prediction of the performances of several testing techniques. great effort is made in order to optimize the computing time so that the 3d models that are tested can be used in parametric studies, despite the potentially complex configurations. moreover, civa can simulate the ultrasound wave propagation and highlight the defects inside the 3d models [18]. 2.3 the virtual environment in order to solve the problem of time limitation and lack of accessibility for more than one user that the real configuration presents, a virtual implementation of the scanner was developed. all functionalities of the real ndt installation were made accessible through engview’s 3d-immersive simulation software (see figure 2.a). this feature allows any user to practice the scanning procedure without any repercussions in case of faulty maneuvers. this method offers a superior overview and understanding of the device and its mechanism of functioning. more precisely, the user is able to change the viewpoint inside the simulated environment (front/back, left/right, and up/down) and to navigate inside the virtual scanning device for a better view-point. these features allow the user to visualize the surface of the virtual scanned object during the simulation. the user can also move the three crane-like components of the virtual scanning device to virtually scan the simulated 3d probe. the engview setup was used during the second semester of 2007 in training sessions by engineering and physics senior students, organized in eight groups, each containing 25 students. a) b) figure 2: engview : a) practical session, b) theoretical assessment session the ndt curriculum requires one practical evaluation on the basis of six laboratory hours. as mentioned before, the ndt makes no exception in both theoretical and practical evaluation. to this end, the virtual environment contains pedagogical resources that provide users with access to theoretical background and evaluation as well as to practical sessions. students can reproduce different types of realistic experiments using the engview system by preparing the sample, changing the type of transducer, setting the parameters of the moving engines to establish the type of scanning procedure, and to make comparative studies. the students that work on the client machines in the engview system are able to perform the same kind of analysis as in a real system. the engview system can be used either independently not coupled to the real system by installing it on a computer, or directly connected to the scanning device. the former option gives the advantage of 406 d.m. popovici, j.p. gerval, f. hamza-lup, r. querrec, m. polceanu, n. popovici, r. zăgan supporting several students to train simultaneously using their home internet-enabled computer. through the latter option the device is actively controlled, serving as a safe and easy way to perform experiments when accurate data is required. an assessment was organized on the basis of a multiple choice test containing ten pure theoretical and seven practical questions (see figure 2.b) to evaluate the knowledge acquired by the students. the time limit was 30 minutes to answer all questions. the practical evaluation had three steps: the experiment setup/calibration; the experiment itself; and the interpretation of the results. in the real configuration, about 30 minutes are necessary for the experiment per student, without any error recovery, so there is no possibility to try it twice during the exam. in this situation, it often occurs that the student uses the real ndt setup for the first time. 2.4 emulaction a platform for distributing learning/training activities although the solution described above provides the users with a more efficient learning environment, it does not support more users working together at the same time. to overcome this deficiency, we developed a context in which the simulations can take place. this context is constituted by a virtual classroom that holds the fully functional 3d representations of each element from the real educational scenario. organizing the learners in teams in the context offered by the "virtual classroom" metaphor helps to reduce most of the discrepancies between the individual knowledge levels increases communication and competition (in this order). competition becomes cooperation and aids the level of motivation. hence, the complexity that may arise even in the most "simple" subjects is a non-declared motivational factor when introduced gradually. if the students’ needs are satisfied and their expectations are met, they will strive to develop their professional competences. indirectly, students contribute to the development of the learning context (see figure 3). a) b) figure 3: shared training environment (emulaction project): a) users sharing a task, b) virtual office containing theoretical material and assesment tools because the students share the real environment, we want them to share a similar virtual environment also. students naturally start to form small work teams in the virtual setup, based on the real environment configuration. later, these teams may evolve based on the complementary knowledge that the team members possess, in order to assure a higher level of team performance. shared experiences provide different, perhaps even complementary perspectives to the lesson, depending on each individual. combining specific tasks in a distributed platform enables the users to collaborate and focus on the same target, share knowledge and impressions. working in teams can bring great advantages to the 3d virtual spaces supporting engineering learning activities 407 learning experience, as users can communicate and coordinate each other’s actions in real time, while conducting the experiments. figure 4: engview-based shared training session figure 4 shows an example of such a context, where the students that have passed the theoretical assessment have access to the virtual replica of the ndt scanning installation. here they may test different scanning parameters and different probe materials while visualizing the same engview environment. a virtual laptop gradually displays the scanned probe and can receive commands to either start or stop the scan. the scanning device is a fully functional replica of the real equipment, and the cranes from the standard version of engview have been replaced by virtual disks that can be rotated to achieve the desired position of the start and end positions. the visual feedback is coherent with the used scanning parameters and may give hints to the trainee concerning the task currently in progress. 3 implementation-related aspects our educational virtual environments are currently based on the assumption that knowledge and skills acquired in a vr-based environment will be transferred to the real world. the effectiveness of such an environment depends on the user’s capability to apply the knowledge and/or the skills acquired to its real world counterpart. the current learning materials are implemented using the moodle [19] platform for the text and multimedia resources (doc, pdf, ppt, avi, or jpeg files) as well as 3d virtual environments. concerning textual and multimedia support, we explored the moodle facilities to align the pedagogical context with the sharable content object reference model norms [20]. moreover, we manage the users’ access to the corresponding course materials according to their curricula and the course materials. therefore, the tutors have the ability to create, modify, and publish educational materials, such as courses, seminars, homeworks, project subjects, tests, and so on. furthermore, using such a system, the administrator is able to manage the courses, the users, the groups of students, and the students enrolling in each course. our 3d environments are developed using vrml [21] and/or arévi api [22]. the arévi api is open-source, c++ and opengl based, and adaptive to different configurations, ranging from desktops to 3d stereoscopic immersion systems. to put all together, we use a reactive agent-based architecture [23]. this architecture assures the user’s immersion and evolution within the virtual space. to ensure the distributed activities, we have adopted the linux, apache [24], mysql [25], and php [26] based solution. because our educational environment is mostly 3d-oriented, we chose to build it based on the ajax/ajax3d technology [27, 28] and x3d/vrml language [29, 21]. ajax provides optimal update speed between the client and the server by simulating a direct connection, while x3d 408 d.m. popovici, j.p. gerval, f. hamza-lup, r. querrec, m. polceanu, n. popovici, r. zăgan has the advantage of having an accessible structure that can be controlled with the javascript engine through a browser plug-in called fluxplayer [30]. fluxplayer is easily installable on windows (xp and vista) operating systems for firefox [31] and internet explorer browsers. the scene access interface (sai) is achievable also through java, but in this case we considered that having an applet to control the environment was unnecessary. this approach is still at the beginning as more and more gamelike browser-based applications spread over the internet. this launch is facilitated by the increase in processing power of the personal computers, and by the ever-evolving internet browsers that are able to faster process web content. to this end, this architecture can be considered modern and unique in the context of educational purposes. php and mysql are in charge of the user account and database management. the system currently supports two account types: student and teacher, each enabling users to perform certain actions depending on their status. apart from proving a high level of performance, the system is easy to install on any operating system that supports php and mysql. although the update speed is not real-time due to the impossibility of establishing a direct connection with the server through this architecture, this was not a factor of decision because the main objective of this application is only to provide users with a functional collaborative environment in which they can practice. assuming a high number of users connected to the system simultaneously, the application was optimized to cache new events to prevent unnecessary communication with the server that would cause it to slow down. updates are transmitted using the xml format for better information structuring. xml not only brings ease in the use of the received data, but also makes the system adaptable to changes brought to its structure. each client of the application requests updates from the server at customizable time intervals, depending on the connection speed. the virtual classrooms also offer users tools to communicate with each other and submit results for verification, after experiments have been completed. among these tools are the button toolbar, books and files, which are also viewable by all participants when activated (see figures 3 and 4). the environment is designed to be customizable by the teachers that want to hold a course in of different curricula. the teacher account type offers users the freedom to create a personalized classroom, suitable for the course that needs to be held, containing adequate tools and devices for the students to use. this way, the customized classroom is dynamically generated by the application, and becomes ready for the students to join. 4 discussions in order to evaluate the efficiency of the virtual setup, we gathered completion time information from students trained in the classical manner and those who benefited from the virtual practice (see figure 5). we observed that using only the traditional training sessions is neither motivating nor time-efficient. the probability of failure because of poor practical skills and/or errors that may appear during the experiment is too high for the current curricula. on the other hand, by offering students the possibility of practicing in the virtual configuration before the real one, they became more confident in their own potential due to the chance to recover from errors and to experiment with more training situations. in addition, the number of hours dedicated by the faculty’s regulations for training and practice with the scanning device is considerably small. engview makes seminars less expensive by using complex immersive and interactive simulations which are accessible over the internet. moreover, it brings students closer to the practical part of their education and helps them better comprehend each learned concept. in order to evaluate the system’s impact on the user’s learning/training process we have compared the assessment results obtained in classical training context with those obtained after shared 3d-setup was used (see figures 6 and 7). table 1 contains the repartition of the users’ results. for each answer data set we determined the characteristic values as the average, the mode and the 3d virtual spaces supporting engineering learning activities 409 figure 5: comparison of training completion time in real configuration with and without virtual training sessions, respectively variance. the first two values represent the central tendency while the variance represents the dispersion degree around the mean. the mode is the most frequent value that appears in the data set. classic assesment 3d assesment possible theoretical practical theoretical practical values questions questions questions questions 1 2 8 0 2 2 7 7 0 3 3 10 19 10 12 4 17 37 16 23 5 13 46 8 48 6 18 45 22 40 7 29 38 33 72 8 43 0 41 0 9 39 0 47 0 10 22 0 23 0 total 200 200 200 200 mean 7.025 4.965 7.39 5.6 variance 5.134375 2.503775 3.8179 1.93 mode 8 5 9 7 table 1: user results in both classical and 3d training contexts and characteristic values based on these values and the corresponding charts (see figures 6 and 7) we can conclude that the differences indicate a significant overall improvement in the case of using the 3d setup. in order to verify that the improvement brought by the 3d setup is indeed significant, we also applied a statistical t test for mean comparison between the two samples assuming unequal variances. the resulting p-values corresponding with theoretical assessments and practical assessments, i.e. 0.043032 and 0.000014 respectively, are smaller than 0.05; hence the difference between the means of the 2 samples is significant. in other words, since the mean of the 3d setup is obviously higher than the classical approach, we conclude that the 3d method brings significant improvement in the training process. we have also implemented an anonymous questionnaire that focuses on both the engview’s user interface and the environment content. we have opted for a "five-level-choice" questionnaire, with the following grades: "very poor", "poor", "acceptable", "good", and "excellent". 410 d.m. popovici, j.p. gerval, f. hamza-lup, r. querrec, m. polceanu, n. popovici, r. zăgan figure 6: results of the theoretical assessment using classical approach v.s. 3d-based one figure 7: results of the practical assessment using classical approach v.s. 3d-based one question / answer very poor poor acceptable good excelent the interface is self-explanatory? 0% 0% 16.2% 83.5% 0.3% the environment helps me to identify the key concept? 0% 0.28% 15.6% 83.1% 0.02% how natural was the interaction with existing objects? 0% 0.17% 73.2% 12.97% 13.66% how did you find the virtual ndt setup feedback? 0% 0% 1.3% 97.2% 1.5% the reuse of the capabilities in real setup that where obtained in virtual setup 0% 0% 3% 79% 18% table 2: engview evaluation questionary 3d virtual spaces supporting engineering learning activities 411 as the results show (table 2), despite the specificity of the engview environment, what we want to convey to all users of our virtual environments is self-confidence and team-oriented contexts. the virtual space has to motivate users to study the environment by direct and constructive observation of its components, without any temporal or geographical constraints. by simulating real setups into virtual spaces we encourage the users to be active situated actors in self-explanatory pedagogical contexts. 5 conclusions and future work as previously stated at the beginning of this contribution, the subtle goal of this work is to prove whether virtual 3d environments are able to increase the efficiency and of learning processes and their capacity of being evolutionary. first of all, people need feedback in order to comprehend the activities they perform; the lack of feedback is a major issue when dealing with the educational context because materials and equipment are often too expensive to purchase in large amounts. having virtual simulations of the real training material lowers the costs of training, and increases the number of students that are able to be trained using them. secondly, it was proved that collaboration increases the quality of learning, but not all virtual environments support multiple user access. using a distributed platform that can implement various live training sessions makes possible the evolution of teams of students while training. users receive feedback from their own actions as well as from other’s, this way maximizing the intake of information. thirdly, statistics based on the users’ responses show that learning speed is greatly increased when using virtual environments in addition to classical methods. to this end, the point in adopting interactive 3d worlds in the educational context has been proven. one of the central directions of our efforts is to use ontologies in content management and deployment. this may be useful in producing similar pedagogical situations that use different content. this may also allow us to introduce agent-oriented tutors that can evaluate the users’ actions inside the 3d space. 6 acknowledgements our contribution is an extended version of the paper [32] previously published in the proceedings of the 3rd international conference on virtual learning (icvl’08). this work is partially funded by the following projects: intuition (fp6-ist-nmp-1-507248-2), emulaction (fonds francophones des inforoutes ref.no. 14g023), and tomis (pn ii: 11–041/2007, national centre of programs management). bibliography [1] p. milgram, h. takemura, a. utsumi, f. kishino, augmented reality: a class of displays on the reality-virtuality continuum, spie vol. 2351, telemanipulator and telepresence technology, pp. 282-292, 1994. [2] g. burdea, ph. coiffet, virtual reality technology second edition with cd-rom, wiley, new jersey, 2003, isbn 0471360899. [3] d.zelter, autonomy, interaction and presence, presence, vol. 1(1), pp. 127-132, 1990. [4] a. wilson, the promise of situated cognition, in an update on adult learning theory, ed. by s.b.merriam, pp. 71-79, san francisco, jossey-bass, 1993. [5] j. lave, e. wenger, situated learning. legitimate peripheral participation, cambridge, university of cambridge press, 1991. 412 d.m. popovici, j.p. gerval, f. hamza-lup, r. querrec, m. polceanu, n. popovici, r. zăgan [6] g. taxen, a. naeve, cybermath: a shared virtual environment for mathematics exploration, center for user oriented it design, royal institute of technology, technical report cid-129, stockholm, sweden, 2001. [7] b. loftin, m. engelberg, r. benedetti, applying virtual reality in education: a prototypical virtual physics laboratory, in proceedings of ieee symposium on research frontiers in virtual reality, san jose, ca, pp. 67-74, 1993. [8] j.p. gerval, y. le ru, velab: a virtual lab for electronics virtual experiments, advanced technology for learning, volume 3 issue 2, acta press, 2006. [9] n. doukas, a. andreatos, advancing electronic assessment, international journal of computers, communications & control, vol.ii, no.1, pp. 56-65, 2007. [10] w. l. johnson, j. rickel, steve: an animated pedagogical agent for procedural training in virtual environments, sigart bulletin, acm press, vol. 8, number 1-4, pp. 16-12, 1997. [11] j.-m. burkhardt, d. lourdeaux, d. mellet-d’huart: la conception des environnements virtuels d’apprentissage (the design of training virtual environments). in: p. fuchs & g. moreau (eds.): le traité de la réalité virtuelle. paris, presses de l’ecole des mines, 2003. [12] f.g. hamza-lup, i. sopin, haptics and extensible 3d in web-based environments for e-learning and simulation, 4th international conference on web information systems and technologies (webist), may 4-7, funchal, madeira, portugal, 2008. [13] c. roloson, j. zirnhelt, performance based qualification: an ndt service industry perspective, paper no. 744, cd-rom proceedings of the 16th wcndt 2004 world conference on ndt, aug 30 sep 3, montreal, canada, http://www.ndt.net/abstract/wcndt2004/744.htm, 2004. [14] j. bruner, actual minds, possible worlds, harvard university press, cambridge, ma., 1986. [15] cerva, http://www.univ-ovidius.ro/cerva/engview, 2007. [16] p.n. shkatov, d.v. bydanov, s.v. kluev, simulator for education of eddy current testing specialists, proc. of the 17th world conference on nondestructive testing, 25-28 oct 2008, shanghai, china. [17] d.h. harris, j. spanner, virtual nde operator training and qualification, ndt.net october 1999, vol. 4 no. 10. [18] s. lonné, l. de roumilly, l. le ber, s. mahaut, g. cattiaux , experimental validation of civa ultrasonic simulations, international conference on nde in relation to structural integrity for nuclear and pressurised components, 2006. [19] moodle, http://moodle.org, feb., 2009. [20] scorm, sharable content object reference model, http://www.adlnet.gov/scorm/, feb., 2009. [21] vrml, virtual reality modeling language, http://www.web3d.org/x3d/specifications/vrml/, feb., 2009. [22] p. reignier, f. harrouet, s. morvan, j. tisseau, t. duval, arévi: a virtual reality multiagent platform, lectures notes in computer science, volume 1434, issn: 0302-9743, http://www.cerv.fr/fr/activites/arevi.php, 1998. 3d virtual spaces supporting engineering learning activities 413 [23] d.m. popovici, modeling the space in virtual universes, phd thesis, politehnica university of bucharest, 2004. [24] apache, http://www.apache.org, feb., 2009. [25] mysql, http://www.mysql.com, feb., 2009. [26] php, http://www.php.net, feb., 2009. [27] ajax, http://www.ajax.org, feb., 2009. [28] ajax3d, http://www.ajax3d.org/, feb., 2009. [29] x3d, http://www.web3d.org, feb., 2009. [30] fluxplayer, http://sourceforge.net/projects/flux, feb., 2009. [31] firefox, http://www.firefox.com, feb., 2009. [32] d.m. popovici, j.p. gerval, f.g. hamza-lup,n. popovici,m. polceanu,r. zagan: learning distributed activities inside 3d virtual spaces, proceedings of the rd international conference on virtual learning, pp. 289-296, constanta, 2008. dorin-mircea popovici (08/07/1968) obtained his phd in computer science at politehnica university of bucharest, romania (2004). he actively participates in several international and national projects, all oriented on the use of virtual reality in human activities like learning, education and culture heritage. he had written the book "an insight of 3d virtual environments" in romanian, published by ed. muntenia, constanta (2007). he is the founder and the leader of the research team in virtual and augmented reality of the ovidius university of constanta (cerva). (for more information please see http://www.univ-ovidius.ro/cerva) jean pierre gerval (12/12/1957) obtained his phd in automation from the university of valenceinnes in france (1987). he has been project manager at the "institut d’informatique industrielle" in brest, france (1988-2003) and associate professor of computer sciences at the "ecole nationale d’ingénieurs" de brest (1993-2003). he is currently the head of the computer science department at the "institut supérieur de l’electronique et du numérique" in brest. he received the "best software system" award from iasted international conference on computers and advanced technology in education oranjestad aruba august 2005 for the development of the virtual lab for electronics. his research interests include distributed virtual reality and virtual environments especially dedicated to pedagogical applications. dr. hamza-lup (20/03/1976) received a b.sc. in computer science from technical university of cluj-napoca, romania, the ms. and ph.d. in computer science from university of central florida, orlando in 2001 respectively 2004. he is involved with research and development of medical simulation systems and medical training tools that use 3d imaging components and virtual reality paradigms. his research interests are: human computer interaction, distributed systems, virtual reality and simulation/training for medical procedures. he received grants from several organizations including nasa, md anderson cancer center (orlando) and has served as co-pi on several sttr and sbir grants. in 2003 he has received the link foundation fellowship in advanced simulation and training and in 2005 the distinguished service award for mentoring students in science. (for more information please see http://www.cs.armstrong.edu/felix) ronan querrec (18/09/1973) had his phd in 2001 in the science university of brest. he is professor assistant in computer science and works at the cerv. his reasearch work is about virtual environment for training. in this theme, he works on the mascaret project, a virtual environment metamodel. 414 d.m. popovici, j.p. gerval, f. hamza-lup, r. querrec, m. polceanu, n. popovici, r. zăgan mihai polceanu (22/12/1988) is currently a student in computer science at the ovidius university of constanta. as member of the cerva team, he works in the emulaction project as architecture designer and programmer. his domains of interests are: programming, virtual reality, web technologies, cryptography, cryptanalysis and data security. norina popovici (29/04/1970) obtained her phd in economics (2005). her domains of interest are management, human resources management, project management, and usability of web technologies in management and educational systems. remus zăgan (11/06/1967) had his phd in industrial engineering at technical university "gh.asachi", ia¸si in the field of vibration, ultrasound, material characterization, modeling and simulation, he is currently the dean of mechanical industrial and maritime engineering faculty, "ovidius" university of constanta. he had managed 7 national grants, 1 international grant and deposed 1 patent. he has several books in wavelets analysis of noise and vibrations, modeling and simulations of production systems. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 1, pp. 42-51 mining authoritativeness of collaborative innovation partners j. engler, a. kusiak joseph engler adaptive systems rockwell collins, inc. cedar rapids, ia 52498 usa e-mail cjengler1@msn.com andrew kusiak mechanical and industrial engineering 3131 seamans center university of iowa iowa city, ia 52242-1527 usa e-mail: andrew-kusiak@uiowa.edu abstract: the global marketplace over the past decade has called for innovative products and cost reduction. this perplexing duality has led companies to seek external collaborations to effectively deliver innovative products to market. external collaboration often leads to innovation at reduced research and development expenditure. this is especially true of companies which find the most authoritative entity (usually a company or even a person) to work with. authoritativeness accelerates development and research-to-product transformation due to the inherent knowledge of the authoritative entity. this paper offers a novel approach to automatically determine the authoritativeness of entities for collaboration. this approach automatically discovers an authoritative entity in a domain of interest. the methodology presented utilizes web mining, text mining, and generation of an authoritativeness metric. the concepts discussed in the paper are illustrated with a case study of mining the authoritativeness of collaboration partners for microelectromechanical systems (mems). keywords: innovation, web mining, text mining. 1 introduction innovation most often occurs in one of two forms, incremental or radical. radical (discontinuous) innovation assumes focuses on a completely new concept that is radically different from the existing ideas. this type of innovation occurs rarely and is not easily predictable. incremental, or continuous, innovation builds upon previous concepts and therefore it is easier to be quantified. kusiak [1] defined innovation as an iterative process aimed at the creation of new products, processes, knowledge or services by the use of new or even existing technology. this definition summarizes the typology of incremental innovation. innovation has further been quantified into five generational models. the first generational model is linear, where innovation is unidirectionally pushed from the research phase to the commercial application phase [2], [3]. the second model, the pull model, holds the consumer as the main focus of innovation as opposed to the designer [4]. feedback forms the third model and utilizes the consumer’s responses to an initial product/service offering to perform incremental innovation on that product and/or service [5]. the fourth model is known as the strategic model in which innovation lines up directly with the company’s strategy [2]. the model pursued in this paper is the fifth model, also known as the networked model. in this model extraenterprise and cross-discipline organizations form a network to innovate. the term open innovation [6] is often used when describing this model. collaborative networking involves a detection of the optimal sources of collaboration. this can often be viewed as a challenge and often results in the local optimum as the choice of the collaborative source rather than the unseen, and most often unknown, global optima (e.g., the most authoritative person/company). the success of collaborative innovation depends greatly on the quality of the collaboration sources. the probability of innovation success, as measured by the market, can be considered proportional to the quality of the collaborative sources. therefore, it is incumbent upon a company, in pursuit of innovation, to seek out the optimal sources for collaboration. a wealth of information is available upon the world wide web (www) for identifying the optimal sources of collaboration (e.g., white-papers written by authorities of specific domains). copyright c© 2006-2010 by ccc publications mining authoritativeness of collaborative innovation partners 43 various researchers have begun to investigate possible means by which collaboration produces effective results. unfortunately, the literature is lacking solid systematic methodologies by which collaboration authoritativeness may be determined apriori of collaboration inception. chapman et al. [22] proposed a process model for collaborative data mining in an electronic manner but fail to address the need for authoritativeness when determining the collaborative partner selection process. lavraĉ et al. [23] surveyed various methodologies for collaboration but fail to report a systematic methodology of determining the most authoritative entity for collaboration. gajda [24] investigated assessment measures for determining the success of a collaborative partnership on a project. unfortunately, these measures are considered upon the completion of the project rather than to determine the effective partners for collaboration prior to entering in to a collaborative agreement. other researchers have posited criteria and methodologies for collaboration partner selection. geringer [25] proposed task-related selection criteria for international joint ventures but failed to present a systematic methodology for automatically determining the authoritativeness of a collaborative partner. hitt et al. [26] investigated resource-based and organizational learning for collaborative partner selection. again, the methodologies in the literature fall short of the goal of automated systematic determination of authoritativeness for collaboration. this paper presents machine learning algorithms to extract collaborative innovation relationship information from various sources including the www. utilizing machine learning algorithms to discover valuable knowledge from disparate sources has been presented in the literature. chen [27] posited utilizing natural computing techniques, such as swarm intelligence, to foster a collective intelligence in a virtual learning environment. grebla et al. [28] presented a bayesian belief network for mining data from various databases to assist in predicting arteriosclerosis and cardiovascular disease. this paper offers a novel methodology by which the optimal authoritative sources of collaborative partnership may be discovered. through the use of web mining, text mining, and the creation of an authoritativeness matrix, users may determine the optimal authority with which to perform collaborative innovation. of course, optimality may depend upon more than just the most authoritative partner on a given subject. other factors such as the availability of the collaborator or the cultural background of the collaborator (e.g., defense systems collaboration) may be involved. thus, this paper advocates the creation of an authoritativeness matrix as opposed to simply defining global optima for collaboration. the remainder of this paper proceeds as follows. section 2 discusses the focused mining of the world wide web to discover authoritative sources for collaboration. the distillation of these sources to form an authoritativeness matrix is discussed in section 3. an authoritativeness metric is presented in section 4. section 5 discusses clustering of the mined sources of collaboration. section 6 offers a case study for determining the leading authorities on microelectromechanical systems (mems). finally, section 7 offers concluding remarks. 2 focused mining of the web the first major step in forming a collaborative innovation relationship is to seek out and choose partners for the collaboration process. the world wide web (www) presents a proven search space for multiple concepts. a natural inclination is to manually search the internet for such sources of collaboration. some companies hire business development teams to perform this task. manually searching the web is a time intense process that often yields sub-optimal results. sometimes searches can even present misguided or influenced results due to the ability for parties to influence their rank among the various search engines [7]. increasing the difficulty of the manual search, many search engines utilize payer based rankings which facilitate assigning a higher position in the search results. additionally, many websites have multiple internal links thus boosting certain search engine ratings. such forms of ranking manipulation may provide false results and thus the most suitable collaborative candidates could be missed. to overcome the limitations of a manual search of the internet for collaboration resources, a focused web miner is presented. the focused web miner used for this process includes user inputs of a specified phrase which become the search criteria. the focused web miner then proceeds, following standard focused web crawling methodologies as presented by liu [8], to traverse the www in search of white papers, articles, and journal entries related to the search criteria. the presented web miner can easily be extended to handle other sources, e.g., information about companies which have reached phase ii funding from small business innovation research (sbir) programs. the presented version of the focused web miner does not attempt to mine the web blog data or the standard html files. rather, this version of the focused web miner seeks content mimicking academic writings. thus, the focused web miner spends a fair amount of time searching academic web sites, scientific communities, and trade journals. it is from these types of internet resources that the, often academic, writings are extracted. 44 j. engler, a. kusiak the discovery of web pages containing white papers is a significant task involving crawling the internet and classifying the web pages that are examined as a review or non-review page. the standard approach to performing the crawl is to utilize a focused web crawler. a focused web crawler targets a specific corpus of web pages. standard crawling does not consider a specific topic of inquiry; rather its job is to index all pages available on the internet. even with the assistance of algorithms such as pagerank developed by google [17], successful standard crawling requires massive hardware and bandwidth. this drawback prevents most corporations from performing this type of crawl internally. focused crawling requires far less hardware and bandwidth but does require some sophistication of algorithms to weed out the undesirable links as they relate to the given query. the algorithms useful to focused web crawling involve basic classification algorithms. there is a great variety of classification algorithms for determining web pages containing white papers. shih et al. [18] suggested the use of web page content structure as parameters of classification. the authors in [18] indicated that content providers tend to choose urls and page layouts that coherently structure their content. this html structure may be useful in determining the likelihood that a web page contains reviews. kules et al. [19] extended this idea to limit the features used for classification to items such as web page titles, urls and text snippets. jin et al. [20] took a different approach from the previous two and utilized a data-mining algorithm called hidden nad’ve bayes. their methodology considered a large corpus of web pages and calculated the probability that each page fit into a particular category. fortunately for the focused web miner proposed here, document type is often the best indicator of a possible fit. given the query string provided by the user and a set of allowable document types (e.g., pdf files only) the need for classification is reduced greatly. the need for classification increases dramatically when white papers other than standard academic content is searched. in such case the use of a combination of bayesian classification and the web page structure algorithm of [18] is suggested (see [21]). once the focused web miner has discovered a document of the standard academic writing, it attempts to download this document to a central repository that is dedicated to the search criteria. this repository often becomes extremely large, in the range of a terabyte or more, but is central to the process for collaboration resource detection. no partitioning of this repository takes place at the time of download; rather, all documents are placed into the same location. it is upon these documents that the process of determining the most authoritative collaboration partners is performed. while this form of repository may seem excessive, the reader is reminded of the low cost of data storage. additionally, a great deal of information will be gleaned from this repository over time. 3 authoritativeness matrix an authoritativeness matrix is generated from the documents that were obtained from the focused web mining process. utilizing standard text mining techniques the documents are deconstructed to gain the information necessary for the generation of the authoritativeness matrix. a text-mining algorithm extracts from each document the author’s names and the references, other authors, cited by that particular paper. to assist in discovery of the authors who wrote, and are cited in, the papers a list of first names is utilized. the list of first names, freely available on many internet sites, allows for the detection of document patterns within the corpus such that most often names of authors are placed within a given context within the document (e.g., author names at the beginning, authors who are cited at the end). the text mining algorithm utilizes these patterns to classify portions of the document which will have the information extracted from. additional, textmining algorithms may be utilized to detect the sense, positive or negative, in which a citation appears within the document. from the information mined by the text-mining algorithm the authoritativeness matrix may be constructed. the authoritativeness matrix is a two dimensional matrix, or table, made up of columns representing individuals who have been referenced by the papers and rows representing paper authors. the authoritativeness matrix forms a concise but sparsely populated representation of the given, or presented, authorities in the documents. figure 1 presents an example authoritativeness matrix for three documents. the rows represent the authors of the documents while the columns represent authorities who are cited as references in these documents. a cell of the matrix is 1, if the author in the row containing the cell has referenced an authority in the column containing the cell. otherwise the cell is 0. each row represents a single author for a single paper, thus there may exist multiple rows for a single paper. from the example of figure 1 it is easily determined that jr koza is the most authoritative person with which to conduct collaborative innovation for the example domain. this is due to the fact that jr koza is the most cited mining authoritativeness of collaborative innovation partners 45 figure 1: example authoritativeness matrix for three documents author within the tiny corpus of documents for the example. it will be demonstrated later in this section that other factors contribute to this outcome. it should be noted that an author may reference his/her own writings as well as be referenced by others. thus, it is possible for this matrix to hold entities that are in both the rows and columns of the matrix. in fact, it is possible, although highly improbable that the authoritativeness matrix holds exactly the same authors in its rows as it does references in its columns. as will be explained in section 4, there some caution needs to be exercised when an author often references their own work while others do not. this caution is the motivation for the storage of the document year in the authoritativeness matrix as will be explained. the representation of the authorities of the documents discovered by the focused web miner by the authoritativeness matrix ensures ease of storage and traversal. the authoritativeness matrix is compact enough to be stored in main memory especially given the sparseness of the matrix. this allows for efficient processing when determining the true authorities for the collaboration process as presented next. 4 authoritativeness metric to determine which entities in the authoritativeness matrix represent the optimal, or most, authoritative entity, within the search criteria, an authoritativeness metric is used. this metric accounts for the number of documents which were written by the authority, the number of times the authority is referenced in the body of work, discovered by the focused web miner, as well as the average age of the documents written by the authority. additionally, the sense, positive or negative, in which the author is portrayed in the document can be collected. thus, the first step in calculating the authoritativeness metric is to scan the authoritativeness matrix and calculate a number of measures. these measures will be stored in various hash tables for efficient referencing. during the scan of the authoritativeness matrix a hash table representing the authors, or rows, of the matrix is created. each time an author is encountered in the scan of the matrix that author is either added to the hash table as a key value pair of or the value of the index of the author in the hash table is incremented. the same action is performed for the columns, or referenced authorities, in the matrix. additionally, a hash table is created for the purpose of obtaining the average age of the documents for each author, or row, in the matrix. the use of the three hash tables makes for an efficient scanning methodology for the authoritativeness matrix since the matrix is scanned only a single time. a forth table may be required to represent sense. the hash table which represents the number of documents written by the authors, or rows, of the authoritativeness matrix is used to obtain the number of out-links of each author. out-links are documents written by the author. similarly, the hash representing the number of times an author is referenced is used to obtain the number of in-links for each authority. in-links are documents written by other authors referencing the given author, thus implicitly conveying authority on, or detracting from in the case of a negative sense, the referenced author. the conveyance of authority on an author by referencing their work is held here in a context similar to that of the pagerank algorithm discussed in [9]. conveyance of authority plays a vital role in the determination of authoritative collaborative sources. thus, the authoritativeness metric proposed weights the in-link measure higher than the out-link measure. additionally, should sense be included, the in-link could have the ability to decrease the author’s authoritativeness. the initial authoritativeness metric is defined in (1). in (1), λ is a user defined parameter which allows for 46 j. engler, a. kusiak changing the weight of the out-links based on the average age of the documents written by author ai. this parameter assists in controlling the conveyance of authority to an author who is a prolific writer but perhaps not often cited by other authors. additionally, the λ parameter allows for decreasing the weight of older documents if only recent documents are desired. if sense is utilized, the in-link, ini, can be a negative number. it should be noted that determination of the final authoritativeness metric measure is an iterative process as will be explained. ai = ln(λ t ′ (outi) + ini) (1) where: outi is the number of out-links; ini is the number of in-links; t ′ is the average age of the documents of author ai in years from the current year; λ is a user parameter in the range of [0, 1]. thus, the initial authoritativeness of an author ai is given by ai. once the initial individual authoritativeness metrics of the entire authoritativeness matrix is calculated, the iterative process of boosting the authoritativeness is performed. similar to the methodology used in the pagerank algorithm [10], it is desirable to instill more authority to an author who is referenced by another author of high authority. thus, if one author is considered a leading authority, deemed so by the authoritativeness metric, and that author references a second author, the second author’s authoritativeness metric measure should be increased. the iterative process of authoritativeness boosting is performed using the average of the in-links pointing to the current author. in-links of high authority contribute to the boosting of the current author’s authoritativeness, while in-links of less authoritative authors are not detrimental to the current author’s authoritativeness. thus the average of the in-links during the authoritativeness boosting process is calculated by including the authoritativeness of the author who referenced the current author as shown in (2). ini = n∑ i= ea n (2) where: ai is the authoritativeness of author ai; n is the number of in-links to ai. at each iteration of the calculation of the final authoritativeness metric the average of the in-links is used to calculate the new authoritativeness metric measure ai for each author ai. equation (3) describes how the new in-link measure is calculated iteratively. in ′ i = n∑ j− { ea − ini if ea > ini  if ea ≤ ini (3) thus, at each iteration of the boosting process, ini of (1) is replaced with ini for the calculation of the authoritativeness of each author. the boosting process is continued for n iterations, as set by the user, or until the order of the authorities remains unchanged which is the preferred method. with the final authoritativeness metric at hand for each author it is easy to determine which author and/or entity is the most authoritative in the subject matter of the search criteria. it is useful to determine the top k authorities in the subject matter to ensure that a good collaborative resource can be found available and willing to collaborate on the innovation at hand. as such it is useful to set a user defined parameter λ which is a threshold below which authoritativeness is discounted. this threshold is utilized in determining the authorities of the clustered documents as explained next. 5 document clustering often the size of the search space or the generality of the search criteria can result in a document set of varied type which is large in size. to ensure that the collaborative partner chosen by the authoritativeness metric is the one mining authoritativeness of collaborative innovation partners 47 that is most appropriate for the specific collaboration it is helpful to cluster the documents into similar categories. once the clustering has been performed, a cluster that is most similar to, or most represents, the specific innovation topic is chosen. from that cluster it is the possible to determine the best collaborative source for the innovation. note, the collaborative authority of a specific cluster may not be the authority whose overall authoritativeness metric is the highest. rather, the cluster authority, or authorities, will be those who are most advantageous for the specific innovation subject. clustering of the documents mined via the focused crawler begins with the generation of a word frequency matrix for the documents. the word frequency matrix represents the counts of each word in the individual documents. each row of the matrix represents a single document; while each column of the matrix represents a single word. there exists columns for every document word, which is not a stop word, thus the matrix can be somewhat sparse. many words, known as stop words, do not assist in properly classifying the documents. stop words are most common in everyday language and thus not specific to the topic. words such as "the", "in" and "here" are removed from the word frequency matrix prior to clustering. further, it is often favorable to generate the root of words as opposed to the actual words for this frequency matrix. thus, words such as "innovation", "innovate", and "innovativeness" would all be placed in the root word frequency cell for the word "innov". figure 2 below represents a partial frequency word matrix. figure 2: word frequency matrix once the word frequency matrix is obtained it is important to reduce the dimension of the matrix to ensure efficient clustering. dimensionality reduction techniques, such as singular value decomposition, that are used for standard data mining are especially helpful here. the word frequency matrix before dimensionality reduction can easily include thousands of words or attributes. rarely are all the attributes of value to the clustering. thus, by performing a dimensionality technique such as singular value decomposition, the attribute set can be reduced down to a size that is more manageable, typically of size 100 or less [11]. once the dimensionality reduction has been performed, the reduced word frequency matrix is clustered with simple k-means clustering algorithm described in [11]. thus, a brief review of the cluster centroids will help to determine which cluster most resembles the subject matter of the specified innovation. the authoritative collaboration partner(s) can easily be determined from those entities that have contributed work to the cluster that most resembles the subject of the innovation. section 4 presented a threshold measure by which authorities could be weeded out of the collaborative search process. following the apriori property discussed in [11] and [12], those authorities that are not authoritative for the entire group should not be considered authoritative for a subsection of that group. therefore, only authorities with the authoritativeness metric higher than the user defined threshold should be sought within the clusters. 6 mems case study this section presents a case study on the discovery of the most authoritative person to perform collaborative innovation with for the domain of microelectromechanical systems (mems). in this study 2403 papers were mined from the internet on the subject of mems simon [13] describes mems as a monolithically integrated device used for microwave applications such as switches, distributed phase shifters and bpsk modulators. other applications for mems have also surfaced. in fact, according to maeda et al. [14] mems is expected to be one of the most promising areas of research and development contributing to future success of electronics businesses. 48 j. engler, a. kusiak after the 2403 papers were mined from the internet using the focused web miner, the author’s names and references were parsed from the documents as described in section 3 above and the authoritativeness matrix was generated. the authoritativeness metric, described in section 4, was applied with λ set to 0.80 to slightly discount the average age of the documents. figure 3 illustrates the top ten authorities after this initial calculation of authoritativeness. figure 3 illustrates the results of running the algorithms presented in this paper prior to the iterative boosting discussed in section 4. thus, the results in figure 3 are more indicative of a rapid manual search. figure 3: the top 10 authors in the non-boosted authoritativeness metric list for mems as seen in figure 3, gm rebeiz is indicated as the leading authority on mems. a quick search of the internet with the name gm rebeiz justifies his rank as the top authority in this non-boosted list. gm rebeiz is a professor at the university of michigan in the college of engineering and leads a team of 8 phd students in a focus on rf-mems [15]. utilizing the authoritativeness matrix, discovered in the mining of the 2403 documents which generated the non-boosted results of figure 3, the iterative boosting of authoritativeness is applied. upon the application of the boosting of the authoritativeness the list changes in order as can be seen in figure 4. boosting has the effect of attributing higher authority to those whose papers have been cited by authors of higher authority. thus, this is the list that a person for collaborative practices in the field of mems should be sought from. figure 4: top 10 authors in the boosted authoritativeness metric list for mems from the boosted authoritativeness it is easy to see that cl goldsmith is the authoritative figure one would wish to collaborate with. in fact, with a quick search of the internet it is found that cl goldsmith is the president of a company called memtronics and received his phd from the university of texas [16]. the list contains other potential candidates who may be sought after should cl goldsmith not be available for collaboration. for this case study, the focused web miner ran for approximately 18 hours to gather the 2403 documents. the parsing of the author’s references and document age took less than 2 minutes. the initial authoritativeness was then calculated from the matrix in approximately 1.5 minutes. the boosting of the authoritativeness took 16 iterations mining authoritativeness of collaborative innovation partners 49 before order no longer changed and took less than 10 minutes to achieve (see figure 5). thus, overall, the process of mining the leading authority in the field of mems, based upon these documents, took less than 18.5 hours and required very little of the user’s time to perform. it is easily seen that this is a marked improvement upon a manual search. figure 5: summary of the algorithm run the case study illustrates the effectiveness of the authoritativeness metric presented in this paper. further, the case study highlights the differences between boosted and non-boosted authoritativeness. from the perspective of a company that is seeking a collaborative partner, the boosted authoritativeness offers a list of highly respected candidates. deriving this list in the short unmanned time frame of 18.5 hours offers companies a great benefit in discovering the most authoritative person(s) to perform collaborative innovation with. the effectiveness of the authoritativeness metric is further explained in a recent scenario that was encountered by an electronics manufacturer who required expertise with legacy 16 bit pcmcia pc cards. due to confidentiality the details of this episode cannot be related although a summary of the scenario can be provided. the electronics manufacturer, a government contractor, was contracted to design a laptop integrated testing device for a piece of electronic equipment for a foreign concern. one of the requirements for this testing device was for it to integrate with the laptop through a legacy 16 bit pcmcia pc card. the contractor lacked the domain knowledge to effectively and rapidly design the testing device with this form of legacy interface. therefore, the author’s were asked to apply the authoritativeness metric to determine which entities to best collaborate with on this issue. the results of the running of the algorithms, described herein, a list of authoritative entities was generated. the second entity on this list was eventually utilized to solve the domain issue. the first entity on the list was not completely suited for the task due to security restriction. the above presented scenario lends additional support towards the effectiveness of the authoritativeness metric. it is shown that the authoritativeness metric is applicable not only to academic, research, and scientific activities but also to integration of various domain expertise in a corporate setting as well. further, it is shown that the list of authoritative entities is crucial for selection of collaborative partners due to various external constraints (e.g., security, geospatial reasons) that cannot be accounted for within the authoritativeness metric. by producing a list of authoritative entities the end user is capable of filtering for these external constraints while still achieving the results of finding the optimal collaborative partner. 7 conclusions and future works open innovation is the means by which companies seek external entities with which to collaborate to form innovation. this paper illustrated that finding the best source of collaboration for a given innovation in a manual fashion is sub-optimal. this paper presented a novel methodology for the automation of collaboration partner detection for the purpose of collaborative innovation. furthermore, a process by which the authoritativeness of the collaborative partner is ensured to be optimal was presented. using data mining, clustering, and analysis of the documents related to the innovation domain increases competitiveness of companies. novel to this paper is the use of boosted authoritativeness. the iterative process of increasing, or decreasing the authoritativeness of possible candidates for collaborative innovation extends the search process, and represents an automated methodology for determining the best candidate entity (company, person) for collaborative innovation. future research should includes increasing the efficiency of document detection during the web mining process as 50 j. engler, a. kusiak well as increasing the rate at which document classification takes place. bibliography [1] a. kusiak, innovation: a data-driven approach, international journal of production economics, vol. 122, no. 1, pp. 440-448, 2009. [2] g. berkhout, p. van der duin, mobile data innovation: lucio and the cyclic innovation model, proc. of the 6th intl. conf. on electorinc commerce, delft, netherlands, pp. 603-608, 2004. [3] y. sawatani, f. nakarmura, a. sakakibara, m. hoshi, s. masuda, innovation patterns, proc. of the 2007 ieee intl. conf. on services and computing, salt lake city, ut, pp. 427-434, july 2007. [4] j.b. zhang, y. tao,the interaction based innovation process of architectural design service, industrial engineering and engineering management 2007 ieee intl. conf., pp.1719 1723, dec. 2007. [5] a.w. ulwick, turn customer input into innovation. harvard business review, vol. 80, no. 1, pp. 91-97, 2002. [6] l. collins, opening up the innovation process, engineering management journal, vol. 16, no. 1, pp. 14-17, 2006. [7] a. langville, c. meyer, deeper inside pagerank, internet mathematics, vol. 1, no. 3, pp. 335-380. [8] b. liu, web data mining, springer, heidelberg, 2007. [9] t. haveliwala, topic-sensitive pagerank: a context-sensitive ranking algorithm for web search, ieee transactions on knowledge and data engineering, vol. 15, no. 4, pp. 784-796, 2003. [10] s. kamvar, t. haveliwala, c. manning, g. golub, extrapolation methods for accelerating pagerank computations, proc. of the 12th intl. conf. on world wide web, budapest, hungary, pp. 261-270, 2003. [11] i. whitten, e. frank, data mining, practical machine learning tools and techniques, morgan kauffman, new york, 2005. [12] j. han, y. yin, g. dong, efficient mining of partial periodic patterns in time series databases, proc. of the 15th ieee intl. conf. on data engineering, sydney, australia, pp. 106-115, march 1999. [13] s. simon, modeling and design aspects of the mems switch, proc. of the 2003 ieee international semiconductor conference, sinaia, romania, september 28 october 2, pp. 128-132, 2003. [14] r. maeda, m. takahashi, s. sasaki, commercialization of mems and nano manufacturing, proc. of the 6th ieee intl. conf. on polymers and adhesives in microelectronics and photonics, tokyo, japan, pp. 20-23, january 2007. [15] g. m. rebeiz, homepage htt p : //www.eecs.umich.edu/rebeiz/rebeiz.html. [16] c. l. goldsmith, homepage htt p : //www.memtronics.com/page.aspx?pageid = . [17] y. zhai, b. liu, web data extraction based on partial tree alignment, proc. of the 2005 international world wide web conference, may 10-14. chiba, japan, pp. 76-85, 2005. [18] l. shih, d. karger, using urls and table layout for web classification tasks, proc. of www 2004, may 17-22, new york, pp. 193-202, 2004. [19] b. kules, j. kustanowitz, b. shneiderman, categorizing web search results into meaningful and stable categories using fast-feature techniques, proc. of jcdl’06, june 11-15, pp. 210-219, 2006. [20] x. jin, r. li, x. shen, r. bie, automatic web pages categorization with relieff and hidden nad’ve bayes, proc. of sac ’07, march 11-15, pp. 617-621, 2007. [21] j. engler, a. kusiak, a. mining the requirements for innovation, mechanical engineering, vol. 130, no. 11, pp. 38-40, 2008. [22] p. chapman et al., step-by-step data mining guide, crsip-dm consortium, crisp-dm 1.0, 2000. mining authoritativeness of collaborative innovation partners 51 [23] n. lavraĉ, h. motoda, t. fawcett, r. holte, p. langley, p. adriaans, introduction: lessons learned from data mining applications and collaborative problem solving, machine learning, vol. 57, no. 1-2 , pp.13-34, 2004. [24] r. gajda, utilizing collaboration theory to evaluate strategic alliances, american journal of evaluation, vol. 25, no. 1, pp. 65-77, 2004. [25] m. geringer, strategic determinants of partner selection criteria in international joint ventures, journal of international business studies, vol. 22, no. 1, pp.755-786, 1991. [26] m. hitt, m. dacin, e. levitas, j. arregle, a. borza, partner selection in emerging and developed market contexts, academy of management journal, vol. 43, no. 3, pp. 440-467, 2000. [27] z. chen, learning about learners: system learning in virtual learning environment, international journal of computers, communications and control, 3(1):33-40, 2008. [28] h. grebla, c. cenan, c., distributed machine learning in a medical domain, international journal of computers, communications & control, 1(s):245-250, 2006. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iii (2008), no. 3, pp. 281-294 integration of traffic management and traveller information systems: basic principles and case study in intermodal transport system management houcine ezzedine, therese bonte, christophe kolski, christian tahon abstract: as they involve many interacting agents behaving in numerous ways that are extremely difficult to predict, urban transportation systems are complex in nature. the development of intermodal passenger transportation solutions to address the mobility issues constitutes a major thrust area of urban transport policies. but, to offer citizens comprehensive seamless mobility, intermodal transportation system management (itsm) requires the integration of two major components. the traffic regulation support system, to help the operator responsible for the regulation tasks: coordination of timetables, synchronising arrival and departure times between the different transportation modes, and the traveller information system, giving customers access to information and using a comprehensive set of information tools. in this paper, a generic model of a transport management system, integrating these two components is proposed. this generic model is then used to elaborate a traffic regulation system in the case of a bimodal transportation system (tram-bus). the traffic regulation support system, based on the decision model of an operator, and the traveler information system are described. keywords: intermodal transportation systems, traffic regulation, travellers information system, management support system. 1 introduction urban networks were, for years, only designed and built to carry flows of passengers to and from workplaces and their role was limited to their transport function. they did not take into consideration passenger expectations and the integration of the system in its environment. this period is definitely behind us, and a high transport capacity is no longer the sole criteria for assessing urban transportation system efficiency. mobility patterns are hence changing as cities, with their large sprawling populations and are generating greater and more diverse mobility demands. today, due to the growing service-oriented economy, people require a broad and flexible spread of transport services to choose from. citizens are demanding greater mobility more frequent and more widespread and a higher quality mobility. most existing public transport systems were built at a time when the majority of journeys were fixed and routinely made. therefore, they must now fundamentally change their approach. public transport must take an increasingly flexible and competitive approach to meet the needs of today’s travelling public and become more service-oriented. it must increase its quality, punctuality, frequency, attractiveness and comfort because all of which have an impact on its productivity. indeed, urban networks have been transformed from just being a transport system into being an urban achievement around which the city and its mobility system are structured and developed. it is therefore crucial that an optimisation of interchange and transfer points between modes occurs for making them functional and pleasant. as well, there is no interest in setting up an efficient transport system if passengers do not know how to use it. so, another important aspect is the provision of high quality information. these two aspects, minimising waiting times and giving a real-time information to guide the travellers efficiently, are two essential points for the design of an intermodal transportation system. the intermodal transportation systems approach is briefly described in the section ii and a generic model of a traffic management system, integrating a traveller information system and a traffic regulation support system is proposed in the section iii. the section iv shows how this model was used to develop a copyright © 2006-2008 by ccc publications 282 houcine ezzedine, therese bonte, christophe kolski, christian tahon traffic regulation system, based on the decision model of an operator in the case of a bimodal transportation system (tram-bus). the travellers information system used by the operators is also explained. this system provides the operator with the information she/he needs to identify the disturbances and evaluate potential corrective actions to be carried out according to the regulation strategy she/he has selected. this paper is an extended variant of the work [15]. 2 towards intermodal transportation systems intermodality is both a technical term for a specific type of journey including several modes of transport and a policy principle. intermodality describes coordinated interchanges between two or more transportation modes to complete a movement. in this light, intermodal movements involve either the physical transfer of people or individual items from one mode to another, or the transfer of one loaded transport vehicle or container from one mode to another to continue the journey. numerous studies have been carried out on advantages (mainly quality of service) and disadvantages (costs, technical problems) of intermodal transportation systems. these are a number of barriers which prevent the advancement of intermodal passenger transportation (institutional, system integration, interoperability requirements) [20, 10]. nevertheless, the development of intermodal transportation solutions constitutes a main thrust area of urban transport policies and the role of intermodal transportation systems has become increasingly important. the intermodal approach involves looking at how individual modes can be connected and managed as a seamless and sustainable transportation system. that is, the fundamental objective of intermodalism is to integrate all the modes into an optimal, sustainable, and ethical system. such a system should support efficiency, safety, mobility, economic growth, protection of the natural environment. any person departing from any of the available modes of transportation (train, bus, plane, metro, ě) may get on the intermodal system additional services to reach its final destination. the comfort and the attractiveness of passenger transport systems depend largely upon the quality of the transfers at interchanges between public transport modes themselves and with the car. so, a crucial element of a successful intermodal transport system is the coordination of timetables, synchronising arrival and departure times between modes as much as possible to minimise passenger travel and waiting time. here the impressive advances made in both computer technology, telecommunications and decision support tools and methods are playing a crucial role in the promotion of public transport: informing the travellers about existing transport possibilities in order to allow them to define and plan their movement, giving access to information and quality infrastructure to find out what mode of transport is available. in addition, parks and ride also contribute to a successful intermodal transport system, encouraging car drivers to leave their car in a car-park at the outskirts of the city and to travel by metro, tram or a bus on a route which is free of traffic difficulties. the previous description shows that intermodal transport systems are customer-oriented that makes them really attractive. but, to offer citizens comprehensive seamless mobility, they require advanced management systems combining different decision and informational support, especially to help the operator, responsible for the traffic regulation. 3 generic model of a passenger traffic regulation support system the planning process of an urban transport system consists first in calculating a theoretical planning [1] and different timetables describe trips according to lines, frequencies, transport demand and travel times in the transport network. but, random events occur (vehicle’s breakdown, strike, traffic congestion...) that cause disturbances and make the predictive scheduling to be modified. therefore, for reducing the effects of disturbances, regulation tasks must be carried out to adapt the predictive scheduling to the real state of the network. this process consists in creating new schedules resulting from the integration of traffic management and traveller information systems: basic principles and case study in intermodal transport system management 283 decisions the regulator takes and depending on several parameters (the location, the type of disturbance, the time the disturbance occurs, ...). the real-time regulation of an urban collective transport network is a very complex problem, especially in case of simultaneous disturbances, that frequently overload the regulator with numerous information. taking efficient decisions requires: to have a global and significant overview of the network, to access rapidly to the available information related to the disturbances, to analyse the characteristics of the disturbances in order to identify the most appropriate corrective actions, to evaluate and compare the impact of these actions to decide which one must be selected. our purpose here is to assist the regulators (decision-makers) and to propose them effective solutions taking account their preferences and uncertainties related to these preferences. the generic model of a transport regulation support system (trss) is shown in figure 1. the role of each software package as a component of the integrated architecture can be identified as follows: • the exploitation assistance system is the main information system of the trss, interfacing trss with all the network information sources (sensors, gps, phone, radio-locator...). • the decision assistance system (das) is composed of three components: (i) the regulation data base containing all the information required for regulation (location of disturbances, line, bus number, ě),), (ii) the regulation models composed of all regulation methods the regulator can use and/or combine to elaborate a coercive action: algorithms, meatheuristics,.. (iii) the rss monitor which supports data manipulation, allows an interactive navigation through the regulation system, and allows the regulator to manage the regulation process by choosing the regulation models he wants to find out a solution. • the information assistance system: it is used by the operators to send relevant information to the users of the network. • the geographical information system (gis): this module stores cartographic information about the network (streets and transportation networks, measurement locations, contour lines, etc.) and descriptive data (streets names, measurements, statistics, ě).). it can help the regulator by providing a spatial view of the network, locating and displaying the disturbances areas. 4 application the validation of our approach occurred within the framework of a project involving an industrial partner, the transvilles company, as well as several research laboratories (lagis, lamih and inrets). transvilles is the company which actually runs the urban transport network (tramway and bus) in the town of valenciennes. this project is sponsored by the nord/pas-de-calais regional authorities and by the feder (fonds européen de développement régional european fund for regional development). our research work consists in the specification, design and evaluation of (1) a humancomputer supervision interface (referred to later as the information assistance system or ias), and (2) a decision assistance system (das). the ias is intended for human regulators working on passenger information on the transport system in valenciennes. the transport network includes information screens (or display units) intended for the passengers. these screens are found both in the stations and in the vehicles; they show information on the schedules and on the connections. these information are calculated automatically by an exploitation assistance system (eas) in which the position and state of each vehicle are stored. in conclusion, the regulation system is made up of three sub-systems: an information assistance system (ias), an exploitation assistance system (eas), a decision assistance system (das), (see figure 1). 284 houcine ezzedine, therese bonte, christophe kolski, christian tahon figure 1: generic model of an urban transport regulation support system each of the three sub-systems plays a particular role. indeed, the eas has to centralise the information regarding exploitation of vehicles (alerts, vehicles ahead of time, messages, schedules, delays,...), and make it possible to manage these elements. the das is intended to create, evaluate and suggest regulation strategies to the human regulator using the information provided by the eas. in this way, the regulator has less work to do, which should help to improve the quality of regulation and thus the quality of the overall service. the das is not intended to replace the human regulator, but it must provide assistance in decision-making [8]. the ias is intended to present information to the regulators and make it possible to send relevant information to the passengers [3, 4]. a simulation system has been developped to simulate the behaviour of the multimodal network in valenciennes. this simulator is described in the next section. the das and the ias, which are objects of our researches, are more detailed in the two next sections. 5 the simulator the software quest (queuing event simulation tool), developed by delmia, was chosen to build the simulation model. quest is a powerful discrete event simulation tool that allows very complex systems to be modelled. we used it to model the transportation network of our industrial partner. this multimodal network is composed of buses, trams and trains. the simulation system takes both the normal behaviour and the disturbances into account. the topology of the transportation system consists in a road network principally used by buses and a rail network which incorporates both tram and train networks. this type of network is sometimes supported by a road network, in which case bus and tram interaction must be taken into consideration. crossings are modelled to avoid collisions between buses or between buses and trams. intersections between roads or between roads and rail lines are supervised. trams always have priority in road-rail intersections. overtaking is restricted: only stationary buses may be overtaken by moving buses. an overtaking manoeuvre on the rail network can only be accomplished by using a specific procedure, such as following parallel tracks or shunting tracks, which, strictly speaking, is traffic diversion, not overtaking. rolling equipment can be divided into three categories: train, tram or bus. buses can be further divided into standard buses, articulated buses or minibuses. their dimensions, kinematic characteristics and capacities are modelled. macroscopic passenger flow simulation is used to integration of traffic management and traveller information systems: basic principles and case study in intermodal transport system management 285 track passenger movements; the model does not track individual passengers. passenger flow is considered at simple stop areas, at interchanges, and within the vehicles. stop area arrival patterns are modelled using a poisson law. the mean arrival rate per time unit varies according to timeband (period of the day) and type of day (weekday, weekend, school holidays ě).). stop areas are divided into simple stop areas and interchange stop areas. a stop area is simple if passengers can board or disembark without any possiblity of connecting to other lines while an interchange stop area combines multimodal transportation vehicles and may be very complex. in our application, the interchanges are divided into two types: • interchanges between bus and tram: a tram drops passengers off, and when a bus reaches the stop area, it takes passengers on board and leaves, unless the bus driver sees an approaching tram. in this case, the bus waits for the tram. in any case, the tram drops its passengers off, regardless of whether there is an interchange or not, • interchanges between bus and bus: the interchange is organized around an arriving bus, which brings passengers into the interchange, and a waiting bus, which takes waiting passengers out of the interchange. two types of disturbances may occur and disrupt the transportation network: • alarms detected by the vehicle or its driver (failures, emergency calls) or detected by the controller (operational threshold exceeded ě)) • unforeseen events resulting from an accident, traffic jam, passenger behaviour, driver illness ... . in our application, perturbations are generated by the user of the simulation system. vehicle perturbations include delays, early arrivals, unscheduled stops, and network perturbations refers to the unavailability of some part of the network. the regulation action that will compensate for the dysfunction is defined by the user with the help of the decision assistance system and is then simulated. in this instance, quest’s ability to communicate with external softwares via a socket-based mechanism that allows it to send or receive any type of information is clearly an advantage. 6 the decision assistance system 6.1 global modelling approach chosen concerning the decision-making process of the regulator dss design is a very rich research and development field [21]. most of the decision-making systems developed in the field of regulation are based on automatic methods. several authors (see for instance [12]) have contributed to the use of artificial intelligence in the field of transportation system regulation using fuzzy models. balbo [2] adopts a multi-agent approach for offer planning, and laichour [8] uses the same type of approach to regulate transfers. fayech chaar [5] combines a multi-agent approach with a generic algorithm. however, none of these approaches consider the integration of the method in an interactive environment. in parallel, three main modelling approaches have been proposed and used to model a human operator: • the oldest is the “human factor” approach (1940-1955), which models the observable elements of operator behavior during task execution [6]. • then, the “human/automatic” approach (1955-1980), which is based on physics theory, has tried to model the human operator. information theory, the optimal control model and model of the regulation of human activity proposed by [9], are all based on this modelling approach. 286 houcine ezzedine, therese bonte, christophe kolski, christian tahon • the third approach is the “cognitive engineering” approach, which holds that the role of human operators in system operations is supervision and decision-making. thus, problems that this approach seeks to resolve are related to the decision-making involved in system control, management, monitoring and reconfiguration. as we consider that the regulator is part of all supervision and decision-making tasks, our application falls into the “cognitive engineering” approach. our proposed model is inspired by the ones introduced by hoc [11] and [7] (see figure 2). this model integrates the regulator in the decision process and allows conflicts between the user and the system to be avoided. figure 2: the decision model of the regulator 6.2 functional model for the regulation of a transportation system figure 2 shows the functional diagram of the decision model described above. it is composed of five phases: disturbance detection and acquisition, disturbance analysis, solution design, solution evaluation and solution implementation. they are all described below. • phase 1: disturbance detection/acquisition. three detection methods are possible: (1) the automatic vehicle-monitoring system can provide relevant information about disturbances; (2) vehicle drivers can call in by radio; (3) the regulator can anticipate disturbances based on his experience in supervising a transportation network. the third case is the most common. • phase 2: disturbance analysis. descriptive data of a disturbance are numerous. since there are often several simultaneous disturbances, the assistance system provides a disturbance analysis to avoid information overload at the level of the regulator. it provides a synthetic representation of relevant network data for each disturbance, a description of the specific information available (e.g., passenger flow at vehicle stops, number of passengers in the vehicles, characteristics of the line). current disturbances are ranked according to significant criteria defined with the help of the industrial partner. • phase 3: solution design. three methods are proposed to the regulator for choosing or designing a solution. these methods complement approaches mentioned above [2] [5] [8]. they increase interactivity and allow the regulator to adapt to the varying complexity of the disturbances: integration of traffic management and traveller information systems: basic principles and case study in intermodal transport system management 287 – a manual method: all available basic and combined regulation actions are listed in a typology provided by the system and are proposed to the regulator [8]. – a semi-automatic method: the regulator chooses among the different regulation logics. he can focus on respect for punctuality or respect for transfers. the system can then filter acceptable actions and proposes a list of possible actions . – an automatic method: the regulator chooses one of the suitable solutions automatically proposed by the system. • phase 4: solution evaluation. all solutions are evaluated according to indicators computed by the system. these indicators allow the efficiency of the solution to be verified. these indicators are classified according to their importance: (1) “essential” indicators evaluate the service offered to passengers (e.g., regularity, punctuality, respect of transfer times); (2) “important” indicators evaluate the additional consequences of a disturbance (e.g., delays); (3) "secondary" indicators simply provide information to the regulator but have no influence on the decision (e.g., the lines and transfers that are affected by the solution). the regulator chooses the best solution according to these indicators. • phase 5: solution implementation. the chosen solution is implemented by creating a new timetable. 6.3 system components system implementation is based on an architecture including the following modules: • the “acquisition tool” evaluates the complexity of the disturbances and classifies them based on local and network parameters and traffic management criteria (e.g., type of vehicle, geographical location, time, number of passengers in the vehicle and waiting at the next stop). • the “regulation environment” manages three modules according to the method chosen by the regulator: – the “manual tool module” is used if the regulator builds his own solution manually. this module contains a set of mathematical algorithms specifically developed for the project. each algorithm models a specific regulation action such as “u-turn with vehicles and drivers exchange”. these algorithms have been enhanced by a filtering procedure, which enables or disables the algorithms depending on which logic is chosen. the filter is based entirely on specifications defined by the industrial partner transvilles. for example “respect for transfer” logic will disable “u-turn with vehicles and drivers exchange”. – the “case-based reasoning module” uses the traditional cbr approach [19]. its principle is based on the use of previous experience to solve problems. this approach can be used because transvilles has compiled a database of incidents and strategies applied to resolve them. this database is enriched with the activities resulting from manual regulation or those generated automatically by the system. – the “expert system” uses a knowledge base that exploits the expertise of our industrial partner (transvilles). it is used if no solution can be found using the cbr module. 7 the information assistance system 7.1 architecture chosen for the human-computer interface several architecture models of interactive systems have been put forward by researchers over the past twenty years. two main types of architecture can be distinguished: architectures with functional 288 houcine ezzedine, therese bonte, christophe kolski, christian tahon components (language, seeheim and arch) and architectures with structural components (pac, pacamodeus, mvcě);); several of these models have also variations [13]. the classic models distinguish three essential functions (presentation, control and application). some models (such as the seeheim and arch models) consider these three functions as being distinct functional units. other approaches using structural components, and in particular those said to be distributed or agent approaches, suggest grouping the three functions together into one unit, the agent. the agents are then organised in a hierarchical manner according to principles of composition or communication (for example pac, mvc). our approach could be considered as being intermediate as it borrows elements for its principles from both types of model given above: we suggest using a division into three functional components which we have called respectively: interface with the application (connected to the application), dialogue controller, and presentation (this component is directly linked to the user). these three components group together agents [14]. the application agents handle the field concepts and cannot be directly accessed by the user; one of their roles is to ensure the correct functioning of the application and the real time dispatch of the information necessary for the other agents to perform their task. the interactive agents (or interface agents) are in direct contact with the user; they co-ordinate between themselves in order to intercept the user commands and to form a presentation which allows the user to gain an overall understanding of the current state of the application. in this way, a window may be considered as being an interactive agent in its own right; its specification describes its presentation and the services it is to perform. the dialogue control agents provide services for both the application and the user; they are intended to guarantee coherency in the exchanges emanating from the application towards the user, and vice versa. 7.2 agent oriented specification and design of the human-machine interface an agent oriented architecture is used for the human-computer interface. the application agents are intended to manage the passenger information in the vehicles and stations and to calculate the information to be displayed (delays, timetable and route modifications, etc.). according to the traffic context, each agent possesses rules enabling it to act correctly in its environment. concerning the specification of the interface agents, we have identified six types of interface agent responsible for direct interaction with the human regulator.these agents are represented in the form of interactive windows. the regulator can interact with these agents via the various functions possible in the windows, for example: the buttons, the edition zones, the pictures, etc. these agents are: the state of traffic agent, the state of the line agent, the station agent, the vehicle agent, the message agent, the overall view agent. it should be remembered that the aim of the ias is to enable the regulator to visualise, edit, create and transmit information intended for the passengers in the stations and/or vehicles. in order to perform his/her task of regulation correctly, the regulator interacts with these different interface agents. for instance, the state of the traffic interface agent gives a synthetic representation of all the delays concerning mobile units travelling on the network. thus, with the help of the network support system, it ensures the real time surveillance of vehicle delays on the network supervised. the state of the line interface agent is made up of graphic elements such as stations, route sections, vehicles, ě ( (see figure 3a). a click on a vehicle directly displays the view (window) of the vehicle interface agent which will deal with any further interaction with the regulator (see figure 3b). the principle is the same when the operator click on a station (see figure 3c). explanations about the other types of interface agents can be found in [14] or [18]. 8 evaluation we are evaluating and validating the global approach; preliminary results are available in [18]. in parallel with technical tests, evaluation with regulators using the different interactive systems in normal integration of traffic management and traveller information systems: basic principles and case study in intermodal transport system management 289 figure 3: (a) view of the state of the line interface agent, (b) view of a vehicle interface agent, (c) view of a station interface agent 290 houcine ezzedine, therese bonte, christophe kolski, christian tahon and abnormal situations are under development. the evaluation of interactive system consists in ensuring that the users are able to carry out their task by using it; it must therefore meet their needs. the methods for evaluating interactive systems are as varied as they are numerous. in [16, 17] the emphasis is placed on two global evaluation criteria: utility and usability. indeed, evaluation consists in verifying and confirming the interactive system, whatever the domain of application. if it meets advanced criteria, it is then accepted and validated. otherwise it has to be reorganized for a new evaluation. 8.1 data collection using an electronic informer for the evaluation of agent oriented interactive system used in regulation room an “electronic informer” is a software tool which ensures the automatic collection, in a real situation, of users’ actions and their repercussions on the system. the collection of information is done in a discreet and transparent way for the user, who must not at any time feel hampered by the presence of the informer. our evaluation method is divided into two closely dependent phases [15]: the first will be dedicated to the acquisition of the data related to the interaction between the user and the agent-based interactive system; the second phase ensures the classification and the analysis of the recovered data. the parameters to be acquired correspond to the various interactions between the user and the interactive system. they must contribute to the judgement to give on the quality of the system and make it possible to the designers to plan improvements to be brought to it. these parameters allow the identification of the: reactions of the interface further to the actions of the user, reactions of the user further to the actions of the interface, time response of the system, duration of the human tasks, user errors, uses of the help, displayed and not used information, and so on. the acquisition of all these parameters is ensured by the various informer agents. their tasks will consist in perceiving the actions/reactions of the corresponding interface agent and the user, and to record them. indeed, we equipped these agents with a rather simple architecture (see figure 4). this architecture consists of (1) a perception module, ensuring the acquisition of the data provided by the user and the agent presentation, (2) an interpretation figure 4: using the electronic informer for the evaluation of agent oriented interactive system. and classification module ensuring a sorting of the collected data, and (3) a recording module ensuring the data storage in a database. the acquisition of information via the various tools and techniques of evaluation is not the most difficult task during the evaluation phase of a system. indeed, the interpretation of the gathered data as their analysis is a more delicate task. integration of traffic management and traveller information systems: basic principles and case study in intermodal transport system management 291 8.2 analysis of the data collected the second phase is dedicated to the analysis of the recovered data. in order to facilitate the work of the evaluator, we equipped our electronic informer with an analysis data module. the evaluator can thus consult the various data according to the categories already quoted above. with each appearance of a sight (i.e. the visible part of an interface agent), we engage a stopwatch. it stops at the time of its disappearance or its setting behind plane. the output value represents the duration of the visualization of a sight. we attach importance to the duration of appearance of the sights. indeed, it can exist a close connection between the duration of consultation of a sight and the strategies implemented by the users. the interaction of the user with the system is carried out via the interface agents; so an agent informer is assigned to each interface agent. in parallel, the integration of a mouse agent and of a keyboard agent have been done; when collaborating with the other agent informers, the keyboard and mouse agents have to allow the recording of different actions/reactions from the interface (interface agents) and from the user. such principles are more detailed in [18]. 9 summary and conclusions for economical and ecological reasons, the role of intermodal transportation systems has become increasingly important. it is now crucial to provide the human operators in regulation rooms with systems for making easier their management. a generic model of a traffic regulation support system has been proposed. we are validating the global approach during a project concerning the urban transport network in the town of valenciennes, france. the article was focused on the decision assistance system and the information assistance system. for the evaluation, an electronic informer for the evaluation of agent oriented interactive system has been realized; its basic principles have been also explained. the proposed approach and the resulting architecture are promising because they are centred on the human activities in regulation room during the different normal and abnormal real situations. research perspectives concern experimentations in abnormal situations combining different types of dysfunctionnings. 10 acknowledgement the present research work has been partially supported by the “ministère de l’education nationale, de la recherche et de la technologie”, the “région nord pas-de-calais”, the feder (miaou, eucue and sart projects). the authors gratefully acknowledge the support of these institutions. the authors thank also the cesa’2006 staff for recommendation. bibliography [1] m.t. isaai, n.p. cassaigne, predictive and reactive approaches to the train scheduling problem, a knowledge management perspectives, ieee smc, part c, special issue on knowledge management, vol. 31, issue 4, pp 429-437, 2001. [2] f. balbo, s. pinson, dynamic modeling of a disturbance in a multi-agent system for traffic regulation, decision support systems 41, pp. 131-14, 2005. [3] f. cartegnie, h. ezzedine, a. peninou, c. kolski, agent oriented specification of interactive systems: basic principles and industrial case study, c. kolski, j. vanderdonckt (eds.), computer-aided design of user interfaces iii, kluwer academics publishers, dortrecht, pp. 381-388, 2002. 292 houcine ezzedine, therese bonte, christophe kolski, christian tahon [4] h. ezzedine, a. trabelsi, c. kolski, modelling of an interactive system with an agent-based architecture using petri nets, application of the method to the supervision of a transport system, mathematics and computers in simulation, vol. 70, issues 5-6, pp. 358-376, 2006. [5] b. fayech chaar, s. hammadi, régulation spatio-temporelle d’un réseau de transport multimodal, e-sta sciences et technologie de l’automatique, vol 2, 2005. [6] p.m. fitts, j.r. petterson, information capacity of discrete motor responses, j. of experimental psychology, 67, pp. 103-112, 1964. [7] j.m. hoc, supervision et contrôle de processus, la cognition en situation dynamique, grenoble, pug, france, 1996. [8] h. laichour, s.maouche, r. mandiau, traffic control assistance in connection nodes: multi-agent applications in urban transport systems, international scientific journal of computing, vol 1, issue 1, pp. 37-42, 2002. [9] p. millot, v. taborin, a. kamoun, two approaches for man-computer, cooperation in supervisory tasks, 4th ifac congress on analysis design and evaluation of hms, xian, china, 1988. [10] neighbourhood intermodal transfer facilities, l. land, c. foreman, final report, national center for urban transit research, nctr university of south florida, october 2001. [11] j. rasmussen, information processing and human-machine interaction, an approach to cognitive engineering, elsevier science publishing, 1986. [12] s. zidi, s. maouche, s. hammadi, nouvelle approche pour la régulation des réseaux de transport multimodal, 6eme conférence francophone de modélisation et simulation mosim 06, rabat, marocco, 2006. [13] j. coutaz, l. nigay, architecture logicielle des systèmes interactifs, in kolski c. (ed.), analyse et conception de l’ihm, vol. 1, pp. 207-246, paris: hermès, 2001. [14] h. ezzedine, c. kolski, a. peninou, agent oriented design of human-computer interface, application to supervision of an urban transport network, engineering applications of artificial intelligence, 18, pp. 255-270, 2005. [15] h. ezzedine, t. bonte, c. kolski, c. tahon, intermodal transportation system management: towards integration of traffic management system and users information system, f. sun, h. liu (eds.), imacs multiconference on computational engineering in systems applications (cesa’2006, beijing, china, oct. 4-6, 2006), vol. 1, tsinghua university press, pp. 972-979, 2006. [16] j. nielsen, usability engineering, academic press, boston, 1993. [17] b. shneiderman, c. plaisant, designing the user interface: strategies for effective human-computer interaction, 4th edition, reading, ma: addison-wesley, 2004. [18] a. trabelsi, contribution à l’évaluation des systèmes interactifs orientés agents : application à un poste de supervision du transport urbain (in french), ph.d. thesis, university of valenciennes, france, 2006. [19] j. kolodner, case-based reasoning, morgan kaufmann, 1993. [20] m. a. miller, d. loukakos, assessing opportunities for intelligent transportation systems in california’s passenger intermodal operations and services, california path research report, november 1, 2001. integration of traffic management and traveller information systems: basic principles and case study in intermodal transport system management 293 [21] f. g. filip, decision support and control for large-scale complex systems, annual reviews in control, elsevier, vol. 32, no 1, pp. 61-70, 2008. h. ezzedine, t. bonte, c. kolski, c. tahon university of valenciennes laboratoire d’automatique, mécanique et informatique industrielles et humaines lamih umr cnrs 8530 le mont houy f-59313 valenciennes cédex 09, france e-mail: houcine.ezzedine@univ-valenciennes.fr houcine ezzedine has obtained his ph.d in 1985. he is professor in industrial computer science at the university of valenciennes (france) and member of the “human-computer interaction and automated reasoning” research group in the lamih. he is involved in several research networks, projects and associations. he is specialized in human-computer interaction and software engineering for interactive systems. therese bonte has obtained his ph.d at the university of valenciennes in 1985 in simulation for production systems. she is researcher in the production systems research group of the lamih cnrs laboratory at the university of valenciennes. she is involved in several research networks and projects. she is specialized in software engineering, simulation and cad-cam. christophe kolski has obtained his ph.d in 1989. he is professor in computer science at the university of valenciennes (france) and head of the “human-computer interaction and automated reasoning” research group in the lamih. he is involved in several research networks, projects and associations and is a referee for many scientific journals and conferences. he is specialized in human-computer interaction, software engineering for interactive systems and intelligent system design. 294 houcine ezzedine, therese bonte, christophe kolski, christian tahon christian tahon has an estp engineering school degree. he is qualified in applied physics and obtained a ph.d in decision support systems for production system management in 1990 at the university of valenciennes. he is currently a full professor (france) and head of the production systems research group of the lamih cnrs laboratory at the university of valenciennes. he is involved in many research and scientific associations and is a referee for scientific journals. his teaching and research interests focus on the areas of decision-support systems, production control systems design, management of transportation systems and services systems. http://horos.rdsor.ro/ijcccv3n4draft.pdf int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iii (2008), no. 4, pp. 366-373 ant colony solving multiple constraints problem: vehicle route allocation sorin c. negulescu, claudiu v. kifor, constantin oprean abstract: ant colonies are successfully used nowadays as multi-agent systems (mas) to solve difficult optimization problems such as travelling salesman (tsp), quadratic assignment (qap), vehicle routing (vrp), graph coloring and satisfiability problem. the objective of the research presented in this paper is to adapt an improved version of ant colony optimisation (aco) algorithm, mainly: the elitist ant system (eas) algorithm in order to solve the vehicle route allocation problem (vrap). after a brief introduction in the first section about mas and their characteristics, the paper presents the rationale within the second section where aco algorithm and its common extensions are described. in the approach (the third section) are explained the steps that must be followed in order to adapt eas for solving the vrap. the resulted algorithm is illustrated in the fourth section. section five closes the paper presenting the conclusions and intentions. keywords: ant colony optimisation, vehicle route allocation problem, multiagent systems. 1 introduction ant colonies are used nowadays as multi-agent systems to solve different optimization problems from the np-hard class. examples of such problems are traveling salesman (tsp), quadratic assignment (qap), vehicle routing (vrp), graph colouring and satisfiability problem. the multi-agent systems are copying some or all the characteristics of their biological counterparts (depending on how much these characteristics are helpful to the mas solving a particular type of problem) such as [6]: • distributed society of autonomous individuals/agents; • fully distributed control among the agents; • localized communications among the individual; • stochastic agent decisions; • system-level behaviours transcending the behavioural repertoire of the single (minimalist) agent; • simple interaction rules. consequently, the overall very important features of the system are: robustness, adaptability and scalability. the paper presents the technique of solving yet another difficult problem (vrap) which belongs to the np-hard class of problems. related work, regarding the aco and eas algorithms, is described recently (2006-2007) in [1], [2], [3], [9]; to weaken redundancy, here details are skipped over. as a result, the rest of the paper is structured as follows: section 2 expounds the rationale where ant colony optimisation (aco) and its common extensions are explained. section 3 describes the steps that must be followed in order to adapt eas for solving the vrap. the next section concentrates on presenting the resulted algorithm. conclusions and directions of future work (section 5) are closing the paper. copyright © 2006-2008 by ccc publications ant colony solving multiple constraints problem: vehicle route allocation 367 2 rationale and approach the ant colony optimization algorithm (aco), introduced by marco dorigo in 1992 in his phd thesis, is a probabilistic technique for solving computational problems which can be reduced to finding good paths through graphs and is inspired by the behaviour of ants in finding paths from the colony to food [8]. in all early ant system (as) algorithms, ants are searching for (candidate) solutions based on two main components: pheromone trails and problem-dependent heuristic information. these algorithms have suffered frequent modifications in order to improve their efficiency. thus, the as [5] developed into the elitist ant system (eas) [4], because each ant that finds a better solution has the chance to deposit more pheromone. other systems that emerged from the as are the max-min ant system (mmas) [10] and the ant-q [7] where the deposited pheromone amount is directly proportional to the quality of the found solution. in the rank-based ant system (asrank) all solutions are ranked accordingly to their fitness. the amount of pheromone deposited is then weighted for each solution, such that the more optimal solutions deposit more pheromone than the less optimal solutions. the basic vrap consists in allocating vehicles from a fleet to different routes in such a way that the average distance travelled in a month is (optimally) equal for all vehicles. before designing an aco algorithm some questions regarding important aspects of the problem must be answered first [6]: • the representation of the problem (pheromone model): what are the most effective state features to consider for learning "the good decisions"? • heuristic variables: what are they and what is their relative weight regarding the pheromone in the decisions? • ant-routing table: how pheromone and heuristic variables are combined together to define the goodness of a decision? • stochastic decision policy: how to make good exploration without sacrificing exploitation of promising/good actions? • regarding pheromones: is it useful to put limits in pheromone ranges? how should the pheromone initialization and evaporation be dealt with? what is the best policy for pheromone updating: online or offline? • scheduling of the ants: how and/or how many per iteration? • restarting the algorithm after stagnation: would it be better to restart the algorithm when stagnation is detected? • daemon components: what is the effect of local search? the standard form of the eas algorithm for solving the tsp is presented in figure 1. 368 sorin c. negulescu, claudiu v. kifor, constantin oprean // initialize pheromone trails for (every edge i, j) { τ = τ0 } // choose the starting town for every ant for (k = 1; k ≤ m; k++) { place ant k on a randomly chosen city } // initialize the best tour and length t + = the shortest tour found from the beginning l+ = the length of the best tour // main loop for (t = 1;t ≤ t max;t++) { // compute a tour for every ant for (k = 1; k ≤ m; k++) { build tour tk(t) by applying n −1 times the following step: choose the next node (city) j with the probability pki j(t) = [τi j (t)] α ·[ηi j (t)]β ∑ l∈jk i [τil (t)] α ·[ηil (t)]β , if j ∈ j pki j(t) = 0, if j 6∈ j where i is the current city. } // compute the tour lengths for all ants for (k = 1; k ≤ m; k++) { compute the length lk(t) of the tour tk(t) produced by ant k } // update the best tour if an improved tour is found if (an improved tour is found) { update t + and l+ print t + and l+ } // global update for the pheromone trails for (every edge i, j) { update the pheromone trails by applying the rule: τi j(t) = (1− ρ) · τi j(t) + ∆τi j(t) + e · τ ei j(t), where ∆τi j(t) = ∑ m k=1 ∆τ k i j(t) ∆τ ki j(t) = { q/lk(t),i f (i, j)∈t k(t) 0,otherwise } and τ ei j(t) = { q/l+,i f (i, j)∈t + 0,otherwise } } // calculate the intensity of the pheromone for next iteration for (every edge i, j) { τi j(t + 1) = τi j(t) } } figure 1: elitist ant system pseudocode for solving the travelling salesman problem. ant colony solving multiple constraints problem: vehicle route allocation 369 in this paper is presented a problem that has more constraints than tsp in its basic form: • the number of routes (destinations); • the number of vehicles in the fleet; • the route allocation history (the routes must be allocated in an equally distributed manner for each month turnus); • the average distance travelled by vehicles in a month (this distance must be (optimally) equal for all the vehicles average distance); • the driver’s rest (of two days) between trips (rest). because of these constraints, new variables are necessary to be included in the algorithm for solving vrap. moreover a different approach is necessary regarding the development of the graph used to solve the problem. in figure 2 are depicted the graphs for solving tsp and vrap accordingly. figure 2: comparison between the graphs for solving tsp (a) and vrap (b). if in the case of tsp the nodes in the graph are representing the cities that the salesman must reach, and the arches are standing for the roads (their cost being directly proportional to the distance between cities), in the case of vrap the nodes (seen as pairs node0 and noden) are equivalent to a certain route (e.g. bucharest paris) and the arches are representing the cost of allocating a vehicle to that route. another approach is that the node0 is a unique virtual node regardless of the name of the city that corresponds to the departure station (e.g. node0 can be bucharest, paris, etc.). the cost function must include the following factors: • the vehicle history (the cost is greater if the vehicle departed from node0 and arrived at noden more recently and smaller otherwise); • the average distance ran by a vehicle in a month (the cost is directly proportional with the difference between the average distance and the actual distance). • the rest (of two days) of the drivers (the cost is directly proportional with the difference between the required resting days and the actual resting days). 370 sorin c. negulescu, claudiu v. kifor, constantin oprean 3 the resulted algorithm the algorithm for solving vrap by using the modified elitist ant system is presented in figure 3. // variables used // tmax maximum numbers of iterations // r number of routes // v number of vehicles // h routes history matrix // di j distance between cities i and j // pi j pheromone intensity between nodes i and j // a number of ants used to solve the problem (a is also equal with the number of routes) // tkr(t) the matrix tour of ant k at the iteration t // c the vector containing ant tours costs // t + vector containing the best tour // c+ the cost of the best tour // α ≃ 0.7 how much the ants takes in consideration the history // β ≃ 0.3 how much the ants takes in consideration the pheromone intensity // γ ≃ 0.8 how much the ants takes in consideration the routes distances // ρ ≃ 0.5 percent of the pheromone that evaporates // ε ≃ 0.5 the multiplication value for the pheromone intensity of the elitist ant // p0 minimum pheromone intensity on arches = 10 −6 // initialize pheromone trails for (every edge i, j) { pi j = p0 } // initialize the best tour and best tour cost t + = the best tour found from the beginning c+ = the length of the best tour // main loop for (t = 1;t ≤ t max;t++) { // for every vehicle (every ant colony) for (v = 1; v ≤ v ; v++) { // choose the starting town for every ant for (k = 1; k ≤ a; k++) { place ant k on node zero (start node) } // compute a tour for every ant for (k = 1; k ≤ a; k++) { build tour t r k (t) by applying r −1 times the following step: choose the next node i in witch the ant k will go with the probability pkvi(t) = [hvi(t)] α ·[pvi(t)]β ·[vvi]γ ∑v∈t r k [hvi(t)]α ·[pvi(t)]β ·[vvi]γ , if i 6∈ t r k (t) pkvi(t) = 0, if i ∈ t rk (t), where v is the current vechicle } ant colony solving multiple constraints problem: vehicle route allocation 371 // compute the tour costs for all ants for (k = 1; k ≤ a; k++) { compute the cost ck(t) of the tour t r k (t) produced by ant k } // update the best tour if an improved tour is found if (an improved tour is found) { update t + and c+ print t + and c+ } // global update for the pheromone trails for (every edge i, j) { update the pheromone trails by applying the rule: pi j(t) = (1− ρ) · pi j(t) + ∆pi j(t) + ε · pei j(t) } // calculate the intensity of the pheromone for next iteration for (every edge i, j) { pi j(t + 1) = pi j(t) } } } figure 3: elitist ant system pseudocode for solving the route allocation problem. the α, β and γ parameters are controlling the relative weight of the history, pheromone intensity and the distance in the process of choosing the next node in the itinerary. the formula for computing the pheromone intensity between nodes i and j at the t moment is: pi j(t) = (1− ρ) · pi j(t) + ∆pi j(t) + ε · pei j(t), where • (1 − ρ) · pi j(t) the pheromone quantity that evaporates on the arch between nodes i and j at the moment t; • ∆pi j(t) = ∑ a k=1 ∆p k i j(t) the total pheromone quantity that is deposited on the arch between nodes i and j at the moment t as sum of pheromone quantity deposited by all the ants that have used this arch; • ∆pki j(t) = { 1−h t i k i ,i f (i, j)∈t ik (t) 0,otherwise } the pheromone quantity deposited by ant k on the arch between the nodes i and j if that arch belongs to its itinerary (tki(t)) at the moment t. this quantity is inversely proportional with the history of the vehicle on the route associated to the node that points to that arch; • ∆pei j(t) = { 1−h t e k i ,i f (i, j)∈t +(t) 0,otherwise } the pheromone quantity deposited by the elitist ant k on the arch between nodes i and j if that arch belongs to its itinerary (tki(t) = t +) at the moment t. this quantity is inversely proportional with the history of the vehicle on the route associated to the node that points to that arch. this process repeats until a certain condition is met (number of iterations, time, the finding of an acceptable solution, etc.). 372 sorin c. negulescu, claudiu v. kifor, constantin oprean 4 conclusions and intentions the self-organisation (as in the case of ant colonies) indirectly supports the quality of the allocation service through the lack of the distortions or the deficiencies that may appear in a centralized coordination. the resulting implemented algorithm is robust (both reliable and error tolerant). the optimisation algorithm (elitist ant system) proved to be perfectly adaptable to the vehicle route allocation problem. the ant colony algorithms are showing an obvious potential as problem-solving tool, at least in the context of simple mas, and relevant results can be achieved on affordable hardware configurations. the short-range intentions are: • improving the cost formula; • finishing the research regarding the control of pheromones and search paths; • speeding up the search by fine-tuning the algorithm. the middle-range target is to apply the optimisation in a real-time manner by modifying the algorithm. 5 acknowledgements this research is supported by the ceex 116 / 2006 project: etransmobility agent oriented system for modelling and optimizing the transport of persons; coordinated by "lucian blaga" university of sibiu. bibliography [1] bărbat b.e, moiceanu a., pleşca s., negulescu s.c. (2007).affordability and paradigms in agentbased systems. computer sc. j. of moldova, 15, 2(44), pp.178-201. [2] bărbat b.e., negulescu s.c. (2006). from algorithms to (sub-)symbolic inferences in multiagent systems. international journal of computers, communications and control, 1, 3, pp.5-12. [3] bărbat b.e., negulescu s.c., zamfirescu c.b. (2005). human-driven stigmergic control. moving the threshold. in n. simonov (ed.), proc. of the 17th imacs world congress (scientific computation, applied mathematics and simulation), pp.86-92. paris: isbn 2915913-02-01. [4] bonabeau e., dorigo m., theraulaz g. (1999). swarm intelligence: from natural to artificial systems. new york: oxford university press. [5] dorigo m., maniezzo v., colorni a. (1996). the ant system: optimisation by a colony of cooperating agents. ieee transactions on systems, man, and cybernetics, part b, 26 (1), pp.29-42. [6] gambardella l.m., di caro g. (2005). the ant colony optimization (aco) metaheuristic: a swarm intelligence framework for complex optimization tasks. retrived 2008, from university of bologna: first summer school on aspects of complexity. web site: http://www.cs.unibo.it/˜fioretti/ac/ac2005/docs/slides_dicaro.pdf [7] gambardella l.m., dorigo m. (1995). ant-q: a reinforcement learning approach to the travelling salesman problem. in a. prieditis and s. russell (ed.), proceedings of the eleventh international conference on machine learning (pp.252-260). san francisco, ca: morgan kaufmann. ant colony solving multiple constraints problem: vehicle route allocation 373 [8] negulescu s.c., bărbat b.e. (2004). enhancing the effectiveness of simple multi-agent systems through stigmergic coordination. in icsc-naiso (ed.), fourth international icsc symposium on engineering of intelligent systems (eis 2004), pp.149-156. canada: icsc-naiso academic press. [9] negulescu s.c., zamfirescu c.b., bărbat b.e. (2006). user-driven heuristics for nondeterministic problems. studies in informatics and control (special issue dedicated to the 2nd romanianhungarian joint symp. on applied computational intelligence), 15, 3, pp.289-296. [10] stützle t., hoos h.h. (2000). max-min ant system. future generation computer systems, 16 (8), pp.889-914. sorin c. negulescu "lucian blaga" university of sibiu faculty of engineering 4, emil cioran, im 502 550025 sibiu, romania e-mail: sorin_negulescu@yahoo.com claudiu v. kifor "lucian blaga" university of sibiu faculty of engineering 4, emil cioran, im 502 550025 sibiu, romania e-mail: claudiu.kifor@ulbsibiu.ro constantin oprean "lucian blaga" university of sibiu faculty of engineering 4, emil cioran, im 502 550025 sibiu, romania e-mail: constantin.oprean@ulbsibiu.ro int j comput commun, issn 1841-9836 8(3):486-491, june, 2013. a rate based congestion control mechanism using fuzzy controller in manets h. zare, f. adibnia, v. derhami hamideh zare department of computer engineering, yazd branch, islamic azad university, yazd, iran e-mail: khezrabad@imo.org.ir fazlollah adibnia, vali derhami yazd university e-mail: fadib@yazd.ac.ir, vderhami@yazd.ac.ir abstract: the traditional congestion control mechanism tcp, performs very poorly in manets because there are a number of new challenges such as wireless link error, medium contention and frequent route failures in this kind of networks [1]. in this paper, we propose a fuzzy adhoc rate-based congestion control (farcc) to enhance the efficiency of network in manets. in farcc, we use a rate-based transmission scheme using two fuzzy controller of zero order takagi sugeno kang (tsk) model to congestion detection and congestion control. the farcc sender adjusts data rate by receiving a feedback packet from farcc destination. ns2-based simulation results show that farcc outperforms itp and atp to achieve, in terms of throughput and fair resource allocation in adhoc networks under random topology. keywords: rate control, congestion control, fuzzy controller, channel busyness ratio. 1 introduction congestion control is a critical issue to achieve optimal network resource utilization and fairness among end-to-end flows.in manet with shared resources, where multiple senders compete for link bandwidth, it is necessary to adjust the data rate used by each sender in order not to overload the network. packets that arrive into router and cannot be forwarded are dropped, consequently an excessive amount of packets arriving at a network bottleneck leads to many packet drops.these dropped packets might already have travelled a long way in the network.thus consumed significant resources. in recent years, the whole congestion control mechanism designed in manet, specify the number of packets that are droped lead to congestion.these approaches are divided into three categories: window-based, rate-based and a hybrid between a window-based and a rate-based approach. window-based congestion control is not appropriate for manet to provide high throughput, short delay and stable performance with few packet collisions. in this paper, we focus on the problems resulting from the medium contention and propose a novel fuzzy adhoc rate based end-to-end congestion control scheme (farcc).this is unlike tcp, a rate-based congestion control and tuning of rate is operated in destination node. extensive simulations show that our scheme significantly outperforms itp [2] and atp [3] in terms of channel utilization, average delay, and fairness between different flows. 2 fuzzy adhoc rate base congestion control (farcc) in this section, we will elaborate on the specific mechanisms used by farcc. farcc primarily consists of mechanisms at the sender to achieve effective congestion control and reliability. each time the farcc sender transmits a packet, it attaches a congestion header to the packet. the rate-stamp field in header is sender’s current rate. which is filled in by the sender and copyright c⃝ 2006-2013 by ccc publications a rate based congestion control mechanism using fuzzy controller in manets 487 never modified in transit. the br-stamp field is channel busyness ratio that all the intermediate nodes along the path are calculated and may modify it to control the packet sending rate of the sources. the delay-stamp field is the average queuing delay that all the intermediate nodes along the path may modify it. the destination node is similar to a tcp receiver except that when it receives acknowledging a packet, it copies the congestion header from the data packet to its ack. it also acts as a collator of the congestion information provided by the intermediate nodes in the network before the information is sent back to the sender. the destination node provides the reliability, flow control, and collated congestion control information through periodic messages. farcc sender is responsible for the connection management and the startup rate. estimation and controls the sending rate of each flow by the explicit feedback carried in the acks. 2.1 intermediate node delay(q(t)) is queuing delay experienced by per packet traversing the intermediate node. the intermediate nodes maintain an average queuing delay as avg-delay(q(t)) irrespective of the specific flow. it is performed over all the packets traversing through them. average queuing delay is impacted by the contention between different flows traversing the node. this value is maintained using exponential averaging according to (1). for every outgoing packet, an intermediate node updates its avg-delay(q(t)) and delay(q(t)) values. ave − delay(q(t)) = α ∗ ave − delay(q(t)) + (1 − α ) ∗ delay(q(t)) (1) every packet consists of delay-stamp field. it refers to the maximum avg-delay(q(t)) value at the upstream nodes that packet has traversed through. when the packet is dequeued for transmission, the intermediate node updates the delay-stamp field in the packet with avg-delay(q(t)) value if delay-stamp field is smaller than avg-delay(q(t)) value. the intermediate node runs an epoch timer of period e. this period e should be larger than the round trip time of a connection, but at the same time must be small enough to track the dynamics of the path characteristics. e is empirically chosen to be one second in our simulations. in addition, the intermediate node computes channel busyness ratio periodically for the whole channels communicating with it according to [4]. intermediate nodes in the network maintain maximum channel busyness ratio periodically as br(t) values. each packet consists of a br-stamp field that refers to br(t) value at the upstream nodes which packet traversed through. when the packet is dequeued for transmission, the intermediate node updates the br-stamp field in the packet with br(t) value, if it is smaller than br(t) value. 2.2 destination node the destination node provides periodic feedback to sender for the collates congestion information which are provided by the intermediate nodes (through the br-stamp and delay-stamp fields on the packets). when destination node receives a packet, delay-stamp and br-stamp fields in the packet refer to as maximum average queuing delay and maximum channel busyness ratio which are experienced by the packets traversed through intermediate nodes. for every incoming packet belonging to a flow, the destination node performs an exponential averaging of the delaystamp value. in addition it performs periodically an exponential averaging of the br-stamp value which is specified in the packet. the destination nodes in the network maintain avg(br-stamp) and avg(delay-stamp) values. in order to send the feedback periodically, the destination node runs an epoch timer of period t. note that the period t should be larger than the round-trip time of a connection, but at 488 h. zare, f. adibnia, v. derhami the same time must be small enough to track the dynamics of the path characteristics. t is empirically chosen to be one second in simulations. congestion probability estimation using fuzzy logic controller the destination node gets packets and passes them through both the fuzzy logic congesion detection system and fuzzy logic congestion control system. then it calculates appropriate transmission rate and piggybacks new rate on packet to the sender node. the fuzzy logic congestion detection system will calculate the probability of congestion using congestion header of packets. in the following section, we will introduce how use the fuzzy logic controller to estimate congestion probability.the system model of farcc is shown in fig.1. figure 1: system controller to transmission rate all inputs to fuzzy logic congestion probability system are considered as t, where t is the sampling period. the fuzzy congestion probability system consist of three inputs. the input avgdelay(t) is avg(delay-stamp) and the input avg-br(t) is avg(br-stamp) that are both calculated by destination node. the input rate(t) to primary system is current data rate in congestion header of packet. the fuzzy system in farcc is zero order tsk-fuzzy system. the fuzzy rules are determined based on a full combination of the input variables. hence, there will be 36 (3*3*4) fuzzy rules. if rth rule is denoted by rr then the fuzzy rule can be described generally as shown in the following equation: rr: if avg-delay(t) is c and avg-br(t) is d and rate(t) is p then prob(t) is zr. the output of fuzzy model as prob(t) is the probability of congestion, that is a value in the range [0,1] and how input variables are mapped to the probability of congestion detemined by simulation. parameters zr (probability of congestion) are initialized using simulations and gathered experiences. simulation is performed over 200 simulation runs in section 3. the goal of the proposed fuzzy controller is to determine the probability of congestion using current network status and parameters except packet loss rate in the network. congestion control using fuzzy logic controller in farcc approache, there are three congestion levels low, medium and high. the destination node determines network congestion level using secondary fuzzy system at each sampling period according to fig.1. the transmission rate tuned depends on specified congestion level through one of the three phases; multiplicative increase, additive increase and multiplicative decrease. two inputs of fuzzy logic congestion control system are considered at t, where t is the sampling period. the input prob(t) is congestion probability and output of primary fuzzy system. the input rate(t) is current data rate in congestion header of packet. the zero order t-s fuzzy model has been selected for this system. a rate based congestion control mechanism using fuzzy controller in manets 489 plant modeling plant based on one of the congestion levels( low, medium and high) which is specified in output secondary fuzzy system performs one of three phases of multiplicative increase, additive increase and multiplicative decrease on current transmission rate respectively. if specified congestion level is low then it performs multiplicative increase phase according to (2) inorder the improvement of network throughput. [6] new − rate = old − rate ∗ ( th − br avg(br − stamp) ) (2) if congestion level is medium then it performs additive increase phase inorder the increases in transmission rate belong to average queuing delay of path nodes according to (3). [6] new − rate = old − rate + ( 1 avg(delay − stamp) ) (3) if congestion level is high, then probability of congestion event is high. therefore it performs multiplicative decrease and control send rate using average queuing delay and average channel busyness ratio, according to (4). [6] new − rate = old − rate ∗ avg(br − stamp) avg(delay − stamp) ∗ th − br (4) the destination node compares the new transmission rate with the current rate. if difference is greater than e then the new transmission rate is stamped on ack packet and is sent to the sender, since increase in the number of feedbacks lead to an increase in congestion and network load. 2.3 sender node in connection initiation phase, the sender node sends a probe packet to receiver. the probe packet is piggybacked on the next data packet in sequence queued for transmission. when a new path is used, the connection is not aware of the available bandwidth on path. the probe packet perform bandwidth estimation once again allows the connection to operate at the true available bandwidth instead of either over utilizing or under utilizing the resources available along the new path.the sender node calculates time interval packets sending using send rate. 3 simulation results we used the ns2 simulator [7] to evaluate farcc under random topology. the default parameter settings are as follows: the two-ray ground reflection model is used as the radio propagation model, ieee 802.11 dcf as the mac protocol, and aodv as the routing protocol. the parameters in farcc are as follows: the initial values of the transmission rates are 110 kbps; the step sizes are appropriately tuned according to the scale of the network and the number of flows in network. in these topologies, each node is 200 meters away from its closest neighbors. the transmission range and the interference range are 250 m and 550 m, respectively. the data transmission rate is 2mbps. the packets generated are of size 512 bytes in all the simulations. all the simulations are run for 100s and each data point on the figure is averaged over 10 simulation runs. in this scenario the random topologies are generated for the simulations. the mobility model used for topology generation is the random waypoint model. all the simulations are performed 490 h. zare, f. adibnia, v. derhami for a grid consisting of 50 nodes, distributed randomly over the two dimensional grid. the source destination pairs are randomly chosen from the set of 50 nodes in the network. load on the network is 15 flows and ftp is the application that is used for all the flows in the network. figure 2: throughput versus mobility in random topology the first metric is the overall throughput shown in fig.2 we observe with the larger speed of the node mobility, the aggregate throughput achieved by farcc is able to provide an improvement of around 13.8%-48.4% over default atp, and around 0.7%-23.6% over itp. figure 3: fairness index in random topology the result of the fairness index [8] is depicted in fig.3 which shows a remarkable fairness of farcc. the more concurrent flows, the better the improvement of farcc over the congestion control algorithms and the mount of fairness index have an improvement of around 9 percent over atp,and around 20 percent over itp. a rate based congestion control mechanism using fuzzy controller in manets 491 4 conclusion in this paper, we proposed an improved congestion control mechanism called farcc in manets. the proposed scheme uses the fuzzy logic control mechanism to determine an appropriate data rate for sending packets. using the fuzzy logic congestion detection, probability of congestion is estimated. then fuzzy logic congestion control specifies network congestion level based on probability of congestion. the congestion level is used to tune the data rate in plant. farcc uses a feedback scheme to adjust data flow rate. simulation results show that the proposed farcc performs better than itp and atp for aggregate throughput. consequently, our method supports higher fair resource allocation. bibliography [1] zhai, h.; chen, x.; fang, y.; rate-based transport control for mobile ad hoc networks, in proceedings of the ieee wireless communications and networking conference, 4(1): 22642269, 2005. [2] wang1, n.; huang2, y.; liu2, w.; a fuzzy-based transport protocol for mobile ad hoc networks, in proceedings of the ieee international conference on sensor network, 2008. [3] sundaresan, k.; anantharaman, v.; hsieh, h. y.; sivakumar, r.; atp: a reliable transport protocol for ad-hoc networks, in mobihoc 03: proc. of the 4th acm international symposium on mobile ad hoc networking & computing, 64-75, 2003. [4] kumar, s.; raghavan, v.; deng, j.; medium access control protocols for ad hoc wireless networks: a survey, ad hoc networks, 4(3): 326-358, 2004. [5] smitha, k.; anand, h. u.; mallapur, j. d., fuzzy based congestion control in wireless network, international journal of computer science and communication, 2(2):469-473, 2011. [6] zhai, h.; chen, x.; fang, y.; a call admission and rate control scheme for multimedia support over ieee 802.11 wireless lans, in proc. first international conference on quality of service in heterogeneous wired/wireless networks (qshine 04), 2004. [7] the network simulator ns-2 website, available at : http://www.isi.edu/nsnam/ns. [8] wang, x.; kar, k.; cross-layer rate optimization for proportional fairness in multihop wireless networks with random access, ieee journal on selected areas in communications, 24(8):1548-1559, 2006. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 2 (june), pp. 337-348 impact of poor requirement engineering in software outsourcing: a study on software developers’ experience i. perera indika perera department of computer science and engineering, university of moratuwa, sri lanka e-mail: indikaperera@uom.lk abstract: the software requirement engineering (re) is one of the most important and fundamental activities in the software life cycle. with the introduction of different software process paradigms, the requirement engineering appeared in different facets, yet remaining its significance without a doubt. the software development outsourcing is considered as a win-win situation for both developed and developing countries. high numbers of low paid, yet talented workforce in developing countries could be employed for software outsourcing projects with the demanding power of the outsourcer to decide the projects, their scope and priorities with the intention of profit maximization. this study was conducted to analyze the impact of poor requirement engineering in outsourced software projects from the developers’ context (sample size n = 57). it was identified that the present outsourcing scenario has created to have frequent requirement changes, shrunk design and stretched development phases, and frequent deliverables, which have to be accommodated by the software developer with extra effort and commitment beyond the project norms. the results reveal important issues and open policy level discussions while questioning our insights on the outsourcing benefits as a whole. keywords: requirement engineering, software outsourcing, project management, software developer productivity 1 introduction with the development of internet and other communication media, the entire world is becoming a single village with the reach of each other at arms length. many explain this scenario with the combination of other concepts as the globalization. some views globalization with a positive look, where as others have strong critiques on its impact to individual nations and ultimately to the global economy. the development of the software and computing technologies has triggered the globalization and world’s operations to a large scale. in fact, the software outsource process gains its significance in the global economy with more and more projects being outsourced [1]. software applications are complex and intangible products, which are difficult to manage. hence, software lifecycle management becomes one of the key research areas in software engineering. due to the nature of the software, software researchers and practitioners are focused to improve the software processes, which are used to develop software [2]. the underline assumption is that there is a direct correlation between the quality of the process and the quality of the developed software [3]. a software process can be considered as a set of tools, methods, and practices, where we use to produce a software product [4]. requirement engineering is one of the key process phases in the software development process; in most cases, essential enough to determine the success or failure of the project. therefore, having an error free re copyright c⃝ 2006-2011 by ccc publications 338 i. perera activity and specification can be more significant for the project success than any other activity. software requirement errors can cost up to 70-85% of additional cost of reworking in an average software project [5]. this shows the importance of managing the requirement engineering works productively especially in an outsourced project where the cost and profit concerns are high motives [18]. furthermore, it is a known fact that many small to medium scale software companies practice flexible, lightweight, agile like informal processes to suit their development; more or less for the sake of having a process practice to convince the outsourcers and win the projects [6]. however, in practice, these organizations tend to follow ad-hoc set of activities than any established process, which are camouflaged as the agile practice. these informalities make the re phase more complex, resulting the developers’ programming work unnecessarily tedious and unproductive. this research has tried to study on the issues of poor re practices with outsourced software projects and how it affects to junior level software developers’ productivity and work life balance. the rest of the paper presents the research work and outcomes with the following organization. section 2 will discuss the background literature related to this study. section 3 discusses the research problem in brief. thereafter, the section 4 is included with the experiment methodology and section 5, the analysis section explains the analysis done based on the collected results. the conclusion with the possible future directives is included thereafter along with the acknowledgements. the references will compile the paper. 2 background this section contains a comprehensive synopsis of the referred literature for this study in the areas of software requirement engineering and software outsourcing. it is important to mention that, there have very few research works been done to study how the current software outsourcing practices affect to the software developers work. 2.1 software requirements engineering software requirement engineering is a mature and one of the key life cycle activities in any software process paradigm. as nuseibeh and easterbrook explained, software systems’ requirements engineering (re) is the process of discovering that purpose, by identifying stakeholders and their needs, and documenting these in a form that is amenable to analysis, communication, and subsequent implementation [7]. more precisely, zave clearly defines the requirement engineering as; "requirements engineering is the branch of software engineering concerned with the real world goals for, functions of, and constraints on software systems. it is also concerned with the relationship of these factors to precise specifications of software behavior, and to their evolution over time and across software families." [8]. sometimes, it is impossible to have complete and finalized set of requirements at the beginning of a project. this allows requirement changes to happen during the latter stages of the project and create conflicts with the software process been practiced. to overcome such scenarios agile like flexible processes have been introduced and well established in the software industry. agile process welcomes frequent requirement changes even at late stages of the project. with frequent deliverables, agile process measures its progress through the norm of working software [9]. however, the most important point to remember, even if someone practices the agile process is, that still he needs to perform a certain set of requirement engineering activities to ensure smooth and correct process flow [17]. therefore, whatever the process practice used in the development, following proper requirement engineering activities is a must, even though the practices may have different facets. impact of poor requirement engineering in software outsourcing: a study on software developers’ experience 339 2.2 software outsourcing there are number of definitions for the concept outsourcing in general and specifically in the context of software development. rajkumar and mani explain offshore outsourcing as: "when the supplier of software development is from another country than the firm that decides to outsource information systems" [10]. the software development outsourcing has, in the recent past, taken up global dimensions in which companies from the developed countries sub contract software lifecycle function to the developing countries [11]. importantly, outsourcing has become an appealing option to organizations operating within the global economy for a variety of reasons [12]. one of the major and foremost reasons to outsource software development is the lower cost and relative high profit margins for outsourcer [13]. another advantage of software development outsourcing is the opportunity for the outsourcer to focus more on their core businesses and strategic activities [1]. sometimes, this makes the outsourcing company to act solely as an intermediary organization, except for legal and contractual works. furthermore, there are some issues with outsourcing to countries with vast heterogeneous socio-economic and cultural factors. having local representatives may be a key option to overcome cultural and linguistic barriers in offshore outsourcing [14], [15]. however, this can be an extra cost item and will not help developers if the representatives are not technically sound. nevertheless, the outsourcing attracts both developed and developing countries due to their benefits from different perspectives. developing countries trying to secure more and more outsourcing projects, aiming the monetary gains, without conducting grass root level research on how it impacts to their workforce, industry and society at large. 3 research problem the research problem which this study has focused mainly based on the software developers’ experience in an outsourcing environment. there have been number of business and economic forums to attract outsourcing projects for developing countries, but not visible effort been done for assessing the impact of developing an outsourced industry on various contexts, except the monetary gains and employment opportunities. since the majority of studies done to examine how to manage the outsource projects form the view point of the outsourcer, there is a significant vacuum of knowledge to understand the issues faced by the end developers due to the poor requirement engineering. as the outsourcing process consists with two parties mainly; the outsourcer and the service/solution provider, there are different issues for both parties depending upon their context. in many cases, the software developers are employed by the solution providers to engage in outsourced projects. this research primarily focuses on the problems that software developers encounter during their project activities, due to the poor software requirement practices of the projects. it is a known fact that many software outsourced projects get schedule overrun and extended developer time. one research found out that 60-80% of software projects get schedule or effort overrun and the average overrun varies from 30-40% of the initial project scope [16]. why this level of scope variation during the project happens even with mature, experienced staff is a concern need to be examined with extensive studies. there can be other factors, which cause the over time work. furthermore, the software developers’ attitude towards such over work situation need to be assessed. this was a major objective of this study. another questioning area is whether the process of passing requirements to the developer either directly from the main client or from the outsourcer, is effective and efficient enough to facilitate the development process. with the technological advancement and the studies to alleviate geographical barriers, have come up with various methods of communication for outsourcing [15]. in fact there are number of informal communication approaches used in the software outsourcing 340 i. perera and this study has analyzed the developer perception towards each approach and their relative significance to hinder the development activities. identifying key success factors for software process practices may be helpful for the improvement of current processes [19]. this study will also help to provide necessary guidance for possible improvements to the requirement engineering activities in outsourced software projects. 4 research methodology the research methodology was based on a comprehensive survey conducted using the sri lankan outsourcing companies and their employees. since the main objectives of the study and the research problem are generic to any outsourcing company and country as a whole the survey and its findings could easily be generalized with minor alterations. for this study, 57 software developers from 8 small to medium software developing enterprises were used. all these developers are engaged in outsourced projects for more than one year. to avoid any extreme cases in the study and to reduce skill variances, the developers who have industrial experience for less than 3 years were selected. the study was on voluntary basis and to strengthen the trust on the confidentiality of the participants’ responses the study was conducted on individual basis without the intervention of the participants’ employing organization. the study had two major phases for data collection. first all the participants were individually contacted through e-mail and telephone to explain the objective of the study and the confidentiality of their responses. the study was designed flexible enough to conduct by the participants as their own experiment. for the first phase, a worksheet was given to each participant to fill every day for 3 weeks. the worksheet was simple to fill and it has only four fields to be entered, based on the following measurement parameters. n number of requirement changes/emerge for a given day ts start time of the work tf finish time of the work tm -time spent on meetings/discussions to clarify/understand requirements for a given day. these are basic parameters for this study and those were used to analyze the average daily commitment of the developer and as a validating factor for the second phase data. the second phase was based on a questionnaire which was distributed electronically through e-mail and the participants had to fill their one and send back. strong assurances were given on the confidentiality of their responses in order to have more accurate information. the following section describes the collected data and their analysis 5 analysis during the first phase of the study, the above mentioned 4 parameters were recorded and the distribution of average time spent by each developer on meetings/ discussions to clarify requirements was calculated using the following model (1). µmj is the mean time spent for the jth developer, and tmi is the time spent on the i thday up to n , i.e. the number of days he worked during the experiment. µmj = n∑ i=1 tmi n (1) the obtained values are sampled for displaying purpose and shown in the table 1. impact of poor requirement engineering in software outsourcing: a study on software developers’ experience 341 table 1: the distribution of average daily times spent on meetings/discussions for clarifying requirements during the development time spent on meetings for requirements number less than 0.5 hours 5 0.5 1 hours 9 1 1.5 hours 13 1.5 2 hours 14 2 2.5 hours 11 2.5 3 hours 3 3 3.5 hours 1 more than 3.5 hours 1 to obtain the population mean value µm of tm , the following model (2) was used and m represents the number of participants in the sample. µm = m∑ j=1 n∑ i=1 tmij n m (2) the obtained µm value for the entire population is 1.571 hours. the value shows that on average a developer has to spend more than one and half hours of their development time, daily to clarify requirements that they are developing. the main problem with this is that these times are not included in the project schedules and the developers have to get these times from their own expense as the deliverable deadlines are fixed. similarly with the same meaning of the notations as above, the population mean value for n, i.e. µn was calculated with the following model in (3). µm = m∑ j=1 n∑ i=1 nij n m (3) the obtained value for µn was 2.84. this indicates that nearly three requirements alterations happen per day on average during the development phase. there are many reasons for such significant change to occur. the first fact is that everybody wants to have the working software so quickly and success of getting future work depends on the delivery of current work. because of this rapid nature of project development, the outsourcer cuts down the time on requirement engineering, design, and testing phases of the life cycle. however, when the frequent deliverable is done the outsourcer frequently improves his expected functionalities and gives frequent alterations to the planed requirements. moreover, due to the high demanding power, the outsourcer gets what he requests over multiple times of changing the same requirements. finally the work pattern of the participants were analyzed with the model described in (4). µwj = n∑ i=1 {tfi − tsi − k} n (4) 342 i. perera in this model tfi and tsi are the finish time and the start time of a developer on i th day, respectively. k is an average constant to represent the summation of none working times during a given day. it was considered to be 1.5 hours based on preliminary studies. with that model the following sample of work patterns were derived for a given week. in fact there is a reason to analyze this for a week specially. all the developers have in their employment contract that they are supposed to work on weekdays i.e. monday to friday and they have the opportunity of holidays including saturday, sunday and any other declared holidays by the government. however, it was observed that most of the participants have worked in saturdays, special holidays, and in some cases sundays to meet their deadlines. the mean value (µw ) for the population, was table 2: the distribution of average working hours per week average working hours per week number 35 40 hours 2 40 45 hours 7 45 50 hours 16 50 55 hours 21 55 60 hours 10 60 65 hours 1 calculated using the model described in equation (5); hence, µw = 51.76 hours, which indicates an average developer spends nearly 12 hours of additional work than their contractual norm of 40 hours per week. that means they put effort on additional work load on average worth of 29.4% of their monthly salary. in fact when the results finding were informed to the people who participated, they were astonished, and one of their response is worth to mention here. ". . . i can’t believe this. this means up to now i have worked nearly one million rupees work just for free . . . i should have thought of this before . . . " [participant’s response] µw = m∑ j=1 n∑ i=1 {tfi − tsi − k} n m (5) 5.1 questionnaire analysis the questionnaire was consisted with 16 questions. some of the questions were simple yes/no type and some have to be answered with further details. out those 16 questions, 6 were used to condition the participant’s mind to answer correctly, and also used to validate the accuracy of their responses for the main questions. for this analysis, only the relevant questions, i.e. the 10 main questions were used. q1. do you get a sufficient requirement specification for the project? for this question 41 participants said ’no’, 12 said ’yes’ and 4 said don’t know. therefore as a percentage, 71.92% believes that they do not get a sufficient requirement specification for their development. q2. what are the most frequent methods you use to communication on requirements? please rank them the table 3 summarizes the responses for the question 2. only 13 developers are working with proper requirements specification in which only 5 have the opportunity to consider it as the impact of poor requirement engineering in software outsourcing: a study on software developers’ experience 343 table 3: the distribution of average working hours per week method of communication 1st 2nd 3rd 4th total formatted req. spec. as e-mail attachment 5 6 2 0 13 e-mail messages 24 21 8 4 57 instant messaging (im) / chat 19 18 11 9 57 voice conference / telephone 6 10 24 11 51 video conference 3 2 12 19 36 face to face communication 0 0 0 4 4 travel to client places for discussions 0 0 0 1 1 main form of requirement communication. lack of proper requirement specification makes it enormously vulnerable to requirement alternations and scope creep, during the later stages of the project. also only 1 participant has the opportunity to go to client premises and experience their expectations where as only four have the opportunity to meet the clients for discussions. email messages considered as the main form of communication followed by instant messaging or chat. also everybody (all 57) use e-mail and im/chatting with different significance. voice conferencing and video conferencing are used as secondary and tertiary forms of communication when extra clarification is needed on requirements, in generally. q3. what are the reasons you think for requirement issues in your projects? table 4: the distribution of average working hours per week reason number poor technical knowledge at outsourcer 37 optimistic scheduling 29 scope creep and stretching requirements 42 poor communication 17 informal ways of practice 31 management issues 24 practice of "do what they say" 44 the table 4 indicates developers’ responses on what they believe as the causes of requirement issues. there were multiple reasons given and most of the participants have concluded that the current practices in the outsourced industry are mainly responsible for these re issues. as mentioned in the background section, the agile practices has significant impact on this issue as when it is practiced without a proper knowledge on how to use, the agile process becomes a worse practice and creates a chaotic environment. according to other findings of this study it shows that developers have to accommodate such chaos within their activities and meet the deadlines and outcomes as expected. however, these results open a huge necessity for a discussion to improve the outsourcing industry in the developing countries. q4. do you experience spending more time on assigned tasks than they were scheduled? if yes what is the reason for that? as the figure 1 shows, a significant impact to schedule overrun is due to the requirement issues. the other significant factors are technical difficulties, which may be due to lack of training and development opportunities for developers, and the management issues specially having optimistic schedules to delight the outsourcer. the most important fact is that all of the participants ex344 i. perera perience to spend more time than they were scheduled, making them to work more time than the employment norms, without any compensation for the extra effort. figure 1: reasons make the schedule overruns q5. do you experience difficulties for understanding requirements from foreigners? comment on this out of the 57 participants, 49 has responded and 37 had issues with verbal communication due to the accent of the foreigners. 42 of the participants complained about email and instant messages with insufficient information to take a decision; as a result, they have been spending more time and effort to clarify requirements. more importantly, the natural language based e-mails or chat massages, instead of a proper requirement specification make it further difficult to understand what the client is actually expecting. q6. if you are provided with complete and final requirement specification at the beginning of development, will it helps to your work? 23 people said it will help them lot and 9 were not sure about whether it helps them or not. 25 have indicated that it will not help as they will never getting a finalized requirement specification at any time of the project. this shows how much significant the issue of ever changing and poor requirement engineering process in the outsourced industry. q7. are you the direct person to get requirements from client? if not how many to be passed for requirements to reach you? 8 participants have direct access with their clients to get requirements, 17 participants get through one person, 29 participants get through two persons, and 3 participants gets through three persons. the most obvious scenario was to have one person (non-technical) at the out-source end and one person (technical) at the development end to form 2 person layer to process requirements. due to the non-technical managerial person at the client end, the requirements process deters its effectiveness, making developers to work more on requirements fine tuning and error fixing during the development phase. impact of poor requirement engineering in software outsourcing: a study on software developers’ experience 345 q8. how good are you at work and life balancing? the idea of this question is to assess the health and social impact from the outsourcing work to the developers. there was a 5 level lickert scale given to rank them considering their extra activities, exercise time, recreational activities and good life style practices. results are shown in figure 2. 45.61% of the participants are not pay attention as required, for their social, health and other extra activities. when they spend more time on their work they get less time to attend those activities. the impact of this may not visible for short term but there are huge social, economical and health consequences in the long run which may not be easily compensated by their wee bit of extra salary they get compared to other disciplines. the good point is still the majority are happy about their work life balance, but there should be a proper mechanism to improve the situation. q9. are you properly compensated for your effort? figure 2: work life balance of the developers for this question the 21 participants (36.84%) said yes and 26 participants (45.61%) have said no. 3 people said they are not sure about it. 7 participants did not respond to this question. in fact, out of the participants who responded, more than 50% is unhappy about their compensation and believe that they are under paid and over utilized. q10. what is the impact from time zone difference for your work times? this was an open ended question where developers have to express their feeling about the impact from time zone difference. idea of this question was to assess whether their work patterns have been affected from that. in fact, many have responded in similar fashion indicating there is a significant impact from the time zone difference. most of the outsourcing projects are from united states of america and european union member countries, where there is a significant time difference with the sri lankan time, around 11-13 hours and 4-6 hours, respectively. what the participants have indicated was that when the outsourcer wants to change a requirement or impose a new one, the developers have to communicate with them which will mostly the evening or late night in sri lankan local time. in the following day they have to implement those from the morning. therefore, invariably they have to spend at least 11-12 hours per day despite what 346 i. perera was agreed in the contract as 8 hours. and these don’t count for any overtime payment or additional benefit. the above individual analyses of questions provide a clear view on how the poor requirement engineering practices affect to software developers in outsourced projects. the questions were focused into different aspects of significance relevant to the research problem. even though the fact that, responds for these questions covers a larger spectrum of issues related to the problem, there should be further researches with more focus to these significant areas identified in this analysis. 5.2 study limitations in this study, there were some research limitations which are worth to mention. since this study was involved with human activities, this had the experimental limitation of different skill levels between the participants. however this was general scenario of the software industry and has no significant impact for this study’s outcome. the second and the most crucial limitation with this study was the difference between the requirement changes. some requirements were very simple and some were not. though it was really hard to eliminate this, in the general situation every project has both difficult requirement changes and simple changes. therefore, in the large population conditions (n > 30), this behaves with normal distribution and nullifies the impact as the standard error is 0. another limitation with the study was the truncation errors of the collected data. literally, what happened was, the developers were confident on expressing their values with integer figures of hours or in minutes over the decimal or fractional values. for an example they might have said actual time as 7.15 but the time may be 7.11 however, since there is no comparison done based on their given values, the impact for the study outcome was negligible. furthermore, these errors are also normally distributed with the standard error 0 in the large population samples. 6 conclusion despite the outcome of this research, there are some possible future studies relevant to this research, which can be considered as further extensions. this study mainly focused on the requirement engineering issues in the development phase of the outsourced software project life cycle. there are other important life cycle phases in the outsourced projects such as system testing, system design, system implementation, etc. then again, there can be similar future studies to examine the social, economic and health impact from malpractices of software outsource industry in the context to employees, countries, and to industry as in whole. furthermore, impact on software outsourcing due to different cultural characteristics, social and demographic parameters, and geographical and ethnic factors can be another fruitful research area. domain specific examine of the impact of poor requirement engineering on developers is another possible study as a further extension to this study, which will indeed provide an in depth insight for the prevailing issues. due to the limited resources this study was conducted in a limited environment. though the results prove impressive observations, it is expected, and encouraged the other scholars to conduct further researches based on the findings of this study to formulate a global knowledgebase on outsourcing and impact to its stakeholders due to various parameters. for that level of generalization, essentially there should more similar researches have to be conducted within other major countries within the global software outsourcing industry. this study was initiated to examine an unaddressed issue in the outsourcing industry. though the study does not cover each and every issue experienced by the developers, it provides sufficient impact of poor requirement engineering in software outsourcing: a study on software developers’ experience 347 results and analyses to understand the gravity of the issue faced by developers engaged in outsourced industry. it is high time for policy makers to do further research on the identified issues and formulate standardization mechanisms to strengthen outsource industry in many developing countries while facilitating the developer community. in fact, the new computational paradigms such as cloud computing can cause significant impact towards reforming how outsourcing works at present. it is possible to have influential impact on present software outsourcing countries, when emerging solutions offering outsourcing both hardware and software [20]. it is therefore, a must to investigate possible improvements with new technologies to enhance the outsourcing industry, both individual country level and global level. with all this regard it is fair to believe that, this study will create a significant paradigm shift towards rethink of the outsourced software development process. in conclusion, it is evident that this research is one of the significant achievements in the present software engineering field. this study’s outcome will direct the policy implications to benefits the stakeholders of the software outsource industry, thus assist to improve the social, economical and health levels of software developers while avoiding crisis situation within the industry, in the long run. acknowledgement author would like to thank the people who helped for this study in various forms, especially, the developers who participated in this study for their effort and contribution to make this a success. bibliography [1] r. e. ahmed, software maintenance outsourcing: issues and strategies. computers and electrical engineering, vol. 32(6), pp. 449-453, 2006. [2] g.i.u.s. perera, m.s.d. fernando, enhanced agile software development hybrid paradigm with lean practice, in proc. of ieee 2nd iciis conference , peradeniya, pp.239-244, 2007 [3] a. fuggetta, software process: a roadmap, in proc. of the conference on the future of software engineering, icse, limerick, p.25-34, 2000 [4] w.s. humphrey, managing the software process, sei, pearson education, india, pp.03, 2006 [5] k.e. wiegers, software requirements, 2nd ed. redmond, wash, microsoft press, 2003 [6] g.i.u.s. perera, m.s.d. fernando, bridging the gap business and information systems: a roadmap, in proc. of 4th icbm conference, pp. 334-343, 2007 [7] b. nuseibeh, s. easterbrook, "requirement engineering: a roadmap", proc. of the conference on the future of software engineering, 22nd icse, limerick, p. 35-46, 2000 [8] p. zave, classification of research efforts in requirements engineering. acm computing surveys, vol. 29(4), pp. 315-321, 1997 [9] k. beck, et. al., manifesto for agile software development, 2001, available at http://agilemanifesto.org/, [accessed on 07th december 2008] [10] t.m. rajkumar, r.v.s. mani, offshore software development. the view from indian suppliers, information systems management, pp. 63-73, 2001 348 i. perera [11] e. carmel, taxonomy of new software exporting nations, electronic journal of information systems in developing countries, vol. 13(2), p. 1-6, 2003 [12] a. yalaho, plugging into offshore outsourcing of software development: a multiple case study, issues in information systems, vol. 8(2), pp. 499-515, 2007 [13] n. levina, j.w. ross, from the vendor’s perspective: exploring the value proposition in information technology outsourcing, mis quarterly, vol. 27(3), pp. 331-365, 2003 [14] c. t. coward, looking beyond india: factors that shape the global outsourcing decisions of small and medium sized companies in america, electronic journal of information systems in developing countries, vol. 13(11), pp. 1-12, 2003 [15] e. carmel, r. agarwal, tactical approaches for alleviating distance in global software development, ieee software, vol. 18(2), pp. 22-29, 2001 [16] k. molřkken, m. jřrgensen, a review of surveys on software effort estimation. in proc. of the international symposium on empirical software engineering, pp. 223-231, 2003 [17] i. perera, impact of using agile practice for student software projects in computer science education, international journal of education and development using information and communication technology, vol. 5(3), online [http://ijedict.dec.uwi.edu/viewarticle.php?id=755], 2009 [18] g.i.u.s. perera, fernando m.s.d., rapid decision making for post architectural changes in agile development a guide to reduce uncertainty, international journal of information technology and knowledge management, vol. 2(2), in press, 2009 [19] g.i.u.s. perera, "key success factors for e-learning acceptability: a case based analysis on blended learning end-user experience", in proc. of ieee international advance computing conference, iacc’09, pp. 2379-2384, 2009 [20] i. perera, reshaping the computation with clouds: an analysis on opportunities and issues of cloud computing, journal of advances in computational sciences and technology, 2(3), pp.305-324, 2009 ijcccv4n1draft.pdf int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 1, pp. 41-56 flatness-based control and conventional rst polynomial control of a thermal process hajer gharsallaoui, mounir ayadi, mohamed benrejeb, pierre borne hajer gharsallaoui école nationale d’ingénieurs de tunis, laboratoire de recherche en automatique (la.r.a.) b.p. 37, le belvédère, 1002 tunis, tunisia laboratoire d’automatique, génie informatique et signal (lagis), école centrale de lille avenue paul langevin bp 48 59651 villeneuve d’ascq cedex, france e-mail: hajer.gharsallaoui@voila.fr abstract: in this paper, a comparison between conventional rst polynomial control by poles placement and rst flatness-based control is proposed. these approaches were developed, in discrete-time formalism by carrying out a generation of a reference starting from a flat output and by then analyzing the tracking error in closedloop. the case of the thermal system, that we applied an output disturbance, was considered by simulation to study the effectiveness of given flatness-based robust controller with the integration of an anti-windup device, in terms of tracking trajectory and disturbance rejection. keywords: flatness, rst control, tracking trajectory, robustness, anti-windup. 1 introduction the flatness property was introduced by m. fliess, j. lévine, p. martin and p. rouchon in 1992 to propose a new strategy to control continuous nonlinear systems with good performances in term of tracking trajectory. at first, the use of this property consists in the definition of an output trajectory allowing the determination of the variables of the flat system. secondly, it concerns the elaboration of the control in closedloop allowing to obtain a stable system giving place to a tracking of a desired trajectory with an error which tends asymptotically towards zero [1, 2, 5, 18, 19]. several techniques of control resulting from the theory of control were established and applied to real systems, especially the rst polynomial control. the exploitation of robust control with an argued choice for its design satisfying the control objectives in term of the tracking trajectory of reference as well as the regulation in the presence of disturbances, constitute the object of this paper. a comparison between this well-known conventional rst polynomial control and a recent approach of a rst controller design based on the property of flatness of the mono-variable linear and controllable systems are carried out by considering the case of the thermal process control. 2 conventional rst polynomial control 2.1 structure of rst controller by poles placement in the rst control approach it is possible to impose poles in closed-loop and to carry out in a separate way the objectives of tracking and regulation. in a discrete formalism, the blocks of rst controller are given by figure 1, [4, 9, 12]. copyright © 2006-2009 by ccc publications 42 hajer gharsallaoui, mounir ayadi, mohamed benrejeb, pierre borne figure 1: conventional structure rst controller h (q−) = b(q−) a(q−) is the transfer function of the dynamic linear discrete system where the polynomials a and b represent respectively the denominator and the numerator of the transfer function with respective degrees na and nb: a ( q − ) = a + aq − + ... + anaq −na (1) b ( q − ) = b + bq − + ... + bnbq −nb (2) the choice of the polynomials r, s and t allows to solve as well the problem of regulation as well as of tracking. these polynomials are given by: r ( q − ) = r + rq − + ... + rnrq −nr (3) s ( q − ) = s + sq − + ... + snsq −ns (4) t ( q − ) = t +tq − + ... +tnt q −nt (5) where nr = deg(r), ns = deg(s) and nt = deg(t ). the transfer function in closed-loop, in this case, is given by: hbf ( q − ) = t (q−)b(q−) a(q−)s (q−) + b(q−)r(q−) (6) the calculation of the polynomials r and s is carried out starting from the choice of the polynomial p: p(q−) = a(q−)s (q−) + b(q−)r(q−) = p + pq − + ··· + pnpq−np (7) if deg(p) < deg(a) + deg(b), this equation is regular and it can be written in the following matrix form:               a  . . .  b  ···  a a ... ... b b ... ... ... ... ...  ... ... ...  ana ... ... a bnb ... ... b  ... ... a  ... ... b ... ... ... ... ... ... ... ...  ··· ··· ana  ··· ··· bnb               ︸︷︷︸ m              s ... sns r ... ... rnr              ︸︷︷︸ x =             p p ... pnp  ...              ︸︷︷︸ mp (8) flatness-based control and conventional rst polynomial control of a thermal process 43 where m ∈ ℜ(na×nb) × ℜ(na×nb) is called sylvester matrix. the coefficients of r(q−) and s (q−) contained in vector x are given by: x = m−mp (9) for the determination of the coefficients of the polynomial t , in the case of the tracking of a desired trajectory yc k corresponding to a reference model: hm ( q − ) = bm (q −) am (q −) (10) we choose t (q−) = β b(q−), to obtain: hbf ( q − ) = β b ( q − ) (11) hbf ( q − ) = β b(q−)bm (q −) am (q −) (12) with taking β =  b() , a static gain in closed-loop equal to 1 is then imposed. figure 2: detailed structure of rst regulator 2.2 rejection of disturbances let us consider s (q−) the polynomial of the following form: s ( q − ) = (  − q− )m s ′ ( q − ) (13) to reject a step disturbance, we can choose m =  with deg(s′) = deg(s) − m. 2.3 anti-saturation of the control the direct implementation of rst controller often leads to a control signal whose amplitude can be very important. in order to keep this amplitude in an acceptable interval [umin,umax], we add to the structure of control an anti-windup device [4], d is the output disturbance and s∗ (q−) is a polynomial defined from the relation bellow: s ( q − ) = s + q − s ∗ ( q − ) (14) 44 hajer gharsallaoui, mounir ayadi, mohamed benrejeb, pierre borne figure 3: conventional structure of rst controller with anti-windup device 3 design of flatness-based controller 3.1 determination of flat output in this part, let us consider the discrete-time linear system single input single output (siso), described by the following equation: a ( q − ) yk = b ( q − ) uk (15) where yk and uk represent the output and the control signals respectively. the flat output is a variable defined in continuous time; we consider the expressions of the polynomials a(q−) and b(q−) in term of the operator q−, to be able to define thereafter the discrete flat output of the system. the system is considered as linear and controllable, consequently it is flat and its flat output zk is given by [2]: zk = n ( q − ) yk + d ( q − ) uk (16) where n (q−) and d(q−) are polynomials which represent the solutions of the equation of bezout: a ( q − ) d ( q − ) + b ( q − ) n ( q − ) =  (17) the flat output zk on which depend the output yk and the control uk, can be also seen as being the partial state of a linear system [3]: uk = a ( q − ) zk (18) yk = b ( q − ) zk (19) the knowledge of the flat output zk allows to determine the open loop control uk of the system. 3.2 trajectories planning the objective of the trajectories planning [17] is to determine an open-loop control ud (t), carrying out the objective bringing a given system, of a certain initial state in a known final state: u d (t) = b ( z d (t) , ...,zd(α+) (t) ) (20) y d (t) = c ( z d (t) , ...,zd(σ ) (t) ) (21) flatness-based control and conventional rst polynomial control of a thermal process 45 zd is the desired trajectory for the flat output that is sup(α + , σ ) time continuously derivable. if the objective is to reach two equilibrium points of balance: (u(t) ,y(t) ,z(t)) and (u(t f ) ,y(t f ) ,z(t f )), where t and t f are the two times known in advance, it is necessary to solve the problem of trajectory generation offline. a solution for the calculation of zd(t), vector of zd(t) and its successive derivatives, is given by the relation: z d (t) = m (t −t)c + m (t −t)c (22) where m(t) and m(t) are the two following matrices: m (t) =        t ··· tn− (n−)!   ··· tn− (n−)! ... ... ... ...  ···         (23) m (t) =       tn n! tn+ (n+)! ··· tn− (n−)! tn− (n−)! tn n! ··· tn− (n−)! ... ... ... ... t ··· ··· tn n!       (24) the vectors c and c can be defined from 20 and 21 by: c = z d (t) (25) c = m −  (t f −t) ( z d (t f ) − m (t f −t)z d (t) ) (26) in the discrete-time formalism, the real output of system yk to be controlled, follows the desired trajectory yd k expressed by: y d k = b ( q − ) z d k (27) the synoptic diagram of figure 5 summarizes the different steps as well as the necessary tools to generate the desired trajectory yd k in a discrete-time formalism with considering the sampling period te [2, 5, 12]. figure 4: generation of the desired trajectory yd k 3.3 flatness-based rst polynomial controller the realization of flatness rst controller is carried out here by considering the method of direct calculation of the state vector zk = ( zk zk+ ··· zk+n− )t , suggested in [2, 5]. 46 hajer gharsallaoui, mounir ayadi, mohamed benrejeb, pierre borne from the relations (20) and (21), we can determine the representation of state of the system in its controllable form: { zk+ = azk +buk, yk = czk, (28) with: a =            ···   ...  ... ... ... ... ... ...   ···    −a −a ··· −an− −an−         , b =       ...        and c = ( b b ··· bn− ) from this representation, we can write [5]: zk = o − (a,c) ( yk − m(a,b,c)uk ) (29) with the observability matrix is: o(a,c) =      c ca ... can−      (30) and the controllability matrix m(a,b,c) given by: m(a,b,c) =           ··· ···  cb ... ... cab cb ... ... ... ... ...  can−b ··· cab cb          (31) from [5], we obtain the flatness control law by the following relation [12]: uk = k (q)z d k + (a − k)zk (32) where a and k are two constant vectors constituted by the ai and ki coefficients of the a(q −) and k(q) polynomials given by: a = ( a a ... an− ) , k = ( k k ... kn− ) , where k(q) is a polynomial containing the closed-loop poles. the realisable structure of rst controller by flatness with q− operator can be then obtained by: s ( q − ) uk = k (q)z d k + r ( q − ) yk (33) where: r ( q − ) = − (a − k)an−o− (a,c)q (34) s ( q − ) =  + (a − k) ( an−o− (a,c)m(a,b,c) − ( an−b ... b ) ) q ∗ (35) and: q = ( q−(n−) q−(n−) ... q−  )t , q∗ = ( q−(n−) q−(n−) ... q− )t . the dynamics of the closed-loop is defined by the tracking polynomial k (q−) such as: flatness-based control and conventional rst polynomial control of a thermal process 47 a ( q − ) s ( q − ) + b ( q − ) r ( q − ) = k ( q − ) (36) this relation represents the bezout equation already defined in the traditional approach of rst controller. the choice of poles must be well optimized in order to satisfy the desired performances. this step appears the fundamental contribution of the exploitation of the flatness property in the design of such controller. indeed, the choice of the closed-loop poles corresponds to this of the tracking model of a desired trajectory and more precisely, to the poles of the k(q) polynomial. 4 study of the robustness 4.1 disturbances rejection and noises attenuation by using the principle of the internal model, a deterministic disturbance can be rejected by considering in the transfer function of the direct chain an integrator. in fact, in order to obtain a robust controller, the polynomial s (q−) must have a term in ( − q−) which allows the rejection of static disturbance present on the output signal. moreover, to introduce a filtering effect into a certain zone of frequencies, the principle of the blocking of the signal is often used by forcing the system to behave as an open loop system at these frequencies; what is equivalent to impose of the zeros in polynomial r(q−). this led so that polynomial r(q−) contains a term of ( + q−) making it possible to ensure an attenuation of the noises effect in high frequencies on the system input [4, 6]. thus, the following polynomials hs (q −) and hr (q −) are introduced into the model of the system: hs ( q − ) =  − q− (37) hr ( q − ) =  + q− (38) by taking in account of these pre-specified fixed parts, polynomials r̃(q−) and s̃ (q−) of new rst controller can be expressed by: r̃ ( q − ) = hr ( q − ) r ( q − ) (39) s̃ ( q − ) = hs ( q − ) s ( q − ) (40) in addition, the dynamics of the regulation is defined by the choice of the dominant and auxiliary poles of polynomial k (q−) of the closed loop given by the relation: a ( q − ) hs ( q − ) s ( q − ) + b ( q − ) hr ( q − ) r ( q − ) = k ( q − ) (41) in order to take in account the pre-specified parts in the design of the new controller, it is enough to consider the increased model governed by the following transfer function: h̃ ( q − ) = b(q−)hr (q −) a(q−)hs (q −) (42) then, it is a question to redo the calculations of flatness-based rst controller using the method presented previously. 48 hajer gharsallaoui, mounir ayadi, mohamed benrejeb, pierre borne 4.2 analyze of the controller robustness in order to maintain nominal performances in rejection of disturbances and in the presence of modelling errors, the sensitivity functions defined below, are calibrated and recomputed so as to satisfy the performances required. the sensitivity functions are given according to the type of disturbance to consider. by considering the three types of following disturbances: output-input disturbances and noises of measurement, we can thus deduce the output-input functions of sensitivity described in [4, 6]: • output disturbance: represented by the transfer function syd (q −) between the output disturbance d(t) and the system output y(t): syd ( q − ) = a(q−)s̃ (q−) a(q−)r̃(q−) + b(q−)s̃ (q−) = a(q−)s̃ (q−) k (q−) (43) • input disturbance: represented by the transfer function sud (q −) between the output disturbance d(t) and the input u(t): sud ( q − ) = −a(q−)r̃(q−) k (q−) (44) the study of the robustness of the developed rst controller is based on the frequencies analysis of the modules of the various functions of sensitivity. desirable templates for the sensitivity functions will be defined through the constraints of imposed performances and tolerated robustness margins. the purpose is to have the disturbance output sensitivity function inside the upper and lower templates as shown further in figures 14 and 15. also, input sensitivity function must have a decreasing shape meaning that the controller is insensitive to noise. 5 saturation effects the additive static disturbances cause generally the increase in the amplitudes of the control signal applied to the system exceeding certain limit values. this especially presents a problem in the control of process by digital computers having thresholds in tension not to exceed on the level of their inputoutputs. consequently, it is necessary to design a device of anti-saturation according to the technique already developed in [4], where the law of control has the following forms: uk = k (q)z d k − r̃ ( q − ) yk − s̄ ( q − ) uk− (45) with: s̃ ( q − ) =  + q−s̄ ( q − ) (46) in addition, by considering umin and umax respectively the lower and higher limits of the control saturation, we obtain: ūk =    uk i f umin ≤ uk ≤ umax umax i f uk > umax umin i f uk < umin (47) however, it is possible to impose certain dynamics when the system leaves the saturation. this is illustrated in figure 5. the desired dynamics is defined by the polynomial ps (q −) given by equation (48). ps ( q − ) =  − exp ( te τsat ) q − (48) flatness-based control and conventional rst polynomial control of a thermal process 49 where τsat indicates a time-constant of a system of first order. it is it should be noted that outside of saturation, umin < uk < umax, this mechanism of anti-saturation is equivalent to block  s̃ (q−) of rst controller given by figure 5. figure 5: diagram of flatness-based rst controller with taking in consideration of anti-saturation finally, the new control law of the flatness-based polynomial controller in the presence of saturation (47) is given by the expression: ps ( q − ) uk = k (q)z d k − r̃ ( q − ) yk − ( s̃ ( q − ) + exp ( te τsat )) ūk− (49) in the following part, a comparative study between the two laws of rst polynomial controllers, rst flatness-based control and conventional rst polynomial control, is carried out by digital simulation by considering the case of the thermal process control. 6 application to the control of a thermal system 6.1 modelling of the thermal system the thermal process whose simplified diagram is given by figure 6, is constituted of a tube of constant volume v [m] and of a heating resistance rc connected to a direct current power supply u(t) applied to resistance for heating the air entering at a desired temperature by joule effect 6, c [j.m−.◦k−] is the specific heat constant of air, te [ ◦k] the ambient temperature, f j [m .s−] the air rate flow entering according to the aperture of the valve j. the purpose of the control approach is to regulate the temperature ts [ ◦k] of the outgoing air at the constant temperature, given that the air flows into the tube with an initial temperature te [ ◦k] and at the flow rate f j [m .s−]. figure 6: simplified diagram of the thermal process the flow rate signal is assumed piecewise constant and can be vary by changing the throttle position j. the model of the thermal process, dependent, primarily of the valve position, if it is admitted that the temperature of the entering air remains constant, the transfer function of the model is given by [10, 16]: 50 hajer gharsallaoui, mounir ayadi, mohamed benrejeb, pierre borne g(p) = ke−τ p  + tc p (50) where k is the delay, t is the time-constant of the process and p is the laplace operator. for an ambient temperature equal to c, the identified parameters of g(p) are: k = ., τ = .s and tc = .s [10, 11, 16]. the corresponding discrete-time transfer function is then given by: g ( q − ) = .q− + .q−  − .q− (51) te is the sampling period selected equal to .s. 6.2 determination of the desired trajectory let us consider thatb() = 0.7875 , we choose to generate the trajectory expressed in continuous time zd (t) according to the following polynomial form: z d (t) =    yd (t) b() , i f  ≤ t ≤ t poly(t) , i f t ≤ t ≤ t f yd(t f ) b() , i f t ≥ t f (52) by taking the transition times t = s and t f = s, and choosing the polynomial poly(t), like reference trajectory between these two moments: poly(t) = (    ) (m (t −t)c + m (t −t)c) (53) the desired trajectory zd calculated is represented on figure 7. 0 5 10 15 20 25 30 3.5 4 4.5 5 5.5 6 6.5 time(s) d e s ir e d t ra je c to ry z d figure 7: desired trajectory zd(t) 6.3 conventional rst controller by poles placement the tracking polynomial k(q) is obtained by discretizing of the continuous model as third order system formed by the setting in cascade of two subsystems of which one is of second order, characterized flatness-based control and conventional rst polynomial control of a thermal process 51 by ωn = .rad.s − and ξ = ., and the other first order one having a time-constant τ = .s . it becomes then: k (q) = q − .q + .q − . (54) simulations were carried out by considering the following conventional rst controller polynomials: r ( q − ) = . − 3.216q− + 1.015q− (55) s ( q − ) =  + 0.1724q− − 0.5306q− − 0.6419q− (56) t ( q − ) = k (q−) b() = −0.09915 + 0.7326q− − 1.575q−+1.27q− (57) the simulation results show that the response of the thermal process presents an important tracking error. the thermal process presenting one zero unstable it is thus with no minimum of phase that explain the response of the system given in figure 8. 0 5 10 15 20 25 30 −1 0 1 2 3 4 5 6 time(sec) o u tp u t s ig n a ls yd y figure 8: conventional rst control: y the real system output and yd the desired output 0 5 10 15 20 25 30 −3 −2 −1 0 1 2 3 4 5 6 time(sec) c o n tr o l s ig n a l figure 9: conventional rst control: control signal 52 hajer gharsallaoui, mounir ayadi, mohamed benrejeb, pierre borne 0 5 10 15 20 25 30 −0.5 0 0.5 1 1.5 2 2.5 3 3.5 4 time(sec) tr a c k in g e rr o r figure 10: conventional rst control: tracking error 6.4 flatness-based rst controller the tracking polynomial k(q) is selected starting from the discretization of the continuous tracking model considered as third order system. consequently, we deduce the flatness-based rst regulator polynomials designed by flatness are bellow: r ( q − ) = 0.6554 − 0.3274q− (58) s ( q − ) =  + 0.2681q− + 0.2069q− (59) leading to the results of figures 11, 12 and 13. 0 5 10 15 20 25 30 0 1 2 3 4 5 6 time(sec) o u tp u t s ig n a ls yd y figure 11: flatness-based rst control: y the real system output and yd the desired output the results in figures 11, 12 and 13 underline the importance of the implementation of flatness-based controller in term of trajectory tracking. in fact, we notice in addition that the tracking error in the case of first approach rst is significant compared with that obtained by flatness-based controller. thus, the flatness property is very interesting in terms of planning and tracking trajectory exploited for the development of a closed-loop system with high performances. the results of figures 14 and 15 show that the controller obtained is robust considering the sensitivity functions which remain inside the specified templates of robustness [4]. in addition, we underline the flatness-based control and conventional rst polynomial control of a thermal process 53 0 5 10 15 20 25 30 0 1 2 3 4 5 6 time(sec) c o n tr o l s ig n a l figure 12: flatness-based rst control: control signal 0 5 10 15 20 25 30 −0.5 0 0.5 1 1.5 2 2.5 3 time(sec) tr a c k in g e rr o r figure 13: flatness-based rst control: tracking error 0 0.1 0.2 0.3 0.4 0.5 −50 −40 −30 −20 −10 0 10 20 30 40 50 normalized frequency o u tp u t s e n s it iv it y f u n c ti o n |s y p | figure 14: output sensitivity function |syd| case of output disturbance 54 hajer gharsallaoui, mounir ayadi, mohamed benrejeb, pierre borne 0 0.1 0.2 0.3 0.4 0.5 −150 −100 −50 0 50 normalized frequency in p u t s e n s it iv it y f u n c ti o n | s u p | figure 15: input sensitivity function |sud| case of input disturbance robustness of such controller related to the static disturbance and the high frequency noises rejection is clearly guaranteed. moreover, the input sensitivity function presents attenuation in the high frequencies which define an elimination of noises in the input. 7 conclusion in conclusion, the study of the thermal process control shows that flatness-based rst regulator ended in a regulation of the robust and powerful type rst in term of tracking trajectories compared with such obtained by the use of the conventional polynomial rst controller. the design of flatness-based rst controller, depending on the choice of the poles of the tracking dynamic system and on the desired trajectory, allowed the satisfaction of the templates imposed by the typical values of the robustness margins. bibliography [1] m. fliess, j. lévine , ph. martin and p. rouchon, sur les systčmes non linéaires différentiellement plats, compte rendu de l’académie des sciences de paris, série i, 315, pp. 619-624, 1992. [2] m. ayadi, contribution ŕ la commande des systčmes linéaires plats de dimension finie, phd thesis, institut national polytechnique de toulouse, 2002. [3] t. kailath, linear systems, prentice hall, 1980. [4] i. landau , r. lozano and m. m’saad, adaptative control, springer-verlag, london, 1998. [5] f. rotella, f j. carrillo and m. ayadi, polynomial controller design based on flatness, kybernetica special issue on system structure and control, vol. 38, n°5, pp. 571-584. [6] a. oustaloup, la robustesse. analyse et synthčse de commandes robustes, hermčs, 1994. [7] m. fliess and r. marquez, towards a module theoretic approach to discrete time linear predictive control, 14th international symposium on mathematical theory of networks and systems, mtns’2004, perpignan, 2001. flatness-based control and conventional rst polynomial control of a thermal process 55 [8] m. m’saad m. and j. chebassier, commande adaptative des systčmes, techniques de l’ingénieur, vol. s2, n°s7426, pp. 1-25, paris, 1999. [9] p. borne, g. dauphin-tanguy , j. p. richard, f. rotella and i. zambettakis, commande et optimisation des processus, edition technip, paris, 1990. [10] s. médar, supervision et reconfiguration de la commande des systčmes dynamiques en présence de variations dans les conditions de fonctionnement, phd thesis, institut national polytechnique de toulouse, 2002. [11] f. rotella, f. j. carrilo and m. ayadi, digital flatness-based robust controller applied to a thermal process, ieee international conference on control application, pp. 936-941, mexico, 2001. [12] k. j. aström and b. wittenlmark, adaptative control, addison wesley, united states, 1989. [13] m. ayadi, n. langlois, m. benrejeb and h. chafouk, flatness-based robust adaptative polynomial controller for a diesel engine model, transaction on systems, signals & devices, vol. 2, n°1, pp. 71-90, 2006. [14] m. fliess, j. lévine, ph. martin and p. rouchon, on differentially flat non linear systems, ifacsymposium nolcos’92, pp. 408-412, bordeaux ,1992. [15] r. marquez, and m. fliess, linear predictive control revisited. a flatness based approach, european control conference, ecc’99, karlshrule, 1999. [16] ph. charbonnaud, j. f. carrilo and s. médar, robust control reconfiguration of a thermal process with multiple operating modes,ieee transaction of control systems technology, vol. 11, n°4, 2003. [17] j.y. dieulot, i. thimoumi, f. colas and r. béarée, numerical aspects and performances of trajectory planning methods of flexible axes, international journal of computers, communications and control, vol.i, n°4, pp. 35-44, 2006. [18] j. lévine, on the necessary and sufficent conditions for differential flatness, th ifac symposium on nonlinear control systems, nolcos’04, pp. 463-468, stuttgart, 2004. [19] h. sira-ramirez and s.k. agrawal, differentially flat systems, marcel dekker, 2004. hajer gharsallaoui was born in 1979 and received the engineer diploma degree in 2004 and the master degree in automatic control in 2005 from the ecole nationale d’ingénieurs de tunis (enit). she is currently preparing the ph.d. degree in automatic control within the framework of lagis-ec-lille and lara-enit cooperation. her research is related to accommodation control and ftc control, diagnostic of flat system, and adaptive control. mounir ayadi graduated from the ecole nationale d’ingénieurs de tunis in 1998 and received his phd degree in automatic control from the institut national polytechnique de toulouse in 2002. he was a post-doctoral fellow at the ecole supérieure d’ingénieurs en génie electrique de rouen in 2003. he is currently maître-assistant at the ecole nationale d’ingénieurs de tunis. his research interests are in the area of control system theory, predictive and adaptive control, and flat systems. mohamed benrejeb was born in tunisia in 1950. he obtained the diploma of "ingénieur idn" (french "grande ecole") in 1973, the master degree of automatic control in 1974, the phd in automatic control of the university of lille in 1976 and the dsc of the same university in 1980. he is currently a full professor at the ecole nationale d’ingénieurs de tunis and an invited professor at the ecole centrale de lille. his research interests are in the area of analysis and synthesis of complex systems based on classical and non conventional approaches. 56 hajer gharsallaoui, mounir ayadi, mohamed benrejeb, pierre borne pierre borne received the master degree of physics in 1967, the masters of electronics, of mechanics and of applied mathematics in 1968. the same year he obtained the diploma of "ingénieur idn" (french "grande ecole"). he obtained the phd in automatic control of the university of lille in 1970 and the dsc of the same university in 1976. he became doctor honoris causa of the moscow institute of electronics and mathematics (russia) in 1999, of the university of waterloo (canada) in 2006 and of the polytechnic university of bucarest (romania). he is author or co-author of about 200 journal articles and book chapters, and of 34 plenary lectures and of more than 250 communications in international conferences. he has been the supervisor of 71 phd thesis and is author of 20 books . he is fellow of ieee and has been president of the ieee/smc society in 2000 and 2001 .he is vice-chair of the ifac tc on large scale complex systems. he is presently professor "de classe exceptionnelle" at the ecole centrale de lille and director of the national french pluriformations group of research in automatic control. int j comput commun, issn 1841-9836 7(5):892-899, december, 2012. mobile network qoe-qos decision making tool for performance optimization in critical web service c. lozano-garzon, c. ariza-porras, s. rivera-diaz, h. riveros-ardila, y. donoso carlos lozano-garzon, christian ariza-porras sebastián rivera-díaz, horacio riveros-ardila, yezid donoso universidad de los andes, bogotá, colombia e-mail: ca.lozano968@uniandes.edu.co cf.ariza975@uniandes.edu.co, s.rivera57@uniandes.edu.co lh.riveros102@uniandes.edu.co, ydonoso@uniandes.edu.co abstract: regardless of the type of service that a company offers the customer satisfaction is a factor for success, if these services are in a highly competitive environment. this situation encourages companies to develop strategies to improve the quality of the experience (qoe) of their users. strategies include improving their processes, or infrastructure for provisioning the services. take these kind of decisions is very difficult because they ignore how the key performance indicators (kpi) services are correlated with the information about user experience. this problem is approached from the perspective of mobile telecom operators, who have addressed this challenge through the quality of service (qos) concept. unfortunately, the qos is only characterized by technical aspects, the user’s criteria are not included. into a highly competitive environment, the user’s loyalty is a key component to be considered in the operator’s development plan. nowadays, the mobile telecom operators focus their efforts to ensure not only the qos but also the qoe. the aim of this paper was the develop a decision making tool that allows the mobile telco operators support their determinations about the maintenance of network infrastructure, as well as the expansion of the same, specifically for their critical web services; based in a correlated information between qos and qoe. this tool was developed on the basis of the pseudo subjective quality assessment (psqa) methodology. keywords: decision making tool, pseudo subjective quality assessment, quality of experience, quality of service, web services. 1 introduction the rapid evolution that has made the telecommunications industry in the world, represented in technological development in networks and the emergence of ip as a fundamental part of both fixed and mobile networks, has led the sector to a converged environment that enables businesses to provide new services. this new environment base its performance in what is now known as next generation mobile networks (ngmn) which is intended to support the growing needs of both technical and quality demands for new services.this environment generates new challenges not only in the design and implementation of the network, also this should allow the provision of services independent of location, time or device from accessing this and besides challenges in the way that operators can implement mechanisms to ensure quality of service (qos) and quality of experience (qoe) under these different technology platforms. the term qos is widely used in the environment for communications networks, it was defined by the itu-t [1] as the collective effect of service performance which determine the satisfaction of a service user. associated with the conceptualization of qos harry in [2] defines three concepts of qos: intrinsic qos perceived qos and evaluated qos, the definite relationship between these three concepts are general qos model proposed by the itu-t. copyright c⃝ 2006-2012 by ccc publications mobile network qoe-qos decision making tool for performance optimization in critical web service 893 stankiewicz, cholda, & jajszczyk in [3] describes the intrinsic qos, it is known as "network performance" by the itu and etsi in recommendation e.800, it covers all the features of service determined by the efficiency of the network. the intrinsic qos is key to the quality perceived and evaluated by the customer. the perceived qos reflects the customer experience in using a particular service through the itu recommendation g.1000 has four important perspectives: the qos required by the client, the qos offered by the provider, the qos achieved by the supplier and the qos perceived by the customer. finally the evaluated qos starts when the customer decides whether to continue using a service or not, this decision depends on the perceived quality, the price of the service, and supplier responses (problems and complaints), the itu defines the guidelines of quality of experience (qoe) in recommendation p.10. even though qos and qoe measurements are quite different, they have a high degree of correlation; nevertheless, some mobile operators have not yet implemented tools for incorporating the "feel" of a user based on the qos parameters measured for a specific service. the mobile phone companies have improved the deployment and delivery of their products influenced by the quality of service (qos), regardless of user perception. however, the mobile telco operators know that user satisfaction is a key success factor for the loyalty and positioning of the company against its competitors. this fact has led these operators to develop strategies for adoption of the perception of its users in their decision processes to the tuning of their infrastructure for the provision of services. this work aims to provide to mobile telecom operators a qoe-qos decision making tool that will allow them to support their determinations about the maintenance of network infrastructure, as well as the expansion of the same, specifically for critical web services. the remainder of this paper is organized as follows. in section 2 shown some works related with the measurement of quality of experience and the correlation of it with quality of service for some specific services. the psqa methodology is presented in section 3. the proposed methodology used to develop the qos-qoe decision making tool is presented in section 4 and the implementation of this methodology is described in section 5. finally, the experimental results of the proposed tool are presented in section 6 and the section 7 presented the conclusions and future work. 2 related works stankiewicz, cholda, and jajszczyk in [3] notes that qoe assessment methods could be classified into qualitative (subjective) and quantitative (objective) methods. qualitative methods are built with the participation of people, a representative sample of the population, whom used a particular service. in these methods the service is assessed in a controlled environment and people fill out a survey with numerical values qualification. quantitative methods provide a qoe assessment based on the measurement of several parameters related with service quality indicators in the signal at the output of the transmission channel. however, the institut national de recherche en informatique et en automatique (inria) in france proposed a hybrid method between subjective and objective assessments of qoe called psqa [4]. other papers examine the relation between qos parameters and user perception. for example the research generated by telenor in [5] about the conceptual difference between qos, considered the quality associated with technical performance parameters; and qoe understood as a measure of user performance, based on objective and subjective measures of the use of information and communications technology (ict) product or service. in [5], [6], and [7] it is evident that between these two concepts clear relationship that allows us to express the quality of the experience in terms of quality of service, which is the starting point for further studies made in order to establish a relationship or a relational model between the two. some of these studies 894 c. lozano-garzon, c. ariza-porras, s. rivera-diaz, h. riveros-ardila, y. donoso have focused on the relationship can be identified for a specific service such as the iptv [8] and [9], transmission media [10] or public internet [11], other studies have sought to establish a generic model of relationship between them as shown in [12], [13] and [14]. as a result of these studies, progress has been made in studies seeking to establish an objective measure based on a subjective view by users. within the work developed in this field we find the following: [15] focused on the development of the concept of the quality of experience focused on the measurement and communication requirements for industrial use, [16] measurement studies quality of experience based on ontologies and [17] which presents a look at the main techniques for measuring the quality of experience focused on the methodologies and tools available free. finally in [18] the authors propose a new methodology called pseudo-subjective quality assesment (psqa) based on random neural networks, to quantify the quality of a video or audio transmission over the internet. they discuss the results concerning psqa-based dynamic quality control and conversational quality assessment. a general system developed to evaluate qoe on ip networks was shown in [19]. their system architecture is designed to be capable of emulating multi agent networks and dynamically changing conditions, in a web browsing qoe experiment. the experiment was conducted on the basis of itu-t recommendation g.1030 , and aimed to update the perceptual model provided in this recommendation to today context. as can be seen both of these studies: relationship between quality of service and quality of experience, as well as metrics for quality of experience; are an important part of an environment using all-ip network. 3 pseudo-subjective quality assessment (psqa) knowing that mobile operators require a method that allows them: correlate qos parameters and subjective perception of the user, it can be used without creating a new testbed, and generating a set of reports that support their decisions about network infrastructure. after making a comparison between the existing metrics the hybrid model (specifically psqa metric) was selected because it takes the best of the subjective and objective models, the results are in terms of mean opinion score (mos), is a not intrusive method, obtains real-time data and its implementation phase is low at the time. this model is divided into three big steps: firstly the application of a subjective evaluation in a controlled environment where the samples are distorted in periods of time; in the second step the samples go through a statistical process where the elements out of range are detected and removed; and finally the results are used to train a statistical learning tool, a random neural network (rnn), that learns the correlation between configurations and mos values defined, related among the parameters that cause distortion and the perceived quality. 4 proposed methodology for development of qoe-qos decision making tool based on the psqa methodology we propose an adaptation of it, in order to develop a making decision tool that allow us to combine the knowledge of end-user experience and technical parameters values for the decision making in mobile operators. it is hoped that this new model assigns to the samples (qos parameters) a qoe value very close to the value that an average human observer would give.(see figure 1). mobile network qoe-qos decision making tool for performance optimization in critical web service 895 figure 1: flow chart of qoe-qos decision making tool 4.1 development of subjective test for assessment web services when we started the development of this work, we didn’t find any documented work related to the existence of a mos test oriented to web services. for this reason, we decided the developing of a subjective test for critical web services based on some mos existing tests. for the development of this test, we considered the following web contexts: a text-only page, a page with images and text, a video and download a file. 4.2 determination of the population sample and statistical analysis of the test based in [20] the number of samples must be calculated so as to ensure a confidence interval of at least 95% and an error no greater than 5%, taking into consideration the following formula: n = (z1− α 2 )2 a2 · ( s mean(x) )2 (1) where n is the number of samples, z1− α 2 is the 1 − α 2 percentile of the standard normal distribution, s is the expected standard deviation, mean(x) is the expected mean value, and a is the relative accuracy. after we collect the samples through the designed test application, the results are passed through a statistical process in order to detect and remove users who present data out of range. 4.3 development of agents to gathering qos parameters the agents are the responsible of making the measurements of the qos parameters defined for the web browsing services. these parameters were selected according to [20]and are: bandwidth, latency, signal strength, trademark of device and the cell where the device is located. another important characteristic of these agents is the need to send the measurements collected to the server in order to use these in a first time like initial information to training the 896 c. lozano-garzon, c. ariza-porras, s. rivera-diaz, h. riveros-ardila, y. donoso neural network and later as the principal source of information to calculate the qoe through the neural network. 4.4 training of the neural network given that the neural network should behave as a classifier, we propose the use of a multilayer perceptron whose input will correspond to the five (5) qos parameters defined above and the output will correspond to one (1) value of qoe (excellent, very good, good, fair, or poor). once the neural network has been trained and validate, it is hoped that the results produced will be very similar to the results of people’s subjective tests. 4.5 storage of information in order to maintain a historical record of both: the values of qos parameters collected and the qoe values calculated; it is necessary to implement a repository of information. this repository will be the primary source of information for the generation of reports that will support decision making. 4.6 development of report manager the report manager should enable operators to generate the required documentation for making decisions regarding the maintenance of network infrastructure, as well as its expansion, specifically for critical web services. some of the basic reports considered are the following: information of qoe by base station or by the device type and the correlation between qos parameters and the estimated qoe. also the operators need that the generator can produce new reports easily according to their requirements. 5 qoe-qos decision making tool implementation as a first step towards the implementation of this tool we designed the software architecture to be used, it is built by six (6) components: the mobile agent, the agent listener, the persistence component, the classifier, the report generator and the presentation component. these components and their connections can be seen in fig. 2. additionally, in order to support connectivity between the agents and the server, we propose a client server network architecture of three layers (see fig. 3). the implementation of the tool was performed according to previously proposed architectures. in the server component is used, among other tools: weka for the classifier, webservices developed to receive data from agents and classified using the trained models and birt as a development tool for reports and report viewer. while the mobile agent components were implemented in java. figure 2: software architecture connection diagram mobile network qoe-qos decision making tool for performance optimization in critical web service 897 figure 3: network infrastructure proposed 6 experimental results as a first step we performed the collection of about 120 subjective tests in 5 different cells of the operator. subsequently we made a statistical analysis on data collected and all data that were outside three times the standard deviation were removed, because they were atypical behaviors that could influence the model training. with the remaining surveys, we conducted the relationship with data taken by the agents and estimated the value of qoe general survey taking an average of the evaluations of the test (text, text and images, video and downloads) and an average compared against the modified test. once the data were purged we proceeded to do the training of the neural network. the classification algorithm used was the multilayer perceptron with 50 nodes in the hidden layer and 5000 epochs (times) as a limit. from 50 additional samples, which are selected to valid the model, after executing the classifier (neural network) had a success rate in the classification of 90% aand the mean absolute error is close to 4.7%. in fig 4 we show the relationship between the calculated qoe and qoe assessment of the user to test video. figure 4: relationship between the qoe assesment calculated and the test user to the video 7 conclusions and future works in conclusion, through the decision making tool developed the mobile telecom operators could estimate the users’ subjective opinion based in network qos parameters. this information allows the generation of specific reports with the aim of supporting decisions oriented to prevent product or service rejections by the users. some of the most important decisions are related with the determinations about: the tuning of network infrastructure, the expansion of this infrastructure, the use of some specific equipment, among others. as future work we plan to work in two actions, the first is related to the exploration of other training algorithms for neural network that allows us to achieve better results, and the second seeks to expand this study to other data transmission services. 898 c. lozano-garzon, c. ariza-porras, s. rivera-diaz, h. riveros-ardila, y. donoso bibliography [1] itu t. , terms and definitions related to quality of service and network performance including dependability, international telecommunication union., recommendation e.800, 1995. [2] w. c. hardy, qos: measurement and evaluation of telecommunications quality of service:baffins lane, chichester, united kingdom: john wiley & sons, ltd., 2001. [3] r. stankiewicz, p. cholda, and a. jajszczyk, qox: what is it really?, ieee communications magazine, 49(4):148 158, 2011. [4] g. rubino. the psqa project. [online]. http://www.irisa.fr/armor/lesmembres/rubino/mypages/psqa.html [5] b. hestnes, p. brooks, and s. heiestad, qoe (quality of experience) measuring qoe for improving the usage of telecommunication services, telenor, research report 2009. [6] a. van moorsel, metrics for the internet age: quality of experience and quality of business, hewlett packard laboratories, palo alto, california, usa, hpl-2001-179, 2001. [7] k. bharrathsingh, quality of experience as an integral part of network engineering, focus in convergence , vol. 1, february 2005. [8] j. kim, t.w. um, ryu w., and b. sun lee, heterogeneous networks and terminal-aware qos/qoeguaranteed mobile iptv service, ieee communications magazine, 46(5):110 117, 2008. [9] h.j. kim and s.g. choi, a study on a qos/qoe correlation model for qoe evaluation on iptv service, in the 12th international conference on advanced communication technology (icact 2010), gangwon-do, korea, 2:11077 1382, 2010. [10] m. siller and j.c. woods, qos arbitration for improving the qoe in multimedia transmission, int. conf. on visual information engineering (vie 2003) , 238 241, 2003. [11] s. khirman and p. henriksen, relationship between quality-of-service and quality-ofexperience for public internet service, passive and active measurement conference, palo alto, california, usa, 1 6, 2002. [12] m. fiedler, t. hossfeld, and phuoc tran-gia, a generic quantitative relationship between quality of experience and quality of service, ieee network, 24(2):36 -41, march april 2010. [13] h.j. kim et al., the qoe evaluation method through the qos-qoe correlation model, fourth int. conf. on networked computing and advanced information management (ncm ’08), 2:719-725, 2008. [14] c. guo, y. liu, and y. liu h. du, research on relationship between qoe and qos based on bp neural network, ieee int. conf. on network infrastructure and digital content (ic-nidc 2009), 312 315, 2009. [15] p. brooks and b. hestnes, user measures of quality of experience: why being objective and quantitative is important, ieee networks, 24(2): 8 13, march april, 2010. mobile network qoe-qos decision making tool for performance optimization in critical web service 899 [16] a. sánchez-macián, d. lópez, j. e. lópez de vergara, and e. pastor, a framework for the automatic calculation of quality of experience in telematic services, proc. of the 13th hp-ovua workshop, côte d’azur, 1-6, 2006. [17] r. kooij, d. de vleeschauwer, k. brunnström, and f. kuipers, techniques for measuring quality of experience, wwic 2010, 216 217, 2010. [18] g. rubino, p tirilly, and m..varela, evaluating users’ satisfaction in packet networks using random neural networks, proceedings of icann’06, athens, greece, 303-312, 2006. [19] e. ibarrola, f. liberal, i. taboada, and r. ortega, web qoe evaluation in multi-agent networks: validation of itu-t g.1030, fifth int. conf. on autonomic and autonomous systems (icas ’09), 289 294, 2009. [20] european telecommunications standards institute, "speech processing, transmission and quality aspects (stq); user related qos parameter definitions and measurements., european telecommunications standards institute, sophia antipolis cedex france, standard etsi eg 202 057-2 v1.3.1, 2009. ijcccv4n3draft.pdf int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 3, pp. 283-290 on parallel graph rewriting systems dragoş sburlan ovidius university of constantza, romania faculty of mathematics and informatics 124 mamaia blvd., 900527 constantza e-mail: dsburlan@univ-ovidius.ro received: april 5, 2009 accepted: may 30, 2009 abstract: in this paper we introduce a new theoretical paradigm, called pgr systems, which can be used to model in a discrete manner some natural phenomena occurring in-vivo/in-vitro environments. pgr systems make use of graphs to describe the spatial structure of space of individuals, while the system dynamics caused by the movement/interaction of individuals is captured by the parallel applications of some graph rewriting rules. in this frame, an illustrative example is studied and based on it, an eloquent comparison between the abstract rewriting machines and pgr systems is done. several further ideas to overcome the global computational effort needed for simulations, but still maintaining the overall ability for modeling are finally proposed. keywords: parallel multiset processing, abstract rewriting systems, p systems 1 introduction nowadays an increasing interest regards the study of the development of biological systems in which more species of individuals interact (usually to perform a certain global task). research ranging from completely different areas like the study of metapopulations (the study of groups of spatially separated populations of the same species that live in fragmented habitats and interact at some level) and hiv infections was done in essentially the same manner. traditionally, such studies were done by employing continuous models where (partial) differential equations were used to capture the dynamics of these systems. currently, the usage of discrete models where the system dynamics is captured from the collective actions of individual entities has been shown to be a promising choice (see [4], [6], [7], [9]). this is based on the fact that living organisms are spatially discrete and the individuals occupy particular localities at a given time. the interactions between individuals are strongly connected with their neighborhood relations. while characterizing these facts a basic issue regards the way the space is represented. simple models that involve no detailed spatial structure are in general analytically easily solvable. however, as the complexity of the reaction-diffusion dynamics grows, the models based on partial differential equations become intractable to be analyzed. on the other hand, integrating within the model a detailed spatial structure (as cellular automata models do, for instance) the setback comes in general from the impossibility to analyze the models except only by performing simulations. although such models have much greater biological reality, they suffer from the difficulty of generalization (hence of finding the exact behavior). this is especially important while formulating some practical testable predictions regarding a given model. p systems are formal computing devices that were initially inspired and abstracted from the cell functioning (see [10]). in general, p systems make use of multisets to represent the computational support. these multisets are placed inside the membranes which in their turn are disposed in some hierarchical tree structure. the (maximal) parallel applications of some multiset rewriting rules (particular to each membrane) were used to process the multisets. although these formal systems were extensively studied with respect to their computational power and efficiency, while representing some biological processes many difficulties arise. representing the data support as multisets essentially simplifies the structure of the environment and of the individuals interaction (the neighboring relations between the individuals are completely ignored), the focus being over the system dynamics. however, in copyright c© 2006-2009 by ccc publications 284 dragoş sburlan this case, two main assumptions are considered: the environment is homogeneous so that the concentration of the individuals do not change with respect to space and the number of individuals of each species in the environment is “adequately" large (hence the concentration of the individuals might be assumed to vary continuously over the time). moreover, the rules that describe the interactions between the individuals are assumed to be executed in a maximal parallel manner and governed by a global clock that marks equal steps. even if all these simplifications are useful while defining a computing formal framework, they are questionable if the aim is to model and simulate actual biological systems. this is way many new features that are meaningful to biologists were added to the original paradigm in order to extend its functionality and versatility for modeling. in order to cope with these issues, probabilistic/stochastic p systems were introduced (see [4], [14], [2]). in general, the main idea was to associate to the rules some weights describing how they should be applied at a given moment. for a particular rule, the weight gives the susceptibility of its execution at certain instant. hence, employing this principle to all interaction rules it sets up more realistic bounds of the nondeterministic application of the rules. the ultimate goal of this approach is to integrate the structural and dynamical characteristics of a real bio-system into the way the rules of the model are selected to be applied and executed step by step (preserving at the same time the unstructured computing support). although this method has in general good simulation time complexity it is inadequate if the interacting species are poorly represented, when there exist many “inactive" individuals (that are not the subject of any rule) with respect to the entire population of individuals, or when the environment is not homogeneous. 2 preliminaries we assume the reader familiar with the basic notions of p systems (one can consult [10] for more details), so that here we only recall some notions regarding the abstract rewriting systems on multisets. arm systems represent a variant of p systems which was proposed in order to perform simulations of some bio-chemical processes. later on, due to its modeling flexibility, it was used to study some symbiotic mechanism of an ecological system and even for proposing a novel theory of evolution. arms is a stochastic model that uses multisets to represent the bio-chemical support. multiset rewriting rules are used to describe the bio-chemical reactions. as opposed to the classical definition of p systems where the rules are applied in a nondeterministic, maximal parallel manner and with competition on the objects composing the multisets, in arms the rules obey the mass action law where the frequency of a reaction follows the concentration of bio-chemicals and a rate constant. consequently, the rules to be applied are chosen probabilistically from the rules set and each probability is given by the ratio of the total number of colliding chemicals of a reaction to the sum of the total number of colliding chemicals of every reaction in the rule; the applications of the rules remain parallel and with competition on the objects. more formally, an arm system is a construct π = (o, w, r) where o is the alphabet of objects, w represents the multiset of objects at the beginning of computation, and r is a set of multiset rewriting rules of type u k → v, where u ∈ o+, v ∈ o∗, and k ∈ r is the rate constant of the rule. for example, in case of a cooperative rule of type ri : aa + bb → cc + dd, where a, b, c, d ∈ in, a, b,c, d ∈ o, and a given multiset of objects m ∈ o∗, the probability of rule execution is defined as prob(ri) = kim a a mb b r , where ki is the rate constant (determined experimentally), ma and mb are the multiplicities of objects a and b in m, and r is a coefficient for normalizing the probabilities (i.e., ∑ i prob(ri) = ). in a straightforward way probabilities can be defined for any type of rules. the system π starts to evolve from the initial configuration (represented by w) by applying the rules in parallel, randomly selecting the rules but according to the probabilities computed as above. π is governed by a universal clock that marks equal time units. a simple example, meaningful for our work, is presented bellow. we ran various tests using an arm system π where o = {a, b,c, d, x , f}, and the set of rules r is given below: the initial configuration of π was w = abcd and in our tests we used several values for ki,  ≤ i ≤ . the system attempts to simulate the behavior of some interacting individuals, represented here as the objects a, b, c, and d, sharing the same environment. in addition, the individuals corresponding to the objects c and d (which are much less than the individuals corresponding to the objects a and b) share a localized patch in the environment. thus, we assumed the environment not to be homogeneous and we aimed to test the arm system ability to simulate such conditions. on parallel graph rewriting systems 285 if at least once the objects c and d interact (i.e., the rule cd k → f is applied) they will produce an object f which will trigger the conversion of all existing objects in the multiset into f (the rules r, r, and r). the rest of the rules (r till r) are used to slow down the rewriting rate of objects a, b, c, d, and x . r : ab k→ x r : f k→ f r : ac k→ x r : a k → a r : bd k → x r : b k→ b r : cd k → f r : c k→ c r : f x k → f f r : d k→ d r : fa k→ f f r : x k → x r : f b k → f f since we have assumed the existence in the environment of a patch of individuals corresponding to objects c and d, then we could make another further assumption. if the patch is "large enough” so that there exists at least two individuals c and d which can interact with each other but which cannot interact initially with the individuals a and b, then there exists a "significant” probability that the rule cd k → f is executed (assuming that k > k and k > k). while using multisets to represent the individuals in the environment we lose the structure, hence when simulating such systems we actually have to relay on the probabilities of the executions of the rules (which in their turn depend on some constants experimentally determined). in figure 1, one can notice the different behaviors of the same system and they are related to the usage of such probabilities. the diagrams shown on the left hand side present a simulation when the rule cd k → f was executed at least once, while the charts on the right hand side present a simulation when the rule cd k → f was not executed at all. although the model considered is very simple a similar situation might happen when representing some complex systems. even more, such situations might emerge during the system evolution and sudden shifts in the behavior might arise from some minor changes in the circumstances (as it is in the presented charts); if this is the case, then it would make almost impossible the precise identification of the rate constants associated to the rules. figure 1: two runs of system π . the results are presented on columns and they show the different behaviors of the same system when some minor changes in the circumstances happen. besides all of these issues, if the number of objects in the model decreases under a certain limit, the usage of probabilities to specify the way the rules are applied becomes inadequate. 286 dragoş sburlan 3 pgr systems aiming to tackle the mentioned issues, in this section we introduce a new model for simulating bio-systems that are composed by interacting individuals of various species in a given environment. denote by c the finite set of species in an environment represented here as a metric space (for simplicity, let r k, k ≥ , be the environment). let v ⊆ l ×c be the finite set of labeled individuals in the environment (l denotes a finite set of labels that uniquely identify the individuals in the environment). in addition, let f : v → rk, k ≥ , be a bijective mapping; for a node v = (n, l) ∈ v , the value h(v) denotes the position of the individual v in the environment. in addition let r ∈ r, r > , be a positive constant. based on the above definitions one can represent the environment and the individuals from within as a graph g = (v, e) where v = v and the set of edges is constructed as follows: for two nodes v, v ∈ v , if h(v) belongs to the open ball centered in h(v) and with radius r (i.e., h(v) ∈ b(h(v), r)), then there exists an edge from v to v. for simplicity we assume that g is connected, that is, for any two nodes m, n ∈ v there exists a sequence m = v, v, . . . , vt = n ∈ v such that h(vi) ∈ b(h(vi−), r), for  ≤ i ≤ t. for example, in figure 2 it is presented a set of 4 individuals which initially lay on an environment represented as r and the way the corresponding labeled graph is constructed. figure 2: the construction of the labeled graph representing the initial computational support motivated by these facts we introduce the following model. a parallel graph rewriting system (in short, a pgr system) is a construct γ = (c, g, r) where: • c = {c, . . . , ck} is a finite set of symbols; • g = (v, e) is the initial global graph – a connected graph such that v ⊆ l ×c is a set of labeled nodes and e ⊆ v ×v is a set of edges between nodes from v; • r is a finite set of graph rewriting rules. a graph rewriting rule r ∈ r is of the following type: r = (g = (v, e), g = (v, e)), where vi ⊆ li ×c, ei ⊆ vi ×vi, i ∈ {, }. the graphs g and g are connected graphs; g represents the neighboring relations between the individuals that are required for an interaction to take place and g represents the output of an actual interaction between the individuals represented in g. in addition we will assume that g and g are not arbitrary graphs, but rather they obey some physical constraints: any node from g and g cannot be the subject of more than a constant tr ∈ in edges – a condition that assumes the nonexistence of more than tr individuals in an open ball of radius r. a graph rewriting rule r = (g, g) ∈ r can be applied on a graph g if g is label isomorphic with one subgraph gs = (vs, es) of g, that is, there exists a bijective mapping h : v → vs such that h((m, c)) = (n, c) and h−((n, c)) = (m, c), where (m, c) ∈ v, (n, c) ∈ vs and such that any two nodes u, v ∈ vs are adjacent in gs if and only if h(u) and h(v) are adjacent in g (see figure 3). in other words, a graph rewriting rule r can be applied on g iff the left-hand side rule’s graph is "contained" in g both as layout and as corresponding node labels (via an edge/label-preserving bijection). on parallel graph rewriting systems 287 figure 3: a graph g = (v, e) denoting the computing support and a graph rewriting rule r = (g, g). the sites where the rule r can be applied in g are explicitly figured. if gs = (vs = {(n, b), (n, a)}, es = {((n, b), (n, a))}) then g is label isomorphic with gs. the neighborhood set of degree k =  of gs is b = {(n, a), (n,c), (n,c), (n, b)}. the following steps are accomplished when a rule r is applied over g: • eliminate gs from g (all the nodes from vs are eliminated from v ; all the edges of the type (v, vs), v ∈ v , vs ∈ vs are deleted from e); • add g to g (some relabeling of the nodes from g is required in order to avoid duplicates of nodes at multiple application of r). all the (relabeled) nodes and edges of g are added to g; • add a set of edges from some nodes of v to some nodes of v \ (vs ∪ v). the edges are established as described below. for the graph gs let us define the neighborhood set of degree k bk = {v ∈ v \ vs | there exists a path of length less or equal with k from v to a node u ∈ vs} as we mentioned above, the output of the application of a rule consists of new individuals that, by hypothesis, at the moment of their apparition it is assumed to belong to the same vicinity. how big is that vicinity and how the new individuals are related to the rest depend on many factors among which we just mention the type of the rule and the environment. consequently, in our framework, the set bk is useful when defining the new neighborhood relations triggered by the application of a rule. by some straightforward physical arguments, the output graph g of the rule r is likely to be "connected" to g via the nodes from bk. however, for simplicity, we will consider the neighborhood set of degree  in our simulations 1. let e = {(n, n) ∈ e | n ∈ b, n ∈ vs}. then, a number equals with card(e) of random edges from the nodes of g to the nodes from b are added to g but such that any node considered is not the subject of more than tr ∈ in + edges. starting from the initial configuration (the initial global graph g), the system evolves according to the rules from r and the current labeled graph in a non-deterministic parallel manner (but not necessarily maximal). the labeled graph of γ at any given moment constitutes the configuration of the system at that moment. for two configurations ga and gb we can define a transition from ga to gb if we can pass from ga to gb by applying rules from r. the problem of determining whether two graphs are isomorphic is referred to as the graph isomorphism problem. although this problem belong to np it is neither known to be solvable in polynomial time nor it is npcomplete. a generalization of this problem (that is used in our formalism) is the subgraph isomorphism problem which is np-complete; hence the known deterministic algorithms for this problem are exponential. 1if the studied individuals are particles that perform the brownian motion, then at each application of a rule a random positive integer k can be generated and correspondingly a new neighborhood set can be defined; hence, the set bk might be also useful to describe the random particle movement in the environment. 288 dragoş sburlan remark 3.1. there is a physical motivation to assume that after applying a rule of the system, the newly produced objects (that correspond to the output nodes of the rule) belong to the same vicinity, hence the right hand side graph of any rule should be complete. remark 3.2. for a given pgr system, as much as the radius r grows (hence the number of edges in the initial global graph is close to n(n−)  where n is the number of the nodes, that is, the initial global graph is "almost” complete) and the degrees of the neighboring sets grow as well, the result of a simulation is as “similar" to the one obtained using parallel multiset rewriting. this is because multisets can be seen in our formalism as complete graphs, hence any individual in the system is in a neighboring relation with any other individual (hence, they can interact if proper rules exist). 4 pgrs simulator and a test case the simulator implements the model introduced in section 3. its main characteristics regard the definition of the rules set by using an xml file, and the possibility to save/load intermediate configurations. the simulator is written in java language hence it benefits of cross-platform compatibility, parallelism, and possibility to distribute the computational effort. the task that has the most computational resource consumption is the subgraph isomorphism problem which is addressed whenever a rule r = (g, g) is selected for application and the set s of all the subgraphs of the global graph that are isomorphic with g has to be determined. even more, whenever a subgraph g ∈ s is selected to be rewritten by r, a run through all the elements of s has to be performed in order to eliminate those subgraphs that have some nodes from g (a task useful when multiple applications of the same rule are performed). considering all these matters for all the rules from the rule set and a relatively small global graph, the overall time complexity for simulating just one computational step is exponential, hence in general unfeasible. nevertheless, if the left hand side graphs of the rules from the rule set are very simple (i.e., less than 4 nodes) and the global graph contains at most hundreds of nodes, the simulation is viable. moreover, taking into account that the problem can be easily parallelized one can divide the problem into smaller instances and distribute them over a network. let us consider the following pgr system γ = (c, g, r) where • c = {a, b,c, d, f, x }, • r = {r, r, r, r, r, r, r, r} is defined as follows: r : (n, a) (n, b) (n, x ) y yy r : (n, a) (n,c) (n, x ) y yy r : (n, b) (n, d) (n, x ) y yy r : (n,c) (n, d) (n, f) y yy r : (n, f) (n, x ) (n, f) (n, f) y yy y r : (n, f) (n, a) (n, f) (n, f) y yy y r : (n, f) (n, b) (n, f) (n, f) y yy y r : (n, f) (n, f) yy in our tests, the initial global graph g was build to obey some properties. first of all, a random graph g ′  was generated and this graph contains 500 nodes labeled only with a and b (for each test case, the apparition of these on parallel graph rewriting systems 289 figure 4: the results of 100 simulations of different gpr systems but having the same properties. the minimal and maximal obtained values are explicitly marked. labels are equally probable) and 2000 edges. a second graph g′′ was generated and this graph contains 10 nodes labeled only with c and d (also in this case, the apparition of these labels are equally probable) and 30 edges. finally, g′ and g ′′  were merged together in order to form g by connecting 10 randomly chosen nodes from g ′  with 10 randomly chosen nodes from g′′ . we ran the simulator for 100 times, considering for each run a new initial global graph generated as above. in figure 4 there are represented the minimal and the maximal values at each step of the simulation for the objects a, b, c, and d. any particular simulation graphic from our test case lay between the boundaries established. 5 conclusions simulations performed using pgr systems in some cases give more accurate answers than arms simulations because they explicitly use the spatial distribution of individuals (hence the neighborhood relations can be extensively expressed). however the price to pay while using pgr systems regards the computational effort which in their case is exponential as time complexity. nevertheless, for some cases when the number of interacting individuals in the environment is small and they are not dense, the pgr systems might be useful for performing simulations. in order to handle these issues, a hybrid system combining features from the arm and pgr systems might be proposed. two directions could be taking into account: • one can use alternatively an arms-type simulation whenever the number of individuals from all the species is large and a pgrs-type simulation whenever the number of individuals from certain species goes below some threshold; in this case the newly obtained system uses in a more careful manner the probabilities for the rules executions. • one can use in parallel an arms-type simulation over a multiset of many individuals and a pgrs-type simulation on relatively small instances of graphs. then one can consider a time sequence and at each moment in the sequence one can merge the arms configuration with the multiset of labels of the nodes from the graph (or one can exchange some data between these simulations). in this way, the newly obtained hybrid systems become more robust against some unexpected changes in the behavior (which might be triggered by some minor changes). acknowledgments the author gratefully acknowledges the support by cncsis-idiei grant, romanian ministry of education, research and innovation, no. 804/2008. bibliography [1] d. besozzi, g. mauri, d. pescini, c. zandron, dynamical probabilistic p systems. int. j. found. comput. sci., 17, 1 (2006), pp. 183–204. [2] d. besozzi, p. cazzaniga, g. mauri, d. pescini, modelling metapopulations with stochastic membrane systems, biosystems, 91 (2008), pp. 499–514. 290 dragoş sburlan [3] b. bollobas, modern graph theory, springer, 1991. [4] m. cavaliere, i.i. ardelean, modeling biological processes by using a probabilistic p system software, natural computing, 2, 2 (2003), pp. 173–197. [5] h. ehrig, introduction to the algebraic theory of graph grammars, lecture notes in computer science 73 (1979), pp. 1–69. [6] p. frisco, the conformon-p system: a molecular and cell biology-inspired computability model, theoretical computer science, 312, 2-3 (2004), pp. 295–319. [7] p. frisco, r.t. gibson, a simulator and an evolution program for conformon-p systems, proc. of the th int. symp. on symbolic and numeric algorithms for scientific computing, 2005, pp. 427–430. [8] d.t. gillespie, exact simulation of coupled chemical reactions, j. physical chemistry, 81 (1977), pp. 2340–2361. [9] v. manca, l. bianco, biological networks in metabolic p systems. biosystems, 91, 3 (2008), pp. 489–498. [10] g. păun, membrane computing. an introduction, springer, berlin, 2002. [11] d. sburlan, parallel graph rewriting systems, proc. of the th bwmc, seville, spain, 2009, in print. [12] y. suzuki, h. tanaka, s. tsumoto, analysis of cycles in symbolic chemical system based on abstract rewriting system on multisets, proceedings of international conference on artificial life 5 (alife 5), 1996, pp. 482–489. [13] y. suzuki, j. takabayashi, h. tanaka, investigation of tritrophic system in ecological systems by using an artificial chemistry, j. artif. life robot., 6 (2002), pp. 129–132. [14] y. suzuki, h. tanaka, modelling p53 signaling pathways by using multiset processing, applications of membrane computing (g. ciobanu, g. păun, m. pérez-jiménez, eds.), springer, berlin, 2006, pp. 203–214. dragoş sburlan graduated the faculty of mathematics and informatics, the master program computational mathematics and modern computer science technologies at the ovidius university of constanta, romania. he defended his european phd in computer science at the university of seville, spain. he followed several research stages (leiden institute of advanced computer science, holland and staki, budapest, hungry) and he is involved in several national and international research projects. currently, he is senior lecturer at the faculty of mathematics and informatics, ovidius university of constantza. his research interests include formal languages, theory of computing, natural computing, and software engineering. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 2, pp. 137-147 presentation of an estimator for the hurst parameter for a selfsimilar process representing the traffic in ieee 802.3 networks ginno millán, gastón lefranc pontificia universidad católica de valparaíso escuela de ingeniería eléctrica avda. brasil #2147. valparaíso chile e-mail: ginno.millan@gmail.com and glefranc@ucv.cl abstract: the hypothesis for the existence of a process with long term memory structure, that represents the independence between the degree of randomness of the traffic generated by the sources and the pattern of traffic stream exhibited by the network is presented, discussed and developed. this methodology is offered as a new and alternative way of approaching the estimation of performance and the design of computer networks ruled by the standard ieee 802.3-2005. keywords: computer networks, ieee 802.3-2005 standard, network traffic, selfsimilar process. 1 introduction the positioning and consolidation of ethernet as a predominant standard in the local and extensive coverage computer network field against some traditional technologies, such as frame relay, dqdb and atm, are facts that are explained by its main characteristics, i.e. compatibility and interoperability among some ethernet equipment from different speeds, high performance, scalability and self-configuration capacity, independence ip addressing and, undoubtedly, the usual scale economy. ethernet, initially at 3 mb/s, has evolved from 10 mb/s to 10 gb/s in the last twenty-two years. the latter does not consider the ieee 802.3ba standard, which specifies the ethernet at 40gb/s and 100 gb/s since 2007. furthermore, it has also evolved from using simple bridges developed for the network bridging with identical physical protocols and average access to n*10 gb/s switches [1], [2] two very interesting, striking and critical aspects are involved in this continuous evolution underwent by the ethernet networks. the first is related to the complete abandonment of the origin of the halfduplex shared system, which has yielded to full-duplex links. on the other hand, the second involves its extension, as ethernet has evolved from lan range distances to wan range coverages [3]. furthermore, although both changes have gradually occurred, they result quite critical from the ethernet point of view, as both represent the disappearance of the mechanism controlling the media access regulated by the csms/cd protocol and also assume a strong and radical change in the transmission media, which tends to the massive and total use of the optical fiber to support the ever demanding application of enormous bandwidths. in general, the local area networks, particularly ethernet, were developed as high capacity interconnected networks; in contrast to wan network technologies that are based on switching and transmission flow generally lower than those available in the lan networks. however, the evolution of the different technologies used in both environments now converges in solutions based in ethernet and its different specifications. therefore, the current ethernet networks are switched and almost totally made from fullduplex links, but they also incorporate the multiplexing according to the ieee 802.1q standard and bear transmission distances similar to those involved in the conventional wan links. this degree of evolution copyright © 2006-2009 by ccc publications 138 ginno millán, gastón lefranc is largely ascribed to great development achieved by the ethernet switches, as there has not only been an increase in the operational transparency and simplicity degree but also a direct effect in the incorporation of additional functionality to the switching, which, from the standard point of view, involves the extension of the format of the original frame and the incorporation of labeling for vlan and the priority establishment for service classes, the size increase of the csma/cd carrier signal, the incorporation of burstiness packets to try to compensate the network velocity loss caused by the extension bits of the carrier and, most of all, in the complete abandonment of the restraint and solution conflict mechanism given by the exponential backward algorithm [4], [5]. on the other hand, the migratory tendency towards ethernet networks without shared media is confirmed through the incorporation of the ieee 802.1x, ieee 802.1w standards (rstp currently included in the ieee 802.1d standard) and the ieee 802.1s standard (mstp; currently included in the ieee 802.1q standard). these establish the dedicated full-duplex links as an essential requirement for the correct operation. it is important to point out these dedicated links are not only necessary to obtain the best benefits from the network, but also to ensure the logical link control (llc), simplifying the protocols and making possible the rapid convergence mechanisms in layer two. one example of the latter is the ieee 802.3e standard (ethernet at 10 gb/s), which does not involve the use of half duplex links in the specifications as the ieee 802.3z (ethernet at 1000 mb/s). the use in the ieee 802.3z is strictly related to the compatibility with the equipment database previously installed and its aim is to work as a platform in the migration or technological transition processes. it is important to remember that the ieee 802.3z is the latest specification in the ieee 802.3 standard giving native support to this device communication method. it seems quite interesting the diffusion and flooding dominated in the traditional ethernet networks as the basic and valid mechanisms to establish the presence or absence of stations; nowadays, however, a minimum diffusion of the frames is sought due to the performance degradation and the exhaustive control of the traffic flow, the same reasons that must be avoided in the wan networks. new technologies are similarly incorporated to the large covering networks, which are typical of the lan environments due to their robustness and great price/assistance relation. these are well established in access and metropolitan environments and, increasingly, in wan environments [6]. all these arguments guarantee a reassessment of the study of the benefits of the csma/cd access control mechanisms, in terms of the impacts of the performance in the current commuted environments, i.e. considering the performance as the useful information quantity that the network is able to transport in relation to the real quantity of transported bits, as well as a characterization of the nature of the traffic under study in terms of a performance pattern capable of describing the temporal evolution as well as the implications on the previously defined performance. the performance parameter is considered as an active form of measuring the benefits from a network, as this is one of the crucial aspects in the global analysis of the communication systems, considering its impacts in the final users. on the other hand, the traffic characterization among the networks is considered, because the performance observed in the performance parameter depends on it, thus becoming a key factor for the characterization. furthermore, a new approach to carry out the modeling processes of the ethernet networks is justified, as in terms of the evolution previously stated, it is inferred that the natural successor of the ieee 802.3u standard (fast ethernet) must be the ieee 802.3z standard (gigabit ethernet), the latter will give way to the access and wan environments, the ieee 802.3ae (ethernet at 10 gb/s) and ieee 802.3ba (ethernet at 40 gb/s and 100 gb/s) standards, respectively. the impacts associated to these technological migrations must be adapted and properly evaluated, sized up and classified in terms of the impacts on the installed equipment databases. this must be previously defined before any adoption. however, the existence of self-similar traffic patterns is accepted on the empirical fact that these are characterized by the constant presence of package traffic bursts through different time scales and that the characteristic property of the self-similar processes is the long range dependence (lrd), which presentation of an estimator for the hurst parameter for a selfsimilar process representing the traffic in ieee 802.3 networks 139 is observed when the trunking level increases [7]. then, the fractal performance of this type of traffic does not coincide with the performance traditionally modeled through the poisson processes, which are characterized by the absence of bursts and a low variability that is reflected in the temporal independence between the samples. in short, these are processes that show a short-term temporal dependence rejecting the relation between temporarily distant processes, i.e. these are null memory processes that, therefore, do not considered as valid the presence of any pattern representing the sent traffic. then, considering the same arguments and the fact the self-similar phenomena show the same aspect or performance when visualized with different enlargement degrees or different scales of a certain dimension, and also that the temporal series showing self-similarity with respect to time are the object of interest in computing networks, a degree of selfsimilarity must be established for such series, which must be expressed using a parameter representing the de-growth speed of the self-correlation function. this responds to the fact that a time series is self-similar when the aggregated series involves the same self-correlation function as the original series. the latter is achieved using the hurst parameter, h, which can be estimated through different methods, being the whittle the one with the highest statistical rigor. the whittle estimator calculation may be carried out from different algorithms, all of which need the underlying stochastic process. in this paper a variation of this estimator is proposed, which allows the obtaining of the self-similarity degree with an acceptable commitment relation between the cost of the computing model (higher inconvenient when obtaining the whittle estimator no matter the chosen method) and the estimation quality. this proposal formally involves the modification of the whittle local estimator, or semi-parametric gaussian estimator, of the memory parameter in the short-term standard process stated in [8]. then, all the advantages of the original technique are expected, which shows all its main attributes as an alternative to the regression technique of the periodogram logarithm shown in [9]. it is expected that under less restrictive suppositions, an asymptotic efficiency profit form is shown. therefore, an analysis of the asymptotic performance of the original semi-parametric gaussian estimator must be carried out on the memory parameter in processes with cyclic or seasonal memory, thus allowing divergences or spectral zeros of asymmetric type. then, modifying the original algorithm, the consistency and asymptotic normality needed for the characterization of the traffic flows will be obtained. 2 problem definition the traffic analysis based on the queuing theory has resulted in great help to the network design and the system analysis when carrying out capacity planning and performance prediction [10]. however, there are some cases in the real world in which all predicted results from a queuing analysis significantly differ from the performance observed in reality [11]. in this sense, it must be remembered the validity of the analysis based on the queuing theory depends on the poisson nature of the data traffic, and when dealing with poisson processes, the representation of the length of each arrival and the time between each frame arrival are represented by independent and exponentially arranged random variables. therefore, these are null memory models. thus, being this the case, of models in those that the probability of an arrival in an instant, is independent of the instants of previous arrivals, property that is not completed in the nets of commutation of packages. however, it is acknowledged the objective of these suppositions correspond to relatively simple models, from an analytical point of view. according to the results obtained in [12], different authors have studied the existence of a temporal dependence and the establishment of the great impact involved in the assistance of a queuing system outstands. therefore, there is plenty of literature on entry traffic models showing more or less complex correlation structures, which are used in cases when the telecomputing system model under study allows an adequate analytical treatment. in any case, all these models, mainly of markovian nature, reject the correlation from a given temporal separation, even when it may be arbitrarily increased at the expense of making the model complicated with additional parameters. 140 ginno millán, gastón lefranc in [13] and [14] is shown that after several measurements on an ethernet network, the traffic shows a self-similar or fractal nature (understanding traffic as the frame quantity in the network by a time unit). this becomes apparent with the existence of a long-term correlation. the self-similar characteristic of the traffic in wan networks is shown in [15] and [16]. furthermore, the fractal nature of the data flow from the protocols involved in the signaling system 7 (ss7) in the common channel signaling networks is shown in [17]. the self-similar nature of the traffic due to the www is shown in [18], using the experimental evidence, the possible causes and origin are shown. the self-similar nature in the traffic of the variable bit rate (vbr) is shown in [19] and then in [20]. in [21] is shown that the probability distribution that is followed by the queuing size of a multiplexor shows a weibull-type asymptotic fall when using certain self-similar process as the entry traffic. then, in [22] it is shown this fall is even slower and hyperbolic when other self-similar processes are used. reference [23] shows the effective band width estimated through markov models, where the queue size distribution show an exponential fall, highly underestimates the loss rate of cells in different orders of magnitudes in atm networks. an outstanding long-term positive correlation in the added traffic is shown in [24], from the analysis of a voice and data multiplexer, where some high delays were obtained, higher than those obtained using the poisson models. in reference [25] the number of entries is shown, counted in adjacent time intervals that result in the superposition of multiple independent and homogeneous voice sources susceptible to be treated through updating process models. this is a complex process that includes strong correlations and involves a significant impact in the assistance of the studied system. the interesting thing about this and other studies is that they highlight the impact the long-term temporal dependence have on the assistance of the communication networks, which is intrinsic to the most diverse traffic types, in contrast to other models that do not include this due to analytical simplicity (renewal processes, for example) or that show a more analytically complex correlation structure (such as the markov or generally self-regressive models). all these are known as short range dependence models (srd). furthermore, the main problem these traditional models involve is that they require a great number of parameters to characterize the strong correlations existing among different traffic types in a network. in this study, it is clearly stated that while the parameter number increases, the analytical complexity also increases (not necessarily in a lineal form) and that there is also a difficulty to give a physical interpretation to all the parameters and to estimate them from empirical data. the self-similarity and fractability characteristics help to describe a phenomenon where certain object property is preserved in relation to the temporal and spatial scalability and in a self-similar and fractal object. thus, the magnified parts are similar to the form of the complete object, where the similarity is measured in some proper form. then, the simpler form of self-similarity is obtained by reconstruction through the iteration of a certain procedure. if this process is indefinitely repeated for each new segment, any portion of the object can be magnified to exactly reproduce a bigger portion, no matter how small the portion. this property is known as “exact self-similarity” [26]. no exact selfsimilar characteristics are intended to be observed in a highly random process such as the one involving the packets arrival to a data network; however, the traffic observed as sample traces from a stochastic process are considered and the similarity is only restricted to certain specific statistics of the temporal series though adjusted in scale. furthermore, the exact self-similarity in abstract mathematical objects and approximate self-similarity for each considered specific execution are also discovered. no exact self-similarity is expected from the network traffic under study; however, a self-similar stochastic behavior is expected and, according to this, the second order statistics may be used to capture the process variability in order to determine the self-similarity rate. in fact, the scale invariance may be defined in terms of the self-correlation function, as the polynomial decrease of this function (in place of the exponential) represents a long-term dependence manifestation similar to the self-similarity, and constitutes the view from which all self-similar presentation of an estimator for the hurst parameter for a selfsimilar process representing the traffic in ieee 802.3 networks 141 processes must be interpreted through the development of this investigation. on the other hand, in relation to the fundamental problem of the self-similar process analysis, or more precisely, to the temporal series showing lrd, i.e. the estimation of the hurst parameter (h), the methods involved in the literature can be classified in the following two groups: 1. geographical methods of lineal regression. used to estimate a statistic t (x) behaving asymptotically for a determined group of x values and, therefore, are based in obtaining the straight line that better adjusts (for that group of x values) to the log(t (x)) against the log(x) using the least square lineal regression, thus obtaining the h parameter value directly from the gradient value of that particular straight line. 2. methods based on maximum likelihood estimators (mle) for h. these help to minimize the differences between the periodogram of the concerning series and its theoretical spectrum. the methods involved in the first group are relatively simple and algorithmically fast to implement. however, their main disadvantage is that an asymptotic performance must be first estimated from a finite sample number, which makes the estimation of the h parameter directly and considerably dependant from the right selection of the x value group. therefore, the graphic representations result crucial for the confirmation of correspondence between the x values and the lineal performance area and the proper adjustment of the straight line for the represented points. furthermore, it is important to point out these methods only allow a precise estimation of h, as the resulting confidence intervals involves a high cost for the computing resource and long processing periods, both as a result of the use of the intensive graphic-type methods. the mle-based methods, on the other hand, though more complex and a higher computing cost, result more flexible and efficient from the statistical inference point of view, as they allow the obtaining of confidence intervals for the h estimated values. therefore, these are the most common methods. this investigation will deal with mle-based methods to solve the confidence intervals directed to obtain the first approximations for an h value, which will be then adjusted using the proposed method. in any case, the methods from the first group will be thoroughly analyzed and the results will be contrasted with those resulting from the mle and the adjustment suggested by the proposed method. then, and as the analysis starts using mle, it is important to stress these are methods designed to minimize the differences between the periodogram of the series and the parametric model suggested for the theoretical spectral density. besides, the exact estimation of the mle is quite expensive form the computing point of view, so likelihood gaussian functions are generally used (gaussian mle). however, even when these kind of functions are considered, the computing costs results quite high, thus the approximations based on gaussian mle are generally used and the most widely used approximation is the whittle approximation; therefore, the suggested method is based on the modification of one of the variables for the result. 3 justification and proposition of an efficient whittle estimator being f(λ, θ) the parametric form of the spectral density of a gaussian stationary process, xt where, θ = (θ1, . . . , θm) is the parameter vector to estimate. then, being i(λ) the periodogram of samples defined by i(λ) = 1 2πn ∣∣∣∣∣ n∑ t=1 xte jtλ ∣∣∣∣∣ 2 (1) the approximated mle o whittle is the vector θ̂ = (θ̂1, . . . , θ̂m) (2) 142 ginno millán, gastón lefranc minimizing the function q(θ)=̂ 1 2π {∫ π −π i(λ) f(λ, θ) dλ + ∫ π −π log[f(λ, θ)]dλ } (3) in practice, the estimation of the whittle estimator is done choosing an adequate scale parameter θ1, complying with f(λ, θ) = θ1f(λ, θ ∗) = θ1f ∗(λ, η) (4) thus annulling the second addend of (3), i.e. ∫ π −π log[f(λ, θ∗)]dλ = ∫ π −π log[f∗(λ, η)]dλ = 0 (5) where η = (θ1, θ2, . . . , θm) y θ∗ = (1, η) in [27] is shown the scale parameter is given by θ1 = σ2ε 2π , being the mean-square prediction medium (mspe). on the other hand, the discreet version of the whittle estimator is suggested in [28], which is approximated to 3 using a riemann addition in the frequency range given by λk = 2πn−1k, with k = 1, 2, . . . n∗ (being n∗ the integer part of (n − 1)/2). then, the function to be minimized is given by the expression q̃(θ1, h) = 4π n { n∗∑ k=1 i(λk) f(λk, θ1, h) + log[f(λk, θ1, h)] } (6) the estimated h parameter is obtained through the selection of the adequate scale parameter, ĥ, a value that minimizes the following expression q̃∗(h) = q̃(1, h) = n∗∑ k=1 i(λk) f(λk, 1, h) = n∗∑ k=1 i(λk) f∗(λk, h) (7) where it is verified that f∗(λ, h) = 1 θ1 f(λ, θ1, h) = 2π σ2ε (8) the following disadvantages of the whittle estimator stand out in its conventional forms, from this point of view: 1. determination of the parametric form of the spectral density. 2. high estimation period due to graphic method use. the central limit theorem for self-similar processes result quite useful from the perspective of the application of the whittle estimator to processes where it is not possible to assure anything in relation to the spectral density, as it is a good approximation for non-gaussian series, which allow the application of aggregated series to any result from the pure self-similar gaussian processes, as shown in [29]. then, it seems quite interesting this theorem allows the supposition that for a temporal series of n size, whose self-correlation shows a lrd hyperbolic fall, if m and n/m are sufficiently big and the variance is finite, then the fgn process is a good approximation for the aggregated sequences of the series, even when it does not represent a gaussian approximation [30]. the latter is the base for the variant of the whittle estimator known as the added whittle estimator, which gives a more robust and less biased form of the whittle estimator when no information on the presentation of an estimator for the hurst parameter for a selfsimilar process representing the traffic in ieee 802.3 networks 143 exact parametric form of the spectral density is available. in other words, a shorter representative series is obtained given by the expression x (m) k = 1 m (k+1)m−1∑ i=km xi 0 ≤ k ≤ [n/m] (9) and then the whittle estimator is used, considering the fractionary gaussian noise (fgn) as the parametric model of its spectral density. however, in spite of the fact of using a shorter series considerably reduces the computing cost, the observed problem is that the variance of the estimator increases and so does the self-similarity degree, thus reducing the pattern representativeness degree. another problem associated to the method is the impossibility to a priori know the value of the appropriate m. however, in this last sense, a method to represent the estimation of the h parameter are shown in [31], which were obtained for different m values, and finding a region where the graphics are approximately plain. the local whittle estimator is shown in [32], which in contrast to the whittle estimator, represents a semi-parametric estimator that only specifies the parametric form of the spectral density for those frequencies close to zero, i.e., f(λ) ∼ g|λ|1−2h (10) when λ → 0 when replacing f(λ, h) given by (10) in (2), and integrating up to the 2πm n frequency, with 1 m + m n → 0 when n → ∞, the following is obtained q(g, h)=̂ 1 m m∑ j=1 [ i(λj) gλ1−2hj + log(gλ1−2hj ) ] (11) when replacing the g constant by its estimation, given by ĝ = 1 m m∑ j=1 i(λj) λ1−2hj (12) the function to be minimized is obtained, i.e., r(h)=̂q(ĝ, h) − 1 (13) r(h)=̂ log   1 m m∑ j=1 i(λj) λ1−2hj   − (2h − 1) 1 m m∑ j=1 log(λj) (14) nevertheless, the problem still remains, as the selection of the m value results critical and from this value the bias and variance depend, and yet again the bias and variance problem shows. however, as long as m increases, the estimated value for h will quickly converge to the real h value but the spectrum form will continuously fall apart from (6) and the srd effects will be even greater, thus increasing the bias. then, as in the previous method, one must choose to represent the estimated value of h against m and find the plain region of the graphic. as it has been observed from the methods about the whittle estimator, all of them require the minimization of an expression, (6) or (14). the most obvious way of carrying out such minimizations is to assess these expressions for a certain number of equidistant h values (q), which will depend on the selected resolution. however, it must be pointed out that as lightly higher number of samples are needed for the algorithm to be immediately translated into a very high computing cost. 144 ginno millán, gastón lefranc then, a reduction through an algorithm is suggested to reduce the computing cost, which is aimed to decrease the number of points to be assessed. therefore, the convex characteristic of the function in all the domain [0.5, 1(( will be considered also that the minimal is then unique. then, a searching method by bi-section used on the function derivative will have to allow the number of evaluated points to be around log 2(q), which will be of advantage in a significant sample saving, and at the same time it does not settle in relation to the bias and variance commitment. therefore, considering the assessment of the derivative in a hi point may be approximated by a difference coefficient for a sufficiently small increase, h, i.e., q′(hi) ≈ q(hi + h) − q(hi) h (15) for h → 0 it is derived the main work hypothesis is related with the establishment of a self-similarity degree based on the whittle estimator; however, due to the complexity involved in the bias and variance commitment behind all models, it is then suggested that in a reduced point spectrum it is possible to give an answer with an acceptable level of commitment. therefore, in order to obtain confidence intervals, and taking advantage of the convexity of the function to minimize in the whole dominium [0.5, 1(, which implies the presence of a unique minimum, it will be considered that when estimating a unique h parameter, if ĥ represents the value minimizing the q(h) function and that h0 represents its real value, then (ĥ − h0) → n(0, σh) (16) where the σh parameter will be defined by: σ2h = 4π n {∫ π −π [ ∂ log[f(λ, h)] ∂h ]2 dλ }−1 h=h0 (17) then, the estimation of the derivative given by (14) may also approximate through a difference coefficient for a sufficiently small increase, h. however, in this case, the chosen value for θ1 different, because a cancellation is produced. then, σ2h = { n∗∑ k=1 [ ∂ log[f(λk, θ1, h)] ∂h ]2}−1 h=ĥ (18) where the [33] it is suggested in the end that σ2h =    n∗∑ k=1 [ log[f(λk, θ1, ĥ + h)] ∂h − log[f(λk, θ1, ĥ)] ∂h ]2  −1 (19) 4 conclusions the hypothesis on the existence of a process with a temporary structure of long-term memory was presented, discussed and developed, which was representative of the independence between the randomness degree of the traffic generated by the sources and the pattern of the traffic flow shown by the network. the latter has been shown to be considered as a new form and alternative to cover the performance and network design estimation topics that are ruled by the ieee 802.3-2005 standard. the traditional models based on the poisson processes, or even more general, based on short-term dependency processes are unable to describe the performance of he current data networks, particularly presentation of an estimator for the hurst parameter for a selfsimilar process representing the traffic in ieee 802.3 networks 145 those related to the switched ethernet networks according to the ieee 802.3-2005 standard. consequently, it is necessary to redefine the study of the load systems, considering self-similar entry processes as a result of the self-similar traffic demand imposing new requirements in the network design, especially in what the buffering strategies are concerned. all methods traditionally used to assess the whittle estimator show the disadvantages of the need to know the parametric form of the spectral density, as well as a high computing cost resulting from the intensive use of graphic methods. it is estimated these problems may be solved as far as it is feasible to introduce an algorithm that reduces the number of points to be assessed. this does not only mean reduced costs in computing processing, but also a new alternative to be considered in the study of the self-similar or fractal traffic consideration on the benefits of a network. a convex function to carry out the minimization of the generalized function of the local whittle estimator is suggested for a delimited domain, which involves the advantage to include a sole minimum that is completely individualizable, and thus using a bi-sectional searching method applied on the derivative of the function, should allow the determination of a point around which all values fluctuate. therefore, this allows the availability of a plain region in which the value of the h parameter is perfectly approximated by a difference coefficient. the latter must be translated into savings in the computing costs and processing time that endorse the new proposed model. the development of the simulations is being tackled through literature search as a first stage, which must clearly include the simulation techniques and statistical analysis for long-term dependency series, as it is considered it is not enough to be able to deduce the results from some aggregations, but there must be a standardization of the procedures to specifically study the long-term dependencies. this area requires urgent attention, as most of the operational costs involved in the computing capacity directly depend on it. bibliography [1] metcalfe, m. and boggs, r. ethernet: distributed packet switching for local computer networks.communications of the acm, vol. 19 n°7, 1976. [2] ibańez, g. contribución al diseńo de redes de campus ethernet autoconfigurables. ph.d. thesis, dept. ing. telemática, universidad carlos iii de madrid, madrid, espańa, 2005. [3] ibańez, g. contribución al diseńo de redes de campus ethernet autoconfigurables. ph.d. thesis, dept. ing. telemática, universidad carlos iii de madrid, madrid, espańa, 2005. [4] garcía, j., ferrando, s., and piattini, m., redes para proceso distribuido, madrid, ra-ma, pp. 127160, 1997. [5] zacker, c., redes. manual de referencia, madrid, mcgraw-hill, pp. 275-341, 2002. [6] halabi, s.,metro ethernet. the definitive guide to enterprise and carrier metro ethernet applications. indianapolis, cisco press, pp. 1, 2003. [7] leland, w., taqqu, m., willinger, w., and wilson, d., on the self-similar nature of ethernet traffic, ieee/acm trans. networking, vol. 2, no. 1, pp. 1-15, 1994. [8] robinson, p., gaussian semiparametric estimation of long-range dependence,annals of statistics, vol. 3, no 1995b, pp. 1630-1661, 1983. [9] geweke, j., and porter-hudak, s., the estimation and application of long memory time series models. j. timer ser. anal. 4, pp. 221-238, 1983. 146 ginno millán, gastón lefranc [10] stallings, w., internet y redes de alta velocidad. rendimiento y calidad de servicio. 2nd ed., madrid, pearson prentice hall, pp. 224-225, 2004. [11] stallings, w., internet y redes de alta velocidad. rendimiento y calidad de servicio. 2nd ed., madrid, pearson prentice hall, pp. 224-225, 2004. [12] kleinrock, l., communication nets, new york, mcgraw-hill, 1972. [13] leland, w., taqqu, m., willinger, w., and wilson, d., on the self-similar nature of ethernet traffic,computer communications review, vol. 23, pp. 183-193, 1993. [14] leland, w., taqqu, m., willinger, w., and wilson, d., on the self-similar nature of ethernet traffic, ieee/acm trans. networking, vol. 2, no. 1, pp. 1-15, 1994. [15] klivansky, s., mukherjee, s., and song, c., factor contributing to self-similarity over nfsnet, georgia institute of technology, 1995. [16] paxon, v., and wilson, d., wide-area traffic: the failure of poisson modeling, ieee/acm trans. networking, vol. 3, no. 1, pp. 266-244, 1995. [17] duffy, d., mcintosh, a., rosenstein, m., and willinger w., statistical analysis of ccsn/ss7 traffic data from working ccs subnetworks, ieee journal on selected areas in communications, vol. 12, pp. 544-551, 1994. [18] crovella, m., and bestavros, a., self-similarity in world wide web traffic: evidence and possible causes, ieee/acm trans. networking, vol. 5, no. 6, pp. 835-846, 1997. [19] garret, m., and willinger, w., analysis, modeling and generation of self-similar vbr video traffic, proc. acm sigcomm’94, pp. 269-280, london, 1994. [20] beran, j., sherman, r., taqqu, m., and willinger, w., long-range dependence in variable-bit-rate video traffic, ieee trans. communications, vol. 24, no. 2, pp.1566-1579, 1995. [21] norros, i., a storage model with self-similar input, ieee tans. queueing systems, vol. 16, pp. 387-396, 1994. [22] likhanov, n., tsybakov, b., and georganas, n., analysis of an atm buffer with self-similar “fractal”) input traffic, proc.ieee infocom’95, pp. 985-992, boston, ma, 1995. [23] elwaid, a., and mitra, d., effective bandwidth of general markovian traffic sources and admission control of high-speed networks, ieee/acm trans. networking, vol. 1, no. 3, pp. 329-343, 1993. [24] sriram, k., and whitt, w., characterizing superposition arrival processes in packet multiplexers for voice and data, ieee journal on selected areas in communications, vol. 4, pp. 833-846, 1986. [25] heffes, h., and lucantoni, d., a markov modulated characterization of packetized voice and data traffic and related statistical multiplexer performance, ieee journal on selected areas in communications, vol. 4, no. 6, pp. 856-868, 1986. [26] bravo, j., and marrone, l., tráfico autosimilar. algoritmo algebraico para asignación dinámica del buffer, revista ingenius, ed. 2, facultad de ingenierías, universidad politécnica salesiana, ecuador, 2007. [27] beran, j., statistics for long-memory processes, in generalized additive models (monographs on statistics and applied probability), new york, chapman & hall, 2000. presentation of an estimator for the hurst parameter for a selfsimilar process representing the traffic in ieee 802.3 networks 147 [28] geweke, j., and porter-hudak, s., the estimation and application of long memory time series models. j. timer ser. anal. 4, pp. 221-238, 1983. [29] taqqu,m., and teverovsky, v., a practical guide to heavy tails: statistical techniques ans applications. http://citeseerx.ist.psu.edu, 1983. [30] taqqu,m., and teverovsky, v., a practical guide to heavy tails: statistical techniques ans application. http://citeseerx.ist.psu.edu, 1983. [31] leland, w., taqqu, m., willinger, w., and wilson, d. (1994), on the self-similar nature of ethernet traffic, ieee/acm trans. networking, vol. 2, no. 1, pp. 1-15. [32] robinson, p., log-periodogram regression of time series with long-range dependence, annals of statistics, no. 23, pp. 1048-1072, 1995. [33] millán, g., and lefranc, g., proposición de un estimador del parámetro de hurst para un proceso autosimilar representativo del grado de aleatoriedad del tráfico registrado en redes ieee 802.32005. xviii congreso de la asociación chilena de control automático (acca), 2007. international journal of computers communications & control issn 1841-9836, 9(6):741-748, december, 2014. parmods: a parallel framework for mods metaheuristics e.d. nino ruiz, s. miranda, c.j. ardila, w. nieto elias d. nino ruiz*, stella miranda, carlos j. ardila, wilson nieto universidad del norte computer science department colombia, barranquilla {enino,stellam,carila,wnieto}@uninorte.edu.co *corresponding author: enino@uninorte.edu.co abstract: in this paper, we propose a novel framework for the parallel solution of combinatorial problems based on mods theory (parmods) this framework makes use of metaheuristics based on the deterministic swapping (mods) theory. these approaches represents the feasible solution space of any combinatorial problem through a deterministic finite automata. some of those methods are the metaheuristic of deterministic swapping (mods), the simulated annealing deterministic swapping (samods), the simulated annealing genetic swapping (sagamods) and the evolutionary deterministic swapping (emods) those approaches have been utilized in different contexts such as data base optimization, operational research [1–3, 8] and multi-objective optimization. the main idea of this framework is to exploit parallel computation in order to obtain a general view of the feasible solution space of any combinatorial optimization problem. this is, all the mods methods are used in a unique general optimization process. in parallel, each instance of mods explores a different region of the solution space. this allows us to explore distant regions of the feasible solution which could not be explored making use of classical (sequential) mods implementations. some experiments are performed making use of well-known tsp instances. partial results shows that parmods provides better solutions than sequential mods based implementations. keywords: mods, combinatorial optimization, parallel framework. 1 introduction combinatorial optimization (co) is a branch of optimization in which problems can be represented (or reduced) to discrete structures. in this ramification, we find many problems related to operational research and networking fields. moreover, since the number of possible solutions in this kind of problems increase exponentially with regard to the input parameters, their numerical solution can be very hard (or impossible) to obtain, for instance solving their mathematical formulations. thus, two important considerations should be taken into in account when we want to solve co problems: the number of solutions to consider is only a subset of the feasible solution space and the solutions should be obtained in a polynomial time. the first item addresses the necessity of having good solutions and the second one, demands the elapsed time to be small for the proposed implementation. however, those features are opposite, this means, when the number of solutions explored from the feasible solution space is small, the solution is obtained in short time but maybe, the approximated optimal solutions are not good enough. on the other hand, exploring more and more the feasible solution space provides better approximations to the optimal solution but, the performance of the method is affected considerably (long elapsed times). thus, we need methods which in a polynomial time consider more and more solutions from the feasible solution space. notice, we are not taking about exhaustive methods such as brute force but, combining information from different metaheuristics in a polynomial time which can be done in parallel. copyright © 2006-2014 by ccc publications 742 e.d. nino ruiz, s. miranda, c.j. ardila, w. nieto this paper is organized as follows: in section 2 the tsp problem is introduced and mods metaheuristics are presented, section 3 describes the proposed implementation and, section 4 and 5 provide the experimental results and conclusions, respectively. 2 preliminaries following the previous section, one of the most widely used co problems is the traveling salesman probem (tsp) its importance is derived owing to its application to different branch and fields from optimization. moreover, some well-known problems such as the vehicle routing problem and the transportation problem are derived from the tsp formulation. in general, this problem is defined as follows: we have a set of n cities c = {c1, c2, . . . , cn}, a matrix of weights w ∈ rn×n whose elements wi,j provides the weight of going from ci to cj, for 1 ≤ i, j ≤ n and, the cost function j(α) = wn,α1 + n−1∑ i=1 wαi,αi+1, (1) which is subjected to • visiting each city from c once. • coming back to the initial city once the path α has been completed. rudely speaking, we want to find the optimal path α∗ which provides the optimal components wα∗i ,α ∗ i+1 , for 1 ≤ i ≤ n−1, from w such that (1) is minimized. note that, the number of possible solutions for the tsp problem increases by n! with regard to the number of cities n. as we mentioned before, the numerical solution of co problems can be an exhaustive work making use of numerical methods and the tsp problem is not the exception. note that, the tsp problem can be seen as a linear programming problem therefore well-known numerical methods based on integer programming and simplex methods could be used in order to solve (1) but, they have been proved to fail when the number of cities is large, as is usual in practice. on the other hand, the optimal solution of the tsp problem can be approximated making use of metaheuristics, we address one set of them in this paper, those are based on the metaheursitic of deterministic swapping (mods) mods is a metaheuristic inspired on the automata theory. its application ranges from the operational research field to the database query optimization area [7]. it is very important to note that, mods methods are not novel methods, in general, they are nothing but classical combinatorial optimization methods represented on deterministic finite automata structures. this improves the manner to design the solution of the problem since the optimial solution space and the transition between solutions (states of the automata) are defined prior any optimization process. this avoids, for instance, to explore unfeasible regions of the solution space. mods considers the next deterministic finite automata (dfa) qmods = {s, σ, δ,s0,j} , (2) where s is the feasible solution space, σ is the input alphabet which is utilized by δ : s → s in order to perturb the solutions, s0 contains the initial solutions and, j is the cost function to be optimized. the s space is unknown since it contains all the possible solutions of the co problem. putting all in the tsp context, s contains all the possible paths, s0 provides the initial path, j is the cost function (1) and σ and δ provides all the possible manners to perturb parmods: a parallel framework for mods metaheuristics 743 a given path, for instance, given the path α′ = [1, 2, 3, 4] and the duple σ1 = (2, 3) ∈ σ then, δ(α′, σ1) = [1, 3, 2, 4]. the mods metaheuristic is defined in algorithm 2. algorithm 2 mods metaheuristic require: σ, δ,s0,j ensure: α+ ≈ α∗ 1: α+ ← s ∈s0 2: for k = 1 → m do 3: σk ← σ ∈ σ 4: α− ← δ(α+, σk) 5: if j(α−) < j(α+) then 6: α+ ← α− 7: end if 8: end for algorithm 3 samods metaheuristic require: σ, δ,s0,j , t0, ρ, l ensure: α+ ≈ α∗ 1: α+ ← s ∈s0 2: for k = 1 → m do 3: for i = 1 → l do 4: σi ← σ ∈ σ 5: α− ← δ(α+, σi) 6: if j(α−) < j(α+) then 7: α+ ← α− 8: else 9: generate (uniformly) η ∈ [0, 1] 10: compute γ = exp ( − j(α+)−j(α−) tk ) (3) 11: if η < γ then 12: α+ ← α− 13: end if 14: end if 15: end for 16: tk+1 ← ρ ·tk 17: end for samods is a simulated-annealing (sa) based mods method which explores the feasible solution space s in a more “generous” manner. it allows bad solution to be accepted in small optimization intervals (usually at the beginning of the iterations) alike mods, samods makes use of the dfa qsamods = {s, σ, δ,s0,j , t0, ρ, l} , (4) where s, σ, δ, s0 and, j remain unchanged, t0 is the initial temperature, ρ is the cooling factor and, l is the number of refinement iterations. note that, mods accepts a new solution 744 e.d. nino ruiz, s. miranda, c.j. ardila, w. nieto only when its optimal value is better than the current value (from the current path) on the other hand, samods makes use of the boltzamm distribution (5) in order to give the chance of a bad solution to be improved. this may provides a better solution than the best solution considered so far. thus, at the beginning of the iterations, the number of solutions accepted as good is large but, this number is decreased when the iterations draws on since the parameter tk is large and then, the condition of line 11 in algorithm 3 is almost never satisfied. following the sa principles, sagamods [5] is defined on the samods method but, when a bad solution is rejected (line 11 in algorithm 3), the solution is improved making use of genetic algorithms. the supporting automata of sagamods method is defined as follows: qsagamods = {s,s0,c(s, r, k), f(s)} s and s0 remain unchanged from the previous methods. in addition, c(s1, s2, k) is the crossover operator where s1 ∈ s and s2 ∈ s are parents solutions. likewise, k provides the cross point. sagamods method is presented in the algorithm 4. algorithm 4 sagamods metaheuristic require: σ, δ,s0,j , t0, ρ, l ensure: α+ ≈ α∗ 1: α+ ← s ∈s0 2: for k = 1 → m do 3: for i = 1 → l do 4: σi ← σ ∈ σ 5: α− ← δ(α+, σi) 6: if j(α−) < j(α+) then 7: α+ ← α− 8: else 9: generate (uniformly distributed) η ∈ [0, 1] 10: compute γ = exp ( − j(α+)−j(α−) tk ) (5) 11: if η < γ then 12: α+ ← α− 13: else 14: generate integer number (uniformly distributed) β ∈ [1, modelsize] 15: call c(α+, α−, β) 16: end if 17: end if 18: end for 19: tk+1 ← ρ ·tk 20: end for emods [6] is an evolutionary mods method which improves the solutions making use of evolutionary techniques (crossover and mutation). a complete taxonomy of sagamods and emods methods can be read in [4, chapter 4]. now we are ready to present our parallel approach of mods based methods. parmods: a parallel framework for mods metaheuristics 745 3 proposed implementation to start, consider an array of processors available at time t: p = [p1, p2, . . . , pn] , (6) where n is the number of processors. for simplicity, we avoid the use of time indexes. moreover, we consider that the metaheuristics mods, samods, sagamods and emods can be run independently at different processors. thus, we want to split the number of available processors per the number of metaheuristics, this is: jobsproc = n 4 . (7) consider the initial solution s ∈ s0, then we denote the next dfas based on the index 1 ≤ i ≤ n: qi =   q1 = qmods = {. . . , s} for i = 1, 5, . . . q2 = qsamods = {. . . , s} for i = 2, 6, . . . q3 = qsagamods = {. . . , s} for i = 3, 7, . . . q4 = qemods = {. . . , s} for i = 4, 8, . . . , (8) and then, we are ready to launch different processes based on the next rule jobi =   p(q1, s) for i = 1, 5, . . . p(q2, s) for i = 2, 6, . . . p(q3, s) for i = 3, 7, . . . p(q4, s) for i = 4, 8, . . . , (9) where the i-th process jobi (p(., .)) is executed in the processor pi of (6), for 1 ≤ i ≤ n. note that, in (8) we choose the automata to be utilized and in (9), we launch the process. for instance, mods metaheuristic is executed on processors 1,4,..., likewise, samods is executed on processors 2,5,... and so on. denote by s1, s2, s3, and s4 the approximated optimal solutions provided by mods, samods, sagamods and emods among processors, respectively, this is s1 = arg min s (i) mods { j ( s (i) mods ) , for i = 1, 4, . . . } s2 = arg min s (i) samods { j ( s (i) mods ) , for i = 2, 5, . . . } s3 = arg min s (i) sagamods { j ( s (i) mods ) , for i = 3, 6, . . . } s4 = arg min s (i) emods { j ( s (i) mods ) , for i = 4, 7, . . . } where, for instance, s(1)1 is the approximated optimal solution of mods from the processor 1. then, we choose the best approximation, s+ = arg min si {j (sk) , for 1 ≤ k ≤ 4} (10) 746 e.d. nino ruiz, s. miranda, c.j. ardila, w. nieto which will serve as the new initial solution in s0. this iterative process is called parmods (parallel mods) and it is summarized in algorithm 5. note that, the required components of this metaheuristic varies from the definitions of the automatas, that is why the common components are shown in the inputs and the optional parameters are expressed by dots. algorithm 5 parmods metaheuristic require: σ, δ,s0,j , . . . ensure: α+ ≈ α∗ 1: α+ ←s0 2: for t = 1 → m do 3: for all i = 1 → n do 4: launch jobi according to (9). 5: end for 6: α− = arg min s (j) ∗mods { j ( s (j) ∗mods ) , for 1 ≤ j ≤ n } 7: if j(α−) < j(α+) then 8: α+ ← α− 9: end if 10: end for notice, the computational cost of the method per iteration will be given by the number of iterations of parmods times the upper bound o (aparmods) = max (o(amods),o(asamods),o(asagamods),o(aemods)) , where the letter a counts for “algorithm". note that, since all the methods are executed in parallel, the computational effort of parmods is provided by the largest upper bound, which, in general, is provided by sagamods. 4 experimental results we study the performance and efficiency of parmods making use of tsp instances from the tsplib. the selected tsp instances are kroa100 and kroa150 which contain 100 and 150 cities, respectively. the solutions obtained by the methods are presented in table 1 and figure 1. metaheuristic processors j kroa100 j kroa150 mods n/a 1.6165 2.6122 samods n/a 0.6583 0.9865 sagamods n/a 0.3739 0.5399 emods n/a 1.5340 2.4663 parmods 4 0.3545 0.3664 parmods 8 0.2827 0.3359 parmods 12 0.2827 0.3359 table 1: cost function values ×105 for different mods implementations. parmods: a parallel framework for mods metaheuristics 747 in figure 1 can be seen how parmods outperforms the other mods implementations (sequential mods, samods, sagamods and emods) in terms of accuracy. moreover, the mods implementations are divided evenly onto the number of processors available (n) for instance, four processors means one instance of mods, samods, sagamods and emods are used at each processor when parmods is executed. notice, parmods do not make use of parallel resources in order to split the domain but to obtain information about the feasible solution space. since parmods spread mods instances among processors, the best solution is used in the next generation of each mods implementation (initial state of each automata). 5 10 15 20 25 iteration mods samods sagamods emods parmods (a) kroa100 n = 4 5 10 15 20 25 iteration mods samods sagamods emods parmods (b) kroa150 n = 4 5 10 15 20 25 iteration mods samods sagamods emods parmods (c) kroa100 n = 8 5 10 15 20 25 iteration mods samods sagamods emods parmods (d) kroa150 n = 8 5 10 15 20 25 iteration mods samods sagamods emods parmods (e) kroa100 n = 12 5 10 15 20 25 iteration mods samods sagamods emods parmods (f) kroa150 n = 12 figure 1: graphical comparison of the cost function values per iteration for the different mods implementations. 5 conclusions we propose a novel parallel method based on mods theor5522894y. the proposed implementation exploits the attractive features of each mods implementation. initial results show that parmods provides better results among the compared methods. moreover, when the number of processors is increased, the results are improved. however, we note that the results obtained for 8 and 12 processors are the same. this motivates to study theoretical bounds regarding the number of processors and the percentage of improvement on the solutions. bibliography [1] anonnimus (1964); operational research studies in inventory sequencing simulation, production engineer, 43(9):437–438, doi: 10.1049/tpe.1964.0060. [2] anonnimus (1964); operational research studies. project a-inventory. production engineer, 43(9):438–447, doi: 10.1049/tpe:19640061. [3] junyi chen and pingyuan xi (2010); simulation and application on modern operational research. in computer and automation engineering (iccae), 2010 the 2nd international conference on, 4: 118–121. 748 e.d. nino ruiz, s. miranda, c.j. ardila, w. nieto [4] elias d. niño (2012); real-world applications of genetic algorithms, chapter evolutionary algorithms based on the automata theory for the multi-objective optimization of combinatorial problems. intech, oxford, 2012. book edited by olympia roeva. [5] elias d. nino, carlos j. ardila, and anangelica chinchilla (2012); a novel, evolutionary, simulated annealing inspired algorithm for the multi-objective optimization of combinatorial problems. procedia computer science, 9(0):1992 – 1998. [6] elias d. nino-ruiz (2012); evolutionary algorithm based on the automata theory for the multi-objective optimization of combinatorial problems. international journal of computers communication & control, 7(5):916–923. [7] miguel rodríguez, daladier jabba, elias d. niño, carlos j. ardila, and yi-cheng tu (2013); automata theory based approach to the join ordering problem in relational database systems. in markus helfert, chiara francalanci, and joaquim filipe, editors, data, pages 257–265. scitepress. [8] li zhengfeng and ye jinfu (2010); study on the evolutionary mechanism from operational research activities to sustainable competitive advantage. in intelligent computation technology and automation (icicta), 2010 international conference on, 3: 580–584. 915filipn6.pdf international journal of computers communications & control special issue on fuzzy sets and applications (celebration of the 50th anniversary of fuzzy sets) issn 1841-9836, 10(6):952-953, december, 2015. book review: "biometric and intelligent decision making support" f.g. filip florin gheorghe filip romanian academy, calea victoriei 125, sector 1, bucharest, romania ffilip@acad.ro biometric and intelligent decision making support by artüras kaklauskas intelligent systems reference library (janusz kacprzyk, lakhmi c. jain, series editors) springer, isbn 978-3-319-13659-2, 236 pages http://www.springer.com/us/book/9783319136585 this book is meant to familiarize the reader with the state-of-art of research and application results obtained in the domain of intelligent decision support systems (idss), biometric technologies and their integration. the author of the book is prof. a. kaklauskas, the head of the department of construction economics and real estate management of the vilnius gediminas technical university, faculty of engineering, vilnius, lithuania, a reputed author in the domain of intelligent decision support systems, multi-criteria decision making (mcdm) methods and their applications in the construction, property management and the related fields. in the book, the author makes an exhaustive review of results reported in the domain literature and presents a detailed account of his practical achievements as well as his colleagues’. the book is composed of seven chapters as follows. chapter 1, entitled "introduction to intelligent decision support systems", sets the stage for the following chapters. it introduces the main concepts of the domain and pays a special attention to artificial intelligence (ia)based advanced methods and their integration with data base management systems (dbms) and humancomputer interface (hci). chapter 2, entitled "intelligent decision support systems", reviews basic concepts of several new subclasses of idss, such as recommender, advisory and expert systems, data and text mining. the integration of artificial neural networks is addressed too. a big part of the chapter provides abundant information about specific aspects of biometrics-based dss, such as voice and speech recognition, expression analysis and so on. two very modern concepts and associated technologies, namely ambient intelligence and internet of things, are described in details. several other subclasses of solutions based on various technologies, such as genetic algorithms, fuzzy and rough sets, computer vision, robotic systems and so on, are surveyed in the final part of the text of the chapter. the remaining part of the text contains the presentation of various conceptual and application results obtained by professor kaklauskas and his colleagues. chapter 3 introduces the model of passive house to be used in a qualitative analysis and design of the corresponding intelligent system. chapter 4 contains the results obtained in biometric and self-assessment of the student progress system and describes two relevant case studies. copyright © 2006-2015 by ccc publications book review: "biometric and intelligent decision making support" 953 chapter 5 addresses the subject of analysis of user’s emotions and work productivity by using the state of blood circulatory system and describes a webbased biometric computer mouse advisory system. the content of chapter 4 is complemented by the interesting solutions presented in chapter 6, which addresses student progress assessment by using an intelligent pupil analysis and presents two case studies. chapter 7 addresses also the education domain and presents recommender-system-based solutions meant to increase student productivity. one book addresses two modern research domains: intelligent and integrated decision support systems and biometrics-based human-computer interface. it can be explained, in my view, by the huge effort made by the author and his colleagues to find solutions to problems of the real world. the presentation of the research and application results obtained by the author and his colleagues over the years is a special merit of the book, beside the variety of the topics addressed. consequently, i warmly recommend the book to phd and msc students and instructors, as an up-to-date source of knowledge. i think the book can also be a valuable tool for those consultants who are willing to master new concepts and technologies in order to get a competitive advantage on the market of computer applications. f. g. filip romanian academy bucharest, romania int j comput commun, issn 1841-9836 8(1):161-165, february, 2013. alternative wireless network technology implementation for rural zones f.j. watkins, r.a. hinojosa, a.m. oddershede francisco j. watkins, ricardo a. hinojosa, astrid m. oddershede universidad de santiago departamento de ingenieria electrica santiago, chile e mail francisco.watkins@usach.cl, ricardo.hinojosa@usach.cl, astrid.oddershede@usach.cl abstract: this paper describes a methodology that allows wireless networking allowing interconnection to the internet through a gateway, to interact and obtain products and services delivery. these wireless mesh network(wmn), are based on routers which are programmed to work as nodes of a network. there are certain routers that allow the programming of its firmware to form network nodes. communication is transmitted between nodes in the network and it is possible to cover long distances. the signal of a distant node, hop from node to node till reach the gateway. this generates delays and congestion in the network. a path that contains nodes that make more faster connection to the gateway can be designed as a solution. this is called a backbone, has a different channel frequency of the common nodes. the characteristics of these networks is its fast implementation and low cost. this make them useful for rural areas, for developing countries and remote regions. keywords: wireless mesh network. 1 introduction the methodology can be structured to generate a transmission infrastructure of broadband hybrid wireless mesh of low cost that shall extend to a great extent with nodes placed point to point. this to access products and services delivers the internet. in modern times the internet has been a powerful network of communication and exchange of information with a strong endorsement of communication technology. on the other hand, it can be defined as a powerful instrument of social development that provides information, communication between people and a way of obtaining knowledge. the internet is a valuable tool for business, industry, trade, education, and social development of communities. a computer is physically connected to the internet via a modem or a card nic (net work interface card). the logical connection applies standards called protocols. a protocol is a formal description of a set of rules and conventions that govern so that communicate with devices on the network. the network connection can use multiple protocols this set is the (tcp/ip), transport control protocol, internet packages, to receive or transmit information. the most important is have the connectivity for getting the internet benefits. based on the modern technology support, a communication network can be implemented, hacking the characteristic of its software and implementing some passives devices that modify the hardware characteristics. the network design for a wireless mesh network will depend on the geographic landscape and distances between the points to be connected. a combination of point-topoint long distance links copyright c⃝ 2006-2013 by ccc publications 162 f.j. watkins, r.a. hinojosa, a.m. oddershede (using directional antennas) and local point-to-multipoint links (using omni-directional antennas) between mesh nodes can create a reliable mesh network. this type of wireless communication network has been denoted as wireless mesh network [1]. there are several pilot projects in development at underdevelopment countries, mainly in rural areas. 2 the wireless mesh network (wmn) it is a network structures by several nodes, that form the backbone of the network. the nodes, because of its software, can be configure automatically and re configure to maintain the network connectivity. figure 1: wireless mesh network in figure 1, the wireless nodes are interconnected, and a wireless node is a router with its antenna, the antenna could be omni-directional or directional. a mesh node only communicate with others mesh nodes. a wireless access point is a point which alloys the interaction with the wireless mesh network of any wi-fi device. its is consist of a wireless router and an antenna. in this case, the antenna is a omni-directional antenna. in a mesh wireless network any node can be connected in no structured. a simple wireless network can consist of two wireless routers and its antennas. 3 characteristics and advantages of a wireless mesh network the link between nodes the routers could be configure in different way, generating links that cover big distances or for giving service to several user in a small area. the connection could be by physics means or wireless. alternative wireless network technology implementation for rural zones 163 in mesh wireless network , the unidirectional antennas are used, and in same places its can used wired connection. the large distance s are cover by using static wireless nodes with unidirectional antennas. the mesh wireless network are robust and of simple configuration , because its software will determine the path of the data in real time. the backbone of the network its depend on the site topography. communication of all mesh nodes are based on wi-fi . all the nodes of the mesh wireless network operate at same channel frequency. in a wmn, each node must be communicated with at least with other two nodes in order to maintain a robust mesh connectivity, which is the main features of a wmn. in a wmn, each node have the same name and number. the ip address should be unique to allow to connect to any computer in the network. a computer can connect to the mesh via lan cables connected to the mesh node or a wireless connection to a separate access point connected to a lan or a mesh node. a network device, is connected physically to the network through a modem or a nic(network interface card) and the logic connection is made through protocols. a protocol is a formal description of a set of rules and agreements for defining a communication way between the different devices of the network. the protocols tcp/ip, ( transport and control protocol/ internet packages) and osi (open system interconnection ). the users to manage the mesh thought navigators. a navigators initiate (start up) the connection to a server and could required or received information. this software interpret hipertext language label (html), which is one of language to code of a web page content. in a wmn, a routing protocol, will route de ip traffic between the wireless interfaces of the mesh nodes. manage the routing information and maintains routing tables dynamically. this provide an alternative route when a node fails. the advantages of a wmn, are the following: self-forming. the wireless mesh forms their structure automatically once its nodes have been configured and activated. it is a robust network. fault tolerance because of redundant routes exist in the network. information is not interrupted in the rest of the network when a node fails. low cost of the of the nodes of the wmn, which allows the admission of extra nodes to increment the network, given a low incremental cost of the network. easy deployment of the network. new member of the community, with little training can built their own nodes. 4 the wireless mesh network design wireless mesh networks are no problematic to build when you have a few nodes. in general you must follow the following stages: • map of the network; • place of each node; • network topology and channel allocation; • channel allocation for site users; • plan ip address allocation. 4.1 map of the network the map of the mesh network star with identification of the sites that will receive a mesh node, the coordinates can get with a gps and them plot distribution nodes in a paper. the sites nodes can be linked together using the map. each link is a straight line between to nodes. the length of each link represent the distance between sites. 164 f.j. watkins, r.a. hinojosa, a.m. oddershede 4.2 place of each node for the nodes is a solution to build a backbone reaching the gateway. if we have a complex mesh, several backbones uniformly distributed are needed for the nodes reach the gateway. the backbones are path included in the connection of certain strategic nodes in such way that for any others nodes, the connection with the gateway be expedite (connection with the gateway be through a few nodes). 4.3 network topology and channel allocation nodes in the mesh can communicate to each other, if they have the same frequency channel. when a back bone is incorporated, other channel is needed, which works as an independent network. in this case both network do not have interference problem. 4.4 channel allocation of the backbone if we have a mesh network with a backbone we need two ip ranges . a third range is needed is we adds an access point [2]. 4.5 ip address allocation the ip allocation should assure for each pc and each node a unique address, according to rfc 1918 subnet scheme [3], [4]. for the assignment of the address of the different element of the network, we have the following: backbone node; wireless interface : 10.0.1.x/24 where 1ł x < 255; ethernet interface: 10.3.x.y/24 where 1ł x < 255 where 1ł y < 255; normal mesh node: wireless interface: 10.1.1.a/24 where 1ł a < 255; ethernet interface:10.2.a.b/24 where 1ł a < 255 where 1ł b < 255, pc and laptop connected to a node will be numbered from 100according to the setting; access point : the subnet assigned to a lan or hotspot will be the same as ethernet lan connected to the mesh node. 5 the wireless mesh network 5.1 first steps to build the mesh network the following stages should be cover: • configure all the mesh nodes and wireless access point according to the network document; • attach a paper to the device of the mesh node or wireless access point with the configuration detail; • test the equipment to be sure all is working properly; • connect a pc to a mesh node with a lan cable. the pc will require a ip address. ping other mesh node .if the ping is successful, the mesh node of the pc and the other mesh node are working. if not, check the configuration; • the gateway is the point where the mesh network will be connected to the internet; • installing the mesh nodes from the gateway, in such way you can confirm that the network is still working when a new mesh node be installed; • connect a pc to a mesh node with a lan cable, ping the gateway first, and if that is successful, access to any site on the internet (different web pages) in order to ensure the pc can connect to the internet. alternative wireless network technology implementation for rural zones 165 5.2 the mesh node in order to start with the mesh node you must have the router, the lan cable, and the power supply. to configure the mesh node, the following stage must be cover: • upgrade the firmware for the backbone and normal mesh nodes [5]; • configuration of system settings; • configuration of wireless settings; • configuration of lan settings; • olsr settings [6] (optimized link state routing protocol. (olsr) is developed for mobile ad hoc networks. it operates as a table driven and proactive protocol, thus exchanges topology information with other nodes of the network regularly). 6 conclusions a methodology has been given in order to plan and design a wmn , which can supply available communication systems in a similar way of internet does. but a lower cost of internet. this can be planned specially on rural areas. different countries can access to several routers models, but there are only a few that allow be adapted to this technology. by other hand there are free open software that allows this implementation. this methodology makes it possible to cover a niche of potential users, especially in rural areas and in developing countries where the low density of population, not makes it attractive to commercial isp (internet service provided). bibliography [1] akyildiz i., wang x.,wang w., wireless mesh network: a survey, computer networks 47, 445-487,2005. [2] ding y., xiao l., channel allocation in multi-channel wireless mesh networks [3] bernardos c., calderon m., soto i.,solana a., k weniger k., building an ipbased community wireless mesh network: assessment of pacman as an ip address autoconfiguration protocol, computer networks; 54, 291-303, 2010. [4] avallone s., akyildiz i., a channel assignment algorithm for multi-radio wireless mesh networks, computer communications 31 1343-1353, 2008. [5] hsu ch., .wu j-l, tewang s., chi-yao hong ch y., survivable and delayguaranteed backbone wireless mesh network design, journal parallel distribution computing. 68, 306-320,2008. [6] http://hipercom.inria.fr/olsr/ international journal of computers communications & control issn 1841-9836, 11(6):877-888, december 2016. coverage optimization strategy for wsn based on energy-aware l. zhu, c. fan, h. wu, z. wen li zhu, chunxiao fan*, zhigang wen beijing university of posts and telecommunications, beijing key laboratory of work safety intelligent monitoring, school of electronic engineering, no.10 xitucheng road, beijing, p.r.china, 100876 jolie.zhl@hotmail.com *corresponding author: cxfan@bupt.edu.cn zwen@bupt.edu.cn huarun wu national engineering research center for information technology in agriculture, key laboratory for information technologies in agriculture, ministry of agriculture, room 316, beijing agriculture science and technology building a, no. 11 beijing shuguang garden middle road, haidian district west suburb, beijing,china, 100097 wuhr@nercita.org.cn abstract: in order to optimize the wireless sensor network coverage, this paper designs a coverage optimization strategy for wireless sensor network (eacs) based on energy-aware. under the assumption that the geographic positions of sensor nodes are available, the proposed strategy consists of energy-aware and network coverage adjustment. it is restricted to conditions such as path loss, residual capacity and monitored area and according to awareness ability of sensors, it would adjust the monitored area, repair network hole and kick out the redundant coverage. the purpose is to balance the energy distribution of working nodes, reduce the number of “dead” nodes and balance network energy consumption. as a result, the network lifetime is expanded. simulation results show that: eacs effectively reduces the number of working nodes, improves network coverage, lowers network energy consumption while ensuring the wireless sensor network coverage and connectivity, so as to balance network energy consumption. keywords: wsn, coverage optimization, energy-aware, hole repair, sensing radius. 1 introduction wireless sensor network (wsn) consists of many sensor nodes which draw on self-organization to transmit data between nodes. these sensor nodes are small in size with limited energy and certain awareness, making them widely applied to many fields, such as military, transportation, environment protection, medical care, disaster relief and agriculture. currently, sensor nodes are distributed manually or randomly. manual distribution is executed in certain circumstances requires high on environment. so in most cases, random distribution [1–3] is the primary choice as it is easy to operate and able to reduce the interference on human activities. however, under circumstances of high density and large area, wireless sensor network is weak in expansion, which results in that the transmission signal is easy to be disturbed, the network energy is unevenly distributed and network become unstable. it is urgent to solve problems of increasing the effective coverage in monitored areas as much as possible, lowering network energy consumption, extending network lifetime, and improving network performance under the condition of limited energy and using the least sensor nodes. and network coverage is one of the most important indicators of network performance measurement. copyright © 2006-2016 by ccc publications 878 l. zhu, c. fan, h. wu, z. wen among existing studies, literature [4] proposes a method to calculate redundancy of probability node. with this method, nodes in the network can acquire information about their own, such as redundancy, without knowing the geographic position. it also proposes a node adjustment strategy without knowing node position. but this strategy neglects the awareness redundancy area within two-hop neighbor nodes, making many redundant working nodes exist during node adjustment. literature [5] proposes a node distribution strategy based on wireless beacon selfadaption. this strategy repairs network coverage holes by adding wireless beacon to the network but fails to consider the cost of adding the beacon and the influence on monitored area. literature [6] proposes to use a circle to solve node adjustment coverage. it calculates the angle between the awareness circle and the monitored area to get the minimum number of working nodes and reduces redundant working nodes. but under high coverage, the connectivity is weak. literature [7] acquires information of null nodes and introduces mobile nodes. to be specific, it replaces null nodes with mobile nodes to fill the network holes. the network coverage is increased, but the energy of mobile nodes is limited and there are too many null nodes in the network. literature [8] proposes a coverage strategy based on genetic algorithm. it selects the maximum solution set of coverage nodes. with genetic algorithm, it uses evaluation function to optimize random samples and reach a balance of network coverage. but the algorithm is relatively complicated and requires much calculate. to some extent, it increases network energy consumption. in order to improve effective coverage, this paper proposes a coverage optimization strategy for wsn based on energy aware, establishes a network coverage model. network nodes are distributed randomly; a network coverage model is established the relationship between node residual capacity and sensing range are examined to set reasonable sensing range of nodes, balance network energy consumption, repair network coverage holes, and extend network lifetime. finally, a simulation experiment is conducted to prove the relationship between network energy consumption and coverage of the proposed algorithm under different circumstances. the proposed algorithm aims at balancing network energy consumption, improving network coverage, and minimizing node energy consumption. this paper establishes a non-linear model of coverage optimization strategy for wireless sensor network (eacs) and exercise stricter constraints to turn it to linear-constraints and get the second-best solution. 2 network model and problem description 2.1 analysis model of energy-aware node suppose the wireless sensor network consists of n randomly distributed nodes, during node awareness, the energy of physical signal changes inversely with the distance between the signal and the awareness target. such tendency is mainly due to path attenuation during transmission, select any source node si within the scope of monitoring, when target node sj is anywhere in the plane, the awareness intensity ψi(j) of souse node to target node is expressed as: ψi(j) = { 0 rs < d(i,j) λe−kd(i,j) 0 < d(i,j) ≤ rs(1) (1) where k is the indicator of signal attenuation. rs is the maximum effective sensing radius of the node. d(i,j) is the euclidean distance between node i and j. λ is a constant value. the awareness intensity φ(j) of node j is expressed as: φ(j) = 1 − (1 − ψ1(j))(1 − ψ2(j)) · · ·(1 − ψi(j)) · · ·(1 − ψn(j)) = 1 − i=n∏ i=1 (1 − ψ1(j)) (2) coverage optimization strategy for wsn based on energy-aware 879 when φ(j) > �, target node j is sensed. when φ(j) < �, target node j is not sensed, and at this time, j is the coverage blind-point (ε target node j is not sensed, and at this time, j is the coverage blind-point h0 : g(i) = ϑi i = 1, 2, . . . ,n h1 : g(i) = ϑi + ψi(j) i = 1, 2, . . . ,n (3) where ϑi is the background noise signal following the normal distribution ϑi ∼ n(µ,σ2), and ϑi(j) is useful signal. the signal detected by sensor nodes is g(i). the target that exits is expressed by h0 and the target that does not exist is expressed by h1. 2.2 network model n non-overlapped wireless sensor nodes, s = {si|1 < i < n}, whose sensing radius is rs are randomly distributed in a two-dimensional monitored area a. suppose the network has the following features: (1) the node position is permanent. there is no inherent relationship between the communication radius rc and sensing radius rs. (2) all nodes have isomorphism. the wireless self-organized network is constructed in the monitored area. (3) all nodes adopt the node probability awareness model. (4) the sensing radius of sensor node can be adjusted according to node residual capacity. (5) the initial energy of sensor nodes is w , and they have synchronous clock. (6) sensor nodes use the location technology in literature to acquire their own locations. at the same time nodes can acquire the information of neighbor nodes, such as residual capacity, node position within effective communication distance. 2.3 problem description a number of sensor nodes are distributed randomly within the monitored area a. probabilistic sensing model is deployed, and network nodes are adjusted according to data from 2.2network model; a wireless sensor network is established to balance network energy consumption and network coverage, adjust sensing ranges based on residual capacities, reduce the possibility of "death node" and redundant network coverage, and extend network lifetime. this paper expects to reach a balance between residual capacity of any sensor node and sensing range in the monitored area (where wi is the residual capacity, =i is the sensing range and a is the monitored area), wi < wj && =i < =j (4) a = =1 ⋃ =2 · · · ⋃ =i · · · ⋃ =n = i=n⋃ i=1 =i (5) in the monitored area a, nodes should satisfy (3)(4) to reach the requirement of coverage. this paper also turns energy consumption and coverage balance strategy to a non-linear and 880 l. zhu, c. fan, h. wu, z. wen multi-target optimization problem:   0 < covmax = f( i=n∑ i=1 covi) ≤ a 0 ≤ wmin = ϕ( i=n∑ i=1 ∆wi) ≤ nw0 tmax = max(min(t1,t2 · · ·tn)) ζmin = θ( i=n∑ i=1 ζi) (6) given that path loss, node residual capacity [9] and monitored area are key factors [10–12] to realize network performance optimization and extend network lifetime, the constraint conditions are dealt with according to mathematical programming so that the optimized network has high accuracy. construct the optimized model with maximizing the network lifetime as the target. the optimized model is restricted to constraint conditions. adjust the monitored area, and optimize the network lifetime and the overall network overhead. in the non-linear and multitarget optimization, optimized factors are not in the optimal state, but the network overhead is almost the minimum and the convergence has high accuracy. 3 energy coverage strategy coverage strategy of the wireless sensor network optimize network energy consumption while improving network coverage [13–15]; in addition, they should also balance energy consumption of single node and overall network energy consumption. if nodes with lower energy are assigned the same work load as nodes with higher energy, the former will end in “premature death” and the transmission and reliability of the entire network will be undermined. to reach the balance of two optimization coverage strategies and solve the contradiction between individual nodes and the overall performance, this paper proposes a coverage optimization strategy for wsn based on energy-aware, namely energy-aware coverage strategy (eacs). the strategy is mainly divided into two phases: one is the energy-aware phase in which the sensing field of each node is confirmed by probability according to residual capacity of sensor node in the monitored area. the second is the network coverage adjustment phase where the sensing radius of node is adjusted effectively according to the sensing field of each node and the overall network coverage to lower the redundant coverage, reduce redundant coverage and unnecessary energy consumption, so as to extend network lifetime. 3.1 energy-aware phase in the wireless sensor network, sensor nodes in the monitored area transmit information through awareness coordination. given that sensor nodes have limited energy, this strategy concerns not only about energy consumption of any node in the area, but also the equilibrium of the whole network energy consumption. as time goes by, survived nodes may suffer from breaking the equilibrium of energy consumption due to signal interference, resulting in the change of node residual capacity. considering that node residual capacity is related to node sensing range, as shown in fig.1, reasonable sensing range is set up for each node to balance the network energy consumption and finally, to extend network lifetime. coverage optimization strategy for wsn based on energy-aware 881 in the energy-aware phase, after time t of working, the relationship between the electric quantity wi consumed by node si and the sensing radius ri of sensing field ai is (k is a constant value): wi = kr 2 i (7) for any two neighboring nodes si and sj, their residual capacity is qi and qj respectively. after time t, their electric quantity is consumed simultaneously. the sensing radius between two nodes fits expression (2): ri = d(i,j) · √ qi√ qi + √ qj (8) where d (i, j) is the euclidean distance between nodes. according to expression (8), the sensing radius and residual capacity of si and sj have the following relationship: ri : rj = √ qi : √ qj (9) si sj d(i,j) rs ri d(i,j) rj i(j) 0 sj d(i,j) sensor node figure 1: node sensing range theory 1 the sensing filed of node si is ω, its awareness is radius ri. ω falls in the set γ(γ1,γ2, · · ·γi) and any γk satisfies d(si,γk) < ri. if the probability of x(x ∈ ω) sensed by any node in γ is p, there are pxsensing fields in ω on average that are sensed by γ. suppose the sensed set in the sensing field ω is π{x1,x2 · · ·xn}, and these nodes are independent from each other following even distribution. if k(k < n) nodes in being sensed by γ is called event x, so x follows binomial distribution, namely x ∼ b(n,p). p{x = k} = cknpk(1 −p)n−k where e[x] is the expectation of x. when n → +∞, the average awareness in ω is: lim n→+∞ e[x] n = p so there are psensing fields on average that are sensed by γ in ω. 3.2 network coverage adjustment select any node si in the monitored area, according to the energy-aware strategy, acquire the energy-aware range and node residual capacity of last phase and send the information such as sensing radius and energy consumption to the neighboring node. according to awareness 882 l. zhu, c. fan, h. wu, z. wen intensity, confirm the sensing range and monitored area of each node. when there produce holes in the monitored area, re-distribute the nodes. given that the sensing field may overlap, when no new holes are produced, adjust the sensing radius based on node residual capacity to wipe out redundant monitored area, so as to reduce unnecessary energy consumption. detailed steps are described below: step 1łşdivide the monitored area. for any node in the area, connect it with neighboring nodes and form the minimum triangle network as shown in 2(b). subject each triangle to perpendicular bisector and connect the lines, as shown in 2(c). finally, monitored area a consists of many regional polygons, as shown in 2(d). node distribution the minimum triangle network subject each triangle to perpendicular bisector regional polygon figure 2: distribution of monitored area step 2: detect irreparable holes and nodes filling. firstly, detect holes on the edge of monitored area a. the polygon which cannot be fully covered by the minimum triangle network is called the edge monitoring polygon. take the vertex of the polygon at the edge of the monitored area as the circle center and twice the maximum sensing radius of nodes rmax as the radius to draw a circle. places that cannot be covered by the circle are irreparable holes and need to be filled. new nodes should fill in the middle of the vertex and original nodes. for any side of the triangle in the network < s1,s2 >, if the side is twice more than the maximum sensing radius of nodes, namely l(s1,s2) > 2rmax, it is also considered that there are irreparable holes. thus, we should fill node in the middle of the triangle side. after nodes are filled, repeat step 1 to construct the new triangle network and the monitored polygon. go on to step 3 until no new nodes should be filled. step 3: detect regional holes. confirm the initial sensing radius of nodes according to expression (9). construct the network model with the minimum triangle and detect the triangle network. to be specific, judge whether a minimum triangle s1s2s3 has sensing holes. if one side of the triangle < s1,s2 > is longer than the total sensing radius of two tip nodes, namely l(s1,s2) > r(s1) + r(s2), as shown in 3(a), then 4s1s2s3 has holes. if three sides fit l < ri + rj, in other word, two circles intersect, the premise for no holes is that any circle intersects with the overlapping part of the other two circles. that is to say, the intersection of any two circles should fall within the sensing radius of the third one. otherwise, 4s1s2s3 has holes, coverage optimization strategy for wsn based on energy-aware 883 as shown in 3(b) and 3(c). when all minimum triangles fit this rule, a set with holes is formed. calculate the number of nodes that have appeared, and form the hole-node set {si, . . .,sj}. s1 s3 s2 r1 r2l(s1,s2) s1 s3 s2 r1 r2l(s1,s2) s1 s3 s2 the total sensing radius < the triangle side the intersection circles with hole the intersection circles without hole figure 3: network hole detection step 4: hole repair. start repairing from the nodes that appeared the most times in the statistics in step 2. if two nodes appear the same times, the one with higher residual capacity is adjusted primarily. for example, si is the first one to be filled. increase the transmitting power of si, by adding its sensing radius, until all triangles with si as the tip node do not have holes. repeat step 4 until all holes are filled, as shown in fig.4 (a) wsn with coverage hole (b) wsn without coverage hole figure 4: coverage hole repair step 5: wipe out the redundant holes. get the complementary set of the triangle set with holes in step 2, namely the minimum triangle set without holes. as the initial sensing range of nodes is big, or two neighboring nodes are close to each other, there may be redundant coverage. related nodes with relatively large sensing radius may consume unnecessary energy. thus, it is necessary to adjust the sensing range of nodes in the complementary set. firstly, calculate the number of nodes that appeared and rank them from the most appeared to the least appeared. priority is given to nodes with less residual capacity. reduce the sensing radius of nodes gradually until no new holes appear in the triangle area. repeat step 5 to all nodes in the complementary set. 884 l. zhu, c. fan, h. wu, z. wen 4 analysis of algorithm performance 4.1 analysis of coverage quality conclusion 1 if the awareness intensity probability of j by node s in monitored area a is p{τ < φ(j) < 1}, where ss1,s2 . . .sn is the node set and τ is the threshold of awareness intensity. when and only when the expectation value of network coverage γ in a is bigger than γ0 (where γ0 is the coverage threshold), namely e[γ] > γ0, the coverage in a meets the requirement of the network coverage. coverage connectivity is one of the important indicators to measure the service quality of wireless sensor network. according to the model, the probability of awareness intensity for node q in monitored area a is p{τ < φ(j) < 1}. elements in the node set s form a connected graph g(v,e), where v is the node set, and e is the side set. nodes are independent from each other. the connectivity degree of g is ξ(g) (n is the number of selected nodes). lim n→+∞ p{γ(sn,v, ξ(g)) ≥ k} = 1 when n approaches infinity, the awareness probability of node in the connected graph increases. so does the connectivity degree. according to the definition, the expectation value of network coverage γ is: e[γ] = e[ ∫∫ a cov(i)da/‖a‖] = p{τ < φ(i) < 1} > γ0 select the sub-areas in the monitored area randomly. according to the probability event, when the connectivity degree has a high probability, the node sets in the area are all connectivity set and the network coverage would also meet the expectation. 4.2 analysis of network lifetime conclusion 2 distribute n sensor nodes in monitored area a. the node set is s = {s1,s2. . .si. . .sn} and si and sj are neighboring nodes. the neighboring node set of si is ni. when the network parameters are set the same as that of the network model, the network lifetime is max ti (ti is the lifetime of node, i = 1, 2......n). node si can be any node in the network. its neighboring node set is ni = {sj,sk. . .. . .sm}. select one neighboring node sj. at this moment, the communication distance of two nodes is within the effective sensing range of si. so the energy consumption between nodes is: q = ∑ j∈ni τijηij + ∑ j∈ni νηji (10) where ηij is the information transmitted by si to sj. τij and νji are energy loss factor when node receiving and transmitting information. under normal condition, the lifetime of si is: ti = $i qi (11) where $i is the residual capacity of si. according to expression (10) and (11), the network lifetime tlife is: tlife = max{t1...ti...tn} coverage optimization strategy for wsn based on energy-aware 885 5 simulation experiment analysis through simulation experiment, this paper makes a comparative analysis on algorithm performance. the setting is described as follows: place 120 sensor nodes in the monitored area 100 × 100m2. the sensing radius is 5-20m and the initial electric quantity is 200j. the ratio of energy consumption of working, leisure and dormant state is 24:5:0.02. eacs algorithm is compared with the distributed random algorithm and payy algorithm (proposed approach of yourim y) proposed by literature [5] to assess the performance of eacs algorithm. network coverage, network overhead and network lifetime are important parameters in the comparison. table 1: experiment parameter parameter value m 100m*100m n 120 to 120s w 0.2j node sampling frequency 1hz r 10m initial energy 200j minimum energy limit 0.02j ro 5m rm 20m in the initial state of network operation, the performance parameters of all nodes in the monitored area are set the same, as shown in fig.5(a). after working for some time, there produce difference on nodes. adjust sensing radius of nodes according to their energy consumption. nodes adjust themselves according to residual capacity and relationship with neighboring nodes, as shown in fig.5(b). in the whole operation, due to differences between nodes and the change of sensing range, there may be redundant areas or coverage holes. hole detection is conducted followed by hole repair and redundancy elimination in other to enhance the coverage, as shown in fig.5(c). (a) (b) (c) figure 5: network coverage network coverage ratio refers to the ratio of the effective coverage area by sensor nodes against the overall monitored area. for random algorithm, operate it for 50 times and get the average coverage. fig.1 shows the network coverage under different network scales of different algorithm. from fig.6, it is known that with the increasing of the number of sensor nodes, 886 l. zhu, c. fan, h. wu, z. wen the network coverage also increases. under the same number of working nodes, the coverage of random algorithm is relatively low. when 100 nodes are started, eacs algorithm can reach 90% coverage, 15% higher than random algorithm. eacs algorithm is better that the other two algorithms in terms of coverage, because such algorithm can adjust the sensing radius according to residual capacity and reduces redundant monitored area. when sensor nodes increase in number, the coverage would increase rapidly. figure 6: working nodes and network coverage figure 7: coverage changes over time fig.7 shows network coverage of three algorithms changing over time. in the initial operation of network, the network coverage changes relatively slowly with little divergence. as time goes by, eacs algorithm witnesses the increase of network coverage, though the energy-aware and hole detection require some energy consumption. from fig.7, it can be told that the coverage curve of eacs algorithm drops steadily, reflecting that the network energy consumption is distributed evenly. fig.8 compares nodes location of payy and eacs algorithm under the same initial settings and after the network operates for some time. working nodes of eacs algorithm is more evenly distributed than those of payy algorithm. this is mainly because eacs algorithm adopts energy-aware model. the sensing radius of nodes is adjusted according to real situation of the network, so that the energy can be evenly distributed and the network lifetime can be extended. figure 8: working nodes distribution coverage optimization strategy for wsn based on energy-aware 887 conclusion this paper proposes a coverage optimization strategy for wsn based on energy-aware, namely eacs algorithm. it can reach energy balance of nodes in the wireless sensor network. re-assign tasks of awareness according to working nodes residual capacity, and adjust the sensing radius of nodes according to probability awareness in order to repair the network holes and redundant areas, improve network coverage and reduce redundant coverage and overall network consumption. results show that the method proposed in this paper takes node residual capacity as an important factor of adjusting sensing field to reduce the burden of nodes with little energy and save them from being "null" too early. otherwise, it may affect the connectivity and the network lifetime. at the same time, when the coverage holes are under repair, adjust the redundant coverage so that the balance of coverage reaches a reasonable range and the network coverage satisfies actual need. as a result, the unnecessary network energy consumption can be reduced effectively, a balance between the network coverage and energy consumption can be reached and the network lifetime can be improved. acknowledgments the work presented in this paper was supported by the national natural science foundation of china (grants no. nsfc-61471067) and the national natural science foundation of china (grants no. nsfc-61271257). fund for the doctoral program of higher education of china (grants no.20120005110002).the work presented in this paper was supported by national great science specific project(grants no. 2012zx03005008) bibliography [1] mohamadi h., ismail a.s., salleh s. (2014); solving target coverage problem using cover sets in wireless sensor networks based on leraning automata. wireless personal communications, 75(1): 447-463 [2] he s.b. et al. (2013); barrier coverage in wireless sensor networks: from lined-based to curve-based deployment, marco a, ed. proc. of the 32nd ieee int’l conf. on computer communications , 10(9): 470-474 [3] chen j.m., li j.k., lai t.h. (2013): energy-efficient intrusion detection with a barrier of probabilistic sensors: global and local; ieee trans. on wireless communications, 12(9): 4742-4755 [4] wu k., gao y., li f., xiao y. (2005); light weight deployment aware scheduling for wireless sensor network, acm/kluwer mobile networks and applications(monet), 10(6):837-852. [5] bulusu n., heidemann j., estrin d., tran t. (2004); self configuring localization systems: design and expermiental evaluation, acm transac-tions on embedded computing systems, 3(1): 24-60. [6] khedr a.m., osamy w. (2011); minimum perimeter coverage ofquery regions in a heterogeneous wireless sensor network, information sciences, 181(15):3130-3142. [7] la guilingwang, guohong cao t.p.(2006); movement-assisted sensordeployment, ieee transactions onmobile computing, 5(6): 640-652. 888 l. zhu, c. fan, h. wu, z. wen [8] y. yoon (2013); an efficient genetic algorithm for maximum coverage deployment in wireless sensor network, ieee transactions on cybernerics, 45(5):1473-1483. [9] giuseppe anastasi et al(2009); energy conservation in wireless sensor networks: a survey, ad hoc networks, 7(3) : 537-568. [10] martins f.v.c. et al. (2011). a hybrid multiobjective evolutionary approach for improvingthe performance of wireless sensor networks, ieee sensors journal, 11(3): 545-554. [11] wang d, xie b, agrawal dp (2008); coverage and lifetime optimization of wireless sensor networks with gaussian distribution, ieee trans. on mobile computing, 7(12): 1444-1458 [12] amato g., chessa s., gennaro c., vairo, c. (2011); efficient detection of composite events in wireless sensor networks: design and evaluation, proc. of the ieee symp. on computers and communications, 10(11):821-823. [13] yourim y., yong h.k. (2013); an efficient genetic algorithm for maximum coverage deployment in wireless sensor network, ieee transactions on cybernetics, 45(5): 1473-1483. [14] hossain a., chakrabarti s., biswas p.k. (2012); impact of sensing model on wireless network coverage, iet wireless sensor systems, 2(3): 272-281. [15] mini s., udgata s.k., sabat s.l. (2014); sensor deployment and scheduling for target coverage problem in wireless sensor networks, ieee sensors journal, 14(3): 636-644. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 4, pp. 551-557 a secure and efficient off-line electronic payment system for wireless networks h. oros, c. popescu horea oros, constantin popescu department of mathematics and computer science, university of oradea str. universitatii 1, oradea, romania e-mail: {horos,cpopescu}@uoradea.ro abstract: an electronic cash system allows the exchange of digital coins with value assured by the bank’s signature and with concealed user identity. in an electronic cash system, a user can withdraw coins from the bank and then spends each coin anonymously and unlinkably. in this paper we propose a secure and efficient off-line electronic payment system based on bilinear pairings and group signature schemes. the anonymity of the customer is revocable by a trustee in case of a dispute. because the amount of communication in the payment protocol is about 480 bits, the proposed off-line electronic payment system can be used in wireless networks with limited bandwidth. keywords: electronic payment system, bilinear pairings, group signatures, membership certificate. 1 introduction chaum suggested the first electronic cash system [5] in 1982. in this system the technique of blind signatures was used to guarantee the privacy of users. various extended systems have been proposed, which provide functionalities such as anonymity, double spending prevention, unforgeability, untraceability and efficiency [1], [4], [8]. off-line electronic cash systems were first introduced in [6] and then developed further in [9], [10], [11], [12]. in off-line systems the bank’s involvement in the payment transaction between a customer and a merchant was eliminated. customers withdraw electronic coins from the bank and use them to pay a merchant (a shop). the merchant subsequently deposits the coins back to the bank. in this paper we propose a secure off-line electronic payment system based on bilinear pairings and group signature schemes. in order to construct our electronic cash system, we use the group signature scheme of d. yao and r. tamassia [16] and the blind signature of schnorr [13]. due to the low amount of communication in the payment protocol that is about 480 bits, our off-line electronic payment system can be used in wireless networks with limited bandwidth. the rest of this paper is organized as follows. in the next section we present our off-line electronic cash system. furthermore, we discuss some aspects of security and efficiency in section 3. finally, section 4 concludes the work of this paper. 2 the proposed off-line electronic payment system an e-cash system is a set of parties with their interactions, exchanging money and goods. a typical e-cash system has three parties: • customer: purchases goods or services from the merchant using the e-cash. • merchant: sells goods or services to the customer, and deposits the e-cash to the bank. • bank: issues the e-cash and maintains the bank account for customers and merchants. and there are also three protocols: withdrawal, payment and deposit. a customer withdraws electronic coins from the bank and pays the coins to a merchant. finally, the merchant deposits the paid coins to the bank. our electronic payment system consists of four types of participants: customers, merchants, banks and trusted parties. the customers honestly withdraw money from the bank and pay money to the merchant. the merchants get money from customers and deposit it in the bank. the banks manage customer accounts, issue and redeem money. the bank can legally trace a dishonest customer with the help of the trusted parties. an e-cash system is copyright c⃝ 2006-2010 by ccc publications 552 h. oros, c. popescu anonymous if the bank in collaboration with the merchant cannot trace the coin to the customer. the system is off-line if during payment the merchant does not communicate with the bank. in our off-line electronic cash system, all customers who open a bank account form a group and a trusted party is the group manager. when a customer wants to withdraw an electronic coin from his account, the bank applies a blind signature protocol [13] to this coin and decreases appropriate amount from the customer’s account. everyone including the merchant can verify the validity of the blind signature. the withdrawals are made by the bank by applying the blind signature of schnorr [13] to a coin randomly selected by a customer and the payments are made by the customer by applying the group signature scheme of d. yao and r. tamassia [16] to the random coin. 2.1 system parameters this operation outputs the system parameters and public/private keys of users that will be used in the system. • the group manager chooses a set of public parameters y = (g1,g2,e,p,h,h ′,h ′′), where g1 and g2 are groups of a large prime order q, g1 is a gap group, e : g1 × g1 → g2 is a bilinear map, p is a generator of g1 and h : {0,1}∗ → g1, h ′ : {0,1}∗ → zq and h ′′ : {0,1}∗ × g1 → zq are three collision-resistant hash functions. the group manager chooses his secret key sa ∈ z∗q and computes the public key pa = sap. • the customer chooses a secret su ∈ z∗q as his private key and computes the product pu = sup as its public key. • the bank selects a random secret xb from the interval [1,q−1] and calculates the point pb = xbp. the public key of the bank is pb and the corresponding secret key is xb. the process for selecting the parameters and generating g1,g2,q,e,p is given in [2]. 2.2 the registration protocol we assume that communication between the customer and the group manager is secure, i.e., private and authentic. any customer who wants to withdraw a coin from the bank has to interact with the group manager and obtains two type of certificates from the group manager. one is long-term group membership certificate, which certifies the customer’s public key information. the other is one-time signing permit, which certifies the customer’s one-time signing key information. the latter is used for issuing signatures in the payment protocol. the registration protocol involves the customer and the group manager as follows: 1. a customer obtains a long-term group membership certificate cert from the group manager. the group manager computes cert = sah(in f o||sup), where sa is the private key of the group manager, sup is the customer’s public key and in f o contains information such as group name and membership expiration date. cert is given to the customer. 2. a customer also obtains one-time signing permits from the group manager. the customer randomly chooses a number of secrets x1,...,xl and computes one-time signing secret keys x1p,...,xl p and one-time signing public keys sux1p,...,suxl p. the keys sup and suxip are sent to the group manager, for all i = 1,...,l. the customer also sends cert to the group manager. 3. the group manager first checks if the customer with public key sup is a valid group member. this is done by verifying the following equality: e(cert,p) = e(pa,h(in f o||sup)) where pa is the group manager’s public key and sup is the customer’s public key. the protocol terminates if e(cert,p) ̸= e(pa,h(in f o||sup)). then the group manager tests if e(suxip,p) = e(sup,xip) for all i = 1,...,l. if the test fails, the protocol terminates. otherwise, the group manager computes: si = sah(in f o||suxip) for all i = 1,...,l. si is an one-time signing permit and is given to the customer. the group manager adds the tuple (sup,xip,suxip) to its record for all i = 1,...,l. a secure and efficient off-line electronic payment system for wireless networks 553 2.3 the withdrawal protocol we assume that communication between the customer and the bank is secure, i.e., private and authentic. the withdrawal protocol allows a customer to withdraw e-coins from the bank. after having open a bank account, the customer withdraws an e-coin from his account by using the blind signature. therefore, the bank cannot link the e-coin to the identity of the customer but can debit to the account correctly. the withdrawal protocol involves the customer and the bank in which the customer withdraws an electronic coin from the bank. first, the customer proves his identity to the bank using the elliptic curve version of the signature scheme of shao [14]. then, the bank uses the elliptic curve version of the blind schnorr signature scheme [13] to sign the e-coin. the customer must perform the following protocol with the bank: 1. the customer sets his electronic cash requirement: m = h ′(withdrawal require||id) where id is the identity of the customer. then, the customer chooses a random value ku ∈ [1,q − 1] and signs the message m using the elliptic curve version of the signature scheme of shao [14]: f = h ′(m) (1) r = ku f p (2) h = h ′′(m,r) (3) s = ku − hsu. (4) the customer sends m and its signature (h,s) to the bank. 2. the bank verifies the signature (h,s) of the message m: (a) the bank first computes f = h ′(m), r′ = f (hpu + sp) and h′ = h ′′(m,r′). (b) then, the bank checks that the following equality holds: h = h′. (c) if h ̸= h′ the protocol terminates. 3. then, the bank uses the elliptic curve version of the blind schnorr signature [13] to sign the e-coin: selects k ′ ∈ [1,q −1], computes the point r′′ = k ′p and sends r′′ to the customer. 4. the customer establishes a random coin c, randomly selects α,β ∈ [1,q − 1], computes rb = r′′ + α p + β pb,cb = h ′′(c||α,rb) and blinds the e-coin by computing c′ = cb − β mod q. the customer sends the value c′ to the bank. 5. the bank computes: s′ = k ′ − c′xb mod q and forwards s′ to the customer. 6. the customer computes sb = s′ + α mod q. the pair (cb,sb) is a valid e-coin signature issued by the bank. 7. the customer verifies the blind signature (cb,sb) of the coin c, issued by the bank, by checking that the following equation holds: sbp + cbpb = rb (5) 8. the blind signature of the coin c is the pair (cb,sb). the customer gets the coin c from his account. 2.4 the payment protocol the payment protocol involves the customer and the merchant and should be done through a secure channel (i.e., data privacy and integrity). in the proposed system, during payment the merchant does not communicate with the bank. after withdrawing e-coins, the customer can pay for what the merchant provided. then the merchant verifies the validity of the received e-coins. in order to sign the coin c, the customer uses the protocol of yao and tamassia [16]. the merchant first sends a challenge cm to the customer. then, the customer produces a signature su of the coin c and merges the signature su with his one-time signing permit si associated with the secret suxi. the details are as follows: 554 h. oros, c. popescu 1. the merchant sends challenge cm = h ′(idm||t ) to the customer, where idm is the merchant’s identity and t is the recorded time of the transaction. 2. the customer computes: cu = h ′(c||cm||cb||sb) (6) 3. the customer computes su = suxih(cu). 4. the customer computes the signature s = su + si, where si = sah(in f o||suxip). 5. the customer sends c,cu and the signature s = su + si of the coin c to the merchant. 6. the merchant verifies the signature s of the coin c as follows: (a) computes the hash digest h(cu) and the hash digest h′ = h(in f o||suxip) of one-time signing permit. (b) the signature s is accepted if e(s,p) = e(pa,h ′)e(suxip,h(cu)). (7) if the test fails, the protocol terminates. 2.5 the deposit protocol the deposit protocol permits the merchant to deposit the received e-coins to the bank. when receiving the deposited requirement from the merchant, the bank first verifies the validity of received e-coins and then credits the account of the merchant. in on-line e-cash systems this protocol is part of the payment protocol as executed by the merchant. in our e-cash system, the deposit protocol is executed at a later moment, preferably in batch mode. the bank holds a record of spent cash to prevent double spending of e-cash. the bank cannot link deposited coins to a customer without collaboration from the group manager. the deposit protocol involves the merchant and the bank as follows: 1. the merchant sends c,cm,cu,cb,sb to the bank. 2. the bank verifies the signature as given in the equation (6). 3. after verification succeeds, the bank checks if c obtained from the merchant exists in its database. if the coin c is in the database of the bank, then the bank finds the signature s′ for the deposited coin in its database and sends it to the merchant (detection of double spending). 4. if the merchant receives s′ from the bank, he/she checks whether s′ = s. if s′ = s, then the merchant rejects performing protocol (double spending). otherwise, the merchant sends cu and t to the bank. 5. the bank verifies the validity of the signature s using the equation (7). 6. if the signature s of the coin c is valid, then the bank accepts the coin c. then, the bank will deposit the cash to the merchant’s account and the merchant sends the goods to the customer. the bank stores c and (cu,suxup) in its database. 7. if the bank finds out that c and (cu,suxup) has been stored before but different t and cm, then the coin c has been double spending. the bank performs the tracing protocol and detects the identity of the double spender with the help of the group manager. 2.6 the tracing protocol the bank can legally trace the customer of a paid coin with the help of the group manager. the tracing protocol involves the bank and the group manager. given a signature s and its associated public information pa and suxip, the group manager verifies the signature s. if the signature s is valid, the group manager can identify a customer’s public key sup from suxip value, by consulting the customer group record. the details are as follows: 1. the bank sends cu and the signature s of the coin c to the group manager. 2. the group manager verifies the signature s using the equation (7). a secure and efficient off-line electronic payment system for wireless networks 555 3. the group manager can easily identify the customer from suxip. the group manager can provide a proof that it is indeed the customer’s signature from the following equations: e(suxip,p) = e(sup,xip) (8) 4. the group manager searches through the group customer list to get the identity of the customer and sends it to the bank. similar to what is shown in the group signature scheme of chen et al. [7], the group manager cannot misattribute a signature to frame the customer unless he can compute bp given q, p, ap and dp which satisfies: a ≡ db mod q (9) the authors in [7] define this problem the reversion of computation diffie-hellman problem. they prove that the reversion of computation diffie-hellman problem is equivalent to computational diffie-hellman problem in g1. 3 security and efficiency analysis in this section we discuss some aspects of security and efficiency of our off-line electronic payment system. we prove that our off-line electronic payment system is secure against tracing a honest customer by the bank and the proposed system is secure against forgery of the coin. theorem 1. our off-line electronic payment system is secure against existential forgery of the coin c. proof: long-term membership certificates, one-time signing permits and customer’s signatures using one-time secret signing keys are generated by the sign protocol of the signature scheme of boneh, gentry, lynn and shacham [3]. the authors in [3] shown that their signature scheme is secure against existential forgery attacks. therefore, if an adversary can forge any of these signatures, she can also forge signatures in the signature scheme of boneh et al. [3]. note that a signature computed with one-time secret signing key is in the form of suxih(cu), rather than suh(cu) as in the signature scheme [3]. it can be easily shown that if an adversary can forge a signature in a form of suxih(cu), then she can forge a signature in the form of suh(cu). also, since the blind signature of schnorr is secure against existential forgery, this allows only the legal bank to generate the signature for coin. as the hash function h ′ has the feature of collision free, the customer cannot find a value c′ ̸= c with h ′(c′||cm) = h ′(c||cm). thus, our payment system satisfies unforgeability of the coin. 2 theorem 2. the both valid signatures s and (cb,sb) in our payment system contain a proof of the group membership without revealing the identity of the customer. proof: a valid signature s is obtained from an one-time signing permit of a customer and the customer’s signature using the corresponding one-time signing key. that is s = si +su, where si = sah(in f o||suxip) and su = suxih(cu). because of the definition of signatures [3], a valid signature s implies that si is valid. this proves that the holder of key suxip is a certified customer. a valid s also means that su is valid, therefore su is generated with the secret key suxi. thus, s contains a proof of the customer membership. because the signing key suxi is one-time signing key and xi is chosen randomly by the customer, the identity of the customer is not revealed. also, since cu = h ′(c||cm||cb||sb) and the blind signature (cb,sb) of the coin c can not give any information for the coin c, the bank can not link the blind coin with the identity of the customer. 2 table 1: storage space of the payment systems our system wang lee au canard withdrawal 1120 bits 1824 bits 800 bits 8160 bits 6420 bits payment 480 bits 1282 bits 1304 bits 5188 bits 30740 bits deposit 960 bits 3232 bits 1656 bits 5164 bits 27648 bits 556 h. oros, c. popescu table 2: computation cost of the payment systems our system wang lee au canard withdrawal protocol multi-exp 8 9 15 2156 5 pairing 0 0 0 22 0 payment protocol multi-exp 2 11 9 34 1673 pairing 3 0 0 14 0 deposit protocol multi-exp 0 5 7 10 14 pairing 3 0 0 0 0 next, we evaluate the storage space and computational time of the costly operations. table 1 and table 2 summarize the storage space and computation cost respectively, of different protocols of our e-cash system and the schemes in [1], [4], [9] and [15]. the overall efficiency is improved in our electronic cash system compared to au et al.’s system [1], canard et al.’s system [4], lee et al.’s system [9] and wang et al.’s e-cash system [15] in terms of the storage space and the computation cost. our system has a point p of 160 bits and q of 160 bits. the off-line e-cash system proposed by lee et al. has a point p of 160 bits and 160 bits prime q and the system of wang et al. has 160 bits prime q and 321 bits prime p. spending a coin in [15] requires 11 multi-based exponentiations and a total bandwidth of 1282 bits. the payment protocol in [9] requires 9 multi-based exponentiations and a total bandwidth of 1304 bits. for a moderate value l = 10 and t = 40, the payment protocol in [4] requires 1673 multibased exponentiations and a total bandwidth of 30740 bits. the payment protocol in [1] requires 34 multi-based exponentiations, 14 pairings and a total bandwidth of 5188 bits. in contrast, the payment protocol in our e-cash system requires 2 multi-based exponentiation, 3 pairings and a total bandwidth of 480 bits. 4 conclusions in this paper we presented a secure and efficient off-line electronic payment system based on bilinear pairing and group signature schemes. we used the group signature scheme of yao and tamassia and the blind signature of schnorr. because the amount of communication between customer and merchant is about 480 bits, the proposed off-line payment system can be used in the wireless networks with the limited bandwidth. bibliography [1] m. au, w. susilo, y. mu, practical anonymous divisible e-cash from bounded accumulators, proceedings of financial cryptography and data security, lecture notes in computer science 5143 springer-verlag, pp. 287-301, 2008. [2] d. boneh and m. franklin, identity-based encryption from the weil pairings. advances in cryptology-crypto 2001, lecture notes in computer science 2139, springer-verlag, pp.213-229, 2001. [3] d. boneh, c. gentry, b. lynn, and h. shacham, aggregate and verifiably encrypted signatures from bilinear maps. in advances in cryptology eurocrypt’03, lecture notes in computer science 2656, springer-verlag, pp. 416-432, 2003. [4] s. canard, a. gouget, divisible e-cash systems can be truly anonymous, proceedings of eurocrypt 2007, lecture notes in computer science 4515, springer-verlag, pp. 482-497, 2007. [5] d. chaum, blind signature for untraceable payments. proceddings of eurocrypt’82, plenum press. pp.199203, 1983. [6] d. chaum, a. fiat, m. naor, untraceable electronic cash, proceedings of the crypto’88, pp. 319-327, 1990. [7] x. chen,f. zhang, k. kim, a new id-based group signature scheme from bilinear pairings. journal of electronics, 23, pp. 892-900, 2006. a secure and efficient off-line electronic payment system for wireless networks 557 [8] c. fun, ownership-attached unblinding of blind signatures for untraceable electronic cash, information science, 176(3), pp. 263-284, 2006. [9] m. lee, g. ahn, j. kim, j. park, b. lee, k. kim, h. lee, design and implementation of an efficient fair off-line e-cash system based on elliptic curve discrete logarithm problem, journal of communications and networks, 4(2), pp. 81-89, 2002. [10] t. okamoto, k. ohta, universal electronic cash, proceedings of the 11th annual international cryptology conference on advances in cryptology, pp. 324-337, 1992. [11] t. okamoto, an efficient divisible electronic cash scheme, proceedings of crypto’95, lecture notes in computer science 963, springer-verlag, pp. 438-451, 1995. [12] c. popescu, an electronic cash system based on group blind signatures. informatica, 17(4), pp. 551-564, 2006. [13] c.p. schnorr, efficient signature generation for smart cards, journal of cryptology, 4(1991), pp. 239-252, 1991. [14] zuhua shao, a provably secure short signature scheme based on discrete logarithms, information sciences: an international journal, vol.177(23), pp. 5432-5440, 2007. [15] h. wang, j. cao, y. zhang, a flexible payment scheme and its role-based access control. ieee transactions knowledge data engeneering, 17, pp. 425-436, 2005. [16] d. yao, r. tamassia, cascaded authorization with anonymous-signer aggregate signatures. proceedings of the seventh annual ieee systems, man and cybernetics information asssurance workshop, usa, pp.84-91, 2006. horea oros (b. august 22, 1977) received his phd in computer science (2009) from “babeş bolyai” university of cluj-napoca, romania. since 2001 he is working within the department of mathematics and computer science, faculty of sciences, university of oradea, romania, where currently he is a lecturer. he also is lecturer at agora university of oradea. he co-authored three books in the filed of computer science and published 19 articles in several journals and proceedings of prestigious international conferences. his main research interest is in the field of cryptology and computer security. constantin popescu (b. october 21, 1967) received his phd in computer science (2001) from “babeş bolyai” university of cluj-napoca, romania. since 2005 he is a professor at the department of mathematics and computer science, university of oradea, romania. his research interests include cryptography, network security, group signatures, security protocols and electronic payment systems. he co-authored 7 books in the field of computer science and published 49 articles in several journals and proceedings of prestigious international conferences. he is reviewer for 10 journals and several prestigious international conferences. int j comput commun, issn 1841-9836 8(1):37-41, february, 2013. bio-inspired sensory systems in automata for hazardous environments l. canete lucio canete universidad de santiago de chile av. ecuador 3769, estación central, santiago de chile, chile. e-mail: lucio.canete@usach.cl abstract: every automaton in dynamic and complex environments requires sensory systems with an appropriate level of attention on the hazardous environment. this property in any efficient automaton is analogous to that observed in animal sensory systems. in this context, it is noted that to ensure its viability, the sensory systems of animals must maintain a continuous state of alertness or attention to the environment. however, the state consumes energy so it is impossible to keep a constant level over time. in this regard, biologists have designed models for explaining the variation in the level of surveillance in two vital activities of animals: work and rest. in an alternating pattern between work and rest, the attention level v(t) declines and increases as the animal works and rests respectively along the time. for each of the two states, there is one relation: dv/dt = −α * v while working and dv/dt = β*(1-v) while resting. in this model α is the loss rate of surveillance that depends on the difficulty of the work and β is the recovery rate which depends on the quality of rest. in the case of automata, this phenomenon is analogous to that observed in the animal kingdom. even if the automatic machines have relief structures to monitor their environments, they always require that its sensory system recovers the alertness after being hit by the inexorable entropy. if the task is hard (α is large), the attention level decreases rapidly. once the level has dropped below a threshold of tolerance, it must be recovered. if rest is poor, the automaton will take a lot of time to achieve the desired level. obviously, machines do not rest, but in analogous terms, this phenomenon is emulated in the way of maintenance activities. parameter β represents the quality of these maintenances. this model has been tested with computer simulations to study the performance of automatic machines in hostile environments. after tests, it was possible to quantify α and β for each kind of task-environment and each kind of maintenance. the bio-inspired model showed to have explicative and predictive applications to the conquest of hostile scenarios by means of automata. indeed it is an interesting conceptual tool for increasing the performance of machines. keywords: attention level, model, performance, emulation, automata. 1 introduction in the continuing quest to provide new qualities in automata to improve productivity in crisis scenarios, this work uses the market of ideas in the sense that each field of knowledge is not selfsufficient and therefore must rely on others to import ideas [1]. biology is in this paper the source of ideas. this science shows that over the 3.8 billion years since life is estimated to have begun to appear on earth, evolution has resolved many of nature’s challenges leading to lasting solutions with maximal performance using minimal resources [2]. can biology export ideas toward automation? of course, there are many examples, but the more representative is a robot. this device is a non living thing with some qualities emulated from humans, manufactured to replace them [3]. after all biology and automation have something in common: the study of entities capable of operating by they own. copyright c⃝ 2006-2013 by ccc publications 38 l. canete in the same way, both automaton and living been have in common a vital component to interact with environments: a sensorial system [4]. in this context, the biological studies about the behavior of this system in living things, maybe animals, can be useful for improve the behavior of automata. regarding these assumptions, the present work examines how to model in automaton the level of attention on the hazardous environment. the desired model can contribute to better management of robots and other automata in crisis scenarios, especially where automation has not satisfactory solutions [5]. for reach this goal, this paper identifies a requirement from automation, then search for the right supply in biology and finally applies the idea in a real situation. 2 a requirements from automation for facing complex and dynamic environments, any human organization must maintain a high level of attention to the outside to ensure its viability [6]. this requirement is stronger when the environment is adverse cause of either natural or cultural variables. performing a continuous task, this level decays because the central nervous system of human been cannot sustain a high quality of information processing for a long time [7]. then, the technology allows to replace human been by automata for watching the hazard environments. nevertheless, when automata performance tasks, they reduce their level of attention too. indeed, automata are made of structures which obey of the entropy law [8]. so, in metaphorical sense, automata get tired. this tiredness emerges when their sensory systems show a gradual reduction in performance or vigilance decrement. in fact, optical and mechanical pieces of the sensory system are exposed to environmental aggression such as weathering of shocks. it is possible to know in real time how tired is the automaton? what can do manager whether the know this level ? this kind of question can be answered by biology. 3 a supply of biology any decay of this level may involve a loss of information and thus jeopardize the viability of the automaton. biologists note that subjects that perform a continuous and difficult task (such a working in crisis scenarios) "show a gradual reduction in performance or vigilant decrement" [9]. vigilance implies a general state of alertness that results in enhanced processing of information by brain. performing a difficult task reduces the quality of information processing and this reduction results in decreased performance over time [10]. after perform a task, animals rest and recover the vigilance. this recovery depends on the quality of rest. regarding this phenomena, ecologists assumed that a forager under risk of predation and the same time looking for food, alternates between two short periods of activity and rest [11]. this model explain the variation of attention level v(t): dv/dt = { −α ∗ v while foraging β ∗ (1 − v ) while resting (1) α: rate of vigilante decrement, positive and associated with task difficulty; β: rate of vigilante recovery, positive and associated with quality of rest. bio-inspired sensory systems in automata for hazardous environments 39 figure 1: graphic representation of the biological model 4 the imported model does an automaton get tired? in metaphorical or analogous sense: yes [12]. after any task, the automaton will be deteriorated and the more deteriorated it is, the lower will be the attention level. in management, it is important to know this level because, like persons, it is possible to allocate the work in charge. if the automaton is very tired, its sensory system probably is not in optimal state and this quality of survey will be unsatisfactory, so it is not advisable to apply. how does the automaton recover the satisfactory attention level? stop performing the task and start maintenance, this is the way to recover it. how fast does the attention level decrease and recover. this speed is determined by α and β respectively. how can managers determinate these parameters? each task and each maintenance has an own α and β respectively, both measurable. for getting numerical indicators, the phenomenon was studied in an autonomous vehicle in order to survey (collect field data) environmental variables in patagonia. this rover in figure 2 was programmed to cycle through different sections of equal length, in the same direction at the same speed and other conditions ceteris paribus, differing only in sections of the terrain roughness, measured as the vertical dispersion over an imaginary horizontal line of the surface measured longitudinally every 0.1 m. when the vehicle did the tours, it was assumed that was on stage for the rigorous entropy field and affects the mechanical components including optical. when the automata faced irregular geomorphology and bad weathers, the attention level declined rapidly cause of environmental aggression on mechanical and optical devices; so its performance was unsatisfactory. in friendly environments the attention level declined slowly, requiring small maintenances for retake the level desired. this behavior is shown in figure 3. figure 2 shows how decreased the percentage the achievements (identification of cryptic elements) cause of reduction attention level. after many tests, it was possible to quantify α and β for each kind of task-environment and each kind of maintenance. with this knowledge, managers took new and better decisions. for example set a minimum tolerance vo for attention level. any lower than this, they rejected the work of robots. the bio-inspired model showed to have explicative and predictive applications to the conquest of hostile scenarios by means of automata. indeed it is an interesting conceptual tool for increasing the performance of machines. 40 l. canete figure 2: the rover which "worked" in patagonia figure 3: variation observed of attention level in two kinds of environments in patagonia bio-inspired sensory systems in automata for hazardous environments 41 5 conclusions given the syntax, semantics and praxis of the model, it was possible tested it with computer simulations to study the performance of automatic machines in hostile environments. besides, there were tests in different terrains, most of them in extreme zones of chile to study the quality of the information gathered by surveyor robots. when the automata faced irregular geomorphology and bad weathers, the attention level declined rapidly cause of environmental aggression on mechanical and optical devices; so its performance was unsatisfactory. in friendly environments the attention level declined slowly, requiring small maintenances for retake the level desired. the behavior observed confirmed the hypothesis of this work. this model inspired by biological phenomena has an explanatory utility and a predictive use because it can measure the level of difficulty of tasks and the recovery rate (α and β respectively), to forecast the decay of productive factor of interest. even known values of α and β and several observations regarding the performance, it can be through a statistical procedure to discover the function v = f (α,β), a mission that the author of this trial have begun to develop. the inspiration observed in living phenomena may have more similarities in inert bodies and therefore opens an interesting line of research that contributes to better management of productive factors. bibliography [1] morin e., introduction á la pensée complexe, editions du seuil, 2005. [2] ovchinnikov y., basic tendencies in physico-chemical biology, mir publisher, 1987. [3] siciliano b., sciavicco l., villani l., oriolo g., robotics: modelling, planing and control, springer, 2010. [4] maturana h., varela f., de máquinas y seres vivos, editorial universitaria, 1994. [5] siegwart r., nourbakhsh i., scaramuzza d., autonomous mobile robots, the mit press, 2011. [6] pérez j., design and diagnosis for sustainable organizations: the viable system method, springer, 2012. [7] mason p., medical neurobiology, oxford university press, 2011. [8] atkins p., four laws that drive the universe, oxford university press, 2007. [9] dukas, r., constraints on information processing and their effects on behavior, the university of chicago press,1998. [10] gendron r., staddon j., searching for cryptic prey: the effects of search rate, american naturalist, issn 00030147, 121: 172-186, 1983. [11] parasuram r., mouloua m., interaction of signal discriminability and task type in vigilance decrement, perception, issn 0301-0066, 41: 17-22. [12] gleich p., pade c., petschow c., pissarskoy e., potencials and trends in biomimetics, springer. 2009. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 2, pp. 205-215 on the use of the fuzzyartmap neural network for pattern recognition in statistical process control using a factorial design j. a. vazquez-lopez, i. lopez-juarez, m. peña-cabrera j. a. vazquez-lopez instituto tecnologico de celaya ciatec, a.c. posgrado interinstitucional en ciencia y tecnologia (picyt) industrial engineering department celaya, gto. mexico e-mail: antoniov@itc.mx i. lopez-juarez centro de investigacion y de estudios avanzados del ipn unidad saltillo robotics and advanced manufacturing department carretera saltillo-monterrey km 13. ramos arizpe, coahuila. mexico e-mail: ismael.lopez@cinvestav.edu.mx m. peña-cabrera national autonomous university of mexico engineering computing science and automation department institute of research on applied mathematics and systems mexico, city. mexico e-mail: mario@leibniz.iimas.unam.mx abstract: time-series statistical pattern recognition is of prime importance in statistics, especially in quality control techniques for manufacturing processes. a frequent problem in this application is the complexity when trying to determine the behaviour (pattern) from sample data. there have been identified standard patterns which are commonly present when using the x chart; its detection depends on human judgement supported by norms and graphical criteria. in the last few years, it has been demonstrated that artificial neural networks (ann’s) are useful to predict the type of time-series pattern instead of the use of rules. however, the ann control parameters have to be fixed to values that maximize its performance. this research proposes an experimental design methodology to determine the most appropriate values for the control parameters of the fuzzyartmap ann such as: learning rate (β ) and network vigilance (ρa, ρb, ρab) in order to increment the neural network efficiency during unnatural pattern recognition. keywords: statistical process control, control charts, artificial neural network (ann), fuzzyartmap, and factorial design. 1 introduction to preserve product quality an accurate knowledge of the production process is necessary. this requires the automation of quality control systems and the use of control charts as introduced by dr. walter a shewart to observe the behaviour of the manufacturing process. control charting is the key point in statistical process control (spc) implementation. the correct application of these control charts requires satisfying statistical assumptions such as the independence of the random variable and symmetry in its probability distribution [1]. if these assumptions are met then the use of control charts is correctly applied since the upper and lower limits are established as ±3σ from the global mean of the x random variable. in figure 1, the probability distribution of x is shown under both circumstances with symmetry and without symmetry. copyright c⃝ 2006-2010 by ccc publications 206 j. a. vazquez-lopez, i. lopez-juarez, m. peña-cabrera figure 1: x symmetry importance to establish the control limits the power of an x chart is based on its capacity to differentiate special and natural causes of variation; however, important disadvantages on the use of this form of quality control exist because traditional control chart with control limit can only indicate when to seek a disturbance and not where and what to look for, generating then hurried and sometimes mistaken diagnostics [2]. trying to know where and what has happened in the manufacture process may be possible by using pre-established rules in combination with human judgment. these rules are commonly referred to as: points outside the control limits, run of consecutive points, non-random patterns and points near the control limits [1]. the efficiency of the use of these rules has been investigated and it has been found that is not enough to recognise the type of statistical pattern ([3], [4], [5], [6], [7], [8], [9], [10], [11], [12]), which would give the correct answer to the questions of where and what to look for. this is why researchers suggest the use of the neural networks as an alternative approach to identify variation data in statistical patterns [4]. a neural network is a soft computing system [25], which consists of a number of elements (nodes) strongly interconnected that have the ability to process information as a result of a process of dynamic work of those nodes and connections to external points to the network [13]. neural networks are efficient in recognizing data variation [2], [5], [14], especially in asymmetric probability distributions [2]. among the existing neural networks, the fuzzy artmap network is widely recognized due to its on-line and fast learning capability for pattern recognition tasks [15], [16]. for many processes of manufacture, the parameters can be obtained with neural network, with exception in many complex biotechnological processes. in these cases, [23] proposed the use of grey-box models which combine a priori knowledge expressed in terms of a white-box model, with a black-box model such as neural network, and [24] developed a matlabrtoolbox for the construction of grey-box neural network models. this paper is organised as follows: in the next section an introduction to the art theory is given first followed by the standardisation and coding algorithm for the input data in the neural network using the monte carlo method. section 3 describes the statistical pattern generation. training and testing results are provided in section 4. section 5 describes the factorial design for the selection of the network parameters as well as the experimental results. an analysis for the pattern variation findings is given in section 6 while conclusions and future work are given in section 7. 2 adaptive resonance theory the adaptive resonance theory (art) [17] was developed by stephen grossberg and gail carpenter at boston university to solve the called stability-plasticity dilemma. that is, the system is sensitive to novelty capable of distinguishing between familiar and unfamiliar events (plastic) and still remains stable. different model variations have been developed to date based on the original art-1 algorithm on the use of the fuzzyartmap neural network for pattern recognition in statistical process control using a factorial design 207 for binary input patterns [18], art 2-a for analogue and binary input patterns [19], and art 3 based on chemical transmitters. supervised learning is possible through artmap [20] that uses two art modules and its variants, fuzzy artmap [16], gaussian artmap [21] and art-emap even though there are many other variants adapted for specific applications [22]. in the next section a brief explanation of the mechanics of art-1 and fuzzy artmap is given. 2.1 art-1 the art-1 architecture consists of two parts: attentional subsystem and orienting subsystem as illustrated in figure 2. the attentional subsystem is made up of two layers of nodes f1 and f2. in an art network, information in the form of processing-element output reverberates back and forth between layers. if a stable resonance takes place learning or adaptation can occur. on the other hand, the orienting subsystem is in charge of resetting the attentional subsystem when an unfamiliar event occurs. figure 2: basic art architecture a resonant state can be attained in one of two ways. if the network has learned previously to recognise an input vector, then a resonant state will be achieved quickly when that input vector is presented. during resonance, the adaptation process will reinforce the memory of the stored pattern. if the input vector is not immediately recognised, the network will rapidly search through its stored patterns looking for a match. if no match is found, the network will enter a resonant state whereupon the new pattern will be stored for the first time. thus, the network responds quickly to previously learned data, yet remains able to learn when novel data is presented, hence solving the so-called stability-plasticity dilemma. the activity of a node in the f1 or f2 layer is called short-term memory (stm) whereas the adaptive weights are called long-term memory (ltm). gain controls handle the discrete presentation of the input signals. a vigilance parameter measures how much mismatch is tolerated between the input data and the stored patterns, which can be used to control the category coarseness control of the classifier. 2.2 fuzzy artmap in the fuzzy artmap (fam) network there are two modules arta and artb and an inter-art module map − f ield that controls the learning of an associative map from arta recognition categories to artb categories. this is illustrated in figure 3. the map field module also controls the match tracking of arta vigilance parameter. a mismatch between map field and arta category activated by input ia and artb category activated by input ib 208 j. a. vazquez-lopez, i. lopez-juarez, m. peña-cabrera increases arta vigilance by the minimum amount needed for the system to search for, and if necessary, learn a new arta category whose prediction matches the artb category. the search initiated by the inter-art reset can shift attention to a novel cluster of features that can be incorporated through learning into a new arta recognition category, which can then be linked to a new art prediction via associative learning at the map − f ield. figure 3: fuzzyartmap architecture a vigilance parameter measures the difference allowed between the input data and the stored pattern. therefore this parameter is determinant to affect the selectivity or granularity of the network prediction. for learning, the fuzzyartmap has 4 important factors: vigilance in the input module (ρa), vigilance in the output module (ρb), vigilance in the map field (ρab) and learning rate (β ). these were the considered factors in this research. 2.3 standardization and codification the use of the neural network requires two important mathematical considerations, which are the standardization and the codification of the input data [2]. the training and testing data needs to be preprocessed in these two stages. the standardization means that the data have to be linearly transformed from data with mean (µ ) and standard deviation (σ ) into data with µ = 0 and σ = 1 using equation 1, as a result, the sample data is within the interval (-3.9, +3.9). yt = ( xt − µ σ ) (1) where: xt = sample value at sampling time t. yt = standardized value from xt . µ = process mean. σ = process standard deviation. the xt data are generated by a process simulator of monte carlo, according to equation 2 xt = µ + nt + dt (2) where: µ = process mean. nt = common cause variation at sampling time t. on the use of the fuzzyartmap neural network for pattern recognition in statistical process control using a factorial design 209 dt = special disturbance at time t (dt = 0 when the pattern is natural). shift. dt = ud (3) where: u = parameter to determine the position of shifting (0: before shifting; 1: after shifting). d = displacement of mean in terms of σ . trend slope. dt = st (4) where: s = trend slope in terms of σ . t = sampling time. on the other hand, with the codification of yt the variation interval of [0,1] is obtained, which is a requirement for the neural network operation that reduces the effects of common causes of variation (noise) [2]. the codification of xt considered the interval [-7.625, 7.625], whose range is greater to the expected yt range. 3 pattern data generation a specific value xt of sample data is obtained from the sum of three mathematical considerations: • global and historical effect (µ ). • natural variation effect (nt ). • disturbance variation effect (dt ). mathematically, equation 2 expresses this situation. in terms of industrial quality, these effects can be thought of as the global and historical mean obtained from experience (i), thought of as data variation which is unavoidable and it is always present (ii); and finally, the data variation due to disturbances which is associated to special causes that may cause the process to be out of statistical control (iii). when a sample data has only influence on natural causes of variation, then nt > 0 and dt = 0, and the pattern data will be natural. on the other hand, if dt > 0, then the pattern data will be unnatural, and it means that a cause of special variation has occurred in time t. it must be noticed that 0 < nt < dt for any type of special pattern data. if the dt value is very similar to nt then neural network output can be misleading between a special pattern and a natural one. 3.1 natural pattern the data used for this pattern were generated using the monte carlo simulator using equation 2 with µ = 0 and σ = 1. an example of this type of pattern is shown in figure 4. the graph data comes from a time-series of x that did not consider any trend or shift in the global mean and with data distribution randomly assigned. 3.2 shift pattern data used for either downward shift or upward shift shows two data set separated by an abrupt change as shown in figure 5. this occurs because the reference mean also changes. this can be positive or negative and its magnitude depends on the special variation cause in the manufacturing process. 210 j. a. vazquez-lopez, i. lopez-juarez, m. peña-cabrera figure 4: natural pattern figure 5: shift patterns (a)upwards (b)downwards 3.3 trend pattern the type of pattern can be distinguished at a glance due to its upward or downward trend. in terms of mathematically what happens is that for any xt in the data series (where t is not the last data), there will be points in time t +1,t +2,t +3,...,t + n of higher magnitude (upward trend) or lower magnitude (downward trend). through linear regression is always possible to find out the slope magnitude which is also the magnitude of the effect that caused the special variation. this type of pattern can be observed in figure 6. figure 6: trend patterns (a)upwards (b)downwards 4 training and testing five pattern types with diverse effects of special variation as mentioned above were studied. table 1 shows the corresponding information indicating the used values for dt and the output binary code used during the training and testing phase. the number of patterns for the input vector was 51 for training and 1,350 for testing. 5 factorial design the experimentation required 120 tests based on 8 runs and 3 replicates. the information for the experimental design is shown in table 2. an analysis of variance from multiple experiments revealed the on the use of the fuzzyartmap neural network for pattern recognition in statistical process control using a factorial design 211 table 1: dt values considered for each pattern type and output vector code pattern type dt code natural 0.0 1 0 0 0 0 upward shift +0.5,+1.5,+2.5,+3.5 0 1 0 0 0 downward shift −0.5,−1.5,−2.5,−3.5 0 0 1 0 0 upward trend +0.1,+0.2,+0.3,+0.4 0 0 0 1 0 downward trend −0.1,−0.2,−0.3,−0.4 0 0 0 0 1 significant factors of operation for the neuronal network. the results are given in section 5.1 and in all cases, there were no violations to the normality and independence of the residuals ei, j. table 2: experimental design 5.1 experimental results the results lead to the identification of the factors that most influence the prediction efficiency of the neuronal network (characterization). results also showed the determination of the best combination for the factor levels (relative optimization) that generated higher efficiencies. it was observed that at the level of significance of the experimental test, all the factors influence in the efficiency of the neural network. the relative optimal levels are shown in table 3. the efficiency of the neural network (η ) considering the factors and levels indicated in table 3 are given in table 4 and figure 7. the experimental test validation was carried out with 17,000 data samples created by simulation. 212 j. a. vazquez-lopez, i. lopez-juarez, m. peña-cabrera table 3: ann factors and optimal levels factors optimal levels a (ρa1) 0.8 b (ρab1) 0.2 c (β1) 1.0 d (ρb1,2) 0.8 e (ρab2) 0.2 f (β2) 0.2 g (ρa2) 0.6 table 4: neural network efficiency (η) pattern data η natural 30 % upward shift 93.5 % downward shift 96.4 % upward trend 99.8 % downward trend 96.4 % 6 analysis of pattern variation there is a direct relationship between the value dt assumed by each special pattern data (table 1), the value of the standard deviation (σ ) of the sample data and the efficiency of the neuronal network (η ). table 5, shows this situation. in all cases, while the absolute value of dt increases the corresponding σ and η also increases. a polynomial regression analysis was carried out (σ and dt are the independent variables and η the dependent variable). the coefficient of correlation was high and typically above 89% (see table 6). this fact indicates that as dt approaches nt and σ is low then the neural network may predict a wrong pattern. on the opposite, with higher values of dt and σ the neural network prediction efficiency increases. table 5: relationship between dt , standard deviation (σ ) and the neural network efficiency (η ). pattern type parameters values upward shift η 76.0% 86.8% 92.7% 93.5% downward shift η 42.4% 77.6% 88.2% 96.4% dt ±0.5 ±1.5 ±2.5 ±3.5 σ 1.009 1.237 1.620 2.089 upward trend η 22.0% 25.0% 73.8% 99.8% downward trend η 79.1% 82.8% 95.8% 96.4% dt ±0.1 ±0.2 ±0.3 ±0.4 σ 0.539 0.670 0.862 1.080 on the use of the fuzzyartmap neural network for pattern recognition in statistical process control using a factorial design 213 figure 7: neural network efficiency (η ) vs dt table 6: equations for the multiple linear regressions pattern type multiple regression r2 upward shift efficiency = 107.7+21.4(dt)−42.5(σ) 99.8 downward shift efficiency = 132.2−61.0(dt)−119.7(σ) 99.3 upward trend efficiency = −121.5−294.1(dt)+317.5(σ) 96.0 downward trend efficiency = 77.3−92.3(dt)−15.0(σ) 89.21 7 conclusions this investigation confirms the obtained results from previous studies with respect to the efficiency of the neural network in the recognition of statistical pattern data. it is also demonstrated, that as the effect of the special cause approaches close to zero, the efficiency decreases because the standard deviation of these data is smaller or equal to 1, i.e., the standard deviation of a data set from a natural pattern. another result from this investigation is the definition of the most appropriate parameter values for the fuzzyartmap that facilitated the use of this neuronal network. it is important to mention that in this application the efficiency of the network for the control chart pattern recognition depends greatly on the dt and σ values. future work has been envisaged to look at the dt and σ values in the confusion zone and their relationship with the sampling window size in order to analyse the neural network behaviour in this zone. acknowledgements the authors wish to thank ciatec, a.c., and the posgrado interinstitucional en ciencia y tecnologia (picyt) for the facilities provided to j. a. vazquez-lopez during his doctoral studies. bibliography [1] d.c. montgomery. introduction to statistical quality control. third edition. john wiley & sons, new york, 1991. [2] r.s. guh. real-time pattern recognition in statistical process control: a hybrid neural network/decision tree-based approach. imeche, part b: j. engineering manufacture. vol. 219, no. 3. pp. 283-298, 2005. 214 j. a. vazquez-lopez, i. lopez-juarez, m. peña-cabrera [3] t. l. lucy-bouler. application to forecasting of neural network recognition of shifts and trends in quality control data. proceedings of the world congress on neural networks (wcnn’93). vol. 1, pp. 631-633, uk, 1993. [4] r.s. guh. robustness of the neural network based control chart pattern recognition system to nonnormality. int. j. qual. reliability mgmt. vol. 19(1), pp. 97-112, 2002. [5] c.w. zobel, d.f. cook, q.j. nottingham. an augmented neural network classification approach to detecting mean shifts in correlated manufacturing process parameters. int. journal of production research. vol. 42 (4), pp. 741-758, 2004. [6] f. zorriassantine, j.d.t. tannock. a review of neural networks for statistical process control. journal of intelligent manufacturing. vol. 9, pp. 209-224, 1998. [7] c.s. cheng. a multi-layer neural network model for detecting changes in the process mean. computers ind. engng. vol. 28(1), pp. 51-61, 1995. [8] h.b. hwarng, and c. w. chong. detecting process nonrandomness through a fast and cumulative learning art-based pattern recognizer. int. j. prod. res., vol. 33(7), pp. 1817-1833, 1995. [9] c.s. cheng. a neural network approach for the analysis of control chart patterns. int. j. prod. res., vol. 35(3), pp. 667-697, 1997. [10] e.s. ho and s. i. chang. an integrated neural network approach for simultaneous monitoring of process mean and variance shifts a comparative study. int. j. prod. res., vol. 37(8), pp. 1881-1901, 1999. [11] r.s. guh and j.d.t. tannock. recognition of control chart concurrent patterns using a neural network approach. int. j. prod. res., vol. 37(8), pp. 1743-1765, 1999. [12] m.a. wani, and d.t. pham. efficient control chart pattern recognition through synergistic and distributed artificial neural networks. proc. instn mech. engrs, part b: j. engineering manufacture. vol. 213(b2), pp. 157-169, 1999. [13] d.f cook, c.w. zobel and m. l. wolfe environmental statistical process control using an augmented neural network classification approach. european journal of operational research. vol. 174, pp. 1631-1642, 2006. [14] j.a. swift. development of a knowledge-based expert system for control chart pattem recognition and analysis. doctoral thesis. oklahoma state university, 1987. [15] gerson tontini. pattern identification in statistical process control using fuzzy neural networks. ieee management sciences. department regional university of blumenau (furb), 1996. [16] g.a carpenter, s. grossberg, n. markuzon, j.h. reynolds, and d.b. rosen. fuzzy artmap: a neural network architecture for incremental learning of analog multidimensional maps. ieee transactions on neural networks. vol. 3 (5), pp. 698-713, 1992. [17] g. a. carpenter, s. grossberg. adaptive resonance theory (art). the hanbook of brain theory and neural networks. edited by m. a. arbib, the mit press. pp. 79-82, 1995. [18] g. a. carpenter, s. grossberg. a massively parallel architecture for a self-organizing neural pattern recognition machine. computer vision, graphics, and image processing. academic press, inc. pp. 54-115, 1987. on the use of the fuzzyartmap neural network for pattern recognition in statistical process control using a factorial design 215 [19] g. a. carpenter, s. grossberg, d. b. rosen. art 2-a: an adaptive resonance algorithm for rapid category learning and recognition. neural networks. vol. 4, pp. 493-504, 1991. [20] g. a. carpenter, s. grossberg, j. h. reynolds. artmap: supervised real-time learning and classification of nonstationary data by self-organizing neural network. neural networks. pp. 565-588, 1991. [21] j. r. williamson. gaussian artmap: a neural network for fast incremental learning of noisy multidimensional maps. neural networks. vol. 9, no. 5, pp. 881-897, 1996. [22] g. a. carpenter, w. d. ross. art-emap: a neural network architecture for object recognition by evidence accumulation. ieee trans. on neural networks. vol. 6, no. 4, pp. 805-818, 1995. [23] d. psichogios, l. ungar. a hybrid neural network-first principles approach to process modeling. computers & chemical engineering. vol. 38(10), pp. 1499-1511, 1992. [24] g. acuña, and e. pinto. development of a matlabrtoolbox for the design of grey-box neural models. int. journal of computers, communications & control. vol. i, no. 2, pp. 7-14, 2006. [25] imre j. rudas, jános fodor. intelligent systems. int. journal of computers, communications & control, 3(s):132-138, 2008. jose antonio vazquez-lopez was born in celaya, guanajuato, mexico in 1972. he graduated with a beng and msc in industrial engineering from instituto tecnologico de celaya in 1999 and 2001 respectively. he obtained his phd from the posgrado interinstitucional en ciencia y tecnologia (picyt) in 2009. currently, he is a lecturer at the instituto tecnologico de celaya. his main research interest is in statistical industrial engineering, where he has also served as industrial consultant for several companies. ismael lopez-juarez was born in tequixquiac, mexico in 1968. he obtained a beng from the national autonomous university of mexico (unam) in 1991. he obtained an msc in instrument design and application at the university of manchester in 1996 and a phd in intelligent robotics at the nottingham trent university in 2000, both in th u.k. he was the founder and leader of the mechatronics and intelligent manufacturing systems research group at ciateq, a.c. during 2000-2006. currently, he is a principal researcher at the robotics and advanced manufacturing systems research group at cinvestav-saltillo. dr. lopez-juarez has published over one hundred papers in refereed international journals and conference proceedings in the area of instrumentation, self-adaptive industrial robotics and neural networks. mario peña-cabrera was born in queretaro, mexico in 1954. he graduated with a beng in mechanical and electrical engineering with specialization in control, communications and electronics from the national autonomous university of mexico (unam). he holds a meng in electronics from the same university and a meng from the mcmaster university, canada. he obtained his phd degree in science and technology from picyt, sep-conacyt, mexico. his areas of interest are in robot vision, automation and digital control, areas in which he has been working for over 30 years. currently, he is the head of the computing engineering science and automation department at the institute of research on applied mathematics and systems in the national autonomous university of mexico (unam). int j comput commun, issn 1841-9836 8(1):79-86, february, 2013. reliable critical infrastructure: multiple failures for multicast using multi-objective approach f.a. maldonado-lopez, y. donoso ferney a. maldonado-lopez, yezid donoso universidad de los andes, bogotá, colombia e-mail: fa.maldonado1897@uniandes.edu.co ydonoso@uniandes.edu.co abstract: multicast is the keystone for multimedia internet. multicast is one of the new and most used services in telecommunication networks. however, these networks meet big challenges when facing failures from diverse factors, including natural disasters and bad configurations. networks operators need to establish mechanisms to maintain available multicast services, and plan actions to handle incidents. we study and implement an elitist evolutionary algorithm based on strength pareto evolutionary algorithm spea. our implementation recalculates network routes, even when there are multiple failures. the results indicate that our product finds lower-cost and higheravailability multicast tree to protect multicast services. keywords: resilience, protection, survivability networks, multi-objective evolutionary algorithm. 1 introduction multicast is one of the new and most used services in telecommunication networks. some applications on the internet use a multicast service which is a key factor in multimedia, video conferencing, distributed games, internet television, and telepresence. a multicast service simultaneously sends messages from only one source to a group of destinations creating a multicast tree over a network. networks are exposed to several sources of failure, including misconfiguration or operational errors, natural disasters, attacks, and environmental challenges. additionally, to maintain the services, they must consider unusual but legitimate traffic [1], such as peak traffic on specific hours or dates. due to multicast applications demand important amount of traffic, they need a mechanism to protect them from network failures. the internet, for example, has been designed to survive random failures. if a node is down, the reliable protocol is able to reroute the traffic around and use alternative connections. in networking, resilience and survivability refer to the abilities of the network to overcome failures and maintain its services working. the study of resilience and survivability has become an important aspect of managing infrastructure in multicast networking. in this paper, we formulate a mechanism to protect a multicast service when the network experiences multiple failures. failures in multicast have been widely studied [2] [3]. a common strategy to face this problem is finding a redundant multicast tree (rmt). some authors consider that a rmt could be completely link-disjoint from the original. they propose to find a rmt with the minimal cost. to determine the optimal rmt, we consider the np-hard steiner tree problem [4]. several algorithms have been proposed to deal with this issue, such as topological methods [5], or the nearest participant first (npf) [6]. furthermore, there are network protocols which calculate a multicast tree with the minimal cost, such as mospf, pim-dm, pim-sm, and cbt. however, there are some limitations with existing approaches. first, the algorithms that calculate rmts were designed to optimize only one decision variable; thus, they use only distance to find the minimum cost tree. furthermore, algorithms and protocols previously mentioned were tested with single failures, it means, when only a link disruption occurs. nonetheless, copyright c⃝ 2006-2013 by ccc publications 80 f.a. maldonado-lopez, y. donoso telecommunication operators need to find not only routes with the minimal cost, but also routes with high availability. moreover, operators require techniques to plan and reconfigure the network where a set of links are damaged or eliminated. we propose a mechanism to protect multicast services, with the maximum availability and the minimum cost, considering multiple failures and diverse decision variables. our approach applies an heuristic multi-objective evolutionary algorithm (moea) which is a stochastic search method to estimate the optimal rmt. we implement the strength pareto evolutionary algorithm (spea) which is an elitist evolutionary algorithm that finds pareto-optimal solutions [9]. the used method has polynomial computational complexity o(mn2) where m is the number of arcs and n nodes [10]. result exhibits possible configurations after multiple failure for this problem considered np-hard. 2 multicast protection problem multicast protection problem (mpp) is analysed from the communication survivability and resilience view. multicast services, methods for provisioning, and failures are presented by formal description. a network is depicted as a weighted, directed, connected graph g = (n,a). a set n of n nodes and a set a of arcs. nodes are elements labelled 1 . . .n and an arc is an ordered pair of nodes, an arc between nodes i and j is denoted by (i,j) = {(i,j)|i,j ∈ n}. a network is represented by an n×n adjacency matrix a. the ijth element is 1 if (i,j) ∈ a and 0 otherwise. arc symbolizes a communication link in the network; so, it has assigned two link’s attributes: cost and availability. cost indicates the length of the link and availability is the probability that the link works during a period of time. these attributes are represented by n × n matrices w = {wij} for distance and v = {vij} for availability. table 1 contains used notation to model the problem. n set of nodes labelled {1, 2, . . . , n} s multicast source node s ∈ n a set of arcs {(i, j)|i, j ∈ n} d destinations set {d1, d2, · · · , dl : di ∈ n} a adjacency matrix, n × n t multicast session t = (s, d) w matrix of cost, w(aij) = w(i,j) t multicast session subgraph t = (n, a′)|a′ ⊆ a v matrix of availability, v(aij) = v(i,j) f set of failures {f1, f2, · · · , fk|fi ∈ a} table 1: graph notation definition 1. multicast session is delivery data from source node s to l destinations nodes that belong to set d. according to definition 1, we model a multicast session as a graph t. it is a directed tree with root s and terminal vertices di ∈ d. t is a set of paths from s and di ∈ d. a path p is a walk without repeated nodes, and it is represented as a nodes sequence pj = {s,i1, i2, · · · ,dj} : s,ik,dj ∈ n. a failure is a link disruption; therefore, multiple failures are described as a set of damaged or eliminated links. this set, called vector of failures, is labelled by f where k is the number of edges that have failed. the failures are uncorrelated, and they occur following a uniform random distribution. definition 2. a failure fi is a pair (i,j) ∈ a, where i,j ∈ n. after a failure the graph g changes to g′ = (n′,a′),n′ ⊆ n and a′ ⊆ a then, the multicast protection problem against multiple correlated failures is described as follows. reliable critical infrastructure: multiple failures for multicast using multi-objective approach81 definition 3. given a weighted graph g = (n,a), a multicast demand t , and a set of links that have failed f, the multicast protection problem (mpp) is providing a rmt g′ with maximum availability v(g′) and minimum cost w(g′). mpp is similar to find the optimal tree, which is a classic graph theory problem knowing as steiner tree problem that was proven as np-hard. consequently, mpp is also a np-hard problem. 3 optimization model in order to make feasible the computation, the network is represented by a matrix of adjacency a. we assume that all links on the network are directional. the matrix of adjacency has zero entries on the main diagonal because there are not loops in the vertices. the attribute values of cost and availability are represented into matrices w and v. w(g) is the cost function of graph g which is the sum cost of all arcs in g (1). similarly, availability function v(g) is the conditional probability that all arcs in g are able (2). w : g → ℜ, w(g) = ∑ (i,j) w(i,j) ∀i,j ∈ g (1) v : g → [0,1], v(g) = ∏ (i,j) v(i,j) ∀i,j ∈ g (2) figure 1 shows three trees able to carry data to multiple destinations. figure (1a) is a graph representing a complete network; figure (1b) is a subgraph able to reach both destinations; figure (1c) is another subgraph reaching the same destinations. note that both subgraphs are multicast trees, but these are disjoint trees; it means, one is a multicast protection tree of the other. figure 1: network and multicast protection tree for a demand from node 0 to nodes 4 and 5. 3.1 variable definition we set xi,j in (3) as a binary decision variable that symbols if an edge is used or not by the session t . xi,j corresponds to the input (i,j) value in the matrix of adjacency a. 82 f.a. maldonado-lopez, y. donoso x(i,j) = { 1 when the link is used from node i to node j, 0 in other case. (3) 3.2 objective functions and constraints the expressions (4) and (5) are the objective functions. recall that the goal is to find a set of optimal trees that minimize the total cost and maximize the total availability. min w(g) = min ∑ (i,j) x(i,j)w(i,j) ∀i,j ∈ a (4) max v(g) = max ∏ (i,j) x(i,j)v(i,j) ∀i,j ∈ a (5) in networking, a multicast session is a particular case where only one package is sent from s and multiples copies are delivered to di. multicast session is a variation of the transport problem. in the transport problem an intermediate node ik receives and delivers same amount of goods. in multicast, an intermediate node can replicate packages and send them by several output arcs. then, the multi-objective optimization problem is subject to: x(s,j) = 1 j ∈ a (6) x(j,di) = −1 j ∈ a (7) x(i,j) ≤ ∑ k xj,k|x(i,j) = 1,k ∈ a j ̸= s,j ̸= di (8) x(i,j) ≥ 0 i,j ∈ a (9) constrains (6) and (7), called constrains of offer and demand, are associated to root and destinations nodes. constrain (8) guaranties the replication of package, and the last constrain is a positive flow. 4 case study in this paper we study multiple failures in telecommunications networks and reliability for multicast services. we propose a different mechanism to reconfigure the network. as an example, we propose a case study, generate multiple failures, run our implementation to avoid failed links and generate a new rmt. this work is divided into three stages. first, we implement a mechanism based on spea. secondly, cost239 pan-european network tests the implementation. finally, the simulation is executed, and data is obtained to be analysed. 4.1 creating an evolutionary algorithm genetic algorithm (ga) is a stochastic search method based on chromosomes. each chromosome represents a valid solution to the problem. space of solution ω contains all feasible solutions that satisfy problem constraints. thus, particular solution ωi ∈ ω is a tree represented by a chromosome. ga initiates a population p which is a set of chromosomes randomly generated. after, each chromosome is rated by the objective function or fitness function. then, a pair of solutions, or individuals, are selected and produce an offspring by mixing their genetic information. successor solutions are generated by combining two parent states or modifying a single reliable critical infrastructure: multiple failures for multicast using multi-objective approach83 one, analogue to natural selection. there are two kind of evolutionary algorithms, non-elitist and elitist ones. non-elitist algorithms use whole latter population for the next iteration; also, this procedure allows exploit non-dominated solutions, it means, found optimal pareto p solutions. second, elitist algorithm gives the chance to preserve the best solutions, or elite solutions p , directly to next generation. this kind of problem representation is used to solve complex optimization problems [11]. we use a ge optimization in this paper to solve rmt problem. strength pareto evolutionary algorithm spea zitzler and thiele proposed an evolutionary algorithm called spea [9]. this algorithm maintains elitism by an external population p . this population is a collection of non-dominated solutions ω. let say ω dominates ω, ω ≽ ω, if w(ω) 6 w(ω) and v(ω) > v(ω). the algorithm finds non-dominated solutions and compares them with previous external population until segregate a new external population. this algorithm preserves elitist population p . spea has been widely used to solve network optimization problems [11]. this particular problem was faced from two phases. first, we used a high-level modelling system for mathematical optimization which allows to solve linear, nonlinear, and mixed-integer optimization problems. second, a spea genetic algorithm was implemented. this method demonstrated polynomial computational complexity o(mn2) where m is the number of arcs and n nodes [10] solving a problem considered np-hard [12]. chromosome the chromosome is a tree that can be represented by an adjacency matrix subset of a. however, a matrix representation has difficulties when it is necessary to reconstruct a unique route. consequently, we change the chromosome representation for a list of paths following the original model. the figure 2 depicts the structure representation of a chromosome. path p0 path p1 path pk path p(k-1) d(k -1) d1 dk d0s ... s ... s s . . . figure 2: chromosome as a list of paths. 4.2 spea implementation we design and implement a specific tool to find optimal rmts. initial population p0 is generated by a random tree generator, algorithm 1. random tree generator creates random walks from s node to each destination node di. also, we implement matrix operations and processes for creating multicast demands t , operators for chromosomes, including crossover and mutation. algorithm 2 is a generalized description that we follow to generate a set of solutions. 84 f.a. maldonado-lopez, y. donoso simulation scenario cost239 pan-european network tests the implementation. the network with 28 nodes represents the internet backbone, and connects main cities in europe. after setting topologies, we set the parameters, and generate random link failures in the tree of multicast service. table 2 summarizes simulation parameters. then, our implementation compute rmts and they are stored for analysis. parameter value number of generations 15 initial population size 10 max population size 30 max non-dominant population 20 crossover probability 0.2 mutation probability 0.2 table 2: simmulation paremeters 5 analysis and conclusions after running our implementation, we notice that spea, for reliability in mpp and network design, is a powerful tool. figure (3) is an example using cost239 pan-european network as test topology. first, the implementation find a tree, which in our hypothesis is the original multicast tree (3a). after that, the multiple failures module chooses arcs and delete them, simulating link failures and a reaction against the service. then, our implementation of spea for reliable networks identifies those paths that become unreachable and creates new protection routes. figures (3b) and (3c) show two examples of rmts calculated to reliable network infrastructure. however, cost239 and nsfnet are small network topologies to test real performance of the implementation. for that reason, we also test it with a generated topology. we create a virtual reliable critical infrastructure: multiple failures for multicast using multi-objective approach85 figure 3: multicast network trees over all network including thirty nodes and twenty-nine arcs for each one. results are condensed in figure (4). we can observe that the network is able, high availability and low cost. when it experiments failures, our algorithm recalculate the multicast tree and re-configure the paths. in some cases, the services can be fulfilled without problems; however, services can be degraded or cancel in the worst case. as a result, we can establish a correlation between the topologies, mechanisms to find rmts, and complexity needed to solve alternatives trees. here we can notice that our implementation finds new low-cost high-availability trees, even if the failures increase. we present and investigate a different approach of protection of survivable multicast sessions in networks. results allow to optimize designs in both variables: total cost and availability. we also deploy a spea heuristic algorithm to mpp. we use spea due to its reduced computational time and complexity o(mn2). after using our approach in the pan-european and the nsfnet networks, we find that the performance of the multicast protection schemes is better when the size of the network is bigger and the network experiences uncorrelated multiple failures. moreover, the results show that our implementation is an important tool to support decisions for a network operator with similar conditions to this scenario. also, results display the trade-off between cost and availability in a network and present how multicast sessions can be restored or re-configured using this tool. figure 4: cost and availability for solutions rmts 86 f.a. maldonado-lopez, y. donoso bibliography [1] james p.g. sterbenz, david hutchison, egemen k. çetinkaya, abdul jabbar, justin p. rohrer, marcus schller, paul smith, resilience and survivability in communication networks: strategies, principles, and survey of disciplines, computer networks, 54(8):1245-1265, 2010. [2] m. zotkiewicz, and w. ben-ameur, and m. pióro, finding failure-disjoint paths for path diversity protection in communication networks, communications letters, ieee 14:776778, 2010. [3] jia weijia, cao jiannong, jia xiaohua, h. lee chan, design and analysis of an efficient and reliable atomic multicast protocol, computer communications, 21:37-53, 1988. [4] medard, m. and finn, s.g. and barry, r.a. and gallager, r.g., redundant trees for preplanned recovery in arbitrary vertex-redundant or edge-redundant graphs, networking, ieee/acm transactions on, 5(7):641-652, 1999. [5] a. v. panyukov, the steiner problem in graphs: topological methods of solution, automation and remote control, 65:439-448, 2004. [6] h. takahashi, and a. matsuyama, an approximate solution for the steiner problem in graphs, math japonica, 24:573-577, 1980. [7] k. singhal narendra, and ou canhui, and mukherjee biswanath, cross-sharing s. self-sharing trees for protecting multicast sessions in mesh networks, computer networks, 50:200-206, 2006. [8] yang chyi-bao, and wen ue-pyng, applying tabu search to backup path planning for multicast networks, computers & operations research, 32:2875-2889, 2005. [9] e. zitzler, and l. thiele, multiobjective evolutionary algorithms: a comparative case study and the strength pareto approach, evolutionary computation, ieee transactions on, 3:257271, 1999. [10] deb kalyanmoy, multiobjective evolutionary algorithms: a comparative case study and the strength pareto approach, wiley, chichestr, new york, 2001. [11] yezid donoso and ramon fabregat, multi-objective optimization in computer networks using metaheuristics, auerbach publications, 2007. [12] chern maw-sheng, on the computational complexity of reliability redundancy allocation in a series system, operations research letters. 11:309-315, 1992. international journal of computers, communications & control vol. ii (2007), no. 4, pp. 340-354 bounded controllers for decentralized formation control of mobile robots with limited sensing k.d. do abstract: this paper presents a constructive method to design bounded cooperative controllers that force a group of n mobile robots with limited sensing ranges to stabilize at a desired location, and guarantee no collisions between the robots. the control development is based on new general potential functions, which attain the minimum value when the desired formation is achieved, and are equal to infinity when a collision between any robots occurs. smooth and p times differential jump functions are introduced and embedded into the potential functions to deal with the robot limited sensing ranges. formation tracking is also considered. keywords: formation control, mobile robot, local potential function, nonholonomic mobile robot. 1 introduction formation is an extremely useful tool mimicking from biological systems to man-made teams of vehicles, mobile sensors and embedded robotic systems to perform tasks such as jointly moving in a synchronized manner or deploying over a given region with applications to search, rescue, coverage, surveillance, reconnaissance and cooperative transportation. formation control can be roughly understood as controlling positions of a group of the robots such that they stabilize/track desired locations relative to reference point(s), which can be another robot(s) within the team, and can either be stationary or moving. three popular approaches to formation control are leader-following (e.g. [2], [3]), behavioral (e.g. [4], [5]), and use of virtual structures (e.g. [6], [7]. most research works investigating formation control utilize one or more of these approaches in either a centralized or decentralized manner. centralized control schemes, see e.g. [3], [11], and [10] where actual dynamics of nonholonomic robots were considered in the formation control design, use a single controller that generates collision free trajectories in the workspace. although these guarantee a complete solution, centralized schemes require high computational power and are not robust due to the heavy dependence on a single controller. a nice application of formation control based on potential field method [3] and lyapunov’s direct method [17] to gradient climbing is recently addressed in [18]. however, the final configuration of formation cannot be foretold. on the other hand, decentralized schemes, see e.g. [8], [12], require less computational effort, and are relatively more scalable to the team size. the decentralized approach usually involves a combination of robot based local potential fields (e.g. [3], [15], [16]). the main problem with the decentralized approach, when collision avoidance is taken into account, is that it is extremely difficult to predict and control the critical points of the controlled systems. recently, a method based on a different navigation function from [13] provided a centralized formation stabilization control design strategy is proposed in [11]. this work is extended to a decentralized version in [12]. however, the potential function, which possesses all properties of a navigation function (see [13]), is finite (but its gradient with respect to the system states can be unbounded) when a collision occurs. this complicates analysis of collision avoidance. moreover, the formation is stabilized to any point in workspace instead of being "tied" to a fixed coordinate frame. in [13], [11] and [12], the tuning constants, which are crucial to guarantee that the only desired equilibrium points are asymptotic stable and that the other critical points are unstable, are extremely difficult to obtain. this problem has been removed in [9] where new potential functions were introduced. it is however noted that in [9] each robot requires knowledge of position of all other robots in the group. moreover, the control design methods (e.g. [3], [14], [15], [18]) based on the potential functions that are equal to infinity when a collision occurs exhibit very large control efforts if the robots copyright © 2006-2007 by ccc publications bounded controllers for decentralized formation control of mobile robots with limited sensing 341 are close to each other. hence, a bounded control is called for. in addition, switching control theory [21] is often used to design a decentralized formation control system (e.g. [2], where a case by case basis is proposed), especially when the vehicles have limited sensing ranges and collision avoidance between vehicles must be considered. clearly, it is more desirable if we are able to design a non-switching formation control system that can handle the above decentralized and collision avoidance requirements. in this paper, bounded cooperative controllers are designed for formation stabilization of a group of mobile robots with limited sensing ranges. new general potential functions are constructed to design the controllers that yield (almost) global asymptotic convergence of a group of mobile robots to a desired formation, and guarantee no collisions among the robots. smooth and p times differential jump functions are introduced and embedded into the potential functions to deal with the robot limited sensing ranges. moreover, the controlled system exhibits multiple equilibria due to collision avoidance taken into account. we therefore investigate the behavior of equilibrium points by linearizing the closed loop system around those points, and show that critical points, other than the desired point for an robot, are unstable. the proposed formation stabilization solution is then extended to solve a formation tracking problem. 2 problem statement we consider a group of n mobile robots, of which each has the following dynamics q̇i = ui, i = 1, ..., n (1) where qi ∈ rn and ui ∈ d ⊂ rn are the state and control input of the robot i. we assume that n > 1 and n > 1. here, we treat each robot as an autonomous point. control objective: assume that at the initial time t0 ≥ 0 each robot starts at a different location, and that each robot has a different desired location, i.e. there exist strictly positive constants ε1 and ε2 such that for all (i, j) ∈ {1, 2, ..., n}, i 6= j ‖qi(t0)−q j(t0)‖ ≥ ε1,‖qi f −q j f ‖ ≥ ε2 (2) where qi f , i = 1, .., n, is the desired location of the robot i. moreover, the robot i can only measure its own state and can only detect the other group members if these members are in a sphere, which is centered at the robot and has a radius of ri larger than a strictly positive constant. design the bounded control input ui for each robot i such that each robot asymptotically approaches its desired location while avoids collisions with all other robots in the group, i.e. for all (i, j) ∈ {1, 2, ..., n}, i 6= j,t ≥ t0 ≥ 0 ‖ui(t)‖ ≤ δ , lim t→∞ (qi(t)−qi f ) = 0,‖qi(t)−q j(t)‖ ≥ ε3 (3) where δ is a strictly positive constant and ε3 is a positive constant. 3 preliminaries we present one definition and one lemma to be used in the control design and stability analysis in the next section. 342 k.d. do definition 1. a scalar function h(x, a, b) is called a p times differential jump function if it enjoys the properties: 1) h(x, a, b) = 0 if 0 ≤ x ≤ a, 2) h(x, a, b) = 1 if x ≥ b, 3) 0 < h(x, a, b) < 1 if a < x < b, (4) 4) h(x, a, b) is p times differentiable with respect to x where p is a positive integer, x ∈ r+, and a and b are constants such that 0 ≤ a < b. moreover, if p = ∞ then the function h(x, a, b) is called a smooth jump function. lemma 1. let the scalar function h(x, a, b) be defined as h(x, a, b) = ∫ x a f (τ −a) f (b−τ)dτ∫ b a f (τ −a) f (b−τ)dτ (5) with the function f (y) being defined as follows f (y) = 0 if y ≤ 0, f (y) = g(y) if y > 0 (6) where the function g(y) enjoys the following properties a) g(τ −a)g(b−τ) > 0 a < τ < b, b) g(y) is p times differentiable with respect to y, and lim y→0+ ∂ kg(y) ∂ yk = 0, k = 1, 2, ..., p−1. (7) then the function h(x, a, b) is a p times differentiable jump function. proof. see appendix a. remark 1. several examples of the function g(y) are g(y) = yp, g(y) = tanh(y)p, g(y) = arctan(yp) for any positive integer p, and g(y) = sin(y)p for any even positive integer p. corollary 1. if the function g(y) in (7) is taken as g(y) = exp(−1y ) then the function h(x, a, b) defined in (5) is a smooth jump function. proof. see appendix b. 4 control design to achieve the control objective, we design the control ui for the robot i based on the new potential function ϕ . this potential function must attain its unique minimum value when all robots are at their desired positions, and must equal infinity when there is a collision between any robots. the potential function ϕ should also be chosen such that the gradient based control ui for the robot i can handle the limited sensing range of the robot. as such, we propose the following potential function ϕ = eγ β κ −1 (8) bounded controllers for decentralized formation control of mobile robots with limited sensing 343 where κ is a positive constant, γ and β are the goal and collision avoidance functions. these functions are specified as follows: -the goal function is designed such that it puts penalty on stabilization errors for all robots, and is equal to zero when the robots are at their final positions. a simple choice of this function is γ = 1 2 n ∑ i=1 ‖qi −qi f ‖2. (9) -the collision function β is chosen such that it equals zero when there is a collision between any robots, and equals 1 when the robots are at their desired positions. we choose this function as follows: β = ∏ i, j βi j, i = 1...n −1, j = i + 1, ..., n. (10) the function βi j = β ji is designed as βi j = h(0, b2i j/2,‖qi j‖2/2) (11) where h(0, b2i j/2,‖qi j‖2/2) is a smooth or p > 2 times differentiable jump function presented in the previous section, qi j = qi −q j, bi j is a strictly positive constant such that bi j ≤ min (ri, r j, ε2) with ε2 given in (2). remark 2. thanks to properties of the smooth or p > 2 differentiable jump function (see definition 1), the collision function β equals zero when a collision between any robots occurs, i.e. ‖qi j‖ = 0 for any i 6= j. the function β equals 1 when all robots are at their desired locations, i.e. qi = qi f for i = 1, ..., n. the function β is at least twice differentiable with respect to qi j. hence, the choice of the goal function γ in (9) and the collision function β in (10) with its components given in (11) ensures that the potential function ϕ in (8) attains the (unique) minimum value of zero when all the robots are at their desired positions, and equals infinity whenever a collision between any robots occurs. moreover, the potential function ϕ is at least twice differentiable. the derivative of ϕ along the solutions of (1) satisfies ϕ̇ = eγ γ̇ β κ −eγ κ β κ−1β̇ β 2κ = eγ β κ ( γ̇ −κ n−1 ∑ i=1 n ∑ j=i+1 β̇i j βi j ) = eγ β κ n ∑ i=1 ωti ui (12) where we have used β̇ = β ∑n−1i=1 ∑ n j=i+1 β̇i j βi j , and ωi = qi −qi f −κ ∑ j∈ni β̄ ′i jqi j (13) with β̄ ′i j = β ′i j βi j , β ′i j = ∂ βi j ∂ (‖qi j‖2/2) , and ni the set of all robots, denoted by n, in the group except for the robot i. from (12), a bounded control ui for the robot i is simply designed as follows: ui = −cψ(ωi) (14) where c is a positive constant, and ψ(ωi) denotes a vector of bounded functions of elements of ωi in the sense that ψ(ωi) = [ ψ(ω1i ) ψ(ω 2 i ), ..., ψ(ω l i ), ...., ψ(ω n i ) ]t with ωli the l th element of ωi, i.e. 344 k.d. do ωi = [ω1i ω 2 i ...ω l i ...ω n i ] t . the function ψ(x) is a scalar, differentiable and bounded function, and satisfies 1) |ψ(x)| ≤ m1, 2) ψ(x) = 0 if x = 0, xψ(x) > 0 if x 6= 0, 3) ψ(−x) = −ψ(x), (x−y)[ψ(x)−ψ(y)] ≥ 0, 4) |ψ(x)x | ≤ m2,| ∂ ψ(x) ∂ x | ≤ m3, ∂ ψ(x) ∂ x |x=0 = 1 (15) for all x ∈ r, y ∈ r, where m1, m2, m3 are strictly positive constants. some functions that satisfy the above properties are arctan(x) and tanh(x). indeed, the control ui is bounded, i.e. ‖ui(t)‖ ≤ c √ nm1 := δ ,∀t ≥ t0 ≥ 0. remark 3. when ωi defined in (13) is substituted into (14), the lth element of the control ui can be written as uli = cψ ( −(qli − qli f )− ∑ j∈ni β̄ ′i jqli j ) with qli , q l i f and q l i j being the l th elements of qi, qi f , and qi j. the argument of ψ consists of two parts: −(qli −qli f ) and −∑ j∈ni β̄ ′i jqli j. the first part, −(qli −qli f ), referred to as the attractive force plays the role of forcing the robot to its desired location. the second part, −∑ j∈ni β̄ ′i jqli j, referred to as the repulsive force, takes care of collision avoidance for the robot i with the other robots. moreover, the control ui of the robot i given in (14) depends on only its own state, and the states of other neighbor robots j if these robots are in a sphere, which is centered at the robot and has a radius no greater than ri because outside this sphere β̄ ′i j = 0. now substituting (14) into (12) results in ϕ̇ = −c e γ β κ n ∑ i=1 ωti ψ(ωi). (16) substituting (14) into (1) results in the closed loop system q̇i = −cψ(ωi), i = 1, ..., n. (17) theorem 1. assume that at the initial time t0 ≥ 0 each robot starts at a different location, and that each robot has a different desired location, i.e. the conditions given in (2) hold, the bounded controls given in (14) guarantee that no collisions between any robots can occur, the solutions of the closed loop system (17) exist and the robots asymptotically approach their desired positions (a set of equilibria) defined by qi f , i = 1, ..., n. proof. see appendix c. 5 simulations we carry out a simulation with n = 2, n = 10. the robots are initialized randomly in a circle, which is centered at the origin and has a radius of 1. the desired formation is specified in shape, location and orientation as qi f = r f [sin((i − 1)2π/n); cos((i − 1)2π/n)], i = 1, ..., n with r f = 8, i.e. the desired formation is a polygon whose vertices are uniformly distributed on a circle, which is centered at the origin and has a radius of r f . all robots have the same sensing range: ri = 2. the parameters of the p times differential jump functions are p = 2, bi j = 1, g(y) = yp. the function ψ is taken as arctan. the control gains are chosen as κ = 1, c = 2. simulation results are plotted in fig. 1. it is seen that all robots nicely approach their desired locations. since the robots initialize pretty close to each other, they quickly move away from each other then approach their desired locations, see sub-figure a) of fig. 1, where the trajectory of the robot 1 is plotted in the thick line, and robots 1 and 2 are indicated by 1 and 2. for clarity, only the control input u1 = [u1x u1y]t is plotted in sub-figure b) of fig. 1. noticing that bounded controllers for decentralized formation control of mobile robots with limited sensing 345 the values of the continuous controls u1x and u1y are in the range ±π . sub-figure c) of fig.1 plot the functions β1 j, j = 2, ..., n. it is seen that these functions are always greater than zero and approach 1. sub-figure d) of fig. 1 plots a ’mean-product’ distance, distall = ( ∏(i, j)∈n‖qi j‖ )n(n−1)/2 , see the thick line, and the distances between the robot 1 and other robots in the group. clearly, no collisions between any robots occurred since distall is larger than zero for all simulation time. the bottom figure in fig. 1 plots the functions β1 j, j = 2, ..., n. it is noted that all β1 j equal 1 when ‖q1 j‖ are larger or equal to 1. 10 5 0 5 10 10 5 0 5 10 x y 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 6 4 2 0 2 4 6 time [s] u 1 x , u 1 y 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 0 5 10 15 20 time[s] d is a ll, | |q 1 j|| 0 0.5 1 1.5 2 2.5 3 0 0.5 1 1.5 ||q 1j || 1 j u 1x u 1y dis all 1 2 a) b) c) d) figure 1: simulation results. 6 extension to formation tracking this section extends the results developed in the previous sections to solve the problem of designing a control input ui for each robot i that forces the group of n mobile robots whose dynamics are given in (1) to track a moving desired formation graph (dfg), in the sense that the dfg is allowed to move on a common desired trajectory qod (s) with s being the common reference trajectory parameter defined in the fixed coordinate system πf , see fig. 2 and fig. 3. we consider the dfg whose center moves along the common reference trajectory qod (s). we assume that qod (s) is regular in the sense that it is single valued and its first derivative exists and is bounded. since the dfg under consideration is only representative, the center does not have to be the "true" center of the dfg but can be any convenient point. when the dfg moves along the trajectory qod (s), the vertex i of dfg generates the reference trajectory qid (s) for the robot i to track. we limit our consideration to twoand three-dimensional (2d and 3d) spaces, which are most common in practice. the control objective is now stated as follows. control objective. assume that at the initial time t0 ≥ 0, for all (i, j) ∈ {1, 2, ...n}, i 6= j, s ∈ r there 346 k.d. do exist strictly positive constants ε1, ε2 and ε3 such that ‖qi(t0)−q j(t0)‖ ≥ ε1, ‖qid (s)−q jd (s)‖ ≥ ε2, ‖∂ qid (s) ∂ s ‖ ≤ ε3 (18) moreover, the robot i can only measure its own state and can only detect the other group members if these members are in a sphere with a radius of ri larger than a strictly positive constant, and centered at the robot i. design the control input ui for each robot i such that lim t→∞ (qi(t)−qid (s)) = 0, ‖qi(t)−q j(t)‖ ≥ ε4 (19) for all (i, j) ∈ {1, 2, ...n}, i 6= j, s ∈r, where ε4 is a positive constant. control design. let us construct qid (s). let the moving coordinate system πm , of which the origin _ o coincides with the center of the desired formation graph, move along qod (s). let _ q id be the coordinate vector of the vertex i of the dfg in the moving coordinate system. we then have _ q id = j(•)(qid (s)−qod (s)) (20) where j(•) whose elements depend on ∂ qod (s)/∂ s and is the rotational (invertible) matrix, which describes the rotation of πm with respect to πf , and is such that δ1 ≤ ‖j(•)‖ ≤ δ2, δ3 ≤ ‖j(•)−1‖ ≤ δ4 with δi, i = 1, .., 4 strictly positive constants. therefore, by specifying _ q id in πm , qid (s) in πf for the robot i can be calculated from (20). similarly, in πm the coordinate vector of each robot i satisfies _ q i= j(•)(qi −qod (s)). (21) from (20) and (21), we can see that the first two conditions in (18) imply the following condition ‖ _ q i (t0)− _ q j (t0)‖ ≥ _ ε 1, ‖ _ q id (s)− _ q jd (s)‖ ≥ _ ε 2 (22) where _ ε 1 and _ ε 2 are some strictly positive constants. moreover, the tracking control goal specified in (19) is achieved by designing the control ui for each robot i such that lim t→∞ ( _ q i (t)− _ q id (s)) = 0, ‖ _ q i (t)− _ q j (t)‖ ≥ _ ε 4 (23) where _ ε 4 is a positive constant, and by letting the dfg move along the common reference trajectory via giving ṡ some desired value. now differentiating both sides of (21) gives _̇ q i = _ ui (24) where _ ui is the new control, and we have chosen the control ui as ui = q̇od (s) + j(•)−1( _ ui −j̇(•)(qi −qod (s))). (25) the problem of designing _ ui for (24) to achieve (23) under (22) is exactly the same as the control objective in section 2. therefore, the control design in section 4 can be used directly to design a bounded control _ ui to achieve the goal (23). after _ ui is designed, the actual tracking control ui is calculated from (25). let us give the expression of the rotational matrix j(•) in 2d and 3d spaces. bounded controllers for decentralized formation control of mobile robots with limited sensing 347 figure 2: formation coordinates in 2d. ( )y j ( )x i ( )z k i y i x i z 1 y 1 x 1 z ( )y n ix iy i z ( )z b ( )x t od x od y od z o o i v 11 13 12 11 12 13, , 21 22 23 , , 31 32 33 , , odq agent i moving frame coordinate system formation graph m figure 3: formation coordinates in 3d. 348 k.d. do two-dimensional space. consider the moving coordinate frame, _ o _ x _ y attached to the dfg, as shown in fig. 2. the origin _ o coincides with the center of the graph, and is on the common reference trajectory qod (s) = [xod (s) yod (s)]t . the _ o _ x and _ o _ y axes are tangential and perpendicular to the reference trajectory qod (s). therefore the angle θ between _ o _ x and ox is calculated as θ = arctan(y ′ d /x ′ d ), where •′ ∆= ∂ •/∂ s. hence the rotational matrix j(•) is given by j(•) = [ cos(θ ) sin(θ ) −sin(θ ) cos(θ ) ] , which is indeed invertible for all θ ∈r, and ‖j(•)‖ = 1. three-dimensional space. consider the moving coordinate frame, _ o _ x _ y _ z , attached to the dfg as shown in fig. 3. the coordinate frame _ o x1y1z1 is parallel to oxy z . the origin _ o coincides with the center of the graph, and is on the reference trajectory qod (s) = [xod (s) yod (s) zod (s)]t . the _ o _ x , _ o _ y and _ o _ z axes coincide with the unit tangent vector~t , the unit principal vector ~n , and the unit binormal vector ~b of the trajectory qod (s) at the point _ o. these unit vectors form a positively oriented triple of vectors called the moving triad, and are given by~t = ~q′ od /‖~q′ od‖, ~n =~t′/‖~t′‖, ~b =~t ×~n, where × stands for the vector cross product operation, (~i, ~j, ~k) are the unit vectors of the oxy z coordinate frame. let (ξi1, ξi2, ξi3), i = 1, 2, 3 be the directional cosines of _ o _ x , _ o _ y and _ o _ z with respect to the fixed axes ox , oy and oz, respectively. this notation means that if we denote (θi1, θi2, θi3), i = 1, 2, 3 as the angles between the axes _ o _ x , _ o _ y and _ o _ z , and the axes ox , oy and oz (see fig. 3 for representation of θ11, θ12 and θ13), we have ξi j = cos(θi j), ∀i, j ∈ {1, 2, 3}. the rotational matrix j(•) is given by j(•) =   ξ11 ξ12 ξ13 ξ21 ξ22 ξ23 ξ31 ξ32 ξ33  . it is shown in [20] that the determinant of j(•) is equal to 1, i.e. j(•) is globally invertible, and ‖j(•)‖ = 1. 7 summary and conclusions this paper has presented a method to design bounded and continuous (even smooth) controllers to force a group of mobile robots with limited sensing ranges to achieve a desired formation while avoiding collisions among themselves. the control development was based on construction of new potential functions, and guaranteed that all critical points, except for the desired points in formation, are unstable points. these potential functions with embedded smooth or p times differential jump functions are attractive parts of the paper since they do not require the use of switching control theory despite the robot limited sensing ranges. these functions can certainly be modified to solve other cooperative control problems such as flocking and consensus of mobile robots. the problem of formation tracking was also addressed. future work is to extend the proposed techniques in this paper and those for centralized formation control of nonholonomic mobile robots in [10] to design a decentralized formation control system for a group of nonholonomic mobile robots. 8 appendix a: proof of lemma 1 we need to verify that the function h(x, a, b) given in (5) satisfy all properties defined in (4). property 1) holds because by (6) for all 0 ≤ x ≤ a, we have ∫ xa f (τ −a) f (b−τ)dτ = 0. property 2) holds since by (6) we have ∫ x a f (τ −a) f (b−τ)dτ = ∫ b a f (τ −a) f (b−τ)dτ for all x ≥ b. to prove property 3), we first note from property a) of the function g(y) given in (7) that ∫ x a f (τ −a) f (b−τ)dτ > 0 for all a < x < b. therefore, 0 < ∫ x a f (τ−a) f (b−τ)dτ∫ b a f (τ−a) f (b−τ)dτ < 1, which means that property 3) of the function h(x, a, b) holds. to prove property 4), we just need to show that f (y) is p−1 times differentiable. we first note that f (y) is bounded controllers for decentralized formation control of mobile robots with limited sensing 349 p times differentiable except at y = 0. hence, we only need to verify that f (k)(0) = ∂ k f (y) ∂ yk ∣∣∣ y=0 = 0 for any positive integer k < p. clearly, limy→0− f (k)(y) = 0 since f (y) = 0, ∀y ≤ 0. on the other hand, since f (y) = g(y), y > 0, from property b) of the function g(y) we have limy→0+ f (k)(y) = limy→0+ g(k)(y) = 0, where g(k) = ∂ k g(y) ∂ yk . since both leftand right-hand limits are equal to 0, we have f (k)(0) = 0. hence property 4) holds. 2 9 appendix b: proof of corollary 1 we first note that property a) of the function g(y) in (7) can be proven without a difficulty. we focus on proof of property b). we note that g(k)(y) = ∂ k g(y) ∂ yk = qk( 1 y )e − 1y where qk( 1y ) is a polynomial function of 1y , and k is any positive integer. we will prove property b) of the function g(y) in (7) by induction. it is clear that limy→0+ g(1)(y) = limy→0+ g(y)−g(0) y−0 = limy→0+ e− 1 y y = limξ→∞ 1 eξ = 0 where ξ = 1y and we have used l’hopital’s rule. this means that property b) of the function g(y) holds for k = 1. assuming that limy→0+ g(k)(y) = 0, we now compute limy→0+ g(k+1)(y) as follows: lim y→0+ g(k+1)(y) = lim y→0+ g(k)(y)−g(k)(0) y−0 = lim y→0+ q̃k( 1 y )e− 1 y = lim ξ→∞ q̃k(ξ ) eξ = 0 (26) where l’hopital’s rule has been used, ξ = 1y , and q̃k(ξ ) = ξ qk(ξ ) is another polynomial of ξ . therefore we have proved that limy→0+ g(k)(y) = 0 for any k, which means property b) of the function g(y) holds for any positive integer p, i.e. p can be equal to infinity. by definition 1, the function h(x, a, b) is a smooth jump function.2 10 appendix c: proof of theorem 1 we prove theorem 1 in two steps. in the first step, we prove that no collisions between the robots can occur and that the robots asymptotically approach their target points or some critical points. in the second step, we investigate stability of the closed loop system (17) at these points, we linearize the closed loop system at these points. we then prove that only desired target points are unique asymptotic stable and that other critical points are unstable. step 1. proof of no collision and existence of solutions. from (16) and properties of the function ψ , see (15), we have ϕ̇ ≤ 0, which implies that ϕ(t) ≤ ϕ(t0),∀t ≥ t0. with definition of the function ϕ in (8) and its components in (9) and (10), we have e0.5 ∑ n i=1 ‖qi(t)−qi f ‖2 ∏ h(0, b2i j/2,‖qi j(t)‖2/2) ≤ e 0.5 ∑ni=1 ‖qi(t0)−qi f ‖2 ∏ h(0, b2i j/2,‖qi j(t0)‖2/2) (27) for all t ≥ t0 ≥ 0, i = 1, ..., n − 1, j = i + 1, ...n. from the first condition in (2) and properties of the jump function h(0, b2i j/2,‖qi j‖2/2), we have ∏ h(0, b2i j/2,‖qi j(t0)‖2/2) is larger than a strictly positive constant. therefore the right hand side of (27) is bounded by a positive constant depending on the initial conditions. boundedness of the right hand side of (27) implies that the left hand side of (27) must be also bounded. as a result, ∏ h(0, b2i j/2,‖qi j(t)‖2/2) must be larger than some positive constant depending 350 k.d. do on the initial conditions for all t ≥ t0 ≥ 0. from properties of h(0, b2i j/2,‖qi j‖2/2), ‖qi j(t)‖ must be larger than some positive constant depending on the initial conditions denoted by ε3, i.e. there are no collisions for all t ≥ t0 ≥ 0, i = 1, ..., n −1, j = i + 1, ...n. boundedness of the left hand side of (27) also implies that of ‖qi j(t)‖ and ‖qi(t)‖ for all t ≥ t0 ≥ 0, i.e. the solutions of the closed loop system (17) exist. +equilibrium points. since we have already proved that there are no collisions between any robots and that the solutions of the closed loop system (17) exist, an application of theorem 8.4 in [19] to (16) yields lim t→∞ ωti (t)ψ(ωi(t)) = 0,∀i = 1, 2, ..., n. (28) thanks to property 2) of the function ψ , see (15), the limit equation (28) implies that lim t→∞ ωi(t) = lim t→∞ [ qi(t)−qi f −κ ∑ j∈ni β̄ ′i j(t)qi j(t) ] = 0 (29) for all i = 1, 2, ..., n. the limit equation (29) implies that the state q(t) = [qt1 (t) q t 2 (t), ..., q t n (t)] t converges to the manifold m of (17) contained in e = {q ∈ rn×n|ω=0} with ω = [ωt1 ωt2 , ..., ωtn ]t , i.e. on the surface where ϕ̇ = 0. this surface is continuous because we have already proved that ‖qi j‖ > 0,∀(i, j) ∈ {1, 2, ..., n}, i 6= j, i.e. β̄ ′i j is continuous. as the time t goes to infinity, it can be verified that one solution of (29) is q f = [qt1 f q t 2 f , ..., q t n f ] t since β ′i j|‖qi j‖=‖qi j f ‖ = 0 ⇒ β̄ ′i j|‖qi j‖=‖qi j f ‖ = 0 (we have chosen bi j ≤ min (ri, r j, ε2), see (11)), and other solutions are denoted by qc = [qt1c qt2c, ..., qtnc]t . it is noted that some elements of qc can be equal to that of q f . however, for simplicity we abuse the notation, i.e. we still denote that vector as qc. indeed, the vector qc is such that ωi|q=qc = [ qi −qi f −κ ∑ j∈ni β̄ ′i jqi j ]∣∣∣∣ q=qc = 0 (30) for all i = 1, ..., n. next, we will show that q f is stable and qc is unstable, by linearizing (17) at these points. +properties of equilibrium points. the closed loop system (17) can be written in a vector form as q̇ = −cψq(q, q f ), and ψq(q, q f ) = [ψt (ω1), ..., ψt (ωi), ..., ψt (ωn )]t . therefore, near an equilibrium point qo, which can be either q f or qc, we have q̇ = −c ∂ ψq(q, q f )/∂ q ∣∣ q=qo (q−qo) (31) where ∂ ψq(q, q f ) ∂ q =   ∆11 ∆12 ··· ··· ∆1n ... . . . ... ... ... ∆i1 ··· ∆ii ··· ∆in ... ... ... . . . ... ∆n1 ··· ··· ··· ∆nn   (32) with ∆i j = ∂ ψ(ωi) ∂ ωi ∂ ωi ∂ qi , (i, j) ∈n, where n denotes the set of all robots in the group. a simple calculation shows that for all i = 1, ..., n, j ∈ni, j 6= i ∂ ωi ∂ qi = ( 1−κ ∑ i∈ni β̄ ′i j ) in −κ ∑ j∈ni β̄ ′′i jqi jq t i j, ∂ ωi ∂ q j = κ β̄ ′i jin×n + κ β̄ ′′ i jqi jq t i j (33) bounded controllers for decentralized formation control of mobile robots with limited sensing 351 where β̄ ′′i j = ∂ β̄ ′i j ∂ (‖qi j‖2/2) . let n ∗ be the set of the robots such that if the robots i and j belong to the set n∗ then ‖qi j‖ < bi j. next we will show that q f is asymptotically stable and that qc is unstable. step 2. behavior of equilibrium points. evaluating (33) at q = q f gives ∂ ψ(ωi) ∂ ωi ∣∣∣∣ q=q f = in, ∂ ωi ∂ qi ∣∣∣∣ q=q f = in, ∂ ωi ∂ q j ∣∣∣∣ q=q f = 0 (34) where we have used β̄ ′i j ∣∣ qi j =qi j f = 0 and β̄ ′′i j ∣∣ qi j =qi j f = 0 since β ′i j ∣∣ qi j =qi j f = 0 and β ′′i j ∣∣ qi j =qi j f = 0 (we have chosen bi j ≤ min (ri, r j, ε2), see (11)). we consider the lyapunov function candidate vf = 0.5‖q−q f ‖2 whose derivative along the solutions of the linearized closed loop system (31) with qo replaced by q f , and using (34) satisfies v̇f = −c ∑ni=1 ‖qi −qi f ‖2 = −2cvf , which implies that q f is asymptotically stable. proof of qc being unstable. now evaluating (33) at q = qc give that for all i = 1, ..., n, i 6= j : ∂ ψ(ωi) ∂ ωi ∣∣∣∣ q=qc = in, ∂ ωi ∂ qi ∣∣∣∣ q=qc = ( 1−κ ∑ j∈ni β̄ ′i jc ) in − κ ∑ j∈ni β̄ ′′i jcqi jcq t i jc, ∂ ωi ∂ q j ∣∣∣∣ q=qc = κ β̄ ′i jc + κ β̄ ′′ i jcqi jcq t i jc (35) where qi jc = qic − q jc, β̄ ′i jc = β̄ ′i j ∣∣ qi j =qi jc and β̄ ′′i jc = β̄ ′′ i j ∣∣ qi j =qi jc . since the related collision avoidance functions βi j, (hence β̄ ′i j and β̄ ′′ i j), are specified in terms of relative distances between robots and it is extremely difficult to obtain qc explicitly by solving (30), it is very difficult to use the lyapunov function candidate vc = 0.5‖q − qc‖ to investigate stability of (31) at qc. therefore, we consider the following lyapunov function candidate v̄c = 0.5‖q̄− q̄c‖2 (36) where q̄ = [qt12, q t 13, ...q t 1n , q t 23, ..., q t 2n , ..., q t n−1,n ] t and q̄c = [qt12c, q t 13c, ...q t 1nc, q t 23c, ..., q t 2nc, ..., q t n−1,nc] t . differentiating both sides of (36) along the solution of (31) with qo replaced by qc gives ˙̄vc = −c ∑ (i, j)∈n\n∗ ‖qi j −qi jc‖2 −c ∑ (i, j)∈n∗ (1−κ nβ̄ ′i jc)× ‖qi j −qi jc‖2 + κ cn ∑ (i, j)∈n∗ β̄ ′′i jc ( qti jc(qi j −qi jc) )2 (37) where i 6= j and (35) has been used. to investigate stability properties of q̄c based on (37), we will use (30). define ωi jc = ωic − ω jc, ∀(i, j) ∈ {1, ..., n}, i 6= j where ωic = ωi|q=qc = 0, see (30). therefore ωi jc = 0. hence ∑(i, j)∈n∗ qti jcωi jc = 0, i 6= j, which by using (30) is expanded to ∑ (i, j)∈n∗ ( qti jc(qi jc −qi j f )−κ nβ̄ ′i jcqti jcqi jc ) = 0 ⇒ ∑ (i, j)∈n∗ (1−κ nβ̄ ′i jc)qti jcqi jc = ∑ (i, j)∈n∗ qti jcqi j f (38) where i 6= j. the sum ∑(i, j)∈n∗ qti jcqi j f is strictly negative since at the point where qi j = qi j f , ∀(i, j) ∈ n∗, i 6= j (the point f in fig. 4) all attractive and repulsive forces are equal to zero while at the point where qi j = qi jc ∀(i, j) ∈ n∗, i 6= j (the point c in fig. 4) the sum of attractive and repulsive forces are equal to zero (but attractive and repulsive forces are nonzero). therefore the point where qi j = 0, ∀(i, j) ∈ n∗, i 6= j (the point o in fig. 4) must locate between the points f and c for all (i, j) ∈ n∗, i 6= j. that is 352 k.d. do 12 q 13 q * 1| | q 23 q * 2| | q * * | | 1,| | q o f c figure 4: illustration of equilibrium points. there exists a strictly positive constant b such that ∑(i, j)∈n∗ qti jcqi j f < −b, which is substituted into (38) to yield ∑ (i, j)∈n∗ (1−κ nβ̄ ′i jc)qti jcqi jc < −b, i 6= j. (39) since qti jcqi jc > 0,∀(i, j) ∈ n∗, i 6= j, there exists a nonempty set n∗∗ ⊂ n∗ such that for all (i, j) ∈ n∗∗, i 6= j, (1 − κ nβ̄ ′i jc) is strictly negative, i.e. there exists a strictly positive constant b∗∗ such that (1−κ nβ̄ ′i jc) < −b∗∗, ∀(i, j) ∈n∗∗, i 6= j. we now write (37) as ˙̄vc = −c [ ∑ (i, j)∈n\n∗ ‖qi j −qi jc‖2 + ∑ (i, j)∈n∗\n∗∗ (1−κ nβ̄ ′i jc)× ‖qi j −qi jc‖2 −κ n ∑ (i, j)∈n∗ β̄ ′′i jc ( qti jc(qi j −qi jc) )2 ] − c ∑ (i, j)∈n∗∗ (1−κ nβ̄ ′i jc)‖qi j −qi jc‖2 (40) where i 6= j. we now define a subspace such that qi j − qi jc = 0, ∀(i, j) ∈ n\n∗∗ and qti jc(qi j − qi jc) = 0, ∀(i, j) ∈n∗, i 6= j. in this subspace, we have v̄c = 0.5 ∑ (i, j)∈n∗∗ ‖qi j −qi jc‖2, (41) ˙̄vc = −c ∑ (i, j)∈n∗∗ (1−κ nβ̄ ′i jc)‖qi j −qi jc‖2 ≥ 2cb∗∗v̄c where we have used (1−κ nβ̄ ′i jc) < −b∗∗, ∀(i, j) ∈n∗∗, i 6= j. clearly (41) implies that ∑ (i, j)∈n∗∗ ‖qi j(t)−qi jc‖ ≥ ∑ (i, j)∈n∗∗ ‖qi j(t0)−qi jc‖ecb ∗∗(t−t0) (42) for all i 6= j, t ≥ t0 ≥ 0. now assume that qc is a stable equilibrium point of the closed loop system (17), i.e. limt→∞ ‖qi(t)− qic‖ = di,∀i ∈ n with di a nonnegative constant. note that n∗∗ ⊂ n, we have limt→∞ ‖qi(t) − qic‖ = di,∀i ∈ n∗∗, which implies that limt→∞ ∑(i, j)∈n∗∗ ‖qi j(t) − qi jc‖ = d∗∗,∀(i, j) ∈ n∗∗, i 6= j with d∗∗ a nonnegative constant, since qi j = qi −q j and qi jc = qic −q jc. this contradicts (42) for the case ∑(i, j)∈n∗∗ ‖qi j(t0)− qi jc‖ 6= 0, since the right hand side of (42) is divergent (so does the left bounded controllers for decentralized formation control of mobile robots with limited sensing 353 hand side). for the case ∑(i, j)∈n∗∗ ‖qi j(t0) − qi jc‖ = 0, there would be no contradiction. however this case is never observed in practice since the ever-present physical noise would cause ‖qi j(t∗)− qi jc‖ for some (i, j) ∈n∗∗, i 6= j to be different from 0 at the time t∗ ≥ t0. we now write (42) as ∑ (i, j)∈n∗∗ ‖qi j(t)−qi jc‖ ≥ ∑ (i, j)∈n∗∗ ‖qi j(t∗)−qi jc‖ecb ∗∗(t−t∗) (43) for all i 6= j, t ≥ t∗ ≥ t0 ≥ 0. since ∑(i, j)∈n∗∗ ‖qi j(t∗)−qi jc‖ 6= 0, the right hand side of (43) is divergent (so does the left hand side). this contradicts limt→∞ ∑(i, j)∈n∗∗ ‖qi j(t)− qi jc‖ = d∗∗,∀(i, j) ∈ n∗∗, i 6= j. therefore qc must be an unstable equilibrium point of the closed loop system (17). proof of theorem 1 is completed. bibliography [1] p. k. c. wang, navigation strategies for multiple autonomous mobile robots moving in formation, j. robot. syst., vol. 8, no. 2, pp. 177-195, 1991. [2] a. k. das, r. fierro, v. kumar, j. p. ostrowski j. spletzer, and c. j. taylor, a vision based formation control framework, ieee transactions on robotics and automation, vol. 18, pp. 813-825, 2002. [3] n. e. leonard and e. fiorelli, virtual leaders, artificial potentials and coordinated control of groups, proceedings of ieee conference on decision and control, orlando, fl, pp. 2968-2973, 2001. [4] r.t. jonathan, r.w. beard and b.j. young, a decentralized approach to formation maneuvers, ieee transactions on robotics and automation, vol. 19, pp. 933-941, 2003. [5] t. balch and r. c. arkin, behavior-based formation control for multirobot teams, ieee transactions on robotics and automation, vol. 14, pp. 926-939, 1998. [6] m. a. lewis and k.-h. tan, high precision formation control of mobile robots using virtual structures, autonomous robots, vol. 4, pp. 387-403, 1997. [7] r. skjetne, moi, s., and t. i. fossen, nonlinear formation control of marine craft, proceedings of ieee conference on decision and control, las vegas, nv, pp. 1699-1704, 2002. [8] d. m. stipanovica, g. inalhana, r. teo and c. j. tomlina, decentralized overlapping control of a formation of unmanned aerial vehicles, automatica, vol. 40, pp. 1285 -1296, 2004. [9] d.b. nguyen and k.d. do, formation control of mobile robots, international journal of computers, communications and control, vol. i, no. 3, pp. 41-59, 2006. [10] k.d. do and j. pan, nonlinear formation control of unicycle-type mobile robots, robotics and autonomous systems, in press, available online 1 november 2006. [11] h. g. tanner and a. kumar, towards decentralization of multi-robot navigation functions, proceedings of ieee international conference on robotics and automation, barcelona, pp. 4143-4148, 2005. [12] h. g. tanner and a. kumar, formation stabilization of multiple agents using decentralized navigation functions, robotics: science and systems i, s. thrun, g. sukhatme, s. schaal and o. brock (eds), mit press, pp. 49ű56, 2005. 354 k.d. do [13] e. rimon and d. e. koditschek, robot navigation functions on manifolds with boundary, advances in applied mathematics, vol. 11, pp. 412-442, 1990. [14] v. gazi and k. m. passino, a class of attraction/repulsion functions for stable swarm aggregations, international journal of control, vol. 77, pp. 1567-1579, 2004. [15] h. g. tanner, a. jadbabaie and g. j. pappas, stable flocking of mobile agents, part ii: dynamics topology, proceedings of ieee conference on decision and control, hawaii, pp. 2016-2021, 2003. [16] s. s. ge and y. j. cui, new potential functions for mobile robot path planning, ieee transactions on robotics and automation, vol. 16, pp. 615-620, 2000. [17] p. ogren, m. egerstedt, and x. hu, a control lyapunov function approach to multi-agent coordination, ieee transactions on robotics and automatation, vol. 18, pp. 847-851, 2002. [18] p. ogren, e. fiorelli and n. e. leonard, cooperative control of mobile sensor networks: adaptive gradient climbing in a distributed environment, ieee transactions on automatic control, vol. 49, no. 8, pp. 1292-1302. [19] h. khalil, nonlinear systems, prentice hall, 2002. [20] a. wells, theory and problems of lagrangian dynamics, new york, 1967. [21] d. liberzon, switching in systems and control, birkauser, 2003. k.d. do school of mechanical engineering the university of western australia 35 stirling highway, crawley, wa 6009, australia e-mail: duc@mech.uwa.edu.au received: december 26, 2006 international journal of computers communications & control issn 1841-9836, 9(4):419-429, august, 2014. determining the state of the sensor nodes based on fuzzy theory in wsns m.s. gharajeh mohammad samadi gharajeh department of computer engineering tabriz branch, islamic azad university tabriz, iran mhm.samadi@gmail.com abstract: the low-cost, limited-energy, and large-scale sensor nodes organize wireless sensor networks (wsns). sleep scheduling algorithms are introduced in these networks to reduce the energy consumption of the nodes in order to enhance the network lifetime. in this paper, a novel fuzzy method called fuzzy active sleep (fas) is proposed to activate the appropriate nodes of wsns. it uses the selection probability of nodes based on their remaining energy and number of previous active state. the proposed method focuses on a balanced sleep scheduling in order to belong the network lifetime. simulation results show that the proposed method is more efficient and effective than the compared methods in terms of average network remaining energy, number of nodes still alive, number of active state, and network lifetime. keywords: wireless sensor networks (wsns), fuzzy theory, sleep scheduling, energy consumption, network lifetime. 1 introduction wireless sensor networks (wsns) organize a wireless network, where the sensors called nodes have basically some features such as low cost, limited energy, and large scalability [1]. a very large number of nodes are deployed in the network to sense and to transmit the environmental conditions or occurrence of the physical events. they are used in some applications such as traffic management, close-circuit camera in retail, seismic monitoring, and military usage. organization of the nodes in a hierarchical topology, short length messages, short range of message transfer, and sleep scheduling of the nodes are some of the essential mechanisms to reduce the energy consumption of nodes in order to enhance the network lifetime [2]. the sleep scheduling categorizes the nodes into active or sleep states. a proper sleep scheduling method performs the scheduling operation in a way that maintains the connectivity among nodes and coverage of whole network. a sensor network is connected when each active node can transmit its data via one or multiple hops toward a specified center. meanwhile, the coverage is specified as an area that can be accessed by active nodes. both of the connectivity and coverage are the essential factors to monitor a given area that should be considered by the presented sleep scheduling methods. fuzzy logic is utilized to develop some models such as physical tools, uncertain and complex systems, and non-linear processes. the fuzzy models can be easily understood, and have less external complexity with more useful features [3]. furthermore, fuzzy controllers can take an appropriate decision even by imprecise and incomplete information. the linguistic terms and inexact data can be manipulated as a useful tool to design the uncertain systems. therefore, the sleep scheduling can be designed and implemented by fuzzy decision making to give most advantageous in terms of connectivity, coverage, and network lifetime. the rest of this paper is organized as follows. the related works presented in section 2 discuss about some of the prior sleep scheduling methods. section 3 describes the proposed fuzzy method by addressing the designated fuzzy system. performance evaluation of the simulated methods is explained in section 4. finally, the paper is concluded in section 5. copyright © 2006-2014 by ccc publications 420 m.s. gharajeh 2 related works a sleep scheduling method for stationary nodes of wsns is developed in [4] that utilize a discrete-time markov chain (dtmc) model. it applies an analytical method using slotting the time according to the data unit transmission time in order to discover the trade-offs existing between energy conserving and system throughout metrics including network capacity, energy consumption, and data delivery ratio. besides, the sensor nodes are considered as three operational modes as transmit, receive, and idle to simply adapt them with various traffic conditions. an analytical method for the random scheduling algorithms is provided in [5] to derive the detection probability and detection delay. the simulation results are carried out with discrete event simulation to investigate the impact of number of subsets and number of sensor nodes on coverage intensity, detection probability, and detection delay. a random scheme for wsns is also proposed in [6] that develop an analytical schema to investigate the relation between randomized sleep state and network throughout. a queue model for nodes and an efficiency framework for whole the network are included in the presented framework to derive the throughout, energy consumption, and delay time. another randomized scheduling method is studied in [7] via analysis and simulations in aspects of detection probability, detection delay, and network coverage intensity. furthermore, a problem of prolonging the network lifetime under quality of service (qos) limitation such as bounded detection probability, detection delay, and network coverage intensity is analyzed by authors. an optimal sleep control mechanism is proposed in [8] to prolong the network lifetime with reducing the energy consumption of the nodes. it utilizes the proposed procedure by distance between the sensor nodes and the sink. furthermore, energy of whole the network is balanced through reducing the number of transmissions related to the sensor nodes which are placed more close to the sink. in the method presented in [9], several characteristics of active/sleep model in wsns are investigated. the main mechanism of this method to manage the nodes in an on or off period is that the steady-state probability distribution of number of packets is derived in the reference node. another method is presented in [10] that determine the active and sleep modes of the nodes as randomly or alternatively manner in a stochastic model of wsns. the active mode is categorized as two phases as full active phase and semi-active phase to better manage the energy consumption of the nodes. this method evaluates the energy consumption of the network by developing the important analytical formulae. 3 the proposed method the main objective of the proposed method is to balance the energy consumption of the nodes in order to enhance the network lifetime. they are possible by activating and sleeping the appropriate nodes for a period time based on fuzzy decision making. if the nodes are activated based on a structural scheduling method, their energy are consumed in an equivalent flow so that the energy balance of the network will be considerably enhanced. if various types of the sensors such as temperature, smoke, and light intensity are used in the network, the proposed method will be applied within each category independently. the proposed fuzzy method called fuzzy active sleep (fas) operates based on fuzzy decision making. the whole network is divided to different areas so that only one node is activated in each area for a period time. it is worth noting that the number of divided areas is determined based on network size and number of nodes. a single sink is assumed in the centre point of network that determines the active nodes of the areas. furthermore, it receives the environmental data from nodes, and forwards them to the base station. the reason is that all the nodes cannot directly transmit their data to the base station. an overview of the assumed network is shown in figure 1, so that it is composed determining the state of the sensor nodes based on fuzzy theory in wsns 421 of four areas included by three types of sensors as temperature, smoke, and light intensity. figure 1: an overview of the assumed network there are some input and output variables in the fuzzy systems to make the fuzzy rules. the fuzzy rules are used in a fuzzy system to decide an appropriate action in the uncertain conditions. the input variables of the proposed fuzzy system are determined as follow; the first one is remaining energy of nodes (denoted by rei); the second one is the number of previous active state (denoted by asi); here, ”i” refers to the number of existing nodes in the related area. the output variable is selection priority of the nodes (specified by spi). suppose for this work, rei and spi take on following linguistic values: vl (very low), l (low), m (medium), h (high), vh (very high); and asi take on the following linguistic values: fe (feeble), fw (few), me (medium), ma (many), l (lots). membership graph for the inputs and the output variables are depicted in figure 2. note that the membership functions of asi are defined by triangular [11] method, and membership functions of rei and spi are determined by bell-shaped [11] method. while there are two input variables as each one can accept five linguistic terms, total number of the fuzzy rules is 52 = 25. some of the fuzzy rules used in the proposed fuzzy system are represented in table 1. note that the fuzzy rules are constructed by mamdani-type fuzzy rulebased systems [12]. meanwhile, all the fuzzy rules are aggregated together by or operator to produce the total fuzzy rule. a schematic of the fuzzy rules used in the proposed fuzzy system is shown in figure 3 based on input and output variables. the appropriate node from among the nodes’ groups within each area is selected by sink 422 m.s. gharajeh figure 2: membership graph for the inputs (remaining energy and number of active state) and the output (selection priority) table 1: some of the fuzzy rules used in the proposed fuzzy system input variables output variable rule no. rei asi spi 1 vh fe vh 2 h fw h 3 m me m 4 vh l l 5 h ma l 6 l l l 7 vl fw h 8 l fe h 9 m ma l 10 vh me vl figure 3: a schematic of the fuzzy rules used in the proposed fuzzy system determining the state of the sensor nodes based on fuzzy theory in wsns 423 using the proposed method for a period time as described following. first, remaining energy of the nodes is converted to a fuzzy value by bell-shaped membership function, and the number of active state at nodes is converted by triangular membership function. then, the fuzzy value of each node’s selection priority is determined by the approximate reasoning, total fuzzy rule, and the inputs’ fuzzy values. finally, the output fuzzy value is converted to a crisp value by center-of-gravity [13] which is a defuzzification method as follows spi = n∑ i=1 µsp(xi).xi n∑ i=1 µsp(xi) (1) where n indicates the number of elements in the universe set of selection priority, xi represents each elements of the universe set, and µsp(xi) describes the membership degree of xi in the universe set. the node with highest crisp value is selected as active node, and other nodes are selected as sleep nodes. 4 performance evaluation the simulation processes are carried out in matlab. 40 sensor nodes are randomly deployed in a topographical area of dimension 200 m × 200 m. all nodes have the same initial energy 2j. the proposed method is compared to all active method and random active sleep method called "ras" to evaluate them in terms of average network remaining energy, number of nodes still alive, and number of active state. furthermore, impacts of the different experimental parameters such as interval time between data sense, interval time between sending data, and initial energy of nodes on the network lifetime are evaluated carefully. note that all the nodes are always activated in the all active method, and the active node is selected by a random procedure in the ras. the active nodes transmit their environmental data to the sink in a specific interval time. afterwards, the sink also transmits the aggregated data to the base station in a determined interval time. note that the gathered data are aggregated by the sink as follows dagg = n∑ i=1 di n (2) where n indicates the number of data presented in the sink’s buffer and di refers to each data of the buffer. the simulation will be terminated when the remaining energy of all the nodes is under threshold energy. note that the discrete simulation results are the average value of the results which are independently simulated for 10 times. the transmission and receiving energies are calculated based on the model expressed in [14]. according to this mode, for transmitting an l-bit data packet a long a distance d, the radio spends etx(l, d) = { leelec + lϵfsd 2 , d < d0 leelec + lϵmpd 4, d ≥ d0 (3) where d0 indicates a threshold distance, ϵfsd2 uses the free space (fs) model to calculate the amplifier energy, and ϵmpd4 utilizes the multipath (mp) model to estimate the amplifier energy. meanwhile, the spending radio to receive this data packet is calculates as erx(l) = leelec (4) note that the threshold energy to sense or to receive an l -bit data packet is calculated like erx(l). the simulation parameters and their default values are represented in table 2. 424 m.s. gharajeh table 2: simulation parameters parameter value topographical area (meters) 200 × 200 sink location (meters) (100, 100) location of base station (meters) (500, 500) buffer size of the sink (packet) 10,000 buffer size of the base station (packet) 10,000 number of nodes 40 initial energy of node 2 j interval time between data sense (round) 10 interval time between sending data (round) 5 interval time between active sleep changes (round) 25 interval time between data transmission through the sink (round) 20 eelec 50 nj/bit ϵfs 10 pj/bit/m2̂ ϵmp 0.0013 pj/bit/m4̂ 4.1 an instance of selecting the active node by the proposed fuzzy system as previously described, the active node is independently selected within each area. selection priority is calculated for all the nodes presented in the areas; then, the node with the highest priority is activated for a period time. as represented in table 3, if there are five nodes in a special area, selection priority is calculated based on their remaining energy and number of previous active state. therefore, the node n4 which has the highest priority is selected as the active node for a period time. table 3: an example of determining the selection priority for each node input variables output variable node rei asi spi n1 2 11 44.913 n2 1.2 15 42.027 n3 0.3 5 50.843 n4 0.8 2 54.757 n5 1.6 8 48.058 4.2 simulation results some of the continuous simulation results of the evaluated methods are shown in figure 4 in terms of average network remaining energy and number of nodes still alive. these parameters illustrate the lifetime status of network under the situations represented in table 3. as shown in the results, the network lifetime obtained by the all active method is 2,560 round, by the ras method is 9,125, and by the proposed method is 9,440. as it has been expected, the lifetime in the all active is very low; but it is near to each other in the ras and fas methods. however, the average network energy and number of live nodes in the proposed method is higher than that of ras due to balance the active state of the nodes. average network remaining energy are determining the state of the sensor nodes based on fuzzy theory in wsns 425 calculated in each round as follows remavg = n∑ i=1 reme(i) n (5) where n indicates the number of nodes and reme(i) represents the remaining energy at each node "i". simulation results demonstrate that the average network remaining energy and the number of nodes still alive achieved by the proposed method could be increased by about 700% more than that obtained by the all active method and by about 10% more than that obtained by the ras method. figure 4: lifetime status of the network in a simulation execution based on various methods the number of dead nodes is calculated according to algorithm 1 in each round. if it equals to the number of nodes, the simulation process will be terminated. note that the number of last round is known as network lifetime. number of active state at nodes is one of the most important factors in the sleep scheduling methods. a good method tries to balance this factor in order to enhance the network lifetime. as shown in figure 5, the number of active state in the all active method has a stationary 426 m.s. gharajeh high value due to activate all the nodes in all the active/sleep selection rounds. moreover, this value in the proposed method is more balance than that of the ras method. the reason is that selecting the active and sleep nodes is determined based on remaining energy of nodes and number of previous active state that leads to balance the final number of active state. note that the number of active state at each node increases when it is selected as the active node for a period time. figure 5: number of active state at nodes under various methods the statistical results of the active state at nodes are represented in table 4. the four famous statistical functions including minimum, maximum, mode, and standard deviation are used to determine the detailed information. the minimum and maximum functions specify the range of all the numbers sets. the mode function represents the most frequently occurring, or repetitive, number of active state at all the numbers sets. meanwhile, the standard deviation function specifies a measure of how widely numbers are dispersed from the average number so that it can be calculated as follows stddev = √√√√√ n∑i=1 (xi − x̄)2 n − 1 (6) where n indicates the number of nodes, xi represents the number of active state at node "i", and x̄ specifies the average value of all the numbers. the values calculated by various statistical functions represents that the proposed method is more efficient and balance than both of the other methods. table 4: the statistical values of the active state at nodes under various methods method statistical function all active ras fas minimum 120 26 36 maximum 120 47 45 mode 120 31 37 standard deviation 0 5.44 2.48 determining the state of the sensor nodes based on fuzzy theory in wsns 427 (a) interval time between data sense (b) interval time between sending data (c) initial energy figure 6: network lifetime achieved by different methods under various parameters 428 m.s. gharajeh some parameters such as interval time between data sense, interval time between sending data, and initial energy influences strongly on the network lifetime. when each of these parameters increases, the network lifetime will be considerably enhanced. the interval time between data sense determines a period time to sense the environmental conditions by nodes. meanwhile, the interval time between sending data specifies a period time to transmit the sensed data to the sink. affection of them on the various methods is illustrated in figure 6. as shown in the results, the all active method has the network lifetime very lower than others. besides, the proposed method surpasses the ras method under various parameters changes. the reason is that some of the nodes are activated more than other nodes in the ras method so that the energy consumption of the nodes is unbalanced. the unbalanced energy consumption of nodes causes the network lifetime to be lower than the proposed method. in addition, the energy efficiency of the network can be specified based on total energy consumption of the nodes which is calculated as follows te = n∑ i=1 conse(i) (7) where n indicates the number of nodes and conse(i) represents the energy consumption of each node "i". note that the above formulae can be used to determine the energy consumption of the nodes both in each round and whole the network. 5 conclusions wireless sensor networks (wsns) are composed of some large-scale, low-cost, and limitedenergy sensor nodes. the sleep scheduling methods presented in the wsns cause the network lifetime to be considerably enhanced. in this paper, a novel fuzzy method called fuzzy active sleep (fas) proposed to select the appropriate node in each desired area to be activated for a period time. it selects the active node from among the related nodes based on their remaining energy and number of previous active state. selection procedure of the active nodes is balanced by the proposed method that leads the network lifetime to be enhanced. simulation results represent that the proposed method surpasses the other compared methods in aspects of average network remaining energy, number of nodes still alive, number of active state, and network lifetime. bibliography [1] stankovic, j.a. (2008); wireless sensor networks, computer, issn 0018-9162, 41(10): 92-95. [2] zhang, p. et al (2013); clustering algorithms for maximizing the lifetime of wireless sensor networks with energy-harvesting sensors, comput netw, issn 1389-1286, 57(14): 2689-2704. [3] cerami, m.; straccia, u. (2013); on the (un)decidability of fuzzy description logics under lukasiewicz t-norm, inform sciences, issn 0020-0255, 227: 1-21. [4] chiasserini, c.f.; garetto, m. (2004); modeling the performance of wireless sensor networks, proc. ieee infocom ser, issn 0743-166x, 220-231. [5] xiao, y. et al (2007); modeling detection metrics in randomized scheduling algorithm in wireless sensor networks, proc. ieee wcnc, issn 1525-3511, 3741-3745. determining the state of the sensor nodes based on fuzzy theory in wsns 429 [6] liu, j. et al (2010); analysis of random sleep scheme for wireless sensor networks, international journal of sensor networks, issn 1748-1279, 7(1): 71-84. [7] xiao, y. et al (2010); coverage and detection of a randomized scheduling algorithm in wireless sensor networks, ieee t comput, issn 0018-9340, 59(4): 507-521. [8] keh, h.c. et al (2011); power saving mechanism with optimal sleep control in wireless sensor networks, tamkang j. sci. eng, issn 1560-6686, 14(3): 235-243. [9] li, w.w. (2011); several characteristics of active/sleep model in wireless sensor networks, proc. ieee ntms’4, issn 2157-4952, 1-5. [10] zhang, y.; li, w. (2012); modeling and energy consumption evaluation of a stochastic wireless sensor network, eurasip j wirel comm, issn 1687-1499, 2012(1): 1-11. [11] zhao, j.; bose, b.k. (2002); evaluation of membership functions for fuzzy logic controlled induction motor drive, proc. ieee iecon’2, 1: 229-234. [12] alcalá, r. et al (1999); approximate mamdani-type fuzzy rule-based systems: features and taxonomy of learning methods, citeseer, technical report decsai-990117, pp.1-23. [13] runkler, t.a. (1997); selection of appropriate defuzzification methods using application specific properties, ieee t fuzzy syst, issn 1063-6706, 5(1): 72-79. [14] heinzelman, w.b. et al (2002); an application-specific protocol architecture for wireless microsensor networks, ieee t wirel commun, issn 1536-1276, 1(4): 660-670. international journal of computers, communications & control vol. ii (2007), no. 1, pp. 26-36 deformable atlases for the segmentation of internal brain nuclei in magnetic resonance imaging marius george linguraru, miguel ángel gonzález ballester, nicholas ayache abstract: magnetic resonance imaging (mri) is commonly employed for the depiction of soft tissues, most notably the human brain. computer-aided image analysis techniques lead to image enhancement and automatic detection of anatomical structures. however, the information contained in images does not often offer enough contrast to robustly obtain a good detection of all internal brain structures, not least the deep grey matter nuclei. we propose a method that incorporates prior anatomical knowledge in the shape of digital atlases that deform to fit the image data to be analysed. our technique is based on a combination of rigid, affine and non-rigid registration, segmentation of key anatomical landmarks and propagation of the information of the atlas to detect deep grey matter nuclei. the montreal neurological institute (mni) and zubal atlases are employed. results show that detecting important structures such as the ventricles and brain outlines greatly improves the results. our method is fully automatic. keywords: mri, brain, deep grey matter nuclei, atlas, image normalisation, registration, segmentation. 1 introduction the advent of medical imaging modalities such as x-ray, ultrasound, computed tomography (ct) and magnetic resonance imaging (mri) has greatly improved the diagnosis of various human diseases. to date, the most common procedure to analyse imaging data is visual inspection on printed support. in the last decade, computer-aided medical image analysis techniques have been employed to provide a better insight into the acquired image data. [5]. such techniques allow for quantitative, reproducible observation of the patient condition. furthermore, the computing power of modern machines can be used to combine information from several images of the same patient (i.e. image fusion) or add prior information from a database of images. in this paper, we present a fully automated medical image analysis technique aimed at the detection of internal brain structures from mri data. such automated processes allow the study of large image databases and provide consistent measurements over the data. in our case, we employ a priori anatomical knowledge in the form of digital brain atlases. relevant background information about mri and brain anatomy is provided next. section 2 will describe the different components of our image processing framework, which detects and delineates internal brain structures by identifying analogous structures in digital brain atlases. finally, results and conclusions are given. 1.1 magnetic resonance imaging mri has become a leading technique widely used for imaging soft human tissue. its applications are extended over all parts of the human body and it represents the most common visualisation method of human brain. images are generated by measuring the behaviour of soft tissue under a magnetic field. under such conditions, water protons enter a higher energy state when a radio-frequency pulse is applied and this energy is re-emitted when the pulse stops (a property known as resonance) [7]. a coil is used to measure this energy, which is proportional to the quantity of water protons and local biochemical conditions. thus, different tissues give different intensities in the final mr image. from the brain copyright c© 2006-2007 by ccc publications deformable atlases for the segmentation of internal brain nuclei in magnetic resonance imaging27 mri perspective, this quality makes possible the segmentation of the three main tissue classes within the human skull: grey matter (gm), white matter (wm) and cerebrospinal fluid (csf). their accurate segmentation remains a challenging task in the clinical environment. the relative contrast between brain tissues is not a constant in mr imaging. in most medical imaging applications, little can be done about the appearance of anatomically distinct areas relative to their surroundings. in mri, the choice of the strength and timing of the radio-frequency pulses, known as the mri sequence [12], can be employed to highlight some type of tissue or image out another, according to the clinical application. however, the presence of artefacts due to magnetic field inhomogeneity (bias fields) and movement artefacts may hamper the delineation of gm versus wm and csf and make their depiction difficult. there is an entire family of mri sequences that are used in common clinical practice. t1-weighted mri offers the highest contrast between the brain soft tissues. on the contrary, t2-weighted and proton density (pd) images exhibit very low contrast between gm and wm, but high contrast between csf and brain parenchyma. in other mri sequences, like the fluid attenuated inversion recovery (flair) sequence, the csf is eliminated from the image in an adapted t1 or t2 sequence. more about these specific mri sequences and their variations can be found in [2]. multisequence mri analysis combines the different information provided by the employed sequences. combining such knowledge gives substantially more information about brain anatomy and possible occurring changes. mr images depict a 3d volume where the organ or part of the body of interest is embedded. this information can be used to build a 3d representation of the structure of interest. this applies both to 2d sequences, where images are acquired in slices, and to recently developed 3d sequences, where the data are captured in the 3d fourier space, rather than each slice being captured separately in the 2d fourier space [2, 12]. 1.2 deep grey matter nuclei the neurones that build up the human brain are composed of a cellular body and an axon. the latter projects its dendritic connections to other neurones in remote cerebral regions. in essence, grey matter corresponds to the cellular bodies, whereas the axons constitute the white matter. cerebral grey matter is mainly concentrated in the outer surface of the brain (cortex), but several internal gm structures exist, as seen in figures 1 and 2. these are known as deep grey matter nuclei and they play a central role in the intellectual capabilities of the human brain. additionally, deep brain grey matter nuclei are relevant to a set of clinical conditions, such as parkinson’s and creutzfeldt-jakob diseases. however, their detection in mri data sets remains a challenging task, due to their small size, partial volume effects [6], anatomical variability, lack of white matter-grey matter contrast in some sequences and movement artefacts. a methodology for the robust detection of deep brain grey matter nuclei in multi-sequence mri is presented in this paper. 2 method 2.1 spatial normalisation the large variability inherent to human anatomy and the differences in patient positioning across scans leads us to consider spatial normalisation as an approach to put patient images in a standard reference frame. this will allow to localise the areas of interest with the help of an atlas of the brain. furthermore, it will make automatic inter-patient comparisons possible. the identification of brain structures in volumetric images can be automated thanks to the use of digital atlases. these are images that have been segmented and thus contain information about the position and shape of each structure. such atlases can be binary (1 for the location of a structure and 0 28 marius george linguraru, miguel ángel gonzález ballester, nicholas ayache figure 1: deep grey matter internal nuclei as seen in a normal t1 weighted axial mr image with good contrast between wm, gm and csf. the arrows point towards some of these nuclei, namely the caudate nuclei, the thalami and the putamen. figure 2: an annotated map of deep grey matter internal nuclei reproduced from the talairach and tournoux atlas [13]: the caudate nuclei (cn), putamen (pu) and thalami (th). deformable atlases for the segmentation of internal brain nuclei in magnetic resonance imaging29 for "outside") or probabilistic, in which case the values correspond to the probability of a voxel containing the structure of interest. in order to locate such structures in a given patient image, the atlas image is deformed to match the shape of the patient brain. this process is known as registration. depending on the type of geometric deformation allowed, registration can be rigid, affine, parametric (e.g. spline) or free-form (a deformation field specifying the displacement applied to each point). registration to a digital atlas has become a common technique with the introduction of popular statistical algorithms for image processing, such as statistical parametric mapping (spm) [1] or expectation maximization segmentation (ems) [14]. a well-known probabilistic atlas in the scientific community is the mni atlas from the montreal neurological institute at mcgill university [4]. it was built using over 300 mri scans of healthy individuals to compute an average brain mr image, the mni template, which is now the standard template of spm and the international consortium for brain mapping [9]. the averaging is performed for the entire brain, but also on isolated gm, wm and csf, providing a tool for statistical segmentation. for these reasons, we chose the mni template as the basis for image alignment in our approach. figure 3 shows the mni template. figure 3: the mni template. on the left, the probabilistic mni atlas of the brain; on the right, the corresponding gm atlas. please note the arrangement of mr images in radiological convention with an axial, a sagittal and a coronal view. this convention is reflected in figures throughout the paper. we propose the following registration scheme. t1 images have often the highest resolution, hence we register them to the mni template first using an affine transformation. the registration algorithm, previously developed in our group, is described in [11]. it uses a block matching strategy in a twostep iterative method. the standard assumption behind the algorithm is that there is a global intensity relationship between the template image and the one being registered to it. the method proposes several types of correlation measures: linear, functional or statistical. maximising one of these, the correlation coefficient in our case, the transformation between the two images is computed block by block and a displacement field is thus generated. a parametric transformation, either affine or rigid, is then estimated from this deformation field. to further improve robustness, this procedure is repeated at multiple scales. more details can be found in [10]. next, rigid intra-patient registration of all sequences is performed using the same algorithm as above. t2, flair, diffusion-weighted, diffusion tensor or other sequence images can be registered to the t1 image. since this registration is performed on images of the same patient acquired during the same scanning session, rigid registration suffices. by combining these rigid transformations with the affine transformation matching t1 and mni template, we can find correspondences between the atlas and the other sequence images. this is illustrated in figure 5. the final image resolution is that of the mni atlas: 91 109 91 voxels. figure 4 shows an example of spatial normalisation. with all images registered to the atlas, intraand inter-patient analysis becomes simple and statistical algorithms can be applied. 30 marius george linguraru, miguel ángel gonzález ballester, nicholas ayache figure 4: an example of spatial normalisation. the image on top is the subject’s t1 before registration; the image on bottom left shows the subject’s t1 after spatial normalisation and the mni template is presented on the bottom right image. 2.2 a priori anatomical knowledge to be able to segment gm and wm in mri sequences, a good contrast between these types of tissue in t1-weighted images is desired. figure 6 shows a typical t1 with high contrast between brain soft tissues and a common t1 image from our database. under the given circumstances, the segmentation of gm cannot be done directly from the patient images. the mni atlas can provide a probabilistic segmentation of gm, but this is not precise enough for our application. we use instead a segmented anatomical atlas of the brain, the zubal phantom [15], which is introduced next. figure 5: diagram of the spatial normalisation algorithm. intra-patient images are rigidly registered on the corresponding t1. the t1-weighted image is affinely registered to the atlas template. the resulting transformation is used to align all other mr images to the atlas. the zubal atlas offers a precisely labelled segmentation of brain structures from the t1-weighted mr image of one single subject. our interest focuses on the internal nuclei, which are segmented in the phantom. first, the atlas must be aligned to our set of images, which have been previously registered to the mni atlas. thus, we register the zubal phantom to the mni template, again using our block matching algorithm [11], to estimate an affine transformation. however, in order to preserve the correct deformable atlases for the segmentation of internal brain nuclei in magnetic resonance imaging31 figure 6: a typical t1-weighted mr image with good contrast between brain gm, wm and csf (left) versus a t1 image where gm and wm cannot be reliably distinguished from each other. values of the segmentation labels posterior to the application of the transformation, nearest-neighbour interpolation is performed, as opposed to the case of patient image registration, which employed spline interpolation. figure 7 shows the results of registering the zubal phantom to the mni reference without disrupting the zubal labels. 2.3 refined segmentation once the zubal phantom is registered to the working framework, we can easily depict the brain structures that are of interest, namely the deep gm internal nuclei. for the examples in this paper, we will focus on the basal ganglia. hence, we create a mask with the thalamus, putamen and head of the caudate which will be referred as internal nuclei for the rest of this paper from the zubal phantom registered on mni (figure 8). we aim to use this mask for the segmentation of internal nuclei in patient images. although the affine registration gives correct correspondences in a general brain registration framework, the anatomical variability between patients makes the correspondence between the zubal internal nuclei mask and the corresponding internal nuclei in each patient erroneous. a refinement of the registration in the deep gm between the zubal internal nuclei mask and the patient internal nuclei seems necessary to allow us to use the a priori anatomical information resulting from the segmentation of the zubal phantom. the segmentation of internal nuclei in patient images is not an obvious task; this is why we exploit the zubal phantom. nevertheless, there are other important anatomical landmarks in the brain that are easier to identify. we concentrate on the segmentation of ventricles and cortex external boundary. ventricles will give a good approximation of the deformation field around the internal nuclei, whereas the cortex boundary will impose the global spatial correspondence and stabilise the deformation field inside the brain. figure 8 illustrates the segmentation of ventricles, brain contour and internal nuclei from the registered zubal phantom. to obtain similar images of segmented brain margin and ventricles for each patient, we employ morphological opening on patient t2 images. the strong contrast that csf has against the brain in t2weighted images allows us to segment the ventricles, while the cortex boundary can be extracted from either t1 or t2 sequences. we prefer using the t1 sequence, since the t2 image we employ lacks some top and bottom slices. the ventricles being located in the middle of the brain, it is correct to extract them from t2 images, but the cortex would be incomplete. we are now in the possession of two binary maps of ventricles and brain boundaries for each patient: one from the zubal phantom and the other from the patient. non-rigid (free-form) registration is used to align the two images, employing the algorithm developed in our group and described in [3]. this registration method minimises an energy function, which uses measures of intensity similarity, smoothing, noise parameters and correspondence between points. figure 9 shows typical results and figure 10 shows the 3d deformation fields related to the registration in figure 9. the outer margin of the cortex ensures that 32 marius george linguraru, miguel ángel gonzález ballester, nicholas ayache figure 7: the registration of zubal phantom onto the mni template. on the top row, the original zubal phantom is shown; on the bottomleft, we have the registered zubal phantom on the mni template, which is shown in the bottomright image. the deformation fields are spatially sound and do not pull the internal nuclei over their location. having the deformation fields computed, we apply them to the mask of internal nuclei of the zubal phantom, deforming the mask according to the position and size of the ventricles in the patient image. a diagram of the algorithm is shown in figure 11. the deformed mask is used to segment the internal nuclei of the patient, namely the putamen, head of the caudate and thalamus. figure 12 shows an example of registration of internal nuclei in 3d and the internal nuclei segmentation results in a t1-weighted mr image of a patient. in figure 13 we segment the internal nuclei in a patient t2-weighted image. the segmentation can be accurately performed in any mr sequence of the patient, given that multisequence images have been previously registered to the mni atlas. figure 8: the segmentation of the zubal phantom. from left to right: column 1, the zubal phantom registered on mni; column 2, the ventricles segmented from the zubal phantom; column 3, the cortex outer boundary is added to the ventricles; column 4, the internal nuclei segmented from the zubal phantom. the top row shows the axial view, while on bottom we present the coronal view. in this paper, we focused on the segmentation of the basal ganglia to present our algorithm for the segmentation of deep grey matter nuclei. an identical approach can be used for other inner brain deformable atlases for the segmentation of internal brain nuclei in magnetic resonance imaging33 structures to accurately segment them in patient images. each nuclei class has an associated label in the zubal phantom, which facilitates their identification. in figure 14 we illustrate the segmentation of individual types of nuclei, here caudate nuclei, thalami and putamen, using our approach. figure 9: registration of the zubal ventricles and cortex outer boundary on a patient. the patient’s ventricles are larger next to the small ventricles in the zubal phantom, where the subject is young. the algorithm gives robust results, as seen above. from left to right: column 1, the t2-weighted image of the patient; column 2, the ventricles and brain margin of the patient (ventricles segmented from t2 and cortex from t1); column3, the ventricles and brain boundary of zubal phantom; column 4, the ventricles and cortex boundary of the zubal phantom registered on the patient. 3 conclusion a robust automatic technique for the identification of deep brain internal nuclei was presented. the use of key anatomical landmarks such as the ventricles and the outline of the brain imposes anatomical constraints in the deformation fields found by the non-rigid registration algorithm, which otherwise would fail to converge to the correct segmentation. figure 10: deformation fields of the non-rigid registration between the zubal phantom ventricles and those of a patient with very large ventricles. on the left is the x field, the y field is in the middle column and the z field on the right. 34 marius george linguraru, miguel ángel gonzález ballester, nicholas ayache figure 11: diagram of the refined registration of internal nuclei. figure 12: an example of internal nuclei registration and their segmentation in a t1-weighted patient image. on the left, we have the t1 image of the patient; in the middle column, we show the segmentation of internal nuclei according to the binary map before non-rigid deformation with the head of the caudate superposed on the ventricle; on the right, we segment the internal nuclei after non-rigid deformation, showing an accurate segmentation. figure 13: an example of internal nuclei segmentation in a t2-weighted image of the patient. on the left, we have the t2 image of the patient; in the middle column, we show the segmentation of internal nuclei before non-rigid deformation; on the right, we segment correctly the internal nuclei after non-rigid deformation. figure 14: binary maps of deep grey nuclei. from left to right: the caudate nuclei, the putamen and the thalami. from top to bottom: axial and coronal views. these individual masks can be used for the accurate segmentation of each type of nuclei. deformable atlases for the segmentation of internal brain nuclei in magnetic resonance imaging35 acknowledgement the authors would like to thank professor ioana moisil from the “lucian blaga” university of sibiu for her assistance. references [1] j. ashburner, k.j. friston, "voxel-based morphometry the methods", neuroimage, 11:805-821, 2000. [2] m a. brown, r.c. semelka, "mr imaging abbreviations, definitions, and descriptions: a review", radiology, 213:647-662, 1999. [3] p. cachier, e. bardinet, d. dormont, x. pennec, n. ayache, "iconic feature-based nonrigid registration: the pasha algorithm", cviu special issue on nonrigid registration, 89(2-3):272-298, 2003. [4] d.l. collins, a.p. zijdenbos, v. kollokian, j.g. sled, n.j. kabani, c.j. holmes, a.c. evans, "design and construction of a realistic digital brain phantom", ieee transactions on medical imaging, 17(3):463-468, 1998. [5] j. duncan, n. ayache, "medical image analysis: progress over two decades and the challenges ahead", ieee transactions on pattern analysis and machine intelligence, 22(1):85-106, 2000. [6] m. a. gonzález ballester, a. zisserman, m. brady, "estimation of the partial volume effect in mri", medical image analysis, 6(4):389-405, 2002. [7] j p. hornak, "the basics of mri", http://www.cis.rit.edu/htbooks/mri/. [8] m.g. linguraru, m.a. gonzales ballester, e. bardinet, d. galanaud, d. dormont, j.p. brandel, n. ayache, "automated analysis of basal ganglia intensity distribution in multisequence mri of the brain application to creutzfeldt-jakob disease", rapport de recherche, inria, 2004. [9] j.c. mazziotta, a.w. toga, a.c. evans, p.t. fox, j. lancaster, k. zilles, r.p. woods, t. paus, g. simpson, b. pike, c.j. holmes, d.l. collins, p.m. thompson, d. macdonald, m. iacoboni, t. schormann, k. amunts, n. palomero-gallagher, s. geyer, l. parsors, k.l. narr, n. kabani, g. le goualher, m. boomsma, t. cannon, r. kawashima, b. mazoyer, "a probabilistic atlas and reference system for the human brain: international consortium for brain mapping (icbm)", philosophical transactions of the royal society of london, series b (biological sciences), 356(1412):1293-1322, appendix ii, 2001. [10] s. ourselin, "recalage d’images médicales par appariement de régions application à la construction d’atlas histologiques 3d" , phd thesis, université de nice sophia antipolis, 2002. [11] s. ourselin, a. roche, s. prima, n. ayache, "block matching: a general framework to improve robustness of rigid registration of medical images", in a.m. digioia and s. delp (eds.) medical robotics, imaging and computer assisted surgery (miccai 2000), volume 1935 of lectures notes in computer science, springer, 557-566, 2000. [12] d.d. stark, w.g. bradley, w.g. jr. bradley, "magnetic resonance imaging", mosby, 1999. [13] j. talairach, p. tournoux, "co-planar stereotaxic atlas of the human brain", thieme medical publishers, 1988. 36 marius george linguraru, miguel ángel gonzález ballester, nicholas ayache [14] k. van leemput, f. maes, d. vandermeulen, a. colchester, p. suetens, "automated segmentation of multiple sclerosis lesions by model outlier detection", ieee transactions on medical imaging, 20(8):677-688, 2001. [15] i.g. zubal, c.r. harrell, e.o. smith, z. rattner, g. gindi, p.b. hoffer, "computerized threedimensional segmented human anatomy", medical physics, 21:299-302, 1994. marius george linguraru epidaure/asclepios research group, inria sophia antipolis, france division of engineering and applied sciences harvard university, cambridge ma, usa e-mail: mglin@deas.harvard.edu miguel ángel gonzález ballester university of bern, bern, switzerland mem research center, institute for surgical technology and biomechanics nicholas ayache epidaure/asclepios research group, inria sophia antipolis, france received: november 16, 2006 editor’s note about the author: marius george linguraru joined the diagnostic radiology department at the clinical center at the national institute of health (nih), bethesda, maryland, usa in 2007 as staff scientist. previously, he worked as research fellow in the division of engineering and applied sciences of harvard university in cambridge, massachusetts, usa. he moved to the boston area from the south of france, where he was expert engineer in the epidaure/asclepios research group of the national institute of research in informatics and automatic control (inria) in sophia antipolis, france. he received a phd in information engineering/medical image analysis at the university of oxford, oxford, uk within the medical vision laboratory and was a member of keble college. his previous studies include an ma in british cultural, an msc in parallel and distributed processing systems, studies and a bsc in computer science. all three degrees are from the "lucian blaga" university of sibiu, romania, where he also worked as assistant professor in the department of computer science and automatic control. 7nadaban6.pdf international journal of computers communications & control special issue on fuzzy sets and applications (celebration of the 50th anniversary of fuzzy sets) issn 1841-9836, 10(6):834-842, december, 2015. fuzzy continuous mappings in fuzzy normed linear spaces s. nădăban sorin nădăban department of mathematics and computer science aurel vlaicu university of arad, elena drăgoi 2, ro-310330 arad, romania snadaban@gmail.com abstract: in this paper we continue the study of fuzzy continuous mappings in fuzzy normed linear spaces initiated by t. bag and s.k. samanta, as well as by i. sadeqi and f.s. kia, in a more general settings. firstly, we introduce the notion of uniformly fuzzy continuous mapping and we establish the uniform continuity theorem in fuzzy settings. furthermore, the concept of fuzzy lipschitzian mapping is introduced and a fuzzy version for banach’s contraction principle is obtained. finally, a special attention is given to various characterizations of fuzzy continuous linear operators. based on our results, classical principles of functional analysis (such as the uniform boundedness principle, the open mapping theorem and the closed graph theorem) can be extended in a more general fuzzy context. keywords: fuzzy normed linear spaces; fuzzy continuous mapping; fuzzy bounded linear operators. 1 introduction and preliminaries the concept of fuzzy set was introduced by l. zadeh [14] in 1965. if x is a nonempty set, a fuzzy set in x is a function µ from x into the unit interval [0, 1]. the classical union and intersection of ordinary subsets of x can be extended by the following formulas, proposed by l. zadeh ( ∨ i∈i µi ) (x) = sup{µi(x) : i ∈ i} , ( ∧ i∈i µi ) (x) = inf{µi(x) : i ∈ i} . from here to the notion of fuzzy topological space, there was one more step to be taken. thus, in 1968, c.l. chang [4] introduced the notion of fuzzy topological space. the definition is a natural translation to fuzzy sets of the ordinary definition of topological space. indeed, a fuzzy topology is a family t , of fuzzy sets in x, such that t is closed with respect to arbitrary union and finite intersection and every constant function belong to t . one of the important problems concerning the fuzzy topological spaces is to obtain an adequate notion of fuzzy metric space. many authors have investigated this question and several notions of fuzzy metric space have been defined and studied. we just mention the definition given by i. kramosil and j. michálek [9] in 1975. definition 1. the pair (x, m) is said to be a fuzzy metric space if x is an arbitrary set and m is a fuzzy set in x × x × [0,∞) satisfying the following conditions: (m1) m(x, y, 0) = 0, (∀)x, y ∈ x; (m2) (∀)x, y ∈ x, x = y if and only if m(x, y, t) = 1 for all t > 0; (m3) m(x, y, t) = m(y, x, t), (∀)x, y ∈ x, (∀)t > 0; (m4) m(x, z, t + s) ≥ m(x, y, t) ∗ m(y, z, s), (∀)x, y, z ∈ x, (∀)t, s > 0; copyright © 2006-2015 by ccc publications fuzzy continuous mappings in fuzzy normed linear spaces 835 (m5) (∀)x, y ∈ x, m(x, y, ·) : [0,∞) → [0, 1] is left continuous and lim t→∞ m(x, y, t) = 1 . we note that, in previous definition, ∗ denotes a continuous t-norm (see [13]). the basic examples of continuous t-norms are ∧, ·,∗l, which are defined by a ∧ b = min{a, b}, a · b = ab (usual multiplication in [0, 1]) and a ∗l b = max{a + b − 1, 0} (the lukasiewicz t-norm). in studying fuzzy topological linear spaces, a.k. katsaras [8], in 1984, first introduced the notion of fuzzy norm on a linear space. since then many mathematicians have introduced several notions of fuzzy norm from different points of view. thus, c. felbin [6] in 1992 introduced an idea of fuzzy norm on a linear space by assigning a fuzzy real number to each element of linear space. in 1994, s.c. cheng and j.n. mordeson [5] introduced a concept of fuzzy norm on a linear space whose associated metric is kramosil and michálek type. following s.c. cheng and j.n. mordeson, in 2003, t. bag and s.k. samanta [2] proposed another concept of fuzzy norm. in this paper we continue the study of fuzzy continuous mappings in fuzzy normed linear spaces initiated by t. bag and s.k. samanta [3], as well as by i. sadeqi and f.s. kia [12], in a more general settings: definition 2. [10] let x be a vector space over a field k (where k is r or c) and ∗ be a continuous t-norm. a fuzzy set n in x × [0,∞) is called a fuzzy norm on x if it satisfies: (n1) n(x, 0) = 0, (∀)x ∈ x; (n2) [n(x, t) = 1, (∀)t > 0] if and only if x = 0; (n3) n(λx, t) = n ( x, t |λ| ) , (∀)x ∈ x, (∀)t ≥ 0, (∀)λ ∈ k∗; (n4) n(x + y, t + s) ≥ n(x, t) ∗ n(y, s), (∀)x, y ∈ x, (∀)t, s ≥ 0; (n5) (∀)x ∈ x, n(x, ·) is left continuous and lim t→∞ n(x, t) = 1. the triple (x, n,∗) will be called fuzzy normed linear space (briefly fnls). remark 3. a) t. bag and s.k. samanta [2], [3] gave a similar definition for ∗ = ∧, but in order to obtain some important results they assumed that the fuzzy norm also satisfied the following conditions: (n6) n(x, t) > 0, (∀)t > 0 ⇒ x = 0 ; (n7) (∀)x 6= 0, n(x, ·) is a continuous function and strictly increasing on the subset {t : 0 < n(x, t) < 1} of r. the results obtained by t. bag and s.k. samanta [3], as well as by i. sadeqi and f.s. kia [12], can be found in this more general setting. b) i. goleţ [7], c. alegre and s. romaguera [1] also gave this definition in the context of real vector spaces. c) n(x, ·) is nondecreasing, (∀)x ∈ x. example 4. [2] let x be a linear space and || · || be a norm on x. let n(x, t) := { 1 if |x| < t 0 if |x| ≥ t then (x, n,∧) is a fnls. in particular, (c, n,∧) is a fnls. 836 s. nădăban theorem 5. [10] let (x, n,∗) be a fnls. for x ∈ x, r ∈ (0, 1), t > 0 we define the open ball b(x, r, t) := {y ∈ x : n(x − y, t) > r} . then tn := {t ⊂ x : x ∈ t iff (∃)t > 0, r ∈ (0, 1) : b(x, r, t) ⊆ t} is a topology on x. moreover, if the t-norm ∗ satisfies sup x∈(0,1) x ∗ x = 1, then (x,tn) is hausdorff. theorem 6. [10] let (x, n,∗) be a fnls. then (x,tn) is a metrizable topological vector space. definition 7. [2] let (x, n,∗) be a fnls and (xn) be a sequence in x. 1. the sequence (xn) is said to be convergent if (∃)x ∈ x such that lim n→∞ n(xn − x, t) = 1 , (∀)t > 0 . in this case, x is called the limit of the sequence (xn) and we denote lim n→∞ xn = x or xn → x. 2. the sequence (xn) is called cauchy sequence if lim n→∞ n(xn+p − xn, t) = 1 , (∀)t > 0, (∀)p ∈ n ∗ . 3. (x, n,∗) is said to be complete if any cauchy sequence in x is convergent to a point in x. a complete fnls will be called a fuzzy banach space. theorem 8. let (x, n,∗) be a fnls and pα(x) := inf{t > 0 : n(x, t) > α}, α ∈ (0, 1) . then, for x ∈ x, s > 0, α ∈ (0, 1), we have: pα(x) < s if and only if n(x, s) > α . proof: the proof is entirely the same as in [10], where there are considered fnlss of type (x, n,∧). ✷ the structure of the paper is as follows: in section 2, we introduce the notion of uniformly fuzzy continuous mapping and we establish the uniform continuity theorem in fuzzy settings. the concept of fuzzy lipschitzian mapping is introduced and a fuzzy version for banach’s contraction principle is obtained. in section 3, special attention is given to various characterizations of fuzzy continuous linear operators. based on our results, classical principles of functional analysis (such as the uniform boundedness principle, the open mapping theorem and the closed graph theorem) can be extended in a more general fuzzy context. even if the structure of fuzzy f-spaces, recently introduced in [11], is much more complicated than that of fuzzy banach spaces, we intent to study, in a further paper, fuzzy continuous linear operators on fuzzy f-spaces and to prove that the well-known principles of functional analysis are valid in this context too. in the following sections (x, n1,∗1), (y, n2,∗2) will be fnlss with the t-norms ∗1,∗2 which satisfy sup x∈(0,1) x ∗i x = 1, (∀)i = 1, 2. fuzzy continuous mappings in fuzzy normed linear spaces 837 2 fuzzy continuous mappings definition 9. [3] a mapping t : x → y is said to be fuzzy continuous at x0 ∈ x, if (∀)ε > 0, (∀)α ∈ (0, 1), (∃)δ = δ(ε, α, x0) > 0, (∃)β = β(ε, α, x0) ∈ (0, 1) such that (∀)x ∈ x : n1(x − x0, δ) > β we have that n2(t(x) − t(x0), ε) > α . if t is fuzzy continuous at each point of x, then t is called fuzzy continuous on x. theorem 10. [3] a mapping t : x → y is fuzzy continuous at x0 ∈ x, if and only if (∀)(xn) ⊆ x, xn → x0, we have that t(xn) → t(x0). definition 11. a mapping t : x → y is said to be uniformly fuzzy continuous on x, if (∀)ε > 0, (∀)α ∈ (0, 1), (∃)δ = δ(ε, α) > 0, (∃)β = β(ε, α) ∈ (0, 1) such that (∀)x, y ∈ x : n1(x − y, δ) > β we have that n2(t(x) − t(y), ε) > α . remark 12. if t is uniformly fuzzy continuous, then t is fuzzy continuous. theorem 13. (uniform continuity theorem). let (x, n1,∗1) be a compact fnls and (y, n2,∗2) be a fnls. if t : x → y is a fuzzy continuous mapping, then t is uniformly fuzzy continuous. proof: let ε > 0 and α ∈ (0, 1). as sup x∈(0,1) x ∗2 x = 1, then there exists α0 ∈ (0, 1) such that α0 ∗2 α0 > α. as t : x → y is a fuzzy continuous on x, for all x ∈ x, there exist δx = δ ( ε 2 , α0, x ) > 0, βx = β ( ε 2 , α0, x ) ∈ (0, 1) such that (∀)y ∈ x : n1(x − y, δx) > βx ⇒ n2 ( t(x) − t(y), ε 2 ) > α0 . as sup x∈(0,1) x ∗1 x = 1, we can take γx > βx such that γx ∗1 γx > βx. since x is compact and { b ( x, γx, δx 2 )} x∈x is an open covering of x, there exist x1, x2, · · · , xn in x such that x = n ⋃ i=1 b ( xi, γxi, δx i 2 ) . let β = max{γxi} and δ = min { δx i 2 } , for i = 1, 2, · · · , n. let x, y ∈ x arbitrary, such that n1(x − y, δ) > β. as x ∈ x, there exists i ∈ {1, 2, · · · , n} such that x ∈ b ( xi, γxi, δx i 2 ) , namely n1 ( x − xi, δx i 2 ) > γxi . hence n1(x − xi, δxi) ≥ n1 ( x − xi, δxi 2 ) > γxi > βxi . thus n2 ( t(x) − t(xi), ε 2 ) > α0 . we remark that n1(y − xi, δxi) ≥ n1 ( y − x, δxi 2 ) ∗1 n1 ( x − xi, δxi 2 ) ≥ ≥ n1(y − x, δ) ∗1 n1 ( x − xi, δxi 2 ) > β ∗1 γxi ≥ γxi ∗1 γxi > βxi . 838 s. nădăban thus n2 ( t(y) − t(xi), ε 2 ) > α0. in conclusion n2(t(x) − t(y), ε) ≥ n2 ( t(x) − t(xi), ε 2 ) ∗2 n2 ( t(xi) − t(y), ε 2 ) > > α0 ∗2 α0 > α . ✷ definition 14. a mapping t : x → y is said to be fuzzy lipschitzian on x if (∃)l > 0 such that n2(t(x) − t(y), t) ≥ n1 ( x − y, t l ) , (∀)t > 0, (∀)x, y ∈ x . if l < 1 we say that t is a fuzzy contraction. remark 15. it is clear that a fuzzy lipschitzian mapping is necessarily fuzzy continuous. theorem 16. (banach’s contraction principle). let (x, n,∗) be a fuzzy banach space and t : x → x be a fuzzy contraction. then t has a unique fixed point z ∈ x and lim n→∞ t n(x) = z, (∀)x ∈ x . proof: let x ∈ x be arbitrary. then {t n(x)} is a cauchy sequence. indeed, for t > 0 and p ∈ n∗, we have n(t n+p(x) − t n(x), t) ≥ n ( t n+p−1(x) − t n−1(x), t l ) ≥ ≥ ··· ≥ n ( t p(x) − x, t ln ) . as l ∈ (0, 1), we have that lim n→∞ t ln = ∞. thus lim n→∞ n ( t p(x) − x, t ln ) = 1 . hence lim n→∞ n(t n+p(x) − t n(x), t) = 1, namely {t n(x)} is a cauchy sequence. since x is complete, we have that {t n(x)} is a convergent sequence. thus (∃)z ∈ x such that lim n→∞ t n(x) = z. we note that z = lim n→∞ t n+1(x) = lim n→∞ t(t n(x)) = t(z) . now we show the uniqueness. suppose that there exist z, y ∈ x, z 6= y with the property z = t(z), y = t(y). as z 6= y, there exists s > 0 such that n(z − y, s) = a < 1. then, for all n ∈ n, we have a = n(y − z, s) = n(t n(y) − t n(z), s) ≥ n ( y − z, s ln ) → 1 . thus a = 1, which contradicts our assumption. ✷ fuzzy continuous mappings in fuzzy normed linear spaces 839 3 fuzzy continuous linear operators theorem 17. let t : x → y be a linear operator. then t is fuzzy continuous on x, if and only if t is fuzzy continuous at a point x0 ∈ x. proof: ” ⇒ ” it is obvious. ” ⇐ ” let y ∈ y be arbitrary. we will show that t is fuzzy continuous at y. let ε > 0, α ∈ (0, 1). since t is fuzzy continuous at x0 ∈ x, there exist δ > 0, β ∈ (0, 1) such that (∀)x ∈ x : n1(x − x0, δ) > β ⇒ n2(t(x) − t(x0), ǫ) > α . replacing x by x + x0 − y, we obtain that (∀)x ∈ x : n1(x + x0 − y − x0, δ) > β ⇒ n2(t(x + x0 − y) − t(x0), ǫ) > α , namely (∀)x ∈ x : n1(x − y, δ) > β ⇒ n2(t(x) − t(y), ǫ) > α . thus t is fuzzy continuous at y ∈ y . as y is arbitrary, it follows that t is fuzzy continuous on d(t). ✷ corollary 18. let t : x → y be a linear operator. then t is fuzzy continuous on x, if and only if (∀)ε > 0, (∀)α ∈ (0, 1), (∃)δ = δ(ǫ, α) > 0, (∃)β = β(ǫ, α) ∈ (0, 1)such that (∀)x ∈ x : n1(x, δ) > β we have that n2(t(x), ε) > α . theorem 19. a linear operator t : x → y is fuzzy continuous on x, if and only if (∀)α ∈ (0, 1), (∃)β = β(α) ∈ (0, 1), (∃)m = m(α) > 0 such that (∀)t > 0, (∀)x ∈ x : n1(x, t) > β ⇒ n2(t(x), mt) > α . proof: ” ⇐ ” let ε > 0, α ∈ (0, 1) be arbitrary. then there exist β = β(α) ∈ (0, 1), m = m(α) > 0 such that (∀)t > 0, (∀)x ∈ x : n1(x, t) > β ⇒ n2(t(x), mt) > α . in particular, for t = ε m , we obtain n1 ( x, ε m ) > β ⇒ n2(t(x), ε) > α . applying corollary 18, for δ = ε m > 0, we obtain that t is fuzzy continuous on x. ” ⇒ ” we suppose that (∃)α0 ∈ (0, 1) such that (∀)β ∈ (0, 1), (∀)m > 0, (∃)t0 = t0(β, m) > 0, (∃)x0 = x0(β, m) ∈ x, n1(x0, t0) > β and n2(t(x), mt0) ≤ α0 . the set v0 = {y ∈ y : n2(y, t0) > α0} is an open neighborhood of 0y . we will prove that, for all neighborhood u of 0x, we have t(u) 6⊆ v0, which contradicts the fuzzy continuity of t at 0x. as {b(0, β, s)}β∈(0,1),s>0 is a fundamental system of neighborhoods of 0x, it is enough to show that for all β ∈ (0, 1), s > 0 we have t(b(0, β, s)) 6⊆ v0. as m > 0 is arbitrary, we can chose s = t0 m . we note that, for z0 = 1 m x0 ∈ x, we have n1 ( z0, t0 m ) = n1 ( 1 m x0, t0 m ) = n1(x0, t0) > β . 840 s. nădăban hence z0 ∈ b ( 0, β, t0 m ) . we will prove that t(z0) 6∈ v0, namely n2(t(z0), t0) ≤ α0. indeed, n2(t(z0), t0) = n2 ( t ( 1 m x0 ) , t0 ) = n2(t(x0), mt0) ≤ α0 . ✷ corollary 20. a linear functional f : (x, n1,∗) → (c, n,∧) is fuzzy continuous, if and only if (∃)β ∈ (0, 1), (∃)m > 0 such that (∀)t > 0, (∀)x ∈ x, n1(x, t) > β ⇒ |f(x)| < mt . proof: according to the previous theorem f is fuzzy continuous if and only if (∀)α ∈ (0, 1), (∃)β ∈ (0, 1), (∃)m > 0 such that (∀)t > 0, (∀)x ∈ x : n1(x, t) > β ⇒ n(f(x), mt) > α . but n(f(x), mt) > α ⇔ n(f(x), mt) = 1 ⇔ |f(x)| < mt . hence (∃)β ∈ (0, 1), (∃)m > 0 such that (∀)t > 0, (∀)x ∈ x, n1(x, t) > β ⇒ |f(x)| < mt . ✷ corollary 21. let (x, n1,∗1), (y, n2,∗2) be fnlss and pα(x) := inf{t > 0 : n1(x, t) > α}, α ∈ (0, 1) , qα(x) := inf{t > 0 : n2(x, t) > α}, α ∈ (0, 1) . a linear operator t : x → y is fuzzy continuous on x if and only if (∀)α ∈ (0, 1), (∃)β = β(α) ∈ (0, 1), (∃)m = m(α) > 0 such that qα(tx) ≤ mpβ(x) , (∀)x ∈ x . proof: according to the previous theorem, (∀)α ∈ (0, 1), (∃)β = β(α) ∈ (0, 1), (∃)m = m(α) > 0 such that (∀)t > 0, (∀)x ∈ x : n1(x, t) > β ⇒ n2(t(x), mt) > α . thus, for x ∈ x, we have {t > 0 : n1(x, t) > β} ⊆ {t > 0 : n2(tx, mt) > α} . hence inf{t > 0 : n1(x, t) > β} ≥ inf{t > 0 : n2(tx, mt) > α} , namely inf{t > 0 : n1(x, t) > β} ≥ inf { t m > 0 : n2(tx, t) > α } . therefore pβ(x) ≥ 1 m qα(tx),∀)x ∈ x . ✷ fuzzy continuous mappings in fuzzy normed linear spaces 841 corollary 22. a linear functional f : (x, n1,∗) → (c, n,∧) is fuzzy continuous, if and only if (∃)β ∈ (0, 1), (∃)m > 0 such that |f(x)| ≤ mpβ(x), (∀)x ∈ x . remark 23. we note that a subset a of a topological linear space x is said to be bounded if for every neighbourhood v of 0x, there exists a positive number k such that a ⊂ kv . a linear operator t : x → y is said to be bounded if t maps bounded sets into bounded sets. based on this remark the following definitions are natural. definition 24. [12] a subset a of x is called fuzzy bounded, if (∀)α ∈ (0, 1), (∃)tα > 0 such that a ⊂ b(0, α, tα). definition 25. [12] a linear operator t : x → y is said to be fuzzy bounded if t maps fuzzy bounded sets of x into fuzzy bounded sets of y . we must note that the following result was established by i. sadeqi and f.s. kia [12] for fnlss of type (x, n,∧) which satisfy (n7). since the proof is entirely the same as in [12], it is omitted. theorem 26. let t : x → y be a linear operator. the following sentences are equivalent: 1. t is fuzzy continuous; 2. t is topological continuous; 3. t is fuzzy bounded; 4 conclusion as fuzzy continuity and topological continuity are equivalent and since fnlss are metrizable topological linear spaces, all results and theorems in topological linear spaces hold for fnlss. particularly, we can obtain fuzzy versions for the classical principles of functional analysis (such as the uniform boundedness principle, the open mapping theorem and the closed graph theorem). this remark was made by i. sadeqi and f.s. kia [12] for fnlss of type (x, n,∧). based on our results, these principles remain true without assuming (n7) as in [12]. bibliography [1] alegre, c., romaguera, s. (2010); characterizations of fuzzy metrizable topological vector spaces and their asymmetric generalization in terms of fuzzy (quasi-)norms, fuzzy sets and systems, 161(16): 2181–2192. [2] bag, t., samanta, s.k. (2003); finite dimensional fuzzy normed linear spaces, journal of fuzzy mathematics, 11(3): 687–705. [3] bag, t., samanta, s.k. (2005); fuzzy bounded linear operators, fuzzy sets and systems, 151: 513–547. [4] chang, c.l. (1968); fuzzy topological spaces, j. math. anal. appl., 24: 182–190. [5] cheng, s.c., mordeson, j.n. (1994); fuzzy linear operator and fuzzy normed linear spaces, bull. calcutta math. soc., 86: 429–436. 842 s. nădăban [6] felbin, c. (1992); finite dimensional fuzzy normed liniar space, fuzzy sets and systems, 48: 239–248. [7] goleţ, i. (2010); on generalized fuzzy normed spaces and coincidence point theorems, fuzzy sets and systems, 161(8): 1138–1144. [8] katsaras, a.k. (1984); fuzzy topological vector spaces ii, fuzzy sets and systems, 12: 143–154. [9] kramosil i., michálek, j. (1975); fuzzy metric and statistical metric spaces, kybernetica, 11: 326–334. [10] nădăban, s., dzitac, i. (2014); atomic decompositions of fuzzy normed linear spaces for wavelet applications, informatica, 25(4): 643–662. [11] nădăban, s. (2014); fuzzy pseudo-norms and fuzzy f-spaces, fuzzy sets and systems, doi: 10.1016/j.fss.2014.12.010. [12] sadeqi, i., kia, f.s. (2009); fuzzy normed linear space and its topological structure, chaos, solitons and fractals, 40(5): 2576–2589. [13] schweizer, b., sklar, a. (1960); statistical metric spaces, pacific j. math., 10: 314–334. [14] zadeh, l.a. (1965); fuzzy sets, information and control, 8: 338–353. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 3, pp. 398-409 role-based access control for the large hadron collider at cern i. yastrebov ilia yastrebov 1. european organization for nuclear research switzerland, 1211 geneva 23, and 2. joint institure for nuclear research russia, 141980 dubna, 6 jolio-curie e-mail: ilia.yastrebov@cern.ch abstract: large hadron collider (lhc) is the largest scientific instrument ever created. it was built with the intention of testing the most extreme conditions of the matter. taking into account the significant dangers of lhc operations, european organization for nuclear research (cern) has developed multi-pronged approach for machine safety, including access control system. this system is based on rolebased access control (rbac) concept. it was designed to protect from accidental and unauthorized access to the lhc and injector equipment. this paper introduces the new model of the role-based access control developed at cern and gives detailed mathematical description of it. we propose a new technique called dynamic authorization that allows deploying rbac gradually in the large systems. moreover, we show how the protection for the very large distributed equipment control system may be implemented in efficient way. this paper also describes motivation of the project, requirements and overview of the main components: authentication and authorization. keywords: software development, role-based access control, information security, equipment protection. 1 introduction the large hadron collider was built by the european organization for nuclear research (cern) with the intention of testing various predictions of high-energy physics, including the existence of the hypothesized higgs boson and of the large family of new particles predicted by super-symmetry [1]. the lhc is the world’s largest and highest-energy particle accelerator. it is contained in a circular tunnel with a circumference of 27 kilometers, at a depth ranging from 50 to 175 meters underground [2]. the lhc uses some of the most powerful dipoles and radio-frequency cavities in existence. the size of the tunnel, magnets, cavities and other essential elements of the machine represent the main constraints that determine the design energy of 7tev per proton beam [3]. the energy stored in the lhc magnets and beam is enormous, and the potential for crippling machine damage is a serious concern. that’s why european organization for nuclear research (cern) has developed a multi-pronged approach for machine safety [4]: • hardware protection – lhc beam interlock system – powering interlock system • software interlock system • role-based access control copyright c© 2006-2010 by ccc publications role-based access control for the large hadron collider at cern 399 – prevents unauthorized access to equipment – provide logging to detect errant settings role-based access control (rbac) is an approach to restrict system access to authorized users. within an organization, roles are created for various job functions. the permissions to perform certain operations are assigned to specific roles. members of staff (or other system users) are assigned particular roles, and through those role assignments acquire the permissions to perform particular system functions [5]. rbac is a preventative and therefore inexpensive way to protect the accelerator equipment. it keeps users from making the wrong settings or from logging into the application. other machine protection systems such as interlocks are reactive and once triggered it is expensive to recover operations. rbac is also used to ensure machine stability during a run. once the equipment is fine tuned and beam is in the machine, an error setting can disrupt operations for hours and lose valuable data. it is important to mention that rbac is not a security system against hackers; it is designed only to prevent well meaning people from making the wrong setting, and unauthorized users who have no credentials from running the control applications. the last motivation is that rbac implements logging for each setting protected by access rules. this is crucial during commissioning and debugging. each setting can be traced and bugs in the sequencer or operations can be caught and corrected [6]. as part of the problem solution a new mathematical model of the role-based access control (hereinafter cern-rbac) was proposed. equipment protection subsystem based on this model has been successfully implemented and deployed at cern. the subject of this paper is to describe the new concept of cern-rbac for distributed equipment control system. the paper also demonstrates the practical implementation of the major system components, and presents the results of the comprehensive testing. 2 mathematical model role-based access control, as formalized in 1992 by david ferraiolo and rick kuhn [7], has become the predominant model for advanced access control. in 2000, the ferraiolo-kuhn model was integrated with the framework of sandhu et al. [8] to create a unified model for rbac, published as the nist rbac model [9] and adopted as an ansi/incits standard in 2004. today, most information technology vendors have incorporated rbac into their product lines, and the technology is finding applications in areas ranging from health care to defense, in addition to the mainstream commerce systems for which it was designed [5]. we propose a new model of cern-rbac for the distributed control system. this concept is based on the standard model and preserves the advantages of scalable security administration that rbac-style models offer. moreover it significantly extends standard rbac model according to specific requirements and yet offers the flexibility to specify complex access restrictions based on the dynamic security attributes. the new cern-rbac model is quite general and flexible and could be used in many other areas for equipment access control. below we give a formal mathematical description of the model in terms of sets and relations. u – a set of users, {u, u, . . . , un}. the user is either a human user or a computer program. r – a set of roles, {r, r, . . . , rn}. role is a job function which defines an authority level. p – a set of permissions, {p, p, . . . , pn}. permission (access rule) is an approval of a mode of access to a resource. ua – user assignment: operation which assigns concrete roles to the users. ua ⊆ u ×r (1) pa – permission assignment: r → p – function, defining a set of access rules for each role. this condition must be met at that: ∀p ∈ p,∃r ∈ r : p ∈ pa(r) (2) 400 i. yastrebov figure 1: mathematical model of cern-rbac s – a set of sessions, {s, s, . . . , sn}. session (subject) is mapping of one user to possibly many roles. the double-headed arrow from the session to r in figure 1 indicates that multiple roles are simultaneously activated. the permissions available to the user are the union of permissions from all roles activated in that session. each session is associated with a single user, as indicated by the single-headed arrow from the session to u in figure 1. this association remains constant for the life of a session. user(si) = s → u (3) ar – a set of the subject’s active roles (which can change with time). for each subject, the active role is the one that the subject is currently using. ar(si) = s → r (4) ar(si) ⊆ {r ∈ r | (user(si), r) ∈ ua} (5) sessions are under the control of individual users. as far the model is concerned, a user can create a session and choose to activate some subset of the user’s roles. roles active in a session can be changed at the user’s discretion. the session terminates at the user’s initiative. a system may also terminate a session if it is inactive for too long [8]. cern-rbac model in the previous sections we described the main elements of the rbac model included in the standard [9]. however the standard model is quite general and abstract to protect the equipment against unauthorized access. the standard concept allows defining permissions only for simple objects. without further extension it cannot describe more complex interaction. moreover, the specificity of the equipment requires taking into account its current working mode. that is the authorization algorithm must be dynamic. we propose a new model of the role-based access control for the distributed control system, called cern-rbac. within this model device is a named entity in the control system. device can be controlled via its properties. each property has a name and value. the set of properties specific for the group of homogenous devices forms a device class. the model also defines operations for work with all environments in the system. these are get, set and monitor operations, that allows reading value from device, writing value to device and monitor the device property correspondingly. op – a set of operations, {get, set, monitor}. role-based access control for the large hadron collider at cern 401 pr – a set of device properties, {pr, pr, . . . , prn}. dc – a set of device classes, {dc, dc, . . . , dcn}. dca = dc → pr (6) d – a set of devices (equipments), {d, d, . . . , dn}. now we give a function that for every device from the set d defines associated device class: class(di) = d → dc (7) the properties set for concrete device (apr) are defined as: apr(di) = {pr ∈ pr | (class(di), pr) ∈ dca} (8) a set of transactions, which user can execute working within the distributed control system: t = {(op, pr, d) | o p ∈ op, pr ∈ pr, d ∈ d, pr ∈ apr(d)} (9) this implies that interaction of user and control system comes to execution of operations from the set op for the properties of some devices. in terms of proposed model, permission defines for the given role a set of transactions potentially executable by the user owning that role. function defining a set of transactions for each permission (transaction assignment): t a = p → t (10) the predicate exec(si,t j) is true if subject can execute transaction at the current time, otherwise it is false. subject can execute transaction only if it has active roles: ∀s ∈ s,t ∈ t, (exec(s,t) ⇒ ar(s) 6= ) (11) authorization is the function of specifying access rights to resources or services. in our case subject can execute transaction from the set t only if this transaction is authorized for at least one active role of the subject: ∀s ∈ s,t ∈ t, (exec(s,t) ⇒ t ∈ t a(pa(ar(s)))) (12) this rule ensures that users can execute only transactions for which they are authorized. because the conditional is "only if", this rule allows the possibility that additional restrictions may be placed on transaction execution. that is, the rule does not guarantee a transaction to be executable just because it is in the set of transactions potentially executable by the subject’s active role [10]. dynamic authorization equipment functions in different modes. while a device is working in the test mode it’s often necessary to allow access for wider range of users than during normal operation. in such cases an expansion of the access rules is not always desirable and appropriate. firstly, it may loosen up on the system security, and secondly it requires significant administrative costs. we believe the introduction of different working modes of authorization assists to problem solving. in this case the authorization algorithm will take into account not only access rules, but also the current working mode of the equipment. cern has a lot of exposed equipments due to the size of the lhc, which contains hundreds of thousands different devices with dozens of properties each. thus it takes a lot of time to design access rules for all devices. as far as we cannot force equipment specialists to define these rules in a single day, we have to propose the flexible solution to regulating access to non-protected equipments. this will allow us to deploy the access control system step-by-step, without breaking existing infrastructure. it’s the crucial requirement for the access control system at cern. 402 i. yastrebov dynamic authorization is the algorithm of authorization taking into accounts not only access rules, but also internal state of the authorization subject. moreover this approach defines default privileges for unprotected environment and non-authenticated users. cp a set of checking policies considered by dynamic authorization algorithm, {no-check,lenient,strict}. a checking policy is defined at the level of every device and can be changed at runtime. the function mapping each device from the set d to the associated checking policy: policy(di) = d → cp (13) now let’s introduce the predicate of dynamic authorization as ψ(s, op, pr, d, policy(d)), which will be true if subject can execute operation from op on concrete device property. ∀s ∈ s, op ∈ op, pr ∈ pr, d ∈ d : (exec(s,t) ⇒ ψ(s, op, pr, d, policy(d))) (14) now we define a predicate which is true if property is considered to be protected. protected(op ∈ op, pr ∈ pr, d ∈ d) =  ⇔ (op, pr, d) ∈ t (15) that is property is protected if there is at least one access rule which restricts access for the given operation. below we give a description of the dynamic authorization algorithm for each checking policy. no-check policy grants access for each property without any checking. typically this policy is used at design stage, when the device interface is not fixed and there are no access rules yet. this policy is also used during testing phase if needed to permit equipment access for some additional users for a short period of time. this mode could be useful for system debugging because or for other activity when it’s required to disable cern-rbac authorization checks. ψ(no − check) = { , protected(pr, d) = ; , protected(pr, d) = . (16) lenient checking policy implements relaxed authorization. for the protected properties algorithm gives access only if corresponding access rule permits so. that is in order to deal with protected properties user must be authenticated in the system. for unprotected properties access is not restricted for any users. typically this policy is used at the testing stage, when access rules exist only for the most critical settings of the equipment. some of the devices work in this mode permanently, because sometimes it’s desirable to restrict access only to significant settings while keeping others unprotected. ψ(lenient) = { (op, pr, d) ∈ t a(pa(ar(s))), protected(pr, d) = ; , protected(pr, d) = . (17) strict checking policy implies the most exacting verification. it always requires users to be authenticated in the system; otherwise user’s requests will be blocked. access for the protected properties is granted only if there is an associated access rule. for unprotected properties access is permitted only for reading property value and monitoring operation. setting new value even for unprotected property is forbidden. this checking policy is the strictest in existence at cern. our final goal is to propagate this mode as wide as possible, because it provides best security. all critical equipments are supposed to work in this mode. ψ(strict) =    , s = ; (op, pr, d) ∈ t a(pa(ar(s))), protected(pr, d) = ; , op ∈ {get, monitor}, protected(pr, d) = ; , op = set, protected(pr, d) = . (18) thereby role-based access control concept with dynamic authorization solves the problem of unprotected properties and legacy applications working without authentication. introducing this approach we role-based access control for the large hadron collider at cern 403 get a system that works dynamically and its behavior can be easily changed at runtime. one of the main advantages of this approach is that it allows deploying cern-rbac step-by-step without interruption of the existing software. this requirement is very important for such a huge project like lhc. our final goal is to protect every single device, but it’s impossible to accomplish this job even in one year, taking into account number of environments. we believe that our model of cern-rbac could be a good example of the flexible system and could be useful in many other applications for very large distributed systems. based on the new model of cern-rbac we developed and deployed software products. in the following chapters we overview the main technical requirements of the product, and describe implementation of its main components. 3 system requirements to distinguish authentication from the closely related term authorization, the shorthand notations a1 (authentication) and a2 (authorization) are occasionally used. in the present paper we list only the most important requirements of the authentication and authorization components of the cern-rbac system [11]. both parts are implemented independently, as two different systems, which do not interact in any way other than passing the users’ credentials from a1 to a2. authentication requirements: • encryption: the credentials used to authenticate the user shall be encrypted when sent over the network. • hardware independence: the method of authentication shall be independent of specialized hardware such as a card reader, finger print reader etc. • quick and simple: the method of authentication must be straightforward for the users. • authentication method: authentication shall be done via user name and password from a personal cern account. the software should allow flexibility for future implementations of kerberos and/or x.509 certificates. • authentication by location should be implemented as an additional authentication method. this will allow users to be authenticated without providing credentials from a very limited set of trusted machines located in the cern control centre (ccc). • role activation mechanism should be implemented, which will allow users to pick up active roles from the set of available roles. authorization requirements: • subject of authorization: it shall be possible to restrict access i.e. define access privileges for the following operations on each device property: read, monitor, and write. • permission administration: the permissions shall be defined during the design/deployment phase, and authorized administrators shall be able to edit the permissions at any time. • logging/tracking: cern-rbac shall keep track of all write actions. • performance: authorization shall be fast and shall not hinder the performance of the middleware. 404 i. yastrebov 4 authentication the purpose of the cern-rbac authentication system is to verify the digital identity of a principal (which is either a human user or a program). if the authentication succeeds its result is a digitally signed authentication token that is returned to the application [12]. the program can use the token whenever it needs to interact with various parts to the control system. for example, the token can be provided as one of the arguments in a remote call to set a device. front-ends and the middleware that are receiving such calls will verify the token, thus confirming the identity of the remote party, and can use it as a base for authorization. the cern-rbac authentication token is a short-term uniform substitute of the real credentials. it gets issued by a central service that can reliably verify the user’s identity. various recipients of the tokens can validate them quickly and easily, and use for making authorization decisions [6]. figure 2 demonstrates the authentication process. 1) application sends login request to cern-rbac. 2) cern-rbac checks the user location in the database (for authentication by location). 3) if location is trusted, then the new token immediately created and returned to the user (authentication by location). 4) otherwise cern-rbac sends to the application the login dialog for credentials or certificate. the user types in the name and password or chooses the certificate and sends it to cern-rbac. 5) cern-rbac verifies the credentials with cern account system (nice). 6) if the check was successful cern-rbac retrieves roles from the cern-rbac database, generates a new token and signs it with private key to prevent any modifications. 7) cern-rbac returns to the application a digitally signed token containing the user roles. figure 2: authentication process the a1 server is implemented in java programming language. it receives authentication requests via http from multiple clients, returning back either an authentication token, or an error code. each request from a user contains its credential in some form. all requests are atomic, so no session information is cached by the server. the ssl/tls protocol is generally used over http to protect the communication role-based access control for the large hadron collider at cern 405 between the two parties, and to authenticate the client’s x.509 certificate, if such is provided. the client side is organized as a library implemented both in c++ and java, which can be used by other applications or application frameworks. this library provides a function that should be called in order to obtain the authentication token from the server. java implementation also provides several standard gui components, such as a login dialog, role picker dialog, and others. basically, that authentication client-side library can be used in most application without changes. c++ version of the client library is more lightweight because it does not provide any gui. types of authentication • by the user name and password. the user names and passwords are checked against the central nice account database, via a dedicated web service. no user account information is stored in the cern-rbac own database. • by a x.509 certificate. if the user’s x.509 certificate is available, it can be applied in the standard client authentication mechanism of tls/ssl protocol. then, the certificate information is used to look up the user name in the cern-rbac database. • by the network address (also called authentication by location). certain clients can be authenticated by their ip addresses, using a lookup table in the cern-rbac database. normally, the address authentication is permitted only for a very limited number of machines, such as control room consoles. • by using an existing authentication token. any existing token can be used to request a new one, providing that the original token is not expired, bears valid signature, and was issued to the same location address. the validity time of the new token will not exceed the validity time of the original one. 5 authorization middleware in the control system controls middleware (cmw) is a software infrastructure delivered and managed by cern beams department/controls group. its goal is to provide a generic way of accessing lhc-era accelerator devices [13]. cmw provides access to devices from application programs in a distributed heterogeneous control system. it allows interconnecting applications and devices implemented in java or c++, and running under unix or windows platforms. the cmw design reflects the accelerator device model in which devices, named entities in the control system, can be controlled via properties. each property has a name and a value. the model defines several basic device access methods (get, set, monitor); by invoking these methods, applications can read, write and subscribe to the property values. cmw is based on a client-server model. accelerator devices are implemented in device servers, and client applications access them using the cmw client api. cmw provides location transparent access to devices: servers can run anywhere in the controls network and devices can be moved from server to server without clients becoming aware of it. authorization cern-rbac authorization library is the part of each device server. in most cases, authorization occurs once the application makes a request to get, set, or monitor a property via cmw protocol. this request is made from the application via the cmw client to the cmw server. the token obtained at authentication is passed to the cmw client. there the digital signature is verified, and if valid, the token is sent to the cmw server. if the token is not valid a meaningful exception message is returned to the 406 i. yastrebov sender. the cmw looks up the permission in the access map, and depending on access rights either grants access or blocks the request. figure 3: authorization process authorization process: 1) access map loaded from the local file on device server startup. 2) client application authenticates on the cern-rbac a1 server and obtains a valid token. 3) token is passed to the cmw client and then to the device server via cmw protocol. 4) cmw server receives the accelerator mode from timing source. 5) dynamic authorization algorithm verifies user permissions and either allows to execute operation or thrrows an exception. 6) result of authorization process logged for auditing. 6 access rules decision whether a particular operation is valid or not is dependent on a set of access rules. they are specified by an equipment specialist for every device class, stored and managed centrally in the controls database. every cmw server can read access rules (referred to as access map), relative to device classes it is providing access to, through a tab-separated text file located in the network file system. this file mirrors the access rules located centrally in the database, and has a digitally signed by the server to prevent any modifications. access map is read by cmw server on its start-up. in addition, access map can be reloaded upon a distant call from cmw client. it usually happens when the set of related access rules has been altered by equipment specialist. access rules are parameterized using all factors relative to the authorization process. more specifically, an equipment specialist has to specify the following fields to define an access rule: device class, property, device name, role for which access is permitted, application, location of the remote client, operational mode of the accelerator, and operation on the property (get, set or monitor). apart from specific values, an equipment specialist can put a wildcard ’*’ in any of the fields except device class and operation. this interpreted as all values fit. authorization process is performed for each transaction and therefore should work as fast as possible role-based access control for the large hadron collider at cern 407 figure 4: access rules structure and should not slow down the performance of the system significantly. as an authorization time is a concern for the cmw operation, it was decided to represent the access map in the cmw server in the form of trees, a separate one for every operation type. figure 4 presents the structure of access map tree. nodes placed with the same distance from the tree root group authorization argument of one type. authorization is performed by traversing the tree, appropriate in respect to the operation type, from its root to a leaf, trying at each level to match exact authorization parameter or to find a wildcard ’*’. if this succeeds, the access is granted. such a structuring of the access map allows us to assure authorization with time complexity of o(log n) instructions, where n is the number of access rules [14]. 7 performance and tests an important concern with any design is its performance. if cern-rbac would double the time to make a setting, nobody would use it. the system also had to scale with a large number of access rules. how does a 2000-rule access map on a front-end impacts the search for the right permission? another potential performance hit was logging each request on a protected property, and the 3-tier vs. 2-tier performance. lhc controls can run in 2-tier mode, meaning the application and the client are on one machine and the database and devices are accessed directly on the second machine. this configuration is used when in developer testing. however when the system is in operation the configuration is usually 3-tier. meaning the client is on a middle tier. a common client is shared by several applications and consolidates requests. in 2-tier, a dedicated client ensures that each request is made from the same application. the cernrbac token can be validated once per session and the credentials can be used for each subsequent request. in 3-tier, the requests can come from any application at random times. the simplest design is to verify the token for each request. but how does this affect performance? we ran performance tests to answer these questions on a front-end with a 400 mhz. power pc with linux, and here are the results: • the size of the access map has very little effect on performance due to sophisticated and optimized search algorithms. our test result show a 0.02 ms increase between a 20 rule and 2000 rule access map. 408 i. yastrebov • logging each request also has very little effect on the performance. our tests show the difference between having logging on vs. having it off is 0.003 ms. • 3-tier token verification on every request has a larger impact on performance than the other two concerns. the key size is the most contributing factor. a dsa, 1024 bit key takes 5 ms to validate. a rsa 512 bit key takes 0.150 ms. • for a 2000 rule access map in a 2-tier configuration the average turn around time or a request is 0.7 ms. in a 3-tier configuration it is 2.7 ms. at this time, this is acceptable according to the requirements. 8 conclusions and future works the cern-rbac approach was successfully implemented based on the proposed model. the system successfully passed many centrally organized tests. results of the tests prove that the proposed model and design concepts are valid. we also measured the performance of the implemented system and show that the overhead is acceptable. this allows us to assume that the proposed model of the cern-rbac could be used in many other areas where access control is needed for large distributed systems. currently the rbac system is released in a production version and used by virtually every equipment device at cern. thanks to the algorithm of the dynamic authorization we propagate cern-rbac stepby-step, without interruption of the legacy subsystems. in the future we expect to extend functionality of the cern-rbac software and the area of its applicability. bibliography [1] cern: why the lhc. http://public.web.cern.ch/public/en/lhc/whylhc-en.html (2008), accessed 1 august 2009. [2] wikipedia: large hadron collider. http://en.wikipedia.org/wiki/large_hadron_collider (2008), accessed 1 august 2009. [3] cern: what is lhcb. cern faq lhc: the guide. cern communication group. http://cdsmedia.cern.ch/img/cern-brochure-2008-001-eng.pdf (2008). accessed 9 december 2008. [4] wenninger j.: operational challenges of the lhc. http://irfu.cea.fr/phocea/file.php?class=std&file=seminaires/1595/dapnia-novc07-partb.ppt. (2007), accessed 1 august 2009 [5] wikipedia: role based access control, http://en.wikipedia.org/wiki/role_based_access_control (2009), accessed 1 august 2009 [6] petrov a., schumann c., gysin s.: user authentication for role-based access control. proceedings of icalepcs 2007 [7] ferraiolo d.f., kuhn d.r.: role based access control. 15th national computer security conference, baltimore, usa, (1992) [8] sandhu r., coyne e. j., feinstein h. l., youman c. e.: role-based access control models. ieee computer 29 (2): 38-47, (1996) role-based access control for the large hadron collider at cern 409 [9] sandhu r., ferraiolo d.f., kuhn d.r.: the nist model for role based access control: toward a unified standard, 5th acm workshop role-based access control, 47-63, (2000) [10] ferraiolo d.f., cugini j.a., d. kuhn d.r.: role-based access control (rbac): features and motivations. proceedings of 11th annual computer security application conference, new orleans, la, 241-248, (1995) [11] gysin s., kostro k., kruk g., lamont m., lueders s., sliwinski w., charrue p.: role-based access for the accelerator control system in the lhc area requirements, edms id 769302, (2006) [12] charrue p. et al.: role based access control for the accelerator control system in the lhc era design. edms id 805654, (2007) [13] kostro k., baggiolini v., calderini f., chevrier f., jensen s., swoboda r., trofimov n.: controls middleware the new generation, epac, paris, france, p. 2028. (2002) [14] kostro k., gajewski w., gysin s.: role-based authorization in equipment access at cern. proceedings of icalepcs, (2007) ilia yastrebov (b. september 29, 1981) received his m.sc. in information technology (2004). currently he is a phd student in computer science at ulyanovsk state university, russia. he has been working at european organization for nuclear research since 2005 as a software developer for access control. his current research interests include different aspects of access control and distributed systems. he has 12 papers, 4 conferences participation. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 3, pp. 375-384 a swarm intelligence approach to the power dispatch problem d.c. secui, i. felea, s. dzitac, l. popper dinu călin secui, ioan felea, simona dzitac, laurenţiu popper university of oradea, faculty of energy engineering, romania e-mail: csecui@uoradea.ro, ifelea@uoradea.ro, simona.dzitac@gmail.com, director@perfect-service.ro abstract: this paper examines how two techniques of the particle swarm optimization method (pso) can be used to solve the economic power dispatch (epd) problem. the mathematical model of the epd is a nonlinear one, pso algorithms being considered efficient in solving this kind of models. also, pso has been successfully applied in many complex optimization problems in power systems. the pso techniques presented here are applied to three case studies, which analyze power systems having four, six, respectively twenty generating units. keywords: economic dispatch problem, constrained optimization, particle swarm optimization. 1 introduction an important issue in optimizing the power systems is the economic power dispatch. this problem consists in determining the power generated by the plant units of a system in order to minimize the total generation cost of units, taking into account the active power balance and the constraints imposed to the capability of the units. the mathematical optimization model is nonlinear, where both the objective function and the restrictions imposed by equations describing the system functionality are nonlinear. the mathematical model can be solved using conventional optimization techniques such as the lambda iteration method, the gradient method and others [1, 2], if constraints are considered as linear, the objective function is continuous and the domain of the values is convex. some disadvantages that arise in these situations long solving time, objective function discontinuity etc. may be overcome by applying the artificial intelligence techniques. the most common optimization techniques based upon artificial intelligence used for solving economic power dispatch problems are: the genetic algorithm [3–6], the hopfield neural networks [2, 7], the differential algorithm [8], the evolutionary programming [9, 10], fuzzy-optimization [12, 13], tabu search [14], particle swarm optimization [15, 16, 27, 28]. also, the epd can be formulated as a multi-objective optimization problem [11, 13, 17]. twe mention that the particle swarm optimization method was successfully applied to other optimization problems, such as optimal power flow [18–20], reactive power optimization and voltage control [23], power loss reduction in distribution systems [24], network reconfiguration [25], unit commitment problem [26], due to its good convergence, low computational time and good quality solutions. in this paper there are proposed two versions of applying pso method for solving epd, comparing the results obtained for three systems consisting of four, six and respectively, twenty generating units. the paper is structured as follows: section 2 exposes the epd problem, section 3 and 4 describe the proposal for solving this problem using pso algorithm, section 5 presents the results obtained through the application of pso algorithm for three power systems, and section 6 outlines the conclusions drawn. 2 formulation of the economic power dispatch problem we consider a power system containing n generating units, each unit having its own generated power pj, j = 1...n. the total load required in the system is considered to be known and equal to pd. the fuel copyright c© 2006-2010 by ccc publications 376 d.c. secui, i. felea, s. dzitac, l. popper cost (fj(pj)) for each generator j is represented by a quadratic function: fj(pj) = a j ·pj + b j ·pj + c j (1) where: a j, b j and c j are fuel cost coefficients of generator j; pj represent the power of generator j. epd solution consists in determining the pj powers of generating units, so that the total fuel cost of the entire system to be minimal, respecting the restriction of power balance on the overall system and the inequality restrictions for each unit j. the objective function is: min f = n∑ j= fj(pj) (2) the problem constraints are given by relations (3) and (4): fp = n∑ j= pj − pd − ∆ p =  (3) pminj ≤ pj ≤ pmaxj , j = , , ..., n (4) where: pminj and p max j represent the maximum and the minimum operation limits of a generator j; the power loss at the level of the entire system is a quadratic function in relation to variables pj and it is calculated by using constant b coefficient formula: ∆ p = n∑ i= n∑ j= pi ·bi j ·pj + n∑ i= bi ·pi + b (5) where bi j is an element of the loss coefficient matrix, bi is the element i of the loss coefficient vector, and b is the loss coefficient constant. 3 presentation of different particle swarm optimization techniques pso is a heuristic algorithm, used for solving nonlinear and noncontinuous optimization problems, being introduced by kennedy and eberhart [22], in 1995. since then several techniques in applying the pso method have been developed, but in the current paper only two techniques are presented, namely the classical pso (pso classical) and pso with time varying acceleration coefficients (pso accelerated). classical pso: to search for the optimal solution in a space with the dimension n, pso uses a population of np particles. for a given particle i within np population, vector solutions at a certain iteration k are represented by x ki =(x k i, x k i,..., x k i j,..., x k in). in any optimization process, switching from one solution (xki j) to another solution (x k+ i j ) is accomplished by using the velocity of particles, represented by the vector v ki =(v k i, v k i,..., v k i j,..., v k in), according to the relation: x k+i = x k i + v k+ i , i = , , ..., np (6) the updated velocity of the particle in the next iteration (k+1) is given by the relation: x k+i = ω ·v ki + c ·r ·(pbest ki − x ki ) + c ·r ·(gbest k − x ki ) (7) where: v ki , v k+ i represent the velocity vector of particle i at iteration k, respectively k+1; x ki , x k+ i represent the solution vector of particle i at iteration k, respectively k+1; a swarm intelligence approach to the power dispatch problem 377 pbest ki represent the best solution vector of particle i, until iteration k; gbest k represent the vector corresponding to the best solution of the group, until iteration k; c and c are coefficients corresponding to cognitive and social behavior; r and r are random numbers between 0 and 1, and w is the inertia weight factor determined using the relation: ω = ωmax − ωmax − ωmin kmax ·k (8) where ωmax and ωmin represent the initial, respectively final weights, kmax is maximum iteration number and k is current iteration number for the algorithm. pso with time varying acceleration coefficients tries to improve the global search in the early stages of the optimization process and to accelerate the convergence of the particles to the global optimum in the final part of the process. in this case, the calculation of the velocity (v k+i ) and of the solution (x k+ i ) for the next iteration is done with relations (6) and (7), and according to [21], the coefficients c and c are determined by the relations: c = (c f − ci)· k kmax + ci (9) c = (c f − ci)· k kmax + ci (10) where ci, ci, c f and c f are initial and final weights for cognitive and social acceleration coefficients. 4 the methodology based on pso for solving the epd problem the implementation of pso techniques for solving the epd problem involves the following steps: step1. initialization of the parameters and of pso solution. the pso algorithm parameters are set in reference to ωmax, ωmin, kmax, number of particles (np), coefficients c and c (for pso classical), respectively, ci,ci, c f and c f (pso accelerated). initially a population of np particles is randomly formed. each particle defines a possible solution to the problem, which should respect the constraints given by relations (3) and (4). step 2. evaluation of the objective function f and of the auxiliary function f. the problem contains an equality restriction shown by relation (3). thus, the auxiliary function f is formed, using the relation: f = f + α ·f p (11) where α is the penalty factor. for each particle and each iteration the values of function f will be calculated, and by comparing them solutions pbesti and gbest are selected. at the end of the optimization process, functions f and f will have approximately equal values, according to the calculation error admitted by choosing factor α . step 3. update velocity and solution. the minimum (v minj ) and the maximum limits (v max j ) of the velocity for each generating unit j are calculated: v maxj = β ·(pmaxj − pminj ) and v minj = −v maxj (12) where factor β was considered between 0.05 and 0.1. the update of the particle position and velocity is done with relations (6) and (7). for each solution (xi) ) it is verified if the components xi j satisfy the constrain (4). if the constrain is satisfied, then the 378 d.c. secui, i. felea, s. dzitac, l. popper unit pmin[mw] pmax[mw] a[$/mw ] b[$/mw] c[$] 1 30 120 0.00875 18.24 750 2 50 160 0.00754 18.87 680 3 50 200 0.00310 19.05 650 4 100 300 0.00423 17.90 900 table 1: cost coefficients and limits of generated powers for a thermal power plant with four units (cs4) unit pmin[mw] pmax[mw] a[$/mw ] b[$/mw] c[$] 1 100 600 0.001562 7.92 561 2 100 400 0.00194 7.85 310 3 50 200 0.00482 7.97 78 4 140 590 0.00139 7.06 500 5 110 440 0.00184 7.46 295 6 110 440 0.00184 7.46 295 table 2: cost coefficients and limits of generated powers for a thermal power plant with six units (cs6) calculated value for xi j is kept. otherwise xi j is set with the value nearest to the limit of the domain (pmaxj or pminj ). the vectors pbest and gbest are obtained based on the evaluation of the auxiliary function f and on the comparison of the f values calculated in two consecutive steps. if the new value of function f is better than the previous value of f for previous gbest, then gbest is set at the new value. similarly, pbest vector is updated. step 4. stopping process. in this paper the criterion of stopping the calculation process is given through achieving the maximum number of iterations set. 5 numerical examples and simulation results in this section three case studies on how epd solving by applying the two pso techniques (pso classical and pso accelerated) are presented. the objective function is given by the relation (2) and restrictions (3) and (4). all case studies were implemented in mathcad, on a personal computer having a 1.58 ghz processor and 896 mb of ram. 5.1 description of tested systems case study 1 four unit system (cs4). the first system (cs4) is a thermal power plant having four generating units, where the total power losses (∆ p) are considered zero. the data for the four generators (cost coefficients and limits of generated powers) are presented in table 1. the total power demanded in the system is pd=520 mw. case study 2 six unit system (cs6). the second system (cs6) is a thermal power plant with six generating units, where the total (∆ p) is considered zero. the data for the six generators (cost coefficients and limits of generated powers) are presented in table 2 [5]. the total power demanded in the system is pd=1800 mw. case study 3 twenty unit system (cs20). the third system contains twenty units, and the demanded power in the system is pd=2500mw. the data for the generators and the values of the coefficients bi j are available in [9]. a swarm intelligence approach to the power dispatch problem 379 scenarios ci c f best f worst f average f standard deviation f [$/hour] [$/hour] [$/hour] [$/hour] 1 1.5 1.5 12919.76 12920.77 12919.84 0.026 2 1.5 2 12919.76 12920.36 1291981 0.015 3 1.5 2.5 12919.76 12921.01 12919.84 0.035 4 2 1.5 12919.77 12921.00 12919.83 0.028 5 2 2 12919.76 12920.04 12919.79 0.007 6 2 2.5 12919.76 12920.42 12919.80 0.016 7 2.5 1.5 12919.76 12922.35 12919.84 0.052 8 2.5 2 12919.76 12920.17 12919.80 0.009 9 2.5 2.5 12919.76 12920.27 12919.80 0.013 table 3: the influence of the coefficients c1i,c2f upon the results obtained through pso accelerated, for one hundred trials (cs4) 5.2 pso parameters and pso convergence pso parameters involved in the calculation process may affect the algorithm performances and quality of the solutions. for the system having four generators (cs4) the parameters were set to values: wmin = ., wmax = , c = ., c = ., np=6, kmax =  (pso classical) and respectively wmin = ., wmax = , ci = , ci = ., c f = ., c f = , np=6, kmax =  (pso accelerated). in case of pso accelerated algorithm, in order to assess the influence of the coefficients on epd problem solving, 100 distinct trials were performed, noting the best value for f (best f), the worst value for f (worst f), the average value for f (average f) and the standard deviation of cost f. table 3 shows the values best f, worst f, average f and standard deviation f, considering that the values of coefficients (ci, c f ) vary between limits ci, c f ∈ [., .], and the values of coefficient (c f ) and (ci) are constant and equal to c f = ci = .. regarding the four unit system (cs4), in table 3 it can be noticed that coefficient changes do not affect the solutions, but the best results are obtained for scenario 5, where ci = , c f = . for the six generator system (cs6) in both pso classical and pso accelerated algorithm, the coefficients have little influence upon the outcomes, considering that they vary within the limits c1,c2 [1, 4], respective c1i,c2f [1.5, 2.5] and c1f, c2i [0.1, 0.4]. the results are shown in table 4, the parameters being set to values: wmin=0.4, wmax=1, c1=2, c2=2, np=15, kmax=30 (pso classical), and wmin=0.4, wmax=0.9, c1i=2.5, c2i=0.2, c1f=0.4, c2f=1.6, np=15, kmax=30 (pso accelerated). in case of the twenty unit system (cs20), pso classical was applied taking into account the following settings: w1=0.4, w2=1.1, c1=2, c2=2, np=500, kmax=200 and =0.1; for pso accelerated it was considered: w1=0.4, w2=1.1, c1i=2.5, c2f=2, c1f=0.4, c2i=0.2, np=500, kmax=200 and =0.1. the number of particles that constitute the population is another important factor in the pso algorithm. in case of the four unit system, for both methods (pso classical and pso accelerated), the changes in cost f in relation to particles number (np) were graphically represented, considering the number of iterations set at kmax=15 (fig. 1). also, in fig. 2 the variation of cost f in relation to the number of iterations (kmax) is represented, considering the number of particles set to np=10. analyzing the diagram shown in fig. 1 it is found that for both algorithms (pso classical and pso accelerated) the best solution (best f) is obtained considering a population consisting of 5-6 particles. fig. 2 shows that the converging process towards the best value of f is obtained after fifteen iterations for both algorithms, the initial solution being different. in the six unit system, increasing the number of variables involves a higher number of particles (np=15) and a higher number of iterations to (kmax=30) in order to obtain the same solution, presented in table 4. 380 d.c. secui, i. felea, s. dzitac, l. popper figure 1: f convergence with particles number (np)/cs4; figure 2. f convergence with kmax for cs4 algorithms p1 [mw] p2 [mw] p3 [mw] p4 [mw] p5 [mw] p6 [mw] cost f [$/hour] flcga [3] 250.49 215.43 109.92 572.84 325.66 325.66 16585.85 pga [5] 248.14 217.74 75.20 587.80 335.56 335.56 16579.33 aecga [6] 248.07 217.73 75.30 587.70 335.60 335.60 16579.33 ihn [7] 248.08 217.74 75.18 587.90 335.55 335.55 16579.33 pso classical 247.95 218.44 75.16 587.58 335.43 335.44 16579.33 pso accelerated 248.00 217.71 75.16 588.02 335.52 335.59 16579.33 table 4: the comparison of the results obtained for the system with six generating units (cs6) to assess the efficiency of pso algorithms, they are compared to other four algorithms previously presented using the same data, available in [5]. it can be seen that both pso classical, and pso accelerated algorithm reach the same cost f as the algorithms presented in [3], [5], [6] and [7]. the resulting solution and the cost value are comprised in table 4. 5.3 the assessment of the solutions the quality of the solutions was assessed by determining the values best f, worst f, average f and standard deviation f considering 100 trials. ffor the system with four generating units the results are presented in table 5. in order to assess the convergence process and quality of the solutions the average of cost values f (average f fig.3) and its standard deviation (standard deviation f fig.4) for each iteration are determined, considering only one trial. the system considered in fig.3 and fig.4 proves that both algorithms converge quickly toward the best solution for f, the curves presenting a continuous decrease in relation to the number of iterations. the best solution obtained in 100 trials, using algorithms pso classical and pso accelerated, is presented in table 5, together with the results of the gradient method. in case of the six units system (cs6), the values best f, worst f, average f and standard deviation f considering 100 trials are presented in table 6. algorithm robustness was tested starting from different initial solutions, randomly obtained, and retaining the best value of f for a number of algorithm trials within the [1-50] interval. for the system having twenty units (cs20) the following values were obtained, considering 100 trials (table 7). in table 8 is presented the solution for the twenty units system, and the comparison of the results with those obtained in [2]. a swarm intelligence approach to the power dispatch problem 381 figure 3. average variation of f with kmax (cs4); figure 4. variation of standard dev. f with kmax(cs4) unit power output [mw] pso classical pso accelerated gradient method p1 88.554 92.536 92.493 p2 65.340 65.539 65.559 p3 134.662 130.293 130.431 p4 231.444 231.632 231.517 total power [mw] 520.00 520.00 520.00 total generated cost (best f)[$/hour] 12919.96 12919.76 12919.76 cpu time (s) <  <  <  table 5: the best solutions (best f) obtained for the system with four generating units (cs4) algorithm best f worst f average f standard deviation f [$/hour] [$/hour] [$/hour] [$/hour] pso classical 16579.33 16582.64 16579.51 0.0650 pso accelerated 16579.33 16581.93 16579.49 0.0362 table 6: the values best f, worst f, average f and standard deviation f for cs6 algorithm best f worst f average f standard deviation f [$/hour] [$/hour] [$/hour] [$/hour] pso classical 62457.1805 62469.64 62462.45 0.4346 pso accelerated 62456.4380 62466.63 62461.05 0.3340 table 7: the values best f, worst f, average f and standard deviation f for cs20 382 d.c. secui, i. felea, s. dzitac, l. popper the value of cost f obtained in table 7 presents a smaller dispersion for both pso algorithms, showing a high quality of the solutions. the calculation of power losses in the transmission lines was achieved with an error of 0.03 mw. it is also seen that pso accelerated algorithm gets a solution as good as the algorithms presented in [2], but pso classical leads to a farther solution, comparing to the solutions presented in table 8. unit power output [mw] pso pso lambda-iteration hopfield neural accelerated classical method [2] network [2] p1 511.1808 514.2219 512.7805 512.7804 p2 171.9583 163.3928 169.1033 169.1035 p3 125.9410 125.7172 126.8898 126.8897 p4 99.6666 99.4738 102.8657 102.8656 p5 114.9377 112.8359 113.6836 113.6836 p6 75.1378 76.8715 73.5710 73.5709 p7 113.2613 109.2679 115.2878 115.2876 p8 116.2341 116.8953 116.3994 106.3994 p9 101.5174 100.5633 100.4062 100.4063 p10 102.5556 113.8036 106.0267 106.0267 p11 150.6753 154.2216 150.2394 150.2395 p12 292.5836 289.6717 292.7648 292.7647 p13 120.2476 118.0356 119.1154 119.1155 p14 34.9866 38.2194 30.8340 30.8342 p15 117.3186 115.2359 115.8057 115.8056 p16 36.1563 36.4369 36.2545 36.2545 p17 68.7329 69.8707 66.8590 66.8590 p18 83.7616 81.9408 87.9720 87.9720 p19 98.9061 103.8655 100.8033 100.8033 p20 56.4137 51.6055 54.3050 54.3050 total power [mw] 2592.1728 2592.1468 2591.9670 2591.9669 total generation cost [$/hour] 62456.4380 62457.1805 62456.6391 62456.6341 total lines losses [mw] 92.1728 92.1468 91.9670 91.9669 table 8: the best solutions (best f) obtained for the system with twenty generating units (cs20) 6 conclusions in this paper, the economic power dispatch problem is solved using two pso techniques, namely, pso classical and pso accelerated. both techniques are effective in solving this problem, but pso accelerated leads to a better quality of solutions and a lower computing time. for epd problems with small number of variables and linear restrictions, the classical solving techniques (gradient method, lambda iteration method) are also applicable, obtaining the same results as pso. for epd with nonlinear restrictions, pso techniques are more effective, having a better convergence, robustness and stability, indicated by low values of standard deviation. the number of particles and the number of iterations required to obtain stable solutions are related to a reduced computing time. pso techniques are also compared with other techniques, such as hopfield neural network, the results being almost identical for our applications. a swarm intelligence approach to the power dispatch problem 383 bibliography [1] lee, f.n. and breipohl, a.m., reserve contrained economic dispatch with prohibited operating zones, ieee transaction power systems, vol. 8 (1), pp: 246-254, 1993 [2] su, c.t. and lin c.t., new approach with a hopfield modeling framework to economic dispatch, ieee transaction power systems, vol. 15 (2), pp: 541-545, 2000 [3] song, y.h., wang, g.s., wang, p.v. and johns, a.t., environmental/economic dispatch using fuzzy logic controlled genetic algorithms, iee generation, transmission and distribution, vol. 144(4), pp: 377-382, 1997 [4] chiang c.l., improved genetic algorithm for power economic dispatch of units with valve-point effects and multiple fuels, ieee trans. power systems, vol.20(4), pp: 1690-1699, 2005 [5] yalcinoz, t., altun, h. and uzam, m., economic dispatch solution using a genetic algorithm based on arithmetic crossover, ieee power tech conference, vol.2, pp: 4, porto, 2001 [6] song, y.h. and chou c.s.v., advanced engineered conditioning genetic approach to power economic dispatch, iee generation, transmission and distribution, vol.144(3),pp: 285-292, 1997 [7] yalcinoz, t. and short m.j., large scale economic dispatch using an improved hopfield neural network, iee generation, transmission and distribution, vol. 144(2), pp: 181-185, 1997 [8] coelho, ld.s. and mariani, v.c., combining of chaotic differential evolution and quadratic programming for economic dispatch optimization with valve-point effect, ieee transactions on power systems, vol. 21(3), pp: 14651465. 2006. [9] sinha, n., chakrabarti, r. and chattopadhyay p.k., evolutionary programming techniques for economic load dispatch, ieee transaction evolutionary computation, vol.7(1), pp: 83-94, 2003 [10] venkatesh, p., gnanadass, r. and padhy n.p, comparison and application of evolutionary programming techniques to combined economic emission dispatch with line flow constraints, ieee transactions on power systems, vol.18(2), pp: 688-697, 2003 [11] singh, l. and dhillon, j.s., fuzzy satisfying multiobjective thermal power dispatch based on surrogate worth trade-off method, electric power components and systems, vol. 36(1), pp: 93108, 2008 [12] attaviriyanupap, p., kita, h., tanaka, e. and hasegawa j., a fuzzy-optimization approach to dynamic economic dispatch considering uncertainties, ieee trans. power systems, vol.19(3), pp: 1299-1307, 2004 [13] brar, y.s., dhillon, j.s. and kothari, d.p., multiobjective load dispatch by fuzzy logic searching weightage pattern, electric power systems research, vol. 63, pp: 149-160, 2002 [14] lin, w.m., cheng, f.s. and tsay, m.t., an improved tabu search for economic dispatch with multiple minima, ieee transaction power systems, 17(1), pp: 108-112, 2002 [15] alrashidi, m.r. and el-hawary m.e., hybrid particle swarm optimization approach for solving the discrete opf problem considering the valve loading effects, ieee transaction power systems, vol. 22 (4), pp: 2030-2038, 2007 [16] gaing, z.l., particle swarm optimization to solving the economic dispatch considering the generator constraints, ieee transaction power systems, vol. 18 (3), pp: 1187-1195, 2003 384 d.c. secui, i. felea, s. dzitac, l. popper [17] rotar, c., dumitrescu, d. and lung, r.i., guided hyperplane evolutionary algorithm, proceedings of the 9th annual conference on genetic and evolutionary computation, london, pp: 884 891, 2007 [18] yumbla, p.e.o., ramirez, j.m. and coello, c.a.c., optimal power flow subject to security constraints solved with a particle swarm optimizer, ieee transaction power systems, vol. 23(1), pp: 33-40, 2008 [19] makeechev, v.a., soukhanov, o.a. and sharov, y.v., hierarchical algorithms of functional modelling for solution of optimal operation problems in electrical power systems, international journal of electrical power & energy systems, vol. 30 (6-7), pp: 415-427, 2008 [20] singh, l. and dhillon j.s., secure multiobjective real and reactive power allocation of thermal power units, international journal of electrical power & energy systems, vol. 30, pp: 594-602, 2008 [21] ratnaweera, a., halgamuge, s.k. and watson, h,c., self-organizing hierarchical particle swarm optimizer with time varying acceleration coefficients, ieee transaction evolutionary computation, vol. 8(3), pp: 240-255, 2004 [22] kennedy, j.f., eberhart, r.c. and shi, r.c., swarm intelligence, san francisco (ca, usa): morgan kaufmann publishers, 2001. [23] yoshida, h., kawata, k., fukuyama, y., takayama, s. and nakanishi, y., a particle swarm optimization for reactive power and voltage control considering voltage security assessment, transactions of the institute of electrical engineers of japan. b, vol. 119-b, 12, pp: 1462-1469, 1999 [24] gavrilas, m., iovanov, o. and sfintes, c.v., enhanced particle swarm optimization method for power loss reduction in distribution systems, 19th international conference on electricity distribution, vienna, paper 0088 (4 pp), 2007 [25] liu, y. and gu x., skeleton-network reconfiguration based on topological characteristics of scale-free networks and discrete particle swarm optimization, ieee transaction power systems, vol. 22 (3), pp: 1267-1274, 2007 [26] ting, t.o., rao, m.v.c. and loo, c.k., a novel approach for unit commitment problem via an effective hybrid particle swarm optimization, ieee transaction power systems, vol. 21 (1), pp: 411-417, 2006 [27] krishna teerth chaturvedi, manjaree pandit, laxmi srivastava, particle swarm optimization with time varying acceleration coefficients for non-convex economic power dispatch, international journal of electrical power & energy systems, vol. 31 (6), pp: 249-257, 2009 [28] leandro dos santos coelho, chu-sheng lee, solving economic load dispatch problems in power systems using chaotic and gaussian particle swarm optimization approaches, international journal of electrical power & energy systems, vol. 30 (5), pp: 297-307, 2008 international journal of computers communications & control issn 1841-9836, 9(5):593-601, october, 2014. decision model for assessing healthcare ict support implications: user perception a.m. oddershede, f.m. cordova, r.a. carrasco, f.j. watkins astrid m. oddershede, felisa m. córdova university of santiago of chile industrial engineering department,chile astrid.oddershede@usach.cl felisa.cordova@usach.cl rolando carrasco school of ieee newcastle university/ newcastle upon tyne, uk r.carrasco@ncl.ac.uk francisco j. watkins university of santiago of chile, electrical engineering department, chile francisco.watkins@usach.cl abstract: this paper presents a multi-criteria decision model based upon user judgments to assist the evaluation process of an information and communication technology (ict) network system in health care to improve the quality of service (qos). measuring quality in health care services is not an easy task, as there are many competing goals involved, human, economic, communications technology, governmental and others. integrating multiple criteria decision analysis (mcda) methodology with modeling and simulation through optimization network engineering tool (opnet) platform permit to characterize main ict user and identify priority applications to examine network qos requirements and implications. the proposed approach permitted to identify the main users, to elaborate a profile and characterization of the ict support requirements according to their main daily task in answer to a service requirement. the results generate evidence related to the important factors effecting quality in hospital requirement as availability of services and the need for ubiquitous access to integrated information. the stakeholder interface perception and resources for ict network support are investigated through a case study for chilean hospitals. keywords: mcda, decision support, user perception, ict healthcare. 1 introduction information and communications technology appears as an emerging concept in health care undertaking an important role for healthcare-related activities [3]. an information technology system provides stakeholders with several applications to support their duties in clinic care, medical research and administrative issues. these applications rely on the ict network infrastructure and its performance. an ict system should be a facilitator for health care users since they need to access all types of data existing on all types of systems. there is evidence [7], [9] that an ict network system implementation generates an effect on the service and health care providers. in this sense it has become important to consider, [1], [2] evaluation mechanisms for ict healthcare support and applications. the health ict design, implementation and its management has to consider risk for quality and efficiency on patient care and be in agreement with the type of user needs that could be attained through user profiling and requirements analysis. copyright © 2006-2014 by ccc publications 594 a.m. oddershede, f.m. cordova, r.a. carrasco, f.j. watkins the study is concerned with the development of an exploratory assessment model incorporating empirical data collected from the main users of the health network system.the model will be of assistance to find out user perception of quality of the service related to the communications system in a healthcare institution, to identify critical areas for qos of each user type, to provide a decision making tool as a guidance to analyse and evaluate a networked system for health related activities, to compare the different requirements and to enable trade-offs in accordance to the institution necessity. with the purpose to recognize the properties of the system that could subsequently affect the degree of satisfaction with the ict system in unanimity with user activities, a multicriteria approach is utilised, modelling with ahp [13]. the obtained data allows generating user profiles and applications profiles to support the design of models for evaluating ict healthcare network qos through the optimization network engineering tool (opnet) simulation platform to examine the network behaviour and performance. section 2 introduces the proposed evaluation approach. section 3 gives notions of the applied ahp method. following the case study description and its results is presented in section 4 providing information that is not currently available. in section 5, the conclusions are provided. 2 healthcare ict network system evaluation approach the evaluation approach considers integrating user perception modelling through ahp with network modelling and simulation to observe the network and examine performance. qos offered by a particular network could be established by technical parameters that can be measured objectively. however, it is a difficult task to find a set of universal parameters for every type of service because there are many and dissimilar parameters involved in the performance evaluation. the qos technical metrics related to each attribute has to be defined together with the applications profile. these profiles are influenced by healthcare user requests for developing a specific task. the user perception depends upon their needs, their precise applications and their expectations. concerning stakeholder’s perception, ahp modelling plays an important role generating network attributes of the system. it is of great utility to analyze which of the parameters would be relevant when considering the user perception for a determined service. the organization must then define a service level agreement (sla) for their main applications. once healthcare service application and user profiles are characterized it follows to model, design and simulate through optimization network engineering tool to examine the potential network performance. this paper is focused on presenting the perception model related to the communications system and its results for recognizing priority health activities and distinguishing the critical it network support resources that could lead to the improvement of the service. the results will reveal key qos parameters that generate user and activities profiles to support the assessment of communications resources of local area networks (lan) in a hospital. 3 the analytic hierarchy process methodology the ahp involves decision-makers (dm) in breaking down a decision into smaller parts, proceeding from the goal to criteria to sub-criteria down to the alternative courses of action. dms then make pair-wise comparison judgements throughout the hierarchy to arrive at overall priorities for the alternatives [13]. this approach provides the structure and the mathematics to support decision-makers make rational decisions. the basic principles of ahp are: hierarchy representation and decomposition, which is a representation of a complex problem in a multilevel structure whose first level is the goal followed successively by levels of factors, criteria, and sub criteria, and so on down to a bottom level of alternatives. figure 1 shows an illustration of a decision model for assessing healthcare ict support implications: user perception 595 figure 1: generic decomposition of a problem into a hierarchy simple three level hierarchy. the object of a hierarchy is to assess the impact of the elements of a higher level on those of a lower level or alternatively the contribution of elements in the lower level to the importance or fulfillment of the elements in the level above. this type of assessment is usually made by paired comparisons responding to an appropriately posed question eliciting the judgement. the mathematical definition of a hierarchy is given in saaty’s book [13]. setting priorities in a hierarchy requires that we perform measurements throughout the structure. we must then synthesize these measurements to obtain priorities for the bottom level alternatives. the ahp is based on ranking activities in terms of relative ratio scales. in the paired comparison approach of the ahp, one estimates ratios by using a fundamental scale of absolute numbers in comparing two alternatives with respect to an attribute and one uses the smaller value as the unit for that attribute. to estimate the larger one as a multiple of that unit, assign to it an absolute number from a fundamental scale shown in table 1. table 1: saaty’s fundamental scale importance intensity definition 1 equal importance 3 moderate importance 5 strong importance 7 very strong or demonstrated importance 9 extreme importance if activity i has one of the above nonzero numbers reciprocals of above assigned to it when compared with activity j, then j has the reciprocal value when compared with i this process is done for every pair. thus, instead of assigning two numbers wi and wj and forming the ratio wi/wj we assign a single number drawn from the fundamental 1 − 9 scale to represent the ratio (wi/wj) : 1. the absolute number from the scale is an approximation to the ratio wi/wj. the derived scale tells us what the wi and wj are. let w be a matrix (1) 596 a.m. oddershede, f.m. cordova, r.a. carrasco, f.j. watkins whose row elements are ratios of the measurements wi of each of n items with respect to all others. w =   w1/w1 · · · w1/wn w2/w1 · · · w2/wn ... . . . ... wn/w1 · · · wn/wn   (1) a number in the matrix is a dominance judgment. a judgment of 1.0 means that two activities contribute equally to the objective or goal, a judgment of 3.0 means that slightly favour one activity over another or three times as much (if you are dealing with measurable), a judgment of 5.0 means that judgement strongly favour one activity over another, a judgment of 7 means that activity is strongly favoured over another; its dominance is demonstrated in practice and 9.0 means that the evidence favouring one activity over another is of the highest possible order of affirmation. you should group your elements into homogeneous clusters so that it is not necessary to use a number larger than 9. in this way, we can interpret all ratios as absolute numbers or dominance units. the ahp provides guidelines for a test of consistency of judgments to ensure that elements are grouped logically and ranked consistently according to a logical criterion. in general, the ratio should be in the neighborhood of 0.10 [13]. too great a departure from the perfectly consistent value indicates a need to improve the judgments or to restructure the hierarchy. 4 the case study and ahp application the stakeholder interface perception and resources for ict network support are investigated through a case study for chilean hospitals. a pilot study has been carried out collecting data from health institutions in chile (private, public, regional) to examine ict infrastructure, ict network provision and stakeholders perception related to ict network system. an ahp model is constructed to determine user perspective related to the ict support importance in developing their work. for this study, we classified the main ict network system users into three groups: those who develop activities in clinic care, who would make use of ict to deliver a service (physician, nurses, paramedics, etc.); the medical research group, who develop health research, collecting disease statistic and/or investigate new drugs and new devices and a third group integrated by users who perform administrative activities, billing, products distribution, and inventory control or other connected. for this study patients were not considered since from previous work, [10] ict support showed to have a lower impact on patient [1]. figure 2 show the ahp process results. ict support reflects the greatest impact on supplying clinical care service. the next step is concentrated in finding out the relevant ict application for each type of healthcare users that would support to perform a better service. initial data was collected from 480 participants; following an expert team of 36 is grouped comprising representatives from each category from all of the three types of hospitals considered. the ict applications to be supported by a server are: email, web browsing (http 1.1), file transfer, database access, file print, video conferencing, and voice. a new ahp model is developed and processed. the pairwise results from the three group representatives indicated the relative importance of ict system application for performing their decision model for assessing healthcare ict support implications: user perception 597 figure 2: relative ict support importance for user type group table 2: overall relative importance priority for ict applications application priority data base access 51,1 web browsing 16,5 ftp 2,9 e mail 14,5 video and voice 2,9 activities as seen in table (2). though, when analysing separately, the clinical care group revealed a strong tendency, to rely on database applications to have access to patient records throughout email services. the research group indicated a strong interaction with web browsing application and data base application. this result would be in concordance to the nature of their work. while from administration group perspective, the preferences are for data base and file transfer protocol. the activities such as, delivering and obtaining test and exams results, within the institution implies interaction with database application. the applications relative importance according to ict user group for each application is depicted in figure 3. regarding to current ict applications usage in healthcare institutions there is a gap between current usages compared to what users declare important. from data collected and author’s observations the application, e-mail, appears to be mostly employed and there is little usage of the others. figure 4 shows the usage according to different health institutions. in this sense there is work to be done, moreover, the results analysis suggests that health ict network users expectations are that network will help to deliver a service with the required functionality on time and within the budget. consequently, a new ahp model is developed, to bring about information related to the most important network attributes to develop the applications. the essential qos attributes to meet the ict support requirement for each defined activity has to be established. it follows the 598 a.m. oddershede, f.m. cordova, r.a. carrasco, f.j. watkins figure 3: relative ict support importance for user type group figure 4: applications usage in healthcare institutions process to determine the qos attribute relative importance for applications according to user type expectation. this refers to the ict support, user perceives about service satisfaction. for example: success in the connection, accessibility, velocity, etc. ict system users expect that the network will help to deliver a service on time with the required functionality and within the budget. then the hierarchy structure should consider the attributes that would improve/ensure a better performance. the attributes considered are based on standard iso [6] quality software model: functionality, efficiency, reliability, availability, and serviceability. even though, the five attributes are essential, it was possible to detect some differences in relation to the type of user. the two overall most important qos attributes concomitant to the ict applications in performing their health related activities system are availability and reliability. the attribute availability is most important for the group who develop task in clinic care. in effect, clinic care professionals require having information on their patients including those elaborated by others, as, complementary tests results, at the moment and at the place of attendance. availability of decision model for assessing healthcare ict support implications: user perception 599 ict support is critical, when dealing with an emergency situation. figure 5 shows the overall relative importance for attributes that the panel of experts indicated. figure 5: user relative importance for attributes through the ahp it was possible to identify applications priority and to characterize main ict system users. the relative importance of ict network support for each activity to deliver a better service is obtained. at this point, the qos technical metrics that would guarantee a service related to each attribute are defined together with the applications profile. technical requirement demanded for the different activities the group perform is shown in figure 6. this information and data obtained by author’s research and extensive collaboration with the it network and management teams from a range of hospitals which differ in size and category as public, university, private is used for configuring profiles which will be the input for modelling and simulation to examine ict network technical aspects and behaviour through opnet methodology [11]. then the next step is to set up profile applications according to each user type which is based on the ahp results obtained. according to opnet methodology, initially a topology has to be selected from a hospital zone as a first approach to analyze technology infrastructure and network performance. continuing with traffic configuration where every workstation will have a profile application consistent with the users’ main role. once applications and profiles are defined, then different scenarios are characterized for each study case to visualize how sensible network performance is, related to changes. then, simulations are ran increasing the number of ict network user, formulating scenarios, varying the number of users and/or varying the links (ethernet connections) between workstations and switches, to obtain the point to point throughput in bits/secs. and utilization (link usage %). the utilization percentage of link usage is expressed as: point− to−point.throughput(bits/sec)∗100 ethernetconnection(mbps) (2) 600 a.m. oddershede, f.m. cordova, r.a. carrasco, f.j. watkins figure 6: user relative technical requirement demanded 5 conclusions the existing competing objectives for improving quality of service in health institution increase complexity when analysing ict support. the utilisation of a scientific multicriterial decision method ahp was beneficial for identifying high-priority requirements of an ict system in health related activities. the attribute for measuring quality in clinic healthcare system, availability and the need for ubiquitous access to integrated information are considered most important. the combination of fixed and wireless network support can facilitate to obtain the timely information needed. this issue is critical in healthcare institution operation mainly for the clinic care group and work has to be done in this sense. the mcda approach allowed elaborating a profile and characterization of the ict support requirements in healthcare service. the use of ahp modelling and empirical evaluation permits to capture human perspective. this allowed designing profiles characterization that would help to configure network traffic and different scenarios for simulating and evaluating network behaviour. acknowledgements the authors are grateful to the department of industrial engineering and dicyt of the university of santiago of chile for its support. decision model for assessing healthcare ict support implications: user perception 601 bibliography [1] ammenwerth e., nykanen p., rigby m., keizer n. (2013); clinical decision support systems: need for evidence, need for evaluation, institute of health informatics, umit 06. [2] ammenwerth e., graber s., herrmann g., burkle t., konig j. (2003); evaluation of health information systems problems and challenges, international journal of medical informatics, 71: 125-135. [3] bourret c. (2004); data concerns and challenges in health: networks, information systems and electronic records, data science journal, 3:96-113. [4] gao f., ye x. (2002); a hierarchical trade-off assessment model and the systematic evaluation of networked systems, fast abstract issre, copyright chillarege press. [5] heath a., carrasco r.(2001), access techniques for 3g multimedia wireless packet switched networks: simulation using opnettm, iee/ieee/bcs 6th international symposium on communication theory and applications (iscta01), lancaster university, 15-20 jul 2001, 1-29. [6] iso/iec: iso/iec 9126-1(1997); information technology software quality characteristics and metrics part 1: quality characteristics and sub-characteristics. [7] jaspers m.w., smeulers m. vermeulen h., peute l.w. (2011), effects of clinical decisionsupport systems on practitioner performance and patient outcomes: a synthesis of highquality systematic review findings, journal of the american medical informatics association, 18: 327-334. [8] national research council. (2000); networking health: prescriptions for the internet, national academic press isbn-10: 0-309-06843-6. [9] oddershede a. (2009); methodology to evaluate qos of ict networks for the chilean national health service, a thesis submitted to newcastle university for the degree of doctor of philosophy, 2009. [10] oddershede a.m., carrasco r.a (2007); perception of mobile technology provision in health service, chapter book global mobile commerce: strategies, implementation and case studies, edited by dr. wayne huang, dr. y.l. wang and dr. john day, ohio university, usa. [11] opnet: user manual (2004); http://www.opnet.com/university\_program/teaching\ _with\_opnet/textbooks\_and\_materials/materials/opnet\_modeler\_manual.pdf [12] xinjie chang (1999); network simulations with opnet, proc. of the 1999 winter simulation conference, p. a. farrington, h. b. nembhard, d. t. sturrock, and g. w. evans, eds., network technology research centre school of eee nanyang technological university, singapore. [13] saaty, thomas l. (2001); decision making for leaders, vol. ii, ahp series, rws publ., (new ed.), isbn 0 9620317. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 2 (june), pp. 367-374 digital control of a waste water treatment plant r. vilanova, j.d. rojas, v.m. alfaro ramón vilanova, josé david rojas department de telecomunicació i enginyeria de sistemes universitat autònoma de barcelona 08193, bellaterra, spain, e-mail: ramon.vilanova@uab.cat, josedavid.rojas@uab.cat víctor m. alfaro escuela de ingeniería eléctrica universidad de costa rica san josé, 11501-2060 costa rica. e-mail: victor.alfaro@ucr.ac.cr abstract: the activated sludge process (asp) is arguably the most popular bioprocess utilized in the treatment of polluted water. the asp is described by means of a nonlinear model and results on a two-input two-output multivariable system. in this paper a discrete time digital control is proposed where the design of a decentralized controller is faced. local controllers are given the form of a two-degree-of-freedom pi controller tuned using the data-driven virtual-reference feedback tuning approach. keywords: asp, process control, pid, data-driven control, vrft. 1 introduction water pollution represents one of the most serious environmental problems due to the discharge of nutrients into receiving waters. hence, stricter standards for the operation of wastewater treatment plants (wwtps) have been imposed by authorities. in order to meet these standards, improved control of wwtps is needed. wastewater treatment control has begun a gradual progress towards the use of more advanced technology, in the face of more stringent modern water quality standards. several approaches have been reported in the literature that attempt to control the wwtps process. among others, the activated sludge process (asp) is arguably the most popular bioprocess utilized in the treatment of polluted water, using microorganisms present within the treatment plant in the biological oxidation of the wastewater. the simplified but still realistic and highly non-linear four-state multivariable model considered here is the asp as presented in [1]. the asp is described by means of a nonlinear model and results on a two-input two-output (tito) multivariable system. in this paper a discrete time digital control is proposed where the design of a decentralized controller is faced. the paper designs the local controllers as discrete time proportional-integral (pi) controllers. the controllers are synthesized using the virtual reference feedback tuning, which is a model-free based approach where just purely data taken from the system is considered, therefore there is no need for a mathematical model of the system. on the basis of these data the discrete time pi controllers are tuned. 2 the two-degree-of-freedom virtual reference feedback tuning the virtual reference feedback tuning (vrft) is a one-shot data-based method for the design of feedback controllers. the original idea was presented in [2], and then formalized by copyright c⃝ 2006-2011 by ccc publications 368 r. vilanova, j.d. rojas, v.m. alfaro r + + + − cr(θr) p d y cy(θy) figure 1: two degrees of freedom structure lecchini, campi and savaresi (see [3–5]). in this section, an outline of the two-degree-of-freedom case is presented. the design methodology is presented in [5], the control structure is presented in fig. 1. the objective of this method is to minimize the criterion in (1). jmr(θr, θy) = ∥(ψm(z; [θr, θy]) − m(z))wm(z)∥ 2 2 + ∥(ψs(z; θy) − s(z))ws(z)∥ 2 2 (1) with ψm(z; [θr, θy]) = p(z)cr(z; θr) 1 + p(z)cy(z; θy) ψs(z; θy) = 1 1 + p(z)cy(z; θy) (2) and m(z) being the target input-to-output transfer function and s(z) the target sensitivity function. in the vrft framework a plant model is not available, and is not intended to find one. instead, a batch of input/output data is taken from an experiment on the plant (namely input u(t) and output y(t)). so, in order to find the parameters of the controllers (θr and θy) the signals r̄(t), d̄(t) and ȳ(t) are defined. these signals are called “virtual” because they are not really measured, but constructed from the input/output data available and the desired closed-loop relations as follows: • r̄(t) is the virtual reference, so that y(t) = m(z)r̄(t) • d̄(t) is the virtual perturbation, so that y(t) + d̄(t) = s(z)d̄(t) • ȳ(t) is the virtual-perturbed output of the plant, so that ȳ(t) = y(t) + d̄(t) this signals are the ones that would be found if u(t) and y(t) had been measured in closed-loop and if the closed-loop dynamics were given by m and s i.e., if the perfect controllers were set in the loop. on the basis of these “virtual” signals the controller’s parameters are found by minimizing the following alternative identification cost function: jnv r(θr, θy) = 1 n n∑ t=1 [γm(t; [θr, θy])] 2 + 1 n n∑ t=1 [γs(t; [θr, θy])] 2 where γm(t; [θr, θy]) = lm(z)(u(t) − cr(z; θr)r̄(t) + cy(z; θy)y(t)) (3) γs(t; [θr, θy]) = ls(z)(u(t) + cy(z; θy)ȳ(t)) (4) and lm(z) and ls(z) are appropriate filters to be chosen so (3) becomes an approximation to (1). if the controllers are linear in the parameter (cr(z; θr) = βr(z)t θr and cy(z; θy) = βy(z)t θy) the cost criterion (3) becomes a standard quadratic optimization problem. in [5] the authors use the digital control of a waste water treatment plant 369 concept of “ideal controller” to derive the structure of filters lm and ls. the ideal controllers cr0 and cy0 are the ones that, if used in the control loop, would solve (1) exactly, that is cy0 = 1 − s sp cr0 = m sp (5) when comparing (1) and (3) using the parseval theorem the expression of the filters lm and ls that must make the identification problem (3) match the control problem (1) are found to be: |lm|2 = |m|2|s|2|wm|2 1 φu |ls|2 = |s − 1|2|s|2|ws|2 1 φu (6) 3 2-dof pi structure for the vrft in order to apply the vrft framework to the activated sludge process, the structure of the controllers has to be decided before the optimization is carried out. in [6], a decentralized pi structure is used in the same plant with good results for both, reference tracking and disturbance rejection. in this paper, a discretized version of the pi controller is used as the chosen structure for the vrft controllers. using a two-degree-of-freedom pi as in fig. 1, the continuous time version of the controller is: cr(s) = kc ( β + 1 tis ) cy(s) = kc ( 1 + 1 tis ) (7) when applying the bilinear transformation s = 2 ts z−1 z+1 , the controllers are cr(z) = kc ( β + ts 2ti ) + kc ( ts 2ti − β ) z−1 1 − z−1 (8) cy(z) = kc ( 1 + ts 2ti ) + kc ( ts 2ti − 1 ) z−1 1 − z−1 (9) from the vrft point of view, (8) and (9) can be seen simply as linear-in-the-parameters controllers with two parameters as follows: cr(z) = α1 + α2z −1 1 − z−1 cy(z) = γ1 + γ2z −1 1 − z−1 (10) since the continuous time controllers (7) have three adjustable parameters (kc, ti, β), one of the four parameters in its discrete time equivalent (10) should depend of the other three. from (8) to (10) it can be found that α1 + α2 = γ1 + γ2 = kcts ti (11) and we have that γ2 = α1 + α2 − γ1 (12) then the discrete time controllers (10) are now cr(z) = α1 + α2z −1 1 − z−1 cy(z) = γ1 + (α1 + α2 − γ1)z−1 1 − z−1 (13) 370 r. vilanova, j.d. rojas, v.m. alfaro once the parameters of the controllers (13) (α1, α2, γ1) are found, one can recuperate the pi (7) parameters using: kc = γ1 − 1 2 (α1 + α2) (14) ti = ts γ1 − 12 (α1 + α2) α1 + α2 (15) β = α1 − 12 (α1 + α2) γ1 − 12 (α1 + α2) (16) 4 activated sludge process (asp) description the mathematical model considered in this paper is given in [1]. the asp process comprises an aerator tank where microorganisms act on organic matter by biodegradation, and a settler where the solids are separated from the wastewater and recycled to the aerator. the layout is shown in fig. 2. the component balance for the substrate, biomass, recycled biomass and dissolved oxygen provide the following set of non-linear differential equations: dx(t) dt = µ(t)x(t) − d(t)(1 + r)x(t) − rd(t)xr(t) (17) ds(t) dt = − µ(t) y x(t) − d(t)(1 + r)s(t) + d(t)sin (18) ddo(t) dt = − koµ(t) y x(t) − d(t)(1 + r)do(t) + kla(dos − do(t)) + do(t)doin (19) dxr(t) dt = d(t)(1 + r)x(t) − d(t)(β + r)xr(t) (20) µ(t) = µmax s(t) ks + s(t) do(t) kdo + do(t) (21) where x(t) biomass, s(t) substrate, do(t) dissolved oxygen, dos maximum dissolved oxygen, xr(t) recycled biomass, d(t) dilution rate, sin and doin substrate and dissolved oxygen concentrations in the influent, y biomass yield factor, µ biomass growth rate, µmax maximum specific growth rate, ks and kdo saturation constants, kla = αw oxygen mass transfer coefficient, α oxygen transfer rate, w aeration rate, ko model constant, r and β ratio of recycled and waste flow to the influent. the influent concentrations are set to sin = 200 mg/l and doin = 0.5 mg/l. with respect to the control problem definition, the waste water treatment process is considered under the assumption that the dissolved oxygen, do(t), and substrate, x(t), are the controlled outputs of the plant, whereas the dilution rate, d(t), and aeration rate w(t) are the two manipulated variables. the initial conditions and kinetic parameters are taken as in [1] and [6]. 5 discrete-time vrft tuned pi controller applied to the asp using the non-linear model presented in (17) to (21), the data in fig. 3(a) and fig. 3(b) was collected. using only this data without the information of the non-linear model, or any linear approximation, the parameters of the controller are calculated according to section 2 using the pi structure specified in section 3. the vrft controllers are tested against two decentralized digital control of a waste water treatment plant 371 figure 2: activated sludge process layout. taken from [7] 0 50 100 150 200 250 300 0 0.05 0.1 0.15 0.2 time(s) m g /l dilution rate 0 50 100 150 200 250 300 20 40 60 80 time(s) m g /l substrate data (a) substrate-dilution rate loop 0 50 100 150 200 250 300 50 100 150 time(s) m g /l airflow rate 0 50 100 150 200 250 300 2 4 6 8 time(s) m g /l disolve oxygen data (b) disolve oxygen-airflow rate loop figure 3: data used to find the vrft controller pi controllers which parameters are computed using imc [8], based on considering a first-order (fo) controlled-process given by p(s) = kp ts + 1 (22) the identified models obey to k1 = 437.1 and t1 = 2.7h for the first loop and k2 = 0.03 and t2 = 0.51h for the second loop. these values are as in [6]. the controllers were discretized using the bilinear transformation (sampling time of 15min). in the case of the vrft, the controllers are directly found in discrete time. the closed-loop specifications are given in terms of the desired time constants of the controlled system for each loop. the desired time constant of the substratedilution rate loop is approximately t1 = 2.7h while the dissolved oxygen-airflow rate loop constant time is approximately t2 = 0.51h. the closed-loop constant time for each variable is giving in terms of the dimensionless variables τc1 and τc2 via tc1 = τc1t1 and tc2 = τc2t2. if, for example, τc1 < 1, the substrate-dilution rate closed-loop is expected to be faster than in open loop. it is worth to note that for the vrft tuning it is possible to specify a different time constant for the disturbance attenuation transfer function (in fact s(z)). in this case, the vrft tuned pi controllers are found with a time constant for the corresponding s(z) transfer function that is half the one specified for the reference to output relation by using τc (therefore τc/2). if τc1 = τc2 = 1, the resulting controller parameters are kc1 = 0.0023, kc2 = 33.33, ti1 = 2.7h and ti2 = 0.51h for the imc and kc1 = 0.0042, kc2 = 25.41, ti1 = 3.43h, ti2 = 0.42h, β1 = 0.6602 and β2 = 0.74 for the vrft. the response to a change in the set points of both loops is giving in fig. 4. the iae value represents the integrated absolute value of the error. as it can be seen, the responses for the vrft provide smaller iae as well as less demanding control actuation, computed here as the total variation (tv) or the sum of the control movements from 372 r. vilanova, j.d. rojas, v.m. alfaro 0 20 40 60 80 100 120 40 42 44 46 48 50 52 54 time(h) m g /l substrate, c1 = 1 reference imc, iae: 24.284 vrft, iae: 15.6386 0 20 40 60 80 100 120 4 4.5 5 5.5 6 6.5 time(h) m g /l dissolve oxygen, c2 = 1 reference imc, iae: 0.95129 vrft, iae: 0.67543 0 20 40 60 80 100 120 0.08 0.085 0.09 0.095 0.1 0.105 0.11 0.115 time(h) m g /l dilution rate, c1 = 1 imc, tv: 0.03196 vrft, tv: 0.034001 0 20 40 60 80 100 120 10 20 30 40 50 60 70 80 90 100 110 time(h) m g /l airflow rate, c2 = 1 imc, tv: 58.5604 vrft, tv: 45.0193 figure 4: vrft and imc controllers responses to set-points step changes, τc1 = τc2 = 1 one sampling time to the other. in case the closed-loop bandwidth is increased and we set τc1 = τc2 = 0.5 the resulting controller parameters are kc1 = 0.0046, kc2 = 66.67, ti1 = 2.7h and ti2 = 0.51h for the imc and kc1 = 0.0065, kc2 = 33.38, ti1 = 2.68h, ti2 = 0.35h, β1 = 0.7752 and β2 = 1.03 for the vrft. in this case the imc controller presents and undesirable oscillating behavior in the dissolved oxygen loop, as shown in fig. 5. the output of the controllers was saturated to 0, in case it went below this value. also the control effort of the second controller is quite lower in the case of the vrft with a performance nearly 50% better. also a simulation was carried out for a disturbance in the inflow concentration. the results are depicted in fig. 6. again the results are quite similar, and the multivariable characteristic is tackled in a satisfactory way. as it can be seen, this data driven methodology is suitable for the control of the asp process and it allows to skip the modeling step in order to find a good controller. 6 conclusions this paper has presented the application of a purely data based approach for tuning of discrete time pi controllers. the main advantage of the proposed method is that it does not rely on the usual linear model approximation of the system to be controlled. just an experiment that provides input-output data from the system is needed. the performance of the tuning approach has been tested on a non-linear multivariable system and performance compared with that of the well known imc method. as designed performance is more demanding, the resulting control system exhibits better results than its imc counterpart. digital control of a waste water treatment plant 373 0 20 40 60 80 100 120 40 42 44 46 48 50 52 54 time(h) m g /l substrate, c1 = 0.5 reference imc, iae: 12.8966 vrft, iae: 7.9997 0 20 40 60 80 100 120 3 3.5 4 4.5 5 5.5 6 6.5 time(h) m g /l dissolve oxygen, c2 = 0.5 reference imc, iae: 1.0949 vrft, iae: 0.59053 0 20 40 60 80 100 120 0.08 0.09 0.1 0.11 0.12 0.13 0.14 0.15 time(h) m g /l dilution rate, c1 = 0.5 imc, tv: 0.043395 vrft, tv: 0.04601 0 20 40 60 80 100 120 0 20 40 60 80 100 120 time(h) m g /l airflow rate, c2 = 0.5 imc, tv: 163.5215 vrft, tv: 64.5191 figure 5: vrft and imc controllers responses to set-points step changes, τc1 = τc2 = 0.5 0 20 40 60 80 100 120 40.6 40.8 41 41.2 41.4 41.6 41.8 42 42.2 42.4 42.6 time(h) m g /l substrate (disturbance), c1 = 0.5 imc, iae: 24.7512 vrft, iae: 19.2922 0 20 40 60 80 100 120 6.108 6.11 6.112 6.114 6.116 6.118 6.12 6.122 6.124 time(h) m g /l dissolve oxygen (disturbance), c2 = 0.5 imc, iae: 0.093692 vrft, iae: 0.087208 0 20 40 60 80 100 120 0.072 0.074 0.076 0.078 0.08 0.082 0.084 time(h) m g /l dilution rate (disturbance), c1 = 0.5 imc, tv: 0.043227 vrft, tv: 0.046262 0 20 40 60 80 100 120 89.6 89.8 90 90.2 90.4 90.6 90.8 91 91.2 91.4 91.6 time(h) m g /l airflow rate (disturbance), c2 = 0.5 imc, tv: 25.757 vrft, tv: 24.6659 figure 6: responses changing the specification for the vrft and comparison with the imc controller, τc1 = τc2 = 0.5 374 r. vilanova, j.d. rojas, v.m. alfaro acknowledgment this work has received financial support from the aeci-pci program a/025100/09 and from the spanish cicyt program under grant dpi2007-63356. research work by j.d. rojas has received financial support from the universitat autònoma de barcelona. support from the universidad de costa rica is greatly appreciated. bibliography [1] f. nejjari, a. benhammou, b. dahhou, and g. roux, “non-linear multivariable adaptive control of an activated sludge wastewater treatment process,” int. j. adapt. control signal process., pp. 347–365, 1999. [2] g. guardabassi and s. savaresi, “virtual reference direct design method: an off-line approach to data-based control system design,” automatic control, ieee transactions on, vol. 45, pp. 954–959, may 2000. [3] m. c. campi, a. lecchini, and s. m. savaresi, “virtual reference feedback tuning: a direct method for the design of feedback controllers,” automatica, vol. 38, no. 8, pp. 1337 – 1346, 2002. [4] a. lecchini, m. campi, and s. savaresi, “sensitivity shaping via virtual reference feedback tuning,” in decision and control, 2001. proceedings of the 40th ieee conference on, vol. 1, pp. 750–755 vol.1, 2001. [5] a. lecchini, m. campi, and s. savaressi, “virtual reference feedback tuning for two degree of freedom controllers,” international journal of adaptative control and signal processing, vol. 16, no. 5, pp. 355–371, 2002. [6] r. vilanova, r. katebi, and v. alfaro, “multi-loop pi-based control strategies for the activated sludge process,” in emerging technologies and factory automation, 2009. etfa 2009. ieee international conference on, september 2009. [7] s. caraman, m. sbarciog, and b. marian, “predictive control of awastewater treatment process,” international journal of computers, communications & control, vol. 2, no. 2, pp. 132–142, 2007. [8] m. morari and e. zafirou, robust process control. prentice-hall international, 1989. int j comput commun, issn 1841-9836 9(2):151-159, april, 2014. algorithm of maximizing the set of common solutions for several mcdm problems and it’s application for security personnel scheduling s. dadelo, a. krylovas, n. kosareva, e.k. zavadskas, r. dadeliene stanislav dadelo, aleksandras krylovas, natalja kosareva*, edmundas kazimieras zavadskas vilnius gediminas technical university sauletekio al. 11, lt-10223 vilnius, lithuania e-mail: stanislav.dadelo@vgtu.lt, aleksandras.krylovas@vgtu.lt, natalja.kosareva@vgtu.lt, edmundas.zavadskas@vgtu.lt *corresponding author: natalja.kosareva@vgtu.lt ruta dadeliene lithuanian university of educational sciences studentu st. 39, lt-08106 vilnius, lithuania e-mail: ruta.dadeliene@leu.lt abstract: the article deals with the task of elite selection of private security personnel on the basis of objective and subjective criteria. one of the possible solutions of this multiple criteria decision making (mcdm) problem is creation of heuristics allowing to minimize discrepancy of ranks calculated for the objective and subjective criteria on the basis of the best security staff. the proposed heuristic combines interval points re-selection and random points generation methods. two optimizing algorithms are proposed. it is shown how this method is applied for solving specific task of elite selection from security personnel. keywords: preference feature, multiple criteria decision making, heuristics, optimizing algorithm. 1 introduction eu private security sector employs millions of people and this figure is constantly growing [1]. economic trends provide further development of the private security. it is predicted that over the next 10 years, the u.s. private security sector job demand will increase by 15% and this is much higher growth rate compared to the overall job growth around the u.s. economy [2]. safety business formed specific personnel management challenges. personnel selection and placement to the necessary positions (ranking) is seen as the most important factor affecting the organizations security, stability and development [3] and for private security this process takes on a deeper meaning. personnel selection process focuses on measurement and evaluation of specific potential, skills and personal characteristics of candidates. security personnel evaluation process requires identification of specific characteristics (criteria) for the occupation of the post and their weights determination [4]. it is necessary to develop universal algorithms for personnel selection [5]. developed selection and evaluation systems are mainly focused on companies operating cost reduction, in order to optimize staffing requirements and layout planning [6], but is not enough in the safety field. assessing the activities of private security, their complexity and the peculiarities of the hazard, the following complexity – hazard levels are distinguished: 1) protection of civil objects; 2) protection of critical and strategic objects, collection, and personal protection [7]. evolution of the modern world generated the demand of private security services in military operations. however, armed security services in the "hot" spots of the world providing companies are often confronted with the staff inappropriate behavior in dangerous and emergency situations. assessing the private security business diversity, personnel evaluation and selection procedure copyright © 2006-2014 by ccc publications 152 s. dadelo, a. krylovas, n. kosareva, e.k. zavadskas, r. dadeliene for different tasks becomes necessity. there is a need to look for factors affecting the security staff competence and professional activities as a basis for generation of algorithms for employees evaluation and selection for different tasks procedures [8]. evaluating of personnel engaged in security features in dangerous environment it is necessary to search for specific solutions [3]. critical tasks in hazardous environments, which may result in loss of health and even life, must carry only the elite guards. ryan et al. [9] note that only about 10 – 12 % of the total sample of candidates are able to meet the demands posed by members of special elite forces. dessler [10] states that only 15% of the total sample of individuals can be evaluated in the maximum. recent research reveals the elite diagnostic possibilities according to six sigma principle [11]. private security personnel selection process should point competency assessment in two directions: external (evaluation of subordinate by his immediate superior) and internal (personal competence) [12]. relevance of the problem occurs in the construction of selection algorithms for private security elite, capable of effectively carry out extremely important and dangerous tasks. this scientific problem is not paid enough attention. the aim of this study is construction approval of private security elite selection algorithm. a similar kind of research has not yet been done. an overview of possible mcdm methods for this problem solution is provided by zavadskas and turskis [13]. recently fuzzy mcdm methods are becoming increasingly popular [14]. in this article heuristic methods, based on the function, minimizing non-compliance of ranks, calculated for the best security staff in accordance with objective and subjective criteria are proposed. 2 internal and external evaluation criteria participants. one hundred and eighteen security guards were randomly selected from the company g4s lietuva; twenty two leader managers (experts) of g4s lietuva with not less than 10 years of service at private security structures involving execution and organization of security ranked the competences chosen by authors of the article. security guards internal evaluation (x) (objective testing, measuring, personal competence). selected security guards were tested and evaluated according to 41 criteria. the data received were classified into six groups of competences (variables) regarding the features analysed [12] (table 1): 1. theoretical and practical preparation (x1): knowledge, skills, abilities, practical experience – acquired throughout life; 2. professional activity (x2): carrying out required tasks; 3. mental qualities (x3): individual psychical qualities vital for performance of professional activities; 4. physical development (x4): morphological indications of a body; 5. motor abilities (x5): personal physical conditions allowing carrying out physical tasks at work or home, during leisure, and reflecting the level of physical qualities; 6. fighting efficiency (x6): a set of physical and mental qualities influencing the ability to effectively carry out actions fighting an adversary in direct contact. table1: security guards internal evaluation criteria (objective tests and measurements) algorithm of maximizing the set of common solutions for several mcdm problems and it’s application for security personnel scheduling 153 security guards criteria (competences) x1 x2 x3 x4 x5 x6 max (highest values) 2.582 2.186 2.651 3.388 2.768 1.696 min (lowest values) −1.606 −2.567 −2.518 −1.948 −2.596 −1.709 a1 1.358 −1.503 0.607 0.520 1.381 1.696 a2 0.259 −0.549 −0.424 −1.487 −0.645 0.068 a3 −0.984 0.465 0.101 −0.479 0.690 −0.100 ... ... ... ... ... ... ... a116 0.225 −0.500 −0.837 −0.599 0.749 0.795 a117 0.561 −1.366 0.438 −0.017 0.013 −0.903 a118 −1.512 −1.391 −2.180 −1.666 −1.520 −0.845 security guards external evaluation (y) (evaluation of subordinate by his immediate superior). team leaders evaluated their subordinates based on a. sakalas [15] modified methodology (table 2). they have been assessed on nine criteria that could affect security guards professional competences: 1. specialty knowledge, professionalism (y1): versatility, knowledge about their and related occupations; 2. diligence and positive attitude to work (y2): activeness, responsibility, discipline, zeal, vocation to work; 3. behaviour with colleagues and supervisors (y3): the ability to cooperate and work in a team; 4. reliability at work (y4): ability and willingness to perform tasks independently; 5. quality of work (y5): the ability to avoid mistakes; 6. workload performance (y6): the ability to carry out the maximum amount of work; 7. image (y7): the self-representational skills (exterior, posture, language culture); 8. development rate (y8): the ability to quickly adapt to new requirements and new working conditions; 9. being promissing (y9): potential for career. table 2: security guards external evaluation criteria (evaluation of subordinate by his immediate superior) security guards criteria (competences) y1 y2 y3 y4 y5 y6 y7 y8 y9 max (highest values) 5 5 5 5 5 5 5 5 5 min (lowest values) 0 0 0 0 0 0 0 0 0 a1 3 4 4 4 4 4 4 4 4 a2 4 4 4 4 5 3 3 4 4 a3 3 4 4 4 4 4 4 4 4 ... ... ... ... ... ... ... ... ... ... a116 2 3 3 4 3 3 3 3 2 a117 1 2 2 2 2 3 2 1 1 a118 3 3 4 3 3 3 3 3 2 security staff competency assessment is depicted in figure 1. 3 mathematical model definition. suppose that vector of real components x = (x1,x2, . . . ,xn) ,(∀j)xj > 0, has the meaning of some measurable values and is the set of n criteria values. any function 154 s. dadelo, a. krylovas, n. kosareva, e.k. zavadskas, r. dadeliene figure 1: research progress φ : rn → r will be called criteria convolution, if it nondecreasing for each variable, i. e. (∀j = 1,2, . . .n,∀δ > 0) φ(. . . ,xj + δ, . . .) > φ(. . . ,xj, . . .) . (1) note, that if for the two alternatives x(1) and x(2) the pareto dominance is valid x(1) ≻ x(2) (∀j = 1,2, . . .n : x(1)j > x (2) j and ∃i : x (1) i > x (2) i ), then inequality (1) will be true for the criteria convolution: φ ( x(1) ) > φ ( x(2) ) . suppose, that permutation (j1,j2, . . . ,jn) of natural numbers {1,2, . . . ,n} determine the criterion x components xj preferences xj1 ≻ xj2 ≻ · · · ≻ xjn . for example, permutation (2,3,1) means that the most important influence has the second component of criterion x = (x1,x2,x3), and the least influence has the first component. definition. let’s say that criteria convolution φ has preferences feature (j1,j2, . . . ,jn), if all the inequalities are valid: (∀i < k & δ > 0) φ(. . . ,xi + δ, . . . ,xk, . . .) > φ(. . . ,xi, . . . ,xk + δ, . . .) , (2) when ji > jk. note, that criteria convolutions determined on the basis of weighted averages have preferences feature defined by (2). for example, criteria convolution φ(x) = w1x1 + w2x2 + w3x3 + w4x4 w1 + w2 + w3 + w4 , when w2 > w3 > w1 > w4 > 0 has preferences feature (2,3,1,4). convolution φ(x) = x p1 1 x p2 2 x p3 3 , when p2 > p1 > p3 > 0 has preferences feature (2,1,3). 3.1 formulation of optimization problems suppose that for the certain alternatives criteria-referenced evaluations are known x(k) =( x (k) 1 ,x (k) 2 , . . . ,x (k) n ) , y (k) = ( y (k) 1 ,y (k) 2 , . . . ,y (k) m ) , k = 1,2, . . . ,k and criteria x, y preferences are determined: (j1,j2, . . . ,jn), (i1, i2, . . . , im). suppose that φ and ψ are convolutions of criteria x and y respectively, having corresponding preferences features. for each alternative ( x(k),y (k) ) values of both criteria convolutions algorithm of maximizing the set of common solutions for several mcdm problems and it’s application for security personnel scheduling 155 φ ( x(k) ) and ψ ( y (k) ) are calculated and numbered in ascending order to obtain ranks of alternatives: r(k)x , r (k) y . note that the lower is the rank, the better is alternative. let’s denote akx and bky sets of the best alternatives – those subsets of the set {1,2, . . . ,k} whose items ranks r(k)x < kx and r (k) y < ky respectively. here kx, ky mean the proportion of the best alternatives to be selected (generally 10–15%, i. e. kx,ky ∈ [0.10k,0.15k]). denote s(j1,j2,...,jn) class of convolutions having preferences feature (j1,j2, . . . ,jn). in this class of functions we’ll search for the function φ (analogously ψ). denote ∣∣∣akx ∩ bky∣∣∣ number of elements in intersection of sets akx and bky and formulate the optimization problem: max φ ∈ s(j1,j2,...,jn) ψ ∈ s(i1,i2,...,im) ∣∣∣akx ∩ bky∣∣∣ . (3) notice that if kx = ky = k the problem (3) has trivial solution akx ∩ bky = {1,2, . . . ,k} for any convolutions φ, ψ. decreasing values of parameters kx, ky imply the decreasing number of common elements (intersection) of the sets akx , bky . if this number is less than necessary, we mus increase values of parameters kx, ky. this means that criteria x and y don’t match. choose the numbers k′x 6 kx, k′y 6 ky. the incompatibility of criteria x and y is quantitatively described by number of all elements of the union of sets ak′x and bk′y which don’t belong to the intersection in the right side of formula (3), i. e. the alternatives with high rank according to the one and low rank according to another criterion. so the problem (3) could be supplemented with additional one min φ ∈ s(j1,j2,...,jn) ψ ∈ s(i1,i2,...,im) ∣∣∣(ak′x \ bky) ∪ (bk′y \ akx)∣∣∣ . (4) notice that the problem (4) also has trivial solution ak′x ∪ bk′y = ∅, if k ′ x = k ′ y = 1. in general, this means a certain amount of "fine" for the ignorance of dubious alternatives and numbers k′x 6 kx, k′y 6 ky must to be the larger. 3.2 restrictions and general scheme of the research formulated optimization problems (3) and (4) aren’t easy. construction of the effective algorithms for their solving is interesting, but poorly investigated issue. efficiency of these algorithms should strongly depend on the class of criteria convolutions s(j1,j2,...,jn). in this article we restrict our investigation with the functions from the class of the weighted averages: φ(j1,j2,...,jn) (x1,x2, . . . ,xn) = n∑ i=1 wjixji n∑ i=1 wi . (5) here wj1 > wj2 > · · · > wjn > 0. preferences feature (j1,j2, . . . ,jn) is determined by experts estimates of the form xj1 ≻ xj2 ≻ · · · ≻ xjn . in this article we limit ourselves to calculation of the average estimates for criteria ranks and numbering of xj accordingly. there are many various ranking methods and their comparison will be our further investigation. 156 s. dadelo, a. krylovas, n. kosareva, e.k. zavadskas, r. dadeliene 3.3 solving the personnel selection problem (case study) the particular managerial task is solved in the paper. it requires finding an acceptable alternative sets having a clear and easily verifiable interpretation of the subject matter. therefore, the comparison of solutions received by various methods could be their interesting practical suitability criteria and have a long-term value. task parameters are as follows: the number of alternatives (security personnel compared with each other) – k = 118; the number of vector criterion x components (physical measurements of emploees) n = 6; the number of vector criterion y components (managers assessment) m = 9. criteria x and y components priority was determined from expert estimates. each of 22 experts had to specify priority permutations xj1 ≻ xj2 ≻ · · · ≻ xjn and yi1 ≻ yi2 ≻ · · · ≻ yim . these evaluations are shown in the table 3: table 3: criteria x and y components ranks expert x1 x2 x3 x4 x5 x6 y1 y2 y3 y4 y5 y6 y7 y8 y9 1 6 1 5 3 2 4 7 9 4 8 5 3 6 2 1 2 5 2 3 1 6 4 7 9 5 6 8 2 3 4 1 3 5 1 3 2 6 4 5 9 4 6 8 1 7 3 2 4 6 1 5 3 4 2 5 9 7 8 6 3 4 2 1 5 6 1 5 2 3 4 7 9 6 8 5 3 4 2 1 6 5 1 4 2 6 3 5 9 6 8 7 3 4 2 1 7 4 2 3 1 6 5 4 8 5 9 7 2 6 3 1 8 6 2 5 3 4 1 6 8 5 9 7 3 4 2 1 9 5 4 6 3 2 1 7 9 4 8 6 3 5 1 2 10 5 4 6 1 2 3 6 7 4 9 8 3 5 2 1 11 4 1 3 2 6 5 5 8 6 9 7 2 4 3 1 12 6 2 4 1 5 3 6 9 5 7 8 4 3 2 1 13 6 4 5 1 3 2 5 9 6 8 7 3 4 2 1 14 6 3 5 1 4 2 7 8 6 9 5 4 3 2 1 15 4 1 3 2 6 5 5 9 7 8 6 3 2 4 1 16 4 1 3 2 6 5 7 9 8 6 5 4 2 3 1 17 6 1 4 2 3 5 6 8 7 9 4 5 2 3 1 18 6 3 4 1 5 2 6 9 8 7 4 5 2 3 1 19 4 1 5 2 6 3 4 9 8 6 7 1 5 3 2 20 6 2 4 1 5 3 5 9 6 7 8 3 4 2 1 21 5 2 6 3 4 1 6 8 7 9 5 4 3 2 1 22 4 1 6 3 2 5 6 9 7 8 5 4 2 3 1 σ 114 41 97 42 96 72 127 190 131 172 138 68 84 55 25 ri 5.18 1.86 4.41 1.91 4.36 3.27 5.77 8.64 5.96 7.82 6.27 3.09 3.82 2.50 1.14 rank 1 6 2 5 3 4 5 1 4 2 3 7 6 8 9 prefs x : (1,3,5,6,4,2) y : (2,4,5,3,1,7,6,8,9) component’s x and y ranks preferences were calculated by method of sum (average) of ranks (i. e. the average place of each component was calculated and preferences drawn up. 3.4 optimization algorithm the heuristic is used for solving problem (3). functions φ(x) and ψ(y ) will be selected from the class of functions (5) as follows: φ(x) = α1x1 + α3x3 + α5x5 + α6x6 + α4x4 + α2x2, α1 > α3 > α5 > α6 > α4 > α2 > 0, algorithm of maximizing the set of common solutions for several mcdm problems and it’s application for security personnel scheduling 157 by analogy accordingly to the preference feature given in the last row of table 3: ψ(y ) = β2y2 + β4y4 + · · · + β9y9, β2 > β4 > β5 > · · · > β8 > β9 > 0. weighted coefficients αj and βi are determined so as to satisfy the normalizing condition α1 + · · · + α6 = β1 + · · · + β9 = 1. (6) designate r(k)(α) and r(k)(β),k = 1,2, . . . ,k ranks of numbers { φ ( x(1) ) ,φ ( x(2) ) , . . . ,φ ( x(k) )} and { ψ ( y (1) ) ,ψ ( y (2) ) , . . . ,ψ ( y (k) )} (k = 118). determine function of ranks discrepancies, i. e. sum of squares of highest ranks differences according to criteria x and y : crkx,ky (α,β) = ∑ k ∈ {1, 2, . . . , 118} : r(k) (α) 6 kx & r(k) (β) 6 ky ( r(k) (α) − r(k) (β) )2 (7) and minimize this function min φ ∈ s(j1,j2,...,jn) ψ ∈ s(i1,i2,...,im) crkx,ky (α,β) . (8) here kx and ky are chosen so as ∣∣∣akx ∩ bky∣∣∣ = 12, since the goal is to select the top 12 security guards. minimization problem (8) is solved by re-selecting of values αj, βi. for the search of values α1, α3, α5 and β2, β4, β5 evenly divide intervals: α1 ∈ [ 1 6 ,1 ) , α3 ∈ [ 1 − α1 5 ,min{α1,1 − α1} ) , α5 ∈ [ 1 − α1 − α3 4 ,min{α3,1 − α1 − α3} ) , β2 ∈ [ 1 9 ,1 ) , β4 ∈ [ 1 − β2 8 ,min{β2,1 − β2} ) , β5 ∈ [ 1 − β2 − β4 7 ,min{β4,1 − β2 − β4} ) . values α6, α4, β3, β1, β7, β6, β8 are chosen randomly in the intervals constructed analogously. for example, α4 is uniformly distributed in the interval α4 ∈ [ 1 − α1 − α3 − α5 − α6 2 ,min{α6,1 − α1 − α3 − α5 − α6} ) . the remaining values α2 and β9 are determined from the normalizing condition (6). general scheme of the research is represented in figure 2. 4 results and discussion this heuristic enabled to achieve the criterion (8) value cr12,12 (α,β) = 429 with the following values of weights: α1 α3 α5 α6 α4 α2 0.31 0.19 0.17 0.17 0.12 0.04 β2 β4 β5 β3 β1 β7 β6 β8 β9 0.41 0.24 0.08 0.08 0.06 0.05 0.03 0.03 0.02 158 s. dadelo, a. krylovas, n. kosareva, e.k. zavadskas, r. dadeliene figure 2: research progress top 12 security guards are those belonging to the intersection akx ∩ bky : a21, a34, a36, a47, a56, a76, a77, a81, a91, a102, a106, a112. according to another heuristics we are looking for such convolutions among all φ ∈ s(j1,j2,...,jn) and ψ ∈ s(i1,i2,...,im) which maximize (3) and minimize (4). since there are many solutions of problem (3)–(4), among the entire sample of solutions we are looking for one that minimize the sum (7). let’s choose k′x = k ′ y = 4. then we reached even lower criterion (8) value cr12,12 (α,β) = 406 and the following x and y weights were obtained: α1 α3 α5 α6 α4 α2 0.26 0.18 0.18 0.17 0.13 0.08 β2 β4 β5 β3 β1 β7 β6 β8 β9 0.21 0.12 0.12 0.12 0.10 0.10 0.09 0.07 0.07 top 12 security guards were selected: a21, a34, a36, a47, a56, a76, a77, a81, a91, a102, a106, a111. this solution differs from the first solution by only one employee – in place of a112 is an employee a111. another 4 security guards were distinguished having high rank (> 4) by one and low rank by another criterion: a6, a26, a46, a54. attention is drawn to the fact that future studies are interesting for the aspect of development of general design of the study for the similar problems, since a very different assessments have similar structure. briefly this scheme can be described as follows. the same objects are independently characterized by several vector criteria (in this paper there are two, but the methodology is easily generalized for greater number of vector criteria). vector components significance is not the same and significance hierarchy (j1,j2, . . . ,jn) is known (usually from the expert estialgorithm of maximizing the set of common solutions for several mcdm problems and it’s application for security personnel scheduling 159 mates). the proposed methodology allows the construction of the alternatives evaluation criteria by application of general design of the study and using all the existing information. bibliography [1] van dijk, j. (2008); the world of crime. los angeles, ca: sage publications. [2] bureau of justice statistics. (2008); census of state and federal correctional federal correctional facilities, 2005. washington, dc: office of justice programs, u.s. department of justice. [3] garland b.; hogan n.l.; kelley t.; kim b.; lambert e.g. (2013); to be or not to be committed: the effects of continuance and affective commitment on absenteeism and turnover intent among private prison personnel. journal of applied security research, 8(1). 65-88. [4] dadelo, s.; turskis, z.; zavadskas, e. k.; dadeliene, r. (2012); multiple criteria assessment of elite security personal on the basis of aras and expert methods. journal of economic computation and economic cybernetics studies and research, 46(4). 65–88. [5] brucker p.; qu r.; burke e. (2011); personnel scheduling: models and complexity. european journal of operational research, 210. 467–473. [6] van den bergh j.; belien j.; de bruecker p.; demeulemeester e.; de boeck l. (2013); personnel scheduling: a literature review. european journal of operational research, 226. 367–385. [7] carlos m. (2011); a survey of the european security market. economics of security working paper 43, berlin: economics of security. [8] dadelo, s.; turskis, z.; zavadskas, e. k.; dadeliene, r. (2013); integrated multi-criteria decision making model based on wisdom-of-crowds principle for selection of the group of elite security guards. archives of budo, 9(2). 135–147. [9] ryan, m.; mann, c.; stilwell, a. (2003); the encyclopedia of the world’s special forces: tactics, history, strategy, weapons. amber books ltd., london. [10] dessler, g. (1999); essentials of human resource management, upper saddle river, nj: prentice hall, 298. [11] suresh s.; antony j.; kumar m.; douglas a. (2012); six sigma and leadership: some observations and agenda for future research, the tqm journal, 24(3). 231–247. [12] dadelo, s. (2005); czynniki determinujce kompetencjae pracownikow ochrony na litwie. awf warszawa–vilnius, 2005 [in polish, abstract in english, in lithuanian] [13] zavadskas, e. k.; turskis, z. (2011); multiple criteria decision making (mcdm) methods in economics: an overview, technological and economic development of economy, 17(2). 397–427. [14] kosareva, n.; krylovas, a. (2013); comparison of accuracy in ranking alternatives performing generalized fuzzy average functions, technological and economic development of economy, 19(1). 162–187. [15] sakalas, a. (2003); personnel management. vilnius: margi rastai. [in lithuanian] ijcccv4n1draft.pdf int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 1, pp. 57-64 discussion support system for intra-class discussions and the criteria for group making ikuo kitagaki hiroshima university, research institute for higher education 2-12-1 kagamiyama, higashi-hiroshima, 739-8512, japan e-mail: kitagaki@hiroshima-u.ac.jp abstract: a computerized system has been discussed. it assists group discussion done in a classroom in the way that, first, it presents a topic with the relevant choices, second, each student selects a choice and sends it to the server, third, the server determines the groups according to the choices and other information, forth, it sends to the students’ cell phone the group information with each choice, lastly, students actually make group according to the given information then start to discuss. relating to the system, this paper describes the configuration of the proposed computer system, two aspects of group division(difference in learning and similarity in learning), the algorithm of the group division, and the execution process of actual group discussions, assisted by this system, about specific topics. keywords: group division, discussion support system, algorithm 1 introduction computer-assisted collaborative learning and group learning have become more popular with the development of e-learning systems. the computer system which is characterized by the algorithm for dividing one class of students into several groups for group discussion is discussed (akahori, 1997). this computer system is used for the group division, however, it is not used in the later group discussions because the actual discussions are made as traditional face-to-face communication activities. thus the computer system can be also called the computer-assisted, group discussion support system. concretely speaking, this system divides one class into multiple groups according to students’ answers of a discussion topic by using a specific algorithm, and each student is notified of the names of the members who belong to the same group. the students then form groups according to the notified information, exchange opinions, and discuss the topic to increase understanding. as almost all students have their own cell-phones, the computer server collects information necessary from students and distributes information to students via cell-phones. although the configuration of the above computer system and the algorithm of the group division are presented in this paper, the criteria of the group division has not been discussed in terms of their educational characteristics. this paper deals with not only the mathematical criteria(kitagaki, 2008) but also the educational characteristics of two criteria: similarity in learning and difference in learning both of which are exemplified. this paper also describes the execution process of actual group discussions, assisted by this system, about specific topics. relating the algorithm shown in this paper, we have already discussed how to make group divisions when a test problem and the answers are the topic for discussion(kitagaki et.al., 2007). in this case, i divided a student class based on the students’ answers to the test. that is, values 1 and 0 were assigned to the right and wrong answers of the test respectively, and the class was divided into groups according to these assigned values. on the other hand, there are no concepts of correct/wrong answer in discussion. copyright © 2006-2009 by ccc publications 58 ikuo kitagaki server incl. local pc student attribute list d isc u ssa n t’s c e ll p h o n e topic of discuss. assembl. ans.data group division inform. of group member et.al. regist. thru.web transmit. url brows. web, transmit. answer parameter (4) member, ans.data (1) (2) (3) (5) figure 1: system configuration. thus the previous paper has been extended theoretically to make a revised paper this time. in the case of discussion, there could be the concepts of similar remarks and different remarks. thus it is rational to deal with the choices so that we let them have ’distance’ between 0 and 1. 2 discussion support system discussion support system is implementing the system flow shown in figure 1. 1. to register student attribute list: as the initial process of the proposed system, it is necessary to register student’s name, number, the mail address, sex and so forth. among these, mail address is used for sending relevant url and their names are used for students to know all the group member. these registrations are done on a web page. the url of the page is informed to all students in advance. 2. to determine a topic with its choices for discussion: there could be usually installed many topic for discussion. the teacher selects a topic among them. then the computer sends the url for browsing the topic to all discussants. 3. to gather the answer data as a choice: all students make an access to the url mentioned in (2), then read the topic and the choices. they select a choice among the choices for the given topic then send it to the server. all the answer are gathered and stored in the server. 4. to make groups: the computer server makes the group division according to the student’s answer as transaction data. the basic idea of the group division is explained in the later section. in the actual administration, the following parameters and the necessary information ought to be inputted prior to the group division: the value with which a topic is discussed.: the difference of choices in their contents.: the number of a group constituents when the information of group division is obtained as a processed result, it is possible for a teacher to add, to it, remarks to each group, remarks to each individual and remarks to the all. discussion support system for intra-class discussions and the criteria for group making 59 5. to inform each student of the group member, selected choice, teacher’s remarks: the computer server sends to each student’s mail address the group members, each answer and the remarks above if any. through the process above, each student is informed the name of all member which belong to the same group. then, each group gathers somewhere inside or outside of the classroom and starts to deeply consider the topic by discussion. in the actual classroom it could be that the students don’t know each other, thus, in order to make the students’ group making easier, it is desirable for the server to allocate a group id to each group and send the id with the group information mentioned above. 3 two aspects of criteria for group making a community, in general, is made under some sort of homogeneity: similar thought, similar culture, and so forth, leading to its stable existence. on the other hand, group making for developing creative products which may be called as a small community, often needs heterogeneity such as different idea, different situation, and so forth. thus which is valuable criterion, homogeneity or heterogeneity, will naturally depend upon the objective, characteristics and others of making the relevant human groups. upon the consideration, i here discuss the criteria in the case of educational objectives in a classroom. 1. difference in learning: i here raise the example of regarding difference in learning in a classroom. let us suppose that students make groups and discuss the physical problem/solution in physics. in this case it is natural to think that there could be only one correct answer/solution, which will be proved provided the relevant experiment is done. thus if there happened to be different answers in the group, it could be valuable to makethe difference clear and discuss which idea will be correct, leading to extensive thinking a new idea. as various predictions done by the students may be converged to get a correct answer by observing the experiment for verification, fruitful discussions will be anticipated to proceed. as an another example, a management game in a classroom will be raised where several groups compete with each other for profit in order to survive. one group may be called as a company, which generally consists of different role of constituents. if we think those constituents ought to be selected regarding different ability/characteristics, it is important to set difference in learning as a criterion for making appropriate groups. 2. similarity in learning: if student’s values is important in order to get a solution as learning, there will be a case where similarity is regarded in making students’ groups. let us suppose that students plan to go abroad in a group for a learning objective thus the teacher at first surveys the place which each student wants to visit. if the places proposed in a group locate near to each other in a group, it gets possible for them to visit all the places as one trip. consequently, i can summarize that, in the case of regarding values in order to do the group based determination, the criterion of similarity in learning ought to be used. 4 method of group division in accordance with the previous section, group division using student’s choice of an answer can be made by two kinds of criterion as the followings. 1. difference in answers: groups are made so that choices of each member may be different from those of others as much as possible. 60 ikuo kitagaki 2. similarity in answers group are made so that choices of each member are similar with those of others as much as possible. two criteria are reverse in their evaluation of ’goodness’. thus it is enough only to explain criterion (a). as for the criterion (a), two methods have been proposed(kitagaki, 1996; kitagaki et.al., 1981). the proposed system in this material adopts the simpler method(kitagaki et.al., 1981). the algorithm is outlined below. topic sets: m topic: mi(∈ m) value of the topic: v(mi) group sets: g relevant group: g(∈ g) bigness of group ’g’: | g | discussant(student): x j(∈ g) selected choice: a(mi,x j) difference of choices selected by discussant x j and discussant xk : d{a(mi,x j),a(mi,xk)} goodness of group division using criterion (a): αg goodness of group division using criterion (b): α ′ g goodness per capita of group division: β in the definition above, both of ’value of the topic’ and ”difference of choices selected by discussant x j and discussant xk’ are the value between 0 and 1. the ’bigness of group g” is the number of a group. the number is not always the same for all group. but its algorithm is abbreviated here. all the said variables have to be determined in advance. the ’goodness per capita of group division β ’ is determined as the following. αg = ∑ mi∈m ∑ x j∈g v(mi) ⋃ k {d{a(mi,x j),a(mi,xk)}} (1) β = ∑ g∈g αg/ ∑ g∈g | g | (2) in the equation [2], group division which makes the value maximum is the optimal solution. in order to get the optimum, however, it is necessary to administrate the calculation for all the combination of groups. it is actually difficult to get it because of time for its calculation. thus a simple method is implemented(kitagaki et.al., 1980) as the following. its example deals with the case that thirty discussants are divided into ten groups with three discussants each. as the initial status, i suppose that the computer fix the discussants x, . . . ,x as shown in ’n=1’ figure 2, and define the value of β in eq.[2] as β. then i let it compare cell1 with each cell thereafter one by one. first, let it exchange cell1 x for cell2 x to obtain the pattern as shown in ’n=2’ then get the value as β,. it is clear that β, is same as β in their value. thus there is no reason to exchange thus it ought to be withdrawn. second, it is obvious that the exchange of x and x leads to the same result as above. it is cell1 and cell4 that has actual meaning of exchange because they belong to different groups in the initial pattern. if β is bigger than(or equal to) β,, the computer regards the pattern of ’n=4’ as not better pattern than the one of ’n=1’ then the exchange ought to be withdrawn. on the other hand, if β is smaller than β,, it regards the pattern of ’n=4’ better than the one of ’n=1’ then the exchange ought to be withdrawn. on the other hand, if β is smaller than β,, it regards the pattern of ’n=4’ better than the one of ’n=1’ then the exchange ought to be done to get the new pattern. based upon the new pattern, discussion support system for intra-class discussions and the criteria for group making 61 it searches for a better pattern.(in the new pattern, the content of each cell is re-designated in the way that the content of cell’i’ are written as xi (i=1,30). it means that the new pattern is shown as x1. . . x30 from the leftmost column to the rightmost column.) the search for the better pattern is succeeded in the same way. if the comparison of cell1 and cell30 has been done, cell2 becomes the base of comparison. the exchange of cell2 and cdll3 is done as shown in the pattern of ’n=31’, resulting in meaning nothing, followed by the exchange of cell2 and cell4 in ’n=32’. consequently, the exchange of two cells are done in the following order, and as a result, the number of exchange becomes 870(=29*30) in all.( actually the exchange of two cells in a group ought to be omitted.) cell1 and cell2, cell1 and cell3, cell1 and cell4,cell1 and cell5, ’ś,cell1 and cell29, cell1 and cell30 cell2 and cell3, cell2 and cell4, cell2and cell5, ’ś,cell2 and cell29, cell2 and cell30 cell3 and cell4, cell3 and cell5, ’ś,cell3 and cell29, cell3 and cell30 . . . . . . . . . . . . . . . . . . . . . . . . cell29 and cell30 n=1 cell1 cell2 cell3 cell4 cell5 cell6 … cell28 cell29 cell30 … x1 x2 x3 x4 x5 x6 … x28 x29 x30 n=2 cell1 cell2 cell3 cell4 cell5 cell6 … cell28 cell29 cell30 … x2 x1 x3 x4 x5 x6 … x28 x29 x30 n=4 cell1 cell2 cell3 cell4 cell5 cell6 … cell28 cell29 cell30 … x4 x2 x3 x1 x5 x6 … x28 x29 x30 g1 g2 g10 g1 g2 g10 g1 g2 g10 figure 2: the exchange of two cells(in the case that a classroom consists of thirty students). supposing the number of discussant to be ’n’, the number of the said exchange becomes ’n(n-1)’. for each exchange, the computer gets the value of β , then the optimal group division is obtained. if method (b) shown in the beginning of this section is implemented for group division criterion, we have only to use eq.[3] instead of eq.[1]. α ′ g = ∑ mi∈m ∑ x j∈g v(mi) ⋃ k { − d{a(mi,x j),a(mi,xk)}} (3) 5 the administration of discussion classroom as a topic for the proposed discussion, i raised a topic of career development to which most students might be relevant. the topic relates the consideration on the answer in an interview in job hunting. i administrated the classroom discussion four times. in every administration, the same topic was used. two experimental administrations are discussed below. 62 ikuo kitagaki experimental 1(e1) fifteen hiroshima university students served as subject(twelve undergraduate students and three graduate students). among them, nine students were science in major., jul. 26, 2007. experimental 2(e2) fourteen hiroshima university students served as subject(seven undergraduate students and seven graduate students). among them, nine students were science in major., nov. 11, 2007. table 1: a questionnaire and the answer [6] q: what worker do you want to be? a: as human is not working alone, i want to be a worker regarding communication with others in the actual work where i should say whatever necessary to say and listen to them whenever to do that . # as your remark for the underlined parts, select a choice that is nearest to it. 1. the expression is vague and fuzzy thus have little impact. 2. no fresh awareness as a new comer. can be felt. 3. everybody can say that thus little impact are there. 4. it is the president of a company to say that. thus, if the matter gets worse, it may give them a feeling of impoliteness. in both of e1 and e2, i used the same room and implemented the same topic which consists of two rounds, r1 and r2, that is, two ’question-answer’ set. but the students were different in those two experiments. in either round, the question exemplified in table 1 was used. in its actual administration, i let the students inform others in a group of the choice which each student has selected then discuss what would be a better answer for the given answer. lastly i let them make and write a better answer then offer it as a report with of their consensus. as most students in a group were not acquainted with each other, it was assumed that, when the information of a group member were presented in the step of figure 1(5), it gets difficult for them to make a group. thus id number proper to each group was informed all the classroom, leading to easier making groups. addition to that, as group leaders is necessary in order to facilitate the discussion well, how to determine a leader in a group was also informed them. discussion time was set to nearly fifteen minutes, which was also informed them as a comment. after each administration of r1 and r2, the following questionnaires have been examined to all the students. (influence of selecting a choice on the discussion flow) 1. i don’t think that the information of a choice selection had an influence on the relevant discussion flow. in other words, discussion flow must have been the same as the one without selecting a choice. 2. i think that the information of a choice selection had an influence on the relevant discussion flow to some extent. 3. i think that the information of a choice selection had an influence on the relevant discussion flow to great extent. discussion support system for intra-class discussions and the criteria for group making 63 table 2: answer data processing of ’influence of selecting a choice’ (a)basic statistics number of students x σ condition selecting a choice (%) 1 2 3 (e,r&r) 0.30 0.57 0.13 1.90 0.65 (e,r&r) 0.07 0.46 0.46 2.39 0.61 (e&e,r&r) 0.19 0.52 0.29 2.10 0.69 number of the students is fifteen 15 in e1, and fourteen in e2. x: average, σ : standard deviation (b)z-test(using normalized distribution) m condition 1 2 3 (e1, r1&r2) −.∗ 0.86 .∗ (e2, r1&r2) −.∗ −.∗ .∗ (e1&e2, r1&r2) −.∗ -1.15 .∗ ∗ hypothesis ’x=m’ has been rejected (p < 0.01), z = (x − m) √ n σ in r1 and r2, presented topics were the same type thus those answers were added to have been statistically processed altogether. the result is shown in table 2(a). there is also shown the result in the case that e1 and e2 were combined in their data. from the test result shown in the table, the average choice for the questionnaire is said to be 2(or in-between 2 and 3). it means that their average remark is that selecting a choice has an influence on the discussion flow as their awareness. besides the experiment shown in this material, i have done the questionnaire survey comparing criterion (a) with criterion (b) shown in the rd section supposing the case that groups were divided according the criterion (b). as their awareness, it got obvious that they felt more fruitful in their discussion with variety of choices in a group than that with a similar choices. 6 conclusion this paper discussed a computerized system which assists group discussion done in a classroom. first, configuration of the system was explained. second, two aspects of criteria for making groups, similarity in learning and difference in learning have been discussed. third, a math. model for group division was presented. a trial of administrating a classroom discussion was done. in its administration, a topic and the choices were presented to the students, each student selected a choice and sent it to the server, the server determined the groups, then it sent to the students the group information with each choice. from the administration, it got clear that the discussants have the awareness that letting them know their selected choice each other gave an influence on the further discussion flow. 64 ikuo kitagaki bibliography [1] akahori k. et.al.(1997, in japanese): the skill of university classroom teaching, daiichi-houki, tokyo, 142-145. [2] kitagaki i.,hikita a.,takeya m., fujihara y.(2007): development of an algorithm for groupware modeling for a collaborative learning, int’l journal of computers, communication & control, ii,1, 66-73. [3] kitagaki i.(1996): evaluation of students’ group using fuzzy integral, ieice, j79-d-ii,11, 18881896. [4] kitagaki i., shimizu y. and suetake k.(1980): an instructional method which permits the students to critically discuss their own test answers, japan journal of educational technology, 5,1, 23-33. [5] kitagaki i.: development of group division algorithm and discussion support system for intraclass discussions, proceedings of the rd international conference on virtual learning, pp.101-108, constanta, 2008. [6] nakatani a.(1995, in japanese): expert of interviewing, diamond co., tokyo. ikuo kitagaki was born in toyohashi, japan, in 1947. he has received b.e., m.e. and doctoral degrees from tokyo institute of technology in 1970, 1972 and 1981, respectively. he has worked in tokyo institute of technology, the institute of vocational training, and so forth. during the time, he has been involved in educational technology, literacy in science/technology, fuzzy science, human science, et.al.. he has been a member of the editorial advisory board of the advances in web-based learning(awbl) book series(usa). he has received icvl excellence award “intel® education” from the icvl(the international conference on virtual learning) scientific committee. int j comput commun, issn 1841-9836 7(5):900-906, december, 2012. building a cloud governance bus v.i. munteanu, t.-f. fortiş, a. copie victor ion munteanu, teodor-florin fortiş, adrian copie 1. west university of timişoara romania, timişoara, bvd. v.pârvan 4, and 2. institute e-austria, timişoara romania, timişoara, bvd. v.pârvan 4 e-mail: vmunteanu@info.uvt.ro, fortis@info.uvt.ro adrian.copie@info.uvt.ro abstract: thought still at its first steps, cloud governance lays the foundation upon which business innovations can be built. it fills in the gaps left by cloud providers and allows major players on the it market to be challenged by small and medium-sized enterprises (smes) for their share. at the core of cloud governance, its bus enables interaction and communication between various services and governance components. the cloud governance bus is a step forward for the enterprise service bus (esb) into the cloud environment, addressing data integration and full implementation of enterprise integration patterns. this paper covers current requirements for esb migration to the cloud environment and proposes a cloud governance architecture that meets the given requirements. keywords: cloud governance, cloud management, cloud governance bus, enterprise integration patterns. 1 introduction cloud migration is an ongoing process to which small and medium-sized enterprises (smes) must adhere such that they can benefit of the advantages given by its economic model. this adoption can enable them to challenge large enterprises by creating niche solutions or grouping themselves in order to provide complex applications that are tailored for their customers’ needs. according to [13], cloud computing can be summed up in five core characteristics: on-demand self-service, ubiquitous network access, location independent resource polling, rapid elasticity and pay-per-use. the last one, pay-per-use, is a clear incentive as to why cloud adoption is desired. cloud adoption is also driven by technical characteristics like virtualization, service orientation, link with business models, strong fault tolerance, and loosely coupling, as identified in [10] . the large amount of proprietary technologies used by cloud vendors and the lack of cloud standards has lead to the fragmentation of cloud environments making development hard. by having multiple deployment models (public, community, hybrid and private clouds), the gap is further enlarged because of the different type of policies that need to be implemented for each of them. several solutions that are built on-top of cloud infrastructures (iaas) come in aid by offering flexible cloud-independent development environments and partially handling de facto things like resource provisioning, management and monitoring. unfortunately, these platform-as-a-service (paas) solutions lack the functionality that is required to have a complete cloud management solution and require a set of complementary services, as exposed in [6–8]. furthermore, current cloud applications run in isolation or in a small clusters [19] even though there is a demand for application integration at saas level [4, 5, 19]. this leads to the necessity of a central entity whose purpose is to enable both service and data integration and create a unitary ecosystem where applications can be easily created, managed, discovered and can easily interact one another, the necessity for cloud governance. copyright c⃝ 2006-2012 by ccc publications building a cloud governance bus 901 cloud governance, a step forward for service oriented architecture (soa) governance, is essential for full cloud adoption, and even the lack of a partial solution can lead to serious challenges [14]. while not part of soa governance itself, an enterprise service bus (esb) is a flexible connectivity infrastructure for integrating applications and services [1]. similarly to soa, cloud governance can benefit from the use of such a bus. this paper focuses on defining the requirements for a cloud governance bus while providing partial solutions in the form of already available software. the remainder of the paper is organized as follows. our motivation and related work is covered in section 2. section 3 introduces the mosaic project and its component, the cloud agency (ca). our proposed cloud governance architecture is covered in section 4. the main results are presented in section 5 and conclusions and future work are presented in section 6. 2 motivation and related work 2.1 cloud management and governance cloud management is covered in distributed management task force’s (dmtf) white papers [7,8] which identify concerns and issues related to aspects of cloud service lifecycle, components in the architecture for managing clouds etc. the white papers describe management requirements in close relationship with governance ones. the growing interest in cloud management solutions has lead to an abundance of paas solutions, like mosaic1, openshift2, cloud foundry3, or morfeo 4caast4. however, little interest is payed o cloud governance related concerns like data and security management, logging and audit, event management and others. a clear place for cloud governance in relation to a generic cloud management architecture is specified in [7]. important information related to cloud governance covering service level agreements (slas), security patterns and controls are covered in [6]. several enterprises have taken interest in cloud governance and have integrated it as part of their paas solutions: enstratus5, wso2 stratos6 and fiorano cloud platform7. 2.2 enterprise service bus in the cloud building an enterprise service bus is a challenge for any developer because of the complexity of integrating multiple services in one environment, the use of different technologies etc. there is a high variety available of esbs ranging from commercial ones to open source ones like ibm websphere message broker8, oracle esb9, fuse esb10, mule esb11, petals esb12, jboss esb13, openesb14. in his paper, garcăa-jimăšnez et. al. [9] compares some of the open source esbs. 1http://www.mosaic-project.eu 2https://openshift.redhat.com/app/ 3http://www.cloudfoundry.com/ 4http://4caast.morfeoproject.org/ 5http://www.enstratus.com/ 6http://wso2.com/cloud/stratos/ 7http://www.fiorano.com/products/esb-enterprise-service-bus/fiorano-cloud-platform.php 8http://www-01.ibm.com/software/integration/wbimessagebroker/ 9http://www.oracle.com/technetwork/middleware/service-bus/overview/index.html 10http://fusesource.com/products/enterprise-servicemix/ 11http://www.mulesoft.org/ 12http://petals.ow2.org/ 13http://www.jboss.org/jbossesb 14http://openesb-dev.org/ 902 v.i. munteanu, t.-f. fortiş, a. copie while not originally designed for the cloud, esbs are slowly making their way into cloud environments. some paas providers offer them alongside their products either built in or as a service. some of the commercial esb providing solutions are wso2 statos, fiorano cloud platform, netperspective cloud esb15 and others, while open source ones are mule esb16. 3 mosaic mosaic17 is an fp7-ict project [12], which is developing a platform that promotes an opensource cloud application programming interface (api) and a platform targeted for developing multi-cloud oriented applications. its goal is to provide enough freedom both at resource and programming level such that cloud-based services can be easily developed and deployed. the architecture of the platform [16] is designed around the use of open and standard interfaces. its main goal is to provide a unified cloud programming interface which enables the flexibility needed to build inter-operable applications across different cloud providers [15]. mosaic is comprised of the mosaic api and the cloud agency. the cloud agency [2,17,18] is a multi-agent system that has been designed to handle resource provisioning and monitoring and also to handle reconfiguration of resources. the cloud agency is easily accessible to the mosaic platform through a rest interface. built around a semantic engine, the cloud agency has capabilities that allow dynamic discovery and mapping of cloud providers. the cloud agency works at an iaas level within the mosaic platform. 4 cloud governance architecture the proposed cloud governance architecture (figure 1) is built in close relation with mosaic’s cloud agency and is designed to offer a variety of services which complement it. this architecture closely follows dmtf’s white paper [7] and is built as a multi-agent system. the cloud agency exposes itself within the ecosystem as services. a clear representation of the system is depicted in figure 1, and is composed of four subsystems: service management, security management, audit management and governance management. each of the these subsystems is made of several agents, each agent being able to serve several of them. the service management subsystem is in charge of service lifecycle management (publishing, brokering, instantiation/commissioning, etc.). of the agents which compose it, the service management agent is the most important one as it stores all service related information in the service datastore. security management handles all security for our governance solution. the service management agent is the core of this subsystem as it handles storing, retrieving, generating all the security information within the system. the audit management subsystem covers all governance monitoring, ranging from cloud resource monitoring to service monitoring. it also uses a set of policies in order to notify the system or human administrator of possible errors/faults. governance management manages the system based on setting and policies. it makes sure that all agents are running and that there is a sufficient number so that all systems work properly. several issues have been thought of when designing the system: 15http://www.netspective.com/netspective-cloud-esb-overview 16http://www.mulesoft.org/ 17open source api and platform for multiple clouds building a cloud governance bus 903 • complete integration of cloud management (cloud resource management, scaling, monitoring, reconfiguration); • complete service management and lifecycle related issues (including scaling, monitoring and reconfiguration); • complete security and privacy management; • compliance with business practices and standards. figure 1: cloud governance 5 cloud governance bus the traditional role of an esb in a soa environment is to simplify access by hiding the complexity of the underlying system and providing a generic way for querying, accessing and interacting between services. this is achieved by handling the routing and monitoring of messages between services, handling service deployment and versioning etc. similarly to the esb, a cloud governance bus (cgb) needs to be able to handle messages (queuing, sequencing), security, exceptions, protocol conversion and provide an adequate level of quality of services (qos). unlike traditional esbs, our proposed cgb implements enterprise integration patterns (eip) as well as data integration (extraction, transformation, loading, mapping) which enables easy access to datastores as well as other components like the integrated scala implementation of activemq provided by apache apollo18. the cgb is first and foremost designed to handle the internal communication of our proposed cloud governance solution in a secure manner. having esb-like features is a secondary goal. as soa allows for the development of both tightly coupled and loosely coupled services, having the opportunity to integrate them in our cgb is nice to have, but not a priority. the following list summarizes what cgb features we would like/are a must having: • support both synchronous and asynchronous interaction between services • allow message operations like filtering, routing, translating 18http://activemq.apache.org/apollo/ 904 v.i. munteanu, t.-f. fortiş, a. copie • allow various forms of message routing including, but not limited to, static routing, contentbased routing, rules-based routing, policy-based routing • allow both statically and dynamically bound services • allow any type of data to be handled • handle semantic transformation if required • allow the possibility to define message channels • separate system messages from service messages • allow various ways in which endpoints can be defined in his paper, kiran kanetkar discusses several functions that an esb must handle [11]: routing, transformation, adaptation, messaging, orchestration, uddi registry, security, consumer integration, service integration, metrics and management, and b2b. however, for building our cloud governance system, we only need to handle the most important functions as well as eip. in [3], rob barry identifies several problems an esb has to face when being deployed in a cloud environment. because of the various deployment environments (public, hybrid or private clouds) an esb must adopt specific security policies (encryption) when dealing with the messages or authentication within the system. another issue is the latency that can arise from sending messages between various clouds and the transport protocols the esb needs to know. 5.1 using akka and apache camel in order to address the issues related with building a cloud governance bus, two technologies that can cover them were identified, solutions that are event-driven and enable eip and data integration. one of them, akka19, is an event-driven middleware in scala20. while not a traditional, fipa compliant, multi-agent system, akka can be used successfully for building high performance and reliable distributed applications. akka’s architecture allows easy mapping of agents to its actor system. its event driven system allows building reactive agents, facilitating them with mailboxes, another feature needed for an cgb. akka’s high-performance, self-healing, transparent-distributed system is complemented by features like support for various development libraries that enhance it like rest, comet, spring, guice, lift, apache cametm, persistence and aqmp libraries. akka’s apache cameltm module allows easy integration with it. apache cameltm is a versatile open-source integration framework based on known enterprise integration patterns. it enhances our cgb by enabling the definition of routing and mediation rules in a variety of languages. it can be easily integrated into any kind of transport of messaging model that our cgb employs, and enhances or cloud governance architecture’s ability to communicate with 3rd party applications and partners. 6 conclusions and future work unlike large enterprises which have the resources (financial and otherwise) to build and maintain their own infrastructure, smes find themselves lacking and looking elsewhere for support. that support is found in the cloud, where they can delegate infrastructure management to cloud 19http://akka.io/ 20http://www.scala-lang.org/ building a cloud governance bus 905 providers benefiting from the given pay per use economic model. however they are somewhat limited and need a governance solution that enables them to group and provide complex, targeted services tailored for their customers’ needs. cloud governance is complementary to cloud management through the services it provides. by having a cloud governance bus as the core of our cloud governance architecture, we enable a new approach to business integration and to building a highly complex and business oriented ecosystem. this paper tried to cover requirements for a cloud governance bus from the perspective of our proposed governance architecture. future work will cover patterns for building highly interdependent cloud services by using our bus to route and translate their messages. acknowledgments this work was partially supported by the grant of the european commission fp7-ict-20095-256910 (mosaic), fp7-regpot-ct-2011-284595 (host), and romanian national grant pn-ii-id-pce-2011-3-0260 (amicas). the views expressed in this paper do not necessarily reflect those of the corresponding projects’ consortium members. bibliography [1] wohl associates. soa governance. an ibm white paper. white paper. 2006. available on-line at: http://www-01.ibm.com/software/solutions/soa/amy_wohl_soa_governance_analyst_white_paper.pdf. [2] r. aversa, b. di martino, m. rak, and s. venticinque. cloud agency: a mobile agent based cloud system. in proceedings of the 2010 international conference on complex, intelligent and software intensive systems, cisis ’10, pages 132–137, washington, dc, usa, 2010. ieee computer society. [3] r. barry. esbs in the cloud: tricky in the early going. news. june 2010. available on-line at: http://searchsoa.techtarget.com/news/1514427/esbs-in-the-cloud-tricky-in-the-earlygoing. [4] s. bennett, t. erl, c. gee, r. laird, a. t. manes, r. schneider, l. shuster, a. tost, and c. venable. soa governance: governing shared services on-premise & in the cloud. prentice hall/pearsonptr, 2011. [5] t. cecere. five steps to creating a governance framework for cloud security. cloud computing journal. november 2011. available on-line at: http://cloudcomputing.syscon.com/node/2073041. [6] cloud computing use cases group. cloud computing use cases white paper. july 2010. available on-line at: http://opencloudmanifesto.org/cloud_computing_use_cases_whitepaper-4_0.pdf. [7] dmtf. architecture for managing clouds. june 2010. available on-line at: http://dmtf.org/sites/default/files/standards/documents/dsp-is0102_1.0.0.pdf. [8] dmtf. use cases and interactions for managing clouds. june 2010. available on-line at: http://www.dmtf.org/sites/default/files/standards/documents/dsp-is0103_1.0.0.pdf. 906 v.i. munteanu, t.-f. fortiş, a. copie [9] f.j. garcia-jimenez, m.a. martinez-carreras, and a.f. gomez-skarmeta. evaluating open source enterprise service bus. in e-business engineering (icebe), 2010 ieee 7th international conference on, pages 284 –291, nov. 2010. [10] c. gong, j. liu, q. zhang, h. chen, and z. gong. the characteristics of cloud computing. in wang-chien lee and xin yuan, editors, icpp workshops, pages 275–279. ieee computer society, 2010. [11] k. kanetkar. a roadmap to building an esb. may 2006. available on-line at: http://www.saterisystems.com/docs/whitepapers/roadmap to building an esb.pdf. [12] mosaic consortium. the mosaic project. 2010. available on-line at: http://mosaiccloud.eu/. [13] a. mulholland, j. pyke, and p. fingar. enterprise cloud computing: a strategy guide for business and technology leaders. meghan-kiffer press, tampa, fl, usa, 2010. [14] p. mynampati. soa governance: examples of service life cycle management processes. november 2008. available on-line at: http://www.ibm.com/developerworks/webservices/library/ws-soa-governance/index.html. [15] d. petcu, c. crăciun, m. neagul, i. lazkanotegi, and m. rak. building an interoperability api for sky computing. in high performance computing and simulation (hpcs), 2011 international conference on, pages 405–411, july 2011. [16] d. petcu, s. panica, and m. neagul. from grid computing towards sky computing. case study for earth observation. proceedings cracow grid workshop 2010, pages 11–20. academic computer center, poland, 2010. [17] s. venticinque, r. aversa, b. di martino, m. rak, and d. petcu. a cloud agency for sla negotiation and management. in proceedings of the 2010 conference on parallel processing, euro-par 2010, pages 587–594, berlin, heidelberg, 2011. springer-verlag. [18] s. venticinque, r. aversa, b. di martino, and d. petcu. agent based cloud provisioning and management design and prototypal implementation. in closer 2010, pages 184–191, 2011. [19] p. wainewright. time to think about cloud governance. august 2011. available on-line at: http://www.zdnet.com/blog/saas/time-to-think-about-cloud-governance/1376. international journal of computers communications & control issn 1841-9836, 12(5), 661-676, october 2017. mining users’ preference similarities in e-commerce systems based on webpage navigation logs p. li, c.x. wu, s.z. zhang, x.w. yu, h.d. zhong ping li college of biological and environmental sciences, zhejiang wanli university no. 8 south qianhu rd., ningbo, zhejiang, 315100, p. r. china, liping_kaixin@163.com chunxue wu school of optical-electrical and computer engineering, university of shanghai for science and technology no. 516 jun gong road, shanghai 200093, p. r. china, tyfond@126.com shaozhong zhang school of electronic and computer science, zhejiang wanli university no. 8 south qianhu rd., ningbo, zhejiang, 315100, p. r. china, dlut_z88@163.com xinwu yu the information center, zhejiang wanli university no. 8 south qianhu rd., ningbo, zhejiang, 315100, p. r. china, herrison@163.com haidong zhong* logistics and e-commerce school, zhejiang wanli university no. 8 south qianhu rd., ningbo, zhejiang, 315100, p. r. china *corresponding author: zhonghaidong@zwu.edu.cn abstract: mining users’ preference patterns in e-commerce systems is a fertile area for a great many application directions, such as shopping intention analysis, prediction and personalized recommendation. the web page navigation logs contain much potentially useful information, and provide opportunities for understanding the correlation between users’ browsing patterns and what they want to buy. in this article, we propose a web browsing history mining based user preference discovery method for e-commerce systems. first of all, a user-browsing-history-hierarchical-presentationgraph to established to model the web browsing histories of an individual in common e-commerce systems, and secondly an interested web page detection algorithm is designed to extract users’ preference. finally, a new method called upsawbh (user preference similarity calculation algorithm based on web browsing history), which measure the level of users’ preference similarity on the basis of their web page click patterns, is put forward. in the proposed upsawbh, we take two factors into account: 1) the number of shared web page click sequence, and 2) the property of the clicked web page that reflects users’ shopping preference in e-commerce systems. we conduct experiments on real dataset, which is extracted from the server of our self-developed e-commerce system. the results indicate a good effectiveness of the proposed approach. keywords: web browsing history mining, e-commerce, preference, recommendation. copyright © 2006-2017 by ccc publications 662 p. li, c.x. wu, s.z. zhang, x.w. yu, h.d. zhong 1 introduction e-commerce (or electronic commerce) usually refers to products or services exchange oriented activities based on the internet technology, which covers online payment, information security, logistics distribution, etc [25,29]. pushed by the widespread availability of the internet, more and more consumers prefer to shift from traditional face-to-face transactions to web-based commercial activities [16]. meanwhile, with the advances of network technology and fast-growing e-commerce systems, such as amazon, stimulated by network technology advances, the rapid growth of networking systems and the boom in netizens, an ever-increasing number of traders and entrepreneurs have participated in e-commerce [19]. nowadays, supply chain management, online transaction processing and many other e-commerce relevant industries have attracted thousands of workers and companies to provide a great many products or services for the online transactions. with online information growing exponentially, it will eventually result in "information overload" and "information loss" in e-commerce systems [11], which seriously hinder the development of e-business or e-commerce industries. since the born of collaborative filtering approach in the 1990s, recommendation systems have become an intensively investigated independent discipline and deemed as an effective means to ease the "information overload" problem. generally, the existing recommendation approach can be divided into three categories [2, 26, 28]: (1) collaborative filtering recommendation, which is based on the idea of similarity of users to make predictions (filtering) about the interests of an individual by calculating preferences from other users (collaborating) who have brought the same items as the target user; (2) content-based recommendation, which is the technology of choosing items as recommended for a target user according to other similar users’ preferred items; (3) hybrid recommendation, which is proposed to combine different recommendation technologies according to different mixed strategies (e.g., weighted, switch, mixed characteristics, combination, series, meta level hybrid, etc.). although, the specific steps of these recommendation methods vary, the fundamental principles of them are similar: finding users with similar preferences with target consumer and using them to make recommendations. however, there are still many challenges, such as data scarcity and cold-start [20, 21], in utilizing these recommendation approaches. searching for candidate users more precisely is deemed as an important solution to improve these challenges. in the majority of the existing researches, insufficient effort has been made to solve the problem that users’ preference is timevarying and can be measured in different granularity. from the view point of srivastava, web usage logs, which is an important part of web data, contain abundant information and can be exploited in many web personalization applications [24]. they provide demographic data (for instance name, age, country, marital status, education, interests, etc.) of each website user, also implicit knowledge about users’ behavior patterns and other preferences. to discover the information explicitly from a novel perspective, a web page navigation logs mining based method is proposed to extract users’ preferences dynamically. the paper directs toward excavating users’ browsing histories in typical e-commerce systems and tries to establish a hierarchical presentation model for the data, which will have wide utilization potentiality in e-commerce system intelligence, monitoring users’ preferences and analyzing on different levels. based on the proposed hierarchical presentation model and interested web page detection algorithm, a user’s shopping preference measure algorithm upsawbh is put forward. mining users’ preference similarities in e-commerce systems based on webpage navigation logs 663 2 related work 2.1 web page navigation logs mining web page navigation logs generally refers to the detailed information that can be gathered from the internet browsing of users, which are lists of links network clients clicked and the elapsed time between them [22,30]. they can be represented as a quintuple weblog={time, remote host, method, page, request status}. the format of the web browsing log may vary slightly in different application servers, but the elements listed in the quintuple are essential. the meaning of each element in the quintuple is explained as follows [32]: "time" denotes the time the server responds to the user’s request and returns the requested resources. in the article, time between two urls is the time interval between the request of a url and the followed url page, which contains of web page load time and page time as shown in fig.1 [1]. "remote host" denotes the logic name or ip address of the network server that a user visits. a proxy server may exists between the user and the web server, so "remote host" may represent the final proxy server that the user has visited. "method" denotes the request method, which include get, post, header, options, put, and so on of the user. among these methods, get and post are the most usually adopted. "page" denotes the requested web page. the “page” can be functionally divided into two types: navigation pages and content pages. navigation pages act as "guiding people" in the internet, while the content page is the place where people usually spend most of their time. "request status" denotes the status code that request the user to return to the server. the status code consists of three digits, which represent the status response of the server to the browser’s request. web page navigation log data contains a lot of valuable information, such as the records of links that a user has visited and the elapsed time between them, the number of clicks on each web page, the complete visit path and the time spent on the web pages. this kind of statistical information can be explored by a variety of methods and used for many scenarios, such as users’ preference prediction and the prefetching of pages to improve users’ browsing experience. a great many scholars have paid the much attention on website access pattern investigation of users by their browsing logs with the help of statistical analysis methods to reduce server-side response time and improve access efficiency of web pages [27]. magdalini eirinaki and michalis vazirgiannisrely on the application of statistical analysis and intelligent data mining methods (for instance, clustering, association rule mining, sequential pattern discovery and classification) to the web log data, resulting in a set of valuable patterns that imply individuals’ access patterns, and the knowledge is then employed to personalize pages for users according to their navigational behavior and profile [9]. based on the theory of probability, borges and levene put forward a data mining method that captures users’ web page access patterns: individuals’ navigation sessions are treated as hypertext probabilistic grammar whose higher probability strings correspond to the interested tails of an individual and the last n visited web pages affect the affect the probability of the following page to be navigated [4]. ezeife and lu proposed a web access pattern tree (waptree) approach to explore frequent visit sequences for users, which can response dynamically without numerous re-constructions of wap-tree during knowledge mining [10]. to overcome the weakness of ineffective content management of websites and the incapability of providing personalized web page services for the users in traditional web usage mining approaches, yaote wanga and anthony j. t. lee introduced the concept of throughout-surfing patterns and present an advanced access pattern mining model [27]. also, they put forward a compact graph 664 p. li, c.x. wu, s.z. zhang, x.w. yu, h.d. zhong model, termed a web page navigation path traversal graph, to store knowledge about the web page access paths of the website users. 2.2 user similarity measurement in website access scenario, a user profile contains static part, which changes seldom (such as demographic information) and dynamic part that changes frequently. the ability to find users with similar preference or distinguish between different individuals is a matter of cardinal significance in various information system applications, especially in e-commerce systems. the process of finding similar users is usually conducted based on users’ profile mining, which focus on knowledge about web page access preferences and characteristics of the users. due to the convenience of collecting users’ web page navigation and other potential valuable information from server-side, users’ profile exploration has attracted much attention of scholars all over the world in recent years. personalized recommendation in e-commerce systems is deemed as one of the most popular applications that based on users’ profile and preference extraction. user similarity measurement is one of the research aspects of users’ preference mining. although, much effort has been paid on this topic, it is still a problem-rich area. the existing researches focus mainly on how to measure user similarity in different circumstance. some investigations follow the perspective of geography to probe users’ similarity. for example, li, zheng, xie, et al. found it important to discover valuable knowledge from large scale spatiotemporal data, and proposed hierarchical-graph-based similarity measurement (hgsm) framework to model an individual’s trajectories. the model considered both the sequence property of people’s movement behaviors and hierarchy attribute of geographic feature, which proved to be an effective way of measuring similarities among users [17]. guy, jacovi, perer, et al. studied nine kind of sources (friending, communities, blogs, forums, et al.) that can be used for users’ similarity measurement in social media applications [13]. their research shows that the aggregation of sources may be valuable to measure the similarity between people. all these approaches are based on the hypothesis that the more two users share the same geographical overlap areas, the more likely they can be similar to each other. however, it may be challenge to evaluate the similarity of two users who are living very close. take two persons, an old man and a young one, living in the same community as an example: they may share the same geographical overlap area (both stay at home) in a period of leisure time (week end or holiday), but it is hard to say that they are similar. while still other works pay much attention to semantic analysis to exploit users’ similarity. ying, lu, lee, et al. argued that geographically close uses’ trajectories may not have to be similar, because the activities implied by nearby landmarks that they passed through may vary and so they put forward a mstps (maximal semantic trajectory pattern similarity) method, which measures the similarity between users based on the calculation of semantic similarity of their trajectories [31]. lee and chung proposed a method to calculate user similarity according to the semantics of frequently visited locations and the user’s potential preference [15]. still others studies focus on mining users’ purchase history or web page access record to measure their similarity. for example, eckhardt a. put forward a collaborative filtering based user preference model to explore users’ similarity [8]; wei, shijun, yunlu, et al. held the opinion that users sharing similar purchase in history are very likely to have similar preference in the future, and constructed a user similarity network to get rid of the negative affect of popular objects or items for personalized recommendation [12]. in this work we focus on web usage logs mining to find users’ preferences, based on the opinion that web page navigation patterns resemble users may have similar interests. we combine the approach of hierarchical-graph-based similarity measurement with web usage mining techniques on web page access records. mining users’ preference similarities in e-commerce systems based on webpage navigation logs 665 2.3 e-commerce recommendation with the boom in e-commerce in recent years, the structure of the e-commerce system has become much more complicated than ever before as it provides a great many customized services for both customers and enterprises. meanwhile, a wide variety of goods is provided by sellers in e-commerce virtual shops, which makes it impossible for a user to view all the products when he/she has something to buy. under this circumstance, the demand for understanding users’ preferences in e-commerce systems and find useful knowledge to make recommendations has greatly increased. by providing precise and useful suggestions to a potential consumer, recommendations in e-commerce systems have made good profits for many popular e-commerce enterprise, such as tabao.com and amazon.com [18], who recommend new products related to the items purchased previously by similar users of the target user. since the collaborative filtering approach first proposed in the mid-1990s, scholars all over the world have devoted their effort on recommender system, and make it a high-profile research area [33]. the interaction between a recommendation system and the user can be divided into explicit approach, using users provided registration information to get a general picture of their static profiles, as well as implicit methods, by exploring the web page access records to infer users’ preferences [14]. the former make recommendations based on the users’ input conditions, while the latter would automatically collect or observe users’ behavior to detect their profile. in e-commerce systems, recommendation seems extremely import because it aim sat customizing/personalizing a given product according to interests of the consumers and help them make decisions. although, recommendation methods (such as collaborative filtering, content-based filtering and rule-based filtering ) may different from one another, the process of e-commerce recommendation generally involves three main steps [3,5–7]: (1) the users’ buying records are collected, processed and analyzed to find their preferences; (2) based on the conclusion of the above first step, commence recommendation for a target user and (3) provide a recommended goods list for the target users to buy. in addition to mining customers’ properties and filtering unnecessary information, an e-commerce recommendation system focuses, as far as possible, on the matter of the ability to suggest items of interest to the user. over the last decade many new algorithms and methods have been put forward to improve recommendation accuracy and efficiency in both practical application and theoretical research. however, it still faces many challenges, such as cold start data sparse. at present, recommender system related research is still a popular issue because it constitutes problem-rich research areas concerning not only about finding accurate recommendation algorithms, but also a great many crucial factors, such as diversity, recommender persistence, robustness, serendipity, privacy etc. in many e-commerce systems, web page browsing pattern mining plays a very important role in generating accurate recommendations. when a user accesses to a web page or a browser-server based e-commerce system, the urls of the pages visited will be stored in the server access log. it plays a crucial role in conveying knowledge of customers’ activities and preferences, which are very useful for personalized recommendation. furthermore, it dynamically reflects their interests in a sense and the similar click sequence of two different users may imply that they have similar preferences at that time, which is very import for real-time recommendations. preference similarity is one of the most useful pieces of knowledge that can be extracted by many kind of data mining models [13]. this knowledge will help in finding groups of visitors with similar preferences and making effective recommendations. in recent years, many scholars have turned 666 p. li, c.x. wu, s.z. zhang, x.w. yu, h.d. zhong their attention to web browsing history mining based e-commerce recommendation, for example, qinbao song and martin shepperd put forward a vector analysis and fuzzy set theory based model to explore similar users, frequently visited web pages and navigation paths and designed a web browsing mining based recommendation model for e-commerce systems [23]. 3 users’ preference similarities exploration in this section, we conduct a detailed introduction to the processes involved in user preferences extraction, which includes web browsing trajectory definition, user browsing history hierarchical presentation and users’ preference similarity measure algorithm. 3.1 preliminary definition 1. web browsing trajectory. a web browsing trajectory (wbtraj) is a clicked url sequence of pages viewed by a user across the entire web visit process: each wbtraj contains a page url (li.url) request time (li.rtime) and leave page time (li.ltime). thus, a web browsing trajectory can be represented as wbtraj = l1 −→ l2 −→ ...ln−1 −→ ln, where li.rtime < li.ltime and li.ltime < li+1.rtime. definition 2. interested web page. generally, an interested web page (iurl) can be represented by a url where a user stays on longer than a certain time interval. therefore, the main factor involved in extraction of the interested web page depends on the time threshold (θt), which implies the time a user stays on a certain web page. formally, a set of interested web pages can be defined as iurl = {li ∈ wbtraj, |li. ltime− li.rtime| >= θt }. figure 1: time of request between two urls figure 2: web browsing trajectory of a general e-commerce system mining users’ preference similarities in e-commerce systems based on webpage navigation logs 667 as demonstrated in figure 2, l1 −→ l2 −→ l3 −→ l4 −→ l5 −→ l6 formulates a general web browsing trajectory in a common e-commerce system, and the interested web page can be extracted according to the web page browsing time threshold. typically, interested web pages may occur in the following conditions: (1) a person enters an e-commerce system, opens a web page and then diverts his (or her) attention away, and (2) a user opens more than one product item web page to compare their characteristics and decide which one to buy, exceeds a time limit at a certain web page. it is impossible to grasp customers’ interested produce web page navigation logs under the former circumstance. we focus on the second case, and put forward an interested-webpage-detection algorithm. detailed process of the algorithm can be described below. algorithm 1 interested webpage detection require: web browsing trajectory wbtraj and a time threshold θt ensure: a set of urls (iurl={l}) of the interested web pages 1: i = 0, iurlcount = |wbtraj| //the number of weburl in wbtraj 2: while i < iurlcount do 3: if |li. ltime− li.rtime| >= θt then iurl.insert(li) 4: end if 5: i + + 6: end while 7: return iurl to capture interested web pages in a server-browser based e-commerce system as accurate as possible, and we need to find a suitable time threshold to detect every stay on a certain web page. a too small time threshold can lead to too many navigated web pages over-detected as iurls. a small time threshold value, for example 1 second, might be more capable of identifying much iurls for an e-commerce system; however, this could cause too many iurls detected, making us get lost and don’t know which is the real web page or produce that users interested. in addition, time interval between the server response and the requested web page shown in the user’s screen may be greater than 1 second (depending on network situation). this is obviously not in accordance with people’s intuitiveness, as the web page has not shown in that short time interval. meanwhile, too large time threshold (θt) value is not appropriate either. it could result in many interested web pages, which indicate users’ real preference, cannot be detected efficiently. 3.2 user browsing history hierarchical presentation from the above section, we can ascertain that the more interested web pages two people share in an e-commerce system, the more likely it is that they may have the same preferences and the similar product purchase inclination. however, it is subjective to measure the similarity of two customers’ preferences directly based on the web pages of interest that they have in common. moreover, it doesn’t make sense to judge users’ preference similarities just by yes or no. therefore, we aim to measure the degree of similarity of two users’ interest quantitatively, and then rank a group of people according to the preference similarities among them. to solve the key point of the issue, we put forward a hierarchical graph to present users’ browsing histories in an e-commerce system, as shown in figure 3. three procedures need to be preformed before building such a graph for an e-commerce system browse path. (1) formulate a set of user click logs according to the time sequence in which he (or she) visits an e-commerce system and form the web browsing trajectory; (2) filter out the common web urls that everyone will click in the e-commerce system, such 668 p. li, c.x. wu, s.z. zhang, x.w. yu, h.d. zhong figure 3: hierarchical structure of the user browsing trajectory as login, logout, product payment or other related urls; (3) construct the hierarchical graph according to the web page properties of the clicked url. we are not concerned with the web page clicked time sequence while creating the graph (in other words, a web page clicked more than one time can be repetitively treated as a multiple node in the graph ). also, a web page clicked many times can be represented as multiple nodes in the hierarchical graph. definition 3. hierarchical graph (hg). hg is a collection of clicked urls in an e-commerce system, with a hierarchical structure hg = {h,l}, where h = {h1,h2, ...,hn−1,hn} represents the collection of layers of the hierarchy graph. l = {lij|0 ≤ i ≤ |h| , 0 ≤ j ≤ |li|}, where lij denotes the jth nodes on the layer hi (hi ∈ h), and li is the set of nodes on layer li 3.3 users’ preference similarity measurement concepts of similar web click sequences definition 4. similar web click sequence (swcs). a similar web click sequence represents for two users (up and uq) who have viewed the same sequence of urls within the same period. formally, a pair of web-click-sequences, webclickseqpi and webclickseq q i , for two users, up and uq, webclickseq p i = [ url p 1(t p 1) ∆t p 1−→ urlp2(t p 2) ∆t p 2−→ urlp3(t p 3)... ∆t p n−1−→ urlpn(t p n) ] webclickseq q i = [ url q 1(t q 1) ∆t q 1−→ urlq2(t q 2) ∆t q 2−→ urlq3(t q 3)... ∆t q n−1−→ urlqn(t q n) ] mining users’ preference similarities in e-commerce systems based on webpage navigation logs 669 where urlj(1 ≤ j ≤ n) are the graph vertices that up and up share on the layer li, tj(1 ≤ j ≤ n) stands for the times the visitor successively stay on the web page urlj and ∆tj (1 ≤ j ≤ n) represents the times interval that the user transfer from urlj to urlj+1. webclickseq p i and webclickseq q i are similar web click sequences only if they meet the following conditions: (1) the two users, up and uq, share the same vertex in one layer of their hgs. formally, ∀1 ≤ j ≤ n, urlpj = url q j. (2) the two users, up and uq, have assemble transition times between the orderly accessed web pages. formally, ∀1 ≤ j ≤ n, 0 ≤ |∆tpj−∆t q j| max(∆t p j ,∆t q j ) ≤ tthreshold, where tthreshold is a predefined time threshold. if the above two conditions hold, a similar web click sequence, simwebclickseqq,pi , included in webclickseqqi and webclickseq q i can be extracted as: simwebclickseq p,q i = 〈url p,q 1 (min(∆t p 1, ∆t q 1) → url p,q 2 (min(∆t p 2, ∆t q 2) → ... → url p,q n (min(∆t p n, ∆t q n))〉 , where min(∆tp1, ∆t q 1) stands for the minimum of the time intervals ∆t p 1 and ∆t q 1. definition 5. n-length similar web click sequence. if there is n nodes in the similar web click sequence simwebclickseqq,pi for two users, up and uq, we call the sequence n-length similar web click sequence. similar web click sequences extracting it can be found in figure 3 that the bottom nodes in the hierarchy graph reveal more specific preferences than those at the top. also, from the top to the bottom, the customers’ shopping intentions increase gradually. therefore, the hierarchical characteristic of this graph is efficient in depicting individuals’ preference similarity. customers who share the same web browsing trajectory on a lower layer are more likely to have similar shopping preferences than those who have web browsing trajectories in common on a higher layer. according to the user browsing history hierarchical presentation model, we propose the following users’ preference similarity (ups) measure approach and define preference similarity between user p and q as: ups(p,q) = sumscorep,q |hgp| + |hgq| (1) where hgp and hgq denote the hierarchical graphs of the user p and q,sumscoreq,q denotes the sum of the score for the same node in each layer, |hgp| is the number of nodes in hgp, |hgq| is the number of nodes in hgq. the sum score (sumscoreq,q) value of two users (p and q) need to be calculated according to the urls they share in each layer. the detailed process of acquiring sumscoreq,q can be described in the following algorithm. obviously, the weighted factor α, in the algorithm, can impact the result of users’ preference similarity to a certain degree. we set f(i) = 1|h|−i+1 (|h| is the total layers of the hierarchical graph) to normalize the preference similarity value to [0,1]. additionally, the greater ups(p,q) value means more preference similarities between users p and q. the process of calculating the total length of the shared trajectory for users p and q in each layer is illustrated in the following algorithm (getsimilarclickseqlength), and the demonstration of similar web page click sequence matching is presented in figure 4. 4 experiment evaluation to examine the effectiveness of the proposed web page navigation logs mining based user preference similarity measure approach, we conduct the experiments on dataset collected from 670 p. li, c.x. wu, s.z. zhang, x.w. yu, h.d. zhong algorithm 2 upsawbh require: hierarchically structured web browsing trajectory hgp, hgq of users p and q ensure: ups(p,q) which shows the preference similarities of users p and q 1: sumscoreq,q = 0, ups(p,q) = 0 2: while hi ∈ h do 3: scoreh = 0// preference similarity on a layer 4: α = f(i) //α is an i-dependent factor 5: while li ∈ l do 6: len = getsimilarseqlength(hgip,hg i q)// get similar web page length 7: scoreh = 2len∗α//get the total length of a similar web page 8: end while 9: sumscoreq,q = sumscoreq,q + scoreh 10: end while 11: ups(p,q) = sumscore p,q |hgp|+|hgq| // calculate ups according to formula (1) 12: return ups(p,q) algorithm 3 getsimilarclickseqlength require: web browsing list set lp,lq ensure: the total length (totallength) of the similar web page click sequence that user p and q share in lp and lq 1: sort lp, lq according to the web click time order, and form the web browsing trajectories trajp and trajq 2: indictor = 0, totallength = 0//variable initialization 3: while urlp in trajp do 4: while indictor < |trajq| do 5: if urlp == trajq[indictor] then totallength + + 6: end if 7: indictor + + 8: end while 9: end while 10: return totallength the server of our self-developed e-commerce system. the dataset covers all the system click logs during a whole month. table 1 depicts the profile of the dataset of the dataset that we used in the experiment, and figure 5 details the click trajectories we extracted. it suggests that most of the system users has trajectory length of 1, which means many users just access a web page of the system and exit within a very short time, because the search engine www.baidu.com is chosen to promote our system, and many link-clicks from the search engine. meanwhile, the numbers of users decrease sharply with the increase of the click trajectory length in our system. all the proposed algorithms and models are implemented and run in a computer with intel (r) core™ i3-2310 cpu @2.10 ghz (4 cpus), 6gb ram, windows 7 ultimate 64-bit using visual programming language c# (in microsoft visual studio 2010 professional). to differentiate the significance of similar web page sequences with various length values on different layers, we set α = 1|h|−i+1 . here α increases in accordance with the layer i in the hierarchical graph, since we intuitively observe that the likelihood of two individuals’ preference similarity rises sharply. there are thousands of combinations of time threshold (θt) with factor (α) in algorithm upsawbh. honestly, it is a great challenge to determine what time interval is proper to detect interested mining users’ preference similarities in e-commerce systems based on webpage navigation logs 671 figure 4: schematic of similar web page click sequence matching table 1: statistical information of the web browsing log data 672 p. li, c.x. wu, s.z. zhang, x.w. yu, h.d. zhong web pages of an individual. therefore, we set the values of θt with reference to commonsense knowledge in real-world web system design. in our experiment, θt is set to 5 seconds, 10 seconds, 15 seconds, 20 seconds, 25 seconds, 30 seconds, 35 seconds, 40 seconds and 45 seconds. the result, as shown in figure 6, reveals that that the number of interested web pages decrease rapidly as the time threshold θt increases. take θt = 5 as an example, nearly 290 interested web pages are detected, but the value drops to less than 80 when θt is set to 10 seconds. figure 5: relationship between trajectory length and number of users figure 6: number of interested web page changing over time threshold we mainly focus on verifying the feasibility and effectiveness of hierarchical graph, interestedwebpage-detection algorithm, web click sequence, in measuring users’ similarity in e-commerce systems. to further explore the accuracy of proposed upsawbh algorithm, six candidate total layer values in the hierarchical graph are tested in the experiment. as we can see in figure 7, with the increase in the total layer in the hierarchical structure of user the browsing trajectory, the accuracy of upsawbh nearly increases linearly, and the calculated preference similarity value (ups) rises simultaneously. intuitively, the more different layer levels in hierarchical graph considered the more similar web click sequences can be checked, that is, both preference similarity and average accuracy are better while 6 total layers are considered. all these prove that our approach has good effectiveness in mining users’ preference similarity knowledge using webpage navigation logs of users in e-commerce systems. mining users’ preference similarities in e-commerce systems based on webpage navigation logs 673 figure 7: the influence of total layer parameter on the accuracy of the upsawbh algorithm 5 conclusion and future work users’ preference similarities mining is an important research area in many information systems, especially in e-commerce application. to some extent alleviate challenges, such as cold start and data sparsity, in e-commerce environment, the article presents a method to explore users ’preference similarities based on web browsing history data mining. in this research, we introduce a web page-navigation-log data extraction based users’ similarity calculation approach which (1) starts from a perspective of geography, treating individuals’ visits of web pages in e-commerce systems as trajectories, (2) devotes to explore the users’ preference based on web page click sequence discovering and (3) tries to check users’ preferences similarity in terms of their web click pattern resemblance in distinct levels. in order to construct a complete a research framework of our approach, a hierarchical graph model is proposed to present the structure of web navigation history and an interested web page detection algorithm is put forward to explore users’ preference patterns. meanwhile, an algorithm, upsawbh, is put forward to figure out the preference similarities among users, and the experimental results on dataset from real-world e-commerce system sever prove good effectiveness of our approach. two traits, the sequence peculiarity of user click and hierarchy feature of webpage browsing levels, have been considered in this similarity measure. one limitation of our method is that it has been tested using only dataset from our selfdeveloped e-commerce system and we just set the value of weighted factor (α), by intuition. in the future, we will pay more attention to collect web page navigation dataset from other b/s based e-commerce systems to compare the result and improve our approach. we plan to focus on the following two areas: (1) further explore the impact of weighted control factor (α) on the result of upsawbh quantitatively, and (2) investigate how to improve the veracity of the proposed user preference similarity computation method to meet the requirement of personalization recommendation. we have just put forward a rough research paradigm to mine users’ preference similarities using web page navigation logs in e-commerce system from another perspective, and still have a long way to go. 674 p. li, c.x. wu, s.z. zhang, x.w. yu, h.d. zhong acknowledgments this work is partly supported by the ministry of education, humanities and social sciences research project (grant no. 14yjc630210), the zhejiang public technology research and application project (grant no. 2015c33065), the zhijiang youth action project: study on mobile e-commerce recommendation ( grant no. g306), the ningbo education science planning key project (grant no. 2017yzd010), the zhejiang business economics association project (grant no. 2016sjyb01), the china national natural science foundation project (grant nos. 61202376, 71071145 and 41201550), the natural science foundation of zhejiang (grant no. ly16g020012), the major research projects of humanities and social sciences in colleges and universities of zhejiang (grant no. 2014gh015), the ningbo huimin project of science and technology (grant no. 2016c51040), the china press and publication administration key laboratory project and shanghai key lab of modern optical system, the modern port service industry and culture research center of the key research base of philosophy and social sciences of zhejiang province and the china scholarship council. bibliography [1] abraham s., lai p.s. (2011); spatio-temporal similarity of web user session trajectories and applications in dark web research, proceedings of the pacific asia workshop on intelligence and security informatics, beijing, china, 2011. doi:10.1007/978-3-642-22039-5_1. [2] adomavicius g., tuzhilin a. (2005); towards the next generation of recommender systems: a survey of the state-of-the-art and possible extensions, ieee transaction on knowledge and data engineering (tkde), 17(6), 734-749, 2005. doi:10.1109/tkde.2005.99. [3] becchetti l. et al. (2014); a lightweight privacy preserving sms based recommendation system for mobile users, knowledge and information systems, 40(1), 49-77, 2014. [4] borges j., levene m. (2000); web usage analysis and user profiling, chapter: data mining of user navigation patterns (92-112), san diego, ca, usa; springer berlin heidelberg, 2000. [5] chen d.-n. et al. (2010); a web-based personalized recommendation system for mobile phone selection: design, implementation, and evaluation, expert systems with applications, 37(12), 8201-8210, 2010. doi:10.1016/j.eswa.2010.05.066. [6] cheng a.-j. et al. (2011); personalized travel recommendation by mining people attributes from community-contributed photos, proceedings of the 19th acm international conference on multimedia, scottsdale, arizona, usa; acm, 83-92, 2011. [7] dao t.h., jeong s.r., ahn h. (2012); a novel recommendation model of location-based advertising: context-aware collaborative filtering using ga approach, expert systems with applications, 39(3), 3731-3739, 2012. doi:10.1016/j.eswa.2011.09.070. [8] eckhardt a. (2012); similarity of users’ (content-based) preference models for collaborative filtering in few ratings scenario, expert systems with applications, 39(14), 11511-11516, 2012. doi:10.1016/j.eswa.2012.01.177. [9] eirinaki b.m., m vazirgiannis m. (2003); web mining for web personalization, acm transactions on internet technology, 3(1), 1-27, 2003. mining users’ preference similarities in e-commerce systems based on webpage navigation logs 675 [10] ezeife c.i., lu y. (2005); mining web log sequential patterns with position coded pre-order linked wap-tree, data mining and knowledge discovery, 10(1), 5-38, 2005. doi:10.1007/s10618-005-0248-3. [11] he s., fang m. (2008); personalized recommendation based on ontology inference in ecommerce, proceedings of the international conference on management of e-commerce and e-government, 192-195, 2008. doi:10.1109/icmecg.2008.24. [12] gan m., jiang r. (2013); constructing a user similarity network to remove adverse influence of popular objects for personalized recommendation, expert systems with applications, 40(10), 4044-4053, 2013. doi:10.1016/j.eswa.2013.01.004. [13] guy i. et al. (2010); same places, same things, same people?: mining user similarity on social media, proceedings of the 2010 acm conference on computer supported cooperative work, savannah, georgia, usa, 41-50, 2010. doi:10.1145/1718918.1718928. [14] leavitt n. (2006); recommendation technology: will it boost e-commerce, computer, 39(5), 13-16, 2006. doi:10.1109/mc.2006.176. [15] lee m.-j., chung c.-w. (2011); a user similarity calculation based on the location for social network services, proceedings of the database systems for advanced applications, hong kong, china, springer, 38-52, 2011. doi:10.1007/978-3-642-20149-3_5. [16] li p. et al. (2007); preference update for e-commerce applications: model, language, and processing, electronic commerce research, 7(1), 17-44, 2007. doi:10.1007/s10660-006-0061-0. [17] li q. et al. (2008); mining user similarity based on location history, proceedings of the acm sigspatial gis, irvine, ca, usa, 2008. doi:10.1145/1463434.1463477. [18] linden g., smith b., york j.(2003); amazon.com recommendations: item-to-item collaborative filtering, ieee internet computing, 7(1), 76-80, 2003. doi:10.1109/mic.2003.1167344. [19] papazoglou m.p. (2001); agent-oriented technology in support of e-business enabling the development of "intelligent" business agents for adaptive, reusable software, communications of the acm, 44(4), 71-77, 2001. doi:10.1145/367211.367268. [20] park s.t., pennock d., madani o. (2006); collaborative filtering for robust cold-start recommendations, proceedings of the 12th acm sigkdd international conference on knowledge discovery and data mining, new york, usa, 2006. [21] sarwar b.m. (2001); sparsity, scalability and distribution in recommender systems, minneapolis, usa, university of minnesota, 2001. [22] shahabi c. et al. (1997); knowledge discovery from users web-page navigation, proceedings of the seventh international workshop on research issues in data engineering, birmingham, england, 1997. doi:10.1109/ride.1997.583692. [23] song q., shepperd m. (2005); mining web browsing patterns for e-commerce, computers in industry, 57(7), 622-630, 2006. doi:10.1016/j.compind.2005.11.006. [24] srivastava j. et al. (2000); web usage mining: discovery and applications of usage patterns from web data, acm sigkdd explorations newsletter, 1(2), 12-23, 2000. [25] turban e. et al. (2009); electronic commerce, nj, usa: prentice hall press, 2009. 676 p. li, c.x. wu, s.z. zhang, x.w. yu, h.d. zhong [26] waga k., tabarcea a., franti p. (2011); context aware recommendation of location-based data, proceedings of the 15th international conference on system theory, control, and computing (icstcc), sinaia, romania, 1-6, 2011. [27] wang y.-t., lee a.j.t. (2011); mining web navigation patterns with a path traversal graph, expert systems with applications, 38(6), 7112-7122, 2011. doi:10.1016/j.eswa.2010.12.058. [28] woerndl w., brocco m., eigner r. (2009); context-aware recommender systems in mobile scenarios, international journal of information technology and web engineering, 4(1), 67-85, 2009. doi:10.4018/jitwe.2009010105. [29] wu c., hou f. (2011); design and optimization of redundant controlnet networking control system, process automation instrumentation, 32(3), 50-56, 2011. [30] wu c., yu z. (2005); data transmission with data package dropout and control method on ncs, control & automation, 10, 39-41, 2005. [31] ying j. j.-c. et al. (2010); mining user similarity from semantic trajectories, proceedings of the 2nd acm sigspatial international workshop on location based social networks, san jose, california, 19-26, 2010. doi:10.1145/1867699.1867703. [32] yu x.b., guo s.s., huang x.r. (2010); intelligent e-commerce based on web usage mining and its application (in chinese), computer integrated manufacturing systems, 16(2), 439-448, 2010. [33] zhong h. et al. (2014); study on directed trust graph based recommendation for ecommerce system, international journal of computers communications & control, 9(4): 510-523, 2014. ijcccv11n4.dvi international journal of computers communications & control issn 1841-9836, 11(4):538-552, august 2016. detecting topic-oriented overlapping community using hybrid a hypergraph model g.l. shen, x.p. yang, j. sun gui-lan shen*; xiao-ping yang information school, renmin university beijng, china guilan.shen@buu.edu.cn; yang@ruc.edu.cn *corresponding author: guilan.shen@buu.edu.cn jie sun business school, beijing union university beijing,china jie.sun@buu.edu.cn abstract: a large number of emerging information networks brings new challenges to the overlapping community detection. the meaningful community should be topicoriented. however, the topology-based methods only reflect the strength of connection, but ignore the consistency of the topics. this paper explores a topic-oriented overlapping community detection method for information work. the method utilizes a hybrid hypergraph model to combine the node content and structure information naturally. two connections for hyperedge pair, including real connection and virtual connection are defined. a novel hyperedge pair similarity measure is proposed by combining linearly extended common neighbors metric for real connection and incremental fitness for virtual connection. extensive experiments on two real-world datasets validate our proposed method outperforms other baseline algorithms. keywords: information network, overlapping community detection, topic-oriented, hybrid hypergraph model. 1 introduction community is considered to be a fundamental property of complex network. despite the variety of complex networks, community often accounts for the functionality of the system [1]. research of recent years shows that the structure of community is not disjoint. overlapping is an important property of many real-world networks, i.e., they are naturally characterized by multiple community memberships. for example, a person could join in several hobby groups in social networks; one academic paper could cover a number of fields, etc. it is therefore a very essential work to develop approach for efficient overlapping community detection, which will contribute to the links prediction, collaborative recommendation and influence propagation in many application fields. although numerous techniques have been developed for overlapping community detection in recent years, most of them only focus on the structure information for real network. it is well understood, however, that there exist a large quantity of real networks with node content or semantic information, which is referred to as information network, such as www, scientific citation network, and social network. the meaningful detected community of the information network should be topic-oriented, which has two characteristics: the nodes inside one community should have dense connections and consistent or similar topics. communities identified via those topological methods often incorporate different topics since stronger connections represent the interactions that occur across several different topics, which would confuse the meanings of the topic-oriented community [2]. copyright © 2006-2016 by ccc publications detecting topic-oriented overlapping community using hybrid a hypergraph model 539 in this paper, we propose a topic-oriented overlapping community method for information network which combines node content information and link information. firstly, information network is modeled as a hybrid hypergraph composed of hyperedge that features the collection of nodes with common attributes. for different information network, the node attribute could be represented by interest, tab, word or topic. secondly, a hyperedge pair similarity calculation method is proposed, which combines the content information by calculating common neighbors of hyperedge pair and structure information by measuring the link relationships between the nodes involved into two hyperedges. finally, an agglomerative hierarchical clustering algorithm is applied to partition the hybrid hypergraph model into different topic-oriented overlapping communities. compared with the existing methods, our method can identify communities from the perspective of both content and link structure for information network. from this result, we can easily find more meaningful communities, such as topics, research fields and so forth. moreover, with the inherent characteristics of hybrid hypergraph model, overlapping of communities could be identified easily. we proceed to report our work in the rest of the paper as follows. we discuss the related works in section 2. in section 3, we propose our approach for identifying the meaningful overlapping communities based on hybrid hypergraph model for information network. in order to verify our approach, we conducted extensive experiments. the experimental design and results analysis are given in section 4. finally, a conclusion is drawn in section 5. 2 related works overlapping community detection using topology. some methods have been proposed to detect overlapping communities in a network. lfm proposed by lancichinetti et al [3] is a kind of algorithms utilizing local expansion and optimization. this method relies on a local benefit function that characterizes the quality of a densely connected group of nodes. lfm expands a community from a random seed node to form a natural community until the fitness function f(c) = kcin (kcin + k c out) α (1) is locally maximal, where kcin and k c out are the total internal and external degree of the community c, and α is the resolution parameter controlling the size of the communities. after detecting one community, lfm randomly selects another node not assigned to any community to expand another new community. this method obviously can identify the overlapping community, since they allow a single node to be put into different community owing to different optimization process. in real-world network, it’s difficult to decide how many communities that a single node should be put in, however it’s very clear whether the edge incident on the node is in the community or not. now, researcher suggests using links to defining community [4], owing to an edge only is in one community, but the nodes connected by edge could be put into different communities. some methods [5–7] using line graph and link partitioning to detect overlapping community have been proposed. among them, ahn [6] partitions links into clusters via hierarchical clustering of edge similarity. given a pair of links eik and ejk incident on a node k, the edge pair similarity can be computed via the jarracd index defined as, sim(eik,ejk) = |nb+(i) ∩nb+(j)| |nb+(i) ∪nb+(j)| (2) 540 g.l. shen, x.p. yang, j. sun where nb+(i) is the inclusive neighbors of a node i, which the set contains the node itself and its neighbors. with this similarity, single-linkage hierarchical clustering is then used to build a link dendrogram. cutting this dendrogram at a special threshold yields link communities. although the link partitioning method can detect the overlapping communities naturally, there is no guarantee that is provides high quality detection for information network because the method only relies on links of the network while ignores the node content totally. there are many other methods to detect overlapping community. for example, the ones based on subgraphs, such as cpm [8], cpmw [9] etc.al, treat community structure as the composition of adjacent subgraphs, as one node can belong to several subgraphs. however, these methods are usually considered to solve the pattern matching of complex networks rather than finding communities. in addition, the methods extended girvan and newman’s divisive clustering algorithm [10], such as conga [11], conco [12], allow a node to split into multiple copies. despite the use of different techniques, the above methods can always detect overlapping dense connections in network. however, they only focus on the topology information but ignore the content information that contributes to improve the quality of the community [13]. topic-oriented community detection using topology and content. based on the assumption that the content information can improve the quality of the detected community, various approaches have been combined the links and contents for community detection. some approaches have combined content information with structure information for community discovery. one of them is generative probabilistic modeling which considers both contents and links as being dependent on one or more latent variables, and then estimates the conditional distributions to find community assignments. plsa-phits [4], community-user-topic model [15] and pmc [16] are three representatives in this category. other fusing the content and structure methods, such as sa-clustering [17] via augmenting the underlying network to take into account the content information, heuristic algorithm ckc [18] to solve the connected k-center problem, subspace clustering algorithm [19] on graphs with feature vectors. different from those methods using topology, these methods account for the content information of the nodes, so the division results for the network is more cohesive in the topics. however, considering the content of nodes, the complexity of the algorithm is greatly increased which will lead to some new challenges, such as how to deal with the high dimensional sparse for node attributes. furthermore, those methods are not designed for overlapping communities. 3 methodology in this section, we present our method for fusing structure and content via hybrid hypergraph to detect the overlapping communities for information networks. firstly, we present the definitions, and then introduce how to build the hybrid hypergraph model for information network. thirdly, we give the method that how to measure hyperedge pairs similarity. finally, we briefly introduce algorithmic details of hlp (hyper link partition), a novel method extended from link graph partition algorithm. 3.1 definitions link partitioning method is a kind of topology methodology based on classic graph theory. it’s very simple and distinct to model information networks as simple graphs, in which nodes indicate entity object, and links indicate the binary relationships between node pairs. however, real information networks are characterized by node content attribute, hence simple graph is not suitable for representing the content information. as the generalization of simple graph, detecting topic-oriented overlapping community using hybrid a hypergraph model 541 hypergraph can represent the multiple relationships for nodes in finite set, and describe the relationship between general discrete structures, which overcome the defect of the knowledge represented by the simple graph. hypergraph characterized by one hyperedge incident on any number of nodes, is a graph in generalization. the definition of hypergraph is provided as follows. definition 1. a hypergraph [20], h = (v,e) is defined as a set of vertices v = {v1,v2, · · · ,vn}, and a set of hyperedges e = {e1,e2, · · · ,em}, where: (1) e1 6= φ(i = 1,2, · · · ,m) (2) m ⋃ i=1 ei = v according to definition, a hyperedge essentially is the set of vertices which are independent. that is, hypergraph cannot represent the original topology of vertices. thus, we give the definition of hybrid hypergraph. definition 2. a hybrid hypergraph, hh = (v,e,ε,ψ) is defined as a set of vertices v = {v1,v2, · · · ,vn}, a set of hyperedges e = {e1,e2, · · · ,em} and a set of edges ε where: (1) ei 6= ∅ (i = 1,2, · · · ,m) (2) m ⋃ i=1 ei = v (3) ψ(εi) = (vi,vj) (i,j = 1,2, · · · ,n) 3.2 modeling information network as hybrid hypergraph we need to extend a structural graph with tuples describing node attributes. this can be formally expressed as a quad ag = {v,ε,fv ,ψ}, where each node v is associated with a feature vector f(v). fv is the set of features for all nodes, where, f(v) ⊆ fv ,v ∈ v . feature selection is an important issue in system anomaly detection applications [21]. with different node attributes in different information network, the feature vector f(v) can be varied as a topic, a keyword, a place, an author, an activity. the number of features of fv is m, formally, m = ∣ ∣ ∣ ∣ ∣ f(v) ⋃ v∈v ∣ ∣ ∣ ∣ ∣ . so, the question of how to build the information network as the hybrid hypergraph model is simplified as how to map a quad attribute graph ag into a quad hh. here, we take each feature fi as a basic unit to build hyperedge e, when fi ∈ f(v) , the node v ∈ e we use incidence matrix and adjacent matrix to represent the data structures with related to hybrid hypergraph. the nv ×me incidence matrix for a hybrid hypergraph hh, say i, is defined as that ive = { 1, if v ∈ e 0, otherwise (3) the me ×nv transposed matrix for i, say k, is defined as that kev = { 1, if v ∈ e 0, otherwise (4) the nv ×nv adjacent matrix for a hybrid hypergraph hh, say a, is defined as that aij = { 1, if (i,j) ∈ ε 0, otherwise (5) the me ×me similarity matrix for a hybrid hypergraph hh, say sim, is defined as that simij = simlarity(ei,ej) ei,ej ∈ e (6) 542 g.l. shen, x.p. yang, j. sun 3.3 similarity for hyperedge pairs in this section, we present the method of how to calculate the similarity of hyperedge pairs in hh. this technique was originally introduced by link partitioning algorithm and local expansion algorithm for the purposes of identifying the overlapping communities in network. however, link partitioning algorithm and local expansion algorithm both only focus on topological information, but ignore the content information. we argue the hh model for information network can fuse the content information and structure information naturally. according to the definition of hh, a hyperedge can be regarded as a sub-community characterized by a special feature, because the hyperedge is a set of nodes with the same feature. we know clearly that the sub-community represented by the hyperedge is different from the final detected community. the reason is that one node is usually associated with more than one feature, but a hyperedge only reflect one of the features. nevertheless, we can determine whether the hyperedge pairs have the same topic or not by exploring the link relationship between them. if the node only has one feature, moreover, such as topic, our work can be simplified as the work presented by zhao [22]. the relationship between two hyperedges is the link between two node sets which is more complicated, rather than just that between the two nodes in the simple graph. the link type for two hyperedges comes into two kinds: one kind is shared common nodes; the other is that the nodes in one hyperedge are connected with those in the other hyperedge. in order to better illustrate this, we offer two formal definitions as follows. definition 3. given two hyperedge ei and ej, is real connection, where ei ∩ej 6= ∅. for instance, two hyperedges e1, e2 are shared with two common nodes v2, v3 (fig1.a), that is e1 ∩e2 = {v2,v3} 6= ∅, therefore the link type of e1, e2 is real connection. definition 4. given two hyperedge ei and ej, is virtual connection, where (1) ei ∩ej 6= ∅ (2) ∀ε ∈{< vm,vn > |∃vm ∈ ei,∃vn ∈ ej} 6= ∅ or (3) ei ∩ej 6= ∅ (4) ∀ε ∈{< vm,vn > |∃vm ∈ ei −ei ∩ej,∃vn ∈ ej −ei ∩ej} 6= ∅ for instance, as shown in fig1.b e1 ∩ e2 = ∅, and, there are links < v1,v2 >, < v2,v3 > between nodes for e1 and e2, this is one case of virtual connection. another case is also shown in fig1.a, e1 ∩ e2 = {v2,v3} = ∅, considering edge < v4,v6 >, which match the conditions (4) v4 ∈ e1 −e2 ∩e2 and v6 ∈ e2 −e1 ∩e2. therefore, hyperedge e1 and e2 has both real connection and virtual connection. obviously, when measuring the hyperedge similarity, both virtual connection and real connection are meaningful. virtual connection reflects the structure information of the original network essentially, while real connection reflects the number of nodes with common attributes. intuitively, hyperedge similarity is dependent on the tightness of hyperedge pairs. as mentioned above, hyperedge is the group of nodes; therefore, measuring the tightness of hyperedge pair can be converted into how to measure the tightness of two groups of nodes. if there are a lot of links between two sets of nodes, the two sets are strongly tight. for instance, in citation network, if article a1 with keyword k1 cited article a2 with keyword k2, then k1 and k2 have certain correlation or similarity. similarly, the more articles involving the keyword k1 are cited by the articles involving k2 keyword, that is, k1 set has strong tightness with k2 set, which turns out that k1 and k2 have higher correlation. detecting topic-oriented overlapping community using hybrid a hypergraph model 543 (a) real connection (b) virtual connection figure 1: two kinds of link type for hyperedge to better quantify the tightness between the two groups of nodes, we extend the local fitness function in lfm [3]. local fitness function for a given local community s, is formally given as fs = ksin ksin + k s out (7) where ksin and k s out are the total internal and external degree of the community. fitness function fs can measure the internal and external tightness for a local community. in topology detecting community methods, it has shown a good performance, and was expanded or applied by more researchers [23, 24]. inspired by this thought, we propose the fitness function for a hyperedge e to be given as fe = kein kein + k e out (8) where kein and k e out are the total internal and external degree of the hyperedge. the total of k e in and keout is the total degree of all nodes in this hyperedge. similar to fs, fe can measure the internal and external tightness for hyperedge. whether to merge two hyperedge into a larger hyperedge depends on how the changed numbers of virtual connection influence the fitness of the combined hyperedge. we define incremental fitness for combined hyperedge as the similarity for virtual connection of two hyperedges. given two hyperedge ei and ej simvirtual(ei,ej) = ∆fij = fei ∪fei = k eiej in k eiej in + k eiej out (9) where k eiej in is the numbers of virtual connection between ei and ej. the total of k eiej in and k eiej out is the total degree of all nodes in merged hyperedge. for instance, in fig2, considering two hyperedge e1 and e4, k e1e4 in = 4, k e1e4 in + k e1e4 out = 32, so, ∆f14 = fe1 ∪fe4 = 18 . another example in this figure is, ke3e4in = 3 andk e3e4 in + k e3e4 out = 41, so, ∆f34 = fe3 ∪fe4 = 341 . whether to merge two hyperedges into a larger hyperedge depends on the value of incremental fitness function. virtual connection reflects the tightness of hyperedge pair via the structure information, while, real connection reflects the semantic similarity of hyperedge via the content information. in our method, we do not directly calculate the similarity of the features implied by hyperedges, such as that of keyword or topic, instead, we evaluate the similarity via cn metric. cn(common neighbors) [25] is also called structural equivalence, namely, the nodes are similar if they share a lot of common neighbors. cn is one of the most widely used metrics when measuring the similarity in local community detection methods. therefore, we extend the cn metric to make it suitable for measuring the similarity between two hyperedges, or, two groups of nodes. to define clearly the neighbors set for hyperedges, we give the following definitions. 544 g.l. shen, x.p. yang, j. sun figure 2: a sample of hybrid hypergraph model definition 5 (inductive nodes set). given two hyperedge ei, ej in hh, where ei, ej has real connection, the inductive nodes set for ei related to ej, say pj(ei) is formally as pj(ei) = {v|v ∈ ei −ei ∩ej}. in order to illustrate the neighbors for hyperedges, we specially designate those neighbors related to inductive nodes set. definition 6 (extended common neighbors of inductive nodes set). given two hyperedge ei, ej and inductive nodes set pj(ei) in hh, the extended neighbors of pj(ei) including pj(ei), say n+(pj(ei)),is formally as n+(pj(ei)) = {x|d(v,x) ≤ 1,v ∈ pj(ei)} where d(x,y) is the distance of two nodes, formally as follows: d(x,y) ≤ 1 if x ∈ ei,y ∈ ej and ei ∩ej 6= ∅ (10) in fig2, e1 ∩ e2 = {v1}, p2(e1) = {v2}, p1(e2) = {v3,v4}, so we can calculate the extended common neighbors of inductive nodes sets n+(p1(e2)) = {v1,v3,v4,v5,v6,v7,v8,v9,v10}, n+(p2(e1)) = {v1,v2,v9,v10,v11}. based on jaccard index, we propose the method that how to calculate the similarity for hyperedge pair ei, ej with real connection. formally as: simreal(ei,ej) = |n+(pj(ei)) ∩n+(pi(ej))| |n+(pj(ei)) ∪n+(pi(ej))| (11) in fact, this definition is consistent with the similarity for the link pair in ahn[14_6] research. for instance, in fig2, s(e1,e2) = 3 11 . we combine the jarracd index and fitness function to compute the similarity for hyperedge pair ei, ej, formally as: simij = sim(ei,ej) = λsimreal(ei,ej) + (1 −λ)simvirtual(ei,ej) (12) we give the detail of how to compute the similarity for hyperedge pair in algorithm 1. 3.4 stop criterion in our method, we adopt divisive hierarchical clustering to cluster the hyperedges. the results of hierarchical clustering are presented in a dendrogram. one important job in this method is detecting topic-oriented overlapping community using hybrid a hypergraph model 545 algorithm 1: computing the similarity for hyperedge pair input: e1, e2, k output: simvalue j = 0, f = 0 check e1 and e2 is whether real connection or virtual connection if (e1 ∩e2 6= ∅) //real connection for each v ∈ v if (v ∈ e1 & &v ∈ e2) vcomm ← v //get extended neighbors of inductive nodes set niv1 ← veneighbors(k,e1,vcomm) niv2 ← veneighbors(k,e2,vcomm) if niv1 6= ∅ && niv2 6= ∅ internum=intersection(niv1,niv2) unionnum=unionsection(niv1,niv2) jaccard = internum / unionnum //fitness fin=blink(e1,e2) ftotal=totaldegree(e1)+ totaldegree(e2) f=fin/ftotal else fin=blink(e1, e2) ftotal=totaldegree(e1)+ totaldegree(e2) f=fin/ftotal simvalue = λ*jaccard + (1 λ)*f figure 3: algorithm similarity deciding the stop criterion for clustering. we define partition density d, as a function of the dendrogram cut threshold. the maximum of d indicates the discovered hyperedge communities are well structured. for a hh, {hp1,hp2, · · · ,hpk} is a partition of the hyperedges into k clusters. cluster hpk has mk hyperedges, and nk = ∣ ∣ ∣ ∣ ∣ ⋃ ei∈hpk {v ∈ ei} ∣ ∣ ∣ ∣ ∣ nodes. then we define this as normalized form: dk = mhpk −1 mk −1 (13) where, mhpk = nk ∑ i=1 mihpk / n is the mean value for the nodes located in a number of hyperedges. mihpk is the number of hyperedges in which node i is located. the larger dk indicates the probability, at which nodes are clustering into one cluster, the tighter internal connection in the cluster is. the partition density d, is the average of dk. d = k ∑ i=1 di / k (14) the goal of hierarchical clustering is to find k clusters when partition density d is maximal. when d = 1, all clusters are merged into one community. 546 g.l. shen, x.p. yang, j. sun 3.5 hlp algorithm based on the hh model for information network, we apply the previously-defined similarity algorithm to pairs of all hyperedges, and produce a similarity matrix s. in the process of clustering, the clustered hyperedge contain more than one initial hyperedge. for instance, ei′ has n initial hyperedges, ej′ has m initial hyperedges. the jarracd similarity of ei′, ej′ is computed as: simreal(ei′,e′j) = n ∑ t m ∑ c simreal(e c i,e t j) / n∗m (15) we give a simple description of hlp in algorithm 2. we obtain the clusters of merged hyperedges via this algorithm. the nodes located in each clustered hyperedges constitute the communities related to some special topics. algorithm2: clustering for hyperedges input: k output: k′ s ← 0, m = 0, n = 0, k′← k m = lengthofrow(k) n = lengthofcol(k) //initialize the similarity matrix for k for(int i = 0; i < m; i++) for(int j = 0; j < n; j++) s(i,j) = similarity(i,j) s′← s while m > 1 //find the coordinates for the maximal similarity in s′ maxi = maxi(s′) maxj = maxj(s′) k′← clustering(k′(, maxi), k(, maxj)) if density(k′) = maxdensity(k′) return k′ figure 4: algorithm hlp 4 experiments in this section, we present experiments on real datasets to evaluate the performance of our method. we first applied our method to two datasets to choose the optimal value of λ. then we compared the performance of our method with two baseline methods. before going to details, we first describe the datasets and the method to extract the node feature, and introduce the performance metric to be used in our experiments. 4.1 datasets two real datasets used in our experiments are described in the following: cora dataset: the cora dataset [26] consists of the abstracts and references of about 34000 computer science research papers. three subfields of machine learning (ml), programming (pl) and database (db) are used and those articles without references to other articles in the set are removed. the detailed information about each subfield is shown in table1. detecting topic-oriented overlapping community using hybrid a hypergraph model 547 table 1: cora dataset area #of #of #of #of subfields total papers used papers links ml 7 4218 2708 5249 pl 9 4496 3292 7772 db 9 1396 1060 2522 webkb dataset: the webkb dataset [27] consists of about 6000 web pages from computer science departments of four schools. the web pages are classified into seven categories, including, staff(sa), course(cs),department(dp), faculty(fa), student(sd),project(pj),other(ot). we select five categories excluding dp and ot, in that the two categories contain only a few pages. the detailed information about these web pages is shown in table 2. table 2: webkb dataset school #of #of #of #of #of #of #of cs fa sd pj sa total links cn 44 34 18 128 21 245 304 tx 36 46 20 148 2 252 328 wa 77 30 18 123 10 258 446 wi 85 38 25 156 12 316 530 4.2 preprocessing for cora, we treated each subfield as an independent dataset. after stemming and removing stopwords we were left with vocabulary of stemmed unique words for each subfield respectively. all words with document frequency less than 15 were removed. taking db dataset as an example, a vocabulary of size 5911 unique words was reduced to size 1086 words by removing the low document frequency words. the word cloud for preprocessed db dataset is shown in fig.5 figure 5: word cloud for db dataset the same preprocessing is handled by webkb. we extracted words whose document fre548 g.l. shen, x.p. yang, j. sun quency is more than 10. 4.3 evaluation metrics in our paper, we focus on the topic similarity of the detected overlapping communities, therefore when the ground-truth community is known, we utilize two measures of purity and nmi to evaluate the quality of overlapping communities detected by different methods. purity measures the internal topic similarity within the community, and nmi is the most widely used measure to account for overlapping communities. given the ground-truth community structure, g = {g1,g2, · · · ,gs} where gs contains the set of nodes that are in the sth community. the community structure given by the algorithms is represented by c = {c1,c2, · · · ,cs}, where ck contains the set of nodes that are in the kth community. the purity of ci is defined as: purity(ci) = 1 |ci| max j {ci ∩gj} (16) usually, the detected community ci includes nodes that belong to other gj in the groundtruth. for ci, we compute the intersection set with each standard community gj, then take the maximum as the final purity for it. the purity of c is defined as: purity(c) = 1 k k ∑ i=1 purity(ci) (17) the higher the purity, the better the communities are partitioned from the perspective of topics. the mutual information between g and c is defined as mi(g,c) = ∑ x∈g,y∈c p(x,y) log p(x,y) p(x)p(y) (18) the nmi(normalized mutual information) is defined by nmi(g,c) = mi(g,c) max(h(g),h(c)) (19) where h(g) and h(c) are the entropies of the partitions g and c. the higher the nmi, the closer the partition is to the ground truth. 4.4 optimal value of λ as we discussed in section 3, the parameter λ balances the jaccard index and fitness function value when compute the similarity of hyperedge pairs. we perform experiments to study how the λ value affects the purity of detected communities. we set the step 0.1, the result is shown in figure 4. the result shows that λ value is decided by the structure of network and the number of hyperedges. jaccard index and fitness function value both affect the purity of the detected communities. it is proved that the structure and content information both have influence on the topical community detection. however, we observe that the characteristics of information networks detemine the value of parameter λ . different network will lead to different λ . this is detecting topic-oriented overlapping community using hybrid a hypergraph model 549 figure 6: the result of choosing parameter λ because there are noise problems to various degree, including both links and node content in the information network. for webkb dataset,the best performace is achieved when "λ = 0.6", and for cora dataset, the optimal value "λ = 0.5" which are used as default settings for the following experiments. 4.5 results to evaluate the effectiveness of hlp, we compare our method with two baseline methods: one is topology-based method, line graph partition [6], the other is lda to cluster the nodes by using content information only. we use purity and nmi quantifying the performance each algorithm. the details are shown in figure 5, which illustrates hlp achieve the best performance in real information networks. from the results, we also can observe some interesting things. in some datasets, such as db,pl,cn and ws, lda algorithm can achieve better performance than line graph algorithm, in some other datasets, nevertheless, such as ml,ta and wc, the results are opposite. this confirms our assumption again that both node information and link information affect the quality of detected overlapping communities. therefore, we are sure that the combination of node information and link information can improve the quality of overlapping community detection. (a) cora dataset (b) webkb dataset figure 7: the evaluations of community algorithms over two real information networks. 550 g.l. shen, x.p. yang, j. sun conclusion in this paper, we propose a topic-oriented overlapping community detection approach based on hierarchical clustering for hybrid hypergraph model, which can combine the content and structure information of information network naturally. considering the complex of the hybrid hypergraph model, we classify the connections of hyperedges into real connection related to content information, and virtual connection related to structure information. we present incremental fitness to evaluate the tightness for hyperedge pairs in virtual connection. meanwhile, we extend cn metric on hyperedge pairs to conduct the semantic similarity calculation in real connection. in order to balance the influence of two connections, we combine linearly the two measures for similarity of hyperedge pairs. the density function is employed to determine the appropriate number of communities. to evaluate the performance, we conducted experiments on two real datasets. compared with the benchmark, line graph partition algorithm focusing on topological detection, lda focusing on clustering node contents, our approach gained a better performance in information network. furthermore, the overlapping communities detected by our approach were more meaningful since they are topic-oriented. our approach has many potential applications. it can be applied to many kinds of information networks, where nodes contain content. with our method detecting the communities, we are able to improve the efficiency of collaborative scientific research, discover experts for each topic, and analyze topic-oriented influence propagation. future work includes qualifying the weight of each node in the hyperedge to improve the purity of detected communities. we also intend to take the time factor into account, so that we can detect the evolution communities. acknowledgment this paper is supported by natural science foundation of china (no.71572015), scientific research project of beijing union university (no. zk10201506), beijing higher education young elite teacher project (no.yetp1503). bibliography [1] cobanoglu b, zengin a, ekiz h, et al (2014); implementation of devs based distributed network simulator for large-scale networks [j]. international journal of simulation modelling (ijsimm), 13(2): 147-158. [2] ding y. (2011), community detection: topological vs. topical, journal of informetrics, doi: 10.1016/j.joi.2011.02.006, 5(4): 498-514. [3] lancichinetti, andrea, santo fortunato, and jános kertész (2009); detecting the overlapping and hierarchical community structure in complex networks, new journal of physics, 11(3): 033015. [4] xie j., kelley s., szymanski b. k. (2013), overlapping community detection in networks: the state-of-the-art and comparative study, acm computing surveys (csur), 45(4): 4379. [5] evans t. s., lambiotte r. (2009), line graphs, link partitions, and overlapping communities, physical review e, doi:http://dx.doi.org/10.1103/physreve.80.016105, 8(1): 92-105. detecting topic-oriented overlapping community using hybrid a hypergraph model 551 [6] ahn y. y., bagrow j. p., lehmann s. (2010), link communities reveal multiscale complexity in networks, nature, doi:10.1038/nature09182, 466: 761-764. [7] he, c., ma, h., kang, s., cui, r. (2014), an overlapping community detection algorithm based on link clustering in complex networks, in military communications conference (milcom) ieee, 865-870. [8] palla, g., derényi, i., farkas, i., & vicsek, t. (2005), uncovering the overlapping community structure of complex networks in nature and society, nature, 435: 814-818. [9] farkas, i., ábel, d., palla, g., & vicsek, t. (2007), weighted network modules, new journal of physics, 9: 80-198. [10] girvan m., newman m. e. j. (2002), community structure in social and biological networks, proc. of the national academy of sciences, 99: 7821-7826. [11] gregory s. (2007), an algorithm to find overlapping community structure in networks, knowledge discovery in databases: pkdd 2007, springer berlin heidelberg, 91-102. [12] gregory s. (2008), a fast algorithm to find overlapping communities in networks, machine learning and knowledge discovery in databases, springer berlin heidelberg, 408-423. [13] t. yang, r. jin, y. chi, s. zhu (2009), combining link and content for community detection: a discriminative approach, proceedings of the 15th acm sigkdd international conference on knowledge discovery and data mining, paris, 927-936. [14] hric d., darst r. k., fortunato s. (2014), community detection in networks: structural clusters versus ground truth, arxiv preprint arxiv:1406.0146. [15] hofmann, david cohn thomas (2001), the missing link-a probabilistic model of document content and hypertext connectivity, proceedings of the 2000 conference on advances in neural information processing systems, vancouver, 430-436. [16] d. zhou, e. manavoglu, j. li, c. giles, and h. zha (2006), probabilistic models for discovering e-communities, in proceedings of the 15th international conference on world wide web, banff, 173-182. [17] yang, b., di, j., liu, j., liu, d. (2013), hierarchical community detection with applications to real-world network analysis, data & knowledge engineering, 83: 20-38. [18] m. ester, r. ge, b. gao, z. hu, and b. ben-moshe (2006), joint cluster analysis of attribute data and relationship data: the connected k-center problem, proceedings of the 2006 siam international conference on data mining, maryland, usa, 25-46. [19] günnemann s, b. boden, and t. seidl (2011), db-csc: a density-based approach forsubspace clustering in graphs with feature vectors, machine learning and knowledge discovery in databases, springer berlin heidelberg, 565-580. [20] berge, claude (1989), hypergraphs: combinatorics of finite sets, north holland. [21] zhou x. et al.(2014); information-value-based feature selection algorithm for anomaly detection over data streams [j]. tehnički vjesnik, 21: 223-232. [22] zhao, z., feng, s., wang, q., huang, j. z., williams, g. j., fan, j. (2012), topic oriented community detection through social objects and link analysis in social networks, knowledgebased systems, 26: 164-173. 552 g.l. shen, x.p. yang, j. sun [23] darst r. k., nussinov z., fortunato s. (2014), improving the performance of algorithms to find communities in networks, physical review e, 89(3): 42-58. [24] mcauley j., leskovec j. (2014), discovering social circles in ego networks, acm transactions on knowledge discovery from data (tkdd), 8(1): 10-16. [25] lorrain f., white h. c. (1971) , structural equivalence of individuals in social networks, the journal of mathematical sociology,1: 49-80. [26] rayid ghani (2014), cmu world wide knowledge base (webkb) project, jan, 2001.[online]. available: http://www.cs.cmu.edu/∼webkb. [accessed: april 9, 2014] int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 4, pp. 415-426 fuzzy logic control system stability analysis based on lyapunov’s direct method r.-e. precup, m.-l. tomescu, şt. preitl radu-emil precup, ştefan preitl "politehnica" university of timişoara department of automation and applied informatics bd. v. parvan 2, ro-300223 timişoara, romania e-mail: {radu.precup, stefan.preitl}@aut.upt.ro marius-lucian tomescu "aurel vlaicu" university of arad computer science faculty complex universitar m, str. elena dragoi 2, ro-310330 arad, romania e-mail: tom uav@yahoo.com abstract: a stability analysis method for nonlinear processes controlled by takagisugeno (t-s) fuzzy logic controllers (flcs) is proposed. the stability analysis of these fuzzy logic control systems is done in terms of lyapunov’s direct method. the stability theorem presented here ensures sufficient conditions for the stability of the fuzzy logic control systems. the theorem enables the formulation of a new stability analysis algorithm that offers sufficient stability conditions for nonlinear processes controlled by a class of t-s flcs. in addition, the paper includes an illustrative example that describes one application of this algorithm in the design of a stable fuzzy logic control system. keywords: fuzzy logic controller, lasalle’s invariance principle, lyapunov function candidate. 1 introduction fuzzy logic controllers have been proposed for a long time and applied successfully in many applications [1, 2, 3, 13, 14, 18]. a comprehensive work on the proof of stability of fuzzy logic control systems represents one of the challenges in fuzzy control [6, 12, 16, 17]. this paper presents a new stability analysis method for fuzzy logic control systems comprising nonlinear processes and t-s flcs. the advantages of this method with respect to the state-of-the-art result from its specific features. first, it is different to lyapunov’s theorem in several important aspects and allows more applications. in particular, it is well-suited to controlling processes where the derivative of the lyapunov function candidate is not negative definite. therefore lyapunov’s direct method can cope with fuzzy control of a wide area of nonlinear dynamic systems. second, the stability of the closed-loop system is guaranteed by the stability in each active region of the fuzzy rules. so making use of the proposed stability analysis approach determines the inserting of new fuzzy rules become very easy because just the fulfillment of one condition in the stability analysis theorem is needed. the paper discusses the following topics. section 2 deals with the description of the accepted class of fuzzy logic control systems. the proposed stability analysis method focused on a stability theorem based on lyapunov’s direct method and the new stability analysis algorithm that guarantees the stability of fuzzy logic control systems are presented in section 3. next, section 4 offers a simple example to validate the theoretical part suggesting ways of applying the proposed algorithm. the conclusions are drawn in section 5. copyright c© 2006-2009 by ccc publications 416 r.-e. precup, m.-l. tomescu, şt. preitl output y(t) x(0) initial conditions u(t) reference input r process ! ! !tuxbxfx "# state variables (x1,x2,…,xn) figure 1: fuzzy logic control system structure. 2 fuzzy logic control systems the structure of a fuzzy logic control system consisting of a process controlled by an flc is presented in figure 1. let x be the universe of discourse and consider a single-input n-th order nonlinear system of the following form representing the state-space equations of the controlled process: ẋ = f (x) + b (x) u, x (t) = x, (1) where: − x ∈ x , x = [x, x, ..., xn]t is the state vector; − f (x) = [ f (x) , f (x) , ..., fn (x)] t , b (x) = [b (x) , b (x) , ..., bn (x)] t are functions describing the dynamics of the process, f , b : d → rn are locally lipschitz maps from a domain d ⊂ rn into rn; − u is the control signal applied to the process input; − the time variable, t, has been omitted to simplify the further formulation; − x (t) is the initial state at time t. the i-th fuzzy (control) rule in the rule base of the t-s flc base is of the form (2): rule i : if x is xi, and x is xi, and ... and xn is xi,n then u = ui (x) , i = , r, r ∈ n∗, (2) where r is the total number of rules, xi,, xi,, .., xi,n are fuzzy sets that describe the linguistics terms (lts) of the input variables xk, k = , n, u = ui (x) is the control signal of rule i, similar to the case of parallel distributed compensation, and the function and is a t-norm. ui can be a single value or a function of the state vector, x. the structure presented in figure 1 cam be viewed as a nonlinear state-feedback control system. however other input variables (to the flc) can be considered as well instead of the state variables xk, k = , n. one simple design of the fuzzy logic control system can be done in terms of parallel distributed compensation. each fuzzy rule generates the firing strength defined in (3): αi (x)= and (µi, (x) , µi, (x) . . . µi,n (xn)) ∈ [, ] ,∀x ∈ x , i = , r. (3) fuzzy logic control system stability analysis based on lyapunov’s direct method 417 it is assumed that for any x ∈ x there exists among all rules at least one αi ∈ (, ] , i = , r. the control signal u is a function of αi and ui. applying the weighted sum defuzzification method the output of the flc is given by u = r∑ i= αiui r∑ i= αi . (4) definition 1. for any input x ∈ x if the firing strength αi (x) corresponding to the fuzzy rule i is zero, that fuzzy rule i, i = , r, is called an inactive fuzzy rule for the input x; otherwise, it is called an active fuzzy rule. it should be noted that with x = x an inactive fuzzy rule will not affect the controller output u (x). hence (4) can be rewritten as follows aiming the consideration of all active fuzzy rules only: u (x) = r∑ i=,αi 6= αi (x) ui (x) r∑ i=,αi 6= αi (x) . (5) definition 2. an active region of the fuzzy rule i is defined as a set x ai = {x ∈ x |αi (x) 6= } , i = , r. (6) 3 stability analysis the stability analysis presented in this paper is based on lasalle’s invariance principle cited and analyzed in [11]. this section is concentrated on the formulation and proof of theorem 1 that ensures sufficient conditions for the stability of nonlinear processes controlled by t-s flcs. the lyapunov function candidate v : rn → r, v (x) = xt px is considered. it is positive and unbounded, where p ∈ rn×n is a positive definite matrix. considering the state trajectories fulfilling (1) in order to obtain the closed-loop system dynamics, it results that v has continuous partial derivatives and the derivatives of v with respect to time expressed in terms of (7): v̇ (x) = ẋt px + xt pẋ = ( f (x) + b (x) u (x))t px+ +xt p ( f (x)+ b (x)u (x)) = f (x) + b (x) u (x) , (7) where: f (x) = f (x)t px + xt p f (x) , b (x) = b (x)t px + xt pb (x) . (8) the following sets are defined to be used in the stability analysis: b = {x ∈ x |b (x) =  } , b+ = {x ∈ x |b (x) >  } , b− = {x ∈ x |b (x) <  } . (9) the main result is given by the following theorem. theorem 3. let the process be described by (1) with x =  ∈ rn an equilibrium point. if there exists a function v : rn→ r, v (x)= xt px, p∈ rn×n, positive definite, unbounded and fulfilling 1, 2 and 3: 1. f (x) ≤ ,∀x ∈ b, 2. ui (x) ≤ − f(x)b(x) for x ∈ x ai ∩b+ and ui (x) ≥ − f(x) b(x) for x ∈ x ai ∩b−, i = , r, 418 r.-e. precup, m.-l. tomescu, şt. preitl 3. the set { x ∈ x ∣∣v̇ (x) =  } contains no state trajectories except the trivial one, x (t) =  for t ≥ , then the closed-loop system composed by the t-s flc and the process (1) will be globally asymptotically stable in the sense of lyapunov at the origin. proof. by the definition of v it results that v () = , v (x) > ,∀x 6=  and v (x) = xt px → ∞ as ‖x‖ → ∞. further on, it will be proved that v̇ is negative semi-definite with respect to time employing (7). an arbitrary initial state vector x ∈ x is accepted. then the following three cases are possible. case 1: b (x) is strictly positive. from the condition 2 of theorem 1 it results that: ui (x) ≤ − f (x) b (x) ⇒ u (x) = r∑ i=,αi 6= αi (x) ui (x) r∑ i=,αi 6= αi (x) ≤ − f(x) b(x) r∑ i=,αi 6= αi (x) r∑ i=,αi 6= αi (x) = − f (x) b (x) ⇒ ⇒ v̇ (x) = f (x) + b (x) u (x) ≤ f (x) + b (x) ( − f (x) b (x) ) = . (10) therefore, ui (x) ≤ − f (x) b (x) ⇒ v̇ (x) ≤ . (11) case 2: b (x) is strictly negative. once more, from the condition 2 of theorem 1 it results that ui (x) ≥ − f (x) b (x) ⇒ u (x) = r∑ i=,αi 6= αi (x) ui (x) r∑ i=,αi 6= αi (x) ≥ − f(x) b(x) · r∑ i=,αi 6= αi (x) r∑ i=,αi 6= αi (x) = − f (x) b (x) ⇒ ⇒ v̇ (x) = f (x) + b (x) u (x)≤ f (x) + b (x) ( − f (x) b (x) ) = . (12) therefore, ui (x) ≥ − f (x) b (x) ⇒ v̇ (x) ≤ . (13) case 3: x ∈ b. in this case using the condition 1 in theorem 1 the result will be f (x) ≤ . hence, v̇ (x) = f (x) + b (x) u (x) = f (x) ≤ . (14) from the above three cases it is obtained that v̇ (x) ≤ ,∀x ∈ x . (15) in conclusion, the derivative with respect to time of the lyapunov function candidate, v̇ , is negative semi-definite. the condition 3 ensures the fulfilment of lasalle’s invariance principle. this justifies the fact that the equilibrium point at the origin is globally asymptotically stable. the proof is now complete. the stability theorem presented here ensures sufficient conditions for the stability of the fuzzy logic control system described in section 2. so it has been proved that if the lyapunov function candidate is negative semi-definite in the active region of each fuzzy rule then, the closed-loop system will be globally asymptotically stable in the sense of lyapunov. fuzzy logic control system stability analysis based on lyapunov’s direct method 419 the conditions 1 and 2 in theorem 1 guarantee that the function v̇ is negative semi-definite in the active region of each fuzzy rule. the condition 3 proves that the set {} is the largest invariance set in{ x ∈ x ∣∣v̇ (x) =  } . by lasalle’s invariance principle it has been guaranteed that the fuzzy logic control system, comprising the nonlinear process described by (1) and the t-s flc, is globally asymptotically stable in the sense of lyapunov at the origin. the stability analysis algorithm ensuring the stability of the class of fuzzy logic control systems considered in section 2 is based on theorem 1. it consists of the following steps: 1. set the lyapunov function candidate v (i.e. set p). 2. check that the set { x ∈ x ∣∣v̇ (x) =  } contains no state trajectories except the trivial one, x (t) =  for t ≥ . 3. determine f (x) , b (x) , b, b−, b+. 4. if f (x) ≤ ,∀x ∈ b then go to step 5. else go to step 1. 5. for each fuzzy control rule i determine ui such that ui (x) ≤ − f(x)b(x) for x ∈ x ai ∩b+ and ui ≥ − f(x) b(x) for x ∈ x ai ∩b−, i = , r. the application of this algorithm will be illustrated in the next section. 4 illustrative example this section is dedicated to the validation of the theoretical results derived in section 3 by the design of a stable fuzzy logic control system with t-s flc controlling a nonlinear process, the inverted pendulum on a cart system. this simple mechanical system is representative to model a class of attitude control problems whose goal is to maintain permanently the desired vertically oriented position. since the inverted pendulum is a nonlinear system, the basic balance equations for the system are derived firstly and put into the standard state-space form. given an inverted pendulum mounted on a cart as shown in figure 2, the first principle nonlinear equations are applied in the sequel. assuming that the rod is massless and that the cart mass and the point mass at the upper end of the inverted pendulum are denoted as m and m, respectively, there is an externally x-directed force on the cart, f(t), and the gravity force acts on the point mass at all times. the coordinate system is defined according to figure 2, where x(t) represents the cart position and θ (t) is the tilt angle referenced to the vertical upward direction. figure 2: variables related to the inverted pendulum on a cart system. the differential equation that describes the behavior of the simplified system, playing the role of controlled process, is usually written as (m + m)·l · θ̈ − (m + m)·l ·g·sin (θ ) = −u, (16) 420 r.-e. precup, m.-l. tomescu, şt. preitl where: m the mass of the cart, m the mass of the pendulum, l the length of pendulum (distance to the center of mass), x the cart position coordinate, θ the pendulum angle with respect to the vertical position, u the control signal, equal to the externally x-directed force, u = f . the state vector consists of the angle, θ , and the angular velocity of the pendulum, θ̇ . therefore, the two state variables are defined as z and z, where z ∈ [−, ], z ∈ [−, ], z (t) = θ (t) and z (t) = θ̇ (t). in order to write equation (19) in terms of state variables, they are substituted resulting in ż = f (z) + b (z) u, (17) where: z = [ z z ] state vector, f (z) = [ z g l sin (z) ] , b (z) = [  −  (m+m)l ] . the goal of fuzzy logic control system design, to be presented as follows, is to ensure the upright stabilization of the pendulum aiming the setpoint value of z, z = . the design starts with setting the fuzzification module of the t-s flc. figures 3 and 4 illustrates the membership functions corresponding to the lts of the linguistic variables z and z. the three lts representing positive, zero and negative values are noted by p, z and n, respectively. figure 3: membership functions of z. figure 4: membership functions of z. fuzzy logic control system stability analysis based on lyapunov’s direct method 421 the inference engine of the flc employs the min and max operators and it is assisted by the complete rule base illustrated in table 1. the weighted sum defuzzification method is used in the t-s flc structure. summarizing, the only parameters to be calculated are the consequents ui in the 9 fuzzy control rules. table 1 fuzzy control rule base rule antecedent consequent z z u 1 p p u 2 n n u 3 p n u 4 n p u 5 p z u 6 n z u 7 z p u 8 z n u 9 z z u the algorithm presented in section 3 will be applied as follows in order to find the values of ui for which the system (19) can be stabilized by the above described t-s flc. step 1: the lyapunov function candidate v (z) = zt pz = z + z   is considered, where p = (     ) . therefore v is positive. if ‖z‖ → ∞ then v (z) → ∞. v (z) > ,∀z 6= . step 2: the derivative is v̇ (z)= z ( z + g l sin (z) −  (m+m)l u ) and v̇ () = . assume that there is a trajectory with z (t) =  and z (t) 6= . then ddt z (t) = g l sin (z (t)) −  (m+m)l u (t) 6= , which means that z (t) can not stay constant. hence, z (t) =  is the only possible state trajectory for which v̇ (z) = . so the set { z ∈ x ∣∣v̇ (z) =  } contains no trajectory of the system except the trivial trajectory z (t) =  for t ≥ . step 3: the expressions of f and b are: f (z) = z ( z + g l sin (z) ) , b (z) = − z (m + m) l . (18) the following elements necessary in theorem 1 obtain the particular values expressed in (23) to (26): b = {(z, )∈ x |z∈ [−, ] } , b+= {(z, z)∈ x |z <  } , b−= {(z, z)∈ x |z >  } , (19) − f (z) b (z) = l (m + m) (zl + g sin (z)) . (20) step 4: if z ∈ b then z =  and f (z) = . step 5: each rule will be analyzed further on. this is not a complex task since only 9 rules are involved. for rule 1: z is p, z is p. so x a = (, ] × (, ], x a ∩ b+ = /0 and x a ∩ b− = (, ] × (, ]. thus, u (z) ≥ − f(z)b(z) = l (m + m) (zl + g sin (z)). it is taken u (z) = l (m + m) (zl + g), and this function fulfills the condition 2 in theorem 1. for rule 2: z is n, z is n. so x a = [−, )× [−, ), x a ∩ b+ = [−, )× [−, ) and x a ∩ b− = /0. thus u ≤ − f(z)b(z) = l (m + m) (zl + g sin (z)). it is taken u (z) = l (m + m) (zl − g), and this function fulfills the condition 2 in theorem 1. 422 r.-e. precup, m.-l. tomescu, şt. preitl for rule 3: z is p, z is n. so x a = (, ]×(−, ], x a ∩b− = /0 and x a ∩b+ = (, ]×(−, ]. thus u (z) ≤ − f(z)b(z) = l (m + m) (zl + g sin (z)). it is taken u (z) = −z, and this function fulfills again the given condition. for rule 4: z is n, z is p. so x a = [−, )×(, ], x a ∩b+ = /0 and x a ∩b− = [−, )×(, ]. thus u (z) ≥ − f(z)b(z) = l (m + m) (zl + g sin (z)). it is taken u (z) = −z, and this function fulfills the given condition. for rule 5: z is p, z is z. so x a = (, ]×(−, ). thus, two possible cases will occur: a) for z ∈ x a ∩b− = (, ]×[, ) ⇒ u (z) ≥ − f(z)b(z) = l (m + m) (zl + g sin (z)) and b) for z ∈ x a ∩b+ = (, ]×(−, ] ⇒ u (z) ≤ − f(z)b(z) = l (m + m) (zl + g sin (z)). in order to satisfy both conditions it is chosen u (z) = l (m + m) (zl + g sin (z)). for rule 6: z is n, z is z. so x a = [−, )×(−, ). the result will be: a) for z ∈ x a ∩b− = [−, )×(, ) ⇒ u (z) ≥ − f(z)b(z) = l (m + m) (zl + g sin (z)) and b) for z ∈ x a ∩b+ = [−, )×(−, ) ⇒ u (z) ≤ − f(z)b(z) = l (m + m) (zl + g sin (z)). in order to satisfy both conditions it is chosen u (z) = l (m + m) (zl + g sin (z)). for rule 7: z is z, z is p. so x a = (−, ) × (, ], x a ∩ b+ = /0 and x a ∩ b− = (−, ) × (, ] . thus u (z) ≥ − f(z)b(z) = l (m + m) (zl + g sin (z)). it is set u (z) = l (m + m) (zl + g) to fulfill the condition 2 in theorem 1. for rule 8: z is z, z is n. so x a = (−, ) × [−, ), x a ∩ b+ = (−, ) × [−, ) and x a ∩ b− = /0. thus u (z) ≤ − f(z)b(z) = l (m + m) (zl + g sin (z)). it is set u (z) = l (m + m) (zl − g) to fulfill the condition 2 in theorem 1. for rule 9: z is z, z is z. so x a = (−, )×(−, ). this will yield: a) for z ∈ x a ∩b+ = (−, )×(−, ) ⇒ u (z) ≤ − f(z)b(z) = l (m + m) (zl + g sin (z)) and b) for z ∈ x a ∩b− = (−, )×(, ) ⇒ u (z) ≥ − f(z)b(z) = l (m + m) (zl + g sin (z)). in order to satisfy both conditions it is set u (z) = l (m + m) (zl + g sin (z)). concluding, from theorem 1 it results that the closed-loop system composed by the nonlinear process modeled in (19) and the t-s flc designed here is globally asymptotically stable in the sense of lyapunov at the origin. considering the values of process parameters m = ., m = ., l = , g = ., the responses of z and z versus time in the closed-loop system are presented in figures 5 to 8 for different initial conditions. 5 summary and conclusions a new approach to the global asymptotic stability analysis of fuzzy logic control systems employing t-s flcs dedicated to a class of nonlinear processes has been introduced. the example proves how the stability analysis algorithm suggested here can be applied to the design of a stable fuzzy logic control system for a nonlinear process. the new stability approach can be applied also in situations when the system has an equilibrium point different to the origin and / or the setpoint is nonzero by an appropriately defined state transformation [15]. fuzzy logic control system stability analysis based on lyapunov’s direct method 423 [s] figure 5: state variables versus time for fuzzy logic control system with t-s flc in the condition z () =  and z () = −. [s] figure 6: state variables versus time for fuzzy logic control system with t-s flc in the condition z () =  and z () = . [s] figure 7: state variables versus time for fuzzy logic control system with t-s flc in the condition z () =  and z () = −. [s] figure 8: state variables versus time for fuzzy logic control system with t-s flc in the condition z () = − and z () = . the stability analysis algorithm suggested in this paper canbe applied also when the rule base (2) of the t-s flc is not complete. however interpolation techniques [10, 19] are needed in the imple424 r.-e. precup, m.-l. tomescu, şt. preitl mentation of the t-s flc. they require the re-assessment of the stability conditoins derived prior to the implementation. further research will be concentrated on new applications of the proposed algorithm to several classes of processes [4, 5, 7, 8, 9, 17, 20]. the complex applications require the computer-aided design of the takagi-sugeno fuzzy logic controllers employing the stability analysis algorithm proposed in this paper to strive for increased generality. acknowledgement the support from the cncsis and cnmp of romania is acknowledged. bibliography [1] m. m. bălaş, v. e. bălaş, world knowledge for control applications by fuzzy-interpolative systems, international journal of computers, communications & control, vol. iii, suppl. issue: proceedings of icccc 2008, pp. 28-32, 2008. [2] a. bazoula, m.s. djouadi, h. maaref, formation control of multi-robots via fuzzy logic technique, international journal of computers, communications & control, vol. iii, suppl. issue: proceedings of icccc 2008, pp. 179-184. 2008. [3] d. bellomo, d. naso, and r. babuška, adaptive fuzzy control of a non-linear servo-drive: theory and experimental results, engineering applications of artificial intelligence , vol. 21, issue 6, pp. 846-857, 2008. [4] s. blažič and i. škrjanc, design and stability analysis of fuzzy model-based predictive control a case study, journal of intelligent and robotic systems, vol. 49, issue 3, pp. 279-292, 2007. [5] s. ertugrul, predictive modeling of human operators using parametric and neuro-fuzzy models by means of computer-based identification experiment, engineering applications of artificial intelligence , vol. 21, issue 2, pp. 259-268, 2008. [6] g. feng, a survey on analysis and design of model-based fuzzy control systems, ieee transactions on fuzzy systems, vol. 14, issue 5, pp. 676-697, 2006. [7] d. hladek, j. vascak, and p. sincak, multi-robot control system for pursuit-evasion problem, journal of electrical engineering, vol. 60, issue 3, pp. 143-148, 2009. [8] l. horváth and i. j. rudas, towards content oriented integration of product and robot system models, international symposium on logistics and industrial informatics, wildau, pp. 7-12, 2007. [9] g.-h. hwang, d.-w. kim, j.-h. lee, and y.-j. an, design of fuzzy power system stabilizer using adaptive evolutionary algorithm, engineering applications of artificial intelligence, vol. 21, issue 1, pp. 86-96, 2008. [10] z. c. johanyák and s. kovács: sparse fuzzy system generation by rule base extension, 11th ieee international conference of intelligent engineering systems, budapest, pp. 99-104, 2007. [11] h. k. khalil, nonlinear systems, third edition, prentice hall, englewood cliffs, nj, 2002. [12] k. michels, f. klawonn, r. kruse, and a. nürnberger, fuzzy control: fundamentals, stability and design of fuzzy controllers, springer verlag, berlin, heidelberg, new york, 2006. fuzzy logic control system stability analysis based on lyapunov’s direct method 425 [13] b. m. mohan and a. sinha, analytical structure and stability analysis of a fuzzy pid controller, applied soft computing, vol. 8, issue 1, pp. 749-758, 2008. [14] a. g. perry, g. feng, y.-f. liu, and p. c. sen, a design method for pi-like fuzzy logic controllers for dc-dc converter, ieee transactions on industrial electronics, vol. 54, issue 5, pp. 2688-2696, 2007. [15] r.-e. precup, s. preitl, and g. faur, pi predictive fuzzy controllers for electrical drive speed control: methods and software for stable development, computers in industry, vol. 52, issue 3, pp. 253-270, 2003. [16] r.-e. precup, s. preitl, i. j. rudas, m. l. tomescu, and j. k. tar, design and experiments for a class of fuzzy controlled servo systems, ieee/asme transactions on mechatronics, vol. 13, issue 1, pp. 22-35, 2008. [17] e. tian and c. peng, delay-dependent stability analysis and synthesis of uncertain t-s fuzzy systems with time-varying delay, fuzzy sets and systems, vol. 157, issue 4, pp. 544-559, 2006. [18] t. vesselenyi, s. dziţac, i. dziţac and m.-j. manolescu, fuzzy and neural controllers for a pneumatic actuator, international journal of computers, communications & control, vol. ii, no. 4, pp. 375-387, 2007. [19] k. w. wong, d. tikk, t. d. gedeon, and l. t. kóczy, fuzzy rule interpolation for multidimensional input spaces with applications: a case study, ieee transactions on fuzzy systems, vol. 13, issue 6, pp. 809-819, 2005. [20] j. yoneyama, robust stability and stabilization for uncertain takagi-sugeno fuzzy time-delay systems, fuzzy sets and systems, vol. 158, issue 2, pp. 115-134, 2007. radu-emil precup was born in lugoj, romania, in 1963. he received the dipl.ing. degree (hons.) in automation and computers from the "traian vuia" polytechnic institute of timişoara, romania, in 1987, the diploma in mathematics from the west university of timişoara in 1993, and the phd degree in automatic systems from the "politehnica" university of timisoara (put), romania, in 1996. from 1987 to 1991, he was with infoservice s.a., timişoara. he is currently with the put, where he became a professor in the department of automation and applied informatics, in 2000, and he is currently a doctoral supervisor of automation and systems engineering. he is also an honorary professor with the budapest tech polytechnical institution, budapest, hungary. he is the author or coauthor of more than 120 papers published in various scientific journals, refereed conference proceedings, and contributions to books. marius l. tomescu was born in arad, romania, in 1967. he received his msc degree in computer science from the west university of timişoara, romania, in 1993, and the phd degree in automation from the "politehnica" university of timişoara, romania, in 2008. from 1993 to 2008, he was with grup service s.r.l., arad. he is with the "aurel vlaicu" university of arad, where he has been a lecturer in the department of computer science since 2002. his research interests include fuzzy control applications and stability analysis of fuzzy control systems. he is the author or coauthor of more than 30 papers published in various scientific journals and refereed conference proceedings. dr. tomescu is a member of the romanian society of control engineering and technical informatics. 426 r.-e. precup, m.-l. tomescu, şt. preitl ştefan preitl was born in timişoara, romania, in 1943. he received the dipl.ing. degree in electrical engineering, and the phd degree in measurement techniques from the "traian vuia" polytechnic institute of timişoara in 1966 and 1983, respectively. from 1967 to 1972, he was with electromotor s.a., timişoara. he is currently with the "politehnica" university of timişoara, timişoara, where he became a professor in the department of automation and applied informatics, in 1992, and he is currently a doctoral supervisor of automation and systems engineering. he is also an honorary professor with the budapest tech polytechnical institution, budapest, hungary. he is the author or coauthor of more than 170 papers published in various scientific journals, refereed conference proceedings, and books in the field of automatic control. int j comput commun, issn 1841-9836 vol.7 (2012), no. 3 (september), pp. 432-449 performing mapreduce on data centers with hierarchical structures z. ding, d. guo, x. chen, x. luo zeliu ding, deke guo, xueshan luo key lab of information system engineering, school of information systems and management, national university of defense technology, changsha 410073, china e-mail: zeliuding@nudt.edu.cn, guodeke@gmail.com xsluo@nudt.edu.cn xi chen school of computer science, mcgill university, montreal h3a 2a7, canada e-mail: chenxiwarm@gmail.com abstract: data centers are created as distributed information systems for massive data storage and processing. the structure of a data center determines the way that its inner servers, links and switches are interconnected. several hierarchical structures have been proposed to improve the topological performance of data centers. by using recursively defined topologies, these novel structures can well support general applications and services with high scalability and reliability. however, these structures ignore the details of some specific applications running on data centers, such as mapreduce, a well-known distributed data processing application. the communication and control mechanisms for performing mapreduce on the traditional structure cannot be employed on the hierarchical structures. in this paper, we propose a methodology for performing mapreduce on data centers with hierarchical structures. our methodology is based on the distributed hash table (dht), an efficient data retrieval approach on distributed systems. we utilize the advantages of dht, including decentralization, fault tolerance and scalability, to address the main problems that face hierarchical data centers in supporting mapreduce. comprehensive evaluation demonstrates the feasibility and excellent performance of our methodology. keywords: mapreduce; data center; distributed hash table (dht). 1 introduction in the recent years, the data centers have emerged as distributed information systems for massive data storage and processing. a data center provides many online applications [8] [2] and infrastructure services [4] [13] through its large number of servers, which are interconnected via high-speed links and switches. these devices construct the networking infrastructure of a data center, named data center network [9]. the structure of a data center network determines the way these devices are organized. to design suitable structures and make them match well with the data storage and processing applications are fundamental challenges. mapreduce, proposed by google, is a well-known and widely used data processing mechanism on data centers [6]. mapreduce works by separating a complex computation into map tasks and reduce tasks, which are performed in parallel on hundreds or thousands of servers in a data center. mapreduce provides a good control and execution mode for distributed computing and cloud computing [18]. users copyright c⃝ 2006-2012 by ccc publications performing mapreduce on data centers with hierarchical structures 433 without any experience of distributed programming can easily process terabytes of data on data centers with the help of mapreduce. nowadays, increasingly diverse applications and services call for an improvement of data centers in the topological performance, including scalability, reliability, etc. especially, users’ various requirements for processing large amount of data results in an exponentially increasing number of servers. the traditional structure, however, can hardly sustain the incremental expanding of data centers [10]. several novel data center structures, such as dcell [9], ficonn [15], and bcube [11], have been proposed to optimize the topological performance of data centers. these structures are all recursively defined to construct data center networks, interconnecting the servers by a hierarchical way. we represent them as hierarchical structures. these structures mainly focus on the scalability and reliability for data centers. however, the details of some specific applications running on the data centers with these structures are ignored. for example, none of these hierarchical structures treat servers as masters and workers, although this requirement is the basis of many distributed data processing applications, especially mapreduce. the communication and control mechanisms for performing mapreduce on the traditional structure can hardly be operated on these hierarchical structures. to solve this problem, this paper presents a methodology for performing mapreduce on the data centers with hierarchical structures, represented as hierarchical data centers for short. our methodology is based on the distributed hash table (dht) [20] [21], an efficient data retrieval approach on distributed systems. dht works by assigning each server a hash table that records the range of keys handled by all adjacent servers. responsibility for hashing data to keys is distributed among the servers. this approach possesses several advantages. first, dht makes all servers freely communicate with each other without any central coordination. this provides a control mechanism for mapreduce on hierarchical data centers without designating the masters or workers. second, dht ensures that the whole system can tolerate any single node failure. since the information of a server is held by its adjacent servers, a failed server can affect only its neighbors, which causes a minimal amount of disruption [22]. finally, dht can flexibly deal with a large amount of nodes joining or leaving the system. this matches well with the scalability of hierarchical data centers. these advantages bring a feasibility to perform mapreduce on hierarchical data centers. in this paper, we address the main problems that face hierarchical data centers in supporting mapreduce. our methodology utilizes the above advantages of dht to execute the procedure of mapreduce on hierarchical data centers. comprehensive evaluation shows that our methodology is effective and possesses excellent performance. the main contributions of this paper are as follows. • first, we propose the schemes for designating master servers and worker servers, and storing data files on hierarchical data centers, so as to facilitate the execution of mapreduce. • second, we present a specific dht architecture and a corresponding routing scheme for assigning map and reduce tasks and delivering intermediate data on hierarchical data centers. comprehensive evaluation demonstrates that our scheme can evenly distribute the workload and well support throughput-hungry mapreduce applications. • third, we deal with server and switch failures by proposing suitable fault-tolerant approaches for performing mapreduce on hierarchical data centers. experimental results prove that our methodology is a reasonable solution even considering node failures. the remainder of this paper is organized as follows: section 2 introduces the background, related work and our motivation. section 3 proposes the schemes for executing the basic procedure of mapreduce on hierarchical data centers. section 4 presents the dht architecture and routing scheme. section 5 looks into the fault-tolerant routing and issues for performing mapreduce on hierarchical data centers. section 6 evaluates the performance of the proposed methodology. section 7 concludes this paper. 434 z. ding, d. guo, x. chen, x. luo 2 preliminaries 2.1 background with a simple and practical processing procedure, mapreduce provides a standard mechanism for distributed data processing. a basic mapreduce procedure consists of a map phase and a reduce phase [5]. each phase includes multiple parallel map or reduce tasks, respectively. a mapreduce procedure can process terabytes of data through numbers of map and reduce tasks. map tasks are applied for data classification and preparing intermediate data for reduce tasks. by means of predefined map programs, map tasks transform the input data into intermediate data, which are organized as key/value pairs, and then deliver those intermediate data with the same key to corresponding reduce tasks. the keys represent the types of intermediate data. the values represent the content of intermediate data. reduce tasks are responsible for merging those intermediate data and producing output files. after retrieving intermediate data from map tasks, reduce tasks integrate the intermediate values associated with the same key by means of predefined reduce programs, and therefore generate output values. in a data center, mapreduce lets a master server control many worker servers in executing map and reduce tasks [7]. the master assigns each map or reduce task to a worker, and each map task is assigned to the worker that stores the input data for the map task. the workers executing map and reduce tasks are called mappers and reducers, respectively. in the map phase, mappers execute corresponding map tasks simultaneously. when mappers accomplish map tasks, they store derived intermediate data on local disks, and then send the location information of intermediate data to the master. in the reduce phase, the master distributes the location information of intermediate data to reducers. then reducers read corresponding intermediate data from mappers and execute their respective reduce tasks simultaneously. fig.1 shows the basic process of a mapreduce procedure. mapper 0 mapper 1 mapper 2 reducer 0 reducer 1 reducer 2 local write output data 0 output data 1 output data 2 input data 0 input data 1 local read assign map tasks assign reduce tasks input data 2 input data files intermediate data map phase reduce phase output data files key0/value key1/value key2/value key0/value key1/value key2/value master key0/value key1/value key2/value figure 1: the basic process of a mapreduce procedure 2.2 related work many existing data centers adopt the traditional tree structure. namely, all servers are located at leaf nodes. aggregation switches and core switches are placed at inner nodes and root nodes, respectively. the servers are connected by the aggregation switches, which are linked by the core switches. using these expensive and high-speed switches, the servers are fully interconnected. there is a path between each pair of servers without passing through any other server. the traditional tree structure is simple and easy to build, but it does not scale well. expanding such a structure needs to add more inner nodes and root nodes, using more expensive and higher-speed switches. actually, these aggregation switches performing mapreduce on data centers with hierarchical structures 435 and core switches easily lead to bottlenecks. a core switch failure can break down hundreds or even thousands of servers. fat-tree [1] is an improved structure of the traditional tree structure. every inner node in a fat-tree has more than one father node. this improvement increases the number of links between the aggregation switches and core switches. the network connectivity is increased, making fat-tree a relatively reliable structure. however, like the traditional tree structure, it still does not scale well. hierarchical structures constructed recursively are believed to be scalable and reliable. to construct a hierarchical structure, a high level structure utilizes a lower level structure as a unit and connects many such units by means of a given recursive rule. as the level of a structure increases, more and more servers can be added into a hierarchical dcn without destroying the existing structure. dcell [9], ficonn [15], and bcube [11] are three typical hierarchical structures. they use the same smallest recursive unit, in which a switch interconnects several servers. however, they are different in their recursive rules. dcell employs a complete graph as its recursive rule. there is a link between any two units of the same level. as a result, dcell possesses the advantage of the complete graph. in order to obtain high connectivity, each server in dcell should be equipped with multiple network ports. although ficonn and dcell employ similar design principles for constructing high level compound graphs recursively, they have fundamental differences. each server in ficonn is equipped with only two network ports, and the recursive units in ficonn are connected with only a half of the idle network ports in each unit. bcube employs the generalized hypercube as its recursive rule. the neighboring servers, which are connected to the same switch, differ in only one digit in their address arrays. thus, bcube holds the advantages of the generalized hypercube, such as high connectivity and reliability. 2.3 motivation the above hierarchical structures efficiently solve the problems of scalability and reliability in different ways by using recursively defined topologies. nevertheless, they ignore the particular approaches for performing mapreduce on the data centers with these structures. in a hierarchical structure, servers are not fully interconnected, and each server only connects with several adjacent servers. to support mapreduce, the hierarchical structure cannot utilize the approaches proposed by google for performing mapreduce on the traditional tree structure [6]. first, hierarchical structures do not treat servers as masters and workers. in the procedure of google’s mapreduce, the servers controlling the execution of mapreduce procedures are called masters. the servers executing map and reduce tasks are called workers. in the traditional tree structure, servers are partitioned into masters and workers for mapreduce. a master can directly communicate with its workers without intermediate server. in a hierarchical structure, however, the communication among servers is multi-hop. assume the servers are partitioned into masters and workers, there will be lots of control information transmitted through the servers shared by different paths. it is difficult to designate servers as masters or workers in hierarchical data centers. second, hierarchical structures can hardly support the data maintenance mechanism used by the traditional tree structure. in the traditional tree structure, each data file is divided into 64 megabytes blocks, and each block has several copies which are stored on different workers to keep data locality [3]. when a worker updates its data files, it sends related maintaining information to a distributed file system [12], which runs on some particular servers. since the number of servers in a data center can be up to several thousands or even more, if the same method is used on a hierarchical data center whose servers are not fully connected, the amount of maintaining information transmitted on the data center will generate huge traffic load. third, the approaches for transmitting intermediate data on the traditional tree structure may be inefficient on hierarchical structures. in a traditional tree structure, after accomplishing map tasks, mappers store all intermediate data on local disks and send corresponding messages to the master. then the mas436 z. ding, d. guo, x. chen, x. luo ter forwards the information about intermediate data to reducers. after that, reducers send massages to mappers asking for intermediate data. finally, mappers distribute all intermediate data to reducers concurrently. in a hierarchical data center with non-fully connected network, this process becomes much more complex. since mappers are not directly connected with reducers, the intermediate data may be transmitted through several servers shared by different paths. sometimes a mapper can generate megabytes of intermediate data. when mappers are executing map tasks, communication channels might be idle. however, when the map tasks are accomplished, some mappers may have to wait to deliver intermediate data. therefore, delivering all intermediate data concurrently through multi-hop reduces network resource utilization, takes too much bandwidth and may result in network congestion. to perform mapreduce on hierarchical data centers, we would like to propose a methodology for addressing all the above problems with high fault-tolerance. details about the proposed methodology are introduced in the rest of the paper. 3 mapreduce on hierarchical data centers in this section, we study the methodology for performing mapreduce on hierarchical data centers. our methodology is mainly based on the distributed hash table (dht), and can efficiently solve the above problems in terms of storing data files, assigning map and reduce tasks, and delivering intermediate data. 3.1 roles of servers in a data center, servers control and execute mapreduce procedures. in our research, each server in a hierarchical data center can work as a master or a worker. if a server receives a mapreduce request, it will be regarded as a master for the current mapreduce procedure. different from a traditional data center, this master is only responsible for assigning map and reduce tasks to workers. it is not responsible for controlling the transmission of intermediate data. if a server receives a map or reduce task, it will be regarded as a worker for the current mapreduce procedure. moreover, the worker receiving a map task is regarded as a mapper, and the worker receiving a reduce task is regarded as a reducer. a worker executes received tasks according to the rule of fcfs (first come, first served). after accomplishing map tasks, mappers directly send derived intermediate data to reducers without masters’ control. 3.2 scheme for storing data files we would like to design a scheme for storing data files, so that the traffic load due to transmission of the maintaining information can be reduced and the map tasks can be easily assigned. according to the rule of dht [20] [17], the scheme for storing data files on hierarchical data centers can be summarized as three steps. the first step is to define a suitable file key space for all servers, and assign a set of sequential file keys to each server. a file key refers to a fixed-length number or string used for denoting a data file block. the number of file keys assigned to a server depends on the disk capacity of the server. a server with more disk capacity can get more file keys. the second step is to build a file key table on each server, which records the range of file keys of every adjacent server. the third step is to define a suitable function to hash the name of each data file block to a file key, and store the data file block on a server which holds that file key. each server is responsible for maintaining its data file blocks, file keys and the file key table. if a server updates its data file blocks, there is no need to inform other servers. if a server updates its file keys, it sends maintaining information only to its adjacent servers to update their file key tables. each server has only a finite number of adjacent servers. consequently, storing data files according to the performing mapreduce on data centers with hierarchical structures 437 above scheme can reduce the amount of maintaining information transmitted on the whole data center network. 3.3 scheme for assigning map and reduce tasks in a traditional tree structure, a master directly connects with all workers. therefore, a master can easily send a map or reduce task to a worker. in a hierarchical structure, however, a master usually sends a map or reduce task to a worker through a number of other servers. (1) assigning map tasks. based on the scheme for storing data files and the rule of dht [20], we propose the scheme for assigning map tasks on hierarchical data centers as follows. when a server receives a mapreduce request, it first determines the input data file block processed by each map task. this server, namely the master, then hashes the name of the input data file block to a file key for the map task. after that, the master chooses a server from all its adjacent servers to send the map task, and the selected server has the range of file keys closest to the derived file key. this server will further choose another server from its neighbors to forward the map task according to the same rule. this process is iteratively performed until a server, which receives the map task, holds the corresponding file key. that implies the server stores the input data for the map task. here the closest is measured by a function, which is specified according to the definition of file keys. (2) assigning reduce tasks. we propose the scheme for assigning reduce tasks on hierarchical data centers as following steps. similar to the scheme for storing data files, the first step is to define a suitable reduce key space for all servers, and assign a set of sequential reduce keys to each server. a reduce key refers to a fixed-length number or string. the number of reduce keys assigned to a server depends on the computing ability of the server. a server with faster computing capacity can get more reduce keys. the second step is to build a reduce key table on each server, which records the range of reduce keys of every adjacent server. the third step is to define a suitable function used for hashing the keys of intermediate data to reduce keys. finally, when assigning reduce tasks, the master hashes the key of intermediate data processed by each reduce task to a reduce key. the master chooses a server from all adjacent servers to send the reduce task, and the selected server has the range of reduce keys closest to the derived reduce key. this process is iteratively performed until a server, which receives the reduce task, holds the corresponding reduce key. since the master can send map and reduce tasks immediately without searching data files node by node, in a hierarchical data center, assigning map and reduce tasks through dht can reduce the execution time of mapreduce procedures. more than that, since there is no particular path from the master to a worker, the transmission of the map or reduce task cannot be interrupted by node failures. the communication becomes more reliable. 3.4 scheme for delivering intermediate data based on the scheme for assigning reduce tasks, it is easy to delivering intermediate data. our scheme for delivering intermediate data on hierarchical data centers are as follows. when a mapper is executing a map task, it hashes the key of a derived intermediate key/value pair to a reduce key in the same way as that of assigning reduce tasks. then, according to its reduce key table, it directly sends the intermediate key/value pair to an adjacent server, which has the range of reduce keys closest to the reduce key hashed from the intermediate key among all adjacent servers. in a similar way to the scheme for assigning map and reduce tasks, this intermediate key/value pair will be delivered node by node to a server which holds the corresponding reduce key. according to the scheme for assigning reduce tasks, this server has received the reduce task that is responsible for processing the intermediate data. 438 z. ding, d. guo, x. chen, x. luo delivering intermediate data with above scheme can avoid the unnecessary controlling process performed by masters, which is complicated and can delay executing reduce tasks in a non-fully connected data center network. since each intermediate key/value pair is delivered immediately after being generated, this scheme can increase network resource utilization, and facilitate intermediate data transmission. like the scheme for assigning map tasks, delivering intermediate data in this way can avoid the impact of node failures, and therefore can increase the reliability of the transmission. 0 1 2 3 idfilekeyrange reducekeyrange 0(f000, f009) (r000, r009) 2(f020, f029) (r020, r029) idfilekeyrange reducekeyrange 1(f010, f019) (r010, r019) 2(f020, f029) (r020, r029) idfilekeyrange reducekeyrange 0(f000, f009) (r000, r009) 1(f010, f019) (r010, r019) 3(f030, f039) (r030, r039) figure 2: an example of our dht 4 dht architecture and routing scheme this section introduces the specific dht architecture and corresponding routing scheme for executing mapreduce on hierarchical data centers. since the servers in a hierarchical data center are homogeneous and work in the same manner, we assign the file or reduce keys to the servers in the order of their identifiers. the keys held by different servers are logically arranged in a line or a circle, like chord [20]. this dose not means that the longest path to retrieve a key is the whole line. adjacent servers in a hierarchical data center can belong to different recursive units and do not have sequential identifiers. the range of keys held by adjacent servers may not be sequential. only the adjacent servers in the same smallest recursive unit hold sequential identifiers and keys. consequently, for a hierarchical data center, the retrieval efficiency depends on its physical structure instead of the logical form in which the keys are arranged. for ease of maintenance, we integrate the aforementioned file key table and reduce key table into one hash table. this table consists of three attributes, including the range of file keys, the range of reduce keys and the identifier of the adjacent server that holds those file keys and reduce keys. in a hierarchical data center, each server stores such a table for recording the range of file keys and reduce keys of all adjacent servers. in our work, itemi=(filekeyrange, reducekeyrange, i d) denotes a record of the hash table, where filekeyrange, reducekeyrange, and id represent the three attributes, respectively. when a server is added to or removed from the data center, only its adjacent servers need to renew the records of their hash tables, so all other severs will not be affected. fig.2 shows an example of our dht architecture for executing mapreduce on hierarchical data centers. based on the hash table stored on each server, it is easy to implement the routing scheme for assigning map and reduce tasks and sending intermediate data. when a server receives a map or reduce task or an intermediate key/value pair, it first determines whether the corresponding file key or reduce key is in its charge. if the server holds that file key or reduce key, it performs the corresponding task according to reference [6]. otherwise, it forwards the map or reduce task or intermediate data to an adjacent server in the way indicated by algorithm 1. in algorithm 1, t ransob ject denotes a map or reduce task or an intermediate key/value pair, which performing mapreduce on data centers with hierarchical structures 439 needs to be delivered by any server in a hierarchical data center. t ransob ject.key denotes the file key or reduce key derived by corresponding hash functions. function d(, ) is used for calculating the distance between t ransob ject.key and the range of keys held by an adjacent server. algorithm 1 sets a variable, denoted by s erver, for recording the record item of the hash table on current server, which represents the next hop to send t ransob ject. it initializes s erver with the first record item, and then determines the type of t ransob ject. after that, algorithm 1 iterates over the hash table to find a record whose range of file or reduce keys is the closest to t ransob ject.key according to function d(,), and assigns the obtained record item to s erver. finally, it sends t ransob ject to the adjacent server whose identifier is denoted by s erver.id. in this way, t ransob ject will be delivered node by node to the server responsible for executing corresponding map or reduce task. algorithm 1 deliver1 (object t ransob ject) 1: object s erver = item0; 2: if t ransob ject is a map task then 3: for i = 1; i < i − 1; i ++ do 4: if d(t ransob ject.key, itemi.filekeyrange) < d(t ransob ject.key, s erver.filekeyrange) then 5: s erver = itemi; 6: end if 7: end for 8: else 9: for i = 1; i < i − 1; i ++ do 10: if d(t ransob ject.key, itemi.reducekeyrange) < d(t ransob ject.key, s erver.reducekeyrange) then 11: s erver = itemi; 12: end if 13: end for 14: end if 15: send t ransob ject to s erver.i d; 5 fault tolerance a hierarchical data center consists of much more servers, switches and links than a traditional tree structure data center, so it has more tendency to machine or link failures. a link failure leads to the disconnection of the two machines interconnected through the link, and the two machines can be regarded as failed to each other. hence we only focus on server and switch failures in this paper. 5.1 fault-tolerant routing against failures of servers and switches a mapreduce procedure cannot utilize a failed server or switch to assign tasks or transmitting intermediate data. to address this problem, we propose the fault-tolerant routing for mapreduce in hierarchical data centers. in a traditional tree structure data center, a switch failure can break down all the servers connecting to it. since a hierarchical data center employs the redundant structure, a switch failure may not affect the servers connecting to it. however, these servers cannot communicate to each other directly. in this work, we treat a switch failure as several disconnected servers. to ensure that our routing scheme can forward all tasks and intermediate data to available servers, we employ the following approach. each server sends the number of tasks in its service queue as its 440 z. ding, d. guo, x. chen, x. luo state information to all adjacent servers periodically, and therefore each server knows the running state of all its adjacent servers. if a server cannot update its state information to its adjacent servers, it will be regarded as a failed server and its corresponding records in the hash tables of adjacent servers will be denoted as unavailable. based on this failure notification mechanism among adjacent servers, we modify algorithm 1 in order to achieve algorithm 2 as the fault-tolerant routing scheme. algorithm 2 assigns the first available record item to s erver, and iterates over the hash table from that record to update s erver with an available record whose range of file or reduce keys is closer to t ransob ject.key. in such a way, s erver.i d finally gives the identifier of the available next hop to deliver t ransob ject. algorithm 2 deliver2 (object t ransob ject) 1: object s erver = null; int j = 0; 2: for i = 0; i < i − 1; i ++ do 3: if itemi.available == true then 4: s erver = itemi; 5: j = i; 6: break; 7: end if 8: end for 9: if t ransob ject is a map task then 10: for i = j; i < i − 1; i ++ do 11: if itemi.available == true then 12: if d(t ransob ject.key, itemi.filekeyrange) < d(t ransob ject.key, s erver.filekeyrange) then 13: s erver = itemi; 14: end if 15: end if 16: end for 17: else 18: for i = j; i < i − 1; i ++ do 19: if itemi.available == true then 20: if d(t ransob ject.key, itemi.reducekeyrange) < d(t ransob ject.key, s erver.reducekeyrange) then 21: s erver = itemi; 22: end if 23: end if 24: end for 25: end if 26: send t ransob ject to s erver.i d; 5.2 fault-tolerant approaches to address failures of masters and workers in a hierarchical data center, a running mapreduce procedure can be interrupted by a server failure, no matter a master failure or a worker failure. to address this problem, we propose the following approaches. 1) addressing the failure of a master server: • as soon as a master receives a mapreduce request, it sends the request to an adjacent server as a replica. when assigning map and reduce tasks, the master concurrently sends a confirmation performing mapreduce on data centers with hierarchical structures 441 message for each task to that adjacent server. if that server cannot receive any confirmation message of a task within a threshold time, the master will be regarded as failed, and that server will take over the current mapreduce request and reassign the corresponding map or reduce task. 2) addressing the failure of a worker server: • if a worker server receives a map task and its service queue has achieved a predefined threshold length, it will discard the map task and send a message to the master for reassigning the map task to another server which stores a replica of the corresponding input data file. • according to the aforementioned failure notification mechanism, each server keeps the running state of all adjacent servers. if a worker server receives a reduce task and its service queue has achieved a predefined threshold length, it will forward the reduce task to an available adjacent server. moreover, it will forward all the intermediate data to that adjacent server. • when a worker server accepts a map or reduce task, it sends corresponding information of the task to an adjacent server. as soon as the worker server accomplishes that task, it sends a confirmation massage of the task to that adjacent server. if that adjacent server cannot receive the confirmation massage within a threshold time, it will send the task information to the master for reassigning the task to another available server. in this case, the worker server will be regarded as a straggler [16]. 6 evaluation since bcube is a representative hierarchical structure [11], in this section we conduct a comprehensive evaluation based on bcube, to demonstrate that our method is feasible for executing mapreduce on hierarchical data centers. in the following evaluation, we employ equation 1 as the hash function for transferring a task name or an intermediate key into a file key or a reduce key. t ransob ject.key= f (t ransob ject) mod k (1) function f (t ransob ject) calculates the decimal ascii number of t ransob ject. for example, suppose that t ransob ject denotes a character string "abc", then f (t ransob ject) equals 979899. k is a prime number less than the total number of file keys or reduce keys. here the value of t ransob ject.key is an integer, hence the aforementioned function d(, ) can be defined for calculating the absolute value of the difference between t ransob ject.key and the range of keys held by a server. 6.1 load balance according to aforementioned schemes, map tasks are assigned to the servers that hold corresponding data files, so the distribution of input data file blocks determines the load balance of map tasks. since researchers have studied how to allocate date to servers evenly with consistent hashing [14] [23], we assume that all input data file blocks are well distributed in a hierarchical data center. therefore, we can also assume that map tasks can be evenly assigned to different servers. here we mainly study the load balance of reduce tasks, which is much more uncertain than that of map tasks. we perform the simulation for evaluating load balance as follows. we simulate the structure of bcube and corresponding communication among its servers. let n denote the number of servers in a level 0 bcube, and h denote the number of levels. the number of servers in bcube varies from 4 to 625 when n varies from 2 to 5 and h varies from 1 to 3. we assign a unique identifier to each server, and calculate the identifiers of its adjacent servers. in bcube, two servers that connect to the same switch can be regarded as adjacent servers. in reality, different mapreduce applications generate different kinds 442 z. ding, d. guo, x. chen, x. luo of reduce tasks, whose arrival to a data center is a stochastic process. for ease of evaluation, in our simulation, we assume that there is only one reduce task arriving to the bcube data center within a short period of time. the execution time of a reduce task is random and is longer than the arrival period. we consider two cases about the execution time. the first case is that the execution time of a reduce task varies randomly from 10 to 100 times the length of arrival period. the second case is that the execution time of a reduce task varies randomly from 100 to 500 times the length of arrival period. according to hadoop [24], we define that each server can execute two reduce tasks simultaneously. when a server is busy in executing two reduce tasks, it will reject the newly arrived reduce task and foreword it to an adjacent server. if a reduce task is rejected for three times, we regard that this reduce task is dropped. we consider a workload with 2×103 reduce tasks, which are assigned according to the schemes studied in section 3. since a server can only execute limited number of reduce tasks simultaneously, some reduce tasks may get dropped in a data center with skewed load. we repeat the simulation 30 times for each number of servers to calculate the mean percentage of dropped reduce tasks. fig.3 plots the results in the two cases about the execution time. as shown in fig.3, the percentage of dropped reduce tasks decreases rapidly with the increase of the number of servers. in the first case, the percentage of dropped reduce tasks decreases to 0 when the number of servers achieves 64. in the second case, the percentage of dropped reduce tasks decreases to 0 when the number of servers achieves 256. when we further increase the number of arrival reduce tasks to 104, the results of the two cases remain the same. this implies that running mapreduce on such a data center according to our approaches can hardly drop any tasks. 0 50 100 150 200 250 300 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 number of servers p e rc e n ta g e o f d ro p p e d r e d u ce t a sk s execution time: 10 to 100 periods execution time: 100 to 500 periods figure 3: variation of the percentage of dropped reduce tasks along with the number of servers to evaluate the workload of each server, based on the above simulation, we calculate the reduce tasks executed by each server in the two cases. fig.4 plots the variation of the mean percentage of reduce tasks that per server executes along with the number of servers, which varies from 4 to 625. as shown in fig.4, when the number of servers is small, the mean workload of a server in the first case is higher than that in the second case, as the execution time of the first case is much lower than that of the second case. in the first case, the mean workload of a server keeps on a relatively high level until the number of servers achieves 27. in the second case, the mean workload of a server dose not decrease until the number of servers achieves 125. the reason of these variations is, when the number of servers is not enough to sustain continually arrival reduce tasks, all servers work at full capacity and many tasks are dropped. while the number of servers achieves 256, the mean workload of a server decreases to the same low level in both cases. fig.5 shows the percentage of reduce tasks executed by each server when there are 256 servers in all. we can derive from fig.5 that the difference between the workload of any two servers is less than 1.6×10−3, a vary small value. therefore, the workload can be evenly distributed. performing mapreduce on data centers with hierarchical structures 443 4 8 9 16 2527 64 81 125 256 625 0 0.005 0.01 0.015 0.02 0.025 0.03 0.035 0.04 0.045 number of servers p e rc e n ta g e o f r e d u ce t a sk s execution time: 10 to 100 periods execution time: 100 to 500 periods figure 4: variation of the mean percentage of reduce tasks that per server executes along with the number of servers 0 50 100 150 200 250 2 2.5 3 3.5 4 4.5 5 5.5 6 x 10 −3 identifiers of the servers p e rc e n ta g e o f r e d u ce t a sk s figure 5: percentage of reduce tasks executed by each server when there are 256 servers 6.2 data forwarding performance a data center network mainly consists of servers, switches and links. in a hierarchical data center, servers are not only used for executing map and reduce tasks, but also used for forwarding intermediate data. however, since servers are not network devices, they have lower data forwarding capacity than switches and links. in this work, we assume that switches and links can provide sufficient bandwidth and only consider the servers. based on the former simulation, we perform the simulation for evaluating data forwarding performance, including data forwarding throughput and bandwidth between servers, as follows. we assume that our routing scheme held by each server supports receiving and sending only one data packet, namely a key/value pair, within an extremely short period of time. according to literature [11], in a bcube data center, a server forwards at 750mb/s in all-to-all traffic pattern. mtu is usually set to be 1.5kb for a commodity computer. thus, on this condition, we can easily calculate that the short period is 1.6×10−5 seconds. in such a period, a server with a packet to transmit chooses an adjacent server as the destination server, according to our routing scheme. if this destination server is available for receiving a packet, 444 z. ding, d. guo, x. chen, x. luo the former server will forward that packet successfully. then any other packets to the same destination server should wait to be transmitted until the server is available in another period. we vary the number of servers that simultaneously generate packets to calculate the best possible data forwarding performance. this procedure is recursively performed in our simulation so as to evaluate data forwarding performance on a steady working condition. when the number of servers varies from 4 to 625, we repeat the simulation 30 times for each number of servers to obtain the mean and range of the maximum data forwarding throughput, as shown in fig.6. we find that the maximum data forwarding throughput does not closely track the increase of the total number of servers. the throughput of 27 servers is lower than that of 25 servers, and the throughput of 81 servers is lower than that of 64 servers. the reason for that is, throughput depends not only on the number of servers, but also on the number of hops for data forwarding. if the data are forwarded through more hops, the throughput will be lower. for a hierarchical data center, the maximum number of hops is determined by the number of levels, and more levels bring more hops. in our simulation, the bcube data centers with 27 servers and 81 servers have one level more than the bcube data centers with 25 servers and 64 servers, respectively. given a fixed number of servers in a recursively hierarchical data center, it is crucial to make a tradeoff between the number of levels and the number of servers in the smallest recursive unit to achieve desired performance. as shown in fig.6, when the number of servers is larger than 125, data forwarding throughput increases rapidly. therefore, our approaches can well support throughput-hungry mapreduce applications on hierarchical data centers. 4 8 9 16 2527 64 81 125 256 625 0 10 20 30 40 50 60 number of servers m a xi m u m d a ta f o rw a rd in g t h ro u g h p u t (g b /s ) figure 6: variation of the maximum data forwarding throughput along with the number of servers to evaluate the bandwidth between servers, we first keep the number of servers that simultaneously generate packets in a certain value, which ensures that data forwarding throughput remains at the maximum value. on this condition, we then calculate the mean of the maximum bandwidth that each server can achieve for sending data to another server through maximum number of hops. fig.7 illustrates the result when there are 256 servers. we can derive that most values of the bandwidth are larger than 0.3gb/s and less than 0.57gb/s. this variance, which is less than 0.3gb/s, is acceptable for mapreduce applications. in practice, different servers store different types of data, and internet service providers may store popular data on certain servers to save power [19]. hence some of the map or reduce tasks in a mapreduce procedure process and generate more data than other tasks. overall, the result of our simulation implies that the bandwidth can be evenly distributed according to our approaches. performing mapreduce on data centers with hierarchical structures 445 0 50 100 150 200 250 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 identifiers of servers b a n d w id th ( g b /s ) figure 7: mean of the maximum bandwidth that each server can achieve for sending data to another server through maximum number of hops, when there are 256 servers and data forwarding throughput remains at maximum value 6.3 fault tolerance we evaluate the load balance and data forwarding performance under given failure rates of links and nodes to further validate the fault-tolerant capability of our methodology. for a hierarchical data center, let p1 and p2 denote the expected probabilities that a server or a switch fails when they are executing a mapreduce procedure, respectively. here we omit link failures which can be regarded as adjacent nodes failures. then we can calculate the probability that a server keeps on working according to theorem 1. theorem 1. for a hierarchical data center, let p3 denote the probability that a server keeps on working in a mapreduce procedure. let i and j denote the number of servers and switches directly connecting with this server, respectively. p3 is given by p3 = (1 − p1) × (1 − i∑ i=1 ( ( i i ) × pi1)) × (1 − j∑ j=1 ( ( j j ) × p j2)) (2) proof: the probability that this server does not fail equals 1−p1. the probability that i of the i servers fail is ( i i ) ×pi1. then the probability that these i servers keep on working equals 1− ∑i i=1( ( i i ) ×pi1). similarly, the probability that the j switches keep on working equals 1− ∑j j=1( ( j j ) ×p j2). this server can keep on working only if all the i servers and j switches can keep on working and itself does not fail. therefore, the probability that this server keeps on working equals (1−p1)×(1− ∑i i=1( ( i i ) ×pi1))× (1− ∑j j=1( ( j j ) ×p j2)). theorem 1 is proved. based on theorem 1, we modify the aforementioned simulations to evaluate the fault-tolerant load balance and data forwarding performance. in the corresponding period, a server with a reduce task (or a packet) for transmitting chooses an available adjacent server in working order as the destination server, according to the fault-tolerant routing. if there is a third server having a reduce task (or a packet) to the same destination server in the same period, it has to wait until the destination server is available in anther period. in bcube, each server can only directly connect to switches, so p3 equals (1−p1)× (1− ∑j j=1 ( ( j j ) ×p j2)). we obtain different values of p3 by varying p1 and p2. then we calculate the variation of the mean percentage of reduce tasks that per server executes along with the total number of servers, when 446 z. ding, d. guo, x. chen, x. luo p3 equals 0.98, 0.90 and 0.80, respectively. fig.8 illustrates the results in the aforementioned two cases. in the first case, many tasks are dropped when the number of servers is small, so there is no significant difference among the workload of each server for the three values of p3. in the second case, since the workload of each server keeps very low, the difference is not notable when the number of servers is less than 125. while the number of servers are enough to sustain the continually arrival reduce tasks, our method has good fault tolerance, so the difference is also small in both cases. fig.9 plots the percentage of reduce tasks executed by each server when there are 256 servers and p3=0.90. the mean percentage of reduce tasks that per server executes is only 0.5×10−3 lower than that shown in fig.5. moreover, the variance is less than 1.5×10−3. thus, the workload can be evenly distributed under high failure rates of links and nodes. 4 8 9 16 2527 64 81 125 256 625 0 0.01 0.02 0.03 0.04 0.05 0.06 number of servers p e rc e n ta g e o f r e d u ce t a sk s execution time:10 to 100 periods, p3=0.98 execution time:10 to 100 periods, p3=0.90 execution time:10 to 100 periods, p3=0.80 execution time:100 to 500 periods, p3=0.98 execution time:100 to 500 periods, p3=0.90 execution time:100 to 500 periods, p3=0.80 figure 8: variation of the mean percentage of reduce tasks that per server executes along with the number of servers, when p3=0.98, p3=0.90 and p3=0.80 0 50 100 150 200 250 2 2.5 3 3.5 4 4.5 5 5.5 x 10 −3 identifiers of servers p e rc e n ta g e o f r e d cu e t a sk s figure 9: percentage of reduce tasks executed by each server, when there are 256 servers and p3=0.90 due to the failures of links and nodes, we calculate the maximum data forwarding throughput when p3 equals 0.98, 0.90 and 0.80, respectively. for each value of p3, we repeat the modified simulation 30 times to obtain the mean value. fig.10 illustrates the result. when the number of servers is small, the throughput of the network is low, so there is no obvious difference among the throughput for the three values of p3. when the number of servers is larger than 125, the throughput increases rapidly for all the three values of p3. thus, with enough servers, our method can provide satisfactory data forwarding throughput against failures of links and nodes. considering link and node failures, we performing mapreduce on data centers with hierarchical structures 447 4 8 9 16 2527 64 81 125 256 625 0 5 10 15 20 25 30 35 40 45 50 55 number of servers m a xi m u m d a ta f o rw a rd in g t h ro u g h p u t (g b /s ) p3=0.98 p3=0.90 p3=0.80 figure 10: variation of the maximum data forwarding throughput along with the number of servers, when p3=0.98, p3=0.90 and p3=0.80 recalculate the mean of the maximum bandwidth that each server can achieve for sending data to another server through maximum number of hops. fig.11 illustrates the result when there are 256 servers and p3=0.90. although the probability that a server cannot keep on working is 0.10, which is really high in practice, the bandwidth for every server is only 0.05gb/s lower than that shown in fig.7. hence the bandwidth between servers is abundant against failures of links and nodes. moreover, the range of variance, as shown in fig.11, is less than 0.25gb/s. therefore, the bandwidth between servers can be evenly distributed under high failure rates of links and nodes. 0 50 100 150 200 250 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 identifiers of servers b a n d w id th ( g b /s ) figure 11: mean of the maximum bandwidth that each server can achieve for sending data to another server through maximum number of hops, when there are 256 servers and p3=0.90 7 conclusion several hierarchical structures have been proposed to improve the topological properties of data centers. however, the communication and control mechanisms proposed by google for performing mapreduce on the traditional structure can hardly be operated on these hierarchical structures. this paper presents a methodology for performing mapreduce on data centers with hierarchical structures. comprehensive analysis and simulations show that our methodology can evenly distribute the workload 448 z. ding, d. guo, x. chen, x. luo and well support throughput-hungry mapreduce applications. it is also proved that our methodology is competent for mapreduce even under node failures. the mismatch problem between hierarchical data centers and mapreduce is effectively solved in this paper. acknowledgment we would like to thank the anonymous reviewers for their constructive comments. our work is supported in part by the nsf china under grants no.60903206, no.60972166, no.61170284, no.71031007, no.71071160 and no.71171197, the china postdoctoral science foundation under grant no.201104439, and the preliminary research foundation of national university of defense technology under grant no.jc10-05-01. bibliography [1] m. al-fares, a. loukissas, and a. vahdat. a scalable, commodity data center network architecture. proc. acm sigcomm, pp.63-74, aug. 2008. [2] d. borthakur. the hadoop distributed file system: architecture and design. http://hadoop.apache.org/core/docs/current/hdfsdesign.pdf [3] c. bastoul and p. feautrier. improving data locality by chunking. springer lecture notes in computer science, vol.2622, pp.320-334, 2003. [4] f. chang, j. dean, s. ghemawat, w.c. hsieh, d.a. wallach, m. burrows, t. chandra, a. fikes, and r.e.gruber. bigtable: a distributed storage system for structured data. proc. 7th symposium on operating systems design and implementation (osdi), pp.205-218, nov. 2006. [5] j. cohen. graph twiddling in a mapreduce world. computing in science and engineering, ieee educational activities department, vol.2, no.4, pp.29-41, 2009. [6] j. dean and s. ghemawat. mapreduce: simplified data processing on large clusters. proc. 6th symposium on operating system design and implementation (osdi), pp.137-150, dec. 2004. [7] j. dean, and s. ghemawat. mapreduce: a flexible data processing tool. communications of the acm, vol.53, no.1, pp.72-77, 2010. [8] a. greenberg, j. hamilton, d. a. maltz, and p. patel. the cost of a cloud: research problems in data center networks. acm sigcomm computer communication review, vol.39, no.1, pp.68-73, jan. 2009. [9] c. guo, h. wu, k. tan, l. shi, y. zhang, and s. lu. dcell: a scalable and fault-tolerant network structure for data centers. proc. acm sigcomm, pp.75-86, aug. 2008. [10] a. greenberg, j.r. hamilton, n. jain, s. kandula, c. kim, p. lahiri, d.a. maltz, p. patel, and s. sengupta. vl2: a scalable and flexible data center network. acm sigcomm computer communication review, vol.39, no.4, pp.51-62, aug. 2009. [11] c. guo, g. lu, d. li, h. wu, x. zhang, y. shi, c. tian, y. zhang, and s. lu. bcube: a high performance, server-centric network architecture for modular data centers. proc. acm sigcomm, pp.63-74, aug. 2009. [12] s. ghemawat, h. gobioff, and s.t. leung. the google file system. proc. 19th acm symposium on operating systems principles, pp.29-43, dec. 2003. performing mapreduce on data centers with hierarchical structures 449 [13] m. isard, m. budiu, y. yu, a. birrell, and d. fetterly. dryad: distributed data-parallel programs from sequential building blocks. proc. 2nd acm sigops/eurosys european conference on computer systems, pp.59-72, jun. 2007. [14] w. jun. a methodology for the deployment of consistent hashing proc. 2nd ieee international conference on future networks, jan. 2010. [15] d. li, c. guo, h. wu, k. tan, y. zhang, and s. lu. ficonn: using backup port for server interconnection in data centers. proc. ieee infocom, pp.2276-2285, apr. 2009. [16] j. lin. the curse of zipf and limits to parallelization: a look at the stragglers problem in mapreduce workshop on large-scale distributed systems for information retrieval, jul. 2009. [17] j. pang, p.b. gibbons, m. kaminsky, s. seshan, and h. yu. defragmenting dht-based distributed file systems proc. 27th ieee international conference on distributed computing systems, jun. 2007. [18] t. redkar. introducing cloud services. windows azure platform, apress, pp.1-51, 2009. [19] l. rao, x. liu, l. xie, and w. liu. minimizing electricity cost: optimization of distributed internet data centers in a multi-electricity-market environment proc. ieee infocom, mar. 2010. [20] i. stoica, r. morris, d. karger, m.f. kaashoek, and h. balakrishnan. chord: a scalable peertopeer lookup service for internet applications proc. acm sigcomm, pp.1-12, aug. 2001. [21] d. talia and p. trunfio. enabling dynamic querying over distributed hash tables. elsevier journal of parallel and distributed computing, vol.70, no.12, pp.1254-1265, 2010. [22] g. urdaneta, g. pierre and m.v. steen. a survey of dht security techniques. journal of acm computing surveys, vol.43, no.2, pp.1-49, 2011. [23] x. wang and d. loguinov. load-balancing performance of consistent hashing: asymptotic analysis of random node join ieee/acm transactions on networking, vol.15, no.4, pp.892-905, 2007. [24] http://hadoop.apache.org. international journal of computers communications & control issn 1841-9836, 11(6):845-859, december 2016. stochastic stability analysis of power control in wireless networks via a norm-inequality-based approach r. qian, y. qi rongrong qian, yuan qi* beijing univ. of posts and telecommunications (bupt) beijing, 100876, china rongrongqian@bupt.edu.cn *corresponding author: qiyuan@bupt.edu.cn abstract: owing to the requirements from realistic wireless networks, the stochastic stability analysis for discrete-time power control, which concerns the randomness brought by the fading channels and noise of wireless systems, is of practical significance. by developing a norm-inequality-based framework of analyzing the stochastic stability of linear systems with random parameters, we show that a typical powercontrol law with linear system model is stable in the sense of the pth-moment stability. several conditions of achieving the pth-moment stability for the considered power-control law are obtained, which can easily applied to realistic wireless networks. besides, within this study, the stability analysis of power control for the first time takes into account the effect of multiple-access methods. keywords: wireless networks, power control, stochastic stability. 1 introduction there has been a great deal of research over the past several decades on the power control of wireless networks. these studies span multiple disciplines and include information theory [1,8], communication [2, 6, 7, 9, 13], and control theory [3–5, 21]. it has already been recognized that the stability analysis of power-control laws can efficiently investigate the intrinsic properties of various power-control algorithms [3–5,21]. the power-control laws with two-sided scalable iterative (interference) functions are convergent under the condition that an equilibrium exists [1], and the laws with contractive interference functions guarantee the existence and uniqueness of equilibriums along with linear convergence of iterates [2]. the global asymptotic stability of power control laws involving two-sided scalable interference functions and the exponentially scalability of laws with contractive interference functions are seen even under bounded time-varying delays [3]. a general class of power-control laws whose interference functions are monotonic and scalable are considered in [4, 5]. by employing appropriately constructed lyapunov functions, [5] shows that any bounded power distribution obtained from these laws is uniformly asymptotically stable. further, in [5] lyapunov-razumikhin functions are used to show that, even when the system incorporates time-varying delays, any solution along which the generalized system nonlinearity is bounded must also be uniformly asymptotically stable. in both of above cases the stability is shown to be global. most of current wireless networks are digital communication systems, in which the link gains are random (stochastic) variables fluctuating as wireless channels of the underlying networks are experiencing fading at all time and the noises are also random variables. it is therefore important for the power-control laws to be designed and verified when considering the impact of randomness in discrete-time wireless systems with fading channels and random noise. stochastic power-control algorithm that uses noisy interference estimates (observations) is first studied in [6]. with conventional matched filter receivers, the stochastic power control is shown by [6] to converge to the optimal power vector in the mean square error sense. these copyright © 2006-2016 by ccc publications 846 r. qian, y. qi results are later extended to the cases when a linear receiver or a decision feedback receiver is used [7, 22]. in [9], a stochastic-approximation based power-control algorithm is proposed to handle both measurement errors and randomness in the channel gain matrix, which is proved to converge to the optimal solution in the mean-squared sense. treading the elegant footsteps of recent works [3][5], one can gain a deep insight into the stability theory of typical power-control laws in wireless networks. rather than being concerned with the stability analysis of power control, the studies [6][9] focus on developing extra techniques of reducing the impact of randomness encountered by power control in wireless networks, where these techniques includes matched filter receiver, decision feedback receiver, and stochastic approximation. to the best of our knowledge, it still lacks of systemic study on stability analysis of typical power-control laws taking into account the randomness existed in practical wireless networks while the power control does not use extra randomness reducing techniques proposed by [6][9]. we shall emphasize here that, this kind of study is indispensable, because, on the one hand, it could attentively reveal the inherent attributes of the typical power-control laws when the randomness exists (but no randomness reducing technique is involved) so as to make the stability theory of typical power-control laws complete, on the other hand, extra technique of reducing randomness may not necessarily be available in practical systems due to the objective factors such as realtime processing demands so that engineers have to be aware of the stability of power control in randomness environments without any randomness reducing technique. hence, the aim of our work is to perform this study; specifically, we will develop a framework of stochastic stability analysis for discrete-time power control, which takes the randomness brought by the fading channels and noise of wireless systems into consideration. our main works are: (i) developing a norm-inequality-based framework of analyzing the stochastic stability (to be specific, the pth-moment stability) for linear systems with random parameters so as to investigate the stochastic stability of the power control in consideration of the randomness caused by the fading channels and noise; (ii) clarifying the conditions of achieving the stochastic stability for the considered linear systems and power-control law; and (iii) investigating the effect of multiple-access methods to stochastic stability of power control. 2 notation and preliminaries 2.1 notation throughout, the interval [0 + ∞) is denoted by r+, and the set of positive integers by z+. the non-negative orthant of the n-dimensional real space is represented by rn+ . the vectors are written in bold lower case letters and matrices in bold capital letters, e.g., a and a. the ith component of a vector a is denoted by ai, and the ijth entry of a matrix a is denoted by aij such that aij = [a]ij and a = [aij]. the notation a ≥ 0 means that all of the components of a are greater than or equal to zero. the inequality a ≥ b implies that aij ≥ bij for all components ij. we let (·)t denote the transpose of a vector or a matrix. if a is a vector with components a1,a2, · · · ,an, then its p-norm is defined by ‖a‖p = (∑n i=1 |ai|p )1/p and its euclidean norm by ‖a‖2 = (∑n i=1 |ai|2 )1/2 that is actually p-norm with p = 2. for a square matrix a, the induced norm corresponding to the p-norm of vectors is defined as ‖a‖p = max ‖x‖p 6=0 ‖ax‖p ‖x‖p = max ‖x‖p=1 ‖ax‖p ‖x‖p , (1) where ‖a‖1 = maxj ∑n i=1 |aij| is also known as the maximum column sum matrix norm, and ‖a‖∞ = maxi ∑n j=1 |aij| is the maximum row sum matrix norm. stochastic stability analysis of power control in wireless networks via a norm-inequality-based approach 847 a probability space is a triple (ω,f,p) where ω is a set of "outcomes", f is a set of "events", and p : f → [0 1] is a function that assigns probabilities to events. if x is a random variable on (ω,f,p) then we define the expected value operator to be ex = ∫ xdp . if ex2 < +∞ then the variance of x is defined to be var(x) = e(x−ex)2. we let {x[k],k ∈ z+} denote a stochastic process with random values in a set of vectors, and {x[k],k ∈ z+} denote a stochastic process with random values in a set of matrices, by writing x[k] and x[k] in italic and bold letters. if x is a n ×1 random vector then we define its expected value as x̄ = ex , [ex1 ex2 · · ·exn ]t . analogously, for a n ×n random matrix x, we define its expected value as x̄ = ex , [exij] and its variance as var(x) , [var(xij)]. let lp(ω,f,p) be the set of measurable function f on ω such that ∫ ω |f|pdµ < +∞, we introduce an operator elp as elpf , (e|f|p)1/p . (2) from [15, 2.2.5 example], one shall find that lp(ω,f,p) is a linear space and elp is a semi-norm. 2.2 preliminaries in this part, we collect basic properties and definitions of matrix theory, algebra theory, probability theory, and stochastic stability theory, which will be used in the following analysis. for more details, see, e.g., [10,11,23–25]. basic norm inequalities [23,25]: the p-norm of vectors and the corresponding induced norm of square matrices are nonnegative numbers have the properties that 1. ‖x + y‖p ≤‖x‖p + ‖y‖p and ‖a + b‖p ≤‖a‖p + ‖b‖p; 2. ‖ax‖p ≤‖a‖p‖x‖p, which is derived from the definition of ‖a‖p; 3. ‖ab‖p ≤‖a‖p‖b‖p, since ‖abx‖p ≤‖a‖p‖bx‖p ≤‖a‖p‖b‖p‖x‖p and max ‖x‖p=1 ‖abx‖p = ‖ab‖p. upper bound of induced matrix norm [17]: for any n × n matrix a, the induced norm ‖a‖p has no explicit representation unless p = 1, 2 or ∞. however, one can have the below inequalities [17, (1.8),(1.11)] ‖a‖p ≤ n1−1/p‖a‖1, (3) and ‖a‖p ≤‖a‖1/p1 ‖a‖ 1−1/p ∞ , (4) provide two closed-form upper bounds of ‖a‖p with p other than 1, 2 and ∞. cauchy-schwarz-buniakowsky inequality involving real numbers [25]: let a1,a2, · · · ,an and b1,b2, · · · ,bn be any two arbitrary sets of real numbers, then( n∑ i=1 aibi )2 ≤ ( n∑ i=1 a2i )( n∑ i=1 b2i ) . (5) this inequality can be expressed in the vector form as at b ≤‖a‖2‖b‖2, where a = [a1 a2 · · · an ]t and b = [b1 b2 · · · bn ]t . 848 r. qian, y. qi definition 1 [24]: let (s,s) be a measurable space. a stochastic process {φ[k],k ∈ z+} taking values in s is said to be a markov chain with respect to a filtration fk, if φ[k] ∈fk and for all b ∈ s, p(φ[k + 1] ∈ b|fk) = p(φ[k + 1] ∈ b|φ[k]). in words, given the present, the rest of the past is irrelevant for predicting the value of φ[k + 1]. stability properties of stochastic systems need to be established in the context of stochastic stability, in which a variety of inter-related definitions exist [10, 11]. this study concerns the so-called pth-moment stability which is borrowed from [11] with trivial differences and defined as follows. definition 2 [11]: the pth-moment stability can be stated as, for each initial distribution, there exists limk→+∞e (‖x[k]‖ pp ) < +∞, where p ∈ z+, and it shall hold that (e (‖x[k]‖ pp ))1/p = elp‖x[k]‖p. this study will seek for an analytical framework of deriving the upper bound of e (‖x[k]‖ pp ) so as to prove the pth-moment stability for linear systems with random parameters. 3 system model and problem statement 3.1 system model of power control we consider a wireless network with n wireless nodes, which employs a discrete-time power control algorithm given by x[k + 1] = i(x[k]), where x[k] = [x1[k] x2[k] · · · xn [k]]t and xj[k] ∈ r+ is the transmitted power of node j at the kth iteration, i(x) = [i1(x) i2(x) · · · in (x)]t and ij : rn+ → r+ is the interference function modeling the interference together with noise measured at the receivers for node j that mainly comes from other nodes and local noise source. denote the link gain between the transmitter of node j and the receiver for node i by gij. to perform the study in a systematic fashion, we proceed from a simple but considerably typical law of power control that has a linear system model, helping us avoid any entanglement due to nonlinear effects. this power-control law is given by x[k + 1] = i(x[k]), (6) where i(x[k]) = d[k] (c[k]x[k] + n[k]) = d[k]c[k]x[k] + d[k]n[k], d[k] is a n ×n diagonal matrix whose diagonal elements are { γ1[k] g11[k] , γ2[k] g22[k] , · · · , γn [k] gnn [k] } , in which γj[k] is the target signal-to-interference-and-noise ratio (sinr) of node j at the kth iteration, and gij[k] is the link gain gij at the kth iteration. in this study, we set γj[k] = γj where γj is fixed target sinr value for node j. c[k] = [cij[k]] is a n × n matrix whose entries are either zero or positive depending on whether the entry is diagonal or off-diagonal, i.e., c[k] =   0 g12[k] · · · g1n [k] g21[k] 0 · · · g2n [k] ... ... ... ... gn1[k] gn2[k] · · · 0   . (7) n[k] = [n1[k] n1[k] · · · nn [k]]t denotes the vector of noise power at the receivers for all n nodes. here, note that x[k],n[k] ≥ 0 and d[k],c[k] ≥ 0 because the powers and link gains are all positive values. the model (6) is thought to be typical because it covers the well-known foschini-miljanic algorithm [12] and can be extended (in future) to describe the power-control algorithms of opportunistic communications e.g., the utility-based power control (ubpc) algorithm [13]. stochastic stability analysis of power control in wireless networks via a norm-inequality-based approach 849 in wireless channels, fading is deviation of the attenuation affecting a signal over certain propagation media. the fading can vary with time or geographical position, and is often modeled as a stochastic process. if let g[k] = [gij[k]], now one should bear in mind that, {g[k],k ∈ z+} and {n[k],k ∈ z+} are two stochastic processes when the fading channels and random noise appear in the wireless networks. as a consequences, {d[k],k ∈ z+} and {c[k],k ∈ z+} will also be stochastic processes. to define the randomness behavior of the wireless networks with fading channels and noises, the below assumptions are always employed. assumption 1 (additive white gaussian noise). the noises existed in the wireless networks are i.i.d. additive white gaussian noises with average power δ2n > 0 such that the noise power vector n[k] satisfies e(‖n[k]‖ 11 ) = nδ2n. the noises are independent with the link gains. in words, if nq[k2] has distribution µnq,k2 and gij[k1] has µ g i,j,k1 , then (nq[k2],gij[k1]) has distribution µnq,k2 ×µ g i,j,k1 [24], for 1 ≤ i,j,q ≤ n and k1,k2 ∈ z+. assumption 2 (temporal independency of link gains). the link gains at different iterations are independent. in essence this implies, if gij[k1] has distribution µgi,j,k1 and gpq[k2] has µ g p,q,k2 , then (gij[k1],gpq[k2]) has distribution µgi,j,k1×µ g p,q,k2 [24], for 1 ≤ i,j,p,q ≤ n whenever k1 6= k2. assumption 3 (stationarity). the distributions of gij[k] and nq[k] are unrelated to k, i.e., whatever k is, gij[k] and nq[k] has distribution µgi,j and µ n q , respectively, for 1 ≤ i,j,q ≤ n. this assumption stipulates that {g[k],k ∈ z+}, {d[k],k ∈ z+}, {c[k],k ∈ z+}, and {n[k],k ∈ z+} are stationary stochastic processes which do not change their statistical properties with k. assumption 4 (deployment of multiple access methods). in wireless networks, multiple access methods can suppress the leakage of signal power from one node to the receivers for other nodes under certain level such that gij[k] ≤ βijgjj[k] with constant values βij � 1 for any i 6= j. moreover, gjj[k], 1 ≤ j ≤ n are n stationary and i.i.d. random variables with e(gjj[k]) = µg, e(1/gjj[k]) = µ1/g, e(g2jj[k]) = µg2, and e(1/g 2 jj[k]) = µ1/g2. this assumption holds in case of the power control of wireless networks with multiple access methods, which implies c[k] ≤   0 β12g22[k] · · · β1ngnn [k] β21g11[k] 0 · · · β2ngnn [k] ... ... ... ... βn1g11[k] βn2g22[k] · · · 0   . in summary, the power-control system (6) has a linear system model with random parameters. this study will analyze the pth-moment stability for such a system. 3.2 problem statement in what follows, we begin by considering the linear systems with random parameters as x[k + 1] = a[k]x[k] + b[k]n[k], (8) where {a[k],k ∈ z+} and {b[k],k ∈ z+} be two stationary stochastic processes. {n[k],k ∈ z+} is a stationary stochastic process of additive white gaussian noise, and independent with {a[k],k ∈ z+} and {b[k],k ∈ z+}. clearly, the model of system (8) is a generalization of the model (6), which is not same but closely related to the models appeared in several existed works [10] [14] [16]. main problem of this work. how can we estimate whether the linear system (8) is pth-moment stable or not? 850 r. qian, y. qi theorem. the stochastic process {x[k],k ∈ z+} that corresponds to the state vector x[k] in (8), is a markov chain. proof the conclusion directly follows from the definition of markov chain. 2 the result above is a trivial but fundamental understanding of the system (8). we then turn to taking a closer look at this system. to state the further results, we need to rewrite (8) as x[k + 1] = ( k∏ u=1 a[u] ) x[1] ︸︷︷︸ non-noise term + k∑ i=1 ( k∏ u=i+1 a[u] ) b[i]n[i] ︸︷︷︸ noise term . (9) if the noise term does not exist in (9), the furstenberg-kesten theorem [14] and the analytical framework developed by feng et al. [10] would be available to study the stochastic stability properties for the associated system. however now, due to the existence of the noise term, we have to seek for a new proper framework to analyze such a system. in [16], koning provided an analytical framework which is usable to investigate (8), but such a framework can only reflect the firstand second-order statistics of x[k]. in this work, we will develop a norm-inequalitybased framework that is capable of analyzing higher-order as well as the firstand second-order statistics of x[k]. 4 results in this section, the pth-moment stability of the linear system (8) is analyzed through a norminequality-based approach, and then the analysis is applied to the power-control system (6). to attain the main results of this study, we need to derive and use several lemmas; however, we prefer not to introduce them in the main text but, rather, in appendix, so as not to interrupt the presentation. for details, please refer to lemmas a.1 to a.5. 4.1 a norm-inequality-based approach of the pth-moment stability analysis we are now ready to perform the pth-moment stability analysis of the linear system (8) via a norm-inequality-based approach. theorem 1. a sufficient condition for the first-moment stability of the system (8) is limk→+∞ ā k = 0n×n, where 0n×n is the n ×n zero matrix. proof since x[k + 1] ≥ 0, e ( ‖x[k + 1])‖ 11 ) = ‖e(x[k + 1])‖ 11 = ‖x̄[k + 1]‖ 11 = ∥∥∥ākx̄[1] + ∑ki=1 āk−i b̄n̄∥∥∥ 1 1 . if the matrix ā has the property that limk→+∞ āk = 0n×n, i−ā will be nonsingular and its inverse can be expressed by [23, corollary 5.6.16]: (i − ā)−1 = ∑+∞ k=0 ā k, and then we shall have limk→+∞e ( ‖x[k + 1])‖ 11 ) = ∥∥(i − ā)−1b̄n̄∥∥ 1 1 < +∞ as long as limk→+∞ āk = 0n×n holds. 2 the model (8) has an alternative formulation as x[k + 1] = sx[k] + k∑ i=1 skn[i], (10) where, for notational simplicity, sx[k] = (∏k u=1 a[u] ) x[1], skn[i] = (∏k u=i+1 a[u] ) b[i]n[i]. the forthcoming analysis will involve applying the operator elp to p-norm of some random vector x, i.e., substituting f = ‖x‖p into elpf, or to induced matrix norm of some random stochastic stability analysis of power control in wireless networks via a norm-inequality-based approach 851 matrix x, i.e., substituting f = ‖x‖p into elpf. there is one important issue herein that must be mentioned: remark 2: computing the induced matrix norm is a nonlinear optimization problem1, and the induced norm ‖x‖p has no explicit representation unless p = 1, 2 or ∞. if p 6= 1, 2, or ∞, the integral ∫ ω ‖x‖ pp dµ < +∞ might not exist, in which case we can not take elp to ‖x‖p. therefore, in this study when it is needed to apply the operator elp to ‖x‖p, we will seek for an integrable upper-bound ϕ(x) of ‖x‖p and use elpϕ(x) for theoretical analysis. theorem 2. assume that ‖x‖p has an upper-bound ϕ(x) > 0, i.e., ‖x‖p ≤ ϕ(x), where∫ ω |ϕ(x)|pdµ is integrable. a sufficient condition for the pth-moment stability of the system (8) is elpϕ(a) < 1, elpϕ(b) < +∞, and elp‖n‖p < +∞.2 proof by (10), we get elp ‖x[k + 1])‖p = elp ∥∥∥∥∥sx[k] + k∑ i=1 skn[i] ∥∥∥∥∥ p ≤ elp ‖sx[k]‖p + k∑ i=1 elp ∥∥∥skn[i]∥∥∥ p ≤ k∏ u=1 elpϕ(a[u])elp‖x[1]‖p + k∑ i=1 k∏ u=i+1 elpϕ(a[u])elpϕ(b[i])elp‖n[i]‖p, where the first inequality follows from lemma a.4 and the second one is from lemma a.5. under the assumption of stationarity, elpϕ(a[k]), elpϕ(b[k]), and elp‖n[k]‖p shall not change with k such that we can drop k for notational simplicity. thus, elp ‖x[k + 1])‖p ≤ [elpϕ(a)] k elp‖x[1]‖p + k∑ i=1 [elpϕ(a)] k−i elpϕ(b)elp‖n‖p = [elpϕ(a)] k elp‖x[1]‖p + 1 − [elpϕ(a)]k 1 −elpϕ(a) elpϕ(b)elp‖n‖p. if elpϕ(a) < 1, elp ‖b‖p < +∞, and elp‖n‖p < +∞, then lim k→+∞ e ( ‖x[k + 1]‖ pp ) ≤ ( elpϕ(b)elp‖n‖p 1 −elpϕ(a) )p < +∞. therefore, theorem 2 is justified. 2 theorem 2 holds for all p ∈ z+, thus it can reveal any pth-order statistics of x[k]. it’s a progress made by the proposed analytical framework of this study, compared to the framework developed by [16] that can only investigate the firstand second-order statistics of x[k]. since the derivation methods of theorems 1 and 2 are not exactly same, the sufficient condition of the first-moment stability obtained by theorem 1 is not necessarily identical as that by theorem 2 with p = 1. theorem 3. if elpϕ(a) < 1, elp ‖b‖p < +∞, and elp‖n‖p < +∞, there exists α < +∞, such that limk→+∞e ( ‖x[k])‖ pp ) = α. proof let us begin by assuming that k > j > j. we get elp ‖x[j + 1])‖p ≥ elp ∥∥∥∑ji=1 sjn[i]∥∥∥ p , 1an already known approach is to make use of the algorithm of estimating the induced matrix norm as well as the matlab routines provided by higham [17]. 2note that a, b, and n are short for a[k], b[k], and n[k] by dropping the index k, since the statistics of a[k], b[k], and n[k] are irrelevant with k under the stationarity assumption. 852 r. qian, y. qi because sx[j] and s j n[i] (i,j ∈ z+, 1 ≤ i ≤ j) are all positive vectors. by (10), we can have elp ‖x[k + 1])‖p ≤ elp ‖sx[k]‖p + elp ∥∥∥∥∥∥ k∑ i=k−j+1 skn[i] ∥∥∥∥∥∥ p + elp ∥∥∥∥∥ k−j∑ i=1 skn[i] ∥∥∥∥∥ p . (11) due to the stationarity property, it holds true that elp ∥∥∥∑ji=1 sjn[i]∥∥∥ p = elp ∥∥∥∑ki=k−j+1 skn[i]∥∥∥ p . then, with elpϕ(a) < 1, elp ‖b‖p < +∞, and elp‖n‖p < +∞, it implies that elp ‖x[k + 1])‖p −elp ‖x[j + 1])‖p ≤ elp ‖sx[k]‖p + elp ∥∥∥∥∥ k−j∑ i=1 skn[i] ∥∥∥∥∥ p ≤ [elpϕ(a)]k elp‖x[1]‖p + [elpϕ(a)] j 1 −elpϕ(a) elpϕ(b)elp‖n‖p. (12) now we can conclude that, ∀ε > 0, ∃j > 0, such that elp ‖x[k + 1])‖p−elp ‖x[j + 1])‖p < ε as long as k,j ≥ j. it states that elp ‖x[j + 1])‖p has a limit value as k → +∞, and thus finishes the proof. 2 theorem 4. for the system (8), if let ϕ(a) = n1−1/p‖a‖1 or ϕ(a) = ‖a‖ 1/p 1 ‖a‖ 1−1/p ∞ , where a is short for a[k], ∫ ω |ϕ(a)|pdµ would exist which means elpϕ(a) exists. proof both ‖a‖1 and ‖a‖∞ are continues measurable functions. then ∫ ω np−1 ‖a‖ p1 dµ and ∫ ω ‖a‖1 ‖a‖ p−1 ∞ dµ exist. this leads to the results of theorem 4. 2 remark 3: theorem 4 yields two sufficient conditions for the pth-moment stability of the system (8), i.e., elp(n1−1/p‖a‖1) < 1 and elp(‖a‖ 1/p 1 ‖a‖ 1−1/p ∞ ) < 1. although there might exist certain conservation, these two conditions are convenient for practical operations, because both ‖a‖1 and ‖a‖∞ have the explicit representations. remark 4: taniguchi [18] provided stochastic stability theorems of the nonlinear difference equations through using norm inequalities; however, the theorems obtained in [18] can not assist us to achieve the results with practical significance for the system (8). while by employing the norm-inequality-based framework, this study dedicates to derive the results for the system (8). one could also find the moment stability studies attract many interests recently, e.g., the pthmoment exponential ultimate boundedness is investigated for impulsive stochastic differential systems [19], and the pth-moment asymptotic stability is analyzed for stochastic delayed hybrid systems with levy noise [20]. 4.2 the pth-moment stability of power control going back to the power-control system (6), we can obtain many useful results without too much efforts based on the previous analysis. remark 5: by letting a[k] = d[k]c[k] and b[k] = d[k], one can directly apply theorems 1 to 4 to the power-control system (6). one important novelty of this study is not only to assess the stability of power-control system (6) but also to acquire more knowledge of relations between the stochastic stability and power control together with other wireless communication technologies. we will show that the proposed norm-inequality-based approach allows us to recognize the effect of multiple-access methods to the pth-moment stability of power control. the sufficient conditions for the pth-moment stability given by theorems 1, 2, 3, and 4 are only related to a[k] (= d[k]c[k]), while d[k]c[k] is partly determined by the target sinrs stochastic stability analysis of power control in wireless networks via a norm-inequality-based approach 853 and link gains according to (6)-(7). this fact inspires us to investigate the pth-moment stability by thinking over the power control together with the effect of multiple access technique. consider the power-control system (6) with a multiple access method, under assumption 4, we have an upper bound of a[k] as a[k] = d[k]c[k] ≤   0 γ1β12 g22[k] g11[k] · · · γ1β1n gnn [k]g11[k] γ2β21 g11[k] g22[k] 0 · · · γ2β2n gnn [k]g22[k] ... ... ... ... γnβn1 g11[k] gnn [k] γnβn2 g22[k] gnn [k] · · · 0   . in the reminder of this section, the index k will be dropped from a[k], d[k], c[k], and gii[k] such that a, d, c, and gii are used. from above, we get the following results. theorem 5. if the power-control system (6) employs a multiple access method so that assumptions 1 to 4 are satisfied, it will hold that ā = ea ≤ µg ·µ1/g · θγ̄,β, where θγ,β =   0 γ1β12 · · · γ1β1n γ2β21 0 · · · γ2β2n ... ... ... ... γnβn1 γnβn2 · · · 0   . then if the values of γi (1 ≤ i ≤ n) and βij (1 ≤ i,j ≤ n) are properly chosen such that min  maxi n∑ j 6=i γiβij, max j n∑ i 6=j γiβij   < 1µg ·µ1/g , (13) the system will be first-moment stable. proof the upper bound of ā can be obtained by taking expectation to the upper bound of a given above. let ρ(·) denote the spectral radius. using [23, theorem 8.1.22] to show that max  mini n∑ j=1 āij, min j n∑ i=1 āij   ≤ ρ(ā) ≤ min  maxi n∑ j=1 āij, max j n∑ i=1 āij   . i.e., the smallest row sum of a nonnegative matrix is a lower bound on its spectral radius, and the largest row sum is an upper bound. then, by applying āij ≤ µg ·µ1/g ·γiβij, we have ρ ( ā ) ≤ µg ·µ1/g · min  maxi n∑ j 6=i γiβij, max j n∑ i 6=j γiβij   . (14) combing this result with [23, theorem 5.6.12] which says limk→+∞ āk = 0 if and only if ρ(ā) < 1, we see that letting the right-hand side of (14) be less than 1 is a sufficient condition for limk→+∞ āk = 0, which therefore makes the power-control system to be first-moment stable (see theorem 1 for reference). 2 854 r. qian, y. qi (a) 5 10 15 20 25 30 35 40 10 −2 10 0 10 2 10 4 10 6 10 8 10 10 10 12 k e (| |x [k ]| | 1 1 γ=0.1, ρ=0.3 γ=0.2, ρ=0.6 γ=0.3, ρ=0.9 γ=0.4, ρ=1.2 γ=0.5, ρ=1.5 γ=0.6, ρ=1.8 ρ<1 ρ>1 (b) 5 10 15 20 25 30 35 40 10 0 10 5 10 10 10 15 10 20 10 25 k e (| |x [k ]| | 2 2 ) γ=0.1, e l 2 ||a|| 2 =0.28 γ=0.2, e l 2 ||a|| 2 =0.56 γ=0.3, e l 2 ||a|| 2 =0.84 γ=0.4, e l 2 ||a|| 2 =1.12 γ=0.5, e l 2 ||a|| 2 =1.40 γ=0.6, e l 2 ||a|| 2 =1.68 e l 2 ||a|| 2 <1 e l 2 ||a|| 2 >1 figure 1: (a): e(‖x[k]‖ 11 ) versus k, where ρ is short for ρ ( ā ) . the curves with ρ < 1 tend to finite values, while those with ρ > 1 increase ceaselessly; (b): e(‖x[k]‖ 22 ) versus k. the curves with el2 ‖a‖2 < 1 tend to finite values, while those with el2 ‖a‖2 > 1 are progressively growing theorem 6. consider the power-control system (6) with a multiple access method such that assumptions 1 to 4 are satisfied, if the values of γi (1 ≤ i ≤ n) and βij (1 ≤ i,j ≤ n) are properly chosen such that n∑ j=1 n∑ i 6=j γ2j β 2 ij < 1 µg2 ·µ1/g2 , (15) the system will be second-moment stable. proof since ‖a‖2 = √ tr(at a) [23], where tr(·) is the trace operation, by setting ϕ(a) =√ µ1/g2 · tr ( θtγ,βθγ,β ) , it follows that el2ϕ(a) = µg2 · µ1/g2 · tr ( θtγ,βθγ,β ) = µg2 · µ1/g2 ·(∑n j=1 ∑n i 6=j γ 2 j β 2 ij ) . observe that el2ϕ(a) < 1 as long as (15) holds. then, recalling theorem 2 completes the proof. 2 remark 6: the cauchy-schwarz inequality [24] leads to 1 µg·µ1/g < 1 and 1 µ g2 ·µ 1/g2 < 1. so we can have more conservative but simpler conditions than (13) and (15) to achieve the firstand second-moment stability, respectively, which are min { maxi ∑n j 6=i γiβij, maxj ∑n i 6=j γiβij } < 1, and ∑n j=1 ∑n i 6=j γ 2 j β 2 ij < 1. furthermore, let us extend theorems 5 and 6 to a generalized case, i.e., the pth-moment stability with any p ∈ z+. theorem 7. suppose that the power-control system (6) employs a multiple access method so that assumptions 1 to 4 are established, the system will be pth-moment stable if elpϕ(a) < 1 with ϕ(a) = n1−1/p ( maxj ∑n i 6=j γiβij gjj gii ) , or ϕ(a) =  max j n∑ i 6=j γiβij gjj gii  1/p  max i n∑ j 6=i γiβij gjj gii  1−1/p . proof through replacing ‖a‖1 and ‖a‖∞ in theorem 4 with the maximum column sum and maximum row sum of a[k], respectively, theorem 7 can be validated. 2 the importance of theorems 5, 6, 7, and remark 6 lies in that they can guide system designers to assess and select suitable target sinr schemes and multiple access methods for wireless-network systems and also to pick out the proper system parameters for them, from the perspective of power-control stability. stochastic stability analysis of power control in wireless networks via a norm-inequality-based approach 855 table 1: numerical values of examples 2 and 3. γ 73.7 86.0 98.3 110.6 min { maxi ∑n j 6=i γβij, maxj ∑n i 6=j γβij } 0.90 1.05 1.20 1.35 ρ ( ā ) 0.66 0.77 0.87 1.01∑n j=1 ∑n i 6=j γ 2β2ij 0.85 1.16 1.51 1.91 el2 ‖a‖2 0.53 0.72 0.94 1.19(∫ ω ‖a‖ 55 dµ ) 1/5 2.05 2.40 2.70 3.13 el5 ( ‖a‖ 1/51 ‖a‖ 4/5 ∞ ) 2.26 2.64 2.98 3.45 5 numerical examples example 1: we consider the power-control system (8) in main text with i.i.d. rayleigh fading link gains (that is, all gij are rayleigh distributed with unit variance) and fixed target sinrs γ1[k] = γ2[k] = · · · = γn [k] = γ. there are four nodes in the network. we let n[k] be the power vector of gaussian noise with unit variance, and initially set x[1] = [1 0 0 0]t . in case of γ = 0.1, 0.2, · · · , 0.6, figs. 1(a) and 1(b) illustrate how e(‖x[k]‖ 11 ) and e(‖x[k]‖ 22 ) grow with k, where ρ ( ā ) and el2 ‖a‖2 are estimated during the simulations. fig. 1(a) shows that the curves of e(‖x[k]‖ 11 ) with γ = 0.3, 0.6, 0.9 tend to finite values (in other words, the system is first-moment stable), while others increase ceaselessly. this result is in accordance with theorem 1 because limk→+∞ āk = 0 if and only if ρ(ā) < 1 [23, theorem 5.6.12]. from fig. 1(b), it is observed that the curves of e(‖x[k]‖ 22 ) with γ = 0.28, 0.56, 0.84 tend to finite values (or, equivalently, the system is second-moment stable), while others are progressively growing. this observation is a consequence of theorems 2, 3. example 2: we consider the power-control system (8) in main text with multiple access method and fixed target sinrs γ1[k] = γ2[k] = · · · = γn [k] = γ. all gjj = 1 + g′j where g′j is the rayleigh distributed with unit variance. let gij[k] = βijgjj[k] without loss of generality. it can be obtained that µg · µ1/g = 1.20 and µg2 · µ1/g2 = 2.03. there are four nodes in the network which employs a multiple access method such that   0 β12 β13 β14 β21 0 β23 β24 β31 β32 0 β34 β41 β42 β43 0   =   0 1 100 1 200 1 300 1 400 0 1 500 1 600 1 700 1 800 0 1 900 1 1000 1 1100 1 1200 0   . for above, min  maxi n∑ j 6=i βij, max j n∑ i 6=j βij   = 1.22 × 10−2, n∑ j=1 n∑ i 6=j β2ij = 1.565 × 10−4. (16) again, let n[k] be the power vector of gaussian noise with unit variance, and initially set x[1] = [1 0 0 0]t . in case of γ = 73.7, 86.0, 98.3, 110.6, we build table 1 by recalling two previous equalities, i.e., (16), and performing simulations. from table 1, it is clear that min { maxi ∑n j 6=i γβij, maxj ∑n i 6=j γβij } is less than ρ ( ā ) , and ∑n j=1 ∑n i 6=j γ 2β2ij is less than el2 ‖a‖2, as we could expect from the relations between theorems 1, 2, 5, and 6. the numerical results of figs. 2(a) and 2(b) are in agreement with theorems 1, 2, 5, and 6. 856 r. qian, y. qi (a) 0 10 20 30 40 50 60 70 10 2 10 3 10 4 k e (| |x [k ]| | 1 1 γ=73.7 γ=86.0 γ=98.3 γ=110.6 (b) 0 10 20 30 40 50 60 70 10 3 10 4 10 5 10 6 10 7 10 8 k e (| |x [k ]| | 2 2 γ=73.7 γ=86.0 γ=98.3 γ=110.6 figure 2: (a): e(‖x[k]‖ 11 ) versus k; (b): e(‖x[k]‖ 2 2 ) versus k. (a) 0 5 10 15 20 25 30 35 40 0 2 4 6 8 10 12 p ( ∫ ω ||a|| p p dµ)1/p e l p (n1−1/p ||a|| 1 ) e l p (||a|| 1 1/p ||a|| ∞ 1−1/p) (b) 0 5 10 15 20 25 30 35 40 10 10 10 15 10 20 10 25 k e (| |x [k ]| | 5 5 γ=73.7 γ=86.0 γ=98.3 γ=110.6 figure 3: (a): comparison among (∫ ω ‖a‖ pp dµ )1/p , elp(n1−1/p‖a‖1), and elp ( ‖a‖ 1/p1 ‖a‖ 1−1/p ∞ ) for different p. note that, the matlab routines provided by higham [17] which can directly estimate ‖a‖p is used for computing (∫ ω ‖a‖ pp dµ )1/p ; (b): e(‖x[k]‖ 55 ) versus k example 3: the same system model and parameters as example 2 are used. in fig. 3(a), we have compared (∫ ω ‖a‖ pp dµ )1/p with its two upper bounds as given by remark 3, i.e., elp(n1−1/p‖a‖1) and elp ( ‖a‖ 1/p1 ‖a‖ 1−1/p ∞ ) , which can also be refereed to theorem 4. it is seen that, elp ( ‖a‖ 1/p1 ‖a‖ 1−1/p ∞ ) stays quite close to (∫ ω ‖a‖ pp dµ )1/p , however, there is an evident gap between elp(n1−1/p‖a‖1) and (∫ ω ‖a‖ pp dµ )1/p , which is amplified as p increases. therefore, being the upper bound of (∫ ω ‖a‖ pp dµ )1/p , elp ( ‖a‖ 1/p1 ‖a‖ 1−1/p ∞ ) is more tight than elp(n1−1/p‖a‖1). as a consequence, we propose to use elp ( ‖a‖ 1/p1 ‖a‖ 1−1/p ∞ ) when the upper bound of (∫ ω ‖a‖ pp dµ )1/p is needed. for γ = 73.7, 86.0, 98.3, 110.6, fig. 3(b) illustrates how e(‖x[k]‖ 55 ) evolves with k, and table 1 presents the data of (∫ ω ‖a‖ 55 dµ )1/5 and el5 ( ‖a‖ 1/51 ‖a‖ 4/5 ∞ ) . when γ = 73.7, 86.0, e(‖x[k]‖ 55 ) tends to finite values as long as k is sufficiently large. then if γ = 98.3, 110.6, e(‖x[k]‖ 55 ) will be found to grow infinitely. the numerical result is in accordance with theorem 3. stochastic stability analysis of power control in wireless networks via a norm-inequality-based approach 857 conclusion this study develops a norm-inequality-based framework of analyzing the pth-moment stability of linear systems with random parameters, so as to show that a typical power control law with linear system model is stable in the sense of the pth-moment stability. it is the first time to recognize the effect of multiple-access methods to stability analysis of power control. acknowledgment the authors are grateful to the anonymous reviewers for many valuable comments concerning the presentation. this work was supported by the national natural science foundation of china under grant 61501043. we would like to thank dr. miao diao and prof. duan zhisheng for all their help and support while writing this paper. bibliography [1] c.w. sung, k.k. leung (2005); a generalized framework for distributed power control in wireless networks, ieee trans. inf. theory, 51(7): 2625-2635. [2] h. feyzmahdavian, m. johansson,t. charalambous (2012); contractive interference functions and rates of convergence of distributed power control laws, ieee trans. wireless commun., 11(12):4494-4502. [3] h. feyzmahdavian, t. charalambous, and m. johansson (2014); stability and performance of continuous-time power control in wireless networks, ieee trans. automat. contr., 59(8):2012-2023. [4] i. lestas (2012); power control in wireless networks: stability and delay independence for a general class of distributed algorithms, ieee trans. automat. contr., 57(5):1253-1258. [5] e. devane and i. lestas (2014); stability of a general class of distributed algorithms for power control in time-varying wireless networks, ieee trans. automat. contr., 59(8):1999-2011. [6] s. ulukus, r. yates (1998); stochastic power control for cellular radio systems, ieee trans. commun., 46(6):784-798. [7] m. varanasi, d. das (2002); fast stochastic power control algorithms for nonlinear multiuser receivers, ieee trans. commun., 50(11):1817-1827. [8] j. lu, s. ulukus,a. ephremides (2005); standard and quasi-standard stochastic power control algorithms, ieee trans. inf. theory, 51(7):2612-2624. [9] h. zhang, w. s. wong, w. ge, p. e. caines (2007); a stochastic approximation approach to the power-control problem, ieee trans. commun., 55(8): 878-886. [10] x. feng, k a. loparo, y. ji, h. j. chizeck (1992); stochastic stability properties of jump linear systems, ieee trans. automat. contr., 37(1):38-53. [11] j. leth, h. schioler, m. gholami, v. cocquempot (2013); stochastic stability of markovianly switched systems, ieee trans. automat. contr., 58(8): 2048-2054. [12] g. j. foschini and z. miljanic (1993); a simple distributed autonomous power control algorithm and its convergence, ieee trans. vehic. technol., 42: 641-646. 858 r. qian, y. qi [13] m. xiao, n. shroff, and e. chong (2003); a utility-based power-control scheme in wireless cellular systems, ieee j. sel. areas commun., 11(2):210-221. [14] h. furstenberg, h. kesten (1960); products of random matrices, annals of mathematical statistics, 31(2):457-469. [15] a. bobrowski (2005); functional analysis for probability and stochastic processes, cambridge universtity press, 2005. [16] w. l. d. koning (1984); optimal estimation of linear discrete-time systems with stochastic parameters, automatica, 20(1):113-115. [17] n. j. higham (1992); estimating the matrix p-norm, numer. math, 62:511-538. [18] t. taniguchi (1990); stability theorems of stochastic difference equations, journal of mathematical analysis and applications, 147(1):81-96. [19] l. xua, s. s. geb (2015); the pth moment exponential ultimate boundedness of impulsive stochastic differential systems, applied mathematics letters, 42:22-29. [20] j. yang, w. zhou, x. yang, x. hu, l. xie (2015); pth moment asymptotic stability of stochastic delayed hybrid systems with levy noise, international journal of control, 88(9):1726-1734. [21] t. charalambous, i. lestas, g. vinnicombe (2008); on the stability of the foschini-miljanic algorithm with time-delays, in proc. 47th ieee conf. on decision control, 2991-2996. [22] m. varanasi (1999); nonlinear multiuser receivers with distributed power control in cellular radio networks, in proc. 37th annu. allerton conf. communication, control and computers, 820-830. [23] r. a. horn and c. r. johnson (1985); matrix analysis, cambridge university press, second edition, 1985. [24] r. durrett (2005); probability: theory and examples, cengage learning, third edition, 2005. [25] i. s. gradshteyn, i. m. ryzhik (2007); table of integrals, series, and products, 7th edition, academic, 2007. appendix: several lemmas this appendix is devoted to present the lemmas (and their proofs) which are required to derive the main results of our work. lemma a.1: let the nonnegative numbers a1,a2, · · · ,an and the positive p1,p2, · · · ,pn be given. set ∑n j=1 1 pj = 1 then the inequality ∏n j=1 aj ≤ ∑n j=1 1 pj a pj j holds with equality if and only if all ak with pk > 0 are equal. proof: ∏n j=1 aj ≤ ∑n j=1 1 pj a pj j in lemma a.1 is an inequality of the weighted arithmetic mean and geometric mean, which can be proved by using the finite form of jensen’s inequality [24] for the natural logarithm. 2 stochastic stability analysis of power control in wireless networks via a norm-inequality-based approach 859 lemma a.2: let x1,x2, · · · ,xn be n random variables and p1,p2, · · · ,pn be nonnegative numbers numbers. if ∑n j=1 1 pj = 1 and e|xj|pj < +∞ for 1 ≤ j ≤ n, then e (∏n j=1 |xj| ) ≤∏n j=1 (e|xj|pj ) 1 pj , where (e|xj|pj ) 1 pj = elp||xj|| with p = pj. proof: by using lemma a.1, we get ∏n j=1 |xj|∏n j=1(e|xj| pj ) 1 pj = ∏n j=1 |xj| (e|xj|pj ) 1 pj ≤ ∑n j=1 |xj|pj pje|xj|pj . then applying the expectation to above inequality, e( ∏n j=1 |xj|)∏n j=1(e|xj| rj ) 1 pj ≤ ∑n j=1 e|xj|pj pje|xj|pj = ∑n j=1 1 pj = 1. thus, lemma a.2 is verified. 2 lemma a.3: suppose x1,x2, · · · ,xk are random matrices with size s1 ×s2, s2 ×s3, · · · , sk × sk+1, where s1,s2, · · · ,sk,sk+1 are all simply positive integers and the subscripts are labels corresponding to the matrices. if the entries of xk are independent with those of xl for any k 6= l then e (∏k k=1 xk ) = ∏k k=1 e(xk). proof: the product of k matrices can be expressed in the index notation as[ k∏ k=1 xk ] ij = s1∑ i1=1 s2∑ i2=1 · · · sk−1∑ ik−1=1 [x1]ii1 [x2]i1i2 [x3]i2i3 · · · [xn−1]in−2in−1 [xn]in−1j. this implies every entry of the resultant matrix after matrix product is a linear function of the entries of all ak matrices. the independence condition can further yield [24] e  [ k∏ k=1 xk ] ij   = s1∑ i1=1 s2∑ i2=1 · · · sk−1∑ ik−1=1 e([x1]ii1 )e([x2]i1i2 ) · · ·e([xn−1]in−2in−1 )e([xn]in−1j). therefore, lemma a.3 is proved. 2 lemma a.4: suppose that x1,x2, · · · ,xi are random matrices with size s×1, one can have elp ∥∥∥∑ii=1 xi∥∥∥ p ≤ ∑i i=1 elp ‖xi‖p, where p ∈ z+. proof: the norm inequalities and lemma a.2 combine to provide elp ∥∥∥∥∥ i∑ i=1 xi ∥∥∥∥∥ p  p = e  ∥∥∥∥∥ i∑ i=1 xi ∥∥∥∥∥ p p   ≤ e (( i∑ i=1 ‖xi‖p )p) = i∑ i1=1 i∑ i2=1 · · · i∑ ip=1 e ( ‖xi1‖p‖xi2‖r · · · ∥∥xip∥∥p) ≤ i∑ i1=1 · · · i∑ ip=1 ( e ( ‖xi1‖ p p ) · · ·e (∥∥xip∥∥ pp ))1/p = ( i∑ i=1 elp ‖xi‖p )p . this gives the desired result. 2 lemma a.5: let x1,x2, · · · ,xk be random matrices with size s1 ×s2, s2 ×s3, · · · , sk × sk+1 and y be a sk+1×1 random vector. if the entries of xk are independent with those of xl for any k 6= l and y for any 1 ≤ k 6= k, then elp ∥∥∥(∏kk=1 xk)y∥∥∥ p ≤ (∏k k=1 elpϕ(xk) )( elp ‖y‖p ) , where p ∈ z+. proof: the norm inequalities implies ∥∥∥(∏kk=1 xk)y∥∥∥ p ≤ (∏k k=1 ‖xk‖p ) ‖y‖p, and thus we get e (∥∥∥(∏kk=1 xk)y∥∥∥ p p ) ≤ (∏k k=1 e(|ϕ(xk)| p) ) e(‖y‖ pp ). such an inequality directly yields the result. 2 int j comput commun, issn 1841-9836 8(4):594-607, august, 2013. a fuzzy data envelopment analysis approach based on parametric programming s.h. razavi, h. amoozad, e.k. zavadskas, s.s. hashemi seyed hossein razavi hajiagha institute for trade studies and research, no. 240, north kargar st., tehran, iran, s.hossein.r@gmail.com hannan amoozad mahdiraji kashan branch, islamic azad university, kashan, iran, h.amoozad@ut.ac.ir edmundas kazimieras zavadskas* 3*vilnius gediminas technical university, faculty of civil engineering, sauletekio al. 11, lt-10223vilnius, lithuania. edmundas.zavadskas@vgtu.lt *corresponding author shide sadat hashemi kashan branch, islamic azad university, kashan, iran, shide−hashemi@yahoo.com abstract: in this paper, a fuzzy version of original data envelopment models, ccr and bcc, is extended and its solution approach is developed. the basic idea of the proposed method is to transform the original dea model to an equivalent linear parametric programming model, applying the notion of α-cuts. then, a bi-objective model is constructed which its solution has determined the optimal range of decision making units efficiency. the proposed method can be used both for symmetric and asymmetric fuzzy numbers, while the feasibility of its solution for the original problem is guaranteed. the application of the proposed method is examined in two numerical examples and its results are compared with two current models of fuzzy dea. keywords: data envelopment analysis; fuzzy numbers; α-cuts; parametric programming. 1 introduction from the early stages of modernization, the limitation of resources was one of the major challenges in managerial decisions. this limitation motivated the managers to be sensitive about the utilization of acquired resources. the origin of efficiency problem can be referred to this sensitivity. roughly speaking, efficiency tries to answer this question: how well an organization uses its resources to produce the desired outputs? economists suggest the concept of production function as a tool to appraise efficiency and therefore, a majority of methods that are introduced for efficiency appraisal are based on the approximation of this function (see [15] for a definition of production function). based on the concept of production function, the efficiency evaluation methods can be classified into two groups: (1) parametric methods which directly approximate the production function, like stochastic frontier analysis [20], and (2) non-parametric methods which indirectly approximate this function. data envelopment analysis is one of the well known and widely accepted methods in non parametric class [6]. copyright c⃝ 2006-2013 by ccc publications a fuzzy data envelopment analysis approach based on parametric programming 595 farell (1957) in his paper introduced a method of efficiency evaluation which is known as the origin of dea. according to the pareto-koopmans definition, he divided the efficiency of each unit into two technical and assignment components [12]. later, in 1978, charnes, cooper, and rhodes developed the dea method based on the farells model. the first dea model was called ccr model due to its authors [4]. after 1978, the dea method was widely known and accepted as a permanent paradigm in efficiency evaluation. for instance, emrouznejad [11] and cook and seiford [7] surveyed more than thousands of papers and applications of dea in different fields. dea is a set of linear programming based methods for evaluation the efficiency of a group of homogeneous units which use a set of inputs to produce a set of outputs. dea considers the efficiency of each unit as the ratio between its weighted sums of outputs to the weighted sums of inputs. in contrast with the classical methods of constant weights, dea allows each unit to take its variable weights such that its efficiency is maximized, while the efficiency of all units is constrained to be less than one. it can be concluded that the dea weights are closely related to its inputs and outputs data, and a small swing in units data will have a great influence on the dea results. the original dea models were developed based on crisp and deterministic data and no deviation in data were allowed. however, in practical applications, this assumption is violated. scholars proposed some frameworks to deal with data uncertainty and non crispness. some papers examined the efficiency problem under stochastic data [27, 28]. bellman and zadeh [2] in their highly cited paper introduced the concept of decision making in a fuzzy environment. while many inputs and outputs are stated by qualitative or lingual variables, applications of the fuzzy sets theory in dea are proposed in literature. the application of fuzzy sets theory in dea can be traced to sengupta [29]. since that time, there are a continuously increasing interest on fuzzy dea methods and applications. hatamimarbini et al. [14] classified the fuzzy dea method into four primary categories which some instances are just introduced here: (1) the tolerance approach [29], (2) the α-level based approach [19, 26], (3) the fuzzy ranking approach [13], and (4) the possibility approach [21, 22]. also, some approaches like luban [23], wang et al. [31], and zerafat angiz et al. [33] are known as other developments in fuzzy dea. in this paper, a method is proposed to solve the fuzzy dea problems by transformation of fuzzy dea problem to an equivalent interval problem, using the concept of α-cuts. the obtained interval problem is a parametric problem based on α which a solution method is proposed to solve it as a bi-objective problem with the concept of compromise programming. the pareto efficiency of the transformed interval problem is also proved for the fuzzy dea problem. the rest of paper is organized as follows. section 2 consist a brief overview on multiplier and envelopment form of dea ccr and bcc models. the required fuzzy set definitions and operations are overviewed in section 3. the fuzzy dea problem and its solution procedure are described in section 4. two numerical examples are solved in section 5 and the proposed methods solutions are compared with some of the current fuzzy dea methods. finally, the paper is concluded in section 6. 2 data envelopment analysis data envelopment analysis (dea) measures the relative efficiency of a set of congruent decision making units (dmus) that consume multiple inputs to produce multiple outputs. in fact, dea is a multi-factor productivity analysis model for measuring the relative efficiencies of a homogenous set of dmus. the efficiency score in the presence of multiple inputs and outputs is defined as the weighted sum of outputs to the weighted sum of inputs. let, there are n dmus which used m – dimensional input vector xj = ⌊x1j, ..., xmj⌋ to produce an s – dimensional 596 s.h. razavi, h. amoozad, e.k. zavadskas, s.s. hashemi output vector yj = ⌊y1j, ..., ysj⌋. then, the relative efficiency of dmu0, 0 ∈ {1, 2, ..., n} will be as follows: e0 = s∑ r=1 uryr0/ m∑ i=1 vixi0. (1) the basic form of ccr model can be illustrated as follows: max s∑ r=1 uryr0/ m∑ i=1 vixi0 s∑ r=1 uryrj/ m∑ i=1 vixij 6 1, j = 1, 2, ..., n (2) ur > 0, r = 1, 2, ..., s vi > 0, i = 1, 2, ..., m the model (2) is a fractional programming which is converted to a linear programming model by charnes “ cooper [4] variable transformation as follows: max s∑ r=1 uryr0 m∑ i=1 vixi0 = 1, (3) s∑ r=1 uryrj − m∑ i=1 vixij 6 0, j = 1, 2, ..., n ur > 0, r = 1, 2, ..., s vi > 0, i = 1, 2, ..., m model (3) is called the multiplier output oriented ccr model. the ccr models are extended under the constant return to scale assumption. in the case of variable return to scale, the bcc model [1] can be used. the bcc formulation is as follows: max s∑ r=1 uryr0 m∑ i=1 vixi0 − v0 = 1, (4) s∑ r=1 uryrj − m∑ i=1 vixij − v0 6 0, j = 1, 2, ..., n ur > 0, r = 1, 2, ..., s vi > 0, i = 1, 2, ..., m where, is the free in sign return to scale variable. a model is ccr or bcc technically efficient if the objective functions in models (3) or (4) are equal to one and its slack variables are zero. a comprehensive review on different dea models and their economic interpretations are discussed in [8, 25]. a fuzzy data envelopment analysis approach based on parametric programming 597 3 fuzzy sets theory fuzzy sets are introduced by zadeh [32] as a generalization of classic sets. suppose that u is a universe. a fuzzy set ã in u is defined as ã = {( x, µ ã (x) ) |x ∈ u } , where µ ã (x) is called the membership function of ã. if µ ã (x) : u → [0, 1], then ã is called a normal fuzzy set. jain [18] and dubois and prade [10] initially defined the concept of fuzzy numbers. a fuzzy number is a normal and convex fuzzy set ã in universe u. the most common form of fuzzy numbers in practical problems, especially in decision making related problems are trapezoidal and triangular fuzzy numbers. a trapezoidal fuzzy number can be shown as the quadruple ã = (l, m1, m2, r), where l 6 m1 6 m2 6 rare real numbers. a trapezoidal fuzzy number ã is characterized by its membership function as follows: µ ã (x) =   0, x 6 l x − l m1 − l , l 6 x 6 m1 1, m1 6 x 6 m2 . r − x r − m2 , m2 6 x 6 r 0, x > r (5) triangular fuzzy number is a specific form of trapezoidal fuzzy numbers where m1 = m2. a required concept of fuzzy numbers in this paper is the concept of α-cuts. for a fuzzy set ã, its α cut is defined as ã = { x ∈ u|µ ã (x) > α } . the α cuts can be shown as crisp intervals which are called α level interval: ( ã ) α = [( ã )l α , ( ã )u α ] = [ min x { x ∈ u|µ ã (x) > α } , max x { x ∈ u|µ ã (x) > α } ] (6) for a trapezoidal fuzzy number ã = (l, m1, m2, r), its α level interval is determined as follows: ( ã ) α = [m1α + l(1 − α), m2α + r(1 − α)] (7) the arithmetic operations can be defined on fuzzy numbers [34]. an alternative way of fuzzy arithmetic can be defined based on interval arithmetic of α-level intervals. the interval arithmetic is described in moore et al. [24]. if ã and b̃ be two fuzzy numbers with α-level intervals( ã ) α = ⌊( ã )l α , ( ã )u α ⌋ and ( b̃ ) α = ⌊( b̃ )l α , ( b̃ )u α ⌋ , then it follows that [3, 30]: ( ã ) α + ( b̃ ) α = ⌊( ã )l α + ( b̃ )l α , ( ã )u α + ( b̃ )u α ⌋ (8) ( ã ) α − ( b̃ ) α = ⌊( ã )l α − ( b̃ )u α , ( ã )u α − ( b̃ )l α ⌋ (9) ( ã ) α ( b̃ ) α =   min {( ã )l α ( b̃ )l α , ( ã )l α ( b̃ )u α , ( ã )u α ( b̃ )l α , ( ã )u α ( b̃ )u α , } , max {( ã )l α ( b̃ )l α , ( ã )l α ( b̃ )u α , ( ã )u α ( b̃ )l α , ( ã )u α ( b̃ )u α , }   (10) 598 s.h. razavi, h. amoozad, e.k. zavadskas, s.s. hashemi 1/ ( b̃ ) α = ⌊ 1/ ( b̃ )l α , 1/ ( b̃ )u α ⌋ (11) ( ã ) α ÷ ( b̃ ) α = ( ã ) α × 1( b̃ ) α (12) for an interval number ( ã ) α = ⌊( ã )l α , ( ã )u α ⌋ , its center is defined as follows: [( ã ) α ] c = ( ã )l α + ( ã )u α 2 (13) 4 fuzzy data envelopment analysis a fuzzy dea problem can be stated as follows: consider a group of n decision making units. each dmuj, j = 1, 2, ..., n used a set of fuzzy inputs x̃i = (x̃i1, x̃i2, ..., x̃im) to produce a set of fuzzy outputs ỹi = (ỹi1, ỹi2, ..., ỹis),where components of x̃i and ỹi are fuzzy numbers. then, the fuzzy ccr model can be formulated as follows: max s∑ r=1 urỹr0 m∑ i=1 vix̃i0 ∼= 1 (14) s∑ r=1 urỹrj − m∑ i=1 vix̃ij6̃0, j = 1, 2, ..., n ur > 0, r = 1, 2, ..., s vi > 0, i = 1, 2, ..., m where ∼= and 6̃ are fuzzy equality and inequality, means "approximately equal to" and "approximately smaller than". the model in eq. (14) is an input oriented fuzzy ccr model (i f-ccr). in a specific α-level, α ∈ [0, 1], the α-level efficiency of dmu0, eα, can be achieved by solving the following model: eα = max s∑ r=1 ur(ỹr0)α m∑ i=1 vi(x̃i0)α = 1 (15) s∑ r=1 ur(ỹrj)α − m∑ i=1 vi(x̃ij)α 6 0, j = 1, 2, ..., n ur > 0, r = 1, 2, ..., s vi > 0, i = 1, 2, ..., m a fuzzy data envelopment analysis approach based on parametric programming 599 while (ỹrj)α ⊆ ỹrj and (x̃i0)α ⊆ x̃i0, then for each α ∈ [0, 1] the feasible space of model (15) is a subset of feasible space of model (14) and therefore, each feasible solution of (15) is a feasible solution of (14). therefore, it can be stated that: lemma1. if sa is the feasible space of model (15) and s0 is the feasible space of model (14), then ∀α ∈ [0, 1], sα ⊆ s0. substituting the α-level intervals in model (15), the model (16) will be obtained as follows: eα = max s∑ r=1 ur [ (ỹr0) l α, (ỹr0) u α ] m∑ i=1 vi [ (x̃i0) l α, (x̃i0) u α ] = 1 (16) s∑ r=1 ur [ (ỹrj) l α, (ỹrj) u α ] − m∑ i=1 vi [ (x̃ij) l α, (x̃ij) u α ] 6 0, j = 1, 2, ..., n ur > 0, r = 1, 2, ..., s vi > 0, i = 1, 2, ..., m the model (16) is an interval linear programming model and can be solved by interval linear programming techniques. to solve the model (16), first let define some ordering relations between interval numbers. the objective function of model (17) is as the following form: eα = max [ s∑ r=1 ur(ỹr0) l α, s∑ r=1 ur(ỹr0) u α ] (17) according to ishibuichi and tanaka [17] and das et al. [9], the eα will be maximized if and only if its lower bound and center are maximized, i.e.: eα = max   s∑ r=1 ur(ỹr0) l α, s∑ r=1 ur ( (ỹr0) l α + (ỹr0) u α/2 )   (18) the first constraint of model (16) is justified as follows: m∑ i=1 vi [ (x̃i0) l α + (x̃i0) u α 2 ] = 1 (19) because this constraint required that m∑ i=1 vi(x̃i0) l α = 1, and m∑ i=1 vi(x̃i0) u α = 1. adding these equations and dividing by 2, the eq. (19) is obtained. also, the second set of constraints is modified as follows to a linear set of constraints:  s∑ r=1 ur(ỹrj) l α − m∑ i=1 vi(x̃ij) u α, s∑ r=1 ur(ỹrj) u α − m∑ i=1 vi(x̃ij) l α   6 0, j = 1, 2, ..., n (20) 600 s.h. razavi, h. amoozad, e.k. zavadskas, s.s. hashemi the constraints of the form eq. (20) are handled as follows: s∑ r=1 ur(ỹrj) u α − m∑ i=1 vi(x̃ij) l α 6 0, j = 1, 2, ..., n s∑ r=1 ur ( (ỹrj) l α + (ỹrj) u α 2 ) − m∑ i=1 vi ( (x̃ij) l α + (x̃ij) u α 2 ) 6 0, j = 1, 2, ..., n (21) integrating the eqs. (18) – (21), the following multi objective model is constructed as an equivalent model for input oriented fuzzy ccr model (14). eα = max s∑ r=1 ur(ỹr0) l α max s∑ r=1 ur ( (ỹr0) l α + (ỹr0) u α/2 ) subject to m∑ i=1 vi [ (x̃i0) l α + (x̃i0) u α 2 ] = 1 s∑ r=1 ur(ỹrj) u α − m∑ i=1 vi(x̃ij) l α 6 0, j = 1, 2, ..., n (22) s∑ r=1 ur ( (ỹrj) l α + (ỹrj) u α 2 ) − m∑ i=1 vi ( (x̃ij) l α + (x̃ij) u α 2 ) 6 0, j = 1, 2, ..., n ur > 0, r = 1, 2, ..., s vi > 0, i = 1, 2, ..., m the eq. (22) is a bi-objective parametric model. this model is then decomposed to two distinct model to (1) maximize the elα, and (2) maximize the e c α. . if pps is the feasible space of model (22), these two models are formulated as follows: maxelα = s∑ r=1 ur(ỹr0) l α (u, v) ∈ pps minecα = s∑ r=1 ur ( (ỹr0) l α + (ỹr0) u α/2 ) (u, v) ∈ pps (23) the models (22) and (23) are linear programming problems which can be solved easily by available packages. the solutions of these models determine the optimal range of dmus efficiency, i.e. [el∗α , e u∗ α ], which in turn is the efficiency α-cut. in fact, while the outputs of the dmus are determined by fuzzy numbers, their efficiency scores will be also fuzzy numbers with unknown membership function. the result of model (22) for a specific value of α is the efficiency membership function’s α-cut. a fuzzy data envelopment analysis approach based on parametric programming 601 the following lemma shows an important relation between models (21) and (14). lemma2. if (u, v) is a feasible solution of model (21), it will be a feasible solution of the original fuzzy dea model (14). the proof is easily obtained from lemma1. in the case of variable return to scale, the fuzzy bcc model can be formulated as follows: eα = max s∑ r=1 ur(ỹr0) l α max s∑ r=1 ur ( (ỹr0) l α + (ỹr0) u α/2 ) subject to m∑ i=1 vi [ (x̃i0) l α + (x̃i0) u α 2 ] − v0 = 1 s∑ r=1 ur(ỹrj) u α − m∑ i=1 vi(x̃ij) l α − v0 6 0, j = 1, 2, ..., n (24) s∑ r=1 ur ( (ỹrj) l α + (ỹrj) u α 2 ) − m∑ i=1 vi ( (x̃ij) l α + (x̃ij) u α 2 ) − v0 6 0, j = 1, 2, ..., n ur > 0, r = 1, 2, ..., s vi > 0, i = 1, 2, ..., m v0 : unrestricted the model (24) can be solved with a similar approach to model (21). analyzing the efficiency of dmus with the proposed method, for a set of n different values of α, e.g. αi, i = 1, 2, ..., n results in a set of efficiency α-cuts [el∗αi , e u∗ αi ].therefore, it will be necessary to obtain an integrated efficiency score for dmus to rank them. while the membership functions of efficiency are not determined, the conventional methods of fuzzy numbers aggregation which need membership functions cannot be used. here, the chen and klein [5] method is proposed to rank the efficiency scores of dmus based on their α-cuts, like kao and liu [19]. chen and klein [5] introduced the following index for fuzzy numbers: ij = n∑ i=0 (( eu∗αi ) − c ) n∑ i=0 (( eu∗αi ) − c ) − n∑ i=0 (( el∗αi ) − d ), n → ∞ (25) where, c = mini,j {( el∗αi )} and d = maxi,j {( eu∗αi )} . while n is grown, the methods validity is increased, however chen and klein believed that n = 3 or 4 is sufficient. 602 s.h. razavi, h. amoozad, e.k. zavadskas, s.s. hashemi 5 numerical examples in this section, two numerical examples are solved by the proposed method and the results are compared with previously presented methods. example1. consider 4 dmus with inputs and outputs which are presented in table1. table 1: inputs and outputs of 4 dmus dmu input α-cut output α-cut a (11,12,14) [11+α,14-2α] 10 [10,10] b 30 [30,40] (12,13,14,16) [12+α,16-2α] c 40 [40,40] 11 [11,11] d (45,47,52,55) [45+2α,55-3α] (12,15,19,22) [12+3α,22-3α] kao and liu [19] solved this problem in an 11 point scale for α based on bcc model. table 2 shows the kao and liu’s results. considering the dmu d, the model (22) is designed for this dmu as follows. first, two single objective models are solved: eαl (d) : max(12 + 3α)u1 subject to( 100 − α 2 ) v1 + v0 = 1 10u1 − (11 + α)v1 − v0 6 0 (16 − 2α)u1 − 30v1 − v0 6 0 11u1 − 40v1 6 0 (22 − 3α)u1 − (45 + 3α)v1 − v0 6 0 10u1 − ( 25 − α 2 ) v1 − v0 6 0( 28 − α 2 ) u1 − 30v1 − v0 6 0 17u1 2 − ( 100 − α 2 ) v1 − v0 6 0 u1 > 0, v1 > 0, v0 : unrestricted eαl (d) : max 17 2 u1 subject to( 100 − α 2 ) v1 + v0 = 1 10u1 − (11 + α)v1 − v0 6 0 (16 − 2α)u1 − 30v1 − v0 6 0 11u1 − 40v1 − v0 6 0 (22 − 3α)u1 − (45 + 3α)v1 − v0 6 0 10u1 − ( 25 − α 2 ) v1 − v0 6 0( 28 − α 2 ) u1 − 30v1 − v0 6 0 a fuzzy data envelopment analysis approach based on parametric programming 603 17u1 2 − ( 100 − α 2 ) v1 − v0 6 0 u1 > 0, v1 > 0, v0 : unrestrict end these models are designed and solved for different α-levels. the results are shown in table 2. let consider two sets of solutions with more details. considering the dmus a and c, which their outputs are stated by crisp numbers, it is intuitionally more acceptable that their efficiency in an input oriented model and in each specific level of α be a crisp number, as it is in table 2. now consider the dmu a. solving its lower bound model with kao and liu [19], for α = 0, the optimal solution is as follows: table 2: the α-cuts of the efficiency at eleven α-values based on the proposed method and kao and liu [19] method a [ (ea) l α , (ea) u α ] [ (eb) l α , (eb) u α ] [ (ec) l α , (ec) u α ] [ (ed) l α , (ed) u α ] proposed kao and liu proposed kao and liu proposed kao and liu proposed kao and liu 0.0 0.9497 [1.0,1.0] [0.71,0.95] [0.71,1.0] 0.5436 [0.54,0.91] [0.54,1] [0.74,1.0] 0.1 0.9557 [1.0,1.0] [0.73,0.95] [0.73,1.0] 0.5523 [0.55,0.90] [0.56,1] [0.77,1.0] 0.2 0.9615 [1.0,1.0] [0.75,0.95] [0.75,1.0] 0.5612 [0.56,0.88] [0.58,1] [0.80,1.0] 0.3 0.9671 [1.0,1.0] [0.76,0.95] [0.76,1.0] 0.5703 [0.57,0.87] [0.61,1] [0.83,1.0] 0.4 0.9725 [1.0,1.0] [0.78,0.95] [0.78,1.0] 0.5795 [0.58,0.86] [0.63,1] [0.86,1.0] 0.5 0.9776 [1.0,1.0] [0.79,0.95] [0.79,1.0] 0.5890 [0.58,0.85] [0.65,1] [0.89,1.0] 0.6 0.9826 [1.0,1.0] [0.81,0.95] [0.81,1.0] 0.5987 [0.59,0.83] [0.68,1] [0.93,1.0] 0.7 0.9873 [1.0,1.0] [0.83,0.95] [0.83,1.0] 0.6086 [0.60,0.82] [0.70,1] [0.96,1.0] 0.8 0.9917 [1.0,1.0] [0.85,0.95] [0.85,1.0] 0.6187 [0.61,0.81] [0.73,1] [0.99,1.0] 0.9 0.9960 [1.0,1.0] [0.87,0.95] [0.87,1.0] 0.6290 [0.62,0.80] [0.76,1] [1.0,1.0] 1.0 1.0 [1.0,1.0] [0.88,0.95] [0.88,1.0] 0.6395 [0.63,0.79] [0.78,1] [1.0,1.0] ⌊u∗1 = 0.1, v ∗ 1 = 0.07142857, v ∗ 0 = 0.511766⌋ . replacing this solution in the first constraint of model (14), the following equality is obtained: (1.29747587, 1.3689044, 1.51176158) = 1. however, it is clear that 1 ∈ (1.29747587, 1.3689044, 1.51176158) and therefore this constraint is violated. in fact, the kao and liu [19] method overestimated the dmus efficiency. in this case, the proposed methods solution is: ⌊u∗1 = 0.09497207, v ∗ 1 = 0.03351955, v ∗ 0 = 0.5810056⌋ , and the considered constraint is as follows: (0.94972065,0.9832402,1.0502793)=1 which it is clear that 1 ∈ (0.94972065, 0.9832402, 1.0502793). example2. saati-mohtadi et al. [26] applied their model on an example with 10 dmus which used two inputs in order to produce two outputs. data are presented in table 3. table 4 presents the results of efficiency appraisal of dmus based on the proposed method and saati-mohtadi et al. [26] method. applying eq. (25) the following results are obtained: i1 = 0.9407, i2 = 0.9599, i3 = 0.3704, i4 = 0.4629, i5 = 0.5015, i6 = 0.2942, i7 = 0.1409, i8 = 0.1439, i9 = 0.0471, i10 = 0.9243. therefore, the dmus are ranked based on their efficiencies as follows: ẽ2 ≻ ẽ4 ≻ ẽ10 ≻ ẽ5 ≻ ẽ4 ≻ ẽ3 ≻ ẽ6 ≻ ẽ8 ≻ ẽ7 ≻ ẽ9 604 s.h. razavi, h. amoozad, e.k. zavadskas, s.s. hashemi table 3: data for 10 dmus dmus i1 i2 o1 o2 d1 (6.0, 7.0, 8.0) (29.0, 30.0, 32.0) (35.5, 38.0, 41.0) (409.0, 411.0, 416.0) d2 (5.5, 6.0, 6.5) (33.0, 35.0, 36.5) (39.0, 40.0, 43.0) (478.0, 480.0, 484.0) d3 (7.5, 9.0, 10.5) (43.0, 45.0, 48.0) (32.0, 35.0, 38.0) (297.0, 299.0, 301.0) d4 (7.0, 8.0, 10.0) (37.5, 39.0, 42.0) (28.0, 31.0, 31.0) (347.0, 352.0, 360.0) d5 (9.0, 11.0, 12.0) (43.0, 44.0, 45.0) (33.0, 35.0, 38.0) (406.0, 411.0, 415.5) d6 (10.0, 10.0, 14.0) (53.0, 55.0, 57.5) (36.0, 38.0, 40.0) (282.0, 286.0, 289.0) d7 (10.0, 12.0, 14.0) (107.0, 110.0, 113.0) (34.5, 36.0, 38.0) (396.0, 400.0, 405.0) d8 (9.0, 13.0, 16.0) (95.0, 100.0, 101.0) (37.0, 41.0, 46.0) (387.0, 393.0, 402.0) d9 (12.0, 14.0, 15.0) (120.0, 125.0, 131.0) (24.0, 27.0, 28.0) (400.0, 404.0, 406.0) d10 (5.0, 8.0, 10.0) (35.0, 38.0, 39.0) (48.0, 50.0, 51.0) (470.0, 470.0, 470.0) table 4: dmus efficiency scores in six α-values based on the proposed method (p.) and saatimohtadi et al. [26] dmus 0.0 0.2 0.4 0.6 p saati et al. p saati et al. p saati et al. p saati et al. d1 [0.91, 0.94] 1.00 [0.93, 0.96] 1.00 [0.95, 0.97] 1.00 [0.96, 0.98] 1.00 d2 [0.94, 0.95] 1.00 [0.95, 0.96] 1.00 [0.96, 0.97] 1.00 [0.97, 0.98] 1.00 d3 [0.48, 0.57] 0.84 [0.50, 0.57] 0.79 [0.52, 0.58] 0.75 [0.54, 0.58] 0.71 d4 [0.60, 0.62] 0.76 [0.61, 0.63] 0.74 [0.62, 0.63] 0.71 [0.63, 0.64] 0.70 d5 [0.62, 0.64] 0.78 [0.63, 0.65] 0.75 [0.64, 0.65] 0.73 [0.66, 0.66] 0.71 d6 [0.44, 0.49] 0.69 [0.46, 0.50] 0.67 [0.47, 0.50] 0.65 [0.49, 0.51] 0.63 d7 [0.35, 0.35] 0.63 [0.38, 0.39] 0.59 [0.39, 0.40] 0.55 [0.41, 0.42] 0.51 d8 [0.32, 0.33] 0.85 [0.36, 0.37] 0.75 [0.38, 0.40] 0.66 [0.41, 0.4] 0.59 d9 [0.31, 0.31] 0.46 [0.34, 0.34] 0.44 [0.34, 0.34] 0.42 [0.35, 0.35] 0.40 d10 [0.89, 0.93] 1.00 [0.91, 0.94] 1.00 [0.93, 0.96] 1.00 [0.95, 0.97] 1.00 dmus 0.8 1.0 p saati et al. p saati et al. d1 [0.98, 0.99] 1.00 [1.0, 1.0] 1.00 d2 [0.98, 0.99] 1.00 [1.0, 1.0] 1.00 d3 [0.56, 0.58] 0.66 [0.61, 0.61] 0.61 d4 [0.64, 0.65] 0.68 [0.65, 0.65] 0.66 d5 [0.67, 0.67] 0.69 [0.68, 0.68] 0.68 d6 [0.53, 0.54] 0.60 [0.58, 0.58] 0.58 d7 [0.43, 0.44] 0.48 [0.45, 0.45] 0.45 d8 [0.45, 0.46] 0.53 [0.47, 0.47] 0.47 d9 [0.35, 0.35] 0.38 [0.36, 0.36] 0.36 d10 [0.97, 0.98] 1.00 [1.0, 1.0] 1.00 a fuzzy data envelopment analysis approach based on parametric programming 605 now, let compare the results of the proposed method with [26]. table5 presents that d1 attain to full efficiency in all α-levels. suppose that α = 0 . if the problem is solved with saati-mohtadi et al. [26] method, its results will be u∗1 = 0.02439024, u ∗ 2 = 0, v ∗ 1 = 0.1406398, v ∗ 2 = 0.005384859. replacing this solution in the first constraint of model (14), the result will be as follows: (1, 1.146024, 1.297434) which the membership degree of 1 is zero and it violates the ” ∼= ” relation. however, when the model is solved with the proposed method, both in the multipliers in the lower limit model are u∗1 = 0, u ∗ 2 = 0.002235469, v ∗ 1 = 0, v ∗ 2 = 0.03278689 and in the center model, the multipliers are u∗1 = 0.005162761, u ∗ 2 = 0.001776794, v ∗ 1 = 0, v ∗ 2 = 0.03278689, where both cases, the first constraint of model (14) is become as (0.95082, 0.983607, 1.04918) which apparently satisfied the ” ∼= ” relation. both examples show that the feasibility of the proposed method is guaranteed. in fact, in both methods the first constraint is transformed to m∑ i=1 vix̃i0 ≥ 1 which consequently overestimated the efficiency of dmus, while this is prevailed in the proposed method. 6 conclusions in this paper a model is proposed to solve data envelopment analysis models, when the inputs and outputs are determined ambiguously by fuzzy numbers. the method is developed based on the concept of α-cuts which transform the fuzzy problem to an equivalent parametric problem. then, the parametric problem is solved based on different values of α. the proposed method is developed either for constant return to scale ccr model and variable return to scale bcc model. the application of the proposed method is also presented and compared with two existing methods. one of the advantages of the proposed method is that it can be used for symmetric and asymmetric fuzzy numbers. also, the proposed method provides frameworks to analysis the efficiency appraisal problems under ccr and bcc models. the major advantage of the proposed model is that its results guaranteed the feasibility of dea results in original fuzzy model, while other methods don’t have such property in some cases. another feature of the proposed method is that it is a linear programming based parametric method which makes it easy to solve it with the present approaches and applications. bibliography [1] banker r.d., charnes a., cooper w.w., some models for estimating technical and scale inefficiencies in data envelopment analysis, manage. sci. issn 0025-1909, 30 (9): 10781092, 1984. [2] bellman r.e., zadeh l.a., decision making in a fuzzy environment, manage. sci. issn 0025-1909, 17 (4): 141-164, 1970. [3] bojadziev g., bojadziev m.,fuzzy sets, fuzzy logic, applications, world scientific publishing, singapore, 1996. [4] charnes a., cooper w.w., programming with linear fractional functions, nav. res. log. issn 0894-069x, 9 (3-4): 181-186, 1962. [5] charnes a., cooper w.w., rhodes e., measuring efficiency of decision making units, eur. j. oper. res. issn 0377-2217, 2: 429-444, 1978. 606 s.h. razavi, h. amoozad, e.k. zavadskas, s.s. hashemi [6] chen c.b., klein c.m., a simple approach to ranking a group of aggregated fuzzy utilities. ieee. t. syst. man. cy b. issn 1083-4419, 27 (1): 26-35, 1997. [7] coelli t.j., prasada rao d.s., o’donnell c.j., battese g.e., an introduction to efficiency and productivity analysis, 2nd edition, springer, new york, 2005. [8] cook w.d., seiford l.m., data envelopment analysis (dea) “ thirty years on. eur. j. oper. res. issn 0377-2217, 192 (1): 1-17, 2009. [9] cooper w.w., seiford l.m., tone k., data envelopment analysis: a comprehensive text with models, applications, references, and dea-solver software, 2nd ed., kluwer academic publishers, new york, 2006. [10] das s.k., goswami a., alam s.s., multiple objective transportation problem with interval cost, source and destination parameters, eur. j. oper. res. issn 0377-2217, 117 (1): 100-112, 1999. [11] dubois d., prade h., operations on fuzzy numbers. int. j. syst. sci. issn 0020-7721, 9 (6): 613-626, 1978. [12] emrouznejad a., parker b.r., tavares g. evaluation of research in efficiency and productivity: a survey and analysis of the first 30 years of scholarly literature in dea, socio. econ. plan. sci. issn 0038-121, 42 (3): 151-157, 2008. [13] fare r., zelenyuk v., on farrell’s decomposition and aggregation. int. j. bus. econ. issn 1607-0704, 4 (2): 167-171, 2005. [14] guo p. fuzzy data envelopment analysis and its application to location problems, inform. sciences. issn 0020-0255, 179 (6): 820-829, 2009. [15] hatami-marbini a., emrouznejad a., tavana m., a taxonomy and review of the fuzzy data envelopment analysis literature: two decades in the making, eur. j. oper. res. issn 0377-2217, 214 (3): 457-472, 2011. [16] hirschey m., managerial economics, 12th ed., south western cengage learning, ohio, 2008. [17] ishibuchi h., tanaka h., multiobjective programming in optimization of the interval objective function, eur. j. oper. res. issn 0377-2217, 48 (2): 219-225, 1990. [18] jain r., decision-making in the presence of fuzzy variables, ieee. t. syst. man. cy b. issn 1083-4419, 6 (10): 698-703, 1976. [19] kao c., liu s.t., fuzzy efficiency measures in data envelopment analysis, fuzzy. set. syst. issn 0165-0114, 113 (3): 427-437, 2000. [20] kumbhakar s.c., knox lovell c.a., stochastic frontier analysis, cambridge university press, cambridge, 2003. [21] lertworasirikul s., fang s.c., joines j.a., nuttle h.l.w., fuzzy data envelopment analysis (dea): a possibility approach, fuzzy. set. syst., issn 0165-0114, 139 (2): 379-394, 2003. [22] lertworasirikul s., fang s.c., nuttle h.l.w., joines j.a., fuzzy bcc model for data envelopment analysis, fuzzy. optim. decis. ma. issn 1568-4539, 2 (4): 337-358, 2003. a fuzzy data envelopment analysis approach based on parametric programming 607 [23] luban f., measuring efficiency of a hierarchical organization with fuzzy dea method, economia. seria. management., issn 1454-0320, 12 (1): 87-97, 2009. [24] moore r.e., baker kearfott r., cloud m.j., introduction to interval analysis, siam, philadelphia, 2009. [25] ray s.c., data envelopment analysis: theory and techniques for economics and operations research, first ed., cambridge university press, new york, 2004. [26] saati-mohtadi s., memariani a., jahanshahloo g.r., efficiency analysis and ranking of dmus with fuzzy data, fuzzy. optim. decis. ma., issn 1568-4539, 1 (3): 255-267, 2002. [27] sengupta j.k., data envelopment analysis for efficiency measurement in the stochastic case, comput. oper. res. issn 0305-0548, 14 (2): 117-129, 1987. [28] sengupta j.k., measuring economic efficiency with stochastic input-output data, int. j. syst. sci., issn 0020-7721, 20 (2): 203-213, 1989. [29] sengupta j.k., a fuzzy systems approach in data envelopment analysis, comput. math. appl. issn 0898-1221, 24 (8-9): 259-266, 1992. [30] siler w., buckley j.j., fuzzy expert systems and fuzzy reasoning, john wiley and sons, new jersey, 2005. [31] wang c.h., chuang c.c., tsai c.c., a fuzzy dea-neural approach to measuring design service performance in pcm projects, automat. constr., issn 0926-5805, 18 (5): 702-713, 2009. [32] zadeh l.a., fuzzy sets. information and control., issn 0019-9958, 8 (3): 338353, 1965. [33] zerafat angiz m., emrouznejad a., mustafa a., al-eraqi a.s., aggregating preference ranking with fuzzy data envelopment analysis, knowl-based syst. issn 0950-7051, 23 (6): 512-519, 2010. [34] zimmermann h.j., fuzzy sets theory and its application, forth ed., kluwer academic publishers, massachusetts, 2001. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 4, pp. 558-566 some aspects about vagueness & imprecision in computer network fault-tree analysis d. e. popescu, m. lonea, d. zmaranda, c. vancea, c. tiurbe daniela elena popescu, doina zmaranda, codruta vancea, cristian tiurbe university of oradea romania, 410087 oradea, 1 universitatii st. e-mail: {depopescu,zdoina,cvancea,ctiurbe}@uoradea.ro madalina lonea "politehnica" university of timisoara romania, timisoara, 2-4 v. parvan blvd. e-mail: madalina_lonea@yahoo.com abstract: based on the available information (eg.multiple functional faults or sensor errors give rise to similar alarm patterns or outcomes), some states in the behaviour of a network can not be distinguished from one another. so, the computer network’s fault tree reliability analysis frequently relies on imprecise or vague input data. the paper will use a dempster-shafer theory to accommodate this vagueness and it will show how imprecision can give rise to false-negative, and false-positive inferences; there will be assigned upper and lower bounds for the probability on elements of the state space. after illustrating the computational simplicity of incorporating the dempster-shafer theory probability assignments, we will apply them for analyzing the reliability of the network of our department. keywords: reliability analysis, networks, dempster-shafer theory, fault tree. 1 introduction the probabilities are no longer appropriate to represent vagueness in risk and reliability analyses; fuzzy set theory was proposed instead to quantify vagueness in this area [1]. development in dst have shown how probability can be adapted to incomplete or vague information, especially information that is based on human judgment or human-machine interaction. false positives and false-negatives are often the end products of vagueness or imprecision. they can arise from imprecision due to noise in monitored data, sensor device failure, or from the ambiguity about the logic rules in the fault tree. so, when the researcher has only imprecise information, he must appeal to fuzzy-set theory techniques, either the common laws or logic his appreciation for the logic relations in fault trees. fuzzy data can be incorporated through dempster shafer theory (dst) [2] [3] [4] [5] in conventional fault-tree analysis yield meaningful results. 2 dempster-shafer theory mass assignments dst generalizes classical probability theory by assigning upper und lower bounds for probabilities, as opposed to point values, to both the elements and the subsets of the state space. for a given state space, ω , mass (probability) is assigned over the set of all possible subsets of ω . because each element of ω is also a subset of ω (comprising 1 element) any classical probability assignment can be represented in dst. just as the probabilities of a distribution sum to 1, so do the masses of a dst-distribution. copyright c⃝ 2006-2010 by ccc publications some aspects about vagueness & imprecision in computer network fault-tree analysis 559 the standard risk analysis possibility set considered is ω ={true, false}. therefore, the power set, 2ω = {φ , t, f, (t, f)} contains an element (t,f) that represents an observation that could be either true or false, but not both. the components of faults trees the initiating events, consequences, and rules are typically given fail or not fail possibilities. in some engineering applications, observation of particular initiating events is imprecise, so an analyst cannot tell whether a given event occurred. by using dst method of assigning probability to the (t,f) event, fault tree analysts can quantitatively depict the imprecision [7]. based on the fact that [sum of all masses] = 1 and based on the fact that in dst m(φ ) = 0, after dst renormalization of the φ , the state space become: t, f, (t, f), and: m (t) + m (f) + m (t,f) = 1 ordinary fault trees are systems of boolean equations with components joined by boolean and & or gates. the boolean and gate for the members of the effective state space is summarized in figure 1 [7], and the boolean or gate is given in figure 2. figure 1: boolean truth table for the and gate ∧ t f (t,f) t t f (t,f) f f f f (t,f) (t,f) f (t,f) figure 2: boolean truth table for the or gate ∨ t f (t,f) t t t t f t f (t,f) (t,f) t (t,f) (t,f) the mass for each entry in the table is obtained by multiply-ing the masses of the edge entries and adding all masses of like entries. for example: let a1,a2,a3 numbers denoting ma over the possibilities t,f,(t,f) where a1+a2+a3=1 let b1,b2,b3 numbers denoting mb over the possibilities t,f,(t,f) where b1+b2+b3=1 for the boolean or gate from table 2 we have: m{a ∨ b} = (a1b1+ a1b2+ a1b3+ a2b1+ a3b1;a2b2;a2b3+ a3b2+ a3b3) (1) = (a1+ a2b1+ a3b1;a2b2;a2b3+ a3b2+ a3b3) thus, for (a ∨ b) the t element has mass assignment (a1 + a2b1 + a3b1), the f element has mass assignment a2b2, the (t,f) element has mass assignment (a2b3+ a3b2+ a3b3). similarly, for the boolean gate from figure 1, we have: ma ∧ b = (a1b1;a1b2+ a2b1+ a2b2+ a2b3+ a3b2;a1b3+ a3b1+ a3b3) (2) = (a1b1;a1b2+ a2+ a3b2;a1b3+ a3b1+ a3b3) 3 fault tree illustration we will apply the above techniques for the fault tree that corresponds to the network monitoring system of our department. our network uses 3 routers and it accesses the internet through the university 560 d. e. popescu, m. lonea, d. zmaranda, c. vancea, c. tiurbe router (figure 3). it comprises the following elements: r1 “hn” router r1’s fault represent the e1 event r2 “cazarma” router r2’s fault represent the e2 event r3 “cnlab” router r3’s fault represent the e3 event r4 “univ.oradea” router r4’s fault represent the e4 event s1 files server s1’s fault represent the e5 event s2 ftp server s2’s fault represent thee6 event s3 database server s3’s fault represent the e7 event s4 domain server s4’s fault represent the e8 event the network’s fault tree representation is shown in figure 4. figure 3: computer science network the top event is labelled t on the tree represented in figure 4. the initiating events ei (i=1, 2, 3, 4, 5, 6, 7, 8) lead to intermediate results (levels), ij (j=1, 2, 3, 4, 5, 6). the fault tree comprises the following boolean equations: t = i1∨ i6 (3) i1 = e3∨ e4 (4) i6 = i4∨ i5 (5) i4 = i2∨ i3 (6) i2 = e5∨ e6 (7) i3 = e7∨ e8 (8) i5 = e1∨ e2 (9) the equal signs in equations (3) ÷ (9) mean that the logical implications works in both directions. to assess the risk of the top event, masses can be assigned to the initiating events ei, and then propagated some aspects about vagueness & imprecision in computer network fault-tree analysis 561 up the tree based on relations (1) and (2). the main problem is the determining of the probabilities of basic initiating events (how are they assigned or determined). figure 4: the fault tree each of (3) ÷ (9) can be interpreted as an "if . . . then . . . " rule of the fault tree and considered as much part of the fault tree as the initiating events. so, let ri be a parameter about the failure rates of rule i = 3, . . . , 9, where the i-th corresponds to equations i [6]. when joined to our existing fault tree, the ri depict a further constraint on the fault tree, through which the initiating events must pass before an intermediate level is reached. the ri can be interpreted in several ways. for example, to model "silent alarms" when the true state of a system is abnormal, the ri could represent incidents in which the alarms are not working properly. if a rule were incorporated in the design of the fault tree such that the rule were true in only 9 out of 10 trials, then coupling ri with a boolean and gate to the existing tree could filter the number of sure fire observations from the rule to 9/10.. joining the ri to the fault tree through an and gate is useful in modelling false negatives. when the ri are joined to the fault tree by and gates and some sensors fail during abnormal conditions, the anticipated consequence might not occur on the fault tree because it has been falsely stopped by the rule parameter. we will focus only on false-negatives, and thus, the ri are joined to the existing fault tree through some boolean and gates. the new fault tree graphically represents the seven equations: t = (i1∨ i6)∧ r3 (10) i1 = (e3∨ e4)∧ r4 (11) i6 = (i4∨ i5)∧ r5 (12) i4 = (i2∨ i3)∧ r6 (13) i2 = (e5∨ e6)∧ r7 (14) i3 = (e7∨ e8)∧ r8 (15) 562 d. e. popescu, m. lonea, d. zmaranda, c. vancea, c. tiurbe i5 = (e1∨ e2)∧ r9 (16) we consider 3 adjustable assignments of mass for the initiating events and rules variables on the fault tree; these are given in table 1. table 1: mass assignment cwev case i case ii case iii e1 (0.9, 0.1, 0) (0.8, 0, 0.2) (0.8, 0, 0.2) e2 (0.9, 0.1, 0) (0.8, 0, 0.2) (0.8, 0, 0.2) e3 (0.9, 0.1, 0) (0.8, 0, 0.2) (0.8, 0, 0.2) e4 (0.9, 0.1, 0) (0.8, 0, 0.2) (0.8, 0, 0.2) e5 (0.8, 0.2, 0) (0.9, 0, 0.1) (0.9, 0, 0.1) e6 (0.8, 0.2, 0) (0.7, 0, 0.3) (0.9, 0, 0.1) e7 (0.8, 0.2, 0) (0.9, 0, 0.1) (0.9, 0, 0.1) e8 (0.8, 0.2, 0) (0.9, 0, 0.1) (0.9, 0, 0.1) r9 (0.8, 0, 0.2) (0.8, 0, 0.2) (0.9, 0.1, 0) r8 (0.7, 0, 0.3) (0.9, 0, 0.1) (0.8, 0.2, 0) r7 (0.9, 0, 0.1) (0.9, 0, 0.1) (0.8, 0.2, 0) r6 (0.9, 0, 0.1) (0.9, 0, 0.1) (0.8, 0.2, 0) r5 (0.9, 0, 0.1) (0.7, 0, 0.3) (0.7, 0.3, 0) r4 (0.8, 0, 0.2) (0.8, 0, 0.2) (0.8, 0.2, 0) 1. the initiating events have the usual probability p assigned to true and the remaining (1-p) is assigned to false. the rule variables have the reliability factor assigned to true and the remaining non-true mass assigned to y (t,f). the rule variables correspond to information that each of the rule has held, eg, in 8 out of 10 trials. in the 2 remaining trials, there are not sufficient informations to validate or invalidate the rule. 2. the reliability factor is assigned to true and the remaining non-true mass on both the rule parameters and the initiating events is assigned to the y (t,f) term. there are no sufficient information to conclude whether the event or rule failed. the remaining mass is assigned to y (t,f) term it is better than assigning the remaining mass to true or false. 3. the evidence is precise on the rule validations but imprecise on the initiating events. there is the situation of a good appreciation of the logic structure of the engineering system but there are problems with the devices. the mass assignments can be obtained from relations (1) and (2) together with the values from figure 1 for our case study we made our calculation in microsoft excel. our first calculation was made based on relations (3) (9) and the results are given in figure 5, and then we made the calculus based on relations (10) (16) and the results are given in figure 6. by analyzing the results from figure 5 and figure 6 we conclude that it is obvious more realistic to compute the probability of the t event with dst than without it. on the other hand, the results of the three cases considered by us are given in figure 7, figure 8 and figure 9. in case i (figure 7) the mass is distributed on all 3 elements. the top event occurs with probability 0,6213 with the remaining non-true mass on the y (t,f) and on the false elements. in case ii (figure 8) the remaining non-true mass on the intermediate and final events of the fault tree is assigned to the false element of the consequences. this is due to the allocation of zero mass to the some aspects about vagueness & imprecision in computer network fault-tree analysis 563 figure 5: calculus without imprecision on rules figure 6: calculus with imprecision on rules 564 d. e. popescu, m. lonea, d. zmaranda, c. vancea, c. tiurbe false elements of the initiating events and rules. the mass of the true element of t is 0,6218, with the remaining 0,3782 mass distributed to the uncommitted term. the case iii (figure 9) reflects the most realistic case with fuzzy input data on the initiating events but clear knowledge of the logic rules. the resulting probability assignments from figure 9 reflect the distribution of all 3 elements. cases i, ii and iii offer interesting comparisons. in case i we have only vague information on the accuracy of the rules, while in case iii we have vague observations of the initiating events. the numerical estimates for the true elements are the same in both cases because both assume the same mass assignments for the true elements of the initiating events and rule parameters. apparently, the effects of the uncertainly on the rules do not begin to offset the certainly on the initiating events until the mass is propagated up through the majority of the boolean gates. figure 7: mass assignment for case i figure 8: mass assignment for case ii 4 conclusions this paper illustrate the advantages of using dst methodology (acting on binary state space with either true or false elements) for representing vagueness and imprecision in reliability analysis of netsome aspects about vagueness & imprecision in computer network fault-tree analysis 565 figure 9: mass assignment for case iii works. we used dst probability assignments for the component of fault trees and a separate parameter on each boolean rule to show how a pattern of false-negative can be observed. so, dst offers a more accurate representation of knowledge. therefore, when there is incomplete knowledge or a limited database upon which to make probability assignments, dst offers a clear advantage over binary (true, false) assignments in representing vagueness. bibliography [1] stephen d.unwin, a fuzzy set theory foundation for vagueness in uncertainly analysis, risk analysism vol.6, num i. 1986, pp.27.34 [2] arthur p. dempster, upper and lower probabilities induced by a multi-valued mapping ann mathematical statistics, vol.38, 1967, pp.325-339 [3] glenn shafer, a mathematical theory of evidence, 1976, princeton university press [4] glen shafer, bayes’s two arguments for the rule conditioning, ann.statistics, vol.10, 1982, pp 10751089 [5] glen shafer, the combination of evidence, int’l j. intelligent systems, vol.i, num.3, 1986, pp.155176 [6] henry prade, a computational approach to approximate and plausible reasoning with applications to expert systems, ieee trans. pattern analysis and machine intelligence, vol.pami-7, 1985, may [7] michael a.s.guth, a probability foundation for vagueness & imprecision in fault tree analysis. ieee trans.on reliability, vol.40, no.5, 1991, dec. daniela elena popescu (b. june 27, 1961) received her phd in computer science (1998) from the "politechnica" university timisoara, romania. since 1990 she is working within the department of computer science, faculty of electrical engineering and information technology, university of oradea, romania, currently position occupied being professor. she is member in the computer architecture and computer testing research group and she has written more then 80 papers in international journals. her current main research field of interest is in computer architecture and 566 d. e. popescu, m. lonea, d. zmaranda, c. vancea, c. tiurbe digital circuits design, computers networks and computational intelligent methods. she is also member in hungarian technical academy. lonea alina-madalina (b. july 16, 1984) is phd student at “politehnica” university of timisoara. she is studying “optimisation in grid systems”. her previous projects are in the following fields: network server management, network systems, web application development, advanced databases, bash scripting and research management skills. doina zmaranda (b. july 14, 1967) received her msc in computer science (1990) and phd in computer science (2001) from the "politechnica" university timisoara, romania. since 1990 she is working within the department of computer science, faculty of electrical engineering and information technology, university of oradea, romania, currently position occupied being professor. her scientific research is focusing on real-time application development and programming. in addition, she also investigates issues related to reliability of complex control systems. she has (co-)authored 5 books and more than 20 papers in international journals in the last five years, participating also within several research projects. codruta vancea (b. august 7, 1967) received her msc in informtics (1990) and phd in mathematics (2003) from the university "babes bolyai" of cluj napoca, romania. since 1991 she is working within the department of electrical engineering, electrical measurement and electric power use, faculty of electrical engineering and information technology, university of oradea, romania. her current position is assistant professor. her scientific research is focusing on modeling of electromagnetic problems and parallel computing. cristian tiurbe (b. october 22, 1979) received his msc in computer science (2003) from the university of oradea, romania. since 2002 he is working within the department of computer science, faculty of electrical engineering and information technology, university of oradea, romania, currently position occupied being assistant. his scientific research is focusing on computer networks security. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 2 (june), pp. 266-277 on the power of small size insertion p systems a. krassovitskiy alexander krassovitskiy university rovira i virgili research group on mathematical linguistics av. catalunya, 35, tarragona 43002, spain e-mail: alexander.krassovitskiy@estudiants.urv.cat abstract: in this article we investigate insertion systems of small size in the framework of p systems. we consider p systems with insertion rules having one symbol context and we show that they have the computational power of context-free matrix grammars. if contexts of length two are permitted, then any recursively enumerable language can be generated. in both cases a squeezing mechanism, an inverse morphism, and a weak coding are applied to the output of the corresponding p systems. we also show that if no membranes are used then corresponding family is equal to the family of context-free languages. keywords: p systems, insertion-deletion systems, context-free languages, matrix grammars, computational completeness 1 introduction the study of insertion-deletion operations on strings has a long history; we just mention [2,5,11,16,20]. insertion-deletion systems motivated from molecular computing have been studied in [1, 3, 10, 19, 21]. with some linguistic motivation they may be found in [9]. in general form, an insertion operation means adding a substring to a given string in a specified (left and right) context, while a deletion operation means removing a substring of a given string from a specified (left and right) context. a finite set of insertion/deletion rules, together with a set of axioms provide a language generating device: starting from the set of initial strings and iterating insertion-deletion operations as defined by the given rules we get a language. the number of axioms, the length of the inserted or deleted strings, as well as the length of the contexts where these operations take place are natural descriptional complexity measures of the insertion-deletion systems. inspired by the structure and the functioning of a living cell, especially by the local information processing, p systems are brought to light few years ago [15]. this is a highly distributed computational model which combines local processing (in membranes) and communication (between them). it is natural to consider insertion and deletion operations in the framework of p systems and it was firstly done by gh. paun in [17]. such a combination permits do define programmed-like insertion-deletion systems, which, as expected, increase their computational power. some combinations of parameters for pure insertion-deletion lead to systems which are not computationally complete [6, 12] or even decidable [22], while in [7, 8] it was shown that p systems framework can easily increase the computational power comparing to ordinary insertiondeletion systems. traditionally, language generating devices having only insertion or only deletion rules were studied. early computational models based only on insertion appear already in [9], and are discussed in [19] and [17] (with membrane tree structure). it was proved that pure insertion systems having one letter context are always context-free. yet, there are insertion systems with two letter context which generate nonsemilinear languages (see theorem 6.5 in [19]). on the copyright c⃝ 2006-2011 by ccc publications on the power of small size insertion p systems 267 other hand, it appears that by using only insertion operations the obtained language classes with contexts greater than one are incomparable with many known language classes. for example, there is a simple linear language {anban | n ≥ 1} which cannot be generated by any insertion system (see theorem 6.6 in [19]). in order to overcome this obstacle one can use some codings to “interpret” the generated strings. the computational power of insertion systems with morphisms and intersection with special languages was investigated in [14] and [18]. in [11] there were used two additional mapping relations: a morphism h and a weak coding φ. the strings of a language are considered being the products h−1◦φ over the generated strings. more precisely, the squeezing mechanism selects only those output strings of the corresponding (p) systems to which h−1 and φ are being applied. as expected, the language generating mechanisms have greater expressivity, and the corresponding language family is larger. it appears that with the help of morphisms and codings one can obtain every re language if insertion rules have sufficiently large context. it is proved in [11] that for every recursively enumerable language l there exists a morphism h, a weak coding φ and a language l′ generated by an insertion system with rules using the length of the contexts at most 7, such that, l = h(φ−1(l′)). the result was improved in [13], showing that rules having at most 5 letter contexts are sufficient to encode every recursively enumerable language. recently, in [4] it was shown that the same result can be obtained with the length of contexts equal to 3. our aim is to reduce the length of the contexts in insertion rules by regulating derivations in terms of membranes. unlike the previous works, our article considers the encoding as a part of insertion p systems. the obtained model is quite powerful and has the power of matrix languages if contexts of length one are used. we also show that if no encoding is used, then the corresponding family is strictly included into the family of matrix languages and is equal to the family of context-free languages if no membranes are used. if an insertion of two symbols in two letters contexts is used, then all recursively enumerable languages can be generated (using of course the inverse morphism, the weak coding, and the squeezing mechanism). 2 prerequisites here we use standard formal language theoretic notions and notations. the reader can consult any of the many monographs in this area (see, e.g., in [20] for the unexplained details). we denote by |w| the length of a word w and by card(a) the cardinality of the set a, while ε denotes the empty string. by cf and re we denote the classes of context-free and recursively enumerable languages, respectively. a language l is context-free if there exists a context-free grammar g such that l(g) = l. a context-free grammar is a construct g = (n, t, s, p), where n and t are disjoint alphabets, s ∈ n, and p is a finite set of context-free rules of the form a −→ v, where a ∈ n and v ∈ (n ∪ t)∗. we say that a context-free grammar g = (n, t, p, s) is in chomsky normal form, if each rule in p has one of the forms a −→ α, or a −→ bc, for a, b, c ∈ n, α ∈ t ∪{ε}. a language l is recursively enumerable if there exists a type 0 grammar g such that l(g) = l. a type 0 grammar is a construct g = (n, t, s, p), where n and t are disjoint alphabets, s ∈ n and p is a finite set of rules of the form u −→ v, where u ∈ (n ∪ t)∗n(n ∪ t)∗ and v ∈ (n ∪ t)∗. we say that a type 0 grammar g = (n, t, p, s) is in penttonen normal form, if each rule in p can be written either as a −→ α, or ab −→ ac, for a, b, c ∈ n, α ∈ (t ∪ n)∗, |α| ≤ 2. it is well known that for every type 0 language there is a modified penttonen normal form where each rule can be written as either: a −→ α, or a −→ ac, or a −→ ca, or ab −→ ac, or ab −→ cb, for a, b, c ∈ n, α ∈ t ∪ {ε}. 268 a. krassovitskiy we also recall the following definition from [17]. a context-free matrix grammar (without appearance checking) is a construct g = (n, t, s, m), where n, t are disjoint alphabets (of nonterminals and terminals, respectively), s ∈ n (axiom), and m is a finite set of matrices, that is sequences of the form (a1 −→ x1, . . . , an −→ xn), n ≥ 1, of context-free rules over n ∪ t . for a string w, a matrix m : (r1, . . . , rn) is executed by applying the productions r1, . . . , rn one after the other, following the order in which they appear in the matrix. formally, we write w =⇒m u if there is a matrix m : (a1 −→ u1, . . . , an −→ un) ∈ m and the strings w1, w2, . . . , wn+1 ∈ (n ∪ t)∗ such that w = w1, wn+1 = u, and for each i = 1, 2, . . . , n we have wi = w′aiw′′ and wi+1 = w ′uiw ′′. if the matrix m is understood, then we write =⇒ instead of =⇒m. as usual, the reflexive and transitive closure of this relation is denoted by =⇒∗. then, the language generated by g is l(g) = {w ∈ t ∗ | s =⇒∗ w}. the family of languages generated by context-free matrix grammars is denoted by mat λ (the superscript indicates that erasing rules are allowed). it is well known fact that every language from mat λ can be generated by a modified binary normal form, (similarly to the binary normal form, see e.g., [17]) having each matrix m of the following form m : (a −→ α, a′ −→ α′), for a, a′ ∈ n, α, α′ ∈ (n ∪ t)∗, max(|α|, |α′|) ≤ 2. an insertion system is a construct γ = (v, a, i), where v is an alphabet, a is a finite language over v , and i is finite set of triples of the form (u, α, v), where u, α, and v are strings from v ∗. the elements of a are called axioms, the triples in i are insertion rules. an insertion rule (u, α, v) ∈ i indicates that the string α can be inserted in between u and v. stated otherwise, (u, α, v) ∈ i corresponds to the rewriting rule uv → uαv. we denote by =⇒ the relation defined by an insertion rule (formally, x =⇒ y iff x = x1uvx2, y = x1uαvx2, for some (u, α, v) ∈ i and x1, x2 ∈ v ∗). we denote by =⇒∗ the reflexive and transitive closure of =⇒ (as usual, =⇒+ is its transitive closure). the language generated by γ is defined by l(γ) = {w ∈ v ∗ | x =⇒∗ w, x ∈ a}. we say that an insertion system (v, a, i) has weight (n, m, m′) if m = max{|u| | (u, α, v) ∈ i}, m′ = max{|v| | (u, α, v) ∈ i}, n = max{|α| | (u, α, v) ∈ i}. we denote by insm,m ′ n the corresponding families of languages generated by insertion systems. an insertion p system is a construct: π = (v, µ, m1, . . . , mk, r1, . . . , rk), where • v is a finite alphabet, • µ is a (cell-like, i.e., hierarchical) membrane structure with k membranes. this structure will be represented by a word containing correctly nested marked parentheses, i.e., by a word from the dyck language. the skin membrane is labeled with “1". • mi, for each 1 ≤ i ≤ k is a finite language associated to the membrane i. • ri, for each 1 ≤ i ≤ k is a set of insertion rules with target indicators associated to membrane i and which have the following form: (u, x, v; tar), where (u, x, v) is an insertion rule, and tar, called the target indicator, is from the set {here, inj, out}, 1 ≤ j ≤ k. on the power of small size insertion p systems 269 a configuration of π is a k-tuple (n1, . . . , nk) of finite languages over v . for two configurations (n1, . . . , nk) and (n ′1, . . . , n ′ k) of π we write (n1, . . . , nk) =⇒ (n ′ 1, . . . , n ′ k) if we can pass from (n1, . . . , nk) to (n ′1, . . . , n ′ k) by applying nondeterministically the insertion rules, to all strings which can be rewritten from the corresponding regions, and following the target indications associated with the rules. more specifically, w′ ∈ n ′j if either w ′ ∈ nj or there is a word w ∈ ni and w =⇒r w′, where r = (u, x, v; tar) ∈ ri. moreover, the membrane labeled by j is immediately outside the membrane labeled by i if tar = out; the membrane labeled by j is immediately below the membrane labeled by i if tar = inj; and i = j if tar = here. no other words are present in n ′j, 1 ≤ j ≤ k. we say that a word w ′ is sent out of the system if there is a configuration (n1, . . . , nk), a word w ∈ n1, and w =⇒r w′ where r = (u, x, v; out) ∈ r1. we use the definition of the language generated by π according to [17]. this language is denoted by l(π) and it is defined as follows: we start from the initial configuration (m1, . . . , mk) of the system and proceed iteratively, by transition steps performed by applying the rules in parallel, to all strings which can be rewritten. all strings over the alphabet v sent out of the system (i.e. sent from the skin membrane) during any step of any computation form the language l(π). insertion tissue p systems are defined in an analogous manner. as the tissue p systems use arbitrary graph structures we write the target indicator in the form tar = goj, j = 1, . . . , k. we remark, that the result of a computation consists of all strings over v which are sent to one selected output cell. the weight of insertion rules (n, m, m′) and the membrane degree k describe the complexity of the system. we denote by lspk(ins m,m′ n )(see [17]) the family of languages l(π) generated by insertion p systems of degree at most k ≥ 1, having weight at most (n, m, m′). if some of the parameters n, m, m′, or k is not specified we write “ * ” instead. we say that a language l′ is from morinsm,m ′ n (from morlspk(ins m,m′ n ), respectively) if there exist a morphism h, a weak coding φ and l ∈ insm,m ′ n (l ∈ lspk(ins m,m′ n )) such that φ(h−1(l)) = l′. for every language l ∈ morlspk(ins m,m′ n ), l = φ(h −1(l(π)), we add h and φ to the system, and we write π in the form π = (v, µ, m1, . . . , mn, r1, . . . , rn, h, φ), hereafter h and φ are naturally extended to strings: h(a1a2 . . . at) = h(a1)h(a2) . . . h(at), and φ(a1a2 . . . at) = φ(a1)φ(a2) . . . φ(at), ai ∈ v. we insert “t′′ before p to denote classes of languages corresponding to the tissue cases (e.g., lstp ). we also write “[t]” (e.g., ls[t]p ) if we do not distinguish between tissue and tree classes. we say that a letter a is marked in a sentential form waw′′ if it is followed by #, i.e., |w′′| > 0, and # is the prefix of w′′. in the following proofs we use a marking technique introduced in [17]. the technique works as follows: in order to simulate a rewriting production a −→ b we add adjacently right from a the word #b specifying that letter a is already rewritten. as soon as the derivation of the simulated sentential form is completed, every nonterminal a is marked and the pair a# is subject to the inverse morphism. 3 main results let us consider insertion systems (without membranes) with one letter context rules, i.e., the family morins1,1∗ . applying the marking technique we get a characterization of context-free languages. theorem 3.1. morins1,1∗ = cf. 270 a. krassovitskiy proof: first we show that cf ⊆ morins1,13 . let g = (v, t, s, p) be a context-free grammar in chomsky normal form. consider the following insertion system π = (t ∪ v ∪ {#}, r, {s}, h, φ), where r = {(a, #γ, α) | α ∈ t ∪ v, a −→ γ ∈ p, γ ∈ (t ∪ v )∗, 1 ≤ |γ| ≤ 2}, the morphism h is defined as follows h(a) = a#, if a ∈ v, and h(a) = a, if a ∈ t, and the weak coding φ is defined as follows φ(a) −→ ε, if a ∈ v, φ(a) −→ a, if a ∈ t. it is clear that l(π) ∈ morins1,13 . we claim that l(π) = l(g). indeed, each rule (a, #γ, α) ∈ r can be applied in the sentential form wa alphaw′ if a is unmarked (not rewritten). hence, the production a −→ γ ∈ p can be simulated by the corresponding derivation of g. hence, by applying the counterpart rules we get equivalent derivations. at the end of the computation every nonterminal is marked, and no rules can be applied any more. (indeed, if the system produces a word having some unmarked nonterminal then h−1 is not defined on this word.) at this point h−1 removes all marks, and φ removes all nonterminal symbols. hence l(π) = l(g). we get cf ⊆ morins1,13 . the equivalence of these two classes follows from theorem 6.4 in [19] stating that ins1,1∗ ⊆ cf and the fact that the family of context-free languages is closed under inverse morphisms and weak codings. � now we consider insertion p systems with the left and right contexts of at most one letter. it is known from theorem 5.5.1 in [17] that lsp2(ins 1,1 2 ) contains non context-free languages. we prove that the more general family lsp∗(ins 1,1 ∗ ) is bounded by the class of languages generated context-free matrix grammars: lemma 3.2. lstp∗(ins 1,1 ∗ ) ⊂ mat λ. proof: the proof uses a similar technique as in [19], theorem 6.4 for context-free grammars. let π = (v, µ, m1, . . . , mn, r1, . . . , rn) be an insertion p system such that l(π) ∈ lstpn(ins 1,1 ∗ ) for some n ≥ 1. consider the matrix grammar g = (d ∪ q ∪ {s}, v, s, p), where q = {qi | i = 1, . . . , n}, d = {da,b | a, b ∈ v ∪ {ε}}, and p is constructed as follows: 1. for every rule (a, b1 . . . bk, c, goj) ∈ ri, a, c ∈ v ∪ {ε}, b1, . . . , bk ∈ v, k > 0 we add to p (qi −→ qj, da,c −→ da,b1db1,b2 . . . dbk−1,bkdbk,c), where a = { a, if a ∈ v, t, ∀t ∈ v ∪ {ε}, if a = ε c = { c, if c ∈ v, t, ∀t ∈ v ∪ {ε}, if c = ε. 2. for every rule (a, ε, c, goj) ∈ ri, a, c ∈ v ∪ {ε}, k > 0 we add to p (qi −→ qj, da,c −→ da,c), where a and c are defined as in the previous case. 3. next, for every w = b1 . . . bk ∈ mi, i = 1, . . . , n, k > 0 we add to p the matrix (s −→ qidε,b1db1,b2 . . . dbk−1,bkdbk,ε). 4. as a special case if ε ∈ mi we add (s −→ qidε,ε) to p. 5. also, for every da,b ∈ d, a, b ∈ v ∪ {ε} we add (da,b −→ a) to p. on the power of small size insertion p systems 271 6. finally, we add (q1 −→ ε) to p (we assume that the first cell is the output cell). the simulation of π by the matrix grammar is straightforward. we store the label of current cell by means of nonterminals from q. every nonterminal da,c ∈ d, a, c ∈ v ∪ {ε} represents a pair of adjacent letters, so we can use them as a context. a rule (a, b1 . . . bk, c, goj) ∈ ri, a, c ∈ v, b1 . . . bk ∈ v k can be simulated by the grammar iff the sentential form contains both qi and da,c. as a result, the label of the current cell is rewritten to qj and da,c is rewritten to the string da,b1db1,b2 . . . dbk−1,bkdbk,c. we note, that in order to simulate rules that have no context we introduce productions with every possible context symbols by writing a ∈ v ∪ {ε} and c ∈ v ∪ {ε}. clearly, since indexed symbols are duplicated for adjacent nonterminals, e.g., dbi−1,bidbi,bi+1 , every nonterminal thus preserves one symbol right and left contexts. the simulation of π by the grammar starts with a nondeterministic choice of the axiom from m1, . . . , mn. then, during the derivation any rule from r1, . . . , rn having the context (a, b) is applied in one to one correspondence with grammar productions having da,b in the left hand side. finally, the string over v is produced by the grammar iff q1 has been deleted from the simulated sentential form. the deletion of q1 specifies that π reached the output cell. so, we obtain l(π) = l(g). hence, lstp∗(ins 1,1 ∗ ) ⊆ mat λ. the strictness of the inclusion follows from the fact there are languages from mat λ which cannot be generated by any insertion p system from lstp∗(ins 1,1 n ), for any n ≥ 1. indeed, consider the context-free language la = {cakcakc | k ≥ 0}. since every context-free language is a matrix language we have la ∈ mat λ. on the other hand, la /∈ lstp∗(ins 1,1 n ), for any n ≥ 1. for the contrary, assume there is such a system π′. we note, that the system cannot delete or rewrite any letter, so every insertion is terminal. as the languages of axioms are finite we need an insertion rule of letter a. consider the final insertion step in a derivation which has at most one step and derives a word w = cakcakc, for some k ≥ n + 1 : w0 =⇒∗ w′ =⇒ w, where w0 is an axiom. since |w0|c ≤ 3, c may be inserted by the last insertion. assume, that |w′|c = 3. in the latter case, let ap be the inserted string, p ≤ n. because, we may insert ap in the distinct positions of w′ we get that either cak−pcak+pc ∈ l(π′) or cak+pcak−pc ∈ l(π′). this is a contradiction. now assume that c is inserted by the last insertion. we note that the insertion of two c is not possible, since k ≥ n + 1. consider three cases: (1) the last applied rule inserts c in the middle, (2) at the end, or (3) at the beginning of w′. (1) let wc = ap ′ cap ′′ be the inserted string, where p′ + p′′ ≤ n − 1. hence, w′ = cak ′+k′′c, where k′ + p′ = k′′ + p′′ = k, and k′ + k′′ = 2k − p′ − p′′ ≥ 2n + 2 − n + 1 ≥ 4. obviously, regardless of the contexts of the last insertion rule there are at least two positions at which wc can be inserted. so, we get a contradiction because either cak ′+p′+1cak ′′+p′′−1c ∈ l(π′), or cak ′+p′−1cak ′′+p′′+1c ∈ l(π′). (2) let aqc be the inserted string, where q ≤ n − 1. the corresponding insertion rule has one of the following forms: (ε, aqc, ε, goj) or (a, aqc, ε, goj), where j is an index of the final membrane. in ether case, aqc may be inserted in w′ before the last letter a. this is a contradiction. the case (3) is a mirror to the case (2) and can be treated similarly. so we proved la /∈ lstp∗(ins 1,1 n ), for any n ≥ 1 and, hence, lstp∗(ins 1,1 ∗ ) ⊂ mat λ. � since trees are special cases of graphs we get the following result corollary 3.3. lsp∗(ins 1,1 ∗ ) ⊂ mat λ. lemma 3.4. mat λ ⊆ morlsp∗(ins 1,1 2 ). 272 a. krassovitskiy proof: we prove the lemma by direct simulation of a context-free matrix grammar g = (n, t, s, p). we assume that g is in the modified binary normal form, i.e., every matrix has the form i : (a −→ bc, a′ −→ b′c′) ∈ p , where a, a′ ∈ n; b, b′, c, c′ ∈ n ∪ t ∪ {ε}, and i = 1, . . . , n. consider a p insertion system π defined as follows: π = (v, [1 [2 [3 [4 ]4 . . . [n+3 ]n+3 ]3 ]2 ]1, {s$}, ∅, . . . , ∅, r1, . . . , rn+3, h, φ), where v = n ∪ t ∪ {ci, c′i | i = 1, . . . , n} ∪ {#, $}. for every matrix i : (a −→ bc, a′ −→ b′c′) we add r.1.1 : (a, #ci, α, in2), to r1; r.2.1 : (ci, bc, α, in3), r.2.2 : (c ′ i, #, α, out) to r2; r.3.1 : (ci, #, α, ini+3), r.3.2 : (c ′ i, b ′c′, α, out) to r3; r.i + 3.1 : (a′, #c′i, α, out), to ri+3 for every α ∈ v \{#}. in addition we add (ε, $, ε, out) to r1. we define the morphism h and the weak coding φ by: h(a) = { a, if a ∈ t, a# if a ∈ v \(t ∪ {#}) φ(a) = { a, if a ∈ t, ε if a ∈ v \t. clearly, l(π) ∈ morlspn+3(ins 1,1 2 ). we claim that l(π) = l(g). to prove this it is enough to prove that w ∈ l(g) iff w′ ∈ l(π) and w′ = φ(h−1(w)). first we show that for every w ∈ l(g) there exists w′ ∈ l(π) and w′ = φ(h−1(w)). consider the simulation of the i-th matrix (a −→ bc, a′ −→ b′c′) ∈ p. it is controlled by letters ci and c′i. first, we insert #ci in the context of unmarked a and send the obtained string to the second membrane. then we use ci as a context to insert adjacently right the word bc. after that, we mark the control letter ci and send sentential form to the (i + 3)-rd membrane. here we choose nondeterministically one letter a′, mark it, write adjacently right the new control letter c′i, and, after that, send the obtained string to the third membrane. (we remark, the third membrane is immediately outside of the i + 3-rd membrane.) it is clear that it is not possible to apply the rule r.i + 3.1 : (a′, #c′i, α; out) in the (i + 3)-rd membrane and to reach the skin membrane if the sentential form does not contain unmarked a′. so, this branch of computation cannot influence the result and may be omitted in the consideration. next, in the third membrane, b′c′ is inserted in the context of unmarked c′i and the form is sent to the second membrane. finally, we mark c′i and send the resulting string back to the skin membrane. at the beginning of the simulation the sentential form in the skin membrane does not contain unmarked ci, c′i. hence, the insertions in the second and third membranes are deterministic. the derivation preserves this property, as after the sentential form is sent back to the skin membrane, introduced ci, and c′i are marked. at the end of the computation we send the resulting form out from the system by the rule (ε, $, ε, out). let w be a string in the skin region which contains some unmarked a and a′. if the letter a precedes a′ then we can write w = w1aα1w2a′α2w3. the simulation of the matrix is the following w1aα1w2a ′α2w3 r.1.1,r.2.1,r.3.1 =⇒ w1a#ci#bcα1w2a′α2w3 r.3+i.1,r.3.2,r.2.2 =⇒ w1a#ci#bcα1w2a ′#c′i#b ′c′α2w3, where w1, w2, w3 ∈ v ∗, α1, α2 ∈ v \{#}. we can write a similar derivation if a′ precedes a. on the power of small size insertion p systems 273 hence, as result of simulation of i-th matrix we get both a and a′ marked and bc, b′c′ inserted in the correct positions. the derivation in π may terminate by the rule (ε, $, ε, out) only in the first membrane. hence it guaranties that the simulation of each matrix has completed. according to the definition of morlsp the string w′ belongs to the language if w′ = φ(h−1(w)), where w is the generated string. we may consider only the final derivations of pi in which each nonterminal is marked. hence, we have l(g) ⊆ φ(h−1(l(π))). the inverse inclusion is obvious since every rule in π has its counterpart in g. moreover the case when the derivation in π is blocked corresponds to the case in which the simulation of a matrix cannot be completed. hence, we get mat λ ⊆ morlsp∗(ins 1,1 2 ). � remark 3.5. one can mention that a similar result can be obtained with a smaller number of membranes at the price of the maximal length of inserted words. precisely, for any context-free matrix grammar g′ there is a p insertion system π′ such that l(π′) ∈ morlspn+1(ins 1,1 3 ) and l(g′) = l(π′), where n is the number of matrices in g′. to prove this we can use the same argument as in the previous theorem and replace rules in r1, . . . , rn+3 by (a, #bc, α, ini+1) to r1, (a′, #b′c′, α, out) to ri+1, for every α ∈ v \{#}, i = 1, . . . , n. since trees are special cases of graphs we get the following result corollary 3.6. mat λ ⊆ morlstp∗(ins 1,1 2 ). taking into account lemma 3.4 and 3.2, and the fact that the class of languages generated by context-free matrix grammars is closed under inverse morphisms and weak codings we get a characterization of mat : theorem 3.7. morls[t]p∗(ins 1,1 ∗ ) = mat λ. now we increase the maximal size of the context of insertion rules to two letters. it is known from [19] that ins2,22 contains non-semilinear languages. by considering these systems with membrane regulation we obtain the following result theorem 3.8. morlsp3(ins 2,2 2 ) = re. proof: we prove the theorem by simulating a type 0 grammar in the modified penttonen normal form. let g = (n, t, s, p) be such a grammar. suppose that rules in p are ordered and n = card(p). now consider the following insertion p system, π = (v, [1 [2 [3 ]3 ]2 ]1, {s$}, ∅, ∅, r1, r2, r3, h, φ), where v = t ∪ n ∪ f ∪ f ∪ {#, #, $}, f = {fa, | a ∈ n}, f = {f a, | a ∈ n}. we include into r1 the following rules: (ab, #c, α, here), if ab −→ ac ∈ p ; (a, #c, bα, here), if ab −→ cb ∈ p ; (a, c, α, here), if a −→ ac ∈ p ; (ε, c, aα, here), if a −→ ca ∈ p ; (a, #δ, α, here), if a −→ δ ∈ p ; ($, ε, ε, out), 274 a. krassovitskiy where α ∈ v \{#}, a, b, c ∈ n, and δ ∈ t ∪ {ε}. it may happen that the pair of letters ab subjected to be rewritten by a production ab −→ ac or ab −→ cb ∈ p is separated by letters that have been marked. we use two additional membranes to transfer a letter over marked ones. in order to transfer a ∈ n we add for each α ∈ v \{#} r.1.1 : (a, #fa, α, in2), to the skin membrane. then we add to the second membrane r.2.1 : (fa, #a, α ′, out), r.2.2 : (fa, #a, α ′, out), r.2.3 : (fax, #fa, #, in3), r.2.4 : (fafb, #fa, #, in3), r.2.5 : (fa#, fa, α, in3), r.2.6 : (fa#, fa, α, in3), r.2.7 : (fa#, fa, #, in3), r.2.8 : (fa#, fa, #, in3), r.2.9 : (fa#, fa, #, in3), r.2.10 : (fa#, fa, #, in3), for every x ∈ f ∪ n, fb ∈ f, α ∈ v \{#, #}, α′ ∈ {ab | a ∈ n ∪ t, b ∈ n ∪ t ∪ {$}} ∪ {$}. finally, we add to the third membrane the rules r.3.1 : (fa, #, α, out), r.3.2 : (fa, #, α ′, out), for every α ∈ v \{#}, α′ ∈ v \{#}. the morphism h and the weak coding φ are defined as h(a) = { a, if a ∈ v \n, a# if a ∈ n. φ(a) = { a, if a ∈ t, ε if a ∈ v \t. we simulate productions in p by marking nonterminals from n and inserting corresponding right hand sides of the productions. this can be done with insertions in the skin membrane by rules of weight (2, 2, 2) since the grammar has such a form that production rewrites/adds at most one letter. the simulation of the transfer is done in the second and third membranes. the idea of the simulation is (1) to mark the nonterminal we want to transfer, (2) jump over the marked letters with the help of one special letter, at the end (3) mark the special letter and insert the original nonterminal. since we use two letter contexts, in one step we can jump only over a single letter. we also need to jump over the marking letter # as well as over the marked nonterminals, and the letters inserted previously. in order to jump over # we introduce one additional marking symbol #. we mark letters from f by #, and all the other letters in v \{#, #} by #, e.g., in a word fa#, letter fa is unmarked. (1) the rule r.1.1 : (a, #fa, α, in2), specifies that every unmarked letter from n may be subjected to the transfer. (2) the rules r.2.3 − r.2.10 in the second membrane specify that fa or fa is copied to the right in such a way that inserted letters would not be marked. in order to do so, the appropriate rule chooses to insert either the overlined copy fa or the simple copy fa. the rules r.2.3, r.2.4 describe jumps over one letter not in {#, #}, and r.2.5−r.2.10 describe jumps over #, #. every rule r.2.3 − r.2.10 sends the sentential form to the third membrane, and the rules r.3.1, r.3.2 in the third membrane send the sentential form back to the second membrane after marking one symbol fa ∈ f or fa ∈ f. on the power of small size insertion p systems 275 (3) the rules r.2.1 and r.2.2 may terminate the transferring procedure and send the sentential form to the first membrane if letter $ or two letters from {ab | a ∈ n ∪ t, b ∈ n ∪ t ∪ {$}} appear in the right context. for example, consider the transfer of a in the string ax#c$ (here, we underline inserting substrings) ax#c$ r.1.1 =⇒ a#fax#c$ r.2.3 =⇒a#fax#fa#c$ r.3.1 =⇒ a#fa#x#fa#c$ r.2.6 =⇒a#fa#x#fa#fac$ r.3.2 =⇒ a#fa#x#fa ##fac$ r.2.1 =⇒a#fa#x#fa ##fa#ac$. the sentential form preserves the following property: (i) the first membrane does not contain unmarked letters from f ∪ f; there is exactly one unmarked letter from f ∪ f in the second membrane; and there are always two unmarked letters from f ∪ f in the third membrane. we mention that property (i) is preserved by every derivation. indeed, we start derivation from the axiom s$ that satisfies the property, then one unmarked symbol is inserted by r.1.1. rules r.2.3 − r.2.12 always add one more unmarked letter, whereas rules r.2.1, r.2.2, r.3.1, r.3.2 always mark one letter from f ∪ f. in order to verify that π generates the same language as g we note that every reachable sentential form in g will be reachable also in π by simulating the same production. we also note that the derivation in π may terminate by the rule ($, ε, ε, out) only in the first membrane. hence, every transfer will be completed. it follows from property (i) that the simulation of the transfer is deterministic in the second membrane. also note, that there is a nondeterministic choice in the third membrane, where the rules r.3.1, r.3.2 may mark one of the two unmarked letters. in the case the rule marks the rightmost letter, the derivation has to “jump" again over the inserted letter. in a special case if r.1.1 starts the transfer of a letter adjacently left from an unmarked one then the rules r.1.1, r.2.1 produce two marked symbols which do not affect the result of the simulation. the output string w is in the language, iff w′ = φ(h−1(w)) is defined. hence, the resulting output of π does not contain unmarked nonterminals. on the other hand every final derivation in π has its counterpart in g. by applying the inverse morphism h−1 we filter out every sentential form with unmarked nonterminals from n. hence, the corresponding derivation in g is completed. finally, the weak coding φ filter away all supplementary letters. hence, we have l(g) = l(π). � 4 conclusions this article investigates the generative power of insertion p systems with encodings. the length of insertion rules and number of membranes are used as parameters of the descriptional complexity of the system. in the article we exploit the fact that a morphism and a weak coding are incorporated into insertion p systems. the obtained family morls[t]p∗(ins 1,1 ∗ ) characterizes the matrix languages. when no membranes are used, the class morins1,1∗ is equal to the family of context-free languages. we proved also the universality result regarding the family morlsp∗(ins 2,2 ∗ ). also, for the family lsp∗(ins 2,2 ∗ ) one can get an analogous computational completeness result by applying right(left) quotient with respect to a regular language. we recall the open problem posed in [4], namely, whether morins2,2∗ is computationally complete. our work gives a partial solution of the problem by using the membrane computing framework. one may see that in order to solve the problem completely (by the technique 276 a. krassovitskiy promoted in the article), it is enough to find a concise way to transfer a letter over a marked context. in our case this can be reduced to the question whether it is possible to compute the membrane regulation in the skin membrane. one may mention that there is a trade-off between the number of membranes and the maximal length of productions. by introducing additional nonterminals and fitting the grammars into the normal forms we decrease the amount of membranes used. on the other hand by raising the number of membranes we can simulate larger production rules. moreover, the descriptional complexity used in the paper may be extended by such internal system parameters as, e.g., the size of the alphabet, the number of rules per membrane, etc. it may be promising to continue the research of the minimal systems regarding these parameters. we are also interested in the computational power of the insertion p systems having only one sided (right or left) contexts. acknowledgments the author acknowledges the support pif program from university rovira i virgili, and project no. mtm2007-63422 from the ministry of science and education of spain. the author sincerely thanks the help of yurii rogozhin and seghey verlan without whose supervision the work would not been done. also the author warmly thanks gheorghe păun and erzsèbet csuhajvarjú who draw attention, during the meeting bwmc-09, to the point of using different normal forms of grammars to simulate p insertion-deletion systems. bibliography [1] m. daley, l. kari, g. gloor, r. siromoney, circular contextual insertions/deletions with applications to biomolecular computation, in: proc. spire’99 , pp. 47-54, 1999. [2] l. kari, from micro-soft to bio-soft: computing with dna, in:proc. of biocomputing and emergent computations, 146-164, 1997. [3] l. kari, gh. păun, g. thierrin, s. yu, at the crossroads of dna computing and formal languages: characterizing re using insertion-deletion systems. in: proc. of 3rd dimacs , pp. 318-333, 1997. [4] l. kari, p. sosík, on the weight of universal insertion grammars, tcs , 396(1-3), pp. 264-270, 2008. [5] l. kari, g. thierrin, contextual insertion/deletion and computability, information and computation, 131, 1, pp. 47-61, 1996. [6] a. krassovitskiy, yu. rogozhin, s. verlan, further results on insertion-deletion systems with one-sided contexts, in proc. of 2nd lata08, lncs , 5196, pp. 333-344, 2008. [7] a. krassovitskiy, yu. rogozhin, s. verlan, one-sided insertion and deletion: traditional and p systems case, cbm08 , pp. 53-64, 2008. [8] a. krassovitskiy, yu. rogozhin, s. verlan, computational power of p systems with small size insertion and deletion rules, in:proc of csp08 , pp. 137-148, 2008. [9] s. marcus, contextual grammars, rev. roum. math. pures appl., 14, pp 1525-1534, 1969. [10] m. margenstern, gh. păun, yu. rogozhin, s. verlan, context-free insertion-deletion systems, tcs, 330, pp. 339-348, 2005. on the power of small size insertion p systems 277 [11] c. martin-vide, gh. păun, a. salomaa, characterizations of recursively enumerable languages by means of insertion grammars, tcs , 205, 1–2, pp. 195-205, 1998. [12] a. matveevici, yu. rogozhin, s. verlan, insertion-deletion systems with one-sided contexts, lncs , 4664, pp. 205-217, 2007. [13] m. mutyam, k. krithivasan, a. s. reddy, on characterizing recursively enumerable languages by insertion grammars, fundamenta informaticae, 64(1-4), pp. 317-324, 2005. [14] k. onodera, new morphic characterization of languages in chomsky hierarchy using insertion and locality, in proc. of 3rd lata09, lncs, 5457, pp. 648–659, 2009. [15] gh. păun, computing with membranes, journal of computer and system sciences, vol. 61, 1, pp. 108-143, 2000. [16] gh. păun, marcus contextual grammars, kluwer, dordrecht, 1997. [17] gh. păun, membrane computing. an introduction, springer–verlag, berlin, 163, pp. 226230, 2002. [18] gh. păun, m. j. pérez-jiménez, t. yokomori, representations and characterizations of languages in chomsky hierarchy by means of insertion-deletion systems, international journal of foundations of computer science, 19, 4, pp. 859-871, 2008. [19] gh. păun, g. rozenberg, a. salomaa, dna computing. new computing paradigms, springer–verlag, berlin, 1998. [20] g. rozenberg, a. salomaa, eds., handbook of formal languages, springer–verlag, berlin, 1997. [21] a. takahara, t. yokomori, on the computational power of insertion-deletion systems, dna 2002, sapporo, lncs, 2568, pp. 269–280, 2003. [22] s. verlan, on minimal context-free insertion-deletion systems, journal of automata, languages and combinatorics, 12, 1/2, pp. 317–328, 2007. int j comput commun, issn 1841-9836 8(3):460-468, june, 2013. a tight coupling cooperation scheme in wifi/wimax heterogeneous mesh networks w. sun, p. zhang, y. chen, z. qin, d. teng weifeng sun, peng zhang, zhenquan qin school of software dalian university of technology, dalian 116620, china wfsun@dlut.edu.cn, terence_zhang@126.com, qzq@dlut.edu.cn yuanfang chen institut mines-telecom pierre-and-marie-curie university (upmc, paris vi), france yuanfang_chen@ieee.org da teng tencent technology (beijing) co.ltd beijing 100080, china tengda.ustc@gmail.com abstract: a tight coupling cooperation scheme for wifi/wimax networks with qos provisioning has been proposed in this paper. a new wfw (wimax for wifi) module which enables wimax fulfill the procedure of bandwidth request-confirmgrant for wifi was described with a modified mac layer of which msh-dsch (mesh distributed control message) renewed, and thus overhead of interacting of wifi control message was eliminated. the schedule is then evaluated through simulations in two typical transmission scenarios. numerical results show that more effective wifi/wimax heterogonous networks which offer qos guarantees are obtained with the utilization of the scheme. the efficiency of wifi mesh networks increase sharply without obvious decrease of wimax performance. keywords: tight cooperation, wimax for wifi, heterogeneous mesh networks. 1 introduction broadband wireless communication has been a promising area compares to the conventional wireless networks over the last decade [1]. also the technique has diversified different radio access technologies (rats) such as wimax, wifi etc. [2]. heterogeneous networks make an expecting tendency that nodes within which could be embedded with multiple rats. the nodes mentioned above could also work simultaneously in different networks so as to acquire a better qos. in [3], the author explores the resource allocation of nodes in heterogeneous network which could solve some traditional wireless networks’resource allocation problem. author in [4] introduces a call-level quality of service vertical handoff algorithm which could be applied to heterogeneous wireless networks. a further progress could achieved by the use of different rats in a single heterogeneous wireless network. generally speaking, when concerning rat cooperation, two possible classifications are available. a loosely cooperation is often designed by a coordinate mac layer which could effectively allow fast switching between different rats which is transparent to the upper layer [4]. for example, in [6], the author proposes a loose cooperation scheme between wimax and wifi in the airtime-based module which can make the cooperation strategy and control the rats through the collection of each rat’s information. and in [5] a simple integration of wifi/wimax network model was studied, the performance of the heterogeneous network was improved to some degree. tight cooperation, however, could relate rats and mac straightly, so as to make copyright c⃝ 2006-2013 by ccc publications a tight coupling cooperation scheme in wifi/wimax heterogeneous mesh networks 461 more effective ways to take the proper strategies without decreasing the networks performance obviously. in [8], the author introduces a cooperation scheme between wifi and wimax. the scheme could make wifi offload some wimax traffic which is limited and intelligible. further solutions should be given in order to get a better use of the tight cooperation schemes advantages. specifically, a new module and more particularly schemes could be given in order to improve the performance of the networks ulterior. in this paper, we propose a tight coupling cooperation scheme for the wifi/wimax heterogeneous networks. a new wimax for wifi (wfw) module was developed to share the wimax time slot for wifi. in addition, a modified msh-dsch (m-dsch) was designed to consult for both wifi and wimax about bandwidth request information. by making use of different rats, wimax rats could take up the request-confirm-grant procedure for the wifi rats; wifi control message was eliminated. the qos and higher throughput for the whole heterogeneous network could also be guaranteed because of the tightly coupled cooperation. 2 m-dsch message scheduling 2.1 structure of m-dsch the tight coupling cooperation scheme could be fulfilled by the wimax’s replacing for wifi in the bandwidth request-confirm-grant procedure. table 1 illustrates the m-dsch message structure to realize the tightly coupled cooperation scheme. the item marked by a “*” was the original item which also included in the original mshdsch. when we use distributed coordination function (dcf) in the traditional wimax networks. grant/request flag was permanently fixed by a zero. signifying signal contains mshdsch_request_ie(), msh-dsch_grant_ie() simultaneously. the parameters we need to know are the no.request, no.availabilities, and no.grants represents the numbers of the request_ie, avaliable ie, and grant ie respectively. the data structure and the other parameters’ connotation could refer to the ieee std 802.16-2004 [3]. additional items in the m-dsch message are wifi grant/request flag, no.wifi request, no.wifi availabilities, and no.wifi grants. similarly when we use dcf, the wifi grant/request flag was fixed a zero permanently, represents that the signifying signal could also request, confirm and grant for the wifi rats, wifi grant/request flag, no.wifi request, no.wifi availabilities, and no.wifi grants represents the numbers of the request_ie, avaliable_ie, and grant_ie which has the identical data structure of the wimax rats. 2.2 performance analysis of m-dsch analysis of signaling size m-dsch control signal message should transmit trough 7 ofdm symbols without split into segments. the size of the no.grants was 6bits in the original dsch, while it is changed into to 5 bits in the m-dsch. the size of no.wifi grants is fixed in a 5 bits size. in order to meet the demands with the requirement of the signaling transmission delay, the maximum figure of msh-dsch_grant_ie() was decrease from the original 63 to 31. analysis of transmission delay suppose the maximum length of the m-dsch is s_max. when compute with the existing parameters, the result of smax is 1263 bytes. suppose the original dsch message length is 462 w. sun, p. zhang, y. chen, z. qin, d. teng table 1: structure of wimax mesh mdsch message syntax size syntax size msh-dsch_message format() { * if(coordinationflag == 0) * management message type = 41 8 bits * msh-dsch_scheduling_ie() varialbe * coordination flag 1 bit * for(i=0;i < norequests;++i) * grant/request flag 1 bit * msh-dsch_request_ie() 16 bits * sequence counter 6 bits * for(i=0;i < noavailabilities;++i) * no. requests 4 bits * msh-dsch_availability_ie() 32 bits * no. availabilities 4 bits * for(i=0;i < nogrants;++i) * no. grants 5 bits * msh-dsch_grant_ie() 40 bits wifi grant/request flag 1 bit * for(i=0;i < nowifirequests;++i) * reserved 2 bits * msh-dsch_request_ie() 16 bits no. wifi requests 4 bits * for(i=0;i < nowifiavailabilitiess;++i) no. wifi availabilities 4 bits * msh-dsch_availability_ie() 32 bits no. wifi grants 5 bits * for(i=0;i < nowifigrants;++i) reserved 3 bits * msh-dsch_grant_ie() 40 bits } ′max. compute with existing parameters, the size of s′max is 1176 bytes. so far a conclusion could be inferred that m-dsch was 87 bytes more than the dsch. through the equation, the requirement that mdsch should transmit trough 7 ofdm symbols without split into segments could be assured. m-dsch frame design is reasonable. analysis of the wifi and wimax mesh network performance using m-dsch message time slots are divided into frames in the wifi mac layer. in our scheme, the nth frame of wifi starts from the nth data sub-frame of wimax, and finishes at the end of (n + 1)th control sub-frame. suppose the length of the wimax frame is l(s), the percentage of the control sub-frame is r(0 < r < 1), the length of the wifi frame is the same l as wimax. when r = r1(0 < r1 < 1), the length that could be scheduled in each wifi frame ls is described in equation 1: ls = r1l + (1 − r1)l = l (1) then the performance efficiency of the wifi mesh is δ theoretically, then δ = ls/l = 100%. namely wifi mesh network could reach the maximum throughput in theory. the increasing number of the signaling message of m-dsch could affect the performance of wimax mesh network, here is the analysis of the effect. imagine a network with k nodes, ni represents the node i(i ∈ {1, .., k}). m-dsch runs only for wimax. in a sequential ξ scheduler control frames, node ni occupies ni control frames, which means a total ni transmits chances, here comes the equation 2. φ(rl) is transmits chances of control frames which is rl in length. equation 2 based on an assumption that the control frames was saturated, the transmit chances could be utilized completely. k∑ t=1 ni = ξφ(rl) (2) a tight coupling cooperation scheme in wifi/wimax heterogeneous mesh networks 463 suppose ξ series scheduler control frames make the bandwidth request simultaneously, and then the m-dsch messages transmit through node ni could be divided into α m-dsch messages which merely contain the three-way handshake procedure information for wimax, β m-dsch messages which contains the grant, confirm messages for wifi and wimax simultaneously, γ m-dsch messages which merely contain the three-way handshakes procedure information for wifi. because the wimax and the wifi bandwidth arrangement are off interference, then α + β = ni was inferred. if p is the ratio that m-dsch messages which merely contain the three-way handshake procedure information for wifi of the total m-dsch messages in the same nodes, then equation 3 is conducted. p = γ/(α + β + γ) = γ/(ni + γ) (3) when the scheme uses for wimax only, node ni has ni transmit chances. when using for both wimax and wifi in order to fulfill the same requirement, suppose the node ni has n′i transmit chances, n′i could be conducted as equation 4. n′i = ni/(1 − p) (4) then the total transmission chances of all nodes in the network environment are as the following equation 5: k∑ i=1 n′i = k∑ i=1 ni/(i − p) = 1/(1 − p) k∑ i=1 ni = 1/(1 − p)ξφ(rl) = ξφ(rl/(1 − p)) (5) in order to meet the total requirement of all the nodes in our scheme, the percentage of the control sub-frame should increase from r to 1/(1 − p). compared with the total wimax environment, the total decreasing throughput is τ: τ = ((1 − r) − (1 − r/(1 − p)))/(1 − r) = r/(1 − r)(1/(1 − p) − 1) (6) since r is a constant, p and τ are of the positive pertinence, a conclusion could be concluded that when p becomes smaller, the loss of the wimax performance becomes fewer. when p was 0, then r is zero, and wimax performance is lossless. hence, reducing the percentage of the m-dsch messages which service for wifi only was the key point designing the scheme. 3 designing of the tight coulping cooperation system 3.1 dsch handshaking procedure in wimax wimax mac layer is composed by the core disposal wimax mac module (wmm), bandwidth request queue (rq), the availability queue, and the grant/confirm queue. wimax mac layer use a feigned random algorithm to provide the transmission chance for the dsch message, the detail of the algorithm could refer to [3]. the dcf process as the following description: • when receives a bandwidth request from the other nodes, wmn compute the bandwidth grant information (grant) bases on the available information in aq, then insert grant into gcq (grant confirm queue). • when receives a grant in which destination address is the same as the node itself, update aq (availability queue) and generate the confirm information and insert into the gcq. simultaneously inform the node combined with grant information to transmit data in the allocated minislot. if the received destination address if different from the nodes itself, update the aq only. 464 w. sun, p. zhang, y. chen, z. qin, d. teng • when a node receives confirm information, update the aq. • when the nodes get the transmission chances of the dsch, generate the dsch information firstly, wmn fills the parameters’s domain and msh-dsch_schedulin_ie(), rq, aq, gcq fills the msh-dsch_request_ie(), msh-dsch_availability_ie(), mshdsch_grant_ie() respectively. dsch will transmit during the beginning of the transmission chance. the available time slots and the time slot duration information is recorded in the data structure of aq, but the specific design is not given in [3]. in the supposed system, linked list is chosen to achieve the storage function. using the procedures above, wimax mac could fulfill the 3-shook hands procedure of dsch and assure the data sub-frame transmits successfully without any collision. 3.2 implementation of wifi and wimax tight coupling cooperation system design of the system a wfw module is set up between the mac layer of wifi and wimax which is utilized to fulfill the requirement of the system. the module was composed as follows. • information sharing module: wifi and wimax synchronized through the acquired timing information. wifi shares the links id, neighbor mac address, minislot number and other related information with wimax. and wimax shares current frame number, fram start time, fram duration, data subframe start time and minislot length. hence, wimax could take the wifi request-grant-confirm procedure. the data sub-frame of wimax is divided into 256 minislots. supposing each wifi frame has minislots, σ could be computed as the equation 7. σ = floor(l/((1 − r)l/256)) = floor(256/(1 − r)) (7) • wfw processing module: this module is used to receive the mutual require-grant-confirm information between the wimax and wifi mac layer, and then delivers the related information to the other side. when generating the m-dsch, the module is used to fill the related data structure. • wifi request queue (wrq), wifi available queue (waq), and wifi grant/confirm queue (wgcq): wrq is used to store query information generated by wifi mac layer; waq is used to record the location information of the available time slots; wgcq is used to store the grant and confirm information in responding to the neighbors. working procedure of the system the proposed system works as the following procedures: • when a bandwidth request is generated from the wifi mac layer, the message is submitted to the wfw processing module. the wfw module then inserts it into the wrq. • when the wimax mac layer receives wifi grant information, it generates bandwidth grant information named grant based upon the information sharing module and then inserts it into the wgcq. a tight coupling cooperation scheme in wifi/wimax heterogeneous mesh networks 465 • when wimax mac layer receives the wifi grant information, the layer will submit it to the wfw module. if the information’s destination address is exactly the address of the wifi mac layer. then submits the grant to the wifi mac, wifi mac layer will inform the related connection to transmit data in the stipulated minislots. wfw updates the waq and generates the confirm information which is inserted into the wgcq. if the destination address is not the address of the wifi mac layer, update the waq. • when wimax receives the wifi confirm information, it will submit the information to the wfw processing module, then updating the waq through the confirm information. • when wimax gets the transmission change of the m-dsch and generates the m-dsch messages, wimax mac will fill the related domains of the wimax. wfw update the domains of wifi, the msh-dsch_request_ie(), msh-dsch_availability_ie(), mshdsch_grant_ie() was filled by the wrq, waq, wgcq respectively. m-dsch starts transmission from the beginning of the transmission chances. performance analysis of the system the performance analysis of the system is as the theorem 1. theorem 1. the tightly coupling cooperation scheme of the wimax and wifi draws lossless impacts to the original wimax scheme. proof: when wifi mac layer generates the bandwidth request, wfw module will put it into the wrq. imagine the wrq’s length is infinity, then wimax will obtain the m-dsch transmission chances using the feigned random algorithm based on the request, grant, confirm requirements. when generating m-dsch messages, msh-dsch_request_ie() and mshdsch_grant_ie() in the m-dsch contains one or more effective information thus α+β +γ = α = β, then γ = 0 and p = 0. theorem 1 is proved. 2 4 performance evaluation through numerical simulations in this section, we evaluated the performance of the proposed scheme through simulations over ns-2. based on the platform, two typical scenarios are carried out. 4.1 network topology and parameter settings network topology in order to analysis the performance of the proposed scheme, two scenarios are introduced in this paper: • fixed topology of double data flows in a single hop. the network contains 2 fixed nodes; each node is embedded with wifi rats and wimax rats. when we take the noncooperation mode: wifi and wimax works independently, wifi occupancy rate of the channel is larger, the throughput and the network efficiency work nearly the maximum value in theory. while the wimax performance will suffer a sharply decrease compared to the network using the cooperation mode. • random topology of multi-hop and multi flows. several nodes are randomly distributed in the scenario; each node is embedded with wifi rats and wimax rats. in this scenario, the average throughput and the efficiency comparisons between wifi and wimax will help in analyzing the performance gain in our proposed system. 466 w. sun, p. zhang, y. chen, z. qin, d. teng parameter settings in order to measure the performance of wifi and wimax respectively, each node is configured with two constant cbr flows of witch the packet length is 1000 bytes and packet generation interval is 0.005s, each rat serves one cbr flow. the parameters of wifi are set as follows: the signal range of rts/cts and data is 550m, 250m respectively. a 50 length of queue is used with physical layer, of which the highest transmit rate is 1mbps. modified wifi layer does not contain the rts/cts and ack procedure, the other parameters is set upon the [3]. the parameters of wimax are set as follows: the queue length of wimax mac layer is set with a 50, the percentage of the control sub-layer is 30%, the scheduling frames is with a 1 set which means 4 scheduling frames emerges during every two control frames. the protocol of the physical layer is using the ofdm and with a 5mhz bandwidth. the band is set to 3.5ghz; modulation mode of the control sub-frame is set with the ofmd_qpsk_1_2 according to [3], data transmission modulation mode is set with the ofdm_16qam_1_2. 4.2 simulation results performance evaluation in single hop scenario the scenario contains two nodes within each other’s signal coverage. figure 1(a)(b) depicts the average throughput in the single hop scenario, of which the data flow starts at 30s and end at 130s. solid points in the figure 1(a)(b) represent the average throughput in the following 10s from time point it correlates. fig 1(a) shows the wifi performance comparison between cooperation and noncooperation scheme. when using the dcf mode, the average throughput is 0.77mbps. the efficient of the scenario is 0.77mbps/1mbps*100%=77%, which is conform to the analysis in the preceding section. when using the cooperation scheme, the average throughput is 0.96mbps, the network efficient is 0.96mbps/1mbps*100%=96%, the network efficient is approximate to 100%. the 4% loss is due to the inefficient use of the mini-slot during the simulation which could be solved by constraining the length of the data flow. figure 1(b) shows the wimax performance comparison between cooperation and noncooperation scheme. when using the noncooperation scheme, the average throughput of the network is 1.5248mbps; while using the cooperation scheme, the average throughput of the network is 1.5244mbps with a deviation of 0.0026%. hence, the proposed scheme maximizes the throughput and the network efficiency of the wifi mesh, assured a nearly lossless wimax mesh network. performance evaluation in the random topology the scenario contains 10 nodes, distributed randomly in a 1000m*1000m area, with 4 cbr flows transmitting simultaneously. simulation results are showed in the figure 1(c)(d), the data flow similarly starts at 30s and end at 130s. the solid points in the figure are the same as the points defined in the preceding parts. figure 1(c) is the shows the comparison between cooperation and noncooperation scheme. when using the dcf mode the average throughput is 1.91mbps in the noncooperation mode and 2.82mbps in the cooperation mode. the performance of the network increased by (2.82mbps-1.91mbps)/1.91mbs*100%=47.6%. figure 5(d) is the wimax performance comparison between cooperation and noncooperation scheme. when using the noncooperation scheme, the average throughput is 5.184mbps; while using the cooperation scheme, the average throughput is 5.181mbps. the performance of it decreases by a 0.057%. wimax suffer a subtle decrease nearly lossless as well. the result of the simulations indicates that the proposed tight coupling cooperation scheme works in accordance with our anticipation that the scheme could guarantee the wimax rats a tight coupling cooperation scheme in wifi/wimax heterogeneous mesh networks 467 (a) throughput of wifi mesh in single hop scenario (b) throughput of wimax mesh in single hop scenario (c) throughput of wifi mesh in random distributed scenario (d) throughput of wimax mesh in random distributed scenario figure 1: average throughput in random distributed scenario with a lossless performance and the wifi rats performance with an evident promotion. in our network environment, the throughput and network efficient could reach the max value in theory. 5 conclusion the paper is concerned with a tight coupling operation scheme which could support the effective transmission among the wimax rats and wifi rats. a wfw module is employed to do the bandwidth negotiation for wifi. a novel m-dsch was proposed to support the wimax to fulfill the bandwidth request-grant-confirm procedure. numerical results have confirmed that the proposed scheme improve the performance of wifi, and also draws subtle inference to the original wimax performance. increasing performance of the network assures the qos of the heterogeneous network. in the future, we intends to examine the performance of the hybrid network in which the users are equipped with single wifi rats or wimax rats or equipped with the two rats simultaneously. compatibility of the proposed scheme is another direction we will work on. acknowledgement this work is supported by natural science foundation of china under grant no. 61103233, 61202442, 61202443, the fundamental research funds for central universities (dut12jr08). 468 w. sun, p. zhang, y. chen, z. qin, d. teng bibliography [1] weifeng, s. et al. (2012); an optimal odam-based broadcast algorithm for vehicular adhoc networks, tiis, issn 2160-6455, 6(12): 3257-3274. [2] hanaoka, s.; yano, m.; hirata, t. (2008); testbed system of inter-radio system switching for cognitive radio, ieice transaction on communications, issn 1745-1345, 5(4): 432-446. [3] pignaton de freitas, e.; heimfarth, t.; allgayer, r.s.; wagner, f.r.; larsson, t.; pereira, c.e.; ferreira, a.m. (2010); coordinating aerial robots and unattended ground sensors for intelligent surveillance systems, international journal of computers communications & control, issn 1841-9836, 5(1): 52-70. [4] arun, e.; moni, r.s. (2012); optimization of vertical handoff decision algorithm for wireless networks, international journal of computers communications & control ,issn 18419836, 7(2): 218-230. [5] lior, o.; yigal, b.; itay, s. (2004); wi-fi (ieee 802.11) and bluetooth coexistence: issues and solutions, ieee pimrc, issn 0-7803-8523-3, 2: 847 852. [6] jong-ok, k.; shigeno, h.; yamaguchi, a.; obana, s. (2007); airtime-based link aggregation at the co-existence of wimax and wifi, ieee pimrc, issn 978-1-4244-1144-3, 1-5. [7] ghazisaidi, n.; kassaei, h.; bohlooli, s. (2009); integration of wifi and wimax-mesh networks, advances in mesh networks, second international conference on advances in mesh networks, issn 1978-0-7695-3667-5, 1-6. [8] hae jung, k.; chan jung, p.; ronny yongho, k. (2011); cooperative heterogeneous network interworking between wimax and wifi, communication in computer and information science, issn 1865-0929, 206(1): 62-69. [9] ieee std 802.16-2004. (2004); ieee standard for local and metropolitan area networks part 16: air interface for fixed broadband wireless access systems. the institude of electrical and electronic engineers, inc. kak_ijcccv11n5.pdf international journal of computers communications & control issn 1841-9836, 11(5):666-683, october 2016. degree of project utility and investment value assessments a. kaklauskas arturas kaklauskas vilnius gediminas technical university, vilnius, lithuania arturas.kaklauskas@vgtu.lt abstract: this article recommends a new invar method for a multiple criteria analysis (degree of project utility and investment value assessments along with recommendation provisions). its use can be for a sustainable building assessment. the invar method can additionally assist in determining the investment value of a project under deliberation and provide digital recommendations for improving projects. furthermore, the invar method can optimize the selected criterion seeking that the project under deliberation would be equally competitive in the market, as compared to the other projects under comparison. the invar method is additionally able to calculate the value that the project under deliberation should be for this project to become the best among those under deliberation. the case studies presented in this research are for demonstrating this developed method. keywords: copras, duma and invar methods, multiple criteria analysis, investment value, utility degree, recommendations. 1 introduction the increased awareness about building energy consumption and sustainability has resulted in the development of various means for predicting performance and rating sustainability. the building research establishment environmental assessment method (breeam) and leadership in energy and environmental design (leed) are the most commonly used performance rating systems [1]. according to lee [2], statistical analysis reveals a moderate degree of agreement amongst the five schemes (breeam, leed, casbee, beam plus and the chinese esgb) on weights and ranks of weights allocated to five key assessment aspects. ferreira [3] compare the criteria weighting process of four sustainable construction assessment tools (lidera, sb toolpt, code for sustainable homes and leed for homes 2012) and show that the four different weighting sets are robust and generally similar. a discussion on breeam and multiple criteria decision making follows as an example. the hierarchical structures of key criteria and features of breeam offices are by levels of issues, categories and criteria. the top level contains ten distinct issues (the maximum number of obtainable credits appears in parentheses): management (22), health & well-being (14), energy (30), transport (9), water (9), materials (12), waste (7), land use & ecology (12), pollution (13), innovation (10). the second level includes 69 categories and the third level – 114 criteria. expert opinion determines the total number of credits for each category [4]. the use of the breeam credits scoring system is for determining the overall assessment grade, which may be pass (≥ 30%), good (≥ 45%), very good (≥ 55%), excellent (≥ 70%) and outstanding (≥ 85%). no weightings are applied to credits awarded under different categories, as the number of obtainable credits assigned to each category already reflects the weight assigned to a category of assessment relative to other categories (as per [2]). for example, breeam (code for sustainable homes) divides into nine categories, which subdivide into 34 issues (criteria). the award for each issue according to its performance can be a maximum number of credits. then, for each category, the percentage of the total credits awarded for all its issues is determined. that percentage is copyright © 2006-2016 by ccc publications 668 a. kaklauskas multiplied by its weight [5,6]. in the end, the weighted values of all those nine categories are added up to obtain one of the six possible certification classes. thus there is maintenance of the weighting structure with natural adjustments to market needs [3]. multiple criteria decision making (mcdm) comprises a finite set of alternatives, which decision makers must select, evaluate or rank according to the weights of a finite set of criteria. the multiple criteria nature of the problem regarding energy performance assessment of buildings makes the mcdm method ideal for coping with the complexity of the problem [7]. berardi [8] emphasizes sustainability assessments in a built environment using multiple criteria rating systems. other scientists [9–15] have also done multiple criteria and multi-aspect analysis of green buildings. copras method [9,10] was found to be an effective method for the green buildings assessment. copras (complex proportional assessment method) method was developed by e. zavadskas and a. kaklauskas [16]. the copras method consists of five stages. later, this method has been supplemented with a new “method of defining the utility and market value of a property” (duma) developed by kaklauskas [14], see [17]. the degrees of utility of the property considered as well as the market value of a property being valuated is determined in seven duma method stages. the newly developed invar (degree of project utility and investment value assessments along with recommendations) method by kaklauskas integrates the philosophy of copras and duma methods and offers the new opportunities. these new opportunities are as follows: defining the investment value of a project; providing digital tips for improving projects; optimizing a selected criterion; calculating the value of the project, which would permit it to be best among others under deliberation. determining the priorities and utility degree of projects applying stages 1-5 of the invar method are identical to copras method. other invar method 6-11 stages are different from the copras and duma methods. according to the international valuation standards [18], investment value is the value of an asset to the owner or a prospective owner for individual investment or operational objectives. as stated in business dictionary, investment value reflects the value of an asset to its owner, depending on his or her expectations and requirements. schmidt [19] believes that investment value refers to the value to a specific investor, based on requirements of that investor, tax rate, and financing. the invar method for an analysis of sustainable buildings (see case studies) use the same initial data as the breeam method uses. the invar method was applied in research in various eu projects (intellities, idesedu, brita in pubs); the author took part in the research. the results of these projects were discussed in a number of publications by the author in conjunction with colleagues [20–25]. the structure of this paper is as follows: after this introduction, section 2 describes the invar method. section 3 follows with case studies. finally the discussion and conclusions appear in section 4. 2 invar method assessing utility degree and the value of a project under investigation along with the establishment of priorities for this project’s implementation is not especially difficulty. however, this first requires obtaining the numerical values and weights of criteria and applying multiple criteria decision making methods. the presentation of the analysis of projects under comparison is in the form of a grouped decision making matrix, where columns contain n alternative projects under consideration. meanwhile the rows represent all the pertinent quantitative and conceptual information (see table 1) [14]. degree of project utility and investment value assessments 669 table 1: grouped decision making matrix of the multiple criteria analysis of projects under comparison criteria describing the alternatives * projects under comparison w ei g h ts m ea su re m en t u n it s a1 a2 ... aj ... an x1 z1 q1 m1 x11 x12 ... x1j ... x1n x2 z2 q2 m2 x21 x22 ... x2j ... x2n x3 z3 q3 m3 x31 x32 ... x3j ... x3n ... ... ... ... ... ... ... ... ... ... xi zi qi mi xi1 xi2 ... xij ... xin ... ... ... ... ... ... ... ... ... ... xm zm qm mm xm1 xm2 ... xmj ... xmn conceptual information pertinent to projects (i.e., texts, drawings, graphics, video tapes and virtual and augmented realities) * – the sign zi(+(−)) indicates that a greater (lesser) criterion value corresponds to greater (lesser) significance for stakeholders. the invar method [14] assumes direct and proportional dependence of significance and a priority of investigated versions in a system of criteria that adequately describe the alternatives and on the values and weights of those criteria. significance, priority, utility degree and investment value of alternatives, presentation of quantitative recommendations and optimization of different criteria are determined in 11 stages. invar method stages 1-5 are identical as copras method [9,10,14]. stage 1. first, form a weighted, normalized decision making matrix d. the purpose of this stage is to receive dimensionless, weighted values from the comparative indices. upon establishing the dimensionless values of the indices, all criteria, originally having different dimensions, become comparable. the following formula for this purpose is: dij = xij · qi n∑ j=1 xij , i = 1, m; j = 1, n, (1) where xij is the value of the i-th criterion in the j-th alternative of a solution, m – the number of criteria, n – the number of the alternatives compared and qi – the weight of the i-th criterion. the sum of dimensionless, weighted index values dij of each criterion xi is always equal to the weight qi of this criterion: qi = n∑ j=1 dij, i = 1, m; j = 1, n. (2) in other words, the value of the weight qi of the investigated criterion proportionally distributes over all the alternative versions aj according to their values xij. stage 2. the sums of weighted, normalized indices describing the j-th version are calculated. the minimizing of index s −j and maximizing of index s+j describe the versions. the lower value of minimizing indices is better (investment). the greater value of maximizing indices is better (management, health & wellbeing, energy, transport, water, materials, waste, land use & ecology, 670 a. kaklauskas pollution, innovation). the formula for calculating the sums is: s+j = m∑ i=1 d+ij; s−j = m∑ i=1 d −ij, i = 1, m; j = 1,n. (3) in this case, the values s+j (the greater the project "pluses" of this value, the greater the satisfaction of interested parties) and s −j (the lower the project "minuses" of this value, the better the goal attainments by interested parties) express the degree of goals attained by interested parties pertinent to each alternative project. in any case, the sum of the "pluses" s+j and the "minuses" s −j of all alternative projects is always respectively equal to all the sums of the weights of the maximizing and minimizing criteria: s+ = n∑ j=1 s+j = m∑ i=1 n∑ j=1 d+ij, s − = n∑ j=1 s −j = m∑ i=1 n∑ j=1 d −ij, i = 1, m, j = 1, n. (4) this way the calculations performed may be additionally checked. stage 3. thebasis pertinent todetermining the significance (efficiency) of the versionsunder comparison constitutes the descriptions of the features pertinent to positive project "pluses" and to negative project "minuses". the formula for finding the relative significance qj of each project aj is: qj = s+j + s −min · n∑ j=1 s −j s −j · n∑ j=1 s −min s −j , j = 1, n, (5) where s −min is the least value of the s−j. stage 4. determining the priorities of projects pertains to the axiom that the greater the qj the higher the efficiency (priority) of the project. the analysis of the method presented allows stating that it may be easily applied for evaluating projects and selecting the most efficient of them, while fully aware of the physical meaning of the process. moreover, it allows formulating a reduced criterion qj directly proportional to the relative effect of the compared criteria values dij and weights qi on the end result (see table 2). determining the utility degrees of the project under consideration as well as the investment value of a project under valuation occurs in seven stages. stage 5. the formula used for the calculation pertinent to project aj utility degree nj is: nj = (qj ÷ qmax) · 100% (6) here qj and qmax are the significances of the project obtained from equation 5. the utility degree nj of project aj indicates the satisfaction level of the interested parties. the more goals achieved and the more important they are, the higher is the degree of project utility. stage 6. calculating the investment value x1j cycle e of the project under deliberation aj can be by means of e approximation. the problem may be stated as follows: what investment value x1j cycle e of the assessed project aj will make it equally competitive on the market with the projects under comparison (a1 − an) (see table 3)? the measurement of the value x1j cycle e is by price (euro, british pounds, u.s. dollar or others) per square meter. degree of project utility and investment value assessments 671 table 2: alternative results of a multiple criteria analysis criteria describing the alternatives * projects under comparison w ei g h ts m ea su re m en t u n it s a1 a2 ... aj ... an x1 z1 q1 m1 d11 d12 ... d1j ... d1n x2 z2 q2 m2 d21 d22 ... d2j ... d2n x3 z3 q3 m3 d31 d32 ... d3j ... d3n ... ... ... ... ... ... ... ... ... ... xi zi qi mi di1 di2 ... dij ... din ... ... ... ... ... ... ... ... ... ... xm zm qm mm dm1 dm2 ... dmj ... dmn sums of weighted, normalized, maximizing indices (project "pluses”) of the project s+1 s+2 ... s+j ... s+n sums of weighted, normalized, minimizing indices (project "minuses”) of the project s −1 s−2 ... s−j ... s−n significance of the project q1 q2 ... qj ... qn priority of the project p1 p2 ... pj ... pn utility degree of the project (%) n1 n2 ... nj ... nn * – the sign zi(+(−)) indicates that a greater (lesser) criterion value corresponds to greater (lesser) significance for stakeholders. assuming nje > n∑ j=1 nj÷n, then continue increasing the value x1j cycle e of this project aj (see table 3) by 1 unit costs per square meter (e.g., 1 euro/m2) and performing calculations as per stages 1-6 with the gained decision making matrix until arriving at inequality nje < n∑ j=1 nj ÷ n during e approximations. then the final value x1j cycle e (while nje > n∑ j=1 nj ÷ n) equals the investment value: x1j iv = x1j cycle e (7) assuming nje < n∑ j=1 nj ÷ n , then continue reducing the value x1j cycle e of this project aj (see table 3) by 1 unit costs per square meter (e.g., 1 euro/m2) and performing calculations as per stages 1-6 with the gained decision making matrix until arriving at inequality nje > n∑ j=1 nj ÷ n during e approximations. then the final value x1j cycle e (while nje < n∑ j=1 nj ÷ n) equals the investment value (see formula 7). stage 7. performing the optimization of value xij is possible for any criterion during e approximations. it is necessary to determine, what the optimized value xij cycle e should be for alternative aj to be equally competitive in the market with the other alternatives under comparison (a1 − an) (see table 3). the optimization of value xij for any criterion pertinent to the project under deliberation aj may be determined by performing complex analyses of the benefits and drawbacks of these projects. development of a grouped, decision making matrix for the multiple criteria analysis of a project transpires by calculating the optimization of value xij during e approximations of a 672 a. kaklauskas table 3: grouped decision making matrix for the investment value assessment of project aj (optimization of value xij for any criterion) criteria describing the alternatives * project under valuation and projects under comparison w ei g h ts m ea su re m en t u n it s a1 a2 ... aj ... an x1 z1 q1 m1 x11 x12 ... x1j cycle e ... x1n x2 z2 q2 m2 x21 x22 ... x2j ... x2n x3 z3 q3 m3 x31 x32 ... x3j ... x3n ... ... ... ... ... ... ... ... ... ... xi zi qi mi xi1 xi2 ... xij cycle e ... xin ... ... ... ... ... ... ... ... ... ... xm zm qm mm xm1 xm2 ... xmj ... xmn nje n1e n2e ... nje ... nne conceptual information pertinent to projects (i.e., texts, drawings, graphics, video tapes and virtual and augmented realities) * – the sign zi(+(−)) indicates that a greater (lesser) criterion value corresponds to greater (lesser) significance for stakeholders. project under valuation by the block-diagram, as presented in figure 1. use of stages 1-5 and 7 accomplishes a set assessment of all the positive and negative features of a project (criteria, its values and weights). perform calculations by using a grouped decision making matrix (see table 3) and stages 1-5 and 7. the calculation for the corrected optimization of value xij cycle e for any criterion aj is by formula: assuming nje > n∑ j=1 nj ÷ n and xi is xi−, then xij cycle e = xij cycle 0 × (1 + e × r), e = 1, r assuming nje > n∑ j=1 nj ÷ n and xi is, xi+, then xij cycle e = xij cycle 0 × (1 − e × r), e = 1, r (8a) assuming nje < n∑ j=1 nj ÷ n and xi is xi−, then xij cycle e = xij cycle 0 × (1 − e × r), e = 1, r assuming nje < n∑ j=1 nj ÷ n and xi is xi+, then xij cycle e = xij cycle 0 × (1 + e × r), e = 1, r (8b) where e is the number of cycles during which optimization value xij cycle e can be determined by means of e approximation of the project under deliberation aj. meanwhile r is the amount by which the optimization value xij cycle e of the project under deliberation aj increases (decreeses) by means of cycling, to satisfy inequality 9. xi+(xi−) – indicates that a greater (lesser) criterion value corresponds to a greater (lesser) significance for stakeholders. assuming the utility degree nje of the project under deliberation aj is greater than the average utility degree (formula 8a) of the projects under comparison, it means project aj is more beneficial on average than the projects under comparison are. for the project under deliberation degree of project utility and investment value assessments 673 figure 1: block-diagram for a project’s optimization value assessment to be equally competitive on the market with the projects under comparison (a2 − an), reduce (increase) the value xij cycle e of its criterion (see formula 8a) under deliberation by an r amount over e cycles, until satisfying the next inequality: |nje − n∑ j=1 nje ÷ n| < s (9) where s is the accuracy, by percentage, to be achieved by calculating the value xij cycle e of the criterion under deliberation of project aj. for example, given that s = 0.5%, the number of calculation approximations will be lower than it is at s = 0.1%. the decision maker selects the r and s amounts depending on the accuracy needed for the calculations. assuming the utility degree nje of the project under deliberation ax is lower than the utility degree (formula 8b) is on average of the projects under comparison, it means project aj is less beneficial on average than the projects under comparison are. for the project under deliberation to be equally competitive on the market with comparison projects (a1 − an), increase (reduce) the value xij cycle e of its criterion (see formula 8b) under deliberation by an r amount over e cycles, until satisfying inequality 9. assuming inequality 9 is not satisfied, it means the calculation of the value xij cycle e of the criterion under deliberation of the project under valuation aj is not sufficiently accurate, and it is necessary to repeat the approximation cycle. thereby the corrected revision of value xij cycle e of the project under valuation substitutes into a grouped decision making matrix of a project’s multiple criteria analysis. recalculate formulae 1-8 until satisfying inequality 9. there is a determination of the optimization value xij cycle e for any criterion of the project under valuation aj. upon satisfaction of inequality 9, the application of the next, formula 10 is to determine the optimization value xij cycle e for any criterion of project aj: xij opt value = xij cycle e (10) stage 9. presenting indicator xij of the quantitative recommendation iij showing thepercentage of a possible improvement in the value of indicator xij for it to become equal to the best value 674 a. kaklauskas xi max of criterion xi is by the formula (see tables 4 and 8): iij = |xij − xi max|÷ xij × 100% (11) where iij is the quantitative recommendation iij of indicator xij showing the percentage of a possible improvement in the value of indicator xij for it to become equal to the best value xi max of criterion xi. meanwhile xi max is the value of the indicator of the best criterion xi of the variants under comparison. stage 10. indicator xij of quantitative recommendation rij showing the percentage of possible improvement of utility degree nj of alternative aj upon presentation of xij = xi max. in other words, rij shows the percentage of possible improvement in the utility degree nj of alternative aj, assuming the value of indicator xij can be improved up to the best value xi max of the indicator of criterion xi. the calculation is by formula: rij = (qi × xi max) ÷ (s−j + s+j) × 100% (12) where rij is the indicator xij of the quantitative recommendation rij showing the percentage of possible improvement in the utility degree nj of alternative aj, when xij = xi max. the submission of the quantitative recommendations iij and rij of value xij is in a matrix form (see table 4). stage 11. this stage involves calculation by approximation e cycle to determine, what the value x1j cycle e should be for the project under deliberation aj to become the best among those under deliberation. the problem may be stated as follows: what investment value x1j cycle e of the project under valuation aj will make it the best on the market, as per the projects under comparison (a1−an) (see table 3)? the measurement of value x1j cycl e is by price (euro, british pounds, u.s. dollar or others) per square meter. the reduction in the price of this project per 1 square meter unit (e.g., 1 euro/m2) continues until utility degree nj e of the project under deliberation aj equals 100%. 3 case studies: describing the sustainability of buildings assessed by the invar method 3.1 case study 1: calculations of the ikea shopping center utility degree a specific example appears next to demonstrate the invar method more clearly. five buildings for retail operations a1 – a5 are under analysis for this case study. all the data come from the breeam pre-assessment reports and other sources pertinent to ikea shopping center a1 [26,27], orchard park district centre a2 [28], friargate court & retail units a3 [29], dorking store a4 [30] and retail foodstore a5 [31]. table 5 shows this data. table 5 consists of criteria (breeam sections and investment), their values (breeam section scores and prices per square meter) and weights. the sum of the weights of all the breeam criteria (breeam sections) is equal to one, because the calculation of the section score section has assessed the weighting. the weight of the investment criterion is compared to the sum of the weights from all the other criteria (breeam sections). this associates with the requirement that the price of these projects must equal the achieved results. the basis for performing an assessment of the sustainability of retail buildings consists of the 11 invar method stages. these calculations appear in brief below. stage 1: the weighted normalized decision making matrix d is formed (see formula 1, table 5 and 9). the first formula for this purpose is: degree of project utility and investment value assessments 675 table 4: quantitative recommendations submitted in a matrix form criteria describing the alternatives * compared projects w ei g h ts m ea su re m en t u n it s a1 a2 ... aj ... an x1 z1 q1 m1 x11 x12 ... x1j ... x1n possible improvement of the value of indicator x1j for it to become equal to the best value x1 max of criterion x1 % i11 i12 ... i1j ... i1n possible improvement of the utility degree nj of alternative aj upon presentation of x1j = x1 max % r11 r12 ... r1j ... r1n x2 z2 q2 m2 x21 x22 ... x2j ... x2n possible improvement in the value of indicator x2j for it to become equal to the best value x2 max of criterion x2 % i21 i22 ... i2j ... i2n possible improvement of utility degree nj of alternative aj upon presentation of x2j = x2 max % r21 r22 ... r2j ... r2n ... ... ... ... ... ... ... ... ... ... xi zi qi mi xi1 xi2 ... xij ... xin possible improvement in the value of indicator xij for it to be equal to the best value xi max of criterion xi % ii1 ii2 ... iij ... iin possible improvement in utility degree nj of alternative aj upon presentation of xij = xi max % ri1 ri2 ... rij ... rin ... ... ... ... ... ... ... ... ... ... xm zm qm mm xm1 xm2 ... xmj ... xmn possible improvement in the value of indicator xmj for it to be equal to the best value xm max of criterion xm % im1 im2 ... imj ... imn possible improvement of utility degree nj of alternative aj upon presentation of xmj = xm max % rm1 rm2 ... rmj ... rmn d11 = 10 × 1774 ÷ (1774 + 1953.8 + 2370 + 1890 + 2045) = 1.7682 d12 = 1.1 × 1953.8 ÷ (1774 + 1953.8 + 2370 + 1890 + 2045) = 1.9474 d13 = 1.1 × 2370 ÷ (1774 + 1953.8 + 2370 + 1890 + 2045) = 2.3623 the value of weight qi of the investigated criterion distributes proportionally among retail buildings under analysis aj according to their values xij (see table 6). for example: q2 = 0.1068 + 0.2403 + 0.1942 + 0.2403 + 0.2185 = 1.0 q4 = 0.2709 + 0.1996 + 0.0925 + 0.1913 + 0.2457 = 1.0 stage 2: the sums of weighted normalized indices describing the j-th version are calculated. formula 3 calculates the sums: s+1 = 0.1068+ 0.2293 + 0.2709 + 0.2056+ 0.0957 + 0.1186 + 0.13 + 0.1944 + 0.2557+ 0.0 = 1.607 s −1 = 1.7682 etc. in any case, the sums of the “pluses” s+j and “minuses” s−j of all alternative projects are always, respectively, equal to all sums of the weights of maximizing and minimizing criteria (see formula 4): s+ = 1.607 + 1.7515 + 2.2967 + 1.6557 + 2.689 = 10.0 s − = 1.7682 + 1.9474 + 2.3623 + 1.8838 + 2.0383 = 10.0 676 a. kaklauskas table 5: initial data for invar method calculations (see [32]) quantitative and qualitative information pertinent to retail buildings criteria describing the retail buidlings * measurement units weight compared retail buidlings a1 a2 a3 a4 a5 investment euro/m2 10 1774 1953.8 2370 1890 2045 management + points 1 4.8 10.8 8.73 10.8 9.82 health & wellbeing + points 1 10.65 10 7.5 8.3 10 energy + points 1 14.44 10.64 4.93 10.2 13.1 transport + points 1 5.6 4.92 7.11 2.5 7.11 water + points 1 1.98 5.33 4 4.7 4.67 materials + points 1 4.12 5.77 9.62 4.8 10.42 waste + points 1 3.22 4.69 3.75 5.6 7.5 land use & ecology + points 1 7 6 7 7 9 pollution + points 1 5.8 3.08 6.15 3.8 3.85 innovation + points 1 0 0 2 0 2 * – the sign “+/-” indicates that a greater (lesser) criterion value corresponds to greater (lesser) significance for a user (stakeholder). stage 3: formula 5 finds the relative significance qj of each project aj (see table 6): q1 = 1.607 + 1.7682 × (1.7682 + 1.9474 + 2.3623 + 1.8838 + 2.0383) 1.7682 × (1.7682 ÷ 1.7682 + 1.7682 ÷ 1.9474 + 1.7682 ÷ 2.3623+ + 1.7682 ÷ 1.8838 + 1.7682 ÷ 2.0383) = 3.8478 q2 = 1.7515 + 1.7682 × (1.7682 + 1.9474 + 2.3623 + 1.8838 + 2.0383) 1.9474 × (1.7682 ÷ 1.7682 + 1.7682 ÷ 1.9474 + 1.7682 ÷ 2.3623+ + 1.7682 ÷ 1.8838 + 1.7682 ÷ 2.0383) = 3.7861 stage 4: the greater the qj, the higher is the efficiency (priority) of the retail buildings: q5 > q3 > q1 > q2 > q4 (see table 6: 4.6329 > 3.974 > 3.8478 > 3.7861 > 3.759). stage 5: formula 6 is used for calculating utility degree nj: n1 = (3.8478 ÷ 4.6329) × 100% = 83.05% n2 = (3.7861 ÷ 4.6329) × 100% = 81.72% n3 = (3.974 ÷ 4.6329) × 100% = 85.78% n4 = (3.759 ÷ 4.6329) × 100% = 81.14% n5 = (4.6329 ÷ 4.6329) × 100% = 100% the results of a multiple criteria evaluation of the sustainable retail buildings under analysis appear in table 6. table 6 shows that the fiftht version a5 is the best by utility degree equaling n5 = 100%. the third version a3 was second according to priority, and its utility degree was equal to n3 = 85.78%. 3.2 case study 2: calculations of the ikea shopping center investment value the calculations of the investment value of the ikea shopping center under valuation are according to data from table 5 and stages 1-6. construction of the ikea shopping center for furniture and home furnishings was in several stages. first, there was selection of a lot and then, the detailed planning for merging two lots. upon approval of the detailed plan, there were degree of project utility and investment value assessments 677 table 6: invar method calculation results quantitative and qualitative information pertinent to retail buildings criteria describing retail buidlings * measurement units weight retail buidings under comparison a1 a2 a3 a4 a5 investment euro/m2 10 1.7682 1.9474 2.3623 1.8838 2.0383 management + points 1 0.1068 0.2403 0.1942 0.2403 0.2185 health &wellbeing + points 1 0.2293 0.2153 0.1615 031787 0.2153 energy + points 1 0.2709 0.1996 0.0925 0.1913 0.2457 transport + points 1 0.2056 0.1806 0.261 0.0918 0.261 water + points 1 0.0957 0.2577 0.1934 0.2273 0.2258 materials + points 1 0.1186 0.1661 0.277 0.1382 0.3 waste + points 1 0.13 0.1894 0.1515 0.2262 0.3029 land use & ecology + points 1 0.1944 0.1667 0.1944 0.1944 0.25 pollution + points 1 0.2557 0.1358 0.2712 0.1675 0.1698 innovation + points 1 0 0 0.5 0 0.5 sums of weighted, normalized maximizing indices (project “pluses”) of the retail buildings 1.607 1.7515 2.2967 1.6557 2.689 sums of weighted, normalized minimizing (projects “minuses”) indices of the retail buildings 1.7682 1.9474 2.3623 1.8838 2.0383 significance of the retail buildings 3.8478 3.7861 3.974 3.759 4.6329 priority of the retail buildings 3 4 2 5 1 utility degree of the retail buildings (%) 83.05% 81.72% 85.78% 81.14% 100% * – the sign “+/-” indicates that a greater (lesser) criterion value corresponds to greater (lesser) significance for a user (stakeholder). ecological tests conducted on the lot, followed by the design and then the arrangement of the lot. some 2,400 units of garages and their foundations were demolished. the partial use of processed construction materials was for new construction, and the remaining materials, for transferring to other waste handlers. the amount of contaminated soil removed was 1,000 tons (see figure 2). the retail buildings designed a parking lot for 953 automobiles of which 37 are for the disabled and 36 for families with children. the unused areas of the lot have planted greenery. the water supply of the city provides the water for the building. centralized sewage networks of the city handle the captured wastewater from the facilities and rainwater that then flow into appropriate piping. the facility contains an installed, autonomous water heating system using solar energy. air conditioning installations consist of efficient heat pumps and the ventilation – of productive recovery systems. the centralized heating network supplies heat. the design and construction of the building were according to customer specifications and were in consideration of permissible noise level maintenance. the project blueprint stipulates an external enclosure that insulates noise to no less than 32 db. the main indicators of the project are total building area – 25,359 m2, main area – 21,533 m2, building height – 15.84 m, drinking water supply pipeline – 3,300 m, wastewater pipeline – 1,900 m and rainwater pipeline – 2,358 m. air conditioning and ventilation systems are installed in the retail buildings for assuring hygienic stipulations for the facilities and the required, stable air temperature and moisture stipulations for the administrative facilities of the work environment. the lighting for the building divides into zones that are all independently controlled. only certified materials having the least impact on the environment over the life of the building were used for the building’s internal and external systems. the insulation materials used were those having the least impact on the environment but containing the best thermal insulation properties. the investment of the ikea shopping center was 47.2 mln. euro. the aim was to establish, what the investment value x11 cycle e (see the bold-faced numbers 678 a. kaklauskas a b figure 2: ikea shopping center for furniture and home furnishings: a) ikea lot under arrangement and b) operating ikea shopping center in tables 5 and 7) of the investment should be for a1 to be equally competitive in the market against the other retail buildings under comparison (a2 – a5). applications of invar stages 1-6 serve to accomplish a set assessment of the positive and negative features of all these retail buildings. as table 7 shows, the most beneficial retail building during the 124th cycle of approximation (e = 124), according to its designation for use, is a5 (n5 124 = 100%). the second under comparison that is most beneficial is a1 (n1 124 = 86.43%) and the third under comparison – a3 (n3 124 = 85.77%). the calculated utility degrees of the sustainable retail buildings under comparison make it apparent that the cost x11 124 = 1650 (euro/m 2) for ikea shopping center under valuation a1 is still too high. therefore this retail buildings a1 is not equally competitive in the market, as compared to the sustainable retail buildings under comparison, once the assessment of their sets of specific positive and negative features is complete. stage 6 also affirms the same fact: the calculation of the investment value for retail building a1 during the 124th cycle of approximation was not sufficiently accurate (see column 9 in table 7). table 7 shows that inequality (see column 9 in table 7) was unsatisfactory for the first 144 cycles. the determination of the investment value of a1 under valuation with respect to the other retail buildings under comparison appears in the final, 145th approximation cycle – n1 145cycle = 87.04% (n2 145cycle = 81.53%, n3 145cycle = 85.77%, n4 145cycle = 80.91% and n5 145cycle = 100%). in the 144th approximation cycle, the utility degree of project under comparison a1 calculates at n1 = 87.02%. the degrees of utility for the retail buildings under analysis show that a1 under valuation in the 145th approximation cycle is more beneficial than is the second retail building under comparison a2 by 5.51% and more beneficial than retail building under comparison a4 by 6.13%. there was a revision of the investment value x11 in every cycle (from x11 cycle 0 = 1774 euro/m2), each by 1 euro/m2 by size until inequality (see column 9 in table 7) was satisfied (x11 cycle 145 = 1629 euro/m 2). thus investment value x11 cycle e (respectively, 1774, ..., 1629) is checked for accuracy pertinent to retail building a1 by placing them into the bold cell of the decision making matrix (see table 5). all calculations were repeated according to stages 1-6 until inequality (see column 9 in table 7) was satisfied in the 145th cycle. table 7 shows that the calculations of investment value x11 cycle e become more and more accurate with each, next e approximation cycle for retail building a1 under analysis. 3.3 case study 3: provision of recommendations the results of the provision of recommendations by applying stages 1-5, 9 and 10 of the invar method for the retail buildings appear in table 8. initial data for the calculations are presented in table 5. meanwhile, the recommendations for bettering the criteria for these retail buildings under comparison appear in table 8. recommendations arrive in a matrix (see table degree of project utility and investment value assessments 679 table 7: revised changes in value and investment valuedeterminations for ikeashopping center under valuation a1 utility degree change in retail buildings under deliberation by rationalizing the corrected value x11 cycle e of building a1 approximation cycle * utility degree n1e utility degree n2e utility degree n3e utility degree n4e utility degree n5e ** *** 1 2 3 4 5 6 7 8 9 0 1774 83.05% 81.72% 85.78% 81.14% 100% 86.34% |− 4.11%| < 0.02% ... ... ... ... ... ... ... ... ... 124 1650 86.43% 81.56% 85.77% 80.95% 100% 86.94% |− 0.64%| < 0.02% ... ... ... ... ... ... ... ... ... 134 1640 86.72% 81.55% 85.77% 80.93% 100% 87.00% |− 0.34%| < 0.02% ... ... ... ... ... ... ... ... ... 144 1630 87.02% 81.53% 85.77% 80.91% 100% 87.05% |− 0.03%| < 0.02% 145 x1j iv = 1629 87.04% 81.53% 85.77% 80.91% 100% 87.05% |− 0.01 %| < 0.02 % * revised changes in value and investment value x11 cycle e (euro/m 2) of ikea shopping center under valuation a1. ** (n1e + n2e + n3e + n4e + n5e) ÷ 5 *** inequality to determine, whether the calculation of revised value x11 cycle e of ikea shopping center under valuation a1 is sufficiently accurate. 8) by using formulae 10 and 11 during stages 9 and 10. every window in table 8 describing alternative aj consists of three parts: xij – the value of the i-th criterion (xi) in the j-th alternative; quantitative recommendation iij showing the percentage of a possible improvement in the value of indicator xij for it to become equal to the best value xi max of criterion xi (xij = xi max); and quantitative recommendation rij showing the percentage of possible improvement of utility degree nj of alternative aj upon presentation of xij = xi max. if, for example, it would be possible to improve the assessment of the health &wellbeing criterion for building a3 (i33 = 42%) from the x33 = 7.5 value achieved up to the best value for a1 (x34 = 10.65), then the utility degree n3 for building a3 would increase by r33 = 2.1%. analogically, if the assessment of the energy criterion for building a3 (x43 = 5.1) could be improved up to the amount of the best assessment for building a1 (x41 = 14.44), then the effectiveness of the criterion energy for building a3 would increase by i43 = 183.14%, and the utility degree n3 would increase by r43 = 9.1569% (see table 8). 3.4 case study 4: optimization of the value this example, based on stages 1-5 and 7, will determine, what the value x43 cycle e of the breeam energy section (see the number in bold in table 5) must be for project a3 to be equally competitive on the market, as compared to the other retail buildings under comparison (a1, a2, a4, a5) by a set assessment of all their positive and negative features. it is possible to optimize any one of the criteria or their composite parts by the new invar method, which deliberates the sustainability of retail buildings under analysis in an integrated manner by using pre-assessment reports. the optimization of the score of the energy section of breeam, which appears next, will serve as an example (see table 5). the determination of the optimized score x43 cycle e for the project under valuation a3 appears 680 a. kaklauskas table 8: quantitative recommendations submitted in a matrix form quantitative and qualitative information pertinent to alternatives criteria describing the alternatives * measurement units weight alternatives a1 a2 a3 a4 a5 health &wellbeing + points 1 x31 = 10.65 10 x33 = 7.5 8.3 10 (0%) (6.5%) (i33 = 42%) (28.31%) (6.5%) (0%) (0.325%) (r33 = 2.1%) (1.4157%) (0.325%) energy + points 1 x41 = 14.44 10.64 x43 = 5.1 10.2 13.1 (0%) (35.71%) (i43 = 183.14%) (41.57%) (10.23%) (0%) (1.7857%) (r43 = 9.1569%) (2.0784%) (0.5115%) *the sign “+/-” indicates that a greater (lesser) criterion value corresponds to a greater (lesser) significance for a user (stakeholder). in table 9. the formulation of this task is the following: determine, what the optimized score x43 cycle e should be for building under valuation a3 for it to be equally competitive in the market, as compared with the sustainable retail buildings (a1, a2, a4, a5) after a complex assessment of their positive and negative features. the decision making matrix (see table 5), the amalgamated block diagram submitted in figure 1 and the calculations performed by stages 1-5 and 7 serve as the basis for these calculations. the results of the e approximation cycles of these calculations appear in table 9. the aim was to establish, what the score x43 cycle e should be (see the numbers table 9: what score x43 cycle e should be for building a3 to be equally competitive in the market with other retail buildings under comparison (a1, a2, a4, a5) approximation cycle score x43 cycle e utility degree n1e utility degree n2e utility degree n3e utility degree n4e utility degree n5e * ** 0 4.93 83.05% 81.72% 85.78% 81.14% 100% 86.34% |− 0.7%| > 0.1% ... ... ... ... ... ... ... 7 5 83.05% 81.72% 85.81% 81.14% 100% 86.34% |− 0.67%| > 0.1% ... ... ... ... ... ... ... 57 5.5 83.04% 81.72% 86.03% 81.14% 100% 86.39% |− 0.45%| > 0.1% ... ... ... ... ... ... ... 107 6 83.02% 81.72% 86.25% 81.14% 100% 86.43% |− 0.19%| > 0.1% ... ... ... ... ... ... ... 157 6.5 83.01% 81.72% 86.47% 81.14% 100% 86.47% |0%| < 0.1% * (n1e + n2e + n3e + n4e + n5e) ÷ 5 ** inequality 9 to determine, whether the calculation of revised value x43 cycle e of under valuation a3 is sufficiently accurate. in bold in tables 5 and 9) for building a3 to be equally competitive in the market with other retail buildings under comparison (a1, a2, a4, a5). applications of invar stages 1-5 and 7 serve to accomplish a set assessment of the positive and negative features of all these retail buildings. table 9 shows that inequality 9 was unsatisfactory for the first 156 cycles. the score x43 was increased in every cycle (from x43 cycle 0 = 4.93) by an amount of 0.01 until inequality 9 was satisfied (x43 cycle 157 = 6.5). then scores x43 cycle e (respectively, 4.94, ... and 6.5) are checked for accuracy pertinent to building a3 by placing these results into the bold cell of the decision making matrix (see tables 5 and 9). all the calculations were repeated according to formulae stages 1-5 and 7 until inequality 9 was satisfied in the 157th cycle. table 9 shows the degree of project utility and investment value assessments 681 table 10: what should the value x11 cycle e of ikea shopping center be for this project to become the best among those under deliberation? approxımation cycle investment value x11 cycle e (euro/m2) utility degree n1e n2e n3e n4e n5e 0 1774 83.05% 81.72% 85.78% 81.14% 100% 124 1650 86.43% 81.56% 85.77% 80.95% 100% 134 1640 86.72% 81.55% 85.77% 80.93% 100% 174 1600 87.92% 81.49% 85.77% 80.86% 100% 274 1500 91.14% 81.34% 85.77% 80.68% 100% 424 1350 96.73% 81.07% 85.76% 80.37% 100% 474 1300 98.84% 80.97% 85.76% 80.25% 100% 484 1290 99.27% 80.95% 85.76% 80.23% 100% 494 1280 99.72% 80.93% 85.76% 80.20% 100% 499 1275 99.94% 80.92% 85.76% 80.19% 100% 504 1270 100% 80.78% 85.62% 80.04% 99.84% calculations of score x43 cycle e becoming more and more accurate with each, next approximation cycle for building under analysis a3. 3.5 case study 5: what should the value of the ikea shopping center be for this project to be the best among those under deliberation? the calculations in this example are by approximation e cycle to determine, what the value x11 cycle e of ikea shopping center a1 should be for this project to become best among those under deliberation a1-a5. the price of this project continues being reduced by 1 euro/m 2 until n1e becomes equal to 100% (stages 1-5 and 11). table 10 shows that n1e = 100% had not been satisfied over 503 cycles. that is the reason the investment value x11 cycle e of the project under valuation a1, which had been revised 504 times, was entered into the decision making matrix (table 5) for the multiple criteria analysis of retail building. table 10 shows that, in each following approximation cycle, the calculation of the revised investment value x11 cycle e of building under valuation a1 became more and more accurate. all the calculations by stages 1-5 and 11 were repeated, until n1e = 100% was satisfied in the 504th cycle. it can be stated that this project can become the most effective among the projects under comparison, once the value x11 cycle e of the ikea shopping center = 1270 euro/m 2. 4 conclusion this article recommends a new multiple criteria analysis, the invar method (degree of project utility and investment value assessments along with recommendation provisions). invar method stages 1-5 are identical as copras method [9, 10, 14]. it generates conditions to assess management, health & wellbeing, energy, transport, water, materials, waste, land use 682 a. kaklauskas & ecology, pollution, innovation, comfort, quality of life and aesthetics as well as its technical, economic, legal/regulatory, educational, social, cultural, ethical, psychological, emotional, religious and ethnic aspects in conformity with requirements and opportunities for clients, designers, contractors, users and other stakeholders. the systems and the values and weights of the quantitative and qualitative criteria express these requirements. the invar method allows determining the strongest and weakest aspects of each project pertinent to a sustainable building and its constituent parts. performance of the analyses is to learn by what degree one alternative is better than is another. furthermore, this discloses the details, why this is so. the practical case studies presented in this research validate this developed method. an analysis of the results reached by the invar method permits making the following claims: • the invar method can determine the utility degree and investment values of the projects under deliberation. • the invar method can provide digital tips for improving projects. • the invar method can define, what the value of a selected criterion needs to be for the project under deliberation to be equally competitive in the market, as compared with others under comparison after a set assessment of all their positive and negative features. • the invar method can calculate, what the value of the project under deliberation should be for this project to become the best among others under deliberation. acknowledgment the author thanks ms. vijole arbas for her help in translating to english and editing this article. bibliography [1] schwartz, y., raslan, r. (2013), variations in results of building energy simulation tools, and their impact on breeam and leed ratings: a case study, energy and buildings, 62: 350-359. [2] lee, w.l. (2013), a comprehensive review of metrics of building environmental assessment schemes, energy and buildings 62: 403-413. [3] ferreira, j., pinheiro, m. d., de brito, j. (2014), portuguese sustainable construction assessment tools benchmarked with breeam and leed: an energy analysis, energy and buildings, 69: 451-463. [4] howard, n. (2005), building environmental assessment methods: in practice, the 2005 world sustainable building conference, tokyo, 27-29. [5] communities and local government, code for sustainable homes – technical guide, communities and local government publications, london, united kingdom, 2010, 292 p. [6] forbes, d., smith, s., horner, r. (2008), investigating the weighting mechanism in breeam ecohomes, cib w055 – w065 joint international symposium: transformations through construction, dubai, united arabic emirates. degree of project utility and investment value assessments 683 [7] kabak, m., kose, e., käąräąlmaz, o., burmaoglu, s. (2014), a fuzzy multi-criteria decision making approach to assess building energy performance, energy and buildings, 72: 382-389. [8] berardi, u. (2015), chapter 15 – sustainability assessments of buildings, communities, and cities, assessing and measuring environmental impact and sustainability, 497-545. [9] mulliner, e., smallbone, k., maliene, v. (2013), an assessment of sustainable housing affordability using a multiple criteria decision making method, omega, 41 (2): 270-279. [10] mulliner, e., malys, n., maliene, v.(2016), comparative analysis of mcdm methods for the assessment of sustainable housing affordability, omega, 59 (part b) , 146-156. [11] banaitiene, n., banaitis, a., kaklauskas, a., zavadskas, e. k. (2008), evaluating the life cycle of a building: a multivariant and multiple criteria approach, omega, 36(3): 429-441. [12] li, y., yu, w., li, b., yao, r. (2016), a multidimensional model for green building assessment: a case study of a highest-rated project in chongqing,energy and buildings, 125(1): 231-243. [13] balaban, o., de oliveira, j. a. p. (2016), sustainable buildings for healthier cities: assessing the co-benefits of green buildings in japan, journal of cleaner productionl, in press, corrected proof 2016. [14] kaklauskas, a. (1999), multiple criteria decision support of building life cycle, research report presented for habilitation (drsc): technological sciences, civil engineering (02t), vilnius gediminas technical university, vilnius: technika, 1999, 118 p. [15] kaklauskas, a. (2015), biometric and intelligent decision making support. series: intelligent systems reference library, xii. springer-verlag, berlin, 81 , 228 p. [16] zavadskas, e. k., kaklauskas, a., v. sarka (1994), the new method of multicriteria complex proportional assessment of projects. technological and economic development of economy, 3: 131-139. [17] method of defining the utility and market value of a property, https://www.researchgate.net/publication/301771443 [18] international valuation standards, international valuation standards council, 2011, 128 p. [19] schmidt, r. difference between market value and investment value in commercial real estate, property metrics, 2014. [20] kaklauskas, a., zavadskas, e. k., raslanas, s.(2005), multivariant design and multiple criteria analysis of building refurbishments, energy and buildings, 37(4): 361-372. [21] kaklauskas, a., zavadskas, e. k., raslanas, s., ginevicius, r., komka, a., malinauskas, p. (2006), selection of low-e windows in retrofit of public buildings by applying multiple criteria method copras: a lithuanian case, energy and buildings, 38(5):454-462. [22] kaklauskas, a., kelpsiene, l., zavadskas, e. k., bardauskiene, d., kaklauskas, g., urbonas, m., sorakas, v. (2011), crisis management in construction and real estate: conceptual modeling at the micro-, mesoand macro-levels. land use policy, 28(1): 280-293. [23] kaklauskas, a., rute, j., zavadskas, e. k., daniunas, a., pruskus, v., bivainis, j., gudauskas, r., plakys, v. (2012), passive house model for quantitative and qualitative analyses and its intelligent system, energy and buildings, 50: 7-18. 684 a. kaklauskas [24] kanapeckiene, l., kaklauskas, a., zavadskas, e. k., raslanas, s. (2011), method and system for multi-attribute market value assessment in analysis of construction and retrofit projects. expert systems with applications, 38(11): 14196-14207. [25] kaklauskas, a., zavadskas, e. (2007), decision support system for innovation with a special emphasis on pollution, international journal of environment and pollution, 30(3-4): 518528. [26] jurgaitis, j. (2014), building environmental impact assessment methods application in lithuania. master thesis, construction technology and management study program. supervisor: a. kaklauskas, 98 p. [27] kajauskaite, e. (2013), moderniu inzineriniu sprendimu ir darnios statybospavyzdys, structum, 46-54. [28] gifford (2010), orchard park district centre, hull. breeam pre-assessment summary. report no. 17318-su001, 30 p. [29] scott hughes design (2013), friargate court & retail units, preston. stage b, breeam strategy & pre-assessment report. project number: 2604, 102 p. [30] morrisons (2011), dorking store. breeam pre-assessment, 14 p. [31] s. r. fall (2013), new retail foodstore. breeam pre-assessment report, 47 p. [32] calculations with invar method. http://iti.vgtu.lt/ilearning/simpletable.aspx? sistemid=675 international journal of computers communications & control issn 1841-9836, 9(6):749-757, december, 2014. an online load balancing algorithm for a hierarchical ring topology c.i. paduraru ciprian i. paduraru computer science department university of bucharest ciprian.paduraru2009@gmail.com abstract: ring networks are an important topic to study because they have certain advantages over their direct network counterparts: easier to manage, better bandwidth, cheaper and wider communication paths. this paper proposes a new online load balancing algorithm for distributed real-time systems having a hierarchical ring as topology. the novelty of the algorithm lies in the goal it tries to achieve and the method used for load balancing. the main goal of the algorithm is to correctly utilize the computing resources in order to satisfy the average response time of clients. the secondary goal is to ensure fairness between the numbers of requests solved per client with respect to the average response time. a request from a client is moving through the network until a node considers that it can solve the request in the promised average time for that client or until it seems like the best opportunity to avoid any additional delays in solving it. a performance analysis and motivation for the proposed algorithm is given with respect to the goals it tries to achieve. the results show that the proposed algorithm satisfies its goals. keywords: ring; hierarchical; distributed; balancing; algorithm; fairness 1 introduction today, the common approach for processing user requests sent to a web or network-based service is to handle them using a distributed architecture of computers. in this context, the performance of the processing system is closely related to user experience and service availability, and can therefore play an important role in the success or failure of the respective service on the market. as sufficient hardware resources for processing a large number of requests are generally expensive, a good algorithm for the distribution of load between the processing units in the distributed system is necessary to save costs in addition to increase client’s satisfaction. this paper presents a load balancing algorithm for hierarchical ring network. a hierarchical ring (figure 1) is an alternative to 2d meshes or tori [5]. hierarchical type was chosen to show the generality of the algorithm. instead, we can have rings combined with other network topologies. in a ring network every node has exactly two neighbors: pi is connected pi+1 to and pi−1. in this paper, if we consider that n is the number of processors in the ring, then we assume that all additions on processors indices are done modulo n. in the hierarchical ring network considered, each sub-network has a leader which is responsible to store information and coordinate some activities. in the continuation, when we refer to a sub-network of a leader node then this includes only the direct nodes under the leader’s level. requests are received by a web service which coincide with the leader node of the network and are considered to have an estimated average time to complete. the goals of the presented algorithm are the most significant for services provided in the present. the main goal is to ensure a certain average response time given for each client, depending on what we call a user license. the user license can be interpreted as a contract between the service provider and the user, where parameters referring to the delivery of the service are specified. these include parameters copyright © 2006-2014 by ccc publications 750 c.i. paduraru l l l figure 1: two levels ring hierarchy. relevant for load distribution, like the average response time of the system for certain types of requests. the secondary goal is to ensure fairness between the numbers of requests solved per client with respect to the average response time. a simulator has been created to demonstrate how the algorithm succeeds to satisfy the desired goals. the rest of the paper is organized as follows: in section 2 there is a discussion about research made on load balancing for ring topologies or other network types but appropriate to our goals. in section 3 the design of the load balancing algorithm is discussed. if first starts with the assumptions made over the proposed algorithm then it describes the main ideas and pseudocode behind the decision making process. section 4 shows the simulation results compared with a general load balancing for a hierarchical ring network . conclusions are given in the last section. 2 related work a description of hierarchical ring networks is given in [5]. they are presented as an interesting alternative to popular direct networks such as 2d meshes or tori. advantages of using them are also described here: simple router designs, wider communications paths and faster networks than their direct network counterparts. however the paper is not dealing with load balancing algorithms. its a study to determine how large hierarchical ring networks can become before their performance deteriorates due to their bisection bandwidth constraints. there are not many papers discussing about real time load balancing for ring networks. the most appropriate paper for our presentation is [1]. in comparison with [1], which is a general load balancer for rings, the new proposed load balancing algorithm has another two goals: satisfy the average response time specified in the owners license type and ensure some fairness between the requests with respect to their specified response times. other papers, like [6], are performing a statical load balancing of requests on a ring network. paper [2] presents a load balancing algorithm for distributed systems having the same two goals. however, the algorithms presented there are inapplicable to the ring networks. it uses the fact that nodes can communicate directly with a master and it would be too much overhead to simulate the same implementation algorithm on a ring. both the requests and results are exchanged directly from master to workers. 3 design of the algorithm 3.1 assumptions it is assumed that when a node finishes a request, the results are sent back to clients directly from that node. the algorithm allows for the system to be heterogeneous, workstations may differ in processing capacity. the processing time of a request is expressed as the ”request’s length” and can be predetermined. we assume that getesttimetocompute(request) returns the an online load balancing algorithm for a hierarchical ring topology 751 estimated processing time of a request on any node and its time complexity is constant. one simple way to do this is to benchmark how fast each node can execute different requests length intervals, then group and store these results in a data structure on the node. it is considered that request processing is workstation independent (all types of requests can be processed by any of the nodes). requests are independent (the order in which requests are processed does not affect the correctness of the result) and indivisible (can only be handled by a single worker at one moment). the communication time is not generally important for the algorithm. the reason is that while a request spends time moving through the network its priority increases. 3.2 high level implementation and the communication protocol the main responsibilities of nodes are to take decisions, solve requests and communicate with neighbors. the communication and request’s solving should run in different threads to avoid communication blocking. requests are received by the leader of the ring and send further until a node can execute it in the required time or when that node is a good opportunity to save additional delay in response time. nodes evaluates if a request can be executed by them or not depending on the time needed to complete all other requests waiting there and having a higher priority than the considered request. also, in the case of requests that are close to their deadline (or already passed deadline), if they have a higher priority than all other requests waiting on a node then we choose that node to minimize the additional delays in the response time. by using the above two conditions there is a possibility that all nodes to decline solving a new request. in this case measures need to taken in order to avoid affecting the performance of the system with the new request running too many times through the ring. the method used is to have a variable on each request that represents a bonus time considered when a node evaluates if it has enough time to execute the request. each time the request goes back through the node that initiated it, the leader of the ring, this variable is incremented by some value determining the nodes to accept it faster. in the continuation of this section these ideas are presented in more details. it starts with the high level operations and messages exchanged between nodes, then it continues to explain the implementation of data structure and decision making in more details using pseudocode and complexity analysis. to send data between nodes, two functions are used: send(data) used to send data to the next node on the same sub-network and sendtosubnetwork(data) which sends a message from a leader to its coordinated sub-network (this helps moving a message from a higher level network to a sub-network). another important function is isleadernode which has two prototypes. the first one doesn’t have any parameters tests if the node is the leader of a sub-ring and the second one with a parameter representing a message tests if the node is a leader and the one who created/added that message in its sub-network. there are two types of messages used in the communication protocol: gather and request. a request message is used for sending requests between nodes and contains the following: data context for request execution, the average response time specified in the owners license, timestamp when created, the time when should ideally finish and the current bonus time. the code below shows the high level implementation of the decision making when a request message is received by a node. a node that is not the leader at the level where a request message is sent can either store the request for later execution or send it further in the same network level. additionally, a leader node has the option to send the request down in its sub-network. users might also want to relax the conditions and not decrease the bandwidth performance with requests that are travelling the ring many times in order to find a node that accepts them 752 c.i. paduraru (bonus_step variable is considered as input given by user). to make this possible, when a leader receives back a request that it previously sent to its sub-network, the bonus time variable on the request will be increased. an interesting property of this communication protocol is that if a request travels again back to the leader node of a sub-network because of the high workload, it can eventually get to another sub-network, if the leader evaluates that it is better to do so. onrequestreceived(request) if (canexecuterequest(request)) addrequest(request); else if (isleadernode(request) and canexecuteonmysubnetwork(request)) { if (already received this request) request.bonustime += bonus_step sendtosubnetwork(request) } else send(request); leader stores info about its sub-nodes pi pi+1 if pi is a not a leader, and can’t execute a request it send it to pi+1 request message contains: (data, id, length class, average time in owner’s license) l l l when a leader node receives a request it evaluates in order the following possibilities: execute itself, send it down in the coordinated sub-network or send to another node on the same level. figure 2: high level decision making for a new request. a gather message is initiated by leaders of each sub-network at fixed time periods and sent only to the nodes on the same level. the role of this message is to have a snapshot of the current load inside nodes. this information will be used to take a decision if a leader node should accept a request or not to be executed in its sub-network. the gather messages are asynchronous between different sub-networks. when a node receives a gather message, it adds its local load information (like how much load is in there) to the message and sends it further. when the message is received by the leader it updates its load information table. figure 3 is representative for this flow. the code below presents the action code of every node and the handler function for receiving gather messages. lasttimegathersent is a variable where we store the timestamp of the last gather message sending occured. t is threshould value set by the user, depending on how often he wants to send the gather message. solve function is supposed to run the effective job on the request. function extractnextrequest selects the task with the highest priority from the local list of tasks. onupdate() if (isleadernode() and (getcurrenttime() lasttimegathersent) > t) an online load balancing algorithm for a hierarchical ring topology 753 l 1 2 3 when a node receives this, it adds the current state to the message and send further leader initiates the update message at this point, the leader will receive a message with the combined states of the nodes at the same level figure 3: an update message in the ring. { lasttimegathersent = getcurrenttime() gather msg sendtosubnetwork(msg) } request = extractnextrequest() if (request != null) solve(request) ongatherreceived(msg) if (isleadernode(msg)) updatelocalstate(msg) else { addlocalstate(msg) send(msg) } 3.3 decision making to execute a request locally on a node (leaf or leader) to make computations easier, the average response times specified in the clients licenses are normalized. if t1, t2, t3, ..., tn are the average response times and tmax = max ti, then tk = tk tmax . requests are stored and evaluated based on their priorities. the priority of a request is defined as the waiting time for the request to be solved divided by the inverse of the normalized average response time specified in the owners license. if we denote withwaitingtime(request) the waiting time of the request to be solved, and owner(request) the index of the owner license then the priority computation can be written as priority(request) = waitingtime(request)1 towner(request) , where waitingtime(request)=currenttime()-request.createtime. priorities of requests waiting on a node are dynamic and could modify in time. this is a key point of the algorithm which gives the fairness between the clients with respect to their average response times. the formula used also helps in the case of requests that travel in the network for a long time. the priority of a request increases with its waiting time regardless of the license’s specifications. if a requests travel a long time then it has a bigger priority and its chances to be added in a node are increased. requests are stored in a node using a linked list. at each query of function canexecuterequest(request) the algorithm iterates over the existing items and sum up the time needed to compute all requests 754 c.i. paduraru that have a priority higher than the new request. using the bonus modifier and comparing the result with the average response time of the request’s owner we can find out if the request can be executed on that node or not. also, in the case of requests that are close to ideal execution deadline or should have been executed until now, we check if their priorities are higher than all other priorities waiting in the node. if this is valid then this node is good fit for the request because it will be the first request selected for execution, thus minimizing the delays in response time. a pseudocode for this is given below. requestslist is storing requests of a node. request.bonus represents the bonus time given by the leader, while request.idealtimeoffinish is the precomputed time when the request should be solved in order to satisfy the owner’s license specfication. t is a threshold value defined by user. it could be either the average time for moving data between consecutive nodes, the average time needed for processing it, or a heuristics combining these. canexecuterequest(request) totalesttime = 0; newesttime = getesttimetocompute(request) / request.bonus bestpriority = null; foreach req in the requestslist { if (priority(req) > priority(request)) totalesttime = totalesttime + getesttimetocompute(req); if (priority(req) > bestpriority) bestpriority = priority(req) } remainingtime = request.idealtimeoffinish (getcurrenttime() + totalesttime) isclosetodeadline = (request.idealtimeoffinish getcurrenttime()) <= t return (remainingtime >= 0 or (isclosetodeadline and bestpriority < priority(request)*request.bonus) 3.4 decision making to execute a request on a sub-network this type of decision is valid only for leader nodes. in order to make this possible the gather messages are sent in the sub-network in order to collect workload information. in an ideal case, a leader would know informations about all waiting requests in its sub-network nodes and run the same canexecuterequest function. but such a message would be too large creating a bandwidth and processing time overhead. a tradeoff solution between performance and quality of the decision result is to gather statistics on how much time the current requests would take to execute on different intervals of priorities. if phigh and plow are estimated bounds of the priorities, then splitting on n equal intervals would result in timesumi storing the sum of times to solve the requests with priorities in interval [ plow + phigh−plow n ∗ i, plow + phigh−plow n ∗ (i + 1)−1 ] . the tradeoff can be adjusted using variable n. the gather message will contain the timesum array. when adding the local state to a gather message, a nodes responsibility is to iterate through all its waiting requests and for each one to add in the corresponding array index (the correct priority interval) the time needed by the node to solve it. the leader will keep the final timesum array and use it for decision making. to find out if a request can be executed by the leaders sub-network we need to sum up the values of all intervals of greater priority than the considered request. then, we divide this sum to the number of nodes in the sub-network to find out the average time needed to finish all higher priority requests. the close to ideal deadline test is used here too, but this time we check if there an online load balancing algorithm for a hierarchical ring topology 755 are any values bigger than zero on intervals with greater priority than the considered request. below is presented the pseudocode for adding a local state to the gather message and the decision making of a leader if it should accept or not a request in its sub-network. getpriorityinterval does simple math to get the interval index from the priority of a request. addlocalstate(message) foreach req in the requestslist message.timesum[getpriorityinterval(req)] += getesttimetocompute(req) canexecuteonmysubnetwork(request) p = priority(request) totaltime = 0; for i = p +1 to n totaltime += timesum[i] averagetotaltime = totaltime / numnodesinsubnetwork remainingtime=(request.idealtimeoffinish (getcurrenttime() + averagetotaltime)) isclosetodeadline = (request.idealtimeoffinish getcurrenttime()) <= t return remainingtime>= 0 or (isclosetodeadline and there is no timesum[k]>0 with k from p+1 to n) the complexities of the operations used here are linear which can be good or bad depending on request’s granularity. if there are generally very small requests to execute then this linear time might affect the global performance. an idea to solve this case would be to use a heap tree data structure (which provides logarithmic time for operations) and to rebuild the tree at different time intervals considering the newest priorities. 4 simulation results to demonstrate that the algorithm satisfies the proposed goals, a simulator in mpi has been created. the nodes are processes on different machines connected in a network. the test implies a total of 64 processes over 8 machines. random requests where continuously generated with a normal distribution in length classes. the estimated times to compute the requests were between [10, 500] milliseconds (depending on the computing power of the nodes). same interval was used for the average response times in the clients licenses. two relevant tests are used to show how the load balancer works. the results are compared to the results of the algorithm in [1]. test 1: check the response times with different workloads. for this test, the simulator created random requests considering the total computing power of the system. table 1 shows a comparison between both algorithms in terms of response time delays, given as a percentage value from the value promised in the owner’s license. final results were obtained by averaging multiple simulation results. in the proposed algorithm the average response time goal is satisfied, with important delays appearing just when the workload was too high. test 2: check the fairness between requests with respect to the average response time when the available hardware resources are not enough for satisfying the requests. the simulator creates random requests to simulate a high workload then checks how many of them were solved per interval of average response time. the initial interval of average response times [10,500] was split in 5 intervals as the table 2 shows. ideally, the number of requests solved per each interval should be inverse proportional to the average value of the interval. 756 c.i. paduraru system workload average delays in response time proposed algorithm average delays in response time algorithm in [1] 20 % 1.23 % 15 % 50 % 1.45 % 26.4 % 100 % 1.72 % 54.8 % 200 % 103.14 % 104.2 % table 1: shows average response times with different system workloads. intervals of average response times average number of requests solved in proposed algorithm average number of requests solved in [1] 10-100 4233 1651 101-200 2397 1675 201-300 1683 1649 301-400 779 1693 401-500 541 1680 table 2: number of requests solved for different average response time intervals. the results show that the second goal is satisfied too in the proposed algorithm, while in the other load balancer there is no fairness between the clients. the proposed algorithm performs much better when comparing the maximum waiting times of requests thanks to the priority formula. because of the overhead needed to satisfy the goals, the proposed algorithm had a throughput with 2.11% smaller than the reference algorithm. with a proper tuning of the bonus_step, the number of intervals for splitting the local state data and the time to initiate a new gather message the algorithm can obtain peak performance with minimizing the overhead. these variables should be tuned considering the granularity of the nodes and the available bandwidth. in the simulation, bonus_step was equal to 2, the number of intervals was 5 and the time to initiate gather messages was 300 milliseconds. as a recommendation, these variables values should actually represent a percentage value of real input data. 5 conclusion this paper presented a load balancing algorithm for distributed real-time systems which have a hierarchical ring topology. the algorithm has two proposed goals: satisfy the average response time if the computing power allows this and keep the fairness between clients with respect to the response times specified in their license. the results presented in section 4 demonstrate that the algorithm satisfies the proposed goals. bibliography [1] oguz akay, kayhan erciyes, a dynamic load balancing model for a distributed system, mathematical & computational applications, 8(3):353-350, 2003. [2] ciprian paduraru, a new online load balancing algorithm in distributed systems, symbolic and numeric algorithms for scientific computing, synasc 14th edition, pages:327-334, 2012. an online load balancing algorithm for a hierarchical ring topology 757 [3] andrew s. tanenbaum, modern operating systems (3rd edition), prentice hall, december 2007. [4] kwang soo cho, un gi joo, heyung sub lee, bong tae kim, and won don lee, efficient load balancing algorithms for a resilient packet ring using artificial bee colony, applications of evolutionary computation, lncs, 6025:61-70, 2010. [5] g. ravindran and m. stumm, hierarchical ring topologies and the effect of their bisection bandwidth constraints, proc. intl. conf.parallel processing, i:51-55, 1995. [6] perry fizzano and clifford stein, scheduling on a ring with unit capacity links, proceedings of the sixth annual acm symposium on parallel algorithms and architectures, pages:210-219, 1994. [7] johannes e. gehrke , c. greg plaxton and rajmohan rajaraman, rapid convergence of a local load balancing algorithm for asyncronous rings, distributed algorithms, lncs, 1320:81-95, 1997. [8] young-soo myung, hu-gon kim, dong-wan tcha, optimal load balancing on sonet bidirectional rings, operations research, 45(1):148-152, 1997. [9] dekel tsur, improved scheduling in rings, journal of parallel and distributed computing, 67(5):531-535, 2007. [10] amir gourgy, ted h. szymanski, cooperative token-ring scheduling for input-queued switches, journal of parallel and distributed computing, 58(3):351-364, 2009. [11] leonidas georgiadis, wojciech szpankowski, leandros tassiulas, a scheduling policy with maximal stability region for ring networks with spatial reuse, queueing systems (springer), 19(1-2):131-148, 1995. [12] joseph (seffi) naor, adi rosen, gabriel scalosub, online time-constrained scheduling in linear and ring networks, journal of discrete algorithms, 8(4):346-355, 2010. international journal of computers communications & control issn 1841-9836, 9(4):482-496, august, 2014. observer-based non-fragile passive control for uncertain nonlinear sampled-data system with time-delay s.g. wang, y.s. li, y. wu shigang wang school of mechanical & electric engineering heilongjiang university no.74 xuefu road, harbin, 150080, p. r. china wsg6363@gmail.com yingsong li* college of information and communications engineering harbin engineering university no.145 nantong street, harbin, 150001, p. r. china *corresponding author: liyingsong82@gmail.com yi wu school of information science and technology heilongjiang university no.74 xuefu road, harbin, 150080, p. r. china wy51cn@yahoo.com.cn abstract: the problem of observer-based passive control for uncertain nonlinear sampled-data systems with time delay is investigated by using non-fragile passive control. attention is focused on the design of a non-fragile passive observer and a controller which guarantees the passivity of the closed-loop system for all admissible uncertainties. a sufficient condition for passivity and asymptotic stability of the combined system is derived via linear matrix inequality (lmi). finally, a simulation example is presented to show the validity and advantages of the proposed method. keywords: uncertain sampled-data system, time-delay systems, state observer, non-fragile passive control, linear matrix inequality 1 introduction in the past few years, sampled-data systems are widely encountered in the area of control theory and control engineering, such as welding process, aerospace, signal processing, earthquake prediction, due to its character as continuous control plant and discrete controller [1]. since time delay and inherent nonlinearity often occurs and causes serious deterioration of the stability of various engineering system, considerable research has been done and numerous results have been obtained relating to the analysis and synthesis of uncertain nonlinear sampled-data systems with time-delay, see e.g. [2–7]. among these results, non-fragile passive control problem have attracted particular attention. passivity is part of a broader and more general theory of dissipativity and non-fragility is a scheme of solving robustness of controller and observer themselves [8–11], they maintain the system’s internal stability. because non-fragile passive control has attractive features such as fast response, good transient response and insensitivity to variations in system parameters and external disturbance [12–16], which is likely to be an effective technique of control for uncertain nonlinear sampled-data system with time-delay. on the other hand, all above works are based on an implicit assumption that the states are all known. however, this unrealistic assumption is not always verified. and hence, the construction of the unmeasured states through the knowledge of the system’s inputs and outputs still an unavoidable task to solve any desired control problem [17–21]. however, to our knowledge, there copyright © 2006-2014 by ccc publications observer-based non-fragile passive control for uncertain nonlinear sampled-data system with time-delay 483 have been few results in literature of any investigation for observer-based non-fragile passivity uncertain nonlinear sampled-data system with time-delay. the above situation is exactly what concerns and interests us. a novel approach of non-fragile control combined with passive control is proposed for stabilizing a class of uncertain nonlinear systems with time-delay. by utilizing a non-fragile state observer, a novel control law is established such that the resulting closed-loop system is strictly passive. a sufficient condition for the passivity and asymptotic stability of the augmented system is derived via lmi. finally, an example is simulated to illustrated the advantage of the proposed method. 2 problem statement and preliminaries consider the plant of uncertain nonlinear sampled-data system with time-delay described by   ẋ(t) = (a0 + ∆a0)x(t) + (a1 + ∆a1)x(t − τ) + b0u(t) + f(x, u, t) + b1ω(t) y(t) = c1x(t) + h2u(t) z(t) = c2x(t) + h3ω(t) x(t) = x0 , t ∈ [−τ, 0] (1) where x(t) ∈ rn is the state, and u(t) ∈ rm is the control input, y(t) ∈ rp is regulated output, z(t) ∈ rq is measured output, ω(t) ∈ rr is the external disturbance input that belongs to l2[0, ∞], a0, a1, b0, b1, c1, c2, h2, h3 are known real constant matrices of appropriate dimension, ∆a0, ∆a1, are uncertain matrices. f is the uncertain nonlinear function vector, f(0, 0, t0) = 0, and f satisfies the lipschitz condition. assumption 2.1. the continuous plant is time-driven with a constant sampling period h(h > 0). discretizing system (1) in one period, we can obtain the discrete state equation of the plant of sampled-data system   x(k + 1) = (g0 + ∆g0)x(k) + (g1 + ∆g1)x(k − d) + h0u(k) + f̄(xk, xk−d, k) + h1ω(k) y(k) = c1x(k) + h2u(k) z(k) = c2x(k) + h3ω(k) x(k) = x0 , k ≤ 0 (2) where g0 = e a0h , g1 = ∫ h 0 ea0(h−w)dwa1 h0 = ∫ h 0 ea0(h−w)dwb0 , h1 = ∫ h 0 ea0(h−w)dwb1 f̄(x, u, k) = ∫ h 0 ea0wdwf(x, u, t) f̄(x̂, u, k) = ∫ h 0 ea0wdwf(x̂, u, t) consider non-fragile observer described by{ x̂(k + 1) = g0x̂(k) + g1x̂(k − d) + h0u(k) + (l + ∆l)(y(k) − ŷ(k)) + f̄(x̂k, xk−d, k) ŷ(k) = c1x̂(k) + h2u(k) (3) 484 s.g. wang, y.s. li, y. wu where x̂(k) ∈ rn is state of observer, and l is observer gain, ∆g0, ∆g1, are uncertain matrices and ∆l are observer gain perturbation which are assumed to be of the following form: [∆g0 ∆g1 ∆l] = m0f(k)[e0 e1 e2] (4) on the other hand non-fragile controller described by u(k) = (k + ∆k)x̂(k) (5) where k is controller gain, ∆k represents corresponding gain perturbation, and generally, there exist the following two classes of perturbation in ∆k: type1: ∆k is of the additive form: ∆k = m0f(k)e3 (6) type2: ∆k is of the multiplicative form: ∆k = m0f(k)e4k (7) where m0, e0, e1, e2, e3, and e4 are real matrices with appropriate dimension and f(k) ∈ rk×l is an unknown time-varying matrix function satisfying f t(k)f(k) ≤ i assumption 2.2. f̄(xk, xk−d, k), f̄(x̂k, x̂k−d, k) satisfies the quadratic inequality in the domains of continuity, that is f̄t(xk, xk−d, k)f̄(xk, xk−d, k) ≤ δ21xt(k)mt1 m1x(k) + δ 2 d1x t(k − d)mtd1md1x(k − d) (8) let ξt1 = [ et(k) x̂t(k) et(k − d) x̂t(k − d) f̄t(xk, xk−d, k) ] , then (8) can be conveniently written as ξt1 (k)   −δ21mt1 m1 ⋆ ⋆ ⋆ ⋆ −δ21mt1 m1 −δ 2 1m t 1 m1 ⋆ ⋆ ⋆ 0 0 −δ2d1m t d1md1 ⋆ ⋆ 0 0 0 −δ2d1m t d1m ⋆ 0 0 0 −δ2d1m t d1m i   ξ1(k) ≤ 0 (9) in addition, f̄t(x̂k, x̂k−d, k)f̄(x̂k, x̂k−d, k) ≤ δ22x̂t(k)mt2 m2x̂(k) + δ 2 d2x̂ t(k − d)mtd2md2x̂(k − d) (10) let ξt2 = [ et(k) x̂t(k) et(k − d) x̂t(k − d) f̄t(x̂k, x̂k−d, k) ] , then (10) can be conveniently written as ξt2 (k)   0 ⋆ ⋆ ⋆ ⋆ 0 −δ22mt2 m2 ⋆ ⋆ ⋆ 0 0 0 0 0 0 −δ2d2m t d2md2 ⋆ 0 0 0 0 i   ξ2(k) ≤ 0 (11) observer-based non-fragile passive control for uncertain nonlinear sampled-data system with time-delay 485 where δ1, δ2, δd1, δd2 are the bounding parameter, and m1, m2, md1, md2 are constant matrices such that f(0, 0, k) = 0 and x = 0 is an equilibrium of system (2) for dk = 0. the objective of this paper is to design observer-based non-fragile passive controller, substitute non-fragile observer (4) and controller (5) into system (2), and let e(k) = x(k) − x̂(k), then resulting error closed-loop system is obtain by   e(k + 1) = (g0 − lc1 + ∆g0 − ∆lc1)e(k) + ∆g0x̂(k) + (g1 + ∆g1)e(k − d)+ ∆g1x̂(k − d) + h1ω(k) + f̄(x) − f̄(x̂) x̂(k + 1) = (g0 + h0k + h0∆k)x̂(k) + g1x̂(k − d) + (l + ∆l)c1e(k) + f̄(x̂(k)) (12) before proceeding to this main results, the following useful assumption and lemmas are need. assumption 2.3. suppose that the matrix c1 has full row rank (i.e. rank(c1)=p). for convenience of discussion, the singular value decomposition of c1 as follows: c1 = u [ s 0 ] v t where is s ∈ rp×p a diagonal matrix with positive diagonal elements in decreasing order, 0 ∈ rp×(n−p) is a zero matrix, and u ∈ rp×p and v ∈ rn×n are unitary matrices. lemma 1. [22] for a given c1 ∈ rp×n with rank(c1)=p,assume that x ∈ rn×n is a symmetric matrix, then there exists a matrix x̂ ∈ rp×p such that c1x = x̂c1 if and only if x = v [ x̂11 0 0 x̂22 ] v t where x̂11 ∈ rp×p and x̂22 ∈ r(n−p)×(n−p) lemma 2. [23] (schur complement) for a given symmetric matrix s = st = [ s11 s12 st12 s22 ] with s11 ∈ rr×r, the following conditions are equivalent: (1) s < 0 (2) s11 < 0, s22 − st12s −1 11 s12 < 0 (3) s22 < 0, s11 − s12s−122 s t 12 < 0 lemma 3. [24] for given matrices q = qt, h, and e, with appropriate dimensions q + hf(k)e + etf t(k)ht < 0 holds for all f(k) satisfying f t(k)f(k) ≤ i if and only if there exists ε > 0 q + εhht + ε−1ete < 0 definition 4. the systems (2) is called passive if there exists a scalar β ≥ 0 such that ∞∑ k=0 ω(k)z(k) ≥ β , ∀ω ∈ l2[0, ∞] where β is some constant which depends on the initial condition of the system. 486 s.g. wang, y.s. li, y. wu 3 main results theorem 5. for system(2) and observer (3), if there exist two symmetric and positive matrices r̄ ∈ rn×n, p̄ ∈ rn×n, two real matrices y1 ∈ rm×n, y2 ∈ rn×p and three positive constants ε1, ε2, and ε3 such that the following holds: ξ = [ ξ11 ξ t 21 ξ21 ξ22 ] < 0 (13) then there exist two gains k = y1p̄ −1, and l = y2usx̂ −1 11 s −1ut, such that system is asymptotically passive stable. where ξ11 =   w1 − r̄ ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ 0 w2 − p̄ ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ 0 0 −q1 ⋆ ⋆ ⋆ ⋆ ⋆ 0 0 0 −q2 ⋆ ⋆ ⋆ ⋆ −c2r̄ −c2p̄ 0 0 −h2 − ht2 ⋆ ⋆ ⋆ 0 0 0 0 0 0 ⋆ ⋆ 0 0 0 0 0 0 0 ⋆ g0r̄ − y2c1 0 g0 0 h1 i −i −r̄   ξ21 =   y2c1 g0p̄ + h0y1 0 g1 0 0 i 0 0 0 0 0 0 0 0 ε1m t 0 e0r̄ − e2c1r̄ e0p̄ e1 e1 0 0 0 0 0 0 0 0 0 0 0 0 e2cr̄ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 e3p̄ 0 0 0 0 0 0 δ1m1r̄ 0 δd1md1 δd1md1+ δd2md2 0 0 0 0   ξ22 =   −p̄ ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ 0 −ε1i ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ 0 0 −ε1i ⋆ ⋆ ⋆ ⋆ ⋆ ε1m t 0 0 0 −ε1i ⋆ ⋆ ⋆ ⋆ 0 0 0 0 −ε1i ⋆ ⋆ ⋆ ε1(h0m0) t 0 0 0 0 −ε1i ⋆ ⋆ 0 0 0 0 0 0 −ε1i ⋆ 0 0 0 0 0 0 0 −ε1i   proof: choose a lyapunov function candidate for the system (12) as follow: observer-based non-fragile passive control for uncertain nonlinear sampled-data system with time-delay 487 v = v1 + v2 + v3 + v4 v1 = e t(k)re(k) v2 = k−1∑ i=k−h et(i)q1e(i) v3 = x̂ t(k)px̂(k) v4 = k−1∑ i=k−h xt(i)q2x(i) where r = rt > 0, q1 = qt1 > 0, q2 = q t 2 > 0, and p = p t > 0. define vector ξ(k) =   e(k) x̂(k) e(k − d) x̂(k − d) ω(k) f̄(xk, xk−d, k) f̄(x̂k, x̂k−d, k)   , θ1 =   (g0 − lc1 + ∆g0 − ∆lc1)t ∆gt0 (g1 + ∆g1)t ∆gt1 ht1 i −i   t θ2 = [ h(l + ∆l)c g0 − h0k 0 g1 0 0 i ] therefore, ∆v = ∆v1 + ∆v2 + ∆v3 + ∆v4 = ξt(k)θt1 rθ1ξ(k) + e t(k)q1e(k) − et(k − d)q1e(k − d)+ ξt(k)θt2 pθ2ξ(k) + x̂ t(k)q2x̂(k) − x̂t(k − d)q2x̂(k − d) = ξt(k)π1ξ(k) (14) on one hand, the sufficient condition of stability ∆v < 0, implies that π1 < 0, that is π1 :=   q1 − r ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ 0 q2 − p ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ 0 0 −q1 ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ 0 0 0 −q2 ⋆ ⋆ ⋆ ⋆ ⋆ 0 0 0 0 0 ⋆ ⋆ ⋆ ⋆ 0 0 0 0 0 0 ⋆ ⋆ ⋆ 0 0 0 0 0 0 0 ⋆ ⋆ g0 − lc1+ ∆g0 − ∆lc1 ∆g0 g1 + ∆g1 ∆g1 h1 i −i −r−1 ⋆ (l + ∆l)c g0 − h0k 0 g1 0 0 i 0 −p −1   < 0 (15) 488 s.g. wang, y.s. li, y. wu on the other hand, utilizing (14) with ω(k) ∈ l2[0, +∞] ̸= 0, one is obtained by ∆v − 2zt (k)ω(k) ≤ ξt(k)     q1 − r ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ 0 q2 − p ⋆ ⋆ ⋆ ⋆ ⋆ 0 0 −q1 ⋆ ⋆ ⋆ ⋆ 0 0 0 −q2 ⋆ ⋆ ⋆ 0 0 0 0 0 ⋆ ⋆ 0 0 0 0 0 0 ⋆ 0 0 0 0 0 0 0   + θt1 rθ1 + θ t 2 pθ2 − 2θ t 3 θ4 } ξ(k) = ξt(k)π2ξ(k) (16) if π2 < 0, then ∆v (k) − 2zt(k)ω(k) < 0 and from which it follows that ∞∑ k=0 ω(k)z(k) > 1/2 ∞∑ k=0 ∆v = 1/2[v (0) − v (∞)] (17) due to v (k) > 0 for x ̸= 0 and v (k) = 0 for x = 0, if follows as k → ∞ that system (12) is strictly passive. in virtue of definition 4, the strictly passive condition is guaranteed if π2 < 0 and it can be expressed conveniently as   q1 − r ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ 0 q2 − p ⋆ ⋆ ⋆ ⋆ ⋆ 0 0 −q1 ⋆ ⋆ ⋆ ⋆ 0 0 0 −q2 ⋆ ⋆ ⋆ 0 0 0 0 0 ⋆ ⋆ 0 0 0 0 0 0 ⋆ 0 0 0 0 0 0 0   + θt1 rθ1 + θ t 2 pθ2 − 2θ t 3 θ4 < 0 (18) application of the lemma 2 to (18) puts it into the form:   q1 − r ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ 0 q2 − p ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ 0 0 −q1 ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ 0 0 0 −q2 ⋆ ⋆ ⋆ ⋆ ⋆ −c2 −c2 0 0 −h2 − ht2 ⋆ ⋆ ⋆ ⋆ 0 0 0 0 0 0 ⋆ ⋆ ⋆ 0 0 0 0 0 0 0 ⋆ ⋆ g0 − lc1+ ∆g0 − ∆lc1 ∆g0 g1 + ∆g1 ∆g1 h1 i −i −r−1 ⋆ (l + ∆l)c1 g0 − h0(k + ∆k) 0 g1 0 0 i 0 −p −1   < 0 (19) substituting the uncertainty structure into (19) and rearranging, we get observer-based non-fragile passive control for uncertain nonlinear sampled-data system with time-delay 489   q1 − r ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ 0 q2 − p ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ 0 0 −q1 ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ 0 0 0 −q2 ⋆ ⋆ ⋆ ⋆ ⋆ −c2 −c2 0 0 −h2 ⋆ ⋆ ⋆ ⋆ 0 0 0 0 0 0 ⋆ ⋆ ⋆ 0 0 0 0 0 0 0 ⋆ ⋆ g0 − lc1 0 g0 0 h1 i −i −r−1 ⋆ lc g0 − h0k 0 g1 0 0 i 0 −p −1   +θ5f(k)θ6 + θ t 6 f t(k)θt5 + θ7f(k)θ8 + θ t 8 f t(k)θt7 + θ9f(k)θ10+ θt10f t(k)θt9 < 0 (20) where θ5 = [ 0 0 0 0 0 0 0 mt0 0 ]t θ6 = [ e0 − e2c1 e0 e1 e1 0 0 0 0 0 ] θ7 = [ 0 0 0 0 0 0 0 0 mt0 ]t θ8 = [ e2c1 0 0 0 0 0 0 0 0 ] θ9 = [ 0 0 0 0 0 0 0 0 (h0m0)t ]t θ10 = [ 0 e3 0 0 0 0 0 0 0 ] then by lemma 3, the inequality (20) holds if and only if for some εi > 0 (i=1,· · · , 3) π3 + ε1θ5θ t 5 + ε −1 1 θ t 6 θ6 + ε2θ7θ t 7 + ε −1 2 θ t 8 θ8 + ε3θ9θ t 9 + ε −1 3 θ t 10θ10 < 0 (21) where π3 :=   q1 − r ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ 0 q2 − p ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ 0 0 −q1 ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ 0 0 0 −q2 ⋆ ⋆ ⋆ ⋆ ⋆ −c2 −c2 0 0 −h2 ⋆ ⋆ ⋆ ⋆ 0 0 0 0 0 0 ⋆ ⋆ ⋆ 0 0 0 0 0 0 0 ⋆ ⋆ g0 − lc1 0 g0 0 h1 i −i −r−1 ⋆ lc1 g0 − h0k 0 g1 0 0 i 0 −p −1   on using the lemma 2, it becomes that 490 s.g. wang, y.s. li, y. wu   π3 ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ ε1θ t 5 −ε1i ⋆ ⋆ ⋆ ⋆ ⋆ θ6 0 −ε1i ⋆ ⋆ ⋆ ⋆ ε2θ t 7 0 0 −ε2i ⋆ ⋆ ⋆ θ8 0 0 0 −ε2i ⋆ ⋆ ε3θ t 9 0 0 0 0 −ε3i ⋆ θ10 0 0 0 0 0 −ε3i   < 0 (22) thirdly, introduce nonlinearities (8) and (10) into (22), then   π4 ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ ε1θ t 5 −ε1i ⋆ ⋆ ⋆ ⋆ ⋆ θ6 0 −ε1i ⋆ ⋆ ⋆ ⋆ ε2θ t 7 0 0 −ε2i ⋆ ⋆ ⋆ θ8 0 0 0 −ε2i ⋆ ⋆ ε3θ t 9 0 0 0 0 −ε3i ⋆ θ10 0 0 0 0 0 −ε3i   < 0 (23) where π4 :=   φ1 ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ −δ21mt1 m1 φ2 ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ 0 0 φ3 ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ 0 0 −δ2d1m t d1md1 φ4 ⋆ ⋆ ⋆ ⋆ ⋆ −c2 −c2 0 0 −h2 ⋆ ⋆ ⋆ ⋆ 0 0 0 0 0 0 ⋆ ⋆ ⋆ 0 0 0 0 0 0 0 ⋆ ⋆ g0 − lc1 ∆g0 g0 0 h1 i −i −r−1 ⋆ lc1 g0 − h0k 0 g1 0 0 i 0 −p −1   < 0 φ1 = q1 − r − δ21mt1 m1 φ2 = q2 − p − δ21mt1 m1 − δ 2 2m t 2 m2 φ3 = −q1 − δ2d1m t d1md1 φ4 = −q2 − δ2d1m t d1md1 − δ 2 d2m t d2md2 pre-multiplying and post-multiplying (23) by diag { r−1 p −1 13 columns︷︸︸︷ i · · · i } , and from lemma 1, the condition c1r̄ = ˆ̄rc1 holds, moreover, setting y2 = l ˆ̄r. in the meanwhile, define p̄ = p −1, r̄ = r−1, y1 = kp̄, w1 = r −1q1r −1, w2 = p −1q2p −1, it is seen that (23)<0 is equivalent to (13), the means that the system (12) is asymptotically passive stable. theorem 6. for system(2) and observer (3), if there exist two symmetric and positive matrices r̄ ∈ rn×n, p̄ ∈ rn×n, two real matrices y3 ∈ rm×n, y4 ∈ rn×p and three positive constants ε4, observer-based non-fragile passive control for uncertain nonlinear sampled-data system with time-delay 491 ε5, and ε6 such that the following holds: ω = [ ω11 ω t 21 ω21 ω22 ] < 0 (24) then there exist two gains k = y3p̄ −1, and l = y4usx̂ −1 11 s −1ut, such that system is asymptotically passive stable. where ω11 =   w3 − r̄ ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ 0 w4 − p̄ ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ 0 0 −q1 ⋆ ⋆ ⋆ ⋆ ⋆ 0 0 0 −q2 ⋆ ⋆ ⋆ ⋆ −c2r̄ −c2p̄ 0 0 −h2 − ht2 ⋆ ⋆ ⋆ 0 0 0 0 0 0 ⋆ ⋆ 0 0 0 0 0 0 0 ⋆ g0r̄ − y4c1 0 g0 0 h1 i −i −r̄   ω21 =   y4c1 g0p̄ + h0y3 0 g1 0 0 i 0 0 0 0 0 0 0 0 ε4m t 0 e0r̄ − e2c1r̄ e0p̄ e1 e1 0 0 0 0 0 0 0 0 0 0 0 0 e2cr̄ 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 e4y3 0 0 0 0 0 0 δ1m1r̄ 0 δd1md1 δd1md1+ δd2md2 0 0 0 0   ω22 =   −p̄ ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ 0 −ε4i ⋆ ⋆ ⋆ ⋆ ⋆ ⋆ 0 0 −ε4i ⋆ ⋆ ⋆ ⋆ ⋆ ε4m t 0 0 0 −ε5i ⋆ ⋆ ⋆ ⋆ 0 0 0 0 −ε5i ⋆ ⋆ ⋆ ε4(h0m0) t 0 0 0 0 −ε6i ⋆ ⋆ 0 0 0 0 0 0 −ε6i ⋆ 0 0 0 0 0 0 0 i   proof: theorem 6 ’s proof is same to theorem 5, so is omitted. 4 numerical example consider the plant of sampled-data system (1) with the parameters: 492 s.g. wang, y.s. li, y. wu a0 =   0 1 0 0 0 1 1 −3 1   , a1 =   0 1 0 1 0 0 1 −3 1   , b0 =   0.1 0.1 0 1.2 0.5 0.1   , b1 =   0.1 0.2 0 1.5 0 0.1   c1 = [ 1.5 0.6 1.0 0.5 0.6 0 ] , c2 = [ 3 4 1 ] , h2 = [ 0.1 0.1 0.1 0.1 ] , h3 = 0.5 by theorem 1 has a solution: p̄ =   3.8697 −2.7891 −0.4465 −2.7891 2.0973 −0.0138 −0.4465 −0.0138 1.3967   , y1 = [ −13.8330 9.8716 2.0314 1.4006 −0.8149 −0.9412 ] r̄ =   0.3864 −0.3364 −0.3658 −0.3364 0.3753 0.2113 −0.3658 0.2113 0.4907   , x̂11 = [ 0.0059 0.0333 0.0333 0.2139 ] , x̂22 = 1.0326 the non-fragile passive observer-based control are given by l = y2usx̂ −1 11 s −1ut =   −6.5298 −9.0418 2.7828 3.9663 8.6606 11.5095   , k = y1p̄ −1 = [ −1.3147 2.9654 1.0635 0.3124 0.0231 −0.5738 ] we present design method of observer-based non-fragile passive controller in this paper, the simulation results are given in figure 1. from figure 1, it can be seen the state estimation x̂(t) has a good trace performance with the external disturbance and input nonlinearity. 5 application to stabilization of an inverted pendulum on a cart an inverted pendulum on a cart [25] is depicted in figure 2 in this model, a pendulum is conjuncted to the topside of a cart by a pivot, which is allowed to swing in the xy−plane. a force u acts on the cart in the x direction, in order to keep the pendulum balance upright. x(t) is the displacement between central mass of cart and the origin 0; θ is the angle of the pendulum from the top vertical. which is described by the following dynamics by applying newtons second law (m + m) ẍ + mlθ̈ cos θ − mlθ̇2 sin θ = u mlẍ cos θ + 4 3 ml2θ̈ − mgl sin θ = 0 observer-based non-fragile passive control for uncertain nonlinear sampled-data system with time-delay 493 1 x 1 x̂ with oberver l 1 x̂ with observer l 2 x 2 x̂ with oberver l 2 x̂ with observer l 3 x 3 x̂ with oberver l 3 x̂ with observer l figure 1: the simulation of non-fragile observer with additional perturbation now, by selecting state variables z = [ z1 z2 ]t = [ θ θ̇ ]t and by linearizing the above model at the equilibrium point z = 0, we obtain the following state-space model: ż(t) = [ 0 1 3(m+m)g l(4m+m) 0 ] z(t) + [ 0 − 3 l(4m+m) ] u(t) (25) here the parameters are selected in table 1, by assuming the sampling time to be ts = 0.1 s, the discretized model for the above pendulum system in (21) is given by x(k + 1) = [ 1.0877 0.1029 1.7794 1.0877 ] x(k) + [ −0.0000 −0.0182 ] u(k) (26) the poles of the system are 0.6598 and 1.5156, thus this discretized system is unstable. it is assumed that a non-fragile control law with additive form is given by u(k) = [ 219.023 49.786 ] x(k) (27) the other non-fragile control law with multiplicative form is given by u(k) = [ 199.755 52.011 ] x(k) (28) table 1. an inverted pendulum parameters 494 s.g. wang, y.s. li, y. wu m y ( )u t xm ( )x t figure 2: a pendulum system perturbation system parameter values mass of the cart m (kg) 8.0 mass of the pendulum (kg) 2.0 half length of the pendulum (m) 0.5 acceleration of gravity (m/s2) 9.8 1 x 2 x (a) regular controller 1 x 2 x (b) non-fragile controller 1 x 2 x (c) non-fragile controller with nonliearity (d) control input figure 3: state response of the pendulum system and control input the simulation results are given in figure 3. in fact, for sampling period ts = 0.1s, lmi (13) remain solvable. state response curve of regular controller is divergent in figure 3(a), however, the curve is convergent for non-fragile controller in figure 3(b). furthermore, it is still convergent, when there exists a nonlinear perturbation in figure 3(c). a corresponding control input is shown in figure 3(d). observer-based non-fragile passive control for uncertain nonlinear sampled-data system with time-delay 495 6 conclusions the problem of observer-based non-fragile passive control of uncertain nonlinear sampleddata system with time-delay has been studied. a lmi based approach to designing state observer and non-fragile controller, which ensure the passivity of the resulting error closed-loop system has been developed. a numerical example has been provided to demonstrate the effectiveness and applicability of the proposed approach. acknowledgment this work is partially supported by national defense 973 basic research development program of china (6131380101), national natural science foundation of china (61203121,51307045),and the authors are indebted to the editor and reviewers for their valuable comments and suggestions. bibliography [1] chen, t.; francis, b. a. (1995); optimal sampled-data control system, springer, new york. [2] wu, j. f.; wang, q.; chen, s. b. (2001); robust stability for sampled-data systems, control theory & applications, issn 1000-8152, 18(8):99-102. [3] wu, j. f.; wang, q.; chen, s. b. (2002); robust control of a class of sampled-data systems with structured uncertainty, control and decision, issn 1001-0920, 17(5):681-685. [4] liu, f. c.; yao, y.; he, f. h. (2009); robust h∞ controller design for sampled-data systems with parametric uncertainties, journal of systems engineering and electronics, issn 10044132, 20(2):371-378. [5] wang, g. x.; liu, y. w.; zhen, h. (2006); h∞ mixed sensitivity design for sampled-data systems, control theory & applications, issn 1000-8152, 23(3):351-354. [6] wang, s. g. (2011); non-fragile h∞ control with pole constraints for a class of nonlinear sampled-data system, lecture notes in electrical engineering, issn 1876-1100, 87(2):587594. [7] wang, s. g.; bi, y. l.; li, y. s. (2014); improvements on robust stability of sampled-data system with long time delay, mathematical problems in engineering, issn 1024-123x, 2014, article id 580768, 7 pages, http://dx.doi.org/10.1155/2014/580768. [8] abid, h.; chtourou, m.; toumi, a. (2008); robust fuzzy sliding mode controller for discrete nonlinear systems, international journal of computers, communications & control, issn 1841-9836, iii(1):6-20. [9] xianga, z. (2010); robust control of particle size distribution in aerosol processes, international journal of computers, communications & control, issn 1841-9836, v(3):385-397. [10] ngo, t.; wang, y.; mai, t. l.; nguyen, m. h.; chen, j. (2012); robust adaptive neuralfuzzy network tracking control for robot manipulator, international journal of computers, communications & control, issn 1841-9836, 7(2):341-352. [11] farokhi, m. h.; vasegh, n. (2014); robust pid stabilization of linear neutral time-delay systems, international journal of computers, communications & control, issn 1841-9836, 9(2):201-208. 496 s.g. wang, y.s. li, y. wu [12] keel, l. h.; bhattacharyya, s. p. (1997); robust, fragile, or optimal, ieee transactions automatic control, issn 0018-9286, 42(8):2678-2683. [13] yang, g. h.; wang, j. l.; lin, c. (2000); h∞ control for linear systems with additive gain variations, international journal of control, issn 0020-7179, 73(16):1500-1506. [14] yang, g. h.; wang, j. l. (2001); non-fragile control for linear systems with multiplicative controller gain variations, automatica, issn 0005-1098, 37(5):727-737. [15] wang, s. g.; wu, j. f. (2011); non-fragile h∞ filtering for a class of sampled-data system with long time-delay, icic express letters part b: appliations, issn 2185-2766, 2(6):14471452. [16] wang, s. g.; li, y. s.; wu, y. (2013); non-fragile passive filtering for sampled-data system with long time-delay subject to nonlinearity, recent patents on electrical & electronic engineering, issn 1874-4761, 6(3):196-202. [17] chen, j. d.; yang, c. d.; lien, c. h.; horng, j. h. (2008); new delay-dependent nonfragile h∞ observer-based control for continous time-delay systems, information sciences, issn 1968-2013, 178(24):4799-4706. [18] ibrir, s.; yang, d. s. (2008); novel lmi conditions for observer-based stabilization of lipschitzian nonlinear systems and uncertain linear systems in discrete-time, applied mathematics and computation, issn 1975-2013, 206(2):579-588. [19] wang, s. g.; wu, j. f. (2011); observer-based non-fragile h∞ control for a class of uncertain time-delay sampled-data systems, systems engineering and electronics, issn 1001-506x, 33(6):1358-1361. [20] shih, k. s.; li, t. s.; tsai s. h. (2012); observer-based adaptive fuzzy robust controller with self-adjusted membership functions for a class of uncertain mimo nonlinear systems: a pso-sa method, international journal of innovative computing information and control, issn 1349-4198, 8(2):1025-1036. [21] wang, s. g.; bi, y. l. (2012); robust output feedback control via pole constraints for sampled-data system with long time-delay, journal of computational and theoretical nanoscience, issn 1546-1955, 10(12):2926-2930. [22] gu, d. w.; poon, f. w. (2003); robust stabilization for a class of discrete-time non-linear systems via output feedback: the unified lmi approach, international journal of control, issn 0020-7179, 76(2):105-115. [23] albert. (1969); conditions for positive and non-negative definiteness in terms of pseudoinverses, international journal of control, issn 1095-712x, 17(2):434-440. [24] barmish, b. r. (1985); necessary and sufficient conditions for quadratic stabilizability of an uncertain system, journal of optimization theory and applications, issn 0022-3239, 46(4):399-408. [25] gao, h. j.; chen, t. w. (2007); new results on stability of discrete-time systems with timevarying state delay, ieee transactions on automatic control, issn 0018-9286, 52(2):328334. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 3, pp. 301-313 srol web-based resources for languages and language technology e-learning s.m. feraru, h.n. teodorescu, m.d. zbancioc silvia monica feraru institute for computer science of the romanian academy, iaşi, romania e-mail: mferaru@etti.tuiasi.ro horia-nicolai teodorescu institute for computer science of the romanian academy, iaşi, and technical university "gheorghe asachi" of iaşi, romania e-mail: hteodor@etti.tuiasi.ro marius dan zbancioc institute for computer science of the romanian academy, iaşi, and technical university "gheorghe asachi" of iaşi, romania e-mail: zmarius@etti.tuiasi.ro abstract: the srol web-based spoken language repository and tool collection includes thousands of voice recordings grouped on sections like "basic sounds of the romanian language", "emotional voices", "specific language processes", "pathological voices", "comparison of natural and synthetic speech", "gnathophonics and gnathosonics". the recordings are annotated and documented according to proprietary methodology and protocols. moreover, we included on the site extended documentation on the romanian language, on speech technology, and on tools, produced by the srol team, for voice analysis. the resources are a part of the clarin european network for language resources. the resources and tools are useful in virtual learning for phonetics of the romanian language, speech technology, and medical subjects related to voice. we report on several applications in language learning and voice technology classes. here, we emphasize the utilization of the srol resources in education for medicine and speech rehabilitation. keywords: spoken language resources, voice education, gnathosony, gnathophony, education, speech rehabilitation. 1 introduction in a world where the web and internet communication is pervasive, the computer is more than a study topic for everyone, it is a ubiquitous tool. computers serve for more than doing computations, they are now one of the most used means of communication and interaction the very basis of any educational system. as a consequence, computer-based education is an obvious choice whenever a distance separates the learner and the learning person. in a general sense, computer-based education and virtual education based on internet is today an undeniable fact of life in every academic campus [28], [29]. while computers and the network are the means, the spoken language represents the prevalent support of communication in the teaching-learning process. hence, the natural need to address e-learning and virtual learning of languages, phonetics, voice pathology, and other aspects related to voice and spoken language. in view of the above, we built during a timeframe of about five years a web site that offers the possibility of teaching and learning various aspects on the romanian language, based on an annotated corpus freely accessible on the internet. the corpus is complemented with in-depth phonetic and linguistic analyses, moreover with specific tools accessible by users from everywhere through the copyright c© 2006-2010 by ccc publications 302 s.m. feraru, h.n. teodorescu, m.d. zbancioc web [16], [17], [18], [19], [25]. this instrument has a high level of dimensionality and aims to cover numerous aspects of the language that are not typical features in language corpora. this makes this "corpus-tool" an unique instrument of its kind existing today in the domain [22]. during the recent years, we developed an emotional speech database which can help in education and re-education of speech, in diagnosis and treatment, and in learning a language aided by computer; examples of related published results are [5], [16], [22]. voice and language e-education is a topic addressed by many research and educational groups. solomon [13] studied the possibilities and issues of learning with and about computers in schools or in other learning environments. the eric education resources page shows the importance of computer assisted education of speech and voice [24]. on the other side, web-based educational resources and training have received attention during the last decade. ake olofsson [10] offers a simple method of compensation for word decoding problems, by using a computer which pronounces the words which can not be read. olofsson developed a program for the ibm-pc/at and a scandinavian multilingual text-to-speech unit that children can use to read a textfile on the monitor and request using a mouse the pronunciation of any word from that text [10]. the computer-assisted learning language software helps the interaction between student and computer by speech, by sound effects, by animation, and by video. on the other hand, the interaction is restricted typically to the mouse and keyboard. an active interaction, through spoken language enhances the educational computer-based tools [1]. in computer-assisted language learning, speech recognition offers the possibilities to have an active participation by oral reading and conversation. the call system reported in [1] includes recordings spelled by native speakers. the user has the possibility to compare the quality of her pronunciation with model recordings. in another direction of research, warschaue [23] observes the uses of online communications for language teaching. he determined that the interest in this domain grows day by day. he proposed a conceptual framework for understanding the role of the interaction assisted by computer [23]. lundberg considers the computer a tool of remediation in the education of students with reading disabilities as dyslexic students which can benefit by computer training in correct reading and spelling the words [9]. a speech database is a collection of files with sounds, structured according to its own purpose. the srol resource (corpus) is located at the address (www.etc.tuiasi.ro/sibm/romanian_spoken_language/ index.htm). the initiator conceived srol as an internet-based "dictionary of sounds and words" for the romanian language supplemented with specific manifestations of voice (including pathologies) and various tools. the srol database includes files with vowels, consonants, diphthongs, sentences with emotional states, linguistic particularities for the romanian language, dialectal voices, and gnathosonic and gnathophonic sounds. it is the first internet based annotated database of emotional speech for the romanian language and contains more than 1500 recordings in different coding formats (.wav, .ogg, .txt, 22 khz sampling rate, 24 bit or 16 bit precision). the phonetic recordings in srol, which refer to an annotated emotional speech corpus (database), are registered to orda. 2 the srol resources and the srol web site the srol corpus evolved from a small research and educational speech database around 1995 (see annex 1). it currently includes several sections, all freely available on the web. the main sections are: i) standard pronunciation of vowels, diphthongs, words and short sentences in romanian; the recordings in this section are appropriate for learning correct pronunciation in romanian, moreover for statistical research on the romanian phonetics; ii) special syntactic constructs (linguistic peculiarities), like double subject and apposition; this section is research-oriented; iii) emotional voices; srol web-based resources for languages and language technology e-learning 303 iv) analytic comparison between the synthetic and natural speech [27]; v) dialectal utterances; vi) a small archive of gnathosonic/gnathophonic sounds (included in the general "archive of sounds"). beyond the main sections, the srol site includes an introductory section on the phonetics of the romanian language, descriptions of the recording protocols and descriptions of the methodology, analysis tools (free software), extended research documentation, a video application, references, and a list of potentially useful links. the srol team developed instruments for signal processing regarding the extraction of patterns from voice signals, and the computing of the fundamental frequency (pitch) traces, respectively the traces of formants f, f, f. the site offers, beside executables programs, descriptions for each of these tools. those descriptions are intended for a "general use", offering elementary explanations and relevant references for a better understanding [4], [22]. in this paper, we provide details about applications of the srol corpus, available to the address http://www.etc.tuiasi.ro/sibm/romanian_spoken_language/index.htm. 3 srol as support for learning the romanian language one of the goals of the srol web site is to provide a free romanian database for students and researchers, for linguists, for teachers, in view of teaching, learning and analysis the romanian language sounds. the database includes the pronunciation corpus and related documentation. the database contains among others, sections with: recordings of syllables and words pronounced in various contexts, like accentuated word, interrogative sentences, exclamations, various emotions conveyed by the speaker, etc. this part of the database is aimed as a source for concatenative synthesizers and as benchmark for the voice recognition systems (isolated words), based on statistical models of language and speech, as [26]; files of sounds, syllables and words pronounced by persons with various pathologies; this section may be useful in medical and phonological researches; files with professional voices ("perfect" pronunciations), as well as non-professional voices, the "voices of the people in the street". for the moment, we concentrate on voices from the iaşi region (east romania) and middle area of moldova. learning and teaching languages require well documented audio-visual tools that exemplify and fully explain spelling for a large variety of voices and contextual and emotional states. while former methods, like tape recordings and audio disks have been helpful, the multimedia internet-based tools offer tremendously increased capabilities. srol represents such a tool for the romanian language. not only it is the first for the romanian language, but its multidimensionality makes it somewhat unique and novel in concept for language learning and teaching in general. as an example of use, consider the case of a foreign student who wants to improve her romanian pronunciation by comparing the prosody of her voice with the prosody of native speakers. the student utters a sentence (from those included in the site), then opens waspt m or another similar tool and displays the energy and fundamental frequency in her voice. she then compares these prosodic features to the ones of native speakers and tries to improve her prosody until she produces correct prosodic patterns. also, the student can compare formant values and try improving the formants of the vowels she pronounces. this instrument is useful for learning to improve speech communication, moreover for humancomputer speech interaction, for security, for medical applications, for video-games and interactive tv, for teachers, in the study of the romanian language, etc. 304 s.m. feraru, h.n. teodorescu, m.d. zbancioc 4 applications in medical education and re-education of speech application fields like language learning, professional voice education, and voice rehabilitation and re-education for medical conditions have different requirements, moreover are based on different methods. on the other side, education in medicine (orl, phoniatrics, dentistry) and in logopedy are other fields of potential applications of speech resources. further, voice analysis for diagnosis is a domain that has seen significant progresses in recent decades. voice education is needed whenever a voice pathology including some neurologic and psychiatric disorders, or pathology of the vocal tract occurs. several groups have addressed the voice re-education topic [9], [10]. 4.1 srol resources for minor voice pathology till now, we included in srol words pronounced by persons with minor pathologies, as trembling voice. we have demonstrated in our research that splitting the signal in frequency bands that correspond to the peaks of the f-f formants and respectively to the peaks of f-f formants helps improving the discrimination process in a significant way. the use of fractal dimensions in assessing the jitter or shimmer in voice produce mixed results [21]. adding other fractal dimension, the rate of recognition of the tremor segments in voice improves, but it still low [21]. the voice pathology section of the database is useful in medical and phonological researches. also for medical education use, the site comprises a gnathosonic and gnathophonic corpus. 4.2 srol resources for gnathosony the gnathosonic analysis refers to the analysis of sounds produced during occlusion, due to the closing of the mandible over the maxillary at some stage in masticatory-like movements. watt (cited in [7], [8], [11], [12]) has initiated the analysis of these sounds with application to diagnosis of the state of the stomato-gnathic apparatus during the 1960s and 1970s. the method has seen some interest, but it is not yet a current method in clinical practice. the shape of the envelope of an occlusal sound is determined by the number of occlusal contacts and by the dynamics of the terminal part of the occlusion, namely by the dynamics of the sliding of the teeth, from the first contact until the equilibrium position in occlusion. a characterization of the waveform should take into account the need to correlate the sound with the medically relevant processes of contact and sliding. a limit in the occlusal sound analysis has been the complexity and the variability of shapes of the sound wave. the envelope of a single contact sound is characterized by the rise and fall times, value of the maximum, duration of the maximum, and total duration. the rise and fall curves follow exponentially laws, whose constants are of interest in the classification of the occlusal dynamics. for gnathosonic purposes, the sound signal s(t) generated by occlusion (teeth impact when closing the mouth like for mastication) and discretized as s[n] is first filtered by an elementary high pass, differential filter, s[n + ] ← s[n + ] − s[n]. then, the signal is filtered with a nonlinear filter introduced in [14]. the filter first extracts the rough envelopes, averages them, applies to them median filters, sums the two resulting envelopes, and then apply to the sum an averaging filter [14]: uin f [n] = mink=−,..,s[n + k], usup[n] = maxk=−,..,s[n + k] vin f [n] =   · ∑ k=− uin f [n + k], vsup[n] =   · ∑ k=− usup[n + k]. the next stage in the filtering is constituted by the median filtering on a moving window, as zin f = mediank=−,.., {vin f [n + k]} , zsup = mediank=−,.., {vsup[n + k]} . srol web-based resources for languages and language technology e-learning 305 and the two envelopes are summed – actually, summed in the sense y[n] = [|zin f | + zsup] /; e[n] =  p +  · ∑ k=−p y[n + k]. we used a window of width 6 (p = ) for the last averaging. the widths of the windows in the above operations depend on the signal sampling frequency used in the recording process. the envelope of the signal is determined by taking the maximal respectively minimal value in a moving window, according to a procedure similar to the one explained for the filtering process. the envelope, e(t), is itself low-pass filtered and then used for determining the occlusal sound parameters. the heuristic procedure applied to determine the duration of the occlusal sounds by forming "binary" impulses during the valid occlusal sound is: if (e[n] > c) and b(e[n − ], ..., e[n + ]) > c then h[n] = ., else h[n] = , where b is a binary function (taking only 0 and 1 values) defined by b = [max (e[n − ], ..., e[n − ]) > c] & [max (e[n + ], ..., e[n + ]) > c] . the constants were chosen semi-empirically, as a function of the amplitude of the signal, c.. ∼ as where as is the average amplitude of the signal after filtering (actually, we used the average amplitude of the sum of the envelopes), and the window width, 14, is determined by tests. we used the values c = ., c = ., c = ., which correspond to the average signal a = ., determined as explained. for a normalized amplitude a, a = , the constants are about c = ., c = ., c = .. the detection procedure can be further improved by reducing the false positives by imposing that the skewness of the impulse is larger than +.; typical values for the skewness are larger than ., showing that the rise of the impulse is significantly faster than the decreasing part. 5 research support in gnathophonics and gnathosonics in previous researches, we identified several ways the pathology of the stomato-gnathic system influences the speech: i) the lack of the frontal dentition, namely of the upper teeth, may dramatically change the spectrum of the fricative consonants. ii) the lack of the upper teeth may significantly modify the spectrum of the dento-alveolar sounds t, d, n, and l. (notice that these sounds are rather alveolar in english, while in some other languages, like spanish and romanian, they may be dental. therefore, the influence of the dentition on phonation is language-dependent.) iii) the limited mobility and the pain in the temporo-mandibular joint (tmj) impedes the production of fast transient vowels, especially in the diphthongs where the second vowel is pronounced with a largely opened mouth, like oa, ea, ua. iv) the uncertainty in uttering due to a forcing in the tmj, or to a poor neuro-muscular control may produce a tremor of the voice (fast amplitude changes, errors in the attacks, i.e. error in transitory regimes etc.). v) the neurological pathology of the buccal cavity may impede on the accuracy of the pronunciation, including deficient starting of the words. vi) defective mobile prostheses may produce extra sounds, especially when the mouth is fast opened for pronunciation, moreover, it may produce clicks before the utterances. 306 s.m. feraru, h.n. teodorescu, m.d. zbancioc vii) prostheses of the upper teeth that do not provide for a physiological "v " shaped space between the teeth impede on the pronunciation of the fricatives, for example f. viii) especially the fricative consonants and the labial vowels are affected by the state of the dental furniture. the s consonant uniformly occupies a large spectrum for a healthy dental apparatus, while it has a multi-band spectrum when the upper front teeth are missing or have deficiencies. the pronunciation of s and v may become close to that of f. for subjects with mobile prostheses, we noticed an uncertainty in the starting of the uttering. the difference ratio in amplitude spectra is a parameter defined as: ∆ s = ∑ k |s( fk) − s( fk)| s( fk) + s( fk) where f [k] is the k-th frequency in the fft (fast fourier transform) power spectrum of the two sounds and s, are the average power spectra of the two sounds. for two similar sounds uttered by the same speaker, a difference larger than 50% means that the sounds are clearly distinguishable, while a difference smaller than 10% means that the sounds are indistinguishable. for example, if the average spectra for two sustained utterances of f and v have a ∆ s index of 40%, they will be distinguished by a listener, while if ∆ s = %, they will be confused. we proposed the sustained consonant differential analysis as a method to further assess the impairment of speech production due to dentition. for this test, two similarly produced sounds are generated in a sustained mode and their spectra contrasted. for example, the sounds f and v are both at least partly fricative (v can be a semi-vowel, only partly fricative) that may be poorly produced due to imperfect dentition or neurological control. we conclude this section by stressing that gnathophonic testing should become a standard test for the dentist in the near future. the knowledge in the field is only emerging today, and fully developed, commercial tools are yet lacking, but the importance of the domain can not be refuted [7], [8], [11], [12]. the proposed tests are non-invasive, objective, and purely instrumental, hence their importance in the evaluation of the health state of the buccal system. these methods can easily be extended to remote, web-based diagnosis. in figure 1, we exemplify a gnathophonic (a) and gnathosonic (b) recording sounds (for the speaker 19743m). in figure 1(a), we exemplified recordings of the romanian words "vata", "fata", "var", intended to obviate similarities and differences in the pronunciations (fourier spectra) of the consonants f and v, in the same context (beginning of the word, same _cv c structure, with the same vowels and consonants, and _ denotes the beginning of the word). this is one of the specific choices of words proposed by the second author to determine when dentition defects produce confusion in the f − v uttered sounds. by analyzing such recordings available at srol, students can learn how to differentiate the normal and pathological states. figure 1: gnathophonic (a) and gnathosonic (b) recording with details, tool goldwavet m srol web-based resources for languages and language technology e-learning 307 6 applications in teaching the voice signal technology classes signal technology classes are taught around the world, especially for the master degrees in computer science and electrical engineering, moreover in some departments of linguistics and in a few medical centers. some universities and education institutions developed their own databases and tools for spech processing. for examples, the center for spoken language understanding (cslu) offers available language database from speech area and hearing science. these resources are important for analyzing the speech, for diagnosing and treating speech and language problems, for training students and so on. the tools and the corpora are distributed to over 2000 sites in 65 countries [2]. in education these tools help students learn about speech, learn a new language, learn through interactive media systems, or to become accustomed to hearing the normal and abnormal voice signal. the second author currently uses the srol corpus in teaching and laboratory activities in the class "speech technology" given for the master degree in "computational linguistics" at the faculty of computer science, "al.i. cuza" university of iaşi. details on the use in voice technology classes of some topics from srol are described in [4]. at the international eurolan 2007 summer school, the second author used the srol site to present "traces of emotion, intentions and meaning in spoken romanian" (http://eurolan.info.uaic.ro/html/profs/hnteodorescu.html). the second author taught the specific methodology aspects, results obtained on the characterization of emotions in speech, possibilities of recognition of emotions and intentions in speech, and the relationship between specific meanings and the prosody in specific constructions in the romanian language. the lesson exemplified applications of analysis of the speech emotional prosody to social, psycho-social, educational, and psycho-medical topics. 7 software tools: pitch (f) extractor the extraction of the fundamental frequency f values combines four different methods: i) autocorrelation method (analysis in time domain) ii) the average magnitude difference function method, amdf (analysis in time domain) iii) the harmonic product spectrum method, hps (based on spectral analysis) iv) the cepstral method (an analysis in que-frequency domain) also applied for the higher formants searching. the autocorrelation method is a classical method for pitch detection in the time domain. the method is based on the quasi-periodicity property of the voice signal and generates a local maximum that corresponds to the signal period. in the case of amdf method, the local minimal values are detected and these values provide the necessary information to compute the fundamental period t, ck =  n · n∑ n= xn ·xn+k, k = ,w dk =  n · n∑ n= (xn − xn+k), k ∈ ,w here, ck is the self-correlation, dk is the difference function coefficient for a delay k, xn is the n-th sample of the signal, n is the number of correlation coeficients, w is the width of the analysis window. the hps method (harmonic product spectrum) is based on the propriety that the spectrum of a periodic signal with fundamental frequency f has maximal spectral values at the multiples of this frequency 2f, 3f, 4f, ... (the harmonics of fundamental). when the signals are rescaled with the factors 1/2, 1/3, 1/4,... after the decimation operation, by the multiplication of the resulted signals (which all have a spectral maximum in fundamental frequency f), the other maximal value from spectrum are strongly attenuated. 308 s.m. feraru, h.n. teodorescu, m.d. zbancioc h kn = h  k·n (decimation) or h k n =  k k−∑ i= hk·n+i the cepstral method relies on the separation of the spectrum of the sound generator, hg (which provide the information regarding the fundamental frequency), from the spectrum of the vocal signal filter, h f (which describe the resonating cavities model). in the cepstral formula, the multiplication operation between the excitatory signal and the transfer function spectrums is transformed using logarithms into an addition operation: h(ω) = f f t (s) = hg(ω)·h f (ω) cepstrum = if f t (log|f f t (s)|) = if f t (log|hg(ω) ·h f (ω)|) cepstrum = if f t (log|hg(ω)|) + if f t (log|h f (ω)|) where fft is the fast fourier transform, and ifft is the inverse fft. the results of the f extraction methods are compared in a decisional block, and a selection algorithm is used if there are significant differences. another algorithm compares a current value with a number of neighboring values in order to select the nearest one, moreover compares the current values with mean values of f. the error correction of the f extractors is performed through three methods: comparing the "neighbors": use the results provided by the same f extractor and if a difference between two consecutive values greater than a specified threshold value (usually 10-20%) is detected, the corresponding samples are considered errors; if the difference in absolute value between the current value of f and the average of fundamental frequency is greater than twice the standard deviation, then we consider those values as erroneous; if the current value of f is below 60% or over 150% of the average values of f, then we consider that the corresponding value is incorrect. the threshold values were empirically determined and the final correction is accomplished by applying all the three correction methods described. the decision block receives the f values provided by the detection methods (amdf differences method, autocorrelation method, hps method, and cepstral method). to achieve the best possible pitch detection, the output values are weighted according on the performance of each f extractor. we assign smaller weights to the methods with a higher probability of providing incorrect outputs. the false detections of the fundamental frequency often consist in selecting the first subarmonic, or the first harmonic of f. when these "false" detection are not repaired by the correction module, we have two options: comparing the outputs of different f detection methods for the same window of analysis; comparing the outputs with a number of previous final results provided by the decision block. 8 discussion our team has a long standing experience with using novel technologies in teaching, lasting for three decades [3], [7], [15], [20]. we applied that experience to the srol e-teaching and e-learning resource. the srol resource is a vast annotated corpus of speech files complemented by tutorials, papers and additional files, moreover with tools for speech processing. if used by an experimented student or teacher, it may become a powerful tool for instruction and learning the romanian language pronunciation, speech technology, and voice pathology and re-education. the srol sound voice resource is useful in many domains, including phonology, applied computer science, and medicine. students and researchers may use this freely accessible site for learning the pronunciation of romanian language, for srol web-based resources for languages and language technology e-learning 309 making comparative study between romanian and other languages, for development of synthetic voice systems, for other linguistic, phonetic, socio-linguistic or medical applications. this database is structured corresponding to precise criteria, documented and annotated according to a well defined methodology. the site has more then 1500 recordings of syllable, word, and sentence with various tonalities and pronounced with various emotional states. the database contains recordings of professional and normal voices, from the north-east region of romania, without dialectal accent. the srol resources have been recognized by several bodies, beyond the scientific publications that included our papers on srol. clarin european network of language resources accepted srol as a member; orda (the romanian office for authorship rights) registered the original recordings, and the srol received a gold medal and media attention at the inventica 2009 fair for inventions and creativity. also, the website of embassy of france in romania briefly described in its bulletin the srol site and its use in education (http://www.bulletins-electroniques.com/actualites/58811.htm). the technical university "gheorghe asachi" of iaşi intends to use srol in helping foreign students enrolled at this university to learn the correct romanian pronunciation. we hope the srol resources will be used in all the universities in romania by foreign students who learn the romanian language, moreover in other academic media and as an online tool by foreign students and teachers. we welcome any request for help and educational advice from all those who wish to use srol and the language-related web resources in virtual e-teaching and for e-learning. 9 conclusions and future work the srol speech annotated corpus constitutes the first extensive educational and research web speech corpus for the romanian language. we believe it also constitutes a speech repository unique in many respects, including the first international language and sound resources for gnathophony and gnathosony, the first resources for comparative study of appositions and double subject constructions, moreover specific features as the rigorous methodology of documenting the records we used. the objectives for the next two years are to increase the speech data base by about 1000 annotated recordings and to significantly extend the medical-oriented section of the resources. also, we intend to add more tools for speech processing, including statistical tools on the grid. acknowledgements the authors have been partly supported by the romanian academy, moreover the second author has been partly supported by a grant of the ministry of education and science of romania, during 20052006. notices 1. a partial version [6] of this paper was presented in the icvl 2009 conference and received the intel special award for education (2009). 2. the authors contributions: the gnathophonic and gnathosonic research was been performed by the second author who also wrote the corresponding section of the paper (sections 2, 4, 5, 6, and 8, and contributed to writing the other sections); the first author helped with further recordings and with their inclusion on the web page. 310 s.m. feraru, h.n. teodorescu, m.d. zbancioc bibliography [1] k. cameron, computer assisted language learning (call) media, design, and applications, taylor & francis, isbn: 902651543x, http://www.google.com/books?id=do_ snqlwhrsc & printsec=frontcover & dq=related, isbn0940753030 & hl=ro & source=gbs_ similarbooks_s & cad=1. [2] r.a. cole, tools for research and education in speech science, proc. int. conf. for physics students, 1999, www.cslu.ogi.edu/toolkit/pubs/pdf/cole_icps_99.pdf. [3] f. de coulon, e. forte, d. mlynek, h.n. teodorescu, st. suceveanu, subject state analysis by computer in cae, proc. int. conf. on intelligent technologies in human-related sciences, leon, spain. vol .2, pp. 243-250, 1996. [4] d. cristea, h.n. teodorescu, d.i. tufis, student projects in language and speech processing, 4th conf. on language resources and evaluation, lisbon, portugalworkshop on language resources: integration and development in e-learning and in teaching computational linguistics, pp. 17-22, 2004, http://nats-www.informatik.uni-hamburg.de/view/main/acceptedpapers. [5] m. feraru, h.n. teodorescu, the emotional speech section of the romanian spoken language archive, conf. on intelligent systems and technologies, proc. 5th european, iaşi, romania, isbn 978973730497, 2008. [6] m.s. feraru, h.n. teodorescu, srol web-based resources and tools used for language and language technology e-learning, virtual learning virtual reality, proc. 4th international conference on virtual learning, icvl 2009, bucharest university press, issn: 1844-8933, section models & methodologies, pp. 119-127, 2009. [7] w. hedzelek, t. hornowski, gnathosonic study of occlusion in patients wearing complete dentures, eur j prosthodont restor dent., vol. 5, no. 3, pp. 119-23, 1997. [8] w. hedzelek, t. hornowski, the analysis of frequency of occlusal sounds in patients with periodontal diseases and gnathic dysfunction, j oral rehabil., vol. 25, no. 2, pp. 139-45, 1998. [9] i. lundberg, the computer as a tool of remediation in the education of students with reading disabilities: a theory-based approach, learning disability quarterly, technology for persons with learning disabilities, vol. 18, no. 2, pp. 89-99, 1995 http://www.jstor.org/pss/1511197. [10] a. olofsson, synthetic speech and computer aided reading for reading disabled children, journal: reading and writing, vol. 4, no. 2, pp. 165-178, issn: 09224777, 1992 (http://www.springerlink.com/content/j521536n135x2864/). [11] j.f. prinz, computer aided gnathosonic analysis: distinguishing between single and multiple tooth impact sounds, j oral rehabil., vol. 27, no. 8, pp. 682-689, 2000. [12] j.f.prinz, k.w. ng, characterization of sounds emanating from the human temporomandibular joints, arch oral biol. vol. 41, no. 7, pp. 631-639, 1996. [13] c. solomon, computer environments for children a reflection of theories of learning and education, 1988 www.google.com/books?id=eonpz9a81kkc&printsec= frontcover & hl=ro & source=gbs_v2_summary_r& cad=0. [14] h.n. teodorescu, occlusal sound analysis revisted, proc. 3rd int. conf. medsip 2006, advances in medical, signal and information processing, isbn: 0863416586, glasgow, uk, 17-19 july 2006. srol web-based resources for languages and language technology e-learning 311 [15] h.n. teodorescu, computer semiotics: understanding meanings and parallel languages (refereed invited paper) t. yamakawa, g. matsumoto (eds.), proc. int. conf. iizuka’98, world scientific publ., pp. 279-283, 1998. [16] h.n. teodorescu, m. feraru, classification in gnathophonics preliminary results, the second symposium on electrical and electronics engineering, galati university press, pp. 525-530, isbn 1842-8046, 2008. [17] h.n. teodorescu, m. feraru, micro-corpus de sunete gnatosonice si gnatofonice, pistol, cristea, tufis (eds.) resurse lingvistice si instrumente pentru prelucrarea limbii romane, ed. universitatii "al.i. cuza" iaşi, isbn 978-973-703-297-3, pp. 21-30, 2007. [18] h.n. teodorescu, m. feraru, d. trandabat, studies on the prosody of the romanian language: the emotional prosody and the prosody of double-subject sentences, c. burileanu, h-n. teodorescu, (eds.) advances in spoken language technology, the publishing house of the romanian academy, bucharest, romania, isbn 978-973-27-1516-1, pp. 171-182, 2007b. [19] h.n. teodorescu, m. zbancioc, e. mihailescu, speech technology and bio-medical engineering teaching based on the web-a new tool and case study, int. conf. on interactive computed aided learning, villach, austria, 2006. [20] h.n. teodorescu, a. kandel, b. paschall, teaching modern chapters in automata theory and formal languages, (abstract in booklet of the symposium.) symp. 21 century teaching technologies, univ. south florida, tampa, usa 2000. [21] h.n. teodorescu, r. ganea, m. feraru, a. burlui, assement of voice quality based on nonlinear dynamic analysis, proc. of the 15th int. conf. on control syst. & computer sci., bucharest, romania, pp. 536-542, isbn 9738449898, 2005. [22] h.n. teodorescu, d. tandabat, m. feraru, m. zbancioc, r. luca, a corpus of the sounds in the romanian spoken language for language-related education in: c.p. pascual (ed.), revisiting language learning resources, cambridge scholars pub. (csp),uk, ch. 6, isbn 1847181562, pp. 7389, 2007. [23] m. warschaue, computer-mediated collaborative learning: theory and practice, the modern language journal, vol. 81, no. 4, special issue interaction, collaboration, and cooperation learning languages and preparing language teachers (winter, 1997), pp. 470-481, http://www.jstor.org/pss/328890. [24] b.w. wise, r.k. olson, computer speech and the remediation of reading and spelling problems, j. special education technology, vol. 12, no. 3, pp. 207-220, 1994. [25] m. zbancioc, tools for the archive of the romanian language sounds project, 4th european conf. on intelligent systems and technologies, iaşi, romania, isbn 973-730-265-6, 2006. [26] kenko ota, emmanuel dulfos, philippe vanheeghe, masuzo yanagida, bayesian inference for speech density estimation by the dirichlet process mixture, , studies in informatics and control journal, bucharest, romania, issn 1220-1776, vol. 16, no. 3, 2007. [27] florin grigoras, horia-nicolai teodorescu, vasile apopei, nonlinear analysis and synthesis of speech, studies in informatics and control journal, bucharest, romania, issn 1220-1776, vol. 7, no. 1, 1998. 312 s.m. feraru, h.n. teodorescu, m.d. zbancioc [28] tom page, gisli thorsteinsson, andrei niculescu, management of knowledge in a problem based learning environment, studies in informatics and control journal with emphasis on useful applications of advanced technology, bucharest, romania, vol. 18, no. 1, 2009. [29] antonios andreatos, international journal of computers,virtual communities and their importance for informal learning communications and control, international journal of computers, communications and control ijccc, romania, issn 1841-9836, vol. ii, no. 1,pp.39-47, 2007. annex 1. development stages of srol the presently named srol corpus started around 1995 as a small, research and educational database including examples of recordings with vowels and a few typical words in romanian, moreover a few recordings of pathological voices. it was correlated to the class of image and speech processing given by the second author in the "gheorghe asachi" technical university of iaşi, romania. former students (who are now professors in several romanian universities) contributed to that incipient voice database (credit for recordings and other help for that database deserve the now professors radu ciorap and irinel pletea, among others). the database was further developed for educational purposes in relation to the the class of speech technology given by the second author in the faculty of computer science of "al. i. cuza" university in iaşi. the third stage of development started in 2004, when the second author decided to significantly enlarge and move the speech database on the web, partly with the help of two grants that helped forming a team in the institute for computer science of the romanian academy and in the "gheorghe asachi" technical university of iaşi. the first author joined the team, at that time as a fresh ph.d. student. since the second author initiated five years ago the project "the sounds of romanian language" (srol), the team increased to 8 researches. the srol web-based spoken language repository and tool collection as it is today was developed during several years by the collaboration of groups from the institute for computer science of the romanian academy, cerfs excellence center in "gheorghe asachi" technical university of iaşi and by staff of the discipline of language technology, computer science faculty, "al.i. cuza" university. annex 2. typical shapes of gnathosonic signals the sketches below stand for the envelopes of typical gnathosonic signals, corresponding to normal, merged double contact, and isolated double contact signals. the sound is easily categorized by automatic means. figure 2: typical envelopes of occlusal sounds (from [14]) srol web-based resources for languages and language technology e-learning 313 silvia monica feraru (november 21, 1977) received a msc. degree in biomedical engineering (2004) and phd in electronics (2009) from "gheorghe asachi" technical university of iaşi. now she is research assistant at the institute for computer science of the romanian academy, iaşi branch. she received the special awards intel education 2009 at the international conference on virtual learning, icvl 2009. her current research interests include vocal signal processing, cognitive processes, and various aspects of artificial intelligence. she has (co-)authored more than 21 conference, journal or bookchapter papers. horia-nicolai teodorescu (november 14, 1951). ms in electronics, "politehnica" university, bucharest, 1975, ph.d. in applied physics electronics, under the supervision of the late prof. emil luca, at the technical university of iaşi, 1981. currently, he is a professor at the "gheorghe asachi" technical university of iaşi and the director of the institute for computer science of the romanian academy, iaşi. he is a correspondent member of the romanian academy. has authored or co-authored about 300 journal and conference papers, holds 24 national and international patents and has received numerous national and international awards and prizes. he is a senior member, ieee. marius-dan zbancioc (august 15, 1975) teaching assistant at the "gheorghe asachi" technical university of iaşi and researcher at the institute of computer science of the romanian academy, iaşi branch. his current research interests include signal processing, expert systems, fuzzy systems and several aspects of artificial intelligence. he has (co-)authored 3 books and 39 papers. int j comput commun, issn 1841-9836 vol.7 (2012), no. 4 (november), pp. 688-695 an electromagnetism-like approach for solving the low autocorrelation binary sequence problem j. kratica jozef kratica 1. mathematical institute, serbian academy of sciences and arts kneza mihaila 36/iii, 11 000 belgrade, serbia e-mail: jkratica@mi.sanu.ac.rs abstract: in this paper an electromagnetism-like approach (em) for solving the low autocorrelation binary sequence problem (labsp) is applied. this problem is a notoriously difficult computational problem and represents a major challenge to all search algorithms. although em has been applied to the topic of optimization in continuous space and a small number of studies on discrete problems, it has potential for solving this type of problems, since movement based on the attraction-repulsion mechanisms combined with the proposed scaling technique directs em to promising search regions. fast implementation of the local search procedure additionally improves the efficiency of the overall em system. keywords: low autocorrelation binary sequence problem, electromagnetism-like metaheuristic, combinatorial optimization. 1 introduction low autocorrelation binary sequence problem (labsp) is a very hard combinatorial optimization problem with quite a simple formulation. the mathematical formulation of labsp is based on a binary sequence s of length n. let s ∈{−1,1}n, i.e. s be represented by (s1, s2, ... , sn), where si ∈{−1,1} for 1 ≤ i ≤ n. each sequence s is associated with the value of its energy function which is defined as follows: e(s) = n−1∑ j=1 c2j (s), where cj(s) = n−j∑ i=1 sisi+j (1) now, the low autocorrelation problem for binary sequences with length n, can be formulated as finding a sequence s of length n whose energy function is as minimal as possible. the second measure of quality of the sequence s is a merit factor f(s) = n2 2e(s) , (2) defined by bernasconi in [2]. mathematically, labsp can be formulated as max s∈{−1,1}n f(s). both formulations are equivalent, and either of them can be used when it is convenient. 2 previous work labsp has been deeply studied since the 1960s by both the communities of physics and artificial intelligence. there are two reasons behind this interest: • it arises in many diverse areas including statistical mechanics and configuration state analysis [2], calibration of surface profile metrology tools [1], satellite and space applications [8], digital signal processing [16], etc.; copyright c⃝ 2006-2012 by ccc publications an electromagnetism-like approach for solving the low autocorrelation binary sequence problem 689 • labsp is also a significant challenge of exact and/or heuristic applications, since it is known that the problem has "bit-flip" neighborhood structure of combinatorial landscapes [5, 6]. with this type of neighborhood, it is extremely steep around the optimum, which is sometimes referred to as ”golf hole” landscapes and it poses a very difficult optimization problem. in that case, small changes in argument values usually cause drastic difference in objective value. for example, alteration of only one bit in binary sequence s can affect the objective value change by several tens of percents. from these reasons the labsp is also listed as a problem 005 in the csplib library. although golay in [9] estimated that lim n→∞ f(s) = 12.32, it is not well enough, because for dimensions between 21 and 60 the merit factor varies from f(s) = 5.627 for n = 23 up to f(s) = 9.85 for n = 27, which is obviously far from the estimated limit 12.32. the state-of-the-art exact method given in [12,13] is based on exhaustive search and it solves problem optimally up to n = 60. the experimental research was carried out for several days on a multiprocessor cluster of 160 cpus. up to now it is the largest dimension with known optimal solution. a hybrid evolutionary approach described in [4] combines the evolutionary search described in [14] and kerninghan-lin heuristic defined in [11]. that evolutionary approach uses a specially defined termination criterion based on statistical analysis of known optimal solutions and their asymptotic behavior. a detailed analysis of different stand-alone local search strategies is given in [7]. that analysis is later used in embedding the best local search strategy within other metaheuristic approaches. the results indicate that pure evolutionary algorithm cannot cope with the complexity of the problem and the assistance of local-search operators it is required to provide optimal or suboptimal results consistently. as a best choice for solving labsp a memetic algorithm endowed with a tabu search local searcher is proposed, and that approach consistently finds optimal sequences in considerably less time than approaches previously reported in the literature. another metaheuristic method for solving labsp, based on the stochastic local search (sls), is presented in [10]. in-depth analysis of labsp fitness landscape and the white-box visualization get insights on how sls can be effective and lead to a slightly better strategy. local search algorithm described in [15], on the other hand, uses a quite different strategy compared to previous local search approaches, which is based on the randomized form of backtracking. in that way, the optimization problem is reduced to a series of constraint satisfaction problems which are to be solved iteratively, with decreasing upper bounds on the given objective function. experimental results indicate that the algorithm is time consuming. for example, the average running time for n = 40 is over 1000 seconds. 3 proposed em method an electromagnetism-like (em) metaheuristic is a powerful algorithm for global optimization that converges rapidly to optimum [3]. the method is also used for combinatorial optimization as a stand-alone approach or as an accompanying algorithm for other methods. em is a population based algorithm that can solve nonlinear optimization problems. in the following text each member pk, k = 1,2, ...,m of the population maintained by the algorithm will be referred to as em point (or solution), and the population itself will be referred as a solution set. the proposed em algorithm for solving labsp is given by the following pseudo code: em points in the first iteration are randomly initialized from [−1,1]n (function random_init()). 690 j. kratica for a given em point pk, sequence s is obtained by rounding, i.e. si = { 1, pki ≥ 0 −1, pki ≤ 0 , for each coordinate i = 1, ...,n. energy e(s) and merit factor f(s) are computed by using (1) and (2). 3.1 local search and scaling this step is used to move the sample points towards the local optima that are near them. points are pushed towards the local valleys using a neighborhood search procedure. the local search method used in this algorithm is simple but effective. regarding the importance of the local search step, it is described in algorithm 2. the proposed local search procedure uses the first improvement strategy, which means that when an improvement is detected, the improvement is immediately applied and local search continues. if for each member of sequence s swap produces energy value greater or equal than the original one, the local search ends with no further improvement. in this implementation, scaling procedure is also applied, which additionally moves points towards solutions obtained by local search. it is considered only with some factor λ ∈ (0,1) to prevent falling into a local optimum and become trapped there. an em point pk is moved by the following formula pk ← λ ·pk′+ (1−λ) ·pk (3) where pk′ denotes sequence s of the k-th em point in the current iteration when the local search procedure finished its work. choosing appropriate value of the scale factor λ is significant for governing the search process. in the extremal case, when λ is close to 1, the search process will likely fall into a local optimum and become trapped. another extremal case, when λ is equal to 0, obviously represents no-scaling situation. experiments showed that λ = 0.1 is a good compromise which yields satisfactory results. 3.2 attraction-repulsion mechanism as can be seen from the literature, the strength of the em algorithm lies in the idea of directing the sample points towards local optima utilizing an attraction-repulsion mechanism. therefore, after applying the local search procedure to each solution in the current population, the solutions must be moved towards promising regions in order to get closer to the optimal solution. an electromagnetism-like approach for solving the low autocorrelation binary sequence problem 691 in this process, each sample point is considered as a charged particle. the charge of each sample point is calculated by the following formula: qi = exp  −n f(pi)−f(pbest)m∑ k=1 f(pk)−f(pbest)   . (4) the amount of charge relates to the value of the objective function (f(pk) = e(s)) at the point, which also determines the magnitude of attraction or repulsion of the point over the sample population. according to the superposition principle of electromagnetism theory, the force exerted on a point via another point is inversely proportional to the distance between the points and directly proportional to the product of their charges. mathematically, the power of attraction or repulsion of charges is calculated as follows: fi = m∑ j=1,j ̸=i f j i , where f j i =   ( qiqj ||pj−pi||2 ) · (pj −pi), f(pj) < f(pi)( qiqj ||pj−pi||2 ) · (pi −pj), f(pj) ≥ f(pi) (5) where ∥pi −pj∥ is the euclidean distance between em points pi and pj. as mentioned before, by using the move() procedure of the electromagnetism approach, current solutions are shifted towards the best ones. all the em points are moved with the exception of the current best solution. detailed explanations about movement are given in algorithm 3. 692 j. kratica as can be seen from algorithm 3, the movement of each em point is in the direction of total force exerted on it by a random step length β. this length is generated from uniform distribution between [0,1]. as can be seen in [3], the candidate solutions have a nonzero probability to move to the unvisited solution along this direction when random step length is selected. moreover, normalizing the total force exerted on each candidate solution implies that infeasible solutions cannot be produced. 4 experimental results in this section, the proposed em solution procedure on labsp is tested for n up to 40 nodes, for which the optimal solutions are known in the literature. each numerical experiment was repeated 20 times and the results are summarized in table 1, which is organized as follows: • the first three columns contain n, optimal solution value (merit factor f(s)) and the em best solution obtained in 20 runs; • the average running time (t) and number of iterations iter used to reach the final em solution for the first time are given in the fourth and fifth columns, while the total running time ttot necessary to finish em is given in the sixth column. • the last two columns (agap and σ) contain information on the average solution quality: agap is a percentage gap defined as agap = 1 20 20∑ i=1 gapi, where gapi = 100∗ embest−emiembest and emi represents the em solution (merit factor f(s)) obtained in the i-th run, while σ is the standard deviation of gapi, i = 1,2, ...,20, obtained by formula σ = √ 1 20 20∑ i=1 (gapi −agap)2. the computational results were performed on an intel 2.5 ghz single processor with 1gb memory, under windows operating system. all em runs were made with the following empirically determined parameters: m = 10, itermax = 100000 and λ = 0.1. these values cause most charges to exhibit convergent behavior with a few individuals diverging, thereby providing a good balance between local and global search. in this case all these values were chosen experimentally as a matter of convenience because they provide good results. an electromagnetism-like approach for solving the low autocorrelation binary sequence problem 693 table 1: computational results n optsol embest t ttot agap σ (sec) (sec) (%) (%) 3 4.500000 opt. 0.0010 1.5503 0.000 0.000 4 4.000000 opt. 0.0010 4.3613 0.000 0.000 5 6.250000 opt. 0.0010 4.8269 0.000 0.000 6 2.571429 opt. 0.0010 11.6996 0.000 0.000 7 8.166667 opt. 0.0010 9.8925 0.000 0.000 8 4.000000 opt. 0.0010 16.4588 0.000 0.000 9 3.375000 opt. 0.0010 17.3339 0.000 0.000 10 3.846154 opt. 0.0010 20.3198 0.000 0.000 11 12.100000 opt. 0.0017 16.8205 18.462 34.585 12 7.200000 opt. 0.0010 17.4573 0.000 0.000 13 14.083333 opt. 0.0031 17.3925 11.429 25.806 14 5.157895 opt. 0.0010 26.4173 0.000 0.000 15 7.500000 opt. 0.0900 22.1761 1.739 7.715 16 5.333333 opt. 0.0010 23.1151 0.000 0.000 17 4.515625 opt. 0.0031 25.3597 0.000 0.000 18 6.480000 opt. 0.0052 28.1081 2.424 7.453 19 6.224138 opt. 0.0249 23.8753 1.212 3.681 20 7.692308 opt. 0.3983 26.3590 8.235 12.230 21 8.480769 opt. 0.0865 26.3949 19.226 12.095 22 6.205128 opt. 0.0266 33.8676 3.404 7.048 23 5.627660 opt. 0.1937 30.9995 2.745 3.847 24 8.000000 opt. 0.2528 32.9262 19.003 13.390 25 8.680556 opt. 0.8880 32.2269 13.759 12.523 26 7.511111 opt. 1.6412 36.4104 4.887 9.350 27 9.851351 opt. 0.9698 35.7136 29.084 18.230 28 7.840000 opt. 1.1606 37.8760 17.338 11.389 29 6.782258 opt. 3.7152 36.8581 6.531 6.318 30 7.627119 opt. 1.7120 42.7611 13.471 10.877 31 7.171642 opt. 2.6340 43.7440 9.258 6.876 32 8.000000 opt. 2.7331 46.6714 15.540 11.667 33 8.507813 opt. 4.4168 49.8097 15.667 9.911 34 8.892308 opt. 6.9574 51.4971 25.079 9.309 35 8.390411 opt. 1.8435 52.5394 19.264 8.260 36 7.902439 opt. 2.4458 55.0151 17.239 8.781 37 7.959302 opt. 6.4090 55.7682 15.507 7.856 38 8.298851 opt. 2.6012 61.6151 20.118 9.361 39 7.681818 opt. 7.8379 62.7660 13.623 8.275 40 7.407407 opt. 9.4135 73.1222 14.369 6.279 694 j. kratica observing the data shown in table 1, it is remarkable that em identifies optimal solutions in all cases. moreover, the em performs very efficiently, since the total running time was less than 74 seconds with one million objective function evaluations. note that, most of this time is spent after the em reach optimal solution merely to satisfy the finishing criterion. also mind that in the case when n = 40, search space is 240 and em searched only 1.17 ·10−7 part of it to reach the optimal solution. 5 conclusions and future works in this article, a hybrid approach combining an electromagnetism-like method (em) with a scaling technique for solving the labsp is proposed. the fast local search procedure and the applied scaling scheme were adapted to facilitate the use of em to boost the performance of the proposed algorithm. to show the efficiency of the proposed hybrid em, a number of experiments was carried out and the results were compared with the optimal solutions taken from the literature. the obtained results clearly indicate that em is a useful tool for solving this problem. as a direction for future studies, it can be interesting to parallelize the em and run it on a powerful multiprocessor computer. another orientation of future research can be incorporation of this method in some exact solution framework. acknowledgments this research was partially supported by serbian ministry of education and science under grants 174010 and 174033. bibliography [1] s.k. barber, p. soldate, e.h. anderson, r. cambie, w.r. mckinney, p.z. takacs, d.l. voronov, v.v. yashchuk, development of pseudorandom binary arrays for calibration of surface profile metrology tools, journal of vacuum science and technology b: microelectronics and nanometer structures, vol.27, no.6, pp.3213–3219, 2009. [2] j. bernasconi, low autocorrelation binary sequences: statistical mechanics and conguration state analysis, journal physique, vol.48, pp.559–567, 1987. [3] s.i. birbil, s.c. fang, an electromagnetism-like mechanism for global optimization, journal of global optimization, vol.25, pp.263–282, 2003. [4] f. brglez, x.y. li, m.f. stallman, b. militzer, reliable cost prediction for finding optimal solutions to labs problem: evolutionary and alternative algorithms, fifth international workshop on frontiers in evolutionary algorithms, cary, nc, usa 2003. [5] a.v. eremeev, c.r. reeves, non-parametric estimation of properties of combinatorial landscapes, lecture notes on computer science, vol.2279, pp.31–40, 2002. [6] f. ferreira, j. fontanari, p. stadler, landscape sstatistics of the low autocorrelated binary string problem, journal of physics a: mathematical and general, vol.33, pp.8635–8647, 2000. [7] j.e. gallardo, c. cotta, a.j. fernandez, finding low autocorrelation binary sequences with memetic algorithms, applied soft computing, vol.9, no.4, pp.1252–1262, 2009. an electromagnetism-like approach for solving the low autocorrelation binary sequence problem 695 [8] r. garello, n. boujnah, y. jia, design of binary sequences and matrices for space applications, proceedings of the 2009 international workshop on satellite and space communications iwssc’09, art. no. 5286416, pp.88–91, 2009. [9] m.j.e. golay, the merit factor of long low autocorrelation binary sequences, ieee transactions on information theory, vol.28, pp.543–549, 1982. [10] s. halim, r.h.c. yap, f. halim, engineering stochastic local search for the low autocorrelation binary sequence problem, lecture notes in computer science, vol.5202, pp.640–645, 2008. [11] b.w. kernighan, s. lin, an efficient heuristic procedure for partitioning graphs, bell system technical journal, pp.291–307, 1970. [12] s. mertens, exhaustive search for low-autocorrelation binary sequences, journal of physics a: mathematical and general, vol.29, pp.473–481, 1996. [13] s. mertens, h. bauke, ground states of the bernasconi model with open boundary conditions, available online at http://odysseus.nat.uni-magdeburg.de/~mertens/ bernasconi/open.dat [14] b. militzer, m. zamparelli, d. beule, evolutionary search for low autocorrelated binary sequences, ieee transactions on evolutionary computation, vol.2, pp.34–39, 1998. [15] s. prestwich, exploiting relaxation in local search for labs, annals of operations research, vol.156, pp.129-141, 2007. [16] a. ukil, low autocorrelation binary sequences: number theory-based analysis for minimum energy level, barker codes, digital signal processing: a review journal, vol.20, no.2, pp.483–495, 2010. int j comput commun, issn 1841-9836 8(1):166-175, february, 2013. issues on applying knowledge-based techniques in real-time control systems d. zmaranda, h. silaghi, g. gabor, c. vancea doina zmaranda, helga silaghi gianina gabor, codruta vancea university of oradea romania, 410087 oradea, 1 universitatii st. e-mail: zdoina@uoradea.ro, hsilaghi@uoradea.ro gianina@uoradea.ro, cvancea@uoradea.ro abstract: at the time being knowledge-based systems are used in almost all life aspects. the main reason for trying to use knowledge-based systems in real-time control is to reduce cognitive load on users (overload), their application proving to be important when conventional techniques have failed or are not sufficiently effective [1]. the development of automated diagnosis techniques and systems can help also to minimize downtime and maintain efficient output. this paper presents some issues of applying knowledge-based systems to real-time control systems. it describes and analyzes the main issues concerning the real-time domain and provides possible solutions, such as a set of requirements that a real-time knowledge-based system must satisfy. the paper proposes a possible architecture for applying knowledge-based techniques in real-time control systems. finally, a way of employing knowledge-based techniques for extending the existing automatic control and monitoring system for the geothermal plant from the university of oradea is presented. keywords: real-time control systems, knowledge-based systems, programmable logic controller 1 introduction real-time systems generally consist of a series of complex, heterogeneous and critical processes. they are also closely coupled systems, consisting of a physical system part and a control computer system part. while the physical part reacts to the control signals from the computer, the control computer part and its software must interact with the dynamic properties of the physical part. real-time systems are also, by definition, reactive systems. to increase their efficiency, different monitoring programs, tools, algorithms, and rules could be utilized. generally, these programs are used for detecting abnormal behaviors, tracing workflow progress, and generating alerts and reports during the different phases of the system. however, using knowledge-based techniques in real-time applications represents a major challenge because of several reasons [10]: problems related to time representation and reasoning about time; problems related to deadline because a knowledge-based system should provide the best solution within a given deadline; problems related to asynchronous evens handling that could lead to interruption of the inference process; problems related to integration of conventional real-time programming and knowledge-based programming. moreover, specific nature of real-time systems that implies interaction with external physical system that also implies specific features when including knowledge-based components [7]: the knowledge-based decision making system is hardly related to the external system; the knowledgebased decision making system should be also a real-time system in order to be sure that the decision making are performed before deadlines; implies both knowledge of control and of the copyright c⃝ 2006-2013 by ccc publications issues on applying knowledge-based techniques in real-time control systems 167 external physical system each of them having a specific form; when dealing with complex realtime systems consisting of several sub-components, the problem of decision making need to be based on distributed knowledge. knowledge-based and real-time control technologies are complementary, rather than competitive technologies. control technologies generally are oriented to quantitative processing while knowledge-based integrates both qualitative and quantitative processing [6]. separating the description (the knowledge) of a process from the control algorithm allows knowledge to be more explicit, visible and analyzable, instead of being hidden inside of the procedural programming code. knowledge-based systems could be used for different purposes in real-time process control, the main domains of their applicability include [5] [9]: fault diagnosis, that implies detection, cause analysis and repetitive problem recognition; complex control schemes; process and control performance monitoring and statistical process control; real time quality management (qm); control system validation. for real-time control systems, an important issue is represented by their capability in fault detection and diagnosis because the availability and productivity can be significantly improved by shortening their downtime. moreover, because personnels ability for observations could be incomplete or wrong and leading to incorrect diagnosis, intelligent system approaches need to be investigated and applied. 2 knowledge-based techniques in real-time applications real-time applications could have different structures, and consequently, different approaches for using knowledge-based techniques can be employed. one typical structure of real-time control systems comes from the requirement of meeting the quest for automation and flexibility in complex manufacturing systems and is based on plc (programmable logic controller) usage. such real-time control systems architectures are widespread because plcs offer an adaptable and modular solution to the control problem. however, there are some shortcomings of this approach that are generated by plcs inflexible programming system that do not support automatic analysis of logic circuits in order to look for a fault. even if in todays modern plcs there are some diagnosis functions available, their usage is limited and need to be extended. consequently, developing a knowledge-based system for diagnosis purposes could represent a solution for implementing automated diagnosis techniques into complex manufacturing real-time systems. in order to create efficient solutions, several specific issues should be considered: knowledge representation and acquisition, real-time reasoning, knowledge validation, integration with real-time software [11] [12]. 2.1 knowledge representation and acquisition the main reason for using real-time knowledge-based systems is to reduce cognitive load on users. therefore, such systems require a knowledge representation that integrates several kind of knowledge taken from several sources: analytical models developed by using differential equations, material or energy balances or overall process behavior kinetics. generally, each object could have a behavior that is represented by a combination of analytic (model-based) and heuristic (rule-based) statements. in plc-based systems automatic monitoring elements available for faults are represented by discrete state signals into the plc memory. these signals are indicating different operation states of the controlled plant and based on their values further diagnosis can be carried out. the values can be obtained by accessing plc memory via a linkage from the computer that has implemented the diagnosis system. the diagnosis system must then use specific reasoning algorithms for searching all possible fault causes under the 168 d. zmaranda, h. silaghi, g. gabor, c. vancea help of relevant knowledge and real-time data. therefore, the knowledge acquisition task is very important, and could be made in two ways: artificial knowledge acquisition, model-based knowledge acquisition [3]. artificial knowledge acquisition is obtained by knowing specific issues about the controlled plant. for example, in each plant there are several alarms that have the purpose of protecting plants equipments or preventing the plant to work in error conditions. these alarms are normally indicated by one or a combination of plc signals. for example: temperature too high, pressure too low or a combination of these. model-based knowledge acquisition is based on knowledge achieved during modeling the system behavior and constructing the plc program, resulting in improvement of knowledge acquisition and diagnosis efficiency. let define s the space that represents the fundamental set for all possible configurations of the control system variables. a specific configuration set s is given as: s = s1,s2, ......sn ∈ s (1) let define the behavior space b that represents the fundamental set of all determinable behavioral attributes: b = b1,b2, ....bm ∈ b (2) every subset f of b that comprise specific required profile for faulty behavior could be expressed by a combination of plc signals associated with correspondent control system variables and their specific state. from formal point of view, these could be represented in the following way [2]: fault− > (plcsignal1,state),(plcsignal2,state)....,(plcsignaln,state) (3) with every fault defined into the faulty behavior profile, all possible sources generating the fault are considered and specified. for example, if for a specific fault occurrence from the fault space f, device mapping dealing and inferring the functional relationship is considered: f : d → crelevant (4) where d represents the device space and crelevant represents relevant cause space. the relationship between behavior space b, configuration space s and required profile for faulty behavior f is presented in figure 1. based on evaluation mapping between a specific behavior and profile of faulty behavior, if a match is found then the corresponding fault could be identified together with all possible causes. 2.2 knowledge validation in traditional real-time control systems, the control problem and its implementation through control algorithms is based on the exact knowledge of the control plant that is determined usually based on the mathematical model of the plant. real-time knowledge-based control systems combine the analytical process model with conventional process control while reasoning about current, past and future situation in order to assess on-going developments and plan appropriate actions. such systems allow the application to be structured into a model that is capable to behave and use its reasoning when taking a decision as human specialists do. generally, a full control strategy requires not only variable (parameters) identification, state estimation and control, but also to check the validity of the data and process models before they are used in estimations. however, there is a relatively high degree of uncertainty concerning the plant starting from the mathematical model itself: there is not apriori knowledge of some parameters (for issues on applying knowledge-based techniques in real-time control systems 169 figure 1: the relationship between behavior space b, configuration space s and required profile for faulty behavior f example, parameters for achieving stability conditions of a feedback control) or, the plant behavior may be not deterministic [4]. there is an important concern when using knowledge-based techniques for real-time control systems: the need for validating systems knowledge, allowing determination if it accurately represents an experts knowledge in the particular domain. in this idea, simulation, when available, could represent a very useful tool that provides a general overview of system dynamics. 2.3 integrating knowledge-based software with real-time software the real-time control software and rule-based software differ in their underlying execution models. procedural software uses generally an imperative model in which software engineer determines the sequence of actions while rule-based systems represent a general control scheme of matching, selecting and execution of rules. consequently, a knowledge-based real-time diagnosis system should be thought as an extension of existing control software that interacts both with knowledge database constructed by using artificial and model-based acquisition and with the real-time data acquired from plc source code execution. a possible structure is presented in figure 2. diagnostic reasoning is based on the knowledge base as well as real-time data from realtime database. the reasoning mechanism is based on logic control of faults. thus, it uses the logical expression for faults presented in (3) where each term represents a possible cause of fault indicated by a specific plc signal. by comparing the fault state with the current state of signals from real-time database in plc, an occurrence of a fault state could be identified. furthermore, by using systems profile mapping to devices and associated causes, as presented in (4), associated devices and causes of the fault could be shown. 2.4 real-time reasoning first attempt for using knowledge-based systems for real-time process control involves using static expert systems that take a snapshot of plant data. static expert systems use pattern matching of a set of facts and rules. with no time constraints, this approach proves to be practical but when time constrains come into picture, this might not be a good idea. the 170 d. zmaranda, h. silaghi, g. gabor, c. vancea figure 2: structure of a diagnostic system for a real-time control system elements that should be considered in this situation are: temporal reasoning and responding within a given response time. in the controlled system, variables such as temperatures and pressures may vary in time; therefore, a time representation together with the possibility of reasoning about time is essential to be included. so, to the elementary entity of knowledge base additional time information should be attached. also, the rules defined into the expert system may be extended with specific, temporal-extended rules, such as: operators always, to formulate the premise of a rule; qualitative statements that refer to the relation of time points or time intervals, without exact time specification: earlier, after and similar; quantitative statements that allow expressing conditions with specification of exact point of time: for example at 12:00:00 p.m. generally, the basic characteristic of a system that guarantees a certain response time is represented by its determinism. knowledge-based systems are by their nature non-deterministic because the time of inference is dependent on the given situation. if imposed real-time deadlines are lower than the maximum searching time that is needed for a certain inference, the response time requirements cannot be met. in order to meet those requirements, three strategies could be applied: implementing algorithms to quantitatively estimate the maximum searching time, reducing the inference searching time or define an embedded diagnosis system approach which will integrate the diagnosis models into the plc control program so that faults could be diagnosed in real-time. the problem of including all diagnosis into the plc has also disadvantages by creating a much more complex control program. integrating inference rules into the control program itself complicate the rules and makes introduction of new rules more difficult. in addition, the process of integration of different knowledge and information of these systems represent a tedious process. a mixed approach could be a possible better solution: only for critical faults, the diagnosis part with corrective actions should be included into the plc and the rest of them should remain in charge of the knowledge-based system. in the next chapter, a way of employing knowledgebased techniques for extending the existing automatic control and monitoring system for the geothermal plant from the university of oradea is presented. issues on applying knowledge-based techniques in real-time control systems 171 3 the automatic control and monitoring system structure the automatic control and monitoring system of the geothermal plant from the university of oradea is an example of a real-time control system that has been developed using a combination between a plc (programmable logic controller) and a pc (for the user interface and supervisory control). from the structural point of view, the controlled plant is composed of 3 parts: the well station, the pump station and the heat station. the system functions in the following way: first, the geothermal water is extracted from the well station using a deep well pump if the necessary flow rate is greater than the artesian one; the water is then stored into a reservoir tank, which acts as an accumulator and also separates the production network from the distribution network. then, from the reservoir tank the water is pumped, through the pump station to the heat station and further, in the heat station, the water is not directly utilized, but through 4 heat exchangers; the water that comes out from these heat exchangers flows into the distribution network and heats the university campus buildings. the structure of the control system consists of a plc for controlling the geothermal heating system based on a control program embedded to the controller, connected with a pc computer and contains the user interface for the operator that is implemented using wonderware intouch software [15]. the intouch display management subsystem handles display call up, real-time display update, data entry and process schematics. it also maintains the plc real-time and historical database that could be used to follow the time evolution of certain parameters or for statistical calculations. the real-time database provided includes maintenance of historical data in addition to current values of process variables. in the current implementation approach, the decision process in order to achieve possible solutions for several faults, such as sensors and other equipments faults implies changing the plant operation into a safety mode and employing the operator for tracing the fault, its effects and related causes. also, plc control program deals with diagnosis to only of few critical faults. based on current situation of the geothermal plant and the current research in the domain, a proposal for extending the existing control system with a knowledge-based system is developed and described further in this paper. 3.1 analysis phase. knowledge acquisition and validation the main issue when constructing a knowledge-based system is the way on which description (knowledge) is built up in accordance with the plant behavior and structure. consequently, the development of a knowledge-based system for the existing plant and control system implies selecting the most important characteristics of the system that will be used in order to construct the knowledge database. moreover, there is the need for validating systems knowledge, that means determination if it accurately represents an experts knowledge in the particular domain. in this idea, the simulator developed for the geothermal plant of the university of oradea proves to be of great help. the simulator, which was previously developed, provides a simplified physical model of the plant dynamics together with plc control, which is formulated into an easier-tooperate computer simulation: the acsl solution to the model equations. key elements of the improved easy-of-operation are the use of general-purpose simulation language acsl (advanced continuous simulation language), and pre-programmed modules of all important plant components, including control elements [16]. in the development phase of control system, the simulator has provided a useful tool for testing the system specifications, including the adopted control strategy; it could be employed also into the process of step-by-step knowledge acquisition and for further validation and updating. based on information gathered from simulation and from the control program development, knowledge acquisition could be achieved. for example, if we refer to possible faults, a structured notation could be used when doing knowledge aquisition, 172 d. zmaranda, h. silaghi, g. gabor, c. vancea based on tables, as shown in table 1. table 1: faults and associated switches in plc (partial) digital inputs fault notation overload protection switch pump p3/p4 p3_ol, p4_ol thermal switch pump p3/p4 p3_td, p4_td overheat in the heat station ts2 local emergency stop switch in the pump station panel es1 3.2 design phase. knowledge representation knowledge acquisition is the most important part when developing a knowledge-based system, but an important point is represented also by the way on which knowledge representation is done. representation is tied to the production rules that should be expressed accordingly to the knowledge system that is used. current knowledge-based industrial systems are generally built within shells, which package a combination of tools. different shells may include different features useful for real-time control applications, such as: hierarchy for objects, associative knowledge, relating objects in the form of connections and relations, rules and associated inference engine, analytic knowledge, such as functions, formulas, and differential equation simulation, real-time features such as time stamping and validity intervals for variables, history-keeping, run-time environment. in this idea, current literature presents several expert systems that could be used for real-time process control systems, the most known being g2 and jess. g2 real-time expert system [13] allows integration of models and rules for combining modelbased and artificial knowledge representation and it is based on an inference engine that can use generic forms of knowledge, interpreted for specific instances in the domain. it is specifically designed for process control and related applications and allows the process engineer to implement and manage the expert system. but, even if g2 is claimed to be real-time, there is no mention of verifiability from temporal point of view. java expert system shell or jess [13] is inspired by the artificial intelligence production rule language clips being a fully developed java api for creating rule-based expert systems. even if it is architecturally inspired by clips, it exhibits a lisp-like syntax. it consists of three components: the rules (knowledge base), the working memory (fact base corresponding to real-time base) and an inference engine (rule engine). jess uses the rete (ree-tee) algorithm to match patterns. rete algorithm is not time predictable but two newer algorithms treat and leaps have introduced some optimization from this point of view; there are as well not time predictable but, with some restrictions, it would allow to be time predictable. consequently, we consider to develop a solution based on jess for our system. thus, a knowledge model can be constructed starting from observing the main problems associated with every specific item (for example, pumps p3/p4 from the pump station) in the operational system and creating the model (table 2) that presents correlation between problems and items that can be solved by expert system. table 2: correlation model slot item problems situation (for pumps p3/p4) engine overheating, vibration, lack of voltage afterwards, for each identified problem, the possible causes are identified and associated, generating a knowledge model (table 3). issues on applying knowledge-based techniques in real-time control systems 173 table 3: knowledge model slot inference level task level cause overheating cooler water missing, too big temperature in pump station vibration bend axle, motor basement snap lack of voltage disruption of wires, general pump station switch off after establishing the knowledge model, the design phase is dedicated for structuring the rules according to requirements of jess inference engine. jess uses the notion of frames that are hierarchical representation that includes several components (slots, facet, datum and how). frame definition is made by using deftemplate, having the following generic form [2]: ((deftemplate [extends ] [] [(slot [(default | default-dynamic)] [(type ))]*) for example, the template for pumps p3/p4 will look in the following way: (deftemplate p3/p4 (slot situation (default nf)) (slot cause (default nf) ) , where nf states from not found. jess basis of knowledge composed from rules could be further built based on the previously frame definitions. these slots in the jess rules structure will be unified by using pattern-matching by the inference engine through the rete algorithm [8]. 3.3 overall proposed architecture the proposed modifications of the existing architecture are illustrated in figure 3. it employs as additional element an additional computer on which the jess expert system is running that is connected with the pc on which the user interface was developed and on which plc historical and real-time database resides. historical and real-time databases are, in our situation, created and updated by intouch in real-time, based on plc associated inputs. figure 3: the knowledge system architecture 174 d. zmaranda, h. silaghi, g. gabor, c. vancea in the proposed architecture, the main issue is represented by the way in which collaboration of several components is achieved, because plc real-time database has a specific format for storing, defined by intouch. there is a need of conversion from this format in order to be able to interact with the knowledge base that resides on the expert systems pc computer. a possibility is represented by using an xml file that stores all the tagnames of the faults, generated from plc database. the idea is to use the xml file to feed the knowledge base and to interact as an intermediate level between expert system and plc real-time database. then, this file could be used as input by a jess function in order to check is value; afterwards, for all tagnames that are acted (indicating a fault) rules pattern should be evaluated. consequently, the architecture colaboration of components could be achieved by integration of jess engine with the real-time database through the xml file. there are several issues that are not completely defined in our architecture. for example, the way in which the xml file is generated based on plc real-time database in still in question. also, after completing the implementation, system performance should be evaluated in order to prove that the solution provides satisfactory. 4 conclusions and future works knowledge-based systems are making significant contributions to real-time process control applications. their applications are often in areas which complement traditional process control technology, like, for instance, diagnosis and handling abnormal situations. they integrate knowledge-based techniques with conventional control, having significant benefits in overall quality management. but, a knowledge-based system operating in a real-time situation will typically need to respond to a changing environment involving asynchronous flow of events and dynamically changing requirements with limitations on time, hardware, and other resources. determining how fast this system can respond under all possible situations is a difficult problem that requires using flexible software architecture in order to provide the necessary reasoning on rapidly changing data. in this paper, various issues on applying knowledge-based techniques in real-time control systems have been presented; starting from this foundation, an implementation structure of a knowledge-based system for the existing automatic control and monitoring system for the geothermal plant from the university of oradea was analyzed and an overall architecture is proposed for further implementation. the actually proposed knowledge-based system approach is focused mainly on general architecture and component colaboration. the architecture does not include time-constrains validation, it relies only on the performance of the jess engine; a further solution for creating a specific expert system that includes temporal reasoning will be also investigated in the future. bibliography [1] singh a., verma m., real time expert system its applications, ijcst, issn : 2229-4333 (print), issn : 0976-8491 (online), 1(2):150-153, 2010. [2] farias o., labidi s., neto j. f., albuquerque s., a real time expert system for decision making in rotary railcar dumpers, automation control theory and practice, a d rodi (ed.), isbn: 978-953-307-039-1, intech, available from: http://www.intechopen.com/books/automation-control-theory-and-practice/a-realtime-expert-system-for-decision-making-in-rotary-railcar-dumpers, 2009. issues on applying knowledge-based techniques in real-time control systems 175 [3] el-desouky a. i., arafat h. a., laban s., implementing knowledge-based systems in monitoring real-time systems, geophysical research abstracts, vol. 10, egu2008-a-12044, sref-id: 1607-7962/gra/egu2008-a-12044, egu general assembly, 2008. [4] eremeev a., varshavskiy p., case-based reasoning method for real-time expert diagnostics systems, international journal information theories and applications, 15:119-125, 2008. [5] wai k. s., latif abd., rahman b. a., zaiyadi m. f., aziz a. abd., expert system in real world applications, http://www.generation5.org/content/2005/expertsystem.asp, 2005. [6] bubnicki z., modern control theory, isbn 10 30540-23951-0, isbn 13 978-3-540-23951-2, springer-verlag, berlin, hedelberg, 2005. [7] bubnicki z., kowledge-based and learning control systems, control systems, robotics and automation, ecyclopedia of life-support systems, vol. xvii, 2005. [8] grissa-touzi a., ounally h., boulila a., visual jess: an expandable visual generator of oriented object expert systems, world academy of science, engineering and technology, 2005. [9] alexander a. j., how to use expert systems in real world applications, http://conferences.embarcadero.com/article/32093, 2004. [10] vagin v.n., eremeev a.p., certain basic principles of designing real-time intelligent decision systems, journal of computer and systems sciences international, 40(6):953-961, 2001. [11] laffey t. j., cox p. a., schmidt j. l., kao s. m., read j. y., real-time knowledge based systems, ai magazine, 9(1):27-45, 1988. [12] charpillet f., look on the issue of building real-time knowledge based systems research summary, association for the advancement of artificial intelligence technical report ws97-06, 1997. [13] http://www.gensym.com/ [14] http://herzberg.ca.sandia.gov/jess/ [15] http://global.wonderware.com/en/pages/wonderwareintouchhmi.aspx [16] mga software, advanced continuous simulation language (acsl) reference manual,1995 int j comput commun, issn 1841-9836 8(1):97-104, february, 2013. managing information technology security in the context of cyber crime trends d. neghina, e. scarlat diana-elena neghina institute of doctoral studies ase 11, tache ionescu street, bucharest, romania e-mail: diananeghina@gmail.com emil scarlat academy of economic studies department of economic cybernetics 6, romana square, bucharest, romania e-mail: emil_scarlat@csie.ase.ro abstract: cyber-attacks can significantly hurt an organization’s it environment, leading to serious operational disruptions, from simply damaging the first layers of it security up to identity theft, data leakage and breaking down networks. moreover, the dangers through which current cybercrimes practices affect organizations present a tendency of developing more rapidly that decision makers can assess them and find countermeasures. because cyber threats are somewhat new thus a critical source of risks, within the context of the constantly changing it environments (e.g. cloud services integration) organizations may not effectively implement and manage cyber threat risk assessment processes. this paper highlights the importance of designing effective security strategies and proactively addressing cybercrime issues as key elements within the organizational risk management approaches. malware rises constantly in impact and complexity and has surpassed the traditional security model. one of the main ideas of the study is to present the main areas of risks related to cyber security to which an organization is subject to and provide a baseline of an analysis model that would adequately evaluate input data, rank priorities and represent the results and solutions to decrease these risks. the importance of this study is to increase awareness efforts and to highlight the critical importance of using the full extent of resources provided. each member of an organization has a significant role in decreasing the exposure to the vulnerabilities created by cyber-attacks. keywords: cybercrime, it security, risk assessment, vulnerability management. 1 introduction the dangers through which current cybercrimes practices affect organizations present a tendency of developing more rapidly that decision makers can assess them and find countermeasures, exposing entities to significant risks. vulnerability events are created constantly, through the exposure over shared environments(such as cloud solutions), permanent transfers of critical information between an organizations branches, social platforms, electronic banking solutions, displaying intellectual property, all enabling favorable circumstances for appropriation, disturbances and misuse of critical information and data. the purpose of this paper is to make entities aware of the changes in the last decades of cyber threats within network environments and how to effectively approach these elements, timely finding solutions and actions to be taken. one of the main ideas of the study is to present the principal areas of risk related to cyber security to which an organization is subject to in the current it environments and to determine a specific analysis model that would adequately lead copyright c⃝ 2006-2013 by ccc publications 98 d. neghina, e. scarlat to input data evaluations, rank priorities and represent the results and solutions to mitigate the identified risks. on the current market, the tendency of decision makers is still to take unintended or unexpected risks by following classical patterns of behavior and standard models of security, applied additionally in new environments that are continuously changing, often with the consequence of significantly affecting organizational value. our opinion is that the increasing number of attacks and constantly developing threats are leading to significant gains that are further supporting the constant adaptation of cyber criminals to the classical implementation and management of security tools. we consider that our paper is raising an alarm signal on the lack of perception of the fact that traditional models are starting to be considered as outdated and decision makers must evaluate the need of combining them with additional elements from the operational areas to obtain a better understanding; the real issue is the fact that professional cybercrime tools and methods are so advanced that are continuously increasing the security issue. managements ability of forecasting security breaches that are considered critical for the compliance of a company with its strategies is essential in using corporate information as an asset, as well as a significant competitive advantage. this further sustains the capability of a close collaboration between it resources and operational ones. technical understanding of events and processes must be combined with the business perception of processes and activities for entities to adjust more quickly to challenges and take immediate actions to unexpected events. even though it governance and compliance methods, identity management, applications security and network tools are becoming more and more complex, are processing information from different areas of an organizational environment, cyber criminals are also aware of the changes and are also adjusting their techniques, becoming better at hiding their trace, gaining undetected access and all for extended durations of time; studies present the fact that they are characterized by stealthy and passive methods for attacking a system (schudel, wood) [1]. entities must better determine the key elements that are making them a target of cyber criminals and the way they are perceived by external parties regarding critical information of interest and act on these vulnerabilities first, these being considered as the initial layers of attack. all security analysis and prevention methods should begin from the idea of unauthorized access being gained and core business data being misused, thus leading to appropriate security measures of classification of critical information (high to low risk assessments). as a general overview of it reviews, there are not many entities that are implementing levels of security and classification of data based on values or risk considerations. as a summary conclusion, the actual warning is the tendency of cybercrime attacks of continuously becoming more dangerous and complex, evolving into schemes difficult to anticipate. the attacks are determined to be more destructive, more advanced and more studied (significant resources are invested in these activities, new capabilities are researched based on existing security tools) and have a serious impact over economic and even national security elements. 2 cyber-crime trends analysis the current approaches of cyber criminals may be characterized as proactive. further, attackers actions are dynamic through their frequency and most aggravating, collaborative, in the sense that services emerge over the internet with the scope of committing fraud, theft and exploitation of systems vulnerabilities. managing information technology security in the context of cyber crime trends 99 2.1 brief development of cyber crime as a summary of cyber-attacks tendencies, during the first periods of modern internet and it environments, cyber-attacks were generally performed by employees, within organizational networks, generally due to different dissatisfaction reasons. this type of threat was favored until the 1980 and attackers took advantage of privileged and granted access privileges to it resources; they altered information mainly for financial advantages or simply sabotaged data for revenge at employers. studies revealed that programmers developed their abilities of writing malicious software, including self-replicating programs, to interfere with personal computers. during the 1990s, financial crime over the internet completed through penetration and subversion of computer systems increased significantly. by the late 1990s and in the years following 2000, fraud attacks and identity theft developed. moreover, organized attacks started to be performed more often including groups of cyber criminals and increasing the time of acting out without any detection (kabay) [3]. throughout the 2000s, cyber attackers started to externalize services and offer their unauthorized activities for more complex crimes. there were identified many denial-of-services attacks opposing known websites and malware was designed to record logs of keystrokes and then send this information through secure internet communication channels to cyber criminals. 2.2 trends and security challenges thus, business data, organizational assets are increasingly threatened and traditional it security approaches are offering only the basis of solutions. as a general characteristic, the present it environment can be considered as reactive, without any time allocated for risks assessments; cyber criminals are aware of all these exposures and take advantage of them through the use of end users (social engineering, theft of credentials), phishing attacks, through all sorts of original deceptions, penetration and encryption techniques to make their trace inaccessible. as mentioned previously, security events are continuously increasing, in frequency as well as in impact. however, in the same time quantitative information related to these events are both difficult to obtain as well as hard to place into a meaningful framework (shimeall, williams) [2]. on a general level, organizations should transition from a mainly security based approach to a more risk assessment approach, thus addressing vulnerabilities within the risk management planning and methods. entities should continuously increase their security awareness procedures, apply active monitoring procedures and complete periodical trainings for all operational and technical personnel to achieve an effective cyber security stance. operational and information technology solutions are shifting towards collaboration environments. it environments sustain shared resources and services, including core business applications, centralized architecture and infrastructure elements for which entities must put in place controls to identify and counteract effectively unauthorized actions. thus there must be put in place techniques that set the baseline for risk control and business efficiency, outlining the key issues and introducing risk assessments of specific elements regarding enterprise information protection and personal privacy. however, the vast majority of organizations have restricted abilities to identify and take effective actions against security breaches. researches show that preventive actions are extremely hard to implement. moreover, most of the current analysis tools are based on prior behavior and activities and may not produce significant models to appropriately identify other attacks or to be used for efficient preventive actions (jajodia et al) [4]. vulnerabilities are thus analyzed constantly based on prior security events and not on emerging cyber threats, adding value to the security outline of the organization. 100 d. neghina, e. scarlat however, in practice, defined security assessments performed by organizations are not capable any more to cope with the continuously evolving threats. these are not focused on collecting cyber threat data from a variety of business and operational areas and different sources due to increased time allocation and costs of implied resources. security methods applied are based mainly on high level information and tools and technologies implemented are generally configured based on existing standards, and not customized on the features of the environments that would have the ability to promptly identify, enclose and restrict, evaluate and restore compromised characteristics. with the difficulty of obtaining and updating cyber intelligence information, the implementation of risk management methodologies should be performed. these must be appropriately designed and used to effectively challenge or even block access attempts, security breaches or fraudulent transactions. in the following section we present the approach distinguished through a set of specific tools and techniques that can be easily customized and applied to specific organizations profiles, networks, and significantly improve the already implemented security controls by organizations. 3 cyber threats risk management capability the risks characteristic for an organization and its related industry shape the operational environment, its readiness and effective response to different interactions with internal and external environments. the general characteristic of the current organization is its increasing reliance on technology, information sharing, and connectivity elements. thus dependence leads to risks at all levels. due to the fact that cyber threats are a relatively new and constant source of risk, continuously changing, entities are not as capable at managing cyber threat risk as they are at managing any other operational risk related to business activities. 3.1 the fundamentals of a cyber-threat risk assessment process unless an organization is considerably developed related to cyber threat risk management practices, it cannot have the risk assessment infrastructure and governance elements designed to sustain an adequate security environment. for example, if the basic elements are not defined, such as specific risk definitions and business impact analysis, risk limits of acceptance or specific key performance indicators. if an enterprise cannot sustain at least the above mentioned elements, it is advisable as a starting point the evaluation of the following set of information security practices that significant for an appropriate cyber risk assessment process: 1. existing security controls, implemented by the entity to identify and record known types of cyber-attacks that are characterized by stealth breaches. here are included also the security tools and techniques used to timely identify and contain compromised it resources; 2. available methods of recording security breaches information from multiple sources (internal as well as external). added to this category are the abilities of the entity to implement cyber-crimes risk models in order to collect relevant cyber intelligence information and generate value and actionable data for decision making purposes; 3. exposure of employees to complex social engineering attacks that allows malware to be integrated in the administrative consoles or workstations. here are included the procedures of detecting advanced, persistent threats within the entitys own business environments in the case of identity theft or unauthorized use of authentication elements. managing information technology security in the context of cyber crime trends 101 these are not topics for an elaborate analysis process, but they do represent basic elements of an effective defense mechanism against current cyber-attacks. by applying a more elaborate cyber threat risk assessment framework (presented in figure 1. and further detailed in the following section) an organization can better protect its operational environments as well as gain valuable insights on its vulnerabilities and improvement actions to be taken. figure 1: model score after boosting 3.2 core cyber threats risk management capabilities below we present a new approach of cyber threat risks that, if unaddressed, may lead to the security vulnerabilities exploited firstly by attackers. 1. monitoring and key performance indicators: starting from the basic incident driven communication with management, periodical and formalized communication with decision makers should be implemented. evidence should be kept for ongoing dialogue and critical metrics should be reviewed, analyzed and used to improve already existing security controls. key performance indicators should be designed based on cyber intelligence information that must be logged for predefined periods of time. these must be standardized within the organizational units and have a clear linkage to business value (generally these should be defined in quantitative terms). 2. organizational personnel: employees must be aware of cyber threat risk, recognize attacks as a potential risk area and have basic knowledge of designed security policies, processes, and implemented tools; roles and responsibilities for risk management should be established that would ensure the integration of specific risk assessments into larger security decisions and controls affecting an organization. it personnel must have specialized knowledge about cyber threat risk; moreover, as a required competency security feeds from operational units must be centrally coordinated to manage and 102 d. neghina, e. scarlat keep cyber threat risks within defined acceptance levels. written and approved it security policies, training missions, and communications must be effectively distributed and compliance periodically monitored and reviewed for appropriate enforcement; most of employees should have clearly defined responsibilities for cyber risk management, appropriate to their roles and responsibilities. 3. operational processes: organizational business units are characterized by fragmented processes, which are not always communicating between them, certain manual input and execution of operational activities. defined processes must align with enterprise-wide risk management framework above mentioned and must be monitored by it personnel and executive management at different levels; processes must reach the level of being consistently integrated, automated, and clearly documented related to cyber threats risk assessments. organizations must formally measure and monitor process effectiveness; automation must be viewed as an objective; cyber threat risk management may be organized at a higher level as a self-standing unit and through which all processes are addressed by continuous improvement efforts; the overall objective is to accomplish structured cyber threat risk management programs that are integrated with the already existing it risk management and enterprise risk management agendas. 4. security tools and techniques: technology already installed for security reasons must be enabled to log security events, to centralize them and as a basis must send alerts in case of incidents recorded or exceptions; signature-based controls such as anti-virus and intrusion-detection software must be implemented. for an evolved organizational environment, forensic tools may be used for reacting to exceptional security events. commercially available threat monitoring feeds may be integrated with centralized logging solutions and monitoring software to generate automated alerts. for more complex risk management existing security tools may be automatically enabled to perform advanced correlations related to threat information and to convert obtained data into actionable alerts. these methods applied may be used to automate not just threat monitoring and alerts, but further security events identified, such as malware, or complete forensic analysis, as well as more complex threat assessments. the presented framework of continuously evolving cyber threat risk management capability model may sustain an organization in the process of implementing better security techniques to avoid cyber-attacks of criminals that make it past the already existing access controls mechanisms. for example, a well-developed cyber threat risk management model will include safety elements against unauthorized information distribution, as well as protection opposed to unauthorized information access. effective completion of these features lead to the use of technologies and processes that monitor outbound information traffic for both content and destination in particular. this can be configured as an alert point if data is being transferred to a location outside the normal operational environment, where the organization has not been present before. an accomplished capability will also be able to contain the transfer of information timely, to isolate the elements involved and assess the suspicious communication networks until their authentication credentials are clear; in case of unauthorized actions, procedures are in place for forensic analysis. because cyber-attackers are continuously improving their identity theft techniques, an organization shouldnt assume that each user that authenticates at network level and within the it application systems and is performing the required activities with legitimate credentials is in reality a legitimate user, an employee or an agreed service provider of the entity. managing information technology security in the context of cyber crime trends 103 a complex cyber threat risk management process will be designed to use at least two verification methods, depending on the classification of the information being protected. these activities include the verifications of a persons physical identity through techniques including biometrics such as laptop fingerprint readers, pin code token devices that must be carried by the legitimate users at all times, and finally behavioral programs that track post-login activity against historical patterns defined for a specific user in order to determine the likelihood of that person authenticating of being legitimate. 4 discussions any cyber-attack can hurt an organization in any number of ways, ranging from minor damages to a website informative page to shutting down core networks, committing fraud, and stealing intellectual property. thus organizations should implement actionable, risk-based intelligence processes in order to timely identify unauthorized cyber activity. entities should maximize the use of existing security solutions, logged information and most important, completing cyber threat awareness programs among employees. as a summary process, an organization should effectively identify the cyber-criminal risk, assess and evaluate this risk, integrate it, respond and take isolation actions to the risk, design, implement and test appropriate security controls, further monitor events, as well as assure and escalate future cyber-attacks. as mentioned within prior studies, cyber-crime events will become more precise, more specialized, and for this reason organizations must integrate a dedicated cyber threat analysis tools and it experts. as a general idea, an organization’s security resources will need to focus more on analyzing available internal and external data sources, customize controls and center less on managing and maintaining standard security controls. 5 conclusions entities are responsible for implementing and maintaining an integrated approach between its employees, operational process, and technology resources implemented in order to complete effective risk management procedures. resources must be allocated to gather and process cyber threat analysis information, notifying the results and defining alerts for better security controls and measures to be taken by the operational units. complex cyber risk management processes are repeatable, clearly defined, well-documented, and aligned with an organizations larger it risk management. future work will focus upon cyber intelligence collection methods and processing algorithms, behavioral trends of cyber attackers, which could accommodate customized improvements to the risk management activity of an organization. as the research of cyber security capabilities transforming from raw data to actionable intelligence will provide valuable cyber threat research. hence, such an analysis would support improvements in multiple key threat indicators and metrics related to it security analysis. bibliography [1] gregg schudel, bradley wood, modeling behavior of the cyber-terrorist, in http : //www.dli.gov.in/data/hacking_information/printed20papers /modeling20behavior20of20cyber20terrorist.pdf. [2] tim shimeall, phil williams, models of information security trend analysis, in http : //www.dli.gov.in/data/hacking_information/printed20papers/ models20for20inf20security20trend20analy sis.pdf. 104 d. neghina, e. scarlat [3] m. e. kabay, ma brief history of computer crime, in http://www.mekabay.com/overviews/history.pdf. [4] sushil jajodia, peng liu, vipin swarup, cliff wang, editors, cyber situational awareness: issues and research, in springer international series on advances in information security. [5] sumit ghosh, elliot turrini, editors, cybercrimes: a multidisciplinary analysis, in springer-verlag berlin heidelberg, 2010. [6] martin c. libicki, cyberdeterrance and cyberwar, rand corporation, 2009. [7] jean-marc seigneur, adam slagell, collaborative computer security and trust management, in information science reference (an imprint of igi global), 2010. int j comput commun, issn 1841-9836 7(5):976-989, december, 2012. error correction method in classification by using multiple-criteria and multiple-constraint levels linear programming b. wang, y. shi bo wang school of mathematical sciences, graduate university of the chinese academy of sciences, beijing 100190, china e-mail: wangbo8014@126.com yong shi (corresponding author) 1. research center on fictitious economy & data science, chinese academy of sciences, beijing 100190, china and 2. college of information science & technology, university of nebraska at omaha, omaha, ne 68182, usa e-mail: yshi@gucas.ac.cn abstract: in classification based on multiple-criteria linear programming (mclp), we need to find the optimal solution of the mclp problem as a classifier. according to dual theory, multiple criteria can be switched to multiple constraint levels, and vice versa. a mclp problem can be logically extended into a multiple-criteria and multiple-constraint levels linear programming (mc2lp) problem. in many applications, such as credit card account classification, how to handle two types of error is a key issue. the errors can be caused by a fixed cutoff between a "good" group and a "bad" group. two types of error can be systematically corrected by using the structure of mc2lp, which allows two alterable cutoffs. in order to do so, a penalty (or cost) is imposed to find the potential solution for all possible trade-offs in solving mc2lp problem. some correction strategies can be investigated by the solution procedure. furthermore, a framework of decision supporting system can be illustrated for various real-life applications of the proposed method. keywords: classification, two types of error, multiple-criteria linear programming, multiple-criteria and multiple-constraint levels linear programming, decision supporting system. 1 introduction taking advantage of many optimization based classification algorithms, the transactions data collected by the bank can be analyzed in many ways. as a result, the profit of the new accounts can be predicted, that is, which accounts are inclined to be bankrupt and which ones have good credit. however, for many real-life problems, the accounts cannot be separated by the hyperplane in spite of using some kernel techniques [1]. thus, how to decrease the number of misclassified samples becomes a big issue. in some applications, misclassifications have unequal importance. for example, in credit card account classification, it is essential to classify credit card customers precisely in order to provide effective services while avoiding losses due to bankruptcy from users’ debt. actually, even 0.01 percent increase in early detection of bad accounts can save millions, while losing a good account does not influence much [2], [3]. linear programming (lp) is a useful tool for discriminating analysis of a problem given appropriate groups (e.g., "good" and "bad") [4]. and multiple-criteria linear programming copyright c⃝ 2006-2012 by ccc publications error correction method in classification by using multiple-criteria and multiple-constraint levels linear programming 977 (mclp) has improved the result by minimizing the sum external deviations and maximizing the sum of the internal deviations simultaneously. mostly, the cutoff of mclp is fixed to be a given number (e.g., 1), while this will cause some other problems. for instance, it cannot involve those possible cases that can achieve the ideal cutoff score to be zero. formally, this means that the solutions obtained by linear programming are not invariant under linear transformations of the data [2], [3]. particularly, it is not invariant under vector addition. furthermore, if the classes of the samples exchange, i.e. "good" and "bad" classes swap to each other, the solutions are different. noticing these problems, some researchers made many efforts on this topic. consequently, a new model based on multiple-criteria and multiple-constraint levels linear programming (mc2lp) was posed [5], [6]. however, these methods were domain-driven, which meant they needed some domain knowledge to help finding the best cutoff. nevertheless, our new models are all automatically solving. as a result, it is useful to solve the problems that no domain knowledge is prepared. in particular, it solves classification problem twice. the maximal external deviation is found for the first time, while mc2lp is exploited to search for the optimal hyperplane based on minimizing two types of error for the second time. in addition to this, we also provide another mc2lp based model for the purpose of errors correction. in this model, we fix the cutoff to be 1 whereas we also add two more hyperplanes to detect the misclassified points carefully. accordingly, a subtle discussion is involved regarding the relationship between two types of error and the deviations . in fact, in the statistics theory, two types of error influence on each other oppositely. in other words, reducing of type i error will cause the increasing of type ii error, and vice versa. hence, we focus on different types of error respectively. moreover, some more elaborate introductions are demonstrated in the next sections. 2 multiple-criteria and multiple-constraint levels linear programming for classification 2.1 the mclp model for classification given a set of n variables about the records xt = (x1, x2, ..., xl), and then let xi = (xi1, xi2, ..., xin) t be one sample of data, where i = 1, 2, ..., l and l is the sample size. in linear discriminant analysis, data separation can be achieved by two opposite objectives, that is, minimizing the sum of the deviations (msd) and maximizing the minimum distances (mmd) of observations from the critical value [4]. that is to say, in order to solve classification problem, we need to minimize the overlapping of data, i.e. α, at the same time, to maximize the distances from the well classified points to the hyperplane, i.e. β. however, it is difficult for traditional linear programming to optimize mmd and msd simultaneously. according to the concept of pareto optimality, we can check all the possible trade-offs between the objective functions by using multiple-criteria linear programming algorithm. the mclp model can be described by figure 1: 978 b. wang, y. shi figure 1: mclp model moreover, the first multiple criteria linear programming (mclp) model can be described as follows: min ∑ i αi max ∑ i βi s.t. aix = b + αi − βi, ai ∈ bad, aix = b − αi + βi, ai ∈ good, αi, βi > 0, i = 1, 2, ..., l here, αi is the overlapping and βi is the distance from the training sample xi to the discriminator (w · xi) = b (classification separating hyperplane). 2.2 discussion of the cutoff b a boundary value b (cutoff) is often used to separate two groups, where b is unrestricted. a problem caused by treating b as a variable will meet many cases of no solution. for some applications, the user can choose a fixed value of b (b = 1) to get a solution as the classifier. as a result, efforts to improve the accuracy rate of classification have been greatly confined to the unrestricted characteristics of b (that is, a given b is put into calculation to find coefficients w) according to the user’s experience facing the real time data set [3]. in such procedure, the goal of finding the optimal solution for classification question is replaced by the task of testing boundary b. that is to say, if b is given, we find a classifier using an optimal solution by solving the model above. however, now we will point out the drawbacks of handling model in this way. at first, fixing b = 1 will make the solutions different under vector addition (one kind of linear transformations). besides, the solution will change if we swap the classes of "bad" and "good", which seems to be illogical. thus, we will introduce the examples below to illustrate these two issues in the rest of this part. example 1. in figure 2, there are 20 points (10 points belong to class -1, 10 points belong to class 1). on the left, there are coordinates of the points. on the right, the distribution of the points is displayed. mclp is applied to this data set and the solution is x = (0.58, −0.77). error correction method in classification by using multiple-criteria and multiple-constraint levels linear programming 979 figure 2: data1 figure 3: data2 and then, we transform the data by moving the points along the vector (1, 1), and the new data coordinates and the distribution are shown in figure 3. it is obvious that the samples’ distribution is the same as the former ones’, which means the optimal hyperplane will parallel to the former one. conversely, the result obtained by mclp is x = (0.4, −0.4), which is quite different from the former one. in addition, if the classes are swapped, the result will change into x = (−0.15, 0.77), which should be the same as the first one. for simplicity, in these experiments, the weight between two criterions is always fixed to be (4, 1). the discussion above shows that the result won’t be invariant under linear transformation. this means when the data change, we cannot just keep b to be a fixed number, say 1, and then solve the mclp problem for different data set. moreover, we need to take advantage of mc2lp to trace a better b to decrease the number of errors. 2.3 the mc2lp model for classification according to the discussion above, a non-fixed b is very important to our problem. at the same time, for the simplicity and existence of the solution, b should be fixed in some interval. 980 b. wang, y. shi as a result, for different data, we fix b in different pairs of interval [bl, bu], where bl and bu are two fixed numbers. now our problem is to search the best cutoff between bl and bu at every level of their trade-offs, that is to say, to test every point in the interval [bl, bu]. we keep the multiple-criteria the same as mclp, that is, mmd and msd. and then, the following model is posed [3]: min ∑ i αi max ∑ i βi s.t. aix = [bl, bu] + αi − βi, ai ∈ bad, aix = [bl, bu] − αi + βi, ai ∈ good, αi, βi > 0, i = 1, 2, ..., l where ai, bl and bu are given, and x is unrestricted. in the model, [bl, bu] represents a certain trade-off in the interval. by virtue of the technical of multiple-criteria and multiple-constraint levels linear programming (mc2lp), we can test each trade-off between the multiple-criteria and multiple-constraint levels as follows: min λ1 ∑ i αi − λ2 ∑ i βi s.t. aix = γ1bl + γ2bu + αi − βi, ai ∈ bad, aix = γ1bl + γ2bu − αi + βi, ai ∈ good, αi, βi > 0, i = 1, 2, ..., l here, the parameters of λ×γ are fixed for each programming problem. moreover, the advantage of mc2lp is that it can find the potential solutions for all possible trade-offs in the parameter space systematically [7] [8], where the parameter space is {(λ, γ)|λ1 + λ2 = 1, γ1 + γ2 = 1}. of course, in this model, choosing a suitable pair for the goal problem is a key issue and needs domain knowledge. consequently, a non-parameter choosing mc2lp method should be posed. 3 a new two alterable cutoffs model based on mc2lp 3.1 a framework of the new mc2lp model for the original mclp model, one cutoff is used to predict a new sample’s class, that is to say, there is only one hyperplane. the former mc2lp model points out that we can define two cutoffs instead of the original single cutoff. and then a systematical method can be used to solve this problem. consequently, all potential solutions at each constrain level trade-off can be acquired. however, one problem is how to find the cutoffs, that is, bl and bu. on one hand, we utilize two cutoffs to discover the solution of higher accuracy; on the other hand, we hope the cutoffs can be obtained from the system directly. inspired by the idea above, we address our first mc2lp model, which solves the classification problem twice. for the first step, mclp model is used to find the vector of external deviations α. it is a function of λ. for simplicity, we set b = 1. and then, we fix the parameter of λ to get error correction method in classification by using multiple-criteria and multiple-constraint levels linear programming 981 one potential solution. now a non-parameter vector of external deviations α is acquired. the component (αi > 0) means the corresponding sample in the training set is misclassified. in other words, type i and type ii errors occur. according to the idea of mc2lp, we can detect the result of every single mclp by fixing the parameter of γ at each level in the interval [bl, bu]. now, we find the maximal component of α: αmax = max{αi, 1 ≤ i ≤ l}. (1) indeed, the smaller the weight of external deviations is, the bigger αmax is. the misclassified samples are all projected into the interval [1 − αmax, 1 + αmax] according to the weight vector x obtained from the mclp model. in this way, we define bl and bu as 1 − αmax and 1 + αmax, respectively. it is easy to see, if we want to lessen the number of two types of error, in effect, we just need to inspect the cutoffs by altering the cutoff in the interval [1 − αmax, 1 + αmax]. moreover, for the second step, a new mc2lp classification model can be stated as follows: min λ1 ∑ i αi − λ2 ∑ i βi s.t. aix = [1 − αmax, 1 + αmax] + αi − βi, ai ∈ bad, aix = [1 − αmax, 1 + αmax] − αi + βi, ai ∈ good, αi, βi > 0, i = 1, 2, ..., l where ai, αmax are given, and x is unrestricted, [1 − αmax, 1 + αmax] means a certain trade-off in the interval. at the same time, λ = (λ1, λ2) is the parameter chosen in the first step. claim 2. furthermore, all the notations as [a, b] in the models displayed in this paper mean a certain trade-off in the interval, where a and b are two real numbers. 3.2 discussion of the new mc2lp model the most direct modification of the new mc2lp model is to transfer the single objective function to be a multiple-criteria one. because the vector of external deviations is a function of λ, it is easy to observe that if the weight between external deviations and internal deviations changes, α changes. consequently, αmax alters. and the ideal α is the one that makes αmax not too huge. in other words, we do not hope to check the weight that satisfies λ1 not too small. actually, some papers have proved that only if λ1 > λ2, then α · β = 0, which makes the model meaningful [9]. as a result, we only need to check the parameters of objective functions that make αmax not too big, in short, not too far away from the original one. on the other hand, we expect αmax not too small. that is to say, we hope the model has some generalization. hence, two small positive numbers ϵ1 and ϵ2 are chosen manually. and then, the interval is builded as [[1−αmax −ϵ1, 1−αmax +ϵ1], [1+αmax −ϵ2, 1+αmax +ϵ2]]. this means that the lower and the upper bound of the interval should be trade-off of some intervals, i.e. the multiple-constrained levels are actually multiple-constrained intervals. indeed, checking every trade-off of the intervals is the same as checking every trade-off of 1 − αmax − ϵ1 and 1 + αmax + ϵ2. in this case, we can consider the objective function as a multiple-criteria one. it 982 b. wang, y. shi figure 4: the solutions for mc2lp can be stated as follows: min ∑ i αi max ∑ i βi s.t. aix = [1 − αmax − ϵ1, 1 + αmax + ϵ2] + αi − βi, ai ∈ bad, aix = [1 − αmax − ϵ1, 1 + αmax + ϵ2] − αi + βi, ai ∈ good, αi, βi > 0, i = 1, 2, ..., l (2) where ai, αmax, ϵ1 and ϵ2 are given, and x is unrestricted. here, ϵ1 and ϵ2 are two nonnegative number. example 3. a numeric experiment will be introduced to demonstrate that our new mc2lp model is effective. the data is the same as data1 that has been displayed in figure 2. as we have shown above, the solution for mclp is x = (0.58, −0.77). according to the result, we find there are 6 misclassified points in the data, that is, point 5, 7 for class 1 and point 13, 15, 17, 19 for class -1. moreover, according to the solution, αmax is 1.57. then, we keep ϵ1 = ϵ2 = 0. as a result, [bl, bu] = [−0.57, 2.57] is obtained. next, some trade-offs between multiple-constraint levels have been chosen and the solutions are listed below. for consistency, we keep the weight between the objective function to be (4, 1). the result is laid out in figure 4. the green hyperplane 0.02 ∗ x1 − 0.08 ∗ x2 = 1 is obtained from negative cutoff. from the solutions, we can see that different trade-offs between the constraint levels yield different results, while the solutions will be the same when b is set to be the same sign. that is to say, for all the positive b, the solutions generate the same hyperplane, and then, for all the negative b, the solutions create the same hyperplane. moreover, we want to prove this result as a lemma. error correction method in classification by using multiple-criteria and multiple-constraint levels linear programming 983 lemma 4. for certain trade-off between the objective functions, if b is maintained to be the same sign, then hyperplanes, which are obtained in the mclp model , keep the same. furthermore, different signs result in different hyperplanes. proof: let’s assume that the trade-off between the objective functions is λ = (λ1, λ2) and x1 is the solution obtained by fixing b to be 1. and then, set b1 as an arbitrary positive number. the mclp model can be transformed as follows: min λ1 ∑ i αi − λ2 ∑ i βi s.t. aix = b1 + αi − βi, ai ∈ bad, aix = b1 − αi + βi, ai ∈ good, αi, βi > 0, i = 1, 2, ..., l the problem above is the same as: min λ1 ∑ i αi b1 − λ2 ∑ i βi b1 s.t. ai x b1 = 1 + αi b1 − βi b1 , ai ∈ bad, ai x b1 = 1 − αi b1 + βi b1 , ai ∈ good, αi, βi > 0, i = 1, 2, ..., l and then, we let α′i = αi b1 , β′i = αi b1 , x′ = x b1 . it is obvious that the solution is x′ = x1 b1 and the hyperplane ax′ = b1 is the same as ax1 = 1. similarly, we can prove that when b is a negative number, the solution is the same as the one that is obtained from b = −1. as a result, we just need to compare the solutions (hyperplanes) resulted from b = 1 and b = −1. for this case, it is easy to see that the signs before αi and βi swap when we transform b = −1 into b = 1. if this happens, then the objective function changes into −λ1 ∑ i αi+λ2 ∑ i βi. this means that the solutions will be different. 2 according to the lemma, we have the theorem below: theorem 5. for our mc2lp model (2) above, according to the solutions (hyperplanes), space γ is divided into two non-intersect parts. remark 6. when [1 − αmax, 1 + αmax] is achieved, ϵ1 and ϵ2 are chosen to satisfy that 0 is contained by the interval [1 − αmax − ϵ1, 1 + αmax + ϵ2]. in this case, for any λ, the solutions belong to the trade-offs with same sign will result in the same hyperplane. in other words, there are only two different hyperplanes corresponding to model (2). in short, the flexibility of model (2) is limited. 984 b. wang, y. shi 4 a new model based on correcting of two types of error in many classification models, including original mclp model, two types of error is a big issue. in credit card account classification, to correct two types of error can not only improve the accuracy of classification but also help to find some important accounts. accordingly, many researchers have focused on this topic. based on this consideration, more attention should be paid to the samples that locate between two hyperplanes acquired by the original mclp model, that is, the points in the grey zone [10]. consequently, we define the external deviations and internal deviations related to two different hyperplanes, the left one and the right one, that is, αl, αr, βl and βr. definition 7. the conditions the deviations should satisfy are stated as follows: αli =   0, aix < 1 − αmax and ai ∈ bad; aix − (1 − αmax), aix ≥ 1 − αmax and ai ∈ bad; 0, aix ≥ 1 − αmax and ai ∈ good; (1 − αmax) − aix, aix < 1 − αmax and ai ∈ good. αri =   0, aix < 1 + αmax and ai ∈ bad; aix − (1 + αmax), aix ≥ 1 + αmax and ai ∈ bad; 0, aix ≥ 1 + αmax and ai ∈ good; (1 + αmax) − aix, aix < 1 + αmax and ai ∈ good. βli =   (1 − αmax) − aix, aix < 1 − αmax and ai ∈ bad; 0, aix ≥ 1 − αmax and ai ∈ bad; aix − (1 − αmax), aix ≥ 1 − αmax and ai ∈ good; 0, aix < 1 − αmax and ai ∈ good. βri =   (1 + αmax) − aix, aix < 1 + αmax and ai ∈ bad; 0, aix ≥ 1 + αmax and ai ∈ bad; aix − (1 + αmax), aix ≥ 1 + αmax and ai ∈ good; 0, aix < 1 + αmax and ai ∈ good. figure 5 is a sketch for the model. in the graph, the green and the red lines are the left and right hyperplane, bl and br respectively, which are some trade-offs in two intervals, i.e. [1 − αmax − ϵ2, 1] and [1, 1 + αmax + ϵ1]. and all the deviations are measured according to them in different colors. for instance, if a sample in "good" class is misclassified as "bad" class, it means αri > β l i ≥ 0 and α l i = β r i = 0. and then, if a sample in "bad" class is misclassified as "good" class, it means αli > β r i ≥ 0 and α r i = β l i = 0. thus, for the misclassified ones, αri + α l i − β r i − β l i should be minimized. as a result, a more meticulous model could be stated as follows: min ∑ i (αri + α l i) min ∑ i (αli − β r i ) min ∑ i (αri − β l i) error correction method in classification by using multiple-criteria and multiple-constraint levels linear programming 985 figure 5: mc2lp model max ∑ i (βri + β l i) s.t. aix = 1 + [0, αmax + ϵ1] + α r i − β r i , ai ∈ bad, aix = 1 − [0, αmax + ϵ2] + αli − β l i, ai ∈ bad, aix = 1 + [0, αmax + ϵ1] − αri + β r i , ai ∈ good, aix = 1 − [0, αmax + ϵ2] − αli + β l i, ai ∈ good, αri , α l i, β r i , β l i > 0, i = 1, 2, ..., l. where ai, αmax, ϵ1 > 0, ϵ2 > 0 are given, and x is unrestricted. in figure 5, for each point, there are at most two kinds of deviations nonzero. the objective functions appear to deal with the deviations according to the position shown in figure 5, respectively, whereas they have their own special meaning. that is to say, it measures two types of error in some degree by means of the second and third objective functions. as a result, in this new version of mc2lp, we not only consider the deviations respectively, but also take the relationship of the deviations based on two types of error into account in the objective functions. by virtue of mc2lp method, each trade-off between 1 − αmax − ϵ2 and 1 for the left hyperplane as well as each trade-off between 1 and 1 + αmax + ϵ1 for the right hyperplane can be checked. after obtaining the weight vector x of the hyperplane, ax = 1 is still used to be the classification hyperplane. however, in our new model, we minimize the distance between the left hyperplane and the right one. in other words, we discover the hyperplane that genders the smallest grey area. example 8. we show how the new mc2lp model based on error correction works. the data is the same as data1 that has been displayed in figure 2. two kinds of situations will be considered in this example, that is, the pairs of hyperplanes are ax = 0.57, ax = 2.57 and ax = −6, ax = 10. we choose ϵ1, ϵ2 bigger than 10, so that we can get the pair of hyperplanes ax = −6, ax = 10. from the new model, we obtain two hyperplanes, which are displayed in figure 6. the blue hyperplane is obtained from ax = 0.57, ax = 2.57, while the green one is corresponding to ax = −6, ax = 10. in both experiments, we fix the weight between αli, α r i and β l i, β r i to be (4,1). according to the result, we can see that the point marked is a misclassified sample corresponding to the former pair, while it is classified correctly based on the latter one. 986 b. wang, y. shi figure 6: solutions for new mc2lp model actually, in statistics, type i and type ii errors are two opposite objectives. that is to say, it is very hard to correct both of them at the same time. as a result, we modify the former model into two different models focusing on two types of error respectively as follows: min ∑ i (αri + α l i) min ∑ i (αli − β r i ) max ∑ i (βri + β r i ) s.t. aix = 1 + [0, αmax + ϵ] + α r i − β r i , ai ∈ bad, aix = 1 + α l i − β l i, ai ∈ bad, aix = 1 + [0, αmax + ϵ] − αri + β r i , ai ∈ good, aix = 1 − αli + β l i, ai ∈ good, αri , α l i, β r i , β l i > 0, i = 1, 2, ..., l. (3) where ai, αmax and ϵ > 0 are given, and x is unrestricted. in this model, ∑ i(α r i − β l i) is not contained in the objective functions. this model can deal with type ii error, that is, classifying a "good" point to be a "bad" one. now we provide an example to illustrate the effect of model (3). example 9. the data set is the same as data1 that has been displayed in figure 2. here, the label "-1" is regarded as "good" class, and the label "1" is regarded as "bad" class. two kinds of situations will be considered in this example, that is, the pairs of hyperplanes are ax = 1, ax = 2.57 and ax = 1, ax = 8. the left hyperplane is always kept to be ax = 1, which is also the hyperplane that is used to be the classification hyperplane at last. we set ϵ as 10. it is obvious that more remarkable result will be obtained from bigger ϵ. after fixing the weight of the objective functions as (4,0.1,1), the outcome is shown in figure 7. the blue hyperplane is obtained from ax = 1, ax = 2.57, while the green one is corresponding to ax = 1, ax = 8. two marked points are the ones corrected by model (3) after the right hyperplane moving to the right. error correction method in classification by using multiple-criteria and multiple-constraint levels linear programming 987 figure 7: solutions for new mc2lp model (3) as the result shown above, model (3) can correct type ii error in some degree. we conclude this in the proposition below. proposition 10. model (3) can correct type ii error by moving the right hyperplane to the right based on the concept of multiple-constraint levels. remark 11. the second objective function in model (3) is nonzero for the samples in class "bad" and getting negative when the right hyperplane moving to the right. that is to say, we tolerate some type i errors. at the same time, the first objective function in model (3) renders type ii errors an increasing punishment with moving the right hyperplane to the right. as a result, it can correct type ii error in some degree. similarly to model (3), we pose model (4) which can deal with type i error as follows: min ∑ i (αri + α l i) min ∑ i (αri − β l i) max ∑ i (βri + β r i ) s.t. aix = 1 + α r i − β r i , ai ∈ bad, aix = 1 − [0, αmax + ϵ] + αli − β l i, ai ∈ bad, aix = 1 − αri + β r i , ai ∈ good, aix = 1 − [0, αmax + ϵ] − αli + β l i, ai ∈ good, αri , α l i, β r i , β l i > 0, i = 1, 2, ..., l. (4) where ai, αmax and ϵ > 0 are given, and x is unrestricted. in this model, ∑ i(α l i − β r i ) is not contained in the objective functions. this model focuses on type i error, that is, classifying a "bad" point to be a "good" one. example 12. in order to compare model (3) and model (4), we draw two hyperplanes obtained from these two models in the same graph. figure 8 is the result of using pair ax = 1, ax = 8 for model (3) and pair ax = −0.57, ax = 1 for model (4). 988 b. wang, y. shi figure 8: comparison of model (3) (4) in figure 8, the green hyperplane is obtained from model (3) with less type ii errors, while the blue one is acquired from model (4) with less type i errors. in the last two models introduced above, the first one defines ax = 1 as the left hyperplane, at the same time, ax = 1 + [0, αmax + ϵ] as the right one. oppositely, the second model defines ax = 1 − [0, αmax + ϵ] as the left hyperplane, and then, ax = 1 as the right one. for each trade-off, our model can deal with the misclassifications. in this way, two types of error can be corrected respectively. 5 conclusion this paper focuses on how to correct two types of error. at first, the disadvantage of mclp by fixing the cutoff is addressed. and then, a mc2lp based model is introduced. according to the topic of error correction, we develop mc2lp by measuring the deviations of the misclassified points more sophisticatedly. in this way, we obtain a brand new hyperplane that is different from the one obtained by mclp model. moreover, focusing on correcting two types of error, a result with less errors is gained. acknowledgement this work has been partially supported by grants from national natural science foundation of china (no. 70921061, no. 71110107026) and the cas/safea international partnership program for creative research teams. bibliography [1] zhang z., zhang d., tian y., shi y., kernel-based multiple criteria linear programming classifier, procedia cs 1(1): 2407-2415, 2010. [2] thomas l.c., edelman d.b., crook j.n., credit scoring and its applications, siam, 2002. error correction method in classification by using multiple-criteria and multiple-constraint levels linear programming 989 [3] he j., zhang y., shi y., huang g., domain-driven classification based on multiple criteria and multiple constraint-level programming for intelligent credit scoring, ieee transactions on knowledge and data engineering, vol. 22, no. 6: 826-838, 2010. [4] freed n., glover f., simple but powerful goal programming models for discriminant problems, european j. operational research, vol. 7: 44-60, 1981. [5] shi y., tian y., kou g., peng y., li j., optimization based data mining: theory and applications, advanced information and knowledge processing, springer, 2011. [6] shi y., multiple criteria optimization-based data mining methods and applications: a systematic survey, knowl inf syst, 24: 369-391, 2010. [7] shi y., multiple criteria and multiple constraint levels linear programming: concepts, techniques and applications, world scientific, 2001. [8] shi y., he j., wang l., fan w., computer-based algorithms for multiple criteria and multiple constraint level integer linear programming, comput. math. appl. 49(5): 903921, 2005. [9] nakayama h., yun y., generating support vector machines using multiobjective optimization and goal programming, studies in computational intelligence, vol. 16: 173-198, 2006. [10] chen y., zhang l., shi y., post mining of multiple criteria linear programming classification model for actionable knowledge in credit card churning management, icdmw, ieee computer society: 204-211, 2011. international journal of computers communications & control issn 1841-9836, 11(2):224-232, april 2016. a taboo search optimization of the control law of nonlinear systems with bounded uncertainties a. gharbi, m. benrejeb, p. borne amira gharbi*, mohamed benrejeb, pierre borne lara, ecole nationale d’ingénieurs de tunis tunisie, bp 37, le belvédére 1002 tunis and cristal, ecole centrale de lille france, cité scientifique bp 48-59651, villeneuve d’ascq cedex merkarim@gmail.com, mohamed.benrejeb@enit.rnu.tn, pierre.borne@ec-lille.fr *corresponding author: merkarim@gmail.com abstract: the aim of this paper is to propose a method to determine among the eligible controls of a nonlinear system, with bounded perturbations, the one which minimizes the final error. the approach is based on the implementation of aggregation techniques using vector norms in order to determine a comparison system used to calculate an attractor in view of its minimization by implementation of metaheuristics. keywords: attractor, aggregation technique, vector norm, optimization, taboo search. 1 introduction in the presence of uncertainties in modeling, that increase the complexity of the stability study [1], it is not always possible to obtain a control law ensuring the stability of the process with respect to a chosen objective. it is then necessary to estimate the maximum deviation from this target, an operation which can be performed by determining an attractor [2][4] corresponding to the vicinity of the target for which the local stability cannot be guaranteed, [5], [7], [6], [8], [9], [10], [11], [12]. in case of uncertain or poorly defined problems, possibly subject to random perturbations or for which the search for solutions might evolve towards the combinatorial explosion, the exact methods are very unlikely to provide solutions in an acceptable period of time. the method presented in this paper corresponds to a law finding, if we do not obtain the optimal solution of the problem, we obtain at least a good solution in an acceptable run time. the heuristic methods that can be implemented on a computer are referred to metaheuristics. they rely on the following basic principle: the search for optimum is simulating either the behaviour of a biologic system or the evolution of a natural phenomenon, including an intrinsic optimization mechanism. for this reason, a new optimization branch has been developed in the past 20 years, inspired by nature. almost all numerical algorithms designed as metaheuristics are included into this class of optimization techniques [13]. in general, all metaheuristics are using a pseudo-random engine to select some parameters or operations that yield to the estimation of an optimal solution. the procedures to generate pseudo-random (numerical) sequences of optimization are crucial in metaheuristics design. we have two classes of metaheuristic approaches: global approaches and local approaches, such as the taboo search which is one of the easiest to implement. in this paper, the determination of the attractor, when the process is submitted to uncertainties, is achieved by using aggregation techniques and the borne-gentina stability criteria, with the use of vector norms and of comparison systems [14], [15]. in the following section 2, we propose the determination of the control law of a nonlinear process submitted to bounded uncertainties with a view to minimize the effect of these uncertainties. in section 3 we use the taboo search to realize the optimization. an application is presented in section 4 to illustrate the proposed method. copyright © 2006-2016 by ccc publications a taboo search optimization of the control law of nonlinear systems with bounded uncertainties 225 2 attractor determination let us consider the system (s) whose evolution is described by the following state equation ẋ = f(x,.) + g(x,.)u + δ(.) (1) y = h(x) (2) x is the state vector and y is the output, x ∈ rn,y ∈ rm,u ∈ rl δ ∈ rn characterizes the disturbances and/or perturbations acting on the system and u is the control law: u = u(x,θ) (3) where θ ∈ rν is a vector of the adjustable parameters of the control law. a new representation of system (s) characterized by (1) and (3) can be defined by ẋ = a(x,θ, .)x + δ(.) (4) with |δ(.)| ≤ δm (5) a = f(x,.) + g(x,.)u(x,θ) (6) and a comparison system of this system can be determined using the vector norm p(x) defined by p(x) = [|x1| , |x2| , . . . , |xn|]t (7) by noting m(a(x,θ, .)) an overvaluing matrix of a(x,θ, .) related to the vector norm p(x) it comes d dt p(x) ≤ m(a(x,θ, .))p(x) + n(.) (8) let us denote: a(.) = {aij(.)} (9) and m(θ) = {mij(θ)} the matrix such that:{ mii(θ) = maxaii(x,θ, .) ∀i = 1,2, . . .n mij(θ) = max |aij(x,θ, .)| ∀i ̸= j (10) we can define a comparison system by: z ∈ n/ż(t) = m(θ)z(t) + δm (11) if m(θ) is the opposite of an m-matrix, it exists an attractor dθ asymptotically stable such that dθ = { x ∈ rn;p(x) ≤−m−1(θ)δm = pm(θ) } (12) 3 taboo search optimization 3.1 principe of taboo search the metaheuristic described in this section belongs to greedy descent local methods. for this type of methods, the search starts from an admissible solution θi of s. the strategy is then to focus on a local vicinity v (θi), in order to find another solution θj that can improve the criterion current performance. the vicinity v (θi) corresponds to the set of all accessible solutions after applying a single admissible movement, displacement or transition from θi . usually, this set is a hyper-cube or a hyper-sphere including the current solution θi. 226 a. gharbi, m. benrejeb, p. borne 3.2 taboo search method based on the principle of local search for minimizing a criterion, by this method, there is the possibility to jump out from the capturing vicinity and to explore a different zone of the research area. here after, the term of movement stands for any modification allowing the search to focus on vicinity in the neighborhood of the current vicinity. as usual, the search starts from some initial point (solution), θi in the vicinity v (θi) but it is permitted to relocate the exploitation around another point (solution) θj ∈ v (θi), even if this choice degrades the criterion to optimize. this actually is a movement towards another zone of interest. however, in order to avoid infinite search loops, once a solution is focused on, it will never be focused on again in the future iterations. thus, the nt last focused solutions belonging to a taboo list tki become untouchable, "taboo" [16], [17]. starting from the solution θi, a set of possible movements, say mk,j, can be built, during the kth iteration. let δθ ∈ mk,j be such a movement. by convention,θi δθ→θj stands for the transition from solution θi to a new point θj as result of movement δθ. then vk(θi) = { θj ∈ v (θi)/∃ δθ ∈ mkj,θi δθ→θj & θj /∈ tki } (13) the new solution which is the best non taboo one is added to the last taboo list and the oldest one is removed from it. the chosen criterion is for this problem the minimisation of a scalar norm of pm(θ) the optimization of the control law consists to determine the value of θ minimizing a scalar norm of pm . in the following we use the euclidian norm ∥pm∥. the optimisation algorithm corresponds in this paper to the taboo search with nt number of elements of the taboo list and ns the maximum number of iterations without improvement of the solution to stop the research. 4 application to a second order system let us consider the nonlinear system of second order with uncertainties such that ẋ = a(x,t)x + b(x)u(x,θ) + δ(.) (14) y = h(x) (15) with u(θ,x) = −(θ1y + θ2x2) (16) and h(x) = x1 + (1−e−x 2 1)x2 (17) with x(t) ∈ r2,b(.) ∈ r2,a(.) a 2×2 matrix and θ ∈ r2 such that a(x,t) = [ −2 + cost + cosx1 4−e−x 2 1(1 + sinx1) 4 + cosx2 −8 + sinx1 + e−x 2 1 ] (18) b(x) = [ 3 + 0.5 cosx1 2 ] (19) we can write (14) as ẋ = a(x,t,θ)x + δ(.) (20) with a(x,t,θ) = a(x,t)−b(x)[θ1,θ1((1−e−x 2 1)) + θ2] (21) a taboo search optimization of the control law of nonlinear systems with bounded uncertainties 227 it comes a(x,t,θ) = [ a11 a12 a21 a22 ] (22) with a11 = −2 + costcosx1 −θ1(3 + 0.5 cosx1) a12 = 4−ex 2 1(1 + sinx1)− (3 + 0.5 cosx1)(θ1(1−ex 2 1) + θ2) a21 = 4 + cosx2 −2θ1 a22 = −8 + ex 2 1 + sinx1 −2[θ1(1−ex 2 1) + θ2] (23) 4.1 determination of a comparison system for the vector norm p(x) = [|x1| , |x2|]t , we obtain the overvaluation defined by d dt p(x) ≤ m(a(x,θ, .))p(x) + n(.) (24) z ∈ rn/ż(t) = m(.)z(t) + n(.) (25) with m(a(x(t))) = [ a11 |a12| |a21| a21 ] (26) and |n(.)| ≤ δm (27) in our example δ(.) is assumed to be by bounded by δ1 = [ −0.2 0.3 ] ≤ δ(.) ≤ δ2 = [ 0.1 0.5 ] (28) then δm = [ 0.2 0.5 ] (29) and by overvaluation, for the process without feedback, for θ = (θ1,θ2) = (0,0) we obtain the linear comparison system ż = mz + n ż = [ 0 2 5 −6 ] z + [ 0.2 0.5 ] (30) after application of stability conditions we have det(m) < 0 (31) it appears that m is not stable and so is not the opposite of an m-matrix which needs the determination of a suitable feedback optimized in order to limit the influence of the uncertainties. 228 a. gharbi, m. benrejeb, p. borne 4.2 attractor optimization with taboo search for this taboo search we choose nt = ns = 4. starting from the solution θ1 = 2 and θ2 = 0 a set possible movements, can be built, during the k-th iteration. let δθl ∈ mk,j be such a movement, with |δθ1| = 0.2 and |δθ2| = 0.1 . by convention, θi1 δθ1→θj1 , θi2 δθ2→θj2 stands for the transition from solution θli to a new point θlj with l = {1,2} as result of movement δθl . then for θ1 = 2 and θ2 = 0 the overvaluing system for the vector norm p(x) = [|x1| , |x2|]t is defined by (22) with m(x,t,2,0) =   −8 + cost ∣∣∣∣∣ −2−e −x21(−5 + sinx1 − cosx1) −cosx1 ∣∣∣∣∣ |cosx2| −12 + sinx1 + 5e−x 2 1   (32) and n = [ 0.2 0.5 ] (33) then the linear comparison system is the following ż = [ −7 3 1 −7 ] z + [ 0.2 0.5 ] (34) the stability conditions for matrix m can be written{ −7 < 0( −12 ) det(m) > 0 (35) as m is the opposite of m-matrix, we have p(x) ≤−m−1n = [ 0.0630 0.0804 ] = pm(2,0) (36) the strategy is then to focus on a local vicinity v (θi) in order to find the best non taboo solution θi the chosen criterion being the euclidian norm of pm(θ). for this, eight solutions ⋆ will be tested starting from θ = (2,0) θ = (θ1,θ2 + δθ2),θ = (θ1,θ2 − δθ2),θ = (θ1 + δθ1,θ2),θ = (θ1 − δθ1,θ2),θ = (θ1 + δθ1,θ2 + δθ2), θ = (θ1 + δθ1,θ2 − δθ2),θ = (θ1 − δθ1,θ2 + δθ2),θ = (θ1 − δθ1,θ2 − δθ2) 2θ 1θ (2,0) figure 1: the vicinity of θ = (2,0) solution for a taboo search optimization of the control law of nonlinear systems with bounded uncertainties 229 θ = (2,0.1) ⇒ p(x) ≤ pm(2,0.1) = [ 0.0579 0.0775 ] , θ = (2,−0.1) ⇒ p(x) ≤ pm(2,−0.1) = [ 0.0668 0.0787 ] , θ = (2.2,0) ⇒ p(x) ≤ pm(2.2,0) = [ 0.0572 0.0763 ] , θ = (1.8,0) ⇒ p(x) ≤ pm(1.8,0) = [ 0.0727 0.0860 ] , θ = (2.2,0.1) ⇒ p(x) ≤ pm(2.2,0.1) = [ 0.0528 0.0738 ] , θ = (2.2,−0.1) ⇒ p(x) ≤ pm(2.2,−0.1) = [ 0.0621 0.0790 ] , θ = (1.8,0.1) ⇒ p(x) ≤ pm(1.8,0.1) = [ 0.0664 0.0824 ] , θ = (1.8,−0.1) ⇒ p(x) ≤ pm(1.8,−0.1) = [ 0.0796 0.0899 ] , the best non taboo solution minimizing ∥p(x)∥ : pm(2.2,0.1) is obtained for θ = (2.2,0.1), and the solution for θ = (2,0) becomes "taboo". now the strategy is then to focus on a local vicinity of this solution in order to find the best one which does not belong to the taboo list. so, we test other solutions that are neighbouring the current one’s θ = (2.2,0),θ = (2.2,0.2),θ = (2,0.1),θ = (2,0.2), θ = (2.4,0),θ = (2.4,0.1),θ = (2.4,0.2). 2θ 1θ (2,0) : solution taboo (2.2,0.1) θ figure 2: the vicinity of θ = (2.2,0.1) solution θ = (2.4,0) ⇒ p(x) ≤ pm(2.4,0) = [ 0.0523 0.0729 ] , θ = (2.4,0.1) ⇒ p(x) ≤ pm(2.4,0.1) =[ 0.0528 0.0727 ] , θ = (2.4,0.2) ⇒ p(x) ≤ pm(2.4,0.2) = [ 0.0540 0.0690 ] , θ = (2,0.2) ⇒ p(x) ≤ pm(2,0.2) = [ 0.0531 0.0747 ] , θ = (2.2,0.2) ⇒ p(x) ≤ pm(2.2,0.2) = [ 0.0536 0.0719 ] , the best non taboo solution minimizing ∥p(x)∥ : [ 0.0540 0.0690 ]t is obtained for θ = 230 a. gharbi, m. benrejeb, p. borne (2.4,0.2) then the solution θ = (2.2,0.1) becomes "taboo". now we continue the iteration starting from this new solution θ = (2.2,0.2),θ = (2.2,0.3), θ = (2.4,0.1),θ = (2.4,0.3),θ = (2.6,0.1),θ = (2.6,0.2),θ = (2.6,0.3). the best non taboo solution minimizing ∥p(x)∥ : [ 0.0558 0.0673 ]t is obtained for θ = (2.4,0.3), then the solution θ = (2.4,0.2) becomes "taboo". now we will test the solutions in the neighbourhood of θ = (2.4,0.3) the best non taboo solution minimizing ∥p(x)∥ : [ 0.0575 0.0656 ]t is obtained for θ = (2.4,0.4) then the solution θ = (2.4,0.3) becomes "taboo". now we will test the vicinity of this solution the best non taboo solution minimizing ∥p(x)∥ : [ 0.059 0.064 ]t is obtained for θ = (2.4,0.5), then the solution becomes "taboo". 2θ 1θ (2,0) : solution taboo (2.2,0.1) : solution taboo θ (2.4,0.2) : solution taboo (2.4,0.3) : solution taboo (2.4,0.4) : solution taboo (2.4,0.5) figure 3: the vicinity of θ = (2.4,0.5) solution at the next iteration the best non taboo solution minimizing ∥p(x)∥ : [ 0.0605 0.0625 ]t is obtained for θ = (2.4,0.6). for the two following iterations the best non-taboo solutions correspond to pm(2.4,0.7) and pm(2.4,0.8), but ∥pm(2.4,0.4)∥ = ∥pm(2.4,0.5)∥ = ∥pm(2.4,0.6)∥ = ∥pm(2.4,0.7)∥ = ∥pm(2.4,0.8)∥ = 0.870, so as we have had 4 iterations without improvement we can stop the research. the control law defined by θ = (2.4,0.4), corresponds to the best solution. hence the evolution of the state vector, and its evolution of the state vector in the attractor defined in figure 4. 0.022 0.024 0.026 0.028 0.03 0.032 0.034 0.036 0.038 0.04 0.0632 0.0633 0.0634 0.0635 0.0636 0.0637 0.0638 0.0639 0.064 x 2 x1 −0.06 −0.04 −0.02 0 0.02 0.04 0.06 0.08 0. 1 −0.08 −0.06 −0.04 −0.02 0 0.02 0.04 0.06 0.08 0.1 x 2 x1 figure 4: evolution of the state vector in the attractor 5 conclusion the approach proposed here consists, having defined the attractor of the process for a control law depending of parameters to minimize the size of this attractor by implementation of a metaheuristic to determine the optimal values of these parameters. the method presented in this paper is applied, with success, for a second order nonlinear complex system using the concept a taboo search optimization of the control law of nonlinear systems with bounded uncertainties 231 of vector norm for the determination of the comparison system. the minimization of the norm of the vector defining the limits of the attractor is realized by using a taboo search method. bibliography [1] benrejeb, m. ; borne, p. (1978); on an algebraic stability criterion for non-linear processe, interpretation in the frequency domain, measurement and control international symposium meco, athens: 678-682. [2] gharbi, a.; benrejeb, m. ; borne, p. (2013); on nested attractors of complex continuous systems determination, proceedings of the romanian academy, series a, 14(2):259-265. [3] gharbi, a.; benrejeb, m. ; borne, p. (2013); new approach for the control and the determination of attractors for nonlinear systems , 2nd international conference on systems and computer science (icscs), villeneuve d’ascq, france, august 26-27. [4] gharbi, a.; benrejeb, m. ; borne, p. (2014); tracking error estimation of uncertain lur’e postnikov systems, 2nd international conference on control, decision and information technologies (codit’14) metz, france, november 3-5. [5] benrejeb, m. (2010); stability study of two level hierarchical nonlinear systems. large scale complex systems theory and applications ifac symposium, plenerylecture,lille, 9(1), 3041. [6] borne, p.; benrejeb, m. (2008);on the representation and the stability study of large scale systems, international journal of computers communications and control, 3(5): 55-66. [7] borne, p. (1987); nonlinear system stability. vector norm approach, system and control encyclopedia,pergamon press, lille, france, 5:3402-3406. [8] gentina, j.c.; borne, p.; burgat, c.; bernussou, j., : grujic, l.t. (1979). sur la stabilite des systmes de grande dimension. normes vectorielles, 13(1):57-75. [9] gentina, j.c.; borne, p.; laurent, f. (1972a). stabilite des systmes continus non linéaires de grande dimension,rairo, aôut:69-77. [10] gentina, j.c.; borne, p. (1972b), sur une condition d’application du critcre de stabilité linéaire certaines classes de systmes continus non linéaires, cras, paris, t. 275: 401-404. [11] grujic, l.t.; gentina, j.c.; borne, p.; burgat, c.; bernussou, j. (1978). sur la stabilité des systmes de grande dimension. fonctions de lyapunov vectorielles,rairo, 12(4):319-348. [12] grujic, l.t.; gentina, j.c.; borne, p. (1976). general aggregation of large scale systems by vector lyapunov functions and vector norms,inernational.journal.of control, 24(4): 29-550. [13] stefanoiu, d.; borne , p.; popescu, d.; filip, f. g. ; el kamel, a.(2014); optimization in engineering sciences. metaheuristics, stochastic method and decision support, iste, wiley:20-39. [14] siljac, d. d. (1972); stability of large scale systems under structural perturbations.ieeetrans. on syst. manand cyber, 2(5). 232 a. gharbi, m. benrejeb, p. borne [15] borne, p.; richard, j.p.; radhy, n.e. (1996). stability, stabilization, regulation using vector norms, nonlinear systems, 2, stability and stabilization,chapman and hall, chapter 2; 4590. [16] ghédira k., (2007); optimisation combinatoire par métaheuristiques, editions technip, france. [17] ennigron m.; ghédira k (2004); flexible job-shop scheduling with multi-agent system and taboo search,journal européen des systmes automatisés jesa, 38: 7-8. int j comput commun, issn 1841-9836 vol.7 (2012), no. 3 (september), pp. 509-517 an entropy-based method for attack detection in large scale network t. liu, z. wang, h. wang, k. lu ting liu sklms lab and moe klnnis lab, xi’an jiaotong university xi’an, shaanxi, 710049, p.r.china e-mail: tingliu@mail.xjtu.edu.cn zhiwen wang, haijun wang, ke lu moe klnnis lab, xi’an jiaotong university xi’an, shaanxi, 710049, p.r.china e-mail: wzw@mail.xjtu.edu.cn {hjwang,klu}@sei.xjtu.edu.cn abstract: intrusion detection system (ids) typically generates a huge number of alerts with high false rate, especially in the large scale network, which result in a huge challenge on the efficiency and accuracy of the network attack detection. in this paper, an entropy-based method is proposed to analyze the numerous ids alerts and detect real network attacks. we use shannon entropy to examine the distribution of the source ip address, destination ip address, source threat and destination threat and datagram length of ids alerts; employ renyi cross entropy to fuse the shannon entropy vector to detect network attack. in the experiment, we deploy the snort to monitor part of xi’an jiaotong university (xjtu) campus network including 32 c-class network (more than 4000 users), and gather more than 40,000 alerts per hour on average. the entropy-based method is employed to analyze those alerts and detect network attacks. the experiment result shows that our method can detect 96% attacks with very low false alert rate. keywords: network security, entropy-based, ids, shannon entropy, renyi cross entropy. 1 introduction network attacks are defined as the operations that disrupt, deny, degrade, or destroy information resident in computer networks or the networks themselves. in recent years, more and more network attacks threatened the reliability and qos of internet, compromised the information security and privacy of users. ksn (kaspersky security network) recorded 73 million internet browsers attacks on their users in 2009, and that number skyrocketed to 580,371,937 in 2010 [1]. symantec reported that they recorded 3 billion attacks from their global sensor and client [2]. intrusion detection system (ids) is used to monitor and capture intrusions into computer and network systems which attempt to compromise their security [3]. with the development of networks, a large number of computer intrusions occur every day and idss have become a necessary addition to the security infrastructure of nearly every organization. however, idss still suffer from two problems: 1) large amount of alerts. in fact, more than 1 million alerts are generated by snort each day in our research; 2) high false alerts rate. gina investigated the extent of false alerts problem in snort using the 1999 darpa ids evaluation data, and found that 69% of total generated alerts are considered to be false alerts [4]. these problems result in a huge challenge on the efficiency and accuracy of the network attack detection. several methods have been applied to resolve the problems of large amount of alerts and high false rate. pietraszek used the adaptive alert classifier to reduce false alerts, which is trained with lots of labeled copyright c⃝ 2006-2012 by ccc publications 510 t. liu, z. wang, h. wang, k. lu past alerts [5]. whereas, it is difficult to label large volume alerts generated in large-scale network. in order to reduce the false alarms, mina propose the extend dpca to standardize the observations according to the estimated means [6]. spathoulas and katsikas propose a post-processing filter based on the statistical properties of the input alert set [7]. cisar employ ewma to detect attacks by analyzing the intensity of alerts [3]. in our research, 32 c-class subnets are monitored by snort and more than 1 million alerts are generated every day. therefore, we propose a method to spot anomalies which is more tolerable for the operator rather than reduce false alerts. in information theory, entropy is a measure of the uncertainty associated with a random variable, which is widely used to analyze the data and detect the anomalies in information security. lakhina et al argue that the distributions of packet features (ip addresses and ports) observed in flow traces reveal both the presence and structure of a wide range of anomalies. using entropy as a summarization tool to analyze traffic from two backbone networks, they found that it enables highly sensitive detection of a wide range of anomalies, augmenting detections by volume-based methods [8]. brauckhoff ind that entropy-based summarizations of packet and flow counts are affected less by sampling than volume-based method in large networks [9]. a. wagner and b plattner applied entropy to detect worm and anomaly in fast ip networks [10]. relative entropy and renyi cross entropy can be used to evaluate the similarity of different distributions. yan et al use a traffic matrix to represent network state, and use renyi cross entropy to analyze matrix traffic and detect anomalies rather than shannon entropy. the results show renyi cross entropy based method can detect ddos attacks at the beginning with higher detection rate and lower false rate than shannon entropy based method [11]. gu et al proposed an approach to detect anomalies in the network traffic using maximum entropy estimation and relative entropy [12]. the packet distribution of the benign traffic was estimated using maximum entropy framework and used as a baseline to detect the anomalies. in this paper, an entropy-based method is proposed to detect network attack. the shannon entropy and renyi cross entropy are employed to analyze the distribution characteristics of alert features and detect network attack. the experimental results under actual network data show that this method can detect network attack quickly and accurately. the rest of the paper is organized as follows: the method is introduced in section 2, and the experimental results are shown in section 3. section 4 is the conclusion and future work. 2 methodology in this paper, snort is used to monitor the network and five statistical features of the snort alert are selected. the shannon entropy is used to analyze the distribution characteristics of alert that reflect the regularity of network status. when the monitored network runs in normal way, the entropy values are relatively smooth. otherwise, the entropy value of one or more features would change. the renyi cross entropy of these features is calculated to measure the network status and detect network attacks. 2.1 snort alert and feature selection each snort alert consists of tens of attributions, such as timestamp, source ip address (sip), source port, destination ip address ( dip), destination port, priority, datagram length and protocol, etc. suppose there are n alerts generated in time interval t. the alerts set in time interval t is denoted as alert(t) = {alert1,alert2, . . . ,alertn}. assuming there are m distinct sip and k distinct dip in alert(t), we can generate the distinct source ip addresses set (sip) and distinct destination ip addresses set (dip): s i p = {sip1, sip2, . . . , sipm}, di p = {dip1,dip2, . . . ,dipk}. an entropy-based method for attack detection in large scale network 511 suppose the number of alerts come from sipi is snumi, and the number of alerts send to dipi is dnumi. the alert number of each source ip (s nu m) and destination ip (dnu m) can be calculated: s nu m = {snum1, snum2, . . . , snumm}, dnu m = {dnum1,dnum2, . . . ,dnumk}. there are 4 default priorities of snort alert: 1, 2, 3 and 4. the threat severity gradually weakens from 1 to 4(high, medium, low, info). in order to strengthen the threat degree of high severity alerts, the threat degree of the alerti is denoted as threati = 5(4−priorityalerti ) in present work. suppose the threat degree sum of all alerts come from sipi is sthreati, and the threat degree sum of all alerts send to dipi is dthreati. the threat degree of each source ip (s t hreat ) and destination ip (dt hreat ) can be calculated: s t hreat = {sthreat1, sthreat2, . . . , sthreatm}, dt hreat = {dthreat1,dthreat2, . . . ,dthreatk}. the datagram length is the size of the packet that breaks the alarm rules of snort. we search the distinct datagram length of all alerts, and generate the datagram length set dgmlen = {dgmlen1,dgmlen2, . . . ,dgmlenx}, where x is the number of the distinct datagram length of all alerts. suppose the number of alerts whose datagram length equal to dgmleni is dgmnumi. the alert number with different datagram length can be calculated: dgmnu m = {dgmnum1,dgmnum2, . . . ,dgmnumx}. above 5 features (s nu m, dnu m,s t hreat, dt hreat, dgmnu m) are selected to evaluate the alerts and detect attacks. 2.2 shannon entropy-based feature analysis shannon entropy is used as measures of information and uncertainty [13]. for a dataset x = {x1, x2, x3, . . . , xn}, each data item x belongs to a class x ∈ cx. the entropy of x relative to cx is defined as h(x) =− n∑ i=1 pi log2 pi (1) where pi is the probability of xi in x. the distribution characteristics of five features are analyzed using shannon entropy. the entropies of s nu m and dnu m in time interval t can be calculated h(s ipt) =− m∑ i=1 (snumi/n) log(snumi/n) (2) h(dipt) =− k∑ i=1 (dnumi/n) log(dnumi/n) (3) the entropy of s t hreat and dt hreat can be calculated: h(s threatt) =− m∑ i=1 threat_o f _sip(i) sum_threat · log ( threat_o f _sip(i) sum_threat ) (4) 512 t. liu, z. wang, h. wang, k. lu h(dthreatt) =− k∑ i=1 threat_o f _dip(i) sum_threat · log ( threat_o f _dip(i) sum_threat ) (5) where threat_o f _sip(i) is the threat sum of the alerts from sipi, threat_o f _dip(i) is the threat sum of the alerts to dipi, and sum_threat is the threat sum of all the alerts in alerts which can be calculated using sum_threat = n∑ i=1 threati (6) the entropy of datagram length is h(dgmlent) =− x∑ i=1 (dgmnumi/n) · log(dgmnumi/n) (7) after calculating the entropies of above features, we can use an entropy vector v (t) = [h(s ipt), h(dipt), h(s threatt), h(dthreatt), h(dgmlent)] to represent the network status of time interval t. 2.3 renyi cross entropy-based attack detection the renyi entropy, a generalization of shannon entropy, is a measure for quantifying the diversity, uncertainty or randomness of a system. the renyi entropy of order α is defined as hα(p) = 1 1 −α log2 ∑ r pαr (8) where 0 <α< 1, p is a discrete stochastic variable, and pr is the distribution function of p [14]. higher values of α, approaching 1, giving a renyi entropy which is increasingly determined by consideration of only the highest probability events. lower values of α, approaching zero, giving a renyi entropy which increasingly weights all possible events more equally, regardless of their probabilities. the special case α→ 1 gives the shannon entropy. the renyi cross entropy of order α is derived as iα(p,q) = 1 1 −α log2 ∑ r pαr qα−1r (9) where p and q are two discrete variables, pr and qr are their distribution functions [14]. if α = 0.5, the renyi cross entropy is symmetric, which means iα(p,q) = iα(q, p). in the rest of the paper, when referring to the cross entropy we mean the symmetric case i0.5(p,q) = 2 log2 ∑ r √ prqr (10) the renyi cross entropy is used to fuse the values of different features. as mentioned above, we use an entropy vector v (t) = [h(s ipt), h(dipt), h(s threatt), h(dthreatt), h(dgmlent)] to represent the network status of time t, thus the network status can be viewed as a time series of entropy vector v (1),v (2), . . . ,v (t). before calculating renyi cross entropy, v (t) is unitized to v̄ (t) = [h̄(s ipt), h̄(dipt), h̄(s threatt), h̄(dthreatt), h̄(dgmlent)] (11) where h̄(s ipt) = h(s ipt)/hsum h̄(s threatt) = h(s threatt)/hsum h̄(dipt) = h(dipt)/hsum (12) h̄(dthreatt) = h(dthreatt)/hsum h̄(dgmlent) = h(dgmlent)/hsum an entropy-based method for attack detection in large scale network 513 and h sum = h(s ipt) + h(dipt) + h(s threatt) + h(dthreatt) + h(dgmlent). to determine if there is any change in the network at time t compare with previous time t−1, we use the following equation to calculate the renyi cross entropy of v̄ (t) and v̄ (t − 1) i0.5(v̄ (t), v̄ (t − 1)) = 2 log2 ∑ r √ pr(t − 1)pr(t) (13) we set η as the threshold of ∣∣∣i0.5(v̄ (t − 1), v̄ (t))∣∣∣ to test whether there is a change. the choice of threshold η is network dependent and it can be set as experience. since our purpose is to detect network attack, it is not enough to compare network status of time t to its previous time t − 1, unless we make sure that no attack occurs in time t − 1. thus, the average of the latest n normalized shannon entropies is employed to replace the t − 1, called v̄ (t,n) v̄ (t,n) = 1 n n∑ i=1 v̄ (t − i) (14) then, we calculate the renyi cross entropy of v̄ (t) and v̄ (t,n), and network attack is detected if its absolute is greater than η. i0.5(v̄ (t,n), v̄ (t)) = 2 log2 ∑ r √ pr(t,n)pr(t) (15) 3 experiment results 3.1 data collection in the research, we have used snort to monitor 32 c-class subnets in the xi’an jiaotong university campus network for two weeks, which include more than 4,000 users. in this paper, we select the alerts gathered in 2010-12-6. there are 862,284 alerts with 65 signatures, which come from 42,473 distinct source ip addresses and send to 11,790 distinct destination ip addresses. figure 1: the statistical results of alerts (2010-12-6). as shown in fig.1, four statistical features of alerts display the trend as the people living customs and habits (the time interval set as 5 seconds). few alerts are generated in the middle night; then, more alerts are detected from 8:00 to 10:00 when students get up successively; the alerts keep the same trend from 10:00 to 23:30; the alerts collapse at last 30 minutes, since network constraint due to the dormitory administrating rules. at the same time, the statistical features change abruptly in some time intervals. in general, these abnormal upheavals are the sign of the faults or network attacks. 514 t. liu, z. wang, h. wang, k. lu we select two alerts sets in different time period as training and test data set: training data set includes 170,516 alerts generated from 10:00 to 14:00. these alerts come from 13,148 ip addresses and send to 7,570 ip addresses. by analyzing these alerts manually, we identify 87 host scan attacks, 5 port scan attacks, 1 dos attack and 1 host intrusion. test data set includes 578,389 alerts generated from 14:00 to 23:30. these alerts come from 29,327 ip addresses and send to 10,590 ip addresses. by analyzing these alerts manually, we identify 203 host scan attacks, 7 port scan attacks, 6 dos attack, 3 host intrusion and 1 worm attack. 3.2 entropy-based attack detection the training data is evaluated by shannon entropy, as shown in fig. 2 (a). we remove the alerts associated to true attacks, which called as attack alert. the remainders are called as flase alert. we re-evaluate the noise alert in the training data set, as shown in fig. 2 (b). the shannon entropies are relatively smooth when no attack occurs; otherwise, one or some of the values would change abruptly. (a) all alerts (b) false alerts figure 2: shannon entropy. although the shannon entropies reflect the regularity of network status, it is difficult to detect attack directly by using five fixed thresholds. because the shannon entropy value varies with the activities of end users even the network runs in normal way. in our experiment, the renyi cross entropy is used to fuse the shannon entropy of five statistical features to detect attack. as shown in fig. 3, we calculate the renyi cross entropy of the alerts in train data set using (13). it is clearly shown that 1) the renyi cross entropy will change sharply when the network are attacked, see fig. 3 (a); 2) the renyi cross entropy will be close to 0 without the large-scale network attacks and failures, see fig. 3 (b). thus, it is easy to detect attack using fixed threshold. in the experiments, when ηdetect = −0.016, 84 attacks can be detected from 94 attacks with 11 false detections. 81 host scan attacks can be detected from 87 host scans. the missed scan attacks last for a relative long time and with small scan density. 1 port scan is detected from 5 port scans. 1 host intrusion and 1 dos attack are detected successfully. according to (14) and (15), the n and η are important for the accuracy of attack detection. in the experiments, we set ηbase = {−0.001,−0.002,−0.003, . . . ,−0.04} and n = {5,10,15, . . . ,200}. for each combination of ηbase and n, the training data is analyzed in the following method. firstly, each v (t) is unitized to v̄ (t) using (11) and (12); secondly, the shannon entropy can be calculated using (14). its unitized form is v̄ (t,n). finally, v̄ (t) is compared with v̄ (t,n) using (15) to calculate renyi cross entropy value. an entropy-based method for attack detection in large scale network 515 (a) all alerts (b) false alerts figure 3: renyi cross entropy. in the experiment, roc (receiver operating characteristic) is used to describe the detection results. roc is a graphical plot of true positive rate and false positive rate [15]. fig. 4(a) shows the roc curve of detection results in training data, where the size of nts n and base threshold ηbase equals (5, 0.005), (50, 0.02) and (100, 0.04) separately. when detection threshold ηdetect comes to 0, almost all the time intervals are detected as network attack. thus, the detection false positive rate and hit rate are both near 100%. a detection result with high hit rate and low false rate is considered to be a good result. in this case, the roc curve is plotted at the top left corner, and the auc value (area under roc curve) has large value. in this paper, we use auc value to evaluate the detection results. the best combination of n and ?base can be obtained using training data. as shown in fig. 4(b), the auc values of all the combinations are calculated, and the highest auc is 0.9962 when n = 95 and ηbase =−0.022. (a) roc (b) auc figure 4: detection result on training data set. 3.3 testing the test data set is analyzed to detect the attacks using entropy-based method. as shown in fig. 5, 211 attacks can be detected from 220 attacks (detection rate is as high as 96%) with 8 false detections. 197 host scan attacks can be detected from 203 host scans. 4 port scans are detected from 7 port scans. 3 host intrusions, 1 worm attack and 6 dos attacks are detected successfully. 516 t. liu, z. wang, h. wang, k. lu figure 5: attack detection results on test data set. 4 conclusion in this paper, a new network attack detection method based on entropy is proposed. the source ip, destination ip, alert treat and alert datagram length are selected from tens of snort alert attributions. the shannon entropy is used to analyze the alerts to measure the regularity of current network status. the renyi cross entropy is employed to fuzz the shannon entropy on different features to detect network attacks. in the experiments, the network traffic of more than 4000 users in 32 c-class network are monitored using snort. 748905 alerts, generated from 10:00 to 23:30 dec. 6 2010, are selected and separated into training data set and test data set. the experiments show that the renyi cross entropy value is near 0 when the network runs in normal, otherwise the value will change abruptly when attack occurs. the attack detection rate of entropy method is as high as 96% with only 8 false alerts. in next step, more alerts from different time segments will be collected to test our method and an attack classification method will be considered. acknowledgment this work was supported by the national natural science foundation (60921003, 60970121, 91018011), national science fund for distinguished young scholars (60825202) and the fundamental research funds for the central universities. bibliography [1] a. gostev, "kaspersky security bulletin. malware evolution 2010," kaspersky, 2011. [2] m. fossi, g. egan, k. haley, e. hohnson, t. mack and a. et, "symantec global internet security threat report trends for 2010," symantec, 2011. [3] p. cisar, s. bosnjak and s. m. cisar, "ewma algorithm in network practice," international journal of computers, communications & control, vol.5, pp. 160-170, 2010. [4] g. c. tjhai, m. papadaki, s. m. furnell and n. l. clarke, in lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics), turin, italy, 2008, pp. 139-150. [5] t. pietraszek, "using adaptive alert classification to reduce false positives in intrusion detectionrecent advances in intrusion detection," vol.3224, pp. 102-124, 2004. an entropy-based method for attack detection in large scale network 517 [6] j. mina and c. verde, "fault detection for large scale systems using dynamic principal components analysis with adaptation," international journal of computers, communications & control, vol.2, pp. 185-194, 2007. [7] g. p. spathoulas and s. k. katsikas, in 2009 16th international conference on systems, signals and image processing, iwssip 2009, chalkida, greece, 2009. [8] a. lakhina, m. crovella and c. diot, in computer communication review, new york, united states, 2005, pp. 217-228. [9] d. brauckhoff, b. tellenbach, a. wagner, m. may and a. lakhina, in proceedings of the acm sigcomm internet measurement conference, imc, rio de janeriro, brazil, 2006, pp. 159-164. [10] a. wagner and b. plattner, in proceedings of the workshop on enabling technologies: infrastructure for collaborative enterprises, wet ice, linkoeping, sweden, 2005, pp. 172-177. [11] r. yan and q. zheng, "using renyi cross entropy to analyze traffic matrix and detect ddos attacks," information technology journal, vol.8, pp. 1180-1188, 2009. [12] y. gu, a. mccallum and d. towsley, "detecting anomalies in network traffic using maximum entropy estimation," in proc. 2005 proceedings of the 5th acm sigcomm conference on internet measurement, pp. 32. [13] c. e. shannon, "a mathematical theory of communication," sigmobile mob. comput. commun. rev., vol.5, pp. 3-55, 2001. [14] c. e. pfister and w. g. sullivan, "renyi entropy, guesswork moments, and large deviations," ieee transactions on information theory, vol.50, pp. 2794-2800, 2004. [15] a. p. bradley, "the use of the area under the roc curve in the evaluation of machine learning algorithms," pattern recognition, vol.30, pp. 1145-1159, 1997. international journal of computers communications & control issn 1841-9836, 11(6):889-901, december 2016. learning bayesian networks in the space of structures by a hybrid optimization algorithm m. zhu, s. liu, j. jiang mingmin zhu*, sanyang liu school of mathematics and statistics, xidian university 2 south taibai road, xi’an, china, 710071 *corresponding author: zmmzhu2010@126.com liusanyang@126.com jiewei jiang school of computer science and technology, xidian university 2 south taibai road, xi’an, china, 710071 jiangjw924@126.com abstract: bayesian networks (bns) are one of the most widely used class for machine learning and decision making tasks especially in uncertain domains. however, learning bn structure from data is a typical np-hard problem. in this paper, we present a novel hybrid algorithm for bn structure learning, called mmabc. it’s based on a recently introduced meta-heuristic, which has been successfully applied to solve a variety of optimization problems: artificial bee colony (abc). mmabc algorithm consists of three phases: (i) obtain an initial undirected graph by the subroutine mmpc. (ii) generate the initial population of solutions based on the undirected graph and (iii) perform the abc algorithm to orient the edges. we describe all the elements necessary to tackle our learning problem, and experimentally compare the performance of our algorithm with two state-of-the-art algorithms reported in the literature. computational results demonstrate that our algorithm achieves better performance than other two related algorithms. keywords: bayesian network, artificial bee colony, structural learning, metaheuristics, scoring function. 1 introduction bayesian networks (bns), also known as belief networks, are becoming a popular tool for representing uncertainty in artificial intelligence. they have been applied to a wide range of tasks such as natural spoken dialog systems, vision recognition, expert systems, medical diagnosis, and genetic regulatory network inference [1-6]. with a network at hand, probabilistic inference can be performed to predict the outcome of some variables based on the observations of others [7,8]. there has been a lot of work in the last ten years on the learning of bns for both graph structure and probability parameters. however, learning the structure is harder and, arguably, more critical. most of these algorithms can be grouped into two different categories: constraint-based methods [9,10,11] and search-and-score methods [12,13,14]. the first one poses the learning process as a constraint satisfaction problem, and then constructs a network structure by testing the conditional independence relations. the second one poses the learning problem as a structure optimization problem. namely, it uses a score metric to evaluate every candidate network structure, and then, finds a network structure with the best score. though the implement of the former approach is relatively simple, the computation for high-order conditional independence tests is complex and irresponsible. thus, the score-and-search approach gradually becomes a popular approach for learning bns [15,16,17]. copyright © 2006-2016 by ccc publications 890 m. zhu, s. liu, j. jiang recently, a hybrid algorithm that combines the conditional independence methodology with scoring metric optimization, max-min hill climbing (mmhc)[18], has been introduced and proven to be very competitive. in mmhc approach, after reconstructing an undirected graph encoding parent-child relationships by using the max-min parents and children (mmpc) algorithm, a greedy bayesian scoring hill-climbing search is used to orientate the edges. in this paper, the idea is further explored, by using the artificial bee colony (abc)[19,20] algorithm to perform the edge orientation. we call the new hybrid algorithm max-min artificial bee colony (mmabc) algorithm. the abc algorithm can be used to search for the near-optimal graph in the space of directed acyclic graphs (dags). we describe all the elements necessary to tackle our learning problem using this meta-heuristic, and experimentally compare the performance of our abc-based algorithm with other algorithms used in the literature. the remainder of this paper is organized as follows. section 2 describes the concepts and methods related to bns. we introduce the abc algorithm and our main algorithm in section 3. simulation studies are conducted to demonstrate the performance of our algorithm and existing algorithms in section 4. finally, in section 5, we conclude and outline our future work. 2 preliminaries in this section we briefly review some basic concepts related to bns and how to learn them. a bn is a graphical representation of a joint probability distribution that includes two components. one is a directed acyclic graph (dag) g = (v,e), where v = {v1,v2, . . . ,vn}, the set of nodes, represents the random variables, and e is the set of edges, represents direct dependency relationships between variables. the other is a conditional probability table that quantifies the effects (p(vi|pa(vi))) that the parent set of vi have on the variable vi in g, where pa(vi) is the parent set of vi in g, i.e., pa(vi) = {vj|vj → vi ∈ e,vj ∈ v\vi}. if vj is the parent of vi, then vi is the child of vj in g. learning the structure of bn is equivalent to identifying the structure of the dag g that best matches the data set d. robinson [21] showed that the number f(n) of possible structures for bn having n nodes is given by the recurrence formula: f(n) = n∑ i=1 (−1)i+1 n! i!(n− i)! 2 i(n−1)f(n− i). (1) searching for the best structure is difficult because the search space increase exponentially with the number of variables. it has been proven to be an np-hard problem [22]. recently, a new and quite competitive method, the mmhc algorithm [18], that combines both scoring-based and conditional independence test approaches has been introduced. it first learns the undirected graph (i.e., the edges without their orientation) of a bn using a local discovery algorithm called the max-min parents and children (mmpc) algorithm. then, it orients the undirected graph using a greedy hill climbing search. the subroutine mmpc is sound in the sample limit. it attempts to quickly identify the set of parents and children pc(vi) of a variable vi in two phases. in the first phase, variables that are conditionally independent on vi can enter the set of candidate parents and children according to a heuristic function. in each iteration, variable vj enters the candidate set that maximizes the minimum association to vi given the current candidate set. in the second phase, the false-positive variables that possibly entered the candidate set in the first phase are removed. more detailed explanation can be found in [18]. in this paper, we first use the mmpc algorithm to learn the undirected graph of a bn, and then orientate the edges by applying the artificial bee colony (abc) algorithm. in the following, we briefly introduce the abc algorithm, which is a new meta-heuristic approach inspired by the intelligent foraging behavior of honeybee swarm. after that, we give the main algorithm of this paper. learning bayesian networks in the space of structures by a hybrid optimization algorithm 891 3 learning bayesian networks using the abc algorithm 3.1 the artificial bee colony algorithm the abc [20] algorithm is a swarm based meta-heuristic that simulates foraging behavior of honey bees. this algorithm includes three kinds of bees considering the division of labor: employed bees, onlooker bees and scout bees. a bee that is currently exploiting a food source is called an employed bee. a bee waiting in the hive for making decision to choose a food source is named as an onlooker. a bee carrying out a random search for a new food source is called a scout. an employed bee keeps a food source in her mind when she leaves from the hive and she shares the information about her food source with onlookers on dance area. onlookers select a food source by watching the dances of the employed bees and try to improve this source. if a food source is abandoned, its employed bee becomes a scout to explore new food sources randomly. scouts either randomly search the environment in order to find a new food source depending on an internal motivation or based on possible external clues. figure 1: flowchart of the abc algorithm 3v 4v 1v 2v 3v 4v 1v 2v 0 1 1 0 0 0 0 1 0 0 0 1 0 0 0 0 x ! " # " #$ " # " # % & * 0 1 1 0 0 0 1 1 0 0 0 1 0 0 0 0 x ! " # " #$ " # " # % &3v 4v 1v 2v * 0 1 1 0 0 0 0 0 0 0 0 1 0 0 0 0 x ! " # " #$ " # " # % & 3v 4v 1v 2v * 0 0 1 0 1 0 0 1 0 0 0 1 0 0 0 0 x ! " # " #$ " # " # % & figure 2: three edge operators used in the mmabc algorithm in abc algorithm each food source represents a possible solution for the problem and the fitness value of the solution corresponds to the nectar amount of this food source. each food source is exploited by only one employed bee. in other words, the number of employed bees is equal to the number of candidate solutions. similar to the other swarm intelligence based approaches, the abc algorithm is also an iterative process. it repeats a series of steps until a termination criterion is met. we show all these steps and interactions between them as a flowchart on figure 1. 892 m. zhu, s. liu, j. jiang 3.2 a hybrid algorithm for learning bns using abc optimization the above basic abc algorithm, originally designed for the continuous nature of optimization problems, can not directly be used for discrete problems. in this section, we develop a new abc-based algorithm for learning bns that performs the search in the space of directed acyclic graphs. the new algorithm is called mmabc for short. we first use the mmpc algorithm to learn the undirected graph of a bn. based on this undirected graph ug, the initial population for mmabc is produced randomly. more specifically, the edges of the initial graph corresponding to each individual are randomly selected from the edge set of ug, which is the candidate edge set of the optimal bn structure. this greatly reduces the randomness of the initial population and improve the convergence speed. the main components of our mmabc algorithm are described below. a. initialization. let m and n denote the total number of food sources and the number of variables in a bn, respectively. since the structure of a bn is a directed graph g = (v,e), we represent each individual by a binary node-node adjacency matrix x = (xij)n×n, i,j = 1, 2, . . . ,n. entry (i,j) is 1 if there is a directed arc from node i to node j, and 0 otherwise. that is, xij = { 1, if vi → vj ∈ e, 0, if vi → vj /∈ e. (2) initial food sources (solutions) xk = (xij)n×n,k = 1, 2, . . . ,m are generated randomly by selecting edges from an undirected graph ug = (v,e), which is obtained by mmpc. counters which store the numbers of trials of solutions are reset to 0 in this phase. b. employed bees phase. each employed bee is associated with only one food source site. an employed bee generates a neighboring solution by three operators: edge insertion, edge deletion and edge reversal. we show these three operators in figure 2. let xk and xk∗ denote the current solution and the generated one, respectively. the quality of the graph g corresponding to xk∗ is measured by the k2 score function [23] as follows: fk2(g,d) = n∑ i=1 fk2(vi,pa(vi)|dvi,pa(vi)), (3) fk2(vi,pa(vi)|dvi,pa(vi)) = qi∑ j=1 [ log ( (ri − 1)! (nij + ri − 1)! ) + ri∑ k=1 log(nijk!) ] , (4) where d is the domain data set. dvi,pa(vi) are the statistics of the variable vi and pa(vi) in d, i.e., the number of instances in d that match each possible instantiation of vi and pa(vi). nijk is the number of cases in d where vi is in its kth state and its parents are in their jth state. nij = ∑ri k=1 nijk denotes the number of cases in d where pa(vi) is in its jth state and ri denotes the number of states of variable vi, qi is the number of parent configurations of vi. because the k2 metric is decomposable and the graphs corresponding to xk∗ and xk are different from one edge by each one of three operators, we can reuse most of the computations made in the food source xk. in other words, we only need to update the statistics corresponding to the variables whose parent sets have been modified. thus, the time performance can be greatly enhanced. as our problem is a maximization problem and the value of fk2(·) is negative, therefore, the fitness value assigned to the solution xk∗ is determined using the following expression: fitk = 1 1 + abs(fk) , (5) learning bayesian networks in the space of structures by a hybrid optimization algorithm 893 where fk is the k2 value of the graph g corresponding to xk∗. a greedy selection is employed between xk and xk∗ such that the better one is selected depending on fitness values representing the nectar amount of the food sources at xk and xk∗. if xk cannot be improved, its counter is incremented by 1, otherwise, the counter is reset to 0. c. onlooker bees phase. an onlooker bee evaluates the nectar information taken from all employed bees and chooses a food source site with a probability related to its nectar amount. the roulette wheel selection scheme is used here in which each slice is proportional in size to the fitness value as follows: pk = fitk∑m j=1 fitj . (6) by doing so, the onlooker bees are encouraged to visit the food sources that have higher nectar amounts. briefly, food source selection by onlookers is based on the information provided by employed bees. a uniform random number within the range [0, 1] is obtained for each source. if the probability value pk is greater than this random number, then the onlooker bee generates a neighboring food source again by using three operators as in the case of the employed bee. after the source is evaluated, greedy selection is applied. if solution xk cannot be improved, its counter is incremented by 1, otherwise, the counter is reset to 0. this process is repeated until all onlookers are distributed onto food source sites. d. scout bees phase. if the value of the counter is greater than the control parameter “limit”, then the food source associated with this counter is abandoned. assume that the abandoned source is xk, then the scout randomly discovers a new food source xk∗ which can be achieved by randomly generating any directed acyclic graph. this procedure can increase solutions diversity and avoid falling into the local optimum. in our algorithm, there is only a single food source which can be abandoned in each cycle, and only one employed bee can be a scout. e. termination. if a termination is not satisfied, go to employed bees phase; otherwise stop the procedure and output the best food source found so far. as shown in algorithm 1, the mmabc algorithm first constructs the undirected graph ug of the underlying bn structure by the mmpc algorithm, which obtained the parents and children of each variable. it gives the best-so-far undirected graph, from which the initial population can be generated randomly by selecting n− 1 edges from ug (assumed the underlying bn has n variables). figure 3 shows the process of generating initial population by a simple example. consider the second graph in figure 2. assumed that its undirected graph ug obtained by mmpc is shown in figure 3 (a). by randomly selecting 3 directed edges from ug, we can get an initial solution. figure 3(b), figure 3(c) and figure 3(d) show 3 solutions of the initial population generated from figure 3(a). after producing and evaluating the initial population, the main loop of the mmabc algorithm then begins for iter iterations. step 7 and 15 show one of the edge operators can be chosen to obtain the neighbor of the current solution in employed bees phase and onlooker bees phase, respectively. each of the operators can be chosen randomly with equal probability. if there is a cycle in the generated graph, another new graph can be achieved again. 4 experimental results in this section, we present the experimental results carried out with our algorithm and compare mmabc with two hybrid heuristic methods: mmhc [18] and mmaco [24]. in all the 894 m. zhu, s. liu, j. jiang algorithm 1 the mmabc algorithm. 1. obtain the undirected graph u g by mmpc; 2. generate the initial population of solutions xk,k = 1,2,...,m, trk = 0. m is the total number of food sources. trk is the non-improvement number of the solution xk , used for abandonment; 3. evaluate the population; 4. iter=1; 5. repeat 6. for k = 1 to m do 7. produce a new food source xk∗ for the employed bee of the food source xk by using one of the three edge operators and evaluate its quality according to equation (3); 8. apply a greedy selection process between xk∗ and xk and select the better one; 9. if solution xk does not improve trk = trk + 1, otherwise trk = 0; 10. end for 11. calculate the probability value pk by (6), k = 1,2,...,m; 12. t = 0,k = 1; 13. repeat 14. if random < pk then 15. produce a new food source xk∗ for the onlooker bee of the food source xk by using one of the three edge operators and evaluate its quality by (3); 16. apply a greedy selection process between xk∗ and xk and select the better one; 17. if solution xk does not improve trk = trk + 1, otherwise trk = 0; 18. t = t + 1; 19. end if 20. until t = m; 21. if max(trk) > limit then 22. replace xk with a new randomly produced solution; 23. end if 24. memorize the best solution achieved so far; 25. iter=iter+1; 26. until iter=maximum iteration. figure 3: the mmabc algorithm 3v 4v 1v 2v 3v 4v 1v 2v 1 0 1 0 0 0 0 0 1 0 0 0 0 0 0 1 0 x ! " # " #$ " # " # % & 2 0 0 1 0 1 0 0 0 0 0 0 1 0 0 0 0 x ! " # " #$ " # " # % &3v 4v 1v 2v 0 1 1 0 1 0 0 1 1 0 0 1 0 1 1 0 ug x ! " # " #$ " # " # % & 3v 4v 1v 2v 3 0 0 0 0 0 0 0 1 1 0 0 1 0 0 0 0 x ! " # " #$ " # " # % & (a) ug (b) solution 1 (c) solution 2 (d) solution 3 figure 4: the process of generating initial population learning bayesian networks in the space of structures by a hybrid optimization algorithm 895 cases the scoring metric used to guide the search is the k2 metric, so, the difference among implementations is minimized. all these experiments are based on the bn toolbox written by murphy [25] and the causal explorer system developed by aliferis et al.[26]. the experimental platform is a personal computer with pentium 4, 3.06 ghz cpu, 1.0 gb memory, and windows xp. the standard way of assessing the accuracy of a learning method is to draw samples from a known bn, apply structure learning on the artificial data, and to compare the learned structure with the original one. to benchmark our method, we investigate its performance on artificial data with real-world characteristics focusing on the well-known asia [27], insurance [28], and alarm [29] networks. the first one is a small bn using a fictitious medical example that considers the dependence of the health status of a patient (having tuberculosis, lung cancer, or bronchitis) on risk factors (visit to asia and smoking) and clinical observations (dyspnea and x-ray). it consists of eight nodes with eight edges connecting them. each random variable is discrete in nature and can take two states. the second example is a network for estimating the expected claim costs for a car insurance policyholder. it consists of 27 nodes and 52 edges. the last one is a medical diagnostic system for patient monitoring. it consists of 37 nodes and 46 edges connecting them. the random variables in the alarm network are discrete in nature and can take two, three, or four states. the characteristics of the networks are summarized in table 1. for these three networks, data sets with 200, 500, 1000, 3000, and 5000 samples are drawn. for each of the sample sizes, ten independent data sets are generated by applying gibbs sampling [30] on the benchmark networks, resulting in a set of 50 data sets for each network. 10 replicates are done for each of different network parameters and sample size. we will discuss our experimental results in more detail in the following two subsections. table 1: bayesian networks network variables edges max in/out-degree domain range asia 8 8 2/2 2 insurance 27 52 3/7 2-5 alarm 37 46 4/5 2-4 table 2: the k2 values for the true networks with different samples network n = 200 n = 500 n = 1000 n = 3000 n = 5000 asia -501.79 -1194.9 -2350.1 -6881.9 -11458 insurance -5291.2 -9851.6 -17105 -45406 -73717 alarm -3259.2 -6364.4 -11162 -29894 -48597 4.1 comparison of the solution quality in this subsection, we compare the performance of the different algorithms on all of the data sets. our main objective in these experiments is to determine whether mmabc is more efficient and effective than the other two hybrid algorithms. for mmabc and mmaco, the population size is set to 20, and the maximum number of iterations is set to 100. the significant level used 896 m. zhu, s. liu, j. jiang in these algorithms is set to 0.05. other parameter settings of the mmaco algorithm can be found in [24]. the parameter “limit” for mmabc is set to 6, namely, if the solution associated with an employed bee does not improve for 6 times, then that solution is replaced by some randomly generated solution. in order to provide an objective measure of network structure reconstruction behavior, we compare the underlying dag with the learned dag and record the structural hamming distance (shd), where shd is defined as the total number of operations to verify the learned dag to the underlying dag. the k2 scoring function was also presented to measure the learned network structure, which could be used to compare the quality of the structures obtained by different algorithms. the k2 values for the true network structures with different data sets can be seen in table 2. table 3: the results for three algorithms with different datasets network algorithm n = 200 n = 500 n = 1000 n = 3000 n = 5000 mmabc 3.0 2.1 1.0 0.7 0.3 -500.53±0.7 -1191.6±1.1 -2351.6±3.1 -6877.0±3.5 -11454.0±5.2 asia mmaco 4.8 2.9 1.2 0.6 0.3 -488.8±1.3 -1179.3±3.2 -2346.5±7.1 -6868.0±9.1 -11478.0±11.9 mmhc 6.1 3.3 2.0 1.1 0.9 -507.6±1.1 -1209.2±1.8 -2351.9±3.7 -6896.3±5.7 -11593.0±6.8 mmabc 15.8 11.6 8.7 6.6 4.1 -5292.1±1.2 -9853.8±1.8 -17116.1±2.6 -45408.3±4.7 -73716.9±5.6 insurance mmaco 16.9 12.4 9.4 6.7 5.3 -5295.8±3.1 -9857.6±5.6 -17115.3±6.9 -45412.1±8.2 -73716.7±10.1 mmhc 36.2 27.9 19.3 12.4 8.9 -5361.3±1.5 -9911.5±3.2 -17190.4±3.4 -45815.6±5.3 -73850.3±6.8 mmabc 18.4 11.3 10.0 6.9 3.9 -3198.3±1.2 -6363.5±1.7 -11149.9±2.8 -29883.9±4.1 -48600.9±5.0 alarm mmaco 19.1 12.1 11.2 7.2 4.4 -3072.2±3.1 -6344.3±5.5 -11085.7±6.9 -29891.1±9.2 -48605.9±11.1 mmhc 35.8 27.6 19.2 13.8 9.6 -3085.8±1.4 -6435.1±3.1 -11210.2±3.3 -29849.3±5.3 -48694.0±6.7 table 3 provides the mean and standard deviation of the k2 score and the mean shd values obtained for the learned networks with different data sets. µ ± σ indicates the mean and the standard deviation over the executions carried out. we have carried out 10 executions of each algorithm and for each domain considered. in table 3, the first six rows display the results of three algorithms on asia data sets, and the rest twelve rows is the results on insurance and alarm data sets. for example, figures in the second row are the shd values obtained by the mmabc algorithm on asia data sets. figures in the third row are their corresponding mean and standard deviation of the k2 scores. on the whole, we can see that mmaco algorithm is a little better than mmhc algorithm on the asia network, and our algorithm is best. however, in the case of insurance and alarm networks, the advantages of mmabc and mmaco are more evident, and mmabc performs better than mmaco, which means our algorithm has better performance with more complicated networks. the k2 score of our algorithm is always closer learning bayesian networks in the space of structures by a hybrid optimization algorithm 897 to that of the original network. the reason is that the construction process of a bn for an ant is dependent much too on the pheromone and heuristic information of candidate arcs, which is prone to overfitting. although it can be seen that the k2 score of mmabc and mmaco becomes similar as the sample size increase, mmabc has a lower shd value. we also note that in some cases, the score obtained by mmaco is better than that of mmabc, but mmabc has lower shd values. this indicates that a better score does not necessarily mean a better shd value and vice-versa. it can occur because of small size and because of the parameters given to the scoring function, which have been shown to produce differences in scoring function behavior. it is worth mentioning that the standard deviation of the mmabc is lower than that of the other two algorithms, which means that the mmabc guarantees good quality networks with less fluctuation than other algorithms. 4.2 comparison of the convergence and time performance in order to measure the convergence speed and the runtime complexity to learn the structure, we compared with the typical runs of the mmabc and mmaco algorithms on the same data set by large number of experiments. (a) 10 20 30 40 50 60 70 80 90 1001 -2.380 -2.375 -2.370 -2.365 -2.360 -2.355 -2.350 -2.345 iteration s c o r e x10 3 mmaco mmabc us (b) 10 20 30 40 50 60 70 80 90 1001 1 1.5 2 2.5 3 3.5 4 4.5 iteration s h d mmaco mmabc figure 5: comparison of the solution quality for mmaco and mmabc on asia data sets: (a) scores for asia; (b) shd for asia figure 4 and figure 5 show the curves of the convergence performance on two data sets, asia1000 and alarm-5000, respectively. in these figures, the abscissas depict the number of iterations, and the y-coordinates depict the k2 score or shd value of the learned graph. us specifies the underlying score of the original network. from a general point of view, mmabc achieve better quality networks in the initial stages of the optimization and has a rapidly converging learning curve, which means that mmabc require a significantly lower number of iterations than mmaco to achieve networks of better quality. although we are testing on two very different data sets, we obtain similar observation that as the score is improving over iterations, the shd value is deteriorating. it provides strong evidence that mmabc is performing well against the mmaco algorithm. in figure 6, we test mmabc with 4 different population sizes on asia-1000: 10 (mmabc1), 20 (mmabc2), 30 (mm abc3), 40 (mmabc4). from the curves in figure 6(a) and figure 6(b), it can be found that the population size m can have an effect on the convergence speed of the algorithm. in other words, mmabc takes a comparatively large number of iterations to converge to an acceptable solution with a small number of bees. because a large population 898 m. zhu, s. liu, j. jiang (a) 10 20 30 40 50 60 70 80 90 1001 -4.8635 -4.863 -4.8625 -4.862 -4.8615 -4.861 -4.8605 -4.86 -4.8595 x 10 4 iteration s c o r e mmaco mmabc us (b) 10 20 30 40 50 60 70 80 90 1001 -4.8635 -4.863 -4.8625 -4.862 -4.8615 -4.861 -4.8605 -4.86 -4.8595 x 10 4 iteration s c o r e mmaco mmabc us figure 6: comparison of the solution quality for mmaco and mmabc on alarm data sets: (a) scores for alarm; (b) shd for alarm (a) 10 20 30 40 50 60 70 80 90 1001 -2.380 -2.375 -2.370 -2.365 -2.360 -2.355 -2.350 -2.345 iteration s c o r e x10 3 mmabc1 mmabc2 mmabc3 mmabc4 us (b) 10 20 30 40 50 60 70 80 90 1001 1 1.5 2 2.5 3 3.5 4 4.5 5 5.5 iteration s h d mmabc1 mmabc2 mmabc3 mmabc4 figure 7: comparison of the solution quality for different population sizes on asia data sets: (a) scores for asia; (b) shd for asia learning bayesian networks in the space of structures by a hybrid optimization algorithm 899 means that more food sources are employed, there are more different solutions examined than in a smaller population. on the other hand, the time consumed also increased proportionally with increasing population size. it should be noted that tiny changes to the score value can lead to large structural changes as an algorithm converges towards the optimum network. (a) (b) figure 8: comparison of the time performance on two networks: (a) running time for asia data sets; (b) running time for alarm data sets figure 7 gives the average running time to reach the best solution for three algorithms on two networks. based on figure 7, we can see that the proposed mmabc has a considerable better time performance than mmaco, and mmhc has the best time performance. however, from the results of table 3, we can see that mmhc obtains worse k2 score and shd values. moreover, the advantage is very obvious when the data set is large on the same network, namely, the bigger the sample size, the more obvious the improvement. we also noted that the running time of mmabc is affected less by the size of data but more by the problem dimension. it means that mmabc is not sensitive to the increase of the sample capacity and able to handle very large data sets. conclusions in this paper we have designed and implemented a hybrid abc-based algorithm for the bn structure learning problem. ideas from constraint based local learning and score-based abc were combined into effective learning processes. its application to several benchmark networks yielded better results than the previously used mmhc and mmaco algorithms. this paper demonstrates the capability of abc algorithm in learning a probability graphical model. ideas presented in this paper can be applied to many other probability graphical problems also, such as dynamic bn, chain graph, etc. moreover, we have applied the basic three edge operators for determining the neighboring solutions in mmabc. it will be interesting to see whether there exists some alternative approach to serve the same purpose here. we also intend to expand this analysis to larger benchmark networks and make further conclusions on the usefulness of the constraint-based and score-based learning approaches. finally, although we assume in this paper that the data are completely observed, missing data or data with latent variables may arise in practice. generalization of the proposed algorithm to incomplete data is of great research interest. 900 m. zhu, s. liu, j. jiang acknowledgment the authors would like to thank the editor and the anonymous reviewers for their insightful comments and suggestions. this work has been partially supported by the national natural science foundation of china (grant no. 61373174, 11401454), the natural science foundation of shannxi province, china (grant no. 2014jq1031) and the fundamental research funds for the central universities (grant no. jb140711). bibliography [1] z. cai, s. sun, s. si, b. yannou, identifying product failure rate based on a conditional bayesian network classifier, expert systems with applications, 38(5): 5036-5043. [2] y. sun, y.y. tang, s.x. ding, s.p. lv, y.f. cui, diagnose the mild cognitive impairment by constructing bayesian network with missing data,expert systems with applications, 38(1): 442-449. [3] v. aquaroa, m. bardoscia, r. bellotti, a. consiglio, f.d. carlo, g. ferri, a bayesian networks approach to operational risk, physica a, 389(8): 1721-1728. [4] d.c. kim, x. wang, c.r. yang, j. gao, learning biological network using mutual information and conditional independence, bmc bioinformatics, 11(3): s9. [5] s. nikolajewa, r. pudimat, m. hiller, m. platzer, r. backofen, biobayesnet: a web server for feature extraction and bayesian network modeling of biological sequence data, nucleic acids research, 35: 688-693. [6] r. wei, j. zhen, l. bao, study on mining big users data in the development of hubei auto-parts enterprise, mathematical modelling of engineering problems, 2(4): 1-6. [7] k. burns, bayesian inference in disputed authorship: a case study of cognitive errors and a new system for decision support, information science, 176(11): 1570-1589. [8] s. lee, y. son, j. jin, decision fild theory extentions for behavior modelling in dynamic environment using bayesian belief network, information science, 178(10) 2297-2314. [9] j. cheng, r. greiner, j. kelly, d. bell, w. liu, learning bayesian networks from data: an information theory based approach, artificial intelligence, 137: 43-90. [10] j.p. pellet, a. elisseef, using markov blankets for causal structure learning, journal of machine learning research, 9: 1295-1342. [11] c. borgelt, a conditional independence algorithm for learning undirected graphical models, journal of computer and system sciences, 76: 21-33. [12] e. perrier, s. imoto, s. miyano, finding optimal bayesian network given a superstructure, journal of machine learning research, 9: 2251-2286. [13] l.m. de campos, a scoring function for learning bayesian networks based on mutual information and conditional independence tests, journal of machine learning research, 7: 2149-2187. [14] l.m. de campos, j.m. fernández-luna, j.a. gámez, j.m. puerta, ant colony opyimization for learning bayesian networks, international journal of approximate reasoning, 31: 291311. learning bayesian networks in the space of structures by a hybrid optimization algorithm 901 [15] r. daly, q. shen, learning bayesian network equivalence classes with ant conoly optimization, journal of artificial intelligence research, 35: 391-447. [16] l. bouchaala, a. masmoudi, f. gargouri, a. rebai,improving algorithms for structure learning in bayesian networks using a new implicit score, expert systems with applications, 37: 5470-5475. [17] j. ji, h. wei, c. liu, an artificial bee colony algorithm for learning bayesian networks, soft computing, 17: 983¨c994. [18] i. tsamardinos, l.f. brown, c.f. aliferis, the max-min hillclimbing bn structure learning algorithm, machine learning, 65: 31-78. [19] d. karaboga, an idea based on honey bee swarm for numerical optimization, technical report-tr06, erciyes university, engineering faculty, computer engineering department. [20] d. karaboga, b. basturk, on the performance of artificial bee colony (abc) algorithm, applied soft computing, 8: 687-697. [21] r.w. robinson, counting unlabeled acyclic digraphs, combinational mathematics, 622: 28-43. [22] d. chickering, d. heckerman, c. meek, large-sample learning of bayesian networks is np-hard, journal of machine learning research, 5: 1287-1330. [23] g. cooper, e. hersovits, a bayesian method for the induction of probabilistic networks from data, machine learning, 9: 309-347. [24] p.c. pinto, a. nägele, m. dejori, t.a. runkler, j.m.c. sousa, using a local discovery ant algorithm for bayesian network structure learning, ieee transactions on evolutionary computation, 13(4): 767-779. [25] k. murphy, the bayes net toolbox for matlab, computing science and statistics, 33: 331350. [26] c.f. aliferis, i. tsamardinos, a. statnikov, l.e. brown, causal explorer: a causal probabilistic network learning tooltik for biomedical discovery, in proceedings of the international conference on mathematics and engineering techniques in medicine and biological sciences, 371-376. [27] s.lauritzen, d.spiegelhalter, local computations with probabilities on graphical structures and their application on expert systems, j.royal statistical soc., 50: 157-224. [28] i. beinlich, g. suermondt, r. chavez, g.cooper, the alarm monitoring system: a case study with two probabilistic inference techniques for belief networks, proc. second european conf. artificial intelligence in medicine. [29] j. binder, d. koller, s.j. ruddell, k. kanazawa, adaptive probabilistic networks with hidden variables, machine learning, 29: 213-244. [30] t. hrycej, gibbs sampling in bns, artificial intelligence, 46(3): 351-363. international journal of computers communications & control issn 1841-9836, 10(4):508-519, august, 2015. a visualization technique for accessing solution pool in interactive methods of multiobjective optimization e. filatovas, d. podkopaev, o. kurasova ernestas filatovas*, olga kurasova vilnius university institute of mathematics and informatics, akademijos str. 4, lt–08663 vilnius, lithuania ernest.filatov@gmail.com, olga.kurasova@mii.vu.lt *corresponding author: ernest.filatov@gmail.com dmitry podkopaev 1. systems research institute polish academy of sciences newelska str. 6, 01-447 warsaw, poland 2. university of jyväskylä departament of mathematical information technology p.o. box 35 (agora), fi-40014 university of jyväskylä, finland dmitry.podkopaev@gmail.com abstract: interactive methods of multiobjective optimization repetitively derive pareto optimal solutions based on decision maker’s preference information and present the obtained solutions for his/her consideration. some interactive methods save the obtained solutions into a solution pool and, at each iteration, allow the decision maker considering any of solutions obtained earlier. this feature contributes to the flexibility of exploring the pareto optimal set and learning about the optimization problem. however, in the case of many objective functions, the accumulation of derived solutions makes accessing the solution pool cognitively difficult for the decision maker. we propose to enhance interactive methods with visualization of the set of solution outcomes using dimensionality reduction and interactive mechanisms for exploration of the solution pool. we describe a proposed visualization technique and demonstrate its usage with an example problem solved using the interactive method nimbus. keywords: multiobjective optimization, interactive methods, pareto front visualization, dimensionality reduction, multidimensional scaling. 1 introduction many real-world optimization problems are multiobjective in their nature. existence of a solution optimizing all objectives simultaneously is unlikely in practice. solving a multiobjective problem is usually understood as finding a pareto optimal solution which is the most preferred for a decision maker (dm) [16]. therefore most of the multiobjective optimization methods rely on preference information obtained from the dm. we concentrate on interactive methods consisting of iterations, where at each iteration the dm provides some preference information and receives solution(s) derived based on this information as feedback [3]. in many interactive methods, a solution obtained at each iteration serves as the starting point for the next iteration. the dm considers this solution and expresses preference information aiming at its improvement. it is useful to allow the dm selecting for consideration the solution obtained not only during the last iteration but any previous iteration. few interactive methods include this possibility (see e.g. [8, 22]) and to our knowledge, none of the papers describe the selection process in detail. the issue to be addressed here is high cognitive load placed on the copyright © 2006-2015 by ccc publications a visualization technique for accessing solution pool in interactive methods of multiobjective optimization 509 dm when dealing with many-objective problem and a large number of accumulated solutions. even though the number of iterations in an interactive solution process can be moderate [7], many interactive methods such as tchebycheff method [33], nimbus [22] and others generate several pareto optimal solutions at each iteration, which makes the solution set grow fast. in this paper, we address the problem of assisting the dm in selecting one among many pareto optimal solutions at each iteration of an interactive method in the case of a large number of objectives. we propose to enhance interactive methods with a technique which visualizes a set of the obtained solution outcomes and provides interactive mechanisms of its exploration. this technique is based on multidimensional scaling method [2] which maps the solution outcomes on a plane while trying to preserve distances between them. despite significant lost of information, such dimensionality reduction creates a holistic view on the outcome set helping the dm to develop a cognitive map. this method has been successfully applied earlier for visualizing solution outcomes in multiobjective evolutionary algorithms [10], but never in interactive methods. its integration into interactive methods requires some modification for handling inclusion of new solutions into the solution pool. this paper is organized as follows. in section 2 we present background information and studies underlying our research, namely basic notions of multiobjective optimization, a general structure of interactive methods and the employed approach to dimensionality reduction. in section 3 we propose a visualization technique which can be integrated in many interactive methods. in order to demonstrate this technique in action, in section 4 we integrate it to the interactive method nimbus, and use the technique for solving an example problem. section 5 presents conclusions. 2 background 2.1 the problem of multiobjective optimization the multiobjective optimization problem is formulated as follows [16]: min x∈s f(x), (1) where x is a decision (variable) vector ; s ⊆ rn is a set of feasible solutions; f(x) = (f1(x),f2(x), . . . ,fk(x))t, fi : rn → r, i ∈ {1,2, . . . ,k} are objective functions, k ≥ 2. for each decision vector x ∈ s, f(x) is called its outcome. a decision vector x∗ is called pareto optimal solution of problem (1) (or pareto optimum for short), if there does not exist another vector x ∈ s such that fi(x) ≤ fi(x∗) for all i ∈{1,2, . . . ,k} and fj(x) < fj(x∗) for at least one j. the set of pareto optimal solutions is called pareto optimal set, and the set of outcomes of all pareto optimal solutions is called pareto front. solving the problem (1) is commonly understood as finding a pareto optimum x∗ whose outcome f(x∗) is the most preferred for the dm (see e.g. [3, 16]). therefore, most of multiobjective optimization methods use preference information provided by the dm. in this paper, we concentrate on interactive methods for they are believed to be most promising methods of multiobjective optimization because of numerous advantages [25]. an important advantage of interactive methods is the possibility for the dm to learn about the problem during the solution process, which makes him/her more confident in the final choice [1]. 2.2 interactive methods in interactive methods [16,18], the dm provides preference information progressively during the solution process and obtains pareto optima derived based on this information as feedback. 510 e. filatovas, d. podkopaev, o. kurasova many interactive methods have the underlying idea of exploring the pareto optimal set by shifting dm’s attention from one solution to another. the notion of current solution is introduced as the pareto optimum considered by the dm in the current iteration. the dm expresses preferences aiming (explicitly or implicitly) at improving the current solution, i.e. finding a more preferred pareto optimum. a pareto optimum derived based on these preferences can be considered as the current solution in the next iteration. this process of producing a sequence of pareto optima can be viewed as moving from one pareto optimum to another, where preference information determines the direction of movement. in the case of a cognitively complex model underlying a multiobjective optimization problem, the dm cannot be sure that his/her preference information always leads to a more preferred solution. besides that, sequential improvement of pareto optimal solutions based on dm’s preferences cannot guarantee the convergence to the most preferred solution among the whole pareto optimal set, by analogy with global optimization problems where a series of local improvements of a feasible solution does not necessary lead to a global optimum. therefore it is reasonable to give the dm possibility of exploring different parts of the pareto optimal set by going back to solutions derived earlier and starting moving from them in different directions. this possibility can be implemented by saving all the obtained pareto optima in the solution pool and enabling the dm in each iteration selecting any element of the pool as the current solution for the next iteration. thus instead of a sequence of pareto optima one can have a tree where each branch relates to such a move back. a general structure of interactive methods including the possibility of moving back is described below step-by-step (algorithm 1). algorithm 1 step 0. initialize. set initial variable values and derive pareto optimal solution(s) using an initial preference model (based on some initial dm’s preference information or a rule of thumb). step 1. update. add the obtained solution(s) into the pool. step 2. select. ask the dm to select a solution from the pool. step 3. preference expression. if the dm regards the selected solution the most preferred, then the method stops with the selected solution as the final one. otherwise set the selected solution as the current solution of the next iteration and ask the dm to express preferences aiming at its improvement. step 4. solution derivation. build the preference model based on dm’s preference information and derive new pareto optimal solution(s) using this model. go to step 1. some authors mention the possibility of selecting the current solution from the solution pool in interactive methods (see e.g. [6, 8, 22, 30]). many interactive methods can be easily enhanced by adding such possibility and thereby be fit into the structure described above. however to our knowledge, no papers present any details about implementing the procedure of solution selection. as the size of the solution pool grows, it becomes cognitively more difficult for the dm to keep and operate with it in his/her mind. indeed, comparing and choosing among a number of objects exceeding the mental capacities obviously leads to mistakes and inconsistencies as illustrated e.g. in [32]. therefore the dm can benefit from a graphical tool visualizing the solution pool and assisting with solution selection. a visualization technique for accessing solution pool in interactive methods of multiobjective optimization 511 2.3 graphical representation of solution outcomes it must be noted that most of the multiobjective optimization methods rely on the assumption that the dm compares different solutions and chooses the most preferred one only based on their outcomes. therefore most of existing visualization techniques (see e.g. [10, 12, 13, 17]) concentrate on presenting the set of the pareto optimal outcomes in the so-called objective space rk. visualizing a set of pareto optimal outcomes is a non-trivial task in the case k > 3. if the solution set is relatively small, then all components of all outcomes can be presented in a diagram such as scatter plot, bar charts, spider-web charts and others (see comprehensive surveys in [9, 17]). for bigger solution sets, such representations enforce the dm to deal with abundance of information. then dimensionality reduction methods can be helpful. as dimensionality reduction involves loss of information, the problem of extracting information which remains can be approached in different ways depending on the aim the visualization serves. for example, in self-organizing maps [36], artificial neural networks are trained to preserve the topological properties of the input space. principal component analysis (pca) [15] tries to preserve variances of data. interactive decision maps [4, 12] are based on two-dimensional projections, they are also used for the visualization of pareto outcomes. multidimensional scaling method (mds) [2, 5] aims at finding points v1, v2, . . . , vm in the low-dimensional space rs, s < k, such that the distances between them are as close as possible to the distances between the original points z1, z2, . . . , zm in the multidimensional space rk. this is achieved by minimizing the stress function emds = ∑ i mp or the dm wants to remap the solution outcomes. if yes then go to step 1, else go to step 3. step 5: stop. 514 e. filatovas, d. podkopaev, o. kurasova the visualization should be accompanied with graphical tools and interactive mechanisms based on graphical user interface, allowing the dm: • to see additional information related to the way solution were obtained, e.g. information about the extent at which solutions satisfy preferences based on which they were derived, relations between solutions related to the sequence in which they were obtained; • to perform operations on visualized outcomes, e.g. hide selected outcomes or outcomes satisfying a given property in order to focus on the most interesting outcomes, recalculate mapping on demand; • to get full information about the solution corresponding to any selected outcome. the implementation of the above enhancements of the visualization might depend on the interactive method used as well as the graphical user interface of the method implementation. 4 example implementation in this section, we demonstrate the proposed visualization technique by solving an example problem using a popular interactive multiobjective optimization method nimbus [16, 19]. it is implemented in the ind-nimbus software platform ( [28, 29]) designed for solving industrial problems, as well as in a web-based application www-nimbus available via the internet [21]. the nimbus method has been applied to a number of engineering problems [11, 22, 24]. in the nimbus method, new pareto optimal solutions are derived by solving scalarized problems based on preference information provided by the dm in the form of a classification. in each iteration of this method, the dm considers the current pareto optimal solution xc ∈ s and corresponding outcome and expresses preferences by classifying the objective functions fi(xc), i = 1,2, . . . ,k into up to the five classes related to five types of desirable or acceptable changes of the function values [22]: i< – to be improved (i.e. decreased) as much as possible; i≤ – to be improved till some desirable aspiration level z̄i < fi(xc); i= – the current function values are acceptable; i≥ – may be impaired (i.e. increased) till some upper bound ai > fi(xc); i⋄ – the function values are allowed to change freely. this classification must be done in such a way that at least one objective function value should be improved, and at least one is allowed to impair. the classification is used to build scalarizing functions for deriving new pareto optimal solutions. in synchronous nimbus [22], four different scalarization functions are used in order to present to the dm diverse solutions related to expressed preferences, thereby up to four different solutions can be obtained according the same preference information. 4.1 numerical example as an example for demonstrating the proposed visualization technique in use, we chose the five-objective river pollution problem which was implemented in ind-nimbus. the problem was originally presented in [26] and modified in [20], where the fifth (nonsmooth) objective function was included. the mathematical formulation of the problem is as follows: a visualization technique for accessing solution pool in interactive methods of multiobjective optimization 515 maximize f1(x) = 4.07 + 2.27x1 maximize f2(x) = 2.60 + 0.03x1 + 0.02x2 + 0.01 1.39−x21 + 0.30 1.39−x22 maximize f3(x) = 8.21− 0.71 1.09−x21 minimize f4(x) = −0.96 + 0.96 1.09−x22 minimize f5(x) = max[|x1 −0.65|, |x2 −0.65|] subject to 0.3 ≤ x1,x2 ≤ 1.0. (4) it describes a (hypothetical) pollution problem of a river, where a fishery and a city are polluting water. the decision variables x1 and x2 represent the proportional amounts of biochemical oxygen demanding material removed from water in two treatment plants located after the fishery and after the city [23]. here f1 and f2 describe the quality of water after the fishery and after the city, respectively; f3 and f4 show the percent return on investment at the fishery and the addition to the tax rate in the city, respectively; and f5 describes the functionality of the treatment plants. hence, the first three objective functions must be maximized, while the fourth and fifth objective functions are to be minimized. the dm’s strategy and solution process when solving the original four-objective river pollution problem [26] using nimbus method was described in [14]. below we describe the process of solving the problem (4) using synchronous nimbus implemented in ind-nimbus, where the dm gets additional support from our visualization technique. when addressing multiobjective optimization problems the dm can have different priorities across objectives in mind. further we assume that the dm has higher priority to improving the quality of the water after the fishery and the city. the solution process described below is divided into several stages according to the dm’s actions related to the usage of the visualization technique: stage 1. when starting solving the problem by using the ind-nimbus, at first a socalled neutral compromise solution is obtained using an initial preference model and presented to the dm. naturally the dm is not satisfied with the obtained solution, therefore he/she tries to improve it by making several different classifications of the objective functions. during the solution process the dm selects the most preferred solution from the obtained four solutions as the current solution and tries to improve it in the next iteration. this process is repetitive, and the solution pool is constantly updated with the new derived solutions. stage 2. after a predefined number of the solutions is collected in the solution pool, the visualization technique is started to be used by the dm (see figure 2a). here the points of the map correspond to the obtained solution outcomes. we omit drawing axes of the two-dimensional coordinate system due to the fact that the coordinates do not have individual meaning, as the only important aspect of mapping is preserving distance between points. the points of the scatter plot are coloured using rdylgn colour scheme, where colour of each mapped point represents its distance to the reference point that express the current dm’s preferences (red colour corresponds to large distance, green – to small distance). the numbers near the points mean the order in which the corresponding solutions were obtained. the presented map of obtained outcomes is constructed by using the mds method, when the 8th solution has been selected as the current one. now the dm can explore the map, see the relations between the outcomes and think about the further directions of the solution process. it can be noticed from the map that the 10th solution is located furthest from the area of the objective space where the dm started the solution process. as the dm wants to explore the objective space in direction further away from first solution, he decides to select the 10th solution as the current one. 516 e. filatovas, d. podkopaev, o. kurasova (a) (b) (c) figure 2: two-dimensional map of the pareto optimal solution outcomes, constructed by the visualization technique. stage 3. the dm tries to improve the 10th solution by using a different classification of objective functions. then the solution pool is updated by new outcomes and they are added to the map by using the relative mapping (see figure 2b). the points mapped using mds have circular shape while the new points are presented as squares. here the graph of the solution process is also displayed (the arrows are drawn between the points corresponding to obtained outcomes which once served as current solutions). the location of new points confirms that the selection of 10th solution as the current solution and provision of the preference information allowed the dm to advance in exploring the objective space, e.g. obtain pareto optimal solutions which demonstrate bigger differences from earlier obtained solutions. afterwards, the dm wants to remap all the obtained outcomes by mds in order to see relations between them more precisely, and the updated map is presented (see figure 2c). after remapping, the places of the points on the map have been changed slightly comparing with the previous ones, which does not affect the cognitive map developed in the dm’s mind. however, after remapping the furthest location of 15th solution outcome from earlier solutions is more expressed. here the arrows are drawn between all the solutions from the solution pool in sequence order in which the solutions were obtained (see figure 2c). next, the dm can select other solution from the solution pool as the current solution for further improvement, change the solution strategy by giving higher priority to other objectives, and search for more preferable solutions in unexplored parts of the decision space with the support of the mds-based visualization technique, or using only the original ind-nimbus user interface. hence, the obtained map helps the dm to keep in mind a holistic image of the current state of the solution pool, as well as the history of the solution process. it must be pointed out that the purpose of this example is not to describe the whole solution process of the problem, but to demonstrate the performance and flexibility of the proposed visualization technique. hence, the three presented stages cover the main aspects of its usage during the solution process. 5 conclusions the core idea of interactive methods is using a pareto optimum derived during each iteration (so-called current solution) as the starting point for the next iteration. the possibility of selecting pareto optima derived in any of previous iterations as the current solution contributes to the flexibility and freedom of pareto optimal set exploration. this aspect of implementing interactive methods has not received any attention in the literature. a visualization technique for accessing solution pool in interactive methods of multiobjective optimization 517 in this paper we have proposed to enhance interactive methods with a technique visualizing the outcomes from the solution pool and assisting the dm in its exploration and solution selection. the resulting visualization is presented in the form of an interactive 2d scatter plot obtained using mds method, which helps the dm to detect similarity between outcomes and focus on more preferred parts of the pareto front. this visualization facilitates the dm in developing a cognitive map of the set of solution outcomes. we have described an general approach of creating and integrating the visualization into interactive methods, and also illustrated it with a simple example. however the visualization should be implemented using a graphical user interface and, depending on the method, enhanced with graphical and interactive elements improving dm’s experience. acknowledgment ernestas filatovas is supported by the postdoctoral fellowship funded by european union structural funds project “postdoctoral fellowship implementation in lithuania”. bibliography [1] belton, v., branke, j., eskelinen, p., greco, s., molina, j., ruiz, f., słowiński, r. (2008); interactive multiobjective optimization from a learning perspective, in branke, j., deb, k., miettinen, k., słowiński, r., editors, multiobjective optimization (lecture notes in computer science, 5252), springer, isbn 978-3-540-88907-6, 405–433. [2] borg, i., groenen, p. j. (2005); modern multidimensional scaling: theory and applications, springer, isbn 978-0-387-25150-9. [3] branke, j., deb, k., miettinen, k., słowiński, r. (2008); multiobjective optimization: interactive and evolutionary approaches (lecture notes in computer science 5252), springer, isbn 978-3-540-88908-3. [4] castelletti, a., lotov, a. v., soncini-sessa, r. (2010); visualization-based multi-objective improvement of environmental decision-making using linearization of response surfaces, environmental modelling & software, issn 1364-8152, 25(12): 1552–1564. [5] dzemyda, g., kurasova, o., žilinskas, j. (2013); multidimensional data visualization: methods and applications (springer optimization and its applications, 75), springer, isbn 9781-4419-0235-1. [6] filatovas, e., kurasova, o. (2011); a decision support system for solving multiple criteria optimization problems. informatics in education, issn 1648-5831, 10(2): 213–224. [7] gardiner, l., vanderpooten, d. (1997); interactive multiple criteria procedures: some reflections. in climaco, j., editor, multicriteria analysis, springer, isbn 978-3-642-64500-6, 290–301. [8] jaszkiewicz, a., słowiński, r. (1999). the "light beam search" approach an overview of methodology and applications. european journal of operational research, issn 0377-2217, 113(2): 300–314. 518 e. filatovas, d. podkopaev, o. kurasova [9] korhonen, p., wallenius, j. (2008); visualization in the multiple objective decision-making framework, in branke, j., deb, k., miettinen, k., słowiński, r., editors, multiobjective optimization (lecture notes in computer science, 5252), springer, isbn 978-3-540-88907-6, 195–212. [10] kurasova, o., petkus, t., filatovas, e. (2013); visualization of pareto front points when solving multi-objective optimization problems. information technology and control, issn 1392-124, 42(4): 353–361. [11] laukkanen, t., tveit, t.-m., ojalehto, v., miettinen, k., fogelholm, c.-j. (2010); an interactive multi-objective approach to heat exchanger network synthesis. computers & chemical engineering, issn 0098-1354, 34(6): 943–952. [12] lotov, a., bushenkov, v., kamenev, g. (2004); interactive decision maps, approximation and visualization of pareto frontier (applied optimization, 89), springer, isbn 978-1-44198851-5. [13] lotov, a. v., miettinen, k. (2008); visualizing the pareto frontier, in branke, j., deb, k., miettinen, k., słowiński, r., editors, multiobjective optimization (lecture notes in computer science, 5252), springer, isbn 978-3-540-88907-6, 5252: 213–243. [14] luque, m., ruiz, f., miettinen, k. (2011); global formulation for interactive multiobjective optimization. or spectrum, issn 0171-6468, 33(1): 27–48. [15] mareschal, b., de smet, y. (2009); visual promethee: developments of the promethee & gaia multicriteria decision aid methods, in proceedings of ieee international conference on industrial engineering and engineering management, 2009, eisbn 978-1-4244-4870-8, 1646–1649. [16] miettinen, k. (1999); nonlinear multiobjective optimization (international series in operations research & management science, 12), springer, isbn 978-1-4615-5563-6. [17] miettinen, k. (2014); survey of methods to visualize alternatives in multiple criteria decision making problems. or spectrum, issn 0171-6468, 36(1): 3–37. [18] miettinen, k., hakanen, j., po (2014a); interactive nonlinear multiobjective optimization methods. in ehrgott, m., figueia, j., and greco, s., editors, multiple criteria decision analysis: state of the art surveys, springer, (to appear). [19] miettinen, k., mäkelä, m. (1995); interactive bundle-based method for nondifferentiable multiobjective optimization: nimbus. optimization, issn 0233-1934, 34(3): 231–246. [20] miettinen, k., mäkelä, m. m. (1997); interactive method nimbus for nondifferentiable multiobjective optimization problems, in climaco, j., editor, multicriteria analysis, springer, isbn 978-3-642-64500-6, 310–319. [21] miettinen, k., mäkelä, m. m. (2000); interactive multiobjective optimization system www-nimbus on the internet. computers & operations research, issn 0305-0548, 27(7): 709–723. [22] miettinen, k., mäkelä, m. m. (2006); synchronous approach in interactive multiobjective optimization. european journal of operational research, issn 0377-2217, 170(3): 909–922. a visualization technique for accessing solution pool in interactive methods of multiobjective optimization 519 [23] miettinen, k., mäkelä, m. m., kaario, k. (2006); experiments with classification-based scalarizing functions in interactive multiobjective optimization. european journal of operational research, issn 0377-2217, 175(2): 931–947. [24] miettinen, k., mustajoki, j., stewart, t. j. (2014b); interactive multiobjective optimization with nimbus for decision making under uncertainty. or spectrum, issn 0171-6468, 36(1): 39–56. [25] miettinen, k., ruiz, f., wierzbicki, a. (2008); introduction to multiobjective optimization: interactive approaches, in branke, j., deb, k., miettinen, k., słowiński, r., editors, multiobjective optimization: interactive and evolutionary approaches (lecture notes in computer science, 5252), springer, isbn 978-3-540-88908-3, 27–57. [26] narula, s. c., weistroffer, h. (1989); a flexible method for nonlinear multicriteria decisionmaking problems. ieee transactions on systems, man and cybernetics, issn 1083-4419, 19(4): 883–887. [27] naud, a., duch, w. (2000); interactive data exploration using mds mapping, in proceedings of the fifth conference: neural networks and soft computing, isbn 83-908587-2-x, 255–260. [28] ojalehto, v., miettinen, k., laukkanen, t. (2014); implementation aspects of interactive multiobjective optimization for modeling environments: the case of gams-nimbus. computational optimization and applications, issn 0926-6003, 58(3): 757–779. [29] ojalehto, v., miettinen, k., mäkelä, m. (2007); interactive software for multiobjective optimization: ind-nimbus, wseas transactions on computers, issn 2224-2872, 6(1): 87– 94. [30] petkus, t., filatovas, e., kurasova, o. (2009); investigation of human factors while solving multiple criteria optimization problems in computer network. technological and economic development of economy, issn 2029-4913, 15(3): 464–479. [31] pohlheim, h. (2006); multidimensional scaling for evolutionary algorithms – visualization of the path through search space and solution space using sammon mapping. artificial life, issn 1064-5462, 12(2): 203–209. [32] saaty, t., ozdemir, m. (2003); why the magic number seven plus or minus two, mathematical and computer modelling, issn 0895-7177, 38(3–4): 33–244. [33] steuer, r. e. (1989); the tchebycheff procedure of interactive multiple objective programming, in karpak, b., zionts, s., editors, multiple criteria decision making and risk analysis using microcomputers (nato asi series, 56), springer, isbn 978-3-642-74919-3, 235–249. [34] valdés, j. j., barton, a. j. (2007); visualizing high dimensional objective spaces for multiobjective optimization: a virtual reality approach, in proceedings of the ieee congress on evolutionary computation(cec 2007), ieee, isbn 978-1-4244-1339-3, 4199–4206. [35] walker, d. j., everson, r. m., fieldsend, j. e. (2013); visualizing mutually nondominating solution sets in many-objective optimization. ieee transactions on evolutionary computation, issn 1089-778x, 17(2): 165–184. [36] yoshimi, m., kuhara, t., nishimoto, k., miki, m., hiroyasu, t. (2012). visualization of pareto solutions by spherical self-organizing map and it’s acceleration on a gpu. journal of software engineering & applications, issn 1945-3116, 5(3): 129–137. international journal of computers communications & control issn 1841-9836, 9(5):602-609, october, 2014. optimal routing strategy based on specifying shortest path f. shao, b. cheng fei shao*, binghua cheng school of computer engineering, jinling institute of technology no. 99, hongjing rd., nanjing, 211169, jiangsu, p.r.china *corresponding author: shaofei@jit.edu.cn cbh@jit.edu.cn abstract: how to enhance the transfer capacity of weighted networks is of great importance. the network transfer capacity, which is often evaluated by the critical packet generation rate, is proved to be inversely proportional to the highest node betweenness. by specifying the shortest path according to the different node characteristics, two different routing strategies are proposed to reduce the high node betweenness for the different node delivery capability schemes. simulations on both computer-generated networks and real world networks show that our routing strategies can improve the network transfer capacity greatly. especially, the greater the new added edge number is, the more efficient our routing strategies are. keywords: complex networks, weighted networks, routing strategy, betweenness. 1 introduction due to the constantly growing significance of large scale communication networks such as the world wide web, the network transfer capacity has attracted an increasing attention. in those previous studies to improve transfer capacity and control traffic congestion on networks, some focus on making appropriate adjustments to the network topology structure [1] [3] while others on finding optimal routing strategies [4] [9]. since the former is too expensive and too difficult to implement in some large-scale networks, most of previous works focused on effective routing strategies. some of them are based on the global information: the shortest path routing strategy [4] which pass through the minimum number of nodes, the efficient path routing strategy [5] whose sum of node degrees is the minimum. some others focus on local topological information since global information is usually unavailable in large-scale networks: the neighbor information [6], the next-nearest-neighbor information [7]. however, those aforementioned studies are mostly focused on the simplest unweighted networks with edges between nodes are represented by binary states according to whether the edges are present or not. in fact, the scientific collaboration networks [10], the cellular metabolism [11], the world-wide airport networks [12] and the internet [13] have been proved to be weighted networks which are specified not only by its topology but also by the weight of the edges. lots of models have been presented to describe weighted network among which the bbv weighted network model, coupled dynamical evolution of topology and weights, is most widely used. in those most widely used traffic models in weighted network, the packets are transferred through the traditional shortest path [4], which is a path with the minimal number of nodes between arbitrary pairs of nodes, or the weighted shortest path [10, 14], where the distance between nodes is just the inverse of the weight of edge linked them. however, it is proved that these two routing strategies are not optimal for weighted networks [15]. since there may be more than one traditional shortest path between some pairs of nodes, we can specify the best one to enhance the network transfer capacity. this paper is organized as follows. in section 2 we describe the model and our routing strategy, followed by the experimental evaluations on the computer generated networks and real world network in section 3. the conclusions are given in section 4. copyright © 2006-2014 by ccc publications optimal routing strategy based on specifying shortest path 603 2 models 2.1 network model the bbv network can be completely described by a weighted adjacency matrix w whose elements wij denote the weight of the edge linking node i and j. the generation of the bbv network is based on two coupled mechanisms: i topological growth. starting from the initial network with n0 nodes which are fully connected by edges with assigned weight w0, one new node is added at every time step. the new added node will be connected to m different nodes with equal weight w0 for every edge and will choose nodes with large strength according to the probability ∏ n→i = si/ ∑ l sl, where si = ∑ j wij is the node strength. ii weight dynamics. the weight of each new added edge is initially set to a given value w0 which is often set to 1 for simplicity. but the adding of edge connecting to node i will result in increasing the weight of the other edges linked to node i which is proportional to the edge weights. if the total increase is δ (we will focus on the simplest form: δi = δ), we can get wij = wij + ∆wij = wij + δ ∗ wij si (1) this will yield the strength increase of node i as: si = si + δ + w0 (2) the degree distribution of bbv network p (k) ∝ k−γk and the strength distribution p (s) ∝ s−γs yield scale-free properties with the same exponent [12], [16][18]: γk = γs = 4γ + 3 2γ + 1 = 2 + 1 2γ + 1 (3) 2.2 traffic model the traffic model can be described as follows: i all nodes can create packets with addresses of destination, receive packets from other nodes, and forward the packets to their destinations. ii at each time step, there are r packets generated in the network, with randomly chosen sources and destinations. once a packet is created, it is placed at the end of the queue if the node already has several packets waiting to be forwarded to their destinations. iii at each time step, the first ci packets at the top of the queue of node i, if it has more than ci packets in its queue, are forwarded one step toward their destinations and placed at the end of the queues of the selected nodes. otherwise, all packets in the queue are forwarded one step. this procedure applies to all nodes at every time step. iv a packet, upon reaching its destination, is removed from the system. 604 f. shao, b. cheng in our model, three node delivery capability schemes are considered: (i) each node has the same packet delivery capability (ci = 1, con stands for this scheme); (ii) the node delivery capacity is considered to be proportional to the node strength si (ci = si/ < s >, str stands for this scheme); (iii) the node delivery capacity is considered to be proportional to the node degree ki (ci = ki/ < k >, deg stands for this scheme). to compare the overall transfer capacity, we normalize the delivery capability to keep the total node delivery capability of the whole network is equal to the node number n in three situations the same. when r is increased from 0 to ∞, two phases will be observed: free flow and congested phase. for r < rc, the numbers of created and forwarded packets are balanced, resulting in a steady free flow of traffic. for r > rc, traffic congestion occurs due to the fact that packet delivery capacity of node is limited. the phase transition from the former to the latter occurred at the critical packet generation rate rc. we focus on the critical value rc which can best reflect the transfer capacity of a network. we utilize the betweenness bi [19] to estimate the traffic passing through a node i under a given routing strategy: bi = ∑ s,t σ (s,i,t) σ (s,t) (4) where σ (s,i,t) is the number of paths under the given routing strategy between nodes s and t that pass through node i and σ (s,t) is the total number of paths under the given routing strategy between s and t and the sum is over all pairs s,t of all distinct nodes. the probability a packet will pass through the node i is bi/ ∑n j=1 bj , and therefore the average number of packets that the node i will receive at each time step is. when the number of incoming packets is equal to or larger than the outgoing packets at the node i,rbi/(n(n−1)) ≥ ci, traffic congestion will occur. so the critical packet generation rate rc is rc = min ( ci ∗n∗ (n−1) bi ) = n∗ (n−1)∗min(ci/bi) (5) 2.3 routing strategy enlightened by the efficient path [5], we also definepi→j as the path between node i and j which pass through the nodes sequence x0 (= i) ,x1,x2, · · · ,xn−1,xn(= j) . however we define f (pi→j,α) = n−1∑ i=0 wαij (6) in our routing strategies, we specify the path between i and j as the one makes f (pi→j,α) minimum under a given tunable parameter α. when α is -1, the specified routing strategy is the same as the weighted shortest path routing strategy [10, 14] (wsh stands for this routing strategy). when α is 0, the specified routing strategy is the same as the traditional dijkstra shortest path routing strategy [4] which pass through the minimum amount of nodes (sht stands for this routing strategy). as we mentioned above, there may be more than one shortest path between some nodes. we calculate the sum of strengths si of all nodes on each shortest path, and select the one with minimum sum as our specified path. sss stands for this routing strategy and ssd for the minimum sum of degrees ki. the definition of our routing strategies (the sss routing strategy and the ssd routing strategy) is shown in tab.1. optimal routing strategy based on specifying shortest path 605 table 1: definition of our routing strategies (sss and ssd) wsh sht sss ssd f (pi→j,α) min ( n−1∑ i=0 w−1ij ) min ( n−1∑ i=0 w0ij ) sht&min ( n−1∑ i=0 si ) sht&min ( n−1∑ i=0 ki ) wsh sht sss ssd 0 10 20 30 40 50 60 70 80 90 r c δ=4,m=4 δ=8,m=4 δ=4,m=8 δ=8,m=8 figure 1: rc of different routing strategies. bbv network with n = 200 and w0 = 1,ci = 1 3 results and discussion to obtain the critical packet generation rate rc in simulations, we use the order parameter[1]: η = lim t→∞ ⟨∆θ⟩ r∆t (7) where ∆θ = θ(t + ∆t)−θ , with ⟨· · ·⟩ indicating average over time windows of width ∆t , and θ(t) is the total number of packets in the network at time t. at the early stage, when r is very small, the generated packets can be delivered, ⟨∆θ⟩ is less than zero and so is η. where η is greater than zero, we can obtain the critical packet generation rate rc. in figure 1, we plot the critical packet generation rate rc of different routing strategies in a bbv network with n = 200 and ω0 = 1. (for every network, 10 instances are generated and for each instance, we run 10 simulations. the results are the average over all the simulations.) fig.1 shows that when each node has the same packet delivery capability, the wsh routing strategy is the most sensible to traffic congestion. the sht routing strategy is better than wsh, and the ssd routing strategy has the maximum transfer capacity. moreover, the greater the new added edge number m is, the more efficient our routing strategies are. take networks with δ = 4,m = 4 and δ = 4,m = 8 for example, the sht, sss and ssd routing strategies can enhance the critical packet generation rate rc 95.72%, 246.53 %, 326.75% than the wsh routing strategy correspondingly in network with δ = 4,m = 4 while 112.95%, 306.81%, 398.81% 606 f. shao, b. cheng wsh sht sss ssd 500 1000 1500 2000 2500 3000 3500 4000 4500 r c δ=4,m=4 δ=8,m=4 δ=4,m=8 δ=8,m=8 wsh sht sss ssd 100 200 300 400 500 600 700 800 900 1000 1100 r c δ=4,m=4 δ=8,m=4 δ=4,m=8 δ=8,m=8 figure 2: rc of different routing strategies. bbv network with n = 200 and w0 = 1 (a) ci = si/ < s > (b) ci = ki/ < k > wsh sht sss ssd 0 10 20 30 40 50 60 r c δ=4,m=4 δ=8,m=4 δ=4,m=8 δ=8,m=8 wsh sht sss ssd 100 200 300 400 500 600 700 r c δ=4,m=4 δ=8,m=4 δ=4,m=8 δ=8,m=8 figure 3: rc of different routing strategies. bbv network with n = 100 and w0 = 1 (a) ci = 1 (b) ci = ki/ < k > in network with δ = 4,m = 8. when the new added edge number m is increased, there are more edges in the network and there might be more traditional dijkstra shortest paths between nodes consequently. that is why our routing strategies are more efficient with larger parameter m. then we turn to the other two schemes: the node delivery capacity is proportional to the node strength si and the node degree ki. simulation results are shown in fig.2(a) and fig.2(b) correspondingly. fig.2(a) presents that when the node delivery capacity is considered to be proportional to the node strength, the sht routing strategy is the most effective while the ssd routing strategy has the largest rc when the node delivery capacity is considered to be proportional to the node degree as shown in fig.2(b). fig.2(a) shows that our routing strategies do not work well in str scheme. in deg scheme, the sht routing strategy is also better than wsh, and the ssd routing strategy still has the maximum transfer capacity. however, the gap among the sht and sss and ssd routing strategies is narrowed. and our routing strategies are more efficient with the greater new added edge number. then we check the impact of the node number n on our routing strategies. we test our routing strategies on bbv weighted networks with n = 100 nodes to achieve the simulation results of con scheme and deg scheme as shown in fig. 3. fig.3(a) and fig.3(b) display the influence of node number our routing strategies. we can optimal routing strategy based on specifying shortest path 607 0 50 100 150 200 250 0 5 10 15 20 25 30 35 s b (s )/ s wsh sht sss ssd 0 5 10 15 20 25 0 20 40 60 80 100 120 k b (k )/ k wsh sht sss ssd figure 4: betweenness per node. bbv network with n = 200, δ = 4,m = 4 and w0 = 1 (a) ci = si/ < s > (b) ci = ki/ < k > discover that the ssd route is still the best way to enhance the critical packet generation rate. and by comparing fig.3(a) with fig.1 and fig.3(b) with fig.2(b), we can discover that the node number n has a little effect on the transfer capacity. to achieve heuristic explanation for the routing strategies corresponding to the highest transfer capacity, we investigate the betweenness distribution on the network as presented in fig.4. the betweenness normalized by the strength of str scheme is shown in fig.4(a) and normalized by the degree of deg scheme shown in fig.4(b). in both figures, the load of the most effective routing strategy is distributed more evenly than the other three. in fig.4(a), the betweenness divide by the strength of the sht routing strategy is relatively flat which means the node with higher strength forward more packets. and in fig.4(b), the node with higher degree forward more packets while using the ssd routing strategy. it is obvious that the traffic load under the sht routing strategy for str scheme and under the ssd routing strategy for deg scheme are distributed evenly to the nodes according to their strength and degree correspondingly. those routing strategies which elongate the average path length lav e unnecessarily may not be efficient for network communications. thus it is of great importance for a routing strategy to maintain the small-world phenomenon, i.e. lav e ∝ lnn. in our routing strategies, all the paths are the shortest path between arbitrary nodes which means the small-world phenomenon is still maintained in our routing strategies. finally, we test our routing strategies for three schemes on real world network. we choose the usair 97 network (network of direct flight connections between us airports for the year 1997, http://vlado.fmf.uni-lj.si/pub/networks/data/) with 332 nodes and 2126 edges. simulation results are shown in tab.2. table 2: rc of different routing strategies of usa airport network wsh sht sss ssd con 3.49 4.86 5.85 5.87 str 2.24 2.26 2.24 2.24 deg 83.69 161.96 169.03 169.90 from table 2 we can discover that in con scheme, the ssd routing strategy has the maximum transfer capacity which is only a bit higher than the sss routing strategy. the wsh 608 f. shao, b. cheng routing strategy has the lowest transfer capacity. it means when each node has the same packet delivery capability, our sss routing strategy and our ssd routing strategy can enhance the transfer capacity in real world network. and in str scheme, the sht routing strategy has the maximum transfer capacity which is also the same as the computer generated weighted network. when it turns to the deg scheme, the sss routing strategy and the ssd routing strategy also achieve better results than the traditional routing strategy. in a word, the sss routing strategy and the ssd routing strategy also works well in the real world network in the con scheme and in the deg scheme. 4 conclusions considering the different node delivery capability, this paper has proposed two novel routing strategies to enhance the network transfer capacity in weighted networks. the characteristic of our strategy is to specify the shortest path according to three kinds of different node delivery capability schemes. the simulation shows that when each node has the same packet delivery capability, we can select the path with the minimal number of nodes and with minimum sum of node degree. and this routing path is also optimal in the scheme which the node delivery capacity is considered to be proportional to the node degree. when the node delivery capacity is considered to be proportional to the node strength, our routing strategies do not work. it is worth mentioning that our routing strategies are more efficient with the more new added edge. at last, we apply our routing strategies on the usair 97 network to show the validity of our routing strategies on real world network. moreover, the above-mentioned research may throw lights on designing better communication protocols. acknowledgment this work was partially supported by the national natural science foundation of china (grants no. 61373136 and 61375121), the natural science foundation of jiangsu province, china (grant no. bk2012082), the research foundation of jinling institute of technology (grant no. jit-b-201406) and sponsored by qing lan project. the author also gratefully acknowledges the helpful comments and suggestions of the reviewers, which have improved the presentation. bibliography [1] arenas a. et al (2001); communication in networks with hierarchical branching, physical review letters, issn 0031-9007. 86(14): 3196-3199. [2] guimera r. et al (2002); optimal network topologies for local search with congestion, physical review letters, issn 0031-9007, doi://dx.doi.org/10.1103/physrevlett.89.248701. [3] zhang, g.q. (2010); on cost-effective communication network designing. europhysics letters, issn 0295-5075. 89(3): 38003. [4] dijkstra e.w. (1959); a note on two problems in connexion with graphs. numerische mathematik, issn 0029-599x, 1(1): 269-271. [5] yan g. et al (2006); efficient routing on complex networks. physical review e, issn 15393755. 73(4): 046108. optimal routing strategy based on specifying shortest path 609 [6] wang w.x. et al. (2006); traffic dynamics based on local routing protocol on a scale-free network. physical review e, issn 1539-3755, 73(2): 026111. [7] yin c.y. et al (2006); traffic dynamics based on an efficient routing strategy on scale free networks, the european physical journal b, issn 1434-6028. 49(2): 205-211. [8] toroczkai z.; bassler k.e. (2004); network dynamics: jamming is limited in scale-free systems. nature, 428, 716 (15 april 2004), doi:10.1038/428716a. [9] wu y.h. et al (2013); performance analysis of epidemic routing in delay tolerant networks with overlapping communities and selfish nodes, international journal of computers communications & control, issn 1841-9844, 8(5): 744-753. [10] newman m.e.j. (2001); scientific collaboration networks. ii. shortest paths, weighted networks, and centrality. physical review e, issn 1063-651x. 64(1): 016132. [11] almaas e. et al (2004); global organization of metabolic fluxes in the bacterium escherichia coli, nature, issn 0028-0836. 427(6977): 839-843. [12] barrat a. et al (2004); the architecture of complex weighted networks. pnas, issn 00278424. 101(11): 3747-3752. [13] pastor-satorras r.; vespignani a. (2007); evolution and structure of the internet: a statistical physics approach, cambridge university press, isbn 9780521714778. [14] brandes u. (2001); a faster algorithm for betweenness centrality. journal of mathematical sociology, issn 0022-250x, 25(2): 163-177. [15] shao, f. (2013); optimal transport on weighted networks for different node delivery capability schemes, the scientific world journal, http://dx.doi.org/10.1155/2013/378083. [16] barrat a. et al (2004); modeling the evolution of weighted networks. physical review e, issn 1539-3755. 70(6): 066149. [17] barrat, a. et al (2004); weighted evolving networks: coupling topology and weight dynamics. physical review letters, issn 1079-7114. 92(22): 228701. [18] barthelemy m. et al (2005); characterization and modeling of weighted networks. physica a: statistical mechanics and its applications, issn 0378-4371. 346(1-2): 34-43. [19] freeman l.c. (1977); a set of measures of centrality based on betweenness. sociometry, issn 0038-0431, 40(1): 35-41. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 1, pp. 52-70 coordinating aerial robots and unattended ground sensors for intelligent surveillance systems e. pignaton de freitas, t. heimfarth, r. schmidt allgayer, f. rech wagner, t. larsson, c. e. pereira, a. morado ferreira edison pignaton de freitas and tony larsson halmstad university school of information science, computer and electrical engineering halmstad, sweden e-mail: {edison.pignaton, tony.larsson}@hh.se tales heimfarth and flávio rech wagner federal university of rio grande do sul institute of informatics porto alegre, brazil e-mail: {theimfarth, flavio}@inf.ufrgs.br rodrigo schmidt allgayer and carlos eduardo pereira federal university of rio grande do sul electrical engineering department porto alegre, brazil e-mail: {allgayer, cpereira}@ece.ufrgs.br armando morado ferreira military institute of engineering defense engineering graduate program rio de janeiro, brazil e-mail: armando@ime.eb.br abstract: sensor networks are being used to implement different types of sophisticated emerging applications, such as those aimed at supporting ambient intelligence and surveillance systems. this usage is enhanced by employing sensors with different characteristics in terms of sensing, computing and mobility capabilities, working cooperatively in the network. however, the design and deployment of these heterogeneous systems present several issues that have to be handled in order to meet the user expectations. the main problems are related to the nodes‘ interoperability and the overall resource allocation, both inter and intra nodes. the first problem requires a common platform that abstracts the nodes’ heterogeneity and provides a smooth communication, while the second is handled by cooperation mechanisms supported by the platform. moreover, as the nodes are supposed to be heterogeneous, a customizable platform is required to support both resource rich and poorer nodes. this paper analyses surveillance systems based on a heterogeneous sensor network, which is composed by lowend ground sensor nodes and autonomous aerial robots, i.e. unmanned aerial vehicles (uavs), carrying different kinds of sensors. the approach proposed in this work tackles the two above mentioned problems by using a customizable hardware platform and a middleware to support interoperability. experimental results are also provided. keywords: sensor networks, unmanned vehicles systems, wireless communication, heterogeneous platforms copyright c© 2006-2010 by ccc publications coordinating aerial robots and unattended ground sensors for intelligent surveillance systems 53 1 introduction new applications based on the interaction between autonomous robots and static sensor nodes are emerging. part of the growing interest in using this interaction comes from the potential benefits it can provide, such as distributed processing and data gathering enrichment. moreover, the deployment of such systems is becoming a tangible reality due to advances in small and efficient processors, sensor devices, and wireless networking. the integration of static and mobile sensor nodes enhances capabilities of the overall sensor network. it enables new applications in which fixed sensor nodes provide an active response capability, while unmanned mobile nodes acquire a spatial vision of the region, allowing monitoring of larger areas. military and civilian applications, such as borderline patrol, search and rescue, area surveillance, communications relaying, and mapping of hostile territory, can take advantage of this kind of enlarged sensor networks. since these tasks may be repetitive, tedious, and dangerous, they are ideal for autonomous unmanned vehicles [1]. indeed, applications of sensor network can greatly profit of using different kinds of mobile and sophisticated sensors in addition to static ones, which typically are simpler and more resource constrained. the cooperation between these different types of sensors provides advanced functionalities that were not feasible before [2] and thus also opens a vast range of new application scenarios. wireless sensor networks are usually developed to perform specific applications. sensor nodes have usually a small footprint including only the resources that are necessary to meet specific application requirements. however, reconfigurable and customizable architectures are important to make both sensor nodes and sensor networks more flexible. depending on the application, sensor nodes vary from resource-constrained to high performance computing nodes, resulting in a heterogeneous network composed by a variety of sensor node types. the main issues in developing heterogeneous sensor networks are: (i) support for cooperation among heterogeneous nodes; and (ii) customization of sensor nodes. the former is related to concerns such as message exchange synchronization, qos requirements management, task (re-) allocation, and network adaptation. the later is related to the diversity of node platforms, which may be built upon very distinct hardware components controlled by very different pieces of software. considering (i), the use of a middleware is a suitable approach to address the mentioned concerns, since it can integrate the technologies used in different nodes by means of a common communication interface and cooperation support. regarding (ii), customizable architectures can be very useful to build platforms for different sensor network nodes, from the very simple to the more sophisticated ones. this kind of architecture can provide a common base capability for all nodes. however, for nodes that need more advanced capabilities, the required resources can be incorporated. hence, even though all nodes should have the same base capability, some of them could be equipped with additional resources, thus making the sensor network more powerful due to this allowed heterogeneity. this paper presents a flexible and adaptable platform infrastructure intended to support heterogeneous sensor network applications composed by static sensor nodes on the ground and mobile sensors carried by autonomous aerial robotic platforms, such as autonomous uavs. it is based on the proposal of (i) a flexible middleware [3] and, (ii) on a customizable hardware architecture aimed for sensor nodes, called femtonode [4]. the key idea is to use this customizable platform to deploy different kinds of sensor nodes, from very tiny and resource constrained up to more sophisticated ones. both types of nodes run a common middleware in order to provide the desired interoperability that will allow the cooperation among different sensor nodes. nodes may be built upon the femtonode architecture and alternatively upon nodes with other hardware platforms, such as sunspot [5] or similar ones. moreover, the middleware provides services to allow autonomous decision making by the nodes, in order to perform reflection about the runtime conditions and adapt the system according to the current needs. these features allow the mobile nodes to take decisions about their movements and all nodes, i.e. also the fixed nodes, to take decisions about network parameters, such as qos and resource usage. in addition, this paper presents a 54 e. pignaton de freitas, t. heimfarth, r. schmidt allgayer, f. rech wagner, t. larsson, c. e. pereira, a. morado ferreira bio-inspired coordination mechanism that uses artificial pheromones used to provide information about the position of the mobile sensor nodes, facilitating the cooperation among these nodes and the others that make part of the network. the remaining of this paper is organized as follows: section 2 presents related work in the area. in section 3, the application scenario is highlighted, characterizing the network heterogeneity. section 4 presents an overview of the proposed middleware, while section 5 presents details about the key issues addressed by the middleware. the pheromone-based coordination among static and mobile sensors is the focus of section 6. in section 7, the femtonode customizable hardware architecture is described. section 8 presents a case study while section 9 provides results for some of the presented features of both middleware and customizable hardware. finally, section 10 draws concluding remarks and gives directions for future work. 2 related works aware [6] is a middleware whose goal is to provide integration of the information gathered by different types of sensors, including low-end sensor nodes in a wireless sensor network and mobile robots equipped with more sophisticated sensors. our proposal not only addresses heterogeneous sensors and their coordination, but also concerns like qos and runtime reflection in order to cope with changes in the environment and in the network, which is missing in [6]. moreover, in our approach, the autonomous decisions taken by the mobile nodes characterize a higher intelligence degree of our mobile robots if compared with the proposal presented in [6]. batalin et al. [7] propose an approach that uses a sensor network to guide a mobile robot with limited sensor capabilities. this work presents aspects of cooperation among different sensors, but it does not address the same problem as proposed in our work. in that approach, the mobile node has limited sensor capabilities and only gathers data from the sensor network to guide the robot’s movement. in our work, mobile nodes are equipped with sophisticated sensors that can provide data, which may be merged with data that come from the other nodes in order to achieve more refined decisions to guide the movement of the mobile nodes. schmidt et al. [8] present an approach that uses artificial intelligence techniques to configure an underlying middleware. their approach uses the concepts of missions and goals to plan the allocation of tasks in a network of homogeneous nodes. the handling of heterogeneous nodes is one of the differences between our work and their approach. additionally, in that work, the intelligence is outside the middleware and influences it by just sending commands or adjusting its parameters. in our approach, the decision making support is an integral part of the middleware, spreading intelligence over the network. jin et al. [9] provide a very consistent proposal to handle the problem of balance between target search and response by a team of uavs. the work evaluates the tradeoff between search and response within the framework, presenting a predictive algorithm that provides a good balance between these tasks. the first difference between our approach and this work is that we handle only the alarm response, abstracting the concern about the uavs movement planning to perform the search for new targets. this difference is due to the peculiarity of the distinct missions addressed in the current paper and in [9]. we focus on area surveillance, while they focus on target acquisition. in our case, the whole area must be covered, which may not be true in the target acquisition they address. another difference is that we use the uavs in coordination with ground sensor nodes. besides, the assumption of a centralized information base considered in their work is not used in our proposal. their initial centralized off-line task assignment is another premise that is not valid in our work. in [10], an approach using digital pheromones to control a swarm of uavs is presented. the method proposed by the authors uses digital pheromones to bias the movements of individual units within a swarm toward particular areas of interest that are attractive, from the point of view of the mission that the swarm is performing, and away from areas that are dangerous or just unattractive. in the large sense, coordinating aerial robots and unattended ground sensors for intelligent surveillance systems 55 the pheromone-based strategy used in our work has a similar goal, driving the uavs to areas of interest. however, differently from their approach, we use the pheromone traces to localize the uavs when an alarm is issued by a ground sensor node informing an event of interest and then drive the uavs to the location where the event happened. 3 application scenario overview and network heterogeneity in the following, heterogeneity means that nodes in the network may have different sensing capabilities, computation power, and communication abilities. additionally, it means that they run on different hardware and operating systems. therefore, such sensor networks are made up of lowand high-end nodes. moreover, sensor nodes may have fixed positions or be able to move, being carried by mobile aerial robots (uav platforms), which can also vary from very small, as in [11], up to huge aircraft platforms, like globalhawk [12]. low-end sensor nodes are those with constrained capabilities, for instance piezoelectric resistive tilt sensors, with limited processing support and communication resource capabilities. high-end sensor nodes include powerful devices like radar, high definition visible light cameras, or infrared sensors, which are supported by moderate to rich computing and communication resources. mobility, as mentioned, is another important characteristic related to the heterogeneity addressed in this work and requires special attention. sensor nodes can be statically placed on the ground or can move on the ground or fly at some altitude over the target area in which the observed phenomenon is occurring. figure 1 graphically represents the idea of the three heterogeneity dimensions considered in this work, in which each axis represents one of the considered characteristics. figure 1: heterogeneity dimensions. the reason for heterogeneity in the sensor nodes is to support a large range of applications that deal with very dynamic and changing scenarios, which require different types of sensor capabilities. moreover, these different scenarios require also adaptations in the network, in terms of choosing suitable sensors for the tasks at hand as well as feasible qos parameters, among others. in order to illustrate the above idea, suppose that a network has the mission of providing a certain kind of information during a given period of time. the set of sensors selected in the beginning of a mission may not be the most suitable one during the execution of the whole mission. the network must be thus able to choose a better alternative, among the set of all available options, in order to accomplish the mission. for example, an area surveillance system may receive the mission to observe if certain types of vehicles that are not allowed to pass through the surveyed area make any such violation and report if that is the case. to perform this in an efficient way ground sensors are set to alarm in the presence of unauthorized vehicles. suddenly, an alarm is triggered by one of these sensors. then, in order to verify 56 e. pignaton de freitas, t. heimfarth, r. schmidt allgayer, f. rech wagner, t. larsson, c. e. pereira, a. morado ferreira the occurrence, uavs equipped with visible-light cameras are commanded to fly over the area where the ground sensor has issued the alarm. due to a sudden change in the weather (e.g. the area becomes foggy or cloudy), visible-light cameras become useless. however, the mission must still be accomplished. therefore, nodes must be coordinated to accomplish the mission by sending uavs equipped with other kinds of sensors that can provide the required information in bad weather conditions, such as infrared cameras. 4 middleware support overview one of the key ideas of the proposed work is to drive the use of sophisticated sensors carried by uavs using data from low-end nodes. uavs’ equipped with intelligent behavior uses such data to decide the trajectory direction during the system runtime. consequently, the proposed middleware must be able to provide cooperation among the different sensors and still fit in both the low-end and rich sensor nodes. the middleware must thus be lightweight and provide enough customization in order to address the needs of both types of sensor nodes. these goals are achieved by using aspectand component-oriented techniques, as in [13] and [14], and also a mobile multi-agent approach, as discussed in [3]. figure 2 depicts an overview of the middleware layers, whose description is provided in the following. figure 2: overview of the middleware layers. the bottom layer is called infrastructure layer. it is responsible for the interaction with the underlying operating system and also for the management of node resources, such as available communication and sensing capabilities, remaining energy, etc. this layer is also responsible for coordination of necessary resource sharing. the intermediate layer is called common services layer. it provides services that are common to different types of applications, such as qos negotiation, quality of data assurance, and data compression. other concerns also handled within this layer are: deadline expiration alarms; timeouts for data transmissions; number of retries and delivery failure announcements; resource reservation negotiation among applications (based on priorities established by missions and operation conditions); bindings; and synchronous/asynchronous concurrent requests. the top layer is called domain-services layer, whose goal is to support domain specific needs, including data fusion support and specific data semantic support, in order to allow the production of application-related information from raw data processing. fuzzy classifiers, special types of mathematical filters (e.g. kalman filter), and functions that can be reused among different applications in the same domain are found in this layer. ”smile faces” in figure 2 represent autonomous agents that can provide specific services in a certain node at a given moment during system runtime. the domain-services layer hosts a special agent (called coordinating aerial robots and unattended ground sensors for intelligent surveillance systems 57 planning-agent), which performs a special task related to the reasoning about the missions that a node is responsible for performing. concerns that affect elements in more than one middleware layer, such as security and real-time requirements control, are represented as cross-layer features. these crosscutting concerns are addressed by the aspect-oriented approach presented in [13]. a. planning-agent model overview as an important element in the proposed approach, the model of the planning-agent is briefly presented. the model used for the planning-agent is a cognitive one, based on the model of mental attitudes, known as bdi model (belief-desire-intention), presented in [15]. the motivation is that the bdi model appears to suite well to the surveillance problem addressed, as some decisions that need to be taken by the planning-agent require cognitive skills to evaluate if certain actions are adequate to achieve a desired result. these decisions are based on knowledge about conditions that may interfere on the performance of those actions. in the current problem formulation, it is desired to obtain information by means of sensing activities, which are the goals of a sensing mission. this knowledge is the ”belief” that the node has about the relevant conditions, and the intentions are translated into the actions required to retrieve the desired information. as an example, when a uav receives an alarm, it will consult its beliefs in order to decide if it is able to respond to that alarm. if so, it will take the responsibility for the respective alarm and include the incoming information from the alarm into its beliefs, as well as include the accomplishment of the alarm handling into its desires. based on that, the uav will then intend to fly to the place where the alarm was issued in order to execute the respective sensing over the target. b. middleware services usage example taking the application scenario given as example in section 3 and the middleware layers described above, this subsection provides some examples of the utilization of middleware services and of the interaction between nodes. assuming the described example, in the case that alarms are issued by several ground sensor nodes, the domain services layer provides data aggregation, for fusion of data from many sensors in an area, which can provide richer information, such as the direction of a crossing vehicle, or handle problems such as alarm duplicity. the delivery of such information has an expiration time, since after a given time threshold, it is probable that the vehicle may have changed its trajectory, making the previous collected data useless. therefore, the common services layer associates qos with the delivery of messages as well as a guarantee mechanism to assure that an alarm has being correctly delivered and in time to a uav. the infrastructure layer uses these qos parameters to manage resources utilization. in the uavs, the middleware makes the complementary task, providing data fusion of images with matching alarm messages received from low-end nodes, i.e. fusion of position information included in the alarm messages with images that are being taken by the uavs, in the domain services layer. this is the case, for example, when more than one alarm has been issued in a given location. the data fusion helps in distinguishing the source of the alarms. qos verification of incoming messages is performed by the common services layer, which checks if either stored data can be used by the application or a request for fresh data must be sent. this is the case, for example, if the ground sensor network experiences problems and takes a too long time to deliver an alarm message to a uav. it may happen that, when this alarm comes to the uav, the object that triggered the alarm is not anymore at the location where it was detected. at this point, a request for confirmation can be sent by the uav to another uav in the region around the location of that alarm, requiring certain levels of qos. depending on the results from data processing, as described above, the uavs autonomously decide their own placements over the surveillance area, by means of the reasoning mechanisms of their respective planning-agents. additionally, other factors influence this decision, e.g. specific needs of the current situation and also sensible data that must be sent to the base station or to another uav. in this 58 e. pignaton de freitas, t. heimfarth, r. schmidt allgayer, f. rech wagner, t. larsson, c. e. pereira, a. morado ferreira case, data segregation and network utilization control play a crucial role in the efficiency of the system. important data must be prioritized. thus, network resources are allocated to data transmission according to the established priorities. possibly, the specific needs faced by the system in a given situation may require adjustments in the resource usage policy, which influences the services provided by the common services layer. for instance, a change to a ”mandatory priority” may take place in order to assure that data about a detected object or phenomena arrive within a given deadline at the base station via relayed communication through other uavs. in order to consider all the mentioned factors, necessary support for reflection about the network conditions and mission needs must be available. this support is provided by the planning-agent installed in the domain services layer, which will, for instance, analyze the current conditions and requirements and, taking into account the information provided by other sensors, decide the best placement of the uav in order to meet the system needs, thus selecting the next steps to move. 5 addressing key issues of sensor networks with the proposed middleware the proposed middleware uses the publish-subscribe paradigm and is inspired by the data distribution service for real-time systems (dss) specification, from omg [16]. some nodes publish their capabilities and the offered data, while others subscribe to data in which they are interested. although largely being inspired by the omg dss standard, the proposed middleware does not follow the whole specification. as it is intended to fit in both low-end nodes (based on simple and constrained platforms) and more sophisticated ones (carried by the uavs), it must not only be lightweight but also provide capabilities for customizations to cope with the needs of the different sensor nodes. consequently, the middleware uses a minimalist approach, being kept as simple as possible in each node. required features are included by adding components or weaving aspects to handle real-time and other non-functional crosscutting requirements in the minimal middleware. using aspects to tune the middleware is out of the scope of this paper, and for more information readers are referred to [13] and [17]. the following subsections show how the proposed middleware addresses some of the main platform needs in heterogeneous sensor networks, enabling the intelligent behavior of the mobile nodes and providing the required data with real-time guarantees. a. flexibility the middleware provides full communication control, i.e. it does not use underlying mechanisms available in the nodes’ network layer. instead, it provides its own communication control. this means that all parameters related to communication are controlled by the middleware, which uses only basic connectionless communication services offered by the nodes’ network layer. the middleware handles parameters such as number of retries, message priority, memory utilization for buffering, and timing. this provides more flexibility, with direct impact in the reduction of message delivery latency. b. network reflection reflection over the state of the network is a feature that enables the sensor nodes to take decisions regarding their participation in accomplishing a specific mission or sub-mission. in the mobile sensor nodes, this feature is responsible for the decision regarding the movement of the robot platform that carries the sensor, in order to take it to a place or the area related to a mission. the reflection considering the network conditions is performed inside the middleware by the planning-agent, which schedules the activities that should take place in order to accomplish a given mission. c. dynamicity when a node gets into the network, its services are announced using the publish-subscribe paradigm. then, all interested nodes can subscribe for those services. this eliminates the need for a dedicated server coordinating aerial robots and unattended ground sensors for intelligent surveillance systems 59 node that centralizes all available services in the network. additionally, this approach reduces latency in acquiring data because there is no intermediary node between the data producer and consumer. d. minimum message exchange the publish-subscribe paradigm by itself already reduces the number of exchanged messages due to the elimination of intermediate nodes (such as brokers). however, bandwidth for control messages is still required. this bandwidth need can be further reduced with the use of smart techniques such as qos contracts and attaching freshness timestamps to data, thus avoiding unnecessary request for data resend. another important characteristic for minimizing bandwidth requirements is the selection of routing strategies that optimize the use of the communication channel, finding the best path to be followed to arrive at the destination node of a message. an example of such strategy is a routing mechanism based on pheromone traces left by the mobile nodes, which will be explained in the following section and highlighted when presenting the experimental results. e. multicast communication the middleware uses multicast communication to reach selected destination nodes. this type of communication positively influences the latency and throughput, as data is sent at the same time to several nodes without unnecessary broadcast and delays, which would occur in unicast communication. a negative-acknowledgement (nack) strategy (controlled by timeout expiration) is adopted in order to reduce acknowledgement messages in the network. however, very sensible data may require a positive acknowledgement to assure their delivery. hence, positive acknowledgement is also made available as a middleware service and can be used when required. f. network resources usage control in order to improve the overall system performance, the control of the use of communication media and of transmission buffers is crucial. the middleware performs this task by taking into account two factors: (i) the priority associated to each application; and (ii) the resource sharing policy adopted in the system. there are three available resource sharing policies: (a) fair sharing: priorities are not considered and thus all applications have the same right to use the resources in a round-robin scheme, which is organized in an incoming fifo queue; (b) soft priority sorted: the priorities are taken in account, but in a relaxed way. if a higher priority application needs a resource already used by a lower priority one, it must wait until the resource is released. due to its higher priority, it will get access to the resource before other applications, which may be waiting for the resource; (c) mandatory priority: higher priority applications can preempt lower priority ones in order to access the desired resources. priority inversion issues are handled by a priority inheritance mechanism. g. qos control qos control is performed through a contract between the provider and the requester of data. when a node publishes a data service, it informs about the qos (level) offered. nodes interested in the published data service should accept the offered qos and subscribe to the service. however, if a node is interested in the data but does not agree with the offered qos, it has two alternatives: (a) if the application that is requiring the data has a priority lower than any other one using the same service, the requester node looks for another data provider; (b) if its priority is higher than all other applications, the requester node negotiates with the data provider node, in order to obtain the desired qos. this renegotiation occurs in spite of undesired consequences that may affect other lower priority applications, which need to look for another data provider if the qos could not be accepted anymore. h. use of cached values the use of cache in both data providers and requesters may avoid unnecessary data communication. when the measurement device gathers a new value, the data provider publishes the new value, thus updating its subscribers. if the data size is large, requiring many packets to be transmitted, a differential value can be sent instead of the whole data value in order to reduce packets transmission. this option is arranged in advance, at the time when the nodes are negotiating the qos contract. i. data segregation 60 e. pignaton de freitas, t. heimfarth, r. schmidt allgayer, f. rech wagner, t. larsson, c. e. pereira, a. morado ferreira there are two kinds of data exchanged among nodes in the network: control data and application data. control data is small and can usually not experience latency or unexpected delays to achieve their destination. thus, control data are separated from application data by receiving higher priority to be forwarded. moreover, data from different types of applications are handled according to their type and semantics, e.g. the communication of a video stream is handled differently from the communication of character strings. data from a specific application can be handled with higher or lower priority, depending on its semantics. j. synchronous and asynchronous calls the middleware is intended to support both synchronous and asynchronous calls. synchronous calls are time bounded in order to avoid unpredictable waiting periods. the waiting time and number of retries are configurable and negotiated during the qos negotiation. asynchronous calls are also provided in order to support the handling of external unpredictable event. 6 pheromone-based coordination strategy the coordination strategy used in this work to make mobile sensor nodes cooperate with static sensor nodes is based on pheromone traces handed over by the mobile sensors to the static ones. artificial pheromones are usually applied to distributed coordination by means of stigmergy, the indirect communication using environment cues [17]. a pheromone trail is deposited in the environment when the entities are moving. the pheromone provides information to other entities when they pass over it. artificial pheromone also looses its strength along the time, modeling the evaporation of the real pheromones. in the uav research field, pheromones are used to guide the movement of uav swarms, for instance in surveillance and patrolling applications [18] [19]. differently from other existing approaches, in our work pheromones are used to guide the selection and assignment of a suitable uav to handle an alarm issued by a ground sensor node. when an alarm is issued by the detection of a target, the network is responsible for selecting an appropriate uav to respond to the alarm. this is performed by routing a given alarm to the uav that has the strongest pheromone trace over the area. having this information, the uavs will base their movement decisions in a way to respond to the received alarms. this strategy is called here heuristic-p. following the above outlined principles, the uavs that are not engaged in the handling of any target will leave pheromone traces over the area which they cross. this pheromone trace is represented by a piece of information that is taken by the ground sensor nodes that are deployed in the area through which the uavs have passed. when a target is detected by a ground sensor node, an alarm is issued, as already mentioned. the decision about which uav that will handle the potential target indicated by the issued alarm will be taken by the ground sensor nodes, by routing the alarm in the direction that points to the uav which has the strongest pheromone trace over that area of the network. this process just considers the pheromone trace handed over by the uavs to ground sensor nodes. this means that the only parameter taken into account is the time interval since a uav passed by that specific location. heuristic-p is inspired in [20], which presents a pheromone-based strategy to migrate services in a sensor network, in which the pheromone concentration determines the places where the services are required. in heuristic-p, instead of services, alarms are moved through the network following the pheromone concentration. figure 3 presents a scenario that illustrates the strategy. a ground sensor node in the left border of the area detects a target. then it issues an alarm, which is received by its neighbors. however, only those which have pheromone information about a uav stronger than that of the alarm issuer will forward the alarm. this way, the alarm will follow a path to the closest uav, which is represented in the figure by the shaded sensors, until the alarm delivery. figure 4 illustrates the choice of the strongest pheromone trace to be followed by an issued alarm. it is possible to observe that the alarm follows the strongest trace, which corresponds to uav-a, until its delivery to this uav. the arrows illustrated besides each sensor node represent how strong the pheromone coordinating aerial robots and unattended ground sensors for intelligent surveillance systems 61 figure 3: illustrative scenario for the pheromone strategy. of each uav is. as it is possible to see, the pheromone level of uav-a is increasing to the left, while the pheromone level of uav-b is increasing to the right. figure 4: choice of a uav based on the pheromone strategy. when an alarm reaches the uav indicated by the strongest pheromone trace, if this uav is not engaged in the handling of another alarm it sends a confirmation message to the node that had delivered the alarm. if the suggested uav is already engaged in another alarm, the current alarm follows the second strongest pheromone trace to find another uav to engage. when an idle uav detects a new target, it takes the responsibility for handling it. in case that the uav is already busy with another alarm response mission, it relays the incoming alarm that will be routed to another uav, according to the pheromone-based heuristic-p strategy explained above. in order to increase the robustness of the proposal, in case an alarm is issued by a node that has no pheromone trace, a direction is randomly chosen and the alarm is sent in that direction until it finds a pheromone trace. when the trace is found, it follows the trace as explained above. this situation is more likely to occur in the initialization of the system, especially in cases in which the number of uavs deployed in the system is very low with regard to the area under surveillance. when a uav receives an alarm and is not able to perform the task, it may send the alarm back to the network, which will try to find another uav following the traces, or hand it over directly to another uav. this situation may occur when the type of the sensor that the uav carries is not appropriate to handle 62 e. pignaton de freitas, t. heimfarth, r. schmidt allgayer, f. rech wagner, t. larsson, c. e. pereira, a. morado ferreira the event that was detected. 7 femtonode wireless sensor architecture the architecture of a sensor node aims at efficiently supporting specific application needs. it requires a dedicated processing module, including a wireless communication interface, which meets both energy and performance requirements, as well as respects footprint constraints. the fact that application requirements as well as environment and other operational conditions may change during system run time imposes a major challenge [21]. in this context, the use of reconfigurable hardware [22] appears as an interesting alternative. therefore, a customizable sensor node called femtonode is proposed. it contains a customizable asic and a wireless communication interface, which are configured according to application requirements. the nodes use the rt-femtojava processor [23], a stack-based microcontroller that natively executes java byte-codes. it implements an execution engine for java in hardware, through a stack machine that is compatible with the specification of java virtual machine. the customized application code is generated by the sashimi design environment [24]. the code also includes a vhdl description of the processor core and rom (programs) and ram (variables) memories. the sashimi environment has been extended to incorporate an api that supports concurrent tasks, implementing the rtsj standard [25]. as rt-femtojava is customizable, its code can be optimized according to the application requirements, reducing the occupied hardware area and also the energy consumption and dissipation. the customizable hardware architecture of the femtonode allows the use of the sensor node as either a lowor highend node. if the application requires higher performance resources to handle more complex data, such as image processing, additional resources can be included in the femtonode implementation. however, if the application is aimed at processing simple data, such as those from presence sensors, a reduced set of resources is used in the processor. this feature is important for the sensor node, because energy consumption is a great concern in wireless sensor networks, due to the nodes’ limited energy resource. besides, reducing the unused resources during its synthesis the sensor node architecture allows its implementation in reconfigurable circuits with fewer available logical units, which is a feature that provides a larger application portability between different reconfigurable architectures with fewer available resources. in the current implementation, the femtonode includes a wireless transceiver of texas instruments cc2420, which utilizes the ieee802.15.4 standard communication protocol targeted to wireless sensor network applications with a low data rate. a module adapter described in vhdl implements the interface with the wireless transceiver. the module uses data and address buses to communicate with the processor, performing the exchange of data and allowing the transceiver parameters configuration. as the data transfer rate from the wireless transceiver is low, compared to the processor frequency, the wireless communication module implements a buffer to store data, preventing delays while providing the necessary data to the processor. the module uses an interrupt system to inform the processor when a reception occurred. to facilitate the use of the wireless communication module by the application developers, a communication api has been developed. the wireless-api abstracts details of the communication media between the sensor nodes, offering a simplified form for the configuration of the data transfer module. the wireless-api is used by the middleware communication services in the common services layer and also by resource management services in the infrastructure layer, in order to implement end-to-end communication with the desired qos and reliability control. coordinating aerial robots and unattended ground sensors for intelligent surveillance systems 63 8 case study description in order to illustrate the use of the proposed platform infrastructure, including the customizable femtonode and the adaptable middleware, an area surveillance application is studied. in this application, low-end sensors nodes are scattered on the ground along a borderline. in case an unauthorized vehicle crosses the borderline limit, the sensors issue an alarm which will trigger the use of unmanned aerial vehicles (uavs), which are equipped with more sophisticated sensors, such as radars or visible light cameras, in order to perform the recognition of the vehicle. figure 5 presents this scenario. figure 5: area surveillance application scenario. sensor nodes with two different architectures compose the described surveillance system, one based on the femtonode and another one on the sunspot. each of them includes all necessary resources to meet the requirements of their utilization. thus, based on the application specifications, a customization of the femtonode architecture was implemented. the uavs’ architecture is a femtonode with a large set of resources, capable of processing a large amount of data. on the other hand, both femtonodes and sunspots were used to populate the sensor network on the ground. the femtonode architecture used in the ground sensor nodes is simpler and more constrained in terms of available resources, as the sunspot one, and hence only capable of processing simple data, like those produced by piezoelectric sensors or accelerometers, which inform only if an object over a certain weight threshold passed over its sensing area. according to the coordination strategy based on pheromones presented in section 6, the sensor nodes on the ground route the alarms according to the pheromone trace left by the uavs, choosing the strongest trace to follow. when the alarm achieves a node close to the uav, the alarm is delivered. this mechanism addresses several problems related to the communication between nodes, such as controlled delay, delivery assurance, and alarm duplicity handling. 9 results several simulations of the scenario described in section 8 have been performed. they do not cover all aspects mentioned in the above description, but focus on the behavior of the system using the coordination strategy described in section 6. these simulations were conducted using shox [26], a powerful wireless network simulator implemented in java. additional results related to the customizable femtonode working in a heterogeneous network together with sunspot nodes are also presented. these results were gathered from the deployment of a laboratory-size testbed demonstrator, which shows the applicability of the platform. 64 e. pignaton de freitas, t. heimfarth, r. schmidt allgayer, f. rech wagner, t. larsson, c. e. pereira, a. morado ferreira the simulations provided results in an ”ideal-like” operation condition environment for the proposed technique, while the demonstrator was built in order to assess if these results provided by the simulations are possible to be achieved in a real deployment. for more results related to the mechanisms that support the other features mentioned in section 5, interested readers are referred to [27], [28], and [29]. a. simulation results the metrics evaluated in the simulations were: 1) the mean response time to the alarms generated in the system, and 2) the number of alarms lost, due to communication failures. the simulation setup was the following: the surveillance area has dimensions 10 km x 10 km, in which 20,000 ground sensor nodes are randomly deployed with independent uniform probability (homogeneous poisson point process in two dimensions, which generates a geometrical random graph). this distribution gives more than 70% probability that the nodes in the network will form a connected graph [30], for a communication range of 500 meters. six uavs of three different types, equally distributed, patrol the area, having a communication range of 1.5 km and flying at speeds from 100 km/h up to 120 km/h. three different runs were simulated, with one, three, and five targets respectively. the targets can further be of five different types, randomly chosen, with speeds from 50 km/h up to 80 km/h. figure 6 presents the simulation results in terms of the mean time required to respond to the alarms. both raw data from each run (total of 20 runs for each number of targets) and the average value (lines with squared dots) are plotted in the figure. it is possible to observe that, in the worst case, the mean time to find a uav that is idle to engage in the handling of an alarm is around 4 seconds, in the scenario with the maximum number of targets. on the other hand, in the best case, when there is just one target, the time needed to find a uav is in average less than 1 second. an explanation for this behavior is that it is more probable to find an idle uav when the number of targets is smaller. this may happen because, when there are more targets, an alarm message may follow a pheromone trace of a uav that has just engaged in handling a target announced by another alarm, so the alarm must be retransmitted to the network and follow another trace. however, the solution does scale, as the increase in the mean time to find an idle uav is linear with the increase in the number of targets, as can be concluded by taking the average values for all runs for each number of targets. figure 6: alarm response time achieve by the simulation. the second metric evaluates the system efficiency in terms of detecting a target and correctly routing the alarm message to an idle uav. for all simulation runs, no alarm was lost, which means that the system had 100% efficiency for the simulated scenario and correctly found an idle uav at all occasions when an alarm was issued. coordinating aerial robots and unattended ground sensors for intelligent surveillance systems 65 b. demonstrator results the simulations performed in shox showed that the approach proposed in this paper works well in the described scenario. however, wireless communications are very sensible to interferences and unpredictable variations. this means that simulation data, such as communication reachability and delays, are not always confirmed in real deployments. this fact motivated the deployment of a demonstrator to assess the properties of a network used to evaluate part of the system presented in this paper. the deployed demonstrator is composed as a network consisting of sixteen static ground sensor nodes (nine sunspots and the others femtonodes) and one mobile node (femtonode). the ground sensor nodes are equally distributed in a grid in an area of 225 square meters. the mobile femtonode, moved manually, represents a uav that ”flies” over this area leaving pheromones over the ground sensor nodes via a periodic beacon message sent to the network. upon the occurrence of an alarm, the nodes route it in the direction of the nodes with stronger pheromone traces, until it arrives at a node which has communication with the uav. figure 7 presents the demonstrator setup. the radio in the nodes was adjusted to provide a communication range of 5 meters, such that the nodes are capable of communicating only with their immediate vertical and horizontal neighbors, which are 5 meters apart, but not with their diagonal neighbors or any other node in the grid. the mobile node, representing the uav, has the same communication range configuration as the static nodes. figure 7: demonstrator setup. the pheromone traces in the nodes are represented by the numbers in the center of the circles representing the ground sensor nodes in the figure. the smaller the number is, the stronger the pheromone. this translates the idea of the time past since a ground sensor node received the last pheromone beacon from a uav. when a ground sensor node receives this pheromone beacon, it sends this information to its neighbors with a pheromone one point weaker (a number one unit greater than the one representing the node’s pheromone information). this is an indirect beacon that helps the other nodes find the traces to route the alarms. the nodes that receive the indirect beacons do not forward it. the symbol ∞ means that the node has no pheromone trace, i.e. the last beacon (directly from a uav or indirectly from another ground node) was received a long time before, above a tunable threshold. the number representing the pheromone is periodically incremented, indicating that the pheromone trace becomes weaker when time elapses, until disappearing (become ∞). figure 8 presents an example of how an alarm issued by a sensor node (figure 8-a) is routed through the network, following the pheromone traces (figure 8 from a to d), until it is delivered to a uav (figure 8-e). twenty runs were performed. in each of them, an alarm was generated by one of the static nodes, randomly chosen, which had to be routed to the uav according to the pheromone mechanism described in section 6 and implemented as described above. some of the middleware features presented in section 5 were installed in the static nodes, such as message priorization and qos control, in terms of delay to forward an alarm message [31]. in order to stress the network and test these mechanisms, random messages were generated by the static nodes, which competed with the beacon and alarm messages for 66 e. pignaton de freitas, t. heimfarth, r. schmidt allgayer, f. rech wagner, t. larsson, c. e. pereira, a. morado ferreira figure 8: alarm routing and delivery. the utilization of the communication resources. the evaluated parameter with the described testbed was the time to respond to the alarms generated in the system. by obtaining this metric, the delay of one hop communication was calculated and compared with the one achieved in the simulation results described before. figure 9 presents the time taken by the system to deliver the alarm to the uav. the average number of hops to deliver the alarm was 5 hops for the 20 runs of the testbed. taking the average of the time to deliver an alarm, 538.85 ms, and the average number of hops, we get an average delay of 107.77 ms in each hop. considering the simulation results, taking the worst case scenario, the one with 5 targets, in average, the number of hops for an alarm to be delivered was 13.78. taking the average of worst case scenario, 1,821.65 ms to deliver an alarm, we get a 132.14 ms delay for an alarm to be forwarded among the static nodes in each hop. comparing the delays obtained from the simulation runs and from the demonstrator, it is possible to observe that they are very close to each other. the delays obtained with the demonstrator are even better than the ones achieved by simulation, which shows the applicability of the approach described in this paper. figure 9: alarm response time achieved by the demonstrator. coordinating aerial robots and unattended ground sensors for intelligent surveillance systems 67 10 conclusion and future work this paper presented a system solution to provide interoperability and coordination support for heterogeneous sensor networks composed by ground static sensor nodes and mobile sensors carried by autonomous aerial robots. this solution is based on customizable sensor nodes and an adaptive middleware to manage different resources available in each customized sensor node. the femtonode platform provides a support to the development of different types of nodes, customized according to specific requirements. the middleware gives support to the network heterogeneity, enables adaptations to fulfill requirements that may change during the system run-time, and also promotes the necessary coordination among nodes. the coordination and cooperation among distinct nodes in the network allow intelligent behavior of the unmanned vehicles, resulting in autonomous decisions regarding their movement and also how they can complement the work performed by other nodes. this intelligent behavior is based on data exchanged between nodes that then are aggregated and analyzed in order to support the autonomous decisions. simulation and testbed results were provided. these results assessed the suitability of the pheromone coordination mechanism, presenting delay time results for the delivering of alarm messages that drive the movement of the mobile sensor nodes in the system. additional simulations are planned in order to assess the effectiveness of other features of the middleware that were not yet explored, such as the use of cached values and multicast communication. in order to validate the overall proposal, a larger scale demonstrator is also planned to be deployed, with enlarged communication range and real uavs controlled by femtonodes. acknowledgments e. p. freitas thanks the swedish knowledge foundation and the brazilian army for the grant to follow the phd program in embedded real-time systems at halmstad university in cooperation with ufrgs in brazil. bibliography [1] d. a. schoenwald. auvs: in space, air, water, and on the ground. ieee control systems magazine, vol. 20, no. 6, pp. 15-18, 2000. [2] d. culler, d. estrin, and m. srivastava. overview of sensor networks. ieee computer, vol. 37, no. 8, pp. 41-49, 2004. [3] e. p. freitas, m. a. wehrmeister, c. e. pereira, and t. larsson. reflective middleware for heterogeneous sensor networks. proceedings of 7th workshop on adaptive and reflective middleware, acm, pp. 49-50, 2008. [4] r. s. allgayer, m. götz, and c. e. pereira. femtonode: reconfigurable and customizable architecture for wireless sensor networks. proceedings of 10th international embedded systems symposium (iess’09), langenargen, germany, pp. 302-309, 2009. [5] sun microsystems. sunspot, www.sunspotworld.com [6] a. t. erman, l. hoesel, and p. havinga. enabling mobility in heterogeneous wireless sensor networks cooperating with uavs for mission-critical management. ieee wireless communications, vol. 15, issue 6, pp. 38-46, 2008. 68 e. pignaton de freitas, t. heimfarth, r. schmidt allgayer, f. rech wagner, t. larsson, c. e. pereira, a. morado ferreira [7] m. a. batalin, m. hatting, and g. s. sukhatme. mobile robot navigation using a sensor network. ieee intl. conf. on robotics and automation, 2004. [8] j. s. kinnebrew, a. gupta, n. shankaran, g. biswas, and d. c. schmidt. a decision-theoretic planner with dynamic component reconfiguration for distributed real-time applications. proceedings of 8th international symposium on autonomous decentralized systems, pp. 461-472. [9] y. jin, y. liao, a. a. minai, and m. m. polycarpou. balancing search and target response in cooperative unmanned aerial vehicle (uav) teams’, ieee transactions on system, man, cyberneticspart b: cybernetics, vol. 36, no. 3, pp. 571-587, 2006. [10] b. walter, a. sannier, d. reinerss, and j. oliver. uav swarm control: calculating digital pheromone fields with the gpu. in the interservice/industry training, simulation & education conference (i/itsec), vol. 2005 (conference theme: one team. one fight. one training future), 2005. [11] msb co. web site. submeter-scale aircraft, http://spyplanes.com [12] r. leonard, and j. drezner. global hawk and darkstar. santa monica, california : rand corporation, vol. 4, 2002. [13] e. p. freitas, m. a. wehrmeister, c. e. pereira, f. r. wagner, e. t. silva jr., and f. c. carvalho. deraf: a high-level aspects framework for distributed embedded real-time systems design. springer, pp. 55-74, 2007. [14] e. p. freitas, m. a. wehrmeister, c. e. pereira, and t. larsson. using aspects and component concepts to improve reuse of software for embedded systems product lines. proceedings of 13th early aspects workshop at splc-08, pp.105-112, 2008. [15] m. e. bratman. intention, plans, and practical reason. cambridge, ma, 1987. [16] object management group (omg). distribution service for real-time systems (dss) specification, vol. 1.2, 2007. [17] e. bonabeau, m. dorigo, and g. theraulaz. swarm intelligence: from natural to artificial systems. oxford university press, santa fe institute studies in the sciences of complexity, ny, 1999. [18] p. gaudino, b. schargel, e. bonabeu, and b.t clough. swarm intelligence: a new c2 paradigm with an application to control of swarms of uavs. proceedings of 8th international command and control research and technology symposium, 2003. [19] j. a. sauter, r. matthews, h. v. d. parunak, and s. a. brueckner. performance of digital pheromones for swarming vehicle control, proceedings of 4th international joint conference on autonomous agents and multi-agent systems, acm press, pp. 903-910, 2005. [20] t. heimfarth, and p. janacik. experiments with biologically-inspired methods for service assignment in wireless sensor networks, ifip intl federation for information processing, vol. 268, eds. boston: springer, pp. 71-84, 2008. [21] h. hinkelmann, p. zipf, and m. glesner. a domain-specific dynamically reconfigurable hardware platform for wireless sensor networks, int. conf. on field-programmable technology, pp. 313-316, 2007. [22] p. garcia et al. an overview of reconfigurable hardware in embedded systems, eurasip j. embedded systems, new york, ny, usa, n.1, pp.13-13, 2006. coordinating aerial robots and unattended ground sensors for intelligent surveillance systems 69 [23] s. a. ito, l. carro, and r. p. jacobi. making java work for microcontroller applications, ieee design and test of computers, los alamitos, vol. 18, no. 5, pp. 100-110, 2001. [24] sashimi manual, www.inf.ufrgs.br/˜lse/sashimi/, 2006. [25] m. a. wehrmeister, c. e. pereira, and l. b. becker. optimizing the generation of object-oriented real-time embedded applications based on the real-time specification for java, proceedings of the design, automation, and test in europe conference, belgium, pp. 806-811, 2006. [26] j. lessmann, t. heimfarth, and p. janacik. shox: an easy to use simulation platform for wireless networks, proceedings of 10th international conference on computer modeling and simulation, pp. 410-415, 2008. [27] e. p. freitas, t. heimfarth, m. a. wehrmeister, f. r. wagner, a. m. ferreira, c. e. pereira, and t. larsson. using link metric to improve communication mechanisms and real-time properties in an adaptive middleware for heterogeneous sensor networks, advances in information security and assurance, lncs, 5576, springer berlin, heidelberg, 422-431, 2009. [28] e. p. freitas, t. heimfarth, f. r. wagner, a. m. ferreira, c. e. pereira, and t. larsson. an agent framework to support sensor networks setup and adaptation, proceedings of international multiconference on computer science and information technology, pp. 533?540, mragowo, poland, 2009. [29] e. p .freitas, t. heimfarth, f. r. wagner, a. m. ferreira, c. e. pereira, and t. larsson. evaluation of coordination strategies for heterogeneous sensor networks aiming at surveillance applications, proceedings of 8th ieee sensors, pp. 591?596, christchurch, new zealand, 2009. [30] c. bettstetter. on the minimum node degree and connectivity of a wireless multihop network, proceedings of the 3rd acm international symposium on mobile ad hoc networking & computing, acm, new york, ny, usa, 2002, pp. 80-91. [31] e. p. freitas, m. a. wehrmeister, c. e. pereira, and t. larsson. real-time support in a reflective middleware for heterogeneous sensor network. in proceedings of the 29th ieee real-time systems symposium: work-in-progress proceedings, barcelona, spain, pp. 57-60, december 2008. edison pignaton de freitas has a position as computer engineer at the brazilian army. he is currently a phd student at halmstad university, sweden, and federal university of rio grande do sul (ufrgs), brazil, in the area of sensor networks, performing his research in the embedded systems groups of both universities and has published several papers in this area. he got his bachelor degree in computer engineering from the military institute of engineering, brazil, in 2003, and his msc degree in computer science from ufrgs, in 2007. in 2001 and 2002 he participated in an interchange program for engineering students between brazil and france, studying at the genie informatique industriel program of the institut national des sciences appliques, in toulouse, and performing an internship at the systems department of the airbus, working in the a380 project. tales heimfarth received his phd degree in computer science from the university of paderborn, germany, in 2007. his phd dissertation was entitled "biologically inspired methods for organizing distributed services on sensor networks". he received a msc degree in computer science in 2002, with the thesis "real-time communication platform over an sci cluster", and a bachelor degree in computer science in 2000, both from ufrgs. since 2008 he holds a post-doc position at ufrgs, performing research in the area of sensor networks. he is author of several papers published in international conferences in the area of sensor networks, especially with focus on basic software and auto-organizing protocols. he participates of the shox project, which is a open-source, event-oriented simulator for ad-hoc networks, led by the university of paderborn. 70 e. pignaton de freitas, t. heimfarth, r. schmidt allgayer, f. rech wagner, t. larsson, c. e. pereira, a. morado ferreira rodrigo schmidt allgayer is currently a phd student at ufrgs, brazil, in the area of wireless sensor networks, embedded systems and reconfigurable architectures, performing his research in the control, automation and robotics group. he got his bachelor degree in electrical engineering in 2005, and his msc degree in electrical engineering in 2009, both from ufrgs. his research interests include wireless sensor networks, reconfigurable architectures, and embedded systems. flávio rech wagner received a bsc degree in electrical engineering (1975) and an msc degree in computer science (1977), both from ufrgs, brazil. he received a phd degree in computer engineering from the university of kaiserslautern, germany, in 1983. in 1992 and 2002 he held post-doc positions at inpg (institut national polytechnique de grenoble), france. he was invited professor at the university of tübingen, germany, in 1994. he is currently professor at ufrgs, position that he holds since 1977, and since 2006 he is the director of the institute of informatics of ufrgs. he has been president of the brazilian computer society, from 1999 to 2003. he is associate editor of the design automation for embedded systems journal and has been member of more than 30 program committees of international conferences, having published 3 books, 7 book chapters, and more than 120 papers in conference proceedings and journals. tony larsson became a member of ieee in 1990. he received a meng degree in 1974, a tech.lic. 1986 in computer systems and a phd degree in computer science in 1989, all at the institute of technology at linköping university, sweden. he worked for ericsson ab from 1974 to 2002, in several different positions, such as engineer, manager, and expert (the latter in system design methods and architecture), in areas such as testing, computer aided design, radio network control, and distributed computer systems platforms for dependable telecommunication applications. he then worked for the swedish defense material administration in the area of network based defense and is since 2003 professor in embedded systems at halmstad university. carlos eduardo pereira received a bsc degree in electric engineering from escola de engenharia (1987), brazil, and an msc degree in computer science from ufrgs in 1990. he received a phd degree in electrical engineering from technische universitat stuttgart, germany, in 1995. in 2000 he held post-doc positions at united technologies research center (utrc). he is a professor in the department of electrical engineering and science computing at ufrgs and since 2009 he is the deputy director of the engineering of ufrgs. he is a senior member of the ieee and ifac technical committee on real-time programming. has experience in electrical engineering, focusing on electronic automation of electric and industrial processes, acting on the following subjects: real-time systems, object-oriented programming, industrial automation, industrial fieldbus and software engineering. armando morado ferreira received bsc (1990) and msc (1997) degrees in mechanic engineering, both from military institute of engineering. he received his phd degree in mechanical engineering from university of delaware (2001), and msc and phd degrees in military sciences from brazilian army’s command and high-staff school (2004 and 2009) lieutenant-colonel military engineer, professor at military institute of engineering and instructor at brazilian army’s command and high-staff school, he has knowledge in mechanical engineering with focus in robotization, concentrating his work in the following themes: trajectory generation, non-linear control and unmanned vehicles control and modelling, besides a great interest in the area of information science and technology management applied to defence. int j comput commun, issn 1841-9836 9(2):172-186, april, 2014. a new information filling technique based on generalized information entropy s. han, l. chen, z. zhang, j.-x. li shan han*, lin chen, zhi zhang, jianxun li department of automation, shanghai jiao tong university, key laboratory of system control and information processing, ministry of education of china, shanghai 200240, china e-mail: hanshan@sjtu.edu.cn, clcyk2002@aliyun.com, mark1896bc@gmail.com, lijx@sjtu.edu.cn *corresponding author: hanshan@sjtu.edu.cn abstract: multi-sensor decision fusion used for discovering important facts hidden in a mass of data has become a widespread topic in recent years, and has been gradually applied in failure analysis, system evaluation and other fields of big data process. the solution to incompleteness is a key problem of decision fusion during the experiment and has been basically solved by proposed technique in this paper. firstly, as a generalization of classical rough set, interval similarity relation is employed to classify not only single-valued data but also interval-valued data in the information systems. then, a new kind of generalized information entropy called "h’-information entropy" is suggested based on interval similarity relation to measure the uncertainty and the classification ability in the information systems. thus, the innovated information filling technique using the properties of h’-information entropy can be applied to replace the missing data by some smaller estimation intervals. finally, the feasibility and advantage of this technique are testified by two actual applications of decision fusion, whose performance is evaluated by the quantification of e-condition entropy. keywords: multi-sensor decision fusion, rough set theory, generalized information entropy, information classification, information filling. 1 introduction decision fusion is the general name of data-based decision making methods. the purposes of these methods are automatically discovering important facts hidden in a mass of data collected from varieties of sensors or other sources, and expressing them by the natural language of decision rules. then, people can effectively use the simplified decision rules to aid decision making in future applications. obviously, this target puts forward higher requirements for both sensors capability and the information processing method. besides basic statistical method, neural networks and bayesian networks, rough set theory as a favorable mathematical tool with high performance of information acquisition and classification takes into account consequently [1] [2]. the theory of rough set was firstly proposed by pawlak in 1982 [3]. it is an extension of classical set theory for the study of information systems characterized by inexact, uncertain and vague, and has been widely used in the fields of knowledge discovery, decision fusion, data mining, pattern recognition, and so on. with the rapid development of rough set, some new frameworks were proposed to extend its application range. the tolerance relation [4], valued tolerance relation [5], similarity relation [6], together with the limited tolerance relation [7], laid a solid foundation for the progress of generalized rough set. until now, the improvements of binary relations are still attracting the scholars attentions [8][11]. as to decision fusion of multi-sensor, data acquired from different sensors is real-valued which characterizes objects of interest. because of the imprecision of acquisition, the fuzziness of cognition and the limitation of knowledge [12] [13], these attributes with real-valued data usually expressed as single-valued information system (sis), interval-valued information system (iis) copyright © 2006-2014 by ccc publications a new information filling technique based on generalized information entropy 173 or the combination of sis and iis cannot be classified by any binary relation mentioned above absolutely. the discretization of a long interval by dividing the range into a certain number of partitioning small intervals and using symbolic values to replace these small intervals before further calculation is the popular handling method [14] [15], but existing dispute about the cut-off points [16]. additionally, the classical rough set may generate an unacceptably large number of classifications from the discretized data resulting in too many classification rules to make final decisions. more important, data missing and uncertain called incompleteness may occur unavoidably because of loss of documents, difficulties of measurement or malfunction of sensors, and increases the difficulties of information processing [17]. the direct information filling methods have not been paid enough attention to, and existed methods [12] [13] are lack of precision to make optimal decisions. thereupon, the researches of solutions to the classifications and the incompleteness in information systems, together with the following calculations of rough set become significant [18]. in this paper, the discussions are paying attention to solve above issues based on rough set theory emphatically. the notion, properties and measurement of generalized rough set are reviewed firstly in section 2. in section 3, a brand new “h’-information entropy" is defined for the measurement of uncertainty and classification ability in the information systems according to the interval similarity relation. in section 4, an innovated information filling technique is proposed by the use of h’-information entropy to solve the incompleteness in both sis and iis, whose performance is evaluated by the quantification of e-condition entropy. in section 5, the advantage and practicability of this new information filling technique are testified by two integrated examples. the missing data in the decision table is filled by smaller estimation intervals and the certain decision rules are extracted by the calculation of rough set finally. 2 the theory of generalized rough set information systems with the form of decision tables provide a convenient basis for the representation of objects in terms of their attributes. but the uncertainty and incoordination take place in most of decision tables. thereupon, rough set as a powerful method is used to deal with these issues. 2.1 basic concept let s = (u,a) be an information system, where u = {u1,u2, . . . ,u|u|} (|u| means the cardinality of u) is a non-empty finite set of objects, denoting the whole research objects of the information system. a = {a1,a2, . . . ,an} is a non-empty finite set of attributes, denoting the whole attributes of the information system. f : u → va is a mapping for a ∈ a ,where va is called the domain of attribute a. this constitutes the basic research contents of rough set. let p ⊆ a be a subset of attributes. t is a binary relation (including but not limited to equivalence relation). sp(u) denotes the object set {v ∈ u|(u,v) ∈ t} , called a class or an information granule. if (ui,uj) ∈ t , then sp (ui) ̸= sp (uj) in general conditions. the whole classes of u are obtained according to t [19]: k(p) = (sp (u1),sp (u2), . . . ,sp (u|u|)) (1) at this time, k(p) is not restricted to “partition", but may be the “covering" of u, namely: ∪sp (ui) = u i = 1,2, . . . , |u| (2) then, a rough set can be described by the definitions of lower and upper approximations. 174 s. han, l. chen, z. zhang, j.-x. li definition 1. let s = (u,a) be an information system, x ⊆ u. with an arbitrary binary equivalence relation on u, the lower and upper approximations of x are [3]: r(x) = ∪{sp (ui)|sp (ui) ⊆ x} i = 1,2, . . . , |u| (3) r(x) = ∪{sp (ui)|sp (ui) ∩ x ̸= ∅} i = 1,2, . . . , |u| (4) the covering-based rough set is the great generalization of classical rough set. so, the application range of rough set is extended significantly. 2.2 the measurement of rough set the uncertainty in rough set refers to the size of classes called information granules generally determined by specified attributes in the information system. it can be measured by information entropy [20]. small information granules imply a precise description of rough set. definition 2. let s = (u,a) be an information system, and p ⊆ a be a subset of attributes.k(p) = (sp (u1),sp (u2), . . . ,sp (u|u|)) are the classes of u created by some binary relation according to p . information entropy of p is defined as [19]: h(p) = − |u|∑ i=1 1 |u| log2 |sp (ui)| |u| (5) this definition of entropy is called “h-information entropy". in other words, h-information entropy quantifies the classification ability of objects set u according to the attributes subset p . the relationship between different classes can be compared by partial relation “ ≼ ”. let p,q ⊆ a be subsets of attributes. if k(p) = (sp (u1),sp (u2), . . . ,sp (u|u|)) and k(q) = (sq(u1),sq(u2), . . . ,sq(u|u|)). the partial relation “ ≼ ” is defined as: k(p) ≼ k(q) ⇔ ∀i = 1,2, . . . , |u|,sp (ui) ⊆ sq(ui) (6) theorem 3. if k(p) ≼ k(q), then h(q) ≤ h(p) [19]. this theorem explains that h-information entropy is monotone increasing with the decrease of elements in each class. the thinner classes are obtained by stricter binary relations. the bigger the h-information entropy is, the smaller the information granules will be. in order to mine the internal relations of subsets p and q, the research of coordination of these two attributes becomes significant. the result embodies the support degree of one subset of attributes to another. it is just the major application of rough set in decision fusion. the condition entropy is introduced to measure this support degree [21]. it is proved that e-condition entropy is suitable for both partition-based and covering-based rough set [19] [22]. definition 4. let p,q ⊆ a be two subsets of attributes. if k(p) = (sp (u1),sp (u2), . . . ,sp (u|u|)) and k(q) = (sq(u1),sq(u2), . . . ,sq(u|u|)), then e-condition entropy is: e(q/p) = |u|∑ i=1 |sp (ui)| − |sp (ui) ∩ sq(ui)| |u|2 (7) in particular, if k(p) ≼ k(q), which means the classes obtained by p are totally included in the classes obtained by q, then e(q/p) = 0 [23]. thus, some attributes “entirely" support the some others. rough set degenerates to the classical set. it is considered that if e-condition entropy is small, the support degree is high. then, the coordination in rough set is better. a new information filling technique based on generalized information entropy 175 3 generalized information entropy in the decision tables, the values of attributes may be some interval ranges instead of simple single values. in order to measure the uncertainty in such kind of system, the new information entropy is defined in this section, called h’ information entropy. 3.1 interval similarity relation the information system with interval-valued data is now called interval-valued information system (iis). definition 5. let s = (u,a) be an iis. if x ∈ u and a ∈ a, then the interval range of f(x,a) is denoted as f(x,a) = [f(x,a)l,f(x,a)u]. as mentioned in introduction, it is clear that the classical binary relations are not suitable for the classifications of iis now. additionally, it is considered that the length of interval implies the uncertainty of cognition. because the intersection degrees among objects are highly different, the negligence will enlarge the possibility of intersection factitiously. thus, the classification ability and the cognition of the system will be reduced. so, the interval similarity relation is employed as the binary relation to classify the objects of iis in this paper. definition 6. let s = (u,a) be an iis. x,y ∈ u and a ∈ a. the interval similarity degree of f(x,a) and f(y,a) is defined as: paxy = |f(x,a) ∩ f(y,a)| |f(x,a) ∪ f(y,a)| (8) where || means the absolute length of interval. in particular, there may exist some single-valued data in iis in practical applications. that is f(x,a) = f(x,a)l = f(x,a)u = constant. in order to generalize the definition of interval similarity degree, if f(y,a) = [f(y,a)l,f(y,a)u] and constant ∈ [f(y,a)l,f(y,a)u], then we define paxy = 1, and vice versa. thus, the definition of similarity degree can be improved. definition 7. let s = (u,a) be an iis. x,y ∈ u and a ∈ a. the interval similarity relation is defined as: s(a) = {(x,y) ∈ u × u|paxy ≥ α,∀a ∈ a} (9) where α is the similarity threshold value. obviously, if f(x,a) and f(y,a) are both single-valued data, then the interval similarity relation will degenerate to simple equivalence relation by this definition. deduction 1. the interval similarity relation satisfies reflexivity and symmetry. it is significant that if an information system is classified by interval similarity relation, the interval length has direct effect on the classification ability. that is interval similarity may affect information entropy of the system. 3.2 definition of generalized information entropy as the improvement, interval similarity degree defined in equation (8) will be added to classical information entropy. a new kind of generalized information entropy is defined to measure the uncertainty and classification ability in the information systems. 176 s. han, l. chen, z. zhang, j.-x. li definition 8. let s = (u,a) be an information system and b ⊆ a be a subset of attributes. k(b) = (sb(u1),sb(u2), . . . ,sb(u|u|)) are the classes of u created by interval similarity relation according to b. pxy is the minimum interval similarity degree between two objects (x and y). pxy = { min{pkxy} pkxy > 0,∀k ∈ b 0 else (10) the generalized information entropy called “h’-information entropy" is defined as: h′(b) = − |u|∑ i=1 1 |u| log2 |u|∑ j=1 puiuj |u| (11) deduction 2. h’-information entropy satisfies non-negativity, symmetry, continuity, monotonicity and extremum property. proof. 1) by the definition of interval similarity degree, it is obvious that 0 ≤ puiuj ≤ 1 . then, 0 < |u|∑ j=1 puiuj ≤ |u| ⇒ 0 < |u|∑ j=1 puiuj |u| ≤ 1 ⇒ − log2 |u|∑ j=1 puiuj |u| ≥ 0 (12) that is h’-information entropy is nonnegative. 2) moreover, because of the symmetry of interval similarity degree (puiuj = pujui), h’information entropy is also symmetric. 3) the numerical ranges of puiuj are continuous and puiuj constitute the whole variables of h’information entropy. the operations of addition, multiplication, and logarithm are continuous totally. hence, h’-information entropy keeps the continuity. 4) let s = (u,a) be an information system and b1,b2 ⊆ a be subsets of attributes. the condition that b1 ⊆ b2 is assumed, then there is k(b2) ≼ k(b1) [24]. because pb1xy = min{pkxy} (pkxy > 0,∀k ∈ b1) and pb2xy = min{pkxy} (pkxy > 0,∀k ∈ b2), with the assumption b1 ⊆ b2 , the consequent that pb1xy ≥ pb2xy is obtained. hence, h′(b1) ≤ h′(b2). as a result, k(b2) ≼ k(b1) ⇒ h′(b1) ≤ h′(b2) (13) it should be noticed that this monotonicity is under the constraint (b1 ⊆ b2) in this paper. if there is no inclusion relation between two subsets of condition attributes, this monotonicity may be broken down. 5) if |u|∑ j=1 puiuj = 1, then h′(b) = log2 |u|. if |u|∑ j=1 puiuj = |u|, then h′(b) = 0. hence, there exist two extreme values. 4 the solution to incompleteness data missing and uncertain called incompleteness occur commonly in data acquisition by multi-sensor. in this section, the research focus will turn to this incomplete information system and try to find the solution to incompleteness. a new information filling technique based on generalized information entropy 177 4.1 the new information filling technique now, a new information filling method is proposed based on the h’-information entropy defined above. theorem 9. in an iis with one attribute, it will assuredly increase the h’-information entropy when a new object of minimal interval is added into the system. proof. let s = (u,a) be an iis. ∀a ∈ a, s = (u,a) is an iis with one attribute. u|u+1| is the newly added object. there are: f(u|u+1|,a) = [f(u|u+1|,a) l,f(u|u+1|,a) u] (14) |f(u|u+1|,a)| = f(u|u+1|,a) u − f(u|u+1|,a) l = δ (15) when u|u+1| is added into original iis, the new h’-information entropy is: h′(a)1 = − |u+1|∑ i=1 1 |u + 1| log2 |u+1|∑ j=1 puiuj |u + 1| = − 1 |u + 1| (log2 |u|∑ j=1 pu1uj + pu1u|u+1| |u + 1| + log2 |u|∑ j=1 pu2uj + pu2u|u+1| |u + 1| + · · · + log2 |u|∑ j=1 pu|u|uj + pu|u|u|u+1| |u + 1| + log2 |u|∑ j=1 pu|u+1|uj + pu|u+1|u|u+1| |u + 1| ) (16) obviously, pu|u+1|u|u+1| = 1. |u|∑ j=1 puiuj (i = 1,2, . . . , |u|) is unchanged since the newly added object does not affect the intersection relations in original system. when δ is a minimal interval, there is δ ≪ |f(ui,a)|(i = 1,2, . . . , |u|). therefore, whether δ has an intersection with other interval or not, it is obtained: puiu|u+1| = pu|u+1|ui = 0 i = 1,2, . . . , |u| (17) as a result: h′(a)1 = − 1 |u + 1| (log2 |u|∑ j=1 pu1uj |u + 1| + log2 |u|∑ j=1 pu2uj |u + 1| + · · · + log2 |u|∑ j=1 pu|u|uj |u + 1| + log2 1 |u + 1| ) def = h′(a)10 (18) let |u|∑ j=1 puiuj = pi for convenience, then: h′(a)10 = − 1 |u + 1| (log2 p1 |u + 1| + log2 p2 |u + 1| + · · · + log2 p|u| |u + 1| + log2 1 |u + 1| ) = − 1 |u + 1| log2 p1p2 · · ·p|u| |u + 1||u+1| = 1 |u + 1| log2 |u + 1||u+1| p1p2 · · ·p|u| (19) 178 s. han, l. chen, z. zhang, j.-x. li the h’-information entropy of original system is: h′(a)0 = − |u|∑ i=1 1 |u| log2 |u|∑ j=1 puiuj |u| = 1 |u| log2 |u||u| p1p2 · · ·p|u| (20) because |u| ≥ 1 and pi ≥ 1(i = 1,2, . . . , |u|), hence, h′(a)10 = 1 |u + 1| log2 |u + 1||u+1| p1p2 · · ·p|u| = log2 |u + 1| − 1 |u + 1| log2(p1p2 · · ·p|u|) > log2 |u| − 1 |u + 1| log2(p1p2 · · ·p|u|) > log2 |u| − 1 |u| log2(p1p2 · · ·p|u|) = h′(a)0 (21) it is clear that, theorem 9 can be expanded to a sis. that is in a sis with one attribute, when a new object of minimal interval is added into the system, it will assuredly increase the h’-information entropy if the interval range of newly added object doesn’t include any existed single-valued data in the attribute. as long as this promise exists, puiu|u+1| = pu|u+1|ui = 0 as well according to equation (8). consequently, the simplifying process from equation (16) to (18) stays the same. in order to make the research more intuitive, the following discussions are based on the assumption that the increase of interval is unidirectional. hence, the lower limit or upper limit is fixed. theorem 10. in an information system with one attribute, the h’-information entropy will be monotone decreasing with the interval increase of newly added object within a limit. proof. followed by previous proof, δ is increased gradually. 1) if the original system is an iis, obviously, only the terms of pi def = puiu|u+1| = pu|u+1|ui = |f(ui,a)∩δ| |f(ui,a)∪δ| are changing during the increase of δ. firstly, h′(a)1 will be monotone decreasing because of the continuous increase of pi. the increase trend of pi will be sustained until the intersection of ui(i = 1,2, . . . , |u|) and u|u+1| is steady, which means the upper /lower limit of δ reaches the upper /lower limit of arbitrary ui (as showed in fig.1). after that, |f(ui,a) ∩ δ| is a constant while |f(ui,a) ∪ δ| is still increased. then, pi will turn to decrease. once this happens to any ui ∈ u, the trend of h′(a)1 may be changed. 2) if the original system is a sis, there should exist growing amount of pi = 1(i = 1,2, . . . , |u|) with the increase of δ. thus, h′(a)1 will be monotone decreasing until the interval range of δ is long enough to cover the largest number of original single-valued data in the domain of the attribute a. theorem 10 shows that the newly added object will have more opportunity to be classified with other objects with interval increase under the interval similarity relation. this phenomenon will cause the weakness of classification ability and the reduction of information entropy. a new information filling technique based on generalized information entropy 179 figure 1: intersection of two intervals. deduction 3. in an information system with one attribute, there is a minimum value of h’information entropy with the interval increase of newly added object. h′(a)1 is continuous according to deduction 2 and it has been proved that h′(a)1 is monotone decreasing within a limit. consequently, there is a minimum value in the limit. deduction 4. in an iis with one attribute, after the upper /lower limit of newly added object reaches the maximal /minimal value in the domain of the attribute a, the h’-information entropy will be monotone increasing with the interval increase. deduction 5. in an sis with one attribute, after the upper /lower limit of newly added object reaches the maximal /minimal value in the domain of the attribute a, the h’-information entropy will be a constant with the interval increase. in an iis, once the upper /lower limit of newly added object reaches the upper /lower limit of interval-valued data of entire original objects, pi will turn to decrease with the increase of δ for all ui ∈ u. then, h′(a)1 is changed to increase assuredly. meanwhile, in a sis, once the upper /lower limit of newly added object reaches the maximal /minimal value of single-valued data of entire original objects, pi will be the number of covered single-valued data and remains unchanged regardless of the interval extension. then, h′(a)1 is a constant assuredly. the deductions above explain that when the interval of the newly added object is increased from zero to the maximal /minimal value in the domain of the attribute a, there exists at least one global minimum value of h’-information entropy. now, the missing data in the information systems can be dealt with above-mentioned results. the new information filling technique follows the assumption that the objects with missing data have no effect with the classifications of information. in other words, regardless of the existing of the existence of these incomplete objects, the information entropy of corresponding condition attribute should be unchanged. the process of the new information filling technique is considered as a new object of minimal interval adds into the complete information system attribute by attribute. then, the h’information entropy is surely increased implying the rapid improvement of classification ability. according to deduction 3, 4 and 5, the trend of h’-information entropy is decreased firstly and increased or flat finally. the filling interval is expected to make the h’-information entropy close to the original one as much as possible before the upper /lower limit reaches the maximal /minimal value in the domain of corresponding attribute. thus, a smaller filling interval is obtained. because of the expanded definition of interval similarity relation, this new information filling technique can be applied for both iis and sis. in general, the process is listed below. 1) calculate the h’-information entropy (h′(a)0) of original information system for each condition attribute without incomplete objects. 2) add a new object with minimal interval (δ = 10−3 is recommended) to one attribute. if 180 s. han, l. chen, z. zhang, j.-x. li the original system is sis, the lower /upper limit of the interval should be set at the left /right side of minimal /maximal value in the domain of corresponding attribute of the objects with the same decision. if the original system is iis, the lower /upper limit of the interval should be known and fixed in advance just like the forms of f(x,a) = [f(x,a)l,∗] or f(x,a) = [∗,f(x,a)u]. thus, the initial position marked as a is established. the h’-information entropy is h′(a)1|start at the beginning. 3) forward /backward increase the length of the new interval by a setting step size until its upper /lower limit reaches the maximal /minimal value in the domain of the attribute. fig.2 is just showing the process of forward increasing for instance. calculate the new h’-information entropy for each step. record the following data: a. mark the position of interval’s upper /lower limit as b when the corresponding new h’information entropy reaches global minimum value h′(a)1|min for the first time. b. the position of maximal /minimal value in the domain of the attribute is marked as c, and the corresponding h’-information entropy is h′(a)1|end. c. mark the position of interval’s upper (lower) limit as d when the corresponding new h’-information entropy is equal to the original h’-information entropy (h′(a)1 = h′(a)0) for the first time. d may not exist in some cases. 4) if h′(a)1|min ≤ h′(a)0 , then the estimation interval is |a − d| (as showed in fig. 2(left)).if h′(a)1|min > h′(a)0 ,then the estimation interval is |a − b| (as showed in fig. 4(right)). thus, there are |a − d| ≤ |a − c| and |a − b| ≤ |a − c|, so the estimation interval is shortened. ( ) ( ( ), ( ),..., ( )) ( ) ( ( ), ( ),..., ( )) ( )' ( ( )', ( )',..., ( )') figure 2: variation curve of h’-information entropy. 4.2 performance analysis in order to check the performance of proposed information filling technique, a deduction is introduced firstly. deduction 6. in an information system, the shortening of estimation intervals will improve the coordination of the system. proof. let s = (u,a) be an information system. a = c ∪d , where c is the set of condition attributes and d is set of the decision attributes. if b ⊆ c , then the classes obtained according to b and d are: k(b) = (sb(u1),sb(u2), . . . ,sb(u|u|)) (22) k(d) = (sd(u1),sd(u2), . . . ,sd(u|u|)) (23) with the shortening of estimation intervals in b, there exists a new classification: k(b)′ = (sb(u1)′,sb(u2)′, . . . ,sb(u|u|)′) (24) a new information filling technique based on generalized information entropy 181 obviously, short estimation interval will decrease the intersection degree and reduce the objects in the classes made by interval similarity relation. that is sb(ui)′ ⊆ sb(ui), |sb(ui)′| ≤ |sb(ui)| (i = 1,2, . . . , |u|). hence, sb(ui)′ ∩sd(ui) ⊆ sb(ui) ∩sd(ui) and |sb(ui)′ ∩sd(ui)| ≤ |sb(ui)∩sd(ui)|. e-condition entropy studied in section 2.2 is used to measure the coordination in rough set. according to its definition, there is: e(d/b) − e(d/b)′ = |u|∑ i=1 |sb(ui)| − |sb(ui) ∩ sd(ui)| |u|2 − |u|∑ i=1 |sb(ui)′| − |sb(ui)′ ∩ sd(ui)| |u|2 = |u|∑ i=1 |sb(ui)| − |sb(ui)′| − (|sb(ui) ∩ sd(ui)| − |sb(ui)′ ∩ sd(ui)|) |u|2 = |u|∑ i=1 |sb(ui)| − |sb(ui)′| − |(sb(ui) − sb(ui)′) ∩ sd(ui)| |u|2 ≥ 0 (25) the equality holds if and only if (sb(ui) − sb(ui)′) ⊆ sd(ui). so, e(d/b) ≥ e(d/b)′. it is mentioned that small e-condition entropy implies great support degree, which means the shortening of estimation intervals improves the coordination between corresponding condition attributes and decision attributes. because of the generality of this result to the whole condition attributes, the coordination of the information system is improved consequently. now, the benefits of smaller estimation intervals obtained in this paper are reflected in two aspects. on the one hand, the coordination between condition attributes and decision attributes is improved according to deduction 6. that is the positive region in the rough set is expanded, while the uncertainty is decreased. the reliability of decision will be increased as a result. on the other hand, the shortened estimation intervals are beneficial for decision making. the new decision rules will be fine and precise with the extending of covering range. 5 experimental studies the new information filling technique can be applied to process the incompleteness in large amounts of data acquired by different sensors for decision fusion in a variety of fields. here, two integrated experiments will be studied below to validate the feasibility and superiority of proposed method. 5.1 single-valued data estimation and filling in the study of single-valued data evaluation and filling, a part of icu data of one patient is acquired by different sensors and listed in table 1. in this decision table, the column "u" refers to the times of examinations. columns "c1" to "c5" denote the data acquired by sensors of temperature, heart rate, systolic blood pressure, spo2 and respiration rate, and compose five condition attributes. the column "d" denotes four degrees of icu decisions and composes the decision attribute. furthermore, the missing data is denoted as "∗" in the table. by applying the new information filling technique, a small interval can be estimated to fill the missing data. figure 3 shows this process of information filling and we can get f(u17,c1) = [36.3,36.4] (as showed in figure 3(left)) and f(u3,c3) = [103.0,115.9] (as showed in figure 3(right)) as a result. then, e-condition entropy discussed before can be used to evaluate the performance of above result. we get e(d/c) = 0 without the objects with missing data, meaning the condition 182 s. han, l. chen, z. zhang, j.-x. li u c1 c2 c3 c4 c5 d 1 37.8 62.0 103.0 94.0 18.0 2 2 37.7 61.0 120.0 97.0 16.0 2 3 37.7 61.0 * 97.0 16.0 2 . . . . . . . . . . . . . . . . . . . . . 14 37.3 62.0 116.0 98.0 18.0 3 15 37.3 62.0 116.0 98.0 18.0 3 16 37.3 62.0 116.0 98.0 18.0 3 17 * 111.0 142.0 95.0 20.0 1 18 36.3 111.0 142.0 95.0 20.0 1 19 36.4 111.0 142.0 95.0 20.0 1 . . . . . . . . . . . . . . . . . . . . . 49 36.7 98.0 133.0 92.0 20.0 4 50 36.7 98.0 133.0 92.0 20.0 4 table 1: incomplete icu data. ] ] ] 730] ] ] ] 5.658] ] ] 26.819] ... ... 0.033] ] 826] ... ... 21 4.997] ] 848] ... ... 5 ] ] 806] ... ... ... ... 31 4.997] 0.038] 4.999] ... ... figure 3: information filling of icu data. attributes entirely support each decision in the original system. then, this index is still 0 when u3 and u17 with filling intervals are added to the system. that is the new information filling technique doesn’t change the coordination between condition attributes and decision attribute in this example and the results make sense. 5.2 interval-valued data estimation and filling in the study of interval-valued data estimation and filling, a part of thruster experimental data is acquired by different sensors. in order to simplify the table, only the objects with missing data denoted as "∗" are listed in table 2. the column "u" refers to the times of experiment. columns "c1" to "c4" denote the data acquired by different pressure and temperature sensors, and compose four condition attributes. these acquired data is formed as interval values including the minimum and maximum from the each sensor without data preprocessing. the column "d" denotes three types of failure simplified as symbolic values and composes the decision attribute. we need to fill the missing data effectively and discover the failure rules hidden in the data as accurately as possible. 1) firstly, the decision table is completed by the method in ref. [13] and showed in table 3. briefly, "∗" is replaced by the minimal /maximal value in the domain of each condition attribute. 2) then, the decision table is completed by the new information filling technique proposed in this paper and showed in table 4. in order to reveal the difference between above two methods, the classifications of condition attributes are made by interval similarity relation. the cardinality of each class obtained by the two methods can be compared in figure 4. a new information filling technique based on generalized information entropy 183 u c1 c2 c3 c4 d 3 [4.344, 4.604] [0.029, 0.037] [4.600, *] [23.865, 26.819] 1 6 [4.283, *] [0.024, 0.033] [4.613, 4.889] [23.872, 25.826] 1 21 [4.801, 4.997] [0.027, 0.035] [*, 4.516] [28.700, 30.848] 2 25 [*, 4.993] [0.023, 0.030] [4.627, 4.904] [28.663, 30.806] 2 28 [4.445, 4.711] [0.025, 0.037] [4.236, 4.490] [28.826, *] 2 43 [2.629, *] [0.024, 0.037] [4.600, 4.876] [30.881, 32.116] 3 46 [2.653, 3.812] [0.025, 0.033] [4.619, 4.896] [30.016, *] 3 55 [1.470, 2.551] [0.027, 0.030] [4.507, *] [30.497, 31.717] 3 . . . . . . . . . . . . . . . . . . table 2: incomplete thruster experimental data. u c1 c2 c3 c4 d 3 [4.344, 4.604] [0.029, 0.037] [4.600, 4.999] [23.865, 26.819] 1 6 [4.283, 4.999] [0.024, 0.033] [4.613, 4.889] [23.872, 25.826] 1 21 [4.801, 4.997] [0.027, 0.035] [4.062, 4.516] [28.700, 30.848] 2 25 [1.015, 4.993] [0.023, 0.030] [4.627, 4.904] [28.663, 30.806] 2 28 [4.445, 4.711] [0.025, 0.037] [4.236, 4.490] [28.826, 32.150] 2 43 [2.629, 4.999] [0.024, 0.037] [4.600, 4.876] [30.881, 32.116] 3 46 [2.653, 3.812] [0.025, 0.033] [4.619, 4.896] [30.016, 32.150] 3 55 [1.470, 2.551] [0.027, 0.030] [4.507, 4.999] [30.497, 31.717] 3 . . . . . . . . . . . . . . . . . . table 3: completed thruster experimental data 1. the histogram indicates that, with the same binary relation, more than 30% classes obtained from the completed decision table by the new information filling technique are “thinner" than before. that is, the new information filling technique can achieve a classification ability no less than the traditional one. the classes are more precise and the uncertainty is lower now. additionally, e-condition entropy is calculated as e(d/c) = 0.0033 in table 4, while this index is 0.0106 in table 3. the result also implies a better coordination obtained by the new information filling technique between condition attributes and decision attributes. then, the reduct of the system is c1,c3,c4 obtained by heuristic algorithm of attribute reduction based on e-condition entropy [22] [25]. the certain decision rules (failure rules) can be achieved explicitly (as showed in table 5) since all the missing data has been filled. at present, the estimation intervals in each attribute are relatively small in this group of certain failure rules. it has a better significance for the practical decision making. the result is agreed with the performance analysis in section 4.2. consequently, the problems of incompleteness in thruster experimental data are solved and the failure rules are achieved favorably. figure 4: cardinality of each class. 184 s. han, l. chen, z. zhang, j.-x. li u c1 c2 c3 c4 d 3 [4.344, 4.604] [0.029, 0.037] [4.600, 4.753] [23.865, 26.819] 1 6 [4.283, 4.601] [0.024, 0.033] [4.613, 4.889] [23.872, 25.826] 1 21 [4.801, 4.997] [0.027, 0.035] [4.235, 4.516] [28.700, 30.848] 2 25 [4.877, 4.993] [0.023, 0.030] [4.627, 4.904] [28.663, 30.806] 2 28 [4.445, 4.711] [0.025, 0.037] [4.236, 4.490] [28.826, 30.471] 2 43 [2.629, 3.810] [0.024, 0.037] [4.600, 4.876] [30.881, 32.116] 3 46 [2.653, 3.812] [0.025, 0.033] [4.619, 4.896] [30.016, 31.240] 3 55 [1.470, 2.551] [0.027, 0.030] [4.507, 4.837] [30.497, 31.717] 3 . . . . . . . . . . . . . . . . . . table 4: completed thruster experimental data 2. rules descriptions r1 if c1 ∈ [4.458, 4.725] ∧ c3 ∈ [4.867, 4.997] ∧ c4 ∈ [23.779, 25.730], then d = 1 r2 if c1 ∈ [4.456, 4.732] ∧ c3 ∈ [4.358, 4.619] ∧ c4 ∈ [23.717, 25.658], then d = 1 . . . . . . r19 if c1 ∈ [4.801, 4.997] ∧ c3 ∈ [4.235, 4.516] ∧ c4 ∈ [28.700, 30.848], then d = 2 r20 if c1 ∈ [4.817, 4.989] ∧ c3 ∈ [4.350, 4.611] ∧ c4 ∈ [29.200, 30.328], then d = 2 . . . . . . r35 if c1 ∈ [2.947, 4.215] ∧ c3 ∈ [4.868, 4.999] ∧ c4 ∈ [30.391, 31.607], then d = 3 r36 if c1 ∈ [3.678, 4.047] ∧ c3 ∈ [4.358, 4.619] ∧ c4 ∈ [29.950, 31.148], then d = 3 . . . . . . table 5: certain decision rules. 6 conclusions the solution to incompleteness in decision fusion is presented with the help of a new information filling technique based on generalized information entropy in this paper. interval similarity relation is employed as the binary relation to classify both single-valued and interval-valued data acquired by different sensors. as the improvement of classical rough set, a new kind of generalized information entropy called h’-information entropy is provided to solve the incompleteness occurring unavoidably in information systems. so, the missing data in the decision tables can be filled evidently and favorably. the calculation procedure of this innovated information filling technique is listed in detail. finally, two integrated examples are given to evaluate the performance of proposed method. the results turn out the superiority and practicality in real applications. acknowledgements this work is jointly supported by national natural science foundation (61175008) and state key laboratory of complex electromagnetic environment effects on electronics and information system (cemee2014k0301a). bibliography [1] pawlak, z.; skowron, a. (2007); rudiments of rough sets. information science, issn 00200255, 177(1): 3-27. [2] tay, f.e.h.; shen, l.-x.(2003); fault diagnosis based on rough set theory. engineering applications of artificial intelligence, issn 0952-1976, 16(1): 39-43. a new information filling technique based on generalized information entropy 185 [3] pawlak, z.(1982); rough sets. international journal of computer & information sciences , issn 0885-7458, 11(5): 341-356. [4] kryszkiewicz, m.(1998); rough set approach to incomplete information systems. information sciences, issn 0020-0255, 112(1-4): 39-49. [5] stefanowski, j.; tsoukiàs, a.(1999); on the extension of rough sets under incomplete information. new directions in rough sets, data mining, and granular-soft computing, lecture notes in computer science, issn 0302-9743, 1711: 73-81. [6] stefanowski, j.; tsoukiàs, a.(2001); incomplete information tables and rough classification. computational intelligence, issn 0824-7935, 17(3): 545-566. [7] wang, g.-y.(2002); extension of rough set under incomplete information systems. journal of computer research and development, issn 1000-1239, 39(10): 1238-1243. [8] greco, s.; matarazzo, b.; slowinski, r.(1999); rough approximation of a preference relation by dominance relations. european journal of operational research, issn 0377-2217, 117(1): 63-83. [9] guan, y.-y.; wang, h.-k.(2006); set-valued information systems. information sciences, issn 0020-0255, 176(17): 2507-2525. [10] leung, y.; wu, w.-z.; zhang, w.-x.(2006); knowledge acquisition in incomplete information systems: a rough set approach. european journal of operational research, issn 0377-2217, 168(1): 164-180. [11] yang, x.-b.; yu, d.-j.; yang, j.-y.; song, x.-n.(2009); difference relation-based rough set and negative rules in incomplete information system. international journal of uncertainty, fuzziness and knowledge-based systems, issn 0218-4885, 17(5): 649-665. [12] yang, x.-b.; yu, d.-j.; yang, j-y.; wei, l.-h.(2009); dominance-based rough set approach to incomplete interval-valued information system. data & knowledge engineering, issn 0169023x, 68(11): 1331-1347. [13] zhao, l.; zhang, x.; xue, z.(2011); security assessment for incomplete interval-valued information system. computer engineering, issn 1000-3428, 37(11): 146-148. [14] grzymala-busse, j.w.; stefanowski, j.(2001); three discretization methods for rule induction. international journal of intelligent systems, issn 1098-111x, 16(1): 29-38. [15] beynon, m.j.(2004); stability of continuous value discretisation: an application within rough set theory. international journal of approximate reasoning, issn 0888-613x, 35(1): 29-53. [16] leung, y.; fischer, m.-m.; wu, w.-z.; mi, j.-s.(2008) a rough set approach for the discovery of classification rules in interval-valued information systems. international journal of approximate reasoning, issn 0888-613x, 47(2): 233-246. [17] yang, x.-b.; yang, j.-y.(2012); incomplete information system and rough set theorymodels and attribute reductions, springerverlag: berlin, heidelberg, isbn 978-3-64225934-0. 186 s. han, l. chen, z. zhang, j.-x. li [18] zhang, n.; miao, d.-q.; yue, x.-d.(2010); approaches to knowledge reduction in intervalvalued information systems. journal of computer research and development, issn 10001239, 47(8): 1362-1371. [19] liang, j.-y.; qian, y.-h.(2008); information granules and entropy theory in information systems. science in china series f: information sciences, issn 1674-733x, 51(10): 14271444. [20] düntsch, i.; gediga, g.(1998); uncertainty measures of rough set prediction. artificial intelligence, issn 0004-3702, 106(1): 109-137. [21] wang, j.; miao, d.-q.(1998); analysis on attribute reduction strategies of rough set. journal of computer science and technology, issn 1000-9000, 13(2): 189-193. [22] teng, s.-h.; zhou, s.-l.; sun, j.-x.; li, z.-y.(2010); attribute reduction algorithm based on conditional entropy under incomplete information system. journal of national university of defense technology, issn 1001-2486, 32(1): 90-94. [23] liang, j.-y.; chin, k.s.; dang, c.-y.; yam, r.c.m.(2002); a new method for measuring uncertainty and fuzziness in rough set theory. international journal of general systems, issn 0308-1079, 31(4): 331-342. [24] xu, j.-c.; sun, l.(2010); a new knowledge reduction algorithm based on decision power in rough set. transactions on rough sets xii, lecture notes in computer science, issn 0302-9743, 6109: 76-89. [25] li, f.; yin, y.-q.(2009); approaches to knowledge reduction of covering decision systems based on information theory. information sciences, issn 0020-0255, 179(11): 1694-1704. int j comput commun, issn 1841-9836 vol.7 (2012), no. 2 (june), pp. 218-230 optimization of vertical handoff decision algorithm for wireless networks e. arun, r.s. moni elias arun, r.s. moni department of computer science and engineering, department of electronics and communication engineering, noorul islam university, tamil nadu, india. arunsedly@yahoo.com, r.smoni@yahoo.com abstract: to provide mobile users with seamless access anywhere and anytime there is a strong need forinterworking mechanism between cellular networks and wireless local area networks in the next generation wirelessnetworks. due to the heterogeneous underlying quality of service (qos) support, the admission traffic in these areashas significant impact on overall resource utilization efficiency and qos satisfaction when multiple services areconsidered. this paper addresses a call-level quality of service (cls) vertical handoff algorithm between wlanand cellular networks for seamless ubiquitous access. the cls involves call blocking/dropping probabilities, meandata transfer rate, and number of handoff per call. based on the above proposed admission strategy, the admissionregion of a cell or wlan for the traffic can be derived with the function of new call arrival rate, handoff callarrival rate, and the radius of wlans. the blocking and dropping probabilities are calculated under the guardchanneladmission strategy. the radius of wlan is determined by using simulated annealing (sa) method to minimizethe cost function. moreover, handoff traffic should be differentiated from new traffic in terms of call admission.when a mobile node (mn) moves from an area, with only cellular coverage to an overlaid wlan area, theongoing call of the mn should be handed over to the wlan, and handoff from wlan to cellular network if theyleave the scope. the results based on a detailed performance evaluation study are presented here to demonstrate theefficacy of the proposed algorithm. keywords: heterogeneous wireless networks, vertical handoff, seamless mobility, integration handover. 1 introduction the past decade has witnessed the fast evolution and successful deployment of a number of wireless access networks. the two most promising ones are cellular networks and wireless local area network (wlan). driven by the service anywhere and anytime concept, it is well accepted that fourth-generation (4g) wireless networks will be heterogeneous, integrating different networks to provide seamless internet access for mobile users with multi mode access capability. one major challenge in cellular/wlan internet working is how to take advantage of the wide coverage and almost universal roaming support of cellular networks and the high data rates of wlan. many issues should be carefully addressed to achieve seamless interworking,such as mobility management, resource allocation, call admission control, security and billing.this article focuses on how to properly admit incoming traffic to the cell or wlan and when to take handoff decision. the process of switching connections among networks is called handoff. in each network, an admission control policy either accepts the connection request and accordingly allocates the requested bandwidth or blocks the connection request.higher priority is usually given copyright c⃝ 2006-2012 by ccc publications optimization of vertical handoff decision algorithm for wireless networks 219 to accept the connection requests from handoff users not the new users. the reason is that from the user’s point of view, having a connection abruptly terminated is more annoying than being blocked occasionally on new connection attempts. a service request rejected by its first-choice network can just leave the system or further try to access the other network [1]. there are some related researches on similar problem in two-tier hierarchical cellular networks, in which small size microcells overlay with large macro cells. many proposed admission strategies [1] [2] are based on user mobility and traffic characteristics. the vertical handoff [3] process involves three stages. the first is the network discovery. in this phase, mobile nodes (mn) periodically searches if there are some other different types of wireless networks and take these discovered networks as candidates. the second is the handoff decision phase where mns compare the state of the current network with candidates, and select one as the handoff target from them according to a certain criterion. the last is the handoff implementation phase where mns execute the handoff actions and associate with the newly authenticated network. among these three stages, the handoff decision phase is very important, because it has a direct influence on the network performance and the quality of service of nodes. the objectives of the proposed framework are to maximize network utility through efficient resource allocation, achieve prioritization among different types of connections such as new connections and vertical and horizontal handoff connections, and ensure that the performance of ongoing connections doesn’t deteriorate due to accepting too many connections in a service area. the authors of [4] propose a vertical handoff decision method that simply estimates the service quality for available networks and selects the network with the best quality. in the literature, vertical handoff algorithm is developed in different directions. one of them takes the received signal strength (rss) and some other factors such as bandwidth, delay and distance into consideration to select the best network through a simple comparison [5] [6]. another approach utilizes the artificial intelligent techniques such as neural network, fuzzy logic and machine learning, combining these factors considered to select the best network [7] [8] . the above methods mainly consider the quality of service of nodes after handoff, but do not consider the overall system performance such as resource utilization affected by handoff. the servicedifferentiated admission scheme proposed in [9] applies a different admission strategy for voice service, in which the cellular network is the first choice for voice call and no vertical handoff from the cell to the wlan is executed for ongoing voice calls. to maximize resource utilization, a complex set of admission parameters need to be determined so that the traffic load is properly distributed to the cells and wlans. this research is based on the resource management and the handoff execution of a node is used to make the resources be optimally utilized [10]. by properly setting the effective bandwidth of services, packet-level qos such as packet delay and packet loss can be guaranteed, as long as the allocated bandwidth to traffic is no less than the corresponding effective bandwidth requirements. in the following, the focus is on call-level qos in terms of call blocking/dropping probabilities, mean data transfer time and number of handoffs per call. based on the proposed admission strategy, the admission region of a cell or wlan for the traffic can be derived with the functions of new call arrival rate, handoff call arrival rate and the radius of wlans. in view of that in a period, the call arrival rates are stable, so it can be represented all the probabilities as functions of the radius of wlan. the blocking probabilities and the dropping probabilities are calculated under the guard-channel call admission strategy. the radius of wlans is determined by using simulated annealing method to minimize the cost function. the rest of the paper is organized as follows. section 2 provides cellular/wlan system description. section 3 proposes the optimal admission control for cellular/wlan. section 4 proposes the vertical handoff decision algorithm to minimize the call-level qos. in section 5 performance of the proposed algorithm is discussed. finally the conclusion is stated in section6. 220 e. arun, r.s. moni 2 cellular/ wlan system description consider an integrated cellular/wlan system where one or more wlans may be deployed inside each cell of the cellular system as shown in fig.1. there are two specific coverage areas to be considered: the cellular-only coverage area and the dual cellular/wlan coverage area. in this context, coverage means service availability. a mobile node (mn) can be existing at a given time in the coverage area of a cellular alone. but due to mobility it can move into the regions covered by more than one access networks simultaneously within the coverage area of an umts bs and an ieee802.11 ap. multiple 802.11 wlan coverage areas are usually contained within an umts coverage area. horizontal and vertical handoffs can occur in different coverage areas. in this section it is described a model to formulate a multi-service integrated umts/wlan system. figure 1: integrated heterogeneous network it is assumed that a umts network covers k wlans and all the wlans have no overlapping areas and are directly adjacent between two umts networks. for simplicity, we only draw a single bs and some aps in fig.1, although there are many other cellular networks besides the cellular network. however this has no influence on the design and analysis of our handoff algorithm. let the radii of the umts network and wlan is ru and rw respectively. the number of channels to be cu and cw. wlans are usually deployed in an indoor environment, where user mobility level is very low and may significantly differ from that of other areas. hence homogeneous mobility model may not be applicable and it is necessary to differentiate the user mobility characteristics in the double-coverage area from those in the cellular-only area. in the following analysis, a non-uniform model is used to characterize the user mobility within a cell cluster. let tcores denote the residence time that a user stays within the cellular-only area before moving to neighboring cells with probability pcc or to the overlaying wlan with probability pcw and tdcres the user residence time in the double-coverage area. t co res and t dc res are assumed to be exponentially distributed with parameters ηco and η dc, respectively. as shown in [11], for the mn with mean velocity v and uniformly distributed movement direction over [0 − 2π], the average region boundary cross-over rate ç is given by η = v (l/πs), where l and s are the boundary length and area of the region respectively. to analyze the dwell time, we adapt the third model [12] which is defined as the duration that a node stays in a certain region before it moves out of the boundary of the region, of a node optimization of vertical handoff decision algorithm for wireless networks 221 in a region. the third model assumes the nodes are uniformly distributed throughout the whole area and each mn moves in any direction with equal probability. the area of the region in the heterogeneous wireless networks that is in double-coverage defined as sdc and boundary length ldc. similarly the area of the region that is covered by a wlan is defined as sw and boundary length lw. the area of the region that is covered by cellular-only area is sco and boundary length lco. hence the area sdc equals to the sum of sw and sco. the surface area sco = sdc − ksw (1) where k is the kth wlan. to model the mobility, it is defined inter-boundary time similar to [13, 14], as the time interval between any two consecutive access network boundary crossings by a mobile user. the wider the coverage area or the more stationary users, longer the inter-boundary times. if the inter-boundary time starts at the moment of entering cell i, then it is denoted by tcbi. if an interboundary time starts at the moment of entering wlan k then it is twbk. it is assumed that t c bi and t w bk are exponentially distributed with means 1/ ηci and 1/ η w i respectively. hence it is noted that the arrival rates of handoff calls and new calls follow poison distribution, the dwell time in a certain region follows exponential distribution and the call duration time follows exponential distribution with mean value as 1/η. the channel holding time can be defined as the time that a connected mobile user keeps using basic bandwidth resources in each network. for service s, the channel holding times in cell i and in wlan k are obtained as min(trs , t c bi) and min(trs , t w bi) respectively where ts is the connection time of a service s. since t r s , t c bi and twbiare exponentially distributed, the channel holding times are also exponentially distributed with parameters µcis = ϑs + η c i andµ w ks = ϑs + η w k respectively. with the heterogeneous qos support of the underlying structure, the incoming traffic in the double-coverage area should be properly admitted to the cell and wlan. it is assumed that the calls are uniformly distributed in the cellular region, so the call requests in cellular region can be classified as, (i) new call requests to the wlan with arrival rate λwn which is equal to (s w/su)λn where λn is the new call arrival rate, and su is the surface area of the cell. (ii) new call request to the cellular-only area with arrival rate λcon equaling to (s co/su)λn. (iii) new call arrival rate in double-coverage area λdcn which is equal to (s dc/su)λn. (iv) the arrival rate of handoff calls between neighboring cell λcch . (v) the arrival rate of handoff calls from cellular-only area to overlaying wlan λcwh . (vi) the arrival rate of handoff calls from wlan to the overlaying cell λwch . with the above parameters the average dwell time in kth wlan area ,(aw) is 1/ηwk ,where the ηwk = v l w/πsw (2) the average dwell time in double-coverage area (adc) is 1/ηdc where ηdc = v ldc/πsdc (3) the average dwell time in cellular-only area (aco) is 1/ηco where ηco = v lco/πsco (4) consider an mn is in cellular-only area. when mn is moving out of this region, it may enter into adjacent cell or move into wlan region. hence, the average dwell time in cellular-only region before mn moves into another adjacent cell is 1/ηcch where ηcch = v l dc/π(sdc − ksw) (5) 222 e. arun, r.s. moni the average dwell time in cellular-only region before mn moves into wlan area is 1/ηcwh where ηcwh = v l w/π(sdc − ksw) (6) 3 optimal admission control for cellular/wlan interworking with call admission control the heterogeneous network blocks some new call requests in order to reduce interference on the network so that the outage probability decreases. given the cell bandwidth cc and total offered traffic load, the minimum bandwidth needed to meet the requirements of call blocking and dropping probabilities can be obtained as xn. ncn where xn is the bandwidth requirement of a new call and ncn is the maximum number of new call requests allowed in a cell (ncn ≤ cc/xn). it is noted that in cellular-only region only cellular access is available so in this paper randomized guard channel method [15] is applied to give the new and handoff traffic in this area a priority to access the cell bandwidth over the traffic in the doublecoverage area. because the call blocking and dropping probabilities are very sensitive to the amount of reserved bandwidth , the guard bandwidth for high priority call traffic is randomized instead of an integer number of guard channels. each cellular network has cc channels and each wlan has cw channels and cc − ncn channels in cellular network are reserved only for handoff calls. similarly cw − nwn channels in wlan are reserved only for handoff calls; when the occupied channels are less than ncn in cellular networks, the call admission region of the cell is given in terms of (ncn,g c h,g c nh) vectors, in which gcnh(≤ n c n) is a real number to represent a randomized number of guard channels dedicated to new and handoff calls in cellular-only area and gch is the guard bandwidth reserved only for handoff traffic in this region. when the occupied channels are equal to or more than ncn only the handoff calls could be allowed; and the same to the wlans. within the two-tier overlaying structure, the vertical handoff from the cell to the overlaying wlan is not necessary but optional to maintain an ongoing call. hence the handoff traffic load to the wlan can be controlled by properly adjusting the admission parameters to the wlan, by using a simple guard channel method. due to different quality of service support and resource sharing policy in the underlying networks, the configuration of admission regions of the cell and wlan can have a significant impact on the overall system performance. let preqbn ,p req dh and t req d are the requirements of new call blocking and handoff call dropping probabilities and mean transfer rate respectively. then the admission control problem can be formulated as maxnwn λd (7) subject to : pwbn,p dc bn ≤ p req bn ; p co bn ≤ p req bn ; p c dh ≤ p req bh ,e(td) ≤ t req d . where pcobn and p dc bn are the blocking probabilities of the cell for new calls in the cellular-only area and double-coverage area respectively, pcdh is the handoff dropping probability of the cell, pwbnis the probability that a new call is blocked by the wlan and e(td) is the mean transfer rate time. thus the maximization of λd implies a maximization of the total acceptance traffic load and resource utilization. 3.1 new call blocking and dropping probabilities we use a k+1 dimensional markov chain to analyze the guard channel admission algorithm. let (kwn ,k c nco,k c ndc) denote the state of the new call arrival in a cell cluster, where k w n ,k c nco,k c ndc are the numbers of new calls admitted to the wlan, to the cell from the cellular-only area and optimization of vertical handoff decision algorithm for wireless networks 223 to the cell from double-coverage area respectively. first, the number of new calls in the wlan can be described by a birth-death process with respect to kwn . since both new call duration and user residence time in the double-coverage area are exponentially distributed the channel holding time of new calls in wlans, min(trs , t c bi) is exponential with mean 1/µn + η dc where 1/µn is the mean new call duration. then the steady state probability of k new calls in the wlan is obtained based on m/m/k/k loss system, as πwn = ([(λ dc n + λ cw n )/(µn + η dc)]k/k!)/(σ nwn i=0([(λ dc n + λ cw n )/(µn + η dc)]i)/i! (8) hence the new call blocking probability in the wlan is pwbn = π w n (n w n ) (9) in the following, we derive the state-dependent transition rates, which are given by: i) (kcnco,k c ndc) → (k c nco + 1,k c ndc) and k c nco < n c n. this happens when there is a new call request in sco or a handoff call request comes from adjacent cellular networks. since there is no transition at double-coverage area, it is impossible that a handoff call request comes from one of the wlans, otherwise kcndc should be k c ndc − 1. so the transition rate is λcon + λ cc h (10) ii) (kcnco,k c ndc) → (k c nco + 1,k c ndc) and k c nco ≥ ncn this will happen when there is a handoff request comes from adjacent cellular network. because kcnco ≥ ncn it is impossible that a new request call is admitted, and because there is no change of state for the mn in kcndc it is also impossible that a handoff call request comes from wlan. so the transition rate is λcch . (11) iii) (kcnco,k c ndc) → (k c nco,k c ndc +1) and k c ndc < n w n this will happen when there is a new call request in the kth wlan. since there is no channel released in cellular network, it is impossible a handoff call request comes from cellular network. so the transition rate is λwn . (12) iv) (kcnco,k c ndc) → (k c nco,k c ndc − 1) and k c nco = n c c it shows that the channels in wlan have been used up, the channel released in cellular networks might be driven by three events: call finishes communication; call leaves for neighbor cellular network; call leaves for the k number of used-up-channel wlans. hence the transition rate is kcnco(µn + η cc h ) + (k − k)η co n (13) v) (kcnco,k c ndc) → (k c nco,k c ndc − 1) and k c nco = c c it means that the channels in cellular network have been used up. the channel released in the kth wlan may be driven by two events: call finishes its communication and call leaves for the cellular network. so the transition rate is kcndc(µn + η w) (14) vi) (kcnco,k c ndc) → (k c nco,k c ndc − 1)andk c nco < c c this means that the channels in cellular network have not been used up and the channel released in the kth wlan is only driven by the event that call finishes its communication. it is impossible that a call comes from sw to sco otherwise (kcnco should be k c nco + 1. so the transition rate is 224 e. arun, r.s. moni kcndcµn (15) as given in equation [12] and [13] the state departure rates vary with the number of existing new calls in wlan kwn based on which handoff calls from the cell are admitted or blocked by the wlan. hence the new calls admitted to the cell from cellular-only region and doublecoverage region has different mean channel holding time. therefore, the cell can be viewed as a multiservice loss system [16]. a product-form state distribution exists and is insensitive to service time distributions, provided that the resource sharing among services is under coordinate convex policies. this requires that transitions between states come in pairs. for loss systems with trunk reservation like guard channel method, the insensitivity property and product-form solutions are destroyed due to the one-way transitions at some states. a recursive method is proposed in [17] to approximate the state distribution, which is shown to be accurate for a wide range of traffic intensities and when the service rates do not greatly differ from each other. the blocking probabilities are almost insensitive to service time distributions. hence, we use the recursive approximation in [17] to obtain the steadystate probability of new calls admitted into the cell πcn. thus the blocking probabilities of the cell for new calls in the cellular-only region and double-coverage region are given by pcbnco = g c h − ⌊g c h⌋π c n ⌊n c nco⌋ + ncn∑ i=⌊nnco⌋+1 πcn(i) (16) pcbndc = g c nh − ⌊g c nh⌋π c n ⌊n c ndc⌋ + ncn∑ i=⌊nndc⌋+1 πcn(i) (17) and the dropping probabilities of the cell are given by dcn = π c n(n c n) (18) 3.2 average arrival rates of handoff calls the handoff arrival rates are related to the handoff probabilities. the handoff probability of new calls in the cellular-only area to neighbouring cells is denoted as hccnc is given by p ccφ(−µn) where, φ is the moment generating function. similarly, the handoff probability of new calls in the cellular-only area to the overlaying wlan is denoted as hcwnc is given by h cw nc = p cwφ(−µn) with an exponentially distributed user residence time in the double-coverage area, the handoff probability of new calls from wlan to the overlaying cell is hwcnc = η dc/(ηdc + µn) (19) hence the handoff traffic from the wlan to the overlaying cell has a mean arrival rate λwch is given by λwch = h wc nc (λ dc n + λ cw h )(1 − p w bn) (20) the mean arrival rates of handoff traffic between neighbouring cells is λcch = h cc nc[λ co n (1 − p co bn) + (λ wc h + λ cc h )(1 − p co dh) + λ dcc n (1 − p dc bn)h wc n ] (21) and the mean arrival rates of handoff traffic from the cell to the overlaying wlan is optimization of vertical handoff decision algorithm for wireless networks 225 λcwh = h cw nc [λ co n (1 − p co bn) + (λ wc h + λ cc h )(1 − p co dh) + λ dcc n (1 − p dc bn)h wc n ] (22) thus the new call blocking and dropping probabilities can be obtained recursively from (9),(18),(20),(21) and (22). 4 vertical handoff decision making algorithm from the analysis in section 3, it is learnt that if the radius of wlan is fixed, the drop probability and the block probability will be determined by new call arrival rate and handoff call arrival rate. usually, the arrival rates in a short period are invariable. hence the radius of wlan can be adjusted by regulating the transmission power to optimize these block probabilities and drop probabilities. in order to accomplish an optimization, we formulate a combined cost function g = πcn + β1π c n + β2λ wc h + β3λ cc h + β4λ cw h (23) which is determined by the block probabilities and the drop probabilities in cellular networks and one of the wlan, and is a function of the radius of wlans. when the radius of wlan is determined, all the nodes in the wlan should communicate with wlan. even if the channels in wlan are used up and there are free channels in cellular networks, the call request in wlan should not be admitted by cellular network would be blocked or dropped. the communication node shall handoff to wlan if it enters wlan and handoff to cellular network if it leaves current wlan. in the equation (23), βk , k = 1,2,3,4 denotes the weight of dropping probability in wlan, handoff probability in wlan to cellular network, handoff probability in cellular-only area to adjacent cell and handoff probability in cellular network to wlan respectively. as to the new call originates in aw it is covered by cellular network and couldn’t have been admitted by cellular network. according to the above algorithm it loses the opportunity and this will not be happened much often, so the weight of β1 should be bigger than one. as the mn moving from aco to aw the handoff is not required because it could have been using the original channel to communicate, but handoff is required if mn moves from aw to aco otherwise, it will lose the channel of wlan. so the weight of β4 should be somewhat greater than β2 and β3. because terminating an on-going call is far more annoying than refusing to admit a new call from user’s point of view, β4, β3 and β2 should be much bigger than β1. the objective function and its constraints are, ming = πcn + β1π c n + β2λ wc h + β3λ cc h + β4λ cw h (24) such that rmin ≤ rw ≤ rmax , where rmin and rmax are the minimum radius and the maximum radius of the wlan. it is very difficult to determine the optimal radius of wlan by numerical programming methods. simulated annealing (sa) is a stochastic computational technique derived from statistical mechanics for finding near globally minimum-cost solutions to large optimization problems. finding the global minimum value of an objective function with many degrees of freedom subject to conflict ting constraints is an np-complete problem. therefore, the objective function will tend to have many local minima. a procedure for solving optimization problems of the above nature can be implemented by following the successive decent algorithm by kirkpatric et al [18] which follows the evolution of a solid thermodynamic equilibrium with a decreasing succession of temperature values. the procedure is as follows: step 1: begin minimization. 226 e. arun, r.s. moni select an initial radius ri ∈ e randomly for all wlans; select an initial control parameter t greater than 0; select parameter change counter t = 0; repeat; set repetition counter k=0; repeat. step 2: select a new radius rn in ri − 6,ri + 6 randomly, rn is in [rmin,rmax]; compute s = g(rn) − g(r0). step 3: if s less than 0 then r0 := rn goto step 5. step 4: else if rand(0,1) less than exp(-s/t) then ri := rn. step 5: k := k+1. until k = r(t); t:=t+1. step 6: if t greater than 30, then output r0 as the optimal radius. otherwise k := 0 and; goto step 2. 5 performance analysis in this section, we evaluate the performance of the proposed vertical handoff decision algorithm in terms of call blocking/ dropping probabilities. due to the differentiation of new and handoff traffic in different areas, the analysis is very complex. by applying the call admission control algorithm given in section 3, we can obtain the best configuration for admission parameters to maximize the admissible traffic load with the given cell/ wlan cluster. fig.2 shows the relationship between the total acceptance traffic load λd and the maximum number of new calls arrived in the wlan nwn under blocking probabilities ≤ 0.01, dropping probabilities ≤ 0.001 and mean data transfer ≤ 4s. figure 2: max. acceptable data traffic load versus max. number of data it is observed from fig.2 that the total acceptance traffic load increases with nwn when n w n is relatively small. fig.3 illustrates the call-level quality of service performance with different nwn . it is noted that the simulation results of new call blocking and dropping probabilities are very close to the analytical results. the performance fluctuation of handoff dropping probability is due to the maximum number of calls allowed in the cell and wlan are both integer variables. from fig.4 and.5 it is noted that the block probability and drop probability in cellular network optimization of vertical handoff decision algorithm for wireless networks 227 decreases when the radius of wlan becomes larger. this is because more and more of new call requests are admitted by wlan. figure 3: call level qos performance with different 5 figure 4: block probability in cellular networks fig.6 and fig.7 show that the block probability and drop probability in wlan becomes larger with the radius of wlan. the reason here is, when the radius becomes expanding, more and more new calls can be admitted to the wlan, this will result in increase of block probability and drop probability. fig.8 illustrate the optimal radius of wlan under different new call arrival rate and different handoff call arrival rate. it is observed that when the new call arrival rate is fixed, the radius varies consistent with the handoff call arrival rate. this is because the handoff call requests are only sent to the cellular network, which will increase the drop probability in cellular network, and this leads to maximize the cost function. in order to minimize the cost function, the drop probability must be reduced. this is done by redirecting some of the new call request to wlan. it is also noted that the handoff call arrival rate is fixed; the radius varies inversely with the new call arrival rate. this is because the number of channels in cellular network is more than that in wlan. 228 e. arun, r.s. moni figure 5: drop probability in cellular networks figure 6: block probability in wlan figure 7: drop probability in wlan optimization of vertical handoff decision algorithm for wireless networks 229 figure 8: optimal radius of wlan 6 conclusion when connections need to migrate between heterogeneous networks for performance and high availability reasons, then seamless vertical handoff is necessary the first step. in this paper, we tried to highlight the block probability of new calls and drop probability of handoff calls in heterogeneous networks and their computation and the cost function is proposed which is based on the block probabilities and drop probabilities. the optimal radius of wlan is determined using the simulated annealing method. all the mobile nodes entering the scope of wlan should handoff from cellular network to wlan, and the mn leaving the scope should handoff from wlan to cellular network. our performance results based on detailed simulations illustrate that the proposed algorithm could achieve good effects. acknowledgement. this work was supported by aicte, govt. of india, file no : 8023/bor/rid/rps 66/2009-10. bibliography [1] k.maheswary and a.kumar, performance analysis of microcellization for supporting two mobility classes in cellular wireless networks, ieee trans.veh. technolog, 49(2):321-333, 2000 [2] t.klein and s.j.han,assignment strategies for mobile data users in hierarchical overlay networks: performance of optimal and adaptive strategies, ieee int j. of select. areas. commun., 22(5):849-861, june 2004. [3] j.ncnair, and f.zhu, vertical handoff in 4g multi network environments, ieee wireless communications, 11(3):8-15, 2005 [4] n.nasser, a.hasswa and h.hassanein, handoff in 4g heterogeneous networks, ieee communication magazine, 44(10):96-103, 2006 [5] e.arun, r.s.moni, a novel decision scheme for vertical handoff in 4gwireless networks, global journal of computer science and technology, 10(5):28-33, 2010 [6] a.h.zahran, b.liang, and a.aaleh,signal threshold adaptation for vertical handoff in heterogeneous wireless networks, vmobile network applications, 11:625-640, 2006 230 e. arun, r.s. moni [7] e.steven, s.navarro, v.w.s.wong, and y.x.lin, a vertical handoff decision algorithm for heterogeneous wireless networks, wireless communication and networking conferrence, pp. 3199-3204, 2007 [8] e.arun, r.s.moni, optimization algorithm for a handoff decision in wireless heterogeneous networks, international journal of next-generation networks (ijngn), 2(3):99-117, 2010 [9] w.song and w.zhuang, qos provisioning via admission control in cellular/wireless networking, in proc. 2nd intl conf. on broadband networks (broadnets’05), 1:585-593, 2005 [10] p.k.tang, y.h.chew and l.c.o.michael, improvement in grade-of-service in a cooperative overlaying heterogeneous networks, 6th int conf. on information communication and signal processing, pp. 1-5, 2007 [11] b.jabbari, teletraffic aspects of evolving and next generation wireless communication networks, ieee pers. comm., 3(6):4-9, 1996 [12] q.a.zeng and d.p.agerwal, modeling and efficient handling of handoff in integrated wireless mobile networking, ieee trans. on veh. tech., 51(6):1469-1478, 2002 [13] i.akyildiz and w.wong, a dynamic location management scheme for next generation multityre pcs systems, ieee trans. on wireless communications, 1(1):178-189, 2002 [14] d.hong, and s.s.rappaport, traffic model and performance analysis for cellular mobile radio telephone systems with priority and non polarized handoff procedures, ieee trans. on veh. tech., 35(3):77-92, 1986 [15] n.c.phuong, s.h.lee and j.m.moon, priority-based call admission control of multi classes in mobile networks, icact, pp.1471-1474, 2006. [16] k.w.ross, multiservice loss models for broadband telecommunication networks, newyork: springer-verlag, 1995. [17] p/tran-gia and f.hubner, an analysis of trunk reservation and gradeoff service balancing mechanism in broadband networks, in proc.tc6 task group/wg6.4. int’l. workshop, pp.8397, 1993 [18] koza.j.r, genetic programming ii automatic discovery of reusable programs, the mit press 1994. international journal of computers communications & control issn 1841-9836, 11(2):273-281, april 2016. fuzzy b-metric spaces s. nădăban sorin nădăban department of mathematics and computer science aurel vlaicu university of arad, elena drăgoi 2, ro-310330 arad, romania snadaban@gmail.com abstract: metric spaces and their various generalizations occur frequently in computer science applications. this is the reason why, in this paper, we introduced and studied the concept of fuzzy b-metric space, generalizing, in this way, both the notion of fuzzy metric space introduced by i. kramosil and j. michálek and the concept of b-metric space. on the other hand, we introduced the concept of fuzzy quasi-bmetric space, extending the notion of fuzzy quasi metric space recently introduced by v. gregori and s. romaguera. finally, a decomposition theorem for a fuzzy quasipseudo-b-metric into an ascending family of quasi-pseudo-b-metrics is established. the use of fuzzy b-metric spaces and fuzzy quasi-b-metric spaces in the study of denotational semantics and their applications in control theory will be an important next step. keywords: fuzzy b-metric spaces, fuzzy quasi-b-metric, fuzzy quasi-pseudo-bmetric, b-metric space. 1 introduction and preliminaries the concept of b-metric space was introduced by i.a. bakhtin [5] and exensively used by s. czerwic [10, 11]. definition 1. [10] let x be a nonempty set and k ≥ 1 be a given real number. a function d : x ×x → [0,∞) is a b-metric on x if, for all x,y,z ∈ x, the following conditions hold: (b1) d(x,y) = 0 if and only if x = y; (b2) d(x,y) = d(y,x); (b3) d(x,z) ≤ k[d(x,y) + d(y,z)]. the triple (x,d,k) will be called b-metric space. some examples of b-metric spaces and some fixed point theorems in b-metric spaces can be found in [6–8, 21]. we also note that the class of b-metric spaces is larger than that of metric spaces, since every b-metric is a metric when k = 1. in [22] an example of a b-metric space which is not a metric space, is given. recently, m.a. alghamdi, n. hussain, p. salimi [1] introduced the notion of b-metric-like space, which is an interesting generalization of metric-like space (introduced by a. amini-harandi [2]) and partial metric space (introduced by s.g. matthews [17]). in paper [14], n. hussain and m.h. shah introduced the notion of cone b-metric space, generalizing both notions of b-metric spaces and cone metric spaces. the concept of quasi-b-metric space was introduced by m.h. shah and n. hussain [20] in 2012. in this paper we adopt a slight modification of their definition. definition 2. let x be a nonempty set. a real valued function d : x ×x → [0,∞) is said to be a quasi-b-metric with constant k ≥ 1 if the following conditions are satisfied: copyright © 2006-2016 by ccc publications 274 s. nădăban (qb1) d(x,y) = d(y,x) = 0 if and only if x = y; (qb3) d(x,z) ≤ k[d(x,y) + d(y,z)],(∀)x,y,z ∈ x. the triple (x,d,k) will be called quasi-b-metric space. on the other hand, after l.a. zadeh has introduced in his famous paper [23] the concept of fuzzy set, one of the important problems is to obtain an adequate notion of fuzzy metric space. i. kramosil and j. michálek [16] reformulated successfully the notion of probabilistic metric space, introduced by k. menger in 1942, in fuzzy context. definition 3. [19] a binary operation ∗ : [0,1]× [0,1] → [0,1] is called triangular norm (t-norm) if it satisfies the following condition: 1. a∗ b = b∗a,(∀)a,b ∈ [0,1]; 2. a∗1 = a,(∀)a ∈ [0,1]; 3. (a∗ b)∗ c = a∗ (b∗ c),(∀)a,b,c ∈ [0,1]; 4. if a ≤ c and b ≤ d, with a,b,c,d ∈ [0,1], then a∗ b ≤ c∗d. example 4. three basic examples of continuous t-norms are ∧, ·,∗l, which are defined by a∧b = min{a,b}, a·b = ab (usual multiplication in [0,1]) and a∗lb = max{a+b−1,0} (the lukasiewicz t-norm). definition 5. [16] the triple (x,m,∗) is said to be a fuzzy metric space if x is an arbitrary set, ∗ is a continuous t-norm and m is a fuzzy metric, i.e. a fuzzy set in x ×x × [0,∞) such that for all x,y,z ∈ x we have: (m1) m(x,y,0) = 0; (m2) [m(x,y,t) = 1,(∀)t > 0] if and only if x = y; (m3) m(x,y,t) = m(y,x,t),(∀)t ≥ 0; (m4) m(x,z,t + s) ≥ m(x,y,t)∗m(y,z,s),(∀)t,s ≥ 0; (m5) m(x,y, ·) : [0,∞) → [0,1] is left continuous and lim t→∞ m(x,y,t) = 1. we note that a. george and p. veeramani [12] modified the concept of fuzzy metric space introduced by i. kramosil and j. michálek and defined a hausdorff topology on this fuzzy space. another approach for fuzzy metric spaces was introduced by o. kaleva and s. seikkala in paper [15], by setting the distance between two points to be a non-negative, upper semicontinuous, normal and convex fuzzy number. in recent years, different types of fuzzy generalized metric spaces was considered by different authors in different approaches. thus, v. gregori and s. romaguera introduced in paper [13] the concept of fuzzy quasi-metric space, generalizing in this way the notions of fuzzy metric introduced by i. kramosil and j. michálek and by a. george and p. veeramani to the quasimetric setting. on the other hand, the idea of fuzzy cone metric space has been introduced in [3] and some basic properties and fixed point theorems for different types of contraction mappings have been developed in fuzzy cone metric spaces. in paper [4], t. bag introduced the concept of fuzzy fuzzy b-metric spaces 275 cone b-metric space and some fixed point theorems are established in such spaces for contraction mappings. we must note that bag’s definitions for fuzzy cone metric space and for fuzzy cone b-metric spaces generalized the notion of fuzzy metric introduced by kaleva and seikkala. in this paper we introduced and studied the concept of fuzzy b-metric space, generalizing, in this way, both the notion of fuzzy metric space introduced by i. kramosil and j. michálek and the concept of b-metric space. on the other hand, we introduced the concept of fuzzy quasi-bmetric space, extending the notion of fuzzy quasi-metric space recently introduced by v. gregori and s. romaguera. finally, a decomposition theorem for a fuzzy quasi-pseudo-b-metric into an ascending family of quasi-pseudo-b-metrics is established. 2 fuzzy b-metric spaces definition 6. let x be a nonempty set, let k ≥ 1 be a given real number and ∗ be a continuous t-norm. a fuzzy set m in x × x × [0,∞) is called fuzzy b-metric if, for all x,y,z ∈ x, the following conditions hold: (bm1) m(x,y,0) = 0; (bm2) [m(x,y,t) = 1,(∀)t > 0] if and only if x = y; (bm3) m(x,y,t) = m(y,x,t),(∀)t ≥ 0; (bm4) m(x,z,k(t + s)) ≥ m(x,y,t)∗m(y,z,s),(∀)t,s ≥ 0; (bm5) m(x,y, ·) : [0,∞) → [0,1] is left continuous and lim t→∞ m(x,y,t) = 1. the quadruple (x,m,∗,k) is said to be a fuzzy b-metric space. remark 7. the class of fuzzy b-metric spaces is larger than the class of fuzzy metric spaces, since a fuzzy b-metric space is a fuzzy metric space when k = 1. example 8. let (x,d,k) be a b-metric space. let md : x ×x × [0,∞) → [0,1],md(x,y,t) = { t t+d(x,y) if t > 0 0 if t = 0 . then (x,md,∧,k) is a fuzzy b-metric space. md will be called standard fuzzy b-metric. proof: we check only (bm4), because verifying the other conditions is standard. let x,y,z ∈ x and t,s > 0. without restraining the generality we assume that md(x,y,t) ≤ md(y,z,s). thus tt+d(x,y) ≤ s s+d(y,z) , i.e. td(y,z) ≤ sd(x,y). on the other hand md(x,z,k(t + s)) = k(t + s) k(t + s) + d(x,z) ≥ ≥ k(t + s) k(t + s) + k[d(x,y) + d(y,z)] = t + s t + s + d(x,y) + d(y,z) . we will prove that t + s t + s + d(x,y) + d(y,z) ≥ t t + d(x,y) . hence we will obtain that md(x,z,k(t + s)) ≥ md(x,y,t) = md(x,y,t) ∧md(y,z,s), what had to be verified. we remark that t + s t + s + d(x,y) + d(y,z) ≥ t t + d(x,y) ⇔ 276 s. nădăban t2 + st + td(x,y) + sd(x,y) ≥ t2 + st + td(x,y) + td(y,z) ⇔ sd(x,y) ≥ td(y,z) , which is true. 2 definition 9. let k ≥ 1 be a real given number. a function f : r → r will be called knondecreasing if for t < s we will have that f(t) ≤ f(ks). proposition 10. for all x,y ∈ x the mapping m(x,y, ·) : [0,∞) → [0,1] is k-nondecreasing. proof: for 0 < t < s we have m(x,y,ks) ≥ m(x,x,s− t)∗m(x,y,t) = 1∗m(x,y,t) = m(x,y,t) . 2 theorem 2.1. let (x,m,∗,k) be a fuzzy b-metric space. for x ∈ x,r ∈ (0,1), t > 0 we define the open ball b(x,r,t) := {y ∈ x : m(x,y,t) > 1−r} . then tm := {t ⊂ x : x ∈ t iff (∃)t > 0,r ∈ (0,1) : b(x,r,t) ⊆ t} is a topology on x. proof: it is obvious that ∅ and x belong to tm . let {ti}i∈i ⊆tm and t = ∪ i∈i ti. we will show that t ∈tm . let x ∈ t . then there exists i0 ∈ i such that x ∈ ti0 . as ti0 ∈ tm , there exist t > 0, r ∈ (0,1) such that b(x,r,t) ⊆ ti0 . thus b(x,r,t) ⊆ ∪ i∈i ti = t . let now {ti}ni=1 ⊆ tm and t = n∩ i=1 ti. we will show that t ∈ tm . let x ∈ t . we obtain that x ∈ ti,(∀)i = 1,n. thus (∃)ti > 0,ri ∈ (0,1) : b(x,ri, ti) ⊆ ti,(∀)i = 1,n . let r = min{ri, i = 1,n}, t = min { ti k ,i = 1,n } . we have that b(x,r,t) ⊆ b(x,ri, ti),(∀)i = 1,n. indeed, for y ∈ b(x,r,t), we have m(x,y,t) > 1 − r ≥ 1 − ri,(∀)i = 1,n. as t ≤ tik ,(∀)i = 1,n, we obtain that m(x,y,t) ≤ m(x,y,ti). thus m(x,y,ti) > 1−ri,(∀)i = 1,n. hence y ∈ b(x,ri, ti),(∀)i = 1,n. therefore b(x,r,t) ⊆ ti,(∀)i = 1,n. thus b(x,r,t) ⊆ n∩ i=1 ti = t . 2 remark 11. previous theorem extends to fuzzy b-metric space a similar result obtained by a. george and p. veeramani [12] in the context of fuzzy metric space. the definitions for convergent sequence and cauchy sequence given by a. george and p. veeramani [12] in the context of fuzzy metric space can be translated in the context of fuzzy b-metric space, as follows. definition 12. let (x,m,∗,k) be a fuzzy b-metric space and (xn) be a sequence in x. the sequence (xn) is said to be convergent if there exists x ∈ x such that m(xn,x,t) = 1,(∀)t > 0. in this case, x is called the limit of the sequence (xn) and we note lim n→∞ xn = x, or xn → x. remark 13. let (x,m,∗,k) be a fuzzy b-metric space. a sequence (xn) is convergent to x if and only if (xn) is convergent to x in topology tm . fuzzy b-metric spaces 277 indeed, xn → x in topology tm ⇔ ⇔ (∀)r ∈ (0,1),(∀)t > 0,(∃)n0 ∈ n : xn ∈ b(x,r,t),(∀)n ≥ n0 ⇔ ⇔ (∀)r ∈ (0,1),(∀)t > 0,(∃)n0 ∈ n : m(xn,x,t) > 1−r,(∀)n ≥ n0 ⇔ ⇔ lim n→∞ m(xn,x,t) = 1,(∀)t > 0 . definition 14. let (x,m,∗,k) be a fuzzy b-metric space and (xn) be a sequence in x. the sequence (xn) is said to be a cauchy sequence if (∀)r ∈ (0,1),(∀)t > 0,(∃)n0 ∈ n : m(xn,xm, t) > 1−r,(∀)n,m ≥ n0 . a fuzzy b-metric space in which every cauchy sequence is convergent is called complete fuzzy b-metric space. 3 fuzzy quasi-b-metric spaces definition 15. a fuzzy quasi-b-metric space is a quadruple (x,m,∗,k), where x is a nonempty set, ∗ is a continuous t-norm, k ≥ 1 is a given real number and m is a fuzzy set in x×x×[0,∞) such that for all x,y,z ∈ x we have: (qbm1) m(x,y,0) = 0; (qbm2) [m(x,y,t) = m(y,x,t) = 1,(∀)t > 0] if and only if x = y; (qbm3) m(x,z,k(t + s)) ≥ m(x,y,t)∗m(y,z,s),(∀)t,s ≥ 0; (qbm4) m(x,y, ·) : [0,∞) → [0,1] is left continuous and lim t→∞ m(x,y,t) = 1. remark 16. v. gregori and s. romaguera [13] also gave this definition in the particular case k = 1 and the triple (x,m,∗) is called fuzzy quasi-metric space. proposition 17. if q is a fuzzy quasi-b-metric, then q−1 defined by q−1(x,y,t) = q(y,x,t) is also a fuzzy quasi-b-metric (called the conjugate of q). proof: we have to check only (qbm3). q−1(x,z,k(t + s)) = q(z,x,k(s + t)) ≥ q(z,y,s)∗q(y,x,t) = q−1(x,y,t)∗q−1(y,z,s) . 2 definition 18. [18]. let ∗,◦ be two t-norms. we say that ◦ dominates ∗ and we denote ◦≫∗ if (x1 ◦x2)∗ (y1 ◦y2) ≤ (x1 ∗y1)◦ (x2 ∗y2),(∀)x1,x2,y1,y2 ∈ [0,1]. remark 19. [18]. for any t-norm ∗ we have ∧≫∗. proposition 20. let (x,q,∗,k) be a fuzzy quasi-b-metric space and ◦ be a continuous t-norm such that ◦≫∗. let m be a fuzzy set in x ×x × [0,∞) defined by m(x,y,t) = q(x,y,t)◦q−1(x,y,t) . then (x,m,∗,k) is a fuzzy b-metric space. 278 s. nădăban proof: it is easy to check (bm1)− (bm3) and (bm5). we prove (bm4). m(x,z,k(t + s)) = q(x,z,k(t + s))◦q−1(x,z,k(t + s)) ≥ ≥ [q(x,y,t)∗q(y,z,s)]◦ [q−1(x,y,t)∗q−1(y,z,s)] ≥ ≥ [q(x,y,t)◦q−1(x,y,t)]∗ [q(y,z,s)◦q−1(y,z,s)] = m(x,y,t)∗m(y,z,s) . 2 corollary 21. let (x,q,∗,k) be a fuzzy quasi-b-metric space and m(x,y,t) = min{q(x,y,t),q(y,x,t)} . then (x,m,∗,k) is a fuzzy b-metric space. proof: we apply previous proposition for ◦ = ∧≫∗. 2 example 22. let (x,d,k) be a quasi-b-metric space. let md : x ×x × [0,∞) → [0,1],md(x,y,t) = { t t+d(x,y) if t > 0 0 if t = 0 . then (x,md,∧,k) is a fuzzy quasi-b-metric space. md will be called standard fuzzy quasi-bmetric. proof: the proof is standard. 2 proposition 23. if (x,m,∗,k) is a fuzzy quasi-b-metric space, then the relation ≤m on x defined by x ≤m y if and only if m(x,y,t) = 1,(∀)t > 0 is a partial ordering. proof. it is easy to check. 4 fuzzy quasi-pseudo-b-metric spaces definition 24. let x be a nonempty set. a function d : x ×x → [0,∞) is called quasi-pseudob-metric with constant k ≥ 1 if the following conditions are satisfied: (qpb1) d(x,x) = 0; (qpb3) d(x,z) ≤ k[d(x,y) + d(y,z)],(∀)x,y,z ∈ x. the triple (x,d,k) will be called quasi-pseudo-b-metric space. definition 25. a fuzzy quasi-pseudo-b-metric space is a quadruple (x,m,∗,k), where x is a nonempty set, ∗ is a continuous t-norm, k ≥ 1 is a given real number and m is a fuzzy set in x ×x × [0,∞) such that for all x,y,z ∈ x we have: (qpbm1) m(x,y,0) = 0; (qpbm2) [m(x,x,t) = 1,(∀)t > 0]; (qpbm3) m(x,z,k(t + s)) ≥ m(x,y,t)∗m(y,z,s),(∀)t,s ≥ 0; fuzzy b-metric spaces 279 (qpbm4) m(x,y, ·) : [0,∞) → [0,1] is left continuous and lim t→∞ m(x,y,t) = 1. theorem 4.1. let (x,m,∧,k) be a fuzzy quasi-pseudo-b-metric space and dα(x,y) := inf{t > 0 : m(x,y,t) > α},α ∈ (0,1) . then d = {dα}α∈(0,1) is an ascending family of quasi-pseudo-b-metrics on x. proof: (qp1) dα(x,x) = inf{t > 0 : m(x,x,t) > α} = 0. (qp2) k[dα(x,y) + dα(y,z)] = k[inf{t > 0 : m(x,y,t) > α}+ inf{s > 0 : m(y,z,s) > α}] = = k[inf{t + s > 0 : m(x,y,t) > α,m(y,z,s) > α}] = = inf{k(t + s) > 0 : m(x,y,t)∧m(y,z,s) > α}≥ ≥ inf{k(t + s) > 0 : m(x,z,k(t + s)) > α} = dα(x,z) . it remains to prove that d = {dα}α∈(0,1) is an ascending family. let α1 ≤ α2. then {t > 0 : m(x,y,t) > α2}⊆{t > 0 : m(x,y,t) > α1} . thus inf{t > 0 : m(x,y,t) > α2}≥ inf{t > 0 : m(x,y,t) > α1} , namely dα2(x,y) ≥ dα1(x,y),(∀)(x,y) ∈ x ×x. 2 5 conclusions and further works in this paper we introduce the notions of fuzzy b-metric space and fuzzy quasi-b-metric space. thus, we have built a fertile ground to study, in further papers, some fixed point theorems in these spaces. the first problem is to established fuzzy versions of banach contraction mapping principle in fuzzy b-metric spaces. from here we will obtain a lot of applications both in mathematics as well as in engineering and computer science. the second issue is to study set-valued contractions in fuzzy b-metric spaces and their applications in control theory and convex optimization. a real challenge is to extend the results of c. chifu and g. petruşel [9] in fuzzy b-metric spaces. we intend to obtain some fixed point theorems for multivalued operators in fuzzy b-metric spaces endowed with a graph. this paper may be of interest for researchers working in the following fields belonging to computer science and information technology: (i) integrated solution in computer-based control and communications (ii) computational intelligence methods (iii) advanced decision support systems where fuzzy metric spaces will be applied in dealing with the problems such as: fixed point theorems and their applications in the semantics of programs; distance measurement between programs with important results to measure the complexity of programs and algorithms; color image processing and image denoising; the use of some types of fuzzy metrics in cognitive information, in time series and in bioinformatics; the appplications in neural networks; data mining and web mining applications. 280 s. nădăban bibliography [1] alghamdi, m.a., hussain, n., salimi, p. (2013); fixed point and coupled fixed point theorems on b-metric-like spaces, journal of inequalities and applications, 2013:402. [2] amini-harandi, a. (2012). metric-like spaces, partial metric spaces and fixed points, fixed point theory and applications, 2012:204. [3] bag, t. (2013); fuzzy cone metric spaces and fixed point theorems of contractive mappings, annals of fuzzy mathematics and informatics, 6(3): 657–668. [4] bag, t. (2014); some fixed point theorems in fuzzy cone b-metric spaces, international journal of fuzzy mathematics and systems, 4(2): 255–267. [5] bakhtin, i.a. (1989); the contraction mapping principle in quasi-metric spaces, funct. anal. unianowsk gos. ped. inst., 30: 26–37. [6] boriceanu, m., bota, m., petruşel, a. (2010); multivalued fractals in b-metric spaces, central european journal of mathematics, 8(2): 367–377. [7] boriceanu, m., petruşel, a., rus, i.a. (2010); fixed point theorems for some multivalued generalized contraction in b-metric spaces, international j. math. statistics, 6: 65–76. [8] boriceanu, m. (2009); strict fixed point theorems for multivalued operators in b-metric spaces, intern. j. modern math., 4: 285–301. [9] chifu, c., petruşel, g. (2014); fixed point for multivalued contraction in b-metric spaces with applications to fractals, taiwanese journal of mathematics, 18(5): 1365–1375. [10] czerwik, s. (1993); contraction mappings in b-metric space, acta math. inf. univ. ostraviensis, 1: 5–11. [11] czerwik, s. (1998); non-linear set-valued contraction mappings in b-metric spaces, atti. sem. math. fig. univ. modena, 46(2): 263–276. [12] george, a., veeramani, p. (1994); on some results in fuzzy metric spaces, fuzzy sets and systems, 64: 395–399. [13] gregori, v., romaguera, s. (2004); fuzzy quasi-metric spaces, applied general topology, 5(1): 128–136. [14] hussain, n., shah, m.h. (2011); kkm mappings in cone b-metric spaces, comput. math. appl., 61(4): 1677–1684. [15] kaleva, o., seikkala, s. (1984); on fuzzy metric spaces, fuzzy sets and systems, 12: 215– 229. [16] kramosil, i., michálek, j. (1975); fuzzy metric and statistical metric spaces, kybernetica, 11: 326–334. [17] matthews, s.g. (1994); partial metric topology, in: proc. 8th summer conference on general topology and applications, ann. new york acad. sci., vol. 728, the new york academy of sciences, 183–197. [18] nădăban, s. (2015); fuzzy euclidean normed spaces for data mining applications, international journal of computers communications & control, 10(1): 70–77. fuzzy b-metric spaces 281 [19] schweizer, b., sklar, a. (1960); statistical metric spaces, pacific j. math., 10: 314–334. [20] shah, m.h., hussain, n. (2012); nonlinear contraction in partially ordered quasi b-metric spaces, commun. korean math. soc., 27(1): 117–128. [21] shatanawi, w., pitea, a., lazović, r. (2014); contraction conditions using comparison function on b-metric spaces, fixed point theory and applications, 2014:135. [22] singh, s.l., prasad, b. (2008); some coincidence theorems and stability of iterative procedures, computers and mathematics with applications, 55: 2512–2520. [23] zadeh, l.a. (1965); fuzzy sets, informations and control, 8: 338–353. 8precup.pdf international journal of computers communications & control special issue on fuzzy sets and applications (celebration of the 50th anniversary of fuzzy sets) issn 1841-9836, 10(6):843-855, december, 2015. a unified anti-windup technique for fuzzy and sliding mode controllers r.-e. precup, m.l. tomescu, e.m. petriu radu-emil precup* politehnica university of timişoara department of automation and applied informatics bd. v. parvan 2, 300223 timisoara, romania *correspondig author: radu.precup@upt.ro marius l. tomescu aurel vlaicu university of arad faculty of computer science complex universitar m, str. elena dragoi 2, 310330 arad, romania e-mail: tom_uav@yahoo.com emil m. petriu university of ottawa school of electrical engineering and computer science 800 king edward, ottawa, on, k1n 6n5 canada e-mail: petriu@uottawa.ca abstract: this paper proposes the unified treatment of an anti-windup technique for fuzzy and sliding mode controllers. a back-calculation and tracking anti-windup scheme is proposed in order to prevent the zero error integrator wind-up in the structures of state feedback fuzzy controllers and sliding mode controllers. the state feedback sliding mode controllers are based on the state feedback-based computation of the switching variable. an example that copes with the position control of an electro-hydraulic servo-system is presented. the conclusions are pointed out on the basis of digital simulation results for the state feedback fuzzy controller. keywords: anti-windup technique, electro-hydraulic servo-system, fuzzy control, saturation, sliding mode control, digital simulation. 1 introduction there are many situations in industrial control applications when a mismatch between the control signal (controller output) and the input of the process occurs. the saturation of controller output due to the functionality of the controller is such a natural process. the anti-windup techniques correct the controller output in the case of controllers with integral (i) component, but other components of the controllers can cause the saturation. some discussions on the strong or weak impacts of the integrator wind-up and of saturation are presented in [1–5]. the anti-windup is applied to nonlinear systems including sliding mode control systems and fuzzy control systems, with representative examples given in [6–9]. a static anti-windup compensator for linear sliding mode controllers is designed in [10] on the basis of linear matrix inequality (lmi) conditions derived from lyapunov stability and l2-gain performance. the switching function of sliding mode controllers is modified in [11] to reduce the discontinuous component of the control signal during saturation. an adaptive anti-windup pid sliding mode scheme is proposed in [12]. the sliding surface for robust saturated sliding mode control is designed in [13] as a problem of root clustering. a dead-zone technique is employed in [9] in the framework of adaptive sliding mode control combined with fuzzy logic. the necessity of anti-windup measures in fuzzy copyright © 2006-2015 by ccc publications 844 r.-e. precup, m.l. tomescu, e.m. petriu control is pointed out in [14] and focused on mamdani proportional-integral-fuzzy controllers. the fuzzy models of nonlinear models are involved in [15] in the design of piecewise fuzzy antiwindup dynamic output feedback controllers based on piecewise quadratic lyapunov functions. starting with a stabilizing dynamic output feedback fuzzy controller, an anti-windup block is designed in [16] to maximize the size of estimate of the domain of attraction, and sufficient lyapunov-krasovskii stability conditions are derived. this paper suggests the application of a back-calculation and tracking anti-windup scheme to the zero error integrator in the framework of state feedback fuzzy controllers and state feedback sliding mode controllers. the specific feature of state feedback sliding mode controllers is the state feedback-based computation of the switching variable. the paper is supported by our previous results in fuzzy control [17–24], and proposes the unified treatment of anti-windup techniques in fuzzy and sliding mode controllers. this leads to good effects on the overall control system performance. the expression of the parameters of the anti-windup block is given, but these parameters can be optimized in terms of, for example, the optimal tuning of the anti-windup tracking gain carried out in [25]. this paper represents a step forward in the systematic design of fuzzy control systems, pointed out by prof. zadeh in [26] and [27]. the mathematics of fuzzy sets must be incorporated in the structure of fuzzy controllers by appropriate operators and parameters [28–30, 32, 32–34]. the model-based design using fuzzy models is emphasized in [35–45], but neural networks are used in [46–48]. this paper also represents a step forward in the systematic design of sliding mode control systems in the context of other popular techniques [49–53]. the parameters of both fuzzy and sliding mode controllers can be optimally tuned by means of appropriate optimization problems and algorithms [54–58]. the paper is organized as follows: section 2 is dedicated to the modeling of state feedback sliding mode and fuzzy control systems. the proposed back-calculation and tracking anti-windup scheme is presented in section 3. section 4 applies the scheme to the position control of an electro-hydraulic servo-system and simulation results are included. the conclusions are outlined in section 5. 2 models of state feedback sliding mode and fuzzy control systems the unified structure of state feedback sliding mode and fuzzy control systems is presented in figure 1, where: w the reference input, v the disturbance input, u the control signal, y the controlled output, e = w = y the control error, xp the state vector of the process p, supposed to be observable and controllable, xp ∈ rn, zei the zero error integrator (to obtain the zero steady-state value of the control error), xr the nominal integrator output, xrlthe saturated (limited) integrator output, rb the reference block, sb the switching block, g the switching variable, {kr, kw, ti, u0,ktp } the parameters of the sliding mode controller (smc), t stands for matrix transposition, ktp ∈ r 1×n, {−xl, xl} the limits of the saturation element that belongs to zei, xl = const > 0. the structure given in figure 1 is obtained by the appropriate transformation and extension of the sliding mode control scheme presented in [59], the abbreviation fc indicates the fuzzy controller, and the detailed structure of smc is illustrated in the lower part of figure 1. both controllers are presented as nonlinear blocks in figure 1. the p in the smc is described by the nth order state-space model: a unified anti-windup technique for fuzzy and sliding mode controllers 845 figure 1: unified structure of state feedback sliding mode and fuzzy control systems. ẋp = apxp + bpu + bpvv, y = ctp xp, (1) where ap ∈ rn×n, bp ∈ rn×1, bpv ∈ rn×1, ctp ∈ r 1×n. introducing the extended state vector: x = [ xtp xr ] t , (2) the state-space model of cp and zei is: ẋ = ax + bu + bvv + bww, y = ctx, (3) with the matrices: a = [ ap 0 −(1/ti)c t p 0 ] ,b = [ bp 0 ] ,bv = [ bpv 0 ] ,bw = [ 0 1/ti ] ,ct = [ ctp 0 ]. (4) the variable structure control law specific to smc is of relay type: u = u0sgn(g(x)), g(x) = −k t x + kww,k t = [ ktp −kr ], (5) where the matrix kt characterizes the switching hyper-plane and u0 = const > 0 is the absolute value of the control signal. the analysis of the control system in sliding mode is supported by the equivalent control method [60] resulting in the state-space equations of the control system in sliding mode: ẋ = asx + bsvv + b s ww + b s ẇẇ, (6) with the matrices: a s = [ mpap − [kr/(tik t p bp)]bpc t p 0 −(1/ti)c t p 0 ] ,bsv = [ mpbpv 0 ] ,mp = i − [1/(k t p bp)]bpk t p , b s w = [ [kr/(tik t p bp)]bp 1/ti ] ,bsẇ = [ kw/(k t p bp) 0 ] , (7) 846 r.-e. precup, m.l. tomescu, e.m. petriu and i is the nth order identity matrix. equation (6) does not highlight the reaching mode, but its effect can be highlighted by the proper modification of the initial conditions. the sliding mode existence condition involves the equivalent control signal ueq: |ueq| < u0, ueq = [1/(k t p bp)] · [krxr − k t p (apxp + bpvv) + kwẇ]. (8) all above equations specific to smc correspond to the case of zei that operates in its linear operating mode, i.e. the zei is not in saturation: xrl = xr. (9) it is very convenient to use equation (6) in the analysis and design of the smc as it is linear and it characterizes with an acceptable accuracy the behavior of a nonlinear control system (the sliding mode one). similar models for the fc will be presented as follows. the rule base of the continuous-time dynamic takagi-sugeno (t-s) fuzzy model of p consists of nr rules, ri, i = 1...nr. each rule is assigned to the following continuous-time state-space model in its consequent, namely to a local linear model of p: ẋp = apixp + bpiu, y = ctpixp, i = 1...nr, (10) where api ∈ rn×n, bpi ∈ rn×1, ctpi ∈ r 1×n, and the disturbance input is omitted for simplicity. the complete rule base of the continuous-time dynamic t-s fuzzy model of p is: ri : if z1is lt i z1 and z2 is lt i z2 and ... and zm is lt i zm then { ẋp = apixp + bpiu y = ctpixp , i = 1...nr, (11) where z = [ z1 z2 ... zm ]t is the scheduling vector, i.e., the input vector, which contains the measurable variables of p, zk, k = 1...m, and lt izk are the input linguistic terms with the input membership functions µizk(zk). using the sum and prod operators in the inference engine and the weighted average defuzzification method, the firing strengths (of the rules) are: wi(z) = m ∏ k=1 µizk(zk), i = 1...nr, (12) the normalized firing strengths are: hi(z) = wi(z)/[ nr ∑ i=1 wi(z)], i = 1...nr, (13) and the continuous t-s fuzzy model of p is expressed in the state-space form: ẋp = nr ∑ i=1 [hi(z)(apixp + bpiu)], y = nr ∑ i=1 [hi(z)c t pixp]. (14) in parallel distributed compensation (pdc) the structure of the fc model matches the structure of the fuzzy model of p given in (11). considering the absence of the blocks rb and zei, the pdc controller for the system (11) is: a unified anti-windup technique for fuzzy and sliding mode controllers 847 u = − nr ∑ i=1 [hi(z)f t i xp], (15) and the goal of fc design is to obtain the gain matrices fti , i = 1...nr, f t i ∈ r 1×n, of the nonlinear state-feedback control law (15) such that the closed-loop system is stable and eventually robust. as outlined in [61], many design problems derive the least conservative conditions related to the condition: nr ∑ i=1 nr ∑ j=1 [hi(z)hj(z)γij] < 0,γij = γij t . (16) considering the fuzzy control system structure according to figure 1, the control law of the fc given in (15) is modified as follows in the specific case of zei that operates in its linear operating mode: u = kww + krxr − nr ∑ i=1 [hi(z)f t i xp]. (17) using (2), (17) and figure 1 in (14), the state-space equations of the fuzzy control system are: ẋ = asx + bsww, (18) with the matrices: a s =     nr ∑ i=1 [hi(z)api] − [ nr ∑ i=1 [hi(z)bpi] ] · [ nr ∑ i=1 [hi(z)f t i ] ] kr nr ∑ i=1 [hi(z)bpi] −(1/ti) nr ∑ i=1 [hi(z)c t pi] 0     , b s w =   kw nr ∑ i=1 [hi(z)bpi] 1/ti   . (19) the models (6) (of the sliding mode control system) and (18) (of the fuzzy control system) are similar. moreover, the gain matrices fti of fc are similar to the gain matrix k t p of smc (illustrated in figure 1). this justifies the unified treatment of anti-windup techniques for fuzzy and sliding mode controllers. however, sliding mode and fuzzy control systems have different structures; although the fuzzy control system is more complicated, it is not constrained to enter the sliding mode related to the sliding mode existence condition. if zei enters saturation, i.e.: xrl = xr lim, xr lim ∈ {−xl, xl}, (20) the updated expression of the switching variable is: g(xp) = −k t p xp + kww + krxr lim, (21) and the equivalent control method leads to the updated equivalent control signal: ueq = [1/(k t p bp)] · [−k t p (apxp + bpvv) + kwẇ], (22) and to the updated state-space equations of p in sliding mode: ẋp = a s pxp + b s pvv + b s pẇẇ, (23) 848 r.-e. precup, m.l. tomescu, e.m. petriu with the matrices: a s p = mpap,b s pv = mpbpv,b s pẇ = [kw/(k t p bpv)]bp. (24) hence, at least one negative effect on control system behavior can be observed when the saturation of the zei occurs or when the integrator output is maintained in saturation/outside the limits on a too large time interval. this consists in the difficult fulfillment of the sliding mode existence condition due to the different expressions of ueq in (8) and (22). therefore, anti-windup techniques are necessary not just to get zei out of saturation, but just to avoid exaggerate exceeds of limitation. 3 back-calculation and tracking anti-windup scheme the state-space equation of i that belongs to zei is: ẋr = (1/ti)e. (25) equation (25) is also kept in the presence of windup, but with the substitution of e with another i input, el, chosen such that to keep zei in saturation, i.e., (9) is applied, therefore: ẋrl = (1/ti)el. (26) equation (25) holds in the absence of windup, and equation (26) corresponds to the presence of windup. subtracting (26) from (25) results in: el = e − ti(ẋr − ẋrl). (27) the structure of zei with back-calculation and tracking anti-windup scheme is built using (27) and given in figure 2. bct in figure 2 represents the back-calculation and tracking block, with pure derivative character, modeled by the transfer function: hbct (s) = tis. (28) figure 2: unified structure of zei with back-calculation and tracking anti-windup scheme. the proposed anti-windup technique has a shortcoming, namely it does not operate if xr enters saturation, and it is constant, or if the difference (xr − xrl) is constant. therefore, the modified bct block (mbct) is of lead-lag type with the following transfer function: hmbct (s) = (tis + kaw )/(1 + taw s), (29) where the time constant taw (a small value) is necessary in order to make possible the implementation of mbct. the anti-windup tracking gain kaw can take any positive value. however, an as small as possible value of kaw is recommended because: (i) the strong feedback derivative action already determines the integrator to stay very close to the saturation limit, (ii) a large value of kaw could lead to stability problems investigated in [62]. a unified anti-windup technique for fuzzy and sliding mode controllers 849 4 position control application. simulation results an example of nonlinear electro-hydraulic system [63] is taken into consideration in order to illustrate the advantages of the proposed back-calculation and tracking anti-windup scheme in the context of state feedback sliding mode control. the simplified structure of the electrohydraulic system meant for position (y) control is presented in figure 3. the parameters of p are: g0 = 0.0625, ti1 = 0.002s, ti2 = 0.065s, xl = 0.5. figure 3: simplified structure of p. the design of the smc is performed according to [59], but for n = 2, and the following values of controller parameters are obtained: kx1 = 1, kx2 = 0.13, ktp = [ kx1 kx2 ] = [ 1 0.13 ], kw = 0.13, kr = 32.5, ti = 0.1s, u0 = 15. the overall control system is referred to as electrohydraulic servo-system. the designed sliding mode control system was tested with respect to the modifications of w using the nonlinear model of p. figure 4 gives the control system response (the control system output y with continuous line, w with dash dotted line, xrl with continuous line) without limitations imposed to zei. figure 4 shows that y tracks the imposed w. figure 5 gives the figure 4: control system response without limitations. control system response without anti-windup technique. the harmful effect of the limitation is illustrated by the fact that y does not track anymore w, and xr stays during a relatively long time period in limitation within the time interval [1s, 3s]. figure 6 gives the control system response with classical back-calculation and tracking antiwindup scheme using the same w and conditions as in figure 4. the control system performance is improved in comparison with the case without back-calculation and tracking anti-windup scheme. figure 7 gives the control system response that incorporates the proposed modified backcalculation and tracking anti-windup scheme using the same w and conditions as in figure 4, and 850 r.-e. precup, m.l. tomescu, e.m. petriu figure 5: control system response without bct. figure 6: control system response with bct. for kaw = 0.02 and taw = 0.01s. figure 7 shows the control system performance improvement compared to all previous cases. one relatively minor shortcoming concerns the slightly larger effect of the chattering phenomenon. 5 conclusions the paper has proposed an approach to the unified treatment of anti-windup in fuzzy and sliding mode controllers. the unified models of fuzzy control systems and sliding mode control systems have been suggested. an analysis of the possibility to apply the back-calculation and tracking anti-windup scheme to the zero error integrator belonging to a state feedback sliding mode controller has been carried out. a modified back-calculation and tracking anti-windup scheme applicable to these controller structures has been suggested. the future work will be dedicated to the discrete time formulation of the controllers and of the anti-windup schemes and to the stability analysis because the control systems stability can be affected or not by windup. the parameters of the anti-windup schemes will be tuned by a unified anti-windup technique for fuzzy and sliding mode controllers 851 figure 7: control system response with mbct. several optimization algorithms in the context of appropriate optimization problems. acknowledgments this work was supported by a grant of the romanian national authority for scientific research, cncs uefiscdi, project number pn-ii-id-pce-2011-3-0109, by a grant from the partnerships in priority areas pn ii program of the romanian national authority for scientific research ancs, cndi uefiscdi, project number pn-ii-pt-pcca-2011-3.2-0732, by grants from the partnerships in priority areas pn ii program of the romanian ministry of education and research (medc) the executive agency for higher education, research, development and innovation funding (uefiscdi), project numbers pn-ii-pt-pcca-2013-4-0544 and pn-ii-pt-pcca-2013-4-0070, and by a grant from the nserc of canada. bibliography [1] bohn, c.; atherton, d. p. (1995); an analysis package comparing pid anti-windup strategies, ieee control systems magazine, 15(2):34-40. [2] peng, y.; vrančić, d.; hanus, r.; weller, s. s. r. (1998); anti-windup designs for multivariable controllers, automatica, 34(12):1559-1565. [3] tharayil, m.; alleyne, a. (2002); a generalized pid error governing scheme for smart/sbli control, proceedings of 2002 american control conference, anchorage, ak, usa, 1:346-351. [4] åström, k. j.; hägglund, t. (2006); advanced pid control, instrument society of america, research triangle park, nc. [5] wu, x.; lin, z. (2012); on immediate, delayed and anticipatory activation of anti-windup mechanism: static anti-windup case, ieee transactions on automatic control, 57(3):771777. [6] fišer, j.; šulc, b. (2001); sliding mode control design with anti wind-up, proceedings of xxvi. asr’2001 seminar, instruments and control, ostrava, czech republic, paper id 17:1-8. 852 r.-e. precup, m.l. tomescu, e.m. petriu [7] herrmann, g.; turner, m. c.; postlethwaite, i; guo g. (2004); practical implementation of a novel anti-windup scheme in a hdd-dual-stage servo-system, ieee/asme transactions on mechatronics, 9(3):580-592. [8] kanamori, m.; iwagami, k. (2014); novel anti-windup pid controller design under holonomic endpoint constraints for euler-lagrange systems with actuator saturation, preprints of 19th ifac world congress, cape town, south africa, pp. 9321-9326. [9] chiang, h.-h.; hsu, k.-c.; li i.-h. (2015); optimized adaptive motion control through an sopc implementation for linear induction motor drives, ieee/asme transactions on mechatronics, 20(1):348-360. [10] septanto, h.; syaichu-rohman, a.; mahayana, d. (2011); static anti-windup compensator design of linear sliding mode control for input saturated systems, proceedings of 2011 international conference on electrical engineering and informatics, bandung, indonesia, pp. 1-4. [11] yokoyama, m.; kim, g.-n.; tsuchiya, m. (2012); integral sliding mode control with antiwindup compensation and its application to a power assist system, journal of vibration and control, 16(4):503-512. [12] lee, j. m.; park, s. h.; kim, j. s. (2013); design and experimental evaluation of a robust position controller for an electrohydrostatic actuator using adaptive antiwindup sliding mode scheme, the scientific world journal, vol. 2013, article id 590708, pp. 1-16. [13] zaafouri, c.; garcia, g. (2013); comparative study of the saturated sliding mode and antiwindup controllers, proceedings of 2013 international conference on electrical engineering and software applications, hammamet, tunisia, pp. 1-6. [14] preitl, s.; precup, r.-e. (1995); on the opportunity of arw measures in fuzzy control, in: real world applications of intelligent technologies, h.-j. zimmermann, m. g. negoita, d. dascalu, eds., editura academiei romane, bucharest, pp. 149-153. [15] zhang, t.; feng, g.; liu, h.; lu, j. (2009); piecewise fuzzy anti-windup dynamic output feedback control of nonlinear processes with amplitude and rate actuator saturations, ieee transactions on fuzzy systems, 17(2):253-264 . [16] ting, c.-s.; chang, y.-n. (2011); robust anti-windup controller design of time-delay fuzzy systems with actuator saturations, information sciences, 181(15):3225-3245. [17] precup, r.-e.; preitl, s. (1997); popov-type stability analysis method for fuzzy control systems, proceedings of fifth european congress on intelligent technologies and soft computing, aachen, germany, (2):1306-1310. [18] precup, r.-e.; preitl, s. (1999); fuzzy controllers, editura orizonturi universitare, timişoara, 1999. [19] precup, r.-e.; preitl, s.; balas, m.; balas, v. (2004); fuzzy controllers for tire slip control in anti-lock braking systems, proceedings of ieee international conference on fuzzy systems, budapest, hungary, (3):1317-1322. [20] precup, r.-e.; preitl, s. (2006); stability and sensitivity analysis of fuzzy control systems. mechatronics applications, acta polytechnica hungarica, 3(1):61-76. a unified anti-windup technique for fuzzy and sliding mode controllers 853 [21] tomescu, m. l.; preitl, s.; precup, r.-e.; tar, j. k. (2007); stability analysis method for fuzzy control systems dedicated controlling nonlinear processes, acta polytechnica hungarica, 4(3):127-141. [22] precup, r.-e.; tomescu, m. l.; preitl, s. (2009); fuzzy logic control system stability analysis based on lyapunov’s direct method, international journal of computers, communications & control, 4(4):415-426. [23] precup, r.-e.; tomescu, m. l.; radac, m.-b.; petriu, e. m.; preitl, s.; dragos, c.-a. (2012); iterative performance improvement of fuzzy control systems for three tank systems, expert systems with applications, 39(9):8288-8299. [24] precup, r.-e.; radac, m.-b.; tomescu, m. l.; petriu, e. m.; preitl, s. (2013); stable and convergent iterative feedback tuning of fuzzy controllers for discrete-time siso systems, expert systems with applications, 40(1):188-199. [25] precup, r.-e.; david, r.-c.; petriu, e. m.; radac, m.-b.; preitl, s. (2014); adaptive gsa-based optimal tuning of pi controlled servo systems with reduced process parametric sensitivity, robust stability and controller robustness, ieee transactions on cybernetics, 44(11):1997-2009. [26] zadeh, l. a. (1974); fuzzy logic and its application to approximate reasoning, proceedings of ifip congress 74, stockholm, sweden, pp. 591-594. [27] zadeh, l. a. (1994); fuzzy logic: issues, contentions and perspectives, proceedings of ieee international conference on acoustics, speech and signal processing icassp94, adelaide, sa, australia, pp. 183-184. [28] zadeh, l. a. (2013); stochastic finite-state systems in control theory, information sciences, 251:1-9. [29] mitroi, f. c.; minculete, n. (2013); mathematical inequalities for biparametric extended information measures, journal of mathematical inequalities, 7(1):63-71. [30] nădăban, s.; dzitac, i, (2014); atomic decompositions of fuzzy normed linear spaces for wavelet applications, informatica, 25(4):643-662. [31] gál, l.; lovassy, r.; rudas, i. j.; kóczy, l. t. (2014); learning the optimal parameter of the hamacher t-norm applied for fuzzy-rule-based model extraction, neural computing and applications, 24(1):133-142. [32] nădăban, s. (2015); fuzzy euclidean normed spaces for data mining applications, international journal of computers, communications & control, 10(1):70-77. [33] nădăban, s. (2014); fuzzy pseudo-norms and fuzzy f-spaces, fuzzy sets and systems, doi: 10.1016/j.fss.2014.12.010. [34] nădăban, s.; dzitac, i (2014); special types of fuzzy relations, procedia computer science, 31c:552-557. [35] škrjanc, i; blažič, s.; agamennoni, o. e. (2005); interval fuzzy model identification using l∞-norm, ieee transactions on fuzzy systems, 13(5):561-568. 854 r.-e. precup, m.l. tomescu, e.m. petriu [36] johanyćk, z.c. (2010); survey on five fuzzy inference-based student evaluation methods, in: computational intelligence in engineering, i. j. rudas, j. fodor, j. kacprzyk, eds., studies in computational intelligence, springer-verlag, berlin, heidelberg, vol. 313, pp. 219-228. [37] vaščák, j. (2010); approaches in adaptation of fuzzy cognitive maps for navigation purposes, proceedings of 8th international symposium on applied machine intelligence and informatics, heržany, slovakia, pp. 31-36, 2010. [38] linda, o.; manic, m. (2011); interval type-2 fuzzy voter design for fault tolerant systems, information sciences, 181(14):2933-2950. [39] dragos, c.-a.; precup, r.-e.; tomescu, m. l.; preitl, s.; petriu, e. m.; radac, m.-b. (2013); an approach to fuzzy modeling of electromagnetic actuated clutch systems, international journal of computers, communications & control, 8(3):395-406. [40] teodorescu, h.-n. l. (2013); on the characteristic functions of fuzzy systems, international journal of computers, communications & control, 8(3):469-476. [41] jafarian, a. ( 2014); new artificial intelligence approach for solving fuzzy polynomial equations, international journal of artificial intelligence, 12(2):57-74. [42] baranyi, p.; tikk, d.; yam, y.; patton, r. j. (2003); from differential equations to pdc controller design via numerical transformation, computers in industry, 51(3):281-297. [43] precup, r.-e.; preitl, s.; radac, m.-b.; petriu, e. m.; dragos, c.-a.; tar, j. k. (2011); experiment-based teaching in advanced control engineering, ieee transactions on education, 54(3):345-355. [44] precup, r.-e.; dragos, c.-a.; preitl, s.; radac, m.-b.; petriu, e. m. (2012); novel tensor product models for automatic transmission system control, ieee systems journal, 6(3):488498. [45] angelov, p.; yager, r. (2013); density-based averaging a new operator for data fusion, information systems, 222:163-174. [46] bălănică, v.; dumitrache, i.; preziosi, l. (2013); breast cancer diagnosis based on spiculation feature and neural network techniques, international journal of computers, communications & control, 8(3):354-365. [47] osaba, e.; diaz, f.; onieva, e.; carballedo, r.; perallos, a. (2014); amcpa: a population metaheuristic with adaptive crossover probability and multi-crossover mechanism for solving combinatorial optimization problems, international journal of artificial intelligence, 12(2):123. [48] popescu, d.; cirstoiu, s. (2014); a simulator for the multi-model control of diesel engines, studies in informatics and control, 23(4):381-386. [49] yu, x.; kaynak, o. (2009); sliding-mode control with soft computing: a survey, ieee transactions on industrial electronics, 56(9):3275-3285. [50] antić, d.; milojković, m.; jovanović, z.; nikolić, s. (2010); optimal design of the fuzzy sliding mode control for a dc servo drive, strojniški vestnik journal of mechanical engineering, 56(7-8):455-463. a unified anti-windup technique for fuzzy and sliding mode controllers 855 [51] al-hadithi, b. m.; barragán, a. j.; andújar, j. m.;jiménez, a. (2013); variable structure control with chattering elimination and guaranteed stability for a generalized t-s model, applied soft computing, 13(12):4802-4812. [52] milosavljević, c.; perunicić-drazenović, b. veselić, b. (2013); discrete-time velocity servo system design using sliding mode control approach with disturbance compensation, ieee transactions on industrial informatics, 9(2):920-927. [53] liu, z.; su, h.; pa, s. (2014); a new adaptive sliding mode control of uncertain nonlinear systems, asian journal of control,16(1):198-208. [54] filip, f.-g.; leiviskä, k. (2009); large-scale complex systems, in: springer handbook of automation, s. y. nof, ed., springer-verlag, berlin, heidelberg, pp. 619-638. [55] precup, r.-e.; david, r.-c.; petriu, e. m.; preitl, s.; paul, a. s. (2011); gravitational search algorithm-based tuning of fuzzy control systems with a reduced parametric sensitivity, in: soft computing in industrial applications, a. gaspar-cunha, r. takahashi, g. schaefer, l. costa, eds., springer-verlag, berlin, heidelberg, advances in intelligent and soft computing, vol. 96, pp. 141-150. [56] penedo, f.; haber, r. e.; gajate, a.; del toro, r. m. (2012); hybrid incremental modeling based on least squares and fuzzy k-nn for monitoring tool wear in turning processes, ieee transactions on industrial informatics, 8(4):811-818. [57] barchinezhad, s.; eftekhari, m. (2014); a new fuzzy and correlation based feature selection method for multiclass problems, international journal of artificial intelligence, 12(2):24-41. [58] astudillo, l.; melin, p.; castillo, o. (2015); introduction to an optimization algorithm based on the chemical reactions, information sciences, 291:85-95. [59] bühler, h. (1986); réglage par mode de glissement, presses polytechniques romandes, lausanne. [60] utkin, v. i. (1997); variable structure systems with sliding modes, ieee transactions on automatic control, ac-22(2):212-222. [61] precup, r.-e.; hellendoorn, h. (2011); a survey on industrial applications of fuzzy control, computers in industry, 62(3):213-226. [62] preitl, s.; precup, r.-e. (1993); anti-reset-windup (arw) structure for speed control of hydrogenerators, proceedings of cscs-9 international conference, bucharest, romania, 2:390-397. [63] preitl, s. (1986); aspects concerning the algorithmic design of electro-hydraulic servosystems for speed controllers, proceedings of ccsiteh national symposium, timisoara, romania, pp. 143-149 (in romanian). int j comput commun, issn 1841-9836 9(3):276-283, june, 2014. enhancing nodes lifetime optimum protocol for dissemination of information in wsn m. gholami, a. panahi mohammadreza gholami department of electrical engineering, saveh branch, islamic azad university, saveh, iran e-mail: reza5762@ yahoo.com abdorreza panahi* department of mathematics, saveh branch, islamic azad university, saveh, iran *corresponding author: panahi53@ gmail.com abstract: challenging issue in wireless sensor networks (wsns) is assessment of energy and lifetime at different nodes within the networks. various methods may be employed to improve lifetime (reduce energy consumption). one such method involves balancing loads on nodes when data is being transmitted from source to destination nodes. multi-path routing techniques can be used for this purpose. in these techniques no global information is available regard the path, making it difficult to create multipath routes from sources nodes to destination nodes. another problem with these networks is a routing applied to source nodes independently from that applied used for destination nodes. this creates energy loss and reduces lifetime. to overcome this problem, the present paper makes use of clustering by selecting virtual nodes to gather information from sources and sending it to destination nodes. the new protocol for enhancing nods lifetime (penl) is implemented through ns-2. keywords: wireless sensor networks (wsn), optimum routing, dissemination of information, enhancing nodes lifetime. 1 introduction sensor networks have experienced a considerable growth over the recent years [4]. these networks are composed of a large number of nodes of very small sensors used to collect and process environmental information [5]. nodes in sensor networks usually do not have unique addresses, and the information collected through nodes is of greater importance [6]. in addition, the nodes become inaccessible once they are distributed in the environment. they become useless (reach the end of their lifetime) once they consume the available energy [8]. therefore, energy and its optimization is a major challenge is sensor network and received attention from a large body of research over the past years [1, 3]. to this date, sensor networks have found increasing applications in different areas including military, environmental monitoring, medicine, agriculture, and so on. one data-centric method proposed for routing data in sensor networks is directed diffusion [7], in which nodes use only local data in routing packets. in this method, interest packets are disseminated over the network and to all nodes by basic nodes. then the nodes containing information of interest (information sources) receive these packets and direct collected information to the destination node. the present paper attempts to overcome problems (e.g. late aggregation, extra explanatory data, and high levels of power consumption) experienced in previous protocols. figure 1 illustrates the process. copyright © 2006-2014 by ccc publications enhancing nodes lifetime optimum protocol for dissemination of information in wsn277 figure 1: comparison between routing in mech (left one) and penl (right one) the method uses a virtual node in vicinity of source nodes to collect information and send it to destination nodes. this paper presents penl and uses the routing method described above to improve lifetime and reduce overload compared to mech [2], as we shall see in simulation results presented in the final section of this paper. 2 maximum energy cluster head in mech [2], source and destination nodes use characteristics of the graph to determine information that needed to be disseminated and to find a multi-direction efficient path connecting source and destination nodes. to send data, an interest message is disseminated over an area of interest in the network. each node remembers the node through which it has received information and assigns a gradient to that node. the gradient represents both the direction of information flow and the status of query (which can be active or inactive). if the node is able to predict the next path using the gradient, then it delivers the query to an adjacent node related to that query; otherwise, the query will be sent to all adjacent nodes. the sending node will be recognized as a source. when being send to destination, data is stored in intermediate nodes in order to prevent repeated sending. if one node stops working, other nodes will try to locally recover the path. once initial exploratory data are sent, the next data will be sent only through reinforced paths. source nodes alternatively send exploratory data from time to time to update gradients based on dynamic changes in network. properties of mech: • mech uses neighbor-to-neighbor or step-by-step in which each node can interpret data. • information diffusion is a data-centric method and all connections in a wsn use interests to determine named data for dissemination. • nodes are not assigned globally unique addresses and since each node can individually interpret data, it is possible to reduce data load and send data in concise form. 278 m. gholami, a. panahi figure 2: mech protocol a drawback of mech is the increased number of data steps which, in turn, shortens lifetime of nodes and the overall network. npdi, described below, overcomes these issues. 3 the proposed protocol (penl) as seen in c, an appropriate node close to source nodes is selected as virtual node. in d, the virtual node creates a path to destination node. in e, another node adjacent to the first virtual node is selected as the second virtual node. and finally in f, in cases where source nodes do not receive local interest packets for a while, they overlook the virtual node and send collected data directly to the destination node. 3.1 selecting virtual node a major and one of the most difficult steps in penl is selecting virtual node which has to: • be spatially close to sources and have the largest number of nodes adjacent to it in order to be able to collect data as quickly as possible. • maintain a minimum level of energy above some threshold (e) in order to be able to handle a large amount of data. since each node reports its location, it is easy to select a virtual node with the largest number of adjacent nodes close to sources (the goal is to prevent reduction in lifetime as a result of information dissemination). selection of virtual node in this manner meets the above mentioned properties to a large extent. the minimum distance for the virtual node (denoted by d) can be determined based on node density in the network. in simulations through ns-2, d is equal to 3 steps. as seen in fig. 3, vs1 meets the above mentioned conditions and therefore is selected as virtual node. enhancing nodes lifetime optimum protocol for dissemination of information in wsn279 figure 3: flowchart for selecting virtual node 3.2 the shortest path the shortest path problem can be formulated into a linear programming model: min z = ∑ i ∑ i̸=j cijxij st : ∑ k ̸=j xjk − ∑ k ̸=j xkj = 1 if j is source ∑ k ̸=j xjk − ∑ k ̸=j xkj = 0 if j is neither source nor destination ∑ k ̸=j xjk − ∑ k ̸=j xkj = −1 if j is destination (1) the model does not account for the direction of edges (links). each link may transmit data from j to k or in opposite direction. the following graph shows a part of a network where this technique is applied to find the shortest path. here, the goal is to find the shortest path between vs and s. fig. 4 presents a typical graph of a wsn. 280 m. gholami, a. panahi x01 + x02 + x03 − (x10 + x20 + x30) = 1 x10 + x13 + x14 − (x01 + x31 + x41) = 0 x20 + x23 + x25 − (x02 + x32 + x52) = 0 x30 + x31 + x32 + x34 + x35 + x36 − (x03 + x13 + x23 + x43 + x53 + x63) = 0 x41 + x43 + x46 + x48 − (x14 + x34 + x64 + x84) = 0 x52 + x53 + x56 + x57 − (x25 + x35 + x65 + x75) = 0 x63 + x64 + x65 + x67 + x68 + x69 − (x36 + x46 + x56 + x76 + x86 + x96) = 0 x75 + x76 + x79 − (x57 + x67 + x97) = 0 x84 + x86 + x89 − (x48 + x68 + x98) = 0 x96 + x97 + x98 − (x69 + x79 + x89) = −1 a ≤ xij ≤ a + ϵ (i, j = 1, 2, . . . , 9) min z = a01x01 + a02x02 + . . . + a89x89 (2) figure 4: connecting paths in a typical graph 3.3 selecting a new virtual node since virtual node is required to handle a large amount of data, it will be eliminated once its energy is used up. to prevent this, a new virtual node will be selected after a certain period of time is passed. in penl, this time interval is denoted by pe. when pe is reached, the virtual node sends an nr message to its neighbors requesting them for sending an na message in response indicating the remaining amount of energy for the virtual node. the virtual node allows for a delay in responses and then selects an adjacent node with the highest level of energy as the new virtual node. an sn message is sent to this node. the new node will disseminate an interest message to update paths to the new node. after a certain period, the new virtual node floods exploratory data globally over the network to find a path to the destination node. the process is illustrated in fig. 5. 3.4 virtual node expiration in some cases virtual node may become inoperative. this can be caused by different factors such as hardware problems, expiry of working period, used up energy, failure in finding a node with required level of energy, etc. in this case, if virtual node is still working, a message will enhancing nodes lifetime optimum protocol for dissemination of information in wsn281 be sent over the network to request source nodes for overlooking the virtual node and sending individual packets of exploratory data over the network to find a path to the destination node. the process is illustrated in fig. 5. figure 5: selecting a new virtual node using the original virtual node 4 analysis of simulation results this section presents simulation results obtained through ns-2. 4.1 routing overload overloads contain additional bits used to identify and correct errors. this increases the level of disseminated unwanted information and redundant processing at intermediate nodes as well as end stations. fig. 6 shows routing overload. as seen in this figure, routing overhead of penl when the number of sources is greater than 2, is smaller compared to that of mech. figure 6: comparison of routing overload in penl and mech 4.2 packet loss rate fig. 7 compares number of lost data packets for different number of sources when connection is established between source and destination nodes. as seen in this figure, packet loss rate does not change considerably with the increase in the number of sources for penl while increase in 282 m. gholami, a. panahi the number of sources for mech, particularly from 6 to 7 sources, significantly raises packet loss rate. figure 7: a comparison of packet loss rate of nodes in penl and mech 4.3 energy consumption in networks fig. 8 shows overall energy used by all nodes of the network. as seen in this figure, penl is much more energy-efficient compared to mech mainly because of reduction in the number of transmission paths. in this simulation 7 sources were used over a 25 × 15 grid containing 320 nodes. figure 8: a comparison of energy consumption of nodes in penl and mech 5 conclusions in this paper, penl was proposed as a protocol to improve routing in wsns. the protocol is often used to increase efficiency of the previously used protocols in terms of energy consumption by reducing routing overload and balancing loads on nodes. penl outperforms previous protocols such as mech in many aspects including packet loss rate, energy consumption, and routing overload. enhancing nodes lifetime optimum protocol for dissemination of information in wsn283 bibliography [1] allahviranloo, t. et al (2009); a computational method to find an approximate analytical solution for fuzzy differential equations, analele stiintific ale universitatii ovidius costantaseria matematica issn 1224-1784, 17(1): 5-14. [2] chang, r.s.; kuo, c.j. (2006); an energy efficient routing mechanism for wireless sensor networks, proceedings of the 20th international conference on advanced information networking and applications, 2(6): 308-312. [3] chong, c.y.; pumar, s.; (2003); sensor networks: evolution, opportunities, and challenges proceedings of the ieee, 1247-1256. [4] dai, l., xu, h.k. chen,t., qian, c. xie, l.j. (2014). a multi-objective optimization algorithm of task scheduling in wsn. int j comput commun, issn 1841-9836, 9(2):160-171. [5] gaomez, m.; perez, g.m. (2008); providing trust in wireless sensor networks using a bio inspired technique, proc. of the networking and electronic commerce research conference, 312-321. [6] heinzelman, w.r. et al (2000); energy-efficient communication protocols for wireless microsensor networks, proceedings of the hawaii international conference on systems sciences. [7] levis, p. et al (2004); the emergence of networking abstractions and techniques in tinyos, proceedings of the first usenix/acm symposium on networked systems design and implementation. [8] manjeshwar, a.; agrawal, d. (2001); a routing protocol for enhanced efficient in wireless sensor networks, the 15th international parallel and distributed processing symposium, doi: 10.1109/ipdps.2001.925197, 2009-2015. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 2 (june), pp. 204-213 an improved computational model for adaptive communication channel estimation s.a. akinboro, g.a. aderounmu, e.a. olajubu a.o. ajayi, i.k. ogundoyin, o.m olaniyan s.a. akinboro, o.m olaniyan computer science and technology bells university of technology, ota, nigeria e-mail: akinboro2002@yahoo.com g.a. aderounmu, e.a. olajubu, a.o. ajayi, i.k. ogundoyin computer science and engineering obafemi awolowo university, ile-ife, nigeria abstract: channel estimation is an important and necessary function performed by modern wireless receivers. the goal of channel estimation is to measure the effects of the channel on known or partially known transmission. the usual practice in acquiring knowledge about a channel is to model the channel and then acquire the parameters involved in the model. this paper proposes a variable partial update model for adaptive communication channel estimation with a view to improving signal error at the receiver station. the proposed model is composed of finite impulse response transversal adaptive filter and least mean square adaptation algorithm. the performance of the proposed model was compared with the full update model. the evaluation results indicated that the proposed model performed better than the full update model in terms of computational complexity, memory load, and convergence rate. keywords: adaptation algorithm, computational complexity, memory load, convergence rate, partial update 1 introduction channel refers to one way telecommunication link or transmission medium through which information or signal is transmitted from a transmitter to a receiver. it may either be physical or logical depending on the application, for example cables, and radio frequency are physical channels while control and traffic channels within the ratio frequency channel are logical channels. in estimation theory, it is assumed that the desired information is embedded in a noisy signal. noise adds uncertainty and if there is no uncertainty then there would be no need for estimation. the channel deforms the transmitted signals often in unpredictable ways. to retrieve the information that is transmitted, the received signal has to be processed. the retrieval of information about the channel either from the received signal or from the signal sent is known as channel estimation. the major sources of impairment in wireless channels include channel time-variation, inter symbol interference, and co channel interference [6]. in order to deal with these problems, the transmitted signal needs to be processed at the receiver station. a core feature of many modern communication systems is their ability to adapt to the working environment. the technology at the heart of these flexible systems is an adaptive digital filter whose coefficients change in response to external conditions [3]. an adaptive filter has coefficients that are updated by some types of adaptive algorithms to improve or optimize its response to a desired performance. in general, adaptive filters consist of two basic parts: the filter which applies the required processing copyright c⃝ 2006-2011 by ccc publications an improved computational model for adaptive communication channel estimation 205 on the incoming signal to be filtered and an adaptive algorithm which adjusts the coefficient of the filter to improve its performance [1]. many computationally efficient algorithms have been developed for adaptive filtering [2]. they are based on either a statistical approach such as least-mean square (lms) algorithm or a deterministic approach such as recursive least-square (rls) algorithm. the major advantage of the lms algorithm is its computational simplicity. the rls algorithm conversely, offers faster convergence but with a higher degree of computational complexity. increased popularity of mobile phones and other wireless communication products provide the demand for developing appropriate techniques to improve the performance of existing system for reliable transmission of data over wireless communicational system. many applications in wireless communication like channel estimation and echo cancellation require the adaptive filter to have a very large number of co-efficient, and updating the entire coefficient is costly in terms of power, memory and computation, and sometimes impractical for mobile units [2]. in this paper, an improved computational model for adaptive channel estimation is proposed. the adaptive channel estimator is modelled using finite impulse response traversal adaptive filter. the adaptation process of the filtering is performed using variable partial update lms algorithm. the coefficient of the adaptive filter is partially updated to reduce computational cost and memory load, while at the same time updating the step size parameter to enhance the speed and the accuracy of convergence. the outline of this paper is as follows. in section 2, the review of related works is carried out, and in section 3, the proposed model is described together with the adaptation algorithm used. simulation and results are presented in section 4, and section 5 concludes the study. 2 related work several research works have been done in developing computational models and adaptation algorithms for adaptive communication channel estimation but all the models failed to address the issues of convergence rate, memory load and computational complexity efficiently. in particular, no research work has considered the comparison of the full update and the partial update of the adaptive digital filter coefficients using the parameters memory load, convergence rate and computational complexity. in an attempt to reduce computational complexity and improve asymptotic performance, [4] proposed active tap detection lms algorithm. though the research work reduces computational burdens as well as unsatisfactory poor convergence rate asymptotic performance of the adaptive tap in a long channel but storage location is provided for the entire adaptive tap which is quite expensive. also, [2] proposed partial updating of the lms adaptive filter to reduce the cost of power, memory load and computation. sequential partial update lms is employed in their work. they analyzed the alternating odd/even partial update lms algorithm and derived stability bound on step size parameter for wide sense stationary and cyclo-stationary signals based on external properties of the matrix 2-norm but comparing with the proposed model, the memory load and computational complexity is still large. the behavior of three variants of variable step size lms algorithm for training based multi-user detection in a cdma system was studied by [10]. two of the algorithm have smaller computational complexity and memory load but still suffers from the fact that their steady state error and speed of convergence depend on the same parameters (the step size), therefore complementary pair variable step size lms was introduced. although the proposed algorithm has an increased computational complexity and memory load, but it has better speed performance and more simple parameters setup which are very important in practical applications. a number of previous lms algorithms were analyzed by [5]. they pointed out their weaknesses and proposed a variant of modified variable step size lms algorithm which was tested and can ensure convergence in any cases and can provide a higher speed of convergence and a better level of tracking ability. 206 s.a. akinboro, g.a. aderounmu, e.a. olajubu a.o. ajayi, i.k. ogundoyin, o.m olaniyan the normalized lms algorithm is adapted to get higher speed of convergence by adjusting the step size through the power of input signals. however, we can hardly make an accurate estimate of the auto correlation matrix and mean square error practically. they also said that variants of variable step size lms algorithms have been proposed, but in all of them, the step size equation can be written as µ(n + 1) = αµ(n) + γp where p is a function and depends on the different vss-lms algorithm, α and γ are constant, µ is the step size and n is the time index. previous research works focused on improving the effect of channel on transmitted signal through proposition of variants of adaptation algorithm and design of improved adaptive digital filters, this work however, compare the performance of full-update and variable partial update variants of the adaptation algorithm with design of adaptive digital filter using index factors. the three performance metrics considered are convergence rate, memory load and computational complexity. 3 architecture of the proposed model the block diagram for variable partial update lms model is applied in this work, figure 1 depict the model, while the adaptation module of the proposed model is an improvement on [2] which is depicted in figure 2. the difference between the model developed in [2] and this work is the updates of the coefficients. this model uses index factors of three and five for updates while [2] uses alternate even and odd updates of the coefficients the model consists of four major modules vis-ŕ-vis; unknown channel, fir filter, summer ( ∑ ), and adaptation algorithm. the unknown channel is positioned parallel to the fir filter so that the same input signal can be transmitted simultaneously. the estimation is adaptive and parallel, this is because in wireless situation the paths that a signal takes between the transmitter and the receiver may keep changing. the signal transmitted through the unknown channel is the desired training signal d(k), while the signal transmitted through the fir filter output is the estimated training signal y(k). the two signals are transmitted through the summer to give the estimated error signal e(k).the estimated error signal is then used to update the coefficient of the fir filter using the adaptation algorithm called variable step size partial update least mean square algorithm (vpu-lms) [6]. it is assumed that the variable partial update lms filter in fig 2 is a standard transversal fir filter of length l ≥ 5. let (xi,n) be the input sequence and let (wi,n) denote the coefficients of the adaptive filter. wn = [w1, n, w2, nw3, n...wl, n] t xn = [x1, n, x2, nx3, n...xl, n] where the terms define above are for the instant n and t denotes the transpose operator. also, let d(n) denotes the desired response. it represents a known training signal which is transmitted over a noisy channel with unknown fir transfer function. it was assumed that d(n) obeys a fir model given by d(n) = xn + k(n) (1) where k(n) is the autoregressive noise signal that is independent of the input sequence xn. we also assumed that the filter coefficient is a mutually exclusive set. the elements of the set are coefficients with index factors of three and five i.e. filter length that is divisible by three and five. therefore the set s is defined as s = {w3, w5, w6, w9, w10....} the proposed algorithm called variable partial update lms algorithm. the algorithm updates only the coefficient of the adaptive filter with index factors of three and five. it also makes sure that only the active coefficients (i.e. value not equal to zero) are used for the update process. the step size parameter is updated to ensure convergence of the algorithm. the algorithm is described in following steps: an improved computational model for adaptive communication channel estimation 207 figure 1: the block diagram for variable partial update lms model figure 2: transversal fir filter structure for the variable partial update-lms model 208 s.a. akinboro, g.a. aderounmu, e.a. olajubu a.o. ajayi, i.k. ogundoyin, o.m olaniyan • compute the output of the adaptive filter y(n) = wtnjxnj (2) where j ∈ s and t is the transpose operator. • compute the output error e(n) = d(n) − y(n) (3) where d(n) = xn + k(n) is the desired output of the transfer filter • update the coefficient of the adaptive filter using equation(1). wn+1 = { w(n − 1), j + µe(n) ∗ x(n), j if j ∈ si and w¬0, w(n−1),j otherwise. (4) • update the step size of the adaptive filter µ(n + 1) = αµ(n − 1) + µ(n)p(n) (5) µ(n + 1) =   µmax if jµ(n + 1) > µmax, µmin if jµ(n + 1) < µmin, µ(n + 1) otherwise. (6) γ(n) = { βγ(n − 1) if jµ(n + 1) > µmax, γ(n − 1) otherwise. (7) where γ and β are constant values, 0 < γ, β > 1. the algorithm will adjust the parameter γ with the constant β. to ensure convergence the parameter β must satisfy that 0 < β < 1 4 simulation and results to test the performance of the proposed variable partial update lms algorithm, we simulated the discrete signal sequence generated in mat lab environment using pseudo random number generator with zero mean and variance of one, and a noise signal sequence which is obtained by introducing 0.8 noise level to the discrete signal. these signals form the desired signals that were input to the finite impulse response filter. the outputs from the filter form the actual signal which was subtracted from the desired signal to obtain the mean square error. to establish the superiority of the proposed partial update model over the full update model, training was performed using fifty different sets of input data with different value of the step size to obtain the average result of the mean square error (mse) and the efficiency of the two models. table 1 shows the values of the step size and other simulation parameters. the full update lms algorithm and our partial update lms algorithm were simulated using various values of step size. figure 3 shows the comparison of the full update and the partial update model the proposed algorithm exhibit variable update of the step size. in order to test this algorithm for speed and accuracy of convergence, we used the parameters specified in table 2 for the case when step size is less than the minimum step size (0.0036). figure 4 shows a specimen comparison of the proposed partial update model with the equivalent full update model. we illustrate the case when the step size is set to a large value or possibly larger than the maximum step size. table 3 shows the parameters used for the simulation. in a normal situation, when the mean square error is magnified, stability of the filter is affected. this is an improved computational model for adaptive communication channel estimation 209 step µmax µmin γ α β length size value 0.001 1.9 0.001 0.0007 0.2 0.01 50 0.0011 0.0013 0.0017 0.0020 0.0030 table 1: simulation parameters for fixed step size (µ) figure 3: mean square error versus number of iteration for step size of 0.001 step µmax µmin γ α β length size value 0.0036 0.09 0.01 0.0007 0.2 0.01 50 0.0049 0.0052 table 2: simulation parameter for variable step size, for (µ < µmin ) figure 4: mean square error versus number of iteration for (µ < µmin) 210 s.a. akinboro, g.a. aderounmu, e.a. olajubu a.o. ajayi, i.k. ogundoyin, o.m olaniyan shown in figure 5 where the mean square error was magnified and the accuracy of convergence was adversely affected for the full update model. however, the reverse is the case for the partial update model, with the speed and accuracy of convergence still enhanced. figure 5: mean square error versus number of iteration for (µ > µmax) step µmax µmin γ α β length size value 0.25 1.9 0.001 0.0007 0.2 0.01 50 0.30 0.40 table 3: simulation parameter for variable step size for (µ > µmax) the filter length was varied with a constant step size to evaluate the system performance. the filter length is equivalent to the number of coefficients required for the update of the filter. the parameter considered here is the memory load. each of the coefficient requires a unit storage, therefore the lesser the number of coefficient the lesser the memory requirement. for the full update model, all the coefficients are used for the update process therefore the memory load (m) is equivalent to the filter length (l). in the case of partial update model, only the coefficients with index factors of three and five are used for the update. therefore memory load m = l 3 + l 5 . where l 3 and l 5 are the sets of filter length divisible by factors of three and five respectively. simulation was carried out for fifty iterations using the parameters in table 4. the result shown in figure 6 revealed that the performance of the system is below 50% for full update model and up to 86% for the partial update model. 4.1 computational complexity computational complexity refers to the number of hardware resources required to implement the system. the complexity of an algorithm determines the hardware requirement and computational cost. the hardware required to implement the full update and the partial update finite impulse response transversal adaptive filter are the multiplier, summer and the memory. the multiplier is use to multiply the input with the corresponding weight, memory to store the weights, and summer to perform the addition. the computational complexity of our model was estimated by counting the number of hardware resources as described in [12] such as multipliers, an improved computational model for adaptive communication channel estimation 211 filter µmax µmin γ α β step size memory load length full update partial update 20 0.02 0.01 0.0007 0.2 0.01 0.003 20 9 50 50 23 70 70 33 90 90 42 100 100 47 130 130 61 150 150 70 170 170 79 200 200 93 250 250 126 table 4: simulation parameter for system performance evaluation figure 6: system performance versus filter length 212 s.a. akinboro, g.a. aderounmu, e.a. olajubu a.o. ajayi, i.k. ogundoyin, o.m olaniyan summers and memories required for a single iteration for each model as shown in table 5. the evaluation result shown in figure 7 revealed that the computational complexity of the proposed partial update model is considerably lower when compared with the full update model. filter memories summers multipliers memories summers multipliers length (fu) (fu) (fu) (pu) (pu) (pu) 20 20 20 20 9 9 9 50 50 50 50 23 23 23 70 70 70 70 33 33 33 90 90 90 90 42 42 42 100 100 100 100 47 47 47 130 130 130 130 61 61 61 150 150 150 150 70 70 70 170 170 170 170 79 79 79 200 200 200 200 93 93 93 250 250 250 250 126 126 126 table 5: evaluation of computational complexity figure 7: filter lengths versus computational complexity 5 conclusions to achieve the continuous update of the coefficient using adaptive algorithm, an improved computational model was proposed in this research, the novelty of which is the adoption of finite impulse response transversal adaptive filter to filter the noise signal from the transmitted signal. the adaptation process employed the concept of variable partial update least mean square algorithm. in the update process, only the coefficients with the factors of three and five are used. the performance of the proposed model was compared with the full update model using the following parameters convergence rate, memory load, and computational complexity in mat lab environment. the simulation results revealed a better performance of the proposed model over the full update model. the proposed framework will particularly be suitable for wireless communication environment where the characteristics of the channel changes with time. the an improved computational model for adaptive communication channel estimation 213 results obtained in the study will go a long way to reducing the effect of channel time variation, inter-symbol interference and co channel interference on the transmitted signal. bibliography [1] c. s. douglas, and w. pan, exact expectation analysis of the lms adaptive filter, ieee transaction on signal processing, 43(12),2863-2871, 1995. [2] m. godavarti and a. o. hero, analysis of the sequential partial update lms algorithm, proceedings of ieee international conference on acoustics, speech, and signal processing,pp. 3857-3860, 2001. [3] s. mullins and c. heneghan, alternative least mean square adaptive filter architectures for implementation on field programmable gate arrays, department of electronic and electrical engineering, university college dublin available at http://195.134.67.70/eurasip/proceedings/eusipco/2002/articles/paper389.pdf, 2002. [4] s. singh, r.k. bansal, and s. bansal, improved channel estimation with auto-regressive prewhitening techniques for color inputs, international conference on next generation communication system: icongencom-06, pp. 9-14, 2006. [5] y. li and w. xinan, a modified vs lms algorithm, ieee transaction on signal processing, vol. 2, no. 5, 615-618, 2005. [6] s. diggavi, b. chong, and a. paulraj, an interference suppression scheme with joint channel data estimation, ieee journal on communication, vol. 17, no.11,465-469, 1998. [7] j. sanubari, fast convergence lms adaptive filters employing fuzzypartial updates, ieee transaction on signal processing, vol. 4, pp. 1334-1337, 2003. [8] s. jo, j. choi and y. lee, modified leaky lms algorithm for channel estimation in dscdma system, ieee communications letters, vol. 6, no 5, 202-204 2002. [9] m. gadhiok, channel estimation for fast fading environment, ieee transaction on signal processing, vol. 19, pp. 14-22, 2004. [10] k. egiazarian, p. kuosmanen, and r. c. bilcu, variable step size lms adaptive filters for cdma multiuser detection, vol.17 pp. 259-264, 2004. [11] r. bilcu, p. kuosmanen and c. rusu, a novel complementary variable step lms algorithm, ieee transaction on signal processing, vol. 23 pp. 13-19, 2000. [12] v. debrunner and d. zhou, hybrid filtered error lms algorithm: another alternative to filtered-x lms, ieee transactions on circuit and system, vol. 53, no. 3,pp.653-661, 2006. int j comput commun, issn 1841-9836 vol.7 (2012), no. 2 (june), pp. 341-352 robust adaptive neural-fuzzy network tracking control for robot manipulator t. ngo, y. wang, t.l. mai, m.h. nguyen, j. chen thanhquyen ngo, yaonan wang, t. long mai, m. hung nguyen, jun chen college of electrical and information engineering hunan university, changsha, hunan province 410082, p.r.china faculty of electrical engineering hcm city university of industry, hcm city, vietnam thanhquyenngo2000@yahoo.com, yaonan@hnu.cn, mailongtk@gmail.com, manhhung@yahoo.com, 2011junchen@gmail.com abstract: this paper presents a robust adaptive neural-fuzzy network control (ranfnc) system for an n-link robot manipulator to achieve the highprecision position tracking. initially, the model dynamic of an n-link robot manipulator is introduced. however, it is difficult to design a conformable model-based control scheme, for instance, external disturbances, friction forces and parameter variations. in order to deal with this problem, the ranfnc system is investigated to the joint position control of an n-link robot manipulator. in this control scheme, a four-layer neural-fuzzy-network (nfn) is used for the main role, and the adaptive tuning laws of network parameters are derived in the sense of a projection algorithm and the lyapunov stability theorem to ensure network convergence as well as stable control performance. the merits of this model-free control scheme are that not only the stable position tracking performance can be guaranteed but also unknown system information and auxiliary control design are required in the control process. the simulation results are provided to verify the effectiveness of the proposed ranfnc methodology. keywords: adaptive control, neural-fuzzy network, robot manipulator. 1 introduction in the past decade, the applications of intelligent control techniques (fuzzy control or neural control) to the motion control of robotic manipulator have received considerable attention [1]-[3]. in general, robotic manipulators have to face various uncertainties in their dynamics, such as friction, and external disturbance. it is difficult to establish exactly mathematical model for the design of a model-based control system. thus, the general claim of these intelligent control approaches is that they can reduce the effects of structured parametric uncertainty and the unstructured disturbance by using their powerful learning ability without a detailed knowledge of the controlled plant in the design processes. recently, the concept of incorporating fuzzy logic into a neural network has grown into a popular research topic [5]-[8]. the integrated neural-fuzzy-network system possesses the merits of both fuzzy system [9] (e.g., humanlike if-then rule thinking and ease of incorporating expert knowledge) and neural network [10] (e.g., learning and optimization abilities and connectionist structure). in this way, one can bring the low-level learning and computational power of neural network into fuzzy systems as well as the high-level humanlike if-then rule thinking and reasoning of fuzzy systems into neural network. copyright c⃝ 2006-2012 by ccc publications 342 t. ngo, y. wang, t.l. mai, m.h. nguyen, j. chen the main of this paper is to design an intelligent control system scheme for the position control of an n-link robot manipulator by using neural-fuzzy-network controller to compensated uncertainness dynamic model and external disturbance via capability self-learning of neural network and human intuitive. this paper is organized as follows: section 2 described a dynamic model of an n-link robot manipulator briefly [11]. section 3 presents a structure of neural-fuzzy-network. the design process of the ranfnc system is investigated to control an n-link robot manipulator for periodic motion in section 4. the design procedures of the proposed ranfnc system are described in detail. the adaptive learning laws in the ranfnc system are designed in the sense of the lyapunov stability theorem [12], [13] so that the network convergence and system tracking stability can be guaranteed in the closed-loop control system. numerical simulation results of a two-link robot manipulator under the possible occurrence of uncertainties are provided to demonstrate the tracking control performance of the proposed ranfnc system in section 5. conclusions are drawn in section 6. 2 system description 2.1 robotic dynamic model in general, the dynamic of an n-link robot manipulator may be expressed in [11] as: m(q)q̈ + c(q, q̇)q̇ + g(q) + f(q̇) + τd = τ (1) where q, q̇, q̈ ∈ ℜn are the joint position, velocity and acceleration vectors, m(q) ∈ ℜn×n denotes the inertia matrix, c(q, q̇) ∈ ℜn×n expresses the matrix of centripetal and coriolis forces, g(q) ∈ ℜn is the gravity vector, and f(q̇) ∈ ℜn is the friction. bounded unknown disturbances are denoted by τd and the control input torque is τ(t) ∈ ℜn. in this paper a robot manipulator is shown in fig.1 which is utilized to verify dynamic properties are given in section 5. y y0 x0 x q1 x1 y1 lc1 m1 lc2 m2 g p x2 y2 lt1 lt2 q2 figure 1: architecture of two-link robot manipulator. 1 1 j 1 z 1 1 b j b z b 1 r j r z r 1 z bz rz k jb w 1 l k l 1 y 2 y n y ik w p l input layer membership layer rule layer output layer figure 2: structure of four-layer nfn given a desired arm trajectory qd(t) ∈ ℜn the tracking error is: e(t) = qd(t) − q(t) (2) and the filtered tracking error (in standard use in robotics) is: r(t) = ė + λe (3) robust adaptive neural-fuzzy network tracking control for robot manipulator 343 where λ = λt > 0, differentiating r(t) and using (1), the arm dynamics may be written in terms of the filtered tracking error as: m(q)ṙ = −c(q, q̇)r − τ + m(q)(q̇d + λė) + c(q, q̇)(q̇ + λe) + f(q̇) + g(q) + τd (4) where the nonlinear robot function is: f(x) = m(q)(q̈d + λė) + c(q, q̇)(q̇ + λe) + g(q) + f(q̇) (5) substituting (5) equation into (4) we have: m(q)ṙ = −c(q, q̇)r − τ + f(x) + τd (6) and, for instance x = [ et ėt qt q̇t q̈t ]t 2.2 defined control law now, we define a control input torque as: τ0 = f̂(x) + kvr (7) with f̂(x) is an estimate of f(x) and a gain matrix kv = ktv the closed-loop system becomes: m(q)ṙ = −(kv + c(q, q̇))r + f̃(x) + τd ≡ −(kv + c(q, q̇))r + ζ0, ζ0 = f̃(x) + τd (8) where functional estimate error is given by: f̃(x) = f(x) − f̂(x) (9) this is a system error wherein filtered tracking error is driven functional estimate error. the control τ0 incorporates a proportional plus derivative (pd) term in kvr = kv(ė + λe). in the remainder of the paper we shall use (8) to focus on selecting nfn weight tuning algorithms that guarantee the stability of the filtered tracking error r(t). then, since (3), with the input considered as r(t) and the output as e(t) describes a stable system, standard techniques [13], [15] guarantee that e(t) exhibits stable behaviour. in fact ∥e∥2 ≤ ∥r∥2/σmin, ∥ë∥2 ≤ ∥r∥2 with σmin(λ) the minimum singular value of λ. generally λ is diagonal, so that σmin(λ) is the smallest element of λ. the following properties of the robot dynamics are required [15]. they hold for all revolute rigid-link manipulator. the inertial matrix m(q) is symmetric and positive definite. it is also bounded as a function of q : m1i ≤ m2i. ṁ(q)−2c(q, q̇) is a skew symmetric matrix, that is yt [ṁ(q)−2c(q, q̇)]y, where y is a n × 1 nonzero vector. the gravity vector g(q) is bounded as a function of q : g(q) ≤ gd. the unknown disturbance satisfies ∥τd∥ ≤ db and this property is new, that is, the dynamic (8) from ζ0(t) to r(t) are a state strict passive system. 3 structure of nfn in the ranfnc scheme, as shown in fig.3, an nfn estimator is designed to tune the nonlinear dynamic function vector, and then, the estimative vector is used to indirectly develop a stable ranfnc law. an nfn controller is directly designed to imitate a predetermined model-based stabilizing control law, and then, the stable control performance can be achieved by using joint position and filtered error vector and information. in this paper, a four-layer nfn 344 t. ngo, y. wang, t.l. mai, m.h. nguyen, j. chen d/dt e + -q iq neural-fuzzy network (nfn) adaptive law proposed robust adaptive neural-fuzzy-network control system i robot manipulator e e )(tq r reference model d q eer + + v k r sgn(•) fk )(ˆ xf f )( xl ŵ d/dt e e )(tq d figure 3: block diagram of ranfnc scheme structure showns in fig.2, which is composed of input, membership, rule, and output layers, is adopted to implement the nfn estimate in ranfnc. the signal propagation and the basic function in each layer of the nfn are introduced as follows. 1. input layer transmits the input linguistic variables zb|1,2,··· ,r to the next layer. 2. membership layer represents the input values with the following gaussian membership functions: µ j b(zb) = exp [ − (zb − m j b) 2 (t j b) 2 ] (10) where exp[·] is the exponential function, mjb and t j b(b = 1,2, · · · ,r;j = 1,2, · · · ,z) are the mean and the standard deviation of the gaussian function in the jth term of the bth input variable zb to the node of membership layer, respectively. it can be referred as the fuzzification procedure. 3. the output of each node in the rule layer is determined by fuzzy and operation. each node in this rule layer is denoted by ∏ , which multiplies the input signals and output the result of the product. the product operation is utilized to determine the firing strength. it can be referred as the fuzzy inference mechanism. the output of this layer is given as: lk = r∏ b=1 wkjbµ j b(zb) (11) where lk|k = 1,2, · · · ,p represents the kth output of the rule layer, wkjb which represents the weights between the membership layer and the rule layer, is assumed to be unity, and p is the number of rules. 4. final layer is the output layer, and nodes in this layer represent the output linguistic variables. each node in the output layer yi(1,2, · · · ,n) is labelled as ∑ , which computes the robust adaptive neural-fuzzy network tracking control for robot manipulator 345 overall output as the summation of all input signals, and be represented as: yi = p∑ k=1 wiklk (12) the output node, together with the links connected to it, acts as a defuzzifier. it can be referred as the normal defuzzification procedure. moreover, it can be rewritten in the following vector form: y = [ y1 y2 . . . yn ]t = wl (13) where w =   w11 w12 · · · w1p w21 w22 · · · w2p ... ... . . . ... wn1 wn2 · · · wnp   = [ w1 w2 · · · wn ] ∈ ℜn×p, l = [ l1 l2 · · · ln ]t ∈ ℜp×1 in the ranfnc scheme, the nfn is used to estimate unmodeled nonlinear function, moreover, the ranfnc law and adaptive tuning algorithms for nfn are introduced from the stability analyses of the closed-loop system by using lyapunov method. the input of the nfn estimator are the elements in the filtered error vector and joint positions signal, the output of the nfn estimator are the nonlinear dynamic function vectors in the local models. based on the powerful approximation ability [4], there exists an optimal nfn estimator to approximate the nonlinear dynamic function in (5) such that f(x) = w∗l(x) + ε(x) (14) with w∗ the ideal weight matrix and the estimative error vector ε(x) ∈ ℜn×1 are assumed to be given by w̃ = arg min ŵ∈mx [ sup x∈mx ∥ f(x) − ŵl(x) ∥ ] , ∥ ε ∥≤ εn (15) in which ∥ · ∥ is the euclidean norm, mx and mw are the predefined compact sets of x and ŵ , and the positive constant εn can be reduced arbitrarily by increasing the number of rules. 4 ranfnc design define the nfn functional estimate by f̂(x) = ŵl(x) (16) with ŵ the current values of the nfn weight as provided by the tuning algorithm. with the ideal weights required in (14) define the weight deviations or weight estimation errors as w̃ = w∗ − ŵ (17) with τ0 defined to be (7), select the control input τ = τ0 − v = ŵl(x) + kvr − v (18) 346 t. ngo, y. wang, t.l. mai, m.h. nguyen, j. chen v = (εn + db)sgn(r) = kfsgn(r) (19) with v(t) a function to be determined to provide robustness in the face of the net reconstruction error ε. then, the closed-loop filtered error dynamics become mṙ = −(kv + c(q, q̇))r + w̃l(x) + (ε + τd) + v = −(kv + c(q, q̇))r + ζ1 (20) theorem 1: consider an n-link robot manipulator represented (1). if the ranfnc law is designed as (18), (19) and the weight update law is designed as (21), then the stability of the proposed ranfnc system can be ensured + )(tq )(tq )(tq e e )(tq d )(tq r )(tq d d/dt reference model d/dt robot manipulator mcg)(tq )(),( )( qgqqqc qekrm dcfl )sgn()sgn( 1 t c ermk fl r k a k r r )sgn(rd/dt error function sgn(•) + + robust feedback linearization control system (c) + )(tq d )(tq e k k pd )(tq )()()( tektekt pd pd control system robot manipulator d/dt e (a) + )(tq computed control system )(tq )(tq e e )(tq d )(tq r )(tq d ct )(),( )( qgqqc ekekqm badct a k bk d/dt reference model d/dt robot manipulator mcg)(tq (b) figure 4: (a) pd control system, (b) computed torque control (ctc) system, (c) robust feedback linearization control (rflc) system robust adaptive neural-fuzzy network tracking control for robot manipulator 347 ˙̂ w = fl(x)rt (21) proof: define a lyapunov function candidate as v (r(t),w̃) = 1 2 rt mr + 1 2 tr(w̃t f−1w̃) (22) where tr(·) is a trace operator. by differentiating (22) with respect to time and using (19), (20), (21), and using properties of the robot dynamics are introduced in section 2, one can obtain. v̇ = 1 2 rt mṙ + 1 2 rt ṁr + tr(w̃t f−1w̃) = −rt kvr + 1 2 rt (ṁ − 2c)r + trw̃(f−1 ˙̃w + lrt ) + rt (ε + τd) − rt (εn + db)sgn(r) = −rt kvr + rt (ε + τd) − rt (εn + db)sgn(r) = −rt kvr + rt (ε + τd) − ∥r∥(εn + db) ≤ −rt kvr ≤ 0 (23) since v̇ (r(t),w̃) ≤ 0, v̇ (r(t),w̃) is a negative semidefinite function, i.e. v (r(t),w̃) ≤ v (r(0),w̃). it implies that r(t) and w̃ is bounded functions. let function h(t) ≡ rt kvr ≤ −v̇ and integrate function h(t) with respect to time τ∫ 0 h(t)dτ ≤ v (r(0),w̃) − v (r(t),w̃) (24) because v (r(0),w̃) is a bounded function, and v (r(t),w̃) is a nonincreasing and bounded function, the following result can be concluded: lim t→∞ τ∫ 0 h(t)dτ < ∞ (25) in addition, ḣ(t) is bounded; thus, by barbalats lemma can be shown that lim t→∞ h(t) = 0. it can imply that r(t) will be converging to zero as time tends to infinite. 5 numerical simulation a two-link robot manipulator as shown in fig.1 is utilized in this paper to verify the effectiveness of the proposed control scheme. the detailed system parameters of this robot manipulator are given as: link mass m1,m2(kg), lengths l1, l2(m), angular positions q1,q2(rad). the parameters for the equation of motion (1) are adopted in [11]. m(q) = [ (m1 + m2l 2 1) m2l1l2(s1s2 + c1c2) m2l1l2(s1s2 + c1c2) m2l 2 2 ] c(q, q̇) = m2l1l2(c1s2 + s1c2) [ 0 −q̇2 −q̇1 0 ] , g(q) = [ −(m1 + m2)l1gs1 −m2l2gs2 ] 348 t. ngo, y. wang, t.l. mai, m.h. nguyen, j. chen where q ∈ ℜ2 and the shorthand notations c1 = cos(q1), c2 = cos(q2), s1 = sin(q1) and s2 = sin(q2) are used. for the convenience of the simulation, the nominal parameters of the robotic system are given as m1 = 4.6(kg), m2 = 2.3(kg), l1 = 0.5(m), l2 = 0.2(m), g = 9.8(m/s2) and the initial conditions q1(0) = 0.5, q2(0) = 0.5, q̇1(0) = 0, q̇2(0) = 0. the desired reference trajectories are qd1(t) = sin(2t), qd2(t) = cos(2t) respectively. 0 2 4 6 8 10 −2 −1 0 1 2 time(s) (a) p os it io n li nk 1 ( ra d) desired position 0 2 4 6 8 10 −2 −1 0 1 2 time(s) (b) p os it io n li nk 2 ( ra d) desired position 0 2 4 6 8 10 −0.6 −0.4 −0.2 0 0.2 time(s) (c) e rr or l in k 1 (r ad ) 0 2 4 6 8 10 −1 −0.5 0 0.5 1 time(s) (d) e rr or l in k 2 (r ad ) 0 2 4 6 8 10 −50 0 50 100 time(s) (e) c on tr ol t or qu e 1 (n m ) 0 2 4 6 8 10 −10 0 10 20 time(s) (f) c on tr ol t or qu e 2 (n m ) figure 5: simulated position responses, tracking errors, and control torques of the ctc control system at joints 1 and 2 the most important parameters that effect the control performance of the robotic system are the external disturbance and the friction term which are denoted tl(t) and f(q̇), in simulation, parameter variation situation and disturbance situation occurring at 5s are considered. the parameter variation situation is that 1(kg) weight is added to the mass of link 2, i.e. m2 = 3.3(kg). the disturbance situation is that external forces are injected into the robotic system, and their shapes are expressed as follows: tl(t) = [ 5 sin(5t) 5 cos(5t) ]t . in addition, friction forces are also considered in this simulation and given as: f(q̇) = [ 2q̇1 + 0.8sgn(q̇1) 4q̇2 + 0.1sgn(q̇2) ]t . to this end, the simulation situations are adopted to demonstrate the robust property of the proposed control scheme. in order to exhibit the superior control performance of the proposed ranfnc scheme, three extra control systems including an rflc system shows in fig. 4(c), a conventional computed torque control (ctc) and a proportional differential (pd) control are examined in the mean time [13]. moreover the conventional ctc system as shown in fig.4(b)can be expressed as τcd = m(q̈d + kaė(t) + kbe(t)) + c(q, q̇) + g(q) (26) robust adaptive neural-fuzzy network tracking control for robot manipulator 349 the pd control system as shown in fig. 4(a) can be expressed as τpd = kαe(t) + kβė(t) (27) the gain in these control system are given as kα = [ 2500 0 0 1000 ] , kβ = [ 20 0 0 25 ] , ka = [ 6 0 0 6 ] , kb = [ 9 0 0 9 ] , kv = [ 5 0 0 5 ] , η = 20, kr = db = 5. (28) the gain matrices of ka, and kb, are determined so that the roots of the characteristic polynomial of kaė + kbe lie strictly in the open left haft of the complex plane, i.e. lim t→∞ e(t) = 0. it means that the ctc system shown in (26) is globally asymptotically stable as the root dynamics (1) without the consideration of system uncertainties. however, the stability of the closed-loop control system may be destroyed if the system dynamics are perturbed by external disturbance. however gains kα and kβ in the pd control are selected according to the ziegler-nichols tuning rule. in addition, the selection of learning rates η is dependent on the significance of tuning objects. 0 2 4 6 8 10 −2 −1 0 1 2 time(s) (a) p os it io n li nk 1 ( ra d) desired position 0 2 4 6 8 10 −2 −1 0 1 2 time(s) (b) p os it io n li nk 2 ( ra d) desired position 0 2 4 6 8 10 −0.5 0 0.5 time(s) (c) e rr or l in k 1 (r ad ) 0 2 4 6 8 10 −0.5 0 0.5 time(s) (d) e rr or l in k 2 (r ad ) 0 2 4 6 8 10 −500 0 500 time(s) (e) c on tr ol t or qu e 1 (n m ) 0 2 4 6 8 10 −100 −50 0 50 100 time(s) (f) c on tr ol t or qu e 2 (n m ) figure 6: simulated position responses, tracking errors, and control torques of the rflc control system at joints 1 and 2 the simulated results of ctc system, the responses of joint position, tracking error and control torque are depicted fig. 5(a-d), and (e-f), respectively. from the simulated results, in interval from beginning to 5s, favourable tracking responses can only be obtained for the nominal situation. however, since the control gains in (28) are determined without considering the joint 350 t. ngo, y. wang, t.l. mai, m.h. nguyen, j. chen friction and external disturbance. so, poor tracking responses after 5s are resulted due to the occurrence of joint friction and external disturbance. in the rflc system are depicted in fig. 6. the joint position responses, tracking error and control torque are depicted in fig. 6(a-d) and (e-f), respectively. the robust control performance of the rflc system is obvious under the occurrence of system uncertainties. however, the undesirable chattering phenomenon in the control torque. 0 2 4 6 8 10 −2 −1 0 1 2 time(s) (a) p os it io n li nk 1 ( ra d) desired position 0 2 4 6 8 10 −2 −1 0 1 2 time(s) (b) p os it io n li nk 2 ( ra d) desired position 0 2 4 6 8 10 −0.5 0 0.5 time(s) (c) e rr or l in k 1 (r ad ) 0 2 4 6 8 10 −0.2 0 0.2 0.4 0.6 time(s) (d) e rr or l in k 2 (r ad ) 0 2 4 6 8 10 −1000 −500 0 500 1000 time(s) (e) c on tr ol t or qu e 1 (n m ) 0 2 4 6 8 10 −400 −200 0 200 400 time(s) (f) c on tr ol t or qu e 2 (n m ) figure 7: simulated position responses, tracking errors, and control torques of the pd control system at joints 1 and 2 the pd control system based on model-free design is offered to apply comparable responses to manifest the performance of the ranfnc system. the simulated responses of joint position, tracking error, and control torque, are represented in fig. 7(a-d) and (e-f). from simulated results see that, the tracking responses are greatly improved and the chattering phenomenon are much reduced. now, the proposed ranfnc system depicted in fig. 3 is applied to control the robot manipulator for comparison. the simulated results of joint position responses, tracking error and control torque are depicted in fig. 8(a-d) and (e-f), respectively. because of, all parameters in the nfn are roughly initialized, the tracking errors are gradually decreased though online training process whether the uncertainties exist or not. furthermore, robust control performance of the ranfnc system, both in the condition of joint friction, parameter variation, and external disturbance are obvious. compared these results with the ctc, rflc and pd control systems, the control torque of proposed ranfnc system is not chattering phenomenon. robust adaptive neural-fuzzy network tracking control for robot manipulator 351 6 conclusions this paper has successfully implemented an ranfnc system to control the joint position of a two-link robot manipulator for achieving desired position control. all the system dynamic could be unknown and no strict constraints. the nfn is used to compensate the uncertainty of the system. all adaptive learning laws in the ranfnc system were derived in the sense of a projection algorithm and lyapunov theorem so that the network convergence and system-tracking stability of the closed-loop control system can be ensured whether or not the uncertainties occur. simulated results of a two link robot manipulator via various existing control frameworks including ctc rflc and pd control were also applied in this paper to compare and display the manipulative performance of the proposed control system. according to the result as depict in figs. 5-8, the desired position tracking response of the ranfnc system can be controlled closely follow specific reference trajectories under wide range of disturbance. the main of the paper is to construct a simpler and more efficient intelligent control system without dynamic knowledge of plant. while ensuring the convergence and tracking stability of the closed-loop system. the proposed ranfnc system can also be applied to other systems, such as mobile robotic, ac servo system and so on. 0 2 4 6 8 10 −2 −1 0 1 2 time(s) (a) p os it io n li nk 1 ( ra d) desired position 0 2 4 6 8 10 −2 −1 0 1 2 time(s) (b) p os it io n li nk 2 ( ra d) desired position 0 2 4 6 8 10 −0.6 −0.4 −0.2 0 0.2 time(s) (c) e rr or l in k 1 (r ad ) 0 2 4 6 8 10 −0.2 0 0.2 0.4 0.6 time(s) (d) e rr or l in k 2 (r ad ) 0 2 4 6 8 10 −50 0 50 100 time(s) (e) c on tr ol t or qu e 1 (n m ) 0 2 4 6 8 10 −100 −50 0 50 100 time(s) (f) c on tr ol t or qu e 2 (n m ) figure 8: simulated position responses, tracking errors, and control torques of the ranfnc control system at joints 1 and 2 7 acknowledgment this work was supported by the national natural science foundation of china (60775047; 60835004), the national high technology research and development program of china (863 352 t. ngo, y. wang, t.l. mai, m.h. nguyen, j. chen program) (2007aa04z244; 2008aa04z214). the authors would like to thank the associate editor and the reviewers for their valuable comments. bibliography [1] jinzhu peng, yaonan wang, wei sun, yan liu, a neural network sliding mode controller with application to robotic manipulator, ieee conf. int. control, 1:2011-2015, 2000 [2] b.k. yoo and w.c ham, adaptive control of robot manipulator using fuzzy compensator, ieee trans. ind. electron, 8(2):186-199, 2000 [3] shuzhi s. ge, adaptive neural network control of robot manipulator in task space, ieee trans. ind. electron, 44(6):746-752, 1997 [4] c.t. lin and c.s. george lee, neural fuzzy systems, englewood cliffs, prentice-hall, 1996 [5] y.q. zhang and a. kandel, compensatory neural-fuzzy systems with fast learning algorithms, ieee trans. neural newt, 9(1):83-105, 1998 [6] vesselenyi t., dzitac s., dzitac i., manolescu m.-j., fuzzy and neural controllers for a pneumatic actuator, nt j comput commun, issn 1841-9836. 2(4): 375-387, 2007 [7] alavandar s., nigam m.j., neuro-fuzzy based approach for inverse kinematics solution of industrial robot manipulators, int j comput commun, issn 1841-9836, 3(3):224-234, 2008 [8] alavandars., nigam m.j., inverse kinematics solution of 3dof planar robot using anfis, nt j comput commun, issn 1841-9836, 3(s):150-155, 2008 [9] l.x. wang, a course in fuzzy systems and control, englewood cliffs, nj:prentice hall, 1997 [10] o. omidvar and d.l. elliott, neural systems for control, englewood cliffs, nj: prenticehall, 1997 [11] b.s. chen, h.j. uang, and c.s. tseng, robust tracking enhancement of robot systems including motor dynamics: a fuzzy-based dynamic game approach, ieee trans. fuzzy syst, 11(4):538-553, 1998 [12] r.j. schilling, fundamentals of robotics, analysis and control. hoboken, nj: prentice-hall, 1998 [13] j.j.e. slotime and w. li, applied nonlinear control, hoboken, nj: prentice-hall, 1991 [14] h. k. khalil, nonlinear systems. englewood cliffs, englewood cliffs, nj: prentice-hall, 1996 [15] k. liu and f.l. lewis, robust control techniques for general dynamic system, j. intell. robotic syst, 6:33-49, 1992 [16] f.l. lewis, c.t. abdallah, and d.m dawson, control of robot manipulators, new york: macmillan, 1993 int j comput commun, issn 1841-9836 9(2):160-171, april, 2014. a multi-objective optimization algorithm of task scheduling in wsn l. dai, h.k. xu, t. chen, c. qian, l.j. xie liang dai, hongke xu, qian chao school of electronic and control engineering chang’an university xi’an 710064, china e-mail: ldai1981@gmail.com, xuhongke@chd.edu.cn ting chen school of information engineering chang’an university xi’an 710064, china e-mail: tchen@chd.edu.cn lijing xie information and navigation college air force engineering university xi’an 710077, china abstract: sensing tasks should be allocated and processed among sensor nodes in minimum times so that users can draw useful conclusions through analyzing sensed data. furthermore, finishing sensing task faster will benefit energy saving. the above needs form a contrast to the lower efficiency of task-performing caused by the failureprone sensor. to solve this problem, a multi-objective optimization algorithm of task scheduling is proposed for wireless sensor networks (mtwsn). this algorithm tries its best to make less makespan, but meanwhile, it also pay much more attention to the probability of task-performing and the lifetime of network. mtwsn avoids the task assigned to the failure-prone sensor, which effectively reducing the effect of failed nodes on task-performing. simulation results show that the proposed algorithm can trade off these three objectives well. compared with the traditional task scheduling algorithms, simulation experiments obtain better results. keywords: wireless sensor networks (wsn); task scheduling; multi-objective optimization; improved nsga-ii . 1 introduction sensing tasks should be allocated and processed among sensor nodes in minimum times so that users can draw useful conclusions through analyzing sensed data. furthermore, finishing sensing task faster will benefit energy saving, which is critical in system design of wireless sensor networks. the primary objective of task scheduling in wireless sensor networks is to find an optimal strategy of splitting the original tasks received by sink into a number of sub-tasks as well as distributing these sub-tasks to the sensors in the right order. the directed acyclic graph [1], independent task sets[2] and divisible load theory [3] are usually used as modeling tools for task scheduling in wireless sensor networks, but these models only take the makespan as the main objective, and assign the task to sensors. however, wireless sensor networks are widely applied to both abominable and military environments. meanwhile, the complexity of networks, the limited energy of sensors, and potential physical or logical faults, bring challenge to task scheduling of wireless sensor networks. wireless sensor network is one kind of the widely used distributed real-time systems. how to assign tasks of system to sensors in unstable and unreliable network environment, and guarantee copyright © 2006-2014 by ccc publications a multi-objective optimization algorithm of task scheduling in wsn 161 their deadlines is one of the key techniques in wireless sensor networks. in wireless sensor network environments, qos guided task scheduling problem is complex and challenging, especially when the tasks have multiple needs. in this paper, a multi-objective optimization algorithm of task scheduling is proposed for wireless sensor networks. it is the first time that the nsga-ii [4] algorithm is used to analyze the task scheduling for wireless sensor networks. based on the characteristics of wireless sensor networks, makespan optimization, the energy-consuming balance optimization and task-performing probability optimization are included. a mathematical model used to optimize the task scheduling problem by nsga-ii was built and the solution was presented, and a detailed process to solute the multi-objective programming model is put forwards. the problem is solved with a multi-objective genetic algorithm (ga) optimization method combined with linear programming (lp) and a group of pareto solutions are provided. 2 related work and motivation wireless sensor networks have restrictions due to energy, memory, and communication ability. we should realize the goal of the improvement and enhancement of sensor networks performance in real time, economy, power aware and harmony. some studies over the past decade have been conducted to reduce the overall energy consumption for task scheduling in wireless sensor networks by using diverse techniques [5-14]. heemin presents an energy-efficient task assignment and migration framework for sensor networks. with the proposed framework, optimal task transformation and assignment is sought so as to minimize given cost function[6]. younis m presented an optimization scheme for task allocation to gateways [7]. the task allocation problem is modeled as a zero-one nonlinear program. he study system partitioning of computation to improve the energy efficiency of a wireless sensor networking application. wang explored system partitioning between the sensor cluster and the base station, employing computation-communication tradeoffs to reduce energy dissipation[8]. also he showed that system partitioning of computation within the cluster can also improve the energy efficiency by using dynamic voltage scaling. tian presented a task mapping and scheduling solution for real-time applications (rt-maps) in wsns[9]. rt-maps incorporates wireless channel modeling, hyper-dag extension, concurrent task mapping, communication and computation scheduling, and dynamic voltage scaling (dvs) methods. tian also presented a task mapping and scheduling solution for energy-constrained applications in wsns, energy-constrained task mapping and scheduling (ecomaps)[10]. ecomaps incorporates channel modeling, concurrent task mapping, communication and computation scheduling, and sensor failure handling algorithm. the performance of ecomaps is evaluated through simulations with randomly generated directed acyclic graphs (dag).yu proposed an energy-balanced allocation of a real-time application onto a single-hop cluster of homogeneous sensor nodes connected with multiple wireless channels[11]. an epoch-based application consisting of a set of communicating tasks is considered. each sensor node is equipped with discrete dynamic voltage scaling (dvs). the time and energy costs of both computation and communication activities are considered. he proposed both an integer linear programming (ilp) formulation and a polynomial time 3-phase heuristic. gu presented an application independent task mapping and scheduling solution in multi-hop video sensor networks (vsns) that provides real-time guarantees to process video feeds[12]. the processed data is smaller in volume which further releases the burden on the end-to-end communication. using a novel multi-hop channel model and a communication scheduling algorithm, computation tasks and associated communication events are scheduled simultaneously with a dynamic critical-path scheduling algorithm. dynamic voltage scaling (dvs) mechanism is implemented to further optimize energy consumption. xie developed a novel task allocation strategy 162 l. dai, h.k. xu, t. chen, c. qian, l.j. xie called beata (balanced energy-aware task allocation) for collaborative applications running on heterogeneous networked embedded systems[13]. the beata algorithm aims at blending an energy-delay efficiency scheme with task allocations, thereby making the best tradeoffs between energy savings and schedule lengths. besides, he introduced the concept of an energy-adaptive window, which is a critical parameter in the beata strategy. by fine-tuning the size of the energy-adaptive window, users can readily customize beata to meet their specific energy-delay trade-off needs imposed by applications. further, he built a mathematical model to approximate energy consumption caused by both computation and communication activities. a task allocation framework for underwater acoustic sensor networks (uw-asns) that participate as a team to accomplish critical missions is developed by kulkarni[14]. the team formed as a result of this task allocation framework is the subset of all deployed auvs that is best suited to accomplish the mission while adhering to the mission constraints. most of the existing task scheduling algorithms is normal ones without consideration of harmony and reliability in our research. this problem will need to be further gone into for improving the wireless sensor networks characteristics on real time, economy, power aware and harmony. the above discussed task scheduling algorithms are mainly for the problem of assignment tasks with the objective of minimizing the makespan and energy consumption, effective algorithms is discussed in these papers, and these algorithms will be able to distribute tasks to nodes with the expected value of the minimum earliest-finish-time and make them executed parallel and efficiently. however, tasks distributed to remote sensors may be unable to complete because of physical failures or attack, and the complexity, dynamics and open deployment of the wireless environment increases the possibility of that happening. existing task scheduling policies are not consider this question, so tasks will be assigned to the sensors with lower reliability to perform, what make measurement tasks automatically cease to be in force due to sensor failure, and then lowered the efficiency and qos of wireless sensor networks. compared with the previous research, the main contributions of this paper are presenting the concept of " task-performing probability ", and proposing a multi-objective optimization task scheduling algorithm for wireless sensor networks considering makespan optimization, the energyconsuming optimization and task-performing probability optimization simultaneously. consideration shall also be given to ensuring a reasonable distribution of tasks to reliable sensors, taking into account the makespan and energy consumption. this algorithm schedules tasks avoiding allocation some tasks to unreliable sensors, thus effectively lowering the influence of sensor failure on task performing. 3 the task scheduling model for wsns in this section, we describe mathematical models which were built to represent a task scheduling framework. 3.1 task model generally, a wireless sensor network consists of a set of heterogeneous sensors in abominable or military environments. sensor nodes always break down due to hardware failure, software error, energy exhaustion and disturb from outer environment, so the precondition of the tasks accomplished successfully is the sensors to provide a stable hardware and software needed to perform the given tasks. each sensor is in a "active" or "inactive" state at a given time. the task is impossibly accomplished on a inactive sensor, and the status information also will be missed or ineffective. a multi-objective optimization algorithm of task scheduling in wsn 163 let λi be the failure probability of sensor si , then the task-performing probability of the tasks accomplished successfully on sensor si is 1 − λi . we assumed that the failure process of each sensor is independent and yields to poisson distribution. we defined the task set as γ = {τ1,τ2, · · ·,τn}. according to the previous assumption, the tasks in the task set is independent each other. 3.2 makespan model suppose that a wireless sensor network consists of m sensors, s = {s1,s2, · · ·,sm}. there are n independent tasks competing the m sensors. we aim to scheduling the n tasks to the m sensors reasonably, to make the minimum makespan of the tasks. and we characterize a n × m matrix x satisfying the scheduling results. when xi,j = 1 (xi,j ∈ x), it meant that we schedule the task τi on sensor sj to process, otherwise, xi,j = 0. the task-processing time of each sensor can be estimated by forecasting techniques and history status based on task type. the task-processing time can be represented by a n × m matrix y , in which the matrix element yi,j represents the estimated task-processing time that task τi runs on sensor sj.the task-processing time of sensor sj is the sum of all the tasks processing time that the tasks run on sj. it can be expressed as follow: tj = n∑ i=1 yi,j (1) then, the makespan that the n tasks scheduled to the m sensors according to the scheduling result x is expressed as: t(γ,s,x) = maxsi∈s[r(sj)] (2) 3.3 energy consumption model because of the limited energy in wireless sensor networks, the research on low power technology and long lifetime is pivotal in the architecture of wireless sensor networks. the energy consumption of wireless sensor networks mainly composed of communication and task-performing energy consumption. communication energy consumption is relevant to the minimum energy-communication for a given standard distance p0 and the distance di,j between sensor si and sensor sj. pi,j = (4π) 2d2i,jβp0/d 2 0gtgrλ 2 (3) where gt and gr are the antenna gains (with respect to an isotropic radiator) of the transmitting and receiving antennas respectively, λ is the wavelength, and β is the energy consumption factor. since (4π)2βp0/gtgrλ2 is constant, one unit of communication energy consumption is indicated as d2i,j/d 2 0. task-performing energy consumption of sensor sj for finishing all its tasks is: ej = cjtj (4) where cj is the task-performing energy consumption of one certain task in one unit time. the relative task-performing energy consumption of sensor sj can be represented as follow: rj = ej/maxsi∈s(ej) (5) 164 l. dai, h.k. xu, t. chen, c. qian, l.j. xie excluding energy consumption, the lifetime of wireless sensor networks is the critical estimated part in task scheduling too. to extend network lifespan, we should balance the energy consumption of every sensor during the period of ordinary operation. we can measure the uniformity of sensors’ residual energy based on entropy theory. the uniformity of sensors’ residual energy after data transmission in t can be represented as follow: hit = − ∑ p(eit) log p(eit) (6) where eit is the residual energy of sensor sj in t, and hit is the value of residual energy entropy in the network. the higher hit, the more residual energy in the network, then the longer lifetime of the network. thus, we define the evaluation function of energy consumption for sensor sj is that: c(si) = −d2i,jrjh/d 2 0 (7) the less c(si) means that the energy consumption of data transmission and executing tasks has less effect on the residual energy in the network. the evaluation function of energy consumption for the network is defined as: c(l,s,x) = m∑ i=1 c(si) (8) the smaller the value of c(l,s,x), the longer network lifetime. 3.4 task-performing probability the concept of task-performing probability comes from the survivability of distributed system. survivability represents that tasks are capable of being performed steadily and regularly in the networks. definition. the task-performing probability means the probability that task could be accomplished successfully on specified sensor. let p(τi,sj) be the task-performing probability that task τi accomplished on sensor sj. in accordance with the illustrations in the previous section, the task-performing probability is exp(−λjt) that sensor be in the normal (active) state within t. because the tasks can be performed normally only when the given sensor is in a "normal" state, so we can get: p(τi,sj) = exp(−λjr(sj)) (9) let p(γ,s,x) is the task-performing probability the n tasks to the m sensors according to the scheduling result x, then we can get: p(γ,s,x) = exp[− i=n∑ i=1 j=m∑ j=1 λjxi,jr(sj)] (10) taking the task-performing probability as goal is to avoid the tasks scheduled to the sensors with low reliability. lowering the influence of sensor failure on task performing, then to maximize the taskperforming probability of task set γ, that is, we will maximize p(γ,s,x) by our task scheduling algorithm to obtain the most suitable x. thus, the quality of service for wireless sensor networks will be improved. a multi-objective optimization algorithm of task scheduling in wsn 165 in eq. 9, let l(τi,sj) = −λjr(sj), we can see that if we want to increase the value of p(τi,sj), then we must lower the l(τi,sj). let l(γ,s,x) = i=n∑ i=1 j=m∑ j=1 λjxi,jr(sj), then we can see that maximizing p(γ,s,x) means to minimize l(γ,s,x). the task scheduling algorithm taking the task-performing probability as goal makes the value of l(γ,s,x) as minimal as possible. 4 optimal task scheduling based on improved nsga-ii 4.1 multi-objective optimization problem in the case of a multi-objective optimization (moo) problem, there is usually no single solution that is optimum with respect to all objectives. there are a set of optimal solutions known as pareto optimal solutions. without additional information, all these solutions are equally satisfactory. the goal of moo is to find as many of these solutions as possible. in the research of pareto front many methods have been proposed. david schaffer first implemented a multi-objective evolutionary algorithm called the vector-evaluated genetic algorithm or vega in 1984. his algorithm started off well but tended to converge to a single solution. to prevent the convergence to a single solution, goldberg and richardson suggested using a nondominated sorting procedure coupled with a niching strategy called sharing. sharing takes into account that individuals in the same niche must share the available resources. this concept is integrated into the pareto genetic algorithm by increasing the cost of chromosomes as a function of their distance from each other. closely grouped chromosomes will find their costs increased more than chromosomes that are spaced far apart. the multi-objective genetic algorithm (moga)[15] starts by finding all non-dominated chromosomes of a population and gives them a rank of one. these chromosomes are removed from the population. next, all the non-dominated chromosomes of this smaller population are found and assigned a rank of two. this process continues until all the chromosomes are assigned a rank. the largest rank will be less than or equal to the size of the population. usually, there are many solutions that have the same rank. the selection procedure uses the chromosome ranking to determine the mating pool. moga also uses niching on the cost to distribute the population over the pareto optimal region [16]. non-dominated sorting genetic algorithm (nsga)[17] ranks chromosomes in the same manner as moga. the nsga algorithm then calculates a unique value. this unique value is related to the distance between each solution and its two closest neighbors. distance may be calculated from the variable values or the associated costs. the resulting values are scaled between 0 and 1 and subtracted from the cost. further information about the evolution of this method can be found in [4,16,17]. as discussed above, the goal of mtwsn is to find the solutions giving the best trade-off between the three conflict objectives, known as pareto optimal. mogas[4,17] are recognized to be well qualified to tackle multi-objective optimization problems. nsga-ii[4] is one of the most popular mogas. some concepts of multi-objective optimization problem are defined as follows. definition 1 (multi-objective optimization problem). given an n-dimensional decision vector x = {x1,x2, · · ·,xn} in the solution space x, find a vector x∗ that maximizes a given set of k objective functions f(x∗) = {f1(x∗),f2(x∗), · · ·,fk(x∗)}. the solution space x is generally restricted by a series of constraints, such as gj(x∗) = bj for j = 1,2, · · ·,m. definition 2 (dominance). a vector u = {u1,u2, · · ·,un} is said to dominate a vector v = {v1,v2, · · ·,vn} if and only if u is partially less than v, i.e., ∀ i = 1,2, · · ·,n, ui ≤ vi ∧ ∃i = 166 l. dai, h.k. xu, t. chen, c. qian, l.j. xie 1,2, · · ·,n, ui ≤ vi. definition 3 (pareto optimal solution). a solution xu ∈ x is said to be pareto optimal if and only if there is no xv ∈ x for which v = f(xv) = {v1,v2, · · ·,vn} dominates u = f(xu) = {u1,u2, · · ·,un}. definition 4 (pareto optimal set and front). let a ⊆ x. the nondominated set regarding a, represented by xp, is defined as xp = {z ∈ a|z is nondominated regardingx}. the corresponding objective function values in the objective space are defined as yp = f(xp) = {f(z)|z ∈ xp}, where xp is called the pareto optimal set and yp is called the cohere pareto optimal front. however, the solutions found by original nsga-ii are likely to be inferior or only comparable to that by classical heuristic search algorithms because of premature convergence. to find perfect solutions, a delete operator for nsga-ii is proposed to enhance the search capability. when selecting the elitist, if neither of the two individuals in a population wins out and their genes are the same, then delete one of them. furthermore, a circulation selection is presented to preserve excellent genes of the parent population. suppose there are k individuals in a population (ind1, ind2, ···, indk) when the crossover operations are carried out. the first time the operation is carried out with (ind1, ind2) as parents, the second time (ind2, ind3) are taken as parents, and so on. similarly, the last child is done by (indk, ind1). by this way, k offspring individuals are generated. the genes of each parent are inherited by two offspring individuals, thus avoiding the loss of excellent solutions. 4.2 an efficient task scheduling algorithm the motivation here is to provide the user with a set of pareto optimal solutions by the nsga-ii algorithm and give it the flexibility to choose the best possible solution from this set, depending on the specific application requirements. now, we can construct the cover set using the optimal pareto solutions generated by the improved nsga-ii algorithm. the chromosomes of a genetic algorithm contain all the building blocks to a solution for the genetic operators and the fitness function. in our implementation, each individual node is represented by a one-bit binary number called gene. this one-bit gene defines the status of the sensors as follows: xi,j = { 1, if task τi runs on sj 0, otherwise (11) we assign each individual with three fitness functions, makespan, energy consumption and task-performing probability. by introducing the non-dominated sorting approach and the crowded distance operator the replacement scheme is executed. first, a combined population rt = pt∪qt is formed with the parent population pt and the offspring population qt, where t is the number of generation. therefore the population rt will be of size 2n. and it is sorted according to the non-domination and crowded comparison. by adding solutions from the first front till the size exceeds n, the new parent population pt+1 is formed. after that, the solutions of the last accepted front are sorted according to the crowed comparison and the first (n − size(pt+1)) points are picked. in this way, the population pt+1 of size n is constructed. subsequently, it is used for the circulated selection, crossover, and mutation to create a new population qt+1 of size n. the recombination operator used in this paper is k-point crossover. after recombination, the mutation operator is applied to complement some genes in the chromosomes of the child randomly. this entire process is repeated until the difference of fitness values among the current pareto optimal set and the previous one is less than a chosen precision (ε). the main procedure of a multi-objective optimization algorithm of task scheduling in wsn 167 algorithm is described as algorithm 1: 5 performance evaluation in this section, we will present the simulation results as the performance evaluation of our proposed task scheduling algorithm. the performance of our proposed algorithm is compared with rt-maps [9], ecomaps [10], and ebta [11] task scheduling algorithms in wireless sensor networks in terms of the makespan, and energy consumption of task scheduling. for our experiments, we use 50 sensor nodes and a sink, which is not power constrained. nodes are distributed in a 100x100 meter square area yielded to random poisson distribution. to evaluate the performances of the task scheduling algorithms properly, experimental parameters of the four algorithms were set as the same ones. the performance of the genetic algorithm is greatly affected by a number of factors, such as the population size, the probability of mutation and crossover, and the method of scheduling. we have run a number of experiments with different values of these parameters to determine the optimal set for our network size. finally, the ga parameters used in our simulations are listed in table 1. although we allow the ga to run for a maximum of 100 generations, we have observed 168 l. dai, h.k. xu, t. chen, c. qian, l.j. xie figure 1: schematic diagram of statistic information for scheduling that the best solution is typically found within 40 generations. table 1: ga parameters used in simulation parameter value population size 200 recombination rate 0.9 mutation rate 0.005 reduction rate 0.5 the simulation results are shown in fig. 1 to fig. 3. figure 1 delineates the statistical information of the proposed task scheduling algorithm. we investigates the space of optimum configuration parameters (makespan, energy consumption and task-performing probability), using unconstrained optimization, to highlight the trade-offs faced by the wsns considered. in order to solve the optimization problem, we chose a pareto-compliant ranking method based on evolutionary techniques, namely the non-dominated sorting genetic algorithm-ii (nsga-ii). fig. 1 shows the result of the joint optimization of the three objective functions. these results show that the higher the balanced energy consumption or the higher the probability of task performing, the shorter the makespan. the important outcome of the results in fig. 1 is that it provides details about the optimal network configuration, since tuning the network with the parameters derived from the execution of the nsga-ii algorithm guarantees that the network performance is not biased toward any one of the performance indicators. the makespan metric is compared between proposed task scheduling algorithms for wireless sensor networks in figure 2. it explains that the makespan for wireless sensor networks is lower for proposed algorithms than rt-maps [9], ecomaps [10], and ebta [11]. this is because that in proposed algorithm the tasks have been finished on minimum makespan as the target function. figure 3 shows the comparative performance in terms of energy consumption of sensor nodes.with the increase of the number of tasks in the network, the energy consumption of task scheduling and data transmission in the algorithms increases. however, the energy consumption in the proposed algorithm is lower than rt-maps [9], ecomaps [10], at fixed number of tasks. a multi-objective optimization algorithm of task scheduling in wsn 169 figure 2: makespan of different scheduling algorithms figure 3: energy-cosumption factor of different scheduling algorithms our proposed task scheduling algorithm results in lowest energy consumption for higher number of tasks. this shows the efficiency of the proposed algorithm. 6 conclusions the optimization of task scheduling is studied to reduce the energy consumption and ensure the effective information acquisition in wireless sensor network. a multi-objective optimization algorithm of task scheduling is proposed for wireless sensor networks in this paper. it is the first time that the nsga-ii algorithm is used to analyze the task scheduling for wireless sensor networks. based on the characteristics of wireless sensor networks, makespan optimization, the energy-consuming balance optimization and task-performing probability optimization were included. a mathematical model used to optimize the task scheduling problem by nsga-ii was built and the solution was presented, and a detailed process to solute the multi-objective programming model was put forwards. 170 l. dai, h.k. xu, t. chen, c. qian, l.j. xie bibliography [1] z. zeng, a. liu, d. li (2008), a highly efficient dag task scheduling algorithm for wireless sensor networks, in proc. of icycs 2008,zhang jia jie , hunan , china, 570-575. [2] j. lin, w. xiao, f. l. lewis (2009), energy-efficient distributed adaptive multisensor scheduling for target tracking in wireless sensor networks. ieee transactions on instrumentation and measurement, 58(6):1886 1896. [3] l. dai, y. chang, z. shen (2011), an optimal task scheduling algorithm in wireless sensor networks, int j comput commun, issn 1841-9836, 6(1):101-112. [4] k. deb, a. pratap, s. agarwal (2002), a fast and elitist multiobjective genetic algorithm: nsga-ii. ieee transactions on evolutionary computation, 6(2):182-197. [5] j. mao, c. g. cassandras, q. zhao (2007), optimal dynamic voltage scaling in energylimited nonpreemptive systems with real-time constraints. ieee transactions on mobile computing, 6(6):678 688. [6] p. heemin, b. s. mani (2003), energy-efficient task assignment framework for wireless sensor netwoks, uc los angeles: the berkeley electronic press. [7] m. younis, k. akkaya, a. kunjithapatham (2003), optimization of task allocation in a cluster-based sensor network. in proc. of the eighth ieee international symposium on computers and communication, netherlands: ieee computer press, 329-334. [8] a. wang, a. chandrakasan (2002), energy-efficient dsps for wireless sensor networks. ieee signal processing magazine, 19(4):68-78. [9] y. tian, j. boangoat, e. ekici (2006), real-time task mapping and scheduling for collaborative in-network. in proc. of the 20th international parallel and distributed processing symposium. rhodes island: ieee computer press, 1-10. [10] y. tian, e. ekici (2005), energy-constrained task mapping and scheduling in wireless sensor networks. in proc. of the workshop on resource provisioning and management in sensor networks. washington, d c: ieee computer society, 211-218, 2005. [11] y. yu, v. k. prasanna (2005), energy-balanced task allocation for collaborative processing in wireless sensor networks. acm/kluwer mobile networks and applications journal, 10(1):115-131. [12] y. gu (2007), real-time multimedia processing in video sensor networks, signal processing : image communication, 22(3):237-251. [13] t. xie, x. qin (2008), an energy-delay tunable task allocation strategy for collaborative applications in networked embedded systems. ieee transactions on computers, 57(3):329 343. [14] i. s. kulkarni, d. pompili (2010), task allocation for networked autonomous underwater vehicles in critical missions. ieee journal on selected areas in communications, 28(5):716727. [15] c. m. fonseca, p. j. fleming (1993), genetic algorithms for multiobjective optimization: formulation, discussion and generalization, in genetic algorithms: proceedings of the fifth international conference, san mateo, ca: morgan kaufmann, 416-423. a multi-objective optimization algorithm of task scheduling in wsn 171 [16] r. l. haupt, s. e. haupt (2004), practical genetic algorithms. john wiley & sons. [17] n. srinivas, k. deb (1995), multiobjective optimization using nondominated sorting in genetic algorithms. journal of evolutionary computation, 2(3):221-248. int j comput commun, issn 1841-9836 8(6):845-853, december, 2013. an improved genetic algorithm for the multi level uncapacitated facility location problem v. korać, j. kratica, a. savić vanja korać*, jozef kratica mathematical institute, serbian academy of sciences and arts kneza mihaila 36/iii, 11 000 belgrade, serbia vanja@mi.sanu.ac.rs, jkratica@mi.sanu.ac.rs *corresponding author: vanja@mi.sanu.ac.rs aleksandar savić university of belgrade, faculty of mathematics studentski trg 16, 11000 belgrade, serbia aleks3rd@gmail.com abstract: in this paper, an improved genetic algorithm (ga) for solving the multilevel uncapacitated facility location problem (mluflp) is presented. first improvement is achieved by better implementation of dynamic programming, which speeds up the running time of the overall ga implementation. second improvement is hybridization of the genetic algorithm with the fast local search procedure designed specially for mluflp. the experiments were carried out on instances proposed in the literature which are modified standard single level facility location problem instances. improved genetic algorithm reaches all known optimal and the best solutions from literature, but in much shorter time. hybridization with local search improves several best-known solutions for large-scale mluflp instances, in cases when the optimal is not known. overall running time of both proposed ga methods is significantly shorter compared to previous ga approach. keywords: evolutionary approach, metaheuristics, discrete location, combinatorial optimization. 1 introduction during the last decades, there was an expansive growth in the ways of solving facility location problems. research has been concentrated mostly on location problems, which require minimization of total travel time, physical distance, or some other related cost. in most cases it is assumed that facilities are large enough to meet any demand. so far, there are several deterministic uncapacitated models proposed in the literature. the multi-level uncapacitated facility location problem (mluflp) is np-hard, since it is a generalization of the uncapacitated facility location problem which is proven to be np-hard in [7]. the multi-level version of uncapacitated facility problem is discussed only in several papers ( [1–5, 8, 9]), compared to several hundreds of papers about the basic version of the problem. moreover, most of the papers that deal with mluflp were of theoretical nature, without experimental results. the only exceptions are papers [5] and [8]. in [5] four methods for solving mluflp are implemented. three algorithms are based on linear programming relaxation of the model, described in section 2, which has an enormous number of variables and moderate number of constrains. consequently, these three algorithms are capable of solving only small size mluflp instances with up to 52 potential facility locations. running times of these methods are greater than 100 seconds. gap values are rather satisfying copyright © 2006-2013 by ccc publications 846 v. korać, j. kratica, a. savić but the corresponding running times are very long even for these small size instances. obviously, all three linear programming based methods are unable to solve larger size problem instances. the fourth method produce very quick results (up to 0.07 seconds), but the gaps are very large up to 732%. from these facts, it can be concluded that none of these algorithms is capable of solving real medium-size and large-scale mluflp instances. the only method capable of reaching optimal solutions in solving small and medium-size mluflp instances was an evolutionary approach presented in [8]. it also gives results on large-scale mluflp instances in a reasonable running time. a binary encoding scheme with appropriate objective function containing dynamic programming approach for solving subproblem wad used, which consisted of finding sequence of located facilities on each level to satisfy clients’ demands. used dynamic programming approach has the polynomial number of states and steps, since the given subproblem is a special case of the shortest path problem. that approach enables the ga to use the standard genetic operators and caching technique, which helps genetic algorithm to reach promising search regions. 2 previous work let us first give a mathematical formulation of the mluflp problem as presented in [5]. the input data to the mluflp consists of a set of facilities f (|f| = m) partitioned into k levels, denoted f1, . . . ,fk, a set of clients d (|d| = n), a fixed cost fi for establishing facility i ∈ f , and a metric that defines transportation costs cij for each i,j ∈ f ∪ d. a feasible solution assigns each client a sequence of k facilities, one from each level fk, . . . ,f1, respectively. a feasible solution is charged the sum of the fixed costs of the facilities used, plus the transportation costs of the clients’ assignments. each client’s transportation cost is the sum of transportation cost from itself to the first facility of its sequence, plus the transportation cost between successive facilities. an optimal solution is a feasible solution with a minimum total cost. the assignment of a client j ∈ d to a valid sequence of facilities can now be represented as the assignment of j to a path p from j to one of the top level facilities f1. the set of all valid sequences of facilities is defined with p = fk × . . . × f1 and the transportation cost of client j’s assignment to sequence p = (ik, . . . , i1) by cpj = cjik + cikik−1 + . . . + ci2i1 . variable yi has value of 1 if facility i is established and 0 otherwise. similarly, variable xpj represents whether or not a client j is assigned to the path p. using the notation mentioned above, the problem can be written as: min ∑ i∈f fiyi + ∑ p∈p ∑ j∈d cpjxpj (1) ∑ p∈p xpj = 1, for each j ∈ d, (2) ∑ p∋i xpj ≤ yi, for each i ∈ f,j ∈ d, (3) xpj ∈ {0,1} , for each p ∈ p,j ∈ d, (4) yi ∈ {0,1} , for eachi ∈ f. (5) an improved genetic algorithm for the multi level uncapacitated facility location problem 847 table 1: fixed costs facilities f1 f2 f3 f4 f5 f6 fixed cost 70 50 30 20 20 40 table 2: distance between facilities on level 1 and level 2 f1 f2 f3 58 23 f4 44 74 f5 67 15 f6 29 38 the objective function (1) minimizes the sum of overall transportation cost and fixed costs for establishing facilities. constraint (2) ensures that every client is assigned to a path while constraint (3) guarantees that any facility on a path used by some client is paid for. constraints (4) and (5) reflect binary nature of variables xpj and yi. the problem can be illustrated with one small example. example 1 an example of the mluflp is shown below. it assumes k = 2 levels of m = 6 facilities: the first level contains 2 potential facilities and the second 4 potential facilities. in this example there are n = 5 clients to be served. the fixed costs of establishing facilities are given in table 1, the distances between facilities of different levels and the distances between clients and facilities on the second level are given in table 2 and table 3, respectively. the total enumeration technique, described in section 4, is used to obtain an optimal solution. established facilities are: f2 on the first and f3, f5 on the second level. the objective function value is 329. the sequences of facilities for each client are shown in table 4. 3 improved ga method proposed genetic algorithm proposed in this paper is an improved version of ga used in [8], so after the short summary of the overall genetic algorithm implementation we only give the detailed description of the improvement parts. the outline of our ga implementation is given below, where npop denotes the overall number of individuals in the population, nelite is a number of elite individuals and ind and objind mark the individual and its value of objective function. input_data(); population_init(); while not stopping_criterion() do for ind:= (nelite + 1) to npop do table 3: distance between clients and facilities f3 f4 f5 f6 client 1 38 13 57 41 client 2 47 52 63 15 client 3 42 48 13 54 client 4 9 54 41 43 client 5 15 18 36 22 848 v. korać, j. kratica, a. savić table 4: sequences of facilities for clients level 2 level 1 client 1 f3 f2 client 2 f3 f2 client 3 f5 f2 client 4 f3 f2 client 5 f3 f2 if (exist_in_cache(ind)) then objind:= get_value_from_cache(ind); else objind:= objective_function(ind); local_search(ind,objind); put_into_the_cache_memory(ind,objind); if (full_cache_memory()) then remove_lru_block_from_cache_memory(); endif endif endfor fitness_function(); selection(); crossover(); mutation(); endwhile output_data(); the encoding of the individuals used in this implementation is binary. the set of facilities f which will be used is naturally represented as the individual represented with a binary string of length m. digit 1 at the i-th place of the string denotes that yi = 1, while 0 shows the opposite (yi = 0). now, for any individual, through its genetic code, string of established facilities is given. objective function value can now be computed using dynamic programming explained in [8] if the array of established facilities has at least one established facility. otherwise, solution is not valid and individual is marked as infeasible. genetic operators are the same as in [8]. we can briefly summarize them • selection operator is fine-grained tournament selection fgts [6], • crossover operator is a standard one-point crossover operator, • mutation operator is a standard simple mutation modified for frozen genes, • initialization of first population is random. the elitist strategy is applied to nelite elite individuals, which are directly passing to the next generation. the genetic operators are applied to the rest of the population (nnnel = npop −nelite non-elite individuals). all duplicates of every individual are eliminated. individuals with the same objective function value, but different genetic codes are limited with the number of their appearance nrv. an improved genetic algorithm for the multi level uncapacitated facility location problem 849 the fitness_function(), the fitness find of individual ind 1,2, ..., npop is computed by scaling values of objective function objind of all individuals into the interval [0,1], so that the best individual indmin has fitness 1 and the worst one indmax has fitness 0. caching technique applied in this improved version of ga is the same as described in [8]. 3.1 improved objective function in [8] for feasible individual ind, the objective_function(ind), is evaluated in four steps. 1. in the first step,the values of variables yi are obtained from the genetic code. let us denote the number of established facilities (number of yi’s with value 1) with m1. 2. in the second step, the array of minimal costs cs is initialized. the array cs carries information about total minimal costs for serving clients and established facilities (except the ones on the first level), regarding the costs for serving facilities on the upper level. the minimal cost values in cs, for the established facilities on the first level are initially set to zero, while the costs for established facilities on remaining levels and clients are set to a large constant inf = 1030. 3. the minimal costs for each client and each facility are calculated by dynamic programming. for each facility in each level (except the first one), the array of total minimal costs (initialized in the second step) is updated. the minimal cost value for each facility is the minimum of the sum of the minimal cost for established facility on the upper level and transportation cost between the two facilities. the same procedure is done for each client: among the established facilities from the last level, the one with the minimal sum of the corresponding minimal cost value and the transportation cost facility-client, is taken. 4. finally, the objective value is computed by adding all transportation costs facility-client and fixed costs for all established facilities. as can we seen from the previous algorithm, some of the operation is unnecessary. for example, all pairs of facilities from the two consecutive levels are taken into consideration where as only pairs of established facilities are what matters. in example 1. there is an optimal solution only for 1 out of 2 facilities established on the first level and only for 2 out 4 facilities established on the second level. therefore, for calculating minimal cost between the first and the second level, previous algorithm performs 2*4=8 operations (one operation per facility pair), while as only 1*2=2 is really needed. a similar situation arises in calculating facility-client costs, which in previous algorithm as 5*4=20 operation, while only 2*5=10 is really needed. in order to correct shortcomings of previous algorithm, the following improvements are proposed. new array of established facility indices is constructed. when calculating minimal costs with dynamic programming approach, instead of using all pairs of facilities on consecutive levels we used only pairs of established facilities obtained from the newly constructed array. similarly, when calculating minimal facility-client costs, , instead of using of all facilities on the last level, only the established once are taken into consideration. therefore, usage of this procedure effectively improves objective function calculation which makes the major part in running time of the overall ga implementation. in presented example we have a significant improvement, but for large scale instances improvement can be even greater. this can be seen on finding objective function for the largest instance mt1_5l_60_120_250_500_1070.2000 best known solutions has 1 established facility on the first 4 levels and 5 established facilities on the 5th level. now, instead of calculating 850 v. korać, j. kratica, a. savić 1. 60*120 pairs of facilities between level 1 and 2, there is only one pair of facilities; 2. 120*250 pairs of facilities between level 2 and 3, there is only one pair of facilities; 3. 250*500 pairs of facilities between level 3 and 4, there is only one pair of facilities; 4. 500*1070 pairs of facilities between level 4 and 5, there are five pairs of facilities; 5. 1070*2000 pairs of facilities-clients we have 5*2000 pairs of facilities-clients. although objective function calculation has spped up factor greater than 200, speed up of overall ga implementation is about 6.5 times. this discrepancy can be explained that the running time of calculating objective function was conducted previously, instead of representing the major part, becomes dominated by running time of other parts of ga implementation. it is obvious that this way of calculating objective function is much faster so its running time is significantly shorter than those of genetic operators (selection, crossover and mutation). 3.2 local search in order to improve the accuracy of the solutions additionally, the proposed ga approach incorporates a local search procedure. it considers the array of facilities and regards which facilities are established and, which are not. in the array facilities which are established they are marked with 1 and those that are not, are marked with 0. local search starts from the first facility in the array and tries to change its status. if there is an improvement in the value of objective function, local search starts from the beginning of the array, otherwise it moves to the next facility. if the facility that is momentarily in consideration is the only established facility on that level, local search moves to the next facility in the array. this procedure is repeating until there are no more facilities in the array. improvement can be determined in two ways, according to the status of the regarded facility. 1) let us first consider the case where the facility is non-established, and its status is changed into being established. then new value of objective function is determined in the following way. all established facilities on the previous levels are unchanged with their fixed transport costs. on the current level costs are added with the fact that running facility is now established. all other transport costs are unchanged. on the next level already found transport costs are compared with transport costs through the new established facilities and changed if the latter is smaller. on the levels following this all transport costs must be evaluated anew. this includes transport costs from the last level of facilities to the clients. 2) in the second case an established facility becomes non-established. transport costs for that level are then reduced for transport cost of that facility. the array is not reduced in length but all paths through that facility are out of consideration. on the next level transport costs are again calculated only for those that were connected through facility considered. on the following levels, transport costs are determined only for those that were on paths leading through the facility considered. local search is applied only on the best individual in population and this only if it stayed unchanged in nrep generations (in this work nrep = 5). on the one individual, local search is applied only once, because of its deterministic nature. if the best individual is replaced with another individual with equal value of an objective function but different genetic code and that individual stays unchanged through nrep generations, local search is again applied. in this paper, local search was not applied in the first 50 generations, whatsoever. an improved genetic algorithm for the multi level uncapacitated facility location problem 851 table 5: ga results on the instances with previously known optimal solution instance name optimal ga improvedga ga + ls solution sol ttot sol ttot sol ttot cap71_2l_6_10.50 1813375.51 opt 0.202 opt 0.195 opt 0.226 cap71_3l_2_5_9.50 4703216.31 opt 0.22 opt 0.195 opt 0.233 cap101_2l_8_17 1581551.39 opt 0.274 opt 0.224 opt 0.269 cap101_3l_3_7_15.50 3227179.81 opt 0.258 opt 0.223 opt 0.265 cap131_2l_13_37.50 1592548.45 opt 0.557 opt 0.358 opt 0.407 cap131_3l_6_14_30.50 3201970.46 opt 0.546 opt 0.355 opt 0.396 cap131_4l_3_7_15_25.50 3630297.67 opt 0.515 opt 0.32 opt 0.352 4 computational results all tests were carried out on an intel 2.5 ghz with 2 gb memory. the algorithms were coded in c programming language. the ga is tested on the same instances as in [8] to show improvement obtained by improved objective function and hybridization with local search. in this paper, only those instances with multiple levels of facilities are taken into consideration, since for the basic problem with only one level several hundreds of papers are available. therefore, instances with multiple levels are the main interest of this research, while results on instances with only one level of facilities are omitted. for small size instances, optimal values are known from the literature which is indicated in table 1. for fair and direct comparison of the results, we leave same ga parameters as in [8]. the finishing criterion of ga is the maximal number of generations ngen = 5000. the algorithm also stops if the best individual or best objective value remains unchanged through nrep = 2000 successive generations. since the results of ga are nondeterministic, the ga was run 20 times on each problem instance. in [8] experimental results are not totally in accordance with the instances. it was possible to repeat testing and about 1/3 of instances values of objective function cannot be validated as presented in that quoted paper. for example, value of objective function, for instance cap131_4l_3_7_15_25.50, is 3630297.67 in the paper and in repeated testing, but for instance capa_3l_15_30_55.100 value function in the paper is 40725103.254, and 25424361.91 in the repeated testing. furthermore, for some instances, results obtained in literature and in repeated testing slightly differ, but that can be caused by slightly different random seed. for example, for instance ms1_4l_64_128_256_552.1000 values of objective function in the literature and in repeated testing are 30936.585 and 31257.27 respectively which means that result in the quoted paper is slightly better. for instance mt1_4l_120_250_520_1110.2000 respective results are 65044.003 and 64995.454 which means that result in repeated testing is slightly better. because of these differences, results of repeated testing will be presented in the whole paper as results obtained by original ga implementation. table 1 presents the ga result on smaller and medium instances. in the first column names of instances are given. the instance’s name carries information about the number of levels, the number of facilities on each level and the number of clients respectively. for example, the instance capb_3l_12_25_63.1000 is created by modifying orlib instance capb, which has 3 levels with 12, 25, 63 facilities, respectively and 1000 clients. the second column contains the optimal solution on the current instance, if it is previously known, otherwise sign as − . the best ga value gabest and running time ttot of the original ga is is given in the following two columns, with marked optimum in cases when ga reached 852 v. korać, j. kratica, a. savić table 6: ga results on the instances with unknown optimal solution instance name ga improved ga ga + ls sol ttot sol ttot sol ttot capa_2l_30_70.1000 14829245.63 32.715 14829245.63 16.589 14829245.63 17.517 capa_3l_15_30_55.1000 25424361.91 17.76 25424361.91 8.679 25424361.91 8.933 capa_4l_6_12_24_58.1000 35421258.15 16.231 35421258.15 7.49 35421258.15 7.636 capb_2l_35_65.1000 14479223.79 27.062 14479223.79 14.118 14479223.79 14.283 capb_3l_12_25_63.1000 25986997.29 28.16 25986997.29 12.147 25986997.29 12.477 capb_4l_6_13_31_50.1000 41787432.24 17.371 41787432.24 8.631 41787432.24 9.083 capc_2l_32_68.1000 14072575.52 29.437 14072575.52 16.218 14072575.52 16.901 capc_3l_13_27_60.1000 26751918.75 26.85 26751918.75 12.932 26751918.75 12.773 capc_4l_4_9_27_60.1000 47109818.66 24.491 47109818.66 11.316 47109818.66 11.184 mq1_2l_100_200.300 8341.287 25.561 8341.287 3.091 8341.287 3.180 mq1_3l_30_80_190.300 12994.871 23.95 12994.871 2.903 12994.871 2.951 mq1_4l_18_39_81_162.300 18048.0305 21.233 18048.0305 3.002 18048.0305 3.000 mq1_4l_20_40_80_160.300 17648.0095 20.759 17648.0095 3.141 17648.0095 3.070 mr1_2l_150_350.500 6733.815 88.939 6733.815 8.979 6733.815 10.277 mr1_2l_160_340.500 6707.505 88.878 6707.505 8.752 6707.505 10.096 mr1_3l_55_120_325.500 10911.319 80.546 10911.319 8.39 10911.319 9.401 mr1_4l_30_65_140_265.500 15237.2605 76.126 15237.2605 7.852 15237.2605 8.297 ms1_2l_320_680.1000 13361.3895 479.724 13361.3895 66.767 13361.3895 98.085 ms1_3l_120_250_630.1000 21923.331 364.135 21881.384 64.965 21881.384 89.360 ms1_4l_64_128_256_552.1000 31257.27 385.73 30902.742 54.715 30902.742 79.729 ms1_5l_25_55_120_250_550.1000 40494.7435 373.781 40249.2415 57.169 40094.335 80.764 mt1_2l_650_1350.2000 27733.057 2410.92 27733.057 457.427 27733.057 685.593 mt1_3l_255_520_1225.2000 46278.719 2398.943 46529.979 426.48 46278.719 622.719 mt1_3l_256_600_1144.2000 46095.09 2403.649 46095.09 406.855 46095.09 585.418 mt1_4l_120_250_520_1110.2000 64995.454 2318.705 64995.454 402.143 64851.16 567.867 mt1_5l_60_120_250_500_1070.2000 83486.9185 2265.78 83363.586 371.867 83363.586 543.857 the optimal solution. in the following two columns are given best values and running times, improved ga and ttot of the improved ga algorithm. finally, in the last two columns best values and running times are given, ga+ls and ttot of the hybridized version of ga algorithm. as it can be seen from table 5, there are three variants of ga reached optimal solutions. as expected, running times of the improved ga has been better than for original ga for all instances, and improvement is up to 40 percent. hybridized ga with ls in some instances also has better running times (4 of 7) than the original ga. it can be noticed that better running times in both variants are accomplished on instances with the greater number of levels and greater number of facilities. in table 6 results on larger instances are given. in this case, optimal solutions are not known so that the column is omitted and all other columns have same meaning as in table 1. from table 6 can be concluded that improvements in both new variants of ga were validated. improved ga has all running times faster than original ga (in some cases like the instances with 500 clients up to 10 times faster)without losing on quality of results. only in one instance, was result was obtained by improved ga, greater than that obtained by original ga (instance mt1_3l_255_520_1225.2000). in all other instances improved ga has better results. hybridization with local search also produced improvements. results obtained from this variant are the best in all instances. this variant of ga produced better results than original ga in 5 instances (mostly in the largest instances), and was better than the improved ga in 3 instances. an interesting fact is that ga with ls has running times shorter almost in all instances (except the smallest instances), and sometimes like in those instances with 300 clients ga with ls run an improved genetic algorithm for the multi level uncapacitated facility location problem 853 up to 8 times faster than the original ga. even in the largest instances hybridized version ran up to 44 times faster and also produced better results. as it could be expected, running times of the hybridized version were always longer than those of improved ga. 5 conclusions and future works this paper presented improved genetic algorithm for solving multi-level uncapacitated facility location problem. improvements are achieved, firstly, in formulating the new version of finding objective function for the problem which resulted in much shorter running times, and secondly, through hybridization with fast and reliable local search procedure, which achieved better results and in almost all cases run faster than original ga approach. these improvements are considerable, since hybrid ga is obtained 5 new best-known solutions and running times are shorter sometimes up to one order of the magnitude compared to original ga version. it can be concluded that both of these versions represent considerable improvements in solving mluflp. future research can be directed to parallelization of presented ga, hybridization with some other heuristics and its application in solving similar facility location problems. bibliography [1] k. aardal, f. chudak, d.b. shmoys, a 3-approximation algorithm for the k-level uncapacitated facility location problem, information processing letters, 72:161–167, 1999. [2] a. ageev, improved approximation algorithms for multilevel facility location problems, operations research letters, 30: 327–332, 2002. [3] a. ageev, y. ye, j. zhang, improved combinatorial approximation algorithms for the k-level facility location problem, siam journal on discrete mathematics, 18: 207–217, 2005. [4] a.f. bumb, w. kern, a simple dual ascent algorithm for the multilevel facility location problem, proceedings of the 4th international workshop on approximation algorithms for combinatorial optimization problems and 5th international workshop on randomization and approximation techniques in computer science: approximation, randomization and combinatorial optimization, 55-62, august 18–20, 2001. [5] n.j. edwards, approximation algorithms for the multi-level facility location problem. ph.d. thesis, cornell university, 2001. [6] v. filipović, j. kratica, d. tošić, d. i. ljubić, fine grained tournament selection for the simple plant location problem. it proceedings of the 5th online world conference on soft computing methods in industrial applications wsc5, 152–158, september 2000. [7] j. krarup, p. m. pruzan, the simple plant location problem: survey and synthesis, european journal of operational research, 12: 36–81, 1983. [8] m. marić, an efficient genetic algorithm for solving the multi-level uncapacitated facility location problem, computing and informatics, 29(2): 183–201, 2010. [9] j. zhang, approximating the two-level facility location problem via a quasi-greedy approach. mathematical programming, 108: 159–176, 2006. int j comput commun, issn 1841-9836 8(5):736-743, october, 2013. feedback linearization with fuzzy compensation for uncertain nonlinear systems m.c. tanaka, j.m.m. fernandes, w.m. bessa marcelo c. tanaka, josiane m.m. fernandes, wallace m. bessa departamento de engenharia mecânica, universidade federal do rio grande do norte campus universitário lagoa nova, natal, rn 59072-970, brazil marcelotanaka.eng.mec@gmail.com, josiane.eng.mec@gmail.com, wmbessa@ct.ufrn.br abstract: this paper presents a nonlinear controller for uncertain single-input–single-output (siso) nonlinear systems. the adopted approach is based on the feedback linearization strategy and enhanced by a fuzzy inference algorithm to cope with modeling inaccuracies and external disturbances that can arise. the boundedness and convergence properties of the tracking error vector are analytically proven. an application of the proposed control scheme to a second-order nonlinear system is also presented. the obtained numerical results demonstrate the improved control system performance. keywords: feedback linearization, fuzzy logic, nonlinear control, van der pol oscillator. 1 introduction due to its simplicity, feedback linearization scheme is commonly applied in industrial control systems, specially in the field of industrial robotics. the main idea behind this control method is the development of a control law that allows the transformation of the original dynamical system into an equivalent but simpler one [11]. although feedback linearization represents a very simple approach, an important handicap is the requirement of a perfectly known dynamical system, in order to ensure the exponential convergence of the tracking error. on this basis, much effort has been made to combine feedback linearization with intelligent algorithms in order to improve the trajectory tracking of uncertain nonlinear systems. the most common strategies are based on artificial neural networks [2,4,9,10,13] or fuzzy logic [1,3,5,6]. a drawback of these approaches is that both neural networks or fuzzy logic are used to model the entire plant, which means that a large computational effort is normally required to characterize system dynamics. considering that the designer of the control system usually has at least some knownledge of the plant to be controlled, a nonlinear controller is proposed in this paper to compensate for the uncertainties of single-input-single-output (siso) nonlinear systems. the adopted approach is based on the feedback linearization method, but enhanced by a fuzzy inference system to cope with modeling imprecisions and external disturbances that can arise. this approach requires a reduced number of fuzzy sets and rules and consequently simplifies the design process. the boundedness and convergence properties of the closed-loop signals are analytically proven and numerical simulations are carried out in order to demonstrate the improved performance of the proposed control scheme. 2 feedback linearization consider a class of nth-order nonlinear systems: copyright c⃝ 2006-2013 by ccc publications feedback linearization with fuzzy compensation for uncertain nonlinear systems 737 x(n) = f(x, t) + b(x, t)u + d (1) where u is the control input, the scalar variable x is the output of interest, x(n) is the n-th time derivative of x, x = [x,ẋ, . . . ,x(n−1)] is the system state vector, f,b : rn → r are both nonlinear functions and d is assumed to represent all uncertainties and unmodeled dynamics regarding system dynamics, as well as any external disturbance that can arise. in respect of the disturbance-like term d, the following assumption will be made: assumption 1. the disturbance d is unknown but continuous and bounded, i. e. |d| ≤ δ. let us now define an appropriate control law based on conventional feedback linearization scheme that ensures the tracking of a desired trajectory xd = [xd, ẋd, . . . ,x (n−1) d ], i. e. the controller should assure that x̃ → 0 as t → ∞, where x̃ = x − xd = [x̃, ˙̃x,. . . , x̃(n−1)] is the related tracking error. on this basis, assuming that the state vector x is available to be measured and system dynamics is perfectly known, i. e. there is no modeling imprecision nor external disturbance (d = 0) and the functions f and b are well known, with |b(x, t)| > 0, the following control law: u = b−1(−f + x(n)d − k0x̃ − k1 ˙̃x − · · · − kn−1x̃ (n−1)) (2) guarantees that x → xd as t → ∞, if the coefficients ki (i = 0,2, . . . ,n− 1) make the polynomial pn + kn−1p n−1 + · · · + k0 a hurwitz polynomial [11]. the convergence of the closed-loop system could be easily established by substituting the control law (2) in the nonlinear system (1). the resulting dynamical system could be rewritten by means of the tracking error: x̃(n) + kn−1x̃ (n−1) + . . . + k1 ˙̃x + k0x̃ = 0 (3) where the related characteristic polynomial is hurwitz. however, since in real-world applications the nonlinear system (1) is often not perfectly known, the control law (2) based on conventional feedback linearization is not sufficient to ensure the exponential convergence of the tracking error to zero. thus, we propose the adoption of fuzzy inference system within the control law, in order to compensate for d and to enhance the feedback linearization controller. 3 fuzzy inference system because of the possibility to express human experience in an algorithmic manner, fuzzy logic has been largely employed in the last decades to both control and identification of dynamical systems. the adopted fuzzy inference system is the zero order tsk (takagi–sugeno–kang), with the rth rule stated in a linguistic manner as follows: if x̃ is x̃r, ˙̃x is ˙̃ xr, · · · , and x̃(n−1) is x̃ (n−1) r , then d̂r = d̂r ; r = 1,2, · · · ,n where x̃r, ˙̃ xr, · · · , and x̃ (n−1) r are fuzzy sets, whose membership functions could be properly chosen, and d̂r is the output value of each one of the n fuzzy rules. considering that each rule defines a numerical value as output d̂r, the final output d̂ an be computed by a weighted average: 738 m.c. tanaka, j.m.m. fernandes, w.m. bessa d̂(x̃) = ∑n r=1 wr · d̂r∑n r=1 wr (4) or, similarly, d̂(x̃) = d̂tψ(x̃) (5) where, d̂ = [d̂1,d̂2, . . . ,d̂n] is the vector containing the attributed values d̂r to each rule r, ψ(x̃) = [ψ1,ψ2, . . . ,ψn] is a vector with components ψr(x̃) = wr/ ∑n r=1 wr and wr is the firing strength of each rule, which can be computed from the membership values with any fuzzy intersection operator (t-norm). 4 fuzzy feedback linearization considering that fuzzy logic can perform universal approximation [7], we propose the adoption of a tsk fuzzy inference system within the feedback linearization controller to compensate for modeling inaccuracies and consequently enhance the trajectory tracking of uncertain nonlinear systems. therefore, the control law with the fuzzy compensation scheme can be stated as follows u = b−1[−f + x(n)d − k0x̃ − k1 ˙̃x − · · · − kn−1x̃ (n−1) − d̂(x̃)] (6) and the related closed-loop system is: x̃(n) + kn−1x̃ (n−1) + . . . + k1 ˙̃x + k0x̃ = d̃ (7) with d̃ = d̂ − d. now, defining ktx̃ = kn−1x̃(n−1) + . . . + k1 ˙̃x + k0x̃, where k = [c0λn,c1λn−1, . . . ,cn−1λ], λ is a strictly positive constant and ci states for binomial coefficients, i. e. ci = ( n i ) = n! (n − i)! i! , i = 0,1, . . . ,n − 1 (8) the convergence of the closed-loop signals to a bounded region is assured. theorem 2. consider the uncertain nonlinear system (1) and assumption 1, then the fuzzy feedback linearization controller defined by (5) and (6) ensures the exponential convergence of the tracking error vector to a closed region ω = {x ∈ rn | |x̃(i)| ≤ ζiλi−nε,i = 0,1, . . . ,n − 1}, with ζi defined by (9). ζi = { 1 for i = 0 1 + ∑i−1 j=0 ( i j ) ζj for i = 1,2, . . . ,n − 1. (9) proof: considering the universal approximation feature of fuzzy logic [7], the output of the adopted inference system (5) can approximate the disturbance d to an arbitrary degree of accuracy, i. e. |d̂(x̃) − d| ≤ ε for an arbitrary ε > 0. thus, from (7) one has |x̃(n) + kn−1x̃(n−1) + . . . + k1 ˙̃x + k0x̃| ≤ ε (10) from (8), inequality (10) may be rewritten as −ε ≤ x̃(n) + cn−1λx̃(n−1) + · · · + c1λn−1 ˙̃x + c0λnx̃ ≤ ε (11) feedback linearization with fuzzy compensation for uncertain nonlinear systems 739 multiplying (11) by eλt yields −εeλt ≤ dn dtn (x̃eλt) ≤ εeλt (12) integrating (12) between 0 and t gives − ε λ eλt + ε λ ≤ dn−1 dtn−1 (x̃eλt) − dn−1 dtn−1 (x̃eλt) ∣∣∣∣ t=0 ≤ ε λ eλt − ε λ (13) or conveniently rewritten as − ε λ eλt − (∣∣∣∣ dn−1dtn−1 (x̃eλt) ∣∣∣∣ t=0 + ε λ ) ≤ dn−1 dtn−1 (x̃eλt) ≤ ε λ eλt + (∣∣∣∣ dn−1dtn−1 (x̃eλt) ∣∣∣∣ t=0 + ε λ ) (14) the same reasoning can be repeatedly applied until the nth integral of (12) is reached: − ε λn eλt − (∣∣∣∣ dn−1dtn−1 (x̃eλt) ∣∣∣∣ t=0 + ε λ ) tn−1 (n − 1)! − · · · + − ( |x̃(0)| + ε λn ) ≤ x̃eλt ≤ ε λn eλt+ + (∣∣∣∣ dn−1dtn−1 (x̃eλt) ∣∣∣∣ t=0 + ε λ ) tn−1 (n − 1)! + · · · + ( |x̃(0)| + ε λn ) (15) furthermore, dividing (15) by eλt, it can be easily verified that, for t → ∞, − ε λn ≤ x̃(t) ≤ ε λn (16) considering the (n − 1)th integral of (12) − ε λn−1 eλt − (∣∣∣∣ dn−1dtn−1 (x̃eλt) ∣∣∣∣ t=0 + ε λ ) tn−2 (n − 2)! − · · · + − (∣∣ ˙̃x(0)∣∣ + ε λn−1 ) ≤ d dt (x̃eλt) ≤ ε λn−1 eλt+ + (∣∣∣∣ dn−1dtn−1 (x̃eλt) ∣∣∣∣ t=0 + ε λ ) tn−2 (n − 2)! + · · · + (∣∣ ˙̃x(0)∣∣ + ε λn−1 ) (17) and noting that d(x̃eλt)/dt = ˙̃xeλt + x̃λeλt, by imposing the bounds (16) to (17) and dividing again by eλt, it follows that, for t → ∞, −2 ε λn−1 ≤ ˙̃x(t) ≤ 2 ε λn−1 (18) now, applying the bounds (16) and (18) to the (n − 2)th integral of (12) and dividing once again by eλt, it follows that, for t → ∞, −6 ε λn−2 ≤ ¨̃x(t) ≤ 6 ε λn−2 (19) the same procedure can be successively repeated until the bounds for x̃(n−1) are achieved: 740 m.c. tanaka, j.m.m. fernandes, w.m. bessa − [ 1 + n−2∑ i=0 ( n − 1 i ) ζi ] ε λ ≤ x̃(n−1) ≤ [ 1 + n−2∑ i=0 ( n − 1 i ) ζi ] ε λ (20) where the coefficients ζi (i = 0,1, . . . ,n − 2) are related to the previously obtained bounds of each x̃(i) and can be summarized as in (9). in this way, by inspection of the integrals of (12), as well as (16), (18), (19), (20) and the other omitted bounds, it follows that the tracking error exponentially converges to the n-dimensional box determined by the limits |x̃(i)| ≤ ζiλi−nε,i = 0,1, . . . ,n − 1, where ζi is defined by (9). 2 corollary 3. it must be noted that the proposed control scheme provides a smaller tracking error when compared with the conventional feedback linearization controller. by setting the output of the fuzzy inference system to zero, d̂(x̃) = 0, theorem 2 implies that the resulting bounds are |x̃(i)| ≤ ζiλ i−nδ,i = 0,1, . . . ,n − 1. considering that ε < δ, from the universal approximation feature of d̂, it can be concluded that the tracking error obtained with the fuzzy feedback linearization controller is smaller than the associated with the conventional scheme. 5 illustrative example in order to illustrate the controller design methodology, consider a controlled van der pol oscillator ẍ − µ(1 − x2)ẋ + x = υ (21) with a dead-zone in the control input defined according to υ =   u + 0.2 if u ≤ −0.2 0 if − 0.2 < u < 0.2 u − 0.2 if u ≥ 0.2 (22) for control purposes, equation (22) can be rewritten as a combination of a linear and a saturation function [8, 12]: υ = u + d(u) (23) where d(u) can be obtained from (22) and (23) as: d(u) =   0.2 if u ≤ −0.2 −u if − 0.2 < u < 0.2 −0.2 if u ≥ 0.2 (24) based on (6) and considering d(u) as uncertainty, a fuzzy feedback linearization controller can be chosen as follows u = x − µ(1 − x2)ẋ + ẍd − 2λ ˙̃x − λ2x̃ − d̂(x̃, ˙̃x) (25) in order to evaluate the performance of the proposed control law (25), a numerical simulation was carried out. the simulation study was performed with an implementation in c, with sampling rates of 500 hz for control system and 1 khz for the van der pol oscillator, and the differential equations were numerically solved using the fourth order runge-kutta method. the chosen parameters for the van der pol oscillator and controller were µ = 1 and λ = 0.8. feedback linearization with fuzzy compensation for uncertain nonlinear systems 741 regarding the fuzzy inference system, the number of fuzzy rules and the type of the membership functions, as well as how they are distributed over the input space, could be heuristically defined to accommodate designer’s experience and experimental knowledge. the fuzzy rule base adopted in this work is presented in table 1, where nb, nm, ns, zo, ps, pm and pb represent, respectively, negative–big, negative–medium, negative–small, zero, positive– small, positive–medium and positive–big. triangular and trapezoidal (at the ends) membership functions are adopted for both x̃r and ˙̃ xr, with the central values defined respectively as cx̃ = {−20; −2; −0.2; 0.0; 0.2; 2; 20}×10−2 and c ˙̃x = {−16; −1.6; −0.16; 0.0; 0.16; 1.6; 16}×10 −2. the chosen fuzzy intersection operator was the minimum t-norm. it should be also emphasized that the input space could be partitioned and represented in many other ways, and that the system designer may test each one of them in order to improve the output value d̂. with respect to the output of each rule, the following values were heuristically adopted for nb to pb: d̂r = {−20 ; −5 ; −2.5 ; 0.0 ; 2.5 ; 5 ; 20}. table 1: adopted fuzzy rule base. x̃ / ˙̃x nb nm ns zo ps pm pb nb pb pb pb pm pm ps zo nm pb pb pm pm ps zo ns ns pb pm pm ps zo ns nm zo pm pm ps zo ns nm nm ps pm ps zo ns nm nm nb pm ps zo ns nm nm nb nb pb zo ns nm nm nb nb nb in this way, considering that the initial state and initial desired state are not equal, x̃(0) = [−2.0,−0.4], figures 1–3 show the obtained results for the tracking of xd = [sint,cost]. -2.00 -1.50 -1.00 -0.50 0.00 0.50 1.00 1.50 0 5 10 15 20 25 30 35 40 x t desired obtained (a) tracking performance. -2.00 -1.50 -1.00 -0.50 0.00 0.50 1.00 1.50 2.00 0 5 10 15 20 25 30 35 40 u t (b) control action. figure 1: trajectory tracking with xd = [sint,cost]. as observed in figure. 1(a), even in the presence of modeling imprecisions, the proposed control scheme allows the actuated van der pol oscillator to track the desired trajectory. now, in order to demonstrate the improved performance of the fuzzy feedback linearization controller, the tracking error associated with the last simulation is shown in fig. 2. for comparison purposes, the tracking error obtained with conventional feedback linearization is also presented. it can be easily verified that the proposed controller provides a smaller tracking error 742 m.c. tanaka, j.m.m. fernandes, w.m. bessa when compared with the conventional one. -2.00 -1.50 -1.00 -0.50 0.00 0.50 0 5 10 15 20 25 30 35 40 x~ t conventional fuzzy figure 2: tracking error with conventional and fuzzy feedback linearization. the phase portraits of the tracking errors obtained with conventional as well as fuzzy feedback linearization are shown in fig. 3. note that the convergence region related to the proposed control scheme is much smaller than the associated with its uncompesated counterpart, which confirms corollary 3. -0.60 -0.40 -0.20 0.00 0.20 0.40 0.60 0.80 -2.5 -2 -1.5 -1 -0.5 0 0.5 d x~ / d t x ~ (a) conventional. -0.60 -0.40 -0.20 0.00 0.20 0.40 0.60 0.80 -2.5 -2 -1.5 -1 -0.5 0 0.5 d x~ / d t x ~ (b) fuzzy. figure 3: phase portrait of the error with conventional and fuzzy feedback linearization. 6 concluding remarks in this paper, a fuzzy feedback linearization controller is developed to deal with uncertain single-input–single-output nonlinear systems. to enhance the tracking performance, the feedback linearization controller is combined with a fuzzy inference system for uncertainty/disturbance compensation. the boundedness and convergence properties of the tracking error vector are analytically proven. to evaluate the control system performance, the proposed scheme is applied to the van der pol oscillator. by means of numerical simulations, the improved performance over the conventional feedback linearization controller is confirmed. acknowledgments the authors would like to acknowledge the support of the brazilian national research council (cnpq), the brazilian coordination for the improvement of higher education personnel (capes), the brazilian national agency of petroleum, natural gas and biofuels (anp) and the german academic exchange service (daad). feedback linearization with fuzzy compensation for uncertain nonlinear systems 743 bibliography [1] boukezzoula, r.; galichet, s.; foulloy, l. (2007); fuzzy feedback linearizing controller and its equivalence with the fuzzy nonlinear internal model control structure, int j appl math comput sci, issn 1641-876x, 17(2):233-248. [2] chen. f.c. (1990); back-propagation neural networks for nonlinear self-tuning adaptive control, ieee control syst mag, issn 0272-1708, 10(3):44-48. [3] couceiro, m.s.; ferreira, n.m.f.; machado, j.a.t. (2012); hybrid adaptive control of a dragonfly model, commun nonlinear sci numer simul, issn 1007-5704, 17(2):893-903. [4] deng, h.; li, h.x.; wu, y.h. (2008); feedback-linearization-based neural adaptive control for unknown nonaffine nonlinear discrete-time systems, ieee trans neural netw, issn 1045-9227, 19(9):1615-1625. [5] hojati, m.; gazor, s. (2002); hybrid adaptive fuzzy identification and control of nonlinear systems, ieee trans fuzzy syst, issn 1063-6706, 10(2):198-210. [6] kang, h.j.; kwon, c.; lee, h.; park, m. (1998); robust stability analysis and design method for the fuzzy feedback linearization regulator, ieee trans fuzzy syst, issn 10636706, 6(4):464-472. [7] kosko, b. (1994); fuzzy systems as universal approximators, ieee trans comput, 43(11):1329-1333. [8] lewis, f.l.; tim, w.k.; wang, l.z.; li, z.x. (1999); deadzone compensation in motion control systems using adaptive fuzzy logic control, ieee trans control syst technol, issn 1063-6536, 7(6):731-742. [9] lu, z.; shieh, l.s.; chen, g.; coleman, n.p. (2006); adaptive feedback linearization control of chaotic systems via recurrent high-order neural networks, inf sci, issn 0020-0255, 176(16):2337-2354. [10] pedro j.o.; dahunsi, o.a. (2011); neural network based feedback linearization control of a servo-hydraulic vehicle suspension system, int j appl math comput sci, issn 1641-876x, 21(1):137-147. [11] slotine, j.j.e.; li, w. (1991); applied nonlinear control, prentice hall. [12] wang, x.s.; su, c.y.; hong, h. (2004); robust adaptive control of a class of nonlinear systems with unknown dead-zone, autom, issn 0005-1098, 40(3):407-413. [13] yeşildirek, a.; lewis, f.l. (1995); feedback linearization using neural networks, autom, issn 0005-1098, 31(11):1659-1664. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 2 (june), pp. 278-285 modelling, implementation and application of a flexible manufacturing cell f. leighton, r. osorio, g. lefranc felipe leigthon universidad de las américas vina del mar chile e-mail: felipe.leigthon@gmail.com román osorio universidad nacional autónoma de méxico instituto iimas e-mail: roman@servidor.unam.mx gastón lefranc pontificia universidad católica de valparaíso escuela de ingeniería eléctrica, valparaiso chile e-mail: glefranc@ucv.cl abstract: this paper presents petri nets model, the implementation and application of a assembly flexible cell. the cell is composed by a robotic manipulator, a computer vision system and a conveyor. the system is applied to assembly several products, showing only two of them. keywords: flexible manufacturing cells, assembly, cim, petri nets. 1 introduction within the area of the manufacturing, cim model (computer integrated manufacturing) permits the automation all the activities in a manufacturing company: the organization, storage (asrs), manufacturing (cam), engineering (cae) and quality control. these activities are supported by local area networks and related database. in cam includes fabrication, assembly and quality control of products, utilizing robotics manipulators, computer vision, etc., to facilitate automation [1]. these industrial systems, allow flexible their production and to automate their processes, without human intervention. assembly systems are widely used in several industries. these systems are composed by robotics manipulators, computer vision and sensors to complete the production task accurately. however, since these systems are composed of many components, it is difficult to deal with unexpected situations. for example, an undetected error may propagate and end up as a detectable failure which may cause the whole line to stop its operation. in this case, it may take considerable time to diagnose the system and identify the main reasons for the failure. there exist approaches to detect and to predict failures, but the usage of these approaches is limited. [2], [3] the ability to produce a variety of products through the combination of modular components is a meaningful benefit of product modularity. there are five different ways that modular products are developed in industry. component-swapping modularity can be achieved when two or more alternative types of components are paired with the same basic product body to create different product variants. this modularity is often associated with the creation of product variety as perceived by the customer. [6] the assembled flexible cell is part of a developed flexible system in the laboratory of robotics, artificial intelligence and advanced automation lab, in the escuela de ingeniería eléctrica de la pontificia universidad católica de valparaíso, chile. fms system includes: an copyright c⃝ 2006-2011 by ccc publications modelling, implementation and application of a flexible manufacturing cell 279 asrs cell (automatic storage retrieval system) [4], with an executive system and database [5], servoing systems [7], [9] and stereo computer vision [8], [10], [11] and flexible manufacturing cell. [12] in this paper is presented an assembly cell modeled by petri nets, the implementation and the application to the assembly of some products, in a manufacturing flexible cell. the assembly flexible cell is composed by a robotic manipulator, a computer vision system and a conveyor. 2 structure of the assembly cell the assembled flexible cell of is constituted by a robotic arm, a system of vision and an system of automated transport. the system robotic manipulator used is ibm 7547 type scara; the computer vision system is an stereo vision (two webcams); and the automated system of transport, is conformed by four conveyor belts that form a "circuit" around the robotic manipulator and the assembled area. it function is to interconnect the different cells from the system, like the asrs (storage), the assembly flexible cell, manufacturing flexible cell and the flexible cell of quality control. this conforms a flexible system of manufacture, as it is seen in the (fig.1). through this system of transport are transferred pallets with different kits to assembly. in (fig.2), is the transfer. figure 1: the flexible manufacturing system figure 2: pallets with kit in the conveyor belt the stereo vision system, (fig.3), determines what pieces are in pallet, and the pallet position of the in the work area to make the assemly. in order to prove the cell a wood figure is assembled (geometric figure), and then making assembly of a electrical interrupt and electrical plug, common in houses and offices. the robotics manipulator takes the pieces inside the pallet and make the assembly programmed, the vision system contributes to the determination of the position and direction of the parts to be assembled. 280 f. leighton, r. osorio, g. lefranc figure 3: the system of vision 3 aplications of flexible assembly cell. the applications of the cell are to assemble geometric figure (fig.4a), to test if the system works. then, making assembly of a electrical interrupt and electrical plug with one, two or three plug per unit (fig.4b). figure 4: a) kit of wood parts. b) kit of plugs. 4 computer vision system. the vision system have to determine the space position of pallets and objects within the space of work of the robotic manipulator. the stereo vision has to be able of visualizing objects of different size, send the order to manipulator to pick up the pallets and put in the working area, recognize the oarts inside the pallets and then send the order to manipulator to assemble the product in other pallet. finally, the vision system verify is the product is well/assembled, giving the orders to manipulator to transport the pallet with assembled product ro the conveyor. the images are captured by two webcam cameras. these images pass through a process of digitalization during which is made a discretización of the space coordinates of the image (x,y) denominated sampling of the image, and a discretización of the amplitude of the intensity of light in each point or píxel of the image, which denominates quantification of the gray level (fig.5). the system make image segmentation, separating the different objects and applying a threshold based on the histogram (fig.6). then, the center of mass or centroide of an object of a binary image is computed (fig.7). the direction of an object determines by the minus existing distance between its center of mass and an element of his contour. therefore, the first step to follow to calculate the direction of an object is to determine which are píxeles that they belong to his contour. for doing this, a gradient of the image is used. so that an object is taken, the effector of the manipulator must be oriented of such way that their direction is perpendicular to the direction of the straight line modelling, implementation and application of a flexible manufacturing cell 281 figure 5: digital image figure 6: segmentation and thresholding figure 7: centroide of the objects figure 8: direction of the object that constitutes the minimum range between the center of mass of the object and its contour. this can be observed in following (fig.8), where the point a corresponds to the center of mass of the rectangle and point b is determined finding the minimum range between the centroide and the contour of the object. the union of these points forms a perpendicular straight line to the direction that must have the effector of the manipulator to pick up the object. 282 f. leighton, r. osorio, g. lefranc 5 petri nets model of the assembly cell. assembly systems has many components and it is difficult to deal with unexpectes situations. error non detected could propagate and to produce failure in the system, causing stop in the operation of the line. it takes time to diagnose, and to identify the reasons of the failure. a petri net model (pn) of the flexible assembly cell permits a simulation of the behaviour of the assembly cell, detecting failure nd an evaluation of the performance. the (fig.9) shows the definitions of places and transitions and the (fig.10), pn model. figure 9: place and transition definitions for pn model definitions: mr robotics manipulator; asrs: automatic storage and retrieval system ct conveyor; rm mobile robot sv3d stereo vision systems; obk 1 cover of the product obj 2 internal cover; obj 3 plug 1 obj 4 plug 2; pe final assembly product 6 evaluations of the flexible assembly cell. for the evaluations of the project different tests were made. one of them is to measure the time taken to make an assembly. table 1 shows the times of joint, like for the evaluations, the iterations, with and without retiring pallet from the circuit of conveyor belts. table 1. time to assembly a product modelling, implementation and application of a flexible manufacturing cell 283 figure 10: flexible assembly cell pn model geometric figures simple plug doble plug triple plug with without with without with without with without retiring retiring retiring retiring retiring retiring retiring retiring 1’4” 1’37” 1’12” 1’34” 1’25” 1’59” 1’45” 2’22” the time of assembled goes from 1 minute 12 seconds to 2’22”, that is to say, the time when the manipulator puts the pallets in the work area, until the manipulator pick up the pallet with the assembled product. a bad position of the pallets in the work area produces that the manipulator do not pick up the pallets or it pick up bad. one of the problem is the illumination affecting the quantification of the objects, like in the determination of the centroide and the direction of each one of the subparts. the stero vision system send to manipulator wrong position. this problem it solves, in part, with the installation of a system of illumination superior to the existing one. a small mobile robots is used to move the pallet to the right place, using two walls to put the pallet in 284 f. leighton, r. osorio, g. lefranc the desired position, the best place for stereo vision system (fig.11). it can see the mobile robot, the walls and the pallet in the workspace. this tobots receives order from the computer cell. figure 11: mobile robot and the two walls. other evaluation is the measurement of the percentage of correct assembly. it is made ten measurement, for different assembled such as for the geometric figure, and each type of plug. each one considers iterations with retiring pallet from the "circuit", and without retiring from them. the results given by table 2, reflecting the percentage of error by amount of products, in other words, if the product to assemble has 3 pieces (geometric figure, simple plug) and fail in one, the result is 33% of error, or, if it has of 5 pieces, and it fails in two, have an error of the 40%, ( triple plug). table 2. percentage of faults to assembly a product geometric figures simple plug doble plug triple plug with without with without with without with without retiring retiring retiring retiring retiring retiring retiring retiring 1 0% 0% 0% 0% 50% 0% 20% 20% 2 33% 0% 33% 0% 0% 25% 0% 20% 3 0% 33% 33% 0% 25% 0% 40% 0% 4 33% 0% 33% 0% 0% 25% 0% 0% 5 0% 0% 0% 0% 25% 0% 60% 0% 6 33% 0% 33% 33% 25% 0% 0% 40% 7 0% 0% 0% 33% 0% 0% 20% 40% 8 33% 33% 33% 0% 25% 50% 0% 60% 9 33% 0% 0% 33% 25% 50% 20% 20% 10 0% 0% 0% 0% 0% 0% 60% 20% 7 conclusions in this paper has been presented a petri nets model, the implementation and the of a assembly flexible cell. the system assemblies several products, only is presented a wood assembly and assembly of different kind of electrical interrupt and electrical plugs. the cell is composed by a robotic manipulator, a computer vision system and a conveyor. different effectors for the robotics manipulator are designed for different kind of product to assembly, considering different factors of the object. the flexible assembly cell is modelled and simulated using petri nets. this model and the simulation permit to evaluate the performance of the cell, and correct it before starting a new assembly. modelling, implementation and application of a flexible manufacturing cell 285 the real cell is tested assembled geometric objects and electrical interrupt and plugs. the time of assembled goes from 1 minute 12 seconds to 2’22”, that is to say, the time when the manipulator puts the pallets in the work area, until the manipulator pick up the pallet with the assembled product. the stereo vision system recognizes the objects whithin the working area, establishing the position of the pallets and the objects in the pallets. problems affects the quatification determination of the centroid and direction of each object, producing error from 33% to 59%. the problem is solved with changing illumination and adding a small mobile robot to move the pallet to the desired position. bibliography [1] lefranc, g., la manufactura integrada por computador: un tutorial, magazine automática e innovación de la asociación chilena de control automático, vol 1, 2, 1993, pg 45. [2] t. c. cao, a.c., sanderson, 1992, sensor-based error recovery for robotic task sequences using fuzzy petri-nets, proceedings of the 1992 ieee international conference on robotics and automation, vol.2, pp.1063-1069. [3] q. jing , w. xisen, p. zhihua, x. youngcheng, 1996, a research on fault diagnostic expert system basedon fuzzy petri nets for fms machining cell, proceedings of ieee international conference on industrial technology, pp. 122-125. [4] andrada i., lefranc g. sistema experimental de un sistema de almacenamiento automatizado. i parte. xv congreso de la asociación chilena de control automático 2002. [5] maizares m., lefranc g., base de datos y programa ejecutivo para un sistema as/sr. ieee latin-american conference on robotics and automation. chile, 2003. [6] lefranc, gastón, simulación de la generación de secuencias de ensamblado en una celda flexible de producción. automática e innovación de la asociación chilena de control automatico. ano 3. volumen 2. n◦7, 1996. [7] lefranc g., cano f., sistema servoing pick and place, congreso latinoamericano de control automático, 2003, guadalajara, méxico. [8] schleyer g., lefranc g., experimental 3-d visual servoing for fms applications. third ifac ieee conference on management of control of production and logistics mcpl’2004. [9] opazo m., lefranc g., visual servoing. fourth ifac ieee conference on management of control of production and logistics, mcpl 2007. [10] schleyer g., lefranc g., tridimensional visual servoing, studies in informatics and control, volume 18 . issue 3 . 2009. [11] schleyer g., lefranc g., color images segmentation using split&merge and region growing techniques in rgb and hsv color spaces, international jornal of computers, communications & control, vol. v issue 1, 2010. [12] leighton f., lefranc g., flexible assembly cell using scara manipulator. third ifac ieee conference on management of control of production and logistics mcpl’2004. international journal of computers communications & control issn 1841-9836, 10(1):62-69, february, 2015. application of the analysis of self-similar teletraffic with long-range dependence (lrd) at the network layer level g. millán, g. lefranc ginno millán universidad católica del norte escuela de ingeniería, larrondo #1281, coquimbo chile gmillan@ucn.cl gastón lefranc* pontificia universidad católica de valparaíso escuela de ingeniería eléctrica, avda. brasil #2147, valparaíso chile *corresponding author: glefranc@ucv.cl abstract: in a previous paper it was proposed, and theoretically confirmed, that analysis of self-similar traffic flows with long-range dependence may be restricted to the network layer. in this paper this novel concept is applied to the study of traffic recorded in an ieee 802.3u network environment with the aim of proving its validity as a simple and efficient tool for high speed computer network traffic flow analysis. keywords: long-range-dependence, network layer, traffic models, self-similar process. 1 introduction it is interesting to reflect on the idea that a purely random process is no more than a theoretical concept, but it is much more interesting to do so considering that no series is yet known whose characteristics correspond exactly to those of such processes. likewise, it is of interest to explain that a given behavioral evolutionary singularity is widely attributable to two stationary stochastic processes without considering their origins, scope and implications. to clarify the above two assertions, [1] made an exhaustive background review that includes research proposals and their results as well as the mathematical foundations underlying all of them. however, a basic idea remains unfinished: if all the arguments given do nothing more than highlight the benefits and advantages of the parsimonious modeling of traffic flow in current high speed network environments, then why is there dissent on its use?, and even more important yet, why do all the results deal with self-similarity as ubiquitous not only on the time scales, a fact that is certainly not put in doubt, but also with respect to the set of circumstances attributable to its origin? with the purpose of answering these questions, that same paper states and then gives the foundation for the validity of the following working hypothesis:"it is completely feasible to restrict the evolution of a statistically self-similar process to a well defined application setting without altering its nature and its more important properties, in that way highlighting the validity of its postulates and giving greater plausibility to its physical interpretation”, clarifying that the plausibility refers to the action of conferring an admissible, and therefore worth considering, character to one or various parameters that compose an analytical model whose interpretations are not only mathematical idealizations, and that the theoretical proof of that hypothesis is based essentially on the proposal of ryu and lowen [2], which consists in making a distinction between the self-similarity observed at the application level and the self-similarity observed at the network level, but with the substantial difference of not considering any particular traffic model as in the case of the authors, in which the proposal is developed to support the analysis of the results obtained from using the fractal point process (fpp) model proposed in [3], to carry copyright © 2006-2015 by ccc publications application of the analysis of self-similar teletraffic with long-range dependence (lrd) at the network layer level 63 out the characterization of traffic flow in high speed networks. it should be specified that the original proposal of ryu and lowen consists in establishing a difference based on the osi level at which is found the source that gives origin to the traffic flows in which self-similarity is seen, and therefore explains its origin. in this way the authors conclude that it is more adequate to refer to application level fractal traffic and network level fractal traffic instead of encompassing under fractal traffic a whole range of dissimilar behaviors that find explanations precisely in the internal processes inherent to each of those levels. it is therefore clear that this division of fractal traffic into those two subcategories addresses effectively the profound differences that exist in the design as well as in the control processes of actual high speed networks at those levels. concretely, the application level self-similar traffic (fractal) has its origin in a source that exhibits self-similarity over a wide range of time and frequency scales without any interactions with the network. in other words, self-similarity is inherent to the source, while network level selfsimilar traffic (fractal), in contrast, exhibits self-similarity over a wide range of time and frequency scales as a result of numerous interactions with the network. an example of application level selfsimilar traffic source is the vbr video sequence of [4], while all the applications based on tcp of [5]-[8] are examples of network level self-similar traffic sources. it should be stated that actually the behavior of an application level traffic source can be affected by the network conditions, depending on the functionalities of the low level protocols used. this effect is insignificant, however, compared to what happens with a network level traffic source, such as an ftp client, under identical conditions, where the ratio of output to input data flow depends directly and critically on the conditions of the network and can largely be considered independent of the size of the files [9]. also, application level self-similar traffic can be managed in the context of the resource assignment admission control subjected to service quality guarantees, since it is independent of the network conditions through which it is sent. this paper presents an experimental application that validates the hypothesis of [1]. considering a real network scenario implemented under ieee standard 802.3u, traffic capture experiments are performed and then analyzed, restricting the results according to the precepts presented above as well as in [1]. this paper is therefore centered on showing the validity of the analysis of restricted self-similar traffic at the network layer level as a simple and efficient tool to study the behavior of traffic flow in present day high speed computer networks. 2 traffic measurements 2.1 description of the network environment figure 1 is a diagram of the topology of the implemented experimental network scenario. it is a lan ieee 802.3u environment that has the following main operational characteristics: • ten workstations that uninterruptedly request an on-demand video service from the video server equipment provided for it. both the client equipment as well as the server make use of the vlc media player application for that purpose. continuous reproduction is achieved by predefining a video program in the server equipment. the ten stations keep an xml file with the page index of the web server. the state of the page index updating is consulted randomly by each of the stations with the purpose of always having the latest version of the file. the network monitoring equipment (tagged "sniffer" in figure 1) makes use of the ethereal application to perform the traffic packet capture. •• the internet access functions for the purpose of updating both the operating systems and the antivirus applications are enabled and automated in all the network’s equipment. 64 g. millán, g. lefranc 2.2 work methodology the procedure to carry out the experiment consists of the following steps: • program the algorithms to estimate the values and study the behavior of the hurst parameter, of the variance-time (v-t), rescaled adjusted range or the r/s statistic, and spectral density analyses. all the programs were developed over matlab because of their availability. verify the correct operation of the programmed algorithms. to perform this operation, use is made of the traffic sample normalized series of [5], bc-paug89.tl, available for downloading in [10], and the values obtained are then compared with those reported in the literature by the authors. capture traffic from the experimental network shown in figure 1. it should be pointed out that the duration of each of the traffic capture periods is governed only by a criterion of availability of storage capacity in the sniffer equipment, trying to capture the largest possible number of samples to face a possible decision scenario based on a figure of merit coming from the bias versus variance relation. •• using the capabilities of the ethereal application the filtering of the captured packets is carried out in such a way as to create time series of data that contain the length of the packets and the arrival times of each of them. these series are then stored in flat text files. • apply the v-t, r/s, and periodgram analyses over the data series previously specified. figure 1: experimental network connections 2.3 simulations table 1 shows the detail of the captured ethernet frames. in that respect the following aspects must be considered: • the temporal resolution of the arrival times that are recorded by the sniffer equipment is set in microseconds. this resolution is delivered as default measurement by the ethereal application. the existence of time fluctuations that are not considered as those due to the latency of the circuits of the equipment’s network card and those due to the code application of the analysis of self-similar teletraffic with long-range dependence (lrd) at the network layer level 65 processing time by the equipment is suggested. however, and in spite of the great impact that both can get to have on the finally recorded times, the option is taken to consider them anomalies belonging to the data processing systems, and they are therefore part of their behaviors. the individual captures present a time resolution of 6 µs, which is the average recorded value. •• the data sets trace-1 to trace-4 contain the data of the time series representative of each capture process. in this respect, each series is composed of an ordered list of pairs of data: the arrival time of the packet, recorded according to the above considerations in floating point format with six positions, and the size of the captured ethernet packet, which records the length of ethernet data. the recorded value does not include the following fields: preamble, address of origin, destination address, length, and crc or verification sequence. it must be recalled that the ethernet protocol forces the frames to have a size with a minimum of 64 bytes and a maximum of 1518 bytes, so the recorded values are within that interval, with the 1518 bytes value as that recorded mostly in all the capture processes. • 99.9% of the ethernet pdus are encapsulated in ip datagrams. • with the purpose of testing the algorithms programmed for each method, whose mathematical expressions are given by (1)-(3), the analysis sequence begins using the series bc-paug89.tl, from figure 2, with the v-t, r/s and periodgram analyses, from left to right, respectively. then, and in the same order, the analyses for each of the four data sets are shown. table 1: qualitative description of the traffic capture sets measurement period data set number of packets november 2010 total: 32 h 2118505 start of trace: first period: trace-1 918896 nov. 29, 06:00 am (06:00 am 12:00 am) end of trace: second period: trace-2 1199609 nov. 30, 08:00 pm (08:00 am 08:00 pm) december 2010 total: 38 h 6096937 start of trace: first period: trace-3 1338789 dec. 1, 08:00 am (08:00 am 08:00 pm) end of trace: second period: trace-4 4758148 dec. 2, 10:00 pm (12:00 am 10:00 pm) var[x(m)] ∼ m−β, 0 < β < 1, h = 1 − β/2 (1) r(n) s(n) = 1 s(n) [max(0,w1, ...,wn) − min(0,w1, ...,wn)], wk = k∑ i=1 xi − kx̄(n) (2) f(λ) ∼ λ1−2h, when λ → ∞ (3) 3 discussion of results table 2 summarizes the results obtained. it shows that the value of h for the bc-paug89.tl data series using variance-time and r/s analyses is correct with respect to the value determined 66 g. millán, g. lefranc by the authors, h = 0.9 using the r/s graphic method. it is also verified that all the values of h found for each of the experimental series (trace-1, trace-2, trace-3, and trace-4) are within the interval of interest 1/2 < h < 1, which certainly implies an asymptotic behavior of the selfcorrelation function given by r(k) ∼ h(2h − 1)k2h−2, when k → ∞ [11], which ensures that these series present a hyperbolic type drop in the tails of their distributions, thereby reinforcing a behavior different from a typically exponential one. clearly, the central limit theorem reinforces the previous condition by considering that the self-covariance function of these processes in the interval 1/2 < h < 1, depends on the value of the h in the approximate form γ(k) ∼ ck2h−2, when k → ∞, with c < 0 [12]. the presence of long-range time dependence is seen from r(k) = rm(k), ∀k ≥ 1, ∀m ≥ 1 [13], a fact that is evidenced from the linear type behavior of the relation subjacent between different levels of aggregation with respect to the variance, which becomes evident from the v-t graph analysis of the figures 3, 4, 5, and 6. figure 2: v-t, r/s, and periodgram for bc-paug89.tl trace. h = 0.9000, h = 0.8991, and h = 0.7681 figure 3: v-t, r/s, and periodgram for trace-1. h = 0.8999, h = 0.8986, and h = 0.7680 figure 4: v-t, r/s, and periodgram for trace-2. h = 0.8999, h = 0.8982, and h = 0.7678 application of the analysis of self-similar teletraffic with long-range dependence (lrd) at the network layer level 67 figure 5: v-t, r/s, and periodgram for trace-3. h = 0.8999, h = 0.8985, and h = 0.7677 figure 6: v-t, r/s, and periodgram for trace-4. h = 0.9000, h = 0.8994, and h = 0.7681 table 2: results obtained for h after applying them to different methods of analysis value of h for the data series measurement period method bc-paug89.tl trace-1 trace-2 trace-3 trace-4 v-t 0.9009 0.8999 0.8999 0.8999 0.9007 r/s 0.8969 0.8996 0.8982 0.8985 0.9004 periodgram 0.6603 0.6680 0.6678 0.6677 0.6681 the self-similar behavior of the different levels of aggregation, as well as its pronounced longrange dependence, are made evident from the linearity shown by the graphs of the size m blocks with respect to the r/s factor. see r/s graph analysis of the figures 3, 4, 5, and 6. the self-similar behavior of the different data series is also shown by the last row of table 2, which considers the spectral estimation based on the periodgram of the series. the noticeable differences between the values of h for each of the series lie in the absence of confidence intervals. the analyses based on the r/s and variance-time graphs show simplified limiting cases that do not consider fine interactions between aggregate components when their differences are small, i.e., when considering, for example, two levels of immediately contiguous aggregation. in this respect, the frequency analysis does consider them, because the spectral handling is, most of the time, sufficiently exact to not omit the bias generated between the compromise of the representation of the singularities of each aggregation level and their general values in terms of the variance. therefore, that is the reason to consider the aggregation of frequencies in relation to the periodgram, instead of the block sizes with respect to the variance. however, for the correct use of this method each result must go together with confidence intervals, which in general it is recommended to obtain from whittle’s maximum likelihood estimator (mle), but this goes beyond the objectives stated for this research, beside the fact that interest is focused on getting 68 g. millán, g. lefranc a general view of the presence or absence of self-similarity characteristics, and not of their exact value, so the values obtained through the variance-time and r/s analyses as upper limits can therefore be considered with minimum error in the final interpretation of the results. it is important to note that, even though no traffic model has been developed and it has only been assumed that the different arrivals are related to an on/off model, as evidenced by the use of the variance-time and r/s statistics in terms of representing aggregations, parsimony is a provable characteristic based on the number of parameters needed to establish self-similarity and long-range dependence. finally, the proposal of restricting the results derived from representing the traffic flows as second order self-similar time series at the network level when necessary, certainly results not only feasible based on the set of arguments presented, but it is also useful for the correct understanding of and search for the origin of the self-similarity (fractality) and the long-range dependence that it may eventually exhibit. in that respect, it is undeniable that it is more adequate to interpret the presence of these singularities as the product of the internal processes subjacent in each osi level in particular, instead of referring to them as intrinsic characteristics of the data flows because of the indetermination to which it leads. therefore, the possibility of restricting a statistically self-similar process to a well defined application environment without altering its nature and its most important properties is true, thereby highlighting the validity of its postulates and adding greater plausibility to its physical interpretation, which is derived precisely from not forcing a reality to fit all the parameters of a given model, but on the contrary, it is taking care directly of a characteristic of the behavior of traffic flows that is exposed through its observation and mathematical interpretation. 4 conclusions a new point of view has been presented with respect to the systematic classification of self-similar processes with long-range dependence, whose purpose, applied to the parsimonious modeling of high speed computer networks, is to propose a common working framework for the interpretation of the results derived from the representation of traffic flows in the form of self-similar second order stationary time series. supported by the use of a set of four high time resolution traffic samples, it is shown that the behavior of traffic flow in a high speed computer network is self-similar and exhibits longrange dependencies over a wide range of time scales, which is evidenced when the dependence that exists among the different aggregation levels with respect to their second order statistics analyzed, and it is confirmed that regardless of the level considered, the trend reflects linearity. this linearity implies a behavior that does not depend on the time scales under consideration, but a behavior inherent to the data flows. the mathematical models of current high speed computer networks must consider selfsimilarity as well as the long-range dependencies. although it is true that the literature often proposes the equivalence of both concepts starting from an indistinct treatment, it happens that these concepts are actually independent, without one necessarily being the consequence of the other, and more profoundly, one not implying the existence of the other; these last two considerations are regardless of the direction in which they are taken. to consider a restriction in the interpretation of the results derived from representing the traffic flows as self-similar stationary time series in relation to the osi observation level results not only feasible, but it must be considered as a useful practice aimed at accuracy in determining the origins of self-similarity and long-term dependence, and for a better understanding of their implications, concretely when dealing with high speed computer networks where the protocol dependencies of the data flows require an adequate interpretation for their practical use in network application of the analysis of self-similar teletraffic with long-range dependence (lrd) at the network layer level 69 engineering. in this respect, it is doubtlessly a better practice to interpret these singularities as a product of the internal processes that underlie each osi level in particular, than to interpret as the indetermination caused by referring to them as intrinsic characteristics of flows without an origin. bibliography [1] millán, g.; kaschel, h.; lefranc, g. (2010); discussion of the analysis of self-similar teletraffic with long-range dependence (lrd) at the network layer level, international journal of computers communications & control , issn 1841-9836, 5(5): 799-812. [2] ryu, b.k.; lowen, s.b. (1997); point process approaches for modeling and analysis for self-similar traffic. part ii: applications, proc. 5th international conference on telecommunications systems, modeling and analysis, nashville, tn. [3] ryu, b.k.; lowen, s.b. (1996); point process approaches for modeling and analysis for self-similar traffic. part i: model construction, proc. ieee infocom ’96, san francisco. [4] garrett, m.w; willinger, w. (1994); analysis, modeling and generation of self-similar vbr video traffic, computer communication review, 24(4):269-280. [5] leland, w.e.; taqqu, m.s.; willinger, w.; wilson, d.v. (1994); on the self-similar nature of ethernet traffic (extended version), ieee/acm trans. netw., 2(1): 1-15, feb. 1994. [6] paxson, v.; floyd, s. (1995); wide-area traffic: the failure of poisson modeling, ieee/acm trans. netw., 3(1):226-244. [7] ryu, b.k. (1996); implications of self-similarity for providing end-to-end qos euarantees in high-speed networks: a framework of application level traffic modeling, lectures notes in computer science (proceedings of international zurich seminar on digital communications (izs ’96)), plattner, b. (ed.), zurich, switzerland: springer-verlag, (1044):65-79. [8] willinger, w.; taqqu, m.s.; sherman, r.; wilson, d.v. (2007); self-similarity through highvariability: statistical analysis of ethernet lan traffic at the source level, ieee/acm trans. netw., 5(1):71-86. [9] aracil, j.; edell, r.; varaiya, p. (1997); a phenomenological approach to internet traffic self-similarity, 35th annual allerton conference on communication, control and computing, urbana-champaign, il. [10] http://ita.ee.lbl.gov/html/contrib/bc.html. [11] park, k.; willinger, w. (2000), self-similar network traffic: an overview, self-similar network traffic and performance evaluation, park, k.; willinger, w. (eds.), new york: john wiley & sons. [12] sheluhin, o.i.; smolski, s.m.; osin, a.v. (2007); self-similar process in telecommunications, chichester, uk: wiley. [13] stallings, w. (2004); redes e internet de alta velocidad. rendimiento y calidad de servicio, 2nd ed., madrid, españa: pearson prentice hall. international journal of computers communications & control issn 1841-9836, 12(1):90-102, february 2017. initial phase proximity for reachback firefly synchronicity in wsns: node clustering m. misbahuddin, r.f. sari misbahuddin misbahuddin* department of electrical engineering, faculty of engineering, universitas indonesia, depok, 16424, indonesia *corresponding author: misbahuddin@ui.ac.id riri fitri sari department of electrical engineering, faculty of engineering, universitas indonesia, depok, 16424, indonesia riri@ui.ac.id abstract: synchronicity is one of the essential basic services to support the main duties of wireless sensor networks (wsns). synchronicity is the ability to arrange simultaneously collective actions in wsns. a high-rate data sampling to analyze the seismic structure and volcanic monitoring is the important applications requiring a synchronicity. however, most of the existing synchronicity algorithm is still executed in a flat network, so that it requires a long time to achieve a synchronous condition. to increase the convergence rate, the synchronicity can be executed concurrently through a clustering scheme approach. in this work, the such scheme is called as the node clustering based on initial phase proximity for reachback firefly synchronicity (ncipp-rfs). the ncipp-rfs solves in three steps: (1) constructing the node clustering, (2) intra-cluster synchronicity, and (3) inter-cluster synchronicity. the ncipp-rfs method is based on the firefly-inspired algorithm. the fireflies are a species in the natural system, which are able to manage their flashing for synchronicity in a distributed manner. the ncipp-rfs was implemented in ns-3 and evaluated and compared with reachback firefly algorithm (rfa). the simulation results show a significant increase in the convergence rate. the ncipp-rfs can reach a convergence time shorter than the rfa. in addition, the ncipp-rfs was compared in the various numbers of clusters, where the least number of clusters can reach the fastest convergence rate. finally, it can also contribute significantly to the increase of the convergence rate if the number of nodes is greater than or equal to 50 nodes. keywords: wireless sensor network, synchronicity, node clustering, phase proximity, firefly-inspired algorithm. 1 introduction wireless sensor networks (wsns) is a set of spatially distributed autonomous sensor nodes that could to interact locally, and some nodes of them interact with a sink node or base station. a sensor node in wsns is a device that has the limitations in the processing unit, communication resources, and sensing capabilities. the wsns has been implemented in various applications for environmental monitoring, condition sensing, and process automation such as battlefield surveillance, habitat monitoring, coordinated target detection and localization, chemical attacked detection, ubiquitous healthcare, and home automation. the duties of the wsns can be executed properly when it is supported by robust basic requirements such as time synchronization, synchronicity, self-configuration, and self-localization. for example, time synchronization is extremely required to ensure the high accuracy measurement for an event-driven measurement application in an area where an event is detected [1]. synchronicity inspired by the biological copyright © 2006-2017 by ccc publications initial phase proximity for reachback firefly synchronicity in wsns: node clustering 91 principle of a firefly is adopted by [2] for handling a dynamic node clustering in data readings. self-configuration characteristic for a routing protocol of wsns developed by [3] aimed at discovery the best route for delivering information with minimum energy consumption. finally, self-localization is a basic feature in wsn for finding out the location of unknown nodes because gps that is usually used to find the location of a device is not suitable for wsns [4]. many phenomena exist in the natural system around us, which have inspired many scientists to solve various problems within the engineering field or the specific problems of wsns. the basic requirements of wsns such as synchronization and synchronicity need a self-organized way. both are a mutually complementing requirement of wsns. the synchronization is an ability to align the time of node’s internal clock referring on the global time to perform a simultaneously collective action. on the other hand, the synchronicity is a way to align the phase of the internal clock of node to conduct a synchronously collaborative action. the application of the synchronicity in wsns is very useful as a simple sensor network coordinator in sampling the high data rates such as seismic analysis of structure [5], and volcanic monitoring [6]. moreover, is also used as a scheduling mechanism of the node duty cycles in order to save energy, so that all nodes in a network can wake up at the same time. there are some challenges of the robust synchronicity requirements, i.e. simple, fast, low energy consumption, self-configuration and high scalability. in this study, we propose a new nodeclustering synchronicity method using a firefly-inspired algorithm to address three synchronicity requirements in simple, fast,and self-configured way. clustering in the wsns is often used in some applications because it is extremely useful for various purposes. the purposes are divided as primary and secondary [7]. the first purpose represents the objectives that are the most substantial in the node clustering such as scaling, faulttolerance, data aggregation, load balancing, network topology stabilization, network lifetime extension. instead, the second purpose points out the objectives that are not highly important, and they are indirectly achieved by clustering node such as increasing connectivity, reducing routing delay, avoiding collision, and utilizing sleeping schemes. therefore, in this research we propose the node clustering model as a new approach to synchronize the node in the network, where the node clustering is an extremely important requirement to relieve the load of the network to reach the synchronicity. our ncipp-rfs approach provides two main contributions that consist of (i) in clustering, there are some node subsets that perform an intra-cluster synchronicity in parallel, continued to an inter-cluster synchronicity. the total of periods required to execute both synchronicity processes is smaller than that of without clustering. this emphasizes that our approach is faster than the non-clustered synchronicity algorithm. (ii) self-configuration is an important problem in the distributed system for executing both its primary duties and basic function as well as for the synchronicity function. in fact, there are many natural phenomena around us in which their population can organize themselves toward a synchronicity state, in which one of them is the firefly. the behavior of firefly flashing has inspired a number of researchers to create the firefly-inspired synchronicity algorithms [8]-[13]. in this research, we utilize the synchronicity algorithm developed by [10] for intra-cluster synchronicity. the remainder of this paper is organized as follows: section 2 presents literature review related with the firefly-inspired synchronicity. section 3 describes the approach used to solve the synchronicity requirements. section 4 presents the simulation results to show the performance evaluation. finally, section 5 concludes this paper and ideas for future work. 92 m. misbahuddin, r.f. sari phase1 1 0 ε f(φ1)+ε f(φ1) ∆φ φ1 φnew figure 1: the state function of m&s model 2 firefly-inspired synchronicity in the natural system, there are several synchronicity phenomena that have been observed by many researchers to understand the mutual interaction of a population toward a self-organized synchronicity without a notion of time. examples of the natural synchronicity include circadian rhythm [14], pacemaker cell of the heart [15], and synchronous flashing of fireflies [16]. fireflies are one of the species around us, which can interact mutually to fire synchronously. this phenomenon is one of the most spectacular self-organized synchronicity, which was imitated as a firefly-inspired synchronicity algorithm. the population of the fireflies can be analogized as a population of the pulse-coupled biological oscillators (pco) that was introduced by mirollow and strogatz [8], which is known as the m&s model. however, this model cannot be implemented directly in a real wsns because it still uses some ideal assumptions that are not in accordance with the realistic wireless communication [17]. the assumptions are: (1) the characteristic of oscillators is identical, (2) the node’s firing event occurs and other nodes respond instantaneously. (3) node’s computations are conducted perfectly and immediately. the m&s model presents a basic concept of the firefly-inspired synchronicity algorithm, which is described through two pulse-coupled oscillators. each oscillator is characterized by a monotonically increasing and concave down function representing a firing function as shown in figure 1. when the oscillator’s phase increase monotonically to reach a threshold, it fires and falls immediately to zero. the mutual interaction occurs between two oscillators when an oscillator fires and sends a firing message to another oscillator that causes another oscillator responds by adjusting its own phase toward firing. as a result, it will jump to a new phase φnew with coupling strength ε. the new phase can be calculated using the following equation [6]: φnew = min(1,f −1(φ) + ε)) (1) where f(φ)=1 b .ln(1+[eb-1].φ), and b is a dissipation parameter. the essential weakness of the m&s model, which violates the practical wsns, is when responding immediately a firing message that is sent by its neighbor nodes without considering an unpredictable delay because of the channel contention prior to message transmission. therefore, the reachback firefly synchronicity (rfa) developed by werner-allen et al.[10] overcomes the realistic wireless communication problems of the pco model. three problems handled by the rfa are related the wireless communication and one problem of the load computation. they are initial phase proximity for reachback firefly synchronicity in wsns: node clustering 93 the timestamping message, the notion of pre-emptive message staggering, reachback response, and simplified firing function. the amount of time a message that was delayed before being broadcasted can be estimated using the low-level timestamping. the oscillator of the pco model that reacts shortly to each firing event can be overcome utilizing the notion of reachback response, that is, all firing events as the phase jumps are recorded and are calculated once at the end of each period that is used to jump at the beginning of the cycle. the wireless contention, in the worst case, can be avoided employing the notion of pre-emptive message staggering. finally, the computation complexity is reduced by simplifying the firing function. the detail description of four ways to be applied in the real wireless network are as follows: 1. a node experiences a delay between when it reaches a firing and when it starts to transmit a message. the delay can be estimated using mac-layer timestamping. the measurement of the mac-delay can be started using a trigger to record it in the header of the outgoing message when the message is transmitted. in the receiver node, the information is used to determine the proper firing time by calculating the difference between the mac-delay and the reception time of the message. 2. in the m&s model, a node responds immediately to each firing message from other nodes. in contrast, the rfa uses the notion of reachback response to record each received firing message and calculates them as the phase jumps once at the end of each period, which is used as a jump at beginning of the next cycle as illustrated in figure 2. this approach will be discussed in more detail in subsection 3.2. 3. in the m&s model, perhaps many nodes transmit the firing message together when they fire simultaneously. this event is the worst case of the csma scheme because they can cause channel collisions. to avoid such a worst case and to control the extent of the message delay, the rfa model introduced a notion of pre-emptive message staggering by adding a random transmission delay to the firing message at the application level of the node. the delay value is assigned to a uniformly random value between 0 up to a constant d before node fires. furthermore, a random waiting time w (where d 6 and q(i)> q(i-1) for i=1,2,.....r-1. permute the sequence q(i)to make the sequence r(i) such that r(i) = t(q(i)), i = 0,1,...,r-1. where t(i) is a simple indexing transform. it is defined by the standard and shown in table 1. table 1 list of prime number p and associated primitive root ’v’ perform the intra row permutation if c = p , u(i,j)=s[(j*r(i))mod(p-1)] where u(i,p-1) = 0 if c= p+1,u(i,j)=s[(j*r(i)mod(p-1)] where u(i,p-1) = 0; and u(i,p) = p and if k= r c, then exchange u(r-1,0) with u(r-1,p) if c = p-1, u(i,j)=s[(j*r(i))mod(p-1)]-1 where i = 0,1,...,r-1 and j = 0,1,..,p-2. 190 j.m. mathana, s. badrinarayanan, r. rani hemamalini table 2 inter-row permutation patterns perform the inter row permutation for the rectangular matrix based on the pattern t(i).it is denoted as u(i) u(i) = u(t(i)), where i=0,1,..,r-1. read out the addresses column wise. looking at the interleaver algorithm in 3gpp standard, the computationally intensive functions found are modulo computation, intra-row and inter-row permutations, multipliers, finding least prime integers, and computing greatest common divisor. some of these complex functions are implemented using the support from rom, while the others are simplified to reduce the hardware usage. the interleaving process can be easily separated into two main phases. the two phases are pre-computation phase and execution phase. as different block sizes has different interleaving patterns, thus every time some pre-computation is needed while changing the block size ’k’. the only parameter applied to interleaver is the block size (k) and rest of the parameters is computed by the hardware itself. 2.1 pre-computation phase pre-computation phase involves pre-computing the various parameters required for the interleaving process. it has to be performed each time the change in the block size occurs, that means for a fixed block size k, these operations have to be performed only once. 2.2 execution phase execution phase calculates the intra row permutations ui,j, and the interleaved address i_addr .write or read the data bits ( in/ from a data ram) can be taken place depending upon whether interleaving or deinterleaving is performed of addresses could also be computed and saved in the memory only. for each data block, the execution phase has to be performed. the set of addresses could also be computed and saved in the memory only once.this approach permits to reduce the implementation size of interleaving / deinterleaving algorithm as compared to the memory or look up table (lut) based approach, where the interleaving addresses are saved in memory. thus execution phase performs basic permutations which form the interleaving process. the basic permutations performed in this phase are illustrated in fig.4. 3 proposed architecture of reconfigurable interleaver the main objective of the proposed work is to device an architecture capable of managing every one of the 5074 different block sizes of data ranging from 40 to 5114 defined in the 3gpp vlsi architecture for high performance 3gpp (de)interleaver for turbo codes 191 standard, while maintaining low hardware complexity. the proposed interleaver is said to be reconfigurable, since it adapts or varies its interleaving architecture depending upon the block size ’k’ of the input rectangular matrix. the reconfigurable interleaver accepts only the block size of the rectangular matrix as the input. the rest of the parameters such as the number of rows, columns and the base sequence for permutation (both intra row and inter row permutation) are calculated by the reconfigurable architecture for interleaver address generation. the block diagram of proposed reconfigurable interleaver/deinterleaver is shown in figure 5. it consist of modulo-computation block, multiplication addition and comparison hardware, controller circuit , i/o ram, data ram, exceptional logic block, s(j) ram and luts etc. 3.1 precomputation of parameters the computation of parameters in pre-computation for 3gpp is discussed in detail by blakley [9]. some parameters are computed using look up tables while the others need some close loop or recursive computations. first, the row value ’r’ with logical functions is calculated. parameters like prime number ’p’ and primitive root ’v’ are stored as a pair in a look up table just like in the standard and lut is read via a counter generated by the controller based on equation (1). k ≤ r ∗ (p + 1) → (k − r) ≤ (r ∗ p) (1) in order to perform the operation in equation (1) the architecture given by blakley is used in the proposed work. it is shown in figure 6.the same hardware architecture is used in execution phase also. for computation of ’p’ and ’c’ operations such as the multiplication, addition and comparison are required. it is controlled by the flags generated by the controller. once ’p’ and ’v’ are obtained then numbers of columns have to be determined. the three possible column values are p-1, p or p+1 and always the following condition shown by the equation (5.2) should be satisfied. (r ∗ c) ≥ k (2) 192 j.m. mathana, s. badrinarayanan, r. rani hemamalini vlsi architecture for high performance 3gpp (de)interleaver for turbo codes 193 the computation of intra-row permutation pattern s(j) required modulo computation. modulo function is computed iteratively using the interleaved modulo multiplication algorithm [9].it is an iterative numerical algorithm based on additions, shifts, comparisons and bit retrieval. in order to calculate the s(j) sequence, it is mandatory to perform the modulo operation. a flow diagram depicting this algorithm is shown in figure 7. the hardware for the computation of s (j) is shown in figure 8. this architecture uses two adders, a multiplier and a subtractor to compute the s (j) values. the s array is computed as follows (3) with s (0) = 1. s(j) = (v ∗ s(j − 1))mod(p) (3) where j =1, 2... (p-2) from the above equation it is known that, primitive root ’v’ is required to compute s(j), which is 5 bits. so maximum of 5 iterations are needed to compute one modulo multiplication. with this architecture every s (j) is determined and writes in a ram of 256 x 10 bits. the size of the ram depends upon the value of ’p’. the number of cycles needed to calculate every s (j) depends on v, which can take only six different values 2, 3,5,6,7 and 19. for v = 2 and v = 3 only one iteration is required for v = 5, v = 6 and v =7 two iterations and for v =19 four iterations are required. according to the 3gpp algorithm, the next step is calculation of the prime sequences q (i). it can be noticed that the qi sequences [10] are almost the same in most cases and they only differ by one or two elements from other sequences. based on this observation, sub groups of q (i) sequence are placed into a rom and then choose one according to the ’p’ value [11]. in the proposed work, instead of finding the least prime number sequence q (i), q(i) mod (p-1) is determined. this gives the benefit of computing the ram address recursively and avoiding computation of modulo function. this idea is introduced by shin and park [12] and later used by wang and li [13]. according to the equation (4) q (i) is calculated. 194 j.m. mathana, s. badrinarayanan, r. rani hemamalini vlsi architecture for high performance 3gpp (de)interleaver for turbo codes 195 q(i) = q(i)modp − 1 (4) where i = 0,1,2,3,...p-1 the architecture used to perform q(i) is shown in figure 9. in the proposed work, the architecture of s (j) is used with some alterations to determine q mod (p-1). it is calculated in such a way that the following conditions have to be satisfied. g c d(q, p-1) = 1, q(i) > 6 and q(i) > q(i-1) for i = 1,2,.....r-1. g c d is the greatest common divisor. the selected prime number from the prime number table should be 6 and g c d (q, p-1) = 1.if the above condition is satisfied then the value of i = i+1, j = j+1 and the next prime number is selected. the selected prime number becomes one of the member of q(i) and the value of ’j’ is also incremented by1.if the condition is not satisfied then i=i+1,j=j. 3.2 execution phase computation after completing the pre-computation phase, the controller is set in execution phase and the hardware is configured to perform run time computations for the generation of the interleaved addresses.ram address are computed using the hardware shown in figure 6. the recursive function used to compute the ram address and multiply?add function to compute the final interleaved address are as follows: ramadr(i,j) = [ramadr(i,j − 1) qmod (i)]mod(p − 1) (5) it can be seen that computing the ram address using qmod (p-1) instead of q helps to avoid the full computation of modulo multiplication. after computing the ram address, the final interleaved address is computed by the multiply?add function. the interleaved addresses are obtained by performing the inter row t (i) and intra row ui,j permutations in executing phase, where t(i) is stored in a lut while the ui,j is stored in a ram. with these parameters and the using the equation (6), the interleaving addresses i_addr are finally generated. iaddr ((i × c) + j) = (c × t(i)) + ui,j (6) with i = 0,1,...,r-1; j = 0,1,...,c-1 [c×t(i)] is computed by the architecture shown in figure 6.the result of this computation points to the first element of each of the ’r’ rows in the rectangular matrix, and then by adding ui,j with the same architecture, a displacement along every row is obtained. ui,j is obtained by calculating modulo operation (j × r(i)) mod (p-1).it has the same form as (v × s(j-1))mod p in the pre-computation phase. since the computation of s (j) and ui,j are performed at different phases, the same architecture is used with some modifications. since the same architecture is used in both the phases, hardware complexity is reduced. 3.3 modulo computation one of the important blocks in the proposed reconfigurable architecture is "modulo computation". this block plays a crucial role both in execution phase and in pre-computation phase. figure 10 shows the architecture of modulo computation introduced by rizwan asghar and dake [10]. the main block is depicted in figure 10 by blue colour. it is separated and shown in figure 11a. in pre-computation phase, this architecture works 100% efficiently. but in execution 196 j.m. mathana, s. badrinarayanan, r. rani hemamalini phase, its efficiency is only 88%. the problem will occur when q (i) > 2(p-1)[10]. this problem is nullified by performing subtractions like q(i) (p-1), q(i) 2(p-1), q(i) 3(p-1) and q(i) 4(p-1) instead of q(i) (p-1) [11]. it is illustrated in figure 11b. in the proposed work, this structure is further modified as shown in figure 12. in order to nullify the problem, always q(i) must be less than p. in the proposed work q (i) is compared with p. after comparison the small value is assigned to q (i) and the large value is assigned to p .hence in the proposed work, during the execution phase also modulo computation block works 100% efficiently. there are three multipliers are used.[11] to generate 2(p-1), 3(p-1) and 4(p-1). but the proposed modified architecture does not contain any multipliers; hence hardware complexity is reduced. this architecture performs all modulo operations required by the 3gpp standard. with this modification the obtained hardware architecture works properly in both the pre-computing and executing phase. 4 control finite state machine in order to synchronize every block in the architecture, a "controller" block generates all control signals; its state machine diagram is shown in figure 13. the controller mainly works for the pre-computation phase of 3gpp. it configures different functions like multiply, add, compare and modulo computation hardware to find all the vital parameters r, c, p, v and s (j). usually interleavers in the turbo coded system work with the same block size ’k’ several times before changing it. when an interleaving operation is required the pre-computation phase is performed, where s (j) is calculated and placed in ram. then the execution phase starts where interleaved addresses are calculated continuously meanwhile ’k’ vlsi architecture for high performance 3gpp (de)interleaver for turbo codes 197 does not change. the new pre-computation phase can be initiated by changing the block size ’k’. 5 exception handling in the 3gpp interleaving algorithm, there are some exceptions that are reflected in the architecture. the final address is tagged valid or invalid using the comparator. this is called pruning of the interleaver and is needed for the case when interleaver block size is not exactly equal to r*c. for example, for k = 43 the rectangular matrix size is 5x10, and then there would be seven invalid addresses. because of that, a valid address cannot be generated per clock cycle. interleavers presented by shin and park [12] wang and li[13], [11] and carlos sánchez [14] provides the interleaved addresses with valid or invalid tag. but in the proposed architecture, if the input data stream of any size between 40 and 5114, then the interleaver rearranges it according to the corresponding 3gpp interleaved path and provides the data stream even the exception is present. in order to manage data streams, a data ram is used in the proposed design. when the proposed architecture is operated in interleaver mode, this ram is used to hold data input when invalid addresses are generated and waiting for a valid address in order to write this data to data ram. in deinterleaver mode, when invalid address is read from data ram, the i/o ram ignores this data, waiting for a valid data and then output this data. 6 performance analysis of reconfigurable interleaver the rtl code for the hardware blocks is written in verilog and downloaded it to the fpga cyclone iii from altera. the rtl diagram of reconfigurable interleaver / deinterleaver is shown in figure 14. the number of clock cycles spent for pre-computation through is mentioned in table 3. these cycles mainly depend on the value of p and v for a particular block size. the maximum clock frequency of 69.91 mhz is shown in figure 15.table 4 summarizes comparison of the performance with existing state-of-the-art interleavers. as it can be seen from the table 4, the proposed design occupies about 4586 out of 5136 logic elements. it is worth mentioning that although the proposed design is bigger than others, it includes the hardware for exceptions handling, an i/o ram and a data ram as well as extra hardware to control them. in this way, the proposed architecture is capable of working both as an interleaver and as a de-interleaver.in this proposed interleaver/deinterleaver latency and throughput can 198 j.m. mathana, s. badrinarayanan, r. rani hemamalini vlsi architecture for high performance 3gpp (de)interleaver for turbo codes 199 vary depending on the block size. the maximum block size is k=5114.the proposed architecture requires 5120 clock cycles which represents a realistic insignific ant overhead. 7 conclusions and future work in this paper a fully functional 3gpp turbo code interleaver/ deinterleaver architecture that receives an input data stream of any size established by the 3gpp standard and delivers this stream interleaved or deinterleaved depending on the user requirements is presented. in this proposed design modulo computation is done by computer algorithm to calculate and took advantage of using the same hardware in pre-computation and execution phases by multiplexing it. novel vlsi architectures have been introduced in this design. by adding rams for data handling and achieved a complete architecture that can perform interleaving/deinterleaving operations as required by the 3gpp standard for turbo codes.one of the important parameter is pre-computation cycle cost. if the pre-computation cycle cost is less then, the design supports fast switching among different standards. hence future work is to design reconfigurable interleaver which is suitable for a multimode environment. bibliography [1] 3rd generation partnership project (3gpp) tsg-ran, "multiplexing and channel coding," release 4, version 4.2.0, sept. 2001. [2] c. berrou, a. glavieux, and p. thitimajshima (1993), near-shannon limit errorcorrecting coding and decoding : turbo codes, proceedings of icc, geneva,switzerland, 1064-1070. [3] c. berrou, a. glavieux, and p. thitimajshima (2003), near shannon limit error correcting coding and decoding:turbo codes. proceedings of the ieee international conference on commun, geneva, switzerland, may 2003. 200 j.m. mathana, s. badrinarayanan, r. rani hemamalini [4] m valenti (1999), iterative detection and decoding of wireless communications, ph.d thesis, virginia polytechnic and state university, july 1999. [5] university of south australia, institute for telecommunications research, turbo coding research group, http://www.itr.unisa.edu. [6] rizwan asghar and dake liu (2010), multimode flexinterleaver core for baseband processor platform, journal of computer systems, networks, and communications, 1-16. [7] rizwan asghar and dake liu (2010), towards radix4,parallel interleaver design to support high throughput turbo decoding for reconfigurability, 33rd ieee sarnoff symposium, princeton, new jersey, usa, 1-5. [8] 3rd generation partnership project, technical specification group radio access network; multiplexing and channel coding,release 6, 3gpp ts 25.212 v6.0.0 (2003-12). [9] g.r.blakley (1983), a computer algorithm for calculating the product a*b mod m, ieee trans. on comp, 32(5):497-500. [10] rizwan asghar and dake liu (2008), very low cost configurable hardware interleaver for 3g turbo decoding, 3rd international conference on information and communications technologies theory to applications, ictta 2008, damascus, syria, 1-5. [11] hector borrayo sandoval , r. parra-michel, luis f.gonzález-pérez, fernando landeros printzen claudia feregrino-uribe (2009), design and implementation of a configurable interleaver/deinterleaver for turbo codes in 3gpp standard , in proc. of ieee international conference on reconfigurable computing and fpgas, cancun, mexico, 320325. [12] m. shin and i.-c. park (2003), processor based turbo interleaver for multiple third generation wireless standard, ieee commun. lett., 7(5):210 212. [13] z. wang and q. li (2007), very low complexity hardware interleaver for turbo decoding, ieee trans. on circuits and sys. ii, 54(7):636 640, july 2007. [14] carlos r.sanchez, r. parra-michel and m.e guzmánrenteria (2008), design and implementation of a multi-standard interleaver for 802.11a, 802.11n, 802.16e & dv standards, international conference on reconfigurabl computing and fpgas, reconfig 2008, cancun, mexico, 379 384. [15] p.ampadu and k. kornegay (2003), an efficient hardware interleaver for 3g turbo decoding, proc. of radio and wireless conference, rawcon’03, boston, 199-201. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 4, pp. 418-431 probabilistic proximity-aware resource location in peer-to-peer networks using resource replication m. analoui, m. sharifi, m.h. rezvani morteza analoui, mohsen sharifi, mohammad hossein rezvani iran university of science and technology (iust) 16846-13114, hengam street, resalat square, narmak, tehran, iran email: {analoui,msharifi,rezvani}@iust.ac.ir abstract: nowadays, content distribution has received remarkable attention in distributed computing researches and its applications typically allow personal computers, called peers, to cooperate with each other in order to accomplish distributed operations such as query search and acquiring digital contents. in a very large network, it is impossible to perform a query request by visiting all peers. there are some works that try to find the location of resources probabilistically (i.e. non-deterministically). they all have used inefficient protocols for finding the probable location of peers who manage the resources. this paper presents a more efficient protocol that is proximity-aware in the sense that it is able to cache and replicate the popular queries proportional to distance latency. the protocol dictates that the farther the resources are located from the origin of a query, the more should be the probability of their replication in the caches of intermediate peers. we have validated the proposed distributed caching scheme by running it on a simulated peer-to-peer network using the well-known gnutella system parameters. the simulation results show that the proximity-aware distributed caching can improve the efficiency of peer-to-peer resource location services in terms of the probability of finding objects, overall miss rate of the system, fraction of involved peers in the search process, and the amount of system load. keywords: distributed systems, peer-to-peer network, content distribution, resource location, performance evaluation. 1 introduction 1.1 motivation a peer-to-peer (p2p) system is a distributed system consisting of interconnected peers who are able to selforganize into network topologies with the purpose of sharing resources such as cpu or bandwidth, capable of adapting to dynamic conditions of network, without requiring the support of a global centralized server [1]. the p2p systems are classified as unstructured and structured. in the structured systems such as can [2] the overlay topology is tightly controlled and files are placed at exact locations. these systems provide a distributed routing table, so that queries can be routed to the corresponding peer who manages the desired content. unlike the structured systems, in the unstructured systems such as gnutella [3] and kazaa [4] searching mechanisms are employed to discover the location of the resources. each peer owns a set of resources to be shared with other peers. in general, the shared resource can be any kind of data which make sense, even records stored in a relational database. the most significant searching mechanisms include brute force methods (e.g. flooding the network with propagating queries in a breath-first or depth-first manner until the desired content is discovered) [1], probabilistic searches [5], routing indices [6], randomized gossiping, and so on. most of the current p2p systems such as gnutella and kazaa fall within the category of p2p "content distribution" systems. a typical p2p content distribution system creates a distributed storage medium and allows doing services such as searching and retrieving query messages which are known as "resource location" services. the area of "content distribution systems" has a large overlap with the issue of "resource location services" in the literature. fig. 1 illustrates a possible topology of the unstructured p2p networks. each super-peer is a powerful master node that acts as a local central indexer for files which are shared by its local peers, whereas acts as an ordinary peer for other super-peers. in graph representation, each pair of peers is connected by an edge representing a tcp connection between them. the number of neighbors of a super-peer is called its out-degree. if two nodes are not copyright c⃝ 2006-2010 by ccc publications probabilistic proximity-aware resource location in peer-to-peer networks using resource replication 419 connected by an edge, they could communicate through an indirect path which passes across some other nodes. the length of a path through which two nodes communicate with each other is known as hop-count. upon delivery of a query request message to a super-peer, it looks for matches over its local database. if any matches are found, it will send a single response message back to the node which has requested the query. if no match is found, the super-peer may forward the query request to its neighbor super-peers. figure 1: a typical super-peer network in general, the performance of the p2p systems is strictly evaluated by metrics such as response time, number of hops, aggregated load, throughput, overall miss rate, fraction of participating nodes in the search operation, and so on. to meet these requirements, previous researches resorted to heuristics to locate the resources by incorporating proximity concerns. 1.2 challenges there already exists significant body of researches toward proximity-aware p2p systems. the proximity-aware resource location method in the p2p unstructured systems has been investigated in [7,8]. the proposed method uses flooding mechanism to forward a query request to all neighbors of a peer. it uses the hop-count as the proximity metric. in order to reduce the number of broadcast messages in the network, the header of each query message contains a time-to-live (ttl) field whose value is decremented at each hop. finally, when the ttl reaches zero, the query message is dropped from the network. after locating the resource, a direct connection is established between the originating peer and the destined peers and the file is downloaded. the flooding approach employed in [7] is probabilistic in the sense that each peer replicates the query to its neighbors with a fixed probability. an analytical study on the impact of proximity-aware methodology for the content distribution is presented in [9]. they have evaluated the performance of video streaming p2p network via key scalability metric, namely network load. similar to the previous works, they have used the pair-wise latency of the peers as the proximity criterion. regard to the aforementioned works, in general, there are two strands of work concerning the proximity-aware methodology. first, there are works on content distribution via constructing the p2p topology [9]. second, there are works on resource location services [7, 8]. these works assume a given topology setting such as mesh or tree for the p2p system. it has been shown in [10, 11] that finding an optimal-bandwidth topology for the p2p network is an np-complete problem. so, we shall not try to solve the np problem of topology construction here. instead, we will try to optimize the proximity-aware resource locating problem within the given topology setting in the p2p system. 1.3 contributions in this paper, we are concerned with the design of a resource location service by using scalable proximityaware distributed caching mechanism. we define the resource location service as "given a resource name, find with a proximity probability, the location of peers who manage the resource." 420 m. analoui, m. sharifi, m.h. rezvani we use round-trip time (rtt) latency distance as the criterion for the probabilistic caching of each query. each peer, upon receiving a query, at first searches its local cache. if the query is found, the peer returns it to the original requesting peer along with the reverse path which is traversed by the query. in this order, the so called query is cached in the memory of each intermediate node using replication method based on the proposed proximity-aware distributed caching mechanism. the probability of the resource replication and updating of the caches in each intermediate node is proportional to the latency distance between that node and the location where the resource is found. to the best of our knowledge, there has been no investigation on designing the proximity-aware probabilistic caching in the p2p systems. the rest of the paper is organized as follows. section 2 presents our proposed proximity-aware resource location mechanism along with its specification such as resource replication. section 3 provides an analytical study of the probabilistic search method along with numerical results. section 4 presents the experimental validation of the proposed mechanism. finally, we discuss the related works in section 5, and conclude in section 6. 2 proximity-aware distributed caching each pair of nodes is associated with a latency distance representing the average rtt experienced by communication between them. the latency distance corresponding to a specific pair of nodes may be measured either directly through ping messages, or estimated approximately through a virtual coordinate service [12]. due to large number of nodes in a p2p system, we have adopted the latter approach to measure the latency between each pair of nodes. the virtual coordinate service which has been used in [12] is provided by vivaldi [13], a distributed protocol developed at mit. due to space limitations, we do not explain the details of the virtual coordinate service here. interested readers can refer to [12, 13] for it. as sated above, some works use the hop-count as the criterion for the distance estimation rather than using vivaldi estimation method. every super-peer in our system has a local index table (lit) that points to locally managed resources (such as files, web pages, processes, and devices). each resource has a location-independent globally unique identifier (guid) that can be provided by developers of the p2p network using different means. in a distributed online bookstore application, developers could use isbns as guids [8]. each super-peer has a directory cache (dc) that points to the presumed location of resources managed by other super-peers. an entry in the dc is a pair (id, loc) in which id is the guid of a resource and loc is the network address of a super-peer who might store the resource locally. each peer has a local neighborhood defined as the set of super-peers who have connected to it. table 1 and table 2 provide a high-level description of the proposed proximity-aware distributed caching mechanism. the querysearch (qs) procedure describes the operations in which a source is searching a resource, namely . the string path s1,...,sm is the sequence of super-peers who have received this message so far. this sequence is used as a reverse path to the source. the header of each query message contains a ttl field which is used to control the depth of the broadcast tree. for example, gnutella has been implemented with a ttl parameter equal to 7. the queryfound (qf) procedure indicates that the resource being searched by source has been found at super-peer . in this procedure, the max_latency is the latency distance between the super-peer who manages and the farthest super-peer in the reverse path. figure 2 illustrates a typical unstructured p2p computing system. a given resource is managed by nodes s3,s5,s7,s8,s9,s13, and s18. the resource is saved as a file on disk memory corresponding to clients who are clustered by aforementioned super-peers. inspecting the dc of super-peers s11,s12, and s16 for example, reveals that the resource is located in the super-peer s18. the lits corresponding to the nodes s1,s2,s4,s6,s10,s11,s12,s14,s15,s16, and s17 are empty; indicating that they are not themselves managers of the resource . also, the dcs corresponding to the nodes s1,s10,s11,s12,s14,s15, and s17 are empty; indicating that they do not know the address of the owner of the resource res. each super-peer upon receiving the qs message, at first searches within its lit. if it finds the resource in the lit, it will return a qf message. the qf message is forwarded to the source following the reverse path which has been used by the qs message. it updates the dcs corresponding to each of the intermediate nodes as well. the contribution of our work emerges at this point where the qf message updates the lit in each of the intermediate nodes using replication of resources based on the proposed proximity-aware distributed caching mechanism. the probability of resource replication and updating of the lit corresponding to each intermediate node is proportional to the latency distance between that node and the location where the resource has been found. to this end, each intermediate node r performs the following actions with a probability proportional to the latency distance between itself and the peer who has been found as the manager of the resource: 1) establishing a tcp connection with the super-peer who manages the resource. probabilistic proximity-aware resource location in peer-to-peer networks using resource replication 421 2) downloading the resource object and saving into client c who has enough available space. 3) updating the lit by adding the entry (res,c) to it. table 1: querysearch message received by super-peer r. table 2: queryfound message received by super-peer r. if the super-peer does not find the resource in its lit but finds it in the dc, it will send a qs message to the super-peer who is pointed to by that dc. if this super-peer no longer has the resource, the search process will be continued from that point forward. if a super-peer does not find the resource in its lit or dc, it will forward the request to each super-peer in its neighborhood with a certain probability p which is called the broadcasting probability. this probability could vary with the length of the path that the request traverses. figure 3 illustrates how a qs message would be propagated in the network. in the figure, the maximum number of nodes to be traversed by a qs message is defined to be equal to 3 hops (apart from the source node). similar to gnutella, our system uses a breath-first-search (bfs) mechanism in which the depth of the broadcast tree is limited by the ttl criterion. the difference is that in gnutella every node receiving a query forwards the message to all of its neighbors, while in our proposal, the propagation is performed probabilistically and is done if the query is not found neither in the lit nor in the dc of a node. in fig. 3, the qs message originating from source s1 is probabilistically sent to super-peers s2,s3, and s4 due to search for the resource res. the super-peer s3 finds the resource in its lit, but s2 and s4 do not find such an entry, hence probabilistically forward the message to the nodes registered in their dcs. note that the super-peer s4 does not forward the message to s10 because, for example, in this case the probability of forwarding is randomly selected to be zero. figure 4 illustrates an example of returning qf messages in a reversed path from the location where the resource res is found, to the node who has originated the query request. the qf message is routed to the source (node s1) following the reverse path which is used by the qs message. the qf message updates the corresponding dc of each intermediate node based on the proposed proximity-aware distributed caching mechanism. the probability 422 m. analoui, m. sharifi, m.h. rezvani figure 2: the topology of a typical unstructured p2p system of replication and caching the resource object in the lit of each intermediate node is proportional to the latency distance between that node and the location where the resource is found. the closer is the intermediate node to the discovered resource; the less will be the probability of caching the resource in the node’s lit. this probability is shown by a graphical representation with partially boldfaced circles. in the sequence of nodes which consists of s1,s2,s6, and s13, the node s6 caches the address of the resource res with the least probability; whereas node 1 caches it with the most probability. the probability of caching the resource res by s2 is larger than that of s6 and smaller than that of s1. 3 the analytic study of the proposed mechanism considering that the corresponding super-peer of a query is located at the root of a tree of height d, each super-peer r has kr neighbors, where kr follows a power-law distribution. in such a distribution, the majority of nodes have relatively few local connections to other nodes, but a significant small number of nodes have large wide-ranging sets of connections. the power-law distribution gives small-world networks a high degree of fault tolerance, because random failures are most likely to eliminate nodes from the poorly connected majority [14]. hence, each query originating from a client must visit at most d-1 super-peers. note that the levels of the tree are numbered 1,...,d from the root down. let qi be the probability that a super-peer at level i has a local index for the resource res. let r(i) be the number of super-peers at level i who receive a querysearch message from upstream super-peers, and s(i) be the number of super-peers at level i who may forward the querysearch message to downstream super-peers one level down. so, it must be that r(1)=0 and q1=0. the reason for q1 = 0 lies in the fact that the resource res only may be found in the lit of the super-peers who are located in the levels 2,...,d of the broadcast tree. let us consider j super-peers at level i-1 may not find the resource in their lit and forward query request message one level down. each of j super-peers, say r , located at level i-1 can select at most kr super-peers among its neighbors to send a query request message. let us suppose that each super-peer forwards these query messages independently to the children who are located at level i-1 with probability pi. let m1,...,m j be the number of level i children to receive the query request message from super-peers 1,...,j located at level i-1, respectively. let us assume n super-peers receive these query requests at level i. so, we can define the tuple s as follows s( j,n,k1,...,k j) = { −→m = (m1,...,m j)| j∑ s=1 ms = n & 0 6 ms 6 ks} (1) probabilistic proximity-aware resource location in peer-to-peer networks using resource replication 423 figure 3: forwarding a qs message using maximum hop-count equal to 3 where, k1,...,k j are out-degrees of nodes 1,...,j which are located at level i-1 , respectively. now we are in a position to define conditional probability pr[r(i)=n|s(i-1)=j] . this is the probability of receiving the querysearch messages by n super-peers at level i given that j super-peers at level i-1 may forward the messages one level down. with respect to the above premises we have: pr[r(1) = n|s(i −1) = j] = ∑ ∨ −→m ∈s( j,n,k1,...ks) j∏ s=1 ( ks ms ) × pmsi ·(1− pi) ksms (2) for example let us consider that,j-3, n=5, k1 = 2, k2 = 3 and k3 = 2. three nodes at level i-1 forward the querysearch message to five super-peers at level i. thus, s(3,4,2,3,2)={(0,3,2), (2,1,2),(2,3,0),(2,2,1), (1,2,2),(1,3,1)}. this example shows that eq. (2) does not depend on the order of appearance of values m1,...,m j in m. we can derive the probability pr[r(i)=n] that n super-peers receive a querysearch message at level i as follows pr[r(i) = n] = ni−1∑ j=0 pr[r(i) = n|s(i −1) = j]· pr[s(i −1) = j] (3) where, n=0,..., ∑ j r=1 kr , and ni−1 is the total number of nodes located at level i-1. eq. (3) can be computed recursively by following conditions: pr[r(1)=1]=1, pr[r(1=0)]=0, pr[r(i)=0|s(i-1)=0]=1, and pr[r(i)=n|s(i-1)=0]=0 for n>0. we are now in a position to compute the average number of super-peers, , involved in query search (apart from the source) as follows n = d∑ i=2 ni∑ n=0 n · pr[r(i) = n] (4) where, ni is the total number of nodes located at level i. note that s(i-1) is not necessarily equal to r(i-1). the probability pr[s(i-1)=j] in eq. (3) can be computed by conditioning on r super-peers that receive querysearch messages at level i-1. so, we have: 424 m. analoui, m. sharifi, m.h. rezvani figure 4: forwarding a qs message using maximum hop-count equal to 3 pr[s(i −1) = j] = ni−1∑ r= j pr[s(i −1) = j|r(i −1) = r].pr[r(i −1) = r] (5) the above equation can be simplified as follows pr[s(i −1) = j] = ni−1∑ r= j ( r j ) qr− ji−1(1− qi−1) j.pr[r(i −1) = r] (6) the probability pf that a local index for a resource is found can be computed as pf = 1− d∏ i=2 pr[no local index is f ound at level i] (7) formally, the above equation can be expressed as follows pf = 1− d∏ i=2 ni∑ n=0 (1− qi) n.pr[r(i) = n] (8) now, the probability that an entry for the resource is not found, namely pn f , can be computed as follows pn f (s) = ni∑ n=0 (1− qs) n.pr[r(s) = n] (9) for computing the average number of hops required to find a local index for a resource, namely h, we first define pminf (i) with the assumption that the first level wherein a resource is found at level i: pminf (i) = ∏i−1 s=2 pn f (s)][1− pn f (i)] pf (10) now, the average number of hops can be computed as follows h = d∑ i=2 (i −1)pminf (i) (11) probabilistic proximity-aware resource location in peer-to-peer networks using resource replication 425 as mentioned earlier, eq. (11) is approximately equivalent to mean latency distance. before proceeding, let us define f as the ratio between the average number of super-peers, namely n, involved in the query operation and the total number of super-peers except the originating super-peer: f = n∑d i=2 ni (12) another important factor in our analysis is the load of query requests imposed by other nodes on each superpeer node. assuming that in a deterministic query search case, the overall load on each node is 1byte/sec, then in a probabilistic case, the overall load will be 1byte/sec. the reason is that in the probabilistic search case, each node sees a fraction f of the requests generated by each peer. we anticipate that the idea of caching the local indices, proposed by us, causes the value of f to tend to very low values, thus resulting in a reduction in the processing load of each peer. now, we provide the numerical results of the above analytical model. we assume d=5. for each node j, the out-degree, namely k j , comes from a power-law distribution with α = 1.5. figure 5 shows the variation of pf and f versus p. this figure assumes a fixed message broadcasting probability, i.e., pi = p for i=2,...,d. also the probability of caching in the dcs at each level is assumed to be q2 = 0.5, q3 = 0.25, q4 = 0.1, and q5 = 0.01. as the broadcasting probability p increases, the probability that a directory entry for the resource is found increases and exceeds 0.97 for a value of equal to 0.7. at that point, only 12% of the super-peers would participate in the search (i.e., f= 0.12). figure 6 shows the variation of f versus p f . the assumptions used in this figure are the same as those of fig. 5. it can be seen from fig. 5 that by adjusting the broadcasting probability, one can find the probability that the resource is found. given this, one can tune the fraction of participating nodes from fig. 6. figure 5: probability of finding entry and fraction of participating super-peers vs. different broadcasting probabilities 4 experimental validation we have performed a large number of experiments to validate the effectiveness of our proximity-aware distributed caching scheme. we evaluated the performance of the system with a file-sharing application based on several metrics. these metrics include fraction of involving super-peers in the query search, probability of finding an entry in dcs, overall cache miss ratio, average number of hops to perform query requests, and system load. all of the aforementioned metrics, except system load, are already defined in the previous sections. the load metric is defined as the amount of work an entity must do per unit of time. it is measured in terms of two resource types: incoming bandwidth, and outgoing bandwidth. since the availability of the incoming and the outgoing bandwidths is often asymmetric, we have treated them as separate resources. also, due to heterogeneity of the system, it is useful to study the aggregate load, i.e., the sum of the loads concerning to the all nodes in the system. all of the results are averaged over 10 runs of experiments and have been come up with 95% confidence intervals. we followed the general routine devised in [15] for the efficient design of the p2p network. so, as the first step, we had to 426 m. analoui, m. sharifi, m.h. rezvani figure 6: fraction of super-peers involved vs. probability of finding entry generate an instance topology based on a power-law distribution. we used the plod algorithm presented in [16] to generate the power-law topology for the network. the second step was calculating the expected cost of actions. among three "macro actions", i.e., query, join, and update, which exist in a cost model [15], we have restricted our attention to the query operations. each of these actions is composed of smaller "atomic" actions for which the costs are given in [15]. in terms of bandwidth, the cost of an action is the number of bytes being transferred. we used the specified size of the messages for gnutella protocol in such a way that is defined in [15]. for example, the query messages in gnutella include a 22-byte gnutella header, a 2 byte field for flags, and a null-terminated query string. the total size of a query message, including ethernet and tcp/ip headers, is therefore 82 plus the query string length. some values, such as the size of a metadata record are not specified by the protocol, rather are functions of the type of the data which is being shared. the values which are used from [15] are listed in table 3. table 3: gnutella bandwidth costs for atomic actions [15] atomic action bandwidth cost (bytes) send query 82 + query length recv. query 82 + query length send response 80+28 #addresses+76 #results recv response 80+28 #addresses+76 #results to determine the number of results which are returned to a super-peer r, we have used the query model developed in [17] which is applicable to super-peer file-sharing systems. the number of files in the super-peer’s index depends on the particular generated instance topology i. we have used the so called query model to determine the expected number of returned results, i.e. e[nr|i] . since the cost of the query is a linear function of [nr|i] and also since the load is a linear function of the cost of the queries, we can use these expected values to calculate the expected load of the system [15]. in the third step, we must calculate the system load using the actions. for a given query originating from node s and terminating in node r we can calculate the expected cost, namely csr . then, we need to know the rate at which the query action occurs. the default value for the query rate is 9.26 × 10−3 which is taken from the general statistics provided by [15] (see table 4). the query requests in our experiments have been generated by a workload generator. the parameters of the workload generator can be set up to produce uniform or non-uniform distributions. considering the cost and the rate of each query action, we can now calculate the expected load which is incurred by node r for the given network instance i as follows e[mr|i] = ∑ s∈network e[csr|i].e[fs] (13) where, fs is the number of the queries submitted by the node s in the time unit, and e[fs] is simply the query rate per user. let us define q as the set of all super-peer nodes. then, the expected load of all such nodes, namely mq , is defined as follows probabilistic proximity-aware resource location in peer-to-peer networks using resource replication 427 e[mq|i] = ∑ n∈q e[mn|i] |q| (14) also, the aggregate load is defined as follows e[m|i] = ∑ n∈network e[mn|i] (15) we ran the simulation over several topology instances and averaged e[m|i] over these trials to calculate e[e[m|i]]=e[m] . we came up with 95% confidence intervals for e[m|i]. the settings used in our experiments are listed in table 4. in our experiments, the network size was fixed at 10000 nodes. as mentioned before, the generated network has a power-law topology with the average out-degree of 3.1 and ttl=7. these parameters reflect gnutella topology specifications which have been used by many researchers so far. for each pair of the super-peers (s,r), the latency distance lat(s,r) was generated using a normal distribution with an average µ = 250ms and a variance δ = 0.1 [12]. then, to find the pair-wise latency estimation, namely est(s,r) , we ran the vivaldi method over the generated topology. table 4: experimental settings name default description graph type power-law the type of network, which may be strongly connected or power-law graph size 10000 the number of peers in the network cluster size 10 the number of nodes per cluster avg. out-degree 3.1 the average out-degree of a super-peer ttl 7 the time-to-live of a query message query rate 9.26×10−3 the expected number of queries per user per second in order to be able to compare the results with the previous works, we chose a cache size per super-peer equal to 1% of the total number of the resources managed by the all super-peers. in a distributed system with highly variant reference patterns, it is better to use frequency-based replacement policies. the frequency-based policy takes into account the frequency information which indicates the popularity of an object. the least-frequencyused (lfu) is a typical frequency-based policy which has been proved to be an efficient policy [18]. in lfu, the decision to replace an object from the cache is made by the frequency of the references to that object. all objects in the cache maintain the reference count and the object with the smallest reference count will be replaced. the criterion for replacing an object from the cache is computed as follows costob ject = frequencyob ject × recencyob ject (16) where, frequencyob ject and recencyob ject denote the access frequency and the elapsed time from recent access, respectively. if the cache has enough room, lfu will store the new object in it. otherwise, lfu selects a candidate object which has the lowest costob ject value among all cached objects. then, lfu will replace the candidate object by the new object if the of the new object is higher than that of the candidate object. otherwise, no replacement occurs. figure 7 shows the experimental results concerning the effect of the resource replication on the fraction of participating super-peers, namely f, and the probability of finding objects, namely pf , versus various broadcasting probabilities. it can be seen from the figure that pf attains high values for much smaller values of p. the experimental result in fig. 7 shows a trend similar to what the analytical study provides in fig. 5. by adjusting the broadcasting probability, one can tune the probability of finding the resource. in the case of using resource replication, pf achieves larger values in comparison with the case in which the resource replication is not used. in contrast to pf , the metric f achieves smaller values in the case of using the resource replication in comparison with the case in which the resource replication is not used. its cause lies in the fact that in the case of using the resource replication method, some intermediate nodes replicate the queries in their local disks (cache the queries into their lit); leading to a decrease in the lits miss ratio, thus resulting an increase in the probability of finding the queries. such nodes do not need to propagate the querysearch message to other super-peers anymore. fig. 8 shows the effect of using the resource replication on the cache miss ratio as a function of the broadcasting probability p. in the both cases of fig. 8, the cache miss ratio decreases by an increase in p. its cause lies in the fact that when p increases, more super-peers participate in the search; hence it is more likely that the resource is 428 m. analoui, m. sharifi, m.h. rezvani found by more than one super-peer. so, more dcs of the intermediate nodes in the reverse path to the source will be aware of the resource; leading to a decrease in the dcs miss ratio. the use of the resource replication decreases the miss ratio compared to the case in which the resource replication is not used. however, the amount of the reduction in the miss ratio is not remarkable in both cases for the values of p greater than 0.8. at this point, the use of resource replication method yields a miss ratio of 0.72; giving 20% improvement over the 0.9 miss ratio when no resource replication is used. figure 7: the effect of resource replication on the fraction of participating peers and the probability of finding objects for various broadcasting probabilities figure 8: the effect of resource replication on overall cache miss ratio for various broadcasting probabilities figure 9 shows the average number of the required hops to find the resource, namely h, which is normalized by the total number of super-peers (except the original source). the figure shows the effect of the resource replication method in various broadcasting probabilities. it can be seen in both curves of fig. 9 that the average number of hops initially increases until reaches to a maximum point and then begins to decrease. a higher broadcasting probability means that the super-peers who are located further away from the original source are contacted and the resource tends to be found further away from the original source. as p continues to increase, the increased values of hit ratio concerning to intermediate dcs allow the resource to be found in locations where are closer to the original source; hence decreasing the value of h. it is clear from fig. 9 that the use of resource replication reduces the number of hops needed to find the resource. for example, in a reasonable practical point of broadcasting probability, such as 0.7, it yields a 31% improvement, whereas the hop ratio decreases from 0.08 to 0.055. figure 10 shows the effect of resource replication on the total required bandwidth of the system, i.e. the required incoming and outgoing bandwidth of super-peers, for various broadcasting probabilities. by increasing probabilistic proximity-aware resource location in peer-to-peer networks using resource replication 429 figure 9: the effect of resource replication on hop ratio for various broadcasting probabilities the broadcasting probability, some additional costs are imposed to the system. the most important costs include the cost of sending queries to each super-peer, a startup cost for each super-peer as they process the query, and the overhead of additional packet headers for individual query responses. some of these factors are mentioned in the literature by prior researchers. interested readers can find useful hints in [15]. the upper curve in fig. 10 shows the required bandwidth in the absence of resource replication. in this case, as the broadcasting probability p increases, the required bandwidth of super-peers increases and reaches to 7.7 108 bps for a value of equal to 0.8. from this point forward, the growing of bandwidth occurs more slightly until reaches to 7.9×108 bps at the value of p equal to 1. the lower curve in fig. 10 shows an improvement in the required bandwidth in the presence of resource replication. in this case, the required bandwidth decreases to 6.6×108 bps for a value of p equal to 0.8, resulting in a 14% improvement in comparison with the same point in the upper curve. figure 10: the effect of resource replication on total required bandwidth for various broadcasting probabilities 5 related works unstructured architectures are divided into three sub-classes [1]: 1) hybrid decentralized, 2) purely decentralized, and 3) partially centralized. in a hybrid decentralized p2p system, all peers connect to a central directory server that maintains a table for their ip address, connection bandwidth and other information, and another table that keeps the list of files that each peer holds. upon receiving a request from a peer, the server searches for any matches in its index table and returns a list of peers who hold the matching file. then a direct connection is 430 m. analoui, m. sharifi, m.h. rezvani established between the originating peer and the peers who hold the requested files to download them. although the implementation of hybrid decentralized systems is easy, they suffer from vulnerabilities and attacks as well as scalability problems. napster [19] is an example of such systems. gnutella [3] is a well-known example of purely decentralized system. a significant research toward proximity-aware resource location services in typical gnutella-based unstructured p2p system has been done in [7, 8]. the proximity metric in this works is the ttl of the query messages. forwarding the queries is done with a fixed probability. when a query message is reached to a peer, its ttl is decremented. the forwarding of the query messages will be stopped if its ttl is reached to zero. the analytical study on the impact of the proximity-aware methodology for making the p2p streaming more scalable has been presented in [9]. they have proposed a geometric model which maps the network topology from the space of discrete graph to the continuous geometric domain, meanwhile capturing the power-law property of the internet. they found that, although random peer selection methods can maximally save the server resources, they introduce the maximum load to the network. in systems where availability is not guaranteed, such as gnutella [3], resource location techniques can afford to have loose guarantees [20]. current search techniques in "loosely controlled" p2p systems are rather inefficient because they impose a heavy load on system as well as high response times. the main motivation for many researches in the area of p2p systems was early "loosely controlled" systems such as gnutella, freenet [14], napster [19], and morpheus [21]. other resource location techniques for "loosely guaranteed" systems are mentioned in [20]. in the other proposed techniques which mentioned in [20] each node maintains "hints" as to which nodes contain data that answer certain queries, and route messages via local decisions based on these hints. this idea itself is similar to the philosophy of hints which is used by menasce et al. in [7]. can [2] is an example of systems with "strong guarantee" that employs search techniques. these systems can locate an object by its global identifier within a limited number of hops. it is concluded from the literature that selecting a resource locating methodology depends on the type of system which is planned. a complete literature survey relevant to search techniques is collected in [20] which interested readers can refer to it for more detail. the resource locating techniques for partially centralized p2p networks are addressed in [15, 20]. they also evaluate some proposed query search broadcasting policies using gnutella system and compare their performance with each other. the first query search policy evaluated by yang et al. on gnutella is called iterative deepening in which, a query is sent iteratively to more nodes until the query is answered. the second proposed technique, directed breath first search (dbfs) technique, forwards a limited set of nodes selected to maximize the probability that the query is answered. their experimental analysis shows that if nodes are allowed to answer queries on behalf of other nodes, then the number of nodes that process a query will be reduced without decreasing the number of results. this very nice conclusion has formed our original motivation for designing distributed p2p caching protocol based on super-peers. the last technique which is investigated by them is called local indices technique in which, nodes maintain simple indices over other client’s data. queries are then processed by a smaller set of nodes. this is also quite similar to the concept of the super-peer nodes which we have adopted in our proposal. the performance of hybrid p2p systems such as napster is investigated by yang and garcia-mollina in [17]. morpheus [21] is another hybrid p2p system whose architecture is similar to gnutella. upon joining a new peer to the system, p2p network contacts a centralized server which then directs it to a super-peer. the authors of [17] study the behavior and performance of hybrid p2p systems and develop a probabilistic model to capture the query characteristic of such systems. 6 conclusions in this paper we have targeted the scalable proximity-aware location service for p2p systems. the proposed protocol provides a scalable distributed caching mechanism to find the peers who manage a given resource. the proposed mechanism enhances the mechanisms which have been proposed in previous researches by replicating objects based on latency distance metric, resulting in less aggregate load of the system. the simulation results showed that the use of probabilistic resource discovery service in p2p systems combined with latency-aware probabilistic resource replication, improves the overall performance of the system in terms of aggregated load, throughput, response time, number of hops, and number of contributing peers in the search process. using the proposed mechanism yields at most 20% improvement in miss ratio in comparison with the case in which no resource replication is used. also, in reasonable practical points of broadcasting probability, it yields about 30% reduction in hop ratio as well as 14% reduction in required bandwidth of the system. probabilistic proximity-aware resource location in peer-to-peer networks using resource replication 431 bibliography [1] s. androutsellis-theotokis, d. spinellis, a survey of peer-to-peer content distribution technologies, acm computing surveys, vol. 36, no. 4, pp. 335-371, 2004 [2] s. ratnasamy, p. francis, m. handley, r. karp, and s. shenker, a scalable content addressable network, proc. acm sigcomm, august 2001. [3] m. ripeanu, i. foster, a. iamnitchi, mapping the gnutella network: properties of large-scale peer-to-peer systems and implications for system design, ieee internet computing, 6(1), february 2002. [4] http://www.kazaa.com. [5] q. lv, p. cao, e. cohen, k. li, and s. shenker, search and replication in unstructured peer-to-peer networks, the 16th acm international conference on supercomputing (ics’02). new york, ny., 2002. [6] a. crespo, h. garcia-molina, routing indices for peer-to-peer systems, proc. of int. conf. on distributed computing systems, vienna, austria, 2002. [7] d.a. menascé, l. kanchanapalli, probabilistic scalable p2p resource location services, acm sigmetrics performance evaluation rev., volume 30, no. 2, pp. 48-58, 2002. [8] d. menascé, scalable p2p search, ieee internet computing, volume 7, no. 2, march/april 2003. [9] l. dai, y. cao, y. cui and y. xue, on scalability of proximity-aware peer-to-peer streaming, in computer communications, elsevier, vol. 32, no 1, pp. 144-153, 2009. [10] y. zhu, b. li., overlay networks with linear capacity constraints, ieee transactions on parallel and distributed systems, 19 (2), pp. 159-173, february 2008. [11] y. zhu, b. li, k. q. pu., dynamic multicast in overlay networks with linear capacity constraints, ieee transactions on parallel and distributed systems, vol. 20, no. 7, pp. 925-939, 2009. [12] g.p. jesi, a. montresor, o. babaoglu, proximity-aware superpeer overlay topologies, ieee transactions on network and service management, september 2007. [13] f. dabek, r. cox, f. kaashoek, and r. morris., vivaldi: a decentralized network coordinate system, the sigcomm ’04, portland, oregon, august 2004. [14] i. clarke, s. g. miller, t. w. hong, o. sandberg, and b. wiley, protecting free expression online with freenet, ieee internet computing , volume 5, no. 1, pp. 40-49, 2002. [15] b. yang, h. garcia-molina, designing a super-peer network, proc. int’l conf. data eng. (icde), pp. 49-63, mar. 2003. [16] c. palmer, j. steffan, generating network topologies that obey power laws, the globecom 2000, november 2000. [17] b. yang, h. garcia-molina, comparing hybrid peer-to-peer systems, proc. 27th int. conf. on very large data bases, rome, 2001. [18] j.w. song, k.s. park, s.b. yang, an effective cooperative cache replacement policy for mobile p2p environments, in proceeding of ieee international conference on hybrid information technology (ichit’06), korea, vol. 2, pp. 24-30, 2006. [19] http://www.napster.com. [20] b. yang, h. garcia-molina, improving search in peer-to-peer networks, the 22nd international conference on distributed computing systems (icdcs’02), vienna, austria, 2002. [21] http://www.morpheus-os.com. international journal of computers communications & control issn 1841-9836, 11(5):657-666, october 2016. robust face recognition against soft-errors using a cross-layer approach g.-m. jeong, c.-w. park, s.-i. choi, k. lee, n. dutt gu-min jeong, chang-woo park department electrical engineering kookmin university jeongeung-dong, songbuk-gu seoul, 136-702, korea gm1004@kookmin.ac.kr, high1117@nate.com sang-il choi* department applied computer engineering dankook university jukjeon-dong, suji-gu, yongin-si gyeonggi-do, 448-701, korea *corresponding author: choisi@dankook.ac.kr kyoungwoo lee department computer science yonsei university sinchon-dong, seodaemun-gu seoul, 120-749, korea kyoungwoo.lee@gmail.com nikil dutt department computer science university of california, irvine irvine, ca 92697, usa dutt@ics.uci.edu abstract: recently, soft-errors, temporary bit toggles in memory systems, have become increasingly important. although soft-errors are not critical to the stability of recognition systems or multimedia systems, they can significantly degrade the system performance. considering these facts, in this paper, we propose a novel method for robust face recognition against soft-errors using a cross layer approach. to attenuate the effect of soft-errors in the face recognition system, they are detected in the embedded system layer by using a parity bit checker and compensated in the application layer by using a mean face. we present the soft-error detection module for face recognition and the compensation module based on the mean face of the facial images. simulation results show that the proposed system effectively compensates for the performance degradation due to soft errors and improves the performance by 2.11 % in case of the yale database and by 10.43 % in case of the orl database on average as compared to that with the soft-errors induced. keywords: soft-error, face recognition, cross-layer approach, mean face. 1 introduction nowadays, memory errors due to various causes have become a critical threat for the performance and stability of numerous systems. in particular, soft-errors, which are transient bit toggles in memory systems, have become increasingly important factors for the performance copyright © 2006-2016 by ccc publications 658 g.-m. jeong, c.-w. park, s.-i. choi, k. lee, n. dutt degradation of various applications. soft-errors denote the phenomenon that changes the memory bit value temporarily from 1 to 0 and vice versa due to abnormal conditions such as high radiation, high pressure or high temperature [1]. although the soft-errors are not permanent and nondestructive, there have been several reports on the damages due to soft-errors. sun and hewlett packard announced the loss due to soft-errors in their server systems [2,3]. further, soft errors brought a billion-dollar automotive factory to halt every month [4]. until now, there have been several researches on the protection of soft-errors [5–9]. especially, for multimedia systems, cross-layer based approaches have been introduced to compensate for the negative effects of soft-errors. since soft-errors are not critical to the system stability, cooperation across system abstraction layers can help to build a cost-efficient system against soft-errors from a hardware layer to an application layer in mobile embedded systems [10]. in this paper, we propose a robust and cost-effective system for face recognition against soft-errors by using a cross layer approach. first, we analyze the effect of soft-errors for face recognition systems and show that the soft-errors induced in the jpeg image can degrade the performance of the recognition system [11, 12]. next, we propose a cross-layer compensation module consisting of a detector in the hardware layer and a corresponding compensator in the application layer. to attenuate the negative impact of soft-errors in face recognition systems, they are detected in the embedded system layer by using a parity bit checker. when they are detected, the mean face method is used for compensating for the negative impact on the performance at the application layer. the classification experiments are performed for the yale [13] and orl databases [14]. the features for classification are extracted by using the rlda (regularized linear discriminant analysis) method [12]. the experimental results demonstrate the effectiveness of the proposed method. further, the proposed cross-layer based compensation method can improve the system performance by 2.11 % and by 10.43 % on average in case of yale and orl databases, respectively, as compared to the performance with soft-errors in the face recognition system. the remainder of this paper is organized as follows. in section 2, we introduce an exemplary face recognition system for the simulation and the effect of soft-errors in jpeg images. then, in section 2, we describe an analysis of the effect of soft-errors on the face recognition system. in section 4, we present the proposed cross-layer approaches and experimental results. the conclusion follows in section 5. 2 exemplary face recognition system and effect of soft-errors in jpeg images 2.1 exemplary face recognition system figure 1 shows the exemplary face recognition system considered in this study. as shown in figure 1, after a jpeg image is captured using a camera, it is transmitted to the classification part and decoded in the bmp format. then, after preprocessing, it is classified using various classification methods [12,15–17]. for the analysis of the soft-error impacts, we assume that the soft-error occurs inside the jpeg file. hence, we assume that the soft-error can occur in the memory when storing or decoding the jpeg file. 2.2 effect of soft-errors in jpeg file the jpeg file is a compressed image file format. the effects of soft-errors depend on where they occur. for example, the effects are different between the soft-errors in the header information robust face recognition against soft-errors using a cross-layer approach 659 figure 1: an exemplary face recognition system figure 2: effect of soft-errors for the decoding of jpeg files. (a) decoded image without softerrors (b) examples of decoded images with bit-errors 660 g.-m. jeong, c.-w. park, s.-i. choi, k. lee, n. dutt parts and those in the compressed 8×8 blocks. figure 2 shows various examples of decoded images when soft-errors occur in the jpeg file. note that the decoded results may differ according to the implemented decoder. if the exceptions are not handled appropriately, the decoder can even be stopped. when soft-errors occur in the critical data of the header such as errors in the length of the file or the length of the block data, the decoder can lose all its information afterwards. in the case of a soft-error in block data, the block can be decoded inappropriately, causing a significant degradation of the quality. 3 effect of soft-errors on the face recognition system in this section, we discuss the effects of soft-errors on face recognition systems. we build a model of the soft-errors in the face recognition system and analyze the performance degradation for the recognition rate when these errors are induced. 3.1 modeling of soft-errors and system setting to conduct a quantitative analysis of soft-errors, we make the following assumptions of softerrors for a face recognition system: assumptions of the soft-errors • a1) a single-bit soft-error occurs for one facial image, particularly in the test set. • a2) a soft-error occurs only for the block data part. on the basis of a1) and a2), we can conclude that only one block is affected by a single-bit soft-error. further, for the system setting, a soft-error is detected using a parity bit checker in the memory and it is notified to the jpeg decoder. when the jpeg decoder receives the information of the soft-error occurrence, it passes over the decoding of the corresponding block. hence, the corresponding 8×8 block of the soft-error is filled with 0’s instead of the real decoded data. figure 3 shows the overall processing based on the system setting in our study. figure 3: decoding procedure for the jpeg file with a soft error. a) decoding procedure considering a soft-error in the block data. b) decoded image based on the considered system setting 3.2 classification experiment for the soft-errors we apply the proposed method to the yale database in order to observe the effect of soft-error on the recognition rate. the yale database contains 165 gray images of 15 individuals, having robust face recognition against soft-errors using a cross-layer approach 661 different facial expressions, with or without glasses, and under different lighting conditions. each face image is cropped and re-scaled so that the center of each eye is placed at a fixed point in an image of 60×50 pixels. on the basis of assumption a1), we consider that a soft-error occurs in all the images for the test set, while no error occurs for the training set. there exist 42 blocks (size: 8×8) in a 60×50 image as shown in figure 4(a). hence, we conduct 42 times an 11-fold cross validation [17] according to the location of the soft-error, as depicted in figure 4(b). in these 11-fold cross validations, one image from each subject is randomly selected for testing while the remaining images are used for training. in other words, we consider 150 images in the training set and 15 images for the probing. for the selected 15 images for the test, we assume that there occur a soft-error, as shown in figure 4(b). for the classification, the r-lda method [12] is used as a feature extraction method and the one nearest neighbor rule is applied as a classifier with the l2 distance metric. it is noted that the r-lda method is well known but is not necessarily the best one. in table 1, the first and second columns show the recognition rates without a soft-error and with a soft-error, respectively. as summarized in table 1, we can observe the average performance degradation of 2.45 % in terms of the recognition rate in the case with the softerrors as compared to that in the case without the soft-error. as shown in figure 7(a), there is a significant degradation (up to approximately 10 %) according to the block numbers, and we can see that there is a significant performance degradation due to the single-bit soft-errors. figure 4: classification experiment for the soft-errors. a) 42 blocks inside the 60 × 50 image. b) 42 times 11-fold cross validation. 4 cross-layer approach for compensation of soft-errors in the face recognition system 4.1 overall structure of the proposed method considering the performance degradation of the soft-errors, we propose a costand performanceeffective compensation method for the face recognition system. figure 5 shows the overall structure of the proposed compensation method in a cross-layered manner. as shown in figure 5, when the occurrence of the soft-error is notified from the hardware layer to the application layer, the proposed method can compensate the corresponding block for the performance degradation with 662 g.-m. jeong, c.-w. park, s.-i. choi, k. lee, n. dutt the mean value data from the database. by this compensation, we can enhance the performance of the recognition system in a cost-effective manner. figure 5: overall structure of the proposed compensation method in the proposed cross-layered approach, we detect the soft-error by using parity bit checkers cost-effectively rather than by using ecc (error correction codes) modules. indeed, the real compensation for the soft-errors is achieved in the application layer. in the application layer, we should mitigate the impact of soft-errors on the performance in the data block, which is filled with 0’s by skipping the decoding. in the face recognition method, the mean face is effectively used for the illumination compensation and other applications [18,19]. likewise, if we use a mean face for the training data, we can improve the system performance. in this paper, we use the corresponding mean block from the mean face of the training set. figure 6 shows an example of a mean face for the yale database. figure 6: compensation using mean face. a) example of a mean face for the yale database. b) facial images without a soft-error, with a soft-error and with the compensation. 4.2 experimental results and analysis table 1 shows the recognition rate for the cases with a soft-error, without a soft-error, and with the compensation for the soft-error, respectively in the yale database. as shown in table 1, the proposed compensation method can enhance the performance by 2.11 % as compared to that in the case with soft-errors. in the comparison between the cases with no soft-error and compensation, there is a performance difference of only 0.34%. figure 7(a) shows the recognition rate according to the blocks in the yale database. further, table 2 and figure 7(b) show the experimental results for the orl database. the images are resized into 56×46 pixels in this experiment. as shown in table 2 and figure 7(b), robust face recognition against soft-errors using a cross-layer approach 663 figure 7: recognition rate according to the block numbers for the yale and orl databases. a) yale database. b) orl database. table 1: comparison of the recognition rate for the soft error in the yale database(%) no. of features no error soft-error compensation 1 35.15 26.16 29.12 2 54.55 49.62 55.90 3 73.94 68.40 73.61 4 82.42 76.64 80.52 5 83.03 81.52 83.52 6 83.64 83.52 84.49 7 86.67 86.25 87.76 8 89.70 88.86 90.17 9 92.73 91.44 92.81 10 93.33 91.75 92.94 11 93.33 92.44 93.19 12 93.94 92.83 93.67 13 93.94 93.29 93.78 14 93.33 92.73 93.41 average 82.12 79.67 81.78 664 g.-m. jeong, c.-w. park, s.-i. choi, k. lee, n. dutt a soft-error can degrade the performance by approximately 11.48 %. however, the proposed compensation method can achieve a performance improvement of approximately 10.43 % as compared to the case with the soft-error while it incurs a 1.05 % performance degradation as compared to the case without the soft-errors. from these experiments, we can conclude that the proposed method can mitigate the impact of soft-errors on the system performance effectively. table 2: comparison of the recognition rate for the soft error in the orl database(%) no. of features no error soft-error compensation 1 6.75 5.92 7.46 2 13.50 10.86 15.32 3 27.50 17.01 25.58 4 41.75 22.69 38.09 5 48.00 27.58 45.10 6 56.25 33.60 52.59 7 62.75 38.80 59.81 8 69.00 44.29 65.58 9 74.75 49.59 70.98 10 79.00 54.16 76.16 15 87.50 70.49 86.37 20 95.75 82.61 94.73 25 97.25 90.64 96.88 30 98.50 93.14 98.06 35 98.00 94.70 97.90 average 82.62 71.14 81.57 figure 8: recognition rate according to the numbers of feature of for the yale database. the recognition rate versus the numbers of feature of yale database is shown in figure 8 for three cases: non-error, error and compensation. it is obvious that the recognition rates increase with small numbers of feature, from one feature to eight features, and after that, the recognition rates would be almost saturated even if the numbers of feature increase how much. therefore, we will select the minimum numbers of feature that the higher rates can be achieved. table 3: average recognition rates of two selection method rlda and lbp for yale and orl database(%) selection method yale orl no error error compensation no error error compensation rlda 83.11 73.57 82.27 80.04 72.67 78.96 lbp 87 86.79 86.86 89.12 88.58 88.92 we have also applied local binary pattern (lbp) as a feature selection method for both robust face recognition against soft-errors using a cross-layer approach 665 databases by measuring chi-square distances, and the result is shown in table 3. these data in the table show the average recognition rates of forty-two blocks for three cases: non-error, error and compensation. from this table, there is no doubt that the recognition rates obtained from lbp is higher than from rlda. the reason is that lbp uses all blocks as features, whereas rlda only extracts some important features, hence, lbp can achieve higher result. however, lbp’s running-time is also longer than rlda. 5 conclusion in this paper, we have proposed a robust face recognition method for soft-errors by using a cross-layer approach. first, we have analyzed the effect of soft-error on a face recognition system. for the soft-errors in the images of the training set, the performance is degraded. next, we have presented a novel face recognition method for the robust system against soft-errors by using a cross-layer approach. in the proposed method, the soft-errors are detected using a parity bit checker in the hardware layer and compensated in the application layer using a mean face. simulation results have revealed that the proposed system have effectively compensated for the performance degradation due to soft errors. a more systematic compensation method such as the compensation for the occluded images can be considered with the proposed method. it remains as a future work. acknowledgment this research was supported by the basic science research program through the national research foundation of korea (nrf) funded by the ministry of education (nrf-2015r1d1a1a01060917) and the basic science research program (no. 2015r1a1a1a05001065) through the national research foundation of korea (nrf) funded by the ministry of science ict and future planning, and also supported by the human resources program in energy technology of the korea institute of energy technology evaluation and planning (ketep) granted financial resource from the ministry of trade industry and energy, republic of korea (no. 20154030200830). bibliography [1] baumann, robert (2005); soft errors in advanced computer systems, design & test of computers, issn 0740-7475, 22(3): 258-266. [2] lyons, d. (2000), sun screen: soft error issue in sun enterprise servers, www.members.forbes.com/global/2000/11130323026a.html. [3] sarah e. michalak et al. (2005); predicting the number of fatal soft errors in los alamos national laboratory’s asc q supercomputer. ieee transactions on device and materials reliability, issn 1530-4388 5(3): 329-335. [4] marc tremblay, yuval tami. (1989); support for fault tolerance in vlsi processors, ieee international symposium on circuits and systems, 1: 388-392. [5] borodin, demid, ben hh juurlink. (2010); instruction precomputation with memoization for fault detection, proceedings of the conference on design, automation and test in europe, isbn 978-3-9810801-6-2, 1665-1668. 666 g.-m. jeong, c.-w. park, s.-i. choi, k. lee, n. dutt [6] lee, kyoungwoo, et al. (2006) mitigating soft error failures for multimedia applications by selective data protection. proceedings of the 2006 international conference on compilers, architecture and synthesis for embedded systems, isbn: 1-59593-543-6, 411-420. [7] nakka, nithin, karthik pattabiraman, and ravishankar iyer. (2007); processor-level selective replication. proceedings of the 37th annual ieee/ifip international conference on dependable systems and networks, isbn 0-7695-2855-4, 544-553. [8] lee, kyoungwoo, et al. (2007); partially protected caches to reduce failures due to soft errors in multimedia applications. ieee transactions on very large scale integration (vlsi) systems, issn 1063-8210, 17(9): 1343-1347. [9] vera, xavier, et al. (2009); selective replication: a lightweight technique for soft errors. acm transactions on computer systems, issn 0734-2071, 27(4), doi: 10.1145/1658357.1658359. [10] lee, kyoungwoo, et al. (2008); mitigating the impact of hardware defects on multimedia applications: a cross-layer approach. proceedings of the 16th acm international conference on multimedia, isbn: 978-1-60558-303-7, 319-328. [11] lu, juwei et al. (2003); regularized discriminant analysis for the small sample size problem in face recognition. pattern recognition letters, issn 0167-8655, 24(16): 3079 3087. [12] lu, juwei, konstantinos n. plataniotis, and anastasios n. (2005); regularization studies of linear discriminant analysis in small sample size scenarios with application to face recognition. pattern recognition letters, issn 0167-8655, 26(2): 181-191. [13] center for computational vision and control, yale university, the yale face database, http://cvc.yale.edu/projects/yalefaces/yalefaces.html. [14] at&t laboratories cambridge. the orl database of faces, http://www.uk.research.att.com/facedatabase.html. [15] lu, juwei, kostantinos n. plataniotis, and anastasios n. (2003); face recognition using lda-based algorithms, neural networks, issn 1045-9227, 14(1): 195 200. [16] turk, matthew, and alex pentland. (1991); eigenfaces for recognition, journal of cognitive neuroscience, issn 0898-929x, 3(1): 71-86. [17] andrew r. webb (eds.)(2002); statistical pattern recognition, 2ndnd ed, wiley, isbn: 9780-470-85478-5. [18] choi, sang-il, chong-ho choi, and nojun kwak. (2011); face recognition based on 2d images under illumination and pose variations, pattern recognition letters, issn 0167-8655, 32(4): 561-571. [19] choi, sang-il, and gu-min jeong. (2011); shadow compensation using fourier analysis with application to face recognition, ieee signal processing letters, issn 1070-9908, 18(1): 23-26. lang_ijcccv11n5.pdf international journal of computers communications & control issn 1841-9836, 11(5):685-696, october 2016. application of visual servo control in autonomous mobile rescue robots h. lang, m.t. khan, k.-k. tan, c.w. de silva haoxiang lang department of mechanical engineering, university of ontario institute of technology. haoxiang.lang@uoit.ca muhammad tahir khan* institute of mechatronics, university of engineering and technology, peshawar, pakistan. corresponding author: tahir@uetpeshawar.edu.pk kok-kiong tan department of electrical and computer engineering, national university of singapore. address: 4 engineering drive 3 singapore 117576, singapore. eletankk@nus.edu.sg clarence w. de silva department of mechanical engineering, university of british columbia. desilva@mech.ubc.ca abstract: mobile robots that integrate visual servo control for facilitating autonomous grasping and manipulation are the focus of this paper. in view of mobility, they have wider application than traditional fixed-based robots with visual servoing. visual servoing is widely used in mobile robot navigation. however, there are not so many report for applying it to mobile manipulation. in this paper, challenges and limitations of applying visual servoing in mobile manipulation are discussed. next, two classical approaches (image-based visual servoing (ibvs) and position-based visual servoing (pbvs)) are introduced aloing with their advantages and disadvantages. simulations in matlab are carried out using the two methods, there advantages and drawbacks are illustrated and discussed. on this basis, a suggested system in mobile manipulation is proposed including an ibvs with an eye-in-hand camera configuration system. simulations and experimentations are carried with this robot configuration in a search and rescue scenario, which show good performance. keywords: mobile robots, visual servoing, robot control, search and rescue. 1 introduction research and applications of mobile robots have drawn much attention in the robotics community because the emphasis of robotic research appears to have shifted from structured industrial environments to dynamic and partially-known natural environments (e.g., homes, cities, planet surfaces and deep sea). mobile robots can be implemented in homecare, surveillance, exploration, search and rescue tasks where the working environments may be too hazardous for human workers. in such scenarios, robots are expected to have extensive autonomous ability and human capabilities, yet not suitable or impractical for human presence. examples for the necessity of applying autonomous search and rescue robots in practical applications were highlighted in the destruction of the world trade center in 2001 due to terrorist copyright © 2006-2016 by ccc publications 686 h. lang, m.t. khan, k.-k. tan, c.w. de silva attack in new york city and the damage caused by earthquake and tsunami in sendai, japan in 2012. in both incidents, not only did large numbers of victims lose their life, but significant number of rescues scarified their life in rescuing work. they were mainly caused by the search and rescue activities after those incidents because of dangerous environments and lacking of heavy-duty tools. generally speaking, in any incidents, rescuers have about forty-eight hours to save humans in incident sciences. however, most of these time is wasted due to lacking of necessary resources or heavy-duty equipment for accessing damaged or contagious areas that are not suitable for humans. in this scenario, search and rescue robots have the advantages of being deployed instantaneously, and operating in any given working environments without human engagement. an autonomous robot that is able to operate in unknown and dynamic workspace will have significant benefits in the above mentioned two cases in the areas of search and rescue. however, there is no such robot available in the market yet because of existing challenges in developing such systems in terms of both hardware and software. this paper focus on the control software of such system and emphasis on examining two traditional visual servoing methodologies along with their challenges in applying them in autonomous mobile manipulation applications. two traditional methods of visual servo control are presented and examined through simulations; and comparative evaluation of these two methods are discussed, respectively. in the application of visual servoing, a suitable method for autonomous mobile rescue robots is suggested. simulations and experimentations are carried out and demonstrated the effectiveness of the system and the methodologies. 2 two traditional visual servoing methods vision-based mobile manipulation systems typically utilize camera with image processing and computer vision algorithms as its main sensory feedback in the control loop of mobile bases and their manipulators. this approach is called a visual servo control system or visual servoing, which involves many related research areas including image processing, computer vision, dynamic system modeling of robots and nonlinear control theory [1]. vision sensors, usually cameras, are powerful sensors because they mimic human sense of vision and provide non-contact measurement of workspace. furthermore, they can provide various types of sensed data such as pose, distance and objects in camera scenes. because of these advantages of vision sensory, researchers have paid great attention to implement it in practical applications, especially in robotics ares. however, the image capturing hardware and processing software took seconds in producing image and doing analysis that makes real-time control impossible to apply in robotic applications [2]. originally, sanderson and weiss invented a hierarchical dynamic look-an-move structure to overcome the inefficiency of the vision system [3] where vision was utilized to provide control inputs to the joint-level controller. then, the sub-control system utilized the joint feedback to internally stabilize the robot. nowadays, computer vision has benefited from the rapid development of computer technologies and image sensing hardware (ccd and cmos) in terms of the processing time. therefore, it makes possible for so called direct visual servoing where it heavily relies on the speed of vision processing for computing joint inputs [4] in robot control. since 1990s, the term "visual servoing" (more appropriately, visual servo control) has accepted as a generic description of robotic system that has a visual feedback control loop. the subject has been studied in various forms for over twenty years now, with applications ranging from initially simple pick-and-place tasks to complex, real-time and autonomous mobile manipulation applications. visual servo control of mobile robots has drawn more attentions recently. when compared to the traditional fixed-base manipulators, one advantage of mobile robots is their ability to application of visual servo control in autonomous mobile rescue robots 687 navigate in a large workspace because of its mobility. a mobile manipulation system has better maneuverability and larger coverage of the workspace, which make it more widely applicable than a traditional fixed-based manipulator. autonomous navigation control of mobile robots employs the application of visual servo control. there is limited activity in this context possibly because mobile manipulation requires more accurate positioning performance and hence it is rather challenging. some examples of visual servo application in mobile navigation are indicated in [5,6]. the research and physical demonstration of visual servoing in mobile manipulation have been mostly limited two degrees of freedom dof [7] and simplified robot models [8]. 3 developments in visual servoing systems in general, traditional visual servoing can be categorized into image-based and position-based approaches based on how the vision information is utilized [2]. the two traditional approaches can share similar control block diagrams with difference in reference of the control system and the processed information acquired from camera (figure 1). figure 1: block diagram of difference between position-based and image-based visual servoing. the objective of the controller is to minimize either the position error of the image or the position error of the object in the 3d space for two different methods respectively by controlling the motion of the robot joints. the error can be defined as follows: e(t) = s(i(t), c) − sr (1) where e(t) is is the error between desired and measured positions; s is the measurement from the image or pose estimation using computer vision; i(t) is the image data from the camera; c is mathematic model of the camera; and sr is the reference of the control system. s(i(t), c) and sr can be decided in detail depending on the type of visual servoing that is utilized. once they are decided, a velocity controller of robot joints can be designed by ṡ = lξcc (2) where ṡ is the velocity of the target objects (targets in the image from the camera in ibvs approach and 3d pose of the camera through 3d reconstruction in pbvs approach); l is the interaction matrix; and ξcc is the velocity vector of the camera with respect to its own coordinate frame. combining equations (1) and (2), the error dynamic equation can be written below: ė(t) = lξcc (3) where ė(t) is the time variation of the error. the square-of-error norm can be utilized to generate the control law as a candidate lyapunov function: ṡ = l−1(−ke) (4) 688 h. lang, m.t. khan, k.-k. tan, c.w. de silva 3.1 image-based visual servoing approach the objective of image-based visual servoing (ibvs) is to decrease position errors of target objects in images acquired from the camera by moving each joint of the robot. the controller will continuously adjust the speeds of the robot joints based on controller outputs so that the trajectories of the target objects (ui, vi) navigate toward the desired positions (udi, vdi) on camera sciences. the error vector of the target objects in the image plane is given by e = [ u − ud v − vd ] = [ −sx(r − rd) −sy(c − cd) ] (5) where sx and sy represent the physical dimensions of image pixels of the camera sensor; and r, c are pixel coordinates. the velocity of the target objects can be expressed as ė = [ d(u−ud) dt d(v−vd) dt ] = [ u̇ v̇ ] (6) substituting equation (6) into (3), we can acquire the following equation: [ u̇ v̇ ] = lξcc (7) assuming that the error dynamics obeys ė = −ke, a proportional controller based on the lyapunov method can be designed accordingly: ξ c c = l −1(−ke) (8) where k is the proportional gain (a scalar), with k > 0 . therefore, the control law can be obtained by substituting equation (5) into (8), as ξ c c = −kl −1 [ −sx(r − rd) −sy(c − cd) ] (9) in equation (9), r and c represent the pixel coordinate in the captured image by the camera. the desired velocities of the camera in the 3d space can be computed from the image measurements. moreover, the developed controller guarantees asymptotic stability in the closed-loop system. the image jacobian (l) in the model of ibvs [9], which is also named as interaction matrix, describes the motion relationship between the target objects in the image and camera in 3d space in terms of their velocities, respectively. the detailed formulation of the interaction matrix is shown below: [ u̇ v̇ ] = lξcc = [ − λ zc 0 u zc uv λ − λ2+u2 λ v 0 − λ zc v zc λ2+u2 λ − uv λ −u ] ξ c c (10) where λ is focal length; and zc is the distance between the object of interest and the camera coordinate frame in the 3d workspace. the distance must be either measured or approximated for completing the interaction matrix. in many applications, multiple objects of interest are usually measured for the purpose of estimating the distance information if there is no direct way of measuring it. moreover, if it is a typical robotics arm with six degree-of-freedom (dof), at least three objects of interest are required. in practical applications, four objects of interest are usually utilized so that the system can overcome both singularity of the interaction matrix and local minima [1–4]. application of visual servo control in autonomous mobile rescue robots 689 3.2 position based visual servoing in position-based visual servoing (pbvs) approach [10] [11], features of the object of interest are extracted from images from one or more cameras. both camera model and a geometric model of the target object are utilized to mathematically estimate the pose of the camera with respect to the target object in a 3d space. it involves three coordinate frames for designing a control system. they are current frame fc, desired frame fd, and reference frame fr. in the workspace, the reference frame is typically attached to the target object. in the robot workspace, the function of the controller is to calculate and provide outputs for moving the robot with the ultimate goal of reducing the error between the current measured pose and the desired reference pose. the approach is also named 3d visual servoing because the control scheme is defined in the cartesian space. however, the ibvs is usually called 2d visual servoing because it doesn’t need to consider the pose of the robot in its workspace. instead, it only considers target objects in provided image from cameras, which is a 2d workspace. the features in pbvs can be defined below: s = (t, θu) (11) where t is a translation vector and θu is an orientation vector, respectively. the error dynamic equation for the controller can be written by e = (t cr − td, θu) (12) the interaction matrix can be written as: l = [ −i3 [t c r ]x 0 lθu ] (13) where lθu can be described by [18]: lθu = i3 − θ 2 [u]x + ( 1 − sincθ sinc2 θ 2 ) [u]2x (14) where sincx is the sine cardinal, which is defined by xsincx = sin x and sinc0 = 1 . therefore, the velocity control scheme can be generated as follows: vc = −kl −1 e (15) 4 visual servoing simulations figure 2 shows a typical image-based visual servoing system in matlab simulation. in the simulation results, the control system detects the current position of the objects (they are four marks "o" in this case) which locate in the right bottom corner. by comparing the current locations of the objects with the desired locations which are marked by "*", a position error vector will be generated and fed into the image-based visual servoing controller for calculating the velocity commands for each joint of the robot. the robot applied the velocity commands for its joints; and move in the 3d workspace. figure 2(a) shows the trajectory of the objects from the initial locations to the goal locations during the servoing. figure 2(b) shows the convergence of the position error during servoing. since the ibvs scheme decides the 3d motion of the robot based on 2d feedback information, it simplifies the control problem. however, it losses the control of the pose and trajectory of the robot in the 3d workspace. therefore, obstacle 690 h. lang, m.t. khan, k.-k. tan, c.w. de silva (a) (b) figure 2: ibvs in matlab simulation: (a) trajectories of objects of interest in camera scenes; (b) error convergence of the positions of the objects. avoidance and other psychical constraints during the servoing cannot be considered. moreover, if large displace of the camera is required for moving from the current pose to the desired pose, the control system may easily fail because a large change of image scene can be caused by a small displacement of the camera in cartesian space. furthermore, the interaction matrix in the model of the ibvs introduce non-linear and time-varying terms, which are challenges of designing controllers. figure 3 demonstrates a typical position-based visual servoing in matlab simulation. in this given example, the pose of the camera is controlled by eliminating the pose error between the current pose of the camera and the desired pose of the camera in 3d workspace. figure 3(a) demonstrates the initial pose of the camera in the 3d workspace and the desired pose of the camera, respectively. figure 3(b) records the velocity commands that are sent from the controller to each joint of the robot. figure 3(c) shows the trajectory of the camera pose during servoing. comparing the ibvs showed in above section, one of the most significant advantages of this approach is that it controls the pose and trajectory of the camera in the cartesian space directly. obstacles and other physical constraints can be easily considered and integrated into the system during servoing. however, there is a disadvantage in this approach. there is absolutely no control of the image, which will result in losing objects of interest in the camera scene. eventually, it will cause the failure of the whole control system because there will no data and information from the feedback loop in the control system. moreover, 3d models of objects of interest have to be completely available. moreover, 3d reconstruction is a challenge because multiple feature points are required from camera in order to calculate poses, which may not be the case during the servoing. camera calibration result will be another issue for successful pbvs system. 5 application of visual servoing in autonomous mobile robots figure 4(a) shows a robotic search and rescue scenario, in a future city. it presents an emergency situation where a group of general-purpose robots identify injured humans and remove them from the dangerous region. in applications of autonomous search and rescue robots, the workspace is usually large, and sometimes unknown and dynamic; the base frame of the robot has ability to move around in a large area; and the objects of interest are not predefined well, which depend on different applications. therefore, 3d models of target objects are unavailable in most of the time. in this situation, position-based visual servoing will fail and image-based visual servoing has the advantages over the position-based visual serving of reducing the computational burden by avoiding unnecessary image interpretations. also, it eliminates the calibration errors of camera parameters. furthermore, the eye-in-hand camera configuration has more advantages application of visual servo control in autonomous mobile rescue robots 691 (a) (b) (c) figure 3: pbvs simulation in matlab: (a) current and desired pose of the camera cartesian space; (b) controller output; (c) trajectory of the camera pose in cartesian space. 692 h. lang, m.t. khan, k.-k. tan, c.w. de silva (a) (b) figure 4: (a) robot search and rescue scenario; (b) physical configuration of the test bed. than the eye-to-hand one in mobile robot applications. the suggested system for autonomous mobile rescue application is illustrated in figure 4(b). the given system contains a mobile base (pioneer powerbot), a manipulator (robuarm), and different types of sensors (a web camera, laser, sonar, and stereo camera). both two cameras are configured as eye-in-hand configuration. the nonlinear terms in the ibvs model is linearized by updating the position of the feature points in each iteration. the time-varying term (the distance between object and camera) is also updated in each iteration by using the measurement results from the laser distance finder and sonar sensors. therefore, there is no need for distance estimation in the proposed approach. figure 4(b) describes an application of the proposed system where there are general-purpose robots in a future city (e.g., surveillance). if an incident or emergency situation occurs in the city, these robots are able to quickly response and change their roles to involve emergency cases such as search and rescue work. the control law of traditional ibvs is discussed and shown in (9), which describes the relationship between the velocities of the camera and of the feature points in the image. now, the relationship between velocity of the camera with respect to its own coordinate frame and speeds of each joints will be formulated, which is the mathematic model of ibvs of the robot. in order to generate the model, one should first consider the mathematic relationship between the velocity of the end effector with respect to the robot base frame and the velocity of each joint application of visual servo control in autonomous mobile rescue robots 693 (a) (b) (c) (d) figure 5: (a) the performance of the system in terms of the trajectory of the feature point; (b) distance information measured by laser distance finder; (c) error convergence of the system; (d) velocities commands from the controller. as: ξ r t = j1q̇ (16) where j1 is the jacobian matrix, and q̇ is the velocity vector of the robot joints. the velocity of the end effector with respect to its own coordinate frame is derived by ξ t t = g −1 ξ r t (17) where g = [ rrt 03×3 03×3 r r t ] . since the camera is rigidly attached to the end effector, the camera frame and the end effector frame have a constant relationship of homogeneous transformation. therefore, the relationship between camera velocity and the robot end-effector velocity may be written as: ξ t t = j2ξ c c (18) where j2 = [ rtc s(d t c)r t c 03×3 r t c ] . finally, by combining equations 9, 20, 21, 22, the mathematical representation of the relationship between object velocities on camera scenes and joint velocities of the robot can be written as: ṡ = lj−12g −1 j1q̇ (19) 694 h. lang, m.t. khan, k.-k. tan, c.w. de silva denoting the terms lj−12g −1j1 by m , equation (19) can be simplified as: ṡ = mq̇ (20) the square of the error norm can be utilized to derive the control law as a candidate lyapunov function: q̇ = m−1(−ke) (21) figure 6: experimental setup in the laboratory. there are nonlinear and time varying terms in this model which can cause challenges in designing and implementing controller. in order to eliminate the nonlinear and time-varying terms, an adaptive parameter updating mechanism is introduced in the system to linearize and nonlinear terms and update the time varying terms so that the system can be considered a linear system. the time varying term in the model is the distance between camera and objects. a laser distance finder is utilized to update the modeling by using the depth measurement from the laser distance finder. the non-linear terms are about the position of the objects in the image plane in the interaction matrix. to eliminate those nonlinear terms, the position of the objects in the interaction matrix are updated in each iteration of the controller. figure 5 shows the performance of the suggested ibvs system in matlab simulation. the trajectory of the object on images is shown in figure 5(a). the distance measurements between camera and object by the laser distance finder is illustrated in figure 5(b). figure 5(c) shows the performance of the controller, showing the error convergence. figure 5(d) shows the velocity history of each joint of the robot during the servoing. figure 6 shows the experimental setup in our laboratory. the purpose of the mobile robot is to find the object of interest in the workspace, navigate to it and finally approach and grasp the object using image-based visual seroving. in this paper, only the visual servoing aspect is considered. during the final operation, the stereo camera mounted on the mobile base will detect the object of interest in the scene and guide the robot to move toward to the object by using visual servo control. the mobile robot approaches the object until the object is inside the workspace of the manipulator. next, the visual servoing of the manipulator is activated to grasp the object. figure 7 presents an experimental result of the proposed system. figure 7(a) shows the desired camera view, and figure 7(b) shows the current camera view when the robot starts to servo. as discussed in the previous session, the purpose of the ibvs is to eliminate the image error between the current camera view and the desired camera view, by controlling the motion application of visual servo control in autonomous mobile rescue robots 695 (a) (b) (c) (d) (e) figure 7: (a) desired camera view; (b) current camera view; (c) trajectory of the object in in r; (d) trajectory of the object in c; (e) distance between the camera and the target object. 696 h. lang, m.t. khan, k.-k. tan, c.w. de silva of the robot. specifically, the system controls the robot motion by eliminating the position error of the object in the camera view. figure 7(a) gives the trajectory of the object in the camera view in the row (r) axis; and figure 7(b) gives the trajectory of the object in the camera view in the column (c) axis. figure 7(d) gives the distance between the camera and object. 6 conclusions this paper introduced and described two basic methods of vision servoing for mobile robots. the detailed formulation of these two methods were discussed along with their advantages and disadvantages in applying in mobile robots. a suggested method including selection of visual servoing method and hardware configuration was proposed for applications of autonomous mobile manipulation. both simulation and experimentation were carried out with the proposed system. the results showed the effectiveness and good performance of the methodology in a mobile robotic application. bibliography [1] chaumette, f. ; hutchinson, s. (2006); visual servo control part 1: basic approaches, ieee robotics and automation magazine, 13(4): 82-90. [2] agin, g. j. (1979); real time control of a robot with a mobile camera, sri international, menlo park, technical note 179. [3] sanderson, a. c. ; weiss, l. e. (1980), image-based visual servo control using relational graph error signals, in proc. ieee,:1074-1077. [4] hutchinson, s. ; hager, g.d. ; corke, p. i. (1996), a tutorial on visual servo control, ieee transaction on robotics and automation, , 12(5): 651-670. [5] ma, y. ; kosecka, j. ; sastry, s. s. (1999), vision guided navigation for a nonholonomic mobile robot, ieee transactions on robotics and automation, 15(3): 521-536. [6] fang, y.; dixon, w. e. ; dawson, d. m. ; chawda, p. (2005), homography-based visual servo regulation of mobile robots, ieee transactions on systems, man and cybernetics part b, 35(5): 1041-1050. [7] wang, y. ; lang, h. ; de silva, c. w. (2010), a hybrid visual servoing controller for robust manipulation using mobile robots, ieee/asme transactions on mechatronics, 15(5): 757769. [8] gangloff, j. ; de mathelin, m. (2002), visual servoing of a 6-dof manipulator for unknown 3-d profile following, ieee transactions on robotics and automation, 18(4): 511-519. [9] spong, m. w. ; hutchinson s. ; vidyasagar m. (2005); robot modeling and control, wiley. [10] malis, e. ; chaumette, f. ; boudet, s. (1999), 1-1/2-d visual servoing, ieee transactions on robotics and automation, 15(2): 238-250. [11] wilson, w. j. ; hulls, c. c. w. ; bell, g. s. (1996), relative end-effector control using cartesian position-based visual servoing, ieee transactions on robotics and automation, 12(5): 684-696. ijcccv11n1.dvi international journal of computers communications & control issn 1841-9836, 11(1):77-89, february 2016. towards pricing mechanisms for delay tolerant services l. marentes, t. wolf, a. nagurney, y. donoso luis marentes department of systems and computing engineering universidad de los andes, bogota, colombia, south america la.marentes455@uniandes.edu.co *corresponding author tilman wolf department of electrical and computer engineering university of massachusetts, amherst amherst, ma, united states wolf@ecs.umass.edu anna nagurney operations and information management, university of massachusetts, amherst amherst, ma, united states nagurney@isenberg.umass.edu yezid donoso department of systems and computing engineering universidad de los andes, bogota, colombia, south america ydonoso@uniandes.edu.co abstract: one of the applications of delay tolerant networking (dtn) is rural networks. for this application researchers have argued benefits on lowering costs and overcoming challenging conditions under which, for instance, protocols such as tcp/ip cannot work because their underlying requisites are not satisfied. new responses are required in order to understand the true adoption opportunities of this technology. constraints in service level agreements and viable alternative pricing schemes are some of the new issues that arise as a consequence of the particular operation mode. in this paper, we propose a novel model for pricing delay tolerant services, which adjusts prices to demand variability subject to constraints imposed by the dtn operation. with this model we also show how important parameters such as channel rental costs, cycle times of providers, and market sensitivities affect business opportunities of operators. keywords: delay tolerant networks pricing, capacity management, economic models. 1 introduction networks deployed in low income isolated areas operate in challenging environments characterized by discontinuous power supply and long distance wireless link connections. their architectures are adapted to reduce costs, but they are prone to fail due to severe bandwidth restrictions, an operation highly dependent on weather conditions, and low levels of redundancy on key elements. these facts lead to intermittent connectivity between end to end points, which breaks the main assumption for the proper operation of the tcp/ip protocol suite. the research networking community introduced the concept of intermittent connected networks (icn) to cope with this environment. according to [15], icn can be defined as: copyright © 2006-2016 by ccc publications agora university 78 l. marentes, t. wolf, a. nagurney, y. donoso an infrastructure-less wireless network that supports the proper functionality of one or several wireless applications operating in stressful environments, where excessive delays and unguaranteed continuous existence of end-to-end path(s) between any arbitrary source-destination pair, result from highly repetitive link disruptions. the core element is the uncertainty in connectivity partially solved by two mechanisms: storeand-forward and flooding. in the first mechanism, the network’s nodes are capable of storing data during periods of disconnection and forwarding to other nodes closer to the destination when a link is established. in the second mechanism, flooding copes with the routing problem. as nodes cannot maintain origin-destination paths, the network is programmed to send many copies of the same data towards different nodes to increase the probability of reaching the final destination. for isolated networks the first project introducing this concept was [19] with a two-fold objective: to reduce operational cost using offline data mule connections implemented by existing transportation methods such as buses, motorcycles, bicycles, etc., and to provide an infrastructure capable of overcoming interruptions. then the idea was extended to use mechanic backhauls and the delay tolerant network architecture (dtn) proposed by the internet engineering task force (ietf) to handle icns [4]. in recent years prototypes such as [5, 9] using dtn or similar architectures have appeared. potential operators deploying networks in isolated regions would fulfil a mix of delay tolerant and real time services such as video conferencing for medical procedures. we envision that real time services are going to be operated using technologies resulting from projects deploying low-cost infrastructure for cellular networks, see [1,21], or by wireless channels connected to infrastructure provided by states, i.e, the e-mexico system [23], the national broadband plan in india [24], the broadband infraco in south africa [22], to name a few. although most of these proposals cover technical parts of the network, the economics of the services being fulfilled are not considered. therefore, their sustainability is strongly compromised. in economic terms, providers establish service level agreements (slas) to offer alternatives regarding the quality of services and prices that customers are willing to pay. right now, these agreements are based on measures including, but not limited to, packet loss ratio, connection latency and bandwidth. once the agreement is established the provider charges services according to the rules defined in the slas. nonetheless, potential operators in isolated areas cannot directly use these measurements; they are based on the assumption of end to end connectivity which is not guaranteed in the rural environment. so one of the problems faced by these operators is how to establish slas for a diverse set of customers in networks delivering delay tolerant and real time services. in this paper we propose, as a first step, an integrated pricing and resource assignment policy for a delay tolerant service. this proposal is based on two service agreement elements and it optimizes the expected profits from the operator’s point of view. furthermore, using real data from a rural network we test the validity of this model to adjust to variability on traffic demand and study its behavior under different scenarios. the remainder of this paper is organized as follows. section ii provides an overview of previous proposals on internet service usage pricing. section iii introduces a novel pricing model tailored to a delay tolerant service. this model is based on traffic management and average time of delivery. scenario test-bed settings and results are presented in section iii. finally, section iv summarizes and concludes the paper. towards pricing mechanisms for delay tolerant services 79 2 related work from the introduction of the internet, authors like [10] have argued the necessity of creating and using charging models to induce an optimal network resource usage. for internet connectivity in low income regions, which can be isolated, an optimal resource assignment is even more important, howsoever, there is no room for light users to subsidize heavy users, nor enough capacity to deliver non-priority services without affecting high-priority services, the profit enablers for operators [2]. to the best of our knowledge this paper is the first to introduce charging methods for networks using the delay tolerant architecture. despite its novelty, the proposal takes elements of general charging models for the internet and, in particular, it recaptures conclusions to increase operator performance under time-variance consumption patterns, see [11, 16, 20] for an overview of alternative research approaches. hande et al [8] introduce models to understand the relationship between the revenues of a monopolist internet service provider(isp) and the price sensitivity of users under a two part tariff. they conclude that in markets with high price sensitivity, normally found in low income regions, most of the revenues come from the usage based portion of the price. even more, the paper suggests that the loss in revenue can be mitigated if the isp implements charging based on consumption. following these conclusions, the proposed model assumes that the isp collects the usage part of the tariff and it has previous knowledge of the demand function based on the price sensitivity. this paper extends the formulation by including quality attributes on the demand function and costs. these extensions help to provide insights regarding isp profitability and service quality experience. jian l. et al [12] introduce a price scheme to be implemented by a monopolist operator having a set of users, whose utilities not only depend on the traffic quantity, but on the specific access time. they show that the level of revenue extracted by the isp depends on the information available. with perfect information, the isp can extract the maximal revenue. when the isp has only information about the traffic in different time-slots, a more realistic scenario, the loss in revenue compared with the maximal value is in general not bounded. contrary to their proposal, we partially discard the additional utility received by users when access the network in a specific time-slot. the present proposal utilizes a demand changing during the day and forecast prices. we expect that users being charged move to their preferred time-slot as they know pricing information, which somehow follows the results presented by [7]. for us, based on an actual deployment, the authors verify that repeating a game, in which users take into account net prices to decide either to wait or to use the network, helps not only to decrease congestion periods but to increase the network usage. 3 proposed pricing model operators have two modes for connecting to the internet. the first mode is a real time connection with cost and bandwidth capacity. its cost is measured in value per unit of traffic volume and its bandwidth is measured in traffic volume per unit of time. the second mode is a mechanic backhaul connection with a by trip transit time (delivery time), cost, and capacity. operators can divide traffic in both connections. users receive in real time part of the content, for instance the web page’s text, and, after the delivery time, the rest, images and videos. this way of operation is beneficial for users and providers. users consume a reduced service instead of no service and providers increase the use of the network. information is relevant to users during a certain period of time. this is closely related to the value given by users to services, which is partially determined by whether the operator might 80 l. marentes, t. wolf, a. nagurney, y. donoso or not deliver the information within that period. then, the delivery time becomes an intrinsic quality characteristic for delay tolerant services to be modelled as part of the demand. in this proposal the idea of grade of service (gos) was adapted to make quality operational for delay tolerant services. gos is defined, see [6], as a set of traffic engineering variables used to measure the adequacy of a group of resources to fulfil a service under a specified condition. an important concept behind gos is the average behavior, which is used to aggregate the performance of different resources into a unique measure. following this idea, dtn operators are interested in measuring the adequacy of the resources (the two connection modes) for user’s preferences on time of delivery. we propose that operators might use the weighted average time required to get answers to a user’s request as a measure of the service’s quality. our proposed model for pricing delay tolerant services includes price and average delivery time as factors determining aggregated demand. it creates an optimal policy for price and average delivery time searching for the maximum amount of profits. in the following subsections we present the main assumptions: the mathematical representation for the situation being modelled and the development of its solution. 3.1 assumptions demand in this paper demand constructions suggested by authors in [3] are used. they fixed the relationship between demand, price and time by the following linear function. d(p,t) = a−βp p −βt t, (1) where a is the potential market size, βp and βt are the corresponding sensitivity of demand to price and to delivery time. the potential market size is generalized because demand fluctuates within the hours of days and between days. we propose a as a function of time, which is continuous and differentiable. the end form aggregated demand used in this proposal is: d(t,p,tavg) = a(t) −βp p −βt tavg (2) costs the mechanic backhaul connection is charged by a third party. the vendor charges by the distance travelled and not by how much information is being travelled. from the operator’s point of view this charge is a fixed cost by trip. the real time connection is used for fulfil priority and non-priority services –delay tolerant services belong to the second category–, and it has a channel rental cost. constant capacity the model assumes that connections cannot increase capacity dynamically as more demand is requested from users. the mechanic backhaul capacity is determined by the connection bandwidth and the total time that the vehicle is connected to the main gateway. constant consumption requests generated by users in the network are dissimilar in terms of data to be transferred. however, the operator needs to establish prices before observing the actual use of the network. the estimate of the average download and upload consumption is used to charge users. the towards pricing mechanisms for delay tolerant services 81 estimate is an input parameter for the model and it is included in the potential market size function a(t). 3.2 mathematical formulation the operator needs a rule that defines traffic going to be served by the mechanic backhaul connection (f1(t)) or by the real time connection (f2(t)) and the price to charge. see figure 1. each connection has a common set of parameters defined by the tuple (t,c) where t means time of delivery, and c means cost. the real time connection has an instant capacity k2. the mechanic backhaul connection has a by trip capacity capmb. the operator wants to maximize profits while maintaining the service mean delivery time (tavg). the operator cannot differentiate among customers willing to pay more for the time provision. d(t,p,tavg) f1(t) f2(t) (t1,c1,capmb) (t2,c2,k2) gw gw i(t) figure 1: elements in the general model the general model could be stated as: max t1 ∫ 0 p(t)(f1(t) + f2(t)) −c1f1(t) −c2f2(t)dt subject to i′(t) = −f1(t) (3a) i(t1) > 0 (3b) f1(t) + f2(t) = a(t) −βt tavg −βp p(t) (3c) f2(t) 6 k2 (3d) (z(t) −1)f1(t) 6 f2(t) (3e) f1(t) > 0 (3f) f2(t) > 0 (3g) p(t) > 0 (3h) i(0) = capmb (3i) the first constraint indicates that the instantaneous change in the backlog of traffic going to be served by the mechanic backhaul connection is equal to the flow using that channel. the second constraint establishes that the aggregate capacity for services using the mechanic backhaul connection cannot be more than the capacity available. in particular, this condition must be true at the end of the cycle time. the third constraint establishes that demand must go by either of the channels. it constitutes the relationship between channel use and price and service mean delivery time. constraint four 82 l. marentes, t. wolf, a. nagurney, y. donoso indicates that the quantity of flow using the real time connection must be less than or equal to the capacity available. requests made at time t and using the mechanic backhaul must at least have a travel time equal to the vehicle cycle time t1. moreover, when the arrival time coincides with the vehicle being in transit, it has to wait the remaining cycle time and another cycle. therefore, the traffic going by the mechanic backhaul as part of a request made at time t must to wait 2t1 − t. then, the mean delivery time for services being handled at time t is given by: l(t) = (2t1 − t)u1(t) + t2(t)u2(t) u1(t) + u2(t) (4) we can assume that t2(t) = 0 for all t. this measure according to the sla maintained by the operator has to be less than or equal to tavg. defining the decreasing function z(t) = 2t1 − t/tavg the expression is reduced to the following inequality which corresponds to constraint number five. the reader must note that it is valid only for the set {t ≤ t1 : z(t) ≥ 1}. (z(t)−1)f1(t) 6 f2(t) (5) finally, we define b(t) = a(t) −βt tavg as the function representing the potential market at time t after discounting the sensitivity of users for the average time. 3.3 optimal policy in this subsection we develop the optimal policy for problem (3). first, the optimal policy for the formulation without the time average constraint is developed; then as a second step, the policy is extended to include the average time constraint. the optimal policy when providers can break the time average constraint we introduce a new constraint in f1(t) in order to make easier the optimal policy develop. this constraint limits the amount of flow using the mechanic backhaul connection at any time, which is written as f1(t) ≤ k1. multiplier functions θ(t),λ1(t),λ2(t),λ3(t),λ4(t),λ5(t),λ6(t) are respectively associated with (3a), (3c), f1(t) ≤ k1, (3d), (3f), (3g), (3h) to calculate the hamiltonian (h) and lagrangian (lh), which are, see [14]: f(t,i,f1,f2,p) = p(t)(f1(t) + f2(t)) −c1f1(t) −c2f2(t) (6a) g(t,f1) = −f1(t) (6b) h1(t,p,f1,f2) = b(t) −βp p(t) −f1(t) −f2(t) (6c) h2(t,f1) = k1 −f1(t) (6d) h3(t,f2) = k2 −f2(t) (6e) h4(t,f1) = f1(t) (6f) h5(t,f2) = f2(t) (6g) h6(t,p) = p(t) (6h) h(t,i,f1,f2,p,θ) = f(.) + θ(t)g(.) (6i) lh(t,i,f1,f2,p,θ,λ) = f(.) + θ(t)g(.) + λh (6j) from which we obtain the necessary conditions for optimality: towards pricing mechanisms for delay tolerant services 83 lhp = f1(t) + f2(t)−λ1(t)βp + λ6(t) = 0, (7a) lhf1 = p(t) −c1 −θ(t)−λ1(t)−λ2(t) + λ4(t) = 0, (7b) lhf2 = p(t) −c2 −λ1(t) −λ3(t) + λ5(t) = 0, (7c) lhλ1 = b(t) −βp p(t) −f1(t)−f2(t) = 0, (7d) θ′(t) = −hi = 0, i ′(t) = −f1(t), θ(t1) ≥ 0,θ(t1)i(t1) = 0, (7e) λ4(t) ≥ 0,λ4(t)f1(t) = 0, λ5(t) ≥ 0,λ5(t)f2(t) = 0, λ6(t) ≥ 0,λ6(t)p(t) = 0, (7f) λ2(t) ≥ 0,λ2(t)(k1 −f1(t)) = 0, λ3(t) ≥ 0,λ3(t)(k2 −f2(t)) = 0, (7g) from (7) and integrating θ′(t) = 0 we have θ(t) = θ0. without loss of generality we are going to restrict the optimal path to prices greater than zero, so λ6(t) = 0. the reader must to observe: 1. assuming c1 < c2, the unconstrained problem has as optimal price p∗1 (t) = b(t) + (c1 + θ0)βp/2βp . it is important to note that the function f(t) is strictly concave in the parameter p , fp p = −2βp < 0, and it is concave in the parameter f1, therefore, this value is a global optimum. let d1(t) = b(t) − (c1 + θ0)βp/2 the demand corresponding to p∗1 (t). 2. because the mechanic backhaul channel has lower cost (c1 < c2), the optimal solution uses it as much as possible until the point d1(t). indeed, if k1 > 0 ⇒ f1(t) > 0. now, we calculate the optimal paths as functions of the maximal instantaneous capacity for the mechanic backhaul k1. all cases are developed in the following paragraphs. case 1. k1 > d1(t). the constraint on the mechanic backhaul traffic is not binding. from observation two we have: f∗2 (t) = 0,λ ∗ 2(t) = 0,λ ∗ 3(t) = 0, and λ ∗ 4(t) = 0. replacing these results on the set of equations 7, we can calculate the remaining optimal paths as: f∗1 (t) = b(t) − (c1 + θ0)βp/2,λ∗1(t) = b(t) − (c1 + θ0)βp/2βp , and λ∗5(t) = c2 −c1 −θ0. case 2. k1 ≤ d1(t) and f∗2 (t) = 0. from these conditions and observation two we conclude λ∗3(t) = 0 and λ ∗ 4(t) = 0. these let us to find optimal values for the rest of the functions. those values are: p∗2 (t) = b(t) − k1/βp,λ ∗ 1(t) = k1/βp,λ ∗ 2(t) = b(t) − 2k1 − (c1 + θ0)βp/βp,λ ∗ 5(t) = 2k1 + c2βp − b(t)/βp . case 3. k1 ≤ d1(t) and f∗2 (t) > 0. in this case p ∗ 3 (t) = b(t) + c2βp/2βp,f ∗ 1 (t) = k1,f ∗ 2 (t) = b(t) − c2βp − 2k1/2,λ∗1(t) = b(t) − c2βp/2βp,λ ∗ 2(t) = c2 − c1 − θ0,λ ∗ 3(t) = 0,λ ∗ 4(t) = 0,λ ∗ 5(t) = 0. additionally, it is defined d2(t) = b(t) − c2βp/2. case 4. k1 + k2 ≤ d2(t). from observation one, the objective function concavity in p(t) for a given t, and the fact that it is increasing in f1(t) and f2(t) until the point d2(t), it can be concluded as the optimal decision to assign k1 and k2 to f1(t) and f2(t) respectively. with these assignments we have λ∗4(t) = 0,λ ∗ 5(t) = 0. replacing this information on the set of equations, we have p∗4 (t) = b(t) − k1 − k2/βp,λ ∗ 1(t) = k1 + k2/βp,λ ∗ 2(t) = b(t) − 2(k1 + k2) − (c1 + θ0)βp/βp,λ ∗ 3(t) = b(t) − 2(k1 + k2) − c2βp/βp . case 5. k1 = 0,f2(t) > 0,k2 6 d2(t). the objective function in t is increasing in f2(t) as long as k2 6 d2(t). therefore, f∗2 (t) = k2,λ ∗ 5(t) = 0. from k1 = 0, we assign to the multiplier function λ∗4(t) = 0. replacing these results in the set of equations and resolving for the rest of functions, we have: p∗5 (t) = b(t) − k2/βp,λ ∗ 1(t) = k2/βp,λ ∗ 2(t) = b(t) − 2k2 − (c1 + θ0)βp/βp,λ ∗ 3(t) = b(t) − 2k2 − c2βp/βp . case 6. k1 = 0,f2(t) > 0,k2 > d2(t). from these conditions, then λ∗3(t) = 0,λ ∗ 4(t) = 0, and λ∗5(t) = 0. additionally, replacing these conditions, the optimal paths are: p ∗ 6 (t) = b(t) + c2βp/2βp,λ ∗ 1(t) = b(t) − c2βp/2βp,λ ∗ 2(t) = c2 −c1 −θ0. 84 l. marentes, t. wolf, a. nagurney, y. donoso the optimal θ0 value is required for all cases, from sub-equation (3a) it is equal to: (where χ represents the indicator function). i(t1) = ∫ t1 0 −f1(t)dt = ∫ t1 0 −k1χk1≤d1dt + ∫ t1 0 −d1χk1>d1 (8) let ad = ∫ t1 0 b(t)dt the total estimated market during the cycle time. from observation two, we conclude that operators want to send the maximal value by the mechanic backhaul channel. in case of enough mechanic backhaul capacity, i.e, ad − c1βp t1/2 ≤ capmb, it must be assigned θ0 = 0 and k1 = max{b(t) − c1βp/2 : t ∈ [0,t1]}. when ad − c1βp t1/2 > capmb, the constant value θ0 can be obtained observing: 1. it can be verified that profits for cases 1 and 2 are greater or equal than profits for case 3, whenever 0 ≤ θ0 ≤ c2 −c1. 2. the amount of flow not sent because not enough mechanic backhaul capacity is θ0βp t1/2. this issue is the result of using the optimal point b(t) − (c1 + θ0)βp/2 instead of b(t) − c1βp/2. 3. from items 1 and 2, we conclude that the flow potentially lost is constrained in [0, (c2 − c1)βp t1/2]. if ad−c2βp t1 2 ≤ capmb ≤ ad−c1βp t1 2 , then θ0 = ad−2capmb βp t1 −c1 and k1 = max{ b(t)−c1βp 2 : t ∈ [0,t1]}. if capmb < (ad−c2βp t1) 2 , then θ0 = c2 −c1 and k1 = capmb t1 must be assigned. the optimal policy for providers with the time average constraint we use the lagrangian lh defined in the previous section and include the constraint h4 = f2(t) − (z(t) − 1)f1(t). again, a continuous multiplier function λ7(t) is associated with this constraint. the necessary optimality conditions for the modified lagrangian is given by 7 and: λ7(t) ≥ 0, λ7(t)(f2(t) − (z(t)−1)f1(t)) = 0 (9) constraint (3e) is active when f2(t) = (z(t) − 1)f1(t),z(t) ≥ 1 and f2(t) is constrained by k2. then combining constraints (3e) and (3d) we have k2 ≥ (z(t) − 1)f1(t). from this inequality and k1 ≥ f1(t), we conclude that f1(t) is constrained by k2 z(t)−1 as long as this value is less than k1. moreover, this value is the binding constraint for f1(t) between the time 0 and time t1, defined as: t1 = max ( 0,2t1 −tavg ( k2 k1 + 1 )) (10) now, we complete the optimal policy as a function of the instantaneous capacities k1 and k2. if the constraint (3e) is active and t ∈ [0, t1], then the optimal policy is the result of the capacity k2 and the value d3(t) = b(t)z(t)−c1βp −c2(z(t)−1)βp 2z(t) is attained, which is the maximum profit. see table 1, where c1 = c1 + θ0. for t ∈ [t1, t2 = 2t1 − tavg] the optimal policy is governed by values k1 and d3(t), see table 1. finally, for t > t2 the optimal policy follows the results presented in the previous subsection. towards pricing mechanisms for delay tolerant services 85 table 1: optimal assignment policy for t ∈ [0, t2] case t g(t)∗ p (t)∗ f1(t) ∗ f2(t) ∗ d(t) k2z(t) z(t)−1 ≥ d3(t) t ≤ t 1 (z(t)−1)(b(t)z(t)−βp (c1+c2(z(t)−1))) 2z(t)2 b(t) βp − g∗z(t) βp (z(t)−1) g∗ z(t)−1 g ∗ d3(t) k2z(t) z(t)−1 ≤ d3(t) t ≤ t 1 k2 b(t) βp − g∗z(t) βp (z(t)−1) g∗ z(t)−1 g ∗ g ∗z(t) z(t)−1 z(t)k1 ≤ d3(t) t 1 ≤ t ≤ t 2 k1 b(t)−z(t)g∗ βp g ∗ (z(t) − 1)g∗ z(t)g∗ z(t)k1 > d3(t) t 1 ≤ t ≤ t 2 (b(t)z(t)−c 1βp −c2(z(t)−1)βp ) 2z(t)2 b(t)−z(t)g∗ βp g ∗ (z(t) − 1)g∗ d3(t) 4 results and discussion 4.1 scenario and parameter settings overall performance evaluation is done using the software created for network rural planning in [17]. the real data presented in [13] and [18] are used as estimators for investment, maintenance costs, data traffic, and elasticities. the evolving potential market size a(t) and the sensitivity of demand to price βp were calculated for business days and weekends. data used for simulation is summarized in tables 2 and 3. parameters taking values in a range are presented as three arguments: the initial, final, and increment values. with the proposed pricing model, only one delay tolerant service can be handled. authors in [13] report two kinds of traffic that potentially could belong to this category: http and e-mail, with e-mail being more delay tolerant than http. because the same elasticity value is reported for both services, we can consolidate their traffic and manage them as one service. extending the model to more than one service and real time technology requires additional control variables and modifying the inventory constraint. this extension is left for future research. table 2: network general parameters parameter value channel(kbps) 32 annual investment rate(%) 2,14,6 monthly fixed cost (usd) 330 cost by channel (usd) 90,170,10 mechanic backhaul cost (usd) 10,70,15 initial working hour 6 final working hour 19 cycle time (t1) 7 time average (tavg) 7 investment periods 60 table 3: service parameters parameter delay tolerant real time elasticity 1.45 1.337 demand contribution (%) 75 25 βt 0.1,1,0.01 n/a base price (usd/unit) 0.016 0.016,0.05,0.016 86 l. marentes, t. wolf, a. nagurney, y. donoso figure 2: traffic and price behavior 4.2 results the evolution of traffic by connection and price is presented in figure 2. the two figures on the left show the potential market size and the optimal use of the real and mechanic backhaul connections. the two cycles are delimited and for each of them the time interval t ∈ [0, t1] in which constraint 3e is active and determined by k2, and when it is determined by k1, t ∈ [t1, t2]. traffic use is strongly affected by the average time constraint regardless of business days or weekends. as long as the policy has to use the real time connection, it increases prices to cover link cost (see right graph). hence, the optimal demand decreases limiting the network usage. once the time constraint is more relaxed, the mechanic backhaul traffic increases as well as optimal demand. in fact, for the hypothetical scenario of t1 <= t2 the maximum mechanic backhaul connection use is determined by the potential market. the last sub-figure in 2 corresponds to the optimal price. as it can be seen the figure presents jumps which are the result of changing βp every hour within the days. explicitly, we find the size of jumps to be proportional to demand changes between hours. from our tests we inference that maintaining a constant βp eliminates jumps in prices, but under some scenarios decreases profits for operators. these jumps introduce comprehension complexities for users and operators, so we argue that maintaining a constant sensitivity to price for weekdays and weekends is the best decision for the whole system. using mixed technologies, the behavior of prices and demand might indicate a cost of real time channels too high for isolated areas. as this cost decreases, the model starts to decrease prices and the demand is stimulated for both channels. so we can suggest that mixed providers could be profitable in areas where wimax technologies could be installed ( rental costs less that 90 usd dollars for 32 kbps month). second, operators desire to know how traffic and income are affected by different parameters used in the model. the impact of average time, cost of the real time connection, and market’s time sensitivity are presented in figure 3. if the operator only considers the time average parameter, his optimal decision would be to increase it as much as possible. however, this decision is correct as long as users do not have great sensitivity to time, low βt values. if demand is closely related with time, high βt values, the operator must establish an equilibrium between income gained as more traffic uses the mechanic backhaul connection and the corresponding demand lost. the real time connection cost is a critical parameter. when the time average constraint is towards pricing mechanisms for delay tolerant services 87 active, it determines the optimal demand d3, price, and quantity send by the two connection types. in the simulated test-bed, the most extreme case is assumed in which the operator has to use a vsat connection. surprisingly, these results indicate that deployment regions closer to connected internet areas can benefit from mechanic backhaul connections when services require high volume traffic and users are to a certain degree insensible to delay. there exists a correlation between real time costs and the traffic pattern used. it can be seen in figure 3 that a decrease in channel rental costs for business days produces more income than the same increase in weekends. the demand lost during the interval [0, t1] explains this outcome, which is the consequence of the variation in the potential market size throughout time. therefore, we can conclude the need for using continuous models for adapting policies to demand variations. 5.0 5.5 6.0 6.5 7.0 7.5 8.00 50 100 150 200 to ta l d em an d (m by te s) businessdays weekends 5.0 5.5 6.0 6.5 7.0 7.5 8.0 average time tavg (hour) 0 1 2 3 4 to ta l i nc om e( u s d ol la rs ) 80 100 120 140 160 180 200 0 50 100 150 200 250 300 350 businessdays weekends 80 100 120 140 160 180 200 real time cost crt (us dollars) 0 1 2 3 4 5 6 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.00 50 100 150 200 businessdays weekends 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 time sensitivity bt 0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0 figure 3: change in aggregated results due to parameter shifts 5 conclusions we propose a model with price and delay time as demand predictors in which operators compromise themselves to deliver requests to maximize their profits. the results presented suggest that continuous models, like the one proposed, are pertinent to establish prices for delay tolerant services as long as they can handled variability of demand. contrary to our first assumption, we observed that networks markets located close to internet connected points can benefit from these services. although real time channel lease cost continues to be a critical factor determining the demand offered to the market, results suggest that mechanic backhaul connections somehow mitigate the situation. in this paper we do not research, to name a few, the consequences of monopolistic operators’ behavior, the extent to which sustainability of network deployments is reached, and the presence of economies of scale. all these research directions remain open. acknowledgment this research is funded by colciencias grant 567. this support is gratefully appreciated. 88 l. marentes, t. wolf, a. nagurney, y. donoso bibliography [1] anand, a.; pejovic, v.; belding, e.; johnson, d. (2012); villagecell: cost effective cellular connectivity in rural areas; proceedings of the fifth international conference on information and communication technologies and development; isbn:978-1-4503-1045-1. [2] bayle, j.; nagel, j; raghavan s. (2008); ex-post internet charging; telecommunications modeling, policy, and technology; isbn:978-0-387-77779-5 59-78. [3] boyaci, t.; ray, s. (2003); product differentiation and capacity cost interaction in time and price sensitive markets, manufacturing & service operations management ; issn: 15234614, 5(1): 18-36. [4] cerf, v. et al (2007); delay-tolerant networking architecture. [5] coutinho, m. et al.(2011); a new proposal of data mule network focused on amazon riverine population; proceedings of the 3rd extreme conference on communication: the amazon expedition; isbn:978-1-4503-1079-6. [6] gosztony, g. (1991); ccitt work in teletraffic engineering, ieee journal on selected areas in communications, issn:0733-8716, 9(2): 131-134. [7] ha, sangtae. et al. (2012), tube: time dependent pricing for mobile data; proceedings of the acm sigcomm 2012 conference on applications, technologies, architectures, and protocols for computer communication; isbn: 978-1-4503-1419-0. [8] hande, p.; chiang, m.; calderbank, r.; zhang, j. (2010); pricing under constraints in access networks: revenue maximization and congestion management, ieee infocom 2010 proceedings; issn:0743-166x 1-9. [9] husni, e. (2011); rural internet service system based on delay tolerant network (dtn) using train system; international conference on electrical engineering and informatics (iceei), issn:2155-6822. [10] mackie-mason j.; varian h. (1998); the economics of the internet; internet economics; isbn:0-262-63191-1 27-62. [11] gizelis, c.; vergados, d. (2011); a survey of pricing schemes in wireless networks; ieee communications surveys and tutorials; issn:1553-877x 13(1): 126-145. [12] jiang, l.; parekh, s.; walrand j. (2008); time dependent network pricing and bandwidth trading; network operations and management symposium workshops; isbn:978-1-42442067-4. [13] johnson, d.; belding, e.; almeroth, k. van stam, g. (2010); internet usage and performance analysis of a rural wireless network in macha, zambia; proceedings of the 4th acm workshop on networked systems for developing regions; isbn:978-1-4503-0193-0. [14] kamien, m.; schwartz, n. (1991); dynamic optimization: the calculus of variations and optimal control in economics and management, isbn: 978-044-401-609-6. [15] khabbaz, m.; assi, c.; fawaz, w. (2012); disruption-tolerant networking: a comprehensive survey on recent developments and persisting challenges; ieee communications surveys tutorials, issn:1553-877x, 14(2) 607-640. towards pricing mechanisms for delay tolerant services 89 [16] ezziane, z. (2005); charging and pricing challenges for 3g systems; ieee communications surveys tutorials; issn:1553-877x 7(4) 58-68. [17] marentes, l.; donoso, y. (2013); assigning capacity and prices for telecommunication services to increase possibilities of investment in rural networks; global information infrastructure and networking symposium (giis); [18] mo, j.; kim, w.; park, h. (2013); internet service pricing: flat or volume?; journal of network and systems management; issn:1064-7570 21(2) 298-325. [19] pentland, a. and fletcher, r. and hasson, a. (2004); daknet: rethinking connectivity in developing nations; computer, issn:0018-9162, 37(1):78-83. [20] sen s.; joe-wongc.; ha, s.; chiang m. (2012); incentivizing time-shifting of data: a survey of time-dependent pricing for internet access; ieee communications magazine; isss:0163-6804 50(11) 91 99. [21] zheleva, m.; paul, a.; johnson, d.; belding, e. (2013); kwiizya: local cellular network services in remote areas; proceeding of the 11th annual international conference on mobile systems, applications, and services; isbn:978-1-4503-1672-9. [22] broadband infraco; http://www.infraco.co.za/sitepages/home.aspx [23] e-mexico system; http://www.mexicoconectado.gob.mx/ [24] bharat broadband network limited; http://www.bbnl.nic.in/content/faq/what-isbbnl.php international journal of computers communications & control issn 1841-9836, 9(6):758-767, december, 2014. representing it performance management as metamodel a. pajić, o. pantelić, b. stanojević ana pajić∗, ognjen pantelić department of information systems faculty of organizational sciences, university of belgrade jove ilica 154, 11000 belgrade, serbia ana.pajic@fon.bg.ac.rs, pantelico@fon.bg.ac.rs *corresponding author bogdana stanojević mathematical institute of the serbian academy of sciences and arts kneza mihaila 36, 11000 belgrade, serbia bgdnpop@mi.sanu.ac.rs abstract: many empirical studies have shown that the business value from investment in it projects can be greater than the one being currently achieved. thus it calls for specific focus on it governance in order to reach fusion between business and it goals. good it performance management should enable the business and it executives to understand how it is contributing to the achievement of business goals. the paper addresses the issue of representing it governance best practice frameworks as ontological metamodels. special attention is dedicated to val it framework, which represents a comprehensive framework to maximize business value from it investments. the paper points out the necessity of analyzing, comparing and integrating it governance frameworks in order to complement different knowledge and generate ontological metamodel of it performance management. scope of our work is in the static aspect of the framework and as the metalanguage extended entity/relationship model is used. keywords: it governance, ontology, metamodels, it investment, it performance management. 1 introduction in today’s changing and competitive business environment, organizations are in a constant struggle to obtain a dominant role in the market. information technology (it) has become a key enabler of business process reengineering if an organization is to survive and continue to prosper. they invest substantial financial resources for delivering quality software as their competitive advantage. the increasing use of information technology (it) has resulted in a need for evaluating the productivity impact of it. many empirical studies have shown that the business value from investment in it projects can be greater than the one being currently achieved [8]. more investments do not mean by design achieving desired goals and better business results. it refers to productivity paradox, initially formulated by solow [20] in 1987, who pointed out “you can see the computer age everywhere but in the productivity statistics". thus it calls for specific focus on it governance in order to achieve fusion between business and it. several studies reported rather low success rates in achieving successful business and it alignment on the organization level, as perceived by business and it executives [18]. figures run so low that they are ranked as one of the top concerns for the upper management, in the last few years. one way to reach the strategic alignment and to bridge the gap between business and it is through the way companies govern their it. effective it governance is seen as a critical element to ensure returns on it investment and improved organizational performance [12]. copyright © 2006-2014 by ccc publications representing it performance management as metamodel 759 this can be achieved through creation of organizational structure with clearly defined roles and responsibilities regarding information, business processes, applications and it infrastructure. it can contribute to higher returns on assets with the main goal to provide the support for conducting business in a good manner [19]. strong emphasis is now placed on developing it governance frameworks to help the management to ensure that organizations realize optimal value from it business investments at an affordable cost with a known and acceptable level of risk. itil (information technology infrastructure library) and cobit (control objectives for information and related technologies) are recognized in the business and academic world as the most used and adopted frameworks ( [4], [7]). however, there are a lot of concerns in regard to adaption and integration of different best practice frameworks, which subsequently bring the logical structures and semantic of the frameworks in forefront. achieving it business value and measuring that value are important governance domains. good it performance management should enable the business and it executives to understand how it contributes to the achievement of business goals, in the past and in the future. as brandabur [3] pointed out “the it capability of organization exceeds many concepts like strengths or competitive advantage and had become an absolute necessity". a focus of it performance management should be the removal of non-value adding activities and processes. according to haanappel et al. [9], organizations had very different it performance management approaches and maturity levels. it performance measurement framework needs to be balanced, comprehensive and adopted as the tool for evaluation and assessment of it investments. therefore, the research will address this issue. paper goal is to provide insight into semantically rich structure of best practice frameworks for supporting management and governance of it. special attention is dedicated to val it framework, which represents a comprehensive framework to maximize business value from it investments. moreover, the integration of different it best practice frameworks will result in generating ontological metamodel of it performance management framework. in section 2 we discuss the concept of metamodel and ontology from the academic perspective, and elaborate the it performance management concept. a new ontological metamodel of val it framework, with rich logical structure and semantics of its relationships, is introduced in section 3. an ontological metamodel for monitoring it performance is presented in section 4 to complete the models of it governance best practices. in section 5, we discuss research outcomes and ideas for future work. 2 related works and basic concepts 2.1 metamodels and ontologies models as the main instruments of enterprise architecture (ea) have been very useful in addressing it/business alignment problems [17]. the objective of ea is to enable organization to align business goals and it investment plans. ea models are striving to be executable in order to enable enterprise to adapt it and enterprise models to change situations and to increase business opportunities. models on different levels of abstraction are in existence today. at a higher-level of abstraction “model of model" is called metamodel. metamodels play an important role in ea by ensuring semantic consistency and a common language for the enterprise [6]. atkinson and kühn [2] have come to the conclusion of two dimensions of metamodeling: linguistic and ontological. the difference between them is in the forms of instancing the objects of the metamodel. linguistic metamodeling deals with the definition of the language and relationships between its elements. on the other hand, ontological metamodels are related to the classification of model elements according to their content. they cope with “instance of" relationships between concepts and 760 a. pajić, o. pantelić, b. stanojević their types. ontological metamodels are more oriented to the users, focusing on content instead of forms. keeping in mind a note of karagiannis [13] that the combination of metamodels and ontologies provides a solution of fully semantic integration, we present in this paper a val it framework and it performance management framework as a semantic enrichment metamodels. the scope of our work is in static aspect of the frameworks and as the metalanguage the extended entity/relationship model is used [5]. the data is presented in following types: entity, relationship, attribute and constructor. 2.2 it governance frameworks the problem of implementing the it best practice frameworks is receiving growing attention from scientists. it governance has emerged as an answer to the problem of identifying the business value derived from investments in it. in order to provide the business value, investment in information technology and systems has to be closely aligned to the corporate strategy. “by creating the necessary structures and processes around it investments, management can ensure that only those it projects that are aligned with strategic business objectives are approved, funded, and prioritized" highlighted symons in [21]. webb et al. [22] underlined performance and risk management as essential part of it governance. moreover, their study has proven that companies with good it governance model generate higher results than competitors. nowadays, cobit and itil are the best known and widely accepted best practices. itil represents set of practices focusing on aligning it services with the needs of businesses. it has become the standard for it service management (itsm), providing “a detailed description of a number of important it practices, with comprehensive checklists, tasks, procedures, and responsibilities, which can be tailored to any it organization" [15]. on the other hand, cobit pays more attention on audit and control perspective. in addition it provides maturity models, critical success factors and different metrics [11]. pereira and mira da silva [16] presented in their work the conceptual models of both frameworks, highlighting the fact: “a very complex framework with several dependencies between processes". moreover, the cobit metamodel can be found in the literature [7] and in the following we will focus on it. at the heart of cobit are 34 processes which are grouped into four life cycle domains: plan and organize (po), acquire and implement (ai), deliver and support (ds), monitor and evaluate (me). each of these 34 processes has goals, divided into activity, process and it goals. it is well-structured and therefore applicable for semantic metamodeling. the main contribution of the presented model is reflected in identifying the possibilities of framework improvement. for example, the authors realize that the component’s activity and the control objective are both connected to the process, but not related to one another. they consider that control objectives have significant overlapping with activities and it can be eliminated by dropping the relationship between activity and process. furthermore, new processes can be integrated in the framework, which has substantial impact on the model flexibility. the further subject of this research will be val it framework, developed and maintained by the it governance institute. it sets good practices for the process of value creation. val it framework is primarily designed to complement cobit, but it can be used without a prior implementation of cobit framework. 2.3 it performance management measuring it performance ensures that organizations maximize the business value of their it investments. it performance management can be defined as the area of setting goals, responsibility accounting and monitoring and improving the performance of it [9]. four key areas of representing it performance management as metamodel 761 it performance management are recognized by andra [1]. the two most important areas are it efficiency and effectiveness. according to kifor and tudor [14], it effectiveness and efficiency represents today some of the most important key performance indicators and a permanent concern for every organization. many methods, tools and best practices exist to demonstrate the added business value of it. problems in measuring it performance emerge when traditional financial methods are applied to information systems, which often generate intangible benefits. despite a number of well-established methods available in the market, they are not widely adopted among it and business executives in evaluation of it performance. a limited amount of literature is available on how an organization can apply and improve their it performance management. it is necessary to understand in depth framework’s complex structure and purpose with the aim to adopt framework and using it in practice. 3 the metamodel of val it framework in the age of information and knowledge, it is no longer sufficient to measure only financial performance, but instead it is necessary to determine the value of intangible assets. from an it governance point of view, evaluating the business value of it investments is of high importance in order to control, govern and manage it functions. as a comprehensive framework, val it is dedicated to answer this issue with the primary goal to maximize business value from it investments. according to the itgi, val it “adds best practices for the end, providing the means to unambiguously measure, monitor and optimize the realization of business value from investment in it". it is systematic and comprehensive approach for measuring and delivering value, which represents one of the five focus areas of it governance. val it supports organization by providing clear and consistent policy to improve it investments decisions and returns on investments. the framework covers the value governance processes and management practices, portfolio and investment management, with ongoing measurements. they represent the three main domains. each domain includes a number of processes, key management practices and activities that need to occur in order to select the investments with the highest potential to create value and to manage them. twenty two processes are contained in these domains. each process produces specific outputs and delivers it to other processes as their inputs. therefore, both of them are categorized as results and presented through specialization relationship type. two types of output benefits are distinguished in val it framework. the first are business benefits which influence value directly to. on the contrary, intermediate benefits do not create value despite they might fulfill stakeholders’ needs. thus, we link outputs to the concept stakeholder. furthermore, process is decomposed to activities and for each activity it is indicated who should be responsible, accountable, consulted and informed (raci chart). responsibilities and accountabilities are defined for typical roles. the stated categories might be assigned to one or more of these roles, which might be undertaken by one person or single organization entity in smaller enterprises. categories are identified as entity and its relationship with role and activity entities is treated as higher-level entity, assignment aggregation. the val it management guidelines illustrate possible assignments of responsibilities to different roles. furthermore for each val it process, guidelines include key activities that need to be assumed, activity description, inputs and outputs. besides, val it processes are collection of practices, including activities and procedures. within the processes, a set of key management practices are introduced as main characteristics which lead to success. hence, the process entity is in relationship with management guidelines and management practice entities. it contains management practices and is supported by management guidelines. there are three types of goals and metrics defined for corresponding levels for each of the processes. domain 762 a. pajić, o. pantelić, b. stanojević goals and metrics describe what has to be done in order to deliver optimal value from itenabled business investments. in addition, they were enabled to achieve process goals which are measured by process metrics. the process goals and metrics have been influenced by the process key management practices. activity goals and metrics are established by process goals and they need to occur inside each process. it is important to measure what has actually been achieved, before outcome is met and afterwards. maturity level is assigned to each of three domains, based on particular maturity model. maturity model is given to help enterprise identify its current state and possible future states. the point is to set the priorities for further improvements. the most important component of val it is business case, which is essential for ensuring additional value from business changes. selecting the right investments closely depends on well defined and comprehensive business case. business case includes different business processes and a set of assumptions how value will be created. they guarantee expected outcomes based on the major it input recourses. in addition, business case should be based on key indicators, both financial and non-financial. figure 1 shows our ontological metamodel of val it. domain process activity goal is a domain goal process goal metric is a process metric domain metric activity metric result belongs to (1,3) (1,*) (1,1) is used byis created by (1,*) (0,*) (0,*) (1,1) input output is a (0,1) intermediate benefits business benefits stakeholder beneficial for (0,*) translated in role business case is contained in it resource assumption uses adresses risk identified for (0,*) (0,*) maturity level maturity modelhas (0,*) (1,1) is determined by (1,1)(1,*) (0,*) (1,1) (1,*) (1,1) (0,*) is a activity goal supports (1,*) (1,1) (1,*) (1,*) (1,*) menagement practice menagement guideline is included in is supported by (1,1) (1,*) (1,1) (0,*) verified by indicator (1,*) (1,*) (0,*) is a non-financial financial (1,*) (1,1) is measured by (1,*) (1,1) assignment category (0,*) (1,1) (0,*) (1,*) supports supports (1,*) (1,*) (1,*) (1,*) is defined by (0,1) (0,*) figure 1: ontological metamodel of val it 4 it performance management metamodel although there are a great number of methodologies, from quantitative metric roi, to higher qualitative metric of the it balanced scorecard, there is not the best possible solution or a standard for measuring the value of it. simple roi or other financial metrics are not good enough. therefore, a comprehensive it performance management framework will ensure that organization significantly improve their it investment returns based on projected business value as well as the actual value delivered. representing it performance management as metamodel 763 as we mentioned before, the ontological metamodels represent an adequate tool for the analysis, adaptation and integration of best practices in governance of it. we analyzed the best known and widely accepted it governance frameworks with focus on cobit framework in section 2 of the paper. different kinds of metrics, such as key performance indicators (kpi), key goal indicators (kgi), and critical success factors (csf), are suggested in widely accepted it governance framework cobit in order to monitor the implementation of each process [7]. val it framework complements the earlier established and used cobit, going one step forward in setting good practices for the process of value creation. it includes portfolio and investment management processes with ongoing measurements. therefore, we presented val it framework as ontological metamodel in section 3, with the idea to analyze and compare these two frameworks in order to generate one comprehensive ontological metamodel of it performance management framework. despite different focuses and logical structures, these two frameworks are enough compatible to be compared and integrated in order to generate one comprehensive ontological it performance management metamodel. connections between the components of the frameworks are found after performing in-depth analysis of the metamodels. entity types, such us process, activity, goal, metric and maturity model, have similar meanings and attributes in these frameworks. the study covers only the components in the it performance management domain. furthermore, the new elements are integrated in the metamodel as the improvement of it governance best practice frameworks. the starting point of metamodel is the entity type business case, essential for ensuring additional value from it investments. business case includes different processes and assumptions of how value will be created. for it performance management it is important to evaluate expected outcomes of business case through different milestones. a milestone is a significant event, which belongs to a business case and is used to monitor the progress in achievement of a particular outcome. the relationship between entity types milestone and expected outcome implies that a milestone is used as the checkpoint only for one outcome. reading the relationship the other way around shows that one outcome has been evaluated through one or more milestones. cobit is frequently used as the standard for it governance maturity assessment. in order to apply it as a tool to assess it performance a lot of expert knowledge is needed. there are lots of metrics, but little support for improving decision-making process. combining maturity models elements of both frameworks, we have defined elements maturity model and maturity level for business case. the reason for assessing maturity of business case lies in the fact that it is active during the whole economic life cycle of investments. the following it performance management concepts are defined: critical success factor (csf), key performance indicator (kpi) and key risk indicator (kri). they are defined by management guidelines of cobit and val it frameworks. csfs are elements vital for business case to be successful. each csf refers to the milestone and is supported by kpis and kris. the concept of key risk indicator is introduced and it supports process of risk assessment. relationship between entity types csf and kpi is treated as higher level entity, aggregation with rating criteria as attribute. the same type of relationship is defined among entity types csf and kpi. in addition, business case should be based on kpis, both financial and non-financial. the relationship between employee and kpi implies that an employee is responsible for zero or more key performance indicators. reading the relationship the other way around shows that a kpi can belong to only one employee. kpi owner is responsible for proactive monitoring of results progress and creation of kpi evaluation report, which can lead to actions of improvement. one action can be caused by only one kpi evaluation report according to metamodel. it is important to emphasize the difference between metric and kpi concepts. kpi is a metric, but metric is not necessarily a kpi. the cardinality of relationship defines this limitation. there are three types of goals and metrics defined for corresponding level in business case. process goals and 764 a. pajić, o. pantelić, b. stanojević metrics describe what has to be done in order to deliver optimal value from it-enabled business investments. the activity goals and metrics are established by process goals and they need to occur inside each process. in addition, it goal is introduced with the aim to succeed it and business alignment. both concepts are presented through specialization relationship type. the relationship between entity types goal and metric is treated as higher level entity, aggregation with score as attribute. a metamodel of it performance management framework is presented in figure 2. goals process goal activity goal it goal business case maturity level maturity model support (1,1) (1,*) has has(1,1) (0,*) (1,1) (1,*) metrics process metric expected outcome it metric activity metric measure score milestone critical success factors key risk indicator key performance indicator s financial non financial kpi owner kpi evaluation report action is (0,1) (1,1) realized (0,*) (1,1) check (0,*) (1,1) consist of (0,*) (1,1) support (0,*) (0,*) rating criteria support (0,*) (0,*) rating criteria create (0,*) (1,1) cause (0,*) (1,1) responsible (1,1) (0,*) figure 2: it performance management ontological metamodel 5 conclusion and future works information technologies and systems take a key role within an organization. organizations are more dependent on it than ever before and different challenges in management and governance of it functions have emerged. the goal of achieving a high degree of it/business alignment has been one of the top priorities for it professionals in academic and industry world for several years. therefore, there are a lot of concerns in regard to the it governance frameworks. these frameworks should support the contribution of it to the overall value of the enterprise. there is no single it governance model that fits all businesses. the choice of model depends on multiple factors. many of the existing frameworks are complementary, with strengths in different areas. they overlap each other and an organization will probably use more than one framework to achieve a complete model. effective implementation of the frameworks demands significant changes in the organization and in its processes. it is necessary to understand in depth framework’s complex structure and purpose with the aim to analyze, adapt, compare and representing it performance management as metamodel 765 integrate different frameworks of it best practice. hence, metamodeling can be a good starting point for enterprise specific governance model adaptation and configuration. we used metamodeling as methodology for adaptation and customization of frameworks on a specific it performance management domain. it performance management requires careful preparation and planning, by following set of rules and best practices. it starts by defining what is important to measure and what the goals to be achieved are. afterwards, it is important to continue with monitoring of progress towards defined goals for permanent improvement. in this paper, we analyzed it governance best practice frameworks and their semantic metamodels based on the existing literature. we provided same initial observation on conceptual models and ontologies, underlying the importance of understanding the logical structure of it governance best practice frameworks for planning successful model integration. the approach of creating ontological metamodels supports comparison and integration of different it governance frameworks in order to meet the semantic integration challenges. val it and cobit frameworks are closely related, having the different purposes. val it is addressing strategic and evaluation questions, while cobit is more oriented on it architecture and the delivery of high quality it services. by exhibiting the entity types and associated relationship types of val it framework, we captured points of overlapping among these frameworks and moreover the key areas where they complement each other in order to use strengths of both models. the main contribution of this work is reflecting through the creation of ontology metamodel of it performance management framework. through the model definition the underlying logical and semantically rich structure of the framework was represented. it was examined from different viewpoints, adjusted to conceptual level and improved through the elimination of substantial overlapping, supporting complex structures and relationships amongst entities. our work captured the knowledge of best practice guidance in creating the business value of information technologies investments focusing on their performance. the resulting metamodel is seen as the first step in building specific enterprise governance model and it is essential to complement it with the knowledge of other frameworks. in future studies, the methamodel will be adapted on the processes and structures of an organization. it will be used in developing application for monitoring it performance. it should complete models of it governance best practices to address in a comprehensive way all critical questions regarding the it performance. acknowledgements this research was partially supported by the ministry of education and science, republic of serbia, project number tr36006. bibliography [1] andra, s. (2006); action-oriented metrics for it performance management, cutter it journal, 19(4): 17-21. http://www.cutter.com/content-and-analysis/journals-andreports/cutter-it-journal/sample/itj0604d.html [2] atkinson, c.; kühne t. (2003); model-driven development: a metamodeling foundation, in ieee software, 20(5): 36-41. [3] brandabur, r.e. (2013); it outsourcing a management-marketing decision, international journal of computers communication and control, issn 1841-9836, 8(2): 184-195. 766 a. pajić, o. pantelić, b. stanojević [4] broussard, f.w.; tero, v. (2007); configuration and change management for it compliance and risk management: the tripwire approach, white paper. [5] engels, g.; gogolla, m.; hohenstein u.; hülsmann, k.; löhr-richter, p.; saake, g.; ehrich, h.-d. (1992); conceptual modelling of database applications using an extended er model, data & knowledge engineering, issn 0169-023x, 9(2): 157-204. [6] franke, u. et al. (2009); decision support oriented enterprise architecture metamodel management using classification trees, proceeding of enterprise distributed object computing conference workshops, edocw, auckland, new zealand, 328-335. [7] goeken, m.; alter, s. (2009); towards conceptual metamodeling of it governance frameworks approach-use-benefits, proceeding of 42nd hawaii international conference on system sciences hicss, hawaii, usa, issn 1530-1605, 1-10. [8] gu, b.; xue, l.; ray, r. (2008); it governance and it investment performance: an empirical analysis, proceedings of international conference on information systems icis, http://aisel.aisnet.org/icis2008/30 [9] haanappel, s.; drost r.; harmsen, f.; brinkkemper, s.; versendaal, j.m. (2011); a framework for it performance management, http://www.cs.uu.nl/research/techreps/repo/cs2011/2011-006.pdf [10] information technology governance institute (2005); measuring and demonstrating the value of it, usa, white paper. [11] information technology governance institute (2007); cobit 4.1 edition, usa, white paper. [12] jacobson, d.d. (2009); revisiting it governance in the light of institutional theory, proceeding of 42nd hawaii international conference on system sciences, hicss, hawaii, usa, 1-9. [13] karagiannis, d.; höfferer, p. (2008); metamodeling as an integration concept, software and data technologies, in: springer berlin heidelberg, 37-50. [14] kifor, v.c.; tudor, n. (2013); quality system for production software as tool for monitoring and improving organization kpis, international journal of computers communication & control, issn 1841-9836, 8(2): 235-246. [15] ogc 2007, the official introduction to the itil service lifecycle, london, uk: the stationery office, white paper. [16] pereira r.; mira da silva m. (2012); a literature review: guidelines and contingency factors for it governance, proceeding in mediterranean & middle eastern conference on information systems, emcis, munchen, germany, 342-360. [17] saat, j. et al (2010); enterprise architecture meta models for it/business alignment situations, proceeding of 14th ieee international enterprise distributed object computing conference, edoc, vitoria, brazil, issn 1541-7719, 14-23. [18] silvius, a. j. g. et al (2013); the relationship between it outsourcing and business and it alignment: an explorative study, computer science and information systems, issn 1820-0214, 10(3): 973-998. representing it performance management as metamodel 767 [19] simonsson, m.; johnson, p. (2008); the it organization modeling and assessment tool: correlating it governance maturity with the effect of it, proceedings of 41st annual hawaii international conference on system sciences, hicss, hawaii, usa, pp. 431. [20] solow, r. (1987); "we’d better watch out", new york times book review, pp. 36. [21] symons, c. (2005); it governance framework, forrester, white paper. [22] webb, p. et al (2006); attempting to define it governance: wisdom or folly?, proceeding in 39th annual hawaii international conference on system sciences, hicss, hawaii, usa, pp. 194a. int j comput commun, issn 1841-9836 8(4):635-646, august, 2013. knapsack-model-based schemes and wlb algorithm for the capacity and efficiency management of web cache a.b. xiang, l. xu, b.z. niu anbo xiang development research center of the state council, beijing, 100010, china abxiang@drc.gov.cn liang xu department of logistics management, school of business administration, southwestern university of finance and economics, chengdu, 611130, china arecxuliang1112@gmail.com baozhuang niu* lingnan college, sun yat-sen university, guangzhou, 510275, china niubzh@mail.sysu.edu.cn *corresponding author abstract: web cache refers to the temporary storage of web files/documents. in reality, a set of caches can be grouped into a cluster to improve the server system’s performance. in this paper, to achieve the overall cluster efficiency, we propose a weighted load balancing (wlb) routing algorithm by considering both the cache capability and the content property to determine how to direct an arrival request to the right node. based on knapsack models, we characterize three new placement/replacement schemes for web contents caching and then conduct the comparison based on wlb algorithm. we also compare wlb algorithm with two other widely used algorithms: pure load balancing (plb) algorithm and round-robin (rr) algorithm. extensive simulation results show that the wlb algorithm works well under the examined cluster content placement/replacement schemes. it generally results in shorter response time and higher cache hit ratio, especially when the cache cluster capacity is scarce. keywords: web cache placement/replacement scheme, knapsack model, load balancing and routing algorithm, performance analysis. 1 introduction in recent years, with the rapid growth of the size of contents delivered and the number of internet users, the world wide web (www) is better known as “world wide wait" [1]. although various information technologies are being developed rapidly, which makes the internet becomes faster and faster, according to [2]’s observation, “the trend of increasing traffic on the internet is likely to continue”. it is reported that if a web page can not be loaded within eight seconds, then users are likely to give up or load the link in a second browser [3]. as a result, a loss of revenue will be suffered, which is called the zona effect [1]. for example, the possible revenue loss by increasing a millisecond to execute automated trades, according to [3], can be as high as $100 million. therefore, there exist plenty of incentives to reduce the web response time when the server capacity is constrained (it may cost a lot to enlarge the server capacity.). in practice, an effective approach to alleviate the user-waiting problem and to optimize the web resource utilization is caching, in which the web content is generated once and kept for a period of time for the future use. when a user requests a previously requested content, the content can be accessed directly from the cache. recently, content caching has been a very active research area. generally speaking, a good caching system must address the following issues: copyright c⃝ 2006-2013 by ccc publications 636 a.b. xiang, l. xu, b.z. niu internet client level caching network level caching web site caching lan cache proxy cache browser cache application database database-backed web site web/application server database server reverse proxy cache figure 1: overview of caching (1) what to cache. cacheable elements include dns mapping, connection, and content. (2) where to cache. a cache usually can reside in one of the three locations: (a) client level, at the client browser and lan, (b) network level, at a proxy somewhere in between the client and the web site, or (c) at the web site itself. (3) when and how to cache. when are elements are placed in cache and when are elements are evicted from cache. industrial experts have already found clear answers/rules to address the first two issues [4], however, more analysis and/or solving methods are needed to address the third issue [5]. therefore, in this paper, we mainly focus on the third issue for the web server with content caching cluster at the web site, i.e., reverse proxy cache. reverse proxy cache, which is a web accelerator, can reduce the workload of a busy web application server that provides both static and dynamic contents. the static contents can be cached on the reverse proxy while the web application server is freed up to better handle the dynamic contents. towards the cache placement/replacement schemes, in practice, the least-recently-used (lru) and least-frequently-used (lfu) are commonly used due to their simplicity. however, with no or limited considerations of content properties or request patterns, these schemes could not provide the best performance. many other schemes have been proposed, which take into account the retrieval latency of contents, their sizes, the popularity of references and temporal locality of requested contents, to solve the problem in a systematic manner (e.g., [6]). from a theoretic perspective, the focused problem in this paper is closely related to the knapsack problem that maximizes the benefit under a constraint capacity. different approaches have been proposed based on the knapsack model in caching or similar applications, for example, [7] and [8]. most of these works utilize the cost-benefit optimizing idea of knapsack model to develop cache management schemes. it is worth noting that, unlike hard disk storage space, cache’s internal main memory is still a limited resource that must be managed wisely [9]. accordingly, there is another trend in caching technology which provides larger cache capacity (hereafter, cache capacity refers to its internal main memory) to support more concurrent users: cluster-based design for web application server (e.g., [10], [11], [12]). with this design, a set of caches are grouped into a loosely coupled cluster to solve some common issues such as capacity, availability, and performance. when clustered, cache’s capacity increases linearly and the number of cacheable missed requests sent to the web application server is reduced accordingly. the impact of an individual cache node failure in the cluster on the site availability and performance is also reduced considerably. this web cache cluster architecture is represented in figure 2, in which a load balancer utilizes the inherent locality of the requests and an adaptive scheme to tune the load allocated to each node in the cluster based on that node’s capability. some leading companies also propose similar commercial knapsack-model-based schemes and wlb algorithm for the capacity and efficiency management of web cache 637 smart load balancer cache 1 cache 2 cache m ... backend server figure 2: model of web cache cluster solutions, such as microsoft internet security and acceleration server with cache array routing protocol, ibm web traffic express proxy servers with network dispatcher, oracle9ias web cache cluster, cisco cache/content engine, and so on. in short, our study was motivated by problems arising in practice and trends in cluster caching. most of the previous works required or assumed that all nodes in the cluster are symmetric with an equal chance to serve any incoming request. we would like to relax these assumptions and propose a more general solution. also we want to consider the nonhomogeneity of the content properties together with the request pattern so as to maximize the utilization of the valuable cache capacity. we first study how to efficiently place/replace cache contents by accurately monitoring the properties of contents, i.e., whether to cache a content, and if we do that, which cluster node to place it so that the system performance is optimized? furthermore, based on the prior knowledge about the requests arrival pattern, contents properties, and the cache contents placement/replacement scheme, we develop a routing algorithm to determine how to direct an arrival request by efficiently coordinating the cluster nodes. the remainder of the paper is organized as follows. in the next section, we introduce our web cache cluster model. the content placment/replacement schemes are discussed based on the knapsack model in section 3. in section 4, we propose a weighted load balancing algorithm, by which we can design a smart load-balancer to determine how to direct an arrival request to a cluster node. extensive simulations have been carried out to evaluate the performance of the algorithms. these simulation results are presented and discussed in section 5. section 6 summarizes the paper. 2 model description as a reverse proxy, our cache cluster is dedicated to a single or a set of original web server systems. the model consists of a set of cacheable contents n = {1, 2, ..., n}, and a set of caches m = {1, 2, ..., m}, as illustrated in figure 2. for convenience, we introduce some notation below. more detailed descriptions will be presented when they are used. cj capacity of cache j, j ∈ m, λ total system requests arrival rate, µ0 service rate of back-end server, λ0 requests arrival rate of the back-end server, µj service rate of the cache j, j ∈ m, 638 a.b. xiang, l. xu, b.z. niu λj requests arrival rate of the cache j, we have λ = ∑ j∈m λj, oj set of all contents currently in the cache j, j ∈ m, w forwarding cost per unit of time in the cache cluster, ri retrieval cost of the requested content i from the back-end server, si size of the content i, i ∈ n, pi “profit" from serving the content i, i ∈ n, pn(i) probability that an arriving request is made for the content i, i ∈ n, given that there are n contents in n. we formulate the cache cluster content placement/replacement problem as a multiple-knapsack problem (mkp) with an objective of maximizing the total “profit" by placing the most “valuable" contents in the available caches: maximize n∑ i=1 m∑ j=1 pixij , (1) subject to, n∑ i=1 sixij ≤ cj , j ∈ m, i ∈ n, (2) where xij = { 1, object i is placed in cache j. 0, otherwise. 3 content placement/replacement schemes the cache cluster content placement/replacement scheme specifies the contents in caches to achieve the specific performance objective. at each decision epoch, the following actions are carried out: 1. initiation, preparing the initial cacheable contents, 2. update the cache contents when cache hit ratio is lower than a preset threshold value, 3. or update the cache contents when a hit miss occurs. we can achieve different specific objective by specifying appropriate pi in the equation (1). for example, 1. when pi = pn(i), the objective is to maximize the cache hit ratio, 2. when pi = pn(i)si, the objective is to maximize the byte hit ratio, 3. when pi = pn(i)ri, the objective is to minimize the retrieval cost. the knapsack problem is known to be np-hard. however, there exist fast heuristics with good performance records. in the following subsections, we discuss how different cache management methodologies are applied in different cases. knapsack-model-based schemes and wlb algorithm for the capacity and efficiency management of web cache 639 3.1 as many contents in cache as possible (amap) in this case, a cache cluster is regarded as a pure multiple-knapsack. we follow the principle of placing as many contents in caches as possible to effectively utilize the capacity of the whole cluster. consequently we need to add one more constraint in addition to (2): m∑ j=1 xij ≤ 1, i ∈ n. (3) this constraint guarantees that no content would be cached redundantly. it is a reasonable consideration when the contents space is huge while the cluster capacity is limited. [13] present an approximate dynamic programming (adp) approach for the multidimensional knapsack problem that produces near optimal solutions efficiently. we apply their adaptive fixing heuristic to solve our cluster content placement/replacement problem. our computational evidence suggests that the adp-based heuristic is an attractive methodology that usually generates good quality solutions in reasonable time. 3.2 popular contents replication and hit ratio threshold (threshold value) in this case, we aim at achieving a high availability of the most popular contents on condition that a certain level of hit ratio is guaranteed. some prior knowledge of the web request pattern can facilitate web cache resource planning and cache hierarchy design, and help us to predict the most popular contents. it has been shown that the web page request follows a zipf-like distribution (see [14]). following this result, we rank all the pages in order of their popularity where page i is the i’ th most popular page. suppose that the number of contents in the system is n, the probability that an arriving request is made for page i is approximated by pn(i) = ω iα , (4) where ω = ( n∑ i=1 1 iα )−1 and α is the zipf parameter determined by the system property. if the top k most popular contents can guarantee a certain level of hit ratio, we only place these top k most popular contents in caches. if there are extra free space, we duplicate these contents in caches sequentially until there are no more free space in the cluster. let pth be a threshold value for hit ratio, then k is determined by the formula below: k = arg inf l∈n ∑l i=1 pn(i) ≥ pth. with this replication caching scheme, the more popular contents are cached redundantly and higher hit ratio can be achieved at any single node. at the same time, a high availability of popular contents can still be maintained when any individual node fails. due to the replication of popular contents, the tradeoff is less effective usage of the combined capacity of the whole cluster. 3.3 web contents space partitioning (partition) in a cache cluster, cache members may not be identical and have different capabilities in dealing with the arrival requests. capability (capacity, processing power, bandwidth, etc.) represents a member’s potential contribution to the cluster. in this case, we follow [10] to partition web contents space based on members’ capability, as shown in figure 3. each cache is assigned to a certain part of the web contents. consequently, 640 a.b. xiang, l. xu, b.z. niu cache 1 cache 2 cache 3 cache 4 web objects namespace figure 3: partitioning web objects namespace based on caches’ capability the mkp problem (1) reduces to multiple single-knapsack problems with cache j being in charge of the contents subset nj, nj ⊆ n. a well-known knapsack problem solution method is the greedy heuristic, which selects objects for inclusion in the knapsack using the “density" of object as the criterion to be greedy upon. let dij = pij /sij be the density of content ij, ij ∈ nj. we set xij = 0 for all ij that satisfies sij > cj, and then arrange the remaining contents in decreasing order of density from top to bottom. starting from the top, the greedy heuristic sets xij = 1 as it goes down until the cache capacity cj is reached. at each stage, if the next content cannot be included in the cache because its size exceeds the remaining capacity, xij is set to 0 for that content and the process continues with the contents below it. the process terminates either when the cache capacity is used up (in this case, xij = 0 for all contents below the current one), or when all the contents have been examined in this way. in the simulation section, we will compare the performance of these three schemes: placing as many contents in caches as possible, threshold value for hit ratio and replication for popular contents, and partitioning of web contents based on caches’ capabilities under different parameter settings. 4 weighted load balancing (wlb) routing algorithm cache clustering introduces a new problem: when a browser requests a particular content, to which cache in the cache cluster should the request be directed? how do we match an incoming request with the cache best able to respond? content-aware routing working at application layer certainly can increase the cache hit ratio. but inspecting every incoming http request would increase the system delay, so that the load-balancer may easily become a bottleneck itself and slow down the entire system. thus, the load-balancer should be kept as “light-weight” and simple as possible to avoid introducing new vulnerabilities into the system (system is only as secure as its weakest component). so in designing our load-balancer, we should consider the system response time and the cache hit ration simultaneously. we want it to work at transport layer and focus on forwarding the data at maximal speed without inspecting every incoming http request, while still guarantees an acceptable cache hit performance. 4.1 the dynamic routing model when a new request arrives and is assigned to cache j, cache j will be associated with an expected cost function f(j) for handling the request(s). we want to find a suboptimal and easy knapsack-model-based schemes and wlb algorithm for the capacity and efficiency management of web cache 641 implemented routing strategy to determine which cache to serve the new arrival request so that the cluster-wide total expected cost is minimized. in general, serving a request directly from web cache cluster, i.e., a cache hit, is significantly faster than forwarding the request to the back-end server for generation. in the cache cluster, we also assume that the network bandwidth between peer caches is large and network latency is low, and thus retrieving a cached content from a peer cache is also significantly faster than getting the same content from the back-end server. we denote as w the forwarding cost per unit time in the cache cluster. there are three possible ways for an arriving request to be served when routed to a cache: (1) the cache contains the requested content and serves the request immediately; (2) the cache does not contain the requested content, but another cache in the cluster contains the content and serves the request; (3) the content is not cached and the request is eventually routed to the back-end server. for the cases (1) and (3), the cost associated with routing the request to any cache is the same. thus, the cost difference only lies in the forwarding cost in the cluster when the requested content is not cached in the assigned cache but is cached in the other node(s) in the cluster. from a request’s “viewpoint", the routing decision could be determined by the expected cost of sending the request to a cache. i.e. we should direct each arriving request to the queue with the minimum expected waiting cost. therefore we need to find l = arg min j∈m f(j). such a policy minimizes each arriving request’s individual expected waiting cost as well as the long-run system waiting cost [15]. 4.2 wlb routing algorithm we assume that the request inter-arrival times and service times are all exponential. let λj and µj denote the request arrival rate and the service rate at cache j, respectively. let oj denote the set of all contents currently in the cache j and kj denote the number of outstanding requests waiting at the cache j. p(i /∈ oh, i ∈ m∪ v=1 ov) denotes the probability that the request content i is not in the cache h but is in the cluster. based on the above notation, assumptions and analysis, we need to compare two arbitrary decisions h and l. we have f(h) − f(l) = khw µh − λh n∑ i=1 pn(i)p(i /∈ oh, i ∈ m∪ v=1 ov) − klw µl − λl n∑ i=1 pn(i)p(i /∈ ol, i ∈ m∪ v=1 ov) , where p(i /∈ oh, i ∈ m∪ v=1 ov) = m∑ j=1 ∑ e∈oj pn(e) − ∑ e∈oh pn(e) m∑ j=1 ∑ e∈oj pn(e) . the two terms in the right-hand side of the equation represent the forwarding costs when the requested content is not cached in the assigned cache h (l) but is cached in the other node(s) in the cluster, respectively. the router should direct the arriving request to cache l if f(h) − f(l) > 0, or to cache h otherwise. thus, we have the following heuristic routing policy: rule: when a request arrives, the router directs it to the server l if l = arg min j { kj µj − λj p [ the requested file is in another cache instead of cache j ] } . 642 a.b. xiang, l. xu, b.z. niu the essence of this wlb algorithm is that the queue length kj is weighted by the probability that the cache does not have the target content and the service capability. the router directs each arrival request to the cache with the lightest effective workload. this wlb routing algorithm makes use of the information related to the cache (it’s service capability and workload) and the property of the content (the probability that a request can be satisfied in one cache). moreover, this algorithm can be easily extended to implement web server cluster. since each web server can satisfy all of the requests, it is natural for us to utilize a pure load balancing policy to guarantee the web server cluster’s performance. 5 performance analysis in this section we compare the performances of different caching schemes. with the wlb arrival request routing algorithm, we first compare performance of lru scheme with our proposed three new content placement/replacement schemes based on the knapsack model: 1. placing as many contents in caches as possible (amap scheme, see section 4.1), 2. placing only contents with popularity values higher than a threshold value in caches (threshold value scheme, see section 4.2), 3. partitioning of contents based on caches’ capabilities (partition scheme, see section 4.3). secondly, we compare the performance of the wlb algorithm with the following two routing algorithms under the threshold value content placement/replacement scheme: 1. pure load balancing (plb) algorithm. with this algorithm, a request is directed to the server with the shortest queue length; 2. round-robin (rr) algorithm. a request is directed to the server next to the server which received the previous request. using simulation, the average response time (art), cache hit ratio (chr), and cache cluster hit ratio (cchr), the ratio between the total requests and the requests are not served rightly by the assigned caches but by other cache in the cluster, of these algorithms have been compared over following parameters setting: 1. size rate, the ratio of cache cluster size vs contents total size; 2. ρ, the ratio of request arrival rate vs the cache cluster service rate. this factor indicates the level of system’s workload; 3. cache types, we assumed that the caches can be all identical, or be divided into two groups with different capabilities (in terms of service rate, capacity, etc), or all be different in capability. 5.1 model-driven simulation the primary motivation for performing model-driven simulation is to understand the effect of different schemes on cache cluster content management. in the model-driven simulation experiment, the arrival requests follow a poisson stream with rate λ. the target content of the request has zipf-like frequency distribution with the zipf parameter α. generally, without specifying otherwise, the default settings are λ = 0.3, µ0 = 0.05, µj = 0.045, m = 10 (i.e. there knapsack-model-based schemes and wlb algorithm for the capacity and efficiency management of web cache 643 are 10 caches in the cluster). hence the ρ = 0.6 (this means the system is of moderate level of workload). the size rate equals 0.5, the zipf parameter α = 0.8. we use 103 contents of sizes uniformly distributed between 1 and 1000. the art, chr, and cchr are explored under different parameters combinations. the results are presented in four sets of figures below. we now discuss the numerical results in detail. observation 1: increasing cache cluster’s capacity can improve the system performances. we observe the impact of varying the caches’ size on the system performances. from figure 4 and figure 5, art is decreasing in cache cluster’s size, chr and cchr are increasing in cache cluster’s size. observation 2: the threshold value scheme is more efficient when the cache cluster capacity is scarce. the partition scheme is more efficient when cache cluster capacity is sufficient. we compare the performances of different content placement/replacement schemes under different situations aim to find which scheme is more appropriate for a certain situation. from figure 4(a), it is noticed that when the cache cluster size is small (size rate < 0.6), the threshold value scheme achieves shorter art than other content placement/replacement schemes. observation 3: better performance can be achieved by considering both node’s capability and content’s property. with the threshold value content placement/replacement scheme, we compare the performances of the wlb, plb, and rr routing algorithms under different situations by varying the setting of size rate, ρ, and cache cluster types. the simulation results (figures 5) suggest that the wlb has shorter art, higher chr and cchr than plb and rr algorithm in most of the situations. 0 500 1000 1500 2000 2500 0.01 0.05 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 size rate: cache cluster vs objects a r t amap policy partition policy threshold value policy lru policy (a) average response time vs size rate 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.01 0.05 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 size rate: cache cluster vs objects c h r amap policy partition policy threshold value policy lru policy (b) cache hit ratio vs size rate 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0.01 0.05 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 size rate: cache cluster vs objects c c h r amap policy partition policy threshold value policy lru policy (c) cache cluster hit ratio vs size rate figure 4: performances of content placement/replacement schemes with wlb algorithm of all the cases simulated, the average improvement in the art achieved by the wlb is 2.56% compared with the plb algorithm, and 13.96% compared with the rr algorithm. 644 a.b. xiang, l. xu, b.z. niu 0 500 1000 1500 2000 2500 0.01 0.05 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 size rate: caches cluster vs objects a r t weighted load balancing policy pure load balancing policy round robin policy (a) average response time vs size rate 0 0.05 0.1 0.15 0.2 0.25 0. 01 0. 05 0. 1 0. 2 0. 3 0. 4 0. 5 0. 6 0. 7 0. 8 0. 9 1 size rate: caches cluster vs objects c h r weighted load balancing policy pure load balancing policy round robin policy (b) cache hit ratio vs size rate 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.01 0.05 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 size rate: caches cluster vs objects c c h r weighted load balancing policy pure load balancing policy round robin policy (c) cache cluster hit ratio vs size rate figure 5: performances of different routing algorithms with the threshold value scheme 5.2 trace-driven simulation we now present results from trace-driven simulations using real web server’s traces, the access log data of a squid proxy server system which was in operation at hkust in september 2012. the trace length is 409623 user accesses. we ran the trace-driven simulation for different values of the cache cluster capacity by varying the size rate. we first compare the performance of the following cache cluster management policies: 1. wlb routing algorithm with threshold value content placement/replacement scheme, 2. plb routing algorithm with lru content placement/replacement scheme, 3. rr routing algorithm with the lru value content placement/replacement scheme, 4. plb routing algorithm with lfu content placement/replacement scheme, 5. rr routing algorithm with the lfu content placement/replacement scheme. from figure 6, we find that wlb algorithm with threshold value content placement/replacement scheme achieves better performance than other combination policies significantly, especially when the size rate is smaller. this observation validates the advantage of our knapsack-based cache cluster management scheme. 6 conclusion in this paper, based on the knapsack model, we first propose three efficient placement/replacement schemes for content caching, and then develop a wlb routing algorithm working at transport layer, which considers both of the node’s capability and content’s property, to determine how to direct an arrival request to the right node by efficiently coordinating the cluster nodes. knapsack-model-based schemes and wlb algorithm for the capacity and efficiency management of web cache 645 0 500 1000 1500 2000 2500 3000 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 size rate: caches cluster vs objects a r t weighted load balancing + threshold value policy pure load balancing + lru policy round robin + lru policy pure load balancing + lfu policy round robin + lfu policy (a) average response time vs size rate 0 0.1 0.2 0.3 0.4 0.5 0.6 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 size rate: caches cluster vs objects c h r weighted load balancing + threshold value policy pure load balancing + lru policy round robin + lru policy pure load balancing + lfu policy round robin + lfu policy (b) cache hit ratio vs size rate 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 size rate: caches cluster vs objects c c h r weighted load balancing+ threshold value policy pure load balancing + lru policy round robin + lru policy pure load balancing + lfu policy round robin + lfu policy (c) cache cluster hit ratio vs size rate figure 6: performances of cache cluster management policies under trace-driven simulation extensive simulation results show that the wlb algorithm with the content replacement scheme leads to satisfactory quality of service (shorter response time and higher cache hit ratio). the simulation results also indicate that content placement/replacement scheme dominates the system performance, and modeling the cache cluster into a multiple knapsack problem is a good way for us to study the content placement/replacement scheme. furthermore, the simulation results point out which content placement/replacement scheme is more appropriate under different situations. in the future study, we plan to integrate the “admission control" and “request priority" mechanisms into our web cache cluster model with the objective to maximize the system reward. we also expect to investigate the distributed data caching infrastructure over internet and the distributed resources optimization problems. moreover, to consider how to organize the cluster (e.g., finding out the optimal number of cache nodes in the cluster) will bring some insight in cache cluster management. acknowledgement this project is supported in part by national natural science foundation of china under grant 71201175 and national natural science foundation of china under grant 71201127 and guangdong natural science foundation under grant s2012040008081 and s2011040001069. bibliography [1] datta, a. et al (2003); world wide wait: a study of internet scalability and cache-based approaches to alleviate it, management science, issn 0025-1909, 49: 1425-1444. [2] kumar, c. ; and norris, j. (2008); a new approach for a proxy-level web caching mechanism, decision support systems, issn 0167-9236, 46: 52-60. 646 a.b. xiang, l. xu, b.z. niu [3] http://www.economist.com/node/15523761. [4] http://plone.org/documentation/kb/optimizing-plone/what-to-cache. [5] xiang, a. (2006); essays on information service systemes. phd dissertation. hong kong university of science and technology. [6] bahat, o.; makowski, a. m.; (2003) optimal replacement policies for non-uniform cache objects with optional eviction. ieee infocom 2003, san francisco, california, april. [7] wang, b. et al (2002); proxy-based distribution of streaming video over unicast/multicast connections. proceedings of ieee infocom, new york, june. [8] otoo, e. et al (2002); disk cache replacement algorithms for storage resource managers in data grids. the 15th annual supercomputer conference, baltimore, maryland, november. [9] rabinovich, m.; spatscheck, o. (2002); web caching and replication. addison-wesley, boston, ma, us. [10] feenan,j. et al (2002); clustering web accelerators. ieee wecwis proceedings, san diego, june. [11] zhang, x., et al (1999); hacc: an architecture for cluster-based web servers. 3rd usenix windows nt symposium, seattle, washington, july. [12] clevenot, f., et al (2005); stochastic fluid models for cache cluster. issn 0166-5316, performance evaluation, 59: 1-18. [13] bertsimas, d.; demir, r. (2002); an approximate dynamic programming approach to multidimensional knapsack problems. issn 0025-1909, management science, 48: 550-565. [14] breslau, l., et al (1999); web caching and zipf-like distributions: evidence and implications. issn 0743-166x, ieee infocom, 1, 126-134. [15] liu, l. et al (2005); weighted load balancing for web server clusters with caching. working paper. hong kong university of science and technology. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 4, pp. 567-577 a new rymon tree based procedure for mining statistically significant frequent itemsets p. stanišić, s. tomović predrag stanisic, savo tomovic university of montenegro department of mathematics and computer science dzordza vasingtona bb, podgorica, montenegro e-mail: pedjas@ac.me, savica@t-com.me abstract: in this paper we suggest a new method for frequent itemsets mining, which is more efficient than well known apriori algorithm. the method is based on special structure called rymon tree. for its implementation, we suggest modified sort-merge-join algorithm. finally, we explain how support measure, which is used in apriori algorithm, gives statistically significant frequent itemsets. keywords: frequent itemset mining, association analysis, apriori algorithm, rymon tree 1 introduction finding frequent itemsets in databases is fundamental operation behind association rule mining. the problem of mining association rules over transactional databases was introduced in [1]. an example of such rule might be that "85% of customers who bought milk also bought bread". discovering all such rules is important for planning marketing campaigns, designing catalogues, managing prices and stocks, customer relationships management etc. the supermarket is interested in identifying associations between item sets; for example, it may be interested to know how many of customers who bought milk also bought bread. this knowledge is important because if it turns out that many of the customers who bought milk also bought bread, the supermarket will place bread physically close to milk in order to stimulate the sales of bread. of course, such a piece of knowledge is especially interesting when there is a substantial number of customers who buy two items together and when large fraction of those individuals who buy milk also buy bread. for example, the association rule milk ⇒ bread [support=20%, confidence=85%] represents facts: • 20% of all transactions under analysis contain milk and bread; • 85% of the customers who purchased milk also purchased bread. the result of association analysis is strong association rules, which are rules satisfying a minimal support and minimal confidence threshold. the minimal support and the minimal confidence are input parameters for association analysis. the problem of association rules mining can be decomposed into two sub-problems [1]: • discovering frequent itemsets. frequent itemsets have support higher than minimal support; • generating rules. the aim of this step is to derive rules with high confidence (strong rules) from frequent itemsets. for each frequent itemset l all nonempty subsets of l are found; for each a ⊂ l ∧ a ̸= ∅ the rule a ⇒ l − a is generated, if support(l)support(a) > minimal confidence. overall performances of mining association rules are determined by the first step; we do not cosider the second step in this paper. efficient algorithms for solving the second sub-problem are presented in [12]. the paper is organized as follows. section 2 provides formalization of frequent itemsets mining copyright c⃝ 2006-2010 by ccc publications 568 p. stanišić, s. tomović problem. section 3 describes apriori multiple_num algorithm which is a modification of well known apriori algorithm [1]. section 4 presents a new candidate generation procedure which is part of apriori multiple_num. in section 5 we use hypothesis testing to validate generated frequent itemsets. 2 preliminaries suppose that i is a finite set; we refer to the elements of i as items. we primarily use notions from [10]. definition 1. a transaction dataset on i is a function t: {1, ...,n} → p(i), where p(i) is set of all subsets of i. the set t(k) is the kth transaction of t. the numbers 1,...,n are the transaction identifiers (tids). [10] given a transaction data set t on the set i, we would like to determine those subsets of i that occur often enough as values of t. [10] definition 2. let t: 1, ..., n→p(i) be a transaction data set on set of items i, where p(i) is set of all subsets of i. the support count of subset k of set of items i in t is the number suppcountt (k) given by: suppcountt (k) = |{k|1 ≤ k ≤ n ∧ k ⊆ t (k)}|. (1) the support of an item set k (in the following text instead of "item set k" we will use "itemset k") is the number: supportt (k) = suppcountt (k)/n. (2) [10] the following rather straightforward statement is fundamental for the study of frequent itemsets. it is known as apriori principle [1]. proof is presented in order to introduce anti-monotone property. theorem 3. let t: 1, ..., n→ p(i) be a transaction data set on a set of items i, where p(i) is set of all subsets of i. if k and k’ are two itemsets, then k ′ ⊆ k implies supportt (k ′) ≥ supportt (k). [10] proof: the previous theorem states that supportt for an itemset has the anti-monotone property. it means that support for an itemset never exceeds the support for its subsets. for proof, it is sufficient to note that every transaction that contains k also contains k’. the statement from the theorem follows immediately. 2 definition 4. an itemset k is µ -frequent relative to the transaction data set t if supportt (k) ≥ µ . we denote by f µt the collection of all µ -frequent itemsets relative to the transaction data set t and by f µ t,r the collection of µ -frequent itemsets that contain r items for r ≥ 1 (in the following text we will use r-itemset to denote itemset that contains r items). [10] note that f µt = ∪ r≥1 f µ t,r. if it is clear what µ and t are, we can omit them. in this paper we will propose new algorithm for frequent itemsets mining which is based on special structure: rymon tree. the rymon tree was introduced in [8] in order to provide a unified search-based framework for several problems in artificial intelligence; the rymon tree is also useful for data mining algorithms. in definition 5 and 6 we define necessary concepts and in definition 7 we define the rymon tree. definition 5. let s be a set and let d : s → n be an injective function. the number d(x) is the index of x ∈ s. if p ⊆ s, view of p is subset view(d,p) = {s ∈ s|d(s) > maxp∈pd(p)}. a new rymon tree based procedure for mining statistically significant frequent itemsets 569 definition 6. a collection of sets c is hereditary if u ∈ c and w ⊆ u implies w ∈ c. definition 7. let c be a hereditary collection of subsets of a set s. the graph g = (c,e) is a rymon tree for c and the indexing function d if: • the root of the g is /0 • the children of a node p are the sets of the form p ∪{s}, where s ∈ view(d,p) if s = {s1,...,sn} and d(si) = i for 1 ≤ i ≤ n, we will omit the indexing function from the definition of the rymon tree for p(s). let s = {i1,i2,i3,i4} and let c be p(s), which is clearly a hereditary collection of sets. finally, let d be injective mapping: d(ik) = k for 1 ≤ k ≤ 4. the rymon tree for c and d is shown in fig. 1. figure 1: example of rymon tree a key property of a rymon tree is stated next. theorem 8. let g be a rymon tree for a hereditary collection c of subsets of a set s and an indexing function d. every set p of c occurs exactly once in the tree. note that in the rymon tree of a collection p(s), the collection sr, that consists of sets located at distance r from the root, denotes all subsets of the size r of s. 3 apriori multiple_num algorithm apriori multiple_num algorithm generates frequent itemsets starting with frequent 1-itemsets (itemsets consisted of just one item). next, the algorithm iteratively generates frequent itemsets to the maximal length of frequent itemset. each iteration of the algorithm consists of two phases: candidate generation and support counting. in candidate generation phase potentially frequent itemsets or candidate itemsets are generated. the apriori principle [1] is used in this phase. it is based on anti-monotone property of the itemset support (see theorem 3) and provides elimination or pruning of some candidate itemsets without calculating its support. according to the apriori principle, if x is frequent itemset, then all its subsets are also frequent. this fact is used in candidate generation phase in a way that the candidate containing at least one not frequent subset is being pruned immediately (before support counting phase). support counting phase consists of calculating support for all previously generated candidates (which are not pruned according to the apriori principle in the candidate generation phase). calculating candidate support requires one database scan and efficient determination if the candidates are contained in 570 p. stanišić, s. tomović particular transaction t ∈ t. for candidates contained in t ∈ t, it’s support will be incremented. on account of that, the candidates are organized in hash tree. the candidates which have enough support are termed as frequent itemsets. the main difference between iterations in original apriori algorithm [1] and apriori multiple_num algorithm is that iterations in later one are "longer", which is determined by multiple_num parameter. actually, in original apriori algorithm in the iteration k set fk(containing all frequent itemsets with k items) is generated, while apriori multiple_num algorithm in the iteration k generates sets fk+i,0 ≤ i ≤ multiple_num. if kmax < multiple_num is true, where kmax is the maximal length of frequent itemset, apriori multiple_num algorithm terminates in just two iterations, or just two database scans. in addition, all candidate k-itemsets (itemsets containing k items) will be signed as ck, and all frequent k-itemsets as fk. pseudocode for apriori multiple_num algorithm is given bellow. apriori multiple_num algorithm input: t-transactional database; µ -minimal support; output: f-frequent itemsets in t method: 1. f1 = all_large_1itemsets(t, µ) 2. multiple_num = maximal_length_o f _transactions 3. c2 = apriori_gen(f1,f1) 4. for i=3 to multiple_num ci = apriori_gen(ci−1,ci−2) end for 5. for i=2 to multiple_num createcandidatehashtree(ci) end for 6. for each t ∈ t do for i=2 to multiple_num traversehashtree(ci,t) end for end for 7. for i=2 to multiple_num fi = {c ∈ ci|support(c) ≥ µ} end for 8. f = ∪ k fk let us explain the most important steps briefly. generating frequent 1-itemsets is done in the same way as in original apriori algorithm [1]. this step requires one database scan. then, parameter multiple_num is set to the length of the longest transaction from the database t, which ensures that the algorithm will need just one more database scan. steps 3 and 4 are concerned with candidate generation: in step 3 set c2 is generated by calling apriori_gen function, then loop in step 4 generates all other candidates ci,3 ≤ i ≤ multiple_num, by calling apriori_gen function, but with the following difference. according to original apriori algorithm [1] candidate itemsets ck+1 (candidate itemsets containing k+1 items) is formed from the set fk(frequent itemsets containing k items) in iteration k+1. however, we want to generate all itemsets in just one loop in order to reduce number of iterations (database scans) to two, but we do not have the necessary frequent sets. as the solution, arguments are candidate sets ck−1 and ck−2, which is known at this moment. the next section describes modification of apriori_gen function and its fast implementation. the support counting phase comes next. all candidate itemsets ci,2 ≤ i ≤ multiple_num are orgaa new rymon tree based procedure for mining statistically significant frequent itemsets 571 nized in separate hash trees in order to make support counting process efficient. then, we scan database and calculate support for candidates by traversing corresponding hash trees. at the end of support counting phase frequent itemsets fi,1 ≤ i ≤ multiple_num are generated. as we stated earlier, we will not further consider support counting phase. apriori algorithm [1] performs kmax + 1 iterations, where kmax is the maximal length of frequent itemsets, and in each iteration it scans whole database. apriori multiple_num algorithm finishes after 2 iterations and performs 2 database scans. 4 new procedure for candidate generation we assume that any itemset k is kept sorted according to some relation <, where for all x,y ∈ k, x < y means that object x is in front of object y. also, we assume that all transactions in database t and all subsets of k are kept sorted in lexicographic order according to relation <. for candidate generation we suggest an original method by which the set ct,k is calculated by joining cµt,k−1 with c µ t,k−2, for k ≥ 3. candidate k-itemset is created from one candidate (k-1)-itemset and one candidate (k-2)-itemset in the following way. let x = {x1,...,xk−1} ∈ c µ t,k−1 and y = {y1,...,yk−2} ∈ cµt,k−2. itemsets x and y are joined if and only if the following condition is satisfied: xi = yi,(1 ≤ i ≤ k −3)∧ xk−1 < yk−2 (3) producing the candidate k-itemset {x1,...,xk−2,xk−1,yk−2}. we will prove the correctness of the suggested method. in the following text we will denote this method by ct,k = c µ t,k−1 ×c µ t,k−2. let i = i1,...,in be a set of items that contains n elements. denote by gi = (p(i),e) the rymon tree of p(i). the root of the tree is /0. a vertex k = {ip1,...,ipk } with ip1 < ip2 < ... < ipk has n − ipk children k ∪ j, where ipk < j ≤ n. let sr be the collection of itemsets that have r elements. the next theorem suggest a technique for generating sr starting from sr−1 and sr−2. it is a modification of theorem 7.8. from [10]. theorem 9. let g be the ryman tree of p(i), where i = i1,...,in. if w ∈ sr, where r ≥ 3, then there exists a unique pair of distinct sets u ∈ sr−1 and v ∈ sr−2 that has a common immediate ancestor t ∈ sr−3 in g such that u ∩ v ∈ sr−3 and w = u ∪ v . proof: let u and v and p be the three elements of w that have the largest, the second-largest and the thirdlargest subscripts, respectively. consider the sets u = w − {u} and v = w − {v, p}. note that u ∈ sr−1 and v ∈ sr−2. moreover, z = u ∪ v belongs to sr−3 because it consists of the first r-3 elements of w. note that both u and v are descendants of z and that u ∪ v = w (for r=3 we have z = /0). the pair (u,v ) is unique. indeed, suppose that w can be obtained in the same manner from another pair of distinct sets u1 ∈ sr−1 and v1 ∈ sr−2 such that u1 and v1 are immediate descendants of a set z1 ∈ sr−3. the definition of the rymon tree gi implies that u1 = z1 ∪ {im,iq} and v1 = z1 ∪ {iy}, where the letters in z1 are indexed by a number smaller than min{m,q,y}. then, z1 consists of the first r-3 symbols of w, so z1 = z. if m < q < y, then m is the third-highest index of a symbol in w, q is the second-highest index of a symbol in w and y is the highest index of a symbol in w, so u1 = u and v1 = v . 2 the following theorem, together with the obvious fact cµt,k ⊂ f µ t,k for all k, directly proves correctness of our method ct,k = c µ t,k−1 × c µ t,k−2. it is modification of theorem 7.10. from [10]. theorem 10. let t be a transaction data set on a set of items i and let k ∈ n such that k > 2. if w is a µ -frequent itemset and |w | = k, then there exists a µ -frequent itemset z and two itemsets {im,iq} and {iy} such that |z| = k −3, z ⊆ w , w = z ∪ {im,iq,iy} and both z ∪ {im,iq} and z ∪ {iy} are µ -frequent itemsets. 572 p. stanišić, s. tomović proof: if w is an itemset such that |w | = k, than we already know that w is the union of two subsets u and v of i such that |u| = k − 1, |v | = k − 2 and that z = u ∩ v has k-3 elements (it follows from theorem 2). since w is a µ -frequent itemset and z, u, v are subsets of w, it follows that each of these sets is also a µ -frequent itemset (it follows from theorem 1). 2 apriori algorithm [2] generates candidate k-itemset by joining two large (k-1)-itemsets, if and only if they have first (k-2) items in common. because of that, each join operation requires (k-2) equality comparisons. if a candidate k-itemset is generated by the method ct,k = c µ t,k−1 ×c µ t,k−2 for k ≥ 3, it is enough to process (k-3) equality comparisons. the method ct,k = c µ t,k−1 ×c µ t,k−2 can be represented by the following sql query: insert into ct,k select r1.item1,...,r1.itemk−1,r2.itemk−2 from cµt,k−1 as r1,c µ t,k−2 as r2 where r1.item1 = r2.item1 ∧...∧ r1.itemk−3 = r2.itemk−3 ∧ r1.itemk−1 < r2.itemk−2 for the implementation of the join ct,k = c µ t,k−1 × c µ t,k−2 we suggest a modification of sort-mergejoin algorithm (note that cµt,k−1 and c µ t,k−2 are sorted because of the way they are constructed and lexicographic order of itemsets). by the original sort-merge-join algorithm [9], it is possible to compute natural joins and equi-joins. let r(r) and s(s) be the relations and r ∩ s denote their common attributes. the algorithm keeps one pointer on the current position in relation r(r) and another one pointer on the current position in relation s(s). as the algorithm proceeds, the pointers move through the relations. it is supposed that the relations are sorted according to joining attributes, so tuples with the same values on the joining attributes are in consecutive order. thereby, each tuple needs to be read only once, and, as a result, each relation is also read only once. the number of blocks transfers is equal to the sum of the number of blocks in both sets cµt,k−1 and cµt,k−2, nb1 + nb2. the modification of sort-merge-join algorithm we suggest refers to the elimination of restrictions that join must be natural or equi-join. first, we separate the condition (3): xi = yi,1 ≤ i ≤ k −3 (4) xk−1 < yk−2. (5) joining ct,k = c µ t,k−1 ×c µ t,k−2 is calculated according to the condition (4), in other words we compute natural join. for this, the described sort-merge-join algorithm is used, and our modification is: before x = {x1,...,xk−1} and y = {y1,...,yk−2}, for which x ∈ c µ t,k−1 and y ∈ c µ t,k−2 and xi = yi,1 ≤ i ≤ k − 3 is true, are joined, we check if condition (5) is satisfied, and after that we generate candidate k-itemset {x1,...,xk−2,xk−1,yk−2}. the pseudocode of apriori_gen function comes next. function apriori_gen(cµt,k−1,c µ t,k−2) 1. i = 0 2. j = 0 3. while i ≤ |cµt,k−1|∧ j ≤ |c µ t,k−2| iset1 = c µ t,k−1[i ++] s = {iset1} done = false while done = f alse ∧ i ≤ cµt,k−1 a new rymon tree based procedure for mining statistically significant frequent itemsets 573 iset1a = c µ t,k−1[i ++] if iset1a[w] = iset1[w],1 ≤ w ≤ k −2 then s = s ∪ {iset1a} i ++ else done = true end if end while iset2 = c µ t,k−2[ j] while j ≤ |cµt,k−2|∧ iset2[1,...,k −2] ≺ iset1[1,...,k −2] iset2 = c µ t,k−2[ j ++] end while while j ≤ |cµt,k−2|∧ iset1[w] = iset2[w],1 ≤ w ≤ k −2 for each s ∈ s if iset1[k −1] ≺ iset2[k −2] then c = {iset1[1],...,iset1[k −1],iset2[k −2]} if c contains-not-frequent-subset then delete c else ct,k = ct,k ∪ {c} end if end for j++ iset2 = c µ t,k−2[ j] end while end while 5 statistical test for validating frequent itemset frequent itemset mining algorithms have the potential to generate a large number of patterns. for example, even if we assume that no customer has more than five items in his shopping cart and that there are 10000 items, there are ∑5 i=1 ( 10000 5 ) possible contents of this cart, which corresponds to the subsets having no more than five items of a set that has 10,000 items, and this is indeed a large number. as the size and dimensionality of real commercial databases can be very large, we could easily end up with thousands or even millions of patterns, many of which might not be interesting. it is therefore important to establish a set of well-accepted criteria for evaluating the quality of patterns. in apriori multiple_num algorithm support measure is used to determine whether an itemset is frequent: an itemset x is considered frequent in the data set t, if suppt (x) > minsup, where minsup is a user-specified threshold. support measure is kind of objective interestingness measure, which is datadriven and domain-independent approach that uses statistics derived from data for evaluating the quality of association patterns [12]. now, we will explain how statistical hypothesis testing can be applied to validate frequent itemsets generated with support measure. hypothesis testing is a statistical inference procedure to determine whether a hypothesis should be accepted or rejected based on the evidence gathered from data. examples of hypothesis tests include verifying the quality of patterns extracted by many data mining algorithms and validating the significance 574 p. stanišić, s. tomović of the performance difference between two classification models. in hypothesis testing, we are usually presented with two opposite hypothesis, which are known, respectively, as the null hypothesis and the alternative hypothesis. the general procedure for hypothesis testing consists of the following four steps [12]: • formulate the null and alternative hypotheses to be tested. • define a test statistic θ that determines whether the null hypothesis should be accepted or rejected. the probability distribution associated with the test statistic should be known. • compute the value of θ from the observed data. use the knowledge of the probability distribution to determine a quantity known as p-value. • define a significance level a which controls the range of θ values in which the null hypothesis should be rejected. the range of values for θ is known as the rejection region. frequent itemsets mining problem can be formulated into the hypothesis testing framework in the following way. to validate if the itemset x is frequent in the data set t, we need to decide whether to accept the null hypothesis, h0 : suppt (x) = minsup, or the alternative hypothesis h1 : suppt (x) > minsup. if the null hypothesis is rejected, then x is considered as frequent itemset. to perform the test, the probability distribution for suppt (x) must also be known. theorem 11. the measure suppt (x) for the itemset x in transaction data set t has the binomial distribution with mean suppt (x) and variance sup pt (x)∗(1−suppt (x)) n , where n is the number of transactions in t. proof: we will use measure suppcountt (x) and calculate mean and variance for it and later derive mean and variance for the measure suppt (x). the measure suppcountt (x) = xn presents the number of transactions in t that contain itemset x, and suppt (x) = suppcountt (x)/n (definition 2). the measure xn is analogous to determining the number of heads that shows up when tossing n coins. let us calculate e(xn) and d(xn). mean is e(xn) = n ∗ p, where p is the probability of success, which means (in our case) the itemset x appears in one transaction. according to bernoulli low, the following holds: ∀ε > 0,limn→∞p{|xnn − p| ≤ ε} = 1. (6) freely speaking, for large n (we work with large databases so n can be considered large), we can use relative frequency instead of probability. so, we now have: e(xn) = np ≈ n xn n = xn (7) for variance we compute: d(xn) = np(1− p) ≈ n xn n (1− xn n ) = xn(n − xn) n (8) now we will compute e(suppt (x)) and d(suppt (x)). recall that suppt (x) = xn n . we have: e(suppt (x)) = e( xn n ) = 1 n e(xn) = xn n = suppt (x). (9) d(suppt (x)) = d( xn n ) = 1 n2 d(xn) = 1 n2 xn(n − xn) n = 1 n suppt (x)(1− suppt (x)) (10) a new rymon tree based procedure for mining statistically significant frequent itemsets 575 2 the binomial distribution can be further approximated using normal distribution if n is sufficiently large, which is typically the case in association analysis. regarding previous paragraph and theorem 4, under the null hypothesis suppt (x) is assumed to be normally distributed with mean minsup and variance minsup(1−minsup)n . to test whether the null hypothesis should be accepted or rejected, the following statistic can be used: wn = suppt (x)− minsup√ minsup(1−minsup) n . (11) the previous statistic, according to the central limit theorem, has the distribution n(0,1). the statistic essentially measures the difference between the observed support suppt (x) and the minsup threshold in units of standard deviation. let n=10000, suppt (x) = 0.11, minsup=0.1 and α = 0.001. the last parameter is the desired significance level. it controls type 1 error which is rejecting the null hypothesis even though the hypothesis is true. in the apriori algorithm we compare suppt (x) = 0.11 > 0.1 = minsup and we declare x as frequent itemset. is this validation procedure statistically correct? under the hypothesis h1 statistics w10000 is positive and for rejection region we choose r = {(x1,...,x10000)|w10000 > k},k > 0. let us find k. 0.001 = ph0{w10000 > k} ph0{w10000 > k} = 0.499 k = 3.09 now we compute w10000 = 0.11−0.1√ 0.1∗(1−0.1) 10000 = 3.33... we can see that w10000 > k, so we are in rejection region and h1 is accepted, which means the itemset x is considered statistically significant. 6 conclusion in this section we compare the proposed method with original apriori [1] and with apriori multiple which we introduced in [11]. sections 3 and 4 of the paper contain comparison with the original apriori algorithm [1]. the main advantages of the new algorithm are: it finishes in just two database scans and it uses more efficient candidate generation procedure. the algorithm from [11] also finishes in two database scans and it uses similar procedure for candidate generation as the apriori multiple_num algorithm proposed here. but, the apriori multiple_num algorithm is more efficient. the main advantage of the apriori multiple_num algorithm in comparison with algorithm from [11] is in the following. the apriori multiple_num uses rymon tree structure for definition of candidate join procedure as it is explained in section 4. because of that, candidate sets are stored in rymon tree structure before joining instead of storing candidates in array as it is done in [11]. the following experiment confirms that rymon tree based implementation is more efficient. we implemented the apriori multiple_num, the original apriori [1] and the apriori multiple [11] algorithms in c in order to evaluate its performances. experiments are performed on pc with a cpu intel(r) core(tm)2 clock rate of 2.66ghz and with 2gb of ram. also, run time used here means the 576 p. stanišić, s. tomović total execution time, i.e., the period between input and output instead of cpu time measured in the experiments in some literature. in experiments dataset which can be found on www.cs.uregina.ca is used. it contains 10000 binary transactions. the average length of transactions is 8. we did not compare number of i/o operations because the algorithm proposed here finishes in just two database scans, while the original apriori requires at least kmax + 1 scans, where kmax is the length of the longest frequent itemset (as explained in section 3). figure 1 shows that the original apriori algorithm from [1] is outperformed by both the apriori multiple [11] and the apriori multiple_num presented here. also, it can be seen that apriori multiple_num with rymon tree based implementation is significantly better than the algorithm from [11]. figure 2: execution times for different algorithms bibliography [1] agrawal, r., srikant, r., fast algorithms for mining association rules, proceedings of vldb-94, 487-499, santiago, chile (1994) [2] coenen, f.p., leng, p., ahmed, s., t-trees, vertical partitioning and distributed association rule mining, proceedings icdm-2003, 513-516 (2003) [3] coenen, f.p., leng, p., ahmed, s., data structures for association rule mining: t-trees and ptrees, ieee transactions on data and knowledge engineering, vol. 16, no 6, 774-778 (2004) [4] coenen, f.p., leng, p., goulbourne, g., tree structures for mining association rules, journal of data mining and knowledge discovery vol. 8, no. 1, 25-51 (2004) [5] goulbourne, g., coenen, f., leng, p., algorithms for computing association rules using a partialsupport tree, journal of knowledge-based systems vol. 13, 141-149 (1999) [6] grahne, g., zhu, j., efficiently using prefix-trees in mining frequent itemsets, proceedings of the ieee icdm workshop on frequent itemset mining implementations (2003) [7] han, j., pei, j., yu, p.s., mining frequent patterns without candidate generation, proceedings of the acm sigmod conference on management of data, 1-12 (2000) [8] rymon, r., search through systematic set enumeration, proceedings of 3rd international conference on principles of knowledge representation and reasoning, 539-550 (1992) [9] silberschatz, a., korth, h. f., sudarshan, s., database system concepts, mc graw hill, new york (2006) a new rymon tree based procedure for mining statistically significant frequent itemsets 577 [10] simovici, a. d., djeraba, c., mathematical tools for data mining (set theory, partial orders, combinatorics), springer-verlag london limited (2008) [11] stanisic, p., tomovic, s., apriori multiple algorithm for mining association rules, information technology and control vol. 37, no. 4, 311-320 (2008) [12] tan., p.n., steinbach, m., kumar, v., introduction to data mining, addicon wesley (2006). dr. predrag stanisic is a professor in the faculty of science department of mathematics and computer science at university of montenegro. he received his b.sc. degree in mathematics and computer science from university of montenegro in 1996, his m.sc degree in computer science from university of belgrade serbia in 1998 and his ph.d. degree in computer science from moscow state university m.v. lomonosov in 1999. he is currently the dean of faculty of science at university of montenegro and he teaches a wide variety of undergraduate and graduate courses in several computer science disciplines, especially database systems, operating systems and programming. m.sc savo tomovic is a teaching assistant in the faculty of science department of mathematics and computer science at university of montenegro. he received his b.sc. degree in mathematics and computer science from university of montenegro in 2006, his m.sc degree in computer science from university of montenegro in 2007. he is currently a ph.d. student in computer science at university of montenegro. int j comput commun, issn 1841-9836 7(5):990-999, december, 2012. a fault-tolerant scheduling algorithm using hybrid overloading technology for dynamic grouping based multiprocessor systems x.-b. yu, j.-s. zhao, c.-w. zheng, x.-h. hu yu xing-biao 1.institute of software chinese academy of sciences 4# south fourth street, zhongguancun, beijing, p.r. china 2.graduate university of chinese academy of sciences 80# zhongguancun east road, haidian district, beijing, p.r. china e-mail: eliteman5317@hotmail.com zhao jun-suo, zheng chang-wen, hu xiao-hui institute of software chinese academy of sciences 4# south fourth street, zhongguancun, beijing, p.r. china abstract: in order to extend the application area of fault-tolerant scheduling algorithm based on hybrid overloading for multiprocessor and increase the fault-tolerant number of processors, we propose a new fault-tolerant scheduling algorithm, which is based on hybrid overloading and dynamic grouping for multiprocessor by combining logic grouping strategy for processors in primary backup overloading and backup backup overloading.this algorithm presents the formalization of the dynamic grouping for processors in fault-tolerant scheduling based on hybrid overloading and enlarges the task number included in overloading task link. in the process of fault-tolerant scheduling the processors are dynamically divided into some groups based on overloading task link, so as to keep good scheduling success ratio and enhance the fault-tolerant performance of processors. both theoretical analysis and simulation experiment prove this algorithm’s effectiveness respectively. keywords: dynamic grouping; fault-tolerant; overloading; primary backup; backup backup 1 introduction real-time embedded system has been applied in many fields such as military, aeronautics, astronautics and communication, and the relevant research also has made important progress. the result of task scheduling in real-time system lies on not only scheduling correctness but also time restriction. the multiprocessor platform of real-time system takes advantage of the technology of resource redundancy and time redundancy in order to meet the demand of scheduling correctness and system reliability, among them primary-backup overloading(pb) [1, 2] and backup-backup overloading(bb) [2–4]approach is most important one. supposing that there is just a processor fault in same period, the scheduling time of different versions of different tasks is overloaded in scheduling period of same processor for pb overloading , and the scheduling time of backup-backup versions of different tasks is overloaded in scheduling period of same processor for bb overloading. the fault-tolerant scheduling technology of hybrid overloading [2, 6–8] is the combination of pb overloading and bb overloading with better scheduling efficiency and fault-tolerant performance of processor, but still supposing there is only a processor fault in same period. logic grouping strategy [1] for processor based on pb overloading and bb overloading dynamically divides processors into groups in the process of task scheduling so as to tolerate a processor fault in every group, which is more adaptable to apply practically. but this strategy yet supposes the task number of overloading task chain is not more than two tasks, therefore the assigning of grouping size and group number is not enough flexible to be suited for hybrid copyright c⃝ 2006-2012 by ccc publications a fault-tolerant scheduling algorithm using hybrid overloading technology for dynamic grouping based multiprocessor systems 991 overloading. in order to simplify the application of overloading method in scheduling algorithm, some basic application principles of overloading technology were introduced. [2] in this paper dynamic grouping pb-bb algorithm(dg_pb-bb algorithm) combines the advantage of two overloading scheduling technology and logic grouping in the precondition of application principle for overloading technology, not only efficiently improving the efficiency of task scheduling but also enlarging tolerance area of the process. dg_pb-bb algorithm has better application value than no grouping algorithm and logic grouping algorithm. 2 system model system is composed of m same processor nodes based on real time multiprocessor platform with shared memory, centralized dispatcher and processor communication medium without communication cost. processor responsible for scheduling is dispatcher and responsible for execution is executer, which runs in parallel with dispatcher. new real time task is received by dispatcher and centralized dispatched to form assignment queue to execute on each processor. real time tasks to be scheduled are independent aperiodic and nonpreemptalbe scheduling with primary-backup copy technology. each task ti has two versions, namely primary copy(pri) and backup copy(bki) with identical attributes and resource requirements. task sets τ = {ti|i = 1, 2, · · · , n}, ti=(ai,ri ,ci,di), ai is the start time of task ti, ri is the ready time of task ti, ci is the maximal execution time of task ti, di is the deadline time of task ti. processor sets ω = {pi |i = 1, 2, · · · , m}. pri→bki is task chain, if other tasks are overloading scheduled on task ti, then it is overloading task chain. the parameters of system model are defined as follows: definition 1. st(ti) is the start scheduling time of task ti. ft(ti) is the finish scheduling time of task ti. ri≤st(pri)≤ft(pri)≤st(bki)≤ft(bki)≤di. definition 2. proc(pri) is the processor on which the primary task is scheduled, proc(bki) is the processor on which the backup task is scheduled, proc(pri)̸=proc(bki). definition 3. s(ti) is the time interval on which the primary or backup task is scheduled. s(pri)∩s(bki)=ϕ. definition 4. ncascade is the cascade number of overloaded tasks within a processor group and a time slot. groupsize(gi) is the processor number of a processor group gi. ncascade=1 represents the scheduling assignment of no task overloading, ncascade= groupsize(gi) represents the task can not be overloading scheduled.1≤ncascade≤groupsize(gi). definition 5. toverload is the part time of a task overloaded on other tasks. 0≤toverload≤ci. 3 a fault-tolerant scheduling algorithm using hybrid overloading technology for dynamic grouping based multiprocessor systems(dg_pb-bb) this paper states the strategy of dynamic grouping of fault-tolerant processor based on fault-tolerant scheduling technology of hybrid overloading and the finishing of task scheduling with the help of ap(allocation parameter) [5,6]algorithm. in dg_pb-bb algorithm the system can tolerate more than two processor faults at same moment and the tasks of overloading chain can not again be limited as two numbers. 992 x.-b. yu, j.-s. zhao, c.-w. zheng, x.-h. hu 3.1 validity checking 1. backups can be overloaded on any task. primaries can be overloaded only on backups. 2. if a primary pri is overloaded on a backup bkj, then st(pri)≥ft(prj). 3. a overloading task chain should not be looped. a overloading task chain is relating to some processors and the maximal primary number of a overloading task chain is groupsize(gi)-1 in the group gi. 4. no more than one processor is belong to different processor groups. every processor group has at least two processors. 5. the size of every processor group is possibly unequal and dynamically changes in the process of task scheduling. 6. the backups and primaries of same task are scheduled on same processor group. overloading scheduling of task copy can only happen within same processor group. 7. if proc(pri)=proc(prj) within same processor group, then s(bki)∩s(bkj)=ϕ, stating the scheduling time of task bki and bkj can not be overloading. 3.2 scheduling algorithm next specifically describing dg_pb-bb algorithm. in dg_pb-bb algorithm processor grouping is concluded as four conditions, formalizably described as following: 1. if the copy of task ti, pri and bki are not overloaded on other tasks,then the processors occupied by two copies form a new processor group ge. ge=form_group(proc(pri),proc(bki)). 2. if pri is not overloaded on other tasks, but bki is, then the task chain in which task tk overloaded on bki is situated links with the task chain in which pri→bki is situated to extend as a new processor group ge. (proc(prk),proc(bkk))∈group(ge), expand_group(ge,proc(pri)). 3. if bki is not overloaded on other tasks, but pri is, then the task chain in which task tk overloaded on pri is situated links with the task chain in which pri→bki is situated to extend as a new processor group ge. (proc(prk),proc(bkk))∈group(ge), expand_group(ge,proc(bki)). 4. if both of pri and bki is overloaded on other tasks, then the task chain in which task tk overloaded on bki is situated links with the task chain in which task tj overloaded on pri is situated to extend as a new processor group ge. (proc(prj),proc(bkj))∈group(ge), (proc(prk),proc(bkk))∈group(gf ), ge=expand_group(ge,gf ). fig.1 states four conditions of dynamic processor grouping. otc is overloading task chain. otc(a) represents task chain pr1→bk1 and the processors occupied by it form a new processor group, which is the first condition of dynamic processor grouping. within otc(b1), the task chain t4 is situated in links with pr3→bk3 task chain and the processors occupied by it form a fault-tolerant scheduling algorithm using hybrid overloading technology for dynamic grouping based multiprocessor systems 993 figure 1: dynamic processor grouping a new processor group, showing the second condition of dynamic processor grouping. within otc(b2), the task chain t4 is situated in links with task chain pr2→bk2 and the processors occupied by it form a new processor group, expressing the third condition of dynamic processor grouping. within otc(b1), task chain pr3→bk3 links with both task chain t2 and t4, and the processors occupied by it form a new processor group, which is the fourth condition of dynamic processor grouping. dg_pb-bb algorithm is based on ap scheduling strategy of hybrid overloading. new tasks form ready scheduling queue according to fcfs(first come first server) method. bigger the distance of start scheduling time between primaries and backups, smaller the effect of backups to task receiving ratio, and it means start scheduling time of primaries is as early as possible, st(pri)→ri, finish scheduling time of backups is as late as possible, ft(bki)→di. in order to increase task receiving ratio, tasks contained in task sets of hybrid overloading should be as much as possible and scheduling time occupied by them should be as few as possible. in the process of task scheduling the strategy of dynamic processor grouping is adopted to enhance tolerant level of the processor. the principle of processor grouping is the processors occupied by every overloading task chain constitute one group, if new overloading task chain has no relationship with old overloading task chain, then it should add a new processor to schedule new overloading task chain again to form a new processor group. if e=groupsize(ge),then: ap[pri, pj, ft(pri)] = { di−ft(pri) di−ri 1 e ncascade = 1 di−ft(pri)i di−ri ncascade e toverload ci 1 < ncascade ≤ e ap[pri, pj] = max {ap[pri, pj, ft(pri)]} ap[bki, pj, st(bki)] = { st(bki)−ri di−ri 1 e ncascade = 1 st(bki)−ri di−ri ncascade e toverload ci 1 < ncascade ≤ e ap[bki, pj] = max {ap[bki, pj, st(bki)]} ap[pri,pj,ft(pri)] is evaluation factor of scheduling scheme that a new primary pri is assigned to schedule on processor j. similarly,ap[bki,pj,ft(bki)] is evaluation factor of scheduling scheme that a new backup bki is assigned to schedule on processor j. ap is bigger, showing that the scheme of scheduling assignment is more optimal and scheduling efficiency is better. dg_pb-bb algorithm 1. the copy of first task, pri and bki are assigned respectively processor p1 and p2 to form a new group ge. ge is present processor group, g is assigned processor group. schedule(pri)→p1,schedule(bki)→p2,ge=form_group(p1,p2),validity(),g=ge. 2. (1) within assigned processor group g, next task ti finishes scheduling with the adoption 994 x.-b. yu, j.-s. zhao, c.-w. zheng, x.-h. hu of scheduling technology of hybrid overloading and allocation parameter. for task ti, while validity()̸= failed ap(pri,pj1)=next[max(ap(pri,pj))]or ap(bki,pj2)=next[max(ap(bki,pj))], pj ∈ group(g),j = 1, 2, · · · , m,j1 ̸=j2, schedule(pri)→pj1,schedule(bki)→pj2,validity(). (2) if task ti can not be scheduled within assigned processor group, then it should be to add a new processor into assigned processor group g to schedule task ti again and judge whether new and old overloading task chain can be united according to four conditions. otherwise to judge whether task chain pri→bki and task chain included in processor group g are separated each other, and whether processor group ge formed by task chain in which task ti is situated is really contained in group g. if it is true, then group ge should be decomposed, otherwise task ti is dealt with according four conditions. after processors are assigned completely, if new task can not be scheduled, then fault-tolerant scheduling algorithm of dynamic grouping is started again according to principle of processor grouping. if schedule(ti)=failed in g then {for new processor pk, g=pk∪g,go to (1) (constraint pk=pj1∨pk=pj2) if (combine(link(pri → bki),(group(g) → link))=failed) then condition 1,validity() else condition i,validity(),go to (1) until j1∨ j2=m} else {if ((combine(link(pri→bki),(group(g) → link))=failed) and form_group(pri,bki)∈group(g)) then decompose(form_group(pri, bki)) else condition i,validity(),go to (1) until j1∨j2=m} next giving an example of dg_pb-bb algorithm to express the thinking of algorithm.t1=(2, 2,2,7.5),t2=(4,4,2,8),t3=(4.5,4.5,2,9),t4=(5,5,2,10),t5=(5,5,4,13), t6=(6,6,3.5,14.5),m=6. fig.2 describes dg_pbbb algorithm scheduling example of above queue. figure 2: dg_pb-bb algorithm firstly m=2, processor p1 and p2 form group g1.st(pr2)=4 ,m=3 ,proc(bk2)=p3, pr2 and bk1 can be pb overloading ,p3 is added into group g1.st(pr3)=4.5,if m=3, pr3 and bk3 can not be scheduled within group g1, then m=4, bk3 and bk2 are bb overloading, proc(pr3)=p4, p4 is added into group g1. st(pr4)=5,m=4, pr4 and bk4 can be scheduled within group g1,proc(pr4)=p1, a fault-tolerant scheduling algorithm using hybrid overloading technology for dynamic grouping based multiprocessor systems 995 proc(bk4)=p2,and task chain pr4→bk4 is separated from old overloading task chain within group g1 and really included in group g1, so that task chain pr4→bk4 is combined into group g1. st(pr5)=5,m=4,pr5 and bk5 can not be scheduled within group g1, so it is to extend a new processor p5 and rescheduling pr5 and bk5 within processor scheduling sets composed of p5 and group g1, proc(pr5)=p5, proc(bk5)=p4.as a result, task chain pr5→bk5 is separated from old overloading task chain within group g1 and not really included in group g1, therefore it is to extend task chain pr5→bk5 as group g2. st(pr6)=6, task ti can not be scheduled neither in group g1 nor group g2, so that a new processor p6 is added into group g2 and task ti is rescheduled within extended group g2, proc(pr6)=p6, proc(bk6)=p4, bk5 and bk6 is bb overloaded on processor p4. all processors have been assigned completely so that dg_pb-bb algorithm is restarted from processor p1 while there are new tasks arriving into scheduling queue. 3.3 algorithm analysis if task sets showed in fig.2 are scheduled by ap scheduling algorithm of hybrid overloading without grouping(pb-bb_ap algorithm),although it only needs four processors to finish scheduling, but can just tolerate a processor fault and is too strict to apply widely. dg_pb-bb algorithm needs to increase two processors to finish scheduling, but new processor group g2 in system makes fault-tolerant number of processor extend as two processors so as to strengthen the reliability of system. time complexity of pb-bb_ap algorithm is o[n2 · m · (m − 1)], n is average task number of task sets on which a processor has ever scheduled, m is processor number of the system. if in dg_pb-bb algorithm average processor number of group ge is k, then time complexity of dg_pb-bb algorithm is o[n2 · k · (k − 1)].in regard to dg_pb-bb algorithm, comparing with pb-bb_ap algorithm, the algorithm cost decreases and the system reliability increases, but the guarantee ratio decreases, because pb-bb_ap algorithm is an ideal method. 3.4 theory testification the theory testification follows the methodology used in [8] and is based on the following assumptions: • all tasks have unit worst execution time, for example: ci=1. • backup slots are preallocated in the schedule. • fifo scheduling strategy is used. • task deadlines follow uniform distribution [wmin,wmax],called deadline window. if pwin(w) is the probability that an arriving task has a relative deadline w, then pwin (w) = 1/(wmax − wmin + 1), wmin ≤ w ≤ wmax. • task arrivals follow uniform distribution[0,amax], with mean aav=amax/2. if par(k) is the probability of k tasks arriving at a given time, then par (k) = 1/(amax + 1), 0 ≤ k ≤ amax. a simple pre-allocation policy for bb overloading is to reserve a slot for backups every n time slots on each processor. backup slots on the three processors can be staggered. for a task ti, bki is scheduled immediately after pri with probability 0.5 and is scheduled two slots later than pri with probability 0.5. in pb overloading there are three different types of time(0,1 and 2), if (t-1)mod 3=i, any time t has a type of i. at any time t, the number of primaries that can be scheduled to start at that time is s0 if t is of type 0, s1 if t is of type 1, s2 if t is of type 2. 996 x.-b. yu, j.-s. zhao, c.-w. zheng, x.-h. hu using fifo scheduling is equal to maintaining a task queue, to which arriving tasks are appended. given that the number of task that can be scheduled on each time unit is known, then the position of a task in the q indicates its scheduled start time. in bb overloading two tasks can be scheduled on each time (one slot is reserved for backups). if at the beginning of time slot t, a task ti is the qth task in q, then ti is scheduled to execute at time slot t+g bb q . g bb q is the time at which a task, whose position in the q is q (q = 1, 2, · · · , 2wmax), will be executed and is defined as g bb q = ⌊ q 2 ⌋ . in pb overloading s0,s1 and s2 tasks can be scheduled on a given time slot t depending on whether t is of type 0,1,or 2 respectively. the time g p b q is defined as g p b q =(i+j+l), i∑ c=1 s0 + j∑ c=1 s1 + l∑ c=1 s2 ≤ q − 1, |i − j| ≤ 1, |j − l| ≤ 1, |l − i| ≤ 1. where i ≥ j ≥ l if t is of type 0, j ≥ l ≥ i if t is of type 1, and l ≥ i ≥ j if t is of type 2. when a task ti arrives at time t, its schedulability depends on the length of q and on the relative deadline wi of the task. in bb overloading, if ti is appended at position q of q and wi ≥ g bb q , then the primary task pri is guaranteed to execute before t+wi, moreover, if wi ≥ g bb q + 2, then bki is also guaranteed to execute before time t+wi. in pb overloading, if ti is appended at position q of q and wi ≥ g p b q , then the primary task pri is guaranteed to execute before t+wi, moreover, if wi ≥ g p b q +2, then bki is also guaranteed to execute before time t+wi. let pq,k be the probability that one of the k tasks is rejected when the queue size is q, and its value is the probability that the relative deadline of the task is smaller than g∗b +δ,*=pb or bb, δ=1 or 2. g logic b is the time at which a task, whose position in the q is b, will be executed in the pb-bb overloading scheduling strategy of logic grouping, b = q + k/2. showing as t0-t12 in fig.3, processor p1-p12 are divided into 4 groups and every group has 3 processors, adopting with scheduling strategy of bb overloading, processor p13-p24 are also divided into 4 group and every group has 3 processors, adopting with scheduling strategy of pb overloading. above method is described as pb-bb overloading scheduling strategy of logic grouping. g logic b = q+k/2⌊ (i+j+l)(n−n/3)+ i∑ c=1 s0+ j∑ c=1 s1+ l∑ c=1 s2 ⌋, i∑ c=1 s0 + j∑ c=1 s1 + l∑ c=1 s2 ≤ q + k/2 − 1, |i − j| ≤ 1, |j − l| ≤ 1, |l − i| ≤ 1. g dynamic b is the time at which a task, whose position in the q is b, will be executed in the pb-bb overloading scheduling strategy of dynamic grouping, b = q + k/2. different with logic grouping, in dynamic grouping s0=3, s1=4 and s2=2. describe as t12-t24 in fig.3, processor p1p12 is divided into 3 groups and every group has 4 processors, adopting with scheduling strategy of bb overloading, processor p13-p24 are also divided 3 groups and every group has 4 processors, adopting with scheduling strategy of pb overloading, above method is described as pb-bb overloading scheduling strategy of dynamic grouping. g dynamic b = q+k/2⌊ (i+j+l)(n−n/4)+ i∑ c=1 s0+ j∑ c=1 s1+ l∑ c=1 s2 ⌋, i∑ c=1 s0 + j∑ c=1 s1 + l∑ c=1 s2 ≤ q + k/ 2 − 1, |i − j| ≤ 1, |j − l| ≤ 1, |l − i| ≤ 1. obviously, gdynamicb < g logic b , g dynamic b decreases more quickly than g logic b with increasing n, therefore dg_pb-bb algorithm is more efficient than lg_pb-bb algorithm with increasing n. a fault-tolerant scheduling algorithm using hybrid overloading technology for dynamic grouping based multiprocessor systems 997 figure 3: theroy testification of lg_pb-bb and dg_pb-bb 4 simulation experiment supposing lg_pb-bb represents fault-tolerant scheduling algorithm of hybrid overloading based on the strategy of logic grouping. simulation experiment mainly compares with change situation of the guarantee ratio(gr) in variety of fault-tolerant number of processor in circumstance of different task load for dg_pb-bb, lg_pb-bb and pb-bb_ap algorithm respectively. the guarantee ratio=the number of tasks guaranteed/the number of tasks arrived. simulation parameters as following: • the inter-arrival time of tasks follows exponential distribution with mean θ. θ=8. • the inter-arrival time of faults follows exponential distribution with mean λ. λ=200. • the execution time of a task is chosen uniformly [2,8]. • the deadline of a task is chosen uniformly [ri + ci, ri + r ∗ ci], wherer ≥ 1. • processor number m=10, task number n=50. • r(task laxity) represents flexibility time task ti can stay in ready queue in precondition of finishing scheduling before deadline. r=3. • l (task load) is the expected number of task arrivals per mean service time. l =c/θ, c is the mean execution time, θ is the inter-arrival average time of tasks ti. bigger l is, more the average load of processor is and lower the guarantee ratio is. in fig.4 (a),(b) and (c) respectively show the relationship of processor fault and guarantee ratio in dg_pb-bb,lg_pb-bb and pb-bb_ap algorithm for the system of l=0.25,l=0.5 and l=1. experiment results prove gr decreases along with l and processor faults increase. when there is only one processor fault in dg_pb-bb,lg_pb-bb and pb-bb_ap algorithm, the differences of gr for the system of l=0.25,l=0.5 and l=1 are small. when processor fault increases to more than two faults, gr for three kinds of task load in pb-bb_ap algorithm are all low and failure possibility of task scheduling is high. when l=0.25 and l=0.5 in dg_pb-bb algorithm gr enhances significantly comparing with lg_pb-bb algorithm, and for the system of l=1 the difference of gr between two algorithm is small, stating in the system of not full load task dg_pb-bb algorithm can tolerate processor fault better than lg_pb-bb and pbbb_ap algorithm. 998 x.-b. yu, j.-s. zhao, c.-w. zheng, x.-h. hu (a) l=0.25 (b) l=0.5 (c) l=1 fig.4. comparison with dynamic grouping, logic grouping and no group algorithm 5 conclusions this paper improves task number and grouping strategy included in overloading task chain and proposes dg_pb-bb algorithm based on pb-bb_ap algorithm according to logic grouping strategy of pb overloading and bb overloading. simulation experiment shows dg_pb-bb algorithm not only has good guarantee ratio of task scheduling, but also improves fault-tolerant level of processor, with better application valuation. a fault-tolerant scheduling algorithm using hybrid overloading technology for dynamic grouping based multiprocessor systems 999 main creative achievements include 1)introducing the formalization of processor dynamic grouping in fault-tolerant scheduling technology of hybrid overloading, 2)proposing the method of processor dynamic grouping based on overloading task chain,and 3)extending task number included in overloading task chain and increasing fault-tolerant level of processor by the adoption of processor dynamic grouping. bibliography [1] r.al-omari,arun k.somani,g.manimarna,efficient overloading techniques for primarybackup scheduling in real-time systems, j.parallel and distributed computing,64:629648,2004. [2] wei sun,naixue xiong,laurence t.yang,chunming rong, towards free task overloading in passive replication based real-time multiprocessors, 10th ieee international conference on computer and information technology, 1735-1742, 2010. [3] bindu mirle,albert m.k.cheng, simulation fault-tolerant scheduling on real-time multiprocessor systems using primary backup overloading, university of houston, 1-10,2006. [4] r.al-omari,arun k.somani,g.manimarna, an adaptive scheme for fault-tolerant scheduling of soft real-time tasks in multiprocessor systems, j.parallel and distributed computing, 65:595-608, 2005. [5] w.sun,y.zhang,c.yu,x.defago,y.inoguchi, dynamic scheduling real-time task using primary-backup overloading strategy for multiprocessor systems, ieice transactions on information and systems, 796-806, 2008. [6] w.sun,y.zhang,c.yu,x.defago,y.inoguchi,real-time task scheduling using extended overloading technique for multiprocessor system, 11th ieee symposium on distributed simulation and real-time applications, 95-102, 2007. [7] w.sun,y.zhang,c.yu,x.defago,y.inoguchi,hybrid overloading and stochastic analysis for redundant scheduling in real-time multiprocessor systems, 26th ieee international symposium on reliable distributed systems, 265-274, 2007. [8] g. manimaran, c. siva ram murthy,a fault-tolerant dynamic scheduling algorithm for multiprocessor real-time systems and its analysis, ieee trans. parallel distributed system , 9(11):1137-1152, 1998. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 4, pp. 483-489 stream ciphers analysis methods d. bucerzan, m. crăciun, v. chiş, c. raţiu dominic bucerzan, mihaela crăciun, violeta chiş "aurel vlaicu" university of arad faculty of exact sciences department of mathematics-informatics românia, 310330 arad, 2 elena drăgoi e-mail: dominic@bbcomputer.ro, qbt@rdslink.ro, viochis@yahoo.com crina raţiu daramec srl, arad românia, sofronea fn e-mail: ratiu__anina@yahoo.com abstract: the purpose of this paper is to present and to discuss analysis methods applied in symmetric cryptography, especially on stream ciphers. the tests were made on some algorithms and also on the personal symmetric cryptographic algorithm, henkos, based on a pseudorandom number generator. the test confirms that the algorithm appears to be secure and fast. the paper describes first the main parts of the cryptosystem, its implementation and different analysis methods. the code is written in the c/c++ language. the software application and the tests applied were processed on a pc computer. the quality analysis presents the results of many classical statistical tests, comparing some algorithms based especially on pseudo random number generators. the tests use standard sequence of 12.5 mb resulted from some test generators. the main part of the work presents selected results for the most important statistical tests like: fips 1401, fips 1402 , ent tests, diehard battery of tests, nist statistical test suite. the final question is: are these tests enough to certifie the quality of a tested algorithm? keywords: stream cipher, synchronous stream cipher, pseudorandom number generator (prng), performance analysis, statistical tests. 1 introduction stream ciphers are an important class of encryption algorithms. they encrypt individual characters (usually binary digits) of a plaintext message one at a time, using an encryption transformation which varies with time. various design methods where proposed for stream ciphers and the specialists proposed many analysis methods. however, the reality is that in the literature we can find relatively few fullyspecified stream cipher algorithms. one possible explanation can be the fact that many stream ciphers used in practice tend to be proprietary and confidential. a stream cipher generates what is called a keystream (a sequence of bits used as a key). encryption is accomplished by a simple operation combining the keystream with the plaintext, usually with the bitwise xor operation. stream ciphers can be either symmetric-key or public-key. the focus of this chapter is symmetric-key stream ciphers. a stream cipher generates successive elements of the keystream based on an internal state. this state is updated in essentially two ways: if the state changes independently of the plaintext or ciphertext messages, the cipher is classified as a synchronous stream cipher. by contrast, self-synchronizing stream ciphers update their state based on previous ciphertext digits. for the synchronous stream ciphers some properties are mandatory. copyright c⃝ 2006-2010 by ccc publications 484 d. bucerzan, m. crăciun, v. chiş, c. raţiu (i) synchronization requirements. the sender and the receiver must be synchronized – using the same key and operating at the same position (state) within that key. if synchronization is lost due to ciphertext digits being inserted or deleted during transmission, then decryption fails and can only be restored through additional techniques for re-synchronization. (ii) no error propagation. a ciphertext digit that is modified (but not deleted) during transmission does not affect the decryption of other ciphertext digits. (iii) active attacks problem. as a consequence of the synchronization requirement, the insertion, deletion, or replay of ciphertext digits during an attack causes loss of synchronization, and offer the possibility to be detected by the attacker. an active attack offers the possibility to make changes to selected ciphertext digits, and find out what affect these changes have on the plaintext. this conclusion proves that the data origin authentication and data integrity must be assured by additional mechanisms. in a synchronous stream cipher a stream of pseudo-random digits is generated independently of the plaintext and ciphertext messages, and then combined with the plaintext (to encrypt) or the ciphertext (to decrypt). in the most common form, binary digits (bits) are used, and the keystream is combined with the plaintext using the exclusive or operation (xor). this is called a binary additive stream cipher. another approach uses several of the previous n ciphertext digits to compute the keystream. such schemes are known as self-synchronizing stream ciphers or asynchronous stream ciphers. the idea of self-synchronization has the advantage that the receiver will automatically synchronize with the keystream generator after receiving n ciphertext digits, making it easier to recover if digits are dropped or added to the message stream. most stream cipher designs are for synchronous stream ciphers. for further details see [5]. 2 design of a stream cipher the design of a new stream cipher involves some important goals: – to deduce the internal state from the result should be impossible, – there should be no short cycles, – it should be cryptographically secure, – it should be easy to implement, – the code should be optimized for speed, – to create as much confusion and diffusion as possible. i tried to achieve the same goals in designing a new stream cipher named henkos ( see [1], [2]). this cryptosystem is a symmetric synchronous stream cipher encryption system designed for a software implementation. after many efforts, here are the results: – an easy to implement algorithm, – a cryptographically secure algorithm (proven by statistical tests), – a very fast algorithm: a 50 megabytes file is encrypted / decrypted in less then one second, – a fast pseudorandom number generator: a 12,500,000 byte stream needs 0.30 sec for c/c++ code. stream ciphers analysis methods 485 this cryptosystem uses a binary additive stream cipher and two types of keys: – a short-term key named data key (dk) with a fixed length of 1024 bytes that is an input in the keystream generator. this key can be generated with a prng (not necessarily a cryptographic secure prng) or can be an ordinary file, if prng is not available. – a long-term key named master key (mk) with a fixed length, which contains 1024 numbers, used to mix the data key and the internal state of the keystream generator. this key must be generated with a true rng (hardware). – if during the transmission an attacker intercepts the encrypted data, it is not possible to decrypt the ciphertext correctly without having the master key, because there is a very large number of possible combinations of decrypted ciphertext. every attempt to find the master key produces a different plaintext, including the one with the same numbers but the changed order of the numbers in the key affects the decryption process. 2.1 index keys generation in this section of the algorithm the master key mk is transformed into two index keys mks and mkt in two steps. two functions sum and inv are used: the first one is an additive function and the second one produced a sort of symmetrical figures of number transformation. the function sum is sum : {1,2,...,1024}→ {1,2,...,1024}, sum(i;mk) = i∑ j=0 mk( j) modulo 1024 . the function inv is inv : {1,2,...,1024} → {1,2,...,1024}, inv(i) = i∗ modulo 1024, where i∗ is the number obtained by writing the digits of the number i in reverse order. the index keys mks and mkt are: step 1 : mks(i) = sum(i;mk), i ∈ {1,2,...,1024} , (1) step 2 : mkt (i) = inv ( mks(i) ) , i ∈ {1,2,...,1024} . (2) the transformation has two targets: • not to use the original mk key directly in the process, • to create confusion and diffusion for master key. keystream generation transform the dk key to obtain the real k key for encryption using two functions: the first one is the essential function in this algorithm the "switch function" sw and the second function ad is an additive one: (sw) : dk( j)↔ dk(k), where j = mks(i) and k = mkt (i) for i ∈ {1,2,...,1024} , (3) (ad) : dk(i) = dk(i)+ dk(i +1) modulo 256, i ∈ {1,2,...,1023} and dk(1024) = dk(1024)+ dk(1) . (4) these functions create a totally changed image of the data key dk. after these two transformations we obtain dk1; the new key is the input for transformations (3) and (4) and the process will be repeated 64 times: dk → dk1→ dk2→ ...→ dk63→ dk64 . (5) 486 d. bucerzan, m. crăciun, v. chiş, c. raţiu to obtain the keystream bytes k(i) of the final k key, the last operation is: k(i) = ( dk64(i)+ dk64(i +1) ) ⊕ dk64(i) for i ∈ {1,2,...,1023} ; k(1024) = ( dk64(1024)+ dk64(1) ) ⊕ dk64(1024) . (6) the encryption / decryption process will transform a plain text p of 1024 bytes into a cipher text c of 1024 bytes by using the encryption key k and the function ⊕: c(i) = p(i)⊕ k(i) . for every stream of 1024 bytes of plain text, another k key wil be used. the new encryption k key will be obtained by the algorithm with the last values of dk as input and the operations described in (4), (5) and (6) will be effectuated one time. this sequence will run until the plain text is finished for one session. remarks: • the confusion and the diffusion of the bits are given specially from (3) and (4), • the data key for the next session we have generated with the same algorithm. 3 quality analysis the quality of a stream cipher is measured performing statistical tests. these tests fips 140–1, fips 140–2, ent tests, diehard battery, nist statistical test suite (a statistical test suite for testing the pseudo–random number generators used in cryptographic applications) were performed on large ciphertext samples of 12.5 megabytes. 3.1 fips 140–1/fips140–2 test fips statistical tests contain the monobit test, the poker test, the runs test and the long run test. the following tests are based on performing a pass/fail statistical test on 5000 sequences of 2500 bytes each. in my results the well known generators sha–1 and ccg together with the new henkos pass fips 140–1 in proportion of 100%. sha–1 passes fips 140–2 in proportion of 99.6%, ccg in proportion of 99.4% and henkos in proportion of 99.64%. for these statistical tests, even if the generators present good statistical properties this isn’t a guarantee that the algorithm is good for cryptographic purposes. 3.2 diehard statistical tests the next set of tests was designed to identify weaknesses in many common non–cryptographic prng algorithms. these tests analyze a single large file from the output of the generator of 11 megabytes or more (see [3]). the battery of tests include: birthday spacing test, overlapping 5–permutation test, binary rank test 31 × 31, binary rank test 32 × 32, binary rank test 6 × 8, bitstream test, opso, oqso and dna tests, count–the–1’s test on a stream of bytes, parking lot test, minimum distance test, 3dspheres test, etc. majority diehard tests return a p−value, which should be uniform on [0,1) if the input file contains truly independent random bits. those p−values are obtained by p = f(x), where f is the assumed distribution of the sample random variable x . when a bit stream really fails, it get p−values of 0 or 1 (or close to 0 or 1) to six or more places. for sha–1 generator we have 2 p−values very close to 1, and for cubic congruent generator we have 28 p−values near to 1. for henkos we don’t have p−values close to 0 or 1. stream ciphers analysis methods 487 3.3 ent tests ent applies various tests to a sequence of bytes stored in files and reports the results. the program can be used to evaluate pseudorandom number generators for encryption and compression algorithms. it calculates entropy, optimum compression, chi–square distribution, arithmetic mean, monte carlo value for pi and serial correlation coefficient. henkos obtained good results. 3.4 nist statistical test suite the package includes statistical tests for: frequency, block frequency, cumulative sums, runs, long runs, marsaglia’s rank, spectral (based on the discrete fourier transform), nonoverlapping template matching, overlapping template matching’s, maurer’s universal statistical, approximate entropy, random excursions (due to baron and rukhin), lempel–ziv complexity, linear complexity, and serial. the nist framework, like many tests, is based on hypothesis testing. – state your null hypothesis. assume that the binary sequence is random. – compute a sequence test statistically. testing is carried out at the bit level. – compute the p−value which must be less than 0.01, otherwise, failure is declared. the cubic congruential generator fails frequency, cumulative sums, runs, aperiodic template at 11 from 284 templates, approximate entropy test, serial and lempel–ziv test and micali generator has 3 fails at aperiodic template. henkos and bbs pass all this tests. 4 security analisys what is a secure stream cipher? that is a question with no definitive answer, but i can make some assumption on that subject. the entire test package presented here can eventually reveal weaknesses but, even if the ciphers pass with good results, that is not a guarantee of its security. that does not make it fail proof. cryptographers consider that there are two main conditions for the security of a stream cipher with a k−bit key. • the attacker should not be able to predict future keystream generated by the cipher in any conditions: recovering the secret key, recovering the internal state of the cipher at some point, or otherwise. the attacker can obviously test all possible secret keys, so the complexity of a brute force attack (requiring at most 2k executions of the algorithm) gives a performance baseline to which any alleged attack should be compared to. • the attacker should not be able to recover the cipher’s key or internal state from the keystream. cryptographers also demand that the keystream be free of even subtitle biases that would let attackers distinguish a stream from random noise, and free of detectable relationships between keystreams that correspond to related keys or related nonce. this should be true for all keys (there should be no weak keys), and true even if the attacker can know or choose some plaintext or ciphertext. like other attacks in cryptography, stream cipher attacks can be certificational, meaning they aren’t necessarily practical ways to break the cipher but they indicate that the cipher might have weaknesses. securely using a secure synchronous stream cipher requires that one never uses the same keystream twice; that generally means that a different nonce or key must be supplied to each invocation of the cipher. application designers must also recognize that most stream ciphers don’t provide authenticity, only privacy: encrypted messages may still have been modified in transit. 488 d. bucerzan, m. crăciun, v. chiş, c. raţiu stream creaspeed bits attack cipher tion (cycles/ key init. internal best comp. date byte) length vector state known compl.1 a5/1−2 1989 voice 54 114 26? active 240 fish 1993 quite fast huge ? ? k-p a2 211 grain ≤ 2004 fast 80 26 160 key d3 243 hc−256 ≤ 2004 22? 28 28 215 ? ? henkos 2005 7.8 28 −210 28 210 ? 21024 isaac 1996 2.38−4.69 40−28 n/a4 8288 2006 wis5 5×101240 panama 1998 2 28 27? 1216? 2001 hc6 282 rabbit 2003 3.7−9.7 27 26 29 2006 n/a 2006 n/a rc4 1987 impressive 40−28 23 2064 key d 213 −233 seal 1997 very fast ? 25? ? ? ? sober−128 2003 ? ≥ 27 ? ? mes. forge 26 trivium ≤ 2004 22 −23 80 80 288 brute force 2135 table 1: comparison of some well known stream ciphers 5 performances analysis 5.1 testing platform for testing we select only one platform among many other possible platforms, on the criteria of disponibility and reproductibility of measurements. there are also results, reported in papers, that make possible only a relative comparison between the performances of different algorithms. 5.2 performances measurement the performance algorithms are mesuread by a special program, written in visual c and based on reading the processor clock before and after the calls of the main phases implementing functions, using the routines get_start_time and get_stop_time written in asm. the result is calculated as the difference between the two clock readings, minus the additional time consumed with the calls of the clock reading routines. in the henkos case the cpu time for k keys generation is 7.8 cycles/byte, and the encryption process performs 181 megabytes/second. comparing it to well–known algorithms and the algorithms tested in the estream project like cryptmt, dragon or salsa20, which are among the top algorithms in the final list, my results are good enough (for details see [6] and [4]). 5.3 implementation details the performance was measured reading the processor clock cycles using the rdtsc instruction. 5.4 comparison of performances in the real life, in very few cases the authors reveals all the details and the performances of the ciphers. in table 1 there are some results for some well known stream ciphers, [8]. stream ciphers analysis methods 489 6 conclusions there are a lot of stream ciphers used in cryptography because of the speed, but in this case nobody tried a standardisation like in block cipher area. the european union–based nessie project [7], which was aimed at evaluating the security of various cryptographic primitives, did not recommend any stream ciphers in their report. the performances and quality analysis on cryptographic stream ciphers algorithms are an ambitious goal for all the designers of algorithms. in majority of cases there is no proof of the behaviour of the new cipher, but it’s possible to verify the quality by performing statistical tests, and also to measure the performances of implementation and the speed by software means. in the future, new stream ciphers will appear so that new methods for analysis will be a permanent preoccupation for the cryptographic community. bibliography [1] bucerzan d. and gheorghiţă m., henkos – a new stream cipher: performance analysis, wartacrypt ’04 the 4th central european conference on cryptology, bedlewo, poland, july 2004. [2] bucerzan d., a cryptographic algorithm based on a pseudorandom number generator, synasc’08, timişoara, october 2008. [3] marsaglia g., diehard statistical tests, http://stat.fsu.edu/pub/diehard/ [4] matsumoto m., saito m., nishimura t. and hagita m., cryptmt stream cipher version 3, estream project, http://www.ecrypt.eu.org/stream/ [5] schneier b., applied cryptography, j. wiley & sons inc, (second edition), 1996. [6] ***, estream, http://www.ecrypt.eu.org/stream/ [7] ***, nessie european proiect, http://www.cosic.esat.kuleuven.be/nessie/ [8] ***, http://www.answers.com/topic/stream-cipher dominic bucerzan (b. may 17, 1956) received his m. sc. in information technology from "aurel vlaicu" university of arad, romania and a phd in economic cybernetics from the "bucharest academy of economic studies" (2005), with a paper in the field of information security. currently he works as a lecturer in informatics at the department of mathematics-informatics, faculty of exact sciences, "aurel vlaicu" university of arad, românia. his current research interests include aspects of it security and cryptography. he is author or co-author of 4 books and more than 45 papers and participated in 35 conferences and workshops. mihaela crăciun (b. march 10, 1972) received her master of science in information technology from "aurel vlaicu" university of arad, românia. at present she is a candidate for a phd in computer science at "politehnica" university of timişoara, românia. her current research is focused on decision in enterprise analysis. she published articles in her field of interest. 1computational complexity 2known-plaintext attack 3key derivation 4not available 5weak internal state 6hash collisions wang_ijcccv11n5.pdf international journal of computers communications & control issn 1841-9836, 11(5):734-746, october 2016. a momentum theory for hot topic life-cycle: a case study of hot hashtag emerging in twitter l. wang, x. li, l.j. liao, l. liu liu wang, xin li, le-jian liao*, li liu school of computer science beijing institute of technology no. 5 south zhongguancun street beijing, 100081, china wangliu2000@163.com, xinli@bit.edu.cn, liaolj@bit.edu.cn, liuli0407@hotmail.com *corresponding author: liaolj@bit.edu.cn abstract: the existing work on mining of hot topics is mainly based on topic multiplicity and attention from users in unit time. with the advent of social networking, the weight has been put on the hot topics which can effectively describe the importance and hotness of a topic. however, the researches on the influence exerted by the accumulation of attention towards hot topics and the alternation between hot topics and outdated ones are still relatively weak. in this paper, a novel algorithm for calculating the hotness of topics is proposed based on momentum. the number of the participants, but also the long tail effect of the historical accumulation on the topic is taken into consideration. through this algorithm, we can accurately build a model for the hot topics on their emerging growing period and effectively describe the whole life circle of the topic. additionally, the change between hot topics and old ones can be distinguished efficiently. our experiments show that the process of a topic growing into a hot topic can be detected explicitly. potential hot topics can be explored and the overdue ones can be rejected respectively. keywords: hashtag, hot topic, aging theory. 1 introduction in modern times social networking has become an important resource of real-time news updates. as smart phones and other mobile devices spread, there is a growing tendency that people use social networks such as twitter and weibo to obtain the hot issues happening across the world. detecting the hot topics out from the large-scale of information posted online simultaneously is of significant interest for many reasons. for one, it shortens the time for users to obtain the hot topic, which may play an important role in decision-making. also from the hot topics detected, one can easily have an understanding of current social dynamics. most of the social network systems may provide a ranking list of hot topics through the search or post count of keywords. while this approach fails to take the temporal relations of hot topics into account. current hot topic detection solutions are mostly based on the topic multiplicity and attention from users in unit time, whereas they neglect the digestion of the hot topics. also they aim to establish the topics from a clustering of keywords from the content layer, which overlooked the functions of a social network. from the point of view of research methods analysis, physics methods are applied in many fields, such as: newton’s theorem was applied to the [6,18], the gravity was applied to manufacturing modeling [21], the application of the theorem of momentum was used in [26,34], [15] combined with momentum method to launch the stock prediction. they carried out the research of modeling by physics methods. from the social networking features, in addition to the security and network mining research [11,28,31,35], also, many scholars are copyright © 2006-2016 by ccc publications 736 l. wang, x. li, l.j. liao, l. liu still studying hashtags’ other functions [27,40], in [1,37], xiao and aldhelaan were using social network to do the research on hot topic discovery and recommendation, chen [13] combined with evolution model to achieve the topic prediction. the accumulation of topics in social networks and the integration of the topic have strong physics characteristics, so we try to find a way to use physics methods to model the topic in social networks and combine the momentum theorem with hashtags. in this paper, we aim to detect those emerging hot topics according to the momentum theory with the use of hashtags. the momentum of hashtags can reveal the mechanism of dynamics from the temporal characteristics and quantity characteristics and discover the ideal life cycle of topics. this will result in using momentum theory to dig out hot topics more accurately and effectively. in this paper, the proposed algorithm does not need to collect and store a large number of historical data. therefore, it is suitable for data streams real-time calculation. moreover, the nested loops in the algorithm are few and the time complexity is approximate to o(n). due to high computational efficiency, this algorithm is suitable for large data analysis as well. the rest of the paper is organized as follows: in section 2, we give a review of related works. in section 3, we propose the definition of hot topics. section 4 describes the theoretical model of topic hotness. in section 5, we discuss the results of the experiments run. finally, in section 6, we present our conclusions and some future research directions. 2 related work topic detection and tracking (tdt) [2] has long been a foundation of the research related to hot topic detection. however, different from the traditional sources of information such as web pages, texts, etc., information in social networks is often very short and sparse, also spreading rapidly. under the background of the new era of social media, a series of work have been completed towards those characters. you et al. [7] utilized frequent item set mining algorithm sam [8] to find out hot topics denoted by combinations of keywords. thus, detecting the hot topics is comparable to mining frequent patterns from news streams. similar to that, a recent work by kim et al. [22] also took geographic elements into consideration, which provides a simple but useful approach to analyze real-time streaming data and finds geographic communities. according to fp-growth [20], giannella et al. [19] proposed the fp-stream time window and the digestion concept of the support parameters. this new algorithm can let the outdated items expired and accumulate the importance of items based on the timeline as well. however, the algorithm is more complicated. guo et al. [24] improved the tree structure in the frequent pattern stream mining algorithm (fp-stream) [19] and used the new algorithm to detect hot topics from twitter streams, which can lead to time-sensitive results. lee et al. [17] developed an algorithm for ranking topics, using topic energy to represent the significance of a given topic at each time point within a time period. the strategy in determining topic energy value considers factors such as popularity, burstiness and informativeness, which suits the character of information on social network better. in [29], the key entity significance is computed through traditional "tf-idf" evaluation method [33] in the information retrieval literature. then, clustering entities are used to generate significant events. bun et al. [25] computed the value of a term by using tf*idf and clustered terms into a sentence. yang et al. [39] applied the vsm to the task of news tdt and used a time window with a decaying function (tw-df) to model the temporal relations between documents and events. unfortunately however, the strategies above did not consider the temporal relations thoroughly. the newly generated hot topics cannot be distinguished from the outdated ones accurately. chen et al. [12,16] proposed an aging theory to model life cycles of news events. for all user a momentum theory for hot topic life-cycle: a case study of hot hashtag emerging in twitter 737 messages generated from the topic time interval, the aging theory calculates their nutrition and convert the nutrition value to the energy value, and then add into the cumulative energy value of the topic. it also applies energy decaying strategy on the topic. as time goes by, the energy decays gradually. if the increase of the energy is less than the reduction of the energy, the topic will show a trend of attenuation. if the energy value is less than a given threshold, this topic is set to "death" state and is removed from the hot topics list. the aging algorithm defines α as the nutrition transferred factor and β as the nutrition decayed factor, 0 < α < 1, 0 < β < 1, α decides the increase of nutrition from an input news document and β decides the nutrition loss in a period. zheng et al. [41] utilized aging theory on candidate topics discovered by a clustering method in each time slot in bbs. with energy values updated at the end of each time slot according to the three functions of aging theory, hot topics in each time slot can be easily found. chen et al. [14] combined aging theory with a term weighting scheme to extract genuine hot terms. based on the extracted hot terms, key sentences are then identified and grouped into clusters that represent hot topics by using multidimensional sentence vectors. cataldi et al. [10] improved the nutrition formula by using "idf" and formalized the keyword life cycle leveraging a novel aging theory intended to mine terms that frequently occur in the specified time interval which are relatively rare in the past. wang et al. [36] took media focus and user attention into the classical aging theory and proved the importance of these two factors. in comparison with these approaches, our solution extracts hot topics based on hashtags according to the functions of a social network instead of the content aspect. the hot topic emerging theory based on momentum can explicitly distinguish new hot topics from outdated ones. the methods mentioned emphasized topic extraction from a cluster of sentences or the frequency pattern of topics, but ignore the topic life cycle. the aging theory [12,16,41] discovers the life cycle of topic but does not deal with the older topic elimination mechanism very well. chen et al. [23] proposed a similar idea to our model, defining hot velocity and hot acceleration to recognize hot topics but did not reveal the dynamic characteristics. different from that model, we define hot topic emerging theory based on momentum. the utilization of momentum theory can successfully describe the whole life circle of a topic, thus sift out the newly emerged hot topics from the old ones more effectively. 3 definition of hot topic hot topics in real microblogging systems have three characteristics: 1) the number of posts related to this topic would exceed a threshold; 2) the amount of users concerned should be large, especially those key persons; 3) a hot topic would occur at a short time [23]. in [14], the characteristics of hot topics are concluded based on [25] as those appear on many news channels and go through a life cycle of birth, growth, maturity, and death. in this paper, hot topics are defined as the topics which are both influential and latest. four characteristics are summarized as follows: i timeliness. the hot topic refers to the people and things that have happened recently. those topics which have been under discussion for more than seven days or one month cannot be regarded as hot topics. ii development. due to the dynamics while topics are propagated, the influence of public opinion will spread as time goes by. the hot topics diffract backwards. iii accumulation. the more people are involved into the discussion of the topic, the hotter the topic is. however with more and more attention drawn on a certain topic, it becomes less possible to be hot topic again. 738 l. wang, x. li, l.j. liao, l. liu iv digestion. as time goes on, the influence of a topic will unavoidably be watered-down. thus hot topics have the character of obliviousness. 4 kinetic model of topic hotness the topic under a certain time point can be regarded as a set of large number of synonymous terms. in other words, to describe this concept from the perspective of physics, we can assume each term as a particle which owns weight and velocity. besides, the activeness of the particle represents its energy and the activeness of a topic is the reflection of the particles inside the set. as time goes on, the set is dynamically changing with terms increasing, thus the energy of the particles are changing too. in time t − 1, the topic set is represented as topicn(t−1) . in time t, the additional subset topicn∆t will join into topicn(t−1) as the new set topicnt. this process can be regarded as two kinetic physical objects collided and fused into one kinetic physical object with the exchange of energy. in this paper, we adopt conservation of momentum to illustrate this process. in this paper, we propose to use momentum to represent the physical characters in thermodynamics and dynamics of hot topic. negative acceleration is imported to represent the digestion of hotness according to the attenuation of the hot topics while propagated. we use momentum equation to present the active momentum of topics which is aroused by discussion. a topic is consists of many synonymous terms. in social networks, a post can be seen as a term, and similar terms constitute a topic. the topic set is varying all the time and the variation composes a time sequence. for a topic n, the topic set can be expressed as: topicn := {topicn1, topicn2, ..., topicnt} all kinds of topics are formed as the whole topic set: topic := {topic1, topic2, ..., topicn} the current topic set is the combination of the former set and the subset increased during the interval, as: topicnt := topicn(t−1) ∪ topicn∆t topicn∆t := {termn1, termn2, ..., termnm} we can also say that the topic under a specific time spot is the set of all the terms emerged before, as: topicnt := {termn1, termn2, termn3, ..., termnt} 4.1 momentum modelling in this paper, we use the equations below to represent the variation when a topic becomes hotter. the topic in t − 1 can be seen as a physical entity mt−1 with weight and velocity. the increased terms inunder that topic can be regarded as another entity mterm∆t. thus the topic in t as an entity mt is the combination of mterm∆t and mt−1 after a collision with the velocity of vterm∆t. the increased momentum of topic entity mterm∆t is the sum of the momentum of each term added. we can calculate the momentum of each topic as follows: ⇀ mterm∆t = m∑ ∆t=1 (mterm∆t × ⇀vterm∆t) (1) a momentum theory for hot topic life-cycle: a case study of hot hashtag emerging in twitter 739 the momentum of the mt equals to the sum of the momentum of mt−1 and mterm∆t. the total momentum is invariable before and after the collision. ⇀vt−1 + ⇀g∆t represents the final velocity after ∆t. the development and digestion of topic can be expressed clearly by this equation: ⇀ mt = mt−1 × ( ⇀vt−1 + ⇀g∆t) + ⇀ mterm∆t (2) the weight of the topic in time t is the sum of the weight of itself before collision and the weight of the topic added, which shows the accumulation of topic. the more the topic is discussed, the heavier the topic is. mt = mt−1 + m∑ ∆t=1 mterm∆t (3) in time t, the velocity of the topic is the quotient of total momentum and total weight. ⇀vt = ⇀ mt/mt (4) the topic reaches a height in a certain velocity after being represented in physics. in this paper we define this height as the hotness of the topic in that time spot. ⇀ ht = ⇀vt−1 × t (5) the height is not able to keep growing according to the digestion of the topic. under this circumstance we use acceleration of gravity g to lower the height (hotness). the equation is modified as follows: ⇀ ht = ⇀vt−1 × t0 + 1 2 × ⇀g∆t2 (6) 4.2 modelling solution in social networks, hashtag (#) is used extensively to determine the topic, which is convenient to aggregate and classify vast amounts of information and let people who follow a certain topic get the relevant information more easily. twitter showed its significant value in information propagation almost before every emergency and important activity due to the aggregation of using hashtags. we can draw a safe conclusion that hashtag reflects the true tendency of topics and has great potential in the reconstruction and compilation of information. in this paper we use hashtag for identification modeling of topics. a topic under a specific time spot is defined as a set with synonymous hashtags. topicnt := {hashtagn1, hashtagn2, hashtagn3, ..., hashtagnt} at a specified time, more similar hashtags means the topic is more active. in a specific period of time, topic is a sequence of sets. in this section we use m for the weight of hashtag, v for the velocity of hashtag, and m for the momentum of hashtag. the topic hotness modeling algorithm is as follows: 5 experiment analysis and result 5.1 data corpus and parameter settings in this paper we adopt the historical(2009/6/11-2009/12/31) data from twitter as corpus. we extracted 460,496 twitter terms and 22,063 hashtags in total, neglecting those topics which lasted 740 l. wang, x. li, l.j. liao, l. liu algorithm 1 topic hotness modeling input: the set of hashtags which emerged during a certain period of time output: the topics list ranked by hotness in each time spot t 1: for t = t0 do 2: process the data we collected and aggregate into topic set topic. 3: for each topic topicn from topic do 4: obtain the set {termn1, ..., termnt} of the topicnt 5: calculate the hotness of topicnt by ⇀ ht = ⇀vt−1 × t0 + 1 2 × ⇀g∆t2 6: end for 7: output the topic list ordered by ht 8: t = t + 1 9: end for table 1: parameter setting parameter value hashtag initial velocity v 10 hashtag weight m 1 acceleration g -3 interval t (day) 1 less than 3 days or contained less than 2 posts, the coverage can meet the needs of simulation experiments. the main parameters of the computer for this modelling are as follows: cpu is intel(r) core(tm)2 duo, main frequency is 2.0ghz, memory frequency is 777mhz, memory capacity is 1.96gb, running environment is windows xp. the rest parameters are set as the table 1. 5.2 comparison of hot topic detection in this experiment, our momentum theory (m) is compared to two proposed methods. the baseline method (a) [12, 16] is a basic aging method. cataldi et al. [10] improved a method (a-tf) which enhanced the aging method by using an augmented normalized term frequency. as a result, top 3 and top 10 generated hot topics from each method are evaluated using official tdt measures including: precision (p), recall (r) and f1-measure (f1). in table 2, the best score for each item is represented in bold. in the top 3 comparison, our momentum theory achieves both highest precision and recall which results in the best f1 score, while the aging method achieves both reasonable precision and recall. in the top 10 comparison, the precision of our momentum theory is still the highest in the first 10 hot hashtags but loses table 2: comparison of three methods method p r f1 top 3 a 0.67 0.67 0.67 m 1.00 0.67 0.80 a-tf 0.50 0.50 0.50 top 10 a 0.63 0.71 0.67 m 0.88 0.78 0.82 a-tf 0.56 0.83 0.67 a momentum theory for hot topic life-cycle: a case study of hot hashtag emerging in twitter 741 table 3: top 3 hashtags changing circumstances in a week in three methods 2009/7/6 2009/7/7 2009/7/8 2009/7/9 2009/7/10 2009/7/11 m mileycyrus mileycyrus mileycyrus 140army 140army zyngapirates 140army 140army 140army mileycyrus zyngapirates hottest100 140mafia 140mafia gorillapenis gorillapenis urumqi ashes a livestrong moonfruit turnon notagoodlook ff unacceptable iranelection nothingpersonal turnoff iranelection followfriday iranelection 140mafia iranelection iranelection 140mafia iranelection ff a-tf livestrong moonfruit turnon iranelection ff unacceptable iranelection iranelection turnoff 140mafia followfriday iranelection musicmonday livestrong iranelection notagoodlook iranelection ff recall. we believe that is due to the fact that the momentum mechanism likes to remove the old topics. the recall of a-tf is the best, because it improves the aging algorithm which will be sensitive of hot topics and enlarge its coverage, but there is a reduction in the precision. in the f1 item, the momentum theory has maintained the highest standards. in conclusion, the comprehensive performance of momentum theory is better than the other two algorithms. 5.3 comparison of hot topic trending due to space limitations, table 3 represents the top 3 hashtags from 2009/7/6 to 2009/7/11 in the momentum theory (m), the aging algorithm (a) and the improved aging algorithm (atf) . these results show that the topic trending in m is relatively stable. hashtag #mileycyrus, #140army and#zyngapirates are stable up to first place in turn. in the last two algorithms, #iranelection is ranked in the top 3 places. however, according to the historical data checking, this hashtag appeared over 20,000 times a day in june and was already ranked as hottest topic at that time in all three algorithms. clearly, in july #iranelection should not be a hot topic again. hashtag #140mafia’s overall performance is good, the repeating number is higher and the potential of impacting as hottest topic is great, so it is ranked as top 3 in the three algorithms. although it is stable in m, it is unstable in a and a-tf. table 4 presents the top 10 hashtags in two days (2009/7/6-2009/7/7) in three algorithms. hashtag #mileycyrus remains first place in m in two days, ranks 7th in a, falls out of the top 10 in a-tf and only ranks 17th as 0.635 energy value. it is because there are a lot of old hot topics which interfere with these two days of ranking, while these old hashtags already were the hot topics in an earlier time. #140army is in second place in m, but ranks 13th and 11th in a and a-tf separately, falling out of top 10. #flip2009 is captured only by m, while it gets its highest times in 5th july and 6th july. but due to relatively lower times, it is abandoned by two aging algorithms. as we mentioned before, #140mafia is captured in all three methods. #tcot, #spymaster and #tweetmyjobs had more than thousands of times per day from 2009/6/12 and were listed as hot topics in momentum theory at that time. these are not hot news yet, but these hashtags are still ranked as top 10 hashtags in a and a-tf. #mj is a standard hot hashtag which arise sharply in 7th july as 1,077 times occur, however, it is replaced by #tcot in a and a-tf due to the shortcoming of lower discrimination degree of the aging theory. because of the large amount of data and limited space, we take hashtag #mileysycyrus as an example to analyze the trending of different algorithms. figure 1 shows the times, hotness and energy value six months profiles of hashtag #mileysycyrus, which is the hottest topic of 6th july in momentum theory. in figure 1 a, this hashtag began to appear in 12th june. after that, it did not reproduce for about a week. later, it appeared sometime in a lower rate. on 5th july, it shot up suddenly 742 l. wang, x. li, l.j. liao, l. liu !!"#$#%& !!"#'#%& !!"#"#%& !!"#%%#%& !%!#%#%& ! %!!! !!! (!!! )!!! $!!! &!!! !!"#$#%& !!"#'#%& !!"#"#%& !!"#%%#%& !%!#%#%& ! %! ! (! )! !!"#$#%& !!"#'#%& !!"#"#%& !!"#%%#%& !%!#%#%& !*! !*% !* !*( !*) !*$ !*& !!"#$#%& !!"#'#%& !!"#"#%& !!"#%%#%& !%!#%#%& +!*% !*! !*% !* !*( !*) !*$ !*& !*' ! " # ! " # $ % & ' ( ) % * + ) , !$% #&'()* + !$% # ( & ' ( ) % * + ) , #( # #!$% . $ / , 0 #+ # # !$% . $ / , 0 #+,&figure 1: comparison of #mileysycyrus (a:occurrence number of hashtag per day, b:hotness of hashtag per day in momentum theory, c:energy value of hashtag per day in aging method, d:energy value of hashtag per day in a-tf method) a momentum theory for hot topic life-cycle: a case study of hot hashtag emerging in twitter 743 table 4: top 10 hashtags in two days in three methods hashtag m times hashtag a times hashtag a-tf times 7-6 mileycyrus 33.63 187 livestrong 0.56 4026 livestrong 0.67 4026 140army 27.93 781 iranelection 0.55 2640 iranelection 0.66 2640 140mafia 27.68 2316 140mafia 0.53 2316 musicmonday 0.66 2966 moonfruit 26.23 2124 moonfruit 0.51 2124 140mafia 0.66 2316 gorillapenis 24.98 984 musicmonday 0.50 2966 moonfruit 0.66 2124 flip2009 21.19 12 spymaster 0.47 1616 spymaster 0.65 1616 mj 20.86 288 tcot 0.40 1206 tcot 0.65 1206 rw09 20.64 16 gorillapenis 0.39 984 gorillapenis 0.64 984 katemcrae 19.36 10 mileycyrus 0.38 187 militarymon 0.64 933 cmonbrazil 19.11 140 honduras 0.30 676 jobs 0.64 827 7-7 mileycyrus 35.64 18 moonfruit 0.62 5147 moonfruit 0.64 5147 140army 30.88 461 nothingpersonal 0.54 3670 iranelection 0.63 1902 140mafia 29.23 1644 iranelection 0.51 1902 livestrong 0.63 1228 gorillapenis 29.09 559 140mafia 0.49 1644 140mafia 0.63 1644 moonfruit 25.65 5147 livestrong 0.47 1228 musicmonday 0.63 1423 cmonbrazil 20.13 47 musicmonday 0.46 1423 spymaster 0.63 936 urumqi 19.61 153 spymaster 0.40 936 tcot 0.63 839 mj 19.27 1075 tcot 0.36 839 tweetmyjobs 0.63 1013 crocmint 19.27 6 tweetmyjobs 0.34 1013 gorillapenis 0.63 559 xinjiang 19.05 110 threadless 0.30 1192 jobs 0.63 642 and reached 5,302 times, then declined rapidly. although it repeated a few dozens or hundreds times later, but never rose again sharply. in figure 1 b, momentum theory captures the hotness rising rapidly on 5th, 6th, 7th july and declining dramatically quickly. this hotness is higher than other hashtags, so it is ranked in first place. the aging theory also finds this change but does not list this hashtag in a higher position (only the 9th place) due to a relative lower energy value compared with other hashtags. the a-tf method almost gets every higher repeat rate moments, but it cannot distinguish the highest point from all higher points clearly. so, it is not able to choose this hashtag from the data corpus efficiently. as a result, it falls out of the top 10 hashtags. in figure 1, momentum theory presents a higher sensitivity and better ability to discriminate the hottest hashtags from others than the other two algorithms. extrapolating the results from the experiment, we can obtain the hottest topics during the period of time we captured by the algorithm we proposed, and compared with the hot hashtags that are extracted by the rank of occurrence. from the comparison of table 5, we can explicitly observe that the occurrence of the hashtag may mislead to the generation of a hot topic, such as #followfriday, #ff and #1. those topics own little valid information, however are regarded as hot according to the high frequency of occurrence. by the algorithm we proposed, the topics which lack realistic meaning will be eliminated by the momentum equation. the topics which survive will be the active ones with realistic meaning. 6 conclusion and future work in this paper we propose a novel algorithm for hot topic detection based on momentum theory using hashtags for defining a topic. the main contributions of the paper are as follows. 744 l. wang, x. li, l.j. liao, l. liu table 5: hottest hashtags in half a year momentum theory occurrence hashtag hotness hashtag days times 1 tehran 44.73 ff 113 417802 2 nomaschavez 44.19 iranelection 116 368519 3 happybirthdaymikey 43.13 tcot 120 288091 4 nem 43 jobs 115 243586 5 iranelection 41.14 mobsterworld 85 203445 6 zain 38.93 followfriday 113 188664 7 happybirthdaypink 38.2 1 114 166260 8 teenisland 38.15 musicmonday 109 t164399 9 blackbery 37.94 140mafia 105 149078 first we analyze the characteristics of hot topics and conclude into four points:1) timeliness. 2) development. 3) accumulation. 4) digestion. then we build a hot topic detection model with momentum theory. the experiments show that our model can identify those emerging hot topics effectively and accurately. in the future, we hope to filter the posts under each topic to provide the users with a purer source of information without the distraction from irrelevant posts. our algorithm standardizes the topic life cycle into a very stable curve which makes the topic prediction possible. some artificial intelligence techniques have been applied in mathematical modeling [4, 32, 38], and some achievements have been obtained in the prediction of data modeling [3,5,9,30]. we will try to use these techniques to predict the hot topics in the next step. acknowledgment the authors would like to thank the anonymous reviewers for their insightful comments. the authors also thank yanmei zhai and xu han for their help with data preparation and experiments. this work has been partially supported by national program on key basic research project under grant no. 2013cb329605, nsfc under grant no. 61300178 and natural science foundation of beijing under grant no. 4092037. bibliography [1] aldhelaan, m.; alhawasi, h. (2015); graph summarization for hashtag recommendation, 2015 3rd international conference on future internet of things and cloud (ficloud), 698-702. [2] allan, j.; carbonell, j.; doddington, g., et al. (1998); topic detection and tracking pilot study final report, proceedings of the darpa broadcast news transcription and understanding workshop, 194-218. [3] asadi, s.; hadavandi, e.; mehmanpazir, f., et al. (2012); hybridization of evolutionary levenberg-marquardt neural networks and data pre-processing for stock market prediction, knowledge-based systems, 35(15): 245-258. a momentum theory for hot topic life-cycle: a case study of hot hashtag emerging in twitter 745 [4] aydemir, e.; koruca, h.i. (2015); a new production scheduling module using priority-rule based genetic algorithm, international journal of simulation modelling, isbn 1726-4529, 14(3): 450-462. [5] bas, e.; egrioglu, e.; aladag, c.h., et al. (2015); fuzzy-time-series network used to forecast linear and nonlinear time series, applied intelligence, 43(2): 1-13. [6] blekas, k.; lagaris, i.e. (2013); a spectral clustering approach based on newton’s equations of motion, international journal of intelligent systems, 28(4): 394-410. [7] bo, y.; ming, l.; bing-quan, l., et al. (2012); detecting hot topics in technology news streams, machine learning and cybernetics (icmlc), 2012 international conference on, isbn 2160-133x, 5:1968-1974. [8] borgelt, c. (2010); simple algorithms for frequent item set mining, advances in machine learning ii, isbn 978-3-642-05178-4, 263(16):351-369. [9] caldeira, j.f.; moura, g.v.; santos, a.a.p. (2016); predicting the yield curve using forecast combinations, computational statistics & data analysis, 100: 79-98. [10] cataldi, m.; caro, l.d.; schifanella, c. (2010); emerging topic detection on twitter based on temporal and social terms evaluation, proceedings of the tenth international workshop on multimedia data mining, 1-10. [11] chang, m.k.; cheung, w.; tang, m. (2013); building trust online: interactions among trust building mechanisms, information & management, 50(7): 439-445. [12] chen, c.; chen, y.-t.; sun, y., et al. (2003); life cycle modeling of news events using aging theory, machine learning: ecml 2003, isbn 978-3-540-20121-2, 2837(7):47-59. [13] chen, j.; yu, j.; shen, y. (2012); towards topic trend prediction on a topic evolution model with social connection, the ieee/wic/acm international joint conferences on web intelligence & intelligent agent technology, 153-157. [14] chen, k.y.; luesukprasert, l.; chou, s.c.t. (2007); hot topic extraction based on timeline analysis and multidimensional sentence modeling, ieee transactions on knowledge and data engineering, isbn 1041-4347, 19(8): 1016-1025. [15] chen, t.l. (2012); forecasting the taiwan stock market with a novel momentum-based fuzzy time-series, review of economics & finance, 2:38-50. [16] chien chin, c.; yao-tsung, c.; meng chang, c. (2007); an aging theory for event lifecycle modeling, systems, man and cybernetics, part a: systems and humans, ieee transactions on, isbn 1083-4427, 37(2): 237-248. [17] chung-hong, l.; tzan-feng, c.; hsin-chang, y. (2011); an automatic topic ranking approach for event detection on microblogging messages, systems, man and cybernetics (smc), 2011 ieee international conference on, isbn 1062-922x, 1358-1363 [18] galperin, e.a. (2011); information transmittal, newton’s law of gravitation, and tensor approach to general relativity, computers & mathematics with applications, 62(2): 709-724. [19] giannella, c.; han, j.; pei, j., et al. (2003); mining frequent patterns in data streams at multiple time granularities, data mining next generation challenges & future directions. 746 l. wang, x. li, l.j. liao, l. liu [20] han, j.; pei, j.; yin, y. (2000); mining frequent patterns without candidate generation, sigmod rec., isbn 0163-5808, 29(2): 1-12. [21] hrelja, m.; klancnik, s.; balic, j., et al. (2014); modelling of a turning process using the gravitational search algorithm, international journal of simulation modelling, isbn 17264529, 13(1): 30-41. [22] hwi-gang, k.; seongjoo, l.; sunghyon, k. (2013); discovering hot topics using twitter streaming data social topic detection and geographic clustering, advances in social networks analysis and mining (asonam), 2013 ieee/acm international conference on, 1215-1220. [23] jiangfeng, c.; jianjun, y.; yi, s. (2012); towards topic trend prediction on a topic evolution model with social connection, web intelligence and intelligent agent technology (wi-iat), 2012 ieee/wic/acm international conferences on, 1:153-157. [24] jing, g.; peng, z.; tanb, j., et al. (2012); mining hot topics from twitter streams, procedia computer science, 9(11): 2008-2011. [25] khoo khyou, b.; ishizuka, m. (2002); topic extraction from news archive using tf*pdf algorithm, web information systems engineering, 2002. wise 2002. proceedings of the third international conference on, 73-82. [26] khulief, y.a. (2010); numerical modelling of impulsive events in mechanical systems, international journal of modelling & simulation, 30: 80-86. [27] kotsakos, d.; sakkos, p.; katakis, i., et al. (2015); language agnostic meme-filtering for hashtag-based social network analysis, social network analysis & mining, 5(1): 1-14. [28] li, m.; tang, m. (2013); information security engineering: a framework for research and practices, international journal of computers communications & control, 8(4): 578-587. [29] liu, m.; liu, y.; xiang, l., et al. (2008); extracting key entities and significant events from online daily news, intelligent data engineering and automated learning-ideal 2008, isbn 978-3-540-88905-2, 5326(26):201-209. [30] liu, c.h.; xiong, w. (2015); modelling and simulation of quality risk forecasting in a supply chain, international journal of simulation modelling, isbn 1726-4529, 14(2): 359-370. [31] ma, h.; lu, z.; li, d., et al. (2014); mining hidden links in social networks to achieve equilibrium ąî, theoretical computer science, 556:13-24. [32] ramesh kumar, l.; padmanaban, k.; balamurugan, c. (2016); optimal tolerance allocation in a complex assembly using evolutionary algorithms, international journal of simulation modelling, isbn 1726-4529, 15(1): 121-132. [33] salton, g. (1989); automatic text processing: the transformation, analysis, and retrieval of information by computer, isbn 0-201-12227-8. [34] ternik, p.; rudolf, r. (2013); laminar natural convection of non-newtonian nanofluids in a square enclosure with differentially heated side walls, international journal of simulation modelling, 12(1): 5-16. [35] tran, d.h.; nguyen, h.l.; zhao, w., et al. (2011); towards security in sharing data on cloud-based social networks, information, communications and signal processing (icics) 2011 8th international conference on, 1-5. a momentum theory for hot topic life-cycle: a case study of hot hashtag emerging in twitter 747 [36] wang, c.; zhang, m.; ru, l., et al. (2008); automatic online news topic ranking using media focus and user attention based on aging theory, proceedings of the 17th acm conference on information and knowledge management, 1033-1042. [37] xiao, z. (2014); a social network-oriented mining algorithm for hot topic data, computer applications & software [38] yang, k.w.; zhang, p.l.; ge, b.f., et al. (2015); a variables clustering based differential evolution algorithm to solve production planning problem, international journal of simulation modelling, isbn 1726-4529, 14: 525-538. [39] yang, y.; pierce, t.; carbonell, j. (1998); a study of retrospective and on-line event detection, proceedings of the 21st annual international acm sigir conference on research and development in information retrieval, 28-36. [40] zangerle, e.; gassler, w.; specht, g. (2013); on the impact of text similarity functions on hashtag recommendations in microblogging environments, social network analysis & mining, 3(4): 889-898. [41] zheng, d.; li, f. (2009); hot topic detection on bbs using aging theory, web information systems and mining, isbn 978-3-642-05249-1, 5854(14): 129-138. int j comput commun, issn 1841-9836 vol.7 (2012), no. 4 (november), pp. 696-708 p2p resource sharing in wired/wireless mixed networks j. liao jianwei liao college of computer and information science southwest university of china 400715, beibei, chongqing, china e-mail: liaojianwei@il.is.s.u-tokyo.ac.jp abstract: this paper presents a new routing protocol called manager-based routing protocol (mbrp) for sharing resources in wired/wireless mixed networks. mbrp specifies a manager node for a designated sub-network (called as a group), in which all nodes have the similar connection properties; then all manager nodes are employed to construct the backbone overlay network with ring topology. the manager nodes act as the proxies between the internal nodes in the group and the external world, that is not only for centralized management of all nodes to a certain extent, but also for avoiding the messages flooding in the whole network. the experimental results show that compared with gnutella2, which uses super-peers to perform similar management work, the proposed mbrp has less lookup overhead including lookup latency and lookup hop count in the most of cases. besides, the experiments also indicate that mbrp has well configurability and good scaling properties. in a word, mbrp has less transmission cost of the shared file data, and the latency for locating the sharing resources can be reduced to a great extent in the wired/wireless mixed networks. keywords: wired/wireless mixed network, resource sharing, manager-based routing protocol, backbone overlay network, peer-to-peer. 1 introduction peer-to-peer technology (p2p) is a widely used network technology, the typical p2p network relies on the computing power and bandwidth of all participant nodes, rather than a few gathered and dedicated servers for central coordination [1, 2]. according to the research and analysis on internet traffic management conducted by ipoque germany, p2p applications dominate internet traffic from 50% to 90%, and the statistics from chinese sources reveal that p2p traffic currently accounts for 70% of china’s total network traffic [3]. this indicates that resource sharing via various p2p techniques contributes to the major part of resource sharing on the internet [4]. in general, p2p systems implement an abstract overlay network, built at application layer on top of the native or physical network topology [2]. from the architecture view [5], p2p systems are generally divided into structured systems, unstructured systems [6] and hybrid systems [7]. a structured p2p system employs a globally consistent protocol to ensure that any node can efficiently find some of the peers that have the desired resources, even though the file is extremely rare. however, since dht-like (distributed hash table, dht) data structure is employed for maintaining the whole structured p2p system, the scalability is a critical problem. chord [8] is a typical structured p2p system. the unstructured p2p system is formed when the overlay links are established arbitrarily. in an unstructured p2p system, if a peer wants to lookup a piece of desired resources in the network, the query has to be flooded through the network to find the peers who have the desired sharing resources as many as possible. however, the unstructured p2p system uses flooding queries to discover the target objects, which may introduce lots of network traffic. bittorrent [10] is a well-known unstructured p2p system. in addition, many structured p2p systems use stronger peers (super-peers or super-nodes [11]) as servers, and the client-peers are connected in a star-like fashion to a single super-peer. this architecture can simplify the copyright c⃝ 2006-2012 by ccc publications p2p resource sharing in wired/wireless mixed networks 697 network architecture, but super-peers hold all routing information, even though a local search is also conducted by a relevant super-peer, thus the super-peers are apt to be overloaded. as examples for hybrid networks can be named modern implementations of gnutella2 [12] and the edonkey network [13]. nowadays, various modern devices can access internet by different resorts, and these devices also want to share resources with others, it is quite clear they can employ the p2p techniques. but different kinds of devices have the different properties and purposes of use, thus the sharing targets are inequality. for instance, widely used handheld devices may share a several megabytes mp3 audio file, and rarely share a size up to several gigabytes video file. but for normal desktops and laptops, the latter sharing is quite common; therefore, treating different kinds of peers as same is not an ideal strategy in the heterogeneous networks. in this paper, we propose, implement and evaluate a new routing protocol called mbrp (manager-based routing protocol) for constructing p2p resource sharing networks, in which there are many disparate devices. as a matter of fact, mbrp has been inspired by the hybrid p2p architecture, but in mbrp, the target sharing resources might be replicated and stored on the manager nodes, and the message forwarding may not conducted by manager nodes. in addition, although the p2p protocols mentioned above are scalable and efficient, they were designed originally for wired networks and are generally not suitable for wireless networks, in which nodes join and depart much more frequently. the main idea of mbrp is to organize the diverse devices into different groups according to their properties such as location, wired or wireless etc., and then appoint a manager node for each group to communicate with other groups. since the devices in the same group have the similar properties, the expected resources are stored and shared in the same group with quite high probability, only the desired resource is not in the group, the inter-group communication is launched. this paper is organized as follows: we present the design and implementation of mbrp in section 2; the evaluation experiments and results are shown in section 3; finally, we make concluding remarks in section 4. 2 the design and implementation of mbrp as various internet-connected devices have the different properties including bandwidth, connection resorts and storage capacity, the proposed mbrp first divides all participant nodes into several groups according to their properties such as wired or wireless connection, then elects a manager node for each group for communicating with the external nodes belonging to other groups. different from traditional hybrid p2p systems, in mbrp, the nodes may communicate with other internal nodes belonging to the same group directly without any intervention from the manager node. besides, the manager nodes might cache the replicas of hot sharing resources to reduce the lookup and transmission overhead for the sharing objects. 2.1 the architecture of mbrp network figure 1 shows the topology of a heterogeneous network built by resorting to mbrp. all nodes are divided into several groups according to their connection property, i.e. wired or wireless access. in each group, there is only one proxy node called manager node, which is responsible for the management of other internal nodes in the same group. for instance, the internal nodes who want to communicate with other nodes belonging to the different groups, are supposed to resort to their manager node. in addition, all manager nodes are connected into a ring, a similar topology to chord [8], but all internal nodes in the same group can connect to each other via different topologies. 698 j. liao figure 1: the topology of wireless/wired mixed network via mbrp 2.2 application routing protocol we assume that mbrp is built on ip-based wide area network, we use a m-bit hash function to calculate the unique m-bit group id. all internal nodes in the same group have their unique group id, which is also used by the manager node as its own private id to communicate with other manager nodes. for the purpose of routing in the whole network, like routing in chord, each manager node holds its predecessor and finger table [14]. tk = bnid + 2 k−1 mod 2m (∀k,where 1 ≤ k ≤ m) (1) equation 1 is employed to calculate the manager node’s successors, where bnid is the manager node id, and m is the number of bits of manager node id, tk is the id of the first successor; then, it calls the find_successor(tk) function, which is shown in figure 2, to calculate the next successor tk′; finally, each manager node has m successors in its finger table when the find_successor function has been called m-1 times. figure 3a shows the finger table of node n5512, in which some example successors of node n5512 are demonstrated. n.find_successor (id) if (id є (n, suceessor)) return successor; else return successor.find_successor (id); figure 2: find manager node’s successor for the nodes who want to publish some sharing files after joining into the group, they are supposed to add the group id as a part of the identification of the sharing files. figure 3b shows that an internal node belonging to group n2124 has published a file named as k4814, but the published name sent to the other manager nodes is n2124#k4814. for other manager nodes, that means the sharing file k4814 is located in the n2124 group. p2p resource sharing in wired/wireless mixed networks 699 figure 3: (a) finger table of n5512 ; (b) routing table and cache table of n5512 creating routing table as shown in figure 3b, for publishing file k4814, the manager node n2124 computes the first successor according to equation 1, therefore, n2124 publishes k4814 to its successor manager node, i.e. n5512 ; then n5512 adds the new entry n2124#k4814 to its own routing table; finally, it makes a replica of k4814 and an associated cache entry in the cache table if the cache strategy is enabled. figure 4: (a) routing table of internal node b; (b) lookup originated from internal node a to b and d; (c) lookup succeed message passed back from d to b and a while an internal node lookups a piece of sharing resource, which is in the different groups, then the lookup process can be taken over by its manager node. otherwise, while the sharing file is in the same group, the desired file data can be transferred within the group directly. thus, each internal node also holds a routing table to show the routines within the group. however, quite different from the manager node, the ids of the internal nodes are their ip addresses; thus, 700 j. liao the routing table is different as well. the routing table of the internal node b (b represents node’s ip address) is shown in the figure 4. bnid#oid stands for the manager node id and the sharing resource id, and the next hop means the next node for getting the sharing resource. for instance, in order to obtain the sharing resource k7189 located in group n5512, internal node b should send the lookup request to the next hop, i.e. the internal node d. the symbol bn in the routing table represents the id of the manager node that in the same group. lookup algorithm the lookup algorithm will be described from 4 parts: sending a request, receiving a request, manager node routing on the backbone overlay network and receiving a lookup hit message. figure 5a shows the algorithm of sending a lookup request. if a node wants to locate a piece of sharing resource labeled as obj_id, then it calls send_lookupreq(obj_id) to send the lookup request. first, the find_cache(obj_id) function is called to make sure whether the sharing object is in its own cache or not; if not,the find_routetable(obj_id) function will be called to obtain the related routing information; while there is no related routing information, it calls get_approximated(obj_id) to obtain the feasible routing information to find the next hop n’. finally, send_req(obj_id, n′) is called to send the lookup request to the next hop n’. figure 4a shows an example of sending a lookup request for the sharing object k9854, since there is no corresponding entry in the route table, the get_approximated(k9854) function is called to obtain the feasible routing entry and the lookup request is forwarded to the selected node d. // send a lookup request to find the obj_id send_lookupreq (obj_id) r = nil; n’= nil; if (find_cache (obj_id) is not nil) r = get_routetable (obj_id); else r = get_approximated (obj_id); n’= r.nexthop; send_req (obj_id, n’); // receive a lookup request from the predecessor recv_lookupreq (obj_id) r = nil; n”= nil; if (find_cache (obj_id) is not nil) send_lookup_hit (src, obj_id, data); return; if (find_routetable (obj_id) is not nil) r = get_routetable (obj_id); else r = get_approximated (obj_id); n” = r.nexthop; send_forward (obj_id, n”); figure 5: (a) sending a lookup request; (b) receiving a lookup request the algorithm of receiving a lookup request from the predecessor is almost same to that of sending a lookup request. it receives a lookup request, and then processes like sending a lookup request, the pseudo-code of the algorithm is shown in figure 5b. while the target sharing object is found, then the send_lookuphit(obj_id, src, data) function will be called to transfer the resource to the request node. figure 4b shows an example of the procedure while the internal node b handles the received lookup request from node a. since there is no corresponding routing entry in node b’s routing table, it calls get_approximated(obj_id) to find the feasible next hop, i.e. node d, and forwards the request to it. the process of handling a received lookup request does not stop until the resource is found or timeout (i.e. maximum hop count exceeded). while the lookup request is not fulfilled within the group, it will be forwarded to the manager node, the core_ring_route(obj_id) function shown in figure 6a, will be called by the group’s manager node to handle the request from other manager nodes. after receiving the p2p resource sharing in wired/wireless mixed networks 701 // routing in backbone overlay network core_ring_route (obj_id) r = nil; n’= nil; b’= nil; if (find_cache (obj_id) is not nil) send_lookup_hit (src, obj_id, data); return; if (find_local (obj_id) is not nil) n’= get_local (obj_id); send_forward (obj_id, n’); else if (find_routetable (obj_id) is not nil) r = get_routetable (obj_id); b’= r.bnid; // the manager id else b’= find_successor (obj_id); send_core_ring_forward (obj_id, b’); // receive a object hit message recv_lookuphit (src, obj_id, data) update_routetable (src, obj_id); if (src is not equal to me) send_lookup_hit _backroute (src, obj_id, data); if (find_caches (obj_id) is nil) append_caches (data); return; figure 6: (a) routing in backbone overlay network; (b) algorithm of receiving a lookup hit lookup request, the manager node checks the target object is in its own group or not. if the object is in its group, then the request is forwarded to the corresponding internal node; otherwise, it forwards the request to other corresponding manager node whose group has the target object or the successor manager node in the finger table. the algorithm of receiving a lookup hit message is shown in figure 6b, while the manager node receives the lookup hit message, it first updates its routing table to label the new routine to the target object; then if the cache mechanism is enabled, it also makes a replica of the target object on its local disk. from the above description, we can see that while the target sharing object is found, then the expected file will be transferred to the request node by the reverse routing path. at the same time, if cache strategy is enabled, one replica of this target object is made and stored in each manager node on the routing path for quick responses to the future lookup requests. we should mention that the number of cached replicas in the manager node is configured and limited, lru is used to evict an existing replica and append the new replica into the local cache. 2.3 dynamic registration since the handheld devices have the roaming property, that means they might change their groups and manager nodes frequently, we have adopted a mechanism like ip mobility support [15] called dynamic handover, to allow the handheld device to register to a new manager node after the roaming. figure 7 illustrates the dynamic registration in following steps: 1. the handheld device issues a registration request with its current ip address, the former ip address and the former manager node id (hashed value from the manager node’s ip address) to the new manager node for registration after it roams to another group. 2. the new manager node creates a new internal node record, and replies the handheld device with message about the registration has been handled. the handheld device updates its manager node and tries to re-build its routing table again. 3. the new manager node notifies the device’s former manager node that the device has joined into its group and the old record should be removed; then the previous manager 702 j. liao previous manager node new manager node mobile client mobile client (1) registration request (2) registration reply (3) deregistration request (4) deregistration reply figure 7: dynamic handover algorithm node deletes the corresponding record, and broadcasts that de-registration message to its all internal nodes to require them to update their routing tables. 4. the previous manager node replies to the new manager node with the message of the de-registration has been handled. 2.4 manager node election the manager node plays a critical role in the mbrp network, while the manager node fails or departs from the group, a new manager node is supposed to be elected. as a matter of fact, the main principle for electing a new manager node is the well relaying property, which means lots of internal nodes choose it as the next hop in their routing table entries; if more than one internal nodes have the same relaying property, then one of them will be selected randomly. on the premise that network is still working when the manager node has exited or failed, the internal node, who first detects the failure or exit of the manager node, broadcasts the election request to other internal nodes. we assume the messages related to election are never lost, then all internal nodes belonging to the same group should take part in the process of election. 1. after receiving the election request, all internal nodes check the possible manager node candidate(s) according to the relaying property, and then reply to the issuer of the election request with the candidate(s). it is possible that the internal nodes may send several candidates who have the same relaying property; 2. the issuer of election request collects all replies from the internal nodes, then determines which node is the unique manager node; 3. the result of election will be broadcast to the corresponding internal nodes; 4. the new created manager node should be insert into the backbone overlay network simply like inserting a node into a ring topology network. in general, the new manager node replaces the failure one in the ring of the backbone overlay network; 5. all nodes in the group (including the manager node), which have a new manager node, remove the records with previous manager node and update the manager node information; p2p resource sharing in wired/wireless mixed networks 703 6. the new manager node broadcasts that the former manager node is not working, to other manager nodes on the backbone network and requires them to update the routing table. at the same time, the new manager node re-builds its routing table and finger table. we should mention that because all nodes in the group should re-build their routing tables, and routines on the backbone overlay network are supposed to be updated as well, the overhead brought by electing a manager node is not trivial. 3 experiments and evaluation the ns2 [16] was employed as our experimental platform while analyzing the performance and overhead on both the mbrp system and its comparison counterpart. much more exactly, the module gnutellasim in ns2 is used for our experiments. the hybrid p2p system gnutella2, which uses super-peers (called hubs) to manage the internal nodes in the same group, has been selected as our comparison counterpart while evaluating the overhead, such as network traffic, in the manager nodes in the mbrp network system. moreover, the manager nodes play significant roles in our proposed mechanism mbrp, so that they are supposed to have enough bandwidth and processing power. because wireless network is connected to the wired network via a gateway (also called access point), in our experiments, we selected such nodes as manager nodes for wireless groups. regarding to wired groups, we simply appointed the fixed servers as manager nodes. thus, all nodes can join the whole system by registering to their own manager nodes. 3.1 overhead on manager node 0 50 100 150 200 250 300 350 400 450 50 60 70 80 90 100 110 120 130 140 a ve ra ge l oo ku p l at en cy ( m s) number of nodes mbrp(1:10) gnutella2(1:10) mbrp(1:1) gnutella2(1:1) mbrp(2:1) gnutella2(2:1) figure 8: the average lookup latency 0 2 4 6 8 10 50 60 70 80 90 100 110 120 130 140 a ve ra ge l oo ku p h op s number of nodes mbrp(1:10) gnutella2(1:10) mbrp(1:1) gnutella2(1:1) mbrp(2:1) gnutella2(2:1) figure 9: the average lookup length we have conducted the comparison experiments between gnutella2 and the proposed mbrp, to show the overhead of introducing the manager nodes in mbrp. for both target systems, the overhead of locating the local objects (belonging to the same group) and locating the external objects (belonging to the different groups) are different, we adopted different internal/external access ratios, i.e. 1:10, 1:1, 2:1, to show the different overhead; each group has 50 internal nodes and each experiments stands 200 seconds. we measured both the average lookup latency and the average lookup hops for gnutella2 and mbrp with different access ratios. figure 8 reports the results of average lookup latency (lower is better), except for the case of internal/external access ratio is 1:10, while no less than half of lookups for sharing resources are hit within the group (i.e. access ratios are 1:1 and 2:1), the lookup latency introduced by mbrp is less than gnutella2, especially while the access 704 j. liao ratio is 2:1, mbrp reduces around 20% lookup latency. that is because all communications within the group should be handled by the super-peers in gnutella2, but the internal nodes may communicate with each other in mbrp. figure 9 presents the results of average lookup length(i.e. hot count, lower is better), the similar tendency to the average lookup latency between gnutella2 and mbrp. in addition, from figures 8 and 9, we can conclude that mbrp needs just a little more time while increasing the total number of nodes, this shows that mbrp has well scalability. 3.2 overhead on backbone overlay network ns2-gnutellasim was also employed to show the traffic on the backbone overlay network with the different network properties. in order to accelerate the access speed to the sharing resources, mbrp applies cache mechanism to store the hot resources in the manager nodes when these resources are transferred via/to them. in this section, we will inspect that different cache strategies bring about what kind of benefits to the lookup latency and negative effect on the traffic overhead in backbone overlay network respectively. the following cache strategies have been taken into consideration: 1. no cache, which represents that the cache mechanism is disabled. 2. unlimited cache, which means the manager nodes can cache unbounded replicas. 3. weighted cache, which indicates that the manager nodes can cache limited resources, while the ceiling is reached, some existing cached resources should be swapped out to make space for the new replicas with lru cache algorithm. because in both of gnutella2 and the mbrp mechanisms, all inter-group messages are handled by the manager nodes or super-peers, the traffic overhead on backbone overlay network of gnutella2 is same to that of mbrp without cache. we do not report the experimental results regarding to gnutella2 in the following experiments. in these experiments, gt-itm [17] is used to construct the network topology, which has 500 manager nodes, the size of the sharing resource is 1024 byte, wired/wireless access ratio is 1:1, and the duration of each experiment is 500 seconds. since p2p is an application level protocol, we only care about application level packets rather than other level packets. average traffic overhead we defined the average traffic overhead as the traffic on backbone overlay network divided by the size of sharing resource. for instance, in order to transfer a sharing file (default size as 1024 byte), which introduces 4096 byte total traffic on backbone overlay network, then the average traffic overhead is 4. figure 10 shows the average traffic overhead by using different cache strategies, the x axis stands for the number of successful lookups, in other words, during 500 seconds for conducting an experiment, how many successful lookups have been completed; the y axis represents the average traffic overhead. without doubt, no cache strategy works worst, meanwhile unlimited cache mechanism works best. figure 11 shows the total traffic in the all manager nodes, which has the similar trend to that of average traffic, in addition, figures 10 and 11 also show that with the increasing the maximum number of caches, the total traffic goes down slowly. p2p resource sharing in wired/wireless mixed networks 705 2.5 3 3.5 4 4.5 5 5.5 6 6.5 7 0 2 4 6 8 10 12 14 16 18 20 a ve ra g e t ra ff ic o ve rh e a d number of sucessful lookups (1000) no cache unlimited cache weighted caches (max=30) weighted caches (max=20) weighted caches (max=10) figure 10: average traffic overhead 0 5 10 15 20 25 30 0 2 4 6 8 10 12 14 16 18 20 t o ta l t ra ff ic in m a n a g e r n o d e s (m b yt e s) number of sucessful lookups (1000) no cache unlimited cache weighted caches (max=30) weighted caches (max=20) weighted caches (max=10) figure 11: total traffic in manager nodes 0 10 20 30 40 50 60 70 80 90 0 2 4 6 8 10 12 14 16 18 20 t o ta l f o rw a rd p a ck e ts ( 1 0 0 0 ) number of sucessful lookups (1000) no cache unlimited cache weighted caches (max=30) weighted caches (max=20) weighted caches (max=10) figure 12: total forward packets on backbone overlay network 3.5 4 4.5 5 5.5 0 2 4 6 8 10 12 14 16 18 20 a ve ra g e t ra ff ic o ve rh e a d number of sucessful lookups (1000) wired/wireless nodes access ratio 1:1 wired/wireless nodes access ratio 2:1 wired/wireless nodes access ratio 1:2 figure 13: average traffic overhead with different wired/wireless access ratio total traffic overhead figure 12 shows the total forward packets on the backbone overlay network. while the number of cached resources is increasing, then less packets is forwarded on the backbone overlay network. that is because the more cached replicas, the more lookup hits can be obtained within the group. overhead with different wired/wireless access ratios in the previous experiments, we fixed the wired/wireless access ratio as 1:1. in this section, we will discuss the ratios are 1:1, 2:1 and 1:2 respectively. first, we configured the cache strategy as weighted caches (max=30). then we repeated to measure the average traffic overhead, total traffic in manager nodes and total forward packets on the backbone overlay network. figures 13, 14 and 15 show the relevant results of overhead on backbone overlay network with different wired/wireless access ratios separately. from these figures, we can see that while the ratio is 1:2, the mbrp with cache mechanism can achieve considerable performance improvement because of a major part of lookup hits occurred in the groups. 3.3 discussion from the above experiments, we can see that mbrp has low latency, and the cache mechanism is also suitable for a large amount of accessing to the sharing resources in wireless/wired mixed networks. in addition, mbrp keeps a good scaling property, it employs manager nodes 706 j. liao 0 2 4 6 8 10 12 14 0 2 4 6 8 10 12 14 16 18 20 t o ta l t ra ff ic in m a n a g e r n o d e s (m b yt e s) number of sucessful lookups (1000) wired/wireless nodes access ratio 1:1 wired/wireless nodes access ratio 2:1 wired/wireless nodes access ratio 1:2 figure 14: total traffic in manager nodes with different wired/wireless access ratio 0 10 20 30 40 50 60 0 2 4 6 8 10 12 14 16 18 20 t o ta l f o rw a rd p a ck e ts ( 1 0 0 0 ) number of sucessful lookups (1000) wired/wireless nodes access ratio 1:1 wired/wireless nodes access ratio 2:1 wired/wireless nodes access ratio 1:2 figure 15: total forward packets with different wired/wireless access ratio to construct the backbone overlay network, and then other nodes can register to the manager node to join into the whole network. the manager nodes are responsible for the communication between the different groups, therefore, both traffic overhead and lookup hops do not increase drastically even though lots of the new nodes join into the whole network suddenly. 4 concluding remarks and future work a new routing protocol named as manager-based routing protocol (mbrp) has been proposed, implemented and evaluated in this paper. all nodes in the network have been partitioned into several groups according to their properties, wired or wireless access for instance; then a manager node is elected for each group and in charge of communication between the internal nodes in the group and external nodes belonging to other groups. from our experimental results, compared with gnutella2, which has super-peers for different groups, except the internal/external locating ratio is 1:10, in which mbrp performs a little worse than gnutella2; in other cases, mbrp outperforms than gnutella2. in addition, since the nodes in the same group, which have the similar properties, are mostly like to share the same kind of resources, the most of resource sharing cases may occur within the group. namely, mbrp can work well in the heterogeneous networks, in which internal accesses might more than external ones. furthermore, for responding quickly to the lookup requests for the hot resources, mbrp adopts caching the hot resources in the group while transferring the target objects from external groups. therefore, the future lookups for these cached resources can be fulfilled in the local group. the cache strategy reduces not only the lookup latency and lookup node hops, but also p2p resource sharing in wired/wireless mixed networks 707 the network traffics on the backbone overlay network. consequently, the system performance can be upgraded greatly. however, the current design of mbrp still has it limitations, although we assume that the sharing resources are not modified frequently, modification of the resources really happen, thus we need to consider how to maintain the consistency between the cached copies and the original ones in the near future. in addition, to determine which manager nodes for storing the copies, and make the cache mechanism much more effective is another aspect of our future work. bibliography [1] g. fox, peer-to-peer networks, computing in science and engineering, issn 1521-9615, 3(3):75-77, 2001. [2] r. schollmeier , a definition of peer-to-peer networking for the classification of peer-topeer architectures and applications, proceedings of the first international conference on peer-to-peer computing, pp.101-102, 2001. [3] eric bangeman, p2p responsible for as much as 90 percent of all net traffic (http://arstechnica.com/old/content/2007/09/p2p-responsible-for-as-much-as-90-percentof-all-net-tra_c.ars, 2007 [4] m. parameswaran, a. susarla, a.b. whinston, p2p networking: an information sharing alternative, computer, issn 0018-9162, 34(7):31-38, 2001. [5] analoui m., sharifi m., rezvani m.h., probabilistic proximity-aware resource location in peer-to-peer networks using resource replication, international journal of computers communications & control, issn 1841-9836, 5(4):447-457, 2010. [6] q. lv, p. cao, e. cohen, k. li, and s. shenker , search and replication in unstructured peer-to-peer networks, proceedings of the 16th international conference on supercomputing,ics’02, pp.84-95, 2002. [7] beneventano, domenico and bergamaschi, sonia and guerra, francesco and vincini, maurizio , querying a super-peer in a schema-based super-peer network, proceedings of the 2005/2006 international conference on databases, information systems, and peer-to-peer computing, pp.12-25, 2007. [8] m. kelaskar, v. matossian, p. mehra, d. paul, m. parashar, a study of discovery mechanisms for peer-to-peer application, proceedings of the 2nd ieee/acm intenational symposium on cluster computing and the grid,ieee computer society washington, dc, usa, pp.444, 2002. [9] sameh elfiansary , luc onana alima , per brand , seif haridi , efficient broadcast in structured p2p networks, proceedings of 2nd international workshop on peer-to-peer systems, 2003. [10] bittorrent, http://www.bittorrent.com [11] j. lin, m. yang, robust super-peer-based p2p file-sharing systems, the computer journal, issn 0010-4620, 53(7):951-968, 2010. [12] m. ripeanu , peer-to-peer architecture case study: gnutella network, proceedings of the first international conference on peer-to-peer computing, pp.99-100, 2002. 708 j. liao [13] s.b. handurukande et al , peer sharing behaviour in the edonkey network, and implications for the design of server-less file sharing systems, proceedings of the 1st acm sigops/eurosys european conference on computer systems 2006, pp. 359-371, 2006. [14] stoica, ion and morris, robert and karger, david and kaashoek, m. frans and balakrishnan, hari, chord: a scalable peer-to-peer lookup service for internet applications, sigcomm comput. commun. rev. issn 0146-4833, 31(4):149-160, 2001. [15] ip mobility support for ipv4, http://tools.ietf.org/html/rfc3344 [16] rupali bhardwaj and v.s. dixit and anil kr. upadhyay, an overview on tools for peer to peer network simulation, international journal of computer applications, issn 0975-8887, 1(1):70-76, 2010. [17] e. w. zegura, gt-itm: georgia tech internetwork topology models (software), http://www.cc.gatech.edu/fac/ellen.zegura/gt-itm/gt-itm.tar.gz, 1996. international journal of computers communications & control issn 1841-9836, 9(4):497-509, august, 2014. optimized branch and bound for path-wise test data generation y.w. wang, y. xing, y.z. gong, x.z. zhang ya-wen wang (1,2), ying xing* (1,3), yun-zhan gong (1), xu-zhou zhang (1) (1) state key laboratory of networking and switching technology beijing university of posts and telecommunications, beijing, china 10 xitucheng road, beijing, china e-mail: wangyawen@bupt.edu.cn, gongyz@bupt.edu.cn, laomao22311@126.com (2) state key laboratory of computer architecture institute of computing technology, chinese academy of sciences (3) school of electronic and information engineering liaoning technical university *corresponding author: lovelyjamie@yeah.net abstract: the increasing complexity of real-world programs necessitates the automation of software testing. as a basic problem in software testing, the automation of path-wise test data generation is especially important, which is in essence a constraint satisfaction problem solved by search strategies. in this paper, the search algorithm branch and bound is introduced and optimized to tackle the problem of path-wise test data generation. the optimized branching operation is fulfilled by a dynamic variable ordering algorithm with a heuristic rule to break ties. the optimized bounding operation is accomplished by analyzing the results of interval arithmetic. in order to facilitate the search methods, the solution space is represented as state space. experimental results prove the effectiveness of the optimized branching and bounding operations, and show that the proposed method outperformed some other methods used in test data generation. the results also demonstrate that the proposed method is applicable in engineering. keywords: test data generation, constraint satisfaction problem, branch and bound, state space search. 1 introduction with the surge of increasingly complex real-world software, software testing plays a more and more important role in the process of software development [1]. in 2002, national institute of standards and technology (nist) found that over one third of the cost of software failure could be eliminated by an improved testing infrastructure [2]. but manual testing is time-consuming and error-prone, and is even impracticable for real-world programs. so the automation of testing is of crucial concern [3]. furthermore, as a basic problem in software testing, path-wise test data generation (denoted as q) is of particular importance because many problems in software testing can be transformed into q. the methods of solving q can be categorized as dynamic and static. the dynamic methods require the actual execution of the program under test (put), and the metaheuristic (mhs) [4] methods are very popular. recently, the mhs method particle swarm optimization (pso) [5] has become a hot research topic due to its convenient implementation and faster convergence speed. but dynamic methods often consume a large number of iterations, and the definition of objective function is also a big problem. the static methods utilize techniques including symbolic execution [6] and interval arithmetic [7] to analyze the put without executing it. the process of generating test data is definite with relatively less cost. they abstract the constraints to be satisfied, and propagate and solve these constraints to obtain the test data. due to their precision in generating test data and the ability to prove that some paths are infeasible, the static methods copyright © 2006-2014 by ccc publications 498 y.w. wang, y. xing, y.z. gong, x.z. zhang have been widely studied by many researchers. demillo and offutt [8] proposed a fault-based technique that used algebraic constraints to describe test data designed to find particular types of faults. gotlieb et al. [9] introduced static single assignment into a constraint system and solved the system. cadar et al. from stanford university proposed a symbolic execution tool named klee [10] and employed a variety of constraint solving optimizations. in 2013, wang et al. [11] proposed an interval analysis algorithm using forward dataflow analysis. no matter what techniques are adopted, static methods require a strong constraint solver. in this paper, considering the drawbacks of the dynamic methods and the demand for static methods, we propose a new static test data generation method based on code test system (cts)(http://ctstesting.com), which is a practical tool to test codes written in c programming language. our contribution is threefold. first, path-wise test data generation is defined as a constraint satisfaction problem (csp). two techniques (state space search and branch and bound) in artificial intelligence are integrated to tackle the csp. second, the branching operation is optimized with a heuristic variable ordering algorithm. third, the bounding operation is optimized in different stages of the search to reduce the search space greatly. through experimental results, we try to evaluate the performance of our method, especially the optimized branching and bounding operations. we also make comparison experiments to find whether our method outperforms some currently existing test data generation methods in terms of coverage. 2 problem definition and solving strategies 2.1 problem definition a control flow graph (cfg) for a program p is a directed graph g=(n,e,i,o), where n is a set of nodes, e is a set of edges, and i and o are respective unique entry and exit nodes to the graph. each node n∈n is a statement in the program, with each edge e=(nr , nt)∈e representing a transfer of control from node nr to node nt . nodes corresponding to decision statements such as if statements are branching nodes. outgoing edges from these nodes are referred to as branches. a path through a cfg is a sequence p=(n1 , n2 , . . ., nq ), such that for all r, 1≤rx2) 2: if(x1+x2==100) 3: if(x1>=20) 4: if(x1-x2==30) 5: printf("solved! "); } figure 1: program test and its corresponding cfg 504 y.w. wang, y. xing, y.z. gong, x.z. zhang table 2: dvo process for x 1 and x 2 ordering rule condition of each variable tie encountered? ordering result domain size |d1|=69, |d2|=69 yes x 1→x 2 rank1 rank1(x 1)=1, rank1(x 2)=1 yes rank2 rank1(x 1)=2, rank1(x 2)=2 yes rank2 rank2(x 1)= 3, rank2(x 2)=∞ no 2={ 1:[100,100]-[1,99]=[1,99] ; 2:[100,100]-[2,100]=[0,98]}d x x 1={ 1:[1,100] ; 2:[1,100]}xd x x1-x2=30 4={ 1:[1,98]+[30,30]=[31,128] ; 2:[20,99]-[30,30]=[-10,69]}d x x 5={ 1:[20,99] [31,128] [31,99] ; 2:[1,98] [-10,69] [1,69]}xd x x1>x2 1={ 1:[2,100] ; 2:[1,99]}x xd 2 ={ 1:[1,100] [2,100] [2,100] ; 2:[1,100] [1,99]=[1,99]}xd x 3={ 1:[20,98]}d x 4 ={ 1:[2,99] [20,98] [20,98] ; 2:[1,98]}x xd x1 20 x1+x2=100 3={ 1:[2,100] [1,99] [2,99] ; 2:[1,99] [0,98]=[1,98]}xd x figure 2: the idr process 4 experimental analyses and empirical evaluations to observe the effectiveness of bfs-bb, we carried out a large number of experiments in cts. within the cts framework, the put is automatically analyzed, and its basic information is abstracted to generate its cfg. according to the specified coverage criteria, the paths to be traversed are generated and provided for bfs-bb as input. the experiments were performed in the environment of ms windows 7 with 32-bits, pentium 4 with 2.8 ghz and 2 gb memory. the algorithms were implemented in java and run on the platform of eclipse. 4.1 performance evaluation the number of variables is an important factor that affects the performance of test data generation methods [17]. hence, in this part, experiments were carried out to evaluate the effectiveness of the optimized branching and bounding operations for varying numbers of input variables. testing the optimized branching operation this part presents the comparison between our branching algorithm dvo (a) which is also bfs-bb and the method which orders variables only by remaining domain sizes (b). the other operations (the bounding operations) in the two methods were both accomplished by idr and hc. the comparison was accomplished by repeatedly running the two methods on generated test programs having input variables x1 , x2 ,. . ., xn where n varied from 2 to 50. adopting statement coverage, in each test the program contained n if statements (equivalent to n branching conditions or n expressions along the path) and there was only one path to be traversed of fixed length, which was the one consisting of entirely true branches. in each test, the expression of optimized branch and bound for path-wise test data generation 505 table 3: the hill climbing process for x 1 d1 v1 f (vi) |f (vi)| peak reached? [31, 99] 80 30 30 no [50, 79] 60 -10 10 no [61, 70] 65 0 0 yes the ith (1≤i≤n) if statement was in the form of [a1, a2, . . . , an][x1, x2, . . . , xn]′rel − op const[i] (3) in formula(3), a1, a2 ,. . ., ai were randomly generated numbers either positive or negative, rel -op∈{>,≥,<,≤,=, ̸=}, and const[j ] (j ∈[1, i]) was an array of randomly generated constants within [0, 1000]. the randomly generated aj and const[i] should be selected to make the path feasible. this arrangement constructed the tightest linear relation between the variables. the programs for various values of n ranging from 2 to 50 were each tested 50 times, and the average time required to generate the data for each test was recorded. the results are presented in figure 3. figure 3: test result of dvo in figure 3, (a) shows that a had a better performance than b, but it was not very obvious when the number of variables (expressions) was not very large, because there was no requirement for an optimized ordering algorithm, since remaining domain size was enough to determine the next variable to be instantiated. so the more variables, the better dvo works. for bfs-bb (a), it is clear that the relation between average generation time and the number of variables can be represented as a quadratic curve very well as shown in (b) and the quadratic correlation relationship is significant at 95% confidence level with p-value far less than 0.05. besides, average generation time increases at a uniformly accelerative speed as the increase of the number of variables. the differentiation of average generation time indicates that its increase rate rises by y=9.5294x -120.53 as the number of variables increases. we can roughly draw the conclusion that generation time using dvo is very close for n ranging from 1 to 25, while it begins to increase when n is larger than 25. so dvo will be very useful for puts with more variables, especially the large-scale real-world programs. testing the optimized bounding operation this part presents the comparison between our bounding algorithm hc (a) which is also bfs-bb and the method without hc (b). the other operations (dvo and idr) in the two 506 y.w. wang, y. xing, y.z. gong, x.z. zhang methods were totally the same. the comparison was accomplished by repeatedly running the two methods on generated test programs having input variables x1 , x2 ,. . ., xn where n varied from 1 to 50. adopting statement coverage, in each test the program contained 50 if statements (equivalent to 50 branching conditions or 50 expressions along the path) and there was only one path to be traversed of fixed length, which was the one consisting of entirely true branches. the expression of each if statement was in the form of [a1, a2, . . . , an][x1, x2, . . . , xn]′rel − op const[c] (4) in formula(4), a1, a2 ,. . ., ai were randomly generated numbers either positive or negative, rel -op∈{>,≥,<,≤,=, ̸=}, and const[c] (c∈[1, 50]) was an array of randomly generated constants within [0, 1000]. the randomly generated ai (1≤i≤n) and const[c] should be selected to make the path feasible. this arrangement constructed the tightest linear relation between the variables. in addition, we ensured that there was at least one “=” in each program to test the equation solving capability of the methods. the programs for various values of n ranging from 1 to 50 were each tested 50 times, and the average time required to generate the data for each test was recorded. the comparison result is presented in figure 4. figure 4: test result of hc we exponentiated the axis representing average generation time as shown in (a). it can be seen that the average generation time of a is far less than b. for bfs-bb (a), it is clear that the relation between average generation time and the number of variables can be represented as a quadratic curve very well as shown in (b) and the quadratic correlation relationship is significant at 95% confidence level with p-value far less than 0.05. besides, average generation time increases at a uniformly accelerative speed as the increase of the number of variables. the differentiation of average generation time indicates that its increase rate rises by y=1.06x -8.6817 as the number of variables increases. we can roughly draw the conclusion that generation time using hc is very close for n ranging from 1 to 8, while it begins to increase when n is larger than 8. 4.2 coverage evaluation to evaluate the capability of bfs-bb to generate test data in terms of coverage, we used some real-world programs to compare bfs-bb with both static and dynamic methods adopted in test data generation. comparison with a static method this part presents the results from an empirical comparison of bfs-bb with the static method [11] (denoted as "method 1" to avoid verbose description), which was implemented in cts prior optimized branch and bound for path-wise test data generation 507 table 4: the details of comparison with method 1 project program function ac by method 1 ac by bfs-bb qlib sin.c radian 55% 100% floor.c ceil 66% 100% dell8i-2 asinl.c acosl 55% 100% tanl.c cotl 66% 100% table 5: parameter setting for pso parameter value population size 30 max generations 100 inertia weight w ranging from 0.2 to 1 acceleration constants c1 and c2 c1 =c2 =2 maximum velocityvmax set according to the input space of the tested program to bfs-bb. the test beds were from two engineering projects at http://www.moshier.net/. the comparison adopted statement coverage as the adequacy criterion. for each test bed, the experiments were carried out 100 times, and average coverage (ac) was used for comparison. the details of the comparison are shown in table 4. from table 4, it can be seen that bfs-bb reached higher coverage than method 1 for all the test beds as shown in bold. that is largely due to the optimization methods utilized in bfs-bb. the applicability in engineering remains one of our primary goals in the future. comparison with pso this part presents results from an empirical comparison of bfs-bb with pso, which is mentioned in section 1 as a popular mhs method with relatively fast convergence speed. table 5 is a brief introduction to some parameters used in pso. we used three real-world programs, which are well-known benchmark programs and have been widely adopted by other researchers [18][20]. branch coverage was taken as the adequacy criterion. for each test bed, the experiments were carried out 100 times, and ac was used for comparison. table 6 shows the details of the test beds and the comparison results. obviously bfs-bb achieved 100% coverage as shown in bold on all the three benchmark programs, which are rather simple programs for bfs-bb, and it outperformed the algorithm in comparison. the better performance of bfs-bb is due to two factors. the first is that the initial values of variables are selected by heuristics on the path, so bfs-bb reaches a relatively high coverage for the first round of the search. the second is that the optimized bounding operation is conducted not only in the state space search stage but in the initialization stage as well, which reduces the domains of the variables to ensure a relatively small search space that follows. table 6: the details of comparison with pso program loc branches variables ac by pso ac by bfs-bb triangletype 31 3 5 99.88% 100% cal 53 18 5 96.85% 100% calday 72 11 3 97.35% 100% 508 y.w. wang, y. xing, y.z. gong, x.z. zhang 5 conclusions and future works the increasing demand of testing large-scale real-world programs makes the automation of the testing process necessary. in this paper, path-wise test data generation (q) which is a basic problem in software testing is defined as a constraint satisfaction problem (csp), and the algorithm best-first-search branch and bound (bfs-bb) is presented to solve it, combining two techniques in artificial intelligence which are state space search and branch and bound (bb). the branching and bounding operations in bfs-bb are both optimized. for the branching operation, dynamic variable ordering (dvo) is proposed to permutate variables with a heuristic rule to break ties. the bounding operation is optimized in both stages of bfs-bb. initial domain reduction (idr) functions in the initialization stage to reduce the search space as well as detect infeasible paths. in the state space search stage, the process of determining a fixed value for a specified variable resembles climbing a hill, the peak of which is the value judged by interval arithmetic that does not cause a conflict. to facilitate the search procedure, the solution space is represented as state space. empirical experiments show that the optimized branching operation is especially useful for large-scale programs, while the advantage of the optimized bounding operation hill climbing (hc) is very obvious. the results also show that bfs-bb outperforms some current static and dynamic methods in terms of coverage. our future research will involve how to generate test data to reach high coverage. the effectiveness of the generation approach continues to be our primary work. acknowledgment this work was supported by the national grand fundamental research 863 program of china (no. 2012aa011201), the national natural science foundation of china (no. 61202080), the major program of the national natural science foundation of china (no. 91318301), and the open funding of state key laboratory of computer architecture (no. carch201201). bibliography [1] michael r. lyu; sampath rangarajan; ada p. a. van moorse. (2002); optimal allocation of test resources for software reliability growth modeling in software development, ieee transactions on reliability, issn 1841-9836, 51(2): 183-192. [2] tassey gregory.(2002);the economic impacts of inadequate infrastructure for software testing, national institute of standards and technology, rti project 7007.011. [3] weyuker elaine j.(1999); evaluation techniques for improving the quality of very large software systems in a cost-effective way, journal of systems and software, issn 0164-1212, 47(2): 97-103. [4] shaukat ali, lionel c. briand; hadi hemmati; rajwinder k. panesar-walawege. (2010); a systematic review of the application and empirical investigation of search-based test case generation, ieee transactions on software engineering, issn 0098-5589, 36(6): 742-762. [5] mao chengying; yu xinxin; chen jifu.(2012); swarm intelligence-based test data generation for structural testing, proceedings of 11th international conference on computer and information science (icis 12),623-628. [6] suzette person; guowei yang; neha rungta; sarfraz khurshid. (2012); directed incremental symbolic execution, iacm sigplan notices, issn 0362-1340, 46(6): 504-515. optimized branch and bound for path-wise test data generation 509 [7] moore ramon edgar; r. baker kearfott; michael j. cloud.(2009);introduction to interval analysis, society for industrial and applied mathematics, philadelphia, pa, usa. [8] richard a. demillo; a. jefferson offutt. (1991); constraint-based automatic test data generation, ieee transactions on software engineering, issn 0098-5589, 17(9): 900-910. [9] arnaud gotlieb; bernard botella; michel rueher.(1998);automatic test data generation using constraint solving techniques, proceedings of the 1998 acm sigsoft international symposium on software testing and analysis, 23(2):53-62. [10] cristian cadar; daniel dunbar; dawson engler.(2008);klee: unassisted and automatic generation of high-coverage tests for complex systems programs, proceedings of usenix symposium on operating systems design and implementation (osdi 2008), 209-224. [11] wang yawen; gong yunzhan; xiao qing. (2013); a method of test case generation based on necessary interval set, journal of computer-aided design & computer graphics, issn 1003-9775, 25(4): 550-556. [12] a.e. eiben; zs ruttkay.(1997);constraint satisfaction problems, pp. c5.7:1-8, new york, ny, usa: iop publishing ltd and oxford university press. [13] ling-ling wang; wen-hsiang tsai. (1988); optimal assignment of task modules with precedence for distributed processing by graph matching and state-space search, bit numerical mathematics, issn 0003-3835, 28(1): 54-68. [14] lianbo gao; shashi k. mishra; jianming shi. (2012); an extension of branch-and-bound algorithm for solving sum-of-nonlinear-ratios problem, optimization letters, issn 1862-4472, 6(2): 221-230. [15] ying xing; junfei huang; yunzhan gong; yawen wang; xuzhou zhang. (2014); an intelligent method based on state space search for automatic test case generation, journal of software, issn 1796-217x, 9(2): 358-364. [16] ying xing; junfei huang; yunzhan gong; yawen wang; xuzhou zhang. (2014); path-wise test data generation based on heuristic look-ahead methods, mathematical problems in engineering, issn 1024-123x,volume 2014, article id 642630. [17] matthew j gallagher; v. lakshmi narasimhan. (2012); adtest: a test data generation suite for ada software systems, ieee transactions on software engineering, issn 00985589, 23(8): 473-484. [18] mao chengying; yu xinxin; chen jifu.(2012); generating test case for structural testing based on ant colony optimization, proceedings of the 12th international conference on quality software (qsic12), 98-101. [19] ammann paul; jeff offutt.(2008); introduction to software testing, cambridge university press, new york, ny, usa. [20] e. alba; f. chicano. (2008); observation in using parallel and sequential evolutionary algorithms for automatic software testing, computers & operators research, issn 03050548, 35(10): 3161-3183. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vii (2012), no. 1 (march), pp. 147-162 analyzing the impact of using interactive animations in teaching r. pinter, d. radosav, s.m. cisar robert pinter subotica tech-college of applied sciences serbia, 24000 subotica, marka oreškovića 16 e-mail: probi@vts.su.ac.rs dragica radosav university of novi sad, technical faculty "mihajlo pupin" zrenjanin serbia, 23000 zrenjanin, djure djakovica bb e-mail: radosav@tfzr.uns.ac.rs sanja maravić čisar subotica tech-college of applied sciences serbia, 24000 subotica, marka oreškovića 16 e-mail: sanjam@vts.su.ac.rs abstract: this study intends to measure the impact of interactive animations on the students’ performance. two courses from subotica tech were included, the subjects “analog and digital electronics” and “microcontrollers”. the experiment lasted over a period of tree years, and it involved the formation of two groups in every academic. both groups’ members participated in traditional frontal teaching, but the experimental group could use interactive flash animations built from selected parts of those courses as supplementary tool. at the end of the semester, the exam marks were analyzed with a two-sample t-test. the results show that learning with properly created interactive animations could have positive effects on most students’ academic performance. keywords: distance education and telelearning, improving classroom teaching, interactive learning environments, simulations, media in education. 1 introduction in the era of modernization in the teaching process, when the use of novel information technologies aims to achieve easier, faster and more efficient knowledge transfer in education, the application of interactive animations has become more and more important. the questions arises as to what the reasons are which have made interactive animations a vital part of modern ecurricula, and whether there is empirical evidence to support claims that using multimedia and interactivity in e-curriculum has positive impact to cognitive development and academic achievement at students. in the first part of this paper, authors analyze characteristics of the interactive animations. the second part presents some research done with interactive animations developed at subotica tech. the e-contents are compiled from selected parts of the course "analog and digital electronics" and "microcontrollers" at subotica tech. the thorough investigation by sekular and blake [1] into how students take in information, how they learn pointed out that the learning process takes place primarily by way of sight, and since it is the most vital of our senses, it is also the most highly-developed one. it enables a person to gather information from one’s surroundings, analyze these and then decide how to process based on the deduced data. in terms of teaching, it is by seeing that students will best grasp a complicated string of steps as it helps transform a vague idea into an image in their brains. copyright c⃝ 2006-2012 by ccc publications 148 r. pinter, d. radosav, s.m. cisar kraidy [2] started that, if the aim is to increase the amount of information to be processed by students within a set time frame, then giving them visual information to work with will help them reach this goal. graphical representations are defined as visual aids that act as supplement to any other textual information and will concentrate learners’ attention [3]. such representations will have maximum effect when accompanying some learning material that is (relatively) new to the learner [4]. this is especially the case with computer animation that is designed to aid long-term learning in the form of focusing learners on certain objects in the beginning. the research of rieber [5] portrayed that abstractions connected with time transitions in a process can be decreased by implementing animations to convey ideas and processes that change over time. dual-coding theory by paivio, [6] [7] offers an explanation as to why graphics are so effective: retaining memory over a long time is made easier if a combination of verbal and visual cues is used. this makes animations a distinctively significant support of visualizing material for long-term memorization. animation and narration further support dual-coding [8]. what makes animations stand out is movement, as opposed to static, still images, and this demonstrates the various relationships within and along a certain process. by goldstein, chance, hoisington and buescher [9] movement will be remembered longer than static images. according to gordin and pea [10] and also brodie, carpenter, earnshaw, gallop, hubbold, mumford, osland, quarendon [11] visualization is a vital part in the acquisition of scientific topics, since important relationships between concepts will be pointed out for learners. it was demonstrated by research results that animations are more effective learning tools that static images, and this was further supported by lesson plans incorporating lectures as well as different learning inputs [12]. based on the dual-coding theory [7] it may be asserted that learning will be the most effective if there are lectures alongside animations, since they together form a base of reference that will help learners fully understand the knowledge that was conveyed through the animations. lectures will cue the students, but actual studying happens through the animations [13]. 2 interactive animations one of the tendencies in education is the continually growing amount of learning content which must be acquired by the student. almost every generation’s curricula are extended by a certain amount of new, updated or revised material. with this swelling of learning contents, another issue arises, namely that the time which is intended for learning these amount of contents is growing ever shorter for each subsequent generation. besides that, students are no longer interested in the foundations of some complex system, and how it is compiled, but rather, they want to know how the system works and how it can be managed. in accordance with these tendencies the educators have been searching for learning tools which can help the students acquire knowledge. as animation are able to unambiguously portray changes over time (temporal changes), they are extremely suitable for using them in process and procedure teaching. animations are applied to show dynamic content, and they reflect alterations in position (translation), as well as form (transformation) which form the basis of learning this kind of topic [14]. unlike static pictures, temporal changes are shown in animations directly (instead of indirectly by some awkward auxiliary markings including arrows and motion lines). the application of animations, as opposed to static graphics, makes these extra markings unnecessary, thus stripping down the displays and making them attractive, lively and easily understandable [15]. furthermore, there is no need for the learner to process these auxiliary markings and what changes they try indicate. interpreting the markings and the inferences may actually surpass the level analyzing the impact of using interactive animations in teaching 149 of graphical skills that the learner possesses. yet with animations, these displays immediately show all information concerning the changes, thus no extra mental depiction is required. learning can be facilitated by animations in two ways. on the one hand, their function is to affect the learner, raise their interest and keep up motivation. the entertainment industry implements this same function in their animations. on the other hand, though, animations also have the function to facilitate comprehension and memorization of a given content. the knowledge-building process is thus supported and this cognitive function is essential to effective learning. superficially, it may seem that animations are the perfect candidates to be applied in presenting dynamic content. nevertheless, there is no unambiguous research evidence supporting this. some researchers have conducted comparisons of how effective static and animated displays are in education by using a number of content domains. although there have been positive results where animations have proven to be rather effective, these results have been countered by other investigations that have found no positive, and even negative effects of using animations. on the whole it is safe to say that animations are not by definition more effective than static graphics. instead, the specific features of certain animations and their method of application is crucial in what kind of effect they will have on knowledge acquisition. 2.1 do animations make learning faster? animations play an important role in computer-based learning environments. so far, however, it has not been sufficiently resolved under which conditions and in which respect animations do actually lead to better learning outcome. well-designed animations are likely to be a real asset to the teacher. they will speed up the learning process and make it easier to grasp and memorize the material. it especially comes in handy when the teacher is trying to explain a difficult subject. the question arises: why is a subject perceived as difficult? it may either be because it requires a certain amount of imagination. for example, in our animations we visualized a clock signals, a values and shapes of the input and output voltage signals, a states and changes of the microcontroller internal registers etc. with the help of computer animations both the teaching and learning process will be made less difficult, it will take less time and it will be livelier. however, what then explains the fact that sometimes animations are not educationally effective as one would expect them to be? a possible answer would be that students are unable to "compute" the information seen in the animation adequately. if a complex subject is to be presented with animation, it may result in an equally complex animation, thus leaving students feeling overwhelmed. this is supported by the role of visual perception and cognition in human information processing. the perceptual and cognitive systems of humans have their limits for information processing. once the presented animation reaches or oversteps the learners’ information processing limits, the learning process may no longer be effective. also negative effects come forward if the new information being presented in animations is faster than the speed of how fast the learner is capable of processing that effectively. replacing current static graphics with animations without careful consideration is not likely to result in improved learning; instead animations should be accompanied by textual explanations, and let the learner have control over the speed of the animation. such user-controllable animations will enable learners to "customize" the animations by varying the playing speed and direction, labels and audio commentary to suit their own personality. the controllable animation can be realized with interactive animation. the interactivity within the animation could mean the own playing speed and walk-through, different amount of auxiliary explanations etc. besides the visualization of the curriculum, this kind of animation offers another advantage: the possibility of modeling and simulating systems. this means that knowledge acquisition can 150 r. pinter, d. radosav, s.m. cisar take place also by changing the model’s parameters, or otherwise experimenting with the system. so, when using interactive simulations besides the previously mentioned advantages, some new ones can be defined: • the model offers the possibility for analyzing and doing experiments with those systems, which cannot be done in real life. • the models enables studying of certain fast occurrences in a much slower mode, or timeconsuming events in a much shorter time span than in reality. • the model makes it possible to focus on the vital characteristics of the learning content being taught. • the model offers the users the freedom of experimentation without any consequences. 2.2 the advantages of flash animations the developing environment provided by the packet adobe flash cs3 (and its prior versions) was used by the authors as the tool of choice for creating these interactive animations. in a simplified form, this software tool is an application for creating vector sketches and animation, with the option of adding this interactive feature. naturally, the flash developing environment offers many more options, but it also includes very straight-forward ways of creating animations. the fact that it is rather easy to create interactive animations is a crucial aspect, as in such a case it is not a pre-requisite for the subject teacher to be highly educated in information technologies. this type of animation can be used for presenting the material in theoretical classes, but also for creating a fully electronic curriculum for consolidating the material previously taught in practices, as well as for independent work outside the classes. practice shows that creating effective interactive animations still requires the close cooperation of the teacher and the expert for flash technologies. successful acceptance of the animations by the students primarily depends on the course teacher. it is their task to determine the following: • goals that are to be achieved with this animation, • the content that is to be shown, • which elements of the learning material are to be represented statically (with an image), and which will take the forms of animation or interactive animation (simulation), • guidelines (design of the outlook, which controls are to be used, the user’s options within the system, etc.) based on which the application will be developed. the task of the "flash expert" is to realize the requirements of the teacher as best as possible. the programmability of the animation thus comes in really handy for the expert. when developing the flash application of the programs that may be used is action script (the current version is 4), an object-oriented programming language. with the help of this language every element of the animation (lines, colors, sound, etc.) can be controlled, calculations can be made using the entered parameters, and finally, the results can be presented, and actually used to draw new objects or their trajectories, as well as communicate with the server, among others. it is safe to say there is no such task in creating an animation that an experienced flash programmer cannot solve. in fact, this is the real advantage of this tool, as it can meet all the requirements irrespective of school age or learning material. besides the listed advantages analyzing the impact of using interactive animations in teaching 151 of a flash animation, it is also rather easy to distribute this application. there are two most commonly used formats for saving this animations: the executive (*.exe) format, which starts in its in-built player; and the standard (*.swf) format for playing in a web browser or in the flashplayer player (it can be downloaded easily from the internet). what is characteristic of these two formats is the small file size, which is a vital factor when distributing the application via the internet. another benefit of the flash animation is that it is a single file, there are no separate sound files, and the images do not comprise a separate module. all this ensures that there is no special installation procedure, only the file to be saved and started, which makes it an accessible program for even the somewhat computer-wary users. besides the so-called technical advantages, with the use of adequate design techniques, the flash-type animation could gain further benefits. one of those benefits is the result of how a flash animation is developed: most often the parts of a flash animation are drawn, and there is little use of images from the real world. the advantage of drawing, i.e. of creating vector objects for animation is that the drawn objects are represented in a simpler form, with less detail than, for example, if they were shown in a bitmap format. this means once the educator has abstracted the material for the students there is yet another simplification of the learning material. but there are other design techniques which could lead to more effective learning process, for example: • using the "inserting and removing fragments" technique. the complexity and information load of the animation interface can be regulated by inserting or removing objects or pieces of information form it. • using the "dimming fragments" technique. with this technique one can differentiate between important parts of the animation and those which serve as additional information. the dimmed elements look like as if they are melting into the background. • using background (blurred) animation to attract and keep user’s attention on the interface. also, in these projects the following design aspects were used: • minimize the number of visual elements, thus making it easy to follow the presented process. • minimal amount of lateral information used solely for presenting the essence as simply as possible. • " data entry by keyboard was not incorporated. the reason for this is that the data entry option does not always mean an advantage in the learning process: they may cause the user to be preoccupied with trying to crash the application by entering invalid formats and values. as a result of these design techniques, the system will show a straight-forward form, using only the vital details, leading directly to a better and easier understanding of the model, and the user cognitive load is kept on adequate (i.e. low) level. are these the only reasons why the animation should be used in teaching? no, they are not. there are problems which occur in educational communication called information barriers, and the flash animation will yield some solutions to this problem. some of these barriers can be classified in the following way: • perceptual barriers – each subject in the communication process feels and interprets events occurring to them differently, depending on their psychological, cultural and social status, 152 r. pinter, d. radosav, s.m. cisar • psychological barriers – the same word or event will have a different meaning for different persons, • social barriers – these barriers become apparent by the different social statuses of the subjects in the educational communication, • cultural barriers – these arise in communication due to the different cultural backgrounds of the subjects participating in the communication process, • semantic barriers – barriers of this type appear when interpreting written contents, speeches, images, and other, thus the way the message is read will change the content itself, • media barriers – this information barrier occurs when the there are different communication media used on educational communication. it is well-known fact that each carrier has their own markings, which may be helpful as well as distracting in communication, • physical barriers informational barriers come up in educational communication when transferring the message, i.e. in the channels of connection. how and where do information barriers occur when there are pcs used in the teaching process? some of possible sources of problems are described below: • experience shows that old programs which exclusively use the keyboard for interaction will be accepted to a lesser extent due to the fact that using the keyboard is more complicated than using the mouse, • programs (simulations) designed using too much detail will be harder to accept because first the users have to make out what is on the screen and only then move on to the explanation of the modeling system, • if there are too many options for simulation set up, result saving, parameter input, etc, where the users might ‘become disoriented’, then, according to murphy ’s law, they probably will. 3 practical applications the following section describes interactive animations which have been successfully in use as an auxiliary teaching tool at subotica tech college of applied sciences [16]. unfortunately, the advantages of the animations as described before are difficult to transfer to paper only with the help of images. the applications have been designed as interactive tutorials for presenting the functioning of some of the basic systems of analogue and digital electronics (figure 1.) and microcontrollers (figure 9. and 10.). for the microcontrollers course two e-contents (interactive flash simulation) were developed. they presents exercises for three out of fourteen lessons, but these three lessons count as "difficult", for example they cover the following themes: using the microcontrollers built in timer/counter in different modes, setting and using interrupts, communication through serial port, controlling analog to digital signal (and vice versa) conversion etc. the e-content for the analogue and digital electronics there are altogether 19 simulations classified into 5 groups/exercises. through these simulations the students can practice approximately about 40% of curriculum’s theory. for example, "exercise 1" contains simulations on the topics: sources of alternating signals, voltage splitter, passive voltage adder, rc low-pass filter, rc high-pass filter. figure 1. shows the screenshot of exercise 3 and the accompanying simulation entitled "pojačavač sa zajedničkim" (common emitter amplifier). the design of the analyzing the impact of using interactive animations in teaching 153 figure 1: representation of the exercise “common emitter amplifier” application shown in this image is followed through in the rest of the simulations, as well: the upper left corner contains the sketch of the system, below are the system parameters which can be altered in the simulation, while the "oscilloscope" is situated in the right side of the screen, showing the change of the signal over time. in this part of the application, by clicking on the link labeled "objašnjenje" (explanation) the theoretical background comes up in text form. below is a detailed description of the content and functions of the elements on the screen: 1. links for transition to the next/other simulation within this exercise. 2. sketch to be simulated. the parameters listed next to the components are changing depending on values of the checkboxes under the sketch. 3. representation of the shape of voltage signal at the input and output. part of the image marked with the arrow 3 shows the shape of output voltage, while the one marked 4 shows the input voltage. these shapes of signals are constantly redrawn. the lighter point on the line shows the current voltage value. the break in the line is the consequence of the change in rc components on the sketch during the simulation. 4. buttons for starting and stopping the simulation. 5. the button for calling up the background explanation for how the sketch functions. 6. the list of equations used for calculating the necessary parameters of the sketch and the results of the calculation/estimation. 7. the return button leading to the introductory page where the exercises can be chosen. 8. values of the sketch components. these parameters can be changed by choosing values from the checkboxes. each change has affects the listing of calculated values based on the new parameters and the change of signal shape at the output (“upper canal of the oscilloscope”). the following image (figure 2) shows the simulation “decade counter” with the help of which students can learn the logic of the synchronous counter. all simulations in this application are entirely controlled by mouse. changing the parameters is done with the help of combo boxes and the predefined values they contain. in this way the 154 r. pinter, d. radosav, s.m. cisar figure 2: representation of the exercise “decade counter” application is protected from irregular data. it is important to mention the following advantages of these simulations: • it is not necessary to really ‘create’ an electric circuit in order to see how it works, • changing the components in the system only takes a few clicks in the checkbox, • it is possible to show the state of important values in continuity, as done by an oscilloscope. the following few paragraphs present some actionscript (version 2) programming code, which shows how one can input data from the combo box, calculate the output voltage, and draw the form of voltage signal like it is done on a real oscilloscope. the combo box is presented as an object on the main animation scene. the next figure shows a combo box, which is used for input of predefined resistor values: figure 3: input option via combo box the following code was attached to the combo box: figure 4: source code for combo box’s onclipevent event when the user selects a value from the "r" combo box’s list, the code is executed . the first line of the code assigns the currently selected item’s label (currently it is a "50k" string) to the ’r1’ variable. the ’r1’ variable is the label in the scheme (see figure 5, dashed line rectangle, right from the r resistor). so changes in the values in the combo box are displayed also on the scheme. the second line of the code assigns the value (numerical value: 50000) associated with the item currently selected ("50k" string) to the "r" variable. the scheme has its own action script code, which uses the "r" variable for calculating the new output value of the voltage. because this code changes several global variables, other movie clips on the scene which also use those variables are affected with it. in this way, for example the changes in the resistor value analyzing the impact of using interactive animations in teaching 155 figure 5: scheme of the rc low–pass filter figure 6: source code attached to the rc low–pass filter schema affects the movie clip which represents the oscilloscope function, and the new form of the output signal is displayed. drawing the form of voltage signal on the oscilloscope is done by moving a special movie clip on the coordinates which are determined in the code above. in the movie clip which presents the current output value one yellow circle changes to a smaller and orange colored circle. this animation of the movie clip with 4 picture out of 10 is presented in figure 7. when drawing the output signal this movie clip is moving on the screen, and with its own figure 7: movie clip of the oscilloscope drawing beam animation the effect preented on the figure 8 is achieved. figure 9 shows one of a series of seven interactive simulations that are part of the e-curriculum which had been developed for the microcontrollers course. the simulations present the i8051 microcontroller’s timer/counter hardware, the setting and use of interrupts, and the application of the special forms of the add and mov instructions. figure 10 presents one of the four interactive simulations created specifically for the microcontrollers course. the simulations refer to the practical use of the i8051 microcontroller. 4 experiments and analysis for the purpose of this study the following research questions were specified: what is the impact of interactivity of the animations on learning? the null hypothesis is defined as follows: interactive animations have no significant positive impacts on studying "microcontroller" and "analog and digital electronics" courses. 156 r. pinter, d. radosav, s.m. cisar figure 8: appearance of the drawing beam in the oscilloscope movie clip figure 9: representation of the exercise “timer0 in mode 1” figure 10: representation of the exercise “microcontroller with a/d” analyzing the impact of using interactive animations in teaching 157 in order to obtain answers to the research questions, the authors compared the final exam score standard deviation at "analog and digital electronics" and "microcontrollers" courses independently, where the animations were used as supplementary tools for learning and practicing after class. 4.1 participants and data collecting method the data acquisition was done at subotica tech college of applied sciences over a threeyear period. it involved the second year students from two undergraduate programs the electrotechnical engineering major (ee) where these two courses were obligatory and the computer science major (cs) where these courses were optional. the number of participants for the first course (analog and digital electronics) over the period of 3 years is 441 students, 56 female (12.7%) and 385 male (87.3%) students. the second course’s participants (microcontrollers) were the same students from ee major, and from the cs major there were some old students and some new ones (those who did not select the first course). the composition of this group was: 464 participants, 58 females (12.5%) and 406 males (87.5%) see table 1. most participants, 98.5%, were between 18 and 20 years old; the remaining percentage is represented by a few students whose age were between 20 and 30. in these 3 years at the beginning of the semesters (the first course was in the fall and the second in the spring semester), the students were divided in two equal-sized groups, the control and the experimental group. the group members were chosen randomly, and only one condition had to be satisfied for the experimental group members: to have possibility of accessing the web application and the simulations from home. if this condition was not satisfied, that student automatically becomes the member of control group. after forming the groups accessing the web application was enabled only for the experimental group. there was no additional motivation for the students. all participants visited face to face (f2f) classes of these two courses, which were taught by the same lecturer presenting identical material. this further strengthens the consistency of comparisons. the web application collected the following data from the users: 1. how many time did he/she logged on to the system to use the e-content, 2. how many time did he/she spent using the particular simulation. students who logged on only few times and spent less time that the authors foresaw are assumed to be not using the system in an adequate mode, and they are not taken as members of the experimental group, so they were transferred to the control group (for details see table 1). ineligibility meant that the number of loggings is less than half of the available exercises, and the time spent in the system is less than 2 minutes per exercise the authors took as null hypothesis that the two groups would have the same mark average at both courses. the alternative hypothesis claims that the control group will achieve better result at both courses. the data was analyzed with one-sided, t-test, assuming that the variances of the two samples are different. because one course was in the fall semester and the second one in the spring semester, the analysis was done twice a year at the end of the semesters and independently for both courses. 158 r. pinter, d. radosav, s.m. cisar courses 2007 school year 2008 school year 2009 school year experiment. group control group experiment. group control group experiment. group control group analog&digital electronics 61 83 72 80 69 76 microelectronics 75 86 74 81 73 75 table 1 –the number of participants in the groups 4.2 student survey at the end of each semester and before the final exam, the control group members were asked to fill out a questionnaire with 5 questions. from the answers (marks from 1 to 5 and comments) the authors received feedback about generally how students were satisfied with simulation, how did it help or not in the learning process and what would they like to see done in a different way. these data were collected in order to perform further improvement of the teaching materials in the way that would lead to a widely accepted e-curriculum. 5 results and discussion the t-test applied to our two sample groups (main and control group) allows us to compare the means of the final exam marks of both groups. the following table presents these values. 2007 microcontrollers course n ss mean meane-meanc df tobs experimental group 75 121.9467 7.973 0.438 159 2.16 controll group 86 139.3953 7.534 2008 microcontrollers course n ss mean meane-meanc df tobs experimental group 71 105.098 7.887 0.420 144 2.06 controll group 75 110.666 7.466 analyzing the impact of using interactive animations in teaching 159 2009 microcontrollers course n ss mean meane-meanc df tobs experimental group 73 99.780 8.054 0.375 146 1.93 controll group 75 104.32 7.680 2007 analog and digital electronics course n ss mean meane-meanc df tobs experimental group 61 57.147 8.540 0.457 142 2.48 controll group 83 124.409 8.084 2008 analog and digital electronics course n ss mean meane-meanc df tobs experimental group 72 110.611 8.139 0.451 150 2.18 controll group 80 135.187 7.687 2009 analog and digital electronics course n ss mean meane-meanc df tobs experimental group 69 104.289 8.232 0.403 143 1.97 controll group 76 110.776 7.829 table 2 – students’ score distribution where the notations in the table are: • n number of participants, • ss sum of squared deviates, • meane/meanc mean for of experimental/controll group, • df degrees of freedom, • tobsobserved values of t-distribution. 6 conclusions and future works the authors compared the observed value of t with the t from the table of critical values that pertain to df > 140, and the results are shown in table 3: 160 r. pinter, d. radosav, s.m. cisar courses significance of the difference between the variances of the two samples m2007 t95%< tobs< t99% 1.98<2.16<2.61 m2008 t95%< tobs< t99% 1.98<2.06<2.61 m2009 tobs< t95% 1.93<1.98 ad2007 t95%< tobs< t99% 1.98<2.48<2.61 ad2008 t95%< tobs< t99% 1.98<2.18<2.61 ad2009 tobs< t99% 1.97<1.98 table 3 – significance differences between two groups from the presented data, the following conclusions can be drawn: • in 4 cases out of 6 we can reject the null hypothesis, and we can say with probability of 95%, that those experimental groups achieved better results on exam than the control groups. • in two cases there are no reasons to reject the null hypothesis. the results show evidence that interactive simulation contents can be very effective tools in the learning process. it can deliver information in a very attractive way, which also can be advantageous in assembling curricula for the students who have different skill levels and learning styles. besides that, it can help learners to understand scientific topics, with presenting important conceptual relationships. it is also important that simulations enable students to become acquainted with the shown system and make changes in the parameters with no additional costs or risks. but only well-designed animations may help to ease and shorten the learning process, and only with them, through play and experimentation can the learning process become more interesting [17] [18]. the students’ answers from the questionnaires show that not every simulation is accepted in the same manner. for example, the third e-content (figure 10) was given lower grades/worse comments than the other two. the reasons for this could be the themes which were presented with the simulation, because it does not contain spectacular and experimenting options. the design/the look of the animation also received worse marks from the students. some future researches should also investigate how effective the interactive animations are when the users have different learning styles various researches focusing on the effectiveness of learning with the help of visualization point out that in order for the animation to be well accepted, by the [19] [20] [21] the following have to be kept in mind : • positive effects in learning can only be achieved in topics that are dynamic in character, • an exaggerated multitude of colors in the animation will have the exact opposite effect, • it is important for the application to contain an optimal amount of information. due to the lack of a standard in creating successful visual applications [22], experiences gained from well-accepted electronic materials may serve as guidelines for defining a methodology, which, if applied in the design of animations and simulations, will lead to greater effect and efficiency in the learning process [23]. however, results also show that there is a tendency of decreasing the difference between those learners who had used the animation and those who had not. is this because there is an increasing number of such and similar e-curricula available to students, and this kind of analyzing the impact of using interactive animations in teaching 161 attractive multimedia presentations are no longer motivate students as they used to before; or was is simply the case of students of the control group getting hold of the animations and using them in their learning process. unfortunately, the questionnaire filled in by the students at the end of the semester failed to provide definitive answers to this question. the questionnaires show that students were on the whole satisfied with the applications. a number of studies indicate that the user’s performance is much better if the teaching methods are matched to the user’s learning style [24]. designing the animation’s interface and contents to match the students’ preferred learning style could lead to a more effective learning process. for example, according to the felder–silverman [25] learning style model, the animations containing a lot of visual elements, such as pictures, diagrams, flow charts etc. are preferred for the visual learning profile, while written and auditory explanations are effective with the verbal type of student. and to mention another example: students with an active profile prefer the simulation (interactive animation) which allows experimenting with the system parameters. bibliography [1] sekular, r., and blake, r., perception, second ed., new york, mcgraw-hill, 1990 [2] kraidy, u., digital media and education: cognitive impact of information visualization. journal of educational media, 27(3),95-106., 2002 [3] richard e. mayer, systematic thinking fostered by illustrations in scientific text, journal of educational psychology, 1989, vol. 81, nto. 2, 240-246 [4] mayer, r.e., gallini, j.k., when is an illustration worth ten thousand words?, journal of educational psychology, 1990, vol. 82, no. 4,715-726 [5] rieber, l.p., computers, graphics, and learning. madison, wisconsin: brown & benchmark, 1994 [6] allan paivio, imagery and verbal processes, lawrence erlbaum associates, publishers, hillsdale, new jersey, 1979 [7] paivio, a. dual coding theory: retrospect and current status. can. j. psychol. 45,255 -287. [8] mayer, r.e., and anderson, r.b., animations need narrations: an experimental test of a dual-coding hypothesis. j. educ. psychol. 83,484-490 [9] goldstein, a., chance, j., hoisington, m., and buescher, k., recognition memory for pictures: dynamic vs. static stimuli. bull. psychonomic soc. 20,37 -40 [10] gordin, d.n., and pea, r.d., prospects for scientific visualization as an educational technology. j. learn. sci. 4,249 -279. [11] brodie, k.w., carpenter, l.a., earnshaw, r.a., gallop, j.r., hubbold,r.j., mumford, a.m., osland, c.d., and quarendon, p.scientific visualization, berlin: springer-verlag, 1992. [12] rieber, l.p. using animation in science instruction with young children. j. ed. psychol. 82,135 -140, 1990. [13] phillip mcclean, christina johnson, roxanne rogers, lisa daniels: molecular and cellular biology animations: development and impact on student learning, cell biol educ 4(2): 169-179 2005, doi: 10.1187/cbe.04-07-047. 162 r. pinter, d. radosav, s.m. cisar [14] gabriela moise, a formal description of the systemic theory based e-learning, international journal of computers, communications & control, vol. iii (2008), no. 1, pp. 90-102 [15] r. pinter, d. radosav, s. maravic cisar,interactive animation in developing e-learning contents, 33rd international convention on information and communication technology, electronics and microelectronics mipro 2010, may 24 28, 2010, opatija, croatia, issn 1847-3938, isbn 978-953-233-054-0, pp. 251-254 [16] pinter robert, sanja maravic cisar, the application of flash animation in the developmnet of e-learning , simpozijum o racunarskim naukama i tehnologijama, kopaonik, 10-14.03.2003, serbia and montenegro [17] jan l. plass, bruce d., homer, elizabeth o. hayward: design factors for educationally effective animations and simulations, journal of computing in higher education, (2009) 21:31-61, doi 10.1007/s12528-009-9011-x [18] moreno, r., & mayer, r., interactive multimodal learning environments: special issue on interactive learning environments: contemporary issues and trends, educational psychology review. special issue: interactive learning environments: contemporary issues and trends, 19(3), 309–326, 2007 [19] moreno, r., optimising learning from animations by minimising cognitive load: cognitive and affective consequences of signaling and segmentation methods, applied cognitive psychology, 21,765–781, 2007. [20] weiss, r. e., knowlton, d. s., & morrison, g. r., principles for using animation in computerbased instruction: theoretical heuristics for effective design, computers in human behavior, 18, 465–477, 2002. [21] um, e., song, h., & plass, j. l., the effect of positive emotions on multimedia learning, paper presented at the world conference on educational multimedia, hypermedia & telecommunications( d-media 2007) in vancouver, canada, june 25–29, 2007. [22] maria d. a. suarez, cayetano g. artal, francisco m. t. hernandez e-learning multimedia applications: towards an engineering of content creation int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844, vol. iii (2008), no. 2, pp. 116-124 [23] rieber, l. p., effects of visual grouping strategies of computer-animated presentations on selective attention in science, educational technology research and development, 39, 5–15, 1991. [24] chen s., macredie r.,cognitive styles and hypermedia navigation: development of a learning model, journal of the american society for information science and technology, 53(1):3 15, (2002) [25] richar f., rebeca b.,understanding student differences, journal of engineering education, 94 (1), 57-72 (2005) int j comput commun, issn 1841-9836 vol.7 (2012), no. 3 (september), pp. 518-529 an adaptive iterative learning control for robot manipulator in task space t. ngo, y. wang, t.l. mai, j. ge, m.h. nguyen, s.n. wei thanhquyen ngo, m. hung nguyen hcm city university of industry college of electrical engineering hcm city, vietnam e-mail: thanhquyenngo2000@yahoo.com, nmhung289@yahoo.com yaonan wang, ji ge, shuning wei hunan university college of electrical and information engineering changsha, hunan province 410082, p.r.china e-mail: yaonan@hnu.cn, gechunxi@126.com, weishuning@sina.com t. long mai hcm city university of industry college of electronic engineering hcm city, vietnam e-mail: mailongtk@gmail.com abstract: in this paper, adaptive iterative learning control (ailc) of uncertain robot manipulators in task space is considered for trajectory tracking in an iterative operation mode. the control scheme incluces a pd controller with a gain switching technique plus a learning feedforward term, is exploited to predict the desired actuator torque. by using lyapunov method, an adaptive iterative learning control scheme is presented for robotic system with both structured and unstructured uncertainty, and the overall stability of the closed-loop system in the iterative domain is established. the validity of the scheme is confirmed through a numerical simulation. keywords: pd control; learning control; robot manipulator. 1 introduction in general, robotic manipulators have to face various uncertainties in their dynamics, such as friction, and external disturbance. it is difficult to establish exactly mathematical model for the design of a model-based control system. in order to deal with this problem, the branches of current control theories are broad include classical control, robust control, adaptive control, optimal control, nonlinear control, neural network control, fuzzy logic and intelligent control. however, many adaptive control approaches are rejected as being too computationally intensive because of the required of real-time parameter identification and control design owing to its simplicity and robustness to modelling uncertainties, are usually used for repetitive tasks. in this case, the reference trajectory is repeated over a definite operation time. so, iterative learning control (ilc) for robot manipulator has attracted considerable attention in the recent years [1][3], the ideal of learning control is to improve the tracking performance from iteration to iteration for applications of robotics in industry which a single repetitive task [4], [5]. ilc is a relatively recent but well-established area of study in control theory. ilc, which can be categorized as an intelligent control methodology, is an approach for improving the transient performance of system that operates repetitively over a fixed time interval [4]. starting from the classical copyright c⃝ 2006-2012 by ccc publications an adaptive iterative learning control for robot manipulator in task space 519 arimoto-type ilc algorithm, we can develop a pid-like update law can be given in [6]. so far some of robot manipulators control in the published literature [7]-[9] and etc, proposed an adaptive ilc to deal with parameter uncertainties, such as the link length, mass inertia, and friction nonlinearity, with a self-organizing capability. in this paper, a new method is given based on a combination of the advantages of a several control methods into a hybrid one. in particular, it is further extended to the task space or the so-called cartesian space. to apply robot manipulators to a wide class of tasks, it will be necessary to control not only the position of the end-effector, but also the force exerted by the end-effector on the object. by designing the control law in task space, force control can be easily formulated. this paper is organized as follows. section 2 described a dynamic model of an n-link robot manipulator in task space. section 3 presents ailc and its features are discussed. by using lyapunov method to prove the asymptotic convergence of proposed controller. numerical simulation results of a two-link robot manipulator in task space under the possible occurrence of uncertainties are provided to demonstrate the tracking control performance of the proposed ailc system in section 4. conclusions are drawn in section 5. 2 robotic dynamic model in task space in general, the dynamic of an n-link robot manipulator may be expressed in the lagrange form [10] as: d(qi(t))q̈i(t) + c(qi(t), q̇i(t))q̇i(t) + g(qi(t), q̇i(t)) +τa(t) = τ i(t) (1) with t ∈ [0, t f ] denotes the time and i ∈ n denotes the iteration, qi(t) ∈ rn , q̇i(t) ∈ rn, q̈i(t) ∈ rn are the joint position, joint velocity, and joint acceleration variables vector, respectively. d(qi(t)) ∈ rn×n is the inertia matrix, c(qi(t), q̇i(t)) ∈ rn is the coriolis-ccentripetal matrix, is the gravity vector plus friction force vector. bounded unknown disturbances are denoted by τa(t) ∈ rn and the control input torque is denoted by τi(t) ∈ rn. for convenience, a two-link robot manipulator, as shown in fig.1, is utilized to verify dynamic properties are given in section 4. figure 1: architecture of two-link robot manipulator. usually, the manipulator task specification is given relative to the end-effector. thus, it is natural to attempt to derive the control algorithm directly in task space, better than in the joint space. denote the end-effector position and orientation in task space by x ∈ rn . the task space dynamic can be rewritten as follow: dx(q i(t))ẍi(t) + cx(q i(t), q̇i(t))ẋi(t) + gx(q i(t), q̇i(t)) = τa(t) + fx i(t) (2) where dx(q i(t)) = j−t (qi(t))d(qi(t))j−1(qi(t)) 520 t. ngo, y. wang, t.l. mai, j. ge, m.h. nguyen, s.n. wei cx(q i(t), q̇i(t)) = j−t (qi(t))(c(qi(t), q̇i(t)) − d(qi(t))j−1(qi(t))j̇(qi(t)))j−1(qi(t)) gx(q i(t), q̇i(t)) = j−t (qi(t))g(qi(t), q̇i(t)), fix = j −t (qi(t))τi(t) and j(qi(t)) ∈ rn×n is the configuration-dependent jacobian matrix, which is assumed as non-singular in the finite work space ω. the above dynamic equation has the following useful structural properties [11], which can be exploited to facilitate the controller design in the next section. property 1: the inertia matrix dx(qi(t)) is symmetric and positive definite. it is also bounded as a function of q: m1 i ≤ dx ( qi(t) ) ≤ m2 i, where m1,m2 > 0. property 2: ḋx(qi(t)) − 2cx(qi(t), q̇i(t)) is a skew symmetric matrix. therefore, yt [ḋx(qi(t)) − 2cx(qi(t), q̇i(t))]y= 0, where y is a n × 1 nonzero vector. assumption 1: the given desired joint trajectory xd (t) belongs to c2[0, t f ], where c2[0, t f ] denotes 2nd -order continuously differentiable functions on t ∈ [0, t f ]. assumption 2: initial condition xi(0) = xd (0), ẋi(0) = ẋd (0) and for all i ≥ 1. 3 ailc design linearizing the system (2) along the desired trajectory xd(t), ẋd (t), ẍd (t) at the ith iterative, we obtain the following linear time-varying system according [8]: d(t)ëi(t) + [c(t) + c1(t)]ė i(t) + r(t)ei(t) + n(ëi, ėi,ei, t) −τa(t) = s (t) − fi(t) (3) where: d(t) = dx(xd (t)), c(t) = cx(xd (t), ẋd (t)) r(t) = ∂dx ∂x ∣∣∣∣∣ xd (t) ẍd (t) + ∂cx ∂x ∣∣∣∣∣ xd (t),ẋd (t) ẋd (t) + ∂gx ∂x ∣∣∣∣∣ xd (t) s (t) = dx(xd (t))ẍd (t) + cx(xd (t), ẋd(t))xd (t) + gx(xd (t)) +τa(t) e(t) = xd (t) − x(t) the term n(ëi, ėi,ei, t) contains the higher order terms ëi(t), ėi(t) and ei(t) and it can be negligible. the control problem is to find a control law so that the end-effector position x(t) can track specific commands xd (t). we construct controller as follows: fi = fie + h i (4) where the first term fie = k i p(e i(t)) + kid (ė i(t)) is feedback pd control law with the following gain switching rule in [8]: ki+1p = β(i)k i p, k i+1 d = β(i)k i d, β(i + 1) >β(i), i = 0,1,2, ..., n (5) with kip, k i d are the initial proportional and derivative control gain matrices that diagonal positive definite, ki+1p , are the control gains of the ith iterative. β(i) > 1 is the gain switching factor. the gains adaptive law in (5) are used to adjust the pd gains from iterative to iterative. and hi is the initial predicted feedforward control input to be computed at each iterative by a learning rule. according demonstrated in [8], the feedback pd control law with the gain switching factor in (5) plus the feedforward learning control law with the input force profile, the convergence of system (3) is guaranteed. however, in order to, the trajectory tracking convergence fast in some initial iterative, we cannot increase the switching factor arbitrarily large because actuator forces are limited, especially when the system has modelling errors or nonlinearity. hence, according [12] to deal this problem, we propose feedforward control input hi(t) with a learning rule so that hi(t) converges to r(t) for all t ∈ [0, t f ] as follow: hi+1 = hi +αfie (6) an adaptive iterative learning control for robot manipulator in task space 521 figure 2: schematic diagram of the learning control scheme. at the initial stage of learning, the hi(t) are set to zero. α is a positive constant often called a training factor. therefore, for the ith and (i + 1)th iterations, applying the input (4), (6) to system (3), we obtain an error equation as follows: d(t)ëi(t) + [c(t) + c1(t)]ė i(t) + r(t)ei(t) = s (t) − fie − h i (7) d(t)ëi+1(t) + [c(t) + c1(t)]ė i+1(t) + r(t)ei+1(t) = s (t) − fi+1e − h i −αfie (8) to simplicity the proof of stability, let kip = ak i d for the initial iteration, and define the filter errors as follow: x̃i(t) = ėi(t) + aei(t) (9) also, define δx̃i = x̃i+1 − x̃i and δei = ei+1 − ei. then, from (9): δx̃i = δėi + aδei (10) from (5)-(9) and (10), one can obtain the following equation: dδ ˙̃xi + (c + c1 − ad + ki+1d )δx̃ i + (r − a(c + c1 − ad))δei =−(ki+1d + (α− 1)k i d )x̃ i (11) the following theorem can be proved. theorem: suppose robot system (2) satisfies property (1, 2) and assumption (1, 2). with the control input (4), the gain switching rule (5) and learning rule (6). the following should hold for all t ∈ [0, t f ]. xi(t) → xd (t), ẋi(t) → ẋd (t), i →∞ if the controller gains are selected so that the following relationships hold: lp = λmin((2 −α)kid + 2c1 − 2ad) > 0 (12) lr = λmin((2 −α)kid + 2c + 2r/a − 2ċ1/a) > 0 (13) lplr ≥ ∥r/a − (c + c1 − ad)∥2max (14) 522 t. ngo, y. wang, t.l. mai, j. ge, m.h. nguyen, s.n. wei where λmin(a) is the minimum eigenvalue of matrix a, and ∥m∥max = max∥m(t)∥ for t ∈ [0, t f ]. here, ∥m∥ represents the euclidean norm of m. proof: we select a performance index v i(t) as follow: v i = t∫ 0 e−ρτ(x̃i)t qx̃idτ≥ 0 (15) thus, β(i) > 1 according (5) so we have ki+1e > k i d and α is a positive constant, so q = k i+1 d + (α− 1)kid) > 0. from the definition of v j, for the (i + 1)th iteration, one can get: v i+1 = t∫ 0 e−ρτ(x̃i+1)t qx̃i+1dτ (16) let ∆v i = v i+1 − v i then from (15), (16) and (11), we obtain: ∆v i = t∫ 0 e−ρτ((δx̃i)t qδx̃i + 2(δx̃i)t qx̃i)dτ = t∫ 0 e−ρτ((δx̃i)t qδx̃i + 2(δx̃i)t qx̃i)dτ = t∫ 0 e−ρτ(δx̃i)t qδx̃idτ− 2 t∫ 0 e−ρτ(δx̃i)t dδ ˙̃xidτ − 2 t∫ 0 e−ρτ(δx̃i)t ((c + c1 − ad + ki+1d )δx̃ i + (r − a(c + c1 − ad))δei)dτ (17) applying the partial integration, from assumption 2, and property 2, we have: t∫ 0 e−ρτ(δx̃i)t dδ ˙̃xidτ= e−ρτ(δx̃i) t d(δx̃i) ∣∣∣∣t 0 − t∫ 0 (e−ρτ(δx̃i)t d)′δx̃idτ = e−ρτ(δx̃i)t (t)d(t)δx̃i(t) +ρ t∫ 0 e−ρτ(δx̃i)t dδx̃idτ − t∫ 0 e−ρτ(δx̃i)t dδ ˙̃xidτ− 2 t∫ 0 e−ρτ(δx̃i)t cδx̃idτ (18) substituting (18) into (17), and q = ki+1d + (α− 1)k i d yields: ∆v i =−e−ρτ(δx̃i)t (t)d(t)δx̃i(t) −ρ t∫ 0 e−ρτ(δx̃i)t dδx̃idτ − 2 t∫ 0 e−ρτ(δx̃i)t (r − a(c + c1 − ad))δeidτ − t∫ 0 e−ρτ(δx̃i)t (ki+1d − (α− 1)k i d + 2c1 − 2ad)δx̃ idτ (19) an adaptive iterative learning control for robot manipulator in task space 523 from (5), we have: t∫ 0 e−ρτ(δx̃i)t ki+1d δx̃ idτ= β(i + 1) t∫ 0 e−ρτ(δx̃i)t kidδx̃ idτ ≥ t∫ 0 e−ρτ(δx̃i)t kidδx̃ idτ (20) substituting (10) into (19) and noticing (20), we obtain: ∆v i ≤− e−ρτ(δx̃i)t (t)d(t)δx̃i(t) −ρ t∫ 0 e−ρτ(δx̃i)t dδx̃idτ − t∫ 0 e−ρτ(δėi)t ((2 −α)kid + 2c1 − 2ad)δė idτ − 2a t∫ 0 e−ρτ(δei)t ((2 −α)kid + 2c1 − 2ad)δė idτ − 2 t∫ 0 e−ρτ(δėi)t (r − a(c + c1 − ad))δeidτ − a2 t∫ 0 e−ρτ(δei)t ((2 −α)kid + 2c1 − 2ad)δe idτ − 2a t∫ 0 e−ρτ(δei)t (r − a(c + c1 − ad))δeidτ (21) applying the partial integration again gives: t∫ 0 e−ρτ(δei)t ((2 −α)kid + 2c1 − 2ad)δė idτ= e−ρτ(δei) t ((2 −α)kid + 2c1 − 2ad)(δe i) ∣∣∣∣t 0 +ρ t∫ 0 e−ρτ(δei)t ((2 −α)kid + 2c1 − 2ad)δe idτ − t∫ 0 e−ρτ(δėi)t ((2 −α)kid + 2c1 − 2ad)δe idτ + 2 t∫ 0 e−ρτ(δei)t (aḋ − ċ1)δeidτ (22) 524 t. ngo, y. wang, t.l. mai, j. ge, m.h. nguyen, s.n. wei therefore ∆v i ≤− e−ρτ(δx̃i)t dδx̃i −ρ t∫ 0 e−ρτ(δx̃i)t dδx̃idτ − ae−ρτ(δei)t ((2 −α)kid + 2c1 − 2ad)δe i −ρa t∫ 0 e−ρτ(δei)t ((2 −α)kid + 2c1 − 2ad)δe idτ− t∫ 0 e−ρτwdτ ≤− e−ρτ(δx̃i)t dδx̃i − ae−ρτ(δei)t lpδei −ρ t∫ 0 e−ρτ(δx̃i)t dδx̃idτ −ρa t∫ 0 e−ρτ(δei)t lpδe idτ− t∫ 0 e−ρτwdτ (23) where w=(δėi)t ((2 −α)kid + 2c1 − 2ad)δė i + 2a(δėi)t (r/a − (c + c1 − ad))δei + a2(δei)t ((2 −α)kid + 2r/a + 2c − 2ċ1/a)δe i (24) let p = r/a − (c + c1 − ad). then from (12) and (13), we obtain w≥ lp∥δė∥2 + 2aδėt pδe + a2lr∥δe∥2 (25) applying the cauchy-schwartz inequality, we obtain: δėt pδe ≥−∥δė∥∥p∥max ∥δe∥ (26) from (12)-(14) w= lp∥δė∥2 − 2a∥δė∥∥p∥max ∥δe∥+ a2lr∥δe∥2 = lp(∥δė∥− a lp ∥p∥max ∥δe∥)2 + a2(lp − 1 lr ∥p∥2max)∥δe∥ 2 ≥ 0 (27) according property 1 and (27), based on (23), it can be ensured that ∆v i ≤ 0, therefore v i+1 ≤ v i. from the definition, kid is a positive definite matrix. from the definition of v i, v i ≥ 0 and v i is bounded. as a result, yi(t) → ∞ when i → ∞. because ei(t) and ėi(t) are two independent variables, and a is a positive constant. thus, if i → ∞, ei(t) → 0 and ėi(t) → 0 for t ∈ [0, t f ]. finally, the following conclusions hold for t ∈ [0, t f ]. xi(t) → xd (t), ẋi(t) → ẋd (t), i →∞ from the above analysis it can be seen that the adaptive pd control method can guarantee that the tracking errors converge arbitrarily close to zero as the number of iterations increases. the following case studies based on simulation will demonstrate this conclusion. an adaptive iterative learning control for robot manipulator in task space 525 4 numerical simulation a two-link robot manipulator as shown in fig.1 is utilized in this paper to verify the effectiveness of the proposed control scheme. the dynamic equation of the robot manipulator is adopted in [13]. d(q) = m1 + m2 + 2m3 cos(q2) m2 + m3 cos(q2) m2 + m3 cos(q2) m2  c(q, q̇) = −m3q̇2 sin(q2) −m3(q̇1 + q̇2) sin(q2) m3q̇1 sin(q2) 0  g(q) = m4gcos(q1) + m5gcos(q1 + q2) m5gcos(q1 + q2)  and mi are the parameters of interest given by m = p + pl l with m = [ m1 m2 m3 m4 m5 ]t p = [ p1 p2 p3 p4 p5 ]t l = [ l21 l 2 2 l1l2 l1 l2 ]t and pl is the payload, ł1 = 1(m) and ł2 = 1(m) are the lengths of link 1 and link 2, respectively, p is the parameter vector of the robot itself. g = 9.8 m/s2. the jacobian matrix is known as: j(q) = −l1 sin(q1) − l2 sin(q1 + q2) −l2 sin(q1 + q2) l1 cos(q1) + l2 cos(q1 + q2) l2 cos(q1 + q2)  for the convenience of the simulation, the nominal parameters of the robotic system are given as: p = [ 1.66 0.42 0.63 3.75 1.25 ]t kg.m2 the desired reference trajectories in the cartesian space are: xd1 = 1.0 + 0.2 cos(πt), xd2 = 1.0 + 0.2 sin(πt). which represent a circle of radius 0.2m and its centres is located at (x1, x2) = (1.0,1.0)m. the robot is initially rested with its end-effector positioned at the center of the circle. with initial condition are x1(0) = xd1(0) = 1.2, x2(0) = xd2(0) = 1.0, ẋ1(0) = ẋd1(0) = 1.0, ẋ2(0) = ẋd2(0) = 1.2 and t = [0,2]. the most important parameters that effect the control performance of the robotic system are the external disturbance τa(t), the friction term f (q̇), in simulation, payload variation situation and external disturbance situation occurring at fifth the iteration are considered. the payload variation situation is that pl = 0(kg) from first the iteration to fourth the iteration, and then it was put on pl = 3 after the fifth iteration. the disturbance situation is that external forces are injected into the robotic system, and their shapes are expressed as follows: τa (t) = [ 5 sin(5t) 5 sin(5t) ]t (28) in addition, friction forces are also considered in this simulation and given as: f (q̇) = [ 2q̇1 + 0.8sgn(q̇1) 4q̇2 + 0.1sgn(q̇2) ]t (29) 526 t. ngo, y. wang, t.l. mai, j. ge, m.h. nguyen, s.n. wei 0.6 0.8 1 1.2 1.41.4 0.4 0.6 0.8 1 1.2 1.4 position x1 (m) p os it io n x 2 (m ) actual desired 0.6 0.8 1 1.2 1.41.4 0.6 0.8 1 1.2 1.41.4 position x1 (m) p os it io n x 2 (m ) actual desired 0 5 10 15 0 0.2 0.4 0.6 0.8 1 iteration number m ax im um e rr or s (m ) rms of x2 rms of x1 figure 3: reference tracking and actual trajectory for end-effector at different iterative under ailc is adopted in [11]. (a) position tracking of end-effector at first iteration, (b) position tracking of end-effector after fifteenth iteration and (c) profile of rms position errors payload and disturbance variation after the fifth iteration. in order to exhibit the superior control performance of ailc system, two extra control systems including an adaptive switching learning pd control system (asl-pd) is represented in [8], an iterative learning control (ilc) is represented in [12]. in three simulation situations, the pd control gain was set to be the same as [8]: kip = k i d = diag {30,30} , k i+1 p = 2ik i p, k i+1 d = 2ik i d, β= 0.75 the simulated results of ilc system, the responses of end-effector position at first and fifteenth iteration and tracking error from iteration to iteration to be depicted fig. 3(a), (b) and (c) respectively. from the simulated results, we can be seen that the tracking performance was not acceptable because the errors were too large compared to asl-pd and proposed controllers, especially at the fifth iteration poor tracking errors responses are resulted due to the occurrence of payload variation situation and external disturbance. in the asl-pd system are depicted in [8]. the end-effector position responses, tracking error from iteration to iteration to be depicted in fig. 4(a), 4(b), and 4(c), respectively. the end-effector tracking performance is obvious under the occurrence of payload variation and external disturbance. the convergence rate increased greatly compared with the ilc control method. now, the ailc system depicted in fig. 2 is applied to control the robot manipulator for comparison. the simulated results of end-effector position responses and tracking error from iteration to iteration are depicted in fig. 5(a), 5(b) and 5(c), respectively. table 1 shows tracking performance of proposed system from the initial iteration to fifteenth are obvious. therefore, the comparison of their method and our method demonstrated a bit fast convergence rate with the proposed control method. specially, control an adaptive iterative learning control for robot manipulator in task space 527 iterative 1 5 10 15 ilc max |ei1|(m) 0.0822 0.0773 0.0168 0.0140 max |ei2|(m) 0.1998 0.5634 0.0130 0.0105 asl-pd max |ei1|(m) 0.0398 0.0076 0.0012 0.0010 max |ei2|(m) 0.0646 0.0593 0.0008 0.0006 ailc max |ei1|(m) 0.0377 0.0076 0.0011 0.0009 max |ei2|(m) 0.1741 0.0590 0.0008 0.0006 table 1: maximum tracking errors from iteration to iteration. iterative 1 5 10 15 asl-pd max |τi1|(n.m) 90 930 3330 7230 max |τi2|(n.m) 51.45 531.6 1903 3618 ailc max |τi1|(n.m) 82.50 772.5 2647 5647 max |τi2|(n.m) 47.14 441.6 1513 3228 table 2: maximum control forces from iteration to iteration. torques of proposed system is much smaller than the asl-pd controller. detail results are depicted in table 2. 5 conclusions this paper has successfully implemented an ailc scheme to control the position of end-effector in task space for achieving desired position control. all the system dynamics may be unknown. a new control method is a combination of advantages some other method into a hybrid one as explained above. by using lyapunov theorem, the asymptotic convergence of the closed-loop control system can be ensured whether or not the uncertainties occur. simulation results of a two link robot manipulator in task space via various existing control methods including asl-pd and ilc control were also applied in this paper to compare and display the manipulative performance of the proposed control system. according to the result as depict in figs. 3-5 and table 1-2, the desired position tracking and tracking errors response of the ailc scheme decrease with the increase of the iteration number under wide range of payload and external disturbance. the main of the paper is to construct a simple scheme, easy implementation, fast convergence. especially, in this paper is applied not only to control the position of the end-effector, but also the force exerted by the end-effector on the object. by designing the control law in task space, force control can be easily formulated. acknowledgment this work was supported by the national natural science foundation of china (60775047; 60835004), the national high technology research and development program of china (863 program) (2007aa04z244; 2008aa04z214). the authors would like to thank the associate editor and the reviewers for their valuable comments. 528 t. ngo, y. wang, t.l. mai, j. ge, m.h. nguyen, s.n. wei 0.6 0.8 1 1.2 1.41.4 0.6 0.8 1 1.2 1.41.4 position x1 (m) p os it io n x 2 (m ) actual desired 0.6 0.8 1 1.2 1.41.4 0.6 0.8 1 1.2 1.4 position x1 (m) p os it io n x 2 (m ) actual desired 0.6 0.8 1 1.2 1.41.4 0.7 0.8 0.9 1 1.1 1.2 1.3 position x1 (m) p os it io n x 2 (m ) actual desired 0.6 0.8 1 1.2 1.41.4 0.8 1 1.21.2 position x1 (m) p os it io n x 2 (m ) actual desired 0 5 10 15 0 0.2 0.4 0.6 0.8 1 iteration number m ax im um e rr or s (m ) rms of x2 rms of x1 figure 4: reference tracking and actual trajectory for end-effector at different iterative under aslpd is adopted in [6]. (a) position tracking of endeffector at first iteration, (b) position tracking of end-effector after fifteenth iteration and (c) profile of rms position errors payload and disturbance variation after the fifth iteration. 0 5 10 15 0 0.2 0.4 0.6 0.8 1 iteration number m ax im um e rr or s (m ) rms of x2 rms of x1 figure 5: reference tracking and actual trajectory for end-effector at different iterative under ailc controller. (a) position tracking of end-effector at first iteration, (b) position tracking of end-effector after fifteenth iteration and (c) profile of rms position errors payload and disturbance variation after the fifth iteration. an adaptive iterative learning control for robot manipulator in task space 529 bibliography [1] p. bondi, g. cacaline and l. gambardella, on the iterative learning control theory for robot manipulators, ieee j. on robotics and automation, 4:14-22, 1984. [2] s. arimoto, s. kawamura and f. miyazaki, bettering operation of dynamic system by learning: a new control theory for servomechanism or mechatronics systems, proceedings of 23rd. cdc, 10641069, 1984. [3] j.j. craig, adaptive control of mechanical manipulators, addison-wesley, new york, 1988. [4] hyo-sung ahn, yangquan chen, kevin l. moore, iterative learning control: brief survey and categorization, ieee trans. ind. sys, 6:1099-1121, 2007. [5] dong-ii kim, sungkwun kim an iterative learning control method with application for cnc machine tools, ieee trans. ind. app, 32(1):66-72, 1996. [6] k.l. moore, iterative learning control for deterministic systems,advances industrial control, new york, springer-verlag, 1993. [7] tayebi a, adaptive iterative learning control for robot manipulators, automatica, 40:1195-1203, 2004. [8] ouyang p r, zhang w j, gupta m m, an adaptive switching learning control method for trajectory tracking of robot manipulators, it mechatronics, 16:51-61, 2006. [9] choi jy, lee js, adaptive iterative learning control of uncertain robotic systems,iee proc contr theory, 147(4):217-223, 2000. [10] b.s. chen, h.j. uang, and c.s. tseng, robust tracking enhancement of robot systems including motor dynamics: a fuzzy-based dynamic game approach,ieee trans. fuzzy syst, 11(4):538552,1998. [11] craig jj. introduction to robotics: mechanics and control. reading, ma: addison-wesley, 1986. [12] t.y kuc, k. nam, j.s. lee, an iterative learning control of robot manipulators, ieee trans robot automat., 7(6): 835-842, 1991. [13] s.s. ge, c.c. hang, adaptive neural network control of robot manipulators in task space,ieee trans. ind. elec., 44(6):746-752, 1997. international journal of computers communications & control issn 1841-9836, 9(5):633-643, october, 2014. a modified ant colony algorithm for traveling salesman problem x. wei, l. han, l. hong xianmin wei*, liqi han, lu hong school of computer engineering, weifang university, weifang 261061, p.r. china weixm@wfu.edu.cn, liqi.han@wfu.edu.cn, sdkfz234@163.com *corresponding author: weixm@wfu.edu.cn abstract: ant colony algorithms such as ant colony optimization (aco) have been effectively applied to solve the traveling salesman problem (tsp). however, traditional aco algorithm has some issues such as long iterative length and prone to local convergence. to this end, we propose we embed aco into cultural algorithm (ca) framework by leveraging the dual inheritance mechanism. best solutions are evolved in both population space and belief space, and the communication between them is achieved by accept and influence operations. besides, we employ multiple population spaces for parallel execution. experiments show that the performance of our proposed algorithm is greatly improved. keywords: traveling salesman problem (tsp), ant colony optimization (aco), cultural algorithm (ca). 1 introduction traveling salesman problem (tsp) is a typical np hard problem [1]. given a number of cities and the distances between them, tsp aims to calculate the shortest tour for the salesman to visit each city exactly once and return to the starting point. solving tsp problem can be beneficial to applications such as deployment of network devices [2], transportation [3] and traffic control [4]. ant colony optimization (aco) algorithm [5] is one possible solution for solving tsp problem. aco simulates the foraging process of ants, where the routes with more pheromones are more likely to be selected. the process of applying aco algorithm for tsp can be described as follows. suppose there are ants and cities. the movement of each ant is to choose the next unvisited city to visit by certain rules, and meanwhile update the amount of pheromones on that route. although the advantages of aco, such as positive feedback, distributed, parallel and selforganization, facilitate the solving of tsp problem, there still remain some issues. for example, at earlier iterations, the amount of pheromones is relative exile, so the accumulation of enough pheromones costs a long time. besides, at later iterations, the positive feedback mechanism greatly increases the possibility of local convergence. to deal with the above long iteration time and local convergence issues, we propose an improved algorithm in this paper. the general idea is to accelerate the evolution and revise the best solution at each iteration. to this end, we leverage the cultural algorithm (ca) [6] framework to embed the traditional aco algorithm. indeed, as indicated in [7, 8], it is feasible and efficient to adapt ca for optimize aco. our experiments prove that the dual inheritance mechanism of ca helps to improve the performance of aco. the remainder of this paper is organized as follows. section 2 reviews related work. the proposed algorithm is discussed in section 3. empirical experiments of evaluating the improved algorithm are conducted in section 4. finally, the paper is concluded in section 5. copyright © 2006-2014 by ccc publications 634 x. wei, l. han, l. hong 2 related work many efforts have been made to solve the problems of traditional aco algorithm. for example, dorigo et al. [9] proposed an ant system with positive feedback, distributed computation, and a constructive greedy heuristic. bullnheimer et al. [10] proposed a new rank based version of ant system. gutjahr et al. [11] proposed a graph based ant system. gambardella et al. [12, 13] introduced q-learning to ant system for solving tsp problem. ciornei et al. [14] developed a hybrid algorithm with ant colony and genetic algorithm, so that the proposed algorithm can achieve faster convergence and better search capability for global best solution. stutzle et al. [15] designed a max-min ant system (mmas) for better exploitation ability and accelerated the accumulating the pheromone of the best solutions. walter and thomas et al. proved the convergence of aco algorithm [16–18]. indeed, aco algorithms have been applied to many problems such as the tsp problem. zhou et al. [19] provided the theoretical convergence analysis on the traditional aco, and proved the feasibility of applying aco algorithm for tsp. zhang et al. [20] proposed an adaptive heterogeneous multiple aco to solve tsp, which used an evolution coefficient to evaluate the solutions obtained by the ant colony and then update the phenomenon. tuba et al. [21] modified aco by improving the pheromone correction mechanism, where the pheromone values for highly undesirable links are significantly lowered by a posteriori heuristic. negulescu et al. [22] presented an adapted aco that incorporates methods and ideas from genetic algorithms (ga) by simulating artificial ants with different behaviors as synthetic genes. besides, aco has been employed in many applications. gajpal et al. [23] applied aco to the route selection of tucks, and the search all the customers during the routing path using a multi-route search principle. hu et al. [24] employed a new method to update the phenomenon to deal with the continuous optimization problems. ghoseiri et al. [25] developed an algorithm to solve the dual-object shortest path problem with a multi-objective aco algorithm. gajpal et al. [26] proposed a modified aco method for a vehicle routing problem. secui et al. [27] applied aco for optimal allocation of capacitor banks in electric power distribution networks. cultural algorithm (ca) was first proposed by reynolds [6]. then, chung et al. [28] developed a new framework to embed the evolutionary programming into the population space to simulate the evolution process. liu et al. [29] combined ca with particle swarm optimization algorithm and applied it to the numerical optimization problem. ma et al. [30] applied ca to solve the tsp problem and utilized the idea from simulated annealing to ensure the accuracy and efficiency. unlike existing work, in this paper we design a modified aco algorithm by leveraging ca framework, that is, to employ a dual inheritance mechanism into the traditional aco algorithm. besides, we apply the modified algorithm on tsp problem. note that, for the consideration of parallelism, we use multiple population spaces. 3 proposed algorithm the disadvantages of the traditional aco algorithm include slow convergence and prone to stagnation. although there are some modifications over the traditional aco version, most of them are built upon the traditional computing structure to determine the internal status of the algorithm and estimate the search capabilities of the ant colony through qualitative analysis. for example, the improvements are made by updating pheromones, changing evaporation coefficient or the amount of released pheromones in order to avoid stagnation. different from existing efforts, we embed the aco algorithm into another framework, cultural algorithm (ca) [6], which could essentially improve the performance. the intuitive is that both algorithms are population based and share information through interactions among the a modified ant colony algorithm for traveling salesman problem 635 population. we perceive that the aco algorithm can benefit from the dual inheritance of ca. besides, the inherent nature of parallelism of aco indicates that parallel algorithm can improve the efficiency due to the mature of hardware and software. therefore, in this section, we propose a parallel aco algorithm based on ca algorithm, and then apply it for solving the traditional tsp problem. figure 1: the framework of culture algorithm ca simulates the evolution process of human society, which regards culture as the information carrier. while the culture is accepted by the whole population of the society, it can be used to direct the behavior of each individual in turn. there are two evolution spaces in ca algorithm: belief space, which is composed of the experience knowledge acquired during the evolution process; and population space, which is composed of the individuals. these two spaces communicate through specific protocols. the framework of ca is illustrated as figure 2. generally, the lower space contributes to the upper space periodically, and the upper space continuously evolves, which in turn influences the lower space. this is called dual inheritance mechanism. as shown in figure 2, the major operations of ca algorithm are accept() and influence(), while other operations are performed inside belief or population space independently. therefore, it is feasible to embed other algorithms into ca framework by adding specific logics into belief and population spaces and implementing accept() and influence() operations between them. the parallel version of ca is illustrated as figure 3, where exist multiple population spaces. each population space performs parallel evolution, and then produces local best individuals for next generation. then accept() operation is performed by all population spaces periodically to update belief space in a synchronous way. on the other hand, belief space conducts evolution as well, and calls influence() operation to direct the evolution process of each population space. 3.1 designing population space let m be the number of ants, n be the number of cities c = {c1,c2, . . . ,cn}, dij(i,j = 1,2, . . . ,n, i ̸= j) be the distance between any two cities, and τij be the amount of pheromones from i to j. at the beginning, τij is initialized as a constant, that is, τij(0) = c , where c is a constant. at t-th iteration, the pheromones on path i,j is notated as τij(t). according to mmas [12] algorithm, τij(t) is restricted between τmin and τmax in order to avoid premature local convergence. 636 x. wei, l. han, l. hong figure 2: the process flow of culture algorithm figure 3: the framework of culture algorithm τmax(t) = 1 2(1−ρ)l(t) + σ l(t) (1) τmin(t) = τmax(t) 20 (2) a modified ant colony algorithm for traveling salesman problem 637 where l(t) is the length of the best solution at iteration t, ρ is the evaporation coefficient of pheromones, and σ is the number of best solutions at iteration t. suppose ηij(t) denotes the heuristic information on path (i,j). the general idea is that ants would select closest path with larger amount of pheromones. therefore, the probability of ant k transferring from city i to j is calculated as: pkij(t) =   [ τij(t) ]α · [ηij(t)]β∑ j∈allowedk [ τij(t) ]α · [ηij(t)]β , j ∈ allowedk, 0, otherwise, (3) where allowedk denotes the available nodes to choose at the next step for ant k, α is the heuristic factor, meaning the importance of path with remaining pheromones, β is the heuristic factor for ηij(t), denoting the affect of heuristic information. after ants iterate all the nodes, remaining pheromones are updated as follows: τij(t + 1) = (1−ρ)τij(t) + m∑ k=1 ∆τkij(t) (4) where 1 − ρ is the residual coefficient of pheromones, and ρ ∈ [0,1). ∆τkij(t) is the amount of pheromones remaining on the path at current iteration for ant k, which can be calculated as: ∆τkij(t) = { q lk , if ant k passes (i,j) at current iteration, 0, otherwise, (5) where q is a constant, and lk is the total length of ant k’s tour. 3.2 designing belief space belief space is responsible for updating knowledge, that is, to further optimize the best solutions provided by the population space, which are transferred through accept() operation. for tsp problem, we employ 3-opt algorithm to evolve belief space. suppose there exist any three nodes i, j, k, and current best solution is c = {cs · · ·cici+1 · · ·cjcj+1 · · ·ckck+1 · · ·ct}. as shown in figure 4, if d(ci,ci+1) + d(cj,cj+1) > d(ci,cj) + d(ci+1,cj+1), then reverse sort is performed on c = {ci+1, . . . ,cj}; if d(cj,cj+1) + d(ck,ck+1) > d(cj,ck) + d(cj+1,ck+1), then reverse sort is performed on c = {cj+1, . . . ,ck}. 3.3 accept operation we accept a fixed ratio of all population spaces as the belief space, e.g. 20%. the accept() operation passes the local best solution of each population space r∗ and the corresponding length of route l∗ to the belief space. when accept() operation is performed, the relative poor individuals are replaced by the current best one of each population space. suppose tcurrent is the current iteration, tend is the predefined maximum iterations, and taccept is the iteration where accept() is called, which can be calculated as: taccept = fix ( c1 + tcurrent tend c2 ) (6) where fix() is a rounding function, and c1, c2 are constants. 638 x. wei, l. han, l. hong figure 4: 3-opt algorithm 3.4 influence operation after further optimization via 3-opt in belief space, influence() operation passes the shortest path and its length to each population space periodically, and updates the global pheromones. suppose the iteration where influence() operation is performed is tinfluence, which can be represented as: tinfluence = fix ( c1 + tend − tcurrent tend c2 ) (7) where fix() is a rounding function, and c1, c2 are constants. the improved algorithm has one belief space and multiple population spaces. each space evolve independently, and the communication between them is achieved by accept() and influence() operations. the process flow of belief space is shown in figure 5. in belief space, the update is achieved by 3-opt algorithm for computing the shortest path. when accept() is called, the solution is updated by the local best from population space. when influence() is called, the global best solution in belief space is transferred to each population space. if the termination condition is satisfied, the algorithm is completed. as shown in figure 6, the update of population space is achieved by aco algorithm. when accept() is called, the local best solution from each population space is updated up to the belief space. when influence() is called, the global best solution in belief space is transferred down to each population space. if the termination condition is satisfied, the algorithm is completed. 4 experiment the environment of our experiments is intel 2.3 ghz, 2gb ram, windows 7 operating system, and matlab 7.0 for programming. the dataset is achieved from tsplib library . the parameters settings are as follows: ρ = 0.5, α = 1, β = 5, q = 100, tend = 200. we use 4 population spaces, and the maximum number of evolution of both belief space and population space is 200. we compare our algorithm with the traditional aco. both algorithms are ran independently for 50 times, and the average execution time is supposed to the performance. as shown in table 1, we evaluate three examples from tsplib, i.e., eil51, berlin52 and st70. we can see that our proposed algorithm outperforms the typical aco method for solving tsp problem. moreover, the best solution provided by our method is much better than the known best. a modified ant colony algorithm for traveling salesman problem 639 figure 5: flow chart of evolution of belief space table 1: results of aco and our algorithm with different datasets example example example example our cities solution aco algorithm eil51 51 426 428.87 412.12 berlin52 52 7542 7542 7088.34 st70 70 675 677.11 652.80 besides, take eil51 as an example, we compare the best tour and convergence speed between our algorithm and traditional aco for tsp in figure 7. the blue line denotes the performance of aco, while the green line is for our algorithm. the result shows that our algorithm can achieve shorter tour with rapid speed of convergence. 640 x. wei, l. han, l. hong figure 6: flow chart of evolution of population space figure 7: evolution of best tour length of eil51 for aco and proposed algorithm a modified ant colony algorithm for traveling salesman problem 641 5 conclusions in this paper, we propose a new ant colony algorithm and apply it to the typical traveling salesman problem. specifically, we embed the traditional aco algorithm into cultural algorithm framework, where the set of ants makes up the population space, and the best solutions found at specific iteration are updated up to the belief space. the evolution of population space is indeed based on aco, and that of the belief space is based on 3-opt algorithm for calculating the shortest path. the communication between population space and belief space is performed by accept and influence operations. moreover, to minimum the execution cost, we design multiple population spaces for parallel execution. the experiments evaluate that our algorithm outperform the traditional aco algorithm. acknowledgments this work was partially supported by shandong natural science foundation (zr2011fl006), shandong science and technology development plan (2011yd01044), shandong spark program (2012xh06005), and weifang municipal science and technology development plan (201301050). bibliography [1] lenstra j., kan a., shmoys d. (1985), the traveling salesman problem: a guided tour of combinatorial optimization, vol. 3, new york: wiley, usa. [2] xing g., wang t., jia w., li m. (2008), rendezvous design algorithms for wireless sensor networks with a mobile base station, proc. of the 9th acm international symposium on mobile ad hoc networking and computing 2008, acm, 231–240. [3] feillet d., pierre d., michel g. (2005), traveling salesman problems with profits, transportation science, issn 0041-1655, 39(2): 188–205. [4] tariq m., ammar m., zegura e. (2006), message ferry route design for sparse ad hoc networks with mobile nodes, proc. of the 7th acm international symposium on mobile ad hoc networking and computing, florence, italy, 37–18. [5] dorigo m. (2006); ant colony optimization and swarm intelligence, proc. of 5th international workshop, ants 2006, brussels, belgium, vol. 4150, springer. [6] reynolds r. (1994), an introduction to cultural algorithms, proc. of the third annual conference on evolutionary programming, singapore, 131–139. [7] gu j., fan p., song q. (2010), improved culture ant colony optimization method for solving tsp problem. computer engineering and applications, issn 1002-8331, 46(26):49– 52. [8] liu s., wang x., you x. (2009), a cultural ant colony system for solving tsp problem. journal of east china university of science and technology (natural science edition), issn 1671-4512, 35(2):288–292. [9] dorigo m., vittorio m., alberto c. (1996), ant system: optimization by a colony of cooperating agents, systems, man, and cybernetics, part b: cybernetics, ieee transactions on, 26(1) : 29–41. 642 x. wei, l. han, l. hong [10] b. bullnheimer b., hartl r., strauss c. (1997), a new rank based version of the ant system, a computational study. [11] gutjahr w. (2000), a graph-based ant system and its convergence, future generation computer systems, issn 0167-739x , 16(8): 873–888. [12] gambardella l., dorigo m. (1995), ant-q: a reinforcement learning approach to the traveling salesman problem, icml, 252–260. [13] dorigo m. , luca m. (1996), a study of some properties of ant-q, parallel problem solving from nature-ppsn iv, springer, berlin, heidelberg, 656–665. [14] ciornei i., elias k. (2012), hybrid ant colony-genetic algorithm (gaapi) for global continuous optimization, systems, man, and cybernetics, part b: cybernetics, ieee transactions on, 42(1): 234–245. [15] stutzle t., holger h.(1997), max-min ant system and local search for the traveling salesman problem, proc. of the 1997 ieee international conference on evolutionary computation, icec’97, indianapolis, in, usa, 309–314. [16] gutjahr w. (2003), a generalized convergence result for the graph-based ant system metaheuristic, probability in the engineering and informational sciences, issn 0269-9648, 17(4): 545–569. [17] gutjahr w. (2003), a converging aco algorithm for stochastic combinatorial optimization, stochastic algorithms: foundations and applications. springer berlin, heidelberg, 10–25. [18] t. stützle, m. dorigo (2002), a short convergence proof for a class of ant colony optimization algorithms, ieee trans. evolutionary computation, issn 1089778x, 6(4): 358–365. [19] y. zhou (2009), runtime analysis of an ant colony optimization algorithm for tsp instances, ieee trans. evolutionary computation, issn 1089-778x, 13(5): 1083–1092. [20] zhang p., jie l., ling x. (2010), an adaptive heterogeneous multiple ant colonies algorithm, journal of intelligent systems, issn 2191-026x, 19(4): 301–314. [21] tuba m., jovanovic r. (2013); improved aco algorithm with pheromone correction strategy for the traveling salesman problem. international journal of computers communications & control, 8(3):477–485. [22] negulescu s.c., dzitac i., lascu a.e. (2010); synthetic genes for artificial ants. diversity in ant colony optimization algorithms.international journal of computers communications & control : 5(2):216-223. [23] gajpal y., prakash a. (2009), an ant colony system (acs) for vehicle routing problem with simultaneous delivery and pickup, computers & operations research, issn 0305-0548 , 36(12): 3215–3223. [24] x. hu, z. jun, l. yun (2008), orthogonal methods based ant colony search for solving continuous optimization problems, journal of computer science and technology, issn: 1000-9000, 23(1): 2–18. a modified ant colony algorithm for traveling salesman problem 643 [25] ghoseiri k., behnam n. (2010), an ant colony optimization algorithm for the bi-objective shortest path problem, applied soft computing, issn: 1568-4946, 10(4): 1237-1246. [26] gajpal y., prakash l.a. (2009), multi-ant colony system (macs) for a vehicle routing problem with backhauls, european journal of operational research, issn: 0377-2217, 196(1): 102–117. [27] secui d.c., dzitac s., bendea g.v., dzitac i. (2009); an aco algorithm for optimal capacitor banks placement in power distribution networks, studies in informatics and control, 18(4): 305–314. [28] chung c.j., robert g.r. (1998), caep: an evolution-based tool for real-valued function optimization using cultural algorithms, international journal on artificial intelligence tools, issn: 0218-2130, 7(3): 239–291. [29] liu s., wang x., you x.m. (2007), cultured differential particle swarm optimization for numerical optimization problems, icnc 2007. proc. of third international conference on natural computation, haikou, hainan, china, 642–646. [30] ma j. (2008), research on cultural algorithm for solving routing problem of mobile agent, the journal of china universities of posts and telecommunications, issn:10021310 , 15(4): 121–125. international journal of computers communications & control issn 1841-9836, 9(4):397-407, august, 2014. petri net based modelling of a career syllabus r. carvajal-schiaffino, l. firinguetti-limone rubén carvajal-schiaffino*1 departamento de matemática y ciencia de la computación universidad de santiago de chile av. b. o’higgins 3363, santiago de chile *corresponding author: ruben.carvajal@usach.cl luis firinguetti-limone2 departamento de estadística universidad del bío-bío av. collao 1202 concepción, chile lfiringu@ubiobio.cl abstract: a syllabus is a set of courses, their prerequisites and a set of rules defining the continuance of a student in an academic program. the manual verification of the possible contradictions between prerequisites and the set of rules is a difficult task due to the large number of possible cases. in this article we present the different stages in the verification of a syllabus, its modelling and analysis using petri nets, and suggested ways in which this may be used by the university administration in the decision making process. keywords: decision support systems, system modelling, system verification, educational planning, petri nets. 1 introduction in many countries student retention rates are under scrutiny as part of government funding policy of higher education institutions. as such, retention rates are an increasingly important issue for these institutions ( [2]). it has been found that structural characteristics of higher education institutions, such as enrollment size, selectivity and control, have significant associations with student drop-out (see [5, 15, 16]). in this regard the consistency of a career syllabus may well play a significant role in student persistence/drop-out. we may define a syllabus as a set of courses, their sequence, the institutional rules defining the continuance of a student in an academic program (number of credits, number of times a course can be repeated, student progress schedule, etc.). in particular, student progress schedules, which define minimum requirements for continuance in an academic program, are set up by many institutions of higher education (see for instance [13, 14, 17, 18]) petri net modelling can be an important tool to support decision making at different levels and types of organizations (see for example [4, 6, 10]). in this article we propose a petri net model of a syllabus, to help the decision making process of the university administrators, students and the course lecturers. the administration may use this model to verify the internal consistency of 1partially funded by dicyt-usach under grant cs 060933 2partially funded by the universidad del bío-bío under grant diubb-083308 3/r copyright © 2006-2014 by ccc publications 398 r. carvajal-schiaffino, l. firinguetti-limone a b c d e figure 1: a prerequisites graph an existing and/or under development syllabus; to plan the student career progress; to establish the risk level of a student drop-out. in section 2 we discuss the main characteristics of petri nets, its feasibility to represent the prerequisites and the analysis of the resulting model. in section 3 we demonstrate trough an example the stages in the modelling of a rule added to a petri net representing the prerequisites; an analysis is made of the model obtained and we discuss some related work. finally, in section 4 we conclude, presenting our findings and suggestions for future work. 2 modelling syllabus rules in this article we consider the following rules in a career syllabus: 1.prerequisites between courses. 2.maximum number of times a course can be repeated. 3.maximum duration of stay in the academic program. 4.minimum number of courses to be taken during each academic period. a petri net [7, 8] allows the formal mode description of systems whose dynamics is characterized by concurrency, synchronization, mutual exclusion, conflicts. prerequisites present all these characteristics, except mutual exclusion. for instance, in figure 1 we can find concurrency of courses a, b, c since, as already mentioned, they have to be taken at the beginning of the program. another situation of concurrency may occur between courses b, d if a student passes courses a, c but fails course b. synchronization is present if to take a course it is necessary to pass more than one, as is the case of e since it is necessary to pass b, c. a conflict does not appear explicitly in a prerequisite graph, nevertheless it is present since a course may or may not be passed. petri net based modelling of a career syllabus 399 pass fail course (a) course1 course2 pass1 pass2 fail1 fail2 (b) course1 course2 pass1 pass2 fail1 fail2 (c) figure 2: a course 2.1 modelling prerequisites and maximum number of failures formally, the prerequisites are represented as an acyclic directed graph where each node represents a course and the arcs represent the dependencies (prerequisites) between courses. for example, figure 1 may represent five courses a, b, c, d, e where the first three are the courses to be taken at the beginning of the academic program. the arc joining a and d pinpoints that to take course d it is necessary to pass course a and the arcs joining b, e and c, e indicate that to take course e it is necessary to pass courses b and c. in petri nets terms a course is represented by a place, whereas a place with a token indicates that the course is being taken. the activities that can be performed while taking a course are pass or fail. this is represented in figure 2a. since a course can be taken at most twice, this may be represented as in figure 2b. a token in place course1 pinpoints that a student is taking a course for the first time. firing the transition fail1 indicates that the course was failed, having to be taken again (this is represented by a token in course2). firing the transition fail2 indicates that the student failed course a second time (figure 2c), causing its elimination from the program. on the other hand firing transitions pass1 and pass2 pinpoint that course was passed in the first or second instance respectively. the petri net representing a complete prerequisites graph is shown in figure 3. place enrolled denote the fact that a student is registered in the academic program. places a1, a2, b1, 400 r. carvajal-schiaffino, l. firinguetti-limone 2 a2 d1 d2 t2 t3 t4 t5 t6 t7 c1b1a1 c2b2 2 e1 e2 2 pend enroled t1 t8 t9 t11 t13 t14 t15 t16 t17 t18 t19 t21 t22 t10 t12 t20 t23 pfail t8, t10, t12, t14, t15, t16, t17, t18, t20 t2, t3, t4, t5, t6, t7, from transitions psem to transitions t2, t3, t4, t5, t6, t7, t8, t9, t10, t11, t12, t13, t20, t21 t14, t15, t16, t17, t18, t19, from transitions t9, t11, t13, t19, t21 figure 3: a net representing the prerequisites graph of figure 1 b2, c1, c2, d1, d2, e1, e2 represent different possibilities of passing or failing the courses. the place pend represents a student having taken all the courses of the program. transition t1 indicates the beginning of an academic period and firing it puts tokens in places a1, b1 and c1. this global state indicates that a student, once enrolled, must take courses a, b, c. firing the transitions t2, t4, t6, t14, t16 respectively represent passing courses a, b, c, d and e the first time these courses are taken. on the other hand, firing the transitions t8, t10, t12, t18, t20 represents passing the same courses but in the second opportunity. first time failures are produced when transitions t3, t5, t7, t15, t17 are fired. second time failures, and elimination from the program, are produced when transitions t9, t11, t13, t19, t21 are fired. since failing a course twice results in the elimination of a student from the program, it is necessary to exclude the possibility of a student taking courses. for this we use place psem, enabling transitions t2, t3, t4, t5, t6, t7, t8, t9, t10, t11, t12, t13, t14, t15, t16, t17, t18, t19, t20 ,t21. transitions t2, t3, t4, t5, t6, t7, t8, t10, t12, t14, t16, t18, t20 return a token to psem, but firing transitions t9, t11, t13, t19, t21 represent failing a course a second time, not allowing any further courses to be taken. these failure transitions lead to the state pfail from which it is not possible to escape. to verify the properties of boundedness and liveness the transition t22 is fired, putting a token in place enroled. 2.2 counting the number of periods to conclude a program the model discussed on the previous section allows the direct verification of the conditions of elimination from the program by failing a course a second time. this model satisfies three important properties: boundedness, liveness and absence of deadlock, which are verified by computing p-invariants, t-invariants and generating the reachability graph, respectively. petri net based modelling of a career syllabus 401 figure 4: the algorithm. we have carried the automated analysis of the petri net model with the tool ina [9]. the number of academic periods necessary to complete the program is taken as a prerequisite for two reasons: to limit the maximum duration of stay in the program and to establish a minimum number of courses to be taken each academic period. typically, the maximum duration of stay is 1.5 times the duration of the academic program. in our example the duration of the program would be 2 periods and the maximum duration of stay would be 3 periods. a question to be answered is how to compute the number of academic periods (semesters, years, etc.) required to complete the academic program. to answer this question we must take into account that there are many performance cases. these vary from the optimum case: not failing any course, to the worst case: failing each course once. the model reachability graph contains all these cases, but we must consider that each state represents the firing of a transition not allowing the direct representation of the simultaneous passing/failing of several courses. however relevant information to count the number of periods is given only by states 1, 2, 5 and 7. state 1 (enrolled + psem) represents a student enrolling in the program the first time; state 2 (a1 + b1 + c1 + psem) represents a student sitting for the first time and simultaneously courses a, b, c; state 5 (d1 + 2e1 + psem) represents a student having passed all courses in the first period and sitting now courses d, e. the two tokens in place e1 represents the fact that course e has two prerequisites b, c. finally, state 2pend + psem represents a student having taken all program courses. from the previous discussion it may be deduced that, although the reachability graph contains all the states of the model and gives all valid sequences to complete the program (without time limitations) an additional time effort is necessary to recover only the states representing an academic period. in our example the reachability graph contains 96 states, but only 17 of the are relevant for our analysis. a feasible approach to counting academic periods this problem can be solved by computing the model t-semiflows because, conceptually, they represent the sequence of fired transitions (in our case passing or failing sequences) to comeback to the initial state. if we regard as the initial state the enrollment of a student in the program and the possibility of taking courses, then any t-invariant will be a sequence of passing/failing 402 r. carvajal-schiaffino, l. firinguetti-limone t-semiflow number of failures failed courses number of periods t1 + t2 + t4 + t6 + t14 + t16 + t22 0 2 t1 + t3 + t4 + t6 + t8 + t14 + t16 + t22 1 a 3 t1 + t2 + t5 + t6 + t10 + t14 + t16 + t22 1 b 3 t1 + t2 + t4 + t7 + t12 + t14 + t16 + t22 1 c 3 t1 + t2 + t4 + t6 + t15 + t16 + t18 + t22 1 d 3 t1 + t2 + t4 + t6 + t14 + t17 + t20 + t22 1 e 3 t1 + t2 + t5 + t7 + t10 + t12 + t14 + t16 + t22 2 b,c 3 t1 + t2 + t5 + t6 + t10 + t15 + t16 + t18 + t22 2 b,d 3 t1 + t2 + t5 + t6 + t10 + t14 + t17 + t20 + t22 2 b,e 4 t1 + t2 + t4 + t7 + t12 + t15 + t16 + t18 + t22 2 c,d 3 t1 + t2 + t4 + t7 + t12 + t14 + t17 + t20 + t22 2 c,e 4 t1 + t2 + t4 + t6 + t15 + t17 + t18 + t20 + t22 2 d,e 3 t1 + t3 + t5 + t6 + t8 + t10 + t14 + t16 + t22 2 a,b 3 t1 + t3 + t4 + t7 + t8 + t12 + t14 + t16 + t22 2 a,c 3 t1 + t3 + t4 + t6 + t8 + t15 + t16 + t18 + t22 2 a,d 4 t1 + t3 + t4 + t6 + t8 + t14 + t17 + t20 + t22 2 a,e 3 t1 + t2 + t5 + t7 + t10 + t12 + t15 + t16 + t18 + t22 3 b,c,d 3 t1 + t2 + t5 + t7 + t10 + t12 + t14 + t17 + t20 + t22 3 b,c,e 4 t1 + t2 + t5 + t6 + t10 + t15 + t17 + t18 + t20 + t22 3 b,d,e 4 t1 + t2 + t4 + t7 + t12 + t15 + t17 + t18 + t20 + t22 3 c,d,e 4 t1 + t3 + t5 + t7 + t8 + t10 + t12 + t14 + t16 + t22 3 a,b,c 3 t1 + t3 + t5 + t6 + t8 + t10 + t15 + t16 + t18 + t22 3 a,b,d 4 t1 + t3 + t5 + t6 + t8 + t10 + t14 + t17 + t20 + t22 3 a,b,e 4 t1 + t3 + t4 + t7 + t8 + t12 + t15 + t16 + t18 + t22 3 a,c,d 4 t1 + t3 + t4 + t7 + t8 + t12 + t14 + t17 + t20 + t22 3 a,c,e 4 t1 + t3 + t4 + t6 + t8 + t15 + t17 + t18 + t20 + t22 3 a,d,e 4 t1 + t2 + t5 + t7 + t10 + t12 + t15 + t17 + t18 + t20 + t22 4 b,c,d,e 4 t1 + t3 + t5 + t7 + t8 + t10 + t12 + t15 + t16 + t18 + t22 4 a,b,c,d 4 t1 + t3 + t5 + t7 + t8 + t10 + t12 + t14 + t17 + t20 + t22 4 a,b,c,e 4 t1 + t3 + t5 + t6 + t8 + t10 + t15 + t17 + t18 + t20 + t22 4 a,b,d,e 4 t1 + t3 + t4 + t7 + t8 + t12 + t15 + t17 + t18 + t20 + t22 4 a,c,d,e 4 t1 + t3 + t5 + t7 + t8 + t10 + t12 + t15 + t17 + t18 + t20 + t22 5 a,b,c,d,e 4 table 1: t-semiflows obtained from the net in figure 3 a course such that the last transition fired will be t22 which allows to comeback to the initial state. the net possess 33 t-semiflows (table 1), but only 32 of them are valid for computing the number of academic periods. for instance, the firing sequence necessary to take all the courses in the program failing none is t1 + t2 + t4 + t6 + t14 + t16 + t22. with this information it is possible to determine the periods of time taken. for this it is necessary to take into account the places acting as synchronization points, in other words courses with more than one prerequisite. as shown in table 1, from the total of 32 valid sequences to sit courses without failing more than twice, only 15 allow completion of the program without entering in conflict with the maximum time of continuance in the program. note that there are situations where failing two courses (b,e or c,e or a,d) results in the elimination of the program. however, in the case of programs of study with many courses, counting the t-semiflows of the corresponding petri nets is not possible. the reason for this is that the number of t-semiflows is exponential with respect to the number of courses. for example, for a program of study with 16 courses (4 academic periods), the corresponding reachability graph contains 47,935 states and 65,536 valid sequences for sitting the courses failing at most once some of them. to solve the problem of the maximum number of semiflows that can be computed with a tool to analyze petri nets, we propose an algorithm (see fig. 4) that allows the generation of valid sequences to fulfill a program with no more than two failures in each course. this algorithm generates a directed graph in which each vertex contains a set of courses that can be taken in each academic period.the initial vertex of this graph contains the set of courses to be taken at the beginning of the program. then, -taking into account the number of resits and the restrictions asserted by the prerequisite graph-, from each vertex are generated the vertices containing the set of courses to be taken next. figure 5 shows the graph of the syllabus of 5 courses given in figure 1; in this case the first vertex, a1b1c1, represents courses a,b,c which are taken for the first time. the adjacent vertices petri net based modelling of a career syllabus 403 figure 5: graph generated with algorithm of figure 4 using the graph of figure 1 as input represent the courses that are taken in the following period having into account the number of repetitions and restrictions of the prerequisite graph. as can be observed, in this case the number of vertices is 17. a valid sequence of courses taken (having into account repetitions) is obtained if one follows the graph from the initial vertex to vertex end. for instance the sequence a1b1c1, d1e1, end represents passing all courses, failing none. in the opposite case, the sequence a1b1c1, a2b2c2, d1e1, d2e2, end represents taking all courses twice. the number of periods taken is given by the number of vertices visited minus 1. the number of trajectories of all paths in the graph coincides with the number of t-semiflows of the petri net representing the prerequisites graph with the advantage of greater velocity and permitting the analysis of programs with a larger number of courses. 2.3 minimum number of courses to be taken during each academic period a b c d e f g h i j k l m n o p q r s t u v w x y z a1 b1 c1 d1 e1 f1 g1 h1 i1 j1 k1 figure 6: a prerequisites graph for a nine periods program 404 r. carvajal-schiaffino, l. firinguetti-limone period 1 period 2 period 3 period 4 period 5 period 6 2 3 3 3 1 table 2: minimum number of courses passed for the academic program number of courses taken frequency 1 8 2 30 3 72 4 107 5 79 6 24 table 3: frequency of the number of courses taken for the academic program to illustrate the analysis of this rule we use the algorithm of figure 4, with the first four academic periods of the prerequisite graph of figure 6. the reduced graph of this model generates 320 states and 65,536 valid sequences. the minimum number of courses passed per semester is given in table 2. the number of valid courses taken each semester varies from 1 to 6. table 3 shows the distribution of the 320 valid sequences. except for the first, second and sixth period, the minimum number of courses passed per semester is 3. from this, two situations must be analyzed: a) less than 3 courses are taken b) more than 4 courses are taken if less than 3 courses are taken and these are from the last part of the syllabus, no problem will arise, since the student will be taking the courses necessary to complete the program. this will not be the case if the courses are taken before the final part of the program. for a course taken there are 8 possible cases, discarding the 4 cases that represent taking only courses m, n, o, p a second time, there remain four cases to analyze, corresponding to taking courses f, i, j a second time or else p for the first time. path a1b1c1d1 → b2c2d2f1 → f1g1h1k1l1 → f2 a1b1c1d1 → e1f1g1h1k1 → e2g2j1 → i1l1n1o1 → i2 a1b1c1d1 → b2c2d2e1 → f1g1h1i1k1l1 → g2j1m1 → j2 a1b1c1d1 → c2d2e1f1 → e2g1h1j1k1 → h2i1k2l1n1o1 → l2m1n2 → p1 table 4: sequences of courses taken enabling to take only one course when it is feasible to take more than 4 courses in each academic period, there are 79 situations with 5 courses (table 5) and 24 with 6 courses (table 6). the analysis of these cases are useful to limit the number of courses a student can take, since there is a larger number of situations where a student takes courses for a second time. the proposed method allows the analysis of the student progress schedule (that is to say the total number of courses passed up to a certain period) with the valid sequences generated. this analysis allows to detect if failing a course, which is prerequisite of others, produces the elimination of a student because it may not be possible to take anymore courses to comply with the student progress schedule. petri net based modelling of a career syllabus 405 courses taken for the first time total 5 3 4 12 3 19 2 21 1 14 0 10 table 5: distribution of courses taken for the first time in groups of five courses taken for the first time total 6 2 5 3 4 4 3 5 2 5 1 3 0 2 table 6: distribution of courses taken for the first time in groups of six 2.4 decision support of common interest to university administrators, students and course lecturers is to reduce student drop-out. our model may help to plan the courses taken by students to achieve this objective. valid sequences provided by the previously developed algorithm would help academic administrators, students and lecturers in making decisions. in particular, administrators may use this information to detect which students are at risk of dropping-out and suggest which courses are the most convenient to achieve the objective of completing the academic program. on the other hand lecturers, once detected students in risks of drop-out, may advice students and administrators to take some corrective actions to reduce this risk. 2.5 practical evaluation we now evaluate the algorithm used to generate the valid sequences of the possible courses to take with no more than two repetitions. using as entry the graph of prerequisites of figure 6 we have run the algorithm incrementing the number of levels from 4 to 9. the results obtained are shown in figure 7. it can be observed that for the complete syllabus there are 3,838 valid sequences with 93,244,504,769 possible paths. processing time was almost an hour. the experiments were made with a computer with a 2.8 ghz intel processor and 4 gb ram. another experiment made consisted in counting the number of sequences leading to a situation were the number of courses to take is insufficient, (in our case less than 3). we found 70,207,548,816 cases. we also counted the number of situations were the maximum number of semesters was exceeded. we found 18,475,875,736 of these cases. 406 r. carvajal-schiaffino, l. firinguetti-limone levels vertices paths time 4 464 65,537 0.002s 5 1,184 2,097,153 0.126s 6 2,182 67,108,865 2.497s 7 2,862 882,837,761 29.049s 8 3,440 12,598,171,649 433.505s 9 3,838 93,244,504,769 3,463.607s figure 7: experimental evaluation 3 results and discussion the work presented in [3] uses petri nets to assist curricula development, modelling the sequence of courses to be taken by students in an academic program. [11] describes the application of model checking to the automatic planning and synthesis of student careers under a set of requirements and to the automatic verification of the coherence of syllabi subject to a set of rules. for this it is suggested to provide the students with a sort of electronic advisor of studies. however, it is recognized that the verification of the coherence of syllabi with the set of rules is by far the most difficult problem and out of the reach of standard technology of information systems, because it requires a (double) exhaustive search engine. the basic step is to explore all possible course combinations to find a feasible set of courses which satisfies the given set of rules and prerequisites to qualify for the university degree chosen by the student. the authors propose to encode and solve the problems of the of the synthesis of student careers and of the verification of the syllabi with respect to the set of rules as computational tree logic model checking problems. 4 conclusions we have presented a methodology for the analysis and verification of prerequisites of courses of a syllabus modelled with a petri net. also, we have verified that the model is consistent and capable of being generated in automatic mode. besides, we have indicated how this model may be used by administrators and lectures to suggest students corrective actions to reduce drop-out caused by a bad selection of course sequences. the next stage in this research is to enrich the model with the representation of the number credits given to each course. also an advance table will be included which consists in verifying whether student complies with the minimum number of credits required since he or she enrolled in the program. in a subsequent stage we plan to apply the model to a group of students and produce a quantitative analysis using the stochastic extensions of petri nets [1]. bibliography [1] ajmone marsan m., balbo g., conte g., donatelli s., and franceschinis g. (1995); modelling with generalized stochastic petri nets, series in parallel computing, john wiley & sons. [2] doherty w (2006); an analysis of multiple factors affecting retention in web-based community college courses, the internet and higher education, 9:245-255. petri net based modelling of a career syllabus 407 [3] ferraris m., midoro v. and olimpo g. (1984); petri nets as a modelling tool in the development of cal software, computers and education, 8:41-49. [4] jahangirian m., eldabi t., naseer a., stergioulas l., young t. (2010); simulation in manufacturing and business: a review, european journal of operational research, 203:1-13. [5] kim d. b.; the effect of loans on students degree attainment: differences by student and institutional characteristics, harvard educational review, 77(1):64-100. [6] rajabi b. a. and lee s. p. (2009); change management in business process modeling based on object oriented petri net, international journal of human and social sciences, 4:13. [7] reisig w (1986); petri nets. an introduction, eatcs monographs on theoretical computer science, springer. [8] reisig w. and rozenberg g., editors (1998); lectures on petri nets i: basic models, advances in petri nets, lncs 1491. springer. [9] roch s. and starke p. h. (1999); ina: integrate net analizer, humboldt-universität zu berlin. [10] salimifard k., wright m. (2001); petri net-based modelling of workflow systems: an overview, european journal of operational research, 134(3):664-676. [11] sebastiani r., tomasi a. and giunchiglia f. (2001); model checking syllabi and student careers, lecture notes in computer science 2031. springer. [12] silva m., teruel e., and colom j. m (1998); linear algebraic and linear programming techniques for analysis of place/transition net systems. in reisig and rozenberg [8], 308309. [13] stevenson university (may 2011); policies for continuance and progression in the major, www.stevenson.edu/academics/nursing/policies.asp [14] the college of william and mary (may 2011); continuance requirements for fulltime students. www.wm.edu/offices/deanofstudents/policies/academic/contfulltime/index.php [15] titus m. a. (2004); an examination of the influence of institutional context on student persistence at 4-year colleges and universities: a multilevel approach, research in higher education, 45(7):673-699. [16] titus m. a (2006); understanding college degree completion of students with low socioeconomic status: the influence of the institutional financial context, research in higher education, 47(4):371-398. [17] universitat pompeu fabra (may 2011); regulation governing the progression system and the continuance rules for undergraduate courses, www.upf.edu/universitat/en/normativa/upf/normativa/grau/rd1393/permanencia/ [18] wabash college. continuance in college (may 2011); www.wabash.edu/academics/bulletin.cfm?site_code_id=967. ijcccv11n4.dvi international journal of computers communications & control issn 1841-9836, 11(4):507-521, august 2016. an ontology to support semantic management of fmea knowledge z. rehman, c. v. kifor zobia rehman* 1. faculty of engineering and management lucian blaga university of sibiu, romania 2. department of computer science comsats institute of information technology, abbottabad, pakistan *corresponding author: zobia.rehman@gmail.com claudiu v. kifor faculty of engineering and management lucian blaga university of sibiu, romania claudiu.kifor@ulbsibiu.ro abstract: risk mitigation has always been a special concern for organization’s strategic management. various tools and techniques have been developed to manage risk in an effective way. failure mode and effects analysis (fmea) is one of the tools used for effective assessment of risk. it analyzes all potential failure modes, their causes, and effects on a product or process. moreover it recommends actions to mitigate failures in order to enhance product reliability. organizations spend their resources and domain experts make their efforts to complete this analysis. it further helps organizations identify the expected risks and plan strategies in advance to tackle them. but unfortunately the analysis produced after spending a lot of organizational assets and experts’ struggles, is not reusable due to its natural language text based description. information and communication technology experts proposed some solutions but they are associated with some deficiencies. authors in [13] proposed an ontology based solution to extract and reuse fmea knowledge from the textual documents, and this article is the first step towards its implementation. in this article we proposed our ontology for process failure mode and effects analysis (pfmea) for automotive domain, along with its implementation, reasoning, and data retrieval through it. keywords: ontology, fmea, knowledge management, owl, protégé, sparql. 1 introduction since ever we have been managing our knowledge as best as we could. in beginning the acquisition and storage of knowledge were the biggest issues but information technology turned the tables and now just one click on google search button brings us pools of knowledge we desire. ordinary gadgets can store gigabyte to terabytes of information and cloud storage offers access to logical storage of a physical storage spanning over multiple locations. on one hand these advances in technology are making storage and acquisition of knowledge easier and on the other hand it has become awfully essential for organizations to capture, store, apply and share their knowledge in a collective and systematic way, so they could survive and flourish in this knowledge based economy era. thus the knowledge management has become a dire need of organizations and industry. they need to structure their knowledge and transform it into valuable competencies, products, and services for effective and well-timed utilization in order to make fruitful decisions. risk management for an organization is a set of strategic level activities that maximizes the chances of objectives being achieved, by systematically understanding and copyright © 2006-2016 by ccc publications 508 z. rehman, c. v. kifor evaluating the project level risks. it has become a core part of organization’s strategic management. organizations spent the significant share of their investments in handling expected and unexpected risks on a project. risk management is a process of risk identification, risk assessment, risk response and control development [6]. risk assessment is a critical phase after risk identification as outcome of this phase leads to develop appropriate response and control for a risk. there are different tools available for risk assessment, e.g., scenario analysis for event probability and impact, risk assessment matrix, fmea, probability analysis, and semi-quantitative scenario analysis. fmea is a systematic tool to assess the risks associated to a product or process. it highlights all potential failure modes and their impacts on a product in advance so that they could be fixed timely in order to achieve desired goals. in 1960’s for the very first time aerospace industry brought it into use during apollo mission. in 1974, the us navy developed mil-std-1629 about its use. since late 1970’s, it is in use in automotive industry for safety and reliability analysis. nowadays this inductive reasoning tool is extensively being used in almost every engineering sector [14]. it finds all possibilities in which a product or process might go wrong (failure modes), determines the effect(s) of each failure mode along with severity, cause(s) of failure mode along with frequency(occurrence) and also the level of difficulty in order to detect a failure (detection). further it calculates the risk priority number (rpn) by multiplying magnitude values of severity, occurrence, and detection. depending on the value of rpn, recommendations are determined and executed along with newly predicted values of severity, occurrence, detection, and rpn [15]. the knowledge produced during the process of figure 1: conceptual architecture of ontology based knowledge management system for fmea [13] fmea is really valuable. if it is adequately managed and re-utilized, it can reduce cost and effort. although organizations spend huge cost and effort to apply fmea method but knowledge acquired during this analysis is neither reused nor shared. since it is not semantically organized, its interpretation varies from person to person and from situation to situation. it is usually found incomplete but larger in size due to its redundant production and that larger size makes it imprecise as well [9]. artificial intelligence proposed different solutions for this problem, e.g., rule based expert systems, case based reasoning, and knowledge based systems with ontological support. among all these, knowledge based systems with ontological support are found most appropriate as rule based systems are not suitable if domain knowledge is larger enough, because an ontology to support semantic management of fmea knowledge 509 table 1: fmea worksheet [16] process failure mode and effects analysis fmea number: team leader: process responsibility: prepared by: fmea date (orig.): fmea date (rev.): process name potential failure mode effect(s) of failure sev causes occ current controls det rpn recom. action resp.& comp. by action result prevention detection action sev occ det nrpn its coding, verification, validation, maintenance, and inference through it becomes complex and time consuming [2]. in case based reasoning all probable cases are stored in case library with additional overhead of their attributes and references, moreover the information returned by such engines is not well formatted [8]. available ontology based approaches to support fmea also lack some significant aspects, e.g., in (lee, 2001) authors presented fmea only as a conceptual model without any inference and rules. authors in [1] considered the discrepancies left by [7] and proposed a system, based on the combination of knowledge management and quality management concepts but it lacks functional taxonomy. in [10] authors discussed a better ontology based approach for fmea procedure representation in lead free soldering but this proposal is still being worked on. moreover no specific ontology is found to address the fmea knowledge sharing and reuse for automotive domain. to address all these issues authors in [13] presented a conceptual architecture of their proposed system as given in figure 1 and this article in first step towards its implementation. this article is about the ontology we developed for representation and retrieval of pfmea knowledge. knowledge of interest used in this ontology is from automotive domain. automotive engineering (vehicle engineering)deals with the design, manufacturing, and operation of vehicles(automobiles, buses, motorcycles etc.). it assimilates various components from different engineering sectors, e.g., mechanical, electrical, electronics, software, safety and quality engineering. from designing to production there are different kind of activities (belonging to different fields of engineering)are involved. productivity of each activity heavily relies on past experiences, expertise of concerned people, and customer feedback. better management of all this knowledge helps organizations improve time to market and customer satisfaction which not only helps earn better profit but brings sustainability for an organization in the market. in section 2 and 3, tools used to develop and query the ontology, are discussed. section 4 describes the process of ontology development in detail, section 5 illustrates some query examples, and section 6 concludes the discussion with highlights of future work. 2 protégé protégé is an open source software tool by stanford university to develop domain models and knowledge based systems with ontology. it facilitates with both of the foremost means to model an ontology, e.g., frames and owl (web ontology language). it has built-in reasoning support for computing the inferred ontology class hierarchy and ontology consistency checking [13]. we used protégé 5.0 beta version. using it we developed our owl ontology with rdf/xml format. rdf/xml format is a syntax defined by w3c to present rdf (resource description format) graph by defining triples of subject, predicate, and object in xml (extensible markup language) format [12]. we developed ontology in owl, the language beyond rdf schema that allows machines to perform more useful reasoning on its documents, and created its individuals (instances) in rdf. we got reasoning support from protégé’s built-in reasoner hermit version 1.3.8.3. hermit is an efficient reasoner based on “hypertableau”calculus that takes a few seconds to reason complex ontologies that are written in owl. it reads an owl file and determines the consistency of classes and their properties [4] 510 z. rehman, c. v. kifor figure 2: pfmea ontology 3 sparql and jena fuseki server jena is an open source semantic web framework for java. it facilitates with an api (application program interface) for extraction and writing data to/from rdf graphs. the graphs are represented as an abstract model, which can be sourced with data from files, databases, urls or a combination of these. fuseki is an http interface to rdf data that supports sparql (w3c recommended language to query directed labeled rdf graphs) for querying and updating rdf graphs. we used apache jena fuseki server version 1.1.0. it provides rest-style sparql http update, sparql query, and sparql update by using the sparql protocol over http [5]. it can be freely downloaded and runs on port 3030 in web browser. 4 development of fmea ontology according to classic definition by [3] the ontology is an explicit specification of conceptualization. in information science perspective the ontology is a formal representation of the knowledge of a specific domain as a set of concepts within a domain, and the relationship between those concepts. it provides shared vocabulary and common (unambiguous) understanding of a domain and supports reasoning about the concepts. according to w3c definition, ontology is a vocabulary that defines concepts, relationship between concepts, and constraints on their usage in order to define and represent domain of discourse [11]. an ontology consists of different components, e.g., classes, properties (relationship of classes), and individuals (instances of the domain of discourse). in this section all these terms will be discussed in pfmea perspective. in table 1 a pfmea worksheet is shown. using the pfmea attributes given in that worksheet we designed an ontology given in figure 2. this ontology is based on five different classes whereas all these classes are logically subsumed an ontology to support semantic management of fmea knowledge 511 figure 3: pfmea classes and sub-classes in protégé sub-classes of the root concept “thing”. according to iso-15926 “thing”is a top-level ontology (collection of general concepts same in all knowledge domains) that subsumes abstract object and possible individual classes. any immaterial object (which exists only as a concept) can be said an abstract object, e.g., information; and any material object (which exists in time and space) is known as possible individual, e.g., a pen. figure 4: ontograph in protégé rest of the classes, their attributes with data types, and relationships with one another are described following. • fmea class represents the header information of fmea worksheet. its attributes are fmea_no (string), fmea_description (string), process_leader (string), starting_date (datetime), ending_date (datetime), and latest_revision_date (datetime). object 512 z. rehman, c. v. kifor property examine (inverse: isexaminedby) connects fmea class to process class. • process class describes a process or a sub-process under analysis. it has a single attribute process_name (string). object property hassubprocess connects it to itself, whereas the property hasfailuremode (inverse: isfailuremodeof) connects it to class failure_mode. • failure_mode class represents a failure mode, its cause(s) and effect(s). it has three attributes failuremode_description (string), effect_description (string) for sub-concept effect, and cause_description (string) for sub-concept cause. different object properties are part of the concept failure_mode. the hascontrolmethod (inverse: iscontrolmethodof) connects it to class control_method, hasmitigationaction (inverse: ismitigationactionof) connects it to concept mitigation_action, it is related to class rpn through property hasrpn (inverse: isrpnof). as a failure causes another failure and inversely a failure is the effect of another failure, therefore the concept failure_mode is divided into two sub-concepts cause and effect. object properties causes and iscausedby (inverse of one another) make it distinguished if a failure is a cause or an effect. a failure that demonstrates root cause nature is only responsible to bear the relation with concepts control_method, mitigation_action and rpn. • rpn class represents the magnitude impacts of a failure mode, its effect and analysis. its attributes are severity (integer), occurrence (integer), detection (integer), and their product (integer). • control_method class describes the controls for detection and prevention of a failure. it has two attributes control_detection (string) and control_prevention (string). • mitigation_action class describes recommendations and actions taken in order to combat a failure. its attributes are recommended_action (string), responsible_person (string), target_completion_date (datetime), and action_taken (string). object property hasnewrpn (inverse: isnewrpnof) relates it to the concept rpn and on the basis of new rpn value effectiveness of a mitigation action is evaluated. figure 5: object properties in protégé an ontology to support semantic management of fmea knowledge 513 figure 4 represents the ontology graph (ontograph) of proposed ontology as perceived in protégé. in figure 5, protégé is displaying the list of object properties. each object property is a sub-property of topobjectproperty. each property connects two classes, known as domain and range. for example the object property hasfailuremode connects two classes the process and the failure_mode. it is a relationship from process to failure_mode, thus the concept process is domain of the property and concept failure_mode is its range. its inverse property isfailuremodeof would be a relation from concept failure_mode to concept process. figure 6 shows the list of data properties, each data property is sub-property of topdataproperty. data properties are attributes of class which are further used to create variables of instances in order to store some values. each data property has a domain (the class name it belongs to) and a range (the data type, the type of data it allows to be stored). for example action_taken is an attribute of the concept mitigation_action and its assertions can have only string data type. figure 6: pfmea data properties in protégé figure 7 shows an inferred individual of concept cause in protégé. it clearly shows that a failure mode is caused by a cause and it further causes an effect, consequently any cause which causes a failure mode is actually responsible for its effect too. figure 8 shows an instance of concept effect. figure 9 is also about an inferred individual of concept mitigation_action. in figure 10 a complete fmea ontology instance is shown. whereas figure 11 shows a few instances of fmea ontology in rdf form. 5 sparql queries to retrieve fmea information from ontology we used sparql with apache jena fuseki server in order to access information from our ontology. jena fuseki server provides user friendly graphical user interface (gui) to mount ontologies on server and allows retrieving query results in multiple formats. we chose csv (comma-separated values) format as original fmea files are in the same format. query results in csv format can be downloaded and saved for further use and can be easily viewed in any csv viewer. here are some queries used to extract information from ontology. 514 z. rehman, c. v. kifor figure 7: pfmea instances of a sub-class cause in protégé figure 8: pfmea instances of a sub-class effect in protégé an ontology to support semantic management of fmea knowledge 515 figure 9: pfmea instances of a sub-class mitigation_action in protégé figure 10: example of ontology based fmea instances 516 z. rehman, c. v. kifor figure 11: example of an fmea ontology instance in rdf 5.1 prefixes following prefixes are a must for the queries to execute on jena fuseki sever, these are used to declare the ontology being used, the language it is developed in and its syntax and schema. prefix owl: prefix xsd: prefix rdf: prefix rdfs: prefix fmea: 5.2 query to display fmea worksheet header information select (str(?fno) as ?fmea_no) (str(?fd) as ?fmea_description) (str(?pl) as ?processleader) (str(?sd) as ?startingdate) (str(?ed) as ?endingdate) (str(?ld) as ?latestrevisiondate) where { ?x fmea:fmea_no ?fno; fmea:fmea_description ?fd; fmea:process_leader ?pl; fmea:starting_date ?sd; fmea:ending_date ?ed. optional { ?x fmea:latestrevision_date ?ld. } } an ontology to support semantic management of fmea knowledge 517 output of this query is shown in figure 12. this query helps know very basic information about an fmea report. for example the description of the process the report is about, its leader,and all important dates about its initiation, completion, and revision. figure 12: fmea worksheet header information in csv viewer 5.3 query to display complete details of fmea process select (str(?pn) as ?process) (str(?fmd) as ?failuremode) (str(?ed) as ?effect) (str(?sv1) as ?sev) (str(?cd) as ?cause) (str(?oc1) as ?occ) (str(?cp) as ?ctrlprevention) (str(?cdt) as ?ctrldetection) (str(?d1) as ?det) (str(?p1) as ?rpn) (str(?ra) as ?recommendedaction)(str(?at) as ?actiontaken) (str(?rp) as ?responsible) (str(?tcd) as ?targetcompletedby) (str(?sv2) as ?psev) (str(?oc2) as ?pocc) (str(?d2) as ?pdet) (str(?p2) as ?prpn) where { ?x fmea:process_name ?pn; fmea:hasfailuremode ?fm. ?fm fmea:failuremode_description ?fmd; fmea:causes ?failureeffect; fmea:iscausedby ?failurecause. ?failureeffect fmea:effect_description ?ed. ?failurecause fmea:cause_description ?cd; fmea:hascontrolmethod ?ctrlmethod; fmea:hasrpn ?rpn1; fmea:hasmitigationaction ?mgt. ?ctrlmethod fmea:control_detection ?cdt; fmea:control_prevention ?cp. ?rpn1 fmea:severity ?sv1; fmea:occurrence ?oc1; fmea:detection ?d1; fmea:product ?p1. ?mgt fmea:recommended_action ?ra; fmea:action_taken ?at. optional { ?mgt fmea:responsible_person ?rp; fmea:target_completiondate ?tcd; fmea:hasnewrpn ?rpn2. ?rpn2 fmea:severity ?sv2; fmea:occurrence ?oc2; 518 z. rehman, c. v. kifor fmea:detection ?d2; fmea:product ?p2. } } result was spanning over large window this is why we split it into three as shown in figure 13, 14, and 15. purpose of this query is to extract all information related to a process. for example its probable failure modes, their causes and effects,magnitude impact of the failure, recommendations and actions taken to reduce impact of failure and the magnitude impact of mitigation made. figure 13: fmea process details in csv viewer figure 14: fmea process details in csv viewer figure 15: fmea process details in csv viewer 5.4 query to display causes and recommendations for each failure mode select (str(?fmd) as ?failuremode) (str(?cd) as ?cause) (str(?ra) as ?recommendedaction) where { ?fm fmea:failuremode_description ?fmd; fmea:iscausedby ?failurecause. ?failurecause fmea:cause_description ?cd; fmea:hasmitigationaction ?mgt. ?mgt fmea:recommended_action ?ra; fmea:action_taken ?at. } an ontology to support semantic management of fmea knowledge 519 output of this query is shown in figure 16. this query helps extract all causes and recommendation for each cause for a specific failure mode. figure 16: causes and recommendations for each failure mode in csv viewer 5.5 query to display causes, effects and recommendations for a specific failure mode select (str(?cd) as ?cause) (str(?at) as ?actiontaken) (str(?sv2) as ?newseverity) (str(?oc2) as ?newoccurrence) (str(?d2) as ?newdetection) (str(?p2) as ?newrpn) where { ?x fmea:hasfailuremode ?fmd. ?fmd fmea:failuremode_description ?fd. filter regex(?fd,"insufficient wax coverage over specified surface") ?fmd fmea:iscausedby ?failurecause. ?failurecause fmea:cause_description ?cd; fmea:hasmitigationaction ?mgt. ?mgt fmea:action_taken ?at. optional { ?mgt fmea:responsible_person ?rp; fmea:target_completiondate ?tcd; fmea:hasnewrpn ?rpn2. ?rpn2 fmea:severity ?sv2; fmea:occurrence ?oc2; fmea:detection ?d2; fmea:product ?p2. } } output of this query is shown in figure 17. this query extracts causes, mitigation actions, and their magnitude impacts for a given failure mode. figure 17: causes and mitigation action(s) for a specific failure mode in csv viewer 520 z. rehman, c. v. kifor 6 conclusion and future work nowadays risk management has become a vital part of an organization’s strategic management. to achieve the organizational objectives it is mandatory to increase the probability of success and decrease the probability of failure for a product or process. it is only possible when an organization knows about all expected risks and has planed policies and actions for its timely avoidance and mitigation. fmea is one of the tools available for risk assessment. because of its effectiveness organizations spend a lot to complete its studies. due to some reasons as mentioned in introduction section, the valuable information produced by fmea is not reusable. to combat this problem authors in [13] proposed a system which would be capable enough to extract information from fmea documents, store it in a knowledge repository, and help retrieving the required information. in this article we presented an important component of that proposed system, the ontology and retrieval of information through it. as we want to develop a system which should be capable of disseminating information in a domain of experts unambiguously, for this we need a common vocabulary, understanding and structure of the specified domain knowledge, machine interpret-able descriptions of concepts and their relations, and a barrier between domain knowledge and operational knowledge; therefore we developed and fmea ontology that qualifies all these specifications. this article not only presents the ontology but also the semantic ways to retrieve information through it. our next step is to use this ontology for auto-population of a knowledge base from csv format fmea worksheets and then we will measure its effectiveness (in terms of completeness and conciseness) by deploying it, so that domain experts could interact with it for required knowledge. bibliography [1] dittmann, l. et al (2004); performing fmea using ontologies, 18th international workshop on qualitative reasoning, 209-216. [2] fernandez, b.i.; saberwal, r. (2010); knowledge management: systems and processes, m.e. sharpe. [3] gruber, t. r. (1993); a translation approach to portable ontology specifications, knowledge acquisition, 5(2): 199-220. [4] horrocks, i. et al (2012); the hermit owl reasoner, emphowl reasoner evaluation workshop, manchester. [5] http://jena.apache.org/documentation/serving_data/ [6] larson, e. w.; gray, c. f. (2011); project management: the managerial process, 5th edition, mcgraw hill. [7] lee, c.f.; (2001); using fmea models and ontologies to build diagnostic models, artificial intelligence for engineering design, analysis and manufacturing, 281-293. [8] mansouri, d.; hamdi-cherif, a. (2011); ontology-oriented case-based reasoning (cbr) approach for training adaptive delivery, 15th wseas int. conf. on computers (cscc’11), 328-333. [9] mikos, w.l. et al (2011); a system for distributed sharing and reuse of design and manufacturing, elsevier journal of manufacturing systems, 133-143. an ontology to support semantic management of fmea knowledge 521 [10] molhanec, m. et al (2010); the ontology based fmea of lead free soldering process, international spring seminar on electronics technology isse, doi: 10.1109/isse.2009.5206998, 1-4. [11] http://www.w3.org/standards/semanticweb/ontology [12] http://www.w3.org/tr/rdf-syntax-grammar/ [13] rehman, z.; kifor, s. (2014); a conceptual architecture of ontology based km system for failure mode and effects analysis, international journal of computers communications & control, 9(4): 463-470. [14] stamatis, d.h. (2003); failure mode and effect analysis: fmea from theory to execution, usa: asq quality press. [15] tay, k. m.; lim, c. p. (2006); fuzzy fmea with a guided rules reduction system for prioritization of failures. international journal of quality & reliability management , 23(8): 1047 1066. [16] wheeler, d. j.; chamber, d. s. (2013); understanding statistical process control wheeler and chambers, 3rd edition, spc press. ijcccv11n1.dvi international journal of computers communications & control issn 1841-9836, 11(1):90-104, february 2016. ann based inverse dynamic model of the 6-pgk parallel robot manipulator l. moldovan, h.-s. grif, a. gligor liviu moldovan*, horatiu-stefan grif, adrian gligor petru maior university of tirgu-mures romania, 540088 tirgu-mures, nicolae iorga, 1 liviu.moldovan@ing.upm.ro, horatiu.grif@ing.upm.ro, adrian.gligor@ing.upm.ro *corresponding author: liviu.moldovan@ing.upm.ro abstract: this paper presents an inverse dynamic model estimation based on an artificial neural network of a complete new parallel robot manipulator prototype 6pgk with six degrees of freedom, built at petru maior university of tirgu-mures. the model estimation of the parallel robot manipulator is performed with a feedforward artificial neural network. in the control engineering domain there are control structures that need the direct or inverse model of the process for ensuring the process control at the imposed performances. usually, the determination of the direct/inverse mathematical model is a difficult or impossible task to be achieved. in these cases different non-parametric or parametric, off-line or on-line identification methods are used. a solution that may support the on-line parametric methods is represented by the feedforward artificial neural networks. by implementing feedforward artificial neural networks as a nonlinear autoregressive model with exogenous inputs, the authors investigate the possibility of choosing the optimum parameters that characterize the neural network so that it approximates as better as possible the model of the 6-pgk prototype robot. finally an innovative algorithm is developed for obtaining the optimal configuration parameters set of the feedforward artificial neural network. the proposed algorithm helps in setting the optimal parameters of the neural network that offer high opportunities to provide satisfactory identification of the robot model. experimental results obtained by a structure derived from the proposed solution demonstrate a good approximation related to the studied system, which is characterized by nonlinearities and high complexity. keywords: 6-dof parallel robot manipulator, inverse dynamics, nonlinear model, unmodeled dynamics, feed-forward artificial neural network. 1 introduction in robotics, the class of parallel robots represent a constructive solution suitable for many industrial, medical or domestic applications, like flight simulators and entertainment rides [1], micromanufacturing [2], tool machine [3], pick and place operation [4], earthquake motion simulator [4], medical haptic devices [6], laparoscopic surgery [7], or even space docking technology [8]. even if parallel robots have a reduced workspace [9], their benefits over their serial counterparts are various: stability and rigidity of contacts during haptic interaction [10], high accuracy, high stiffness, high payload capability, low moving inertia [11], good dexterity, compact size, large power to weight ratio [12]. along with these advantages, there still exist many difficulties in the actual control process. a classic approach in controlling robots consists in the employment of the dynamic model. various methods have been proposed to derive the dynamic model of the parallel manipulator, like the generalized momentum approach [13], principle of virtual work [14] screw theory [15] and hamilton’s principle [16], the recursive matrix method [17] or the lagrangian approach [11], which describes the dynamics of mechanical system from the concepts of work and energy [13]. copyright © 2006-2016 by ccc publications agora university ann based inverse dynamic model of the 6-pgk parallel robot manipulator 91 all these approaches are equivalent as they are describing the same physical system, and lead to equivalent dynamic equations, which present different levels of complexity and associated computational loads. the analytical calculus involved in the dynamic equation is very tedious, thus presenting an elevated risk for errors. furthermore, the duration of numerical computations is getting longer when the number of mechanism’s degrees of freedom is increased. the dynamic model based control arises numerous difficulties because of the time-varying and coupling of the equations. unfortunately, the present-day commercial controllers cannot provide satisfying performance for control. in recent years an important credit in obtaining new enhancements is granted to the support offered by artificial intelligence techniques. the role of artificial intelligence related to the parallel robots aims with predilection the modeling and control domain in terms of kinematics or dynamics. moreover these strategies are designed to improve the classical control strategies, in terms of increasing the current performances, to add new properties such as robustness and improvement of the real-time behavior using fuzzy systems or different types of neural networks (nnet). compared to the mathematical model of the kinematics, neural networks provide significantly better mapping between manipulator’s task space and joint space, as demonstrated by uzunovic et al. [2]. achili et al. in [18] proposed an adaptive force/position controller for parallel robot with constrained motions based on multi-layer perceptron neural network. the solution presents the major advantage of obtaining the control law without the prior knowledge of the inverse dynamic model and being able to take into account the endogenous disturbance (uncertainties and nonlinearities related to the robot dynamics) and to compensate exogenous disturbances. peng et al. [19] design a control system for a parallel mechanism consisting from a combination of linear controllers and two neural networks. the aim of this control system is to compensate the nonlinearities from the model of the system and the dynamic model estimation for improved accuracy of trajectory. le et al. [11] investigate the enhancement of the classical nonlinear pd torque controller by using neural networks for on-line self gain tuning. the proposed solution consists in a self gain tuning control law for minimizing the error in tracking trajectories of the parallel manipulators characterized by simple structure suitable for low computation effort in real-time implementations. yu and weng in [20] propose a robust control structure for parallel manipulators based on h∞ tracking adaptive fuzzy integral sliding mode control scheme mainly able to handle the nonlinear unmodeled dynamics and to compensate external disturbances. all these solutions benefit from the advantages of various artificial intelligence techniques, but underline the difficulty of implementing them, starting from the the structure, the different parameters values or choice of training methods, and finally reaching the problem of required computational effort. recent research in the field of robot control is conducted in new directions based on bioinspired approach that proved their efficiency on other fields of robotics, like the simplified fuzzy controller using neural network based on genetic algorithm learning for mobile robots [21], the communications and social structure of whale pods applied to cooperative robot structures [22] or fast genetic algorithm for robot soccer and planet exploration citelee. we consider that this direction of research may find applicability in the case of parallel robots, for the potential offered in solving many complex problems such as those in the case of parallel robot control discussed in this paper. from the above explored literature we conclude that in the case of parallel robots there is a 92 l. moldovan, h.-s. grif, a. gligor need to find adequate solutions for the real time control which involve increasing the trajectory following performance and low computational effort. the objective of the paper is to build and implement an artificial neural network based on the inverse dynamic model estimation of a complete new parallel robot manipulator 6-pgk type with 6 degrees of freedom built at petru maior university of tirgu-mures. 2 the 6-pgk parallel robot dynamics 2.1 the 6-pgk parallel robot manipulator design the six controlled degrees of freedom (dof) parallel robot configuration is a structure proposed by stewart and hunt and widely referred as stewart platform [23–26]. the parallel kinematic link system proposed by d. stewart is one of a 6-dof robot manipulator used as a flight simulator [25, 26]. the basic geometric structure of the 6-pgk parallel robot manipulator developed at petru maior university consists in a mobile platform (supporting the end-effector) connected to the adjacent links at six distinct points (i=1,2,. . . ,6) at the level iii by cardanic kinematic pairs (figure 1) [27]. six legs connect the platform to the base. each leg has an actuated translational figure 1: the structure of the 6-pgk parallel robot [27] kinematic pair at level i and a spherical kinematic pair at level ii. the translational kinematic pairs form the angles λi with the vertical axis in points bi. the location (position and orientation) of the end-effector and of the manipulated object are expressed in the fixed reference frame oxy z by the generalized coordinates of the manipulated object xp, yp, zp, ψ, φ, θ denoted qpi (i=1,2,. . . ,6), and in the mobile frame oxyz by the xp, yp, zp coordinates. the displacements in the actuated translational kinematic pairs (actuator displacements) are the generalized coordinates of the parallel robot qi (i=1, . . . , 6). ann based inverse dynamic model of the 6-pgk parallel robot manipulator 93 2.2 parallel robot manipulator dynamics modeling the dynamics modeling in case of robot manipulators refers to establishing a mathematical model that relates the time evolution of its end-effector in terms of positions, velocity and acceleration with the forces and moments acting upon it [28]. in order to get the position of the mobile platform with respect to generalized coordinates of the robot, the in-out equations are employed, which solve the positional problem of the robot. they relate the generalized coordinates of the robot to the generalized coordinates of the manipulated object [27]: q2i −2biqi + ci = 0, (i = 1, . . . ,6) (1) where bi, ci are coefficients according to the robot mechanical parameters [27]. determination of the velocity and acceleration of the mobile platform [q̇p] with respect to generalized coordinates of the robot is performed using the kinematics equations: [q̇p] = [j][q̇] (2) obtained by deriving the in-out equations (1) with respect to time, where [j] is the jacobian matrix [27]. the dynamic modeling determines the generalized forces [qj] that are required by the actuators to balance the forces applied to the robot. in the case of the 6-pgk parallel robot, by employing the lagrange method, the dynamic equations are obtained [27]: { 1 2 3 ∑ k=1 3 ∑ l=1 jkl ( ∂ωk ∂q̇pj ω̇l + ω̇k ∂ωl ∂q̇pj ) + m 3 ∑ k=1 q̈pk ∂q̇pk ∂q̇pj + 6 ∑ i=1 miq̈i ∂q̇i ∂q̇pj } +                          1 2 3 ∑ k=1 3 ∑ l=1 ˙jkl ( ∂ωk ∂q̇pj ωl + ωk ∂ωl ∂q̇pj ) − 1 2 3 ∑ k=1 3 ∑ l=1 ∂jkl ∂qpj ωkωl+ 1 2 3 ∑ k=1 3 ∑ l=1 jkl [ d dt ( ∂ωk ∂q̇pj ) ωl + ωk d dt ( ∂ωl ∂q̇pj ) − ∂ωk ∂qpj ωl −ωk ∂ωl ∂qpj ] + 6 ∑ i=1 mi [ q̇i d dt ( ∂q̇i ∂q̇pj ) − q̇i ∂q̇i ∂q̇pj ]                          + { mg ∂zp ∂qpj + 6 ∑ i=1 mig ∂qi ∂qpj cλi −m0g ( xp ∂γ′ ∂qpj + yp ∂γ′′ ∂qpj + zp ∂γ′′′ ∂qpj ) } = qj (j = 1, . . . ,6) (3) where qj are the generalized forces due to the non-conservative forces, mi are masses of the motor links, jkl are the elements of the inertia matrix of the mobile platform of mass m0 and manipulated object of mass mp, their sum is m = m0 + mp, then α′, . . . , γ′′′ are terms of the rotational matrix that relates the coordinates fixed in mobile frame to the base coordinates and ωx, ωy, ωz are the angular velocity terms in oxy z system. the generalized motor forces are deduced from equation [27]: qj = 6 ∑ i=1 lijqmi (4) where li,j are terms of the inverse jacobian matrix. 94 l. moldovan, h.-s. grif, a. gligor the inverse dynamic model is useful for the robotic system control. it consists in determination in real time of the generalized motor forces qmi from equations (4), by solving the positional problem from equation (1), the kinematic problem from equation (2) and the dynamic model from equation (3). parallel robots with a high number of degrees of freedom, in particular six degrees like the 6-pgk parallel robot, are modeled by complex dynamic equations, comprising a large number of parameters. solving them requires crossing a high number of iterations and selection of admissible real solutions. the complexity of the program, the large number of loops, testing all solutions, selecting the nearest current configuration solution, can induce errors in configuration identification. therefore in current industrial exploitation, parallel robots control requires the use of simple and effective control methods. 3 fann based dynamic model of the 6-pgk parallel robot manipulator 3.1 fann structure for robot dynamic modeling for the 6-pgk parallel robot manipulator modeling a feed-forward artificial neural network (fann) with one hidden layer has been designed. the fann is implemented as a nonlinear autoregressive model with exogenous inputs (narx) model. the narx model is based on the linear arx model, which is commonly used in time-series modeling. [30] the numbers of the inputs of the used fann is determined by the numbers of the generalized coordinates of the robot, the number of the generalized forces and the number of theirs considered past values. the numbers of the outputs of the used fann is given by the numbers of the generalized forces from the dynamical model of the robot. by using the compact structure of an artificial neuron presented in [29] we can summarize the mathematical processing that occurs inside of an artificial neuron from the used fann as: a = f ( n ∑ j=1 ωjxj + θ ) (5) where are denoted by: xj (j = 1, . . . ,n) the inputs of the neuron, ωj (j = 1, . . . ,n) the weights of the inputs xj, θ the weight of the offset input, n the numbers of the inputs of the neuron, f is the activation function, a the activation of the neuron. the type of the activation function and the numbers of the inputs corresponding to each neuron depend on the layer to which it belongs. for the neurons from the hidden layer are used two types of the activation function: the hyperbolic tangent function (htf) type and the radial basis function (rbf) type [30]. the numbers of the inputs of the neurons from this layer is given by the maximum lags associated with robot output (nq) and input (nqp) signals described by relation 6(nqp +nq), which also represents the number of the inputs of the fann. the activation function of the neurons from the output layer is the linear type function and the number of the inputs of these neurons equals the number of the neurons from the hidden layer of the fann. for the training of the fann the gradient descent backpropagation algorithm was used [30]. the training of the used fann was achieved by presenting to their inputs and outputs the training set of the n pairs of the vectors: { ( xi(1),aod(1) ) , ( xi(2),aod(2) ) , . . . , ( xi(n),aod(n) ) } where xi(k) is an input training vector (k = 1, . . . ,n) with 6(nqp +nq) elements and a o d(k) is the desired target output vector (with 6 elements) of the fann corresponding to the input training ann based inverse dynamic model of the 6-pgk parallel robot manipulator 95 vector xi(k). 3.2 fann optimal parameters determination analytical determination of the fann parameters is an uncommon task, thus the empirical approach is more often used. the optimum determination however is a more complex task, so in this section a tested methodology for determination of these parameters is proposed. usually, the investigation of a system starts from the searching of the parameters that influence the behavior of the studied system. the same approach was established for the present work. in the case of the proposed structure of neural network the following four parameters were considered: learning rate (γ), the number of the neurons on the hidden layer (nh), the number of the past values of the generalized forces (nq) and the number of the past values of the generalized coordinates (nq). in order to study the influence of these parameters on the neural network behavior for building the fann training data sets the generalized coordinates trajectory of the robot and the corresponding trajectories for generalized motor forces will be used. an adequate solution for the process of searching through the experimental set can be performed by visual analysis of all the available results (large sets of plotted trajectories). but this approach presents a major drawback from time needed for analysis and required storage resources. in order to facilitate and automatize the evaluation process, quality measures can be used. an option is represented by the mean squared error or the mean absolute error. however, both measures present major difficulty in the case of establishing of a threshold value to aid in deciding the quality of the robot model identification. in order to overcome the mentioned drawbacks for the evaluation process the authors propose an automated method and the usage of a quality measure based on signal-to-noise ratio (snr). for the 6-pgk prototype robot the snr is given by the following equation: snri = 10 · log10 n ∑ j=1 ( qmi(j) )2 n ∑ j=1 ( qmi(j) −qmif ann (j) )2 [ db ] , (i = 1, . . . ,6) (6) where qmi represents the generalized motor trajectory forces given by the inverse dynamical model of the robot, qmif ann represents the generalized motor trajectory forces given by the fann and n is the number of the trajectory samples. the proposed snr quality measured was derived from the signal-to-noise ratio used for evaluation of unidimensional electrical signals. the algorithm for fann optimal parameters determination is presented below. general algorithm for choosing the structure of the fann let us consider the following parameters which influence the behavior of the fann: learning rate (γ), the number of the neurons (nh) from hidden layer, the number of the past values for the input (nqp) and output (nq) variables employed to build the robot regressors vectors (input vectors of the fann) used in training and simulation of the fann. these parameters are referred as "fann parameters". the steps of the proposed algorithm for choosing the structure of the fann are: step 0. select (experimentally): the number of the initial sets of values for the fann weights and the corresponding weights values (that are random real numbers in the range [−0.5, 0.5]), having built a collection of initial sets of values for the fann weights, the 96 l. moldovan, h.-s. grif, a. gligor signal-to-noise ratio threshold snrthold, the sets of values for the fann parameters. to each set of values of the fann parameters is attached a set of counter variables of the same size as the size of the value of the considered set. thus for a fann parameter to each value (from the set of values of the considered fann parameter) it corresponds a counter variable. the value of the counter variables is initialized with zero; step 1. set the following values of the fann parameters with start values nh = nhstart, nq = nqstart, nqp = nqpstart. these values are drawn from the sets of the fann parameter values; step 2. keeping constant the values of the fann parameters nh = nhstart, nq = nqstart, nqp = nqpstart establish the optimal learning rate γopt based on the algorithm for evaluation and determination of the optimum value for the fann parameters; step 3. keeping constant the value of the parameters γ=γopt, nq =nqstart, nqp =nqpstart establish the optimal number of hidden layer neurons (nh opt) based on the algorithm for evaluation and determination of the optimum value for the fann parameters; step 4. keeping constant the value of the fann parameters gamma = γopt, nh = nh opt establish optimal values of the number of the past samples for the input and output variables of the robot model (nqopt, nqpopt) based on the algorithm for evaluation and determination of the optimum value for the fann parameters to which the following changes are made corresponding to the steps step 4′′, step 6′′ and step 7 ; step 5. keeping constant the value of the fann parameters gamma = γopt, nh = nh opt and nq = nq opt it is tested the estimation and identification algorithm considering other trajectories for input and output variables of the robot (or model of the robot). algorithm for evaluation and determination of the optimum value for the fann for each value from the set of values corresponding to the fann parameter set to be evaluated, the following steps will be executed separately for the each set of the initial weights: step 1′. initialize the weights of the fann with the corresponding initial selected set of weights; step 2′. run the estimation and identification algorithm calculating the signal-to-noise ratio (snr) based on the relation (6) for each output path (trajectory corresponding to a generalized forces), thus corresponding to each set of initial weights, a set of six values of the noise-to-signal ratio will be obtained; step 3′. compute the average of the signal-to-noise ratio snrmed as mean value of the six signal-to-noise ratio determined in step 2 ; step 4′. compare the value of snrmed with the threshold value snrthold. if the average is greater than the threshold then increment the counter variable corresponding to the current value of the fann parameter and go to step 5′; step 4′′. compare the value of the snrmed with the threshold value snrthold. if the average value is greater than the threshold value, then increment the counter variable corresponding to the current values combination for the two fann parameters; ann based inverse dynamic model of the 6-pgk parallel robot manipulator 97 step 5′. if the sets of initial values for weights are not exhausted select a new set of values for weights and jump to step 1′. if the sets of initial values for weights were exhausted continue with step 6′; step 6′. using the values of the counter variables, the success rates corresponding to each fann parameter value is computed by sri = counteri no.of theinitial weightssets ·100 [%] where i = 1, . . . , no. of the fann parameter values. go to step 7′; step 6′′. using the values of the counter variables, the success rates corresponding to each combination of the fann parameters values are computed and go to step 7′′; step 7′. choose the fann parameter value for which the success rate is the largest. finally this value will be considered as the optimal value for the evaluated fann parameter; step 7′′. choose a pair of values of the two fann parameters for which the success rate is the largest. finally the values of the selected pair will be considered as the optimal for the two parameters of the fann evaluated. 3.3 experimental results and discussions for the implementation and testing of the parallel robot modeling, matlab platform and its neural network toolbox were used. in order to investigate the performance of the fann based modeling the model of the parallel robot based on equations 1 and 2 was implemented. in figure 2 six desired trajectories are presented for the generalized coordinates of the manipulated object (xp, yp, zp, ψ, φ, θ denoted qpi, i = 1, . . . ,6). these trajectories are placed in parallel surfaces having the same 3d spatial shape at different elevation. these trajectories represent the input trajectories for the inverse dynamic model of the 6-pgk parallel robot manipulator. 0 5 10 15 20 -10 -5 0 5 10 t[s] x p [ m m ] generalized coordinate 0 5 10 15 20 0 5 10 15 t[s] ψ [ o ] generalized coordinate 0 5 10 15 20 -10 -5 0 t[s] y p [ m m ] generalized coordinate 0 5 10 15 20 -4 -3.5 -3 -2.5 -2 t[s] φ [o ] generalized coordinate 0 5 10 15 20 280 290 300 310 320 t[s] z p [ m m ] generalized coordinate 0 5 10 15 20 -5 0 5 t[s] θ [o ] generalized coordinate figure 2: the generalized coordinates trajectories of the 6-pgk parallel robot in order to study the influence of the fann parameters on the neural network behavior for building the fann training data sets, one of the spatial generalized coordinates trajectory 98 l. moldovan, h.-s. grif, a. gligor was used, which has the red zp line from figure 2 and the corresponding red trajectories for generalized motor forces from figure 3. in figure 3 six desired trajectories for the generalized motor forces (qmj, j = 1, . . . ,6) are presented. these trajectories represent the output trajectories of the inverse dynamic model of the 6-pgk parallel robot manipulator corresponding to the input trajectories from figure 2. the training data are set up using the trajectories presented in figures 2 and 3, trajectories that consist from 2001 points each. the estimation and identification algorithm is applied in each time step and consists of two phases: the estimation phase based on the curent input vector, the fann computes the curent estimated vector of generalized forces; the identification phase based on curent input/output trainig vectors the fann is trained. an input training vector ( xi(k),k = 0, . . . ,n −1 ) and an output training vector ( aod(k),k = 0, . . . ,n −1 ) , from the training vectors set, are constructed as follow: xi(k) = [ xi1(k −1) . . . x i 6(k −1) . . . x i 6(nqp +nq−1)+1 (k −nq) . . . x i 6(nqp +nq) (k −nq) ] = [ xp(k −1) . . . θ(k −1) . . . xp(k −nqp) . . . θ(k −nqp) . . . qm1(k −1) . . . qm6(k −1) . . . qm1(k −nq) . . . qm6(k −nq) ] (7) aod(k) = [ aod1 a o d2 . . . a o d6 ] = [ qm1(k) qm2(k) . . . qm6(k) ] (8) the order of the vectors in the vectors training set corresponds to the order of the time sample from the time axis. taking into account this observation, the input training vector from (7) and the training output vector from (8) can be expressed in relation with time: xi(t) = [ xi1(kt −t) . . . x i 6(kt −t) . . . x i 6(nqp +nq−1)+1 (kt −nqt) . . . xi6(nqp +nq) (kt −nqt) ] (9) aod(t) = a o d(kt) = [ qm1(kt) qm2(kt) . . . qm6(kt) ] (10) where t is the sampling time. the results obtained by using the above algorithms are presented as follows. the experimental study was started from the following fann parameters values: nh = 5, nq = 1, nqp = 1, 20 initial weights sets values, snrthold = 44.5[db], γ = 0.01,0.05,0.1, . . . ,0.98,1. during the step 2 of the general algorithm the success rates of γ were computed and then synthesized in figure 4. the analysis of figure 4 led to the selecting of the value of 0.25 as the optimal value for the γ parameter. during step 3 the success rates of nh were computed and then synthesized in figure 5. the analysis of figure 5 led to the selecting of the value of 7 as the optimal value for the nh parameter. further, after the evaluation of step 4 the results are synthesized in table 1. the values of the lag for nq are taken successively from 1 to 5. the values for nqp are taken successively from 1 to nq. analyzing the experimental results presented in table 1 were selected as optimal values nqp = 1 and nq = 1. in case of the fann with rbf the proposed algorithm generate the following parameters: γ = 0.25, nh = 6, nq = 1 and nqp = 1. comparing data from table 1 and table 2, figures 4, 5, 6 and 7 shows that fann with rbf offer a better chance of achieving a good estimation even if the learning rate hasn’t a 100% success rate. ann based inverse dynamic model of the 6-pgk parallel robot manipulator 99 0 5 10 15 20 -1000 -500 0 500 1000 t[s] q m 1 [n ] generalized motor force 0 5 10 15 20 -1000 -500 0 500 1000 t[s] q m 4 [n ] generalized motor force 0 5 10 15 20 -1000 -500 0 500 1000 t[s] q m 2 [n ] generalized motor force 0 5 10 15 20 -1000 -500 0 500 1000 t[s] q m 5 [n ] generalized motor force 0 5 10 15 20 -1000 -500 0 500 1000 t[s] q m 3 [n ] generalized motor force 0 5 10 15 20 -1000 -500 0 500 1000 t[s] q m 6 [n ] generalized motor force figure 3: the generalized motor forces of the 6-pgk parallel robot 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0 10 20 30 40 50 60 70 80 90 100 learning rate (γ) s u cc e s ra te o f γ [% ] figure 4: success rates of γ parameter in case of using htf in hidden layer 0 2 4 6 8 10 12 14 16 18 20 0 10 20 30 40 50 60 70 80 90 100 no. of the neurons from hyden layer (n h ) s u cc e s ra te o f n h [ % ] figure 5: success rates of nh parameter in case of using htf in hidden layer (γ = 0.25) 100 l. moldovan, h.-s. grif, a. gligor table 1: the success rates corresponding to ( nq, nqp) (fann: htf, nh = 7, γ = 0.25) success rates [%] nqp 1 2 3 4 5 nq 1 100 2 100 75 3 90 70 35 4 60 55 15 20 5 45 35 20 20 10 table 2: the success rates corresponding to ( nq, nqp) (fann: rbf, nh = 6, γ = 0.25) success rates [%] nqp 1 2 3 4 5 nq 1 100 2 95 85 3 85 85 65 4 75 70 45 40 5 60 65 40 30 15 in order to evaluate the behavior of the built fann, in figure 8 a test motor forces trajectories of the parallel robot and the estimated trajectory generated by the fann are presented comparatively. in figure 9 the estimation error trajectories related to the trajectories from figure 8 is shown. the experimental results demonstrate a good approximation related to the studied system which is characterized by nonlinearities and high complexity. 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0 10 20 30 40 50 60 70 80 90 100 learning rate (γ) s u cc e ss r a te o f γ [% ] figure 6: success rates of γ parameter in case of using rbf in hidden layer ann based inverse dynamic model of the 6-pgk parallel robot manipulator 101 0 2 4 6 8 10 12 14 16 18 20 0 10 20 30 40 50 60 70 80 90 100 no. of the neurons from hyden layer (n h ) s u cc e s ra te o f n h [ % ] figure 7: success rates of nh parameter in case of using rbf in hidden layer(γ = 0.25) 0 2 4 6 8 10 12 14 16 18 20 −600 −400 −200 0 200 t[s] q m 1 [n ] robot fann 0 2 4 6 8 10 12 14 16 18 20 −200 0 200 400 600 t[s] q m 4 [n ] robot fann 0 2 4 6 8 10 12 14 16 18 20 −200 0 200 400 600 t[s] q m 2 [n ] robot fann 0 2 4 6 8 10 12 14 16 18 20 −600 −400 −200 0 200 t[s] q m 5 [n ] robot fann 0 2 4 6 8 10 12 14 16 18 20 −600 −400 −200 0 200 t[s] q m 3 [n ] robot fann 0 2 4 6 8 10 12 14 16 18 20 −200 0 200 400 600 t[s] q m 6 [n ] robot fann figure 8: the fann estimated motor forces trajectory and the motor forces trajectory of the 6-pgk parallel robot (fann: rbf, nh = 6, γ = 0.25) 0 2 4 6 8 10 12 14 16 18 20 −100 0 100 200 t[s] q m 1 − q m 1 f a n n [n ] fann: rbf, γ = 0.25, n h =6, n q =1, n qp =1 0 2 4 6 8 10 12 14 16 18 20 −150 −100 −50 0 50 t[s] q m 4 − q m 4 f a n n [n ] fann: rbf, γ = 0.25, n h =6, n q =1, n qp =1 0 2 4 6 8 10 12 14 16 18 20 −200 0 200 400 t[s] q m 2 − q m 2 f a n n [n ] fann: rbf, γ = 0.25, n h =6, n q =1, n qp =1 0 2 4 6 8 10 12 14 16 18 20 −100 0 100 200 t[s] q m 5 − q m 5 f a n n [n ] fann: rbf, γ = 0.25, n h =6, n q =1, n qp =1 0 2 4 6 8 10 12 14 16 18 20 −100 −50 0 50 t[s] q m 3 − q m 3 f a n n [n ] fann: rbf, γ = 0.25, n h =6, n q =1, n qp =1 0 2 4 6 8 10 12 14 16 18 20 −400 −200 0 200 t[s] q m 6 − q m 6 f a n n [n ] fann: rbf, γ = 0.25, n h =6, n q =1, n qp =1 figure 9: estimation error trajectory corresponding to figure 8 102 l. moldovan, h.-s. grif, a. gligor 4 conclusions this paper proposed to build an on-line feedforward artificial neural network with the aim of estimating the inverse dynamic model of the 6-pgk prototype parallel robot. the presented solution was adopted in order to perform an on-line parametric identification of the inverse dynamic model of the robot. the chosen solution was adopted mainly for the use when the robots are characterized by the nonlinearities and high mathematical model complexity. the implementation of the feedforward artificial neural networks allowed to obtain a nonlinear autoregressive model with exogenous input behavior, for which a new method for finding optimum parameters that approximate as better as possible the model of the 6-pgk robot was obtained. in addition, the proposed solution offers, with the best estimation results, the minimal computational load structure of the neural network. even if the fann has a minimal internal structure, a slower training learning method, it offers a robust and efficient estimation method of the parallel robot motor trajectories, despite its complex and nonlinear mathematical model. bibliography [1] hamlin, g.j.; sanderson, a.c. (1997); tetrobot: a modular approach to parallel robotics, ieee robotics & automation magazine, issn 1070-9932, 4(1): 42-50. [2] uzunovic, t.; golubovic, e.; baran, e. a.; sabanovic, a. (2013); configuration space control of a parallel delta robot with a neural network based inverse kinematics, ieee proc. of 2013 8th intl. conf. on eee (eleco), 497-501. [3] abdellatif h.; heimann, b. (2010); experimental identification of the dynamics model for 6-dof parallel manipulators, robotica, 28:359-368. doi:10.1017/s0263574709005682. [4] yang, z.; wu, j.; mei, j.; gao, j.; huang, t. (2008); mechatronic model based computed torque control of a parallel manipulator, international journal of advanced robotic systems, 5(1):123-128. [5] zhang, j.; yu, h.; gao, f.; zhao, x.; ma, c.; huang, x. (2010); application of a novel 6-dof parallel robot with redundant actuation for earthquake simulation, ieee proc. on intl. conf. on robotics and biomimetics (robio), 513-518. [6] najafi, f.; sepehri n. (2008); a novel hand-controller for remote ultrasound imaging, mechatronics, 18: 578-590. [7] ibrahim, k.; ramadan, a.; fanni, m.; kobayashi, y.; abo-ismail, a.a.; fujie, m.g. (2012); design and workspace analysis of a new endoscopic parallel manipulator, ieee proc of 2012 12th intl. conf. on control, automation and systems (iccas), 688-693. [8] jian, x.; zhiyong, t.; zhongcai, p.; shao, h.; lanbo, l. (2013); adaptive controller for 6-dof parallel robot using ts fuzzy inference, international journal of advanced robotic systems, 10(119):1-9. [9] moldovan, l. (2008); geometrical method for description of the 6-pgk parallel robot’s workspace, ieee proc. of first intl. conf. on complexity and intelligence of the artificial and natural complex systems, medical applications of the complex systems, biomedical computing, 2008. cans’08, 45-51. ann based inverse dynamic model of the 6-pgk parallel robot manipulator 103 [10] constantinescu, d.; salcudean, s.e.; croft e.a. (2005); haptic rendering of rigid contacts using impulsive and penalty forces, ieee transactions on robotics,21(3):309-323. [11] le, t. d.; kang, h. j.; suh, y. s.; ro, y. s. (2013); an online self-gain tuning method using neural networks for nonlinear pd computed torque controller of a 2-dof parallel manipulator, neurocomputing, 116: 53-61. [12] lu, y.; li, x.p. (2014); dynamics analysis for a novel 6-dof parallel manipulator i with three planar limbs, advanced robotics, 28(16): 1121-1132. doi: 10.1080/01691864.2014.908743. [13] lopes, a.m. (2010); complete dynamic modelling of a moving base 6-dof parallel manipulator, robotica, 28:781-793, doi:10.1017/s0263574709990506. [14] staicu, s.; liu x.-j.; wang, j. (2007); inverse dynamics of the half parallelmanipulator with revolute actuators, nonlinear dyn., 50: 112. [15] gallardo j.; rico, j.; frisoli, a.; checcacci, d.; bergamasco m. (2003); dynamics of parallel manipulators by means of screw theory, mech. mach. theory, 38: 1113-1131. [16] miller, k. (2004); optimal design and modeling of spatial parallel manipulators, the int. j. of robotics research, 23(2): 127-140. [17] staicu, s.; zhang, d.; rugescu, r. (2006); dynamic modelling of a 3-dof parallel manipulator using recursive matrix relations, robotica, 24:125-130, doi:10.1017/s0263574705001670. [18] achili, b.; daachi, b.; amirat, y.; ali-cherif, a.; daă˘chi, m. e. (2012); a stable adaptive force/position controller for a c5 parallel robot: a neural network approach, robotica, doi:10.1017/s0263574711001354, 30(7):1177-1187. [19] peng, z.; liu, f.; yang, l. (2010); control based on double neural networks-pi for parallel mechanism, robotics and computer-integrated manufacturing, 26(3): 250-252. [20] yu, w. s.; weng, c. c. (2014); h∞ tracking adaptive fuzzy integral sliding mode control for parallel manipulators; fuzzy sets and systems, issn 0165-0114, 248: 1-38. [21] obe, o.; dumitrache, i. (2012); adaptive neuro-fuzzy controler with genetic training for mobile robot control, international journal od computers communications & control, 7(1): 135-146. [22] resceanu, i. c.; resceanu, c. f.; bizdoaca, n. g. (2012); cooperative robot structures modeled after whale behavior and social structure, international journal od computers communications & control, 7(5):945-956. [23] liu, x.; wang, q.; malikov, a.; wang, h. (2012); the design and dynamic analysis of a novel 6-dof parallel mechanism, international journal of machine learning and cybernetics„ 3(1): 27-37. [24] pei, z.; zhang, y.; tang; z. (2007); model reference adaptive pid control of hydraulic parallel robot based on rbf neural network, ieee proc. of intl. conf.on robotics and biomimetics, robio 2007, 1383-1387. [25] beji, l.; pascal, m. (1999); the kinematics and the full minimal dynamic model of a 6-dof parallel robot manipulator, nonlinear dynamics, 18(4): 339-356. 104 l. moldovan, h.-s. grif, a. gligor [26] ayas, m. s.; sahin, e.; altas, i. h. (2014); trajectory tracking control of a stewart platform, proc. of ieee 2014 16th intl. conf. and exposition on power electronics and motion control (pemc), 720-724. [27] moldovan, l. (2008); trajectory errors of the 6-pgk parallel robot, ieee proc. of first intl. conf. on complexity and intelligence of the artificial and natural complex systems, medical applications of the complex systems, biomedical computing, 2008. cans’08, 31-37. [28] mendes lopes, a.; almeida, f. (2009); the generalized momentum approach to the dynamic modeling of a 6-dof parallel manipulator, multibody system dynamics, 21(2): 123-146. [29] grif, s. (2014); automatic daylight control system based on neural estimator, procedia technology, 12: 759-765. [30] demut, h.; beale, m.; hagan, m. (2007); neural network toolbox for use with matlab. user’s guide. version 5, 2007, the mathworks, inc. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 2 (june), pp. 214-221 enhanced security protocol in wireless sensor networks t.c. aseri, n. singla trilok c. aseri, neha singla pec university of technology computer science & engineering department house no. 808, pec campus, sector-12, chandigarh-160012 (india) e-mail: a_trilok_chand@yahoo.com, nehasingla1409@gmail.com abstract: the need for security in communications is in fact not new. this need has existed in military communications for thousands of years. in this paper, we focus on network protocols that provide security services. wireless sensor network is an emerging technology that shows applications both for public as well as military purposes. monitoring is one of the main applications. a large amount of redundant data is generated by sensor nodes. this paper compares all the protocols which are designed for security of wireless sensor network on the basis of security services and propose an improved protocol that reduces communication overhead by removing data redundancy from the network. by using the message sequence number we can check whether it is old message or new message. if the message is old then no need to send that message thereby reducing overhead. it also integrates security by data freshness in the protocol. keywords: data freshness, protocol, security, wireless sensor network. 1 introduction sensor networks are typically data driven, i.e., the whole network cooperates in communicating data from sensors (information sources) to information sinks. low-cost, low power, multifunctional sensor nodes that are small in size and communicate untethered in short distances have been developed due to the recent advances in wireless communication. these tiny sensors have the ability of sensing, data processing, and communicating with each other. wireless sensor networks (wsn) which rely on collaborative work of large number of sensors are realized. a wsn is a wireless network consisting of spatially distributed autonomous devices using sensors to cooperatively monitor physical or environmental conditions, such as temperature, sound, vibration, pressure, motion or pollutants, at different locations. in addition to one or more sensors, each node in a sensor network is typically equipped with a radio transceiver or other wireless communications device, a small microcontroller, and an energy source, usually a battery. a sensor network normally constitutes a wireless ad-hoc network, meaning that each sensor supports a multi-hop routing algorithm. wireless sensor network is one of the most exciting and challenging research areas. nodes in sensor networks have restricted storage, computational and energy resources; these restrictions place a limit on the types of deployable routing mechanisms. additionally, ad hoc routing protocols for conventional wireless networks support ip style addressing of sources and destinations. they also use intermediate nodes to support end-to-end communication between arbitrary nodes in the network. it is possible for any-to-any communication to be relevant in a sensor network; however this approach may be unsuitable as it could generate unwanted traffic in the network, thus, results the extra usage of already limited node resources. many-to-one communication paradigm is widely used in regards to sensor networks since sensor nodes send copyright c⃝ 2006-2011 by ccc publications enhanced security protocol in wireless sensor networks 215 their data to a common sink node for processing. this many-to-one paradigm also results in non-uniform energy drainage in the network. the applications for wsns are many and varied, but typically involve some kind of monitoring, tracking, and controlling, intelligent buildings, transportation, space exploration, disaster detection. in order to operate these applications successfully, it is necessary to maintain privacy and security of the transmitted data. the rest of the paper is organized as follows: section 2 explains security requirements, section 3 presents a review of the relevant work, section 4 presents the proposed protocol, section 5 shows the results and discussion, and section 6 concludes the paper. 2 security requirements 2.1 confidentiality confidentiality means keeping information secret from unauthorized parties. a sensor network should not leak sensor readings to neighboring networks. the confidentiality objective is required in sensors environment to protect information traveling between the sensor nodes of the network or between the sensors and the base station from disclosure, since an adversary having the appropriate equipment may eavesdrop on the communication. by eavesdropping, the adversary could overhear critical information such as sensing data and routing information. 2.2 authentication in a sensor network, an adversary can easily inject messages, so the receiver needs to make sure that the data used in any decision-making process originates from the correct source. as in conventional systems, authentication techniques verify the identity of the participants in a communication, distinguishing in this way legitimate users from intruders. in the case of sensor networks, it is essential for each sensor node and base station to have the ability to verify that the data received was really sent by a trusted sender and not by an adversary that tricked legitimate nodes into accepting false data. if such a case happens and false data are supplied into the network, then its behavior could not be predicted, and most of the times the mission of wsn will not be accomplished as expected. 2.3 integrity data integrity ensures the receiver that the received data is not altered in transit by an adversary. lack of integrity could result in many problems since the consequences of using inaccurate information could be disastrous, for example, for the healthcare sector where lives are endangered. integrity controls must be implemented to ensure that information is not altered in any unexpected way. 2.4 freshness one of the many attacks launched against sensor networks is the message replay attack where an adversary may capture messages exchanged between nodes and replay them later to cause confusion to the network. data freshness implies that the data is recent, and it ensures that an adversary has not replayed old messages. to achieve freshness, network protocols must be designed in a way to identify duplicate packets and discard them preventing potential mix-up. 216 t.c. aseri, n. singla 2.5 availability availability ensures that services and information can be accessed at the time they are required. in sensor networks there are many risks that could result in loss of availability such as sensor node capturing and denial of service attacks. the availability of a sensor and sensor network may decrease for the following reasons [1]: • additional computation consumes additional energy. if no more energy exists, the data will no longer be available. • additional communication also consumes more energy. besides, as communication power increases so does the chance of a communication conflict or interference. a single point failure exists if we use the central point scheme such as a single sink or gateway. this greatly threatens the availability of the network. 3 related work the various protocols which have been proposed for security in wireless sensor network by various authors are spin, leap, tinysec, zigbee, sm. in spin (sensor protocols for information via negotiation), nodes use three types of messages adv, req and data to communicate. adv is used to advertise new data, req to request for data and data is the actual message itself. the protocol starts when a spin node obtains new data that it is willing to share. it does so by broadcasting an adv message containing meta-data. if a neighbor is interested in the data, it sends an req message for the data and the data is sent to this neighbor node. the neighbor sensor node then repeats this process to its neighbors as a result of which the entire sensor area will get a copy. it consists of two secure building blocks snep (sensor network encryption protocol) and îźtesla (timed efficient stream loss-tolerant authentication). in addition to integrity, snep is used to provide confidentiality through encryption and authentication using a message authentication code (mac). it lowers communication overhead adding only 8 bytes per message [2]. tesla authenticates the initial packet using the digital signature. for an authenticated packet to be sent, the base station computes a mac on the packet with the key that is secret at that point in time. when a node gets a packet, it can confirm that the base station did not yet disclose the corresponding mac key [3]. the goal of leap (localized encryption and authentication protocol) is to satisfy the security properties of authentication and confidentiality in a wireless environment where the intruder may eavesdrop, inject packets, and replay messages [4]. leap, as a key management protocol for sensor networks, is designed to support in-network processing, while restricting the impact of a compromised node to the network. in order to support the in-network processing necessary for most applications of these networks while at the same time providing security properties, such as security and authentication, similar to those of pairwise symmetric keys, leap specifies four types of keys: individual keys, pairwise shared keys, cluster keys and group keys. individual keys are symmetric keys shared between the base station and each of the nodes. for example, a node might use the individual key to notify the base station of a suspicious neighbor. pairwise shared keys are symmetric keys shared between a node and each of its neighbors. while pairwise shared keys are used to establish cluster keys, they prevent passive participation which is desirable for in-network processing. cluster keys are symmetric keys shared between a node and all of its neighbors. these cluster keys can be used for locally broadcast messages such as a routing protocol might use and are also used for updating the group key. the group key, a symmetric key shared between the base station and all of the nodes, allows encrypted and authenticated messages to broadcast through the whole network. enhanced security protocol in wireless sensor networks 217 in the next protocol, tiny sec, the dominant traffic pattern in sensor networks is many-toone, with many sensor nodes communicating sensor readings or network events over a multihop topology to a central base station. however, neighboring nodes in sensor networks often witness the same or correlated environmental events, and if each node sends a packet to the base station in response, precious energy and bandwidth are wasted. to prune these redundant messages to reduce traffic and save energy, sensor networks use in-network processing such as aggregation and duplicate elimination [5, 6]. since in-network processing requires intermediate nodes to access, modify, and suppress the contents of messages, it is unlikely we can use end-to-end security mechanisms between each sensor node and the base station to guarantee the authenticity, integrity, and confidentiality of these messages. with authenticated encryption, tinysec encrypts the data payload and authenticates the packet with a mac [7]. single shared global cryptographic key, link layer encryption and integrity protection cryptography is based on a block cipher. tinysec is a research platform that is easily extensible and has been incorporated into higher level protocols. in zigbee, the concept of a trust center is introduced in the specification. generally the zigbee coordinator performs this duty. this trust center allows other devices to join the network and also distributes the keys. there are three roles played: • trust manager, whereby authentication of devices requesting to join the network is done, • network manager, maintaining and distributing network keys, and • configuration manager, enabling end-to-end security between devices [8]. it operates in both residential mode and commercial mode. the trust center running residential mode is used for low security residential applications. commercial mode is designed for high-security commercial applications. in residential mode, the trust center will allow devices to join the network, but does not establish keys with the network devices. it therefore cannot periodically update keys and allows for the memory cost to be minimal, as it cannot scale with size of the network. in commercial mode, it establishes and maintains keys and freshness counters with every device in the network, allowing centralized control and update of keys. this results in a memory cost that could scale with the size of the network. there are three types of keys employed, the master key, the link key and the network key. master keys are installed first, either in the factory or out of band. they are sent from the trust center and are the basis for long-term security between two devices. the link key is a basis of security between two devices and the network keys are the basis of security across the entire network. link and network keys, which are either installed in the factory or out of band, employ symmetrical keykey exchange (skke) handshake between devices. the key is transported from the trust center for both types of keys. this operation occurs in commercial mode, as residential mode does not allow for authentication. in the latest protocol, sm (security manager), a new method of key agreement has been proposed in [9], whereby, when a new device joins a network, the security manager (sm) gives static domain parameters at the base station such as the order of the curve and the elliptic curve coefficients. after calculating a public key using the base point and a private key, the device sends a public key to the sm. therefore the sm would have the public key list for all the devices in the network. authentication is achieved by using either diffie-hellman or elliptic curve equation. confidentiality is achieved by using message authentication protocol. this shows that sm protocol offers more services than the other existing protocols. 218 t.c. aseri, n. singla 4 proposed protocol a security protocol refers to a set of rules governing the interaction between peer processes to provide a certain type of security service. we propose a new security protocol. security manager (sm) [9] does not guarantee data freshness; and so, we suggest a protocol to make up for the weakness of sm. this provides a solution to maintain data freshness by checking the message sequence number. when the message is sent, it is checked by the message sequence number whether it is already sent or not. if the message is old then no need to send that message. by this way, we can reduce overhead. by reducing the overhead we can make the protocol more efficient. authentication and confidentiality are also provided. authentication is defined to provide assurance about the originator of a message. this prevents an attacker from mimicking the operation of another device in any attempt to compromise the network. confidentiality means keeping information secret from unauthorized parties. the standard solution to keep sensitive data secret is to encrypt the data with a secret key that only the intended receivers possess, hence achieving confidentiality. additionally, this protocol provides freshness through the use of freshness checks. these checks prevent replay attacks, as devices maintain incoming and outgoing messages. whenever a node wants to send message, message sequence number will be checked. one of the many attacks launched against sensor networks is the message replay attack where an adversary may capture messages exchanged between nodes and replay them later to cause confusion to the network. data freshness implies that the data is recent, and it ensures that an adversary has not replayed old messages. to achieve freshness, network protocols must be designed in a way to identify duplicate packets and discard them preventing potential mix-up. this extra feature shows that proposed protocol offers more security services than the existing one. in the proposed protocol, time interval ∆t accounts for request time, response time, delay, as shown in (1) and (2). freshness is computed as the difference between the time a data item is generated and the time it is received at the sink. with data freshness, ∆t = trq + trs + ∆d + t (1) without data freshness, ∆t = trq + trs + ∆d (2) where trq = request transmission, trs = response transmission, ∆d = sum of delays (delays in transmission and propagation), t = time when message sequence number is checked. in case of without data freshness, no message is checked. accuracy is measured as the ratio of total number of messages received at the sink to the total number of messages generated. in case of without data freshness, no message is discarded but in case of with data freshness the message, which is already sent then no need to send again. therefore, the ratio of without data freshness is always one but it is less than one in case of with data freshness. we can calculate the packet ratio by (3) and (4) as follows: without data freshness, packetratio = send/receive = 1 (3) with data freshness, packetratio = send/receive < 1 (4) enhanced security protocol in wireless sensor networks 219 5 results and discussion sensor network is a promising and upcoming technology with usage in important applications. the resource constraint hardware, specialized software, low energy devices and hostile environment makes the security in wireless sensor networks a challenging task as and when compared to the traditional computer networks. energy efficiency can be achieved by reducing the number of packets transmitted. without data freshness, each node will send a packet that will be forwarded to the sink whereas with data freshness no need to send all packets, this reduced number of packets transmitted improve the efficiency. figure 1 shows the average latency. in case of with data freshness, average latency is constant as the number of nodes is increased while it is increased in case of without data freshness. figure 1: number of nodes vs. average latency figure 2 shows the average packet delivery ratio. in case of with data freshness, average packet ratio is 100% because how many packets sent will definitely be received but in case of without data freshness some may already sent. this reduced number of packet transmitted also improves the efficiency. the discussion of the security protocol and authentication mechanism allow for the construction of comparison table as in given table 1, where they can be compared under similar headings. it can be seen from the table that new protocol is better than the existing protocol and offers more security services than the earliest one. 6 conclusion in this paper, firstly we propose a new security protocol for wireless sensor network. secondly, we compared the performances of all the existing protocol with proposed protocol. spin 220 t.c. aseri, n. singla figure 2: number of nodes vs. packet delivery ratio table 1: security architecture comparison protocol / service c f i ava ia a spin yes yes yes no yes no leap yes no no no yes no tinysec yes no no yes yes zigbee yes yes yes no yes yes sm yes no no yes yes our protocol yes yes yes yes c=confidentiality, f=freshness, i=integrity, ava=availability, ia=implicit authentication, a=authentication of user was found to perform better in smaller size networks because of its efficiency and high latency properties. the use of spin in large scale networks could potentially exhaust system resources in a much faster pace. our protocol has one extra feature i.e. freshness. freshness reduces the overhead. this extra feature shows that this is superior to the existing protocols. this also improves the efficiency. bibliography [1] j.p. walters, zh. liang, w. shi, v. chaudhary, security in distributed, grid, and pervasive computing, chapter 17, crc press, 2006. [2] a. perrig, r. szewczk, j.d. tygar, v. wen, d.e. culler, spins: security protocols for sensor networks, wireless networking, vol. 8, no. 5, pp. 521-534, sept 2002. enhanced security protocol in wireless sensor networks 221 [3] a. perrig, r. canneti, j. d. tygar, d. song, the tesla broadcast authentication protocol, cryptobytes, vol. 5, no. 2, pp. 2-13, 2002. [4] d. boyle, t. newe, security protocols for use with wireless sensor networks: a survey of security architectures, proceedings of the 3rd international conference on wireless and mobile communications, guadeloupe, french caribbean, pp. 54, 04-09 march 2007. [5] s. madden, m.j. franklin, j.m. hellerstein, w. hong, tag: a tiny aggregation service for ad-hoc sensor networks, proceedings of the 5th symposium on operating system design and implementation (osdi), boston, massachusetts, usa, pp. 131-146, 09-11 december 2002. [6] s. madden, r. szewczyk, m.j. franklin, d. culler, supporting aggregate queries over adhoc wireless sensor networks, proceedings of the 4th ieee workshop on mobile computing and systems applications (wmcsa), callicoon, ny, usa, pp. 49-58, 20-21 june 2002. [7] c. karlof, n. sastry, d. wagner, tinysec: a link layer security architecture for wireless sensor networks, proceedings of the 2nd acm international conference on embedded networked sensor systems, baltimore, md, usa, pp. 162-175, 03-05 november 2004. [8] zigbee alliance, zigbee security specification overview, [online] available: http://www.zigbee.org/en/events/documents/december2005_open_house_presentations/ zigbee_security_layer_technical_overview.pdf. [9] j. heo, c.s. hong, efficient and authenticated key agreement mechanism in low-rate wpan environment, proceedings of the 1st ieee international symposium on wireless pervasive computing, phuket, thailand, pp. 1-5, 16-18 january 2006. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vii (2012), no. 1 (march), pp. 53-60 adaptation mechanism based on service-context distance for ubiquitous computing m. cremene, m. riveill, a. rarau, c. miron, b. iulian, v. todica marcel cremene, anca rarau, costin miron, benta iulian, valeriu todica technical university of cluj-napoca romania, cluj-napoca, memorandumului nr. 28, 400114 e-mail(s): cremene@com.utcluj.ro, anca.rarau@cs.utcluj.ro, miron@bel.utcluj.ro, iulian.benta@com.utcluj.ro, todicav23@yahoo.com michel riveill university of nice, sophia-antipolis i3s, route de colles, bp 145 f-06903, sophia antipolis cedex e-mail: riveil@unice.fr abstract: service adaptation is one of the main research subjects in ubiquitous computing. dynamic service adaptation, at runtime, is necessary for services that cannot be stopped (banking, airport, etc.). the classical approaches for dynamic adaptation require predicting all service and context states in order to specify service and context-specific adaptation policies. this prediction may lead to a combinatorial explosion. the aim of this research is to create a service and context-independent adaptation mechanism. our proposal is based on a service-context model that is causally connected with the service and context, in a model@run.time paradigm. a closed-loop control principle is used for the adaptation mechanism. we introduce an equivalent for the error that is expressed by the notion of service-context distance. this distance represents a measure of how adequate is a service to its context. this distance is computed by some generic, reusable components. the adaptation algorithm that minimizes this distance is also service and context-independent. keywords: services, dynamic adaptation, context, autonomic computing, adaptation control 1 introduction background and motivation. service adaptation to the context (physical infrastructure/resources, user needs, and environment) is one of the main research subjects in high interest domains such as ubiquitous/pervasive/mobile computing. adapting a service is to reconfigure that service in order to maximize its qos (quality of service) and also the efficiency of the resource utilization. we use a component-oriented approach for services, thus the reconfiguration concerns the service architecture: parameterize, add/remove, connect/disconnect or migrate components. numerous situations require a dynamic service adaptation [1,2], at runtime, because there are an important number of services that cannot be stopped (modified and recompiled). examples are: banking services, airport services, spatial services, corporate services, groupware services and others. even for services that may be stopped, the human intervention means important time and costs. copyright c⃝ 2006-2012 by ccc publications 54 m. cremene, m. riveill, a. rarau, c. miron, b. iulian, v. todica issues about dynamic adaptation control mechanism. the adaptation mechanism, that encapsulates the system "intelligence", is the most important part of an adaptive system. it main function is to decide when a service should be adapted and how to adapt it. after studying several different approaches for dynamic service adaptation, some of them presented in [3], we observed that the large majority of these solutions are based on the prediction about all possible service and context states. this prediction is an essential condition for specifying adaptation policies. usually these policies are service and context-specific, they are not reusable and they do not evolve as the context evolves, without human intervention. a service can be adapted only inside the limits fixed a priori by these services and context-specific polices. for instance, in the case of "event-condition-action" adaptation mechanism, which is one of the most used, a service cannot be adapted if the events, the conditions and the actions have not been predicted/specified priori by a human expert. but these events, conditions and actions are service and context-specific. this means that the human expert decides a priori when and how to adapt a service. the problem is that, the prediction of all possible service and context states, may lead to a combinatory explosion. for instance, a state machine specification mde (model driven engineering) approach leads to a extremely high number of possible artefacts, as it is shown in [2]. another problem of the classical state machine approach is that it does not deal with adding/removing/changing dynamically new states and transitions. thus, such an approach is impossible to be used in practice for complex services and contexts. objective and approach. the aim of this research is to build a service and contextindependent (generic) adaptation mechanism. our proposal is based on a model@run.time approach. according to this approach, we propose a service-context model describing the service and the context as a whole system. this model is causally connected with the service and the context. the service-context model is represented as a directed graph having as nodes the service components and the context elements. each node has attributes related to the entities (services′ components, context elements) and to the interactions between these entities. a closed-loop control principle is used for the adaptation mechanism, as suggested by the autonomic computing paradigm [4]. we introduce an equivalent for the notion of error that is expressed in our model by the notion of service-context distance. this distance represents a measure of how adequate is a service to its context in terms of user needs satisfaction and resources utilisation. the adaptation algorithm is also service and context-independent. paper outline. this paper is organized as it follows: the next section is an overview about the dynamic service adaptation issues and the models@run.time approach. section three presents the proposed solution that is based on the service-context model, service-context distance and a generic adaptation algorithm. section four present presents the conclusions. 2 dynamic service adaptation in a models@run.time approach 2.1 dynamic service adaptation for defining more precisely what service adaptation means for us, we consider a mixture between the categories defined by the autonomic computing [4] and the adaptation types proposed in [1]: a. self-healing/corrective adaptation. the system automatically detects, diagnoses, and repairs localized software (and hardware) problems. for example, if a system component responds too slow it will be replaced by another component, with the same functions, that responds on time (or the component will be moved on another physical machine). self-healing/corrective adaptation deals usually with failure cases, when a component must be replaced/moved. adaptation mechanism based on service-context distance for ubiquitous computing 55 b. self-adaptation. the adaptation is necessary when the context state changes. we may have two cases of self-adaptation:adaptive adaptation concerns the situations when the context change and the service should change also in order to keep its functions. this type of adaptation is usually transparent to the user.extending adaptation concerns the situations when some new user needs are discovered a posteriori and the service should be extended in order to satisfy these new needs. c. self-optimization/perfective adaptation. the system continuously searches for opportunities to improve its own performance. the performance is maximal when the qos parameter are maximized (for instance the response time) and when the resource utilization is minimal and well balanced. self-optimization/perfective adaptation deals with a service that is working correctly but it may be tuned in order to increase its efficiency/quality. the adaptation is dynamic when the service reconfiguration happens at runtime. in the last years, numerous researchers have concentrated on developing middleware that enables dynamic adaptation. for instance, the wcomp middleware [5, 6] offers a support for composing and adapting services at runtime. the design and the execution phases are simultaneous. an event-based communication model is used. it offers the possibility to use services installed on various devices and also web services from internet. wcomp manages also the dynamic service apparition/extinction. fractal [7] proposes a middleware and a component model enabling hierarchical composition. the middleware offers support for reflection and dynamic reconfiguration. a particularity of fractal is the possibility to share a component between two other composite components. openccm (open corba component model) [8] is the first public available and open source implementation of ccm (corba component model). osgi propose a platform based on dynamic modules/components that may be assembled at runtime, it provides a service-oriented, component-based environment. a service-oriented middleware allowing spontaneous distributed service composition at runtime, based on osgi, is proposed in [9]. facing with a high number of existent middleware, our intention is to propose an independent solution that may be adapted to different middleware. 2.2 the models@run.time approach models@run.time [10] represents a novel approach that aim is to extend the usage of software models (mde model driven engineering) to runtime. the need of such models is motivated by the dynamic adaptation of mission-critical software issues. this is also related with autonomic computing paradigm because the adaptation should be autonomous (without human intervention). this approach is also strongly related with the reflection paradigm and has a similar principle: a model@run.time is an abstraction of a system that is causally connected with that system and may be used for dynamic system adaptation. the difference comparing to reflection is that in model@run.time we look for high-level models while in reflection paradigm the (meta)model is strongly related to the low-level software aspects [10]. such a model@run.time describes aspects as: structure, behaviour and goals of the system, from a problem space perspective. one of the advantages of the model@run.time approach is the possibility to treat independently two main issues: i. propose a model that will be used at runtime for discovering the service-context adequacy problems and for adapting the service by solving these problems. our proposal is based on a service-context model, a service-context distance that reflects the service-context adequacy and an adaptation algorithm that aim is to minimize this distance (and to increase the adequacy). 56 m. cremene, m. riveill, a. rarau, c. miron, b. iulian, v. todica ii. create a middleware that will causally connect the two parts: a) the proposed servicecontext model with b)the adapted system (a modification of one part will be reflected on the other part). if we want to offer a general solution, a part of this middleware will make the adaptation with a specific adaptation support-middleware (see the examples presented in the section 2.1). the second issue is more a technical issue than a scientific one because today many middleware and context monitoring tools offers a large number of solutions that may be reused. that is why; in this paper we are concentrated only on the first issue which is the most important and difficult. 3 the service-context model, the service-context distance and the adaptation algorithm 3.1 service-context model we have introduced the service-context model concept in [3, 11] as a possible approach for autonomic computing. in this paper we re-interpret this model from a models@run.time perspective, we extend it with the service-context distance. service-context graph. the service-context model (figure 1) aim is to describe the service and the context as a whole system that may be analyzed from the service to context adequacy point of view. this model is based on a directed graph composed by the following elements: a. the nodes correspond to service and context entities. the service′ entities (s in fig. 1) are the software components that constitute the service. in order to simplify the model, a complex component is described by decomposing it in a sum of basic components having only three types: source (1 output), sink (1 input) or filter (1 input, 1 output). the context entities are context components: users (u in fig. 1), infrastructure elements (i in fig. 1) such as devices, networks and environment elements (e in fig. 1). b. the vertex corresponds to the relations and interactions existent between these entities. we define three types of vertex corresponding to two types of possible relations/interactions: informational flows interactions. the service components are communicating by exchanging information. also, the user exchange information with the service. this information has some attributes that describes it. resource utilization interactions. the service components consume resources provided by the hardware infrastructure (memory, cpu, bandwidth, etc). environment influence interactions. the environment may influence the user needs (ex. location) or the infrastructure (ex. the rain effect on a radio network). for simplifying the model, we will discuss in this paper only the most important interactions which are the first two: information flows and resource utilization. attributes. the graph is annotated with some attributes. each node has attributes related to the node itself or to the node ports (input and output vertex). these attributes are making a semantic connection between the service nodes and the context nodes. for instance, the ’language’ attribute means the user u language but also the service s language, the ’memory’ attribute means the memory consumed by a component and the memory provided by a device. adaptation mechanism based on service-context distance for ubiquitous computing 57 figure 1: service-context general model a common attribute vocabulary for service and context components should be respected by the component developers/providers. for each attribute a input and output domain are specified. for filter-type components, a transfer function (input-output) should be specified. for instance, a translation component change the language between the input and the output, a compression component change the compression rate, etc. the attributes, their input/output domains and the transfer functions should be specified for each component by the component developer/provider, in a form of a profile (i.e. an xml-based descriptor of the component). this is a necessary condition for enabling the machine to understand what a component does. same thing for the context elements, each one has a profile. composition. one of the most important property of the service-context model is the composition. by composition we understand here to assimilate a graph to a node. this is a necessary condition for describing the service behaviour and properties using only the service’s internal components behaviour and properties. we propose a composition mechanism based on the attributes specified by the profiles. the elementary composition cases are described in figure 2. the composition means to determine the equivalent node for a graph composed by two nodes. c1 c2 c1 c2 figure 2: serial and parallel composition the composition operation depends on the attribute nature. the composition is different for information flows interactions and for resource utilization interactions. these operators are described in the figure 3. 3.2 service-context distance the first idea for describing the service-context adequacy, used in [11], was to use a binary approach: a service is adequate or not to its context. in order to have a more general approach, we introduce in this paper the concept of service-context distance. the distance is minimal when the service to context adequacy is maximal. this distance has two components: the s-u distance measures the distance from the information flow point of view and the s-i distance measures the distance from the resource utilization point of view. 58 m. cremene, m. riveill, a. rarau, c. miron, b. iulian, v. todicainformation flow composition atribute name serial composition parallel composition response_time sum, + max function security_level min function min function language language of the output component intersection type_hmi type_hmi of the output component reunion resource utilization composition atribute name paralel composition serial composition shared composition memory sum, + max function m1 + m2 – mshared cpu_time sum, + max function t1 + t2 – tshared screen_surface reunion max function reunion figure 3: composition operators for different attributes each distance is represented as a vector having as components the distances corresponding to each attribute. for each attribute we define a specific distance that will be computed by some dedicated components/services. in some cases the distance is expressed by a simple formula or set of rules. some examples are described in figure 4.information flow distance evaluation attribute name domain distance response_time 0..inf security_level {none, low, medium, high, very high} d = 0 if security_level(s) >= security_level(u) and ∞ else language {ro, fr, de, es…….} d = 0 if language(s) == language(u) and ∞ else resource utilization distance evaluation attribute name domain distance memory 0…100% d = memory(s) / memory(i) * 100 cpu_time 0..100% d = cpu_time(s) / cpu_time(i) * 100 network_capacity 0..100% d = network_capacity(s) / network_capacity(i) * 100 figure 4: distance expressions for different attributes a more complex distance is the distance between the user requests expressed in natural language and the service features. for such complex distances we cannot present a simple formula but we use dedicated component/services that implement the algorithm for computing the distance. such an algorithm is proposed in [12]. the model may be extended by adding new attributes definitions and new distances measuring components. adaptation mechanism based on service-context distance for ubiquitous computing 59 3.3 adaptation algorithm the adaptation algorithm goal is to minimize the service-context distance. for doing that, the adaptation algorithm dynamically transforms the service-context graph from a less adequate one toward a more adequate one. the primitive operations used by the adaptation algorithm are the following: changing a node parameter, connecting/disconnecting a node, inserting a new node, removing a node, replacing a node. in order to minimize the service-context distance, the adaptation algorithm should find a list of primitive operations that transforms the graph from the current state into the desired state (for the desired state the service-context distance is minimal). several alternative solutions may exists. in order to make a difference between these possible solutions, we associate an adaptation cost to each primitive operation. for instance, the parameterization has a cost equal to 1 while the insertion cost is 5. this is because it is faster and easier to apply a parameterization than an insertion. we have implemented an algorithm, described in [11], that uses one attribute and one adaptation strategy: the insertion of a new component. this algorithm starts from a mismatch between the desired value v2 for an attribute a and the current value v1 and search for a new component that transforms the value of the attribute from v1 to v2. the place where the new component may be inserted is searched based on the syntactic interface compatibility. the user is confirmation is asked before transforming the service. we are working in present on a general adaptation algorithm, able to apply a succession of different primitive operations and deal with several attributes simultaneously. we have done some tests with genetic algorithms which seem promising but are not very fast. 3.4 implementation as a proof of concept, in [11] we have implemented a simple forum service that is adapted dynamically to the user language. the components are developed in java, using the ccm (corba component model) specifications. the adaptation consists in inserting a translation component, at runtime. we have used a mechanism based on isl (interaction specification language) that is used also in wcomp middleware [6]. in present we are developing a general service-context simulator, based on the open source jgraphx api that should allow us to test the adaptation at the model level by simulating various service and context states. 4 conclusions the issue discussed in this paper was to propose a service and context-independent adaptation mechanism for dynamic service adaptation. we have proposed a solution, based on the service-context model and service-context distance concepts, that fits into the very recent model@run.time approach. the generality of the proposed model is given by its following properties: the service attributes/behaviour are determined automatically by composing the attributes/behaviour of its internal components, the service-context distance depends only on the model attributes nature and are not dependent to a particular service/context and the adaptation algorithm is also service/context independent. as future work we intend to generalize the adaptation algorithm. 60 m. cremene, m. riveill, a. rarau, c. miron, b. iulian, v. todica acknowledgments this work was supported by cncsis-uefiscsu, pnii-idei, project number 1062/2007. thanks to members of the rainbow team, i3s laboratory, sophia-antipolis. bibliography [1] a. ketfi, n. belkhatir, and p.-y. cunin, “automatic adaptation of component-based software: issues and experiences,” in pdpta ’02: proceedings of the international conference on parallel and distributed processing techniques and applications, pp. 1365–1371, csrea press, 2002. [2] b. morin, o. barais, j. m. jezequel, f. fleurey, and a. solberg, “models@ run.time to support dynamic adaptation,” vol. 42, pp. 44–51, october 2009. [3] m. cremene, adaptation dynamique de services. phd thesis, double coordination between university of savoie, france and technical university of cluj-napoca, romania, 2005. [4] j. o. kephart and d. m. chess, “the vision of autonomic computing,” computer, vol. 36, no. 1, pp. 41–50, 2003. [5] d. cheung-foo-wo, j.-y. tigli, s. lavirotte, and m. riveill, “self-adaptation of event-driven component-oriented middleware using aspects of assembly,” in mpac ’07: proceedings of the 5th international workshop on middleware for pervasive and ad-hoc computing, (new york, ny, usa), pp. 31–36, acm, 2007. [6] j.-y. tigli, s. lavirotte, g. rey, v. hourdin, d. cheung-foo-wo, e. callegari, and m. riveill, “wcomp middleware for ubiquitous computing: aspects and composite event-based web services,” annals of telecommunications (aot), vol. 64, apr. 2009. [7] e. bruneton, t. coupaye, m. leclercq, v. quéma, and j.-b. stefani, “the fractal component model and its support in java: experiences with auto-adaptive and reconfigurable systems,” softw. pract. exper., vol. 36, no. 11-12, pp. 1257–1284, 2006. [8] s. gorappa and r. klefstad, “empirical evaluation of openccm for java-based distributed, real-time, and embedded systems,” in sac ’05: proceedings of the 2005 acm symposium on applied computing, (new york, ny, usa), pp. 1288–1292, acm, 2005. [9] a. bottaro, a. gerodolle, and p. lalanda, “pervasive service composition in the home network,” in aina ’07: proceedings of the 21st international conference on advanced networking and applications, (washington, dc, usa), pp. 596–603, ieee computer society, 2007. [10] g. blair, n. bencomo, and r. b. france, “models@ run.time,” computer, vol. 42, no. 10, pp. 22–27, 2009. [11] m. cremene, m. riveill, and c. martel, “autonomic adaptation solution based on servicecontext adequacy determination,” electron. notes theor. comput. sci., vol. 189, pp. 35–50, 2007. [12] m. cremene, j.-y. tigli, s. lavirotte, f.-c. pop, m. riveill, and g. rey, “service composition based on natural language requests,” in ieee scc, pp. 486–489, 2009. int j comput commun, issn 1841-9836 vol.7 (2012), no. 2 (june), pp. 231-240 the effect of heterogeneous traffic distributions on load balancing in mobile communications: an analytical model k.-c. chu, c.-s. wang, w.-w. jiang, n.-c. hsieh kuo-chung chu, wey-wen jiang, nan-chen hsieh department of information management, national taipei university of nursing and health sciences no.365, mingde rd., beitou dist., taipei city 11219, taiwan (r.o.c.) {kcchu, jiang6, nchsie})@ntunhs.edu.tw chun-sheng wang department of information management, jinwen university of science and technology no.99, anzhong rd., xindian dist., new taipei city 23154, taiwan (r.o.c.) seanwang@just.edu.tw abstract: this paper investigates the load balancing problem in an environment of heterogeneous traffic distributions. an analytical model is proposed to determine the effect of heterogeneous traffic distributions on load balancing, in which a generic measure of load balancing level (lbl) that is a function of traffic type coefficient (tyc) and call blocking probability of cells is to analyze the expected level of the load balance. we consider both voice traffic and data traffic to determine which kind of traffic has the greater effect. the performance of cellular systems with sectorization is evaluated; they are normal case (n) of homogeneous distribution and linear case (l) of heterogeneous distribution. the analysis results indicate that the tyc has a significant effect on the accommodation capacity, in which voice calls outperform data calls because the lbl can easily distinguish between normal and linear distributions. load balancing can be achieved more easily for voice only traffic than for data only traffic. sectorization is more effective in achieving load balancing in the scenario of the heavier loads than in the lighter loads. the paper results are useful for network planning to optimize the channel allocation for different traffic type’s distribution. keywords: analytical model, heterogeneous distributions, load balancing, mobile communications, qos. 1 introduction several studies have evaluated the capacity of mobile cellular systems, but most of them (e.g., [1,2]) assume a homogeneous spatial traffic distribution, as it best fits the system’s characteristics to have all signals share all the spectral resources however, homogeneous traffic distribution among base station (bs)/sectors/cells (equal cell loads) is very uncommon in practice. even though sufficient capacity is planned in a cellular system, heterogeneous traffic distribution may occur in other cells, creating a “hot spot" that exceeds the pre-determined capacity and introduces a large blocking probability, as the quality of service (qos) of such cells may be degraded, especially below a pre-defined threshold. this is why load balancing is the most important issue to be discussed before network planning in terms of optimal resource allocation. copyright c⃝ 2006-2012 by ccc publications 232 k.-c. chu, c.-s. wang, w.-w. jiang, n.-c. hsieh in [3], ning et al. discuss load balancing by using a hybrid scheme of channel borrowing scheme and load transfer, it allows borrowing channels from light load cells, and ongoing calls can be transferred from heavy load cells into the overlapping cells they are light load. to improve global resource utilization and reduce regional congestion given heterogeneous arrivals, [4] requires load balancing among multiple cells. however, their works cannot be applied to general system because a lot of issues are different from other systems (wcdma, cdma2000, hspa), e.g. channel definition, interferences, soft handoff. in general, soft handoff enforced by power control has been proposed as a possible solution to local traffic imbalances among cells [5]. actually, power control is one of the most important processes, as interference is the predominant factor that affects the capacity and signal-to-interference ratio (sir). to maximize system capacity, power control can be used efficiently to adapt cell sizes for load balancing; however, the trade-off between coverage and capacity should be carefully considered [6, 7]. an adaptive load-shedding scheme combines the power control and the soft handoff function to force some mobile stations (mss) farthest from the cell to enter forced soft handoff, and transfer their traffic load to neighboring cells that are lightly loaded. in this way, heavily loaded cells dynamically down-size their coverage area in order to handle traffic, while adjacent cells that are less heavily loaded increase their coverage to accommodate the extra traffic. however, in a hot-spot sector, powering up all mss in the sector results in excessive interference with the mss in neighboring cells, so they cannot maintain sufficient sir levels at their sector sites. previous studies also attempt to achieve constant received mean power from each ms within a sector [8–10]. this paper investigates the load balancing problem in an environment of heterogeneous traffic distributions. an analytical model is proposed to determine the effect of heterogeneous traffic distributions on load balancing, in which a generic measure of load balancing level (lbl) that is a function of traffic type coefficient (tyc) and call blocking probability of cells is to analyze the expected level of the load balance. we consider both voice traffic and data traffic to determine which kind of traffic has the greater effect. the performance of cellular systems with sectorization is evaluated; they are normal case (n) of homogeneous distribution and linear case (l) of heterogeneous distribution. the remainder of this paper is organized as follows. in section 2, we discuss the mobile cellular system background. in section 3, we present an analytical model of load balancing, and define sir. section 4 details the numerical results, and section 5 contains some concluding remarks. 2 background of mobile cellular systems 2.1 sectorization generally speaking, a bs configuration is uniformly sectorized in one sector (with omnidirectional antenna, 360◦ per sector), in three sectors (120◦ per sector), and in six sectors (60◦ per sector). the capacity of each sector is calculated subject to the system’s sir requirements. a sector that is lightly loaded usually experiences more interference than a heavily loaded sector, which leads to a higher blocking probability in the lightly loaded sector. denote b as a set of bss, and s as a set of sectors configured in the bs. we further denote the sector s in bs j as sectorjs (∀s ∈ s, j ∈ b) and k as the set of sector configurations. in this paper, the following two probable configurations are given for a bs (|k| = 2): a single sector configuration with an omni-directional antenna, (360◦ per sector), and a three-sector configuration (120◦ per sector); k is assigned as the identification (id) for each configuration. the sector id i identifies the sector in the configuration k in an anti-clockwise direction. table 1 summarizes the sector candidates the effect of heterogeneous traffic distributions on load balancing in mobile communications: an analytical model 233 table 1: sector candidates for the bs the value of s candidate sk,i configuration i.d. (k) sector i.d. (i) s = 1 s(1,1) 1 1 s = 2 s(2,1) 2 1 s = 3 s(2,2) 2 2 s = 4 s(2,3) 2 3 table 2: coverage of candidate sectors candidate sk,i sector i.d.i coverage of sk,i 1 1 (ϕ,ϕ + 360◦) 2 1 (ϕ,ϕ + 120◦) 3 2 (ϕ + 120◦,ϕ + 240◦) 4 3 (ϕ + 240◦,ϕ + 360◦) for each bs for a combination of k and i. let s be the set of sectors; then, each sector sk,i (∀sk,i ∈ s) is defined by the sector configuration (k) and the sector id (i). table 2 details the coverage (in degrees) of each sector, where ϕ is the degree of the baseline. in general, it can be assigned arbitrarily, but in this paper we given −30◦ in our cellular structure example, as shown in figure 1. baseline axis-x (0 ) 30f = r figure 1: the baseline deployed in a cellular structure 2.2 interference between sectors to calculate the interference between sectors, the sector configuration information in table 1 and the sector coverage information in table 2 must be given. without loss of generality, sector sk,i(sk′,i′) is replaced by s(s′); and sector s in bs j is denoted by sectorjs, as shown in figure 2. because of the baseline degree deployed in all cells is the same using ϕ = −30◦, no matter what the bs is configured, the mutual interference between sectors can be well-known. if we define the interference indicator functions ωuljsj′s′ and ω dl jsj′s′ for the respective uplink (ul) and downlink (dl) connections between sectorjs and sectorj′s′, they can be pre-calculated. to pre-calculate the indicator functions, the sector candidates to be configured in the bs must be defined. assuming (xjs,yjs) and (xj′s′,yj′s′) are the respective locations of bs j and bs j′, the vectors −→ a1( −→ a′1) and −→ a2( −→ a′2) covering sectorjs(sectorj′s′) are defined as follows: −→ a1 = [ x1js − xjs,y 1 js − yjs ] , −→ a2 = [ x2js − xjs,y 2 js − yjs ] −→ a′1 = [ x1j′s′ − xj′s′,y 1 j′s′ − yj′s′ ] , −→ a′2 = [ x2j′s′ − xj′s′,y 2 j′s′ − yj′s′ ] 234 k.-c. chu, c.-s. wang, w.-w. jiang, n.-c. hsieh 1, ' 'j s q 2, ' 'j s q 1, js q 2, js q s e c t o r b o u n d a r y a u x i li a r y i n t e r f e r e n c e a x i s -x ( 0 )) js q ( ),tms tx ty ( )' ', 'tms tx ty sector js ' ' sector j s ' 'j s q ( )' ' ' ',j s j sx y ( ),js jsx y ( )1 1' ' ' ',j s j sx y ( )2 2' ' ' ',j s j sx y ( )1 1,js jsx y ( )2 2,js jsx y figure 2: mutual interference between sectors where x1js = rj cos (θ1,js) + xjs,y 1 js = rj sin (θ1,js) + yjs, x2js = rj cos (θ2,js) + xjs,y 2 js = rj sin (θ2,js) + yjs x1j′s′ = rj′ cos (θ1,j′s′) + xj′s′,y 1 j′s′ = rj′ sin (θ1,j′s′) + yj′s′, x2j′s′ = rj′ cos (θ2,j′s′) + xj′s′,y 2 j′s′ = rj′ sin (θ2,j′s′) + yj′s′ then, the dl interference ωdljsj′s′ between sectorjs and sectorj′s′ can be analyzed by an auxiliary vector −→ a3 = [tx′ − xjs,ty′ − yjs], where (tx′, ty′) is the arbitrary position of ms t′ (∀t′ ∈ t and t is the set of mss) serviced by sectorj′s′. furthermore, (θ1,js,θ2,js) and (θ1,j′s′,θ2,j′s′) can also be calculated easily. according to −→ a3, θjs is calculated by θjs = tan−1 ty′−yjs tx′−xjs ,0 ≤ θjs < 360◦. after calculating θjs, the algorithm cal_interf2 is applied to calculate ωdljsj′s′ meanwhile, the ul interference ωuljsj′s′ between sectorjs and sectorj′s′ can be analyzed by an auxiliary vector−→ a′3 = [tx − xj′s′,ty − yj′s′], where (tx,ty) is the arbitrary position of ms t(∀t ∈ t) serviced by sectorjs; θj′s′ is calculated by θj′s′ = tan−1 ty−yj′s′ tx−xj′s′ ,0 ≤ θj′s′ < 360◦. again, applying algorithm cal_interf to the calculation of ul interference ωuljsj′s′. 3 analytical model of load balancing 3.1 sir definition denote zjst as a decision variable, which is 1 if ms t is admitted by sectorjs subject to the sir requirements and 0 otherwise. assuming the power of both the ul and dl are perfectly controlled, the received power in sectorjs from ms t with constant value pulc(t) will be in the same traffic class−c(t) in the ul, and the received power at ms t from sectorjs with constant value pdl c(t) will be in same traffic class−c(t) in the dl. if djt is the distance from ms t to sectorjs, and given an attenuation factor τ = 4 which is the degree to which a beam of radiation has been attenuated, the intra-sector interference on the ul and the dl is given by (1) and (2) respectively, where both αul c(t) and αdl c(t) are activity factors of traffic class−c(t). the inter-sector 2detailed algorithm procedure is omitted due to the length limitation of the paper. a complete version of the procedure is available upon request. the effect of heterogeneous traffic distributions on load balancing in mobile communications: an analytical model 235 interference on the ul and the dl is expressed by (3) and (4) respectively, where both ωulj′s′js and ωdlj′s′js indicate the interference between sectors. iuljst,intra = ∑ t′∈t t′≠t αulc(t′)p ul c(t′)zjst′ (1) idljst,intra = ∑ t′∈t t′̸=t αdlc(t′)p dl c(t′) ( djt′ djt )τ zjst′ (2) iuljst,inter = ∑ j′∈b j′̸=j ∑ s′∈s s′≠s ∑ t′∈t t′̸=t ωulj′s′jsα ul c(t′)p ul c(t′) ( dj′t′ djt′ )τ zj′s′t′ (3) idljst,inter = ∑ j′∈b j′̸=j ∑ s′∈s s′≠s ∑ t′∈t t′̸=t ωdlj′s′jsα dl c(t′)p dl c(t′) ( dj′t′ dj′t )τ zj′s′t′ (4) siruljs,c(t) = wul dul c(t) · pul c(t) + (1 − zjst)v (1 − ρul)iuljst,intra + i ul jst,inter (5) sirdljs,c(t) = wdl ddl c(t) · pdl c(t) + (1 − zjst)v (1 − ρdl)idljst,intra + i dl jst,inter (6) let wul(wdl) be the spectrum allocated to the ul (dl), and dul c(t) (ddl c(t) ) be the information rate in the ul (dl). the sir values sirul js,c(t) and sirdl js,c(t) in the ul and the dl are defined in (5) and (6) respectively, where ρul (ρdl) is the ul (dl) orthogonality factor. equations (5) and (6) give a very large artificial constant value v in the numerator in order to satisfy the sir constraints. this is because the sir value must be larger than a pre-defined threshold, say the bit energy to noise ratio (benr), if ms t is to be admitted by sectorjs(zjst = 1); in other words, the constraint benr 5 sir must be satisfied. for example, in the ul in equation (5), if ms t is to be admitted by sectorjs(zjst = 1), the sir value siruljs,c(t) is calculated by (wul/dul c(t) ) · pul c(t) /((1 − ρul)iuljst,intra + i ul jst,inter) to determine whether the sir constraint can be satisfied. in contrast, if ms t (zjst = 0) is rejected, the sir value is always larger than benr (benr ≪ sir) because the value v is dominant pul c(t) ; thus, sirul js,c(t) is calculated as a very large value. this implies that the constraint benr 5 sir can be ignored, as it is always satisfied. 3.2 the analytical model in this paper, we consider traffic with multiple classes, and use the kaufman model [12] as a performance measure to analyze the blocking probability of each traffic class effectively. assume that m channels are shared by all traffic requirements. then, for each traffic class−c(∀c ∈ c) with distinct channel requirements, the traffic arrival is a stationary poisson process with mean rate λ; and the channel requirement b is an arbitrary discrete random variable (prob{b = bc} = qc,∀c ∈ c). a call request with channel requirement bc has a mean holding time of 1/µc. thus, traffic with channel requirement bc is generated in the poisson arrival process with mean rate λc = λqc and the class−c offered load ac = λc/µc. the blocking probability of traffic class−c is defined in (7) [12], where the distribution of q(·), which is the probability of the total number of channels occupied by the complete sharing policy, satisfies equation (8) [11], and q(x) = 0 for x < 0 and ∑m j=0 q(x) = 1. bc (a,b) = ∑bc−1 i=0 q (|m| − i) ∀c ∈ c (7) 236 k.-c. chu, c.-s. wang, w.-w. jiang, n.-c. hsieh ∑ c∈c acbcq (j − bc) = jq(j) j = 0,1, . . . ,m (8) to deal with variations in the traffic load, we can seek load balancing with an average value of resource utilization [12]. in order to evaluate the experiment results, we define a diversity function for system load balancing, from which the standard deviation (sd) of the call blocking probability among sectors can be derived. the smaller the sd, the better the balancing results will be. let gcjs = ∑ t∈t zjst/µc(t) be the traffic intensity of class−c. then, gjs = ∑ c∈c g c js is the aggregate traffic (in erlangs) in sectorjs, where gjs is equivalent to the traffic load a in (7). we also define mjs = ∑ t∈t zjstm c(t) as the total number of channels allocated in sectorjs, where mjs is equivalent to the required channels b in (7), and mc(t) is the number of channels required for traffic class−c(t). the performance measure bcjs (the call blocking probability of traffic class−c in sectorjs) is expressed by (7), where the sub-script js in bjs indicates bc in sectorjs. if we define llb as the lbl, the load balancing model can be formulated as (9), where sd ( bcjs ) is the sd function of bcjs. the model is calculated subject to sir constraints (5) and (6). llb = ∑ j∈b ∑ s∈s ∑ c∈c kcsd ( bcjs ) ∀j ∈ b,s ∈ s (9) to assess the impact of different traffic types on load balancing, we denote kc as a ratio of traffic class−c, where ∑ c∈c k c = 1. kc is a traffic type coefficient (tyc) used to analyze the expected level of the load balance. given two classes of call requests, e.g., voice and data traffic, if kv = 1 and kd = 0, we only investigate the effect of voice traffic on load balancing; however, if kv = 0 and kd = 1, we only investigate the effect of data traffic on load balancing. 4 numerical results 4.1 parameters two heterogeneous traffic distributions are considered in the structure of a 5 × 5 twodimensional array with hexagonal cells, and their impact on the system’s load balance is compared with that of a homogeneous distribution between sectors, as shown in figure 3. in the figure, each dark cell has an heterogeneous load that is either heavier or lighter than the load of the normal cells (the light cells). it is assumed that the user density in each cell is homogeneous. each cell is configured with 3 sectors (|s| = 3), and assigned a radius rjs = 5.0km. the required benr for voice (v) and data (d) traffic is given by (eb/ntotal) ul v = (eb/ntotal) dl v = 7db and (eb/ntotal) dl v = (eb/ntotal) dl d = 10db respectively [13]. the information rates dulv = d dl v = 9.6bps, d ul d = 19.2bps, d dl d = 38.4bps [14–16], and the activity factors αulv = α dl v = α ul d = α dl d = 0.5 [13, 16, 17] are also given. the number of channels required is mv = 1,md = 4, and the orthogonality factor is ρul = 0.9,ρdl = 0.7 [13]; and the power is perfectly controlled by pulv = 10db, p dl v = 15db, p ul d = 15db, p dl d = 20db. the assigned service rate is φvjs = φ d js = 0.1 [17, 18]. 4.2 traffic models for each sector, call requests for both voice and data calls are generated in the poisson arrival process with λv and λd respectively. the mean call holding time is given as 1/µv = 180(sec), 1/µd = 600 (sec) [18]. denote (eb/ntotal) ul c(t) ≤ sir ul js,c(t) and (eb/ntotal) dl c(t) ≤ sir dl js,c(t) as the qos requirements of the ul and dl respectively. all traffic calculated in gjs must satisfy the qos requirements and the condition zjstdjt ≤ rjsδjst, where δjst is the indicator function if ms t is in the coverage of sectorjs. power is perfectly controlled in both the ul and the the effect of heterogeneous traffic distributions on load balancing in mobile communications: an analytical model 237 22 23 2524 20 14 1513 9 10 191816 17 11 12 21 3 42 87 51 (a) heterogeneous linear model 22 23 2524 20 14 1513 9 10 191816 17 11 12 21 3 42 87 51 (b) heterogeneous hot spot model 22 23 2524 20 14 1513 9 10 191816 17 11 12 21 3 42 87 51 (c) uniform model figure 3: traffic distribution scenarios dl, and soft handoff is not taken into account. the traffic distributions considered in this work are uniform (u), hot spot (h), and linear (l), as shown in figure 3. recall that the cells with heterogeneous loads in figure 3 (a) and figure 3 (b) have either heavier or lighter loads than normal cells in a homogeneous distribution. to evaluate the heterogeneous scenario, we introduce two traffic models. we denote heterogeneous cells with heavier loads as m1 and heterogeneous cells with lighter loads as m2. if normal cells are given traffic arrivals λc for traffic class−c, arrivals in heterogeneous cells are assigned 200% of λc in m1, and 50% of λc in m2. thus, the level of load balance for traffic with multiple classes can be evaluated effectively in a near-realistic environment. 4.3 analysis without loss of generality, the level of load balance is represented in logarithmic form log (flb). if a smaller value of log (flb) is calculated, a better level of load balance will be achieved. in figure 4 (a), no matter what the distribution (linear, hot spot, or uniform) and the offered voice arrivals λv are, log (flb) is a decreasing function of kv. this implies that load balancing can be achieved more easily for voice only traffic than for data only traffic. if only data traffic is considered, given kv = 0 for all distributions, log (flb) is nearly −1.75, whereas log (flb) is nearly −2.4 if only voice traffic is considered (kv = 1). with regard to the effect of traffic intensity, it is easier to achieve load balancing with more offered voice traffic than less offered traffic. for example, in figure 4 (a), given kv = 0.5 with λd = 6, log (flb) calculates (−1.2,−1.6,−1.9) for arrivals (λv =12, 30, 48). again, given λv = 12 in figure 4 (a), log (flb) is in the range −1.15 to −1.4 in figure 4 (a), but it is in the range −1.7 to −1.75 in figure 4(b) with λd = 24. for the traffic models (m1 vs. m2), there is no significant difference in the load balance with λv = 12 and λd = 6 in both figure 4 (a) and figure 5 (a). however, given λd = 6 in figure 5 (a), the load balance level varies in heavily loaded voice traffic (λv = 30,48). in another case, given λd = 24 in figure 5 (b), log (flb) calculates the same results for variations in the load balance. from the analysis, we conclude that the level of load balancing is more stable in m1 than in m2. a better scheme is needed to handle load balancing in cases of heterogeneous cells with light traffic loads. 5 conclusion in this paper, we propose a load balancing model to deal with the ever-increasing number of heterogeneous distributions in mobile wireless communication systems. we have studied the 238 k.-c. chu, c.-s. wang, w.-w. jiang, n.-c. hsieh lo g (f l b ) linear hot spot uniform o ´ d v k 12 v l = 30 v l = 48 v l = (a) λd = 6 lo g (f l b ) linear hot spot uniform o ´ d v k 1212 v ll = 30 v l = 48 v l = 12 (b) λd = 24 figure 4: blf as a function of blc kv with respect to λv, given traffic model m1 and |s| = 3 lo g (f l b ) linear hot spot uniform o ´ d v k 12 v l = 30 v l = 48 v l = (a) λd = 6 lo g (f l b ) linear hot spot uniform o ´ d v k 12 v l = 30 v l = 48 v l = (b) λd = 24 figure 5: blf as a function of blc kv with respect to λv, given traffic model m2 and |s| = 3 effect of heterogeneous traffic distributions on load balancing, as well as the effect of sectorization. the numerical results indicate that the level of load balancing is affected by spatial traffic distributions, especially by lighter loads in heterogeneous cells (the m2 model). in the scenario of heterogeneous cells with heavier loads (the m1 model), the level of load balancing has similar values of log (flb) in three distributions, i.e., the linear, hot spot, and uniform models. sectorization is more effective in achieving load balancing in the scenario of the heavier loads than in the lighter loads. to achieve load balancing as well as capacity maximization in a system with heterogeneous distributions, a hybrid fmda/cdma scheme can be utilized. usually, available wideband spectrum can be divided into a number of subspectra with smaller bandwidths; each of them is further deployed by cdma technique. each subspectrum employs direct sequence spectrum spreading with reduced processing gain, which is transmitted in one and only one subspectrum. the scheme moderately mitigates interference by allocating an appropriate subspectrum in each cell. the results of this work are useful for network planning to optimize the channel allocation for different traffic type’s distribution. the effect of heterogeneous traffic distributions on load balancing in mobile communications: an analytical model 239 bibliography [1] j.-s. wu and j.-k. chung, analysis of uplink and downlink capacities for two-tier cellular system, ieee proc.-communications., 144(6):405-411, 1997 [2] j.c. liberti, t. s. rappaport, analytical results for capacity improvements in cdma, ieee transactions on vehicular technology, vol. 43, pp. 680-690, sep. 1994 [3] g. ning, g. zhu, l. peng, and x. lu, research on hybrid dynamic load balancing algorithm in heterogeneous hierarchical wireless networks, journal on communication, 28(1):75-81+86, 2007 [4] a. sang, x. wang, m. madihian, and r. d. gitlin, coordinated load balancing, handoff/cellsite selection, and scheduling in multi-cell packet data systems, wireless networks, 14(1):103120, 2008 [5] x.h. chen, adaptive traffic-load shedding and its capacity gain in cdma cellular systems, in proc. iee communications, pp. 186-192, 1995 [6] v.v. veeravalli and a. sendonaris, the coverage-capacity tradeoff in cellular cdma systems, ieee transactions on vehicular technology, 48(5):1443-1450, 1999 [7] g. hampel, k.l. clarkson, j.d. hobby, and p.a. polakos, the tradeoff between coverage and capacity in dynamic optimization of 3g cellular networks, in proc. ieee vtc-fall, vol. 2, pp. 927-932, 2003 [8] w.-m. tam and f.c.m. lau, analysis of power control and its imperfections in cdma cellular systems, ieee transactions on vehicular technology, 48(5):1706-1717, 1999 [9] x.h. chen and k.l. lee, a novel adaptive traffic load shedding scheme for cdma cellular mobile systems, in proc. ieee iccs, vol. 2, pp. 566-570, 1994 [10] x.h. chen, adaptive traffic-load shedding and its capacity gain in cdma cellular mobile systems, iee proceedings-communications, 142(3):186-192, 1995 [11] j.s. kaufman, blocking in a shared resource environment, ieee transactions on communications, vol. 29, pp. 1474-1481, 1981 [12] a. he, performance comparison of load balancing methods in multiple carrier cdma systems, in proc. ieee pimrc, pp. 113-118, 2000 [13] s.w. kim, d.g. jeong, w.s. jeon, and c.-h. choi, forward link performance of combined soft and hard handoff in multimedia cdma systems, ieice transactions on communications, vol. e85-b, no.7, pp. 1276-1282, july 2002 [14] w. choi and j.y. kim, forward-link capacity of a ds-cdma system with mixed multirate sources, ieee transactions on vehicular technology, 50(3):737-749, 2001 [15] d. kim and d.g. jeong, capacity unbalance between uplink and downlink in spectrally overlaid narrow-band and wide-band cdma mobile systems, ieee transactions on vehicular technology, 49(4):1086-1093, 2000 [16] w.s. jeon and d.g. jeong, call admission control for mobile multimedia communications with traffic asymmetry between uplink and downlink, ieee transactions on vehicular technology, 50(1):59-66, 2001 240 k.-c. chu, c.-s. wang, w.-w. jiang, n.-c. hsieh [17] w.s. jeon and d.g. jeong, call admission control for cdma mobile communications systems supporting multimedia services, ieee transactions on wireless communications, 1(4):649-659, 2002 [18] k. kim and y. han, a call admission control scheme for multi-rate traffic based on total received power, ieice transactions on communications, vol. e84-b, no. 3, pp. 457-463, march 2001 int j comput commun, issn 1841-9836 8(3):395-406, june, 2013. an approach to fuzzy modeling of electromagnetic actuated clutch systems c.-a. dragoş, r.-e. precup, m.l. tomescu, s. preitl, e.m. petriu, m.-b. rădac claudia-adina dragoş, radu-emil precup, stefan preitl, mircea-bogdan rădac "politehnica" university of timişoara department of automation and applied informatics bd. v. parvan 2, 300223 timisoara, romania claudia.dragos@aut.upt.ro, radu.precup@aut.upt.ro, stefan.preitl@aut.upt.ro, mircea.radac@aut.upt.ro marius l. tomescu aurel vlaicu university of arad romania, 310330 arad, elena dragoi, 2 tom_uav@yahoo.com emil m. petriu university of ottawa school of electrical engineering and computer science 800 king edward, ottawa, on, k1n 6n5 canada petriu@eecs.uottawa.ca abstract: this paper proposes an approach to fuzzy modeling of a nonlinear servo system application represented by an electromagnetic actuated clutch system. the nonlinear model of the process is simplified and linearized around several operating points of the input-output static map of the process. discrete-time takagi-sugeno (t-s) fuzzy models of the processes are derived on the basis of the modal equivalence principle; the rule consequents of these t-s fuzzy models contain the state-space models of the process. three discrete-time t-s fuzzy models are suggested, compared and validated by simulation results. keywords: discrete-time takagi-sugeno fuzzy models, electromagnetic actuated clutch system, linearization, operating points, simulation results. 1 introduction the process taken into consideration and modeled in this paper is an electromagnetic actuated clutch system as a representative nonlinear system application. therefore the derivation of accurate models is a challenging problem. several approaches to fuzzy modeling of nonlinear servo systems are given in the literature. they belong to the general framework of nonlinear process models [1], [2], [3], [4], [5]. a parallel distributed compensation scheme is proposed in [6] with focus on fuzzy reference models; the linear matrix inequalities are formulated and solved in order to linearize the errors between the feedback system and the nonlinear reference model. the nonlinear system behavior is modeled in [7] by the division of the phase plane into sub-regions and a linear model represented either in state-space or arx model form is assigned for each regions; the linear models are next expressed as fuzzy models. a dsp-based fuzzy-linear-model robust tracking control is developed in [8] for a piezoelectric servo system with dominant hysteresis in terms of the weighted combination of n fuzzy linear pulse transfer functions; the fuzzy model is included in a dead-beat control system. an anfis-based neuro-fuzzy model for a low inertia servomotor is suggested in [9], and several comparisons between the performance of the system with the standard motor model and its neuro-fuzzy model are carried out in the framework of copyright c⃝ 2006-2013 by ccc publications 396 c.-a. dragoş, r.-e. precup, m.l. tomescu, s. preitl, e.m. petriu, m.-b. rădac adaptive control. fuzzy feedback linearization and fuzzy sliding mode control applications are given in [10] and [11]. this paper offers discrete-time dynamic takagi-sugeno (t-s) fuzzy model of an electromagnetic actuated clutch system. the computation of the t-s fuzzy models starts with the derivation of the continuous-time models which are obtained on the basis of the local linearization of the process models at five operating points (o.p.s). the local models are next discretized accepting a zero-order hold, and these local models are placed in the rule consequents of the t-s fuzzy model of the process. our approach is advantageous because it is relatively simple and it can be incorporated in many fuzzy control structures [12], [13], [14], [15], [16], [17], [18], [19]. three fuzzy models are offered and compared using simulation results. the paper is organized as follows: section 2 is dedicated to the mathematical modeling of the process, the computation of t-s fuzzy models is synthesized in section 3. simulation results are presented in section 4 to validate the new t-s fuzzy models. the concluding remarks are highlighted in section 5. 2 process modeling the mathematical modeling of the electromagnetic actuator as part of electric drive clutches is based on the schematic structure of a magnetically actuated mass-spring-damper system presented in figure 1 [20]. the state-space model of the nonlinear servo system is: ẋ1 = x2, ẋ2 = − kmx1 − c m x2 + kax 2 3 m(kb+d−x1)2 , ẋ3 = − r(kb+d−x1) 2ka x2x3 + 1 kb+d−x1 x2x3 + [(kb + d − x1)/2ka]v, y = 1000x1, (1) where x1 is the position, i.e., the mass position, x2 is the mass speed, x3 is the current, v is the control signal, y is the measured position (output), k is the stiffness of the spring, c is the coefficient of the damper, r is the electromagnetic coil resistance, and ka, kb are the constants in the relation between the magnetic flux and the current. the numerical values of the process parameters are listed in [21]. figure 1: schematic structure of a magnetically actuated mass-spring-damper system [20]. the linearization of the nonlinear servo system model (1) at five o.p.s aj(x10, x20, x30, x40) (with j-the index of the o.p. j = 1, 5, and 0-the index of the coordinates of the o.p.s, i.e., the state variables) leads to the linearized state-space models: an approach to fuzzy modeling of electromagnetic actuated clutch systems 397 ẋ(t) = ax(t) + b∆v (t), ∆y(t) = ct x(t), x = [ x1 = x x2 = ẋ x3 = i] t , a =   0 1 0 − k m + 2kax 2 30 m(kb+d−x10) 3 − cm 2kax30 m(kb+d−x10) 2 rx30−v0 2ka − x20x30 (kb+d−x10) 2 − x30kb+d−x10 − x20 kb+d−x10 − r(kb+d−x10) 2ka   , b =   00 kb+d−x10 2ka   , ct = [ 1000 0 0 ]. (2) where x(t) is the system state vector, a, b and ct are the linearized system matrices, and t is the continuous time variable. the matrices of the discrete-time systems developed from (2) will be presented in the sequel. 3 approach to takagi-sugeno fuzzy modeling in order to capture both the static nonlinearity and the linear dynamics of the process, the derivation of a discrete-time dynamic t-s fuzzy model of the process is presented as follows. figure 2 illustrates the structure of the t-s fuzzy model identification process. figure 2: structure of the discrete-time dynamic takagi-sugeno fuzzy model identification process. the steps of our modeling approach are: step i. the definition of the membership functions of the input variables x1 and x3. step ii. the choice of the settling time and the discretization of the continuous-time statespace models of the process which result in the discrete-time state-space models with the matrices ad,i, bd,i and cd,i and cd,i, i = 1, 5. step iii. the derivation of the t-s fuzzy model of the process, which has the state variables x1 and x3 as input variables, and the discrete-time state-space models of the process in the rule consequents. the step i starts with the setting of the largest domains of variation of the two state variables used in all electromagnetic actuated clutch system operating regimes: 0 6 x1 6 0.004, 0 6 x3 6 10. (3) 398 c.-a. dragoş, r.-e. precup, m.l. tomescu, s. preitl, e.m. petriu, m.-b. rădac the fuzzification part of the t-s fuzzy model consists of the linguistic terms assigned to the input variables and defined as follows. three cases were considered for the input variable x1 . the first two cases employ five linguistic terms, ltx1,j, j = 1, 5 , with trapezoidal membership functions defined and referred to as ltx1,1, with the universe of discourse [0.0019, 0.0023], ltx1,2, with the universe of discourse [0.0021, 0.0027] , ltx1,3, with the universe of discourse [0.0023, 0.003], ltx1,4, with the universe of discourse [0.0027, 0.0033] and ltx1,5, with the universe of discourse [0.003, 0.004] . the expressions of these trapezoidal membership functions are: µtlx3,j (x) =   0, x < ax1,j 1 + x−bx1,j bx1,j−ax1,j , x ∈ [ax1,j, bx1,j) 1, x ∈ [bx1,j, cx1,j) 1 − x−cx1,j dx1,j−cx1,j , x ∈ [cx1,j, dx1,j) 0, x > dx1,j , ax1,j < bx1,j 6 cx1,j < dx1,j, j = 1.5 (4) the modal values of the membership functions are the parameters ax1,j, j = 1, 5, bx1,j, j = 1, 5, cx1,j, j = 1, 5 and dx1,j, j = 1, 5. the values of these parameters are given in table 1 for the first case and in table 2 for the second case. table 1 parameters of input membership functions in the first case linguistic terms, trapezoidal membership functions ltx1,j, j = {1, 5} ax1,j, j = 1, 5 bx1,j, j = 1, 5 cx1,j, j = 1, 5 dx1,j, j = 1, 5 ltx1,1 0.0019 0.0019 0.0021 0.0023 ltx1,2 0.0019 0.0021 0.0023 0.0027 ltx1,3 0.0021 0.0023 0.0027 0.003 ltx1,4 0.0023 0.0027 0.003 0.00384 ltx1,5 0.003 0.00384 0.004 0.004 table 2 parameters of input membership functions in the second case linguistic terms, trapezoidal membership functions ltx1,j, j = {1, 5} ax1,j, j = 1, 5 bx1,j, j = 1, 5 cx1,j, j = 1, 5 dx1,j, j = 1, 5 ltx1,1 0.0019 0.0019 0.0021 0.0023 ltx1,2 0.0021 0.0023 0.0025 0.0027 ltx1,3 0.0025 0.0027 0.003 0.0033 ltx1,4 0.003 0.0033 0.0035 0.00384 ltx1,5 0.0035 0.00384 0.004 0.004 five linguistic terms, ltx1,j, j = 1, 5, with trapezoidal and triangular membership functions are defined and employed in the third case, and referred to as ltx1,1, with the universe of discourse [0.0019, 0.0023], ltx1,2, with the universe of discourse [0.0021, 0.0027], ltx1,3, with the universe of discourse [0.0023, 0.003], ltx1,4, with the universe of discourse [0.0027, 0.0033], and ltx1,5, with the universe of discourse [0.003, 0.004]. the modal values of the trapezoidal membership functions are the parameters ax1,j, j ∈ {1, 5}, bx1,j, j ∈ {1, 5}, cx1,j, j ∈ {1, 5} and dx1,j, j ∈ {1, 5} given in table 3. an approach to fuzzy modeling of electromagnetic actuated clutch systems 399 table 3 parameters of trapezoidal input membership functions in the third case linguistic terms, trapezoidal membership functions ltx1,j, j = {1, 5} ax1,j, j ∈ {1, 5} bx1,j, j ∈ {1, 5} cx1,j, j ∈ {1, 5} dx1,j, j ∈ {1, 5} ltx1,1 0.0019 0.0019 0.0021 0.0023 ltx1,5 0.0033 0.00384 0.004 0.004 the expressions of the triangular membership functions are: µtlx1,j (x) =   0, x < ax1,j 1 + x−bx1,j bx1,j−ax1,j , x ∈ [ax1,j, bx1,j) 1 − x−bx1,j cx1,j−bx1,j , x ∈ [bx1,j, cx1,j) 0, x > cx1,j , ax1,j < bx1,j < cx1,j, j = 2, 4 (5) where the modal values of the membership functions are the parameters ax1,j, bx1,j, and cx1,j, j = 2, 4 presented in table 4. table 4 modal values of linguistic terms in the third case linguistic terms, trapezoidal membership functions ltx1,j, j = 2, 4 ax1,j bx1,j cx1,j ltx1,1 0.0021 0.0023 0.0027 ltx1,3 0.0023 0.0027 0.003 ltx1,4 0.0027 0.003 0.0033 figure 3 shows the membership functions of x1 in these three cases: the first case in figure 3 (a), the second case in figure 3 (b) and the third case in figure 3 (c). figure 3: membership functions of the input variable x1 in the first case (a), in the second case (b) and in the third case (c). five linguistic terms, ltx3,j, j = 1, 5, are defined for the input variable x3 . the first and the fifth one are modeled by trapezoidal membership functions, and the second, the third and the fourth one are modeled by trapezoidal membership functions. the universes of discourse of the 400 c.-a. dragoş, r.-e. precup, m.l. tomescu, s. preitl, e.m. petriu, m.-b. rădac membership functions of these linguistic terms are: [4, 8] for ltx3,1, [5, 9] for ltx3,2, for [6, 10], ltx3,3, [7, 11] for ltx3,4, and [8, 12] for ltx3,5. the expressions of the trapezoidal membership functions are: µtlx3,j (x) =   0, x < ax3,j 1 + x−bx3,j bx3,j−ax3,j , x ∈ [ax3,j, bx3,j) 1, x ∈ [bx3,j, cx3,j) 1 − x−cx3,j dx3,j−cx3,j , x ∈ [cx3,j, dx3,j) 0, x > dx3,j , ax3,j < bx3,j 6 cx3,j < dx3,j, j ∈ {1, 5} (6) the modal values of the membership functions are the parameters ax3,j, j ∈ {1, 5}, bx3,j, j ∈ {1, 5}, cx3,j, j ∈ {1, 5}, and dx3,j, j ∈ {1, 5}, given in table 5. table 5 parameters of trapezoidal linguistic terms linguistic terms, trapezoidal membership functions ltx3,j, j = {1, 5} ax3,j, j ∈ {1, 5} bx3,j, j ∈ {1, 5} cx3,j, j ∈ {1, 5} dx3,j, j ∈ {1, 5} ltx3,1 4 4 6 8 ltx3,5 8 10 12 12 the expressions of the triangular membership functions are: µtlx1,j (x) =   0, x < ax1,j 1 + x−bx1,j bx1,j−ax1,j , x ∈ [ax1,j, bx1,j) 1 − x−bx1,j cx1,j−bx1,j , x ∈ [bx1,j, cx1,j) 0, x > cx1,j , ax1,j < bx1,j < cx1,j, j = 2, 4, (7) where the modal values of the membership functions are the parameters ax3,j, bx3,j, and cx3,j, j = 2, 4, given in table 6. table 6 modal values of linguistic terms trapezoidal linguistic terms, membership functions ltx3,j, j = 2, 4 ax3,j bx3,j cx3,j ltx3,1 5 7 9 ltx3,3 6 8 10 ltx3,4 7 9 11 figure 4 shows the membership functions of the input x3. the rule consequents of the t-s fuzzy models correspond to the discrete-time state-space models characterized by the matrices ad,i, and bd,i, cd,i, i = 1, 5, detailed in table 7. these models are obtained by discretization of the continuous-time state-space linearized models (1) using the sampling period ts = 0.001 s. an approach to fuzzy modeling of electromagnetic actuated clutch systems 401 figure 4: membership functions of the input variable x3. table 7 numerical values of matrices of discrete-time state-space models o.p.s numerical values of the matrices 1 ad1 =   0.9864 0.0007 0.0000062−24.3234 0.4847 0.011 3.018 −0.1579 0.9816   , bd1 =   0.0000000310.000088 0.0142   , cd1 = [1000 0 0] 2 ad1 =   0.9869 0.0007 0.0000073−23.352 0.4847 0.013 3.4059 −0.1856 0.9811   , bd1 =   0.0000000360.000088 0.0142   , cd1 = [1000 0 0] 3 ad1 =   0.9876 0.0007 0.0000086−22.0872 0.4847 0.0153 3.7379 −0.2153 0.9807   , bd1 =   0.0000000420.00012 0.0139   , cd1 = [1000 0 0] 4 ad1 =   0.9885 0.0007 0.0000101−20.4072 0.4847 0.018 3.9765 −0.2479 0.9801   , bd1 =   0.0000000490.000139 0.0135   , cd1 = [1000 0 0] 5 ad1 =   0.9897 0.0007 0.0000101−18.3864 0.4847 0.0209 4.1934 −0.2814 0.9793   , bd1 =   0.0000000550.000158 0.0132   , cd1 = [1000 0 0] the modal equivalence principle guarantees the equivalence between the fuzzy models and the nonlinear state-space models. that is the reason to express the rule base of the discrete-time dynamic t-s fuzzy models in the following general form: ri : ifx1,kisltx1,jandx3,kisltx3,j then { xk+1 = ad,ixk + bd,iuk yk,m = cd,ixk , i = 1, nr, j = 1, nlt, (8) where k is the index of the current sampling interval, i is the index of the current rule, j is the index of the current linguistic term, nr is the number of rules, nlt is the number of linguistic terms, nr = nlt = 5 in our discrete-time dynamic t-s fuzzy models. the fuzzy controller employs the sum and prod operators and the weighted average defuzzification method. other operators can be used [22], [23], [24], [25], [26], [27]. 402 c.-a. dragoş, r.-e. precup, m.l. tomescu, s. preitl, e.m. petriu, m.-b. rădac 4 experimental results the modeling approach presented in the previous sections is applied and exemplified in this section in order to obtain fuzzy models for the electromagnetic actuated clutch system. three t-s fuzzy models were developed for this nonlinear process. some comparisons were done to illustrate the difference between them. a part of the results is presented as follows. the simulation results include the evolutions of the position versus time (in figure 5), the evolution of the measured position versus time (in figure 6), the evolution of the modeling error versus time (in figure 7), and the evolution of the current versus time (in figure 8). figure 5 and figure 6 present the evolution of both position and measured position y in four cases: nonlinear model (1) of the process, first t-s fuzzy model, second t-s fuzzy model and third t-s fuzzy model. these evolutions point out in all cases an aperiodical evolution with a small overshoot, but the fuzzy modeled responses exhibit a delay of 0.1 s and they exceed the steady-state values of the nonlinear model. the modeling error versus time is highlighted in figure 7 to outline the difference between the nonlinear model and the fuzzy models. figure 5: position of nonlinear model (a), of first t-s fuzzy model (b), of second t-s fuzzy model (c) and of third t-s fuzzy model (d) versus time. figure 8 points out the evolution of the current in the same four cases: nonlinear model (1) of the process, first t-s fuzzy model, second t-s fuzzy model and third t-s fuzzy model. figure 8 illustrates that the current exhibited by the t-s fuzzy models has a delay, but it reaches the steady-state value in approximately 0.5 s and with aperiodical response as that of the model (1). all responses point out a delay of 0.1 s which must be reduced. moreover, the convergence of the modeling error to zero can be achieved by the optimization of the parameters of several parameters of the fuzzy models including input membership functions or parameters in the rule consequents. various optimization algorithms can be implemented in this context [28], [29], [30], [31], [32], [33], [34], [35], [36]. an approach to fuzzy modeling of electromagnetic actuated clutch systems 403 figure 6: measured position of nonlinear model (a), of first t-s fuzzy model (b), of second t-s fuzzy model (c) and of third t-s fuzzy model (d) versus time. figure 7: modeling error of first t-s fuzzy model (a), of second t-s fuzzy model (b) and of third t-s fuzzy model (c) versus time. 5 conclusions the paper has proposed an approach to the fuzzy modeling of an electromagnetic actuated clutch system. this approach is important because it is easily applicable with adequate but not complicated generalizations to a wide category of industrial applications. other similar t-s fuzzy models can be obtained in order to be further used in the t-s fuzzy controller design and tuning. the future work will be dedicated to separating a part of the parameters of the input membership functions. these parameters will be obtained by different optimization algorithms which 404 c.-a. dragoş, r.-e. precup, m.l. tomescu, s. preitl, e.m. petriu, m.-b. rădac figure 8: current of nonlinear model (a), of first t-s fuzzy model (b), of second t-s fuzzy model (c) and of third t-s fuzzy model (d) versus time. will solve the optimization problems with objective functions that depend on the modeling errors. the reduction of the modeling errors will be thus ensured. acknowledgements this work was supported by a grant in the framework of the partnerships in priority areas pn ii program of the romanian national authority for scientific research ancs, cndi uefiscdi, project number pn-ii-pt-pcca-2011-3.2-0732. bibliography [1] škrjanc, i.; blažič, s.; agamennoni o. (2005); identification of dynamical systems with a robust interval fuzzy model, automatica, 41(2):327-332. [2] johanyák, z.c. (2010); survey on five fuzzy inference-based student evaluation methods, in: computational intelligence in engineering, i. j. rudas, j. fodor, j. kacprzyk, eds., studies in computational intelligence, springer-verlag, berlin, heidelberg, 313:219-228. [3] vaščǎk, j.; madarász, l. (2010); adaptation of fuzzy cognitive maps-a comparison study, acta polytechnica hungarica, 7(3):109-122. [4] babu devasenapati, s.; ramachandran, k. i. (2011) hybrid fuzzy model based expert system for misfire detection in automobile engines, int. j. of artificial intelligence, 7(a11):47-62. [5] dzitac, i; vesselényi, t; tarcă, r. c. (2011) identification of erd using fuzzy inference systems for brain-computer interface, int j comput commun, issn 1841-9836, 6(3):403417. an approach to fuzzy modeling of electromagnetic actuated clutch systems 405 [6] taniguchi, t; tanaka, k.; yamafuji, k.; wang, o.h. (1999) a new pdc for fuzzy reference models, proc. of 1999 ieee int. conf. on fuzzy systems, seoul, korea, 2:898-903. [7] eksin, i.; erol, o.k. (2000) a fuzzy identification method for nonlinear systems, turkish journal of electrical engineering and computer sciences, 8(2):125-135. [8] hwang, v.-l.; jan (2002) a dsp-based fuzzy robust tracking control for piezoelectric servosystems, proc. of 2002 ieee international conference on fuzzy systems, honolulu, hi, usa, 2:1410-1415. [9] mihai, d. (2004) discrete fuzzy control loops based on a motor neuro-fuzzy model. pushing too far a continuous logic?, proceedings of 2004 ieee international conference on fuzzy systems, budapest, hungary, 2:587-592. [10] chien, t.-l.; chen, c.-c.; tsai, m.-c.; chen, y.-c. (2010) control of amira’s ball and beam system via improved fuzzy feedback linearization approach, applied mathematical modelling, 34(12):3791-3804. [11] cerman, o.; hušek, p. (2012) adaptive fuzzy sliding mode control for electro-hydraulic servo mechanism, expert systems with applications, 39(11):10269-10277. [12] precup, r.-e.; preitl, s. (1999) fuzzy controllers, editura orizonturi universitare publishers, timişoara. [13] orlowska-kowalska, t.; szabat, k.; jaszczak, k. (2002) the influence of parameters and structure of pi-type fuzzy-logic controller on dc drive system dynamics, fuzzy sets and systems, 131(2):251-264. [14] precup, r.-e.; preitl, s.; faur, g. (2003) pi predictive fuzzy controllers for electrical drive speed control: methods and software for stable development, computers in industry, 52(3):253-270. [15] precup, r.-e.; preitl, s.; korondi, p. (2007) fuzzy controllers with maximum sensitivity for servosystems, ieee transactions on industrial electronics, 54(3):1298-1310. [16] angelov, p.; lughofer, e.; zhou x. (2008) evolving fuzzy classifiers using different model architectures, fuzzy sets and systems, 159(23):3160-3182. [17] precup, r.-e.; preitl, s.; petriu, e.m.; tar, j.k.; tomescu, m.l.; pozna, c. (2009) generic two-degree-of-freedom linear and fuzzy controllers for integral processes, journal of the franklin institute, 346(10):980-1003. [18] linda, o.; manic, m. (2011) interval yype-2 fuzzy voter design for fault tolerant systems, information sciences, 181(14):2933-2950. [19] khanesar, m. a.; teshnehlab, m.; kaynak, o. (2012) control and synchronization of chaotic systems using a novel indirect model reference fuzzy controller, soft computing, 16(7):12531265. [20] di cairano, s.; bemporad, a.; kolmanovsky, i.v.; hrovat, d. (2007) model predictive control of magnetically actuated mass spring dampers for automotive applications, international journal of control, 80(11):1701-1716. 406 c.-a. dragoş, r.-e. precup, m.l. tomescu, s. preitl, e.m. petriu, m.-b. rădac [21] dragoş, c.-a.; preitl, s.; precup, r.-e.; petriu, e.m.; stînean, a.-i. (2011) a comparative case study of position control solutions for a mechatronics application, proc. of 2011 ieee/asme international conference on advanced intelligent mechatronics, budapest, hungary:814-819. [22] angelov, p.; buswell, r. (2003) automatic generation of fuzzy rule-based models from data by genetic algorithms, information sciences, 150(1-2):17-31. [23] precup, r.-e.; tomescu, m.-l.; preitl, s. (2007) lorenz system stabilization using fuzzy controllers, int j comput commun, issn 1841-9836, 2(3):279-287. [24] johanyák, z.c. (2010) student evaluation based on fuzzy rule interpolation, int. j. of artificial intelligence, a10(5):37-55. [25] vaščák, j.; madarász, l. (2010) adaptation of fuzzy cognitive maps-a comparison study, acta polytechnica hungarica, 7(3):109-122. [26] sadighi, a.; kim, w.-j. (2011) adaptive-neuro-fuzzy-based sensorless control of a smartmaterial actuator, ieee/asme transactions on mechatronics, 16(2):371-379. [27] ho, t.h.; ahn, k.k. (2012) speed control of a hydraulic pressure coupling drive using an adaptive fuzzy sliding-mode control, ieee/asme transactions on mechatronics, 17(5):976986. [28] precup, r.-e.; preitl, s. (2004) optimisation criteria in development of fuzzy controllers with dynamics, engineering applications of artificial intelligence, 17(6):661-674. [29] blažič, s.; matko, d.; škrjanc, i. (2010) adaptive law with a new leakage term, iet control theory & applications, 4(9):1533-1542. [30] sánchez boza, a.; haber-guerra, r.; gajate, a. (2011) artificial cognitive control system based on the shared circuits model of sociocognitive capacities. a first approach, engineering applications of artificial intelligence, 24(2):209-219. [31] liu, t.; hu, z. (2011) immune algorithm with memory coevolution, int. j. of artificial intelligence, 7(a11):189-197. [32] niu, b.; fan, y.; wang, h.; li, l.; wang, x. (2011) novel bacterial foraging optimization with time-varying chemotaxis step, international journal of artificial intelligence, 7(a11):257-273. [33] damanafshan, m.; khosrowshahi-asl, e.; abbaspour, m. (2012) gasant: an ant-inspired least-cost qos multicast routing approach based on genetic and simulated annealing algorithms, int j comput commun, issn 1841-9836, 7(3):417-431. [34] rankovic, v.; radulovic, j.; grujovic, n.; divac, d. (2012) neural network model predictive control of nonlinear systems using genetic algorithms, int j comput commun, issn 1841-9836, 7(3):540-549. [35] bacanin, n.; tuba, m. (2012) artificial bee colony (abc) algorithm for constrained optimization improved with genetic operators, studies in informatics and control, 21(2):137-146. [36] ben omrane, i.; chatti, a.; borne, p. (2012) evolutionary method for designing and learning control structure of a wheelchair, studies in informatics and control, 21(2):155-164. int j comput commun, issn 1841-9836 9(2):209-216, april, 2014. comprehensive energy efficient algorithm for wsn c. tang chengpei tang school of engineering, sun yat-sen university guangzhou 510006 p. r. china e-mail: tchengp@mail.sysu.edu.cn abstract: wireless sensor networks has been widely used. energy problem is one of the important problems influencing the complete application. sensor nodes use batteries as power source and have quite limit lifetime. so, efficiency of energy management becomes a key requirement in wireless sensor network design. based on particle swarm optimization and ant colony optimization, a comprehensive algorithm with weight analysis has been proposed in the paper. in the algorithm, optimization method would be firstly used to determine the nodes number; then, particle swarm optimization would be used to divide the networks into some clusters; finally, ant colony optimization is used to require the best transmission path and select the cluster head. the simulation results show that the new algorithm has higher energy efficiency and balanced energy consumption. it can extend the network lifetime. keywords: wireless sensor network (wsn), particle swarm optimization, ant colony optimization, energy efficiency. 1 introduction wireless sensor networks (wsns) offer a new way of real-time monitor systems that can be used in ample of real life applications, such as temperature, sound, pressure, and so on [1][4]. it is a wireless network and often composed of many nodes or sensors to monitor vast kinds of conditions. generally speaking, wsn consists of vast number of sensors which should be cheaply enough to use in low cost. then, there would be many limitations on the nodes, such as cheap power sources, cheap circuit, and so on. sensors play critical role in the collection of information, and the power source is quite important. usually, the sensor use battery as power source and due to the wireless network, the battery can not be charged or replaced, so, it is quite important to develop proper algorithms to reduce the energy consumption. leach is one of the most popular clustering mechanism in wsn. many communication protocols based on leach for wsn have been developed. the main research fields are energy efficiency, wireless link reliability [5], real-time capabilities [6], or quality-of-service. for the reason of sensor nodes just can use battery without power input. so, energy efficiency research is always a key issue in wsn design with high reliability. many methods and algorithms based on leach have been proposed, such as improved leach [7][10], ndea [11], tb-leach [12], leach-sm [13], energy balanced based routing protocol [14], v-leach [15], tl-leach and dd-leach [16], leach-hpr [17], etc. research of energy efficiency is an endless research, and more and more methods to extend lifetime of nodes or networks would be proposed. in this paper, a comprehensive algorithm based on particle swarm optimization and ant colony optimization with weight analysis has been proposed. the algorithm is mainly used in multi-hop conditions, which can not be set many base stations. in the algorithm, firstly, particle swarm optimization would be used to divide the networks into more than one clusters; secondly, cluster head would be elected with existing energy and distance weighted; thirdly, ant colony optimization would be used to require the best information or data transmission path. the remainder of the paper is organized as follows: leach algorithm would be revisited in section 2. comprehensive leach algorithm and simulation and verification would be expressed in section 3. section 4 would give the conclusion. copyright © 2006-2014 by ccc publications 210 c. tang 2 leach leach protocol is a kind of algorithm. in the network, the nodes work in cluster. in the cluster, a node would be selected as the cluster head [18][20]. the process is organized in periodical manner, and each round would be divided into two steps: cluster building step and stable data communication step. in the first step, close nodes would make a cluster dynamically, and one node would be chosen to be the cluster head, which collects, processes and send information to a sink node. in the second step, each node in the cluster would sent message to the cluster head, and then the head would deal with and send it. in the process, head node would collect and fuse information and send it to sink node, so it would consume more energy than other nodes. leach algorithm could meet the demand that each node in one cluster would have equal possibility to be the head node to balance the exist energy. the election algorithm of head node in leach would be described as following: (1) every node would produce a random number between 0 and 1, and if the number is less than a predefined value t(n), then it would be elected as cluster head and send declaration to other nodes, and if the node has been the cluster node before, and the t(n) would be set to 0. it means that the node cannot be head again. if a node has not been the head node before, the probability of being selected is t(n). t(n) would increase with the number of being head node increases. so the nodes, which have not been head node before, would have a bigger probability. when just one node of not being elected as cluster head node has left, t(n) of the node would be set as 1. that means the node would be cluster head. t(n) =   p 1−p(r mod (1/p)), if(n ∈ g) 0 otherwise (1) where, p means the probability of the number of cluster head in all nodes of the cluster; r is the number of the current round, g is the nodes which have not been the head node before. after the election of the head node, the head node would send information to other nodes. and other nodes would join different clusters dynamically based on the distance and some other indexes. when all the nodes join the clusters, they would send message to the cluster head and the cluster head would produce the time message tdma to inform all the nodes in the clusters. due to keep the nodes in one cluster to join another, the head node would also send cdma code at the same time. each node would send message to the head node in time-interval after they receive the tdma and cdma code. when the transmission is over, cluster head would collect and process the data and results would be sent to sink node and the round would be going into next. 3 c-leach comprehensive leach (c-leach) is aiming at low cost, lifetime extended wsn. 3.1 main idea of c-leach the flowchart of c-leach is shown in figure 1. it includes some main steps: (1) particle swarm optimization would be used to divide the network into more than one clusters with similar number of nodes; (2) cluster head would elected with existing energy and distance weighted; (3) ant colony optimization would be used to require the best information or data transmission path. comprehensive energy efficient algorithm for wsn 211 figure 1: flowchart of c-leach 3.2 the detail steps of algorithm step 1: division of the network into some clusters (1) it is assumed that there are n nodes in the network, and would be going to split into m clusters, this means there are n/m nodes in every cluster. firstly, draw a splitter to make the whole network into two domains with same nodes number, and the split line would be expressed as: u = (x,y,θ).(7) where, (x,y) is location on the splitter of nodes, θ is the angular between the splitter and x-axle. define the function fitness as the following: fitness = (c1 − f1n)2 + (c2 − f2n)2.(8) where, ci(i = 1,2) is the number of nodes in domain i. fi would be determined as the following: fi = mi m ,i = (1,2).(9) where, mi is the expectation of number of cluster nodes in domain i. then, we would complete the first division. the algorithm of cluster division is composed of some steps: (1) all nodes in the network send data (including all kinds of information) to sink node. sink node would split the network into many clusters after received the message and define q particles; (2) set random number to parameters of x,y,θ, and construct the split line. the whole network would be divided into q × 2 clusters. due to the location of nodes in the network are known, ci(i = 1,2) of each node would be determined to calculate the values fitness. (3) compare the value of fitness with minimum fitness value in the last round, the less one would be elected as a general extrum pgd; compare the fitness value of individual node, the least would be elected as individual extrum pid, and update the value of x,y,θ: xxid = xxid + vxid.(10) 212 c. tang xyid = xyid + vyid.(11) xθid = xθid + vθid.(12) where, xxid,xyid represent the location of particle; xθid is the angular of the splitter; vxid,vyid,vθid are the search speed in three dimensions, and they would be determined as following: vxid = ωvxid + c1 × rand()(pid − xxid) + c2 × rand()(pgd − xxid).(13) vyid = ωvyid + c1 × rand()(pid − xyid) + c2 × rand()(pgd − xyid).(14) vθid = ωvθid + c1 × rand()(pid − xθid) + c2 × rand()(pgd − xθid).(15) where, c1 and c2 are the study factor, and c1 = c2 = 2; rand () is random number between 0 and 1; ω is the weighted factor. (4) after the update of x,y,θ, go to step 2 to continue the research process. when the value of fitness is 0 or equal to the maximum times of search, the process would finished. ideally, if the value of fitness is approximately 0, and the whole network would be divided into two clusters with equal nodes. (5) use the method of above to divide the clusters, until get the demanded number of clusters. step 2: search the nearest multi-hop path by aco in this step, first of all is electing cluster head. high existing energy of nodes would be elected as cluster head followed by leach. this search is based on single cluster, so the path in each cluster would be different. and one search path has no influence to that of others. that is to say path search in each cluster is independent. cluster nodes send data or information to sink nodes, if need multi-hop would adopt this algorithm to find the nearest path. according to the definition described before, forward neighbor nodes would be expressed as: nf(i) = {vj|vj ∈ v,di,j ≤ r,αi,j ≤ βi} .(16) each node would have a storage model to reserve the information between self and neighbor nodes, including location, existing energy and neighbor node location and existing energy, and so on. in order to express the pheromone, each node would have a record τij to store the density of pheromone. ant in the network is a data package with memory and storage capability, which just require minimal space. the ant would have characteristics as following: (1) the ant can remember the information of nodes it passed; (2) the ant has capability of recording the nodes it passed with a proper order and forming a path, the ant would return after it get to the cluster node and update the pheromone. (3) ant just can jump to the forward neighbor nodes; (4) ant can read and modify the information it locates. ants in the network would begin with source node, and goes to the cluster head with a way of multi-hop to find a nearest way between source node and cluster head node. ant locates at vi would calculate the jump probability pki,j according to the information of pheromone and existing energy of the neighbor nodes.pki,jrepresents the probability of ant k jump to forward neighbor nodes. it is calculated as following: pki,j =   [τi,j] µ·[ηi,j]λ∑ vh ∈nf (vf ) [τi,h] µ·[ηi,h]λ ,vj ∈ nf(vi) 0 otherwise (17) comprehensive energy efficient algorithm for wsn 213 where, τi,j is value of path li,j from node vi to its neighbor node vj; µ is a factor about distance and pheromone and it means the effect of information accumulated to the travel of ants. bigger the value of µ is, shorter path the ant would choose. λ is a factor of energy, which means the effect of energy in the path selection. if value of λ is bigger, the ant would choose a higher existing energy node to jump. at first time t0, pheromone τi,j(t0) on the path li,j of nearby nodes. it can be calculated as: τi,j(t0) = di,d di,j + dj,d × (1 − di,j∑ vh∈nf (vi) di,h ).(18) ηi,j is a function of energy, which can be calculated as following: ηi,j = ej∑ vh∈nf (vi) eh .(19) where, eh is existing energy of forward neighbor node vh ∈ nf(h) of node i. forward neighbor nodes set limits the jumped node. after the vj is elected to be the next jump node of vi, next jump node of vj would be calculated according to the location of cluster head. following the same method, the ant would jump to the cluster head. in the searching path, if the current sensor j has no forward neighbor nodes, it would be marked as invalid node. the ant would return to node i, and node j with no forward neighbor node would be deleted. then, the ant would select the next jump node again. if all nodes in forward neighbor nodes set are deleted, this means there is no available path between source node and cluster head. if a source node has no available path to the cluster head, it has to transmit the information to its nearest node outside the forward neighbor nodes set, and choose the best path with the method described before. an ant travel from source node to cluster head and return back to the source node with the pheromone information updated is defined to be a process. all ants finish a process, we can say a loop is completed and loop index t would plus 1. in the loop , the pheromone density in the path lij would be adjusted according to equation as following: τi,j(t + 1) = (1 − ρ)τi,j(t) + m∑ k=1 ∆τki,j(t).(20) ∆τki,j(t) =   q lk(t) , if ant k pass 0, otherwise (21) where, m is the number of ants; ∆τki,j(t) is pheromone left on path lij at loop t; l k(t) is the length of ant k moves at loop t; q is a constant about pheromone. ρ is also a constant which describe the volatilization of pheromone. value of ρ is often between 0 and 1 to keep the convergence of the algorithm. in addition of judgment, nearest path of each node would be found. 3.3 simulation and verification the simulation result of c-leach would compared with the result of aco with the whole network as a cluster and all information of each node transfer directly to sink node without to cluster node. same model has been used to verify the advance of two algorithms. figure 2 shows 214 c. tang the energy consumption of the two algorithm in sending 100 data packages. it is easy to see the energy saving with c-leach. figure 3 shows the efficiency of node energy with different algorithm in sending 100 data packages. it can be seen that energy efficiency is higher under c-leach than that under aco algorithm. figure 4 shows the success rate in path search with different algorithm. it can be seen that success rate of less nodes in the network with c-leach is lower than that under aco and when the network with more nodes, the success rate under c-leach is higher. the reason may be the division of the network, and each cluster has less nodes. we can make a decision that the method used in network with more nodes would have a better effect. figure 2: node energy consumption with different algorithm figure 3: efficiency of node energy with different algorithm figure 4: success rate in path search with different algorithm figure 5 shows the lifetime of the network with two different algorithms. from the figure, we can see the lifetime of network under c-leach is longer than that under aco. comprehensive energy efficient algorithm for wsn 215 figure 5: lifetime of network 4 conclusion the paper proposed a comprehensive method to save energy of wsn. it uses pso dividing the whole network into clusters and then aco has been used to find the nearest path to transfer the data package. results of the simulation show that, c-leach is an effective method used in the wsn with multi-hop conditions, especially used in vast area with more nodes. the method can extend the lifetime of the network and increase the energy efficiency. bibliography [1] hart, j. k., martinez, k. (2006). environmental sensor networks: a revolution in the earth system science. earth-science reviews, 78: 177-191. [2] g. werner-allen, k. lorincz, m. welsh, o. marcillo, j. johnson, m. ruiz, j. lees (2006). deploying a wireless sensor network on an active volcano. ieee internet computing, 10(2):1825. [3] i. vasilescu, k. kotay, d. rus, m. dunbabin, p. corke (2005). data collection, storage, and retrieval with an underwater sensor network. in proc.of the 3rd international conference on embedded networked sensor systems, 154-165. [4] martinez, k.; hart, j. k.; ong, r. (2009). deploying a wireless sensor network in iceland. lecture notes in computer science, proc. geosensor networks, 5659, 131-137. [5] anastasi. (2010). a comprehensive analysis of the mac unreliability problem in ieee 802.15.4, wireless sensor networks, 7(1):52-65. [6] pruter s., moritz g., zeeb e., golatowski f., timmermann d (2008). applicability of web service technologies to reach real time capabilities. 11th ieee int. symposium on object oriented real-time distributed computing (isorc), 229-233. [7] yuhua liu, yongfeng zhao, jingjugao (2009). a new clustering mechanism based on leach protocol. 2009 int. joint conference on artificial intelligence, 715-718. [8] fuzhe zhao, you xu, ru li, wei zhang (2012). improved leach communication protocol for wsn. 2012 int. conf. on control engineering and communication technology, 700-702. [9] jia xu, ning jin, xizhong lou, ting peng, qian zhou, yanmin chen (2012). improvement of leach protocol for wsn. 2012 9th int. conf. on fuzzy systems and knowledge discovery (fskd 2012), 2174-2177. 216 c. tang [10] wei wei, peiyi shen, liang zhang, hu xu, juan song, wenzeng zhang, wei wang (2012). leach-based energy-conserved improved protocol for wsns. international journal of digital content technology and its applications (jdcta), 6:163-171. [11] weiping luan, changhua zhu, bo su, changxing pei.(2012). an improved routing algorithm on leach by combining node degree and residual energy for wsns. iot workshop 2012, ccis, 312, 104 c109. [12] hu junping, jin yuhui, dou liang (2008). a time-based cluster-head selection algorithm for leach. 2008 ieee, 1172-1176. [13] bilal abu bakr, leszeklilien (2011). a quantitative comparison of energy consumption and wsn lifetime for leach and leach-sm. 2011 31st int. conf. on distributed computing systems workshops, 182-191. [14] pan xue-feng, li la-yuan (2011). design of an energy balanced based routing protocol for wsn. 2011 ieee, 366-369. [15] mrs. asha ahlawat, msvineeta malik (2013). an extened vice-cluster selection approach to improve v leach protocol in wsn. 2012 third int. conf. on advanced computing and communication technologies, 236-240. [16] ravi kishore kodali, narasimhasarma, nvs. (2013). energy efficient routing protocols for wsn’s. 2013 int. conf. on computer communication and informatics (iccci -2013). [17] li han (2010). leach-hpr: an energy efficient routing algorithm for heterogeneous wsn. 2010 ieee, 507-511. [18] xu long-long, zhang jian-jun (2010). improved leach cluster head multi-hops algorithm in wireless sensor networks. ninth int. symposium on distributed computing and applications to business, engineering and science, 10-12. [19] zhuang jun, qiang chun-xia, feng wan-li (2012). research of cross-layer and multi-hops algorithm based on energy and location. proc. of the 2012 international conference on industrial control and electronics engineering, icicee 2012, 1781-1784. [20] yang yong-jian, jia bing, wang jie (2013). an improved algorithm for leach protocol in wireless sensor network. journal of beijing university of posts and telecommunications, 36(1): 105-109. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 1, pp. 91-105 design of protocols for task administration in collaborative production systems h. s. ko, s. y. nof hoo sang ko school of industrial engineering, purdue university 315 n grant st, west lafayette, in 47907, usa e-mail: ko@purdue.edu shimon y. nof school of industrial engineering, purdue university 315 n grant st, west lafayette, in 47907, usa e-mail: nof@purdue.edu abstract: customer-focused and concurrent engineering service systems process tasks more effectively as a result of the power of collaboration among multiple participants. in such environments, however, complex situations might arise that require decisions beyond simple coordination.task administration protocols (taps) are designed as a control mechanism to manage complex situations in collaborative task environments. this article presents the design of taps for collaborative production systems in which tasks are performed by the collaboration of multiple agents. three component protocols are found to constitute taps and are triggered at appropriate stages in task administration: 1) task requirement analysis protocol, 2) shared resource allocation protocol, and 3) synchronization & time-out protocol. a case study with taps metrics for task allocation in a collaborative production system is investigated to compare performance under taps, and under a non-tap coordination protocol (which is considered to be simpler). in terms of task allocation ratio, the case study indicates that performance under taps is significantly better (up to 10.6%) than under the non-tap coordination protocol, especially under medium or high load conditions. the advantage of taps can be explained by their design with relatively higher level of collaborative intelligence, addressing more complex control logic compared with non-tap coordination protocols. keywords: rules, figures, citation of papers, citation of books, examples. 1 introduction in highly distributed networked systems, tasks arrive at agents in the system, and then are processed by the collaboration of agents that have the skills and capabilities to process the entire task or parts of it. the achievement of goals depends on how effectively each individual agent coordinates tasks with others to solve the given problems in a collaborative manner. there have been numerous research studies regarding collaborative problem solving in a decentralized way by multiple agents, e.g., optimization by collaborative swarm intelligence [22], and collaborative negotiation in global supply networks [29]. as systems and systems-of-systems become more complex, especially when attempting to improve performance via collaboration, the dynamic complexity of interactions among agents requires a higher level of collaborative intelligence. thus, there should be an effective control mechanism to rationalize, coordinate, and harmonize tasks by exchanging information and decisions among collaborating participants. copyright c© 2006-2010 by ccc publications 92 h. s. ko, s. y. nof coordination protocols have been developed to achieve effective control by managing a given, known type of dependence among tasks, e.g., producer/consumer relationships, as defined in coordination theory [1, 2]. application of coordination protocols in collaborative service systems includes multi-robot systems [3, 30], supply chain management [4, 29], and agent-based manufacturing systems [10, 28], to name a few. the operation of these systems is usually represented by collaborative work in multi-agent systems [6, 23], and various coordination protocols have been developed to provide the agents in the systems with the rules and the interaction procedures on how to cooperate and coordinate effectively. many coordination protocols have been developed based on the framework of contract net protocol (cnp) [7], a market-based approach in which tasks are announced to agents and allocated to the agent that provides the best bid. such decentralized approaches have been developed to achieve effective resource allocation in distributed systems and have shown good performance in terms of communication and computational efficiency, scalability, and flexibility [5, 8, 9], compared to centralized approaches [2, 7]. a critical limitation of coordination protocols, however, is that there exist events that cannot be handled by merely coordination protocols. these situations include: 1. the priority of tasks must be dynamically changed; and 2. there are complicated situations requiring decisions beyond simple coordination. for example, consider tasks that are close to their deadline and need to be handled quickly with high priority over other, less urgent tasks. resources need to be allocated for the urgent tasks first in order to meet the deadline. later, if a more urgent task, e.g., resource failure requiring repair, or an emergency task, is generated, the previously scheduled tasks may be preempted by the new one so that it will not cause a significant damage to the system. coordination protocols, however, do not handle such situations since they only allow fixed task allocation, i.e., the winner to whom the task was allocated is deemed to commit itself to the awarded task until it is completed. in order to overcome such limitations of coordination protocols, the protocols need to be able to indentify repeatedly the current state of the system and take proper actions to deal with complicated conditions. such protocols, which assume the responsibility of making decisions actively and triggering timely actions so that the overall system’s performance can be further improved, are defined as task administration protocols (taps) [11, 20]. the purpose of this article is to design taps and analyze taps’ advantage over non-tap coordination protocols. this is more important when more distributed and decentralized networks of activities require collaboration, which increases the complexity of the problem. in this article, the design of taps for effective task allocation and administration is developed by addressing three basic elements in task administration: 1) task, 2) resource, and 3) time. three main reasons require the development of taps with abilities beyond conventional coordination protocols, as follows. (1) dynamic tasks re-prioritizing when tasks enter the system, they need to be analyzed and sorted based on their priority, so that important or urgent tasks can be handled first. priority of tasks is an essential consideration in task administration. with coordination protocols, priority is evaluated by managerial decision or prior assignment based on task type [1], or by pricing based on market-based approach [8, 9]. tasks with higher priority receive more monetary value, so the priority is reflected in the level of funding. the limitation in the coordination protocols is that the priority of a task is predetermined and fixed by its price and deadline in the task description at the moment the task arrives. while this approach may be adequate in some cases, in real situations, however, the priority may change due to emergent events. thus, dynamic change of priority along time should be considered. (2) resource availability appropriate resources need to be allocated by considering the task requirements, the status of resources and their schedules. a bidding procedure can be processed to obtain good resource allocation solutions using the cnp framework (e.g., [8, 10]). design of protocols for task administration in collaborative production systems 93 (3) time-out conditions a resource serving a task needs to be monitored. if a task occupies (engages) a resource excessively for certain reasons, e.g., resource failure, or abnormal tasks, the task access to the resource should be timed-out so that the overall performance of all tasks’ services does not seriously degrade by the given task over-occupying the resource. in order to prevent such wasteful situations, a time-out protocol with an appropriate threshold needs to be used. in several studies and applications, time-out protocols have been developed and proved to be effective [12, 14]. in order to deal with the three basic elements of administration, taps’ design model requires three components as developed in this study. the rest of the article is organized as follows. section 2 describes the definitions and structures of taps with the three component-protocols. each of the components is described in detail in section 3. in order to illustrate the advantage of taps over simple coordination protocols, a case study of applying taps for a collaborative production system is presented in section 4. finally, section 5 concludes the article. 2 task administration protocols (taps) in a coordination network an active middleware model of coordination network [15, 16] is used to define the model of taps. a coordination network (co-net) is a network of autonomous agents that enables collaboration among the agents. each agent exchanges information and execution to achieve its objectives while it tries to maximize the system (or system-of-systems) goals. the co-net is defined as: co-net =< π, τ, α, σ > (1) where π : a set of agents in co-net; π = {a1, a2,..., a|π|};τ a set of tasks to be processed in co-net; τ = {t1, t2,..., t|τ|};α : a set of activities performed by π to fulfill τ ; α = {activity, activity,..., activity|α|}; and σ : a set of control mechanisms used by π . coordination is defined as the process of managing dependencies between activities [1]. in the conet, dependence (δ ) is a relationship between τ , π , and α , and defined as follows: {ti ×π(ti)×α(ti)|ti ∈ τ} → δ ∈ 4 (2) where π (ti): a set of agents who can process ti; α (ti): a set of activities or sub-tasks required to process ti; 4: a set of dependence types. a particular dependence δ can be decided by analyzing tasks and evaluating certain factors in the system, e.g., task priority and time-out threshold. in the co-net, coordination protocols (cp) are designed to manage a particular dependence (δ ) between tasks and resources. a coordination protocol is defined as follows: definition 1. pj = {δ , i, r, pa, s} ∈ cp (3) where pj: a particular coordination protocol for handling ti (j=1,..., j; j is the number of cp); i: a set of initiators which initiate coordination activities; i ⊂ π ; r: a set of responders which respond to initiators’ requests; r ⊂ π and i ∩r = /0; pa: a set of parameters of the coordination protocol; s: a set of decision logic in each transition stage of the protocol; and cp is a set of coordination protocols. taps are defined as a set of protocols which assume the responsibility of making decisions actively and triggering timely actions so that these decisions and triggers can improve the coordinated performance [11, 20]. when a task (ti) or, in a broader sense, an event (e) occurs in the system, tap mechanism should identify the dependencies between tasks and activate the appropriate protocol (pj) between agents with proper parameter values (pa) for the protocol. in the co-net, therefore, a tap is defined as 94 h. s. ko, s. y. nof follows: definition 2. t ap = {σ ∪pj|σ (e|δ ⊂ 4) → pj} (4) where σ is a mechanism to select an appropriate protocol to handle the upcoming events; e is an event (e ∈ e; e: a set of events), e.g., arrival of a task, failure in processing a task, etc. in other words, the tap is a control mechanism to handle an event by analyzing dependence between tasks and involved initiators and responders, as well as the parameters and decision logic required to deal with the event. according to the type of event and systems of interest, different task administration schemes need to be applied. theorem 1.(tap-set). cp is a subset of but not identical to tap. proof. according to definition 1, cp is a set of pj’s which handles tasks under certain dependence δ ⊂ 4. by definition 2, tap includes pj’s as its elements but also contains the mechanisms (σ ) to dynamically activate appropriate pj to handle the upcoming events e based on the current dependence δ ⊂4. moreover, cp cannot handle all types of events in task administration shown in table 1, unlike tap. theorem 2. (tap-performance). better tap yields better performance than cp. proof. let θ be a performance metric. since cp can handle only predefined δ , even though the dependence changes from δt to δt+ along time t, pt (δt+) = pt+ (δt+), where pt is a protocol activated at time t. since tap can trigger a more appropriate protocol and its parameters under the current dependence and event, one can find pt+ such that θ (pt (δt+)) < θ (pt+ (δt+)). therefore, the overall performance ∑ t θ (p t (δt )) under tap (pt variable) will be better than under cp (pt fixed) along time. this will be illustrated in a case study in section 4. there are three general elements in task administration: task, resource, and time. the general administration elements, their properties and protocol solutions, which has been found from existing literatures, are presented in table 1. taps are composed of three component-protocols, each of which will be activated by the tap mechanism under certain dependence and deals with the events corresponding to an administration element. when a task arrives at the system and is inserted to task queue, the task requirement analysis protocol (trap) is activated to analyze the task and assign its priority and dependence to other tasks and resources. upon arrival at task queue, a task must be assigned to the best resource, which is decided by the shared resource allocation protocol (srap), based on the current status of resources. while processed, a task may occupy a resource excessively, preventing other tasks from being processed by the resource. if the time taken in the resource is beyond a certain threshold, the task needs to be returned to task queue so that other tasks can be processed at the resource. sometimes, a task cannot be performed by its due date under the current schedule. in this case, the task may need to seek a resource which can complete the task by preempting the other task currently being processed at the resource. these procedures are controlled by the synchronization & time-out protocol (stop). each of the three protocols is explained in the subsequent sections. the theorems presented above are illustrated with three examples in table 2. in the examples, both taps and cps are used to control multi-agent interactions in certain distributed production systems, but taps are more intelligent because of inclusion of one or more tap components which are missing in cps. in all the three examples, the performance under taps is better than under cps as they are designed to dynamically handle complex tasks. even though the taps include all three components, there could be a difference in performance between different tap designs, as in the first example [21]. the three component protocols are explained in the next section. design of protocols for task administration in collaborative production systems 95 table 1: administration elements, their related events and dependence, and tap solutions and examples admin. elements task (τ) (resource (r) time (t)) events (e) task arrival resource allocation excessive process time, urgent tasks dependence 1. prioritize tasks 1. optimal/effective 1. synchronization analyses (δ ) 2. identify task allocation 2. uncertainty in requirement/dependence (δ ) 2.decentralized processes & products decision making protocols task requirement shared resource synchronization & pj analysis protocol allocation protocol time-out protocol (trap) (srap) (stop) examples [ref’s] pricing of tasks [8-10, market-based resource time-out protocol 17-18]; priority assignment allocation [8-10,16-18] [12-14] protocol [13,21] table 2: illustration of tap-set and tap-performance example tap tap cp performance results [re f ′s] component metrics (θ ) testlan protocol 1) tap1: adaptable trap, testlan protocol waiting time, θ (tap1) [] 2) tap2: nonsrap, w/ fcfs flowtime > θ (tap2) 2) tap2: nonstop > θ (cp) adaptable [] time-out protocol stop non-timeout flow time, θ (tap) > protocol service time θ (cp) viability-based resource profit, number of [] resource allocation srap allocation protocol un-allocated θ (tap) > protocol w/o viability tasks θ (cp) 96 h. s. ko, s. y. nof 3 components of task administration protocols 3.1 task requirement analysis protocol (trap) a key objective in task administration is to allocate incoming tasks to appropriate resources continuously. when this allocation is needed, trap, which is a component-protocol of taps, is activated to handle the event. this protocol finds the dependence between tasks by their priority relationships. whenever a new task ti arrives, a task agent (ta) calculates the priority of ti and previous tasks still in the system at time t by using a dynamic task priority evaluation function (pf(ti, t)). if the priority of ti is less than the priority of previous tasks still in the system, it is just added to the end of the task queue as in first in, first out regime. otherwise, the tasks in the queue need to be sorted by their current relative priority. from time to time, the task currently being processed may need to be preempted by an urgent task with relatively higher priority. each task has different requirements, such as the type of task, quantity (volume), deadline, estimated cost, etc. in general, a task can be defined as follows: ti =< typei, gtyi, ddi, vi, pri(t) > (5) where typei is the class of ti that requires a certain skill of a resource agent (ra); qtyi is the task amount involved in ti that engages the capacity of a ra; ddi is the latest time by which ti must be served by an ra; vi is the estimated value per unit of ti; and pri(t) is the priority of ti at time t. pri(t) represents the relative importance of ti and the task that has higher priority should be served first. pri(t) is dynamically evaluated by using the priority evaluation function pf(ti, t). priority evaluation function needs to be of different form to correspond with specific applications. in its generic form of supplydemand networks, the priority evaluation function is defined as: p f (ti,t) = wsi + w gtyi(vi − ci)∑ j gty j(v j − c j) + w{ − ddi − t∑ j(dd j − t) } + w( − gtyt gtyo ) + w ∑ k ( − pek) (6) where wn: weight of each factor (0 ≤ wn ≤ 1; ∑ wn = 1; n = 1, 2,..., 5); j: index of a task previously assigned to the agent; 0: index of the current task; si = 1 if typei = type, 0 otherwise; ci: estimated unit cost to perform ti by a server; qtyt: remaining quantity of task t at t; k: index of tasks that will be decommitted due to ti (pri(t) > prk(t)); pek: penalty of decommitment of tk; 0 ≤ pek ≤ 1. the implication of the five factors is as follows: 1) if ti is the same type of task as the current task, its priority is high since setup is minimized or not needed. in some case, however, the opposite is required. 2) the relatively larger ti’s quantity and profit (value) are, the higher its priority. 3) a task relatively closer to its deadline has higher priority. 4) if the current task is not finished yet but almost is close to being done, it has a relatively higher priority. 5) after its priority re-assignment,timay cause some tasks, which have already been assigned to resources, to be decommitted if they have a lower priority. the penalty due to the decommitment needs to be considered for fair evaluation of ti’s priority. to sum up, trap is triggered upon arrival of a new task at a ta, which receives tasks and identifies task type and analyzes task requirements, i.e., due date, sub-tasks and required resources. ta assigns a priority value to each of the tasks based on the priority evaluation function. tasks will be sorted and re-sorted in task queue by their relative priority. after sorting its queue, ta announces the tasks to the ras who are capable of processing the tasks, and a resource will be allocated to each task one by one design of protocols for task administration in collaborative production systems 97 by srap, which is explained in the next section. the overall procedure in trap can be summarized as follows: 1)task agent (ta) receives a task ti. 2)calculate priority of tasks by eq. (6). 3)sort the tasks in task queue by pri(t). 4)activate srap. 3.2 shared resource allocation protocol (srap) after tasks are analyzed and their priorities are assigned, srap is activated to find the best resources for the tasks. each of the resources in the system is managed by a resource agent (ra). ta announces the first task (with the highest priority) in task queue to ras which are capable of processing the task. each ra calculates the bid based on expected waiting time in the queue for the resource. ta collects the bids and selects the ra with the best bid (the lowest cost). the steps in srap are as follows: 1) ta announces ti to rar where r is the index of ra which is capable of processing ti. 2) rar sorts its queue including ti by the priority of the tasks in the queue. 3) rar calculates the bid for ti by using the following equation: bri = cr ∑mr k= µptk + csr (7) where bri is the bid by rar for ti; cr is the cost of rar per unit time; k is the index of tasks in the queue of rar (k = 1,..., mr);µptk is mean processing time of tk; and csr is setup cost of rar, which is added only if the next task is of different class from the previous task. 4) rar submits bri for ti to ta. 5) ta selects rar∗ where r∗ = arg minr bri . 6) ta assigns ti to rar∗. ti enters the queue of rar∗ with expected cost br∗i . the policy applied in the above protocol pursues earliest completion time, which is reasonable when the effective cost of tasks is higher compared to the effective (variable) cost of resources. if the effective cost of resources is higher, relatively high utilization of resources may be preferred. in this case, a different policy, i.e., minimizing the total idle time of required resources, needs to be applied. 3.3 synchronization and time-out protocol (stop) from time to time, the current task being served by a resource needs to be timed-out and return to task queue of ta. this situation is triggered and handled by stop, which is activated in the following cases. a. excessive resource occupation since a task with faults needs to be reworked by the resource, it may occupy the resource excessively even while idling it, and cause other tasks to be delayed. stop checks if the current task uses the resource more than certain time-out threshold, which is calculated as follows [13]: toi = µpti + σpti (8) where toi is the time-out threshold for ti, and σ pti is standard deviation of processing time for ti. (two standard deviations are assumed for simplicity here, but another coefficient can be used.) even if the task has been occupying the resource beyond the time-out threshold, the task remains at the resource’s service in the following cases: 1) no other task is waiting in the queue. 98 h. s. ko, s. y. nof 2) the current task ti may be late if timed-out. if ddi t < epti, where t is the current time and epti is the extra processing time needed for ti, ti remains in the resource. b. preemption by an urgent task even though the current task ti has been served by the resource for less than the time-out threshold, a task tk in the queue may be late if it is not served until the current process is completed. this problem may be solved by preempting ti and serving the urgent task tk first. the protocol logic can be summarized as follows: 1) check stk = ddk t µpti of the tk waiting in the queue of the resource, where stk is the slack time for tk, i.e., the remaining time until ddk. 2) if stk < 0 and ddi > epti, stop ti and process tk. 3) once tk is completed, resume ti. to sum up, taps are composed of three inter-related protocols: trap, srap, and stop. upon arrival of tasks, trap is activated to analyze the task requirements and assign their priority. next, srap is activated to select the best resources based on their current workload. during processing tasks, stop is activated to monitor if the current task in service needs to be timed-out because of its excessive use of the service or preempted by another urgent tasks. the overall protocol logic is shown in figure 1. the following case study illustrates their application. figure 1: overall logic of task administration protocols with three component-protocols 4 a case study: collaborative production systems in order to analyze the advantages of taps over non-tap coordination protocols, this section illustrates a case study of taps application for task administration in a collaborative production system. a collaborative production system is defined as a distributed network of production resources in which redesign of protocols for task administration in collaborative production systems 99 sources can communicate and coordinate with each other to process tasks in the system in an effective and efficient manner, based on collaborative resource sharing. a good example of collaborative production systems is testlan (testing local area network), which is a local area network integrating distributed test operations in manufacturing facilities [13]. testlan was developed to increase throughput and reduce the waiting time for testing by integration and communication of distributed testing servers and clients. in testlan, products are tested by shared resources, and those with faults are reworked and retested until all the faults are eliminated. efficient testing resource allocation is difficult since the testing process is highly variable due to design changes, new quality control requirements, and occasionally faulty manufacturing processes. in collaborative production systems like testlan, priority of tasks can be evaluated by their slack time. in testlan, assume only the third term in eq. (6), regarding due date of the task, is effective in priority evaluation, and for simplicity assume every task is of the same importance and the objective is to complete as many tasks as possible in time. hence, priority of a task in testlan is evaluated by a time-based cost of each task as follows: sti = ddi − e[tti] (9) where tti is testing time of ti. e[tti] can be calculated by using the following equation [13]: e[tti] = e[pti + rni∑ r= rtir] = µpti + µrni · µrti (10) where pti is processing time of ti; rni is number of reworks for ti; rtir is rework time for r-th faults occurring in ti; and µpti , µrni , and µrti are mean of processing time, number of rework, and rework time for the tasks of the same class as ti, respectively. in order to reflect the rework in testlan, eq. (7) and (8) need to be modified as follows: bir = cr · ∑mr k= (µptk + µrnk · µrtk ) + csr (11) toi = µpti + σpti + µrni (µrti + σrti ) (12) taps for the testlan function as follows. trap assigns a higher priority to a task with a lower slack time. tasks are sorted in task queue by their priority and a resource who submits the best bid will be allocated to each task one by one by srap. in srap, each testing resource agent can sort the tasks in its own queue plus the new task to find the best bid. during processing the current task by each resource, time-out conditions are checked by stop and the previously assigned tasks may be preempted if any of the two time-out preemption conditions are met. for this case study, two taps are developed: tap1 and tap2. both tap1 and tap2 are composed of the three component-protocols. tap1 considers, however, only time-out condition b in section 3.3 above, while tap2 considers both time-out conditions a and b. a coordination protocol, cp, which is considered to be a subset of tap, is used to compare performance under the various protocols. the logic of cp is similar to the one of srap, except the procedure of resorting tasks in resources’ queue in srap is not included, as typical in traditional coordination protocols. a simulation analysis is designed with two classes of tasks with different levels of urgency. tasks are generated randomly and processed by three servers. the simulation parameters are shown in table 3. performance under the three protocols is measured by 1) task allocation ratio, tar, and 2) weighted tar, war, defined as follows: t ar =  − t |τ| (13) 100 h. s. ko, s. y. nof table 3: simulation parameters parameter value number of servers 3 task class normal: ddi = 600 sec, pti = 60 sec (80%) urgent: ddi = 180 sec, pti = 60 sec (20%) interarrival time low: exp(µ = 25 sec); medium: exp(µ =20sec); high: exp(µ = 15 sec) where ? is the mean of interarrival time weight of pf(ti, t) w = 1 simulation length te = 8 hrs rework rate 5% warm-up time t = 10 min replication rep = 10 treatment tap1 = trap, srap, stop1; tap2 = trap, srap, stop2; cp srap performance measures tar and war table 4: java classes in tie/tap simulator sim supports the entire simulation, e.g., simulation initialization, termination, simulation clock, data collection model simulation modeling and execution, e.g., testlan model and parameters source random task generation ta task agent which collects and announce tasks ra resource agent which bids for and processes tasks task task definition and requirements trap task requirement analysis protocol srap shared resource allocation protocol stop synchronization & time-out protocol war = |τ|∑ i= r∑ r= pri ·t ri |τ|∑ i= pri (14) where t = |τ| − ∑|τ| i= ∑r r= |t r i |;, r is the index of ra, r = , ..., r; t r i = 0 when ti is neither assigned to rar nor completed within ddi; 1 otherwise. while tar does not consider the priority of tasks, war is used to measure the performance with consideration of task priority, i.e., assign more weights to relatively higher priority tasks when evaluating tar. the simulation is developed with a protocol evaluation tool, called tie/tap, which is implemented with java, so as to compare the performance under three protocols: 1) tap1, 2) tap2, and 3) cp. the java classes in tie/tap and their descriptions are listed in table 4. tar results (figure 2 and table 5) are shown under low operational load (µ = 25; system capacity is sufficient to process all the tasks), medium (µ = 20; the number of tasks is close to the system capacity), and high (µ = 15; the number of tasks is beyond the system capacity). war results (figure 3 and table 5) are shown for the same load conditions. the observed results were tested for statistical difference by t-test as shown in tables 6 and 7. in every case, it is found that tap2 performs significantly better design of protocols for task administration in collaborative production systems 101 table 5: simulation results protocol task allocation ratio weighted task allocation ratio low/medium/high[%] low/medium/high[%] tap1 100/95.7/72.8 100/90.5/52.1 tap2 100/97.4/74.6 100/94.2/54.0 cp 98.8/92.8/73.0 98.1/83.6/49.9 than tap1 and than cp. except for one case (tar under high load condition), tap1 is also significantly better than cp. under the low load condition, all three protocols studied perform similarly (tar = 100% for tap1 and tap2, and 98.8% for cp). under the medium load, the difference between the protocols increases. tap2 yields the best tar (97.4%), tap1 – 95.7%, cp – 92.8%. under the high load, tap2 is also best (tar = 74.6%), cp – 73.0%, tap1 – 72.8%. under every condition, tap2 performs better than tap1 and cp. tap1 performs better than cp under the low and the medium load, but slightly worse under the high load. this situation results from the fact that tap1 (and tap2) considers priority of tasks by trap, tries to allocate the tasks with higher priority first, and even preempts the previously assigned tasks by stop for the sake of the urgent tasks. thus the tar, which does not consider the priority of tasks, can be reduced under tap1 (and tap2) due to the failed tasks with low priority. when war is considered (table 5 and figure 3), however, the performance of tap1 is significantly better than cp under the high load. tap2 is again the best (war = 54.0%), and tap1 (war = 52.1%) also performs significantly better than cp (war = 49.9%). under the medium load, war is 94.2%, 90.5%, and 83.6% for tap2, tap1, and cp, respectively. therefore, when war is used to measure performance with consideration of task priority, the difference between taps and cp becomes larger under all conditions. this difference is expected from the fact that when tasks’ priority changes dynamically, cp are not designed to track and respond to those changes. figure 2: tar under (a) low (µ = 25), (b) medium (µ = 20), and (c) high (µ = 15) load condition 102 h. s. ko, s. y. nof table 6: t-test for war of tap2 and cp treatment mean std. dev. t t−α,v µ =  µ =  µ =  µ =  µ =  µ =  α = ., v =  tap2 0.942 0.540 0.0198 0.0127 11.444 7.3414 1.734 cp 0.836 0.499 0.0192 0.0112 table 7: t-test for war of tap1 and tap2 treatment mean std. dev. t t−α,v µ =  µ =  µ =  µ =  µ =  µ =  α = ., v =  tap1 0.905 0.521 0.0219 0.0101 3.774 3.641 1.734 tap2 0.942 0.540 0.0198 0.0127 figure 3: war under (a) low (µ = 25), (b) medium (µ = 20), and (c) high (µ = 15) load condition in the studied case, the illustrated observations imply: (1) systems operating under taps perform better than under non-tap cp, since taps can effectively handle the complicated situations that cp cannot (θ (cp) < θ (tap); see table 6); and (2) even among taps, there can be better designed taps depending on the applications and logic applied (θ (tap1) < θ (tap2); see table 7). 5 conclusions the design of taps for task administration in a collaborative production system is investigated in this article. taps are designed as a control mechanism that can manage complicated situations in the collaborative task workflow environment. in order to overcome certain limitation of coordination protocols, taps are designed with three component-protocols, i.e., trap, srap, and stop, each of which deals with inter-related aspects of task administration, including task, resource, and time. a case study of design of protocols for task administration in collaborative production systems 103 applying taps for testlan, as an example of collaborative production system, is developed to illustrate design of taps and show the advantage of taps over non-tap cp. the simulation, implemented based on the tie/tap java-based simulator, show that taps perform significantly better than other non-taps, in particular under medium or high load conditions (up to 10.6% in terms of war). this advantage results from the fact that the three protocol combinations in taps dynamically interact to consider the dynamic priority of tasks and the current situations and conditions in tasks and resources. thus, taps can address a higher level of collaborative intelligence compared to non-tap cp. finally, the results imply that (1) taps are better than non-tap cp under certain conditions; and 2) there can be better design of taps even among taps to increase system performance. the logic in the protocols is designed to fit the given case study. although the general structure of protocols can be followed in order to handle the complicated situations in collaborative tasks/resources networks, for better effectiveness the protocol logic needs to be context-specific. different applications require different decision policies, heuristics and logic, and they should be reflected in the protocol logic, as illustrated in the case study. in order to obtain better performance, the logic and the parameters in the protocol should be carefully selected and modified. for example, the system performance in the case study can be significantly affected by parameters such as time-out threshold. this recommendation is one of the challenging issues in protocol design, and some research has tried to address this issue by serviceoriented protocol adaptation [21]. further research is currently ongoing to develop and improve tap logic with protocol adaptation for other applications, e.g., taps for collaborative intelligence in research activities [19], in which taps should be developed to manage collaborative workflow between distributed research groups and enhance their collaboration. other similar, potential areas of taps application include collaborative intelligence in e-learning environment [24], knowledge management in enterprise portal [25], and e-service collaboration protocols [26]. bibliography [1] t.w. malone, k. crowston, the interdisciplinary study of coordination, acm computing surveys, 26(1):87-119, 1994. [2] k. crowston, j. rubleske, j. howison, coordination theory: a ten-year-retrospective, humancomputer interaction and management information systems: foundations, p. zhang, d. galletta (eds.), m. e. sharpe, inc., 2006. [3] b. gerkey, m.j. mataric, sold!: auction methods for multi-robot control, ieee transactions on robotics and automation, 18(5):758-768, 2002. [4] r.y.k. fung, t. chen, a multiagent supply chain planning and coordination architecture, international journal of advanced manufacturing technology, 25(7):811-819, 2005. [5] w. shen, d.h. norrie, dynamic manufacturing scheduling using both functional and resource related agents, integrated computer-aided engineering, 8(1):17-30, 2001. [6] m. wooldridge, n.r. jennings, intelligent agents: theory and practice, the knowledge engineering review, 10(2):115-152, 1995. [7] r.g. smith, the contract net protocol: high-level communication and control in a distributed problem solver, ieee transactions on computers, 29(12):1104-1113, 1980. [8] s.s. fatima, m. wooldridge, adaptive task and resource allocation in multi-agent systems, proceedings of the fifth international conference on autonomous agents, montreal, canada, 537-544, 2001. 104 h. s. ko, s. y. nof [9] m.p. wellman, e. walsh, p.r wurman, j.k. mackie-mason, auction protocols for decentralized scheduling, games and economic behavior, 35(1-2):271-303, 2001. [10] k. ertogral and s. d. wu, auction-theoretic coordination of production planning in the supply chain, iie transactions, 32(10):931-940, 2000. [11] c.y. huang, s.y. nof, evaluation of agent-based manufacturing systems based on a parallel simulator, computers and industrial engineering, 43(3):529-552, 2002. [12] k. esfarjani, s.y. nof, client-server model of integrated production facilities, international journal of production research, 36(12):3295-3321, 1998. [13] n.p. williams, y. liu, s.y. nof, testlan approach and protocols for the integration of distributed assembly and test networks, international journal of production research, 40(17):45054522, 2002. [14] j. peralta, j., p. anussornnitisarn, s.y. nof, analysis of a time-out protocol and its applications in a single server environment, international journal of computer integrated manufacturing, 16(1):113, 2003. [15] p. anussornnitisarn, design of active middleware protocols for coordination of distributed resources, phd dissertation, purdue university, 2003. [16] p. anussornnitisarn, s.y. nof, o. etzion, decentralized control of cooperative and autonomous agents for solving the distributed resource allocation problem, international journal of production economics, 98(2):114-128, 2005. [17] a.j. arauzo, j.m. galan, p. javier, a. lopez-paredes, multi-agent technology for scheduling and control projects in multi-project environments. an auction based approach, inteligencia artificial, 42:12-20, 2009. [18] y.h. lee, s.r.t. kumara, k. chatterjee, multiagent based dynamic resource scheduling for distributed multiple projects using a market mechanism, journal of intelligent manufacturing, 14:471-484, 2003. [19] h.s. ko, s.y. nof, design of collaborative e-service systems, introduction to service engineering, g. salvendy, w. karwowski (eds.), john wiley & sons, inc., 2009. [20] j.d. velasquez, s.y. nof, collaborative e-work, e-business and e-service, springer handbook of automation, s.y. nof (ed.), springer, 2009. [21] n.p. williams, y. liu, s.y. nof, analysis of workflow protocol adaptability in testlan production systems, iie transactions, 35(10):965-972, 2003. [22] r.i. lung, d. dumitrescu, collaborative optimization in dynamic environments, international journal of computers, communications, and control, 1(suppl), 2006. [23] c. kolski, p. forbrig, b. david, p. girard, c.d. tran, h. ezzedine, agent-based architecture for interactive system design: current approaches, perspectives and evaluation, lecture notes in computer science, 5610:624-633, 2009. [24] i. moisil, a model of the student behaviour in a virtual educational environment, international journal of computers, communications and control, 3(suppl):108-115, 2008. design of protocols for task administration in collaborative production systems 105 [25] m. guran, knowledge management using intranets and enterprise portals, international journal of computers, communications and control, 3(suppl):75-81, 2008. [26] g. kramler, e. kapsammer, w. retschitzegger, g. kappel, towards using uml 2 for modelling web service collaboration protocols, interoperability of enterprise software and applications, d. konstantas et al (eds.), springer, 2006. [27] f.g. filip, g. neagu, d.a. donciulescu, job shop scheduling optimization in real-time production control, computers in industry, 4(4):395-403, 1983. [28] j.m. frayret, s. d’amours, b. montreuil, coordination and control in distributed and agent-based manufacturing systems, production planning and control, 15(1)42-54, 2004. [29] j. jiao, x. you, a. kumar, an agent-based framework for collaborative negotiation in the global manufacturing supply chain network, robotics and computer-integrated manufacturing, 22(3):239-55, 2006. [30] l. iocchi, d. nardi, m. piaggio, a. sgorbissa, distributed coordination in heterogeneous multirobot systems, autonomous robots, 15(2):155-168, 2003. hoo sang ko is a senior researcher of prism center and a doctoral candidate in the school of industrial engineering at purdue university. he received his b.s. in 1999 and m.s. in 2003 in mechanical engineering at seoul national university in south korea. after graduation, he worked as an r&d engineer at digital appliance business of samsung electronics. his research interests include design of collaboration support systems, task administration protocols for control of collaborative e-work, and facility sensor networks. shimon y. nof is professor of industrial engineering and director of the nsf-industry supported prism center (production, robotics and integration software for manufacturing & management) at purdue university. the prism center was established in 1991, and its motto is "knowledge through information, wisdom through collaboration." in 2001, pgrn, prism global research network was established with affiliate labs and centers throughout the world. professor nof was awarded an honorary doctorate in engineering from the university of sibiu, romania in 2007, when sibiu served as the european capital of culture. 9tang6.pdf international journal of computers communications & control special issue on fuzzy sets and applications (celebration of the 50th anniversary of fuzzy sets) issn 1841-9836, 10(6):856-864, december, 2015. an improved attribute reduction algorithm based on granular computing x. tang, l. shu xiao tang* 1. university of electronic science and technology of china, school of mathematical sciences, chengdu, 611731, china 2. college of mathematics and software science, sichuan normal university, chengdu, 610066, china *corresponding author: 80651177@163.com lan shu university of electronic science and technology of china, school of mathematical sciences, chengdu, 611731, china lanshu@uestc.edu.cn abstract: granular computing is a new intelligent computing method based on problem solving, information processing and pattern classification. granular computing based attribute reduction method is an important application of granular computing. these algorithms are mostly based on reduction core. however, some information systems may have no reduction core, especially in the actual application data. for this case, those algorithms are powerless. in this paper, an improved reduction algorithm based on granular computing is proposed. the algorithm is validated by the experimental result. keywords: attribute reduction, granular computing, rough set, attribute significance. 1 introduction granular computing is a method for analysis of multi-layer granular structure based on problem solving, pattern classification and information processing. it’s also a newly cross discipline among rough set theory, fuzzy set theory, data mining and artificial intelligence. with less than 20 years’ development, granular computing has already made remarkable achievements and great contribution to the field of computer science [1, 2]. through rapid development of society and continuous progress in science and technology, a variety of data is increasing gradually, and then we entered the so-called "big data time". the main goal of data mining is to find potential, desired and useful knowledge from those big data. rough set theory is an efficient mathematical tool to deal with imprecise, incomplete and inconsistent data. it has already made great strides in its theory and has been widely used in practical application. attribute reduction is the main content of rough set theory. the core task of attribute reduction is that dimensionality and storage space may be reduced under the condition of maintaining classification capacity, so as to improve the efficiency of system classification [3, 4]. therefore, it is not only the hot spot of intelligence computing, but also the important task of information processing. in 1979, professor l.a. zadeh discussed the theory of fuzzy information granulation in his paper "fuzzy sets and information granularity", and first proposed the concept of information granulation. then, professor j.r. hobss of stanford university introduced granularity theory copyright © 2006-2015 by ccc publications an improved attribute reduction algorithm based on granular computing 857 in his paper "granularity" published on international joint conference on artificial intelligence held in los angeles [5]. the granularity theory is presented firstly. the idea of granularity theory is that the bigger, whole, unresolved questions can be broken into several smaller ones by granulating, and these small questions can be combined into the bigger, whole questions. in 1990, the chinese scholar zhang bo and zhang ling proposed the theory of quotient space based on problem solving [6]. they thought that human beings can analyze the same problem from different granulation, and make an easy conversion in different knowledge granularity. if people can formalize the analysis process to make the computer possess the ability, it will greatly improve the development of artificial intelligence. furthermore, professor l.a. zadeh raised the theory of computing with words in his paper "fuzzy logic=computing with words", and thus the fuzzy granularity theory was born [7]. this theory is to do fuzzy reasoning and judgments by using natural language, so as to realize the fuzzy intelligent control. in the same year, when professor l.y. lin visited in professor zadeh’s key laboratory of uc-berkeley university, he presented the subject "granular computing" and got approval from zadeh, it marked the birth of granular computing. professor miao, et al. [8] gave the definition of knowledge granularity and knowledge discernibility in fuzzy set theory model, and pointed out the relationship between knowledge granularity and knowledge discernibility: the smaller knowledge granularity is, the stronger knowledge distinguishable ability is; on the contrary, the bigger knowledge granularity is, the weaker knowledge distinguishable ability is. reference [9] defined the concept of the difference of granularity and granularity entropy on the basis of fuzzy set’s algebraic method and information theory approach, and proposes attribute reduction algorithm based on granular computing. reference [10] presented attribute reduction algorithm based on granular computing, using the equivalent relation in rough set to construct granule, and attribute significance is regarded as heuristic information. reference [11] put forward attribute reduction method based on model of granular computing in information systems. reference [12] proposed an improvement of attribute reduction algorithm based on granular computing. this algorithm is to get attribute core using discernibility matrix, and then make attribute reduction based on attribute significance as heuristic information. reference [13] proposed an incomplete order decision table reduction algorithm based on granular computing. these reduction methods based on granular computing are mainly first to calculate reduction core of system, then get reduction based on core. however, in practical application, some information systems may have no reduction core. in this case, this paper proposes an improved reduction algorithm based on attribute significance of granular computing, and numerical experiments show the effectiveness of the algorithm. 2 basic concepts of rough sets 2.1 rough sets let a quadruple s = (u, a, v, f) be an information systems (is), in which u = {x1, x2, · · · , xn} is a non-empty finite set called the domain of discourse; a = {a1, a2, · · · , am} is a non-empty and finite set of attributes; v is a set of attribute values domain, v = ∪ a∈a va; f : u × a → v is a mapping, each attribute of the object in the domain of discourse by the mapping has a corresponding information value, i.e. ∀a ∈ a, x ∈ u, f(x, a) ∈ va. if the attributes set a is composed of condition attributes set c and decision attributes set d, the quadruple s = (u, a, v, f) is also called decision information system (dis). the information system, also known as knowledge representation system, is the main expression of knowledge of rough sets. it is simply expressed in (u, a). if p is a subset of attributes set a , each subset p ⊆ a determines a binary indistin858 x. tang, l. shu guishable relation ind(p), ind(p) = {(u, v) ∈ u × u|∀a ∈ p, a(u) = a(v)}. a set x ⊆ u represent a concept and partition included by ind(p) is called a knowledge base and denoted by u/ind(p). in particular, the partition u/ind(p) = {y1, y2, · · · , yk} is the knowledge base of decision classes. a knowledge base (u, r) is also called an approximation space, where u is the domain of discourse and r is an equivalence relation on u. let x ⊆ uandr ⊆ a, the sets r(x) = {x ∈ u| [x]r ⊆ x} and r̄(x) = {x ∈ u| [x]r∩x 6= φ} are respectively called lower approximation set and upper approximation set. where [x]r refers to an equivalence class of ind(p) = ∩ind(r) determined by element x. if r(x) = r̄(x), then x is called a definable set on u; if r(x) 6= r̄(x), then x is called a rough set on u. if ind(r) 6= ind(r−|a|), then a is indispensable in the set r, otherwise a is dispensable. if every a ∈ r is indispensable, then r is called independent. let q ⊆ p , that is, q is a subset of p , if q is independent and ind(q) = ind(p), then q is a reduction of p , denoted as q = red(p). the union set of indispensable attribute in the set a is called a core set, denoted as core(p), core(p) = ∩red(p). 2.2 knowledge granulation and partition definition 1[8]. let (u, r) be an approximation space, p ∈ r is an equivalence relation on u, called knowledge. the approximation space is also called knowledge base. the equivalence class [x]p = {x ∈ u, (xi, xj) ∈ p} is called knowledge granule. the quotient set u/p = {[x ]p |x ∈ u} is called a p − granularity partition. the granularity of knowledge is defined as gd(p), gd(p) = |p | |u × u| = |p| |u| 2 (1) where |p | denotes the cardinality of the set p ⊆ u × u. the granularity of knowledge p can express its distinguishable ability. for ∀u, v ∈ u, if (u, v) ∈ p , then they belong to the same equivalence class, i.e. they are indistinguishable. the knowledge p ’s discernibility could be defined as dis(p), dis(p) = 1 − gd(p). in general, the greater the granularity is, the weaker the distinguishable ability will be, vice versa. theorem 1[8]. let p ∈ r be a knowledge of knowledge base k = (u, r), if u/p = {x1, x2, · · · , xn}, then gd(p) = ( n ∑ i=1 |xi| 2 )/ |u| 2 . (2) property 1. let p, q ∈ r be an equivalence relations on u, u/p = {x1, x2, · · · , xn}, u/q = {y1, y2, · · · , yn}, if p = q, then gd(p) = gd(q) and dis(p) = dis(q); if p ≺ q, gd(p) < gd(q) and dis(q) < dis(p). proof. (a) if p = q, then m = n, xi = yi, so gd(p) = gd(q), dis(q) = dis(p). (b) if p ≺ q, then |p | < |q|, so gd(p) < gd(q). since dis(p) = 1 − gd(p), we could observe that dis(q) < dis(p). property 2. let p ∈ r be an equivalence relation on u, u/p = {x1, x2, · · · , xn}, if the equivalence relation p divides from knowledge granules in u/r, then gd(p) ≤ gd(r), dis(r) ≤ dis(p). proof. we suppose that the knowledge granule xi from u/r is divided into two knowledge granules xi1 and xi2, that is xi = xi1 ∪xi2 and xi1 ∩xi2 = ∅, u/p = {x1, x2, · · · , xi−1, xi1, xi2, xi+1, · · · , xn}, so gd(r) = ( n ∑ j=1 |xj| 2 )/ |u| 2 an improved attribute reduction algorithm based on granular computing 859 = ( i−1 ∑ j=1 |xj| 2 )/ |u| 2 + |xi| 2 / |u| 2 + ( n ∑ j=i+1 |xj| 2 )/ |u| 2 = ( i−1 ∑ j=1 |xj| 2 )/ |u| 2 + [|xi1| + |xi2|] 2 / |u| 2 + ( n ∑ j=i+1 |xj| 2 )/ |u| 2 ≥ ( i−1 ∑ j=1 |xj| 2 )/ |u| 2 + [ |xi1| 2 + |xi2| 2 ]/ |u| 2 + ( n ∑ j=i+1 |xj| 2 )/ |u| 2 = gd(p), dis(r) = 1 − gd(r) ≤ 1 − gd(p) = dis(p). property 3. let (u, r) be a knowledge base and p ∈ r be an equivalence relation on u, u/r = {x1, x2, · · · , xn}, q is the union of knowledge granules in u/r, then gd(r) ≤ gd(q), dis(q) ≤ dis(r). proof. we suppose that the knowledge granule xk is the union of xi and xi + 1, then u/q = {x1, x2, · · · , xi−1, xk, xi+2, · · · , xn}, so gd(r) = ( n ∑ j=1 |xj| 2 )/ |u| 2 = ( i−1 ∑ j=1 |xj| 2 )/ |u| 2 + |xi| 2 / |u| 2 + |xi+1| 2 / |u| 2 + ( n ∑ j=i+2 |xj| 2 )/ |u| 2 ≤ ( i−1 ∑ j=1 |xj| 2 )/ |u| 2 + [|xi| + |xi+1|] 2 / |u| 2 + ( n ∑ j=i+2 |xj| 2 )/ |u| 2 = ( i−1 ∑ j=1 |xj| 2 )/ |u| 2 + |xk| 2 / |u| 2 + ( n ∑ j=i+2 |xj| 2 )/ |u| 2 = gd(q), dis(q) ≤ dis(r). property 4. let s = (u, a, v, f) be an information system,p, q ⊆ a, (1) if p ⇒ q, then gd(p) ≤ gd(q); (2) if p ⇔ q, then gd(p)=gd(q). proof. (1) if p ⇒ q, then ind(p) ⊆ ind(q), this is |ind(p)| ≤ |ind(q)|. on the other hand, gd(p) = gd(ind(p)) = |ind(p)| / |u| 2, and gd(q) = gd(ind(q)) = |ind(q)| / |u| 2, so gd(p) ≤ gd(q). (2) if p ⇔ q, then p ⇒ q and q ⇒ p . by (1), we could see that gd(p) ≤ gd(q) and gd(q) ≤ gd(p), sogd(p)=gd(q). property 5. let s = (u, a, v, f) be an information system,p, q ⊆ a, (1) if p ⇒ q, then dis(p) ≥ dis(q); (2) if p ⇔ q, then dis(p)=dis(q). proof. it follows immediately from definition 1 and property 4. deduction 1. let s = (u, a, v, f) be an information system, if p ⊆ q ⊆ a, then gd(q) ≤ gd(p) and dis(q) ≥ dis(p). remark. deduction 1 illustrates that for the subset of a, when the attribute number increased, the knowledge granularity is reduced, thus, the discernibility is increased. 3 attribute reduction algorithm based on attribute significance definition 2[8]. let s = (u, a, v, f) be an information system, the attribute significance 860 x. tang, l. shu could be defined as siga−{a}(a), siga−{a}(a) = gd(a −{a}) − gd(a). (3) remark. in an information system s = (u, a, v, f), the attribute significance of each attribute a ∈ a could be measured by knowledge granularity. definition 3[8]. let s = (u, a, v, f) be an information system, c is a subset of a, c ⊆ a, for ∀a ∈ a − c, the attribute significance of attribute a relative to attribute set c could be defined as sigc(a), sigc(a) = gd(c) − gd(c ∪{a}). (4) remark. definition 3 illustrates that the attribute significance of attribute a relative to attribute set c could be measured by change of the knowledge granularity. when a attribute is added to attribute set c, c’s knowledge granularity may change. if c’s knowledge granularity change, then attribute a is indispensable. definition 4[8]. let s = (u, a, v, f) be an information system, a ∈ a, if gd(a − {a}) = gd(a), then attribute a is dispensable, otherwise, attribute a is indispensable. if every a ∈ a is indispensable, then a is called independent. definition 5[8]. let s = (u, a, v, f) be an information system, p ⊆ a, if p is independent and gd(p) = gd(a), then p is a reduction of a, denoted as red(a). the union set of indispensable attribute in the set a is called a core set, denoted as core(p), core(p) = ∩red(p). property 6. attribute a is indispensable, if and only if siga−{a}(a) > 0. (5) proof. ⇒ if attribute a is indispensable, then gd(a − {a}) 6= gd(a). as we know that gd(a −{a}) ≥ gd(a), so siga−{a}(a) = gd(a −{a}) − gd(a) > 0. ⇐ obviously. property 7. core(a) = ∪ { a ∈ a|siga−{a}(a) > 0 } . proof: it follows immediately from definition 3 and property 6. remark. the attribute significance from the perspective of knowledge granularity provides a method of attribute reduction: we could judge the significance of attribute a by discussing whether gd(a − {a}) is equal to gd(a). if gd(a − {a}) = gd(a)łźthen a is dispensable, otherwise a is indispensable. thus we could obtain the reduction core core(a). next calculate the significance of the rest attribute relative to core(a). if gd(core(a) ∪ a) = gd(a), then the set core(a) ∪ a is the reduction of the information system, where a = { a ∈ a − core(a )|max sigcore(a)(a) } . algorithm 1: input: an information system s = (u, a, v, f), where u = {x1, x2, · · · , xn}, a = {a1, a2, · · · , am}. output: red(a) and core(a) // the sets of reductions and core. step 1: for i = 1, i ≤ n, + + i; j = 1, j ≤ m, + + j begin, calculate gd(a)// the knowledge granularity of attribute set a. step 2: calculate siga−{a}(a) // the significance of attribute a ∈ a. step 3: calculate core(a), core(a) = { a ∈ a|siga−{a}(a) > 0 } . step 4: if gd(core(a)) = gd(a), output red(a) = core(a), end. if gd(core(a)) > gd(a), turn next. step 5: calculate max b∈b sigcore(a)(b ), b = a − core(a) // the significance of attribute b ∈ b = a − core(a) for core(a). an improved attribute reduction algorithm based on granular computing 861 step 6: if gd(core(a) ∪ a′) = gd(a), output red(a) = core(a) ∪ b and core(a), end. if gd(core(a) ∪ b) > gd(a), repeat step 5, step 7: calculate max c∈c sigcore(a)∪b(b ) ∪ (core(a) ∪ b) // the significance of attribute c ∈ c = a − core(a) ∪ b for core(a) ∪ b, step 8: for j = 1 to |c|, repeat step 7 until the knowledge granularity is equal to gd(a) step 9: output red(a) = core(a) ∪ b ∪ c ∪·· · and core(a). example 1. let s = (u, a, v, f) be an information system (table 1). there are 6 objects and 4 attributes, where u = {x1, x2, x3, x4, x5}, a = {a1, a2, a3, a4}. calculate the reduction of the system. table1. an information system u a1 a2 a3 a4 x1 1 0 2 2 x2 0 1 1 1 x3 2 0 0 1 x4 1 1 0 2 x5 2 2 0 0 x6 2 1 1 1 it is easy to calculate that u/a = {x1, x2, x3, x4, x5, x6}, gd(a) = ( n ∑ i=1 |xi| 2 )/ |u| 2 = 6 36 = 1 6 , u/a −{a1} = {x1,{x2, x6}, x3, x4, x5}, gd(a −{a1}) = 8 36 , siga−{a1}(a1) = gd(a −{a1}) − gd(a) = 8 36 − 6 36 = 2 36 , u/a −{a2} = {x1, x2, x3, x4, x5, x6}, gd(a −{a2}) = 6 36 , siga−{a2}(a2) = gd(a −{a2}) − gd(a) = 6 36 − 6 36 = 0, u/a −{a3} = {x1, x2, x3, x4, x5, x6}, gd(a −{a3}) = 6 36 , siga−{a3}(a3) = gd(a −{a3}) − gd(a) = 6 36 − 6 36 = 0, u/a −{a4} = {x1, x2, x3, x4, x5, x6}, gd(a −{a4}) = 6 36 , siga−{a4}(a4) = gd(a −{a4}) − gd(a) = 6 36 − 6 36 = 0. based on the result above, we can see that the core of the system is core(a) = { a ∈ a|siga−{a}(a) > 0 } = {a1}, gd(core(a)) = 14 36 > 6 36 = gd(a). let a set b = a − core(a) = {a2, a3, a4}, calculate the significance of the rest attribute relative to core(a): sigcore(a)(a2) = gd(core(a)) − gd(core(a) ∪ a2) = 14 36 − 6 36 = 8 36 , sigcore(a)(a3) = gd(core(a)) − gd(core(a) ∪ a3) = 14 36 − 8 36 = 6 36 , sigcore(a)(a4) = gd(core(a)) − gd(core(a) ∪ a4) = 14 36 − 10 36 = 4 36 . since max b∈b sigcore(a)(b ) = 8 36 , u/core(a) ∪ a2 = {x1, x2, x3, x4, x5, x6}, gd(core(a) ∪ a2) = gd(a), we can see that core(a)∪a2 = {a1, a2} is the reduction, that is red(a) = {a1, a2}, core(a) = {a1}. 4 an improved reduction algorithm the reduction of an information system is not the only, some may have more than one reductions. but the reduction results may not be able to get reduction core, especially in the actual application data. for this case, algorithm 1 is powerless. now we will improve the algorithm, which can deal with the system with reduction core and no reduction core. 862 x. tang, l. shu in an information system s = (u, a, v, f), if siga−{a}(a) = 0, then then a is dispensable. for ∀a ∈ a, if siga−{a}(a) = 0, then the system has no reduction core. definition 6. let s = (u, a, v, f) be an information system, p is a subset of a, p ⊆ a, the attribute significance of p relative to a could be defined as siga−p (p), siga−{p}(p) = gd(a − p) − gd(a). (6) particularly, if p = a, sig∅(p) is represented as sig(p), and sig(p) = sig∅(p) = gd(∅) − gd(a) = 1 − gd(p) = dis(p). where gd(∅) = 1 (since u/ind(∅) = {u}). algorithm 2: input: an information system s = (u, a, v, f), where u = {x1, x2, · · · , xn}, a = {a1, a2, · · · , am}. output: red(a) and core(a) // the sets of reductions and core. step 1: for i = 1, i ≤ n, ++i; j = 1, j ≤ m, ++j begin, calculate gd(a).// the knowledge granularity of attribute set a step 2: calculate siga−{a}(a). // the significance of attribute a ∈ a if siga−{a}(a) 6= 0 // the system has reduction core, turn step 8; if siga−{a}(a) = 0 // the system has no reduction core, next; step 3: calculate siga−{ai,aj}(ai, aj), 1 ≤ i 6= j ≤ m // the significance of the combination of any two attributes in a, step 4: for i = 1, i ≤ n, + + i; j = 1, j ≤ m, + + j, find out red′(a) = { (ai, aj)| max 1≤i 6=j≤m siga−{ai,aj}(ai, aj) } . // the suboptimal reduction of the system step 5: if gd(red′(a)) = gd(a), then output the reduction red(a) = red′(a), end. if gd(red′(a)) > gd(a), turn next; step 6: calculate max b∈b sigred′(a)(b ), b = a − red′(a). // the significance of attribute b ∈ b = a − red′(a) for red′(a) step 7: if gd(red′(a) ∪ b) = gd(a), then output red(a) = red′(a) ∪ b, end. if gd(red′(a)∪b) > gd(a), repeat step 6 until gd(red′(a)∪b) = gd(a), output red(a), end. step 8: calculate core(a) = { a ∈ a|siga−{a}(a) > 0 } . step 9: if gd(core(a)) = gd(a), output red(a) = core(a), end. if gd(core(a)) > gd(a), turn next; step10: calculate max b∈b sigcore(a)(b ), b = a − core(a) // the significance of attribute b ∈ b = a − core(a) for core(a) step11: if gd(core(a) ∪ b) = gd(a), output red(a) = core(a) ∪ b and core(a), end. if gd(core(a) ∪ b) > gd(a), repeat step 10. step12: calculate max c∈c sigcore(a)∪b(b ) ∪ (core(a) ∪ b) // the significance of attribute c ∈ c = a − core(a) ∪ b for core(a) ∪ b. step13: output red(a) = core(a) ∪ b ∪ c ∪·· · and core(a), end. example 2. let s = (u, a, v, f) be an information system (table 2). there are 5 objects and 4 attributes, where u = {x1, x2, x3, x4, x5}, a = {a1, a2, a3, a4}. calculate the reduction of the system. an improved attribute reduction algorithm based on granular computing 863 table2. an information system u a1 a2 a3 a4 x1 1 0 2 1 x2 2 1 2 1 x3 0 1 2 0 x4 2 0 1 1 x5 1 2 2 0 it is easy to calculate that u/a = {x1, x2, x3, x4, x5}, gd(a) = ( n ∑ i=1 |xi| 2 )/ |u| 2 = 5 25 = 1 5 , u/a −{a1} = {x1, x2, x3, x4, x5}, gd(a −{a1}) = 1 5 , siga−{a1}(a1) = gd(a −{a1}) − gd(a) = 0; u/a −{a2} = {x1, x2, x3, x4, x5}, gd(a −{a2}) = 1 5 , siga−{a2}(a2) = gd(a −{a2}) − gd(a) = 0; u/a −{a3} = {x1, x2, x3, x4, x5}, gd(a −{a3}) = 1 5 , siga−{a3}(a3) = gd(a −{a3}) − gd(a) = 0; u/a −{a4} = {x1, x2, x3, x4, x5}, gd(a −{a4}) = 1 5 , siga−{a4}(a4) = gd(a −{a4}) − gd(a) = 0. based on the result above, we can see that the system has no core. next we calculate the significance of attribute combination siga−{ai,aj}(ai, aj), 1 ≤ i 6= j ≤ 4: gd(a −{a1, a2}) = 9 25 , siga−{a1,a2}(a1, a2) = gd(a −{a1, a2}) − gd(a) = 4 25 ; gd(a −{a1, a3}) = 7 25 , siga−{a1,a3}(a1, a3) = gd(a −{a1, a3}) − gd(a) = 2 25 ; gd(a −{a1, a4}) = 7 25 , siga−{a1,a4}(a1, a4) = gd(a −{a1, a4}) − gd(a) = 2 25 ; gd(a −{a2, a3}) = 5 25 , siga−{a2,a3}(a2, a3) = gd(a −{a2, a3}) − gd(a) = 0; gd(a −{a2, a4}) = 7 25 , siga−{a2,a4}(a2, a4) = gd(a −{a2, a4}) − gd(a) = 2 25 ; gd(a −{a3, a4}) = 5 25 , siga−{a3,a4}(a3, a4) = gd(a −{a3, a4}) − gd(a) = 0. we can see that gd(a1, a2) = gd(a), so red(a) = {a1, a2} is the reduction of the system. 5 results and discussion the idea of algorithm 1 is listed as follow: at first, work out the reduction core core(a) by finding out the set { a ∈ a|siga−{a}(a) > 0 } . then calculate the significance of the rest attribute relative to core(a). if gd(core(a) ∪ a) = gd(a), then the set core(a) ∪ a is the reduction of the information system. the time complexity of the algorithm is t ≈ o(|c|3 ·|u|2). the precondition of algorithm 1 is working out reduction core. however, some information systems may have no reduction core. in this case, algorithm 1 is powerless. in this paper, an improved algorithm is proposed. in an information system with no reduction core, the suboptimal reduction red′(a) replaces the reduction core. the improved algorithm does not increase in time complexity. it can deal with the system with reduction core and no reduction core. acknowledgment we would like to thank all the referees for their valuable comments in improving this paper. this work is supported by the foundation of sichuan normal university (grant no. 13kyl15). 864 x. tang, l. shu bibliography [1] zhang w.x.; wu w.z.; liang j.y.(2001); the theory and method of rough set, beijing: science press, china. [2] zeng h.l.(2004); intelligent computing, chongqing: chongqing university press, china, 1-89. [3] miao d.q.; wang g.y.; liu q.; lin t.y.; yao y.y.(2007); granular computing: past, present and future prospects, beijing: science press, china. [4] wang g.y.; li d.y.; yao y.y., et al(2012); cloud model and granular computing, beijing: science press, china. [5] hobbs j.r.(1985); granularity, in proceedings of the 9th international joint conference on artificial intelligence. (ijcai) los angeles, 432-435. [6] zhang b.; zhang l.(1990); theory and application of problem solving. beijing : tsinghua university press ,china. [7] zadeh l.a.(1996); fuzzy logic=computing with words. ieee transactions on fuzzy systems, 4(2): 103-111. [8] miao d.q.; fan s.d.(2002); the calculation of knowledge granulation and its application. systems engineeringtheory & practice, 22(1): 48-56. [9] yan l.h.; han x.(2008); attribute reduction based on granular computing. computer applications and software, 25(4):239-240. [10] zhao m.; luo k.; qin z.(2008); attribute reduction algorithm based on granular computing. computer engineering and applications, 44(30): 157-159. [11] feng l.; liu z.p.; fang d.(2010); an approach on attribute reduction based on model of granular computing in information systems. journal of chongqing university of posts and telecommunications, 22(5):652-655. [12] wang h.x.; cheng y.h.(2010); application and improvement of attribute reduction arithmetic based on granular computing. microcomputer information, 26(5): 33-35. [13] shi j.l.; du g.y.; xiong d.l.(2012); an incomplete order decision table reduction algorithm based on granular computing. computer applications and software, 29(10): 113-116. int j comput commun, issn 1841-9836 9(3):348-355, june, 2014. backstepping-based robust control for wmr with a boundary in prior for the uncertain rolling resistance m. yue, s. wang, x.l. yang ming yue*, shuang wang, xiaoli yang school of automotive engineering dalian university of technology, dalian 116024, china *corresponding author: yueming@dlut.edu.cn abstract: in this study, we focus on the trajectory tracking control problem of a wheeled mobile robot (wmr) in an uncertain dynamic environment. concerning the fact that the upper boundary may be usually achieved in prior according to the physical properties of the terrain, this crucial message is utilized to construct the controllers. firstly, a dynamic model for wmr including the rolling resistance is presented, whose state variables are longitudinal and rotational velocities, as well as the rotational angle of the mobile platform. secondly, with the aid of backstepping technique, the robust controllers based on the upper boundary are proposed and the globally asymptotic stability of the closed-loop system is proven by the lyapunov theory in the following. lastly, a saturation function is applied to replace the signum function, by which the inherent chattering can be suppressed greatly. numerical simulation results demonstrate that the proposed controllers with upper bound in prior possess robustness characteristics which yields potentially valuable applications for the mobile robot, especially in the unstructured environment. keywords: wheeled mobile robot (wmr), boundary estimation, uncertain rolling resistance. 1 introduction in practice, the physical properties of the wheeled mobile robot (wmr) within an unstructured environment are seldom considered in lots of trajectory tracking control issues. among these physical properties, the rolling resistance acting on the robot wheels is such a crucial physical behavior that can effect the control performances greatly. particulary, in some rough terrain, such as loose soil and sand conditions, it is difficult to obtain high-performance of trajectory tracking without considering the influences of the rolling resistance. therefore, when the wmr explores in an outdoor environment and even planetary surface, the rolling resistance may not be neglected for designing the system controllers [1]. many efforts are devoted to the rolling resistance acting on the wmr’s wheels recently. most references (see [2,3]) had investigated the interactions of soil and wheels and then some new mechanical structures had also been reported to enhance the contact between the wheels and terrain. however, there are only a few investigations to deal with rolling resistance problem by control methods. after analysing of the terrain properties, we notice that the boundary (especially for the upper bound) of off-road rolling resistance can be calculated in advance according to the geologic parameters when the terrain of the exploration is determined. for practical implementation, the rolling resistance appears drastically changing, which is similar to the disturbances to the mobile benchmark, but with the aid of the boundary message in prior, the robust controller for wmr can be designed to make it possible to against the uncertain dynamics directly. it should be mentioned that, compared with modifying the system mechanical structure, the adopted control approach yields much more flexibility for the mobile robot. certainly, we are also aware of adaptive scheme to update the upper bound, but the control algorithm with adaptive scheme copyright © 2006-2014 by ccc publications backstepping-based robust control for wmr with a boundary in prior for the uncertain rolling resistance 349 may make the regulation process more elastic and cause further time-delay usually [4, 5]. to overcome this drawback, a powerful robust control approach is determine to employ to deal with the uncertain problem of the unavoidable rolling resistance. on another hand, if the rolling resistance is concerned, the dynamics of the wmr must be considered. for the past decades, most efforts have been done to the wmr with nonholonomic constraint on kinematic model, all of which the system velocities were assumed to be the control inputs [6, 7]. but unfortunately, there are only a few researches address the dynamic behaviors of nonholonomic system, where control inputs are transformed to be the system actuators, i.e., in most cases the driving torques of motors [8–10]. since the rolling resistance is described in torque form when it takes place in practice, the dynamic properties must be considered when designing the system controllers to overcome this kind of uncertain dynamics. with a widely survey of the current dynamic models for the wmr, we observe that a dynamic model proposed by watanabe et.al [11] is so succinct and effective to describe the relationship between the rolling resistance and robot posture on the level of dynamics. then, we utilize this dynamic model to establish the control system in this investigation. particularly, it should be noted that the rolling resistance is somewhat difference from the external disturbances and unmodeled dynamics, because the rolling resistance lies in the movement process around a constant value; while the external disturbances and unmodeled dynamics possess white noise property which is around zero. the objective of this study is to address the rolling resistance behaviors of the wmr in detail, and derives the dynamic model to discuss the influence of rolling resistance. based on addressed dynamic model, a simple and effective approach to stabilize the trajectory tracking system via backstepping techniques is developed. in order to illustrate the efficacy of the control approach, numerical simulations for a practical wmr have been performed. 2 wmr dynamics the typical wmr can be described in fig.1. it is assumed that the mobile robot is driven by two independent wheels as well as a passive auxiliary wheel is adopted to support the workbench. to begin with, some notations is introduced to help the controlling system design, as follows: l is the distance between the driving wheels and the symmetry axis; d is the distance between point c(x, y) and the mass center of the robot p(xc, yc), which is assumed to be on the symmetry axis; r is the radius of the driving wheels; mp and mw are the masses of vehicle body and wheel; ic is the inertia moment of vehicle body w.r.t. vertical axis through c; iw and im are the inertia moments for the wheel w.r.t. wheel axis and diameter, respectively. according to the parallel axis theorem, the equivalent mass m = mp + 2mw and rotation inertia i = 2im + 2mwl2 + mcd2 + ic can be introduced for simply. the robot position can be described by the coordinates (x, y), which is the midpoint c of the axis of two robot wheels, and the orientation angle ϕ, which is heading angle of body coordinate with respect to fixed frame (see fig.1). with the hypothesis of pure rolling and non slipping condition, the nonholonomic constraint, i.e., ẋ sin ϕ − ẏ cos ϕ = 0, hold throughout the movements. let v and w be represent the linear and angular velocities of the mobile robot, and τr and τl express the driving torque of the right and left wheels, respectively. through the analysis of the forces acting on the mobile robot, we can obtain that iϕ̈ = τr r l − τl r l (1) mv̇ = τr r + τl r (2) 350 m. yue, s. wang, x.l. yang x y o d auxiliary wheel symmetry axis figure 1: a wheeled mobile robot τi r θ figure 2: rolling resistance generation while rolling on the ground, especially operating on the soft soil or sandy terrain, the robot will suffer unavoidable rolling resistance, as shown in fig.2. let the notation τ̃d represent this rolling resistance, and the dynamic behaviors of the wheel can be governed by iwθ̈i + cθ̇i = kui − τi − τ̃di (3) where τi is the driving input of the wheel (i expresses r or l for left or right wheels, the same hereinafter); c is the viscous friction coefficient and θ is the rolling angle of the wheel; k is driving gain between the motor’s voltage and its output torque; ui is the excited voltage signals for the motor installed in the relative wheel. let vr, vl denote the right and left linear velocity of the wheel center, and we can further have vr = rθ̇r = v + lϕ̇ and vl = rθ̇l = v − lϕ̇. then, the relationship between linear velocity of the robot’s platform and angular velocity of the robot wheels can be formulated by r(θ̇r + θ̇l) = 2v and r(θ̇r − θ̇l) = 2lϕ̇. with the analysis, we can ultimately obtain the dynamics described by the linear velocity v and orientation angle ϕ as follows: v̇ = − 2cv mr2 + 2iw + kr mr2 + 2iw (ur + ul) − r mr2 + 2iw (τ̃dr + τ̃dl) (4) ϕ̈ = − 2cl2 ir2 + 2iwl2 ϕ̇ + krl ir2 + 2iwl2 (τ̃dr − τ̃dl) (5) if the system state variables are chosen as x = [v ϕ ϕ̇]t , the driving input is selected as u = [ur ul] t , the disturbance vector is defined as τ̃d = [τ̃dr τ̃dl]t , and the output variables are y = [v ϕ]t , a new dynamic model for control system design can be rewritten by:{ ẋ = ax + bu + dτ̃d y = cx (6) with a =   a1 0 0 0 0 1 0 0 a2   , b =   b1 b1 0 0 b2 −b2   , d =   d1 d1 0 0 d2 −d2   , c = [ 1 0 0 0 1 0 ] , where a1 = − 2c mr2 + 2iw , a2 = − 2cl2 ir2 + 2iwl2 , b1 = kr mr2 + 2iw , b2 = krl ir2 + 2iwl2 , d1 = − r mr2 + 2iw , d2 = − rl ir2 + 2iwl2 . backstepping-based robust control for wmr with a boundary in prior for the uncertain rolling resistance 351 3 control system design 3.1 system decoupling noticing that the system (6) is a coupled system, for the convenience of designing the system controller, the proposed dynamic model should be decoupled in the first place. to achieve this objective, a new control input vector should be introduced as follow:[ ur ul ] = [ 1 −1 0 1 ] [ u1 u2 ] (7) with these modified inputs u1 and u2, the system (6) is transformed into two independent subsystems: v̇ = a1v + b1u1 + τ̃dv (8) ẇ = a2w + b2u1 − 2b2u2 + τ̃dw (9) where τ̃dv = d1(τ̃dr + τ̃dl) and τ̃dw = d2(τ̃dr + τ̃dl), which represent disturbances for the two independent subsystems. supposing the robot moves on a special terrain ground, although the rolling resistance is in a changeable state, the maximum amplitude for τ̃dr and τ̃dl could be estimated in advance from the terrain property. let the notation τ̄dr and τ̄dl represent the upper bound values of τ̃dr and τ̃dl respectively, and then it can be achieved that τ̄dv = max |d1(τ̃dr + τ̃dl)| (10) τ̄dw = max |d2(τ̃dr + τ̃dl)| (11) 3.2 control scheme design considering the decoupled form of the system (8) and (9), a backstepping technique can be applied to derive the robot controller due to the high dimension feature of the system. the process of control design procedures may be divided into two steps which is given as bellows. step 1. linear velocity control after the trajectory tracking system has been decoupled, the control algorithms for linear and angular velocities can be derived stage by stage. here, the control input u1 is used to control the linear velocity, such that the robot can track an desired trajectory from an arbitrary original point with the required performances. suppose that the desired linear velocity vd, and the tracking error may be expressed by ve = vd − v. then, a theorem can be given as follow: theorem 1. for the linear velocity system (4) with the bounded linear velocity, the tracking error ve will globally converge to zero, i.e., lim t→∞ ve(t) = 0, if the control law is given by u1 = 1 b1 [v̇d − a1v + c1ve + sgn(ve)τ̄dv] (12) where c1 is a positive constant and sgn(·) is a signum function. proof: consider a candidate lyapunov function as v1 = 1 2 v2e (13) 352 m. yue, s. wang, x.l. yang differentiating v1 with respect to time yields v̇1 = vev̇e = ve(v̇d − a1v − b1u1 − τ̃dv) (14) substituting (12) into (14), it can be ultimately obtained v̇1 = −c1v2e ≤ 0. obviously, by lyapunov stability theorem, the propose theorem 1 is proved. 2 step 2. angular velocity control let ϕd represents the desired orientation angle of the robot to be tracked. then, the tracking error of the orientation angle can be defined as ϕe = ϕd −ϕ and ϕ̇e = ϕ̇d −ϕ̇. in terms of angular velocity tracking control, it is to design the control input u2 such that the ϕe and ϕ̇e converge to zero as t → ∞. in order to realize this purpose, two virtual control inputs are introduced, which are defined as z1 = ϕe and z2 = ϕ̇e + c2ϕe where c2 is an arbitrary positive constant. similar to the linear velocity control, based on the above definitions, a theorem could be given as theorem 2. for the angular velocity system (5) with the bounded angular velocity, the tracking error ϕe and ϕ̇e will both globally converge to zero simultaneously , i.e., lim t→∞ ||ϕe ϕ̇e|| = 0, if the control law is given by u2 = 1 2b2 [a2w + b2u1 − ϕ̈d − c2z1 + c3z2 + sgn(z2)τ̄dw] (15) where c3 is a positive constant and sgn(·) is a signum function. proof: consider a candidate lyapunov function as v2 = 1 2 z21 (16) differentiating v2 with respect to time achieves v̇2 = z1ż1 = z1(ϕ̇ − ϕ̇d) = z1z2 − c2z21 (17) notice that only lyapunov function v2 can not guarantee the global stability of the system. in order to stabilize the system, another virtual input z2 is designed such that the entire tracking errors converge to zero. hence, another candidate lyapunov function constructed by z2 is established for this purpose. the new candidate lyapunov function is put forward as v = 1 2 z21 + 1 2 z22 (18) differentiating v with respect to time obtains v̇ = v̇2 + z2ż2 = z1z2 − c2z21 + z2(a2w + b2u1 − 2b2u2 + c2ż1 − ϕ̈d) (19) substituting (12) and (15) into (19), one can obtain that v̇ = −c2z21 − c3z 2 2 ≤ 0. obviously, according to lyapunov stability theorem, the theorem is proved. 2 since the systems (8) and (9) are decoupled, the proposed controller (12) and (15) may guarantee the stabilization of the linear and angular velocity subsystems overall. to sum up, we can summarize the investigation results and ultimately give a theorem as follow: theorem 3. for the trajectory tracking system (4) and (5), the tracking errors ve, ϕe and ϕ̇e will all globally converge to zero, i.e., lim t→∞ ||ve ϕe ϕ̇e|| = 0, with the controller (12) and (15). backstepping-based robust control for wmr with a boundary in prior for the uncertain rolling resistance 353 the proof can be easily achieved according to the above analysis. from the controller designing process, it can be observed that the candidate lyapunov functions are established step by step, which make the control design process be simple and convenience. this is a backstepping approach which is usually applied for controller design of a sophisticated nonlinear systems. meanwhile, it should be emphasized that there are inherent chattering caused by signum function in the proposed controllers. to overcome this drawback, one can replace signum function sgn(·) by saturation functions sat(·) to suppress this behaviors, one of which can be given by: sat(χ) = χ |χ| + ϵ (20) where ϵ is a positive constant and χ represents arbitrary variable. 4 simulation results in this section we perform numerical simulations to verify the effectiveness of the proposed controllers. the simulated parameters of the mechanical structure of the wmr are supposed to be mc = 14kg, mw = 3kg, ic = 0.9kgm2, im = 0.05kgm2, iw = 0.015kpgm2, l = 0.3m, d = 0.05m, r = 0.1m, k = 0.95, and c = 0.01. from the required performance of the trajectory tracking, the control parameters as selected as follows: c1 = c2 = c3 = 10 and ϵ = 0.002. suppose the mobile robot starts from initial point [x y ϕ]t = [0 0 0]t ; meanwhile, the desired trajectory of linear and angular velocity are hypothesized to be vd = 1m/s, wd = sin t with the initial state vector x0 = [v0 ϕ0 w0]t = [0.5 0.2 0]t . moreover, the rolling resistance acting on the left and right wheels are supposed to be a worse situation in simulation than that in practice to verify the effectiveness of the control algorithms. so a synthesis signal consisted of two sinusoid signals with different frequency is introduced to imitate the rolling resistance. here, assume that the rolling resistances are same on the left and right wheels, i.e., τ̃dr = τ̃dl = sin wdt +sin 2wdt where wd denotes the resistance frequency. according to practical experiment, the τ̄dv and τ̄dw can be set as the value of 0.75nm and 3nm respectively in this simulation circumstance. in addition, the saturation should be considered for actual driving motor, so a maximum value 10nm is introduced to restrict the control outputs, i.e., |τr| ≤ 10nm and |τl| ≤ 10nm. 0 2 4 6 8 10 12 14 16 -3 -2.5 -2 -1.5 -1 -0.5 0 0.5 1 1.5 x /m y / m figure 3: actual trajectory on the plane υ e / m /s with resistant consideration without resistant estimation -0.1 0 0.1 0.2 0.3 0.4 0.5 0.6 0 2 4 6 8 10 12 14 16 18 20 t /s figure 4: tracking error for υe under the proposed simulation circumstance, the actual tracking process for the desired trajectory is plotted in fig.3. it indicates the proposed control method can make the mobile robot to track the desired trajectory with satisfactory tracking performances. besides, we simulate two cases, that is, with and without estimation, to illustrate the signification of control with and 354 m. yue, s. wang, x.l. yang without upper bound information. as shown in fig.4, the tracking error relative to the linear velocity without boundary is described as the dashed line, as well as the case with boundary is plotted by the dashed line. the results state the tracking errors become much smaller (reduced to 20% of the amplitude) when τ̄dv and τ̄dw are adopted. particularly, owing to the sat(·), the inherent chattering is also suppressed. lower chattering may be achieved with smaller ϵ, but it will cause more violent changes for the control inputs. furthermore, to exhibit the better tracking performance with the upper bound, the tracking errors of ϕ̇e and ϕe are also addressed in fig.5 and fig.6, respectively. it is observed that the amplitudes of ϕe and we are reduced to 21% and 18% nearly compared with the case without upper bound estimation, which yields that the derived controller can provide better performances for tracking system and the robustness is enhanced in the following. φ e / r a d 0 2 4 6 8 10 12 14 16 18 20 -0.05 0 0.05 0.1 0.15 0.2 t /s with resistant consideration without resistant estimation figure 5: tracking errors for ϕe 0 2 4 6 8 10 12 14 16 18 20 -0.2 0 0.2 0.4 0.6 0.8 1 1.2 t /s ω e / r a d /s with resistant consideration without resistant estimation figure 6: tracking errors for ωe 5 conclusions the trajectory tracking issues were discussed in this study, and the main contribution can be summarized as follows: 1) a decoupled dynamic model was used to describe the complicated nonlinear behaviors of wmr where the rolling resistances are concerned. the decoupled model consisted of two parts: a first-order model for linear velocity and a second-order model for orientation angle of the robot, which not only kinematic postures but also dynamics behaviors were all included; 2) based on the fact that the upper bound of rolling resistance which can be achieved in prior, a robust controller was developed to obtain the better tracking performances. the backstepping methodology was employed to derive the system controller considering the higher dimensional property of the tracking system; and 3) a saturation function was introduced to alleviate the inherent chattering caused by signum function. such treatment was simply but effective for a practical wmr system. acknowledgement this research was supported by grants from the national natural science foundation of china (no. 61175101), and the fundamental research funds for the central universities (no. dut12lk45). backstepping-based robust control for wmr with a boundary in prior for the uncertain rolling resistance 355 bibliography [1] l. ding, h. b. gao, z. q. deng, k. nagatani, and k. yoshida, experimental study and analysis on driving wheels performance for planetary exploration rovers moving in deformable soil, journal of terramechanics, 48(1): 27-45, 2011. [2] c. c. ward and k. iagnemma,a dynamic-model-based wheel slip detector for mobile robots on outdoor terrain, ieee transactions on robotics, 24(4): 821-831, 2008. [3] p lamon and r. siegwart, wheel torque control in rough terrain-modeling and simulation, ieee int. conf. on robotics and automation, barcelona, spain, 867-872, 2005. [4] x. m. sun, g. p. liu, w. wang and d. rees, l2-gain of systems with input delays and controller temporary failure: zero-order hold model, ieee transactions on control systems technology, 19(3): 699-706, 2011. [5] r. wang, g. p. liu, w. wang, d. rees and y. b. zhao, h∞ control for networked predictive control systems based on the switched lyapunov function method, ieee transactions on industrial electronics, 57(10): 3565-3571, 2010. [6] a. chatterjee, o. ray, a. chatterjee and a. rakshit, development of a real-life ekf based slam system for mobile robots employing vision sensing, expert systems with applications, 38(7): 8266-8274, 2011. [7] d. buccieri, d. perritaz, p. mullhaupt, z. p. jiang and d. bonvin, velocity-scheduling control for a unicycle mobile robot: theory and experiments, ieee transactions on robotics, 25(2): 451-458, 2009. [8] z. j. li and s. z. s. ge and a. g. ming, adaptive robust motion/force control of holonomicconstrained nonholonomic mobile manipulators, ieee transactions on systems, man, and cybernetics, part b: cybernetics, 37(3): 607-616, 2007. [9] m. yue, p. hu and w. sun, path following of a class of non-holonomic mobile robot with underactuated vehicle body, iet control theory and applications, 4(10): 1898-1904, 2009. [10] m. yue, w. sun and p. hu, sliding mode robust control for two-wheeled mobile robot with lower center of gravity, international journal of innovative computing information and control, 7(2): 637-646, 2011. [11] k. watanabe, j. tang, m. nakamura, s. koga and t. fukuda, fuzzy-gaussian neural network and its application to mobile robot control, ieee transactions on control systems technology, 4(2): 193-199, 1996. international journal of computers communications & control issn 1841-9836, 10(4):463-470, august, 2015. alleviation of binding update re-registration handoff latency at home agent failure in mipv6 network a. avelin diana, k. sundarakantham, s. mercy shalinie a. avelin diana*, k. sundarakantham, s. mercy shalinie department of computer science and engineering thiagarajar college of engineering, madurai dianaavelin@gmail.com, kskcse@tce.edu, shalinie@tce.edu *corresponding author: dianaavelin@gmail.com abstract: home agent (ha) is an indispensable entity for binding connectivity to route packets between mobile node (mn) and correspondent node (cn). mipv6 allows the deployment of redundant has to overcome ha failure. different approaches resolve this issue to recuperate binding association information. this paper compares the effect of handoff latency in various methods and proposes a reliable ha delivery (rhad) mechanism to mitigate the binding update (bu) registration latency in ha at the time of active ha failure. we use bgp domain in network architecture and apply ibgp protocol to transmit packets between edge router (er) and ha. both the theoretical evaluation and simulation results reveal that rhad effectively reduces bu re-registration handoff latency and increases packet delivery ratio. keywords: mipv6, home agent, binding update, ibgp, vharp, edge router. 1 introduction mipv6 networks are experiencing a rapid growth due to scarcity of ipv4 addresses and tremendous growth in portable communication devices, ubiquitous computing and mobile users. mipv6 network is a vital for evolving next generation internetworking systems. mipv6 supports the mobility of mobile node (mn) to route packets transparently within the network [1]. mn ascertains its identity location with the router advertisement (ra) broadcasted from the router. ha is a router that maintains the mobility binding table to correlate the home address (hoa) with the care-of-address (coa) of mn. binding update list of mn maintains its binding information with ha and cn. the binding information lifetime is 420 seconds [2]. similarly, cn contains binding cache (bc) to store the mns information and communicate with it directly by forming a tunnel via route optimization (ro) technique [1]. the ro is enhanced to tackle security, signaling overhead and handoff latency [3]. ha is responsible for the binding association of mn and cn. mn discovers ha with the secured dynamic home agent address discovery mechanism (dhaad) [4]. ha maintains the ha list which includes the link address of all the has within the network along with its preference value. mn selects the ha based on the higher preference value [5]. icmp dhaad message contains the preference value which gets decremented by one as it propagates along each router. ha list between has are sustained by different protocols like harp [6], vharp [7], ha-to-ha protocol [8]. has experience single point of failure and it results in bottleneck within the network [9]. a single ha failure causes loss of mns binding association information. to overcome such problems, mipv6 allows the deployment of multiple has with the redundant ha to backup the bindings at the time of failure [10]. this paper evaluates the bu registration handoff latency of various approaches during ha failure and proposes a rhad mechanism to mitigate the registration delay of mn with the new active ha. this method reduces the bu re-registration handoff latency and improves quality of service (qos) in mipv6 network. copyright © 2006-2015 by ccc publications 464 a. avelin diana, k. sundarakantham, s. mercy shalinie this paper is organized as follows: section 2 discusses the related work in this area. section 3 proposes the network architecture of rhad mechanism with the theoretical evaluation. section 4 provides the simulated results and finally we conclude this paper in section 5. 2 related work ha failure results in service connectivity breakdown. redundant ha can avail uninterrupted service by the capture of bc contents from active ha. the standby ha detects the active ha failure with the periodic hello messages [6]. different solutions have been proposed to solve the ha failure issue. secondary has from various home link takeover the service of primary ha and the registration delay is high in this approach [11]. in [12], the redundant has are in the same home link and backup the data of failed primary ha. redundant ha of [13] and [14] follow the same approach as said in [12]and additionally it considers load balancing issues. primary and secondary has are synchronized with the transport layer connections and the registration delay is less in [13]. efficient fault tolerant protocol provides a stable storage method in which the failure of a mobility agent is recovered by another mobility agent using check pointing and receiver based pessimistic message logging approach [15]. the mobility agent of this approach stores all the mns binding. in quorum based mechanism, mobility binding of mn is stored in the backup quorum of network segment [16]. here, every ha finds a new ha as its standby ha. all the has of virtual home agent method share one global address and only one ha is set active [17]. redundant ha set is a collection of active and standby has in which the failure recovery is maintained by havirtual switch (ha-v) and ha hard switch (ha-h). in ha-v the mn executes ike exchange on standby ha with home address (hoa) assignment but in ha-h it exchanges without hoa. at the time of active ha failure, standby ha sends home agent switch message to all the registered mns to redirect the bu messages [7]. in vharp, the virtual home agent address is activated by standby ha and every state of mn is to be synchronized. the backup has are considered from different home links in vhaha [18]. the home agents are arranged 2 in a chain and backup its binding to the adjacent home agent in [19].in mobile ipv6 the message exchange is twice than that of harp and vharp since themn does not involve in ha failure detection and recovery [10]. in all the above approaches, the mn should update its registration to the redundant ha at the time of active ha failure. our rhad technique reduces the registration delay since the mn is not essential for this registration process. the connectivity between mn to a router in the network is wireless, so the packet delivery ratio is also improved by rhad. 3 rhad mechanism 3.1 rhad network architecture our network entities include mn, ha, access router, access point, cn and edge router (er) as in figure 1. mn selects the nearest ha to reduce the probability of ha handoff failure. we create bgp domain and use ibgp to transfer the packets between mn and ha. it is noticed that the packets are never propagated to exterior network. er is a boundary router that scrutinizes the bu message from mn and records it in router list. when the bu packet is routed to ha, first transmission occurs via er that is nearer to mn. the stored bu information contains hoa, coa and lifetime. it remains in the router list until the bu lifetime becomes zero. the bu is transmitted to active ha through edge router and access router. the active ha selects a standby ha from redundant ha set based on the highest preference value and back up its data. the alleviation of binding update re-registration handoff latency at home agent failure in mipv6 network 465 figure 1: rhad network redundant ha is provided with virtual hoa and it should be noticed that the binding should be synchronized between active and standby ha. in our approach we combine vharp and bgp protocol to mitigate the failure recovery handoff latency. vharp is transparent to mn and it maintains the ha list within the network. at the time of active ha failure, the edge router transmits bu information to the redundant ha. 3.2 theoretical evaluation the bu re-registration handoff ow using rhad approach is illustrated in figure 2. the total message ow handoff time includes processing time and transmission time. message retransmission is unnecessary in wired medium. wireless link is least stable since message loss can occur at any instance and therefore message retransmission becomes indispensable. the transmission of bu message between mn and edge router is wireless. the additional signal processing time is evaluated below. t1bu = ∞∑ i=n tbu (nf) ·prob(nf failure and one success) (1) nf represents the number of link failures. if the ba is not received after sending bu request we assume that message is lost and hence retransmitted. if nf failure occurs then toutand message retransmission takes nf times. usually, tout is observed as 2 ms [13]. tbu(nf) = pw + nf · (tout + pwl) (2) from (1) and (2), t1bu = σ ∞ n {pwl + nf · (tout + pwl)} ·prob(nf failure and one success) (3) σ∞n nf ·prob(nf failure and one success) is obtained from infinite geometric progression. σ∞n nf ·prob(nf failure and one success) = r r −1 (4) 466 a. avelin diana, k. sundarakantham, s. mercy shalinie here, r represents the link probability failure and it has the value 0.5 [13]. t1bu = pwl + (tout + pwl) 0.5 1−0.5 (5) t1bu = 2pwl + tout (6) similarly, the ba received from edge router to mn is obtained as t2ba = 2pwl + tout (7) figure 2: bu recovery registration handoff therefore total time for the message transmission in wireless medium is twl = t 1 bu + t 2 ba (8) twl = 2 (2pwl + tout) (9) the message transmission in wired medium includes the packet propagation between edge router and standby ha. the binding synchronization between active and standby home agent is achieved through vharp protocol. the total message transmission time in wired medium is twi = ter−ar + tar−aha + tbsync + tar−er + taha−ar (10) twi = 2ter−ar + 2tar−aha + tbsync (11) at the time of active ha failure, the total transmission time in harp is obtained as tharp = 2twl + twi + ter−ar + tar−sha + tar−er + tsha−ar (12) tharp = 2 (twl + ter−ar + tar−sha) + twi (13) alleviation of binding update re-registration handoff latency at home agent failure in mipv6 network 467 edge router contains the information of mns bu and hence in rhad method the standby virtual ha acquires the bu directly from edge router and sends ba after successful delivery. therefore the transmission between edge router and mn is not necessary. trhad = 2 (ter−ar + tar−sha) + twi + twl (14) we have the fixed processing time to configure coa and to update bu message at edge router, active ha and standby ha. the processing time is same for both harp and rhad method. ptrhad = ptcoa + 4 ptbu = 5 pt (15) the total handoff failure is obtained by summing up the transmission and processing time. thoharp = tharp + ptharp (16) thoharp = 2 (twl + ter−ar + tar−sha) + twi + 5pt (17) thoharp = trhad + ptrhad (18) thorhad = 2 (ter−ar + tar−sha) + twi + twl + 5pt (19) from (7) and (8), it is clear that the bu re-registration handoff in rhad is fast and reliable. 4 simulation results the system parameter for the simulation of rhad network is based on j. mc nair et al. [20] as depicted in table 1. in this experiment we analyse the bu re-registration latency of different approach. figure 3 depicts the number of recovery messages exchanged during bu registration at the new active ha. the number of recovery messages is high in mipv6 and other methods when compared to our rhad approach. table 1: system parameter system parameter value wired link message propagation time 0.5 ms wireless link message propagation time 2 ms link failure probability 0.5 bit rate of wired link 155 mbps bit rate of wireless link 144 kbps message processing time 0.5 ms after the recovery of new ha, the mn sends bu to confirm its binding registration with it. the bu re-registration latency over time is described in figure 5. since this process is not initiated by mn in rhad, the registration latency is alleviated within 35 ms for the simulation time of about 300 sec as shown in figure. the fraction of packets that suffers due to different delays in various approaches at the time of re-registration is shown in figure 6. packet delivery ratio is evaluated as the number of packets received by the new active ha to the total number of packets sent by the mobile node. the packet delivery ratio of diverse 468 a. avelin diana, k. sundarakantham, s. mercy shalinie figure 3: no of recovery messages of different approach figure 4: active ha packet delivery ratio figure 5: bu recovery registration latency alleviation of binding update re-registration handoff latency at home agent failure in mipv6 network 469 figure 6: suffered packets at delay approaches is illustrated in figure 4. for 175 nodes, the percentage of packet delivery ratio in rhad is 81% whereas for vharp, harp and mipv6, it is 61%, 53% and 34% respectively. the successful packet delivery is comparatively large in our technique. 5 conclusion rhad network is simulated and its handoff latency is examined. comparatively this method mitigates the registration latency of bu after the ha failure. our approach outperforms all the other approaches with the enlarged packet delivery ratio at ha. currently we are analysing the ha fault tolerant mechanism and failure recovery. in future work we would like to include an efficient ha fault tolerant method to recuperate the registered bu without any loss of service connectivity. bibliography [1] johnson, d.; perkins, c.; arkko, t. (2004); mobility support in ipv6, ietf rfc 3775, https://www.ietf.org/rfc/rfc3775.txt. [2] http://www.kame.net [3] arkko, j.; vogt, c.; haddad, w.(2007); enhanced route optimization for mobile ipv6, network working group, rfc 4866, https://www.rfc-editor.org/rfc/rfc4866.txt. [4] qian sun et al (2004); security issues in dynamic home agent address discovery, draft-sunmipv6-dhaadsecurity-00.txt. [5] http://www.cisco.com/en/us/docs/ios/ipv6/command/reference/ipv6_07.html [6] wakikawa, r. (2009); home agent reliability protocol, mext working group, draft-ietfmip6-hareliability-05.txt. [7] wakikawa, r.(2011); home agent reliability protocol, mext working group, draft-ietfmip6-hareliability-09.txt. 470 a. avelin diana, k. sundarakantham, s. mercy shalinie [8] devarapalli, v.; wakikawa, r.; thubert, p. (2006); local ha to ha protocol, draftdevearapalli-mip6-nemo-local-haha-01.txt. [9] ng, c.; thubert, p.; watari, m.; zhao, f. (2007); network mobility route opti-mization problem statement, network working group, rfc 4888. [10] seungsun hong (2004); multihoming scenarios with multiple home agents in mobile ipv6, draft-hong-mip6-multihoming-scenarios-00.txt. [11] wakikawa, r.; devarapalli, v.; thubert, p. (2003); inter home agents protocol (haha), ietf draft, draft-wakikawamip6nemo-haha-00.txt. [12] deng (2003); load balance for distributed has in mobile ipv6, ietf draft, draftwakikawamip6-nemo-haha-00.txt. [13] heissenhuber, f.; fritsche, w.; riedl, a. (1999); home agent redundancy and load balancing in mobile ipv6, 5th international conference broadband communications, 235-244. [14] faizan, j.; el-rewini, h.; khalil, m. (2005); vharp: virtual home agent reli-ability protocol for mobile ipv6 based networks, int. conf. on wireless networks, communications, and mobile computing, doi:10.1109/wirles.2005.1549599, 1295-1300. [15] ahn, j.h.; hwang, c.s. (2011); efficient fault-tolerant protocol for mobility agentsin mobile ip, international parallel and distributed processing symposium, doi:10.1109/ipdps.2001.925103, 1273-1280. [16] chen, y.s.; chen, c.h.; fang, h.y. (2008); an efficient quorum-based fault-tolerant approach for mobility agents in wireless mobile networks, ieee international conference on sensor networks, ubiquitous and trustworthy computing (sutc), 373-378. [17] faizan, j.; rewini, h.e. et al (2006); introducing reliability and load balancing in mobile ipv6 based networks, journal of wireless communications and mobile computing, 6:1-19. [18] rathi, s.; thanushkodi, k. (2006); design and performance evaluation of aneffcient home agent reliability protocol, international journal of recent trends in engineering, 2(1):2632. [19] yujun zhang; hanwen zhang (2011); a mobile agent fault-tolerant method based on the ring detection backup chain for mobile ipv6 networks, ieee int. conf. on communications, doi: 10.1109/icc.2011.5962677, 1-5. [20] mcnair, j.; akyildiz, i.f.; bender, m.d. (2001); handoffs for real-time traffc in mobile ip version 6 networks, ieee globecom, doi:10.1109/glocom.2001.966325, 6: 34633467. int j comput commun, issn 1841-9836 7(5):916-923, december, 2012. evolutionary algorithm based on the automata theory for the multi-objective optimization of combinatorial problems e. niño-ruiz elias d. niño-ruiz universidad del norte km5 via puerto colombia barranquilla, colombia web: http://combinatorialoptimization.blogspot.com/ e-mail: enino@uninorte.edu.co abstract: this paper states a novel, evolutionary metaheuristic based on the automata theory (emods) for the multiobjective optimization of combinatorial problems. the proposed algorithm uses the natural selection theory in order to explore the feasible solutions space of a combinatorial problem. due to this, local optimums are often avoided. also, emods exploits the optimization process from the metaheuristic of deterministic swapping to avoid finding unfeasible solutions. the proposed algorithm was tested using well known multi-objective tsp instances from the tsplib. its results were compared against others automata theory inspired algorithms using metrics from the specialized literature. in every case, the emods results on the metrics were always better and in some of those cases, the distance from the true solutions was 0.89%. keywords: combinatorial optimization, multi-objective optimization, automata theory, metaheuristic of swapping. 1 introduction as well known, combinatorial optimization is a branch of the optimization. its domain is optimization problems where the set of feasible solutions is discrete or can be reduced to a discrete one, and the goal is to find the best possible solution [8]. in this field it is possible to find a large number of problems denominated np-hard, that is mean that the problem does not have a solution in polynomial time. one of the most classical problems in the combinatorial optimization field is the traveling salesman problem (tsp), it has been analyzed for years [6] either in a mono or multi-objective manner. formally, tsp is defined as follows: min n∑ i=1 n∑ j=1 cij · xij, (1) subject to: n∑ j=1 xij = 1, ∀i = 1, . . . , n, (2a) n∑ j=1 xij = 1, ∀j = 1, . . . , n, (2b) ∑ i∈κ ∑ j∈κ xij ≤ |κ| − 1, ∀κ ⊂ {1, . . . , n} , (2c) xij = 0, 1∀i, j, (2d) copyright c⃝ 2006-2012 by ccc publications evolutionary algorithm based on the automata theory for the multi-objective optimization of combinatorial problems 917 where cij is the cost of the path xij and κ is any nonempty proper subset of the cities 1, . . . , m. (1) is the objective function. the goal is the optimization of the overall cost of the tour. (2a), (2b) and (2d) fulfill the constrain of visiting each city only once. lastly, equation (2c) set the subsets of solutions, avoiding cycles in the tour. tsp has an important impact on different sciences and fields, for instance in operations research and theoretical computer science. most problems related to those fields, are based in the tsp definition. for instance, the hard scheduling optimization [5] had been derived from tsp. although several algorithms have been proposed for the solution of tsp, there is not one that optimal solves it. for this reason, this paper discuss novel metaheuristics based on the automata theory in order to approach the solution of the multi-objective traveling salesman problem. this paper is structured as follows: in section 2 important definitions about the multiobjective combinatorial optimization and the metaheuristics based on the automata theory are given, section 3 discusses an evolutionary metaheuritic based on the automata theory for the multi-objective optimization of combinatorial problems, lastly, in section 4 and 5 experimental results are given for each algorithm in order to estimate their performance using multi-objective metrics from the specialized literature. 2 preliminaries 2.1 multi-objective optimization the multi-objective optimization consists in two or more objectives functions to optimize and a set of constraints. mathematically, the multi-objective optimization model is defined as follows: optimize f(x) = {f1(x), f2(x), . . . , fn(x)} , (3) subject to: h(x) = 0, (4a) g(x) ≤ 0, (4b) xl ≤ x ≤ xu, (4c) where f(x) is the set of objective functions, h(x) and g(x) are the constraints of the problem. lastly, xl and xu are the bounds for the set of variables x. 2.2 metaheuristic of deterministic swapping (mods) metaheuristic of deterministic swapping (mods) [4] is a local search strategy that explores the feasible solution space of combinatorial problems based on a data structure named multi objective deterministic finite automata (mdfa) [3]. a mdfa is a deterministic finite automata that allows the representation of the feasible solution space of combinatorial problems. formally, a mdfa is defined as follows: m = (q, σ, δ, q0, f(x)), (5) where q represents all the set of states of the automata (feasible solution space), σ is the input alphabet that is used for δ (transition function) to explore the feasible solution space of a 918 e. niño-ruiz combinatorial problem, q0 contains the initial set of states (initial solutions) and f(x) are the objectives to optimize. mods explores the feasible solution space represented through a mdfa using a search direction given by an elitist set of solutions (q∗). the elitist solution are states that, when were visited, their solution dominated at least one solution in qϕ. qϕ contains all the states with non-dominated solutions. lastly, the template algorithm of mods is defined as follows: 1. create the initial set of solutions q0 using a heuristic relative to the problem to solve. 2. set qϕ as q0 and q∗ as ϕ. 3. select a random state q ∈ qϕ or q ∈ q∗ 4. explore the neighborhood of q using δ and σ. add to qϕ the solutions found that are not dominated by elements of qf . in addition, add to q∗ those solutions found that dominated at least one element from qϕ. 5. check stop condition, go to 3. 2.3 simulated annealing metaheuristic of deterministic swapping (samods) simulated annealing & metaheuristic of deterministic swapping [2] (samods) is a hybrid local search strategy based on the mods theory and simulated annealing algorithm for the multi-objective optimization of combinatorial problems. its main propose consists in optimizing a combinatorial problem using a search direction and an angle improvement. samods is based in the next automata: m = (q, q0, p(q), f(x), a(n)), (6) alike mods, q0 is the set of initial solutions, q is the feasible solution space, f(x) are the functions of the combinatorial problem, p(q) is the permutation function (p(q) : q → q) and a(n) is the weighted function (a(n) : n → ℜn). n represents the number of objective for the combinatorial problem. samods exploits the search directions given by mods and it proposed an angle direction given by the function a(n). due to this, samods template is defined as follows: 1. setting sets. set q0 as the set of initial solutions. set qϕ and q∗ as q0. 2. settings parameters. set t as the initial temperature, n as the number of objectives of the problem and ρ as the cooler factor. 3. setting angle. if t is equal to 0 then got to 8, else set ti+1 = ρ × ti, randomly select s ∈ qϕ, set w = a(n) = {w1, w2, · · · , wn} and go to 4. 4. perturbing solutions. set s′ = p(s), add to qϕ and q∗ according to the next rules: qϕ = qϕ ∪ { s′ } ⇔ (̸ ∃r ∈ qϕ)(r is better than s′), (7a) q∗ = q∗ ∪ { s′ } ⇔ (∃r ∈ q∗)(s′ is better than r), (7b) then, if qϕ has at least one element that dominated to s′ go to step 5, otherwise go to 7. evolutionary algorithm based on the automata theory for the multi-objective optimization of combinatorial problems 919 5. guess with dominated solutions. randomly generated a number n ∈ [0, 1]. set z as follows: z = e(−(γ/ti)), (8) where ti is the temperature value in moment i and γ is defined as follows: γ = n∑ i=1 wi · fi(sx) − n∑ i=1 wi · fi(s′x), (9) where sx is the vector x of solution s, s′x is the vector x of solution s′, wi is the weight assigned to the function i and n is the number of objectives of the problem. if n < z then set s as s′ and go to 4 else go to 6. 6. change the search direction. randomly select a solution s ∈ q∗ and go to 4. 7. removing dominated solutions. remove the dominated solutions for each set (q∗ and qϕ). go to 3. 8. finishing. qϕ has the non-dominated solutions. 2.4 genetic simulated annealing metaheuristic of deterministic swapping (sagamods) simulated annealing, genetic algorithm & metaheuristic of deterministic swapping [2] (sagamods) is a hybrid search strategy based on the automata theory, simulated annealing and genetics algorithms. sagamods is an extension of the samods theory. it comes up as result of the next question: could samods quickly avoid local optimums? although, samods avoids local optimums guessing, it can take a lot of time accepting dominated solutions for finding non-dominated. thus, the answer to this question is based on the evolutionary theory. sagamods proposes crossover step before samods template is executed. due to this, sagamods supports to samods for exploring distant regions of the solution space. formally, sagamods is based on the next automata: m = (q, qs, c(q, r, k), f(x)), (10) where q is the feasible solutions space, qs is the initial solutions and f(x) are the objectives of the problem. c(q, r, k) is defined as follows: c(q, r, k) : q → q, (11) where q, r ∈ q and k ∈ n. q and r are named parents solutions and k is the cross point. lastly, sagamods template is defined as follows: 1. setting parameters. set qs as the solution set, x as the number of solutions to cross for each iteration. 2. set qc (crossover set) as selection of x solutions in qs, qm (mutation set) as ϕ and k as a random value. 3. crossover. for each si, si+1 ∈ qc/1 ≤ i < ∥qc∥ : qm = qm ∪ {c(si, si+1, k)} 4. mutation. set q0 as qm . execute samods as a local search strategy. 5. check stop conditions. go to 2. 920 e. niño-ruiz 3 evolutionary metaheuristic of deterministic swapping (emods) evolutionary metaheuristic of deterministic swapping (emods), is a novel framework that allows the multiobjective optimization of combinatorial problems. its framework is based on mods template therefore its steps are the same: create initial solutions, improve the solutions (optional) and execute the core algorithm. unlike samods and sagamods, emods avoids the slowly convergence of simulated annealing’s method. emods explores different regions from the feasible solution space and search for non-dominated solution using tabu search. the core algorithm is defined as follows: 1. set θ as the maximum number of iterations, β as the maximum number of state selected in each iteration, ρ as the maximum number of perturbations by state and qϕ as q0. 2. randomly select a state q ∈ qϕ or q ∈ q∗. 3. mutation tabu search set n as the new solutions found as result of perturbing q. add to qϕ and q∗ according to the next equations: (qϕ = qϕ ∪ {q}) ⇐⇒ (̸ ∃r ∈ qϕ/r is better than q) (12a) (q∗ = q∗ ∪ {q}) ⇐⇒ (∃r ∈ qϕ/q is better than r) (12b) and then, the states with dominated solutions for each set are removed. 4. crossover. randomly, select states from qϕ and q∗. generate a random point of cross. 5. check stop condition, go to 3. step 2 and 3 support the algorithm in removing dominated solutions from the set of solutions qϕ as can be seen in figure 3. however, one of the most important steps in the emods algorithm is 4 where new solutions are found after the crossover step. 4 experimental analysis 4.1 experimental settings the algorithms were tested using well-known instances from the multi-objective tsp taken from tsplib [1]. the test of the algorithms was conducted using a dual core computer with 2 gb ram. the optimal solutions were constructed based on the best non-dominated solutions of all algorithms in comparison for each instance used. the instances were constructed using the combination of the mono-objective instances kroa100, krob100, kroc100, krod100 and kroe100. for instance, kroab100 is a bi-objective instance whose matrices of distance are given by the instance kroa100 and krob100. we full combine the instances (kroab100, kroac100, . . ., kroabcde100) and then we run the experiments. the metrics used for the measurement of the different algorithms are described below, most of them use two pareto fronts. the first one is pftrue and it refers to the real optimal solutions of a combinatorial problem. the second is pfknow and it represents the optimal solutions found by an algorithm. in all the cases ∥ · ∥ represents the number of elements. gndv = ∥pfknow∥ , (13) regndv = ∥{y|y ∈ pfknow ∧ y ∈ pftrue}∥ , (14) evolutionary algorithm based on the automata theory for the multi-objective optimization of combinatorial problems 921 where generation of non-dominated vectors (gndv) and real generation of non-dominated vectors (regndv) measure the number of solutions and the number of true solutions found by an algorithm respectively. on the other measures the number of true solutions generated. on the other hand, generational distance (gd) and inverse generational distance (igd) measure the distance between fpknow and fptrue: gd = ( 1 ∥pfknow∥ ) ·  ∥pfknow∥∑ i=1 di  (1/p), igd = ( 1 ∥pftrue∥ ) ·  ∥pfknow∥∑ i=1 di  , (15) where di is the smallest euclidean distance between the solution i of fpknow and the solutions of fptrue and p is the dimension of the combinatorial problem. for the measurement of the range variance of neighboring solutions in pfknow the spacing (s) is proposed: s = ( 1 ∥pfknow∥ − 1 )2 ·  ∥pfknow∥∑ i=1 ( d − di )2(1/p) (16) where di is the smallest euclidean distance between the solution i and the rest of solutions in pfknow. d = 1∥pftrue∥ ∑∥pftrue∥ i=1 di. the error rate (ε) depicts the error rate respect to the precision of the solutions as follows: ε = (∣∣∣∣∥pftrue∥ − ∥regndv ∥∥pftrue∥ ∣∣∣∣ ) · 100% (17) 4.2 experimental results the average of the metrics applied to each algorithm are shown in table 1. furthermore, a graphical comparison for tri-objectives instances is shown in figure 1. figure 1: graphical comparison between mods, samods, sagamods and emods for triobjective tsp instances. 922 e. niño-ruiz table 1: average performance for the algorithms in comparison using multi-objective instances of tsp with multi-objective optimization metrics. inst ance algorit hm gndv regndv ( regndv gndv ) % s gd igd ε bi-objective tsp mods 262.7 0 0% 0.0286 21.6672 2329.4338 100% samods 6487.2 1425.7 22.03% 0.0016 0.2936 265.8974 89.47% sagamods 6554.5 1581.8 23.97% 0.0015 0.3062 286.547 88.3% emods 19758.6 10671.8 54.89% 0.0003 0.0492 75.2773 22.23% tri-objective tsp mods 1992.5 63.9 3.21% 0.1508 0.302 3206.7459 99.91% samods 12444.2 269.3 2.16% 0.0727 0.0434 2321.5258 99.6% sagamods 12332.5 271.1 2.2% 0.0743 0.0437 2312.3389 99.6% emods 68969.1 67097 97.3% 0.0468 0.0011 6.3914 0.89% quad-objective tsp mods 5364.8 3273.2 60.99% 0.3468 0.0252 5810.4824 94.31% samods 27639.6 11594.2 41.94% 0.2325 0.0043 3397.7495 79.87% sagamods 35649.6 14754.8 41.4% 0.2231 0.0032 3013.1894 74.39% emods 200420.6 27991.6 13.97% 0.176 0.0005 1891.9864 51.43% quint-objective tsp mods 7517 7517 100% 0.5728 0.0125 15705.6864 98.41% samods 26140 26140 100% 0.4101 0.0033 10801.6382 94.46% sagamods 26611 26611 100% 0.4097 0.0033 10544.8901 94.36% emods 411822 411822 100% 0.3136 0.0001 950.4252 12.77% 5 conclusion samods, sagamods and emods are algorithms based on the automata theory for the multi-objective optimization of combinatorial problems. all of them are derived from the mods metaheuristic, which is inspired in the theory of deterministic finite swapping. samods is a simulated annealing inspired algorithm. it uses a search direction in order to optimize a set of solution (pareto front) through a linear combination of the objective functions. on the other hand, sagamods, in addition to the advantages of samods, is an evolutionary inspired algorithm. it implements a crossover step for exploring far regions of a solution space. due to this, sagamods tries to avoid local optimums owing to it takes a general look of the solution space. lastly, in order to avoid slow convergence, emods is proposed. unlike samods and sagamods, emods does not explore the neighborhood of a solution using simulated annealing, this step is done using tabu search. thus, emods gets optimal solution faster than sagamods and samods. lastly, the algorithms were tested using well known instances from tsplib and metrics from the specialized literature. the results shows that for instances of two, three and four objectives, the proposed algorithm has the best performance as the metrics values corroborate. for the last instance worked, quint-objective, the behavior of mods, samods and sagamods tend to be the same, them have similar error rate but, emods has a the best performance. in all the cases, emods shows the best performance. however, for the last test, all the algorithms have different solutions sets of non-dominated solutions, and those form the optimal solution set. acknowledgment first of all, i want to thank to god for being with me in my entire life, he made this possible. secondly, i want to thank to my parents elias niño and arely ruiz and my sister carmen niño for their enormous love and support. finally, and not less important, to thank to my beautiful wife maria padron and our baby maria gabriela for being my inspiration. bibliography [1] university of heidelberg. tsplib office research group discrete optimization university of heidelberg. http://comopt.ifi.uni-heidelberg.de/software/tsplib95/. evolutionary algorithm based on the automata theory for the multi-objective optimization of combinatorial problems 923 [2] elias d. niño. samods and sagamods: novel algorithms based on the automata theory for the multi-objective optimization of combinatorial problems. int. j. of artificial intelligence special issue of ijai on metaheuristics in artificial intelligence, accepted, 2012. [3] elias d. niño, carlos ardila, yezid donoso, and daladier jabba. a novel algorithm based on deterministic finite automaton for solving the mono-objective symmetric traveling salesman problem. int. j. of artificial intelligence, 5(a10):101-108, 2010. [4] elias d. niño, carlos ardila, yezid donoso, daladier jabba, and agustin barrios. mods: a novel metaheuristic of deterministic swapping for the multi objective optimization of combinatorials problems. computer technology and application, 2(4):280-292, 2011. [5] elias d. niño, carlos ardila, adolfo perez, and yezid donoso. a genetic algorithm for multiobjective hard scheduling optimization. int j comput commun, 5(5):825-836, 2010. [6] j.g. sauer and l. coelho. discrete differential evolution with local search to solve the traveling salesman problem: fundamentals and case studies. in cybernetic intelligent systems, 2008. cis 2008. 7th ieee international conference on, pages 1-6, 2008. [7] yang xiawen and shi yu. a real-coded quantum clone multi-objective evolutionary algorithm. in consumer electronics, communications and networks (cecnet), 2011 international conference on, 4683-4687, 2011. [8] qin yong-fa and zhao ming-yang. research on a new multiobjective combinatorial optimization algorithm. in robotics and biomimetics, 2004. robio 2004. ieee international conference on, 187-191, 2004. int j comput commun, issn 1841-9836 vol.7 (2012), no. 2 (june), pp. 353-364 advance and immediate request admission: a preemptable service definition for bandwidth brokers i.t. okumus, f.u. dizdar ibrahim taner okumus k.maras sutcu imam university, computer engineering department avsar kampusu, 46000, k.maras, turkey okumus@mu.edu.tr ferhat umut dizdar mugla university, graduate school of science kotekli kampusu, 48000, mugla, turkey abstract: differentiated services architecture lacks control level functionalities and bandwidth brokers are proposed to fill that gap. in order to provide proper control level functionalities, bandwidth brokers need to provide services for both advance requests and immediate requests. there is a tradeoff between preemption of immediate flows and utilization of links. it is important for a resource manager to provide the promised qos level to a flow without any preemption. in this study, we solve the preemption and the experienced qos problem by defining a preemptable service and explain how this service works and also show the performance and scalability characteristics of resource manager with the addition of a preemptable service. keywords: admission control, ir flows, ar flows,preemptable forwarding service, quality of services. 1 introduction differentiated services (diffserv) architecture [1] is the de facto standard to provide qos in ip networks. however diffserv only specifies the data level functionalities. control level functionalities are left out from the architectural definition.there are various options to provide control level functionalities of diffserv. one of the solutions is named bandwidth broker (bb) [2], which is essentially an intra-domain resource manager (idrm) [3] in a diffserv domain. this manager has admission control, authentication, authorization, inter-bb communication and resource management duties in a diffserv network. in the internet, different applications generate traffic flows. depending on the application, different flows have different qos needs. diffserv architecture defines qos classes to address the needs of different flows [4] [5]. again depending on the application, actual traffic generation time will be different. some applications generate traffic right away and ask for resources to be used immediately. some applications make advance requests for a traffic that will be generated in the future. in order to provide qos for all kinds of applications, idrm needs to provide service for both advance requests (ar) and immediate requests (ir). in this study, to provide proper qos for both type of requests we propose a new service called preemptable forwarding (pf). we present the details of the architecture with the new pf service and analyze the performance and scalability characteristics of the approach. rest of the paper is organized as follows. in the following section, we summarize the previous work in this subject. section 3 gives the details of the idrm and also the definition of a pf service. section 4 provides the analysis results. comparison of the results with earlier work is given in section 5. we give the concluding remarks in section 6. copyright c⃝ 2006-2012 by ccc publications 354 i.t. okumus, f.u. dizdar 2 related work one of the earlier works in the area of resource planning in advance request agents is provided by schelen and pink [6]. in this study, authors tried to reduce the preemption ratio of the ir flows. authors experimented with different look-ahead times to analyze its effect on preemption rate of ir flows. the look-ahead time is kept constant in time (clat). authors show that when look-ahead time increases, the preemption rate decreases but utilization also decreases. lin et.al. [7] proposed to change the look-ahead time depending on the application. in this scheme, authors use prediction methods to calculate the holding time of an ir flow and use this time as the look-ahead time. this scheme requires sensitive prediction techniques which brings high cost to the computing entity. ahmad et.al. [8] proposes to use dynamic look-ahead time to solve the problem. authors argue that look-ahead time is dependent on resource scarcity, ir arrival and release rate, and bandwidth release per ir call. authors calculate the lat dynamically (dlat) by considering these parameters at the time of the calculation. when there is a need for preemption, ir calls that are accepted most recent time are preempted to reduce wasted throughput. however size of these flows is also important. preempting a recently accepted giant flow will cost more to the network than preempting an old small flow. also when the ar limit is low and the lat is high, dlat model exhibits comparable results with clat model. degermark et.al. [9] used a model where all flows declare their duration, and admission is based on measurements and future predictions of the traffic load. in this scheme none of the flows are preempted. ar flows are expected to provide the duration of the flow. however assuming all ir calls to provide their duration is not realistic in today’s networks. greenberg, srikant, and whitt [10] proposed a probabilistic approach for ar admission control. in this method, an ar call is admitted based on the call interruption probability. if the probability is below a threshold, request is admitted. this method assumes all ar calls are made for far ahead in time. this scheme allows occasional service disruptions to achieve higher utilization. srikant and whitt [11] used clt approximation to calculate interrupt probability for resource sharing between ar and ir flows. this method allows multi-class resource scheduling. karsten et.al. [12] proposed a policy-based service specification for advance resource reservations. in the study, an ir request is assumed to be nonpreemptable for certain amount of time and preemptable after that duration. during the admission process, ir requests provide the nonpreemptable duration. also it is possible for a flow to request nonpreemptable for the whole lifetime. however, it is not realistic for a flow to require certain qos for a limited time and not need that qos after that. applications require certain qos for the whole lifetime. if all ir requests are nonpreemptable, then the network utilization is low. this study does not provide any evaluation result for the suggested scheme. in order to provide an acceptable service for both ar and ir calls it is important to have both flows accepted in the network and also to have a sensitive preemption scheme to reduce preemption rate of ir flows. when providing qos, if an idrm accepts a flow into the network, whether it is ar or ir, flow expects to get the promised qos through the duration of the flow. unaware preemption should not be an option in any case. none of the proposed solutions consider user satisfaction and experienced-qos in their study. in the next section we provide the solution we propose for this problem. advance and immediate request admission: a preemptable service definition for bandwidth brokers 355 3 proposed model: preemptable forwarding in our work, we used idrm architecture as a resource manager [3]. idrm is mainly a specialized bandwidth broker. idrm is responsible for admission control and intra-domain resource management along with other tasks. these two tasks are the important ones for our study. in order to better manage intra-domain resources, idrm knows intra-as topology. idrm keeps track of the available capacity of individual links in the network. idrm also keeps track of the currently reserved resources. in this context, to support advanced requests (ar), idrm needs to keep records of start and end times and required capacity of advanced reservation flows. in bandwidth database, idrm keeps current load of the links and differentiates immediate and advance reserved shares of the bandwidth (table 1). table 1: a sample database state linkid utilizedbw reservedbw 40 5 7 41 5 5 table 2: time slot table for reservation states linkid 40 time slot 1 2 3 4 5 6 7 8 9 reserved bw 1 2 4 2 0 1 5 4 0 idrm also keeps reservation database for ar requests. for the admission of advance requests, idrm needs to know the available capacity and the reserved capacity of a future time interval. since time is an analog entity, it is impossible to keep track of the link states in every point in time. usual way to make time more manageable is to quantify it. we call these quantified time intervals time slots. idrm keeps track of reserved capacity of every time slot through the future. to prevent the scalability problems, the number of time slots into the future needs to be finite. the actual amount of a time slot can be determined according to the network’s need. it can be seconds, minutes, hours. a sample state of reservation database is shown in table 2. 3.1 admission control admission control methods for ir, ar and pf requests are different. the goal is to reduce the preemption rate of ir flows and to increase the overall throughput as much as possible. in order to achieve this goal, we need to keep a balance between ir and ar flows. if we only accept ir flows, admission control decision is simply based on current available capacity on the path of the flow. determination of this capacity can be parameter-based or measurement-based which is out of scope of this paper. if we only accept ar flows, then the admission decision is based on the available capacity from the start time to the end time of the request. when we have both ir and ar together, we need to use a mixture of these two admission control methods. we define lc as the total link capacity, irlc as the link capacity share of ir flows, ciri as the capacity of the ith active ir flow,cari as the capacity of the ith active ar flow, cir(t0) as the total ir capacity used by ir flows at time t0, car(t0) as the total ar capacity used by ar flows at time t0. cir(t0), and car(t0) is calculated as follows: cir(t0) = n1∑ i=0 ciri, i = 0,1, ...,n1 (1) car(t0) = n2∑ i=0 cari, i = 0,1, ...,n2 (2) residual capacity at time t0for ar and ir share is calculated as follows: 356 i.t. okumus, f.u. dizdar irlcr(t0) = irlc − cir(t0) (3) arlcr(t0) = arlc − car(t0) (4) admission control of ir flows considering the preemption and throughput tradeoff, there can be two different admission control approaches for ir admission. in first case, to increase the throughput, ir flows can be allowed to overflow into the ar share of the link capacity. however some ir flows can be preempted if necessary. in the second case, to prevent preemption, ir flows can be limited into their own share of the capacity and not allowed to use the excess ar capacity. admission of the first case is based on checking the available capacity on the whole link. if there is enough available capacity at the time of the request, then the ir flow is accepted: c < irlcr(t0) + arlcr(t0) (5) admission of the second case is based on checking available capacity only on the ir share. if there is enough capacity ir flow is accepted: c < irlcr(t0) (6) admission control of ar flows admission control criterion for ar flows is different. ar flows always have priority over ir flows. admission control decision does not take active ir flow capacity into account. decision is based on the ar reservations in the duration of the requested flow. if ar flow starts at t1 and ends at t2, condition to accept the flow is: arlc > c + max[car(τ)],τ = [t1, t2] (7) where arlc stands for ar link capacity share, max[car(τ)] shows the maximum reserved ar capacity in the interval τ. if this condition is satisfied, ar flow is accepted. at time t1 ar flow will start. it is possible that in between the time of accepting the ar flow and the actual ar flow start time, some ir flows could be accepted. during interval τ, some of the ir flows can be preempted to provide capacity to the pre-accepted ar flows. preemptable forwarding service this requirement led us to define a new service type. as we will show in our analysis results, in order to get the most benefit from the network while having both ir and ar flows together, there will be a need to preempt ir flows. instead of selecting a flow without the consent of the owner, we define a new service called preemptable forwarding (pf). in this service type, some ir calls accept the probability of preemption beforehand during the admission control process. from hereon, we refer this flow type as pfir. for pfir flows, there is a chance that accepted flow will never be preempted. in this case the experienced qos will be the same as other ir and ar flows. however if there is a need for preemption, network will select one of the pfir flows to preempt. none of the regular ir flows will be preempted. since the user is expecting this preemption and has consented to preemption beforehand, there will not be any user dissatisfaction in terms of the experienced qos. this service type is most suitable in cases where there is a limit on both ir and ar flows. bandwidth capacity is divided between ir & ar flows and none of these flows can overflow advance and immediate request admission: a preemptable service definition for bandwidth brokers 357 to other flow’s share. if there is no capacity available in ir share for a new ir request, pf admission is used.pf flows will be using the excess capacity in the ar share of the link capacity. the amount that can be used by pf flows can be determined beforehand and can be static or dynamic. admission control of pfir flows the admission control decision for pfir depends on the ir capacity, ar capacity and pf share. if available capacity on the ir share is not enough to accept an ir request, user is prompted to retry for pfir. pfir request is accepted only if the residual ir capacity and residual pf share are in total bigger than the requested capacity. residual pf capacity pflcr is dependent on the look-ahead time (lat), ar reservations in that time interval and the active pfir flows. residual ar capacity (arlcr) is defined as the capacity available on the arlc and is defined as: arlcr = arlc − max[car(lat)] (8) at time t0, total pfir capacity in use is calculated as: pfir(t0) = n3∑ i=0 pfiri, i = 0,1, ...,n3 (9) residual pf capacity at time t0 is then defined as: pflcr(t0) = { pflc − pfir(t0) arlcr ≥ pflc arlcr − pfir(t0) arlcr < pflc (10) pf admission is based on the residual ir and residual pf capacities: if (c < irlcr + pflcr) then accept else reject in order to clarify the admission of pfir flows, lets take a look at the admission process. an ir request will arrive at the bb via a signaling protocol such as sibbs [14]. bb will determine the type of the request ( ir or ar). if the request is ir, bb performs ir admission procedure. if ir capacity is not enough to accept the request, bb sends back a negative message with pf admission option. this message suggests a pf request and also contains an indicator about the preemption possibility. we define pfrate(t0) as the pf share usage rate at time t0: pfrate(t0) = 1 − pflcr(t0) pflc ,0 ≤ pfrate(t0) ≤ 1 (11) average pf rate is calculated using moving average to take into account the previous trends on the pf usage rate. average pf rate takes values between 0 and 1. 0 indicates that no pf flow is active and pf capacity is available. 1 indicates pf capacity is fully used. values of variables a and b can be set based on the network needs. in our study, we used values a=0.2 and b=0.8. average pf rate is calculated as follows: avgpfrate(t0) = a ∗ pfrate(t0 − 1) + b ∗ pfrate(t0),0 ≤ a,b ≤ 1,a + b = 1 (12) average pf rate shows the possibility of preemption of a pfir flow in lat time window. based on this indicator, user can predict preemption possibility and decide whether or not to use the 358 i.t. okumus, f.u. dizdar pf service. after receiving the negative response and average pf rate, request can be submitted as a pf request. after pfir flow is admitted there is a possibility that some ar flows can request a reservation and reclaim the bandwidth currently used by pfir flows. then some pfir flows will be preempted. in this case we propose to preempt latest pfir flow. a detailed loss-benefit analysis needs to be made on the preemption choices to develop a method to choose the flow to be preempted that will have least harm on the network benefit. however, this study is out of scope of this paper. 4 simulation results our analysis mainly covers the effects of the admission control methods on throughput and preemption rate. along with these, to measure the scalability we define benefit and processing load as other measures. 1 unit benefit is defined as the amount of income network gets from a 1mbps of flow in 1 sec. if a 100mbps link is fully utilized for 20 seconds, benefit is 100*20=2000 units. processing load is the amount of work required to process an incoming request. in order to test this, we calculate the number of accesses to databases by idrm to produce a response to a request. if an idrm accesses bandwidth table 2 times and time slot table 4 times that requests’ processing load is 6 units. figure 1: simulation topology idrm is implemented on ns2 simulation software [15]. we used the topology shown in figure 1 in our simulation. in this topology, there are 10 source nodes and 10 destination nodes. all the link capacities are 100mbps. the link between idrm+router and router is the main bottleneck link. we assume that all flows are unidirectional from sources to destinations. flows and reservation requests are entered into the domain from edge routers. edge routers forward reservation requests to idrm. idrm applies admission control methods and sends back an accept/reject answer to the source. the incoming reservation requests follow poisson distribution with a mean arrival of 10 requests per time slot. time slots are used as the time measure and on average 10 request is produced in a single time slot by all the traffic sources. ir & ar limit of these requests are varied depending on the scenario. every request asks for 1mbps capacity from the network and flows last for 20 time slots. in all the simulation scenarios, there is a warmup time. because of the warmup time, first 100 time slots is excluded from evaluation. all results indicate the steady state case. advance and immediate request admission: a preemptable service definition for bandwidth brokers 359 in order to determine the network behavior and use it as a benchmark for other results, we first tested the network with only ir and with only ar flows. 4.1 ir-only and ar-only flows in ir-only scenario, sources only make ir requests from the network. there is no limit on the ir flows. these flows can consume the whole link capacity. during the simulation, links are saturated and the network behavior under full capacity is observed. in this case, 29.24% of ir are rejected because of unavailable capacity. maximum benefit is 20000 units and achieved benefit for this scenario is 19627 units. average utilization is calculated as 98.13%. average process-time is 34.17 units and total process time is 70577 units. in the ar-only case, sources make only ar requests from the network. no limit is imposed on ar flows in terms of the bandwidth use. in this scenario, 38.51% of ar requests are rejected. the difference between ir-only and ar-only case is that, ar-only case works with time slots. that means flows can start only at the beginning of a time slot. however, ir flows can start anytime. this is one of the reasons for high rejection ratio of ar flows. total network utilization in this scenario is on average 90.73%. network gained 18146 units of benefit out of 20000 maximum benefit. total process-time for ar-only case is 166327 units. comparing the two cases, ar flows achieve lower network utilization because of the scheduling problems and the slot timing. ar flows achieve lower benefits and also higher process times compared to ir flows. 4.2 ir & ar together, no limit on ar in this scenario, ir and ar flows appear together in the network. 70% of the requests are ir requests and 30% of the requests are ar requests. since the holding time for flows is 20 time slots and each flow consumes 1mbps of bandwidth, resources are saturated in 10 time slots. results show that network utilization is 96.62% on average. in case of congestion, ir flows are preempted. 44.2% of the total accepted flows are ir flows and 55.8% are ar flows. 67.9% of ir flows and 0.35% of ar flows are rejected. during the simulation 4.66% of accepted ir flows are preempted. in this ir&ar-no-limit case, when both flows are active in the network, utilization is lower compared to the ir-only case, but it is higher than the ar-only case. 4.3 ir & ar, ar limited in this scenario, our goal is to analyze the effect of limiting the ar flows on network utilization. to determine this effect we set the limit of ar flows to 50% in the first simulation and reduced it by 10% for each successive runs. ar flows can use the limited capacity reserved for them. if there is not enough capacity, then ar flows are rejected. however, ir flows do not have any limit. if there is available capacity in the ir part, flow is accepted immediately. ir flows are preempted in the future when an ar flow asks for the capacity that is being overused by ir flows. figure 2 shows ar share and accept/reject ratios for both ir and ar flows. figure 3 shows ir and ar throughput and total throughput for ar-limited scenario. as it can be seen from the figure, as the ar percentage drops, total throughput increases. network utilization changes from 97.23% for 50% ar share to 98.26% for 10% ar share. figure 4 shows the ir drop rate. ir flows are preempted as ar flows move in for their share of the capacity. figure clearly shows that ir drop rate increases as the ar share increases in the network. 360 i.t. okumus, f.u. dizdar figure 2: accept/reject rates of ir & ar flows for different limits on ar flows figure 3: ir, ar bandwidth shares and total throughput for ar limited case. figure 4: ir drop rate for ar limited case table 3 shows processing time shares of ir and ar flows for different limits on ar flows. results show that as the ar limit decreases, total processing time also decreases. this is in accordance with the fact that admittance of ar flows is more costly than admittance of ir flows. table 3: process times for ar limited case limit iravg irtotal aravg artotal total 50% 23 159756 99 279883 439639 40% 25 174704 83 236285 410989 30% 27 190244 66 187944 378188 20% 29 206265 49 140279 346544 10% 32 222212 28 80077 302289 table 4: process times for both limited case limit(%) iravg irtotal aravg artotal total 50-50% 21 146917 99 279883 426800 60-40% 23 165047 83 236285 401332 70-30% 26 183177 66 187944 371121 80-20% 29 201307 49 140279 341586 90-10% 31 219347 28 80077 299424 4.4 ir & ar, both limited as we can see from the ar-limited scenario, some ir flows are preempted, which is not desired in a qos environment. to prevent that, one option is to limit both flows. in this scenario we divide the total capacity between these two flows. we do not allow any of the flow types to use the capacity from the other flow types’ share. as in previous case, highest capacity reserved for ar flow is 50%. this limit is reduced by 10% for each simulation run and ir limit is increased by 10% to analyze the effect of the amount of share each flow type gets from the network. we start with 50-50 (ir-ar), then change to 60-40, 70-30 and so on for each successive runs. figure 5 shows accept/reject ratios for both limited case. compared to previous scenarios, ir accept-rate is reduced. the reason is that after using the capacity reserved for ir flows, all ir requests are rejected. advance and immediate request admission: a preemptable service definition for bandwidth brokers 361 figure 5: accept/reject rates of ir & ar flows for different limits on both flows figure 6: ir, ar shares and throughput for both limited case figure 6 shows ir-ar shares and total throughput for both-limited case. in this scenario, throughput is decreased significantly compared to the base cases and to the ar-limited case. for 50-50 case, throughput is 93.18% and for 90-10 case throughput is 97.06%. table 4 shows process times for both limited case. compared to the ar-limited case, ar process times are same; however ir process time is decreased because less ir flows are accepted into the network. in this scenario, no ir flows are dropped. since both flows have their own share and they are bound to it, preemption is not necessary. however, this is achieved by sacrificing from total throughput. this result also confirms that there is a clear tradeoff between throughput and the preemption of ir flows. since the main goal of the network provider is to get the maximum benefit from the network, which can be achieved by maximum utilization of the network capacity, we need to find a balance between the preemption and the total throughput. if preemption is inevitable, then the question is which flows should be preempted. in our study we prefer to define a new service called preemptable forwarding (pf). ir flows that are accepted as pf flows will be preempted in case of an overuse. 4.5 preemptable forwarding service with lookahead time in our scheme, in order to balance the throughput and preemption we select to limit the capacities of both ir and ar flows. as we showed in section 4.4, this scheme has low throughput but no ir drops. in order to increase the throughput we allow certain percentage of ar capacity to be used by pf flows if necessary. we analyzed this scenario on the same topology with the same parameters. in the analysis, we have two parameters to consider. first one is the lat. how lat affects the throughput? second parameter is the pf percentage in the ar region. we analyzed the effect of pf percentage on the throughput. in the simulations we set the ir-ar ratio to 50-50 and changed the pf share from 1% to 15% of the total link capacity and also changed lat from 1 time slot to 15 time slots. figure 7 shows ir lost benefits because of preemption for different pf shares. for low lat values, ir drop rate is higher for high pf shares. as lat value increases, lost benefit for all pf shares converge to zero. figure 8 shows the total throughput of the network for different lat values and pf shares. this graph shows that total achieved throughput is higher for lower lat values and higher pf shares. again these results show the clear tradeoff between ir drops and the total throughput. table 5 gives the utilization values for different pf share and lat values. table 6 shows 362 i.t. okumus, f.u. dizdar figure 7: ir lost benefits figure 8: total throughput lost ir benefits for different lat values and pf shares. table 5: throughput values for different lat values and pf shares pf(%) 1 3 5 10 15 lat utilization (%) 1 93.94 95.07 95.78 96 96.04 2 93.88 94.88 95.44 95.61 95.57 3 93.825 94.80 95.24 95.27 95.23 4 93.77 94.57 94.88 94.81 94.85 5 93.69 94.37 94.56 94.57 94.52 8 93.50 93.88 94.01 94 94.01 10 93.42 93.62 93.63 93.64 93.66 15 93.33 93.46 93.50 93.53 93.5 table 6: lost ir benefits for different lat values and pf shares pf(%) 1 3 5 10 15 lat lost ir benefits 1 13 38 62 40 41 2 8 32 41 20 5 3 7 27 30 12 0 4 2 14 11 0 0 5 0 2 0 0 0 8 0 0 0 0 0 10 0 0 0 0 0 15 0 0 0 0 0 results on table 5 and 6 suggest that for 50-50 ir-ar share, best option is to set pf share at 15% and lat value at 3 time slots. these values give the highest throughput on the network without any preemption. highest utilization that can be achieved in this scheme is 96.035%. this value is considerably higher than the 50-50 ir-ar both limited case, which has 93.18% utilization. processing time for a pf request consist of two components. a pf request is originally an ir request. when there is lack of resources on ir share, request is rejected for resubmission as a pfir request. pfir admission is similar to an ar admission. residual ar capacity is calculated and decision is made depending on that value. so the process is a two pass process. first pass results in rejection of an ir request. second pass is pf admission. in terms of the processing time, admission of a pfir request is more costly than both admission of an ir request and also admission of an ar request for the same lat value. regular ar admission has a constant lat value. pfir processing time increases with the increase of lat. table 7 shows average processing times of pfir requests for different lat values. table 7: average processing times for different lat values lat 1 2 3 4 5 8 10 15 avg. process time 82 84 85 87 90 97 103 110 advance and immediate request admission: a preemptable service definition for bandwidth brokers 363 5 comparison of the results ahmad et.al. [8] proposed a dynamic lookahead time (dlat) solution for reduce preemption rate at the cost of lower throughput. this study provides results for dlat and compared the results with the study conducted by schelen and pink [6]. schelen and pink used constant lookahead time (clat) in their study. we will also compare our findings with dlat and clat models. in our pf model, we used 50%ir and 50%ar share. we compare these results with the same ar limit case in schelen and ahmad study. when ar limit is set to 50%, highest normalized throughput achieved by dlat model is 0.84 with dlat c=1.0 and highest throughput achieved by clat is 0.85 with clat 30. these lat values corresponds to lat value 2 in our study. with lat=2, highest utilization in our case is 95.6% (pf %10) and lowest utilization is 93.8% (pf %1). this shows that our scheme performs better than both schemes in terms of the network utilization. in terms of the preemption rate, our results show the number of preempted flows, while dlat and clat models show preemption probability. we will compare the preemption trends with those studies. our results indicates that it is possible to prevent preemption by selecting appropriate lat and pf share values. also while longer lat values result in lower preemption, higher lat values result in higher preemption. dlat and clat models also show the similar trends. longer dlat and clat values result in lower preemption probability. as a summary, dlat model achieves lowest throughput than clat and our pf model. pf model performs best in terms of the total throughput. preventing preemption is necessary in order not to disrupt the qos of the accepted flows. dlat model provides this with the cost of throughput. our model suggest a pf scheme where user pre-agrees a preemption when and if necessary. also pf scheme can provide low preemption ratios by selecting lat and pf% values properly. our proposed scheme results in higher utilization and also in higher user satisfaction in terms of the perceived qos. 6 conclusion in this study we analyzed the admission control method of an idrm that supports both ir and ar flow types.the main tradeoff in this environment is between preemption of ir flows and the total throughput of the network. as the throughput increases, ir preemption also increases. if we decrease ir drop rate, throughput decreases. another issue is user satisfaction due to ir drops and also the scheme to select which ir flows to drop in case of a capacity problem. in order to increase the perceived qos on the user side, instead of selecting an ir flows among the ones that the network accepted and promised to provide a certain qos without any interruption, we propose to employ a new qos class called preemptable forwarding (pf). this flow type will be accepted to the network with the condition that the flow will be preempted and not get the promised qos in case of congestion. users will be accepting the service with the possibility of a preemption or not get any qos at all. we analyzed the effects of the pf share from the total capacity on the ir preemption and the total throughput. as the pf rate increases, total throughput also increases. however, increased pf also causes high ir drop rate. when we employ lookahead time (lat) before accepting the pf requests, behavior changes. with high lat values, ir drop rate is reduced. however, this causes the network utilization to decrease. results show that employing a resource manager that uses pf service in admission control of the flows can increase the total throughput and also the user satisfaction in a qos network. 364 i.t. okumus, f.u. dizdar bibliography [1] blake, s. et.al., an architecture for differentiated services, rfc 2475, 1998 [2] nichols k. ,jacobson v. ,zhang l. , a two-bit differentiated services architecture for the internet, rfc 2638, 1999 [3] mantar h.a., okumus ý.t., hwang j., chapin s.j., a scalable intra-domain resource management architecture for diffserv networks, journal of high speed networks, 15, 185205, 2006 [4] jacobson v., nichols k., poduri k., an expedited forwarding phb, rfc 2598, 1999 [5] heinanen et.al., assured forwarding phb group, rfc 2598, 1999 [6] schelen o., pink s., resource sharing in advance request agents, journal of high speed networks: special issue on multimedia networking, 7(3-4):213-228, 1998 [7] lin y., chang c., hsu y., bandwidth brokers of instantenaous and book-ahead requests for differentiated services networks, icice transactions on communication, e85-b, no.1,278283, 2002 [8] ahmad i., kamruzzaman j., aswathanarayaniah s., a dynamic approach to reduce preemption, in book-ahead reservation in qos-enabled networks, computer communications, 29(9):1443-1457, 2006 [9] degermark m. et.al., advance reservationss for predictive service in the internet, multimedia systems, 5(3):177-186, 1997 [10] greenberg a.g., srikant r., whitt w., resource sharing for book -ahead and instantaneous -request calls, ieee/acm transactions on networking, 7(1):10-22, 1999 [11] srikant r., whitt w., resource sharing for book-ahead and instantaneous-request calls using a clt approximation, telecommunication systems, 16(3-4):233-253, 2001 [12] karsten m., beries n., wolf l., steinmetz r., a policy-based service specification for resource reservation in advance, proceedings of the international conference on computer communications (iccc’99), tokyo, japan, 82-88, sept 1999 [13] ahmad i., kamruzzaman j., preemption policy in qos-enabled networks: a customer centric approach, journal of research and practice in information technology, 39(1):61-79, 2007 [14] adamson a. et.al., qbone signaling design team final report", internet2 qbone sinaling workgroup, http://qos.internet2.edu/wg/documents-informational/20020709-chimentoetal-qbone-signaling/, jul 2002 [15] the network simulator ns-2, http://www.isi.edu/nsnam/ns/ international journal of computers communications & control issn 1841-9836, 9(6):768-775, december, 2014. glm analysis for fmri using connex array a. ţugui andrei ţugui politehnica university of bucureşti romania, 061071 bucureşti, splaiul independenţei, 313 andrei.tugui@yahoo.com abstract: in the last decades, magnetic resonance imaging gained lot of popularity, and also functional magnetic resonance imaging (fmri), due to the fact that mri is a harmless and efficient technique for human cerebral activity studies; fmri aims to determine and to locate different brain activities when the subject is doing a predetermined task. in addition, using fmri analysis, nowadays we can make prediction on several diseases. this paper’s purpose is to describe the general linear model for fmri statistical analysis algorithm, for a 64 x 64 x 22 voxels dataset on a revolutionary parallel computing machine, connex array. we make a comparison to other computing machines used in the same purpose, in terms of algorithm time execution (statistical analysis speed). we will show that by taking advantage on its specific parallel computation each step in glm analysis, connex array is able to answer successfully to computational challenge launched by fmri computation: the speed-up. keywords: connex array, functional magnetic resonance imaging, image reconstruction, parallel algorithms, parallel processing. 1 introduction nowadays, neurological activity can be studied using several investigation techniques, each of them having its own advantage and disadvantage, by studying human brain from several perspectives. although other techniques like eeg (electroencephalography) or meg (magneto encephalography) have a satisfactory temporal resolution (milliseconds), when it comes to spatial resolution, pet (positron emission tomography) and fmri are much more indicated to use [1] [4]. those investigation techniques pick information from the blood flow changes. as for fmri statistical analysis on which we will focus in this paper, far away from the biological and functional characteristics, from computational point of vue it is very time consuming. processing dataset is basically mri images acquired voxel by voxel; once the blood flow changes, mri signal strength changes also, this way one can analyze these changes using the statistical data analysis. typically, input data acquired one time can be in range of 100 000 voxels, but the investigation’s main problem remains the fact that this data is repeatedly acquired by 100 up to 2000 times [5] (usually because of head moving, to compensate discrete acquisition in time, slice by slice, or because most of the time we acquire a lot of noise which must be filtered). a typical fmri computational chart is presented in fig. 1. usually, fmri dataset acquired during one observation consist of 8 volumes 64 x 64 x 22 voxels each, having a 3.75 x 3.75 x 3.75 mm spatial resolution. if we represent this data as simple precision floats, the data requires about 29 mb memory space. obviously, we deal to a high enough spatial and temporal memory space, in fact the acquisition process itself is not time-consuming (usually 1 volume/s), but the processing data is, which bottlenecks a lot nowaday’s computers tasks, especially when large amount of data must be computed in real time [6][7]. estimated, today we can process a volume dataset using a dedicated processor during about 5 s, but if we use a gpu, the processing time decreases to just 0.5 s! so, this is the high importance that we assign to processing speed in fmri computation. copyright © 2006-2014 by ccc publications glm analysis for fmri using connex array 769 figure 1: fmri pipline in addition, many times fmri data reconstruction and analysis are made in real time, like bci (brain computer interface) —a cooperation method between the pc and the subject aiming to solve a given task. as one can see in fig.1, fmri data analysis requires two steps: • preprocessing input dataset • statistical dataset analysis in the following, we will present a typically 3d fmri volume computation using input data from [8], meaning first the preprocessing step and then the statistical data analysis using the glm (general linear model), on a revolutionary parallel computing machine, connex array. a full description of this parallel machine’s architecture and processing manner is given in papers [9][10]. we introduce here only the data vector definition, being in fact the cell that connex operates, a new n-length data type containing fmri samples, modeling the vectors from connex array [10]. once we understand the vectorial architecture, we will describe much better this machine’s parallelism. thereby, if we have a c language instruction to be executed as the following one: for( ind = 1;ind <= sizeof_vector;ind ++ ) vect3[ind] = vect1[ind] + vect2[ind] then connex will sum in one step vect1 and vect2 in the sum vector vect3, whereas a sequential processor would normally do the addition element by element in sizeof_vector steps. here, sizeof_vector is a constant holding the vector’s length we operate with, by default. thereby, by typical operators’ overloading, using a special c++ library named cvector [11], connex can sum, multiply, decrease or shift vectors of data. all cycling instructions like for are executed using a new cvector instruction like the following: where (vect1 %2 == 0) {vect1 = 0} elsewhere {vect1 = 1} ; 2 fmri data preprocessing preprocessing usually requires signal filtering. the acquired signal is exposed to a lot of perturbation as head moving (physiological noise), which can be eliminated using motion cor770 a. ţugui rection filtering step, or even the acquisition process itself, which is done in discrete steps (once a section) can generate noise: this noise is filtered using the slice time correction filtering step. we show this discrete time slice acquisition of one voxel in figure 2. figure 2: fmri slice time acquisition 3 slice time correction during this preprocessing step, slice time correction is made using sinc interpolation. from computational point of view, this step involves: • one 1d fft (one fast fourier transform) • one point-wise multiplication • one 1d ifft (inverse fast fourier transform) loading dataset into connex’s memory was made thereby in 64 vectors, 64 floating point elements each (one slice at the time), this way: vectorx[64]; x[0] = [-0.00416487...0.09823550 ]; x[63] = [-0.00687657...0.09823550]; 1dfft computation took 6 steps using cooley-tuckey algorithm [12], each step the output vector is loaded with the old sample and summed with the new resulted sample, like this: x[0]+ = (ci ∗x[0] + x12) (1) where ci is the coefficient vector at current step i and x12 is a temporary vector obtained from successive input vector shifts, like: i = 0; temp1 = shiftleft(x[0],32); where (index <32) { x12 = temp1; glm analysis for fmri using connex array 771 } temp1 = shiftright(x[0],32); i += 32; where (index >= i && index <(i+32)) { x12 = temp1; } [13] for sinc function computation we used this formula to multiply each pixel: sinc[(π/tr)(r − itr)] (2) where tr = repetition time for each pulse sequence, resulting a coefficient vector c[index] like this: where (x[index] == 0) {c[index] = 1;} elsewhere { c[index]=(sin(3.14*(1-index*2)/2))/ (3.14*(1index*2)/2);} for the reverse fourier transform computation, the algorithm is similar to 1dfft algorithm. it worth nothing to see that for this preprocessing step, the total computation time used by spm (statistical parametric mapping software) is about 32 s [14], but the same algorithm implementation on connex array took only about 100 ms. it worth nothing to say that the same algorithm implementation using fixed-point representation is much faster [15]. 4 image registration this preprocessing step filters the noise generated by head movement. it involves the alignment of all voxels to a reference voxel (still state). for fmri, it is sufficient one resolution range with 3 iterations per volume. in each iteration, three quadrature filters are applied on the x, y and z directions, the filters having each 7 x 7 x 7 voxels, complex elements and not cartesian separable. from the filter response we compute three phase shifts, three gradient shifts and the statistical certainty. then it results an equation system by adding all voxels and all filters to find the optimization parameter vector. from the optimization parameter vector we compute then a movement vector for each voxel and we apply trilinear interpolation using a 3d texture to rotate and translate the volume. paper [16] describes the complete image registration algorithm. head movement field is modeled using a 12—length parameter vector p, like: p = [p1, p2, p3, p4, p5, p6, p7, p8, p9, p10, p11, p12] t (3) p vector is computed by solving the 12–length linear equation system with: p = a−1h (4) in this paper we used fft convolution to compute head movement, although there is also the possibility to use spatial convolution in this purpose. it is worth nothing to follow the inverse matrix computation with connex, which reveals the parallelism used once the data is loaded into memory. one may consult papers [10] and [13] for further information on connex architecture and its vectorial computation. we mention that image registration is the most time consuming preprocessing step (about 95% from total time computation time used for a single volume), but using connex’s computational power, we showed once again (see table 1) 772 a. ţugui that time bottleneck can be eliminated using connex array. if we compare image registration algorithm implementation on connex array to the same algorithm implemented on an usual pc using matlab testing environment, in matlab we solve this problem during about 14 s, whereas connex uses only half of this time (table 1). a closer look on matrix computation with connex, intense used in this paper, can be found in reference [17]. 5 glm smoothing algorithm smoothing dataset is an additional filter step applied before statistical fmri analysis. typically, this filter is done for each voxel, except for those voxels near the edge of dataset, and it involves one 3d convolution. basically, this is done by using a gaussian lowpass cartesian separable filter, so the smoothing algorithm becomes in fact a 1d convolution applied in all the three directions (x, y, z). for fmri smoothing, we use typically 9 x 9 x 9 filters. thus, for an entire 1d slice we define: • the input vector: – vectorx[64]; • the output vector: – vectory[64]; • the impulse response vector: – vectorh; by applying smoothing to each line (finally to the entire slice and then to the entire volume), the output vector is processed by multiplying the input vector with the impulse response vector: y = xh (5) 6 detrending the final step in preprocessing fmri data is a special filtering step called detrending. thus, papers [18] and [19] show how exactly detrending is eliminating the drifts caused by physiological noise and scanner imperfections. typically, detrending is applied to each slice of the voxel and it searches for the best linearity fit between the slices on one hand, and a particular polynomial on the other hand, fit which is eliminated by the filtering process (polynomial detrending). this filtering procedure uses in fact linear regression, an algorithm similar to statistical glm analysis, which we will describe in the following. 7 statistical fmri analysis using glm model after preprocessing, fmri dataset is subjected to a statistical analysis which can be made using two approaches [20]: • general linear model (glm) • canonical correlation analysis (cca) glm analysis for fmri using connex array 773 in this paper we approached general linear model (glm) analysis on each slice (the whole volume), where observation matrix is computed like this: y = xβ + ϵ (6) where: • y = the observations (all samples from the slice) • β = optimization parameters • x = design matrix given by stimulus paradigm • ϵ = the errors can not be explained by the model by minimizing ∥ ϵ ∥ 2 it can be shown that the best parameters [14] are given by: β̂ = (xt x)−1xt y (7) the term (xt x)−1xt (we note it with c) is slice independent and can be precomputed and loaded into memory. the only thing we must compute for each voxel in order to find the estimated parameters is the product between the constant c and the current slice y. if we define the contrast as a column vector (e.g. [1 0]t ), the test value t [14] is given by: t = ct β̂√ var(ϵ̂)ct (xt x)−1c (8) the matrix product is computed fast, as the error and its variance. then we load again the slice time-series into memory to compute the mean error: ϵ̄ = 1 n n∑ t=1 (y (t)−x(t)β̂) (9) n = number of time-points and x(t) = design matrix values corresponding to time-point t. the third memory load is made this time to compute error variance an then the test value t. the term ct (xt x)−1c is a scalar in fact, which can be precomputed. on connex, parameter matrix is loaded into 64 vectors 64-length pixels. after the data load into the memory and the matrix computation, we compute matrix product β̂ = ct y for each pixel this way: bij = addall(y [i]∗ct [j]), where (index == i) {b[i] = bij; } addall adds all products from elements located at the same positions: mi = addall(e[i]) where (index == i) { m =mi; } finally, the test value t is computed with: t[0] = b[0]∗ (ct [0]∗ √ (variance)) where variance is the statistical variance (the difference between the mean’s square and the mean, computed prior). 774 a. ţugui 8 conclusions if on a sequential processor like spm, glm analysis would take about 33 s, glm algorithm implementation on connex array requires only 1.7 ms (see table 1). we demonstrate in this paper that the speed-up in fmri reconstruction is a challenge that connex array can easily overcome. table 1: fmri glm data analysis on different processors glm step spm matlab openmp matlab cuda connex array slice time correction 32 s 280 ms 235 ms 7.8 ms 100.4 ms motion compensation 28 s 13.7 s 4.6 s 650 ms 7.857 s smoothing 32 s 1.5 s 195 ms 10.4 ms 0.05 ms detrending 8.6 ms 4.6 ms 0.37 ms 1.73 ms glm 33 s 16.6 ms 5.8 ms 0.38 ms 1.7 ms total time for glm 125 s 15.51 s 5.04 s 0.43 s 7.96 s bibliography [1] tong, s.; alessio, a.m. (2010); noise properties in psf—based fully—3d pet image reconstruction: an experimental evaluation, physics in medicine and biology, 55: 1453—1473. [2] chen, c.m.; lee, s.y. (1990); a parallel implementation of 3—d ct image reconstruction on hypercube multiprocessor, ieee transactions on nuclear science, 37(3): 1333-1346, doi: 10.1109/23.57385. [3] nishimoto, s.; vu, a.t.; naselaris, th.; benjamini, y.; yu, b.; gallant, j.l. (2011); reconstructing visual experiences from brain activity evoked by natural movies, current biology 21, 1641—1646. [4] holland, d.; liu, j.; song, c.; mazerolle, x. et al. (2013); compressed sensing reconstruction improves sensitivity of variable density spiral fmri, magnetic resonance in medicine, 70. [5] lindquist, m.a. (2008); the statistical analysis of fmri data, statistical science, 23(4): 439—464. [6] cohen, m.s. (2001); real—time functional magnetic resonance imaging, methods, 25(2): 201—220. [7] bernstein, m.a.; king, k.f.; zhou, x.j. (2004); handbook of mri pulse sequences, elsevier academic press. [8] http://v04.pymvpa.org/examples.html [9] maliţa, m.; ştefan, gh. m. (2010); many-processors & kleene’s model, upb scientific bulletin series c, 72. [10] ştefan, gh. m. (2010); integral parallel architecture in system—on—chip designs, the 6th international workshop on unique chips and systems, atlanta, usa, 23—26. http://www.arh.pub.ro/gstefan/2010ucas.pdf glm analysis for fmri using connex array 775 [11] mîţu, b. (2008); c language extension for parallel processing. http://arh.pub.ro/gstefan/vectorc.ppt [12] cooley, j.w.; tuckey, j.w. (1965); an algorithm for the machine calculation of complex fourier series. math. computation, jstor mathematic of computation, 19(90):297-309. [13] ţugui, a. (2012); fft parallel implementation for mri image reconstruction, u.p.b. scientific bulletin series c, 74: 229-244. [14] eklund, a.; anderson, m.; knutsson, h. (2012); fmri analysis on the gpu possibilities and challenges, computer methods and programs in biomedicine, 145—161. [15] ţugui, a. (2013); fixed—point real time mri reconstruction using connex array, proceedings of the romanian academy series a, 14(3): 255—258. http://www.acad.ro/sectii2002/proceedings/doc2013-3/11-tugui.pdf [16] eklund, a.; andresson, m.; knutsson, h. (2010); phase based volume registration using cuda, ieee international conference on acoustics, speech and signal processing (icassp), dallas, usa, 658—661. [17] calfa, a.m.; ştefan, gh. m. (2010); matrix computation on connex parallel architecture, ices 2010 —the international conference on signals and electronic systems, gliwice, poland, 375—378. [18] friman, o.; borga, m.; lundberg, p.; knutsson, h. (2004); detection and detrending in fmri data analysis, neuroimage, 22(2): 645—655. [19] tanabe, j.; miller, d.; tregellas, j.; freedman, r.; meyer, f.g. (2002); comparison of detrending methods for optimal fmri preprocessing, neuroimage, 15(4): 902—907. [20] poldrack, r.h.; mumford, j.a. (2011); handbook of functional mri data analysis, cambridge university press, new york, usa. international journal of computers communications & control issn 1841-9836, 10(1):70-77, february, 2015. fuzzy euclidean normed spaces for data mining applications s. nădăban sorin nădăban* aurel vlaicu university of arad romania, 310330 arad, elena dragoi, 2 *corresponding author: snadaban@gmail.com abstract: the aim of this paper is to introduce some special fuzzy norms on kn and to obtain, in this way, fuzzy euclidean normed spaces. in order to introduce this concept we have proved that the cartesian product of a finite family of fuzzy normed linear spaces is a fuzzy normed linear space. thus any fuzzy norm on k generates a fuzzy norm on kn. finally, we prove that each fuzzy euclidean normed space is complete. fuzzy euclidean normed spaces can be proven to be a suitable tool for data mining. the method is based on embedding the data in fuzzy euclidean normed spaces and to carry out data analysis in these spaces. keywords: fuzzy norm, fuzzy euclidean normed spaces, data mining. 1 introduction data mining and information retrieval are two important components of the same problem: discovering new and relevant information and knowledge, through investigation of a large amount of data, through extracting the information and knowledge out of a very large databases or data warehouse. in information retrieval the user knows what he is looking for, but sometimes it is very difficult to express this thing. the use of fuzzy sets in representing the knowledge proves to be successful on many occasions, allowing the user to express his expectations in a language close to the natural one. on the other hand, many times the matching between the requests of the user and the existing data in the databases is only an approximate one, thus, the use of fuzzy sets and the degrees of membership proves to be not only useful but also necessary. in data mining, the user looks for new knowledge. the aim is to divide the data into homogeneous categories, in data classes. the use of the of fuzzy sets brings about flexibility both in representing knowledge and in interpreting the results as well. the measures of similarity are the most used, at all levels in the data mining and information retrieval. the notion of similarity, or more general of the measures of comparison is the central point for all applications in the real world. the measure of similarity aims at quantifying the degree to which two objects are similar or dissimilar, offering a numeric value for this comparison. machine learning techniques use similarity measures. machine learning represents an important method of extracting the knowledge out of very large databases. a study concerning fuzzy learning methods was realized by e. hüllermeier [4]. in the last twenty years, the world wide web has become a major source of data and information for all domains. web mining is the process of discovering useful knowledge and information through investigating the web structure and its content. different web mining tasks and advanced artificial intelligence methods for information retrieval and web mining are discussed by i. dzitac & i. moisil [3]. clustering and classification are both important tasks in data mining. since clustering means the grouping of similar objects, we need some suitable measures on data sets. in order to determinate the similarity or dissimilarity between any pair of objects, the most used measures are distance measures. copyright © 2006-2015 by ccc publications fuzzy euclidean normed spaces for data mining applications 71 if each data point is view as a n-dimensional vector x = (x1,x2, · · · ,xn) ∈ rn, where each component xi is the value of an attribute of the data, the distance between the two data instances can be calculated using euclidean distance, manhattan distance, minkowski distance, max distance, etc. what will we do if the distance between the vectors x and y can not be precisely measured and thus we are not able to assign it, with certainty, the value t ∈ r+. there are probably different approaches enabling to handle somehow this situation. one of them, fuzzy approach, consists in using on rn some fuzzy metrics, i.e. mappings m : rn × rn × [0,∞) → [0,1], where m(x,y,t) = α indicates the truth value of the statement "the distance between x and y is smaller than the real number t" and which belongs to [0,1]. it will be better that such fuzzy metrics to come from fuzzy norms on rn, namely m(x,y,t) = n(x − y,t), where n : rn × [0,∞) → [0,1]. the aim of this paper is to introduce some special fuzzy norms on kn and to obtain, in this way, fuzzy euclidean normed spaces. in order to introduce this concept we have proved that the cartesian product of a finite family of fuzzy normed linear spaces is a fuzzy normed linear space. thus any fuzzy norm on k generates a fuzzy norm on kn. finally, we prove that each fuzzy euclidean normed space is complete. fuzzy euclidean normed spaces can be proven to be a suitable tool for data mining. the method is based on embedding the data in fuzzy euclidean normed spaces and to carry out data analysis in these spaces. in studying fuzzy topological vector spaces, a.k. katsaras [5] first introduced the notion of fuzzy norm on a linear space. since then many mathematicians have introduced several notions of fuzzy norm from different points of view. our definition looks similar, but it is more general, to the definitions introduced, almost in the same time, by t. bag & s.k. samanta (see [1], [2]) and r. saadati & s.m. vaezpour (see [7]). in 2006, r. saadati & j.h. park introduced the notion of intuitionistic fuzzy euclidean normed space (see [8], [9]). 2 preliminaries definition 1. [10] a binary operation ∗ : [0,1] × [0,1] → [0,1] is called triangular norm (t-norm) if it satisfies the following conditions: 1. a ∗ b = b ∗ a,(∀)a,b ∈ [0,1]; 2. a ∗ 1 = a,(∀)a ∈ [0,1]; 3. (a ∗ b) ∗ c = a ∗ (b ∗ c),(∀)a,b,c ∈ [0,1]; 4. if a ≤ c and b ≤ d, with a,b,c,d ∈ [0,1], then a ∗ b ≤ c ∗ d. example 2. three basic examples of continuous t-norms are ∧, ·,∗l, which are defined by a∧b = min{a,b}, a·b = ab (usual multiplication in [0,1]) and a∗lb = max{a+b−1,0} (the lukasiewicz t-norm). remark 3. ∗ ≤ ∧, i.e. ∧ is stronger that any other t-norms. indeed, a ∗ b ≤ a ∗ 1 = a, a ∗ b ≤ 1 ∗ b = b. thus a ∗ b ≤ a ∧ b. definition 4. let ∗,∗′ be two t-norms. we say that ∗′ dominates ∗ and we denote ∗′ ≫ ∗ if (x1 ∗ ′x2) ∗ (y1 ∗ ′y2) ≤ (x1 ∗ y1) ∗ ′(x2 ∗ y2),(∀)x1,x2,y1,y2 ∈ [0,1]. proposition 5. for any t-norm ∗ we have ∧ ≫ ∗. 72 s. nădăban proof: let x1,x2,y1,y2 ∈ [0,1]. case 1. x1 ≤ x2,y1 ≤ y2. then x1 ∗ y1 ≤ x2 ∗ y2. thus (x1 ∗ y1) ∧ (x2 ∗ y2) = x1 ∗ y1. on the other hand (x1 ∧ x2) ∗ (y1 ∧ y2) = x1 ∗ y1. therefore (x1 ∧ x2) ∗ (y1 ∧ y2) = (x1 ∗ y1) ∧ (x2 ∗ y2). case 2. x1 ≤ x2,y2 ≤ y1. as x1 ≤ x2, we have x1 ∗ y2 ≤ x2 ∗ y2. as y2 ≤ y1, we have x1 ∗ y2 ≤ x1 ∗ y1. thus x1 ∗ y2 ≤ (x1 ∗ y1) ∧ (x2 ∗ y2). hence (x1 ∧ x2) ∗ (y1 ∧ y2) = x1 ∗ y2 ≤ (x1 ∗ y1) ∧ (x2 ∗ y2). case 3. x2 ≤ x1,y1 ≤ y2 and case 4. x2 ≤ x1,y2 ≤ y1 are similar to previous cases. 2 definition 6. [6] let x be a vector space over a field k (where k is r or c) and ∗ be a continuous t-norm. a fuzzy set n in x × [0,∞) is called a fuzzy norm on x if it satisfies: (n1) n(x,0) = 0,(∀)x ∈ x; (n2) [n(x,t) = 1,(∀)t > 0] if and only if x = 0; (n3) n(λx,t) = n ( x, t|λ| ) ,(∀)x ∈ x,(∀)t ≥ 0,(∀)λ ∈ k∗; (n4) n(x + y,t + s) ≥ n(x,t) ∗ n(y,s),(∀)x,y ∈ x,(∀)t,s ≥ 0; (n5) (∀)x ∈ x, n(x, ·) is left continuous and lim t→∞ n(x,t) = 1. the triple (x,n,∗) will be called fuzzy normed linear space (briefly fnl-space). remark 7. a) t. bag and s.k. samanta [1], [2] gave a similar definition for ∗ = ∧, but in order to obtain some important results they assume that the fuzzy norm satisfies also the following conditions: (n6) n(x,t) > 0,(∀)t > 0 ⇒ x = 0 ; (n7) (∀)x ̸= 0,n(x, ·) is a continuous function and strictly increasing on the subset {t : 0 < n(x,t) < 1} of r. the results obtained by t. bag and s.k. samanta can be found in this more general settings. b) r. saadati and s.m. vaezpour [7] suppose that 1. n(x,t) > 0,(∀)t > 0; 2. n(x, ·) is a continuous function, (∀)x ̸= 0. remark 8. n(x, ·) is nondecreasing, (∀)x ∈ x. theorem 2.1. [6] let (x,n,∗) be a fnl-space. for x ∈ x,r ∈ (0,1), t > 0 we define the open ball b(x,r,t) := {y ∈ x : n(x − y,t) > 1 − r} . then tn := {t ⊂ x : x ∈ t iff (∃)t > 0,r ∈ (0,1) : b(x,r,t) ⊆ t} is a topology on x. moreover, if the t-norm ∗ satisfies sup x∈(0,1) x ∗ x = 1, then (x,tn) is hausdorff. theorem 2.2. [6] let (x,n,∧) be a fnl-space. let pα(x) := inf{t > 0 : n(x,t) > α},α ∈ (0,1) . then p = {pα}α∈(0,1) is an ascending family of semi-norms on x. moreover, for x ∈ x,s > 0,α ∈ (0,1) we have: pα(x) < s if and only if n(x,s) > α. fuzzy euclidean normed spaces for data mining applications 73 3 convergence in fnl-spaces definition 9. [2] let (x,n,∗) be a fnl-space and (xn) be a sequence in x. the sequence (xn) is said to be convergent if (∃)x ∈ x such that lim n→∞ n(xn − x,t) = 1 , (∀)t > 0 . in this case, x is called the limit of the sequence (xn) and we denote lim n→∞ xn = x or xn → x. definition 10. [2] let (x,n,∗) be a fnl-space and (xn) be a sequence in x. the sequence (xn) is called cauchy sequence if lim n→∞ n(xn+p − xn, t) = 1 , (∀)t > 0,(∀)p ∈ n∗ . remark 11. if (x,n,∗) is a fnl-space, then every convergent sequence is cauchy sequence. definition 12. [2] let (x,n,∗) be a fnl-space. (x,n,∗) is said to be complete if any cauchy sequence in x is convergent to a point in x. a complete fnl-space will be called fuzzy banach space. definition 13. let (x,n,∗) be a fnl-space, α ∈ (0,1) and (xn) be a sequence in x. the sequence (xn) is said to be α-convergent if exists x ∈ x such that (∀)t > 0,(∃)n0 ∈ n : n(xn − x,t) > α,(∀)n ≥ n0 . in this case, x is called the α-limit of the sequence (xn) and we denote xn α→ x. theorem 3.1. let (x,n,∗) be a fnl-space and (xn) be a sequence in x. the following sentences are equivalent: 1. (xn) is convergent to x; 2. (xn) is convergent to x in topology tn; 3. (xn) is α-convergent to x, (∀)α ∈ (0,1); 4. lim n→∞ pα(xn − x) = 0,(∀)α ∈ (0,1). proof: (2) ⇔ (1) xn → x in the topology tm ⇔ (∀)r ∈ (0,1),(∀)t > 0,(∃)n0 ∈ n : xn ∈ b(x,r,t),(∀)n ≥ n0 ⇔ (∀)r ∈ (0,1),(∀)t > 0,(∃)n0 ∈ n : n(xn − x,t) > 1 − r,(∀)n ≥ n0 ⇔ lim n→∞ n(xn − x,t) = 1 , (∀)t > 0 . (1) ⇔ (3) it is obvious. (4) ⇔ (3) lim n→∞ pα(xn − x) = 0 ⇔ (∀)t > 0,(∃)n0 ∈ n : pα(xn − x) < t,(∀)n ≥ n0 ⇔ (∀)t > 0,(∃)n0 ∈ n : n(xn − x,t) > α,(∀)n ≥ n0 ⇔ xn α→ x . 2 theorem 3.2. let (x,n,∗) be a fnl-space and (xn) be a sequence in x. then (xn) is a cauchy sequence if and only if lim n→∞ pα(xn+p − xn) = 0,(∀)α ∈ (0,1),(∀)p ≥ 1. 74 s. nădăban proof: lim n→∞ pα(xn+p − xn) = 0,(∀)α ∈ (0,1),(∀)p ≥ 1 ⇔ (∀)t > 0,(∃)n0 ∈ n : pα(xn+p − xn) < t,(∀)n ≥ n0,(∀)α ∈ (0,1),(∀)p ≥ 1 ⇔ (∀)t > 0,(∃)n0 ∈ n : n(xn+p − xn, t) > α,(∀)n ≥ n0,(∀)α ∈ (0,1),(∀)p ≥ 1 ⇔ (∀)t > 0, lim n→∞ n(xn+p − xn, t) ≥ α,(∀)α ∈ (0,1),(∀)p ≥ 1 ⇔ (∀)t > 0, lim n→∞ n(xn+p − xn, t) = 1,(∀)p ≥ 1 ⇔ (xn) is a cauchy sequence . 2 definition 14. let (x,n,∗), (x,n′,∗′) be two fnl-space. the fuzzy norms n and n′ are said to be equivalent if for any sequence (xn) in x, we have xn → x in (x,n,∗) if and only if xn → x in (x,n′,∗′). 4 fuzzy euclidean normed spaces in this section we will denote by k the field of real numbers r or the field of complex numbers c. theorem 4.1. let (x1,n1,∗),(x2,n2,∗), · · · ,(xn,nn,∗) be fnl-spaces. let ∗′ be a continuous t-norm such that ∗′ ≫ ∗. let n : x1 × x2 × · · ·xn × [0,∞) → [0,1], n(x1,x2, · · · ,xn, t) = n1(x1, t) ∗ ′n2(x2, t) ∗ ′ · · · ∗ ′nn(xn, t) . then (x1 × x2 × · · ·xn,n,∗) is a fnl-space. proof: (n1) n(x1,x2, · · · ,xn,0) = n1(x1,0) ∗ ′n2(x2,0) ∗ ′ · · · ∗ ′nn(xn,0) = 0 . (n2) n(0,0, · · · ,0, t) = n1(0, t) ∗ ′n2(0, t) ∗ ′ · · · ∗ ′nn(0, t) = 1 . conversely, if n(x1,x2, · · · ,xn, t) = 1,(∀)t > 0, we obtain that n1(x1, t) ∗ ′n2(x2, t) ∗ ′ · · · ∗ ′nn(xn, t) = 1,(∀)t > 0. as ∗′ ≤ ∧, we have 1 ≤ min{n1(x1, t),n2(x2, t), · · · ,nn(xn, t)},(∀)t > 0. thus n1(x1, t) = 1,n2(x2, t) = 1, · · · ,nn(xn, t) = 1,(∀)t > 0. hence x1 = x2 = · · · = xn = 0. (n3) for λ ̸= 0, we have n(λx1,λx2, · · · ,λxn, t) = n1(λx1, t) ∗ ′n2(λx2, t) ∗ ′ · · · ∗ ′nn(λxn, t) = = n1 ( x1, t |λ| ) ∗ ′n2 ( x2, t |λ| ) ∗ ′ · · ·nn ( xn, t |λ| ) = n ( x1,x2, · · · ,xn, t |λ| ) . (n4) n(x1+y1,x2+y2, · · · ,xn+yn, t+s) = n1(x1+y1, t+s)∗′n2(x2+y2, t+s)∗′ · · ·∗′nn(xn+yn, t+s) ≥ ≥ (n1(x1, t) ∗ n1(y1,s)) ∗ ′(n2(x2, t) ∗ n2(y2,s)) ∗ ′ · · · ∗ ′(nn(xn, t) ∗ nn(yn,s)) ≥ ≥ (n1(x1, t) ∗ ′n2(x2, t) ∗ ′ · · · ∗ ′nn(xn, t)) ∗ (n1(y1,s) ∗ ′n2(y2,s) ∗ ′ · · · ∗ ′nn(yn,s)) = = n(x1,x2, · · · ,xn, t) ∗ n(y1,y2, · · · ,yn,s). (n5) let x = (x1,x2, · · · ,xn) ∈ x1 × x2 × · · ·xn. as n1(x1, ·),n2(x2, ·), · · · ,nn(xn, ·) are left continuous and ∗′ is a continuous t-norm, we obtain that n(x, ·) is left continuous. it is obvious that lim t→∞ n(x,t) = 1. 2 fuzzy euclidean normed spaces for data mining applications 75 proposition 15. if (x1 × x2 × · · ·xn,n,∗) is a fnl-space, then (x1,n1,∗), (x2,n2,∗), · · · , (xn,nn,∗) are fnl-spaces, where n1(x1, t) = n((x1,0, · · · ,0), t),n2(x2, t) = n((0,x2, · · · ,0), t), · · · , nn(xn, t) = n((0,0, · · · ,xn), t). proof: we will prove that n1 is a fuzzy norm. similarly it can be shown that n2, · · · ,nn are fuzzy norm. (n1) n1(x1,0) = n((x1,0, · · · ,0),0) = 0; (n2) n1(x1, t) = 1,(∀)t > 0 ⇔ n((x1,0, · · · ,0), t) = 1,(∀)t > 0 ⇔ (x1,0, · · · ,0) = 0 ⇔ x1 = 0; (n3) n1(λx1, t) = n((λx1,0, · · · ,0), t) = n(λ(x1,0, · · · ,0), t) = = n ( (x1,0, · · · ,0), t |λ| ) = n1 ( x1, t |λ| ) ; (n4) n1(x1 + y1, t + s) = n((x1 + y1,0, · · · ,0), t + s) = n((x1,0, · · · ,0) + (y1,0, · · · ,0), t + s) ≥ ≥ n((x1,0, · · · ,0), t) ∗ n((y1,0, · · · ,0),s) = n1(x1, t) ∗ n1(y1,s) ; (n5) it is obvious. 2 example 16. let n : k × [0,∞) → [0,1], defined by n(x,t) := { e− |x| t , if t > 0 0, if t = 0 . then (k,n,∧) is a fnl-space. proof: (n1) it is obvious. (n2) n(x,t) = 1,(∀)t > 0 ⇔ e− |x| t = 1,(∀)t > 0 ⇔ − |x| t = 0,(∀)t > 0 ⇔ x = 0. (n3) let x ∈ r, t > 0,λ ∈ r∗. then n(λx,t) = e− |λx| t = e − |x| t/|λ| = n ( x, t |λ| ) . (n4) fix x,y ∈ r, t,s > 0. we assume, without restricting the generality, that e− |x| t ≤ e− |y| s . thus − |x| t ≤ − |y| s , i.e. |x|s ≥ |y|t. we will show that e− |x+y| t+s ≥ e− |x| t , namely − |x+y| t+s ≥ − |x| t , i.e. |x + y|t ≤ |x|(t + s). but |x + y|t ≤ (|x| + |y|)t = |x|t + |y|t ≤ |x|t + |x|s = |x|(t + s) . therefore n(x + y,t + s) = e − |x+y| t+s ≥ min { e− |x| t ,e− |y| s } = n(x,t) ∧ n(y,s) . (n5) it is obvious. 2 lemma 17. let (k,n,∗) be a fnl-space. then there exists α ∈ (0,1) such that pα(1) ̸= 0. proof: pα(1) = inf{t > 0 : n(1, t) > α}. we suppose that pα(1) = 0,(∀)α ∈ (0,1). then n(1, t) > α,(∀)α ∈ (0,1),(∀)t > 0. thus n(1, t) = 1,(∀)t > 0. therefore 1 = 0, contradiction. 2 proposition 18. a sequence (xn) is convergent in a fnl-space (k,n,∗) if and only if (xn) is convergent in (k, | · |). 76 s. nădăban proof: a sequence (xn) is convergent to x in (k,n,∗) ⇔ lim n→∞ pα(xn − x) = 0,(∀)α ∈ (0,1) ⇔ lim n→∞ |xn − x|pα(1) = 0,(∀)α ∈ (0,1) ⇔ lim n→∞ |xn − x| = 0 ⇔ (xn) is convergent in (k, | · |) 2 corollary 19. any two fuzzy norm on k are equivalent. definition 20. the triplet (kn,n,∗) is called fuzzy euclidean normed space (briefly fen-space) if ∗ is a continuous t-norm and n : kn × [0,∞) → [0,1] is a fuzzy norm defined by n(x1,x2, · · · ,xn, t) = n1(x1, t) ∧ n2(x2, t) ∧ · · · ∧ nn(xn, t) , where n1,n2, · · · ,nn are fuzzy norms on k ((n4) is satisfied with the t-norm ∗, for all fuzzy norms n1,n2, · · · ,nn). remark 21. theorem 4.1 and the fact that ∧ ≫ ∗ assure the accuracy of the previous definition, meaning that n is fuzzy norm on kn indeed. proposition 22. a sequence (xk) is convergent in a fen-space (kn,n,∗) if and only if (xk) is convergent in (kn, || · ||), where || · || denotes the euclidean norm on kn. proof: (xk) is convergent to x in (kn,n,∗) ⇔ lim k→∞ n(xk − x,t) = 1,(∀)t > 0 ⇔ lim k→∞ n1(x 1 k − x 1, t) ∧ n2(x2k − x 2, t) ∧ · · · ∧ nn(xnk − x n, t) = 1,(∀)t > 0 ⇔ lim k→∞ ni(x i k − x i, t) = 1,(∀)t > 0,(∀)i = 1,n ⇔ |xik − x i| → 0,(∀)i = 1,n ⇔ ||xk − x|| → 0 . 2 theorem 4.2. any fen-space (kn,n,∗) is complete. proof: let (xk) be a cauchy sequence in (kn,n,∗). then pα,n(xk+p − xk) = inf{t > 0 : n(xk+p − xk, t) > α} = = inf{t > 0 : n1(x1k+p − x 1 k, t) ∧ n2(x 2 k+p − x 2 k, t) ∧ · · · ∧ nn(x n k+p − x n k, t) > α} = = inf{t > 0 : n1(x1k+p − x 1 k, t) > α,n2(x 2 k+p − x 2 k, t) > α, · · · ,nn(x n k+p − x n k, t) > α} ≥ ≥ inf{t > 0 : ni(xik+p − x i k, t) > α},(∀)i = 1,n . thus pα,n(xk+p − xk) ≥ pα,ni(x i k+p − x i k) = |x i k+p − x i k|pα,ni(1),(∀)i = 1,n . by lemma 4.4, applied to fuzzy norm ni, we obtain that there exists αi ∈ (0,1) such that pαi,ni(1) ̸= 0. therefore pαi,n(xk+p − xk) ≥ |x i k+p − x i k|pαi,ni(1),(∀)i = 1,n . as (xk) is a cauchy sequence in (kn,n,∗), we obtain that (xik) is a cauchy sequence in (k, | · |),(∀)i = 1,n. thus (xik) is convergent to x i,(∀)i = 1,n. therefore (xk) is convergent to x = (x1,x2, · · · ,xn) in (kn, || · ||) and previous proposition implies that (xk) is convergent to x in (kn,n,∗). 2 5 conclusion in this paper some special fuzzy norms on kn is given in order to obtain, in this way, fuzzy euclidean normed spaces. these spaces can be proven to be a suitable tool for data mining. the method is based on embedding the data in fuzzy euclidean normed spaces and to carry out data analysis in these spaces. fuzzy euclidean normed spaces for data mining applications 77 bibliography [1] bag, t., samanta, s.k. (2003); finite dimensional fuzzy normed linear spaces, journal of fuzzy mathematics, 11(3): 687–705. [2] bag, t., samanta, s.k. (2005); fuzzy bounded linear operators, fuzzy sets and systems, 151: 513–547. [3] dzitac, i., moisil, i. (2008); advanced ai techniques for web mining, proceedings of the 10th wseas international conference on mathematical methods, computational techniques, non-linear systems, intelligent systems, 343–346. [4] hüllermeier, e. (2005); fuzzy methods in machine learning and data mining: status and prospects. fuzzy sets and systems, 156(3): 387–406. [5] katsaras, a.k. (1984); fuzzy topological vector spaces ii, fuzzy sets and systems, 12: 143–154. [6] nădăban, s., dzitac, i. (2014); atomic decompositions of fuzzy normed linear spaces for wavelet applications, informatica, (in press). [7] saadati, r., vaezpour, s.m. (2005); some results on fuzzy banach spaces, j. appl. math. & computing, 17(1-2): 475–484. [8] saadati, r., park, j.h. (2006); intuitionistic fuzzy euclidean normed spaces, communications in mathematical analysis, 1(2): 85–90. [9] saadati, r., park, j.h. (2006); on the intuitionistic fuzzy topological spaces, chaos, solitons and fractals, 27: 331–344. [10] schweizer, b., sklar, a. (1960); statistical metric spaces, pacific j. math., 10: 314–334. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 2 (june), pp. 286-296 a hybrid artificial bee colony algorithm for flexible job shop scheduling problems j. li, q. pan, s. xie, s. wang jun-qing li, sheng-xian xie school of computer, liaocheng university, liaocheng liaocheng, 252059, pr china e-mail: lijunqing.cn@gmail.com, xsx@lcu.edu.cn quan-ke pan 1. school of computer, liaocheng university, liaocheng liaocheng, 252059, pr china 2. state key lab. of digital manufacturing equipment and technology, huazhong university of science and technology wuhan, 430074, pr china e-mail: qkpan@gmail.com song wang department of economic and management shandong university of science and technology huangdao, 266510, pr china abstract: in this paper, we propose a hybrid pareto-based artificial bee colony (habc) algorithm for solving the multi-objective flexible job shop scheduling problem. in the hybrid algorithm, each food sources is represented by two vectors, i.e., the machine assignment vector and the operation scheduling vector. the artificial bee is divided into three groups, namely, employed bees, onlookers, and scouts bees. furthermore, an external pareto archive set is introduced to record non-dominated solutions found so far. to balance the exploration and exploitation capability of the algorithm, the scout bees in the hybrid algorithm are divided into two parts. the scout bees in one part perform randomly search in the predefined region while each scout bee in another part randomly select one non-dominated solution from the pareto archive set. experimental results on the well-known benchmark instances and comparisons with other recently published algorithms show the efficiency and effectiveness of the proposed algorithm. keywords: flexbile job shop scheduling problem, artificial bee colony, multiobjective optimization, hybrid algorithm 1 introduction the flexible job shop scheduling problem (fjsp), as a branch of the classical job shop scheduling problem (jsp), has been studied in very recent years. brandimarte (1993) [1] is among the first author to solve the fjsp instances with tabu search (ts) algorithm. in very recent years, some meta-heuristic algorithms, such as ts algorithm [2] [3], particle swarm optimization (pso) [4] [5], ant colony optimization (aco) [6], and genetic algorithm (ga) [7] [8], have been used in solving the single-objective fjsps. although the single-objective fjsp has been widely investigated, the research on the multi-objective fjsp is still considered relative limited. kacem et al. (2002a, 2002b ) [9] [10] proposed an effective evolutionary algorithm. xia and wu copyright c⃝ 2006-2011 by ccc publications a hybrid artificial bee colony algorithm for flexible job shop scheduling problems 287 (2005) [11] studied the problem with the hybrid algorithm of the pso and the simulated annealing (sa). zhang et al. (2009) [12] introduced a hybrid algorithm combining pso algorithm with ts algorithm. ho et al. (2008) [13]studied a hybrid evolution algorithm combined with a guided local search and an external pareto archive set. in this paper, we propose a hybrid algorithm combining an external pareto archive set and the artificial bee colony (abc) optimizer to solve the multi-objective fjsp. the rest of this paper is organized as follows: in section 2, we briefly describe the problem formulation. then, the artificial bee colony (abc) algorithm is introduced in section 3. the elements and framework of the hybrid algorithm are presented in section 4 while section 5 shows the experimental results and comparisons with other algorithms in the literature to demonstrate the superiority of the habc performance. finally, the last section presents conclusion of our work. 2 problem formulation the fjsp considers n jobs to be processed on m machines. there are some assumptions and constrains in the fjsp considered in this study as follows: 1) each job has predefined number of operations and a known determined sequence among these operations; 2) each machine and each operation is ready at zero time; 3) each machine can only process one operation at a time, and each job must be processed on one machine at a given time; 4) each machine can process a new operation only after completing the predecessor operation; 5) each operation can be operated on a given candidate machine set instead of only one machine like in jsp; 6) given an operation oij and the selected machine mk, the processing time pijk is also fixed. let ci be the completion date of job ji. wk is the workload of machine mk, which is the total processing time of operations that are operated on machine mk. pijk be the processing time of oi,j on machine mk. three objectives are considered in this study, namely [13]: 1) minimization of maximum completion time (makespan): f1 = max{ci | i = 1, . . . , n} (1) 2) minimization of total workload f2 = ∑ pi,j,k (2) 3) minimization of critical machine workload: f3 = max{wk | k = 1, . . . , m} (3) 3 artificial bee colony algorithm very recently, by simulating the behavior of honey bee swarm intelligence, an efficient bee colony (abc) algorithm is proposed by karaboga ( [14] [17]). due to its simplicity and ease of implementation, the abc algorithm has gained more and more attention and has been used to solve many practical engineering problems. in the basic abc algorithm ( [14] [17]), there are two components: the foraging artificial bees and the food source. the position of a food source represents a possible solution to the optimization problem and the nectar amount of a food source corresponds to the quality or fitness of the associated solution. the artificial bee is divided into three groups, namely, employed bees, onlookers, and scouts bees. the employed bee is the one who is currently performing exploitation on a food source. a bee that is waiting in the hive for making decision to choose a food source is called an onlooker. the scout bee is 288 j. li, q. pan, s. xie, s. wang the one who perform exploration procedure and random exploitation search to find a new food source. the main steps of the algorithm are given as follows ( [14] [18]). step1. produce initial population; step2. while stop criteria is not satisfied, perform steps 3 to steps 6. step3. send the employed bees onto their food sources. step4. send the onlooker bees onto the food sources depending on their nectar amounts. step5. send the scout bees to search possible new food sources. step6. memorize the best food source found so far. 4 the hybrid algorithm habc the basic abc algorithm was originally designed for continuous function optimization. in order to make it applicable for solving the problem considered, a novel hybrid version of the abc algorithm, named habc, is proposed in this section. 4.1 solution representation the solution of the problem is represented with two vectors [19]: the machine assignment vector and the operation scheduling vector. the first part places the assigned machine number for each operation at the corresponding position, while the second part puts the same number symbol for each operation of a job and interpret them according to the occurrence in the operation scheduling vector. 4.2 employed bee phase the employed bee is to perform the local search around a given food source. therefore, the employed bee takes the exploitation search of the algorithm. in order to generate good quality and diversity neighboring solutions, two types of local search operators are applied for the employed bees in this study, which are shown as follows. (1)local search operator in machine assignment component the local search operator in machine assignment component is very simple and easy to be implemented. the perturbation is obtained by following steps. step1. select a position in the machine assignment component, randomly or using some priority rules. step2. assign a suitable machine different with the old one for the operation in the corresponding position. step3. replace the machine number in the selected position and produce the new machine assignment component for the solution. (2)local search operator in operation scheduling component the local search in the operation scheduling component is just like the perturbation in solving the jsp, where insert and swap operations are commonly used in the literature [20-22]. the insert operator is to remove a number symbol for an operation in the permutation π from its original position j and insert it into another position k such that (k ̸= j). the swap operator is to interchange two job symbols of π in the different positions. after performing the above two local search approaches, the employed bee obtains a new neighboring food source around the old one. then the new food source will be evaluated and a hybrid artificial bee colony algorithm for flexible job shop scheduling problems 289 compared with the old one. the better food source will be kept in the population as in the basic abc algorithm which performs a greedy selection procedure. 4.3 onlooker bee phase in the classical abc algorithm, each onlooker bee selects a food source based on the percent of the nectar amount of each food source among the total nectar amounts. however, the above approach consumes large computational time to compute the nectar amount of each food source. for this reason, we propose a tournament selection with the size of 3 in the habc algorithm. in the tournament selection, three food sources are picked randomly from the population, and then the food source with highest nectar amount will be selected by the onlooker bee. after selecting the food source, each onlooker bee performs local search for the selected food source and produce a new neighboring food source. the better food source between the old one and the new neighboring one will be memorized in the population. 4.4 scout bee phase a scout bee performs randomly search in the basic abc algorithm. this will increase the population diversity and avoid local minima, whereas this will also decrease the search efficacy. since the food sources memorized in the pareto archive set often carry better information than others, and the search space around these non-dominated solutions could be the most promising region. therefore, in the habc algorithm, the scout bees are first divided into two parts. one half of the scout bees randomly select a solution from the external pareto archive set and perform several insert and swap operators to the selected solution, while the other half scout bees perform randomly search in the predefined search scope. in the hybrid algorithm, at least 5% − 10% of the population is scout bees. 4.5 multi-objective optimizer the pareto archive set as to provide a set of solutions with good diversity, a pareto archive set (as) was introduced in this study, which is used to maintain a limited number of non-dominated solutions found so far. during the optimization process, the archive set is iteratively updated with adding some nondominated solutions and removing some dominated solutions to get closer to the pareto-optimal front. once a new non-dominated solution is found, it will be added to as and any solution which is dominated by the added one will be removed from as. in case as becomes overfull, its member which is in the crowded domain is eliminated to maintain the diversity of the pareto archive set. the storage structure of as to reduce the computational time complexity consumed on the update process of the archive set, the members of the as firstly sequence in an ascending order according to their first objective function value (pan, 2009) [21]. non-dominated sorting algorithm for the population, we should sequence each solution according to a certain criteria. for multi-objective optimization problems, we can not use one objective function value to determine the solution quality. in this study, a non-dominated sort algorithm (deb et al., 2002 ) [23]was 290 j. li, q. pan, s. xie, s. wang introduced to divide the population solutions into several levels according to their dominated solutions number. 4.6 the framework of habc the details steps of the proposed habc algorithm are as follows: step1 initialization phase; step 1.1 set the system parameters; step 1.2 produce the initial population. step2 apply the pareto non-dominated sorting function on the population, and then update the external pareto archive set by using the solutions in the first pareto level front. step3 if the stopping criterion is satisfied, output the non-dominates solutions in the external pareto archive set; otherwise, perform steps 4-7. step4 employed bee phase. step 4.1 put each employed bee on each solution in the population. step 4.2 for each employed bee, perform local search on the assigned solution and generate a new neighboring solution. step 4.3 evaluate the new neighboring solution and record the better solution among the new solution and the old one as the current solution and put it into the population. if the two solutions are non-dominated with each other, randomly select one as the current solution. step 4.4 if a solution has not been improved through limit cycles, then the corresponding employed bee becomes a scout bee and perform step 6. step 4.5 evaluate each solution corresponding to each employed bee, apply the pareto non-dominated sorting algorithm on the new population and update the external pareto archive set using the solutions in the first pareto level. step5 onlooker bee phase. step 5.1 for each onlooker bee, randomly selects three solutions from the population and selects the best one as the food source. if the three solutions cannot dominate each other, then randomly select a non-dominate solution. step 5.2 for each onlooker bee, performs local search for the selected food source and carries out greedy selection procedure to record the better solution in the population. step 5.3 evaluate each solution corresponding to each onlooker bee, apply the pareto non-dominated sorting algorithm on the new population and update the external pareto archive set using the solutions in the first pareto level. step6 scout bee phase. step 6.1 divide the scout bees into two parts with the same number of bees. step 6.2 the scout bees in the first part randomly select a food source and perform local search operator in the predefined region. after generating a new solution, performs greedy selection procedure. step 6.3 each scout bee in the second part randomly select a non-dominate solution in the external pareto archive set and perform several local search for the selected solution.. after generating a new solution, performs greedy selection procedure. a hybrid artificial bee colony algorithm for flexible job shop scheduling problems 291 step 6.4 evaluate each solution corresponding to each scout bee, apply the pareto nondominated sorting algorithm on the new population and update the external pareto archive set using the solutions in the first pareto level. step7 go to step 3. 5 experiment results this section describes the computational experiments to evaluate the performance of the proposed algorithm. the test samples come from kacem instances set [9]. the current instantiation was implemented in c++ on a pentium iv 1.8ghz with 512m memory. 5.1 setting parameters each instance can be characterized by the following parameters: number of jobs (n), number of machines (m), and the number of operations (op_num). followings are the detail parameters value: the size of the population is equal to the number of employed bee and the number of onlooker, which is set to 5n; the maximum cycle of the algorithm is set to 10 × n × m; the limit number of cycles through which no improvement occurs on the food source, then the employed bee becomes a scout bee; the limit number is set to n× m 2 ; the percent of scout bee is set to a random number between 0.05 and 0.1. 5.2 results comparisons the five test instances come from kacem [9] [10], which range from 4 jobs×5 machines to 15 jobs ×10 machines. two tests are performed for comparison, i.e. the instances with single objective and the problems with three objectives. several recently published algorithms are compared with the proposed habc algorithm, such as the al+cga proposed by kacem et al. (2002b) [10] ,the genace approach employed by ho (2004) [24], the pso+sa developed by xia and wu (2005) [11], the ant systems & local search optimization method (hereafter called aco+ls) presented by liouane et al. (2006) [6], and the pso+ts introduced by zhang (2009) [12]. the signle objective five kacem instances for solving the five instances with single objective to minimize the makespan criterion, the experimental results and comparisons are given in table 1. it can bee seen from table 1 that the habc algorithm can obtain the best results for all the kacem instances. the proposed algorithm outperforms the al+cga in 4 out of 5 instances, while outperforms the genace method in 2 out of 4 cases. for comparison with the very recently published algorithms, the habc algorithm obtained a superior result in solving the largest problem than the aco+ls proposed by liouane (2006). particle swarm optimization (pso) is an efficient swarm intelligent algorithm and the experimental results obtained by pso+sa and pso+ts are considered as the competitive results for the fjsp. table 1 shows that the habc algorithm outperforms the pso+sa algorithm in 2 out of 3 problems. the habc algorithm can obtain the same experimental results with the pso+ts in very short computational times. for example, in solving the largest problem 15 × 10, our algorithm consumes just about 50 seconds to reach the best result so far. 292 j. li, q. pan, s. xie, s. wang table 1: comparison of the five instances with single objective (makespan) problem set al+cga genace aco+ls pso+sa pso+ts habc 4 × 5 16 11 11 11 8 × 8 15 15 14 14 10 × 7 15 12 11 11 10 × 10 7 7 7 7 7 7 15 × 10 23 12 12 12 11 11 "-" denotes not given by the author the multi objective five instances table 2 shows the comparison of the results on the five multi-objective fjsp instances. the three objectives are considered simultaneously, i.e. minimization of the makespan (denoted by f1),the total workload (denoted by f2), and the maximal workload (denoted by f3). it can bee seen from table 2 that the habc algorithm is competitive to other algorithms. the experimental results of the proposed algorithm dominate the results of the al+cga for solving the four instances. for comparison with the very recently published algorithms, the habc can either obtain more non-dominated solutions or obtain superior result than pso+ts and pso+sa algorithms. for example, our algorithm obtain three non-dominated solutions in solving the 8 × 8 instance,while the pso+sa and pso+ts can only obtain two results. in addition, our algorithm obtains all these results in a run while the other algorithms can obtain only one result in a run. in other words, the other algorithm should run several times to obtain different results for an instance. therefore, the external pareto archive can enhance the population diversity of our algorithm. fig. 1 shows the gantt chart of the resulted solution of 15 × 10 instance, due to our proposed algorithm. to make a further comparison with the aco+ls proposed by liouane (2006), we also test the problem listed in the paper [6]. the problem is given in table 3. the comparison of the experimental results from aco+ls and our algorithm are given in table 4 and the two solutions obtained by the habc algorithm are given in table 5 and table 6, respectively. it can be seen from table 4 that the habc algorithm obtained two non-dominated solutions for the example benchmark while the ts method in literature [6]can obtained only one solution. furthermore, the resulted solutions obtained by our algorithm dominated all the results by the aco method. in addition, our algorithm obtained the two non-dominated solutions consuming just only 0.01 seconds for the example benchmark. therefore, table 4 concludes that the proposed algorithm is efficient in solving the example problem especially when compared with the aco method. 6 conclusion in this paper, we have proposed an efficient algorithm for solving multi-objective fjsps. instead of applying the basic abc algorithm, we developed a hybrid abc method. to memory the non-dominated solutions found so far and increase the population diversity, we presented an external pareto archive set. a fast pareto update function is also introduced in the algorithm to enhance the computational capability. in the hybrid algorithm, the balance of the capability of exploration and exploitation is considered. experimental results on several well-known benchmarks show that our algorithm is competitive to other recently published algorithms for solving the fjsps. the future work is to improve the neighborhood structure of the problem considered a hybrid artificial bee colony algorithm for flexible job shop scheduling problems 293 table 2: comparison of the five instances with three objectives al+cga pso+sa pso+ts habc f1 16 11 11 12 13 f2 34 32 32 32 334 × 5 f3 10 10 10 8 7 f1 15 16 15 16 14 15 14 15 16 f2 79 75 75 73 77 75 77 75 738 × 8 f3 13 13 12 13 12 12 12 12 13 f1 11 12 f2 61 6010 × 7 f3 11 12 f1 7 7 7 8 7 8 f2 45 44 43 41 42 4310 × 10 f3 5 6 6 7 6 5 f1 23 24 12 11 12 11 f2 95 91 91 93 91 9315 × 10 f3 11 11 11 11 11 11 figure 1: the gantt chart of the solution 2 (f1=11, f2=93, f3=11) for the instance 15 × 10 294 j. li, q. pan, s. xie, s. wang table 3: example benchmark 3 jobs-6 machines from [6] m1 m2 m3 m4 m5 m6 o11 10 7 6 13 5 1 o12 4 5 8 12 7 11 o13 9 5 6 12 6 17 j1 o14 7 8 4 10 15 3 o21 15 12 8 6 10 19 o22 9 5 7 13 14 7 j2 o23 14 13 14 20 8 17 o31 7 16 5 11 17 9 o32 9 16 8 11 6 3 j2 o33 6 14 8 18 21 14 table 4: comparison of the example benchmark f1 f2 f3 lower bound value 18 45 8 avgtime(s) 19 51 13 19 50 13 19 48 14aco+ls 19 47 14 ts 18 45 12 18 45 12habc 19 46 10 0.01 "-" denotes not given by the author table 5: solution 1 for the example benchmark (f1=18, f2=45, f3=12) o1 o2 o3 o4 j1 m6:[0,1] m1:[1,5] m2:[5,10] m6:[10,13] j2 m4:[0,6] m5:[6,10] m5:[10,18] *** j3 m3:[0,5] m6:[5,8] m1:[8,14] *** table 6: solution 2 for the example benchmark (f1=19, f2=46, f3=10) o1 o2 o3 o4 j1 m6:[0,1] m1:[1,5] m2:[11,16] m6:[16,19] j2 m4:[0,6] m5:[6,11] m5:[11,19] *** j3 m3:[0,5] m6:[5,8] m1:[8,14] *** a hybrid artificial bee colony algorithm for flexible job shop scheduling problems 295 and enhance the convergence ability of the algorithm. 7 acknowledgments this work supported by national science foundation of china under grants 60874075, science research and development of provincial department of public education of shandong under grant j08lj20, j09lg29, j08lj59, soft science foundation of shandong under grant 2009rkb125. bibliography [1] p. brandimarte. routing and scheduling in a flexible job shop by tabu search.annals of operations research , vol 22, pp. 158-183, 1993. [2] m. mastrolilli and l. m. gambardella. effective neighborhood functions for the flexible job shop problem.journal of scheduling , vol. 3(1), pp. 3-20, 2000. [3] j. q. li, q. k. pan, p. n. suganthan and t. j. chua. a hybrid tabu search algorithm with an efficient neighborhood structure for the flexible job shop scheduling problem. international journal of advanced manufacturing technology. doi: 10.1007/s00170-010-2743-y. [4] l. gao, c. y. peng, c. zhou and p. g. li. solving flexible job shop scheduling problem using general particle swarm optimization. in proceedings of the 36th cie conference on computers & industrial engineering, pp. 3018-3027, 2006. [5] j. q. li, q. k. pan and s. x. xie. a hybrid variable neighborhood search algorithm for solving multi-objective flexible job shop problems. comsis computer science of software information and system. doi: 10.2298/csis090608017l. [6] n. liouane, i. saad, s. hammadi and p. borne. ant systems & local search optimization for flexible job-shop scheduling production. international journal of computers, communications & control, vol. 2, pp. 174-184, 2007. [7] j. gao, l. sun and m. gen. a hybrid genetic and variable neighborhood descent algorithm for flexible job shop scheduling problems.computers & operations research, vol. 35(9), pp. 2892-2907, 2008. [8] f. pezzella, g. morganti and g. ciaschetti. a genetic algorithm for the flexible job-shop scheduling problem.computers & operations research , vol. 35, pp. 3202-3212, 2008. [9] i. kacem, s. hammadi and p. borne. pareto-optimality approach for flexible job-shop scheduling problems: hybridization of evolutionary algorithms and fuzzy logic. mathematics and computers in simulation, vol. 60, pp. 245-276, 2002a. [10] i. kacem, s. hammadi and p. borne. approach by localization and multi-objective evolutionary optimization for flexible job-shop scheduling problems,ieee transactions on systems , man and cybernetics, part c, vol. 32(1), pp. 408-419, 2002b. [11] w. j. xia and z. m. wu. an effective hybrid optimization approach for multi-objective flexible job-shop scheduling problems.computers & industrial engineering , vol. 48(2), pp. 409-425, 2005. 296 j. li, q. pan, s. xie, s. wang [12] g. h. zhang, x. y. shao, p. g. li and l. gao. an effective hybrid swarm optimization algorithm for multi-objective flexible job-shop scheduling problem. computers & industrial engineering, vol. 56(4), pp. 1309-1318, 2009. [13] n. b. ho and j. c. tay. solving multiple-objective flexible job shop problems by evolution and local search. ieee transactions on systems, man, and cybernetics, part c:applications and reviews , vol. 38(5), pp. 674-685, 2008. [14] d. karaboga. an idea based on honey bee swarm for numerical optimization. technical report tr06. computer engineering department. erciyes university. turkey. 2005. [15] d. karaboga and b. basturk. a powerful and efficient algorithm for numerical function optimization: artificial bee colony (abc) algorithm.journal of global optimization , vol. 39(3), pp. 459-171, 2007. [16] d. karaboga and b. basturk. on the performance of artificial bee colony (abc) algorithm.applied soft computing , vol. 8(1), pp. 687-697, 2008. [17] d. karaboga and b. akay. a comparative study of artificial bee colony algorithm. applied mathematics and computation,vol. 214, pp. 108-132, 2009. [18] q. k. pan, m. f. tasgetiren, p. n. suganthan and t. j. chua. a discrete artificial bee colony algorithm for the lot-streaming flow shop scheduling problem. information sciences. doi: 10.1016/j.ins.2009.12.025, 2010. [19] j. q. li, q. k. pan and y.c. liang. an effective hybrid tabu search algorithm for multiobjective flexible job shop scheduling problems. computers & industrial engineering, vol. 59, pp. 647-662, 2010b. [20] f. pezzella, g. morganti and g. ciaschetti. a genetic algorithm for the flexible job-shop scheduling problem. computers & operations research, vol. 35, pp. 3202-3212, 2008. [21] q. k. pan, l. wang, b. qian. a novel differential evolution algorithm for bi-criteria no-wait flow shop scheduling problems.computers & operations research, vol. 36(8), pp. 2498 -2511, 2009. [22] l. wang. shop scheduling with genetic algorithms. tsinghua university press, beijing, china, 2003. [23] k. deb, a. paratap, s. agarwal and t. meyarivan. a fast and elitist multiobjective genetic algorithm: nsga-ii. ieee transactions on evolutinary computation, vol. 6(2), pp. 182-197, 2002. [24] n. b. ho and j. c. tay. genace: an efficient cultural algorithm for solving the flexible job-shop problem. in proceedings of the ieee congress on evolutionary computation (cec2004) piscataway, pp.1759-1766, 2004. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 4, pp. 578-585 an ontology to model e-portfolio and social relationship in web 2.0 informal learning environments d. taibi, m. gentile, g. fulantelli, m. allegra davide taibi, manuel gentile, giovanni fulantelli, mario allegra italian national research council institute for educational technologies via ugo la malfa 153 90146 palermo, italy e-mail: {davide.taibi,manuel.gentile,giovanni.fulantelli,mario.allegra}@itd.cnr.it abstract: web 2.0 applications and the increasingly use of social networks have been creating new informal learning opportunities. students interact and collaborate using new learning environments which are structurally different from traditional e-learning environments. in these informal unstructured learning contexts the boundaries between the learning contexts and social spheres disappear, and the definition of the students competences appears more and more important. in this paper we propose a semantic web approach in order to create the basis for a software platform to model learner profiles. in particular we propose to extend the foaf ontology, used to describe people and their personal relationships, with an ontology related to the ims learning portfolio used to model students’ competencies. this ontology could be a fundamental layer for a new web 2.0 learning environment in which students’ informal learning activities carried out in social networks can be managed and evaluated. keywords: semantic web, e-portfolio, social networks, informal learning. 1 informal learning and social communities the significant changes in society that castells in [1] sums up in what he calls “the rise of the network society” also have considerable implications in the definition of learning activities. the information society dramatically increases the opportunities for knowledge acquisition. beyond the structured training activities designed by specialists in the education field, we have to consider the large number of educational opportunities related to everyday activities that define the so-called “informal learning” [2]. in this perspective, the concept of networked learning is drastically changing. the informal learning opportunities created by information technologies, such as web 2.0 applications and social networks, allow users to interact and collaborate in new ways thus leading to the definition of new learning environments; these are structurally different from traditional e-learning environments, since the boundaries between the learning contexts and other social spaces tend to disappear. in these unstructured learning contexts, the definition of the skills acquired by the users is a central objective. consequently, the use of software environments that model learner profiles and can deal with them in a semantic way appears increasingly important. in [3] the authors argue that learning is related to the activities, the environmental and cultural contexts in which it is developed and therefore social interaction is a critical factor. from this point of view learning can be described as a process: students are involved in a community of practice that represents knowledge and behavior in which students play a more active role in the cultural sphere. the concept of situated learning comes from vygotsky’s social development theory, which affirms that social interaction has a fundamental role in the knowledge development process [4]. this theory argues that situated learning is generally unintentional and for this reason learning is more effective if the student is a member of a community of practice he has chosen to join rather than copyright c⃝ 2006-2010 by ccc publications an ontology to model e-portfolio and social relationship in web 2.0 informal learning environments 579 being assigned to a group by external actors such as teachers. the social aspect in learning activities is extremely important and leads to a further consideration. for example, tinto claims that participation in a collaborative learning group allows students to develop a supporting network, that helps students to maintain relations with a wider social community [5]. a peer-to-peer community promotes participation in learning activities. moreover, the communities of learners provide students with the opportunity to satisfy simultaneously both social and academic requirements. these unstructured learning contexts give rise to the need to measure and asses the acquired knowledge; the traditional competence based certification systems are not designed for this type of environment and for this reason are less suitable in this kind of educational context. the semantic web provides a technological substrate which can overcome the limits of current web technologies, setting the base for creating ontological systems in order to model competences in informal educational contexts that are being developed in web 2.0 environments. in this paper we consider the problems connected to the description of competences in informal learning environments within social networks mediated by technologies. in particular, we propose the integration of the foaf (friend of a friend) ontology, which is used to model people and their personal contacts, with semantic ontology related to student e-portfolios used to model their competences. the use of ontologies and the surrounding semantic web technologies allow us to create relationships between the students ongoing educational experiences and the evolution of their social network. for this to happen, we integrate foaf ontology with the ims learning portfolio model in order to support the creation of a new web 2.0 learning environment based on social networks and competences. 2 social semantic web at present a huge amount of shared contents such as bookmarks, images, videos and photos are being created within so called web 2.0 applications, very popular in social and personal spheres as well as in professional and organizational ones. they possess common features like the creation and sharing of contents (images, photos, papers), discussions (comments) and connections between users (group of friends, private messages, and so on). this scenario raises new considerations related to the sharing of social contributions between software applications and the interoperability of social networks. due to the heterogeneity of the nature of social contribution, sharing, searching, connecting and retrieving these kinds of contents has become more complex. the semantic web technologies provide standards and models which are useful for creating a network of data, with unified models which can represent data from different sources appropriately. the unification of semantic web technologies and social paradigms gives rise to "social semantic information spaces" in which information is socially created and managed, as well as being interconnected and available in a machine understandable format, promoting new methodologies to discover information present on the web [6]. moreover, the semantic web offers a generic infrastructure to interchange, integrate and reuse structural data, in order to overcome the limits of web 2.0 platforms. currently, in fact, web 2.0 applications have search mechanisms based mainly on tags and few keywords. adding semantics to the web would enable this kind of problem to be solved, by providing easier search mechanisms, supporting the reuse of contents and creating more connections between different types of contents. moreover, the use of ontologies is useful to structure and elaborate information. ontologies represents entity-relationship models related to a specific knowledge or practice domain. a typical web ontology contains the definition of classes, objects and their relationships, and a set of deduction rules that give inferential power about the concepts. through ontologies the semantic web provides the basis for enriching the resources description with a well defined meaning and in a comprehensible format which can be elaborated by software applications. the strict relationship between documents produced in web 2.0 environments and the specific social network [7] bring us to consider the information objects as the result of the activities of the network; 580 d. taibi, m. gentile, g. fulantelli, m. allegra consequently, we should also represent social relationships in a well structured way, using approaches based on the semantic web concepts. foaf, the acronym of friend of a friend, uses semantic web technologies, in particular the resource description framework (rdf) and the ontology web language (owl), to define a machine-readable ontology describing people, their activities and their relations to other people and objects. foaf is useful for describing social networks and their relationships. 3 learner model in social semantic networks educational activities adopt web 2.0 technologies and social networks more and more frequently. the interaction paradigms at the basis of web 2.0 technologies and social networks are different from those adopted by conventional e-learning tools. foaf is considered the most common vocabulary for constructing social networks, it has been very successful in the applications that use semantic web technologies, and it is useful to model learner in social networks [8]. each student can be described through an foaf file that can be extended and modified at any time. this is useful, for example, for publishing data regarding students using the uri that represent them. to facilitate the creation of the profiles it is possible to use an interface based on foaf-a-matic; in this way, it will be possible to describe social links in the community of practice. defining ontologies it is possible to use a inferential engine like jena and the sparql language to work with data and generate new knowledge about the domain. as reported by [9] there are several advantages in using the foaf approach to model student profiles: • the use of rdf facilitates extensibility and interoperability • the presence of different extensions of the foaf vocabulary, makes foaf very flexible • the creation of foaf files is simplified by the use of foaf-a-matic • foaf simplifies the identification of people with common interests, which is essential for creating communities of practice to use foaf in a learning context, it is necessary to extend this model to include specific characteristics related to learning. regarding this aspect we should take into consideration elements related to: the extension of the foaf vocabulary to include specific information regarding students activities; the consideration of privacy problems in sharing personal information; the evaluation of the ties strength between students belonging to the group. in conclusion, using foaf as a basis for learning models makes it possible to exploit the benefits of the numerous existing tools, and also to use the extension of the model to define specific aspects and personal and group relationships, which are indispensable for creating and supporting social learning networks. foaf is used successfully to describe a student profile, in particular the profile can be extended to bring together information coming from other models containing student data, like, for example the competences described following the ims e-portfolio standard. the model of students competences plays a key role in making the use of social networks to better support learning activities. 4 e-portfolio and ontologies in social learning environments an e-portfolio is defined by the educause nlii (national learning infrastructure initiative) as "a collection of authentic and diverse evidence, drawn from a larger archive, that represents what a person an ontology to model e-portfolio and social relationship in web 2.0 informal learning environments 581 or organization has learned over time, on which the person or organization has reflected, designed for presentation to one or more audiences for a particular rhetorical purpose." as sustained by [10], an e-portfolio can be used to developmental, presentation, assessment purposes, and it can contain different information related to personal and professional achievements, competences, digital works. this relevant information about students can be stored and maintained by different institutions in different sites, so the management can be improved by the use of web-based e-portfolios. a key concept in this scenario is the interoperability between different institutional systems which requires a unified model describing students e-portfolios. the pedagogical objectives of e-portfolios are various: they allow students to describe their learning path, increase awareness of their strengths and weaknesses, take responsibility and increase their autonomy and have a unified way of presenting their competences. at present, the lack of common standards to describe e-portfolio information means that most eportfolio systems are using different proprietary formats to store this type of information, and moreover, they don’t provide features for importing and exporting e-portfolio information from other systems. in this scenario the interoperability between e-portfolio systems is hindered, and for example, it is difficult to integrate the e-portfolio information coming from a university system and from an enterprise. for these reasons it is desirable to use a common standard in order to unify the description processes of competences in lifelong learning. there are two main standards for describing student learning experiences. the ieee learner model working group has defined the public and private information for learners [11] as a standard for a student model, with the aim of gathering information related to competences, personal data, learning style, and so on. this standard considers six types of data related to personal, relations, security, preference, performance and portfolio information; in addition, it is possible to extend and integrate the standard in order to enrich the student description. in 2005 the ims consortium released the ims eportfolio practice and implementation guide [12]. this specification uses the xml language to define the characteristics of an e-portfolio. xml is at the basis of the semantic web layer cake, so this specification constitutes the first step towards a semantic description of student competences. the use of specific ontologies can enrich this description by considering also the relationships between the concepts that are at the basis of e-portfolio systems. for example, figure 1 shows the e-portfolio activity concept, its properties and its relations reported in our ontology. an e-portfolio can bring together different kinds of information such as: digital and non digital works; activities in which the student has participated, is participating, or plans to participate; competences and skills of the student; students achievements, whether or not certificated; student’s preferences; student’s goals and plans; student’s interests and values; any notes, reflections or assessments relevant to any other part; the results of any test or examination taken by the student; contextual information to help the interpretation of any results. 5 semantic framework for e-portfolio management many educational approaches are based on groupwork, since peer learning promotes cognitive processes. there are many different kinds of collaborative work that allow students to learn in different modalities, such as group discussions, group problem solving and group study. the form of collaboration differs according to the duration, the complexity and the level of collaboration. social interactions can help students to share their experiences and to work collaboratively on relevant topics. in this sense social networks occupy a key role in the learning dynamics. the number of informal learning activities which take place in technology supported social networks is constantly increasing. collaborative group activities are frequently used by teachers in the educational curriculum. in 582 d. taibi, m. gentile, g. fulantelli, m. allegra figure 1: the activity concept of the e-portfolio ontology these activities it is necessary to create well balanced groups with the aim of maximizing the attainment of the learning objectives. to ensure the success of a learning activity, teachers must consider the constraints that can affect the entire group or an individual performance, such as previous experiences by students in similar educational contexts, cultural background or interests and competences. the importance of a system based on competences in informal learning environments such as those developed using social software is undeniable. for example, there are clear benefits in involving members with different levels of experience within the group in order to improve the dynamics of collaborative work in problem solving activities. from this point of view it is increasingly important to have software applications that can store data related to the user profiles and process these data semantically. the approach proposed in this paper consists in integrating and extending foaf ontology, used for modeling contacts and personal relationship, with semantic data related to students competences. enriching the description of social networks using semantics can provide precious support for a more effective use of the network for educational purposes. social learning experiences must consider the competences and the e-portfolio of the participants, so web semantic technologies are an essential substrate for merging models related to the social network description with models used to define and structure competences. in particular, the use of ontologies and semantic web technologies makes it possible to relate the evolution of educational activities experienced by the students with their relationships. the description of a students social network using foaf, integrated with the definition of competences by means of the ims model, is the basis for the creation of a competence based ontological system for virtual learning environments using social networks and web 2.0 technologies. the result is a learning environment which is no longer based on the transmission of information from teacher to student but rather is focused on the ability of the students to play an active role in their learning activities. an ontology to model e-portfolio and social relationship in web 2.0 informal learning environments 583 figure 2: an example of the relation between foaf and e-portfolio in the proposed ontology the figure 2 shows an example of how our ontology describes the relation between a concept of the e-portfolio and a concept in foaf. as an example, we consider how our approach can be used to increase learning effectivness in informal learning activities that take place within an on-line social network environment. using the ontology proposed in our work, it is possibile to describe both social relationships between students and the eportfolio for each student. by social relationships, we mean the friendship links explicitly declared by students within the on-line social network software, while the e-portfolio allows us to describe the competences acquired by students, their learning goals and so on. all this information can be available in a software platform that uses our ontology to: • create a sub-group from a student’s friendship group, which includes the friends that have common learning interests and objectives • suggest new friends to a student by selecting the people from within the on-line social network who have specific competences in their portfolios that can help the student to achieve his learning objectives in a social network it is important not only to be connected to other people but to be connected to the right people, depending on your goals. this is true not only for business or work experiences but it is also important for informal learning activities that take place in an on-line social network. the results of learning activities are highly influenced by the group in which students participate. several studies have been conducted analizyng the impact of positive interdependence on the effectiveness of cooperation. as stated by johnson and johnson one of the essential elements for efficacy in cooperation is positive interdependence [13]. the authors affirm that positive interdependence is structured in three categories: outcome, boundary and means. our approach provides helpful conditions to support the first two categories of positive interdependence. in particular, the outcome categories include goals and rewards; in our case we create a sub-group of friends in which learning objectives and competences coincide, thus facilitating a structuring positive outcome interdependence in order to increase achievement and productivity [14], [15], [16]. johnson and johnson also state that “the boundary category includes: outside enemy (or negative interdependence with another group), identity (which binds members together as an entity), and environmental (such as a specific work area) interdependence” [13]. from this point of view, creating the sub-group from the existing friendship network increases some factors of boundary interdependence. in conclusion, our ontological approach is useful for supporting the creation of learning groups, promoting positive interdependence that has a positive influence to produce higher achievement and 584 d. taibi, m. gentile, g. fulantelli, m. allegra productivity more then the membership in and of itself [17] and the interpersonal interaction itself [18], [19]. 6 conclusions collaborative group activities are frequently used by teachers in the educational curriculum. in these activities it is necessary to create well balanced groups with the aim of maximizing the attainment of the learning objectives. to ensure the success of a learning activity, teachers must consider the constraints that can affect the entire group or an individual performance, such as previous experiences by students in similar educational contexts, cultural background or interests. the greater the number of constraints to consider, the more complex becomes the management of the learning experience. semantic web technologies offer the substrate needed to overcome the problems of social network with large groups of students. the versatility of these technologies means that they can be successfully applied for describing social networks and competences in learning experiences. an interesting approach for creating an ontological system based on semantic web technologies that makes it possible to define a social network considering the competences of participants, the quality of the group and its robustness, is based on the use of an ontology as a result of an extension of the foaf vocabulary, to create a semantic data base including specific references to educational paths. in particular, the approach proposed in this work is based on the creation of a specifically designed ontology that extends foaf ontology, in order to describe the domain of competences as defined by the ims e-portoflio standard. bibliography [1] m. castells, the information age, economy, society and culture volume i: the rise of network society, blackwell, oxford 1996. [2] a. andreatos, virtual communities and their importance for informal learning international journal of computers, communications & control, 2(1):39-47, 2007. [3] j. lave and e. wenger, situated learning: legitimate peripheral participation, cambridge university press, 1991. [4] l.s. vigotsky, mind in society. harvard university press. cambridge, 1978. [5] v. tinto, classrooms as communities: exploring the educational character of student persistence, journal of higher education, 68(6):599-622, 1997. [6] j.g. breslin, social semantic information spaces. in s.r. kruk and b. mcdaniel, semantic digital libraries. springer, 2008. [7] j. jung, j. euzenat, towards semantic social networks. in proceedings of the 4th european semantic web conference, innsbruck, austria, pp. 267-280, 2007. [8] a. ounnas, i. liccardi, h.c. davis, d.e. millard and s.a. white towards a semantic modeling of learners for social networks. in proceedings of the international workshop on applications of semantic web technologies for e-learning (sw-el) at the ah2006 conference, dublin, ireland, 2006. an ontology to model e-portfolio and social relationship in web 2.0 informal learning environments 585 [9] i. liccardi, a. ounnas, r. pau, e. massey, p. kinnunen, s. lewthwaite, m. midy, c. sarkar, the role of social networks in students’ learning experiences. sigcse bulletin 39(4):224-237, 2007. [10] r. mason, c. pegler and m. weller, e-portfolios: an assessment tool for online courses. british journal of educational technology, 35,6, pp. 717-727, 2004. [11] ieee p1484.2.1/d8 : draft standard for learning technology public and private information (papi) for learners (papi learner) core features sponsored by the learning technology standards committee of the ieee computer society, 2001. [12] ims eportfolio practice and implementation guide, ims global learning consortium, 2005. [13] d. w. johnson, r. t. johnson. an educational psychology success story: social interdependence theory and cooperative learning. educational researcher, 38(5), 365-379, 2009. [14] m. jensen, cooperative quizzes in the anatomy and physiology laboratory: a description and evaluation. advances in physiology education, 16(1), s48-s54, (1996). [15] m. jensen, d.w.johnson, r. johnson, impact of positive interdependence during electronic quizzes on discourse and achievement. journal of educational research, 95, 161-166, (2002). [16] t. matsui, t. kakuyama, m. onglatco, effects of goals and feedback on performance in groups. journal of applied psychology, 72, 407-415, (1987). [17] n. hwong, a. caswell, d.w. johnson, r. johnson, effects of cooperative and individualistic learning on prospective elementary teachers’ music achievement and attitudes. journal of social psychology, 133, 53-64, (1993). [18] d. mesch, m. lew, d.w. johnson, r. johnson, isolated teenagers, cooperative learning and the training of social skills. journal of psychology, 120, 323-334, (1986). [19] d. mesch, d.w. johnson, r. johnson, impact of positive interdependence and academic group contingencies on achievement. journal of social psychology, 128, 345-352, (1988). int j comput commun, issn 1841-9836 vol.7 (2012), no. 3 (september), pp. 530-539 brain tumor segmentation on mri brain images with fuzzy clustering and gvf snake model a. rajendran, r. dhanasekaran arthanari rajendran professor and hod, department of electronics and communication engineering, sriguru institute of technology, coimbatore,tamilnadu, india e-mail:rajendranav@gmail.com raghavan dhanasekaran professor, director-research, syed ammal engineering college, ramanathapuram, tamilnadu, india e-mail:rdhanashekar@yahoo.com abstract: deformable or snake models are extensively used for medical image segmentation, particularly to locate tumor boundaries in brain tumor mri images. problems associated with initialization and poor convergence to boundary concavities, however, has limited their usefulness. as result of that they tend to be attracted towards wrong image features. in this paper, we propose a method that combine region based fuzzy clustering called enhanced possibilistic fuzzy c-means (epfcm) and gradient vector flow (gvf) snake model for segmenting tumor region on mri images. region based fuzzy clustering is used for initial segmentation of tumor then result of this is used to provide initial contour for gvf snake model, which then determines the final contour for exact tumor boundary for final segmentation. the evaluation result with tumor mri images shows that our method is more accurate and robust for brain tumor segmentation. keywords: deformable model; fcm; segmentation; mri image; gvf 1 introduction the accurate and automatic segmentation of brain tumor on mri image is of great interest for assessing tumor growth and treatment responses, enhancing computer-assisted surgery, planning radiation therapy, and constructing tumor growth models. this is very difficult task in existing methods. the existing methods are divided into region-based and contour-based methods. region-based methods [1-9] seek out clusters of pixels that share some measure of similarity. these methods reduce operator interaction by automating some aspects of applying the low level operations, such as threshold selection, histogram analysis, classification, etc. they can be supervised or non-supervised. in general these methods take advantage of only local information for each pixel and do not include shape and boundary information. contour-based methods [10-14] rely on the evolution of a curve, based on internal forces and external forces, such as image gradient, to delineate the boundary of brain structure or pathology. these methods can also be supervised or non-supervised. in general these methods suffer from the problem of determining the initial contour and leakage in imprecise edges. in this paper we propose a method that is a combination of region-based fuzzy clustering method called enhanced possibilistic fuzzy c-means (epfcm) and gradient vector flow (gvf) snake model to remove the problems using the capabilities of each one. for example a region-based method can solve the problem of the initialization of a contour-based method (gvf snake model) and a contour-based method is able to improve the quality of region-based segmentation at the boundary of objects. copyright c⃝ 2006-2012 by ccc publications brain tumor segmentation on mri brain images with fuzzy clustering and gvf snake model531 so the proposed method has two main phases for tumor segmentation on mri brain images namely, initial segmentation which is done by a region-based method and final segmentation that is performed by a boundary-based gvf snake model. we discuss these approaches in the following section. 2 region-based enhanced possibilistic fuzzy c-means (epfcm) in this proposed enhanced possibilistic fuzzy c-means (epfcm) method, distance metric di j in pfcm [15] is modified in such a way that it includes membership, typicality and both local, nonlocal spatial neighbourhood information to overcome the noise effect in mri brain medical images. this modified distance metric is incorporated into objective function of pfcm. then resultant algorithm is called enhanced possibilistic fuzzy c-means (epfcm) is obtained for enhanced segmentation results. therefore objective function of our proposed epfcm is defined as follows, jm (u,v,t ; x) = c∑ i=1 n∑ j=1 ( aµmi j + bt η i j ) d2i j + c∑ i=1 γi n∑ j=1 ( 1 − ti j )η (1) where, the modified distance metric is given by d2 ( x j, vi ) & = &d2i j = ( 1 −λ j ) d2l ( x j, vi ) +λ jd 2 nl ( x j, vi ) (2) c∑ i=1 µi j& = &1 ∀ j,0 ≤ µi j, ti j ≤ 1 and a > 0,b > 0,m > 1,η > 1 (3) the membership function: µi j =  c∑ i=1 ( di j dk j ) 2 m−1  −1 (4) typicality: ti j = 1 1 + ( b γi d2i j ) 1 η−1 (5) cluster centre: vi = n∑ j=1 ( aµmi j + bt η i j ) x j n∑ j=1 ( aµmi j + bt η i j ) (6) in the following equation is suggested to compute γi: γi = k n∑ j=1 µmi j d 2 i j n∑ j=1 µmi j , k > 1 (7) 2.1 importance of modified distance metric term ( di j ) the modified distance metric or dissimilarity measure is rewritten from equation (2) as follows, d2 ( x j, vi ) & = &d2i j = ( 1 −λ j ) d2l ( x j, vi ) +λ jd 2 nl ( x j, vi ) (8) 532 a. rajendran, r. dhanasekaran where, dl is the distance metric influenced by local spatial information. this added local spatial neighborhood term is similar to the one which is used in [16] to incorporate the neighborhood effects in the classic fcm.the local spatial constraint is evaluated by the feature difference between neighboring pixels in the image. dnl is the distance measurement influenced by nonlocal spatial information. this added non local term is obtained from the non local means (nl-means) algorithm [17] for image denoising. the nonlocal constraint determined by all points whose neighborhood configurations look like the neighborhood of the pixel of interest. λ j is the weighting factor controlling the tradeoff between local and nonlocal spatial information. it varies from zero to one. 2.2 importance of local distance metric (dl) let n j denote a chosen local neighborhood configuration of fixed size with respect to a center pixel x j. if the value of a pixel xk in n j is close to the center pixel, then x j should be influenced greatly by it, otherwise, its influence to x j should be small. according to the above description, the distance measurement influenced by local information dl is given by, d2l ( x j, vi ) = ∑ xk∈n j ωl ( xk, x j ) d2 (xk, vi) ∑ xk∈n j ωl ( xk, x j ) (9) where d2 (xk, vi) = ∥xk − vi∥2 is the euclidean distance metric measure the similarity between pixel pixel xk and cluster centroid vi,ωl(xk, x j) is the weight of each pixel xk in n j and is given by ωl ( xk, x j ) = e |xk − x j|2 σ2 (10) where, σ2 is the variance of ni. it specifies the steepness of the sigmoid curve. 2.3 importance of non local distance metric (dnl) the distance measurement influenced by non-local information dnl is computed as a weighted average of all the pixels in the image i, xk ∈ i d2nl ( x j, vi ) = ∑ xk∈i ωnl ( xk, x j ) d2 (xk, vi) (11) where the family of weight ωnl ( xk, x j ) ; xk ∈ i depends on the similarity between the pixel xk and x j, and satisfies the usual conditions 0 ≤ωnl ( xk, x j ) ≤ 1 and ∑ ωnl ( xk, x j ) = 1. the similarity between two pixels xk and x j depends on the similarity of the intensity gray level vector v(nk) and v(n j), where nk denotes a square neighborhood of fixed size and centered at a pixel xk. this similarity is measured as a decreasing function of the weighted euclidean distance ∥∥∥∥v(nk) − v(n j)∥∥∥∥2 2,a , where a > 0 is the standard deviation of the gaussian kernel. the pixels with a similar gray level neighborhood to v(n j) have larger weights in the average. these weights are defined as ωnl ( xk, x j ) = 1 q ( x j ) s (xk, x j) (12) brain tumor segmentation on mri brain images with fuzzy clustering and gvf snake model533 where, s ( xk, x j ) is the exponential form of the similarity and q ( x j ) is the normalizing constant. these terms are defined as, s ( xk, x j ) = e ∥v(nk )−v(n j)∥22,a h2 (13) q ( x j ) = ∑ xk∈i e ∥v(nk )−v(n j)∥22,a h2 (14) the parameter h acts as a degree of filtering. it controls the decay of the exponential function and therefore the decay of the weights as a function of the euclidean distance. 2.4 importance of trade-off parameter (λ) for computational purpose, the search of the similar neighborhood configuration always be restricted in a larger "search window" denoted by ωi. let x j be the pixel under consideration. for each pixel xk in the search window of size s × s , calculate its exponential similarity to x j using equation (13). the tradeoff parameter of x j is then defined as λ j = 1 m m∑ i=1 s i ( xk, x j ) (15) where s i represents the ith exponential similarity term in the search window and choose m = s − 1.the parameter λ j decides the trade-off between local and non local spatial information. 3 algorithm for proposed epfcm method finally the algorithm for carrying out our proposed epfcm for tumor segmentation of mri brain images can now be stated from the following steps 1. select the number of clusters ′c′ and fuzziness factor ′m′ 2. select initial class centre prototypes v= {vi} ; i = 1,2 · · ·c, randomly and ∈,a very small number 3. select the neighbourhood size and search window size 4. calculate modified distance measurement d2i j using the equation (2) 5. update membership function µi j using d2i j 6. update γi; i = 1,2 · · ·c, using equation (7) 7. update typicality using the equation (5) 8. update cluster centre using equation (6) 9. repeat steps 4 to 8 until termination. the termination criterion is as follows, ∥vt+1 − vt∥ ≤∈ where′t′ is the iteration steps, ∥.∥ is the euclidean distance norm. we applied this proposed algorithm to segment tumor on mri images. in this case, we segmented the brain image into five classes: namely, csf (cerebrospinal fluid), wm (white matter), gm (gray matter), tumor and background .due to some classification errors, there are undesired additional pixels in the tumor class. to remove these misclassified components, several binary morphological operations 534 a. rajendran, r. dhanasekaran are applied to the tumor class after users defined segmentation classes are obtained (number of clusters). an opening operation is first used to disconnect the components. then we select the largest connected component, which proved to always correspond to the tumor, even if it has a small size. here the elementary neighborhood of the morphological operations corresponds to 6-connectivity. the result of this algorithm gives segmented tumor class as shown in figure 1(c) .this output is the initial contour for the gvf snake model. 4 boundary-based gvf snake model the traditional deformable active contour model [18-19] is a curve x(s ) = [ x(s),y(s) ] , s ∈ [0,1],that move within the image to minimize the energy function. the curve dynamically changes the shape of an initial contour in response to internal and external forces. the internal forces provide the smoothness of the contour. while the external forces push the curve move toward the desired features, such as boundaries. the object contour will be got when the energy function is minimized. the energy is defined as: e = 1∫ 0 1 2 [ α|x′(s )|2 +β|x′′(s )|2 ] + eext (x(s )) d s (16) where, x′(s ) and x′′(s ) are first and second derivatives of x(s ) with respect to s. the parameter α controls the tension of the curve and β controls its rigidity. eext is the external energy which is calculated from the image data. to minimize the energy function, the snake must satisfy the euler equation αx′′(s ) −βx′′′′(s ) −∇eext = 0 (17) then the snake is made dynamic by treating as the function of time t, as follows: xt (s, t) =αx ′′ (s, t) −βx′′′′(s ) −∇eext (18) when the solution x(s, t) stabilizes, the term xt(s, t) is zero. then we get the solution of equation (18).the typical external energies include: eext (x,y) = −|∇i (x,y) |2 (19) eext (x,y) = −|∇ [ gσ (x,y) ∗ i (x,y) ] |2 (20) eext (x,y) = i (x,y) (21) eext (x,y) = gσ (x,y) ∗ i (x,y) (22) where, gσ(x,y) is a 2-d gaussian function with standard deviation σ and mean is zero.∇ denotes the gradient operator ∗ denotes linear convolution. these external forces have a short capture range and poor convergence to boundary concavities. to overcome these problems, gradient vector flow snake was proposed by xu and prince [18], which uses the force balance condition as a starting point of snake. it defined a new static external force field called gvf field fext = v (x,y) = [u(x,y), v(x,y)] (23) where, u and v are the grey changes on x-axis and y-axis of the image respectively .fext can be got by minimizing the following energy function: ∈= ∫ ∫ µ ( u2x + u 2 y + v 2 x + v 2 y ) + |∇ f |2|v−∇ f |2d xdy (24) brain tumor segmentation on mri brain images with fuzzy clustering and gvf snake model535 where, ux, uy, vx, vy are derivative of x-axis and y-axis respectively. f (x,y) is the edge map (using canny edge detector) which is derived from image i(x,y). µ is a regularization parameter governing the tradeoff between the first term and the second term in the formula. it should be set according to the noise of the image. the calculus of variations and numerical implementation discussed in [18] is used to obtain the solution of equation. this deformable contour is first initialized by the tumor class output of epfcm method, which then moves towards the final tumor boundary. 5 results and discussion initially tumor mri brain image is segmented for tumor class using epfcm method, which then initial contour for gvf snake. then the contour attracted towards final tumor boundary by edge map derived from the image using canny edge detector. we set parameter h = 500, search window size is 7 × 7,neighborhood window size is 3 × 3,m = 2,a = 5,b = 3 and η = 2 for epfcm method to have proper segmentation result. we set α and β value between 0.1 to 0.2 and µ value between 0.2 to 0.3 for gvf snake model to have final tumor boundary.the application of our combined method to 10 contrasts enhanced t1-weighted images and 5 flair images shows better tumor segmentation. the results of four cases are as shown in figure 1. figure 1: (a)first column: first two images; original ce-t1w enhanced tumors; third image; original ce-t1w ring enhanced tumor; fourth image; original non enhanced tumor flair image (b) second column: manual segmentation result (c) third column: epfcm result showing tumor class after morphological operations (d) fourth column: segmentation of tumor class using combined approach (epfcm and gvf snake model) (e) fifth column: final boundary detection (blue curve) shows tumor region using gvf snake model. the evaluation of segmentation performance is also carried out quantitatively by employing four volume metrics namely, the similarity index(s), false positive volume function (fpvf), false negative volume function (fnvf) and jaccard index in our experiment. for a given image, suppose that ai and bi represent the sets of pixels belong to class i in manual and in automatic segmentation, respectively. |ai| denotes the number of pixels in ai. |bi| denotes the number of pixels in bi. the similarity index is an intuitive and clear index to consider the matching pixel between ai and bi, 536 a. rajendran, r. dhanasekaran and defined as s = 2|ai ∩ bi| |ai|+ |bi| (25) similarity index s > 70% indicates an excellent similarity [20]. the false positive volume function (fpvf) represents the error due to the misclassification in class i and the false negative volume function (fnvf) represents the error due to the loss of desired pixels of class i, they are defined as follows, fpvf = |bi| − |ai ∩ bi| |ai| (26) fnvf = |ai| − |ai ∩ bi| |ai| (27) higher value of s, and lower value of fpvf, fnvf gives better segmentation result. the jaccard index between two volumes is represented as follows, ji (a, m) = |ai ∩ bi| |ai ∪ bi| × 100 (28) table 1: evaluation of the segmentation results of enhanced tumors and nonenhanced tumor by combined approach (epfcm and gvf model) on a few ce-t1w and flair images.(fet denotes the full enhanced tumor, ret the ring-enhanced tumor,net the enhanced tumor mri modality type type of tumor volume metric functions (%) ce-t1w & flair fet, ret & net s fpvf fnvf j ce-t1w fet1 98.8 0.4 0.2 88.2 ce-t1w fet2 96.3 0.7 0.4 84.5 ce-t1w fet3 92.6 1.2 0.7 86.7 ce-t1w fet4 95.7 0.6 0.6 89.5 ce-t1w fet5 93.2 0.4 0.5 87.8 ce-t1w ret1 95.8 0.1 0.2 80.2 ce-t1w ret2 92.3 1.3 0.7 78.6 ce-t1w ret3 97.6 0.2 0.3 76.2 ce-t1w ret4 91.8 2 1.2 75.5 ce-t1w ret5 96.3 0.1 0.3 77.3 flair net1 98.8 0.9 0.6 76.9 flair net2 91.5 2.1 1.8 83.2 flair net3 98.6 2.9 3 80.1 flair net4 94.4 1.8 2.2 85.2 flair net5 95.3 1.2 2.1 81.6 average 95.3 1.06 0.98 82.1 the result of four volume metrics for our method applied to 15 tumor cases is as shown in table 1and plotted in figure 2. from this table, we can see that an average similarity metrics and jaccard index of our method is 95.3% and 82.1% that is, the overlap degree between our segmentation result and the manual segmentation is higher. the average fpvf and fnvf values are equal to 1.06% and 0.98%. it shows misclassification and loss of desired tumor pixels are reduced in great degree. these average values are obtained from 15 tumor cases as shown in figure 3.to compare the results with other methods, there is no a good standard, however in comparison with works such as in [1,2,7] shows that our method has a better tumor segmentation performance. brain tumor segmentation on mri brain images with fuzzy clustering and gvf snake model537 figure 2: graph of the quantitative comparison results of three volume metrics for 15 mri brain tumor images. figure 3: graph of the average value of the three volume metrics obtained from 15 mri brain tumor images. 6 conclusions we have presented in this paper a tumor segmentation method which combines both region based fuzzy clustering method called epfcm and boundary based method called gvf snake model .we verified our method with brain tumour mri images. the obtained results are quantitatively verified with other existing methods and they show that our combined approach provides better result. 538 a. rajendran, r. dhanasekaran bibliography [1] m. prastawa, e. bullitt, s. ho, g. gerig, a brain tumor segmentation framework based on outlier detection, medical image analysis, 2004, 18 (3), 217-231. [2] j.j. corso, e. sharon, a. yuille, multilevel segmentation and integrated bayesian model classification with an application to brain tumor segmentation, in: miccai2006, copenhagen, denmark, lecture notes in computer science, october 2006,vol. 4191, springer, berlin, pp. 790-798. [3] m.b. cuadra, c. pollo, a. bardera, o. cuisenaire, j. villemure, j.-p. thiran, atlas-based segmentation of pathological mr brain images using a model of lesion growth, ieee transactions on medical imaging, 2004,23 (10) ,1301-1313. [4] j.-p. thirion, image matching as a diffusion process: an analogy with maxwells demons, medical image analysis, 1998,2 (3) ,243-260. [5] g. moonis, j. liu, j.k. udupa, d.b. hackney, estimation of tumor volume with fuzzyconnectedness segmentation of mr images, american journal of neuroradiology, 2002,23 ,352363. [6] a.s. capelle, o. colot, c. fernandez-maloigne, evidential segmentation scheme of multi-echo mr images for the detection of brain tumors using neighborhood information, information fusion, 2004 ,5 ,203-216. [7] w. dou, s. ruan, y. chen, d. bloyet, j.m. constans, a framework of fuzzy information fusion for segmentation of brain tumor tissues on mr images, image and vision computing, 2007, 25 ,164-171. [8] m. schmidt, i. levner, r. greiner, a. murtha, a. bistritz, segmenting brain tumors using alignment-based features, in: ieee internat. conf. on machine learning and applications, 2005, pp. 215-220. [9] j. zhou, k.l. chan, v.f.h chong, s.m. krishnan, extraction of brain tumor frommr images using one-class support vector machine, in: ieee conf. on engineering in medicine and biology, 2005, pp. 6411-6414. [10] a. lefohn, j. cates, r. whitaker, interactive, gpu-based level sets for 3d brain tumor segmentation, technical report, university of utah, april 2003. [11] y. zhu, h. yang, computerized tumor boundary detection using a hopfield neural network, ieee transactions on medical imaging, 1997 ,16 (1) ,55-67. [12] s. ho, e. bullitt, g. gerig, level set evolution with region competition: automatic 3d segmentation of brain tumors, in: icpr, quebec, august 2002, pp. 532-535. [13] k. xie, j. yang, z.g. zhang, y.m. zhu, semi-automated brain tumor and edema segmentation using mri, european journal of radiology, 2005, 56 ,12-19. [14] wang guoqiang, wang dongxue, segmentation of brain mri image with gvf snake, model,in: 2010 first international conference on pervasive computing, signal processing and applications,2010,pp.711-714. [15] pal, n. r., pal, k., keller, j. m., and bezdek, j. c.a, possibilistic fuzzy c-means clustering algorithm, ieee transactions on fuzzy systems, 2005, 13(4), pp.517-530. brain tumor segmentation on mri brain images with fuzzy clustering and gvf snake model539 [16] buades a,coll b,morel j-m. "a non-local algorithm for image denoising",in cvpr 2005:60-5. [17] ma, l. and staunton, r. c., a modified fuzzy c-means image segmentation algorithm for use with uneven illumination patterns, pattern recognition, 2007, 40(11), pp.3005-3011. [18] c. xu and j.l. prince, snakes, shapes, and gradient vector flow, ieee trans. on image processing, march 1998,vol. 7, pp. 359-369. [19] bingrong wu, me xie, guo li, jingjing gao, medical image segmentation based on gvf snake model ieee conference on second international intelligent computation technology and automation (icicta 09), ieee press, 2009, vol. 1,oct., pp. 637 640. [20] zijdenbos, a. p., dawant, b. m., margolin, r. a., and palmer, a. c. morphometric analysis of white matter lesions in mr images: method and validation, ieee transactions on medical imaging, 1994;13(4):716-724. int j comput commun, issn 1841-9836 8(1):42-49, february, 2013. datastores in cloud governance a. copie, t.-f. fortiş, v.i. munteanu adrian copie, teodor-florin fortiş, victor ion munteanu 1. west university of timişoara romania, timişoara, bvd. v.pârvan 4, and 2. institute e-austria, timişoara romania, timişoara, bvd. v.pârvan 4 adrian.copie@info.uvt.ro,fortis@info.uvt.ro vmunteanu@info.uvt.ro abstract: the small and medium enterprises benefit now, due to the scale adoption of cloud computing, from an emerging market where they can associate and collaborate to form virtual enterprises or virtual clusters, aiming to compete with the large enterprises and provide tailored it solutions for their customers. however the lack of standardization for the cloud services and technologies leads to a myriad of different components that cannot be easily set to work together in the absence of a real cloud governance solution. cloud governance acts like a catalyst to allow small and medium enterprises to easily manage and optimize their services infrastructure, and to facilitate collaboration in a clustered or virtual-enterprise environment. we have proposed a cloud governance architecture based on mosaic’s multi-agent cloud management solution. the cloud governance solution relies on various datastores that are responsible with maintaining and managing a set of crucial data that are used during the cloud governance process . our paper is focused to analyze and emphasize the requirements that must be fulfilled by different database systems in order to have a reliable storage system and also to suggest a concrete solution. keywords: cloud computing, cloud governance, datastores, databases 1 introduction in the last years, cloud computing becomes a common paradigm, due to the straightforward way of resource provisioning, dynamically scalability, service orientation and its simple pay-asyou-go financial model. along with these characteristics, in [1] are identified others like strong fault tolerance, loosely coupling, virtualization, ease of use and the link with the business model which lead to the development of new business models [2], [3]. the selected architectural model allows smes to collaborate and associate in virtual enterprises or virtual clusters and expose services in direct competition with the large enterprises. however, there are still issues to overcome in order to benefit of this kind of collaboration due to the diversity of services offered by the cloud providers and the lack of standardization. efforts have been made in this direction, different solutions that aim to abstract the characteristics of some existing cloud providers have been released: mosaic1, cloudfoundry2, morfeo 4caast3, activestate stackato4, openshift5, reservoir6, sla@soi7 and many others . even if they are a step forward in the process of the cloud services standardization, [4], [5] and [6] suggest 1http://www.mosaic-cloud.eu 2http://cloudfoundry.org 3http://4caast.morfeo-project.org 4http://www.activestate.com/stackato 5http://openshift.redhat.com/app/ 6http://www.reservoir-fp7.eu/ 7http://sla-at-soi.eu/ copyright c⃝ 2006-2013 by ccc publications datastores in cloud governance 43 complementary services related to the cloud management which assures that the resources in the cloud are used optimally and properly interact with the users and other services. taking into account the growth rate of the cloud services, [7], [8], [9] reveal the need a better integration and a demand for complementary mechanism for cloud management: cloud governance. this is an evolutionary step in the service oriented architecture (soa) which makes possible the awareness of the existence of the cloud services for the potential consumers. due to its similarities, the awareness mechanism rely on different specialized datastores which hold critical information and which have various requirements in order to correctly function and offer reliable data. however, according to [11] there must be a clear separation between the management and governance processes, because they describe different activities, have distinct goals and involves different organizational structures. 2 the mosaic project ’open source api and platform for multiple clouds’ (mosaic) is a fp7-ict project that aims to provide an open-source platform and an api which abstracts the particularities of various cloud providers and encourages the applications development based on the cloud-programming paradigm. its main components are the cloud agency which is an embedded cloud management solution based on agents that is designed to negotiate cloud resources and provide them to the second component called mosaic platform which consumes them by offering a cloud-oriented application development framework. 3 cloud governance our proposed cloud governance architecture is based on variuos proposals, including distributed management task force’s (dmtf) white papers [6], [10] and is built around the mosaic’s cloud agency. the cloud agency is a core cloud management component in the mosaic platform that exposes its functionality as a service that is consumed inside our cloud governance component (figure 1). figure 1: cloud governance the architecture reveals several distinct components like the cloud management solution based on mosaic’s cloud agency, the cloud governance bus, the cloud governance functional modules and the datastores used to persist various data during the governance process. the cloud agency’s main role is to assure the management of the resources but also to perform 44 a. copie, t.-f. fortiş, v.i. munteanu table 1: general requirements for the datastores in cloud governance crt.no. requirement s1 cost optimisation by trying to find solutions in order to minimize the costs with the storage and virtual machines in the cloud through intense multiplexing s2 high performance in terms of throughput, small latency, scalability independent of the data size and the dynamic of the workload s3 security (confidentiality, integrity, privacy) s4 high availability d1 simple internal api in order to expose a relative small number of methods to be used by the governance agents various slas monitoring activities. the messages and data exchange between the cloud services is made through the cloud governance bus. however the governance process is realized using four specialized agencies namely service management, security management, audit management and governance management. the service management takes care about the lifecycle of the services registered in the cloud governance environment. security management handles the identities of the service consumers and provides the security tokens used to access the requested services. the audit management agency processes the audit information obtained from the cloud management component and in the same time allows the access to a wide range of logs generated by the interacting components. finally, the governance management coordinates the entire governance activity based on a set of rules, policies and settings. 4 datastores in cloud governance every cloud governance agency handles specific data, being in direct relation with a dedicated datastore called namely service datastore, security datastore, audit datastore and governance datastore. every datastore is in charge with keeping crucial information related to the functionality of the entire system but also very sensitive from the privacy and confidentiality point of view, such as credentials, contracts, partners, policies, etc. in the same time the data storage system must be very responsive in what concerns the time took to perform various operations over the data set and also must offer a good performance in terms of bandwidth, to accommodate with the processed data flow. this is why the data storage systems must fulfil some general requirements, as revealed in table 1. the general requirements for the data storage systems have been divided in two categories: requirements related directly to the intrinsic characteristics of the storage like cost optimization, high performance, high availability, high security and also requirements related to the development phase, like a very simple and intuitive api. these requirements apply to all the datastores inside the governance environment and, from case to case, there are other specific requirements that will be exposed in the appropriate section. 4.1 governance datastore the cloud governance implies the use of various policies related to the way in which the cloud services interact with the consumers, together with different types of constraints that limit the datastores in cloud governance 45 access to the underlying resources, or even functionality of the governance process building blocks. there are several types of policies that are administered by the cloud governance component like service level agreements (slas) and service level objectives (slos) and which represent the cornerstone of the cloud governance process [4], [6]. the relevant constraints involved in the decision taking process are deployment constraints, data residency constraints, auditability constraints and security constraints. after two partners, namely the provider and the consumer, agree about a set of constraints, they will further govern the interaction between the entities involved in the contract. the governance management agency coordinates the entire activity performed by the cloud governance component based on different policies, constraints and system settings that are stored in the governance datastore like access policies, virtual machine settings, security credentials, interaction policies, etc. every cloud service provider has its own policies and constraints that will be published and used during the service lifecycle. some of the policies and constraints could be editable by the service consumers in order to customize them, like access control lists, some of them are not, like quality of services, all of them being part of the request and offer operations. the information represented by the policies and constraints is usually structured, being related to the guaranteed functionally parameters of the services, security policies, configuration parameters of the virtual machines on which the agents are executed and many more. the data persisted in the governance datastore is critical to the whole governance process, therefore requirements of the storage system are also in terms of highly availability and reliability. some of the data is very sensitive so a level of encryption must be added. in the same time, this datastore must maintain relations with other storage systems which recommends a flexible graph database to hold the information related to governance. this choice will assure high performances in terms of records processing, allowing to be installed either in the private cloud on commodity servers or in the public cloud through virtual machines. 4.2 services datastore this is the most complex datastore, part of the cloud governance process. it could be seen like a service itself offering essential interfaces for the cloud services that want to register in the cloud governance component and then to be discovered and used, together with other additional features related to offers, contracts, audits and billing. figure 2 depicts the schematic of the service datastores. figure 2: services datastore and their internal relationships the service datastores, through their specific mechanisms and information that compound 46 a. copie, t.-f. fortiş, v.i. munteanu them, facilitates the services publishing and discovery processes. like in the case of soa governance, cloud governance relies on a service repository model in which the services that want to offer their functionality must register and must offer methods to be discovered by the consumers. this datastore is also tied with global catalogues that maintain information about the system’s customers and the partners. the character of the information contained in the service datastore is extremely heterogeneous, holding many kinds of specific information, grouped in several sub-components: • service descriptors. the services functional and non functional requirements are described through service descriptors and they are persisted inside the descriptors component. this is usually a structured information, the service descriptors contains basically information of the same type and must be highly query-able in order to offer flexible information about the contained services. • semantic definitions. besides the syntactic description stored inside the descriptors component, the cloud services are described and defined from their semantic point of view through semantic descriptors and stored in the semantic definition sub-component. • offers. the relation between provider and consumer usually benefit from a resource model that describes what the service can offer in terms of functionality. service templates are used to describe in a generic form what a provider can offer, but when the template contains information about a specific provider, the template becomes an offer and it is persisted for future use in the offers sub-component. because the information is based on generic templates, it could be structured so an appropriate storage system could be used. • contracts. when a consumer looks for a specific service and discover it in the services datastore, it consults its offer, agrees with the slas and if it decides to use that services the offer becomes a contract that is also stored inside the contracts sub-component. for a specific service, more than one contract could exist. • instance sets. after a cloud service is contracted, in order to be effectively used it is instantiated and an additional information called service instance is generated. this information is an aggregate of trading, billing and deploying information which is stored in a separate sub-component of services datastore called instance sets. • running instances. the information about the running instances of the cloud services is very much the same as the one contained in the service instances and is persisted in the running instances sub-component • billing. for every contract, billing information is generated in accordance with the parameters agreed during the contracting phase and this information is located in the billing sub-component. taking into account the format of the data persisted in the individual database components in the service datastores, together with the general requirements already exposed, the choices in terms of database types are • newsql database for the service descriptors database • rdf datastore for the semantic definitions database in order to benefit from the specialized sparql query language • graph database for the offers, contacts, instance sets, running instances and billing databases datastores in cloud governance 47 4.3 security datastore the cloud governance involves distinct entities, components, agents running in behalf of various parties, therefore the security is one of the most important concerns in order to assure a safe and trustful environment. inside the cloud governance system we distinguish two security planes: the security of the cloud services, usually assured by the service providers and the intrinsic security of the cloud governance component. the access to the cloud services must be done in a secure way, only authorized consumers having the rights to use the provided functionalities, sometimes in a granular way through various access rights and security policies. the security management agent is responsible for the credentials management for the authentication and authorization process together with providing the security tokens used to access different cloud resources after the successful authentication process. all the credentials and security policies governing the cloud services in the system are stored in the security datastore. in most cases, the services are offered on a pay-as-you-go schema, being accessed by providing credentials in the form of user name and password, or in terms of security keys. this special kind of data is kept inside the security datastore. to add an extra security layer over the persisted data, the information contained in the identities sub-component must be encrypted and the passwords hashed. the security policies used to consume the cloud services are related to the consumer identities and establish what kind of actions are allowed over a given resource by an authenticated consumer. the appropriate database type to hold the security information is newsql, allowing to be implemented either in the private or public cloud. 4.4 audit datastore the services involved into a complex cloud system that requires governance are provided on a paid scheme, and must respect an sla previously negotiated and agreed between the cloud service provider and the consumer. usually the service is monitored only on the provider side, such that the client is not aware when the cloud service does not respect the parameters that are stipulated in the contract. through the accepted sla, different metrics are established in order to facilitate the service control and monitoring mechanisms. the mosaic’s cloud agency component provides the required functionality that allows the service monitoring on the consumer side and to permanently compare the measured parameters with the contractual ones that is leveraged by the audit management agent which obtains all the necessary information about a specific service. along with the performance measurements, billing information is generated based on the time spent for a cloud service to perform specific tasks together with audit information , triggered when the cloud service is accessed by various entities. the audit mechanism is working based on various policies established at the service level. all these data together with the audit policies is stored in the audit datastore component. usually the data contained in the audit datastore is not structured, but has a large volume depending on the number of the performance parameters monitored and the sampling frequency. because many decisions in the process of governance are taken based on the data analysis, it is recommended that the storage system that host this data to offer high availability, high writability and scaling, which is better achieved by a key-value datastore. 48 a. copie, t.-f. fortiş, v.i. munteanu 5 conclusions and future works cloud computing adoption embraced by the small and medium enterprises makes possible the access on new markets where they can associate in virtual clusters or virtual enterprises in order to be able to compete with the large enterprises, by offering complex solutions, tailored on the customers specific needs. we have proposed a cloud governance architecture based on mosaic’s cloud management components that aims to manage and govern the services infrastructure. this government solution relies on a set of datastores that maintains and manage various data produced and consumed during the services lifecycle. our paper focused in expanding the information contained in the most important datastores in cloud governance, determining the requirements for every database that form the datastores and suggests the appropriate database types. this paper is a base for a future work consisting in the concrete implementation of a cloud governance component. acknowledgments this work was partially supported by the grant of the european commission fp7-ict-20095-256910 (mosaic), the views expressed in this paper do not necessarily reflect those of the corresponding projects consortium members. bibliography [1] c. gong, j. liu, q. zhang, h. chen, and z. gong, the characteristics of cloud computing, in w.-c. lee and x. yuan (eds.), icpp workshops, ieee computer society, pp. 275-279, 2010. [2] c. weinhardt, a. anandasivam, b. blau, n. borissov, t. meinl, w. michalk, j. stoer, cloud computing a classification, business models, and research directions, business and information systems engineering,issn: 1867-0202, 1(5):391-399, 2009. [3] c. weinhardt, a. anandasivam, b. blau, and j. stoer, business models in the service world, ieee it professional, special issue on cloud computing, issn: 1520-9202, 11(2):28-33, 2009. [online]. available: http://dx.doi.org/10.1109/mitp.2009.21 [4] cloud computing use cases group. (2010, july) cloud computing use cases white paper. [online]. available: http://opencloudmanifesto.org/cloud computing use cases whitepaper-4 0.pdf [5] use cases and interactions for managing clouds. distributed management task force, (2010, june) [online]. available: http://www.dmtf.org/sites/default/files/ standards/documents/dsp-is0103 1.0.0.pdf [6] dmtf. (2010, june) architecture for managing clouds. distributed management task force. [online]. available: http://dmtf.org/sites/default/files/standards/documents/dspis0102 1.0.0.pdf [7] p. wainewright. (2011, august) time to think about cloud governance. [online]. available: http://www.zdnet.com/blog/saas/time-to-think-about-cloud-governance/1376 datastores in cloud governance 49 [8] s. bennett, t. erl, c. gee, r. laird, a. t. manes, r. schneider, l. shuster, a. tost, and c. venable, soa governance: governing shared services on-premise in the cloud. prentice hall/pearsonptr, 2011. [online]. available: http://www.soabooks.com/governance/ [9] t. cecere. (2011, november) five steps to creating a governance framework for cloud security. cloud computing journal. [online]. available: http://cloudcomputing.syscon.com/node/2073041 [10] dmtf. (2010, june) use cases and interactions for managing clouds. distributed management task force. [online]. available: http://www.dmtf.org/sites/default/files/standards/documents/dsp-is0103 1.0.0.pdf [11] isaca. (2012, march) cobit 5 introduction. [online]. available: http://www.isaca.org/cobit/documents/cobit5-introduction.ppt int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vii (2012), no. 1 (march), pp. 163-172 load-aware and position-aided routing in satellite ip networks l. wang, l. liu, x. hu wang lu 1. graduate university of chinese academy of sciences beijing, china, 100049, and 2. institute of software, chinese academy of sciences beijing, china, 100190 e-mail: wlu810@gmail.com liu lixiang, hu xiaohui institute of software, chinese academy of sciences beijing, china, 100190 abstract: satellite communication is regarded as a promising way to achieve seamless global coverage with an array of advantages over the traditional terrestrial network. however, the non-uniform distribution of users and the highly topological dynamics have been a matter of facts. the traditional load balancing technique is able to guarantee a better traffic distribution around the inter satellite links with heavy loads after considering the non-uniform distribution of users, but it transmits load information passively and fails to provide a fair traffic distribution over the entire constellation. given the highly topological dynamics, traditional routing protocols may experience the convergence processes frequently or use a large amount of storage to keep the snapshots. this paper proposes a novel routing protocol named lpr for the satellite ip networks, which transmits the bottleneck link load information to upstream satellites in a hop-by-hop manner actively, to guarantee a better distribution of traffic among satellites. in response, upstream satellites select less congested paths and communicate a portion of data according to the priority of different classes of traffic. in addition, lpr applies the geographical position when selecting the routing path to eliminate the convergence process which may cause the routing tables invalid. finally, various aspects of performance of lpr have compared with other mechanisms. our evaluations show that a routing protocol with load-aware and position-aided leads to higher throughput and better traffic distribution compared to the traditional routing protocol and load balancing technique. keywords: satellite networks, load balancing, position, routing. 1 introduction satellite networks exhibit unique features and offer an array of advantages over traditional terrestrial networks. they are able to provide coverage to extensive geographic areas and to interconnect among remote terrestrial networks. a satellite communicates with other satellites via inter-satellite links(isls). the links between satellites in the same plane are called intraplane isls. the links between satellites in different planes are called inter-plane isls. satellites communicate with the ground stations over user data links(udl). leo satellites provide short round-trip delays and have been the focus of several researches for a period of time [1], [2]. given recent advancements in satellite communications, researchers find that a combination of different layers of satellites would yield a better performance than copyright c⃝ 2006-2012 by ccc publications 164 l. wang, l. liu, x. hu these layers individually. researchers have proposed severals routing protocols for multi-layered satellite networks, such as mlsr [3], sgrp [4], sos [5] et. al. mlsr is a link state routing protocol for multi-layered satellite networks. sgrp is a multi-layered satellite network routing protocol based on snapshots. in these multi-layered satellite network routing protocols, leo satellites transmit data, and meo/geo satellites are managers and compute the routing paths. another type of satellite network routing protocol is based on geographical location, such as lpdb [6]. lpdb utilizes satellites’ inherent broadcast capabilities, and predict transmission direction according to the geographical location. the majority of the routing protocols have convergence processes to collect information and update routing tables when the topology changes, however satellites move rapidly relative to the surface of the earth and to the ground stations(leo satellites speed at over 2500km/hour [7]) and a lot of packets may be lost during the convergence process, due to routing tables becoming invalid. furthermore, satellites’ rapid movements cause frequent convergence processes, which increases the algorithms’ overhead. when satellites use routing protocols based on snapshots, it takes a large amount of storage. if satellites’ links interrupted, routing tables of the current snapshot and the following snapshots would be invalid, and in turn it takes a long interval for the central control nodes, such as ground stations, geo/meo, to calculate new routing tables and update routing tables for the satellites. in addition, an important missing point in the previous designs consists in their focus on searching for the paths without any consideration of the total traffic distribution over the entire constellation, which leads to unfair distribution of traffic. recently, researchers propose some load balance mechanisms, such as cemr [8], elb [9] et. al. cemr is based on a cost metric that involves both propagation and queuing delays, however it does not reflect the congestion state of the next hop, nor does it estimate the queuing delay a packet may experience there. elb is an explicit load balancing mechanism that classifies users into three classes. neighboring satellites exchange load information and reduce data forwarding rates when the queue occupancies exceed a pre-determined threshold. however, elb only exchanges information among neighboring satellites actively and it takes a long interval to notify sources of the congestion state. furthermore, although elb applies a routing metric that instantly reflects both the one-way propagation delay and the instant queuing delay to avoid loops, the traffic forwarding may still cause loops. the objective of this paper is to design a novel satellite network routing protocol named lpr (load-aware and position-aided routing), which is able to guarantee fair distribution of traffic among the entire constellation and eliminate the convergence process. lpr calculates every link’s load information and transmits it to the upstream satellites actively. lpr calculates the next hop set based on geographical position, and the next hop set includes a main path and several alternative paths. lpr transmits traffic on the main path when the load on the main path is low, and begins to communicate a portion of traffic on the alternative paths when the load on the main path becomes high. the main path is the routing path with the shortest distance. 2 dissemination and estimation of load information 2.1 load transmission in traditional load balancing technique, the load information is exchanged among neighboring satellites. it is able to achieve better traffic distribution than the routing protocols which do not take traffic distribution into account, but the isls are expected to be heavily loaded around the overload region and others remain underutilized, and in turn it fails to achieve fair traffic load-aware and position-aided routing in satellite ip networks 165 distribution over the entire constellation, as in figure 1. in lpr, each satellite calculates the load information and compares its own link load with the received link load , and updates the link load, if the local link load is larger than the received link load. then satellites transmit the load information in a hop-by-hop manner. in other words, each satellite transmits the bottleneck link load information to upstream satellites actively. the upstream satellites receive the bottleneck link load, and begin to detour traffic when the bottleneck link load is larger than the threshold, resulting that all upstream satellites on the transmission path are able to communicate a portion of data when there is a link congested. in turn, the traffic is distributed fairly over the entire constellation and satellites are able to return back to free state quickly, as in figure 2. in addition to the fair traffic distribution, lpr is able to disseminate the load information instantly to all upstream satellites. in traditional load balancing technique, it takes a relatively long interval to transmit the overload information to upstream satellites, for the traditional load balancing technique only transmits load information actively to neighboring satellites, and load information is transmitted passively to the upstream satellites. let i and o denote the total input and output traffic at a given satellite. a satellite is considered to be overloaded if the load exceeds the threshold. let tn denote the time for satellite n becomes overloaded when i is larger than o. let dn denote the transmission delay between satellite n and satellite n + 1. in traditional load balancing technique, it takes ∑j−1 n=i (tn + dn) for satellite i to know the overload sate originated from satellite j. but in lpr, it only takes ∑j−1 n=i dn for satellite i to know the load information of satellite j, due to the active transmission of load information in a hop-by-hop manner. figure 1: traditional load balancing figure 2: lpr’s load balancing 166 l. wang, l. liu, x. hu 2.2 load estimation and setting of thresholds in lpr, satellites keep a virtual queue for every link and estimate link load rl over an appropriate interval time tr, and tr is estimated according to rtt in one hop. rl is computed similar to [10]: rl = λl + kq · ql γl · cl · tr (1) here the definitions of λl and ql are different from [10]. λl is the total amount of input data in a virtual queue for one link during tr, and ql is the persistent queue length of a virtual queue for one link during tr . kq controls how fast the persistent queue drains, γl is the target utilization that sets close to 1, and cl is the link capacity. the input traffic λl and the persistent queue length ql are measured using bytes [10]. lpr sets kq to 0.8 to make full use of the limited memory space. the persistent queue length ql is measured by using a low-pass filter that samples the instantaneous queue size, q(t), every tr/20 . 2.3 setting of detouring ratio the objective of setting the detouring ratio is to allow the satellites’ overloaded links to return back to light load state and keep this state for at least a period of θ. let it denote the total rate of traffic coming from terminals to the bottleneck link within the coverage area of a satellite, in denote the total rate of traffic coming from neighboring satellites to the bottleneck link, o denote the total rate of output traffic of the bottleneck link, and ld denote the isl delay. a satellite receives link load rl(t) from neighboring satellites at time (t + ld). rl(t) is the neighboring satellite’s link load at time t, and the satellite’s link load at time (t + ld) is rl(t + ld) : rl(t + ld) = γl + kq · (ql + (it + in − o) · id) γl · cl · tr (2) in order to ensure a prompt recovery and a residual time in the light load for at least θ, the new rate of traffic coming from neighboring satellites inewn should satisfy: rl(t + id + θ) = γl + kq · (ql + (it + in − o) · id + (it + inewn − o) · θ) γl · cl · tr < threshold (3) threshold is the link load’s threshold. if the link load is larger than threshold, the link is overloaded and satellites begin to detour a portion of traffic according to the detouring ratio on the alternative paths. the traffic detouring ratio φ can be computed as: φ = 1 − min(max(0, inewn in ),1) (4) 3 load-aware and position-aided routing 3.1 path selectioin the key technologies required to support positioning over satellite systems have been developed, such as gps, galileo and glonass. with these advancements, satellites are able to get the high precision orbit determination. in lpr, path selection is performed by using the position. lpr calculates the routing path for every arriving packet, instead of maintaining routing tables to avoid the convergence process, however, it takes a large amount of computing resources. so load-aware and position-aided routing in satellite ip networks 167 satellites cache the routing results until the topology state changes, to reduce the computation overhead. step 1: predicting transmission direction-the polar constellation networks have orbital seams, and lpr assumes that there are no cross-seam links, for the regular hand over as satellites pass. satellites would transmit receiving packets cross seam, if the destination and current satellites located on different side of the orbital seam. the orbital seams are located between the fixed planes of ascending and descending satellites in the same layer and in turn lpr is able to utilize the longitude difference between the orbital plane of current satellites and the orbital seam to judge whether packets should cross the seams or not. assuming that satellite a’s geographical position is (longa, latitudea).the points of the intersections of the orbital seams and the equator are left(longleft_seam,0) and right (longright_seam,0). the point of the intersection of the plane of satellite a and the equator is a′(plong,0): plong = longa + arcsin(tan(latitudea)/tan(inclination)) (5) where inclination is the constellation’s inclination angle, and the constellation is composed of satellites in one layer. let left_seam_diff denote the longitude difference between a′ and left and right_seam_diff denote the longitude difference between a′ and right. left_seam_diff = plong − longleft_seam (6) right_seam_diff = longright_seam − plong (7) left_seam_diff and right_seam_diff are fixed in the same layer. assuming that the point of intersection of the destination and the equator is (longdest,0). if the difference between plong and longdest is less than left_seam_diff or right_seam_diff, packets will not be transmitted across the orbital seams and a’s neighboring satellites set nei(a) includes all satellites having isls with a, otherwise packets will be transmitted across the orbital seams and nei(a) only includes satellites which are able to transmit packets across the seams by means of passing through the poles or another satellites layer. after estimating neighboring satellites set nei(a), satellite a calculates the distance from x , where x ∈ nei(a) , to the destination to get next hop set n(x) : n(x) = {x|d(x,dest) < d(a,dest),x ∈ nei(a)} (8) in multi-layered satellite networks, all satellites are projected to the surface of the earth when calculating the distance. if several satellites in different layers are projected in one point on the surface of the earth, the satellite in the same layer with the sender has the highest priority. step 2: selecting next hop-after predicting transmission direction, satellite a will choose next hop from n(x) depending on the load information collecting from the transmission paths. lpr transmits traffic on the main path when the load is low, and begin to detour traffic when the load of the main path becomes high. lpr classifies traffic into three types, similar to [11], to guarantee the qos requirements. class a is delay-sensitive traffic, class b is through-put sensitive traffic, and class c is best effort traffic. when the load increases, lpr starts detouring traffic of class c. if the requested detouring ratio of traffic is larger than the traffic percentage of class c, the traffic of class b is detoured as well. if the requested detouring ratio of traffic φ is larger than the traffic percentage of class c and class b, the traffic of a begins to detour. let a, b and c denote the predicted traffic ratio of class a, b and c, the traffic detouring ratio is shown in table 1. 3.2 loop avoidance in lpr, forwarding traffic depending on the geographical position and the load information may cause loops. to cope with this issue, lpr adds the routing path into the packet header 168 l. wang, l. liu, x. hu table 1: traffic detouring ratio class a class b class c φ < c 0 0 φ c c ≤ φ < (b + c) 0 φ−c b 1 φ ≥ (b + c) φ−b−c a 1 1 after selecting the next hop, to form the traversed link set ls. consider satellite a sending the set of traversed link ls to satellite b, and satellite b will not select links in the ls. however it would take a lot of bandwidth to transmit the traversed link set information. lpr proposes a mechanism to reduce this overhead at the expense of local mapping. considering that satellite a associates a traversed link set ls to labells, and sends the traversed link set ls and the mapping label labells to satellite b. satellite b will establish a mapping labells → ls and send an acknowledgement to satellite a. after satellite a receives the acknowledgement of the mapping from satellite b, satellite a only includes labells rather than the entire traversed link set ls in the packet header. labels exchange among neighboring satellites do not have global meaning. furthermore, a time-to-live value t is associated with a mapping at the receiver and the mapping is deleted, if no packet with the label associated with this mapping is received for t . then the receiver notify the sender of deleting the timeout mapping labells → ls. considering that satellite b finds that labells → ls is timeout, satellite b will delete the mapping and notify satellite a that the mapping is timeout. satellite a will delete the mapping after receiving the notification. 4 simulation setup in this section, we describe the simulation topology and parameters that have been used to compare the performance of lpr, dijkstra, elb on dijkstra and snapshot [12]. we have used omnet++ simulator [13] and developed a constellation of iridium satellites and the main parameters are shown in table 2. table 2: main parameters of satellite topology parameters values satellite number 66 orbit altitude 780km inclination angle 84.4degrees isls 2 intra-sls, 2inter-sls polar area border 70degrees plane number 6 seam links no constellation type polar it is assumed that each satellite maintains four isls with its neighboring satellites. uplinks, downlinks and isls are each given a capacity equal to 25mbps. the average packet size is set to 1kb. lengths of drop-tail based buffers equal to 200 kb. simulations are all run for 2 hours, for load-aware and position-aided routing in satellite ip networks 169 leo satellite constellation operates at the altitude of 780 km with the system period of about 100 minutes. in the simulation, we consider 100 data flows. the source and destination end-terminals are dispersed all over the world and the distribution of end-terminals is shown in table 3. the traffic distribution is identical to [14]. the sources send data at constant rates from 0.8 mbps to 1.5 mbps. the traffic percentages of delay-sensitive traffic, throughput-sensitive traffic and best effort traffic are set to 20%,30% and 50%. table 3: distribution of end users north america south america europe africa asia oceania end users number 35 10 25 5 20 5 5 simulation results in this section, we will show the performance results of lpr in the presence of different traffic classes in terms of average transmission delay and average throughput, and will compare the load balancing index among lpr, elb on dijkstra, snapshot and dijkstra. 5.1 multiple traffic classes the performance of lpr with multiple traffic classes is compared with dijkstra algorithm. figure 3 shows the average packet transmission delay. the delay of packets belonging to the delay-sensitive traffic is smaller compared to other types of traffic this is because the delaysensitive data is always sent via the paths with the shortest distance. when the traffic load gets higher (the individual transmission rate is larger than 1.3mbps), dijkstra exhibits the minimum transmission delay, but at the expense of significant packet drops, as shown in figure 4. lpr’s transmission delay increases higher, due to the traffic detouring which increases the communication delay. 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 0.1 0.12 0.14 0.16 0.18 0.2 0.22 individual data transmission rate(mbps) a ve ra g e p a ck e t t ra sm is si o n d e la y( s) dijkstra lpr delay−sensitive lpr throughput−sensitive lpr best effort figure 3: packets transmission delay as shown in figure 5, the average throughput of dijkstra is always lower than other schemes, due to the overflow of the drop-tail buffers, as shown in figure 4. throughput of the delaysensitive traffic outperforms the other two types of traffic, because of the detouring priority. 170 l. wang, l. liu, x. hu 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 individual data transmission rate(mbps) p a ck e t d ro p r a te (% ) lpr dijkstra figure 4: packets drop rate 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 individual data transmission rate(mbps) a ve ra g e d a ta t h ro u g h p u t dijkstra lpr delay−sensitive lpr throughput−sensitive lpr best effort figure 5: throughput these results demonstrate that lpr is able to guarantee qos requirements by the effective load balance mechanism and a routing method without the convergence process. 5.2 load balancing index to evaluate the traffic distribution of lpr, this paper compares lpr with dijkstra, snapshot and elb with the following traffic distribution index: f = ( ∑n i=1xi) 2 n · ∑n i=1x 2 i (9) where n is the total number of isls and udls, xi denotes the actual number of packets that traversed the ith link. the index ranges from zero to one and high value of this traffic distribution index represents good distribution of traffic over the entire constellation. in the simulation, lpr’s threshold ranges from 0.85 to 0.95 and the interval is 0.5, and lpr’s distribution index is the average of the results of these different thresholds, in order to evaluate the performance of lpr under different thresholds. in dijkstra scheme, the traffic distribution index is low (figure 6), for packets always traverse through paths with the smallest number of hops without taking the traffic distribution into account. lpr scheme always try to distribute the traffic on the transmission paths when the link load-aware and position-aided routing in satellite ip networks 171 is overloaded, so the traffic distribution is higher than the plots of dijkstra and snapshot. elb on dijkstra is higher than the plots of dijkstra, snapshot, for elb searches for less congested paths, after receiving a busy signal advertisement from neighboring satellites. however, in lpr scheme, not only neighboring satellites are able to communicate a portion of data via less congested paths, but also the upstream satellites. furthermore the load estimation of lpr is more accurate than elb, in that lpr’s granularity of load estimation is the link, but elb is the node. in turn, the distribution index of lpr is much higher than elb on dijkstra. 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 40 50 60 70 80 individual data transmission rate(mbps) t ra ff ic d is tr ib u tio n i n d e x( % ) lpr elb on dijkstra snapshot dijkstra figure 6: traditional load balancing 6 conclusions in this paper, we propose a load-aware and position-aided routing protocol named lpr, which transmits load information in a hop-by-hop manner and enables all the upstream satellites to know the load information instantly and to communicate a portion of traffic on the less congested paths when the load becomes high. furthermore, lpr utilizes the geographical positions of satellites and select the routing path to eliminate the convergence process. finally, we evaluate the performance of lpr in the iridium constellation. the simulation results indicates that lpr is capable of guaranteeing a good distribution of traffic among satellites and supporting different qos requirements. as future work, we intend to improve the ability of providing different qos by differentiating traffic, and analyze lpr’s impacts on packets reordering. finally, we will implement lpr in linux kernel to assess its strengths and limitations in practice. bibliography [1] e. ekici, i.f. akyildiz, m.d. bender, a distributed routing algorithm for datagram traffic in leo satellite network, ieee/acm transactions on networking, 9(2):137-147, 2001. [2] t. henderson, r.h. katz, on distributed, geograhpic-based packet routing for leo satellite networks, proceedings of ieee globecom, 2000. [3] i.f. akyildiz, e. ekici, m.d. bender, mlsr:a novel routing algorithm for multilayered satellite ip networks, ieee/acm transactions on networking, 10(3):411-424,2002. [4] c. chen, e. ekici, a routing protocol for hierarchical leo/meo satellite ip networks, wireless networks, 11(4):507-521, 2005. 172 l. wang, l. liu, x. hu [5] j. lee, s. kang, satellite over satellite (sos) networks: a novel architecture for satellite network, proceeding of ieee infocom, 2000. [6] c. chen, c. zesheng, towards a routing framework in ad hoc space network, internation journal of ad hoc and ubiquitour computing, 5(1):44-55, 2010. [7] l. wood, internetworking with satellite constellations, school of electronics, computing and mathematics, university of surrey, guildford, 2001. [8] b. jianjun, l. xicheng, l. zexin, compact explicit multipath routing for leo satellite networks, proceedings of ieee workshop on high performance switching and routing, hongkong, china, 2005. [9] t. taleb, d. mashimo, a. jamalipour, et. al., explicit load balancing technique for ngeo satellite networks with on-board processing capabilities, ieee/acm transactions on networking, 17(1):281-291,2009. [10] y. xia, l. subramanian, i. stoica, one more bit is enough, ieee/acm transactions on networking, 16(6):1281-1294, 2008. [11] a. svigelj, m. mohorcic, g. kandus, et. al., routing in isl networks considering empirical ip traffic, ieee journal on selective areas communication, 22(2):261-272,2004. [12] v. gounder, r. prakash, h. abu-amara, routing in leo-based satellite networks, emerging technologies symposium on wireless communications and systems,221-226,1999. [13] a. varga, omnet++ discrete event simulation system, http://www.omnetpp.org. [14] m. mohorcic, m. werner, a. svigelj, et. al., adaptive routing for packet-oriented intersatellite link networks: performance in various traffic scenarios, ieee transaction on wireless communications, 1(4):808-818,2002. ijcccv11n4.dvi international journal of computers communications & control issn 1841-9836, 11(4):580-593, august 2016. a forward-connection topology evolution model in wireless sensor networks c. zhang, c. li, n. ning changlun zhang*, nan ning science school beijing university of civil engineering and architecture beijing, china zclun@bucea.edu.cn, ningnan@stu.bucea.edu.cn *corresponding author: zclun@bucea.edu.cn chao li beijing key laboratory of communication and information systems beijing jiaotong university beijing, china lichao21261@163.com abstract: the stability and reliability of the topology structure play an important role in the efficiency of the data collecting for wireless sensor networks. in this paper, a topology evolution model is proposed. the model considers the directionality of the data flow, and adopts the forward connectionism to ensure the neighbor nodes of each node. furthermore, the model considers the balanced energy overhead in each communication path, adopts the energy balanced mechanism to compute the connection probability to the neighbor nodes. meanwhile, the process of topology evolution is distributed and the communication radiuses of all sensor nodes are limited. a theoretical analysis exhibits that the model has power-law distribution of node degrees. simulation shows that the proposed topology evolution model make energy overhead more balanced, and prolongs the lifetime of the network. keywords: wireless sensor networks; topology evolution; energy balanced mechanism; power-law distribution 1 introduction wireless sensor networks (wsns) are a kind of wireless networks which are constructed by plenty of sensor nodes. wsns can gather the data from its monitoring physical or environment conditions (e.g. the temperature, the sound etc.) and send their data to the destination (base station) directly or via multi-hop [1,2]. wsns cover a wide range of applications, since it is easily deployed and self-organized, such as environmental monitoring, military target tracking, natural disaster relief and health monitoring, and so on [3,4]. on the other hand, wsns are weakness in processing capability and storage capacity. especially, when the wsns are deployed in a harsh environment that people hardly reach, the nodes are difficultly recharged and replaced, and it leads to the limited energy sensor nodes. therefore, establishing a model which can prolong the lifetime of wsns efficiently is a very important issue. the study of complex networks has become a common focus of many branches of science since the end of last century [5, 6]. the complex networks study the characteristics of the networks, which can describe many systems in nature, such as the cooperative networks, social networks and so on. most complex networks are scale-free networks, which are robust against random removal or failures of nodes. recently, complex network is used to study connectivity, fault tolerant and topology evolution of the wireless sensor networks. copyright © 2006-2016 by ccc publications a forward-connection topology evolution model in wireless sensor networks 581 in this paper, we proposed a forward-connection topology evolution model for wireless sensor networks. different from existing schemes, in the proposed model, a node sends the data to those nodes that are nearer to the base station than itself. the remainder of this paper is organized as follows. in section 2, the related work is summarized. in section 3 and 4, an algorithm of forward-connection topology evolution model is proposed and analyzed. in section 5, the simulation to present the features of the networks generated by the proposed algorithms is provided. finally, the conclusion of this paper is given. 2 related works the topology evolution models can be divided into two types roughly: the cluster-based and the non-cluster-based. in the non-cluster-based topology evolution models, all the nodes transit the data by a chain, a tree and so on to the base station. these models are used to some small wsns or the single type nodes. in 2002, s. lindsey et al. [7] proposed pegasis (power-efficient gathering in sensor information systems) model, an optimal chain-based protocol, which can reduce the energy consumption. but if a middle node in the chain is drained, all the nodes behind the drained node can′t transmit the data to the base station. in 2003, h. o. tan et al. [8] improved the pegasis model, and proposed pedap algorithm. the algorithm is near to optimal minimum spanning tree based routing schemes. in 2003, xiang y l et al. [9] considered how the transmission range is related with the number of nodes in a fixed area such that the resulted network can sustain k fault nodes with high probability. then they presented a localized method to control the network topology. in 2005, thallner et al. [10] presented an improvement of topology control algorithm for dynamic networks and low power devices, which provided an improvement of topology control algorithm for dynamic networks and low power devices. in 2006, abhishek et al. [11] proposed an approximation algorithm, which used additional relay nodes to construct a fault-tolerant backbone network. on the other hand, the cluster-based topology evolution models divide the wsns into the inner-cluster layer and inter-cluster layer. these two layers usually possess the self-similarity. these models are used to some large or mixed wsns. in 2000,w. r. heinzelman et al. [12] presented an leach protocol which is able to distribute energy dissipation evenly throughout the sensors, doubling the useful system lifetime for the networks we simulated. but the leach protocol is not fit for the huge wsns and the wsns which have the unbalanced energy for each node. in 2003, s. bandyopadhyayet al. [13] proposed a distributed, randomized clustering algorithm to organize the sensors into clusters in a wireless sensor network. they then extended this algorithm to generate a hierarchy of cluster heads and observe that the energy savings increase with the number of levels in the hierarchy. in 2004, ossama et al. [14] proposed heed protocol, which selected cluster heads periodically according to the node residual energy. the heed protocol can prolong the lifetime of wsn. recently, the evolution models which combine the complex networks come up. in 2009, li j c et al. [15] proposed an evolving model among the cluster heads of wsn. the model is based on the random walker theory, and exhibits a power-law distribution. in 2009, zhu et al. [16] presented two self-organized energy-efficient models for wsns. the first model constructed evolving network considering the connectivity and remaining energy of each node. the second model considered the energy consumption balance of the whole network. in 2011, xiao g q et al. [17] proposed a topology evolution tebas, which introduces fitness and local world and more suitable for wsns than erw (topology evolution by random walker). 582 c. zhang, c. li, n. ning in 2012, ya q w et al. [18] studied the influence of node failure on the performance of wsn, and different immunization strategies were given. in 2012, xiao j l et al. [19] considered the energy-aware mechanism, and proposed a new topological evolving model based on the complex network theory. they found that node energy distribution had the weak effect on the degree distribution. in the practical applications, a node sends the data to the others that are nearer to the base station than itself. this kind of connection reduces the energy consumption and the delay of transmission. based on this connection, a forward-connection topology evolution model is proposed, which considers both the forward-connection mechanism and the energy-balanced mechanism. analysis and simulation show that the network exhibits well robustness, a powerlaw degree distribution and a long lifetime. 3 forward-connection topology evolution model the model uses the cluster network which contains three kinds of nodes: base station, cluster heads and cluster nodes. the network is divided into two layers: inner-cluster and inter-cluster. in the inner-cluster, data are sent to the cluster head, and the cluster head verifies the data integrity to restrict the range of compromised node. in the inter-cluster, data are sent to the base station, and the integrity is verified at the base station. furthermore, a mechanism is proposed to locate the compromised node. sppda model can be divided into initialization, key distribution, inner-data aggregation and inter-data aggregation. in this section, a forward-connection mechanism is considered. meanwhile, the energy of each node in a path should be kept balance. it means that there is no node which energy is less than others. then, an energy balanced mechanism is considered, which prolong the lifetime of the path. in wsns, most energy of sensor nodes was used in data transmission. so, it ′s assumed that the higher energy a node has, the greater connection probability the node holds in this model. besides, referred to the evolution principles of complex networks, preferential attachment of the nodes is present in the processes of the topology evolution. meanwhile, considering the relationship between the distance and the energy consumption, a threshold of communication radius should be set, which decides the communication probability between two nodes. the probability equals zero when the distance of two nodes is larger than the threshold. and the closer of two heads are, the larger probability connects to these heads is. here, dj,max is used to represented the communication radius of node j, and dmax is the communication radius of the node whose energy is emax,then dj,max = ej emax dmax . since each node in the network are homogenous, the initial node actual does not exist in the theory of distributed mechanism, which is considered in this paper. all nodes will send a message to its surrounding node when the topology evolution begins. according to the time of index, the node sends agreement to the earliest request, and refuses to the rest. the messages that nodes sent are direction in the evolution. then a directed network is considered, where the direction shows the direction of data flow. 3.1 forward-connection mechanism in forward-connection mechanism, the forward neighbors of each node are confirmed. the mechanism is divided into two steps. in the first step, the neighbors of sending node are confirmed. in the second step, if the base station is the neighbor of sending node, the sending node has no forward neighbors, because the sending node transmits the data to the base station directly. if the base station is not the neighbor of sending node, the neighbor nodes whose distance a forward-connection topology evolution model in wireless sensor networks 583 to the base station is smaller than the sending node are selected. these neighbor nodes are the forward neighbors of sending node. as shown in figure 1, point o is the base station. point i is the sending node. line oa, oi and ob is the distance between sending node and the base station. lind ia and ib is the communication radius of the sending node. point p is a neighbor of sending node. then, a conclusion to confirm the forward neighbors is given as follows. conclusion: if ∠pio is less than 60 degree, p is the forward neighbor of sending node. figure 1: forward-connection mechanism proof: the forward neighbors of sending node exist. so, we have oa > ai, then ∠oia > ∠aoi. on the other hand, oa = oi, so ∠oia = ∠iao.in △ aio, ∠aio +∠iao +∠aoi = 180◦. so, 180◦ = ∠aio + ∠iao + ∠aoi < ∠aio + ∠iao + ∠aio = 3∠aio. that is ∠aio > 60◦. in the same way, we have ∠oib > 60◦. ✷ according to the conclusion, the forward neighbors of each node can be confirmed easily. 3.2 energy-balanced path mechanism in energy-balanced mechanism, the neighbors of sending node in two hops are considered. in this paper, an energy-balanced element is used to judge the level of the energy-balanced. and a threshold śĺ is used to distinguish the high energy nodes and the low energy nodes. if the energy of a forward neighbor ǫ is larger than the energy of sending node, this forward neighbor is a high energy node. node i is one of the forward neighbors of sending node. n is the number of the forward neighbors of nodei, and nǫ is the number of the high energy nodes of node i. thus, the energy-balanced element of node i can be counted as δi = nǫ n . 3.3 model of evolution network in this model, a network is modeled as a directed graph g(v, e), where nodes are represented as the set of vertices v and the links as the set of edges e. the number of sensor nodes is defined as |v | = n. in the topology evolution networks, there are two rules of the distributed and local-world topology evolution model which is r-1 (growth) and r-2 (preferential attachment) is given as follows: r-1:after the node i is selected, this node sends the message to other nodes in its local-world, which means the circle area is within the maximum communication radius dj,max of the head i. the nodes send agreement to the earliest request, and refuses to the rest. the evolution ends when each node succeed to connect other nodes actively. 584 c. zhang, c. li, n. ning r-2: until the each node in its local-world returns the agreement to the request node, the request node connects m edges to these nodes with the probability πi→j . if the number of the nodes is less than m, it connects all of the nodes. πi→j represents the probability of a head j connected to head i. the probability πi→j depends on the connectivity, the distance of two nodes and the threshold of node i. the form of πi→j is πi= ejδjkj∑ l∈λi elδlkl ,where ∧ i is the local-world of node i. 3.4 model of evolution network in a network, the energy consumptions in each node are different according to the different distances and the amount of data. the route is constructed by the initial energy. then, the energy is consumed when the nodes send the data to the base station. and the remaining energy in each node is changed. so, the local-reconstruction is needed when the network runs for a while. in this section, two local-reconstruction mechanisms are provided which are initiative localreconstruction mechanism and passive local-reconstruction mechanism. the initiative localreconstruction mechanism is a centralized reconstruction, and the local-reconstruction require is sent by the node that has a lower energy. the passive local-reconstruction mechanism is a distributed reconstruction. and the local-reconstruction require is sent by the node that whose parent node has a lower energy. these two mechanisms are proposed as follows: the initiative local-reconstruction figure 2: local-reconstruction mechanisms in initiative local-reconstruction, when a node that has a lower energy exists in the network, this node determines the nodes that need to change the routes in its neighbors. then, this node sends the local-reconstruction requests to these nodes. the nodes that receive the requests delete the link from the lower energy node. finally, these nodes connect to other nodes with the evolution model above. figure2(b) shows the initiative local-reconstruction. algorithm1 shows the algorithm of initiative local-reconstruction. ni is the lower energy node,nfi is the parent node of node ni, nsi is the set of son nodes with node ni. the initiative local-reconstruction in passive local-reconstruction, each node determines whether it needs reconstruction according to the rate between the node and its parent node. a node need, it sends the localreconstruction requests to its parent when necessary. then, the parent node deletes the link between them after receiving the requests. lastly, this node connects to other nodes with the a forward-connection topology evolution model in wireless sensor networks 585 figure 3: algorithm of initiative local-reconstruction figure 4: algorithm of passive local-reconstruction 586 c. zhang, c. li, n. ning evolution model above. algorithm 2 shows the passive local-reconstruction.figure.2(c) shows the algorithm of initiative local-reconstruction. ni is the lower energy node, nfi is the parent node of node ni, nsi is the set of son nodes with node ni. 4 analysis degree distribution is an important and useful feature for a given complex networks. the degree distribution of this network is analyzed by the mean field theory. we assumed that n is the total number of the nodes in the wsns, and these nodes are uniformly distributed in the wsns. some important parameters are defined in table 1. table 1: definition of important parameters parameters definition n number of nodes in a network dmax the maximum communication radius0 m numbers of new edges which a new node connect at every time step ki number of links connected to node i dmax 60 600 150 200 e remaining energy of a node l number of nodes in every new comer′s local-area ti time of node i newly introduced into the network the evolution process described in this paper adopts the distributed mechanism, which can describe the real evolution processing more efficiently. actually,the nodes can not begin to send the message at the very same time/concurrently, so it can be approximately regarded the process as follows. r-1: starting with one node and m edges, at each step, a new node with m edges was added. the node can connect with all the nodes surrounding. r-2: when a new node comes into the network, it will choose some nodes in its local-world to connect with the probability ∏ i→j. during each unit of time, m new edges are formed. so we get ∂ki ∂t ≈ mli ∏ j→i n = mliejδjki nσl∈λλjelδlkl (1) here, li is the number of nodes which can connect to node i. according to the mean field theory [20]: ∑ l∈λj elδlkl = ljēδ̄〈k〉 (2) similarly, lj is the number of nodes in the new comer ′s (node j ) local-world. ē is the mean value of the local-world energy, and 〈k〉 is the average degree of local-world. in a large scale network, the average degree can be calculated as < k >= 1 t + 1 = εjm (3) a forward-connection topology evolution model in wireless sensor networks 587 where l is the number of edges connected to the node that has joined. it′s not a fixed number while it satisfied the equation:0 ≤ l ≤ (t + 1)m , εjm is the mean degree of the network after node j joined. the ǫj increases when time t increases constantly. so it′s simply defined as ǫj = t n . combine three equations above, we get ∂ki ∂t = liδiei liēδ̄ ki t (4) we set liδiei liēδ̄ = g (5) then we have ∂ki ∂t = g ki t (6) solving this differential equation, we get ki(t) = ct g . since ki(ti) = mi, thus we have ki(t) = mi t g i tg (7) the probability that a node has connectivity ki(t) smaller than k is p(ki(t) < k) = p( mi t g i tg < k) = 1 −p(ti < ( mi k ) 1 g t) (8) assume that we add the nodes to the network at equal time intervals, the probability density at the time is f(ti) = 1 t+1 , therefore, we get p(ki(t) < kin) = 1 −p(ti < ( mi kin ) 1 g t) = 1− t t + 1 ( mi kin ) 1 g (9) the probability density function of the degree of a node with remaining energy e is p(kin) = ∂p(ki(t) < kin) ∂kin = tm 1 g i t + 1 k −(1+ 1 g ) in (10) here,g = lje j δj ljēδ̄ therefore, the distribution has a power-law form with degree exponent γ = −(1 + 1 g ) in the network. thus, we can organize wsns with scale-free feature in this energy-aware algorithm. this algorithm can not only make the network evolution in energy-efficient, but also improve the network reliance against random errors, which is an inherent advantage of most scale-free networks. especially, when the distributions of energy and nodes are uniform distribution, it is got that g ≈ (1 −d(i,j)/dj,max) , which is obvious that γ < −2 , and if dmax is big enough, the degree exponent γ →−2. 588 c. zhang, c. li, n. ning 5 simulations in the evolution of the network, we assume that the cluster heads have been selected, and the network has 1000 cluster heads. these cluster heads are randomly deployed over a 600 meters ×600 meters. the network parameters are listed in table 2. in the simulation of lifetime and network-efficiency, we compare three kinds of aggregation tree: the tag aggregation tree [21] (tag aggregation tree), the tag aggregation tree with forwardconnection evolution model and the fctag tree with reconstruction. 5.1 the degree distribution table 2: main settings in simulation parameters setting area 600×600 n 1000 m 2 6 15 30 e random in [0.8 1] dmax 60 600 150 200 in the analysis, there are two parameters influence the degree distribution: dmax and m. the following simulation shows the influence of dmax and m on the evolving network. the influence of dmax in this section, the influence of dmax is discussed. four different value of dmax are listed in table 2 (m=3). as shown in figure 5, when dmax is small,f < 1, so the exponent γ < −2. when dmax gets larger, f closes to 1, then the exponent γ closes to -2. besides, figure 1 also shows that when the dmax increases constantly, the maximum in-degree get larger. this is because the larger the communication radius is, the more likely be connected by other nodes. the influence of m in this section, the influence of m is discussed with different m (dmax = 120). as shown in figure.6, when m is small, the process of preferential attachment works well, and a node is connected randomly, the network exhibits a power-law degree distribution. when m gets larger, the process of preferential attachment is limited. it leads to the reduction of the randomness of the connection. so the degree distribution can′t obey the power-law form well. especially, when m is larger than the neighbors, the node will connect all nodes surrounding, which is independent of the preferential attachment. 5.2 lifetime in the simulation of lifetime, we assume that all sensor nodes have an initial energy which is 0.5j. the data packet size is 1000 bits. there is not an exact definition in lifetime for a wsn. some papers use the round when the first drained node appears. some papers use the round when a certain ratio of drained nodes appears. some papers use the round when the network canążt cover the monitor area. in this paper, we just run the network for a certain round (here a forward-connection topology evolution model in wireless sensor networks 589 figure 5: the influence of dmax figure 6: the influence of m 590 c. zhang, c. li, n. ning uses the 1500 round), and we observe the changing rule in different aggregation tree. we deem an aggregation tree has a longer lifetime if the drained nodes increase slower. nodes consume energy both in sending and receiving data according to [22]. in this paper, we use the model that the pass loss exponent is 2. the model is as follows: a k-bit data packet is transmitted and the energy consumption of sending node is given by et = ε1×k+ε2×d2×k, d is the distance between the two sensor nodes, and ε1 = 50njbit ,ε2 = 100pj bit·m2 .a k-bit data packet is transmitted, and the energy consumption of receiving node is given by eγ = ε1 ×k without local-reconstruction as shown in figure 7, the dotted line is the lifetime of tag aggregation tree. the solid line is fctag aggregation tree. obviously, in the dotted line, the first drained node appears at the 592th round. in the solid line, it appears at 737th round. this shows that the fctag aggregation tree consumes the energy more balance than the tag aggregation tree. another key point appears at about the 1280th round. two lines intersect. and after this point, the number of drained nodes in dotted line is larger than that in solid line. it means that, after some round, the energy consumption in our model is more unbalance than the pure tag aggregation tree. the point of intersection appears normal and reasonable. there are two reasons for the point. firstly, the fctag aggregation tree is constructed according to the energy and the position of nodes at that time. with the network running, the energy decreases which make the fctag aggregation tree is no longer suit for the network now. secondly, the idea of fctag aggregation tree is that, when the nodes transmit the data, the nodes that have large energy use more energy, and the nodes that have low energy use less energy. so the nodes that have low energy can run more round, but it makes the nodes that have large energy run less round. in fact, the fctag aggregation tree has large total energy consumption in one round than tag. in the first reason, a local-reconstruction mechanism can solve it well. but the second reason is because of the idea of our model. finally, the point of intersection is difficult to remove, but it can be delayed. local-reconstruction in this part, we simulated the lifetime with tag aggregation tree and fcrtag aggregation tree. in the figure 8, the dotted line is the lifetime of tag aggregation tree, the solid line is fcrtag aggregation tree. it shows that when a local-reconstruction mechanism is considered in the fctag aggregation tree. the lifetime of the network is better. the drained node appears in 737th round too. and obviously, there is no intersected point before the 1500th round. but in fact, these two lines are getting more and more near when the round increasing. so it can be inferred that there is an intersected point after 1500th round. a forward-connection topology evolution model in wireless sensor networks 591 figure 7: the lifetime without local-reconstruction figure 8: the lifetime with local-reconstruction 592 c. zhang, c. li, n. ning conclusions and future work in this paper, we present a new secure privacy-preserving data aggregation model, which adopts a mixed data aggregation structure of tree and cluster. the proposed model verifies the data integrity both at the cluster nodes and the base station. meanwhile, the model gives a mechanism to locate the compromised nodes. finally, the detail analysis shows that this model is robust to many attacks, and has lower communication overhead. acknowledgment this work is supported by beijing natural science foundation under grant (4132057), national natural science foundation of china under grant 61201159, beijing municipal education commission on projects (sqkm201510016013), and foundation of mohurd (2015-k8-029). bibliography [1] j. yick, b. mukherjee, d. ghosal (2008); wireless sensor network survey, computer networks, 52(12): 2292-2330. [2] a. mainwaming, j. polastre, r. szewczyk, d. culler, j. anderson (2002); wireless sensor networks for habitat monitoring, proc. of acm international workshop on wireless serlsor networks and applications, 88-97. [3] g. j. pottie, w. j. kaiser (2000); wireless integrated network sensors, communication of the acm, 43(5): 51-58. [4] c. rotariu, h. costin, i. alexa, g. andruseac, v. manta, b. mustata (2010); e-health system for medical telesurveillance of chronic patients, international journal of computers communications & control, 5(5): 900-909. [5] m. e. j. newman, d. j. watts (1999) renormalization group analysis of the small-world network model, physics letters a, 263(4): 341-346. [6] a. l. barabasi, r. albert (1999); emergence of scaling in random networks, science, 286(5439): 509-512. [7] s. lindsey. c. s. raghavendra (2002); pegasis: power-efficient gathering in sensor information systems, proc of the ieee aerospace conf, 18(4):305-314. [8] h. tan (2003); power efficient data gathering and aggregation in wireless sensor networks, acm sigmod record, 32(4): 66 71. [9] x. y. li, p. wan, y wang, c. w. yi (2003); fault tolerant deployment and topology control in wireless networks, proceedings of the fourth acm symposium on mobile ad hoc networking and computing, 117-128. [10] t. bernd, m. heinrich (2005); topology control for fault tolerant communication in highly dynamic wireless networks, proceedings of the 3rd international workshop on intelligent solutions in embedded systems, 89-100. a forward-connection topology evolution model in wireless sensor networks 593 [11] a. kashyap, s. khuller, m shayman (2006); relay placement for higher order connectivity in wireless sensor networks, infocom ieee international conference on computer communications, 1-12. [12] w. r. heinzelman, a. chandrakasan, h. balakrishnan (2000); energy-efficient communication protocol for wireless microsensor networks, system sciences proceedings of annual hawaii international conference on, doi: 10.1109/hicss.2000.926982. [13] s. bandyopadhyay, e. j. coyle (2003); an energy efficient hierarchical clustering algorithm for wireless sensor networks, in proc. of ieee infocom, 1713 1723. [14] o. younis, s. member, s. fahmy (2004); heed: a hybrid, energy-efficient, distributed clustering approach for ad hoc sensor networks, ieee trans. mobile computing, 366–379. [15] l. j. chen, m. liu, d. x. chen, l. xie (2009); topology evolution of wireless sensor networks among cluster heads by random walkers, chinese journal of computers, 32(1): 69-76. [16] h. zhu, h. luo, h. peng, l. li, q. luo (2009); complex networks-based energy-efficient evolution model for wireless sensor networks, chaos solitons and fractals the interdisciplinary journal of nonlinear science and nonequilibrium and complex phenomenal, 41(4): 18281835. [17] x. qi, s. ma, g. zheng (2011); topology evolution of wireless sensor networks basedon adaptive free-scale networks, journal of information and computational science, 8(3): 467475. [18] y. q. wang, x. y. yang (2012); study on a model of topology evolution of wireless sensor networks among cluster heads and its immunization, acta physica sinica, 2012, 61(9): 13211323. [19] x. luo, h. yu, x. wang. energy-aware topology evolution model with linkand node deletion in wireless sensor networks, mathematical problems in engineering, 55(1): 256267. [20] a. barabasi, r. albert, h. jeong (1999); mean-field theory for scale-free random networks, physica a statistical mechanics and its applications, 272: 173-187. [21] s. madden, m. j. franklin, j. m. hellerstein (2002); tag: a tiny aggregation service for ad-hoc sensor networks, proceedings of the usenix symposium on operating sysems design & implementation, 14-22. [22] m. hussaini, h. bello-salau, a. f. salami, f. anwar, a. h. abdalla (2012); enhanced clustering routing protocol for power-efficient gathering in wireless sensor network, international journal of communication networks and information security, 18-28. pin_ijcccv11n5.pdf international journal of computers communications & control issn 1841-9836, 11(5):697-708, october 2016. nvp: a network virtualization proxy for software defined networking b. pinheiro, e. cerqueira, a. abelem billy pinheiro*, eduardo cerqueira, antonio abelem federal university of para brazil, para, belem *corresponding author: billy@ufpa.br cerqueira@ufpa.br, abelem@ufpa.br abstract: the combination of network function virtualization (nfv) and software defined networking (sdn) can improve the control and utilization of network resources. however, this issue still requires proper solutions to virtualize large-scale networks, which would allow the use of sdn and virtualization in real environments. thus, this paper proposes a virtualization architecture for sdn that relies on a proxybased approach. the nvp (network virtualization proxy) is a virtualization proxy that intercepts messages exchanged between controllers and switches sdn enabling network virtualization. an implementation of the proposal was developed as a proof of concept and load testing was performed showing that the solution can provide network virtualization in a scalable manner, using less than 2.5 mb of memory to manage 100 switches performing simultaneous requests, whereas flowvisor requires more then 200 mb. keywords: sdn, virtualization, nvp, flowvisor. 1 introduction the internet was originally designed to provide network services to a closed community, and has today become an undeniable success worldwide, with users of various types of services everywhere on the planet. specific adaptations have historically been proposed and implemented at the emergence of new demands. this approach, despite having attended to momentary needs, has generated increasing complexity and cost of maintaining the internet. moreover, the higher the number of these adjustments, the greater is the complexity of the resulting architecture, making it even more difficult to overcome future challenges, in a situation commonly referenced as internet "ossification", increasingly resistant to structural changes [1]. in this context, software defined networking (sdn) is today one of the most relevant solutions for future internet environment, and the openflow protocol implementation is the bestknown method [2]. along with sdn, the field of network function virtualization (nfv) has grown so as to allow resources to be shared and dynamic topologies to be created [3]. the main concept for working with network virtualization is the division into slices, to optimize the use of their resources (nodes or links) and to separate it into different logical instances. for example, this approach has been used in the fibre project, to create a common space between brazil and the eu for future experimental internet research into network infrastructure and distributed applications with more then 40 openflow nodes [4]. the partitioning of the network enables the actions taken in one of the network slices, to not interfere with others, even if they are sharing the same physical infrastructure. in traditional architectures, the network is sliced through vlan’s technique, but, with the diversity of network models, the structure of vlans makes the experiments with others protocols rather difficult to manage [1]. copyright © 2006-2016 by ccc publications 698 b. pinheiro, e. cerqueira, a. abelem currently, flowvisor [5] is the leading virtualization solution available for openflow networks. it acts as a layer of virtualization located between openflow switches and the network controller allowing resources to be shared, but maintaining the isolation of a virtual network from others. a major problem with this solution is the scalability since the recommended requirements for operating the flowvisor are 4 processor cores and 4 gb java heap. depending on the size and use of the network, these requirements can increase [6]. this article proposes the network virtualization proxy (nvp), a new model of network virtualization that improves the use of computational resources in the virtualization process, through an innovative architecture for greater scalability. to prove the effectiveness of the proposal, an implementation has been developed, and load tests were performed comparing nvp with flowvisor. this article is structured as follows: section 2 presents sdn virtualization related works. in section 3, the nvp is detailed. in section 4, the tests performed and their results are discussed. finally, section 5 presents the conclusions and future works. 2 related work flowvisor is a well-known networks virtualization solution for sdn openflow environment [5]. it is proposed as a special controller that acts as a transparent proxy, located between the openflow switch and the network controller. by using flowvisor, it is possible divide the switches and have their resources controlled by more than one controller at the same time, thus enabling the creation of virtual networks with logical isolation between them. the flowvisor adopts an hierarchical architecture that allows multiple flowvisor networks to be interlinked making virtualization of an already virtualized resource possible. in order to function seamlessly, it acts as a controller for the switches and as an openflow switch for the controllers. thus, it must store the information of the switches and controllers to enable the routing of messages between them. however, its architecture generates an overhead, because the same entity responsible for managing the virtualized network’s information is responsible for forwarding the generated messages, including the packet_in. these are packets coming through the network and sent to the controller for review, which can consume a lot of resources to be forwarded between flowvisors to reach the controllers destination. the prefix-based layer 2 network virtualization (l2pnv) [7] is an extension of flowvisor, with the main objective of providing a level 2 virtualization engine without using vlan, instead basing the virtual networks created in media access control (mac) on the address of origin and destination. however, it was necessary to modify flowvisor, the switch firmware used to support a modified version of openflow and the host since it was necessary to enable mac masking for these elements. despite being discussed in the article that a test environment was created using the proposed solution and the necessary changes in the above-mentioned elements fit the demands of the proposal, no results related to scalability of the solution or comparison with other existing solutions were presented. in fact, no evaluation was submitted. aiming to overcome flowvisor’s major limitation, the fact that virtual topologies that may be created with flowvisor are restricted to a subset of the physical topology, the advisor [8] was proposed. this solution leverages the vlan tag to differentiate between virtual links and the virtual network, making it possible to create virtual links by joining different physical links. however, the solution that the authors validate necessitates the manual configuration of network devices and the sending of dpctl commands. furthermore, the solution limits the use of the vlan header, leaving the less transparent solution for the user. in addition, tests have shown an increase in terms of latency when compared it to flowvisor. again, load tests with a large network, such as the fibre network, showing the scalability of the solution were not performed. nvp: a network virtualization proxy for software defined networking 699 to solve the problem of virtual links, vertigo [9] was proposed. an extension of flowvisor, it basically adds a layer of intelligence which enables the creation of virtual topologies over the physical topology. in this case, it creates virtual links interconnecting physical links. moreover, it may provide a complete virtual structure, since it is possible to virtualize the network nodes together with the links. the proposal was implemented in the testbed ofelia, where it was possible to prove that the solution can, in fact, provide virtual links. it is possible to verify the data taken from the article itself, whereby the solution shows an increase of 44.70 % in overhead when compared with flowvisor, a fact that makes the scalability of the proposal questionable. the flowvisorqos proposed [10] solves the problem of ensuring bandwidth usage per slice in flowvisor. thus, it is possible to define minimum requirements for quality of service (qos) that will be respected in the switches, ensuring that each flow does not exceed the specified limits. for the implementation of the solution, changes in the firmware of the switches were performed. however, despite achieving the solution’s objectives, tests to show the solution behavior in a large-scale environment were not performed. openvirtex [11] builds on the design of flowvisor, and functions as an openflow controller proxy , but, differently of flowvisor, openvirtex provides virtual links and slices isolation. the hardware requirements for both flowvisor and openvirtex are the same and they requires 4 core processors and 2gb ram [12]. the flow-n [13] is, to our knowledge, the first work on network virtualization that addresses scalability problems. it was proposed as an extension of the nox controller. it enables virtualization-based containers and uses relational databases to map between the physical and virtual network topologies. the scalability tests presented in the study show an almost stable latency with 100 virtual networks created, while flowvisor starts with a low value but increases to almost equalize the value of flow-n. unfortunately, only one chart is presented, and the proposal can not be evaluated further while details about the operation of the solution are not provided. as discussed in the papers presented in this section, the virtualization solution for sdn must provide isolation, virtual links, and be prepared to support a large quantity of equipment, because all other services will be running virtualized over the physical equipments table 1 summarizes the main features of the proposed virtualization in this section. it was concluded that none of the protocols can fully meet all the requirements cited above in order to provide a reliable and scalable communication that might require an sdn for the virtualization. table 1: virtualization solutions comparison proposal scalability virtual links transparent isolation flowvisor no no yes partial l2pnv no no yes partial advisor no yes no partial vertigo no yes yes partial flowvisorqos no no partial yes openvirtex no yes yes yes flow-n partial yes yes partial finally, is important to emphasize that, despite the clear importance of scalability for virtualization solutions in the sdn environment, only one proposal attempts to address this issue. however, it does not provide the information necessary for a more satisfactory evaluation, such as the comparative numbers of supported switches, number of controllers, and number of virtual networks. 700 b. pinheiro, e. cerqueira, a. abelem 3 network virtualization proxy (nvp) as presented in section 2, the fundamental requirements for virtualization in sdn are virtual links to enable a greater variety of topologies and scalability, aiming to cover a larger number of devices in the physical substrate. such features expected for this technology could actually be used as a basis for production networks. a key feature of the flowvisor architecture is to enable its use hierarchically. this feature brings high flexibility regarding different possible topological configurations. however, this feature imposes an important limitation for network scalability, since it introduces more elements between the controller and the switch, as the hierarchical level increases. the nvp enables virtualization scalability in an efficient way. thus, the main contributions of our proposal are the following: • allow resources of openflow switches to be shared between different controllers; • increase the network scalability, allowing the virtualization layer to support a high number of switches and controllers without generating a big network overhead; • provide a global network abstraction, in which all available resources are not separated hierarchically, i.e., increasing the number of network devices does not increase the number of elements between switches and controllers; • enable a formal modeling of virtualized resources. in order to better structure the display and use of nvp, an cim extension (common information model) was developed to allow modeling of sdn using uml (unified modeling language) [14]. 3.1 nvp architecture the nvp is composed of two elements: the proxy flow switch (fps) and the flow proxy manager (fpm). its design was inspired in openflow itself, in which the complexity of control is the removal of switches and passed to an external control entity. in this case, the fps acts as a forwarder and fpm concentrates all of the system virtualization’s information and intelligence. its architecture is shown in figure 1. figure 1: nvp architecture flow proxy switch (fps) it is the entity responsible for forwarding the information between switches and controllers. it will exchange messages with openflow switches and controllers and pass them on to the fpm. nvp: a network virtualization proxy for software defined networking 701 after the network start up, fps use is dedicated to consulting its routing table and deciding to which controller/switch the messages should be forwarded. it is formed of the components: fakecontroller, fakeswitch, proxytable, and network virtualization proxy protocol (nvpp). • fakecontroller: this is an entity that acts as an openflow controller, passing the idea on to the openflow switch that is connected to the network controller. it is responsible for ensuring the transparency from the switches side and use nvpp to send the information obtained from the switches to the fpm; • fakeswitch: like fakecontroller, it is responsible for ensuring transparency from the side of the controllers and providing information to those obtained about the resources that each controller will access (with the fpm through nvpp). it is an entity that acts as an openflow controller, communicating the idea to the openflow switch that is connected to the network controller; • network virtualization proxy protocol (nvpp): it is a binary protocol, similar to openflow, developed to enable communication between fpss and fpm. because of the need to maintain fps as simple as possible, we chose to develop a binary protocol that can meet the need for communication between entities, generating as little computational overhead possible. the data types used in the nvp are similar to those used in openflow protocol, and its use follows the same logic defined in the openflow protocol specifications. [15]; • proxy table: it is a table that stores which resources of a given switch can be used by a particular controller. it is composed of the fields ctr _addr and ctr _port, which uniquely identify the network controller, the field datapath_id which identifies to which switch this flow is related, and the fields in_port and dl_vlan that identify the port and the vlan associated with the virtual network. table 2: proxy table ctr_addr ctr_port datapath_id in_port dl_vlan 192.168.1.2 6633 2 1 2 192.168.1.1 6633 2 1 1 in the example of table 2, an action of the flow type_mod arrival of a controller with ip 192.168.1.2 and running on port 6633 will be forwarded exclusively to the switch with the datapath_id 2 on port 1 and vlan 2. if one of the fields differ, the packet is discarded. flow proxy manager (fpm) it is the element that performs network virtualization’s effective control. it will contain a database with information about the switches, controllers, and the network slices. all the slices changes must pass through it, and it should embed the rules in fps. a fpm will be used in each domainthere may be several fps associated with itallowing the network to be scalable, since the actual load generated in the network through the packet_in, and these will be handled only by the pfss. its architecture follows the pattern model-view-controlller (mvc) and is described next. • storage: it is the component responsible for storing the switches’ information, controllers, and the network slices. the data stored here will be used by the slice policy to feed the fps; 702 b. pinheiro, e. cerqueira, a. abelem • slice policy: responsible for the slices creation, allowing the creation of policies and associating resources to controllers required by the administrator. it should use the information from switches and policies present in the storage and create slices in flowtable; • virtual link: this element is responsible for mapping the physical links and enabling the creation of virtual links by setting up forwarding rules in the switches. thus it is possible to interconnect continuous physical links and provide the abstraction of a virtual link directly connecting two ports of the switches; • text configurator: it is the nvp configuration interface. it is similar to the configuration file used by flowvisor to facilitate the adoption of nvp and uses json (javascript object notation) syntax; • cim-sdn model: this is an xmi (xml metadata interchange) file that models the entire network to be virtualized and the virtual networks themselves. it uses the cimsdn [14], which is a cim extension that allows the representation of sdn elements in a uml diagram, ensuring the consistency the network. it also maintains an updated documentation for easy understanding. the nvp has its architecture divided into two main elements: the fps and the fpm, aiming to provide, at the same time, scalability and a global network view. scalability is the result of multiple fpss that can be used in the network in order to provide the best use of the resources of the switches and controllers. the global view is available at fpm, which contains all the network information and can provide easy integration to other networks through interconnection with other fpms from other networks. 3.2 applying the nvp typically, nvp is located between the switches and the network controller. it is possible to see in figure 2 as the components of nvp relate with other network components. one fps can suit various switches and controllers, and, to enable network scalability, the number of fpss can be increased to suit more switches and controllers. the fpm is unique to the network and is responsible for providing a global network view. in figure 3, the fps and the mpf activity diagrams are presented. in the case of fps, actions are started by the controllers and switches. the information of switches and controllers is collected, and subsequently, proxytable is used to determine the forwarding of packets. in fpm, the administrator starts activities, providing the cim-sdn model as input to generate the creation of slices. it is worth noting that a packet is never forwarded to the mpf; only the fps handles packets and does it using only proxytable as a decision resource. the use of nvp is similar to flowvisor plus the cim-sdn. however, it is possible to directly use the configuration file, a fact that enables an easy integration with other tools that already have flowvisor as the nfv. 4 proposal evaluation an implementation to prove the concept of nvp was developed and its performance was compared with flowvisor. the metrics used in the evaluation were the number of requests and cpu loads and memory usage. in order to ensure the most efficient proposal implementation, the development of the nvp prototype was performed in c++ using the boost::asio 1. in this version, fakecontroller 1http://www.boost.org/library nvp: a network virtualization proxy for software defined networking 703 figure 2: applying the nvp figure 3: nvp activity diagrams 704 b. pinheiro, e. cerqueira, a. abelem and fakeswitch components were implemented in their entirety, as well as a short version of proxytable. the only external library used was a file defining the types of data available in openflow itself, to enable communication between the switches and controllers. the current implementation of fps allows it to act as a proxy virtualization, although it may not include all elements of the proposal, since the fpm has not been fully developed. 4.1 test methodology the methodology for evaluation and validation of nvp is based on the application execution in a real environment, but, with tools that emulate openflow switches and allow the creation of a large number of elements for the experiment, the use of real openflow switches is not necessary [16]. the tool used to emulate openflow switches was cbench, which is part of the framework oflops 2. using the tool, it is possible to define the number of switches to be emulated and indicate to which network controller it must connect. the controller used in the experiments was the pox, having switches emulated by cbench as clients. the experiments were performed on the same machine so that network traffic would not influence the response time of the components. the machine is a cpu intel core 2 quad 2.5ghz with 4gb of ram and uses debian wheezy. figure 4: experiments topology the experiments used the pox controller 3 to handle the requests from the switches created with the cbench following this topology in figure 4. three experiments were performed, the first with the nvp between the switches and the controller, the second with flowvisor taking the place of nvp, and the last experiment without any virtualization tool. ten replicates were performed for each experiment using the number 1, 5, 10, 20, 40 and 100 switches to generate requests simultaneously. 4.2 results analysis the goal of these experiments was to validate the nvp, demonstrating load tests with it to verify its behavior and compare it with the flowvisor. the pox is presented, not for the 2http://archive.openflow.org/wk/index.php/oflops 3http://www.noxrepo.org/pox/about-pox/ nvp: a network virtualization proxy for software defined networking 705 purpose of direct comparison, because it perform different tasks on the network, but to demonstrate the extent to which nvp, flowvisor and openvirtex introduce overhead when acting as virtualization proxies. figure 5: reply per milisecond x number of switches figure 5 shows the number of responses that the controller can produce per millisecond. the direct comparison with flowvisor shows a huge difference between the number of requests answered for each virtualization proxy. for one switch, the nvp is 53.68% better than flowvisor, and, when the number of switches achieves 100, the nvp results are 89.83% better, showing a great increase in performance. when the nvp is compared with openvirtex their results are getting almost the same according to the increase of the number of switches, but for a small number of equipment the openvirtex has a better result. also, it is possible to see, when compared with pox, that the nvp shows fewer responses, as expected, since the message must be sent to the pox anyway. however, as the number of switches increases, it is possible to notice a reduction in the difference in the environment both with and without nvp. for one switch, the performance loss is 44.90 %, but it will decrease to 22.53 % with five switches until arriving at 1.10 % for 100 switches, showing a better performance when the number of switches is elevated. figure 6 shows the cpu load percentage of nvp, flowvisor and openvirtex during the experiments. it is possible to observe that, even with requests for 100 switches at the same time, the cpu usage by nvp was less than 12% while flowvisor gets, for the same number of switches, 107% of cpu, i.e., more than one cpu core is needed to handle the requests. the openvirtex shows a better result than flowvisor, but uses more than 3 times the amount of memory when compared with nvp. 706 b. pinheiro, e. cerqueira, a. abelem figure 6: cpu load time(%) x number of switches figure 7: memory usage (mb) x number of switches nvp: a network virtualization proxy for software defined networking 707 figure 7 presents the memory consumption of nvp and flowvisor during the experiments. the values presented are rss (resident set size), which is the physical memory the process currently uses. again, even with 100 switches, total memory usage did not exceed 2.5 mb, showing that the solution exhibits a good memory usage. nvp would therefore be more attractive to clients, as opposed to flowvisor, which uses more then 200 mb to handle the 100 switch requests. the memory used by openvirtex stays almost constant in 150 mb, overcoming in 60 times the memory used by nvp. 5 conclusions and future work this article proposes a framework for sdn virtualization, named network virtualization proxy (nvp). the purpose of nvp is to provide a mechanism for scalable virtualization with isolation between virtual networks by creating and enabling the creation of virtual links. the innovative aspect of this work is that the proposed solution combines scalability and global network view. the results presented in the experiments to allow for a validation of the proposal for the use of machine resources that will host the virtualization solution. thus, nvp has demonstrated that it can act as a virtualization proxy without using many host resources, specifically, consuming, in our experiments, fewer than 2.5 mb of memory to manage connections originating from 100 different outfits. also, all the comparisons with flowvisor show a better results for the nvp proposal. as future work, we intend to verify the individual limits of fps and fpm, as well as examine a more detailed comparison with other virtualization proposals. bibliography [1] alcober, j. et al. (2013);internet future architectures for network and media independent services and protocols, 15th international conference transparent optical networks, (icton), 1-4. [2] open networking foundation. software-defined networking: the new norm for networks. white paper, open networking foundation, palo alto, ca, usa (april 2012). [3] king, d., ford, c.(2013); a critical survey of network functions virtualization (nfv), https://www.ietf.org/proceedings/86/slides/slides-86-sdnrg-3.pdf, 1-8. [4] sallent, s. et al. (2012); fibre project: brazil and europe unite forces and testbeds for the internet of the future, in: korakis, t., zink, m., ott, m. (eds.) tridentcom. lecture notes of the institute for computer sciences, social informatics and telecommunications engineering, springer, 44: 372-372. [5] sherwood, r. et al. (2009 ; flowvisor: a network virtualization layer. openflow switch consortium, tech. rep. [6] bastin, n. (2011); flowvisor system requirements, https://openflow.stanford.edu/display/ docs/system+requirements. accessed: 2014-06-09. [7] matias, j., et al. (2012); implementing layer 2 network virtualization using openflow: challenges and solutions. software defined networking (ewsdn), 2012 european workshop on, 30-35. 708 b. pinheiro, e. cerqueira, a. abelem [8] salvadori, e., corin, r.d., broglio, a., gerola, m. (2011); generalizing virtual network topologies in openflow-based networks, globaltelecommunicationsconference (globecom 2011), ieee, 1-6. [9] corin, r.d. et al.(2012), vertigo: network virtualization and beyond. in: software defined networking (ewsdn), 2012 european workshop on, 24-29. [10] gomes, v.s. et al. (2013); flowvisorqos: aperfeicoando o flowvisor para aprovisionamento de recursos em redes virtuais definidas por software, iv workshop de pesquisa experimental da internet do futuro-simposio brasileiro de redes de computadores e sistemas distribuidos (sbrc 2013), brasilia, 35-41. [11] al-shabibi, a. et al. (2014); openvirtex: make your virtual sdns programmable. in: proceedings of the third workshop on hot topics in software defined networking, hotsdn’14, acm, new york, ny, usa , 25-30. [12] onlab; openvirtex system requirements. http://ovx.onlab.us/gettingstarted/installation/. accessed: 2015-07-14. [13] drutskoy, d., keller, e. (2013), rexford, j.; scalable network virtualization in softwaredefined networks, internet computing, ieee, 17(2): 20-27 [14] pinheiro, b., chaves, r., cerqueira, e., abelem, a. (2013); cim-sdn: a common information model extension for software-defined networking, globecom workshops (gc wkshps), 2013 ieee, 836-841. [15] onf; openflow switch specification. https://www.opennetworking.org/images/stories/ downloads/sdnresources/onf-specifications/openflow/openflow-spec-v1.0.0.pdf. accessed: 2015-07-14 (2009). [16] jarschel, m. et al.(2011); modeling and performance evaluation of an openflow architecture, proceedings of the 23rd international teletraffic congress. itc’11, 1-7. int j comput commun, issn 1841-9836 9(3):292-304, june, 2014. formal specification and verification of mobile agent systems l. kahloul, m. grira laid kahloul* linfi laboratory, computer science department, university of biskra, biskra, 07000, algeria *corresponding author: kahloul2006@yahoo.fr messaouda grira computer science department, university of biskra biskra, 07000, algeria abstract: mobile agent systems offer efficiency and flexibility as a design paradigm. these two characteristics allow to these systems to be an adequate solution for many problems. these systems are used in many critical domains. this expansion, in use, obliges designers to insure the reliability and correctness of such systems. formal methods can be used to verify the correctness of these systems. this paper presents a formal specification and verification of mobile agent systems using the high order π-calculus. the verification exploits the two tools uppaal and spin. keywords: mobile agent, formal verification, π-calculus, promela, spin, uppaal. 1 introduction formal methods are methods with mathematical background. unlike informal methods, descriptions using formal methods are more precise, understandable, and unambiguous. there are many formal languages and methods, among these languages we recall: tl (temporal logics) [2], processes algebras (ccs [1], π-calculus [4], hoπ-calculus [19]) and state transition systems [3, 4]. to be formal, a language expressing a specification must comprise three components: a syntax defining the rules for forming expressions, a semantics with rules for the interpretation of formed sentences and a proof theory governing rules for inferring useful information from the specification [3]. π-calculus is a mathematical model of processes whose interconnections change as they interact. the basic computational step is the transfer of the communication link between two processes; the recipient can then use this link for further interaction with other parties. this makes the calculus suitable for modelling systems where the accessible resources vary over time [1]. higher-order-π-calculus (hoπ) treats another kind of mobility, where processes (called agents) themselves move. the π-calculus and its extensions are processes-algebras that focus on mobility. in these calculi, processes communicate using channels. a process sends a channel’s name in the monadic π-calculus, tuples of channels names in the polyadic π-calculus, and tuples of processes and channels names in higher-order-π-calculus (hoπ) [4]. in software engineering, formal specification is the expression in a formal language, and at an abstract level, a set of proprieties; that a system is designed to satisfy [3]. it is recognised that good specifications should be adequate, internally consistent, unambiguous, complete, minimal and satisfied by lower-level ones [2]. system verification is a domain appeared after the crisis witness of enterprises specialised in the development of systems, where the verification is a viable solution to these problems. during this period, verification tools were designed to verify systems. some of the most important verification tools are: copyright © 2006-2014 by ccc publications formal specification and verification of mobile agent systems 293 1. uppaal: is the acronym of the university of uppsala (sweden) and aalborg (denmark). it is an educational tool for formal specifications and verification of systems [5]; 2. spin: is the acronym of simple promela interpreter. it is a tool for verifying the logical consistency of concurrent systems, specifically of data communication protocol. the verified systems must be described in the promela [6] modelling language; 3. lotos: is the acronym of (language of temporal ordering specifications). lotos has been applied to describe complex systems, formally. a number of tools have been developed for lotos, covering user needs in the areas of simulation, compilation, test generation, and formal verification [7]. the formal specification and verification of mobile agent systems contribute to the best formalisation of these systems. specification can be used to analyse some known mas (mobile agent system) properties, which are: safety, accessibility, boundedness, and liveness. the objective of our work is to specify and verify an example of a mas. this system manages service locations in particular networks. this system is called service location protocol (slp) [26]. to present this work, we organize this paper as follows: in the next section, we present briefly mobile agent systems, the section three will present hoπ-calculus, promela and uppaal languages. section four will present the modelling and verification using spin and uppaal tools. before concluding this paper, we will show some related works in section five. finally, a conclusion will summarize this paper and will discuss possible prospects. 2 mobile agent systems (mas) 2.1 agent concept agent’s term comes from two distinct domains: the distributed systems domain and the multiagent systems domain. these last belongs in the base to intelligence artificial domain, where the programmers tent to imitate humane intelligence [8]. russel and norvig define the agent term as entity that broach its environment and interact on it [9]. 2.2 mobile agents mobile agents have been introduced initially in 1994 with the telescript environment [16] that permitted to processes to choose themselves to move on the sites of a network in order to work locally onto resources. a mobile agent [12, 13, 15] is a process that can move from a site to another site in order to achieve a task. in general, the mobility is provided using some primitive like: move(site) that permits the agent to move toward the site designated by the parameter. a mobile agent is composed of his corresponding code, as well as of a context including some data. this context can evolve under execution, for example while collecting some data when an agent achieves a research of information on a set of servers. the code and the agent’s context are displaced with the agent when this one visits different servers. in general, a mobile agent system provides the primitive of communication allowing to the agents to interact between them, but also to the agents to interact with the services that they visit. these primitives of communications take the form of sending messages or calling procedures or methods [17]. 2.3 mobility mechanisms mobile agents can move between network hosts, transporting their code, data and state information to continue their execution on different environments. in literature [5], we can find 294 l. kahloul, m. grira many mechanisms. in the case of remote execution, the agent is sent before it starts to be executed. when it arrives on the destination, it is executed until it finishes. in this case the agent is transferred once. when it is executing it can use the same remote execution mechanism to start the execution of other agents. in the remote execution the destination of the agent is determined by the execution starter. the mobile agent can do a weak migration by sending its data with its code. usually, the implementations of this scheme allow choosing which part of data will be transferred to the new location of the agent. in this case, the agent programmer might design some mechanism based on the value of agent’s data to resume the execution from some point. strong migration is the highest degree of mobility. using this scheme not only agent code and data is sent, but also the state of execution. when the agent arrives to the destination, it is fully restored and its execution is resumed from the same execution point it was just before migration. strong migration turns to be complex, since it involves low level internal mechanisms for execution restoring that must be standard to provide migration transparency in heterogeneous environments. 2.4 applications and limits of mobile agents mobile agents [1] are software abstractions that can migrate across the network representing users in various tasks. this is a contentious topic [14, 18] that attracts some researchers. mobile agents provide a very appealing, intuitive, and apparently simple abstraction. the authors in [1] give some applications of mobile agents: 1. distributed research of information; 2. active documents: active e-mail, web page (hypertext); 3. advanced telecommunications services: video-conferencing, mobile users (with the potential disconnections); 4. monitoring and remote configuration of devices: industrial processes, network management; 5. management and cooperation in the work-flow: the work-flow defines activities, sites, relationships, time for their implementation, to achieve an industrial product. mobile agents are responsible for conveying information between co-workers in a work-flow; 6. active networks: flexible and dynamic networks according to application needs. two approaches are proposed: (i) programmable switches: dynamically extend the networks. this approach is based on code on demand paradigm; and (ii) capsule approach: attach codes to the transferred packets. the node that receives the packet performs the associated code to treat the data in the packet; 7. e-commerce: an agent looks in a market for catalogues, and then it returns to the laptop of a customer with the best rates available; 8. applications deployment and maintenance of components in distributed environments. 9. parallel processing: the agents dispatch several computing units to make parallel certain tasks. 3 formal languages for mas we are interested to present two formal languages in this section hoπ-calculus, promela language and two tools which are spin and uppaal. formal specification and verification of mobile agent systems 295 3.1 hoπ-calculus in this higher order paradigm, mobility is achieved by allowing agent to be passed as values in a communication. hoπ-calculus [19] is an extension of the first order π-calculus introduced by d.sangiorgi [4]. this calculus enriches the π-calculus with an explicit higher order communication. in the hoπ-calculus, not only names, but also agents of arbitrarily high order, can be transmitted. the syntax of the hoπ-calculus is an extension of the syntax of the first order π-calculus. let be {a, b, ..., x, y, ...} a set of names and {p, q, ...} a set of processes [19]. processes of hoπ-calculus are defined by the following grammar: p ::= x̃(k).p |x(u).p |p ||q|τ.p|υxp|p + q|[x == y]p |!p |0 where: 1. the output prefix x̃(k).p can send the name k via the name x and continue as p ; 2. the input prefix x(u).p can receive any name via x and continue as p with the received name substituted for all free occurrences of u in p ; 3. the parallel composition p ||q represents the combined behaviour of p and q executed in parallel, where p and q can proceed independently and interact via shared names; 4. the silent prefix τ.p represents an agent that can evolve to p without interaction with the environment; 5. the restriction υxp behaves as p but the scope of the name x is restricted to p . p can not interact with other process through x; 6. the sum p + q represents an agent that can enact either p or q; 7. the match [x == y]p can evolve as p if x and y are the same name, and will do nothing otherwise; 8. the replication !p can be seen as an infinite composition !p = p |p | or, equivalently, an agent satisfying the equation !p = p |!p =!p |p ; 9. the empty agent 0 cannot perform any action. 3.2 promela promela [6] is the acronym of process (or protocol) meta language; it is the modelling language for the spin (simple promela interpreter) [27]. promela programs consist of processes, message channels, and variables. processes are global objects that represent the concurrent entities of the distributed system. message channels and variables can be declared either globally or locally in a process. it supports rendezvous and asynchronous communication between processes via channels. processes specify behaviour, while channels and global variables define the environment in which the processes run. 3.3 uppaal uppaal [6] is a tool to model, to validate and to verify real time systems; it is appropriated for the systems that can be modelled by timed automata or linear hybrid automata. the model-checker uppaal is based on the theory of timed automata [29,30] and its modelling language offers additional features such as bounded integer variables and urgency (priority between the same actions defined in two automata). the query language of uppaal, used to specify properties to be checked, is a subset of ctl (computation tree logic) [31]. 296 l. kahloul, m. grira 4 specification and verification of the slp protocol 4.1 the slp protocol the service location protocol is an ietf standard track protocol [26]. the ietf is the internet engineering task force committee that is part of the iab (internet activities board) and determines internet standards. slp provides a framework to allow networking applications to discover the existence, location, and configuration of networked services in enterprise networks. slp can eliminate the need for the user to know the technical features of network hosts. using the slp, the user needs only to know the description of the service he is interested in. based on this description, slp is then able to return the url of the desired service. slp is a language independent protocol. thus the protocol specification can be implemented in any language. the slp infrastructure (figure 1) consists of four types of agents [26]: 1. user agent (ua), which is a software entity that is looking for the location of one or more services; 2. service agent (sa), which is a software entity that provides the location of one or more services; 3. directory agent (da), which is a software entity that acts as a centralised repository for service location information; 4. memory directory agent (mda), which is a software entity that memorizes the service information. figure 1: the slp infrastructure in next section, we present a formal specification of slp in hoπ-calculus. this specification will be transformed firstly into the promela language, then into the uppaal modelling language. the section six will present the verification phase. we achieve the verification using the two tools: spin and uppaal, then we discuss a comparison between the results obtained from the two tools. 5 slp specification in this section, we will specify the slp protocol by three formal specification languages: the hoπ-calculus, promela, and uppaal modeling language. the formal specification of the slp formal specification and verification of mobile agent systems 297 system using the hoπ-calculus is given as the following: system = (reqst, reply, save, nosave, connect, ack, askinfo, sendinfo)| ua(reqst, reply)|sa(connect, save, nosave, ack)| da(reqst, ack, connect, save, nosave, askinfo, sendinfo)| mda(askinfo, sendinfo) in this specification, the system is composed of one user agent (ua), one directory agent (da), one memory directory agent (mda), and one services agent (sa). the mobility is described through the parameters of the channel. for each new service registered by the sa within the da, there are mda belonging to this specific service. in our case the service will be service. the parameters for each agent are used for the communication between agents. in the ua specification, we use request and reply as two channels for the communication between the ua and da. the term service specifies the service that will be asked. in this specification, the ua sends a request to ask a service, and receives a reply that contains the service information. the user agent (ua) specification is as follows: ua(reqst, reply) = υ(service) reqst(service, reply).reply(service, info) the sa can register a service within the da using the channel save and also remove a registered service within the da using the channel nosave, and it receives an ack as an ok to save or nosave. the service agent (sa) specification is as follows: sa(connect, save, nosave, ack) = υ(service) connect(adrs).save(service).ack| sa(connect, save, nosave, ack) +connect(adrs).nosave(service).ack| sa(connect, save, nosave, ack) the da can communicate with ua and sa, it generates the service location. this da saves the service information if it receives a save(service), and does not save it otherwise. if the da receives a reqst(service), it asks the memory directory agent (mda) for the service information via askinfo(service), and it waits the reply (sendinfo(service)) from this agent, before sending this service information to the ua via ˜reply(info, service). the specification of the da is as follows: da(reqst, ack, connect, save, nosave, saveservice, nosaveservice, askinfo, sendinfo) = connect(adrs)|save(service).ack.damem(askinfo, sendinfo)| saveservice(service)| reqst(service).askinfo(service)| sendinfo(service, info).reply(service, info)| da(reqst, ack, connect, save, nosave) +connect(adrs)|nosave(service).ack| damem(askinfo, sendinfo).nosaveservice(service)| da(reqst, ack, connect, save, nosave) the mda is the memory directory agent; it saves information about services. it communicates only with the da. the specification of the mda is as follows: mda(askinfo, sendinfo, saveservice, nosaveservice) = saveservice(service)|askinfo(service).sendinfo(service)|mda(askinfo, sendinfo) +nosaveservice(service).mda(askinfo, sendinfo) 298 l. kahloul, m. grira formal methods may be used to specify the system behaviour and to verify that the designed and implemented system satisfies the expected properties. the higher-order π-calculus is able to describe the mobile systems behaviour and, because it possesses formal semantics, it is capable of verification as well. to insure verification of the hoπ-calculus, one is obliged to use some automated tools. we have found two tools that can be used: spin and uppaal. to realise verification using this two tools, we were obliged to apply some transformation on the former specification. the two next sections will present the specification in spin and in uppaal. 5.1 modelling slp using promela to allow the verification of the slp protocol, we propose (firstly) to use the spin tool. the spin tool uses promela as a specification language. requirements and properties are specified as ltl (linear temporal logic) [2]. these ltl formulae can be entered as assertions in the promela specification, and so be used in simulation, or they can be verified using the modelchecker of spin. the following paragraphs present the promela specification of the slp protocol. the messages declarations are formalised as follows: #define msgtype 10 mtype = service, name, ok, info the channels between the entities (agents) are formalised as follows: chan reqst = [1] of mtype, byte chan reply = [1] of mtype chan save = [1] of mtype chan nosave = [1] of mtype chan ack = [1] of mtype chan connect = [1] of byte chan askinfo = [1] of mtype chan sendinfo = [1] of mtype chan saveservice = [1] of mtype chan wait = [1] of byte boolfree = true, souscribe = false; byteadrs; we declare four processes. the first one is a process named user agent, specified as follows: active proctype ua(){ xsreqst; xrreply; again1 : reqst!service, reply; if :: reply?service → free = false; wait!msgtype(10) fi; goto again1} the second one is a process named service agent, specified as follows: formal specification and verification of mobile agent systems 299 active proctype sa(){ xs save, nosave; xr ack; again2; connect!adrs; if :: wait?(10) → free = false fi if :: free == true → save!service :: else → nosave!service fi; ack?ok; goto again2} the third one is a process named directory agent, specified as follows: active proctype da(){ xs ack, ask_info, reply; xr save, nosave, send_info, reqst; if :: connect?adrs → subscribe = true fi assert((free == true) & &(subscribe == true)); if :: save?service → ack!ok; save_service!service :: nosave?service → ack!ok :: reqst?service → ask_info!service fi; if send_info?service → reply!service, info fi} the four one is a process named memory directory agents and it is specified as follows: active proctype mda(){ xs send_info; xr ask_info, save_service; if :: save_service?service; :: ask_info?service → send_info!service fi} 5.2 modelling slp by uppaal figure 2 presents the graphic modelling of the whole system specification as a timed automaton. the agents sa, da, and mda are presented respectively in the figures: 3, 4, and 5. 6 slp verification this section is dedicated to present the verification phase in this work. we consider three important properties (i.e. requirements), that we will verify: • safety: every agent sa which registers a service within da is recognised by the da and the service becomes available; 300 l. kahloul, m. grira figure 2: diagram of the system figure 3: service agent diagram • reachability: if ua asks for the service “service” and if sa subscribes this service at the da, then ua obtains the service; • bounded liveness: during the registration period within the da, a service must be accessible. 6.1 slp verification by the spin tool we use the spin tool to verify the safety, reachability and liveness properties. this tool allows verifying properties during the simulation using the assert function (assertion): assert((free == true) & &(subscribe == true)), or using the following never claims: never{ to_init : if :: (!(ask) & &(frr) & &(subscribe)) → goto accept_all :: (!ask) → goto to_s3 fi; to_s3 if ((free) & &(subscribe) → goto accept_all (1) → goto to_s3 fi; accept_all skip} formal specification and verification of mobile agent systems 301 figure 4: directory agent diagram figure 5: memory directory agent diagram 6.2 slp verification by the uppaal tool we use the spin tool to verify the safety and liveness properties using the ctl (computational tree logic) [31] as following: e<>directory.receive_reqst_service: this expression allows us to verify the reachability of some sate in the directory agent behaviour. a[]notdeadlock: this expression allows us to verify that the system has no deadlock. 7 related works mobile agent paradigm attracts researchers in the domain of software engineering and artificial intelligence. many interesting researches on formal specification and verification of mobile code system can be found. in the present work, we have presented firstly a specification of the service location protocol system in the hoπ-calculus. this specification was then verified using the two tools: spin and uppaal. this verification validates the safety propriety. in the literature, we found some similar works that tended to specify and verify formally mobile code systems using other specification languages and other verification tools, as examples: • in [1], the slp is specified in hopi-calculus and verified by uppaal tool. in this work, the hopi-calculus specification is then translated to an automaton model, where the mobile aspect of data is not respected. in our work, we have used a specification with only four agents. another difference is that we have used the spin tool and promela. • in [3], a formal specification and verification of secure routing protocols for mobile ad hoc networks (manets) is presented. in this work, authors used the promela as specification languages and spin as verification tool. 302 l. kahloul, m. grira • formal specification of standards for distance victor routing protocol is presented in [10]. this project presents how to use an interactive, hol (high order logic), together with a model checker, spin, to prove key proprieties of distance victor routing protocols. the formal verification techniques in this project are suited to routing protocol generally. 8 conclusion a mobile agent is an entity that can change its location during its execution. it can migrate from one host to another, in a network. the use of mobile agents knows an expansion in several domains. this inclusion of mobile agents in critical domains obliges the designer and developer to make attention during the elaboration of these agents. the reliability and the correctness of these programs (mobile agents) are important. formal methods can help the designer to specify systems then to prove the correctness of these systems. some of these methods are: π-calculus, hoπ-calculus. in this paper, we have presented the use of the hoπ-calculus and promela language as specification languages for mobile agent systems. we have presented a specification of the slp protocol (services location protocol), and we have done verification using the two tools: uppaal and spin. the use of formal methods meets some difficulties in software development. these difficulties are argued by the lack in experiments of these methods and the requirement of some mathematical background by the developer. in this work and some other previous works [32–34], we have tried to show the power of formal methods to insure the reliability of a mobile agent systems. in our future work, we will be interested to apply formal methods on more examples of mobile agent systems. these experiments will establish an approach on the use of formal methods, in software engineering process. bibliography [1] milner, r. (1989). communication and concurrency, prentice hall, international series in computer science, isbn 0-13-115007-3. [2] rescher,n., garson, j. (1968). topological logic, journal of symbolic logic, 33(4):537-548. [3] wagner, f. (2006). modeling software with finite state machines: a practical approach, auerbach publications, isbn 0-8493-8086-3. [4] murata, t. (1989). petri nets: properties, analysis and applications, proceedings of the ieee, 77(4): 541-580. [5] fuggetta, a., picco, g.p., and vigna, g. (1998). understanding code mobility, ieee transactions on software engineering, 24(5): 342-361. [6] bengtsson, j., larsen, k.g., larsson,f.,pettersson, p., yi w. (1995). uppaal a tool suite for automatic verification of real-time systems, in proceedings of the 4th dimacs workshop on verification and control of hybrid systems, new brunswick, new jersey, 22-24 october, 1995. [7] barbu, a. (2005). developing mobiles agents through a formal approach, thesis, paris xii, 12 september 2005. formal specification and verification of mobile agent systems 303 [8] lamsweerd, a. v. (2000). formal specification: a roadmap, icse ’00 proceedings of the conference on the future of software engineering, 147-159. [9] gurdag, a.b., caglayan, m.u. (2008). a formal security analysis of saodv using model checking, international symposium on computer networks (iscn), june 2008. [10] sangiorgi, d., walker, d.(2003). the pi-calculus: a theory of mobile process, cambridge university press. [11] faez charfi. (2003). une approche dinterfaage de cod uppaal pour la spcification et la vrification des sysmes temps el, thesis, september 2003. [12] ruys, t. (2006). spin and promela, january 18, 2006. [13] sighireanu, m., lotos nt user manual, february 21, 2008. [14] gray,r.s., cybenko,g., kotz, d., peterson, r.a., rus, d. (2002) d’agents: application and performance of a mobile-agent system, 2002. [15] akhil sahai, a., morin, c. (1998). mobile agents for enabling mobile user aware application, proceedings of the 2nd international conference on autonomous agents, 205-211. [16] bhargavan,k., obradovic, d., gunter, c.a. (2002). formal verification of standards for distance vector, journal of the acm, vol. 49(4):538-576. [17] picco, g.p. (1998). understanding, evaluating, formalizing, and exploiting code mobility, ph.d. thesis, politecnico di torino, italy, february 1998. [18] satoh, i. (2002). physical mobility and logical mobility in ubiquitous computing environments, proceeding ma ’02 proceedings of the 6th international conference on mobile agents, 186-202. [19] johansen, d. (2004). mobile agents: right concept, wrong approach, proc. ieee, mobile data management-mdm, 300-301. [20] robles s. (2002). mobile agent systems and trust, a combined view toward secure seaof-data applications, phd thesis, barcelona, july 2002. [21] kotz, d., gray, r. s. (1999). mobile agents and the future of internet, acm operating systems review, 7-13. [22] roth, v. (2004). obstacles to adoption of mobile agents, ieee international conference on mobile data management, 296-297. [23] milojicic, d. (1999). trend wars: mobile agent applications. ieee concurrency, 80-90. [24] sangiorgi, d. (1993). from π-calculus to higher-order-π-calculusand back. tapsoft ’93 proceedings of the international joint conference caap/fase on theory and practice of software development. [25] milner, r., parrow, j., walker, d. (1992). a calculus of mobile processes, part i/ii. 1992. [26] perkins, c., guttman, e., service location protocol (slp), online: http://www.ietf.org/rfc/rfc2608.txt. [27] kaliappan, p.s., simple promela interpretor (spin)model checker. 304 l. kahloul, m. grira [28] guillaume, s. (2008). a promela front-end for spot, 20 p., may 2008. [29] alur r., dill, d.l. (1990). automata for modeling real-time systems, colloquium on algorithms, languages, and programming, 443:322-335. [30] hopcroft, j.e., ullman, j.d. (2001). introduction of automata theory, languages, addison wesley. [31] huth, m., ryan, m. (2004). logic in computer science, (second edition), cambridge university press. p. 207. isbn 0-521-54310-x. 2004. [32] kahloul, l., chaoui, a. (2008). coloured reconfigurable nets for code mobility modeling, int j comput commun, issn 1841-9836, suppl. issue, 3(s):358-363. [33] kahloul, l., chaoui, a., djouani, k. (2009). code mobility modelling: a formal study, international review on computer and software, http://www.praiseworthyprize.com/irecos.htm. [34] kahloul, l., chaoui, a., djouani, k. (2010). modeling reconfirgurable systems using flexible petri nets, 4th ieee international symposium on theoretical aspects of software engineering, august 24 27, 2010, taipei, taiwan. international journal of computers communications & control issn 1841-9836, 11(6):776-788, december 2016. lossless compression of data tables in mobile devices using co-clustering b. han, b. li bo han*, bolang li international school of software, wuhan university 37 luoyu road, wuhan, hubei, china, 430079 *corresponding author: bhan@whu.edu.cn abstract: data tables have been widely used for storage of a collection of related records in a structured format in many mobile applications. the lossless compression of data tables not only brings benefits for storage, but also reduces network transmission latencies and energy costs in batteries. in this paper, we propose a novel lossless compression approach by combining co-clustering and information coding theory. it reorders table columns and rows simultaneously for shaping homogeneous blocks and further optimizes alignment within a block to expose redundancy, such that standard lossless encoders can significantly improve compression ratios. we tested the approach on a synthetic dataset and ten uci real-life datasets by using a standard compressor 7z. the extensive experimental results suggest that compared with the direct table compression without co-clustering and within-block alignment, our approach can boost compression rates at least 21% and up to 68%. the results also show that the compression time cost of the co-clustering approach is linearly proportional to a data table size. in addition, since the inverse transform of co-clustering is just exchange of rows and columns according to recorded indexes, the decompression procedure runs very fast and the decompression time cost is similar to the counterpart without using co-clustering. thereby, our approach is suitable for lossless compression of data tables in mobile devices with constrained resources. keywords: data tables, lossless compression, co-clustering, redundancy. 1 introduction internet of things and mobile internet churn out huge volume of real-time data in mobile computing devices, such as smartphones, tablets and e-readers. by using ubiquitous mobile sensors, these digital devices constantly collect environment information or persons’ behavior information. the information is generally recorded in a structured data table. for example, in a table of a healthcare application, rows represent the records in a sequential sampling time series and columns show the values of multiple measured parameters, such as heart rates. by cumulating records with the time, multi-columns mobile data tables can grow sharply in size, varying from few hundred kilobytes to few hundred megabytes dependent upon different applications. in the internet context and big data era, these sensor data will generally be sent to cloud servers for further processing in a time frequency. for a mobile device, a larger size of a data table not only takes more storage space, but its transportation to cloud servers takes more network transmission latencies and more energy costs in batteries. therefore, lossless compression of data tables is critical to optimize the usage of system resources and thus play an important role in optimizing system performance. traditional lossless compression methods view a dataset as a large sequence of strings and employ the occurrence frequency of duplicated symbols to compress them in variable length codes in a dictionary, such as huffman, lempel-ziv, run length encoding and other compression encoders [1]. these methods do not account for dependency patterns in a table and therefore they cannot compress a table with the size less than the limit given by the shannon entropy [2–4]. copyright © 2006-2016 by ccc publications lossless compression of data tables in mobile devices using co-clustering 777 recently, some researchers studied data dependency and exploited the resulted redundancy for compression. mielikainen et al. applied spatial dependency to propose an adaptive prediction length in a clustered differential pulse code modulation method for lossless compression of hyperspectral data [5]. venugopal et al. applied hadamard transformation to eliminate the correlation inside local blocks in medical data [6]. patauner et al. combined vector quantization, delta calculation and a huffman coding algorithm together to reduce the correlation among a sequence of data records acquired from pulse digitizing electronics and then apply lossless compression [7]. kolo et al. proposed an adaptive lossless data compression algorithm for wireless sensor networks [8]. the network data sequence is partitioned into consecutive blocks, and the optimal compression scheme is applied for each block by dynamically analyzing data dependency in a block. they further improved their approach and proposed a fast lossless adaptive compression scheme with low memory requirements for wireless sensor networks [9]. the approach can generate its coding tables on the fly and compress data blocks very fast. buchsbaum et al. proposed a novel dynamic programming algorithm to discover column dependencies in a table by a one-time, offline learning procedure from a small number of training examples [10]. by exploiting the dependencies, they contiguously partitioned table columns into disjoint groups and found that compressing each group of columns separately can significantly improve compression rates. in their further research, buchsbaum et al. applied a tsp (traveling salesman problem) tour method to reorder table columns prior to partitioning for further improving compression rates [11]. it provided a unified theory of entropy-like functions to explain both contiguous partitioning and column rearrangement. however, the time cost of the algorithm is expensive and is not linearly proportional to a data table size. yang et al. studied a transform compression approach for a boolean matrix [12]. it firstly located the largest columnwise-constant submatrix, and next rearranged columns such that the columnwise-constant is moved to the left-upper corner of the matrix. following the procedure, the approach recursively applied transformation on the rest of the matrix until the partition resulted in a matrix smaller than a user-defined threshold. however, its running-time cost is also very expensive and the choice of the user-defined threshold is a tricky problem and some improper choices can negatively affect the compressibility. in a word, the above methods exploited data dependencies and correlations among columns in a table (or called a matrix in mathematics). however, rows in a table also can show dependencies and redundancies. a distinct characteristic of tables is that reordering of columns and rows will not lead to information loss. therefore, we aim to apply an approach to group dependent or similar columns and rows together, such that the redundancy in grouped homogeneous blocks can be exposed and then we apply compression on these blocks. co-clustering is an approach simultaneously clustering of similar rows and columns for revealing hidden structures of a data matrix [13,14]. it provides great potential for compression. firstly, co-clustering reorders and groups rows and columns into similar or homogeneous rectangular regions, such that redundant information gets exposed and can be removed by a statistical encoder (such as a huffman compressor). next, the inverse transform of co-clustering is very simple and fast. this property will be desirable for uncompressing data. in addition, the time complexity of the algorithm is o((m+n)kl) [15], where m is the number of rows, n is the number of columns, k is the number of row-clusters and l is the number of column-clusters. since k and l are much smaller than m and n, the algorithm can run in linear time with the size of a data table. in another word, the algorithm is scalable to big size of data tables. co-clustering has been widely studied for information clustering, pattern structure discovery, et al [16–19]. in this paper, we propose a novel lossless compression approach for data tables. unlike traditional compression methods by exploiting data dependencies in a single view of either columns or rows, the proposed approach can shape homogeneous rectangular blocks by reordering columns and rows simultaneously via co-clustering. to the best of our knowledge, the proposed ap778 b. han, b. li proach is the first algorithm that integrates co-clustering and information coding theory for the purpose of lossless compression of data tables. it not only constructs homogeneous blocks by co-clustering, but also optimizes columns/rows alignment in a block to further expose block redundancy, such that the downstream compression with a standard lossless encoder becomes more efficient. we compare empirically the performance of a standard compressor 7z before and after the application of co-clustering and within-block alignment on a synthetic dataset and several public datasets with properties similar to tables in mobile systems, such as information collected from wearable sensors, clinical care, customer reviews, et al. the extensive experimental results suggest that the proposed approach can effectively improve compression ratios for data tables. in addition, the decompression procedure runs very fast and thus it is very suitable for lossless compression of data tables in mobile devices with constrained resources. 2 methodology in this paper, we propose a novel data table compression approach consisting of three steps: 1. reorder table columns and rows by co-clustering; 2. refine table columns and rows alignment to further expose redundancy; 3. compress the resorted data table by a standard compressor. 2.1 reorder table columns and rows by co-clustering given a data table t with m rows and n columns, co-clustering transforms t into another table t ′ with k row-clusters and l column-clusters (here k and l are smaller than m and n respectively), where each element bij(i ∈ [1,k],j ∈ [1, l]) in t ′ corresponds to a homogeneous two-dimensional block after reordering columns and rows. the adjacent elements in the block have the same or similar values and they provide potentials to boost compression rates. the co-clustering transform can be illustrated in figure 1 as below, t −→ t ′  a11 a12 . . . a1n a21 a22 . . . a2n . . . . . . . . . . . . am1 am2 . . . amn   →  b11 . . . b1l. . . . . . . . . bk1 . . . bkl   figure 1: co-clustering of a data table in a statistical way, the original table t can be viewed as a joint probability distribution between two discrete random variables denoting rows and columns respectively. let r and c be such two discrete random variables that take values in the set {r1, . . . ,rm} and {c1, . . . ,cn} respectively. co-clustering aims to simultaneously quantize r into k disjoint clusters, and c into l disjoint clusters. in other words, co-clustering will generate mappings mrow and mcol as below, mrow : r = {r1,r2, . . . ,rm}→ r̂ = {r̂1, r̂2, . . . , r̂k} mcol : c = {c1,c2, . . . ,cn}→ ĉ = {ĉ1, ĉ2, . . . , ĉl} the mappings also can be represented in functional forms:r̂ = mrow(r) and ĉ = mcol(c). for obtaining homogeneous blocks in those clusters and further be applied compression, an optimal co-clustering minimizes the loss in mutual information, that is, minimize i(r; c) − i(r̂; ĉ) by subject to the given k and l; (1) here, the mutual information i(r; c) measures the amount of information random variable r contains about c. lossless compression of data tables in mobile devices using co-clustering 779 to facilitate the search for the optimal co-clustering, the above objective function can be expressed as the following "distance" of p(r,c) to an approximation q(r,c), i(r; c) − i(r̂; ĉ) = kl(p(r,c)||q(r,c)) (2) here, kl(·||·) denotes the kullback-leibler divergence, q(r,c) is a distribution of the form q(r,c) = p(r̂, ĉ)p(r|r̂)p(c|ĉ),where r ∈ r̂,c ∈ ĉ (3) by following kullback-leibler divergence, the following equations (4) and (5) can be derived, kl(p(r,c,r̂,ĉ)||q(r,c,r̂,ĉ)) = ∑ r̂ ∑ r:mrow(r)=r̂ p(r)kl(p(c|r)||q(c|r̂)) (4) kl(p(r,c,r̂,ĉ)||q(r,c,r̂,ĉ)) = ∑ ĉ ∑ c:mcol(c)=ĉ p(c)kl(p(r|c)||q(r|ĉ)) (5) here, the distribution q(c|r̂) can be defined as "row-clustering prototype". similarly, the distribution q(r|ĉ) can be defined as "column-clustering prototype". in this way, the above equations show that the objective function in (1) can solely be expressed in terms of row-clustering or column-clustering. with this intuition, the co-clustering algorithm is listed in figure 2. in the first step, co-clustering algorithm starts with an initial mapping function m(0)row and m (0) col , and then computes the approximation distributions q (0), including the initial row-cluster prototype q(0)(c|r̂). for every column c, the row-cluster is computed as, q(t)(c|r̂) = q(t)(c|ĉ)q(t)(ĉ|r̂) where, ĉ = mcol(c). (6) from line 6 to 16, the algorithm keeps iterative computing of row-clusters and column-clusters until a desired convergence condition is satisfied. specifically, from line 7 to 9, we re-assign each row r into a row-cluster by using the mapping function m(t+1)row (r). in line 10, the algorithm recomputes the required marginal of q(t+1) based on the updated row-clustering results. it also recomputes the column-cluster prototype by (7). for every row r, we have, q(t+1)(r|ĉ) = q(t+1)(r|r̂)q(t+1)(r̂|ĉ) where, r̂ = mcol(r). (7) from line 11 to 13, we re-assign each column c into a column-cluster by using the mapping function m(t+2)col (c), while keeping the row-cluster fixed. in line 14, the algorithm recomputes marginals of q(t+2) and the row-cluster prototype q(t+2)(c|r̂) by (6). in line 15, we let the iterative variable t = t + 2. the processes from line 6 to 16 are repeated and the row/column clusters are updated until the change of objective function is very small. actually, the co-clustering algorithm monotonically decreases the objective function given in (2) [15,16] . it grantees an optimal co-clustering results will be achieved by the iterative procedure in the algorithm. by co-clustering, similar columns/rows are grouped into one cluster. it facilitates coding and provides compression potentials by using a standard compressor. 780 b. han, b. li algorithm co-clustering of a data table input: table t with a joint probability distribution p(r, c) the number of row-clusters k, the number of column-clusters l output: the cluster mapping functions mrow and mcol 1: //initialization: 2: let iterative variable t = 0 3: start with some initial mapping functions m (0) row and m (0) col 4: compute q(0)(r|r̂) , q(0)(c|ĉ) , q(0)(r̂, ĉ) and q(0)(c|r̂)(1 ≤ r̂ ≤ k) using (6) 5: //compute new row-cluster index and column-cluster index by iterative rounds: 6: repeat do 7: for each row r do 8: m (t+1) row (r) = argminr̂kl(p(c|r)||q(t)(c|r̂)), m(t+1)col = m (t) col 9: end for 10: compute q(t+1)(r|r̂), q(t+1)(c|ĉ), q(t+1)(r̂, ĉ) and q(t+1)(r|ĉ) (1 ≤ ĉ ≤ l) using (7) 11: for each column c do 12: m (t+2) col (c) = argminĉkl(p(r|c)||q(t+1)(r|ĉ)), m (t+2) row = m (t+1) row 13: end for 14: compute q(t+2)(r|r̂), q(t+2)(c|ĉ), q(t+2)(r̂, ĉ) and q(t+2)(c|r̂) (1 ≤ r̂ ≤ k) using (6) 15: t = t + 2 16: until convergence of the change in objective function 17: (that is, |kl(p(r, c)||q(t)(r, c)) − kl(p(r, c)||q(t+2)(r, c))| < µ, µ is a small threshold) figure 2: co-clustering algorithm 2.2 refine table columns and rows by alignment to further expose redundancy next, a series of refinements are applied on a co-clustered table t’ to further expose information redundancy. for each column, we compute the minimum value col_min and their standard derivation value col_std. if col_min/col_std is larger than a threshold α, it shows all elements in the column have similar values. thereby, by deducting col_min from all column elements, we can only code their differences more efficiently in a smaller range of numbers. in a column-cluster, we sort columns by their column mean values such that neighborhood columns show the maximum similarity. this will help to expose the redundancy among columns in a cluster. in addition, we compute the standard derivation value for each column to obtain the homogenous status in each column. next, in a row-cluster, rows are sorted by the most similar columns according to the minimum standard derivation. the above transformation helps form the homogenous blocks. their same or similar values will facilitate a statistical encoder (such as the run-length coding) in a standard compressor to improve compression effects. furthermore, for the same reason, if the number of rows is larger than the number of columns (i.e.m > n), we transpose the matrix, such that more number of similar neighbor elements are listed in a row and more redundancy can be exposed. 2.3 compress the resorted data table by a standard compressor after the above transformation, we use a statistical encoder for data compression. 7-zip is a popular and fast file archiver with a high compression ratio [20]. we select it since it supports several different data compression file formats and encryption algorithms, such as .7z format with lzma algorithm and .gzip format with deflate algorithm [21]. lzma is a variation of lz77 lossless compression of data tables in mobile devices using co-clustering 781 algorithm compressing a data table input: a table t with m rows and n columns the number of row clusters k, the number of column clusters l a threshold α, a standard compressor sc output: an index r′ recording the original order of rows an index c′ recording the original order of columns a mean vector v ′, a compressed file f ′ 1: //co-clustering: 2: generate a measure matrix w with m rows and n columns, let all entries in w are 1s 3: setup co-clustering parameters (options) according to guidance in [15] 4: let [r, c] = co − clusting(t, w, k, l, options) 5: t ′ = reorder(t, r, c); //get a reordered data table 6: // refine table columns and rows in a co-clustering block: 7: //if a column has little variance, each element is deducted from the column minimum value 8: for each column i do 9: col min = min(ith column of t ′); col std =standard derivation(ith column of t ′) 10: if col min/col std > α then deduct each element in the ith column of t ′ by col min 11: record col min in v ′ 12: end for 13: //for those columns in a column cluster,sort them by their mean 14: for each column-clustering i do 15: temp = t ′(:, c == i) //find columns in column cluster i 16: col mean = mean(temp), temp = sort(t ′, col mean) 17: add sorted column-cluster temp into new matrix sortt 18: record the updated column index in c′ 19: end for 20: //for those rows in a row cluster, sort them by the column with the minimum std value 21: [order index] = sort(std(sortt)) 22: for each row-clustering i do 23: temp = sortt(r == i, :) //find those rows in row cluster i 24: //sort rows by the column with the minimum std value 25: temp = sort(temp(:, order index(1))) 26: //use the reordered rows in row cluster i to form a matrix 27: add sorted row-cluster temp into new matrix sortt’ 28: record the updated row index in r′ 29: end for 30: //compress the resorted table by a standard compressor: 31: f ′ = compress(sc, sortt ′) figure 3: compression algorithm using co-clustering algorithm and it uses entropy coding with a markov chain based range coder and binary trees. deflate is a standard algorithm based on lz77 and huffman coding. in the above two steps, our approach applies co-clustering to expose redundancy in a data table and the redundancy can be removed by any of these encoding algorithms. thereby, we use 7-zip to test our approach. 782 b. han, b. li 2.4 complete compression and decompression algorithm the details of the compression algorithms are shown in figure 3. accordingly, the decompression algorithm consists of two steps: decompression and reorder back to the original table. its details are shown in figure 4, algorithm decompressing a file to its original data table input: a compressed file f ′, a mean vector v ′ an index r′ recording the original order of rows an index c′ recording the original order of columns a standard compressor sc output: the original table t with m rows and n columns 1: // decompression: 2: dt = decompress(sc, f ′) 3: //reorder back to the original table: 4: reorder rows in dt according to r′, reorder columns in dt according to c′ 5: for a column with a col min value in v ′, add each element in the column with col min figure 4: decompression algorithm using co-clustering 3 experimental results to test the effectiveness of our approach, we designed two experiments. one is the compression on a synthetic dataset. we would like to show how co-clustering can help to reorder and group columns and rows in a data table such that redundancy is exposed. since compression ratio results are dependent upon different data tables used in mobile applications, we perform the other experiment which is compressing public benchmark datasets. these benchmark datasets are selected from uci real-life datasets. they cover many typical topics in mobile applications, such as wearable sensors, clinical care, customer reviews, and they have the similar table structures and contents as data in mobile systems. we aim to compare the compression performance before and after applying co-clustering transformation for the benchmark datasets. the coclustering software we used is co_cluster (version 1.1) [15]. the compression software used is 7-zip. all experiments were performed on a machine with an intel core i7 2.30ghz processor and 8gb main memory. for comparing the compression effectiveness, we define a measure called improved compression rate (icr) based on compression rate (cr) as below, cr = original file size compressed file size (8) icr = cr with co− clustering −cr without co− clustering cr without co− clustering (9) another two measures are compression time and decompression time. 3.1 experiment on a synthetic dataset we firstly generate a data table with 10000 rows and 30 columns. it includes 2 row-clusters and 3 column-clusters. thereby, the data table is designed with six blocks. two of them are homogenous (entries values all are 1). the other four blocks contain entries with random values lossless compression of data tables in mobile devices using co-clustering 783 ranging in [0, 1]. by random permutation on rows and columns, the data table mixes six blocks together. the image of the designed data table is shown in figure 5(a) and its original text file size is 946kb. in compression algorithm, the parameter m is set to 10000, n is 30, k is 2, l is 3, and options vector is set to default values.α is configured with 10. figure 5(b) shows the reordered data table images after co-clustering. it illustrates clearly that the two homogenous blocks (or equivalently say, redundancy) are discovered by reordering columns and rows. the homogenous blocks are helpful boost the compression effects by a statistical encoder. in addition, in this case m > n, so we transpose the transformed table such that the number of consecutive entries with the same or similar values in a row increases for improving the performance of an encoder. by comparing figure 5(a) with figure 5(b), we see that the redundancy information has been well exposed by co-clustering. it has been proved by final compression results. in 7-zip, by setting compression format as 7z, compression level as maximum and compression algorithm as lzma, other parameters as default values, the compressed file sizes before and after co-clustering are 268kb and 216kb respectively. (a) 5 10 15 20 25 30 1000 2000 3000 4000 5000 6000 7000 8000 9000 10000 (b) 1000 2000 3000 4000 5000 6000 7000 8000 9000 10000 5 10 15 20 25 30 figure 5: synthetic data matrix image: (a) before co-clustering, (b) after co-clustering and transpose by (8), cr with co-clustering and without co-clustering is 4.38 and 3.53 respectively. consequently, icr for the synthetic dataset is 24%, a significant improvement on the original compression rate. the compression procedure without co-clustering takes 0.55 seconds, and the counterpart compression procedure with co-clustering takes 4.77 seconds. for decompression, they both take 0.21 seconds. 3.2 experiments on ten uci datasets we further test our approach on ten real-life benchmark datasets in multiple fields, such as wearable sensors, clinics, ecommerce, et al. they are from the discretized version of uci machine learning repository [22] and their structures and contents are very similar to data tables in mobile applications. the list of the ten datasets is described in table 1. for each dataset, we list the number of rows and number of columns in a table, its original file size and its brief description. obviously, the ten datasets differ in table rows and columns, from 700 rows to 1000000 rows and from 6 columns to 10000 columns. subsequently, their file sizes vary from hundreds of kb to tens of mb. the diversified tables are good to test the general compression and decompression performance of our approach. 784 b. han, b. li table 1: dataset description dataset rows*columns original size brief description nursery 12960*8 288kb applications for schools waveform 5000*21 304kb attributes of waves chesskrvk 28056*6 463kb chess positions gait freeze 151987*11 6018kb data from wearable acceleration sensors connect4 67557*42 8775kb 8-ply positions in a game diabetes130 101766*48 21482kb 10 years of clinical care data at 130 us hospitals arcene_test 700*10000 18560kb biomedical features poker_hand 1000000*11 22978kb a hand of playing cards opportunity 51116*250 39022kb dataset for human activity recognition amazon 1500*10000 29396kb customer reviews in a commercial website in co-clustering settings, m and n are set to the original dimension information of a data table. for simplicity, k is fixed to 4. if a table contains at least 10 columns, l is set to 4, otherwise, l is set to 2. α is configured with 10. table 2 compares the compression performance by using lzma algorithm in 7z without co-clustering or with co-clustering. in 7z, the compression level is set to maximum. other parameters in 7z are configured with default values. the 3rd and 4th columns in table 2 (column header "size by 7z" and "cr by 7z") show the compressed file sizes and compression rates by using 7z without co-clustering. and the 5th and 6th columns (column header "size by 7zco" and "cr by 7zco") give the compressed file sizes and compression rates by using co-clustering. from the 7th column, we see clearly that co-clustering can significantly improve the compression rates with icr ∈ [21%, 59%]. the 8th and 9th columns compare the compression time difference between 7z and 7z with co-clustering. it suggests co-clustering takes more time when a table size increases. the 10th and 11th columns show decompression times between 7z and 7z with co-clustering are similar with each other. for testing the compression performance by using other algorithms in 7z, table 3 list the compression results by using gzip algorithm. the compression level is also set to maximum and other parameters are configured with default values. compared with the results in table 2, the dataset compression rates by gzip are mostly lower than those by lzma. however, our co-clustering approach still can improve the compression rates by the range from 23% to 68%. #rows × #columns ×106 0 5 10 15 ic r 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 compression with 7z+lzma+co-clustering compression with 7z+gzip+co-clustering figure 6: icrs obtained using co-clustering with lzma and gzip lossless compression of data tables in mobile devices using co-clustering 785 table 2: file compression comparison by using lzma algorithm in 7z dataset original size (kb) size by 7z (kb) cr by 7z size by 7zco (kb) cr by 7zco icr ct by 7z (s) ct by 7zco (s) dt by 7z (s) dt by 7zco (s) nursery 288 35 8 22 13 59% 1.88 5.66 0.20 0.20 waveform 304 30 10 22 14 36% 1.84 3.74 0.23 0.25 chesskrvk 463 78 6 52 9 50% 3.00 7.13 0.32 0.30 gait freeze 6018 1167 5.16 792 7.60 47% 67.64 149.13 0.42 0.41 connect4 8775 385 23 275 32 40% 46.73 98.70 0.44 0.40 diabetes130 21482 2177 9.84 1554 13.79 40% 187.69 427.70 3.33 2.85 arcene_test 18560 5572 3.33 4598 4.04 21% 260.46 539.58 3.81 4.09 poker 22978 4794 4.79 3749 6.13 28% 444.35 855.87 3.63 3.98 opportunity 39022 8403 4.64 5573 7.00 51% 522.43 1026.00 4.01 4.56 amazon 29396 1691 17.38 1220 24.10 39% 170.13 437.01 3.96 4.24 table 3: file compression comparison by using gzip algorithm in 7z dataset original size (kb) size by 7z (kb) cr by 7z size by 7zco (kb) cr by 7zco icr ct by 7z (s) ct by 7zco (s) dt by 7z (s) dt by 7zco (s) nursery 288 37 8 24 12 54% 2.00 4.02 0.22 0.23 waveform 304 34 9 24 13 42% 2.06 3.88 0.24 0.24 chesskrvk 463 84 6 54 9 56% 2.78 6.24 0.28 0.26 gait freeze 6018 1339 4.49 1025 5.87 31% 80.24 178.79 1.78 1.61 connect4 8775 499 18 297 30 68% 50.01 106.13 2.23 1.74 diabetes130 21482 2615 8.21 1680 12.79 56% 237.95 490.94 6.71 6.69 arcene_test 18560 5878 3.16 4792 3.87 23% 307.20 643.88 6.80 6.23 poker 22978 5342 4.30 4056 5.67 32% 433.88 986.79 7.58 8.27 opportunity 39022 10256 3.80 6617 5.90 55% 524.55 1124.00 11.51 12.34 amazon 29396 1908 15.41 1550 18.97 23% 194.45 497.12 8.87 7.42 figure 6 illustrates that with the increased products of number of rows and columns in the ten tables, the icrs are waved around 40%. it shows our algorithms keep stable of improved compression rates without much affected by table properties. figure 7 visualizes the compression times by using two different compression algorithms in 7z with co-clustering or without co-clustering. it shows the two algorithms lzma and gzip have no much difference from compression time view. except for the last dataset "amazon", we see that with the product of rows number and columns number increasing, the corresponding compression times increase linearly, without regarding to using co-clustering algorithm. it confirms that the time cost of co-clustering algorithm is just linearly related to the product of rows number and columns number in a table. thereby, our approach is faster than other approaches where time costs are nonlinearly proportional to a data table size. the last dataset digitalizes customer reviews in amazon website. by digitalization representation of text reviews, the table is a high dimensional sparse matrix and it contains a lot of consecutive 0s. it leads to a sharp decrease on compression time though it contains a huge number of columns. 786 b. han, b. li #rows × #columns ×106 0 5 10 15 c o m p re ss io n t im e ( s) 0 200 400 600 800 1000 1200 1400 1600 compression time by 7z+lzma compression time by 7z+lzma with co-clustering compression time by 7z+gzip compression time by 7z+gzip with co-clustering figure 7: compression time comparisons #rows × #columns ×106 0 5 10 15 d e co m p re ss io n t im e ( s) 0 2 4 6 8 10 12 14 16 decompression time by 7z+lzma decompression time by 7z+lzma with co-clustering decompression time by 7z+gzip decompression time by 7z+gzip with co-clustering figure 8: decompression time comparisons figure 8 compares the decompression time costs with different compression settings. it shows lzma algorithm runs consistently faster than gzip on decompression for the ten datasets. the co-clustering approaches, either using lzma or gzip, take similar time on decompression than those counterparts without using co-clustering. one reason is that the compressed file after co-clustering has a smaller size than the packaged file without co-clustering. in addition, the inverse transform of co-clustering is just reordering rows and columns from recorded indexes in a compression procedure. thereby, it is very simple and fast in table transformation. by integrating the two factors, we see in figure 8 that our co-clustering compression approach takes very similar time costs on decompression as original 7z approach. it is favorable for compressed data tables in a mobile device, which will mostly be sent to a cloud server and be decompressed. conclusions data tables are widely used in many mobile applications. lossless compression of a data table can not only reduce storage sizes and network transmission latencies, but also save energy in batteries. in this paper, we propose a novel co-clustering compression approach consisting of lossless compression of data tables in mobile devices using co-clustering 787 three steps: reordering and grouping columns and rows by co-clustering; refine table rows and columns to further expose redundancy; data compression by using a standard compressor. we tested the approach on a synthetic dataset and ten uci real-life datasets. the experimental results suggest that our approach can significantly improve compression rates at least 21% and up to 68%. the approach is robust both to lzma and gzip encoders. in addition, the time cost of our approach is linearly proportional to the product of number of columns and rows in a data table. furthermore, since the inverse transform of co-clustering is just exchange of rows and columns according to records, the decompress procedure is fast and takes similar time on decompression as the original compressor. therefore, our approach is suitable for compressing a data table in a mobile device where the table requires a limited storage size and reduced communication latency with constrained energy cost. in next step, we would like to test our co-clustering approach on lossy compression of multimedia files in mobile devices, such as images and videos. we are also interested in incorporating other dimension reduction techniques, such as principle component analysis and independent component analysis, to further improve the effectiveness of multimedia compression in the mobile systems with constrained resources. acknowledgment this work was supported by the national natural science foundation of china under grants 61272272 and u1531122, and by the natural science foundation of hubei province under grant 2015cfa058. bibliography [1] rao s., bhat p. (2015); evaluation of lossless compression techniques, ieee international conference on communications and signal processing (iccsp), 1655-1659. [2] kontoyiannis i., verdu s. (2014); optimal lossless data compression: non-asymptotics and asymptotics, ieee transactions on information theory, 60(2): 777-795. [3] d. salomon (2007); data compression: the complete reference, 4th ed. new york: springer, 2007. [4] k. sayood (2000); introduction to data compression, 2nd ed. san francisco, ca: morgan kaufmann, 2000. [5] mielikainen j., huang b.(2012); lossless compression of hyperspectral images using clustered linear prediction with adaptive prediction length, ieee geoscience and remote sensing letters, 9(6): 1118-1121. [6] venugopal d., mohan s., raja s.(2016); an efficient block based lossless compression of medical images, optik-international journal for light and electron optics, 127(2): 754-758. [7] patauner c. et al. (2011); a lossless data compression system for a real-time application in hep data acquisition, ieee transactions on nuclear science, 58(4): 1738-1744. [8] kolo j.g. , et al. (2012); an adaptive lossless data compression scheme for wireless sensor networks, journal of sensors, vol. 2012, article id 539638, http://dx.doi.org/10.1155/2012/539638, 1-20. 788 b. han, b. li [9] kolo j.g. et al. (2015); fast and efficient lossless adaptive compression scheme for wireless sensor networks, computers & electrical engineering, 41: 275-287. [10] a.l. buchsbaum et al. (2000); engineering the compression of massive tables: an experimental approach, proceedings of the acm-siam annual symposium on discrete algorithms, san francisco, ca, 213-222. [11] a.l. buchsbaum, g.s. fowler, r. giancarlo (2002); improving table compression with combinatorial optimization, proceedings of the acm-siamannual symposium on discrete algorithms, san francisco, ca, 175-184. [12] q. yang, s. lonardi (2005); a compression-boosting transform for 2d data, data compression conference (dcc’05), 492-492. [13] h. shan, a. banerjee (2008); bayesian co-clustering, proceedings of the 8th ieee international conference on data mining (icdm’08), 530-539. [14] f. pan, x. zhang, w. wang (2008); crd: fast co-clustering on large datasets utilizing sampling-based matrix decomposition, proceedings of the acm sigmod international conference on management of data (sigmod’08), 173-184. [15] a. banerjee et al. (2007); a generalized maximum entropy approach to bregman coclustering and matrix approximation, journal of machine learning research, 1919-1986. [16] i.s. dhillon, s. mallela, d.s. modha (2003); information-theoretic co-clustering, proceedings of the 9th acm sigkdd international conference on knowledge discovery and data mining (kdd’03), 89-98. [17] r.g. pensa, j.f. boulicaut, f. cordero, m. atzori (2010); co-clustering numerical data under user-defined constraints, statistical analysis and data mining,3(1): 38-55. [18] r.g. pensa, j.f. boulicaut (2008); constrained co-clustering of gene expression data, proceedings in applied mathematics 130, 8th siam international conference on data mining 2008, 1:25-36. [19] wu x., zurita-milla r., kraak m.j. (2015); co-clustering geo-referenced time series: exploring spatio-temporal patterns in dutch temperature data, international journal of geographical information science, 29(4): 624-642. [20] http://www.7-zip.org [21] http://en.wikipedia.org/wiki/7z [22] f. coenen, the lucs-kdd discretised/normalized arm and carm data library, in http://www.csc.liv.ac.uk/~frans/kdd/software/lucs-kdd-dn/datasets/datasets.html int j comput commun, issn 1841-9836 vol.7 (2012), no. 4 (november), pp. 606-616 the role of visual rhetoric in semantic multimedia: strategies for decision making in times of crisis a.m.p. brasoveanu, i. dzitac adrian m.p. brasoveanu modul university vienna, department of new media technology, austria, 1 am kahlenberg, 1190 vienna, e-mail: adrian.brasoveanu@modul.ac.at ioan dzitac 1. aurel vlaicu university of arad, romania, 310330 arad, 2 elena dragoi, and 2. agora university romania, 410526 oradea, 8 piata tineretului e-mail: rector@univagora.ro abstract: as semantic multimedia is approaching mainstream, even the great improvements that can be seen in its classic schools, like the data mining inspired information retrieval based on metadata analysis, or computer vision, might not be enough. we identify a new group that gains traction in the semantic multimedia community and which uses as starting point developments from psychology and visual communication. for the purposes of this article we restrict our domain to visual rhetoric as we consider it to yield the biggest potential for future developments. living in times when the periods between crises seem to be shorter and shorter, we look at how developments in semantic multimedia can be used for predicting and overcoming crises. we analyze at least 2 aspects related to this: using information visualization to understand the evolution of crises and creating multi-layered semantic multimedia technologies that can easily be adapted to use a variety of sources and solve problems from different domains. in both cases we show how techniques inspired by visual rhetoric (information linking, framing, composition) in conjunction with named entity recognition offer a lot of benefits. the section related to multi-layered semantic multimedia technologies also draws on the lessons learned while designing a prototype application aimed at improving tourism decision making process. the article ends with a discussion on evaluation methods for multi-layered semantic technologies applications. we look at how to evaluate them on both levels: mechanisms (information linking versus raw named entity recognition when generating visuals, for example), and decision making strategies (do such systems actually solve real problems related to crises, create jobs or at least can they be repurposed to solve other problems than the one with which we have started?). keywords: semantic multimedia, visual rhetoric, text to visual matching, interactive documentary, crisis strategies, multimedia storytelling "the progress of civilization can be read in the invention of visual artifacts, from writing to mathematics, to maps, to printing, to diagrams, to visual computing." stuart card et. al. readings in information visualization 1 introduction in today’s fast web there is a need for elegant mechanisms that can help us understand how to process, store, retrieve and present the huge amount of information contained in multimedia files. semantic multimedia [23] [24], the branch of semantic web focused on the analysis of multimedia documents, traditionally employed methods like metadata analysis, feature extraction copyright c⃝ 2006-2012 by ccc publications the role of visual rhetoric in semantic multimedia: strategies for decision making in times of crisis 607 or multimodal analysis, and as a result, the core researchers were split in two large groups: a group that had its roots in text processing and data mining, and a group with roots in computer vision [21] [23] [24]. of course at times one will need to apply both methods to get meaningful information from a multimedia system. today, new groups are slowly emerging, driven by the advances from fields like visual communication, psychology or biology. the group at which we adhere is still a small one and tries to tie the ideas from the modern visual rhetoric to information visualization, multimedia processing and generation, interactive documentaries and other semantic multimedia areas. visual rhetoric [11] is one of the new disciplines taken into account by the semantic multimedia researchers. we consider this cross-pollination natural since both fields deal with the interpretation of visual media (paintings, photographs, movies, games, etc.). the question we would like to address in this paper is how can we design semantic technologies that take into account the findings from visual rhetoric? the rest of the paper is organized in 4 sections. section 2 starts with a discussion about the various schools and definitions of visual rhetoric; and identifies several ideas that have a potential for growth in the field of semantic multimedia. it also contains a review of the related work. section 3 continues with a short analysis of the role that visual methods have in decision making with a special focus on crisis economics. section 4 presents some ideas about prevention of crises and a case study built around a prototype application. we conclude our paper with a discussion on the various evaluation methods that can be used in order to assess the success of our enterprise, both on the level of the mechanisms described (comparisons between different methods for generating multimedia content) and on the level of decision making strategies. 2 the role of visual rhetoric in semantic multimedia and related work images, video files or graphics of any kind (paintings, infographics, interactive visualizations) always tell more than we would like to admit. since the biggest processing engine we have is our brain we should pay more attention to how we process and present any information using multimedia channels. visual rhetoric is one of the modern disciplines that can help us do precisely this, if we take the time to study it and apply it to our representation and interactivity problems [11]. while visual rhetoric is not new, its theoretical treatment and multimedia applications are. the seeds of this discipline can be found in the articles about art theory, film art and iconology, published in the german space since the third decade of the 20th century. probably the most famous exponents of that period were rudolf arnheim [2] and ernst gombrich [9]. the term visual rhetoric was rarely used at the time, but most of the elements discussed in their essays (from equilibrium to lighting or color, but also space or dynamics) are included in modern treatments of visual rhetorics. the influence of this movement goes well beyond visual communication, and their ideas can be found almost everywhere from architecture to game design. this wave was mostly focused on issues of representation and composition in art. during the ’60s and ’70s there was a french wave of visual rhetoric, which was inspired from the film criticism of cahiers du cinema, literary criticism and philosophy, its most famous exponents being roland barthes [3] and jacques bertin [4]. cinema, photography and charts were the focus of the essays published during this wave. today, the dominant current in visual rhetoric theory is anglo-saxon, gunther kress [17] or charles hill [11] being some of its most respected proponents. eric kandel [16], belongs probably to both the german and anglo-saxon wave (he left vienna when he was 9 years old in 1938, but he always tried to keep contact with fellow austrians like ernst kris or ernst gombrich who were 608 a.m.p. brasoveanu, i. dzitac influential in establishing the grounds on which a scientific theory of visual rhetoric will someday be formed). the current wave is one of solid grounding, multimodality, mathematics, computer science and semiotics being used to connect the dots between the various long running threads. complex questions are asked (questions like: what is the role of music in a certain scene from a movie? how can certain elements be used in the same scene to enhance its meaning?). by doing this, the current wave starts to deconstruct the authorial intentions in a scientific manner. while there is no single all-encompassing definition of visual rhetoric to this day, there is a consensus regarding the fact that you can create a visual rhetoric space for any discipline. some of the fields where visual rhetoric can and should be used are investigated in [11] together with the possible definitions of visual rhetoric as seen from those fields of study. basically all definitions agree on one aspect: visual rhetoric is a form of communication that uses images to construct meaning or arguments. by extension, visual literacy defines the way we respond to images and it implies that we are already well trained in how to read images. the beginnings of visual rhetoric were controversial (as proved by the ernst haeckel biological images forgery case [8]), but its continuous improvement, especially during the last two decades, has led to its acceptance as one of the leading areas of research in visual communication. it is enough to look up the list of publications from the premier venues for semantic web, multimedia or information visualization publications (journal of web semantics, acm multimedia, ieee multimedia, ieee tvvg, acm tomccap) during the last years (2008 2012) to discover that some of the articles that received a lot of attention (best paper awards, quotations, discussions in other papers, even sequels) apply ideas inspired by visual rhetoric like: visual query suggestion [30], narrative visualization [20], affective image classification [18], framing effects [13] and color naming models and their applications [10]. hullman and diakopoulos [13] apply their ideas on visualization rhetoric to a class of visualizations identified by segel and heer [20] as narrative visualizations. narrative visualizations do not just visually present some numbers, but also draw attention to the story behind those numbers, and in doing so they need to deploy an entire arsenal of techniques like provenance rhetoric, mapping rhetoric (visual metaphors, contrast, etc), linguistic or procedural rhetorics [13]. the paper about color naming models [10] is important mainly for library builders (especially javascript libraries), while [18] uses features generally used in psychology and biology to create an affective image classification. some of the metrics used in [18] are color (name, contrast, features), texture (wavelet, tamura, etc.), composition (depth of field, rule of thirds) or content (human faces, skin). an interesting problem in our view is that of using images to illustrate arguments. an existing approach towards this problem involves finding the entities from texts (or even a search box) and associating them with images from medical literature (using caption processing, image processing, and topic discovery), and can be found in the structured literature image finder [1]. the training data used manually annotated images from different subcellular locations. slif, developed at cmu, is basically a restricted search engine, one that has as domain medical literature. for medical texts software like this is extremely valuable, as it is often the case that an image presents an entire story. identifying images that best represent entities is a challenging task by itself, but identifying images that would best illustrate a point of view is an even more daunting task. it is however the task that anyone involved in visual rhetoric would like to solve, especially people involved in advertising. bocconi, hardman and nack [5] were able to generate matter of opinion documentaries by "framing" into a larger conversation clips from interviews that captured people’s reactions to the september 11 terrorist act. after proving how to build the "framing" mechanisms using semantic graphs, they conclude that the role of visuals in providing support for the subject matter in documentaries needs to be further developed. we used this assumption as a the role of visual rhetoric in semantic multimedia: strategies for decision making in times of crisis 609 starting point for our investigations and discovered that there are several theories in social semiotics [11] [17] [27] dedicated to supporting verbal meaning with visual artifacts, and we started to build a framework around these theories. what we noticed by reviewing the literature is that while these articles are hardly related when it comes to subject (apparently there aren’t many connections between visual query suggestion, information visualization or color naming models), and some of them do not even mention the term visual rhetoric (except for [13] which refers to "visualization rhetoric"), almost all of them quote some of the pioneers of the field (rudolf arnheim [2], johanes itten [14], roland barthes [3], or jacques bertin [4], for example), and some were influenced by the research group led by alberto del bimbo [7]. this suggests that a third group is gaining lots of traction in the area of semantic multimedia, and that this group embraces theories from art, psychology or biology. the surveyed papers are built upon the philosophy of the early pioneers of the field, while we base our work on theories developed in the last 15 years. 3 visual rhetoric for decision making in times of crisis mainstream economists almost always fail to predict crises, and the example of the current crisis (started in 2008 with the collapse of the american housing market, and at the time of writing 2012 still unfinished) is one of the best. there a few economists who are said to have predicted this crisis (nouriel roubini, peter schiller, nassim taleb and others) [34], but there are hardly any graphics that prove their theories. the most interesting theories present dragon-kings (significant or meaningful outliers), black swans (in essence events that are almost impossible to predict) and other models that would lead to accurate predictions. dragon-kings models proposed by eth’s didier sornette [15] [22] [31] do not involve events that can’t be predicted, as black swan events suppose. sornette’s group anticipated some short bubbles and published the results sometimes few days before the actual events occurred [31]. other economists just improved som models and run them against datasets related to the current crisis [19]. while they did not predict a new crisis or the length of the current one with a great accuracy, these models are still useful and can help us understand what happens in today’s high frequency trading markets. we think that the worst part when it comes to crisis prediction or visualization is that even some of the best visualizations (take the visualizations from one of the top contests, for example [33]) do not present us with the visual cues (dragon-kings, black swans) that would make us easily understand what happened during the last years. just plotting some data without highlighting the events that suggest bubbles or other crisis scenarios is not going to help us too much. we have to spend more time thinking about the results of the visualizations and how to present those results in such a way that they are easy to understand. this means more time spent tackling the problems related to manipulation and bias, since graphics can in the same time improve our understanding of current or past events, but they can also be used to manipulate the public opinion. in today’s connected world, where most of the governments put their data online (including financial data), we think that this dual nature of visualization rhetoric is something we must carefully address. [13] shows how to "frame" narrative visualizations to present different points of view related to the same event (poll predictions). another method to insert more information into visualizations would be to combine various visual metaphors as demonstrated in [12]. we think there are multiple reasons why current visualizations do not take into account such phenomena. first, many visualizations are not done by interdisciplinary teams, so if the researchers have not heard about such events (dragon kings, black swans, etc) they will not be represented in the 610 a.m.p. brasoveanu, i. dzitac end product (this assumption does not apply to bigger outlets with traditions in visualization like the new york times or guardian). second, most of the visualization rhetoric used today still comes from several sources (usually jacques bertin [4], william cleveland [6], edward tufte [25] [26], leland willkinson [29], the last one being more recent than the rest) mostly focused on visual presentation of data, but not on framing and the consequences of framing and story selection. third, there is no single, uniform, easy to understand and use visual rhetoric for any type of decision making. this means that all visualization designers need to master visual literacy. for today’s visualization designers there is an easy path towards mastery of both visual rhetoric and visual literacy: they can start learning online (outlets like: [36][41]) and then go on and read the masters (classics: [4], [6], [25], [26], [29], or modern: [42][44]). just by choosing a story and applying some creative layering techniques over a plot we will not be able to predict and prevent a crisis. if we want to be able to do that, we will have to rely on other mechanisms like job creation, for example. or we will have to create technologies that work on multiple levels and are easy to adapt for solving different problems. prevention is not going to be of much help as long as the visualizations we use do not show us the dragon-kings or black swans. 4 multi-layered strategies for overcoming crisis semantic technologies to increase profits a good case study for our ideas would involve a situation in which we can produce multimedia content in order to leverage some of the advantages that interactive visual environments have over traditional media. tourism industry, entertainment, or politics could be considered some of the premier venues where image is everything and such applications would help a lot. take for example this scenario: you have several videos in which speakers tell stories about what they like in a city. if you want to create a short movie from this clips, you will likely want to convince those who watch it that the city being spoken about is really beautiful. this means you will have to replace in many places the images of the speakers with those of the objects or concepts being spoken about. using visual rhetoric is just one of the best ways to automate this process. our approach towards the use of visual rhetoric in semantic multimedia is based on some of the more recent findings, like [11] and [17]. the main goal of our project is to understand how to use visual rhetoric in multimedia environments. some secondary goals for the project are to use textual to visual matching methods and visual content generation methods. most of the work presented in section 2 uses visual rhetoric only for interpretation/explanation purposes (this visualization presents the evolution of a player during the last season, for example), whereas our goals are more inclined towards the possible applications of visual rhetoric in the space of multimedia content generation (automated movie generations, automated footprint generation, automated summary generations). the case study involves a prototype web application called interview explorer and used to generate visuals for a series of interviews with people (mostly students) regarding their lives in a foreign city (vienna, the capital of austria). we selected vienna as the city that must be explored, not only because we spend a lot of time in the city, but also because: • it is the city where the whole movement with integration between science and art started around 1900, according to the nobel prize laureate eric kandel [16]; • it is a city with lots of historical sites which are well represented on any media property, from twitter to wikipedia (belvedere or stephansdom, for example); • it is a big city with enough green spaces (donauinsel, vienna woods, etc.); the role of visual rhetoric in semantic multimedia: strategies for decision making in times of crisis 611 figure 1: the components of the prototype • the landscape changes quite fast because there are lots of new construction sites; • while it does not have so many iconic images like london, paris or new york, it consistently ranks higher on mercer’s quality of life tops for few years now [36] (which can make for interesting life questions which should theoretically be harder to illustrate through images). the generated short documentaries (webisodes) can be used in effective marketing campaigns for attracting students or tourists, for example, but they can also be used as research tools in economy or social sciences in order to understand the needs and the habits of the population from a certain area. the interviews contain questions related to the parts of vienna that attract the students (buildings, parks, danube easier to illustrate if we use an automated approach based on named entity recognition), but also questions about how well they integrated into the new environment (work, friends, social life harder to illustrate even if you use a manual approach, and it gets even worse with automated approaches). the application can function in 2 modes: search (where we can just see the proposed visuals and some widgets with additional information about the entities mentioned) and explore (where we can see proposals on how to illustrate episodes with the related identified visuals). for additional information we integrated widgets that display data from wikipedia or twitter, or the maps from google, for example. integrating facebook widgets was problematic, as some of the people interviewed felt that it would be a serious breach of their privacy. they were basically not aware that all you need to find a person on facebook is a name. people that had twitter accounts were more likely to agree to display their information on the twitter widget, because they see this as free publicity. currently, all the videos, images and interviews used in the prototype are from the personal collection of the author. a long term goal of the project would be to use any free images or videos available on the web that are related to the topic discussed. the front end of the prototype was programmed in javascript, while the back end uses several other programming languages. the communication between different components is done using the json format. since this is a prototype, and not a commercial application, we are not able to provide metrics regarding its profitability. the expression from the title of this section should be interpreted as: since semantic technologies are now cheaper than they used to be, they can easily be used to 612 a.m.p. brasoveanu, i. dzitac figure 2: interview explorer prototype first iteration figure 3: visual suggestions tab from the explore mode. this simple visualization offers only several suggestions for replacing the frames of each video block. the role of visual rhetoric in semantic multimedia: strategies for decision making in times of crisis 613 create applications that provide lots of features at an accessible price. maybe the most interesting aspect related to the development of such applications is the fact that you can create a stack of technologies to solve a specific problem (replace the image of the person that is interviewed with images about the things he is talking about) and easily repurpose them to solve other problems (like use the generated documentaries for marketing purposes, or add an interactive visualization layer on top and provide data that can be useful for government to understand foreign citizens living in austria, for example). seen from this perspective, the fact that semantic web is now becoming mainstream should help a lot of companies create the tools that will help them easily navigate through the bad periods they are now confronting. 5 discussion and future work by looking at the work presented in section 2, it can be easily seen that there is a new current in semantic multimedia. this current might not yet be on par with information retrieval or computer vision, but it is certainly developing into something powerful and useful in the same time. it might not be a fully formed school like the ones we mentioned, but given the large number of papers in top venues we think that this school will be important in the next couple of years. transforming visual rhetoric into science is not something that will happen overnight. it takes time to build the mathematical models and the simulations that are associated with scientific processes, and also needed in order to reproduce the results. as we have seen in section 3, many crisis prediction models should also include a visual component (svm visualization, dragon-king, black swan or something else). we think that visualizations should take this finding into account and such patterns should be discovered as soon as possible. it is hard for us to understand why big companies invest millions or billions of dollars or euros into visualization systems for real-time stock trading, but when it comes to presenting their findings to the public they almost never show the patterns that could lead to crises. before fractal analysis we could argue that there were no good mathematical models to predict crises [15] [22] [31], but now since we have such models we should use them. the prototype from section 4 is going through new iterations. future work will involve trying to replace images of speakers with images of the concepts being spoken about. we will also add new layers to our architecture: a visualization layer, a sentiment analysis layer, and so on. we are currently undertaking an evaluation of the system on components level. it is currently the only method to evaluate it since our system uses images and videos that are not from standardized datasets like trecvid [35]. we also do user evaluations against each component, because it is important to know what the users feel about the end result. first reactions of the interviewed persons were that the system looks good and it is useful. they would even see themselves using it, if it would be open to public, since they consider it has the potential to be great in the space of personalized entertainment. we will perform a more detailed survey to assess the strength and weaknesses of our approach in the eyes of the users. since today cinema viewers are accustomed to seeing complex narratives, we know that their expectations are high. for example, the ending from one of last year’s most critically acclaimed movies: tinker, tailor, soldier, spy mixes scenes that happened during at least 4 periods of time (the distance between each being several years) in only 5 minutes (the focus is on the relationship between two characters, and the ascension of the third in light of the recent events), but still manages to keep us involved and provide a satisfactory and artsy conclusion in the same time. we are aware that reaching such an artistic mastery requires a long commitment from our side, but our first target is not tinker, tailor, soldier, spy, when it comes to generating an engaging narrative (as that movie is also a dramatic adaptation of a novel), but rather the autobiography of nicolae ceausescu, a documentary which manages to 614 a.m.p. brasoveanu, i. dzitac tell the dictator’s story without any background narration. from our point of view reaching the artistic and information complexity level of this documentary (editing, scoring, narrative comprehension, etc.) will take several years, but the first steps toward this goal have already been made. assessing the success of our prototype system on the level of decision making strategies represents a complex process which is also likely to take several years. it involves developing future versions and showing the end product to potential customers in order to draft a commercial version in one day. a commercial version might easily function on multiple levels as we suggested. for example, if we would implement it for a touristic city portal, the system could generate both the ads that could be served to visitors (ads for ski during winter, and for hiking during summers), but also the presentations for various locations from the city and webisodes that show different aspects of living in that community. combining such systems with social media monitoring for example would allow us to extract more information from chained events like the arab spring. this will offer us an unprecedented level of access to information to real historical events, which is a thing all historians would like. adding powerful prediction models to such a system would make it the ultimate crises prediction and intervention tool. we are only a few steps away from such systems as they are predicted in movies like minority report. bibliography [1] a. ahmed, l.p. coelho, a. arnold, j. kangas, a.b. sheikh, e. xing, w. cohen, r.f. murphy, structured literature image finder: parsing text and figures in biomedical literature, journal of web semantics: science, services and agents on the world wide web, 8 (2-3), 151-154, 2010 [2] r. arnheim, art and visual perception: a psychology of the creative eye, 2nd edition, cambridge, 2004. [3] r. barthes, image, music, text, hill and wang, 1977. [4] j. bertin, semiology of graphics: diagrams, networks, graphs, 2nd edition, esri press, 2010. [5] s. bocconi, f. nack, l. hardman, automatic generation of matter-of-opinion video documentaries, journal of web semantics, 6 (2), 139-150, 2008. [6] w.s. cleveland, the elements of graphing data, 2nd edition, hobart press, 1994 [7] c. colombo, a. del bimbo, p. pala, semantics in visual information retrieval, ieee multimedia, 6(3): 38-53, 1999. [8] p. dombrowski, ernst haeckel’s controversial visual rhetoric, technical communication quarterly 12: 303-319, 2003. [9] e.h.gombric, art and illusion. a study in the psychology of pictorial representation, millennium edition, princeton university press, 2000. [10] j. heer, m. stone, color naming models for color selection, image editing and pallette design, acm chi, may 5-10, 2012, austin, texas, usa. [11] c.a. hill, m. helmers, defining visual rhetorics, lawrence erlbaum associates, london, 2004. the role of visual rhetoric in semantic multimedia: strategies for decision making in times of crisis 615 [12] a.hubmann-haidvogel, a.m.p. brasoveanu, a. scharl, m. sabou, s. gindl. visualizing contextual and dynamic features of micropost streams, in proceedings of the www’12 workshopon on ’making sense of microposts’. lyon, france, april 16, 2012, ceur workshop proceedings vol-838, 34 40, 2012. [13] j. hullman, n. diakopoulos, vizualization rhetoric: framing effects in narrative visualization, ieee tvcg, vol. 17, no. 12, 2011, 2231-2240. [14] j. itten, the art of color: the subjective experience and objective rationale of color, john wiley, new york, 1973. [15] z.-q. jiang, w.-x. zhou, d. sornette, r. woodard, k. bastiaensen, p. cauwels, bubble diagnosis and prediction of the 2005-2007 and 2008-2009 chinese stock market bubbles, journal of economic behavior & organization 74 (3), 149-162, 2010. [16] e. r. kandel, the age of insight: the quest to understand the unconscious in art, mind, and brain, from vienna 1900 to the present, random house, 2012 [17] g. kress, t. van leeuwen, reading images, routledge, london, 2006. [18] j. machajdik, a. hanburry, affective image classification using features inspired by psychology and art theory, proceedings of the acm multimedia, october 25-29, 2010, firenze, italy, 83-92. [19] p. sarlin, d. marghescu, visual predictions of current crises: a comparison of selforganizing maps with probit models, tucs techical report, no 978, june 2010. [20] e. segel, j. heer, narrative visualization: telling stories with data, ieee tvcg, vol. 16, 2010, 2231-2240. [21] a.w.m. smeulders, m. worring, s. santini, a. gupta, r. jain, content based image retrieval at the end of the early years, ieee transactions on pattern analysis and machine intelligence, volume 22 (12), page 1349-1380, 2000. [22] d. sornette, dragon-kings, black swans and the prediction of crises, international journal of terraspace science and engineering, 2 (1), pp. 1-18, 2009. [23] s. staab, a. scherp, r. arndt, r. troncy, m. gregorzek, c. saathoff, s. schenk, l.hardman., semantic multimedia, reasoning web, 4th international summer school, venice, italy. volume 5224 of lncs, springer, 125-170, 2008. [24] r. troncy, b. huet, s. schenk, multimedia semantics. metadata, analysis and interaction, wiley 2011. [25] e. r. tufte, the visual display of quantitative information, 2nd edition, graphics press, 2001. [26] e. r. tufte, beautiful evidence, graphics press, 2006. [27] t. van leeuwen, introducing social semiotics, routledge, 2005. [28] w.-n. wang, y.-l. yu, image emotional semantic query based on color semantic description, proceedings of the fourth international conference on machine learning and cybernetics, guangzhou, 18-21 august 2005, p.4571-4576. 616 a.m.p. brasoveanu, i. dzitac [29] l. wilkinson, the grammar of graphics, second edition, springer, 2005. [30] z.j. zha, l. yang, t. mei, m. wang, z. wang, t.s. chua, x.s. hua, visual query suggestion: towards capturing user intent in internet image search, acm tomccap 6, issue 3, 1-19, 2010. [31] w. yan, r. rebib, r. woodard, d. sornette, detection of crashes and rebounds in major equity markets, http://adsabs.harvard.edu/abs/2011arxiv1108.0077y, last accessed: 8.04.2012. [32] http://mazamascience.com/workingwithdata/?p=870, last accessed: 08.04.2012. [33] http://www.informationisbeautifulawards.com/, last accessed: 08.04.2012. [34] http://www.economicpredictions.org/who-predicted-the-financial-crisis.htm, last accessed: 08.04.2012. [35] http://trecvid.nist.gov/, last accessed: 08.04.2012. [36] http://www.mercer.com/articles/quality-of-living-survey-report-2011, last accessed: 12.07.2012. [37] http://www.visualisingdata.com/, last accessed: 12.07.2012. [38] http://www.informationisbeautiful.net/, last accessed: 12.07.2012. [39] http://flowingdata.com/, last accessed: 12.07.2012. [40] http://www.visualcomplexity.com/vc/, last accessed: 12.07.2012. [41] http://visual.ly/, last accessed: 12.07.2012. [42] http://eagereyes.org/, last accessed: 12.07.2012. [43] http://blog.typekit.com/2012/06/12/designing-data/, last accessed: 12.07.2012. [44] http://twitter.com/nytgraphics/, last accessed: 12.07.2012. int j comput commun, issn 1841-9836 vol.7 (2012), no. 4 (november), pp. 709-720 distributed collaborative processing under communication delay over wireless sensor and actuator networks l. mo, b. xu lei mo, bugong xu key laboratory of autonomous systems and network control, ministry of education college of automation science and engineering south china university of technology guangzhou 510640, china e-mail: mo.lei@mail.scut.edu.cn, aubgxu@scut.edu.cn abstract: in wireless sensor and actuator networks (wsans), the sensor nodes are involved in gathering information about the physical phenomenon, while the actuator nodes take decisions and then perform appropriate actions upon the environment. the collaborative operation of sensor and actuator nodes brings significant advantages over wsns, including improved accuracy and timely actions upon the sensed phenomena. however, unreliable wireless communication and finding a proper control strategy cause challenges in designing such network control system. in order to accomplish effective sensing and acting tasks, efficient coordination mechanisms among different nodes are required. in this paper, the coordination and communication problems in wsans are studied. first, we formulate the mathematical models for the wsans system. then, a predictor-controller algorithm based on distributed estimation is adopted to mitigate the effects of network-induced delay. finally, we apply a collaborative processing mechanism to meet the desired system requirements and improve the overall control performance. this approach will group the sensor and actuator nodes to work in parallel so as to reduce the computation complexity and enhance the system reacting time. simulation results demonstrate the effectiveness of our proposed method. keywords: wireless sensor and actuator networks, distributed estimation, collaborative processing. 1 introduction wireless sensor and actuator networks(wsans) comprise groups of sensor and actuator nodes that are connected with wireless medium. it is an important extension of wireless sensor networks (wsns), allowing actuator nodes within the network to make autonomous decisions and then perform appropriate actions in response to the sensor nodes measurements [1]. thus, the novel network architecture performs not only ’read’ operations, but also ’write’ operations, which brings about unique and new challenges that need to be addressed [2]. in order to satisfy the requirements introduced by the coexistence of sensor and actuator nodes, multiple coordination levels among nodes are required to implement, which can be defined as: sensor-sensor(ss), sensor-actuator(s-a) and actuator-actuator(a-a). the s-s coordination is similar to the scheme already used in wireless sensor networks applications. thus, in this paper we mainly focus on the s-a and a-a coordination. due to the unreliable wireless communications, system noise and time-delay are the common phenomenons which will influence the overall system performance. to this end, it is quite necessary for the nodes to perform the estimation and compensation algorithms of the required information [3]. a queuing strategy is introduced both in controllers and actuator nodes in [4], and the time delay between controllers and actuator nodes is compensated by multi-step control increment given by the algorithm of general predictive control. the work given in [5] copyright c⃝ 2006-2012 by ccc publications 710 l. mo, b. xu presents a real-time architecture for automated wsans, the delay bound of s-a communication is maintained by the distributed mechanisms for s-a event reporting and self-aware coordination. in [6] a general reliability-centric framework for event reporting in wsans is proposed, which contains an fault-tolerant event data aggregation algorithm and a delay-aware data transmission protocol. in [7], the authors study the model-based predictive networked control systems that compensate random delays and data loss of the communication, and use a predictive control scheme to avoid performance loss. our work is motivated by the above studies. the key difference is that we focus on the s-a delay, not the uniform network nodes delay. moreover, we apply a predictor-controller algorithm based on the state estimation to mitigate the detrimental effects of the communication delay. in this context, the model of the wsans system needs to be analyzed in detail. finding a proper control strategy is still the core in designing the a-a coordination [8], this process involves which actuator node should be scheduled to execute a specific task and how to adjust its actuation to meet the desired system requirements. according to the way data is routed among different actuator nodes, control strategies can be categorized into the distributed control (dc) and centralized control (cc) scheme [9]. in the dc scheme, the control decision of a signal actuator node relies on the local information received from its neighbor nodes rather than global information [10]. then it can achieve a superior performance in modularity, integrated diagnostics, quick and easy maintenance and low cost. in [11] a framework of optimizing a collaborative sensing and actuation system is built for environment control, the sensor is set in the actuator node and the control objection is to balance the energy saving against the spatial smoothness of the control signals. in [12], the authors propose two control schemes in wsans for building-environment control systems, a cc scheme in which control decisions are made based on global information, and a dc scheme that enables distributed actuator nodes to make decisions locally. in [8] a new distributed estimation and collaborative control scheme is proposed for industrial control systems with wsans, which can achieve robust control against inaccurate system parameters. in this paper, we focus on the problem of utilize distributed sensor measurements to design control strategies in order to elicit a desired response from the monitored environment. our methodology incorporates a dynamic clustering schedule into the collaborative estimation and control framework, which can minimize the control error and improve the control quality. the remainder of this paper is organized as follows: section 2 models the wsans system. while section 3 provides a delay compensation algorithm. then a distributed collaborative proceeding method is designed in section 4. at last, the results of simulations conducted to explore the performance of proposed algorithms are demonstrated in section 5. 2 system models we consider the wsans system that are employed to the industrial instrumentation and control applications. the control objective is to adjust the system variables to meet our requirements. a set of static sensor and actuator nodes that are spread throughout the region of interest (roi) to detect and track events and take necessary actions. let x denote the system variable of our concern, such as temperature, brightness, humidity, sound, pressure, vibrations, etc. in different parts of the field. let sa represent the set of actuator nodes, with na = |sa|. let ss represent the set of sensor nodes, with ns = |ss|. we make the following assumption of our network: (1) sensor node is the time-driven device, input reception or output transmission is controlled by a sample time, while the actuator node is the event-driven device depends on the control techniques used; (2) sensor and actuator nodes are aware of their geographical position; (3) the network is synchronized by means of one of the existing synchronization protocols [13]; distributed collaborative processing under communication delay over wireless sensor and actuator networks 711 (4) the randomly varying delays between s-a are bounded; (5) the system is observable and controllable. let si denote the ith sensor node, each node has the following model: zi(k) = hixi(k) + νi(k), i = 1, ...,ns (1) where hi and νi(k) are the observation item and measurement noise, respectively. assume that νi(k) is a zero-mean gaussian white noise with e{νi(k)} = 0, e{νi(k)νtj (l)} = ri(k)δklδij, where δkl = 1 if k = l, and δkl = 0, otherwise. then the matrix form of eq.(1) is: z(k) = hx(k) + ν(k) (2) where z(k) = [z1(k), ...,zns(k)] t , h = diag[h1, ...,hns], x(k) = [x1(k), ...,xns(k)] t and ν(k) = [ν1(k), ...,νns(k)] t . let aj denote the jth actuator node, fj denote its output which influences its ambient plant state, uj denote the control signal that is used to adjust aj’s actuation. the change of each actuator node’s actuation is assumed linearly proportional to the control signal received by this node, which is modeled as: fj(k) = guj(k),j = 1, ...,na (3) where g is the transfer function of aj. here, we consider a scenario with homogenous actuator nodes. then the matrix form of eq.(3) is: f(k) = gu(k) (4) where f(k) = [f1(k), ...,fna(k)] t , g = diag[g1, ...,gna] and u(k) = [u1(k), ...,una(k)] t . here, we used two sets to indicate the interaction between the sensor and actuator nodes: the associated sensor nodes of ai,∀i ∈{1, ...,na}: sai = {sj|dij ̸= 0,j = 1, ...,ms} (5) and the influenced actuator nodes of sj,∀j ∈{1, ...,ns}: ssj = {ai|dij ̸= 0, i = 1, ...,ma} (6) where the parameter dij represents the relation between the ith and the jth nodes: dij = { 1, influenced 0, isolated (7) eq.(5) and eq.(6) show that the sensor nodes in set sai will transmit the sensing data to ai, while the actuator nodes in set ssj will influence the plant state monitored by sj. in wsans, the sensor and actuator nodes are usually linked with wireless medium, since the actuator nodes are connected with each other directly and much more powerful than the ordinary sensor nodes, communication delay between s-a become a general problem of such network control system [14]. the s-a delays do not only degrade the system performance, but can also destabilize the system [15]. the delay system at sample step k has the following dynamics: za(k) = z(k −∆k) (8) the finite non-negative integers ∆k represent the s-a delays at the kth step, za(k) is the sensing data received by the actuator nodes with communication delays. 712 l. mo, b. xu in the course of the practice, the variation of plant state xi at time t is caused by the output transferred form the actuator nodes and its ambient environment [12], under that assumption we have: dxi dt = ∑ 1≤k≤ns,k ̸=i αki(xk −xi) + ∑ 1≤l≤na βli(fl −xi) (9) where αki and βli are coefficients relating to the state-transfer efficiency. so, the plant state equation can be written in the matrix form as follows: dx dt = φx(t) + ψf(t) (10) where φ ∈ rns×ns , ψ ∈ rna×na , we assume that f is constant within each step, i.e., f(t) = f(k), t ∈ [kt,(k + 1)t). since φ and ψ are coefficients or constants, then the dynamic system can be modeled as: x(k + 1) = ax(k) + bf(k) (11) where a = eφt and b = φ−1(eφt −1)ψ. 3 delay compensation algorithm the main idea of the delay compensation algorithm is to utilize an observer to estimate the plant states and a multi-step predictor to compute predictive control inputs based on the past sensor measurements. the block diagram of the delay compensation algorithm is shown in figure 1. k b a h a a a b b b ( 1)x k p k p ! ! ! ( 1 )x k p k p ! " ! ' ( )u k p ' ( 1)u k p ! ( 2 )x k p k p ! " ! ' ( 2)u k p ! ( 3 )x k p k p ! " ! ( )x k k p ! ' ( 1)u k ++ + + + + + + + observer predictor l l l ( )z k p figure 1: block diagram of the delay compensation algorithm. in order to keep the track of past measurements, received sensing data have to be stored in a p length fifo (first-in-first-out), denoted as q, and p is the upper bound of ∆k. thus, the s-a delay is transformed to a constant delay, which is much easier to control than the random delay systems [15]. the delay compensation algorithm is delineated below: observer model: x̂(k−p+1 | k−p) = ax̂(k−p | k−p−1)+bu ′(k−p)+k(k−p)(z(k−p)−hx̂(k−p | k−p−1)) (12) predictor model: x̂(k | k −p) = ax̂(k −1 | k −p) + bu ′(k −1) (13) distributed collaborative processing under communication delay over wireless sensor and actuator networks 713 control law: u ′(k) = l(k)x̂(k | k −p) (14) in [16], the authors have proved that the resulting closed-loop equations can be expressed as:[ x(k + 1) ê(k −p + 1) ] = [ a + bl(k) ∗ 0 a−k(k −p)h ] [ x(k) ê(k −p) ] (15) where: ê(k) = x(k)− x̂(k | k −1) (16) if k and l are constant, then eq.(15) determines the stability of the delay compensator due to the separation of the controller and observer. since the performance of the observer and predictor are highly dependent on the model certainty, then the dynamic model of the plant has to be very precise. 4 distributed collaborative processing 4.1 dynamic clustering schedule in order to maximize the network lifetime and data throughput, and provide load balancing and fault tolerance [17], the clustering schedule should be established. in this paper, based on the characteristics of the current events, an event-triggered dynamic clustering schedule is designed for wsans. if there is no event occurs, the nodes follow a static sleep schedule. when an event occurs, the sensor nodes whose sensing range cover it will be activated, and transmit the sensing data to each coordinator, then the coordinator organizes its neighbor nodes into a working cluster to take a proper action till the error signal becomes zero. during this process, the coordinator will act as the cluster head and the neighbor nodes will be selected as the cluster members. then the control decision is made by the cluster head according to the fusion data aggregate from the associated sensor node and cluster members. for sensor node’s coordinator is the nearest actuator node, since the closer the actuator node to the sensor node is, the earlier the actuator node is informed, thus the quicker the actuator node reacts and the earlier action to be initiated. here, neighbor nodes can be defined as the actuator nodes which are within the communication range of coordinator and the associated sensor nodes are activated. so, the energy constrained sensor node does not need to transmit its readings to multiple actuator nodes. instead, the coordinator will receive this message and relay it to its neighbor nodes to come up with an appropriate actuation. the process of dynamic clustering schedule is shown in figure 2. here, we assume that the data route from source sensor node to terminal actuator node is in one hop, |ssj| = 1,j = 1, ..,ns and |sai| = 1, i = 1, ..,na. 4.2 collaborative processing algorithm consider the control objective which is to meet the set points x∗ = [x∗1, ...,x ∗ ns ]t . in order to balance the control requirements against the spatial smoothness of the control signals, we define the control objective of ai as: ji(k + 1) = α 2 ∑ j∈nai,j ̸=i (ei(k + 1)−ej(k + 1))2 + 1−α 2 e2i (k + 1) (17) where nai is the neighbor nodes set of ai, ei(k + 1) = xi(k + 1)−x∗i = aiix̂i(k) + biigiui(k)−x ∗ i and ej(k + 1) = xj(k + 1)−x∗j = ajjx̂j(k) + bjjgjuj(k)−x ∗ j,(j ∈ nai,j ̸= i). 714 l. mo, b. xu actuator1 actuator2 actuator3 sensor area1 sensor area2 sensor area3 sensor2 sensor1 sensor3 event area dormant sensor nodes active sensor nodes dormant actuator nodes active actuator nodes figure 2: dynamic clustering schedule. in order to minimize ji(k + 1), the gradient descending method can be used. the partial derivative of ji(k + 1) with respect to ui(k) is calculated as: ∂ji(k + 1) ∂ui(k) = [α ∑ j∈nai,j ̸=i (ei(k)−ej(k)) + (1−α)ei(k)]biigi (18) for each ai, i ∈ (1, ...,na), its control law ui(k) is updated by: ui(k + 1) = ui(k) + ∆ui = ui(k)−ε ∂ji(k + 1) ∂ui(k) (19) where ε is a positive step size called the learning step length. if ε is small, the convergence speed of the objective function ji will be slow. if ε is too large, it often leads to unstable. so it is important how to choose the proper ε. in order to investigate the stability of eq.(19), we rewrite ∆ui as: ∆ui = −ε ∂ji(k + 1) ∂ui(k) = −ε ∂ji(k + 1) ∂ei(k + 1) ∂ei(k + 1) ∂ui(k) = −εbiigi ∂ji(k + 1) ∂ei(k + 1) = −λi ∂ji(k + 1) ∂ei(k + 1) (20) we define the learning error as: ∆ei = −λi ∂ji(k + 1) ∂ei(k + 1) (21) where ∂ji(k + 1) ∂ei(k + 1) = α ∑ j∈nai,j ̸=i (ei(k + 1)−ej(k + 1)) + (1−α)ei(k + 1) (22) let the array [∂j1(k + 1)/∂e1(k + 1), ...,∂jna(k + 1)/∂ena(k + 1)] t to be zero, then it can be represented as: de(k + 1) = 0 (23) here, d is a na × na positive definite matrix, e(k + 1) = [e1(k + 1), ...,ena(k + 1)]t and the elements of d satisfy the following equation: |dii|− na∑ j=1,j ̸=i |dij| = 1−α (24) distributed collaborative processing under communication delay over wireless sensor and actuator networks 715 we define the residual error as: r(k) = e(k) − e∗, where e∗ is the solution of de∗ = 0. from eq.(21), we have: r(k) = r(k −1)−λ(de(k −1)) = r(k −1)−λ(de(k −1)−de∗) = (i −λd)r(k −1) = ∏k i=1 (i −λd)r(0) = y ∏k i=1 (i −λλ)y t r(0) = y ∏k i=1 (i −ελ′)y t r(0) (25) where d = y λy t , λ = diag(η1, ...,ηna), λ = diag(λ1, ...,λna), and η1, ...,ηna are the eigenvalues of d, so we can get: λ′ = bgλ = diag(b11g1η1, ...,bnanagnaηna) = diag(σ1, ...,σna) (26) if we select 0 < ε < 2/max(σi),1 ≤ i ≤ na, then r(k) −→ 0 as k −→∞. eq.(19) is a completely distributed collaborative processing method, there has no need a sink to help in the coordination of the sensor and actuator nodes. instead, each actuator node combines itself and neighbor nodes’ messages to access the control law and pursuit the optimal solutions step by step. 5 numerical examples let’s consider a simple humility, ventilation, air conditioning (hvac) control system for temperature control with two sensor nodes (ns = 2) and two actuator nodes (na = 2). the control arm is to meet the set points x∗ = [16(◦c),18(◦c)]. the system parameters are: a = [ 0.9 0 0 0.9 ] ,b = [ 0.57 0 0 0.68 ] ,h = [ 1 0 0 1 ] ,k = [ −0.68 0 0 −0.57 ] ,l = [ 0.5 0 0 0.5 ] where k and l satisfy the stability condition according to eq.(15). the effectively actuation of actuator nodes are highly depend on the precision of the sensing data. the longer time delay between sensing and acting is, the bigger estimation error introduced. the increasing control decision error does not only degrade the system performance, but also can destabilize the system, just as shown in figure 3(a) and 3(c). figure 3(b) and 3(d) clearly show that the compensated system are less oscillatory than those of the uncompensated system. the predictor-controller compensation algorithm provides a valid way to estimate the sensing data with latency, reduce the estimation bias and enhance the precision of feedback control. 0 10 20 30 40 50 −10 0 10 20 k c o n tr o l s ig n a ls u1 u2 0 10 20 30 40 50 0 50 100 150 k o b je ct iv e f u n ct io n s j1 j2 j=j1+j2 (a) ε = 1.63, α = 0.5, ∆k = 1, p = 0 0 10 20 30 40 50 2 3 4 5 6 k c o n tr o l s ig n a ls u1 u2 0 10 20 30 40 50 0 1 2 3 4 k o b je ct iv e f u n ct io n s j1 j2 j=j1+j2 (b) ε = 1.63, α = 0.5, ∆k = 1, p = 1 716 l. mo, b. xu 0 10 20 30 40 50 −500 0 500 1000 k c o n tr o l s ig n a ls u1 u2 0 10 20 30 40 50 0 1 2 3 x 10 5 k o b je ct iv e f u n ct io n s j1 j2 j=j1+j2 (c) ε = 1.63, α = 0.5, ∆k = 2, p = 0 0 10 20 30 40 50 0 5 10 15 k c o n tr o l s ig n a ls u1 u2 0 10 20 30 40 50 0 10 20 30 40 k o b je ct iv e f u n ct io n s j1 j2 j=j1+j2 (d) ε = 1.63, α = 0.5, ∆k = 2, p = 2 figure 3: dynamic responses of uncompensated (∆k ̸= 0,p = 0) and compensated system (p = ∆k ̸= 0). in eq.(17), α is a collaborative factor between 0 and 1. when α = 0, the neighbor nodes’ messages are not taken into consideration, but if we select α ̸= 0, the collaborative processing among different nodes are introduced. moreover, α can also performs as a smooth factor, it will reduce the control overshooting and stabilize the system from oscillating. the performance of compensated system with and without collaborative processing method are shown in figure 4. it is obviously seen that the proposed method can greatly improve the system performance, which can smooth the actuator control signal and accelerate the system convergence speed. 0 10 20 30 40 50 0 2 4 6 8 k c o n tr o l s ig n a ls u1 u2 0 10 20 30 40 50 0 2 4 6 k o b je ct iv e f u n ct io n s j1 j2 j=j1+j2 (a) ε = 2.17, α = 0, ∆k = 0, p = 0 0 10 20 30 40 50 2 3 4 5 k c o n tr o l s ig n a ls u1 u2 0 10 20 30 40 50 0 0.5 1 1.5 k o b je ct iv e f u n ct io n s j1 j2 j=j1+j2 (b) ε = 2.17, α = 0.5, ∆k = 0, p = 0 distributed collaborative processing under communication delay over wireless sensor and actuator networks 717 0 10 20 30 40 50 −5 0 5 10 k c o n tr o l s ig n a ls u1 u2 0 10 20 30 40 50 0 5 10 15 k o b je ct iv e f u n ct io n s j1 j2 j=j1+j2 (c) ε = 1.74, α = 0, ∆k = 1, p = 1 0 10 20 30 40 50 2 3 4 5 6 k c o n tr o l s ig n a ls u1 u2 0 10 20 30 40 50 0 2 4 6 k o b je ct iv e f u n ct io n s j1 j2 j=j1+j2 (d) ε = 1.74, α = 0.5, ∆k = 1, p = 1 0 10 20 30 40 50 −10 0 10 20 k c o n tr o l s ig n a ls u1 u2 0 10 20 30 40 50 0 50 100 150 k o b je ct iv e f u n ct io n s j1 j2 j=j1+j2 (e) ε = 1.96, α = 0, ∆k = 2, p = 2 0 10 20 30 40 50 0 5 10 15 k c o n tr o l s ig n a ls u1 u2 0 10 20 30 40 50 0 10 20 30 40 k o b je ct iv e f u n ct io n s j1 j2 j=j1+j2 (f) ε = 1.96, α = 0.5, ∆k = 2, p = 2 figure 4: dynamic responses of real-time (∆k = 0) and compensated system (p = ∆k ̸= 0) under different α. 0 10 20 30 40 50 2 2.5 3 3.5 4 k c o n tr o l s ig n a ls u1 u2 0 10 20 30 40 50 0 0.2 0.4 0.6 0.8 1 k o b je ct iv e f u n ct io n s j1 j2 j=j1+j2 (a) ε = 0.54, α = 0.5, ∆k = 0, p = 0 0 10 20 30 40 50 2 3 4 5 k c o n tr o l s ig n a ls u1 u2 0 10 20 30 40 50 0 0.2 0.4 0.6 0.8 1 k o b je ct iv e f u n ct io n s j1 j2 j=j1+j2 (b) ε = 1.63, α = 0.5, ∆k = 0, p = 0 718 l. mo, b. xu 0 10 20 30 40 50 0 2 4 6 k c o n tr o l s ig n a ls u1 u2 0 10 20 30 40 50 0 1 2 3 k o b je ct iv e f u n ct io n s j1 j2 j=j1+j2 (c) ε = 2.24, α = 0.5, ∆k = 0, p = 0 0 10 20 30 40 50 −5 0 5 10 k c o n tr o l s ig n a ls u1 u2 0 10 20 30 40 50 0 5 10 15 20 k o b je ct iv e f u n ct io n s j1 j2 j=j1+j2 (d) ε = 2.72, α = 0.5, ∆k = 0, p = 0 figure 5: dynamic responses of real-time system (∆k = 0) under different ε. 0 10 20 30 40 50 0 2 4 6 8 10 12 14 16 18 20 k o b je ct iv e f u n ct io n s ε=0.54 ε=1.63 ε=2.11 ε=2.29 (a) α = 0.5, ∆k = 1, p = 1 0 10 20 30 40 50 0 5 10 15 20 25 30 35 40 k o b je ct iv e f u n ct io n s ε=0.54 ε=1.63 ε=2.34 ε=2.50 (b) α = 0.5, ∆k = 2, p = 2 figure 6: dynamic responses of compensated system (p = ∆k ̸= 0) under different ε. 0 20 40 60 80 100 0 10 20 30 40 k c o n tr o l s ig n a ls u1 u2 0 20 40 60 80 100 0 20 40 60 80 100 k o b je ct iv e f u n ct io n s j1 j2 j=j1+j2 ε = 1.63, α = 0.5, ∆k = 10, p = 10 figure 7: dynamic responses of system with multi-step delay compensation. distributed collaborative processing under communication delay over wireless sensor and actuator networks 719 the learning step length ε plays an important role in the gradient descending method. figure 5 compares the responses of real-time system under different ε (ε = 0.54,1.63,2.24,2.72). we could observe that the bigger ε is, the faster convergence speed of j can be achieved. but if ε exceeds the critical value εc = 2.24, i.e., ε = 2.72, the system become unstable. figure 6 shows the dynamic responses of compensated system with collaborative processing method under different ε. ∆k between s-a are set at one-step and two-step, respectively. we can get that εc = 2.11 and εc = 2.34 are the critical step lengths, and the stable control can be achieved within those values. comparing figure 6(a) and 6(b), we could see that, the bound of j is influenced by the delay step ∆k. the variance of j increased as the longer s-a latency. figure 7 shows that when the system suffers a multi-step delay, such as ∆k = 10, the proposed compensation and control scheme is also useful. j will tend to be zero eventually and control signals both converge to their stable states. 6 conclusions in this paper, we focus on the communication and control problems in wsans. we argue that the system performance is closely related to the communication delay and control strategy. in order to mitigate the detrimental effects of the s-a latency, a delay compensation algorithm based on the state estimation is applied to this system. then, a distributed collaborative processing method is proposed to control actuator option in a coordinate way to accomplish the desired tasks. we formulate it as an optimization problem and utilize gradient descending algorithm to calculate the optimal control law for actuator nodes. on this basis, we discuss the control strategy parameters that relate to the system performance and provide a guide line how to choose properly. in our framework, the proposed collaborative processing method does no need a central sever and make an optimum usage of the available resources, which can be easily applied in the industrial automation systems. acknowledgments this work was supported by the national natural science foundation of china under grant 61174070, the specialized research fund for the doctoral program under grant 20110172110033. bibliography [1] i. f. akyildiz, i. h. kasimoglu, wireless sensor and actor networks: research challenges. ad hoc networks, vol.2, no.4, pp.351-367, 2004. [2] r. vedantham, z. zhuang, r. sivakumar, hazard avoidance in wireless sensor and actor networks. computer communications, vol.29, no.13, pp.2578-2598, 2006. [3] a. deshpande, c. guestrin, s. r. madden, resource-aware wireless sensor-actuator networks. bulletin of the ieee computer society technical committee on data engineering, vol.28, no.1, pp.40-47, 2005. [4] t. wang, l. zhou, p. han, q. zhang, complete compensation for time delay in networked control system based on gpc and bp neural network. proceedings of 2007 international conference on machine learning and cybernetics, hongkong, china, 637-641, 2007. 720 l. mo, b. xu [5] y. zeng, c. j. sreenan, g. zheng, a real-time architecture for automated wireless sensor and actuator networks. the fifth international conference on wireless and mobile communications, cannes/la bocca, french riviera, france, 1-6, 2009. [6] e. ngai, y. zhou, m. r. lyu, j. liu, a delay-aware reliable event reporting framework for wireless sensor-actuator networks. ad hoc networks, vol.8, no.7, pp.694-707, 2010. [7] a. onat, t. naskali, e. parlakay, o. mutluer, control over imperfect networks: modelbased predictive networked control systems. ieee transactions on industrial electronics, vol.58, no.3, pp.905-913, 2011. [8] j. chen, x. cao, p. cheng, y. xiao, y. sun, distributed collaborative control for industrial automation with wireless sensor and actuator networks. ieee transactions on industrial electronics, vol.57, no.12, pp.4219-4229, 2010. [9] r. i. erica, v. g. luis, cooperation mechanism taxonomy for wireless sensor and actor networks. ieee transactions on industrial electronics, vol.264, pp.62-73, 2008. [10] t. melodia, d. pompili, v. c. gungor, i. f. akyildiz, communication and coordination in wireless sensor and actor networks. ieee transactions on mobile computing, vol.6, no.10, pp.1116-1128, 2007. [11] m. nakamura, a. sakurai, s. furubo, h. ban, collaborative processing in mote-based sensor/actuator networks for environment control application. signal processing, vol.88, no.7, pp.1827-1838, 2008. [12] x. cao, j. chen, y. xiao, y. sun, building-environment control with wireless sensor and actuator networks: centralized versus distributed. ieee transactions on industrial electronics, vol.57, no.11, pp.3596-3606, 2010. [13] b. sundararaman, u. buy, a. kshemkalyani, clock synchronization for wireless sensor networks: a survey. ad hoc networks, vol.3, no.3, pp.281-323, 2005. [14] v. c. gungor, ö. b. akan, i. f. akyildiz, a real-time and reliable transport protocol for wireless sensor and actor networks. ieee transactions on networking, vol.16, no.2, pp.359-370, 2008. [15] m. chow, y. tipsuwan, network-based control systems: a tutorial. the 27th annual conference of the ieee industrial electronics society, denver, usa, pp.1593-1602, 2001. [16] r. luck, a. ray, experimental verification of a delay compensation alrorithm for integrated communication and control systems. international journal of control, vol.59, no.6, pp.1357-1372, 1994. [17] b. mclaughlan, k. akkaya, coverage-based clustering of wireless sensor and actor networks. ieee international conference on pervasive services, istanbul, turkey, 45-54, 2007. int j comput commun, issn 1841-9836 8(5):744-753, october, 2013. performance analysis of epidemic routing in dtn with overlapping communities and selfish nodes y. wu, s. deng, h. huang, y. deng yahui wu, su deng, hongbin huang science and technology on information systems engineering laboratory national university of defense technology changsha, 410073, china wuyahui@nudt.edu.cn, yahui_wu@163.com, dzyxxxb@163.com yiqi deng university college london department of computer science zndxxb@sina.com abstract: routing algorithms in delay tolerant networks (dtn) adopt the storecarry-forward way, and this needs the nodes to work in a cooperative way. however, nodes may not be willing to help others in many applications and this behavior can be seen as individual selfish. on the other hand, nodes often can be divided into different communities, and nodes in the same community often have some social ties. due to these social ties, nodes are more willing to help the one in the same community. this behavior can be seen as social selfish. note that some nodes may belong to more than one community in the real world, and this phenomenon makes the network have overlapping communities. this paper proposed a theoretical model to describe the performance of epidemic routing (er) in such network. simulation results show the accuracy of our model. numerical results show that the selfish nature can make the performance of the routing policy be worse, but those nodes belonging to multi-communities can decrease the impact of the selfish nature in certain degree. keywords: delay tolerant networks (dtn), selfish nodes, overlapping communities, epidemic routing, performance analysis. 1 introduction at present, there has been a growing interest to study the communication policy for challenged networking applications, such as deep-space exploration [1], vehicular networks [2], mobile social networks [3], etc. in these new environments, the end-to-end connectivity cannot be assumed because the network is quite sparse or nodes are moving fast. that is, a complete path from source to destination does not exist or such a path is highly unstable and may change or break soon after it has been discovered. these networks belong to the general category of delay tolerant networks (dtn) [4]. in traditional mobile ad hoc networks (manet), nodes communicate with each other based on the assumption that there exists at least one fully connected path between communication nodes. therefore, routing policies in manet cannot be used directly in dtn. in order to overcome the network partitions, nodes of dtn communicate through a store-carry-forward mode. due to the node mobility, different links come up and down. if the sequence of connectivity graphs over a time interval is overlapped, then an end-to-end path might exist, so the message should be forwarded over the existing link, stored and carried at the next hop until the next link comes up [5]. many routing policies have been proposed in dtn. according to the number of replicas, these policies can be divided into two classes: that is, single-copy and multi-copy. in the first class, nodes keep only one copy of the message and attempt to forward that copy towards the node which has higher probability to meet the destination, such as the works in [6], [7], etc. therefore, copyright c⃝ 2006-2013 by ccc publications performance analysis of epidemic routing in dtn with overlapping communities and selfish nodes 745 how to select the proper relay nodes to carry the copy is critical. in the multi-copy methods, one message may have many replicas and they are transited at the same time to increase the successful ratio [8][9]. the core is to select proper relay nodes to store or forward these copies. therefore, routing policies in both classes depend on the help of other nodes. however, nodes may not be willing to help others due to the constraint of buffer space or power resources [10]. this behavior can be seen as individual selfish [11]. hui et al. studied the its impact in mobile social network, and they found that mobile social network is robust to the individual selfish due to the multiple paths [11]. then, it was studied further in [12]. there are also some incentive methods to make nodes be cooperative[13]-[14]. on the other hand, nodes can be divided into different communities according to their interesting, citation relation, etc. obviously, nodes in the same community often have some social ties, and they are more willing to help each other. this behavior can be seen as social selfish. li et al. proposed this behavior for the first time [15]. its impact on the routing performance of er was explored in [16], and then they studied the impact of both individual selfish and social selfish on multicasting application [17]. however, above works failed to consider the fact that some nodes may belong to more than one community which is common in the real world [18]. in this paper, we studied the routing performance of er in dtn with overlapping communities and selfish nodes by the markov process for the first time. at present, many researchers are interesting in er algorithm [19]. for example, the performance of er based on the sparsely exponential graph was studied in [20], and the problem was explored again with heterogeneous nodes [21]. the performance of two-hop relay routing (a special case of er) under limited packet lifetime was studied in [22]. the authors in work [23] studied the routing performance with contention. in addition, some works begin to study how to decrease the energy consumption of er. authors in [24] proposed the optimal probabilistic forwarding policy under a fluid model approximation, and they proved that the optimal policy is the threshold form. then, they addressed the problem of online estimation of optimal policies in [25], and explored the problem with heterogeneous nodes in [26]. the optimal forwarding problem with multiple destinations was proposed in [27]. li et al. designed an optimal relaying scheme for dtn, which considers nodes’ heterogeneous contact rates and delivery costs when selecting relays to minimize the delivery cost while satisfying the required message delivery probability [28]. the optimal control problem with dead nodes was proposed in [29]. however, to our best knowledge, none of the works considered the problem as ours. 2 network model the set of nodes in the network is denoted by v. besides the source s and destination d, every node belongs to at least one of the two communities, which are denoted by c 1 and c 2, respectively. it is easy to see that the source and destination may belong to any class. for simplicity, we assume that d belongs to c 1 and s belongs to c 2. in fact, our work can be extended to other cases easily. nodes other than the destination can be seen as relay nodes. the number of relay nodes in the first class is m and the second class has n nodes. due to the overlap of the communities, there are o<= min {m, n -1} nodes belonging to both classes. therefore, there are totally v =m +n +1-o nodes. the link exists between two nodes only when they come into the transmission range of each other, which means a contact, so the mobility of the users is critical. in this paper, we assume that the occurrence of contacts between two nodes follows a poisson distribution, which is found in many well-known mobility models, such as random waypoint and random direction [30]. this assumption also has been checked by certain real motion traces [31]. therefore, we can assume that the inter-meeting time between two contacts follows an exponential distribution 746 y. wu, s. deng, h. huang, y. deng with parameter λ . nodes may not be willing to help others due to the individual selfish nature. in this paper, we assume that nodes in c 1 help the one in the same class with probability p1, and nodes in c 2 help the one in the same class with probability p2. on the other hand, nodes is social selfish, so we assume that nodes in c 1 help nodes belonging to c 2 with probability p12, and nodes in c 2 help nodes belonging to c 1 with probability p21. in fact, many papers used this mode to denote the selfish nature of nodes [15], [16] [17], etc. because nodes are more willing to help the one in the same community, we have p1 > p12 and p2 > p21. in addition, for any two nodes i and j which belong to c 1 and c 2 at the same time, we assume that they communicate with each other with probability p >= max {p1, p2}. this assumption is based on the observation that nodes having more common hobbies often have much closer relationship. therefore, they are more willing to help others. for simplicity, we assume p=max{p1, p2} in this paper. that is, if p1 > p2, nodes i and j communicate with probability p1, or with probability p2. 3 data dissemination process and performance analysis now, we begin to explore the data dissemination process based on the er algorithm. first, we give a new classification of the nodes. in particular, nodes only belonging to c 1 are denoted by c 11, and nodes just belonging to c 2 are denoted by c 22. nodes belonging to both c 1 and c 2 are denoted by c 12. therefore, class c 11 has m-o relay nodes, class c 22 has n-o relay nodes, and class c 12 has o relay nodes. a snapshot of the network can be seen in figure 1. figure 1: a snapshot of the network the dashed lines in figure 1 mean that the link between the nodes is opportunistic. the caption on the line denotes the forwarding probability. specially, it denotes the forwarding probability from the starting point to the end point. on the other hand, we have p1m=max{p1, p21} and p2m=max{p2, p12}. that is, for any node i in c 11 and j in c 12, because j takes i as a friend, it forwards toward i with probability p1. however, node j also belongs to c 2, if nodes in c 2 are altruism, node j may forward toward i with probability p21 which is bigger then p1. therefore, j should forward to i with probability p1m. we can get the meaning of p2m according to above analysis easily. 3.1 date dissemination model let x (t) denote the number of relay nodes in class c 11 which is carrying data at time t (not including d), y (t) denote the number of nodes carrying data in c 22 (including s ), and z (t) denote the corresponding number in c 12. therefore, the state of the network at time t can be denoted as (x (t), y (t), z (t)), and there are totally (m-o+1)( n-o+1)(o+1) transient states. when the destination gets data, the transmission stops, and this state can be seen as the absorption state which is denoted by dst. from state (x (t), y (t),z (t)), the network may change to one of the following four states through one-step transition, that is, s 1=(x (t)+1, y (t),z (t)), s 2=(x (t), y (t)+1,z (t)), s 3=(x (t), performance analysis of epidemic routing in dtn with overlapping communities and selfish nodes 747 y (t),z (t)+1) and dst. state s 1 means that one node in c 11 received data. obviously, one premise condition of this transition is x (t)< m-o, which means that at least one relay node in c 11 does not receive data at time t. the node in c 11 which just received data can get the data from nodes in any class. for simplicity, if the node received data from node j, we say that the transition is triggered by j. obviously, node j may be any node in the network which is carrying data. if j is in class c 11, one node in c 11 without data must encounter with node j, and node j is also willing to forward data to it. because there are x (t) nodes in the class c 11 carrying data at time t, so there are m-ox (t) nodes without data in c 11. obviously, node j may be any node in the x (t) nodes, and the new node which just received data may be any one in the m-ox (t) nodes. in addition, nodes encounter with each other according to the exponential distribution with parameter λ , combining the selfish behavior, we know that the transition rate is λ x (t)(m-ox (t))p1. if the transition is triggered by nodes in c 22, nodes without data in c 11 must encounter with one node which has received data before in c 22. because there are y (t) nodes in the class c 22 which is carrying data at time t, and nodes in c 22 forward to nodes in c 11 with probability p21, we can know that the transition rate is λ y (t)(m-ox (t))p21. by the same method, we know that if the data comes from c 12, the transition rate is λ z (t)(m-ox (t))p1m. now, we can get the total transition rate from state (x (t), y (t),z (t)) to s 1 through one-step transition which is shown as follows, (x(t),y (t),z(t)) → s1,rateλ(m − o − x(t))(x(t)p1 + y (t)p21 + z(t)p1m) (1) similarly, we can get the transition rate from state (x (t), y (t),z (t)) to s 2 and s 3 through one-step transition, which is shown as follows, (x(t),y (t),z(t)) → s2, rateλ(n − o − y (t))(x(t)p12 + y (t)p2 + z(t)p2m), (x(t),y (t),z(t)) → s3, rateλ(o − z(t))(x(t)p1 + y (t)p2 + z(t)p) (2) if the network comes into dst from state (x (t), y (t),z (t)), the destination d must receive data. according to above analysis and the forwarding probability in figure 1, we can get, (x(t),y (t),z(t)) → dst, rateλ(x(t)p1 + y (t)p21 + z(t)p1m) (3) let q denote the generate matrix which is defined as follows, q = ( t r 0 0 ) (4) the elements in the matrix are different. t is a sub-matrix and it denotes the rate of the transition from one transient state to another. so the number of the rows and columns of the matrix is both (m-o+1)(n-o+1)(o+1). r is a column vector with (m-o+1)(n-o+1)(o+1) elements and it denotes the rate of the transition from one transient state to the absorbing state dst. the left 0 is a row vector with (m-o+1)(n-o+1)(o+1) elements and it denotes the rate of the transition from dst to any transient state. the right 0 is a vector with only one element and it denotes the rate of the transition from dst to dst. according to equations (1), (2) and (3), we can get every element of q. for example, from state (x, y, z ), we have,  t(x + 1,y,z|x,y,z) = λ(m − o − x)(xp1 + yp21 + zp1m), t(x,y + 1,z|x,y,z) = λ(n − o − y)(xp12 + yp2 + zp2m), t(x,y,z + 1|x,y,z) = λ(o − z)(xp1 + yp2 + zp), r(dst|x,y,z) = λ(xp1 + yp21 + zp1m), r(others|x,y,z) = 0 (5) symbol others may be any state other than (x +1, y, z ), (x, y+1, z ), (x, y, z +1) and dst. 748 y. wu, s. deng, h. huang, y. deng 3.2 performance analysis first, we define the one-step transition probability matrix p which can be got from the generator matrix q easily. for example, the transition probability from state i to j is p(j|i), which is an element of p. each row of q represents the transition rate from one state to others. therefore, the sum of all elements in one row denotes the rate of leaving the current state. for example, given state ss, the rate of leaving this state denoted by speed(ss ) can be shown as, speed(ss) = ∑ i∈sspace q(i|ss) (6) symbol sspace represents the set of all valid states and q(i|ss ) is one element in q which represents the transition rate from ss to i. now, we can get the probability of the transition. p(i|ss) = q(i|ss)/speed(ss), i ∈ sspace (7) let dt (k ) denote the average delivery delay till d received data, starting from state k = (x, y, z ). obviously, we have dt (dst)=0. similarly, let st (k ) denote the residence time in state k and we also have st (dst)=0. for any transient state k, we have speed (k )>0 and st (k )=1/speed (k ).by conditioning on the one-hop transition out of the current state, we have dt(k) = ∑ j∈sspace−{k} p(j|k)dt(j) + st(k) = ∑ j∈sspace p(j|k)dt(j) + st(k) − p(k|k)dt(k) = ∑ j∈sspace p(j|k)dt(j) + st(k) (8) define dt as a column vector of the average delivery delay starting from any valid transient state, and st also a column vector of the residence time. then, we can obtain, dt = p ∗ dt + st ⇒ dt = (i − p)−1st (9) because only the source has data at the beginning, we know that the initial state is initialstate=(0, 1, 0). therefore, the average delivery delay is dt(initialstate). now, we begin to compute the average energy cost using similar method. here, we only consider the energy cost in the transition process. as descried in [24], [25] and [26], the energy cost is proportional to the number of transmissions which contain both the forwarding and receiving process. in this paper, we use the number of transmissions to denote the energy cost simply. let er(k ) denote the average energy cost till d received data, starting from state k = (x, y, z )), obviously er(dst)=0. according to above analysis, from state k, the network may come into any state of the following four states: k 1=(x +1, y, z ), k 2=(x, y+1, z ), k 3=(x, y, z+1) and dst. obviously, if the network changes into one of them, one node must forward data and the other one must receive data. that is, if the network changes state, there is one transmission. therefore, we can obtain, er(k) = p(k1|k)(1 + er(k1)) + p(k2|k)(1 + er(k2)) + p(k3|k)(1 + er(k3)) + p(dst|k)(1 + er(dst)) = ∑ j∈sspace p(j|k)(1 + er(j)) = ∑ j∈sspace p(j|k) + ∑ j∈sspace p(j|k)er(j) = 1 + ∑ j∈sspace p(j|k)er(j) (10) define er as the corresponding column vector. equation (10) can be changed to the following equation. er = p ∗ er + e ⇒ er = (i − p)−1e (11) symbol e is a column vector and every element in it equals to 1. therefore, the average energy cost starting from state initialstate=(0, 1, 0) is er(initialstate). performance analysis of epidemic routing in dtn with overlapping communities and selfish nodes 749 10 20 30 40 50 60 70 80 90 100 200 400 600 800 1000 1200 1400 1600 number of relay nodes a ve ra g e d e liv e ry d e la y (s ) theoretical, rwp simulation, rwp (a) rwp mobility mode 10 20 30 40 50 60 70 80 90 100 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 2.2 2.4 x 10 5 number of relay nodes a ve ra g e d e liv e ry d e la y (s ) theoretical, poisson contact model simulation, poisson contact model (b) poisson contact model figure 2: theoretical and simulation result comparison of the average delivery delay 4 simulation and numerical results 4.1 simulation results in this section, we will check the accuracy of our theoretical model, and we run several simulations using the opportunistic network environment (one) simulator [32]. the simulation is based on both synthetic mobility model and real-world-based scenarios. the synthetic model is the famous random waypoint (rwp) mobility model. in this model, the simulation terrain is 1000mąá1000m, and the speed varies from 0.5 to 1.25m/s. the transmission range is 2m. for the real-world-based scenario, we use the poisson contact model. specially, we have λ =3.71ąá10-6s−1. as shown in [17] and [33], this value is obtained from the vehicle model, which is based on real motion traces from about 2100 operational taxis for about one month in shanghai city collected by gps. authors of [34] proposed a least-fitting method to identify the exponential parameter and find that above value is well proper. for the theoretical model related parameters, we set p1=0.5, p12=0.2, p2=0.8 and p21=0.1. as described above, there are totalnum=m+n-o relay nodes in the network. without loss of generality, we set m=n and o=0.2totalnum. through let the number of relay nodes increase from 10 to 100, we get the results in figure 2. from the result we can see that the average deviation between the theoretical results and the simulation is very small. for example, the deviation is about 2.8% for the rwp mobility model and 4.6% for the poisson contact model. this demonstrates the accuracy of our theoretical model. then, we will use the theoretical results obtained by our model to evaluate the performance in different cases. 4.2 performance analysis with numerical results first, we will explore the impact of the overlap between communities. here, we increase the value of o continuously till reaching to totalnum. let the ratio o/totalnum increase from 0.1 to 1. other settings are the same as that in rwp model in the simulation. the numerical results are shown in figure 3 when the number of relay nodes equals to 20, 40 and 80, respectively. figure 3 shows that if there are more nodes belonging to both communities at the same time, the average delivery delay will be smaller. for example, when n =20, the average delivery delay is reduced by about 79.2% when the value of o/totalnum increases from 0 to 1. however, the value of o/totalnum has little influence on the average energy cost. then, we will explore the impact of the overlap when the selfish level is different. first, we explore the case with different social selfish level, so we can assume that nodes in the same community are altruism for each other, that is p1=p2=1. the total number of relay nodes 750 y. wu, s. deng, h. huang, y. deng 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 100 200 300 400 500 600 700 800 900 1000 value of o/totalnum a ve ra g e d e liv e ry d e la y (s ) n=20 n=40 n=80 (a) average delivery delay 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 10 15 20 25 30 35 40 45 50 55 value of o/totalnum a ve ra g e e n e rg y co st n=20 n=40 n=80 (b) average energy cost figure 3: impact of the overlap phenomena with different number of relay nodes 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 180 200 220 240 260 280 300 320 340 360 value of o/totalnum a ve ra g e d e liv e ry d e la y (s ) p 12 =p 21 =0.1 p 12 =p 21 =0.5 p 12 =p 21 =0.8 p 12 =p 21 =1 (a) average delivery delay 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 20 20.5 21 21.5 22 22.5 23 23.5 24 24.5 25 value of o/totalnum a ve ra g e e n e rg y co st p 12 =p 21 =0.1 p 12 =p 21 =0.5 p 12 =p 21 =0.8 p 12 =p 21 =1 (b) average energy cost figure 4: impact of the overlap phenomena with different level of social selfish totalnum is 40, and we have m=n. other settings are also the same as that in rwp model. we set p12= p21, and let their value equal to 0.1, 0.5, 0.8 and 1, respectively. through letting o/totalnum increase from 0.1 to 1, we can get figure 4. from figure 4 we can see that with fixed social selfish level, the average delivery delay is decreasing with the increasing of the nodes in c 12 when p12= p21<1. when p12= p21=1, every node is altruism, so the overlap between communities cannot have any impact. this result also shows that the smaller of p12 (p12= p21), the bigger of the decreasing ratio will be. in addition, the difference of the delivery delay with different social selfish level is decreasing with o/totalnum, and they have the same value when o/totalnum=1. figure 4(b) is a surprising result, and it shows that the average energy cost is not monotonous with o/totalnum. this demonstrates the complex correlation between the overlap and the social selfish behavior. when the cooperative level is small, for example when p12= p21=0.1, the average energy cost is decreasing with o/totalnum. this is because that with the increasing of o/totalnum, the average delivery delay decreasing rapidly (see figure 4(a)), data has less time to spread further. however, when nodes are more cooperative, the average energy cost is first increasing, and then begins to decrease with o/totalnum. in this case, though the average delivery delay still decreases with o/totalnum, the increasing degree of the data spreading speed is much bigger. therefore, the energy cost increases in some degree, but the impact of the increasing of the data spreading speed becomes smaller when o/totalnum is big enough. in fact, the fluctuation of the average energy cost is very small under different value of o/totalnum. therefore, if there are more nodes belonging to more community, the network can get better performance without much increasing of the energy cost. now, we want to explore the results with different individual selfish level when the social performance analysis of epidemic routing in dtn with overlapping communities and selfish nodes 751 (a) average delivery delay 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 22 22.5 23 23.5 24 24.5 25 value of o/totalnum a ve ra g e e n e rg y co st p 1 =p 2 =0.2 p 1 =p 2 =0.5 p 1 =p 2 =0.8 p 1 =p 2 =1 (b) average energy cost figure 5: impact of the overlap phenomena with different level of individual selfish selfish level is fixed. here, we assume that nodes in one community helps the one in other community with probability 0.1. other settings are the same as that in figure 4. we give the numerical results when p12= p21=0.2, 0.5, 0.8 and 1, respectively. let o/totalnum increase from 0.1 to 1, we can get figure 5. the result also shows that the average delivery delay decreases with o/totalnum. the average energy cost has similar changing rule as that in figure 4(b). this further demonstrates that the overlap is good for the network. 5 conclusions this paper explored the performance of er algorithm in dtn which has overlapping communities and selfish nodes, and a theoretical model based on the markov process was proposed. simulation results show the accuracy of the model. numerical results show that the overlap between communities can improve the performance without much increasing of the energy cost. bibliography [1] s. haoliang, l. lixiang and h. xiaohui, a network coding based dtn convergence layer reliable transport mechanism over interplanetary networks, international journal of computers, communications and control, vol.6, no.2, pp.236-245, 2011. [2] a. rahim, z. s. khan, f. b. muhaya, m. sher and m. k. khan, information sharing in vehicular adhoc network, international journal of computers, communications and control, vol.5, no.5, pp.892-899, 2010. [3] w. gao, q. li, b. zhao and g. cao, multicasting in delay tolerant networks: a social network perspective, in proc. acm mobihoc, 2009. [4] k. fall, a delay-tolerant network architecture for challenged internets, in proc. acm sigcomm, 2003. [5] t. spyropoulos, t. turletti and k. obrazcka, routing in delay tolerant networks comprising heterogeneous populations of nodes, ieee transaction on mobile computing, vol. 6, no. 8, 2009. [6] t. spyropoulos, k. psounis and c. raghavendra, efficient routing in intermittently connected mobile networks: the single-copy case, acm/ieee transaction on networking, 2008. 752 y. wu, s. deng, h. huang, y. deng [7] z. guo, b. wang and j. -h. cui, prediction assisted single-copy routing in underwater delay tolerant networks, in proc. ieee globecom, 2010. [8] e. bulut, z. wang and b. szymanski, cost effective multi-period spraying for routing in delay tolerant networks, acm/ieee transaction on networking, vol. 18, no. 5, 2010. [9] w. gao, g. cao, on exploiting transient contact patterns for data forwarding in delay tolerant networks, in proc. ieee icnp, 2010. [10] g. resta, p. santi, the effects of node cooperation level on routing performance in delay tolerant networks, in proc. ieee secon, 2009. [11] p. hui, k. xu, v. o. k. li, j. crowcort, v. latora and p. lio, selfishness, altruism and message spreading in mobile social networks, in proc. ieee netscicom, 2009. [12] k.xu, p. hui, v. o. k. li, j. crowcort, v. latora and p. lio, impact of altruism on opportunistic communications, in proc. ieee icufn, 2009. [13] r. lu, x. lin, h. zhu, x. shen and b. preiss, pi: a practical incentive protocol for delay tolerant networks, ieee transactions on wireless communications, vol.9, no.4, 2010. [14] t. ning, z. yang, x. xie and h. wu, incentive-aware data dissemination in delay-tolerant mobile networks, in proc. ieee secon, 2011. [15] q. li, s. zhu and g. cao, routing in socially selfish delay tolerant network, in proc. ieee infocom, 2010. [16] y. li, p. hui, d. jin, l. su and l. zeng, evaluating the impact of social selfishness on the epidemic routing in delay tolerant networks, ieee communication letters, vol.14, no.11, pp.1026-1028, 2010. [17] y. li, g. su, d. o. wu, d. jin, l. su and l. zeng, the impact of node selfishness on multicasting in delay tolerant networks, ieee transactions on vehicular technology,vol.60, no.5, 2011. [18] n. p. nguyen, t. n. dinh, s. tokala, m. t. thai, overlapping communities in dynamic networks: their detection and mobile applications, in proc. acm mobicom,2011. [19] a. vahdat, d. becker, epidemic routing for partially-connected ad hoc networks, technical report, duke university,2000. [20] x. zhang, g. neglia, j. kurose and d. towsley, performance modeling of epidemic routing, computer networks,vol. 51, no. 10, pp. 2867-2891, 2007. [21] y. k. ip, w. -c. lau and o. -c yue, performance modeling of epidemic routing with heterogeneous node types, in proc. ieee icc,pp. 219-224, 2008. [22] a. ai-hanbali, p. nain and e. altman, performance of ad hoc networks with two-hop relay routing and limited packet lifetime, in proc. valuetools,2006. [23] a. jindal, k. psounis, contention-aware performance analysis of mobility-assisted routing, ieee transaction on mobile computing,vol. 8, no. 2, pp. 145-161, 2009. [24] e. altman, t. basar and f. d. pellegrini, optimal monotone forwarding policies in delay tolerant mobile ad-hoc networks, in proc. acm inter-perf,2008. performance analysis of epidemic routing in dtn with overlapping communities and selfish nodes 753 [25] e. altman, g. neglia, f. d. pellegrini and d. miorandi, decentralized stochastic control of delay tolerant networks, in proc. ieee infocom,2009. [26] f. d. pellegrini, e. altman and t. basar, optimal monotone forwarding policies in delay tolerant mobile ad hoc networks with multiple classes of nodes, in proc. wiopt,2010. [27] c. singh, a. kumar, r. sundaresan and e. altman, optimal forwarding in delay tolerant networks with multiple destinations, in proc. wiopt,2011. [28] y. li, z. wang, d. jin, l. su, l. zeng and s. chen, optimal relaying in heterogeneous delay tolerant networks, in proc. ieee icc,2011. [29] m. khouzani, s. sarkar and e. altman, optimal control of epidemic evolution, in proc. ieee infocom,2011. [30] r. groenevelt, p. nain and g. koole, the message delay in mobile ad hoc networks, performance evaluation,2005. [31] t. karagiannis, j. -y. l. boudec and m. zojnovic, power law and exponential decay of inter contact times between mobile devices, in proc. acm mobicom,2007. [32] a. keranen, j. ott, and t. karkkainen, the one simulator for dtn protocol evaluation, in proc. simutools,2009. [33] s. j. u. traffic information grid team, grid computing center, shanghai taxi trace data. http://wirelesslab.sjtu.edu.cn/ [34] h. zhu, l. fu, g. xue, y. zhu, m. li and l. ni, recognizing exponential inter-contact time in vanets, in proc. ieee infocom,2010. international journal of computers communications & control issn 1841-9836, 12(2):183-200, april 2017. model of network topic detection based on web usage behaviour mode analysis and mining technology m. chen mo chen 1. business college of beijing union university a3, yanjingdongli, chaoyang district, beijing, 100025, p.r. china. mo.chen@buu.edu.cn 2. school of information, renmin university of china. no.59, zhongguancun street, haidian district, beijing, 100872, p.r. china. chenmoky@sohu.com abstract: this research has caught researchers’ wide attention for detecting network topic exactly with the arrival of big data era characterized by semi-structured or unstructured text. this paper proposes a model of network topic detection based on web usage behaviour mode analysis and mining technology taking web news as object of research. the author elaborates main function and method proposed in this model, which include the analysis module of web news instance clicking mode, the analysis module of web news instance retrieval mode, the analysis module of web news instance seed and the analysis module of similar web news instance supporting topics. based on these functions and methods, the author elaborates main algorithm proposed in this model, which include the mining algorithm of web news seed instances and the mining algorithm of similar web news instances supporting topics. these functional algorithms have been applied in processing module of model, and focus on how to detect network topic efficiently from a large number of web usage behaviour towards to web news instances, in order to explore a research method for network topic detection. the process of experimental analysis includes three steps, firstly, the author analyses the precision of topic detection under different method, secondly, the author completes the impact analysis of web news topic detection quality from the number of web news instances concerned and seed threshold, finally, the author completes the quality impact analysis of web news instances mined supporting topic from the number of web news instances concerned and probability threshold. the results of experimental analysis show the feasibility, validity and superiority of model design and play an important role in constructing topic-focused web news corpus so as to provide a real-time data source for topic evolution tracking. keywords: web usage behaviour, network topic detection, clicking mode analysis, retrieval mode analysis. 1 introduction with the arrival of big data era, the field of information technology and internet has developed a challenging stage so far. according to survey of tecktarget that is a global leading professional it network media [15] [2] [22], it has shown that the number of enterprises’ data has broken through pb level with development of network, social media, business and other fields. based on data existed and existing, people should think how to analyse complicated network data showing a tendency of explosive growth [4] [5], which have been concerned and are characterized by semi-structured or unstructured text, nevertheless, in whole process of cognizing network data, detecting topic exactly and effectively is the important and critical application direction. in a mass of network data, the number of web news released has reached eb level with events that continue to take place in social [6] [7], which shows the 4v features of big data, it is volume, variety, velocity and value [15] [9]. based on these features above, web news should reflect copyright © 2006-2017 by ccc publications 184 m. chen high currency and reliability, on the basis of which, the topic contained in web news should be condensed quickly and its path of evolution should also be tracked nearly in real time. however, how to detect network topic efficiently from a large number of web usage behaviour towards to web news instances, it has become an urgent problem solved to construct a topic-focused web news corpus so as to provide real-time data source for topic evolution tracking. this paper proposes a model of network topic detection mainly containing four processing modules based on web usage behaviour mode analysis and mining technology taking web news as object of research. the author elaborates function, method and technology on every processing module of the model in detail, which have been used or completed, and focuses on how to detect network topic efficiently from massive web usage behaviour towards to web news instances. this process of research does key contribution for exploring a method for network topic detection, this experimental analysis results show the feasibility, validity and superiority of model design and implement. 2 related works in recent several years, some scholars have conducted some research about network topic detection method using different theory and technology. for example, yang et all. survey research on the method of topic link detection based on improved information bottleneck theory [10], in this paper, a method of representing text is proposed, which can divide text into several sections of sub-topic features based on the regular pattern of semantic distribution and improve information bottleneck theory, then, the text represented by the attributes is utilized to do topic link detection, the experimental results have shown that this method has a fast convergent rate, and can improve the performance of topic link detection system. suhara, yoshihiko and others survey research on the method of information detection based on sentence-level topic [11], in this paper, the text sentence-level diversity features based on the probabilistic topic model is proposed, an information content classifier is also constructed combining features proposed, the experimental results show that this method outperforms the conventional methods. pang, jb and others survey research on the method of unsupervised web topic detection using a ranked clustering-like pattern across similarity cascades [12], in this paper, a method using a clusteringlike pattern across similarity cascades is investigated from the perspective of similarity diffusion, a topic-restricted similarity diffusion process is also proposed to identify real topic from a large number of candidates efficiently, the experimental results demonstrate that this approach outperforms the state-of-the-art methods on several public data sets, those works are related to author’s research direction of network topic detection and application. in recent several years, some scholars have also conducted certain research about method and technology of web usage behaviour analysis and mining. for example, dziczkowski, grzegorz and others survey research on the opinion mining approach for web user identification and clients’ behaviour analysis [13], in this paper, an approach based on statistical analysis of natural language is proposed, three different methods are used for classifying opinions from clients’ data, two new methods are introduced based on linguistic knowledge, in order to assign a mark dependent upon the client’s emotions and opinions described in comments, the effect of experiments demonstrates that the system developed can carry out an evaluation and rating of opinions. karakostas, bill and others survey research on the mapreduce architecture for web site user behaviour monitoring in real time [14], in this paper, a mapreduce style architecture is proposed, where the processing of event series from the web users is performed by a number of cascading mappers, reducers, local to the event origin, the experimental results show that this architecture is capable to carry out time series analysis in real time for very large web data sets based on the actual events instead of resorting to sampling or other extrapolation techniques. zhang, model of network topic detection based on web usage behaviour mode analysis and mining technology 185 yh and others survey research on the new replacement algorithm of web search based on user behaviour [15], this paper analyses the search ranking based on the user behaviour investigated mass distribution of information on the website, then proposes a replacement algorithm for web search, the simulation experimental results show that this approach under the search algorithm can reduce the execution time of retrieval effectively, and the optimal parameter selection for this blocking organization can be discussed continuously, those works are related to author’s research direction of web usage behaviour application for analysis and mining. based on the analysis of the related research on network topic detection method and web usage behaviour application technology, experts and scholars have studied on two directions, but the research of constructing a network topic detection model based on web usage behaviour mode analysis and mining technology taking web news as an object of analysis according to its attention and usage trait is missing. therefore, this paper proposes a model of network topic detection based on web usage behaviour mode analysis and mining technology mainly, in order to explore how to detect network topic accurately. 3 problem definition and notations with rapid development of information and network technology, there are many types of network information, such as short text of micro blog, short, moderate or long text of web news, long text of document and so on, while the biggest difference is structure of text content among them. this paper selects web news as the object of research in view of ensuring high adaptability that the model of network topic detection based on web usage behaviour mode analysis and mining technology should have, in order to achieve the ideal effect of topic detection in the aspects of analysis precision and so on, this research provides scientific method for constructing and validating model of network topic detection. 3.1 usage feature analysis users can search and browse web news from different dimensions, granularities and frequencies, which have been extracted and analyzed, these processes have been elaborated in previous journal article published by authors [16], [17], [18], [19]. in the process of searching and browsing web news, the user usage behaviour can be recorded, which not only explains web news features used by users, but also contains the concerning topics hidden in web news instances. therefore, based on the analysis of web news usage features, it is conducive to discover knowledge hidden in massive web news, detect topic that the users are concerned about, track a series of events occurring in topic, and comb out process of event evolution. from the perspective of global usage, every web news instance concerned by users can be seen as a node in the range of web news websites with authority, and the node set of some relevant instances supporting social events can be considered as a topic, each topic can also trigger a series of events, therefore, when users directly conceren a series of topics reflected by the social event, not only browse multi web news instances content that support the topic, but also browse a series of events triggered by the topic [21]. from the perspective of local usage, when users search for web news, in addition to input keywords that are related to the social events reported by web news, but also input semantic keywords that may appear in web news title or content, core event, core event occurring time, core event occurring location, subjective or objective object of the core event, and relation event information triggered by the core event [21]. therefore, in the process of topic detection towards to web news, if the usage features of web news can be considered, then it can be mined for social events reported by web news, topics 186 m. chen concerned by social events, web news instances supporting topics, which will provide a web news topic corpus with high quality for web news topic evolution analysis. 3.2 topic detection norm analysis based on the analysis towards to the usage features of web news, if logical relationship need to be mined from behaviour data among the social events reported by web news, the topics concerned in social events, web news instances supporting topics, then the data and norms adopted should be specified in the process of topic detection [22], [23], which include user behaviour records, web news clicking frequency based on s-u, web news clicking frequency corresponding to url, web news clicking rate corresponding to url and so on. in the application process of web news topic detection for social events, users can input the interesting keywords searched, when clicking submit request, web news page will show multitudinous title, releasing time, releasing source and content link of web news instances, and when users click on the contents link of the web news instance, the application platform will record user names, the search information submitted, the behaviour clicking on web news instances, the usage time and other data items, in which the search information submitted is expressed in english, but processed in this paper by the way of chinese. based on the web news usage behavior above, (s,u) of s − u information can express retrieval keywords and web news instances url contained in behavior synchronously, fq(s,u) can express web news instances clicking frequency based on s −u, which explains the number of (s,u) appearance in a certain time period, fqi(u) can express web news clicking frequency corresponding to url, which explains the number of the web news instance appearance in the i particle size of a certain time period, fq(u) can express web news instance clicking frequency in a certain time period as shown in formula 1, rti(u) can express web news clicking rate corresponding to url as shown in formula 2. fq(u) = fq1(u) + . . . + fqi(u) + . . . + fqn(u) (1) rti(u) = fqi(u) fq(u) (2) based on the web news usage behavior above, it can be converted into a graph g = (s,u,e), in which s can express the set of retrieval keywords submitted, u can express the set of web news instances url clicked, e can express the set of edges between s and u, the edge (s,u) can express behavior of clicking the web news instance after submitting search request for users, whose weight value is fq(s,u) corresponding to it. s(u) can express the set of s, which directly connect with u in g, u(s) can express the set of u, which directly connect with s in g, d(s) can express the degree of node in search request, which is the number of web news instance nodes that is connected to retrieval requirement, d(u) can express the degree of the web news instance node, which is the number of retrieval requirement nodes that is connected to the web news instance as shown in figure 1, in which the s set is expressed in english, but processed in this paper by the way of chinese. 3.3 problem notations in this section, the author provides notations used in model and algorithms based on practical value and application direction of web news topic detection. let newsset be a set of web news instances, which is a data source using web news for user and contains a large number of instances in web news websites with authority. let userbehavior be a set of records using web news model of network topic detection based on web usage behaviour mode analysis and mining technology 187 figure 1: the behavior transition diagram based on web news usage for users behavior for users, which is also a data source of web news topic detection, the specific notations are as follows. definition 1. given a set of newsset, it can denote using newsset = {ns1, . . . ,nsi, . . . ,nsk}, the range of i is between one and k.nsi.url stores address of the web news instance, nsi.title stores title of the web news instance, nsi.pubtime stores releasing time of the web news instance, nsi.pubsource stores releasing source of the web news instance, nsi.content stores content of the web news instance, nsi.keyword stores keywords of the web news instance, the extraction process of these information has been elaborated in previous articles published by authors [16], [17]. definition 2. given a set of userbehavior, it can denote using userbehavior = {ub1, . . . ,ubi, . . . ,ubn}, the range of i is between one and n.ubi.username stores user names using web news, ubi.searchword stores keyword retrieving web news, ubi.url stores url of the web news instances clicked by users, ubi.systemtime stores system time using web news for users. definition 3. based on the definition and notations above, the problem that the web news topic detection model needs to solve is to detect topic set contained in massive web news instances from massive web news usage behavior, and mine set of web news instances that can support relevant topic, this result can denote using topicurl = {tu1, . . . , tui, . . . , tuk}, the range of i is between one and k.tui =< topic,topicurl >, tui.topic can express topic description detected, tui.topicurl can express the set of web news instances url mined, which can support relevant topic detected. 188 m. chen 4 the design of network topic detection model in the era background of big data development, it has become an important research direction to detect network topic exactly in web text mining field through the process of defining detection targets, extracting valuable network information, analysing web user usage behaviour, mining potential topics and applying topics detected and so on. based on this process, the model of network topic detection based on web usage behaviour mode analysis and mining technology taking web news as object of research is divided into four modules, which include the analysis module of web news instance clicking mode, the analysis module of web news instance retrieval mode, the analysis module of web news instance seed and the analysis module of similar web news instance supporting topics as showed in figure 2. figure 2: the model of network topic detection 4.1 the analysis module of web news instance clicking mode the inputting content of this module is set of records using web news behaviour for users, the outputting content of this module is analysis results of web news instance clicking mode, the main function of this module is to analyse the outbreak and concern mode of web news instances according to records using web news behaviour for users, and infer clicking mode of web news instance. based on analysis of web news usage behaviour, in a certain period of time, the web news instance mays show a clear outbreak mode regarded as a kind of sensor for social event topic aggregation in records of user’s web news usage behaviour, which indicates whether the web news instance is able to describe relevant topic reflected in social events. therefore, in this module, firstly, administrator should use the outbreak mode of the web news instance to measure whether it is able to describe corresponding topic reflected in social events, whose value is expressed with br(u) that is a conjecture to sharpness of clicking rate changes, and its range is between 0 and 1 as shown in formula 3, this formula cites characteristic of entropy. br(u) = 1 − (−(rt1(u)lognrt1(u) + . . . + rti(u)lognrti(u) + . . . + rtn(u)lognrtn(u))) (3) model of network topic detection based on web usage behaviour mode analysis and mining technology 189 in the above formula, n represents the number of continuous granularity components clicked for web news instances, the granularity can be set at hour or day for unexpected events in society, while the granularity can be set at week or month for normal events in society, if the fluctuations of clicking rate is not strong in the granularity setting for the web news instance, then br(u) value is smaller, if the fluctuations of clicking rate is strong in the granularity setting for the web news instance, then br(u) value is bigger. although using formula 3 can explain outbreak mode of the web news instance, but its value cannot be absolutely certain that the key information of the web news instance can describe topic in social events, because the br(u) value may be one, while the web news instance is only once clicked by users. therefore, in this module, secondly, administrator should use the concern mode of the web news instance to measure whether it is able to describe corresponding topic reflected in social events again, whose value is expressed with cr(u), and its range is between 0 and 1 as shown in formula 4. cr(u) = log(fq(u)) −minui∈u (log(fq(ui))) maxui∈u (log(fq(ui))) −minui∈u (log(fq(ui))) (4) because the process clicked has a characteristic of power law distribution for web news instances, so in the above formula, the clicking frequency of the web news instance is transformed to logarithm, if the web news instance can be concerned by more users, then cr(u) value is larger, whereas, cr(u) value is smaller. based on the above measurement of outbreak and concern mode for web news instances, in this module, finally, administrator should use formula 5 to infer results of the web news instance clicking mode. clickmode(u) = br(u)cr(u) (5) 4.2 the analysis module of web news instance retrieval mode the inputting content of this module is set of records using web news behaviour for users, the outputting content of this module is analysis results of web news instance retrieval mode, the main function of this module is to analyse the degree distribution and similar mode of web news instances according to records using web news behaviour for users, and infer the retrieval mode of web news instance. based on analysis of behaviour diagram g using web news for users, it can be found that the degree of web news instances shows a characteristic of power law distribution, so in this module, firstly, administrator should use the degree distribution mode of web news instance to measure whether it is able to describe corresponding topic reflected in social events and express it using dr(u) as shown in formula 6. dr(u) = log(d(u)) −minui∈u (log(d(ui))) maxui∈u (log(d(ui))) −minui∈u (log(d(ui))) (6) although using formula 6 can explain degree distribution mode of the web news instance, but its value cannot be absolutely certain that the key information of the web news instance can describe topic in social event, because the reason of generating web news instance degree is that users search it using keywords. therefore, in this module, secondly, administrator should use the similar mode of web news instance to measure whether it is able to describe corresponding topic reflected in social events again, whose value is expressed with ss(u), in order to solve problem of existing sparse records in user clicking behaviour as shown in formula 7. 190 m. chen ss(u) = 2 n(n + 1) sum( sum(sik(u)sjk(u))k∈dataitem√ sum((sik(u)) 2)k∈dataitem √ sum((sjk(u)) 2)k∈dataitem )ni<=j (7) based on the above measurement of degree distribution and similar mode for web news instances, in this module, finally, administrator should use formula 8 to infer results of the web news instance retrieval mode. searchmode(u) = dr(u)ss(u) (8) 4.3 the analysis module of web news instance seed the inputting content of this module is analysis results of web news instance clicking and retrieval mode, the outputting content of this module is sets of topic contained in massive web news instances and web news seed instances, the main function of this module is to infer set of web news seed instances according to analysis results of web news instance clicking and retrieval mode, and describe corresponding topic referring to web news key information researched in previous job. in this process, firstly, administrator should use formula 9 to mine set of web news seed instances, its weight value is more than or equal to seed threshold, secondly, based on releasing time of web news, the web news seed instances should be sorted in set, finally, the corresponding topic should be described using key information of the web news seed instance, in following experiment, the optimal value of seed threshold will be analysed. seedurl(u) = clickmode(u)searchmode(u) (9) 4.4 the analysis module of similar web news instance supporting topics the inputting content of this module is set of web news seed instances and records using web news behaviour for users, the outputting content of this module is set of similar web news instances supporting topics, the main function of this module continues to mine set of similar web news instances with topic described according to sets of web news seed instances, the corresponding topics described, the records using web news behaviour for users that have been analysed above. in this process, administrator should take web news seed instances as core, and use the probability of first transfer from the web news instance to itself as possibility that whether it is able to support topic described. if the web news seed instance is set su, then the variable tu indicates that whether the web news instance is able to support topic described by su, the variable ts indicates that whether the retrieval keyword is able to support topic described by su. if the key information of the web news instance can support topic described by su, then tu = 1, otherwise tu = 0, if the retrieval keyword can support topic described by su, then ts = 1, otherwise ts = 0. for each web news seed instance mined, in initial state, tsu is set one, then p(tsu = 1) = 1, and the probability is set zero for any other web news instances supporting topic described by su, in this way, p(ts = 1) can be used to calculate the probability arriving su to itself, which is directly linked to the search keyword with it. p(ts = 1) = sum(ψsup(tu = 1))u:(s,u)∈e (10) ψsu = fq(s,u) sum(fq(s,ui))(s,ui)∈e (11) model of network topic detection based on web usage behaviour mode analysis and mining technology 191 in formula 11, ψsu expresses transfer probability form the search keyword to the web news instance, based on this probability, the value of p(tu = 1) can be calculated by using the following formula for all other web news instances that are directly connected to it. p(tu = 1) = sum(ψusp(ts = 1))s:(s,u)∈e (12) ψus = fq(s,u) sum(fq(si,u))(si,u)∈e (13) in formula 13, ψus expresses transfer probability form the web news instance to the search keyword, when p(tu = 1) is greater than or equal to probability threshold, then the web news instance can be found to support topic described by su, in order to mine set of similar web news instance that can support topics, in the following experiment, the optimal value of probability threshold will be analysed. 5 the design of network topic detection algorithm based on model design of network topic detection above, in this section, the author designs the mining algorithms of web news seed instances and similar web news instances supporting topics, in order to make sure that the topic detection has a high accuracy for web news, in following experiments, the precision of algorithms will be analysed, the optimal value of parameters will be determined. 5.1 the mining algorithm of web news seed instances the key information of web news have been expressed using web news information extraction and analysis method researched in previous job [16], [17], [18], but what topics the users concern are still unknown in the face of massive web news released based on social events. therefore, this algorithm mainly uses set of records using web news behaviour for users and results of web news information extraction and analysis, and through analysing web news instance clicking and retrieval mode to mine set of topic contained in massive web news instances. 5.2 the precision analysis of topic detection under different method this experiment compares precision of web news topic detection under web news instance clicking mode analysis method, web news instance retrieval mode analysis method and the method proposed in this paper. as shown in figure 3, the precision represents quality of web news topic detection using three methods, the red column expresses precision change situation of web news topic detection using web news instance clicking mode analysis method that is called dclickmode method in this chart, from its trend, it can be known that the quality of web news topic detection is not high only through a single analysis for web news instance clicking mode with increasing number of web news instances concerned, although the precision has a certain improvement, but the maximum can only float on the 62.8% the blue column expresses precision change situation of web news topic detection using web news instance retrieval mode analysis method that is called dsearchmode method in this chart, from its trend, it can be known that the quality of web news topic detection is not also high comparing with dclickmode method only through a single analysis for web news instance retrieval mode with increasing number of web news instances concerned, although the precision has also a certain improvement, but the maximum can also only float on the 63%, the green column expresses precision change situation of web news topic detection using method proposed by this paper in this chart, from its trend, it can 192 m. chen be known that the quality of web news topic detection has been significantly improved, because of integrating two analysis methods of web news instance clicking and retrieval mode, while the quantity of web news instances concerned is less, although the difference of precision is not big comparing with other two methods, the distance of precision is constantly widening among other two methods with increasing number of web news instances concerned, the maximum can float on the 75.2%, this experiment shows that the quality of web news topic detection using method proposed in this paper is higher than dclickmode and dsearchmode method. figure 3: the quality of web news topic detection under different methods 5.3 the impact analysis of web news topic detection quality from the number of web news instances concerned and seed threshold under the increasing number of web news instances concerned, this experiment analyses precision change situation of web news topic detection through adjusting seed threshold. as shown in figure 4, the precision represents quality of web news topic detection with increasing number of web news instances concerned in y axis through adjusting seed threshold in x axis, when the threshold value is certain, from this graph, it can be known that the quality of web news topic detection can increase slowly until a relatively stable trend with increasing number of web news instances concerned, the main reason is that when the number of web news instances concerned is less, the data and its relationship among them are relatively simple in the process of analysing clicking and retrieval mode, when the number of web news instances concerned is increasing, the link relationship will exist among data in the process of analysing clicking and retrieval mode, which is conducive to topic detection, so that the precision of web news topic detection is gradually increasing until more stable, when the number of web news instances concerned is certain, from this graph, it can be known that the quality of web news topic detection shows a trend of increasing firstly and then decreasing with increasing of threshold value, the main reason is that when the threshold value is less, a part of inaccurate or approximate accuracy web news topic is likely to be detected as accurate web news topic, when threshold value increases to a certain stable interval, only a small number of approximate accurate web news topic are likely to be detected as accurate web news topic, when threshold value is increased to a certain value, a part of accurate web news topic may not be detected, this experiment shows that when the number of web news instances concerned is 160, and seed threshold is 0.75, the quality of web news topic detection can reach the highest value, which is close to 78.5%. 5.4 the mining algorithm of similar web news instances supporting topics although the topics concerned by users have been detected using the mining algorithm of web news seed instances, but what web news instances supporting these topics the users concern model of network topic detection based on web usage behaviour mode analysis and mining technology 193 algorithm 1 miningseedtopic input: userbehavior, newsset, threshold, initialtime, t; output: topicurl miningseedtopic(userbehavior ub, newsset ns, systemdata s); begin userrecord u[]=extractrecord(ub); double br,cr,cm,dr,ss,sm; groupuserrecord gur[]; topicurl tu=new topicurl(); gur=groupbyurl(u); for i do=0 to gur.size()-1 br=calculatebr(gur[i]); cr=calculatecr(gur[i],u); cm=br*cr; dr=calculatedr(gur[i],u); ss=calculatess(gur[i]); sm=dr*ss; if c thenm*sm>=s.getthreshold() tu.add(ns.getft(gur[i].url),gur[i].url); end if if g thenetcurrenttime()-s.getinitialtime()>=s.gett() resort(tu); readjust(s.getthreshold()); end if end for end are still unknown in addition to web news seed instances. therefore, this algorithm mainly uses sets of records using web news behaviour for users and web news seed instances analysed to mine set of similar web news instance which can support topics. 6 the experimental analysis and results in this section, the author carries out experimental analysis and shows experimental results in order to validate feasibility, validity and superiority of model proposed in this paper, in this process, the author adopts experimental environment towards to event of german a320 plane crash shown as follows. the processor is dual core, the memory is 32g, the language of computer programming design is java, its version is java se development kit 8, the platform of experimental design and implementation is myeclipse 2015, the platform of experimental data storage and management is microsoft sql server 2016. 6.1 the quality impact analysis of web news instances mined supporting topic from the number of web news instances concerned and probability threshold under the increasing number of web news instances concerned, this experiment analyses precision change situation of web news instances mined supporting topic through adjusting probability threshold. as shown in figure 5, the precision represents quality of web news instances 194 m. chen algorithm 2 miningsimilartopicurl input: topicurl, userbehavior, threshold, initialtime, t. output: topicurl miningsimilarurl(topicurl tu, userbehavior ub, systemdata s) begin double pts,ptu; for i do=0 to tu.size()-1 searchword sw1,sw2; webnewsurl wnu1,wnu2; sw1=isexist(tu[i].get("topicurl"),ub); while s dow1!=null for j do=0 to sw1.size()-1 pts=calculate(sw1.get(j).position,tu[i].get("topicurl"),ub); if p thents>=s.getthreshold() sw2.add(sw1.get(j)); end if ub.set(sw1.get(j).position,pts); end for for j do=0 to sw2.size()-1 wnu1=isexist(sw2.get(j).position,ub); if ( thenwnu1=isequal(wnu1,wnu2))!=null for k do=0 to wnu1.size()-1 ptu=calculate(wnu1.get(k).position,sw2.get(j).position,ub); if p thentu>=s.getthreshold() wnu2.add(wnu1.get(k)); end if ub.set(wnu1.get(k).position,ptu); end for end if end for sw1=isexist(wnu2,sw2,ub); end while tu[i].set(wnu2); end for if g thenetcurrenttime()-s.getinitialtime()>=s.gett() resort(tu); readjust(s.getthreshold()); end if end model of network topic detection based on web usage behaviour mode analysis and mining technology 195 figure 4: the changing trend of precision with number of web news instances concerned and seed threshold mined supporting topic with increasing number of web news instances concerned in y axis through adjusting probability threshold in x axis, when the threshold value is certain, from this graph, it can be known that the quality of web news instances mined supporting topic can increase slowly until a relatively stable trend with increasing number of web news instances concerned, the main reason is that when the number of web news instances concerned is less, the data and its relationship among them are relatively simple in the process of analysing similar web news instances supporting topic, when the number of web news instances concerned is increasing, the link relationship will exist among data in the process of analysing similar web news instances supporting topic, which is conducive to instance mine, so that the precision of web news instances mined supporting topic is gradually increasing until more stable, when the number of web news instances concerned is certain, from this graph, it can be known that the quality of web news instances mined supporting topic shows a trend of increasing firstly and then decreasing with increasing of threshold value, the main reason is that when the threshold value is less, a part of inaccurate or approximate accuracy web news instances are likely to be mined, when threshold value increases to a certain stable interval, only a small number of approximate accurate web news instances are likely to be mined, when threshold value is increased to a certain value, a part of accurate web news instances may not be mined, this experiment shows that when the number of web news instances concerned is 140, and probability threshold is 0.7, the quality of web news instances supporting topic can reach the highest value, which is close to 75.7%. 6.2 the process analysis of detecting web news topic the author illustrates effectiveness of web news topic detection method implemented in this paper, as shown in figure 6, in this experimental webpage, firstly, users can choose the social event occurred that is german a320 plane crash, secondly, users can choose releasing time of web news reporting the social event chose, thirdly, users can choose place, object or core event related 196 m. chen figure 5: the changing trend of precision with number of web news instances concerned and probability threshold to the social event chose, whose selection range is imported from result of web news extraction and analysis, fourthly, users can input keywords in textfield component again, finally, when users click web news search button, the experimental platform will display webpage as shown in figure 7. in this webpage, the users can browse retrieval results of web news instances according to retrieval condition chose or inputted that include title, releasing time, leading content and keyword of the web news instances. when users are concerned about the web news instance, and then click its title link, the experimental platform will display corresponding webpage that is linked source url, at the same time, it will also record the current user name, the search keywords submitted, url of clicking the web news instance, usage time and other information in order to detect web news topic. as shown in figure 8, in this experimental webpage, firstly, users can choose the social event occurred that is german a320 plane crash, secondly, users can choose the number of topic detected that need to be displayed, finally, when users click submit button, the experimental platform will display top n topics description that have been detected, these topics are sorted in accordance with releasing time of corresponding web news seed instance, in addition, it will also show set of web news instances supporting every topic and category information of each topic, web news instances are also sorted in accordance with its releasing time in set, when users are concerned about the web news instance supporting topic, then click on its title link, the experimental platform will show corresponding web news instance browsing webpage. as shown in figure 9, in this experimental webpage, firstly, users can choose the social event occurred that is german a320 plane crash, secondly, users can choose the number of topic detected that need to be displayed, finally, when users click submit button, the experimental platform will show top n topics description detected by the way of time axis, these topics are sorted in accordance with releasing time of corresponding web news seed instance, in addition, it will also show set of web news instances supporting every topic, web news instance are also sorted in accordance with its releasing time, when users are concerned about the web news instance model of network topic detection based on web usage behaviour mode analysis and mining technology 197 supporting topic, then click on its title link, the experimental platform will show corresponding web news instance browsing webpage. figure 6: the searching webpage of web news figure 7: the clicking webpage of the web news instance figure 8: the webpage of web news topic detection conclusion this paper completes a research on model of network topic detection based on web usage behaviour mode analysis and mining technology, which takes web news as research object, takes web usage behaviour application technology as research core and executes process of defining detection targets, extracting valuable network information, analysing web user usage behaviour, mining potential topics and applying topics detected from point of innovation. this result is important and valuable for researchers in the same or related field. in the process of model research, design and implement, this paper proposes the mining algorithm of web news seed instances and similar web news instances supporting topics in order to eliminate shortcomings existing in previous traditional method. 198 m. chen figure 9: the browsing webpage of web news topics the experimental analysis and results of model do key contributions for feasibility, validity and superiority of network topic detection request, improve efficiency of understanding network information for users, enhance availability of websites, build scientifically and improve service functions of websites, and improve business operational efficiency and clicking rate of websites. in a word, the process of research, design and implement model of network topic detection has certain practical application value, which establishes real and exact foundation of corpus for continuative research and application on web text mining direction. acknowledgement this work was supported by the national natural science foundation of china under grant nos.71572015, the national natural science foundation of china under grant nos.71271209, the project of philosophy and social sciences planning in beijing under grant nos.13jgc090. bibliography [1] zhang ji, li hongzhou, gao qigang, wang hai, luo yonglong, detecting anomalies from big network traffic data using an adaptive detection approach, information sciences, 6(3): 96-97. [2] pandey suraj, nepal surya, cloud computing and scientific applications-big data, scalable analytics, and beyond, future generation computer systems, 29(7): 1774-1775. [3] zhu zhiguo, a novel method for discovering frequent changing patterns from historical web access data, icic express letters, 8(9): 2443-2445. [4] nasomyont, tamrerk, a study on the relationship between search engine optimization factors and rank on google search result page, advanced materials research, 3(4): 1462-1464. [5] guo yi, chen hao, microblog user ranking based on pagerank and hadoop, wit transactions on information and communication technologies, 49(1): 1083-1085. model of network topic detection based on web usage behaviour mode analysis and mining technology 199 [6] zhang hongli, huang shouming, web information extraction method based on mapreduce, journal of anhui science and technology university, 27(2): 72-74. [7] li wen, zheng bangxi, deng wu, research on web information extraction model based on xml and dom technologies, journal of dalian jiaotong university, 34(3): 96-98. [8] zhang yaming, tang chaosheng, information propagation model based on the dynamics of complex networks in mircoblogging, journal of computational information systems, 10(1): 443-445. [9] wu jiagao, zhou fankun, zhang xueying, research of the extraction method of event properties based on the combining of hmm and syntactic analysis, journal of nanjing normal university(natural science edition), 37(1): 30-32. [10] yang yuzhen, liu peiyu, fei shaodong, zhang chenggong, a topic link detection method based on improved information bottleneck theory, zidonghua xuebao/acta automatica sinica, 40(3): 471-479. [11] suhara, yoshihiko, toda, hiroyuki, nishioka, shuichi, susaki, seiji, automatically generated spam detection based on sentence-level topic information, www 2013 companion proceedings of the 22nd international conference on world wide web, 1157-1160. [12] pang junbiao, jia fei, zhang chunjie, zhang chenggong, unsupervised web topic detection using a ranked clustering-like pattern across similarity cascades, ieee transactions on multimedia, 17(6): 843-853. [13] dziczkowski, grzegorz, wegrzyn-wolska, katarzyna, bougueroua, lamine, an opinion mining approach for web user identification and clients’ behaviour analysis, ieee computer society, 79-84. [14] karakostas, bill, theodoulidis, babis, a mapreduce architecture for web site user behaviour monitoring in real time, data 2013 proceedings of the 2nd international conference on data technologies and applications, 45-52. [15] zhang yongheng, feng zhang, fei you, a new replacement algorithm of web search engine cache based on user behavior, applied mathematics & information sciences, 8(6): 3049-3054. [16] chen mo, yang xiaoping, research on model of network information extraction based on improved topic-focused web crawler key technology, tehnicki vjesnik/technical gazette, 23(4): 49-54. [17] chen xuegang, research and realization of e-commerce monitor system based on focused web crawler, information technology journal, 12(17): 4033-4035. [18] balla, andoena, real-time web crawler detection, 2011 18th international conference on telecommunications, 428-430. [19] ahmadi-abkenari, f, a clickstream-based web page significance ranking metric for web crawlers, 2011 5th malaysian conference in software engineering, 223-225. [20] chen mo, yang xiaoping, liu ting, a research on user behavior sequence analysis based on social networking service use-case model, international journal of uand eservice, science and technology, 7(2): 1-4. 200 m. chen [21] chen mo, yang xiaoping, sun meng, zhao yun, research on model of network information currency evaluation based on web semantic extraction method, international journal of future generation communication and networking, 7(2): 103-105. [22] zhu tao, lin yumin, cheng ji, wang xiaoling, efficient diverse rank of hot-topics-discussion on social network, lecture notes in computer science, 8485(1): 522-524. [23] lu ran, xue suzhi, ren yuanyuan, zhu zhenfang, a modified approach of hot topics found on micro-blog, lecture notes in electrical engineering, 269(1): 603-605. international journal of computers communications & control issn 1841-9836, 11(3):381-393, june 2016. increasing face recognition rate b. lagerwall, s. viriri brett lagerwall serestina viriri* school of mathematics, statistics and computer science university of kwazulu-natal, westville campus durban, south africa *corresponding author: viriris@ukzn.ac.za abstract: this paper describes and discusses a set of algorithms which can improve face recognition rates. these algorithms include adaptive k-nearest neighbour, adaptive weighted average, reverse weighted average and exponential weighted average. essentially, the algorithms are extensions to the basic classification algorithm used in most face recognition research. whereas the basic classification algorithm selects the subject with the shortest associated distance, the algorithms presented in this paper manipulate and extract information from the set of distances between a test image and the training image set in order to obtain more accurate classifications. the base system to which the algorithms are applied uses the eigenfaces technique for recognition with an adapted viola and jones algorithm for face extraction. most of the algorithms proposed show a consistent improvement over the baseline test. keywords: face recognition, eigenfaces, classification algorithms, weighting algorithms. 1 introduction the process of obtaining the identity of a person from an image can be successfully performed by only looking at his/her face. this typically involves a face detection stage and a face recognition stage (taking detected faces and classifying them using an already existing database of faces). the full potential of face recognition applications has not been realized, since most suffer from an inability to handle light and pose variations [1]. yet, face recognition remains an important topic in computer vision because of the large number of real-world scenarios it can be applied to. senior and bolle [1] state that the three main application domains for face recognition are access control, identification systems and surveillance. it is evident that face recognition systems falling into these three domains have in fact become a part of our every day life. example usage cases of the three domains are: access control systems in security environments (e.g. banking), identification systems in websites such as facebook and surveillance systems which compare faces against a threat list in sporting events. moreover, humans are becoming increasingly reliant on face recognition systems to achieve day to day tasks. hence, researchers continue to focus on methods of improving the current recognition rates. the rest of the paper is structured as follows: section 2 explores the state-of-the-art of face recognition, section 3 gives a brief overview of preprocessing, section 4 describes an overall face recognition system, section 5 explores the novel classification techniques, section 6 presents experimental results and discussion; and section 7 draws the conclusions and future work. 2 related work lin [2] shows that face recognition is generally broken down into two modules. these are: copyright © 2006-2016 by ccc publications 382 b. lagerwall, s. viriri • a feature extractor which transforms the pixels of a face into a useful vector representation. • a pattern recognition module which searches the database to find the best match for the inputted face. however, it must be noted that the generic face recognition framework shown by lin [2] is only concerned with classifying a face. a large number of algorithms will pre-process the face images before any classification occurs. one of the earliest techniques used in face recognition was performed by turk and pentland [3]. they used principal component analysis (pca) to encode facial images in what they call an information theory approach. basically, all of the faces in a face database are manipulated to form a set of eigenvectors (eigenfaces in face recognition literature [4]). each of the original faces can be reconstructed via linear combinations of the eigenvectors. for classification, turk and pentland [3] reconstruct the input image by applying weights to each of the eigenvectors. the vector of weights for the input image is compared against the respective weight vectors for each of the images in the database. the subject in the input image is classified to be the subject in the database with the closest match. moon and phillips [4] attempt to improve on the eigenfaces technique by introducing various ideas and optimizations. most notably, they try using different nearest-neighbour distance classifiers. however, recognition rates for frontal, upright facial images (used in this study) do not seem to dramatically increase from the baseline pca algorithm developed by turk and pentland [3]. another face recognition system using an information theory approach was created by liu and weschler [5]. their system uses independent component analysis (ica) instead of pca, since ica provides a more powerful data representation. improvements range between 0 and 4 percent depending on the number of features used [5]. face recognition algorithms which do not use the information theory (or encoding of information) approach are less common. however, bronstein et al. [6] attempted to use a 3d model of a face in order to overcome weaknesses suffered by 2d systems (head orientation and facial expressions). other common methods of performing face recognition involve using physiological biometrics. zhang et. al. [15] compared and contrasted the sparse representation [18] and collaborative representation techniques ( [16], [20]) for face recognition. the sparse representation based classification first codes a testing sample as a sparse linear combination of all the training samples, and then classifies the testing sample by evaluating which class leads to the minimum representation error. while the collaborative representation technique is based on the regularized least square. the experimental results show that both collaborative representation and sparse representation achieved accuracy rate of 93.7% on ar database. furthermore, collaborative representation has significantly less complexity than sparse representation based classification. hao et. al. ( [17], [21]) proposed a novel method, called heteroscedastic sparse representation based classification which addresses the complexity problem of sparse representation based classification. in the presence of noises, the sparse representation based classification model exists heteroscedasticity, which makes residual estimation inefficient. therefore, heteroscedastic correction must be carried out for homoscedasticity by weighting various residuals with heteroscedastic estimation. the experimental results show that heteroscedastic sparse representation based classification has significantly less complexity than sparse representation based classification, while it is more robust. robust face recognition via sparse representation [18] considered the problem of automatically recognizing human faces from frontal views with varying expression and illumination, as well as occlusion and disguise. they cast the recognition problem as one of classifying among multiple increasing face recognition rate 383 linear regression models, and argue that new theory from sparse signal representation offers the key to addressing this problem. this new framework provides new insights into two crucial issues in face recognition: feature extraction and robustness to occlusion. furthermore, wagner et. al. [19] proposed a conceptually simple face recognition system that achieves a high degree of robustness and stability to illumination variation, image misalignment, and partial occlusion. the system uses tools from sparse representation [18] to align a test face image to a set of frontal training images. they demonstrated how to capture a set of training images with enough illumination variation that they span test images taken under uncontrolled illumination. their system can efficiently and effectively recognize faces under a variety of realistic conditions, using only frontal images under the proposed illuminations as training. many of the face recognition methods reviewed used different nearest-neighbour distance classifiers to calculate the distance between a test image and a training image. however, once the distances were calculated, the test image with the smallest distance was always selected to be the best match. this paper takes the classification process one step further by analyzing the results which can be achieved after post-processing the set of calculated distances. 3 preprocessing before performing face recognition, a region of interest (roi) within the image must be extracted. a modified version of the viola and jones [7] face detection algorithm was chosen for this purpose. the key points and modifications are mentioned below. 3.1 classifier a drawback to building a classifier is the sheer amount of training data required (5000 facial images and 10000 non-facial images in viola and jones [7]). thus, an already available classifier is used: the haarcascade_frontalface_default classifier provided with opencv [8]. 3.2 passing the classifier the classifier being used consists of a number of stages and each stage consists of a number of features. at a particular stage, a sub-window can only pass that stage if the sum of the values generated by testing various features (specified in the haarcascade_frontalface_default classifier) is above a certain threshold. for the sub-window to be considered as a face, it must pass all 25 stages. 3.3 merging detections viola and jones [7] said that they merged all overlapping detections. however, if two faces are close together this could accidentally be counted as a single detection. instead, a merging algorithm partially based on work done by rowley et al. [?] was constructed. for each pair of rectangles i and j, the rectangles can be said to be detecting the same face if both (1) and (2) hold. euclideandistance(ci,cj) ≤ t × width(i) (1) euclideandistance(ci,cj) ≤ t × width(j) (2) where ci is the centre of rectangle i, cj is the centre of rectangle j and t is a threshold chosen to be 0.2 [9]. 384 b. lagerwall, s. viriri (a) ghost face 1 (b) ghost face 5 (c) ghost face 50 figure 1: showing example ghost faces. 4 face recognition the eigenfaces technique is used for face recognition. the key advantages of this algorithm are its quick classification of the probe/test image and its ease of implementation. 4.1 training recognition must be performed against a base set of known images. this set is typically called the training set. on each image in the training set, illumination normalization is performed using: imagei = imagei max (3) where imagei is the ith pixel of the image and max is the largest value of any of the pixels in the image. then, the basic steps of the eigenfaces algorithm from turk and pentland [3] are performed on the training set, involving: • mapping the pixel intensities of an image onto a face vector. all of the face vectors from the training set are then placed into a matrix. • performing various matrix operations, including eigenvalue decomposition, to obtain eigenfaces or ghost faces (see figure 1). • calculating a weight matrix, where each weight represents the amount to which an eigenface counts towards making up an image. 4.2 classification when a testing image is inputted, the algorithm must be able to classify the face in the image as one of the subjects in the database. again, the steps described in turk and pentland [3] are performed on the testing image, eventually resulting in a matrix of weights specifying the contribution of each of the eigenfaces in making up the test image. these weights are then compared to the weights of the training images/database images. a distance classifier (euclidean distance in this research) is used to determine the distance between the test image and each of the training images. typically, the person in the test image is classified to be the person in the training set who has the smallest distance between their training image and the testing image. increasing face recognition rate 385 5 novel classification algorithm – introduction in traditional face recognition algorithms, a subject in a test image is classified as the subject in an image in the database against which it has the shortest distance. more specifically, if all of the distances calculated between the n training images in the database and the test image are sorted and thus ranked from smallest to biggest, the subject in a test image is classified as the subject in a training image that corresponds to the first distance (or shortest distance) in the sorted list. if a database contains only one image of each subject, then taking the top match is the only logical classification. however, if a database contains multiple images per subject, then a number of other classification possibilities occur. described below are some of the classification algorithms proposed which can improve the recognition rates. many of the algorithms described rely on the same mathematical foundations. thus, let us define the following: n number of images m number of subjects k a percentage of the images in the sorted list k′ a percentage of the images of a subject sx the set of training images of subject x sxk′ the set containing the top k′ percent of training images of subject x |sxk| the number of images of subject x in the top k percent ij the distance between the jth training image and the testing image wj the weight applied to the jth training image note that where applicable, the algorithms will use a k percent of images, rather than k images. this is because using a percentage allows for better scaling when subjects in the same database have a different number of images. novel classification algorithm – algorithm description 5.1 top match the test image is classified as the top (or first) image in the sorted list. this algorithm is used as the baseline for comparisons. 5.2 top-k average the top k percent of images from the sorted list are chosen. within the chosen images, the average distance from the images of each subject to the test image is calculated. the subject with the smallest average is selected. more specifically, the subject with the smallest value defined by (4) is chosen. subjectavgkx = ∑nk j=1 ij |sxk| ij ∈ g (4) where g is defined by: 386 b. lagerwall, s. viriri g = {ip|p ∈ sx} (5) 5.3 adaptive k-nearest neighbour the k-nearest neighbour algorithm is commonly used to solve pattern recognition problems. however, its performance suffers when subjects have differing numbers of training images. the adaptive k-nearest neighbour attempts to rectify this. the top k percent of images from the sorted list are chosen. the total number of images of each subject within the top k percent of images is obtained. this total within k for each subject is then divided by the total number of images of the subject in the database – thus forming a ratio of the subject which falls within the k images. the subject with the highest ratio defined by (6) is chosen. subjectratiox = |sxk| |sx| (6) 5.4 k′ subject average the top k′ percent of images of each subject are chosen. the average of the distances pertaining to the top k′ percent of images of a subject is calculated. the subject with the lowest average is selected. it must be re-emphasized that in this algorithm, the k′ percent is used for each subject, rather than the whole sorted list. the subject with the lowest average defined by (7) is chosen. subjectavgk′x = ∑n j=1 ij |sx|k′ ij ∈ h (7) where h is defined by: h = {ip|p ∈ sxk′} (8) 5.5 average the average of all the distances for each subject is calculated. the subject which has the lowest average distance is then selected. note that this algorithm is equivalent to k′ subject average with k′ = 100. the subject with the smallest value defined by (9) is chosen. subjectavgx = ∑n j=1 ij |sx| ij ∈ g (9) 5.6 weighted average the average distance for each subject is calculated. however, the images belonging to a subject are given a weighting. the image with the largest distance is given a weighting of 1; the image with the second largest distance is given a weighting of 2, etc. finally, if there are |sx| images of a particular subject, then the image with the smallest distance is given a weighting of |sx|. the weighted sum can then be calculated using (10): weightedsumx = n∑ j=1 ijwj ij ∈ g (10) increasing face recognition rate 387 table 1: algorithm 1: weighted average 1: sort(images) 2: for i = 0 to n do 3: add the weight of image i to the appropriate list in an array of lists arr, where each subject has its own list 4: end for 5: for i = 0 to m do 6: w ⇐ arr[i].size() 7: for j = 0 to arr[i].size() do 8: wsum ⇐ wsum + arr[i].get(j) ∗ w 9: w ⇐ w − 1 10: end for 11: total[i] ⇐ wsum/arr[i].size() 12: end for 13: lowestname ⇐ 0 14: wavg ⇐ total[0] 15: for i = 1 to m do 16: if total[i] < wavg then 17: lowestname ⇐ i 18: wavg ⇐ total[i] 19: end if 20: end for 21: return lowestname when calculating the average, instead of dividing by the total number of images belonging to the subject, it is divided by the sum of the weights pertaining to the subject. the total weight can be calculated using (11): totweightx = |sx|(|sx| + 1) 2 (11) finally, the weighted average for a particular subject can be calculated using (12): weightedaveragex = weightedsumx totweightx (12) the subject with the lowest weighted average is then selected. see algorithm 1 for detailed pseudocode of this algorithm. other weighting algorithms will follow similar pseudocode, but have a different weighting mechanism. 5.7 reverse weighted average this algorithm is the same as the weighted average algorithm except that the larger weights are applied to the images with larger distances. this algorithm attempts to find out if the less closely matched images are in fact better at distinguishing subjects. 5.8 exponential weighted average an average of the distances for each subject is calculated. however, each image belonging to the subject is given a weighting. the image of a subject which has the smallest distance is given an arbitrary weighting of 10.0. subsequent images of the same subject are given weights 388 b. lagerwall, s. viriri exponentially less by dividing the previous weight by the |sx|th root of 10.0. so, if image j has a weight of wj, then the weight of image j + 1 can be calculated with (13): wj+1 = wj |sx|√ 10.0 (13) to find the exponential weighted average, the weighted sum must be divided by the sum of the weights (total weight). the weighted sum and total weight are computed using (10) and (14) respectively. exptotweightx = |sx|∑ j=1 10 j |sx| (14) finally, the exponential weighted average for a particular subject is calculated using (15): expweightedaveragex = weightedsumx exptotweightx (15) the subject with the lowest exponential weighted average is then selected. 6 results and discussion in order to test the validity of results, three publicly available test databases were obtained, namely caltech faces, the georgia face database and the orl database of faces. the feret database ( [10], [11]) is perhaps the most commonly used face database to benchmark the eigenfaces technique. however, the feret database does not contain multiple images of each subject and hence is not useful for testing the novel classification algorithms proposed here. 6.1 testing procedure a standard test procedure was used on each database. this involved the following: 1. running the viola and jones [7] algorithm on each image in the database to extract faces. 2. manually discarding all subjects from the database which do not have a minimum of 10 separate face detections from the viola and jones algorithm. this results in a reduction of the total number of images. 3. setting aside a number of random images of each subject for testing. in this research, 5 per subject were used. 4. performing training on the remaining images. 5. performing the test procedure using the test images and the trained database. 6. running 100 repetitions of steps 3-5 and taking the average as the final result. 7. repeating steps 3-6 for each different classification algorithm. increasing face recognition rate 389 table 2: caltech faces recognition rates classification algorithm rate (%) top match (baseline, (literature)) 87.71 top-k average (k=5) 73.52 adaptive k-nearest neighbour (k=5) 77.39 k′ subject average (k=40) 90.14 k′ subject average (k=60) 92.04 k′ subject average (k=80) 91.05 average 92.63 weighted average 94.54 reverse weighted average 72.97 exponential weighted average 93.75 table 3: georgia face database recognition rates classification algorithm rate (%) top match (baseline, (literature)) 58.26 top-k average (k=5) 51.46 adaptive k-nearest neighbour (k=5) 62.13 k′ subject average (k=40) 65.68 k′ subject average (k=60) 69.13 k′ subject average (k=80) 70.32 average 71.67 weighted average 69.87 reverse weighted average 64.67 exponential weighted average 69.01 6.2 caltech faces the caltech face database, used by kevenaar et al. [12], was obtained for testing. it contains 450 images of 27 subjects. the database is considered to be relatively easy since the faces are always upright, are uniform in presentation and never suffer from occlusion. since the database does not provide a consistent number of images per subject, it was deemed necessary to ignore step 2 from the generic testing procedure. thus, after performing face detection, the database was reduced to 435 images of 25 subjects with between 4 and 26 images per subject. table 2 shows that the weighted average algorithm improves on the baseline results by close to 7% (from 87.71% to 94.54%). this worked out to an extra 1441 correct recognitions over the 22400 image test. the results also show that the images with a smaller distance to the test image contribute more towards identifying the subject – since both weighted average and exponential weighted average perform better than average. the opposite is not true, with the reverse weighted average (giving higher weightings to less closely matched images) does not perform well. finally, it was envisaged that top-k average and adapted k-nearest neighbour would be used with small k values, but they did not improve on the baseline. using higher k values would only result in the algorithms degenerating towards the average classification algorithm. 6.3 georgia face database to determine whether the novel classification algorithms could raise recognition rates in suboptimal conditions, a more difficult database was required. thus, the georgia face database, 390 b. lagerwall, s. viriri table 4: orl database of faces recognition rates classification algorithm rate (%) top match (baseline, (literature)) 83.82 top-k average (k=5) 77.46 adaptive k-nearest neighbour (k=5) 82.44 k′ subject average (k=40) 87.08 k′ subject average (k=60) 88.62 k′ subject average (k=80) 88.79 average 88.24 weighted average 89.27 reverse weighted average 81.27 exponential weighted average 89.04 used by chen et al. [13], was obtained. the database consists of 50 subjects and a total of 750 images, but this was reduced to 30 subjects and 355 images after steps 1 and 2 (pre-processing) in the testing procedure. the database is considered difficult because many of the faces are tilted, have different facial expressions and were taken under different lighting conditions. further to that, the pictures were taken in separate sessions – thus, some subjects exhibited physical changes (e.g. grew a moustache). although the overall results from testing are worse than the caltech faces, table 3 shows that the novel classification algorithms are still effective. in fact the best algorithm (average) raises the recognition rate from 58.26% to 71.67% – an improvement of over 13%. it is interesting to note that the average algorithm outperforms both the weighted average and exponential weighted average. this is probably because the distance between the better images and the test image is relatively large (since the database is extremely difficult) and thus the distribution of distances for a particular subject is more even. with an even distribution of distance, weighting becomes superfluous. 6.4 orl database of faces the final test for the novel classification algorithms was to see how well they scaled. more specifically, would they still show improvements over the baseline if the number of subjects was increased? to test this, a third database was required. the orl database of faces, created by samaria and harter [14], was obtained. the database consists of 40 subjects with 10 images per subject and all of these images remained intact after steps 1 and 2 of the testing procedure. table 4 shows that using k′ subject average, average, weighted average or exponential weighted average as a classification algorithm will give better results than the baseline algorithm. in fact, weighted average and exponential weighted average both show improvements of over 5% – even with the large number of subjects being used. 6.5 algorithm complexity tables 5, 6 and 7 shows the timing increase per classification which occurs when using a novel classification algorithm versus using the baseline algorithm (top match). the increase in classification time is minimal for all of the algorithms running on any of the databases and thus confirms that the algorithms are feasible for use in real-time applications such as streaming video at 25 fps. in fact, the roughly 2ms worst case increase would only become a factor if classifying thousands of faces (e.g. at a sports stadium). these results were generated using an intel core increasing face recognition rate 391 table 5: caltech faces timing increase in ms classification algorithm time top-k average (k=5) 0.58 adaptive k-nearest neighbour (k=5) 0.98 k′ subject average (k=40) 1.39 k′ subject average (k=60) 1.98 k′ subject average (k=80) 1.56 average 1.99 weighted average 1.32 reverse weighted average 1.85 exponential weighted average 1.33 table 6: georgia face database timing increase in ms classification algorithm time top-k average (k=5) 1.07 adaptive k-nearest neighbour (k=5) 0.74 k′ subject average (k=40) 0.98 k′ subject average (k=60) 0.69 k′ subject average (k=80) 1.32 average 1.48 weighted average 1.52 reverse weighted average 1.23 exponential weighted average 1.26 table 7: orl database of faces timing increase in ms classification algorithm time top-k average (k=5) 0.27 adaptive k-nearest neighbour (k=5) 0.79 k′ subject average (k=40) 0.86 k′ subject average (k=60) 0.84 k′ subject average (k=80) 0.95 average 0.87 weighted average 0.88 reverse weighted average 0.94 exponential weighted average 0.86 2 duo e8400 (3.0ghz) with 2gib of ram running ubuntu linux 10.10 64-bit. 7 conclusions and future works methods for increasing face recognition rates have been investigated with the primary focal area being the classification algorithm. a number of the new classification algorithms show improvements of the literature results termed as the baseline results. it can be concluded that the best performing classification algorithm is weighted average with improvements over the baseline of 6.83%, 11.61% and 5.45% in the three test databases. importantly, the algorithms are computationally efficient making them feasible for use in real-time applications. 392 b. lagerwall, s. viriri so far, the novel classification algorithms have only been applied to the eigenfaces technique. future research will involve applying the classification algorithms to other face recognition techniques to see if similar improvements are obtained. bibliography [1] a.w. senior, r.m. bolle (2002); face recognition and its applications: biometric solutions for authentication in an e-world, kluwer academic publishers, 2002. [2] lin, shang-hung (2000), an introduction to face recognition technology, informing science the international journal of an emerging transdiscipline, 3(1): 1-7. [3] m.a. turk, a.p. pentland (1991); ieee computer society conference on computer vision and pattern recognition, proceedings of computer vision and pattern recognition, 586-591. [4] h. moon, p.j. phillips (2001); computational and performance aspects of pca-based facerecognition algorithms, perception, 30(3): 303-321. [5] c. liu, h. wechsler (1999), comparative assessment of independent component analysis, international conference on audio and video based biometric person authentication, avbpa’99, washington d.c. usa, 22-24. [6] a.m. bronstein et al (2003); 3d face recognition without facial surface reconstruction, technion – computer science department. [7] viola et al (2004); robust real-time face detection, int. j. comput. vision, 57(2): 137154. [8] g. bradski (2000); the opencv library, dr. dobb’s journal of software tools. [9] h.a. rowley et al (1995); human face detection in visual scenes, advances in neural information processing systems 8, 875-881. [10] p.j. phillips et al (1998); the feret database and evaluation procedure for face recognition algorithms, image and vision computing journal, 16(5), 295-306. [11] p.j. phillips et al (2000); the feret evaluation methodology for face recognition algorithms, ieee trans. pattern analysis and machine intelligence, 22: 1090-1104. [12] t.a.m. kevenaar et al (2005), face recognition with renewable and privacy preserving binary templates, proceedings of the fourth ieee workshop on automatic identification advanced technologies, 21-26. [13] c. ling et al (2005), face recognition based on multi-class mapping of fisher scores, pattern recognition, 38(6): 799-811. [14] f.s. samaria and a.c. harter (1995); parameterisation of a stochastic model for human face identification, workshop on applications of computer vision. [15] l. zhang et al (2011), sparse representation or collaborative representation: which helps face recognition?, proceedings of the 2011 ieee international conference in computer vision (iccv), 471-478. increasing face recognition rate 393 [16] r. khaji et al (2013), collaborative representation for face recognition based on bilateral filtering, ijcsi international journal of computer science, 397-401. [17] h. zheng, j. xie, z. jin (2012); heteroscedastic sparse representation based classification for face recognition, neural processing letters, 233-244. [18] j. wright et al (2009); robust face recognition via sparse representation, ieee transactions on pattern analysis and machine intelligence (pami), 233-244. [19] w. andrew et al (2012); toward a practical face recognition system: robust alignment and illumination by sparse representation, ieee transactions on pattern analysis and machine intelligence (pami), 372-386. [20] z. pengfei et al (2012), multi-scale patch based collaborative representation for face recognition with margin distribution optimization, computer vision-eccv 2012, springer berlin heidelberg, 822-835. [21] b. mikhail, p. niyogi (2003); laplacian eigenmaps for dimensionality reduction and data representation, neural computation, 1373-1396. ijcccv11n1.dvi international journal of computers communications & control issn 1841-9836, 11(1):113-125, february 2016. a preflow approach to maximum flows in parametric networks applied to assessing the legal information l. sângeorzan, m. parpalea, m.m. parpalea, a. repanovici, r. matefi, i. nicolae livia sângeorzan* transilvania university of brasov faculty of mathematics and computer science 25, eroilor, blvd, 500030, brasov, romania *corresponding author:sangeorzan@unitbv.ro mircea parpalea national college andrei şaguna of brasov 1, şaguna, street, 590000, brasov, romania parpalea@gmail.com mihaela marinela parpalea transilvania university of brasov faculty of letters 25, eroilor, blvd, 500030, brasov, romania mihaela.parpalea@unitbv.ro angela repanovici transilvania university of brasov faculty of product design and environment 25, eroilor, blvd, 500030, brasov, romania repa@unitbv.ro roxana matefi, ioana nicolae transilvania university of brasov faculty of law 25, eroilor, blvd, 500030, brasov, romania roxana.matefi@unitbv.ro, ioana.nicolae@unitbv.ro abstract: the present paper proposes a fractal-like approach to the parametric flow problem, derived from the rules and recursion of generative linguistics. in the same manner in which any sentence can be analysed in terms of "theme" of the sentence (that which is spoken about in the sentence) and "rheme" of the sentence (that which is said about the theme in the sentence), in the proposed parametric preflow-push algorithm, a "partitioning push" (a non-cancelling push of flow in the parametric residual network) might leave the node unbalanced for only a subinterval of the range of parameter values. this will lead to separating the problem for the two disjoints subintervals, which allows the algorithm to continue after the same rules, independently, on each of the partitioned subintervals. the algorithm runs as the template-like structure of a dialogue act which reveals a design where information about the items (part-of-speech) is a two sections vector with one segment for each of the used part of speech categories. the article also proposes a possible application of the algorithm in assessing legal information and in semantic evaluation of legislation. keywords: parametric preflow algorithm, partitioning technique, generative linguistics, legal information. copyright © 2006-2016 by ccc publications agora university 114 l. sângeorzan, m. parpalea, m.m. parpalea, a. repanovici, r. matefi, i. nicolae 1 introduction computing maximum flows in various networks is an important problem not only because the results are applied directly to traffic analysis of communication networks, but also because some efficient algorithms are often employed as sub-problems in other general network problems. consequently, fundamental algorithms for computing maximum network flows were designed and efficient algorithms exist [1] to solve different instances [7] of the problem. a natural generalization of this problem can be obtained by considering that the capacities of some arcs are functions of a real parameter. the obtained parametric maximum flow problem is to compute all maximum flows for every possible value of the parameter. for the maximum flow problem in parametric networks with zero lower bounds and linear capacity functions of a single parameter, hamacher and foulds [3] investigated a "horizontal" approach, based on augmenting directed paths algorithm. in each of its iterations, the algorithm computes an improvement of the flow defined on the whole interval of the parameter. for the same problem, ruhe [8,9] proposed a "piece-by-piece", "vertical" approach. the main idea of his algorithm is to consider that the parametric maximum flow is known up to a parameter value λk and to compute a flow augmentation that assures the optimality of the flow for parameter values λ ≥ λk as well as the parameter maximum value λk+1 up to which the computed flow remains a parametric maximum one. each of the algorithm’s iterations represents a non-parametric maximum flow sub-problem which can be solved via one of the classic approaches for the maximum flow problem. partitioning technique in networks has been, in the latest years, a more and more active research topic in both engineering and theoretical research. the reason why the problem under consideration is of genuine practical and theoretical interest lies in that graph partitioning applications are described on a wide variety of subjects, such as: data distribution in parallelcomputing, vlsi circuit design, image processing, computer vision, route planning, air traffic control, mobile networks, social networks, etc. [2]. unfortunately, graph partitioning is an nphard problem, and therefore all known algorithms for generating partitions merely return approximations to the optimal solution. partitioning algorithms for the parametric maximum flow problem can be developed starting from any of the three classic approaches for the maximum flow problem: flow augmenting directed paths algorithms, preflow algorithms or min-max algorithm. the approach based on flow augmenting directed paths [4], makes use of the concept of shortest conditional augmenting directed path. in order to avoid working with piecewise linear functions, the approach uses a series of parametric residual networks defined for successive subintervals of the parameter values where the parametric residual capacities of all arcs remain linear functions. the approach based on preflow algorithms will further be presented in detail in this paper while the parametric min-max approach [6] is not itself an algorithm but rather a method of obtaining a parametric minimum flow using any algorithm for obtaining a maximum parametric flow and vice versa. the idea of the partitioning approach based on preflows, which is used in this paper, derives from the rules and recursion of generative linguistics [10]. the concept which has proved to be the most useful in the description of german word order has become known under the name of functional sentence perspective (fsp). its main idea is that information is not transmitted in random order but the speaker seeks to give his information to his interlocutor in portions, normally starting from what he assumes is common to both and proceeding to what he regards as important new information [5]. in the same manner in which any sentence can be analysed in terms of "theme" of the sentence (that which is spoken about in the sentence) and "rheme" of the sentence (that which is said about the theme in the sentence), the proposed algorithm for the parametric maximum flow problem uses a fractal-like approach. in the proposed parametric a preflow approach to maximum flows in parametric networks applied to assessing the legal information 115 preflow-push algorithm, a ’partitioning push’ (a non-cancelling push of flow in the parametric residual network) might leave the node unbalanced for only a subinterval of the range of the values of the parameter. like in all fractal approaches a partitioning push is followed by separating the problem into disjoints subintervals, allowing the algorithm to continue after the same rules independently on each of the partitioned subintervals. making use of the partitioning approach, the present paper proposes an original algorithm for computing the maximum flow in parametric networks with linear upper bound (capacity) functions. although both the algorithm presented in [10] and the one that is proposed in this article deals with the same kind of approach of the parametric flow, they are totally different algorithms and they fundamentally differ because of their purpose: the first computes the minimum flow in networks with constant capacities and linear lower bound functions while the latter computes the maximum flow in networks without lower bounds and with linear capacity functions. further on, this paper is organized as follows. section 2 presents the parametric maximum flow problem and the basic parametric network flow terminology and results which are used in the rest of the paper. most of the definitions in this section are straightforward modifications of those in [4] and [6], adapted for the maximum flow problem in parametric networks with zero lower bounds. more specialized terminology and further details on the notions, definitions and main results within this section can be found in [1], [4] and [6]. in section 3 we present our algorithm for solving the parametric maximum flow problem, called "highest label partitioning push (hlpp) algorithm". in the same section there are also presented the corresponding theorems of correctness and of complexity of the algorithm. using a multithread based parallel implementation, the complexity of the algorithm is linearly dependent on the number of breakpoints. section 4 deals with a possible application of the proposed algorithm to assessing the legal information. finally, section 4 presents some conclusions regarding to our contribution to the topic presented in the article. in the presentation to follow, some familiarity with flow algorithms is assumed and many details are omitted, since they are straightforward modifications of known results. 2 flows in parametric networks the parametric maximum flow problem is an extension of the classical maximum flow problem [1] in which the capacities of certain arcs are functions of a parameter λ. the parametric maximum flow problem is to compute all maximum flows for every possible value of the parameter. for the case of linear capacity functions, the maximum flow value function in a parametric network is a continuous piecewise linear function of the parameter. each linear segment of the maximum flow value function between the two breakpoints λk and λk+1 corresponds to a cut that remains a minimum cut for any λk < λ ≤ λk+1. the approach presented in this article refers to the maximum flow problem in a network with linear capacity functions of a single parameter. 2.1 terminology and preliminaries let g = (n,a,u,s,t) be a capacitated network with n = |n| and m = |a|, n = {. . . , i, . . .} being the set of nodes i and a = {. . . ,a, . . .} being the set of arcs a, so that for every arc in a holds that a = (i,j) with i,j ∈ n. the capacity (upper bound) function is a nonnegative function u(a) associated with each arc a ∈ a. the network has two special nodes: a source node s and a sink node t. a flow is a function f : a → r+ satisfying the next conditions: ∑ j|(i,j)∈a f(i,j) − ∑ j|(j,i)∈a f(j,i) =      v, i = s 0, i 6= s,t −v, i = t (1) 116 l. sângeorzan, m. parpalea, m.m. parpalea, a. repanovici, r. matefi, i. nicolae for some v ≥ 0, where v is referred to as the value of the flow f. any flow on a directed network satisfying the flow bound constraints: 0 ≤ f(i,j) ≤ u(i,j), ∀(i,j) ∈ a (2) for every arc (i,j) ∈ a is referred to as a feasible flow. a cut is a partition of the node set n into two subsets s and t = n −s, denoted by [s,t ]. a cut is nontrivial if both s and t are nonempty. an arc (i,j) ∈ a with i ∈ s and j ∈ t is referred to as a forward arc of the cut while an arc (i,j) ∈ a with i ∈ t and j ∈ s as a backward arc of the cut. let (s,t) denote the set of forward arcs in the cut and let (t,s) denote the set of backward arcs. a cut [s,t ] is an s− t cut if s ∈ s and t ∈ t . the maximum flow problem is to determine a flow f̃ for which v is maximized 2.2 the parametric maximum flow the parametric flow problem consists in generalising the classic problem of flows in networks by transforming the upper bounds of some arcs (i,j) ∈ a of the network g = (n,a,u,s,t) in linear functions of a real parameter. definition 1. [4] a directed network g = (n,a,u,s,t) for which the upper bounds u of some arcs (i,j) ∈ a are functions of a real parameter λ is referred to as a parametric network and is denoted by ḡ = (n,a,ū,s,t). for a parametric network ḡ, the parametric upper bound (capacity) function ū : a× [0,λ] → r+ associates to each arc (i,j) ∈ a, for each of the parameter values λ ∈ [0,λ], the real number ū(i,j;λ), referred to as the parametric upper bound of arc (i,j): ū(i,j;λ) = u0(i,j) + λ ·u(i,j), (i,j) ∈ a, (3) where u : a → r is a real valued function associating to each arc (i,j) ∈ a the real number u(i,j) referred to as the parametric part of the upper bound of the arc (i,j). the nonnegative value u0(i,j) is the upper bound of the arc (i,j) for λ = 0, i.e. : ū(i,j; 0) = u0(i,j) with 0 ≤ u0(i,j). for the problem to be correctly formulated, the upper bound function of every arc (i,j) ∈ a must respect the condition 0 ≤ ū(i,j;λ) for the entire interval of the parameter values, i.e. ∀(i,j) ∈ a and ∀λ ∈ [0,λ]. it follows that the parametric part of the upper bounds u(i,j) must satisfy the constraint: u(i,j) ≥−u0(i,j)/λ, ∀(i,j) ∈ a. the parametric flow value function v̄ : n × [0,λ] → r associates to each of the nodes i ∈ n a real number v̄(i;λ) referred to as the value of node i for each of the parameter λ values. definition 2. [4] a feasible flow in the parametric network ḡ = (n,a,ū,s,t) is called a parametric flow, f̄ : a× [0,λ] → r+ satisfying the following constraints: ∑ j|(i,j)∈a f̄(i,j;λ) − ∑ j|(j,i)∈a f̄(j,i;λ) = v̄(i;λ), ∀i ∈ n, ∀λ ∈ [0,λ], (4) 0 ≤ f̄(i,j;λ) ≤ ū(i,j;λ), ∀(i,j) ∈ a, ∀λ ∈ [0,λ], (5) where ∑ i∈n v̄(i;λ) = 0, ∀λ ∈ [0,λ]. the parametric maximum flow (pmf) problem is to compute all maximum flows for every possible value of λ in [0,λ] : maximize v̄(λ) for all λ ∈ [0,λ], (6) a preflow approach to maximum flows in parametric networks applied to assessing the legal information 117 ∑ j|(i,j)∈a f̄(i,j;λ) − ∑ j|(j,i)∈a f̄(j,i;λ) =      v̄(λ), i = s 0, i 6= s,t −v̄(λ), i = t (7) 0 ≤ f̄(i,j;λ) ≤ ū(i,j;λ), ∀(i,j) ∈ a. (8) this problem looks like a classic maximum flow problem with the decisive difference that the variables f̄(i,j;λ) of this problem are piecewise linear functions instead of real numbers and that the upper bounds ū(i,j;λ) are linear functions instead of constants. definition 3. [6] a parametric s−t cut partitioning denoted by [sk;jk], k = 0, . . . ,k, is defined as a finite set of cuts [sk,tk], k = 0, . . . ,k, together with a partitioning of the interval [0,λ] of the parameter in disjoints subintervals jk, k = 0, . . . ,k, so that j0 ∪·· ·∪jk = [0,λ]. definition 4. [4] for the parametric maximum flow problem, the capacity ˜̄c[sk;jk] of a parametric s− t cut partitioning is a linear function on every subinterval jk, k = 0, . . . ,k, defined as: ˜̄c[sk;jk] = ∑ (i,j)∈(sk,tk) ū(i,j;λ), k = 0, . . . ,k. (9) definition 5. [4] a parametric s− t cut partitioning [sk;jk] with the subintervals jk assuring that every cut is a minimum cut [s̃k, t̃k] within the subinterval [λk,λk+1] is referred to as a parametric minimum s− t cut and is denoted by [s̃k;jk], k = 0, . . . ,k. theorem 6. (parametric max-flow min-cut theorem [6]) if there is a feasible flow in the parametric network ḡ, the value function ˜̄v of the parametric maximum flow ˜̄f from a source s to a sink t equals the capacity ˜̄c of the parametric minimum s− t cut [s̃k;jk], k = 0, . . . ,k. let f̄ = (. . . f̄(i,j;λ), . . .)(i,j)∈a be a vector of feasible flow functions defined on the interval [0,λ]. supposing that an arc (i,j) ∈ a carries a flow f̄(i,j;λ), the existing flow can be increased either by sending the additional flow (pushing) ū(i,j;λ) − f̄(i,j;λ) from node i to node j over the arc (i,j) or by cancelling the flow f̄(j,i;λ) from node j to node i over the arc (j,i) which is equivalent to pulling the flow from node i to node j over the arc (j,i). these flows are computed as differences between piecewise linear functions of λ. definition 7. [4] for the parametric maximum flow problem, the parametric residual capacity ˜̄r(i,j;λ) of any of the arcs (i,j) ∈ a, with respect to a given parametric flow f̄, represents the maximum additional flow that can be sent from node i to node j over the arcs (i,j) and (j,i) and is given by: ˜̄r(i,j;λ) = ū(i,j;λ) − f̄(i,j;λ) + f̄(j,i;λ). (10) definition 8. [4] the subintervals ĩ(i,j) ⊆ [0,λ] where an augmentation of the flow f̄(i,j;λ) is possible along the arc (i,j) are defined as follows: ĩ(i,j) = {λ|˜̄r(i,j;λ) > 0}, (i,j) ∈ a. (11) definition 9. [4] given a feasible flow f̄ in the parametric network ḡ, the network denoted by ˜̄g(f̄) = (n,ã(f̄)), with ã(f̄) = {(i,j)|(i,j) ∈ a, ĩ(i,j) 6= ∅} being the set consisting only of arcs with positive parametric residual capacities, is referred to as the parametric residual network with respect to the given flow f̄ for the parametric maximum flow problem. if an arc (i,j) ∈ a does not belong to ˜̄g(f̄) then ĩ(i,j) := ∅ is set. 118 l. sângeorzan, m. parpalea, m.m. parpalea, a. repanovici, r. matefi, i. nicolae definition 10. the parametric excess of a node i ∈ n is defined as: ẽ(i;λ) = ∑ j|(j,i)∈a f̄(j,i;λ) − ∑ j|(i,j)∈a f̄(i,j;λ). (12) definition 11. [4] the subintervals ĩ(i) ⊆ [0,λ] where the excess of node i is positive are defined as follows: ĩ(i) = {λ|ẽ(i;λ) > 0}, i ∈ n −{s,t}. (13) in the residual network ˜̄g(f̄) the distance function d̃ : n → ℵ is a function from the set of nodes to the nonnegative integers. a distance function is said to be valid if it satisfies the following conditions: d̃(t) = 0 and d̃(i) ≤ d̃(j) + 1, ∀(i,j) ∈ ã. definition 12. [4] an arc (i,j) ∈ ã in the parametric residual network ˜̄g(f̄) is referred to as conditionally admissible if both d̃(i) = d̃(j)+1 and ĩ(i,j)∩ ĩ(i) 6= ∅; otherwise it is conditionally inadmissible. 3 partitioning push algorithm 3.1 highest-label partitioning-push algorithm the highest-label partitioning-push (hlpp) algorithm maintains a set l of active nodes organised as a priority queue. in the initialisation step of the algorithm, all the nodes i ∈ n with (s,i) ∈ ã will gain a positive excess, becoming thus active nodes, by setting the flow to the upper bound value f̄(s,i;λ) := ū(s,i;λ) for every arc (s,i). consequently, they are added to the priority queue l and then removed one by one, in the descending order of their priorities d̃(i). for an active node i ∈ ˜̄g(f̄), if there exists an conditionally admissible arc (i,j), the flow will be pushed over this arc and node j will be added to the priority queue l with the priority d̃(j); otherwise node i will be relabelled so that at least one conditionally admissible arc to be created and node i is added to l with its new priority d̃(i). the algorithm terminates when the queue of active nodes is empty. a push of flow from node i to node j is referred to as a cancelling push if it deletes the arc (i,j) from the residual network; otherwise it is a non-cancelling push. algorithm 1 highest-label partitioning-push (hlpp) algorithm 1: procedure hlpp 2: j ← [0,λ] l ←∅ f̄ ← 0 compute d̃(·) in ˜̄g(f̄) 3: for all (s,i) ∈ ã do 4: f̄(s,i;λ) ← ū(s,i;λ) ẽ(i;λ) ← ū(s,i;λ) 5: if ẽ(i;λ) > 0 and i 6= t then 6: add i with priority d̃(i) to l 7: end if 8: end for 9: d̃(s) ← n 10: pp(l j) 11: end procedure for any node i ∈ ˜̄g(f̄), the expressions: active node and balanced node holds only for subintervals of the parameter values. while both the parametric residual capacity ˜̄r(i,j;λ) of any a preflow approach to maximum flows in parametric networks applied to assessing the legal information 119 algorithm 2 partitioning push (pp) procedure 12: procedure pp(l jp) 13: if l 6= ∅and jp 6= ∅ then 14: remove the first node i from l 15: if ∄ an admissible arc (i,j) then 16: d̃(i) ← min{d̃(j)|(i,j) ∈ ã}+ 1 17: add i with priority d̃(i) to l 18: pp(l jp) 19: else 20: select an admissible arc (i,j) 21: push g̃(i,j;λ) ← min{ẽ(i;λ), ˜̄r(i,j;λ)} over (i,j) 22: if j /∈ l and j 6= s and j 6= t then add j to l 23: end if 24: jp1 ←{λ|ẽ(i;λ) ≤ ˜̄r(i,j;λ)} 25: jp2 ← jp −jp1 26: lp1 ← l lp2 ← l 27: add i with priority d̃(i) to lp2 28: do in parallel 29: pp(lp1jp1 ) 30: pp(lp2jp2 ) 31: end do 32: end if 33: end if 34: end procedure arc (i,j) ∈ ã and the parametric excess ẽ(i;λ) of any node i ∈ n −{s,t} are linear functions of the parameter values, cancelling or non-cancelling pushes are defined only for certain subintervals of the parameter values. a non-cancelling push of flow from a node i ∈ n −{s,t} along an arc (i,j) ∈ ã in a subinterval jp = (λp,λp+1] ⊆ [0,λ] which leaves the node i unbalanced is referred to as a partitioning push. whenever the algorithm performs a partitioning push in ˜̄gp(f̄), i.e. the network ˜̄g(f̄) defined for the subinterval jp, a new partitioning of the interval jp in the two subintervals jp1 and jp2, with jp1 ∪jp2 = jp and jp1 ∩jp2 = ∅ will take place. let jp1 be the subinterval within which the partitioning push balances the node i, i.e. jp1 = {λ|ẽ(i;λ) ≤ ˜̄r(i,j;λ)}. if jp2 6= ∅ then, as on every subinterval jp both ˜̄r(i,j;λ) and ẽ(i;λ) are linear functions of λ, the partitioning push generates two parametric residual networks: ˜̄gp1(f̄) for λ ∈ jp1 and ˜̄gp2(f̄) for λ ∈ jp2, so that node i is balanced in ˜̄gp1(f̄) and active in ˜̄gp2(f̄) while arc (i,j) will not belong to ˜̄gp2(f̄), since ˜̄r(i,j;λ) = 0 after the partitioning push. the algorithm will then continue separately in each of the parametric residual networks and for each of the two subintervals. under these observations, the push/relabel procedure from the non-parametric highest label preflow algorithm is replaced with a recursive call of a partitioning push(l,j) procedure. theorem 13. (theorem of correctness) highest-label partitioning-push algorithm computes correctly a maximum flow in the parametric network ḡ = (n,a,ū,s,t). proof: the proof of the theorem follows from the correctness of the general hl preflow algorithm for each of the subintervals of the parameter values. when the algorithm terminates, let sp be the set of all nodes which are reachable from the source node within the subinterval jp. for the resulting cuts [sp,tp] and the intervals jp = (λp,λp+1], p = 1, . . . ,k, the following observations 120 l. sângeorzan, m. parpalea, m.m. parpalea, a. repanovici, r. matefi, i. nicolae hold: (i ) if i ∈ sp, j ∈ tp and (i,j) ∈ a then jp ∩ ĩ(i,j) = ∅ for otherwise node j could be reached from s in ˜̄g(f̄). hence, by the definition of ĩ(i,j), ˜̄f(i,j;λ) = ū(i,j;λ), ∀λ ∈ jp; (ii ) if i ∈ tp, j ∈ sp and (i,j) ∈ a then jp ∩ ĩ(j,i) = ∅ for otherwise node i could be reached from s in ˜̄g(f̄). hence, ˜̄f(i,j;λ) = 0, ∀λ ∈ jp. summarizing, ∑ (i,j)∈(sp,tp) ˜̄f(i,j;λ) = ∑ (i,j)∈(sp,tp) ū(i,j;λ) and ∑ (i,j)∈(tp,sp) ˜̄f(i,j;λ) = 0, thus the obtained flow is a maximum parametric flow which equals the capacity of the minimum s− t parametric cut. ✷ 3.2 complexity issues a breakpoint is a place where the slope of the piecewise linear maximum flow value function is changing. in the worst case the number of breakpoints may be exponential in the size of the problem. the example originates from the pathological graph of zadeh [8]. the highestlabel partitioning-push algorithm overcomes this inconvenient by using the multi-thread parallel implementation of a non-parametric algorithm [12]. the main idea of this implementation is to assign a processor to each newly generated subinterval jp which will carry out the problem forward from the current configuration of the problem. for each of the newly generated subintervals, a copy of the current distance labels values is generated so that they can be independently modified in the further parallel evolution of the algorithm. theorem 14. (theorem of complexity) the parallel implementation of the highest-label partitioningpush algorithm solves the parametric maximum flow problem in o(n2m1/2 + kn) time. proof: the complexity of the non-parametric hl preflow algorithm [1] is o(n2m1/2). the hl partitioning-push algorithm generates new copies of distance label values (one for each of the two new threads which will further run in parallel) every time a breakpoint occurs, i.e. copying distance labels takes o(kn) time where k is the number of breakpoints. thus, the total complexity of the algorithm is o(n2m1/2 + kn). ✷ 4 application to assessing the legal information to write an appropriate, objective and specific rule for every imaginable situation is an impossible thing to do. "given the numberless potential variations, foreseeable and unforeseeable, in motives and circumstances, there can, probably, be no end to the possible specific scenarios and thus no limit on the number of rules that would result from trying to write an appropriate one for each possible, distinct fact situation." [11] the solution of many complex decision problems, such as assessing legal information, involves combinatorial optimization, i.e. obtaining the optimal solution among a finite set of alternatives. such optimization problems are notoriously difficult to solve. one of the primary reasons is that in most applications the number of alternatives is extremely large and only a fraction of them can be considered within a reasonable amount of time. as a result, heuristic algorithms, such as evolutionary algorithms are often applied in combinatorial optimization but their major problem consists in their high complexity. in our opinion, the proposed algorithm can successfully be used in assessing legal information, in their semantic evaluation or in generating files or legal information recording models and subsequently transmitting them to the members of the information society. a preflow approach to maximum flows in parametric networks applied to assessing the legal information 121 4.1 concepts association network regarding the law as a mathematical phenomenon, the legislation represents a logical structure. any law can be considered as a collection of fractal scenarios, made up of legal stipulations and exceptions. in order to enhance assessing features, large collections of legal information must be organized in hierarchical structures based on key concepts. generally, there are two approaches to hierarchical structures generation: the query-independent approach, offering a consistent view of the whole corpus, and the query-dependent approach which allows concepts to be organized differently depending on the query. the approach presented in our paper is a query-dependent one, based on statistical co-occurrence in determining the relationship between concepts. the algorithm builds a hierarchical structure where, for a certain topic, each of the main concepts is related to distinct sub-concepts with different degrees of association values, described by the parameter values. the first stage in generating a concept hierarchy consists in extracting a set of main concepts, related to the topic, and computing the degree of co-occurrences in relation with other subconcepts. once the set of main concepts is extracted and their degrees of co-occurrences are calculated, the concepts association network, covering all concepts, is constructed. the network is built as a directed graph with weighted arcs, where each of the concepts represents a node and the source node represents the topic of the query-dependent structure. the directed graph contains an arc between two concepts only if those concepts co-occur in at least one legal stipulation. definition 15. the degree of co-occurrence is computed as the number of legal articles (stipulation) that contain (refer to) both concepts. definition 16. the strength of a concept, representing that concept’s importance, is the sum of all its co-occurrences. the capacities of the arcs (s,i) are set to u(s,i) := strength(i) while all the arcs (j,t) have no upper value limit. definition 17. the parametric upper bound ū(i,j;λ) of an arc (i,j) is computed as: ū(i,j;λ) = strength(i) externaldegree(i) + λ · co−occurrence(i,j). (14) finally, based on the computed parametric maximum flow in the concepts association network, the sets of concepts defined by the parametric cut partitioning group the concepts in classes which are ordered in hierarchies. as long as the sink node is reached in the concepts association network, any of the directed paths starting from the source node represents a legal demarch. in the case that for various reasons the sink node can not be reached, the meaning of this fact is that some legal provisions are contradictory, inconsistent or ambiguous. 4.2 theoretical legal aspects let us analyze, for example, the general provision of the romanian constitution, namely article 21, first alignment "any person can appeal to the court for protecting its legitimate interests and liberties" and the second alignment "no law can restrict the exercise of these rights". among the main concepts with which the lawmaker operates, we can name: the free access to justice, rights, liberties, legitimate interests. these concepts are in close connection to other sub concepts which are not currently found in laws. thus, according to article 192 of the civil procedure code, "in order to protect one’s legitimate interests and rights, any person can 122 l. sângeorzan, m. parpalea, m.m. parpalea, a. repanovici, r. matefi, i. nicolae appeal to justice by approaching the competent court of law". however, there are some exceptions to the general rule stated above, namely article 193, the first alignment of the civil procedure code, according to which "approaching the court is an action which can occur only after a preliminary procedure is completed if the law is clear on that matter. the proof of completing this procedure will be attached to the summon", as well as the second alignment of the same law, according to which "failure to complete the preliminary procedure can only be invoked by the defendant in his response". to complete these provisions, we have the first alignment of article 7 of law no 554/2004 regarding the administrative procedures which state the following "before addressing the competent legal court, the person who claims that it has suffered an injustice through an unilateral administrative act must first ask the public authority, within 30 days, to revoke that certain document or some parts of it (...)". 4.3 legal case study first court solution to demonstrate the theoretical aspects discussed above, we will use the following example: by their petition addressed to the galati county appeal court, the plaintiffs r.i., b.s., g.m., s.c., g.v., s.g., r.v., s.c., a.c.e., g.c., s.a., b.i., e.a., g.g., g.g versus the romanian government, the galati county pension institution and the romanian council for fighting discrimination have asked the annulment of the government’s decision no 737/2010 regarding the method used for recalculation the amount of retirement money for the categories stated in article 1, letters c to h of law no 119/2010 regarding the establishing of some measures for maintaining the amount of retirement money, published in the official bulletin no 528 of june 29th, 2010, which the plaintiff was receiving at the date the document whose annulment is asked came into force. by the same petition, they asked, according to the provisions of article 15 of law no 554/2001 the suspension of the government’s decision no 737/2010 until this matter is resolved. by its response, the romanian government claimed the petition was inadmissible as it did not complete the preliminary procedure, according to article 7 of law no 554/2004. it further asked for the petition to be dismissed as it is not insubstantial. the romanian ministry for work, family and social protection intervened on behalf of the defendant the romanian government. by its decision no 52 of february 22nd, 2011, the galati county appeal court, dismissed the plaintiffs petition as inadmissible and allowed the ministry to intervene on behalf of the romanian government. in order to reach this decision, the court stated that the provisions of article 7, the first alignment claim that the legal proceedings can only occur after the administrative authorities are given the opportunity to revoke their document or amend it. in the matter at hand, although it was claimed that the government’s decision no 737/2001 regarding the method used for recalculating the amount of retirement money for the categories stated in article 1, letters c to h of law no 119/2010 regarding the establishing of some measures for maintaining the amount of retirement money, should be annulled, there was no proof that a preliminary proceeding existed, as this was a legal condition for the admissibility of the petition, according to article 109, second alignment of the civil procedure code. appeal court solution the plaintiffs appealed this decision, by criticizing it for being illegal and unfounded and claiming that, in the matter at hand, the preliminary procedure can be completed at any time and this condition was met. the high court of justice, by analyzing the works and documents of this case, ruled that the first court was correct to dismiss the plaintiff’s petition as inadmissible, a preflow approach to maximum flows in parametric networks applied to assessing the legal information 123 as it does not respect the provisions of the first alignment of article 7 of law no 554/2004. according to these provisions "before addressing the court, the person who claims one of its rights was disrespected must first ask the public authority who created the document, within 30 days of that date, to revoked that document. the petition can also be addressed to the superior hierarchical organ, if such an organ exists". respecting the terms and conditions stated by law for the preliminary procedure is a special demand of the law and the disrespecting of this demand causes the plaintiffs inability to appeal the courts. in the matter at hand, the plaintiffs asked for the annulment of government’s decision no 737/2010 without proving they have completed the preliminary proceedings. this is similar to not respecting the legal obligation enforced by the first alignment of article 7 of law no 554/2004. indeed, according to the provisions of article 7, the first alignment of law no 554/2004, "in case of an administrative act, the preliminary procedure can be completed at any time", as the government’s decision no 737/2010 is considered to be an administrative act with power of the law. however, these legal provisions must be interpreted in close connection with those of article 11 of the administrative proceedings law which describes the term to formulate such a demand, as well as with the provisions of article 7, the first alignment of the same law. the phrase "at any time" allows for the possibility of formulating such a petition and for completing the preliminary proceedings without respecting the term stated in article 11 of law no 554/2010, except for "ordinances of their dispositions which are considered to be unconstitutional, as well as administrative documents with power of the law which are considered to be illegal", as this course of action is regulated by the fourth alignment of this article. the preliminary procedure regulated by article 7, alignments (1) and (11) of law no 554/2004, is a condition for the admissibility of the petition, according to article 109, second alignment of the civil procedure code, as it is previous to appealing the court. considering all aspects mentioned above and seeing that there are no reasons to annul the first court’s decision, the high court will dismiss the appeal as unfounded [13]. figure 1: concepts association network for the presented legal example. as can be easily seen in the concepts association network presented above in fig.1, by following the directed path which does not include the preliminary procedure attached to a summon, the sink node can never be reached, revealing the fact that some legal provisions are contradictory, inconsistent or ambiguous. further developing the above presented model, based on an appropriate corpus (database) of legal stipulations which is generated by our algorithm in the way it has previously been presented above already, a computer application can be developed so that to any online transmitted legal question, a solicitor could receive the adequate legal information. 124 l. sângeorzan, m. parpalea, m.m. parpalea, a. repanovici, r. matefi, i. nicolae 5 conclusions the maximum flow problem in parametric networks turns out to be an important scenario in practice since the complexity of its solving algorithm was reduced to a linear dependency of the number of breakpoints. the present article presents the state-of-the-art of the approaches for solving the parametric maximum flow problem and presents an original parametric preflow algorithm, based on network partitioning technique. after presenting the basic parametric network flow terminology adapted for the parametric network with linear capacity functions and zero lower bounds, the proposed highest label partitioning push (hlpp) algorithm is described in details, being accompanied by the corresponding theorems of correctness and of complexity of the algorithm. in one of its final sections, the article also proposes a way of implementation of our algorithm in the legislation domain. the definitions (12-14) and interpretations contained in this section are also original contributions of the authors. moreover, the given example shows the way the proposed algorithm progressively generates hierarchical structures of legal articles (stipulations), gathered according to their relevance in explaining a general legal topic (or legal problem). using suitable file structures for legal information recording, these sets of legal stipulations can be transmitted to any user (or solicitor). bibliography [1] ahuja, r., magnanti, t. and orlin, j.(1993); network flows. theory, algorithms and applications, prentice hall, inc., englewood cliffs, new jersey, 1993. [2] bichot, c-e., siarry, p. (2011), graph partitioning: optimisation and applications, iste wiley. [3] hamacher, h.w. and foulds, l.r. (1989); algorithms for flows with parametric capacities, zor methods and models of operations research, 33:21–37. [4] parpalea, m., ciurea, e.(2013); partitioning algorithm for the parametric maximum flow, applied mathematics (special issue on computer mathematics), 4(10a):3–10. [5] parpalea, m.m. (2009);german word order, bulletin of the transilvania university of brasov, series iv, 2(51): 175–182. [6] parpalea, m. (2010); min-max algorithm for the parametric flow problem, bulletin of the transilvania university of brasov, series iii: mathematics, informatics, physics, 3(52):191– 198. [7] parvan, m., ghionea, f., flaut, c. (2008); a mathematical model of the relevant request determination in the transport problem, mathematics and computer in business and economics, proc.of the 9th wseas intl. conf. on mathematics and computer in business and economics (mcbe’08), bucureşti, 24-26 iunie 2008, 105–111. [8] ruhe, g. (1988); complexity results for multicriterial and parametric network flows using a pathological graph of zadeh, zeitschrift fur oper. res., 32: 9–27. [9] ruhe, g. (1985); characterization of all optimal solutions and parametric maximal flows in networks, optimization, 16(1): 51–61. [10] sângeorzan, l., parpalea, m., et al. (2010); partitioning preflow-pull algorithm for the parametric net-work flow problem a linguistic rule-based constraints optimisation a preflow approach to maximum flows in parametric networks applied to assessing the legal information 125 approach, recent advances in neural networks, fuzzy systems and evolutionary computing, proc.of the 11th wseas international conference on neural networks, iasi, romania, 111–116. [11] stumpff, a.m. (2013); the law is a fractal: the attempt to anticipate everything, loyola university chicago law journal, 44. [12] yong, c., chang-le, lu (2006); using multi-thread technology realize most short-path parallel algorithm, world academy of science, engineering and technology, 15: 11–13. [13] high court’s decision no 656/2012, given in a public hearing on february 9th, 2012, in the file no 1259/44/2010, http://www.iccj.ro/cautare.php?id=68146, accessed september 11th, 2014, at 13,40. international journal of computers communications & control issn 1841-9836, 11(2):233-247, april 2016. content based model transformations: solutions to existing issues with application in information security j. janulevičius, s. ramanauskaitė, n. goranin, a. čenys justinas janulevičius*, simona ramanauskaitė, nikolaj goranin, antanas čenys vilnius gediminas technical university, lithuania, lt-10223 vilnius, sauletekio al. 11 justinas.janulevicius@vgtu.lt, simona.ramanauskaite@vgtu.lt, nikolaj.goranin@vgtu.lt, antanas.cenys@vgtu.lt *corresponding author: justinas.janulevicius@vgtu.lt abstract: model-driven engineering uses models in various stages of the software engineering. to reduce the cost of modelling and production, models are reused by transforming. therefore the accuracy of model transformations plays a key role in ensuring the quality of the process. however, problems exist when trying to transform a very abstract and content dependent model. this paper describes the issues arising from such transformations. solutions to solve problems in content based model transformation are proposed as well. the usage of proposed solutions allowing realization of semi-automatic transformations was integrated into a tool, designed for opc/xml drawing file transformations to cysemol models. the accuracy of transformations in this tool has been analyzed and presented in this paper to acquire data on the proposed solutions influence to the accuracy in content based model transformation. keywords: cyber security modeling language; model transformation; model driven engineering. 1 introduction model-driven engineering (mde) [1] uses models as a reference in various phases of software engineering. the model is created in the early stages and reused later for a number of purposes. since most of the processes and aspects can be formalized and represented as a model they are commonly used for their commodity. to obtain a certain output from different type of models is vital for mde and a variety of solutions has been proposed by the research community, spanning from experimental approaches [2] to frameworks [3]. model transformation is a very actual problem in practice as well as research as new types of models appear and more accuracy is needed. the aim of this paper is to simplify transformation of abstract, content based model transformations. content based models have very abstract structure. it can be a benefit as it increases the meta-model adaptation area, but one of the main drawbacks is that model transformations have to be done in content rather than structure level. two main problems with content based model transformations are presented in this paper along with the solutions. to analyze the effectiveness proposed solutions, they are integrated into a tool for opc/xml drawing file to cysemol model transformation. the accuracy results of the transformation are presented in this paper as well. 2 related works numerous research approaches have been carried out on model transformations, as it is a very useful process that not only leads to automation of processes [1], ease of migrating data [2] and copyright © 2006-2016 by ccc publications 234 j. janulevičius, s. ramanauskaitė, n. goranin, a. čenys at the same time liberating the systems from legacy components [3], but also, most importantly, from the economic point of view reducing costs by reusing the existing data [4]. methodologies have been developed to manage the correctness of data, stored as model attributes in the process of transformation. of which, the triple graph grammar case offers a methodology for attribute handling for bidirectional model transformations [5]. dedicated model transformations for information security modeling is a relatively new yet very important area for research. model-driven security is a growing trend with an expanding list of tools and methodologies for the subject [6]. approaches, such as secureuml model transformation semantics and analysis [7] as well as transformations between secureuml and umlsec [8] exist. however, new information security assessment tools require a more flexible approach with an ability to acquire data from less formalized model structures as information security modeling typically involves representing the analyzed infrastructure in a formal way. architectural modeling languages are typically used in this case. they include sysml [9], business process modeling notation (bpmn) [10] that enable representation of information system architecture and system environment through diagrams that can be used for various forms of analysis, one of which is security. some of them also offer extensions for industrial control system security analysis [11]. however, the aforementioned modeling languages do not offer the reasoning process. some solutions that offer modeling capabilities along with the reasoning based on the systemized expert knowledge base exist. one of them is openmads [12], the other is cyber security modeling language (cysemol) [13]. 2.1 model to model transformations model transformation enables information reuse preserving consistency between the two models [14]. in this case preservation of relationship between the source and target models as well as heterogeneity of the transformed data comes as a challenge [15]. model transformation is facing two issues: impedance mismatch and heterogeneity [16]. heterogeneity forces to deal with different data models and encodings of values. impedance mismatches are caused by the difference between logical schemas required by the applications and the ones exposed by data sources. these issues support the idea that data consistency between the models by adjusting the level of abstraction is the main task in order to avoid data loss along the transformation process [17]. model transformation patterns are obtained by using the formal concept analysis [18], where relations and element meta-classes of target and source models are linked together based on model classification group links that have similarities between them. 2.2 content dependent model transformations some languages are equipped with an abstract meta-model. the content of the model is provided in text based form as the label or property value of an element (see fig. 1). this type of meta-model is very common in general purpose systems. the abstract meta-model allows presentation of wider, not predefined content. according to taxonomy of model transformations [19] this type of transformation is considered to be exogenous, vertical transformation. typically it is used as synthesis of a higher-level, more abstract, specification into a lower-level, more specific one. transformation of such model is very content dependent. therefore the definition of transformation rules is time consuming due to these reasons: • every model component and property label has to be listed in order to write a transformation rule. as labels are human generated, the list is infinite or very long as all components and properties can have multiple synonyms. content based model transformations: solutions to existing issues with application in information security 235 figure 1: example of general purpose source meta-model • all component synonyms have to be taken into the account for the transformation rules. therefore multiple rules are required for one target concept or rule. these reasons cause higher resource consumption compared to discrete formal models. there is also a level of uncertainty, as some of the synonyms or concepts can be missed out of the model transformation rules and the process will not be able to transform the elements into the target model. an example of a content dependent source model definition is presented in fig. 2. figure 2: example of content dependent source model definition in ecore file an example of transformation rules of such model in atl is presented in fig. 3. the provided example in figure 3 only has five synonyms, however, the list of synonyms increases by taking different languages, dialects and situations into account. therefore, it would be difficult to modify the list of synonyms if the rule is hardcoded into the source code of software product. a solution for easy synonym integration is a valuable improvement. the source model element type identification in content dependent models complicates when abstract element does not have a name nor a description. in such situation the information is not enough. therefore element identification can be performed according to the structure of the element. however this task in content dependent models is complicated as well as there is no predefined specific element structure for different content source elements. therefore, to identify the type of source model, rules can to be used to check if the containing attributes match the ones expected in the target model (see. fig. 4). since the source model is abstract, the transformation is facing some complications as well: • the attribute set for each element has to be defined individually as there is no list of attribute labels and values in the meta-model. the complications are amplified if the attribute labels are hardcoded in the software source code. 236 j. janulevičius, s. ramanauskaitė, n. goranin, a. čenys r u l e element2network { from // d e f i n e which component t o t a k e from t h e s o u r c e s : a b s t r a c t ! element i n in ( s . l a b e l = ’ network ’ o r s . l a b e l = ’ net ’ o r s . l a b e l = ’ i n t e r n e t ’ o r s . l a b e l = ’lan’ o r s . l a b e l = ’wan’ ) t o // d e f i n e how t h e s o u r c e e l e m e n t have t o be t r a n s f o r m e d n1 : cysemol ! networkzone ( // c r e a t i n g networkzone e l e m e n t i d <− s . id , // with a p p r o p r i a t e p r o p e r t i e s name <− s . l a b e l , o r i g i n a l c o n n e c t i o n <− s . c o n n e c t e d . id , i n t e r f a c e <− n2 ) , // c r e a t i n g n e t w o r k i n t e r f a c e e l e m e n t t o c o n n e c t networkzone n2 : cysemol ! n e t w o r k i n t e r f a c e ( network <− n1 ) } figure 3: example of content dependent element transformation rule in atl for element, associated to networkzone in target model r u l e element2computer { from // s e a r c h e s f o r e l e m e n t s with needed a t t r i b u t e s s : a b s t r a c t ! element i n in ( s . a t t r i b u t e−>c o l l e c t ( l | l . l a b e l )−> i n c l u d e s a l l ( s e t { ’ cpu ’ , ’ ram ’ , ’ hdd ’ } ) ) t o n1 : cysemol ! operatingsystem ( name <− ’ computer with ’+ s . l a b e l ) } figure 4: example of content dependent element transformation according to the obtained parameters rule in atl content based model transformations: solutions to existing issues with application in information security 237 • a decision has to be made on which of the attributes provide a better definition of the element, at the same time which ones are unimportant and may be discarded. in case too many attributes are compared in the transformation rule, a missed attribute in the source model would make the rule worthless. on the contrary, if not enough attributes are be used in the transformation rule, element can be inconclusively (multiple possibilities) identified. • attribute labels and values are content based. therefore multiple labels and values can be linked to the same content. knowing all possible values is nearly impossible and it increases the complexity of transformation rules. • source element identification according to its structure element label, attribute labels and attribute values can be the crucial element. there is no unified methodology for measurement of the significance of the element identity from list of possible cases. all these reasons make the source model element difficult to identify using only static rules. 3 assumptions for model to model transformation improvement existing model transformation methodologies seem to have drawbacks when dealing with specific situations or have to be applied in dynamic situations [20]. therefore new solutions are proposed to improve the process and provide an alternative method that improves the efficiency and accuracy of the transformations. in this chapter ideas on how current situation in specific situations can be improved using advanced techniques, such as grammar-based model transformations [21] and model transformation by-example [22–25] element identification are presented. 3.1 dictionary based element identification a context analysis is a compex task as some words can have different meaning, synonyms for most of words exists etc. one of ways to implement context analysis is synonym based analysis. this aproach is used in web serach engine optimization [26] and user review analysis [27] during the last two years and shows promissing results. therefore dictionary based element identification approach on model to model transformation is proposed for simplification of the transformation of content dependent models. the main idea is to use a synonym database for each target metamodel element. this is done by providing additional dictionary meta-model (see fig. 5) and input of synonyms for each of the target model elements. figure 5: dictionary meta-model for element identification the condition for element identification in source model is simplified and achieved using only one condition rather than a list of conditions. an example of synonym search in dictionary model and its usage are provided in fig. 6 and fig. 7. 238 j. janulevičius, s. ramanauskaitė, n. goranin, a. čenys h e l p e r d e f : getbysyninym ( sn : s t r i n g ) : s t r i n g = i f ( // l o o k i n g f o r a synonym i n t h e d i c t i o n a r y model d i c t i o n a r y ! synonym . a l l i n s t a n c e s ()−> s e l e c t ( e | e . v a l u e . tolowercase ( ) . s t a r t s w i t h ( sn . tolowercase ( ) ) ) ) . isempty ( ) then // i f t h e r e i s no synonym − we t a k e t h e same v a l u e ’<<’+sn+’>>’ e l s e // i f t h e r e i s a synonym − we t a k e t h e word d i s t i o n a r y ! synonym . a l l i n s t a n c e s ()−> s e l e c t ( e | e . v a l u e . tolowercase ( ) = sn . tolowercase ())−> c o l l e c t ( e | e ) . f i r s t ( ) . word . v a l u e e n d i f ; figure 6: example code for search of an element name by comparing it to existing synonyms . . . from s : a b s t r a c t ! element i n in ( // h e l p e r u s a g e t o g e t synonyms from t h e d i c t i o n a r y t h i s m o d u l e . getbysynonym ( s . l a b e l ) = ’ network ’ ) . . . figure 7: simplified situation of fig. 6 used to identify element type of source model the proposed solution is more flexible as the list of synonyms for target model elements can be provided as input file and modified at any given moment. these changes do not require source code to be changed, so the dictionary file can vary depending on the target metamodel, language and other factors. 3.2 example based element identification example based model transformation is well known strategy for transform one model to another and other tasks. this technology is used for images and videos color transformation [28], semantic data analysis from a give string [29] etc. therefore we propose to use example based model transformation in order to simplify the transformation of content dependent models where elements are defined by structure only. in this case a database of target meta-model element examples is used. for each target meta-model element one example is stored in the database of source meta-model. to simplify the atl code a new meta-model was created as a copy of source meta-model (see fig. 8). when example target elements of source meta-model are presented, each source element is compared to the one stored in the example database to find the most similar element based on its structure. similarity estimation q is calculated using as follows: q = m s + m e (1) in (1) m is the number of matched labels between source and example elements; s is the number of attributes in the source element; e is the number of attributes in the example element. content based model transformations: solutions to existing issues with application in information security 239 figure 8: modified source meta-model structure for target element descriptionl sum of these two proportions takes both attribute redundancy and shortage into account. the implementation of this method is presented in fig. 9. there is some space for the improvement of this example by optimizing the code, adding the comparison and attribute labels. example based element identification should be executed after dictionary based element and attribute transformation as the labels and values of source model are content dependent and dictionary usage leads to formalization. 4 case analysis: opc /xml drawing file transformation to cysemol opc is a container file standardized format [30]. an opc format for storing graphical notation has an extension of .vsdx. the structure of the opc/xml drawing file is presented in fig. 10. the information about the element layout of the pages are stored in separate xml format files in sub-directory "visio/pages" (marked red in fig. 10). in this case object and relationship information is extracted from files stored in this directory. for this model transformation specific tags of the xml files are used. they are: • shapes describes a shape array; • shape describes a shape and its’ identification number, name, type and master template; • cell it is a versatile tag, containing information about name and value of many properties of cells under shape and section tags; • text gives text output, most commonly an object of instance, visible graphically; • section contains attribute information under it; • row stores attribute information; • connects describes array of connections; • connect defines a connector between instances, specifying sheets, cells and parts connected. 240 j. janulevičius, s. ramanauskaitė, n. goranin, a. čenys h e l p e r d e f : c a l c u l a t e v a l u e ( a : i n t e g e r , b : i n t e g e r , c : i n t e g e r ) : // c a l c u l a t e s t h e s i m i l a r i t y v a l u e q i n t e g e r = ( (m/a + m/b ) ∗ 1 0 0 ) . f l o o r ( ) ; h e l p e r d e f : c a l c u l a t e s i m i l a r i t y ( a : a b s t r a c t ! element , b : a b s t r a c t 2 ! elementd ) : i n t e g e r = t h i s m o d u l e . c a l c u l a t e v a l u e ( // c a l c u l a t e a v a l u e ( number o f a t t r i b u t e s i n s o u r c e ) a . a t t r i b u t e−>c o l l e c t ( l | l . l a b e l )−> s i z e ( ) , // c a l c u l a t e b v a l u e ( number o f a t t r i b u t e s i n t a r g e t ) b . a t t r i b u t e−>c o l l e c t ( l | l . l a b e l )−> s i z e ( ) , // c a l c u l a t e m v a l u e ( number o f maching a t t r i b u t e s ) ( ( a . a t t r i b u t e−>c o l l e c t ( l | l . l a b e l ) . a s s e t ( ) . i n t e r s e c t i o n ( b . a t t r i b u t e−>c o l l e c t ( e | e . l a b e l ) . a s s e t ()))−> s i z e ( ) ) ) ; h e l p e r c o n t e x t a b s t r a c t ! element d e f : g e t b y s t r u c t u r e ( ) : s t r i n g = l e t sk : s t r i n g = s e l f . getbyexample2 ( ) i n // s k i p p i n g f i r s t l e t t e r s , which i n d i c a t e s s i m i l a r i t y // a s t h e most s i m i l a r e l e m e n t l a b e l i s p r e s e n t e d a t t h e end sk . s u b s t r i n g ( 5 , sk−>s i z e ( ) ) ; h e l p e r c o n t e x t a b s t r a c t ! element d e f : getbyexample2 ( ) : s t r i n g = l e t elem : sequence ( a b s t r a c t 2 ! elementd ) = a b s t r a c t 2 ! elementd . a l l i n s t a n c e s ()−> a s s e q u e n c e ( ) i n elem−>i t e r a t e ( p ; l a b e l : s t r i n g = ’ 0 0 0 ’ | // l o o k i n g f o r t h e maximum q v a l u e i f t h i s m o d u l e . c a l c u l a t e s i m i l a r i t y ( s e l f , p ) > l a b e l . s u b s t r i n g ( 1 , 3 ) . t o i n t e g e r ( ) then l e t numb : i n t e g e r = // r e t u r n i n g 3 d i g i t v a l u e and l a b e l o f t h e e l e m e n t t h i s m o d u l e . c a l c u l a t e s i m i l a r i t y ( s e l f , p ) i n ( ’000 ’+numb ) . s u b s t r i n g ( ( ’ 0 0 0 ’ +numb)−> s i z e ()−2 , ( ’000 ’+numb)−> s i z e ( ) ) + ’ ’+p . l a b e l e l s e l a b e l // o t h e r w i s e r e t u r n i n g t h e same v a l u e e n d i f ) ; figure 9: example code for search of element name by comparing it to example structure content based model transformations: solutions to existing issues with application in information security 241 figure 10: the structure of an opc /xml drawing file the proposed transformations were implemented as a tool to convert .vsdx file data into cysemol meta-model. the proposed transformation methods use synonym database for target meta-model. the current version is constructed for english language only. it stores over 6000 synonyms for most common cysemol classes and attributes. an additional integrated database for connection comparison is built as well. this database has up to 1000 possible connections between cysemol elements and serves as an alternative to the example based content dependent model transformation. the ideas of model transformations based on triple graph grammars are integrated [31] as well as class identification using missing elements based on the target model connection example database. this allowed a more accurate class mapping. 4.1 transformation accuracy estimation experiment an experiment has been carried out to estimate the accuracy of the proposed model transformation methods. this experiment includes estimation of the results provided by a group of 48 informatics engineering senior year students. they were assigned to draw two diagrams in microsoft visio 2013 tool: one to present basic smes local network and one basic web server diagram. the diagram type, diagram elements, description, and detailing level were entirely a matter of choice. the only constraint was to use english language exclusively. the experiment resulted in 86 different diagrams. the most common examples are presented in fig. 11 and fig. 12. all provided diagrams were transformed to a cysemol model. the transformed models were analyzed and compared to expert prepared cysemol model in the eaat tool. the eaat tool allows graphical representation of cybersecurity area as well conforms to the model requirements for cysemol. automated formal comparison as the results were not compliant to any formalization. therefore multiple output results were generated. this fact required to analyze every situation individually by experts. during the experiment most cysemol models had more elements in comparison to the source model file data. this is due to some additional elements had to be added as interfaces (see fig. 13 and fig. 14 as results of fig. 11 and fig. 12 in cysemol). 4.2 results of transformation accuracy estimation experiment the network and web server diagrams use different microsoft visio diagram templates and elements, therefore they are analyzed separately. diagram description level categorization was to the following categories: no diagram element descriptions; defined diagram element name; defined associated diagram element properties. these categories are used for assessment of usefulness of diagram name and property descriptions. 242 j. janulevičius, s. ramanauskaitė, n. goranin, a. čenys figure 11: typical result diagram for basic smes network figure 12: typical result diagram for basic web server figure 13: typical result diagram for basic smes network in cysemol content based model transformations: solutions to existing issues with application in information security 243 figure 14: typical result diagram for basic web server the summary of analyzed file data and transformation accuracy for different diagram type and description level is presented in tables 1. the table 1 shows the number of property level detailed smes network opc/xml files are bigger comparing to web server diagrams as well as property level detailed diagrams usually have less components, comparing to less detailed microsoft visio diagrams. the most important the generated cysemol model has a bigger number of elements, comparing to the source files as existing cysemol meta-model requires more elements to evaluate the risk. transformation accuracy analysis shows that the proposed transformation method is capable of generating cysemol models from more abstract opc/xml drawing files 94% of generated cysemol model elements are correctly identified; 88% of cysemol objects are transformed to template level (with defined default values); 87% of connections between cysemol objects are added as expected (see table 1). as seen in table 1 the accuracy is dependent on used diagram content and detailing level. opc/xml drawing files have a predefined attribute list, however not all diagram elements are covered. therefore some elements cannot be detailed by defining their attribute values. moreover, model transformation might fail due to incorrect diagram element description, using modified terms. this requires maintenance of synonym database, keeping it up to date with human language and technology improvement changes. 244 j. janulevičius, s. ramanauskaitė, n. goranin, a. čenys table 1: summary of opc/xml files and cysemol model data and transformation accuracy sme networm situation web server situation total components and links with no descriptions component name added with no properties component name and properties added components and links with no descriptions component name added with no properties component name and properties added number of files 6 18 22 21 15 4 86 number of .vsdx elements 42 121 145 169 127 22 626 number of cysemol elements 130 349 435 569 412 105 1976 objects 62 184 231 268 193 50 988 connections 68 165 204 301 219 58 1015 correctly identified element % 95% 100% 100% 96% 100% 100% 98% correctly identified connection % 81% 98% 100% 64% 98% 97% 87% total 88% 99% 100% 79% 99% 98% 94% content based model transformations: solutions to existing issues with application in information security 245 5 conclusions the proposed model transformation methods offer a semi-automatic abstract relationshipbased model transformation into more detailed, domain specific template-based model. as this is a content dependent situation detailed knowledge databases are required to extract knowledge and identify model elements according to text based names and descriptions. textual dictionary based analysis is used for element identification, however further reasoning is required for definition of the source model element relation to destination metamodel. element identification in source model is one of the most important steps in model to model transformation. the combination of dictionary association, structure comparison and relationship similarities provided a 94% accuracy in this model transformation. for further improvements it requires a detailed list of attributes in order to increase the model transformation accuracy. bibliography [1] l. levi, m. amrani, j. dingel, l. lambers, r. salay, g. m. k. selim, e. syriani and m. wimmer (2014), model transformation intents and their properties, software systems & modeling, 1-38. [2] l. m. rose, m. hermannsdoerfer, s. mazanek, p. v. gorp, s. buchwald, t. horn and e. kalnina (2014), graph and model transformation tools for model migration, software & systems modeling, 13(1): 323-359. [3] g. m. k. selim, s. wang, j. r. cordy and j. dingel (2012), model transformations for migrating legacy models: an industrial case study, ecmfa, lncs 7349, 90-101. [4] s. sen, n. moha, v. mahe, o. barais, b. baudry, j. m. jezequel (2012), reusable model transformations, software & systems modeling, 11(1): 111-125. [5] l. lambers, s. hildebrandt, h. giese and f. orejas (2012), attribute handling for bidirectional model transformations: the triple graph grammar case, electronic communications of the easst, 49: 1-16. [6] d. basin, m. clavel and m. egea (2011), a decade of model-driven security, sacmat11 proceedings of the 16th acm symposium on access control models and technologies, 1-16. [7] a. d. brucker, j. doser and b. wolff (2006), a model transformation semantics and analysis methodology for secureuml, lecture notes in computer science, berlin, springer, 306-320. [8] r. matulevicius and m. dumas (2011), towards model transformation between secureuml and umlsec for role-based access control, databases and information systems, 6: 1-14. [9] s. friedenthal, a. moore and r. steiner (2014), a practical guide to sysml, waltham: elsevier, 2014. [10] m. chinosi and a. trombetta (2012), bpmn: an introduction to the standard, computer standards & interfaces, 34: 124-134. [11] l. lemaire and j. lapon (2014), a sysml extension for security analysis of industrial control systems, 2nd international symposium for ics & scada cyber security research 2014 (ics-csr 2014),1-9. 246 j. janulevičius, s. ramanauskaitė, n. goranin, a. čenys [12] e. c. andrade, m. alves, r. matos, b. silva and p. maciel (2013), openmads: an open source tool for modeling and analysis of distributed systems, computer safety, reliability, and security, lecture notes in computer science, 8153: 277-284. [13] t. sommestad, m. ekstedt and h. holm (2013), the cyber security modeling language: a tool for assessing the vulnerability of enterprise system architectures, systems journal, 7: 363-373. [14] m. biehl (2010), literature study on model transformations, royal institute of technology, stockholm, 2010. [15] k. czarnecki and s. helsen (2003), classification of model transformation approaches, 2nd oopsla workshop on generative techniques in the context of the model driven architecture, 2003. [16] p. a. bernstein and s. melnik (2007), model management 2.0: manipulating richer mappings, sigmod07, proceedings of the 2007 acm sigmod international conference on management of data, 1-12. [17] t. frisendal (2012), business concept mapping, concept maps: theory, methodology, technology proc. of the fifth int. conference on concept mapping, 1-4. [18] b. ganter and r. wille (2012), formal concept analysis: mathematical foundations, berlin: springer science & business media, 2012. [19] t. mens and p. v. gorp (2015), a taxonomy of model transformation, electronic notes in theoretical computer science, 152: 125-142. [20] k. czarnecki and s. helsen (2006), feature-based survey of model transformation approaches, ibm syst. j., 45(3): 621-645. [21] g. besova, d. steenken, and h. wehrheim (2015), grammar-based model transformations, comput. lang. syst. struct., 43(c): 116-138. [22] h. saada, x. dolques, m. huchard, c. nebut and h. sahraoui (2012), generation of operational transformation rules from examples of model transformations, model driven engineering languages and systems. lecture notes in computer science, 7590: 546-561. [23] m. wimmer, m. strommer, h. kargl and g kramler (2007), towards model transformation generation by-example, hicss 2007, 40th annual hawaii international conference on system sciences, 285b. [24] g. kappel, p. langer, w. retschitzegger, w. schwinge and m. wimmer (2012), model transformation by-example: a survey of the first wave, conceptual modelling and its theoretical foundations, 197-215. [25] d. varro (2006), model transformation by example, model driven engineering languages and systems, 410-424. [26] p. arora and t. bhalla (2014), a synonym based approach of data mining in search engine optimization, international journal of computer trends and technology (ijctt), 12(4): 201-205. content based model transformations: solutions to existing issues with application in information security 247 [27] b. ma, z. dongsong, z. yan and t. kim (2013), an lda and synonym lexicon based approach to product feature extraction from online consumer product review, journal of electronic commerce research, 14(4): 304-314 [28] y. chang, s. saito and m. nakajima (2005), example-based color transformation for image and video, proceedings of the 3rd international conference on computer graphics and interactive techniques in australasia and south east asia (graphite05). acm, new york, ny, usa, 347-353. [29] r. singh and s. gulwani (2012), learning semantic string transformations from examples, proc. vldb endow., 5(87): 740-751. [30] international organization for standartization (2012), iso 29500-2:2012 information technology document description and processing languages office open xml file formats part 2: open packaging conventions. third edition, geneva: international organization for standartization. [31] f. hermann, h. ehrig, f. orejas and u. golas (2010), formal analysis of functional behaviour for model transformations based on triple graph grammars, graph transformations, berlin, springer, 155-170. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 4, pp. 432-446 how to write a good paper in computer science and how will it be measured by isi web of knowledge r. andonie, i. dzitac răzvan andonie department of computer science central washington university, usa and department of electronics and computers transylvania university of braşov, romania e-mail: andonie@cwu.edu ioan dzitac department of mathematics-informatics aurel vlaicu university of arad 310330 arad, romania and cercetare dezvoltare agora piatţa tineretului, 8, oradea, romania email: ioan.dzitac@uav.ro abstract: the academic world has come to place enormous weight on bibliometric measures to assess the value of scientific publications. our paper has two major goals. first, we discuss the limits of numerical assessment tools as applied to computer science publications. second, we give guidelines on how to write a good paper, where to submit the manuscript, and how to deal with the reviewing process. we report our experience as editors of international journal of computers communications & control (ijccc). we analyze two important aspects of publishing: plagiarism and peer reviewing. as an example, we discuss the promotion assessment criteria used in the romanian academic system. we express openly our concerns about how our work is evaluated, especially by the existent bibliometric products. our conclusion is that we should combine bibliometric measures with human interpretation. keywords: scientific publication, publication assessment, plagiarism, reviewing, bibliometric indices. 1 introduction faculty work generally falls into three categories: research, teaching, and service. assessment protocols have considered, to a varied extent, scholarly activities performed in each of these areas. faculty assessment is conducted for purposes of reappointment, promotion, the awarding of tenure, and professional development. during the last decades, a societal focus on the work of university faculty as a measure of return on the public’s investment in higher education stimulated a reevaluation of how faculty performance ought to be measured and assessed. the development of workable assessment systems is difficult largely due to the fact that the value of assessment is often controversial: • assessment methods are defined differently from discipline to discipline. • assessment methods depend on the communication of standards upon which judgments of quality will be based and acceptable mechanisms for documenting faculty work. • as members of a profession, faculty reserve the right to be the sole judges of the quality of the work performance of those claiming membership among their ranks. at different levels, non-faculty administrators are also involved in the assessment process of faculty. there are cases when non-faculty are judging the scientific activity of faculty solely based on criteria like number of publications and impact factors, without having the expertise to pertinently judge these publications. this creates a possible conflict and, in many cases, we can observe tensions in the faculty-administrator relationship [1]. the conflict starts from a communication failure between the two groups. basically, copyright c⃝ 2006-2010 by ccc publications how to write a good paper in computer science and how will it be measured by isi web of knowledge 433 faculty and administrators, share the same goals. beside the psychological aspect (faculty do not like to be judged by non-academics), another cause of tension is the difficult question “how do we measure academic performance?” research performance is typically measured in terms of productivity, relying largely on the use of quantitative measures such as the number of publications, value of grants, or other creative works produced over a specified period of time. many universities use indexing systems, like thompson scientific, as a main assessment tool for publications. but how much can we trust such a numerical criterion? is it enough to count the number of citations of your paper to judge its value? our paper has two major goals. first, we focus on assessment techniques for scientific publications. we discuss the limits of numerical assessment tools. we particularly analyze the specific aspects of computer science (cs) publications knowing that cross-disciplinary comparisons should be generally avoided. second, we give guidelines on how to write a good paper, where to submit the manuscript, and how to deal with the reviewing process. we report our experience as editors of ijccc. from this perspective, we also analyze two important aspects of publishing: plagiarism and peer reviewing. we illustrate with the promotion assessment criteria used in the romanian academic system. finally, we discuss the “publish or perish practice” from the perspective of the current publication assessment techniques. 2 assessment of cs scientific publications books, which some disciplines do not consider important scientific contributions, can be a primary vehicle in cs. we discuss here only dissemination of scientific results by conference proceedings and journals and we start with an important statement: the order in which a cs publication lists authors is generally not significant. in the absence of specific indications, it should not serve as a factor in researcher evaluation. in the cs publication culture, prestigious conferences are a favorite tool for presenting original research, unlike disciplines where the prestige goes to journals and conferences are for raw initial results. acceptance rates at very selective cs conferences are between 13% and 20%. can we tell from the acceptance rate alone how good a conference is? the answer is negative. for example, the international joint conference on neural networks (ijcnn) is a much better conference than shown by its 2008 acceptance rate, which was 58%. as a regular reviewer of ijcnn, one of the authors of this paper (r.a.) considers that about 80% of the submitted papers are at least acceptable. we cannot tell how selective a conference (or a journal) is only by the percentage of papers it accepts because far fewer bad papers are submitted to the best conferences and journals. cs journals have their role, often to publish deeper versions of papers already presented at conferences. while many researchers use this opportunity, others have a successful career based largely on conference papers. there is an increasing tendency to numerically measure the quality of a paper. the starting point would be data from citation databases, such as institute for scientific information’s web of science, that can be analyzed to determine the popularity and impact of specific articles, authors, and publications. in isi web of science citation metrics information is available only when records on the publication list have been added from the web of science. usually metrics are “sum of the times cited”, “average citations per item” and “h-index”. according to hirsch [2], h is the number of articles greater than h that have at least h citations. an h-index of 20 means that there are 20 items that have 20 citations or more. the accuracy of these metrics largely depend on the accuracy of the isi database. the journal citation reports (jcr), published annually by thomson reuters, provides quantitative tools for ranking journals in accordance to statistical information from citation data. ranking is performed, for instance, by the impact factor, which is a measure of the frequency with which the “average 434 r. andonie, i. dzitac article” in a journal has been cited in a particular year or period. the annual jcr impact factor is a ratio between citations and recent citable items published. only citations from papers indexed by isi web of science are considered. thus, the impact factor of a journal is calculated by dividing the number of current year citations to the source items published in that journal during the previous two years. the impact factor jcr(j,y ) of journal j in year y , is c(y ;y −2,y −1)/p(y −2,y −1), where p(y − 2,y − 1) is the number of articles published in journal j in the previous two years (y − 1 and y − 2), and c(y ;y − 2,y − 1) is the number of citations in year y of papers published during the previous two years in journal j. publication quality is just one aspect of research quality, impact is one aspect of publication quality, and the number of citations is one aspect of impact. citation counts rely on databases such as isi, citeseer, acm digital library, scopus, and google scholar. they, too, have limitations. an issue of concern to computer scientists is the tendency to use publication databases that do not adequately cover cs, such as thomson scientific’s isi web of science. the principal problem is what isi counts [3]. the results make niklaus wirth, turing award winner, appear for minor papers from indexed publications, not his seminal 1970 pascal report. as another example, knuth’s milestone book series does not even figure. other evidences of isi’s shortcomings for cs are [3]: • isi’s internal coverage (i.e., percentage of citations of a publication in the same database) is over 80% for physics or chemistry, but only 38% for cs. therefore, we should not make crossdisciplinary comparisons based on number of citations. • isi does not index many top conferences (for instance, the international conference on software engineering (icse), the top conference in the field) but indexes sigplan notices, an unrefereed publication. • isi’s “highly cited researchers” list includes many prestigious computer scientists but leaves out such iconic names as wirth, parnas, knuth and all the ten 2000-2006 turing award winners except one. • any evaluation criterion, especially quantitative, must be based on clear, published criteria. the methods by which isi selects documents and citations are not published or subject to debate. the problem for computer scientists is that assessment relies on often inappropriate and occasionally outlandish criteria. evaluation criteria, like isi’s impact factor or conference acceptance rates are flawed. assessment criteria must themselves undergo assessment and revision. we should at least try to base it on metrics acceptable to the profession [3]: “publication counts only assess activity. giving them any other value encourages “write-only” journals, speakers-only conferences, and stakhanovist research profiles favoring quantity over quality.” 3 where to publish your work: conference vs journal the first thing to start your research is to know what the major journals and conferences in that field are. the rule of thumb is to read “good” papers and submit your papers to “good places”. how to recognize a good journal or conference? it is quite easy if you already went through the reviewing process of that publication: a good journal/conference tends to have rigorous review process. if you are a graduate student, work with your mentors to understand what constitutes good versus bad conference/journal. how to write a good paper in computer science and how will it be measured by isi web of knowledge 435 when ranking conferences, you should look at the following factors: acceptance rate, review process, program committee, who the publisher of the proceedings is, and which database is indexing the published proceedings. this is an example of a good conference (see t. n. vijaykumar [14]): the acm/ieee international symposium on computer architecture (isca) is the top forum for architecture and has been so since 1975. isca papers are 10-12 pages in length with detailed results, and go through around 5-6 double-blind reviews by the top experts on the topic. the acceptance rate is 15-20%, decided by a national science foundation (nsf) panel-style, 20-person program committee. isca takes only 30-35 papers a year (there are no short papers, no posters). for ranking journals, we have to look at the jcr impact factor, publishing house, and editors. the isi ranking system is based on the jcr impact factor (see fig. 1). figure 1: isi journal ranking: ijccc has impact factor 0.373. for example, let us compute the ijccc jcr 2009 impact factor. we have: 34 items published in ijccc (in 4 regular issues) in 2007; 35+ 89 = 124 items published in ijccc (in 4 regular issues + 1 supplementary issue) in 2008. the total number of articles published in 2007 and 2008 in ijccc is p(2007,2008) = 34 + 124 = 158. in 2009, there are c(2009;2007,2008) = 22 + 37 = 59 citations to items published in 2007 and 2008. hence, jcr(ijccc,2009) = c(2009;2007,2008)/p(2007,2008) = 59/158 = 0.373. ijccc is a new journal, founded in 2006. authors use different journal title abbreviations, and this makes journal identification by isi problematic. in addition to this, since the supplementary 2008 issue contains the icccc 2008 proceedings, many citations appear as “proceedings of icccc 2008”, without mentioning the journal. these are two reasons why the isi web of science database contains incorrect ijccc entries, which influences the impact factor of our journal. we recognize here the traditional “garbage in garbage out” problem. a solution would be to use for each journal its unique issn. this is certainly not very easy, because all indexing systems have to change, and authors may have to include the issn’s in their list of publications. however, we think that the effort is worth, since this would make bibliometric indicators more precise. 436 r. andonie, i. dzitac here are examples of important journals and conferences, for different cs domains: • database: ieee trans on knowledge and data engineering, acm trans on database systems, int’l conf on vldb. • software engineering: ieee trans on software engineering, acm trans on software eng. and methodology, ieee int’l conf on software engineering. • computer networks: ieee/acm trans on networking, ieee infocom, acm mobicom. • parallel/distributed systems: ieee trans on parallel and distributed systems, acm trans on computer systems, icdcs, ipdps. • neural networks: ieee trans. on neural networks, neural computation, nips, ijcnn, icann, iwann, esann. should we submit to a journal or conference? in the cs context, this question deserves a discussion. 3.1 why prefer a conference according to patterson et al. [4], in cs, conference publication is preferred to journal publication, at least for experimentalists. this was the recommendation (a memo) of the computer research association (cra) in 1999. the cra memo asserts that conference publication is superior to journal publication in computer science. according to the memo, the typical conference submission receives four to five evaluations, whereas the typical journal submission receives only two to three evaluations. computing researchers are right to view conferences as an important archival venue and use acceptance rate as an indicator of future impact. papers in highly selective conferences, with acceptance rates of 30% or less, should continue to be treated as first-class research contributions with impact comparable to, or better than, journal papers [5]. this distinguishes computer science from other academic fields where only journal publication carries real weight. there are two main reasons to publish in the proceedings of selective conferences: • conferences are more timely than journals. • conferences have higher standards of novelty. journals often only require 20-30% of the material to be new, compared to an earlier conference version. conference selectivity serves two purposes: pick the best submitted papers and signal prospective authors and readers about conference quality. is there a connection between conference acceptance rate and impact factor, where impact is measured by the number of citations received? the answer is positive, up to some threshold. adopting the right selectivity level helps attract better submissions and more citations. chen and konstan [5] found, with respect to acm-wide data, that acceptance rates of 15-20% seem optimal for generating the highest number of future citations for both the proceedings as a whole and the top papers submitted. conferences rejecting 85% or more of their submissions risk discouraging overall submissions and inadvertently filtering out high-impact research. 3.2 why prefer a journal many universities evaluate faculty on the basis of journal publications because, in most scientific fields, journals have higher standards than conferences. journals may have longer page limits and journal reviews tend to be more detailed. many times, conference committees enlist inexperienced graduate students as reviewers of papers in order to meet the quota for reviews. because conference papers are how to write a good paper in computer science and how will it be measured by isi web of knowledge 437 limited in length, and because a large number of papers must be reviewed within a short time, the quality of reviews of conference papers is generally low. in contrast, for journals, because there are usually no page limits, authors can explain their ideas completely. editors can choose qualified reviewers carefully. reviewers can take adequate time to write thorough reviews. by polishing a manuscript for journal publication, the author minimizes the number of errors and improves the clarity of the exposition. thus, journal papers are more likely to be correct and readable than conference papers. journals are more widely distributed through libraries than conference proceedings, which go out of print quickly. in all disciplines, the criteria for quality include innovation, thoroughness, and clarity, appraised through rigorous peer review. across disciplines, there are common standards for the evaluation and documentation of publicly presented scholarly work [6]. according to some authors, computer science is not sufficiently different from other engineering disciplines to warrant evaluation on completely different grounds. the evaluation of the scholarship of academic computer scientists should continue to emphasize publications in rigorously refereed, archival scientific journals. the “conferences vs journal” debate is far from over and was recently relaunched in communications of the acm. studying the metadata of the acm digital library, chen and konstan [5] found that papers in low-acceptance-rate conferences have higher impact than in high-acceptance-rate conferences within acm. highly selective conferences those that accept 30% or less of submissions are cited at a rate comparable to or greater than acm jounals. according to vardi [7], unlike every other academic field, computer science uses conferences rather than journals as the main publication venue. this has led to a great growth in the number of low level conferences. some call such conferences “refereed conferences” but we all know this is just an attempt to mollify promotion and tenure committees. the reviewing process performed by program committees is done under extreme time and workload pressures, and it does not rise to the level of careful refereeing. only a small fraction of conference papers are followed up by journal papers. 4 how to write a good paper and how to deal with the editor ask two questions before starting: i) what is new in your work?, and ii) what are you going to write? emphasize on the originality and significance of your work. organize your thinking and decide the structure (outlines) of your paper. stick on your central points throughout the whole paper and remove all unnecessary discussions. there are many good papers on “how to write a good paper”, and one of the best known was authored by robert day [8]. one could find there some general guidelines which always help: • start writing the day you start the research and maintain a good bibliographic database (use bibtex and latex). • think about where to submit early. • don’t try and prove you are smart and avoid the kitchen sink syndrome. • start from an outline. • work towards making your paper a pleasure for the reviewer to read. • obey the guide to authors. the structure of a cs paper is not different than the structure of any scientific publication: title, abstract, introduction, background, related work, system model & problem statement, methods / solutions, simulations / experiments, conclusion, acknowledgment, and references. almost everybody knows this. however, there are some simple rules of thumb which can make life easier. 438 r. andonie, i. dzitac according to the “hourglass model” [9], a paper should start from general, and go through particular back to general (fig. 2). figure 2: hourglass model (swales [9]). 1. choose a right title. the title should be very specific, not too broad. the title should be substantially different from others. avoid general titles, e.g., “research on data mining”, “contributions to information theory”, “some research on job assignment in cluster computing”, or “a new framework for distributed computing”. 2. write a concise abstract. an abstract should tell: • motivation: why do we care about the problem and the results? • problem statement: what problem is the paper trying to solve and what is the scope of the work? • approach: what was done to solve the problem? • results: what is the answer to the problem? • conclusions: what implications does the answer imply? a good hint is pack each of these part into one sentence. 3. organization of your paper. • plan your sections and subsections. use a top-down writing method. use a sentence to represent the points (paragraphs) in each subsections. • writing details: expand a sentence in the sketch into a paragraph. • keep a logical flow from section to section, paragraph to paragraph, and sentence to sentence. 4. introduction: the most difficult part. this is why one of the authors of the present paper (r. a.) prefers to write the introduction in the final stage and, whenever he writes a paper with students, he prefers to write this section entirely himself. an introduction should: • establish a territory: – bring out the importance of the subject – make general statements about the subject – present an overview on current research on the subject how to write a good paper in computer science and how will it be measured by isi web of knowledge 439 • establish a niche: – oppose an existing assumption – reveal a research gap – formulate a research question – continue a tradition, or propose a completely new approach • occupy the niche: – sketch the intent of the own work – outline important characteristics and results of your own work – give a brief outlook on the structure of the paper 5. related work and list of references. use a proper selection of references. show your knowledge in the related area. give credit to other researchers (reviewers are usually chosen from the references). cite good quality work, particularly when citing your own work, and up to date work. related work should be organized to serve your topic. emphasize on the significance and originality of your work. 6. your conclusions. a research paper should be circular in arguments, i.e., the conclusion should return to the opening, and examine the original purpose in the light of the research presented. assuming that you have decided where to submit, and your paper is ready. what is the next step after writing a nice letter to the editor (if it is a journal) with your manuscript submitted electronically? most probably, your paper will be rejected, or conditionally accepted after a major review. it is almost impossible to have your paper accepted without any modification suggested ( except if your name is donald knuth or david patterson!). even an acceptance “with minor modifications” is rare. the best scientists get rejected and/or have to make major revisions. it is unreasonable to get defensive, unless it is really called for. you should address every aspect of the reviewers concerns. make it obvious to the reviewer through the summary of changes and the revised manuscript itself of the changes you have made. do not add new science unless it is called for. a good referee report is immensely valuable, even if it tears your paper apart. remember, each report was prepared without charge by someone whose time you could not buy. all the errors found are things you can correct before publication. appreciate referee reports, and make use of them. an author who feels insulted and ignores referee reports wastes an invaluable resource and the referees’ time. finally, we have to remember what you put in the literature is your scientific legacy after all else is gone. 5 plagiarism and innovation since ijccc is a young journal, it is reasonable to believe that our review process is less professional than for a top ranked journal like ieee transactions, and our journal attracts many authors who are in their early research stages. such authors are usually under terrible pressure to get something published or not to finish their phd, or not to be promoted (and possibly lose their jobs). the matter becomes serious for them, and some authors try everything to save their career, including plagiarism. after receiving your paper, editors and reviewers have to deal with a very unpleasant task of which authors are probably not aware: they have to detect possible plagiarism. only after your paper passes this preliminary screening it is considered for the true review. as ijccc editors we have to reject about 8% because of detected plagiarism. authors of these papers are blacklisted and not considered for future publication. we do not have any statistics about plagiarism frequencies at other publications, but this would certainly be of interest. we may even imagine a “plagiarism world map”! therefore, we consider important to discuss plagiarism here. 440 r. andonie, i. dzitac the rules of what constitutes plagiarism and how it should be dealt with are not always clear. according to the ieee institute print edition, there are five level of plagiarism: “1. uncredited verbatim copying of a full paper. results in a violation notice in the later article’s bibliographic record and a suspension of the offender’s ieee publication privileges for up to five years. 2. uncredited verbatim copying of a large portion (up to half) of a paper. results in a violation notice in the later article’s bibliographic record and a suspension of publication privileges for up to five years. 3. uncredited verbatim copying of individual elements such as sentences, paragraphs, or illustrations. may result in a violation notice in the later article’s bibliographic record. in addition, a written apology must be submitted to the original creator to avoid suspension of publication privileges for up to three years. 4. uncredited improper paraphrasing of pages or paragraphs (by changing a few words or phrases or rearranging the original sentence order). calls for a written apology to avoid suspension of publication privileges and a possible violation notice in the later article’s bibliographic record. 5. credited verbatim copying of a major portion of a paper without clear delineation of who did or wrote what. requires a written apology, and to avoid suspension, the document must be corrected.” the guidelines also make recommendations for dealing with repeated offenses. according to the ijccc author guidelines, submissions to ijccc must represent original material: “papers are accepted for review with the understanding that the same work has been neither submitted to, nor published in, another journal or conference. if it is determined that a paper has already appeared in anything more than a conference proceeding, or appears in or will appear in any other publication before the editorial process at ijccc is completed, the paper will be automatically rejected. papers previously published in conference proceedings, digests, preprints, or records are eligible for consideration provided that the papers have undergone substantial revision, and that the author informs the ijccc editor at the time of submission. concurrent submission to ijccc and other publications is viewed as a serious breach of ethics and, if detected, will result in immediate rejection of the submission. if any portion of your submission has previously appeared in or will appear in a conference proceeding, you should notify us at the time of submitting, make sure that the submission references the conference publication, and supply a copy of the conference version(s) to our office. please also provide a brief description of the differences between the submitted manuscript and the preliminary version(s). editors and reviewers are required to check the submitted manuscript to determine whether a sufficient amount of new material has been added to warrant publication in ijccc. if you have used your own previously published material as a basis for a new submission, then you are required to cite the previous work(s) and clearly indicate how the new submission offers substantively novel or different contributions beyond those of the previously published work(s). any manuscript not meeting these criteria will be rejected. copies of any previously published work affiliated with the new submission must also be included as supportive documentation upon submission.” whereas plagiarism can more or less measured and there are even software tools available which can help for this, the hardest part to judge as a reviewer is the level of innovation: how much innovation is enough to accept a paper? according to patterson et al. [4]: how to write a good paper in computer science and how will it be measured by isi web of knowledge 441 “when one discovers a fact about nature, it is a contribution per se, no matter how small. since anyone can create something new (in a synthetic field), that alone does not establish a contribution. rather, one must show that the creation is better. accordingly, research in computer science and engineering is largely devoted to establishing the “better” property.” the degree of innovation required depends on the policy of the publication and how selective the conference/journal is. for example, let us illustrate with a good journal. the ieee transactions on neural networks is ranked 13th overall in terms of impact factor (2.62 ) among all electrical and electronic engineering journals (206 journals), according to the latest journal citation report (see [10]). the average time between submission and publication (in print) is 18.8 months, which implies that the average time between final acceptance of a paper and publication is approximately 8 months. the conditions to have a paper accepted for ieee transactions on neural networks are posted in the authors guidelines: • full papers are characterized by novel contributions of archival nature in developing theories and/or innovative applications of neural networks and learning systems. the contribution should not be of incremental nature, but must present a well-founded and conclusive treatment of a problem. well organized survey of literature on topics of current interest may also be considered. • brief papers report sufficiently interesting new theories and/or developments on previously published work in neural networks and related areas. the contribution should be conclusive and useful. it is important to read very carefully these guidelines before submitting a paper. words like “incremental” research are important and should be understood clearly. editors, like accountants, are serious people and they do not play with words. according to qiu [11]: “one-third of more than 6,000 surveyed across six top chinese institutions admitted to plagiarism, falsification or fabrication. many blamed the culture of jigong jinli seeking quick success and short-term gain as the top reason for such practices.” “most academic evaluation in china from staff employment and job promotion to funding allocation is carried out by bureaucrats who are not experts in the field in question. when that happens, counting the number of publications, rather than assessing the quality of research, becomes the norm of evaluation.” “to critics such as rao yi, dean of the life-science school at peking university in beijing, the lack of severe sanctions for fraudsters, even in high-profile cases, also contributes to rampant academic fraud.” we discover the same situation in india [12]: “the resulting push to publish, combined with ignorance about what exactly constitutes plagiarism and research misconduct, has led to a rise in such incidents in the last eight to 10 years.” “meanwhile, the lack of both federal and institutional mechanisms that could detect and punish instances of misconduct have compounded the problem, say some scientists.” actually, plagiarism appears in all countries, but it is more visible in countries: i) with high level of corruption, where plagiarism is not punished, ii) where only the number of papers is the measure of success, and iii) where plagiarism is not considered a major offense. as editors and reviewers, we spend sometimes more time with detecting plagiarism than with judging the novelty of a paper. 442 r. andonie, i. dzitac 6 the task of the reviewer there is an endless stream of research papers submitted to conferences, journals, and other periodicals. many such publications use impartial, external experts to evaluate papers. this approach is called peer review, and the reviewers are called referees. refereeing is a public service, one of the professional obligations of a computer science and engineering professional. unfortunately, referees typically learn to produce referee reports without any formal instruction; they learn by practice [13]. the quality of a publication is also determined by the quality of the reviews. good publications attract the best reviewers and keep this way, in a positive feedback, a high publication standard. for an acceptance rate of 33%, it is fair to ask each published author to provide at least nine good reviews for submitted papers, assuming that each submitted paper has three reviewers. beside detecting plagiarism, editors have to face another administrative problem: they have to find good reviewers. since ijccc is less prestigious than an ieee or acm journal, it is perhaps less attractive for a good computer scientist to collaborate with us. as ijccc editors, we have difficulties in motivating and recruiting good reviewers. the name of the editor can help. in our case, many of our international professional friends have accepted to write reviews simply because of our personal relationship. one rule we try to apply is to let all authors from romania be reviewed by non-romanian residents. the goal is to make our review process unbiased. the most reliable reviewers are experts in their postdoc stage. senior computer scientists are less willing to meet the review deadlines. our review process is blind, but not double-blind: reviewers do know the author’s name. the simple blind review process is possibly more biased, but it has a big advantage: plagiarism is easier to detect. reading a paper as a referee is closer to what a teacher or professor does when grading a paper than what a scientist or engineer does when reading a published work. as a referee you must read the paper carefully and with an open mind, checking and evaluating the material with no presumption as to its quality or accuracy. if you want to be taken seriously as a referee, you must have a middle-of-the-road. a referee who always says “yes” or always says “no” is not helpful. don’t waste that effort on a detailed critique of a badly flawed paper that can never be made publishable. finding one or more fatal and uncorrectable flaws excuses the referee from checking all subsequent details. your report should not be insulting. don’t refer to the author as “idiot” nor to the paper as “trash”. your review should be directed at the paper, not the author. in all cases, the evaluation should be objective and fair. the more psychologically acceptable the review, the more useful it will be. after comparing the paper to an appropriate standard (not your own standards, which may be high or low), you should be able to put it into one of these categories: (1) major results; very significant (fewer than 1 percent of all papers). (2) good, solid, interesting work; a definite contribution (fewer than 10 percent). (3) minor, but positive, contribution to knowledge (perhaps 10-30 percent). (4) elegant and technically correct but useless. this category includes sophisticated analyses of flying pigs. (5) neither elegant nor useful, but not actually wrong. (6) wrong and misleading. (7) so badly written that technical evaluation is impossible. but what are the standards of a journal or conference? you should compare the paper with the average paper in that specific journal or conference, not with the best or worst. of course, in some cases the average is too low and needs to be raised by critical refereeing. as a reviewer, you should be alert to the author who tries to publish the same work in all its various combinations, permutations, and subsets, and to the author who adds the “least publishable unit” of new material to each paper. how to write a good paper in computer science and how will it be measured by isi web of knowledge 443 7 case study the ijccc reviewing process from the many hundreds of emails received by us from authors, we have selected some representative ones. an “excuse letter” for detected plagiarism: “respected sir, i am mrs. j. from ***. this paper was originally prepared in 2008 during my course work period as a conceptual paper. by the time i was not aware of the act of plagiarism. and this paper was submitted only by me without the knowledge of my supervisor. but, later in the middle of 2009 i came to know that preparing an article in this manner is avoided. regarding this i sent a mail to the journal office stated that when can it be published. but i came to know that this paper has been sent for review process and you got the result as such. so, i request you to forgive me for my activity which i have been done unknowingly and now i know to prepare the articles which exhibit only my own findings and i am sure that hereafter this type of work will not be done by me.once again i apologize for my action and sorry for the inconvenience.” here are four hilarious submission letters, with their typo and language errors: “dear sir / madam this is two paper when you received my email just reply me. **notes : which time i can recieved the final result. thank you very much” “hi,dear professor i have send a new paper for your’s journal . . . , msc. faculty member and head research group” “dear sir pl find my paper attched to this mail id. . . . ” “knowing the importance of your journal, we want to submit to you the advances of our research in the area in order to share them with your readers. hoping to hear from you soon” certainly, a nice submission letter is not a sufficient condition for a manuscript to be accepted. but can we expect a good paper from an author who does not know how to write a simple letter? here is a nice professional submission letter: “dear dr. . . . , please find attached our paper entitled . . . . this is joint work of . . . . i will serve as corresponding author. please accept it as a candidate for the publication in ijccc. this manuscript is the authors’ original work and has not been published nor has it been submitted simultaneously elsewhere. all authors have checked the manuscript and have agreed to the submission. thank you for your consideration. best regards,” finally, here is the first part of a good summary of changes document addressed to us after a major review: 444 r. andonie, i. dzitac “dear editors: i am the author of the paper entitled . . . . i revised my paper according to your suggestions and here is the explanation of the changes: 1. section 1, paragraph 5, the first sentence is changed from “a fuzzy qos routing protocol proposed to...” to “we present a fuzzy controller based qos routing algorithm...”. 2. in abstract, “ns2” is explained as network simulation version 2 explicitly. . . . 11. in simulation section, the nodes are assigned classes randomly and we removed the class distribution item in table 2 accordingly.” 8 case study promotion requirements in romania in an effort to uniformly regulate promotion requirements in romanian universities, the romanian ministry for education [15] asks for a minimum number of published papers indexed by the isi web of science citation system or other major citation indexing service. under this relatively flexible umbrella, for each disciplines there are specific standards, in an attempt to automatize academic ranking. the ranking procedures are many times ambiguous and contradictory because of the possible exempts. exempts are frequently modified, in accordance to the acting minister of education. for instance, one isi indexed paper may be replaced by several papers indexed by other citation indexing services. everybody is asking for “isi papers”. each year thomson reuters evaluates approximately 2,000 journals for possible coverage in web of science. isi web of science covers over 10,000 of the highest impact journals worldwide and over 110,000 conference proceedings. these are defined as isi indexed papers. for cs, isi indexed papers are the papers indexed by science citation index expanded. according to the present promotion regulations of the romanian ministry for education, the required isi indexed papers can be journal or conference papers. among the many good publications covered by isi web of science there also journals and proceedings of questionable quality. most promotion standards, including the basic criteria of the romanian ministry for education, consider the number of isi indexed papers, but not other publication assessment indicators, like impact factor and h-index. this stakhanovist criterion favors quantity over quality. physicists are sophisticated and they use more assessment indicators [16]: number of authors, number of citations, and impact factor. it is not easy to be a physicist in romania, especially when you have to prepare your promotion portfolio. but, after all, let us mention that the author of the h-index is jorge e. hirsch, a physicist! one may think that replacing the publication counter by the impact factor of the journal, or by the number of citations of the paper, would be sufficient to accurately quantify scholarship. at thomson reuters’ web site [17], we find the following warning: “the impact factor should be used with informed peer review. in the case of academic evaluation for tenure it is sometimes inappropriate to use the impact of the source journal to estimate the expected frequency of a recently published article.” using excessively the isi indexing scheme to evaluate cs papers has additional drawbacks: • as we have mentioned before, this creates from the very beginning a handicap for computer scientists since isi does do not adequately cover cs. • another weakness of the isi indexing scheme in cs is its poor coverage of high impact conferences, knowing that computer science uses conferences rather than journals as the main publication venue. • a third weakness is the temptation to perform cross-disciplinary comparison. observation: one of the promotion requirements of the romanian ministry for education is the publication of books as “first author”. as we have mentioned before, the order in which a cs publication lists authors is generally not significant. for articles, these requirements do not refer to the order of authors. how to write a good paper in computer science and how will it be measured by isi web of knowledge 445 9 conclusions: the current publication and review model is killing research how efficient are bibliometric measures, like impact factor and h-index? the uk government is considering using bibliometrics in its research excellence framework, a process which will assess the quality of the research output of uk universities and on the basis of the assessment results, allocate research funding. the bibliometric indicators of research quality were tested during 2009-09 [18]. the bibliometrics pilot exercise was conducted with 22 higher education institutions and covered 35 units of assessment from the 2008 research assessment exercise. both thomson reuters web of science and elsevier’s scopus databases were used. the pilot exercise showed that citation information is not sufficiently robust to be used formulaically or as a primary indicator of quality; but there is considerable scope for it to inform and enhance the process of expert review. according to [19], german universities distribute money to researchers by a formula that includes the thomson impact factor. each point of impact factor is worth about 1000 euros. in pakistan, researchers receive bonuses of up to us$20,000 a year depending on the sum of the impact factors of the journals in which they publish. and the critique addressed to the thomson impact factor, which is embedded in a commercial product, continues [19]: “to an extent that no one could have anticipated, the academic world has come to place enormous weight on a single measure that is calculated privately by a corporation with no accountability, a measure that was never meant to carry such a load. yes, some of us benefit from this flawed system-in addition to other rewards that come from publishing in high-impact journals, we collect nice cash bonuses. but none of this changes the fact that evaluating research by a single number is embarrassing reductionism, as if we were talking about figure skating rather than science.” definitely, we have to express openly our concerns about how our work is evaluated, especially by commercial bibliometric products. not only that these products are expensive, but their misuse reduces us to figures in different statistics and rankings. while numeric criteria trigger strong reactions, peer review is strongly dependent on evaluators’ choice and availability (the most competent are often the busiest), can be biased, and does not scale up. the solution is in combining techniques, subject to human interpretation. for instance, extract, first, a citation record for the individual candidate via one of the free internet search engines (e.g., google scholar). second, ask for evaluations concerning the significance of a candidate’s work from carefully selected (i.e., impartial and highly qualified) scientific peers. the pressure to publish is too large for most to ignore. grants don’t get funded unless we splatter our names across journals and conferences the world over. grad students don’t graduate. assistants and adjuncts don’t get tenure. your cv is fewer than 5 pages? you must be stupid. join more vacuous clubs, dues-hungry societies, and enter more regional poster conferences. too much time is spent writing papers rather than developing research. too much time is spent calculating impact factors and finding out who is indexing what. evaluation criteria, like isi’s impact factor or conference acceptance rates are flawed. the reviewing process is inherently flawed and may kill good papers. it is hard to find good reviewer, willing to do this voluntary work. what is the solution? one option would be to slow down. without the pressure to publish a number of isi indexed papers each year, regardless where and how important they are, we might get thorough, lengthy, reproducible publications. is this not what publishing is about? what do we gain from publishing incremental research papers? there are more people writing papers than people who have time to verify their results. 446 r. andonie, i. dzitac bibliography [1] m. del favero and n. j. bray, “herding cats and big dogs: tensions in the faculty-administrator relationship,” in higher education: handbook of theory and research, j. c. smart, ed. springer, 2010, vol. 25, pp. 477–541. isbn 978-90-481-8597-9 (print), 978-90-481-8598-6 (electronic), doi: 10.1007/978-90-481-8598-6-13. [2] j. e. hirsch, “an index to quantify an individual’s scientific research output,” proceedings of the national academy of sciences of the united states of america, vol. 102, no. 46, pp. 16 569–16 572, november 2005. issn-0027-8424, doi:10.1073/pnas.0507655102 [3] b. meyer, c. choppy, j. staunstrup, and j. van leeuwen, “viewpoint research evaluation for computer science,” commun. acm, vol. 52, no. 4, pp. 31–34, 2009, issn:0001-0782 [4] d. patterson, l. snyder, and j. ullman, “best practices memo: evaluating computer scientists and engineers for promotion and tenure,” computer research association, 1999. [5] j. chen and j. a. konstan, “conference paper selectivity and impact,” commun. acm, vol. 53, no. 6, pp. 79–83, 2010, issn: 0001-0782 [6] c. e. glassick, m. t. huber, and g. i. maeroff, scholarship assessed: evaluation of the professoriate. jossey-bass, 1997. [7] m. y. vardi, “conferences vs. journals in computing research,” commun. acm, vol. 52, no. 5, pp. 5–5, 2009, issn: 0001-0782. [8] r. a. day, how to write & publish a scientific paper. oryx press, 1998. [9] j. swales, genre analysis: english in academic and research settings. cambridge university press, 1990, isbn-10: 0521338131; isbn-13: 978-0521338134. [10] m. m. polycarpou, “editorial: a new era for the ieee transactions on neural networks,” neural networks, ieee transactions on, vol. 19, no. 1, pp. 1–2, january 2008, issn 1045-9227. [11] j. qiu, “publish or perish in china,” nature, vol. 463, pp. 142–143, 2010, issn: 0028-0836; eissn : 1476-4687 [12] s. neelakantan, “in india, plagiarism is on the rise,” globalpost, pp. 142–143, october, 18th, 2009. [13] a. j. smith, “the task of the referee,” ieee computer, vol. 23, pp. 65–71, 1990. [14] [online]. available: http://cobweb.ecn.purdue.edu/~vijay/papers/ acceptance.html [15] [online]. available: www.edu.ro/ [16] [online]. available: www.fizica.unibuc.ro/fizica/ [17] [online]. available: thomsonreuters.com/products_services/science/free/ essays/impact_factor/ [18] [online]. available: www.hefce.ac.uk/pubs/hefce/2009/09_39/ [19] a. wilcox, “rise and fall of the thomson impact factor,” epidemiology, vol. 19, pp. 373–374, 2008, issn: 1044-3983. online issn: 1531-5487. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 4, pp. 490-505 implementation of the timetable problem using self-assembly of dna tiles z. cheng, z. chen, y. huang, x. zhang, j. xu zhen cheng college of computer science and technology zhejiang university of technology 288 liuhe road, hangzhou, p.r. china email: chengzhen0716@163.com zhihua chen, yufang huang, xuncai zhang department of control science and engineering huazhong university of science and technology 1037 luoyu road, wuhan, p.r.china jin xu school of electronics engineering and computer science peking university no.5 yiheyuan road haidian district, beijing, p.r.china e-mail: jxu@pku.edu.cn abstract: dna self-assembly is a promising paradigm for nanotechnology. recently, many researches demonstrate that computation by self-assembly of dna tiles may be scalable. in this paper, we show how the tile self-assembly process can be used for implementing the timetable problem. first the timetable problem can be converted into the graph edge coloring problem with some constraints, then we give the tile self-assembly model by constructing three small systems including nondeterministic assigning system, copy system and detection system to perform the graph edge coloring problem, thus the algorithm is proposed which can be successfully solved the timetable problem with the computation time complexity of θ (mn), parallely and at very low cost. keywords: timetable, self-assembly, graph edge coloring, dna tiles 1 introduction since adleman [1] demonstrated the use of recombinant dna techniques for solving a small combinational search problem, the field of dna-based computing has experienced a flowering growth and leaves us with a rich legacy. dna computing [2, 3] potentially provides a degree of parallelism and high density storage far beyond that of conventional silicon-based computers. dna tile self-assembly is an important method of molecular computation and it is also a crucial process by which objects autonomously assemble into complexes [4]. this phenomenon is common in nature and yet is poorly understood from mathematical and programming perspectives. it is believed that self-assembly technology will ultimately permit the precise fabrication of complex nanostructures. the dna nanotechnology was initiated by seeman [5] who proposed self-assembled nanostructures made of dna molecules, and the key of this technology is immobilization of holliday junction (crossover) to make well-defined dna structures. seeman [6] also utilized one of such structures called dx (double crossover) tile to realize a patterned lattice made of these tiles, which is used to construct not only simple pattern such as periodic stripes or barcodes, but also the complex algorithmic pattern. winfree [7] proposed 2d self-assembly process and showed that computation by self-assembly is turing-universal. eng [8] demonstrated that self-assembly of linear, hairpin, and branched dna molecules can generate copyright c⃝ 2006-2010 by ccc publications implementation of the timetable problem using self-assembly of dna tiles 491 regular, bilinear, and context-free languages, respectively. researchers have used dna tile algorithmic self-assembly to create crystals with patterns of binary counters [9, 10] and sierpinski triangles [11], which can be used to implement arbitrary circuit [12]. but those crystals are deterministic, generating nondeterministic crystals may hold the power to solve complex problems quickly. because of the complex and special structure of dna tiles, tile self-assembly is theoretically an efficient method of executing parallel computation where information is encoded in dna tiles and a large number of tiles can be self-assembled via sticky-end associations. mao et al. [13] experimentally implemented the first algorithmic dna tile self-assembly which performed a logical computation (cumulative xor), however that study only executed two computations on fixed inputs. for the application in arithmetic, brun [14] proposed and studied theoretically the systems that computed the sums and products of two numbers using the dna tile self-assembly model, which enough revealed that dna tile selfassembly had the basic computational ability; for the complex application in combinational problems, tile self-assembly has been proposed as a way to cope with huge combinational np-complete problems, such as solving the satisfiability problem [15] by using 2d dna self-assembly tiles, nondeterministically factoring numbers [16], deciding a system of subset sum problem [17]. but generally, the scale is limited to only moderate size problem at best, which further explores the power of computing using dna tile self-assembly. furthermore, this model can also be used in the cryptography. xor computation on pairs of bits can be used for executing a one-time pad cryptosystem that provides theoretically unbreakable security [18]. it is well known that timetable problems [19] are very difficult and time consuming to solve, especially when dealing with large instances. the timetable problem is a combinatorial problem [20] consisting in finding an assignment of a fixed number of teachers to a fixed number of hours in a week, in such a way that a large number of given constraints are satisfied. and it is also known in general to be np-complete [21]. for the most important, the timetable problems are subject to many strict constraints that are usually divided into two categories: ‘‘hard" and ‘‘soft" [22]. hard constraints are rigidly enforced and have to be satisfied for the timetable problem. soft constraints are those that are desirable but not absolutely essential. so it is difficult to generate a satisfactory solution within a short time. in order to avoid the disadvantage of their exponential computation complexity, here we mainly focus on the timetable problem based on dna tile self-assembly, which is a kind of better technique and the model can successfully perform the problem with the operation time complexity of θ (mn), parallely and at very low cost. the rest of this paper is structured as follows: section 2 describes the mechanism of self-assembly based on the dna tiles in detail. section 3 shows the process of performing the timetable problem by self-assembling. the conclusion will summarize the contribution of our work. 2 algorithmic dna tile self-assembly algorithmic dna self-assembly is both a form of nanotechnology and a model of dna computing. as a nanotechnology, the aim of algorithmic dna self-assembly is to design tiles with carefully choosing glue types on their sides. two tiles are said to be of different types if their sides have different glue types. useful tile types are nontrivial to design but relatively easy to duplicate in large quantity. a key design challenge for algorithmic dna tile self-assembly is to use only a small number of different tile types to assemble a target nanostructure to complete the corresponding computation. 2.1 models for algorithmic dna tile self-assembly the tile assembly model extends the theory of wang tilings [23] of the plane by adding a natural mechanism for growth. as a computational model, algorithmic dna self-assembly encodes the input of a computational problem into dna patterns and then manipulates these patterns to produce new dna 492 z. cheng, z. chen, y. huang, x. zhang, j. xu patterns that encode the desired output of the computational problem. informally, the model consists of a set of four sided wang tiles whose sides are each associated with a type of glue. the bonding strength between any two glues is determined by a glue function. a special tile in the tile set is denoted as the seed tile. assembly takes place by starting with the seed tile and attaching copies of tiles from the tile set one by one to the growing seed configuration whenever the total strength of attraction from the glue function meets or exceeds a fixed parameter called the temperature. generally, the tile set and the seed configuration should be constructed before the biological operations together with the suitable temperature. in addition, the tile assembly model [24] is a formal model of crystal growth. it was designed to model self-assembly of molecules such as dna. rothemund and winfree [25] defined the abstract tile assembly model, which provides a rigorous framework for analyzing algorithmic self-assembly. here, we mainly use the abstract tile assembly model to solve the timetable problem. intuitively, the model has tiles or squares that stick or don’t stick together based on various binding domain on their four sides. figure 1 gives the structures of dna tiles, mainly including the tao and tae tiles. figure 1(a) describes the structure of tao tile. figure 1(b) shows the three tao tiles joining diagonally. the tae tiles and the corresponding abstract tiles can be seen in (c), (d) and (e). figure 1(f) gives the structures which are assembled to form a compact lattice. (a) (b) (c) (e) (f) (d) figure 1: dna tiles. (a)structure of tao tile. (b)three tao tiles join diagonally. (c)structure of tae tile. (d)tx tile. (e)two tae tiles join. (f)the two types of tiles can assemble to form a compact lattice structure. 2.2 computation by dna tile self-assembly computation by self-assembly is the spontaneous self-ordering of substructures into superstructures driven by annealing of watson-crick base-pairing dna sequences. computation by dna tile selfassembly entails the building up of superstructures from starting units such that the assembly process implementation of the timetable problem using self-assembly of dna tiles 493 itself performs the actual computation. dna tile self-assembly is also a highly parallel process, where many copies of different molecules bind simultaneously to form intermediate complexes. one might be seeking to construct many copies of the same complexes at the same time, as in the assembly of periodic 1d or 2d arrays; alternatively, one might wish to assemble in parallel different molecules, as in dna-based computation, where different assemblies are sought to test out the combinatorics of the problem. a sequential or deterministic process of dna tile self-assembly has three highly parallel instruction steps [4]. the first one is molecular recognition: elementary molecules selectively bind to others. the second is growth: elementary molecules or intermediate assemblies are the building blocks that bind to each other following a sequential or hierarchical assembly. the cooperativity and non-linear behavior often characterize this process. the third way is termination: a built-in halting feature is required to specify the completion of the assembly. in practice, their growth is interrupted by physical and/or environmental constraints. dna tile self-assembly is a time-dependent process and because of this, temporal information and kinetic control may play a role in the process before thermodynamic stability is reached. 3 implementing the timetable problem based on dna tile self-assembly in this section, we first give the definition of the timetable problem, then we mainly show the algorithm for solving the timetable problem based on dna tile self-assembly, and concretely introduce the process how they can perform this problem. finally, examples of success and failure in the tile attachments are given to demonstrate the reasonability and validity of the algorithm. 3.1 the timetable problem the typical timetable problem consists in assigning a set of activities/actions/events (e.g. work shifts, duties, classes) to a set of resources (e.g. physicians, teachers, rooms) and time periods, fulfilling a set of constraints of various types. constraints stem from both nature of timetable problems and specificity of the institution involved. in other words, timetable or planning is a process of putting in a sequence or partial order a set of events to satisfy temporal and resource constraints required to achieve a certain goal, and is sometimes confused with scheduling, which is the process of assigning events to resources over time to fulfill certain performance constraints. however, many scientists consider scheduling as a special case of timetable and vice versa [26]. in this paper, we solve a special kind of timetable problem which is the coursetable problem [27]. the problem consists in scheduling courses for a set of courses in a university, taught by available teachers in a given period composing a number of weeks, and in available classrooms. although the constraints of timetable problem vary from case to case, one can classify all constraints into hard constraints and soft constraints. hard constraints must be strictly satisfied because any timetable that violates just one will become useless. a timetable that violates some soft constraints can still be usable although it may cause some inconvenience to the users. it is often very difficult to satisfy all the soft constraints in a real life. some concrete definitions of hard constraints and soft constraints in a coursetable problem will be given as follows [28]. some examples of hard constraints are: hc0 a teacher can only teach in a single place at a time. hc1 a teacher can only give one course at a time. hc2 a room can only host one course at a time. hc3 a student can only attend one course at a time. hc4 room capacities must be respected. hc5 no more than a teacher is scheduled to teach in a room each time. hc6 each subject is scheduled in a proper room (for example, a laboratory needs a proper equipment). 494 z. cheng, z. chen, y. huang, x. zhang, j. xu hc7 every teacher must have scheduled all his hours. hc8 every student must have scheduled all his hours. we denote the fact that some conditions can be deduced from other constraints. for example, hc0, hc2→ hc1. some examples of soft constraints are: sc0 the courses should be scheduled in the morning and the seminaries and laboratories in the afternoon. sc1some courses are scheduled with a prior consideration. sc2 as much as possible the preferences of the teachers and the ones of the students should be respected. 3.2 solving the timetable problem based on dna tile self-assembly here, we mainly introduce the algorithm for implementing the timetable problem based on dna tile self-assembly. first, the timetable problem can be converted into the graph edge coloring problem, then the tile self-assembly model is used to solve the graph edge coloring problem with some constraints including mainly constructing three small systems which are nondeterministic assigning system, copy system and detection system, thus the timetable problem can be successfully carried out. examples can be given to indicate how the tile self-assembly model performs in this problem. the graph edge coloring problem let g be an undirected graph where v is the set of vertices and e is the set of edges. mathematically, an assignment of colors to the edges of a graph g(one color to each edge so that adjacent edges are assigned different colors) is called a coloring of g. edges with a same color define a color class. a k-coloring of g is proper if incident edges have different colors; that is, if each color class is a matching, otherwise conflicts happen. a coloring with at least one conflict is called an infeasible coloring. a graph is k-edge-colorable if it has a proper k-edge-coloring. for the given coloring of a graph g, a set consisting of all those edges assigned the same color is referred to as a color class. in this study, the timetable problem can be converted into the graph edge coloring problem. first, a complete bipartite graph, denoted as km,n, is a graph consisting of two sets of vertices, one with m vertices and the other with n vertices. there is exactly one edge from each vertex in the one set to each vertex in the other set. there are no edges between vertices within a set. then we give the bipartite graph from the arrangement matrix of the timetable problem. second, the hard and soft constraints can be considered as the constraints of the graph edge coloring problem. according to the graph theory, the feasible solutions of the edge colorings are the arrangements of courses in the timetable problem. here, we mainly propose non-deterministic algorithm to solve the graph edge coloring problem by using the massive parallelism possible in dna tile selfassembly, thus the timetable problem with some given constrains can be successfully solved. in the process of implementing the tile self-assembly systems, many assemblies happen in parallel by creating billions of billions of copies of the participating dna tiles, so this is simulated by an exponential number of dna assemblies which can be converted into the space occupied by the dna molecules, thus we expect that the procedure will run in parallel on all possible colorings. in this case, there are many possible valid tilings, any or all of which may be produced. when tiles are implemented by real molecules, one would expect a set of tiles to nondeterministically generate a combinatorial library of input assemblies, and then a deterministic set of rule tiles could evaluate each input assembly to determine whether it represents the desired answer. implementation of the timetable problem using self-assembly of dna tiles 495 the nondeterministic assigning system non-determinism implies that at some steps the algorithm makes a non-deterministic choice. of course, there are many differences between the deterministic computation and nondeterministic computation. in terms of deterministic computation, it can be defined as a tile system to produce a unique final seed configuration if for all sequences of tile attachments, all possible final configurations are identical. comparing to the deterministic computation, the nondeterministic computation is a system in which different sequences of tile attachments can attach different tiles in the same position. intuitively, a system nondeterministically computes a function if at least one of the possible sequences of tile attachments produces a final configuration, which contains the computation results. furthermore, in many implementations of the tile assembly model that would simulate all the nondeterministic executions at once, it is useful to be able to identify which executions succeed and fail in a way that allows selecting only the successful ones. the nondeterministic assigning system can give a color set to the edges of the graph. first, the edges of the given graph can be labeled as ‘‘e1,e2,··· ,em", the vertices can be noted as xi(1 ≤ i ≤ n). here, m, n is the number of edges and vertices of the graph respectively. each edge in the graph can be nondeterministeically obtained one color. the same edge connecting different vertices should share the same color. if there is only one edge which is adjacent to the vertex, the information about the vertex and the edge needn’t be arranged on the rightmost column in the seed configuration, but should be assigned with one color at the bottom of the seed configuration of the nondeterministic assigning system. c sc(e1)c(e2)... b r x1e1rx1e2b ## # x1e1x2e2 ...r xnemr ## c sc(e1)c(e2)c(em) ... # x1e1x2e2 ... xnem c xiejc ## xiej xiej #= xiej xiej == xiej c(em) xnem (a) (b) (c) figure 2: the framework of the nondeterministic assigning system. (a) the basic tile types of this system. (b) the seed configuration of the system. (c) an example of the nondeterministic assigning system. the color set of the edges can be nonderministically generated by the tile self-assembly configuration. here, suppose the edge e j is on the vertex xi which is labeled as xie j. c(e j) is denoted as the edge e j with the color ‘‘c". the same edge on the remainder vertices can pass the information from the bottom to the upper in the tile and can obtain the same color. the basic tile types of this system can be shown in figure 2(a). figure 2(b) shows the seed configuration of the system. figure 2(c) is an example of the nondeterministic assigning system which can assign a color set to the edges. the copy system the copy system mainly carries out three functions by designing three basic tile types which can be shown as follows in figure 3. here, xie j denotes the edge e j is adjacent to the vertex xi(1 ≤ i ≤ n). ‘‘c" is the color of the edge e j on the vertex xi. 496 z. cheng, z. chen, y. huang, x. zhang, j. xu the first function is to pass the color of the edge given by the nondeterministic assigning system to the same edge on different vertices, so it can make sure that the same edge adjoining different vertices has the same color. the tile type is labeled with blue. if the edge is adjacent to different vertices, ‘‘xie jc∗" and xie j should be passed to the left and the upper of the tile respectively. otherwise, the color ‘‘c" can be passed to the edge xie j. at the same time, if the input at the bottom of the tile includes the information of the colors, the outputs are only needed to copy the information of the inputs from left to right, and synchronously from bottom to upper in the tile. the second is to copy the possible colorings generated by the nondeterministic assigning system from the bottom of the seed configuration to the uppermost of the self-assembly complexes. after the edges sharing the common vertex have been checked the colorings by the detection system, there would be a condition that the edges adjoining different vertices (k ̸= i) and with different colors (c1 ̸= c2) will meet together, but they have no need to be checked the coloring and also should be passed to the left tiles which is shown in the second tile type with the color turquoise. the third is to copy the information with the edges which are adjacent to the vertices on the rightmost column to the left tiles, so the detection system can check up whether colorings of the edges sharing the common vertex are feasible, and synchronously, they also should be passed to the upper in the tile. here, xie j is the edge which has at least two adjacent edges sharing a common vertex. once a vertex has l adjacent edges, (l −1) edges with the labels smaller then should be arranged on the rightmost column in the seed configuration of the problem, but all the edges should be at the bottom of the seed configuration. the tile type can be shown as follows with the color rosiness. xiejc x i e j x i e j xkelc xkelc xiejc2 xiejc2 x k e l c 1 k i xiejc xiejc xiejc xiejc xiejc x i e j c * xiej xiejc x i e j c * x i e j c * x i e j c * x i e j c * xiej x k e l c * xiej x k e l c * k i figure 3: the basic tile types of the copy system the detection system the key to the detection system is to make one system implement the checking operations. if the adjacent edges sharing the common vertex have different colors, the feasible coloring of the edges for the vertex has been completed with the symbol ‘‘ok". once the edges sharing one common vertex have at least two same colors, the self-assembly complexes will stop to grow with the information ‘‘no tile can match" and the coloring is not feasible. here, the coloring of the adjacent edges for each vertex should be satisfied with the same constraints. for the timetable problem, the hard and soft constraints can be described by the constraints of the edges coloring for the corresponding graph. when the edges which are adjacent to the vertices on the rightmost column and the coloring of each edge from the copy system are passed to the detection system, it can check up whether the coloring is feasible or not. if the comparison result at this step is ‘‘ok", it should be passed to the left tiles to continuously make the next color checking until the edges don’t share the same vertex, then it doesn’t check the coloring with the left tiles any more, and synchronously the colors of the edges at the bottom of the seed configuration in the nondeterministic assigning system should be passed to the higher layers. implementation of the timetable problem using self-assembly of dna tiles 497 here, ek and e j(k ̸= j) are the adjacent edges on the common vertex xi, and they have different colors, so the comparison result is ‘‘ok". for the second tile type, the edge e j which is adjacent to the vertex labeled as xi meets the color ‘‘c", then it can pass ‘‘xie jc" to the right in the tile, and the value of the tile is the color ‘‘c" of the edge e j. if the result is ‘‘no tile can match", the self-assembly complexes can’t grow any more and the input colors of the edges are not the feasible solutions of the graph edge coloring problem. the formula of the detection system can be described as follows: ok c x i e j xiejc xiejc x i e j c xiejc1 x i e k c 2 xiejc1 x i e k c 2 k j figure 4: the basic tile types of the detection system here, this system will use the l-configuration to encode inputs, and produce its output on the top row of an almost complete rectangle. therefore, systems could chain results together. the input structure encodes the edges colorings of the graph on the bottom row and encodes the edges and their adjacent vertices on the rightmost column. the output tiles needs three different kinds of tiles as follows in figure 5. the pink tile shows the edge e j with the color ‘‘c" adjoining the vertex xi which is the feasible coloring for the graph. only if all the edges adjoining different vertices have feasible colorings, the result is ‘‘success", and the feasible solution can’t be obtained otherwise. c ** success * xiejc * figure 5: the output tiles of the timetable problem here, we design the algorithm to implement the timetable problem as following steps: step 1: convert the timetable problem into the graph edge coloring. step 2: generate all possible input combinational colors of the edges to the given graph for the timetable problem with some constraints. step 3: according to the rules for dealing with the constraints using the massive parallelism of dna self-assembly to check up whether all the possible inputs are the feasible colorings for the edges in the graph. in this process, the copy system can pass the color of each edge in the graph from the bottom of the seed configuration to the higher layers, and synchronously copy the information of edges which are adjacent to the vertices, then the detection system can judge whether the colorings of the edges sharing the common vertices are feasible or not. step 4: reject all infeasible solutions according to constraints of the edge coloring and reserve all feasible solutions, therefore we can obtain feasible solutions of the graph edge coloring problem which are also the solutions of the timetable problem. step 5: reading the operation result is done by the reporter strand method. under certain biological operations, we can obtain all the result strands which run through all the results of the feasible colors of the edges for the given graph. each report strand records the result of one feasible input. the strands can be amplified by polymerase chain reaction using the primers to ligate each end of the long reporter strand. then through gel electrophoresis and dna sequencing, we can read out the result strands of 498 z. cheng, z. chen, y. huang, x. zhang, j. xu different lengths representing the information with the feasible coloring results. finally, we can easily get the feasible solutions of the timetable problem. the actual implementation detail is not discussed here since they fall outside of the scope of this paper. however, we believe that we make no arbitrary hypotheses. in fact, our work is based on the achievements that come with dna tiling computation in general. examples of the timetable problem here, we take an example of timetable problem to verify the validity of our method. suppose there are three teachers x1, x2, x3, and four classes y1, y2, y3, y4. the arrangement matrix of the courses is shown as follows: y1 y2 y3 y4 p = x1 x2 x3   1 0 1 00 1 1 0 0 1 1 1   the timetable problem has the hard constraints are: hc0 a teacher can only teach in a single place at a time. hc1 a teacher can only give one course at a time. hc2 a room can only host one course at a time. hc3 a student can only attend one course at a time. hc4 no more than a teacher is scheduled to teach in a room each time. hc5 the teacher x3 should give a course to the class y2, which is arranged in the second period in the morning time. hc6 there are enough rooms for the courses where the students attend. some soft constraints are: sc0 some courses are scheduled with a prior consideration. sc1 as much as possible the preferences of the teachers and the ones of the students should be respected. sc2 if possible, the order of the courses classes taken yj are more earlier than yj+1. first, we should convert the timetable problem into the graph edge coloring problem with some constraints. the bipartite graph from the arrangement matrix of the timetable problem can be shown in figure 6. all the edges of the graph are ‘‘e1,e2,e3,e4, e5,e6,e7" and the vertices are ‘‘x1, x2, x3, y1, y2, y3, y4". x1 x2 x3 y2 y3 y4y1 e 1 e 2 e 3 e 4 e5 e 6 e 7 figure 6: the bipartite graph from the arrangement matrix of the timetable problem second, according to the method introduced above, we need construct the basic tile types in each of the three small systems and they are the same as the tiles described above and the seed configuration, which can be shown in figure 7. when all the tiles and the seed configuration are prepared, we put them implementation of the timetable problem using self-assembly of dna tiles 499 together into the reaction buffer. according to the dna tiles prepared and the mechanism of algorithmic dna tile self-assembly through watson-crick base pairing, the self-assemble process starts at the same time with the connector tiles, so the final stage can be seen in figure 8. we also can see that the process of the three small systems performing. the nondeterministic assigning system can give a color set ‘‘rbrybry" to all the edges of the graph ‘‘e1,e2,e3,e4,e5,e6,e7" which are adjacent to the vertices ‘‘x1, x2, x3, y1, y2, y3, y4" respectively. all the edges on different vertices should be arranged at the bottom of the seed configuration no matter whether the vertices adjoin only one edge or not. at the same time, the edges ‘‘x1e1,x3e5,y2e3,y3e2,y3e4" are on the rightmost column of the seed configuration. the copy system can pass the colors of the edges from the bottom of the seed configuration to the upper layers, and pass the edges and their adjacent vertices on the rightmost column to the left tiles, so that the detection system can check whether the colorings of the edges sharing one common vertex are feasible. the vertex x1 which has two adjacent edges e1 and e2 with different colors ‘‘r" and ‘‘b" respectively is checked up the feasibility of the colorings by the detection system and the result is ‘‘ok". for the vertex x3, it has three adjacent edges e5, e6 and e7 which are at the bottom of the seed configuration. one of the two edges e5, e6 with the smaller subscripts than e7 are on the rightmost of the column. the detection system only need verify the colors of e5 and e6, e5 and e7, e6 and e7, and the three comparison results are all ‘‘ok". the method of checking the colorings of other vertices is also the same. finally, to output the computation result, we would implement a modification of the standard sequencereading operation that uses a combination of pcr and gel electrophoresis. on adding these tiles, and allowing them to anneal, then we get the final tile assembly. on adding ligase to seal the bonds, we will have a single strand of dna passing through the tiles in the final output layer, which encodes the colorings of the edges. this single strand begins with the unique nucleotide sequence labeled ‘‘success". therefore, the feasible assignment of the edge colorings can be obtained if and only if the symbol ‘‘success" appears in the result dna strand. through using the operations, we can extract the strands of different lengths representing the output tiles in the result strands. in this example, we can obtain the feasible solution of the ‘‘rbrybry". the color sets ‘‘r", ‘‘b" and ‘‘y" have the corresponding relationship with the edge sets ‘‘e1,e3,e6" , ‘‘e2,e5" and ‘‘e4,e7" which are also ‘‘x1y1, x2y2, x3y3", ‘‘x1y3, x3y2" and ‘‘x2y3, x3y4". thus the feasible solution of the timetable problem which is also the arrangement of the courses can be described as: ‘‘x1y1, x2y2, x3y3" are arranged in the first period, ‘‘x1y3, x3y2" and ‘‘x2y3, x3y4" in the second and third period respectively which are satisfied with the constraints in the problem. for the nondeterministic algorithm, we give the same example to show the failure in attaching tiles in figure 9 and don’t get the right results. if the nondeterministic assigning system gives a color set ‘‘rbrybby" to the edges ‘‘e1,e2,e3,e4,e5,e6,e7" respectively, there will be some conflicts in the process of the growth for the assembly complexes. when the detection system checks up the coloring of the vertex x3, the colorings of the edges e5 and e6 are the same which are both ‘‘b", so the conflict generates and the result is ‘‘no tile can match", thus the self-assembly complexes can’t grow any more, therefore, the coloring of the edges assigned is the infeasible solution of the problem. it means that ‘‘x1y1, x2y2", ‘‘x1y3, x3y2,x3y3" and ‘‘x2y3, x3y4" are not the feasible arrangements of the courses, here there is a conflict that the teacher x3 can’t give a course in two different classes y2 and y3 at the same period. complexity analysis the complexity of the design is considered in terms of computation time, computation space and the number of distinct tiles required. generally, suppose there are m teachers, and n classes for the timetable problem. it is obvious from the given examples that the upper bound of the computation time t is t = m(n − 1)+ n(m −1)+ mn +4+ mn + mn +2 = θ (mn). the upper bound of the computation space s taken for each assembly is the area of the assemble 500 z. cheng, z. chen, y. huang, x. zhang, j. xu x1 x2 x3 x3 y2 y3 y3 x 1 e 1 x 2 e 3 x 3 e 5 x 3 e 6 y 2 e 3 y 3 e 2 y 3 e 4 * c sc(e3) c(e1)c(e2)c(e4)l c(e7) c(e5)c(e6) x1e1x1e2x2e3x2e4x3e5x3e6x3e7y2e3y2e5y3e2y3e4y3e6 # y2e3y2e5y3e2y3e4y3e6 figure 7: the seed configuration of the timetable problem in the example implementation of the timetable problem using self-assembly of dna tiles 501 x1 x2 x3 x3 y2 y3 y3 ok rl x 1 e 1 ok rl x 2 e 3 l ok bok l rok l l l x 3 e 5 x 3 e 6 x 1 e 1 r y 2 e 3 y 3 e 2 y 3 e 4 y 3 e 4 y 3 e 4 y 3 e 4 y 3 e 4 ok r ok bok rok x 1 e 1 r x 2 e 3 x2e3rx2e4y x 2 e 3 r x 2 e 3 x 3 e 5 x 3 e 5 x 3 e 5 x 3 e 5 x3e5b x3e5b x 3 e 5 b x3e6r x3e6r x 3 e 5 b x3e7y x3e7y x 2 e 3 r x 3 e 6 x 3 e 6 x 3 e 6 x 3 e 6 x3e5b x3e5b x 3 e 6 x3e6r x 3 e 6 r x3e7y x3e7y x 3 e 6 r x 3 e 5 b x2e4y y 2 e 3 y 2 e 3 y 2 e 3 y 2 e 3 y 2 e 3 y 2 e 3 y 2 e 3 y2e3r y2e3r y2e3r y2e3r y2e3r y 2 e 3 r y2e5b y2e5b y2e5b y2e5b y2e5b y 2 e 3 r y3e2b y3e2b y3e2b y3e2b y3e2b y 3 e 2 y 3 e 2 y 3 e 2 y 3 e 2 y 3 e 2 y 3 e 4 y 3 e 4 y 3 e 4 y 3 e 4 y 3 e 2 y 3 e 2 y 3 e 2 y 3 e 2 y 3 e 2 b y3e4r y3e4r y3e4r y3e4r y3e4r y 3 e 2 b y 3 e 2 b y3e4r y 3 e 4 y 3 e 4 y 3 e 4 r y3e6r y3e6r y3e6r y3e6r y3e6r y3e6r y 3 e 4 r x3e6r x1e1rx1e2b x2e3r y3e2b x2e3rx2e4yx3e5bx3e6rx3e7yy2e3ry2e5b x1e1rx1e2b x2e3rx2e4yx3e5bx3e6rx3e7y x1e1rx1e2b x2e3rx2e4y x1e1rx1e2b x2e3rx2e4y x1e1rx1e2b x1e1rx1e2b b ry rsuccess r byb rr br y3e6r x2e3rx2e4yx3e5bx3e6rx3e7yy2e3ry2e5by3e2by3e4r x1e1rx1e2b ************* cb ry rl r by sc(e3) c(e1)c(e2)c(e4)l c(e7) c(e5)c(e6) x1e1x1e2x2e3x2e4x3e5x3e6x3e7y2e3y2e5y3e2y3e4y3e6 ######## l l y2e3y2e5y3e2y3e4y3e6 x1e1rx1e2bx2e3rx2e4yx3e5bx3e6rx3e7y x 1 e 1 r * x 1 e 1 r * x 1 e 1 r * x 1 e 1 r * x 1 e 1 r * x1e2b x1e1r x 1 e 2 b * x 1 e 2 b * x 1 e 2 b * x 1 e 2 b * y2e3y2e5y3e2y3e4y3e6 l l l y2e5y3e6 x1e1rx1e2bx2e3rx2e4yx3e5bx3e6rx3e7y x1e2b x1e1r y2e5y3e4 y3e6 x1e1rx1e2bx2e3rx2e4yx3e5bx3e6rx3e7yy3e2by3e4r y2e3y2e5 x 2 e 3 r * x 2 e 3 r * x 2 e 3 r * x 2 e 3 r * x 2 e 3 r * x 2 e 4 y * x 2 e 4 y * x 2 e 4 y * x 2 e 4 y * x 2 e 4 y * x3e5b x 3 e 5 b * x 3 e 5 b * x 3 e 5 b * x 3 e 5 b * x2e4y x2e4yx3e6rx3e7y x 1 e 1 r * x2e3rx3e5bx3e6rx3e7y y2e3r y2e3r y3e4r l l x1e2b x1e1rx3e5b x2e4yx3e6rx3e7yy2e3ry3e4r x1e1rx1e2bx2e3rx2e4yx3e5bx3e6rx3e7yy3e2by3e4r y2e3ry2e5b x2e3r x 3 e 6 r * x 3 e 6 r * x 3 e 6 r * x 3 e 7 y * x 3 e 7 y * x 3 e 7 y * x2e3r x1e2b x1e1rx3e5b x2e4yx3e6rx3e7yy2e3ry3e4r x2e3r y3e6r y3e6r y2e3y2e5y3e2y3e4y3e6 === x 2 e 3 r * x 2 e 3 r * x 2 e 3 r * x 2 e 3 r * x 2 e 3 r * x 2 e 4 y * x 2 e 4 y * x 2 e 4 y * x 2 e 4 y * == x 3 e 5 b * x 3 e 5 b * x 3 e 5 b * x 3 e 5 b * x 3 e 6 r * x 3 e 6 r * x 3 e 6 r * x 3 e 6 r * x 3 e 7 y * x 3 e 7 y * x 3 e 7 y * x 1 e 2 b * x 1 e 2 b * x 1 e 2 b * x 1 e 2 b * x 1 e 2 b * x 1 e 2 b * x 1 e 2 b * x 1 e 1 r * x 1 e 1 r * x 1 e 1 r * x 1 e 1 r * x 1 e 1 r * x 1 e 1 r * y3e6 y3e6 y3e2b y3e2b y3e2b y3e2b y2e5b y2e5b figure 8: the final stage of the successful example for the problem 502 z. cheng, z. chen, y. huang, x. zhang, j. xu x1 x2 x3 x3 y2 y3 y3 ok rl x 1 e 1 ok rl x 2 e 3 b x 3 e 5 x 3 e 6 x 1 e 1 r y 2 e 3 y 3 e 2 y 3 e 4 y 3 e 4 y 3 e 4 y 3 e 4 y 3 e 4 x 1 e 1 r x 2 e 3 x2e3rx2e4y x 2 e 3 r x 2 e 3 x 3 e 5 x 3 e 5 x 3 e 5 x 3 e 5 x3e5b x3e5b x 3 e 5 b x3e6b x3e6b x3e7y x3e7y x 2 e 3 r x 3 e 6 x 3 e 6 x 3 e 6 x 3 e 6 x3e5b x3e5b x 3 e 6 x2e4y y 2 e 3 y 2 e 3 y 2 e 3 y 2 e 3 y 2 e 3 y2e3r y2e3r y2e5b y2e5b y3e2b y3e2b y 3 e 2 y 3 e 2 y 3 e 2 y 3 e 2 y 3 e 2 y 3 e 4 y3e4r y3e4r y3e6b y3e6b x1e1rx1e2b x2e3r x2e3rx2e4yx3e5b x1e1rx1e2b x2e3rx2e4yx3e5b x1e1rx1e2b x2e3rx2e4y x1e1rx1e2b x2e3rx2e4y x1e1rx1e2b x1e1rx1e2b b ry rb x2e3rx2e4yx3e5b x1e1rx1e2b ****** cb ry rl b by sc(e3) c(e1)c(e2)c(e4)l c(e7) c(e5)c(e6) x1e1x1e2x2e3x2e4x3e5x3e6x3e7y2e3y2e5y3e2y3e4y3e6 ######## l l y2e3y2e5y3e2y3e4y3e6 x1e1rx1e2bx2e3rx2e4yx3e5bx3e6bx3e7y x 1 e 1 r * x 1 e 1 r * x 1 e 1 r * x 1 e 1 r * x 1 e 1 r * x1e2b x1e1r x 1 e 2 b * x 1 e 2 b * x 1 e 2 b * x 1 e 2 b * y2e3y2e5y3e2y3e4y3e6 l l l y2e5y3e6 x1e1rx1e2bx2e3rx2e4yx3e5bx3e6bx3e7y x1e2b x1e1r y2e5y3e4 y3e6 x1e1rx1e2bx2e3rx2e4yx3e5bx3e6bx3e7yy3e2by3e4r y2e3y2e5 x 2 e 3 r * x 2 e 3 r * x 2 e 3 r * x 2 e 3 r * x 2 e 3 r * x 2 e 4 y * x 2 e 4 y * x 2 e 4 y * x 2 e 4 y * x 2 e 4 y * x3e5b x 3 e 5 b * x 3 e 5 b * x 3 e 5 b * x 3 e 5 b * x2e4y x2e4yx3e6bx3e7y x 1 e 1 r * x2e3rx3e5bx3e6bx3e7y y2e3r y2e3r y3e4r l l x1e2b x1e1rx3e5b x2e4yx3e6bx3e7yy2e3ry3e4r x1e1rx1e2bx2e3rx2e4yx3e5bx3e6bx3e7yy3e2by3e4r y2e3ry2e5b x2e3r x 3 e 6 b * x 3 e 6 b * x 3 e 6 b * x 3 e 7 y * x 3 e 7 y * x 3 e 7 y * x2e3r x1e2b x1e1rx3e5b x2e4yx3e6bx3e7yy2e3ry3e4r x2e3r y3e6b y3e6b y2e3y2e5y3e2y3e4y3e6 === x 2 e 3 r * x 2 e 3 r * x 2 e 3 r * x 2 e 3 r * x 2 e 3 r * x 2 e 4 y * x 2 e 4 y * x 2 e 4 y * x 2 e 4 y * == x 3 e 5 b * x 3 e 5 b * x 3 e 5 b * x 3 e 5 b * x 3 e 6 b * x 3 e 6 b * x 3 e 6 b * x 3 e 6 r * x 3 e 7 y * x 3 e 7 y * x 3 e 7 y * x 1 e 2 b * x 1 e 2 b * x 1 e 2 b * x 1 e 2 b * x 1 e 2 b * x 1 e 2 b * x 1 e 2 b * x 1 e 1 r * x 1 e 1 r * x 1 e 1 r * x 1 e 1 r * x 1 e 1 r * x 1 e 1 r * y3e6 y3e6 y3e2b y3e2b y3e2b y3e2b y2e5b y2e5b no tile can match figure 9: the failure of the example for the problem because of infeasible coloring sets implementation of the timetable problem using self-assembly of dna tiles 503 complexes represented by s = [m(n−1)+n(m−1)+mn+4]∗[mn+mn+4] = θ (m2n2) which is upperbounded polynomially to the number of variables. finally, suppose the graph g which is converted from the given timetable is k-edge-colorable. the upper bound of tiles needed contain the following tiles: boundary tiles. these tile types include the input boundary tiles and computation boundary tiles. according to the number of the edges in the graph, there are 2mn tiles for the input tiles at the bottom of the seed configuration, and [m(n − 1) + n(m − 1) + 3] on the rightmost column. the computation boundary tiles contain (kmn + km +3) types. so the upper bound of the boundary tiles is [2mn + m(n − 1)+ n(m −1)+ kmn + km +6]. computation tiles. for the assigning operations, there must be (kmn+mn) tiles with the upper bound, and they are shown in figure 2. for the copy system which can be seen in figure 3, the upper bound of the tile types is [km(n − 1) + kn(m − 1) + kmn + kmn + kmn]. for the detection system in figure 4, there must be [km(n − 1) + kn(m − 1) + kmn] tiles with the upper bound. thus the upper bound of the computation tiles is [9kmn + mn − k(m + n)]. output tiles. finally, there must be some tiles to output the results. the upper bound is (2kmn +2) and the tiles are shown in figure 5. summing up all the tile types, because the value of k can be determined for the given timetable problem, so we can have the upper bound of the total number of tiles: [2mn + m(n − 1) + n(m − 1) + kmn + km +6]+[9kmn + mn − k(m + n)]+(2kmn +2)= θ (mn). 4 summary and conclusions dna tile self-assembly is looked forward to many applications in different fields. in this paper, we show how the dna self-assembly process can be used for solving the timetable problem. the advantage of our method is that once the initial strands are constructed, each operation can compute fast parallelly through the process of dna self-assembly without any participation of manpower, thus the algorithm is proposed which can be successfully implemented the timetable problem with the operation time complexity of θ (mn), parallely and at very low cost. a limitation of the algorithm, which is common for most dna computations, comes from the fact that the exponential dimension of the problem has been pushed into the physical space (volume) occupied by the dna molecules. this will eventually become a restrictive factor. the input size and thus the dna volume can’t grow forever. this implies an upper bound to the size of instances that can be solved in practice. while dna tile self-assembly suffers from high error rates, the possible sources of errors are, either an error in constructing the tiles, or an erroneous binding of tiles, methods of error control and error correction may be used to decrease the error rates in the computation of dna tile self-assembly model. many experimental results in dna tile self-assembly have not appealed to the advantages of crystal growth; however, these early works on the fundamentals of self-assembly and the physical experimental evidence of actual dna tile crystals suggest a bright future for dna tile self-assembly. the field of nanotechnology holds tremendous promise, but many technical hurdles will have to be overcome before algorithmic dna tile self-assembly can be developed into a practical commercial technology. if the molecules and supramolecules can be controlled at will, then it may be possible to achieve vastly better performance for computers and memories. so we can see that the dna tile self-assembly model has various applications in many fields and it also might open up a host of other applications in materials science, medicine, biology and other ways. 504 z. cheng, z. chen, y. huang, x. zhang, j. xu acknowledgments the work was supported by the national natural science foundation of china (grant nos. 60674106, 30870826, 60703047, 60533010 and 60803113), 863 program of china (2006aa01z104), and program for new century excellent talents in university (ncet 05-0612), ph.d. programs foundation of ministry of education of china (20070001020), chenguang program of wuhan (200750731262), and the open fund of key lab. for image processing and intelligent control (no.200703). bibliography [1] l.m. adleman, molecular computation of solutions to combinatorial problems, science, vol. 266, pp. 1021-1024, 1994. [2] l.q. pan, j. xu, y.c. liu, a surface-based dna algorithm for the minimal vertex problem, progress in natural science, vol. 13, pp. 81-84, 2003. [3] l.q. pan, g.w. liu, j. xu, solid phase based dna solution of the coloring problem, progress in natural science, vol. 14, pp. 104-107, 2004. [4] a. carbone, n.c. seeman, molecular tiling and dna self-assembly, springer-verlag berlin heidelberg, lncs 2950, pp. 61-83, 2004. [5] n.c. seeman, dna nanotechnology: novel dna constructions, annu. rev. biophy. biomol. struct., vol. 27, pp. 225-248, 1998. [6] c. mao, w. sun, n.c. seeman, designed two dimensional dna holliday junction arrays visualized by atomic force microscopy, j. am. chem. soc., vol. 121, pp. 5437-5443, 1999. [7] e. winfree, on the computational power of dna annealing and ligation, dna based computers, pp. 199-221, 1996. [8] t. eng. linear self-assembly with hairpins generates the equivalent of linear context-free grammars. 3rd dimacs meeting on dna based computers, univ. of penn.,1997. [9] r. barish, p. rothemund, e. winfree, two computational primitives for algorithmic self-assembly: copying and counting, nano letters, vol. 12, pp. 2586-2592, 2005. [10] pablo moisset de espane′s, ashish goel, toward minimum size self-assembled counters, springer science business media b.v., 2008. [11] p. rothemund, n. papadakis, e. winfree, algorithmic self-assembly of dna sierpinski triangles, plos biology, vol. 12, pp. 2041-2053, 2004. [12] m. cook, p. rothemund, e. winfree, self assembled circuit patterns, dna, pp. 91-107, 2004. [13] c. mao, t.h. labean, j.h. reif, logical computation using algorithmic self-assembly of dna triple-crossover molecules, nature, vol. 407, pp. 493-496, 2000. [14] y. brun, arithmetic computation in the tile assembly model: addition and multiplication, theoretical computer science, vol. 378, pp. 17-31, 2006. [15] g.l. michail, t.h. labean, 2d dna self-assembly for satisfiability, dimacs series in discrete mathematics and theoretical computer science. vol. 44, pp. 139-152, 1999. implementation of the timetable problem using self-assembly of dna tiles 505 [16] y. brun, nondeterministic polynomial time factoring in the tile assembly model, theoretical computer science, vol. 395, pp. 3-23, 2008. [17] y. brun, solving np-complete problems in the tile assembly model, theoretical computer science, vol. 395, pp. 31-36, 2008. [18] a. gehani, t.h. labean, j.h. reif, in dna based computers: proceedings of a dimacs workshop, dimacs series in discrete mathematics and theoretical computer science, 1999. [19] d. werra, an introduction to timetabling. european journal of operations research, vol. 19, pp.151-162, 1985. [20] d. abramson, constructing school timetables using simulated annealing: sequential and parallel algorithms. management science, vol. 37, pp. 98-113, 1991. [21] a. colorni, m. dorigo, v. maniezzo, genetic algorithms and highly constrained problems: the time-table case. in: h.p.schwefel and r.manner(eds). parallel problem solving from nature. proceedings of 1st workshop, ppsn 1, lncs 496, dortmund, germany, 1-3 october. springer-verlag, pp. 55-59, 1991. [22] l. gaspero, a. schaerf, tabu search techniques for examination timetabling. in proceedings of the 3rd international conference on practice and theory of automated timetabling (patat 2000), springer-verlag, lncs 2079, pp. 104-117, 2001. [23] h. wang, proving theorems by pattern recognition, i. bell system technical journal, vol. 40, pp. 1-42, 1961. [24] e. winfree, algorithmic self-assembly of dna, ph.d.thesis, caltech, pasadena, ca, june 1998. [25] p. rothemund, e. winfree, the program-size complexity of self-assembled squares, acm symposium on theory of computing (stoc), pp. 459-468, 2001. [26] mihaela oprea. mas_upuct: a multi-agent system for university course timetable scheduling. international journal of computers, communications & control, vol. ii, pp. 94-102, 2007. [27] zhipeng l., jin-kao hao. adaptive tabu search for course timetabling. european journal of operational research, doi:10.1016/j.ejor.2008.12.007, 2009. [28] a.r. mushi, mathematical programming for mulations for the examinations timetable problem: the case of the university of dar es salaam. african journal of science and technology (ajst) science and engineering series, vol. 5, pp. 34-40, 2004. international journal of computers communications & control issn 1841-9836, 10(4):500-507, august, 2015. an energy-efficient and routing approach for position estimation using kalman filter techniques in mobile wsns y. donoso, g. a. montoya, f. solano yezid donoso*, germán a. montoya system and computing engineering department universidad de los andes bogotá, colombia ydonoso@uniandes.edu.co, ga.montoya44@uniandes.edu.co *corresponding author: ydonoso@uniandes.edu.co fernando solano warsaw university of technology warsaw, poland fs@tele.pw.edu.pl abstract: mobile wireless sensor networks is being an attractive field due to its applicability to an increasingly amount of mobile scenarios such as wild monitoring, disaster prevention, object guidance and health monitoring. in addition, since the sensors have limited batteries, data routing has to be planned strategically in order to extend the battery lifetime as much as possible [1] [2]. in this paper, we assume gps free sensor devices, where considering a predictive technique to estimate the sensor position in a circular trajectory scenario can be useful to know when the sensor will be as close as possible to a sink, and then, help us to reduce the energy consumption by the fact of transmitting data at a short distance respect to the sink. in this paper, we propose an predictive algorithm based on kalman filter techniques to estimate the proper time at which the sensor is close as much as possible to a sink, in order to reduce the energy consumption in the sensor. specifically, we propose the usage of two kalman filters. one kalman filter is used for estimating the received signal strength indicator (rssi) level based on several control packets received at the sensor device. this rssi estimation indicates the distance from the mobile sensor device to the sink at a given time. the second kalman filter, based on the outputs from the first kalman filter, estimates the angular velocity and the angle of the mobile sensor device at a given time. once this information is processed, it is possible to estimate the mobile sensor position in a circular trajectory in order to determine how much close is the mobile sensor device respect to the sink. in addition, the communication channel noise may affect the packet content, generating non-accurate information measurements at the receptor. for this reason, our proposal is evaluated under different noise channel levels and compared against a traditional technique. our predictive routing algorithm shows better results in terms of distance accuracy to the sink and energy consumption in noisy communication channels. keywords: mwsn, position estimator, double kalman filter. 1 introduction the advances of wireless sensor networks (wsn) have allowed attaching the sensors to an entity such as an object, animal or human, to monitor a physical variable presented in its environment. however, the sensors are equipped with limited batteries whereby it is required to implement energy efficient routing techniques to extend the lifetime of the sensors. in addition, the sensors mobility can be disadvantageous for those ones that are distant from a sink, since copyright © 2006-2015 by ccc publications an energy-efficient and routing approach for position estimation using kalman filter techniques in mobile wsns 501 this indicates employing more power transmission to set a communication link, which represents more energy consumption [1] [2]. given the scenario described above, a possible solution would consist to implement energy efficient routing techniques considering the sensor position to know accurately when the sensor is near to a sink. some of these solutions propose the usage of sensors equipped with gps devices, called gps non-free approaches. however, these gps non-free solutions have in most of cases drawbacks such as high implementation costs, delays for acquiring position information and non-accurate position information [3]. in addition, these types of solutions require an extra chip for the gps [4], whereby more energy consumption is experimented. in this sense, gps free solutions can be a viable option to solve the problem described above through using predictive techniques to estimate when it is pertinent to send data packets a sink [3]. however, the prediction processes must take into account external factors such as the inherent noise introduced by the wireless channel in the environment. for this purpose, one of the most well-known and often-used tool is the kalman filter. it is basically a predictorcorrector type estimator that minimizes the estimated error covariance when some presumed conditions are accomplished. the kalman filter has been the subject of extensive research and application, for instance in the area of autonomous or assisted navigation [8]. thanks to the advances of digital computing, currently the filter is practical in many applications, and due to its simplicity and robustness it can be used in sensor devices since they have severe computation and memory constraints [2], [8]. in this paper, we propose an energy efficient routing algorithm based on kalman filtering techniques to predict a future state of the system in order to determine the accurate time to send data packets from a sensor to a sink in despite of the wireless channel noise. specifically, we propose the usage of two kalman filters. the first kalman filter is employed for estimating the rssi level based on several control packets received at the sensor device. this rssi estimation indicates the distance from the mobile sensor device to the sink at a given time. the second kalman filter, based on the outputs from the first kalman filter, estimates the angular velocity and the angle of the mobile sensor device at a given time. once this information is processed, it is possible to estimate the mobile sensor position in a circular trajectory in order to determine how much close is the mobile sensor device respect to the sink. the remainder of the paper is organized as follows: section ii shows in detail the problem description as well as the theoretical basis and the algorithm of our solution is shown in section iii. in section iv, we present results for the predictive approach and the classical solution showing the advantages and disadvantages of them. finally, in section v, we show conclusions and present future directions of our work. 2 problem description the scenario to be analysed is shown in the figure 1, which it is compound of at least two sinks, s1 and s2, and one sensor (target) t attached to an entity (human, animal or object) describing a circular trajectory. for illustrative reasons, our application case will be an athlete monitoring scenario, in which it is assumed a sensor will be attached to an athlete for sending and collection of physiological data. the scenario characteristics is described in detail below. in the circular trajectory, the target will have an angular velocity wk which it is assumed to be constant. however, the angular velocity and the circular trajectory are assumed no ideal such as in real life application scenarios, whereby we assume these variations follow a gaussian distribution with mean µ = 0 and with a standard deviation σ defined in detail later. this means the target angular velocity will increase or decrease according to a gaussian distribution along the circular trajectory, and likewise, the radius r of the circular trajectory will have a little 502 y. donoso, g. a. montoya, f. solano figure 1: application scenario. variation in time following also a gaussian distribution. in addition, it is assumed there are two sinks located close to the road, (s1 and s2) with a fixed d distance between them. further, we assume there is always a noisy wireless link communication between sinks and the target t. due to the target is compound of a wireless sensor with a limited battery, it must be used efficiently in order to extend its lifetime for future usage. the target has to collect relevant information which must be sent to any sink each certain time in order to use efficiently the sensor buffer, which also is limited. for our case, we assume the best time to send data is when the target is as close as possible to a sink. due to, for our case, it is not possible to know at any time the accurate position of the target, knowing the distance between the target and a sink could be a encouraging start to determine the target position. the distance can be inferred by extracting the rssi value of a packet, however, the noise can disturb our measurements, and then, affecting our position estimation. as a result, our problem consists how to send the data collected by the target in an efficient energy way without knowledge of the target position in the circular trajectory, taking into account the negative noise effects caused by the wireless channel to estimate distances. given the problem above, a classical/intuitive solution, proposed by us, would be the target receiving a packet at each time with its correspondent rssi value to establish how close it is from a sink. in a similar way, our novel proposal also uses the rssi attribute, but more times than the classical solution, employing an algorithm which use the kalman filter basis. 3 proposed solution taking into account the figure 1, our proposed solution would estimate when the target is close to a sink based on d1, d2 and d distances, the angular velocity wk and the angle variation θk. once these variables are calculated, it is possible to estimate when the target will be as close as possible to a sink. the d1 and d2 distances can be inferred through the rssi measured when the sinks and the target interchange control messages. however, a rssi value can be unstable since it is affected by the wireless channel, in other words, by the noise introduced by the channel itself. in this sense, it could be not enough to estimate the distance based only on a unique rssi value, because it could be a wrong estimation. for this reason, in order to do a suitable estimation of distances, it is required to collect more than one rssi values, which, in turn, will provide proper estimated values of wk and θk variables. the quality of these variables is very important since it will allow to estimate properly the moment in which the target should send its data to a sink. as a result, an energy-efficient and routing approach for position estimation using kalman filter techniques in mobile wsns 503 the more rssi values are sampled, the higher quality of wk and θk is obtained. however, the above will produce more energy consumption because more control packets are transmitted, and then, it has to be considered at setting parameters for the simulation. in addition, it is necessary to determine how many control packets should be sent in order to obtain suitable distance estimations and, at the same time, saving energy consumption to extend the lifetime of the sensor, and avoiding the noise negative effects. in terms of energy consumption, it is necessary to consider consumption by transmission and reception in the sensor. for transmission, the energy consumption is defined as (eelec + eamp)∗k ∗dn, while for reception, it corresponds to eelec ∗k. eelec is the energy consumption for codification, modulation and filtering, and eamp corresponds to energy consumption for the transmitterpoweramplifier [5]. additionally, the following parameters values are assumed: eelec is 50nj/bit, eamp is 100pj/bit/m2, n is assumed to be 2 and d corresponds to the distance to send a k-bit packet [6]. in addition to determine the number of packets for distance estimations, it is necessary to define a ∆t at which the samples will be taken. if ∆t is too high, that is, the frequency to take samples is too low, is highly probable that the closest distance to a sink generated by the solution will be too high respect to the theoretical best distance to a sink. on the contrary, if ∆t is too low, the solution will be quite better than the previous, but at the expense of a lot of energy consumption. 3.1 kalman filter applied to our proposal the kalman filter is a recursive algorithm, which is compound of two phases, as shown in the figure 2, that are performed iteratively: a prediction phase and a correction phase. for space reasons, the kalman filter basis are omitted, but if you are interested in the basis details, you can refer to [7] [8]. figure 2: kalman filter algorithm phases. we propose to perform two kalman filters as it is shown in the figure 3. one kalman filter for rssi and distances (d1 and d2), and the other one for θk and wk estimations. for the first one, since we are assuming the rssi and distances estimated should converge to a constant value, the matrix a, indicated in figure 2 expressions (i), (iii) and (iv), must be equal to "1" according to the kalman filter theoretical basis. in addition, the observation matrix h also has 504 y. donoso, g. a. montoya, f. solano figure 3: kalman filter block diagram implementation. a "1" value, because only one variable is observed. for the second kalman filter, the expression (i) in the figure 2 can be defined as follows, where a matrix can be inferred: xk = [ θk wk ] = [ 1 ∆t 0 1 ][ θk−1 wk−1 ] + wk−1 (1) in the expression above, notice that the angular velocity is indicated in uppercase and the process noise in lowercase. the a matrix expressed in (1) indicates the transition model state which, basically, defines the circular trajectory of the target. notice that ∆t indicates the time in which we want generate the next kth state of the system. the ∆t should not be selected arbitrary since a too high value of ∆t will generate excellent results but at the expense of a high energy consumption, and a too low value of ∆t will not afford accurate target position. in the algorithm1 is presented the main aspects of our solution. in addition, a block diagram in the figure 3 is shown to complement the pseudocode explanation. in line 2, the target is moved with an angular velocity ws, an angle displacement θs during a lapse of ∆ts for the movement simulation of the sensor, which is indicated with the index s. in line 3, ∆tk indicates the lapse in which the samples for kalman algorithm will be taken. in 6, paqtx indicates the number of packets with rssi values in order to do a proper estimation using a kalman algorithm, that is, to generate the values indicated in line 8 and 9. in 10, it is predicted the rssi for s1 and s2 according to the previous corrected values using a kalman filter. in 11, the estimated distances to s1 and s2 are predicted based on the estimated rssi previously obtained. in 13, a kalman filter is applied over d̂1 − k and d̂2 − k to obtain the corrected values d̂1k and d̂2k. once the distances has been calculated in 13, in line 14 the cosine law is applied to calculate θ̄k based on the values of d̂1k, d̂2k, r and d (figure 1). due to θ̄k and ∆tk is possible to obtain w̄k based on the expression (1). notice that θ̄k and ∆tk correspond to the observed values zk indicated in the expression (ii) of the figure 2. in line 15 and 16, based mainly on observed values θ̄k and w̄k; the transition model state matrix a described in the expression (1); and through the kalman filter theory is possible to predict and correct the values of θ and w , that is, θ̂−k and ŵ−k , and θ̂k and ŵk respectively. once these values are calculated the (x,y) coordinates can be found and determine how close is the target from the sinks. an energy-efficient and routing approach for position estimation using kalman filter techniques in mobile wsns 505 algorithm 1 solution pseudocode 1: while batsensor > 0 do 2: move ws, θs 3: set ∆tk 4: if ∆tk is triggered then 5: calculate d1 and d2 6: set n = paqtx 7: for i = 1 to n do 8: generate rssi1, rssi2 + white noise 9: send rssi packets to the sinks 10: predict ̂rssi1−k , ̂rssi2−k 11: predict d̂1−k , d̂2 − k 12: end for 13: correct d̂1k, d̂2k 14: calculate θk, wk 15: predict θ̂−k , ŵ − k 16: predict θ̂k, ŵk 17: end if 18: end while 4 results before to show the results, it is necessary to present the following figures, which represent some performance aspects of the traditional and our proposed solution. figure 4: performance results for traditional and our proposed solution. the figures a.1) and a.2) show the prediction based on kalman for a noise of 0.01 and 0.1, respectively, and for 20 packets to determine a distance at a given time. in the figure a.1), as the noise is lower than a.2), the prediction is more accurate than a.2). in the figures b.1) and b.2) is presented the delta angular velocity for a noise of 0.01 and 0.1, respectively, in order to determine the target position in a given time. in b.2), as the noise is quite high, the prediction is more variable than b.1). as a result, in b.1) the target position is more accurate than b.2). 506 y. donoso, g. a. montoya, f. solano for c) and d) figures, the main simulation parameters were: ∆tk = 700ms, paqtx = 200, datasizefortransmission equal to 500kb, perimeter = 400m and targetspeed = 5.52m/s. in c) it is presented the energy consumption for both traditional and our proposed solution according to different noise values. for 0.1, 0.2, and 0.3 noise values, the traditional has a lesser energy consumption than our proposed solution. however, for values higher than 0.3, our proposed solution presents a lesser energy consumption because the traditional solution behaviour is affected by the high noise level. the performance is affected due to sending extra data transmission caused by the high noise present in the communication channel. in d), the distance from the target and the sink 2 is calculated at the moment of data transmission for both traditional and our proposed solution. from 0.1 to 0.3 the performance for both are quite similar. however, for values higher than 0.3 our algorithm outperforms classic solution since it less prone to disturbances caused by noisy wireless channels. 5 conclusions in this paper, we propose a predictive algorithm based on kalman filter techniques to estimate the proper time at which the sensor is close as much as possible to a sink, in order to reduce the energy consumption in the sensor. we propose an energy efficient routing algorithm based on kalman filtering techniques to predict a future state of the system in order to determine the accurate time to send data information from a sensor to a sink. the theoretical basis and operation of our algorithm is explained in detail, and its performance is evaluated for an athlete monitoring scenario for illustrative reasons. the main conclusion of our proposed solution is about its performance in noisy communication channels. it means that the energy consumption of our algorithm is lesser than the traditional method, presenting less impact for noisy communication channels. acknowledgement this research paper was made possible through the help and support from the european union project goldfish at framework programme 7. bibliography [1] i. f. akyildiz and m. c. vuran (2010), wireless sensor networks, john wiley & sons. [2] j. zheng and a. jamalipour (2009); wireless sensor networks: a networking perspective, wiley-ieee press, isbn: 978-0-470-16763-2. [3] d. gavalas, c. konstantopoulos, b. mamalis and g. pantziou (2010); mobility prediction in mobile ad hoc networks, book chapter for next generation mobile networks and ubiquitous computing by s. pierre, igi global, 226-240. [4] b. buchli, f. sutton and j. beutel (2012); gps-equipped wireless sensor network node for high-accuracy positioning applications, wireless sensor networks lecture notes in computer science, springer, 7158: 179-195. [5] s. li, x. ma, x. wang, m. tan (2011); energy-efficient multipath routing in wireless sensor network considering wireless interference, journal of control theory and applications, 9(1): 127-132. an energy-efficient and routing approach for position estimation using kalman filter techniques in mobile wsns 507 [6] p. s. boluk, s. baydere, a. e. harmanci (2011); robust image transmission over wireless sensor networks, mobile networks and applications, 16(2): 149-170. [7] p. s. maybeck (1979); stochastic models, estimation and control, mathematics in science and engineering, vol.141. [8] g. welch and g. bishop (1995); an introduction to the kalman filter. technical report. university of north carolina at chapel hill, chapel hill, nc, usa. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 2 (june), pp. 222-226 perturbation in population of pulse-coupled oscillators leads to emergence of structure a. bartha, d. dumitrescu attila bartha babes-bolyai university of cluj-napoca romania, 400084 cluj-napoca, 1 mihail cogalniceanu e-mail: abartha@cs.ubbcluj.ro d. dumitrescu babes-bolyai university of cluj-napoca romania, 400084 cluj-napoca, 1 mihail cogalniceanu e-mail: ddumitr@cs.ubbcluj.ro abstract: a new synchronization model based on pulse-coupled oscillators is proposed. a population of coupled oscillators is represented as a cellular automaton. each cell periodically enters a firing state. firing of a cell is sensed by other cells in a neighborhood of radius r. as a result the sensing cell may change its firing rate. the interaction strength between a firing and a sensing cell decreases with the squared distance between the two cells. for most starting conditions waves of synchronized firing cells emerge. simulations indicate that for certain parameter values the emergence of synchronization waves occurs only if there is dispersion in the intrinsic firing frequencies of the cells. emergence of synchronization waves is an important feature of the model. keywords: cellular automaton, synchronization, emergence, pulse-coupled oscillators 1 introduction many physical and biological systems can be described by mathematical models of coupled oscillators. examples include earthquake formation [1], synchronously firing neurons [2], synchronous flashing of fireflies [3] and heart pacemaker cells [4]. pulse-coupled oscillators are a special case when the interaction between oscillators is pulsatile. each oscillator can enter a firing state when emits a pulse of physical signal. this signal is received by other oscillators in the population and as a result they may change their own oscillation frequency. a model of pulse-coupled oscillators in the area of cardiology has been first described by peskin [4]. the model has proved itself to be applicable in many other areas and was extensively studied [5, 6]. many variations of the model have been considered, including different coupling mechanisms and topolgies. a population of pulse-coupled oscillators on a 2d grid is considered. changes in each oscillator occur at discrete time moments. oscillators (grid cells) interact only if the distance between them is less than a given radius. the interaction strength decreses with the squared distance between the interacting cells. in many models of pulse-coupled oscillators a total synchronization of the population is eventually observed. however, for the proposed model, extensive simulations indicate that a total synchronization of the population does not occur. instead, waves of synchronously firing oscillators emerge (see section 3). this is an interesting feature of the model that indicate a good potential for applications. copyright c⃝ 2006-2011 by ccc publications perturbation in population of pulse-coupled oscillators leads to emergence of structure 223 2 proposed oscillator population model 2.1 grid model a population of pulse-coupled oscillators arranged on a 2-dimensional grid is considered. the oscillator’s states are synchronously updated at regular intervals, called generations. the structure of each oscillator consist of a classical integrate-and-fire mechanism combined with a separate mechanism for sensing and integrating the signal emitted by neigbouring oscillators. a classical integrate-and fire oscillator consists of a unit which integrates an input signal until the output value of the integrator reaches a certain threshold. then the oscillator enters firing state, the integrator discharges and the process repeats by itself. the time elapsed between two consecutive firing states is called the firing period of the oscillator. the discharge time is usually considered negligable compared to the firing period. the firing period of the oscillator can be changed in at least two ways: by changing the slope of the integrator or by changing the firing threshold. in the proposed model the second approach is considered. the integrator has a constant slope (which is a parameter of the oscillator). it generates a signal which increases proportionally with time (the number of generations in our discrete model). a separate mechanism in each oscillator is responsible for sensing the signals emitted by other oscillators. the energy of the detected signals are integrated by a leaky integrator, i.e. the accumulated energy of the integrator decays exponentially in time. an oscillator meats the firing condition if the output value of the integrate-and-fire mechanism summed with the weighted value of the sensing integrator is greater than one. this is equivalent to a classic integrate-and-fire oscillator with a non-constant firing threshold: at each moment the firing threshold is one minus the (weighted) value of the sensing integrator. a particular feature of the model is that the intensity of the signal emitted by a firing oscillator decreases with the squared distance. this models the behaviour of a physical signal (like light or sound) propagating in the three-dimensional space. the proposed model limits the interaction distance between two cells to a certain value called the interaction radius. this corresponds to a sensitivity threshold of a cell. 2.2 oscillator state and update rules the internal state of an oscillator is represented by a set of three variables (p, s, f) with the following meaning : p, called potential, represents the current value of the constant-slope integrator; s, called signal, represents the current value of the sensing integrator; f, called firing state, is a boolean value which is true when the oscillator is in firing state in the given generation. at each generation the state variables of each oscillator are updated according to the following rules: the potential p is incremented with a constant value e. the value of e is an oscillator-specific parameter and determines the slope of the integrator. p <= p + e (1) the signal s is calculated by summing the intensities of the signals emitted by all oscillators in the interaction radius which are in firing state, then decrementing s by an amount proportional to the value of s in the previous generation: 224 a. bartha, d. dumitrescu s <= ∑ k 1 d2k (2) where dk is the distance between the oscillator and a neighbouring oscillator k in the interaction domain dk <= r which is in firing state and d is an oscillator-specific decay parameter. the firing state f of an oscillator is determined by comparing the weighted summ of the potential p and signal s with a constant threshold: f <= true if p + ks >= 1 f <= false otherwise where k is an oscillator-specific coupling parameter. a cell is in firing state for only one generation. in the next generation returns to normal state and the potential p is reset to zero: p <= 0 if f = true in the previous generation the model implies that if the oscillator has no firing neighbors inside the interaction radius r than it will fire periodically with a firing rate determined by the parameter e. the potential is incremented in each generation with e and when it becomes grater than 1, the oscillator enters firing state for one generation. on the next generation the potential is reset to zero and the process repeats by itself. however, if there are firing oscillators in the neigbourhood of the oscillator then the ’radiated energy’ of the firing increases the signal value of the oscillator. this value, multiplied by the coupling parameter k is added to the potential and together determine the firing condition of the oscillator. thus, if many oscillators fire in the neigbourhood of the oscillator, the oscillator will fire more frequently. 3 experimental results the proposed automaton model is simulated for different values of the parameters e, d and k. different interaction radii r are also considered. the potential p of each oscillator is initialized to a random value uniformly distributed in the interval [0, 1). the starting value of the signal s is set to zero for each oscillator. in each experiment the decay and the coupling parameters d and k are set to the same value for each cell. in some experiments the parameter e which determines the intrinsic firing frequency of the cell is chosen to have the same value. in other experiments the e value is peturbed by a small random noise, so that each cell is initalized with different values of e. in experiments where the parameter is the same for all cells, the evolution of the cell array displays a random pattern of firing cells. this pattern is maintained along more than 100,000 generations. however, if the value of the parameter e is perturbed with a small noise, in a few hundred generations clusters of firing cells emerge. waves of synchronously firing cells emerge tipically after 600-1000 generations. figure 1 depicts a typical evolution of the automaton on a grid of 150 x 150 cells. perturbation in population of pulse-coupled oscillators leads to emergence of structure 225 parameter values are e = 0.05, d = 0.7, k = 0.1 and r = 30. the initial value of parameter e for each cell is perturbed by a random value in the range [0, 0.005] around the central value of 0.05. in the diagrams of figure 1 the potential values of the cells are displayed in red color with intensity proportional to the potential. cells in the firing state are indicated in a bright color. figure 1: a typical evolution of the oscillator population. parameter setting: e = 0.05, d = 0.7, k = 0.2, r = 30. the variance of the e parameter was set to 0.005. the sequence of images represents 16 consecutive states of the oscillator array for generations 2400 to 2415. the emergence of a synchronized front of firing oscillators can be observed in this sequence. this front travels through the oscillator array and disappears at the array boundary. after 16 generations the oscillator array returns to a similar state and the process repeats itself. 4 conclusions a new model of pulse-coupled oscillators is proposed and investigated. a population of pulsecoupled oscillators on a 2d grid is considered. changes in each oscillator occur at discrete time 226 a. bartha, d. dumitrescu intervals. the interaction strength decreases with the squared distance between interacting cells. the interaction distance is limited by a maximum radius. a total synchronization of the population is eventually observed in many models of pulse-coupled oscillators. simulations indicate that total synchronization of the population does not occur in the propsed model. instead some synchronization patterns emerge. these emergent patterns are observed as waves of synchronously firing oscillators. the condition of emergence is the existence of a small perturbation in the intrinsic frequencies of individual oscillators. if all oscillators have the same intrinsic frequency then the synchronization patterns do not emerge. the proposed model describes a new autoorganization paradigm conditioned by the presence of a noise mandatory in inducing an emergent synchronization process. therefore the model provides a way of understanding self-organization in some natural or artificial systems. acknowledgment this work was supported by cncsis uefiscsu, project number pnii idei 508/2007. investing in people! phd scholarship, project co-financed by the european social fund, sectoral operational programme human resources development 2007-2013 babeş-bolyai university, cluj-napoca, romania bibliography [1] herz, a. v. m., hopeld, j. j.: earthquake cycles and neural reverberations: collective oscillations in systems with pulse-coupled threshold elements. phys. rev. lett, 75(6), 12221225 (aug. 1995) [2] izhikevich, e. m.: weakly pulse-coupled oscillators, fm interactions, synchronization, and oscillatory associative memory. ieee trans. on neural networks, 10(3), 508-526 (may, 1999) [3] buck, j., buck, e., case j., hanson, f.: control of repteroptyx cribellata. j. comp. physiology a, 144(3), 630-633 (sep, 1981) [4] peskin c.: mathematical aspects of heart physiology. courant institute of mathematical sciences, new york university (1975) [5] mirollo, r.e., strogatz, s.h.: synchronization of pulse-coupled biological oscillators, siam journal on applied mathematics, 50, 1645-1662 (1990) [6] strogatz, s.h.: from kuramoto to crawford: exploring the onset of synchronization in populations of coupled oscillators. physica d 143, 1-20 (sep, 2000) int j comput commun, issn 1841-9836 9(1):48-55, february, 2014. modeling and simulation of genetic fuzzy controller for l-type zcs quasi-resonant converter m. ranjani, p. murugesan mani ranjani* department of electrical and electronics engineering sathyabama university, chennai, 600119 tamilnadu, india *corresponding author: meranjanisasi@gmail.com palzha murugesan department of electrical and electronics engineering s.a.engineering college, chennai, 600077 tamilnadu, india murugu1942@yahoo.co.uk abstract: a new method of speed control of dc drives using series quasi-resonant (qr) zero current switching (zcs) dc to dc converters is proposed. it employs a fuzzy logic controller (flc) in feedback loop conventionally but due to the advent of new intelligent techniques the flc are optimized by genetic algorithm (ga). in this paper the ga optimization technique is applied for speed control of series zcs-qrc fed drive. the main objective of this work is to obtain reduced transient response, reduced switching stresses and switching losses which in turn enhances the efficiency and commutation capability of motor. keywords: fuzzy logic controller (flc), zero current switching quasi-resonant converter (zcs-qrc), genetic algorithm (ga), integral absolute error (iae), direct current (dc) 1 introduction the dc motor drives have the advantage of high controllability and are used in many applications such as robotic manipulators, position control, steel mining, and paper and textile industries. in some industrial applications, the dynamic response of drives is bounded by certain limitations such as transient time and steady state error. in addition when they are fed from pwm converter [1], they suffer from high switching losses, reduced reliability, electromagnetic inference and acoustic noise. to overcome the above difficulties quasi-resonant converters [2][3] are used which can be either zero current (zc) or zero voltage switching (zv) .in order to improve the speed response and regulation of the converter fed drives it is necessary to have closed loop control. conventionally closed loop response employs pi controller but the performance of the drive is sensitive to load disturbances and parameter variations. the advent of flc has been suggested as an alternative approach to conventional control techniques for complex control system like non linear system. the design of flc does not require the exact mathematical model of the system and can compensate the parameter variation due to load disturbances ( [4][5]). unfortunately a good performance cannot be obtained for incorrect membership functions, fuzzy rules and scaling factors. this necessitates the optimization technique of flc by genetic algorithm [6][8] to achieve optimal solutions of membership function, fuzzy rules and scaling factors. in this paper speed control of series zcs-qrc fed dc drive is composed of two steps. the first step is conventional control of dc drive by flc. the second step is optimization of flc by ga. copyright © 2006-2014 by ccc publications modeling and simulation of genetic fuzzy controller for l-type zcs quasi-resonant converter 49 2 analysis of series fm-zcs-qrc fed dc drive the qrc with zcs topology is considered for the present work.fig.1. shows the circuit diagram of the qrc fed motor.the waveform for the model is shown in fig.2. figure 1: zcs-half-wave series quasi-resonant converter figure 2: waveforms for half wave series zcs-qrc to analyze its behavior, the following assumptions are made: • armature inductance is much larger than resonant inductance. • the dc motor is treated as a constant current sink. • semiconductor switches are ideal. • reactive elements of the tank circuit are ideal. a switching cycle can be divided into four stages.. suppose that before mosfet turns on, diode carries the steady-state output current ia and capacitor voltage vcr is clamped at zero. at time t0, mosfet turns on, starting a switching cycle. mode 1 : linear stage [ t0 , t1 ] input current ilr rises linearly and its waveform is governed by the state equation: lr(dilr/dt) = e (1) the duration of this stage td1 (= t1 t0 ) can be solved with boundary conditions of ilr (0) = 0 and ilr (td1)= ia , thus td1 = (lria)/e (2) 50 m. ranjani, p. murugesan mode 2: resonant stage [ t1 , t2] at time t1, the input current rises to the level of ia , freewheeling diode is commutation off, and the difference between the input current and the output current ilr(t) ia flows into cr, voltage vcr rises in a sinusoidal fashion.the state equations are cr(dv cr/dt) = ilr(t)ia (3) lr(dilr/dt) = ev cr(t) (4) with initial conditions vcr(0) =0 , ilr(0)= ia and therefore, ilr(t) = ia + (e/z0)sint (5) v cr(t) = e(1− cost) (6) mode 3 : recovering stage [ t2 , t3] since mosfet is off at time t2, capacitor begins to discharge through the output loop and vcr drops linearly to zero at time t3 . the state equation during this interval is cr(dv cr/dt) = −ia (7) the duration of this stage td3 ( = t3 t2) can be solved with the initial condition vcr (0) = vcr td3 = crv cr/ia (8) td3 = cre(1− cos)/ia (9) mode 4: freewheeling stage [ t3 , t4 ] after t3, output current flows through diode. the duration of this stage is td4 ( t4 t3), td4 = ts− td1− td2− td3 (10) where ts is the period of the switching cycle. after an interval of toff, during which it is zero and vcr = 0, the gate drive to the mosfet is again applied at t4 to turn it on, and the operation during the next cycle is similar to that of the preceding cycle. by controlling the dead time ( t4t3), the average value of the armature voltage and hence the speed of the dc motor can be controlled. • characteristic impedance zn = (l1/c1) (11) • resonant angular frequency ω = 1/(l1c1) (12) • resonant frequency fn = ω/2π (13) 3 fuzzy logic controller fuzzy logic control is derived from fuzzy set theory introduced by zadeh in 1965. in fuzzy set theory, the transition between membership and non-membership can be gradual. therefore, boundaries of fuzzy sets can be vague and ambiguous, making it useful for approximate systems. the fuzzy logic controller employed to control the speed of zcs-qrc fed dc drive is as shown in fig.3.the flc is an attractive choice when precise mathematical formulations are not possible. other advantages of flc are it can work with less precise inputs, it does not need fast processors, modeling and simulation of genetic fuzzy controller for l-type zcs quasi-resonant converter 51 figure 3: fuzzy controlled series zcs qrc fed dc drive table 1: rule table with 25 rulres it needs less data storage in the form of membership functions and rules than conventional look up table for non-linear controllers; and it is more robust than other non-linear controllers, parallel with the z-network output terminals. the simplest form of membership function is triangular membership function and it is used here as the reference. as sugeno type of implication is considered, the singleton membership function is used for the output variable namely the change in duty cycle. the spread of membership functions for the inputs (error and change in error) and output (pulses) are shown in fig.4.respectively. figure 4: triangular membership functions for flc the rules for the designed fuzzy controller are given in the tables 1 .table 1 uses five linguistic variables for error and change in error with 25 rules. the five sets used for fuzzy variables ’error’ and change in error are negative big (nb), negative small (ns), zero (z), positive big (pb), and positive small (ps). from the rule table, the rules are manipulated as if error is nb and change in error is nb, then output is nb. 4 ga optimized flc genetic algorithms [9][10], which are adopted from the principle of biological evolution, are efficient search techniques that manipulate the coding representing a parameter set to reach a 52 m. ranjani, p. murugesan near optimal solution. hence by strengthening fuzzy logic controllers with genetic algorithms the searching and attainment of optimal fuzzy logic rules and high-performance membership functions will be easier and faster. gas is used regularly to solve difficult search, optimization and machine-learning problems that have previously resisted automated solutions. they can be used to solve difficult problems quickly and reliably. these algorithms are easy to interface with existing simulations and models, and they are easy to hybridize. gas includes three major operators: selection, crossover, and mutation, in addition to four control parameters: population size, selection crossover and mutation rate. this paper is concerned primarily with the selection and mutation operators. there are three main stages of a genetic algorithm; these are known as reproduction, crossover and mutation. the flow chart of a genetic algorithm is shown in figure 5. figure 5: flowchart for genetic algorithm the steps involved in genetic algorithm are described below. • start: generate random population of n chromosomes (suitable solutions for the problem). • fitness: evaluate the fitness f(x) of each chromosome x in the population. • new population: create a new population by repeating following steps until the new population is complete. – selection: select two parent chromosomes from a population according to their fitness (the better fitness, the bigger chance to be selected). – crossover: with a crossover probability, cross over the parents to form new offspring (children). if no crossover was performed, offspring is the exact copy of parents. – mutation: with a mutation probability, mutate new offspring at each locus (position in chromosome). modeling and simulation of genetic fuzzy controller for l-type zcs quasi-resonant converter 53 – accepting: place new offspring in the new population. • replace: use new generated population for a further run of the algorithm • test: if the end condition is satisfied, stop, and return the best solution in current population. • loop: go to step 2. the objective functions iae (integral absolute error). the main objective of controller is to minimize the error signal or in other words we can say that minimization of performance indices. iae = ∫ t 0 |e(t)dt| (14) the fitness value of the chromosome is the inverse of the performance indices. the fitness value is used to select the best solution in the population to the parent and to the offspring that will comprise the next generations. the fitter the parent greater is the probability of selection. this emulates the evolutionary process of survival of the fittest. parents are selected using roulette wheel selection method. fitness function is reciprocal of performance indices. in this paper we have taken the discrete form of itae. itae is treated as performance indices and fitness function denoted by j can be described as j = 1/(100 + n∑ k=1 |ωref −ωm|) (15) the membership functions obtained for error, change in error and pulses for ga optimized flc are shown in fig.6. the simulation structure of optimized fuzzy speed controller with the iae is shown in fig.7 & fig.8 respectively. figure 6: ga optimized membership functions figure 7: simulated structure of ga based flc of series zcs-qrc fed dc drive 54 m. ranjani, p. murugesan figure 8: simulated structure of optimized fuzzy speed controller 5 simulation results the closed loop operations of series fm-zcs-qrc fed dc drive has been simulated. controlling the freewheeling period using the controller regulates the speed of the drive. the speed variations for sudden load disturbances are shown for increased load torque and decreased load torque in fig.9 and fig.10 respectively. it can be said that the speed remains constant even for various disturbances of load torque. figure 9: speed response for increase in load torque figure 10: speed response for decrease in load torque 6 conclusion we design the speed controller of the series fm-zcs-qrc in two steps. in step one the conventional flc is designed and simulated and second step optimization of membership function ,fuzzy rules and scaling factors of flc by genetic algorithm is considered. the simulation results show that the optimal fuzzy logic controller is functioning better than a conventional flc in terms of the rise and settling time. hence it can be concluded that the ga optimized flc implemented in zcs-qrc fed drive enhances the drive robustness by reducing the transient time and steady state error and superior than the conventional fuzzy controller. modeling and simulation of genetic fuzzy controller for l-type zcs quasi-resonant converter 55 bibliography [1] ned mohan et.al. (1995); power electronics conveiters, application and design,john wiley and sons. [2] liu.k.h.; lee.f.c. f.c. lee.; zerovoltage switching technique in dc/dc converters; ieee trans. power electron, 5(3):293-304, 1990. [3] rama reddy et al, a step down frequency modulated zero current switching qrc fed dc drive,etep joumal, ref.no.et-1088, 1995. [4] abraham kandel; gideon langholz; fuzzy control systems, crc press, 1994. [5] kung.y.s.; liaw.c.m.; a fuzzy controller improving a linear model following controller for motor drives; ieee transactions on fuzzy systems, 2(3):194-201, 1994. [6] reznik.l; evolution of fuzzy controller design; ieee transactions on fuzzy systems, 503-508, 1997. [7] hu.b.g.; gosine.r.g.; theoretic and genetic design of a three-rule fuzzy pi controller; ieee transactions on fuzzy systems, 489-496, 1997. [8] tan. g.v.; hu.x.; more on designing fuzzy controllers using genetic algorithms: guided constrained optimization;ieee transactions on fuzzy systems, 497-502, 1997. [9] mohammadian.m.; stonier. r.j.; generating fuzzy rules by genetic algorithms; proceedings of 3rd ieee international workshop on robot and human communication, 362-367, 1994. [10] arulselvi.s.; uma govindarajan.; real time implementation of modified fuzzy logic controller for a non linear quasi resonant dc-dc converte ; iete journal of research, 53(5):401-416, 2007. int j comput commun, issn 1841-9836 9(1):8-15, february, 2014. handwritten documents text line segmentation based on information energy c.a. boiangiu, m.c. tanase, r. ioanitescu costin-anton boiangiu*, radu ioanitescu "politehnica" university of bucharest romania, 060042 bucharest *corresponding author: icostin.boiangiu@cs.pub.ro mihai cristian tanase virtualmetrix design romania, 060104 bucharest mihaicristian.tanase@gmail.com abstract: the first step in the text recognition process is represented by the text line segmentation procedures. only after text lines are correctly identified can the process proceed to the recognition of individual characters. this paper proposes a line segmentation algorithm based on the computation of an information content level, called energy, for each pixel of the image and using it to execute the seam carving procedure. the algorithm proposes the identification of text lines which follow the text more accurately with the expected downside of the computational overhead. keywords: text line segmentation, text recognition, information energy, ocr. 1 introduction the identification of the boundaries of the lines of text represents an essential step in many algorithms like the ones for document structure extraction or text recognition. the research in this field has focused mostly on the development of such algorithms for printed documents. this limitation of the domain of application reduces the complexity of the problem as the printed documents are perfectly formed and the main problem that would need to be solved is the skew angle introduced in the process of printing or scanning, angle which is assumed to be the same for the entire document,. with such documents, the problem is reduced to the identification of the skew angle which is assumed to be constant for the entire page because the text lines are parallel with each other. such methods are presented in [1][3]. however, when dealing with handwritten documents, the assumptions made by such algorithms do not hold anymore. there is no constant skew angle, the lines are not parallel and even the size and format similarity between the same characters found on different areas of the page cannot be assumed. even worse, the separation between lines cannot be assumed as for printed documents because, in handwritten tex, the characters often overlap the line bellow as they are more compactly spaced on the vertical. algorithms trying to address these increased complexities have also been attempted in [4] [13]. the pixel of a given document stored as a digital image have different levels of information. a white pixel in a big white region or one that is found between two lines of text contains very little information while a character defining pixel would contain a lot of information. this paper proposes the association of an energy level for each pixel of the input image which tries to estimate the importance or the amount of information provided by that pixel. although the concept can be applied to general images, printed or handwritten text, we will apply this concept for handwritten documents. the information level is then used by the seam carving algorithm for the segmentation of the text lines. copyright © 2006-2014 by ccc publications handwritten documents text line segmentation based on information energy 9 2 related work the algorithms that try to locate the text lines in a document are divided mainly by the information they take from the input document. as the input documents are the results of a digitization process, they are acquired, generally through scanning, as grayscale images. this grayscale representation is converted into binary or black and white for algorithms which are designed to work with this type of documents only. the binary representation conversion is done using a previously defined threshold level. all pixels that have gray levels above the threshold are converted to black, the rest are converted to white. a too high threshold will results in an image containing too little information for text recognition to be possible and a too low threshold will result in too many artifacts, again making the text recognition process impossible. based on the observation that the body of the lines of text contains gray pixels and because the pixels that make the characters being of a darker shade of gray, some algorithms compute projection profiles representing the sum of all the pixels values in a given direction. the method works well for printed documents but fails to produce good results when applied to handwritten documents. to address these problems, different approaches were used: identify the local skew of the handwritten text, calculate the accumulated space between characters, try to fill the space between characters or use attraction from the text pixels and repulsion from the previously detected line trying to estimate the text line boundaries closer. 3 information energy the different pixels from an image carry different information content. a document can be viewed as a group of low information pixels representing the space between lines of text and respectively a group of high information pixels representing the actual lines of text. each pixel in the energy map has a value associated with it that represents the amount of information that the given pixel stores in the image. if a high energy pixel is removed from the image, the resulting image has a significant drop in detail, whereas removing a low energy pixel results in a negligible information loss. the information energy concept can be understood by trying to eliminate a continuous band of pixels from an image representing handwritten text. this concept is illustrated in 1. original image removing low information pixels removing high information pixels figure 1: information energy exemplification 10 c.a. boiangiu, m.c. tanase, r. ioanitescu a band with the same amount of pixels has been removed from a text document. it can be seen that when removing high information pixels there is not enough remaining information to detect the content, while when removing low information pixels, there is hardly any information loss and the content can be usually detected entirely. an important observation is that for handwritten documents and in general text documents, the variation of pixel values represent information in itself. it can be seen that large continuous areas with similar pixel values constitute low energy areas, in particular spaces between lines of text, while high variation pixel values constitute high energy area. computing the energy map of the entire document will thus provide information of the location of the high energy areas which represent the lines of text. 4 accounting for the text direction the algorithm begins with the calculation of the information energy for each pixel. similarly to [4] and [11], for each pixel, the energy value is calculated using the next formula: e(i,j) = 2∗e(i,j) + min ( d( neighborsnumber 2 + k)∗e(i + direction,j + k) ) (1) where: • k fulfills the following condition: −neighborsnumber 2 < k < neighborsnumber 2 • direction represents the direction of processing the energy map +1 representing left to right processing and -1 the opposite direction. using a neighborsnumber value of 3 and because k is a natural number since we work with discrete pixel positions, we get only the immediate neighbors. during the calculation the direction coefficient has been kept constant. as shown in images from the chapter “test and results” section, for documents with horizontal lines or very low skew angles we obtained good results. however, for larger skew angles the results quality decreased with wrong text line segmentation and as a result we considered accounting for the text direction during processing. to accurately follow the text, the direction of the text lines should be taken into consideration at each pixel. for each pixel, the direction coefficient that accounts for the direction of the text line when that given pixel is reached, will be calculated. the calculated information energy map is presented in 2 as a result from interpolating the two processing directions. computation algorithm input: image to be processed, processing window (height*width), skewing angle for every pixel in the image to be processed: 1. for every processing window skewing angle i. sum up the pixels in the skewed window ii. if the sum is less than the current minimum then a) update the current minimum b) save the window variation 2. with the minimum variation saved in step 1 ii: i. update the direction coefficient for the minimum variation ii. rescale the minimum window variation good results were obtained experimentally with processing window sizes that were at least the width and height of the average character in the processed text and for values of a few order handwritten documents text line segmentation based on information energy 11 original image computed information energy map figure 2: the information energy map of an image of magnitude larger. if the processing windows has a smaller size then the algorithm attempts to segment the characters specific structures as separate lines of text leading to erroneous results. because the size of the processing text is usually roughly known and since the results are good with larger sizes for the processing window, this aspect does not represent a limitation. similarly, the minimum and maximum variation angles can be set to 20 degrees. these variation angles represent the maximum skewing angles and combined cover a 40 degrees area that is sufficient for most documents. the size of the processing window is fixed upfront, the calculations at each step are fixed. the total number of operations depends on the direction coefficient and height of the initial image since the algorithm is repeated for every pixel. for each pixel and for each variation of the skewing angle, the information energy level is computed. line identification algorithm input: computed information energy for each pixel in the image to be processing, neighborsnumber output: the number and layout of the identified lines 1. for each line (1..h) 2. for each column (1..w) 3. select the minimum cost pixel on the right not found further away than neighborsnumber i. if the selected pixel is not included in any line include it ii. else move to the next line the algorithm looks through all the information energy levels to locate the minimal values. the values with the minimum energy level represent the blank pixel regions that separate the lines of text. 5 tests and results to test the algorithm, a number of images of handwritten documents have been used. the test data files consisted of about 500 different types of documents representing old letters, library index files, patents, receipts and various printed documents. the documents showed pronounced skew angles and their layout was not trivial. the database was considered to be relevant, although the number of documents is not large, because of their variety which allowed the testing of the algorithm on a wide set of conditions. different methods for the computation of the information energy are used as examples to show how the algorithm depends on this type of variation and to show the general application 12 c.a. boiangiu, m.c. tanase, r. ioanitescu of the concept. the results of the algorithm are discussed in the conclusions section and future possible solutions are presented with an explanation of the associated computational costs. original image line segmentation result figure 3: gaussian first derivative energy computation test 1 original image line segmentation result figure 4: gaussian first derivative energy computation test 2 original image line segmentation result figure 5: magnitude of the gradient energy computation test 1 handwritten documents text line segmentation based on information energy 13 original image line segmentation result figure 6: magnitude of the gradient energy computation test 2 original image line segmentation result figure 7: inverse distance transform energy computation 1 original image line segmentation result figure 8: inverse distance transform energy computation test 2 14 c.a. boiangiu, m.c. tanase, r. ioanitescu 6 conclusions and future works this paper describes a text line segmentation algorithm that shows good results even for handwritten documents. the algorithm is based on the concept of information energy which is used to estimate the text lines in the processed document. experimental results showed that using constant direction coefficients when calculating the information energy levels produces bad results for inputs that show high levels of skew. the algorithm addresses this problem by updating the direction coefficients with a window processing algorithm. these direction coefficients account for the local text direction and for variations of the skew angles that are not uncommon in handwritten documents. by using direction coefficients, pixels from the same line have higher probability of selection when calculating the minimum values in the information energy map. the algorithm that computes the direction coefficient has a large computation complexity. since the computation time depends on the size of the processing window, one solution is to use a smaller size. experimental testing showed that at minimum, the height and width of the average character should be used for the size of this window. alternatively the skew angle could be assumed to be smaller eliminating parts of the computation cases. another change could be the limitation of the possible line curvature angle for the detected document lines to a lower interval. by constraining the line curvature angle. this limitation would also address cases in which a detected line would follow the space between two words and reach the previous or the next line which would represent a wrong line detection. a possible future direction is the evaluation of the robustness of the algorithm on larger images datasets and with a variation of the document types which would allow to more extensively evaluate the accuracy of the handwritten text segmentation algorithm. the work presented in this paper is a building block of a much bigger project: a complete, modular, fully automatic content conversion system developed for educational purposes. in the near future, with the completion of the system and the running in automatic batch processing of large image databases of all kind of skewed documents (containing handwriting or not) the algorithm will be fully evaluated in order to assess its real potential as a preprocessing phase for ocr applied on handwritten documents. acknowledgement the work presented in this paper was funded by the sectorial operational programme human resources development 2007-2013 of the romanian ministry of labour, family and social protection through the financial agreement posdru/89/1.5/s/62557. bibliography [1] dos santos, r.p. et al, text line segmentation based on morphology and histogram projection, document analysis and recognition (icdar), 651655, 2009. [2] saha, s. et al, a hough transform based technique for text segmentation, journal of computing, 2(2):135-140, 2010. [3] arivazhagan, m. et al, a statistical approach to line segmentation in handwritten documents, proceedings of spie, 2007. [4] strand, l. et al, minimal cost-path for path-based distances, image and signal processing and analysis, 379-384, 2007. handwritten documents text line segmentation based on information energy 15 [5] avidan, s. et al, seam carving for content-aware image resizing, acm siggraph, article 10, 2007. [6] saabni, s. et al, language-independent text lines extraction using seam carving, document analysis and recognition (icdar), pp. 563-568, 2001. [7] papavassiliou, v. et al , handwritten document image segmentation into text lines and words, pattern recognition, 43(1):369-377, 2010.. [8] du, x. et al, text line segmentation in handwritten documents using mumford-shah model, pattern recognition, 42(12):3136-3145, 2009. [9] t ripathy, n.; pal, u., handwriting segmentation of unconstrained oriya text, frontiers in handwriting recognition, 306-311. 2004. [10] kennard, d.j., barrett, w.a., separating lines of text in free-form handwritten historical documents, document image analysis for libraries, 12-23, 2006. [11] asi, a. et al, text line segmentation for gray scale historical document images, proceedings of the 2011 workshop on historical document imaging and processing, pp. 120-126, 2011. [12] bar-yosef, i., input sensitive thresholding for ancient hebrew manuscript, pattern recognition letters, 26(8):1168-1173, 2005. [13] bar-yosef, i. et al, line segmentation for degraded handwritten historical documents, document analysis and recognition, 1161-1165, 2009. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 4, pp. 586-591 cryptanalysis on two certificateless signature schemes f. zhang, s. li, s. miao, y. mu, w. susilo, x. huang futai zhang, songqin miao 1. school of computer science and technology nanjing normal university, nanjing 210046, p.r. china, and 2. jiangsu engineering research center on information security and privacy protection technology nanjing 210046, p.r. china e-mail: zhangfutai@njnu.edu.cn, miaosongqin@163.com sujuan li 1. school of computer science and technology nanjing normal university, nanjing 210046, p.r. china, and 2. nanjing university of technology nanjing 210037, p.r. china e-mail: lisujuan1978@126.com yi mu, willy susilo, xinyi huang centre for computer and information security research school of computer science and software engineering university of wollongong nsw 2522, australia e-mail: ymu@uow.edu.au, wsusilo@uow.edu.au, xyhuang81@gmail.com abstract: certificateless cryptography has attracted a lot of attention from the research community, due to its applicability in information security. in this paper, we analyze two recently proposed certificateless signature schemes and point out their security flaws. in particular, we demonstrate universal forgeries against these schemes with known message attacks. keywords: certificateless cryptography, certificateless signature, public key replacement, universal forgery. 1 introduction certificateless cryptography [1] is a new paradigm that not only removes the inherent key escrow problem of identity based public cryptography [2] (id-pkc for short), but also eliminates the cumbersome certificate management in traditional pki. in cl-pkc, the actual private key of a user is comprised of two secrets: a secret value and a partial private key. the user generates a secret value by himself, while the partial private key is generated by a third party called key generating center (kgc), who makes use of a system wide master key and the user’s identity information. in this way, the key escrow problem in identity-based public key cryptosystems is removed. a user’s public key is derived from his/her actual private key, identity and system parameters. it could be available to other entities by transmitting along with signatures or by placing in a public directory. unlike the traditional pki, there is no certificate in certificateless public key cryptography to ensure the authenticity of the entity’s public key. a number of certificateless signature schemes [3–14] have been proposed. some of them are analysed under reasonable security models with elaborate security proofs [8, 11, 13, 14], while some others are subsequently broken due to flawed security proof or unreasonable model [3, 6–8, 12]. copyright c⃝ 2006-2010 by ccc publications cryptanalysis on two certificateless signature schemes 587 recently two certificateless signature schemes were proposed in [4] and [5] respectively. they were claimed to provide high efficiency and provable security. in this short note, unfortunately, we show that these two schemes [4, 5] are insecure even in a very weak security model. namely these two schemes are suffering from universal forgeries under known message attacks. 2 review of the original schemes we omit the preliminaries, basic notions, and security models about certificateless signature schemes. please refer to [1, 8, 11, 13, 14] for details. the two original schemes [4, 5] are based on bilinear maps. they were both called mccls scheme. to distinguish them, we call the one in [4] as mccls1, and the other one in [5] as mccls2. 2.1 description of mccls1 we first describe mccls1. it consists of the following five algorithms. • setup. on input a security parameter, it generates a list of system parameters { p, g1, g2, ê, p, ppub, h1, h2} and a system master private key s ∈ z∗p, where p is a large prime, g1,g2 are groups of order p with an admissible bilinear map ê : g1 × g1 → g2, h1 : {0,1}∗ → g1 and h2 : {0,1}∗ → z∗p are cryptographical hash functions, p is a generator of g1, and ppub = sp. • extract partial private key. on input a user identity id, it computes qid = h1(id), and outputs did = sqid as the user’s partial private key. • generate key pair. a user with identity id selects a random x ∈ z∗p as its secret value sid, and publish its public key pid = xppub. • cl-sign. given a user’s private keys (did,sid) and a message m, the user randomly picks an element r ∈ z∗p, computes s = sid −1did, r = (r − sid)p, v = h2(m,r,pid)r, and outputs σ = (s,r,v ) as his/her signature on message m under the public key pid. • cl-verify. given a signature (s,r,v ) on a message m of a user id with public key pid, a verifier computes h = h2(m,r,pid) and checks whether (ppub,v p − hr,h−1s,qid) is a valid diffiehellman tuple, namely whether the equation ê(v p − hr,h−1s) = ê(ppub,qid) holds. 2.2 description of mccls2 the first three algorithms of mccls2 in [5] are exactly the same as those of mccls1 in [4]. there are slight differences in the cl-sign and cl-verify algorithms. we just depict the differences here. • cl-sign. given a user’s private keys (did,sid) and a message m, the user randomly picks an element r ∈ z∗p, computes s = sid −1did, r = (r − sid)p, v = h2(m,r,pid)rp and outputs σ = (s,r,v ) as his/her signature on message m under public key pid. • cl-verify. given a signature (s,r,v ) on a message m of a user id with public key pid, a verifier computes h = h2(m,r,pid) and checks whether (ppub,v −hr,h−1s,qid) is a valid diffie-hellman tuple, namely whether the equation ê(v − hr,h−1s) = ê(ppub,qid) holds. 588 f. zhang, s. li, s. miao, y. mu, w. susilo, x. huang 3 universal forgery as we can see, in the mccls schemes, a signature on a message m of a user id with public key pid consists of three components s, r and v . note that for a user id with public key pid, s remains unchanged for all messages, r and v are irrelevant to the partial private key did. here we give two kinds of universal forgery under known message attacks. 3.1 attacks against mccls1 1. universal forgery by replacing public key the scheme mccls1 cannot resist public key replacement attacks of a type i adversary a. for the definition of type i and type ii adversaries, please refer to [1, 4, 5, 8, 11, 13, 14]. let σ = (s,r,v ) be id’s valid signature on a message m, where s = sid −1did, r = (r − sid)p, v = h2(m,r,pid)r, and r ∈r z∗p. given r and v , the random number r can be easily derived as r = v h2(m,r,pid)−1. and then sidp is known as sidp = rp−r. now a is able to forge a user id’s valid signature on any message m as follows: (a) choose a random c ∈ z∗p and let r ′ = cr ∈ z∗p; (b) replace id’s public key as p′id = cpid (the new secret value corresponding to the public key p′id is s ′ id = csid ); (c) compute s′ = c−1s,r′ = cr,v ′ = h2(m,r′,p′id)r ′; (d) set σ ′ = (s′,r′,v ′) as id’s signature on message m under the public key p′id. we can see that (ppub,v ′p − h2(m,r′,p′id)r ′,h2(m,r′,p′id) −1s′,qid) is a valid diffie-hellman tuple since ê(v ′p − h2(m,r ′,p′id)r ′,h2(m,r ′,p′id) −1s′) = ê((h2(m,r ′,p′id)crp)− h2(m,r ′,p′id)(crp − csidp)),h2(m,r ′,p′id) −1c−1s) = ê(h2(m,r ′,p′id)csidp,h2(m,r ′,p′id) −1c−1s) = ê(sidp,s) = ê(p,did) = ê(ppub,qid) 2. universal forgery without replacing public key from id’s valid signature σ = (s,r,v ) on a message m, the adversary can get r = v h2(m,r,pid) −1,sidp = rp − r. with these he can forage a signature σ ′ = (s′,r′,v ′) on any message m without replacing id’s public key as follows: pick r ′ ∈r z∗p, and compute s′ = s, r′ = r ′p − sidp, v ′ = h2(m,r′,pid)r ′. the verification will always output “accept" since (ppub,v ′p − h2(m,r ′,pid)r ′,h2(m,r ′,pid) −1s′,qid) cryptanalysis on two certificateless signature schemes 589 is really a valid diffie-hellman tuple. the reason is ê(v ′p − h2(m,r ′,pid)r ′,h2(m,r ′,pid) −1s′) = ê(h2(m,r ′,pid)r ′p − h2(m,r ′,pid)(r ′p − sidp),h2(m,r ′,pid) −1s) = ê(h2(m,r ′,pid)sidp,h2(m,r ′,pid) −1s) = ê(sidp,s) = ê(ppub,qid) 3.2 attacks against mccls2 1. universal forgery by replacing public key let σ = (s,r,v ) be id’s valid signature on a message m. it is obvious that rp = h2(m,p,pid)−1v , sidp = r p − r. a type i adversary a may forge id’s valid signature on any message m as follows: (a) choose a random c ∈ z∗p and let r ′ = cr ∈ z∗p. (b) replace id’s public key as p′id = cpid (this implies the new secret value corresponding to the new public key p′id is s ′ id = csid ). (c) compute s′ = c−1s,r′ = (r ′ − s′id)p = cr,v ′ = h2(m,r′,p′id)r ′p = ch2(m,r′,p′id)rp. (d) set σ ′ = (s′,r′,v ′) as id’s signature on message m using public key p′id. we can see (ppub,v ′ − h2(m,r′,p′id)r ′,h2(m,r′,p′id) −1s′,qid) is a valid diffie-hellman tuple since ê(v ′ − h2(m,r ′,p′id)r ′,h2(m,r ′,p′id) −1s′) = ê(h2(m,r ′,p′id)crp − h2(m,r ′,p′id)(crp − csidp),h2(m,r ′,p′id) −1c−1s) = ê(h2(m,r ′,p′id)csidp,h2(m,r ′,p′id) −1c−1s) = ê(sidp,s) = ê(p,did) = ê(ppub,qid) 2. universal forgery without replacing public key the adversary can get rp = h2(m,r,pid)−1v , sidp = rp − r = h2(m,r,pid)−1v − r, from id’s valid signature σ = (s,r,v ) on a message m. then it (may be type i or type ii) can forge a signature σ ′ = (s′,r′,v ′) on any message m without replacing id’s public key as follows: pick r ′ ∈r z∗p, and compute s′ = s, r′ = r ′p − sidp, v ′ = h2(m,r′,pid)r ′p. the verification will always output “accept" since (ppub,v ′ − h2(m,r ′,pid)r ′,h2(m,r ′,pid) −1s′,qid) 590 f. zhang, s. li, s. miao, y. mu, w. susilo, x. huang is really a valid diffie-hellman tuple. this is because ê(v ′ − h2(m,r ′,pid)r ′,h2(m,r ′,pid) −1s′) = ê(h2(m,r ′,pid)r ′p − h2(m,r ′,pid)(r ′p − sidp),h2(m,r ′,pid) −1s) = ê(h2(m,r ′,pid)sidp,h2(m,r ′,pid) −1s) = ê(sidp,s) = ê(ppub,qid) from these attacks, one can see mccls1 and mccls2 are insecure even in the weakest security model. 4 conclusion recently, two certificateless signature schemes mccls1 and mccls2 were proposed for mobile wireless cyber-physical systems. they only require two scalar multiplications in signing phase and two scalar multiplications and one pairing in verification phase. so they are efficient with respect to computational cost. although the authors claimed and proved that mccls1 and mccls2 were secure, as we have shown in this paper they are in fact insecure. universal forgeries against those two schemes have been presented under known message attacks. 5 acknowledgment this research is supported by the natural science foundation of china under grant number 60673070 and academic discipline fund of njut. bibliography [1] s. al-riyami, k. paterson. certificateless public key cryptography. proceedings of asiacrypt 2003, lecture notes in computer science 2894, springer-verlag, 452-473, 2003. [2] a. shamir. identity based cryptosystems and signature schemes. proceedings of crypto’84, 47-53, 1984. [3] x. huang, w. susilo, y. mu, f. zhang. on the security of a certificateless signature scheme. proceedings of acisp 2005, 13-25, 2005. [4] z. xu, x. liu, g. zhang, w. he, g. dai, w. shu. a certificateless signature scheme for mobilewireless cyber-physical systems. the 28th international conference on distributed computing systems workshops, 489-494, 2009. [5] z. xu, x. liu, g. zhang, w. he. mccls: certificateless signature scheme for emergency mobile wireless cyber-physical systems. international journal of computers, communications & control (ijccc), 3(4): 395-411, 2008. [6] w. yap, s. heng, b. goi1. an efficient certificateless signature scheme. proceedings of euc workshops 2006, lecture notes in computer science 4097, springer-verlag, 322-331, 2006. [7] j. park. an attack on the certificateless signature scheme from euc workshops 2006. cryptology eprint archive, report 442, 2006. cryptanalysis on two certificateless signature schemes 591 [8] z. zhang, d. feng. key replacement attack on a certificateless signature scheme. cryptology eprint archive, report 453, 2006. [9] k. choi, j. park, j. hwang, d. lee. efficient certificateless signature schemes. proceedings of acns 2007, lecture notes in computer science 4521, springer-verlag, 443-458, 2007. [10] r. castro, r. dahab. two notes on the security of certificateless signatures. proceedings of provsec 2007, lecture notes in computer science 4784, springer-verlag, 85-102, 2007. [11] z. zhang, d. wong, j. xu, d. feng. certificateless public-key signature: security model and efficient construction. proceedings of acns 2006, lecture notes in computer science 3989, springerverlag, 293-308, 2006. [12] b. hu, d. wong, z. zhang, x. deng. key replacement attack against a generic construction of certificateless signature. proceedings of acisp 2006, lecture notes in computer science 4058, springer-verlag, 235-346, 2006. [13] x. huang, y. mu, w. susilo, d. wong, w. wu. certificateless signature revisited. proceedings of acisp 2007, lecture notes in computer science 4586, springer-verlag, 308-322, 2007. [14] l. zhang, f. zhang, f. zhang. new efficient certificateless signature scheme. proceedings of euc workshops 2007, lecture notes in computer science 4809, springer-verlag, 692-703, 2007. futai zhang (b. augest 28, 1965) received his m.sc. in mathematics (1990) from shaanxi normal university, china, and phd in cryptology (2001) from xidian university, china. now he is a professor at nanjing normal university, china. his current research interest is public key cryptography. sujuan li is now a phd candidate in nanjing normal university, china. her current research interests include information security and cryptography. songqin miao is now a m.sc candidate in nanjing normal university, china. her main research interest is cryptography. yi mu received his phd from the australian national university in 1994. he is an associate professor at the school of computer science and software engineering at the university of wollongong. he is the co-director of centre for computer and information security research (ccisr) at the university of wollongong. his current research interests include network security, electronic payment, cryptography, access control, and computer security. willy susilo received a ph.d. in computer science from university of wollongong, australia. he is a professor at the school of computer science and software engineering at the university of wollongong. he is the co-director of centre for computer and information security research (ccisr) at the university of wollongong.his current research interests include information security and cryptography. xinyi huang received his ph.d. in computer science from university of wollongong, australia in 2009. his current research mainly focuses on cryptography and its applications. int j comput commun, issn 1841-9836 9(1):79-84, february, 2014. efficiency of a combined protection method against correlation power analysis side-attacks on microsystems h.-n. teodorescu, e.-f. iftene horia-nicolai teodorescu*, emanuel-florin iftene 1. "gheorghe asachi" technical university of iasi romania, iasi, 8 bd. carol i, and 2. institute of computer science of the romanian academy, romania, iasi *corresponding author:hteodor@etti.tuiasi.ro abstract: we analyze the efficiency of the masking of instruction patterns using a chaotic driven clock and power supply, in front of a side attack intruding the power supply of a microsystem. the differential analysis is supposedly conducted by correlation power analysis. we demonstrate that the use of a chaotically-driven masking based on relatively simple circuits may be a significant candidate for the protection of embedded systems. keywords: physical security, protection, hardware, side attack, chaos, control signal, security evaluation. 1 introduction with a field less than 20 year old (the first paper, by paul c. kocher [1], was published in 1998, with the first significant expansion published in 2000, [2]), the protection against hardwarelevel attacks of the information in microsystems, including embedded systems is fast developing, due to the huge interest of the banks, security companies, card manufacturers, and military, moreover due to the interest in power minimization [3]. citing [4], ‘side channel analysis is a [· · · ] form of attack [· · · ] that uses information that leaks, unintentionally, from the real-world implementations of cryptographic hardware.’ side-channel attacks (sca) extract and decode the executed instructions and the manipulated data in microsystems, bypassing the cryptographic protections [1], [2]. the basic methods of attack were named simple power analysis (spa), respectively differential power analysis (dpa), depending on the details of the attack. for various approaches of dpa, see [5]. while the literature includes numerous papers on the attacks and on the sibling topic of power analysis for software optimization [3], understandably fewer papers present hardware methods of mitigating these attacks. several manufacturers include various solutions against side attacks. for example, newell and juliano [4] cite freescale inc., who uses ‘patented dpa functions, licensed from cryptography research.’ other manufacturers, as maxim inc. and infineon also use various protection means, but details on them are not public. for example, the 32 risc ‘deepcover secure microcontroller’ max32590 released in 2013 by maxim includes on the chip, according to the manufacturer datasheet, a ‘tamper detection controller’ that ‘monitors voltage, frequency, temperature, die shield, and external sensors’, erasing essential information when any type of suspicious external activity is detected. in [6], [7], and [8] we introduced the masking of the instructions using a chaotically-driven clock and power supply. however, a detailed analysis of the masking efficiency under dpa has not been performed for that method. in this paper, we provide results of the analysis of differential power attacks, when the chaotic masking as above is used to protect the system. the protection method proposed in [6], [7], and [8] and further analyzed here is, at the hardware level, more of a proof of concept of the capabilities of the method, not a blueprint solution ready to put into silicon. copyright © 2006-2014 by ccc publications 80 h.-n. teodorescu, e.-f. iftene figure 1: block diagram for the hardware for protection against sca. (a) diagram with analog control of the voltage. (b) both the clock and the voltage control are digital signals. (c) pulse forming circuit. (d) controlled voltage regulator. (e) example of noisy control pulse. (f) waveforms from the chaotic circuit and the corresponding pulses generated for control 2 the proposed protection method beyond software protection methods, such as specific algorithms, hardware protection methods play an essential role today in data and code security, as proved by several specific chips produced by companies as the ones quoted in section 1. we present in this paper an operation principle demonstrator of the protection method. we assume that only the external power supply is available to the intruder for monitoring with a series resistor. while the proposed method is similar to the typical injection of (pseudo-)random pulses on the power supply line, in this type of protection the random signal produced by the chaotic circuit is used to drive a controlled voltage regulator (cvr), which modifies the voltage that powers the microcontroller, moreover is used to generate the clock signal of the system. the protection circuits include a chaotic signal generator, a pulse shaper, and the cvr, as shown in fig. 1 (a-c). the cvr designed and used in this research is shown in fig. 1 (d) and an example of control pulse in fig. 1 (e). the circuits where described in [9] and [8]. more than one level of voltage jump can be produced with such a scheme, provided that multiple loops with different zener diodes are used in parallel on the lower branch of the circuit in fig. 1 (d). the random character of the pulses produced by the pulse shaper refers to the variation of their duration, especially on the long run, due to the change of operation condition of the chaotic circuit (changes in the ambient temperature, fluctuations of the power supply of the chaotic circuit.) for recordings spaced in time by about 10 minutes, under apparently unchanged laboratory conditions, variations of the number of samples per pulse were of more than 20 % for measurements performed during the same day. on the other hand, changes from one pulse to the other were less than 0.5 %. the slow change of the pulse duration due to the change of the chaotic regime produced by ambient factors is beneficial for the protection because it makes difficult the learning of the patterns of the instruction, as they continuously and unpredictably change. notice in fig. 1 (f) that the control pulses remain noisy with an amplitude of the noise of about 0.5 v (peak amplitude more than 1 v). this high frequency noise makes the masking efficiency of a combined protection method against correlation power analysis side-attacks on microsystems 81 process more effective, therefore we have not tried to reduce the noise. the evaluation of the randomicity of the clock signal, as produced by the pulse shaper of the chaotic clock generator proposed in [8] was performed by determining the fluctuations of the width of the pulses. for this purpose, the time between two successive up-down impulse edges was determined for all pulses during a long period of time. the analysis showed that the width of the pulses varies by about 0.1 to 2% over short periods (less than 1 ms), but with almost 40% over longer periods (minutes to hours). the presented circuits serve only to illustrate the operation principle and the feasibility. these circuits are not designs for on-chip implementations. on the other hand, the hardware-level protection discussed in this paper assumes that the protection circuits are built on the same chip as the protected microsystem (soc system on chip technology), or at least that they are on a chip included in the same package (multi-chip technology). 3 analysis of the strength of the protection method various attack methods and related countermeasures were presented in the literature, see [5], [10], [11], [12]. attacks based on correlation analysis are among the most common. the intercorrelation function for two sequences, {x(n)} and {y(n)}, is defined as cx,y(τ) = 1n ∑n−1 n=0 x[n]y [n+τ], where τ is the delay. let vk = (sn,sn+1, ,sn+t ) be the expected (average) vector standing for the pattern of an unperturbed (unmasked) instruction k. let x = (xm,xm+1, ,xm+t ) be the vector of an instantiation of a masked, unknown instruction. the duration (number of samples) is taken the same as for vk, when the clock can be determined independently and the number of clock periods for an instruction is known. the purpose of the attack is to identify the instruction from its signature, x. several approaches for the attack are possible, among others the determination of the distance between x and all the patterns of the instructions, vk, k = 1 · · ·n, the computation of the inter-correlations between all vk and x, or determining the distances between the fourier transforms of the unknown, masked sequence x, f(x), and the fourier transform of the sequence of the instructions, f(vk). some authors, e.g. [5], consider the correlation power analysis (cpa) a distinct, more advanced method than dpa. when using correlation functions, attackers may try to determine the instruction in various ways, depending on the information they can acquire about the microsystem. when the attackers are able to determine the patterns of the unmasked instructions, they could proceed as follows. the attackers may compute in the first place the intercorrelation functions between the ‘clean’ patterns of instructions and segments of the waveform that correspond to one machine cycle (m.c.), assuming the instructions take one m.c. the attackers may reason that the true instruction is there where the intercorrelation is the greatest. denote the ‘clean’ pattern of the instruction #k by x0k . we denote an instance of the masked instruction #j by xmj . the correlation between them is denoted by cx0k,xmj (t). in the simplest (ideal) case, maxtcx0 k ,xm k (t) ≫ maxtcx0 k ,xmj (t),j ̸= k. then, the instructions are easily identifiable. if, instead, there is some index j such that maxtcx0 k ,xm k (t) < maxtcx0 k ,xmj (t),j ̸= k, confusion appears between the instructions #k and #j. in the next section, we demonstrate that a key sub-set of the instruction set of the microcontrollers in the 16fxxx series is securely masked by the method we proposed in [7], [8] against cpa analysis. for this purpose, we compute the correlation functions between the waveforms produced by various instructions when they are masked, respectively unmasked. 82 h.-n. teodorescu, e.-f. iftene figure 2: examples of intercorrelations with masked instructions 4 results and assessment of the robustness against cpa-sca the main results refer to the efficiency of the masking as determined by the lack of correlation between the unmasked pattern of the instruction and the masked ones. the results are summarized in tables i and ii. table i shows that the maximal values of the self-correlations of unmasked instruction waveforms is (almost) 1 for all the instructions, as expected, while the maximal values of the intercorrelations between unmasked instructions is less than 0.8. this allows the easy discovery of the unknown instruction by performing the correlation of its waveform with the waveforms of the other instructions. in our case, as table ii shows, seven out of eight instructions have maximal values of intercorelations with other instructions than with themselves. for example, the instruction movlw, when masked, better intercorrelates with addwf, andwf, movf, rrf, and btfss than with itself (larger correlation values, see table ii). table i. maximal values of the inter-correlation functions for eight instructions; unmasked operation, at 4 mhz clock addwf#2 andwf#7 movf#4 rrf#3 btfsc#4 btfss#3 andlw#2 movlw#7 addwf#2 1 0.73801 0.73584 0.60686 0.39528 0.75491 0.66487 0.78685 andwf#7 0.73801 1 0.85923 0.60065 0.4209 0.77955 0.73043 0.8248 movf#4 0.73584 0.85923 1 0.5919 0.39232 0.76756 0.67754 0.80839 rrf#3 0.60686 0.60065 0.5919 1 0.81175 0.57169 0.5507 0.57 btfsc#4 0.39528 0.4209 0.39232 0.81175 1 0.40628 0.46722 0.38796 btfss#3 0.75491 0.77955 0.76756 0.57169 0.40628 1 0.85267 0.73405 andlw#2 0.66487 0.73043 0.67754 0.5507 0.46722 0.85267 1 0.68391 movlw#7 0.78685 0.8248 0.80839 0.57 0.38796 0.73405 0.68391 1 table ii. maximal values of the inter-correlation functions between eight instructions, when one instruction is unmasked (first column in the table) and the other one is masked (first row). addwf#3 andwf#7 movf#3 rrf#6 btfsc#6 btfss#5 andlw#4 movlw#4 addwf#2 0.4508 0.30583 0.33292 0.27566 0.37544 0.37642 0.30343 0.44512 andwf#7 0.34256 0.36752 0.28884 0.41852 0.39059 0.30863 0.48923 movf#4 0.36011 0.29453 0.38931 0.40252 0.32866 0.45862 rrf#3 0.23007 0.47391 0.29884 0.27176 0.51101 btfsc#4 0.39219 0.22641 0.20155 0.40093 btfss#3 0.3588 0.28241 0.45588 andlw#2 0.21974 0.33482 movlw#7 0.44189 notice that the first table is symmetrical with respect to the main diagonal (hermitian). tables i and ii should be considered from the point of view of the identification of the instruction based on correlation functions. each element of the tables (matrices) is the maximal value of the correlation function for specified execution instances of a first and second instructions. the instruction is identified when the correlation is 1, in table i. the attacker is supposed here efficiency of a combined protection method against correlation power analysis side-attacks on microsystems 83 to have access to the true waveform of the instruction and to be able to directly or indirectly determine the clock frequency of the attacked system. assume that the attackers have acquired the waveforms of the non-masked instructions. the attackers can determine the true clock frequency of a system by running in a loop the correlation between an interpolated, respectively extrapolated version of the clean waveforms with the unknown, masked waveforms. for some value(s) of the interpolation, the correlation function exhibits the maximal highest value and a strong periodicity, due to the machine cycles in the waveform. in that case of interpolation, the time alignment between the known ‘clean’ waveform and the given waveform with unknown clock is the best and, therefore, the unknown clock period is found. we assume that the attackers have performed this determination. next, the attackers can perform with a known interpolation factor all the intercorrelations, to extract the information on the instructions in the attacked program. figure 2 shows examples of self-correlations of unmasked and masked instructions and correlations between masked and unmasked instructions. as expected, in all cases the correlations exhibit the periods of the clock and of the machine cycles, but not the instruction patterns. notice in table ii that values of selfcorrelation for a specified instruction that are lower than values of the correlation of the same instruction with others means that the criterion of maximal value of correlation will not work for the discovery of the instruction, based on correlations. 5 discussion and conclusions this paper synthesized partial and preliminary results reported in [6], [7], [8] and presented a thorough analysis of the masking efficiency under cpa attacks against a microsystem protected with the masking method proposed. the method is based on the randomization of both the clock and the supply voltage. the randomization uses an approach based on a simple chaotic system and the related circuitry. the proposed protection can be effective only when the attacker has no access to the chaotic circuit, or to the controlled voltage regulator. these circuits should be included in the same package as the microsystem. moreover, the electromagnetic radiation (emr) from the cvr should not be easy measured, because it reveals to the attacker the control of the voltage (that is, the chaotic circuit output). with the chaotic signal known, the attacker would be able to demodulate the masked signal and the masking one. in addition to limiting the direct and indirect (emr mediated) access to the chaotic signal, the protection must insure that the modulating signal and the protected one have similar characteristics, for example, similar amplitudes and heavily overlapping spectra. only with all these conditions satisfied, could the protection be effective. we reported only on an idea demonstration, not on an effective circuit. therefore, neither the condition on the amplitude of the swings of the vcr, nor the overlapping spectra condition is satisfied. concluding, we presented a method for instruction masking against cpa and showed that the method proves highly effective even with simple circuits for protection. the core of the method is the use of a chaotic circuit to alter at the same time the clock frequency and the supply voltage of the protected microsystem. the method is appropriate for integration either on the chip of the microsystem or in a multi-chip package. acknowledgment. this research was not supported by any institution; most of the conceptual work was benevolently performed by hnt. while there is a partial overlapping of this research with the project ‘cognitive systems’ of the romanian academy, there was no support from that project. authors’ contributions. hnt proposed the protection method in fig. 1, the schemes of the circuits for chaotic signal generation and power supply control, determined the tests and participated 84 h.-n. teodorescu, e.-f. iftene in the tests and experiments, performed part of the data processing, derived conclusions and wrote the paper. efi built all the circuits based on the design and schemes provided by hnt, wrote the test programs based indicated by hnt, and made most of the experiments. both authors discussed the paper and agreed with its final form. conflicts of interest. the authors declare no conflict of interest. bibliography [1] p. kocher, j. jaffe, b. jun, (1998), introduction to differential power analysis and related attacks, cryptography research inc, www.cryptography.com/public/pdf/dpatechinfo.pdf. accessed jan. 2012. [2] p. kocher, j. jaffe, b. jun, (2000), differential power analysis, cryptography research inc, www.cryptography.com/public/pdf/dpa.pdf. accessed jan. 2012. [3] v. tiwari, s. malik, a. wolfe, m. t.-c.lee, instruction level power analysis and optimization of software, j. vlsi signal processing, 13(2-3):223-238, aug 1996. [4] r. newell, f. juliano, protecting sensitive networked embedded systems from aggressive intrusion. edn, electronic design news magazine, may 5, 2013. www.edn.com/pdf/viewpdf?contentitemid=4413418 [5] t.-h. le, m. berthier, mutual information analysis under the view of higher-order statistics. in: echizen, i., kunihiro, n., sasaki, r., (eds.), advances in information and computer security, lncs, springer, berlin heidelberg, 6434: 285-300. 2010. [6] h.-n. l. teodorescu, e.-f. iftene, analysis of the code masking efficiency of chaotic clocks in microcontroller applications, 3rd int. symposium on electrical and electronics engineering (iseee2010), sep 16-18, galati, 261-266, 2010. [7] e.-f. iftene, h.-n. l. teodorescu, masking the instructions of a microcontroller using a ‘chaotic’ power supply, bull. polytechnic inst. iasi, e&e, lix (lxiii), 1:21-28, 2013. [8] e.-f. iftene, h.-n. l. teodorescu, protecting the code against side attacks using chaotically controlled clock and supply, proc. ecai 2013 5th int. conf. electronics, computers and a.i., ieee conf. #20924, 27-29 june 2013, pitesti, romania, 79-82, 2013. [9] h.-n.l. teodorescu, v. p. cojocaru, complex signal generators based on capacitors and on piezoelectric loads. in: c. h. skiadas, i. dimotikalis and c. skiadas (eds), chaos theory: modeling, simulation and applications. world scientific publishing co., 423-430, 2011. [10] e. brier, c. clavier, f. olivier, correlation power analysis with a leakage model. in m. joye and j.j. quisquater (eds.), cryptographic hardware embedded system, ches 2004, vol. 3156, lncs, pp. 16-29, springer-verlag, 2004. [11] y. zhang, a. juels, m.k. reiter, t. ristenpart, cross-vm side channels and their use to extract private keys. acm, 2012. available at http://dx.doi.org/10.1145/2382196.2382230, 2012. [12] r.e. atani, s. mirzakuchaki, s.e. atani, w. meier, on dpa-resistive implementation of fsr-based stream ciphers using sabl logic styles, int j comput commun, issn 18419836, 3 (4):324-335, 2008. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 2 (june), pp. 297-304 the communication in distributed client server systems used for management of flexible manufacturing systems v. lupu, d.e. tiliute valeriu lupu, doru e. tiliute “stefan cel mare” university of suceava department of informatics, faculty of economics and public administration e-mail: valeriul@seap.usv.ro, dtiliute@seap.usv.ro abstract: labour productivity growth is a necessary condition for social and economic progress, in general, and to overcome the economic crisis facing most of the world, in special. applying innovative solutions, based on the itc, is one of the straight ways for achieving that objective, both important and necessary. this paper presents a software solution applicable to industrial production based on numerically controlled machines. it involves a distributed client server communication system, combined with mlp neural networks for the recognition of the 2d industrial objects, viewed from any angle. the information on prismatic and rotational parts to be processed by numerically controlled machine, are stored on a database server together with the corresponding processing programs. the client applications run on the numerically controlled machines and on the robots serving groups of machines. while the machines are fixed, the robots are mobile and can move from a machine to another. as a novelty of the proposed solution, in some well defined situations, the clients are allowed to change messages among them, in order to avoid the server overload. the neural networks are used to help robots to recognize the parts before and during manipulation. keywords: client, server, web, java, image. 1 introduction in domain the flexible production system consists in a number of fixed numerically controlled machines which process the parts, several mobile robots [1] whose main task is to load/unload the machines and storing parts in technologic stocks. the production system is driven by a complex distributed informatics system containing two servers, an application server and a database server, and several clients. the application server provides those applications required by the coordination of the robots and controlling the processing [2] stage of each part on each machine. it also has the communication protocols with the clients and with database server. two video cams equip each robot and machine in order to ensure a 3d vision of parts to be processed or handled. the position of each robot is monitored continuously by the application server using an appropriate number of video cams fixed on the production hall ceiling. an overview of the communication system is shown in figure 1. for each part, the following geometrical features are stored and available in database server: aria, perimeter, scale factor, minimum radius, maximum radius, average radius, standard radius deflection, minimum embedded rectangle dimensions, maximum inertia momentum, minimum inertia momentum, elongation factor, average of the gray levels, pentland ratio, rmin(x)/rmax(x) ratio, aelipsa(x)/belipsa(x) ratio, form characterization using momentum, etc.). copyright c⃝ 2006-2011 by ccc publications 298 v. lupu, d.e. tiliute functional features achieving an accurate enough description of the image objects using analytically developments easy to utilize. this category comprise fourier coefficients of the inherent contour curve function, inertia momentum of different orders calculated on contour curve and the invariant computed using inertia momentum. topological features conveying objects proprieties non depending on the distortions who aren’t affecting the surface. these features are free from distances. we specify some of them: c the connected components number; h the cavities number; e = c-h euler’s number. figure 1: an example of distributed client-server system also, the relations among different areas are forming a good descriptor. these relations can be represented as a graph of connected components. the features mentioned above are needed in order to recognize the parts about to be processed using numerically controlled machine. images of the parts are captured by two video cameras placed on each robot in perpendicular planes, resulting in a 3d view [3]. for recognition it is used a neuronal architecture of multilayer perceptron type. the solution presented in this paper employs neural networks for image processing and for the recognition of the model-based industrial objects [5]. in this order, a software application was created, giving to the robots the ability of recognizing the parts to be processed on the numericalcontrol machines . the software provides also a human interface that was used to verify correct functionality of neural network pattern recognition and communications, as described bellow. the neural network [4] training is ensured by a software component that implements the wellknown backpropagation algorithm. for the classification of the vectors bearing the characteristics of the shapes in the input space, the model of the multilayer perceptron was used. the vectors of the training set present themselves to the network with the class they belong to. in other words, the training inputs are organized as association lists in which the associated vectors correspond to a classification of the key vectors. consequently, an element of the list is a pair key vector corresponding class. if sufficient objects have been memorized from each class, an internal representation can be made of each class by means of the connection weights of the network. the model of the multilayer perceptron used in this software solves the problem of the shape the communication in distributed client server systems used for management of flexible manufacturing systems 299 classification. the network inputs are the vectors that are to be classified, while its outputs are the corresponding classes. 2 the client-side program the "client" program contains three classes: "client", "network" and "writer". initially, the program imports three packages: "java.io.*" (used for input/output actions; io = input / output), "java.awt.*" (used for graphical objects; awt = abstract window toolkit) and "sun.net.*" (used to establish a connection between client and server) [6], [7]. in order to run the client program, the user will type the following command: java client < iporaddress >< clientname >; where: → java the name of the application that launch the classes; → client the name of the class that will be executed; → ip or address ip address or server name; → clientname the name of the client who wants to connect to the server. the following sections explain the three classes of the client application. the "client" class. the "client" class extends the frame class, which has a method called add. using this method we can insert in the middle of the frame an object of type network. the class contains two methods: main and client. in the main method it is verified if the number of arguments is smaller than two. the arguments meaning is: args[0] the ip or the server name; args[1] the name of the client who wants to connect to the server. this function is the first executed when the client class starts. with this method, an instance of the client class is created, having as parameters the server ip address and the client’s name. the client method receives as parameters the name or the ip address of the server (string host) and the user name (string username). inside the frame we insert a new object, which is actually a new instance of the network class, having as parameters the same values received by the function. then, the frame size is set to (500,500). the method show() displays the frame on the main screen. the "network" class. this class extends the panel class. the meaning of the variables is the following: • public static boolean afis if it is true, then the messages that are received from server are shown in the green box from the client’s main window; • networkclient network_client using this method, we can declare a client. we use the specifications from the sun.net.* package; • datainputstream net_input using this variable, the client can receive messages from server; • printstream net_output using this variable, the client can send messages to the server; • static int modif=0 if it is 1, then the image received from server is shown in the client’s main window. this image is called "imagine.gif". if it is 0, then this image isn’t shown in the window. • string username user name; • boolean connected=false if it is true, this means that the client is connected to the server; • writer w using this variable, we can create a new instance of the writer class, which handles with the read of characters or strings received from server; 300 v. lupu, d.e. tiliute • string typed_line="" the message that’s typed by client. bellow are the methods of this class: the class constructor. the constructor receives two parameters, which represents the name or the ip address of the server and the user name. each time a client tries to connect to the server, the connect method is called. if the connection succeeded, the connected will be set on true and also a new instance of the writer class will be created. if the connection doesn’t succeed, an error message is returned to the client (it may also be viewed on the human interface client) and the execution of the application stops. the connect method. this method tries to connect to the server on a specific port (we chose port no. 1111 but it can be any unassociated port). the connection is achieved by creating a new instance of the class networkclient, which can be found in the sun.net.* package. the arguments of this constructor are the name or ip address of the server and the port number. after establishing the connection the net_input and net_output variables are initialized with instances of the classes that correspond to their definition. the serveroutput variable is declared inside the networkclient class and it is initialized when the constructor of the networkclient class is called. the read_net_input method. this method reads a line from server and returns the string received or null in case that an input/output error occurs. it also returns null if there is nothing available from the server. the write_net_output method. this method writes a string to the server. the close_server method. this method tries to close the communication between client and server. the keydown method. this method is executed when a key is pressed in the client’s window (in human interface client version). also, it calls the repaint method in order to make a quick repaint of the client’s window. the paint method. this method paints the client’s frame. if writing messages from server is allowed, a welcome message is displayed: "hello, !", then two colored boxes are drawn, one red and the other one green, under the welcome message. each box displays different messages; the red one displays the messages sent by client and in the green one are displayed the messages received from server. if the modif variable is set, then the image "imagine.gif" will be shown. this image is received from the server. the "writer" class. this class handles with the reading of data from server. this method extends the thread class, which is specific for repeated actions. if the keyword "recv" is received from server, the variable primesc becomes true. the string specifies that a data transaction between server and client is about to begin. the transmission consists in the content of a ".gif" image, whose size is specified by "size nnnn" string, which precedes the actual content. data are received until the "stop" string is detected. in this case the modif variable from network class will become 1 and the repaint method from the same class will be called. 3 the server-side program the server application contains two classes [6], [7]. the server class extends the networkserver class, which is a part of "sun.net" package. in this class we consider that the maximum number of users is 1,000. the meaning of the variables from the server class is given bellow: string user[] this array will contains the name of the users that are currently connected to server; the communication in distributed client server systems used for management of flexible manufacturing systems 301 string sir[] in this array will be stored the last message from each user; datainputstream net_input[] by using this array, the server can receive data from clients; printstream net_output[] by using this array the server can send information and data to cliens; reader r[] in this vector will be stored the addresses of the "reader" class instances; static int client_counter = 0 the current number of connected clients. the main method creates a new instance of the server class. the constructor of this class initializes the arrays presented above. when idle, the server console displays waiting for users... the most important method in this class is servicerequest, which is a function of the networkserver class. when an event occurs at the server, this function is automatically called and creates an instance of the reader class. this thing happens each time when a client connects to the server. the method read_net_input reads data from clients. the method write_net_output writes data to clients. then the server is started, by calling the "startserver(1111)" instruction. this instruction starts the server, specifying that the port on which the server will receive message is 1111. this port number can be changed, but if is changed in the server class it also has to be changed in client class. if the server was started successfully, a message is printed on the server console: waiting for users ... if it’s not possible to open the server on the port 1111, then an error message will be shown and the execution of this application will be canceled. the most important method in this class is servicerequest, which is a function of the networkserver class. when an event occurs at the server, this function is automatically called. in this function it is created an instance of the reader class. this thing happens each time when a client is connecting to the server. the method read_net_input reads data from clients. the method write_net_output writes data to clients. the reader class this class extends the thread class. the most important method of this class is run. this method is called each time when an event occurs (connection of a client, the send of a message from a client, etc.). when the end of sentence is detected (point, !, ?), it checks if the message represents the standard syntax for sending data (images). this syntax is: send < imagename > to < user > where imagename is the name of the image stored on the server and the user is the name of the user that has to receive the image. examples: send image1 to machine1. send image2 to machine2. the syntax is "case-insensitive". the application server has the possibility to search the image in database. using the methods of the package "java.sql", queries may be sent to database and the results are fetched. snapshots of server and clients with human interface are depicted in figure 2. samples of pictures of parts to be processed are given in figure 3. 302 v. lupu, d.e. tiliute 4 case study mlp type neural network for recognizing objects from any angle the network has three layers. at the input, each neuron is connected to the outputs of all the neurons from the previous layer, its output being connected to the inputs of all the neurons of the following layer. within the same layer, the neurons are not connected between them. the dimension of the input layer is given by the dimension of the five vectors describing each shape (sphericity, pentland’s ratio, radii ratio, ratio between the ellipses on the two axes, shape characterisation by means of moments). the dimension of the output layer is equal to the number of classes into which the input data will be separated, that is four (corresponding to the classes: disk, square, triangle, cross). figure 2: a) windows screen the activation function used for the output of the neurons is: f(a) = 1/(1 + exp(−a)). as a result, the outputs of the network (y) will be in the domain [0, 1]4. the representative of the class where the data set presented at the input is placed will have the output 1, while the others, 0. for other types of pieces, the theoretical values cannot be determined directly, requiring measurements and experiments specific to each class. the values obtained, following the implementation of the parameters calculation are given in table 2 (through medium values): table 2 medium values of the shapes’ parameters the communication in distributed client server systems used for management of flexible manufacturing systems 303 figure 3: b) picture 1 c) picture 2 c(x) p(x) rmin(x)/ aeclipse(x)/ f(x) rmax(x) beclipse(x) 1.014 1.120 0.921 1.042 0.040 0.841 0.531 0.673 2.512 0.125 0.691 0.512 0.481 3.824 0.230 0.265 0.401 0.213 14.412 0.612 where: 1. sphericity c(x) = 4 π a(x) / (p(x))2. 2. pentland’s ratio p(x) = 4 a(x) / (πfmd(x))2. 3. radii ratio rmax(x)=maxdi / i=1..n; rmin(x)=min di / i=1..n. 4. ratio between the ellipses on the two axes. aelipsa(x) = ˝ ( p(x) / +((p(x)/)2 4 a(x)/)1/2); belipsa(x) = ˝ ( p(x) / ((p(x)/)2 4 a(x)/)1/2). 5. shape characterisation by means of moments f(x) = √ 1/n ∑n 1 (di − m1)2 1/n ∑n 1 di structure of the file data.txt n patterns number of input vectors for the network training x1, x2, x3, x4, . . . , xnpyi (i=1, nc, where nc = number of classes) the components of the input vector (shapes’ parameters), where np = number of shapes’ parameters. the parameters were introduced as follows (according to the table corresponding to each shape): 304 v. lupu, d.e. tiliute class 1-square 0.867 0.660 2.145 0.124 0.583 0.583 2.302 0.164 0.738 0.738 1.968 0.102 2-disk 1.018 0.867 1.000 1.000 1.021 0.857 1.000 0.039 0.994 0.867 1.168 0.051 3-cross 0.160 0.063 22.924 0.732 0.292 0.250 11.615 0.462 0.169 0.143 21.596 0.647 4-triangle 0.670 0.462 3.696 0.229 0.712 0.423 3.313 0.231 0.708 0.514 3.347 0.211 5 conclusions the functionality of the proposed system was proved by a dedicated application software that responds to the following main requirements: • provides the recognition of the 2d and 3d industrial objects viewed from any angle; • provides border touching, partial overlapping and clipping of the objects in the image; • it is noise tolerant. the comparative study of the existent solutions, in correlation with the above-mentioned requirements, leads to the conclusion that 3d object-centered models are suitable in order to ensure the deduction of the model views from any angle. bibliography [1] c.v. kifor, c. oprean, d.d.m. banciu intelligent systems for assisting decisions in advanced product and process planning and design, studies in informatics and control, sic vol.18, no.3, 2009 [2] mircea ivanescu, mihaela cecilia florescu, nirvana popescu the control of the hyper redundant manipulators by frequency criteria, studies in informatics and control, sic vol.18, no.3, 2009 [3] lupu valeriu contributions to the management of the flexible manufacturing systems (phd thesis), 2004 [4] mohammad reza soltanpour, seyed ehsan shafie design and stability analysis of a robust impendance control system for a robot manipulator, studies in informatics and control, sic vol.19, no.1, 2010 [5] o.khatib, k.yokoi, k.chang, d.ruspini, r.holmberg, a.casal coordination and descentralized cooperation of multiple mobile manipulators, journal of robotic systems, volume 13, no.11, 1996, issn: 0741 2223 [6] d.logofatu algoritmi fundamentali in java. aplicatii, isbn: 978 973 46 0815 7, editura polirom, 2007 [7] java on-line documentation (http://java.sun.com) int j comput commun, issn 1841-9836 7(5):816-823, december, 2012. disaster prevention integrated into commonly used web rendered systems with gis capabilities m. cioca, s.c. buraga, c. cioranu marius cioca lucian blaga university of sibiu e-mail: marius.cioca@ulbsibiu.ro sabin-corneliu buraga a. i cuza university of iasi e-mail: busaco@infoiasi.ro cosmin cioranu uefiscdi e-mail: cosmin.cioranu@gmail.com abstract: the end of the 20th century brought a remarkable increase in the field of positioning techniques and communications, making them visible and available to the public, which led to an unprecedented interconnectivity. at the same time, disasters are part of our life. regardless of their nature, measures can be taken, in order to prevent and mitigate their effects, by anticipative preparation or by avoiding the calamity area (if possible). to this end, this paper presents an integrated system, composed of a software component, a hardware component, and a decision-making human element, all having the declared role of diminishing or eliminating human and material losses. keywords: : dss, disaster, open layers, apache. 1 introduction the end of the 20th century brought a remarkable increase in the field of positioning techniques and communications, making them visible and available to the public. among them, one of the most spectacular which revolutionized the way in which we relate to our planet was in fact the introduction of gis (geographical information system) positioning technologies on the consumer level, which were actively supported by communities of developers and which transposed this knowledge into a format that the ordinary user can access, starting with the mobile ones and ending with the old and well-known pc. (e.g. google earth) [16]. fire, floods, earthquakes, storms, disasters caused by human negligence can occur and can give rise to irremediable damages, but in some cases, material and human losses can be avoided or, in the worst case, diminished through a set of measures that can be assisted or implemented by means of an information system with spatial capabilities, that collects data from different sources in order to develop a prevention system for such events, that finally has an dissemination role, that can act through deployment of resources, so that the event or the events concerned affect the normal course of life as little as possible [18]. 2 decision support technologies and tools the complexity of sustainable development requires rational decision-making, and decisionmaking becomes increasingly difficult especially regarding environmental issues. the progress in decision-making theory and decision support systems have determined the emergence of methods copyright c⃝ 2006-2012 by ccc publications disaster prevention integrated into commonly used web rendered systems with gis capabilities 817 and tools that can assist the decision-maker in making the most adequate decisions. however, in helping decision-making on complex issues, these tools are not easily developed and built [1], [7], [8], [10], [11] and [12] . the use of general accepted techniques and methods in dsss may improve the identification and prevention of complex risk situations. these imply making strategic decisions supported by expert boards and groups. their activity is often burdened with physical, time and cognitive barriers. moreover, these methods and techniques used by dsss represent only general approaching instructions on specific decision-making situations. in practice, in order to be used efficiently, they need reinterpretations, refinements, adjustments and completions. 3 technical approach to this end, this paper presents an integrated system (fig.1), informationally and spatially scalable from a community to a certain area, that stratifies both decisional and informational passive and active elements, in order to prevent human losses and/or danger situations, generated by natural or human calamities (floods, radiations, earthquakes etc.). for harmonization, these elements have been grouped into passive elements, meaning those elements that provide information related to environment, active elements, which in this case, are human decisionmakers, that have as primary attributes the position in latitude and longitude system, as well as their type. among the active elements of the system, we mention a list of spatially-localized decisional personnel, attached to each regional entity (village, commune, county), that can be alerted when the system detects a case of disaster, through specific algorithms. one of the main elements, of a great importance, are the specific approved sensors, both fixed and mobile, placed on the entire area of focus, that measure events like humidity level, landslides, earthquakes, radiations etc. these are equipped with point-to-point communication equipment like gsm, radio or other methods that ensure information transmission to the decisional cluster. another passive element that brings a better accuracy to the system are the meteorologic maps, but which imply a high data volume, meaning that these should be served to the system in a semi-processed manner. the major problem of this system was the link between these elements, their harmonization being accomplished by integrating these data into a decision-making neural network, which concentrates the spatial information by a number of minimization operations, so that the result can be transposed into a gis capable system and that generates a well-determined spatial answer. in such an approach, methodologies for this kind of wireless sensor-systems have to be used [4], security protocols for wireless sensor networks [2] have to be considered, as well as efficient algorithms, specific to this kind of mobile sensor networks [3]. 4 computer system implementation from the informational standpoint, the system is based on a web-type, client-server architecture, one of the most frequently employed in application development on one hand and sensor and meta-data set on the other hand. this approach ensures the separation of the function logic model of the application into smaller functional units, the usage being split into two major components: client and server. on the client-side, the users, both on administration level and also as common users, integrate browsing technologies, being compatible with 99,3% [17] of all users, using technologies such as web2.0, html, css, javascript, openlayers. at server level, apache, php, python, inkscape, imagemagick, gdal, mysql are among the technologies employed [14]. additionally, besides these technologies, the data server receives inputs on other communication channels such as gsm or radio. the architecture used and shown in figure 2 is 818 m. cioca, s.c. buraga, c. cioranu figure 1: integrated and scalable spatial information system easy to approach. likewise, the data central system can send alerts to some decision-making entities, firefighters, the police, responsible with disaster prevention and management etc. this architecture is one of the most versatile and can be accessed from: smartphones, desktop pc, laptops, tablets etc. figure 2: general system architecture the presented system has a temporal role, giving the possibility to be loaded with statistical data. once mature, it can play a crucial role in further cases. from a functional standpoint, the system is divided into several distinct components: a. the transfer component, communication; b. the alert component; c. collection hardware components passive elements, i.e. sensors, that collect states of the environment, processing systems etc.; d. software components, i.e. the implementation of a mathematical model and a logic for the analysis of the risk of the various inputs from the external environment, all marked and positioned, using a gis system; e. finally, human components active, decision-making elements, which will be covered in this study only as constitutive element. from the communication standpoint, two ways are used, namely: a. gsm serial communication, usually for timely communications, status data or alert information, characterized by small data volume; b. or other tcp/ip wireless channels, for viewing and creating a graphical information map. the advantage of gsm communications consists in a broader coverage area, but with a lower bandwidth, usually serial communications. they usually meet the need of speed and coverage for stationary sensors, that send the useful data in burst mode. the second is the wireless channel, which provides a wider bandwidth, but which is limited to a shorter distance. in terms of alert, disaster prevention integrated into commonly used web rendered systems with gis capabilities 819 this system integrates information capabilities of the decision-making elements, by using the following channels: a. gsm channels, by sms [5]; b. ordinary channels, mail, newsletters, form-boards etc. industrial gsm modems ensure communication with the gsm network, but for a higher number of sms messages, one can choose the sms bulk service of a local mobile network provider, which is a service provided by local mobile network operators, by which they ensure a way of interconnection of the local system with the mobile network operator. the disadvantage of such a system is the fact that this connection can be interrupted due to the way of implementation (cables), but can provide a greater number of alert messages even greater than a gsm modem (which is, a mobile phone with advanced features of interconnectivity and interoperability with a computer) [6]. in terms of hardware component, the architecture is composed of sensors and/or other data sources, either automatic or human, which are afterwards integrated into the software architectural level. the internal structure of the software component is divided into the following (figure 3), in a top-dow approach: figure 3: the internal structure of the software component a. the mathematical model; b. the logic analysis model; c. the interaction model. from the functional standpoint, this component uses accumulated data and by specific algorithms, it generates visually usable data, which can also be used in the analysis process. the mathematical model has the following formal structure: s = rxr (1) where s geographical space, georeferenced wgs84 (as a rule) [16], but it can be transformed by using other utilities (e.g. gdal) in any format compatible with openlayers [14]. pi(α, β) ∈ s (2) the sensor coordinates, where α, β, geographic coordinates vpi(α,β) ∈ [vpmin, vpmax] (3) where vpmin, vpmax are the normal values. above these values, maximum or minimum, a danger or disaster impact is to be generated. therefore, attached to each point (sensor) defined as such, the notion of impact is a specific function, defined as follows: ipi = 0 (4) if vpi(α,β) ∈ [vpmin, vpmax]. ipi = ψ(vpi(α,β)) (5) if vpi(α,β) ̸∈ [vpmin, vpmax]. and 820 m. cioca, s.c. buraga, c. cioranu ψ ∈ rxr (6) in order to establish the implementation of thei ψ, function, the following criteria will be taken into account: risk level: disaster, high risk, medium risk, low risk and minor risk. event type with human implications: earthquake, floods and fire. taking into account the above-mentioned relations, we can define the disaster area, passing to an iteration through all the points where information can be collected, generating a layer shown in figure 4, which overlapping with the already existing layers, generates figure 1. figure 4: the layer generated on the basis of the information received from the collection points in its turn, the logic model integrates analysis and decision-making components and prepares the needed elements for decision. at this level, one must specify the mathematical model that underlies the entire software component and has as a result a georeferenced layer, which includes the information of each important element that provides information about the environment. the human element plays two roles, one is to be informed, namely the decision-making element, and the second is to inform oneself, in order to avoid a calamity area. 5 technologies client [19], [14] client requires a minimum number of software applications, which as we can see, are relatively inexpensive: a. web browser (firefox, internet explorer >5.5, nescape) to access the system residing on the server; b. access to the server that hosts the system (intranet, internet) to ensure the interconnectivity for this station. any communication media can usually be employed, from wire to wireless technologies; c. the required hardware is restricted to the above-mentioned software requirements; d. for interface software, open layer is used (in order to render the data). server [14] the server software requirements are the following: a. operation system: linux based -provides very good stability in terms of security and performance; b. programming language: php 5.x; c. sgbd: mysql 5.x; d. web server web : apache 2.x; e. gdal and imagemagick for data display. 6 the experimental section. results the methodology for impact assessment is: a. establishing the area center or sensor coordinates; b. establishing the limits affected on the 4 major directions n, s, e, w; c. establishing the negative impact mitigation rates tn , ts, te, tv in order to calculate distance d between two coordinate points p1(lat1, long1) and p2(lat2, long2) ) the haversine formula was used: [15], [20] disaster prevention integrated into commonly used web rendered systems with gis capabilities 821 d = r ∗ 2 ∗ arcsin (√ sin2 (∆lat 2 ) + cos(lat1) ∗ cos(lat2) ∗ sin2 (∆long 2 ) ) (7) where r is the earth radius, which varies from the equator to the poles between 6378.14 and 6356.78, where the accuracy of the above-mentioned formula derives from, an error of about 0.5%. stage 1 collecting the field data following the format below (in order to calculate distance d1). location : p1 (45.017,27.525) => haversine [13] => p1(450 1′ 1′′, 270 31′30′′) affected area o maximum coordinates: (n (45.097, 27.557), e (45.056, 27.613), s(45.012, 27.552), v(45.053, 27.482); o meters , from the point of origin [16]: n 5088m, with an impact lasting for up to 8 days, e 4661m, 6 days, s 4449m, 5 days, v 5792m, 8 days impact parameters are indicated at each point. collecting the field data following the format below (in order to calculate the distance d2). location: p2(45.391,27.76) => haversine [13] => p2(450 23′ 28′′, 270 45′ 36′′) affected area o coordinates: n(45.415, 27.757), e(45.395,27.783), s(45.375,27.757), v(45.395,27.783) o meters, from the point of origin: n 2178m, with an impact up to 6 days, e1897m, 6 days, s 1992m, 8 days, v 2136m, 9 days figure 5: disaster positioning on the map stage 2 stage 2 involves the development of the graphic management functions. the progression calculation function for the affected area d(t) = tmax − t tmax dmax (8) where t ∈ [0, tmax], d ∈ [0, dmax] d(t) ∈ [0, dmax] (9) the impact function, calculates the impact level using the maximum distance, being applied on the 4 directions n, e, s, w l(t, d) = tmax − t tmax d dmax (10) where t ∈ [0, tmax], d ∈ [0, dmax] l(t, d) ∈ [0, 1] (11) 822 m. cioca, s.c. buraga, c. cioranu where 0 means there is no impact and 1, maximum impact. figure 6 results using the relations [2], [4] figure 6: disaster size at moment t0 =0 the following figure represented in a logarithmic scale disaster cases, using the n direction. figure 7: represented in a logarithmic scale where: blue is space variation/day affected on direction n; red is intensity variation of the space affected /day on direction n on zero-day of disaster the area was about 5000m and the eighth day this impact falls to 0. on zero-day of the impact the disaster was labeled 1 then as time passed by, on the eighth day, it fell to 0. the same calculation is used for the impact area p1 for the eastern element. 7 conclusions and future research in conclusion, we can say that, in order to develop high-quality and complex dsss, able to cover as much as possible in case of disasters, on the one hand, a multidisciplinary approach is required, an approach which should bring together specialists in various fields, such as: environment, gis, geographers, mathematicians, computer scientists, organizations in charge with such situations etc.; and on the other hand, a global approach which should reunite institutions and people from different countries, as such phenomena are not restricted to the borders of one specific country and they may affect large areas, encompassing people and goods from different states and at the same time, the transfer of know-how between partners/researchers from different countries brings benefits to such catastrophic situations, that might end up with material damages or worse, with the loss of human lives. bibliography [1] anica-popa i., cucui g., a framework for enhancing competitive intelligence capabilities using decision support system based on web mining techniques, int j comput commun, issn 1841-9836, 4(4):326-334, 2009. disaster prevention integrated into commonly used web rendered systems with gis capabilities 823 [2] aseri t.c., singla n., enhanced security protocol in wireless sensor networks, int j comput commun, issn 1841-9836, 6(2):214-221, 2011. [3] ban d. , yang w., jiang j., wen j., dou w., energy-efficient algorithms for k-barrier coverage in mobile sensor networks, int j comput commun, issn 1841-9836, 5(5):616624, 2010. [4] chen j.i.-z. , chung y.-n., a data fusion methodology for wireless sensor systems, int j comput commun, issn 1841-9836, 7(1):39-52, 2012. [5] cioca, m., cioca, l.i. decision support systems used in disaster management, in decision support systems, chiang s. jao (ed.), isbn: 978-953-7619-64-0, 2010. [6] cioca, m., cioca, l.i., buraga, s.c., spatial (elements) decision support system used in disaster management, digital ecosystems and technologies conference, ieee-dest, isbn: 1-4244-0467-3, ieee catalog number: 07ex1418c, pp. 607-612, 2007. [7] filip, f.g. decision support and control for large-scale complex systems, annual reviews in control (elsevier),issn: 1367-5788, 32(1): 61-70, 2008. [8] filip, f.g. sisteme suport pentru decizii, isbn 978-973-31-2308-8, editura tehnica, bucuresti, 2007. [9] peng, y., gang kou, shi, y., and chen, z., a descriptive framework for the field of data mining and knowledge discover, int. j. of information technology & decision making, 7(4):639-682, 2008. [10] peng, y., gang kou, wang, g., wu, w., and shi, y., ensemble of software defect predictors: an ahp-based evaluation method, doi: 10.1142/s0219622011004282, int. j. of information technology & decision making, 10(1):187-206, 2011. [11] power, d. j. decision support systems: concepts and resources for managers, quorum books, westport, connecticut, 2002. [12] suduc-ana maria, bizoi, m., filip, f.g., user awareness about information systems usability, studies in informatics and control, 19(2):145-152, 2010. [13] http://andrew.hedges.name/experiments/convert_lat_long/ [14] http://asrc.ro/imeteosat_beta (accessed on 07.04.2012) [15] http://en.wikipedia.org/wiki/haversine_formula [16] http://support.google.com/earth/bin/answer.py?hl=ro&answer=148110 (accesed on 2012) [17] http://www.articlesnatch.com/article/certification-authorities-with-browser-ubiquityof-99-3-are-best-in-industry-/253347 [18] http://www.cimec.ro/resurse/patrimoniu/dezastre.htm (accessed on 5.04.2012) [19] http://www.developer.com/java/web/print.php/10935_3528381_2 (accessed on 3.04.2012) [20] http://www.movable-type.co.uk/scripts/gis-faq-5.1.html int j comput commun, issn 1841-9836 7(5):924-932, december, 2012. decision support for healthcare ict network system appraisal a.m. oddershede, l.e. quezada, f.m. cordova, r.a. carrasco astrid m. oddershede, luis e. quezada, felisa m. cordova university of santiago of chile, industrial engineering department av. ecuador 3769, santiago, chile e-mail: astrid.oddershede@usach.cl, luis.quezada@usach.cl, felisa.cordova@usach.cl rolando a. carrasco newcastle university, uk, school of ieee newcastle upon tyne, uk e-mail: r.carrasco@ncl.ac.uk abstract: a framework to support the appraisal process to improve the quality of service (qos) of an information and communication technology (ict) network system in health care service is presented. most of health-related activities stand to benefit from ict endorsement; however, technical problems may appear, as an inadequate physical infrastructure, insufficient access by the user to the hardware/software communication infrastructure and qos issues. the aim is to develop a prototype assessment model based on data collected from the main users of a health network system an evaluation process is carried out to analyze and assess the support of qos of ict, its infrastructure and user interface perception of the qos offered through case study for hospitals in chile. performance has been evaluated by simulation and modelling network architecture. the optimization network engineering tool (opnet) simulation platform is used to examine the network behaviour and performance to ensure consistency and reliability for thousands of staff across the hospital network. keywords: ict, healthcare, opnet, mcdm. 1 introduction nowadays, healthcare institutions are universally urged to improve quality of their services and hospital units are concentrated on how to develop better services, to allocate methods, to provide resources to satisfy professional aspirations and to comply with citizen necessities. most of the actions oriented to improve the operation and the quality of healthcare service depends, to a great extent, on the level of information available and the communications system. a poor ict network system implementation may generate a negative effect on the service, patients and health care providers. furthermore, the ict system from a health centre may be endowed by attributes that not often matches with the user’s requirement s (quality level, performance, cost and others). the ict system may be inappropriate for satisfying user requirements or may be inefficient in doing it. ict system should be a facilitator for health care users since they need to access to all types of data existing on all types of systems. the challenge of providing quality of service (qos) in a health environment is further complicated by the extremely variable qos needs of individual health organizations over time. the kinds of information exchanges in which an organization engages typically vary considerably in the course of a day, from simple exchanges of information regarding a patient’s coverage by a health plan, through transfers of medical records with affiliated organizations, to the exchange copyright c⃝ 2006-2012 by ccc publications decision support for healthcare ict network system appraisal 925 of large medical images for interpretation and diagnosis. for example, the bandwidth needs of a small medical clinic could, accordingly, vary enormously during the course of a day, ranging from near nothing one minute to several megabits per second the next. finding ways to satisfy such variable demand for bandwidth economically represents a significant challenge. the agents (patient, clinical doctors, hospital staff, government, etc) involved should attend contradictory petitions: on one hand, to respond the increasing demand and, by another, to attend to the budgetary restrictions. the answer to this complexity is to improve qos and the efficiency [1] [2] . user needs and expectations are indispensable to count on better information that lead to an improvement in operation services, welfare and clinical information. reorienting the health care system towards the needs of the client, medicine based on the evidence, clinical management and the continuous improvement of the processes (qos), total quality, contemplating the human factor as the main assets of the business) are key elements in future. the hospital management challenge is based on the achievement of the articulation and the convergence of values among the agents, the staff members and the patients; the information technology and communications systems needed are not just mere data agents, instead, they are of information, knowledge and intellectual capital [3] some general questions to be answered are: what are user expectations about the ict needs? is the current ict infrastructure appropriate? what are the main qos parameters? how can be evaluated the ict network for healthcare service? what technical capabilities do health applications demand of the internet? how do these capabilities differ from those needed by applications in other sectors, such as banking, defense, and entertainment?. to deal with this complexity, a two stages methodology involving user perception is proposed. the first stage corresponds to the identification of the user types, the activities they are involved in, the ict network system requirements, applications and priorities attributes. this is achieved through multicriteria decision making approach (mcdm) [4], using the analytical hierarchy process (ahp) in author previous work [8] obtaining key qos parameters. this paper focuses on the second stage, which is the development of a model for evaluating ict healthcare network quality of service (qos) integrating user perception. this stage is concerned with modelling and simulation to examine the network performance on applications. the qos technical metrics related to each attribute has to be defined together with the applications profile. qos offered by a particular network could be established by technical parameters that can be measured objectively. however, user perception depends upon their needs, their precise applications and their expectations. it is a difficult task to find a set of universal parameters for every type of service because there are many and dissimilar parameters involved in the performance evaluation. a typical public hospital zone is considered as a case study to analyze technology infrastructure and network performance. further, profile applications must be set up according to the main role health care users perform. a new model based on the results obtained on the first stage is designed and opnet simulation platform is used to examine the network behaviour and performance .the model incorporates communications resources of lan architecture from typical chilean hospitals where. opnet model has demonstrated to give a good representation to real world to analyze traffic flows, and network performance providing a tool to demonstrate different type of networks and protocols. this pilot case study revealed that the model shows to be useful by evaluating quality, user pertinent criteria and to connect higher objectives with lower performance metrics and conclude that through the proposed assessment model is possible to detect whether there is connection between human perception of qos and the technical metrics associated to the ict network. this analysis helps decision makers, network planners and operations engineers to 926 a.m. oddershede, l.e. quezada, f.m. cordova, r.a. carrasco manage complex and constantly changing networks, using predictive planning for reacting to significant network issues using real-time network visualization and troubleshooting. this is critical in a healthcare institution. section 2 provides a description of the conceptual model, in section 3 the case study description is presented .the simulation results generate new information, and section 5 provides the conclusions. 2 conceptual model for healthcare service ict network a conceptual model is proposed for health ict network system evaluation connecting qos user perception (qualitative) with qos technical aspects, (parameters, and network performance). modeling and simulation provides a good tool, to plan network infrastructure and manage application performance using predictive planning. the model scheme is depicted through figure 1 illustrating the diverse factors involved in the evaluation of qos for a particular service starting from the end user. initially, the main users, the type of user and its applications must be categorized , then the ict support needs to deliver a better service, the attributes indispensable to meet the requirement in concordance to their expectation, are to be characterized , next , technical metrics should be defined and analyzed to check the performance on an application [5]. the perceived qos cluster includes the parameters related to each service the user perceive and determine the satisfaction of the service received. for example: success in the connection, accessibility, velocity, etc. the technical ict parameters refers to the basic metrics that would guarantee a service, i.e., end to end delay, packet loss and should be detected by the network operators in charge. qos offered by a particular network could be established by technical parameters that could be measured objectively. however the user perception depends upon their needs, their directly application and their expectations. it is a difficult task to find a set of universal parameters for every type of service for the reason that there are many and dissimilar parameters involved in the performance evaluation. then, it is useful to analyze which of the parameters is relevant when considering the user perception for a determined service. the organization must then define a service level agreement (sla) for their main applications. figure 1: proposed conceptual model for health ict network evaluation decision support for healthcare ict network system appraisal 927 for the case in study, we suggest modeling and simulation using optimization network engineering tool (opnet) simulation platform, (www.opnet.com). [6], [7] to examine the network behavior and performance when it is not possible to compute through the operators the basic metrics. opnet supports simulation technologies and it is well suited to examining network behavior. the opnet models execute the protocol in much the same way as a production environment. the modeling it is based on cisco network which is 99% accurate and good representation of real world. [8] it is possible to study protocol behavior under different network conditions and application performance. 3 hospital ict network scenarios simulations with opnet a pilot case study is pursued following the methodology proposed, where the results can be used for analyzing the network performance en ict healthcare system, when it is not possible to measure directly, the performance metrics, and parameters. according to opnet methodology, the initial phase for analyzing the effect of qos on an ict network and/or on application performance is to select a network topology, followed by traffic configuration for each application before running simulations. then qos has to be configured defining slas to analyze the results. 3.1 hospital zone case study the authors have collected information from a public community hospital/medical centre/clinic hospital dr. luis tisne brousse in santiago chile. a local ethernet network from a zone of the medical centre is considered as an initial pilot study case. in view of the fact that the purpose is to examine the backbone utilization, the study zone is represented as a partial network topology along with the methodology. hospital zone description the backbone giga bit ethernet in study is located at a building next to the hospital where they perform administrative functions to assist the hospital and some health service functions. a diagram showing the zone in study is given in figure 2. the section in study is a two floor building, plus a basement named: "zócalo". within the first floor, there are rooms used for providing diagnosis for minor ailments and medical consultation. this first floor includes 3 switches: • ws2950g-24, switch, located at the electric room, with five workstations, • ws2950-24, switch located at the medicine/physiatrist room, with six work stations, • ws2950g-12 located at the x ray room, with 3 workstations. • the basement floor includes a ws2950g-24, switch with 24 ports and there are 6 workstations connected, • the first floor switches are connected to the second floor to a ws2950g-24 switch located at the director office by fibber optical cable. the switches are connected to and between each other by optic fiber, except for the x ray room switch which is a cable utp cat. 5e. 928 a.m. oddershede, l.e. quezada, f.m. cordova, r.a. carrasco figure 2: hospital zone diagram 3.2 opnet network modeling for public hospital zone case study the authors have designed network modeling and implementation for the public hospital stated in conformity with the diagram described on figure 4. the partial network topology is represented following opnet methodology in figure 3. the partial network characterizes the initial situation and first scenario to be explored. a total of 30 workstations are initially linked to the switches. the different services the users deliver are expected to be supported by the network through different applications that will help to accomplish their assignment. the applications regard as sustained by the server are based on the results obtained on the previous study. 3.3 health ict network applications communications networks enable applications to exchange data. popular applications that use data networks include virtual terminal services, file transfer utilities, database transactions, and e-mail. each of these applications generates its own sort of traffic: virtual terminals slowly generate many small packets, while file transfer utilities send long streams of large packets. each type of traffic causes and experiences a different set of problems in the underlying network, so you may want to accurately model the traffic patterns generated by a variety of applications. opnet uses a generic network application model to generate typical application traffic patterns. this is the applications model, also called the standard network application model.. depending on their underlying networks, application architectures may differ [7]. the applications are modeled explicitly, and end-to-end delays or response times are studied in detail. the factors that contribute to application response time include,: decision support for healthcare ict network system appraisal 929 figure 3: partial network topology • delays due to contention on servers (server processing time),or/and • delays due to contention on the client (client processing time), • delays due to contention with "other" traffic at the various intermediate devices (queuing delays), • delays due to contention with traffic of the same application type from other users at the intermediate devices (queuing delays), • network delays (transmission and propagation), • delays due to protocol effects (tcp retransmissions, windowing etc.) in the course of the investigation and survey developed on a previous work [9], [10], the findings revealed that the relative usage of ict applications in health care differs to some extent depending on the institution. indicating that, so far, the real application that has more usage is email and there is very little usage of the others. web browsing is used mainly on research activities. however, through the multicriterial analysis by the use of ahp, to find out the importance of ict provision in quality of service in healthcare institutions developed in[10], one of the important new information, conclusions and contribution are the relative importance users assigned to the applications in performing a health care service. user assigned the highest priority in importance to data base access and the least relative importance to e-mail. this shows the existence of a gap between what healthcare participants actively use and what they consider important for the development of their daily work as a service for the patients. then, considering these results the applications to be supported by the server for every scenario are: email, web browsing (http 1.1), file transfer, database access, file print, video conferencing, and voice. 930 a.m. oddershede, l.e. quezada, f.m. cordova, r.a. carrasco 3.4 applications configuration a profile is applied to each workstation, server, or lan. it specifies the applications used by a particular group of users. an application may be any of the common applications, email, file transfer, etc., that may be defined. the next step is to set up profile applications according to each user type. every workstation will have a profile application consistent with the users’ main role. after the participants have studied the nature of the system, then profiles are designed with respect to the functions a user makes use of in/at a health service. this profile name is assigned for classification. these applications profiles were defined using the results from the analytic hierarchy process. once applications and profiles are defined, it is possible to characterize different scenarios for each study case intended for visualizing how sensible is a networks performance with respect to changes. 4 simulation results different scenarios for the study case are characterized to determine the sensitivity of main key parameters (as throughput and utilization). initially, the applications described are modeled explicitly and all background traffic is disabled. simulations are ran for each scenario, increasing the number of users and their respective applications. through figure 4 it is possible to visualize overlaid run results for delay average (in ethernet. delay (secs) for four scenarios. as the number of users is increased, the average delay time increases at different rates, until it tends to be stabilized. figure 4: [delay] average [in ethernet. delay (sec)] for four overlaid scenarios. the simulations scenarios are run allowing the same probability distribution for the start time. the difference in average delay for the scenarios is due to contention among the network decision support for healthcare ict network system appraisal 931 users on all the intermediate devices and links. figure 5 gives a picture of the delay variation in seconds as the number of users is augmented. figure 5: ethernet average delay vs number of users. after running simulations for different scenarios and for all the applications considered, increasing number of users, incorporating new number of applications and devices, varying traffic and others we could observe that for this hospital zone ict network infrastructure there are no problems concerning channel utilization. ethernet delay showed to be low with a high throughput. then, the network prepared to support a greater number of users, more and new applications, as voice, images, video and others. 5 conclusions through the case studied, we observe for data transmission that there is no connection with human perception. even though from human perspective showed discontent about the access and ubiquity. qos parameters are good. however, users complain about the straightforward access, new applications and number of computers. in this respect, it is possible to conclude that if the delay is low and throughput is high, there should not be any availability problem, but one of the reasons could be caused of unbalanced resources distribution, some resource policy distribution, or other. from the various simulations results we conclude that the model is able to detect whether there is connection between human perception of qos and the technical metrics associated to the ict network. from the technology aspects perspective there is a big potential forthcoming for gaining benefit from of ict support in healthcare service that has to be analyzed also from financial perspective to optimise the available resources. an increase in the number of workstations, increasing the number of new applications, preparing and training people in informatics and technology is a tactic that contributes and leads to increase the usage of technology in the healthcare sector, mainly in rural communities. bibliography [1] elske ammenwertha,*, stefan gräberb, gabriele herrmannc, thomas bürkled, jochem königb, evaluation of health information systems-problems and challenges , int journal of medical informatics 71, 125-135, 2010. 932 a.m. oddershede, l.e. quezada, f.m. cordova, r.a. carrasco [2] e. babulak, quality of service provision assessment in the healthcare information and telecommunications infrastructures, international journal of medical informatics, 75(3):246-252, 2006. [3] networking health: prescriptions for the internet, computer science and telecommunications board, national academic press, isbn-10: 0-309-06843 , 2000. [4] triantaphyllou, e.. " multi-criteria decision making methods: a comparative study" (applied optimization, volume 44) nov 2000 [5] sun, l., and ousmanou, k., articulation of information requirements for personalized knowledge construction, journal of requirements engineering, 11(4):279293, 2006. [6] optimized network engineering tool (opnet). www.opnet.com, opnet: user’s manual, http://www.opnet.com/university_program/teaching_with_opnet /textbooks_and_materials/materials/opnet_modeler_manual.pdf, 2004. [7] opnet documentation v.11.o.a, opnet technologies, inc., bethesda, md, 2004. [8] heath a., carrasco r.( 2001), access techniques for 3g multimedia wireless packet switched networks: simulation using opnet, iee/ieee/bcs 6th international symposium on communication theory and applications (iscta01), lancaster university, 15-20 jul 2001. [9] oddershede, a.m, carrasco, r.a e. barham. multicriteria decision model for assessing health service information technology network support using ahp, iberoamerican journal of computing, computacion y sistemas, issn 1405-5546, 12 (2): 173-182, 2008, [10] oddershede, a.m, carrasco, r.a, information and communications technology significance in health care: user perception, mediterranean journal of electronics and communications, 2(2): 82-89, 2006. int j comput commun, issn 1841-9836 vol.7 (2012), no. 2 (june), pp. 365-376 fuzzy control design for a class of nonlinear network control system: helicopter case study p.q. reyes, j.o. arjona, e.m. monroy, h.b. pérez, a.d. chavesti p. quiñones-reyes instituto tecnológico de jiquilpan carr. nac., s/n km. 202, cp 59510, jiquilpan, michoacán, méxico. j. ortega-arjona, e. méndez-monroy, h. benítez-pérez, a. durán-chavesti universidad nacional autónoma de méxico apdo. postal 20-726, admón. 20, del. a. obregón, méxico d. f., cp. 01000. polbond@hotmail.com, jloa@ciencias.unam.mx, {hector,chavesti}@uxdea4.iimas.unam.mx abstract: this paper presents a fuzzy control approach to a helicopter mimo nonlinear system, implemented on a networked control system, as case study. for this, a hardware-in-the-loop implementation is developed using several multi-channel a/d cards, integrated to a computer network system. variant time delays are considered over ethernet and canbus networks. fuzzy logic is used to deal with the complexity of the integrated computer network as well as with the dynamics of the system. two fuzzy logic control systems are coupled for both signals of the helicopter case study: yaw and pitch. both these tend to concentrate around desired references, considering variant time delays. keywords: fuzzy control, networked control systems. 1 introduction reconfiguration is a transition that modifies the structure of a system so it changes its representation of states. here, it is used as a feasible approach for fault isolation, and also, it is a response to time delay modification. in control systems, several modelling strategies for managing time delay within control laws have been studied by different research groups. nilsson [16] proposes the use of a time delay scheme integrated to a reconfigurable control strategy, based on a stochastic methodology. jiang [12] describe how time delays are used as uncertainties, which modify pole placement of a robust control law. izadi present an interesting case of fault tolerant control approach related to time delay coupling. blanke [7] study reconfigurable control from the point of view of structural modification, establishing a logical relation between dynamic variables and the respective faults. thompson [17] and benítez-pérez [4] consider that reconfigurable control strategies perform a combined modification of system structure and dynamic response, and thus, this approach has the advantage of bounded modifications over system response. recent approximations are presented by dai, which allow a reasonable but static approximation for time variable strategy. also, kim [20] have followed a maximum allowable time delay (madb), where complex task behaviour is permitted as long as madb is preserved [22] [23]. copyright c⃝ 2006-2012 by ccc publications 366 p.q. reyes, j.o. arjona, e.m. monroy, h.b. pérez, a.d. chavesti the approach here makes use of a case study that takes time delays due to communication as deterministic measured variables. for this, a fuzzy control law [1] is used, where time delays result from the deterministic reconfiguration of communications due to a scheduling algorithm. fuzzy control is used for managing extended horizons from system inputs and outputs, to determine several scenarios modified by time delays. recent results encourage this approximation, as shown in benítez-perez [5] [6]. for experimental purposes, the following considerations are taken: 1. time delays are bounded. 2. the combination tends to be globally stable. the objective of this paper is to present the design of a fuzzy control strategy developed from the time delay knowledge, as well as computer network behaviour considering communication amongst nodes for a helicopter case study. the novelty is to propose a fuzzy control [24] [25] for network control system (ncs) based on the defined communication network and variant time delays. present approach makes use of time delays due to communication as deterministic measured variables, defined by previous knowledge of computer network. in here, control law views time delays as a result of bounded communications based upon scheduling algorithm. a basic consideration for this approach is that time delays are bounded to madb. the main reason for these is the behavior of computer network as well as node processing time. however, in order to integrate this variable as global time delay (∆t∗), it is necessary to consider its nominal value. this is presented as a percentage: 0% refers to the current time delay, and 100% represents extreme time delays. hence, this value is any value produced as time delay less than or equal to total sampling period. having defined global time delays as its nominal value, the fuzzy control structure is proposed as shown in figure 1. fuzzy control is chosen here for implementing a gain-scheduler controller, on contrary of smith’s predictor [18], since it has a smooth transition between scenarios. furthermore, the chosen operating points are the reference elements of proposed fuzzy control. thus, any degradation from time delays would degrade the control law, however keeping a stable response from the plant. time delay degradation is bounded from communication protocol, as explained by [13]. figure 1: fuzzy control structure. fuzzy control design for a class of nonlinear network control system: helicopter case study367 notice that the current approach follows a mamdani strategy [18] rather than a takagi sugeno (tks) [1] for the implementation of the fuzzy control. the development of a tks strategy is considered as a future work, aiming for integrating time delays into a subsequent part of the fuzzy control rules. the fuzzy control law for a fault free scenario is presented in figure 2, based on [18]. membership functions are gaussian distributions, where e has six membership functions (pb, pm, ps, ns, nm, nb), and de has 6 membership functions (pb, pm, ps, ns, nm, nb). the output variable has eight membership functions (pb, pm, ps, pz, nz, ns, nm, nb). an additional variable, named current nominal time delay (cntd), has three membership functions (n, z, p). figure 2: classical structure for fuzzy control law notice that this implementation is common for fuzzy control design, however, here fuzzy control design focuses on the strategy of the control law due to the communication in the computer network, as well as the consequent time delays. these time delays are measurable and bounded, according to previous knowledge of computer network, based on a finite state machine that selects when valid strategies take place. here, the scheduling approximation is the inherent edf algorithm [14] with certain lost data. the aim is to study how this transition is carried out when using a fuzzy mamdani approach [1], based on a ncs. particularly, for the actual ncs, the communication network strongly affects the dynamics of the system, expressed as a time variance that exposes a nonlinear behaviour. such nonlinearity is addressed by incorporating time delays. from real-time system theory, it is known that time delays are bounded even in the case of causal modifications due to external effects [15]. using this representation, time delays are counted using simple addition, as described as follows. 2 case study and experimental setup the case study here is a helicopter mimo system, integrated to a computer network as shown in figure 3 [21]. it is integrated by three a/d cards: an ad512 card is the interface of joystick, acting as moving reference; an ad612 card is the interface of actuators yaw and pitch; and a q4 card is used for sampling the information from two encoders which sense yaw and pitch information from the movement. two networks are used for this case study: an ethernet network at 10/100mhz, and a can network at 1mbit/s. for experimental purposes, the controller node works as well as gateway for both databuses. 368 p.q. reyes, j.o. arjona, e.m. monroy, h.b. pérez, a.d. chavesti figure 3: helicopter mimo system for the case study while the description of the helicopter is beyond the scope of this paper, a brief description is provided here. however, further information can be found in the original quanser [21] documentation. the quanser 2 dof helicopter consists of a helicopter model mounted on a fixed base with two propellers that are driven by dc motors. the front propeller controls the elevation of the helicopter nose about the pitch axis and the back propeller controls the side-to-side motions of the helicopter about the yaw axis. the pitch and yaw angles are measured using high-resolution encoders. the two degrees of freedom helicopter pivots about the pitch axis by angle θ and about the yaw axis by angle ψ.the pitch is defined positive when the nose of the helicopter goes up and the yaw is defined positive for a clockwise rotation. table 1 lists the various lengths, masses, and moment of inertias associated with this helicopter model. variable description value unit beq,p equivalent viscous damping about pitch axis. 0.800 n/v beq,y equivalent viscous damping about yaw axis. 0.318 n/v j eq,p total moment of inertia about pitch pivot. 0.0384 kg.m2 j eq,y total moment of inertia about yaw pivot. 0.0384 kg.m2 k pp thrust torque constant acting on pitch axis from pitch motor/propeller. 0.204 n.m/v k py thrust torque constant acting on pitch axis from yaw motor/propeller. 0.0068 n.m/v k yp thrust torque constant acting on yaw axis from pitch motor/propeller. 0.0219 n.m/v k yy thrust torque constant acting on yaw axis from yaw motor/propeller. 0.072 n.m/v lcm center-of-mass length along helicopter body from pitch axis. 0.186 cm mheli total moving mass of the helicopter. 1.3872 kg table 1: helicopter specifications and model parameters. fuzzy control design for a class of nonlinear network control system: helicopter case study369 the linear state-space model of the helicopter is given by equation 1. notice that the nonlinear equations of motion are considered linear about the quiescent point (θ0=0, ψ0=0, θ.0=0, ψ.0=0). substituting the state x = [θ, ψ, θ., ψ.] and solving for x.: x. =   0 0 1 0 0 0 0 0 0 1 0 0 0 0 − bp jeq,p+mhelil 2 cm 0 0 0 0 0 0 − by jeq,y+mhelil 2 cm 0 0 1 0 0 0 0 0 0 1 0 0 0 0   x +   0 0 0 0 kpp jeq,p+mhelil 2 cm kpy jeq,p+mhelil 2 cm kyp jeq,y+mhelil 2 cm kyy jeq,y+mhelil 2 cm 0 0 0 0   u (1) y =   1 0 0 0 0 0 0 1 0 0 0 0 0 0 1 0 0 0 0 0 0 1 0 0 0 0 0 0 1 0 0 0 0 0 0 1   x in order to establish an experimental setup, several conditions nave to be taken into account. for example, the communication network frequencies for ethernet and canbus are set 10/100mhz and 1mbit/s, respectively. moreover, the derivative boundary for pitch is -0.3 to 0.3. the rest of the boundaries are shown table 2. maximum value minimum value standard deviations derivative value -0.3 0.3 error value -0.25 0.25 control value 0 25 derivative center -0.3 -0.2 0 0.2 0.3 0.12 error centers -0.25 -0.05 0 0.05 0.25 0.1 names of current membership functions cmp cp cm cg cmg table 2: signal characteristics of pitch. table 3 shows the related consequent matrix for pitch signal. this is coupled following control law for yaw signal, where time delays are affecting both control laws. error-derv mp p m g mg mp cp cp cp cm cm p cp cp cm cm cm m cp cm cm cm cg g cm cm cm cg cg mg cm cm cg cg cg table 3: membership relationships amongst consequent parts for pitch signal. 370 p.q. reyes, j.o. arjona, e.m. monroy, h.b. pérez, a.d. chavesti table 4 shows the main characteristics of control approach for the yaw signal. notice that the numerical values tend to be different to those related to pitch control law. this is basically due to dynamic conditions of yaw signal. maximum value minimum value standard deviations derivative value -0.2 0.2 error value -0.5 0.5 control value -20 15 derivative center -0.2 -0.1 0 0.1 0.2 0.08 error centers -0.25 -0.125 0 0.125 0.25 0.25 names of current membership functions pmp pp pm pg pmg table 4: signal characteristics of yaw. table 5 shows the matrix for yaw signal. this is considerable different from pitch due to dynamic response of the plant. errorderv mp p m g mg mp pmg pmg pmg pmg pmg p pmg pmg pg pg pg m pm pm pm pm pm g pp pp pp pp pp mg pp pp pp pp pmp table 5: membership relationships amongst consequent parts for yaw signal. now for both signals, the mamdani integration is defined in eqn 2: u = ∑r j=1 αjβj∑r j=1 αj r = mxn αj = γmγn j = 1...mxn m = 1...m n = 1...n γi = exp ( − ( x−ci σi )2) (2) where: • ci are the centers of the gaussians • σi are the standard deviations of the gaussians. • m is the total number of gaussians per error • n is the total number of gaussians per error derivatives, and fuzzy control design for a class of nonlinear network control system: helicopter case study371 • βj is the group control centers figure 4 presents the control surface for pitch signal. this surface tends to have smooth transitions between rules and consequent parts. it has a local valley due to delay interaction and error. this response is reflected on error and derivative conversion. figure 4: control surface for pitch signal. figure 5 shows the control surface for the yaw signal. this tends to be more accurate in terms of formal fuzzy transitions, where only one small local valley exists. figure 5: control signal for yaw signal. in both resulting surfaces, these tend to be smaller at negative error and derivative error values. on the contrary biggest values are related to both positive values (error and derivative respectively). 3 experimental results current time delays are bounded to normal representation as shown in fig. 6. where the delays are mainly considered as a result of communication media following fig. 3. the current experimental results show how the helicopter mimo case study adequately performs during time variant conditions. considering an experimental execution of 50 seconds, several results show that, for instance, current error of yaw signal (figure 7.a) presents a bounded response from 5 seconds onwards, while its derivative (figure 7.b) tends to be almost zero, even in the case of change perturbation around 5 seconds. 372 p.q. reyes, j.o. arjona, e.m. monroy, h.b. pérez, a.d. chavesti figure 6: communication time delays and the related estimation error. figure 7: error response for yaw signal. fuzzy control design for a class of nonlinear network control system: helicopter case study373 figure 8 shows the control response for the yaw signal. it is a bounded response, where several glitches are spotted. these result from loosing data from the ethernet network. such a behaviour is not the actual delay from computer network, but it is an inherent feature of the ethernet computer network. figure 8: control response for yaw signal. figure 9 presents a not-smooth transition between error and its derivative, due to local valley shown at figure 4. although system response is feasible for the current combination, it tends to have drastic, undesirable jumps. figure 9: error vs derivative of error response. regarding the pitch signal error, the response tends to be smooth and bounded. there is an inherent oscillation at error response (figure 10.a), which results of slight movements from the joystick. the related derivative (figure 10.b) reflects the bounded responses with some glitching. figure 11 shows the control response, presenting a similar behaviour, where glitches are present. these glitches result from loosing data in the communication network. 374 p.q. reyes, j.o. arjona, e.m. monroy, h.b. pérez, a.d. chavesti figure 10: error response for pitch signal. figure 11: control response for pitch signal. figure 12 shows a common response between error and derivative of error as convergence signal. figure 12: error vs derivative of error response for pitch signal. fuzzy control design for a class of nonlinear network control system: helicopter case study375 4 concluding remarks this paper presents an approach for the integration of two fuzzy logic controllers, in order to perform control under time variant conditions. these two techniques are applied in parallel. although there is no formal verification for this sequence, it has been adopted since mamdami approximation provides stable conditions for control reconfiguration. moreover, the use of a real-time testbed to approve or disapprove changes on the behaviour of a computer network allows bounding time delays during a specific time frame. this local time delay allows the design of a control law, capable to cope with new conditions. hardware-in-the-loop implementation is feasible, since complex integration through computer network as well as real-time operating systems are integrated through bounded time delays acknowledgements the authors would like to thank the financial support of disca-iimas-unam, and unampapiit (in103310-3) , picco 10-53, mexico in connection with this work. bibliography [1] j. abonyi, fuzzy model identification for control, birkhďż˝user, 2003 [2] l. almeida, p. pedreiras and j. a.-fonseca, the ftt-can protocol: why and how, ieee transactions on industrial electronics, 49(6):1189-1201, 2002 [3] h. benítez-pérez and f. garcía-nocetti, switching fuzzy logic control for a reconfigurable system considering communication time delays, proceedings, cdrom, european control conference, ecc 03 september, 2003 [4] h. benítez-pérez and f. garcía-nocetti, reconfigurable distributed control, springer verlag, 2005 [5] h. benítez-pérez, real-time distributed control a diverse approach for nonlinear problem, nonlinear analysis: hybrid systems and applications, doi:10.1016/j.nahs.2006.06.004, vol 2/2 pp 474-490, junio 2008 [6] h. benítez-pérez, j. s.-gonzalez, f. c.-flores and f. garcía-nocetti, fault classification for a class of time variable systems by using a group of three art2 networks, international journal control and intelligent systems, doi: 10.2316/journal.201.2008.1.201-1820, vol 36, no. 1, 2008 [7] m. blanke, m. kinnaert, j. lunze and m. staroswiecki, diagnosis and fault tolerant control, springer, 2003 [8] a. cervin, d. henriksson, b. lincoln, j. eker and k. arzén, how does control timing affect performance?, ieee control systems magazine, vol. 23, pp. 16-30, 2003 [9] t. frank, k. f.-kraiss and t. kuhlen, comparative analysis of fuzzy art and art-2a network clustering performance, ieee transactions on neural networks, vol. 9, no. 3, may 1998 [10] d. hanselman and b. littlefield, mastering matlab, prentice hall, 2002 376 p.q. reyes, j.o. arjona, e.m. monroy, h.b. pérez, a.d. chavesti [11] r. i.-zamanabadi and m. blanke, a ship propulsion system as a benchmark for faulttolerant control, control engineering practice, vol. 7, pp. 227-239, 1999 [12] j. jiang, and q. zhao, reconfigurable control based on imprecise fault identification, proocedings of the american control conference, ieee, pp. 114-118, san diego, june, 1999 [13] f. lian, j. moyne and d. tilbury, network design consideration for distributed control systems, ieee transactions on control systems technology, vol. 10, no. 2, pp. 297-307, march 2002 [14] l. liu, real-time systems, wiley, 2002 [15] menendez l. de c. a. and h. benítez-pérez, node availability for distributed systems considering processor and ram utilization based upon a local optimization procedure, int j comput commun, issn 1841-9836, 5(3):336-350, 2010 [16] j. nilsson, real-time control with delays, phd. thesis, department of automatic control, lund institute of technology, sweden, 1998. [17] h. thompson, wireless and internet communications technologies for monitoring and control, control engineering practice, vol. 12, pp. 781-791, 2004 [18] d. driankov, h. hellendoorn and m. reinfrank, an introduction to fuzzy logic control, springer-verlag, 1994 [19] l. zhang, y. shi, t. chen and b. huang, a new method for stabilization of networked control systems with random delays, american control conference, pp. 633-637, 2005 [20] d. kim, d. choi and p. mohapatra, real-time scheduling method for networked discrete control systems, control engineering practice, vol 17, pp: 564-570, 2009 [21] http://www.quanser.com/english/html/solutions/ fs_soln_software_wincon.html [22] zmaranda d., gabor g., popescu d.e., vancea c., vancea f., using fixed priority preemptive scheduling in real-time systems, int j comput commun, issn 1841-9836, 6(1):187-195, 2011 [23] dai l., chang y., shen z.,an optimal task scheduling algorithm in wireless sensor networks, int j comput commun, issn 1841-9836, 6(1):101-112, 2011 [24] negoita c.v., remembering the beginnings, int j comput commun, issn 1841-9836, 6(3):458-461, 2011 [25] nyirenda c.n., dong f., hirota k., distance based triggering and dynamic sampling rate estimation for fuzzy systems in communication networks, int j comput commun, issn 1841-9836, 6(3):462-472, 2011 int j comput commun, issn 1841-9836 vol.7 (2012), no. 3 (september), pp. 540-549 neural network model predictive control of nonlinear systems using genetic algorithms v. ranković, j. radulović, n. grujović, d. divac vesna ranković, jasna radulović, nenad grujović faculty of mechanical engineering, university of kragujevac department for applied mechanics and automatic control serbia, 34000 kragujevac; sestre janjić 6 e-mail: vesnar@kg.ac.rs, jasna@kg.ac.rs, gruja@kg.ac.rs dejan divac institute for development of water resources "jaroslav černi" serbia, 11000 belgrade jaroslava černog st., 11226 beli potok e-mail: ddivac@eunet.rs abstract: in this paper the synthesis of the predictive controller for control of the nonlinear object is considered. it is supposed that the object model is not known. the method is based on a digital recurrent network (drn) model of the system to be controlled, which is used for predicting the future behavior of the output variables. the cost function which minimizes the difference between the future object outputs and the desired values of the outputs is formulated. the function ga of the matlab’s genetic algorithm optimization toolbox is used for obtaining the optimum values of the control signals. controller synthesis is illustrated for plants often referred to in the literature. results of simulations show effectiveness of the proposed control system. keywords: model predictive control, nonlinear system, identification, digital recurrent network, genetic algorithm. 1 introduction the predictive controllers are based on the mathematical model of the object, which is being controlled. nonlinear system identification and prediction is a complex task. all the processes in nature are nonlinear. in large number of processes, the nonlinearities are not prominent, so their behavior can be described by the linear model. in the linear systems theory there exist a large number of methods that can be applied for obtaining the linear model of processes. the nonlinear model must be chosen when the nonlinearity is strongly exhibited. in the identification process, the parameters of the mathematical model are being determined as such that the difference between the system response and its mathematical model is as least as possible, both in the transient regime and in stationary state. the general model of linear processes is arx (auto regressive exogenous), while for the nonlinear ones it is narx (nonlinear auto regressive exogenous). the narx model structure enables application of the neural networks, the fuzzy systems and the neuro-fuzzy systems for approximation of the nonlinear function. neural networks have been applied to the identification of nonlinear dynamical systems. the most of the works are based on multilayer feedforward neural networks with backpropagation learning algorithm. however, the conventional back-propagation algorithm has the problems of local minima and slow rate of convergence. a novel multilayer discrete-time neural network is presented for the identification of nonlinear dynamical systems, [1]. in [2] a new scheme for on-line states and parameters estimation of a large class of nonlinear systems using radial basis function neural network has been designed. a new approach to control nonlinear discrete dynamic systems, which relies on the identification of a copyright c⃝ 2006-2012 by ccc publications neural network model predictive control of nonlinear systems using genetic algorithms 541 discrete model of the system by a feedforward neural network with one hidden layer, is presented in [3]. nonlinear system identification via discrete-time recurrent single layer and multilayer neural networks are studied in [4]. in [5] an identification method for nonlinear models in the form of fuzzy-neural networks is introduced. the fuzzy-neural networks combine fuzzy if-then rules with neural networks. the adaptive time delay neural network is used for the identification of nonlinear systems [6], and four architectures are proposed for identifying different classes of nonlinear systems. the identification of nonlinear systems by feedforward neural networks, radial basis function neural networks, runge-kutta neural networks and adaptive neuro-fuzzy inference systems is investigated in [7]. result of simulation indicates that adaptive neuro fuzzy inference systems are a good candidate for identification purposes. however, neural networks are the simplest approaches in the sense of computational complexity. in [8] nonlinear system identification via feedforward neural network and digital recurrent network is studied. model predictive control (mpc) is applied to a large number of nonlinear industrial process, [9,10,11,12]. the methodology to design and implement neural predictive controllers for nonlinear system has been developed in [13,14,15]. in [13] feedforward neural networks to estimate the nonlinear process are applied. also, for the minimization of the cost function, the matlab′s optimal toolbox functions fminunc and fmincon were used. the design methodology for predictive control of industrial processes via recurrent fuzzy neural networks is presented in [14]. in [15] the multilayer perceptron is used to identification of the nonlinear object and genetic algorithm is applied to solve the multi-criteria optimization problem. in this paper the control of the nonlinear object is studied and it is identified by the drn. recurrent networks are more powerful than nonrecurrent networks and have important uses in control and signal processing applications, [16]. they have been shown to be more efficient than feedforward neural networks in terms of the number of neurons required to model a dynamic system [17,18]. models with recurrent networks are shown to have the capability of capturing various plant nonlinearities, [19,20]. the major objective of the study presented in this paper is to take advantages of the recurrent neural networks for modelling and genetic algorithms for optimization. the proposed method formulates a dynamic nonlinear optimization problem, where the cost function consists of two terms: the differences between the drn model predictions and the desired output trajectory over a prediction horizon and the control energy over a control horizon. for the solution of nonlinear optimization problem, a genetic algorithm is used, which can approximate the optimum solution very fast, compared to conventional optimization techniques. in this paper, the genetic algorithm has been successfully used in combinations with digital recurrent network. in the second section the identification of the nonlinear object by application of the drn is explained. in section three the principle of work of the predictive controllers is analyzed. in this paper for obtaining the optimum values of the control signals, the genetic algorithm (ga) was used. results of simulations are given in section four, while section five presents the concluding remarks. 2 neural network for identification of nonlinear dynamic different methods have been developed in the literature for nonlinear system identification. these methods use a parameterized model. the parameters are updated to minimize an output identification error. a wide class of nonlinear dynamic systems with an input and an output can be described by the model: ym(k) = fm(φ(k),θ), (1) where ym(k) is the output of the model, φ(k) is the regression vector and θ is the parameter vector. depending on the choice of the regressors in φ(k), different models can be derived: 542 v. ranković, j. radulović, n. grujović, d. divac • nfir (nonlinear finite impulse response) model φ(k) = (u(k − 1),u(k − 2), . . . ,u(k − nu)), where nu denotes the maximum lag of the input. • narx (nonlinear autoregressive with exogenous inputs) model φ(k) = (u(k − 1),u(k − 2), . . . ,u(k − nu),y(k − 1),y(k − 2), . . . ,y(k − ny)), where ny denotes the maximum lag of the output. • narmax (nonlinear autoregressive moving average with exogenous inputs) model φ(k) = (u(k − 1),u(k − 2), . . . ,u(k − nu),y(k − 1),y(k − 2), . . . ,y(k − ny), e(k − 1),e(k − 2), . . . ,e(k − ne)) where e(k) is the prediction error and ne is the maximum lag of the error. • noe (nonlinear output error) model φ(k) = (u(k − 1),u(k − 2), . . . ,u(k − nu),ym(k − 1),ym(k − 2), . . . ,ym(k − ny)). • nbj (nonlinear box-jenkins) model uses all four regressor types. the narx and noe are the most important representations of nonlinear systems. the block scheme of the drn model which corresponded to the noe model is shown in figure 1. figure 1: the block scheme of the neural network model figure 2 is an example of a drn. the output of the network is feedback to its input. the output of the network is a function not only of the weights, biases, and network input, but also of the outputs of the network at previous points in time. in [16] dynamic backpropagation algorithm is used to adapt weights and biases. drn network is composed of a nonlinear hidden layer and a linear output layer. the inputs u(k − 1),u(k − 2), . . . ,u(k − nu) are multiplied by weights ωyi j outputs ym(k − 1), ym(k − 2), . . . ,ym(k − ny) are neural network model predictive control of nonlinear systems using genetic algorithms 543 multiplied by weights ωyi j and summed at each hidden node. then the summed signal at a node activates a nonlinear function. the hidden neurons activation function is the hyperbolic tangent sigmoid function. in figure 2, ωi represents the weight that connects the node i in the hidden layer and the output node; bi represents the biased weight for i-th hidden neuron and is a biased weight for the output neuron. the output of the network is: ym(k) = nh∑ i=1 ωiνi + b (2) figure 2: digital recurrent network where nh is the number of hidden nodes and: νi = eni − e−ni eni + e−ni (3) ni = nu∑ i=1 u(k − j)ωui j + ny∑ j=1 ym(k − j)ωyi j + bi (4) this error is used to adjust the weights and biases in the network via the minimization of the following function: ε= 1 2 [y(k) −ym(k)]2 (5) using the gradient decent, the weight and bias updating rules can be described as: ωui j (k + 1) =ωui j (k) −η ∂ε ∂ωui j (6) ωyi j (k + 1) =ωyi j (k) −η ∂ε ∂ωyi j (7) bi(k + 1) = bi(k) −η ∂ε ∂bi (8) 544 v. ranković, j. radulović, n. grujović, d. divac b(k + 1) = b(k) −η ∂ε ∂b (9) where: ∂ε ∂ωui j = ∂eε ∂ym ∂ym ∂ωui j ; ∂ε ∂ωyi j = ∂eε ∂ym ∂ym ∂ωyi j ; ∂ε ∂bi = ∂eε ∂ym ∂ym ∂bi ; ∂ε ∂b = ∂eε ∂ym ∂ym ∂b ; where the superscript e indicates an explicit derivative, not accounting for indirect effects through time. the terms ∂ym ∂ωui j , ∂ym ∂ωyi j , ∂ym ∂bi and ∂ym ∂b must be propagated forward through time, [16]. 3 model predictive control here the principle of operation of the predictive controllers will be briefly presented. let us suppose that the mathematical model of the process is known. based on the model, it is possible to determine the future outputs from the object y(k + 1), l = 1,2, . . . , nh , where nh is the prediction horizon. the future outputs depend on the object current states and future control signals, u(k+ l), l = 1,2, . . . , nc, where nc is the control horizon and nc ≤ nh . the predictive controllers compute potential future control signals such that the future outputs will be as close as possible to the desired values r(k + l), l = 1,2, . . . , nh . figure 3 shows the basic concept of the model predictive control. figure 3: basic concept of a model predictive control method there are several types of the predictive controllers. in this work the gpc (general predictive control) controller is used, where the cost function is calculated as: j(k) = nh∑ l=1 [ r(k + l) −ym(k + l) ]2 +α nc∑ l=1 ∆u2(k + l − 1), (10) where: ∆u2(k + l − 1) = u(k + l − 1) − u(k + l − 2), and α is the weight factor of the control signal. neural network model predictive control of nonlinear systems using genetic algorithms 545 real processes are subject to constraints. we consider constraints which limit the range of the control signal, the gradient of the control signal and the future model predictions: umin ≤ u(k + l) ≤ umax | u(k + l) − u(k + l − 1) |≤∆umax ymin ≤ ym(k + l) ≤ ymax. model output ym(k + l) is calculated based on (2): ym(k + l) = nh∑ i=1 ωi · 1 1 + e−( ∑nu j=1 u(k+l− j)ωui j+ ∑ny j=1 u(k+l− j)ωyi j+bi) + b. (11) taking into account the term α ∑nc m=1 ∆u 2(k + l − 1) in the cost function (10) prevents the control signals to be too big such that the executive organs would not be able to realize. the control signals in the k-th step: [u(k),u(k + 1), . . . ,u(k + nc − 1)]t , is possible to solve both numerically and analytically. analytically it is solved in such a manner that from the system of algebraic equations: ∂j ∂u(k) = 0, ∂j ∂u(k + 1) = 0, . . . , ∂j ∂u(k + nc − 1) = 0 one obtains [u(k),u(k + 1), . . . ,u(k + nc − 1)]t . for analytical solution, it is necessary for the model to be linear. the neural network model is nonlinear, its linearization should be performed first, or apply the numerical methods for solving the optimization problem. in this work for obtaining the optimum values of the control signals, the genetic algorithm is used (the function ga of the matlab’s genetic algorithm toolbox). the genetic algorithms represent the global optimization technique. in the k-th step nc −1 control signals are obtained. the first calculated signal is being sent to the controller output. the ga used in this study is simple genetic algorithm. the elements of the populations are encoded into bit-strings. the chromosome selection for reproduction is performed using the roulette selection method. the multi-point crossover operator was used. the uniform mutation operator is applied in this study. 4 simulation results example 1 for the simulation example 1, we consider the nonlinear plant, which is described by the following nonlinear difference equation: y(k) = 0.5y(k − 1) + u(k − 1) ( 1 + 0.2y2(k − 1) ) + u3(k − 1), (12) where y is the output of the plant and u is the plant input. we assume that structure of the model is known, nu = 1,ny = 1. the inputs and output of the neural network model are u(k − 1),ym(k − 1) and ym(k) respectively. the data set are obtained by applying random input signal uniformly distributed in the interval [-0.5 0.5]. the plant output is bounded within region [-2.2 2.2]. in this example, 4500 training patterns are generated to train the drn and 1500 to test the obtained drn model. 546 v. ranković, j. radulović, n. grujović, d. divac selection of an appropriate number of neurons in the hidden layer is very important. the optimal network size was selected from the one which resulted in maximum correlation coefficient for the training and test sets, table 1. based on table 1, it was concluded that the optimal number of hidden neurons is 12. table 1.correlation coefficient for the training and test sets drn-structure 2-10-1 2-12-1 2-15-1 2-17-1 training 0.9878 0.9987 0.9856 0.9879 test 0.9845 0.9921 0.9795 0.9812 since the object model is formed, it is necessary to define the cost function parameters, (10). these parameters are selected by the trial and error method. by increasing the variance of the control signal is decreasing, but simultaneously the difference between the set and real value of the object output is increasing. in the considered example, the satisfactory results are obtained for the following values of parameters: nh = 3, nc = 3,α= 0.05. the optimal predictive controller in the k-th step computes the control signals: [u(k),u(k + 1)]t . in figure 4, the reference signal is shown. the difference between the reference signal and the object output is presented in figure 5. from the figure 5, it is obvious that the tracking error is small. the obtained solution is good for practical realization. example 2 the plant is given by: y(k) = 0.35  y(k − 1)y(k − 2) [ y(k − 1) + 2.5 ] 1 +y2(k − 1) +y2(k − 2) + u(k − 1)  , (13) where y is the output of the plant and u is the plant input. the dynamical system used in the simulation example is given in [21]. it is assumed that structure of the model is known, nu = 1,ny = 2. the inputs and output of the neural network are u(k − 1),ym(k − 1),ym(k − 2) and ym(k), respectively. the input-output patterns are generated randomly. the data set included 4000 data samples. in the training process of the drn, 3000 samples were used. the drn model was tested using 1000 selected data. in this example, it is found that the optimal number of hidden neurons is 15 (table 2). table 2.correlation coefficient for the training and test sets drn-structure 3-10-1 3-12-1 3-15-1 3-17-1 training 0.9811 0.9899 0.9945 0.9921 test 0.9799 0.9831 0.9897 0.9895 the control and predictive horizons are chosen as respectively, nh = 3, and nc = 3. the choice of these values determines the complexity of the optimization problem. the values of nh and nc may not be too large, because the computation time would become also too large. these values should provide a good compromise between performance and computational load. it is found that the weight factor of the control signal α= 0.045 by the trial and error method. the optimal predictive controller in the k-th step computes the control signals: [u(k),u(k + 1)]t . the reference signal is shown in figure 6. the difference between the reference signal and the object output is presented in figure 7. from the figure 7, it is obvious that the control is quite effective. the obtained solution is good for practical realization. neural network model predictive control of nonlinear systems using genetic algorithms 547 figure 4: the reference signal figure 5: difference between the reference signal and the object output figure 6: the reference signal 548 v. ranković, j. radulović, n. grujović, d. divac figure 7: difference between the reference signal and the object output 5 conclusions in this paper the synthesis of the predictive controller for control of the nonlinear object is considered. the object is modeled by the digital recurrent network. in the designing of neural network model, the problem is how to determine an optimal architecture of network. the determination of the values of nu and ny is an open question. large time lags result in better prediction of the nn. however, large nu and ny also result in large number of parameters (weights and biases) that need to be adapted. the simulation results, given in section 4, show that the predictive controllers can successfully be applied for control of the prominently nonlinear object. the optimum values of the control signals are obtained by the genetic algorithm which represents the global optimization technique. the given trajectory tracking error is small. the proposed structure can be applied in control of the linear objects that are modeled by the neural network. recurrent networks are more powerful than nonrecurrent networks and have important uses in control and signal processing applications. simulation results on the examples selected from literature provide good results and that encourages our future plan for real time control system implementation. acknowledgement parts of this research were supported by the ministry of sciences, technologies and development of republic of serbia. bibliography [1] s. jagannathan, f.l. lewis, identification of nonlinear dynamical systems using multilayered neural networks, automatica, 32(12):1707-1712, 1996. [2] g. kenne, t. ahmed-ali, f. lamnabhi lagarrigue, h. nkwawo, nonlinear systems parameters estimation using radial basis function network, control engineering practice, 14(7):819-832, 2006. [3] j.i. canelon, l. shieh, n.b. karayiannis, a new approach for neural control of nonlinear discrete dynamic systems, information sciences, 174(3-4):177-196, 2005. [4] w. yu, nonlinear system identification using discrete-time recurrent neural networks with stable learning algorithms, information sciences, 158(1):131-147, 2004. neural network model predictive control of nonlinear systems using genetic algorithms 549 [5] s.-k. oh, w. pedrycz, h.-s. park, hybrid identification in fuzzy-neural networks, fuzzy sets and systems, 138(2):399-426, 2003. [6] a. yazdizadeh, k. khorasani, adaptive time delay neural network structures for nonlinear system identification, neurocomputing, 47(1-4):207-240, 2002. [7] m. onder efe, o. kaynak, a comparative study of neural network structures in identification of nonlinear systems, mechatronics, 9(8):287-300, 1999. [8] v. ranković, i. nikolić, identification of nonlinear models with feedforward neural network and digital recurrent network, fme transactions, 36(2):87-92, 2008. [9] s. dubljevic, p. mhaskar, n.h. el-farra, p.d. christofides, predictive control of transport-reaction processes, computers and chemical engineering, 29(11-12):2335-2345, 2005. [10] p. mhaskar, n.h. el-farra, p.d. christofides, robust hybrid predictive control of nonlinear systems, automatica, 41(2):209-217, 2005. [11] h. peng, t. ozaki, y. toyoda, k. oda, exponential arx model-based long-range predictive control strategy for power plants, control engineering practice, 9(12):1353-1360, 2001. [12] v. ranković, i. nikolić, model predictive control based on the takagi-sugeno fuzzy model, journal of information, control and management systems, 5(1):101-110, 2007. [13] m. lazar, o. pastravanu, a neural predictive controller for non-linear systems, mathematics and computers in simulation, 60(3-5):315-324, 2002. [14] c.-h. lu, c.-c. tsai, generalized predictive control using recurrent fuzzy neural networks for industrial processes, journal of process control, 17(1):83-92, 2007. [15] k. laabidi, f. bouani, m. ksouri, multi-criteria optimization in nonlinear predictive control, mathematics and computers in simulation, 76(5-6):363-374, 2008. [16] m. hagan, o.d. jesus, r. schultz, training recurrent networks for filtering and control, chapter 11 of recurrent neural networks: design and applications, l.r. medsker and l.c. jain, eds., crc press, 325-354, 1999. [17] r.k. al seyab, y. cao, nonlinear system identification for predictive control using continuous time recurrent neural networks and automatic differentiation, journal of process control, 18(6):568-581, 2008. [18] d.r. hush, b.g. horne, progress in supervised neural networks, ieee signal processing magazine, 10(1):8-39, 1993. [19] k.l. funahashi, y. k.l. funahashi, y. nakamura, approximation of dynamical systems by continuous time recurrent neural networks, neural networks, 6(6):183-192, 1993. [20] l. jin, p. nikiforuk, m. gupta, approximation of discrete-time state-space trajectories using dynamic recurrent neural networks, ieee transactions on automatic control, 40(7):1266-1270, 1995. [21] p.s. sastry, g. santharam, k.p. unnikrishnan, memory neuron networks for identification and control of dynamical systems, ieee transactions on neural net-works, 5(2):306-319, 1994. international journal of computers communications & control issn 1841-9836, 11(3):441-449, june 2016. a fuzzy logic software tool and a new scale for the assessment of creativity i. susnea, g. vasiliu ioan susnea*, grigore vasiliu dunarea de jos university of galati romania, 800008 galati, domneasca, 47 ioan.susnea@ugal.ro; grigore.vasiliu@ugal.ro *corresponding author: ioan.susnea@ugal.ro abstract: it is difficult to measure something we cannot clearly define. no wonder that, for the over 100 definitions of the creativity proposed in the literature, there are almost as many scales and assessment tools. most of these instruments have been designed for research purposes, and are difficult to apply and score, especially in the educational environment. not to mention that they are expensive. the research described in this paper is aimed to develop a free, fast, and easy to use software tool for the assessment of creativity in the educational context. to this purpose, we have designed a new scale with 20 items, based on a novel approach focusing on detecting the factors known to block the creativity, like stereotypical thinking, and social conformity. the user input is collected through a web based interface, and the actual interpretation of the results is automated by means of a fuzzy logic algorithm. the proposed solution is interesting because it can be easily integrated in almost any e-learning platform, or used as a stand-alone tool for tracing the evolution of the students involved in courses for the development of creative thinking skills, and also for possible other applications. keywords: assessment of creativity, e-learning, fuzzy logic 1 introduction the world of the 21’st century is very different from what it used to be just a couple of decades ago. new professions emerge overnight (think of android developer, market research data mining specialist, or cloud services engineer ), while others quickly fade out (postal services, newspaper delivery, travel agents, word processors, and many others). to help students succeed in this world, the educational system should create new skills and should be able to assess these new skills ( [5]). one of the fundamental skills required in our rapidly changing society is creative thinking ( [27], [11], [31]). this is the reason why the modern school has been intensely criticized ( [23], [22]) for it is unable to foster the creativity of the students, or even worse for killing their innate creativity. by reviewing the vast literature dedicated to creativity, it appears that the real reason why we don’t have serious initiatives to foster creativity in school is that we still don’t fully understand this construct ( [20]). there are currently over 100 definitions of the creativity ( [33], [1]), many explanatory theories ( [29], [12], [26]), an ocean of literature about creativity, but very few initiatives explicitly aimed to develop educational content for the education for creativity ( [28]). the lack of consensus of the researchers concerning the many facets of the creativity is also manifest in the field of formal education: though most teachers declare that they value and encourage the creativity of the students, most of them can barely recognize it and are totally unprepared to stimulate it ( [7]). an additional obstacle is the lack of easy to use assessment tools to trace the evolution of the students involved in creativity courses, and to demonstrate the efficiency of the specific educational content. copyright © 2006-2016 by ccc publications 442 i. susnea, g. vasiliu the existing assessment tools are hard, if not impossible, to apply in the educational environment, either because they are too complex and difficult to score, like the famous ttct ( [32]), or simply because they are not free (e.g. rcab runco creativity assessment battery, www.creativitytestingservices.com) . considering the fact that, the first creativity lesson in any course is that “there are no right and wrong answers”, it results that the moodle-style evaluation tests, wherein the students must select one or more “correct”answers from a predefined list are totally useless. for these reasons, the main objective of the research described here is to develop a new creativity assessment tool that is free, easy to use, and score, and compatible with most existing e-learning platforms. unlike the vast majority of the existing psychometric approaches, which treat the tendency towards social conformity as a bias, in our study we assumed that the social conformity is a clue indicating a type of “stereotypical thinking”that blocks creative thinking itself, not the process of measuring it. in other words, in our approach, the social conformity is treated as a signal, not as noise. the automatic scoring is performed by means of a fuzzy logic algorithm starting from two subscales, each having 10 items, focused on different “dimensions”of the creativity: one subscale assessing the ideational behavior, the other aimed to measure the stereotypical thinking and social conformity. beyond the present introduction, this work is structured as follows: section 2 is a brief review of the related work aimed to clearly define the context of this study. section 3 contains the description of the proposed solution, and section 4 is reserved for discussion and conclusions. 2 related work given the polymorphic nature of the creativity, it’s no wonder that the assessment instruments are equally diverse as the definitions of creativity. for this reason, it is convenient to present the assessment tools from the perspective of the 4 p’s (person, product, process, and press/place) commonly used to illustrate the definitions of the creativity. other, more comprehensive reviews of the state of the art in the field of creativity assessment are available in ( [24], [13], [4]). there are many tests that focus on traits or behaviors specific to creative persons, which is understandable considering that most researchers of the creativity are psychologists. among this type of tests, it is worth to mention: the kirton adaptor-innovator inventory ( [16]), scab the scale for creative attributes and behaviors ( [14]) and ribs runco ideational behavior scale ( [25]). in another approach, the tests focus on creative behaviors from the past. examples of products based on this idea are spca the state of past creative achievements ( [6]) and the creative achievement questionnaire ( [8]). the tests that evaluate the creativity by analyzing the creative products are, in most cases, based on the measure of the divergent thinking by means of open ended prompts. these are by far the most frequently used, which determined kaufman ( [13]) to ironically note: “one of the great ironies of the study of creativity is that so much energy and effort have been focused on a single class of assessments: measures of divergent thinking. in other words, there’s not much divergence in the history of creativity assessment. ” it is not unlikely that this preference for the divergent thinking tests is connected with the popularity of ttct (the torrance test of creative thinking, [32]). however, beyond the a fuzzy logic software tool and a new scale for the assessment of creativity 443 incontestable success, the divergent thinking measures have their critics ( [21], [15]), who note that applying and scoring these tests are cumbersome and their predictive value is questionable. in the same class of product oriented tests it is worth to mention cat the consensual assessment techniques ( [2]), which is based on a methodology of evaluating the creative products by independent experts. the creativity assessment tools based on the analysis of the processes leading to creative outcomes are far less common. one example is cpac cognitive processes associated with creativity ( [19]). in what concerns the assessment of the influence of the environment on creativity, this is a difficult and less studied problem. one notable example in this direction is the keys test ( [3]). there are also complex tests containing specific subscales for multiple p’s of the creativity. for example, csq-r creativity styles questionnaire revised. ( [17]) contains 78 items organized in the following subscales: belief in unconscious processes (person), superstition (person), final product orientation (product), use of techniques (process), use of other people (process), and use of senses (process), and environmental control (press). in what concerns the way to collect the user’s responses, the most popular solution is the likert scale, but there are also tests that use the simplified dialogue based on binary responses yes/no, or true/false. from the perspective of the software implementation, the problem of creating the gui (graphic user interface) to collect the user’s responses is quite simple. however, things become considerably more complex when it comes to automate the interpretation of multiple subscales addressing distinct “dimensions”of the creativity. see next section for details on how we solved the problem of computing a global creativity quotient cq. we will conclude this brief presentation of the state of the art in the field of the creativity assessment by citing again the opinion of kaufman: “creativity assessment is a work in progress we know far less about creativity and its measurement than we would like to know but that is not to say that we know nothing at all. ”( [13]). 3 description of the proposed solution probably the most common fallacy about creativity is to confound it with the divergent thinking. in fact, divergent thinking is just one of the many facets of the construct called creativity (see figure 1). when dealing with such complex concepts, the evaluations based on a single dimension of the creativity be it divergent thinking or any other have, inevitably, a limited reliability, and in the same time, addressing multiple dimensions leads to large and cumbersome scales (e.g. csq-r, described in [17] with 78 items). one possible approach to facilitate the understanding of complex intellectual constructs is to consider their opposite, or their associated “negative space”(see figure 2 for a graphical metaphor that illustrates this idea) so, what is the opposite of creative thinking? one possible answer is “thinking inside a box”, a style of thinking heavily biased by stereotypes, prejudices, illicit generalizations, superficiality and conformism. starting from this idea we have developed a scale that attempts to indirectly assess the individual creativity by considering the factors that indicate stereotypical thinking. the proposed scale, called iacest (indirect assessment of creativity through the estimation of stereotypical thinking) contains two subscales as shown in tables 1 and 2. see [19] and [10] for details on how the items of similar scales are formulated. note: in order to prevent the attempts to learn by heart the “right”answers, a third set of items containing 5 additional filler statements has been included in the online implementation 444 i. susnea, g. vasiliu divergent thinkingconvergent thinking s u b je c ti v it y o b je c ti v it y h o l is t ic d e t a il o r ie n t e d v is u a l v e r b a l a s s o c ia t iv e l o g ic a l r ig h t b r a in le ft b r a in c r e a t iv it y l a c k o f c r e a t iv it y figure 1: the multiple facets of creativity non-a a figure 2: the apple logo redesigned by jonathan mak a metaphor based on the negative space table 1: subscale 1. creative personality and thinking style items. item statements 1 an image is worth a thousand words. 2 people say i am a bit lazy and scatterbrained. 3 i have a great sense of humor, and i always see the funny side of life. 4 sometimes i get obsessed with a problem, and i keep trying until i find a solution. 5 a bit of adrenaline is always welcome. life is boring without it. 6 i am very curious. 7 people think that i am good at finding solutions to common problems. 8 i enjoy trying to find new solutions to problems. 9 i have lots of ideas in every domain. 10 one plus one does not always equal two of the scale. (the answers to these items are simply ignored in the evaluation.) for the same reason, the items are presented in random order each time the test starts. the graphic user interface implementing the five point likert scale for collecting the user’s responses is presented in figure 3. each answer is scored with a numeric value between 0 (totally disagree) and 4 (definitely agree). for each subscale we compute a total score: < ss1 = 10∑ i=1 ai >;< ss2 = 20∑ i=11 ai > (1) a fuzzy logic software tool and a new scale for the assessment of creativity 445 table 2: subscale 2. items for detecting stereotypical thinking and other blocking factors for creativity. item statements 1 i always play by the rules. 2 my parents were very strict with me. 3 if anything can go wrong, it will. 4 i am very disciplined and diligent. 5 sometimes i use oracles when i need to make difficult decisions. 6 i know exactly what i will do next summer. 7 i always trust reputable scientists. 8 i like to solve the problems one by one. 9 i like to quote the opinions of wiser people. 10 i feel very embarrassed if i fail. figure 3: a snaposhot of the gui of the application obviously ss1,ss2 ∈ [0,40]. while a high score for the first subscale indicates a high creativity, the second subscale is aimed to detect social conformity tendencies, and stereotypical thinking. a high score for the second subscale is likely to indicate the existence of important blocking factors for the subject’s creativity. since the two subscales address distinct factors of the creativity, it is not possible to compute the final creativity quotient cq by simply adding or substracting the scores of the subscales. assuming that the domain od variation for cq is the interval [0,100], one possible way to compute this quotient is: cq = 50 ∗ (1 + tanh(k ∗ (ss1 − ss2))) (2) where k is a scaling factor, empirically set to the value k = 0.07. though this heuristic method of computing cq spreads the user responses reasonably well over the interval [0,100] when using exactly two subscales that reflect opposite influences, we preferred to use a fuzzy inference algorithm to compute cq. this solution is proven effective in 446 i. susnea, g. vasiliu many other difficult problems (see for example [9]), provides superior flexibility, and the resulting code is largely reusable in other applications. for this experimental version, and considering the limitations of the php language required by the dedicated web based application, we chose the simplest implementation with three fuzzy domains for ss1 and ss2 and linear membership functions, as shown in figure 4. 10 20 30 40 1 low medium high x m l m mm h m ss ,ss 1 2 fuzzy domains and membership functions figure 4: fuzzy domains and membership functions for ss1, ss2 with these assumptions, the knowledge base for the actual interpretation of the scores ss1 and ss2 for computing cq is described by the set of rules presented in table 3: table 3: the fuzzy rule base ss1 ss2 cq low low low low medium low low high low medium low medium medium medium medium medium high low high low high high medium high high high medium each line of table 3 should be read as an if/then statement of the following type: if (ss1 is low) and (ss2 is low) then cq is low the truth values µl,µm,µh of the statements (ss1 is low), (ss1 is medium), (ss1 is high), (ss2 is low), (ss2 is medium), (ss2 is high), for the particular values of ss1, ss2 derived from the user’s responses are determined using the equations of the membership functions. and the truth value of the entire statement for the rule i is: zi = min(µ1,µ2) (3) assuming that the output domain is cq ∈ [0,100], we can choose constant values for the output fuzzy domains (“singletons”, si) e.g. slow = 10,smedium = 50, shigh = 100. with these notations, the final “crisp”value of cq is the center of gravity of the entire knowledge base: cq = ∑9 i=1 zi ∗ si∑ 9 i=1zi (4) a fuzzy logic software tool and a new scale for the assessment of creativity 447 for details on the tehory behind the above implementation, see [30]. figure 5 is a snapshot of the screen presenting the results of the test and the computed value of cq. the actual implementation contains additional software modules for user authentication, and report generation. the results of the tests are stored in a database. a beta version of the web application can be tested at http://dev.ugal.ro/creativity/ figure 5: a snaposhot of the final screen presenting the result of the test 4 discussion and conclusions the validation of the proposed scale is in progress. a simple pretest for internal consistency has been conducted with n=30 undergraduate students of the faculty of automation, computers, electrical and electronics engineering resulting in statistically acceptable values of the cronbach quotient α = 0.73 for subscale 1, and α = 0.78 for subscale 2. though this research has been conducted in the context of an educational project partly funded by eacea the education, audiovisual and culture executive agency of the european commission (namely tecrinoteaching creativity in engineering, 538710-llp-1-2013-1-cyleonardo-lmp), for unknown reasons, the eacea representatives stubbornly denied all the requests for permission to disseminate the results of this work through scientific publications, and to allocate funds, within the same budget, to deepen this study. due to lack of funding, our work in this direction is much slower than we hoped. therefore, for this moment, we must align to the expectations formulated by miller in ( [18]): “it should also be noted that validation of any instrument is an ongoing procedure.... once a measure has been adequately developed, it is the responsibility of all researchers in the field to further the generation of evidence for its validity.” obviously, further validation studies using a larger sample are definitely required. the preliminary results are promising: the proposed tool is free, simple, easy to use, easy to integrate in almost any e-learning platform, and serves the purpose of assessing the evolution of the students enrolled in creativity training courses. and the idea of using a fuzzy algorithm for automated scoring of psychometric scales may have other interesting applications. 448 i. susnea, g. vasiliu acknowledgment the authors gratefully acknowledge the contribution of dr. mihai vlase, who wrote the code for the software implementation of the instrument described in this paper. bibliography [1] aleinikov, a., kackmeister, s., and koenig, r. (eds.). (2000); creating creativity: 101definitions, midland, mi: alden b. dow creativity center, northwoods university. [2] amabile, t.m. (1982); social psychology of creativity: a consensual assessment technique, journal of personality and social psychology, 43: 997-1013. [3] amabile, t. m., conti, r., coon, h., lazenby, j., and herron, m. (1996); assessing the work environment for creativity, academy of management journal, 39(5): 1154-1184. [4] batey, m. (2012); the measurement of creativity: from definitional consensus to the introduction of a new heuristic framework, creativity research journal, 24(1): 55-65. [5] binkley, m., erstad, o., herman, j., raizen, s., ripley, m., miller-ricci, m., and rumble, m. (2012); defining twenty-first century skills, assessment and teaching of 21st century skills, springer netherlands, 17-66. [6] bull, k.s. & davis, g.a. (1980); evaluating creative potential using the statement of past creative activities, journal of creative behavior, 14: 249-257. [7] cachia, r., ferrari, a., kearney, c., punie, y., van den berghe, w.,and wastiau, p. (2009); creativity in schools in europe: a survey of teachers in europe, european commission-joint research center-institute for prospective technological studies, seville. [8] carson, s., peterson, j.b.,& higgins, d.m. (2005); reliability, validity, and factor structure of the creative achievement questionnaire, creativity research journal, 17(1): 37-50. [9] dzitac, i., vesselenyi, t., tarca, r. c. (2011); identification of erd using fuzzy inference systems for brain-computer interface, international journal of computers communications & control, 6(3): 403-417. [10] fields, z., & bisschoff, c. a. (2014); developing and assessing a tool to measure the creativity of university students, j soc sci, 38(1): 23-31. [11] florida, r. (2006); the flight of the creative class: the new global competition for talent. liberal education, 92(3): 22-29. [12] kasof, j. (1995); explaining creativity: the attributional perspective, creativity research journal, 8:311-366. [13] kaufman, j. c., plucker, j. a., & baer, j. (2008); essentials of creativity assessment, john wiley & sons, 53. [14] kelly, k.e. (2004); a brief measure of creativity among college students. college student journal, 38: 594-596. [15] kim, k. h. (2006); can we trust creativity tests? a review of the torrance tests of creative thinking (ttct), creativity research journal, 18(1): 3-14. a fuzzy logic software tool and a new scale for the assessment of creativity 449 [16] kirton, m. (1976); adaptors and innovators: a description and measure. journal of applied psychology, 61: 622-629 [17] kumar, v. k., kemmler, d., & holman, e. r. (1997); the creativity styles questionnaire– revised. creativity research journal, 10(1): 51-58. [18] miller, a. l. (2009), cognitive processes associated with creativity: scale development and validation (doctoral dissertation, ball state university). [19] miller, a. l. (2014); a self-report measure of cognitive processes associated with creativity, creativity research journal, 26(2): 203-218. [20] parkhurst, h. b. (1999); confusion, lack of consensus, and the definition of creativity as a construct, the journal of creative behavior, 33(1): 1-21. [21] plucker, j.a., runco, m.a. (1998); the death of creativity measurement has been greatly exaggerated: current issues, recent advances, and future directions in creativity assessment, roeper review, 21: 36-40. [22] resnick, m. (2007); sowing the seeds for a more creative society, international society for technology in education, 35(4): 18-22 [23] robinson, k. (2011); out of our minds: learning to be creative. capstone [24] runco, m.a. (1999); appendix ii: tests of creativity. in m.a. runco and s.r. pritzker eds.), encyclopedia of creativity, san diego, ca: academic press, 755-760). [25] runco, m. a., plucker, j. a., & lim, w. (2001); development and psychometric integrity of a measure of ideational behavior. creativity research journal, 13(3-4): 393-400. [26] sawyer, r. k. (2011); explaining creativity: the science of human innovation, oxford university press. [27] susnea, i., pecheanu, e., tudorie, c. and cocu, a. (2014); the education for creativity the only student’s tool for coping with the uncertainties of the future, mac etel 2014 international conference on education, teaching and e-learning, prague , oct. 2014. [28] susnea, i. pecheanu, e. tudorie, c. (2014); initiatives towards and education for creativity, the 6th international conference edu world 2014 education facing contemporary world issues, 7th 9th november 2014. also published in procedia social and behavioral sciences, 2015 180: 1520 1526. [29] sternberg, r. j. (1988); a three-facet model of creativity. in r. j. sternberg (ed.), the nature of creativity, cambridge: cambridge university press, 125-147. [30] tanaka, k. (1997); an introduction to fuzzy logic for practical applications, springer verlag. [31] thorsteinsson, g., page, t., & niculescu, a. (2010); adoption of ict in supporting ideation skills in conventional classroom settings, journal of studies in informatics and control, 19(3): 309-318. [32] torrance, e. p. (1974); torrance tests of creative thinking: norms and technical manual. bensenville, il: scholastic testing press [33] treffinger, d. j. (1996); creativity, creative thinking, and critical thinking: in search of definitions. sarasota, fl: center for creative learning. international journal of computers communications & control issn 1841-9836, 9(6):776-785, december, 2014. dissonance engineering: a new challenge to analyse risky knowledge when using a system f. vanderhaegen frédéric vanderhaegen 1 univ lille nord de france, f-59000 lille, france 2 uvhc, lamih, f-59313 valenciennes, france 3 cnrs, umr 8201, f-59313 valenciennes, france frederic.vanderhaegen@univ-valenciennes.fr abstract: the use of information systems such as on-board automated systems for cars presents sometimes operational risks that were not taken into account with classical risk analysis methods. this paper proposes a new challenge to assess risks by implementing an automated tool based on the dissonance engineering principle. it consists in analysing knowledge in term of dissonances. a dissonance is defined as a knowledge that sounds wrong, or in other words that may present conflicts. the paper focuses on two kinds of dissonances: erroneous affordances when events can be related to erroneous actions and contradictory knowledge when the application of knowledge relates to opposite actions. the proposed automated tool analyses the knowledge base content in order to detect possible dangerous affordances or contradictory knowledge. an example of application is given by using a limited number of simple rules related to the use of an automated speed control (asc) system for car driving. keywords: dissonance engineering, erroneous affordance, contradictory knowledge, risk analysis, car driving system. 1 introduction is there something wrong when engineers or researchers design walking robots directly with two legs without copying the learning process of the human walking that begins initially with the legs and the hands, then with the use of supports and finally with both legs after the complete control of the equilibrium? to control such a process, undesirable events such as lack of knowledge to control equilibrium or breakdown of equilibrium or fatal fall should be studied in order to design algorithms or other devices that are able to prevent the walking robots from a loss of their equilibrium. classical risk analysis focuses on the identification and the control of such undesirable events and aims at providing the human-machine systems with barriers in order to protect them from the occurrence or the impact of these events [1]. despite these barriers, accidents remain and retrospective analyses can help the designers to identify what was wrong. safety based analysis can apply different methods. the rams based methods (reliability, availability, maintainability and safety based analyses) treat about technical failures. the methods from cindynics treat about organizational dangers human reliability or human error based analyses focus on the success or the failure of human behaviours respectively. resilience or vulnerability based methods consider the analysis of the success or the failure of the control of the system stability respectively. this paper proposes on a new way to analyse risks: the use of the dissonance concept to assess conflicts between knowledge. the dissonance engineering is the engineering science that treats on dissonance [2]. a cognitive dissonance is defined as an incoherency between individual cognitions [3]. cindynics dissonance is a collective or an organizational dissonance related to incoherency between persons or between groups of people [4]. the occurrence of these dissonances can relate to individual copyright © 2006-2014 by ccc publications dissonance engineering: a new challenge to analyse risky knowledge when using a system 777 or collective knowledge when something sounds wrong, i.e. will be, is, maybe or was wrong. a dissonant cognition is linked with contradictory information and a dissonance may produce discomfort due to the occurrence of conflicting cognition or knowledge that controls or affects behaviors, attitudes, ideas, beliefs, viewpoints, competences, etc. dissonant knowledge of a person or of several persons can explain such conflicts. nevertheless, dissonances can also be due to the occurrence of important or difficult decisions involving the evaluation of several possible alternatives [5], to divergent viewpoints on human behaviours [1], to the occurrence of failed competitive or cooperative activities [6], [7], to organisational changes that produce incompatible information [8], [9]. then, the updating or the refining of a given knowledge due to new feedback from field is required but this can also generate dissonances [2]. the more difficult the learning process is to face a dissonance, the less acceptable this dissonance is. therefore, human operators aim at reducing any occurrence or the impact of a dissonance because it produces discomfort. this activity leads to maintain a stable state of knowledge without producing any effort to change it [3]. despite this reduction, a breakdown of this stability is sometimes useful in order to facilitate the learning process and refine, verify or confirm knowledge [10]. such knowledge reinforcement improves the learning abilities. finally, dissonance can also be seen as a feedback of a decision: dissonance occurs after a decision and this requires a modification of knowledge [9]. therefore, a discomfort can be a dissonance or can be due to the production of a dissonance, and the detection or the treatment of a dissonance can also produce discomfort. discomfort can also occur if this dissonance is over the control of the human operators or because the treatment of a detected dissonance increases the human workload or the human error for instance [11], [12]. such an activity involves a minimum learning process in order to improve the human knowledge and to control such a discomfort. there are then positive and negative feedbacks from the dissonance management. negative feedbacks relate to discomfort and positive ones to the learning aspect for instance. different structures to share tasks between human and machine such as those developed on [13], [14] can be applied for dissonance management and a learning process is usually required to facilitate the control of the knowledge content. several models exist for self-learning, autolearning, co-learning or cooperative learning [6]. main of them is based on the reinforcement principle by taking into account previous knowledge and integrating new or future knowledge, in order to create, modify or delete data from the knowledge base and to make it more coherent [15], [16], [17], [18]. this paper proposes an original knowledge analysis based tool to support the control of dissonances into human knowledge. it focuses on two kinds of dissonances: erroneous affordances and contradictory knowledge. this knowledge analysis is based on the knowledge modelling presented on [19]. the concept of affordances was firstly presented by gibson [20]. affordances can be defined as invariant relationships between direct perception and possible opportunities for action. the concept is used for different research applications related to human-machine system knowledge management [21]. for instance, the perception of a chair relates directly to the action of sitting. regarding other human experiences, a chair can also be related to the actions of climbing when a chair is used as a ladder, or for moving or transporting when the action concerns disabled persons. the paper focuses on particular affordances: erroneous affordances when links between occurred or desired events and actions can be erroneous. the last section of this paper gives a practical example of application related to the use of an automated speed control (asc). 778 f. vanderhaegen 2 the knowledge analysis based tool to control dissonances the proposed tool analyses the content of a given knowledge base. two modules are integrated: the module that detects possible erroneous affordances and the module that detects possible contradictory knowledge, figure 1. the knowledge modelling and reinforcement to create, modify or delete some knowledge are allocated to the users. an interface gives the results of these detections that are validated by the users to reinforce the knowledge modelling and then the knowledge content. the knowledge analysis is based on the knowledge modelling presented on [19]. therefore, the set of knowledge k contains a list of rules r(i) with a condition of activation noted condit(r(i)) and a conclusion noted conclu(r(i)): r(i) ∈ k → r(i) = (condit(r(i)) → conclu(r(i))) (1) if condit(r(i)) is to be achieved or realized then conclu(r(i)) has to be realized. for instance, condit(r(i)) can relate to a goal to be achieved or realized and conclu(r(i)) can correspond to the actions to be done on the process by using specific supports of the system or its environment to achieve this goal. these actions can be realized by the users of the system or by an automated module implemented into this system. some rules are then the procedures to apply in order to achieve a goal and other rules concern the different steps required to achieve a goal. the affordance detection and the contradictory knowledge detection use the function k_analysis to identify the rules that can generate possible erroneous affordances or that can present incoherency. the k_analysis function is defined as follows: k_analysis : k → k r → r+ = k_analysis(r), ∀r(i) ∈ r, condit(r(i))∩be ̸= {∅}, r+ ←∪r(i) (2) figure 1: the automated tool modules. the set k is the set of all the possible rules. for a given rule base noted r of k containing a limited number of rules, the k_analysis gives a reduced rule base noted r+ of k. r+ contains the rules related to the inputs noted be. be contains the events that occur or the goals to be achieved. when the condition condit(r(i)) of a rule occurs on be entirely or partially, then this rule is integrated into r+. the k_affordance function aims at identifying possible new rules combining the condition dissonance engineering: a new challenge to analyse risky knowledge when using a system 779 and the conclusion of existing rules. it is defined as follows: k_affordance : k → k r → ra = k_affordance(r), ∀r(i) ∈ r,∀r(j) ∈ r, i ̸= j, condit(r(i)) ⊂ condit(r(j)), ra = ∪((condit(r(i)), conclu(r(j))), (condit(r(j)), conclu(r(i)))) (3) the result of this function is a new rule base noted ra of k that combines conditions and conclusions of some rules of r. this function proposes new rules based on the affordance application concept in order to list possible new rules taking into account possible relationships between the condition of a given rule with the conclusion of another one. if a condition of a rule is included into to the condition of another one, then both rules can be used to create new rules. this process is limited to the rules identified by the k_analysis function. then: ra = k_affordance(k_analysis(r)) (4) the k_contradictory function aims at listing the contradictory rules, i.e. rules that present opposite behaviours. it is defined as follows: k_contradictory : k → k r → rc = k_contradictory(r), ∀r(i) ∈ r,∀r(j) ∈ r, i ̸= j, (conclu(r(i)) ⊂¬conclu(r(j))) or (¬(conclu(r(i))) ⊂ conclu(r(j))), rc = ∪(r(i), (r(j)) (5) when opposite conclusions appear on r, i.e. when both conclu(r(i)) and ¬(conclu(r(i))) exist, then a possible incoherency occurs. the result of this function is a new rule base noted rc that contains possible conflicts between rules of r. this process is limited to the rules identified by the k_analysis function. then: rc = k_contradictory(k_analysis(r)) (6) this formalism was applied to car driving domain by integrating into the initial rule base the rules related to the use of a cruise/speed control (asc) system. if the asc system is activated and if an initial setpoint value is given by the car driver, the asc has in charge the regulation of the car speed by maintaining this setpoint speed. the "+" and the "-" buttons are used for giving the initial setpoint speed or to modify this setpoint, figure 2. the "+" button aims at increasing the setpoint value whereas the "-" button at decreasing it. several dissonances can generate a possible evolution of the car driver knowledge. the next two sections presents some examples of the detection of possible erroneous affordances and contradictory rules linked to the use of such a system. the wording of the rule condition and conclusion are voluntarily simple in order to illustrate the feasibility of such a knowledge analysis to detect dissonances. perspectives will consist in using more complex rules with a numerical model integrating for instance belief on rules, belief on condition occurrence or belief on conclusion occurrence. 780 f. vanderhaegen figure 2: the "+" and "-" buttons of an asc system of a car. 3 example of possible erroneous affordance detection suppose that a knowledge modeling process produced a knowledge base containing these rules: • r(1): (the use of the asc system → turn the activation button on "on") • r(2): (the deactivation of the asc system→ brake with the braking pedal) • r(3): (the increasing of the car speed setpoint → push the "+" button) • r(4): (the decreasing of the car speed setpoint → push the "-" button) • r(5): (the increasing of the car speed → push the gas pedal) • r(6): (the decreasing of the car speed → release the gas pedal) for instance, whatever the context of the asc system use, if be contains initially increasing of the car speed, ra will list several dissonances to be tested or validated by the car driver. it will contain two possible new rules: • (the increasing of the car speed setpoint → push the gas pedal) • (the increasing of the car speed → push the "+" button) if be contains initially "decreasing of the car speed", ra will then contain other new possible dissonances: • (the decreasing of the car speed setpoint → release the gas pedal) • (the decreasing of the car speed → push the "-" button) 10 subjects who usually used an asc system were invited to evaluate the proposed erroneous affordances, table 1. the outputs of the automated systems were presented to these subjects who have to make comments about them. they have to give their own point of view about the dissonances. all of them considered that the rules related to the management of the car speed setpoint value are erroneous affordances. however, 8 of them do not consider the other rules as problems and decided to integrate the proposed rules into the knowledge base content. therefore, these subjects consider that they can manage the car speed without taking into account the management of dissonance engineering: a new challenge to analyse risky knowledge when using a system 781 table 1: subjective evaluation of the proposed erroneous affordances. proposed erroneous affordance real erroneous affordance? consequences (the increasing of the car speed setpoint → push the gas pedal) yes (10 upon 10 subjects) no modification of the knowledge base (the increasing of the car speed → push the "+" button) no (8 upon 10 subjects) creation of this rule into the knowledge base (the decreasing of the car speed setpoint → release the gas pedal) yes (10 upon 10 subjects) no modification of the knowledge base (the decreasing of the car speed → push the "-" button) no (8 upon 10 subjects) creation of this rule into the knowledge base the pedals anymore. this reduces their workload. the 2 subjects who did not accept these new rules consider these rules as dangerous because this can lead to adapt a possible body position that can generate problems in case of emergency stop for instance. figure 3 gives an example of such a body position into the car when managing the car speed only with the "+" and "-" buttons of the asc system: the legs are crossed because the car speed is managed by a finger that activates the "+" or ""-" buttons, and the position of the legs can therefore become an obstacle or a discomfort in case of emergency stop that may require a quick press on the brake pedal! figure 3: example of a possible dangerous body position when applying the proposed rules for regulating the car speed. 4 example of possible contradictory knowledge detection suppose that, for another use context, the knowledge modelling process produced another knowledge base combining rules related to the aquaplaning control and some rules related to the acs control: 782 f. vanderhaegen • r(1): (the use of the asc system → turn the activation button "on") • r(2): (the deactivation of the asc system → brake) • r(3): (the reduction of the current car speed that becomes under the setpoint managed by the asc → accelerate automatically to reach the setpoint value) • r(4): (the increasing of the current car speed that becomes over the setpoint managed by the asc → decelerate automatically to reach the setpoint value) • r(5): (the control of an aquaplaning → ¬ (brake)) • r(6): (the control of an aquaplaning → ¬ (accelerate)) knowing that the asc is activated, the car speed setpoint is high and the current car speed is equal to the required setpoint, the occurrence of an aquaplaning may reduce the current car speed due to the natural braking and friction related to the water level on the road. suppose that in this case, the initial be content is (control of aquaplaning, deactivation of the acs, reduction of the current car speed). the contradictory knowledge module identifies some couples of possible dissonant rules. rc will contains this list of couples of rules: • ((the deactivation of the asc system → brake), (the control of an aquaplaning →¬(brake))) • ((the control of an aquaplaning → ¬(accelerate)), (the reduction of the current car speed that becomes under the setpoint managed by the asc → accelerate automatically to reach the setpoint value)) 10 subjects were invited to assess these contradictory knowledge proposals. they use an acs system and are aware about the behaviour to follow and about the car behaviour when an aquaplaning occurs. all of them are agree with the contradictory rules proposed by the automated system, table 2. table 2: subjective evaluation of the proposed contradictory knowledge. proposed contradictory knowledge real contradictory knowledge? consequences ((the deactivation of the asc system → brake), (the control of an aquaplaning → ¬(brake))) yes (10 upon 10 subjects) modification of the current rules or creation of new rules ((the control of an aquaplaning → ¬(accelerate)), (the reduction of the current car speed that becomes under the setpoint managed by the asc→ accelerate automatically to reach the setpoint value)) yes (10 upon 10 subjects) modification of the current rules or creation of new rules the contradictory actions (brake, ¬(brake)) and (accelerate, ¬(accelerate)) are then solved by reinforcing the knowledge in different possible ways such as the modification of some current rules or the creation of new rules. they recognize that in emergency situations, it is more natural to use the braking pedal for stopping the car or deactivating a system instead of using the clutch pedal or the "off" button of the asc system as it is noted on the asc user manual. some user manual recommends using the system in particular conditions. for instance, it indicates not to use the system when it is raining. however, water can sometimes occur under bridges for instance even if it is not raining and this can generate aquaplaning. dissonance engineering: a new challenge to analyse risky knowledge when using a system 783 5 conclusion this paper is an original contribution on risk analysis based on dissonance engineering. it proposes a knowledge analysis based tool composed by two main modules: an erroneous affordances detection module, and a contradictory knowledge detection module. the knowledge analysis consists in identifying possible dissonances into a knowledge base composed by rules. rules contain conditions of activation and conclusions when they can be activated. the conditions relate to occurred or desired events and the conclusions to the associated actions to be achieved. the erroneous affordances module treats particular dissonances that the users may create. they are new rules for which desired or occurred events may be related to wrong actions. these possible erroneous relations between events and actions are obtained by using the existing rules and initial events. the contradictory knowledge module manages the rules for which the activation presents opposite actions regarding initial inputs. a practical example is then proposed to study the feasibility of use of such a knowledge analysis based tool. it relates to the use of an automated speed control systems dedicated to car driving. rules associated to its use and other rules applied for controlling events such as an aquaplaning were proposed. erroneous affordances and contradictory knowledge are then given by the proposed automated modules. 10 subjects were invited to make comments about these outputs from the proposed tool. among the erroneous affordances proposed by the system, there is a rule that was not considered as wrong and that was accepted by 8 subjects upon 10. it concerns the use of the "+" and "-" buttons of the asc in order to increase or decrease respectively the current car speed on demand instead of using these buttons to control a stable car speed setpoint. the acceleration and deceleration on road or motorway can then be done with such new procedures without involving the legs anymore. the 2 last subjects that agree with the system consider this new rule as dangerous because problems may occur if an emergency stop is required for instance. problems can be related to the position of the legs or to the body into the car due to the new function allocated to the "+" and "-" buttons for controlling the car speed. regarding the contradictory knowledge, all the subjects were agree with the proposals they consider as very dangerous. indeed, they considered that, in emergency case, people may activate the braking pedal instead of the "off" button of the asc system or the clutch pedal to deactivate the asc system. then, the rules associated to the control of aquaplaning that required no action on the braking pedal and no action on the speed control are contradictory with rules related to the use or behavioural model of the asc system. this simple example has shown the interest of such a new approach to analysis risks involving rule based knowledge. future work research will then focus on more complex applications implementing numerical models of knowledge and will integrate criteria such as uncertainties, beliefs or preferences on knowledge [22], [23]. future applications will connect automated reinforced learning systems to assist the knowledge reinforcement process. finally, this paper is a call to future designers of car driving systems such as asc systems to use a dissonance engineering based risk analysis for designing system functions and user manuals in order to control possible dangerous dissonances and recover side effects of automation. acknowledgments the present research work has been supported by the international research group on human-machine systems in transportation and industry: the author gratefully acknowledges the support of this institution. the author thanks also p. richard for his support to format this paper. 784 f. vanderhaegen bibliography [1] f. vanderhaegen (2010), human-error-based design of barriers and analysis of their uses cognition technology & work, 12: 133-142 [2] f. vanderhaegen (2013), a dissonance management model for risk analysis proceedings of the 12th ifac/ifip/ifors/iea symposium on analysis, design, and evaluation of human-machine systemsl, las vegas, usa, august, 11-15. [3] l. festinger (1957), a theory of cognitive dissonance. stanford, ca: stanford university press. [4] g.-y. kervern (1995), eléments fondamentaux des cindyniques (fondamental elements of cindynics) economica editions, paris. [5] t.-y. chen (2011), optimistic and pessimistic decision making with dissonance reduction using interval-valued fuzzy sets information sciences, 181(3): 479-502. [6] f. vanderhaegen (2012), cooperation and learning to increase the autonomy of adas cognition, technology & work, 14: 61-69. [7] f. vanderhaegen, s. chalmăš, f. anceaux, p. millot (2006), principles of cooperation and competition application to car driver behavior analysis cognition, technology & work, 8(3): 183-192. [8] o. brunel, c. gallen (2011), just like cognitive dissonance proceedings of the 27th international congress of french association of marketing, 8-20 may 2011, brussels. [9] e. e. telci, c. maden, d. kantur (2011), the theory of cognitive dissonance: a marketing and management perspective procedia social and behavioral sciences, 24: 378-386. [10] e. aăżmeur (1998), application and assessment of cognitive dissonance theory in the learning process journal of universal computer science, 4(3): 216-247. [11] f. vanderhaegen (1999), cooperative system organisation and task allocation: illustration of task allocation in air traffic control le travail humain, 63(3): 197-222. [12] f. vanderhaegen (1999), multilevel allocation modes allocator control policies to share tasks between human and computer system analysis modelling simulation, 35: 191-213. [13] f. vanderhaegen (1997), multilevel organization design: the case of the air traffic control control engineering practice, 5(3): 391-399. [14] s. zieba, p. polet, f. vanderhaegen (2011), using adjustable autonomy and human machine cooperation to make a human machine system resilient application to a ground robotic information sciences, 181(3): 379-397. [15] f. vanderhaegen, s. zieba, s. enjalbert, p. polet (2011), a benefit/cost/deficit (bcd) model for learning from human errors reliability engineering & system safety, 96(7): 75776. [16] p. polet, f. vanderhaegen, s. zieba (2012), iterative learning control based tools to learn from human error engineering applications of artificial intelligence, 25(7): 1515-1522. dissonance engineering: a new challenge to analyse risky knowledge when using a system 785 [17] m. ercan, t. acarman (2013), processing capacity and response time enhancement by using iterative learning approach with an application to insurance policy server operation international journal of computers communications & control, 8(4): 514-524. [18] c.k. ang, s.h. tang, s. mashohor, m.k.a.m. arrn (2014), solving continuous trajectory and forward kinematics simultaneously based on ann international journal of computers communications & control, 9(3):253-260. [19] f. vanderhaegen, p. caulier (2011), a multi-viewpoint system to support abductive reasoning information sciences, 181(24): 5349-5363. [20] jj gibson (1986), the ecological approach to visual perception lawrence erlbaum associates, hillsdale (originally published in 1979). [21] s. zieba, t. inagaki, f. vanderhaegen (2010), resilience engineering by the management of affordances application to intelligent transport system. proceedings of the 11th ifac/ifip/ifors/iea symposium on analysis, design, and evaluation of human-machine systems, valenciennes, france, august 31 september 3, 2010. [22] f. aguirre, m. sallak, f. vanderhaegen, d. berdjag (2013), an evidential network approach to support uncertain multiviewpoint abductive reasoning information sciences, 253:110-125. [23] f. vanderhaegen, p. polet, s. zieba (2009), a reinforced iterative formalism to learn from human errors and uncertainty engineering applications of artificial intelligence , 22(4-5): 654-659. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vii (2012), no. 1 (march), pp. 61-72 a new approach to nonlinear tracking control based on fuzzy approximation z. du, t.-c. lin, v.e. balas zhenbin du yantai university, yantai, shandong 264005, china e-mail: zhenbindu@yahoo.com.cn tsung-chih lin feng-chia university taichung, 40724, taiwan e-mail: tclin@fcu.edu.tw valentina e. balas aurel vlaicu university b-dul revolutiei 77 310130 arad, romania e-mail: balas@drbalas.ro abstract: the problem of tracking control is addressed for a class of nonlinear systems with uncertainties. the original nonlinear systems are approximated by a fuzzy t-s model based on which a state-feedback controller is constructed by using the linear matrix inequalities. the approximating error is eliminated by an adaptive compensator based on fuzzy logic systems. the effectiveness of the proposed control scheme is demonstrated by a simulation example. the main advantage is that the designer makes milder constraint assumption for the approximation error and the uncertainties in nonlinear systems. keywords: fuzzy t-s model; fuzzy logic systems; nonlinear systems; uncertainties; tracking control. 1 introduction the problem of controller design for the nonlinear systems with uncertainties is a challenging work. one of effective tools used to solve the problem is fuzzy method. there are two frequently used fuzzy models: fuzzy t-s model [1-13] and fuzzy logic systems [16-17]. fuzzy t-s model is usually used to approximate nonlinear systems, and it has been widely applied to analyze the stability of nonlinear systems [1-4]. the contributions of these works are very important, but these works could be further improved by a simpler and more practical control scheme. the approximating error is neglected in [1-4], which impacts the stability of the system. therefore, the designed controller can’t always guarantee the stability of the original system. to overcome the effect of the approximating error, some relaxed stability methods are developed in [5-8]. these methods improve the approximation accuracy for nonlinear system. however, the original nonlinear system is still neglected in [5-8]. in order to further relax the effect of the approximating error, it is assumed to satisfy the matching condition in [9-11] and have a upper bound in [1213]. however, the matching condition and the upper bound are not easy to be found in practice, which adds some difficulties to the controller design. the uncertainties in nonlinear systems are assumed to satisfy the constraint of the upper bound in [14-15]. however, the upper bound may be too large or doesn’t exist. on the other hand, fuzzy logic systems have been proved to have copyright c⃝ 2006-2012 by ccc publications 62 z. du, t.-c. lin, v.e. balas the universal approximation property. by constructing a set of fuzzy "if-then" rules, fuzzy logic system is used to model uncertain nonlinear systems [16-17]. there exist some conservatism of pure fuzzy t-s model in dealing with the approximating error. the pure adaptive fuzzy control also has some shortages of excessively depending on the chosen membership functions. therefore, it is more interesting to combine both fuzzy models to overcome their shortages each other. based on the above discussions, fuzzy t-s model and fuzzy logic systems are combined to design a new tracking-control scheme for a class of nonlinear systems with uncertainties in this paper. a fuzzy t-s model is used to approximate the nonlinear system based on which a statefeedback controller is constructed by use of the linear matrix inequalities. the approximating error and the uncertain nonlinear parts are eliminated by a compensator based on fuzzy logic systems. the main advantages are summarized as follows: firstly, fuzzy t-s model and fuzzy logic systems are combined to develop a controller. compared with the existing works based on fuzzy t-s model [1-13], the proposed method in this paper makes milder constraint assumptions for the approximating error. secondly, the dimension of the matrix inequalities is reduced, thus, the difficulty of solving the matrix equalities is relaxed. thirdly, the existing works based on fuzzy logic systems [16-17] excessively depend on the chosen membership functions which are improved in the proposed method of this paper. finally, the developed controller makes full use of the advantages of two fuzzy models. as a result, it is more convenient to implement the controller in practice. the rest of the paper is organized as follows. section 2 provides preliminaries and the formulation of the problem. section3 develops a procedure of the controller design. section 4 presents a simulation example of 2-link manipulator to illustrate the effectiveness of the proposed method. these are followed by conclusions in section 5. 2 problem formulation consider the following nonlinear systems with uncertainties ẋ1 = x2, ... ẋ(β1−1) = xβ1, ẋβ1 = f1(x,u) + f̃1(x,u) + d1, ẋ(β1+1) = x(β1+2), ... ẋn = fm(x,u) + f̃m(x,u) + dm, (1) where x,u are the system state vector, control input vector, respectively; x = [x1, ...,x (β1−1) 1 , ...,x(n−βm+1), ...,x (βm−1) (n−βm+1) ]t ∈ rn , β1 + β2 + ... + βm = n, u = [u1, ...,um] t ∈ rm, fi(i = 1, ...,m) are known smooth nonlinear functions, f̃i(i = 1, ...,m) are unknown uncertain nonlinearities of the system, and di(i=1,2,...,m) denote the external disturbances. remark 1: there are many practical physical systems which can be described by the model (1), for example, the mass-spring-damper [18], the rotated inverted pendulum [19] and the n-link manipulator [20]. a reference model is as follows: ẋr(t) = arxr(t) + r(t), (2) a new approach to nonlinear tracking control based on fuzzy approximation 63 where xr(t) is a reference state, r(t) is a bounded reference input, and ar is an asymptotically stable matrix. control objective: design a controller to guarantee that the nonlinear system (1) is stable and the state can track the reference state xr(t). the known part of the system (1) can be approximated by a fuzzy t-s model composed of l rules. for convenience of research, fuzzy t-s model includes the external disturbance d. the ith rule of the fuzzy model is as follows: if z1(t) is f i1 and,...,and zs(t) is f i s, then ẋ(t) = aix(t) + biu(t) + d, i = 1,2, ...,l, (3) where z1(t), ...,zs(t) are the premise variables, f ij (j=1,2,...,s) are the fuzzy sets, l is the number of if-then rules, ai and bi are some constant matrices with compatible dimensions, bi = [0, ...,bti1, ...,0, ...,b t im] t ∈ rn×m with bi1 ∈ rm, ...,bim ∈ rm, and d = [0, ...,d1, ...,0, ...,dm]t . the final output of the fuzzy system is inferred as follows: ẋ(t) = l∑ i=1 µiaix(t) + l∑ i=1 µibiu(t) + d, (4) where µi = νi(z(t)) / l∑ i=1 νi(z(t)), νi(z(t)) = s∏ j=1 f ij (zj(t)), (5) and f ij (zj(t)) is the grade of membership of zj(t) in f i j . therefore, the approximating error for the nonlinear system (1) and the uncertainties of the nonlinear system (1) can be expressed as b∆(x), where b = diag[b1, ...,bm],bi = [0, ...,0,1]t ∈ rβi and ∆(x) = [∆1, ...,∆m]t . therefore, the nonlinear system (1) could be rearranged as ẋ(t) = l∑ i=1 µiaix(t) + l∑ i=1 µibiu(t) + b∆(x) + d. (6) 3 design of controller and stability analysis 3.1 design of controller the controller is chosen as u(t) = ul(t) − uf(t), (7) where ul(t) denotes the state-feedback controller based on fuzzy t-s model, uf(t) is the adaptive compensator based on fuzzy logic systems. the state-feedback controller ul(t) based on fuzzy t-s model is designed as ul(t) = l∑ i=1 µiki(x(t) − xr(t)), (8) where ul(t) is used to stabilize the linear part of the system (1), and ki(i=1,2,...,l) are matrices with proper dimensions and satisfy a t ij + paij + 1 ρ2 pp + q < 0, i,j = 1,2, ...,l, (9) 64 z. du, t.-c. lin, v.e. balas where aij = [ ai + bikj −bikj 0 ar ] , q = diag{2q,2q}, p and q are some symmetric and positive definite matrices, and ρ is a positive constant. the adaptive compensator based on fuzzy logic systems is given by uf(t) = { e−1û(x|θ), if e is nonsingular et (i + eet )−1û(x|θ) if e is singular, (10) which is used to compensate the approximating error and the uncertainties. in(10), ei = [b t i1, ...,b t im] t ∈ rm×m,e = l∑ i=1 µiei, and û(x|θ) is constructed by fuzzy logic systems. the updating law of θ is as follows: θ̇ = η1ψ t (x)b t px, (11) where η1 is a positive constant, ψ(x) is a fuzzy basis-function matrix, and the definition of ψ(x) is given in (16). 3.2 stability analysis note that l∑ i=1 µibiuf(t) − b∆(x,x(t − τ)) = b(euf(t) − ∆(x)) (12) = { b(û(x|θ) − ∆(x)) b(û(x|θ) − (i + eet )−1û(x|θ) − ∆(x)) ∆b(û(x|θ) − ∆(x)). (13) substituting (7) into (6) yields ẋ(t) = l∑ i=1 µiaix(t) + l∑ i=1 l∑ j=1 µiµjbikj(x(t) − xr(t)) − b(û(x|θ) − ∆(x)) + d. (14) denote x̃(t) = [xt (t),xtr (t)] t , and b = [bt 0]t . from (2) and (14), a new extended closed-loop system is as follows: ˙̃x(t) = l∑ i=1 l∑ j=1 µiµjaijx̃(t) + b(−(û(x|θ) − ∆(x))) + d′, (15) where d′ = [dt ,rt (t)]t . when fuzzy logic systems û(x|θ) eliminate ∆(x), then the closedloop system (15) is stable. thus, fuzzy logic systems are constructed to approximate the vector function ∆(x) as follows: ∆̂(x|θ) = ψ(x)θ, (16) where a new approach to nonlinear tracking control based on fuzzy approximation 65 ψ(x) = diag[ξt1 (x), ...,ξ t m(x)],θ = [θ t 1 ,θ t 2 , ...,θ t m] t in which θi(i=1,2,...,m) are the column vectors, and the weight θ is an adaptive parameter. define the optimal parameter estimation θ∗ as follows: θ∗∆ arg min θ∈ω [sup x∈u ||∆̂(x|θ) − ∆(x)||], (17) where u = {x ∈ rn},ω = {θ ∈ rpm×1}. u,ω denote the sets of suitable bounds on x, θ, respectively. then the estimation error for the vector function ∆(x) can be expressed as ∆̂(x|θ) − ∆(x) = ψ(x)θ̃ + w, (18) where w = [w1, ...,wm]t is a residual term, θ̃ = θ − θ∗ = [(θ1 − θ∗1) t , ...,(θm − θ∗m)t ]t . denote w′ = [wt ,rt (t)]t, and w = [0, ...,d1 − w1, ...,0, ...,dm − wm]t . substituting (18) into (15), (15) is rearranged as ˙̃x(t) = l∑ i=1 l∑ j=1 µiµjaijx̃(t) + b(−ψ(x)θ̃) + w′. (19) theorem 1. for the nonlinear system (1), if the controller is chosen as (7) composed of the fuzzy state-feedback controller (8) and the adaptive compensator(10), and the updating law for the weight is chosen as (11), then the closed-loop system (15) is uniformly ultimately bounded (uub) and the following tracking performance is achieved as ∫ t 0 (x(t) − xr(t))t q(x(t) − xr(t))dt ≤ x̃t (0)px̃(0) + 1 η1 θ̃t (0)θ̃(0) + ρ2 ∫ t 0 (w′t w′)dt, (20) where ρ > 0, p, q are some symmetric and positive definite matrices. proof. consider the following functional v = 1 2 x̃t px̃ + 1 2η1 θ̃t θ̃ (21) whose derivative can be computed as follows: v̇ = 1 2 ˙̃xt (t)px̃(t) + 1 2 x̃t (t)p ˙̃x(t) + 1 η1 θ̃t ˙̃ θ = v̇1 + v̇2, (22) where v̇1 = ( l∑ i=1 l∑ j=1 µiµjaijx̃(t)) t px̃(t)+x̃t (t)p( l∑ i=1 l∑ j=1 µiµjaijx̃(t))+ 1 2 w′t px(t)+ 1 2 xt (t)pw′ (23) v2 = [x̃ t pb(−(ψ(x)θ̃) + 1 η1 θ̃t ˙̃ θ. (24) v̇1 ≤ 1 2 l∑ i=1 l∑ j=1 µiµjx̃ t (t)(a t ijp + paij + 1 ρ2 pp)x̃(t) + 1 2 ρ2w′t w′ (25) substituting (9) into (25) yields 66 z. du, t.-c. lin, v.e. balas v̇1 ≤ − 1 2 x̃t (t)qx̃(t) + 1 2 ρ2w′t w′. (26) from (11),(24) v2 = [x̃ tpb(−(ψ(x)θ̃) + 1 η1 θ̃t ˙̃ θ] = 0. (27) thus v̇ = v̇1 + v̇2 ≤ − 1 2 x̃t (t)qx̃(t) + 1 2 ρ2w′t w′. (28) when ||e|| > ρ λmin(q) ||w||, v̇ < 0.thus, the closed-loop system (15) is uub. note that ∫ t 0 (x(t) − xr(t))t q(x(t) − xr(t))dt = ∫ t 0 [xt (t) xtr (t)] [ q −q −q q ] [xt (t) xtr (t)] t dt ≤ ∫ t 0 [xt (t) xtr (t)]diag{2q,2q}[x t (t) xtr (t)] t dt = ∫ t 0 x̃t (t)qx̃(t)dt (29) integrating the above inequality (28) from t=0 to t yields (20). by schur complements, the inequalities (9) are transformed into the linear matrix inequalities. therefore, the common solution p and kj(j=1,2,...,l) are required to be found. p is chosen as the form p = diag{p1,p2}, where p1,p2 are some symmetric and positive definite matrices. the inequalities (9) are equivalent to the following matrix inequalities   s11 −p1bikj 0 −(bikj)t p1 s22 p2 0 p2 −ρ2i   < 0, i,j = 1,2, ...,l, (30) where s11 = p1(ai +bikj)+(ai +bikj)t p1 + 1ρ2 p1p1 +2q, and s22 = p2ar +a t r p2 +2q. the matrix inequalities (30) imply s11 < 0. denote w=p −1 1 and yj = kjw . s11 < 0 is equivalent to the linear matrix inequalities [ s w w −(2q)−1 ] < 0, i,j = 1,2, ...,l, (31) where s = aiw + wati + biyj + (biyj) t + (ρ2)−1i. p1 and kj(j = 1,2, ...,l) are obtained by (31).and then, substituting p1 and kj(j = 1,2, ...,l) into (30), p2 is obtained. remark 2: if a controlled system is fourth-order, the dimension of the matrix inequalities (30) is 12. by use of the method in [12], the dimension of the matrix inequalities (58) is 20. by use of the method in [9], the dimension of matrix inequalities in theorem 1 is no less than 20. thus, the dimension of matrix inequalities is reduced. a new approach to nonlinear tracking control based on fuzzy approximation 67 4 simulation example consider a 2-link manipulator system in [20] m(q)q̈(t) + c(q, q̇)q̇(t) + g(q) = τ(t). (32) consider the existence of the uncertainties and external disturbances in system (32). thus, the plant is modified as follows: q̈(t) + c(q, q̇)q̇(t) + g(q) = b(q)u(t) + r∑ i=1 ξi(t)q(t) + d′, (33) where c(q, q̇) = h−1(q)c′(q, q̇),g(q) = h−1(q)g′(q),b(q) = h−1(q),d′ = h−1(q)d,q = [q1,q2] t and u(t) = [u1,u2]t = τ(t). ξi(t)(i = 1,2, ...,r) are uncertain and bounded. d is random noise with zero mean and variance 0.05, and d is bounded. the reference model is as follows: ẋr(t) = arxr(t) + r(t), where ar = diag{ar1,ar2},ar1 = ar2 = [ 0 1 −6 −5 ] ,r(t) = [0,r1(t),0,r2(t)] t . denote x1 = q1,x2 = q̇1,x3 = q2,x4 = q̇2. fuzzy t-s model is used to approximate the nonlinear system at x1 = −π2 ,0, π 2 and x3 = −π2 ,0, π 2 . the membership functions are adopted as triangle type. fuzzy t-s model with nine rules in the form (3) is given, where a1 =   0 1 0 0 5.927 −0.001 −0.315 −0.0000084 0 0 0 1 −6.859 0.002 3.155 0.0000062   ,a2 =   0 1 0 0 3.0428 −0.0011 −0.1791 −0.0002 0 0 0 1 −3.5436 0.0313 2.5611 0.0000114   , a3 =   0 1 0 0 6.2728 0.003 0.4339 −0.0001 0 0 0 1 −9.1041 0.0158 −1.0574 −0.000032   ,a4 =   0 1 0 0 6.5434 0.0017 1.2427 −0.0002 0 0 0 1 −3.1873 0.0306 −5.1911 −0.000018   , a5 =   0 1 0 0 11.1336 0 −1.8145 0 0 0 0 1 −9.0918 0 9.1638 0   ,a6 =   0 1 0 0 6.1702 −0.001 1.687 −0.0002 0 0 0 1 −2.3559 0.0314 4.5298 −0.000011   , a7 =   0 1 0 0 6.1206 0.0041 0.6205 0.0001 0 0 0 1 8.8794 0.0193 −1.0119 0.000044   ,a8 =   0 1 0 0 3.6421 −0.0018 0.0721 0.0002 0 0 0 1 2.429 −0.0305 2.9832 −0.000019   , 68 z. du, t.-c. lin, v.e. balas a9 =   0 1 0 0 6.2933 −0.0009 0.2188 −0.000012 0 0 0 1 −7.4649 0.0024 3.2693 −0.0000092   ,b1 = [ 0 1 0 −1 0 −1 0 2 ]t ,b2 = [ 0 0.5 0 0 0 0 0 1 ]t b3 = [ 0 1 0 1 0 1 0 2 ]t ,b4 = [ 0 0.5 0 0 0 0 0 1 ]t b5 = [ 0 1 0 −1 0 −1 0 2 ]t ,b6 = [ 0 0.5 0 0 0 0 0 1 ]t b7 = [ 0 1 0 1 0 1 0 2 ]t ,b8 = [ 0 0.5 0 0 0 0 0 1 ]t b9 = [ 0 1 0 −1 0 −1 0 2 ]t . using the lmi box in matlab, kj(j = 1,2, ...,l) are obtained k1 = [ −75.5707 −41.2895 −19.7728 −8.9886 4.5163 −0.5944 −49.7511 −24.5727 ] ,k2 = [ −76.4364 −41.3132 −13.7595 −5.9976 7.1834 1.0290 −49.0587 −24.2578 ] , k3 = [ −75.4503 −41.2461 −19.8614 −9.0371 4.3907 −0.6474 −49.6965 −24.54837 ] ,k4 = [ −76.4364 −41.3132 −13.7595 −5.9976 7.1834 1.0290 −49.0587 −24.2578 ] , k5 = [ −76.4364 −41.3132 −13.7595 −5.9976 7.1834 1.0290 −49.0587 −24.2578 ] ,k6 = [ −76.4364 −41.3132 −13.7595 −5.9976 7.1834 1.0290 −49.0587 −24.2578 ] , k7 = [ −76.4364 −41.9511 −6.2666 −2.2808 11.1565 3.4005 −48.7116 −24.1209 ] ,k8 = [ −76.4364 −41.3132 −13.7595 −5.9976 7.1834 1.0290 −49.0587 −24.2578 ] , k9 = [ lcl − 76.7763 −42.0400 −6.0715 −2.1821 −11.3875 3.4986 −48.8204 −24.1694 ] , then, the controller is given by u(t) = ul(t) − uf(t), where ul(t) = 9∑ i=1 µiki(x(t) − xr(t)) and uf = ( 9∑ i=1 µiei) −1û(x|θ) with the updating law (11). in (11), the symmetric and positive definite matrix p =   0.0070 0.0035 −0.0004 −0.0002 0.0035 0.0021 0.0004 0.0001 −0.0004 0.0004 0.0073 0.0032 −0.0002 0.0001 0.0032 0.0017   . a new approach to nonlinear tracking control based on fuzzy approximation 69 seven fuzzy rules are defined in adaptive fuzzy logic systems. r(j): if x1 is f j 1 ,..., x4 is f j 4 , then y is g j (j=1,2,...,7), µf 1i (xi) = 1 1 + exp[5(xi + 0.8)] (i = 1,2, ...,4), µf 1i (xi) = exp[−(xi + 0.6)2](i = 1,2, ...,4), µf 3i (xi) = exp[−(xi + 0.4)2](i = 1,2, ...,4), µf 4i (xi) = exp[−(xi) 2](i = 1,2, ...,4), µf 5i (xi) = exp[−(xi − 0.4)2](i = 1,2, ...,4), µf 6i (xi) = exp[−(xi − 0.6) 2](i = 1,2, ...,4), µf 7i (xi) = 1 1 + exp[5(xi − 0.8)] (i = 1,2, ...,4). denote s1 = 7∑ j=1 4∑ i=1 µ f j i (xi), then ξ(x) = [ 4∏ i=1 µf 1i (xi)/s1, ..., 4∏ i=1 µf 7i (xi)/s1 ] = [ξ1, ...,ξ7],ψ(x) = diag[ξ t (x),ξt (x)]. the initial condition is set to be (x1(0),x2(0),x3(0),x4(0),xr1(0),xr2(0),xr3(0),xr4(0)) = (0.4,0,−0.4,0,0,0,0,0). choose r1(t) = r2(t) = 4sin(t),r = 2,ξ1(t) = 1 + 20 sin(t),ξ2(t)=2(1-exp(-t))/(1+exp(-t)) and the parameter η1 = 20. simulation results are shown in fig.1fig.3. figure 1: state responses of x1,x2,x3 and x4 (dotted line), xr1,xr2,xr3 and xr4 (solid line) by only using the fuzzy controller based on t-s model [20], simulation result is shown in fig.4 and the tracking performance comparison between proposed method and approach in [20] is given in table 1. 70 z. du, t.-c. lin, v.e. balas figure 2: the control input u1 figure 3: the control input u2 figure 4: state responses of x1,x2,x3 and x4 (dotted line), xr1,xr2,xr3 and xr4 (solid line) a new approach to nonlinear tracking control based on fuzzy approximation 71 table 1: the tracking performance comparison between proposed method and approach in [20]. ei = ∫ t 0 (xi − xri)2dt e1 e2 e3 e4 the proposed method 0.0677 0.1338 0.0587 0.2177 the method in [20] 0.1380 0.1760 0.1244 0.3343 5 conclusion the developed controller makes full use of the advantages of two fuzzy models. theory analysis verifies the feasibility of the proposed control scheme and simulation results demonstrate the effectiveness of the proposed control scheme. acknowledgements this work is supported by the national natural science foundation of china (60974028). bibliography [1] k.tanaka, m.sugeno. stability analysis and design of fuzzy control systems. fuzzy sets and systems,1992,45(2):135-156. [2] bong-jae rhee, sangchul won.a new fuzzy lyapunov function approach for a takagi -sugeno fuzzy control system design. fuzzy sets and systems, 2006, 157(9): 1211 1228. [3] h. k. lam, lakmal d. seneviratne. stability analysis of interval type-2 fuzzy-modelbased control systems. ieee transactions on systems, man, and cybernetics-part b: cybernetics,2008,38(3): 617-628. [4] sung hyun kim ,poo gyeon park.observer-based relaxed h∞ control for fuzzy systems using a multiple lyapunov function. ieee transactions on fuzzy systems, 2009,17(2):476484. [5] antonio sala, carlos ariño.relaxed stability and performance lmi conditions for takagisugeno fuzzy systems with polynomial constraints on membership function shapes. ieee transactions on fuzzy systems,2008,16(5):1328-1336. [6] miguel bernal, thierry marie guerra,alexandre kruszewski.a membership-functiondependent approach for stability analysis and controller synthesis of takagi-sugeno models. fuzzy sets and systems,2009,160(19):2776-2795. [7] yan-wu wang,zhi-hong guan,hua o.wang.impulsive synchronization for takagi-sugeno fuzzy model and its application to continuous chaotic system.physics letters a, 2005,339(35):325-332. [8] shinn-horng chen, wen-hsien ho, jyh-horng chou.robust controllability of t-s fuzzymodel-based control systems with parametric uncertainties. ieee transactions on fuzzy systems,2009,17(6):1324-1335. [9] j.c.lo, m.l.lin. robust nonlinear modeling and control via uncertain fuzzy systems. fuzzy sets and systems,2004,143(2):189-209. 72 z. du, t.-c. lin, v.e. balas [10] junyoneyama.robust h∞ control analysis and synthesis for takagi-sugeno general uncertain fuzzy systems. fuzzy sets and systems 2006,157(16):22052223. [11] fuwen yang,yongmin li,set-membership fuzzy filtering for nonlinear discretetime systems. ieee transactions on fuzzy systems,2010,40(1):116-123. [12] b.s.chen, c.s.tseng, h.j.uang. mixed fuzzy output feedback control design for nonlinear dynamic systems: an lmi approach. ieee transactions on fuzzy systems,2000,8(3): 249265. [13] c.m.park.lmi-based robust stability analysis for fuzzy feedback linearization regulators with its application.information sciences,2003,152:287-301. [14] changchun hua, qing-guo wang , xinping guan.robust adaptive controller design for nonlinear time-delay systems via t-s fuzzy approach. ieee transactions on fuzzy systems,2009,17(4):901-910. [15] guoliang wei, gang feng, zidong wang . robust h∞ control for discrete-time fuzzy systems with infinite-distributed delays.ieee transactions on fuzzy systems,2009,17(1) 224-232. [16] li-xin wang. stable adaptive fuzzy control of nonlinear systems. ieee transactions on fuzzy systems, 1993,1(3): 146-155. [17] weisheng chen, zhengqiang zhang. globally stable adaptive backstepping fuzzy control for output-feedback systems with unknown high-frequency gain sign. fuzzy sets and systems, 2010, 161(6): 821-836. [18] huai-ning wu, kai-yuan cai.h2 guaranteed cost fuzzy control for uncertain nonlinear systems via linear matrix inequalities. fuzzy sets and systems, 2004,148(3):411-429. [19] y. liu, s.s. hu. fuzzy robust tracking control for uncertain nonlinear systems. acta automatica sinica. 2004,30(6):949-953. [20] c. s. tseng, b. s. chen, h.j. uang. fuzzy tracking control design for nonlinear dynamic systems via t-s fuzzy model. ieee transactions on fuzzy systems,2001,9(3): 381-392. international journal of computers communications & control issn 1841-9836, 10(4):480-491, august, 2015. development of a fuzzy logic system to identify the risk of projects financed from structural funds m.i. boloş, d.c. sabău-popa, p. filip, a. manolescu marcel ioan boloş*, diana-claudia sabău popa departament of finance-accounting, faculty of economic sciences, university of oradea romania, 410087 oradea, universitatii st. 1 marcel_bolos@yahoo.com, dpopa@uoradea.ro *corresponding author: marcel_bolos@yahoo.com petru filip 1. dimitrie cantemir christian university, romania, 040042 bucharest, splaiul unirii, 176 2. agora university of oradea, romania, 410526 oradea, piata tineretului, 8 3. university of oradea romania, 410610 oradea, university street, 1 pfilip@uoradea.ro adriana manolescu departament of social sciences agora university of oradea romania, 410526 oradea, piata tineretului, 8 adrianamanolescu@univagora.ro abstract: the fuzzy logic system developed in this research paper seeks to identify the financial risk of projects financed from structural funds when changes occur in project values, in the duration of the projects and in the implementation durations. those two factors are known to influence the financial risk. the fuzzy system was simulated using matlab and the results showed its operation and the conclusion that the financial risk of the project is dependent on the developments values and on the implementation duration. the developed and tested fuzzy logic system provides information on financial risk intensity organized into three categories: small, medium and large and on the inflection point of transition from low risk to high risk. this is considered an early warning system for the management staff with responsibilities in structural funds. keywords: fuzzy logic system (fls), artificial intelligence, financial risk, structural funds, centroid method. 1 introduction the fuzzy logic systems (fls) are used as a tool for decisions making, for the projects financed from structural funds, for the early identification of risks that affect the performance of the allocation of funds for eu member countries, through various financial instruments known as operational programs. the risk of the projects financed from structural funds has various forms, but the most important remains the financial aspect, that generates losses for the budget of the member states. although there are now used statistical methods for risk measurement, the most common being the standard deviation (σ2), it should be noted that they have a major drawback since they reflect the risk at the project level. moreover, the classical statistical indicators provide insight into project financial risk without taking into account the influence factors or copyright © 2006-2015 by ccc publications development of a fuzzy logic system to identify the risk of projects financed from structural funds 481 correlations that exist between the various projects [3]. fls have the advantage of being able to identify the financial risk for the entire portfolio of projects contained by the operational program and to contribute to the decision of management to avoid or eliminate the financial risk. the fls input variables of this model are set according to the project particularities, on which ultimately depend the financial risk of the project, or project value (vp) and the duration of their implementation (di). the output variable is the financial risk of the project (rf). the assessment of the financial risk of the project (as an output variable of fls) was structured by verbal expressions (specific to fuzzy logic): high financial risk, medium financial risk and low financial risk, depending on the seriousness of the risk but also to highlight the intensity of the losses from the budget allocated to eu member states as a result of the manifestation of the financial risk event [6]. the fls developed to identify the financial risk of the projects financed from structural funds becomes a novelty in literature but also a management tool for decision makers. with the help of fls they can measure the financial risk for all the projects entering in the structure of the operational program. based on these ways of measuring the financial risk, identified using fls, corrective action can be taken for the efficient and fair presentation of structural funds for member states of the eu budget. 2 the concept of financial risk of projects financed from structural funds the financial risk of the projects financed from structural funds is a fairly new concept both in literature and in practice. in essence the financial risk of the projects should answer the following question: "what is the financial size of the potential loss that a member state is expected to suffer, due to the implementation of projects financed from structural funds?" the project financial risk depends on a number of factors determined by the main project implementation cycle. in this category are included factors that are measured through the implementation period, resulting the physical progress of projects or the requests repayment duration. in practice, the most important factor that influences the financial risk of the project remains the physical progress [11]. the physical progress, although it seems a technical term, is most often defined as the ratio between the duration of implementation of a project under implementation cycle (dic ) and the duration of implementation actually achieved (dr ) according to a relation of the form: pf = dr dic ×100 (1) the achieved physical progress can record higher values than those set, for a project suitable for the implementation cycle, situation in which the financial risk of the project is small or can record lower values, where the financial risk of the project increases. the financial dimension of a project risk therefore depends on the value of the project (vp), on the physical progress established under implementation cycle (pfc) and on the actual physical progress achieved (pr) after a relationship of the form: rf = vp(pfc −pfr) (2) or rf = vp × ( dct dip − dr dip ) (3) the higher is the difference between the physical progress of the project under implementation cycle (pfc) and the actual physical progress achieved (pfr), the higher is the risk of losing a larger 482 m.i. boloş, d.c. sabău-popa, p. filip, a. manolescu amount of the budget allocated to a project. in practice, the most common form of financial risk measurement remain in the project’s value (vp) and the deviation of the achieved implementation duration of the project dr towards the set one based on implementation cycle (dci), after a relationship of the form: pf = vp × dci −dr dci ×100 (4) there are projects with high financial risk, for which the difference between the two periods (according to implementation cycle and the one actually achieved) is far from average, or projects with low financial risk for which the same difference is close to average. the average difference between the two periods is determined for all projects that are part of the operational program. to appreciate the intensity of the financial risk of a project it can be used the standard deviation or variance [13], characterizing the removal from the average financial risk of a project after a relationship of the form: rf = √ 1 n n∑ i=1 rfi −rf (5) depending on the value of this statistical indicator, the intensity of the financial risk is estimated. the higher is the percentage value of this indicator, the higher is the financial risk of lossing the budget allocated to projects. the indicator, thus, doesn’t provide a complete picture of risk, as it is determined based on historical values recorded in previous periods of time [2]. in fls , the financial risk of the projects will be divided into three categories (large, medium and small), considered as an output of the system. in essence, the financial risk of a project, regardless its quantification and intensity, depends on a number of factors that will transform the input for fls, as follows [10]: 1. the value of projects, that influences the size of the financial risk: the rule is simple: the higher is the project value and the deviation from the implementation period, the higher is the financial risk 2. the project implementation duration, which is determined according to the intensity of risk: the implementation period is known in the literature as the duration necessary to implement a project, so that it can be achieved its project objectives and outcome indicators. the further the implementation period of this project removes from the one set according to the implementation period, the higher is the risk that a part of the budget will be lost.the project value and its implementation duration, as influencing factors underlying the financial risk will be considered for fls as input variables. 3 the development of the fls to identify financial risk of a project each fuzzy logic system supposes four distinct phases [14] as follows: setting the input variables and their associated fuzzy sets, the fuzzy rule base identification, the establishment of fuzzy inference operators and defuzzification. in the first stage of development of the fuzzy logic system were established the input variables mentioned above, namely: the project value (pv) and the duration of implementation (di), while the output variable (result) is the financial risk of the project. the input variables are structured according to the size of their impact on financial risk. the project value (vp) is divided into three categories, namely: high value development of a fuzzy logic system to identify the risk of projects financed from structural funds 483 projects (hvp), the average project value v pm, and the small projects value v pm. at the same time, the duration of project implementation is structured according to the exceeding of the project implementation duration to that established in the implementation cycle as follows: high exceeding (between 180 days and 360 days) (dim), average exceeding (between 90 days and 180 days) dim and low exceeding dim under 90 days. the financial risk of the project, as the output variable of the system, is determined by the fuzzy base rules, the fuzzy inference operators as well as by the expert assessments, that are divided into three categories namely: high financial risk (rfm), average financial risk rfm and low financial risk rfm. in order to completely define the fuzzy set, for the input variables were established the following membership functions [3]: 1. the trapezoidal membership function, for the value of projects (vp) defined according to the value types of projects 2. the triangular membership function, for the exceeding duration of the project implementation (di) which was also structured in: high, medium and low exceeding; figure 1: the trapezoidal membership function for the input variables the fuzzy sets allow the partial membership of elements, the fuzzy membership degree being able to take any value from 0 (not belonging) to 1 (full membership). for example, the trapezoidal membership’s function for v p ∈ [v pn−2,v pn+2], and for project with medium values will be expressed as: µv pm(v p) =   0 if v pn−2 < v p and v p > v pn+2 v p−v pn−2 v pn−1−v pn−2 if v pn−2 ≤ v p ≤ v pn−1 1 if v pn−1 ≤ v p ≤ v pn+1 v pn+2−v p v pn+2−v pn+1 if v pn+2 ≤ v p ≤ v pn+1 similarly, the trapezoidal membership function for the interval v p ∈ [v pn−k,v pn−1], which corresponds to the values of small projects, which will be expressed as follows: 484 m.i. boloş, d.c. sabău-popa, p. filip, a. manolescu µv pm(v p) =   0 if v p < v pn−k and v p > v pn−1 v pn−1−v p v pn−1−v pn−2 if v pn−2 ≤ v p ≤ v pn−1 1 if v pn−k ≤ v p ≤ v pn−2 the triangular membership function for the input variable, the exceeding of the duration of implementation (di) of the project is represented in figure no.2. the triangular membership function can be expressed, for example, for the low exceeding of the duration of implementation by the relationship, (di ∈ [din−k,din−1]): µdim(di) =   0 if din−k < di and di > din−1 di−din−k din−2−din−k if din−k ≤ di ≤ din−2 din−1−di din−1−din−2 if din−2 ≤ di ≤ din−1 similarly, the triangular membership function for the input variable, the duration of the implementation of the project on the interval [din−2,din+2]], can be expressed as: µdim(di) =   0 if din−2 < di and di > din+2 di−din−2 ḋin−2−din−2 if din−2 ≤ di ≤ ḋin−2 din+2−di din+2−ḋin−2 if ḋin−2 ≤ di ≤ din+2 figure 2: the triangular membership function for input variable of the system the duration of implementation (di) for the fls were identified the input variables of the system and their membership functions according to fuzzy rules, to which a fuzzy set is completely determined by the set of ordered pairs [7, 8]: a = {(x,µa(x))/x ∈ x} (6) in the second stage of the fls are set the fuzzy rules base. the specific of these rules is that based on two conditions, namely "if" and ‘"then", are established with the help of experts the development of a fuzzy logic system to identify the risk of projects financed from structural funds 485 influence factors of the financial risk. the number of fuzzy rules base will be equal to 32 = 9 and the financial risk will be divided into three risk classes. the fuzzy rules base for financial risk related to projects financed from structural funds will have the following form: rule 1: if the project value is large (vpm) and the exceeding of the duration of the implementation is high (dim), then the project financial risk is high (rfm); rule 2: if the project value is average v pm and the exceeding the implementation is high (dim), the financial risk is high (rfm); rule 3: if the project value is small v pm and the exceeding of the duration of the implementation is high (dim), then the financial risk is medium rfm; rule 4: if the project value is large (vpm) and the exceeding of the duration of the implementation is average dim, then the financial risk is high (rfm); rule 5: if the project value is average v pm and the exceeding of the duration of the implementation is average dim, then the financial risk is medium rfm; rule 6: if the project value is small v pm and the exceeding of the duration of the implementation is average dim, then the financial risk is small rfm; rule 7: if the project value is large (vpm) and the exceeding of the duration of the implementation is small dim, then the financial risk is medium rfm; rule 8: if the project value is average v pm and the exceeding of the duration of the implementation is small dim, then the financial risk is medium rfm; rule 9: if the project value is small v pm and and the exceeding of the duration of the implementation is small dim, then the financial risk of the project is small rfm; the fuzzy rules base for fls that targets the projects financial risk, aim to capture the best way in which financial risk occurs when there is a change in the value of projects and concomitant a change in terms of exceeding the project implementation duration. depending on the intensity of these changes, the risk of project budget loss may be, as mentioned above: large, medium or small. on the third stage were applied the fuzzy inference operators on the rules basis generated in the second stage [4]. as shown, the fuzzy rules base is connected by "and" which means that the operator inference for the rules base is minimum. for each of the previous defined fuzzy rules, is established the degree of membership of the output variable (rf). therefore will result: for rule 1: µrfm,1 = min[µv pm(v p),µdim(di)] for rule 2: µrfm,2 = min[µv pm(v p),µdim(di)] for rule 3: µrfm,3 = min[µv pm(v p),µdim(di)] for rule 4: µrfm,4 = min[µv pm(v p),µdim(di)] for rule 5: µrfm,5 = min[µv pm(v p),µdim(di)] for rule 6: µrfm,6 = min[µv pm(v p),µdim(di)] for rule 7: µrfm,7 = min[µv pm(v p),µdim(di)] for rule 8: µrfm,8 = min[µv pm(v p),µdim(di)] for rule 9: µrfm,9 = min[µv pm(v p),µdim(di)] from the rules analysis is shown that the affiliation of the system financial risk in a fuzzy set can be from one or more fuzzy rules which are likely to result in different degrees of belonging to the same fuzzy set. but it takes a single degree of belonging and in order to establish it, is applied a fuzzy controller max corresponding to the reunion of the fuzzy sets. under these conditions will result [9]: µrfm = max[µrfm,1,µrfm,2,µrfm,4] µrfm = max[µrfm,5,µrfm,7,µrfm,8] µrfm = max[µrfm,3,µrfm,6,µrfm,9] in this stage are obtained the solutions of the fuzzy rules, without a certain amount of input variables in the system (v pi,dii) to be determined the intensity of the financial risk by applying all rules of financial risk in the fuzzy base. it is therefore necessary to identify these solutions at 486 m.i. boloş, d.c. sabău-popa, p. filip, a. manolescu the last stage of fls, namely defuzzification. in the last stage of defuzzification is extracted a deterministic scalar value, from the fuzzy information which is associated to the output variable, the essence of which is to provide more explicit the best value of the output variable [1]. each of the result in the third stage of the fls will be used to determine the surface area (si) bounded by the parallel to the horizontal axis, taken through the point that determines the size of the output variable, the horizontal axis (ox) and the graphic of the function associated to output variables. for a given value of the project (v pi) and some exceeding of the project implementation duration (dpi ) would result that from the original surface (si) only a certain percentage (p) is the result that will be taken into account to determine the final amount of financial risk [5]. the conversion of the fuzzy result in a real number value is done by determining the center of gravity of the surface obtained by aggregating the proportion (p) of the initial areas for each graphical input variables as follows: z = ∑9 i=1 µv p,di(v pi,dpi)×v pi/dpi∑9 i=1 µv p,di(v pi,dii) (7) the result (s) of equation 7 undergo a conversion from area into a real value through the centroid method, which consists in determining the numerical value (z) through which the perpendicular traced to the horizontal axis divides the s area into two equal parts by a relationship of the form: s = 9∪ i=1 %psi (8) the obtained numerical z value represents the size of the financial risk of a project that has a certain value and a certain level recorded for the (exceeding) implementation duration of the project. the higher this value is, the higher is the probability of losing a part of the project budget. 4 the fls simulation for the financial risk of projects financed from structural funds the developed fuzzy logic system was simulated using matlab programming language, taking into account the following assumptions, namely: 1. the input variable the value of projects was divided into three classes: small projects (between 0 and 350 million um), average project (between 250 million and 750 million u.m.) and large projects ( between 650 million um and 1,000 mil); 2. the input variable the exceeding of the project implementation duration was also divided into three classes, namely: low exceeding (between 0 and 90 days), average exceeding (between 60 to 180 days) and high exceeding (between 150 days and 270 days). in the first fls stage, were established the input variables and their membership functions. thus, for the input variable, the project value (pv), was stated the following fuzzy set (trapezoidal membership function) as depicted in figure 3. for the input variable, the exceeding of the project implementation duration (di), resulted the following fuzzy sets (using the trapezoidal membership function) as depicted in figure 7. for the output variable, the financial risk of the projects, were established three risk classes, using the triangular membership function as follows: low financial risk for values between 0 and development of a fuzzy logic system to identify the risk of projects financed from structural funds 487 figure 3: the fuzzy set for the input variable the project value figure 4: the fuzzy set for the input variable the implementation duration 488 m.i. boloş, d.c. sabău-popa, p. filip, a. manolescu 3, average financial risk for values between 2 and 6 and greater financial risk for values between 5 and 10. the resulted fuzzy set for the output variable is represented in figure 5. figure 5: the fuzzy set for the output variable the financial risk of the projects after continuing the simulation of fls, for the financial risk of the projects, were established 9 fuzzy rules base according to the developed and were introduced in the program shown in figure 6. figure 6: the fuzzy rules base for the financial risk in matlab subsequently were applied the inference operators on fuzzy rules base and the results were presented in figure 7. the obtained results are for a project with a value of 500 million u.m., 135-days exceeding duration of the project implementation and a financial risk value of 4.33. the fls simulation for the financial risk was further carried out for different values of input variables in order development of a fuzzy logic system to identify the risk of projects financed from structural funds 489 figure 7: results obtained by applying the inference operators on fuzzy rules base to identify the developments that the financial risk has when changes occur in values and in exceeding duration of the project implementation. the results were obtained in figure 8. figure 8: the evolution of financial risk for projects based on input variables the simulation results are: 1. for project values between 0 and 600 million and an exceeding duration of the project implementation of 55 days, the financial risk tends to zero; 2. for project values between 600 million and 800 million and an exceeding duration of the project implementation between 55 and 100 days, the financial risk is increased from 0 to about 4.5 (and is considered a medium risk); 3. for project values between 800 million and 1,000 million and an exceeding duration of the project implementation between 100 and 270 days, the financial risk is increased from 4.5 to the maximum 9 (being considered a high risk); 490 m.i. boloş, d.c. sabău-popa, p. filip, a. manolescu 5 conclusion the computational intelligence is an area of great interest for both specialists in computer science and finance. this is because often are abandoned the statistical concepts and methods that didn’t characterize the phenomena and economic processes and are used specific computational intelligence methods that adapt quite well the dynamics of the phenomena studied. the fuzzy logic system developed in this paper, as part of computational intelligence, studies the behavior of financial risk of the projects financed from structural funds when there is a change in the value of projects or in the durations of implementation. this financial risk was defined as the risk of losing a part of the project budget, when there are exceedings in the duration of implementation, based on their implementation cycles. the rules of the fuzzy base were defined according to the impact that the system input variables have on financial risk. the fuzzy inference operators were applied on the basis rules to determine the membership in fuzzy output variable. with the help of defuzzification was ensured the convertion of the fuzzy values in numeric values to determine the size of financial risk for different values of the input variables. following the simulation for the output variable (the financial risk of projects), were reached the following conclusions: 1. there are situations where the financial risk is zero, or almost zero, for different values of input variables in the system; 2. the financial risk of projects increases as changes occur in the value of projects and in the duration of implementation. this increase in financial risk value becomes in time proportional to changes in the input variables in the system; 3. the financial risk is maximum, when the input variables in the system approach the maximum values that they may register; the developed fuzzy logic system is a management tool for decision making in the structural funds, under which can be taken measures to avoid or reduce the financial risk, especially for the early identification of the emerging financial risk in the portfolio of projects falling within the structure of the operational program. this early identification of financial risk can be very useful in structural funds management, for taking the necessary actions to avoid this category of risk [12]. the financial dimension of risk can be determined by calculating the financial risk using the formula 4 for each project, where fls indicates values above the minimum for the financial risk. the value obtained is the value of projects that may be at risk of losing resources that are allocated from the structural funds through operational programs. the fuzzy logic can be also developed on classes of projects, through clustering technique to identify financial risk for each class of projects which will be useful in the future because each operational program includes an amount of projects. bibliography [1] a. altrock, fuzzy logic and neuro fuzzy logic applications explained (1995); prentice hall, englewood cliffsm. [2] i.a. bradea, c. delcea, r.m. paun (2014); managing and controlling the kris in hospitals proceedings of 24rd ibima conference: crafting global competitive economies: 2020 vision strategic planning & smart implementation, italy, isbn: 978-0-9860419-3-8, 18241830. development of a fuzzy logic system to identify the risk of projects financed from structural funds 491 [3] i.a. bradea (2014); risks in hospitals. assessment and management the romanian economic journal, issn: 2286-2056, 54(xvii): 25-37. [4] r. fuller (2000); introduction to neurofuzzy systems, advances an inteligent and soft computing, vol 2, isbn 978-3-7908-1256-5. [5] c. kahraman, i. kaya (2010); a fuzzy multicriteria methodology for selection among energy alternatives, iexpert systems with applications, 37(9): 6270-6281. [6] s.m. mousave, f. joloi, r. tavakkoli-moghaddam (2013); a fuzzy stochastic multiattribute group decision-making approach for selection problems, group decision and negotiation, 22(2): 207-233. [7] s. nadaban (2015); fuzzy euclidean normed spaces for data mining applications, international journal of computers communications & control, 10(1): 70-77. [8] s. nadaban, i. dzitac (2014); atomic decompositions of fuzzy normed linear spaces for wavelet applications, informatica, http://dx.doi.org/10.15388/informatica.2014.33, 25(4): 643-662. [9] a. nieto-morote, f. ruz-vila (2011); a fuzzy approach to construction project risk assessement, international journal of project management, 29(2): 220-231. [10] e. scarlat, n. chiriţă, i.a. bradea (2012); indicators and metrics used in the enterprise risk management (erm), economic computation and economic cybernetics studies and research journal, 4(46):5-18. [11] m.l. tseng (2010); implementation and performance evaluation using the fuzzy network balanced scorecard, computers & education, 55(1): 188-201. [12] y.l. xu, j.f.y. yeung, a.p.c. chan, d.w.n. chan, s.q. wang, y.l. ke (2010); developing a risk assessement model for ppp project in china a fuzzy synthetic evaluation, automation in construction, 19(7): 9293-943. [13] l.a. zadeh (1992); fuzzy logic and the calculus of fuzzy if-then rules, proceedings of the 22nd intl. symp. on multiple-valued logic, los alamitos, ca: ieee computer society press, 530-561. [14] l.a. zadeh (1996); fuzzy logic computing with words, ieee trans. fuzzy systems, 4(2):103-111. int j comput commun, issn 1841-9836 8(3):354-365, june, 2013. breast cancer diagnosis based on spiculation feature and neural network techniques v. bălănică, i. dumitrache, l. preziosi victor bălănică, ioan dumitrache university politechnica of bucharest romania, 060042 bucharest, splaiul independentei, 313 vicord20011@gmail.com, idumitrache@ics.pub.ro luigi preziosi university of politechnics torino italy, 10129 torino, corso duca degli abruzzi, 24 luigi.preziosi@polito.it abstract: the degree of spiculation of the tumor edge is a particularly relevant indicator of malignancy in the analysis of breast tumoral masses. this paper introduces four new methods for extracting the spiculation feature of a detected breast lesion on mammography by segmenting the contour of the lesion in a number of regions which are separately analysed, determining a characterizing spiculation feature set. in order to differentiate between benign and malignant tumors based on the extracted spiculation sets, an intelligent neural network is first trained on a number of 96 cases of known breast cancer malignancy and then tested for diagnosing and classifying breast cancer tumors. the input of the neural network is thus the extracted spiculation feature set and the output is represented by the histopatological diagnostic given by doctors. finally, the performance of the introduced methods is analysed depending on the number of regions in which the contour is segmented and the performance-related conclusions are stated for each of the methods. the highlight of this paper is the division of the tumour contour in regions and the assessment of a spiculation indicator for each region, resulting a set of spiculation indicators that characterise the tumour and by training a neural network can be used in classifying breast tumours with high performance. keywords: breast cancer, spiculation feature extraction, neural network, diagnosis. 1 introduction in the detection process of suspicious breast cancer lesions, mammography screening is usually followed by an analysis of diagnostic mammography, i.e. a more detailed analysis of the artifacts visible on the mammographic imaging results, such as: breast tissue density, tumoral masses, (macroand micro-) calcifications, architectural distortion, etc, [1]. among them, the detected microcalcifications and the identified masses (also known as tumours) are considered the most relevant signs of malignancy that need to be investigated in detail. on one hand, the microcalcifications are tiny mineral deposits within the breast, representing an early indicator of the presence of a tumor. on the other hand, the tumoral mass is a tissue composed of an abnormal growth of cells (normal or neoplastic, cancerous, cells) with or without integrated calcifications that are also being detected on the mammographic results. when analyzing any pathology of breast cancer, the location, the size, the shape of the mass are usually assessed and the mass density and the tumoral margins are moreover evaluated. while the benign masses have usually smooth edges, are well circumscribed, compact and approximately circular or elliptical (see figure 1), malignant lesions usually have vague edges and irregular form, presenting spiculations (i.e. a radial pattern of spiculs), [2]. thus, in analyzing the tumoral mass, in addition to the lesion size, the degree of spiculation of the tumor edge is a particularly relevant indicator of malignancy (differences visible in figure 1). copyright c⃝ 2006-2013 by ccc publications breast cancer diagnosis based on spiculation feature and neural network techniques 355 figure 1: benign and malignant tumors extracted from available mammographies the evaluation of the spiculations of masses visible on the mammographic results implies form recognition techniques that extract numerical information usable in the process of differentiating between benign and malignant tumours. these techniques are continuously being developed and improved in order to make possible a more refined characterization of the particular morfological features and a higher classification performance. [3] computes a spiculation index that can classify tumours with 93% accuracy; [4] proposes three methods for morfological feature extraction and reaches 93% accuracy when classifying breast lesions; [5] explores the classification power of 6 contour algorithms on 349 masses using three popular classifiers (bayesian classifier, fisher linear discriminant analysis and support vector machine) and shows that a big variation (14%) of the quality of the segmentation method influences only 4% of the classification performance given by the contour features. besides the current trend in technology and computer automation, the bioengineering interdisciplinary developments of the last years allowed the implementation of digital cad (computer aided diagnosis) aid tools to assist novice radiologists in making diagnostic and recommended treatments decisions, herewith providing a second opinion on the decision and playing the role of the second pairs of eyes in the analysis, certifying or not the quality and/or the choice. although the computational cad techniques used in determining a medical prognostic do not always provide the desired quality, their implementation using intelligent techniques (neural networks, fuzzy techniques, genetic algorithms and their hybrid variants) that are based on human experience and provide learning and adaptation abilities, demonstrates high performance in these kinds of tasks. in this paper, we are trying to define new methods for extracting the spiculation feature set of a breast lesion identified on a mammographic result by analyzing the lesion contour on a number of contour regions. the methods have been tested by using a neural architecture integrated into a cad that provides good classification performance when trained on the extracted spiculation feature sets of 96 breast cancer cases with known malignancy diagnostic. thus, this neural cad is able to distinguish, diagnose and classify breast cancer tumors in benign or malignant, highlighting the benefit and usefulness of both the methods and their neural cad application in national breast cancer screening programs. 2 materials and methods in terms of imaging, the starry distortions visible on mammography are caused by the intrusion of cancer in surrounding tissue (invasion occurs in malignant tumors), so that the contour/outline form of the tumor is generally correlated to the degree of malignancy. the contour is thus an extremely valuable information in the differentiation between benign and malignant tumors. as stated above benign tumors usually have smooth, circumscribed, macrolobulate and well defined contours (figure 1), while malignant tumors are characterized by vague, irregular, 356 v. bălănică, i. dumitrache, l. preziosi microlobulate and spiculate contours (figure 1). based on these observations, researchers have defined certain objective and quantitative measures/indicators to characterize the contours in order to improve the imaging classification capability of the detected tumors. in terms of performance, among the most important and most frequent contour indicators found in the literature we count: the degree of compactness, the spiculation index, the fractional concavity, the fourier factor and the fractal dimension, [6]. in our paper, the neural networks are used for classifying the tumours based on the extracted spiculation feature set of a breast lesion identified on a mammographic result. the neural networks (nns) models are simplified structural and functional nervous systems, formed by a number of processing elements, the neurons, which are bound by weighted connections, similar to the neural synapses, [7]. the neural network architecture mainly used in the medical field is known as the multi-layer perceptron (or mlp), [8], figure 2, which consists of neurons usually organized in three feed-forward interconnected layers (i.e. all neurons in one layer are fully connected to all neurons in the next layer, but there are no feedbacks to previous layers). a perceptron has the ability to learn, to self-organize and to generalize (i.e. it can have same output for similar sets of inputs). the most commonly algorithm used for training the neural network is the back-propagation algorithm, which calculates the error gradient of the network and adjusts each connection weight by minimizing the mean square difference and achieving convergence to a local minimum, [9]. figure 2 shows a graphical representation of such a network, where, in the case of a medical system, the input layer neurons often corresponds to the clinical symptoms, the hidden layer simulates the medical inference process done by the doctor and the neurons from the output layer correspond to the medical diagnosis. figure 2: graphical representation of a mpl neural network with one hidden layer the focus in this paper is not to develop a novel mpl, but rather to apply it in the clinical relevant problem of breast cancer. because many clinical scenarios can not be explained based on only one parameter, but as interaction of various clinical-pathological factors, [10], the inputs of the mpl may include any set of mammographic descriptors of the calcifications (distribution, number, description), of the detected masses (size, shape, density, margins, location) or of any other associated findings (asymmetries, distorsions), being able to include also inputs related to the patient history. the ouputs of the mpl are usually numbers between zero and one that may correpond to the prediction of the biopsy outcome (benign or malignant) as in [11], the stage of development for the analyzed lesion (in situ or invasive) or it may be correlated to the survival chance/rate of the patient (under 1 year, under 2 years, under 3 years, under 5 years, more than 5 years) as in [12]. breast cancer diagnosis based on spiculation feature and neural network techniques 357 3 new methods for lesion spiculation feature extraction the mammographic lesions segmentation methods proposed below assumes that the spicules are visible on the mammography in the form of tumor infiltration or branches, usually narrow, in normal biological tissue. therefore, the analysis of tumor contour on neighborhoods (or regions) of the contour and the measuring of the curvature change in each region offers an useful set of valuable information in determining the degree of spiculation in the analysed regions of the lesion contour and allowing subsequent cad classification of tumors in benign or malignant. for each of the neighborhoods of the lesion contour four spiculation indicators are computed, namely: a. a maximum level difference for every region of the contour, relative to the center of gravity of the lesion, according to the following figure (figure 3): figure 3: calculation of the maximum level difference of for the regions of the tumoral contour b. the total area of triangles in a neighborhood, calculated by adding the triangle areas formed by each three consecutive points along the contour, according to the following figure (figure 4): figure 4: calculation of the total area of the triangles formed along the contour for a contour region c. the total angle in a neighborhood, calculated by adding all the angles formed by every three consecutive points along the contour, according to the following figure (figure 5): 358 v. bălănică, i. dumitrache, l. preziosi figure 5: calculation of total angle summed for the angles formed along a contour region d. the total quadratic curvature for each region, calculated by adding the curvatures of the circles passing through each three consecutive points along the contour, according to the following figure (figure 6): figure 6: calculation of the total quadratic curvature, by adding the curvatures of the circles passing through each three consecutive points of the analysed region of the contour the main steps for the defined a, b, c and d methods are described as following: i. the starting point of the method is an image of a detected mass (figure 7 a), ii. calculate and plot the center of the mass based on the pixels that form the lesion (computing center of mass for lines and for columns) (figure 7 b), iii. calculate and plot the marginal pixels of the tumor (figure 7 b), iv. calculate and plot the distances between the center of mass and the pixels describing the edge of the tumor (figure 7 c), v. the distance vector is ascendently ordered according to the angle formed by these distances lines with one reference line, vi. depending on a relevant, experimentally determined, number of neighborhoods, x, for each angle z = 360/x are done a number of operations specific for each algorithm. for each angle z, for algorithm: breast cancer diagnosis based on spiculation feature and neural network techniques 359 a. is computed the maximum difference of the distances (= center of gravity marginal pixels) that fall within the z angle, thus identifying the characteristic vector of spiculation for the tumor. table 1 shows an example of this vector for a malignant and a benign lesion). b. is computed the total area of triangles in the neighborhood described by the z angle, by adding the areas of the triangles formed by every three consecutive points along the contour, according to the formula (1), [13]. table 1 shows an example of this vector for a malignant and a benign lesion). ∑ iϵz ai, ai = 1 2 [(xi − xi+2)(yi+1 − yi) − (xi − xi+)(yi+2 − yi)] (1) c. is computed the total angle for the neighborhood described by the z angle, by summing of all angles formed by every three consecutive points along the contour, according to the formula (2), [13]. table 2 shows an example of this vector for a malignant and a benign lesion). ∑ iϵz ui, ui = arcsin[2·ai/ √ (xi − xi+1)2 + (yi − yi+1)· √ (xi+1 − xi+2)2 + (yi+1 − yi+2)] (2) d. is computed the total quadratic curvature for the neighborhood described by the z angle, by adding quadratic curvatures of the circles passing through each three consecutive points along the contour, according to the formulas (3), (4), (5), [13]. table 2 shows an example of this vector for a malignant and a benign lesion). ∑ iϵz ci, ci = 1 r2i , ri = √ d2 + e2 4a2 − f a , where (3) a = ∣∣∣∣∣∣∣ xi yi 1 xi+1 yi+1 1 xi+2 yi+2 1 ∣∣∣∣∣∣∣ , d = ∣∣∣∣∣∣∣ x2i + y 2 i yi 1 x2i+1 + y 2 i+1 yi+1 1 x2i+2 + y 2 i+2 yi+2 1 ∣∣∣∣∣∣∣ , (4) e = ∣∣∣∣∣∣∣ x2i + y 2 i xi 1 x2i+1 + y 2 i+1 xi+1 1 x2i+2 + y 2 i+2 xi+2 1 ∣∣∣∣∣∣∣ , f = ∣∣∣∣∣∣∣ x2i + y 2 i xi yi x2i+1 + y 2 i+1 xi+1 yi+1 x2i+2 + y 2 i+2 xi+2 yi+2 ∣∣∣∣∣∣∣ (5) the spiculation features set computed with any of the four methods shows, after tests, a very good performance in terms of differentiating benign and malignant tumors and can be used to compute an objective measure/degree of spiculation for the examined lesions. moreover, these methods are suitable for training a neural classifier of malignant or benign patterns of spicules that is able to make assessments in the presence of a new set of extracted mammographic spiculation features. as observed, depending on the analyzed lesion, the nature of these sets of indicators show a large variation of values in each neighborhood which makes them very suitable to being analyzed with classification modules such as neural networks in order to determine patterns and nonlinear correlations. the following practical approach shows the great classification potential of these indicators, given the fact that the trained neural modules reached maximum performance after a very short training time. 360 v. bălănică, i. dumitrache, l. preziosi figure 7: lesion segmentation algorithm for computing the spiculation degree: a) a tumor identified on mammography, b) identification of the center of the mass and of the marginal pixels, c) calculating and sorting the angular distances from the center of the mass to the marginal pixels table 1: the characteristical spiculation vectors for a malignant lesion and for a benign one for method a and method b however, the efficient utilization of these methods may vary depending on the size and the quality of the imaging results (processed or unprocessed, noise filtered or unfiltered regions of interest, the quality of image binarization, etc.) and in what concerns the choice of the number of neighborhoods for the segmentation of the lesion contour. in this respect, one can successfully use a genetic algorithm to determine the optimal number of neighborhoods to achieve the maximum classification performance. 4 neural networks trained for breast cancer diagnosis the data sets obtained by applying the above methods can be used to train a neural intelligent module for the classification of the detected lesions on imaging results and determine the nonlinear correlations and the useful predictions. thus, a feedforward neural network with one hidden layer, configurable in terms of the learning rate, of learning momentum, of the training epochs and of the minimum desired error, was trained using the backpropagation learning algorithm based on the extracted imaging spiculation feature sets determined with the above defined a, b, c and d methods (the feature sets will be noted as seta, setb, setc and setd). the breast cancer diagnosis based on spiculation feature and neural network techniques 361 table 2: the characteristical spiculation vectors for a malignant lesion and for a benign one for method c and method d general training and testing algorithm of the neural module is given below: i. having a selection of mammographies for which the diagnostic output is known, i.e. the diagnosis (benign or malignant), ii. the spiculation set for each of the mammographies is extracted with one of the a, b, c, or d methods described above, for a number of neighborhoods, iii. both the input (the spiculation data set) and also the output (the diagnostic) are normalized, iv. based on the normalized data, the neural network is being trained and validated for the predicton of the output (the diagnostic), v. the predicted output is compared with the actual output (truth values of malignancy) by calculating performance measures like: specificity (i.e. number of true negatives divided by the sum of true negatives and false positives), sensitivity (i.e. number of true positives divided by the sum of true positives and false negatives), accuracy (i.e. sum of true positives and true negatives divided by sum of true positives, false positives, true negatives and false negatives) and the matthews correlation coefficient (mcc explained below). the training, the testing and the validation of the neural modules are based on the information of 96 mammographic cases acquired from the department of medical physics, university of freestate, bloemfontain, south africa, of which 48 cases are benign and 48 are malignant, [14]. the practical experience has focused on the analysis of several selections of features in order to determine the training and classification performances of the neural networks. rapid convergence of the training error and the classification and prediction power of the neural network indicate the correlation strength/weight between inputs and outputs, directly justifying the existence or the absence of non-linear correlations between them, in this case evaluated by matthews correlation coefficient (mcc) belonging to the interval [-1, +1], +1 representing a perfect correlation, 0 an arbitrary correlation and -1 inverse correlation. 362 v. bălănică, i. dumitrache, l. preziosi because the quality of the spiculation characteristics introduced in the previous section may vary depending on the number of neighborhoods set for segmentation of the lesions contour, the tests include also the variation of this value. simultaneously, the regions of interest (roi) processed and used in evaluating the imaging characteristics have the dimension of 335x436 pixels, an increase of the size being possible to the expense of the time needed for extracting data and for running the tests. on average, 5 to 8 neural networks were trained for the same set of features in order to observe and determine the recorded performance variation. the performed tests (see table 3) are not exhaustive, the results being obtained based on the currently available data sets. 5 discussion of results the performance results presented in table 3 give a clear indication about the high potential of the introduced spiculation-feature extraction methods in what concerns the differentiation between benign and malignant tumors when using a neural classifier. however, as shown in table 4, the developed methods differ in terms of power and speed of convergence of the classifications and provide a different classification stability when changing the number of analyzed lesion contour neighborhoods. thus, the method that calculates the maximum level difference on each region of the contour offers the optimal method for extracting the spiculation feature and the algorithm that calculates the total quadratic curvature for each region offers the most reliable measure of the lesion malignancy regardless of the neighborhoods number. 6 conclusions this paper introduces four new methods for the assessment of the tumor contour on a number of boundary neighborhoods. the maximum level difference is calculated for every region of the analyzed contour, relative to the center of gravity of the lesion. the total area of triangles in a neighborhood is calculated by adding the areas of each triangle formed by three consecutive points along the contour. the total angle for a neighborhood is calculated by adding up all the angles formed by every three consecutive points along the contour. the total quadratic curvature for each region is calculated by adding the curvatures of the circles passing through each three consecutive points along the contour. the spiculation feature extraction algoritms are tested on a selection of mammographies with known diagnostic by using a neural network that is trained on the extracted feature sets. following the performance tests, the spiculation feature sets computed with all of the four methods show a high potential in the differentiation of benign and malignant tumors and can be used to compute an objective measure/degree of spicularity for the analysed lesion. moreover, these methods are suitable for training a neural classifier of malignant or benign patterns of spiculs that is able to make assessments in the presence of a new set of extracted mammographic spiculation features. however, as shown in the pages of the paper, the developed methods differ in terms of power and speed of convergence of the classifications and provide a different classification stability when changing the number of neighborhoods the lesion contour is segmented and analyzed. these cad extraction methods show high reliability and prove high implementation potential being currently used in an automated decision support systems for breast cancer designed for national mammography screening programs in romania and south africa. breast cancer diagnosis based on spiculation feature and neural network techniques 363 id no. of neighborhoods inputs of the neural network outputs of the neural network malignity classification performances mcc correlation 1 5 seta =5 inputs diagnostic sensitivity: [ 97 100 %] specificity: [ 100 98.1 %] accuracy: [ 98.9 %] for 50-200 training iterations 0.97 2 5 setb =5 inputs diagnostic sensitivity: [ 86.3 100 %] specificity: [ 96.2 83 %] accuracy: [ 91.7 90.7 %] for 100-300 training iterations 0.83 3 5 setc =5 inputs diagnostic sensitivity: [ 90.9 93.1 %] specificity: [ 90.5 98.1 %] accuracy: [ 90.7 95.8 %] for 100-300 training iterations 0.86 4 5 setd =5 inputs diagnostic sensitivity: [97.7 100 %] specificity: [ 98.1 98.1 %] accuracy: [ 97.9 98.9 %] for 50-100 training iterations 0.98 5 5 the best sets: seta,setb =10 inputs diagnostic sensitivity: [ 100 %] specificity: [ 100 %] accuracy: [ 100 %] for 20-40 training iterations 1 6 5 seta,setb, setc,setd =20 inputs diagnostic sensitivity: [87.7 91.6 %] specificity: [ 97.9 100 %] accuracy: [ 92.7 95.8 %] for 100-300 training iterations 0.90 7 20 seta =20 inputs diagnostic sensitivity: [ 100 %] specificity: [ 100 %] accuracy: [ 100 %] for 20-30 training iterations 1 8 20 setb =20 inputs diagnostic sensitivity: [ 100 %] specificity: [ 100 %] accuracy: [ 100 %] for 50-150 training iterations 1 9 20 setc =20 inputs diagnostic sensitivity: [ 100 %] specificity: [ 100 %] accuracy: [ 100 %] for 4-10 training iterations 1 10 20 setd =20 inputs diagnostic sensitivity: [ 100 %] specificity: [ 100 %] accuracy: [ 100 %] for 150-200 training iterations 1 11 20 the best sets: seta,setc =40 inputs diagnostic sensitivity: [ 100 %] specificity: [ 100 %] accuracy: [ 100 %] for 3 training iterations 1 12 20 seta,setb, setc,setd =80 inputs diagnostic sensitivity: [ 100 %] specificity: [ 100 %] accuracy: [ 100 %] for 3-4 training iterations 1 table 3: the trained neural networks and the achieved performances 364 v. bălănică, i. dumitrache, l. preziosi set id description of the spiculations feature set observed features description seta spiculation set for the a algorithm: the maximum level difference for every region of the contour average convergence and avarage stability regardless of the neighborhoods number, the classification performance converges with average speed, and the obtain results vary very little. setb spiculation set for the b algorithm: the total area of triangles in a neighborhood slow convergence and low stability regardless of the neighborhoods number, the classification performance converges with low speed. for a small number of regions, the results are much weaker compared with those obtained for a higher number of regions. setc spiculation set for the c algorithm: the total angle in a neighborhood fast convergence and low stability regardless of the number of neighborhood, the classification performance converges with high speed. for a small number of regions, the results are poor compared to those obtained for a larger number of regions. setd spiculation set for the d algorithm: the total quadratic curvature for a region slow convergence but very high stability regardless of neighborhoods number, the classification performance converges with low speed, but the results are always powerful and do not vary. table 4: the efficiency of the spiculation feature extraction methods for a lesion breast cancer diagnosis based on spiculation feature and neural network techniques 365 bibliography [1] sultana, alina (2010); on improving image-based diagnosis using digital image processing, faculty of electronics, telecommunications and information technology, bucharest. [2] feig, s.a.; yaffe, m.j. (1995); digital mammography, computer-aided diagnosis and telemammography, the radiologic clinics of north america, breast imaging press, 33(6):12051230. [3] guliato, d.; rangayyan, r.m.; carvalho, j.d.; santiago, s.a. (2006); spiculation-preserving polygonal modeling of contours of breast tumors, proceedings of the 28th ieee, 2791-2794. [4] cheikhrouhou, i.; djemal, k.; sellami, d.; maaref, h.; derbel, n. (2008); new mass description in mammographies, image processing theory, tools and applications. [5] domănguez, a.r.; nandi, a.k. (2009); toward breast cancer diagnosis based on automated segmentation of masses in mammograms, pattern recognition, 42(6):1138-1148. [6] rangayyan, r.m.; nguyen, t.m. (2005); pattern classification of breast masses via fractal analysis of their conturs, international congress series, 1281:1041-1046. [7] dumitrache, i.; buiu, c. (1995); hybrid geno-fuzzy controllers, ieee intelligent systems for the 21st century, 5:2034-2039. [8] pandey, b; mishra, r.b. (2009); knowledge and intelligent computing system in medicine, computers in biology and medicine, 39:215-230. [9] alpaydin, ethem (2010); introduction to machine learning, mit press, 2010. [10] drew, p. j.; monson, j. r.t. (2000); artificial neural networks, surgery, 127:3-11. [11] marcano-cedeno, a.; quintanilla-dominguez, j.; andina, d. (2011); breast cancer classification applying artificial metaplasticity algorithm, neurocomputing, 74:1243-1250. [12] mofidi, r.; deans, c.; duff, m.d.; beaux, a.c.; brown, s. p. (2006); prediction of survival from carcinoma of oesophagus and oesophago-gastric junction following surgical resection using an artificial neural network, european journal of surgical oncology, 533-539. [13] hazewinkel, m. (1994); encyclopaedia of mathematics (set), kluwer, isbn 1-55608-010-7. [14] balanica, v.; rae, w.i.d.; caramihai, m.; acho, s.; herbst, c.p. (2009); integration of image and patient data, software and international coding systems for use in a mammography research project, world academy of science, engineering and technology, 58:1002-1005. qu_ijcccv11n5.pdf international journal of computers communications & control issn 1841-9836, 11(5):708-719, october 2016. fuzzy h2 guaranteed cost sampled-data control of nonlinear time-varying delay systems z.-f. qu, z.-b. du zifang qu shandong institute of business and technology yantai, shandong, 264005, china quzifang@163.com zhenbin du* yantai university yantai, shandong, 264005, china *corresponding author: zhenbindu@126.com abstract: we present and study a delay-dependent fuzzy h2 guaranteed cost sampled-data control problem for nonlinear time-varying delay systems, which is formed by fuzzy takagi-sugeno (t-s) system and a sampled-data fuzzy controller connected in a closed loop. applying the input delay approach and stability theorem of lyapunov-krasovskii functional with leibniz-newton formula, the h2 guaranteed cost control performance is achieved in the sense that the closed-loop system is asymptotically stable. a new sufficient condition for the existence of fuzzy sampled-data controller is given in terms of linear matrix inequalities (lmis). truck-trailer system is given to illustrate the effectiveness and feasibility of h2 guaranteed cost sampleddata control design. keywords: fuzzy t-s system; sampled-data; nonlinear systems; time-varying delay; h2 guaranteed cost control 1 introduction fuzzy takagi–sugeno(t-s)models [1] are used to describe nonlinear systems by a set of if– then rules which gives a local linear representation. since the work of tanaka and sugeno [2] on stability analysis and stabilization being published, many efforts have been made in developing systematic theory for such systems. because of the fast development of the digital circuit technology, using computers to design controller to reduce the implementation cost and time is more and more popular. the system of control is a sampled-data system. in sampling period,its control signals are constant. the overall control system becomes a sampled-data system, where the control signals are kept constant during the sampling period. it’s a popular trend to study the analysis and synthesis of fuzzy sampleddata systems in many papers, see, for instance, [3–12] and the references therein. of these works, stability analysis [3], stabilization [11], h∞ control [4,6,7,9], h2 gc control [8,10], fault-tolerant control [12] and tracking control [5] are researched, respectively. stability and robust stability theory was adopted in sampled-data time-delay systems [3,4, 7, 9]. in industrial systems and information networks, it’s popular to use time-delay systems. so, we should study time-delay systems and design some controllers for them. there have two ways for the stability analysis and synthesis of time-delay fuzzy t–s systems, i.e. delayindependent and delay-dependent approaches. with no respective of the size of the delay, we use delay-independent approach to assure stable conditions. the delay-dependent approach, contrast with the delay-independent approach, is complex in design procedure. so, it always have more conservative results. the delay-dependent approach supplies an upper bound of the time-delay. copyright © 2006-2016 by ccc publications 710 z.-f. qu, z.-b. du it deals with the size of the time-delay, as a consequence, it usually provides less conservative results. among these works [3, 4, 7, 9], [7] is delay-independent and [3, 4, 9] are delay-dependent, where time delay is assumed to be constant. however, in practical engineering systems, the occurrence of time delay phenomena is often time-varying. thus, fuzzy sampled-data control for time-varying delay systems is more appealing. in fuzzy sampled-datacontrol, there is no report abouth2 guaranteed cost controlproblem for the nonlinear time-varying delay systems. in this paper, we consider the delay-dependent sampled-data h2 guaranteed costperformance problem of the nonlinear time-varying delay system represented by a fuzzy t-s model. a lyapunov-krasovskii functional with leibniz-newton formula is employed to obtain new sufficient conditions in terms of linear matrix inequalities (lmis) to the fuzzy h2 guaranteed cost control performance. based on the stability condition, the guaranteed cost control is minimized for the closed-loop system. we use truck-trailer system to prove the effectiveness and the feasibility of the proposed method. the main contributions and advantages of the present paper are summarized as follows: (i) the h2 design via fuzzy sampled-data control for nonlinear systems with time-varying delay is first obtained. (ii) fuzzy sampled-data control algorithm is less conservative. comparing with the existing works, the dimension of the lmis in this paper is simplified, which adds the existence of feedback gains and lowers the implementation time. experimental results illustrate that the fuzzy sampled-data controller has a larger sampling interval. notations: throughout this paper, if not explicitly stated, we assumed that matrices have compatible dimensions. the notation p>0(< 0) is used to denote a positive (negative) definite matrix. the transpose of a matrix p is denoted by p t . the symbol ∗ stands for the transposed element in symmetric positions. 2 problem formulation consider the following nonlinear time-varying delay system: ẋ(t) = f(x(t), x(t − d(t)), u(t)), (1) where x(t) ∈ rn is the state vector, u(t) ∈ rm is the control input vector, f is a nonlinear function, and d(t) is time-varying delay. the following fuzzy t-s model with time-varying delay described by if–then rules is used to represent nonlinear time-varying delay system: if ξ1(t) is mi1 and · · · and ξp(t) is mip, then ẋ(t) = aix(t) + aidx(t − d(t)) + biu(t), i = 1, · · · , l, (2) where ai, bi and aid are constant matrices of appropriate dimensions. l is the number of if– then rules, mij are fuzzy sets and ξ1, . . . , ξp are premise variables, ξ(t) = [ξ1 . . . ξp] t , and ξ(t) is assumed to be given or a measurable function vector. we consider the following two cases for the time-varying delay. case 1: d(t) is a differentiable function satisfying for all t ≥ 0: 0 ≤ d(t) ≤ dm and ḋ(t) ≤ dd, where dm and dd are constants. fuzzy h2 guaranteed cost sampled-data control of nonlinear time-varying delay systems 711 case 2: d(t) is a continuous function satisfying for all t ≥ 0: 0 ≤ d(t) ≤ dm, where dm is a constant. by fuzzy blending, the overall fuzzy model is inferred as follows: ẋ(t) = l ∑ i=1 λi(ξ(t))[aix(t) + aidx(t − d(t)) + biu(t)], (3) where λi(ξ(t)) = βi(ξ(t)) ∑ l i=1 βi(ξ(t)) , βi(ξ(t)) = p ∏ j=1 mij(ξj(t)) and mij(.) is the grade of the membership function of mij. βi(ξ(t)) ≥ 0, i = 1, 2, . . . , l, l ∑ i=1 βi(ξ(t)) > 0 for any ξ(t), λi(ξ(t)) ≥ 0, i = 1, 2, . . . , l, l ∑ i=1 λi(ξ(t)) = 1. we design the following fuzzy sampled-data controller for (3): if ξ1(tk) is mj1 and · · · and ξp(tk) is mjp, then u(t) = kjx(tk), j = 1, 2, . . . , l, where kj is the sate feedback gain, the time tk is the sampling instant satisfying 0 < t1 < t2 < · · · < tk < · · · , and sampling interval is a constant, i.e. tk+1 − tk = hk = h. the overall fuzzy sampled-data controller is as follows: u(t) = l ∑ j=1 λj(ξ(tk))kjx(tk). (4) by using input delay approach, (4) is equivalent to (5) u(t) = l ∑ j=1 λj(ξ(tk))kjx(t − τ(t)). (5) the closed-loop system (3) with (5) is given by ẋ(t) = l ∑ i=1 l ∑ j=1 λi(ξ(t))λj(ξ(tk))[aix(t) + aidx(t − d(t)) + bikjx(t − τ(t))]. (6) the following h2 guaranteed costcontrol performance j = ∫ ∞ 0 (xt (t)qx(t) + ut (t)ru(t))dt. (7) must be minimized, where the weighting positive-definite matrices q and r are specified beforehand according to the design purpose. determine a sampled-data state feedback controller such that the closed-loop system (6) is asymptotically stable and the upper bound of h2 guaranteed cost function is minimized. lemma 2.1 (gu et al. [13]). for any positive definite symmetric constant matrix m ∈ rn×n, scalars r1, r2 satisfying r1 ≤ r2, if ̟ : [r1, r2] → r nis a vector function such that the integrations concerned are well defined, then ( ∫ r2 r1 ̟(s)ds )t m ( ∫ r2 r1 ̟(s)ds ) ≤ (r2 − r1) ∫ r2 r1 ̟t (s)m̟(s)ds. (8) 712 z.-f. qu, z.-b. du remark 1: the premise variables ξ 1 , . . . , ξ p can be function of measurable state variables x(t) and x(t − d), or combination of measurable state variables. the limitation of design of fuzzy t–s approach is that some state variables must be measurable to construct fuzzy controller. remark 2: it should be noted that the control signal u(t) holds constant during the period of tk ≤ t ≤ tk+1. 3 fuzzy h2 guaranteed cost sampled-data control in this section, we present a h2 guaranteed cost sampled-data control scheme of the fuzzy system and minimization of the upper bound of (7). here, we give some sufficient conditions for the stability of the closed-loop system (6) in terms of lmis. theorem 1. suppose that, under case 1, for given matrices q > 0, r > 0, scalars h > 0, dm > 0, dd > 0, µ > 0,there exist matrices p > 0, r1 > 0, r2 > 0, r3 > 0, such that the following lmis hold for all i, j = 1, 2, · · ·, l, σij =             σij11 σij12 σij13 0 σij15 σij16 ∗ σij22 0 0 0 0 ∗ ∗ σij33 σij34 σij35 0 ∗ ∗ ∗ σij44 0 0 ∗ ∗ ∗ ∗ σij55 σij56 ∗ ∗ ∗ ∗ ∗ σij66             < 0, (9) where σij11 = aip + pa t i + r1 − r2 − r3, σij12 = p, σij13 = bikj + r2, σij15 = µpa t i , σij16 = aidp + r3, σij22 = −q −1, σij33 = −r2, σij34 = k t j , σij35 = µk t j b t i , σij44 = −r −1, σij55 = −2µp + h 2r2 + d 2 mr3, σij56 = aidp, σij66 = −(1 − dd)r1 − r3. then there exists a sampled-data controller (4) with kj = kjp −1 (j = 1, 2, · · ·, l) such that h2 guaranteed cost control performance (7) is minimized in the sense that the closed-loop system (6) is asymptotically stable. proof. choose the lyapunov-krasovskii functional: v (xt) = v1(x) + v2(xt) + v3(xt) + v4(xt), (10) where v1(x) = x t (t)px(t), v2(xt) = ∫ t t−d(t) xt (s)r1x(s)ds, v3(xt) = h ∫ 0 −h ∫ t t+θ ẋt (s)r2ẋ(s)dsdθ, v4(xt) = dm ∫ 0 −dm ∫ t t+θ ẋt (s)r3ẋ(s)dsdθ and p > 0, r1 > 0, r2 > 0,r3 > 0. the derivative of v along the trajectories of the system (6) is computed as follows: fuzzy h2 guaranteed cost sampled-data control of nonlinear time-varying delay systems 713 v̇1(x) = ẋ t (t)px(t) + xt (t)pẋ(t) = l ∑ i=1 l ∑ j=1 λi(ξ(t))λj(ξ(tk))[x t (t)ati px(t) + x t (t − d(t))atidpx(t) +xt (t − τ(t))ktj b t i px(t) + x t (t)paix(t) + xt (t)paidx(t − d(t)) +xt (t)pbikjx(t − τ(t)). (11) v̇2(xt) = x t (t)r1x(t) − (1 − ḋ(t))x t (t − d(t))r1x(t − d(t)) ≤ xt (t)r1x(t) − (1 − dd)x t (t − d(t))r1x(t − d(t)). (12) by using lemma 2.1, we have −h ∫ t t−h ẋt (s)r2ẋ(s)ds ≤ −τ(t) ∫ t t−τ(t) ẋt (s)r2ẋ(s)ds ≤ − ( ∫ t t−τ(t) ẋ(s)ds )t r2 ( ∫ t t−τ(t) ẋ(s)ds ) . (13) leibniz-newton formula is ∫ t t−h ẋ(s)ds = x(t) − x(t − h). (14) applying (13) and leibniz-newton formula, we have v̇3(xt) = h 2ẋt (t)r2ẋ(t) − h ∫ t t−h ẋt (s)r2ẋ(s)ds ≤ h2ẋ(t)t r2ẋ(t) − (x(t) − x(t − τ(t))) t r2(x(t) − x(t − τ(t))) t = h2ẋt (t)r2ẋ(t) − x t (t)r2x(t) + x t (t − τ(t))r2x(t) + x t (t)r2x(t − τ(t)) −xt (t − τ(t))r2x(t − τ(t)). (15) similarly, by lemma 2.1 and leibniz-newton formula, we have v̇4(xt) ≤ d 2 mẋ t (t)r3ẋ(t) − x t (t)r3x(t) + x t (t − d(t))r3x(t) +xt (t)r3x(t − d(t)) − x t (t − d(t))r3x(t − d(t)). (16) from (6), for a given µ > 0, 0 = −2µẋt (t)pẋ(t) + µẋt (t)p{ l ∑ i=1 l ∑ j=1 λi(ξ(t))λj(ξ(tk))[aix(t) + aidx(t − d(t)) +bikjx(t − τ(t))]} + µ{ l ∑ i=1 l ∑ j=1 λi(ξ(t))λj(ξ(tk))[aix(t) + aidx(t − d(t)) +bikjx(t − τ(t))]} t pẋ(t) = −2µẋt (t)pẋ(t) + l ∑ i=1 l ∑ j=1 λi(ξ(t))λj(ξ(tk))[µẋ t (t)paix(t) +µẋt (t)paidx(t − d(t)) + uẋ t (t)pbikjx(t − τ(t)) + uxt (t)ait pẋ(t) +µxt (t − d(t))atidpẋ(t) + µx t (t − τ(t))ktj b t i pẋ(t)]. (17) 714 z.-f. qu, z.-b. du from (11-12) and (15-17), we obtain v̇ (xt) + x t (t)qx(t) + ut (t)ru(t) ≤ l ∑ i=1 l ∑ j=1 λi(ξ(t))λj(ξ(tk))x̃ t (t)s′ijx̃(t) (18) where x̃(t) = [ xt (t) xt (t − τ(t)) ẋt (t) xt (t − d(t) ] t , s′ij =       s′ij11 s′ij12 s′ij13 s′ij14 ∗ s′ij22 s′ij23 0 ∗ ∗ s′ij33 s′ij34 ∗ ∗ ∗ s′ij44       (19) with s′ij11 = a t i p + pai + r1 − r2 − r3 + q, s′ij12 = pbikj + r2, s′ij13 = µa t i p, s′ij14 = paid+r3, s′ij22 = −r2 + k t j rkj, s′ij23 = µk t j b t i p, s′ij33 = −2µp + h 2r2 + d 2 mr3, s′ij34 = paid, s′ij44 = −(1 − dd)r1 − r3. pre-andpost-multiplying thematrix s′ij in (19)by diag [ p−1 p−1 p−1 p−1 ] withp = p−1,r1 = p −1r1p −1,r2 = p −1r2p −1,r3 = p −1r3p −1,q̄ = p−1qp−1,k̄j = kjp −1 (j = 1, 2, · · ·, l), we have sij =        ∑ ij11 +q̄ ∑ ij13 ∑ ij15 ∑ ij16 ∗ ∑ ij33 +k̄ t j rk̄j ∑ ij35 0 ∗ ∗ ∑ ij55 ∑ ij56 ∗ ∗ ∗ ∑ ij66        . (20) if (9) is satisfied, then σij < 0 is equivalent to sij < 0 in (20) by using the schur complement. and, sij < 0 in (20) is equivalent to s′ij < 0 in (19). thus, v̇ (xt) + x t (t)qx(t) + ut (t)ru(t) < 0. (21) integrating both sides of (21) from t= 0 tot = ∞, we obtain v (xt(∞)) − v (xt(0)) + ∫ ∞ 0 (xt (t)qx(t) + ut (t)ru(t))dt < 0. (22) thus, we have j < v (xt(0)) = x t (0)px(0). (23) now, we provide a stability condition for the fuzzy t–s system (6) under case 2. theorem 2. suppose that, under case 2, for given matrices q > 0, r > 0, scalars h > 0, dm > 0 , µ > 0, there exist matrices p > 0, r1 > 0, r2 > 0such that the following lmis hold fuzzy h2 guaranteed cost sampled-data control of nonlinear time-varying delay systems 715 for all i, j = 1, 2, · · ·, l σ̄ij =             σ̄ij11 σ̄ij12 σ̄ij13 0 σ̄ij15 σ̄ij16 ∗ σ̄ij22 0 0 0 0 ∗ ∗ σ̄ij33 σ̄ij34 σ̄ij35 0 ∗ ∗ ∗ σ̄ij44 0 0 ∗ ∗ ∗ ∗ σ̄ij55 σ̄ij56 ∗ ∗ ∗ ∗ ∗ σ̄ij66             < 0, (24) where σ̄ij11 = aip + pa t i − r1 − r2, σ̄ij12 = p, σ̄ij13 = bikj + r1, σ̄ij15 = µpa t i , σ̄ij16 = aidp + r2, σ̄ij22 = −q −1, σ̄ij33 = −r1, σ̄ij34 = k t j , σ̄ij35 = µk t j b t i , σ̄ij44 = −r −1, σ̄ij55 = −2µp + h 2r1 + d 2 mr2, σ̄ij56 = aidp, σ̄ij66 = −r2. then there exists a sampled-data controller (4) with kj = kjp −1 (j = 1, 2, · · ·, l) such that h2 guaranteed cost control performance (7) is minimized in the sense that the closed-loop system (6) is asymptotically stable. proof. choose the following lyapunov-krasovskii functional: v (xt) = v1(x) + v2(xt) + v3(xt), (25) where v1(x) = x t (t)px(t), v2(xt) = h ∫ 0 −h ∫ t t+θ ẋt (s)r1ẋ(s)dsdθ, v3(xt) = dm ∫ 0 −dm ∫ t t+θ ẋt (s)r2ẋ(s)dsdθ and p > 0, r1 > 0, r2 > 0 are to be determined. then following the similar line in theorem 1, we can obtain theorem 2. if there is no time delay, then we have the following corollary 1. corollary 1. suppose that, for given matrices q > 0, r > 0, scalars h > 0, µ > 0, there exist matrices p > 0, r1 > 0, such that the following lmis hold for all i, j = 1, 2, · · ·, l ¯̄σij =          ¯̄σij11 ¯̄σij12 ¯̄σij13 0 ¯̄σij15 ∗ ¯̄σij22 0 0 0 ∗ ∗ ¯̄σij33 ¯̄σij34 ¯̄σij35 ∗ ∗ ∗ ¯̄σij44 0 ∗ ∗ ∗ ∗ ¯̄σij55          < 0, (26) where ¯̄σij11 = aip + pa t i − r1, ¯̄σij12 = p, ¯̄σij13 = bikj + r1, ¯̄σij15 = µpa t i , ¯̄σij22 = −q −1, ¯̄σij33 = −r1, ¯̄σij34 = k t j , ¯̄σij35 = µk t j b t i , ¯̄σij44 = −r −1, ¯̄σij55 = −2µp + h 2r1. then there exists a sampled-data controller (4) with kj = kjp −1 (j = 1, 2, · · ·, l) such that h2 guaranteed cost control performance (7) is minimized in the sense that the closed-loop system (6) is asymptotically stable. 716 z.-f. qu, z.-b. du in the following, we give the design procedure of fuzzy sampled-data controller. the h2 guaranteed cost sampled-data fuzzy control problem can be formulated as the following optimization problem: min p̄ trace(j) s.t.(9)and [ j xt (0) ∗ p̄ ] > 0. (27) design procedure: the delay-dependenth2 guaranteed cost sampled-datacontrol for fuzzy time-varying delay system is summarized as follows. step 1: select membership functions and fuzzy rules in (1). step 2: give the upper bound of sampling interval h > 0 and a scalar µ > 0. step 3: solve the lmis (27) to obtain kj(j = 1, 2, · · ·, l) and p .thus,kj = kjp −1 (j = 1, 2, · · ·, l) can also be obtained. step 4: increaseh, and repeat step 3 until kj(j = 1, 2, · · ·, l) and p can not be found. step5: confirmfuzzy h2 guaranteedcostsampled-datacontrol and stability of the closed-loop system, substitute p ,kj(j = 1, 2, · · ·, l), µ and h into (19) and verify sij < 0. step 6: construct the fuzzy sampled-data controller (4). 4 simulation example to test the effectiveness and feasibility of the proposed method, we consider the following truck-trailer system [14] ẋ1(t) = −a vt lt0 x1(t) − (1 − a) vt lt0 x1(t − td) + vt lt0 u(t) ẋ2(t) = a vt lt0 x1(t) + (1 − a) vt lt0 x1(t − td) ẋ3(t) = vt lt0 sin(x2(t) + a(vt/2l)x1(t) + (1 − a)(vt/2l)x1(t − td)), (28) where l = 2.8 l = 5.5, v = −1.0, a = 0.7, t̄ = 2.0, t0 = 0.5. x1(t) ∈ [−π/2, π/2], ẋ1(t) ∈ [−3, 3], x2(t) ∈ [−π/2, π/2], ẋ2(t) ∈ [−2, 2].x(t) = [x1(t) x2(t) x3(t)] t , [x1(0) x2(0) x3(0)] = [ 1.5 −2 5 ] . the nonlinear truck-trailer system is modeled by two-rule fuzzy t-s system. rule 1: if θ(t) = x2(t) + a(vt/2l)x1(t) + (1 − a)(vt/2l)x1(t − td) is about 0, thenẋ(t) = a1x(t) + ad1x(t − τd) + b1u(t). (29) rule 2: if θ(t) = x2(t) + a(vt/2l)x1(t) + (1 − a)(vt/2l)x1(t − td) is about π or −π, thenẋ(t) = a2x(t) + ad2x(t − τd) + b2u(t). (30) where a1 =     −a vt lt0 0 0 a vt lt0 0 0 a v 2t 2 2lt0 vt t0 0     , ad1 =     −(1 − a) vt lt0 0 0 (1 − a) vt lt0 0 0 (1 − a) v 2t 2 2lt0 0 0     , b1 =     vt lt0 0 0     , fuzzy h2 guaranteed cost sampled-data control of nonlinear time-varying delay systems 717 a2 =     −a vt lt0 0 0 a vt lt0 0 0 adv 2t 2 2lt0 dvt t0 0     , ad2 =     −(1 − a) vt lt0 0 0 (1 − a) vt lt0 0 0 (1 − a)dv 2t 2 2lt0 0 0     , b2 =     vt lt0 0 0     , and d = 10t0/π. the membership functions are defined as λ1(θ(t)) = ( 1 − 1 1 + exp(−3(θ(t) − 0.5π)) ) × ( 1 1 + exp(−3(θ(t) + 0.5π)) ) , λ2(θ(t)) = 1 − λ1(θ(t)). a two-rule sampled-data fuzzy controller is employed to stabilize the truck trailer system. the sampled-data fuzzy controller is designed as follows: u(t) = 2 ∑ j=1 λj(θ(tk))kjx(tk). first, we assume that time delay d(t) = 0. applying various methods of [8] ( h2 control), [10] ( h2 control) and corollary 1, the dimensions of the lmis are given in table 1. it is seen from table 1 that the dimension of the lmis is greatly simplified in the proposed method of this paper. table 1: the comparison for the dimensions of lmis (corolarry 1) method [8] [10] corollary 1 dimension 25 28 13 next, we assume that the delay is time-invariant, i.e. dd = 0. by using various the methods of [3](h2 control) and theorem2, the dimensions of the lmis are given in table 2. it is seen from table 2 that the dimension of the lmis is simplified in the proposed method of this paper, which adds the existence of feedback gains and lowers the implementation time. table 2: the comparison for the dimensions of lmis (theorem 2) method [3] theorem 2 dimension 20 16 by using various methods of [3] and theorem 2, the maximum allowable upper bounds of sampling interval are given in table 3, which show that theorem 2 of this paper can get a larger sampling interval. this implies that the proposed method achieves a better performance. table 3: the maximum allowable upper bounds of sampling interval method [3] theorem 2 hmax(td = 0.5) 0.374 0.562 hmax(td = 1) 0.315 0.471 hmax(td = 2) 0.251 0.283 finally, we consider the control design for time-varying delay td = 1 + sin t. the maximum allowable upper bound of sampling interval that is obtained by theorem 1 is 0.295.when the design parameters are given by µ = 1 , dm = 2 , dd = 1 with the sampling interval h = 0.295,theorem 1 gives the fuzzy state feedback control gains k1 = [1.0319 -0.1019 0.0009] , k2 = [1.0319 -0.1019 0.0009] . 718 z.-f. qu, z.-b. du figure 1: state response x1 figure 2: state response x2 figure 3: state response x3 figure 4: state response x4 fuzzy h2 guaranteed cost sampled-data control of nonlinear time-varying delay systems 719 when time-varying delay td is 1.5 + 1.5 sin t, theorem 1 gives the maximum allowable upper bound of sampling interval 0.172.with the design parameters h = 0.172 , µ = 1.5 , dm = 3 , dd = 1.5 , q = diag{ 1 10 0.1 }× 10 −6,r = 10−5, theorem 1 gives the fuzzy state feedback control gains k1 = [0.9656 -0.0657 0.0006] , k2 = [0.9656 -0.0657 0.0006] . the sampled-data fuzzy controller with the above control gains is applied to the truck trailer system, the results on the state responsesx1, x2, x3 and control lawuare shown in figures 14. simulation results illustrate the fuzzy h2 guaranteed cost sampled-data control design is effective and feasible. figs. 1-3 show the system stability, and fig.4 shows the sampled-data control signal for the system (28). 5 conclusion this work considers the fuzzyh2 gc sampled-data control problem for nonlinear systems with time-varying delay. it should be pointed that this problem is more complicated and harder to deal with due to the coexistence of feedback delay and sampled-data control. a new sufficient condition for the existence of fuzzy sampled-data controller is given in terms of lmis. to better demonstrate our results, a truck-trailer system with sampled-data control is given. simulation results show the effectiveness and feasibility of sampled-data control design. furthermore, this method could be extended to h∞ control. acknowledgment this work was supported by the national natural science foundation of china(61203320) and shandong natural science foundation (zr2014fl023). bibliography [1] t. takagi; m. sugeno. (1985); fuzzy identification of systems and its applications to modeling and control, ieee transactions on systems, man, and cybernetics, issn 0018-9472, smc, 15(1):116-132. [2] k.tanaka; m.sugeno.(1992); stability analysis and design of fuzzy control systems, fuzzy sets and systems, issn 0165-0114, 45(2):135-156. [3] h. k. lam;f. h. leung. (2007); sampled-data fuzzy controller for time-delay nonlinear systems: fuzzy-model-based lmi approach, ieee transactions on systems, man, and cybernetics-part b: cybernetics, issn 1083-4419, 37(3): 617-629. [4] d. -y. yang ; k. -y. cai. (2008); reliable h∞ non-uniform sampling fuzzy control for nonlinear systems with time delay, ieee transactions on systems, man, and cyberneticspart b: cybernetics, issn 1083-4419, 38(6): 1606–1613. [5] h. k. lam ;l. d. seneviratne. (2009); tracking control of sampled-data fuzzy-model-based control systems, iet control theory & applications, issn 1751-8644, 3(1):56-57. [6] j. yoneyama. (2010); robust h8 control of uncertain fuzzy systems under time-varying sampling, fuzzy sets and systems, issn 0165-0114, 161(6): 859-871. 720 z.-f. qu, z.-b. du [7] c.-h.lien;k.-w.yu;c.-t.huang;p.-y.chou;l.-y.chung ;j. -d.chen. (2010); robusth∞ control for uncertain t-s fuzzy time-delay systems with sampled-data input and nonlinear perturbations, nonlinear analysis: hybrid systems, issn 1751-570x, 4(3) : 550-556. [8] j. yoneyama. (2011); robust guaranteed cost control of uncertain fuzzy systems under time-varying sampling, applied soft computing, issn 1568-4946, 11(1): 225-249. [9] p. chen; q. -l. han;d. yue ; e.-g. tian. (2011); sampled-data robust h8 control for t–s fuzzy systems with time delay and uncertainties, fuzzy sets and systems, issn 0165-0114, 179(11):20-33. [10] g.b.koo;j.b.park;y.h.joo. (2013); guaranteed cost sampled-data fuzzy control for nonlinear systems: a continuous-time lyapunov approach, iet control theory & applications, issn 1751-8644, 7(13):1745-1752. [11] f.-s.yang;h.-g. zhang;y.-c.wang. (2014); an enhanced input-delay approach to sampleddata stabilization of t–s fuzzy systems via mixed convex combination, nonlinear dynamics, issn 0924-090x,75(3):501-512. [12] h.-y.li; x.-j.sun;p.shi ; h.k.lam. (2015); control design of interval type-2 fuzzy systems with actuator falut: sampled-data control approach, information sciences, issn 0020-0255, 302(1): 1-13. [13] k. gu;v.l. kharitonov;j. chen. (2003); stability of time-delay system, boston: birkhauser, isbn 0817642129. [14] y.-y.cao;p.m.frank. (2001); stability analysis and synthesis of nonlinear time-delay systems via linear takagi-sugeno fuzzy models, fuzzy sets and systems, issn 0165-0114, 124(2):213-229. international journal of computers communications & control issn 1841-9836, 10(1):89-99, february, 2015. an approximate algorithm combining p systems and active evolutionary algorithms for traveling salesman problems x. song, j. wang xiaoxiao song* school of electrical and information engineering xihua university, chengdu, sichuan, p.r. china, 610039 *corresponding author: sxx_pippen@163.com jun wang school of electrical and information engineering xihua university, chengdu, sichuan, p.r. china, 610039 745257101@qq.com abstract: an approximate algorithm combining p systems and active evolutionary algorithms (aeaps) to solve traveling salesman problems (tsps) is proposed in this paper. the novel algorithm uses the same membrane structure, subalgorithms and transporting mechanisms as nishida’s algorithm, but adopts two classes of active evolution operators and a good initial solution generating method. computer experiments show that the aeaps produces better solutions than nishida’s shrink membrane algorithm and similar solutions with an approximate optimization algorithm integrating p systems and ant colony optimization techniques (acops) in solving tsps. but the necessary number of iterations using aeaps is less than both of them. keywords: p systems, active evolutionary algorithms, traveling salesman problems. 1 introduction membrane computing, a milestone in natural computing, was introduced by gheorghe pǎun [1] in 1998. this computational model, which was inspired by the structure and the behavior of living cells, was proposed. in the following more than ten years, a sizeable group of researchers were seduced by membrane computing. membrane algorithm is one of the research hotspots after nishida [2–4] first proposed this concept by combining p systems and meta-heuristic search methodologies. huang [5, 6] and cheng [7] combined genetic algorithm and differential evolution with membrane systems to solve some singleand multi-objective optimization problems. quantum-inspired evolutionary algorithm based on p systems was proposed to solve some classical theoretical [8,9] and practical problems [10–17]. also some novel membrane algorithms based on particle swarm optimization [18] and artificial fish swarm algorithm [19] were proposed. evolutionary algorithm is based on survival of the fittest. creatures do not have ability to decide their mutation directions and choose advantageous gene to their offspring. but based on the researches in biology, this stochastic evolution theory could not explain some problems in creature’s adaptability. the modern biology research shows that the evolution process is not completely stochastic [20–22]. so called stress-induced mutation mechanisms were proposed. specifically, when creatures are maladapted to their environment, that is, when they are stressed, stress-induced mutation mechanisms produce mutations [23–26]. these facts have been considered in evolutionary algorithm for solving optimization problems [27]. in this paper an approximate algorithm combining p systems and active evolutionary algorithms (aeaps) is proposed in order to solve traveling salesman problems (tsps) in the special case of complete graphs with euclidean distance. it follows the nested membrane structure adopted by nishida [2], and adopts genetic algorithm (ga), tabu search and active evolutionary algorithms (aea) as the subalgorithms. experiment results are compared with nishida’s copyright © 2006-2015 by ccc publications 90 x. song, j. wang figure 1: the membrane structure of a cell-like p system algorithm and an approximate optimization algorithm integrating p systems and ant colony optimization techniques (acops) [28, 29]. in section 2, a brief introduction of p systems and nishida’s membrane algorithm for tsps is given. details of aea designed for tsps and aeaps is discussed in section 3. the experiment results and analysis are mentioned in section 4. conclusions are drawn in section 5. 2 p systems and membrane algorithm 2.1 p systems p systems could be divided into three groups: cell-like p systems, tissue-like p systems and neural-like p systems [30]. the structure of cell-like p systems is the basic structure of other p systems. the membrane structure of a cell-like p system is shown in fig. 1. the outermost membrane is the skin membrane. outside of the skin membrane is the environment. usually, there are some other membranes inside the skin membrane. we call the spaces between membranes regions. the region just inside the skin membrane is the outermost region, and the region in an elementary membrane is an elementary region. in membrane computing, regions contain multisets of objects and sets of evolution rules. a cell-like p system is formally defined as follows [1, 31]: π = [v,t,µ,w1, . . . ,wm,r1, . . . ,rm, i0] . (1) where: (i) v is an alphabet; its elements are called objects; (ii) t ⊆ v is the output alphabet; (iii) µ is a membrane structure consisting of m membranes; m is called the degree of π; (iv) wi, 1 ≤ i ≤ m, is a string representing the initial multiset over v associated with region i, 1 ≤ i ≤ m; (v) ri, 1 ≤ i ≤ m, is a finite set of evolution rules associated with region i, 1 ≤ i ≤ m; (vi) i0 is a number between 1 and m which specifies the output membrane of π. the rules of ri, 1 ≤ i ≤ m, have the form a → v, where a ∈ v and v ∈ (v ×{here,out,in})∗. the multiset v consists of pairs (b,t), b ∈ v and t ∈ {here,out,in}. here means when the rule is used in one region, b will stay in the region; out means that b exits the region and in means that b will be communicated to one of the membranes contained in the current region. an approximate algorithm combining p systems and active evolutionary algorithms for traveling salesman problems 91 figure 2: membrane structure of membrane algorithm 2.2 the membrane algorithm for tsps membrane algorithm is designed with the hierarchical or network structure of membranes and rules of p systems, and the concepts and principles of meta-heuristic search methodologies. it is a new kind of parallel-distributed framework for solving optimization problems. nishida first proposed a membrane algorithm using cell-like p systems (nested membrane structure) [2] to solve tsps. nishida also proposed some improved membrane algorithms based on tissue-like p systems, such as compound membrane algorithm [3] and shrink membrane algorithm [4]. all the basic concepts of improved algorithms are based on the membrane algorithm. for example, compound membrane algorithm has two phases. the first phase is using membrane algorithm generating good initial solutions for phase 2; and the second phase is also similar to membrane algorithm but using good initial solutions. the shrink membrane algorithm incorporates dynamic membrane structure into compound membrane algorithm. the membrane algorithm with nested membrane structure is a special case of multi-deme evolutionary algorithm [32]. in this paper, we only research membrane algorithm with this structure. in nishida’s membrane algorithm, nested membrane structure, rules in membrane separated regions and transporting mechanisms through membrane from p systems are adopted. the structure of the membrane algorithm is shown in fig. 2. in solving tsps, the membrane algorithm can be described as follows: 1. generate one initial solution in region 0 and two initial solutions in all regions from 1 to m−1 respectively; 2. in one iteration, the solution in region 0 is updated by tabu search and the solutions in regions from 1 to m − 1 are updated by genetic algorithm, simultaneously; 3. regions from 1 to m−2 send the best solution to adjacent inner region, and the worst solution to adjacent outer respectively. region 0 sends the worst solution to region 1 and region m − 1 sends the best solution to region m − 2. 4. erases solutions but the best two in regions from 1 to m − 1 and the best one in region 0. 5. jump to step 2 if the termination condition is not satisfied; otherwise the output of the algorithm is the solution in region 0. 3 aeaps for tsps in active evolution organisms adapt their behaviour to changing environment. in tsps, the “environment" means the structure characteristics of a solution, like without crossings in its path of traveling. if a solution is maladapted to the “environment", some active mutation mechanisms should be considered to improve it. in this section we propose two classes of active evolution conditions and show how to deal with these conditions; then we give a simple method of obtaining good initial solutions; finally, every step of aeaps is described. 92 x. song, j. wang figure 3: triangle inequality for (g,w) 3.1 1st class active evolution condition tsp is one of the well-known combinatorial optimization problems. the tsp problem is about finding the hamilton cycle. i.e., the optimum shortest path of a given weighted undirected and connected graph (g,w) with n nodes and where w is a distance metric. this distance is symmetric, which means w(i,j) = w(j,i). in the two dimensional space, the distance between vertex i and vertex j is w(i,j) = √ (xi − xj)2 + (yi − yj)2. (2) where i and j have the coordinates (xi,yi) and (xj,yj), respectively. the value of one solution is v = n−1∑ i=1 w(i, i + 1) + w(n,1). (3) since w is a distance, then it satisfies the triangle inequality, i.e., w(i,j) ≤ w(i,k) + w(j,k), for any vertices i ̸= j, and k ̸= i,j (see fig. 3). it has been shown that 2-opt iterative improvement method [10, 33, 35] leads to optimum solutions for the tsp without any crossings. this means that any optimal path contains edges of the graph that do not intersect each other. if we consider four nodes. the total graph associated with these four nodes shows convex and concave quadrilaterals. when the convex quadrilateral is considered, any two edges are disjoint (they do not intersect each other). this is no longer true for the concave quadrilateral. in fig. 4, we select a as the starting node, and there can be 6 possible solutions. solutions abcda and adcba have no crossings, and solutions abdca, acbda, acdba and adbca have crossings. one can easily show that the solutions without crossings are better than those with crossings, by referring to the triangle inequality. for example, the path abdca, has two edges, (a,b) and (d,c), overlapping the path abcda ((d,c) = (c,d)). for the path abdca, we have w(b,d) = w(b,o) + w(o,d) and w(c,a) = w(c,o) + w(o,a). according to the triangle inequality, we have w(b,d) + w(c,a) = w(b,o) + w(o,d) + w(c,o) + w(o,a) = w(b,o) + w(c,o) + w(o,d) + w(o,a) ≥ w(b,c) + w(a,d) (4) so we found solution abcda without crossings is better than solution abdca with crossings. with this method, other solutions with crossings can also be transformed into a better ones without crossings. one can conclude stating that for the tsp problem, with euclidian distance, for any solution with crossings there is always a better one without crossings. in our aeaps method we use a specific stress-induced mutation to the current solution. this mutation operator is different from the usual one theory of evolution. this is an active evolution an approximate algorithm combining p systems and active evolutionary algorithms for traveling salesman problems 93 figure 4: example with four nodes figure 5: a 1st class active evolution condition applied operator. if in the current path there is an edge from i to i + 1 that crosses other existing edge in the path, then i is called a 1st class active evolution node. the method of revising the path according to a 1st class active evolution node is the following: 1. find the nearest n1 nodes to i and denote by a the set consisting of these nodes; 2. select one node from the set a and name it j; 3. find out node j + 1 which is the next node after j in the solution; 4. swap the nodes i and j + 1 in the solution; 5. if the value v of the new solution is less than the old one, keep the new one, otherwise keep the old one. an illustration of the method is provided in fig. 5. 3.2 2nd class active evolution condition if the distance from i to the next node, i + 1, is larger than some value d, i is called a 2nd class active evolution node. in this case insert a new node between i and i + 1. also computedi = e × total_distancei/(n − 1), where e is a parameter, total_distancei is the total distance between i and the other nodes and n is the number of nodes. in aeaps we consider an approach similar to 2.5-opt iterative improvement scheme [36] for dealing with 2nd class active evolution nodes. this method is described below, where i is a 2nd class active evolution node and i + 1 is the next node after i in the current solution: 1. find the nearest n2 nodes to i and put them all into a set a; similarly build the set b for i + 1; 2. select one element from a ∩ b and name it j; 3. eliminate j from the solution and insert it between i and i + 1; 4. if the value of the new solution is less than the old one, keep the new one, otherwise keep the old one. an illustration of the use a 2nd class active evolution node is shown in fig. 6. 3.3 initial solutions nishida proposed a membrane algorithm, called compound membrane algorithm, which has two phases. the function of the first phase is producing good solutions which are used as initial 94 x. song, j. wang figure 6: a 2nd class active evolution condition applied solutions for phase two. the better initial solutions can improve the final output solution. as nishida said the computation time of compound membrane algorithm is quite prohibitive. we propose a simpler and faster method for generating good initial solutions. the method is as follows: 1. select one node randomly as the starting node, and name it current node; 2. find the nearest neighbour of the current node which is not selected, and name it current node; 3. repeat step 2 until all nodes are selected; a good initial solution has been then found. if we repeat the above steps for 2 × m − 1 times, we get enough initial solutions for each region of the p system. we have one solution in region 0 and two solutions in each of the regions 1 to m − 1. 3.4 aeaps algorithm aeaps uses the basic idea of the membrane algorithm proposed by nishida; a nested membrane structure with m regions is considered. we still use tabu search in region 0 and genetic algorithms in regions 1 to m−1 as sub-algorithms. finally, the same communication mechanisms between adjacent regions are used; the best and worst solutions are sent to adjacent inner and outer regions, respectively. unlike the nishida’s algorithm, aeaps adds two classes of active evolution operators in every region and use a new initial solution generation method. the overall membrane algorithm can be described as follows: 1. generate initial solutions by using the method mentioned in section 3.3, one for region 0 and two for each of the regions from 1 to m − 1; 2. modify solutions simultaneously in each of the regions 1 to m−1 by using genetic algorithms, simultaneously; 3. find out all 1st class active evolution nodes in every solution and revise them; then find out all 2nd class active evolution nodes and also revise them all; 4. use tabu search in region 0; 5. use the communication mechanisms between adjacent regions (as proposed by nishida); 6. remove all solutions but the best one from region 0 and best two in each of the regions 1 to m − 1; 7. jump to step 2 if the number of iterations is not satisfied; otherwise the output of the algorithm is the solution in region 0. 4 experiments and results we have tested the searching efficiency of aeaps on two benchmark problems, eil51, with 51 nodes, and kroa100, with 100 nodes, from tsplib [37], running 10 times each. the parameters in experiments are chosen as follows: m = 50, e = 0.1, n1 = 25, n2 = 35. the number of iterations is 300. table 1 shows the results. a comparison of simulated annealing (sa), shrink an approximate algorithm combining p systems and active evolutionary algorithms for traveling salesman problems 95 figure 7: curves of solving kroa100 problem by aeaps membrane algorithm (sma) and aeaps is shown in table 2. we have implemented aeaps in c and tested the algorithm on a microsoft visual c++ 6.0 platform with windows 7 and using a computer with 2.4ghz cpu and 2g ram. table 1. results of aeaps for eil51 and kroa100 1 2 3 4 5 6 7 8 9 10 eil51 431 429 428 428 426 430 429 428 429 427 kroa100 21282 21282 21320 21282 21389 21282 21373 21379 21282 21282 table 2. a comparison of sa, sma and aeaps sa sma aeaps best average worst best average worst best average worst eil51 430 438 445 429 430 433 426 429 431 kroa100 21369 21763 22564 21299 21504 21750 21282 21315 21389 results of sa and sma from [4] are shown in the tables. from table 1 and 2, one can see that aeaps gets better results than sa and sma for both eil51 and kroa100. fig. 7 shows the curves of average values for all the solutions and the average value of the solution in region 0 for the kroa100 problem solved by aeaps. for initial solutions which are generated by the method in section 3.3, the average value of the initial solutions in aeaps is much smaller than those using the membrane algorithm [4]. compared to nishida’s algorithm for solving the same problem by the 50 membranes, aeaps converges to remarkably fast to good solutions, in approximately 50 steps. zhang [28, 29] proposed acops for tsp, which uses a smaller number of function evaluations to achieve better solutions. experimental comparisons between nishida’s algorithm, acops and aeaps are listed in table 3 and table 4. results of nishida’s algorithm and acops are from [28]. the results of nishida’s algorithm were calculated by using 50 membranes and 10000 iterations. in acops the number of function evaluations (nofe) is the 96 x. song, j. wang stopping criterion. an equivalent number of iterations was obtained by using the product of average iterations for elementary membranes (gmin+gmax)/2 and the number of communications 2∗(gmin+gmax)∗nofe/(n +m). in solving ulysses22, eil51, eil76, eil101 and kroa100 problems, the parameters of aeaps have the values: m = 50, e = 0.1, n1 = 25, n2 = 35, and the maximal number of iterations is 300. in solving ch150, gr202 and tsp225 problems, the parameters of aeaps are: m = 50, e = 0.1, n1 = 40, n2 = 50, and the maximal number of iterations is 500. the results of the aeaps are obtained from 10 independent runs. table 3. number of iterations in nishida’s algorithm, acops and aeaps with 8 tsps nishida′s algorithm acops aeaps ulysses22 1.0e+5 7.7e+2 3.0e+2 eil51 1.0e+5 7.3e+2 3.0e+2 eil76 1.0e+5 7.5e+2 3.0e+2 eil101 1.0e+5 7.6e+2 3.0e+2 kroa100 1.0e+5 9.6e+2 3.0e+2 ch150 1.0e+5 7.8e+2 5.0e+2 gr202 1.0e+5 6.8e+2 5.0e+2 tsp225 1.0e+5 3.1e+2 5.0e+2 table 4. results of nishida’s algorithm, acops and aeaps with 8 tsps nishida′s algorithm acops aeaps best average worst best average worst best average worst ulysses22 75.31 75.31 75.31 75.31 75.32 75.53 75.31 75.31 75.31 eil51 429 434 444 429 431 434 426 429 431 eil76 556 564 575 546 551 558 543 545 547 eil101 669 684 693 641 647 655 631 638 643 kroa100 21651 22590 24531 21285 21320 21427 21282 21315 21389 ch150 7073 7320 7633 6534 6560 6584 6549 6554 6565 gr202 509.7 520.1 528.4 489.2 492.7 497.1 491.3 496.1 498.9 tsp225 4073.1 4153.6 4238.9 3899.6 3938.2 4048.2 3938.7 3992.3 4034.1 as compared with nishida’s algorithm, aeaps uses much smaller number of iterations to achieve better solutions. as compared with acops, aeaps uses smaller number of iterations only except for the tsp225 and gets better results for the first 5 tsps, similar results for ch150 problem and slightly worse results for the last 2 tsps. 5 conclusions this work is the first attempt to discuss the role of active evolutionary operators in membrane algorithms. we present an approximate algorithm combining nested membrane structure, rules within regions and communication mechanisms of the p systems, and two classes of active an approximate algorithm combining p systems and active evolutionary algorithms for traveling salesman problems 97 evolution operators and a good initial solution generating method. aeaps is used to solve euclidian tsps, well-known np-hard problems. the experiment results show that aeaps performs better than sa and nishida’s membrane algorithm and similar with acops, which requires a smaller number of iterations. in order to improve the performance of aeaps, especially in solving large scale tsps, our future studies will focus on other membrane structure options and communication mechanisms. acknowledgments. this work is supported by the national natural science foundation of china (grant no. 61170030), the chunhui plan of ministry of education of china (grant no. z2012025), the open research fund of key laboratory of xihua university (grant no. szjj2012-002), the research projects of eduction department of sichuan province (grant no. 13zb0017) and the key scientific research foundation of xihua university (grant no. z1120943). the authors also gratefully acknowledge helpful comments and suggestions made by reviewers, which have significantly improved the presentation. bibliography [1] păun, g. (2000); computing with membranes, journal of computer and system sciences, issn 0022-0000, 61(1): 108–143. [2] nishida, t.y. (2004); an application of p-system: a new algorithm for np-complete optimization problems, proceedings of the 8th world multi-conference on systems, cybernetics and informatics, v: 109–112. [3] nishida, t.y. (2005); an approximate algorithm for np-complete optimization problems exploiting p-systems, proceedings of the 6th international workshop on membrane computing, isbn 978-3-540-30948-2, 26–43. [4] nishida, t.y. (2006); membrane algorithms: approximate algorithms for np-complete optimization problems, applications of membrane computing, isbn 978-3-540-29937-0, 303– 314. [5] huang, l.; he, x.x.; wang, n.; xie, y. (2007); p systems based multi-objective optimization algorithm, progress in natural science, issn 1002-0071, 17(4): 458–465. [6] huang, l.; wang, n. (2006); an optimization algorithm inspired by membrane computing, icnc 2006, lncs, isbn 3-540-45901-4, 4222: 49–52. [7] cheng, j.x.; zhang, g.x.; zeng, x.x. (2011); a novel membrane algorithm based on differential evolution for numerical optimization, international journal of unconventional computing, issn: 1548-7199, 7(3): 159–183. [8] zhang, g.x.; liu, c.x.; gheorghe, m.; ipate, f. (2009); solving satisfiability problems with membrane algorithm, proceedings of the 4th international conference on bio-inspired computing: theories and applications, isbn 978-1-4244-3866-2, 29–36. [9] zhang, g.x.; gheorghe, m.; wu, c.z. (2008); a quantum-inspired evolutionary algorithm based on p systems for knapsack problem, fundamenta informaticae, issn 0169-2968, 87(1): 93–116. 98 x. song, j. wang [10] liu, c.x.; zhang, g.x.; zhu, y.h.; fang, c.; liu, h.w. (2009); a quantum-inspired evolutionary algorithm based on p systems for radar emitter signals, proceedings of the 4th international conference on bio-inspired computing: theories and applications, isbn 9781-4244-6438-8, 1–5. [11] liu, c.x.; zhang, g.x.; liu, l.w.; gheorghe, m.; ipate, f. (2010); an improved membrane algorithm for solving time-frequency atom decomposition, wmc 2009. lncs, issn 03029743, 5957: 371–384. [12] liu, c.x.; zhang, g.x.; liu, h.w. (2009); a memetic algorithm based on p systems for iir digital filter design, proceedings of the 8th ieee international conference on pervasive intelligence and computing, isbn: 978-0-7695-3929-4, 330–334. [13] huang, l.; suh, i.h. (2009); controller design for a marine diesel engine using membrane computing, international journal of innovative computing information and control, issn 1349-4198, 5(4): 899–912. [14] zhang, g.x.; liu, c.x.; rong, h.n. (2010); analyzing radar emitter signals with membrane algorithms, mathematical and computer modelling, issn 0895-7177, 52(11-12): 1997–2010. [15] yang, s.p.; wang, n. (2012); a p systems based hybrid optimization algorithm for parameter estimation of fccu reactor-regenerator model, chemical engineering journal, issn 1385-8947, 211: 508–518. [16] zhang, g.x.; gheorghe, m.; li, y.q. (2012); a membrane algorithm with quantum-inspired subalgorithms and its application to image processing, natural computing, issn 1567-7818, 11(4): 701–717. [17] zhang, g.x.; cheng, j.x.; gheorghe, m.; meng, q. (2013); a hybrid approach based on differential evolution and tissue membrane systems for solving constrained manufacturing parameter optimization problems, applied soft computing, issn 1568-4946, 13(3): 1528– 1542. [18] zhang, g.x.; zhou, f.; huang, x.l. (2012); a novel membrane algorithm based on particle swarm optimization for optimization for solving broadcasting problems, journal of universal computer science, issn 0948-695x, 18(13): 1821–1841. [19] tu, m.; wang, j.; song, x.x.; yang, f.; cui, x.r. (2013); an artificial fish swarm algorithm based on p systems, icic express letters, part b: applications, issn 1881-803x, 4(3): 747–753. [20] cairns, j.; overbaugh, j.; miller, s. (1988); the origin of mutations, nature, issn 00280836, 335: 142–145. [21] hall, b.g. (1988); adaptive evolution that requires multiple spontaneous mutations. i. mutations involving an insertion sequence, genetics, issn 0016-6731, 120(4): 887–897. [22] hall, b.g. (1991); is the occurrence of some spontaneous mutations directed by environmental challenges?, the new biologist, issn 1043-4674, 3(8): 729–733. [23] ponder, r.g.; fonville, n.c.; rosenberg, s.m. (2005); a switch from high-fidelity to errorprone dna double-strand break repair underlies stress-induced mutation, molecular cell, issn 1097-2765, 19(6): 791–804. an approximate algorithm combining p systems and active evolutionary algorithms for traveling salesman problems 99 [24] slack, a.; thornton, p.c.; magner, d.b.; rosenberg, s.m.; hastings, p.j. (2006); on the mechanism of gene amplification induced under stress in escherichia coli, plos genetics, issn 1553-7390, 2(4): 385–398. [25] galhardo, r.s.; hastings, p.j.; rosenberg, s.m. (2007); mutation as a stress response and the regulation of evolvability, critical reviews in biochemistry and molecular biology, issn 1040-9238, 42(5): 399–435. [26] rosenberg s.m.; shee, c.; frisch, r.l.; hastings, p.j. (2012); stress-induced mutation via dna breaks in escherichia coli: a molecular mechanism with implications for evolution and medicine, bioessays, issn 0265-9247, 34(10): 885–892. [27] shi, l.; li, h.y.; yang, j.a. (2004); active evolution based genetic algorithm, mini-micro systems, issn 1000-1220, 5(25): 790–793. [28] zhang, g.x.; cheng, j.x.; gheorghe m. (2010); an approximate algorithm combining p systems and ant colony optimization for traveling salesman problems, proceedings of the 8th brainstorming week on membrane computing, isbn 978-84-614-2357-6, 321–340. [29] zhang, g.x.; cheng, j.x.; gheorghe m. (2011); a membrane-inspired approximate algorithm for traveling salesman problems, romanian journal of information science and technology, issn 1453-8245, 14(1): 3–19. [30] păun, g. (2007); tracing some open problems in membrane computing, romanian journal of information science and technology, issn 1453-8245, 10(4): 303–314. [31] păun, g.; rozenberg, g. (2002); a guide to membrane computing, theoretical computer science, issn 0304-3975, 287(1): 73–100. [32] erick, c.p. (1998); a survey of parallel genetic algorithms, calculateurs paralleles, issn 1260-3198, 10(2): 141–171. [33] croes, g.a. (1958); a method for solving traveling salesman problems, operations research, issn 0030-364x, 6(6): 791–812. [34] lin, s.; kernighan, b.w. (1973); an effective heuristic algorithm for the travelingsalesman problem, operations research, issn 0030-364x, 21(2): 498–516. [35] helsgaun k. (2000); an effective implementation of the lin-kernighan traveling salesman heuristic, european journal of operational research, issn 0377-2217, 126(1): 106–130. [36] bentley, j.j. (1992); fast algorithm for geometric traveling salesman problems, informs journal on computing, issn 1091-9856, 4(4): 387–411. [37] http://www.iwr.uni-heidelberg.de/groups/comopt/software/tsplib95/ int j comput commun, issn 1841-9836 vol.7 (2012), no. 2 (june), pp. 241-251 network element scheduling for achieving energy-aware data center networks w. fang, x. liang, y. sun, a.v. vasilakos weiwei fang, xiangmin liang, yantao sun school of computer and information technology beijing jiaotong university, beijing 100044, china {wwfang, 11125116, ytsun}@bjtu.edu.cn athanasios v. vasilakos department of computer and telecommunications engineering university of western macedonia kozani gr 50100, greece vasilako@ath.forthnet.gr abstract: the goal of data center network is to interconnect a massive number of servers so as to provide reliable and scalable computing and storage infrastructure for cloud-based internet services and data-intensive scientific applications. recent studies reveal that the network elements consume 10∼20% of the overall power in a data center, which has introduced a challenge to reducing network energy cost without adversely affecting network performance. considering unique features of traffic patterns and network topologies in data centers, this paper proposes a novel network element scheduling scheme (ness) to reduce data center energy consumption from the networking perspective. the core idea is to turn on only a minimal subset of network elements to satisfy routing requirements, and put to sleep or shut down the rest unneeded ones for energy saving. in ness, the logical network architecture formed by the active elements not only achieves the basic purpose for server interconnections in data centers, but also can support multi-path routing between pairs of hot servers for load balancing. simulation experiments are performed in representative data center network topologies, and the results demonstrate the effectiveness of ness in energy conserving on network elements in data centers. keywords: data center networks, green computing, energy aware, steiner tree. 1 introduction in recent years, many large data centers are built around the world to provide highly reliable and scalable infrastructure for cloud services and scientific computations. as the actual tendency is to exponentially increase the number of demanded servers, a natural consequence is that the power consumption becomes a critical concern for data center operators. for example, it has been reported that data center power usage in u.s. doubled between 2000 and 2006 to nearly 61 billion kilowatt-hours, representing 1.5% of all u.s. electricity consumption [1]. researchers are now seeking to find effective solutions to make data centers reduce power consumption while keep the desired service performance. most of the recent research has focused on reducing the two major components of data center power usage: servers and cooling [2]. however, the underlying network infrastructure, namely routers, switches, high-speed links, still lacks effective energy management solutions. the networking part of data center has been found to consume 10∼20% of its total power consumption [2], and thus should not be neglected. copyright c⃝ 2006-2012 by ccc publications 242 w. fang, x. liang, y. sun, a.v. vasilakos fortunately, there is an opportunity for substantial reductions in the energy consumption of data center networks due to two factors. on one hand, the high network capacity of data center networks is specially provisioned for worst-case or busy-hour load, and far from being exceeded by traffic load most of time. moreover, data center traffic varies considerably over time exhibiting temporal patterns (e.g., daily, weekly, monthly and yearly [3]), and over location exhibiting spatial patterns (e.g., hot/non-hot server racks [4]). in the data center with rich link connectivity, a great number of network elements may consequently work in idle state. on the other hand, today’s network elements are not energy proportional, since fixed overheads such as fans, switch chips and transceivers waste power at low loads. it has been found that the energy consumption of common networking devices at the idle state still accounts for more than 85% of that at the working state [5]. the implication of these factors is that significant amount of power energy is wasted on idle elements in the data center network. to address the challenges stated above, this paper proposes a network element scheduling scheme (ness) that acts as a network-wide energy optimizer for data center networks. it selects a subset of network elements that must stay active to meet traffic routing requirements, and then puts as many unneeded routers, switches and links as possible into dormant mode. considering unique features of network topologies and traffic patterns in data centers, ness achieves this goal in two steps. firstly, ness models and solves the basic problem of network element scheduling for guaranteeing server interconnection and traffic routing by using the steiner tree framework. on the basis of the initial selection result, ness then additionally activates as few unselected network elements as possible to support different degrees of multi-path routing between server pairs. network elements not involved in the routing service are finally powered off or put into sleep state. we have conducted extensive simulations in typical architecture models of data center network to illuminate the effectiveness and performance of the proposed scheme. the rest of this paper is organized as follows. section 2 presents related works. section 3 describes the ness scheme in detail. section 4 explores on experiment results, and finally the paper is concluded in section 5. 2 related works the issue considered in this paper involves two network-related research directions, data center networking and green networking. the rest of this section presents the state-of-art related to these two directions. the research on data center networking mainly focuses on how to implement a network infrastructure that achieves the following goals [2] [6]: (1) it must be scalable to an increasing number of servers; (2) it must be fault tolerant against various types of hardware failures; (3) it must be able to provide high network capacity; (4) it must achieve high utilization and be cost efficient. due to the limitations of the conventional tree-based architecture [7], a number of novel network architectures for data center networks have been proposed recently, which can be roughly divided into two categories. one is the switch-centric architecture, which organizes switches into structures rather than trees and puts interconnection intelligence on switches, such as fat-tree [8], vl2 [7] and portland [9]. the other is the server-centric architecture, which puts interconnection intelligence on servers and uses switches only as cross-bars, such as bcube [10], ficonn [11] and dcell [6]. accordingly, each of the architecture proposals has its own solution for node addressing and traffic routing [6] [8]. furthermore, multi-path routing [6] [8] [12] [13] has been exploited for load balancing in data center networks. other complementary research for data center networking has focused on tcp incast problem [14], traffic-aware virtual machine migration [15], switch design [16] [17] or cost efficiency [2] [3], etc. reduction of unnecessary energy consumption, referred to as "green networking", has become network element scheduling for achieving energy-aware data center networks 243 a major concern in wired/wireless networking, because of the potential economical benefits and the expected environmental impact. [18] is a pioneer work on this topic, and the authors of [18] suggested putting network elements to sleep for saving energy in local area network in a later paper [19]. additionally, link rate adaptation is also employed to reduce energy consumption in ethernet [20]. based on these techniques, the ieee 802.3az energy efficient ethernet task force proposes the low power idle solution to reduce power consumption of ethernet devices [21]. elastictree [3] is a pioneer work that optimizes the energy consumption of data center networks by turning off unnecessary links and switches during off-peak hours. it models the problem based on the multi-commodity flow model, in which some parameters, e.g., server traffic demand, are difficult to be accurately obtained in practice. besides, elastictree focuses on only tree-based topologies such as fattree. another similar work [22] models the same problem as a 0-1 knapsack model, and proposes a heuristic based solution. however, it doesn’t support multi-path routing between server pairs for load balancing. vmflow [23] is a recent work on how to migrate virtual machines among data center servers to minimize the amount of serving network elements while satisfying a large fraction of the network traffic demands. 3 ness: network element scheduling scheme 3.1 research motivations a data center network is typically provisioned for peak traffic load, and run well below capacity most of the time. from previous researches, we can discover the two important characteristics (i.e., temporal and spatial) of data center traffics, both of which can help us to define the design goals. on the one hand, network traffic and its temporal dynamics implicitly reflect the behavioural pattern of end users for whom the data center provide services [3]. for example, traffic may vary daily (e.g., more email exchanging during the day), weekly (e.g., more enterprise data processing on weekdays), monthly (e.g., more multimedia file sharing on holidays), and yearly (e.g., more e-shopping and e-payment in december). rare events like cable breaks or breaking news may hit the peak capacity, but most of the time data center traffic follows the temporal pattern and actually can be satisfied by a subset of active network elements [3]. on the other hand, physical servers in the data center are commonly organized into racks and richly-interconnected by a number of links, switches and routers. the typical upper-layer applications usually generate a traffic demand with only a few of server racks being hot (i.e., sending or receiving a large volume of traffic) [4]. moreover, the hot racks generally exchange much of their data with only a few other racks. such a spatial pattern of data center traffic is determined by the role and tasks of servers in the current application. to avoid network congestion in data centers [4], it is necessary to further provide different degrees of multipath routing for the flows from/to hot servers. motivated by the analysis above, we propose ness, a network element scheduling scheme that acts as a network-wide energy optimizer for data center networks. the following subsections will present the key issues on its design and implementation in detail. 3.2 design details in data center networks, the physical servers are interconnected by a number of high-speed links, switches and even routers. the data center network topology can be modelled as a simple undirected weighted graph g(v, e) with equal edge weights [15], where v is the set of vertices and e ⊆ v × v is the set of edges. there are two types of vertices in v : the servers and the 244 w. fang, x. liang, y. sun, a.v. vasilakos networking devices (i.e., switches and routers). the sets of them can be denoted by vs and vd respectively. therefore, v = vs ∪ vd. the edge e ∈ e represents a communication link between a server and a networking device, or between a pair of networking devices. as an illustration, we show four typical data center topologies in figure 1, namely 2n-tree [3], vl2 [15], fat-tree and bcube. it must be noted that in the first three architectures vs ∩ vd = ∅, while in bcube vs ⊂ vd. (1) 2n-tree (3) fat-tree (2) vl2 (4) bcube networking device physical server figure 1: illustration of state-of-the-art data center topologies. before problem formulation, we first define the following notation: |x| denotes the cardinality of set x, fi,j denotes the flow from server i to server j, pi,j denotes the path having i and j as its ends, and li,j denotes the physically upper bound for total routing paths. now, the network element scheduling problem can be formally formulated as follows: minimize: |v ′d| (1) subject to: g′(v ′,e′) ⊆ g(v,e) (2) v ′ = v ′d ∪ v ′ s (3) v ′d ⊆ vd (4) v ′s = {i ∈ vs | ∑ ∀j∈vs (fi,j + fj,i) > 0} (5) network element scheduling for achieving energy-aware data center networks 245 |{pi,j | i,j ∈ v ′s,∀k ∈ pi,j,k ∈ v ′ d}| ∝ ∑ (fi,j + fj,i) (6) 1 ≤ |{pi,j | i,j ∈ v ′s,∀k ∈ pi,j,k ∈ v ′ d}| ≤ li,j (7) in the above formulation, the term of the objective represents the set of active networking devices for interconnections of servers having incoming and/or outgoing traffic. formula (5)∼(7) guarantee different degrees of multipath routing between pairs of servers can be provided according to traffic requirements. moreover, formula (2) implies that unneeded idle links can actually be turned off to further reduce energy consumption [3] [5] since e′ ⊆ e. such a scheduling problem can be easily proved to be np-complete by restricting that only one path is established between any server pair (i,j). in the equal-edge-weight graph g(v,e), the objective of minimizing |v ′d| is equal to minimizing |e ′|, and thereby is equal to determining the minimal-weight connected sub-graph g′(v ′,e′) spanning v ′s . this is identical to the sterner tree problem in the equal-edge-weight graph, which is a np-complete problem [24]. therefore, the problem above is np-complete. because by now no polynomial-time algorithm is available to obtain the optimal solution of a np-complete problem, we propose ness, a scheme that solves the scheduling problem modelled above heuristically in two steps. in the first step, ness solves the steiner tree problem on graph g(v,e) with the terminal node set v ′s . a lot of heuristic algorithms can be used to obtain the results, among which mph (minimum path heuristic) is one of the best-known solution [24] [25]. the output vertex set v ′d1 is a subset of vd, representing the minimal number of networking devices that can guarantee the basic purpose for server interconnection and traffic routing. in the second step, ness selects from vd\v ′d1 as few vertices as possible to construct multiple path between some vertices in v ′s that represents the hot servers. to achieve this goal, ness integrates the well-known ecmp (equal-cost multiple-path) routing mechanism [12] for discovering multiple paths connecting hot servers, and then chooses the paths containing more vertices already in the set v ′d1. denoting the set of newly selected vertices in this step as v ′ d2, we finally have v ′d = v ′ d1 ∪ v ′ d2. to illustrate the approach stated above, we take an example in the fat-tree topology shown in figure 2. assume v ′s = {v21,v25,v27,v28}, and v21,v28 are hot servers requiring two routing paths to be established between them. in the first step, ness executes the mph algorithm to solve the steiner tree problem, and obtains v ′d1 = {v1,v5,v7,v13,v15,v16}. in the second step, ness can discover totally four equal-cost shortest paths through a basic version of ecmp [8] as follows: path1 : v21 → v13 → v5 → v1 → v7 → v16 → v28 path2 : v21 → v13 → v5 → v2 → v7 → v16 → v28 path3 : v21 → v13 → v6 → v3 → v8 → v16 → v28 path4 : v21 → v13 → v6 → v4 → v8 → v16 → v28 then we can choose path1 and path2 for v21,v28 since path2 requires to additionally activate only v2, while path3 and path4 requires to activate v6,v3,v8 and v6,v4,v8 respectively. therefore, v ′d2 = {v2}. 3.3 implementation issues as a scheduling software, ness consists of four logical modules, i.e., network analyzer, network scheduler, power controller and route controller. the network analyzer obtains network topology and traffic requirement by performing statistics and analysis to the collected running data of the data center network. the role of network scheduler is to find the minimum network subset that can satisfy current traffic according to the approach stated in section 3.2. with 246 w. fang, x. liang, y. sun, a.v. vasilakos v1 v2 v3 v4 v5 v6 v7 v8 v9 v10 v13 v14 v15 v16 v17 v18 v11 v19 v12 v20 v21 v22 v23 v24 v25 v26 v27 v28 v29 v30 v31 v32 v33 v34 v35 v36 figure 2: an fat-tree example for illustrating network element scheduling in ness. the input of topology and traffic conditions from the analyzer, the scheduler outputs the set of active elements to power controller and the set of flow routes to route controller. the power controller toggles the power states of different types of network elements (i.e., links, switches and routers), while the route controller checks routing paths for traffic flows and pushes routes into the network. figure 3: system diagram of ness. similar to elastictree [3], ness can be implemented as a nox application [26] to run atop a network of openflow switches [27]. openflow is an open standard for commercial switches and routers to enable controlling the forwarding plane by a software running on a separate server, and nox is an open-source openflow controller that is designed to provide a simplified platform for writing network control software in c++ or python. the route controller in ness can be implemented with nox. besides, we can leverage the existing mechanisms such as snmp set operations and command line interface to support the power control features of ness. moreover, the network analyzer can collect the state records of traffic and topology through snmp get operations and passive packet tracing [28]. 4 experimental evaluation in this section, we evaluate the performance of ness experimentally, using a custom simulator developed in c++ with the boost graph library [29]. this simulator supports constructing the four types of data center network architectures described in section 3.2 with a number of 48port gigabit ethernet switches [8] [27]. by using this simulator, we have created data center network element scheduling for achieving energy-aware data center networks 247 communication scenarios with different network sizes (e.g., a network of s =2304/4608/13824 end servers), different traffic conditions (e.g., a network of servers with h=1%/10% hot ones [4] in which each hot server randomly selects about 10% other ones under different racks as its communication counterparts) and different routing requirements (e.g., a pair of hot servers may require 2∼6 equal-cost paths for traffic multiplexing). for a specific scenario, the simulation has been carried out independently for multiple times by network reconstruction, and the results are averaged over these runs. figure 4 shows the percentage of the dormant network elements in 2n-tree, vl2, fat-tree and bcube respectively. 0 10 20 30 40 50 p er ce nt ag e( % ) (a) 2n-tree 0 5 10 15 20 25 p er ce nt ag e( % ) (b) vl2 switch link 0 20 40 60 80 100 p er ce nt ag e( % ) (c) fat-tree s = 2304 h = 1% s = 2304 h = 10% s = 4608 h = 1% s = 4608 h = 10% s = 13824 h = 1% s = 13824 h = 10% 0 20 40 60 80 100 p er ce nt ag e( % ) (d) bcube figure 4: the percentage of dormant network elements in six different networking scenarios. we have several observations concerning the simulation results: (1) ness effectively reduces the total amount of active switches as well as active links to different extents in typical communication scenarios under the four data center network architectures. (2) all else being equal, the communication scenario with more hot servers generally will have greater demand for network resources, i.e., more network elements have to be kept active by ness for fulfil routing requirements. (3) for 2n-tree, we can find that its dormant percentages of network elements remain essentially flat under the same condition of hot rate h. that’s because in 2n-tree both the total 248 w. fang, x. liang, y. sun, a.v. vasilakos amount of network elements (i.e., switches or links) and the amount of the ones selected by ness are approximately in direct proportion to the network size. in general, our simulation results on 2n-tree (figure 4 (a)) indicate that we can roughly predict the energy saving for applying ness to the large-scale data center network built in this architecture through small-scale experiments. (4) for vl2, we can find that its dormant percentages of both switch and link are the lowest among the four architectures. an examination of simulation traces reveals that the total amount of switches remains the same (i.e., 336) for all the six simulation scenarios, while most of these switches (i.e., 288) and the links from them to the servers have to be kept active. therefore, the optimization space for ness is quite limited in vl2 architecture, especially when the hot rate h is relatively high. from the results in figure 4 (b), the scenario with fewer hot servers has higher dormant percentage of switches, especially when the number of hot servers is less than 230 (i.e., 10% of 2304). moreover, the dormant percentage of links decreases accordingly along with the increment of network size. (5) for fat-tree, we can find that that it has the relatively highest dormant percentage of links among the four architectures. besides, the dormant percentage of switches is also relatively high (i.e., about 50%) for each scenario. an examination of simulation traces reveals that fat-tree has the most switches (i.e., 2880) and links (i.e., 50000∼70000) among the four architectures. while such redundancy of available network elements makes the fat-tree topology attractive for fault-tolerance, it also brings about considerable energy wastage due to element idle in most cases [3] [22]. moreover, as that of vl2, all scenarios have the same amount of switches and the one with fewer hot servers will have higher dormant percentage of switches. in all, the simulation results in figure 4 (c) indicate that there is a tradeoff between fault tolerance and energy consumption for fat-tree. (6) for bcube, the simulation results are significantly different from those of switch-centric architectures, i.e., the dormant percentages increase sharply when s grows lager than 2304. that’s because: when s =2304, a 2-level bcube1 built with totally 96 switches (48 for level 0 and 48 for level 1) is enough. however, a 3-level bcube2 has to be constructed with 2304 switches for level 2 when s>2304. all these level 2 switches only interconnects two bcube1 when s =4608 and six bcube1 when s =13824. actually, the utilization of these switches are quite low in result of high redundancy, and thus most of them can be put into dormant state. moreover, it can also be found that the increment of network size under the same condition of bcube level, e.g., from 4608 to 13824, will lead to higher element utilization and lower dormant percentages. in all, the simulation results in figure 4 (d) indicate that for each bcube level the dormant percentages of network elements will decrease from high to low along with the increment of network size. accordingly, table 1 specifically illustrates to how much extent power energy can be saved by applying ness to data center networks. the power consumption parameters of network elements are obtained from a previous work [5]. for the three switch-centric architectures above, pv l2, fuzzy (0.x)), with 0.x the minimal accepted similarity [53]. note that this analytics provides the function score() that determines the degree of similarity for every string in a specified set and the given string, but this is simplistic and far from what would may be expected as level of use of fl in understanding people’s language communication. an interesting direction was opened in [39], who proposed the use of prosodic features to prioritize the call servicing. while not using fl, the approach in [39] is an example of applications where fl looks promising for speech analytics. similarly, there is an interesting paper, [13], thoroughly analyzing the possibilities of mining fuzzy association rules in texts; that track could be followed and applied on step further to finding fuzzy associations between textual information a retrospective assessment of fuzzy logic applications in voice communications and speech analytics 869 and prosody and emotions in speech. another remarkable approach to analytics based on fl but not related to speech is constituted by a series of papers [30], [31] that apply fuzzy data analysis and inductive fuzzy classification using a normalization of the likelihood ratio to metadata and for knowledge discovery. surprisingly, there are few reports on research on fl applied to speech analytics related to emotions. maybe this is due to the fact that sentiment analysis on texts have developed earlier and that it is considered sufficient for deriving the mood of the speaker. 3 discussion and conclusions while the contributions of fl to speech technology, specifically to vad, speech segmentation, and coding cannot be disregarded, these contributions seem to be less significant than one may expect from applying fl to speech. few researches compare the results and the advantages or disadvantages of the fl approach to non-fuzzy approaches, or even try to justify the fl-based approach. while some good results obtained using fl-approaches are expected based on the known power of universal approximation (and thus nonlinear classification) of flss, the capabilities of others, including their generalization power are less clear. a more systematic research program for employing fl in speech analysis is needed to overcome the current limits. an explanation for this state of affairs could be that fl requires extensive computations, while systems as cellular phones and even pcs are restricted in computation power. however, the recent processors have tremendously increased in computation power, favoring a larger use of fl. thus, one can look forward with the hope that fl will achieve more in this field, in the near future. acknowledgments. this work was supported in part (section on ’analytics’) by the sps nato program under grant g4877 /sfp 984877. bibliography [1] amir n., kerret o., karlinski d.(2001); classifying emotions in speech: a comparison of methods, 7th eurospeech proc., aalborg, 127-130. [2] austermann, a., esau, n., kleinjohann, l., kleinjohann, b. (2005); fuzzy emotion recognition in natural speech dialogue, robot and human interactive communication, roman 2005, ieee int. workshop on, 13-15 aug. 2005, 317-322. [3] ben jebara s., ben amor t. (2004); on improving voice activity detection by fuzzy logic rules: case of coherence based features, proc. signal processing conference, 2004, 12th european, 725 728. [4] ben jebara s. (2002); coherence-based voice activity detector, iee electronic lett., 38(22):1393-1397. [5] ben jebara s. (2008); voice activity detection using periodioc/aperiodic coherence features, signal processing conference, 2008, 16th european, lausanne, switzerland, 1-5. [6] beritelli f., casale s., cavallaro a. (1999); a multi-channel speech/silence detector based on time delay estimation and fuzzy classification, proc. ieee int. conf. assp, phoenix, az, 15-19 mar 1999, vol. 1: 93-96. [7] beritelli f., casale s., cavallaro a. (1998); a robust voice activity detector for wireless communications using soft computing. ieee j. selected areas comm, 16(9): 1818-1829. 870 h.-n.l. teodorescu [8] beritelli f., casale s., g. ruggeri, s. serrano (2002); performance evaluation and comparison of g.729-amr-fuzzy voice activity detectors, ieee signal process lett, 9(3): 85-88. [9] beritelli f., casale s., cavallaro a. (1998); adaptive voice activity detection for wireless communications based on hybrid fuzzy learning, global telecommunications conference, 1998. globecom 1998. the bridge to global integration. ieee, 3: 1729 1734. [10] christer carlsson (2013); on the relevance of fuzzy sets in analytics. in r. seising, e. trillas, c. moraga, s. termini (eds.), on fuzziness, studies in fuzziness and soft computing, 298: 83-89. [11] carvalho, j.p., batista f., coheur l. (2012); a critical survey on the use of fuzzy sets in speech and natural language process, fuzzy systems (fuzz-ieee), 2012 ieee international conference on, 1-8. [12] cavallaro a., beritelli f., casale s (1998), a fuzzy logic-based speech detection algorithm for communications in noisy environments, proc. 1998 ieee int. conf. acoustics, speech and signal process, 1: 565-568. [13] chen y.-l., weng c.-h. (2009); mining fuzzy association rules from questionnaire data, knowledge-based systems, 22: 46-56. [14] cheng rg, chang c.j. (1996); design of a fuzzy traffic controller for atm networks, ieeeacm trans. networking, 4(3):460-469. [15] cowie, r., douglas-cowie, e., tsapatsoulis, n.,votsis, g. , kollias, s., fellenz, w., taylor, j.g. (2001); emotion recognition in human-computer interaction, ieee signal process magazine, 18(1): 32-80. [16] dhavarudha e, charoenlarpnopparut c, runggeratigul s (2015); traffic control based on contention resolution in optical burst, international journal of computers communications & control, 10(1); 49-61. [17] el ayadi m., kamel m.s., karray f. (2011); survey on speech emotion recognition: features, classification schemes, and databases, pattern recognition, 44(3): 572-587. [18] fenn j. (2006); survey shows adoption and value of emerging technologies. gartner research, 23 march 2006, number g00138453. [19] feraru, s.m., teodorescu, h.n., zbancioc, m.d. (2010); srol web-based resources for languages and language technology e-learning, international journal of computers communications & control, 5(3): 301-313. [20] gharavian d., sheikhan m., nazerieh a., garoucy s. (2012); speech emotion recognition using fcbf feature selection method and ga-optimized fuzzy artmap neural network. neural computing and applications, 21(8): 2115-2126. [21] grimm, m., kroschel, k., narayanan, s. (2007); support vector regression for automatic recognition of spontaneous emotions in speech, proc. icassp 2007, honolulu, hi, 4: 1085-1088. [22] grimm, m., kroschel, k., mower, e., narayanan, s. (2007); primitives-based evaluation and estimation of emotions in speech, speech commun, 49(10-11): 787-800. a retrospective assessment of fuzzy logic applications in voice communications and speech analytics 871 [23] grimm m., kroschel k. (2007); rule-based emotion classification using acoustic features, speech communication; 49(10): 787-800. [24] hsieh c.t., su m.c., lai e., hsu c.h. (1999); a segmentation method for continuous speech utilizing hybrid neuro-fuzzy network. j. information sci. & engineering, 15, 615628. [25] juang c.-f., cheng c.-n., chen t.m. (2009); speech detection in noisy environments by wavelet energy-based recurrent neural fuzzy network. expert systems with applications, 36(1):321-332. [26] kamaruddin, n., nanyang, wahab, a (2010);, driver behavior analysis through speech emotion understanding, ieee intell vehicles symp 2010, san diego, ca, 238-243. doi: 10.1109/ivs.2010.5548124 [27] kamaruddin n., wahab a., quek c. (2012); cultural dependency analysis for understanding speech emotion. expert systems with applications, 39(5): 5115-5133. [28] kamaruddin n., wahab a. (2009); features extraction for speech emotion. j. computational methods in science and engineering, 9(1suppl.): 11-12. [29] kasabov, n., iliev, g. (2000); hybrid system for robust recognition of noisy speech based on evolving fuzzy neural networks and adaptive filtering, proc. int. conf. ijcnn 2000, 24-27 jul 2000, como, italy, 5: 91-96. doi:10.1109/ijcnn.2000.861440 [30] kaufmann m.a. (2008); inductive fuzzy classification in marketing analytics (fuzzy management methods), springer [kindle edition]. [31] kaufmann m.a., e. portmann, m. fathi (2013); a concept of semantics extraction from web data by induction of fuzzy ontologies, 2013 ieee int. conf. electro-information tech eit, 1-6. [32] kazemzadeh a., lee s, and narayanan s (2013); fuzzy logic models for the meaning of emotion words, ieee computational intelligence magazine, 8(2): 34-49. [33] lee c.m., narayanan s.s. (2005); toward detecting emotions in spoken dialogs, ieee trans speech and audio process, 13(2): 293-303. [34] lee cm, narayanan s. (2003); emotion recognition using a data-driven fuzzy inference system, proc. eurospeech, geneva, 157-160. [35] lin, c.t., wu, r.c., wu, g.d.(2002); noisy speech segmentation-enhancement with multiband analysis and neural fuzzy networks, int j pattern recognition and ai, 16(7): 927-955. [36] ndousse, t.d. (1994); fuzzy neural control of voice cells in atm networks, ieee j. on selected areas in communications, 12(9): 1488 1494. [37] ndousse, t.d. (1998); fuzzy expert systems in a tm networks, in fusion of neural networks, fuzzy systems and genetic algorithms: industrial applications, lakhmi c. jain, n.m. martin (eds.), crc press, boca raton, usa, 229-284. [38] pavaloi, i., rotaru f.(2011); a study on duration for different pronunciations in emotional states, proc. 3rd int. conf. ehb, iasi, romania. 872 h.-n.l. teodorescu [39] t. polzehl and f. metze (2008); using prosodic features to prioritize voice messages, proc. searching spontaneous conversational speech workshop sigir 2008, singapore, july 2008, acm. [40] qin y., zhang x., ying h. (2010); a hmm-based fuzzy affective model for emotional speech synthesis, 2nd int. conf. icsps, 3: 525-528. doi: 10.1109/icsps.2010.5555658. [41] ramirez j. et al. (2004); efficient voice activity detection algorithms using long-term speech information, speech commun, 42: 271-287. [42] rodriguez w., teodorescu hn, grigoras f., kandel, a., bunke, h.(2002); a fuzzy information space approach to speech signal non-linear analysis, int. j. intelligent systems, 15(4): 343-363. [43] sheikhan m, garoucy s.(2010); reducing the codebook search time in g.728 speech coder using fuzzy artmap neural networks, world applied sciences journal, 8(10): 1260-1266. [44] spanias a.s. (1994); speech coding: a tutorial review. proc. of the ieee, 82(10):1541 1582. [45] temko a., macho d., nadeu c.(2008); fuzzy integral based information fusion for classification of highly confusable non-speech sounds. pattern recognition, 41(5):1814-1823. [46] tian y., wu j., wang z., lu d. (2003); fuzzy clustering and bayesian information criterion based threshold estimation for robust voice activity detection. 2003 ieee int. conf. assp icassp’03, 1: 444-447. [47] toledano d.t., rodrăguez crespo m. a. (1998); escalada sardina j. g. (1998); trying to mimic human segmentation of speech using hmm and fuzzy logic post-correction rules, 3rd esca/cocosda workshop (etrw), nov. 26-29, ssw3-1998, 207-212. [48] zare, h. , adibnia,f., derhami, v. (2013) ; a rate based congestion control mechanism using fuzzy controller in manets, international journal of computers communications & control, 8(3): 486-491. [49] yang m., kiang m., ku y., chiu c., li y. (2011); social media analytics for radical opinion mining in hate group web forums, j. homeland security and emergency management, 8(1): 1547-7355. [50] zadeh, l.a. (1975); concept of a linguistic variable and its application to approximate reasoning. 1. information sciences, 8(3): 199-249. [51] zbancioc m., feraru m. (2012); the analysis of the fcm and wknn algorithms performance for the emotional corpus srol, advances electrical comput engng, 12(3): 33-38, doi: 10.4316/aece.2012.03005. [52] zhao h., wang g, xu c., yu f. (2011); voice activity detection method based on multivalued coarse-graining lempel-ziv complexity. comput. sci. inf. syst., 8(3): 869-888. [53] http://saphanatutorial.com/sap-hana-fuzzy-search/ int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vii (2012), no. 1 (march), pp. 173-183 small universal tissue p systems with symport/antiport rules x. zhang, b. luo, l. pan xingyi zhang, bin luo key lab of intelligent computing and signal processing of ministry of education school of computer science and technology anhui university, hefei 230039, china e-mail: xyzhanghust@gmail.com,luobinahu@yahoo.com.cn linqiang pan key laboratory of image processing and intelligent control department of control science and engineering huazhong university of science and technology wuhan 430074, china e-mail: lqpan@mail.hust.edu.cn (corresponding author) abstract: in this note, we consider the problem of looking for small universal one-symbol tissue p systems with symport/antiport rules. it is proved that six cells suffice to generate any recursively enumerable set of natural numbers by such a onesymbol tissue p system with symport/antiport rules, under the restriction that only one channel is allowed between two cells or between a cell and the environment. as for the case of allowing two channels between a cell and the environment, it is shown that the computational completeness can be obtained by one-symbol tissue p systems with symport/antiport rules having at most five cells. these results partially answer an open problem formulated by artiom alhazov, rudolf freund and marion oswald. keywords: membrane computing, tissue p system, symport/antiport rule, universality. 1 introduction membrane computing is one of the recent branches of natural computing, which was initiated by gh. păun in 1998 [7]. the aim of membrane computing is to abstract novel computing ideas or models from the structure and the functioning of a living cell, as well as from the organization of cells in tissues, organs, and other higher order structures. the obtained models, called p systems, provide distributed parallel and non-deterministic computing models. the field of membrane computing has rapidly developed (already in 2003, isi considered membrane computing as a “fast emerging research area in computer science”, see http://esi-topics.com). please refer to the handbook of membrane computing [9] for general information in this area, and to the membrane computing web site [11] for the up-to-date information. tissue p systems form a class of p systems, which were introduced in [6]. tissue p systems were inspired by intercellular communication and cooperation between cells. briefly, a tissue p system consists of a set of membranes (abstracted from cells) placed in the nodes of a graph. the net of membranes deals with symbols and communicates symbols along channels specified in advance. the communication among cells is based on symport/antiport rules [8]. symport rules move objects across a membrane in one direction, whereas antiport rules move objects across a membrane in opposite directions. between two cells or between a cell and the environment, it is copyright c⃝ 2006-2012 by ccc publications 174 x. zhang, b. luo, l. pan possible that there exists only one channel or more than one channel [2]. a tissue p system works in a synchronized mode (a global clock is assumed, marking the time for the whole system). in each time unit, if there are rules that can be applied in each channel, then one of the rules must be applied. at most one rule is applied in each channel, non-deterministically chosen among the rules that can be applied in the channel. so, the use of rules is sequential at the level of each channel, but it is parallel at the level of the system. in this note, we shall consider the following two restrictive versions of tissue p systems: (1) only one channel is allowed between two cells or between a cell and the environment, (2) two channels are allowed between a cell and the environment. table 1: the computational completeness results for tissue p systems with only one channel between two cells, or between a cell and the environment; where ? indicates an open problem, † indicates that the result is obtained in this note, the other results are from [1]. symbols/cells 1 2 3 4 5 6 7 1 nreg ? ? ? ? nre† nre 2 nreg ? ? nre nre nre nre 3 nreg ? nre nre nre nre nre 4 nreg nre nre nre nre nre nre one of the central problems about tissue p systems is to investigate their computational power. this topic has been widely investigated for tissue p systems of various forms (e.g., see [4, 6, 10]). for the special case when tissue p systems use only a (very) small number of symbols and cells, please refer to [1, 3]. there seems to be a trade-off between the number of cells and the number of symbols needed for the computational power of tissue p systems. if only one channel is allowed between two cells or between a cell and the environment, it was shown that any recursively enumerable set of natural numbers can be generated by a tissue p system with at most seven cells and only one symbol [3]. the number of cells can decrease when two symbols are used; specifically, a tissue p system with two symbols and at most four cells can generate any recursively enumerable set of natural numbers [3]. the known results about the computational power of tissue p systems with only one channel between two cells or between a cell and the environment are listed in table 1. if two channels are allowed between a cell and the environment, then one cell is enough to obtain computational completeness for tissue p systems with at most five symbols [1]. the number of symbols for computational completeness can decrease when the number of cells increases: for the case of tissue p systems with two channels between a cell and the environment, the computational completeness can be obtained by tissue p systems with two cells and three symbols, or three cells and two symbols. the known results about the computational power of tissue p systems with two channels between a cell and the environment are listed in table 2. table 2: the computational completeness results for tissue p systems with two channels between a cell and the environment; where ? indicates an open problem, † indicates that the result is obtained in this note, the other results are from [1]. symbols/cells 1 2 3 4 5 6 1 nfin ? ? ? nre† nre 2 ? ? nre nre nre nre 3 ? nre nre nre nre nre 4 ? nre nre nre nre nre 5 nre nre nre nre nre nre small universal tissue p systems with symport/antiport rules 175 in this note, we continue to investigate the computational power of tissue p systems with small numbers of symbols and cells. specifically, we look for universal one-symbol tissue p systems with symport/antiport rules having a small number of cells. it is proved that six cells suffice to generate any recursively enumerable set of natural numbers for such a one-symbol tissue p system, under the restriction that only one channel is allowed between two cells or between a cell and the environment. the idea used in the proof of the result can also be extended to the case of allowing two channels between a cell and the environment, then one cell can be saved. the results obtained in this note partially answer open problems formulated in [1] (see tables 1 and 2). this note is organized as follows. in section 2, the formal language theory preliminaries are recalled, including the formal definition of register machine. the formal definition of tissue p systems is introduced in section 3. two universal one-symbol tissue p systems are presented in section 4, with an overview of the computation. conclusions and comments are presented in section 5. 2 formal language theory preliminaries for an alphabet v , v ∗ denotes the set of all finite strings over v , with the empty string denoted by λ. the set of all nonempty strings over v is denoted by v +. a register machine is a construct m = (m,h,l0, lh,i), where m is the number of registers (each holds a natural number), h is the set of instruction labels, l0 is the start label (labeling an add instruction), lh is the halt label (assigned to instruction halt), and i is the set of instructions. each label from h labels only one instruction from i, thus precisely identifying it. the instructions are of the following forms: • li : (add(r), lj, lk) (add 1 to register r and then go to one of the instructions with labels lj, lk), • li : (sub(r), lj, lk) (if register r is non-zero, then subtract 1 from it, and go to the instruction with label lj; otherwise, go to the instruction with label lk), • lh : halt (the halt instruction). a register machine m computes (generates) a number n in the following way. the register machine starts with all registers empty (i.e., storing the number zero). it applies the instruction with label l0 and proceeds to apply instructions as indicated by labels (and, in the case of sub instructions, by the content of registers). if the register machine reaches the halt instruction, then the number n stored at that time in the first register is said to be computed by m. it is known that register machines compute all sets of numbers which are turing computable, hence they characterize nre [5] (nre is the family of length sets of recursively enumerable languages; that is, those recognized by turing machines). especially, it is known that three registers are enough to generate any recursively enumerable set of natural numbers. without loss of generality, it can be assumed that l0 labels an add instruction and that in the halting configuration all registers different from the first one are empty, and that the output register is never decremented during the computation (its content is only added to). without loss of generality, it can be assumed that in each add instruction li : (add(r), lj, lk) and in each sub instruction li : (sub(r), lj, lk) the labels li, lj, lk are mutually distinct (for a short proof, see [2]). 176 x. zhang, b. luo, l. pan 3 tissue p systems with symport/antiport rules tissue p systems were introduced in [6], and tissue-like p systems with channel states were introduced in [2]. in this note, the following type of systems is considered, omitting the channel states. a tissue p system (of degree m ≥ 1) with symport/antiport rules is a construct π = (o,t,e,w1, . . . ,wm,ch,(r(i,j))(i,j)∈ch), where: • o is the alphabet of objects; • t ⊆ o is the alphabet of terminal objects; • e ⊆ o is the set of objects present in the environment in arbitrarily copies each; • w1, . . . ,wm are strings over o, representing the multisets of objects placed in the cells of the system at the beginning of the computation (it is assumed that the system contains m cells, labelled with 1,2, . . . ,m); • ch ⊆ {(i,j) | i,j ∈ {0,1,2, . . . ,m},(i,j) ̸= (0,0)} is the set of links (channels) between cells (they were also called synapses in [2]; 0 indicates the environment); • r(i,j) is a finite set of antiport rules of the form x/y, for some x,y ∈ o∗, associated with the channel (i,j) ∈ ch. an antiport rule of the form x/y ∈ r(i,j) for the ordered pair (i,j) of cells means moving the objects specified by x from cell i (from the environment, if i = 0) to cell j, and at the same time moving the objects specified by y from cell j to cell i. the rules with one of x,y being empty are, in fact, symport rules, but we do not always explicitly consider this distinction here, as it is not relevant for what follows. note that the objects from e are never exhausted, irrespective of how many copies of each of them are brought into the system, arbitrarily many copies remain available in the environment. a configuration of a tissue p system is described by the multisets of objects over o associated with the cells of the system. the tuple (w1,w2, . . . ,wm) is the initial configuration. a halting configuration is a configuration such that there is no rule that can be applied. the computation starts from the initial configuration; in each time unit, a rule is used on each channel for which a rule can be used (if no rule is applicable for a channel, then no object passes over it). therefore, the use of rules is sequential at the level of each channel, but it is parallel at the level of the system: all channels which can use a rule must do it (the system is synchronously evolving). the computation is successful if and only if it halts (reaching a configuration where no rule can be applied). the result of a halting computation is encoded by the multiset of objects over t appearing in a cell specified in advance in the halting configuration. in this note, we deal with a restricted version of the systems introduced above. only channels (i,j) with i ̸= j are allowed. if only one channel is allowed between two cells or between a cell and the environment, then for any i,j only one of (i,j) and (j,i) is allowed. if two channels are allowed between two cells or between a cell and the environment, then both (i,j) and (j,i) are allowed. furthermore, only one-symbol tissue p systems are considered, hence we assume o = t = e = {a}. then, for simplicity, a one-symbol tissue p system is written as π = (w1, . . . ,wm,ch,(r(i,j))(i,j)∈ch); the output cell is that with label 1. we also write x/y for an antiport rule ax/ay and x for multiset ax. small universal tissue p systems with symport/antiport rules 177 4 universality results in [3], it was shown that one-symbol tissue p systems with symport/antiport rules having seven cells are turing universal when only one channel is allowed between two cells or between a cell and the environment. this result is improved by showing that six cells are enough for turing universality. theorem 1. for any recursively enumerable set l of natural numbers, a one-symbol tissue p system with symport/antiport rules having at most 6 cells can be constructed to generate l, under the restriction that only one channel is allowed between two cells or between a cell and the environment. proof: let us consider a register machine m = (m,h,l0, lh,i). as stated in section 2, a register machine with three registers can generate any recursively enumerable set l of natural numbers, where the instructions that act on the first register are add instructions. this means that m = 3. in what follows, we shall construct a tissue p system π with only one symbol a to simulate the register machine m. the system π consists of six cells labeled by 1, . . . ,6. the cell with label 1 represents register 1 (the output register); the cells with labels 2 and 3 represent registers 2 and 3, respectively; the cells with labels 4 and 5 are program cells (the cell with label 4 controls the simulation of the instructions that act on registers 1 and 2, the cell with label 5 controls the simulation of the instructions that act on register 3); the cell with label 6 is used as a trap that, whenever started, leads to a non-halting computation by using the antiport rule 2/2 in channel (6,0). the number stored in each register r (r = 1,2,3) is represented by the number of copies of symbol a in the cell with label r in the following way. if register 1 contains number n (n ≥ 0), then the cell with label 1 has n + 5 copies of symbol a; if register 2 (resp. 3) contains number n (n ≥ 0), then the cell with label 2 (resp. 3) has 2n + 2 copies of symbol a. without loss of generality, we assume that li = 2i + 1, 0 ≤ i ≤ t, t ≥ 0, are the labels of instructions of register machine m, and lt is the label of halting instruction halt. we define the function c: c(0) = 12, c(i + 1) = i∑ j=0 c(j) + c(0), for i ≥ 0. it is easy to check that the function c has the following properties. – for any i < k,j < k, i ̸= j, we have c(k) > c(i) + c(j) + 12. – for any i, 0 ≤ i ≤ t, we have c(li) > 12. – for any i ̸= j, 0 ≤ i,j ≤ t, we have c(li) ̸= c(lj). – for any i < j, 0 ≤ i,j ≤ t, the value c(li + 1) is between c(li) and c(lj). in the initial configuration of π, the cell with label 1 contains five copies of symbol a, and the cells with labels 2 and 3 contain two copies of symbol a, which represents the fact that registers 1, 2 and 3 initially have number 0; the cell with label 4 contains c(l0) copies of symbol a; the cells with labels 5 and 6 contain 0 copy of symbol a. in general, when the cell with label 4 contains c(li) copies of symbol a, system π starts to simulate the instruction li. formally, the tissue p system π is a construct of the form π = (5,2,2,c(l0),0,0,ch,(r(i,j))(i,j)∈ch), where: • ch = {(4, i) | i ∈ {0,1,2,5,6}} ∪ {(5,0),(5,3),(5,6),(6,0)}; 178 x. zhang, b. luo, l. pan • the sets of rules r(i,j), (i,j) ∈ ch, are as follows: r(4,0) = {(c(li) − 6)/(c(lj) − 5),(c(li) − 6)/(c(lk) − 5) | li : (add(1), lj, lk) ∈ r} ∪{(c(li) − 3)/(c(lj) − 1),(c(li) − 3)/(c(lk) − 1) | li : (add(2), lj, lk) ∈ r} ∪{(c(li) − 1)/(c(li + 1) − 3),(c(li + 1))/(c(lj)), (c(li + 1) − 2)/c(lk) | li : (sub(2), lj, lk) ∈ r}, r(4,1) = {6/5},r(4,2) = {3/1,1/3},r(4,6) = {2/0},r(5,3) = {3/1,1/3}, r(4,5) = {c(li)/0,0/c(lj),0/c(lk) | li : (add(3), lj, lk)} ∪{c(li)/0,0/c(lj),0/c(lk) | li : (sub(3), lj, lk)}, r(5,0) = {(c(li) − 3)/(c(lj) − 1),(c(li) − 3)/(c(lk) − 1) | li : (add(3), lj, lk) ∈ r} ∪{(c(li) − 1)/(c(li + 1) − 3),(c(li + 1))/(c(lj)), (c(li + 1) − 2)/c(lk) | li : (sub(3), lj, lk)}, r(5,6) = {2/0}, r(6,0) = {2/2}. in order to show that tissue p system π can correctly simulate register machine m, we only need to show how the following six kinds of rules of register machine m are simulated by tissue p system π. (1) simulating add instructions that act on register 1. let li : (add(1), lj, lk) be an add instruction that acts on register 1, and the cell with label 4 contains c(li) copies of symbol a (in the initial configuration, the cell with label 4 contains c(l0) copies of symbol a). the simulation uses the rules: 6/5 ∈ r(4,1), (c(li) − 6)/(c(lk) − 5) ∈ r(4,0), (c(li) − 6)/(c(lj) − 5) ∈ r(4,0). the simulation takes one step. at this step, six copies of symbol a from the cell with label 4 are sent to the cell with label 1 by the rule 6/5 ∈ r(4,1) (the number n stored in register 1 is encoded by n + 5 copies of symbol a in the cell with label 1, so the cell with label 1 contains at least five copies of symbol a, and thus the rule 6/5 ∈ r(4,1) can be applied), exchanging with five copies of symbol a. the number of copies of symbol a in the cell with label 1 increases by one, which simulates that the number stored in register 1 is increased by one. at the same time, the other copies of symbol a from the cell with label 4 can be used by the rule (c(li) − 6)/(c(lj) − 5) ∈ r(4,0) or (c(li) − 6)/(c(lk) − 5) ∈ r(4,0), non-deterministically chosen. note that the cell with label 4 also gets five copies of symbol a at this step by the rule 6/5 ∈ r(4,1). the cell with label 4 accumulates c(lj) or c(lk) copies of symbol a in total after this step. in this way, system π starts to simulate instruction lj or lk. all objects in cell 4 are simultaneously used by communications along the channels (4,0) and (4,1). if the system does not use at the same time the rule 6/5 ∈ r(4,1) and one of the rules (c(li) − 6)/(c(lj) − 5) ∈ r(4,0) or (c(li) − 6)/(c(lk) − 5) ∈ r(4,0), then the system will enter an infinite loop and then the computation gives no result. for instance, the rule (c(li) − 6)/(c(lj) − 5) ∈ r(4,0) or (c(li) − 6)/(c(lk) − 5) ∈ r(4,0) consumes c(li) − 6 copies of symbol a in the cell with label 4, and there are 6 copies of symbol a that are subject to other possible rules. because for 0 ≤ i′,j′,k′ ≤ t, the encoding c(li′) of each instruction li′ of register machine m is larger than 6, these 6 copies of symbol a cannot be used by the rules (c(li′) − 6)/(c(lj′) − 5), (c(li′) − 6)/(c(lk′) − 5), (c(li′) − 3)/(c(lj′) − 1), (c(li′) − 3)/(c(lk′) − 1), (c(li′) − 1)/(c(li′ + 1) − 3), c(li′ + 1)/c(lj′), (c(li′ + 1) − 2)/c(lk′), c(lt)/0 from r(4,0) and the rule c(li′)/0 from r(4,5). these 6 copies of symbol a can be used by the rule 3/1 or 1/3 from r(4,2), but not both, since at most one rule is allowed to be used in each channel at one step. so there are at least 3 copies of symbol a that are subject to other possible small universal tissue p systems with symport/antiport rules 179 rules. by the maximal parallel manner of application of rules, the rule 2/0 ∈ r(4,6) must be applied at this step, hence two copies of symbol a are sent to the cell with label 6. conversely, the rule 6/5 ∈ r(4,1) consumes 6 copies of symbol a in the cell with label 4, and there are c(li) − 6 copies of symbol a that are subject to other possible rules. for 0 ≤ i, i′, i′′,j′,k′ ≤ t, i′ ̸= i′′, i′ < i and i′′ < i, these c(li) − 6 copies of symbol a can be used by the rule c(li′′)/0 from r(4,5), and one of the rules (c(li′) − 6)/(c(lj′) − 5), (c(li′) − 6)/(c(lk′) − 5), (c(li′) − 3)/(c(lj′) − 1), (c(li′) − 3)/(c(lk′) − 1), (c(li′) − 1)/(c(li′ + 1) − 3), c(li′ + 1)/c(lj′), (c(li′ + 1) − 2)/c(lk′) from r(4,0). by the first property of the function c, c(i) > c(i′) +c(i′′) + 12, there are at least 6 copies of symbol a that are subject to other rules at this moment. these 6 copies of symbol a can be used by the rule 3/1 or 1/3 from r(4,2), but not both. therefore, there are at least 3 copies of symbol a that are subject to the rule 2/0 ∈ r(4,6); the rule 2/0 ∈ r(4,6) must be applied, and two copies of symbol a are sent to the cell with label 6. in each of these two cases, the cell with label 6 receives two copies of symbol a, and the rule (2,2) ∈ r(6,0) would be applied forever. consequently, the rule 6/5 ∈ r(4,1) and one of the rules (c(li) − 6)/(c(lj) − 5) ∈ r(4,0) or (c(li) − 6)/(c(lk) − 5) ∈ r(4,0) must be used, hence the instruction li : (add(1), lj, lk) is correctly simulated by system π (otherwise, the computation cannot halt). (2) simulating add instructions that act on register 2. let li : (add(2), lj, lk) be an add instruction that acts on register 2, and the cell with label 4 contains c(li) copies of symbol a. the following rules are used to simulate the instruction li : (add(2), lj, lk): 3/1 ∈ r(4,2), (c(li) − 3)/(c(lj) − 1) ∈ r(4,0), (c(li) − 3)/(c(lk) − 1) ∈ r(4,0). the simulation takes one step. the content of the cell with label 4 is split into two parts, with one part containing c(li) − 3 copies of symbol a and the other part containing 3 copies of symbol a. the c(li) − 3 copies of symbol a in the cell with label 4 are exchanged with c(lj) − 1 copies of symbol a from the environment by the rule (c(li) − 3)/(c(lj) − 1) ∈ r(4,0), or exchanged with c(lk) − 1 copies of symbol a by the rule (c(li) − 3)/(c(lk) − 1) ∈ r(4,0), non-deterministically chosen. the 3 copies of symbol a in the cell with label 4 are exchanged with one copy of symbol a from the cell with label 2 by the rule 3/1 ∈ r(4,2). in this way, the number of copies of symbol a in the cell with label 2 increases by two, which simulates that the number stored in register 2 is increased by one. the number of copies of symbol a in the cell with label 4 becomes c(lj) or c(lk), which means that system π starts to simulate instruction lj or lk. similar to the case of simulating add instructions that act on register 1, if system π does not use the above rules, then the rule 2/2 ∈ r(6,0) can be applied and would be applied forever. the computation is not a successful one and gives no result. therefore, the add instruction li : (add(2), lj, lk) is correctly simulated by system π. (3) simulating sub instructions that act on register 2. let li : (sub(2), lj, lk) be an sub instruction that acts on register 2, and the cell with label 4 contains c(li) copies of symbol a. the following rules are used to simulate li : (sub(2), lj, lk): (c(li) − 1)/(c(li + 1) − 3) ∈ r(4,0), 1/3 ∈ r(4,2), c(li + 1)/c(lj) ∈ r(4,0), (c(li + 1) − 2)/c(lk) ∈ r(4,0). the simulation takes two steps. note that one of the main issues in the process of simulating li : (sub(2), lj, lk) is to check whether the number stored in register 2 is non-zero. the number n in register 2 is represented by 2n+2 copies of symbol a in the cell with label 2, so the system should check whether the cell with label 2 contains more than two copies of symbol a. the checking is done as follows. at the first step, the c(li) copies of symbol a are split into two parts, with one 180 x. zhang, b. luo, l. pan part containing c(li) − 1 copies of symbol a and the other part containing one copy of symbol a. the c(li) − 1 copies of symbol a can be used by the rule (c(li) − 1)/(c(li + 1) − 3) ∈ r(4,0), exchanging with c(li + 1) − 3 copies of symbol a. for the remaining copy of symbol a, there are two cases. if the number stored in register 2 is non-zero and thus the cell with label 2 contains at least 4 copies of symbol a, then the only rule which can be used is 1/3 ∈ r(4,2). if the number stored in register 2 is zero and thus the cell with label 2 contains only 2 copies of symbol a, then no rule can be used and the symbol stays in the cell with label 4. in the case of the number stored in register 2 being non-zero, the number of copies of symbol a in the cell with label 2 decreases by two (this simulates that the number stored in register 2 is subtracted by one), and the number of copies of symbol a in the cell with label 4 becomes c(li + 1). for the case of the number stored in register 2 being zero, the number of copies of symbol a in the cell with label 2 keeps unchanged (this simulates that the number stored in register 2 is still zero), and the cell with label 4 accumulates c(li + 1) − 2 copies of symbol a. at the second step, by using the rules c(li + 1)/c(lj) ∈ r(4,0) or c(li + 1) − 2/c(lk) ∈ r(4,0), the cell with label 4 eventually obtains c(lj) or c(lk) copies of symbol a, respectively. note that system π should simulate li : (sub(2), lj, lk) by using the above rules; otherwise, the rule 2/0 ∈ r(4,6) must be used, which actives a trap in the sense of the rule 2/2 ∈ r(6,0) being used forever. (4) simulating add instructions that act on register 3. let li : (add(3), lj, lk) be an add instruction that acts on register 3, and the cell with label 4 contains c(li) copies of symbol a. the simulation uses the rules: c(li)/0 ∈ r(4,5), (c(li) − 3)/(c(lj) − 1) ∈ r(5,0), (c(li) − 3)/(c(lk) − 1) ∈ r(5,0), 3/1 ∈ (5,3), 0/c(lj) ∈ r(4,5), 0/c(lk) ∈ r(4,5). the simulations of instructions that act on register 3 are controlled by the cell with label 5. the simulation of the instruction li : (add(3), lj, lk) takes three steps. at the first step, the c(li) copies of symbol a in the cell with label 4 are sent to cell with label 5 by the rule c(li)/0 ∈ r(4,5). in the cell with label 5, the c(li) copies of symbol a are split into two parts at the second step, with one part containing c(li) − 3 copies of symbol a and the other part containing 3 copies of symbol a. the c(li) − 3 copies of symbol a are exchanged with c(lj) − 1 copies of symbol a from the environment by the rule (c(li) − 3)/(c(lj) − 1) ∈ r(5,0) or exchanged with c(lk) − 1 copies of symbol a from the environment by the rule (c(li)−3)/(c(lk)−1) ∈ r(5,0), non-deterministically chosen. the 3 copies of symbol a in cell with label 5 are exchanged with one copy of symbol a from the cell with label 3. in this way, the number of copies of symbol a in the cell with label 3 increase by two, which simulates that the number stored in register 3 is increased by one. the cell with label 5 accumulates c(lj) or c(lk) copies of symbol a, these copies of symbol a are sent to the cell with label 4 by the rules 0/c(lj) ∈ r(4,5) or 0/c(lk) ∈ r(4,5) at the third step. in this way, the system can continue to simulate the next instruction lj or lk. note that system π should simulate li : (add(3), lj, lk) by using the above rules; otherwise, the rule 2/0 ∈ r(5,6) must be used, which actives a trap in the sense of the rule 2/2 ∈ r(6,0) being used forever. therefore, the add instruction li : (add(3), lj, lk) is correctly simulated by system π. (5) simulating sub instructions that act on register 3. let li : (sub(3), lj, lk) be an sub instruction that acts on register 3, and the cell with label 4 contains c(li) copies of symbol a. the following rules are used to simulate the instruction li : (sub(3), lj, lk): c(li)/0 ∈ r(4,5), (c(li) − 1)/(c(li + 1) − 3) ∈ r(5,0), c(li + 1)/c(lj) ∈ r(5,0), 1/3 ∈ r(5,3), (c(li + 1) − 2)/c(lk) ∈ r(5,0), 0/c(lj) ∈ r(4,5), 0/c(lk) ∈ r(4,5). the simulation takes four steps. similar to the simulation of add instructions that act on register 3, the c(li) copies of symbol a in the cell with label 4 are first sent to the cell with label small universal tissue p systems with symport/antiport rules 181 5 and then the cell with label 5 takes care of the simulation of sub instructions that act on register 3. at the second step, system π starts to check whether the number stored in register 3 is non-zero. note that the number n in register 3 is represented by 2n + 2 copies of symbol a, thus the system should check whether the cell with label 3 contains more than two copies of symbol a. the checking is done as follows. the c(li) copies of symbol a in the cell with label 5 are split into two parts, with one part containing one copy of symbol a and the other one containing c(li) − 1 copies of symbol a. the c(li) − 1 copies of symbol a are exchanged with c(li + 1) − 3 copies of symbol a from the environment by the rule (c(li) − 1)/(c(li + 1) − 3) ∈ r(5,0). for the one copy of symbol a, there are two cases. if the cell with label 3 contains at least 3 copies of symbol a (this corresponds to the fact that the number stored in register 3 is non-zero), then the only rule which can be applied is 1/3 ∈ r(5,3) and thus the number stored in register 3 is decreased by one. if the number of copies of symbol a in the cell with label 3 is less than 3 (this corresponds to the fact that the number stored in register 3 is zero), then no rule can be applied and thus the number in register 3 keeps unchanged; furthermore, the one copy of symbol a stays in the cell with label 5. in this way, the cell with label 5 accumulates either c(li + 1) copies of symbol a or c(li + 1) − 2 copies of symbol a. by using the rules c(li + 1)/c(lj) ∈ r(5,0) or (c(li + 1) − 2)/c(lk) ∈ r(5,0), the c(li + 1) copies of symbol a are exchanged with c(lj) copies of symbol a or the c(li + 1) − 2 copies of symbol a are exchanged with c(lk) copies of symbol a at the third step. at the fourth step, the c(lj) or c(lk) copies of symbol a in the cell with label 5 are sent to the cell with label 4 by the rules 0/c(lj) ∈ r(4,5) or 0/(lk) ∈ r(4,5), and system π starts to simulate the next instruction lj or lk. note that the rules 2/0 ∈ r(5,6) and 2/2 ∈ r(6,0) guarantee that system π should use the above rules to simulate li : (sub(3), lj, lk); otherwise, system π would enter into a computation that cannot halt. (6) simulating the halting instruction. the halting instruction lh is simulated by the rule c(lt)/0 from r(4,0). when the register machine m reaches the halting instruction lh, the cell with label 4 accumulates c(lt) copies of symbol a. at that moment, the rule c(lt)/0 ∈ r(4,0) can be used, by which all the copies of symbol a in the cell with label 4 are moved into the environment, and thus the computation halts. from the above explanation, we can find that the register machine m can be correctly simulated by tissue p system π. from the formal definition of tissue p system π, we see that system π has at most one channel between two cells or between a cell and the environment. therefore, theorem 1 holds. 2 corollary 2. for any recursively enumerable set l of natural numbers, a one-symbol tissue p system with symport/antiport rules having at most 5 cells can be constructed to generate l, when two channels are allowed between a cell and the environment. proof: let us first recall that the cell with label 6 in system π constructed in the proof of theorem 1 is used as a trap. if two channels are allowed between a cell and the environment, then the “trap” cell can be saved by introducing two antiport rules 2/2c(lt) ∈ r(4,0) and 2/2c(lt) ∈ r(5,0), where the function c is defined as in the proof of theorem 1. these two rules are used as a “trap” in the sense that, once one of the rules is used, then the rule will be used forever. note that the number 2c(lt) is so large that even in the case of all possible rules which can be used being used, there remain enough copies of symbol a to repeat the “trap” rule 2/2c(lt) ∈ r(4,0) or 2/2c(lt) ∈ r(5,0). in this way, the computation enters an infinite loop. the changes in the construction from the proof of theorem 1 are left to the reader. 2 182 x. zhang, b. luo, l. pan 5 conclusions and comments in this note, a one-symbol universal tissue p system with symport/antiport rules having six cells is obtained, under the restriction that one channel is allowed between two cells or between a cell and the environment. as a corollary of the above result, a one-symbol universal tissue p system with symport/antiport rules having five cells is also constructed, under the restriction that two channel are allowed between a cell and the environment. these results partially answer open problems formulated in [1] (see tables 1 and 2). some improvement of the number of cells used in the universal tissue p systems given in this work may be still possible (thus answering more open problems in tables 1 and 2). a natural idea is to consider removing the trap cell, just as done in the case of allowing two channels between the cell and the environment. in the universal tissue p systems given in this work, two cells have been used to control the simulation of the sub instructions of register machine: the cell with label 4 is responsible for the simulation of the sub instructions that act on register 2; the cell with label 5 takes care of the simulation of the sub instructions that act on register 3. we conjecture that one cell is enough to take care of the simulation of the sub instructions that act on registers 2 and 3. acknowledgements this work was supported by the national natural science foundation of china (grant nos. 61033003, 30870826, 61003131 and 61003038), the fundamental research funds for the central universities (2010zd001), ph.d. programs foundation of ministry of education of china (20100142110072), the opening foundation of key laboratory of university of science and technology of china for high-performance computing and applications (grant no. nhpcc-kf1102), and scientific research foundation for doctor of anhui university (grant no. 02203104). bibliography [1] a. alhazov, r. freund, m. oswald, tissue p systems with symport/antiport rules and small numbers of symbols and cells, lecture notes in computer science, 3572:100–111, 2005. [2] r. freund, gh. păun, m.j. pérez-jiménez, tissue-like p systems with channel states, theoretical computer science, 296:295–326, 2003. [3] r. freund, m. oswald, tissue p systems with symport/antiport rules of one symbol are computational complete, in: m.a. gutiérrez-naranjo, gh. păun, m.j. pérez-jiménez (eds.), proceedings of the european science foundation pesc exploratory workshop cellular computing (complexity aspects), sevilla, pp.178–187, 2005. [4] s.n. krishna, k. lakshmanan, r. rama, tissue p systems with contextual and rewriting rules, lecture notes in computer science, 2597:339–351, 2003. [5] m. minsky (eds.), computation: finite and infinite machines, prentice hall, 1967. [6] c. martín vide, j. pazos, gh. păun, a. rodríguez patón, tissue p systems, theoretical computer science, 296:295–326, 2003. [7] gh. păun, computing with membranes, journal of computer and system sciences, 61(1):108–143, 2000. small universal tissue p systems with symport/antiport rules 183 [8] a. păun, gh. păun, the power of communication: p systems with symport/antiport, new generation computing, 20(3):295–305, 2002. [9] gh. păun, g. rozenberg, a. salomaa (eds.), handbook of membrane computing, oxford university press, 2010. [10] y. rogozhin, s. verlan, on the rule complexity of universal tissue p systems, lecture notes in computer science, 3850:356–362, 2006. [11] the p systems web page: http://ppage.psystems.eu international journal of computers communications & control issn 1841-9836, 10(2):211-221, april, 2015. modelling and analysis of mobile computing systems: an extended petri nets formalism l. kahloul, a. chaoui, k. djouani laid kahloul linfi laboratory, computer science department, university of biskra biskra, 07000, algeria. e-mail: kahloul2006@yahoo.fr allaoua chaoui misc laboratory, computer science department, university of constantine constantine, 25000, algeria. e-mail: a_cahoui2001@yahoo.fr karim djouani lissi laboratory, paris est university, paris, france f’sati at tut, pretoria south africa. e-mail: djouani@univ-paris12.fr abstract: in its basic version, petri nets are defined as fixed graphs, where the behaviour of the system is modelled as the marking of the graph which changes over time. this constraint makes the petri nets a poor tool to deal with reconfigurable systems as mobile computing systems, where the structure of the system can change as its behaviour, during time. many extended petri nets were proposed to deal with this weakness. the aim of this work is to present a new extension of petri nets, where the structure of the graph can be highly flexible. this flexibility gives a rich model with complex behaviours, not allowed in previous extensions. the second aim is to prove that even these behaviours are so complex; they can be translated into other low level models (as coloured petri nets [21]) and so be analysed. this translation exploits dynamic petri nets [11] as an intermediary representation between our model and coloured petri nets. keywords: petri nets , coloured petri nets, dynamic petri nets, mobile computing system. 1 introduction the development of computer science technologies and increasing user requirements are the major drivers of the birth of sophisticated solutions. mobility with its soft (code mobility) and hard (device mobility) aspects is one of these solutions. when some disaster menaces a critical system during its execution, it seems a good idea to transfer this system and to save its state to another, more secure site, where it can continue its execution. by soft mobility, we mean a system where code can migrate from one site to another site. many reasons can cause such migration and many methods and techniques can be used. on the other hand, travelling users who request some computing services need also some specific mobile devices. in this last case, we talk about hard mobility. applications using code mobility are increasing. code mobility touches critical domains (military, spacial, medicine). such domains require that used applications insure a set of properties. safety, liveness, no deadlock, fault tolerance, and security are example of such required properties. using formal methods, one can develop systems and proof (or verify) presence or absence for specific properties, in these systems. formal methods are languages, tools, and approaches allowing specification and verification of systems. formal languages are based on a well-defined syntax and a formal semantics. their formal semantics allow developers to copyright © 2006-2015 by ccc publications 212 l. kahloul, a. chaoui, k. djouani verify specification written in such languages. for some languages, automatic tools are proposed to verify the specifications. using formal methods in code mobility is not recent. most currently methods can be considered as derived from processes-algebra [6] or state-transition systems. as a state-transition model, petri nets [18] was proposed to model concurrent and parallel systems. this formalism has a graphic representation and a formal background. using places, transitions and connecting arcs, this formalism can specify states, actions and transitions between states through which a system evolves. using petri nets, one can analyse behavioural or structural properties of a system. to model mobility with petri nets, the most important contribution can be found in high level petri nets. many extensions have been proposed to adapt petri nets to mobile systems: mobile nets and dynamic petri nets [11], nested petri nets [13], hyper-petrinets [14], mobile synchronous petri net [15]... the first idea that has motivated our work was mobile code systems. in these systems, type of resources and their bindings play a central role in the migration procedure. resources decide also the success or failure of the process. proposed formal methods founded in the literature do not deal with these aspects and their problems. in our first work, we have proposed a naive version of ”labelled reconfigurable nets” [23] extended to ”coloured reconfigurable nets” in [24]. our objective was to propose a graphical tool to model mobile code systems in an easy and intuitive way. in these works, we were interested to provide formalisms that model mobility, explicitly (the mobility is modelled through the reconfiguration of the net’s structure when some transitions are fired). when trying to offer this quality in a model, we have to deal with the problem of interpreting this reconfiguration formally. in [23, 24], we have introduced specific labelled transitions: ”reconfigure-transitions”, which reconfigure the structure of the net when they are fired. the first drawback of this solution is that we must provide a specific treatment of these transitions when the model is analysed. in [25], we have proposed an interpretation of reconfigurable labelled nets into a high order maude (reconfigurable maude). the idea was to extend maude [21] with some “reconfigure rewriting rules”. these rules can represent the reconfigure-transitions. reconfigurable maude can be used to simulate reconfigurable labelled nets. in the current work, we will present another version that we call extended “labelled reconfigurable nets”, where the label of a reconfigure-transition is defined as a tuple. this tuple contains a set of values which belong to different types. three specific types are defined: p (for places), t (for transitions), and a (for arcs). these types will contain signed (negative as well as positive) objects (places, transitions, and arcs). the presence of a positive object (resp. negative object) in a label of a reconfigure-transition can be the cause to add (resp. delete) this object to (resp. from) the structure of this net. using these labels, the structure of the net can be expanded, reduced, or destroyed. our second contribution is the proposition of a method to analyse this model, using dynamic nets [11]. dynamic nets can be translated into cpn [21], and can be analysed using cpn-tool [26]. we propose the encoding of our model into dynamic nets. this encoding will be used to analyse this model. this encoding has been proved. the encoding and its proof are not presented in this paper. this paper is organized as follows: the section two presents the formal definition of extended labelled reconfigurable nets (elrn), its semantics, and an example of modelling. section three discusses the analysis issue that we have developed for the analysis of elrn using a translation of the model into dynamic nets [11]. section four presents a comparison between our work and other similar works, and finally, section five will conclude this paper. 2 extended labelled reconfigurable nets extended labelled reconfigurable nets (elrn) are an extension of coloured petri nets [21]. in elrn, the set of transitions is divided into two subsets: ordinary transitions (ot ) and modelling and analysis of mobile computing systems: an extended petri nets formalism 213 reconfigure-transitions (rt ). a reconfigure-transition has a label (a tuple of values). the firing of an ordinary transition will change the marking of the net in an ordinary manner, as in cpn [21]. the firing of a reconfigure-transition changes the marking of the net as well as the structure of the net. a reconfigure-transition changes the structure of the net, by adding or deleting a node (places, transitions, arcs). to add a place, a reconfigure-transition must have a label which contains: (the name of the place, its initial marking, its input (resp. output) transitions with their incoming (resp. outgoing) expressions). to add a transition, a reconfigure-transition must have a label which contains: (the name of the transition, its guard, its input (resp. output) places with their incoming (resp. outgoing) expressions). finally, to add an arc, a reconfiguretransition must have a label which contains: (the name of the arc, its labelling expression, and its input/output nodes). names of nodes (places, transitions, arcs) can be preceded by a negative sign. the presence of negative node in a label causes its elimination from the net (iff it existed in the original net), once the reconfigure-transition is fired. in the following subsection, we present the formal definition of this model, its semantics, and a modelling example. 2.1 formal definition let name be a set of names. let p , t , and a be three finite and disjoint subsets of the set name. an extended labelled reconfigurable nets np,t,a is a 10-tuple (σ, p ′, t ′, a′, c, g, e, i, l), where: • σ: a set of types (colours). we denote by σ∗ the set of all multi-sets of the set σ; • p ′: a set of places; p ′ ⊆ p . • t ′: a set of transitions; p ′ ⊆ p . t ′ = ot ∪ rt (ot for ordinary transitions, and rt for reconfigure-transitions). • a′: a set of arcs. a′ ⊆ (t ′ × p ′) ∪ (p ′ × t ′). for a place p in p ′ and a transition t in t ′, we can have an arc a in a′ written a = (p, t) (resp. (t, p)), if it connects p to t (resp. if it connects t to p). we write (p, .) (rep (t, .)) to denote the set of arcs that start from p (resp. to denote the set of arcs that start from t). • c: a colour function associated with each place. c : p ′ → σ. for each place p, c associates a unique colour (type) c(p); • g: a guard function associated with each transition. g : t ′ → exp. where exp is the set of all boolean expressions that can be constructed using constants and variables defined in types σ; • e: an expression function that associates to each arc a in a′ an expression e(a). the expression e(a) is a multi-set of c(p) where a ∈ (p, .). • i: is an initial state of the net. i=< m0, s0 >, where m0 is the initial marking of places p ’. m0 : p ′ → σ∗. s0 is the initial structure of the net. we take s0 = p ′ ∪ t ′ ∪ a′. • l: a labelling function which associates to each transition rt in rt a label. we denote by (p, exp)∗ (resp. (t, exp)∗) the set of couples composed of a place in p and an expression (resp. composed of a transition in t and an expression). we denote by p − the set of names defined in p preceded by a negative sign (idem for t − and a−). we denote by ϵ the empty untyped element. a label is a tuple of values. three kinds of labels can be used:(i) (p ∪ p −, σ∗ ∪ {ϵ}, (t, exp)∗ ∪ {ϵ}, (t, exp)∗ ∪ {ϵ}) labels a transition which adds a place 214 l. kahloul, a. chaoui, k. djouani to the net, (ii) (t ∪ t −, exp ∪ {ϵ}, (p, exp)∗ ∪ {ϵ}, (p, exp)∗ ∪ {ϵ}) labels a transition which adds a place to the net, and (iii) (a∪a−, σ∗ ∪{ϵ}, name∪{ϵ}, name∪{ϵ}) labels a transition which adds an arc to the net. so, we have: l : rt → (p ∪ p − × σ∗ ∪ {ϵ} × (t, exp)∗ ∪ {ϵ} × (t, exp)∗ ∪ {ϵ}) ∪ (t ∪ t − × exp ∪ {ϵ} × (p, exp)∗ ∪ {ϵ} × (p, exp)∗ ∪ {ϵ}) ∪ (a ∪ a− × σ∗ ∪ {ϵ} × name ∪ {ϵ} × name ∪ {ϵ}). 2.2 semantics let n be an extended labelled reconfigurable nets, and t a transition in t. as in cpn (coloured petri nets) [21], we denote by ◦t the set of input places for the transition t, and by t◦ the set of output places for the transition t. let i0 =< m0, s0 > be the current state of n. firing t changes i0 towards i1 =< m1, s1 >. we denote this as: < m0, s0 >→t< m1, s1 >. in case of t in ot , we have: s1 = s0. preconditions to fire t a binding β is a function that assigns some values to some variables. we denote by e(p, t)[β] a binding in which every variable in e is assigned to some values depending on β. now, the transition t can be fired iff there is a binding β on the variables of e(p, t) such that m0(p) ≤ e(p, t)[β], for each p ∈◦ t, and g(t) is true. post-conditions of firing t after the firing of t, n will transit from its current state i0 to another state i1 =< m1, s1 >. for each p in ◦t, we will have: m1(p) = m0(p) − e(p, t)[β]. for each p ∈ t◦, we will have: m1(p) = m0(p) + e(t, p)[β]. if t ∈ ot then s1 = s0. if t ∈ rt then: s0 (which is p ′0 ∪ t ′0 ∪ a′0) will be updated to s1 = p ′1 ∪ t ′1 ∪ a′1. three cases are possible: • t changes p ′: – by adding a place p: the label of t must be: (p, mp, {(in_t1, in_e1), ..., (in_tn, in_en)}, {(out_t1, out_e1), ..., (out_tl, out_el)}); where: mp is the initial marking of p, and (in_ti, in_ei) (resp. (out_tj, out_ej)) an input (resp. an output) transition with its incoming (resp. outgoing) expressions. so, p ′1 = p ′0∪{p}, t ′1 = t ′0, and a′1 = a′0. – or deleting the place p: the label must be (−p, ϵ, ϵ, ϵ). so, p ′1 = p ′0 \{p}, t ′1 = t ′0, and a′1 = a′0. • t changes t ′: – by adding the transition at: the label of t must be: (at, gat, {(in_p1, in_e1), ..., (in_pn, in_en)}, {(out_p1, out_e1), ..., (out_pl, out_el)}); where: gt is a guard, and (in_pi, in_ei) (resp. (out_pj, out_ej)) an input (resp. an output) places with its incoming (resp. outgoing) expressions. so, p ′1 = p ′0, t ′1 = t ′0 ∪ {at}, and a′1 = a′0. – or deleting the transition dt: the label of t must be: (−dt, ϵ, ϵ, ϵ). so, p ′1 = p ′0, t ′1 = t ′0 \ {dt}, and a′1 = a′0. • or t changes a′: modelling and analysis of mobile computing systems: an extended petri nets formalism 215 – by adding the arc a = (p, t′): the label must be (a, ea, p, t′); where: ea is a labelling expression, p the name of a place, and t′ the name of a transition. so, p ′1 = p ′0, t ′1 = t ′0, and a′1 = a′0 ∪ {a}. – by adding the arc a = (t′, p): (a, ea, t′, p); where: ea is a labelling expression, p the name of a place, and t′ the name of a transition. so, p ′1 = p ′0, t ′1 = t ′0, and a′1 = a′0 ∪ {a}. – or deleting the arc a: the label of t must be (−a, ϵ, ϵ, ϵ). so, p ′1 = p ′0, t ′1 = t ′0, and a′1 = a′0 \ {a}. 2.3 a modelling example in the example of fig 1, we make an explicit subdivision of the net into a set of sub-blocs. each sub-bloc can represent an agent or a site where many agents reside. each sub-bloc has a title presented on the top of this sub-bloc. we use a specific graphical representation for the reconfigure-transitions, to distinguish them from the ordinary transitions. in fig 1, we have three agents. agent1 and agent3 are immobile agents which existed on two different sites (s1 and s2). agent2 is a mobile agent, which is located initially on the site s1. on the site s1, agent2 communicates with agent1 through the communication place c1. agent2 receives an information (of a some type that we denote: information) from c1, then it moves towards the site s2, where agent3 is located. on the site s2, agent2 passes the information received from agent1 to the agent3, through the place c2. to do the transfer of this information, a new arc (t22, c2) must be added to the model, and the arc (c1, t21) must be deleted from the model. to do this reconfiguration in the model, we use the two reconfigure-transitions: rt1 and rt2, with two labels: l1 =< −(c1, t21) > and l2 =< (t22, c2), t22, c2, x >. x is the variable which transfers the information to agent3. fig 2 shows the system after the movement of the agent2, where the system is reconfigured. the initial marking of the places is {m0(p11) =< inf >, m0(p21) =< • >}, where inf is a data of the type information and < • > represents the constant black-token of the type black-token (a type which contains only one value which is < • >). the variable x is also of type information. the non-labelled arcs are implicitly labelled < • >. 3 on the analysis of elrn our aim is to offer a way to analyse elrn models. we have proposed that the analysis of elrn can be done through the analysis of some equivalent models in cpn (coloured petri nets [21]) or pn (petri nets [18]). petri nets and coloured petri nets have been studied for many years and have many automatic verification-tools [22]. to profit from these tools, we must show that there is some correct transformation (an unfolding) from the elrn formalism (as high level nets) towards cpn (and pn). the transformation of elrn directly into cpn or pn is a hard task; so we propose to prove that the extended labelled reconfigurable nets (elrn) models can be encoded into dynamic nets (dn) [11]. we use the dn as an intermediary pass between elrn and cpn. at this stage, the obtained specification can be transformed into cpn (coloured petri net model) model and so, be analysed using cpn-tool for example. the unfolding of elrn models into dn is developed and has been proved. 216 l. kahloul, a. chaoui, k. djouani figure 1: example of an elrn. figure 2: example of an elrn (after reconfiguration). modelling and analysis of mobile computing systems: an extended petri nets formalism 217 4 related works research on extending petri nets (to model systems, with dynamic structure) has provided some remarkable results. we can distinguish between extensions that model mobility in an implicit way (no modification in the structure of the net), or in an explicit way (the reconfiguration, of the net structure, models mobility). the most important propositions are dedicated to mobile systems and mobile agents. in prn (predicate/transition nets) [16], mobile agents are modelled through tokens. these agents are transferred by transition firing from an environment to another. in this work, the structure of the net does not change. the agents are represented as token, so this abstraction does not allow representing some complex behaviour of this kind of agents. in [15], authors proposed mspn (mobile synchronous petri net) as formalism to model mobile systems and security aspects. they have introduced the notions of nets (an entity) and disjoint locations to explicit mobility. a system is composed of set of localities that can contain nets. to explicit mobility, specific transitions are introduced. two kinds of specific transitions were proposed: new and go. firing a go transition moves the net from its locality towards another locality. the destination locality is given through a token in an input place of the go transition. in this work, mobility is not also explicit. mobility is implicitly modelled by the activation of some nets and the deactivation of other nets, using tokens. migration of an agent is modelled by the deactivation of the net modelling this agent in a locality and the activation of the net that represents this same agent in the destination locality. so, this is a kind of simulation of mobility. in nested nets [21], tokens can be petri nets themselves. this model allows some transition when they are fired to create new nets in the output places. nested nets are hierarchic nets where we have different levels of details. places can contain nets, and these nets can contain also nets as tokens in their places etcetera. so all nets created when a transition is fired are contained in places. so the created nets are not in the same level with the first net. this formalism is proposed to adaptive work-flow systems. self modifying nets (smn) [18] is an extension of petri nets. in this formalism, edges can be labelled by names of places. if the name p is used as the weight of an arc, then this means that the number of tokens to be moved through this arc is equal to the current marking of the place p. so, in self modifying nets, the weights of arcs are dynamic. these weights depend on the current marking of the net. even smn offers more computational power than pn; mobility was not the objective of this extension. though, this formalism was the basis for other more important formalisms like "reconfigurable nets" [10]. in "reconfigurable net" [10], the structure of the net is not explicitly changed. no places or transitions are added in runtime. the key difference with coloured petri nets is that firing transition can change names of output places. names of places can figure as weight of output arcs. this formalism is proposed to model nets with fixed components but where connectivity can be changed over time. in object petri nets [21], and elementary object nets (eon) [12], tokens can be petri nets themselves. in an eon, we distinguish between system nets and object nets. object nets play the role of object tokens that can appear in places of a system net. here, a two-level system modelling technique is introduced. the system net which represents the external level and the object net which represents the internal level can have some synchronous transitions. in this case, these transitions must be fired simultaneously in the two levels. object nets used as tokens in a system net can also interact. eon formalism was proposed to model some kind of systems like: work-flow, flexible manufacturing, and mobile agents. based on the eon formalism, other proposals were done: nested nets [13], petri hypernets [14], nets within nets [19], etc in nested nets [21], firing some transitions creates new nets (called token nets) in their output places. nested nets are also hierarchic nets, where we have different levels of details. places can contain nets, and these nets can contain also nets as tokens in their places. this formalism was proposed to adaptive work-flow systems. adaptive means an ability to modify processes in a structured 218 l. kahloul, a. chaoui, k. djouani way, for example by replacing a sub-process or extending it. petri hypernets [14] are proposed to model mobile agents. mobile agents are modelled as nets. these mobile agents are manipulated by other agents (modelled as nets) who can be also mobile. we call open net a net used to model a mobile agent. this open net plays the role of a token in another net; this last one is called hyper-marking net. as a difference with valk’s proposal [12, 20], the inter-level synchronization in hyper-nets is achieved solely by means of exchanging messages. in [17], pepa nets are proposed, where mobile code is modelled by expressions of the stochastic process algebra pepa which play the role of tokens in (stochastic) petri nets. the petri net of a pepa net models the architecture of the net, which is a static one. mobile petri nets (mpn) [11] extend coloured petri nets to model mobility. mpn is inspired by join-calculus [4]. the output places of transition are dynamic. the input expression of a transition defines the set of its output places. in all these formalisms, the structure of the net is not changed and mobility is modelled implicitly through the net’s dynamic. in these models, an important work is required from the modeller to model mobility implicitly. mpn are extended to dynamic petri net (dpn) [11]. mobility in dpn is modelled explicitly, by adding subnets when transitions are fired. however, the dynamic petri nets formalism implies some constraints: (i) no transition without input places, (ii) added nets, to the original net, must not modify the input of an existing transition in the original net, (iii) we can not add a connection between two disconnected existing nodes, (iv) and we cannot delete nodes (place, transition or connection). in this paper, we have proposed an extension for petri nets that can be used to model mobility (and in general, reconfigurable systems): extended labelled reconfigurable nets. extended labelled reconfigurable nets is more flexible and more expressive and does not imply constraints on the dynamic of the structure. we consider that extended labelled reconfigurable nets can be used by reconfigurable systems developers with more flexibility than other formalisms. this is due to the feature that it models mobility explicitly through mobility of nodes in the labelled reconfigurable net. developers can encode mobile aspects of their system directly and explicitly in the earn formalism. the power of petri nets resides in its verification methods. when extending petri nets, we reach some formalism with a high expressiveness, but the analysis becomes more complex or even impossible. developers of new formalisms must propose analysis techniques. mostly, they are proposing some translation (or encoding) of their formalisms into some well-known formalism or approach in modelling domain. such translation allows the analysis of the new formalisms models using techniques of well-known formalisms. the most famous encoding can be found in the unfolding of petri nets into automaton to apply model-checking, and then the unfolding of cpn [21] (coloured petri nets) into pn [18] (petri nets) to analyse some properties that are not analysed on the cpn directly. we can find other works, in literature. in [22], author authors studied equivalence between the join calculus [4] and different kinds of high level nets. they proved the equivalence between reconfigurable nets (rn) [10] (an extension version of pn) and the join calculus. this equivalence allows to interpret rn into join calculus and to verify those using join-calculus tools. in [19], petri nets are translated into linear logic programming. this translation can be used to analyse petri nets using prolog model-checker. authors of [20] encoded synchronous mobile nets (smn) [15] into rewriting logic [22]. this encoding allows the use of maude [21] to verify smns specifications. in this paper, we have discussed an encoding of elrn behaviours into dynamic nets [11]. this encoding was proved to be correct. the advantage of such encoding resides in the possibility to encode dynamic nets into cpn (coloured petri nets). so, elrn can be translated into cpn. once translated into cpn, elrn nets can be analysed using cpn verification tools. modelling and analysis of mobile computing systems: an extended petri nets formalism 219 5 conclusion mobile systems are systems with a dynamic structure. their structure changes as they are executed. this class of systems can be found in many domains of our life. mobile robots used to explore hostile environment, mobile agents used in the internet or in distributed systems, mobile nodes in a mobile wireless networks ... all these systems can be considered as reconfigurable systems. the use of these systems is in expansion for many reasons: their efficiency, their abstractions for the designer, their flexibility ...these characteristics make these systems in the kernel of many critical systems: aeronautics, military, medicine, commerce... the design of these systems becomes a critical activity. their reliability and their correctness are crucial. to ensure the correctness of these systems, formal methods seem to be an adequate solution. using formal methods, the designer specifies the system in a formal language. a formal language has a well defined syntax, and formal semantics which allows the verification of properties of the designed system. we found in the literature, many formal methods. classical formal methods (proposed for classical systems) are well defined and are mature. however, these classical formal methods have not the expressiveness to specify reconfigurable systems. the use of the classical methods makes the designer’s task a hard task. extended versions are proposed to deal with the idea of reconfigurable systems. in the literature, we can find two principal classes: processes algebra based methods, and state-transition based methods. state-transition based methods can be found in extensions of petri nets model. petri nets are an elegant model for concurrency. with its graphical representation and its formal background, it was used to specify and verify concurrent multi-processes systems. the classical model has not the power of expressiveness to deal with current aspects such as mobility. to take benefits from the power of the model in mobility domains, several works have been proposed. these works try to extend petri nets with the same ability to specify mobility (and more generally: reconfigurability). in this paper, we have presented the extended labelled reconfigurable nets formalism. the formal definition of this formalism, its semantics and a modelling example are presented. the encoding of this formalism into another formalism dynamic nets [11] was proved using and offers a method to do the analysis of this model. as perspectives of the current work, we propose the below axes as open domains: • the experimentation of elrn in the modelling of mobile systems: mobile agents systems, mobile networks, ... this modelling work can prove the power of our formalism and shows its shortcomings and so allow us to introduce necessary adaptations; • the work on automatic verification: the translation of dynamic nets into coloured petri nets is presented in [11]. we are working on the development of a tool-kit to implement this translation. the encoding presented in section three is formal and proved to be correct; so it is possible to think of an implementation of this last encoding also; • in the current time, complexity and decidability issues are not yet studied. these aspects are important, once a new formalism is proposed. these issues will be also developed in our future works. bibliography [1] d. sangiorgi and d. walker (2001); the π-calculus: a theory of mobile processes, cambridge university press. 220 l. kahloul, a. chaoui, k. djouani [2] f. cédric, g. gonthier (2000); the join calculus: a language for distributed mobile programming, applied semantics, international summer school, appsem 2000, caminha, portugal, sept. 2000, lncs 2395, 268-332. [3] j.c.m. baeten (2003); over 30 years of process algebra: past, present and future, in l. aceto, z. ésik, w.j. fokkink, and a. ingólfsdóttir, editors, process algebra: open problems and future directions, vol. ns-03-3 of brics notes series, 7-12. [4] f. cédric, g. gonthier, j. j. lévy, l. maranget, d. rémy (1996); a calculus of mobile agents, proc. 7th international conference on concurrency theory (concur’96), 406421. [5] e. badouel, o. javier (1998); reconfigurable nets, a class of high level petri nets supporting dynamic changes within workflow systems, rapports de recherche inria, issn 0249-6399. [6] a. asperti, n. busi (2009); mobile petri nets, technical report ublcs-96-10, department of computer science university of bologna, mathematical structures in computer science journal, 19 (6): 1265-1278. [7] r. valk (1998); petri nets as token objects: an introduction to elementary object nets, applications and theory of petri nets, lncs vol. 1420, 1-25. [8] i.a. lomazova (2001); nested petri nets, multi-level and recursive systems, fundamenta informaticae, 47(3): 283-293. [9] m. a. bednarczyk, l. bernardinello, w. pawlowski, l. pomello (2004); modelling mobility with petri hypernets, 17th int. conf. on recent trends in algebraic development techniques, wadt’04, lncs vol. 3423. [10] f. rosa-velardo, o.m. alonso, d. f. escrig (2005); mobile synchronizing petri nets: a choreographic approach for coordination in ubiquitous systems, 1st int. workshop on methods and tools for coordinating concurrent, distributed and mobile systems, mtcoord’05. entcs 150. [11] dianxiang xu, yi deng (2000); modeling mobile agent systems with high level petri nets, ieee international conference on systems, man, and cybernetics, 5: 3177-3182. [12] s. gilmore, j. hillston, l. kloul, m. ribaudo (2003); pepa nets: a structured performance modelling formalism, performance evaluation, 54(2):79-104. [13] c.a. petri (1962); kommunikation mit automaten, schriften des iim nr.2, institut fr instrumentelle mathematik, bonn (1962). english translation: technical report radctr-65-377, griffiths air force base, new york, vol. 1, suppl. 1, 1966. [14] r. valk, self modifying nets (1978); a natural extension of petri nets, proceeding of icalp’78, lecture notes in computer science, 62: 464-476. [15] m. khler, d. moldt, h. rlke (2003); modelling mobility and mobile agents using nets within nets, in w. van der aalst and e. best, eds., applications and theory of petri nets 2003, proceeding, vol. 2679 of lncs, 121-139. modelling and analysis of mobile computing systems: an extended petri nets formalism 221 [16] r. valk (2004); object petri nets: using the nets-within-nets paradigm, in jrg desel, wolfgang reisig, and grzegorz rozenberg, eds., advances in petri nets: lectures on concurrency and petri nets, vol. 3098 of lecture notes in computer science, springer-verlag, berlin, heidelberg, new york, 819-848. [17] k. jensen (1994); an introduction to the theoretical aspects of coloured petri nets, in j.w. de bakker, w.-p. de roever, g. rozenberg (eds.), a decade of concurrency, lecture notes in computer science, 803: 230-272. [18] http://www.informatik.uni-hamburg.de/tgi/petrinets/tools/quick.html. [19] m. buscemi, v. sassone (2001); high-level petri nets as type theories in the join calculus, proc. of foundations of software science and computation structure (fossacs ’01), lncs 2030. [20] f. rosa-velardo (2007); coding mobile synchronizing petri nets into rewriting logic, electronic notes in theoretical computer science, 174(1): 83-98. [21] m. clavel, f. durn, s. eker, p. lincoln, n. mart-oliet, j. meseguer, j. quesada (1999); maude: specification and programming in rewriting logic, sri international, http://maude.csl.sri.com. [22] j. meseguer (1992); conditional rewriting logic as a unified model of concurrency, theoretical computer science, 96 (1): 73-155. [23] l. kahloul, a. chaoui, code mobility modeling: a temporal labelled reconfigurable nets, proceedings of the 1st international conference on mobile wireless middleware, operating systems, and applications, mobilware 2008, innsbruck, austria, february 13 15, 2008. acm international conference proceeding series 278. [24] l. kahloul, a. chaoui (2008); coloured reconfigurable nets for code mobility modeling, international journal of computers, communications & control, issn 1841-9836, suppl. issue, 3(s): 358-363. [25] l. kahloul, a. chaoui, lrn/r-maude based approach for modeling and simulation of mobile code systems, ubiquitous computing and communication journal (ubicc journal), vol. 3 no. 6, 12/20/2008. http://www.ubicc.org/search_advanced.aspx. [26] http://wiki.daimi.au.dk/cpntools/cpntools.wiki. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 4, pp. 506-516 cereal grain classification by optimal features and intelligent classifiers a. douik, m. abdellaoui ali douik, mehrez abdellaoui ecole nationale d’ingénieurs de monastir (enim) département de génie electrique laboratoire atsi rue ibn el jazzar, 5019 monastir tunisie e-mail: {ali.douik,mehrez.abdellaoui}@enim.rnu.tn abstract: the present paper focused on the classification of cereal grains using different classifiers combined to morphological, colour and wavelet features. the grain types used in this study were hard wheat, tender wheat and barley. different types of features (morphological, colour and wavelet) were extracted from colour images using different approaches. they were applied to different classification methods. keywords: morphological, colour, wavelet transform, neural networks, statistical classifier, fuzzy logic. 1 introduction the past few years was marked by the development of researches that contribute to reach an automatic classification of cereal grains which is perceived as a possible solution to prevent human errors in the quality evaluation process. computer vision system which is a promising technology in the quality control can replace the human operator. after hours of working the operator may loose concentration which in turn will affect the evaluation process. so a computer vision system proved to be more efficient at the level of precision and rapidity. but, the natural diversity in appearance of various cereal grains varieties makes classification by computer vision a complex work to achieve. many researches were carried out to classify cereal grains. characterization models were based on morphological features ( [1–9]), colour features ( [10–13]) or textural features ( [14]). other researchers ( [15–18]) have tried to combine these features for the sake of improving the efficiency of classification. recently, wavelet technique was integrated in cereal grains characterization ( [19, 20]). this technique, developed by mallat [21], is used in textural image analysis to make object classification more precise. the present paper is divided into four main parts. the first one will deal with the cereal image acquisition system, the second part will be devoted to present the classification features with its morphological, colour and wavelet components, the third section will focus on the different methods used in the classification process and the last one will compare the different methods accompanied with their performance evaluation. 2 cereal image acquisition system 2.1 image acquisition device a high resolution colour camera (vivitar) with a usb 2.0 cable was used to acquire grain images. the acquired images were of 3.1 mega pixel resolution. light sources were placed symmetrically over and under a glass plate over which the grains are spread out. all the samples were taken at constant camera settings, i.e., exposure time, saturation and gamma. the images obtained were pre-processed to eliminate background pixels using image subtraction. indeed, the active image containing grain sample is compared to image containing background. the image we got contains the grains and a uniform background (black). this step of pre-processing makes the gains segmentation easier and more efficient. copyright c⃝ 2006-2010 by ccc publications cereal grain classification by optimal features and intelligent classifiers 507 table 1: freeman code features and their abbreviations region direct1 direct2 direct3 direct4 direct5 direct6 direct7 direct8 region1 v z11 v z12 v z13 v z14 v z15 v z16 v z17 v z18 region2 v z21 v z22 v z23 v z24 v z25 v z26 v z27 v z28 region3 v z31 v z32 v z33 v z34 v z35 v z36 v z37 v z38 region4 v z41 v z42 v z43 v z44 v z45 v z46 v z47 v z48 2.2 image database samples a database of images was created from various samples of several cereal varieties obtained from different sources and for different crop years from laboratories of the tunisian cereal office. tunisian hard wheat (hw), tunisian tender wheat (tw) and tunisian barley (b) are the main classes of the samples considered. 3 classification features for each grain type, 152 parameters are extracted from the colour images of the database (122 morphological, 18 colour, and 12 wavelet features). 3.1 morphological features after isolating the grain, the region of interest was selected around the boundary of the edge. the morphological features were obtained from the binary images containing only pixels of the grain edge. we can classify these features as follows: • grain size measurements: length (l), width (l), width by lengh ratio (r1), area (s), perimeter (p), area by perimeter ratio (r2), angles (gra,pta) and radius of curvature (rr,rl) of the two extremities, likelihood between the grain and the nearest ellipsis for the grain (e), mean (sx,sy) and standard deviation (σx,σy) of horizontal and vertical symmetry. • freeman code features: after dividing the grain image in four regions as shown in the figure (1.a). we perform for every region the freeman code ( [22]); it’s the oldest contour descriptor and the most used today; it’s mainly based on the position of the pixels set that are the nearest neighbours (nn-set) of the actual pixel. in fact, every region is coded starting from a given origin and according to the directions of the nearest neighbour that are represented in 8-connexity (coded on 3 bits) as demonstrated in figure (1.b). the features extracted from the freeman code are 32; eight for every region. these features are summed up in table 1. figure 1: freeman code extraction, (a) dividing image in four regions to compute the freeman code, (b) direction codes • fourier transform features: the fourier transform is an important image processing tool which is used to decompose an image into its sine and cosine components ( [23]). the application of 508 a. douik, m. abdellaoui table 2: list of colour parameters and their abbreviations colour mean mean square variance standard kurtosis skewness components value value deviation red rm1 rm2 rv rsd rm3 rm4 green gm1 gm2 gv gsd gm3 gm4 blue bm1 bm2 bv bsd bm3 bm4 this transform on the contour pixels creates a set of complex coefficients that represents the shape of the contour. from these coefficients we extract the morphological descriptors using different signatures (1). a(u) = 1 n n−1∑ k=0 s(k)ex p [ − j2puk n ] (1) where: u ∈ [0,n −1](n : numbero f pointsincontour) s(k) : the signature chosen. a(u) : harmonic descriptors. the signatures used are complex, radial distance and polar. from each signature, we selected the first 25 harmonic coefficients that can be added to the set of the morphological features. the three signatures used are invariant by translation and consequently their fourier descriptors (fd), but it was proved that they are sensitive to rotation. invariance by rotation is then realized by ignoring the fd phase and by considering only modules of these fourier descriptors. for the complex signature all descriptors except the first (dc component) are needed to index the form. the dc component describes only the contour position, and it is useless with the form description. the descriptors standardization consists in dividing their modules by the one of second descriptor. the vector which indexes the form is given by the (2). f = [ |f d2| |f d1| , |f d3| |f d1| ,..., |f dn−1| |f d1| ] (2) the radial distance function and the polar coordinates are real. they have n/2 different frequencies for that half of the fd is necessary to index the form. the invariant vector (3) is obtained by dividing the module of the n/2 first descriptors by the module of the first descriptor. f = [ |f d1| |f d0| , |f d2| |f d0| ,..., ∣∣f dn/2∣∣ |f d0| ] (3) 3.2 colour features for each colour image that contains an isolated grain, we perform statistical parameters on values of pixels belonging to the grain. color parameters included: mean value, mean square value, variance, standard deviation, kurtosis and skewness of the red, green and blue primaries. in table 2 we present these parameters and their abbreviations. cereal grain classification by optimal features and intelligent classifiers 509 table 3: list of wavelet parameters and their abbreviations matrix type average value variance standard deviation matrix of approximation image mvap vap sdap matrix of horizontal details mv hd v hd sdhd matrix of vertical details mv v d v v d sdv d matrix of diagonal details mv dd v dd sddd 3.3 wavelet features the wavelet analysis of an image is a multi resolution analysis which is defined by linear operators allowing analyzing a signal on various frequencies. indeed, the signal is projected on a scale function that gives a representation of the original signal at higher scale. this projection causes a back zoom of the original signal, where the approximation is performed. in order to rebuild the signal, starting from approximation coefficients, we must also project the original signal on a wavelet to recover information lost during the first projection. the second projection contains the details of the original signal. the details of wavelet features have been reported earlier in [20]. table 3 resumes the chosen features. they were statistically tested to extract the best parameters leading to an optimal classification. the tests done on the 12 parameters proved that only two parameters are judged like not-significant (adh and add). thus; the number of parameters which is going to be retained for the characterization phase is 10: sdap, aap, vap, sddh, vdh, sddv, adv, vdv, sddd and vdd. 4 classification methods starting from the classification features extracted, we developed many methods using different approaches. the first approach is a statistical classification method that uses only morphological and colour features. the second approach is a classification using a fuzzy logic based method. the third is a combination between the first and the second. the last approach is an artificial neural network classification method that exploits all features leading to the best classification result. in what follows, we present these different approaches and their contribution to the classification of cereal grains. 4.1 statistical classification method from the set of samples, we achieved statistics related to morphological and color features extracted from color images of grains. from these statistics we obtained a distribution curve of every feature. this method operates directly on the distributions intervals of the morphological and color parameters. the classification is made by successive tests on parameters according to their ranks. conceived algorithm has been tested on images containing a mixture of grains collected from treated samples. to classify the grain types using a statistical method we considered the morphological and color features in this approach. classification results for the grain types using this method are illustrated in figure 2. we notice that the recognition rate for tw is weak while working with morphological or colour features (morph. 56%; color 51%). for hw, colour features gave an optimal recognition rate exceeding 99,4%, but does not exceed 67% when working with morphological features. for barley grains, due to their form that is different from other types of grains, the morphological features gave us a good classification result reaching 98,7%. the global recognition rate for the statistical classification method 510 a. douik, m. abdellaoui table 4: test of best parameters parameters barley hard wheat tender wheat total lsb 82,43% 77,02% 90,28% 80,94% e 77,82% 20,75% 90,91% 46,28% gra 72,80% 70,57% 67,13% 68,62% rm2 64,02% 50,52% 39,18% 50,25% is limited to 76%. this is explained by the overlapping that exists between the distribution curves of grain classes when working with all the morphological and colour features. figure 2: results of the statistical classification method when applied to morphological and colour features 4.2 fuzzy logic based classification method due to the overlaps of distribution curves of grains types we implement a classification method based on fuzzy logic techniques to improve the recognition rate issued from the statistical classification method. classification using the fuzzy logic is made according to the following steps: � classes’s definition. � generation of the membership functions for every parameter. � development of inference rules. � decision making. it results three classes corresponding to the different grain types considered. membership functions are deduced from the distribution curves of the different parameters of every grain type. the membership functions were conceived by normalization of the curves and then by a gaussian approach for every curve. the number of rules depends on the number of parameters considered. the chosen norm is the max-prod. then, the rules form is: "if (condition1) and (condition2) then (decision)". the choice of entries is based on a test of identification parameters. table 4 illustrates the test of the four best parameters for the classification from the set of morphological and colour features associated to the fuzzy logic method. from the possible combinations of the four parameters we select the best ones according to its recognition rates. the combinations selected are illustrated (lsb and gra : 83,42%; lsb, gra and rm2 : 72,71%; lsb, gra, e and rm2 : 68,23%). from this test we chose the parameters lsb and gra since cereal grain classification by optimal features and intelligent classifiers 511 when combining them it gives the best recognition rate. the result of this method using the combination lsb and gra is shown in figure 3. figure 3: results for fuzzy logic based classification method concerning the hard wheat and tender wheat grains, this method gives us a best recognition rate than the statistical one. on the other hand for the barley grains, the first method is more reliable. so that, we opt to use a method which combines the two previous methods and gives t best recognition rate. 4.3 statistical and fuzzy logic combined classification method it consists in making a decision about the grain type by the fuzzy logic method in the cases where the statistical method cannot make a decision. the fuzzy logic is used in the combined method in the cases of overlaps of all morphological and colour parameters. the improvement concerns the hard wheat and tender wheat grains only; the barley grains possess an optimal recognition rate. the results of this method are illustrated in figure 4. figure 4: results for statistical and fuzzy logic combined classification method 512 a. douik, m. abdellaoui table 5: number of neurons in the hidden layer morphological features colour features wavelet features sm fc ft amf number of features 15 32 25 122 18 10 number of neurons 3 7 5 5 10 3 table 6: classification results for ann classifier rates (%) morphological features colour features wavelet features grain type cr rjr rcr cr rjr rcr cr rjr rcr b 1,1 0 98,9 2,8 0 97,2 2,5 0,7 96,8 hw 1,6 0,5 97,9 7,6 0,7 91,7 1,7 1 97,3 tw 7,7 2,8 89,5 3,5 1,3 95,2 0 0 100 mean rates 3,5 1,1 95,4 4,6 0,7 94,7 1,4 0,6 98 4.4 artificial neural network classification method (ann) training phase and network architecture the network architecture is a multi-layer neural network mlp. the training is done using the function "trainlm" from the matlab neural network toolbox. activation functions are hyperbolic tangent and the linear matlab functions "tansig" and "purelin". during the training phase, we varied the neurons number in the hidden layer and we determine the training error. we chose 40000 as training iterations number since this value leads to a minimum training error. the number of neurons in the hidden layer depends of the type of features considered as entries of the network in the table 5 we illustrate the variation of the number of neurons in the hidden layer when using different types of features (for the morphological features sm means size measurments, fc : freeman code, ft : fourier transform and amf : all morphological features) . classification results for this test we used 3000 grains (1000 grains of each class), 600 grains for characterization and 400 grains for validation. the training of each class is done using 1800 grains, 600 grains will be used to learn the true membership and the others 1200 will be used to learn the system the false membership to the class. this technique seems to be very original and will make it possible to enlarge the classification space, to refine space collates and to reduce the conflict rate between various classes. thus this test will determine the conflict rate (cr), the rejection rate (rjr) and recognition rate (rcr). table 6 represents the results obtained during the first test. 5 evaluation and discussion figure 5 shows the classification recognition rates of the four developed methods. the ann classifier lead to the best recognition rates for barley (98,9%) using morphological features and tender wheat (100%) using wavelet features whereas the statistical and fuzzy logic combined classifier was the best for hard wheat classification (98,7%). these two methods gave better results than the first and second one. the tables 7, 8, 9 and 10 present the confusion matrixes of the four developed classifiers. when we cereal grain classification by optimal features and intelligent classifiers 513 figure 5: comparison of the classification methods based on average recognition rates table 7: confusion matrix (%) for the statistical method b tw hw b 91,4 1,5 7,1 tw 5,2 53,5 41,3 hw 2,9 13,9 83,2 observe these matrixes, we note that the major confusions are between tender wheat and hard wheat in the statistical classification method (41,3% for hw and 13,9 tw), fuzzy logic classification based method (12% for hw and 15,7% for tw) and statistical and fuzzy logic combined classification method (5% for hw and 0,8% for tw) this is due to the similarities that exists in the morphology and the texture of these two cereal grain classes. this problem is resolved using the ann classification method (0% for hw and 1,6% for tw). barley grains are more confused with hard wheat (sta: 7,1% ; fuzzy: 9,6% ; sta+fuzzy: 10,5% and ann : 0,6%) than with tender wheat (sta: 1,5% ; fuzzy: 0,9% ; sta+fuzzy: 1,1% and ann : 0,5%) this is due to the size that is larger than tender wheat and colour features. to evaluate the time performance for each classification method we count the time in seconds that takes every algorithm to classify grains in a sample of 300 grains containing 100 grain of each type. the algorithms are developed on a toshiba satellite (intel core 2 / 1,6 ghz) laptop under windows vista environment. table 11 presents the cost time for each method. table 8: confusion matrix (%) for the fuzzy logic based method b tw hw b 89,5 0,9 9,6 tw 0,5 87,5 12,0 hw 4,1 15,7 80,2 514 a. douik, m. abdellaoui table 9: confusion matrix (%) for the statistical and fuzzy logic cobined method b tw hw b 88,4 1,1 10,5 tw 0,6 94,4 5,0 hw 0,5 0,8 98,7 table 10: confusion matrix (%) for the ann method b tw hw b 98,9 0,5 0,6 tw 0 100 0 hw 0,5 1,6 97,9 table 11: time performance of the different methods method time(s) sta 72 fuzzy 43 sta+fuzzy 84 ann 177 we have noticed that the fuzzy logic based classification method appears to run about 60% faster than the second fastest (statistical classification method). the method leading to best recognition results is 4 times slower than the fastest methods. the statistical and fuzzy logic combined classification method can be considered as the most performing method as it have a good recognition rate (94%) and take 50% less time than the method leading to the optimal recognition rate. while the reported execution times depend on the implementation language, we note that we have used matlab 2007. 6 conclusion as dealt above, the classification of different grain types was successfully achieved using different parameters based on different types of features (morphological, colour and wavelet). these parameters were tested on different classification methods; the statistical classification method gave an average recognition rate of 76%. the second method based on fuzzy logic techniques gave an average recognition rate of 85,73%. the hybrid method, which is a combination of the two fore mentioned methods gave an average recognition rate of 93,83%. finally, the ann classification method was tested on all features and gave the best recognition rate reaching 98%. bibliography [1] m. abdellaoui, a. douik, m. annabi, détérmination des critéres de forme et de couleur pour la classification des grains de céréales, proc. nouvelles tendances technologiques en génie electrique et informatique, gei’2006, hammamet,tunisia, 2006, pp. 393-402. [2] d. a. barker, t. a. vouri, m. r. hegedus, d. g. myers, the use of ray parameters for the discrimination of australian wheat varieties. plant varieties and seeds 5(1) (1992) 35-45. cereal grain classification by optimal features and intelligent classifiers 515 [3] d. a. barker, t. a. vouri, m. r. hegedus, d. g. myers, the use of slice and aspect ratio parameters for the discrimination of australian wheat varieties, plant varieties and seeds 5(1) (1992) 47-52. [4] d. a. barker, t. a. vouri, m. r. hegedus, d. g. myers, the use of fourier descriptors for the discrimination of australian wheat varieties, plant varieties and seeds 5(1) (1992) 93-102. [5] d. a. barker, t. a. vouri, m. r. hegedus, d. g. myers, the use of chebychev coefficients for the discrimination of australian wheat varieties, plant varieties and seeds 5(1) (1992) 103-111. [6] p. d. keefe. a dedicated wheat grain image analyzer, plant varieties and seeds 5(1) (1992) 27-33. [7] h. d. sapirstein, j. m. kohler, physical uniformity of graded railcar and vessel shipments of canada western red spring wheat determined by digital image analysis, canadian journal of plant science 75(2) (1995) 363-369. [8] j. paliwal, n. s. shashidhar, d. s. jayas, grain kernel identification using kernel signature, transactions of the asae 42(6) (1999) 1921-1924. [9] s. majumdar, d. s. jayas, classification of cereal grains using machine vision. i. morphology models, transactions of the asae 43(6) (2000) 1669-1675. [10] m. neuman, h. d. sapirstein, e. shwedyk, w. bushuk, wheat grain colour analysis by digital image processing: i. methodology, journal of cereal science 10(3) (1989) 175-182. [11] m. neuman, h. d. sapirstein, e. shwedyk, w. bushuk, wheat grain colour analysis by digital image processing: ii. wheat class determination, journal of cereal science 10(3) (1989) 182-183. [12] x. y. luo, d. s. jayas, s. j. symons, identification of damaged kernels in wheat using a colour machine vision system. journal of cereal science 30(1) (1999) 49-59. [13] s. majumdar, d. s. jayas, classification of cereal grains using machine vision. ii. color models, transactions of the asae 43(6) (2000) 1677-1680. [14] s. majumdar, d. s. jayas, classification of cereal grains using machine vision. iii. texture models, transactions of the asae 43(6) (2000) 1681-1687. [15] s. majumdar, d. s. jayas, classification of cereal grains using machine vision. iv. combined morphology, color, and texture models, transactions of the asae 43(6) (2000) 1689-1694. [16] j. paliwal, n. s. visen, d. s. jayas, n. d. g. white, comparison of a neural network and a nonparametric classifier for grain kernel identification, biosystems engineering, 85(4) (2003) 405-413. [17] n. s. visen, d. s. jayas, j. paliwal, n. d. g. white, comparison of two neural network architectures for classification of singulated cereal grains, canadian biosystems engineering 46 (2004) 3.7-3.14. [18] m. abdellaoui, a. douik, m. annabi, hybrid method for cereal grain identification using morphological and color features, proc. 13th ieee international conference on electronics, circuits, and systems, (nice, france, 2006), pp. 870-873. [19] r. choudhary, j. paliwal, d. s. jayas, classification of cereal grains using wavelet, morphological, colour, and textural features of non-touching kernel images, biosystems engineering 99 (2008) 330 337. [20] a. douik, m. abdellaoui, cereal varieties classification using wavelet techniques combined to multi-layer neural networks, proc. 16th mediterranean conference on control and automation, (ajaccio, france, 2008) pp1822-1827. 516 a. douik, m. abdellaoui [21] s. g. mallat, a theory for multiresolution signal decomposition: the wavelet representation, ieee transactions on pattern analysis and machine intelligence, 11(7) (1989) 674-693. [22] h. freeman, on the encoding of arbitrary geometric configurations, ieee trans on electr. comput. 10 (1961) 260-268. [23] d. zhang, g. lu, a comparative study on shape retrieval using fourier descriptors with different shape signatures, proc. ieee international conference on multimedia and expo, (2001), pp. 11391142. ali douik was born in tunis, tunisia. he received the master degree from the “ecole normale supérieure de l’enseignement technique de tunis”, in 1990 and the ph.d. degree in automatic from the “ecole supérieure des sciences et techniques de tunis, tunisia”, in 1996. in 2010, he received the ability degree from the “university of monastir, tunisia”. he is presently “maitre assistant” in the “ecole nationale d’ingénieurs de monastir”. his research is related to automatic control and image processing. mehrez abdellaoui was born in tunis in 1979. he received his electrical engineering diploma from electrical engineering department in enim-monastir in 2003 and the master degree in automatics from the enim-monastir in 2005. he is currently a phd student in the electrical engineering department at the enim-monastir. his research interests include image processing and video analysis. int j comput commun, issn 1841-9836 vol.7 (2012), no. 4 (november), pp. 721-732 specification and validation of a formative index to evaluate the ergonomic quality of an ar-based educational platform c. pribeanu costin pribeanu national institute for research and development in informatics ici bucharest romania, 011455 bucuresti, bd. maresal averescu, 8-10 e-mail: pribeanu@ici.ro abstract: the ergonomic quality of educational systems is a key feature influencing both the usefulness and motivation for the learner. desktop augmented reality (ar) systems are featuring specific interaction techniques that may create additional usability issues affecting the perceived ease of use. measuring key usability aspects and understanding the causal relationships between them is a challenge that requires formative measurement models specification and validation. in this paper we present an evaluation instrument based on two main formative indexes that are capturing specific usability measures for two ar-based applications. the formative indexes are forming a second order formative construct that acts as predictor for both the general ease of use and ease of learning how to operate with the application. keywords: formative measurement model, formative index, augmented reality, usability, ergonomic quality. 1 introduction educational systems based on desktop ar technologies are creating an appealing user experience for the learner by integrating real life objects into computer environments. touching and holding real objects is increasing the students’ motivation to learn and could better support active and collaborative learning [18], [22]. as ar technologies become more wide-spread, there is an increasing interest in their ergonomic quality. designing for usability is not easy in emerging technologies, like ar systems, which are featuring novel interaction techniques [5], [15]. the iso standard 9126-1 defined usability as the capability of a software system to be easy to understand, easy to learn how to operate with, easy to operate with, and attractive, when used under specified conditions [19]. by ergonomic quality we refer to the first three usability aspects: ease of understanding, ease of learning how to operate, and ease of operating with a software system. how to measure and improve the usability of interactive systems is a key research topic in hci. a research challenge is to better understand the relationships between different usability measures as well as between usability and other factors of interest [17]. in a previous work we developed a measurement model that was grounded in the technology acceptance models (tam) theory [9] in order to explain the causal relations between various factors influencing the intention to use of an ar-based educational platform [3], [4]. although the structural model was useful to test some typical tam hypotheses the variance explained was small and several items targeting specific usability aspects were eliminated in order to achieve the unidimensionality required by a reflective measurement model. moreover, reflective measurement assumes the same antecedents for reflective indicators (as manifest variables) so causal relations are estimated at construct level [8], [24]. these shortcomings suggest looking for an alternative modeling approach. in this paper we present a measurement model for the evaluation of the ergonomic quality of applications developed onto an ar-based educational platform. the augmented reality teaching platform (artp) was developed in the framework of the arise (augmented reality for copyright c⃝ 2006-2012 by ccc publications 722 c. pribeanu school environments) european project. two ar applications implementing learning scenarios for biology and chemistry were developed and tested onto artp. the measurement model consists in two sets of formative indicators that are measuring two dimensions of the ergonomic quality of desktop ar applications: the quality of visual and auditory perception and the ease to operate and collaborate in a constrained space. the two indexes are forming a second order formative construct. in order to achieve identification requirements we used as outcome variables a reflective construct measuring the perceived ease of learning how to use artp and a general reflective item measuring the overall ease of use. the formative measurement model was estimated on the biology scenario data. then we cross validated the models on the chemistry scenario data. the rest of this paper is organized as follows. in the following section we describe the formative measurement models and discuss some methodological aspects related to the specification, identification and validity. in section 3 we present and discuss the estimation results with the biology scenario data. in section 4 we present the results of a confirmatory assessment of the formative measurement model using the chemistry scenario data and we comparatively discuss the results for each scenario. the paper ends with conclusion and future research directions. 2 the formative measurement model 2.1 reflective vs. formative measurement models a measurement model describes the relationships between a construct (latent variable) and its measures (indicators, items) while a structural model describes the relationships between different constructs [12], [13]. the causal relation between a construct and its measures could be from construct to measures (reflective model) or from measures to construct (formative model). there are distinct characteristics of each measurement model that were systematically presented and discussed in detail in [6], [10], [12], [20]. in the reflective measurement model the indicators are manifest variables of the latent variable. a change in the constructs is reflected in simultaneous changes in all indicators. as such, the items are interchangeable and elimination of one of them doesn’t change the construct domain. measures should be positively correlated and the measurement model should have convergent and discriminate validity. in the formative measurement model the measures are defining the conceptual meaning of the construct. indicators are not interchangeable since each is capturing a distinct cause. since the measures are defining the construct, a census of indicators is recommended [6]. there are no assumptions on unidimensionality and correlations between indicators. however, collinearity should be avoided. indicators don’t have an error term and items are intercorrelated. although there is an error terms at construct level this is not a measuring error but a disturbance accounting for other causes not specified by the model [11]. the nomological net of formative indicators could differ as this is a distinct feature of the formative measurement [20]. a formative measurement model taken in isolation is under identified and cannot be estimated. jarvis et al. and diamantopoulos et al. recommend achieving identification based on a 2+ rule: specifying effects (outcomes) of the formative constructs on at least two other variables that are reflectively measured [12], [20]. the outcome variables could be: two reflective indicators (mimic model), two reflective constructs, or a reflective construct and a reflective variable. the selection of the outcome variables is just as important as is the selection of indicators [11], [14]. according to wilcox et al., the selected effect variables are determining the empirical meaning of the formative construct and the set of indicators [26]. the proper specification of the measurement model is a precondition before analyzing and specification and validation of a formative index to evaluate the ergonomic quality of an ar-based educational platform 723 assigning a meaning to the structural model [1]. according to jarvis et al., there are many studies in literature that are based on inappropriate specification of the measurement models [20]. in recent years, there is an ongoing debate regarding the formative versus reflective specification of various constructs and the appropriateness of measurement scales that are frequently used in different domains [12]. taking the appropriate measurement perspective is not a simple issue. as pointed out by jarvis and colleagues, based on the analysis of 178 papers published in four top journals in marketing research, there are about 29% cases (reported at 1192 constructs) of misspecification [20]. moreover, the authors themselves experienced difficulties in classifying 14% of constructs featuring both reflective and formative characteristics. wilcox and colleagues argued that a construct is not inherently formative or reflective so the researcher has a choice to take a perspective or another [26]. in this respect, the specification of alternative models is useful since is providing with more insights into the field of study. 2.2 experiment, samples and data analysis artp is a "seated" ar environment: users are looking to a see-through screen where virtual images are superimposed over the perceived image of a real object placed on the table [27]. two ar-based applications were developed onto this platform (see figure 1). figure 1: students testing the artp learning scenarios: biology (left) and chemistry (right) the first application implemented a biology learning scenario for secondary schools. the implemented paradigm was "3d process visualization of hidden processes" and was targeted at enhancing the students’ understanding and motivation to learn the human digestive system. the real object is a flat torso of the human body. a pointing device having a colored ball on the end of a stick and a remote controller wii nintendo as handler has been used as interaction tool that serves for three types of interaction: pointing on a real object, selection of a virtual object and selection of a menu item. the second application implemented a chemistry scenario. the implemented paradigm was "building with guidance" and was targeted at enhancing the students’ understanding and motivation to learn the periodic table of chemical elements, the structure of atoms / molecules, and the chemical reactions. the real objects were the periodic table of chemical elements and four sets of colored balls symbolizing atoms. the remote controller wii nintendo has only been used as interaction tool for confirming a selection. the test was conducted in 2008, on the ici’s platform which is equipped with 4 artp modules. a total number of 139 students (13-14 years old), from which 65 boys and 74 girls tested the platform. all were 8th grade students enrolled in 3 general schools in bucharest. none of them was familiar with the ar technology. the students came in groups of 7-8, accompanied 724 c. pribeanu by a teacher. each student tested the platform twice: once for the biology scenario and second time for the chemistry scenario. each scenario consists of a demo lesson and a number of exercises. after testing, the students were asked to answer a usability questionnaire by rating the items on a 5-point likert scale (1-strongly disagree, 2-disagree, 3-neutral, 4-agree, and 5-strongly agree). the questionnaire has 28 closed items and 2 open questions, asking users to describe the most 3 positive and most 3 negative aspects. the first 24 closed items are targeting various dimensions of the artp such as ergonomics and usability (items 1-14), perceived utility (items 15-17), perceived enjoyment (items 18-21) and intention to use (items 22-24). the last four items were to assess how the students overall perceived the platform as being easy to use, useful for learning, enjoyable to learn with, and exciting. in order to estimate the new measurement model we used the biology scenario data. we analyzed the initial sample of 139 observations for normality (skewness and kurtosis), univariate and multivariate outliers. we transformed the data (square root extraction) and we repeated the analysis and successively removed 9 observations. the final sample has 130 observations that present moderate deviations from normality. in order to cross validate the model on another sample, we used the chemistry scenario data. we performed the same data analysis procedure on the initial sample and successively removed 11 observations. the final sample has 128 observations with moderate deviations from normality. 2.3 model specification and identification according to our knowledge, there are few approaches to formative index construction for the usability and / or ease of use [21]. although the perceived ease of use and perceived usability are frequently used in information systems research, in almost all studies they are specified as reflectively measured constructs. as such, their indicators have a limited contribution (as manifest variables) to explain the effect of usability problems. since the objective of this study is to analyze the relationships between different aspects related to the ergonomic quality of the artp, 15 items in the usability questionnaire are of interest, from which 11 are formative measures and 4 are reflective measures. the 15 items (presented in annex 1) are grouped into four constructs and a single item measure: • the quality of visual and auditory perception (erg-p): clear observation and superposition, easy to read the information on the screen, and easy to understand the vocal explanations. • the ease of interaction and collaboration and collaboration (erg-o): comfortable work place, easy to select a menu item with the remote control, easy to correct errors, and easy to collaborate with colleagues. • the ease of adjusting the devices and accessories (erg-a), i.e. the see-through screen, stereo glasses and head phones. • the ease of learning (peol): easy to understand, easy to learn and easy to remember how to use artp. • the general item measuring the overall ease of use (peou1). the first three constructs are composite indexes measuring distinct usability aspects that are specific to an ar-based learning application. as such, the indicators are not interchangeable and elimination of any of them will alter the conceptual domain of the construct. for example, if we analyze the three items measuring the quality of the visual perception, each is targeting a specification and validation of a formative index to evaluate the ergonomic quality of an ar-based educational platform 725 different usability aspect. the clarity of observation through the screen is a hardware issue while the clarity of superposition between the augmentation and the real object is a software issue. reading the information on the screen relates to augmentation, messages to the user and menu items. note that apart from the specific ar devices and accessories there are also several usability aspects which are specific to a given application. for example, in the biology scenario the user selects an organ by pointing on flat torso of the digestive system which is a real object shared by to students staying face-to-face. in the chemistry scenario, the students create a molecule by bringing together several colored balls symbolizing atoms. in this respect, the interaction with the remote control, the correction of mistakes (selection errors) and the collaboration between students depend on the real objects registered with the application. therefore a formative model is an appropriate measurement perspective. the ergonomic quality of artp is a multidimensional construct conceptualized as a composite of formative indexes. each dimension is a formative index measuring a set of specific usability aspects. each index is assumed to have a significant positive influence on two general usability aspects: the perceived ease of learning how to use artp (the construct peol) and on the overall ease of use (the general item peou1). 2.4 validity of the formative indexes according to recent studies, there are several criteria to assess the validity of formative indexes [6], [10], [12], [14]: adequate coverage of the construct’s domain, absence of multicollinearity, indicator validity, significant γ-coefficients, complete mediation of effects, significant influence (β-coefficients) on outcome variables, and acceptable fit with the data. although a census of indicators is ideal to cover the scope of a formative index, this is rarely possible. in our model, each index is addressing a distinct aspect of the ergonomic quality of artp. since formative indicators are also capturing critical usability aspects as indicated in previous studies (e.g. [23]) the coverage of the domain is acceptable. the collinearity of formative indicators was analyzed with the vif (variation inflation factor) statistic for each index. vif values were in the range 1.183-1.946 for the biology scenario, respectively 1.085-1.715 for the chemistry scenario bellow the 3.3 cut-off value [12]. the general item peou1 is an overall measure of the ergonomic quality of artp which qualify it for using as criterion validity. an analysis using pearson’s rho indicated that there are significant positive linear relationships between peou1 and the formative indicators of erg-p and erg-o but no significant correlations with the formative indicators of erg-a. nevertheless, in both samples erg-a indicators are positively correlated with the formative item ergo1. this suggests that erg-a is not a distinct dimension of the ergonomic quality of artp but only an antecedent of a formative indicator measuring the comfort with the workplace. a regression analysis on the biology data sample showed that erga1 and erga2 are two antecedents of ergo1 (standardized coefficients βerga1 =0.191, sig=0.046 and βerga2=0.185, sig=0.039). the regression analysis on the chemistry data sample confirmed this finding (βerga1=0.156, sig=0.083 and βerga2=0.255, sig=0.005). in order to estimate the formative indexes we used a mimic model and a structural model presented in figure 2. the models were estimated using amos 17.0 [2]. each index has n formative indicators, more specifically n=4 for erg-p and erg-o, and n=3 for erg. there are four outcome variables in the mimic model. three of these reflective indicators are further grouped in the structural model that features 2 outcome variables: the general item peou1 (overall ease of use) and the reflective construct peol (ease of learning how to operate). all outcome variables are closely related to the focal construct as they measure general aspects 726 c. pribeanu figure 2: estimation of formative indexes with mimic (left) and structural models (right) of the perceived ergonomic quality. there are three general hypotheses assessed with these models: 1. there is a significant contribution of the formative indicators to the composite index (xi→η, i=1...n). 2. there is a significant positive influence of the composite index on the perceived ease of learning how to use artp (η→peol1, η→peol2, η→peol3 in the mimic model, respectively η→peol in the structural model). 3. there is a significant positive influence of the composite index on the overall ease of use (η→peou1). since the structural model includes a reflectively measured construct, the internal consistency and convergent validity should be assessed. the scale reliability and unidimensionality were analyzed with spss 16.0 and amos 17.0. the consistency of scale (cronbach’s alpha) was 0.701 for the biology scenario and 0.704 for the chemistry scenario which is acceptable. convergent validity was assessed by examining the standardized factor loadings, composite reliability, and average variance extracted for peol in each scenario [16]. almost all factor loadings are over the minimum recommended level of 0.60. the composite reliability was 0.711 for the biology scenario and 0.704 for the chemistry scenario, above the minimum recommended value of 0.70 in each scenario. the average variance extracted was 0.456 for the biology scenario and 0.438 for the chemistry scenario. overall, peol construct has an acceptable convergent validity. 3 estimation results on the biology scenario data 3.1 first order formative indexes the results of mimic and structural model estimations for erg-p and erg-o are presented in table 1. all γ-coefficients are significant at p<0.05 level thus supporting the first hypothesis. there are small differences between the magnitudes of γ-coefficients in the two models. the variance of the error term associated with the formative index is small in each model, so the formative index is sound and each formative item has a distinct contribution to the explained variance [11]. fit indices are acceptable, over the recommended values [16]: χ2=1.115, df=13, χ2/df=1.624, gfi=0.962, cfi=0.974, srmr=0.036 (erg-p, structural model), and χ2=22.963, df=13, χ2/df=1.766, gfi=0.960, cfi=958, srmr=0.042 (erg-o, structural model). in both models all β-coefficients are significant (p<0.001), which supports the last two hypotheses. the influence of formative indexes is stronger on the perceived ease of learning how to specification and validation of a formative index to evaluate the ergonomic quality of an ar-based educational platform 727 table 1: estimation results for erg-p and ergo biology scenario erg-p mimic model structural model erg-o mimic model structural model γ/β sig.(p) γ/β sig.(p) γ/β sig.(p) γ/β sig.(p) contribution contribution ergp1 .33 <.001 .36 <.001 ergo1 .22 0.018 .27 0.006 ergp2 .31 0.001 .30 0.002 ergo2 .22 0.017 .21 0.030 ergp3 .20 0.010 .21 0.010 ergo3 .29 0.003 .30 0.003 ergp4 .27 0.002 .29 0.002 ergo4 .33 <.001 .33 0.001 effect variables effect variables peou1 .63 <.001 .63 <.001 peou1 .62 <.001 .66 <.001 peol .91 <.001 peol .87 <.001 peol1 .64 <.001 peol1 .63 <.001 peol2 .73 <.001 peol2 .71 <.001 peol3 .61 <.001 peol3 .63 <.001 variance expl. variance expl. erg-o 71% 78% erg-o 54% 62% peol 83% peol 75% use artp than on the general ease of use. this means that once the user understands and learns how to use the system he finds it easy to use. the highest contributions to erg-p have the first two items (clarity of observation through the see-through screen and accuracy of superposition). the most important contribution to erg-o has the last item related to the ease of collaboration with colleagues. the ease of correcting the mistakes proved also to be an important measure for the biology scenario. 3.2 second order formative index erg-p and erg-o are two distinct dimensions of the ergonomic quality of artp that are forming a second order formative construct (erg). we used the scores of the first order constructs (the predicted values of the multiple regression) as formative indicators in the second order construct. similar approaches are described in [7], [8]. the estimation results are presented in table 2. table 2: estimation results for second order construct biology scenario erg mimic model structural model γ/β sig.(p) γ/β sig.(p) contribution erg-p .65 <.001 .68 <.001 erg-o .27 0.006 .30 0.004 effect variables peou1 .63 <.001 .63 <.001 peol .90 <.001 peol1 .64 <.001 peol2 .71 <.001 peol3 .61 <.001 variance expl. erg 75% 84% peol 81% the γ-coefficients are significant in each model. the contribution of the first dimension is much higher showing that the quality of visual perception is a critical requirement for the desktop ar systems. the analysis of modification indices showed that the formative index is completely mediating the effects of its items. both β-coefficients are significant (p < 0.001), which supports the last two hypotheses. the variance of the error term associated with the formative index is 0.009 (medium effect). the magnitude of the error term is suggesting some other aspects not covered by the indicators. 728 c. pribeanu fit indices are acceptable, over the recommended values: χ2=11.759, df=7, χ2/df=1.679, gfi=0.972, cfi=0.984, srmr=0.032 (structural model). 4 cross validation of the formative indexes on the chemistry scenario 4.1 first order formative indexes the results of estimation are presented in table 3. almost all γ-coefficients are significant at p<0.05 level thus supporting the first hypothesis. there is only one exception: ergp4 in the mimic model, where the γ-coefficient is signifficant at p<0.10 level. there are relatively small differences between the contributions of each item in each model. table 3: estimation results for erg-p and erg-o chemistry scenario erg-p mimic model structural model erg-o mimic model structural model γ/β sig.(p) γ/β sig.(p) γ/β sig.(p) γ/β sig.(p) contribution contribution ergp1 .23 0.016 .29 0.010 ergo1 .25 0.009 .24 0.018 ergp2 .28 0.009 .31 0.012 ergo2 .27 0.003 .30 0.002 ergp3 .24 0.010 .32 0.004 ergo3 .22 0.021 .24 0.016 ergp4 .20 0.053 .24 0.047 ergo4 .36 <.001 .38 <.001 effect variables effect variables peou1 .55 <.001 .61 <.001 peou1 .48 <.001 .49 <.001 peol .75 <.001 peol .93 <.001 peol1 .70 <.001 peol1 .71 <.001 peol2 .67 <.001 peol2 .70 <.001 peol3 .52 <.001 peol3 .55 <.001 variance expl. variance expl. erg-o 47% 67% erg-o 49% 55% peol 56% peol 86% in both models β-coefficients are significant (p<0.001), which supports the last two hypotheses. the variance of the error term associated with the formative index is 0.022 (0.008) for erg-p and 0.021 (0.016) for erg-o. since the magnitude of the error term is small and all indicator coefficients are significant, the formative index is sound and each formative item has a distinct contribution to the explained variance. fit indices are acceptable, over the recommended values [16]: χ2=15.154, df=13, χ2/df=1.624, gfi=0.973, cfi=0.990, srmr=0.038 (erg-p, structural model), and χ2=22.392, df=13, χ2/df=1.722, gfi=0.958, cfi=0.947, srmr=0.048 (erg-o, structural model). the influence of formative indexes is stronger on the perceived ease of learning how to use artp than on the general ease of use. the highest contributions to erg-p have the items ergp2 (accuracy of superposition) and ergp3 (understanding the vocal explanation). the contribution of ergp3 shows the importance of vocal explanations for students. the most important contribution to erg-o has the last item related to the ease of collaboration with colleagues. the ease of selecting a menu item proved also to be an important measure for the chemistry scenario. 4.2 second order formative index the results of structural model estimation are presented in table 4. both γ-coefficients are significant. the contribution of each dimension is similar for the chemistry scenario. the analysis of modification indices showed that the index is completely mediating the effects of its items. specification and validation of a formative index to evaluate the ergonomic quality of an ar-based educational platform 729 table 4: estimation results for second order construct chemistry scenario erg mimic model structural model γ/β sig.(p) γ/β sig.(p) contribution erg-p .45 <.001 .55 <.001 erg-o .47 <.001 .49 0.004 effect variables peou1 .53 <.001 .55 <.001 peol .83 <.001 peol1 .70 <.001 peol2 .68 <.001 peol3 .53 <.001 variance expl. erg 63% 80% peol 69% both β-coefficients are significant (p<0.001), which supports the hypotheses. the variance of the error term associated with the formative index is 0.016 (0.222) which means a medium to large effect. the magnitude of the error term is suggesting some other aspects not covered by the indicators. fit indices are acceptable, over the recommended values: χ2=14.932, df=7, χ2/df=2.133, gfi=0.964, cfi=0.960, srmr=0.049 (structural model). 4.3 comparison of results and discussion the estimation of formative indexes on the chemistry scenario data cross validated the measurement model and enables a comparison between the two implemented scenarios. the variances explained by the structural models for the formative indexes are higher for the biology scenario than for the chemistry scenario. the variance explained by the model for the second order construct is slightly higher for the biology scenario. the contribution of erg-p to the super ordinate index is higher than the contribution of erg-o in both scenarios but the relative importance is much higher for the biology scenario. the variance explained by the model for the outcome variable peol is also higher for the biology scenario (81% vs. 69%). as regarding the erg-p index, the comparison reveals that understanding of vocal explanations (ergp3) is the most important item for the chemistry scenario and the less important for the biology scenario. this is explained by the fact that the chemistry demo lesson and exercises were more difficult for students so a clear understanding of the lesson and how to perform the exercises was critical. the accuracy of superposition between the projection and the real object (ergp2) has a higher importance for biology. as regarding erg-o, the comparison reveals that the ease of collaboration with colleagues (ergo4) is the most important item for both scenarios. selecting a menu item (ergo2) was easy for the biology scenario (lowest γ-coefficient) and difficult the chemistry scenario. this is explained by the fact that the students had to use both hands to manipulate the colored balls (symbolizing atoms) so handling also the remote control became more difficult. correcting the mistakes (ergo3) was more difficult for the biology scenario because of frequent selection errors when students tried to select a small organ. in both scenarios, erg-a had a significant positive influence on the formative indicator ergo1, showing that the ease to adjust the see-through screen and stereo glasses is influencing the comfort on the work place. 730 c. pribeanu 5 conclusion and future work the main contribution of this study is a measurement model for the perceived ease of use of the artp featuring a second order formative index with two dimensions: the quality of visual and auditory perception and the ease of interaction and collaboration. these indexes are antecedents of a reflective construct measuring the perceived ease of learning how to use artp. the latter could be then integrated in structural models that are based solely on reflective scales. there are several strengths and limitations of this study. an outcome of this research is the integration of almost all items related to the perceived ease of use that were eliminated in a previous work [4] for unidimensionality and convergent validity reasons. the new measurement model includes 12 of 15 items related to the ergonomic quality. as such, it provides a wider perspective on the ergonomic quality and enables the analysis of specific usability aspects. second, the estimation of a formative measurement model provides a more detailed information (at indicator level) shedding light on usability aspects that are critical for artp and a given learning scenario. third, the formative indexes were specified and validated with a structural model that addressed all general aspects related to the perceived ergonomic quality: ease of understanding, ease of use and ease of operating with a software system. since all variables are strongly related to the focal construct the structural model is well supporting an external validity. up to now, there is no similar model developed for the ergonomic quality of a software system. fourth, the model was estimated and cross validated on two different samples which enables a comparison between scenarios and makes it possible to further integrate and discuss in more detail the answers at open questions (qualitative data). as regarding the limitations, the sample used in this study was collected from only 6 classes (3 romanian schools), having a limited representativeness. second, both samples are small, at limit for sem (structural modeling equation) requirements. third, the convergent validity of the relatively measured construct is at limit (acceptable for an exploratory study). fourth, the breadth of formative indicators is inherently limited since the evaluation questionnaire was indented to capture the main usability aspects. fifth, there are inherent limitations since the methodology regarding formative indexes estimation and validation is not mature yet. the usability questionnaire used to collect the data was conceptualized in 2007 while the main recommendations for formative indexes development have been published only in 2008. based on this work we intend to develop a new evaluation questionnaire having both formative and reflective items. the questionnaire will be used for the evaluation of a new version of the chemistry application which is currently under development. acknowledgements this work was supported by the research projects tehsin 503/2009 and arise fp6-027039. specification and validation of a formative index to evaluate the ergonomic quality of an ar-based educational platform 731 bibliography [1] anderson, j.c., gerbing, d.w. structural equation modelling in practice: a review and recommended two-step approach. psychological bulletin 103(3), 411-423, 1988. [2] arbuckle, j.l. amos 16.0 user’s guide. amos development corporation, 2007. [3] balog, a., pribeanu, c. developing a measurement model for the evaluation of ar-based educational systems. studies in informatics and control 18(2), 137-148, 2009. [4] balog, a., pribeanu, c. the role of perceived enjoyment in the students’ acceptance of an augmented reality teaching platform: a structural equation modelling approach . studies in informatics and control 19(3), 319-330, 2010. [5] bach, c., scapin, d., obstacles and perspectives for evaluating mixed reality systems usability. proceedings of iui-cadui conference 2004, 72-79, 2004. [6] bollen, k., lennox, r. conventional wisdom on measurement: a structural perspective. psychological bulletin 110(2), 305-314, 1991. [7] bruhn, m., georgi, d., hadwich, k. customer equity management as formative second order construct. journal of business research 61, 1292-1301, 2008. [8] cadogan, j., souchon, a., procter, d. the quality of market-oriented behaviors: formative index construction. journal of business research 61, 1263-1277, 2008. [9] davis, f.d. perceived usefulness, perceived easy of use, and user acceptance of information technology. mis quaterly 13, 319-340, 1989. [10] diamantopoulos, a., winklhofer, h. index construction with formative indicators : an alternative to scale development. journal of marketing research 28, 269-277, 2001. [11] diamantopoulos, a. the error term in formative measurement models : interpretation and modeling implications. journal of modeling in management 1(1), 7-17, 2006. [12] diamantopoulos, a., riefler, p., roth, k. advancing formative measurement models. journal of business research 61, 1203-1218, 2008 [13] edwards, j., bagozzi, r. on the nature and direction of relationship between constructs and measures. psychological methods 5(2), 155-174, 2000. [14] franke, g., preacher, k., rigdon, e. proportional structural effects of formative indicators. journal of business research 61, 1229-1237, 2008. [15] gabbard, j., swann, e. usability engineering for augmented reality: employing user-based studies to inform design. ieee transactions on visualization and computer graphics 14(3), 513-525, 2008. [16] hair, j.f., black, w.c., babin, b.j., anderson, r.e., tatham, r.l. multivariate data analysis, prentice hall, 2006. [17] hornbaek, k. current practice in measuring usability: challenges to usability studies and research. int. j. human computer studies. 64, 79-102, 2006. 732 c. pribeanu [18] huang, h.m., rauch, u., liaw, s.s. investigating learners’ attitude towards virtual reality learning environments: based on a constructivist approach. computers & education 55, 11711182, 2010. [19] iso 9126-1:2001 software engineering software product quality. part 1: quality model [20] jarvis, c.b., mackezie, s., podsakoff, m. a critical review of construct indicators and measurement models misspecification in marketing and consumer research. journal of consumer research 30, 199-218, 2003. [21] konradt, u., christophersen, t., schaefer-kuelz, u. predicting user satisfaction, strain and system usage of employee self-services. int. j. of human-computer studies 64, 1141-1153, 2006. [22] krauss, m., riege, k., winter, m., pemberton, l. remote hands-on experience: distributed collaboration with augmented reality. proceedings ec-tel 2009, lncs 5794, springer, 226-239, 2009 [23] pribeanu, c., balog, a., iordache, d.d. measuring the usability of augmented reality elearning systems: a user-centered evaluation approach.chapter 14: software and data technologies, ccis 47, corderiro, h., shiskov b, ranchordas a, helfert m (eds.), springer, 175-186, 2009. [24] ruiz, d.m., gremler, d., washburn, j., carrion, g.c. service value revisited: specifying a high order formative measure. journal of business research 61, 1278-1291, 2008. [25] tabachnick, b. g., fidell, l. s. . using multivariate statistics, 5th ed. boston: allyn and bacon, 2007. [26] wilcox, j., howell, r., breivik, e. questions about formative measurement. journal of business research 61, 1219-1228, 2008. [27] wind, j., riege, k., bogen m. spinnstubež: a seated augmented reality display system, virtual environments: proc. ipt-egve eg/acm symposium, 17-23, 2007. annex 1 constructs and items erg mm items variables quality of f ergp1 observing through the screen is clear of visual and f ergp2 the superposition between projection and the real object is clear auditory perception f ergp3 understanding the vocal explanations is easy (erg-p) f ergp4 reading the information on the screen is easy ease of f ergo1 the work place is comfortable interaction and f ergo2 selecting a menu item is easy collaboration f ergo3 correcting the mistakes is easy (erg-o) f ergo4 collaborating with colleagues is easy ease of f erga1 adjusting the "see-through" screen is easy adjusting devices f erga2 adjusting the stereo glasses is easy (erg-a) f erga3 adjusting the head phones is easy perceived ease r peol1 understanding how to operate with artp is easy of learning to operate r peol2 learning how to operate with artp is easy (peol) r peol3 remembering how to operate with artp is easy *** general item r peou1 overall, i find the system easy to use note: mm(measurement model): f (formative) / r (reflective) international journal of computers communications & control issn 1841-9836, 10(5):686-701, october, 2015. adaptive probe-based congestion-aware handover procedure using sip protocol r. libnik, a. svigelj rok libnik telekom slovenije, d.d., cigaletova 15, 1000 ljubljana, slovenia rok.libnik@telekom.si ales svigelj* 1. department of communication systems, jozef stefan institute jamova cesta 39, 1000 ljubljana, slovenia ales.svigelj@ijs.si 2. jozef stefan international postgraduate school jamova cesta 39, 1000 ljubljana, slovenia *corresponding author: ales.svigelj@ijs.si abstract: wireless technologies have evolved very rapidly in recent years. in the future, operators will need to enable users to use communication services independently of access technologies, so they will have to support seamless handovers in heterogeneous networks. in this paper we present a novel adaptive congestion aware session initiation protocol (sip) based procedure for handover in heterogeneous networks. in the proposed algorithm the handover decision is based in addition to signal strength, also on target network congestion status, which is tested during the conversation. as sip protocol was used, the proposed procedure is independent of access technologies. for performance evaluation of proposed procedure we developed a purpose built simulation model. the results show that the use of the proposed adaptive procedure significantly improves the qoe of voip users, compared to reference scenario, in which only signal strength was used as the trigger for handover decision. keywords: session initiation protocol (sip), seamless handover, heterogeneous networks, performance evaluation, congestion awareness. 1 introduction fixed, nomadic and mobile telecommunications networks, which provide voice and data services, are nowadays converging toward a seamless heterogeneous telecommunication network. due to many different wireless access technologies, comprising licenced (e.g. gsm/umts, hspa, wimax, and lte) and unlicensed (e.g. wlan) access, there is a need for a uniform access to converged services. such services should be independent of the access technologies, providing seamless connectivity and sufficient quality of experience (qoe). in the future networks the wireless access networks will play the key role, as their inherent characteristics of the limited radio bandwidth and channel properties, are of the paramount importance for the provision of appropriate transmission rates and quality of service (qos). in addition, wlan is increasing its popularity, in particular in home/business environment (limited coverage areas). thus, to enable mobile users to communicate using a variety of different access technologies, terminals have to support several network interfaces. terminal manufacturers are already producing multi mode terminals and the number of interfaces is bound to increase with technology limits. operators, on the other hand are starting to offer fixed mobile converged services. with increasing demand from the users to communicate independent of access technologies, operators will need to offer seamless handover between heterogeneous access copyright © 2006-2015 by ccc publications adaptive probe-based congestion-aware handover procedure using sip protocol 687 networks (i.e. vertical handover). in homogeneous networks handover techniques (i.e. horizontal handover) are well studied in the literature and already integrated in mobile networks. the horizontal handover is usually triggered by received signal strength (rss) only. in the future, this will not be sufficient and other parameters should be taken into account (network congestion status in our case) in handover decision. thus, new mechanisms need to be developed. the main contribution of this paper is an advanced adaptive sip based procedure for congestion aware handover in heterogeneous networks, together with its performance evaluation in simulation environment. the remainder of the paper is organized as follows. in the following subsections the related research work on mobility management techniques and support for handovers when using the sip protocol is described. in section 2 the novel adaptive congestion aware sip based procedure for handover in heterogeneous networks is presented, while its performance evaluation is presented in section 3. the paper ends with conclusions in section 4. 1.1 mobility management mobility management techniques are defined as techniques that support user movement within and between different networks. the handover process can be in general divided into three phases: (i) handover information gathering phase, (ii) handover decision phase and (iii) handover execution phase [1]. in the first phase (i.e. handover information gathering phase) a mobile node collects not only network information, but also information about the other components of the system such as network properties, mobile devices, access points, and user preferences [1]. the information/parameters typically collected/measured are the following [1] [2] [3] [4] [5]: availability of neighbouring network received signal strength (rss), signal noise ratio (snr), carrier to interference ratio (cir), signal to interference ratio (sir), bit error ratio (ber), delay, jitter throughput, economic price of the usage of the network. the mobile device’s state by gathering information about battery status, resources, speed, and service class. user preferences information such as budget and services required, preferred network operator. context information. as seen, some parameters (snr, delay, jitter, bandwidth, and power consumption) are network/hardware related and cannot be influenced by the user, while others (price, preferred network operator) represent parameters that can be selected/set by the user. gathered parameters can be grouped also according to the origin of the parameters. they can be provided by the network (i.e. service independent) or can be provided by application/service (i.e. service dependent). in the second phase (i.e. handover decision phase) the decision for handover is made, based on criteria function, taking into account different information/parameters, which were gathered during the first phase. the second phase is one of the most critical processes during the handover, as in this phase, the decision about time (if and when) and to which network (selecting the best network fulfilling requirements) the handover is made is taken. in a homogeneous network environment the decision about time usually depends on rss values, while the selection of the network is not an issue since the same networking technology (horizontal handover) is used. in heterogeneous networks the selection of the appropriate network is quite complex, as 688 r. libnik, a. svigelj many parameters/information obtained from the different information sources (i.e. network, mobile devices, and user preferences) must be evaluated in order to make the best decisions. in this paper we are focusing on vertical handover solutions based on the combination of different parameters. selection of appropriate parameters and handling of appropriate trigger algorithm in handover decision phase is of a paramount importance, as wrong handover decision can lead into unsatisfied users. this is in particular important when performing handover using real time applications such as voice over ip (voip), where the assurance of appropriate level of quality of experience (qoe) in the target access network, represents the main challenge. operator’s backbones usually do not present a bottleneck as they are well maintained and controlled in order to provide an adequate level of qos. in the last (third) phase (i.e. handover execution), traffic flow is handed over to the target network. this means that all the traffic is sent using the new connection, while the connection with the old network is terminated. this phase should also guarantee a smooth session transition process. in this paper we focused in particular on the second phase (i.e. handover decision), proposing new procedure that improves the handover decision and consequently the user experience when performing handover using voip applications in heterogeneous networks. handover in heterogeneous network can be performed using different protocols and this has been the subject of several studies [1] [5] [6]. at the network layer, mobile ip (mip), defined in [8], has been most frequently selected [8] [9] [10] [11] as the protocol for handover. with the modifications presented in [12] it can provide greater support for real time services on a mobile ipv4 network, by minimizing the period of time when an mobile node (mn) is unable to send or receive ipv4 packets due to delay in the mobile ipv4 registration process. in [13] authors proposed an enhancement of mobile ip (mip) called mip with home agent handover (hh-mip) to enjoy most of the advantages of route optimization mip (romip) but with only a small increase of signalling overhead. the most widely used protocols at the transport layer are tcp and udp. both have some limitations for mobility support. however, a new solution called mobile sctp (msctp) has been developed to enable ip addresses to be added, deleted and changed during active sctp association [14] [15]. for mobility management at the application layer, sip is usually selected as the most favoured protocol [9] [10] [11] [12] [16] [17] [18]. sip runs on top of several different transport protocols and is today’s most widely used protocol for ip telephony penetrated in both terrestrial and satellite networks [19]. the advantage of using the sip protocol for handover execution is, that sip is an application layer protocol, and thus its use does not have a great impact on the network changes needed. the transport independence of sip means that it does not require great network involvement in handover execution. however, the application usually needs to be improved / customized to support handover and only sip based application can perform handover. application level solutions based on tunnelling [20], using sip only for signalling can overcome this problem. the biggest advantage of using sip is its wide adoption in real operator environments, since almost all operators that are offering voip services are using sip for signalling. in addition, sip is used in many operator environments and has been selected as the primary signalling protocol in ims (ip multimedia subsystem) networks. in this paper we focus on solutions that can be deployed easily in a real operator environment. thus, we decided to focus on the use of sip for mobility management, which is shortly described in the next subsection. 1.2 sip mobility sip protocol [21] [22] is an application layer signalling protocol for establishing, modifying, and terminating internet multimedia sessions. these sessions include internet telephone calls, adaptive probe-based congestion-aware handover procedure using sip protocol 689 multimedia distribution, and multimedia conferences. sip invitations used to create sessions carry session descriptions that allow participants to agree on a set of compatible media types. when using sip protocol for ip telephony in operator’s environment, the regulatory and security issues are forcing operators to provide additional functionalities that can affect the architecture of the ip telephony solution [23]. usually, the session border controller (sbc) is added to their network [24], which can lead to a conflict with sip architectural principles. sip based sbcs typically handle both signalling and media, resulting in call flow to be changed in a way that all rtp streams are routed via sbc (see also figure 4). with minor modifications, sip can support four types of mobility. terminal mobility enables devices to move between subnets and be accessible to other hosts and to continue any ongoing session when they move. session mobility enables users to maintain a session while moving from one terminal to another. personal mobility allows users to use the same set of services, even when changing devices or network attachment points. service mobility enables users to be identified by the same logical address, even if the user is at different terminals. in this paper we are focusing on terminal mobility, for which two types of mobility management have been defined: pre-call mobility and mid call mobility. sip protocol supports several applications; however, in this paper we have selected ip telephony as a typical representative of real time application. in the pre-call mobility scenario mn gets a new ip address prior to the call, thus this operation does not affect the quality of ip telephony service and will therefore not be discussed further. in the mid call mobility scenario first a call is established between the corresponding node (cn) and the mn that is in the home network. when mn moves to the target network, it gets new ip address and sends sip re invite message to cn and informs it about the location change. the new rtp session is then established. the limitation of this approach is that the sip server is not informed about the location change. some solutions have been presented in the literature in which mn informs the sip server about the location change after sending the sip re invite message [13]. however, in a real operator environment, information about location change needs to be sent to the sip server prior to starting a new sip session between mn and cn. this should be done, for example, to support proper charging, since prices can differ between networks. to overcome the limitation of mid call mobility in [25] we have proposed sip enhanced mid call scenario (semcs) in semcs scenario, a call is established between the cn and the mn that is in the home network after moving to target network, the mn sends the sip re-invite message to the sip server to inform it about the location change. the sip server then forwards the sip re invite message to the cn . after the acknowledgement the new rtp session is established using (new) ip address of the target network. in [25] we proposed message exchange via sip server and we focused on handover execution based on snr ratio only. that procedure was upgraded in [26], where we are presenting a novel procedure for handover decision based on congestion detection (cahp-c). in this paper, we are extending the cahp-c procedure described in [26] with an adaptive procedure for congestion aware handover. 2 cahp-a adaptive procedure for congestion aware handover as stated in the introduction in today’s mobile/wireless homogenous networks the trigger for handover is usually based on snr only. this is sufficient as the whole network uses the same access technology and is under control of one operator offering service. in the future heterogeneous networks, the target access network could be covered by different operator or it is 690 r. libnik, a. svigelj not even operator’s network. the later is usually the case with open wlan network in a hotel or in a congress centre, where there are several access points (ap) connected to single xdsl connection. on one hand the users can have good signal coverage and very fast (e.g. 52 mbit/s) connection between mn and ap, while on the other hand, when traffic from all aps is gathered on single xdsl line bandwidth becomes a bottleneck. thus, the snr level is not sufficient parameter for handover decision and should not be performed only when snr level exceeds the threshold, but should be done based also on other parameters (e.g. congestion status) [26]. this is especially important when providing real time applications (ip telephony in our case), where ability to provide sufficient qoe for the user is the most important and where performing handover to congested network could lead to degraded service level. our solution is described in [26], where we defined basic congestion aware handover procedure (cahp-c), which enables efficient handover performance, taking into account also the congestion status of the target network. it should be noted, that receiving signal power measurement remains the prerequisite for handover (i.e. handover cannot take place to a network lacking or with limited signal coverage). in our study two groups of networks with different characteristics were defined. networks in the first group are reliable and expensive (e.g. umts, hspa and lte), while networks in the second one are cheaper or even free of charge and unreliable (e.g. wlan). in order to present our solution more clearly we selected one representative from each group. the hspa was selected from the first group of networks and wlan from the second group. those two will be used in the rest of this paper. please note, that our approach can be used between any two networks (e.g. also between different wlan networks). we assumed that congestion (i.e. qoe degradation) can happen in the wlan network only. we are not focusing on "who" or "what" is causing the additional delay (e.g. mac, tcp, routing, low bandwidth on the access link), but only on the fact that if there is a delay, which is not acceptable for ip telephony, we do not use the corresponding access network. as the proposed cahp-a procedure is an extension to cahp-c procedure we provide short description of cahp-c emphasizing only the main characteristics, as partially depicted in figure 1, while in depth description of cahp-c can be found in [26]. 2.1 cahp-c the cahp-c procedure is used only if snr exceeds the predefined threshold [26]. when snr is below threshold, the proposed procedure is not used. in order to detect possible congestion in the wlan network before the handover is executed, we proposed pre-probe algorithm. the congestion is detected with delay testing. in order to monitor congestion (i.e. round trip delay measurement) we defined new sip message named sip pre_probe , which is sent before the sip re invite message. as characteristics of the wlan access network can also change during the call, we defined another algorithm named mid-probe algorithm. another sip message called sip mid_probe was defined, which is used to check the congestion status of the wlan access network when in use. after receiving the responses the mn calculates the average delay dpre (pre-probe algorithm) or dmid (mid-probe algorithm) from mn to sbc. we also defined two parameters tpre and tmid. when user is trying to handover to wlan network, the parameter tpre defines the period, when sip pre_probe messages are sent again if measured delay is above predefined threshold td (i.e. 200ms in our case) [26]. when user is already using wlan network the parameter tmid defines the period when sip mid_probe messages are sent again if the measured delay is below td [26]. those two parameters are the most important in cahp-c procedure and need to be set carefully as they affect the level of signalling overhead and the speed of detecting possible congestion. the main limitation of cahp-c procedure was that adaptive probe-based congestion-aware handover procedure using sip protocol 691 tpre and tmid parameters are constant, which means that they do not change if the network characteristics are improved. thus, we are proposing cahp-a procedure in which tpre and tmid parameters change adaptively according to current network characteristics (see also figure 1). the cahp-a procedure is presented in the next chapter. 2.2 cahp-a the sip pre_probe and sip mid_probe messages present additional traffic in the network and can be seen as signalling overhead. as the size of the messages is the same as in rtp traffic, the use of newly defined messages does not have significant effect on backbone traffic. however, such increase of signalling affects the sbc, which needs to handle additional messages. in this paper we are extending the cahp-c procedure with adaptive calculation of tpre and tmid, as depicted in figure 1 (bolded blocks). in order to keep the signalling overhead as low as possible we propose that parameters tpre and tmid change adaptively according to characteristics of network (i.e. measured delay) as the networks differ by the ability to provide sufficient qoe. some of them get congested likely (e.g. in congress centre) while the other used by a single user (e.g. at home) are not. with the pre probe algorithm wlan network is tested prior handover. in the case that calculated delay dpre is above threshold td the parameter tpre change according to difference between dpre and td. after handover, the wlan network is tested with mid probe algorithm. in case that calculated delay dmid is below threshold î¤d the parameter tmid should change according to difference between dmid and td. the calculations happen after the delay measurements (see bolded block in figure 1). in order to achieve such dependency we defined equations (1) and (2) which are used for calculation of tpre and tmid respectively: tpre =   n/a; dpre ≤ dmax tmax · ( dpre−dmin dmax−dmin −1 ) ; dmax < dpre < 2 ·dmax −dmin tmax; dpre ≥ 2 ·dmax −dmin (1) tmid =   tmax; dmid ≤ dmin tmax · ( 1− dmid−dmin dmax−dmin )α ; dmin < dmid < dmax n/a; dmid ≥ dmax (2) where tmax represent maximum possible time set for tpre or tmid, dmin minimum (i.e. delay of non congested network) and dmax maximum delay that does not have effect on qoe ( td in proposed algorithm). as seen the values for tpre and tmid are function of α and measured dpre or tmid. by selecting different values for α we can define different curves that define tpre and tmid. it is worth noting, that different α can be set for calculating tpre and tmid. some possibilities are presented in figure 2 and figure 3, where parameter α was set to 1/16, 1/8, 1/4, 1/2, 1, 2, 4, 8, and 16. when tpre and tmid are low more sip probe messages are sent as they are more frequent and the possible degradation of service is detected faster. in such a manner we managed to maintain qoe of the user. when fewer messages are sent ( tpre and tmid are high) the degradation of service may not be detected fast enough and degradation of qoe can be expected. however, more frequent sending of sip pre_probe and sip mid_probe messages tpre and tmid are high) increases signalling overhead. from figure 2 and figure 3 it can be seen that level of overhead changes also with parameter α. thus, the α can be seen as the parameter that defines signalling overhead (i.e. the bigger the α, the bigger the signalling overhead). when α is approaching 0 692 r. libnik, a. svigelj m id -p ro b e a lg o rith m call via hspa established measure snr of wlan snr > tsnr sending sip pre_probe via wlan calculate dpre dpre < td adaptive calculation of tpre wait tpre perform handover to wlan sending sip mid_probe via wlan calculate dmid dmid > td adaptive calculation of tmid wait tmid perform handover to hspa true true true false false false p re -p ro b e a lg o rith m mn sbc cnsip server sip invite rtp sip 200 ok sip pre_probe (via wlan) de2e_pre > τd s/n > τs/n . . . sip 200 ok . . . sip 200 ok . . . sip 200 ok . . . sip 200 ok sip re-invite rtp sip 200 ok sip mid_probe sip mid_probe . . . sip 200 ok . . . sip 200 ok sip mid_probe sip mid_probe . . . sip 200 ok . . . sip 200 ok sip re-invite rtp sip 200 ok hspa wlan hspa sip pre_probe (via wlan) sip pre_probe (via wlan) sip pre_probe (via wlan) de2e_pre < τd de2e_mid < τd de2e_mid > τd τinter τinter τpre τmid figure 1: flow diagram of cahp-a procedure and message exchange. adaptive probe-based congestion-aware handover procedure using sip protocol 693 figure 2: dependency between sip dpre and sip tpre for different values of α the values for tpre and tmid are approaching tmax and when α is approaching infinity the values the values for tpre and tmid are approaching 0 s. in the proposed handover cahp-a procedure two assumptions were made: (i) the procedure starts when mn is connected to both current and target networks and (ii) mn is capable of sending sip messages via wlan interface while, for rtp the hspa network is still used. both functionalities were necessary to perform seamless handover from hspa to wlan network. in the case that mn would not be capable of connecting to two different networks at the same time, first hspa connection should be terminated and only then the establishment of wlan connection would start. by second functionality the mn sends newly defined and standard signalling sip messages via wlan when still connected to hspa. if mn would not have such capabilities, again hspa connection should be terminated first and sip messages could be sent only when wlan connection would be established. without those functionalities the connection would be lost several times which would affect qoe of the user. the sip pre_probe and sip mid_probe messages are sent to sbc, through which rtp packets are also sent. thus, the use of the proposed probes in the target networks can provide the real status of the ability of the network to provide an adequate level of qoe for ip telephony. as sip protocol was used for sending the newly proposed messages, this approach is completely independent of the lower layers (i.e. transport, network and link). furthermore, it can be used independently of the protocol used in lower layers and easily integrated in the operator’s environment. it means that the role of lower layers protocols stays the same (i.e. providing ip connectivity) and that they do not need to be modified. the application uses theirs functionalities (e.g. measurement of signal, establish physical connection, ip connection). our proposal is also easily scalable, as operators add additional sbcs to their network, when the number of users and traffic flows increases and existing sbcs can not serve all the signalling and rtp traffic load. 3 performance evaluation of the cahp-a procedure for performance evaluation of the proposed handover procedure presented in section 2 we developed a simulation model of a telecommunication system, which is discussed in [26]. the simulation model comprises two networks, wlan and hspa. two hosts, mn and cn, that are using ip telephony as an application, were also defined. the g.711 codec was used for voip. 694 r. libnik, a. svigelj figure 3: dependency between dmid and tmid for different values of α silence suppression was not used, resulting in a constant rtp traffic stream of 100 packets/s. the network architecture of the simulation model, resembling real operator environment, is presented in figure 4. the following assumptions were made in the simulation scenario: hspa network is always available; wlan network has limited coverage, but its access link between router 1 and ip network representing fixed operator, could become congested, as it aggregates traffic from all wlan access points (ap); the usage of wlan network is prioritized by the user, which means that mn will always try to perform a handover when this network is available; mn is a dual mode handset capable of sending rtp packets and sip invite messages at the same time via different interfaces. the first two assumptions were made just for simulation scenario in order to validate proposed procedure in environment of two networks, where one is reliable and the second is unreliable. we focused only on access link traffic load, as this is usually critical part of connection from mn to sbc. as described in chapter 2, in proposed procedure the congestion will be detected by measuring delays from mn to sbc. the simulation model of the communication system was developed using the discrete event, object-oriented modelling simulation tool opnet modeler [26]. it has an open source code of commonly used protocols, which is very convenient for performance evaluation of user developed/enhanced mobility management mechanisms [28]. it enables network modelling and simulation for designing new protocols and technologies, together with performance evaluation of existing and newly developed optimized protocols and applications. opnet supports sip telephony but it does not support handover on the application layer, thus some pre-defined process models that incorporate sip procedures were customized [25]. beside modification of process’s functionalities, we also define the newly proposed sip pre_probe and sip mid_probe messages, which are used for congestion testing of the wlan access network. figure 4 shows the simulation network configuration. the mn is a dual mode terminal, capable of connecting to wlan and to hspa network. both networks are connected via ip networks to the sbc. the cn is an ip phone connected to sbc. to increase traffic in the wlan network access link, other clients were added (represented by laptops in figure 4), which generated additional udp traffic in the lan network. adaptive probe-based congestion-aware handover procedure using sip protocol 695 bts ip network (fixed operator) ap1 mn cn sip server router 1 router 4 router 2 wlan network coverage hsdpa network coverage mn sbc lan router 3 ip network (mobile operator) rtp stream path: hspa wlan figure 4: network architecture of the simulation model. 3.1 simulation scenario the proposed cahp-a procedure was evaluated in simulation model. in order to get statistically representative results, one long call that lasted 28,800 s (i.e. 8 hours) was simulated. however, the results can be easily applied to the scenario with several users that are making calls with the total sum of conversation time 28,800 s. in defined network architecture wlan network access link was randomly congested with additional udp traffic, which was generated by additional voip clients. the duration of each "congestion" was distributed exponentially with mean value of 20 s. the time between two congestions was also distributed exponentially with mean value of 10 s. such distribution enabled us to test the proposed procedure several times in different traffic conditions. the user movements were defined by changing snr ratio. the snr values were measured in real environment and imported in the simulation model. table 1: simulation scenarios scenario tmid(s) tpre(s) α r_1 n/a n/a n/a r_2 0 0 n/a s_1 adaptive adaptive 1/16 s_2 adaptive adaptive 1/8 s_3 adaptive adaptive 1/4 s_4 adaptive adaptive 1/2 s_5 adaptive adaptive 1 s_6 adaptive adaptive 2 s_7 adaptive adaptive 4 s_8 adaptive adaptive 8 s_9 adaptive adaptive 16 in order to evaluate the proposed procedure, we prepared eleven scenarios. two first scenarios were defined to get reference results for two opposite situations. in the first reference scenario (r_1) the proposed procedure was not used and handover was made based on snr only. in the second scenario (r_2) parameters tpre and tmid were set to 0 s, which gave us maximal sending frequency of newly defined sip messages sip pre_probe and sip mid_probe. we 696 r. libnik, a. svigelj added additional 9 scenarios (s_1 s_9) with cahp-a procedure in which parameters α was set to 1/16, 1/8, 1/4, 1/2, 1, 2, 4, 8 and 16. all simulation scenarios are summarised in table 1. 3.2 simulation results in the simulation several results were collected. from the user’s point of view the end to end delay should not be significantly affected by performing handovers. we measured end to end delay of ip telephony for each packet. this enabled us to get the cumulative simulation time with end to end delay above 200 ms, which will be presented in results. the cumulative time when hspa interface was used will be also presented as this affects the cost of communication, which is also very important to the user, as he wants to minimise the usage of expensive network (i.e. hspa), while still having the appropriate qoe. the signalling increase caused by additional newly proposed messages during handovers was also tracked. if users perform large number of handovers the probe messages will increase signalling traffic and traffic load of the sbc, which is important from the operator’s point of view. thus, the number of overhead messages will be also presented in the results. the end to end delay distribution for simulation scenarios is presented in figure 5. for the sake of clarity only delays above 200 ms are presented, as those delays significantly affect qoe of the user. delays above 400 ms occur in r_1 only, in which cumulative time, with end to end delay above 200 ms, presents 48.7% of all communication time. that means that in such conversation the qoe of the user is highly degraded. it can be seen that in all other scenarios the cumulative conversation time above 200 ms is much smaller (see percentages above histograms in figure 5), resulting in qoe to be highly improved compared to r_1. the best results are measured in scenarios from s_7 to s_9. it can be seen that results in s_7 to s_9 are even better than in r_2, where tpre and tmid were set to 0 s as in r_2 the signalling overhead itself was causing additional congestion and deteriorate the results. figure 5: end to end delay distribution the biggest share of cumulative simulation time above 200 ms is measured in r_1, thus we adaptive probe-based congestion-aware handover procedure using sip protocol 697 took this scenario as a reference for normalizing other scenarios as presented in (1): nstati = stati statr1 ·100% (3) where nstatî¤i normalized simulation time above threshold (200 ms) of scenario i, staî¤i simulation time above threshold for scenario i, staî¤tr1 simulation time with delay above 200 ms for scenario r_1. normalized values are presented in table 2. it can be seen that the nstaî¤i is decreasing with increase of parameter α in scenarios s_1-s_9. the best results are achieved in s_7, where α was set to 4. cumulative conversation time when hspa interface is used is also presented in table 2. in r_1 the hspa was used only for 16.4% of simulation time, as handovers were performed based on snr only. in all other scenarios (r_2 and s_1 s_9) the measured hspa usage is between 52.0% and 59.6%, which give us only 7.6 percentage points of difference. table 2: normalized cumulative simulation time above 200 ms scenario nstaî¤ti cum. conv. time on hspa r_1 100.0% 16.4% r_2 13.1% 59.6% s_1 23.1% 52.0% s_2 21.9% 52.6% s_3 19.7% 55.8% s_4 17.6% 55.0% s_5 14.7% 57.4% s_6 13.9% 53.7% s_7 12.3% 58.1% s_8 12.6% 59.3% s_9 12.5% 57.7% normalized signalling overhead caused by newly proposed sippre_probe and sip mid_probe messages is presented in figure 6. maximum number of signalling overhead messages is, as expected, measured in r_2 ( tpre = tmid = 0 s) in which about 273 thousand overhead messages were sent. this scenario was taken as a reference for normalizing other scenarios as presented in (2): nsoi = nsmi nsmr2 ·100% (4) where nsoi presents normalized signalling overhead for scenario i, nsmi number of signalling messages of scenario i, nsmr2 number of signaling messages of scenario r_2. from figure 6 it can be seen that the signalling overhead is highly decreased in scenarios form s_1 to s_9 (where cahp-a was used), with lowest overhead in s_1 (only 4.4 % of r_2 signalling traffic). the results show that parameter α defines signalling overhead (i.e. the bigger the α, the bigger the signalling overhead) proofing our theoretical analysis in chapter 2. as in the reference scenario r_1 cahp-a procedure was not used, no messages were sent, thus we get result of 0.0 698 r. libnik, a. svigelj figure 6: normalized signalling overhead 3.3 discussion among presented results we are looking for optimal values for parameter α (defining tpre and tmid). as the results presenting simulation time when hspa was used (table 2) are not very dependent on tpre and tmid (7.6 percentage points of difference among scenarios), these results are excluded from further analysis. from the results of end to end delay distribution (figure 5) and normalized signalling overhead (figure 7) it can be seen, that high signalling overhead results in low end to end delay (i.e. good qoe) and vice versa. thus, compromise will be needed between user’s and operator’s objectives. users want qoe to be as good as possible. to assure this, the end to end delay should be sufficiently low during voip session also when using unreliable network (i.e. wlan network in or case). in all scenarios we measured end to end delay between 250 and 300 ms, thus end to end delay of 300 ms presents upper limit for sufficient qoe. from results (figure 5) it can be seen that scenarios r_2 and s_6 s_9 meet that condition as no packet appeared with delay above 300 ms. from the operators point of view the signalling overhead needs to be low in order to save sbc resources. the limit for signalling overhead is harder to set, thus we defined two conditions. our requirement is that signalling overhead is lowered for at least (a) 70% and (b) 80% compared to r_2. results presented in figure 6 show that scenarios s_1 to s_8 all meet condition (a) and scenarios s_1 to s_6 meet condition (b). as seen, scenario s_6 appears in all groups. based on defined conditions the optimal setting of parameter α is 2. by using those setting, cumulative time with measured end-to end delay above 200 was decreased for 86% (from 48.7% to 6.8%) compared to reference scenario r_1, while signalling overhead was lowered for 85% compared to reference scenario r_2. the comparison of the results for optimal setting (α = 2) with the results obtained for different constant values of tpre and tmid described in [26] shows, that when using chap-a, the endto-end delay is significantly decreased, in particular for the delays higher than 250 ms, while the signalling overhead stays almost the same. adaptive probe-based congestion-aware handover procedure using sip protocol 699 4 conclusions in this paper we presented a novel adaptive sip based procedure for congestion aware handover in heterogeneous networks. with newly proposed pre-probe and mid-probe algorithms the handover decision is (in addition to snr) based also on target network congestion status. in order to analyze and evaluate the proposed procedure several scenarios were prepared. the results show that using the proposed adaptive cahp-a procedure, the qoe of voip users was significantly improved, compared to reference scenario, in which only signal strength was used as the decision for handover. this is achieved by eliminating handovers to unreliable highly congested target network, which could cause degradation of service. with the cahp-a procedure the unreliable network is tested also after handover. in the case of the detection of congestion the handover back to reliable network is triggered, which prevents the qoe degradation. in the proposed procedure sip protocol was used for sending the proposed probe messages. as it runs on the application layer, our solution is completely independent of the underlying access technologies and thus applicable easily to next generation wireless systems. furthermore, it is also independent of the lower layer protocols used. another advantage of using sip protocol for messages is that sip usage is increasing in operators environments. in this paper we have focused only on measurement of end-to-end delay based on transmission of the proposed sippre_probe and sipmid_probe messages. in order to further improve the handover decision algorithm, we will in our further work evaluate more parameters (e.g. user profiles, other network parameters, context awareness) to make even more efficient handover decision. in addition, the historic information about particular network can also be used in order to make the handover decision even more efficient, in particular where there are more than two wlan networks available. bibliography [1] j. m. barja , c. t. calafate, j.-c. cano, p. manzoni (2011); an overview of vertical handover techniques: algorithms, protocols and tools, computer communications, 34(8):985-997 [2] q.-t. nguyen-vuong et al. (2008); a user-centric and context-aware solution to interface management and access network selection in heterogeneous wireless environments, computer networks, doi:10.1016/j.comnet.2008.09.002 [3] e. aruna, r.s. moni (2012); optimization of vertical handoff decision algorithm for wireless networks, international journal of computers communications & control, 7(2):218230, doi: http://dx.doi.org/10.15837/ijccc.2012.2 [4] f. patriarca, s. salsano, f. fedi; efficient measurements of ip level performance to drive interface selection in heterogeneous wireless networks; [5] s. ghahfarokhi, n. movahedinia (2012); context gathering and management for centralized context-aware handover in heterogeneous mobile networks. turk j elec eng & comp sci; 20(6), doi:10.3906/elk-1101-1042 [6] c. lozano-garzon, n. ortiz-gonzalez, y. donoso (2013); a proactive vhd algorithm in heterogeneous wireless networks for critical services, international journal of computers communications & control, 8(3): 425-431, doi: http://dx.doi.org/10.15837/ijccc.2013.3 [7] c. perkins, ip mobility support for ipv4, rfc 3344, august 2002. [8] i. f. akyildiz, j. xie, s. mohanty (2004); a survey of mobility management in nextgeneration all-ip-based wireless systems, ieee wireless communications, 16-28. 700 r. libnik, a. svigelj [9] h. fathi, r. prasad, s. chakraborty (2005); mobility management for voip in 3g systems: evaluation of low-latency handoff schemes, ieee wireless communications, 96-104. [10] p.m.l. chan, r.e. sheriff, y.f. hu, p. conforto, c. tocci mobility management incorporating fuzzy logic for heterogeneous a ip environment. ieee communications magazine 01/2002; doi:10.1109/35.968811 [11] s. mohanty, i. f. akyildiz (2007); performance analysis of handoff techniques based on mobile ip, tcp-migrate, and sip, ieee tranactions on mobile computing, 6(7):731-747. [12] k. el malki (2005); low latency handoffs in mobile ipv4, internet-draft, october 2005. [13] j.-y. chen , c.-c. yang, l.-s. yu (2010); hh-mip: an enhancement of mobile ip by home agent handover. eurasip journal on wireless communications and networking. 2010. doi:10.1155/2010/653838 [14] l. ma, f. yu, v. c. m. leung, tejinder randhawa (2004); a new method to support umts/wlan vertical handover using sctp, ieee wireless communications, 44-51. [15] s. j. koh, m. jeong lee, maximilian riegel, mary li ma, michael tuexen (2004); mobile sctp for transport layer mobility, ietf internet draft. [16] j. zhang, h. chan, v. leung (2007); a sip-based seamless-handoff (s-sip) scheme for heterogeneous mobile networks. ieee wcnc. [17] g. p. silvana, h. schulzrinne, sip and 802.21 for service mobility and pro-active authentication, communication networks and services research conference, 2008, doi 10.1109/cnsr.2008.61 [18] n. banerjee, w. wu, k. basu, s. k. das (2004); analysis of sip-based mobility management in 4g wireless networks, computer communications, 27, 697-707. [19] ali m, liang l, sun z, cruickshank h. (2011); optimization of sip session setup for voip over dvb-rcs satellite networks, inderscience international journal of satellite communications policy and management (ijscpm), 1 (1): 55-76. doi: 10.1504/ijscpm.2011.039741 [20] m. bonola, s. salsano, a. polidoro. upmt: universal per-application mobility management using tunnels. ieee globecom 2009, 30 nov 4 dec 2009, honolulu, hawaii. [21] j. rosenberg, h. schulzrinne, g. camarillo, a. johnston, j. peterson, r. sparks, m. handley, e. schooler, sip: session initiation protocol, ietf rfc3261, june 2002. [22] i. basicevic, m. popovic, (2008); use of sip protocol in development of telecommunications services, the journal of the institute of telecommunications professionals, 01/2008, 2. [23] t. aljaz, b. imperl, a. svigelj (2008); border gateway function performance requirements for the lawful intercept of voice at ims architecture. aeu, int. j. electron. commun. (print), 62(8):610-621, doi: 10.1016/j.aeue.2007.08.006. [24] j. hautakorpi, g. camarillo, r. penfield, a. hawrylyshen, m. bhatia, requirements from sip (session initiation protocol) session border control deployments, internet-draft, october 23, 2008, url: http://www.ietf.org/internet-drafts/draft-ietf-sipping-sbc-funcs-07.txt adaptive probe-based congestion-aware handover procedure using sip protocol 701 [25] r. libnik, a. svigelj, g. kandus (2008); performance evaluation of sip based handover in heterogeneous access networks, wseas transactions on communications, 7(5): 448-458. [26] r. libnik, a. svigelj, g. kandu (2010); a novel sip based procedure for congestion aware handover in heterogeneous networks. computer communications, 33(18): 2176-2184, doi: 10.1016/j.comcom.2010.09.007. [27] opnet technologies (2014); http://www.opnet.com. [28] r. libnik, g. kandus, a. svigelj (2011); simulation model for performance evaluation of advancde sip based mobility management techniques, international journal of communications, 5(1):26-35. ijcccv10n1dz.pdf international journal of computers communications & control issn 1841-9836, 10(1):8-21, february, 2015. method for visual detection of similarities in medical streaming data j. bernatavičienė, g. dzemyda, g. bazilevičius, v. medvedev, v. marcinkevičius, p. treigys jolita bernatavičienė*, gintautas dzemyda, gediminas bazilevičius, viktor medvedev, virginijus marcinkevičius, and povilas treigys vilnius university institute of mathematics and informatics lithuania, lt-08663 vilnius, akademijos, 4 jolita.bernataviciene@mii.vu.lt, gintautas.dzemyda@mii.vu.lt, gediminas.bazilevicius@mii.vu.lt, viktor.medvedev@mii.vu.lt, virginijus.marcinkevicius@mii.vu.lt, povilas.treigys@mii.vu.lt *corresponding author: jolita.bernataviciene@mii.vu.lt abstract: the analysis of medical streaming data is quite difficult when the problem is to estimate health-state situations in real time streaming data in accordance with the previously detected and estimated streaming data of various patients. this paper deals with the multivariate time series analysis seeking to compare the current situation (sample) with that in chronologically collected historical data and to find the subsequences of the multivariate time series most similar to the sample. a visual method for finding the best subsequences matching to the sample is proposed. using this method, an investigator can consider the results of comparison of the sample and some subsequence of the series from the standpoint of several measures that may be supplementary to one another or may be contradictory among themselves. the advantage of the visual analysis of the data, presented on the plane, is that we can see not only the subsequence best matching to the sample (such a subsequence can be found in an automatic way), but also we can see the distribution of subsequences that are similar to the sample in accordance with different similarity measures. it allows us to evaluate differences among the subsequences and among the measures. keywords: streaming data, similarity measures, multivariate time series, visualization, multidimensional scaling. 1 introduction time series data are widely available in different fields including medicine, finance, and science. a time series is a collection of chronologically performed observations of the values of a feature that characterizes the behaviour of a particular object. there are many topics in time series data mining, i.e., similarity search, clustering, classification, anomaly detection, motif discovery, etc. the similarity problem can be defined as a comparison of two time series to determine whether they are similar or not. usually, the choice of a similarity measure can affect the result of data mining tasks. by a similarity measure we mean a method, which compares two time series and returns the value of their similarity. if the object is characterized by several features, we have a multivariate time series (mts) [1]. in this paper, we investigate the similarity search in multivariate physiological time series. a physiological time series is a series of some medical observations over a period of time. such a type of data can be collected using devices (or sensors) that collect personal medical features, such as heart rate, blood pressure, etc. an example of such data can be the intensive care multivariate online-monitoring time series [2]. a sensor is an instrument that detects or measures a physical or environmental characteristic or state, transmits and/or records the reading in some form (e.g., copyright © 2006-2015 by ccc publications method for visual detection of similarities in medical streaming data 9 figure 1: example of the multivariate time series a visual display, audio signal, digital transmission, etc.). a sensor converts the physical quantity to electric output. for example, a pressure sensor converts pressure to electric output. remote monitoring of health parameters such as the pulse rate, oxygen level in blood or blood pressure can be very helpful for early detection of diseases, resulting in reduction of treatment time. most methods, used to analyse medical data, focus primarily on analysing the univariate time series. however, because of parameter dependences and variation over time, examination of all medical data together in a multivariate time series can provide more information about the data and patient, to make a better diagnosis and treat the patient [3]. therefore, this paper deals with the multivariate time series analysis with a view to compare the current situation with that of in chronologically collected historical data, and to find subsequences of the multivariate time series most similar to the sample, corresponding, e.g. to the current situation. an example of mts of four features (heart rate hr, non-invasive systolic arterial blood pressure sys, non-invasive diastolic arterial blood pressure dias, temperature temp) is presented in figure 1. let us have a multivariate time series of n features and ta observations: xa =     xa 11 · · · xa 1ta ... . . . ... xan1 · · · x a nta     . denote the sample of n features and tb observations as xb =     xb 11 · · · xb 1tb ... . . . ... xbn1 · · · x b ntb     . here ta > tb. in fact, xa and xb are matrices. as a result, we need to find the optimal place of xb on xa. the place is defined by some time moment t∗ : 1 ≤ t∗ ≤ ta − tb + 1. our procedure analyses the multivariate time series xa by using the moving time window the width of which is adapted to the current situation xb (width is equal to tb) and comparing the content of this window with the sample, in the sense of several similarity measures, at the same time. visual method of finding the best subsequences matching to the sample is proposed in this paper. as it is indicated in [4], the goal of visual analytics research is to turn the information overload into an opportunity, i.e. decision-makers should be enabled to examine massive, multidimensional, multisource, time varying information stream to make effective decisions in time critical situations. 10 j. bernatavičienė, g. dzemyda, g. bazilevičius, v. medvedev, v. marcinkevičius, p. treigys there are attempts to apply visual analysis for the streaming data. the example is the visual content analysis of real-time data streams project [5] at the pacific northwest national laboratory. its goal was to allow users to quickly grasp dynamic data in forms that are intuitive and natural without requiring intensive training in the use of specific visualization or analysis tools and methods. the project has prototyped five different visualization prototypes that represent and convey dynamic data through human-recognizable contexts and paradigms such as hierarchies, relationships, time and geography. in this paper, we suggest using the specific visualization tools and methods (multidimensional data visualization [6]) that are effective and also do not require intensive training of the users. moreover, we show a possibility of making the decision, based on five criteria of similarity of the sample with the subsequence of real-time data stream by representing the similarity as a point on a plane. dimensionality reduction and visual analysis of multidimensional data [6] have been applied when comparing the best found subsequences in xa. the proposed method is described in section 2. similarity measures for multivariate time series and comparative analysis of the measures are presented in section 3. multidimensional data visualization is reviewed in section 4, where the emphasis is put on the multidimensional scaling. comparative analysis of similarity measures for multivariate time series is presented in section 5. an example, illustrating the proposed method, is presented in section 6. 2 visual method for finding the best subsequences matching to the sample in our method, the multivariate time series xa are analysed by using the moving time window. the width of this window is adapted to the current situation (sample) xb and is equal to tb. the content of this window is compared with the sample, in the sense of several similarity measures at the same time. it includes the dimensionality reduction procedure that allows us to observe multidimensional data visually. the visual method for finding the best subsequences, matching to the sample, can be generalized as follows: 1. let us have: a multivariate time series xa of n features and ta observations; sample xb of n features and tb observations; m similarity measures si, i = 1, . . . , m. 2. the sample xb is compared with all subsequences of xa by using m similarity measures si, i = 1, . . . , m. the subsequences are obtained by moving the time window in the xa from beginning to end. the content of such a window is a matrix of n rows. denote it by xc. the width of the window (the number of columns of xc) is adapted to the current situation (sample) xb (its width is equal to tb). for each measure, k subsequences are chosen most similar to the sample. therefore, the total number of subsequences for a further analysis is equal to km. 3. each comparison of the sample with a subsequence, chosen in the way defined in the step above, produces a m-dimensional point sq = (sq1, sq2, . . . , sqm), where, in our case, q = 1, . . . , km. let us derive two additional points: s0 = (s01, s02, . . . , s0m) is the array of values of all similarity measures, computed for the subsequence, that is ideally coincident with the sample (the array of the best values of m similarity measures); sc = (sc1, sc2, . . . , scm) is the weight center of sq = (sq1, sq2, . . . , sqm), q = 1, . . . , km. therefore, the total number of m-dimensional points for discovering the most similar subsequences to the sample is equal to km+2. afterwards, the normalization of the components method for visual detection of similarities in medical streaming data 11 of these points is performed by z-score. denote the obtained matrix of normalized points by z. it consists of km + 2 rows and m columns. 4. the points from matrix z are mapped on the plane using the multidimensional scaling [6] (or are other algorithm of nonlinear projection of multidimensional points on the plane). denote the resulting matrix by y , that contains km + 2 rows, corresponding to different comparisons of the sample with other subsequences, and 2 columns. each row is coordinates of the point on the plane. 5. the investigator analyses the information presented graphically, where all m-dimensional points are represented as the points on a plane, and makes decisions. in general, the most similar subsequence to the sample can be the subsequence, the corresponding point of which on a plane is closest to the projection of s0 on the plane. however, more subsequences may be considered as similar to the sample. the points on the plane, corresponding to such subsequences, must be closer to the projection of s0 on a plane than to the projection of sc. these rules can be checked automatically in the program realization of this method, however participation of the investigator is valuable, because it gives a possibility to him to cognize the data deeper. the advantage of this method is that the investigator can consider the results of comparison from the standpoint of several measures that may be supplementary to one another or contradictory among themselves. therefore, the similarity of subsequences with the sample will be evaluated from different standpoints. the method is universal, because different sets of similarity measures can be chosen, depending on the problem, but the scheme of decision remains the same. moreover, the involvement of the dimensionality reduction and visual analysis of multidimensional data in the proposed method renders the opportunity to the investigator to participate in the final decision, when comparing the best found subsequences of the multivariate time series with the sample. however, the decision on their similarity can also be made automatically. 3 similarity measures for multivariate time series to detect events in real multivariate time series, it is necessary to compare time series using the appropriate similarity measure [7]. different techniques and similarity measures are introduced and used for comparison of multivariate time series of different nature [8], [9]. multivariate time series can be reduced to univariate time series and their similarity can be measured, using a univariate time series approach [10]. that may lead to a great loss of information, therefore, we concentrate on the multivariate time series approach here. five similarity measures si, i = 1, . . . , 5, used in this paper for multivariate time series, are presented below. let us compare two multivariate time series: xa =     xa 11 · · · xa 1ta ... . . . ... xan1 · · · x a nta     and xb =     xb 11 · · · xb 1tb ... . . . ... xbn1 · · · x b ntb     . the frobenius norm is often used in the matrix analysis [11]. this similarity measure is based on the euclidean distance. the frobenius norm of a matrix xb is defined by the formula: ∥ ∥ ∥ x b ∥ ∥ ∥ f = √ √ √ √ n ∑ p=1 tb ∑ q=1 (xbpq) 2 = √ tr((xb)′xb), (1) 12 j. bernatavičienė, g. dzemyda, g. bazilevičius, v. medvedev, v. marcinkevičius, p. treigys where tr is the sum of elements on the diagonal of the square matrix. the frobenius norm is used to compare the similarity of two matrices. the similarity of xb and xc is defined by the formula frob = ∥ ∥xb − xc ∥ ∥ f . the best possible value of the frobenius norm is 0. the correlation coefficient between two matrices of the same size (matrix correlation coefficient) can also be used as a similarity measure [12]: r = ∑n p=1 ∑tb q=1 (x b pq − x̄ b)(xcpq − x̄ c) √ ∑n p=1 ∑tb q=1 (x b pq − x̄ b)2 ∑n p=1 ∑tb q=1 (x c pq − x̄ c)2 , (2) where x̄b and x̄c are the means of xb and xc, respectively. this measure is the pearson correlation coefficient adapted to matrices and calculated using the matlab corr2 function [12]. the best possible value of the matrix correlation coefficient is 1.the correlation coefficient between two matrices has found wide applications in the image analysis, molecular biology, etc. the third similarity measure for multivariate time series is the principal component analysis (pca) similarity factor [9], [13]. pca is a well-known and wide used technique for dimensionality reduction of data. it is a linear transformation that projects the original data to a new coordinate system with the minimal loss of information. in multivariate cases, the information is the structure of the original data, i.e. the correlation between the features and alteration of the correlation structure among them. to create a projection, pca selects coordinate axes of the new coordinate system one by one according to the greatest variance of any projection. the pca similarity factor is defined by the following formula: spca(x b , x c) = tr(l′mm ′l), (3) where l and m are matrices that contain the first l principal components of xb and xc, respectively. it means that the principal components are computed by the standard algorithm using the matrices (xb)′xb and (xc)′xc, and then l principal components with the highest eigenvalues are selected. the best possible value of the pca similarity factor is l. in our experiments l = 1. dynamic time warping (dtw) [14] is the most widely used technique for comparison of time series data, where extensive a priori knowledge is not available. the euclidean distance reflects the similarity in time, while the dynamic time warping (dtw) reflects the similarity in shape. dtw searches for the best alignment between two time series, attempting to minimize the distance between them. the advantage of dtw is that it can handle unequal series and distortions. multidimensional dynamic time warping (mdtw) is presented in [15]. some distance matrix is defined: {d(p, q) = ∑n k=1 (x b kp − xc kq ) 2 , p, q = 1, . . . , tb}. then the matrix d of cumulative distances is calculated as in the traditional dtw algorithm [15]: d(p, q) =                          d(1, 1), if p = 1, q = 1, d(p, q) + d(p − 1, q), if p = 2, . . . , tb, q = 1, d(p, q) + d(p, q − 1), if p = 1, q = 2, . . . ., tb, d(p, q) + min        d(p − 1, q) d(p, q − 1), d(p − 1, q − 1) in other cases. (4) (p, q) defines the pair of the pth observation in xb and the qth observation in xc. finally, the minimal path and the distance along the minimal path are obtained using matrix d. the path must start at the beginning of each time series at (1, 1) and finish at the end of both time series method for visual detection of similarities in medical streaming data 13 at (tb, tb). see [13] for details. the best possible value of mdtw is 0. on the other hand, dtw can lead us to unintuitive alignments, where a single point on one time series maps onto a large subsection of another time series [16], [17]. also, dtw can fail to find the obvious and natural alignments in two time series because of a single feature (i.e. peak, valley, infection point, plateau, etc.). one of the causes is due to the great difference between the lengths of the compared series. in this paper, the fifth similarity measure for multivariate time series is eros (extended frobenius norm) [9]. eros is based on the principal component analysis and computes the similarity between two mts items by measuring how close the corresponding principal components are using the eigenvalues as weights. in our case, xb and xc are two multivariate time series items of n features and tb observations. v b = [vb1, . . . , v b n] and v c = [vc 1 , . . . , vcn] are two right eigenvector matrices obtained by applying a singular value decomposition (svd) to the covariance matrices mb and mc of features in xb and xc respectively. the eros similarity of xb and xc is defined as follows: eros(xb, xc, w) = n ∑ i=1 wi| 〈 v b i , v c i 〉 |, (5) where 〈 vbi , v c i 〉 is the inner product of vbi , and v c i , w is a weight vector, based on eigenvalues of the mts data set (see more in detail in [9]), ∑n i=1 wi = 1. 4 multidimensional data visualization the method, proposed in section 2 for finding the best subsequences matching to the sample, is based on the visual presentation and analysis of multidimensional points the coordinates of which are the values of similarity measures, computed for a pair of subsequences. the visualization technology is introduced below. for an effective data analysis, it is important to include a human into the data exploration process and combine the flexibility, creativity, and general knowledge of the human with the enormous storage capacity and computational power of today’s computer. visual data mining aims at integrating the human in the data analysis process, applying the human’s perceptual abilities to the analysis of large data sets, available in today’s computer systems. visualization finds a wide application in the medical data analysis, too [18], [19]. the goal of the projection method is to represent the input data items in a lower-dimensional space so that certain properties of the structure of the data set were preserved as faithfully as possible. the projection can be used to visualize a data set, if rather a small output dimensionality is chosen. one of these methods is the principal component analysis (pca). the well-known principal component analysis [6] can be used to display the data as a linear projection on a subspace of the original data space such that best preserves the variance in the data. pca cannot embrace nonlinear structures, consisting of arbitrarily shaped clusters or curved manifolds, since it describes the data in terms of a linear subspace. therefore, several methods have been proposed for reproducing nonlinear higher-dimensional structures on a lower-dimensional display: multidimensional scaling and its modifications [6], [20], [21], [22], isomap [23], locally linear embedding [24], etc. various neural network approaches are used for this aims as well (see e.g. [25], [26], [27]). multidimensional scaling (mds) is a group of methods that project multidimensional data to a low (usually two) dimensional space and preserve the interpoint distances among data as much, as possible. let us have the m-dimensional points sq = (sq1, sq2, . . . , sqm), q = 1, . . . , t, (sq ∈ 14 j. bernatavičienė, g. dzemyda, g. bazilevičius, v. medvedev, v. marcinkevičius, p. treigys rm). the pending problem is to get the projection of these points onto the plane r2. twodimensional points y1, y2, . . . , yt ∈ r2 correspond to them. here yq = (yq1, yq2), q = 1, . . . , t. denote the distance between the points sq and sp by d∗qp, and the distance between the corresponding points yq and yp on the projected space by dqp. in our case, the initial dimensionality is m, and the resulting one is 2 (2-d). naturally, 1-d and 3-d projections could be considered, too. however, in the 1-d case, we lose knowledge that can be obtained from 2-d or 3-d views. advantages of 3-d can be achieved when special means to present such data on the screen are applied. therefore, 2-d projections of the multidimensional data are commonly used. there exists a multitude of variants of mds with slightly different so-called stress functions. in our experiments, the raw stress is minimized emds = ∑t q

200) 1335(13.35%) finally, our test truly shows the higher consistency in table2, given the number of geographic communities nc = 3, and we consider the posterior probability of nodes x̂ic > 0.4, c ∈ 0,1,2 means node i belong to c,then we obtained following numerical results: table 2: the matching results between the geographic communities and economic ladder the label of geographic communities matching results 0(number : 5767) 63.4% 28.85% 7.6% 1(number : 4978) 56.3% 40.1% 3.57% 2(number : 3244) 44.1% 35.2% 20.6% given that periodic movements can explain about 50-70% of the geographic trajectories of individuals, and the repetitive trajectories that are formed by periodic movement mainly depend on the role of daily routine and geographic features of human movement, we consider the similarity of the geographic trajectories between individuals is largely attributable to the overlapping functional areas for these individuals, for example, identical residential area, the same lunch venue and similar workplace. furthermore, these overlapping functional areas just depend on the social properties of individuals, such as economic ladder, vocation, and so on. therefore, the spatial proximity of individuals not only suggests the proximity of social relationship of individuals in mobile social network, but also implies to some extent the similarity of the social properties of individuals. furthermore, we suggest that the similarity of the social properties of users also indicates the likeliness for these users to interact with each other. understanding social characteristic from spatial proximity in mobile social network 547 4 a hybrid approach to mine network structure in mobile social network since sna techniques have gained extensive development in recent years, many studies on detecting existing community structure in various social networks can be found in an excellent survey [16]. furthermore, a large amount of studies on predicting potential links have been presented for link prediction in complex networks [17]. mining network structure, which provides us with meaningful insights into the internal structure of corresponding real social network and the possibilities to optimize the performance of social recommendation and link prediction, mainly contains two parts, i.e. detecting existing network structure and tracking the trend of network structure evolution. in general, mining network structure in mobile social networks has been investigated as a typical example of dynamic social networks by many researchers [6, 18]. spatial proximity and social proximity of individuals can be used for mining network structure in mobile social network. especially, existing community structure and potential social relationship in a real mobile social network can be obtained and analyzed by our hybrid approach. we first propose two new measurement matrixes which provide a bridge between spatial proximity and the social properties of individuals in mobile social networks. combing these matrixes and the known adjacency matrix in social network, the joint nonnegative matrix factorization (jnmf) method [13] is used to mine network structure evolution. finally, a collaborative matrix is obtained to analyze the existing community structure and potential links in mobile social network. based on a real dataset, several experimental results have been shown to verify the feasibility of this proposed hybrid approach. mining community structure in mobile social network using joint nonnegative matrix factorization (jnmf) method in previous studies, jnmf was usually used to detect community structure or hidden links in complex networks. in fact, it has been verified that jnmf has outstanding performance in such aspects in complex networks. in this section, we choose jnmf method as our tool to find community. based on the mit dataset, this social graph is an undirected weight network. given its adjacency matrix of friend graph d, we further apply jnmf method to achieve our purpose. the problem we aim to solve is min x≥0 l(x,u,d,l) s.t. x ∈ rn×m+ where l(x,u,d,l)def = ∥u −x∥2 + α∥d−xxt∥2 + β∥l−xt x∥2, and α > 0,β > 0 are constants to tradeoff the importance between different terms. given n individuals (or mobile personal devices) in a mobile social network, and the number of all locations m , and d represents the adjacency matrix of friend graph , u and l represent user-location matrix (4)and locationuser matrix (5), respectively . according to [15], x̂ can be solved by the following multiplicative update rule. x̂ij ← x̂ij [ [u + 2αdx + xl̂]ij 2(α + β)[xxt x]ij ]1 4 (8) where d̂ = 2βd −1 after iteration, the obtained x̂∗ is just the scale partition matrix of the network g of size n×m , and then we obtain the hybrid matrix c = xt x. let this matrix is the initial adjacent matrix, we further use the above approach in section 3 for detecting community structure based on snmf. 548 d. hu, b. huang, l. tu, s. chen figure 4: two main communities in the community structure of this friendship network figure 5: the hybrid communities structure in this mit dataset in our experiment, we also use reality mining data set [6] provided by the mit media lab to test our proposed approach. from this real friendship network, two main communities can be inferred by real social connections as shown in fig.2, which shows the first-year business school students and individuals working together in the same building respectively. moreover, we can simply detect these social communities by newman method. however, the dynamic evolution of the social ties among these individuals cannot be timely detected. this leaves an important question that to what extent individual mobility patterns impacts the social network. therefore, in this section, the communities of the same dataset will be detected by our proposed method. during our implementation process, we suppose spatial proximity effect on social community structure is limited, and present two weight parameters. as is shown in fig.5, the difference in gray-level between two communities labels for nodes illustrates the segmentation map presented in fig.4. meanwhile, some nodes (ex:i = 2,33,76) that are not covered by this segmentation map shows the potential possibility for joining the two communities together. in fig. 5, the interval [0,1] located in the y-axis represents the community id = 1,and the interval [1,2] located in the y-axis represents the community id = 2, the sequence of nodes i(i = 1,294) is located in the x-axis, and the gray level is obtained by corresponding element understanding social characteristic from spatial proximity in mobile social network 549 value of x , where the higher the gray level of the block (i,id(i)) tend to black, the more likely that the unit i belongs to the geographic community id . 5 results and discussion since recent research progress on the interaction between human mobility and social ties has provided us with valuable information that spatial proximity can be used to predict future social links, in this paper, we first proposed a new measurement dimension (geographic community) for evaluating the spatial proximity influence on the evolution of social community structure in mobile social networks. the correlation between geographic community and common social properties of users has been analyzed based on a real dataset. we found the spatial proximity of individuals not only suggests the proximity of social relationship of individuals in mobile social networks [5], but also implies to some extent the similarity of the social properties of individuals. then a hybrid approach that utilizes spatial proximity and social homophily of individuals for mining network structure in mobile social network has been proposed. however, due to the inner ambiguity of individuals spatial proximity, the direct applicability of spatial proximity in mobile social network still leaves much room to be studied. the results presented in this paper will lead us to two interesting directions for future research. the first direction is to explore the common social properties of individuals based on those common functional areas that can be obtained from spatial proximity, since we consider high spatial proximity should be driven by similar social feature of individuals to a certain extent and even develop a hybrid technique for personalized recommendation utilizing geographic locations and social network. the second direction is to develop a routing algorithm on manet (mobile ad-hoc network) using the inherent property of geographic community, since this communication services that rely on this type of data transfer will strongly depend on human mobility characteristics and how often such transfer arises. bibliography [1] d. brockmann, l. hufnagel, and t. geisel (2006); the scaling laws of human travel, nature, 439:462-465. [2] l. hufnagel, d. brockmann, and t. geisel (2004); forecast and control of epidemics in a globalized world, proceedings of the national academy of sciences of the united states of america, 101(42):15124-15129. [3] c. song, z. qu, n. blumm, a. barabasi (2010); limits of predictability in human mobility, science, 327(5968): 1018-1021. [4] e. cho, s. a. myers, and s. j. leskovec(2011); friendship and mobility: user movement in location-based social networks, proc. of the 17th acm sigkdd international conference on knowledge discovery and data mining, new york, usa, 1082-1090. [5] d. wang, d. pedreschi, c. song, f. giannotti, and a. barabasi (2011); human mobility, social ties, and link prediction, proc. of the 17th acm sigkdd international conference on knowledge discovery and data mining, new york, usa, 1100-1108. [6] n. eagle, a. pentland, and d. lazer (2009);inferring friendship network structure by using mobile phone data, proc. of the national academy of sciences, 106(36): 15274-15278. 550 d. hu, b. huang, l. tu, s. chen [7] l. backstrom, e. sun, and c. marlow (2010); find me if you can: improving geographical prediction with social and spatial proximity, proc. of the 19th international conference on world wide web(www’10), new york, usa, 61-70. [8] m. c. gonzalez, c. a.hidalgo, and a. barabasi (2008); understanding individual human mobility patterns, nature, 453: 779-782. [9] r. n. mantegna and h. e. stanley (1994); stochastic process with ultraslow convergence to a gaussian: the truncated levy flight, physical review letters, 73: 2946-2949. [10] c. song, t. koren, and a. barabasi (2010); modelling the scaling properties of human mobility, nature physics, 6: 818-823. [11] m. t. rivera, s. b. soderstrom, and b. uzzi (2010); dynamics of dyads in social networks: assortative, relational, and proximity mechanisms, annual review of sociology, 36: 91-115. [12] q. hao, et al. (2010); equip tourists with knowledge mined from travelogues, proceedings of the 19th international conference on world wide web (www’10), new york, usa , 401-410. [13] f. wang, et al. (2011); community discovery using nonnegative matrix factorization, data mining and knowledge discovery, 22: 493-521. [14] q. li, et al. (2008); mining user similarity based on location history, proceedings of the 16th acm sigspatial international conference on advances in geographic information systems(gis’08), irvine, ca, usa, 34-43. [15] d. wang, t. li, s. zhu, and c. ding (2008); multi-document summarization via sentencelevel semantic analysis and symmetric matrix factorization, proc. of the 31st annual international acm sigir conference on research and development in information retrieval(sigir’08 ), new york, ny, 307-314. [16] s. fortunato (2010); community detection in graphs, physics reports, 486: 75-174. [17] l. lu and t. zhou (2011); link prediction in complex networks: a survey, physica a: statistical mechanics and its applications, 390: 1150-1170. [18] n. p. nguyen, et al. (2011); adaptive algorithms for detecting community structure in dynamic social networks, proc.ieee infocom, shanghai,china, 2282-2290. int j comput commun, issn 1841-9836 9(3):261-275, june, 2014. advantages of using an ontological model of the state development funds s. arsovski, b. markoski, p. pecev, n. petrovački, d. lacmanović saša arsovski guarantee fund of autonomous province of vojvodina serbia, 21000 novi sad, hajduk veljkova 11 e-mail: garfondapv@neobee.net branko markoski*, predrag pecev, dejan lacmanović university of novi sad technical faculty "mihajlo pupin" zrenjanin serbia, 23000 zrenjanin, djure dakovica, bbi e-mail: redrag.pecev@tfzr.rs, dlacman@tfzr.uns.ac.rs *corresponding author: markoni@uns.ac.rs neboša petrovački university of novi sad faculty of technical sciences computing and control department serbia, 21000 novi sad,trg dositeja obradovića 6 e-mail: petrovackin@uns.ac.rs abstract: ontologies generated from the workflow of administrative procedures, can provide significant improvements and reduce the time of modeling, testing and integration in the process of building an information system of the public administration. in this paper we has analyzed the ontology annotation aimed to provide the groundwork for automatic generation administrative act. the proposed semantic representation of the administrative procedures enables use of the document templates, which are the framework for the automatic generation of an administrative act. this would be a result of the administrative procedures execution. the proposed approach is verified by a case study of building an ontological model of the administrative procedures for the guarantee fund of autonomous province of vojvodina (apv). keywords: e-government, ontology, services, information system. 1 introduction state development funds are an interventionist mechanism by which the state affects the development of small and medium enterprises. these are institutions set up by authorities of different levels (state, provincial, local government) and whose activity is aimed at stimulating the development of small and medium-sized enterprises while reducing risk and transaction costs related to the implementation of stimulating instruments (e.g. loans) of small and medium enterprises. the following development funds are established in ap vojvodina: vojvodina development fund, agricultural development fund, capital investment fund and the guarantee fund of apv. there are two scenarios of use of ontologies in the development of information systems, depending on the type of information system that is being modeled [1]: traditional information systems: semantic content described in the ontology is transformed into a standard component of the information system ontology-driven information systems: the ontology is an individual component of the information system. copyright © 2006-2014 by ccc publications 262 s. arsovski, b. markoski, p. pecev, n. petrovački, d. lacmanović the ontological concept, which defines the administrative procedures and tasks, will be described and analyzed in detail. the course of generating ontology will be resting on the role of defined entity domains and the activities performed by those entities which are observed during the process of the state administration. the model should meet the following conditions: 1. to describe structural aspect of public administration, administrative units (funds) and their hierarchical relationship. 2. to provide an explicit representation of knowledge of administrative processes in guarantee fund of apv. 3. to provide ontological representation of the participants and documents in the development funds. 4. to describe electronic services profiles those are providing the groundwork for automatic generation administrative act. our main intention is to reduce development time and effort in order to meet the demands of the information system based on knowledge of the administrative processes and documents. the core idea in our method is ontology annotations with user interface components in order to automatic generate user interface and modeling ontology of the electronic services that will provide generation of the final administrative act. main advantages of the model which will be described hereinafter, is a semantic representation of the administrative procedures business logic which, in the use of traditional techniques for developing information systems, are firmly integrated into the user interface code. [4] [8] the paper is organized as follows: the second section provides an overview of the related work. the third section presents an ontological model of the guarantee fund of apv. the fourth section shows the detailed administrative tasks semantic representation and shows practical use of designed ontology. in the final, fifth section, concluding remarks and directions for further research are given. 2 related work this section will present research results related to two aspects of the problem investigated in this paper. the first aspect relates to the use of ontologies in modeling business processes of public administration, and the second aspect relates to the use of ontologies in the generation of the administrative act. a ontology is an information structure, which helps to acquire knowledge, share it, and check consistency within knowledge. authors in [19] conclude that the ontology provides a better communication, reusability and organization of knowledge by decreasing language ambiguity and structuring data. author [18] proposed a formal definition of ontology as a 5-tuple (n, r, d, f, t) where each element is defined as follows: n, a set of nodes. r,n is a set of relation types. d, is a set of description logic sentences. each sentence can use the elements in n and two variables subject and object. indicating respectively the first and third element in 5-tuple in t. f is a function that maps each element from r onto one element in d. t is a set of relations which is defined as a set of 3-tuples where for each element consists of (s, r, o) where: s is the subject, an element of n, r is the relation, an element of r and o is the object, an element of n. the authors in [2], propose a model of the system in which the procedures performed by each administrative unit in the decision-making process and the creation of administrative documents are described. the main component of the proposed system is ontology of the public administration. the proposed ontology comprises following aspects of the observed domain. structural aspects of public administration represent administrative units and their hierarchical relationships. textual aspect of ontology describes the documents that appear as a product of advantages of using an ontological model of the state development funds 263 the administrative unit described in the structural aspects of ontology. procedural aspects of public administration ontology given in [2] are represented as an extension (specialization) to owl-s. service aspect is also represented by owl-s. an ontology modeled in this way allows hierarchical control of the administrative units, control of administrative acts anticipated by legislation, communication between administrative units involved in the creation of administrative documents and calling of the procedures that trigger electronic services. the authors [5], [6], define three main reasons for the use of ontologies in generating user interfaces: 1. improved visualization of ui; 2. improvement of interactions between the system and users, 3.improvement of the development process of user interface. in [3] the authors propose the creation of an ontology, which represents different aspects of service including administrative documents and legislation. this ontological model is focused on describing the electronic services of public administration. basic concepts of the proposed ontology are services, service users, organization, administrative, service implementation, legislation, form, document, event. the ontological model created in this paper relies on the results of [2] and [3]. in a study of applying ontologies in the development of e-government the authors presented a method of modeling ontologies in the domain of e-government [15]. according to the authors of [9], each public service of egovernment is semantically modeled and contains references that point to the required input data. predefined values of input data or preconditions can be expressed with semantic rules. in this way, this allows automatic creation of (web) forms and interactive validation of input data. [13] according to the authors, the development of new applications and projects begins with modeling the ontology. when modeling the ontology, the authors used two types of classes and subclasses, by introducing the following assumptions: each class that contains a subclass is seen as an abstract class; each class that does not contain a subclass is considered a realistic class (the basis for the creation of form); electronic services accept only the instances of realistic classes as input data. the presented rules and ways of modeling ontologies, according to the authors of [9], allow the unambiguous identification of e-government services. the authors of [10] proposed the use of algorithm for the direct transformation from owl to relational data structures. the authors of [11] also dealt with these problems. the basic idea is to transform the created ontology with the help of the transformation tools in the dll script and thus preserve all relations, constraints, and information about the domain. dll script is used to generate a relational database. authors of [14] proposes a feasible implementation of a multi-agent environment which makes use of ontologies and ontology mapping to achieve semantic interoperability. authors use an ontology model to facilitate semantic interoperability in a simulated multi-agent environment. the authors in [16] exploits different semantic web technologies and builds a prototype of semantic web mash up functionality based on combination of rdf/owl with sqwrl. the main scope is to improve decision-making processes. sugumaran and storey present a heuristics-based method for developing and creating ontologies [17]. they identify all the basic terms; this is done by using use cases and then revising synonyms and related terms manually or by an online thesaurus. in the next step they identify the relationships among these terms. they define three types of relationships: generalization, synonyms and associations. generalization corresponds to an is-a-relationship. this paper will present the methodology of ontology construction and annotation of state development funds. 3 creating an ontology this section presents a method aimed at building an ontological model of state development funds. the basic administrative activities that are common to all development funds in apv 264 s. arsovski, b. markoski, p. pecev, n. petrovački, d. lacmanović are as follows: registration of participants of the competition, analysis of the submitted documentation, risk assessment of funds placement, the decision on funds placement, and type of placement. it is noticeable that the administrative activities are almost identical for all state development funds, regardless of the type of fund and type of placement. the authors in [12] propose the following four-level typology of administrative activities of public administration: identification-identification of types of services offered to end users: 1. specifications specifying administrative procedures and documents for the identified type of service; 2. interaction specifying a communication protocol for the identified type of service (for example, signing a contract); 3. transaction the realization of services (for example, issuing guarantees for loan). this typology can be applied to state development funds. an important role in the development of ontology represents the conceptualization and organization of knowledge. the task of conceptualization is to transform informal knowledge into an ontological concept with the help of professionals in their field of ontology modeling. figure 1 shows a simplified model of the activities of state development funds. [7] figure 1: a simplified model of the activities of state development funds ontological descriptions of the administrative activities are generally applicable to all government development funds. below is a detailed description of the ontological model of the guarantee fund of apv (gfapv om), which defines the administrative procedures and tasks. 3.1 ontological model of the gfapv the public administration aspects given in [2] were taken as the basis for the creation of concepts in om gfapv. ontology of public administration in [2] consists of two parts. in the first part of the ontology, two aspects are defined: 1. a structural aspect of public administration in which administrative units and their hierarchical relationship are described where hierarchical relationships between the administrative units were presented with properties (belongs to and supervised by). 2. the textual aspect of ontology describes the documents that appear as a product of the administrative units described in the structural aspect of the ontology. the textual aspect is presented in the ontology with four main classes as follows: 1. administrative documents -presented as a product of administrative units; 2. civil documents -all types of documents that people create and fill out in the course of communication with public administration; 3. the legal texts -the laws and secondary legislation; advantages of using an ontological model of the state development funds 265 4. court decisions -decisions related to administrative actions and decisions of the supreme court. in the second part of the ontology given in [2], both the procedural and service aspects are presented. the procedural aspect of public administration ontology is presented as an extension (specialization) of owl-s. the basic concepts of procedural aspects are tasks, procedures and full procedures. tasks are atomic actions that cannot be further broken down and are executed by an administrative unit. each task has input data that needs to be filled by the system administrator or worker. the result of executing the task is an administrative act. a procedure contains at least one informative task (i.e. it tries to find or notify some information from/to another unit) and only one executive task (i.e. it produces a single act). thus, the procedures are composed of one or more tasks. each task is executed within a framework of procedure. full procedure represents a number of procedures intertwined. a full procedure may reflect to the provision of a service to one or several entities (property providedto). procedures may be sequential or in an acyclic graph. in this ontology, the control constructs of owl-s are adopted. when defining the ontological model of om gfapv, the following elements of the domain were analyzed: 1. participants. this aspect corresponds to the structural aspect of the model [2]. it represents internal funds structure and its position within the public administration, as well as actors who perform tasks defined in the business process and external participants to the process. 2. documents. this part, which corresponds to the text aspect of the model [2], represents administrative acts appearing during the execution of tasks within business processes. 3. the business process logic, which corresponds to the procedural aspect of the model [2], is defined by business rules and operating procedures of the business system (state credit guarantee funds). 4. electronic services are the services invoked in order to execute procedures implementing business process logic. in the case of om gfapv, administrative procedures are defined as ontology concepts. communication between administrative procedures and services is presented as atomic process. in the case of om gfapv each service belongs to only one procedure that is defined in the ontology. services are described by service profile properties. the generalization (superclass-of) and specialization (subclass-of) of ontological concepts is represented by the taxonomy of the main concepts of the ontology of the guarantee fund of ap vojvodina. 3.2 the taxonomy of the concept participants by this concept, all participants in procedures are represented. the concept participants is shown in figure 2. figure 2: participants-ontological concept three subclasses of the participants class are defined in this way: financial-institutions, publicadministration, clients. 266 s. arsovski, b. markoski, p. pecev, n. petrovački, d. lacmanović the financial-institutions concept this ontological concept describes financial institutions that cooperate with the guarantee fund as well as external participants in carrying out work procedures of the fund. two groups of financial institutions were identified: 1. state institutions ( the national bank ), 2. the independent financial institutions (commercial banks, credit bureau association of serbian banks). the properties of these concepts are subclassof and haveservice. the property subclass of describes the structural position of the concept, while the property haveservice indicates the existence of external services that are invoked during the execution of procedures. the public-administration concept the classes described by the concept public-administration represent the organizational structure of state administration in ap vojvodina and the place and role of the guarantee fund of apv in this organizational structure. the fund concept the classes identified in the taxonomy of the fund are, as follows: managing board, fund director, professional service, administrative office, commission for the issuance of guarantees. the clients concept the classes described by the concept clients represent participants who are allowed to apply to fund open competition. the classes identified in the taxonomy are: legal entity and natural person 3.3 the documents concept the concept documents is created by analyzing all of the documents identified in the guarantee fund of apv. figure 3 shows the taxonomy of the documents concept. figure 3: taxonomy of the documents ontological concept three groups of documents were identified: administrative acts, planning acts, and general acts. advantages of using an ontological model of the state development funds 267 the administrative-act concept as defined above, administrative documents are the product of administrative tasks within business procedures. semantic descriptions of documents that have been identified as a result of the execution of administrative tasks are given below. if we look at the documents as a product of administrative tasks in the case of the guarantee fund of apv, the following documents that appear as a result of the execution of administrative tasks in the business process of issuing a guarantee can be identified: the application document to open competition; the proposals document to the commission for the issuance of guarantees; the decision proposal document for the board; the decision document of the board; the contract document to issue guarantees. the document of guarantees administrative-act concept is a subclass of the documents concept. in addition, it is associated with the property producedby that describes it in terms of administrative procedure which produces the document of administrative-act type, and the property fillby which determines the entity from the participants taxonomy who fills out a pre-defined form of the document of administrative-act type. the planning-act concept documents belonging to this group are created each year. this group includes the following documents: work program, and open competition text. apart from their property subclass of which defines hierarchical relations among documents, they are associated with properties that describe them in terms of creation and approval: createdby defines the entity from the participants taxonomy that created the document. acceptedby defines the entity from the participants taxonomy that approved the document. the general-acts concept the documents that determine the legal framework of the fund belong to this group. in the case of the guarantee fund of apv, these are the following documents: the establishment decision, the statute, the fund business rules, and the fund code of conduct. apart from their property subclass of which defines hierarchical relations among documents, they are associated with properties that describe them in terms of creation and approval: createdby defines the entity from the participants taxonomy that created the document. acceptedby defines the entity from the participants taxonomy that approved the document. 3.4 procedures concept the procedures concept represents taxonomy of the administrative procedures of the fund. this taxonomy is created based on operational procedures for issuing guarantees and procedures relating to the creation of planning acts of the fund. in addition, the taxonomy contains procedures related to utilization of the documents that define legal and regulative framework of funds within the state administration. the taxonomy is shown on figure 4. the taxonomy of the procedures concept consists of following classes: planning-procedures, operative-procedures. 268 s. arsovski, b. markoski, p. pecev, n. petrovački, d. lacmanović figure 4: the procedures taxonomy planning-procedures concept this concept describes the procedures aimed at planning annual activities of the fund. the outputs of these procedures are various documents that contain activities to be performed in a calendar year and related financial resources. the following procedures have been identified: creating an annual work plan document, and creating an open competition document. the properties of classes describe these procedures: subclass of identifies the structural position of the concept within the procedures taxonomy. haveoutputdoc -identifies the document that results from performing general administrative procedures. performedby -identifies the perpetrators of the procedure. generalactsreferenced identifies all general-act concepts, which are legal/regulative basis for the created planning document. operative-procedures concept basis for creating the operative-procedures ontological concept are: task, procedure, and full procedure as defined in [2]. each task is executed within a procedure. each task has input data that an administrative worker or system should fill out and the result of the execution of the task is an administrative act. the procedure contains only one task with the corresponding input and output data, which is filled out by an administrative worker or system. full-procedure is composed of one or more procedures, while operativeprocedure is composed of one or more tasks. in the case of the guarantee fund of apv, the full-procedure is the procedure of issuing the guarantee. analyzing operative-procedures concept the following classes are identified: application-processing, committee-preparation, committee-decision, board-decision, contract-generation, and guarantee-generation. figure 5. depicts application-processing class, and properties and relations of the operative-procedures concept. services that call the specified operative-procedures during their execution are presented by their profiles (pservice-a-p, etc.). communication between these procedures and the service can be defined as an atomic process since the action of the service can be performed in a single interaction with the service. advantages of using an ontological model of the state development funds 269 figure 5: properties and relations of the operative-procedures concept 3.5 the services concept classes of internal and external services that are invoked for executing the work procedures of issuing guarantees are described in this taxonomy. the following classes are defined: internalservices and externalservices. internal services are the services that are provided by the fund, while external services are those that are provided by third parties like banks, other administrative bodies, etc. both have same properties subclass of and availableon. the property subclass identifies the structural position of the concept within the services taxonomy, while the property availableon indicates service provider uri. members of the class internalservices (service-a-p, service-c-p, etc.) are presented by the presentedby property. this property points to the service profile that presents the service and the operational procedure that uses this electronic service. 4 the semantics of the administrative task as described above, in our model, an operative-procedure is composed of one or more tasks. we call these tasks administrative tasks. figure 6 shows the ontological representation of an administrative task. figure 6: ontological representation of an administrative task each class and subclass of the ontology within the concept of the operative-procedures has input data. properties dataproperties describe input data of each operative-procedures class. an administrative worker or service fills input data into pre-defined document template(s) corresponding to an administrative task. document template is an active document that contains a code aimed at invoking a service presented by internalservices profile. it serves as a starting point for a new document creation. individuals of each operative-procedures class represent corresponding service and corresponding document template that are filled with input data during the execution of an administrative task. result of an administrative task execution is a 270 s. arsovski, b. markoski, p. pecev, n. petrovački, d. lacmanović custom administrative act output document (one or more in general, because the ontology allows it). output documents of an administrative task have their corresponding object properties. these properties describe all relations between an administrative task and concepts documents (producedby, fillby) and participants (performedby). figure 7 shows the ontological model of the administrative task aimed at processing applications on open competition. figure 7: ontological model of the administrative task semantics of an administrative task can be described as follows: the procedure applicationprocessing is executed by the administrative-office. the document application-document-toopen-competition is the result of the execution of application-processing procedure. the procedure for application processing has a unique document template txtapplication. the service profile pservice-a-p describes the electronic service invoked by the procedure applicationprocessing. 4.1 application of the ontology of administrative task the annotation of the ontology with elements of user interface is based on the rule that administrative tasks have input data. as shown in figure 7, input data is a series of variables that are filled with the resulting document template of the administrative task. the names of the input data are represented with dataproperties of the observed task. the descriptions of input data are defined in dataproperties isdefinedby property in the following form: component_type...(type of the ui /database component ) data_type.(data type (string), (integer)) x.(component index order on the form) component_label.(label of the component) unique_name(unique name of the defined component) individuals of each class denote the template of the document that is filled with input data the feature data-properties-assertion references a document that represents the template. data-properties-id defines the order of execution of the administrative procedures and/or tasks. an algorithm for a direct transformation of the semantic description of the administrative task in the user interface components is shown in figure 8. advantages of using an ontological model of the state development funds 271 figure 8: transformation ontology into user interface module for semantic content transformation: the requirement set in the introductory section of this paper is that the created ontology should enable automatic generation of information system components. the basis of this procedure is the transformation of the created administrative task ontology represented in owl format in two xml documents witch enable creation of the user interface. the module for transformation of semantic content represents the application that loads the created ontology and executes sparql queries. parsing the query results generates two xml documents (ontoclass, ontoform). administrative tasks sorted in the order of execution within the administrative process are presented by a tree view component. for each administrative task, it is necessary to load the descriptions of the components of the information system from the property dataproperty isdefinedby. the first xml document is a representation of classes and subclasses of the ontological concept procedure that represents an ontological description of the administrative business processes. ontoclass.xml is an xml document, which represents a workflow of the defined administrative procedures. the second xml document represents components that are extracted from the semantic description of the administrative task. as shown in figure 8, our model proposes the application of ontologies for the representation of administrative processes defined in workflows and extraction of information system components from the generated ontology. figure 9 shows the user interface of the module for the transformation of semantic content. the module for semantic content transformation provides the following functions: selection and loading of the desired ontology; entering the names of basic classes of the ontology that describe the tasks of the business process; generating the first xml document; entering the name of an administrative procedure for which it is necessary to generate user interface components; generating the second xml document; generating user interface for the selected administrative task. as described, annotating administrative task ontology with user interface components, enable automatic creation of the user interface for each procedure defined in the ontology. depending on the application, the appropriate xsl file provides mechanisms of transformation and formatting ontoform.xml and ontoclass.xml documents into user interface components. ontological representation of the administrative task contains the resulting document template of a selected task. listing 1 displays the sparql queries by which we extract the name of the resulting document template of the chosen procedure. 272 s. arsovski, b. markoski, p. pecev, n. petrovački, d. lacmanović figure 9: the module for semantic content transformation listing 1. query code for the template extraction prefix xsd: prefix rdfs: prefix owl: prefix gf: prefix rdfs: prefix rdf: select ?x ?y where { ?x rdf:type gf:adm_task_1 ?x gf:doctxt ?y } 4.2 using document templates each administrative task has exactly two ontological individuals: a corresponding service and a corresponding document template to be filled with input data during the execution of the administrative task thus generating the final document of this administrative task. it and legal experts are responsible for creating document templates and corresponding electronic services based on the semantic annotation of the administrative task. an example of executing application-processing procedure, which illustrates the flexibility of the administrative task representation achieved by the proposed administrative task semantic description and use of document templates, is shown in sequel: when a user select desired administrative task (on the generated user interface applicationprocessing figure 10) and fill out all the required fields, corresponding service generate document template shown on figure 10. document template acquires application data from the generated user interface and enable generation of the final administrative act. advantages of using an ontological model of the state development funds 273 figure 10: functionality of the created document template 5 conclusion egovernment solutions represent a major challenge in redefining the role of public administration agencies and organizations. creating and using ontologies of knowledge of administrative processes and modeling and controling systems that would speed up and automate the work of state administration, constitute prerequisites for technical and organizational interoperability of different government agencies. the proposed method reduces development time and effort in order to meet the demands of the information system based on knowledge of the administrative processes and documents. in this paper, we presented a methodology for ontology creation and annotation utilized to describe knowledge of the administrative task. in this paper, we have proposed an ontological model of state development funds. the proposed model enables document templates creation, which are the framework for the automatic generation of an administrative act as a result of the administrative procedures execution. the proposed approach is verified by a case study of building an ontological model for the guarantee fund of apv. when defining the ontological model of om gfapv, the following elements of the domain were analyzed: 1 participants, represents internal funds structure and its position within the public administration, as well as actors who perform tasks defined in the business process and external participants to the process; 2 documents, represents administrative acts appearing during the execution of tasks within business processes; 3 the business process logic, is defined by business rules and operating procedures of the business system (state credit guarantee funds); 4 electronic services are services that are invoked in order to execute procedures implementing business process logic. the generalization (superclass-of) and specialization (subclass-of) of ontological concepts are represented by the taxonomy of the main concepts of the ontology of the guarantee fund of ap vojvodina. an atomic process presents communication between administrative procedures and services. in the case of om gfapv, each service belongs to only one procedure that is defined in the ontology. services are described by service profile properties. following the proposed ontology, semantics of an administrative task is described in details. finally, an example illustrating usage of the proposed ontology for implementation of application-processing procedure is presented. further research should focus on the development of ontologies of administrative 274 s. arsovski, b. markoski, p. pecev, n. petrovački, d. lacmanović processes within the domain of state bodies. such ontologies could serve for fast and efficient creation of interoperable egovernment applications. acknowledgement this work was partially supported by the serbian ministry of education and sciences (grant no: 171039). bibliography [1] guarino, n. (1998). formal ontology and information systems, in proc. of the first int. conf. on formal ontologies in information systems (fois), 3-15. [2] savvas i, basiliades n. (2009). a process-oriented ontology-based knowledge management system for facilitating operational procedures in public administration. expert systems with applications, an international journal, 36(3):4467-4478. [3] vassilakis c., lepouras g. (2006). an ontology for e-government public services, encyclopedia of e-commerce, e-government, and mobile commerce 721-728. [4] paulheim h., probst f. (2010). ontology-enhanced user interfaces, a survey. international journal on semantic web and information systems (ijswis), doi: 10.4018/jswis.2010040103, 6(2), 24 p. [5] kleshchev a.,gribova v. (2003). from an ontology-oriented approach conception to user interface development, international journal information theories and applications, 10:87-93. [6] paulheim h., probst f. (2010). improving ui integration with formal semantics, sap research cec darmstadt bleichstrasse 8 64283 darmstadt, germany, february 7th, 2010 semais workshop @ iui. [7] furtado, e., furtado, j.j.v., silva, w.b., rodrigues, d.w.t., taddeo, l.s., limbourg, q., vanderdonckt, j. (2002). an ontology-based method for universal design of user interfaces. in: seffah, a., radhakrishnan, t., canals, g. (eds.) proc. of workshop on multiple user interfaces over the internet: engineering and applications trends mui 2001 (lille, september 10, 2001), in task models and diagrams for user interface design (tamodia 2002). [8] liu b., chen h., he w. (2005). deriving user interface from ontologies: a model-based approach. in ictai ’05, proceedings of the 17th ieee international conference on tools with artificial intelligence, washington dc, usa; ieee computer society, 254-259. [9] salhofer p., stadlhofer b., tretter g.(2009). ontology driven e-government. the electronic journal of e-government, 7(4):87-93. [10] vysniauskas e., nemuraite l. (2006). transforming ontology representation from owl to relational database, information technology and control, 35(3a):333-343. [11] gali a., chen c.x., claypool k.t., uceda-sosa r. (2005). from ontology to relational databases. shan wang et all (eds.), conceptual modeling for advanced application domains, lncs, 3289:278-289. advantages of using an ontological model of the state development funds 275 [12] todorovski l., leben a., kunstelj, m., cukjati, d., vintar, m. (2006). methodology for building models of life events for active portals. in gronlund, a. et al. (eds.):communication proceedings of 5th egov international conference, egov 06, trauner verlag, 61-68. [13] berners-lee, t. (1999). weaving the web: the original design and ultimate destiny of the world wide web by its inventor, harper san francisco. [14] toma i.f. (2010). contributions to the study of semantic interoperability in multi-agent environments -an ontology based approach, int j comput commun, issn 1841-9836, 5(5):946-952. [15] klischewski r. (2003). semantic web for e-government. r. traunmuller (ed.): egov 2003, lncs, 2739: 288-295. [16] necula s.c. (2012). implementing the main functionalities required by semantic search in decision-support systems, int j comput commun, issn 1841-9836, 7(5):907-915. [17] sugumaran v., v. storey c. (2002). ontologies for conceptual modeling: their creation, use, and management, data knowl. eng. 42(3): 251-271. [18] gruber t. (2007). ontologies, web 2.0 and beyond. keynote presentation at the ontology summit 2007 ontology, taxonomy, folksonomy: understanding the distinctions, march 1, 2007. [19] ramanauskaite s., olifer d., goranin n., cenys a. (2013). security ontology for adaptive mapping of security standards, int j comput commun, issn 1841-9836, 8(6):878-890. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 4, pp. 447-457 decentralized controller design for forbidden states avoidance in timed discrete event systems a. aybar aydın aybar anadolu university, dept. of electrical and electronics engineering 26555, eskişehir, turkey. e-mail: aaybar@anadolu.edu.tr abstract: a decentralized controller design approach is developed for the timed discrete event systems which are modelled by timed automata in this work. an approach, called augmentation, is presented to obtain the new modelling method such that each unit delay of any event represents a pair of new state and event. the augmented automata model, obtained by using this approach, is considered to design a decentralized controller. this controller design approach is developed such that the local controller is designed for each subautomaton, obtained by using overlapping decompositions and expansions and these controllers are then combined to obtain a decentralized controller for the given timed automaton. the designed decentralized controller guarantees the unreachability of a forbidden state in the considered automaton. keywords: discrete event systems, automata, time delays, decentralized controller. 1 introduction although automata and petri nets are known as common modelling methods for discrete event systems (see, [1]– [4]), these models were first presented without time notation. since there exist time delays in the dynamic systems, time notation is a necessity for the modelling methods of the discrete event systems [5,6]. time notation was used for automata (see, [7]). in this timed automata model, a class of finite state automata was extended with a set of clocks. the clocks were chosen as real values and timed event, denoted by a pair of an event and its occurence time, were used to determine the reachability of any state. afterwards, the timed automata model was used by many works (for example, [8–10]). moreover, the basic supervisory controller approaches were presented for these timed systems, modelled by timed automata (for example, [11–13]). it is known that the computational complexity of a supervisory controller design depends on the number of states and clocks for the timed automata model [10]. moreover, the computational complexity increases exponentially with the number of states of the untimed automata model [14] (also see, [15]). thus, a controller design for timed automata (especially, large scale automata have more number of states and events), can be more complex. an approach, called augmentation, is first introduced for timed automata in order to decrease the computational complexity, depending on the time and/or clock, of a controller design. this approach, based on [16, 17], is described such that each unit delay of any event represents a pair of new state and event, and then these pairs are added to the original automaton. a new modelling model is introduced such that the augmented automaton is obtained by adding the pairs of events and states, corresponding to unit time delays, to the original automaton in this work. in [16, 17], the strecthing approach was developed for timed petri nets. in this developed approach, each delay, assigned to a transition, denotes a pair of new place and transition. using the similarity between automata and petri nets, we first develop the augmentation approach in this work. although any event of automata can be related to any transition of petri net, there exist some differences between these models (for example, a marking vector of petri net is corresponding to a state of automaton). copyright c⃝ 2006-2010 by ccc publications 448 a. aybar the augmented automaton is used to design a decentralized controller in this work. an algebraic approach, which gives the state space representation for automata, is also developed to determine the state vectors. our aim is a decentralized controller design which prevents the occurence of the forbidden states. to facilitate the controller design, we use the approach of overlapping decompositions. the overlapping decompositions approach was first introduced by [18] for the case of continuous-state systems (systems described by differential or difference equations with continuous state variables). this approach was then used for discrete event systems by ( [4, 14, 19, 20]). 2 preliminaries 2.1 mathematical model the timed automata model is represented by a(q,σ,c,q0,d). here, q is the set of states, σ is the set of events, c : q × q → σ ∪{0} is the connection matrix, q0 is the initial state at the initial time, and d is set of the time delays of the events such that de ∈ r+ is the time delay of the event e ∈ σ , where r+ is set of nonnegative real numbers. the connection matrix is given as c(qi,q j) := { e, if qi is obtained when event e occurs at state q j 0, otherwise , for qi,q j ∈ q c(qi,q j) = 0 denotes no connection between two states qi and q j. c(qi,q j) = e denotes a connection between these states via e. it is assumed that the connection of between two states is done by only one event. in this work, the vector-matrix form is used to determine new state. the state vector at time τ is denoted by s(τ) s(τ) := { λq(q), if τ = γ (q), for any q ∈ q z, otherwise here, γ (q) denotes the obtained time of state q (it is assumed that each event occurs immediately as it becames possible), λq : q → {0,1}|q|, λq(q) := { 1, if q = [q] j 0, otherwise , j ∈ {1,..., |q|} where, [q] j denotes the jth element of q, and |q| indicates the number of the elements of q, τ denotes the global time, z, which is zeros vector, denotes that the occurence of the event e has not finished at τ or the considered event can not occured at the given state. the state equation is given as follows: s(τ) = (c∨ s(τe))∧o(e,τe), e ∈ σ (1) it is assumed that the initial state s(τ0) = λq̄(q0) and there exists an event e such that τe = γ (q) for q ∈ q (there is one exception such that if there exists deadlock in the considered automaton, no event can occur at deadlock state), where τe denotes the occurence time of the event e. note that, ∨ and ∧ are used respectively. here, • the event function is defined as o(e,τe) := e ⊙ ϕ(τ − τe − de), where, ϕ : r+ → {0,1}; ϕ(x) ={ 1, if x ≥ 0 0, otherwise . • the operation ⊙ on the set σ̈ := σ ∪{0,1} is defined as 0⊙0 = 0, ek ⊙0 = 0, 0⊙ek = 0, 0⊙1 = 0, 1⊙0 = 0, 1⊙1 = 0, ek ⊙1 = ek, 1⊙ ek = ek, ek ⊙ ek = 0, ek ⊙ el = 0, el ⊙ ek = 0. decentralized controller design for forbidden states avoidance in timed discrete event systems 449 • the operation ∨ on the matrices h ∈ σ̈ |q|×|q| and r ∈ σ̈ |q|, f = h ∨ r, is defined as f(i) =∑|q| k=1 h(i,k)⊙ r(k), i ∈ {1,..., |q|}. • the operation ⊗ on the set σ̈ is defined as 0 ⊗ 0 = 0, ei ⊗ 0 = 0, 0 ⊗ ei = 0, 0 ⊗ 1 = 0, 1 ⊗ 0 = 0, 1 ⊗ 1 = 1, ei ⊗ 1 = ei, 1 ⊗ ei = ei, ei ⊗ ei = 1, ei ⊗ e j = 0, e j ⊗ ei = 0. • for f ∈ σ̈ |q| and e ∈ σ , f ∧ e = [f(1)⊗ e ... f(|q|)⊗ e]t . in the given example automaton, shown in fig. 1a, the set of states is q = {q0,q1,q2,q3,q4}, the set of events is σ = {e1,e2,e3,e4,e5,e6,e7}, and the initial state is q0. the set of time delays, assigned to the events, is given as de1 = de2 = de6 = 2 sec., de3 = 3 sec., de4 = de5 = de7 = 1 sec. let the occurence time of e3 be τe3 = 5 sec. and s(5) = λq(q2) = [0 1 0 0 0] t . the state vector is obtained as s(τ) = (c∨ s(5))∧o(e3,5) =     0 0 0 e4 0 e1 0 0 0 0 0 e3 0 e5 0 0 e2 0 0 e6 e7 0 0 0 0  ∨   0 1 0 0 0    ∧(e3 ⊙ ϕ(τ −5−3)) =   0 0 e3 ⊙1 0 0  ∧(e3 ⊙ ϕ(τ −5−3)) = { [0 0 1 0 0]t , if τ ≥ 8 [0 0 0 0 0]t , if 5 < τ < 8 q2 is obtained when the occurence of event e3 is completed (q2 is not yet obtained in time interval between 5 sec. and 8 sec.). in this work, a new model is introduced in next section and used to design a decentralized controller. 2.2 overlapping decompositions and expansions overlapping decompositions and expansions [21] have been widely used to design decentralized controllers for continuous-state systems. these concepts have also been used to design supervisory controllers for discrete event systems modeled by petri nets [19] and by automata [14]. to our best knowledge, overlapping decompositions and expansions of discrete event systems modelled by automata or formal languages have been first introduced in [14]. in the given approach, overlapping subautomata of an automaton are first identified by examining the topological structure of the given automaton. these subautomata are identified such that the only interconnection between the subautomata are through the overlapping part, i.e., no event should connect two states in different subautomata, unless one of these states is in the overlapping part of the two subautomata. as an example, the automaton (fig. 1a) can be decomposed into two subautomata as shown in fig. 1b-1c ( [14]). after an overlapping decomposition of the original automaton is obtained, the expansions of the automaton is explained as follows [14]: i) a state or an event in the overlapping part of n subautomata is repeated n times and each repeated state/event is assigned to a different subautomaton. ii) two events are introduced between any two repeated states, such that when such an event occurs the state changes from one repeated state to the other. note that, a delay of each new event is assigned to the biggest common divisor of time delays of the original automaton in this work. iii) if the initial state is in the overlapping part of the original automaton, then the initial state of the expanded automaton can be chosen as any one of the repeated states of the original initial state. 450 a. aybar otherwise, the initial state of the expanded automaton is chosen as the initial state of the original automaton. 4 7 6 4 7 6 subautomaton 1 subautomaton 2 4 7 6 a a a b b b 12 2 21 1 21 2 12 1 figure 1. (a) example automaton (b) overlappingly decomposed automaton (c) expanded automaton as a result of this procedure, an expanded automaton, which consists of α disjoint subautomata, is obtained from an original automaton which was decomposed into α overlapping subautomata. the set of states of the expanded automaton is given by q̃ := ∪αi=1qi, where qi is the set of states of the ith subautomaton. the set of events of the expanded automaton is given by σ̃ := σ̆ ∪ σ̀ . here, σ̆ = ∪αi=1σi, where σi is the set of events of the ith subautomaton and σ̀ is the set of additional events introduced between the repeated states. as an example, the states q0, q3, and the event e4 are repeated, and new events e112, e 1 21, e 2 12 and e 2 21 are added to the repeated states in the expanded automaton in fig. 1a. then, the time delay of these events is determined as one second. 3 new model for timed automata although the usage of time in the mathematical model is a necessity for the real world system, the computational complexity of time delay systems increases because of the defined all processes and functions need more memories and time. we introduce the augmentation approach. using this approach, a new model is obtained for timed automata and called as the augmented automata, where each event has only unit time delay. the augmentation approach is defined such that time delays are represented by new states and events in this work. the augmented automaton, āt (q̄,σ̄,c̄,q0), is introduced, where, c̄ : q̄ × q̄ → σ̄ , q̄ := q ∪( ∪ e∈σ ∆s(e)), and σ̄ := σ ∪( ∪ e∈σ ∆e(e)) are given following items. • the time delays of the events are scaled such that dse := de/λ , for e ∈ σ and de ∈ d, where λ indicates the biggest common divisor of time delays. note that, the set of the scaled time delays of the events is denoted by ds. it is assumed that dse ≥ 1, for all e ∈ σ in this work. • for the event e ∈ σ such that c(qi,q j) = e and dse = 1, the input connection from e to the state is hold such as c̄(qi,q j) = c(qi,q j) = e for qi,q j ∈ q. note that, if c(qa,qb) = 0, then c̄(qa,qb) = 0. • for the event e∗ ∈ σ , and dse∗ > 1, δe∗ := d s e∗ −1 numbers new events and states are defined such as f e ∗ 1 , f e∗ 2 , ... , f e∗ δe∗ , and pe ∗ 1 , p e∗ 2 , ... , p e∗ δe∗ . the sets are constructed by using these events and states as ∆e(e∗) and ∆s(e∗), respectively. • the pairs are constructed by using the new events and states for any event e∗ ∈ σ , dse∗ > 1, such that ( f e ∗ i , p e∗ i ) for i ∈ {1,2,...,δe}. for c̄(qk,qn) = e ∗, the connections are described such as from qn to f e ∗ 1 , from f e∗ 1 to p e∗ 1 , from p e∗ 1 to f e∗ 2 , ... from f e∗ δe∗ to qk. hence, the new connection matrix is constructed for the new automaton model such as c̄(pe ∗ 1 ,qn) = f e∗ 1 , c̄(p e∗ 2 , p e∗ 1 ) = f e∗ 2 , ...., c̄(qk, pe ∗ δe∗ ) = ee ∗ δe∗ . decentralized controller design for forbidden states avoidance in timed discrete event systems 451 as a result of the above procedure, we obtain the augmented automaton which has more events and states but each event has only unit time delay. we introduce the algebraic approach for the augmented automaton. let s̄n be denote the present state vector and s̄n+1 be denote the next state vector (for n ∈ {0,1,2,...}, s̄0 = λq̄(q0) denotes the initial state vector). the state equation is defined as follows: s̄n+1 = (c̄∨ s̄n)∧ ē, ē ∈ σ̄. (2) it is possible to obtain pek as the current state. it shows that the occurence of the event e has not finished yet and also the duration time is determined as k ∗ λ + τe for the event e. compared to (1), the evaluation of the above equation (2) is much simpler, since it does not require the time notation and the event function o. qq q q p e e e e e1 2 4 5 1 4 1 23 f e1 11 pe3 1 pe3 2 f e3 1 f e3 2 pe2 1 f e2 1 e 7 e 6 f e6 1 pe6 1 q0 e 3 figure 2. augmented automaton for example, we obtain the augmented automaton (fig. 2.) for the given timed automata (fig. 1a). the set of states is q̄ = {q0,q1,q2,q3,q4}∪{pe11 , pe21 , p e3 1 , p e3 2 , pe61 }, where, ∆s(e1) = {p e1 1 }, ∆s(e2) = {p e2 1 }, ∆s(e3) = {p e3 1 , p e3 2 }, and ∆s(e6) = {pe61 }, the set of events is σ̄ = {e1,e2,e3,e4,e5,e6}∪{ f e11 , f e21 , f e3 1 , f e3 2 , f e61 }, where ∆e(e1) = { f e11 }, ∆e(e2) = { f e2 1 }, ∆e(e3) = { f e3 1 , f e3 2 }, and ∆e(e6) = { f e61 }, and the connection matrix is given as c̄ =   0 0 0 e4 0 0 0 0 0 0 0 0 0 0 0 f e11 0 0 0 0 0 0 0 e5 0 0 0 0 f e3 2 0 0 0 0 0 0 0 f e21 0 0 f e6 1 e7 0 0 0 0 0 0 0 0 0 e1 0 0 0 0 0 0 0 0 0 0 e2 0 0 0 0 0 0 0 0 0 e3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 f e3 1 0 0 0 0 0 0 e6 0 0 0 0 0   4 decentralized controller design a decentralized controller for the forbidden states avoidance is developed for the considered automaton in this section. 4.1 centralized control the centralized controller guarantees the unreachability of a forbidden state for the original automaton (f denotes the set of the forbidden states). now, we consider a centralized controller design for the original augmented automaton (oaa). 452 a. aybar in the oaa, the set of forbidden states is taken as f̄ = f. it is possible that there exists a state which only leads to any element of the set f̄. thus, the set f̄ is extented by these sets and a new set, denoted by ḡ, is obtained by the following algorithm. this algorithm, called as f s, requests the set of the forbidden states and the definition. note that, this algorithm also finds the deadlock states, in which no event can occur, and adds these deadlock states to the set ḡ. in this work, each element of ḡ is called as forbidden state. a controller for the oaa is defined as k̄(s̄n,ē) = k̄(λq̄(q),ē) = { 0, if s̄n+1 ∈ ḡv 1, otherwise , ē ∈ σ̄ (3) where, s̄n = λq̄(q), s̄n+1 = (c̄ ∨ s̄n) ∧ ē and ḡv := ∪ q∈ḡ λq̄(q) denotes the set of the state vectors, corresponding to states of ḡ. note that, if q f is a forbidden state, q f ∈ ḡ, then λq̄(q f ) is called as forbidden state vector, λq̄(q f ) ∈ ḡv. once k̄(s̄n,ē) = 0 denotes disabling event ē ∈ σ̄ , k̄(s̄n,ē) = 1 denotes enabling event ē. then, this controller guarantees the unreachability of an element of ḡ. the oaa with the controller can be also called as controlled automaton, denoted by ākt (q̄,σ̄,c̄,q0,k̄). the controlled state equation, which is obtained by adding this controller to the equation (2), is given as follows: s̄n+1 = (c̄∨ s̄n)∧(ē ⊗ k̄(s̄n,ē)), ē ∈ σ̄ (4) thus, any element of of ḡ does not occur in this controlled automaton. algorithm to construct the set ḡ ḡ = fs(āt ,f̄) ḡ = f̄ do loop construction f̂ = /0 for i = 1 to |q̄| if [q̄]i /∈ ḡ then cnt = 0 for j = 1 to |q̄| if c̄( j,i) = 0 or [q̄] j ∈ ḡ then cnt = cnt +1 if cnt = |q̄| then f̂ ← f̂ ∪̂ {[q̄] j} end end end end end if f̂ = /0 then exit loop construction end ḡ← ḡ ∪̂ f̂ loop construction here, both ∪ and ∪̂ are used to denote the set union. l ∪̂ m is used, rather than l ∪ m, whenever it is known apriori that l ∩ m = /0. to evaluate n = l ∪ m, the set c is first initialized as l; for each element (from first to last), m, of m, it is then checked whether m ∈ l. if m /∈ l, then m is added to set n. to evaluate n = l ∪̂ m, on the other hand, elements of l and of m are simply appended to form n. decentralized controller design for forbidden states avoidance in timed discrete event systems 453 4.2 decentralized control now, we first consider to design a controller for each disjoint subautomaton. then, a controller of the expanded augmented automaton (eaa) is obtained by using these controllers of subautomata. finally, a decentralized controller is designed by using the controller of the eaa for the oaa. it is known that the augmented subautomata are easily obtained by using overlapping decompositions and expansions. let āit (q̄i,σ̄i,c̄i,qi0) be denote the i th subautomaton. now, some definitions and notation are given such that the eaa is denoted by ãt (q̃,σ̃,c̃,q̃0), where, q̃ := ∪αi=1q̄i, σ̃ := ∪αi=1σ̄i ∪ σ̀ , and the connection matrix can be easily determined by using new sets of states and events. ψq̄ : q̄ → q̃, ψq̄(q) denotes the set of states in the eaa which corresponds to the state q in the oaa and ψς̄ : σ̄ → σ̃ , ψς̄ (e) denotes the set of events in the eaa which corresponds to the event e in the oaa. also, we define ψ −1σ̄ : σ̃ → σ̄ and ψ −1q̄ : q̃ → q̄ such that e = ψ −1σ̄ (ẽ) ⇐⇒ ẽ ∈ ψς̄ (e) and q = ψ −1 q̄ (q̃) ⇐⇒ q̃ ∈ ψq̄(q). the set of the forbidden states for the ith augmented subautomaton is obtained as f̄i := f̃∩ q̄i, where f̃ = ∪ q̄∈f̄ ψq̄(q̄). for the i th subautomaton, ḡi and ḡiv are obtained by using the algorithm fs. note that, this algorithm needs the definition of the ith subautomaton, and the set f̄i. it is possible to design a controller, k̄i for āit , if the initial state of this subautomaton is not a forbidden state (qi0 /∈ ḡi). since this repeated state is used for the interconnection between the subautomata (see, section 2.2), it is assumed that any repeated state in the ith subautomaton is not element of ḡi for all i ∈ {1,...,α} (q̃ /∈ ḡi for q̃ ∈ q̃0i which denotes the set of repeated states in the i th subautomaton). the controller for the eaa is designed by using local controllers, k̄i for all i ∈ {1,...,α}, where α denotes the number of subautomata, k̃(λq̃(q̃),ẽ) = { k̄i(λq̄i(q̃),ẽ), if ẽ ∈ σ̄i 1, otherwise , q̃ ∈ q̃ (5) note that, g̃ := ∪ i∈{1,...,α} ḡi. consequently, the controlled state equation is obtained by adding this controller to the state equation, for s̃n = λq̃(q̃), s̃n+1 = (c̃∨ s̃n)∧(ẽ ⊗ k̃(s̃n,ẽ)), ẽ ∈ σ̃ theorem 1: k̃ avoids the existence of the elements of g̃ in ãkt . proof: let q̃ ∈ q̃ and ẽ ∈ σ̃ . this state is also an element of any subautomaton, q̃ ∈ q̄k, for k ∈ {1,...,α}. i) if there is no relation between q̃ and ẽ, then k̃(λq̃(q̃),ẽ) = 1 because of its definition (see, the equation (5)). in this case, ẽ is not occured at q̃ and also this value of k̃ does not affect the controlled state equation because of the definition of operation ⊗. ii) if ẽ ∈ σ̀ , then k̃(λq̃(q̃),ẽ) = 1. in this case, the next state is also repeated state and not a forbidden state (q̃o /∈ ḡ j for q̃o ∈ q̃0j , ∀ j ∈ {1,...,α}). iii) if ẽ ∈ σ̄k, then k̃(λq̃(q̃),ẽ) = k̄k(λq̄k (q̃),ẽ). let the next state, q̃ +, be obtained by using ẽ from q̃. if q̃+ ∈ ḡk , then k̄k(λq̄k (q̃),ẽ) = 0 and q̃ + ∈ g̃ because of definition of g̃ [k̃(λq̃(q̃),ẽ) = 0]. otherwise, k̄k(λq̄k (q̃),ẽ) = 1 and q̃ + /∈ g̃ [k̃(λq̃(q̃),ẽ) = 1]. note that, k̃(λq̃(q̃),ẽ) = 1 if there is no relation between q̃ and ẽ. 2 we now obtain a controller, k̄, for the oaa, by using the controller, k̃, for the eaa as follows: k̄(λq̄(q̄),ē) = π q̃∈ψq̄(q̄) π ẽ∈ψς̄ (ē) k̃(λ̃ (q̃),ẽ), q̄ ∈ q̄, ē ∈ σ̄ (6) 454 a. aybar furthermore, the controlled state equation is given as s̄n+1 = (c̄ ∨ s̄n) ∧ (ē ⊗ k̄(s̄n,ē)), ē ∈ σ̄ , where, s̄n = λq̄(q̄), in the oaa. theorem 2: k̄ guarantees the unreachability of a forbidden state in ākt . proof: it is known that the sets ḡ j for all j ∈ {1,...,α} and g̃ are determined. any repeated event in the eaa is only connected to the repeated states because of overlapping decomposition approach. i) if q̄‡ ∈ q̄, ψq̄(q̄ ‡) = {q̄‡} and ē∗ ∈ σ̄ , ψς̄ (ē ∗) = {ē∗a,ē ∗ b,...,ē ∗ x}, then k̄(λq̄(q̄ ‡),ē∗) = k̃(λq̃(q̄ ‡),ē∗a) . k̃(λq̃(q̄ ‡),ē∗b). ... k̃(λq̃(q̄ ‡),ē∗x). since any event, in the overlapping part, is only connected to the states in the overlapping part, k̄(λq̄(q̄ ‡),ē∗) = 1. ii) if q̄‡ ∈ q̄, ψq̄(q̄ ‡) = {q̄‡} and ē∗ ∈ σ̄ , ψς̄ (ē ∗) = {ē∗}, then q̄‡ and ē∗ are elements of subautomata. note that, k̄(λq̄(q̄ ‡),ē∗) = k̃(λq̃(q̄ ‡),ē∗) = 0 if q̄‡ and ē∗ are not in same automaton. if there is a relation between q̄‡ and ē∗ in the jth subautomaton, then k̄(λq̄(q̄ ‡),ē∗) = k̃(λq̃(q̄ ‡),ē∗) = k̄ j(λq̄ j (q̄ ‡),ē∗). in this case, if q̄u, which is obtained by using ē∗ from q̄‡, is an element of ḡ j, then k̄ j(λq̄ j (q̄ ‡),ē∗) = 0. otherwise, k̄ j(λq̄ j (q̄ ‡),ē∗) = 1. iii) if q̄‡ ∈ q̄, ψq̄(q̄ ‡) = {q̄‡a,q̄ ‡ b,...,q̄ ‡ y} and ē ∗ ∈ σ̄ , ψς̄ (ē ∗) = {ē∗}, then k̄(λq̄(q̄ ‡),ē∗) = k̃(λq̃(q̄ ‡ a),ē ∗) . k̃(λq̃(q̄ ‡ b),ē ∗). .. .k̃(λq̃(q̄ ‡ y),ē ∗). if ē∗ is not connected to any element of ψq̄(q̄ ‡), then k̄(λq̄(q̄ ‡),ē∗) = 1 . 1 ... .1 = 1. let q̄‡l and ē ∗ in the lth subautomaton (the elements of ψq̄(q̄ ‡) \ {q̄‡l } are in the other subautomata, k̃(λq̃( ˙̄q),ē ∗) = 1, for ˙̄q ∈ ψq̄(q̄ ‡) \ {q̄‡l }). in this case, k̄(λq̄(q̄ ‡),ē∗) = 1 .... 1 .k̄l(λq̄l (q̄ ‡ l ),ē ∗) .1 ...1 = k̄l(λq̄l (q̄ ‡ l ),ē ∗) is obtained. here, if ^̄q, which is obtained by using ē∗ from q̄‡l , is an element of g̃ then k̄l(λq̄l (q̄ ‡ l ),ē ∗) = 0 [k̄(λq̄(q̄ ‡),ē∗) = 0]. otherwise, k̄l(λq̄l (q̄ ‡ l ),ē ∗) = 1 [k̄(λq̄(q̄ ‡),ē∗) = 1]. iv) if q̄‡ ∈ q̄, ψq̄(q̄ ‡) = {q̄‡a,q̄ ‡ b,...,q̄ ‡ y} and ē ∗ ∈ σ̄ , ψς̄ (ē ∗) = {ē∗a,ē ∗ b,...,ē ∗ x}, then k̄(λq̄(q̄ ‡),ē∗) = 1 .... 1 = 1 since any repeated state is not element of g̃. note that, each element of ḡ is also an element of g̃ because of overlapping decompositions and expansions approach and q̄ ∈ ḡ ⇒ q̄ ∈ q j, for j ∈ {1,...,α}. thus, q̄ is an element of ḡ j and also q̄ ∈ g̃. this decentralized controller prevents the forbidden states in ākt . 2 we can obtain a decentralized controller for the original automaton as follows: k(λq(q),e) = k̄(λq̄(q),e), q ∈ q, e ∈ σ (7) this controller is added to the state equation (1) and s(τ) = (c∨s(τe))∧(o(e,τe)⊗k(s(τe),e)), for e ∈ σ is obtained. it is known that, although the forbidden states are elements of q, the new states may be elements of ḡ. the controller of the oaa only disables the elements of σ because of the connection of the pairs new states and events (see, section 4). therefore, the occurence of any event, which is disabled at any state by the controller (6), is disabled for the original automaton by the controller (7). decentralized controller design for forbidden states avoidance in timed discrete event systems 455 5 example q 4 e 7 e 6 f e6 1 pe6 1 qq q p e e e e e1 2 4b 5 1 1 2 3b f e1 11 pe3 1 pe3 2 f e3 1 f e3 2 pe2 1 f e2 1 q0b e 3 q e 4a 3a q0a 12 2 21 1 21 2 12 1 figure 3. expanded automaton in this section, we design a decentralized controller, which guarantees the forbidden states avoidance, for the given timed automaton (fig. 1a). the augmented automaton for this automaton is obtained as fig. 2. the eaa, shown in fig. 3, is obtained by using overlapping decompositions and expansions. for the original timed automaton, the set of forbidden states is given as f = {q2} and f̄ = f. then, f̃ = ∪ q̄∈f̄ ψq̄(q̄) = {q2}. now, we consider two subautomata to design a decentralized controller. in the first subautomaton, ā1t (q̄1,σ̄1,c̄1,q10), the set of states is q̄1 = {q0a,q3a,q4, p e6 1 }, the set of events is σ̄1 = {e4,e6a,e7, f e61 }, and the initial state is q10 = q0a. in the second subautomaton, ā2t (q̄2,σ̄2,c̄2,q20), the set of states is q̄2 = {q0b,q2,q3b, p e1 1 , p e2 1 , p e3 1 , p e3 2 }, the set of events is σ̄2 = {e1,e2,e3,e4b, f e11 , f e2 1 , f e3 1 , f e3 2 }, and the initial state is q20 = q0b. the connection matrices are c̄1 =   0 e4a 0 0 0 0 0 f e61 e7 0 0 0 0 0 e6 0   and c̄2 =   0 0 0 e4b 0 0 0 0 0 0 0 0 f e11 0 0 0 0 0 0 e5 0 0 0 f e3 2 0 0 0 0 0 f e21 0 0 e1 0 0 0 0 0 0 0 0 e2 0 0 0 0 0 0 0 e3 0 0 0 0 0 0 0 0 0 0 0 0 f e3 1 0   . for each subautomaton, f̄i = f̃ ∩ q̄i for i ∈ {1,2} is obtained such as f̄1 = /0 and f̄2 = {q2}. in the subautomata, the set ḡ1 = /0 is obtained by using the algorithm fs (note that, ḡ1v = /0). thus, k̄1(λq̄1(q0a),ē +) = k̄1([1 0 0 0]t ,ē+) = 1 for all ē+ ∈ σ̄1 and k̄1(λq̄1(q̄ ∗),ēx) = 1 for all q̄∗ ∈ q̄1 and ēx ∈ σ̄1. the set ḡ2 = {q2, p e3 2 , p e3 1 } is obtained by using the algorithm fs (note that, ḡ2v = {[0 1 0 0 0 0 0 0] t ,[ 0 0 0 0 0 0 0 1]t , [ 0 0 0 0 0 0 1 0]t }). thus, k̄2(λq̄2(q1),e5) = 0, k̄2(λq̄2(q3a),e3) = 0 and k̄2(λq̄2(q̄ x),ē∗) = 1 for all q̄x ∈ q̄2 \{q1,q3a} and ē∗ ∈ σ̄2. using the equation (5), the controller is obtained for the eaa as k̃(λq̃(q1),e5) = k̄2(λq̄2(q1),e5) = 0, k̃(λq̃(q3b),e3) = k̄2(λq̄2(q3b),e3) = 0, and k̃(λq̃(q̃ ‡),ẽc) = 1, ∀q̃‡ ∈ q̃ \{q1,q3a}, ∀ẽc ∈ σ̃. it is known that s̃n = λq̃(q1) = [0 1 0 0 0 0 0 0;0 0] t , and λq̄2(q1) = [0 1 0 0 0 0 0 0] t . finally, a decentralized controller, which avoids the forbidden states, is designed by using (6). this controller is given as k̄(λq̄(q1),e5) = k̃(λq̃(q1),e5) = k̄2(λq̄2(q1),e5) = 0, k̄(λq̄(q3),e3) = k̃(λq̃(q3a),e3) . k̃(λq̃(q3b),e3) = k̄1(λq̄1(q3a),e3) . k̄2(λq̄2(q3b),e3) = 1 . 0 = 0, and k̄(λq̄(q d),ew) = 1, ∀qd ∈ q̄\{q1,q3}, ∀ew ∈ σ̄. the final result is given such that the occurence of e5 is disabled at the state q1 and the occurence of e3 is 456 a. aybar disabled at the state q3. thus, decentralized controller avoids state q2. for the original timed automaton, the controller is obtained as k(λq(q3),e3) = 0, k(λq(q1),e5) = 0, and k(λq(qx),ez) = 1, ∀qx ∈ q \{q1,q3}, ∀ez ∈ σ . now, let us compare the results of centralized and decentralized controllers for the given automaton. both of these controllers disables the occurence of e5 the state q1 and the occurence of e3 at the state q3. the most advantage is that the size of the connection matrix for each subautomaton is smaller then the size of the connection matrix of the oaa. 6 conclusion a decentralized controller approach using overlapping decompositions for the timed discrete event systems. an approach, called augmentation, is presented to obtain the new modelling method such that each unit delay of any event represents a pair of new state and event. the augmented automaton is constructed by adding the pairs of events and states to the original automaton in this work. the decentralized controller design approach is presented to prevent the occurence of the forbidden states. the augmented automaton is first decomposed overlappingly and expanded to obtain subautomata. then, a controller is designed for each disjoint subautomaton. these local controllers are then combined to obtain a controller for the augmented automaton. moreover, the state space representation is used for timed and untimed automata by the given algebraic approach. since the clock or timer does not used to analyse for the augmented automaton, the first advantage is that the computational complexity does not depend on clock for the timed automata. for the construction of the augmented autonmaton, the new states and events are added to the original automaton, and then the size of the connection matrix of the original automaton is smaller than the size of the connection matrix of the augmented automaton. although this seems to be a disadvantage, the connection matrices of the augmented subautomata are only used to design the decentralized controller (i.e., the connection matrix of the augmented automaton is not used for the decentralized controller design approach). the size of the connection matrix of each subautomaton is an advantage for the decentralized approach (the number of states and events of subautomata is less than original automaton, [14, 15]). although the effort needed to obtain a useful the overlapping decomposition, this can be not comparable to the controller design since the decomposition may, in most cases, be easily made. further research can also be undertaken to use this approach to design decentralized controllers for various objectives (for example, a controller can be designed such that this controller leads the given discrete event systems to marked states). bibliography [1] p. j. g. ramadge and w. m. wonham, “the control of discrete event systems,” proceedings of the ieee, vol. 77, pp. 81–98, 1989. [2] r. s. sreenivas and b. h. krogh, “on petri net models of infinite state supervisors,” ieee transactions on automatic control, vol. 37, pp. 274–277, 1992. [3] a. aybar and a. i̇ftar, “decentralized supervisory controller design to avoid deadlock in petri nets,” international journal of control, vol. 76, pp. 1285–1295, 2003. [4] a. aybar and a. i̇ftar, “decentralized supervisory controller design for discrete-event systems using overlapping decompositions and expansions,” dynamics of continuous, discrete and impulse systems (series b), vol. 11, pp. 553–568, 2004. decentralized controller design for forbidden states avoidance in timed discrete event systems 457 [5] a. a. desrochers and r. y. al-jaar, applications of petri nets in manufacturing systems, the institute of electrical and electronics engineers inc., new york, 1995. [6] m. zhou and f. dicesare, petri net synthesis for discrete event control of manufacturing systems, kluwer academic, norwell, ma, 1993. [7] r. alur and d. l. dill, “a theory of timed automata,” theoretical computer science, vol. 126, pp. 183–235, 1994. [8] a. gouin and j. ferrier, “temporal coherence of timed automata product,” in proc. of the 1999 ieee international conference on systems, man, and cybernetics, october 1999, pp. 176–181. [9] j. krakora, l. waszniowski, p. pisa, and z. hanzalek, “timed automata approach to real time distributed system verification,” in proc. of the 2004 ieee international workshop on factory communication systems, september 2004, pp. 407–410. [10] a. khoumsi, “a supervisory control method for ensuring the comformance of real-time discrete event systems,” discrete event dynamic systems: theory and applications, vol. 15, pp. 397–431, 2005. [11] b. a. bradin and w. m. wonham, “supervisory control of timed discrete–event systems,” ieee transactions on automatic control, vol. 39, pp. 329–342, 1994. [12] f. lin and w. m. wonham, “supervisory control of timed discrete–event systems under partial observation,” ieee transactions on automatic control, vol. 40, pp. 558–562, 1995. [13] i. açıksöz, “time step approach for timed automata model (in turkish),” m.s. thesis, anadolu university, eskişehir, turkey, june 2006. [14] a. aybar and a. i̇ftar, “overlapping decompositions of large–scale discrete–event systems,” in proceeding cd-rom of the 15th ifac world congress, barcelona, spain, july 2002. [15] k. rudie and w. m. wonham, “think globally, act locally: decentralized supervisory control,” ieee transactions on automatic control, vol. 37, pp. 1692–1708, 1992. [16] a. aybar and a. i̇ftar, “supervisory controller design for timed petri nets,” in proceedings of the ieee international conference on system of systems engineering, los angeles, ca, u.s.a., apr. 2006, pp. 59–64. [17] a. aybar and a. i̇ftar, “deadlock avoidance controller design for timed petri nets using stretching,” ieee systems journal, vol. 2, pp. 178–188, 2008. [18] m. ikeda and d. d. šiljak, “overlapping decompositions, expansions, and contractions of dynamic systems,” large scale systems, vol. 1, pp. 29–38, 1980. [19] a. aybar and a. i̇ftar, “overlapping decompositions and expansions of petri nets,” ieee transactions on automatic control, vol. 47, pp. 511–515, 2002. [20] a. aybar, a. i̇ftar, and h. apaydın-özkan, “centralized and decentralized supervisory controller design to enforce boundedness, liveness, and reversibility in petri nets,” international journal of control, vol. 78, pp. 537–553, 2005. [21] m. ikeda and d. d. šiljak, “overlapping decentralized control with input, state, and output inclusion,” control theory and advanced technology, vol. 2, pp. 155–172, 1986. 3ashraf.pdf international journal of computers communications & control special issue on fuzzy sets and applications (celebration of the 50th anniversary of fuzzy sets) issn 1841-9836, 10(6):789-802, december, 2015. group decision making with incomplete interval-valued fuzzy preference relations based on the minimum operator s. ashraf, a. rehman, e.e. kerre samina ashraf*, atiq ur rehman department of mathematics comsats institute of information technology lahore, pakistan saminaa561@gmail.com, atiq.math@gmail.com *corresponding author: saminaa561@gmail.com etienne e. kerre department of applied mathematics, computer science, b-900 gent, belgium etienne.kerre@ugent.be abstract: this paper presents a new method to estimate the unknown values in incomplete interval-valued fuzzy preference relations (ivfprs). the method is based on the min-consistency and is used to develop the algorithm for group decision making (gdm) dealing with incomplete ivfprs. keywords: fuzzy preference relation, interval-valued fuzzy preference relation; incomplete interval-valued fuzzy preference relation; min-consistency; group decision making. 1 introduction and preliminaries gdm is a situation faced when a number of experts work together to find the best alternative(s) from a set of feasible alternatives. each expert may have exclusive inspirations or objectives and a different decision procedure, but has a common interest in approaching to select the “best” option(s). preference relation is the most common representation format used in gdm because it is a valuable tool in modeling decision processes, when we have to combine experts’ preferences into group preferences [6, 14, 15]. in a preference relation an expert assigns a numerical value to every pair of alternatives that reflects some degree of preference of the first alternative over the second alternative. mainly two types of preference relations have been used to develop the decision models; multiplicative preference relations (mprs) [2,14], and fuzzy preference relations (fprs) [6, 16]. the popular preference relations, which are being used to express an expert’s preferences over alternatives, are fprs. in a decision making procedure, an expert mostly needs to compare a finite set of alternatives xi (i = 1, 2, ..., n) and construct an fpr [6, 13, 16, 17]. however, an expert may have imprecise information for the preference degrees of one alternative over another and it may not always be possible to estimate his/her preference by means of an exact numerical value. in such a situation, an expert constructs an ivfpr. in 2004, z. s. xu defined the notion of compatibility degree of two ivfprs and showed the compatible connection among individual and collective ivfprs [18]. in 2005, f. herrera et al. established an aggregation process for combining ivfprs with other forms of information as; numerical preference relation (npr) and linguistic preference relation (lpr) [7]. in 2007, y. jiang proposed a technique to measure the similarity degree of two ivfprs and used the error-propagation rule to find the priority vector of the accumulated ivfprs [8]. in 2008, z. s. xu and chen developed some linear programming models to derive the priority weights from several ivfprs [20]. copyright © 2006-2015 by ccc publications 790 s. ashraf, a. rehman, e.e. kerre all the above researches focused on the ivfprs with complete information. however, in dm problems such situations are unavoidable in which an expert does not have comprehensive information of the problem because of time constraint, lack of knowledge and the expert’s limited expertise within the problem domain [1, 3, 5, 10, 19, 22, 24, 33]. consequently, the expert may not be able to give his/her opinion about specific traits of the problem, and hence an incomplete preference relation would be constructed. in literature, researches based on incomplete fprs have been given, but there are only few researches in gdm related to incomplete ivfprs [23]. in this paper, a new technique for gdm by using incomplete ivfprs is developed. obviously, the consistent information is more applicable or important than the information having ambiguities, consistency is linked with definite transitivity properties. several properties have been endorsed to model transitivity of fprs, one of these properties is the max-min transitivity. in this paper, a procedure, based on min-transitivity property is proposed to determine unknown interval-valued preferences of one alternative over others and further, it is extended to develop an algorithm for gdm to select the best alternative. definition 1.1. [4] an interval-valued fuzzy set a on a universe x is defined as: a = {(a, [x−, x+])|a ∈ x, [x−, x+] ∈ l([0, 1])} where l([0, 1]) = {[x−, x+]|[x−, x+] ⊆ [0, 1] with x− ≤ x+}. arithmetic operations can be performed on closed intervals . the following formulae can be used for all p, q ∈ l([0, 1]) (p = [p−, p+] and q = [q−, q+]) [12]: • p + q = [p− + q−, p+ + q+], • p − q = [p− − q+, p+ − q−], • p · q = [min(p−q−, p−q+, p+q−, p+q+), max(p−q−, p−q+, p+q−, p+q+)], • p/q = [p−, p+] · [ 1 q+ , 1 q− ] if 0 /∈ [q−, q+]. definition 1.2. [4] let x be a universe and a and b two interval-valued fuzzy sets. the inclusion of a into b is defined as: a ⊆ b if and only if a(a) ⊆ b(a) for all a ∈ x and the equality between a and b is defined as: a = b if and only if a(a) = b(a) for all a ∈ x . definition 1.3. [9] a triangular norm (t-norm) t is an increasing, associative, commutative and [0, 1] × [0, 1] → [0, 1] mapping satisfying: t(1, x) = x for all x ∈ [0, 1]. the t-norm to be used in this paper is t(x, y) = min(x, y). the concept of a t-norm on [0, 1] can be extended to subintervals of [0, 1]. definition 1.4. an extended t-norm, te, is an increasing, commutative, associative and l([0, 1]) × l([0, 1]) → l([0, 1]) mapping that satisfies: te([1, 1], [x −, x+]) = [x−, x+] for all [x−, x+] ∈ l([0, 1]). let t be a triangular norm. the mapping te defined as: te([a −, a+], [b−, b+]) = [t(a−, b−), t(a+, b+)] for [a−, a+], [b−, b+] ∈ l([0, 1]), is an extended t-norm on (l([0, 1]),⊆), where ⊆ represents the crisp set inclusion. the extended interval t-norm corresponding to the minimum-operator can be computed by: tmin([a −, a+], [b−, b+]) = [min(a−, b−), min(a+, b+)]. (1) group decision making with incomplete interval-valued fuzzy preference relations based on the minimum operator 791 definition 1.5. [15] a fuzzy preference relation r over a finite set x of alternatives, x = {x1, x2, x3, ..., xn}, is a fuzzy set on the product set x×x, i.e., it is characterized by a membership function µr : x × x → [0, 1]. according to definition 1.5, a fuzzy preference relation r on x can be conveniently expressed by an n × n matrix r = (rij)n×n, where rij denotes the degree of preference of alternative xi over the alternative xj with rij ∈ [0, 1], rii = 0.5, rij +rji = 1 (additive reciprocity) for 1 ≤ i ≤ n and 1 ≤ j ≤ n. if rij = 0.5, then there is no difference between the alternatives xi and xj. if rij > 0.5, then alternative xi is preferred over the alternative xj. if rij = 1, then the alternative xi is definitely preferred over the alternative xj. definition 1.6. [18] let r = (rij)n×n be a fuzzy preference relation over the set of alternatives x = {x1, x2, x3, ..., xn} where rij = [r − ij , r+ ij ], 0 ≤ r− ij ≤ r+ ij ≤ 1, rij = [1, 1]−rji and rii = [0.5, 0.5] for all i, j ∈ n, then r is called an interval-valued fuzzy preference relation. definition 1.7. an ivfpr r is said to be min-consistent, if for all i, j and k belonging to {1, 2, 3, ..., n} it holds: rik ≥ tmin(rij, rjk) (min -transitivity). definition 1.8. an ivfpr relation r = (rij)n×n is said to be incomplete if it contains at least one unknown preference value rij for which the expert has no idea about the degree of preference of alternative xi over the alternative xj. 2 method to repair an incomplete ivfpr this section presents a new technique to estimate missing values in an incomplete ivfpr. further, the algorithm is used to construct a min-consistent matrix. in order to determine unknown values in an incomplete ivfpr r = (rij)n×n, the pairs of alternatives for known and unknown preference values are represented by the following sets: kp = {(i, j)|rij is known}, (2) up = {(i, j)|rij is unknown}, (3) where the preference value of alternative xi over xj belongs to the family of closed subintervals of [0, 1] (i.e., rij ∈ l([0, 1])). since rij = [1, 1] − rji, rii = [0.5, 0.5] for 1 ≤ i ≤ n and 1 ≤ j ≤ n, therefore, the min-transitivity of definition 1.7 can be written as: rik ≥ tmin(rij, rjk); rik ≥ tmin(1 − rji, rjk); rik ≥ tmin(rij, 1 − rkj). (4) hence, the following sets can be defined to determine the unknown preference value rik of alternative xi over alternative xk: s1ik = {j|(i, j) ∈ kp , (j, k) ∈ kp and (i, k) ∈ up}, (5) s2ik = {j|(j, i) ∈ kp , (j, k) ∈ kp and (i, k) ∈ up}, (6) s3ik = {j|(i, j) ∈ kp , (k, j) ∈ kp and (i, k) ∈ up}, (7) for i = {1, 2, 3, ..., n}, j = {1, 2, 3, ..., n} and k = {1, 2, 3, ..., n}. based on (5),(6) and (7), we can determine the unknown preference value rik for xi over xk as follows: rik = r1 ik + r2 ik + r3 ik 3 , (8) 792 s. ashraf, a. rehman, e.e. kerre where r1ik =    1 |s1 ik | ∑ j∈s1 ik tmin(rij, rjk), if |s 1 ik | 6= 0 [0.5, 0.5], otherwise (9) r2ik =    1 |s2 ik | ∑ j∈s2 ik tmin([1, 1] − rji, rjk), if |s 2 ik | 6= 0 [0.5, 0.5], otherwise (10) r3ik =    1 |s3 ik | ∑ j∈s3 ik tmin(rij, [1, 1] − rkj), if |s 3 ik | 6= 0 [0.5, 0.5], otherwise (11) where |s1 ik |, |s2 ik | and |s3 ik | are the cardinalities of the sets s1 ik , s2 ik and s3 ik respectively. k ′ p = kp ∪{(i, k)}, (12) u ′ p = up −{(i, k)}. (13) to achieve min-consistency of the ivfpr r, following scaling conditions will be used: (i) if r− ij + r+ ji < 1 and r+ ij + r− ji < 1, then rij = [ r− ij + 1 − (r− ij + r+ ji ) 2 , r+ ij + 1 − (r+ ij + r− ji ) 2 ] (14) and rji = [ r− ji + 1 − (r+ ij + r− ji ) 2 , r+ ji + 1 − (r− ij + r+ ji ) 2 ] . (15) (ii) if r− ij + r+ ji < 1 and r+ ij + r− ji > 1, then rij = [ r−ij + 1 − (r− ij + r+ ji ) 2 , r+ij − r+ ij + r− ji − 1 2 ] (16) and rji = [ r−ji − r+ij + r − ji − 1 2 , r+ji + 1 − (r−ij + r + ji) 2 ] . (17) (iii) if r− ij + r+ ji > 1 and r+ ij + r− ji < 1, then rij = [ r− ij − r− ij + r+ ji − 1 2 , r+ ij + 1 − (r+ ij + r− ji ) 2 ] (18) and rji = [ r− ji + 1 − (r+ ij + r− ji ) 2 , r+ ji − r− ij + r+ ji − 1 2 ] . (19) (iv) if r− ij + r+ ji > 1 and r+ ij + r− ji > 1, then rij = [ r−ij − r− ij + r+ ji − 1 2 , r+ij − r+ ij + r− ji − 1 2 ] (20) group decision making with incomplete interval-valued fuzzy preference relations based on the minimum operator 793 and rji = [ r− ji − r+ ij + r− ji − 1 2 , r+ ji − r− ij + r+ ji − 1 2 ] . (21) example 2.1. let r = (rij)4×4 be an incomplete ivfpr for the alternatives x1, x2, x3 and x4, given as follows: r =   [0.5, 0.5] r12 [0.4, 0.6] [0.3, 0.7] r21 [0.5, 0.5] [0.7, 0.8] r24 [0.4, 0.6] [0.2, 0.3] [0.5, 0.5] [0.3, 0.4] [0.3, 0.7] r42 [0.6, 0.7] [0.5, 0.5]   where r12, r21, r24 and r42 are unknown preference values. now applying (2)-(13) to estimate the unknown preference values for the alternative xi over xk, 1 ≤ i ≤ 4 and 1 ≤ k ≤ 4, we obtain: kp = {(1, 1), (1, 3), (1, 4), (2, 2), (2, 3), (3, 1), (3, 2), (3, 3), (3, 4), (4, 1), (4, 3), (4, 4)}, up = {(1, 2), (2, 1), (2, 4), (4, 2)}. s112 = {3}, s 2 12 = {3}, s 3 12 = {3}, r112 = tmin(r13, r32) = tmin([0.4, 0.6], [0.2, 0.3]) = [0.2, 0.3], r212 = tmin([1, 1] − r31, r32) = tmin([0.4, 0.6], [0.2, 0.3]) = [0.2, 0.3], r312 = tmin(r13, [1, 1] − r23) = tmin([0.4, 0.6], [0.2, 0.3]) = [0.2, 0.3], r12 = 1 3 (r112 + r 2 12 + r 3 12) = [0.2, 0.3]. k ′ p = {(1, 1), (1, 2), (1, 3), (1, 4), (2, 2), (2, 3), (3, 1), (3, 2), (3, 3), (3, 4), (4, 1), (4, 3), (4, 4)}, u ′ p = up −{(1, 2)} = {(2, 1), (2, 4), (4, 2)}. s121 = {3}, s 2 21 = {1, 3}, s 3 21 = {2, 3}, r121 = tmin(r23, r31) = tmin([0.7, 0.8], [0.4, 0.6]) = [0.4, 0.6], r221 = 1 2 [tmin([1, 1] − r12, r11) + tmin([1, 1] − r32, r31)] = 1 2 [tmin([0.7, 0.8], [0.5, 0.5]) + tmin([0.7, 0.8], [0.4, 0.6])] = 1 2 [[0.5, 0.5] + [0.4, 0.6]] = [0.45, 0.55], 794 s. ashraf, a. rehman, e.e. kerre r321 = 1 2 [tmin(r22, [1, 1] − r12) + tmin(r23, [1, 1] − r13)] = 1 2 [tmin([0.5, 0.5], [0.7, 0.8]) + tmin([0.7, 0.8], [0.4, 0.6]] = 1 2 [[0.5, 0.5] + [0.4, 0.6]] = [0.45, 0.55], r21 = 1 3 (r121 + r 2 21 + r 3 21) = [0.433, 0.567]. k ′′ p = {(1, 1), (1, 2), (1, 3), (1, 4), (2, 1), (2, 2), (2, 3), (3, 1), (3, 2), (3, 3), (3, 4), (4, 1), (4, 3), (4, 4)}, u ′′ p = u ′ −{(2, 1)} = {(2, 4), (4, 2)}. s124 = {1, 3}, s 2 24 = {1, 3}, s 3 24 = {1, 3}, r124 = 1 2 [tmin(r21, r14) + tmin(r23, r34)] = 1 2 [tmin([0.433, 0.567], [0.3, 0.7]) + tmin([0.7, 0.8], [0.3, 0.4])] = 1 2 [[0.3, 0.567] + [0.3, 0.4]] = [0.3, 0.484], r224 = 1 2 [tmin([1, 1] − r12, r14) + tmin([1, 1] − r32, r34)] = 1 2 [tmin([0.7, 0.8], [0.3, 0.7]) + tmin([0.7, 0.8], [0.3, 0.4])] = 1 2 [[0.3, 0.7] + [0.3, 0.4]] = [0.3, 0.55], r324 = 1 2 [tmin(r21, [1, 1] − r41) + tmin(r23, [1, 1] − r43)] = 1 2 [tmin([0.433, 0.567], [0.3, 0.7]) + tmin([0.7, 0.8], [0.3, 0.4])] = 1 2 [[0.3, 0.567] + [0.3, 0.4]] = [0.3, 0.484], r24 = 1 3 (r124 + r 2 24 + r 3 24) = [0.3, 0.506]. k ′′′ p = {(1, 1), (1, 2), (1, 3), (1, 4), (2, 1), (2, 2), (2, 3), (2, 4), (3, 1), (3, 2), (3, 3), (3, 4), (4, 1), (4, 3), (4, 4)}, u ′′′ p = u ′′ −{(2, 4)} = {(4, 2)}. s142 = {1, 3}, s 2 42 = {1, 2, 3}, s 3 42 = {1, 3, 4}, r142 = 1 2 [tmin(r41, r12) + tmin(r43, r32)] = 1 2 [tmin([0.3, 0.7], [0.2, 0.3]) + tmin([0.6, 0.7], [0.2, 0.3])] = 1 2 [[0.2, 0.3] + [0.2, 0.3]] = [0.2, 0.3], r242 = 1 3 [tmin([1, 1] − r14, r12) + tmin([1, 1] − r24, r22) + tmin([1, 1] − r34, r32)] = 1 3 [[0.2, 0.3] + [0.5, 0.494] + [0.2, 0.3]] = [0.298, 0.367], group decision making with incomplete interval-valued fuzzy preference relations based on the minimum operator 795 r342 = 1 3 [tmin(r41, [1, 1] − r21) + tmin(r43, [1, 1] − r23) + tmin(r44, [1, 1] − r24)] = 1 3 [[0.3, 0.567] + [0.2, 0.3] + [0.494, 0.5]] = [0.331, 0.456], r42 = 1 3 (r142 + r 2 42 + r 3 42) = [0.276, 0.374]. k ′′′′ p = {(1, 1), (1, 2), (1, 3), (1, 4), (2, 1), (2, 2), (2, 3), (2, 4), (3, 1), (3, 2), (3, 3), (3, 4), (4, 1), (4, 2), (4, 3), (4, 4)}, u ′′′′ p = φ. hence, the complete ivfpr is r =   [0.5, 0.5] [0.2, 0.3] [0.4, 0.6] [0.3, 0.7] [0.433, 0.567] [0.5, 0.5] [0.7, 0.8] [0.3, 0.506] [0.4, 0.6] [0.2, 0.3] [0.5, 0.5] [0.3, 0.4] [0.3, 0.7] [0.276, 0.374] [0.6, 0.7] [0.5, 0.5]   (22) by applying scaling condition on (18), r becomes a min-consistent ivfpr r̃ as follows: r̃ =   [0.5, 0.5] [0.316, 0.434] [0.4, 0.6] [0.3, 0.7] [0.566, 0.684] [0.5, 0.5] [0.7, 0.8] [0.463, 0.615] [0.4, 0.6] [0.2, 0.3] [0.5, 0.5] [0.3, 0.4] [0.3, 0.7] [0.385, 0.537] [0.6, 0.7] [0.5, 0.5]   . 3 a new algorithm to choose the best alternative in gdm with incomplete ivfprs. in this section, a new algorithm is presented for gdm with incomplete ivfprs by using min-consistency. an explanatory example is given to validate the anticipated technique. for ease, the structure of the determination process is also shown in figure1. suppose that there are n alternatives x1, x2, ..., xn and m experts e1, e2, ..., em. let r q be the ivfpr for the expert eq shown as follows: r q = ( r q ij ) n×n =   [0.5, 0.5] r q 12 . . r q 1n r q 21 [0.5, 0.5] . . r q 2n . . . . . . . . r q n1 r q n2 . . [0.5, 0.5]   , (23) where r q ij ∈ l([0, 1]) is the preference value given by expert eq for alternative xi over xj, r q ij = [1, 1] − r q ji , r q ii = [0.5, 0.5], 1 ≤ i ≤ n, 1 ≤ j ≤ n and 1 ≤ q ≤ m. the proposed gdm technique is given as follows: step 1: determine the sets k q p and u q p of pairs of alternatives for known and unknown preference values respectively, shown as follows: k q p = {(i, j)|r q ij is known}, (24) u q p = {(i, j)|r q ij is unknown}, (25) 796 s. ashraf, a. rehman, e.e. kerre where 1 ≤ i ≤ n, 1 ≤ j ≤ n and 1 ≤ q ≤ m. step 2: if u = φ, then skip step 2, otherwise construct the sets s q1 ik , s q2 ik and s q3 ik based on the sets k q p and u q p . the constructed sets are used to estimate the unknown preference values r q ik for the alternative xi over xk by expert eq as follows: rik = r q1 ik + r q2 ik + r q3 ik 3 , (26) s q1 ik = {j|(i, j) ∈ k q p , (j, k) ∈ k q p and (i, k) ∈ u q p }, (27) s q2 ik = {j|(j, i) ∈ k q p , (j, k) ∈ k q p and (i, k) ∈ u q p }, (28) s q3 ik = {j|(i, j) ∈ k q p , (k, j) ∈ k q p and (i, k) ∈ u q p }, (29) r q1 ik =    1 |s q1 ik | ∑ j∈s q1 ik tmin(r q ij , r q jk ), if |s q1 ik | 6= 0 [0.5, 0.5], otherwise (30) r q2 ik =    1 |s q2 ik | ∑ j∈s q2 ik tmin([1, 1] − r q ij, r q jk ), if |s q2 ik | 6= 0 [0.5, 0.5], otherwise (31) r q3 ik =    1 |s q3 ik | ∑ j∈s q3 ik tmin(r q ij , [1, 1] − r q jk ), if |s q3 ik | 6= 0 [0.5, 0.5], otherwise (32) where |s q1 ik |, |s q2 ik | and |s q3 ik | are the cardinalities of the sets s q1 ik , s q2 ik and s q3 ik respectively. k ′q p = k q p ∪{(i, k)}, (33) u ′q p = u q p −{(i, k)}. (34) step 3: to satisfy min-consistency of the complete interval-valued fuzzy preference relation r q = ( r q ij ) n×n , the following scaling conditions are used: (i) if r q− ij + r q+ ji < 1 and r q+ ij + r q− ji < 1, then r q ij = [ r q− ij + 1 − (r q− ij + r q+ ji ) 2 , r q+ ij + 1 − (r q+ ij + r q− ji ) 2 ] (35) and r q ji = [ r q− ji + 1 − (r q+ ij + r q− ji ) 2 , r q+ ji + 1 − (r q− ij + r q+ ji ) 2 ] . (36) (ii) if r q− ij + r q+ ji < 1 and r q+ ij + r q− ji > 1, then r q ij = [ r q− ij + 1 − (r q− ij + r q+ ji ) 2 , r q+ ij − r q+ ij + r q− ji − 1 2 ] (37) and r q ji = [ r q− ji − r q+ ij + r q− ji − 1 2 , r q+ ji + 1 − (r q− ij + r q+ ji ) 2 ] . (38) group decision making with incomplete interval-valued fuzzy preference relations based on the minimum operator 797 (iii) if r q− ij + r q+ ji > 1 and r q+ ij + r q− ji < 1, then r q ij = [ r q− ij − r q− ij + r q+ ji − 1 2 , r q+ ij + 1 − (r q+ ij + r q− ji ) 2 ] (39) and r q ji = [ r q− ji + 1 − (r q+ ij + r q− ji ) 2 , r q+ ji − r q− ij + r q+ ji − 1 2 ] . (40) (iv) if r q− ij + r q+ ji > 1 and r q+ ij + r q− ji > 1, then r q ij = [ r q− ij − r q− ij + r q+ ji − 1 2 , r q+ ij − r q+ ij + r q− ji − 1 2 ] (41) and r q ji = [ r q− ji − r q+ ij + r q− ji − 1 2 , r q+ ji − r q− ij + r q+ ji − 1 2 ] . (42) a min-consistent matrix r̃ q = ( r̃ q ij ) n×n is obtained under these conditions. step 4: determine the collective matrix r c against all experts, shown as follows: r c = ( rcij ) n×n = 1 m ( r̃ 1 ij + r̃ 2 ij + r̃ 3 ij... + r̃ m ij ) n×n , (43) where 1 ≤ i ≤ n, 1 ≤ j ≤ n. step 5: calculate the average degree ai of alternative xi over all other alternatives by using interval normalizing method: ai = n∑ j=1 rcij n∑ i=1 n∑ j=1 rc ij , i = 1, 2, 3, ..., n. (44) step 6: [21] calculate the possibility degree dij = d(ai ≥ aj) by using the formula: d(ai ≥ aj) = min { max ( a+ i − a− j a+ i − a− j + a+ j − a− i , 0 ) , 1 } (45) and construct the complementry matrix d = (dij)n×n, where dij ≥ 0, dij + dji = 1, dii = 0, i, j = 1, 2, 3, ..., n. step 7: [33] calculate the ranking value rv (xi) of alternative xi by using formula: rv (xi) = 2 n2 n∑ j=1 dij, (46) where 1 ≤ i ≤ n and n∑ i=1 rv (xi) = 1. 798 s. ashraf, a. rehman, e.e. kerre figure 1: resolution process for gdm with ivfprs. example 3.1. a firm produces solar water refiners. in its production process, the company has to buy solar panels in different sizes and voltages from different suppliers. presently, japan solar company has four potential suppliers in four different countries, namely, korea, china, italy and turkey, signified as xi(i = 1, 2, 3, 4), respectively. a committee consisting of three experts eq(q = 1, 2, 3) from different departments has been formed to assess the four suppliers xi(i = 1, 2, 3, 4). suppose that the experts eq(q = 1, 2, 3) provide their assessments in the form of following incomplete ivfprs: r 1 =   [0.5, 0.5] r1 12 [0.6, 0.8] r1 14 r1 21 [0.5, 0.5] r1 23 [0.3, 0.7] [0.2, 0.4] r1 32 [0.5, 0.5] [0.6, 0.9] r1 41 [0.3, 0.7] [0.1, 0.4] [0.5, 0.5]   , r 2 =   [0.5, 0.5] r12 [0.4, 0.6] [0.3, 0.7] r21 [0.5, 0.5] [0.7, 0.8] r24 [0.4, 0.6] [0.2, 0.3] [0.5, 0.5] [0.3, 0.4] [0.3, 0.7] r42 [0.6, 0.7] [0.5, 0.5]   , and r 3 =   [0.5, 0.5] r3 12 [0.7, 0.8] r3 14 [0.4, 0.6] [0.5, 0.5] r3 23 [0.5, 0.7] [0.2, 0.3] r3 32 [0.5, 0.5] r3 34 r3 41 [0.3, 0.5] r3 43 [0.5, 0.5]   . step 1: for the fuzzy preference relation r 1 , the sets of pairs of alternatives for known and unknown preference values are determined as follows: k1p = {(1, 1), (1, 3), (2, 2), (2, 4), (3, 1), (3, 3), (3, 4), (4, 2), (4, 3), (4, 4)}, u1p = {(1, 2), (1, 4), (2, 1), (2, 3), (3, 2), (4, 1)}. group decision making with incomplete interval-valued fuzzy preference relations based on the minimum operator 799 step 2: s1112 = φ, s 12 12 = φ, s 13 12 = φ, r1112 = [0.5, 0.5], r 12 12 = [0.5, 0.5], r 13 12 = [0.5, 0.5], r112 = 1 3 (r1112 + r 12 12 + r 13 12) = [0.5, 0.5]. k1 ′ p = {(1, 1), (1, 2), (1, 3), (2, 2), (2, 4), (3, 1), (3, 3), (3, 4), (4, 2), (4, 3), (4, 4)}, u1 ′ p = u 1 p −{(1, 2)} = {(1, 4), (2, 1), (2, 3), (3, 2), (4, 1)}. s1114 = {2, 3}, s 12 14 = {3}, s 13 14 = {2, 3}, r1114 = 1 2 [tmin(r 1 12, r 1 24) + tmin(r 1 13, r 1 34)] = 1 2 [[0.3, 0.5] + [0.6, 0.8]] = [0.45, 0.65], r1214 = tmin([1, 1] − r 1 13, r 1 34) = tmin([0.2, 0, 4], [0.6, 0.9]) = [0.2, 0.4], r1314 = 1 2 [tmin(r 1 12, [1, 1] − r 1 42) + tmin(r 1 13, [1, 1] − r 1 43)] = 1 2 [tmin([0.5, 0.5], [0.3, 0.7]) + tmin([0.6, 0.8], [0.6, 0.9])] = 1 2 [[0.3, 0.5] + [0.6, 0.8]] = [0.45, 0.65], r114 = 1 3 (r1114 + r 12 14 + r 13 14) = [0.3667, 0.5667]. k1 ′′ p = {(1, 1), (1, 2), (1, 3), (1, 4), (2, 2), (2, 4), (3, 1), (3, 3), (3, 4), (4, 2), (4, 3), (4, 4)}, u1 ′′ p = u 1 ′ p −{(1, 4)} = {(2, 1), (2, 3), (3, 2), (4, 1)}. hence, continuing as above the fuzzy preference relation r 1 against expert e1 is obtained as follows: r 1 =   [0.5, 0.5] [0.5, 0.5] [0.6, 0.8] [0.3667, .5667] [0.4667, 0.5222] [0.5, 0.5] [0.2889, 0.4574] [0.3, 0.7] [0.2, 0.4] [0.3055, 0.5389] [0.5, 0.5] [0.6, 0.9] [0.2519, 0.4673] [0.3, 0.7] [0.1, 0.4] [0.5, 0.5]   . step 3: min-consistency preference relation r̃ 1 based on r 1 is obtained as follows: r̃ 1 =   [0.5, 0.5] [0.4889, 0.5166] [0.6, 0.8] [0.4497, .6574] [0.4834, 0.5111] [0.5, 0.5] [0.375, 0.576] [0.3, 0.7] [0.2, 0.4] [0.424, 0.625] [0.5, 0.5] [0.6, 0.9] [0.3426, 0.5503] [0.3, 0.7] [0.1, 0.4] [0.5, 0.5]   . likewise, min-consistency preference relations r̃ 2 and r̃ 3 against the experts e2 and e3 respec800 s. ashraf, a. rehman, e.e. kerre tively, given as below: r̃ 2 =   [0.5, 0.5] [0.33, 0.46] [0.4, 0.6] [0.3, 0.7] [0.54, 0.67] [0.5, 0.5] [0.7, 0.8] [0.46, 0.61] [0.4, 0.6] [0.2, 0.3] [0.5, 0.5] [0.3, 0.4] [0.3, 0.7] [0.39, 0.54] [0.6, 0.7] [0.5, 0.5]   , r̃ 3 =   [0.5, 0.5] [0.4166, 0.55] [0.7, 0.8] [0.4722, 0.599] [0.45, 0.5834] [0.5, 0.5] [0.5333, 0.6592] [0.5, 0.7] [0.2, 0.3] [0.3408, 0.4667] [0.5, 0.5] [0.384, 0.4994] [0.401, 0.5278] [0.3, 0.5] [0.5006, 0.616] [0.5, 0.5]   . step 4: the collective matrix against all the experts is shown as follows: r c =   [0.5, 0.5] [0.4118, 0.5089] [0.5667, 0.7333] [0.4073, 0.6521] [0.4911, 0.5882] [0.5, 0.5] [0.5361, 0.6784] [0.42, 0.67] [0.2667, 0.4333] [0.3216, 0.4639] [0.5, 0.5] [0.428, 0.5998] [0.3479, 0.5927] [0.33, 0.58] [0.4002, 0.572] [0.5, 0.5]   step 5: the average degree ai, i = 1, 2, 3, 4, of each alternative is derived by using interval normalizing method given as: a1 = 4∑ j=1 rc 1j 4∑ i=1 4∑ j=1 rcij = [1.8858, 2.3943] [6.9274, 9.0726] = [0.2078, 0.3456]; a2 = 4∑ j=1 rc 2j n∑ i=1 n∑ j=1 rcij = [1.9472, 2.4366] [6.9274, 9.0726] = [0.2146, 0.3517]; a3 = 4∑ j=1 rc 3j n∑ i=1 n∑ j=1 rc ij = [1.5163, 1.997] [6.9274, 9.0726] = [0.1671, 0.2883]; a4 = 4∑ j=1 rc 4j n∑ i=1 n∑ j=1 rcij = [1.5781, 2.2447] [6.9274, 9.0726] = [0.1739, 0.3240]. step 6: by using eq. (45), the complementry matrix d = (dij)4×4 is obtained as follows: d = (dij)4×4 =   0.5 0.4765 0.6892 0.5964 0.5235 0.5 0.7147 0.619 0.3108 0.2853 0.5 0.4217 0.4036 0.381 0.5783 0.5   group decision making with incomplete interval-valued fuzzy preference relations based on the minimum operator 801 step 7: the ranking value rv(xi) of alternative xi, 1 ≤ i ≤ 4, is obtained as follows: rv(x1) = 2 42 4∑ j=1 d1j = 0.2827625; rv(x2) = 2 42 4∑ j=1 d2j = 0.29465; rv(x3) = 2 42 4∑ j=1 d3j = 0.189725; rv(x4) = 2 42 4∑ j=1 d4j = 0.2328625; where 4∑ i=1 rv(xi) = 1. thus, the final ranking of the alternatives is derived as follows: x2 > x1 > x4 > x3. therefore, x2 is the best alternative. the numerical examples show the way to apply the proposed technique to construct the complete ivfpr based on min-consistency. in general, the proposed approach is quite easy for use in estimating unknown preference values. 4 conclusion in this paper the extended minimum t-norm has been used successfully to determine the missing values in incomplete ivfpr and further extends to construct the min-consistent matrix. numerical studies show that the proposed technique can handle all type of incomplete ivfpr. consequently, another algorithm is established to deal with gdm problems with incomplete ivfprs. this process involves two stages, the estimation of unknown interval-valued preference values and the choice of the best alternative(s). bibliography [1] s. alonso, f. chiclana, f. herrera, e. herrera-viedma, j. alcala-fdez, c. porcel (2008); a consistency-based procedure to estimate missing pairwise preference values, international journal of intelligent systems, 23 (1): 155–175. [2] f. chiclana, f. herrera, e. herrera-viedma (2001); multiperson decision making based on multiplicative preference relations, european j. of operational research, 129: 372 385. [3] f. chiclana, e. herrera-viedma, s. alonso, f. herrera; a note on the estimation of missing pairwise preference values: a u-consistency based method, international journal of uncertainty, fuzziness and knowledge-based systems, 16: 19–32. [4] c. cornelis, g.deschrijver, e. kerre (2004); implication in intuitionistic fuzzy and interval-valued fuzzy set theory: construction, classification and application, international journal of approximate reasoning, 35(1): 55–95. [5] m. fedrizzi, g. silvio (2007); incomplete pairwise comparison and consistency optimization, european j. of operational research, 183: 303–313. 802 s. ashraf, a. rehman, e.e. kerre [6] j. fodor, m. roubens (1994); fuzzy preference modelling and multicriteria decision support, dordrecht: kluwer academic publishers. [7] f. herrera, l. martíze, p. j. sánchez (2005); managing non-homogeneous information in group decision making, european journal of operational research, 166: 115–132. [8] y. jiang (2007); an approach to group decision making based on interval fuzzy preference relations, journal of systems science and systems engineering, 16(1): 113–120. [9] e. p. klement, r. mesiar, e. pap (2000); triangular norms, kluwer academic, dordrecht. [10] h. s. lee, m. t. chou, h. h. fang, w. k. tseng, c. h. yeh; estimating missing values in incomplete additive fuzzy preference relations. in b. apolloni (ed.), kes 2007/wirn 2007, lnai 4693 : 1307–1314. [11] l. w. lee (2012); group decision making with incomplete fuzzy preference relations based on the additive consistency and the order consistency, expert systems with applications, 39: 11666– 11676. [12] r. e. moore (1979); methods and applications of interval analysis, siam studies in applied mathematics, 2. [13] s. orlovsky (1978); decision-making with a fuzzy preference relation, fuzzy sets and systems, 1: 155 167. [14] t. l. saaty; the analytic hierarchy process. mcgraw-hill, new york, (1980). [15] t. tanino (1984); fuzzy preference orderings in group decision making, fuzzy sets and systems, 12 (2): 117–131. [16] t. tanino (1988); fuzzy preference relations in group decision making, in non-conventional preference relations in decision making, eds. j. kacprzyk and m. roubens (springer-verlag, 1988): 54-71. [17] y. m. wang, z. p. fan (2007); fuzzy preference relations: aggregation and weight determination, computers & industrial engineering, 53: 163–172. [18] z. s. xu (2004); on compatibility of interval fuzzy preference relations, fuzzy optimization and decision making, 3: 217–225. [19] z. s. xu (2004); goal programming models for obtaining the priority vector of incomplete fuzzy preference relation, international j. of approximate reasoning, 36 (3): 261–270. [20] z. s. xu, j. chen (2008); some methods for deriving the priority weights from interval fuzzy preference relations, european j. of operational research, 184: 266–280. [21] z. s. xu, q. l. da (2002); the uncertain owa operator, international journal of intelligent systems, 17: 569–575. [22] y. j. xu, q. l. da, h. m. wang (2010); a note on group decision-making procedure based on incomplete reciprocal relations, soft computing, 15 (7): 1289–1300. [23] y. xu, k. w. li, h. wang (2014); incomplete interval fuzzy preference relations and their applications, computers & industrial engineering, 67: 93–103. [24] y. j. xu, r. patnayakuni, h. m. wang (2013); logarithmic least squares method to priority for group decision making with incomplete fuzzy preference relations, applied mathematical modelling, 37 (4): 2139–2152. international journal of computers communications & control issn 1841-9836, 11(3):315-330, june 2016. biometrics systems and technologies: a survey i. buciu, a. gacsadi ioan buciu*, alexandru gacsadi department of electronics and telecommunications faculty of electrical engineering and information technology university of oradea, 410087, romania *corresponding author: ibuciu@uoradea.ro abstract: in a nutshell, a biometric security system requires a user to provide some biometric features which are then verified against some stored biometric templates. nowadays, the traditional password based authentication method tends to be replaced by advanced biometrics technologies. biometric based authentication is becoming increasingly appealing and common for most of the human-computer interaction devices. to give only one recent example, microsoft augmented its brand new windows 10 os version with the capability of supporting face recognition when the user login in. this chapter does not intend to cover a comprehensive and detailed list of biometric techniques. the chapter rather aims at briefly discussing biometric related items, including principles, definitions, biometric modalities and technologies along with their advantages, disadvantages or limitations, and biometric standards, targeting unfamiliar readers. it also mentions the attributes of a biometric system as well as attacks on biometrics. important reference sources are pointed out so that the interested reader may gain deeper in-depth knowledge by consulting them. keywords: biometric modalities, biometric attacks, biometric standards. 1 biometrics introduction let us begin with a simple scenario. let us assume a user wants to remotely access a tele-presence group with secured and restricted credentials. the access is done by launching a verification application requiring a password and user id. unfortunately, for some reasons, the user does not remember the password. in this case, the application or system provides a link for forgot password. the user needs then to type his e-mail address to which a new temporary password is sent by the verification system. the user enters his e-mail account, copies the temporary password and the system asks the user to change again the temporary password with a new more meaningful password. moreover, the brand new password must be re-typed to avoid any typo mistake. the whole process may take several minutes and requires the use of the keyboard for several times, not mentioning memorizing or writing down the brand new password. the action may become very frustrating and inconvenient for the user who might become angry soon. let us now consider there is another application which can get the user access easily and very fast without requiring any typing and with a minimum user interaction or effort. such applications already exist and they are based on user’s biometric features. a biometric feature can be defined as a physiological (face, fingerprints, iris, etc.) or behavioural (gait, voice, signature, etc.) attribute of a human being that can discriminate one individual from another. nowadays, the great interest for biometric recognition systems can be justified due to increased demand for security. the goal of a biometric based recognition system is either automatic identification or verification of identities, given input data comprising images, speech or videos. unlike the traditional ways, such as password, biometric traits have some advantages: they cannot be stolen (although spoof attacks may exist to tamper the biometric system), lost or forget. however, to be reliable, biometric traits should be unique and persistent copyright © 2006-2016 by ccc publications 316 i. buciu, a. gacsadi over time. some other criteria should be met such as user convenience and acceptability (mainly due to privacy reasons). biometric recognition is usually performed by extracting a biometric template in query from the input device and compare it against some enrolled biometric templates. the comparison is processed using of the two modes: a) verification (or authentication) and b) identification (or recognition). verification is an one-to-one process where the query face is compared against the user claiming his genuine identity to verify his claimed id. the output is binary, either accept or reject, based on a matching procedure. we should note here that a biometric authentication technology may be used in conjunction with traditional authentication methods such as password, passports, pin, smart cards, access tokens, etc, employed as second factor authentication. identification is one-to-many process where the query is compared against each enrolled biometric template (multiple templates) from the database to search for the identity of the query. identification is a bit more complex than verification as the system serves as both identifier and authenticator. a biometric based recognition system needs an enrollment procedure which allows the registration of persons in a biometric database that may be later used for identification or verification. the acquired initial data may undergo some pre-processing steps depending on the biometric modality. for instance, in the case of images, histogram equalization may help when the image suffers from illumination imbalance. for audio data, voice separation from the background may be also a pre-processing step. biometric features are constructed by feature extraction step resulting a biometric template, further stored in the database. after a person is enrolled, the person’s biometrics are scanned and matched against the enrolled biometric templates. matching is a complex pattern recognition problem between the enrolled samples and the test one. a matching score is computed to reflect the similarity between two biometric templates. overall, the person’s recognition process is challenging because the representation of the same biometric is basically taken either by different sensors or, more often, at two or multiple different points in time, so that the acquisition conditions between the enroll and test samples may greatly vary due to various factors: noise, change in illumination, partial occlusion, different resolution, etc. this issue translates in a matching score lower than its optimum value. a threshold level is next setup for a final decision. a matching score higher than the threshold would give a match and consequently an accept, while a lower score would lead to rejection. the associated risks for any biometric system are false accept (when an unauthorized person is wrongly accepted), represented by false acceptance rate (far) and false reject (when an authorized person is incorrectly denied for access), represented by false rejection rate (frr). an ideal biometric system should have both far = frr = 0. in real life, no such biometric system or technology exists. while connected to the threshold level, far and frr are inversely proportional. more precisely, a low threshold level would decrease frr and increase far. this situation is preferred for applications where the level of security is not critical. however, for applications demanding high security level, the threshold is set to a very high value to favour low far in detriment of high (possible disturbing) frr. one such applications is the authorized and secure remote access for telepresence requiring very strict authorization. when frr equals far we have equal error rate (eer), a measure that is often reported when the performance of a biometric system or technology is addressed. biometric systems or technologies can be categorized based on several classifications: a) physiological versus behavioural; b) cooperative versus non-cooperative; c) mono-modal versus multimodal biometric systems; d) contact versus touchless versus “at distance” (or remote) technology, e) server based versus mobile based biometric technology; f) human versus no human monitoring for data acquisition. the appropriateness of each classification is highly depending on the application type. for instance, a biometric based surveillance technology may operate with full a) category (face as physiological and gait as behavioural) using non-cooperative user interaction, remote sensors and server based processing (involving large video data), while the biometrics systems and technologies: a survey 317 contact based biometric type is fully absent. similarly, a biometric authorization based access for telepresence application might also require either contact or contactless sensing technology, but no human monitoring is typically involved at the sensory remote spot. 2 physiological versus behavioural biometrics physiological biometrics addresses direct measurement from parts of the human body, while behavioural biometrics relates to measurements derived from human actions. in terms of acquisition, behavioural biometrics need measurements acquired over a certain period of time which is an important factor. 2.1 physiological biometrics the majority of commercial biometrics technologies involve physiological measurements which are considered to remain steady over relatively large time interval. such measurements may include the following modalities: • face recognition • facial thermography • fingerprint recognition • hand geometry based recognition • ear geometry based recognition • iris recognition • retina recognition • vascular pattern recognition the above biometric modalities have, more or less, reached maturity. however, the performance of such technologies are greatly dependent of application class. if, for a controlled acquisition environment (high image resolution, good quality light, occlusion free and steady images a process preferable under external human monitoring) a face recognition technology may easily acquire a very high matching score for an genuine individual, the matching score may significantly degrade for different environmental conditions in the absence of a watching guard compensating the ideal conditions. this case is specific for surveillance applications or, more generally, when the remote biometric sensor is separated by the processing module location. moreover, in the absence of any preventive or detective control, an attacker can get access by attacking the system with stolen biometric data (a printed photo of the genuine’s face, fingerprint, video record, etc.). the scenario is closely related to network authentication such as telepresence, automated teller machine access, internet banking, mobile based biometrics, cloud technology. for a centralized biometric network, authentication the user who needs to be authenticated is physically present at the same site with the sensor while the recognition process takes place on a general purpose computer or server, located some distance away from the sensor. the remote access is granted or rejected based on server decision which communicates with the sensor. 318 i. buciu, a. gacsadi face recognition face identification and verification dates back to 60’s when the computer vision community has started to address the problem. while the face recognition technology is somehow inferior as performance compared to other biometric modalities (such as iris recognition for example), it is more accepted due to its major advantage: it is the only physiological biometric that can be reliable measured at distance and, moreover, the authentication of the users can happen without their explicit interaction with the sensor or their knowledge. the performance of face recognition systems can vary considerable depending upon the context and various factors. the modifications of facial features are caused by both long-term and short-term changes. long-terms changes refer to aging where prominent and wrinkles may appear upon the face and permanently change the facial texture. in this case, periodic enrollment is necessary to update the biometric template. short term changes may refer to weight loss or gain. other factors affecting the system’s accuracy are partial occlusions (growing beard or moustache, glasses, hat, scarf) or various environment conditions (distance from camera, varying lighting conditions, noise, motion blur, etc). another factor is given by face position. while the enrollment is usually taken in frontal pose, the matching process may suffer from non-frontal pose acquisition, where pose estimation might be needed. the face recognition systems and technologies are based either on 2d or 3d representation of the face appearance. the 2d based face biometrics systems are pose variant and rely on the information conveyed in the gray level structure of the facial image (2d face texture), while the 3d approaches are pose invariant and involve the volumetric structure of the face along with its depth map. the 3d image acquisition technologies come with different cost and approaches. the most cost-effective solution called stereo acquisition is to use several calibrated 2d cameras to acquire images simultaneously followed by 3d reconstruction. while the acquisition is fast, this approach is highly light sensitive. changes in illumination can lead to image artifacts compromising the performance of the 3d face recognition system. an alternative is to project a structured light pattern on the facial surface during acquisition. a third solution relies on active sensing where a laser beam is scanning the face surface generating a reflected facial pattern. once the facial data are acquired, either as 2d or 3d, an automatic landmarking process is necessary to detect facial interest points such as eyes, eyebrows, mouth contour, nose, chin, etc. these landmarks are further used for face registration so that facial features (landmarks) are localized at the same position (geometrical coordinates) across multiple face images. the process continues with feature extraction and matching. for feature extraction, various techniques were proposed, including subspace methods (principal component analysis, linear discriminant analysis, independent component analysis), filtering approaches (different implementation of gabor wavelets) or statistical approaches (including vacuous versions of local binary patterns). for the last step, matching, methods starting from distance based classifiers (euclidean distance, cosine similarity measure) up to complex classifiers (neural networks or support vector machines) were implemented. the literature presents an abundant source of methods and it is impossible to provide references for all representative proposed techniques. the reader my consult the most recent edited or authored face recognition books such as [1], [2], [3], [4], [5], [6], where all related aspects and techniques are presented in great details. facial thermography a very challenging issue for conventional face recognition systems is when they are operating under low illumination environment. off-the-shell methods to cope with this issue exist, but their outcome is typically very noisy and the overall performance of this biometric system is slightly improved. one solution is to shift from visible spectrum to infrared (ir) or near infrared biometrics systems and technologies: a survey 319 (nir) range that requires dedicated sensors. for long-wave ir (8 14 µm) the human body is a light source emitting heat due to the blood flow under the skin and the resulting thermal patterns can be collected in total darkness [7], [8], [9]. while an external light is not necessary, these sensors are completely useless when the system operates in daylight, as usually happens. this is because the body thermal patterns strongly interferes with the surrounding temperature coming from artificial light or sunlight. alternatively, less heat sensitive near infrared spectrum nir (0.8 2.5 µm) can also be used. for a reliable functioning, the nir sensor need an ambient light. studies have reported, however, modest performances of somewhere between 84% and 93% [10], [11]. this is due to the fact that many facial texture patterns that are clearly distinctive in the visible spectrum are absent in the ir or even nir facial imagery, thus lowering the discriminant capability of facial thermography based biometrics systems. facial thermography is affected by medical conditions such as, for example, fever. fingerprint recognition the fingerprint recognition is perhaps the most used biometric having applications in both law enforcement and computer systems, having a mature and widely accepted technology. the technology is implemented on various platforms and devices, including laptops, mobile phones or personal digital assistants. conventional fingerprint systems belong to touch-based sensing technology and require touching or rolling a finger onto a rigid surface with a live-scan device. one disadvantage is the usability aspect as the system depends on the finger placement, finger scarfs or skin conditions (dirt, sweat, moisture). to make them robust against skin conditions, touchless fingerprint technologies are emerging. two classes exist: reflection based touchless finger imaging (rtfi) and transmission-based touchless finger imaging (ttfi). in the case of rtfi there are 2 light sources illuminating the fingerprint [12]. the finger must absorb only a small portion of the incident light, while the majority of light is reflected back to the optical detector. the device must be designed to allow different light quantity absorbed by the fingerprint’s valleys compared to the light absorbed by the ridges to assure a good contrast. the other parameters such as the depth-of-focus and field-of-view of the camera, the irradiation and the frequency of the light sources are also crucial for reliable performance when the skin condition changes from dry to wet. the light sources has to be placed as close as possible to detector to minimize the shadowing effect and the emitted light should be in the blue spectrum (around 500 nm wavelength). the design is very complex and acquiring an optimum image contrast is not an easy task, leading to expensive devices compared to the touch-based fingerprint devices. the other approach, ttfi is similar to the optical coherence tomography principle where the light is transmitting through the finger [13] and the light that is back-scattered from the skin tissue is captured. the finger is placed between the light source and detector so that the light illuminates the nails side of the finger while the opposite finger part is oriented towards the detector. a red light with wavelength of 660 nm is used here because this wavelength corresponds to the maximum transmittance ratio to the skin tissue. the light penetrate the finger and then collected by the detector. apart from high cost for the touchless fingerprint recognition technology, the curvature of the finger represents another limitation of this technology resulting in a relatively low interest for commercial adoption. the latter issue can be solved by employing a multi-vision system consisting of several cameras located on a semicircle and pointing out to the center of the finger. such device may contain five cameras along with a set of green led arrays to illuminate the finger [14]. once the finger is in the correct position, each led array is set to a specific light intensity and the five cameras start capturing a picture of the finger simultaneously. using five cameras a enlarged (180 degree) field-of-view is obtained and the views are combined together to form a 3d finger reconstruction. 320 i. buciu, a. gacsadi hand geometry based recognition hand geometry based recognition systems consider the measurement of length, width, thickness, and surface area of the fingers and hand [15]. this biometric offers low security level because it is not scalable, i.e. those measurements do not tend to be unique for large-scale identification systems [16]. one important inconvenient is that such biometric systems require complex hardware to capture the hand image and may not be appropriate for computer-based login [17]. iris recognition iris recognition is the most reliable type of biometric identification. it is considered to be the ideal biometric in terms of uniqueness and stability (its features do not vary over time) leading to massive deployment for large-scale systems that proved to be very effective [18]. the iris is the colored portion of the eye surrounding the pupil and the biometric system searches for its specific intricate patterns composed of many furrows and ridges. the basic steps are: image acquisition, iris localization using landmark features and segmentation, biometric template generation and biometric template matching. the acquisition factors are resolution, signal/noise ratio, contrast and illumination wavelength. once the iris is segmented, it may suffer a pseudo-polar coordinate transformation operation to take into account variations in pupil size. to capture the iris image, the conventional iris recognition technology requires a very short focal length, increasing the intrusiveness of this approach. however, iris recognition systems where the iris image is captured at longer distance exist nowadays. while for short focal length the image resolution is not an issue, this becomes very challenging with increasing distance, leading to significant drop in accuracy. iris biometric technologies operating at long distance (beyond 1 m) are developed by various vendors. for instance, aoptix has implemented a system able to operate within a range of 1.5 2.5 m [19]. they have also developed a system having onboard coaxial imaging and adaptive optics to facilitate the capture of iris image at up to 18 m. another vendor, honeywell [20] has designed a combined face and iris recognition system that is capable of acquiring face and iris images at distances from 1 m and beyond 4 m. carnegie mellon university’s cylab biometrics center have been developing an iris recognition solution for the past several years that can successfully identify subjects from up to 12 meters away [21]. however, regardless of technology, one important aspect with the iris recognition systems is that the approach is not applicable when the user has contact lens. retina recognition in order to obtain retinal images, an infrared camera is used to capture the unique pattern of veins located at the back of the eye. similarly to iris recognition, this modality also suffers from the problem of user inconvenience, which is even more inconvenient as the user is asked to carefully present their eyes to the camera at very close proximity. another inconvenient is that this approach requires complex and expensive dedicated hardware, making such solution to be limited to applications with very high security demands [22]. on a positive side, unlike face, iris or fingerprint biometrics, retina based patterns are very difficult to spoof. vascular pattern recognition this approach uses the subcutaneous vascular network on the back of the hand for specific individual patterns which are distinctive even for identical twins. a near-infrared light is emitted toward the back or palm of the hand. the reflected light prints an image on the sensor, image representing the encoded absorbance of blood vessels [23]. due to the hemoglobin presented biometrics systems and technologies: a survey 321 in the blood, the veins appear as dark areas pattern and clearly delimited. the image is next digitized and specific features including vessel branching points, thickness, or branching angles are extracted as distinct features. 2.2 behavioural biometrics the behavioural biometrics typically measures the behaviour of the user over time. this biometric type usually does not explicitly ask the user to be cooperative and thus, it is more transparent, user-friendly, less intrusive and more convenient than their physiological counterparts. on the downside, the behavioural biometrics suffers from low level of uniqueness and permanence, compared to the physical biometrics. moreover, their accuracy for authentication is lower and are rather more suitable for verification. the approaches can be split into the following classes: • gait recognition • keystroke analysis based authentication • mouse dynamics • speaker recognition gait recognition analyzing the way an individual walks is the key issue for the gait recognition systems. the main advantage is the fact that this approach has no physical contact being ideal for acquiring data at long distance with low resolution. the fine details are not crucial here, rather the movement time patterns are considered. such systems are influenced by external factors such as footwear walking surface or clothing. the gait recognition systems can be classified either as model based or appearance based [24]. model based approaches fit a model representing time pattern of the human anatomy against video data then extracting and analyzing its parameters. appearance model based approaches analyze the silhouette shape and motion of an individual and the way this vary in time. the images are recorded while the individual walks in a plane normal to the camera view. the performance of these biometric systems are highly dependent on the camera viewpoint. a change in the walking direction can negatively influence their performance. by using geometric cues, some systems improve viewpoint invariance when dealing with 2d imagery. another alternative is to use 3d human figure capturing to have viewpoint independence. both 2d and 3d approaches are described in great details in [24]. keystroke analysis based authentication keystroke analysis based authentication is defined as the process of recognizing an individual from his typing characteristics. the verification can be performed either static (text-dependent) or dynamic (text-independent) mode. the characteristics are typically composed of the time between successive keystrokes, more precisely the inter-stroke latency, time durations between the keystrokes, dwell times (i.e. the time a key is pressed down), overall typing speed, frequency of errors (use of backspace), use of numpad, etc. for a large scale application, these characteristics are not unique amongst too many users. therefore this analysis cannot be reliable used as recognition feature (although some report indicate this could also be possible [25]), but they can be suitable for verification systems. aside from relative low accuracy, the enrollment procedure is the major drawback of such behavioural biometric systems. to generate representative biometric 322 i. buciu, a. gacsadi templates the user might be asked to repeat the enroll procedure by providing a username, password or a specific text for a large number of times. an interesting application is presented in [26] where the keystroke dynamics based authentication has been analyzed in the context of collaborative systems. mouse dynamics a behavioural profile can be also constructed using mouse actions performed by an user. mouse derived features are easy to handle without user’s knowledge. the mouse authentication involves registration phase and login phase. a template is built using the mouse features captured at the time of registration. the same template is compared with login details which are captured by the mouse task. in case of laptop, touchpad helps to extract the mouse features. the mouse’s sensitivity affects the performance. the mouse features include general movement, drag and drop, stillness, point and click (single or double) actions, [27], [28]. speaker recognition speaker recognition is the most researched behavioural biometric. although the voice production considers the physical aspects of the mouth, nose and throat, this biometric is considered as behavioural type because the pronunciation and the manner of speech is intrinsically behavioural. the specific voice features refer to various analysis such as amplitude spectrum, localization of spectral peaks related to the vocal tract shape or pitch striations related to the user’s glottal source. similar to keystroke analysis, the speaker recognition can be performed either in static (text-dependent) or dynamic mode (text-independent) mode [29]. in the text-dependent mode the use is asked by the biometric system to pronounce a particular phrase, while, in the case of the text-independent mode the user is free to speak any phrase. in the latter case the verification accuracy usually improves as the text length increases. in between, a pseudo-dynamic mode exists where the user is requested to say two numbers randomly previously enrolled in a database. as principle, the normalized amplitude of the input signal is decomposed into several band-pass frequency channels with the purpose of feature extraction. the type of the extracted feature may vary. typical features are the fourier transform of the voice signal for each channel, along with some extra information consisting in pith, tone, cadence or shape of the larynx. amongst behavioral biometric systems, the speaker recognition is the most accurate behavioral approach. nevertheless, the voice might be perturbed by various factors such as illness, emotional or mental state or even age, conducting to inaccurate results. 3 mobile and web-based biometrics technology according to acuity market intelligence, the mobile biometric market will technically explode from $1.6 billion in 2014 to $34.6 billion in 2020 [30]. this prediction is foreseen for all biometric sensors (modalities) embedded in smart mobile devices (smart phones, tablets, and intelligent wearables. another mobile biometrics considered also refer to biometric applications offered by vendors or mobile service providers, including retailers, payment procedures or banks. a third identified biometric sector is given by payment or non-payment transactions using secure web (cloud)-based services augmented with biometric authentication option. the report claimed that 100% of smart mobile devices will include embedded biometric sensors as a standard feature by 2020. according to the report, each year, more than 800 billion transactions requiring different level of biometric authentication will be processed, while more than 5.5 billion biometric applications are foreseen to be downloaded. biometrics systems and technologies: a survey 323 the aforementioned statistics come with no surprise. millions of internet users experience a malware or hacker attack each year. the situation is even more critical with the use of mobile access where losses to fraud or identity theft are measured in the hundred of billions of dollars. thus, implementing more authentication modalities by replacing the conventional ways with biometrics for mobile devices seems to be a reliable solution. mobile biometrics solutions are implemented by device manufacturers as well as independent vendors as third-parties which offer software solutions. intel security division has developed a biometric authentication application relying either on face or fingerprint named truekeytm [31], application available for various platforms either server-based or mobile. another company, sensory, released an authentication application named trulysecuretm that combines voice and vision (face-based) authentication for mobile phones, tablets, and pcs [32]. fingerprints, face, iris palm print or voice biometric authentication and verification solutions are developed by neurotechnology [33]. mobile technology also incorporates various biometric modalities. devices with built-in fingerprint sensors exist on the market. one example is touchid fingerprint technology developed for iphone 5s by apple, that incorporates a fingerprint module. samsung also came up with fingerprint solution for its samsung’s galaxy tab s model as well as samsung galaxy s6 model. vision based authentication solutions are allowed by all smartphones which integrate high resolution cameras into their hardware, facilitating third-parties to easily develop such software based authentication options. other mobile or web-based biometric technology vendors include applied recognition with ver-id for various applications [34]. notable, a facial biometric application has been recently released by isityou augmented with a unique anti-spoofing mechanism [35]. to address the interoperability among authentication devices a fast identity online (fido) alliance was formed in 2012 [36], including powerful partners such as google, paypal, miscrofsot, mastercard, github or dropbox. the alliance works towards creating secure interfaces between fido-enabled biometric devices and cloud-based website by developing dedicated plugins. fido is very active in defining web-based related biometric standards and a great impact is foreseen for macroeconomic application. recently (november 2015), fido submitted to the world wide web consortium three technical specifications with the purpose of defining a web-based api to be integrated into all web browsers and platform to facilitate a strong and secure authentication option. 4 biometrics attack similar to hacking a conventional authentication modality (password, token, etc), efforts have been made to hack or break a biometric authentication systems. a potential attack against a biometric system is possible for any component of the system. the network distributed (webbased) systems are more vulnerable to the attacks compared to the stand-alone biometric systems. this is due to the fact that, for a stand-alone biometric system, all processes are performed into a single processing unit. on the contrary, for physically disparate biometric systems, the attack may also occur in the transmission path, or any server performing the authentication. the most common attack is the one against the sensor. when the samples acquisition process is fully automated (i.e. no watching guard exists to monitor the acquisition process) an impostor can easily bypass the system by simply presenting a copy of biometric data of a legitimate user in front of the sensor. the attempt of breaking the biometric system using such method is named spoofing attack. to date, there is no commercial biometric technology that is robust against such attacks. the copy may come in various formats, depending on the biometric modality. in the case of facial biometric, the impostor may present a still image, video sequence playback, or even a 3d silica or rubber mask of the genuine user. a demonstration carried out by the security 324 i. buciu, a. gacsadi and vulnerability research team of the university of hanoi drawn attention regarding this issue by bringing evidence on how easy is to bypass the biometric systems namely lenovo’s veriface iii, asus’ smartlogon v1.0.0005, or toshibas face recognition version 2.0.2.32 each set to its highest security level, using fake facial images of the valid user and thus gaining illegitimate access to the laptops [37]. this vulnerability was also tested in the spoofing challenge competition organized as special session at icb 2013 [38]. these examples clearly point out the weaknesses of such systems and emphasize the necessity of incorporating reliable anti-spoofing mechanisms into the fa systems. hence, not surprisingly, many research works were devoted to find robust solutions for detecting spoofing attacks. within the same framework, a competition on counter measures to 2d facial spoofing attacks was also settled at icb 2013 where anti-spoofing methods were evaluated [39]. the spoofing attack issue for various biometrics (face, iris, fingerprint, gait, etc) is a theme for the fp7 funded project tabula rasa, where the topic was intensively and specifically addressed and analyzed [40]. various solutions have been proposed to detect spoofing attacks. the spoof detection approaches may fall into four categories: a) challenge response based methods requiring user interaction, b) behavioral involuntary movements detection for parts of the face and head, c) data driven characterization, and d) presence of special anti-spoofing devices. google proposed a blinking based antispoofing mechanism [41]. the fingerprint modality can also be easily spoofed, as fingerprints are left behind on many objects the user touches. an impostor can use the same approach deployed by law enforcement agencies for lifting the fingerprints. once lifted, a duplicate can be easily constructed from silicon or gelatin material. by so doing, in 2002, matsumoto successfully fooled eleven different commercial fingerprint readers, with both optical and capacitive sensors, and some with live finger detection option, with a rate of success of 80% [42]. matsumoto created a copy of a live finger as well as an artificial finger using a latent fingerprint left on a glass also accepted as genuine. eleven years later (i.e. 2013), a biometrics hacking team of the chaos computer club (ccc) has successfully bypassed the biometric security of apple’s touchid implemented on iphone 5s and samsung galaxy s5 [43]. regarding samsung’s s5 biometric authentication system, the team claimed that not only it was possible to spoof the fingerprint authentication system, even after the device has been turned off, but the system also allows for seemingly unlimited authentication attempts without ever requiring a password, which was unacceptable. two more years later, the vulnerability remains as it was reported at the blackhat hacking conference in las vegas by zhang et al. [44]. similar to facial spoofing challenge competition, an iris and fingerprint based live detection competition is open [45]. the most recent report (2015) confirms the issue is still not solved, although some improvements exist [46]. spoofing a real iris with a good quality image is also possible. gupta et al. [47] used a commercial sdk, verieye [48] and successfully spoofed the system with printed images of iris. finally, voice impersonation can be applied to trick both automated and human verification for voice authentication systems [49]. a legitime user’s voice can be recorded in various ways, including close proximity between the attacker and user, throughout a spam call or searching for audio-video recordings over the internet. with the help of a voice morphing program, the attacker may synthesize the user’s voice by using just a few samples. the cloned voice “borrowed” the features of the authentic voice and the authors successfully fooled the speaker authentication system that was based on the bob spear speaker verification system [50]. their findings were alarming, i.e. the system was able to reject only 20 % of fake voices. 5 attributes of biometric technology the biometric systems and technologies is expected to posses several characteristics to be practically usable, as follows: biometrics systems and technologies: a survey 325 • universality. this is the ability for a specific biometric system to be applied to a whole population of users. this is directly connected to failure to enroll (fte) condition that refers to the case when a part of the population may not be enrolled for whatever reason. on particular reason is when the individual does not have the required biometric, leading to failure to enroll (fta) error. a person suffering from mutism can not be enrolled with a speaker recognition technology; a fingerprint system can not be used for persons with missing fingers, etc. • uniqueness. the ability to successfully discriminate people. the biometric features must be as distinct as possible from one individual to another. the biometric features must convey large differences between individuals (large inter-class variability) while having small difference between samples taken from the same individual (small intra-class variability). • permanence. the ability of biometric features not to change over time. some features do not change (iris, fingerprint patterns, vascular system, etc.) while others do (facial features). for time varying features a periodic biometric update is required. • collectibility. the ability of the system to perform the acquisition for any occasion (regardless of environment change, such as change in illumination, etc). there are cases where the acquisition process can not be performed for the same individual previously enrolled. for example, if the person suffers some skin condition destroying the epidermis or gets a serious scar on his finger, the fingerprint biometric authentication system will more likely output a false reject due to significant difference between the enroll and test biometric features. similar scenario is possible for people suffering from cataract when tested with an iris authentication system, or people undergoing facial plastic surgery recovering from accidents or facial injury, when tested against a facial biometric system. • simplicity. recording and transmission should be easy to use and not error-prone. • cost-efficiency. the whole process should be cost-efficient. • acceptability. the degree to which a biometric technology is found acceptable by the society. typically, gross invasive biometric technologies (such as retina based systems) are tend to be less acceptable than those using non-invasive approaches (such as vision based or touchless sensors). another aspect to be considered is the access to privacy. • scalability. this attribute refers to the ability of the system to accommodate a large number of enrollment individuals while providing a reasonable accuracy. the degree of scalability is application dependent. for instance, when using a biometric-based lock option for a mobile device with the intent to lock some specific applications inside the device, only one (device’s owner) or a few individuals (perhaps family’s members) would enroll. the matching is then either one-to-one or one-to-few type and the scalability is not an issue. on the contrary, for a network distributed system (such as bank application or decision systems) the number of enrolled individuals might easily reach millions and the system should cope with this overwhelming data. for such large-scale biometric application its performance (accuracy, frr and far) is more critical. more exactly, one false rejection a month might be acceptable, but hundreds false rejection a day would be disastrous. the same rationale applies to far. just to give a simple example, let us suppose a biometric system is 99.9% accurate. that is, if someone is an attacker, there is a 1-in-1000 chance that the biometric system fails to detect the attacker (outputs a false accept), while the same chance would be for a legitimate user to be denied as false attacker. 326 i. buciu, a. gacsadi • resilience. the ability of the system to handle exceptions. an example would be an individual whose biometric features might not be easily acquired. if a user has a broken arm, he may need human intervention to use a hand or fingerprint based biometric system. • circumventable. the ability of the system to detect attacks. an important role has the sensor that should be tamper-proof. 6 biometric standards as noted throughout the chapter, a network distributed biometric technology involves several components, including the sensor, the communication channel, the web-based decisional server, components that rely on different hardware architecture. not only the hardware is different but also the integrated software is different for each hardware configuration. moreover, a fully operational system working on a specific operating system is not compatible with another operating system. another representative example is provided by the multi-modal biometric systems relying on two or multiple biometric modalities (face, fingerprint, voice, as an example) that yield individual scores which are finally fused to output a single matching score. without a common format that has to be shared among these modalities, the multi-modal biometric system can not operate. finding mechanisms for each component of the biometric system to communicate has leaded to standardization. there are several working groups concerning biometric standards. at international level, the international standard organisation (iso) and international electrotechnical commission (iec) play a significant role. iso and iec have established a joint technical committee 1 jtc 1/sc 37 [51] to ensure a high priority, focused, and comprehensive approach worldwide for the rapid development and approval of formal international biometric standards. there are several aspects to consider, including data interchange formats, data structure standard and technical interface standards. • data interchange format represents the lowest level of interoperability between systems using the same modality and addresses the actual representation of the biometric data itself. the number and type of features can vary considerable depending upon the matching algorithm. there is a need to format these features so that the other components of the system may properly interpret the transmitted information. • once the data is formatted it needs to be transmitted. data structure standard addresses this by providing the necessary wrapper around the biometric data within the so called common biometric exchange file format to facilitate interoperability between different systems or system components, forward compatibility for technology improvement, and software hardware integration. data interchange format standards provide the mechanism for extraction, matching and decision modules of the biometric system. the main component of this standard is the biometric information record composed of three parts. the first part named the standard biometric header contains information to an application regarding the format of and other properties of the next part named the biometric data block that contains the biometric data conforming to a defined format. the third part named the security block provides information related to the encryption protocol and the integrity of the biometric information record. • technical interface standards provide an application programming interface (api) by defining the format for the biometric information record so that components can understand and interpret records. a representative standard is bioapi [52] that defines biometrics systems and technologies: a survey 327 a framework for installing the components, making them compliant with plug-and-play concept. bioapi tries to hide as much as of unique attributes of individual biometric technologies, vendor implementations, products and devices. a biometric service provider could then plug the components throughout a service provider interface. an application can use biometric services using two fundamental ways: either through primitive functions or through abstract functions. primitive functions are the most basic functions and relates to bioapi−capture, bioapi−process, bioapi−v erifymatch and bioapi−identifymatch. the abstract functions are defined by bioapi−enroll, bioapi−v erify and bioapi−identify. the standardization of biometric technology leaded to proper interoperability between and within biometric systems, ensuring a cost-effective technology implementation. 7 conclusions this paper only briefly touches the main issues of biometric systems and technologies, pointing out their differences, modalities, open problems and standardization. their performance greatly vary upon the operating and external conditions as no universal biometric technology exists. the best performance is obtained where the technology is designed for strict controlled conditions and where data acquisition is accomplished under human supervision. not mentioning a possible accuracy drop, when no human guard is present, a biometric system can be easily attacked and spoofed, a critical open issue yet to be solved. intensive work is still undergoing to improve their performance while protecting them against various attacks. bibliography [1] s. k. zhou, r. chellappa, ramalingam, w. zhao (2006), unconstrained face recognition, springer. [2] k. delac, m. grgic (eds.) (2007), face recognition, i-tech education and publishing, vienna, austria. [3] k. delac, m. grgic, m. s. bartlett (eds.) (2008), recent advances in face recognition, in-tech, vienna, austria. [4] m. tistarelli, s. z. li, r. chellappa (eds.)(2009), handbook of remote biometrics for surveillance and security, springer. [5] s. z. li, stan, a. jain (eds.)(2011), handbook of face recognition, springer. [6] m. d. marsico, m. nappi, m. tistarelli (eds.)(2014), face recognition in adverse conditions, igi-global. [7] x. chen, p. j. flynn, k. w. bowyer (2005), ir and visible light face recognition, computer image and vision understanding, 99(3): 332–358. [8] s. g. kong, j. heo, b. r. abidi, j. k. paik, m. a. abidi (2005), recent advances in visual and infrared face recognition: a review, computer image and vision understanding, 97(1): 103–135. 328 i. buciu, a. gacsadi [9] d. socolinsky, l. wolff, j. neuheisel, c. eveland (2001), illumination invariant face recognition using thermal infrared imagery, proc. of 2001 ieee conf. on computer vision and pattern recognition, 1: 527-534. [10] d. socolinsky, a. salgian, j. d. neuheisel (2003), face recogniton with visible and thermal infrared imagery, computer image and vision understanding, 91(1): 72–114. [11] d. socolinsky, a. selinger (2004), thermal face recognition in an operational scenario, proc. of 2004 ieee conf. on computer vision and pattern recognition, 1012-1019. [12] y. song, c. lee, j. kim (2004), a new scheme for touchless fingerprint recognition system, proc. of 2004 intl symposium on intelligent signal processing and communication systems, 524-527. [13] e. sano, t. maeda, t. nakamura, m. shikai, m. sakata, m. matshusita, k. sasakawa (2006), fingerprint authentication device based on optical characteristics inside a finger, proc. of 2004 ieee conf. on computer vision and pattern recognition workshop, doi:10.1109/cvprw.2006.83. [14] g. parziale, e. diaz-santana, r. hauke (2006), the surround imager: a multi-camera touchless device to acquire 3d rolled-equivalent fingerprints, proc. of the 2006 intl conf. on advances in biometrics, 244-250. [15] r. smith (2002), authentication: from passwords to public keys, addison and wesley, boston. [16] j. ashbourn (2000), biometrics: advanced identity verification: the complete guide, springer, london. [17] http://us.allegion.com/products/biometrics/pages/default.aspx [18] j. daugman (1994), biometric personal identification system based on iris recognition, u.s. patent 5,291,560. [19] m. j. northcott, j. e. graves (2008), iris imaging using reflection from the eyes, u.s. patent application 200800002863. [20] g. geterman, v. jacobsen, j. jelinek, t. phinney, r. jmza, t. ahrens, g. kilgore, r. whillock, s. bedros (2008), combined face and iris recognition, u.s. patent application 20080075334. [21] http://www.cmu-biometrics.org/ [22] r. das (2014), biometric technology: authentication, biocryptography, and cloud-based architecture, crc press. [23] n. miura, a. nagasaka, t. miyatake (2004), feature extraction of finger-vein patterns based on repeated line tracking and its application to personal identification, machine vision and applications, 15(4): 194-203. [24] r. d. seely , m. goffredo, j. n. carter, m. s. nixon (2009), view invariant gait recognition, handbook of remote biometrics for surveillance and security, 61-81. [25] j. ilonen (2003), keystroke dynamics, http://www2.it.lut.fi/kurssit/03-04/ 010970000/seminars/ilonen.pdf. biometrics systems and technologies: a survey 329 [26] r. giot, m. el-abed, c. rosenberger (2009), keystroke dynamics authentication for collaborative systems, intl. symposium on collaborative technologies and systems, 172-179. [27] p. bours (2012), continuous keystroke dynamics: a different perspective towards biometric evaluation, information security techn. report volume 17(1-2): 36-43. [28] c. shen, z. cai, x. guan, y. du, t. yu (2013), user authentication through mouse dynamics, ieee trans. on information forensics and security 8(1): 16-30. [29] z. saquib, n. salam, r. p. nair, n. pandey, a. joshi (2010), a survey on automatic speaker recognition systems, signal processing and multimedia,134-145. [30] the global biometrics and mobility report: the convergence of commerce and privacy market analysis and forecasts 2014 to 2020, available at: http://www.acuitymi.com/gbmr_report.php, accessed on 12 dec. 2015. [31] https://www.truekey.com/ accessed on 14 dec. 2015. [32] http://www.sensory.com/products/technologies/trulysecure/, accessed on 14 dec. 2015. [33] http://www.neurotechnology.com/ [34] http://appliedrec.com/ [35] http://www.isityou.biz/ [36] https://fidoalliance.org/ [37] n. m. duc and b. q. minh (2009), your face is not your password, black hat conference. [38] http://www.tabularasa-euproject.org/evaluations/tabula-rasa-spoofing-challenge-2013/ [39] http://www.biometrics-center.ch/testing/tabula-rasa-spoofing-challenge-2013 [40] http://www.tabularasa-euproject.org/ [41] http://www.google.com/patents/us8437513, accessed on 17 dec. 2015. [42] t. matsumoto, h. matsumoto, k. yamada, s. hoshino (2002), impact of artificial gummy fingers on fingerprint systems, proc. of spie, vol. 4677, optical security and counterfeit deterrence techniques iv, 2002. [43] https://www.ccc.de/en/updates/2013/ccc-breaks-apple-touchid [44] y. zhang,z. chen, h. xue, t. wei (2015), fingerprints on mobile devices: abusing and leaking, black hat conference. [45] http://livdet.org/ [46] v. mura, l. ghiani, g. l. marcialis, f. roli, d. a. yambay, s. a. schuckers (2015), livdet 2015 fingerprint liveness detection competition 2015, available at: http://livdet.org/ reports.php [47] p. gupta, s. behera, m. vatsa, r. singh (2014), on iris spoofing using print attack, proc. of the 2014 22nd intl conf. on pattern recognition, 1681-1686. 330 i. buciu, a. gacsadi [48] verieye, iris recognition software, http://www.neurotechnology.com/verieye.html [49] d. mukhopadhyay, m. shirvanian and n. saxena (2015), all your voices are belong to us: stealing voices to fool humans and machines, european symposium on research in computer security. [50] e. khoury, l. el shafey, s. marcel (2014), spear: an open source toolbox for speaker recognition based on bob, ieee intl. conf. on acoustics, speech and signal processing (icassp). [51] http://www.iso.org/iso/home/store/catalogue_tc/catalogue_tc_browse.htm? commid=313770&published=on [52] http://www.iso.org/iso/catalogue_detail.htm?csnumber=33922 int j comput commun, issn 1841-9836 vol.7 (2012), no. 4 (november), pp. 617-631 nondeterministic algorithm for breaking diffie-hellman key exchange using self-assembly of dna tiles z. cheng zheng cheng zhejiang university of technology 288 liuhe road, hangzhou, p.r. china chengzhenhust@gmail.com abstract: the computation based on dna tile self-assembly has been demonstrated to be scalable, which is consider as a promising technique for computation. in this work, i first show how the tile self-assembly process can be used for computing the modular multiplication by mainly constructing three small systems including addition system, subtraction system and comparing system which can also be parallely implemented the discrete logarithm problem in the finite field gf(p). then the nondeterministic algorithm is successfully performed to break diffie-hellman key exchange with the computation time complexity of θ(p), and the probability of finding the successful solutions among many parallel executions is proved to be arbitrarily close to 1. keywords: modular multiplication; discrete logarithm; nondeterministic; diffiehellman; key exchange; self-assembly; dna tiles 1 introduction nature is a rich source for computing devices design. in recent years, computing models and algorithms inspired by biological systems are deeply investigated, e.g., membrane computing inspired by cells and dna computing inspired by dna molecules. most of such computing models inspired by cells (or dna molecules inside cells) are proved to be universal and computationally efficient [1, 2]. this work focuses on computing systems based on dna molecules, especially the self-assembly of dna tiles. since adleman demonstrated that the dna recombination techniques can be used to solve combinational search problem [3], the field of dna-based computing has developed very fast. dna computing potentially provides a degree of parallelism and high density storage far beyond that of conventional silicon-based computers [4]. dna tile self-assembly is a crucial process, by which the small objects can autonomously assemble into big complexes [5]. seeman proposed dna nanotechnology to make self-assembled nanostructures from dna molecules [6]. based on this landmark work, winfree utilized a kind of dna nanostructure called dx (double crossover) tile to realize a patterned lattice and the complex algorithmic pattern [7]. winfree et al. demonstrated that two dimensional dna selfassembly can be capable of turing-universal computation [8]. winfree and eng proved that self-assembly of linear, hairpin and branched dna molecules can be generated regular, bilinear and context-free languages respectively [9]. dna tile algorithmic self-assembly can also be used to create crystals with patterns of binary counters [10, 11] and sierpinski triangles [12] to implement arbitrary circuit [13]. however, those crystals are deterministic. mao experimentally implemented the first algorithmic dna tile self-assembly [14], where a logical computation (cumulative xor) is performed. brun proposed the application of dna tile self-assembly in arithmetic [15]. dna self-assembly is also used to cope with combinational np-complete problems, such as solving the satisfiability problem [16] by using 2d dna self-assembly tiles, nondeterministically factoring numbers [17], deciding a system of subset sum problem [18]. dna self-assembly has potential applications in the cryptography. xor computation on pairs of bits copyright c⃝ 2006-2012 by ccc publications 618 z. cheng can be used to execute a one-time pad cryptosystem, which provides theoretically unbreakable security [19]. to provide modern security features, the algorithms for public-key cryptosystems such as rsa [20], diffie-hellman key agreement [21], the digital signature algorithm [22] and systems based on elliptic curve cryptography [23] are widely used. all these algorithms have one thing in common: they all need operate the modular multiplication and exponent multiplication [24, 25]. in 1976, diffie and hellman [26] proposed the public-key distribution scheme based on the discrete logarithm problem in a finite field gf(p). chen [27] gave deterministic algorithm to break diffie-hellman key exchange and constructed the basic tiles in which the complex modular multiplication operation in decimal is contained, so it can’t be easily executed based on the experiment of simple binary arithmetic and logical operations. here i mainly propose the nondeterministic algorithm to solve this problem by the addition, subtraction and comparing operations for binary numbers, which can be performed easily in experiments. the computation time complexity of our algorithm is θ(p), and the probability of finding the successful solutions among the many parallel executions is proved to be arbitrarily close to 1. the rest of this paper is structured as follows: section 2 will describe the tile self-assembly model. section 3 gives the method of computing modular multiplication using dna tile selfassembly. section 4 shows the process of breaking diffie-hellman key exchange based on modular multiplication by self-assembling. the conclusion will summarize the contribution of our work. 2 algorithmic dna tile self-assembly the abstract tile assembly model [28] is a formal model of crystal growth which is designed to model self-assembly of molecules such as dna. rothemund and winfree [29] defined the abstract tile assembly model, which provides a rigorous framework for analyzing algorithmic self-assembly. the tile assembly model extends the theory of wang tilings [30] of the plane by adding a natural mechanism for growth. for the tile self-assembly model, the assembly complexes take place by starting with the seed tile denoted as the basic tile type set, which can be produced the seed configuration s. for each tile, it can be represented by the binding domains ∑ which is a 4-tuple {σn,σe,σs,σw}∈ ∑4. here, n,e,s,w is labeled as the direction of north, east, south and west respectively. the set of directions is a set of four functions from positions to positions denoted as d = {n,e,s,w}, i.e. z2 to z2 such that all positions (x,y), n(x,y) = (x,y+1), e(x,y) = (x+1,y), s(x,y) = (x,y−1), w(x,y) = (x− 1,y). for a tile t, for d ∈ d, bdd(t) is used to denoted as the binding domain of tile t on d′s side. given a tile system s = {t,g,τ} which is designed preparedly, here the parameter t is a function z × z → t , which is a configuration of the tile self-assembly model. g is a strength function ∑ × ∑ → r, which denotes the strength of the binding domains, and τ ∈ n is the temperature. when the growth of process terminates, these can be produced a unique final configuration s based on the seed configuration s. s is also a function z2 → γ, here γ is a set of tiles which can be used to design the configuration of the tile self-assembly model. here, i mainly use the abstract tile assembly model to break diffie-hellman key exchange which is shown in figure 1. intuitively, the model has tiles or squares that stick or don’t stick together based on various binding domain including σn,σe,σs,σw on their four sides. nondeterministic algorithm for breaking diffie-hellman key exchange using self-assembly of dna tiles 619 n s w e figure 1: the structure of basic tile unit. 3 the algorithm for computing modular multiplication based on dna tile self-assembly various techniques for speeding up modular multiplication have been reported in literature. among them, two major approaches stand out: one is based on the interleaved modular multiplication algorithm where the multiplier is processed from the most significant position [31]. the other one is based on the montgomery algorithm where the multiplier is processed from the least significant position [32]. the key that enables the linking of these two approaches is a new representation of residue classes modulo. the two methods account for most of the complexity in terms of time and resources needed. so this gives reason to search for dedicated solutions which compute the modular multiplications efficiently with minimum resources. here, i use the method of interleaved modular multiplication based on dna tile self-assembly. it takes advantage of these techniques and the ones that may eventually be devised to further boost speed. for the integers x,y,m with 0 ≤ x,y < m, in order to get the result of x∗y mod m, i use the algorithm which gives a pseudo code implementation of interleaved modular multiplication as follows: (1)p = 0; (2) for (i = n−1; i ≥ 0; i = i−1 ){ (3) p = 2∗p ; (4) p = p + xi ∗y ; (5) if (p ≥ m) p = p −m; (6) if (p ≥ m) p = p −m;} here, n is the number of bits of x. xi is denoted as the i-th bit of x. the idea of interleaved modular multiplication is very simple: the first operand is multiplied with the second operand bitwise and added to the intermediate result. the intermediate result is reduced with respect to the modulus. for this purpose at most two subtractions per iteration are required. the process is to interleave multiplication and reduction such that the intermediate results are kept as short as possible. for every xi(0 ≤ i ≤ n − 2), there is a left shift, a partial product computation, an addition, and at most two subtractions. the partial product computation and left shift operations are easily performed by using an array of and gates and wiring respectively. in each iteration of the loop, the estimation of the previous iteration can be added to the intermediate result. in this section, i will introduce the algorithm for performing modular multiplication through constructing three small systems which are addition system, subtraction system and comparing system. examples can be given to indicate how the tile assembly model performs the computations. 3.1 addition system this system will do addition operations between two positive integers. when the comparison result at the previous step is “<” which means the value of p is smaller than the modulus m, then 620 z. cheng this system will make addition operations between two integers p and m. if xi = 0(0 ≤ i ≤ n−2), the result is the value of p which should be shifted one bit from right to left, intuitively the sum is the value of 2 ∗p . if xi = 1(0 ≤ i ≤ n− 2), the addition between p and y is done, here p should be shifted one bit from right to left and y does not need to shift one bit. theorem 3.1. let the addition system be denoted as ∑ +={ab, abcd, a′bcd, a*bcd, a*bc, a*b, a, a*, #, a*bcd′, a′bc; a, b, c, d ∈{0,1}}, and t+ be the set of tiles as described in fig.2(c). let g+ = 1, τ+ = 2, and s+ be a seed configuration. let np and ny be the sizes of p and y in bits respectively. let n = max(np , ny ). if np < n, the number needs to be padded to be n bits long with extra 0 in the p ’s high bit, and if ny < n, the number needs to be padded to be n bits long with extra 0 in the y ’s high bit. then, there exists some (x0,y0) ∈ z2, such that s+(x0+1,y0−1) = s0, s+(x0−n,y0−1) = e0, s+(x0+1,y0−0) = add; for all i ∈{0,1, · · · ,n}, bdn(s+(x0 − (i−1)),y0 −1) = xipimiyi, for all other positions (x,y),(x,y) /∈ s+. then the tile system s+ produces a unique final configuration based on s+ and can compute the sum of two input numbers by using θ(1) distinct tiles in linear assembly time. proof. consider this tile system ∑ +. let γ+ be composed of the tiles {0∗,#, #,null}, {a<, null,null,null} with the label s0, {#,null,null,null} denoted as the tile e0, {a<, a,∗,null} represented as the tile add, and the basic tile set t+, here a ∈ {0,1}. let the seed configuration s+ : z2 → γ+ be such that  s+(1,−1) = s0; s+(−n,−1) = e0, and s+(1,0) = add; ∀i ∈{0,1, · · · ,n}, s+(−i,−1) = xipimiyi; for all other (x,y) ∈ z2, s+(x,y) = empty. it is clear that there is only a single position where a tile may attach to s+. and after that tile attaches there will only be a single position where a tile may attach. by induction, because ∀t ∈ t+, the triplet < bds(t),bde(t) > is unique, and because τ+ = 2, it follows that this tile system s+ produces a unique final configuration on s+. fig.2(a) gives a sample seed configuration for adding two n-bit input numbers. fig.2(b) shows the final configuration of the example for adding two integers p = (001110)2 and y = (001101)2 with the result (011011)2. the set of tiles which is described in fig.2(c) shows the functions of the addition system has three functions. the addition system has three functions. first, the value of p , m and y are arranged in the seed configuration from the lowest bit to the highest bit, and the value of x is set from the highest to the lowest bit. here, a, b, c, d, e, f, g, k ∈ {0,1}. the initial value of p is set as 0. of course, the highest bit of x denoted as xn−1 is 1, so the first addition operation only needs to pass the value of y to p for the corresponding bits. the tile types with the red color can be seen in fig.2(c). the value “a0cd” at the bottom of the tiles are denoted as x, p , m, y respectively. the number 1 as the input, and output of the tile is the value of xn−1 which is assigned to 1. the numbers of bits of p and m are more than x and y , so “bc” in the first tile of the tile types are represented as the higher bits of p and m respectively. more importantly, i use “a′′′ to label the highest bit of x which should be passed to the lowest bit, then xi = 0 or 1 (0 ≤ i ≤ n− 2) can be passed to the addition system to determine whether the value of y should be added with p together. “e′′′ is the value passed from the highest bit, so the label of “a′′ will be passed to the addition system at next step. second, for the next addition, if xi = 0(0 ≤ i ≤ n − 2), the lowest bit of p is 0, and the value of p should be shifted one bit from right to left, then the result of the addition operation is 2 ∗p . at the same time, xi = 0(0 ≤ i ≤ n− 2) should be passed to the higher bit of y , and the bits of y and m are passed to the upper layer. so here, there is no carry bit for the addition at this step. if xi = 1(0 ≤ i ≤ n− 2), the lowest bit of p is the corresponding lowest bit of y , nondeterministic algorithm for breaking diffie-hellman key exchange using self-assembly of dna tiles 621 s0 add e0 x0p0m0y0 * x1p1m1y1 x1p1m1y1 x0p0m0y0 ... xn-1pn-1mn-1yn-1 xn-2pn-2mn-2yn-2 xn-1pn-1mn-1yn-1 xn-2pn-2mn-2yn-2# a< a e0 s0 add 101101001101110100100000 101101001101110100100000# 111101001001110101100000# 0 * 0 1 1 0 * 1 1 1 * 1 1 1 * 1 0 * 0 0 * 1< * bc fc e * f g * b abcd aecd f* g 1 k * b 1 abcd adcd 0 * b 1 10* bc bc 0 * 0 0 e 0 b a ' 0 b a'bcdabcd aecda'ecd 0 e f' a'bcdabcd abcda'ecd f* g 1 f* g 1 h ' k * 1 h * b 1 a ' 11 bc bc 11 a0cd adcd 1 e ' 1 a0cd a'dcd a'0cd 1 a ' # adcd 0 b 0 0 0 * 0 abcd00 a0cd00 0 e 0 b abcd aecd 0 e 0 b bc ec (a) (b) (c) figure 2: (a) a sample seed configuration for adding two n-bit input numbers. (b) the final configuration of the example for adding two integers p = (001110)2 and y = (001101)2 with the result (011011)2. (c) the basic tile types of the addition system. then the addition is done between p and y , here p also should be shifted one bit from right to left. so if the carry bit from the lower bit is “f*”, and the shifted bit from p is “g”, thus the addition result “e′′ and the carry bit “k*” can be generated at this step, synchronously the bit of p and y labeled as “b” and 1 respectively should be passed to the higher bit. the tile types with lilac are shown this function of the addition system. third, this system can determine whether and what the bit of x is passed to the next addition system. if the lowest bit of x has been completed the addition operation, then it only should be passed to the final result. otherwise, the value of xi should be passed to the lower bit xi−1 which will be done the next round addition. so if the input on the right of the tiles includes the label “f′′′, it can determine the value of xi−1 denoted as “a′′ will be passed to the next addition system. 3.2 subtraction system in this section, i will describe a system that can make subtraction between two positive integers. when the comparison result at the previous step is “>” or “=” which means the value of p is bigger than or equal to the modulus m, then this system will make the subtraction between the two integers p and m. theorem 3.2. let the subtraction system ∑ −={ab, abcd, a*, *; a, b, c, d ∈ {0,1} }, and t− be the set of tiles as described in fig.3(c). let g− = 1, τ− = 2, and s− be a seed configuration. let np and nm be the sizes of p and m in bits respectively. let n = max(np , nm). if np < n, the number needs to be padded to be n bits long with extra 0 in the p ’s high bit, and if nm < n, the number needs to be padded to be n bits long with extra 0 in the m’s high bit. then, there exists some (x0,y0) ∈ z2, such that s−(x0 + 1,y0 − 1) = s0, s−(x0 − n,y0 − 1) = e0, 622 z. cheng s−(x0 +1,y0−0) = sub; for all i ∈{0,1, · · · ,n}, bdn(s−(x0−(i−1)),y0−1) = xipimiyi, for all other positions (x,y),(x,y) /∈ s−. then the tile system s− produces a unique final configuration based on s− and can compute the difference of two input numbers with θ(1) distinct tiles in linear time. proof. consider the tile system ∑ −. let γ− contain the tiles {0*, #, #, null}, {> ,null,null,null} denoted as s0, {#,null,null,null} represented as the tile e0, {∗,∗,>,null}} labeled as the tile sub, and the basic tile set t−. let the seed configuration s− : z2 → γ− be such that  s−(1,−1) = s0; s−(−n,−1) = e0, and s−(1,0) = sub; ∀i ∈{0,1, · · · ,n}, s−(−i,−1) = xipimiyi; for all other (x,y) ∈ z2, s−(x,y) = empty. it is obvious that there is only a single position where a tile may attach to s−. and after that tile attaches there will only be a single position where a tile may attach. by induction, because ∀t ∈ t−, the triplet < bds(t),bde(t) > is unique, and because τ− = 2, it follows that this tile system s− produces a unique final configuration on s−. e0 1 * 0 * 0 * 0 * s0 sub * 10110100100111010111# 10110100100111010111# 11110000100111010001# > * e0 x0p0m0y0 # x1p1m1y1 x1p1m1y1 x0p0m0y0 ... xn-1pn-1mn-1yn-1 xn-2pn-2mn-2yn-2 xn-1pn-1mn-1yn-1 xn-2pn-2mn-2yn-2 s0 sub * > * g * * abcd aecd f* g * abcd aecd f* g * a' bcd a' ecd f* g * bc ec (a) (b) (c) 0 * figure 3: (a) a sample seed configuration for subtraction operation for two n-bit input numbers. (b) the final configuration for the example of subtracting two integers p = (11010)2 and m = (10001)2 with the result (01001)2. (c) the basic tile types of this system. here, the value of “a, b, c, d” at the bottom of the tile is denoted as x, p , m, y respectively, and a, b, c, d, e, f, g ∈{0,1}. the subtraction operation begins from the lowest bit to the highest bit with the label “*”. “a′” is the value of x with the label which is passed to the addition system. the representation “f*” is the borrow bit from the lower bit and “g*” is the borrow bit which is generated at this step. the value of “e” is the result which can be computed by the sum of “b” and “c” minus “f”. fig.3(a) is a sample seed configuration for subtraction operation for two n-bit input numbers. fig.3(b) shows the final configuration for the example of subtracting two integers p = (11010)2 and m = (10001)2 with the result (01001)2, and (c) gives the basic tile types of the subtraction system. 3.3 comparing system this system is to check the relationship for given input string of binary bits which are represented as the addition or subtraction result p at each step and the modulus m respectively. nondeterministic algorithm for breaking diffie-hellman key exchange using self-assembly of dna tiles 623 at the same time, the relationship can determine the next operation should be made addition or subtraction. the system compares two input numbers bit-by-bit from the highest bit to the lowest bit until the relationship which is less than (<), greater than (>) or equal (=) is determined. here, i will describe the comparing system which uses θ(1) distinct tiles in linear time to compare the size of two input numbers. theorem 3.3. let the comparing system ∑ r={ab, abcd, a′bcd, >,<,=, a>, a<, a=, #; a, b, c, d∈ {0,1}}, and t be the set of tiles as described in fig.4(c). let gr = 1, τr = 2, and sr be a seed configuration. let np and nm be the sizes of p and m in bits respectively. let n = max(np , nm). if np < n, the number needs to be padded to be n bits long with extra 0 in the p ’s high bit, and if nm < n, the number needs to be padded to be n bits long with extra 0 in the m’s high bit. then, there exists some (x0,y0) ∈ z2, such that sr(x0 −n,y0 − 1) = s0, sr(x0 + 1,y0 − 1) = e0, sr(x0 − n,y0 − 0) = com; for all i ∈ {0,1, · · · ,n}, bdn(sr(x0 − (i − 1)),y0 − 1) = xipimiyi, for all other positions (x,y),(x,y) /∈ sr. then the tile system sr produces a unique final configuration based on sr and can give the relationship of two input numbers. proof. the proof is referred to theorem 3.1 and theorem 3.2, so i don’t restate it here. this system mainly makes the comparison between two non-negative integers which are used in the binary form. first, for two given integers, the numbers of bits are the same. second, the comparison begins from the most significant bit to the rightmost bit. when the (i + 1)-th bit of the addition or subtraction result p is smaller then the modulus m, then the tile is labeled as “<”, which also should be considered with the result of the i-th bit which is “<”, “>” or “=”, then the result which is “<” can be obtained and should be passed to the (i − 1)-th bit. on the contrary, the final result is “>” also should be passed to the (i − 1)-th bit no matter what the relationship of the i-th bit is. if the (i + 1)-th bit of the addition or subtraction result p is equal to the modulus m, then the tile is labeled as “=”, which also should be considered with the result of the i-th bit which are “<”, “>” or “=”, then the result which is “<”, “>” or “=” respectively can be obtained and should be passed to the (i−1)-th bit. furthermore, if the comparison result of the two integers is “<”, the next operation would turn to the addition computation with xi = 0 or 1. if the comparison result is “>” or “=”, the next operation would turn to the subtraction computation between p and m. when the comparison result is “<” with the label of the position for the lowest bit of x, the modular multiplication is completed at this step. fig.4(a) is a sample seed configuration for two n-bit input numbers. fig.4(b) shows the final configuration for the example of comparing two integers p = (1010)2 and m = (0101)2 with the result “>”. the tiles used in this system are as follows in fig.4(c). in each tile, the value of “a, b, c, d” is denoted as x, p , m, y respectively. 3.4 an example now i take an example to verify the validity of the algorithm based on dna tile self-assembly model introduced above. here, for the integers x,y,m with 0 ≤ x,y < m, i suppose x = 11, y = 13, m = 17. the modular multiplication x∗y mod m is computed as the following steps: first, x,y,m can be represented as x = 11 = (1011)2, y = 13 = (1101)2, m = 17 = (10001)2 respectively. the initial value of p is set as 0 and arranged from the lowest bit to the highest bit together with y,m. on the contrary, the bits of x are from the highest bit to the lowest bit. second, according to the method introduced above, i need construct the basic tile types in each of the three small systems and they are the same as the tiles described above. when all the tiles and the seed configuration are prepared, they are put together into the reaction buffer. 624 z. cheng s0 com e0 # x0p0m0y0 # # s0 com e0 >>># > 1111010010111101 101101001101# 101101001101# > * x1p1m1y1 x1p1m1y1 x0p0m0y0 ... xn-1pn-1mn-1yn-1 xn-2pn-2mn-2yn-2 xn-1pn-1mn-1yn-1 xn-2pn-2mn-2yn-2 * 1011 1011 =# 00 00 == aa aa <= 01 01 >= 10 10 << a'bcd a'bcd <= a'bcd a'bcd b= a'bcd a'bcd b>c << abcd abcd <= abcd abcd b= abcd abcd b>c a '<< a'bcd a'bcd a '<= a'bcd a'bcd >> abcd abcd >> a'bcd a'bcd a << a < abcda'bcd abcda'bcd a < a== a = abcda'bcd abcda'bcd a = b”. (c) the basic tile types of the comparing system. according to the mechanism of algorithmic dna tile self-assembly through watson-crick base pairing, the self-assemble process starts at the same time with the connector tiles, so the final stage can be seen in fig.5. the process of the three small systems performing is shown. here, because the highest bit of x which is denoted as x3 = 1, the value of p is equal to y , so the comparing system can check that p is smaller than m. the next addition operation begins with x2 = 0, so the value of p at the previous step shifted one bit from right to left can be assigned to p at this step, which can be represented as p = (011010)2, then the comparing system determines it is more than m, so the next operation is the subtraction operation between p and m, and the subtraction result should be assigned to p which is (001001)2. when x1 = 0, the addition system can make the sum between y and p , here p should be shifted one bit from right to left. so the addition result also should be assigned to p which is (011111)2, and it is bigger than m by the comparing system, then the subtraction result is (001110)2. the computation stops until x0 attaches to the addition system with the result of p which is smaller than m. the final result of p is (000111)2 which can be obtained by repeated computations using the addition, subtraction and comparing system. 4 the nondeterministic algorithm for breaking diffie-hellman key exchange using self-assembly of dna tiles on basis of the algorithm for computing modular multiplication, i will first give the method of performing the discrete logarithm problem in the finite field gf(p), then the nondeterministic algorithm for breaking diffie-hellman key exchange is successfully proposed. nondeterministic algorithm for breaking diffie-hellman key exchange using self-assembly of dna tiles 625 e0 1'011 1 1 ' 1 ' 1 ' 1 ' 11 com 0 1 0 1 0 ' 0 0 0 1 0 0 0 * 0 <<<<=# com 1 * 0 * 0 * 0 * 0 * 0 * com 0 * 1 1 0 * 1 < 1 <<<=# com 0 * 0 * 0 * 0 * 0 * 0 * com 1 < 1 < 1 <<=# >>>==# com 0 * 0 * 0 * 0 * 1 * 0 * s0 add sub add sub add sub >>>==# >>>>># # 0 0 < * 1 1 < * 1 < > * > > com >>>==# > 0000100110010100# 11110'000110111010100# 11110'100100111010100# 101101001'00111011100# 101101001'00111011100# 111100001'00111010100# 111100001'00111010100# 1111010011011'1011100# 1111010011011'1011100# 1011010011011'1010100# 1011010011011'1010100# 1111000010011'1010110# 1011000010011'1011100# com 1111 1 '< 1 '< 1 '< <=# sub re 1 '< 010011011'0010100# 1111010011011'0010100# 1011000010011'1011100# 1 * 1 * 1 * 0 * 0 * * 1111000010011'1010110# 0 * 0 * 0 1 0 * 0 1 1 ' 0 * 1 0 * 0 0 * 0 1 1 0 * 1 1 1 * 1 1 1 * 1 0 * 0 0 * 0< 1< 1< > > > > * * * * * * * * figure 5: an example of computing modular multiplication 1011∗1101 mod (10001). 626 z. cheng 4.1 the nondeterministic algorithm for solving the discrete logarithm problem a group g is cyclic if there is an element α ∈ g, such that for each b ∈ g, there is an integer i with b = αi. such that an element α is called a generator of g. now given a cyclic group g of order p, a primitive-root g and the element y of the group, the problem is to find an integer x such that y ≡ gx(mod p) with 1 ≤ x ≤ p − 1, and this problem is called discrete logarithm problem in the finite field gf(p). considering the equation y = gx(mod p), for given y, g and p, it is very difficult to compute the value of x. so here i solve the discrete logarithm problem by implementing the algorithm for computing the modular multiplication based on dna tile self-assembly model. in the process of implementing the tile self-assembly systems, many assemblies happen in parallel by creating billions of billions of copies of the participating dna tiles, so this is simulated by an exponential number of dna assemblies which can be converted into the space occupied by the dna molecules, thus i expect that the procedure of computing y will run in parallel on all possible value of x under the condition 1 ≤ x ≤ p−1. then i can compare the computation result with the given value of y, finally the result of x can be read by the biological operations described in section 3. so first, i give the algorithm for computing modular exponentiation based on the modular multiplication introduced above. there are two differences between the two computations. firstly, the modular exponentiation is actually a series of modular multiplications, so in the process of designing the seed configuration, it only needs to give different labels of the lowest bit of x which can be used to distinguish the value of x. here, i consider one of the value of x as y . secondly, given the value of i, after the computation of gi(mod p) is completed, there are some tiles that can convert the result of gi(mod p) into the new value of x, and the value of y doesn’t need to change, then the assembly complexes can be used to implement the computation of gi+1(mod p). therefore i don’t give too much explanation for the process of computing modular exponentiation. on this basis, i use the nondeterministic algorithm to solve the discrete logarithm problem in the finite field gf(p). it can nondeterministically guess the value of x so that a parallel implementation can be executed. i need to construct different labels to denote the value of x, which can determine the computation of modular exponentiation. here, an example in gf(11) is taken. suppose the equation y = 7x(mod 11), for given y = 2, i can get the value of x which is 3 as following steps and the final stage can be shown in fig.6. step 1: the binary forms of g,p is obtained respectively, here i consider y = x. the initial value of the integer p is set as 0 and arranged from the lowest bit to the highest bit together with y,p. on the contrary, the bits of x are from the highest bit to the lowest bit. then the basic tile types and the seed configuration are constructed and put together into the reaction buffer. according to the nondeterministic algorithm, there are many choices at the first position of the seed configuration, it can nondeterministically guess the value of x under the condition 1 ≤ x ≤ p− 1. in this example, the tile containing the label “′′” which is denoted as the value of x = 3 attaches the seed configuration. so the modular exponentiation y = 73(mod 11) can be parallely performed by the three small systems including addition system, subtraction system and comparing system, the assembly complexes can grow, therefore i can obtain the solutions of the problem. step 2: once the self-assembly has occurred, it is necessary to extract the answer. an estimate of the length of the reporter strands can be obtained by annealing the reporter strands with the component strands, so i can extract the result strands of different lengths representing the output tiles which run through the value of y. nondeterministic algorithm for breaking diffie-hellman key exchange using self-assembly of dna tiles 627 e0 1011 com com com com com com s0 add sub add sub sub re 10111001010000# 11111''1111101010000# 11111''1111101010000# 111110111''101011000# 111110111''101011000# 101111111''001110000# 101111111''001110000# 111111111''001111000# 111111111''001111000# 101110110''001011000# 101110110''001011000# 111110111''101010000# com 1111 com com add sub > 0'0011101010000# 11110'0011101010000# 101101011'101110000# 111101011'101110000# 111101011'001010000# com add 111100011'101110000# 101100011'101010000# sub 111100011'1010100 re 101101011'0010100 101101011'0010100 com 00# 00# 00# * * * * * * 1< 1< > * > * 0< 1< * > 1011'00011'001010000# 111110111''101010000# 101100011'001010000# 1 > * & & 1 1 '' 1 '' 1 '' 111 # 1 <<<==# 1 < 1 * 1 1 * 0 0 * 0 0 * 1 0 * 1 1 1 * 1 1 1 '' >>>>=# > 0 * 1 * 1 * 1 * 0 * 0 * * 1 = 1 ====# 1 < 0 * 1 0 * 0 0 * 0 * 0 1 10* 1 1 1 * 0 1 >>>>=# > 0 * 0 * 0 * 0 * 0 * 0 * * >>>>=# > 1 * 1 * 1 * 1 * 0 * 0 * * 1 < 1 <<==# 1 < c *101000 11&& && 1 1 1 ' 1111 11 0 <<<==# 0 < 0 1 0 0 0 ' 0 1 0 0 0 00 0 >>===# > 1 * 1 * 1 * 0 * 0 * 0 * * <<<==# 1< 0 * 0 1 * 0 1 * 1 0 * 0 * 0 * 1 >>>==# > 1 * 1 * 0 * 0 * * 0 * 0 * <<<= <# = figure 6: an example of computing modular exponentiation y = 73(mod 11). 628 z. cheng step 3: by comparing the computation result of y for every feasible value of x with given y = 2, so i can obtain the solution of the discrete logarithm problem. 4.2 the nondeterministic algorithm for breaking diffie-hellman key exchange diffie-hellman method is used for symmetric key exchange between entities. the steps of the algorithm are as follows: (1) key generation (a) generate a large random prime integer p and a generator α of the multiplicative group up. the set un is the multiplicative group of zn with order φ(n), which is defined as follows: un = {[a] ∈ zn : gcd(a,n) = 1} (b) select two random integers a and b, 1 ≤ a, b ≤ p− 1 for entity a and b, then compute x = αa(mod p) and y = αb(mod p). (c) broadcast the public key of a and b where a′s public key is (α,x) and b′s public key is (α,y). (2) a computes the symmetric key ka = ya(mod p) and b computes the symmetric key kb = x b(mod p). a and b will use the diffie-hellman method to exchange symmetric keys. in order to get the symmetric key k which is equal to ka or kb between the users a and b, i can give the nondeterministic algorithm for breaking the diffie-hellman key exchange as follows: first, i can get value of the secret key of user a denoted as a according to the public key (α,x) and the primer p. as a matter of fact, this process can be performed by the method of solving the discrete logarithm problem in the finite field gf(p) which is introduced above. i can design the all needed tiles in the computation, including the basic types of tiles, boundary tiles and the seed configuration by the binary form of the integer x, α and p. when all kinds of the tiles and the seed configuration are prepared, i puts them together into the reaction buffer, there are many choices at the first position of the seed configuration, it can nondeterministic guess the value of a under the condition 1 ≤ a ≤ p − 1. so the modular exponentiation αa(mod p) can be parallely performed by the three small systems, the assembly complexes can grow, therefore i can obtain the solutions of the problem. then i can read the result of the process of the dna tile growth using a combinational of pcr and gel electrophoresis. second, i can make the modular multiplication k = ya(mod p) together with the value of a obtained at the first step and the public key (α,y) of the user b. for example, suppose the multiplicative group u11 and the generator α = 7. the user a selects one random integers a as his own secret key, at the same time, the users a and b broadcast their public key x = 2 and y = 3 respectively. considering the equation x = 7a(mod 11), i can get the solution of the discrete logarithm problem in the finite field gf(11) which is the secret key of the user a by the method described above, and the result of a is equal to 3. so the common key can be computed as k = ya(mod p) = 33(mod11) = 5. 4.3 complexity and probability analysis considering the nondeterministic algorithm for breaking diffie-hellman key exchange based on self-assembly of dna tiles, the complexity of this algorithm can be computed in terms of computation time and the number of distinct tiles required. generally, for the equation y = gx(mod p), here, 1 ≤ x ≤ p − 1, suppose the number of the bits of g be k, and for the given problem, the value of k is a constant. for a round modular multiplication, there is one addition operation and at most two subtraction operations followed by the comparison operation respectively, so the upper bound of the computation time t which is the number of assembled steps is x(6k + 2) + 2 = θ(p). nondeterministic algorithm for breaking diffie-hellman key exchange using self-assembly of dna tiles 629 finally, these distinct tile types include the input boundary tiles and computation boundary tiles. as the basic tiles described in section 3, the tile types are all θ(1), so i can obtain the basic tile types are θ(1) for a round modular multiplication, then i also consider the labels to distinguish the number of the rounds which can be determined by the size of x(1 ≤ x ≤ p− 1), therefore the tile types needed for this algorithm is θ(p). now, i give the probability analysis for this algorithm. theorem 4.1. let each tile that may attach to a configuration at a certain position attach there with a uniform probability distribution. for all integers x(1 ≤ x ≤ p− 1), np is denoted as the number of the bits of the primer p, for given integers g and y, then the probability of assembling a particular final configuration which attaches the value of x such that y = gx( mod p) is at least ( 1 p−1) np . proof. now i calculate the probabilities of each tile attaching in its proper position and then multiply those probabilities together to get the overall probability of a final configuration. on one hand, there are (p−1) possible tiles that may attach to the positions which represent the bits of the primer p in the seed configuration, and only one is correct, so the probability of it attaching is pi = 1p−1(1 ≤ i ≤ np). on the other hand, for the next two positions, there is only one tile that may attach, so the probability of the last two tiles are both pnp+1 = pnp+2 = 1. the overall probability of a specific final configuration can be computed as the following formulas:∏np+2 i=1 pi = ( 1 p−1) np this is the probability of a nondeterministic assembly successfully identifying the solution. i use the nondeterministic computation to perform the function y = gx(mod p), and construct three small systems to define the whole tile systems that use θ(p) input tile types, and assemble in θ(p) steps with probability of each assembly finding the solution at least ( 1 p−1) np. therefore a parallel implementation of this tile system such as the execution of the dna tile self-assembly model in [12], with (p − 1)np seeds has at least a (1 − e−1) chance of finding the secret key of the user a or b, and one with 100(p − 1)np seeds has at least a (1 − e−100) chance. the probability of finding the successful solutions among the many parallel executions can be proved to be arbitrarily close to 1. 5 summary and conclusions dna tile self-assembly is looked forward to many applications in different fields. in this paper, i show how the dna self-assembly process can be used for breaking diffie-hellman key exchange based on the discrete logarithm problem in the finite field gf(p) by computing the modular multiplication. the advantage of our method is that once the initial strands are constructed, each operation can compute fast parallelly through the process of dna self-assembly without any participation of manpower, thus the algorithm is proposed which can be successfully solved the modular multiplication, and then can be used to break diffie-hellman key exchange with the computation time complexity of θ(p), and the probability of finding the successful solutions among the many parallel executions is proved to be arbitrarily close to 1. acknowledgments this work was supported by the research project of department of education of zhejiang province (y201120124) and the national natural science foundation of china (grant nos. 60703047, 60803113, 60903105). 630 z. cheng bibliography [1] l. pan and c. martin-vide, solving multidimensional 0−1 knapsack problem by p systems with input and active membranes, journal of parallel and distributed computing, vol. 65, pp. 1578-1584, 2005. [2] l. pan and m. j. p¨śrez-jim¨śnez, computational complexity of tissue-like p systems, journal of complexity, vol. 26, pp. 296-315, 2010. [3] l.m. adleman, molecular computation of solutions to combinatorial problems, science, vol. 266łź pp. 1021-1024, 1994. [4] l.q. pan, g.w. liu , j. xu, solid phase based dna solution of the coloring problem, progress in natural science. vol. 14, pp. 104-107, 2004. [5] a. carbone, n.c. seeman, molecular tiling and dna self-assembly, springer-verlag berlin heidelberg, vol. 2950, pp. 61-83, 2004. [6] n.c. seeman, dna nanotechnology: novel dna constructions, annu. rev. biophy. biomol. struct., vol. 27, pp. 225-248, 1998. [7] c. mao, w. sun, n.c. seeman, designed two dimensional dna holliday junction arrays visualized by atomic force microscopy, j. am. chem. soc., vol. 121, pp. 5437-5443, 1999. [8] e. winfree, on the computational power of dna annealing and ligation, dna based computers, pp. 99-221, 1996. [9] t. eng, linear self-assembly with hairpins generates the equivalent of linear context-free grammars, 3rd dimacs meeting on dna based computers, univ. of penn., 1997. [10] r. barish, p.w. rothemund, e. winfree, two computational primitives for algorithmic self-assembly: copying and counting, nano letters, vol. 12, pp. 2586-2592, 2005. [11] p. m. espane′s, a. goel, toward minimum size self-assembled counters, springer science business media b.v., 2008. [12] p.w. rothemund, n. papadakis, e. winfree, algorithmic self-assembly of dna sierpinski triangles, plos biology, vol. 12, pp. 2041-2053, 2004. [13] m. cook, p.w. rothemund, e. winfree, self assembled circuit patterns, dna, pp. 91-107, 2004. [14] c. mao, t.h. labean, j.h. reif, logical computation using algorithmic self-assembly of dna triple-crossover molecules, nature, vol. 407, pp. 493-496, 2000. [15] y. brun, arithmetic computation in the tile assembly model: addition and multiplication, theoretical computer science, vol. 378, pp. 17-31, 2006. [16] g.l. michail, t.h. labean, 2d dna self-assembly for satisfiability, dimacs series in discrete mathematics and theoretical computer science, vol. 44, pp. 139-152, 1999. [17] y. brun, nondeterministic polynomial time factoring in the tile assembly model, theoretical computer science, vol. 395, pp. 3-23, 2008. [18] y. brun, solving np-complete problems in the tile assembly model, theoretical computer science, vol. 395, pp. 31-36, 2008. nondeterministic algorithm for breaking diffie-hellman key exchange using self-assembly of dna tiles 631 [19] a. gehani, t.h. labean, j.h. reif, in dna based computers: proceedings of a dimacs workshop, dimacs series in discrete mathematics and theoretical computer science, 1999. [20] r.l. rivest, a. shamir, l.m. adleman, a method for obtaining digital signatures and public-key cryptosystems, commun. acm, vol. 21, pp. 120-126, 1978. [21] i.r. jeong, j.o. kwon, d.h. lee, a diffie-hellman key exchange protocol without random oracles, springer-verlag berlin heidelberg, vol. 4301, pp. 37-54, 2006. [22] ansi x9.30. public key cryptography for the financial services industry: part 1: the digital signature algorithm (dsa), american national standard institute, american bankers association, 1997. [23] n. koblitz. elliptic curve cryptosystem, math. comp., vol. 48, pp. 203-209, 1987. [24] g. blakley, a computer algorithm for calculating the product a∗b mod m, ieee transactions on computers, vol. c-32, pp. 497-500, 1983. [25] t. acar, b.s. kaliski, c. koc, analyzing and computing montgomery multiplication algorithms, ieee micro, vol. 16, pp. 26-33, 1996. [26] w. diffie, m.e. hellman, new directions in cryptography, ieee transactions on information theory, vol. 22, pp. 644-654, 1976. [27] z.h. chen, breaking the diffie-hellman key exchange algorithm in the tile assembly model, chinese journal of computers, vol. 31, pp. 2116-2122, 2008. [28] e. winfree, algorithmic self-assembly of dna, ph.d. thesis, caltech, pasadena, ca, june 1998. [29] p.w. rothemund, e. winfree, the program-size complexity of self-assembled squares, acm symposium on theory of computing (stoc), pp. 459-468, 2001. [30] h. wang, proving theorems by pattern recognition, i. bell system technical journal, vol. 40, pp. 1-42, 1961. [31] e. marcelo kaihara, t. naofumi, a hardware algorithm for modular multiplication/division, ieee transactions on computers, vol. 54, pp. 12-21, 2005. [32] p.l. montgomery, modular multiplication without trial division, mathematics of computation, vol. 44, pp. 519-521, 1985. ijcccv7n5.pdf int j comput commun, issn 1841-9836 7(5):796-797, december, 2012. editorial challenge: from a quarterly to a bimonthly journal f.g. filip, i. dzitac florin gheorghe filip editor in chief of ijccc, romanian academy, bucharest, romania ffilip@acad.ro ioan dzitac associate editor in chief of ijccc, aurel vlaicu university of arad, romania & agora university, oradea, romania rector@univagora.ro abstract: starting with issue 4 of volume 7(2012) international journal of computers communications & control (int j comput commun, ijccc) [4] is a member of, and subscribes to the principles of, the committee on publication ethics (cope) [2]. beginning with issue 1 of volume 8(2013) ijccc will be published as a bimonthly journal (6 issues/year) [5]. keywords: ethics, bimonthly journal, impact factor. 1 why a bimonthly journal? in the period 2006-2012 the international journal of computers, communications & control, issn 1841-9836, has been published as a quarterly journal (4issues/year), with a supplementary issue every even year (5 issues/year) and has been covered in sci expanded [3] starting with the supplementary issue s of volume 1(2006) and has been indexed in journal citation reports/science edition (jcr), with 3-years journal impact factor 0.373 (in jcr2009), 0.650 (in jcr2010) and 0.438 (in jcr2011). this decreasing of the last impact factor can be explained by increasing of the number of published papers in issue 5 (december, 2010). consequently many papers can be cited only in 2012, but no more in 2011, the year considered for journal impact factor calculation in jcr2011. the correction of this anomalous fluctuation of the impact factor might be solved by a change of publishing policy of ijccc, from a irregular frequency of publication (4-5 issues/year) to a bimonthly publication (6 issues/year) with a regular frequency. this new policy is sustainable because in the last two years we have received over 1,200 manuscripts. publication frequency starting with issue 1 of volume 8(2013): ijccc will be published as a bimonthly journal (6 issues/ year), with a regular schedule such as: issue 1 (february); issue 2 (april); issue 3 (june); issue 4 (august); issue 5 (october); issue 6 (december). 2 focus and scope. topics ijccc is directed to the international communities of scientific researchers in computer and control from the universities, research units and industry. to differentiate from other similar journals, the editorial policy of ijccc encourages the submission of scientific papers that focus on the integration of the 3 "c" (computing, communication, control). in particular the following topics are expected to be addressed by authors: 1. integrated solutions in computer-based control and communications; 2. computational intelligence methods (with particular emphasis on fuzzy logic-based methods, ann, evolutionary computing, collective/swarm intelligence); copyright c© 2006-2012 by ccc publications editorial challenge: from a quarterly to a bimonthly journal 797 3. advanced decision support systems (with particular emphasis on the usage of combined solvers and/or web technologies). 3 author guidelines length of a manuscript the maximum number of pages of one article is 16. the publishing of a 6 page article is free of charge. for each supplementary page there is a fee of 50 eur/page that must be paid after receiving the acceptance for publication. technical instructions for authors the papers must be written in english. the first page of the paper must contain title of the paper, name of author(s), an abstract of about 300 words and 3-5 keywords. the name, affiliation (institution and department), regular mailing address and email of the author(s) should be filled as in [1]. manuscripts must be accompanied by a signed copyright transfer form. the copyright transfer form is available at website. initial submission (manuscript for review: the manuscript can be write in latex by the ijccc template or in ms word format with the following specifications: paper a4, font tnr 12p, single column. the manuscript must be uploaded online in journal management and publishing system of ijccc (open source software: open journal systems developed by the public knowledge project) [5]. final submission (accepted paper for publication: checklist of documents which must be send by e-mail: completed copyright transfer form; source files (one latex file for the text, eps files for figures they must reside in a separate folder); final pdf file (for reference). peer review process the submissions will be revised independently by minimum two reviewers and will be accepted for publication only after end of the editorial workflow. evaluation period and rejection rate information: • evaluation period after paper submission: up to 6 months; • publication time is pending on the number of papers received. we aim at a time not longer than one year; • mean acceptance rate: of approximately 20%. bibliography [1] andonie r., dzitac i., how to write a good paper in computer science and how will it be measured by isi web of knowledge, int j comput commun, issn 1841-9836, 5(4):432-446. [2] committee on publication ethics (cope) (http://publicationethics.org). [3] http://ip-science.thomsonreuters.com/cgi-bin/jrnlst/jlresults.cgi?pc=d&issn=1841-9836. [4] http://www.journal.univagora.ro (ijccc archive for 2006-2012). [5] http://univagora.ro/jour/index.php/ijccc/ (new ijccc website). int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 2 (june), pp. 227-235 a time-bound ticket-based mutual authentication scheme for cloud computing z. hao, s. zhong, n. yu zhuo hao 1. university of science and technology of china department of electronic engineering and information science hefei, anhui 230027, p.r.china, and 2. state university of new york at buffalo department of computer science and engineering 201 bell hall, amherst, ny 14260, usa e-mail: hzhuo@mail.ustc.edu.cn sheng zhong state university of new york at buffalo department of computer science and engineering 201 bell hall, amherst, ny 14260, usa e-mail: szhong@buffalo.edu nenghai yu university of science and technology of china department of electronic engineering and information science hefei, anhui 230027, p.r.china e-mail: ynh@ustc.edu.cn abstract: cloud computing is becoming popular quickly. in cloud computing, people store their important data in the cloud, which makes it important to ensure the data integrity and availability. remote data integrity checking enables the client to perform data integrity verification without access to the complete file. this service brings convenience to clients, but degrades the server’s performance severely. proper schemes must be designed to reduce the performance degradation. in this paper, a time-bound ticket-based mutual authentication scheme is proposed for solving this problem. the proposed authentication scheme achieves mutual authentication between the server and the client. the use of timebound tickets reduces the server’s processing overhead efficiently. the correspondence relationship between the digital ticket and the client’s smart card prevents user masquerade attack effectively. by security analysis, we show that the proposed scheme is resistant to masquerade attack, replay attack and password guessing attack. by performance analysis, we show that the proposed scheme has good efficiency. the proposed scheme is very suitable for cloud computing. keywords: cloud computing, mutual authentication, digital ticket, masquerade attack. 1 introduction cloud computing [1, 2] has become a very popular technology. by using cloud computing, both the data storage and the computation resources are moved from personal computers into the cloud. recently many companies provide the cloud storage services, including amazon simple copyright c⃝ 2006-2011 by ccc publications 228 z. hao, s. zhong, n. yu storage service (s3) [3], microsoft skydrive [4], nirvanix cloudnas [5], etc. as people store their important data in the cloud without keeping a local copy, it is important for the cloud clients to be able to verify the data integrity and availability. remote data integrity checking protocols [6–10] are proposed to ensure the data integrity in the cloud. ateniese et al. [6] propose the technology of provable data possession to enable the client to verify the data integrity without retrieving it from the server. juels and kaliski [7] propose the scheme called proofs of retrievability to enable the server to produce a concise proof about the data availability. other schemes [8–10] have also been proposed with increased scalability and efficiency. however, as the clients of cloud services are numerous and are also increasing very quickly, these technologies bring a lot of extra computation and storage overhead to the server. mechanisms should be designed to reduce these overhead as much as possible. one possible solution is to limit the client’s data verification frequency. in this method, the server releases a certain number of digital tickets to the client at a constant frequency, e.g., 12 tickets per year. the client uses one ticket for one time of data verification, and after that, the used ticket becomes invalid and cannot be reused. if the client needs more tickets, she must purchase them from the server. on the other hand, the client’s identity should be properly authenticated to the server before she uses these services. recently lin and chang [11] propose a countable and time-bound password-based user authentication scheme, which is a possible method for solving this problem. in the lin-chang scheme, the tickets {tk1, tk2, ..., tkt} are generated by the server in a way that the authentication value in tkj is one of the square roots of the value in tkj+1. the lin-chang scheme is secure against masquerading attack and replay attack, and also achieves good efficiency at the client side. however, the lin-chang scheme has several disadvantages, which make it not suitable in this environment. firstly, the ticket in lin-chang scheme is not associated with the client’s identity, which allows anyone who obtains the ticket to be able to use it. secondly, the client’s tickets can only be used sequentially. whenever tkj expires, the tickets tk1, tk2, ..., tkj−1 cannot be used anymore. thirdly, the lin-chang scheme involves high-cost modular exponentiation operations at the server side, which bring large amount of overhead as the quantity of cloud clients increases quickly. finally, the lin-chang scheme does not achieve mutual authentication. in this paper, we propose a new ticket-based mutual authentication scheme which has improvements in these aspects. firstly, we use smart card in the mutual authentication scheme. each client has her own unique smart card. the client’s tickets are associated with her smart card, so that even when her tickets are lost, they cannot be used by other clients. secondly, in the proposed scheme, tickets are relatively independent of one another, so that the tickets need not be used sequentially. when one ticket expires, all other tickets that do not expire can still be used. thirdly, the proposed scheme involves only lightweight exclusive-or and hash computations, which makes it very efficient at both the client side and the server side. finally, the proposed scheme achieves mutual authentication, so that both the server and the client authenticates each other when performing the data verification. by security analysis, we show that the proposed scheme is secure against lost smart card attack, lost ticket attack, masquerade attack, replay attack, etc. by performance analysis, we show that the proposed scheme achieves good performance at both the server side and the client side. the proposed scheme is suitable for cloud computing. the rest of this paper is organized as follows. section 2 presents the system model and introduces common notations used throughout this paper. in section 3, the proposed timebound ticket-based mutual authentication scheme is presented. in section 4, we show that the proposed scheme is secure against adversary’s attacks. in section 5, we show that the proposed scheme is cost-efficient. finally, we conclude in section 6. a time-bound ticket-based mutual authentication scheme for cloud computing 229 2 system model and common notations in the proposed scheme, there are two different entities: the cloud server and the client. the cloud server provides data storage services to a lot of clients, and also provides data integrity verification services to the clients. furthermore, the cloud server is in charge of the client registration, client authentication and digital ticket management. clients put their data at the server and retrieves the data on demand. each client has a unique identification and a password with which she can prove her identity to the server. in addition, similar to the lin-chang scheme [11], a bulletin board is maintained by the server for preventing repeated usage of one ticket. for convenience of description, we denote the cloud server by s and the clients by {ui, i = 1, 2, 3, ...}. denote the identification and password of ui by idi and pwi. we use a cryptographic hash function h() and a keyed hash function hk(), with k as a cryptographic key. s has two long-term secret keys, denoted by k1 and k2. 3 the proposed mutual authentication scheme in this section, we present the proposed time-bound ticket-based mutual authentication scheme. the proposed scheme consists of 4 phases: registration phase, verification request phase, mutual authentication phase and password change phase. 3.1 registration phase when the client ui initially registers herself to s, the registration phase is invoked. the registration phase consists of the following steps: 1. ui selects her own idi, her password pwi and a random number b. then ui computes ipbi = h(idi||h(pwi ⊕b)) and sends {idi, ipbi, t} to the server, in which t is the number of digital tickets ui needs. at the same time ui pays the corresponding ticket fee to s. the ticket fee can be payed in the form of either real or virtual currency, which depends on the policy of s. 2. after s receives the message and ticket fee from ui, s generates t tickets for ui. denote the jth ticket of ui by t (j) i . denote by tid (j) i and v p (j) i the ticket id and valid period of t (j) i respectively. specifically, s generates {(tid(j)i , v p (j) i ), j = 1, 2, ..., t} and computes as follows: wi = ipbi ⊕ h(idi, k1), α (j) i = hk2(idi ∥ tid (j) i ∥ v p (j) i ), β (j) i = α (j) i ⊕ ipbi. t (j) i has two parts: t (j) i = (t (j)1 i , t (j)2 i ), in which t (j)1 i = (tid (j) i , v p (j) i ), t (j)2 i = β (j) i . s also computes zi = hk2(idi) ⊕ ipbi, which is used during the password change phase. 3. s writes idi, t, wi, zi and t (j) i , j = 1, 2, ..., t into a smart card and sends it to ui. 4. when ui receives her smart card, she writes b into the card. 230 z. hao, s. zhong, n. yu 3.2 verification request phase as the client receives t tickets, she can use these tickets to perform data verification at most t times. the description below assumes that this is the kth verification request that ui invokes. 1. ui inserts her smart card into a card reader and enters idi and pwi. 2. the smart card generates a nonce ru according to system time, and computes ipbi = h(idi||h(pwi ⊕ b)), hi = wi ⊕ ipbi ; c1 = ru ⊕ hi, c2 = h(ru) ⊕ t (k)2 i ⊕ ipbi. then ui’s smart card sends {idi, t (k)1 i , c1, c2} to s. 3.3 mutual authentication phase when s receives the verification request message from ui, it executes the following steps: 1. s checks the validity of idi and rejects the service request if idi is invalid. 2. s checks whether the ticket id tid(k)i is on the bulletin board. if it is on the bulletin board, then s rejects ui’s service request and terminates the process. 3. s checks whether the current date is in the range of v p (k)i or not. if not, then s rejects ui’s service request and terminates the process. 4. s computes d0 = h(idi, k1), d1 = c1 ⊕ d0 and d2 = h(d1) ⊕ c2. 5. s computes hk2(idi ∥ tid (k) i ∥ v p (k) i ) and checks whether it is equal to d2. if they are not equal, then s rejects ui’s service request and terminates the process. otherwise, s authenticates ui successfully. 6. s generates a random nonce rs, computes c3 = d0 ⊕ rs and c4 = h(ru, rs) and sends {c3, c4} to ui. s also computes ks = h(d0, ru||rs), which will be used as a subsequent session key. 7. ui’s smart card computes d3 = c3 ⊕ hi. then it compares h(ru, d3) with c4. if they are equal, ui authenticates s successfully. then the smart card computes kc = h(hi, ru||rs) and uses k to communicate with s in the subsequent data verification process. note that kc = h(hi, ru||rs) = h(h(idi, k1), ru||rs) = ks. when the kth data verification is finished, ui deletes the ticket t (k) i from its smart card, and s publishes tid (k) i to its bulletin board. finally the smart card deletes the nonce ru , and s deletes rs from its memory, to prevent replay attack. 8. after the mutual authentication, the data verification is performed. the data verification schemes [6–10] are independent of the proposed authentication scheme, so we don’t describe the details. the keys kc and ks can be used for secret communication. 3.4 password change phase this phase is invoked when ui needs to change her password pwi to a new one. the password change phase consists of the following steps: 1. ui inserts her smart card into a card reader, and enters idi and pwi. 2. the smart card generates a nonce ru according to system time, and computes ipbi = h(idi||h(pwi ⊕ b)), c1 = ru ⊕ wi ⊕ ipbi, c2 = h(ru) ⊕ zi ⊕ ipbi. then ui’s smart card sends {update, idi, c1, c2} to s, in which update is the message type indicating this is a password change request message. a time-bound ticket-based mutual authentication scheme for cloud computing 231 3. when s receives this message, it checks the validity of idi. if idi is invalid, it rejects the service request. 4. s computes d1 = c1⊕h(idi, k1) and d2 = h(d1)⊕c2. after that s checks whether d2 is equal to hk2(idi). if they are not equal, s rejects the password change request. otherwise, s authenticates ui successfully and accepts the password change request. 5. s generates a random nonce rs, computes c3 = h(idi, k1)⊕rs and c4 = h(ru, rs) and sends {c3, c4} to ui. 6. the smart card computes d3 = c3 ⊕ wi ⊕ ipbi. then it compares h(ru, d3) with c4. if they are equal, the smart card successfully authenticates s and prompts ui to enter a new password. 7. ui enters a new password pw newi . then the smart card computes ipb new i = h(idi||h(pw new i ⊕ b)). after that the smart card computes w newi = wi⊕ipbi⊕ipb new i , which yields h(idi, k1)⊕ ipbnewi , and computes z new i = zi ⊕ ipbi ⊕ ipb new i , which yields hk2(idi) ⊕ ipb new i . the smart card also updates t (j)2i to t (j)2 i ⊕ ipbi ⊕ ipb new i for all remaining tickets, which yields α (j) i ⊕ ipb new i . 4 security analysis of the proposed scheme in this section, we present security analysis of the proposed scheme. in section 4.1, we show that the server’s secret key cannot be obtained by an adversary who monitors the communication. in sections 4.2-4.5, we show that the proposed scheme is resistant to lost smart card attack, masquerade attack, replay attack and valid period extending attack. 4.1 secrecy of the server’s secret key in the proposed scheme, communications between ui and s are through a common channel. so we assume the adversary can eavesdrop this channel and get any messages transmitted between ui and s. during the kth verification request and mutual authentication phase, the adversary can get messages {idi, t (k)1 i , c1, c2, c3, c4}, in which idi and t (k)1 i have no relationship with k1 or k2, and c1, c2, c3, c4 are as follows: c1 = ru ⊕ wi ⊕ ipbi = ru ⊕ h(idi, k1) c2 = h(ru) ⊕ t (k)2 i ⊕ ipbi = h(ru) ⊕ hk2(idi ∥ tid (k) i ∥ v p (k) i ) c3 = h(idi, k1) ⊕ rs c4 = h(ru, rs) as ru and rs are both random nonces generated by ui and s, the adversary cannot guess their values. in addition, due to the one-wayness of hash function, the adversary cannot get ru or rs from h(ru, rs). so the adversary cannot get the value of h(idi, k1) or hk2(idi ∥ tid (k) i ∥ v p (k) i ). as a result, the adversary cannot get any information on s’s secret keys by eavesdropping channels. 4.2 resistance to attacks based on lost smart card a user’s smart card may get lost due to an accident or the user’s carelessness. in this section, we show that when an adversary gets a lost smart card, he cannot carry out attacks to the proposed scheme. we first show that the proposed scheme is resistant to offline password guessing attack. after that, we show that the lost tickets cannot be used by the adversary. 232 z. hao, s. zhong, n. yu resistance to offline password guessing attack when an adversary gets the lost smart card of ui, he can extract the stored data from it by monitoring the power consumption [12] or analyzing the leaked information [13]. the values stored in ui’s smart card are idi, t, wi, zi, b, t (j) i , j = 1, 2, ..., t. the adversary can pick up a password candidate pw ′. then he can computes ipb′i = h(idi||h(pw ′i ⊕ b)). from wi = ipbi ⊕ h(idi, k1), zi = hk2(idi) ⊕ ipbi and β (j) i = α (j) i ⊕ ipbi = hk2(idi ∥ tid (j) i ∥ v p (j) i ) ⊕ ipbi he can further compute h ′(idi, k1) = wi ⊕ ipb′i, h ′ k2 (idi) = zi ⊕ ipb′i and h ′ k2 (idi ∥ tid (j) i ∥ v p (j) i ) = β (j) i ⊕ ipb ′ i. however, as k1 and k2 are s’s secret keys, the adversary cannot get them. so the adversary cannot test whether pw ′ is equal to pwi from these values. from the above analysis, we can see that the proposed scheme is resistant to offline password guessing attack. resistance to attacks based on lost tickets when an adversary gets the lost smart card of ui, he can extract the stored tickets t (j) i , j = 1, 2, ..., t from it. however, as the adversary cannot get pwi from offline guessing attack (refer to 4.2), he cannot compute ipbi = h(idi||h(pwi ⊕ b)). as a result, the adversary cannot make a valid verification request message {c1, c2}, because both the computations of c1 and c2 require the knowledge of ipbi. from the above analysis, we can see that the proposed scheme is secure when an adversary gets a lost ticket. 4.3 resistance to masquerade attack suppose the adversary gets a set of history messages during the past channel eavesdropping. in order to masquerade as ui, the adversary has to forge a valid message {id∗i , t (k)∗1 i , c ∗ 1, c ∗ 2} which can pass s’s authentication process. it’s easy to see that the computation of c∗1 = ru ⊕ h(idi, k1) requires the adversary to get knowledge of h(idi, k1). from 4.1 we know that the adversary cannot get h(idi, k1), so he cannot forge a valid verification request message either. on the other hand, if the adversary wants to masquerade as the server, he must be able to compute a valid message {c3, c4}, in which c3 = h(idi, k1)⊕rs and c4 = h(ru, rs). because the computation of c3 requires the knowledge of h(idi, k1), which the adversary does not have, the adversary cannot masquerade as the server. from the above analysis, we can see that the proposed scheme is resistant to masquerade attack. 4.4 resistance to replay attack in the proposed scheme, the unique ticket id and the nonces are used for preventing replay attack. assume that the adversary gets a valid message {idi, t (k)1 i , c1, c2} by eavesdropping communications between ui and s. then he tries to resend the message to s after a period of time when the mutual authentication process finishes, with the expectation of obtaining data verification service. however, when s receives this message, it will find that the ticket id tid(k)i is a time-bound ticket-based mutual authentication scheme for cloud computing 233 already on the bulletin board. so s will reject the adversary’s service request and terminate the process. on the other hand, if the adversary gets a message from s to ui: {c3 = h(idi, k1)⊕rs, c4 = h(ru, rs)}. the adversary may try to resend it to ui after a period of time when the mutual authentication process is finished. however, when ui’s smart card receives {c3, c4}, the nonce ru has already been deleted from its storage. so this message will be discarded immediately. from the above analysis, we can see that the proposed scheme is resistant to replay attack. 4.5 resistance to valid period extending attack whenever ui wants to use a ticket whose valid period does not include the current date, the protocol can ensure that the ticket cannot be used even if ui tries to modify the ticket’s begin date or expiration date. assume ui wants to use ticket t (k) i by changing the ticket’s v p (k) i to ˆv p (k) i , so that the current date is included in ˆv p (k) i . denote the modified ticket by t̂ (k) i . when ui sends the message {idi, t̂ (k)1 i , c1, c2} to s, s computes d1 = c1 ⊕ h(idi, k1) and d2 = h(d1) ⊕ c2. then s computes hk2(idi ∥ tid (k) i ∥ ˆv p (k) i ) and compares it with d2. since d2 = h(d1) ⊕ c2 = h(c1 ⊕h(idi, k1))⊕c2 = α (k) i = hk2(idi ∥ tid (k) i ∥ v p (k) i ) ̸= hk2(idi ∥ tid (k) i ∥ ˆv p (k) i ), s will reject ui’s verification request and terminate the process immediately. from the above analysis, we can see that the server can limit the data verification frequency by specifying a valid period for each ticket. for example, the typical coverage of the valid period can be one week, one month, one year, etc. the proposed scheme can prevent the client from using a ticket whose valid period does not include the current date. 5 performance analysis in this section we present performance analysis of the proposed scheme. the main operations include the hash computation and the exclusive-or operations, which are summarized in table 1. we use the hash-based message authentication code (hmac) [14] as the keyed hash, which incurs 2 exclusive-or and 2 common hash operations. table 1: performance analysis of the proposed scheme operation verification request phase mutual authentication phase client xor 5 1 hash 3 2 keyed hash 0 0 server xor 0 3 hash 0 4 keyed hash 0 1 compared with [11], which uses expensive operations like modular exponentiations, our scheme is much more efficient. according to the crypto++ benchmarks1 [15], the sha-256 hash algorithm [16] can achieve a throughput of 111mib/sec under intel core2 1.83ghz processor. in the proposed scheme, the length of a message to be hashed does not exceed 1kb, so the average time for one hash computation is about 0.009ms. if the lin-chang scheme uses 1the crypto++ 5.6.0 benchmark is evaluated in intel core2 1.83 ghz processor under windows vista in 32-bit mode 234 z. hao, s. zhong, n. yu the primes of length 1024bits, then one time of modular exponentiation will cost about 1.46ms under the same platform [15]. our scheme is at least 40 times more efficient than the linchang scheme [11] in case of the server’s computation time. this is a tremendous performance improvement to the cloud servers. performance benefit by limiting verification frequency the cloud server can also benefit its performance by limiting the client’s data verification frequency. this is achieved by controlling the valid period of the digital ticket. from the performance evaluation of [6] we know that for a 30mb file, the computation time of the data integrity verification is approximately 1 second2 at the server side. assume the cloud storage system has 100,000 clients, and each client performs data integrity verification once a week. then the average computation time of the cloud server during one day is approxiately: 100, 000 7 · 1s ≈ 3.97 hours. by using the proposed scheme, the server can limit the data verification frequency to once per month, or even once per quarter, so that the average computation time of the cloud server is reduced to 56 minutes per day and 18.5 minutes per day respectively. 6 conclusions in this paper, we propose a time-bound ticket-based mutual authentication scheme. in the proposed scheme, the digital tickets are associated with the client’s smart card, which effectively prevents the ticket from being used by other clients. by designing a mutual authentication based on the client’s smart card, both the server and the client are assured of each other’s identity. the proposed authentication scheme can efficiently decrease the server’s processing overhead by limiting the data verification frequency. by security analysis and performance analysis, the proposed scheme is shown to be both secure and efficient. it is very suitable for providing mutual authentication in cloud computing. acknowledgment this work was supported by nsf cns-0845149, nsf ccf-0915374 and knowledge innovation program of chinese academy of sciences (no. yyyj-1013). bibliography [1] b. hayes, “cloud computing,” commun. acm, vol. 51, no. 7, pp. 9–11, 2008. [2] c. cachin, i. keidar, and a. shraer, “trusting the cloud,” sigact news, vol. 40, no. 2, pp. 81–86, 2009. [3] amazon.com, “amazon web services (aws),” http://aws.amazon.com/s3/, 2009. [4] microsoft.com, “microsoft windows skydrive,” http://windowslive.com/online/skydrive, 2009. 2the experiment environment in [6] is intel 2.8 ghz pentium iv system with a 512 kb cache, an 800 mhz epci bus, and 1024 mb of ram. a time-bound ticket-based mutual authentication scheme for cloud computing 235 [5] nirvanix.com, “nirvanix cloudnas,” http://www.nirvanix.com/products-services/, 2009. [6] g. ateniese, r. burns, r. curtmola, j. herring, l. kissner, z. peterson, and d. song, “provable data possession at untrusted stores,” in ccs ’07: proceedings of the 14th acm conference on computer and communications security, (new york, ny, usa), pp. 598–609, acm, 2007. [7] a. juels and b. s. kaliski, jr., “pors: proofs of retrievability for large files,” in ccs ’07: proceedings of the 14th acm conference on computer and communications security, (new york, ny, usa), pp. 584–597, acm, 2007. [8] e.-c. chang and j. xu, “remote integrity check with dishonest storage server,” in 13th esorics, pp. 223–237, springer berlin / heidelberg, 2008. [9] a. heitzmann, b. palazzi, c. papamanthou, and r. tamassia, “efficient integrity checking of untrusted network storage,” in storagess ’08, pp. 43–54, acm, 2008. [10] k. d. bowers, a. juels, and a. oprea, “hail: a high-availability and integrity layer for cloud storage,” in ccs ’09, (new york, ny, usa), pp. 187–198, acm, 2009. [11] i.-c. lin and c.-c. chang, “a countable and time-bound password-based user authentication scheme for the applications of electronic commerce,” information sciences, vol. 179, no. 9, pp. 1269 – 1277, 2009. [12] p. c. kocher, j. jaffe, and b. jun, “differential power analysis,” in crypto ’99: proceedings of the 19th annual international cryptology conference on advances in cryptology, (london, uk), pp. 388–397, springer-verlag, 1999. [13] t. messerges, e. dabbish, and r. sloan, “examining smart-card security under the threat of power analysis attacks,” ieee transactions on computers, vol. 51, no. 5, pp. 541–552, 2002. [14] h. krawczyk, m. bellare, and r. canetti, “hmac: keyed-hashing for message authentication,” rfc2104, february 1997. [15] “crypto++ 5.6.0 benchmarks,” http://www.cryptopp.com/benchmarks.html. [16] “secure hash standard,” federal information processing standards publication 180-2, august 2002. ijcccv11n1.dvi international journal of computers communications & control issn 1841-9836, 11(1):126-141, february 2016. computing sorted subsets for data processing in communicating software/hardware control systems v. sklyarov, i. skliarova, a. rjabov, a. sudnitson v. sklyarov*, i. skliarova university of aveiro, dept. of electronics, telecommunications and informatics/ieeta campus universitário de santiago, 3810-193, aveiro, portugal *corresponding author: skl@ua.pt a. rjabov, a. sudnitson tallinn university of technology, dept. of computer engineering akadeemia tee 15a, 12618, tallinn, estonia abstract: computing and filtering sorted subsets are frequently required in statistical data manipulation and control applications. the main objective is to extract subsets from large data sets in accordance with some criteria, for example, with the maximum and/or the minimum values in the entire set or within the predefined constraints. the paper suggests a new computation method enabling the indicated above problem to be solved in all programmable systems-on-chip from the xilinx zynq family that combine a dual-core cortex-a9 processing unit and programmable logic linked by high-performance interfaces. the method involves highly parallel sorting networks and run-time filtering. the computations are done in communicating software, running in the processing unit, and hardware, implemented in the programmable logic. practical applications of the proposed technique are also shown. the results of implementation and experiments clearly demonstrate significant speed-up of the developed software/hardware system comparing to alternative software implementations. keywords: computing sorted subsets, communicating hardware/software systems, filtering, sorting networks, control applications. 1 introduction many electronic, environmental, medical, and biological control applications need to process data streams produced by sensors and measure external parameters within given upper and lower bounds (thresholds) [1]. let us consider some examples. applying the technique [2] in real-time applications requires knowledge acquisition from controlled systems (e.g. plant). for example, signals from sensors may be filtered and analyzed to prevent error conditions (see [2] for additional details). to provide more exact and reliable conclusion, combination of different values need to be extracted, ordered, and analyzed. similar tasks appear in monitoring thermal radiation from volcanic products [3], filtering and integration of information from a variety of different sources in medical applications [4] and in other practical applications described in [5]. since many control systems are real-time, performance is important and hardware accelerators may provide significant assistance for software. a similar data processing is applicable to data mining algorithms, such as [6]. let us consider control systems that collect, filter and analyze data produced by some measurements. we will describe below such computations that permit: • the maximum and/or minimum sorted subsets to be extracted (the maximum/minimum sorted subset of size lmax/lmin contains lmax/lmin data items with maximum/minimum values from a given set); • the maximum and/or minimum sorted subsets to be found within the given upper bu and lower bl bounds. copyright © 2006-2016 by ccc publications agora university computing sorted subsets for data processing in communicating software/hardware control systems 127 o p ti o n a l d ig it a l fi lt e r d a ta p ro c e s si n g m e a s u re d s e t o f d a ta c o m p u te d ( e x tr a c te d ) su b se t o f d a ta f il te ri n g d a ta t h a t fa ll w it h in g iv e n b o u n d s figure 1: general architecture of data processing the problem can be solved as it is shown in fig. 1. there are two blocks in fig. 1. measured data items are handled in such a way that the maximum and/or minimum subsets with lmax and/or lmin items are extracted by the data processing block. input data may optionally be filtered allowing only items (such as d) that fall within pre-given constraints (e.g. bl ≤ d ≤ bu or bl < d < bu) to be processed. the paper suggests a method and high-performance implementation of architecture in fig. 1 in all programmable systems-on-chip (apsoc) from the xilinx zynq-7000 family [7] that are recently developed field-configurable devices integrating the most advanced programmable logic (pl) and a widely used processing system (ps) based on the dual-core arm® cortex™ mpcore™. the available interfaces between the ps and pl are supported by ready-to-use intellectual property (ip) cores. these, combined with numerous architectural and technological advances, have enabled apsocs to open a new era in the development of highly optimized computational systems for a vast variety of practical applications, including high-performance computing, data, signal and image processing, control, and many others. the main target of apsocs is integration in the developed systems of software and hardware components assuming that such integration enables characteristics (most often performance) of the system to be improved. the complexity of hardware only solutions is frequently limited by the available resources in the pl. software/hardware solutions can be very complex and they are appropriate for control applications, such as that are described, for example, in [2,4]. the most close related work can be found in [5,8] where the importance of the considered problem is underlined, but the methods that allow the problem to be solved are different and the proposed below methods permit better results to be achieved. the remainder of the paper is organized in eight sections. section 2 presents the proposed software/hardware architecture. section 3 describes a novel method allowing the maximum and minimum sorted subsets for a given set of data items to be computed. section 4 suggests a run-time filtering method. section 5 is dedicated to on-chip communication mechanisms linking software and hardware components. section 6 shows how large subsets (for which hardware resources are not sufficient) can be computed and discusses additional capabilities such as extracting only the maximum or only the minimum subsets. section 7 demonstrates potential practical application (from the areas of control and data mining). implementations in zynq microchips and the results of thorough evaluation and comparison of software only and software/hardware solutions with explicit indication of the achieved acceleration are discussed in section 8. section 9 concludes the paper. 128 v. sklyarov, i. skliarova, a. rjabov, a. sudnitson 2 software/hardware architecture fig. 2 presents the proposed software/hardware architecture. e x te rn a l d d r m e m o ry on-chip memory (ocm) ps pl zynq device high-performance interfaces 1. acquiring data and saving them in ocm or external memory. 2. requesting to extract subsets in the pl. 3. collecting extracted subsets in ocm or external memory. 1. processing requests from the ps in highly parallel circuits. 2. optional interaction with analog and digital sensors, acquisition of data and transmitting them to the ps. figure 2: the proposed software/hardware architecture the ps collects data, that may be acquired from different sources (such as from a host pc or from sensors connected to a zynq device), and stores them in on-chip or external memory. the pl processes requests from the ps, that is, reads data from memories, and rapidly extracts the maximum and/or minimum subsets. both parts, that are the ps and pl, may function in parallel and any request can be seen as a macroinstruction executed in the pl concurrently with other potential instructions in the ps. it is shown in [9] that for transferring a small number of data items between the ps and the pl on-chip general-purpose ports (gpp) can be used more efficiently than other available interfaces. thus, requests from the ps to the pl are formed through gpp where the ps is the master and the pl is a slave. it is also shown in [7, 9] that large volumes of data can be more efficiently transferred from/to memories to/from the pl through high performance (hp) interfaces: highperformance advanced extensible interface (axi hp) and axi accelerator coherency port (axi acp). in all our designs memories are slaves and either the pl or the processor in the ps is the master. to increase performance, data from memories may be requested to be cacheable. 3 computing sorted subsets let set s containing n m-bit data items be given. the maximum subset contains lmax largest items in s and the minimum subset contains lmin smallest items in s (lmax ≤ n and lmin ≤ n). we mainly consider such tasks for which lmax < < n and lmin < < n which are more common for practical applications. since n may have very large value (millions of items) it cannot be processed completely in hardware due to the unavailability of sufficient resources. it is shown in [10, 11] that even for relatively complex field-programmable gate arrays (fpgas) the size computing sorted subsets for data processing in communicating software/hardware control systems 129 n is limited. for example, for even-odd merge and bitonic merge networks [12] n cannot exceed a few hundreds of 32-bit items even for very advanced fpgas (such as the largest devices from the xilinx virtex-7 family). in zynq devices implementing circuits from [12] the maximum value of n does not exceed 128 32-bit items. iterative even-odd transition networks from [11] permit significantly larger number of items (exceeding thousands of 32-bit items) to be processed and they will be used for computing sorted subsets in hardware. however, in practical cases the given sets anyhow cannot be entirely processed and computing the maximum and/or minimum sorted subsets needs to be done sequentially, nevertheless handling many items in parallel. fig. 3 depicts the proposed architecture that enables the considered problem to be solved. sorting network lmax m-bit data items for maximum subset lmin m-bit data items for minimum subset in p u t d a ta (k m -b it d a ta i te m s ) m m m m m m maximum subset minimum subset lmax m-bit data items (from k m-bit data items) for initialization at the beginning lmin m-bit data items (from k m-bit data items) for initialization at the beginning lmax×m lmin×m feedback copying figure 3: computing the maximum and the minimum sorted subsets address counter (count) in c r e m e n t register address k m -b it i n p u t re g is te rs r0 rk-1 k m -b it d a ta i te m s to s o rt in g n e tw o rk (s e e f ig . 3 ) data item ik from hp port the maximum item from the minimum subset m u x up to the last after the last figure 4: processing the last (possibly incomplete) subset let us divide the given set s into q = ⌈n/k⌉ subsets, all of which contain exactly k m-bit 130 v. sklyarov, i. skliarova, a. rjabov, a. sudnitson items except the last one, which may have less than k m-bit items. computing subsets is done incrementally in q steps (we assume below that k ≤ n). at the first step, the first k m-bit data items are sorted in the network [11] which processes lmax+k+lmin data items but comparators linking the upper part (handling lmax m-bit data items) and the lower part (handling lmin m-bit data items) are deactivated (i.e. the links with the upper and bottom parts are broken). so, sorting is done only in the middle part handling k m-bit items. as soon as the sorting is completed, the maximum subset is copied to the upper part of the network and the minimum subset is copied to the lower part of the network (see fig. 3). from the second step, all the comparators are properly linked, i.e. the network from [11] handles lmax+k+lmin items, but the feedback copying (see the first step and fig. 3) is disabled. now for each new k m-bit items the maximum and the minimum sorted subsets are appropriately corrected, i.e. new items may be appended. at the last step, the number of incoming items may be less than k. fig. 4 explains how the maximum and minimum subsets are corrected for the last possibly incomplete subset of items. there is an additional mux in fig. 4, which supplies data items from a hp port (linking the pl with memory) until the received item is not the last. as soon as the last item is read from memory, the next items (until k) are taken as the maximum value from the minimum subset (see the lower subset in fig. 3). clearly, such an item cannot be moved again to the minimum subset and the last sorting step is executed similarly to the previous steps. let us look at the example shown in fig. 5 for which: n = 21, k = 8, lmax = lmin = 4, and s = 26,37,11,19,3,7,99,56,29,37,22,99,1,55,39,47,12,45,83,5,18. the set s is divided into the following three subsets: a = 26,37,11,19,3,7,99,56, b = 29,37,22,99,1,55,39,47, and c = 12,45,83,5,18. note that the last subset c contains only 5 elements and is incomplete. symbol u in fig. 5 indicates undefined value. the iterative sorting network is exactly the same as in [11]. any comparator is shown in knuth notation [13] and it converts two-item inputs in two-item outputs in such a way that the upper value is greater than or equal to the lower value. the maximum number of iterations for sorting is k/2 [14] and this number is almost always smaller because the method [11] terminates subsequent iterations as soon as all items are sorted. there are 3 steps in fig. 5. at the first step, k (k=8) items are sorted and copied to the maximum and minimum subsets. two comparators are disabled in accordance with the explanations given above (breaking links of the middle section in the sorted network with the upper and the lower sections). at the second step, all the network comparators are enabled and lmax+k+lmin items are sorted by the iterative network with feedback register (fr). all necessary details can be found in [11]. it is easy to show that the maximum number of iterations is ⌈(max(lmax,lmin)+k)/2⌉ and much like the previous case this number is almost always smaller [11]. at the last (third) step, the incomplete subset c is extended to k items by copying the maximum value (11) from the minimum subset 11,7,3,1 to the positions of missing data (see fig. 5). after sorting lmax+k+lmin items at the step 3 the final result is produced. computing sorted subsets for data processing in communicating software/hardware control systems 131 0 7 1 2 3 4 5 6 1 2 3 4 5 6 7 8 9 10 11 12 13 14 f e e d b a c k r e g is te r (f r ) 8 15 9 10 11 12 13 14 l m a x l m in k load sort copy load sort load sort u u u u 26 37 11 19 3 7 99 56 u u u u a u u u u 99 56 37 26 19 11 7 3 u u u u 99 56 37 26 99 56 37 26 19 11 7 3 19 11 7 3 99 56 37 26 29 37 22 99 1 55 39 47 19 11 7 3 b 99 99 56 55 47 39 37 37 29 26 22 19 11 7 3 1 c o m p a ra to r is d is a b le d ( i. e . it i s b ro k e n ) comparator may be either enabled or disabled 99 99 56 55 12 45 83 5 18 11 11 11 11 7 3 1 c 99 99 83 56 55 45 18 12 11 11 11 11 7 5 3 1 in c o m p le te t h e m a x im u m v a lu e m a x im u m s u b se t m in im u m s u b se t step 1 step 2 step 3iterative sorting network figure 5: an example (computing subsets) 4 filtering let bu and bl be predefined upper (bu) and lower (bl) bounds for the given set s. we would like to use the circuit in fig. 3 only for such data items d that fall within the bounds bu and bl, i.e. bl ≤ d ≤ bu (or, possibly, bl < d < bu). fig. 6 depicts the proposed architecture that enables data items to be filtered at run-time (i.e. during the data exchange between the ps and pl). there is an additional block on the upper input of the mux (see also fig. 4), which takes a data item ik from a hp port and executes the operation indicated on the right-hand part of fig. 6. if the counter is incremented, then a new register is chosen to store ik. otherwise, the signal we (write enable) is passive and a new item with a value that is out of the bounds bu and bl is not recorded in the registers. (count = count+1, we) when b l £ ik £ bu else null; if input value ik is within the constraints b u and b l then increment the address counter and allow writing data (activate the signal we – write enable). otherwise deactivate the signal we and do not increment the address counter. address counter (count) in c r e m e n t register address k m -b it i n p u t re g is te rs r0 rk-1 k m -b it d a ta i te m s to s o rt in g n e tw o rk (s e e f ig . 3 ) the maximum item from the minimum subset m u x b u and/or b l ikik we up to the last after the last figure 6: digital filter 132 v. sklyarov, i. skliarova, a. rjabov, a. sudnitson let us look at the same example in fig. 5 for which we choose bu = 90 and bl = 10 (see fig. 7). at the first step incoming data items have preliminary been filtered, the values 99, 7, and 3 have been removed (because they are either greater than bu = 90 or less than bl = 10), and the subset a with 8 items is built from 11 first elements of the set s. at the second (last) step, the values 99, 1, and 5 have been removed, and the subset b = 55,39,47,12,45,83,18 is built from the remaining allowed elements of the set s. since there are 7 items in b and k = 8, this subset is incomplete. as can be seen from fig. 7, two steps are sufficient to extract the maximum and the minimum subsets from the filtered set s. similarly, filtering and computing sorted subsets can be done for very large data sets. 0 7 1 2 3 4 5 6 1 2 3 4 5 6 7 8 9 10 11 12 13 14 f e e d b a c k r e g is te r (f r ) 8 15 9 10 11 12 13 14 l m a x l m in k load sort copy load sort u u u u 26 37 11 19 56 29 37 22 u u u u a u u u u 56 37 37 29 26 22 19 11 u u u u 56 37 37 29 56 37 37 29 26 22 19 11 26 22 19 11 56 37 37 29 55 39 47 12 45 83 18 26 26 22 19 11 b 83 56 55 47 45 39 37 37 29 26 26 22 19 18 12 11 t h e m a x im u m v a lu e m a x im u m s u b se t m in im u m s u b se t step 1 step 2iterative sorting network in c o m p le te figure 7: an example (filtering and computing subsets) clearly, the described above operations can be done in software. for example, c function qsort permits large data sets to be sorted. after that extracting the maximum and minimum subsets may easily be done. filtering can be provided much like it is shown in fig. 6 eliminating items that do not fall within the predefined constraints. however, for many practical applications performance of the described above operations is important. to evaluate software/hardware solutions three different components need to be taken into account (see fig. 8): 1) software part; 2) hardware part; and 3) the circuits that provide for data exchange between software and hardware. numerous experiments were done in [15] to compare such solutions with software only systems. one example in [15] enables sorting blocks of data composed of 320 32-bit items in the pl that are further merged in the ps (see fig. 8). from 512,288 to 4,194,304 of 32-bit data items were randomly generated in the ps (i.e. the size of data varies from 2mb to 16mb) and then sorted in software with the aid of the function qsort and in the software/hardware system (see fig. 8). the actual performance improvement was by a factor of about 2.5. it was shown in [15] that hardware circuits in the pl are significantly faster than software in the ps. in this paper we evaluate and compare software/hardware and software only solutions taking into account all the computing sorted subsets for data processing in communicating software/hardware control systems 133 involved communication overheads that were measured in [15]. we will mainly use axi acp [7] which provides one of the fastest interfaces for exchange of large data sets between the ps and pl [7, 9, 15]. the number of data items transferred from the ps/memory to the pl is the same as in [15]. however, the number of data items transferred from the pl to the ps/memory is significantly smaller enabling much better acceleration to be achieved. software part hardware part circuits that provide support for data exchange software/hardware merging blocks sorting blocks transferring blocks between software and hardware ps pl figure 8: an example of a software/hardware system 5 communication of software and hardware fig. 9 shows how communication is organized between software and hardware. it is done similarly to [8,15], but the proposed in this paper processing is different. the developed hardware in the pl is divided in two parts: application-specific (that is filtering and computing subsets) and communication-specific processing. the latter is studied in [15] and provides support for data exchange with storage of the pl that is either block ram or registers built from flip-flops of configurable logic blocks. application-specific processing (filtering and computing subsets) start k b u communication-specific processing pl ps these operations are controlled by software data transfer from/to selected memories (ddr, ocm, or cache) gpp burst mode memorythe ps software gpp axi acp interrupt from hardware using software only to solve exactly the same problem without hardware circuits b l … figure 9: communication between software and hardware components 134 v. sklyarov, i. skliarova, a. rjabov, a. sudnitson data are transmitted in blocks of 32/64-bit items (i.e. either m=32 or m=64) and the fastest burst mode is applied. input data items ik (k=0,1,. . . ,k-1) are processed by the described above circuits (see fig. 3, 4, 6). note that gpp do not allow burst mode to be applied but are very appropriate for transferring small number of signals that may be used for control in the pl and for some additional details. in our system they are: 1) start requiring data processing in the pl to be initiated; 2) k (see fig. 3); 3) bu and bl (see fig. 6); 4) additional signals, namely source address, destination address and sizes of data to be transferred from the ps to the pl and vice versa. fig. 10 demonstrates a component diagram for reading the initial large volume data from the ps and for transferring the results (i.e. the computed maximum and minimum subsets) from the pl to the ps. we found that in such interactions between the ps and pl the best way is to use hp ports to read data from the ps (memories) and transfer them to the pl and to write data from the pl to the ps (memories). since memory controllers belong to the ps we can talk about data transfers between the ps and pl. exchange of data in both directions is done in burst mode supported by a burst reader and a burst writer described in [8, 15]. both processing (th) and communication (tc) times are measured and taken into account. init read proc write done reading data r e a d in g i s c o m p le te d p ro c e ss in g i s c o m p le te d writing the computed subset processing is completed begin processing in hardware computing sorted subsets burst reader burst writer vhdl components top module in te rr u p t fr o m th e p l t o t h e p s time (th+tc) measured time (th) measured x il in x i p c o re c o m p o n e n t pl application-specific processing figure 10: operations in hardware components 6 computing large subsets and additional capabilities for some practical applications the maximum and/or minimum subsets may be large and the available hardware resources become insufficient to implement the circuits in fig. 3. computing sorted subsets for data processing in communicating software/hardware control systems 135 the arising problem can be solved using the following technique. let lmax and lmin be constraints for the upper and bottom parts of the sorting network in fig. 3, i.e. circuits with larger values (than lmax and lmin) cannot be implemented due to the lack of hardware resources or for some other reasons. let the parameters for the maximum and minimum subsets be greater than lmax and lmin, i.e. lmax > lmax and lmin > lmin. in such case the maximum and minimum subsets can be computed incrementally [8] as follows: 1. at the first iteration the maximum subset containing lmax items and the minimum subset containing lmin items are computed. the subsets are transferred to the ps (to memories). the ps removes the minimum value from the maximum subset and the maximum value from the minimum subset. such correction avoids loss of repeated items at subsequent steps. indeed, the minimum value from the maximum subset (the maximum value from the minimum subset) can appear for subsets to be subsequently constructed in point 3 below and they will be lost because of filtering (see point 3). 2. the minimum value from the corrected in the ps maximum subset is assigned to bu. the maximum value from the corrected in the ps minimum subset is assigned to bl. the values bu and bl are supplied to the pl through gpp. 3. the same data items (from memory), as in point 1 above, are preliminary filtered (see fig. 6) in such a way that only items that are less than bu and greater than bl are allowed to be processed, i.e. computing sorted subsets can be done only for the filtered data items. thus, the second part of the maximum and minimum subsets will be computed and appended (in the ps) to the previously computed subsets (such as subsets from point 1). note, that the method for processing incomplete subsets (see fig. 4) may need to be applied for the last iteration. 4. the points 2 and 3 above are repeated until the maximum subset with lmax items and the minimum subset with lmin items are computed. note, that if the number of repeated items is greater than or equal to lmax/lmin, then the method above may generate infinite loops [8]. this situation can easily be recognized. indeed, if after corrections in point 1 above any new subset becomes empty then an infinite loop will be created. in such case we can use another method based on software/hardware sorters from [9]. in section 8 we will present the results of experiments for such sorters. for some practical cases only the maximum or the minimum subsets need to be extracted. this task can be solved easier than in fig. 3 with the aid of the circuit shown in fig. 11 (for computing only the maximum subset). sorting network in p u t d a ta (k m -b it d a ta i te m s ) m bit m bit m m maximum subset lmax this part of the network register is filled with the smallest values at the beginning figure 11: computing the maximum subset for a given set at initialization stage lmax m-bit words of the fr (see fig. 5) are filled in with the smallest 136 v. sklyarov, i. skliarova, a. rjabov, a. sudnitson possible value (such as zero or the minimal value for m-bit data items). after that the processing is executed as before (see section 3) and finally the maximum subset will be computed. for computing the minimum subsets the bottom part of fig. 3 is filled in with the largest possible value (such as the maximum value for m-bit data items). 7 practical applications let us consider practical applications from the scope of control. we have already mentioned in section 1 that applying the technique [2] in real-time systems requires knowledge acquisition from controlled devices. the data may be compared with the previously collected data that are kept in databases for similar control scenarios. the results of comparison can be analyzed and used to modify the algorithms allowing control operations to be optimized, undesirable (or error prone) situations to be avoided, etc. let us look at fig. 12 where software collects important data from a controlled system, such as changes in temperature, deviation of positions, offsets, etc. the collected data are optionally filtered and their subsets (maximum, minimum, or both) are computed (see the bottom part of fig. 12). data from previous scenarios for analogous conditions are extracted from the database and they are also optionally filtered and similar subsets are computed (see the upper part of fig. 12). data base select a set of parameters optional filtering computing subsets analysis and comparison modifications of control algorithms controlled system collecting data optional filtering computing subsets controller software hardware hardware software software figure 12: an example of control application data from the controlled system (see the bottom part of fig. 12) and from the database (see the upper part of fig. 12) are analyzed. for example, average maximum values are checked. the results of analysis may be used to modify control algorithms much like it is done in [9, 16]. for example, modules of controllers from [9] can be replaced to optimize execution of relevant operations. another group of potential applications is from the scope of statistical data manipulation such as data mining. to describe one of the problems from this area informally let us consider an example [6] with analogy to a shopping card. a basket is the set of items purchased at one time. a frequent item is an item that often occurs in a database. a frequent set of items often occur together in the same basket. a researcher can request a particular support value and find the items which occur together in a basket either a maximum or a minimum number of times computing sorted subsets for data processing in communicating software/hardware control systems 137 within the database [6]. similar problems appear to determine frequent inquiries at the internet, customer transactions, credit card purchases, etc. producing very large volumes of data in the span of a day [6]. computing sets of the most frequent or the less frequent items in large data sets permits the relevant data mining algorithms to be simplified and accelerated. sorting of subsets is involved in many known algorithms from this area e.g. [17–19] and the results of the paper may provide a valuable assistance. 8 implementation, experiments, and comparisons much like [8] we have used a multi-level computing system [9]. initial data are either generated randomly in software of the ps with the aid of c language rand or prepared in the host pc. in the last case data may be generated by some functions or copied from available benchmarks. computing subsets in software/hardware systems is mainly done in zynq apsoc xc7z020 housed on zedboard [20] with the aid of the described above software/hardware architectures (see fig. 2-4, 6, 9, 10). computing subsets in software only sorters is completely done in software of the ps calling c language qsort function which sorts data and after that the maximum and/or the minimum subsets are extracted. the results are verified in software running either in the ps or in the host pc. functions for verification of the results are given in [9]. verification time is not taken into account in the measurements below. synthesis and implementation of hardware modules were done in xilinx vivado 2015.2. standalone software applications were created in c language and uploaded to the ps memory from the xilinx software development kit (sdk) using methods described in [9]. interactions with apsoc are done through the sdk console window. for all the experiments 64-bit axi acp port was used for transferring blocks between the pl and memories. the size of each block for burst mode is chosen to be 128 of 64-bit items. two memories were tested: the ocm (for smaller number of data items) and external (onboard) ddr. the ocm is faster because it provides for 64-bit data transfers [7] but the size of this memory is limited to 256 kb. the available on zedboard 4 gb ddr supports 32-bit data transfers. the measurements were based on time units (returned by the function xtime_gettime [21]) for lmax = lmin = 128, m=32, and k = 256 (see fig. 3). the following operations have been executed: a) copying data to the selected memory in the ps; b) providing the necessary initialization for the function xtime_gettime (i.e. the consumed time will be measured from this point); c) making the request, i.e. setting (through gpp) source address, destination address, the size of data to be copied, and start processing in the pl (optionally some other data, such as bu and bl for filtering, may be provided); d) copying data from the ps to pl and executing all the required operations in the pl; e) copying the computed subsets from the pl to ps; f) generating a hardware interrupt that is handled in the ps as a completion of the request (thus, the consumed time is measured at this point in the ps). each unit returned by the function xtime_gettime corresponds to 2 clock cycles of the ps [21]. the ps clock frequency is 666 mhz. thus, any unit corresponds to approximately 3 ns. the pl clock frequency was set to 100 mhz. fig. 13 shows the time consumed for computing the maximum and minimum subsets for data sets with different sizes in kb (from 2 to 128). since m=32 the number of processed words (n) is equal to the indicated size divided by 4. fig. 14 shows the acceleration of the software/hardware system comparing to the software only system. note that fig. 13, 14 give diagrams for the ocm. if ddr memory is used then communication overheads are slightly increased but acceleration in the software/hardware system comparing to the software only system is again significant. 138 v. sklyarov, i. skliarova, a. rjabov, a. sudnitson let us compare the results with [5, 8]. the number of data items in the proposed solutions is larger than in [5] and can easily be additionally increased. for similar data sets the achieved acceleration is better than in [8] thanks to additional optimization of the proposed circuits. we also implemented and tested the proposed circuits in a more advanced prototyping board zc706 [22] with zynq microchip xc7z045. data were taken from ddr memory and the maximum and minimum subsets were extracted with k data items where k varied from 256 to 1,024 (as before m = 32, n is equal to 256 kb). the consumed time varies from 1,850 µs for lmin = lmax = 256 to 7,200 µs for lmin = lmax = 1,024. thus, the proposed solutions can be used for solving significantly more complicated problems that cannot be solved, in particular, with the aid of the methods [5]. if only the maximum or only the minimum subsets have to be computed the acceleration is slightly increased (although it is almost the same) and the occupied hardware resources are reduced. the proposed filtering (see fig. 6) does not consume any additional time because it is combined with data transfers. so, we can say that the time is included in communication overheads and the latter were taken into account in all measurements. it should be noted that filtering is not described in [5, 8]. 1 10 100 1 000 10 000 100 000 2 4 8 16 32 64 128 software/hardware software only time in ms size of data in kb figure 13: computing time in software only and software/hardware systems if the size of the requested subsets is increased in such a way that all data need to be read from memory several times then the results are the same as in [8] (see comments in [8] for additional details). we found that parallel circuits that enable the maximum and minimum subsets to be excomputing sorted subsets for data processing in communicating software/hardware control systems 139 25 35 45 55 65 75 85 95 105 2 4 8 16 32 64 128 acceleration of software/hardware system comparing to software only system acceleration size of data in kb example: this point indicates acceleration by a factor of 75 of the proposed software/ hardware solution comparing to the software only solution figure 14: acceleration of software/hardware system comparing to software only system tracted in the zedboard [20] can be built up to k = 256. in this case additional hardware resources that enable data exchange between the ps (memories) and pl are available. similar circuits for the zc706 can be built up to k = 1,024. note that if a block of data needs to be sorted in hardware then the number of processed data may be greater because in our case two blocks (each of which possesses k items) have to be handled in parallel and in case of data sorting [9] it is sufficient to handle just one block of data. additional optimizations such as partial merging in hardware circuits permit the size k to be additionally increased. however, the processing time will also be increased. 9 conclusion the paper suggests methods for computing the maximum and minimum subsets that are extracted from large data sets in communicating software/hardware systems, namely in devices from the xilinx zynq family, which combine a high-performance processing system with advanced programmable logic. the extracted subsets may be filtered and this feature is useful for control applications. the proposed solutions are highly parallel permitting capabilities of programmable logic to be used very efficiently. all the proposed methods were implemented in commercial microchips, tested, evaluated, and compared with alternatives. the results of experiments have shown significant speed-up of the proposed software/hardware systems comparing to software only systems and to competitive hardware/software implementations. in particular, the size of 140 v. sklyarov, i. skliarova, a. rjabov, a. sudnitson subsets was increased and additional tasks important for control applications were discussed and solved. practical applications of the proposed technique for control applications and statistical data manipulation were also given. acknowledgment this research was supported by eu through european regional development funds, the institutional research funding iut 19-1 of the estonian ministry of education and research, esf grant 9251, and portuguese national funds through fct foundation for science and technology, in the context of the project pest-oe/eei/ui0127/2014. bibliography [1] sklyarov, v.; skliarova, i. (2013); digital hamming weight and distance analyzers for binary vectors and matrices, int. journal of innovative computing, information and control, 9(12): 4825-4849. [2] zmaranda, d.; silaghi, h.; gabor, g.; vancea, c. (2012); issues on applying knowledgebased techniques in real-time control systems, international journal of computers communications & control, 8(1): 166-175. [3] field, l.; barnie, t.; blundy, j.; brooker, r. a.; keir, d.; lewi, e.; saunders, k. (2012); integrated field, satellite and petrological observations of the november 2010 eruption of erta ale, bulletin of volcanology, 74(10): 2251-2271. [4] zhang, w.; thurow, k.; stoll, r. (2014); a knowledge-based telemonitoring platform for application in remote healthcare, international journal of computers communications & control, 9(5): 644-654. [5] farmahini-farahani, a.; duwe, h. j.; schulte, m. j.; compton, k. (2013); modular design of high-throughput, low-latency sorting units, computers, ieee transactions on, 62(7): 1389-1402. [6] baker, z. k.; prasanna, v. k. (2006); an architecture for efficient hardware data mining using reconfigurable computing systems. in field-programmable custom computing machines, proc. 14th annual ieee symp. on field-programmable custom computing machines fccm, napa, usa, 67-75. [7] xilinx, inc.; zynq-7000 all programmable soc technical reference manual, available at http://www.xilinx.com/support/documentation/user_guides/ug585-zynq-7000-trm.pdf. [8] sklyarov, v.; skliarova, i.; rjabov, a.; sudnitson, a. (2015); zynq-based system for extracting sorted subsets from large data sets, journal of microelectronics, electronic components and materials, 45(2): 142-152. [9] sklyarov, v.; skliarova, i.; silva, j.; rjabov, a.; sudnitson, a.; cardoso, c. (2014); hardware/software co-design for programmable systems-on-chip, tut press. [10] mueller, r.; teubner, j.; alonso, g.; (2012); sorting networks on fpgas, the vldb journal—the international journal on very large data bases, 21(1), 1-23. computing sorted subsets for data processing in communicating software/hardware control systems 141 [11] sklyarov, v.; skliarova, i. (2014); high-performance implementation of regular and easily scalable sorting networks on an fpga, microprocessors and microsystems, 38(5): 470-484. [12] baddar, s. w. a. h.; batcher, k. e. (2012); designing sorting networks: a new paradigm, springer science & business media. [13] knuth, d. e. (2011); the art of computer programming: sorting and searching (vol. 3), addison-wesley. [14] kipfer, p.; & westermann, r. (2005); improved gpu sorting, gpu gems 2: programming techniques for high-performance graphics and general-purpose computation, edited by m. pharr, 733-746, available at http://http.developer.nvidia.com/gpugems2/gpugems2_chapter46.html. [15] silva, j.; sklyarov, v.; skliarova, i. (2015). comparison of on-chip communications in zynq-7000 all programmable systems-on-chip, embedded systems letters, ieee, 7(1): 3134. [16] sklyarov, v.; skliarova, i.; barkalov, a.; titarenko, l. (2014); synthesis and optimization of fpga-based systems, springer. [17] sun, s. (2011); analysis and acceleration of data mining algorithms on high performance reconfigurable computing platforms, ph.d. thesis, iowa state university, available at: http://lib.dr.iastate.edu/cgi/viewcontent.cgi?article=1421&context=etd. [18] wu, x.; kumar, v.; quinlan, j. r. et al. (2008); top 10 algorithms in data mining, knowledge and information systems, 14(1): 1-37. [19] firdhous, m. (2012); automating legal research through data mining, journal of advanced computer science and applications, 1(6): 1-8. [20] avnet, inc. (2014); zedboard (zynqtm evaluation and development) hardware user’s guide, version 2.2, available at: http://www.zedboard.org/sites/default/files/documentations/zedboard_hw_ug_v2_2.pdf. [21] xilinx, inc.; os and libraries document collection ug647, available at: http://www.xilinx.com/support/documentation/sw_manuals/xilinx2014_2/oslib_rm.pdf. [22] xilinx, inc.; zc706, all programmable soc evaluation kit, ug961, available at: http://www.xilinx.com/support/documentation/boards_and_kits/zc706/2014_3/ug961zc706-gsg.pdf. int j comput commun, issn 1841-9836 9(3):340-347, june, 2014. an efficient solution for the vrp by using a hybrid elite ant system m. yousefikhoshbakht, f. didehvar, f. rahmati majid yousefikhoshbakht, farzad didehvar*, farhad rahmati department of mathematics and computer science amirkabir university of technology no.424, hafez avenue, tehran 15914, iran e-mail: khoshbakht@aut.ac.ir, frahmati@aut.ac.ir *corresponding author: didehvar@aut.ac.ir abstract: the vehicle routing problem (vrp) is a well-known np-hard problem in operation research which has drawn enormous interest from many researchers during the last decades because of its vital role in planning of distribution systems and logistics. this article presents a modified version of the elite ant system (eas) algorithm called heas for solving the vrp. the new version mixed with insert and swap algorithms utilizes an effective criterion for escaping from the local optimum points. in contrast to the classical eas, the proposed algorithm uses only a global updating which will increase pheromone on the edges of the best (i.e. the shortest) route and will at the same time decrease the amount of pheromone on the edges of the worst (i.e. the longest) route. the proposed algorithm was tested using fourteen instances available from the literature and their results were compared with other well-known meta-heuristic algorithms. results show that the suggested approach is quite effective as it provides solutions which are competitive with the best known algorithms in the literature. keywords: vehicle routing problem (vrp), elite ant system, , global and local updating, np-hard problems. 1 introduction the vehicle routing problem (vrp) is one of the most important combinational optimization problems that was first defined by dantzig and ramser more than 50 years ago [1]. the vrp involves designing a set of vehicle routes each of which starts and ends at a depot. these routes are used for a fleet of vehicles which provide services for a set of customers with known demands. each customer is visited by exactly one vehicle only once and the total demand of a route does not exceed the capacity of the vehicle type assigned to it. the objective is to minimize the total distance traveled by all the vehicles. to make vrp models more realistic and applicable, there are various forms of the vrp obtained by adding constraints to the basic model. examples of such extensions are vrp with pickup and delivery (if the vehicles need to pick up and delivery) [2], vrp with time windows (if the services have time constraint) [3], heterogeneous fleet ovrp (if the capacity of vehicles in ovrp are different) [4], vrp with backhauls (if the customers with delivery demand have to be visited before by the customers with a pickup demand) [5] and generalized vrp (if the customers are partitioned into clusters with given demands such that exactly one customer from each cluster should be visited) [6]. the vrp solution methods fall into three main categories: exact methods, heuristic and metaheuristic algorithms. exact approaches for solving the vrp are successfully used only for relatively small problem sizes but they can guarantee optimality based on different techniques. these techniques use algorithms that generate both a lower and an upper bound on the true minimum value of the problem instance. if the upper and lower bound coincide, a proof of optimality is achieved. branch-and-bound [7] and branch-and-cut [8] are two of exact methods which have copyright © 2006-2014 by ccc publications an efficient solution for the vrp by using a hybrid elite ant system 341 been proposed for vrp by researchers. since this problem is known to be np-hard (non-deterministic polynomial-time hard) problem [1], exact algorithms are not often suitable for real instances because of the computational time required to obtain an optimal solution. therefore, in the past forty years, researchers have developed the heuristic algorithms using a permissible solution instead of the optimal solution. there are many celebrated algorithms in this class such as savings heuristic of clarke and wright [9] that gains a single route instead of two routes according to the savings obtained by this merger. the sweep algorithm is another famous construction heuristic that is proposed by gillett and miller [10]. a new kind of algorithm which basically tries to combine basic heuristic methods in higher level frameworks aimed at efficiently exploring a search space is meta-heuristics. since the metaheuristic approaches are very efficient for escaping from local optimum, they are one of the best group algorithms for solving combinatorial optimization problems. some of the most popular metaheuristics applied to the vrp are simulated annealing (sa) [11], genetic algorithm (ga) [12], tabu search (ts) [11], computational intelligence approach [13], large neighborhood search [3], ant colony optimization (aco) [14], hybrid ant colony optimization [15] and particle swarm optimization [16]. the vrp is intrinsically a multiple objective optimization problem (mop) in nature that has received much attention because of its practical application in industrial and service problems [17]. on the other hand, there are few research studies which have used the aco in order to solve vrp. furthermore, the elite ant system (eas) is one of the most important and powerful versions of aco that nowadays is applied on a lot of combinatorial optimization problems [18]. therefore, this paper proposes a hybrid eas (heas) mixed with insert and swap algorithms for solving the vrp. the remaining parts of the paper are organized as follows. section 2 describes the eas, and the proposed algorithm. section 3 describes computational experiments carried out to investigate the performance of the proposed algorithm. finally, section 4 presents the results of the conclusions and future works. 2 our algorithm in this section, first, the eas are presented and then the proposed algorithm will be analyzed in more detail. 2.1 elite ant system the first modification to ant system (as) which was conducted by dorigo and his colleagues in 1996 was using elite strategy in eas [19]. the decision for choosing the unvisited ni node by ant k located in node i is made based on formula (1) where τij indicates the amount of pheromone on (i, j) edge while ηij shows inverse distance between i and j. however, both are powered by α and β which can be changed by the user. therefore, their relative importance can be altered. p kij =   τij αηij β∑ j∈ni τijαηijβ if j ∈ ni 0 if j /∈ ni (1) in the eas, in addition to depositing pheromone on all local edges, in each iteration pheromone is released on the edges of the best path. imagine t gb is the best path gained after the algorithm is completed, tk is total length of edges traversed by ant k and ρ is the rate of the pheromone evaporation in order to prevent rapid convergence of ants to a sub-optimal path. 342 m. yousefikhoshbakht, f. didehvar, f. rahmati while pheromone is being updated, the edges marched by the ant which have constructed the t gb path absorb an additional amount of pheromone which is equal to e/lgb(t). the formula for pheromone updating can be written in (2). in this way, the edges of the shortest path up to the current iteration become more attractive and are updated based on the value of the best lgb(t) tour. as it is shown, 1/lk(t) is the formula for the local trail updating. while traversing between nodes i and j, ants release pheromone on the respective edge which is equal to the inverse of the cost of the tour lk(t) taken by ants. τij(t + 1) = (1 − ρ).τij(t) + m∑ k=1 1/lk(t) + e/lgb(t) (2) 2.2 the proposed algorithm in eas, local pheromone release is not a good guide for finding the best route because there might be some edges which belong to no best route in none of the iterations but pheromone is still deposited on them in each iteration. therefore, local pheromone release must be abandoned so that the ants resort to global pheromone release in finding new solutions. this attracts the attention of ants to the edges which belong to the best route ever found. furthermore, the accuracy of solutions in eas is low at the beginning but it increases with the iterations of the algorithm and pheromone release. therefore, constant e coefficient cannot be a suitable formula for encouraging the best path found ever because it is not important when and with what accuracy the best solution was found. to improve the mentioned shortcomings, the heas has made some changes to the eas as follows: route construction in the original eas, an ant, say k, moves from the present node i to the next node j according to the state transition rule given by formula (3). here we define µij as the savings of combining two nodes on one tour as opposed to serving them on two different tours. the savings of combining any two customers i and j are computed as µij = di0 + d0j − dij where dij denotes the distance between nodes i and j, and node 0 is the depot. furthermore, λ like α and β is control parameter. p kij =   τij αηij βµλij∑ j∈ni τijαηijβµ λ ij if j ∈ ni 0 if j /∈ ni (3) pheromone updating in eas, the pheromone of all edges belonging to the routes obtained by ants will be updated. the pheromone updating includes local and global updating rules. the pheromone updating formula was meant to simulate the change in the amount of pheromone due to both the addition of new pheromone deposited by ants on the visited edges and the pheromone evaporation. in heas, not only does the pheromone release increases pheromone on the edges of the best solution in each iteration but also the pheromone release reduction is used in order to distract ants from the edges of the worst solution. the idea of the elitist strategy in the context of the heas is to give extra emphasis to the best and the worst paths found so far after every iteration. the value of coefficient e in pheromone increase and pheromone decrease which is shown as -e is a function. it was found out that using polynomial k2 can generate better solutions for the algorithm. in k2, k is an integer number whose value is 1 at the beginning and increases one unit. moreover, this polynomial is considered an appropriate function since it is not only an ascending function an efficient solution for the vrp by using a hybrid elite ant system 343 but also has variable slope as well. in other words, the function increases and the best routes are encouraged and strengthened compared with the previous paths. it should be noted that the low value of the function at the beginning causes the pheromone released on the best route to have less impact on the route selection in the following iterations and ants demonstrate a kind of forgetting in their search. in other words, this helps the ants to forget the weak solutions found at the start of the algorithm. however, as the algorithm is iterated and the accuracy of the solutions increases, the value of the polynomial rises rapidly and the edges belonging to the best solution absorb more pheromone and the edges of the worst solution lose more pheromone. finally, in order to prevent the edges of the worst solution from gaining a negative value, a minimum value which is equal to the half of the initial pheromone is considered for the edges so that when the amount of pheromone on the edges falls below this minimum, it is replaced with the minimum value. local search a local search approach starts with an initial solution and searches within neighborhoods for better solutions. abundant literature on aco indicates that a promising approach for obtaining high-quality solutions can only be obtained through coupling aco with a local search algorithm. therefore, in the heas, after the ants have constructed their solutions and before the pheromone is locally updated, each ant’s solution is improved by applying a local search. because local search is a time-consuming procedure, only a local search is applied to the iteration’s best solution. the idea here is that better solutions may have better chance to find a global optimum. in the proposed algorithm, at first a local search based on insert move and then the swap move is applied to the ant. in insert algorithm a customer is moved to another route. however, in swap algorithm a customer in a certain route is swapped with another customer from a different route. it should be noted that the new solution will be accepted only if first, vrp constraints are not violated especially about each vehicle’s capacity and second, a novel solution will gain a better value for a problem than the previous solution. 3 results the heas was coded in matlab 7. all the experiments were implemented on a pc with pentium 4 at 2.4ghz and 2gb ram and windows xp home basic operating system. because the proposed heas approach is a meta-heuristic algorithm, the results are reported for ten independent runs and in each run the algorithm was iterated n times. furthermore, the pack of optional parameters obtained through several tests is α = 1, β = 4, λ = 3, ρ = 0.2. the performance of the proposed algorithm was tested on a set of 14 benchmark instances designed by christofides et al. which have been widely used as benchmarks in order to compare the ability of proposed heas to the results of six meta-heuristic algorithms including sa and ts [11], genetic algorithm (ga) [12], scatter search algorithm combined by aco (ss-aco) [14], particle swarm intelligent (pso) [16] and genetic algorithm combined with particle swarm intelligent (gapso) [20]. the information of the 14 instances is shown in table 1. in this table, n, m, heas, mheas, bks, and pd are the number of customers, the number of vehicles of best known solution (bks), the best solutions found by heas algorithm, the used vehicles of heas, the best known solution, and the percentage deviation of heas compared to the best know solutions (bkss) respectively. the pd is computed by formula (4) where c(s∗∗) is the best solution found by our algorithm for a given instance, and c(s∗) is the overall bks for the same 344 m. yousefikhoshbakht, f. didehvar, f. rahmati instance on the web. a zero pd indicates that the bks is found by the algorithm. pd = c(s∗) − c(s∗∗) c(s∗) × 100 (4) table 1: computational results for standard vrp problems instance n m sa ts ga ss-aco pso gapso heas mheas pd bks c1 50 5 528 524 524.61 524.61 524.61 524.61 524.61 5 0 524.61 c2 75 10 838 844 849.77 835.26 844.42 835.26 847.14 10 -1.42 835.26 c3 100 8 829 835 840.72 830.14 829.40 826.14 712.36 8 13.77 826.14 c4 150 12 1058 1052 1055.85 1038.20 1048.89 1028.42 1066.89 12 -3.74 1028.42 c5 199 17 1376 1354 1378.73 1307.18 1323.89 1294.21 1311.35 17 -1.54 1291.45 c6 50 6 555 555 560.29 559.12 555.43 555.43 555.43 6 0 555.43 c7 75 11 909 913 914.13 912.68 917.68 909.68 909.68 11 0 909.68 c8 100 9 866 866 872.82 869.34 867.01 865.94 865.94 9 0 865.94 c9 150 14 1164 1188 1193.05 1179.4 1181.14 1163.41 1162.89 14 -0.03 1162.55 c10 199 18 1418 1422 1483.06 1410.26 1428.46 1397.51 1404.75 18 -0.64 1395.85 c11 120 7 1176 1042 1060.24 1044.12 1051.87 1042.11 1042.11 7 0 1042.11 c12 100 10 826 819 877.8 824.31 819.56 819.56 840.64 10 -2.57 819.56 c13 120 11 1545 1547 1562.25 1556.52 1546.20 1544.57 1545.93 11 -0.31 1541.14 c14 100 11 890 866 872.34 870.26 866.37 866.37 866.37 11 0 866.37 as can be seen from this table, the proposed algorithm finds the optimal solution for 6 out of 14 problems that are published in the literature. the results indicate that heas is a competitive approach compared to the bkss. furthermore, one new best known solution of the benchmark problem including c3 is also improved by the proposed method. for instances c4 and c12, the gap is about as high as 3%. however, in most of the instances, the proposed algorithm finds nearly the bkss and for overall the average difference is 0.25%. as the results in table 1 indicate, the ga has not been able to find the best solutions in thirteen of the fourteen examples. therefore, it is considered to be the weakest algorithm among the seven presented algorithms. however, ss-aco has been able to find better solutions than the ga and has come up with the best solutions in 12 examples. among remaining 5 algorithms, in 11 examples sa has not been almost capable of finding the bks. pso has failed in improving the solutions in 10 examples and has come up with solutions similar to the ones found by sa. hence, it can be concluded that ts is more efficient than ga, ss-aco, pso and sa in finding better solutions. from the comparison between gapso and heas, it can be seen that gapso in four examples has been able to find solutions with a gap of less than 1 percent. despite being able to find the best solution ever found for ten examples, it has failed to achieve these results in the remaining four examples. however, heas has found better solutions than gapso for the two examples. a simple criterion to measure the efficiency and the quality of an algorithm is to compute the average of solutions on specific benchmark instances. in figure 1, the average of each algorithm’s solution is reported. from this table we conclude that the heas method has the best average with 975.44 and has been able to escape local optimum points. the algorithms in terms of their performance from the worst to the best can be listed as: ga, sa, ts, pso, ss-aco, gapso and heas. in addition, in order to demonstrate the efficiency of the proposed algorithm, two of the solutions found for the examples in table 1 are presented in figure 2. it should be noted an efficient solution for the vrp by using a hybrid elite ant system 345 figure 1: comparison of mean for 14 instances between meta-heuristic algorithms in table 1 that in 1 out of 2 examples presented in this figure, the heas has been able to improve the best solution ever found. figure 2: some of the solutions to the vrp found by the proposed algorithm 4 conclusion and future works in this paper, a modified version of eas which employs several effective modifications was presented. the modifications improved the performance of the classic eas algorithm in escaping from local optimum points and finding better solutions in comparison with the other metaheuristic algorithms. it seems that combining the proposed algorithm with other metaheuristic algorithms like tabu search and making use of strong local algorithms like lin-kernigan algorithm can bring better results for the heas. furthermore, heas can be used for other versions of vrp like ovrp and heterogeneous fixed fleet ovrp. future projects will focus on working on such ideas and making them operational. 346 m. yousefikhoshbakht, f. didehvar, f. rahmati bibliography [1] yousefikhoshbakhtm, m. and khorram, e. (2012). solving the vehicle routing problem by a hybrid meta-heuristic algorithm, journal of industrial engineering international, 8(12):1-9. [2] yousefikhoshbakht, m., didehvar, f. and rahmati, f. (2014). a combination of modified tabu search and elite ant system to solve the vehicle routing problem with simultaneous pickup and delivery, journal of industrial and production engineering, 31(2): 65-75. [3] hong, l. (2012). an improved lns algorithm for real-time vehicle routing problem with time windows, computers and operations research, 39(2):151-163. [4] yousefikhoshbakht, m., f. didehvar, and f. rahmati, (2013). solving the heterogeneous fixed fleet open vehicle routing problem by a combined metaheuristic algorithm, international journal of production research, http://dx.doi.org/10.1080/00207543.2013.855337. [5] toth, p. and vigo, d. (1997). an exact algorithm for the vehicle routing problem with backhauls, transportation science, 31:372-385. [6] pop p.c., sitar c.p., zelina i., lupse v. and chira c. (2011). ,heuristic algorithms for solving the generalized vehicle routing problem, international journal of computers communications & control, issn 1841-9836, 6(1): 158-165. [7] valle, c. a., martinez, l. c. and cunha, a. s., mateus, g. r. (2011). heuristic and exact algorithms for a min-max selective vehicle routing problem, computers and operations research, 38(7):1054-1065. [8] brad, j., kontoravdis, g. and yu, g. (2002). a branch-and-cut procedure for the vehicle routing problem with time windows. transportation science, 36(2):250-269. [9] clarke, g. and wright, j. w. (1964). scheduling of vehicles from a central depot to a number of delivery points. operations research, 12: 568-581. [10] gillett, b. e. and miller, l. r. (1974). a heuristic algorithm for the vehicle dispatch problem. operations research, 22: 340-349. [11] osman, i. (1993). metastrategy simulated annealing and tabu search algorithms for the vehicle routing problem, operations research, 41: 421-451. [12] baker, b., and ayechew, m. (2003). a genetic algorithm for the vehicle routing problem, computers and operations research, 30: 787-800. [13] hirota, k., dong, f. and chen, k. (2006), a computational intelligence approach to vrsdp (vehicle routing, scheduling, and dispatching problems), international journal of computers communications & control, issn 1841-9836, suppl. issue, 1(s): 53-60, [14] zhang, x. and tang, l. (2009). a new hybrid ant colony optimization algorithm for the vehicle routing problem, pattern recognition letters, 30(152): 848-855. [15] negulescu s.c., kifor c.v. and oprean c. (2008). ant colony solving multiple constraints problem: vehicle route allocation , international journal of computers communications & control, issn 1841-9836, 3(4):366-373. an efficient solution for the vrp by using a hybrid elite ant system 347 [16] ai, j., kachitvichyanukul, v. (2009). particle swarm optimization and two solution representations for solving the capacitated vehicle routing problem, computers and industrial engineering, 56:380-387. [17] pintea c.-m., chira c., dumitrescu d. and pop p.c. (2011). sensitive ants in solving the generalized vehicle routing problem, international journal of computers communications & control, issn 1841-9836, 6(4):734-741. [18] yousefikhoshbakht, m., didehvar, f. and rahmati, f. (2013). modification of the ant colony optimization for solving the multiple traveling salesman problem, romanian journal of information science and technology, 16(1):65-80. [19] yousefikhoshbakht, m. and sedighpour, m. (2012). a combination of sweep algorithm and elite ant colony optimization for solving the multiple traveling salesman problem, proceedings of the romanian academy, a, 13(4):295-302. [20] marinakis, y. and marinaki, m. (2010). a hybrid genetic-particle swarm optimization algorithm for the vehicle routing problem, expert systems with applications, 37(33):1146-1455. int j comput commun, issn 1841-9836 7(5):824-831, december, 2012. data dimensionality reduction for data mining: a combined filter-wrapper framework m. danubianu, s.g. pentiuc, d.m. danubianu mirela danubianu, stefan gheorghe pentiuc dragos mircea danubianu "stefan cel mare" university of suceava romania, 720229 suceava, 1 universitatii e-mail: mdanub@eed.usv.ro, pentiuc@eed.usv.ro dragosdanubianu@yahoo.com abstract: knowledge discovery in databases aims to extract new, interesting and potential useful patterns from large amounts of data. it is a complex process whose central point is data mining, which effectively builds models from data. data type, quality and dimensionality are some factors which affect performance of data mining task. since the high dimensionality of data can cause some troubles, as data overload, a possible solution could be its reduction. sampling and filtering reduce the number of cases in a dataset, whereas features reduction can be achieved by feature selection. this paper aims to present a combined method for feature selection, where a filter based on correlation is applied on whole features set to find the relevant ones, and then, on these features a wrapper is applied in order to find the best features subset for a specified predictor. it is also presented a case study for a data set provided by terapers a personalized speech therapy system. keywords: data mining, feature selection, filters, wrappers. 1 introduction as an efficient way to find new and useful knowledge in data, knowledge discovery in databases (kdd) process, and implicitly data mining as its main step, have been the subject of extensive research. the main issue relates to model building process performances, and these performances are affected by factors such as type, quality and dimensionality of available data. as most data mining techniques may not be effective for high-dimensionality data, the solution consists in its reduction. in order to reduce the number of cases, one can use sampling or filtering, whereas feature reduction may be achieved by feature selection or feature composition. feature selection aims to identify and to remove as many irrelevant and redundant features as possible with respect to the task to be executed, and it can be made using two approaches: filters and wrappers. filters do not consider the effect of selected features on the performance of the whole process of knowledge discovery, since the used feature selection criterion does not require a predictor evaluation for reduced data sets. wrappers take into account the feed-back related to the performance of the selected set of features in the kdd process. so, it is used as criteria for feature selection the predictor performance. wrappers often give better results than filters, because feature selection is optimized for the specific learning algorithm used, but if the computational complexity and execution time are considered, they are too expensive for large dimensional datasets since each selected feature set must be evaluated with the predictive algorithm used. in these circumstances we aim to study if an apriori filtering of features based on their relevance related to the class may improve a closed loop feature selection process. section 2 presents some theoretically aspects related to data mining and the influence of data dimensionality on its performances. there are also enumerated some data dimensionality reduction methods. section 3 refers some aspects regarding feature selection whereas section 4 provides a comparison copyright c⃝ 2006-2012 by ccc publications data dimensionality reduction for data mining: a combined filter-wrapper framework 825 between filters and wrappers. section 5 presents a framework, which proposes a combination filter-wrapper and section 6 offers some experimental results obtained by applying the proposed method over a dataset collected by terapers a computer-based speech therapy developed within the center for computer research in the stefan cel mare university of suceava, and used by the therapists from regional speech therapy center of suceava from march 2008. 2 data mining and data dimensionality defined as the process of exploring and analyzing large volumes of data, in order to find new relationships within data or new patterns, data mining is a step in knowledge discovery in database (kdd). its task is to analyze large volumes of data in order to extract previously unknown, interesting and potential useful patterns, and its performances are affected by factors such as: type, quality and dimensionality of data. hypothetically, having more data, results are more precise, but practical experience with data mining algorithms has shown that this is not always true. on the one hand, the high dimensionality of data can cause data overload, and on the other hand if there are a lot of features, it is possible that the number of cases in data set to be insufficient for data mining operations. [1] this make some data mining algorithms non applicable. the solution for these problems is the reduction of data dimensions. the size of a data set is determined both by the number of cases and by the number of features considered for each case. in order to reduce number of cases one can use sampling or filtering. feature reduction may be achieved either by feature selection or by feature composition. these methods should produce fewer features, so the algorithms can learn faster. sometimes, even the accuracy of built models could be improved. [2] methods used for feature selection, can be classified as: filters or open loop methods, and wrappers or closed loop methods. 3 feature selection feature selection aims to identify and to remove as much irrelevant and redundant features as possible with respect to the task to be executed. it has the potential to be a fully automatic process, and brings some benefits for data mining, such as: an improved predictive accuracy, more compact and easily understood learned knowledge and reduced execution time for algorithms. feature selection methods are divided in two broad categories, filters and wrappers, and within these categories algorithms can be further individualized by the nature of their evaluation function and by the means the space of feature subsets is explored. typically, feature selection algorithms perform a search through the space of feature subsets, and must solve four problems which affect such search: to select a point in the feature subsets space from which to start the search. a first choice, called forward selection, supposes to begin with no features and successively add attributes, whereas a second one, backward selection, begins with all features and successively remove them; even heuristic search strategies not guarantee finding the optimal subset, such strategies can give good results, and are more feasible than exhaustive search strategies which are prohibitive just for a small initial number of features; the most important factor which makes difference among feature selection algorithms is evaluation strategy. there are feature selection methods which operate independent of any learning algorithm, and irrelevant features are filtered before learning begins, based on general characteristics of the data to evaluate. other methods use an induction algorithm combined with a statistical re-sample technique to estimate the final accuracy of feature subsets; 826 m. danubianu, s.g. pentiuc, d.m. danubianu each feature selection process must solve the problem regarding stop searching through the space of feature subsets. one might stop adding or removing features when none of the alternative improves upon the gain of current feature subset, or one might continue to alter the feature subset as long as the gain does not degrade. 4 filters vs. wrappers the earliest and simplest approaches to feature selection were filters, called also open loop feature selection methods. based on selecting features through class separability criteria, filters do not consider the effect of selected features on the performance of the whole process of knowledge discovery, as is presented in figure 1(a). they provide usually a ranked list of features that are ordered according a specific evaluation criterion such as: accuracy and consistency of data, information content or statistical dependencies between features. [3] they give also information about the relevance of a feature compared with the relevance of other features, and do not tell to the analyst what is the desirable minimum set of the features. [4] figure 1: open loop feature selection method (a), and closed loop feature selection method (b) wrappers, known also as closed loop feature selection methods take into account the feedback related to the performance of the selected set of features for the complete kdd process. they use the prediction performance as selection criteria, and evaluate the quality of selected features by comparing the performances for prediction algorithms applied on the reduced set of features and on the original one. figure 1(b) presents a closed loop feature selection method. [2] regarding the final predictive accuracy of a learning algorithm, wrappers often give better results than filters, because feature selection is optimized for the specific learning algorithm used. but if the computational complexity and execution time are considered, wrappers are too expensive for large dimensional datasets, since each selected feature set must be evaluated with the predictive algorithm used. additional, since the feature selection is closely coupled with a learning algorithm, wrappers are less general than filters and they must be run every time when one switch from one learning algorithm to another. 5 a combined approach filter-wrapper for feature selection studying the advantages and limitations for the two general feature selection methods we can conclude that, if improved performance for a specific learning algorithm is required, a filter can provide a reduced initial feature subset for a wrapper which contains only relevant features, as shown in figure 2. this approach could produce shorter and faster search for the wrapper. data dimensionality reduction for data mining: a combined filter-wrapper framework 827 figure 2: a framework which combines feature relevance analysis with closed loop feature selection since practice has demonstrated that irrelevant input features lead to great computational cost for data mining process and may cause overfitting, more feature selection researches have focused on extraction of relevant features from the whole data set in order to apply data mining algorithms upon these data. [5] [6] but how can we establish if a feature is relevant or not? in [7] is stated that features are relevant if their values vary systematically with category membership. that means that a feature is relevant if it is correlated with the class. formally this was defined in [2] as follows: definition 1. a feature fi is relevant iff there exists fi and c for which p(fi = fi) > 0 such that p(c = c|fi = fi) ̸= p(c = c) (1) relevance is usually defined in terms of correlation or mutual information. in order to define mutual information for two features we start from the concept of entropy, as a measure of uncertainty of a random variable. for a variable x the entropy is defined as: e(x) = −σp(xi)log2(p(xi)) (2) the entropy of a variable x after observing values of another variable y is defined as: e(x|y ) = −σp(yi)σp(xi, yi)log2(p(xi|yi)) (3) where p(xi) is the prior probability for all values of x, and p(xi|yi) is the posterior probabilities of x given the value of y. the value by which the entropy of x decreases, estimates additional information about x provided by y. it is called information gain [8] and is calculated using the following expression: i(x, y ) = e(x) − e(x|y ) (4) we take into account that for discrete random variable, the joint probability mass function is: p(xi|yj) = p(xi, yj)/p(yj) (5) 828 m. danubianu, s.g. pentiuc, d.m. danubianu and the marginal probability function, p( x) is: p(xi) = σp(xi, yj) = σp(xi|yj)p(yi) (6) where p(x,y) is joint probability distribution function of x and y, and p(xi) and p(yj) are the marginal probability distribution functions of x and y respectively. finally, for two discrete random variables x and y , information gain is formally defined as: i(x, y ) = ∑ j ∑ i p(xi, yj)log p(xi, yj) ((p(xi)p(yj)) (7) according to this expression, one says that a feature y is more correlated to feature x than feature z if: i(x, y ) > i(z, y ) (8) it can be observed that information gain favors features with more values, so it should be normalized. in order to compensate its bias and to restrict its values to range [0,1] it is preferable to be used symmetrical uncertainty, defined as: su(x, y ) = 2 i(x, y ) e(x) + e(y ) (9) a value of 1 for symmetrical uncertainty means that knowing the values of either feature completely predicts the value of the other whereas a value of 0 implies that x and y are independent. starting from these considerations, in the proposed framework, first a relevance analysis is made using the symmetrical uncertainty su(fi, c) between each feature fi and the class c. [9] based on this analysis one removes the irrelevant features, and one obtains a features subset containing only the relevant features. then, on this dataset one applies closed loop feature selection methods, using as search strategy both forward and backward selection, and using a decision tree as predictor, both for feature selection and for the performance evaluation. 6 experimental results we have applied the framework described above for a real dataset collected by terapers system. this is a system which aims to assist the personalized therapy of dyslalia (an articulation speech disorder) and to track how the patients respond to various personalized therapy programs. implemented in march 2008, the system is currently used by the therapists from regional speech therapy center of suceava. an important aspect of assisted therapy refers its adaptation according to individual patients’ characteristics and evolution, for which therapist must perform complex examination of children, materialized in recording of relevant data relating to personal and family anamnesis. these collected data may provide information relative to various causes that may negatively influence the normal development of the language. further, one provide to the personalized therapy programs data such as number of sessions/week, exercises for each phase of therapy and the changes of the original program according to the patient evolution. the tracking of child progress materializes data which indicate the moment of assessing the child and his status at that time. all these data are stored in a relational database, composed of 60 tables. data stored in the terapers database is the set of raw data that can be the subject of data mining process. it might be useful, because as it was shown in [10] one can use classification in order to places the people with different speech impairments in predefined classes (if attribute data dimensionality reduction for data mining: a combined filter-wrapper framework 829 diagnosis contain the class label one can predict a diagnosis based on information contained in various predictor variables), one can use clustering to group people with speech disorders on the basis of similarity of different features and to help therapists to understand who are they patients, or one can use association rules to determine why a specific therapy program has been successful on a segment of patients with speech disorders and on the other was ineffective. for our experiments one considers a data set consisting of 72 features with numeric and descriptive values and 312 cases. these are anamnesis data or data derived from complex examination on which one intend to build a classification model to predict, in order to suggest to therapist the diagnosis for future cases. on this data set one have applied the feature selection method described above. shown in figure 3, such experiment is designed and implemented in weka. [11] figure 3: weka knowledge flow for the proposed framework one has considered the attribute diagnosis as class label, and we have built a feature selection process in two steps.in the first stage one have applied a unsupervised filter against the whole set of features. based on values for symmetrical uncertainty su(fi, c) there were retained 52 relevant features. in the second stage over this feature subset one have applied a wrapper which uses as estimation predictor a decision tree classifier (j48). for this wrapper one applies alternatively the two search strategies forward and backward search. to analyze the influence of data reduction we need to know what we gain or what we lose so, we must compare computing times and the accuracy for the model built for reduced data sets obtained using the described approaches. figure 4 presents the performances of the classifiers built on feature subset produced by wrapper using a backward selection strategy and on features obtained from the same wrapper which use, this time, a forward selection strategy. as one can see in figure 4 for the subset obtained by backward selection, the predictions’ performance, measured in correctly classified instances, are better than those for the feature subset obtained by forward selection. in figure 5 a comparison of the execution time for forward and backward selection in both cases, with filter before wrapper, and without filter before wrapper is made. one see that for backward selection, which provides better performance for prediction, the process which considers as input for wrapper the data subset provided by filter is three times faster than the other process. 830 m. danubianu, s.g. pentiuc, d.m. danubianu figure 4: performances of classifier built on feature subsets obtained by proposed method figure 5: performances of classifier built on feature subsets obtained by proposed method 7 conclusions and future works as a possibility to reduce the number of feature considered by data mining algorithms, in order to make them more efficient, this paper presents a method which uses a combination filterwrapper. we have used a correlation based filter on the whole set of features, then on relevant subset of features we have applied a wrapper which uses a decision tree classifier for prediction. as a case study we have applied this method on data collected by terapers a system which aims to assist speech therapists on personalized therapy of dyslalia. the process was designed and implemented in weka. we have compared the performances obtained both for feature selection by the described method, and for feature selection using only the same wrapper as in first case. we have achieved clearly superior performances for execution time, when we have used for feature selection the combined approach and backward selection as search strategy for wrapper. the positive results obtained for the considered data encourage us to continue our work. we will try to improve these execution times by parallelization of feature selection operations. acknowledgments this paper was supported by the project "progress and development through post-doctoral research and innovation in engineering and applied sciences pride contract no. posdru/89/1.5/ s/57083", project co-funded from european social fund through sectorial operational program human resources 2007-2013. bibliography [1] danubianu m., pentiuc s.g., tobolcea i., schipor o.a., advanced information technology support of improved personalized therapy of speech disorders, int j comput commun, issn 1841-9836, 5(5): 684-692, 2010. data dimensionality reduction for data mining: a combined filter-wrapper framework 831 [2] kohavi r., john g., wrappers for feature subset selection, artificial intelligence, special issue on relevance, 97(1-2):273-324, 1997. [3] hall, m., correlation-based feature selection for discrete and numeric class machine learning, proc. of international conference on machine learning, 359-365, morgan kaufmann, 2000. [4] douik a., abdellaoui m., cereal grain classification by optimal features and intelligent classifiers, int j comput commun, issn 1841-9836, 5(4):506-516, 2010. [5] peng h. long f., ding c., feature selection based on mutual information: criteria of maxdependency, max-relevance and min-redundancy, ieee transaction on pattern analysis and machine intelligence, 27(8):1226 1238, 2005. [6] john g.h., kohavi r., pfleger p., irrelevant features and the subset selection problem, machine learning: proceedings of the eleventh international conference, 121-129, morgan kaufman, 1994. [7] gennari j.h., langley p., fisher d., models of incremental concept formation, artificial intelligence, (40):11-16, 1989. [8] quinlan j.r., c4.5: programs for machine learning, morgan kaufman, 1993. [9] yu, l., liu, h., efficient feature selection via analysis of relevance and redundancy, journal of machine learning research, 5:1205-1224, 2005 . [10] danubianu m., pentiuc st. gh., socaciu t., towards the optimized personalized therapy of speech disorders by data mining techniques, the fourth international multi conference on computing in the global information technology iccgi 2009, vol: cd, 23-29 august, cannes la bocca, france, 2009. [11] hall, m., frank, e., holmes, g., pfahringer, b., reutemann, p., witten, i., the weka data mining software: an update, sigkdd explorations, 11(1):10-18, 2009. int j comput commun, issn 1841-9836 9(2):131-138, april, 2014. input projection algorithms influence in prediction and optimization of qos accuracy r.d. albu, i. dzitac, f. popentiu-vladicescu, i.m. naghiu răzvan-daniel albu*, florin popentiu-vlădicescu**, iuliana maria naghiu 1. university of oradea, universitatii st., 1, 410610 oradea, romania *corresponding author: ralbu@uoradea.ro **2. academy of scientist in romania, 54, splaiul independentei, 0590094, bucharest e-mail: popentiu@imm.dtu.dk. ioan dzitac 1. aurel vlaicu university of arad elena dragoi st., 2, 310330 arad, romania ioan.dzitac@uav.ro 2. agora university of oradea piata tineretului 8, 410526 oradea, romania e-mail: idzitac@univagora.ro abstract: regardless of new achievements in the research of prediction models, qos is still a great issue for high quality web services and remains one of the key subjects that need to be studied. we believe that qos should not only be measured, but have to be predicted in development and implementation phases. in this paper we assess how different input projection algorithms influence the prediction accuracy of a multi-layer perceptron (mlp) trained with large datasets of web services qos values. keywords: quality of service (qos), adaptive models, web services, large/big data. 1 introduction the major web services qos requirements are: availability, accessibility, integrity, performance, regulatory reliability and security. models of prediction and/or optimization of qos in web services are presented in many actual works: [19], [14], [16], [15], [20], [21], etc. this research work continues the investigation on web services qos criteria prediction and completes the results obtained in [1][5], works (co)authored by first author of this paper. we build in this work an adaptive model that offers good prediction results of qos in web services. since more data may lead to more accurate analyses and more precise analyses may lead to more confident decision making, the development of accurate and adaptive prediction models is both challenging and essential. enhanced conclusions can cause superior operational efficiencies, cost reductions and lower risks. scott zucker, vice president of business services at family dollar, said: "small data is gone. data is just going to get bigger and bigger and bigger, and people just have to think differently about how they manage it" [8]. large data is an expression used to define the exponential growth of data availability and size. several recent technology improvements, that allow organizations to make the most of big data analytics, are: affordable bigger storage, faster and parallel processors, open source platforms, clustering, virtualization, grid computing, increased throughput and last, but not least, cloud computing. copyright © 2006-2014 by ccc publications 132 r.d. albu, i. dzitac, f. popentiu-vladicescu, i.m. naghiu another new term for the quantity of information generated by business, government, and science is: data deluge [10]. for instance, in 2010, the large hadron collider (lhc) facility at cern delivered 13 petabytes of data. concurrently to this important growth, data are also becoming strongly interconnected. for example, facebook is approximately fully connected. presently, social networks interconnect people or groups who share similar interests, but soon, we expect they will also link software modules such as: web-based services, or workflows. in recent happenings interrelated with the 2013 boston marathon violence, social networks of marathon competitors and high-performance computational systems were combined to group and analyze huge collections of photos and videos, finally leading to the identification of the terrorists. big data is typically characterized by the "three v’s": volume, velocity, and variety. in terms of volume, at the end of 2011, facebook had 721 million individuals and 68.7 billion friendship edges [7]. in terms of velocity, twitter generates 7 tbytes of data daily, while facebook produces 10 tbytes. on 11 november 2012, a sales event at taobao, the largest online shopping marketplace in china, generated 100 million transactions and reached a peak transaction rate of 205,000 per minute [9]. in terms of variety, data today come from various sources, ranging from surveillance videos, to satellite images, to mobile tweets, to sensors and meters in the power grid [18]. a key difference between big data and large data is the rate at which the data can be collected and made accessible for analysis. large data can also be handled by the traditional reporting and analysis tools. big data is generally used to describe the massive amount of unstructured data, which costs a lot of time and money for analysis. however, large data may not have such special meaning, they just refer to the volume. we don’t think there is any value in defining a threshold for what constitutes "big data." and what means "large data". therefore, a flexible definition we use is "big data is data that’s an order of magnitude bigger than you’re used to". the big data analysis affects the qos because, if implemented efficiently, the big data infrastructure will permit carriers to preserve constantly optimized services and to set apart in the marketplace by offering qos reports, diagnostic tools and other decision-support front-ends. cloud computing is a technology that exploded in the recent years and seems to be the ideal way to provide big and large data for mainstream uses, because it offers scale-out and ondemand computing resources in a pay-per-use style. for instance, netflix stores movies and tv shows, while dropbox saves clients’ documents, both in amazon’s simple storage service. in recent years, scale-out data stores, usually mentioned as nosql systems, were quickly gaining admiration as a possible solution for applications scaled at internet level. these stores consist of technologies like: amazon’s dynamodb, google’s bigtable and yahoo’s pnuts. to address the "big and large data" challenge, nosql supporters limit acid restraints, deliver completely scalable solutions and then gradually add back the relational database management system (rdbms) features like index or transaction support. a newly appeared paradigm named stream computing facilitates continuous queries over streaming data like social media feeds. social networking on the cloud could empower sharing based on the social relationship between users. this would possibly make available technologies like volunteer computing. this is a distributed computing model in which associated users donate computing resources to a project. two examples of volunteer computing are the following projects: storage@home14 and boinc15 [18]. in these scenarios, the resources are owned by individuals and they are shared in return for access to other resources. this could hypothetically transform the cloud’s economics and raises doubts about the reliability and qos warranties [11] [13]. 2 input projection and optimization algorithms input projection is the procedure that further reduces input dimensions by automatically mapping multiple pieces of information to single inputs. the main goal of input optimization input projection algorithms influence in prediction and optimization of qos accuracy 133 is to make the network learn faster in the same time offering the same prediction performances or better. input optimization automatically determines the most informative inputs through genetic algorithms, greedy search, back-elimination and other techniques [6]. the four input projection algorithms we have investigated in this study are: pca (principal component analysis), mds (multi-dimensional scaling), som (self organized map) and lle (locally linear embed). next, we will briefly describe each input projection algorithm. principal component analysis (pca) also named karhunen-loeve transform of singular value decomposition (svd) finds an orthogonal set of directions in the input space and delivers a method of finding the projections into these directions in a well-ordered style. the first principal component is the one that has the largest projection (the shadow of our data cluster in each direction). the orthogonal directions are named the eigenvectors of the correlation matrix of the input vector, and the projections the equivalent eigenvalues. because pca orders the projections, we can decrease the dimensionality by truncating the projections to a given order. the reconstruction error is equal to the sum of the projections left out. the features in the linear projection space become the eigenvalues. pca networks are typically utilized for data compression, offering the best m linear features, but they can also be used for data reduction in conjunction with multilayer perceptron classifiers [17]. the som algorithm is as follows [12]: a. initialize the weights with small different random values for symmetry breaking; b. for each input data find the winning pe using a minimum distance rule; ī(x) = argjmin ∥ x̄(n) − wj ∥ (1) c. for the winning pe, update its weights and those in its neighborhood by: wj(n + 1) = wj + η(n)[x(n) − wj(n)] (2) where η(n) is the step size. in the beginning, the step size should be large, but decrease progressively to zero, according to: η(n) = 1 aη + bηn (3) where aη and bη are problem dependent constants. the purpose of these adaptive constants is to guarantee, in the early stages of learning, malleability and the formation of local neighborhoods as well as, in the later stages of learning, constancy and adjustment of the map. these problems are very difficult to study theoretically, so heuristics have to be involved in the designation of these values. multi-dimensional scaling (mds) is a set of related statistical techniques used in information visualization for exploring similarities or dissimilarities in data. the mds algorithm starts with a matrix of item-item similarities and then assigns a location to each item in a lower dimensional space say m, such that the original similarities or dissimilarities are represented by the relative position in the lower dimensional space. the implemented algorithm measures the similarity or distance between points by correlation coefficient and applies simulated annealing to find the mapping from n dimensional space to m dimensional space, where n>=m, such that the distances between points in the m dimensional space approximate the correlation coefficients between points in the n dimensional space [12]. locally linear embedding (lle) is an unsupervised learning algorithm that computes low dimensional, neighborhood preserving embedding of high dimensional data. lle attempts to discover nonlinear structure in high dimensional data by exploiting the local symmetries of linear reconstructions. the lle algorithm includes following three major steps [12]: 134 r.d. albu, i. dzitac, f. popentiu-vladicescu, i.m. naghiu 1. based on desired number of inputs say m, get the neighbors of each point in the sense euclidean distance. 2. compute the weights wij that best reconstruct each data point xi with dimension n from its k neighbors, minimizing the cost defined as: ζ(w) = n∑ i=1 |xi − k∑ j=1 wijxj| (4) the solution can be reached by first solving the linear system of equations involving local covariance matrix and then rescaling the weights so that they sum to one. 3. compute the projected inputs yi with dimension m that is best reconstructed by the weights wij minimizing the quadratic form defined as: ζ(w) = n∑ i=1 |yi − k∑ j=1 wijyj| (5) this is found by extracting the appropriate bottom m eigenvectors of a matrix derived from the cost function. 3 experiments and results the adaptive models compared in this study have a name that respects the following syntax: topology_name number_of_hidden_layers learning_rule input projection_algorithm for example, the adaptive model named: mlp-2-cg-som is a multi-layer perceptron with two hidden layers, trained using conjugate gradient learning rule and self organizing map input projection algorithm. in this study, in order to train the mlp, we made use of two large datasets: rtmatrix and fpmatrix. each matrix has a size of 339 lines x 5825 columns. rtmatrix consists of web services response time values while fpmatrix stores similar web services throughput values. in our previous research works [1][5] we concluded that mlp-2-m and mlp-2-cg offered the most accurate prediction results for rtmatrix and fpmatrix, respectively. fpmatrix and rtmatrix were built by z. zheng, y. zhang and m.r. lyu in [20] and [21]. to implement, train and test the adaptive models we utilized neuro solutions 6.21 development environment. as we can see in figure 1, the main difference of this neuro solution implementation of mlp is the presence of inputprojectionaxon. the inputprojectionaxon will apply either linear or non-linear transformation to convert n input data set into m input data set, where n>=m. the inputprojectionaxon inspector allows us to select a different input projection algorithm and to set its parameters [12]. in this study, we have tested pca in conjunction with mlp-2-m, but it works only on a subset of rtmatrix/fpmatrix since the separability of the classes is not always guaranteed. another problem with linear pca networks is outlying data points. outliers will distort the estimation of the eigenvectors and create skewed data projections. rtmatrix and fpmatrix, having a huge number of values, have also a lot of outliers. nonlinear networks are better able to handle this case. the importance of pca analysis is that the number of inputs for the mlp classifier can be reduced a lot, which definitely influences the number of necessary training samples and the training time. and this was achieved when we made use of just a subset of rtmatrix/fpmatrix. input projection algorithms influence in prediction and optimization of qos accuracy 135 figure 1: mlp-2-m with inputprojectionaxon som are an alternative to the pca concept, and in our experiments, it works on both the entire rtmatrix/fpmatrix and a subset of them. by "works" we mean neuro solutions offer a results report. the parameters of som utilized in our simulation are presented in figure 2. figure 2: som parameters output dimension is used to specify the desired number of inputs as a percentage of number of inputs in the original data set. initial sd of gaussian is used to set the initial standard deviation for the gaussian function that is used to define the neighborhood function. initial learning rate is used to specify the starting learning rate and it is gradually reduced until the final learning rate is 0.01. square radius is used to define the neighbours. the shape of the neighborhood is defined as a square. units/dimension is used to define the smoothness of the projected space or grids. maximum epochs is used to specify the maximum number of epochs before termination of the computation. mds algorithm applied on the entire rtmatrix/fpmatrix determined neuro solutions to block indefinitely in a "not responding" state, but when we selected just a subset of them, it worked and offered some results. lle offers the best results, in comparison with the other four input projection algorithms, since it works on both fpmatrix and rtmatrix and provides the lowest prediction error. in neuro solutions, the best input optimization algorithm is performed by the geneticcontrol component. this component implements a genetic algorithm to optimize the inputs. genetic algorithms are search procedures based upon the principles of evolution witnessed in nature that combine selection, crossover, and mutation operators. they search for an optimal solution until 136 r.d. albu, i. dzitac, f. popentiu-vladicescu, i.m. naghiu a termination criterion is met. in neuro solutions the criteria used to evaluate the fitness of each potential solution is the lowest cost attained during the training run. the solution to a problem is called a chromosome and consists of a collection of genes, which are simply the inputs to be optimized. the genetic algorithm produces an initial population, evaluates this population by training a neural network for each chromosome and then evolves the population through multiple generations in the search for the best inputs. in figure 3 is presented the compared results between mlp-2-m with no input optimization or projection, mlp-2-m with only genetic optimization and mlp-2-m with both genetic and lle input projection. figure 3: effect of input optimization and projection on prediction accuracy the comparison was performed on the entire rtmatrix and the results show that mlp2-m-lle is the most accurate adaptive model for web services response time prediction. as we can observe, the dataset was divided in three subsets: training (50%), cross-validation (30%) and testing (20%). the training data set is obviously used for training, the cross-validation data set tests the model in the training phase and determines when the training is stopped, while the testing data set is utilized to investigate the prediction accuracy of the model when it receives new samples at the input. having the experience with the mlp trained on rtmatrix, the subject of fpmatrix investigation can be reduced at the comparison between mlp-2-cg-som and mlp-2-cg-lle, both having as input optimization a genetic algorithm. we tested all input projection algorithms on fpmatrix, each one separately and the only two that could find a solution were lle and som. consequently, we have labeled mds and pca input projection algorithms as not suitable for our prediction problem. the literature also recommends them for other types of problems like classification or clustering. the results of the comparison between lle and som, when training a mlp with two hidden layers, conjugate gradient learning rule and genetic input optimization, are shown in figure 4. the results reported in figure 4 show that lle is again the best input projection algorithm, since it offers the best prediction accuracy when training the mlp-2-cg on both rtmatrix and fpmatrix. neuro solutions offers a fifth input projection algorithm, k-means clustering, but in our researches we have not used it, since it is not appropriate for prediction problems. input projection algorithms influence in prediction and optimization of qos accuracy 137 figure 4: a comparison between mlp-2-cg-som and mlp-2-cg-lle 4 conclusions and future works qos is still essential for high quality web services and remains one of the subjects that raise researchers’ interest. more and more authors believe that large data may be as significant to society as the internet. social networks can play an important role in large and big data analytics and the technology trends indicate that they soon will interconnect not just people, but software modules like web services. consequently, in this research work we have studied four input projections algorithms, in order to determine which one increases the prediction accuracy of a multi-layer perceptron (mlp) with two hidden layers, trained with web services large data. the result reports, for both fpmatrix and rtmatrix, show the locally linear embed (lle) as the most accurate input projection algorithm. concluding, mlp with two hidden layer, having as input projection algorithm lle and a genetic algorithm for input optimization, can provide more accurate prediction results, when it is trained with large datasets of web services qos criteria values. in future works we will investigate different prediction adaptive models in order to improve web services qos criteria prediction accuracy. bibliography [1] r.-d. albu (2013), contributions regarding the quality and reliability of web services, phd thesis, university of oradea. [2] r.-d. albu (2013), investigating the effect of hidden layers number on web services response time prediction, nonconventional technologies review, issn 1454-3087, 7(1):4-9. [3] r.-d. albu, i. felea, f. popentiu-vlădicescu (2013), on the best adaptive model for web services response time prediction, the 20th int. conference on systems, signals and image processing, iwssip 2013, cd edition, ieee catalog number : cfp1355e-cdr, isbn: 978-1-4799-0942-1,39-42. [4] r.-d. albu, f. popentiu-vlădicescu (2013), on the best learning algorithm for web services response time prediction, paper accepted at esrel "annual conference, advances in safety, reliability and risk management. 138 r.d. albu, i. dzitac, f. popentiu-vladicescu, i.m. naghiu [5] r.-d. albu, f. popentiu-vladicescu (2013), a comparative study for web services response time prediction, the 9th int. scientific conference else 2013 "elearning and software for education", 1: 656-665, bucharest, , issn 2006-026x, cd edition. [6] a. klasnja-milicevic, m. ivanovic, a. nanopoulos (2009), the use of nonlinear manifold learning in recommender systems, 4th int. conference on information technology, http://www.zuj.edu.jo/conferences/icit09/paperlist/papers/aritificial intelligence/525.pdf. [7] http://arxiv.org/abs/1111.4503 (available 16.11.2013) [8] http://blogs.sas.com/content/sascom/2012/04/11/will-big-data-and-high-performanceanalytics-flatten-the-world/ (22.10.2013) [9] http://tech.sina.com.cn/i/ 2012-11-12/00207788375.shtml (available 16.11.2013) [10] http://www.datadeluge.com/ (available 12.11.2013) [11] l. aspirot, p. belzarena, b. bazzano, g. perera (2005), end-to-end quality of service prediction based on functional regression, proc. of third int. working conference on performance modelling and evaluation of heterogeneous networks (het-nets 2005), ilkley, uk, 1-8. [12] neuro solutions help: http://www.aertia.com/docs/nd/neurosolutionshelp.pdf. [13] p. belzarena and l. aspirot (2010), end-to-end quality of service seen by applications: a statistical learning approach, int. j. of computer and telecommunications networking, 54(17):3123-3143. [14] hu y., mu d., gao a., dai g.(2011), the research of qos approach in web servers, int j comput commun, issn 1841-9836, 6(4):636-647. [15] navarro m., donoso y. (2012), an ims architecture and algorithm proposal with qos parameters for flexible convergent services with dynamic requirements, int j comput commun issn 1841-9836, 7(1):123-134. [16] park e.-c. et al. (2011), quality of service control for wlan-based converged personal network service, int j comput commun, issn 1841-9836, 6(4):716-733. [17] p. talebi fard et al. (2013), semantic based networking of information in vehicular clouds based on dimensionality reduction, proc. of the third acm int. symposium on design and analysis of intelligent vehicular networks and applications, acm, 69-76. [18] wei tan, m. brian blake, iman saleh, schahram dustdar (2013), social-network-sourced big data analytics, web-scale workflow, internet computing, ieee, 17(5):62-69. [19] liang-jie zhang, jia zhang, hong cai (2007), services computing, tsinghua university press, springer. [20] z. zheng, y. zhang, m.r. lyu (2010), distributed qos evaluation for real-world web services, proc. of the 8th int. conference on web services (icws2010), miami, florida, usa, 83-90. [21] z. zheng, y. zhang, m.r. lyu (2011), exploring latent features for memory-based qos prediction in cloud computing, proc. of the 30th ieee symposium on reliable distributed systems (srds 2011), madrid, spain, 1-7. int j comput commun, issn 1841-9836 8(4):560-570, august, 2013. a model for collaborative filtering recommendation in e-commerce environment y. jing, h. liu yuanchun jing & honglu liu school of traffic and transportation beijing jiaotong university po box 100044, beijing, china 12121021@bjtu.edu.cn, hlliu@bjtu.edu.cn abstract: in modern business environment, product life cycle gets shorter and the customer’s buying preference changes over time. time plays a more and more important role in collaborative filtering. however, there is a gap in one class collaborative filtering (occf). on the basis of collecting different real-time information, this paper proposes an optimization model for e-retailers. through comparing different methods with different weights, results show that real-time dependent in occf performs better in improving the quality of recommendation. the model is effective in cross-selling e-commerce, personalized, targeted recommendation sales. keywords: integration of real-time information, one class collaborative filtering, e-commerce. 1 introduction with the rapid development of internet and the increase of the amount of data on the internet, retail businesses face many challenges, such as assessment of customer interests to products, as well as an accurate assessment of products at the market’s perspective in modern life. these challenges in the transaction process, often translate into how to find the best match between the customer and the product. in recent years, many companies began to get involved in ecommerce mode, making great progress. according to the “2012 china online retail market data monitoring report” released by the third-party e-commerce research institutes, china electronic commerce research center (100ec.cn): china online retail market transactions reached 1.3205 trillion rmb, attaining a year-on-year growth of 64.7% by december, 2012; this report also predicts: online retail market turnover of 2013 is expected to reach 1.8155 trillion rmb. as of 2012, china’s online retail market transactions have been accounted for 6.3% of the same period’s total retail sales of social consumer goods, while the proportion data was only 4.4% in 2011, which means that e-commerce has begun to change the retail landscape. despite a huge opportunity of e-commerce development, retailers have to solve complex problems which at a higher level. i.e., in recent years, because of the increase of customers and product, the retailers need to develop a systematic recommendation list. the list may be a list of items for the target customer, or a customer list for the target item. collaborative filtering (cf) is the method that an associated list generated by the information from customer clearly expressed their interests or extracted from the potential customer behavior information. one of the most common research problems is the netflix movie recommendation in this area. through huge amount of datasets, like purchases transaction datasets, news recommendation records of recommended sites, the problem to be solved can be converted to one class collaborative filtering (occf) problems. one class collaborative filtering is a problem with fewer researches, the effectiveness based on one class collaborative filtering depends on the consideration and processing of non-labeled or missing dataset. i.e., its basic idea is to construct a negative dataset (negative sample). as for the one class collaborative filtering problems which have weight copyright c⃝ 2006-2013 by ccc publications a model for collaborative filtering recommendation in e-commerce environment 561 distribution application for the matrix factorization technique, recent studies are often based on frequency information, such as customer number, product number, product popularity. in spite of extensive literature researches we found little study in which similar real-time information is combined, undeniable, real-time plays an important role in the occf. therefore, in this paper, we built a recommended model which can make full use of the real-time information about product and customer in the market. through the integration of real-time information including: the information of product launched into the market which is related to product life cycle, the information about product itself, and customer recently visiting information related to the customer relationship management, this model can be divided into two aspects: real-time customer-oriented weight and real-time product-oriented weight, to improve the quality of one class collaborative filtering (occf) for electronic retailers and meet their demand of the online recommendation. the rest of this paper is organized as follows: section 2 contents relevant concepts; in section 3, we proposed a real-time dependent model, added the weight to the model and compared seven different methods with different weight; section 4, we use the movielens dataset to test the model and get results; in section 5, conclusion and the future work. 2 relevant concepts nowadays, customers are presented with a majority of options for products and services. on the other hand, a vendor in the e-retailer can reach lots of customers what means that there is no limiting factor between the e-retailers and global customers. so, in today’s e-retails’ competition, the winner may be the one, which can offer a highly level of personalization to an individual customer. for the personalization needs recommendation from the e-retailers, a good recommendation system is the key factor for them to solve the problem. the recommendation algorithm most widely used in the recommendation system is collaborative filtering, namely, cf. 2.1 collaborative filtering (cf) the kernel of collaborative filtering problem is analysis of user interests, and finding some similar (interest) users for the specified user in the user group. after combining these similar users’ evaluations to products and developing a recommendation system with this information, the e-commerce site can make prediction to the specified user preference. the earliest defined “collaborative filtering” are goldberg and his partners [1]. they developed the first recommendation system, tapestry, mainly for coordination and filtering newsgroups streaming media files. another system to study news item filtration, grouplens, developed by resnick, etc., similar to tapestry. grouplens is mainly for scoring, to test the level of users’ interests on the news item. based on a similar approach, grouplens project team then got down to movielens project, from the filtration problem of news items transferred to movie recommendation problem. in recent years, collaborative filtering algorithm has been widely studied at home and abroad. such as, li g. and li l. (2012) [2], shani g., d. heckerman, and r. i. brafman (2005) [3], banati h. and mehta s. (2010) [4]. on the basis of different technologies, cf system is mainly divided into two categories, one is based on storage, and the other is model. pearson correlation approximation method can be applied to the collaborative filtering based on storage technology, this kind of cf can also make use of the item-to-item’s top-n recommendation method, which is widely used in e-commerce, like www.amazon.com, www.jd.com, etc. that recommended for users of all kinds of goods they may like, such as books, audio-visual products, electrical appliances, clothing. 562 y. jing, h. liu model-based cf approach makes up for many deficiencies of the one based on the storage. this method usually takes steps: firstly, find the clear or potentially preferences complex patterns between users and items. then, collect historical data, and provide recommendation model suggests, such as bayesian model, relies on the network based on the model, clustering model, and the model based on the mdp. 2.2 one class collaborative filtering (occf) according to different data, collaborative filtering processes can be divided into two categories: one category is the clear preference data processing, for example the score; the other is implicit data’s processing, for instance page hits or not. the later always exists in the real world application environments widely, such as whether the user bought a product, whether the user clicked a web page or not. because of no requirement that provide a clear score, the users, obtain this data much easier than the ratings data. besides that, in the implicit datasets, only active factors can be clearly distinguished, while the negative cases are uncertain. so this kind of problem is called one class collaborative filtering (occf) problem. one class collaborative filtering’s task is to rank recommendation sets according to the user’s preference levels by analyzing the implicit information with the particular user’s favorite. despite easy acquisition for such data, it is difficult to explain. for example, as for the data of a user clicks the website, among all data, the data which consists of the user clicks the pages can be interpreted as active factors; the rest of the data is the mix of negative data and missing data. how to explain such combined data, and how to process this data after explanation, are the main difficulties of one class collaborative filtering’s current studies. compare to the cf, the researches about one class collaborative filtering are fewer. we learned something from those few literatures, for example, yang s. and xue w. [4]. as for one class collaborative filtering problems, due to the active dataset rarely (sparsity), while the other two types of datasets, negative and missing datasets, are very confusing, it has many difficulties to future research. negative and unobserved data processing is the key issue of one class collaborative filtering. for this issue, a large quantity of research focuses on the technology of weight distribution, because the weighted low rank approximation can improve the quality of unobserved data recommendations. in this aspect, the typical studies contain: pan, etc. proposed a matrix factorization model based on different weight distribution of potential factors [5]. sindhwani used the formula simplifies weight distribution scheme, proposed another optimization variables, providing an unobserved data processing measure [6]. 2.3 real-time research concerning cf problems, most of models regard users’ behaviors as static, for example, a customer who buys item a will buy item b. however, product has its product life cycle, and the customer’s preferences will change over time, therefore, to regard these problems as static has certain deficiency. in 2009, koren proposed a real-time changing model which is based on the evaluation of netflix recommendation in the report of improving the performance [7]. recent some related studies began to consider the real-time information, such as, the research of lu, agarwal, and dhillon (2009) [8]; xiang l. and yang q. (2009) [9]; xiong l., x. chen, t. huang, j. schneider, j. g. carbonell (2010) [10]. but their studies focus on the cf, few mentioned the occf. even few reports establish the problem that real-time information applied to the system of occf. in one class collaborative filtering field, previous studies have only considered frequency-based information, there is few research which designs real-time information to expand collaborative a model for collaborative filtering recommendation in e-commerce environment 563 filtering problem. so, this paper, on the basis of exploration the information of product launched into market and the customers’ recent access information, considered the product life cycle (plc) and customer relationship management (crm) real-time information our model shows that predictions accuracy can be improved by occf recommendation model of e-commerce based on real-time information. real-time information integration has significant significance for business processes such as cross-selling, up-selling and others’ accurate recommendations. 3 real-time dependent model 3.1 the notation and formula to facilitate the study and discuss, firstly, we presume some items as follows. x: binary representation of actual transaction matrix, 1 means purchase; 0 is no purchase u: user feature matrix with latent features of customers p : product feature matrix with latent features of products y : recommendation resultant matrix w : weight matrix: ∈ [0, 1] we are given that there are m customers, n products, then, x matrix has m rows and n columns. if i−th row and j−th column in the x matrix has a value of 1, it means that customer i purchased a product j. if it is 0, it means that this particular customer-product has no purchase record. as for the actual transaction, x matrix is very large, and there is a certain degree of sparsity, x manifests itself in a display that less 1 more 0. in the matrix, r is the rank; it needs to analyze the number of potential characteristics. presume u is on behalf of user feature matrix, so, u = [u1, u2, . . . , um] t is a m×r matrix. i−th in the matrix u means a customer, ui means customer i in r-dimensional customer feature space. in the same way, supposes that p is in the name of product feature matrix, thus, p = [p1, p2, . . . , pn] is a r×n matrix. j−th in the matrix p stands for a product, pj means product j in r-dimensional product feature space. typically, there is m, n ≫ r and m × n ≫ (m + n) × r. joint matrix is expressed by uti pj which represent the joint relationship between customers and products. let’s assume y = up, therefore, the problem can be converted into the following optimization problem: arg min u>0,p>0 ∑ l (x, y ) (1) in the formula, l is the square error function or other loss functions: square error: l (x, y ) = ∥x − y ∥2 = m,n∑ i=1,j=1 (xi,j − y i,j)2 (2) relative entropy loss: l (x, y ) = d (x∥ y ) = m,n∑ i=1,j=1 ( xi,j log xi,j yi,j − xi,j + y i,j ) (3) in order to prevent over-fitting, parameter λ is introduced in the formula (1), to revise the optimization model: arg min u>0,p>0 λ ( ∥u∥2f + ∥p∥ 2 f ) + ∑ l (x, y ) (4) among it, ∥u∥2f and ∥p∥ 2 f are the u and p matrices’ f norm. 564 y. jing, h. liu collaborative filtering system is primarily intended to provide customers with similar product recommendations. the above-mentioned optimal formula major considers the situation that just value of 1. thus, in terms of the one class collaborative filtering, this formula is not perfect. in one class collaborative filtering, describing the potential customers’ preferences to a variety of products or different importance degrees of different data sets through simply the weights setting. therefore, adds the weight to original optimization model: arg min u>0,p>0 λ ( ∥u∥2f + ∥p∥ 2 f ) + ∑ wl (x, y ) (5) srebro and jakkola applied the low-rank approximation based on weights in collaborative filtering model with two extremes weights [11]: active factors has a weight of 1, other factors have weight of 0. supposes x1, x1 = {(i, j) : xi,j = 1} which contains (i, j) pairs corresponding all 1of a matrix x. x0, x0 = {(i, j) : xi,j = 0} containing (i, j) pairs corresponding all 0 (no purchase) in x matrix. according to the above weight setting scheme, the formula can be further amended to this: arg min u>0,p>0 λ ( ∥u∥2f + ∥p∥ 2 f ) + ∑ i,j∈x1 wi,jl ( xi,j, u t i pj ) (6) among it, wi,j = [ 1 ∀ (i, j) ∈ x1 0 ∀ (i, j) ∈ x0 ] however, the model ignores the non-buyers groups. in order to comply with the requirement of one class collaborative filtering that settings different weight for different types of factors, the above formula (6) further revised as follows: arg min u>0,p>0 λ ( ∥u∥2f + ∥p∥ 2 f ) + ∑ i,j∈x0 wi,jl ( 0, uti pj ) + ∑ i,j∈x1 wi,jl ( 1, uti pj ) (7) the formula covering all types of customers by setting the weighting values wi,j: good customers, potential customers and non-customers. after setting the corresponding weights, the equation (7) can be rewritten as the following equation, similar to the model provided by sindhwani: arg min u>0,p>0 λ ( ∥u∥2f + ∥p∥ 2 f ) + ∥ω ⊗ (x − up)∥2f (8) ωi,j = √ wi,j, ⊗ means for computing according to element, it is to make the two different corresponding elements within matrices multiplied. finally, according to the steps described by lee and seung [12], final optimization scheme is made by multiplication replaced: p = p ⊗ ut (ω ⊗ x) ut (ω ⊗ (up)) + λp (9) u = u ⊗ (ω ⊗ x) p t (ω ⊗ (up)) p t + λu (10) a model for collaborative filtering recommendation in e-commerce environment 565 3.2 weight setting scheme time weight applied to the collaborative filtering is more and more popular, we can find it from the previous studies. such as, the research of lan w. and zhengjun z. (2007) [13]; huaizheny., xiaoqi c. and meilian. l. (2009) [14]; donghui l., dewei p. and hui, zh. (2012) [15]. in our research, we compared the real-time weight with basic methods. 1) basic methods the basic method used by initializing values and other relevant information can be summarized as follows: (1) 0 weight: weight matrix is a determine matrix similar to the transaction matrix, 1 is the weight of all transactions entered, 0 is the weight of all non-transaction entries. (2) full weight: whole weight matrix is 1. error function in the formula not only needs to calculate the maximum error of transaction entry, but also need to calculate the maximum error of non-transactional input. (3) uniform weight: unlike the two extremes of weight setting, the method is to set a smaller weight δ ∈ (0, 1) for all non-transaction input. (4) the customer oriented weight: the weight of non-transaction setting is proportional to customer transactions. associated customers weight is calculated as follows: δi,j ∝ ∑n j xi,j[ max (∑n j xi,j )] δ ∈ (0, 1) , if a customer has many products’ historical purchase records, namely, a lot has been tagged data, that non-trading or unlabeled data, is likely to be treated as a customer non-purchases situation. (5) the product oriented weight: similar to the method (4), weight setting in proportion to the number of traded products. calculated as follows: δi,j ∝ { 1, ∑m i xi,j [max ( ∑m i xi,j)] } after obtained the results, using subtraction allows δ ∈ (0, 1). the main principle is that if the transactions number of a product is small, and most of situations that the non-trading or missing data will be treated as the case of non-purchases. 2) real-time weight (1) real-time customer-oriented weight: in addition to transaction data set, it also involves the customer’s recent visiting. record the history recently visiting vector as ζcr, which stands for that customers’ recently visited records. ζcr, it is the vector that customer recently visited distance; it can be expressed as follows: ∆cr = τc − ζcr among it, τc is the current or assessment phase. δcri ∝ [ 1 − ∆cri max (∆cr) ] so, δ ∈ (0, 1) 566 y. jing, h. liu in the formula, δcri is customer i recently visited value based on the real time and the weight value. the weight of each customer in real time is δ, δ ∈ (0, 1). this means that if a customer has recently visited the e-commerce site, then compared to those who have not recently visited the site of all customers, he is more likely to become frequent customer. (2) real-time product-oriented weight: reference different time information that products launched into market, set up a vector ζpl. ζpl it is the vector which stands for the time information that products launched into market. it can be expressed as: ∆pl = τc − ζpl. in the formula, τc is the current or assessment phase. δplj ∝ [ ∆plj max (∆pl) ] so, δ ∈ (0, 1) δplj , it is the weight setting that product j based on the real time launched into the market. if a product exists long time in the market, it will attract many customers. because of innovation emerging, imperfect goods will have a longer product life cycle. in any retail establishment, the records that products launched into the market exist in the internal management organization, they are the information which can be collected. in summary, different methods of setting the weights can be shown in the following table: table 1: different weight-setting methods weight(wi,j) methods transaction(1) no-transaction(0) 0 weight 1 0 full weight 1 1 uniform weight 1 δ, δ ∈ (0, 1) the customer oriented weight 1 ∝ ∑ j xi,j the product oriented weight 1 ∝ ( m − ∑ j xi,j ) real-time customer-oriented weight 1 ∝ δcri real-time product-oriented weight 1 ∝ δplj 4 experiment and results the algorithm of occf based on real-time information model can be showed: input: customer-product matrix x, the rank r. output: the approximation matrix y of x. 1) initialize p with a random number less than 1; 2) repeatedly using the formula (9), (10). update the u and p until the auc value calculated has convergence; a model for collaborative filtering recommendation in e-commerce environment 567 3) y = up return y ; 4) using matrix y make a recommend list. this paper selected the area under the roc curve (auc) as a quality measure to compare measuring recommendation quality of different methods. auc = 1 |u| ∑ u 1 |e (u)| ∑ (i,j)∈e(u) ρ (x̂ui > x̂uj) ρ, is an indicator function, it can be showed like: ρ (α) = [ 1 if α is true 0 else ] e (u), is the target pair of evaluation. e (u) = {(i, j)| (u, j) ∈ stest ∧ (u, j) /∈ (stest ∨ strain)} for all evaluations, we use the frequently-used datasets, the movielens dataset that grouplens research group provided. the dataset contains 943 users and 1682 films; each user has at least 20 film scores. thus, the total of assessment records is 100,000. despite huge number, this data set is very sparse, sparsity is 6.305%, which is only 6.305% of the items are rated. the simulation environment includes: the pc with windows 7, intel core duo processor and 4g ram. when we make the r = 3, figure 1 is the typical results for all data sets, with each method running. 0 0.2 0.4 0.6 0.8 1 1.2 0 full uniform the customer oriented the product oriented real-!me customeroriented real-!me productoriented figure 1: an auc comparison that all methods in a single run. obviously 0 weight is the worst, its’ the roc curve almost shows the diagonal. in all other ways, the degree of improvement in performance is similar, we can learn from the shape of the roc. in the new methods, the real-time product-oriented weight method’s roc curve is the best of all. compare with other methods, the real-time customer-oriented weight method has some performance improvement. then we repeated 10 times by changing the rank and got the average value of auc. like table 2 and fig 2 shows: from the figure and table, we found that 0 weight methods performed badly, while the realtime customer oriented method and real-time product oriented method had good effect for sake of the recommend. 568 y. jing, h. liu table 2: the average auc of different rank and methods rank 1 3 6 9 12 15 methods 0 0.51 0.501 0.505 0.497 0.494 0.49 full 0.71 0.699 0.697 0.693 0.689 0.687 uniform 0.699 0.712 0.71 0.708 0.705 0.702 the customer oriented 0.692 0.701 0.698 0.696 0.692 0.693 the product oriented 0.692 0.702 0.699 0.698 0.695 0.694 real-time customer-oriented 0.697 0.718 0.714 0.71 0.709 0.704 real-time product-oriented 0.779 0.78 0.778 0.78 0.781 0.779 0 full uniform the customer oriented the p roduct oriented real-time customeroriented real-time p roductoriented figure 2: the average auc of different ranks and methods. 5 conclusion on the basis of previous studies, this paper presents an integrated approach, through different weights to the one class collaborative filtering problem obtained the use of real-time information. the integration of real-time information including: the information of product launched into the market which is related to product life cycle, the information about product itself, and customer recently visiting information related to the customer relationship management. but this datasets used aren’t from e-commerce websites, and it can’t completely replace business process and actions. so how to apply the model to e-commerce needs to practice in the future. the next work is to test this model by changing the number of customers, products and transactions. now real-time information in the field of collaborative filtering experiments is limited, confined to movie ratings recommendations problem. one class collaborative filtering method which is based on e-commerce transaction record set, involved the customer’s history access to sites and product information that launched into market, and got a good result in this paper. this solution to one class collaborative filtering problems will benefit the field of cross-selling e-commerce, personalized, targeted recommendations sales. a model for collaborative filtering recommendation in e-commerce environment 569 bibliography [1] goldberg d., d. nichols, b. m. oki, and terry d., using collaborative filtering to weave an information tapestry, communications of acm, 35 (12): 61–70,1992. [2] li g. and li l., based on matrix decomposition algorithm for single-class collaborative filtering recommendation, application research of computers, 29(5): 1662-1665, 2012. [3] shani g., d. heckerman, and r. i. brafman, an mdp-based recommender system, journal of machine learning research, 6: 1265–1295,2005. [4] banati h. and mehta s., a multi-perspective evaluation of ma and ga for collaborative filtering recommender system, international j of computer science & information technology 2(5): 103-22, 2010. [5] yang s. and xue w., classification based on a single collaborative filtering recommendation algorithm, computer engineering ,19(37): 59-61,2011. [6] pan r. and m. scholz., mind the gaps: weighting the unknown in largescale one-class collaborative, in proceedings of the 15 th acm sigkdd international conference on knowledge discovery and data mining, 667-676, 2009. [7] sindhwani v., s.s. bucak, j. hu, and a. mojsilovic, a family of nonnegative matrix factorizations for one-class collaborative filtering problems, in proc. of 3rd acm recsys workshop on recommendation-based industrial applications, 2009. [8] koren y., collaborative filtering with temporal dynamics, in proc. of 15 th acm sigkdd international conference on knowledge discovery and data mining, 447-456, 2009. [9] lu z., d. agarwal, and i. s. dhillon, a spatio-temporal approach to collaborative filtering, in proc. of 3rd acm recsys workshop on recommendation-based industrial applications.1320, 2009. [10] xiang l. and yang q., time-dependent models in collaborative filtering based recommender system, in proceedings of2009 ieee/wic/acm international joint conference on web intelligence and intelligent agent technology, (1), 450-457, 2009. [11] xiong l., x. chen, t. huang, j. schneider, j. g. carbonell, temporal collaborative filtering with bayesian probabilistic tensor factorization, in proceedings of siam international conference on data mining, 2010. [12] srebro n. and jakkola t., weighted low rank approximations, proceedings of 20th international conference on machine learning, 720-727, 2003. [13] daniel d. lee and h. sebastian seung, learning the parts of objects by non-negative matrix factorization. nature, 401(6755): 788-791, 1999. [14] lan w. and zhengjun z., collaborative filtering algorithm based on time weight, journal of computer application, 27(9): 2302-2303, 2007. [15] huaizhen y. xiaoqi c. and meilian l., research on the personalized recommendation algorithm based on time weight, computer engineering& science, 31(6): 126-128, 2009. 570 y. jing, h. liu [16] donghui l , dewei p. and hui zh., collaborative filtering algorithm based on time weight and user’s feature, journal of wuhan university of technology, 34(5): 144-148, 2012. [17] smaranda cosma, mădălina văleanu, dan cosma, dana vasilescu, grigor moldovan, efficient data organisation in distributed computer systems using data warehouse, int j comput commun, 8(3): 367-373, 2013. [18] kabir golam and hasin m. ahsan akhtar, evaluation of customer oriented success factors in mobile commerce using fuzzy ahp, journal of industrial engineering and management, 4(2): 361-386, 2011. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vii (2012), no. 1 (march), pp. 90-100 quantized feedback control for networked control systems under communication constraints q.q. liu, g.h. yang qing-quan liu school of information science and engineering shenyang ligong university no. 6, nan ping zhong road, hun nan xin district, shenyang, 110159, china e-mail: lqqneu@163.com guang-hong yang college of information science and engineering northeastern university shenyang, 110004, china abstract: this paper investigates the feedback stabilization problem for networked control systems (ncss) with unbound process noise, where sensors and controllers are connected via noiseless digital channels carrying a finite number of bits per unit time. a sufficient condition for stabilization of ncss, which relies on a variable-rate digital link used to transmit state measurements, is derived. a lower bound of data rates, above which there exists a quantization, coding and control scheme to guarantee both stabilization and a prescribed control performance of the unstable discrete-time plant, is presented. an illustrative example is given to demonstrate the effectiveness of the proposed method. keywords: networked control systems (ncss), quantized feedback control, communication constraints, feedback stabilization. 1 introduction as is well known, in recent emerging applications (e.g., industrial automation, sensor networks, vehicle systems, aerospace industry and etc), the main aim is to control one or more dynamical systems employing multiple sensors and actuators transmitting and receiving information via a digital communication network. however, the limitations in the communication links often affect control performances significantly. our focus in this paper is on control under communication constraints. the research on the interplay among coding, estimation, and control was initiated by [1]. a high-water mark in the study of quantized feedback using data-rate limited feedback channels is known as the datarate theorem that states the larger the magnitude of the unstable poles, the larger the required data rate through the feedback loop. the intuitively appealing result was proved (see [2][5]), indicating that it quantifies a fundamental relationship between unstable physical systems and the rate at which information must be processed in order to stably control them. when the feedback channel capacity is near the data-rate limit, control designs typically exhibit chaotic instabilities. this result was generalized to different notions of stabilization and system models, and was also extended to multi-dimensional systems (see [6][8]). the research on gaussian linear systems was addressed in [9][11]. information theory was employed in control systems as a powerful conceptual aid, which extended existing fundamental limitations of feedback systems, and was used to derive necessary and sufficient conditions for robust stabilization of uncertain copyright c⃝ 2006-2012 by ccc publications quantized feedback control for networked control systems under communication constraints 91 linear systems, markov jump linear systems and unstructured uncertain systems (see [12][16]). the decentralized control schemes were addressed in [17]. the result on continuous-time linear gaussian systems was derived in [18]. the result on time-varying communication channel was derived in [19]. control under communication constraints inevitably suffers signal transmission delay, date packet dropout and measurement quantization which might be potential sources of instability and poor performance of control systems (see [22] and [23]). the survey papers [20] and [21] gave a historical and technical account of the various formulations. the data-rate inequality is now well known and provides good intuition into the connection between achievable bit-rate and stability of linear time-invariant systems. it states that for a linear time-invariant system which is open-loop unstable, a controller can be designed to stabilize it if and only if the data rate r around the closed feedback loop satisfies the data rate inequality: r > log | det(a)| (bits/sample) where a denotes the system matrix composed by only unstable modes. however, as the data rate r is reduced to the critical value log | det(a)|, the plant states must always become unbounded. in engineering systems, it is of importance to present a lower bound of data rates above which there exists a quantization, coding and control scheme to guarantee both the stabilization and control performances of plants. in this paper, we address quantized feedback control problem for stochastic linear systems with limited data rates. the aim is to present a lower bound of data rates above which there exists a coder-controller to guarantee both the stabilization and a prescribed control performance. the following notation is adopted throughout this paper. • upper case variables, like x, represent random vectors. • we write log2(·) simply as log(·). • let p(x) denote the probability density function of x and p(x|y ) denote the conditional probability density function of x given y . • ∥·∥ represents either the euclidean norm on a real vector space or the matrix norm induced by it. • ex[·] denotes expectation on x. the remainder of this paper is organized as follows: section ii introduces problem formulation. section iii deals with quantized feedback control problem for ncss with limited data rates. the results of numerical simulation are presented in section iv. conclusions are stated in section v. 2 problem formulation 2.1 preliminaries we start this section by summarizing the main definitions of information theory and adopt [24], as a primary reference. the definitions listed in this section hold under general assumptions. • let x and y denote two random variables. the differential entropy h(x) is defined as h(x) := ex[log 1 p(x) ]. the conditional differential entropy of x given y is defined as h(x|y) := ex,y[log 1 p(x|y) ]. 92 q.q. liu, g.h. yang • the mutual information between x and y is defined as i(x;y) := h(x) − h(x|y) = ex,y[log p(x|y) p(x) ]. • the information rate distortion function between x and y is defined as r(d) := inf p(y|x)∈ξ {i(x;y)} with ξ = {p(y|x) : ex,y[d(x,y)] ≤ d}. therein, d(x,y) denotes a distortion function or distortion measure, which is a mapping d : x × y → r+ and d is a given constant. • over communication channels, there are two different methods to define data rates. one is the code element rate rc(t) that denotes the number of code element transmitted in unit time, and the other is the information transmission rate re(t) that denotes the amount of information transmitted in unit time. re(t) may or may not be time-varying. we write re(t) simply as r(t). therefore r(t) is given by r(t) = 1 h i(x;y)(bits/s) where h is the transmission time for i(x;y). in ncss, after completing sample quantization procedure in state observation, we generally encode state values and transmit the information of plant states over digital channels. therein, quantization precision and sampling period are determined by conditions for the stabilization of control systems and demands for control performances. based on quantization precision and sampling period given, we may calculate the amount of information to be transmitted, which determine the value of the data rate r(t). it means that the data rate r(t) must satisfy requirements of the stabilization of control systems and control performances. example 1. assume p(x) = 1√ 2πσ21 e −x2 2σ2 1 and p(x|y) = 1√ 2πσ22 e −x2 2σ2 2 . let h = 1s. then, we obtain h(x) = 1 2 log 2πeσ21 (bits), h(x|y) = 1 2 log 2πeσ22 (bits), i(x;y) = 1 2 log σ21 σ22 (bits). then, we quantity, encode the value x and obtain the estimate y of x. here, d = σ22. thus, we have r(d) = { 1 2 log σ21 σ22 (bits), 0 ≤ σ22 ≤ σ 2 1, 0 (bit), σ22 > σ 2 1. finally, we transmit y, and the information transmission rate r(t) must satisfy the following inequality: r(t) ≥ 1 h r(d) = { 1 2 log σ21 σ22 (bits/s), 0 ≤ σ22 ≤ σ 2 1, 0 (bit/s), σ22 > σ 2 1. quantized feedback control for networked control systems under communication constraints 93 2.2 system model consider a stochastic linear system described by x(t + 1) = ax(t) + bu(t) + fw(t), y (t) = cx(t) (1) where x(t) ∈ rn is the state process, u(t) ∈ rm is the control input, y (t) ∈ rp is the observation output, and w(t) ∈ rl is the process disturbance. a, b, c and f are known constant matrices with appropriate dimensions (see fig. l). figure 1: a networked control system with a communication channel. the following is assumed to hold. a0. without loss of generality, suppose that the pair (a,b) is controllable, and the pair (a,c) is observable. a1. the matrix a is uniquely composed by unstable modes (having magnitude greater than or equal to unity). a2. the initial condition x(0) and disturbance w(0), · · · ,w(t) are mutually independent random variables with zero mean, satisfying e∥x(0)∥22 < φ0 < ∞ and e∥w(t)∥ 2 2 < φw < ∞, respectively. a3. the sensors and the controller are geographically separated and connected by errorless digital channels. 2.3 problem statement we consider the system (1) under assumptions a0 −a3 above. we quantify the performance of a coder-controller as a prescribed control performance by the asymptotic state norm j := lim t→∞ e∥x(t)∥2. (2) here we are concerned with how small the plant states can be made as t → ∞. if the system (1) is controllable in the unstable modes, then there exists a matrix k such that all eigenvalues of a − bk have magnitude smaller than unity. in this case it seems logical to try to implement a quantized state feedback control law of the form u(t) = −kx̂(t) where x̂(t) denotes the estimate of x(t). 94 q.q. liu, g.h. yang the main problem is to construct an encoder-decoder pair and to design a controller which guarantee both the stabilization and the control performance (2) of the system (1) in the mean square sense lim t→∞ supe∥x(t)∥2 < ∞ (3) using the finite data rate provided by the digital feedback link. 3 feedback control under data-rate constraints this section deals with the quantized feedback stabilization problem for the system (1). the purpose here is to present a quantization, coding and control scheme to stabilize the closed-loop system (1) with the performance (2) in the mean square sense (3). assume that the encoder has access to the full state vector for practical plants. first we give a preliminary lemma. lemma 1[24]. let x ∈ r denote a random variable and x̂ denote an estimate of x. the expected distortion constraint is defined as d ∈ r+. for a given d ≥ ex(x − x̂)2, there must exist a quantization and coding scheme if r(t) satisfies r(t) > r(d) ≥ 1 2 log σ2(x) d (bits/sample) where σ2(x) = ex(x − ex)2. proof. see [24]. remark 1. • r(d) denotes the lower bound of the amount of information which is needed to reconstruct the initial condition to some distortion fidelity which satisfies conditions for the stabilization. it means that more information available at the decoder will lead to better control performance. • since lemma 1 gives a lower bound of data rate for all quantization and coding scheme, for a given d, x̂t can be obtained by many quantization and coding schemes. however, r(d) can not approach the fundamental lower limit if the scheme used is not optimal. the following theorem is our main result in this paper. theorem 1. consider the fully-observed system (1). under assumptions a0 −a3, given a state feedback control law of the form u(t) = −kx̂(t), satisfying that all eigenvalues of a-bk have magnitudes smaller than 1, ∀ε ∈ (0,1), the system (1) is mean-square stabilizable with j = lim t→∞ e∥x(t)∥2 < 1 1 − ε ∥f∥2φw if the data rate r(t) satisfies the following inequality: r(t) > 1 2 log | det[at a−(a−bk)t (a−bk)]| | det[εi−(a−bk)t (a−bk)]| (bits/sample). proof: consider the closed-loop system x(t + 1) = ax(t) − bkx̂(t) + fw(t) which we can also write as x(t + 1) = a(x(t) − x̂(t)) + (a − bk)x̂(t) + fw(t). quantized feedback control for networked control systems under communication constraints 95 then, we have e∥x(t + 1)∥2 =trace[ex(t + 1)xt (t + 1)] =trace[ae(x(t) − x̂(t))(x(t) − x̂(t))t at ]+trace[few(t)wt (t)ft ]) +trace[(a − bk)ex̂(t)x̂t (t)(a − bk)t ] + 2trace[(a − bk)ex̂(t)wt (t)ft ] +2trace[ae(x(t) − x̂(t))wt (t)ft ] + 2trace[ae(x(t) − x̂(t))x̂t (t)(a − bk)t ]. (4) notice that x(t) and w(t) are mutually independent. this implies ex̂(t)wt (t) = 0, e(x(t) − x̂(t))wt (t) = 0 (5) following from assumption a2. furthermore, e[x(t) − x̂(t)]x̂t (t) = ev (t)x̂t (t) = 0 where v (t) denotes the quantization error with zero mean. substituting it and (5) into (4), we obtain e∥x(t + 1)∥2 =traceaς x(t)|x̂(t)a t +trace(a − bk)σ x̂(t) (a − bk)t + e∥fw(t)∥2 where we define σ x(t)|x̂(t) := e(x(t) − x̂(t))(x(t) − x̂(t)) t and σ x̂(t) := ex̂(t)x̂t (t). if assume that e∥x(t)∥2 >traceaς x(t)|x̂(t)a t +trace(a − bk)σ x̂(t) (a − bk)t which is equivalent to εe∥x(t)∥2 =trace[aς x(t)|x̂(t)a t ]+trace[(a − bk)σ x̂(t) (a − bk)t ] (6) with ε ∈ (0,1), we see that e∥x(t + 1)∥2 = εe∥x(t)∥2 + e∥fw(t)∥2. then, we obtain e∥x(t)∥2 = εte∥x(0)∥2 + ∑t−1 i=0 ε t−i−1e∥fw(i)∥2 < εtφ0 + 1−εt 1−ε ∥f∥ 2φw following from assumption a2. thus, j = limt→∞ e∥x(t)∥2 < 11−ε∥f∥ 2φw . it means that if we can design a quantization and coding scheme such that (6) holds, the system (1) is mean square stabilizable. next, we further argue that there exists a lower bound of the data rate r(t) based on the quantization and coding scheme to stabilize the system (1) with j = limt→∞ e∥x(t)∥2 < 11−ε∥f∥ 2φw . observe that εe∥x(t)∥2 = εtraceσx(t) 96 q.q. liu, g.h. yang and σ x̂(t) = ex̂(t)x̂t (t) − ex(t)xt (t) + ex(t)xt (t) = σx(t) − e(x(t)xt (t) − x̂(t)x̂t (t)) = σx(t) − e(x(t) − x̂(t))(x(t) − x̂(t))t = σx(t) − σx(t)|x̂(t) where we define σx(t) := ex(t)xt (t). substituting the equations above into (6), we have εtraceσx(t) =trace[aσx(t)|x̂(t)a t ]+trace[(a − bk)σ x̂(t) (a − bk)t ] =trace[at aς x(t)|x̂(t)]+trace[(a − bk) t (a − bk)σ x̂(t) ] =trace[at aς x(t)|x̂(t)]+trace[(a − bk) t (a − bk)(σx(t) − σx(t)|x̂(t))] =trace[(at a − (a − bk)t (a − bk))σ x(t)|x̂(t)] +trace[(a − bk)t (a − bk)σx(t)] which we can also write as trace(θ1σx(t)) =trace(θ2σx(t)|x̂(t)) (7) where we define θ1 := εi − (a − bk)t (a − bk) and θ2 := at a − (a − bk)t (a − bk). notice that there exist unitary matrices p , q, h that diagonalize the symmetric matrices θ1, θ2, σx(t) and σx(t)|x̂(t), respectively. namely, θ1 = p t θ̄1p , θ2 = qt θ̄2q, σx(t) = ht σx̄(t)h and σx(t)|x̂(t) = h t σ x̄(t)|x̃(t)h where we define x̄(t) := hx(t), x̃(t) := hx̂(t), θ̄1 :=diag[α1, · · · ,αn] and θ̄2 :=diag[β1, · · · ,βn]. then, (7) is equivalent to the following equations: trace(pt θ1σx(t)h) =trace(qt θ2σx(t)|x̂(t)h). since tracemn=tracenm with matrices m ∈ rn×m and n ∈ rm×n holds , thus the equality above is equivalent to trace(σx(t)hpt θ1) =trace(σx(t)|x̂(t)hq t θ2). then, we have trace(ht σx(t)hpt θ1p) =trace(ht σx(t)|x̂(t)hq t θ2q). namely trace(σx̄(t)θ̄1) =trace(σx̄(t)|x̃(t)θ̄2). thus, we obtain trace(θ̄1σx̄(t)) =trace(θ̄2σx̄(t)|x̃(t)) which we can also write as σni=1αiex̄ 2 i (t) = σ n i=1βie(x̄i(t) − x̃i(t)) 2 (8) where define x̄(t) := [x̄1(t), · · · , x̄n(t)]t and x̃(t) := [x̃1(t), · · · , x̃n(t)]t . (8) can be viewed as a quantization and coding criterion. then, based on the criterion (8), we can design a quantization and coding scheme to stabilize the system (1) in the mean square sense (3). however, the analyses on the quantization and coding scheme rely heavily on the assumption of the boundability of the initial condition x(0) and the disturbance input w(t). similar to quantized feedback control for networked control systems under communication constraints 97 shannon source coding, there exists a lower bound of the data rate r(t), satisfying the criterion (8). we quantize each x̄i(t) and obtain the estimate x̃i(t) of x̄i(t)(i = 1, · · · ,n), and encode it into a symbol sti which is transmitted over an errorless digital channel. then, the symbol st=[st1 · · · stn] is decoded and converted into a control input ut = −bkx̂t = −bkht x̃t. namely, we get x̂(t) = ht x̃(t). notice that σ2(x̄) ≤ ex̄2i (t). define d := [ d1 · · · dn]. thus, if let di ≤ αiβi σ 2(x̄i(t)), the criterion (8) must be satisfied. following from lemma 3.1, the following inequality can be obtained: ri(t) > ri(di) ≥ 1 2 log σ2(x̄i(t)) di ≥ 1 2 log σ2(x̄i(t)) αi βi σ2(x̄i(t)) ≥ 1 2 log βi αi (bits/sample). thus, it follows that r(t) = ∑n i=1 ri(t) > 1 2 log ∏n i=1 βi αi = 1 2 log | det(θ̄2)| | det(θ̄1)| = 1 2 log | det(θ2)| | det(θ1)| = 1 2 log | det(at a−(a−bk)t (a−bk))| | det(εi−(a−bk)t (a−bk))| (bits/sample). thus, if r(t) satisfies the inequality above, there exists a coder-controller to stabilize the system (1) with j = lim t→∞ e∥x(t)∥2 < 1 1 − ε ∥f∥2φw . 2 remark 2. • theorem 1 states how fundamental tradeoffs between the data rate and the prescribed control performance. clearly, the better performance can be achieve when the greater data rate is employed. • observe that corresponding to the classical situation without communication constraints, the performance becomes j = lim t→∞ e∥x(t)∥2 → ∥f∥2φw as the limit r(t) → ∞. 4 numerical example to illustrate the effectiveness of communication constraints for stabilization of stochastic linear systems, we present an open loop unstable system as follows, x(t + 1) =   1.61 0.52 0.25 0.52 1.61 0.36 −0.21 0.61 4.31  x(t) +   1 1 2  u(t) + 0.3w(t). 98 q.q. liu, g.h. yang 0 1 2 3 4 5 −200 −150 −100 −50 0 50 100 150 200 time (s) x1 x2 x3 figure 2: the system state responses with limited data rates. let ε = 0.4, x(0)=[50 -20 -50]t , and φw = 10. here we give a controller gain k = [−0.6738 − 0.0681 2.9360]. a simulation is shown in fig.2. remark 3. the obtained state responses are shown in fig.2. it states that the encodingcontrol scheme based on theorem 1 can stabilize the system above. 5 conclusion this paper addressed the feedback control problem for stochastic linear systems with limited data rates. the approach taken here was based on the hypothesis that sensors and controllers are connected by a rate-limited, time-varying communication channel. based on the presented lower bound of data rates, there exists a coder-controller to guarantee both stabilization and a prescribed control performance. the simulation results have illustrated the effectiveness of the quantization, coding and control scheme. acknowledgment this work was supported in part by the funds for creative research groups of china (no. 60821063), national 973 program of china (grant no. 2009cb320604), the funds of national science of china (grant no. 60974043), the 111 project (b08015), and the funds of doctoral program of ministry of education, china (20100042110027). bibliography [1] w. s. wong and r. w. brockett, systems with finite communication bandwidth constraints ii: stabilization with limited information feedback, ieee trans. automat. control, 44(5):1049-1053, may 1999. [2] j. baillieul, feedback designs for controlling device arrays with communication channel bandwidth constraints, in aro workshop on smart structures, pennsylvania state univ, aug. 1999. [3] j. baillieul, feedback designs in information based control, in stochastic theory and control proceedings of a workshop held in lawrence, kansas, b. pasik-duncan, ed. new york: springer-verlag, 2001, pp. 35-57. quantized feedback control for networked control systems under communication constraints 99 [4] j. baillieul, data-rate requirements for nonlinear feedback control, in proc. 6th ifac symp. nonlinear control syst., stuttgart, germany, 2004, pp. 1277-1282. [5] k. li and j. baillieul, robust quantization for digital finite communication bandwidth (dfcb) control, ieee trans. automat. control, 49(9):1573-1584, sep. 2004. [6] g. n. nair and r. j. evans, stabilizability of stochastic linear systems with finite feedback data rates, siam j. control optim., 43(2):413-436, jul. 2004. [7] n. elia and s. k. mitter , stabilization of linear systems with limited information, ieee trans. automat. control, 46(9):1384-1400, sep. 2001. [8] n. elia, when bode meets shannon: control-oriented feedback communication schemes, ieee trans. automat. control, 49(9):1477-1488, sep. 2004. [9] s. tatikonda and s. k. mitter, control under communication constraints, ieee trans. automat. control, 49(7):1056-1068, jul. 2004. [10] s. tatikonda and s. k. mitter, control over noisy channels, ieee trans. automat. control, 49(7):1196-1201, jul. 2004. [11] s. tatikonda, a. sahai and s. k. mitter, stochastic linear control over a communication channel, ieee trans. automat. control, 49(9):1549-1561, sep. 2004. [12] n. c. martins, m. a. dahleh, and n. elia, feedback stabilization of uncertain systems in the presence of a direct link, ieee trans. automat. control, 51(3):438-447, mar. 2006. [13] n. c. martins and m. a. dahleh, feedback control in the presence of noisy channels: ‘bodelike’ fundamental limitations of performance, ieee trans. automat. control, 53(7):16041615, jul. 2008. [14] g. n. nair, s. dey, and r. j. evans, infimum data rates for stabilizing markov jump linear systems, in proc. ieee conf. decision and control, 2003, pp. 1176-1181. [15] a. sahai and s. mitter, the necessity and sufficiency of anytime capacity for stabilization of a linear system over a noisy communication link part i: scalar systems, ieee trans. automat. control, 52(8):3369-3395, aug. 2006. [16] j. q. sun and s. m. djouadi, robust stabilization over communication channels in the presence of unstructured uncertainty, ieee trans. automat. control, 54(4):830-834, apr. 2009. [17] s. yüksel and t. basar, communication constraints for decentralized stabilizability with time-invariant policies, ieee trans. automat. control, 52(6):1060-1066, jun. 2007. [18] c.d. charalambous, a. farhadi, and s.z. denic, control of continuous-time linear gaussian systems over additive gaussian wireless fading channels: a separation principle, ieee trans. automat. control, 53(4):1013-1019, apr. 2008. [19] p. minero, m. franceschetti, s. dey, and g. n. nair, data rate theorem for stabilization over time-varying feedback channels, ieee trans. automat. control, 54(2):243-255, feb. 2009. [20] j. baillieul and p. antsaklis, control and communication challanges in networked real time systems, in proceedings of ieee special iss. emerg. technol. netw. control syst, usa: ieee, 2007, pp. 9-28. 100 q.q. liu, g.h. yang [21] g. n. nair, f. fagnani, s. zampieri, and r. j. evans, feedback control under data rate constraints: an overview, in proceedings of ieee special iss. emerg. technol. netw. control syst, usa: ieee, 2007, pp. 108-137. [22] y. l. wang and g. h. yang, h∞ control of networked control systems with time delay and packet disordering, iet control theory & applications, 1(5):1344-1354, may. 2007. [23] y. l. wang and g. h. yang, multiple communication channels-based packet dropout compensation for networked control system, iet control theory & applications, 2(8):717-727, aug. 2008. [24] t. cover and j. thomas, elements of information theory. new york: wiley, 2006. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 1 (march), pp. 8-20 genetic algorithm for fuzzy neural networks using locally crossover d. arotaritei dragos arotaritei “gr. t popa” university of medicine and pharmacy romania, 700115 iasi e-mail: dragos_aro@yahoo.com abstract: fuzzy feed-forward (ffnr) and fuzzy recurrent networks (frnn) proved to be solutions for "real world problems". in the most cases, the learning algorithms are based on gradient techniques adapted for fuzzy logic with heuristic rules in the case of fuzzy numbers. in this paper we propose a learning mechanism based on genetic algorithms (ga) with locally crossover that can be applied to various topologies of fuzzy neural networks with fuzzy numbers. the mechanism is applied to ffnr and frnn with l-r fuzzy numbers as inputs, outputs and weights and fuzzy arithmetic as forward signal propagation. the α-cuts and fuzzy biases are also taken into account. the effectiveness of the proposed method is proven in two applications: the mapping a vector of triangular fuzzy numbers into another vector of triangular fuzzy numbers for ffnr and the dynamic capture of fuzzy sinusoidal oscillations for frnn. keywords: rules, figures, citation of papers, citation of books, examples. 1 introduction fuzzy neural networks are usually based on neural network architecture [1] with fuzzification of inputs, outputs, weights, or rules that are applied using fuzzy systems [2]. ga is one of the alternatives to gradient techniques (see [3], [4] and [5]) that can be used to develop effective end efficient learning algorithm for fuzzy neural networks. in [6], the authors proposed a learning algorithm for ffnn with fuzzy inputs and weights called "fuzzy delta rule". this algorithm is derived by replacing fuzzy differentiation with real differentiation, this fact being point out by the authors. architecture of multilayer feedforward algebraic neural network for fuzzy input vectors, fuzzy outputs and fuzzy weights is proposed in (see [7] and [8]). the input-output relations in this fuzzy neural network, using max/min operators, are defined by the extension principle of zadeh. in these papers, the authors used only symmetric triangular fuzzy numbers (see [7] and [8]). a cost function for the level sets (α-cuts) of fuzzy outputs and fuzzy target is defined. the fuzzy output from each unit in the fuzzy neural network is computed for level sets of fuzzy inputs, fuzzy weights and fuzzy biases. the learning algorithm, based on the cost function, uses a gradient descent method. a heuristics method, in order to respect the shape of triangular fuzzy weights, during the learning stage is proposed (see [7] and [8]). the same architecture but with non-symmetric and trapezoidal fuzzy numbers has been used in [9]. the authors proposed and "trail-and-error" algorithm using adaptive updated based on gradient techniques developed in the frame of fuzzy arithmetic [9]. the α-cuts and fuzzy biases are also taken also into account [9]. a learning algorithm to fully connected fuzzy recurrent neural network (fcfrnn) has been developed in [10]. the algorithm is developed for non-symmetric triangular fuzzy numbers and gradient techniques developed in the frame of fuzzy arithmetic. the symmetric fuzzy numbers and crisp numbers are considered particularly cases [10]. copyright c⃝ 2006-2011 by ccc publications genetic algorithm for fuzzy neural networks using locally crossover 9 few papers refer to ga for adjusting fuzzy weights. in [11] the authors present a ga mechanism for adaptation of fuzzy weighs as lr-fuzzy numbers for ffnr. the single chromosome includes all the fuzzy weights in binary format. the single point crossover (split point sp) is applied to entire chromosome that represent the coded binary value of al the triangular fuzzy weights expressed as l-r fuzzy number with n parameters [11]. a structure of fuzzy recurrent neural network with ga learning algorithm is proposed in [12]. all the fuzzy weights are coded in a genome, a single string of bits that represent l-r fuzzy numbers. this string represents an individual,a potential solution of the problem. the connections in the recurrent structure follow a limited connection that is the structure is not fully recurrent structure in the sense of [14]. in both papers (see [11] and [12]) the fitness is the performance index and the objective is to minimize this error. this paper presents a ga algorithm based on learning mechanism that adjusts the fuzzy weights represented as l-r fuzzy numbers. the fuzzy neural networks computing operations are described in the fuzzy arithmetic framework. the proposed method is applied to two structures: feedforward structure and fully recurrent structure. the proposed method proved to be better than the existing algorithms in term of number of generations until the optimal solution has been reached. 2 the fuzzy arithmetic framework and the fuzzy neuron in this paper we used the input-output relations fuzzy neural network,using max-min operators, defined by the extension principle of zadeh, and applied in previously papers (see [9] and [10]). all the nodes have a semilinear (sigmoidal) transfer function. we denote a triangular fuzzy number as follow: x̃ = (xl, xc, xr) (1) a non-standard algebraic operation between two tfn,ã and b̃ is defined, as in (see [9] and [10]), by: c̃ = ã♢̃b̃ = x̃ = (cl, cc, cr) = (min(p), ac♢bc, max(p)) (2) p = al♢bl, al♢br, ar♢bl, ar♢br (3) the h-level set of a fuzzy number x̃ is defined, as: [x̃h] = {x|µx ≥ h, h ∈ r} for 0 < h ≤ 1 (4) in the equations (2)-(3), ♢ ∈ {+,−, ·, /} are the operations in classic arithmetic, and ♢̃ ∈ {+̃,−̃, ·̃, /̃} are the corresponding modified operations in fuzzy arithmetic, defined according to [7]. the division is restricted to division by a non-zero numbers, that is {bl, bc, br} ≠ 0 similar relations can be used to get the α-cuts. we consider a fuzzy neuron (fn) that perform input-output operations. the fn get an input vector fuzzy signal x̃ = (x̃1, x̃2, ..., x̃n} nd transfer it after multiplication with fuzzy weights w̃ = (w̃1, w̃2, ..., w̃n} by activation function to output. the total input of the fuzzy neuron is given by (see [9] and [10]): s̃ = w̃1̃·x̃1+̃w̃2̃·x̃2...+̃w̃n ·̃x̃n = fuzz∑ i=1,n w̃ĩ·x̃i+̃b̃ (5) 10 d. arotaritei ỹ = f(s̃) (6) in the equations above the sum is in the fuzzy arithmetic framework (2)-(3) and b̃ is the fuzzy bias. the output is computed by (5) and the extension principle of zadeh [13]. we used sigmoidal function: f(x) = 1 1 + e−x (7) due to monotonic increasing property of the sigmoidal function f, the tfn shape is preserved for the output of the neurons. in section 3 and 4 we will apply these assertions to two types of neural networks architectures: feedforward neural networks [1] and fully connected recurrent neural networks [14]. 3 feed-forward fuzzy neural networks there are three basically types of fuzzy neural networks depending on the type of fuzzification of inputs, outputs and weights (including biases): fuzzy weights and crisp inputs, crisp weights and fuzzy inputs and fuzzy weights and fuzzy inputs [11]. in what follows we consider the most complete fuzzification of neural networks: fuzzy inputs, fuzzy weights and fuzzy outputs. figure 1: a three layered ffnn in fig. 1 we showed an example of ffnn, a three layer ffnn. ffnn propagate the signal layer by layer until the signal reach the output layer. for each k layer we have: s̃kj = fuzz∑ i=1,nk w̃kjĩ·x̃i+̃b̃i (8) f(x) = f(s̃ j k) (9) in the equations above, j is the fuzzy neuron from k layer and i-th is the fuzzy neuron from k-1 layer. the training of ffnn is to adapt the network in order to mapping the known inputs to target outputs. the objective of the learning algorithm is to minimize the error measure between the desired outputs and the real outputs. the learning algorithm must adjust the fuzzy weights based of error measure in order to achieve this objective. genetic algorithm for fuzzy neural networks using locally crossover 11 figure 2: fully connected frnn 4 fuzzy recurrent neural networks let the frnn network have n units and m inputs. each bias allocation will be seeing as an input line whose value is always 1̃ = (1, 1, 1, ). let the frnn network have n units and m inputs. each bias allocation will be seeing as an input line whose value is always 1̃ = (1, 1, 1, ). we denote by (i, u, t) the set of indices for input units, output units and target units. lets x̃(t), ỹ(t) and d̃(t) denote the inputs, the outputs and the targets (if exists) of the units in rnn at time t, respectively. we denote, similar to [14], a generalized z̃(t) as follow: z̃k =   x̃k(t) if k ∈ i d̃k(t) if k ∈ t(t) ỹk(t) if k ∈ u −t(t) (10) the basic algebraic fuzzy neuron is based on the operations defined in (2)-(5), where the sum above denoted by ∑fuzz is the sense of the algebraic sum (4) and b̃k is the bias of the k-th unit. s̃k(t) = fuzz∑ r∈u∪i w̃kr·̃z̃r(t) = w̃k1̃·z̃1+̃...+̃w̃k,m+n·̃z̃m+n(t)+̃b̃ (11) ỹk(t + 1) = f(s̃k(t)) = f(s l k , s c k , s r k ) (12) the missing connection in the rafnn architecture is simply represented by a zero weight value (fuzzy number zero). 5 ga with locally crossover (galc) ga are optimization techniques based on principles of mechanism of natural evolution (see [3] and [4]). ga are working on possible space of solutions (usually codified by chromosomes) in order to find the best candidate suitable for a particularly problem. the fitness is a measure of performance of the individual in order to achieve to desired objectives. the objective function can be the maximization of fitness or minimization of fitness. two problems arise from ga usage for solving the optimization problems: the choice of the fitness function and the codification the values of parameters that must optimized into an 12 d. arotaritei individual chromosome. the range of values that are part of this codification play an important role related to both convergence speed of ga and the accuracy of the results. in our approach we will use the minimization of objective function that is a measure of output errors related to fuzzy numbers. this function is usually named the fitness function. the general fitness function can be defined based distance among desired fuzzy numbers and actual fuzzy numbers at outputs [11]. however, in our application we will use a more practical measure based on hamming distance. by this objective function we estimate more intuitive the difference between desired and real outputs. dfitness = min k∑ i=1 (|ỹdi − ỹi|) (13) ga uses binary encoded in order to represent genes or chromosomes. each binary value (0 or 1) is named allele [3]. the value of solutions is mapped as binary string in the process of encoding using linear of nonlinear functions. we used the representation of tfn as lr-type fuzzy numbers with n parameters [13]. each parameter is coded as binary string of a specified length. ã = (m, dl, dr) (14) in the case of no α-cuts, each fuzzy weight is represented by three chromosomes corresponding to central value wkjic , and l, r values that represents the left and the right spread are denoted by dwkjil and dw k jir (fig. 3). the values are coded into binary string in a predefined range. figure 3: chromosomes of one individual in the case of ffnn, w̃kji is fuzzy connection weight between the j-th neuron from k layer with i-th neuron from k-1 layer. each individual has three chromosomes that represent the strings for the three types of values. we must remark that in tfn the order must be preserved: wkji −dw k jil ≤ w k ji ≤ w k ji + dw k jir, dw k jil ≥ 0, dw k jir ≥ 0 (15) the selection process has two stages. the first stage is selection of two parents for crossover using a known schema: tournament, roulette or stochastic sampling. after this global selection, we perform a locally selection inside of the chromosome. this selection is inspired by the fact observed in crisp feedforward neural networks that not all the weights have the same contribution to forward propagation of the signal. usually, the weights from k layer has a different contribution than the weights from k+1 layer. conversely, it is natural to suppose that some parameters give a more contribution to best solution than others, so a fine locally adjustment can improve the global solution. genetic algorithm for fuzzy neural networks using locally crossover 13 the locally selection selects the weights, the locations in the chromosome where the corresponded weight is coded in binary string for locally crossover (fig 4). figure 4: selection of genes (weights) for locally crossover for c value the same selected location are taken into account in one step for all three values, as in (13). the representation is given in fig. 5. figure 5: selection of genes (weights) for locally crossover for dl, dr values the crossover operation pick the selected genes and mate them in order to produce two offspring genes. the split point (sp) is chosen randomly, the same for both parents. the example from fig. 6 illustrate the locally crossover. the same sp is applied to corresponding gene that code the left and right spread of the selected weights. figure 6: locally crossover between selected genes mutation is made globally al level of entire chromosomes. the mutation pick and bit (allele) with a probability pm and flip this value to 1 if the value is 0 and to 0 if the value is 1. mutation 14 d. arotaritei implements a random search at global level. by pm , the level of search can be modified, the greater is pm , the level of search in the solution space is greater. figure 7: global mutation the galc algorithm is based on the practical following observation. due to nonlinear aspect of the transfer function of the neuron, the changes in some weights can have a greater contribution to result than the changes in the other weights. also, the impact of this changes for ffnn can decrease or increase in the forward process (fig. 8). galc propose to use this observation in order to "encourage" the more contributing weights to be selected in order to do crossover and evolve faster meanwhile the rest of weights will evolve slowly, based only on mutations. for the same dx (dx1 = dx2 ), we can have dy2 > dy1 in same cases with one or two magnitude order. figure 8: the influence of function of activation over quantitative adaptation of the weight let denote by nn the total number of neurons in the ffnn. the proposed algorithm based on basic ga procedures is summarized below. step 1: linear mapping of the solution space (fuzzy weights)into chromosomes (binary string) that represent individuals of the population p. step 2: initialize the population p with random solutions (uniform distribution). the population has n individuals that represents 3*n set of chromosomes for tfn with no fuzzy bias and 3*n + 3*nn set of chromosomes for tfn with fuzzy bias. step 3: evaluate the fitness of the population p. step 4: evaluate the stop criteria. if the stop criteria is fulfilled, go to step 10. step 5: store the best individual of the current generation and the best individual of all the generations. step 6: generate a new population using selection operator according to selection schema based on fitness values of the chromosomes. step 7: random selection of genes for locally crossover. the selected genes are a percent pr from total number of the genes that represent a chromosome. genetic algorithm for fuzzy neural networks using locally crossover 15 step 8: locally crossover for selected genes and creation of new generation of the population. the corresponding genes are mated with probability pc and the results offspring replace the parents in the new population. step 9: global mutation over chromosomes with random probability pm . step 10: map the best individual of all the generations into solution, the fuzzy weights of the ffnn. the stop criterion can have different forms. one of the most common is a predefined number of generations. another one can be the stop of the process, before a maximum number of generations if no significant improvement has been made in the best individual. in our application we used the first criterion. the galc algorithm can be easily extended to α-cuts. for each h-level we must allocate 2 chromosomes that correspond to dl and dr values al level h (fig. 9). the central value is unchanged from one level to next level. we start with the base level (h=0) where we adapt the l, c and r values using l-r type representation (12). next we consider the c value and we must adapt the left and right spread at the level h=1,2,3,... n. figure 9: chromosomes of one individual for h-level because of max/min operators and multiplication operation, the shape of membership for fuzzy weights (and biases) is a curved triangle. restrictions must be made in order to avoid non desired occurrence in the weights adaptation process. figure 10: a non-permitted occurrence in the weights adaptation process the restrictions are: dw k,h jil ≤ dw k,h−1 jil , dw k,h jir ≤ dw k,h−1 jir (16) these restrictions are included in the algorithm in the following way. if the after the crossover, the children accomplish the rules (15) the local crossover operation is validated and the children are selected in the next generation depending on the fitting. if the children doesn’t accomplish the rules (15) the local crossover operation is canceled and the selected weights remain unchanged. 16 d. arotaritei 6 applications 6.1 a mapping of non-symmetric triangular fuzzy numbers with ffnn we apply the proposed method to approximate realisation of non-linear mapping of fuzzy numbers. the tfn from input space have the bases inside the interval [0, 1] and are mapping to tfn which have the bases in [0,+1]. the ffnn has 3 inputs, 6 neurons in the hidden layer and 3 neurons in the output layer. the ffnns fuzzy error can be defined as a distance between fuzzy desired output and the fuzzy real output [15]. because we are interested in accuracy of outputs for all the points that represents the fuzzy number, we express the error measure: e l,c,r i = ∣∣∣dl,c,ri −yl,c,ri ∣∣∣ (17) jtotal(t) = ∑ q=l,c,r;i=1...n e q i (18) the inputs and the target (desired outputs are): x̃ = (0.2,0.1,0.1),(0.3,0.1,0.2),(0.6,0.2,0.1) d̃ = (0.4,0.2,0.2),(0.8,0.2,0.1) we generate an initial population of random binary strings that represent the chromosomes. the population is set to p=100 individuals, pc = 0.7, pm = 0.3, gen = 1000 (the maximum number of generations). we set pr = 20%, that is from 6×3 + 6× 2 = 30 weights, and at each generation we select random a number of 0.2*30 = 6 weights. figure 11: the error evolution (ffnn example) the best result is obtained in gen= 68, when j = 0.0927 (fig. 11). the best fitness individual has the lowest value of fitness (we used the best individual minimum fitness approach). the desired and the real output values are: d̃ = (0.4,0.2,0.2),(0.8,0.2,0.1) x̃ = (0.4055,0.1945,0.2176),(0.7714,0.1636,0.1119) genetic algorithm for fuzzy neural networks using locally crossover 17 we made experiments with α-cuts, also. in our experiment we used three α-cuts, at levels h=0, h= 0.33, and h = 0.66. the experimental results for one of the fuzzy weights (k=2, j =1, i=2) and the levels of the α-cuts mentioned above is showed in fig. 12. figure 12: the fuzzy weight w̃212 with three α-cuts) 6.2 frnn learning of defined dynamic the algorithm designed to train an arbitrary network dynamics can be tested using an interesting class of behaviors, the oscillations [14]. a pair of logistic units (fig. 13) was used to learn a trajectory of a tfn, composed by three sine waves with the same frequencies and the same phase but different range corresponding to l, c and r values of the tfn (fig. 14). figure 13: fuzzy sine wave network. the overall error measure at each sample time t is given by: j(t) = ∑ k∈u [[∣∣elk (t)∣∣ + ∣∣erk (t)∣∣] + ∣∣eck (t)∣∣] (19) j fuzz total (t0, t1) = { d l,c,r k (t)−y l,c,r k (t) if k ∈ t(t) 0 otherwise (20) the network performance measure over trajectory is defined by: j fuzz total (t0, t1) = t1∑ t=t0+1 j(t) (21) 18 d. arotaritei figure 14: the training sine waves (targets). in our particular case we consider a complete trajectory a sinus period (2π). the best individual is obtained after 176 generation after 10 starting new generations and epochs. usually, the stable solution is obtained in less than 500 generation with uniform random distribution of initial population made from 100 individuals. table 1: rafnn parameter for sine waves l c r w̃11 0.3026 0.3687 0.4169 w̃12 -0.4405 -0.2509 -0.0286 w̃21 -0.2506 -0.0076 -0.0035 w̃22 0.2214 0.4054 0.5207 b̃1 -0.3521 -0.3028 -0.2499 b̃2 -0.1499 -0.0085 0.0772 stable, the sine like oscillation are obtained for sine frequencies above 20 network ticks per cycle with the 20,000 ticks or less. the output of the unit 1 in the absence of a teacher imposed to unit 2, after stable oscillation had been established is shown in fig. 15 as we are the expectations, taking into account the results from [14], the frequency of the free-running logistic is around 8% lower than the trained frequency. the amplitude is smaller and it is diminishing in time. this example proved that the frnn is able to learn a dynamics that is the sinusoidal waves. the output of the unit 1 has at each step a triangular fuzzy number, and values yl, yc and yr have a sinusoidal shape. it is clear from results that the output error is important, but out target was to learn predefined dynamic using minimization of error. the algorithm stopped when the performance not improve during the last s generations (in our case, s=3). 7 conclusions we proposed to use a novel method and a new learning algorithm, galc for fuzzy neural networks with feedforward and fully recurrent architecture. the method and a learning algorithm proved by theoretical and experimental approach to be effective with acceptable stable solution. genetic algorithm for fuzzy neural networks using locally crossover 19 figure 15: the output of the unit 1. the solution is usually obtained in a shorter time (in terms of number of generation) than the actually ga based learning algorithms. moreover, the algorithm is effective also for fully fuzzy recurrent neural networks as they has been defined in section 4. different from (see [11] and [12]) we proposed a complete fuzzification of the neural network in the sense that the sum of the fuzzy weighted inputs are also in the fuzzy arithmetic framework (see [11] and [12]). the proposed method offer a possibility to work with long strings. it is know that in the case of very long chromosome, the global crossover can slow considerably the algorithm convergence. that is a very large number of generation is required in order to achieve the global solutions. the proposed method that start with a larger number of individual in the initial population and selective locally adaptation could improve the performance of convergence by pressure of these parameters that has a greater importance in the best solution. bibliography [1] s. haykin, neural networks: a comprehensive foundation, prentice hall, 1998. [2] d.j. dubois, h. prade, fuzzy sets and systems: theory and applications, academic press, 1980. [3] d. e. goldberg, genetic algorithms in search, optimization, and machine learning, addison-wesley professional, 1989. [4] j. r .koza, genetic programming: on the programming of computers by means of natural selection, the mit press, 1992. [5] o. cordon, v. herrera, m. lozano, on the combination of fuzzy logic and evolutionary computation: a short review and bibliography, in: w. pedrycz (ed.), evolutionary computation, kluwer academic publishers, dordrecht, pp.33-56, 1997. [6] y. hayashi et. al., fuzzy control rules in convex optimization, fuzzy neural networks with fuzzy signals and weights ijcnn’92, vol. 2, pp. 165-195, 1992. 20 d. arotaritei [7] h. ishibuchi, k. kwon, h.tanaka, a learning algorithm on fuzzy neural networks with triangular fuzzy weights, fuzzy sets and systems, vol. 72, no. 3, pp. 257-264, 1995. [8] h. ishibuchi, r. fujioka, h.tanaka, an architecture of neural networks for input vectors of fuzzy numbers, proc. fuzz-ieee ’92, san diego, usa, pp. 643-650, 1992. [9] h.n. teodorescu, d. arotaritei, e. lopez gonzales, a general trail-and-error algorithm for algebraic fuzzy neural networks, proceedings of the fourth european congress on intelligent techniques and soft computing, aachen, germany, september 2-5, vol. 1, pp. 8-12, 1996. [10] d. arotaritei, recurrent algebraic fuzzy neural networks based on fuzzy numbers, joint 9th ifsa world congress and 20th nafips international conference, vol. 5, pp. 2676 2680, 2001. [11] r.a. aliev, b. fazlollahi, r.m. vahidov, genetic algorithm-based learning of fuzzy neural network, part 1: feed-forward fuzzy neural networks, fuzzy sets and systems, vol. 118, issue 2, pp. 351-358, 2001. [12] r. a. aliev, r. r. aliev, b. g. guirimov, k. uyar, recurrent fuzzy neural network based system for battery charging, lecture notes in computer science, vol. 4492, pp. 307-316, 2007. [13] l.a. zadeh, fuzzy sets, inform. control 8, pp. 338-353, 1965. [14] r.j. williams, d. zipser, a learning algorithm for continually running fully recurrent neural networks, neural computation, vol. 1, issue 2, pp. 270-280, 1989. [15] c. chakraborty, d. chakraborty, a theoretical development on a fuzzy distance measure for fuzzy numbers, mathematical and computer modelling, nr. 43, pp. 254-261, 2006. international journal of computers communications & control issn 1841-9836, 9(5):531-538, october, 2014. estimation of the technical state of automotive disc brakes using fuzzy logic m. baban, c.f. baban, c. bungau, g. dragomir, r.m. pancu marius baban, calin florin baban*, constantin bungau, george dragomir, rares mihai pancu university of oradea, romania, oradea, universitatii st., 1 mbaban@uoradea.ro, bungau@uoradea.ro, georgedragomir@yahoo.com, pancurares@yahoo.com *corresponding author: cbaban@uoradea.ro abstract: according to existing studies the phenomena that occur in the exploitation of the braking system are very complex and an analytical mathematical modeling of braking process it is difficult to be developed [1]. since these phenomena are also characterized by some uncertainties, a fuzzy logic approach has been employed in this research for the estimation of technical state of the disc brakes. their technical state was expressed through the thickness variation, which was used as the output linguistic variable. the vibrations and temperature of the disc brakes were used as the input linguistic variables. the fuzzy decision system for the estimation of technical state of the disc brakes has been implemented with the fuzzy logic toolbox t m of the matlab r software, which can be employed to determine if the thickness of the disc brakes becomes smaller than the limit value prescribed by the manufacturer. keywords: disc brake, temperature, vibration, thickness variation, fuzzy logic approach. 1 introduction throughout the exploitation of the disc brake its thickness is modified in the contact interface with the friction material of the brake pads, due to the effect of wear. if the thickness of the disc brake becomes smaller than a certain limit value prescribed by the manufacturer, the probability of the deformation of the disc brake or even its damage is increasing at the high intensity or long duration of braking. this may have serious implications for the road accidents. the replacement of discs and brake pads depends on the number of traveled kilometers and it is usually done preventively, when periodic revisions are carried out. such replacement can occur even if maximum wear is not reached, which leads to higher exploitation costs due to unused of disc brakes for their entire lifetime. on the other hand, there may be situations where disc brakes reach maximum wear before a periodic revision is scheduled, because of some exploitation conditions that require a more intense use of the braking system of the vehicle. due to the complexity of the phenomena that occur in the exploitation of the braking system, phenomena that are influenced by different factors [7] and are characterized by some uncertainties, an analytical mathematical modeling of braking process it is difficult to be developed [1]. during the braking process, vibrations are produced within the contact area between the discs and brake pads and their spectrum depends on factors such as the degree of wear and temperature. studies on vibrations in the car braking system are presented in ( [13], [15], [16]), while investigations about the temperature of disc brakes are shown in ( [3], [8], [23]). the employement of fuzzy logic in automotive engineering has been presented in different studies. von altrock [20] emphasized the use of fuzzy logic for the abs systems, control of the engine and control of automatic transmission. the control of the braking system, the suspension control and the vehicle dynamics control are presented among the relevant research areas of fuzzy control in automotive field ( [9]). the use of fuzzy logic for the abs systems has been shown copyright © 2006-2014 by ccc publications 532 m. baban, c.f. baban, c. bungau, g. dragomir, r.m. pancu in other studies ( [5], [11], [21], [22]). fuzzy logic was also used to improve the handling and stability of vehicles ( [4], [18]), to control the speed of an automotive engine ( [19]) or to control a suspension system of automotive ( [6]). while fuzzy logic in automotive field has been used lately, its employement for the estimation of technical state of the disc brakesis is still a challenging approach and this area of research is less developed. within this framework, the fuzzy logic ( [12], [14], [17]) has been proposed for the estimation of technical state of the disc brakes, so they can be replaced before the maximum limit of wear is reached and without depending on the planned periodic revisions. the research begins with the description of the experimental stand developed for the estimation of the technical condition of disc brakes. a fuzzy logic decision approach for the estimation of technical state of the disc brakes is shown in the next section. the implementation of the fuzzy logic decision approach for the estimation of technical state of the disc brakes with the fuzzy logic toolbox tm of the matlab r software is also depicted. at the end, conclusions and recommendations for future research are presented. 2 experimental method for the estimation of the technical condition of disc brakes, the stand depicted fig. 1 has been developed. the stand is based on a powertrain that consists of the following main elements: a) four cylinders diesel engine with the following characteristics: maximum power 43.5 kw at 4600 rot/min, maximum torque 110 nm at 2000 rot/min; b) gearbox 5-speed manual transmission; c) macpherson front suspension; d) hydraulic brake system composed of: a master brake cylinder, operated by a screw to ensure precise movement of the piston within its interior; -a brake pipe and a brake caliper; -a full disc brake with the diameter of 247 mm; -brake pads. the flir sc 640 infrared camera was used to monitor the temperature of braking disc. the vibration monitoring was performed with a system composed of pc-topmessage device (made by delphin technology ag)-accelerometer sensor. the fluid pressure in the hydraulic circuit was monitored through a manometer with glycerin (scale 0 ... 6 mpa). the speed rotation of the disc brake (expressed in km/h) was monitored by a speedometer. the contact force between the brake pads and disc was kept constant by maintaining the pressure of the brake pipe fluid at 1 mpa. the disc rotation speed was also maintained constant at a value corresponding to vehicle speeds of 30 km/h. the first speed of the gearbox was used to provide an engine speed in the stable operation area, without trepidation that could affect the vibrations measurement. due to the dilatation of the material of the disk brake as a result of increasing its temperature during the braking process, we proceeded to fine tune of the position of the piston in the brake master cylinder using a screw-nut mechanism to maintain constant the pressure of the brake fluid. the experiments were conducted using two disc brakes, corresponding to a new disc (thickness = 10 mm) and to a disc with the thickness at the lower limit prescribed by the manufacturer (thickness = 8.5 mm). estimation of the technical state of automotive disc brakes using fuzzy logic 533 speedometer manometer accelerometer sensor figure 1: the experimental stand for the estimation of the technical condition of disc brakes 3 results a lowest value of the vibration amplitude equal to 1.46 mm/s rms was measured for a new disc brake (thickness=10 mm) at a temperature of 50.9 0c. a highest value of the vibration amplitude equal to 2.4 mm/s rms was measured for a disc brake with the thickness of 8.5 mm at a temperature of 154.2 0c. the fuzzy logic decision approach described in [2, p.22-23] was employed for the estimation of technical state of the disc brakes, as follows: 1) the t=temperature [0c] and avib=vibration’s amplitude [mm/s] rms were used as the input linguistic variables: lvi = {t,avib} (1) considering the measured value of the t and avib, their domain values have been defined as follows: t : dt = [50.9,154.2] avib : davib = [1.46,2.4] (2) 2) for the t and avib variables the linguistic terms have been established as: t : ltt =   tverysmall tsmall tmedium tbig tverybig   (3) 534 m. baban, c.f. baban, c. bungau, g. dragomir, r.m. pancu and avib : ltavib =   av verysmall av small av medium av big av verybig   (4) among the membership functions, the triangular functions are extensively employed and they have been proposed as the membership functions for both t and avib variables. according with the definition of [10, p.25], their expressions are: mftk (x,atk,btk,ctk) =   0, x ≤ atk x−at k bt k −at k , atk ≤ x ≤ btk ct k −x ct k −bt k , btk ≤ x ≤ ctk 0, ctk ≤ x ,k = 1...5 (5) and mfavibj (x,aavibj,bavibj,cavibj ) =   0, x ≤ aavibj x−aavibj bavibj −aavibj , aavibj ≤ x ≤ bavibj cavibj −x cavibj −bavibj , bavibj ≤ x ≤ cavibj 0, cavibj ≤ x ,j = 1...5 (6) where: atk < btk < ctk are the parameters of each triangular membership function mf t k in the expression (5), k=1...5; aavibj < bavibj < cavibj represent the parameters of each triangular membership function mfavibj in the expression (6), j=1...5. 3)the tbd=the thickness of the brake disc [mm] was employed as the output variable: lvo = {tbd} (7) for the tbd variable, the domain value has been defined as: tbd : dtbd = [8.5,10] (8) 4) for the tbd variable the linguistic terms have been established as: tbd : lttbd =   tbdverysmall tbdsmall tbdmedium tbdbig tbdverybig   (9) estimation of the technical state of automotive disc brakes using fuzzy logic 535 the triangular function have also been proposed as the membership function for tbd variable and its expression is [10, p.25]: mftbdr (x,atbdr,btbdr,ctbdr) =   0, x ≤ atbdr x−at bdr bt bdr −at bdr , atbdr ≤ x ≤ btbdr ct bdr −x ct bdr −bt bdr , btbdr ≤ x ≤ ctbdr 0, ctbdr ≤ x ,r = 1...5 (10) where atbdr < btbdr < ctbdr represent the parameters of each triangular membership function mftbdr in the expression (10), r=1...5. 5) the fuzzy rules base is: rule1 : {if (av verysmall) and (tverysmall) then (tbdverybig)} rule2 : {if (av verysmall) and (tsmall) then (tbdverybig)} rule3 : {if (av small) and (tverysmall) then (tbdverybig)} ... rule9 : {if (av mediu) and (tmedium) then (tbdmedium)} ... rule23 : {if (av verybig) and (tbig) then (tbdverysmall)} rule24 : {if (av big) and (tverybig) then (tbdverysmall)} rule25 : {if (av verybig) and (tverybig) then (tbdverysmall)} (11) 6) the centroid method [17, p. 98] has been used to obtain the the defuzzified value tbd of the thickness of the disc brake. the fuzzy logic decision approach for the estimation of technical state of the disc brakes has been implemented with the fuzzy logic toolbox tm of the matlab r software (fig. 2). the inference rules are shown in figure 3. figure 4 depicts the dependence of the tbd=f(t, avib). considering t=116 0c and avib=2.16 mm/s rms we obtain tbd=8.9 mm, so that thickness of the disc brake is within prescribed limits [8.5,10]. 536 m. baban, c.f. baban, c. bungau, g. dragomir, r.m. pancu figure 2: the fuzzy decision system technical-state-disc-brakes.fis figure 3: the inference rules of the fuzzy decision system technical-state-disc-brakes.fis figure 4: the dependence tbd=f(t, avib) estimation of the technical state of automotive disc brakes using fuzzy logic 537 4 conclusions the phenomena that occur in the exploitation of the braking system are very complex and they are characterized by some uncertainties. therefore, a fuzzy logic approach has been employed in this research for the estimation of technical state of the disc brakes, expressed through their thickness variation. a stand has been developed to establish the domain values of the temperature and vibration amplitude of the disc brakes, which were used as the input linguistic variables. the thickness of the disc brakes was employed as the output linguistic variable. the fuzzy logic toolbox tm of the matlab r software was used to develop the fuzzy decision system, which can be applied to establish if the thickness of the disc brakes is within prescribed limits by the manufacturer. in our research both the contact force between the brake pads and disc, and the disc rotation speed were maintained constant. future studies are needed to investigate the development of a fuzzy decision system when these two characteristics are also variable. the use of fuzzy logic approach in joining with other artificial intelligence methods, including neural networks may also represent important area for future research. bibliography [1] aleksendric, d.; barton, d. c. (2009); neural network prediction of disc brake performance, tribology international, 42(7): 1074-1080. [2] baban, m.; baban, c.f.; blaga, f.s. (2010); maintenance planning of cold plastic deformation tools using fuzzy logic, eksploatacja i niezawodnoscmaintenance and reliability, 3: 21-26. [3] belhocine, a.; bouchetara, m. (2012); thermomechanical modelling of dry contacts in automotive disc brake, international journal of thermal sciences, 60: 161-170. [4] boada, m.j.l.; boada, b.l.; munoz, a.; diaz, v. (2006); integrated control of front-wheel steering and front braking forces on the basis of fuzzy logic, proceedings of the institution of mechanical engineers 220.d3 (mar 2006), 253-267. [5] cabrera, j.a.; ortiz, a.; castillo, j.j.; simon, a. (2005); a fuzzy logic control for antilock braking system integrated in the imma tire test bench, ieee transactions on vehicular technology, 54(6):19371949. [6] cherry, a.s.; jones, r.p.(1995); fuzzy logic control of an automotive suspension system, iee proceedings: control theory and applications, 142(2):149-160. [7] cirovic, v.; aleksendric, d. (2011); dynamic modelling of disc brake contact phenomena, fme transactions, 39(4):177-183. [8] hwang, p.; wu, x. (2010); investigation of temperature and thermal stress in ventilated disc brake based on 3d thermo-mechanical coupling model, journal of mechanical science and technology, 24:81-84. [9] ivanov, v. (2010); fuzzy methods in ground vehicle engineering: state-of-the-art and advanced applications, proceedings of the 8th international conference on structural dynamics, eurodyn 2011 leuven, belgium, 4-6 july 2011, g. de roeck, g. degrande, g. lombaert, g. muller (eds.), 3008-3015. 538 m. baban, c.f. baban, c. bungau, g. dragomir, r.m. pancu [10] jang, j.-s.r.; sun, c. t.; mizutani, e. (1997); neuro-fuzzy and soft computing: a computational approach to learning and machine intelligence, upper saddle river, nj: prenticehall. [11] mauer, g.f. (1995); a fuzzy logic controller for an abs braking system,ieee transactions on fuzzy systems, 3(4):381-388. [12] ross, t. (2004); fuzzy logic with engineering applications, john wiley & sons ltd., second edition. [13] sawczuk, w. (2011); application of vibroacoustic signal to diagnose disck braking system, journal of kones powertrain and transport, 18 (1): 525-534. [14] seising, r. (2011); from electrical engineering and computer science to fuzzy languages and the linguistic approach of meaning: the non-technical episode: 1950-1975, international journal of computers communications & control, 6(3): 530-561. [15] sergienko, v. p.; bukharov, s. n. (2009); vibration and noise in brake systems of vehicles. part 2: theoretical investigation techniques, journal of friction and wear, 30(3): 216-226. [16] sergienko, v. p.; bukharov, s. n.; kupreev, a. v.(2008); noise and vibration in brake systems of vehicles. part 1: experimental procedures, journal of friction and wear, 29(3):234241. [17] sivanandam, s.n.; sumathi, s.; deepa, s.n. (2006); introduction to fuzzy logic using matlab, springer. [18] song, j.(2012); integrated control of brake pressure and rear-wheel steering to improve lateral stability with fuzzy logic, international journal of automotive technology, 13(4): 563-570. [19] vachtsevanos, g.j.; farinwata, s.s.; pirovolou, d.k. (1993); fuzzy logic control of an automotive engine, ieee control systems, 11(3): 62-68. [20] von altrock, c.(1997); fuzzy logic in automotive engineering, circuit cellar ink, the computer applications journal, 88: 1-9. [21] will, a.b., zak, s.h. (2000); antilock brake system modelling and fuzzy control, international journal of vehicle design, 4(1): 1-18. [22] yazicioglu, y.; unlusoy, y.s. (2008); a fuzzy logic controlled anti-lock braking system (abs) for improved braking performance and directional stability, international journal of vehicle design, 48(3-4): 299-315. [23] yevtushenko, a.; kuciej, m. (2010); temperature and thermal stresses in a pad/disc during braking, applied thermal engineering, 30: 354-359. int j comput commun, issn 1841-9836 vol.7 (2012), no. 2 (june), pp. 377-387 a 2-level metaheuristic for the set covering problem c. valenzuela, b. crawford, r. soto, e. monfroy, f. paredes claudio valenzuela, broderick crawford pontificia universidad católica de valparaíso valparaíso, chile {claudio.valenzuela, broderick.crawford}@ucv.cl ricardo soto 1. pontificia universidad católica de valparaíso valparaíso, chile, and 2. universidad autónoma de chile ricardo.soto@ucv.cl eric monfroy universidad técnica federico santa maría valparaíso, chile eric.monfroy@inf.utfsm.cl fernando paredes escuela de ingeniería industrial universidad diego portales santiago, chile fernando.paredes@udp.cl abstract: metaheuristics are solution methods which combine local improvement procedures and higher level strategies for solving combinatorial and nonlinear optimization problems. in general, metaheuristics require an important amount of effort focused on parameter setting to improve its performance. in this work a 2-level metaheuristic approach is proposed so that scatter search and ant colony optimization act as “low level" metaheuristics, whose parameters are set by a “higher level" genetic algorithm during execution, seeking to improve the performance and to reduce the maintenance. the set covering problem is taken as reference since is one of the most important optimization problems, serving as basis for facility location problems, airline crew scheduling, nurse scheduling, and resource allocation. keywords: metaheuristics, genetic algorithm, scatter search, ant colony optimization, set covering problem. 1 introduction the set covering problem (scp) is a classical problem in computer science and one of the most important discrete optimization problems since it can model conveniently real world problems. some of these problems — which can be modeled as a set covering problem — include facility location, airline crew scheduling, nurse scheduling, resource allocation, assembly line balancing, vehicle routing, among others [6]. there are several studies which have implemented a scp solution using metaheuristics [1,2,12]. depending on the algorithm that has been used, the quality of the solution wanted and the complexity of the scp chosen, it is defined the amount of customization efforts required. conveniently, this work proposes transferring part of this customization effort to another metaheuristic (a “high level" metaheuristic) which can handle copyright c⃝ 2006-2012 by ccc publications 378 c. valenzuela, b. crawford, r. soto, e. monfroy, f. paredes the task of parameters adjustment for a low level metaheuristic. this approach is considered as a multilevel metaheuristic since there are two metaheuristics covering tasks of parameter setting, for the former, and problem solving, for the latter [9]. the main design of the implementation proposed considers a genetic algorithm (ga) [10] at online (control) and offline (tuning) parameter setting for a low level metaheuristic (ant colony optimization (aco) or scatter search (ss)) using a reactive search approach and an automatic parameter tuning approach. in reactive search, feedback mechanisms are able to modify the search parameters according to the efficiency of the search process, i.e. the balance between intensification and diversification can be automated by exploiting the recent past of the search process through dedicated learning techniques [13]. the automatic parameter tuning is carried by an external algorithm which searches for the best parameters in the parameter space in order to tune the solver automatically. ant colony optimization and scatter search techniques [11] have shown interesting results at solving scp [6] and similar problems [5]. for the purpose of this work, the former is selected by its constructional approach for generating solutions, plus its stochastic-based operators. the latter is considered as an evolutionary (population-based) algorithm which uses, essentially, deterministic operators. both of them provide good reference metaheuristics in terms of their foundations, their problem solving approaches, their design maturity, and in terms of how different one is from the other, making them highly suitable to the development of this work. 2 set covering problem a general mathematical model of the set covering problem can be formulated as follows: (1) minimizez = n∑ j=1 cjxj j = {1,2,3, ...,n} subject to: (2) n∑ j=1 aijxj ≥ 1 i = {1,2,3, ...,m} (3) xj = {0,1} equation (1) is the objective function of set covering problem, where cj is the cost of jcolumn, and xj is decision variable. equation (2) is a constraint to ensure that each row is covered by at least one column where aij is a constraint coefficient matrix of size m x n whose elements comprise of either “1” or “0”. finally, equation (3) is the integrality constraint in which the value xj can be “1” if column j is activated (selected) or “0” otherwise. different solving methods have been proposed in the literature for the scp. there exist examples using exact methods [8], linear programing and heuristic methods [7], and metaheuristic methods [2]. has being pointed out, that one of the most relevant applications of scp is given by crew scheduling problems in mass transportation companies where a given set of trips has to be covered by a minimum-cost set of pairings, a pairing being a sequence of trips that can be performed by a single crew. 3 multilevel metaheuristics and parameter setting metaheuristics, in their original definition, are solution methods that orchestrate an interaction between local improvement procedures and higher level strategies to create a process capable a 2-level metaheuristic for the set covering problem 379 of escaping from local optima and performing a robust search of a solution space [3]. over time, these methods have also come to include any procedures that employ strategies for overcoming the trap of local optimality in complex solution spaces, especially those procedures that utilize one or more neighborhood structures as a means of defining admissible moves to transition from one solution to another, or to build or destroy solutions in constructive and destructive processes. a number of the tools and mechanisms that have emerged from the creation of metaheuristic methods have proved to be remarkably effective, so much that metaheuristics have moved into the spotlight in recent years as the preferred line of attack for solving many types of complex problems, particularly those of a combinatorial nature. multilevel metaheuristics can be considered as two or more metaheuristics where a higher level metaheuristic controls de parameters of a lower level one, which is at charge of dealing more directly to the problem. therefore, parameter setting is a key factor in the design of metaheuristics and multilevel metaheuristics [4], since they improve their solving performance by modifying and adjusting themselves to the problem at hand, either by self-adaptation or supervised adaptation. 3.1 parameter setting many parameters have to be set for any metaheuristic. parameter setting may allow a larger flexibility and robustness, but requires a careful initialization. those parameters may have a great influence on the efficiency and effectiveness of the search. it is not obvious to define a priori which parameter setting should be used. the optimal values for the parameters depend mainly on the problem and even the instance to deal with and on the search time that the user wants to spend in solving the problem. a universally optimal parameter values set for a given metaheuristic does not exist. 3.2 tuning before solving also know as “offline parameter initialization”. as previously mentioned, metaheuristics have a major drawback; they need some parameter tuning that is not easy to perform in a thorough manner. those parameters are not only numerical values but may also involve the use of search components [15]. usually, metaheuristic designers tune one parameter at a time, and its optimal value is determined empirically. in this case, no interaction between parameters is studied. this sequential optimization strategy (i.e., one-by-one parameter) do not guarantee to find the optimal setting even if an exact optimization setting is performed. main flavors of “tuning before solving” techniques include: parameter tuning on preliminary experiments, empirical manual tuning and automatic parameter tuning by an external algorithm. 3.3 control during solving also know as “online parameter initialization”. the drawback of the offline parameter setting approaches is their high computational cost, particularly if this approach is used for each input instance of the problem. indeed, the optimal values of the parameters depend on the problem at hand and even on the various instances to solve. then, to improve the effectiveness and the robustness of offline approaches, they must be applied to any instance (or class of instances) of a given problem. the control of the solverős behavior during the run can be achieved by either modifying its components and/or its parameters. this corresponds, for instance, to an online adjustment of the parameters or heuristics. such control can be achieved by means of supervised control schemes or by self adaptive rules. of course, such approaches often rely on a learning process that tries to benefit from previously encountered problems along the search or even 380 c. valenzuela, b. crawford, r. soto, e. monfroy, f. paredes during the solving of other problems. at this section are considered the approaches that change the parameters during the search with respect to the current state and other parameters. of course, these parameters have a direct influence on the heuristics functions, but also these latter functions stay the same during the solving process. some remarkable approaches are introduced as follows: hyperheuristics and reactive search [16]. 4 implementation details 4.1 ga-ss design the design proposed for the multilevel implementation is based on documented standards proposed for each metaheuristic. both metaheuristics are looking to be as close as they can to its origins. obviously, a multilevel implementation with adaptive parameter control approach forces some changes to the “high leveló metaheuristic, in this case, ga. this case will be similar to both combinations: ga-ss and ga-aco. hence, the design of ga seems particularly faithful to its basic design. [ size of p , bestset , diverseset , enhancetrials , maxsol ] figure 1: ga chromosome representation for ss. the chromosome representation shows that the first gene, for this case the “size of p” is the number of initial solutions for ss, will take values between 1 and n/2 where n number of variables. same way for the second gene “bestset”, which is the size of the best solutions reference set. similarly, “diverseset” is the size of the most diverse solutions set. “enhancetrials”, represents the number e of trials for the improvement method to try to enhance a solution, where e = {1...n}. finally, “maxsol ” is the limit of solutions generated by a call to the scatter search algorithm. 4.2 ga components the main components of ga components are: initialization function: initializes the values of genes within the variables bounds. it also initializes (to zero) all fitness values for each member of the population. it takes upper and lower bounds of each variable from user defined parameters. it randomly generates values between these bounds for each gene of each genotype in the population. evaluation function: the upper level metaheuristic uses two criterions; (1) the same evaluation function of the lower level metaheuristic, this is the corresponding objective function of scp, and (2) a objective function of scp penalized by the processing effort. “keep the best” function: this function keeps track of the best member of the population. elitist function: the best member of the previous generation is stored. selection function: standard proportional selection for maximization/minimization problems incorporating elitist model — makes sure that the best member survives. crossover selection: selects two parents that take part in the crossover. this work implements a single point crossover. mutation: random uniform mutation. a variable selected for mutation is replaced by a random value between lower and upper bounds of this variable. a 2-level metaheuristic for the set covering problem 381 4.3 ss components a description of ss components included in ga-ss design: diversification generation method: this method generates diverse binary initial solutions, it starts with an all-zero arbitrary solution, and then begins adding “1” in every position with a jump of k bits, where k = {1...n} and n < (number_of_vars) − 1. in example, for k = 1 it will generate a solution vector [1,0,1,0,1,0,1,0...]. should be noticed that this method always starts placing a bit “1” at the first position. together with the previously generated solution, the method generates the complement of that solution, which will be [0,1,0,1,0,1,0,1,0,1...]. the quantity of solutions generated by this method is a parameter controlled by the ga. improvement method: transforms a trial solution into a enhanced trial solution. if the trial solution is not feasible, should be fixed until it turns feasible. the input solutions are not required to be feasible. if the input trial solution is not improved as a result of the application of this method, the “enhanced” solution is considered to be the same as the input solution. the limit of enhancement trials (not the fix trials) is controlled by the ga. the duties of fixing and enhancing are improved since a vector of ratios is calculated (the length of the vector is the number of variables, or columns). this vector is then sorted from min to max ratio. each element represents a variable at the objective function. then, when trying to fix a solution should be followed in-order, so the first item will represent the column which covers more rows of the incidence matrix. if this position is “0” at the solution, should be turned to “1", because the method is wanting to add “1” for covering rows and turn the solution to feasibility. this process continues until the solution is feasible. when trying to enhance a solution, the vector of ratios should be accessed in post-order, trying to turn as many “1" to “0" as it can while the solution keeps feasible. the method will try to enhanced until the limit l is reached, where l is a parameter controlled by the ga. reference set update method: builds and maintain two reference sets, consisting of the b “best” solutions and the d “most diverse” solutions found (where the value of b and d is set by the ga), organized to provide efficient accessing by other parts of the solution procedure. the criteria for adding the “best” solutions to its set is the cost at the objective function, also, the criteria for adding solutions at the “most diverse” set is the hamming distance to all the solutions at the “best set” and the “diverse set”. subset generation method: operates on the reference set, to produce a subset of its solutions as a basis for creating combined solutions. the most common subset generation method will be used, which generates all pairs of reference solutions (i.e., all subsets of size 2) from the “best set" and the “diverse set" together. solution combination method: transforms a given subset of solutions produced by the subset generation method into one combined solution. the combination method is based in the cost at objective function/ rows covered ratio, which is calculated per column. when combining two solutions, the following procedure is used: if (sol_1[x] = sol_2[x]) then newsol[x] := sol_1[x] elseif (ratio[x] < median) then newsol[x] := 1 else newsol[x] := 0 where x is the position of the bit being evaluated with x = {1...maxcolumns}, and the median is the median of the vector of ratios for a given scp instance. 382 c. valenzuela, b. crawford, r. soto, e. monfroy, f. paredes 4.4 ga-aco design for this implementation, the same structure of ga components presented at 4.2 is used. added to this, the implementation of aco for scp is very straightforward. the components used by aco are: pheromone. denoted by t = τi, the matrix of pheromones will be used to consolidate information obtained by each ant, i.e. the amount of pheromone stored in each column i. τi(t) specifies the intensity of the pheromone at column i in time t, and updates in a local way (according to the path of the ants), and in a global way (the pheromone evaporates in every matrix column). the matrix is initialized with a value t0 which will be 10−6 for this implementation. transition. for apply a transition between two columns, this should be done according to the list of candidates. the list of candidates of a column contains the c more attracting columns, and they are sorted in a sequential manner. the transition is carried according to: if exists at least one column j ∈ candidate list, then, choose the next column j ∈ jki , between the c columns in the candidate list according: j = { maxu ∈ jki [τu(t)][ηu] β if q ≤ q0 j if q > q0 where ηu represents the heuristic information, and j is chosen with the probability: pkj (t) = [τj(t)][ηj] β∑ l∈nk[τl(t)][ηl] β where nk is a possible neighbor of ant k. after each selection of a column j, occurs a local modification of the level of pheromone of that column, given by the equation: τj(t) = (1 − ρ)τj(t) + ρτ0 this evaporation is done so the visited column will not be interesting for the following ants, stimulating in this way the exploration of solutions. at the equation above, ρτ0 is a stabilization factor of the pheromone modification, used with the intention of not having less attractive columns so quickly, allowing the exploration of a number even bigger of new solutions. when every ant ended an iteration (a solution has been found), the pheromone level of the best solution found is updated globally, loading pheromone to each column of the solution according to: τj(t) = (1 − ρ)τj(t) + ρ△τj(t) where △τj(t) is the variation of pheromone left on column j by the best ant. this variation is calculated as the frequency of column j in the routing of each ant, i.e. the number of times where column j is in the solutions found by the ants. to operate, the following parameters controlled by the ga will be defined: number of ants, ρ (the evaporation factor), β (the importance in the choice of the next column, length of the list (which will be used to limit the number of columns that can be visited next), q0 (parameter which indicates if the exploration is supported at the moment of next column election) and maxiter (max number of iterations). a 2-level metaheuristic for the set covering problem 383 4.5 penalizing function an aid to the parameter setting function is needed so the genetic algorithm can handle a trade-off between keep improving a solution — ergo more resources applied — or to quit a solution and try another search. a penalizing function is a very good solution to this problem. the direct comparison is between a parameter tuning version and a parameter control one. as a penalizing function we propose: fitness(sol) = objfuncv alue(sol) + timetaken(sol) ∗ fct where sol is a solution and objfuncv alue is the value — or cost — of the solution generated evaluated at the scp objective function of the corresponding benchmark. also, timetaken is the amount of time taken to generate that solution. fct is a correction factor which makes it possible to compare the time with the cost. 4.6 parameter control considerations designs introduced in sections 4.1 and 4.4 were directly coded into the parameter tuning versions. to obtain the counterparts to the parameter control versions, some changes were introduced between transitions of the metaheuristics. ga was modified to enabling an intermediate results memory taking feedback of the performance of the lower level metaheuristic. 5 experimental results 5.1 performance measurement the performance measures evaluated will are: • best solution found (value at scp’s objective function) • time taken to find the best solution (best represented by “calls to objective function made before best solution is found”) • average fitness (penalized) and its standard deviation the quality of the solutions is taken measuring its closeness to the optimal value of a certain instance (or-library scp file). the average fitness is also an aid to evaluating quality in combination with the standard deviation. the computational effort is measured by: the number of calls to objective function needed to generate the best solution. 5.2 reference benchmarks each implementation — i.e. ga-ss and ga-aco — are tested using the benchmarks provided by [17] which are widely used by operation and optimization researchers. the files used are set covering problem instances denoted by series scp4x, scp5x and scp6x. to obtain more experimental results — and to compare the performance of each implementation — such benchmarks will be tested against both versions of parameter setting configurations. the benchmarks presented are minimization problems, where the idea is to find the lowest cost at the objective function. each benchmark is based on a matrix of 200 rows and 1000 columns, which makes them a fairly large set of problems. 384 c. valenzuela, b. crawford, r. soto, e. monfroy, f. paredes 5.3 environment all the algorithms required for testing will be coded under standard c language and compiled with gnu gcc. the xcode 3.2 ide was selected for coding and depuration task, and as reference computer an intel core 2 duo cpu with 4gb of ddr3 ram was used. to notice, testing implementation code does not allow the use of multiples cores as is the case of current cpu. 5.4 robustness based on ga’s parameters the first table of this section shows the results for ga-ss with parameter tuning. these results are important since accomplish two objectives: to allow to evaluate the measures at this offline parameter setting configuration and to represent the execution of several single ss metaheuristic executions. as was explained before, a parameter tuning configuration runs a whole instance of ss metaheuristic with parameters defined — by ga — before its execution. when the ss execution finish, the best result is given as feedback to ga, which might continue its process of experimenting with other ss parameters. hence, the results obtained consider a neutral parameter configuration — a random one — representing a human trial-and-error testing, where best results might be as good as the ones with ga-ss, but surely requiring more human work in terms of defining those parameters and to experiment with them. pxover pmut bestfound callsof average fit-stdev 0.25 0.25 1009 5778578 3017 433 0.5 0.5 1009 4698886 2915 675 0.75 0.75 1007 5750974 3104 571 0.25 0.75 1008 4545205 2803 458 0.75 0.25 1009 4698886 2644 578 table 1: different ga-ss parameters configurations using parameter tuning. benchmark: scp41. first results (table 1) show a considerable variability. the overall look represent what a human can expect when experimenting — no good luck considerations. as will be seen in the next tables, pxover and pmut tend to lose its impact, being this instance the most unexpected. pxover pmut bestfound callsof average fit-stdev 0.25 0.25 509 3446206 2235 1075 0.5 0.5 509 3462702 2329 1078 0.75 0.75 509 3481599 2002 954 0.25 0.75 509 3502315 1735 797 0.75 0.25 509 3308283 1980 890 table 2: different ga-ss parameters configurations using parameter control. benchmark: scp41. at the second table (table 2) best results found show no variation, mainly due to ss improvement function. the other measures show a low standard deviation specially in the overall calls to objective function. as the average solution results are the ones with more variation, there is logic in thinking that there was a good level of exploration between solution spaces. a 2-level metaheuristic for the set covering problem 385 in the next table (table 3) the performance with aco is evaluated, using parameter tuning. aside of the better results obtained with aco — and its best resources management — the same fashion as in ga-ss is seen: very low variation at each measure. pxover pmut bestfound callsof average fit-stdev 0.25 0.25 449 1477894 781 236 0.5 0.5 434 1095816 768 240 0.75 0.75 449 1659306 896 274 0.25 0.75 449 1599972 862 260 0.75 0.25 449 1435961 762 220 table 3: different ga-aco parameters configurations using parameter tuning. benchmark: scp41. using an online parameter setting criteria (table 4) shows no more variation at the results; actually, a very low variation is obtained. is clearer that the results under this configuration are the best of all. this will be reviewed in following experiments show down. pxover pmut bestfound callsof average fit-stdev 0.25 0.25 436 673804 442 7 0.5 0.5 436 691756 442 8 0.75 0.75 436 743647 444 8 0.25 0.75 436 671342 444 8 0.75 0.25 436 703318 441 7 table 4: different ga-aco parameters configurations using parameter control. benchmark: scp41. 5.5 convergence to the best solution this experiments were selected between the most interesting results: which presented a noticeable convergence through time. at first look, the temptation is to compare the obvious; ga-aco performed better than its counterparts. after a careful review, the comparison is unfair fundamentally because aco uses a more “intelligent” way of building solutions whereby ss is more blind. a preliminary conclusion is that the constructive process of aco is better than the evolutive of ss. anyway, the purpose of this graphics is to allow to observe how each version converges through time to a better solution. the plot is based on benchmark scp48 and the clearest idea is that parameter control allows to get better results in lesser time, which is mainly due to the intensification of the harvest of certain solution space where “best looking” results are found, and to the saved time when quitting to explore “bad looking” results. also, a common item in each plot is that ga-ss (tuning) tent to obtain a better result than ga-ss (control) when a great quantity of solutions are searched. 5.6 wide comparison of best results found following table introduces the best results found through the various instances executed. faster convergence of parameter control versions has been considered in previous section — which is not represented in this table. same results obtained by both versions of aco hide evidently the efficiency factor. every instance executed to obtain this results was using pxov er = 386 c. valenzuela, b. crawford, r. soto, e. monfroy, f. paredes figure 2: convergence analysis to a better solution. benchmark: scp48 0.5 and pmut = 0.5 which seems a fair tradeoff between crossover and mutation at ga. under this circumstances, another criteria of robustness is handled: to perform well in a variety of instances. the worse performer through all experiments was ga-ss with parameter tuning, but, even with this issue, it was not an extremely bad perform. scp41 scp42 scp48 scp61 scp62 scp63 optimum 429 512 492 138 146 145 ga-ss-offline 1007 981 561 154 155 166 ga-ss-online 509 603 642 164 165 176 ga-acs-offline 434 529 497 142 154 148 ga-acs-online 434 529 497 142 154 148 table 5: table of best results. 6 conclusions a 2-level metaheuristic has been tested on different scp benchmarks showing to be very effective. one of the main goals of a multilevel approach is to provide an unattended solving method, for quickly producing solutions of a good quality for different instances. the overall work was extended on several relevant issues and quality measures: quality of solutions, robustness in terms of the instances and insensitivity to small deviation at parameters (ga parameters) solving large-scale problems, easiness of implementation, easiness to combine with other algorithms, automatic setting of parameters. all of them provided a fair approach to the core problem, and many of them have already shown lights of been covered, producing a renovated energy for keep working and admiring the synergy produced. benchmarks have shown interesting results in terms of robustness of every approach being the parameter control the most robust in terms of a good performance in several instances using same parameters and in terms of stable results when small deviations are made to parameters. also, parameter control shown to converge faster to better results than parameter tuning, but, when long running times are taken, both seem to obtain equal results. considering the particular performance of a constructive approach and an evolutive one for the scp, it seems to be more effective to “construct” intelligently a solution rather than “blindly” get one and then evolve it. an interesting fact is that ss with parameter control might get a performance closer to aco. the overall implementation of a multilevel metaheuristic is pretty straightforward, no big obstacles were found. also, the implementation of the ga using real numbers was a direct representation of what parameters a human user might a 2-level metaheuristic for the set covering problem 387 choose to operate, therefore achieving an implementation which accomplished our requirements. bibliography [1] b. crawford, c. lagos, c. castro, f. paredes, a evolutionary approach to solve set covering, iceis 2007 proceedings of the ninth international conference on enterprise information systems, volume aidss, funchal, madeira, portugal, june 12-16, 2007 (2), pp.356-363, 2007 [2] u. aickelin, an indirect genetic algorithm for set covering problems, journal of the operational research society, vol.53, pp.1118-1126, 2002 [3] f. tangour, p. borne, presentation of some metaheuristics for the optimization of complex systems, studies in informatics and control, vol.17, no.2, pp.169-180, 2008 [4] c-m. pintea, d. dumitrescu, the importance of parameters in ant systems, int j comput commun, issn 1841-9836, 1(s):376-380, 2006 [5] r. martí, m. laguna, scatter search: diseño básico y estrategias, revista iberoamericana de inteligencia, vol.19, pp.123-130, 2003 [6] d. gouwanda, s. g. ponnambalam, evolutionary search techniques to solve set covering problems, world academy of science, engineering and technology, vol.39, pp.20-25, 2008 [7] a. caprara, m. fischetti, p. toth, algorithms for the set covering problem, annals of operations research, vol.98, 1998 [8] j. e. beasley, k. jornsten, enhancing an algorithm for set covering problems, european journal of operational research, vol.58, pp.293-300, 1992 [9] c. cotta, m. sevaux, k. sörensen, adaptive and multilevel metaheuristics, springer, 2008 [10] z. michalewicz, genetic algorithms + data structures = evolution programs, springer, 1996. [11] f. glover, g. a. kochenberger, handbook of metaheuristics, springer, 2003 [12] b. crawford, c. castro, integrating lookahead and post processing procedures with aco for solving set partitioning and covering problems, proceedings of icaisc, pp.1082-1090, 2006 [13] y. hamadi, e. monfroy, f. saubion, what is autonomous search?, technical report msrtr-2008-80, 2008 [14] l. lessing, i. dumitrescu, t. stützle, a comparison between aco algorithms for the set covering problem, it proceedings of ants, pp.1-12, 2004 [15] e. talbi, metaheuristics: from design to implementation, wiley publishing, 2009 [16] r. battiti, m. brunato, f. mascia, reactive search and intelligent optimization, springer verlag, 2008 [17] j. e. beasley, or library, http://people.brunel.ac.uk/ mastjjb/jeb/info.html int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 4, pp. 592-602 h∞ robust t-s fuzzy design for uncertain nonlinear systems with state delays based on sliding mode control x.z. zhang, y.n. wang, x.f. yuan xizheng zhang hunan university p.r.china, 410082 changsha, yuelushan, and hunan institute of engineering p.r.china, 411104 xiangtan, 88 fuxing east road e-mail: z_x_z2000@163.com yaonan wang, xiaofang yuan hunan university p.r.china, 410082 changsha, yuelushan e-mail: yaonan@hnu.cn, yuanxiaof@21cn.com abstract: this paper presents the fuzzy design of sliding mode control (smc) for nonlinear systems with state delay, which can be represented by a takagi-sugeno (ts) model with uncertainties. there exist the parameter uncertainties in both the state and input matrices, as well as the unmatched external disturbance. the key feature of this work is the integration of smc method with h∞ technique such that the robust asymptotically stability with a prescribed disturbance attenuation level γ can be achieved. a sufficient condition for the existence of the desired smc is obtained by solving a set of linear matrix inequalities (lmis). the reachability of the specified switching surface is proven. simulation results show the validity of the proposed method. keywords: sliding mode control, t-s fuzzy model, time-delayed system, h∞ control. 1 introduction recently, the dynamic t-s fuzzy model has become a popular tool and has been employed in most model-based fuzzy analysis approaches [1]. moreover, the ordinary t-s fuzzy model has been further extended to deal with nonlinear uncertain systems with time-delays [2]. the stability analysis and stabilization controller design for fuzzy time-delayed systems have attracted much attention over the past few decades due to their extensive applications in mechanical systems, economics, and other areas. a large number of results on this topic have been reported in the literature, see, e.g. [3–5]. note that the uncertainties may exist in the real systems, or come from the fuzzy modeling procedure. hence, the robust stabilization problems have recently been investigated in [6] for nonlinear uncertain fuzzy systems. in practice, the inevitable uncertainties may enter a nonlinear system in a much more complex way. the uncertainty may include modeling error, parameter perturbations, fuzzy approximation errors, and external disturbances. in such circumstances, especially in the existence of external disturbances, the above established methods to control fuzzy time-delay systems could not work well any more. however, it is well known that the sliding mode control (smc) is a reasonable approach to take effect if the lumped uncertainties are known to be bounded by smooth functions. in a more detail, the smc system could drive the trajectories onto the so-called switching surface in a finite time and maintain on it thereafter, and on the switching surface the system is insensitive to internal parameter perturbations and external disturbances [7]. smc approach has been successfully adopted in the control of time-delay systems these years. quite recently, smc approach has been also applied to solve the stabilization and tracking problems for fuzzy systems with matched uncertainties [8]. however, the sliding motion cannot be detached copyright c⃝ 2006-2010 by ccc publications h∞ robust t-s fuzzy design for uncertain nonlinear systems with state delays based on sliding mode control 593 from the effect of unmatched parameter uncertainties, especially, unmatched external disturbances [9]. this means that the unmatched external disturbances make the design of smc complex and challenging. on the other hand, the h∞ control, in the past decades, has been widely employed to deal with the uncertain systems with external disturbance [10, 11]. the goal of this problem is to design a controller to stabilize a given system while satisfying a prescribed level of disturbance attenuation. the h∞ control for uncertain time-delayed systems has been considered by some researchers [12, 13]. in [14], peng and yue investigated the h∞ controller design for uncertain t-s fuzzy systems with time-varying interval delay by using a new lyapunov-krasovskii functionals and an innovative integral inequality. the output feedback controller in [15] was designed for uncertain fuzzy systems such that the closed-loop systems are robustly asymptotically stable and satisfy a prescribed h∞ performance. lin et al. [16] further presented the mixed h2/h∞filter design for nonlinear discrete-time systems with state-dependent noise. motivated by the above discussion, it is certain that the integration of the smc method with h∞ technique would have a great potential in extending the smc to the systems with unmatched uncertainties and obtain a better dynamic performance. therefore, in this paper, by utilizing the h∞ technique to attenuate the effect of unmatched external disturbance, we proposes a novel smc controller that can ensure the robust stability with a prescribed disturbance attenuation level γ for the fuzzy time-delayed system, irrespective of parameter uncertainties and unmatched external disturbance. the controller design method is presented in terms of lmis. the notations used in this paper are quite standard: ℜn denotes the n-dimensional real euclidean space; iii is the identity matrix with appropriate dimensions; www < 0 (www > 0) means that www is symmetric and negative (positive) definite; l2[0,∞]denotes the space of square-integrable vector functions over [0,∞]; the superscript "t " represents the transpose of a matrix, and the notation "∗" is used as an ellipsis for terms that are induced by symmetry; ∥·∥ denotes the spectral norm; matrices, if they are not explicitly stated, are assumed to have compatible dimensions. 2 problem formulation as stated in introduction, t-s fuzzy models can provide an effective representation of complex nonlinear systems in terms of fuzzy sets and fuzzy reasoning applied to a set of linear input-output submodels. hence, in this work, a class of nonlinear time-delay systems is represented by a t-s model. as in [2], the t-s fuzzy time-delay system with uncertainties is described by fuzzy if-then rules, which locally represent linear input-output relations of nonlinear systems. the i-th rule of the fuzzy model is formulated in the following equation: plant rule i: if θ1 is η i1 and θ2 is η i 2 ··· and θp is η p 1 , then  ẋxx(t) = [(aaai + ∆ aaai(t))xxx +(aaadi + ∆ aaadi(t))xxx(t − τ)]+(bbbi + ∆ bbbi)uuu(t)+ bbbwiwww(t) zzz(t) = cccixxx(t) xxx(t) = φ(t),t ∈ [−τ(t),0],i = 1,2,.,r (1) where η ij is the fuzzy set, θθθ = [θ1(t),θ2(t),··· ,θp(t)] t is the premise variable vector, r is the number of rules of this t-s fuzzy. xxx(t) ∈ ℜn is the state vector, uuu(t) ∈ ℜm is the control input vector, zzz(t) ∈ ℜl is the controlled output, wwwi(t) ∈ ℜp denotes the unknown external disturbances or modeling error. aaai, aaadi, bbbi, bbbwi and ccci are known real constant matrices with appropriate dimensions. ∆ aaai(t), ∆ aaadi(t), ∆ bbbi are unknown time-varying matrices representing parameter uncertainties. τ is the time-varying delay for the state vector satisfying 0 < τ(t) < d < ∞, ˙τ(t) < h < 1, where d and h are known real constant scalars. φ(t) is a continuous vector-valued initial function. the overall fuzzy model achieved by fuzzy blending of each plant rule is represented as follows: ẋxx(t) = r∑ i=1 hi(θθθ )[(aaai + ∆ aaai)xxx(t)+(aaadi + ∆ aaadi)xxx(t − τ)+(bbbi + ∆ bbbi)uuu(t)+ bbbwiwww(t)] (2) 594 x.z. zhang, y.n. wang, x.f. yuan where hi(θθθ ) = αi(θθθ )∑r j=1 αi(θθθ ) ,αi(θθθ ) = ∏p j=1 η i j(θθθ ), in which η i j(θθθ ) is the membership grade of θ j in η i j. according to the theory of fuzzy sets, we have θθθ ≥ 0 and ∑r i=1 θθθ ≥ 0. therefore, it implies that hi(θθθ ) ≥ 0 and ∑r i=1 hi(θθθ ) = 1. in this work, the following assumptions are introduced. (assumption.1) the time-varying uncertainties ∆ aaai and ∆ aaadi are assumed to be norm-bounded, that is, [∆ aaai,∆ aaadi] = hhh ifff i(t)[eee1i,eee2i] (3) where hhh i,eee1i, eee2i are known constant matrices, and fff i(t) is an unknown matrix function with lebesguemeasurable elements and satisfies fff ti (t)fff i(t) ≤ iii,∀t. (assumption.2) it is assumed that the matrices bbbi satisfy bbb1 = bbb1 = ··· = bbbr = bbb. moreover, the pair (aaai,bbb) is controllable and the input matrix bbb has full-column rank m and m < n. (assumption.3) the uncertainty matrix ∆ bbbi is assumed to be matched, i.e., there exists a matrix δδδ i(t) ∈ ℜm×m such that ∆ bbbi = bbbiδδδ i(t) with ∥δδδ i(t)∥ ≤ ρb < 1, where ρb is a positive constant. (assumption.4) the upper bound for wwwi(t) is known. it is noted that there exists parameter uncertainties in both the state and control input matrices and unmatched external disturbance wwwi(t) in the systems under consideration. remark 1: assumptions 1 4 are standard assumptions in the study of variable structure control. before proceeding, some standard concepts and lemma are given as follows, which are useful for the development of our result. definition 1. the uncertain fuzzy time-delayed systems in (2) is said to be robustly asymptotically stable if the system with uuu(t) = 0 and wwwi(t) = 0 is asymptotically stable for all admissible parameter uncertainties. definition 2. given a scalar γ > 0, the unforced fuzzy system in (2) with uuu(t) = 0 is said to be robustly stable with disturbance attenuation γ if it is robustly stable and and under zero initial condition, ∥zzz(t)∥e2 ≤ γ∥www(t)∥2 for all non-zero and all admissible uncertainties, where ∥zzz(t)∥e2 = √∫t 0 |zzz(t)|2dt (4) (lemma.1 choi [10]): let eee, hhh, and fff(ttt) be real matrices of appropriate dimensions with fff(ttt) satisfying fff ti (t)fff i(t) ≤ iii. then, we have (i) for any scalar ε ≤ 0, eeefff(t)hhh + hhh t fff t (t)eee t ≤ ε−1eeeeee t + ε hhh t hhh (ii) for any matrix p > 0, −2eee t hhh ≤ eee t pppeee + hhh t p−1hhh. 3 controller design the objective of this work is to design a smc law such that the desired control performance for the resulting closed-loop system is obtained despite of parameter uncertainties and unmatched external disturbance. in this section, a smc law is first synthesized such that the closed-loop systems are robustly asymptotically stable with disturbance attenuation γ . it is further proven that the reachability of the specified switching (sliding) surface s(t) = 0 can be ensured by the proposed smc law. thus, it is concluded that the synthesized smc law can guarantee the state trajectories of uncertain systems (2) to be driven onto the sliding surface, and asymptotically tend to zero along the specified sliding surface. 3.1 sliding mode controller design essentially, a smc design is composed of two phases: hyperplane design and controller design. there are various methods for designing hyperplane, however in this paper the switching surface is h∞ robust t-s fuzzy design for uncertain nonlinear systems with state delays based on sliding mode control 595 defined as s(t) = r∑ i=1 hi(θθθ (t))gggixxx(t) (5) where gggi ∈ ℜm×n is designed so that gggibbbi is not singular. furthermore, we design the vsc control law as follows  uuu(t) = uuus(t)+ uuur(t) uuus(t) = − r∑ i=1 kkkixxx(t) uuur(t) = − r∑ i=1 hi(θθθ (t))gggi [ aaaixxx(t)+ aaadixxx(t − τ(t)) ] − r∑ i=1 hi(θθθ (t))ρi(xxx,t)sgn(s(t)) (6) where kkki ∈ ℜm×n is chosen such that (aaai − bbbikkki) is hurwitz, sgn()is a sign function and ρi(xxx,t)is a positive scalar function given as ρi(xxx,t) ≥ 2 1− ρ2b {[ ∥φ(aaai − bbbikkki)∥+∥φ hhh ieee1i∥+ ρb∥kkki∥+(1+ ρb)∥gggaaai∥ ] ∥xxx(t)∥ + [ ∥φ aaadi∥+∥φ hhh ieee2i∥+(1+ ρb)∥gggaaadi∥ ] ∥xxx(t − τ)∥ +∥s∥∥φ bbbwi∥∥www∥+ β } (7) with φ = (gggibbbi)−1gggi and β > 0 is a small known scalar. thus, substituting (6) into (2), we obtain the closed-loop system as follows ẋxx(t) = r∑ i=1 hi(θθθ ) {[ aaai − bbbikkki + ∆ aaai(t)− ∆ bbbi(t)kkki]xxx(t)+ [ aaadi + ∆ aaadi(t) ] xxx(t − τ(t)) + [ bbbi + ∆ bbbi(t) ] uuur(t)+ bbbwiwww(t) } (8) the above expression eq.(8) is the sliding-mode dynamics of the fuzzy uncertain system (2) in the specified sliding surface s(t) = 0. 3.2 stability of the sliding mode motion in this subsection, we analyze the dynamic performance of the closed-loop system described by (8), and derives some sufficient conditions for the asymptotically stability of the sliding dynamics via lmi method. the following theorem shows that system (2) in the defined switching surface is robustly stabilizable with disturbance attenuation level γ . theorem 3. consider the fuzzy uncertain systems (2) with assumptions 1 4, with the prescribed switching function, if there exist matrices ppp > 0, qqq > 0, and positive scalars ε1, ε2 and ε3 such that the lmi shown in (11) holds, with θ1 = ppp(aaai − bbbikkki)+(aaai − bbbikkki)t ppp + qqq + ε1eee t1ieee1i + ε3ρ 2 bkkk t i kkki +ccc t i ccci (9) θ2 = −(1− h)qqq + ε2eee t2ieee2i (10) for i = 1,2,··· ,r, then, by choosing gggi = bbbti ppp, the sliding-mode dynamics (8) is robust asymptotically stable with disturbance attenuation γ . 596 x.z. zhang, y.n. wang, x.f. yuan   θ1 ∗ ∗ ∗ ∗ ∗ aaatdippp θ2 ∗ ∗ ∗ ∗ bbbtwippp 0 −γ 2iii ∗ ∗ ∗ hhh ti ppp 0 0 ε1iii ∗ ∗ hhh ti ppp 0 0 0 ε2iii ∗ θ1 0 0 0 0 ε3iii   < 0 (11) proof: to analyze the stability of the sliding-mode dynamics (8), we consider the fuzzy uncertain system (2) with www(t) = 0 and choose the following lyapunov functional candidate v (xxx,t) = xxxt (t)pppxxx(t)+ ∫t t−τ xxx(m)t pppxxx(m)dm (12) by differentiating the given lyapunov function, we obtain the differential along the trajectories as v̇ = r∑ i=1 hi(θθθ ) { xxxt (t) [ ppp(aaai − bbbikkki)+(aaai − bbbikkki) t ppp + qqq +2ppp(∆ aaai + ∆ bbbikkki) ] xxx(t) +2xxxt (t)ppp(aaadi + ∆ aaadi)pppxxx(t − τ)−2st [iii + δ (t)]{ρisgn(s))+ bbbti ppp[aaaixxx + aaadixxx(t − τ)] }} −(1− τ̇)xxxt (t − τ)qqqxxx(t − τ) (13) . noting the definition of switching function s(t) and the control law (6), we have v̇ = r∑ i=1 hi(θθθ ) { xxxt (t) [ ppp(aaai − bbbikkki)+(aaai − bbbikkki) t ppp + qqq ] xxx(t) +2xxxt (t)ppp(aaadi + ∆ aaadi)pppxxx(t − τ)+2xxxt (t)ppp(∆ aaai − ∆ bbbikkki)pppxxx(t) −2st (t)[iii + δ (t)]bbbippp [ aaaixxx + aaadixxx(t − τ) ] −2st (t)[iii + δ (t)]ρisgn(s) } −(1− τ̇)xxxt (t − τ)qqqxxx(t − τ) (14) . by lemma 1, we obtain that for εi > 0, the following inequalities hold. 2xxxt ppp∆ aaaixxx(t) ≤ ε−11 xxx t (t)ppphhh ihhh t i pppxxx(t)+ ε1xxx t (t)eee t1ieee1ixxx(t) (15) 2xxxt (t)ppp∆ aaadixxx(t − τ) ≤ ε−12 xxx t (t)ppphhh ihhh t i pppxxx(t)+ ε2xxx t (t − τ)eee t2ieee2ixxx(t − τ) (16) 2xxxt (t)ppp∆ bbbikkkixxx(t) ≤ ε−13 xxx t (t)pppbbbibbb t i pppxxx(t)+ ε3ρ 2 bxxx t (t)kkkti kkkixxx(t) (17) −2st [iii + δ (t)]ρisgn(s) ≤ −2ρi∥s∥+ ρi[st δ δ t s + st s]∥s∥−1 ≤ ρi(ρ2b −1)∥s∥ (18) noting that (3) and ∑r i=1 hi(θθθ (ttt)) = 1, and substituting the above inequalities into (14) results in v̇ ≤ r∑ i=1 hi(θθθ ) [ xxxt (t) xxxt (t − τ) ] × π × [ xxx(t) xxx(t − τ) ] (19) where π = ( ξ1 pppaaadi aaatdippp ξ2 ) , with h∞ robust t-s fuzzy design for uncertain nonlinear systems with state delays based on sliding mode control 597 ξ1 = ppp(aaai − bbbikkki)+(aaai − bbbikkki)t ppp + qqq + ε−11 ppphhh ihhh t i ppp + ε1eee t 1ieee1i +ε−12 ppphhh ihhh t i ppp + ε −1 3 pppbbbibbb t i ppp + ε3ρ 2 bkkk t i kkki (20) ξ2 = −(1− h)qqq + ε2eee t2ieee2i (21) in the following, it will be shown that the lmi (11) implies π < 0. by schur’s complement, π < 0 is equivalent to the lmi shown in (24), with ξ3 = ppp(aaai − bbbikkki)+(aaai − bbbikkki)t ppp + qqq + ε−11 ppphhh ihhh t i ppp + ε1eee t 1ieee1i + ε3ρ 2 bkkk t i kkki (22) ξ4 = ξ2 (23)   θ3 ∗ ∗ ∗ ∗ aaatdippp θ4 ∗ ∗ ∗ hhh ti ppp 0 −ε1iii ∗ ∗ hhh ti ppp 0 0 −ε2iii ∗ bbbti ppp 0 0 0 −ε3iii   < 0 (24) it is shown that the lmi (11) implies the above matrix inequality (24). together with (19) implies that for all [ xxxt (t) xxxt (t − τ) ] ̸= 0, we have v̇ (xxx(t),t) ≤ 0 (25) this means that the closed-loop fuzzy system (8) with www(t) = 0 is robustly asymptotically stable. next, we shall show that the fuzzy uncertain system (2) satisfies ∥zzz(t)∥e2 ≤ γ∥www(t)∥2 (26) for all non-zero www(t) ∈ l2[0,∞]. to this end, we assume zero initial condition, that is, with xxx(t) = 0 for all t ∈ [−d,0]. then, we can rewritten the lyapunov function candidate as follows: v̇ = r∑ i=1 hi(θθθ ) { xxxt (t) [ ppp(aaai − bbbikkki)+(aaai − bbbikkki) t ppp + qqq ] xxx(t) +2xxxt (t)ppp(aaadi + ∆ aaadi)pppxxx(t − τ)+2xxxt (t)ppp(∆ aaai − ∆ bbbikkki)pppxxx(t) +2xxxt (t)pppbbbwiwww(t)−2s t (t)[iii + δ (t)]bbbti ppp [ aaaixxx(t)+ aaadixxx(t − τ) ] −2st (t)[iii + δ (t)]ρisgn(s) } −(1− h)xxxt (t − τ)qqqxxx(t − τ) (27) . now, set j(t) = ∫t 0 [zzzt (m)zzz(m)− γ2wwwt (m)www(m)]dm (28) with t > 0. it is easy to show that j(t) = ∫t 0 [zzzt (m)zzz(m)− γ2wwwt (m)www(m)+v̇ (xxx,t)]dm −v (xxx,t) ≤ ∫t 0 [zzzt (m)zzz(m)− γ2wwwt (m)www(m)+v̇ (xxx,t)]dm (29) 598 x.z. zhang, y.n. wang, x.f. yuan hence, noting (15)-(19), it follows from (29) that j(t) ≤ ∫t 0 [ xxxt (m) xxxt (m − τ) wwwt (m) ] × ω × [ xxx(m) xxx(m − τ) www(m) ] dm (30) with ω =   θ1 pppaaadi 0∗ θ2 0 ∗ ∗ −γ2iii  , where θ1 and θ2 are given as in (9) and (10). by schur’s complement, it can be shown that ω < 0 is ensured by lmi (11). this together with (30) implies that j(t) < 0 for all t > 0. hence, we obtain (26) from (30). 2 remark 2: it is noted that the condition in theorem 1 is delay independent, which might be conservative when the time delay is known and small. hence, it would be appropriate to extend the current study to delay-dependent issues in future research. 3.3 reachability of the sliding-mode as the last step of design procedure, we will further prove that the vsc controller in (6) ensures the reachability of the specified switching surface. it is known from [17] that the solution of the system (2) is given by j(t) = xxx(t) = φ(0)+ ∫t 0 r∑ i=1 hi(θθθ ) [ (aaai + ∆ aaai)xxx +(aaadi + ∆ aaadi)xxx(m − τ) +(bbbi + ∆ bbbi)uuu(m)+ bbbwiwww ] dm (31) hence, the switching function s(t) can be expressed as s(t) = r∑ i=1 hi(θθθ )bbbti pppφ(0)+ ∫t 0 r∑ i=1 hi(θθθ )bbbti ppp [ (aaai + ∆ aaai)xxx(m)+(aaadi + ∆ aaadi)xxx(m − τ) +(bbbi + ∆ bbbi)uuu(m)+ bbbwiwww ] dm. (32) this means that s(t) varies finitely. that is, it is rational to take the time derivation of s(t). hence, we have ṡ(t) = r∑ i=1 hi(θθθ )bbbti ppp [ (aaai + ∆ aaai)xxx(t)+(aaadi + ∆ aaadi)xxx(t − τ)+(bbbi + ∆ bbbi)uuu(t)+ bbbwiwww ] (33) and then, the reachability of the specified sliding surface s(t) = 0 can be obtained in the following theorem. theorem 4. for the uncertain fuzzy time-delay systems (2) with the given switching function (5) where gggi = bbb t i ppp and ppp,qqq, εi(i = 1,2,3) is the solution of lmis (11). then, it can be shown that the state trajectories of the system (2) will be driven onto the switching surface s(t) = 0 for all www(t) ∈ l2[0,∞] by the above vsc law (6). proof: for purpose of design integrity, a simple stability analysis based on lyapunov direct method is carried out. define the lyapunov function v (t) = 1 2 st (gggibbbi) −1s (34) h∞ robust t-s fuzzy design for uncertain nonlinear systems with state delays based on sliding mode control 599 noting that (6), the expressions of ρ and ṡ, thus, we have v̇vv (t) ≤ st (t) r∑ i=1 hi(θθθ )(gggibbbi)−1gggi { (aaai + ∆ aaai)xxx(t)+(aaadi + ∆ aaadi)xxx(t − d) +st (t)[iii + δ (t)]uuu(t)+ st (t)(gggibbbi)−1gggibbbwiwww(t)} ≤ st (t) r∑ i=1 hi(θθθ )(gggibbbi)−1gggi { (aaai − bbbikkki + ∆ aaai)xxx(t)− st (t)δ (t)kkkixxx(t) +st (gggibbbi) −1gggi(aaadi + ∆ aaadi)xxx(t − d)− st [iii + δ (t)]gggi[aaaixxx(t)+ aaadixxx(t − d)] −st (t)[iii + δ (t)]ρi(xxx,t)sgn(s(t))+ st (t)(gggibbbi)−1gggibbbwiwww(t) } (35) by (18), we have v̇vv (t) ≤ ∥s(t)∥ r∑ i=1 hi(θθθ ) {[ ∥φ(aaai − bbbikkki)∥+∥φ hhh i∥∥eee1i∥+ ρb∥kkki∥ ] ∥xxx(t)∥ + [ ∥φ aaadi∥+∥φ hhh i∥∥eee2i∥ ] ∥xxx(t − d)∥ +(1+ ρb) [ ∥bbbti pppaaai)∥∥xxx(t)∥+∥bbb t i pppaaadi∥∥xxx(t − d)∥ ] +∥φ bbbwi∥∥www(t)∥−0.5ρ(xxx,t)(1− ρ2b) } (36) then, it follows from (36) that for s(t) ̸= 0 v̇ (t) ≤ −β ∥s(t)∥ < 0 (37) which implies that the reachability of the specified switching surface is guaranteed, and the trajectories of the fuzzy uncertain system (2) are globally driven onto the specified switching surface s(t) = 0 for all www(t) ∈ l2[0,∞]. moreover, it is seen that the existence domain of the sliding mode is the whole switching surface. 2 remark 3: in fact, the design strategy of the sliding-mode controller (6) accords with the so-called parallel distributed compensation (pdc) scheme [3, 5, 6, 10, 12]. this idea is that the overall controller is a fuzzy blending of each individual controller for each local linear model. the pdc method has been widely utilized in fuzzy control, and is proven to be a very appealing approach. 4 simulation studies in this section, a simple design example is used to illustrate the approach proposed in this paper. consider a t-s fuzzy uncertain stated-delay system with the following model plant rule i: if x2(t) is ηii, then{ ẋxx(t) = [(aaai + ∆ aaai)xxx +(aaadi + ∆ aaadi)xxx(t − τ)]+(bbbi + ∆ bbbi)uuu(t)+ bbbwiwww(t) zzz(t) = cccixxx(t) where i = 1,2. the model parameters are given as aaa1 = [ 0.1 0 0 −2 ] ,aaa2 = [ −0.3 0 1 −3 ] ,aaad1 =[ 0.1 0.1 0 0.1 ] ,aaad2 = [ 0.1 0 0 0.2 ] ,bbb1 = bbb2 = [ 1 1 ] ,bbbw1 = [ 2 0 ] „bbbw2 = [ 1 0 ] ,ccc1 = ccc2 = [ 2 0 0 1.5 ] . the uncertainties are set to be ∆ aaa1 = [ 0 0.08sin t 0 0.06sin t ] , ∆ aaa2 = [ 0.06sin t 0 0.02sin t 0.01sin t ] , ∆ aaad1 = [ 0 0.06sin t 0 0.06sin t ] , 600 x.z. zhang, y.n. wang, x.f. yuan ∆ aaad2 = [ 0.01cos t 0 0 0.06sin t ] , ∆ bbb1 = [ 0.1cos t 0.1cos t ] , ∆ bbb2 = [ 0.1sin t 0.1sin t ] , and the time-varying delay τ(t) = 0.5+0.5sin t with d = 1 and h = 0.5. when choosing the matrix function as fi(t) = 0.02sin t, one can easily obtain the real constant matrices hhh i, eee1i and eee2i from assumption 1. it is also obviously that ρb = 0.1 with δ (t) ≤ ρb. the membership functions are selected as η11 = sin2(x2) and η22 = cos2(x2). let the initial state xxx = [0.9,0.9]t ,t ∈ [−1,0]. the problem at hand is to design a sliding mode controller such that the sliding motion in the specified switching surface is robustly stable, and the state trajectories can be driven onto the switching surface. to this end, we select the attenuation level γ = 0.5 and the matrices as follows: kkk1 = [1.5,2.3],kkk2 = [0.2,3.4]. by solving lmis (11), we obtain:ppp = [ 1.5757 −0.0144 −0.0144 1.6211 ] ,qqq = [ 0.0729 0.127 0.127 0.5216 ] . hence, the switching surface can be obtained as s = [0.6405,0.6223]xxx(t). it following from theorem 2 that the desired vsc law can be obtained. the simulation results are given in figures 1-3. since it is well known that the chattering phenomenon is undesirable as it may incite high-frequency un-modeled dynamics and even leads to the instability of controlled system, we replace sgn(·) by s/(∥s∥+ ε)(ε is the thickness of boundary layer) in the previous vsc law so as to prevent the control signals from chattering. however, it should also be pointed out that such an approach may lead to delay or make the controller less robust. recently, to avoid chattering the use of high order and adaptive sliding mode is receiving more attentions; see, e.g., [10] for more details. it is seen that the reachability of the sliding motion can be guaranteed. furthermore, the simulation results also show that our present design effectively attenuates the effect of both parameter uncertainties and external disturbances. figure 1: trajectories of state x1, x2 5 conclusions this paper has firstly generalized the t-s model to represent a class of nonlinear uncertain systems. then, a novel robust vsc method integrated with h∞ technique, has been proposed for the fuzzy timedelayed system with parameters uncertainties and unmatched external disturbances. moreover, by means of lmis, a sufficient condition for the robustly stability of sliding motion with h∞ disturbance attenuation level γ has been derived. it has been shown that both the switching surface and the vsc controller have been obtained by means of the feasibility of lmis. h∞ robust t-s fuzzy design for uncertain nonlinear systems with state delays based on sliding mode control 601 figure 2: switching surface s(t) figure 3: control effort uuu(t) 602 x.z. zhang, y.n. wang, x.f. yuan bibliography [1] t. takagi, m. sugeno, fuzzy identification of systems and its applications to modeling and control, ieee trans. syst., man, cybern., 15(1):116-132, 1985. [2] y. y. cao, p. m. frank, stability analysis and synthesis of nonlinear time-delay systems via linear takagi-sugeno fuzzy models, fuzzy sets syst., vol.124, no.2, pp.213-229, 2001. [3] b. chen, x. p. liu, s. c. tong, new delay-dependent stabilization conditions of t-s fuzzy systems with constant delay, fuzzy sets syst., vol.158, no.20, pp.2209-2224, 2007. [4] e. g. tian, c. peng, delay dependent stability analysis and synthesis of uncertain t-s fuzzy systems with time-varying delay, fuzzy sets syst., vol.157, no.4, pp.544-559, 2006. [5] y. zhong, y. p. yang, new delay-dependent stability analysis and synthesis of t-s fuzzy systems with time-varying delay, int. j. robust nonlinear control, vol.20, no.3, pp.313-322, 2009. [6] p. chen, y. c. tian, improved delay-dependent robust stabilization conditions of uncertain t-s fuzzy systems with time-varying delay, fuzzy sets syst., vol.159, no.20, pp.2713-2729, 2008. [7] t. z. wu, design of adaptive variable structure controllers for t-s fuzzy time-delay system, int. j. adapt. control signal process., vol.24, no.2, pp.106-116, 2009. [8] f. gouaisbaut, m. dambrine, j. p. richard, robust control of delay systems: a sliding mode control design via lmi, syst. control lett., vol.46, no.4, pp.219-230, 2002. [9] w. j. cao , j. x. xu, nonlinear integral-type sliding surface for both matched and unmatched uncertain systems, ieee trans. on autom. control, vol.49, no.8, pp.1355-1360, 2004. [10] h. h. choi, lmi-based sliding surface design for integral sliding mode control of mismatched uncertain systems, ieee trans. on autom. control, vol.52, no. 4, 736-742, 2007. [11] b. s. chen, c. h. lee, y. c. chang, h∞ tracking design of linear systems: adaptive fuzzy approach, ieee trans. fuzzy syst., vol.4, no.1, pp.32-43, 1996. [12] y. he, q. g. wang, c. lin, an improved h∞ filter design for systems with time-varying interval delay, ieee trans. circuits syst. ii: exp. briefs, vol.53, no.11, pp.1235-1239, 2006. [13] b. chen, x. p. liu, delay-dependent robust h∞ control for t-s fuzzy systems with time delay, ieee trans. fuzzy syst., vol.13, no.26, pp.544-556, 2005. [14] p. chen, y. dong, y. c. tian, new approach on robust delay-dependent h∞ control for uncertain t-s fuzzy systems with interval time-varying delay, ieee trans. fuzzy syst., vol.17, no.4, pp.890990, 2009. [15] s. xu, j. lam, robust h∞ control for uncertain discrete-time-delay fuzzy systems via output feedback controllers, ieee trans. fuzzy syst., vol.13, no.1, pp.82-93, 2005. [16] y.c. lin, j.c. lo, robust mixed h2/h∞ filtering for discrete-time delay fuzzy systems, int j. syst. science, vol.36, no.15, pp.993-1006, 2005. [17] p. gahinet, a. nemirovski, a. j. laub and m. chilali, lmi control toolbox, natick, ma: the mathworks, 1995. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 2 (june), pp. 305-316 fuzzy based packet dropping scheme in wireless cellular networks j.d. mallapur, s.s. manvi, d.h. rao jayashree d. mallapur department of electronics and communication engineering basaveshwar engineering college bagalkot, india e-mail: bdmallapur@yahoo.co.in sunilkumar s. manvi department of electronics and communication engineering reva institute of technology and management bangalore, india e-mail: sunil.manvi@revainstitution.org d.h. rao jain college of engineering belgaum, india e-mail: dr.raodh@gmail.com abstract: wireless multimedia networks are becoming very popular owing to the user demands for multimedia services. packet dropping in the event of buffer congestion is one of the important issue in wireless multimedia networks. a packet dropping scheme has to be flexible and adaptive such that acceptable quality of an application is maintained. the paper presents a fuzzy based packet dropping scheme for wireless multimedia networks. a buffer manager placed at the base station performs packet dropping depending upon the traffic conditions and type of an application. packet dropping is performed by computing dropping factor by considering packet priority, queue length and adaptive queue length threshold. the adaptive queue length threshold is used to dynamically adjust the dropping factor. the queue length threshold is varied by using two fuzzy input parameters, channel condition and rate of flow of an application. the scheme has been extensively simulated to test the performance in terms of acceptance and dropping probability of real-time handoff and new calls. keywords: buffer, multimedia, fuzzy logic, packet drop. 1 introduction the use of wireless connectivity has been steadily growing over the recent years. new generation of mobile networks are opting for multimedia applications, such as video conferencing, e-commerce, e-learning, e-gaming, etc. these applications need good end-to-end quality of service (qos) control even under the conditions of network overload. the qos parameters are bandwidth, buffers, delays, jitters, packet loss, etc. the bandwidth and buffer management have become critical issues because of their scarcity. whenever sufficient bandwidth is not available, buffers are employed to avoid packet losses. due to excess traffic, buffers may become full and cause buffer congestion. the buffer congestion leads to dropping of packets randomly irrespective of application importance (real time or non real time). there is a need for employing flexible and copyright c⃝ 2006-2011 by ccc publications 306 j.d. mallapur, s.s. manvi, d.h. rao adaptable scheme for packet dropping, which may not deteriorate qos of a real time application in wireless multimedia networks. as per the literature survey, it is observed that most of the buffer management schemes in wireless networks, employ queue length as a threshold for dropping. parameters such as round trip time and application priority are also used to make packet dropping decisions. apart from these parameters, channel conditions and flow rates may also affect the dropping decisions. the rigid packet dropping schemes probably may not give flexibility to the buffer manager, hence soft packet dropping schemes by using fuzzy logic may be employed for wireless multimedia networks. fuzzy system’s characteristics are based on the concept of fuzzy partitioning of the information. the decision-making ability of the fuzzy model depends on the existence of rule base and fuzzy reasoning mechanism. the fuzzy logic based solutions are better for uncertain inputs which minimize the random outputs. some of the works done in the area of packet dropping in wireless networks are as follows. an algorithm called red (random early detection), which drops the packets of all flows with the same probability depending on queue length is given in [1]. flows with reduced transmission rates and shorter packet length are given unfair share in red. the work presented in [2] proposes a packet dropping scheme that drops the packet based on the sharing index computed using differentiated services. in [3], an adaptive fuzzy based control algorithm which computes the packet dropping probability according to pre-configured fuzzy logic using only queue length as input variable is presented. a fair packet dropping algorithm is presented in [4] depending on channel condition and fairness so as to achieve trade off between throughput and fair service. an adaptive link layer retransmission and active drop mechanism based on fuzzy logic considering queue length as fuzzy parameter is discussed in [5]. a scheme for handoff calls is presented in [6] by using user population, used bandwidth and received power level as fuzzy parameters for handoff decision at the base station. the scheme given in [7] is based on the fuzzy timer which changes the transmission rate to avoid congestion. a fuzzy green controller is presented in [8] which is expected to act as congestion controller in the routers, for both conditions such as with background load and without back ground load. a congestion index has been proposed for intelligent packet dropping scheme by using fuzzy logic in [9]. in [10], average queue length is replaced by the product of delay and retransmission times of rts (request to send) as congestion indicator to calculate dropping probability. the work presented in [11] is a new scheduling algorithm, named as delay-sensitive dynamic fair queuing (dsdfq) algorithm which is designed to meet delay requirements of multimedia applications as well as maintain high network efficiency to adapt to load fluctuations of different traffic classes and varying wireless channel conditions caused by user mobility, fading and shadowing. in [12], estimation of channel capacity in rayleigh fading environment and the gaussian noise environment is presented. development of a systematic methodology of fuzzy logic modeling with three distinct features such as reasoning formation, fuzzy clustering and membership assignment for input and output is presented in [13]. a simple channel predictor is presented in [14] that responds to impairments in the channel at the packet transmission time scale and makes a decision on packet transmission. the work given in [15] presents multihop relaying technology utilization for mitigating unfairness in qos, which comes about due to the location-dependent signal quality and multihop system. the work presented in [16] depicts that cooperative relaying can improve both capacity and fairness in cellular network. the work proposed in [17] is an adaptive multirate auto rate fallback (amarf) algorithm, in which each data rate is assigned a unique success threshold. this criterion is used to judge when to switch a rate to the next higher one, and the success thresholds can be adjusted dynamically in an adaptive manner according to the running condifuzzy based packet dropping scheme in wireless cellular networks 307 tions such as packet length and channel condition parameters. in wireless networks, the available bandwidth undergoes fast time-scale variations due to channel fading and error from physical obstacles [18]. in [20], an adaptive qos handoff priority scheme which reduces the probability of handoff call failures in a mobile multimedia network with a micro/picocellular architecture is presented. the scheme given in [21] supports voice and data calls with some grade of service guarantee in mobile multiservice networks. the variations in the channel condition and data rate have motivated us to use fuzzy controller for developing a packet dropping scheme. the proposed scheme provides soft packet dropping, which minimizes the bursty losses and also provides better qos for wireless multimedia network applications. rest of the paper is organized as follows. section 2 presents the proposed fuzzy based packet dropping scheme. sections 3 and 4 present simulation model and results, respectively. finally, section 5 concludes the work with some remarks. 2 proposed work the proposed scheme is located at the base station. the components of the scheme are as shown in figure 1. it consists of a knowledge base, buffer manager, and a fuzzy controller. knowledge base comprises of status information of base station and the running applications. the status information of the base station include queue length, queue length threshold, maximum bandwidth, bandwidth available, maximum buffers, buffers available, channel condition, aggregated flow rate, aggregated departure rate, correction factor, etc. the information details about the running applications include application id, priority, type (handoff/new), rate of flow at previous and current time instants, packet service time limit, packet departure time, buffer required, sustainable dropping threshold, dropping factor, etc. queue length threshold and dropping factor unit decision taking unit fuzzy controller correction factor (cf) buffer manager accept/reject computation request base ledge know− figure 1: fuzzy based packet dropping scheme buffer manager comprises of computational and decision taking units. computational unit computes queue length, queue length threshold and dropping factor for an application packet request by using the data such as correction factor, priority, etc., available in the database. the dropping factor is made available to decision taking unit to decide either to drop or accept the packet. the buffer manager employs fuzzy controller to tune the queue length threshold. buffer manager updates the knowledge base as and when required, either periodically or on arrival of application request. fuzzy controller uses channel condition and packet flow rate of an application as inputs and provides correction factor as output, and updates the knowledge base. the correction factor is used to dynamically change the value of queue length threshold. the knowledge base is used for providing data for fuzzy controller to find out the correction factor. 308 j.d. mallapur, s.s. manvi, d.h. rao the computation of queue length, queue length threshold and dropping factor are discussed in the following sections. 2.1 computation of queue length threshold and dropping factor the following assumptions have been made to compute the queue length threshold and dropping factor. (1) handoff calls and new calls both arrive at base station with different priorities, (2) packet dropping is considered in case of non-availability of buffers, and (3) priority is assigned depending upon the request time and current time within the different type of calls, i.e., high priority is assigned for handoff real-time calls. the computation of queue length, queue length threshold and dropping factor is divided into three phases: (1) queue length computation, (2) correction factor computation, and (3) priority an dropping factor computation. the notations used in each of the phase are as follows. q(ti) queue length in the buffer at time ti, r(ti) aggregated flow rate (sum of flow rate of all applications) at time ti, x(ti) aggregated departure (sum of departure rate of all applications) rate of packets at time ti, qth(ti) queue length threshold in buffer at time ti, cf correction factor, chc(ti) channel condition at time ti, df dropping factor for a packet of an application, bwmax maximum bandwidth in the base station area, bumax maximum buffer at the base station, and p priority of a packet. computed queue length in phase-i will be used in phase-ii and phase-iii for computing queue length threshold and dropping factor. computation of queue length in the buffer is performed using two factors: aggregated flow rate and departure rate from the buffer. the queue length is given by formula 1. q(ti) = q(ti−1) + (δr(ti) ∗ (ti − ti−1)) − (x(ti) ∗ (ti − ti−1)) (1) δr(t) = r(ti) − r(ti−1) (2) where δr(ti) is the change of aggregated flow rate at time instant ti. in phase-ii, correction factor (cf) is computed to tune the queue length threshold. the new queue length threshold computed in phase ii is used by phase iii for calculating the dropping factor. the correction factor is a value that depends upon channel conditions as given in formula 3. the value of cf is varied between 0 to 1.0 depending on fuzzy parameter chc. the variables ’h’ and ’z’ lie in the range 0 to 1 as defined by the system administrator. cf =   0 if 0 < chc < h; µ(rf) × µ(chc) if h ≤ chc < z; 1.0 if z ≤ chc < 1.0 (3) the correction factor depends upon the two fuzzy parameters, rate of flow (µ(rf)) and channel condition (µ(chc)) for one of cases. where µ(rf) and µ(chc) are membership function for flow rate and channel condition, respectively. before providing computation of new queue length threshold, let us first describe initial queue length threshold computation. initial queue length threshold value is computed as given in formula 4 by using the bumax and threshold value setter f (varied in the range 0.7 to 0.9). the new queue length threshold is computed periodically as given in formula 5. qth(tinitial) = f ∗ bumax. (4) qth(ti) = (qth(ti−1)) − (cf ∗ (qth(ti−1) − qth(ti−2))). (5) fuzzy based packet dropping scheme in wireless cellular networks 309 in phase-iii, priority of the arriving packet and dropping factor are computed. the priority of the packet is used for computing dropping factor. the priority (p) of a packet from a connection of a call is computed (see formula 6) based on arrival time of a packet during a refreshing interval w, where, refinit is the starting time of interval w and pktarr is packet arrival time. to provide higher priority to handoff packets, p′ is recomputed using formula 7, where x is a value chosen between the range 1 to 3 (3 means 30 percent) or as desired by the administrator. the dropping factor of a packet is represented by formula 8 by considering n applications existing at the base station, where bw denotes bandwidth used by an application. the computed value for dropping factor is communicated to buffer manager for taking the final decision to either drop or accept the packet. the accepted packet is placed into buffer. the fuzzy controller used in computation of correction factor is discussed. p = 1.0 − (pktarr − refinit)/w. (6) p′ = { p for − handoff − packets p/x for − new − packets (7) df =   1 if q(ti) > qth(ti); 1 − max(q(ti)/qth(ti), p′) if ∑n k=1 bw[k] > bwmax; 0 ∑n k=1 bw[k] < bwmax (8) 2.2 fuzzy controller the fuzzy controller used in computation of correction factor in phase ii is as shown in figure 2. the fuzzy inputs are flow rate and channel condition. the output parameter is a correction factor. fuzzy controller has four components: fuzzification, inference, defuzzification and rule base. in the fuzzification step, fuzzy input parameter values (called crisp input) are converted into linguistic values (such as low, high or medium), each of which is represented by a fuzzy set. in the inference step, a set of rules called rule-base, which emulates the decision-making process of a human expert is applied to the linguistic values of the inputs to infer the output sets which represents the actual control signal for the process. we refer the reader to [22] for more complete background information on the fuzzy control. inference defuzzifierfuzzifier rule base rate of flow condition channel correction factor figure 2: fuzzy controller fuzzification fuzzy based packet dropping scheme considers two parameters for fuzzification: channel condition (chc) and the rate of flow of each application (rf ). the output of linguistic parameter is the correction factor which is used for tuning of the queue length threshold. membership 310 j.d. mallapur, s.s. manvi, d.h. rao functions g(rf ), g(chc) and g(cf ) for each of the considered fuzzy parameters are depicted in figure 3, along with the linguistic values. each of the fuzzy parameter is represented by triangular membership function since it represents minimum and maximum boundary conditions. the membership to each of fuzzy variables is assigned using intuition method. 0 1 1 rate of flow channel condition 0 1 0 correction factor low medium high worst good best rf0 rf1 rf2 rf3 rf4 cf0 cf1 cf2 cf3 cf4 medium highlow chc0 chc1 chc2 chc3 chc4 figure 3: membership function for input and output fuzzy parameters • rate of flow each application has different flow rate based on the requirements. the rate of flow is represented by linguistic values low (rf0 to rf2 ), medium (rf1 to rf3 ) and high (rf2 to rf4 ). • channel condition it is different at a given instant of time. three linguistic values of channel conditions are represented by worst (chc0 to chc2 ), good (chc1 to chc3 ) and best (chc2 to chc4 ). • correction factor the correction factor is used as tuning parameter for queue length threshold. the linguistic values for correction factor are low (cf0 to cf2 ), medium (cf1 to cf3 ) and high (cf2 to cf4 ). the fuzzy packet dropping scheme forms a fuzzy set of dimension g(rf) ∗ g(chc). the values of fuzzy variables assigned depends on the network administrator, i.e., he/she can assign the different values at different instant of time depending upon the network conditions. inference and defuzzification mamdani controller is used for reasoning. the two fuzzy inputs are rf and chc with each input having three linguistic values. total number of rules will be 9. the fuzzy rule base is as shown in figure 4. if the condition is true, we call the rule as being active. each rule i is written as follows. for example, rule i: if rfi is low and chci is good, then cfi = low. to decide an appropriate output membership function, the strength of each rule is considered. for this fuzzy based packet dropping scheme in wireless cellular networks 311 reason, the output membership function is a complicated function and center of area method [22] is used for defuzzification. this method finds the center point of the fuzzy output membership function, which is used as output value. the defuzzified output parameter provides flexibility to the network administrator to perform soft packet dropping. rate of flow channel condition correction factor low medium high low low medium medium high high worst worst worst good good good best best best high high high high medium medium medium low low figure 4: fuzzy rule base table for correction factor 2.3 algorithms this section presents algorithmic description of the proposed work. algorithm 1: packet dropping decision in buffer manager nomenclature: n= number of running applications, bumax=maximum buffer size at the base station, i=ith running application, bureq= buffer required, rf =rate of flow, chc = channel condition, qti = queue length in the buffer at ith time instant, qthi = queue length threshold in the buffer at ith time instant, cf = correction factor, and df = dropping factor. 1. receive buffer request bureq from new/handoff call to buffer manager; 2. if (buffers are available) then allocate and go to step 8; else execute the following steps; 3. initialize the computational unit with queue length and queue length threshold; 4. check for channel conditions and set the cf according to formula 3; call algorithm 2 in case of chc lying between h and z. 5. compute queue length threshold as given in the phase ii (formula 5); 6. compute df as given in the phase iii (formula 8); 7. decision unit in buffer manager compares the dropping factor with predefined dropping threshold; if the dropping factor is above the threshold, then the packet is dropped, else accepted; 8. stop. 312 j.d. mallapur, s.s. manvi, d.h. rao algorithm 2: computation of correction factor 1. initialize fuzzy controller with aggregated rate of flow and channel condition; 2. find the membership function of rate of flow and channel condition; 3. compute cf by using rule base; 4. return defuzzified cf; 5. stop. 3 simulation the simulation of proposed work is carried out using c programming language in different scenarios on a pentium iv machine. simulation uses the network model, channel model, fuzzy model and traffic model. models are presented as follows. network modela cell covers an area of x1 ∗ y 1 sq. kms. base station is located at some median point of the area. there are n number of mobile nodes arrive at the base station, which may include n1 handoff nodes and n2 new nodes that are generated with probability p and (1 − p), respectively. mobile nodes move in random directions. maximum bandwidth required in a cell is assumed to be bwmax mbps. maximum buffer size at the base station of the cell is assumed to be bumax gbytes. channel model the wireless fading due to physical environment are often characterized to envelop fading of the carrier signal. rayleigh fading has been used to model channel characteristics, where signal strength is varied by considering the mean and variance of signal strength. each node is modeled to receive data if signal strength of the data is within the acceptable value. fuzzy model fuzzy parameters considered are: rate of flow rf0, rf1, rf2, rf3 and rf4; channel condition chc0, chc1, chc2, chc3 and chc4; correction factor cf0, cf1, cf2, cf3 and cf4. the triangular membership is assigned to the input and output parameters because it is easy to compute and strictly specify the lower and upper limits of the fuzzy parameters. the intuition method of fuzzification is used to fuzzify the input parameters such as flow rate and channel condition. to decide an appropriate output crisp value for cf, we consider the center of gravity method. traffic model poisson distribution is used to model the generation of data packets with arrival rate λ in interval t. 3.1 simulation inputs following inputs are considered for simulation. x1 = 5 km., y1 = 10 km., n=100, p = random number between 0 and 1, n1= n*p, n2= (n-n1). mobile nodes can move in any of eight directions: n, s, e, w, ne, nw, se, sw. bwmax=20 mbps. bumax=100 mbytes. queue length threshold initialized with f= 0.9. queue length (qti) ranges from 10-90 mbytes. the channel condition parameters such as µ set between 0.2 to 1.0, σ set between 0.3 to 0.5. the fuzzy input rate of flow data is randomly distributed between [rf0, rf2]=[500kbps,1mbps], [rf1, f3]= [750kbps,1.5mbps ], [rf2, rf4]=[1mbps-2mbps]. channel condition limits are z=0.9, h =0.1. channel condition fuzzy values are: [chc0, chc2]=[0.1,0.4], [chc1, chc3]=[0.2,0.6], [chc2, chc4]= [0.4,0.9]. the fuzzy output parameter correction factor is distributed between [cf0, cf2]= [0.0,0.4], [cf1, cf3]= [0.2,0.6], [cf2, cf4]= [0.4,0.8]. the value of λ is positive real number between 0 to 10 and t is time interval set to 10. simulation procedure is as follows: generate a wireless network and traffic across the network, apply the proposed model and find the dropping factor, and compute the performance of the system. the performance parameters measured in simulation are as follows. acceptance of new fuzzy based packet dropping scheme in wireless cellular networks 313 calls: it is defined as the ratio of acceptance of new calls to total number of new calls arrived. acceptance of handoff calls : it is defined as the ratio of acceptance of handoff calls to total number of handoff calls arrived. dropping of new calls : it is defined as the ratio of new calls dropped to total number of new calls arrived. dropping of handoff calls: it is defined as the ratio of handoff calls dropped to total number of handoff calls arrived. variations in queue length threshold: it is defined as change in the queue length threshold. number handoff calls corrected: it is defined as number of handoff calls got corrected to total number of handoff calls arrived. number of new calls corrected: it is defined as number of new calls got corrected to total number of new calls arrived. 4 results figure 5 depicts the rise in acceptance of new calls upto certain limit with the variations in channel conditions. as the variance (sigma) of channel reduces, acceptance reduces. in a similar way, as the mean (mu) of channel condition reduces, acceptance reduces. this is because, the scheme attempts to get broader range of channel conditions for computing dropping factor. the packet drops for new calls change with respective change in the mean and variance of the channel conditions as observed in figure 6. 0 10 20 30 40 50 60 70 5 10 15 20 25 30 35 40 45 50 a c c e p te n c e ( % ) number of new calls arrived acceptence(%) vs. number of new calls arrived mu=1.0,sig=0.5 mu=1.0,sig=0.3 mu=0.5,sig=0.5 mu=0.5,sig=0.3 mu=0.2,sig=0.5 mu=0.2,sig=0.3 figure 5: acceptance of new calls (%) .vs. new call arrivals 0 5 10 15 20 25 30 35 40 45 50 5 10 15 20 25 30 35 40 45 50 d ro p o f n e w c a ll s (% ) number of new calls arrived drop of new calls (%) vs. number of new calls arrived mu=1.0,sig=0.5 mu=1.0,sig=0.3 mu=0.5,sig=0.5 mu=0.5,sig=0.3 mu=0.2,sig=0.5 mu=0.2,sig=0.3 figure 6: dropping of new calls (%) .vs. new calls arrival more number of new calls get corrected as the number of calls increase as depicted in figure 7. this happens because of scarcity of buffers at the base station. the variations in acceptance for handoff calls is shown in figure 8. the handoff calls acceptance is more than new calls for the same mean and variance of the channel condition (see figures 5 and 8). this indicates that scheme gives higher priority to handoff and real-time calls. figure 9 shows the dropping percentage of handoff calls with respect to the handoff calls arrived. it is seen that less number of packets from handoff calls are dropped compared to new call packets (see figures 6 and 9). the corrected handoff calls increase with increase in the handoff call arrivals as observed in figure 10. the queue length threshold tuning with respect to channel conditions is shown in figure 11. it shows that as the channel condition becomes better, the tuning value increases to avoid buffer overflows. the queue length threshold variations with respect to time is shown in figure 12. this indicates the tuned value of queue length threshold at different instants of time. 314 j.d. mallapur, s.s. manvi, d.h. rao 0 5 10 15 20 25 30 35 40 45 50 5 10 15 20 25 30 35 40 45 50 n u m b e r o f c a ll s g o t c o rr e c te d number of calls number of calls got corrected vs. number of calls mu=1.0,sig=0.5 mu=0.5,sig=0.5 mu=0.2,sig=0.5 figure 7: corrected new calls (%) .vs. new calls arrival 0 10 20 30 40 50 60 70 5 10 15 20 25 30 35 40 45 50 a c c e p te n c e (% ) number of handoff calls arrived acceptence(%) vs. number of handoff calls arrived mu=1.0,sig=0.5 mu=1.0,sig=0.3 mu=0.5,sig=0.5 mu=0.5,sig=0.3 mu=0.2,sig=0.5 mu=0.2,sig=0.3 figure 8: acceptance of handoff calls (%) .vs. handoff calls arrival 0 5 10 15 20 25 30 35 40 45 50 5 10 15 20 25 30 35 40 45 50 d ro p o f h a n d o ff c a ll s (% ) number of handoff calls arrived drop of handoff calls (%) vs. number of handoff calls arrived mu=1.0,sig=0.5 mu=1.0,sig=0.3 mu=0.5,sig=0.5 mu=0.5,sig=0.3 mu=0.2,sig=0.5 mu=0.2,sig=0.3 figure 9: dropping of handoff calls (%) .vs. handoff calls arrival 0 5 10 15 20 25 30 35 40 45 5 10 15 20 25 30 35 40 45 50 n u m b e r o f h a n d o ff c a ll s g o t c o rr e c te d number of handoff calls arrived number of handoff calls got corrected vs. number of handoff calls arrived mu=1.0,sig=0.5 mu=0.5,sig=0.5 mu=0.2,sig=0.5 figure 10: number of handoff calls got corrected (%) .vs. total handoff calls arrived 5 conclusions the main objective of the proposed work is to use the base station buffers efficiently and decrease the packet dropping especially for handoff and real-time calls during good channel conditions and high flow rate causing the buffer overflow. extensive simulation results reveal that proposed scheme features very low call dropping, better handoff and new calls acceptance with different channel conditions. this is achieved because of tuning the queue length threshold by correction factor. the correction factor is derived by two fuzzy parameters such as channel condition and flow rate. the scheme can be extended for different fuzzy parameters such as device type, user priority, sustainable jitters, etc. fuzzy based packet dropping scheme in wireless cellular networks 315 2 4 6 8 10 12 14 16 18 20 22 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 v a ri a ti o n s i n q u e u e l e n g th t h re s h o ld ( % ) channel condition variations in queue length threshold (%) vs. channel condition mu=1.0,sig=0.5 figure 11: queue threshold (%) .vs. channel condition 0 2 4 6 8 10 12 14 10 20 30 40 50 60 70 80 90 100 v a ri a ti o n s i n q u e u e l e n g th t h re s h o ld ( % ) time variations in queue length threshold (%) vs. time mu=1.0,sig=0.5 figure 12: queue length threshold (%) .vs. time bibliography [1] s. floyd and v. jacobson, random early detection gateways for congestion avoidance. ieee/acm transactions on networking, vol. 1, pp. 397-413, aug., 1993. [2] yuan chen and lemin li, a fuzzy fair packet dropping algorithm supporting differentiated services. ieee proc. 5th international conference on computer and information technology (cit’05), shangai, china, 2005. [3] chonggang wang, bo li, kazem sohraby and yong peng, an adaptive fuzzy-based control algorithm for active queue management. proc. ieee international conf. on local computer networks, bonn, germany, pp.643-649, 2003. [4] yuan chen and lemin li, a wireless packet dropping algorithm considering fairness and channel condition. proc. ieee international conf. on communication, circuits and systems (iccas), chengdu, china, vol. 1, pp. 369-373, june 2004. [5] changvuan luo and chongsen ran, an adaptive retransmission and active drop mechanism based on fuzzy logic. proc. ieee international conf. on radio science , pp. 162-165, aug. 2004. [6] manpreet dang, amol prakash, manika and rajeev, fuzzy logic based handoff in indoor wireless networks. proc. ieee vehicular technology conference , tokyo, japan, pp. 23752379, may 2000. [7] huai-jen liu, chih-hsun chou and kuan-hu ho, fuzzy logic based solution for the congestion collapse problem. journal of information technology and applications , vol. 1, no.2, pp. 89-94 sept., 2006. [8] saman taghavi zargar and mohammad hossein, fuzzy green: a modified tcp equationbased on active queue management using fuzzy logic approach. international journal of computer science and network security , vol. 6, no.5, pp. 50-58, may 2006. 316 j.d. mallapur, s.s. manvi, d.h. rao [9] fan yanfei, ren fengyuan and lin chuang, active queue management based fuzzy logic decision. proc. international conference communication technology, beijing, china, pp. 286289, 2003. [10] dong y.,makrakis d and sullivan t, network congestion control in ad-hoc ieee 802.11 wireless lan. proc. ieee canadian conference on electrical and computer engineering, montreal, canada, vol. 3, pp. 1667-1670, 2003. [11] huai-rong shao, chia shen, daqing gu, jinyun zhang and philip orlik, dynamic resource control for high-speed downlink packet access wireless. proc. icdcs workshop, pp. 838843, 2003. [12] william c.y. and lee, estimation of channel capacity in rayleigh fading environment. ieee transactions on vehicular technology, vol. 39, no. 3, pp. 187-189, aug. 1990. [13] mohammad r., emami i., burhan t. and andrew a, development of a systematic methodology of fuzzy logic modeling. ieee transactions on fuzzy systems, vol. 6, no. 3, pp. 346-360, aug. 1998. [14] javier gomez and andrew t. campbell, a channel predictor for wireless packet networks. proc. ieee international conference on multimedia and expo (icme), new york, usa, july-august, 2000. [15] jaeweon cho and zygmunt j. haas, on the throughput enhancement of the downstream channel in cellular radio networks through multihop relaying. ieee journal on selected areas in communications, vol. 22, no. 7, sept. 2004. [16] seungho song, kyuho son, hyang-won lee and song chong, opportunistic relaying in cellular network for capacity and fairness improvement. proc. ieee globecom, washington, usa, nov. 26-30, 2007. [17] xi yong, huang qingyan, wei jibo and zhao haitao, rate adaptive protocol for multirate ieee 802.11 networks. journal of electronics china, vol. 24, no.3, pp. 289-295, april 2007. [18] samarth h. shah, kai chen and klara nahrstedt, available bandwidth estimation in ieee 802.11-based wireless networks. proc. of 1st isma/caida workshop on bandwidth estimation (best), san diego, ca, dec. 2003. [19] wei zhuang, brahim bensaou and kee chaing chua, adaptive quality of service handoff priority scheme for mobile multimedia networks. ieee transactions on vehicular technology, vol. 49, no. 2, pp. 494-505, march 2000. [20] wei zhuang, brahim bensaou and kee chaing chua, handoff priority scheme with preemptive, finite queueing and reneging in mobile multiservice networks. journal of telecommunication systems, vol. 15, no. 1-2, pp. 37-51, nov. 2000. [21] k. m. passino and s.yurkovich, fuzzy control, addission wesley, 1998. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 4, pp. 517-524 e-learning & environmental policy: the case of a politico-administrative gis n.d. hasanagas, a.d. styliadis, e.i. papadopoulou, l.a. sechidis nikolaos d. hasanagas, athanasios d. styliadis, lazaros a. sechidis kavala institute of technology, department of landscape architecture city of drama, greece, gr 66100 email: styliadis@ath.forthnet.gr, nikolaos.hasanagas@gmail.com, lazikas@photo.topo.auth.gr eleni i. papadopoulou aristotle university of thessaloniki, faculty of agricultural science department of agricultural economics, city of thessaloniki, gr 54124 e-mail epapa@agro.auth.gr abstract: is an effective knowledge exchange and cooperation between academic community and practitioners possible? implementation of e-learning in specialized policy fields pertains to the most challenging priorities of icts and software engineering. in multidisciplinary academic areas which combine environmental policy studies with positivist subjects (like environmental issues, forest policy, rural development, landscape architecture etc), the using of e-learning system in analyzing policy issues steadily gains in importance and is a method which connects the academic community and the researchers with the practitioners and field experts. such initiatives incorporate a number of politometricsrelevant algorithms embedded in a context of political geography (i.e. visualized hierarchies in different regionrelated policy issues). this is the case addressed in this paper. the gis learning management system introduced in this paper is based on certain criteria concerning organizational models and region-specific politico-administrative hierarchies. scenarios of politico-administrative metadata achieving optimal power synergy are extracted through a sequencing technique, combining vector-algebra software and statistics and can be used for both teaching and research purposes. keywords: e-learning, gis, politometrics, forest policy, environmental issues, rural development policy, socio-informatics. 1 introduction although the political structures of the "western" and "civilized" world are considered to be standardized in the framework of a single "cosmopolite" value system, the power structures in environmental policy issues are quite different between regions. not in every region of europe the state actors necessarily concentrate the same degree of power. sometimes private enterprises, environmental or economic groups are the leading actors. an actor (e.g. environmental or landowner interest group) should also be adjusted to the particular condition of regional policy networks in order to succeed. socio-informatics software like visone (network analysis) is based on elaborated vector-algebra algorithms [3] which are aiming at quantifying and visualizing the intangible political relations [1] and informal dynamics of environmental policy and in general of rural development policy. the application of such a computer-aided "political geometry" methodology with region-specific cases is the "cornerstone" of structuring a gis functional for politometrics and thus of implementing a gis learning managament system (glms) [4] in the context of a post-modern political geography depicting regional-specific (in)formal hierarchies (archimedes findings). the implementation of e-learning in specialized policy fields pertains to the most challenging priorities of icts and software engineering. in multidisciplinary academic areas which combine environmental policy studies with positivist subjects (like landscape architecture, rural sociology and economics, forest science etc), the using of e-learning system in analyzing policy issues steadily gains in importance (archimedes findings). moreover, in cross-sectoral policy networks such as those which are developed on forest policy issues and are discussed in this paper, it is impossible to separate policy sectors; environmental issues, rural development policy and forest policy copyright c⃝ 2006-2010 by ccc publications 518 n.d. hasanagas, a.d. styliadis, e.i. papadopoulou, l.a. sechidis are an inseparable part of what is called "integrated rural development". such networks involve not only forest owner and industrial interests but also other groups (e.g. agricultural museums, environmental ngos, agrarian associations etc). not only the classical rural territory but also urban-related interests are involved. thereby, a new urban-countryside relation is developed. thus, a policy-relevant filtering of learning objects is necessary in order to enable the exchange of knowledge between academic community (or researchers) and practitioners/ field experts. the e-learning system suggested in this paper is expected to be appropriate for achieving not only an effective over-bridging between academic community and practitioners in forest policy and in the wider environmental and rural development issues but also an effective organisation and coordination of means of forest and rural resource management, an acceptable evaluation of forest resources and acceptable procedures of estimating or accounting the economic, material and non-material values of forest, frictionless goal-setting and decision-making involving private and public actors, and a method of examining issues of ambiguity and law-making concerning forest and wider natural resources. as long as this e-learning is implemented among target groups from different policy sectors related to forestry (e.g. spatial planning, agriculture, tourism, water management etc), a minimization of conflicts is feasible [6]. this holistic approach of policy-making is enabled through the complete analysis of policy networks, which is an operational form of system theory [8, 10]. in other words, a new systemic analysis of classical forest policy is the basis of this e-learning system. an adaptive process that selects learning objects (region-specific policy structures and actors) from a digital repository and sequences them in a way which is appropriate for the targeted gis learning community or individuals [2, 6, 11, 13, 16, 17], is also necessary for reducing computational time and gaining in objectivity and acceptance of politico-administrative conclusions. such a method is also required as the rural development issues are characterized by complex and unpredictable informal procedures and there are no clear and common indicators for evaluating rural development policy in europe (rudi findings). nevertheless, a qualitative and participative evaluation is necessary on the part of learners (students specialized in forest policy, rural sociology and economics, development and spatial engineering, lobbyists of interest groups etc). many types of intelligent learning systems are available but without gis functionality. in the glms proposed in this paper, five key components could be identified, which are common in most gis systems: region-specific data acquisition, algebraic and statistical analysis, processing data of actors and regional networks, construction of political geography database, and calculating/ visualizing formal and informal hierarchies. figure 1 depicts the interactions between these five gis components [4, 18]. figure 1: the main components of an intelligent glms. the selection of learning content (in this case, the region-specific formal and informal hierarchies) is based on policy-research criteria depending on complex and heterogeneous cognitive styles [3, 8, 15] which may be characterized as "region-" and "administrative-based" (archimedes findings). in this way, a wide range of learner expectations is satisfied [9, 10, 12]. particularly, the administrative-based elements (organisational theories) are much more subjective than the regional ones. therefore, cooperation between multidisciplinary academic and field experts (e.g. environmental, forest and agricultural scientists, public administrators, lobbyists, sociologists, informatics experts etc) is necessary in order to achieve acceptance and integrated analysis of real cases. such a criteria set constitutes a decision support system (dss) for learners [4] and teaching staff which enables both of them to reduce hypothetical options and produce original and accurate research results and constitutes a basic component of an intelligent glms [16, 19]. although many digital dss types have been proposed, these are applicable only to examination of human-building interaction and perceptual relations [10, 17, 18, 22, 23, 25] and they are not combined with glms. other dss types which are combined with glms [16,24] are strictly related to e-learning & environmental policy: the case of a politico-administrative gis 519 spatial elements and not to region-specific politico-administrative issues (archimedes findings). in this paper, the development of a politico-administrative gis is addressed. a regionand administrative-based filtering process of learning objects is discussed. politico-administrative metadata are proposed which can be used for learning object filtering on the basis of the open gis consortium guidelines (standards) concerning gis functionality [4, 20]. 2 digitalizing the "political geometry" in a region-specific gis the formal and informal hierarchy shaped in every region surveyed is composed of three power dimensions, as shown in formula (1) [21]: politico − administrative power = trust + incentives +u niqueness (1) trust is used for leading even when surveillance is infeasible, provision of incentives is for assuring commitment, and uniqueness is useful for exerting institutional pressure. according to the rudi findings, informal hierarchies are more decisive for the policy output than the formal ones because of the lack of detailed criteria of decision-making and evaluation in rural development policy. trust is a relational value based on expertise, experience and personality and is accumulated through successive transfer of reputation. if e.g. the forestry commission trusts the royal scottish forestry society, which trusts the national trust of scotland and the last two actors trust the friends of the loch lomond, then the last one proves to be the most trustworthy as it is able to gain the trust of all previous actors (also of the forestry commission indirectly). for the reputation of the actor a, it is not merely important how many actors trust a, but also how much reputation these actors gain from other actors etc. these actors can be ordered on the vertical and horizontal axis of a matrix. thereby the network can be algebraically processed. formula (2) which is known as katz-status formula is applied for calculating the power status of an actor in a network: t = (i − ac)−1 − i (2) where t is a matrix including the status values of all actors as elements, c is the matrix presenting the real network of trust, and a is a dumping factor. the same formula is applied in the case of the provision of incentives and uniqueness dependence relations. this algorithm is used by visone software. visone layers vertically the actors (learning objects) according to their power status measured in % (figure 2). the horizontal order has no politico-administrative meaning. it is obvious that in the simple polygon form, the policy networks are not disclosing any hierarchies developed in their regions. when they are layered, they acquire a pyramid-shape form. the sharper the pyramid (vertical length in relation to horizontal length), the higher the oligarchy, as defined in formula (3) [14, 21]: oligarchy = status max − status min status average (3) the sharpest pyramid is this of uk1 issue network (oligarchy=2,40), while the "pyramid" of greek network does not seem to be a pyramid at all, as the oligarchy is quite low (1,48). in figure 2, the power status of each actor can be examined by the learners in relation to its orientation (use or conservation of natural resources) and its legal character (private or state actor). in this way, learning effects and original conclusions with academic and practical value can be made by the learners through the interpretation of this digital visualization of region-specific hierarchies with abstract but applicable politico-administrative metadata [1, 13, 20]. an output of such a gis produced by the comparative analysis of these digital pyramids of (in)formal regional hierarchies is concisely presented in figure 3: the power status can be examined by the learners again in relation to the legal character and to the orientation among various regions. thereby, policy-relevant conclusions can be made regarding the winning possibility, considering these determinants (legal character and orientation). e.g. in the greek and spanish regions, the private actors are much more powerful than the state ones, with noticeable difference in comparison with the european average. the inverse hierarchy can be recognized in the case of the uk1 and uk2 networks (scotland). the conservation-oriented actors (e.g. environmental ngos and agencies) are more powerful than the use-oriented ones in denmark. this is a case subversive to the average power relation. the science-oriented actors (universities, research institutes) are more powerful than the other actors in the network of finland and in one region-specific network in spain. these results can be further interpreted by using qualitative information about the content of the policy issues (rudi and archimedes findings). 520 n.d. hasanagas, a.d. styliadis, e.i. papadopoulou, l.a. sechidis figure 2: region-specific networks of politico-administrative power. a further elaboration of dss [2, 5, 16] for the diagnosis of favorable (or unfavorable) participation in a regionspecific network [9, 19] is possible by applying stepwise regression to this political-geographic database: not every actor can participate in every regional network with equal chance of developing power. according to the new-institutionalist approach the power (p) achieved by an actor does not depends only on the organizational (o) features of this actor but also on the regional network (n) conditions in which this actor is involved, as shown in the figure 4. which combinations of actor and regional network features lead to the optimal power? one can deduce hypotheses on the o-factors and induce the n-factors through stepwise regression [21]. in figure 5, the procedure of the stepwise regression works as a filtering process, reduces the power-ineffective combinations and produces ideal types of politico-administrative metadata (actorand regional network-related power determinants). an ideal type, for instance, is the following one: an actor (e.g. an environmental ngo) with multidisciplinary team (0,284*multidis), which is not radical (-0,261*radicali), and has no state representatives at its board (-0,203*statech), can develop optimal power in a network which is composed of only a few actors (-0,427*actors), provides many opportunities of lobbying (0,394*potlobb), is characterized by low relative importance of state (-0,296*relimpst), and involves only a few policy sectors (-0,243*intersec). comparing these actorand network specific factors with gis outputs such as these which are described in figure 3, it is concluded, for instance, that this type of actor described in figure 5, has optimal chance to develop power in the regional networks of bavaria. o-factors were selected by using specific organisational theories (i.e. contingency model which is expressed by the absence of state representative at the board and the using of alternative expertise for surviving in heterogeneous regionalpolitical environments). the regional n-factors were inductively selected by the stepwise regression. the combinations of oand n-factors can also be characterized as regional-specific critical scenario analysis [2,5,7,21]. according to rudi results, the deviation between formal and informal hierarchies and the differences between regional networks can be attributed to the inflexible bureaucracy, the complexity, the centralisation and to the lack of formal and clear criteria of decision-making and evaluation in greece and in other european countries. furthermore, the challenges posed by the requirement of harmonizing social, economic and environmental standards e-learning & environmental policy: the case of a politico-administrative gis 521 figure 3: politometrics-embedded gis. figure 4: optimal power synergy through politico-administrative meta-data: actor-related and regionspecific power determinants. 522 n.d. hasanagas, a.d. styliadis, e.i. papadopoulou, l.a. sechidis figure 5: stepwise regression on politometrical gis data. e-learning & environmental policy: the case of a politico-administrative gis 523 complicate the policy-making. 3 summary and conclusions the main goal of vector-algebra software (i.e. visone) is the digitalization and visualization of the formal and informal politico-administrative hierarchies of region-specific issue networks. the goal of the stepwise regression as a filtering process is the reduction of the searching space. gis learning object repositories often contain numerous possible combinations of regional hierarchy features and actor-related characteristics. without visual complete network analysis and statistical techniques like the stepwise regression, the examination of and familiarization with all possible learning objects would be characterized by conceptual complexity and time-consumption. this would be discouraging for the learners, especially if they were practitioners (e.g. lobbyists of environmental and industrial groups, or employees of forest services and agricultural directorates) and not only normal students. the method presented in this paper is a system for filtering learning objects which is based on knowledge domains [7, 11, 19] (i.e. organisational theories and practical experience) and seems to be appropriate for student and adult education as well. acknowledgements the research initiative proposed by this paper has been supported by the eu-funded "archimedes" research project (department of landscape architecture, kavala institute of technology, drama, greece), by the eufunded research project "rudi: rural development impactsassessing the impact of rural development policies, incl. leader" consortium no: 213034, 7th framework programme for research and technological development, (department of agricultural economics, faculty of agricultural science, aristotle university of thessaloniki, greece), and by the institute of forest policy and nature conservation of goettingen university (germany). bibliography [1] m. a. rajan, m. girish chandra, l.c. reddy, p. hiremath, concepts of graph theory relevant to ad-hoc networks. int. j. of computers, communications & control, 3, suppl. issue: proceedings of icccc, 465-469, 2008. [2] m. medjoudj, p. yim, extraction of critical scenarios in a railway level crossing control system. int. j. of computers, communications and control, 2(3), 252-268, 2007. [3] a. anohina, m. vilkelis, r. lukasenko, incremental improvement of the evaluation algorithm in the concept map based knowledge assessment system. int. j. of computers, communications & control, 4(1), 6-16, 2009. [4] a.d. styliadis, i.d. karamitsos, d.i. zachariou, personalized e-learning implementation the gis case. international journal of computers, communications & control, 1(1), 59-67, 2006 [5] p. dolog, n. henze, w. nejdl & m. sintek, personalization in distributed elearning environments. proc. of the 13th international world wide web conference, new york, usa, 2004. [6] m. krott, n.d. hasanagas, measuring bridges between sectors: causative evaluation of cross-sectorality. forest policy and economics, 8(5), 555-563, 2006. [7] m. stanojevic, m. vujosevic, b. stanojevic, number of efficient points in some multiobjective combinatorial optimization problems. int. j. of computers, communications & control, 3, suppl. issue: proceedings of icccc, 497-502, 2008. [8] y. birot, g. buttoud, r. flies, k. hogl, m. pregernig, r. päivinen, i. tikkanen, m. krott voicing interests and concerns: institutional framework and agencies for forest policy research in europe. forest policy and economics, 4(4), 333-350, 2002. [9] n. magnani, l. struffi, translation sociology and social capital in rural development initiatives. a case study from the italian alps. journal of rural studies, 25(2), 231-238, 2009. [10] j. murdoch, networks a new paradigm of rural development?. journal of rural studies, 16(4), 407-419, 2000. 524 n.d. hasanagas, a.d. styliadis, e.i. papadopoulou, l.a. sechidis [11] g. mccalla,the fragmentation of culture, learning, teaching and technology: implications for the artificial intelligence in education research agenda in 2010. international journal of artificial intelligence in education, 11, 177-196, 2000. [12] lsal, scorm best practices guide for content developers, 2003. cernegie mellon learning systems architecture lab. retrieved in september 2005 from the world wide web: http://www.lsal.cmu.edu/lsal/expertise/projects/developersguide [13] kinshuk, r. oppermann, a. patel & a. kashihara, multiple representation approach in multimedia based intelligent educational systems. artificial intelligence in education journal, amsterdam: ios press. 259-266, 1999. [14] a. real t., n.d. hasanagas, complete network analysis in research of organized interests and policy analysis: indicators, methodological aspects and challenges. connections, 26(2), 89-106, 2005. [15] p.r. polsani, use and abuse of reusable learning objects. journal of digital information, (2003). retrieved in september 2005 from the world wide web: http://jodi.ecs.soton.ac.uk/articles/v03/ i04/polsani. [16] a.d. styliadis, e-learning documentation of historical living systems with 3-d modeling functionality. informatica, 18(3), 419-446, 2007. [17] a.d. styliadis, p.g. patias, n.c. zestas, 3-d computer modeling with intra-component, geometric, quality and topological constraints. informatica, 14(3), 375-392, 2003. [18] a.d. styliadis, m.gr. vassilakopoulos, a spatio-temporal geometry-based model for digital documentation of historical living systems. information & management, 42, 349-359, 2005. [19] i.j. terluin, differences in economic development in rural regions of advanced countries: an overview and critical analysis of theories. journal of rural studies, 19(3), 327-344, 2003. [20] m. s. urban & e. g. barriocanal, on the integration of ieee-lom metadata instances and ontologies. learning technology newsletter, 5(1), 2003. [21] n.d. hasanagas, power factor typology through organisational and network analysis. usinhg environmental policy networks as an illustration. ibidem. stuttgart. 2004. [22] a.d. styliadis, digital documentation of historical buildings with 3-d modeling functionality. automation in construction, 16, 498-510, 2007. [23] a.d. styliadis, historical photography-based computer-aided architectural design: demolished buildings information modeling with reverse engineering functionality. automation in construction, 18, 51-69, 2008. [24] a.d. styliadis, i.i. akbaylar, d.a. papadopoulou, n.d. hasanagas, s.a. roussa, l.a. sexidis, metadatabased heritage sites modeling with e-learning functionality. journal of cultural heritage, 10, 296-312, 2009. [25] a.d. styliadis, d.g. konstantinidou, k.a. tyxola, ecad system design applications in architecture. int. j. of computers, communications & control, 3(2), 204-214, 2008. nikolaos d. hasanagas born in 1974. assistant professor in environment-related subjects at the kavala institute of technology, drama, greece. bsc and msc eq. in environmental sciences (aristotle univ. of thessaloniki, greece), ba and ma eq. in social sciences, phd in environmental policy analysis (goettingen univ., germany). athanasios d. styliadis born in 1956. professor of digital architecture and design computing at the department of landscape architecture at the kavala institute of technology, drama, greece. diploma in surveying engineering, msc in computer science (dundee univ., scotland), phd in caad and gis (aristotle univ. of thessaloniki, greece). eleni i. papadopoulou born in 1957. assistant professor in rural policy at the faculty of agricultural science at the aristotle university of thessaloniki, greece. bsc in agriculture engineering, msc in agricultural economics (univ. of reading, uk), phd (aristotle university of thessaloniki). lazaros a. sechidis born in 1968. assistant professor in geodesy at the department of landscape architecture at the kavala institute of technology, drama, greece. dipl. in surveying engineering, phd in photogrammetry (aristotle univ. of thessaloniki). international journal of computers communications & control issn 1841-9836, 11(3):405-413, june 2016. influence of the qos measures for voip traffic in a congested network r.l. luca, p. ciotirnae, f. popescu robert luca, petrica ciotirnae*, florin popescu military technical academy romania, 050141 bucuresti, bdul george cosbuc, 39-49 robert.lucian.luca@gmail.com, ciotirnae@mta.ro, fdpopy@yahoo.com *corresponding author: ciotirnae@mta.ro abstract: the paper revolves around the subject regarding quality of service (qos) in a telecommunication network. the chosen scenario is based on the transmission of data and voice packets using a wan connection, which has a limited bandwidth and emphasize the need of implementing qos mechanisms in order to fulfill the quality requirements of the traffic, especially for voip. this topology will outline the impact and importance of the qos implementation, illustrated by the desired quality resulted through voip traffic simultaneously with maintaining the data conectivity using a lower bandwidth for applications which require a smaller amount of qos properties, such as ftp. keywords: quality of service (qos), voip, weighted fair queuing (wfq), low latency queuing (llq), class-based weighted fair queue (cbwfq). 1 introduction this paper emphasize a method to implement mechanisms that provide quality of service (qos) to voip services. the qos problem is a subject that will be studied continuously, according to other papers which studied this matter [1], [2]. there are a few challenges that concern the voip implementation: finding solutions to transmit a real-time service using a data network, which offers no guarantees regarding the delivery of the service, by using the best effort mode. this type of technology is successful only if the features that inffluence the quality of service in data networks are being decreased: latency, jitter and packet loss (see [3][8]). moreover, qos represents one of the 4 characteristics which a network has to provide in order to fulfill the clients expectations, beside scalability, security and fault tolerance [9]. 2 general description of the used mechanisms in order to implement the qos mechanisms, it is neccessary to use specific mechanisms depending on the type of network and service used. some of them are described in the paragraphs below. the weighted fair queuing (wfq) [10] mechanism creates dinamically queues based on traffic flows. a packet can be asigned to a certain type of flow depending on the value obtained through applying a hash function on the source ip address and destination address, source port and destination port or tos value. this type of flow will have his own queue. the maximum number of queues that can be recorded on the interface is 4096, which is much higher than other similar mechanisms (ex: priority queue which allows the network administrator to configure four separate buffers with different priorities: high, normal, medium and low. packets in buffer with higher priority are always processed before packets in lower priority buffer; custom queue allows users to define up to 16 queues. each queue is processed at a time and can be transmitted according to the copyright © 2006-2016 by ccc publications 406 r.l. luca, p. ciotirnae, f. popescu parameter "weight" (set by user) a certain number of packets or bytes). in wfq, a flow exists as long as there are packets in that stream waiting to be transmitted. in other words, when the queue allocated to that stream is empty, it is removed . for this reason the number of queues in the system is changing rapidly . the mechanism which allocates bandwidth among different streams is based on a factor (cost, weight), which depends on the precedence field for the given buffer. when there is a free space in the hardware buffer, the wfq mechanism takes a packet from the software buffers and places it in the hardware buffer. it will be chosen the packet with the minimum "sequence number" (sn) among all queues. sn value is assigned to a packet before it is placed in the queue of the associated stream and also before taking a decision on its eventual disposal (the wfq mechanism uses a modified "tail-drop" algorithm which takes into account the sn value before discarding packets). the formula used to calculate the sn of a packet: sn = snant + (w × dim) (1) where: snant represents the sn of the previous packet, w represents the cost (weight)and has the formula: w = 32348 ipp + 1 (2) (ipp represents the value of the ip precedence field) and dim represents the packet dimension. from the expression (1) we can have some conclusions: • the calculation of the sn takes account of packet size, being smaller for smaller packets; • packets arriving in queues that already have a large number of packets will get a higher sn whereas the calculation of the sn takes account of the previous packet sn ; • it can be seen that the sn varies inversely with the value of the precedence field. the higher the priority of a flow, the lower the cost value "w" and hence the sn. in fig. 1 it can be seen how the packets are scheduled for transmission in two different queues that have the same cost w. if the a flow from fig. 1 would have a bigger cost than the cost of stream b it would be possible that sn a1 be greater than sn b1 and the packet "b1" may be transmitted before the transmission of packet "a1", situation which is graphically depicted in fig. 2. wfq algorithm automatically creates eight queues with smaller weights (high priority) for management traffic generated by the router (messages from routing algorithms, control traffic of osi layer 2, etc.). wfq works well for networks where delay sensitive traffic requires less bandwidth than the average of other flows. the method has the advantage that it does not require manual configuration of classes and can thus be used as the default method when traffic characteristics are unpredictable and classification is difficult [12]. disadvantages of this method: no guarantees for the delivery of a specific type of traffic and the fact that this algorithm doesn’t have a high priority queue that would provide minimum values for delay and jitter for voip traffic. another mechanism that can be used for providing qos measures is class-based weighted fair queue (cbwfq) [10], [11]. this congestion management mechanism permits the classification of packets into classes with costs (weights) according with the bandwidth specified by the administrator for each class. cbwfq band division is performed inversely to the cost assigned influence of the qos measures for voip traffic in a congested network 407 figure 1: programming packet transmission according to the sn (same cost w) [14] figure 2: programming packet transmission (different cost, w(a) > w(b)) 408 r.l. luca, p. ciotirnae, f. popescu to the flow. the method in which bandwidth is shared between the streams is described by the following equation : share(k) = w(1) + w(2) + ... + w(k) + ... + w(n) w(k) (3) equation (3) outlines the fact that the flows which have a smaller "w" cost get more bandwidth. unclassified traffic (which does not fall within any user-defined class) is sent to a special class (default) using wfq mechanism and the weights defined by it according to the equation (2). the cost of user-defined classes can be determined from the following formulas based on how the bandwidth command is applied to a certain class (as the number of bits or as a percentage of the total bandwidth) according to the following equations: w(k) = ct bwtot bw(k) (4) ,where ct depends on the number of queues in the system (inversely to the number), bwtot represents the available bandwidth on an interface and bw(k) is the bandwidth configured for a specific class as the number of bits. w(k) = ct × 100 bwpercentage(k) (5) bwpercentage(k) represents the bandwidth configured as a percentage from the total bandwidth. each manually configured class represents a separate queue managed by a fifo algorithm that receives a percentage of the total bandwidth depending on how the packets are chosen for transmission to the hardware buffer. after configuration, the cbwfq algorithm works similar to wfq, which means that it is based on the sn value defined by the same formula as in wfq, equation (2). it can be demonstrated that any user-defined class dominates the dynamically determined flows (wfq) almost all the time (except when certain classes are configured with a very low percentage 1.5%, which is unlikely) [13]. same as wfq, a predefined set of queues is allocated for the cbwfq mechanism called "link queues" that the system uses for transmission of network management traffic. these buffers are set at a fixed cost w = 1024, a cost which is much better compared to the cost of any dynamic flow and about the same level as the traffic classified by the user. due to the fact that this type of traffic is intermittent, the bandwidth distribution is not affected. moreover, there is a rule that user-defined traffic classes should not reserve more than 75% of the available bandwidth on an interface, such that these special buffers do not remain without bandwidth [15]. low latency queuing mechanism (llq) or "cbwfq with priority queue" [11] complements cbwfq mechanism and allows you to specify a traffic class with high priority and therefore minimal latency. the delay sensitive traffic can be transmitted before other types of traffic by using a special buffer with the highest priority (cost w = 0). in fig. 3 it can be seen the essential difference between llq and cbwfq mechanism, namely the existence of the priority buffer. this is a buffer with the minimum cost (0) which means total priority. in other words, the traffic that will be selected to use this buffer will always be sent before other types of traffic. for this class it applies a "policing" mechanism (when the configured rate is exceeded, packets are automatically discarded) to avoid the situation in which traffic with priority monopolizes all available bandwidth on an interface and starves other traffic classes that have been configured for guaranteed bandwidth. due to the fact that in the case of voip, traffic packets are sent at influence of the qos measures for voip traffic in a congested network 409 regular intervals, the transmission rate is relatively constant and for that reason voip is the best choice for the priority buffer. due to the advantages of minimum values for delay and jitter,llq is one of the most recommended method of "queuing" in literature and represents the version tested in the practical application presented below. on the low-capacity links, llq can be used with other qos mechanisms such as header compression and fragmentation and interleaving (to avoid the situation in which voice packets must wait for the transmission of large data packets, which would lead to significant delays and jitter). figure 3: llq mechanism operation [16] 3 experimental results for laboratory testing the following scheme was used for the interconnection of the equipments (fig. 4): figure 4: logical scheme of the equipments interconnection the aim of the laboratory tests is to highlight and compare the statistical values obtained for voice traffic in voip format in two distinct situations. the first case refers to a congested 410 r.l. luca, p. ciotirnae, f. popescu network in which it does not take into account the priority of traffic classes and the second situation where specific qos measures are applied. functional laboratoty platform description: • voip service is provided by the cme server (call manager express) on router 1 for the wireless network 2.0.0.0/24 and cme server on router 2 for the network 3.0.0.0/24. between the two servers there is a trunk for data link; • computer at ip 1.0.0.2 will play the role of a ftp client that will receive multiple streams of data from the laptop with the address 3.0.0.2 that will be the ftp server. between router 1 and router 2 it will be generated so much traffic as possible so that congestion occurs. • in addition, on the link between router 1 and router 2 the rate is limited at 500 kbps. this limitation has been achieved using an additional router (router 3) configured in such a way that it receives all the traffic transmitted between router1 and router 2. • traffic received by the router 3, regardless of its source or nature is selected by access lists and configured on router 3 as a single class of traffic. for this class it applies a shaping policy at 500 kbps (limiting the transfer rate at a given value while maintaining the packets in a buffer when the limit is exceeded). to analyze the quality of voice traffic was used an mechanism available on cisco equipments called "cisco ip sla." this application may generate packets similar with voip packets with predetermined characteristics depending on the type of the codec used. router 4 is used as the source and the probe and switch 1 as the device addressed. generated packets go through the network to the switch 1 and then return to the probe device to calculate the value for delay, jitter and number of packets lost. shortly after the initiation of data traffic congestion occurs in the network and the quality of voice traffic will be affected by high latency, variations in delay and packet loss. a range of objective assessments can be achieved by using a voip traffic generator (router 4) whose statistics are taken, analyzed and displayed in charts by a program called prtg network monitor. this application identifies network devices by ip and assignes them "sensors" which return various statistics such as cpu load, system temperature, check the ping response time etc. such a sensor that receives and interprets voip statistics is called cisco ip sla voip and can be associated with a router / switch cisco that operate as transmitter of ip sla test packets (probe). so, before this program can collect statistics from a voip device it must be configured to generate test traffic. ip sla voip udp jitter mechanism can generate packets similar to voip packets with characteristics that are preset depending on the type of codec used, e.g. g.711, g.729. after the values for delay , jitter and packet loss have been collected an approximate value for the r factor is calculated which is an objective measure of voice quality (e-model), which in turn is converted into an equivalent value called mos (mos-cqe that conversational quality estimated). therefore, ip sla generates voip traffic statistics: delay, jitter, packet loss, mos, and these statistics are retrieved and displayed in intuitive charts by the dedicated sensor in prtg. thus, the degradation of voice quality that comes with the installation of congestion can be confirmed by objective means (up to 1.2 seconds delay, packet loss up to 50% and jitter up to 20 ms.), shown in fig. 5 and fig. 6: router 1 and router 2 are connected by a link that is limited in terms of available bandwidth. when the traffic rate tends to exceed that available bandwidth, every packet will be dropped regardless of the class to which it belongs. to avoid this, it is applied a policy to all the traffic influence of the qos measures for voip traffic in a congested network 411 figure 5: mos, rtt (average values) and packet loss statistics for a congested network without qos measures (for voip sla traffic) leaving the interface (higher level policy), so the transfer rate is less than the maximum available bandwidth. in this way, it will move the congestion from the router 3 on the output interfaces of the border routers (router1 and router2) where the quality requirements of traffic can be managed (in a lower-level policy). in this application it has been studied low latency queuing algorithm performance. figure 6: highlighting the increse of jitter values during congestion so there will be an aggregate traffic consisting mainly of voip traffic, ftp traffic and management traffic (arp, dhcp) that will have a maximum output rate of 480 kbps (limitation imposed by the higher-level policy). from this rate a part will be allocated for voip traffic(with priority, 190 kbps), another part will be guaranteed for ftp traffic (150 kbps) and the remainder will be allocated to other types of traffic. this last category will share the remaining bandwidth with a wfq policy type. llq algorithm must be carefully implemented because a poor setting may have unintended consequences (packet loss) on delay sensitive traffic (voip class in this case) due to the "policing" rule applying to it. if llq is implemented correctly, the traffic that is found in the priority buffer will never exceed the configured rate. when hierarchical qos policy is applied to the output interface of the two border routers it will be seen an immediate increase in mos values and also the call quality (subjective) is much better in the absence of packet losses and delays as it can be seen in fig. 7. in fig. 7 it may be noted that packet losses have decreased from high levels of 40 − 50% to 0 and the delays have also decreased to values of a few ms. 412 r.l. luca, p. ciotirnae, f. popescu figure 7: mos, icpif, jitter, rtt statistics (average) and packet loss for voip sla traffic with qos measures applied in fig. 8 it can be seen a comparison between the results obtained for delay and jitter when the mechanism used is cbwfq and llq mechanism respectively. although apparently the differences between the two are small and don’t affect speech quality in the lab scenario, in a real situation where there can be multiple nodes and multiple paths between source and destination the values for delay and jitter in the case of cbwfq can grow significantly and can sum up and degrade the quality of voice traffic. hence the importance of using a high priority buffer for voice traffic. figure 8: highlighting the differences for jitter and rtt between cbwfq and llq 4 conclusions using appropriate qos mechanisms it can be achieved the desired quality for voip traffic while maintaining the operation at lower rates for other types of traffic. thus, by applying the appropriate qos measures we have achieved a considerable improvement of mos index from values of about 1.5 to ideal levels (about 4) which indicates a very good quality for voip traffic. the increase in mos is due to improvements in the values of delays (from hundreds, thousands of milliseconds to a few milliseconds); lost packets (at levels of about 40 − 50% to 0%) and jitter (from 15msec to 1 msec). llq is a highly effective mechanism to meet qos requirements because it allows control of service quality based on traffic classes that can be defined with precision. also, analyzing the mechanism behind cbwfq (llq) it can be concluded that none of the classes is not restricted to the configuration set up if there is enough bandwidth available. influence of the qos measures for voip traffic in a congested network 413 we also demonstrated the necessity of using a high priority buffer for voip traffic through a comparative analysis of the results obtained in the two cases llq and cbwfq which showed that values for delay and jitter can be minimized by using low latency queuing. bibliography [1] p. sharma (2014); challenges of quality of services in mobile ad hoc networks, international journal of advance research in computer science and management studies, 2(3): 342-347. [2] r.d. albu, i. dzitac, f. popentiu-vladicescu, i.m. naghiu (2010); input projection algorithms influence in prediction and optimization of qos accuracy, international journal of computers communications & control, 9(2): 132-139. [3] v. fineberg (2002); a practical architecture for implementing end-to-end qos in an ip network, ieee communications magazine, 40(1): 122-130. [4] w. c. hardy (2003); voip service quality: measuring and evaluating packet-switched voice, mcgraw-hill, 2003. [5] t. wallingford (2005); switching to voip, o’reilly, 2005. [6] o. hersent, j.p. petit, d. gurle (2005); beyond voip protocols: understanding voice technology and networking techniques for ip telephony, john wiley and sons, 2005. [7] j. davidson, j. peters (2000); voice over ip fundamentals, cisco press, 2000. [8] cisco systems, inc.(2001); cisco ip telephony qos design guide, www.cisco.com. [9] m. a. dye, r. mcdonald, a.w. rufi (2008); network fundamentals ccna exploration companion guide, 2008 cisco press. [10] cisco systems, inc (2014); qos: congestion management configuration guide, cisco ios xe release 3s, www.cisco.com. [11] t. szigeti, r. barton, c. hattingh, k. briley jr. (2014); end-to-end qos network design: quality of service for rich-media and cloud networks, second edition, cisco press, 2014. [12] http://mynetworkingwiki.com/index.php/weighted_fair_queuing [13] http://myway2ccie.blogspot.ro/2009/04/qos-weighted-fair-queuing-and-class.html [14] http://www.perihel.at/2/rno/03-qos-queuing-methods.pdf [15] http://blog.ine.com/2008/08/17/insights-on-cbwfq/?s=wfq [16] http://josephmlod.files.wordpress.com/2010/12/llq-pq.jpg int j comput commun, issn 1841-9836 7(5):933-944, december, 2012. mining association rules from empirical data in the domain of education d. radosav, e. brtka, v. brtka dragica radosav, eleonora brtka, vladimir brtka university of novi sad technical faculty "mihajlo pupin" serbia, 23000 zrenjanin, djure djakovica bb e-mail: radosav@tfzr.uns.ac.rs, brtka@sbb.rs, vbrtka@tfzr.uns.ac.rs abstract: the data mining techniques and their applications are widely recognized as powerful tools in various domains. in the domain of education there is a variety of data of various types that are collected. the important question is: is it possible to process collected data with the data mining technique and what are main advantages of data mining and e-learning interaction? if an e-learning system accumulates a huge volume of data, then it is possible to deploy techniques and tools from the domain of data mining in order to gain valuable information. the research presented in this paper is conducted on real-life data that originates from the balkan region. the software system weka is used to generate association rules. the main result of this research is the assessment of the parameters that are associated with the opinion that computer skills will be helpful in the future, from the students point of view. this result is very important because it gives the exact insight to computer technology usage in the balkans schools. furthermore, some advantages of the usage of data mining techniques in the domain of education are determined. keywords: association rules, education, data mining. 1 introduction in the past few years e-learning techniques have significantly improved as the result of progress and increased use of the internet. the "desktop" e-learning systems, in many cases, have been replaced by systems that operate using the internet. some of the web-based systems allow the determination of preferences for each participant in the process and adjustment of activities in accordance with the profile of participants [1]. recently, techniques from the domain of data mining have been incorporated into systems for e-learning. the changes that are constantly taking place in terms of rapid technical and technological developments affect society as a whole. the educational system is experiencing changes in terms of modernization and globalization. however, the educational system that is "inert" is not suitable for rapid change and modernization. the educational processes in serbia and some other countries in the region of western balkans are changing so that the "reproduction" style of learning is replaced with the style that prefers "understanding" of the learning content and usage of the acquired knowledge. the theories of learning that are used are no longer associative and behavioral, but have become constructive and cognitive. students are required to improve the style of self-training and their skills. in these processes the information capacity is an important factor in development, especially the internet and the resources that are available through world wide web. the usage of the internet by some course management system (cms) is not unusual occurrence in serbia, but cannot be said that such systems are widely present. efforts that have been invested in the integration of cms and data mining (dm) systems are evident. this integration often means adding dm modules to the existing cms [1, 2], but copyright c⃝ 2006-2012 by ccc publications 934 d. radosav, e. brtka, v. brtka it is possible to approach to cms and dm system integration through serial connection [3, 4]. serial connection, in this case, means collecting data with cms, and then processing collected data by dm system. the results of dm analysis are fed back to cms in order to improve their effectiveness. however, this paper lists the basic dm techniques and some tools that allow the application of these techniques in domain of education, but cms and their application is not the topic of this paper, although the importance of cms is evident. this paper deals with some special dm techniques when applied to data collected in the domain of education. the application of dm techniques results in some rules or patterns that can be used as feedback to cms. rather than investigation of the connection between cms and dm system, this paper deals with dm techniques when applied to data in the domain of education and gives some conclusions and remarks about the importance of inferred knowledge. special contribution is the analysis of the results of dm technique when applied to real-life data collected from the region of serbia and bosnia and herzegovina. the paper is organized as follows: section two gives the short description of various dm techniques used in the domain of education. one of dm techniques is chosen to be used for the analysis of the real-life data. it is explained why this dm technique is the most suitable in this particular case. section three contains data description, as well as the methodology description. section four lists the results obtained by application of the dm technique, as well as the interpretation of these results. finally, section five contains conclusions and remarks about applicability of dm techniques in the domain of education. 2 previous work and dm techniques the well-known cms that are in use are: blackboard, webct, angel and moodle. in previous papers dm techniques are used as a functional element (module) of cms [2, 5]. the dm module can be an integral part of cms but, in some cases, can be used separately. the field of data mining, like statistics, concerns itself with "learning from data" or "turning data into information" [6]. according to [5,7], dm can be defined as the intersection of the domain of statistics, computer science, artificial intelligence, machine learning, database management and data visualization. data mining is the process of identifying valid, novel, potentially useful, and ultimately comprehensible and understandable patterns or models. the basic techniques of data mining are [7]: • classification examining the feature of a newly presented object and assigning it to a predefined set of classes. • affinity grouping or association rules determining which things go together, also known as dependency modeling. • clustering segmenting a population into a number of subgroups or clusters. description and visualization exploratory or visual data mining. 2.1 the application of the dm in practice, there are a lot of general and specific data mining tools [2]. the commercial mining tools are: dbminer [8], spss clementine: [9] and db2 intelligent miner [10], etc. some public domain mining tools are: weka [11] and keel [12]. there are also specific educational data mining tools such as: mining tool [13] for association and pattern mining, multistar [14] mining association rules from empirical data in the domain of education 935 for association and classification, kaon [15] for clustering and text mining and ciecof [16] for association rule mining. the application of these systems in the field of education includes the selection of suitable dm techniques. it is not unusual that multiple dm techniques are applied to the same data sample. as in [2, 7] the main steps of application of dm techniques are: 1. collection of the data. the cms system is used and the collected data are stored in database. this step can be executed by a questionnaire or some other data collection technique instead of cms usage. 2. preprocessing the data. the data is "cleaned" and transformed into an appropriate format to be mined. 3. application of suitable dm technique. the dm technique is applied to build the model that discovers new rules, patterns and knowledge. to execute this step, either a general or a specific data mining tool, or a commercial or a free data mining tool can be used. 4. the interpretation, evaluation and deployment of the results. in particular, it is necessary to apply and elaborate in detail each of these steps depending on the data to be analyzed. some of the systems for data mining that are used to analyze data from different domains are: • rosetta system, developed by researchers from the university of warsaw and the university of trondheim [17, 18]. rosetta is capable of synthesis of the if ... then rules by usage of the rough sets theory. this system is based on classification, reduction of data and decision rules synthesis. • weka system was developed by researchers from the university of waikato, new zealand [19]. rosetta system allows data to be loaded from ms excel table; the format of the loaded data can also be csv (comma separated values). rosetta system performs the extraction of the if...then rules. the data can be collected by various methods; the format of the collected data does not have to be specially adapted to dm techniques implemented in rosetta. on the other hand, the weka gui chooser provides a starting point for launching wekas main gui applications and supporting tools. the weka system can be used to start the particular dm applications: • explorer an environment for exploring data with weka. • experimenter an environment for performing experiments and conducting statistical tests between learning schemes. • knowledgeflow this application supports essentially the same functions as the explorer but with a drag-and-drop interface. one advantage is that it supports incremental learning. • simplecli provides a simple command-line interface that allows direct execution of weka commands for operating systems that do not provide their own command line interface. system weka allows (see figure 1): 936 d. radosav, e. brtka, v. brtka figure 1: the main menu of weka system • classification (classify) a classifier is a mapping from a (discrete or continuous) feature space x to a discrete set of labels y [20]. classification or discriminant analysis predicts class labels. this is supervised classification which provides a collection of labeled preclassified patterns; the problem being to label a newly encountered, still unlabeled, pattern. in e-learning, classification has been used for: discovering potential student groups with similar characteristics and reactions to a specific pedagogical strategy [21]; predicting students performance and their final grade [22]; detecting students misuse or students playing around [23]; predicting the students performance, as well as assessing the relevance of the attributes involved [24]; grouping students as hint-driven or failure-driven and finding students common misconceptions [25]; identifying learners with little motivation and finding remedial actions in order to lower drop-out rates [26]; for predicting course success [27]. • clustering (cluster) clustering is a process of grouping objects into classes of similar objects [28]. it is an unsupervised classification or partitioning of patterns into groups or subsets (clusters) based on their locality and connectivity within an n-dimensional space. in e-learning, clustering has been used for: finding clusters of students with similar learning characteristics, and for promoting group-based collaborative learning, as well as for providing incremental learner diagnosis [29]; grouping students and personalized itineraries for courses based on learning objects [30]; grouping students in order to give them differentiated guiding according to their skills and other characteristics [31]; grouping tests and questions into related groups based on the data in the score matrix [32]. • association rule mining (associate) association rule mining discovers relationships among attributes in databases, producing if-then statements concerning attribute-values [33]. an association rule expresses a close correlation between items (attribute-value) in a database with values of support and confidence. the confidence of the rule is the percentage of transactions that contains the consequence in transactions that contain the mining association rules from empirical data in the domain of education 937 antecedent. the support of the rule is the percentage of transactions that contains both antecedent and consequence in all transactions in the database. association rule mining has been applied to web-based educational systems for: building recommender agents that could recommend on-line learning activities or shortcuts [34]; diagnosing student learning problems and offering students advice [35]; guiding the learners activities automatically and recommending learning materials [36]; determining which learning materials are the most suitable to be recommended to the user [37]; identifying attributes characterizing patterns of performance disparity between various groups of students [38]; discovering interesting relationships from students usage information in order to provide feedback to the author of the course [39]; finding out relationships in learners behavior patterns [40]; finding students mistakes that often accompany each other [41]; guiding the search for the best fitting transfer models of student learning [42]; and optimizing the content of the e-learning portal by determining what most interests the user [43]. • selecting attributes (select attributes) attribute selection involves searching through all possible combinations of attributes in the data to find which subset of attributes works best for prediction. to do this, two objects must be set up: an attribute evaluator and a search method. the evaluator determines what method is used to assign a value to each subset of attributes. the search method determines what style of search is performed. • visualization (visualize) information visualization [44] is a branch of computer graphics and user interface which is concerned with the presentation of interactive or animated digital images so that users can understand data. these techniques facilitate analysis of large amounts of information by representing the data in some visual display. weka visualization section allows visualizing 2d plots of the current relation. rosetta system and weka are particularly suitable for data analysis in the field of education because they offer a selection of dm techniques and are relatively easy to use. 3 methodology the data sample, in form of ms excel document, consists of a total of 256 instances (students). it is important to mention that the data are not collected with the aim to be analyzed by dm techniques. the survey was conducted on students from the territory of the republic of serbia and the territory of bosnia and herzegovina. the computer technology that is used in this region is mostly out of date but is sufficient for elementary usage in education. description of the data, presented in table 1, shows the names of attributes and their possible values. attribute names and associated values comply with the form of survey. various techniques can be used on this small data set: first of all, there are statistical techniques, for example students distribution that is used when estimating the mean of a normally distributed population when the data set is small; then there are techniques for inferring decision rules (based on pawlaks rough sets theory, decision trees, etc.), even neural networks can be trained on small data set [45]. in addition, the data sample which is described in table 1 can be analyzed by various dm techniques or dm systems. the association rule mining is adopted as the most suitable dm technique. as defined by agrawal et al. [33] the problem of association rule mining is defined as: let u = {u1, u2, ..., um} be a discrete universe, a finite set of objects. let a = {a1, a2, ..., an} be a finite set of attributes with binary values. each object of universe u is described by attributes ai, i = 1, 2, ..., n thus generating a data set. an associative rule is defined as an implication of the form x ⇒ y where x, y ∈ a and x ∩ y ̸= ⊘. the set of attributes x is 938 d. radosav, e. brtka, v. brtka table 1: description of used data, attribute names and their possible values name value, number of objects and distribution description of the attribute a1 1. yes, 210 , 82.03125% does the student 2. no, 46, 17.96875% has a computer at home a2 1. do not know, 31, 12.109375% what type of computer 2. other, 7, 2.734375% do the student have 3. pii, 7, 2.734376% 4. piii, 60, 23.4375% 5. piv, 144, 56.25% 6. laptop, 7, 2.734376% a3 1. yes, 123, 48.046875% does the student 2. no, 133, 51.953125% use the internet a4 1. 1 hour per day, 98, 38.28125% how many hours per day 2. 2 hours per day, 85, 33.203125% does the student use 3. 3 hours a day, 23, 8.984375% the computer 4. 4 hours a day, 24, 9.375% 5. 5 hours a day, 26, 10.15625% a5 1. 0 hours a day, 67, 26.171875% how many hours 2. 1 hour per day, 74, 28.90625% per day 3. 2 hours per day, 64, 25.0% the student 4. 3 hours a day, 18, 7.03125% use the internet 5. 4 hours a day, 12, 4.6875% 6. 5 hours a day, 21, 8.203125% a6 1. yes, 89, 34.765625% does the student use 2. no, 167, 65.234375% e-mail a7 1. web sites on serbian, 142, 55.46875% what web sites does 2. web sites on other lenguages, 114, 44.53125% the student visit most frequently a8 1. educational, 27, 10.546875% what kind of web sites 2. entertainment, 116, 45.3125% is the most visited 3. other, 113, 44.140625% by student a9 1. yes, 185, 72.265625% does the student use 2. no, 71, 27.734375% his/her home computer for learning a10 1. educational, 42, 16.40625% what type of media does 2. film, 18, 7.03125% the student use most 3. music, 130, 50.78125% frequently at home 4. other, 66, 25.78225% a11 1. yes, 159, 62.109375% does the student want 2. no, 5, 1.953125% more educational 3. do not know, 92, 35.9375% computer software to be used in school in order to improve teaching a12 1. yes, 192, 75.0% does the student want to 2. no, 30, 11.71875% review learning materials 3. do not know, 34, 13.28125% used in school, by distant learning system at home a13 1. games, 78, 30.46875% what type of software is 2. educational, 11, 4.296875% most frequently used by 3. games, educational, 167, 65.234375% student a14 1. yes, 221, 86.328125% does the student like the 2. no, 35, 13.671875% subject of informatics a15 1. yes, 219, 85.546875% does the student believe 2. no, 37, 14.453125% that he/she gained enough knowledge to work independently on a computer a16 1. independently, 82, 32.03125% from who or where has 2. from others, 36, 14.0625% the student learned most 3. at school, 138, 53.90625% about computer usage a17 1. yes, 235, 91.796875% does the student think 2. no, 21, 8.203125% that his/her computer skills will help him/her in the future mining association rules from empirical data in the domain of education 939 table 2: simple example of data set object the student has the student uses the student thinks that a computer at home the internet his/her computer skills will help him/her in the future (a1) (a2) (a3) 1. yes yes no 2. no yes yes 3. no no no 4. yes yes yes 5. no yes no called antecedent (left-hand-side or lhs) of the rule; the set of attributes y is called consequent (right-hand-side or rhs) of the rule. there are many rules of the form x ⇒ y , but to select interesting rules from the set of all possible rules, various measures of significance can be used; the best-known are minimum thresholds on support and confidence. the support supp(x) is defined as the proportion of objects in the data set which contains the attributes from x. the confidence of a rule is defined as: conf(x ⇒ y ) = supp(x ∪ y ) supp(x) the previous concepts are explained in next simple example. for a data set given in table 2 it is possible to infer some association rules, as well as the confidence and support parameters. for x = {a1, a2, a3} supp(x) = 1 5 = 0.2 because there is one object (number four) for which there is a "yes" value for every attribute. for example, the confidence of the rule x ⇒ y , where x = {a1, a2} and y = {a3} is: conf(x ⇒ y ) = supp(x ∪ y ) supp(x) = 0.2 0.4 = 0.5 the previous rule x ⇒ y is interpreted as follows: the student who has a computer at home and uses the internet is associated with the opinion that his/her computer skills will help him/her in the future. there are many algorithms for association rule computation, but so called apriori algorithm [46] is the best-known algorithm to mine association rules. it is based on breadth-first search strategy [47]. in the data set described by table 2 attribute selection is more complicated by the fact that some attribute values are not binary so this involves more extensive search. the association rule generation is chosen to be performed due to multiple reasons: 1. this is an exact method and therefore excludes any subjective influence while analyzing data set. 2. the result is presented in a readable and easy-to-understand form. 3. it is expected that number of generated association rules would not be high (data set contains 256 instances) due to computation of confidence for each rule and selection of rules with the highest confidence. 4. it is expected that association rules have a great value when inferred from data set in education domain because association rules can be treated as a hypothesis. 940 d. radosav, e. brtka, v. brtka the experiment was conducted on data set by software system weka in order to generate association rules. after loading, the data are ready for pre-processing and application of dm techniques. weka system requires that the attributes with numerical values do not participate in the association rule mining, so they are ignored. association rules generated by weka system (apriori algorithm is used) are shown in table 3. table 3: association rules generated by apriori algorithm rule if then rule confidence 1 a11=yes, 159 a17=yes, 154 0.97 2 a9=yes and a14=yes, 159 a17=yes, 154 0.97 3 a13=games, educational, 167 a17=yes, 159 0.95 4 a9=yes and a15=yes, 166 a17=yes, 158 0.95 5 a12=yes and a14=yes, 169 a17=yes, 160 0.95 6 a9=yes, 185 a17=yes, 174 0.94 7 a14=yes and a15=yes, 195 a17=yes, 183 0.94 8 a12=yes, 192 a17=yes, 180 0.94 9 a6=no, 167 a17=yes, 156 0.93 10 a14=yes, 221 a17=yes, 206 0.93 there are 10 rules generated. the if part of every rule is followed by support measure, as is the case with the then part of each rule. the confidence for each rule is given in the separate column. 4 results the analysis of generated association rules provides insight into the dependence of the monitored parameters. each rule is accompanied by a factor of confidence that takes a value in the range [0, 1]. ten association rules have been generated, see table 3. by association rule 1 attribute a11 (does the student want more educational computer software to be used in school in order to improve learning) is associated with attribute a17 (does the student think that his/her computer skills will help him/her in the future). the factor of confidence for this rule is 0.97, that rule is guaranteed to a great extent. if the value of attribute a11 is "yes" then, by this association rule, the value of attribute a17 is also "yes". a possible conclusion which can be drawn is that most of the students think that usage of computer technology in school generates skills and knowledge that could be used in the future. this students opinion is a good indicator of the importance of the usage of computer technology and educational software in teaching process. it is evident that technology usage in future is closely related to computer technology software and methods which are used at present. other association rules can be interpreted in analogous way. the rule 2 can be interpreted as follows. two facts: the fact that student uses home computer for learning and the fact that student likes the subjects of computer science, are associated with opinion that computer skills will be helpful in the future. rule number 3 associates the type of software that is most frequently used by student (games, educational software) with the opinion that computer skills will be helpful in the future. the opinion that computer skills will be helpful in the future is mostly associated with: believing that student gained enough knowledge to work independently on a computer, the usage of the computer for learning and aspiration to use distant learning system at home to review learning materials previously used in school. attribute a14 (the student likes the subject of computer science) is most frequently associated with opinion that computer skills will be helpful in the future. mining association rules from empirical data in the domain of education 941 however, rule number nine associate the students that are not satisfied with the usage of e-mail with the opinion that computer skills will be helpful in the future. in fact, this may be in accordance with rule number eight: the usage of distant learning system at home to review learning materials previously used in school is associated with the opinion that computer skills will be helpful in the future. so, in students opinion, distant learning system makes e-mail service obsolete in a way. 5 conclusions and future works the application of association rule mining allows automatic generation of hypotheses and factors of confidence that are related to them. considering rule number one (see table 3) the hypothesis is: if a student wants more educational computer software to be used in school in order to improve learning then the student thinks that his/her computer skills will help him/her in the future. other rules may also be interpreted as a hypothesis. obviously the factor of confidence has a great impact on confirmation of hypotheses. at the technical faculty "mihajlo pupin" in zrenjanin, serbia, extensive research is underway, investigating the possibilities of applying dm techniques to data from the domain of education, extracted from a survey conducted in the wider balkan region. the fact that it is not obligatory that the data are collected with the aim to be analyzed by dm techniques, offers an excellent chance to assess real possibilities in the actual practice. in future, this leads to identification of the advantages of dm techniques over standard statistical techniques. so far, there have been identified the following advantages of application of the dm techniques to data from the domain of education: • it is possible to extract the "special" cases of the statistical rarity that are often discarded as "noise". the discovery of such "isolated" cases is conducted by automated if ... then rule synthesis. as in [48] the association data mining based on a uniform support misses some patterns of low support. although it is possible to exploit support constrains, which specifies some minimum supports, we opted to use automated if ... then rules synthesis by rosetta system or similar. • clustering technique allows the exact definition of average student and special groups (clusters) of students, so that further action of e-learning can be implemented while respecting the characteristics of each cluster. • analysis of generated association rules provides insight into the dependence of the monitored parameters. • ranking of attributes (parameters) in order of importance of influence on a selected attribute allows the rejection of the parameters of lesser importance. • data visualization provides a figurative description of the data, so we can actually see patterns which may exist. the usage of weka system, or a similar system, is of great help because it generates association rules (hypotheses) automatically. furthermore, weka system generates association rules for which the factor of confidence is high. this method can be used in any case of data, but weka system requires that attributes with numerical values do not participate in generating the association rule mining, so they are ignored. this is not a great handicap because linguistic terms are frequently used in queries and surveys. the final conclusion is: the usage of weka system in order to generated association rules automatically is of great help because the hypotheses of 942 d. radosav, e. brtka, v. brtka little importance are avoided. future work will be practical and it will refer to the selection of a particular cms and integration with dm system. bibliography [1] c. chih-ming, personalized e-learning system with self-regulated assisted mechanisms for promoting learning performance", an int. j. of expert systems with applications 36: 88168829, 2009. [2] c. romero, s. ventura, e. garcia, data mining in course management system: moodle case study and tutorial, an int. j. of computers and education 51, pp. 368-384, 2008. [3] e. brtka, d. radosav, v. brtka, the data mining module as a part of the e-learning system, (in serbian), in proc. of infotech conference, vrnjacka banja, serbia, 2009. [4] e. brtka, the data mining analysis approach in pedagogical research, master thesis, (in serbian), technical faculty "mihajlo pupin", zrenjanin, serbia, 2009. [5] e. gaudioso, l. talavera, data mining to suport tutoring in virtual learning communities: experiences and challenges. in c. romero and s. ventura (eds.), data mining in e-learning, southampton uk, wit press, pp. 207-226, 2006. [6] k. diego, data mining and statistics: what is the connection?, the data administration newsletter, llc www.tdan.com [7] d. hand, h. mannila, p. smyth, principles of data mining, mit press, cambridge, ma. isbn 0-262-08290-x. oclc 226126187, 2001. [8] dbminer (2007), http://www.dbminer.com [9] clementine (2007), http://www.spss.com/clementine/ [10] miner (2007), http://www-306.ibm.com/software/data/iminer/ [11] weka (2007), http://www.cs.waikato.ac.nz/ml/weka/ [12] keel (2007), http://www.keel.es/ [13] o. zaiane, j. luo, web usage mining for a better web-based learning environment, in proc. of conf. on advanced technology for education, banff, alberta, pp. 60-64, 2001. [14] d. silva, m. vieira, using data warehouse and data mining resources for ongoing assessment in distance learning, in ieee int. conf. on advanced learning technologies, kazan, russia, pp. 40-45, 2002. [15] j. tane, c. schmitz, g. stumme, semantic resource management for the web: an elearning application, in proc. of the www conference, new york, usa, pp. 1-10, 2004. [16] e. garcia, c. romero, s. ventura, c. castro, using rules discovery for the continuous improvement of e-learning courses, in int. conf. intelligent data engineering and automated learning, burgos, spain, pp. 887-895, 2006. mining association rules from empirical data in the domain of education 943 [17] z. pawlak, a. skowron, rudiments of rough sets, an int. j. of information sciences 177:327, 2007. [18] a. ohrn, discernibility and rough sets in medicine: tools and applications, phd thesis, department of computer and information science, norwegian university of science and technology, trondheim, norway, 1999. [19] i. h. witten, e. frank, data mining: practical machine learning tools and techniques, 2nd edition, morgan kaufman, san francisco, 2005. [20] r. o. duda, p. e. hart, d. g. stork, pattern classification, wiley interscience, 2000. [21] g. chen, c. liu, k. ou, b. liu, discovering decision knowledge from web log portfolio for managing classroom processes by applying decision tree and data cube technology, journal of educational computing research 23(3):305-332, 2000. [22] b. minaei-bidgoli, w. punch, using genetic algorithms for data mining optimization in an educational web-based system, in genetic and evolutionary computation conference, chicago, usa, pp. 2252-2263, 2003. [23] r. baker, a. corbett, k. koedinger, detecting student misuse of intelligent tutoring systems, in intelligent tutoring systems, alagoas, brazil, pp. 531-540, 2004. [24] s. b. kotsiantis, c. j. pierrakeas, p. e. pintelas, predicting students performance in distance learning using machine learning techniques", applied artificial intelligence 18(5):411-426, 2004. [25] m. v. yudelson, o. medvedeva, e. legowski, m. castine, d. jukic, c. rebecca, mining student learning data to develop high level pedagogic strategy in a medical its, in proceedings of aaai workshop on educational data mining, boston, pp. 1-8, 2006. [26] m. cocea, s. weibelzahl, can log files analysis estimate learners level of motivation? in proceedings of the workshop week lernen wissensentdeckung adaptivitat, hildesheim, pp. 32-35, 2006. [27] w. hamalainen, m. vinni, comparison of machine learning methods for intelligent tutoring systems, in proceedings of the eighth international conference in intelligent tutoring systems, taiwan, pp. 525-534, 2006. [28] a. k. jain, m. n. murty, p. j. flynn, data clustering: a review, acm computing surveys 31(3):264-323, 1999. [29] t. tang, g. mccalla, smart recommendation for an evolving e-learning system, international journal on e-learning 4(1):105-129, 2005. [30] e. mor, j. minguillon, e-learning personalization based on itineraries and long-term navigational behavior, in proceedings of the 13th international world wide web conference, pp. 264-265, 2004. [31] w. hamalainen, j. suhonen, e. sutinen, h. toivonen, "data mining in personalizing distance education courses", in world conference on open learning and distance education, hong kong, pp. 1-11, 2004. [32] j. spacco, t. winters, t. payne, t. inferring use cases from unit testing, in aaai workshop on educational data mining, new york, pp. 1-7, 2006. 944 d. radosav, e. brtka, v. brtka [33] r. agrawal, t. imielinski, a. swami, mining association rules between sets of items in large databases, in proc.of the acm sigmod international conference on management of data, washington dc, usa, pp. 1-22, 1993. [34] o. zaiane, building a recommender agent for e-learning systems, in proc.of the int. conference in education, auckland, new zealand, pp. 55-59, 2002. [35] g. j. hwang, c. l. hsiao, c. r. tseng, a computer-assisted approach to diagnosing student learning problems in science courses, journal of information science and engineering 19: 229-248, 2003. [36] j. lu, personalized e-learning material recommender system, in international conference on information technology for application, utah, usa, pp. 374-379, 2004. [37] p. markellou, i. mousourouli, s. spiros, a. tsakalidis, using semantic web mining technologies for personalized e-learning experiences, in proc. of the web-based education, grindelwald, switzerland, pp. 461-826, 2005. [38] b. minaei-bidgoli, p. tan, w. punch, mining interesting contrast rules for a web-based educational system, in int. conf.on machine learning applications, los angeles, california, pp. 1-8, 2004. [39] c. romero, s. ventura, p.d. bra, knowledge discovery with genetic programming for providing feedback to courseware author, user modeling and user-adapted interaction: the journal of personalization research 14(5):425-464, 2004. [40] p. yu, c. own, l. lin, on learning behavior analysis of web based interactive environment, in proc. of the implementing curricular change in engineering education, oslo, norway, pp. 1-10, 2001. [41] a. merceron, k. yacef,mining student data captured from a web-based tutoring tool: initial exploration and results, journal of interactive learning research 15(4):319-346, 2004. [42] j. freyberger, n. heffernan, c. ruiz, using association rules to guide a search for best fitting transfer models of student learning, in workshop on analyzing studenttutor interactions logs to improve educational outcomes at its conference, alagoas, brazil, pp. 1-10, 2004. [43] a. a. ramli, web usage mining using apriori algorithm: uum learning care portal case, in int. conf. on knowledge management, malaysia, pp. 1-19, 2005. [44] r. spence, information visualization, addison-wesley, 2001. [45] r. andonie, extreme data mining: inference from small datasets, int j comput commun, issn 1841-9836, vol. 5(3):280-291, 2010. [46] r. agrawal, r. srikant, fast algorithms for mining association rules in large databases, in jorge b. bocca, matthias jarke, and carlo zaniolo (eds.), proc. of the 20th international conference on very large data bases, vldb, santiago, chile, pp. 487-499, 1994. [47] g. luger, w. stubblefield, artificial intelligence structures and strategies for complex problem solving, university of new mexico, albuquerque, the benjamin/cummings publishing company inc, 1993. [48] m. pater, d.e. popescu, multi-level database mining using afopt data structure and adaptive support constrains, int j comput commun, issn 1841-9836, 3(s):437-441, 2008. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 4, pp. 458-468 genetic algorithm based feature selection in a recognition scheme using adaptive neuro fuzzy techniques m. bhattacharya, a. das mahua bhattacharya indian institute of information technology & management morena link road, gwalior-474003, india e-mail: mb@iiitm.ac.in arpita das institute of radio physics & electronics university of calcutta 92, a.p.c. road, kolkata-700009 e-mail: arpita.rpe@caluniv.ac.in abstract: the problem of feature selection consists of finding a significant feature subset of input training as well as test patterns that enable to describe all information required to classify a particular pattern. in present paper we focus in this particular problem which plays a key role in machine learning problems. in fact, before building a model for feature selection, our goal is to identify and to reject the features that degrade the classification performance of a classifier. this is especially true when the available input feature space is very large, and need exists to develop an efficient searching algorithm to combine these features spaces to a few significant one which are capable to represent that particular class. presently, authors have described two approaches for combining the large feature spaces to efficient numbers using genetic algorithm and fuzzy clustering techniques. finally the classification of patterns has been achieved using adaptive neuro-fuzzy techniques. the aim of entire work is to implement the recognition scheme for classification of tumor lesions appearing in human brain as space occupying lesions identified by ct and mr images. a part of the work has been presented in this paper. the proposed model indicates a promising direction for adaptation in a changing environment. keywords: adaptive neurofuzzy, genetic algorithm, feature selection, pattern recognition. 1 introduction the boundary detection based on fourier descriptors introduces a large number of feature vectors in a pattern recognition scheme. to classify different boundaries, any standard classifier needs large number of inputs and to train the classifier large number of training cycles and huge memory are also required. a complicated structure of the classifier invites the problem of over learning, and which may cause for misclassification [2]. therefore need exists for significant feature selection for efficient pattern recognition scheme. among many existing methods for solving feature selection problem (fsp), pruning methods for neural network [7],[8], classification trees [9] fuzzy clustering [10] may be referred. ga is an efficient search algorithm based on the mechanics of natural selection and natural genetics [1]. it combines survival of the fittest among string structures with a structured yet randomized information exchange to form a search algorithm with some of the innovative flair of human search. since genetic algorithm is invented to simulate evolutionary processes observed in nature the goal of survival or optimization in a changing environment could be achieved [3]. however, ga [1],[4],[5],[6],[11] differs copyright c⃝ 2006-2010 by ccc publications genetic algorithm based feature selection in a recognition scheme using adaptive neuro fuzzy techniques 459 from other searching algorithm in that sense, it does not deal with the neighborhood of a single current solution. ga use a collection (or population) of parameters, from which using selective crossover and mutation strategies, better solutions may come out. in present paper, the network architecture used for final classification is anfis adaptive neuro fuzzy inference system. anfis [13],[14] architecture for sugeno fuzzy model is an innovative soft computing expert system that removes the limitations of conventional neural networks [12],[13],[15]. the proposed method of feature selection has been compared with fuzzy clustering theory where ga based feature selection shows the improvement over fuzzy clustering due to natural selection mechanisms. proposed fsp methodology combined with anfis classifier is an intelligent, expert system that gives the user accurate detection even in presence of additive noise. the objective of entire work is to identify the different space occupying lesions appearing in human brain as tumor / cancer lesions in different grades of benignancy / malignancy using boundary as feature. presently a part of the work has been presented considering few pattern boundaries in order to develop an accurate classification technique using ga based feature selection. 2 proposed methodology in the proposed method the significant boundary of roi is extracted and ga has been applied to reduce the feature vector size. these reduced and significant features are then fed to anfis sugeno fuzzy network for classification. a comparative study has been conducted for efficient feature selection using both ga and fcm and finally to classify patterns using anfis. this study effectively gives the superior results for ga based feature selection. the method is summarized in figure-1. figure 1: proposed technique 2.1 boundary extraction using fourier descriptors feature selection is the choice of descriptors in a particular application. the boundary of pattern to be analyzed has been detected by implementing canny edge detector and fourier descriptors of the edges then used as shape information. a figure with k-points digital boundary in the x-y plane as, x(k) = xk, 460 m. bhattacharya, a. das y(k) = yk can be represented as s(k) = [x(k)y(k)] for k = 0,1,2,...,k −1. (1) each co-ordinate pair can be treated as a complex number so that, s(k) = x(k)+ j ∗ y(k) for k = 0,1,2,...,k −1. (2) the discrete fourier transform (dft) of s(k) is given below a(u) = 1 k ∗ k−1∑ k=0 s(k)∗ e− j2(π)uk k fork = 0,1,2,...,k −1. (3) the complex coefficient a(u) is called the fourier descriptor of the edge points. let us suppose that instead of all fourier coefficients, only the first ’p’ coefficients are used. this is equivalent to set a(u) = 0 for u > (p − 1). the overall global shape of the images has been identified (it can be shown that if p ≈ u/3, approximate boundary detection would be possible). thus a few fourier descriptors can be used to capture the gross essence of a boundary. this property is valuable, because these coefficients carry shape information and can be used as the basis for differentiating between distinct boundary shapes. 2.2 genetic algorithm for feature selection ga manipulates chromosomes, which are the encoded string set of parameters of a target system to be optimized. presently different boundaries extracted from ct and mr images of section of human brain having space occupying lesions are recognized on the basis of fourier descriptors and which play the role of payoff values (objective function) associated with individual strings. in ga, a new set of offspring has been created in every generation on the basis of the fittest of old generation. ga efficiently exploits the historical information to speculate on new search points with expected improved performance. it is the best learned from the careful study of biological example that, where robust performance is desired, nature does it better which is the secret of adaptation and survival. ga uses three operators: selection (or reproduction), crossover and mutation to achieve the goal of evolution [1],[3]. 2.3 fuzzy c-means clustering algorithm for feature selection in the proposed method, fuzzy c-means clustering algorithm used for reduction of input feature vector sizes without loss of accuracy level of detection. algorithm 1. let x = {x1,x2,...,xn} be a set of given data. a fuzzy c-partition of x is a family of fuzzy subsets of x, denotes by p = {a1,a2,...,ac}, which satisfies c∑ i=1 ai(xk) = 1 for all k ∈ nn (4) and i = 1 0 < n∑ k=1 ai(xk) < n for all i ∈ nc (5) where c is a positive integer given a set of data x = {x1,x2,...,xn}, where xk, in general is a vector, for all k ∈ nn , the problem fuzzy clustering is to find a fuzzy pseudo partition and the associated cluster centers by which the structure of the data is represented as best as possible. to solve the problem of fuzzy genetic algorithm based feature selection in a recognition scheme using adaptive neuro fuzzy techniques 461 clustering, we need to formulate a performance index. usually, the performance index is based upon cluster centers, v1,v2,...,vc associated with the partition are calculated by the formula. vi = ∑n k=1[ai(xk)] mxk∑n k=1[ai(xk)] m (6) for all i ∈ nc, where m > 1 is a real number that governs the influence of membership grades. observe that the vector vi calculated by above equation is viewed as the cluster center of the fuzzy class a i, is actually weighted average of data in ai. the performance index of a fuzzy pseudo partition p, jm(p), is defined in terms of the cluster centers by the formula jm(p) = n∑ k=1 c∑ i=1 [ai(xk)] m∥xk − vi∥2 (7) where ∥xk − vi∥2 represents the distance between xk and vi . clearly, the smaller the value of jm(p), the better the fuzzy pseudo partition p. thus, the goal of fuzzy c-means clustering method is to find a fuzzy pseudo partition p that minimizes the performance index jm(p). 2.4 classification of features using anfis model a generalized anfis model based on sugeno fuzzy architecture is utilized for classification of significant features. the numbers of input nodes are equal to the reduced input feature space sizes. the number of membership functions in each of the input node is continually adjusted to achieve the optimum classification results. to adapt the model with ever-changing environments, hybrid-learning rule is used. figure 2: the anfis model for final classification. figure-2 illustrates the reasoning mechanism of the sugeno fuzzy anfis architecture for boundary detection and texture analysis of masses respectively, where nodes of the same layer have similar functions as described below. 462 m. bhattacharya, a. das layer 2. every node i in this layer is an adaptive node with a node function o1i = µai(x) for i = 1,2. and o1i = µbi(y) where x (or y) is the input to node i and ai (or bi) is a linguistic label (such as large or small) associated with this node. in other words o1i, i is the membership grade of fuzzy set a(a1,a2) or b(b1,b2). here the membership function for a can be any appropriate parameterized membership function, such as generalized bell function: µa(x) = 1 1+ | (x−ci) ai |2b (8) where ai,bi,ci is the parameter set. as the values of these parameters change, the bell-shaped function varies accordingly. parameters of this layer are referred to as premise parameters. layer 3. every node in this layer is a fixed node labeled ∏ , whose output is the product of all the incoming signals: o2,i = wi = µai(x)µbi(x) for i = 1,2. (9) in general, any t-norm operator that performs fuzzy and can be used as the node function in this layer. layer 4. every node in this layer is a fixed node labeled n. the ith node calculates the ratio of the rule’s firing strength to the sum of all rule’s firing strengths: o3,i = wi = wi (w1 + w2) for i = 1,2. (10) for convenience, outputs of this layer are called normalized firing strengths. layer 5. every node i in this layer is an adaptive node with a node function o4,i = wi fi = wi(pix + qiy + ri) (11) where wi is a normalized firing strength from layer 3 and pi,qi,ri is the parameter set of this node. parameters of this layer are referred to as consequent parameters. layer 6. the single node in this layer is fixed node labeled ∑ , which computes the overall output as the summation of all incoming signals: overall output = o5,i = ∑ i wi fi∑ i wi (12) thus anfis architecture is functionally equivalent to a sugeno fuzzy model. hybrid leaning rule combines steepest decent method and least-squares estimator for fast identification of parameters in anfis model. for hybrid learning to be applied in a batch mode, each epoch is composed of a forward pass and a backward pass. in the forward pass, after an input vector is presented, node outputs go forward until layer 4 and consequent parameters are identified by the least squares method. in the backward pass, the error signals propagate backward and the premise parameters are updated by gradient decent. the hybrid approach converges much faster since it reduces the search space dimensions of the original pure back propagation. 2.5 decision making logic the anfis model is trained with targets for each of the output classes, which are well separated, and then membership functions are generated for detecting the possible range of output values. each membership function corresponds to each of the output class; the overlapping regions between two or more classes give the possibility of existing of the particular pattern in all of the overlapped classes. but highest membership grade determines that the particular image pattern belongs to the corresponding genetic algorithm based feature selection in a recognition scheme using adaptive neuro fuzzy techniques 463 class. thus to construct a boundary region for a particular class, we design a decision rule using fuzzy if-then conditions that states: 2.5 <= output value <= 7.5, test image belongs to class-a; 7.5 <= output value <= 12.5, test image belongs to class-b; 12.5 <= output value <= 17.5, test image belongs to class-c. the decision making membership function through the range of all possible output values is given in figure-3. each membership function is a generalized bell shaped curve, which corresponds to each output classes; the overlapped region between two or more classes gives the possibility of existing of the particular pattern in all of the overlapped classes. figure 3: output decision making membership function 3 experimental results the experiment has been conducted with three distinct boundary shapes extracted from ct and mr images for section of human brain having space occupying lesions shown in figure 4 belonging to classa, class-b & class-c. two membership functions are chosen for each input terminal of the network, to obtain the best possible classification result. the superiority of ga is investigated over the conventional fcm clustering technique to classify the noisy images. 3.1 choice of string length in ga based feature subset selection problem in genetic algorithm a particular string of length l contains 2l search points. as a result, a population of size n contains some where between 2l to n∗2l search points, depending upon the population diversity. now among these large numbers of search points, only a few are processed in a useful manner. the reproduction, crossover and mutation operators determine the exponential growth or decay of important search points from generation to generation. it has been observed that ga with samples containing less number of bit strings which are shifted towards the enumerative search. with string length 20, 464 m. bhattacharya, a. das figure 4: boundary features of tumors in human brain there are a least 220 = 1.04 ∗ 106 search points in the search space. thus ga converges rapidly with the samples containing large string length. but too much increase of string length is not profitable for computational enumeration. figure 5 shows an optimum sting length which is 20 and is acceptable for efficient feature subset selection. the variations of average and maximum values of objective function and the corresponding population size for each generation with different string lengths are shown below in figure 5,6,7 respectively. figure 5: variations of average value with different string length it is also viewed from above results that ga with samples containing less number of bit string, shifted towards the enumerative search or random walk. but with string length 24 or more, there are at least 224 = 1.68 ∗ 107 search points in the search space and random walk or enumeration would not be profitable. thus ga converges rapidly with the samples containing large number of bit string. genetic algorithm based feature selection in a recognition scheme using adaptive neuro fuzzy techniques 465 figure 6: variations of maximum value with different string length figure 7: variations of population size with different string length. 466 m. bhattacharya, a. das table 1: recognition of distinct noise free test images using ga and anfis model classification rate tested image training value tested value decision boundary-1 5.0000 4.9496 belongs to class-a. boundary-2 10.000 9.8162 belongs to class-b. boundary-3 15.000 14.8400 belongs to class-c. table 2: ga based classification rates noise training value tested value classification decision 0.002 5.0000 5.5309 class-a correct 0.004 5.0000 4.5801 class-a correct 0.006 5.0000 5.9943 class-a correct 0.008 5.0000 6.6275 class-a correct 0.010 5.0000 5.0767 class-a correct 0.015 5.0000 3.0135 class-a correct 0.020 5.0000 8.6202 class-b misclassification 3.2 results of experiment to recognize the distinct noise free test images using ga based feature subset selection & anfis model anfis sugeno fuzzy model is implemented to recognize the image boundary-2. the network with three significant input features and two optimum membership functions on each would result in 23 = 8 fuzzy if-then rules and thus the input space is partitioned with 8 grids. 3.3 comparative study of ga & fcm based feature subset selection (fss) mode in presence of noise in the proposed model, the inputs of anfis network are ga based feature subset. this reduced feature subset helps to form a simple anfis classifier. table-2 and table-3 compare the classification rates of ga and fcm based fss model respectively for image boundary-1 in presence of gaussian noise. table 3: fcm based classification rates noise training value tested value classification decision 0.002 20.0000 39.6124 class-a correct 0.004 20.0000 83.5869 class-c incorrect 0.006 20.0000 33.3989 class-a correct 0.008 20.0000 96.9512 class-c incorrect 0.010 20.0000 34.6393 class-a correct 0.015 20.0000 92.7572 class-c incorrect 0.020 20.0000 91.0327 class-c incorrect genetic algorithm based feature selection in a recognition scheme using adaptive neuro fuzzy techniques 467 4 discussions authors have presented a pattern recognition scheme by efficiently selecting the significant features and finally using adaptive neuro-fuzzy techniques for design of classifier. for efficient feature selection, two approaches like genetic algorithm and fuzzy clustering techniques have been implemented. finally the classification of patterns has been achieved using adaptive neuro-fuzzy techniques. the aim of entire work is to implement the recognition scheme for classification of tumor lesions appearing in human brain as space occupying lesions identified by ct and mr images. the comparative study of ga and fcm based feature subset selection (fss) reveals that there is a large possibility of misclassification if fcm is used for significant fss in presence of noise. ga based fss is resistant from noise up to a certain level and classification rate is improved for ga based fss model. this is because, fcm has partitioned the large number shape descriptors such that the degree of association is strong for the descriptors within the same cluster and weak for the descriptors in different clusters. genetic algorithm (ga) searched the significant shape descriptors by applying the beauty of natural argument. using three operators like reproduction, crossover and mutation, ga is capable to select significant feature subset. bibliography [1] i. d. e. goldberg, genetic algorithms in search, optimization and machine learning. reading, ma: addison-wesley, 1989. [2] b. k. fukunaga and r. r. hayes, "effects of sample size in classifier design," ieee trans. pattern anal. mach. intell., vol. 11, pp. 873-885, aug. 1989. [3] d’haeseleer, p. "context preserving crossover in genetic programming"’ proc. of the 1994 ieee world congress on computational intelligence, vol. 1, pages 256-261, orlando, fl, usa. ieee press, 1994. [4] [4]. burke, e., gustafson, s., and kendall, g. diversity in genetic programming: an analysis of measures and correlation with fitness. ieee transactions on evolutionary computation, 8(1): pp. 47-62, 2004. [5] j. yang and v. hanovar, "feature subset selection using genetic algorithm", journal of ieee intelligent systems, vol. 13, pp. 44-49, 1998. [6] s. s. sanz, g.c .valls, f. p. cruz, j. s. sanchis, c. b. calzn, "enhancing genetic feature selection through restricted search and walsh analysis", ieee trans. on systems, man, and cybernetics, vol. 34, no. 4, november 2004. [7] p. leray and p. gallinari, "feature selection with neural networks," behaviormetrika, vol. 26, jan. 1999. [8] b. hassibi and d. g. stork, "second order derivatives for network pruning: optimal brain surgeon," in advances in neural information processing systems, s. j. hanson, j. d. cowan, and c. l. giles, eds. san mateo, ca: morgan kaufmann, 1993, vol. 5, pp. 164-171. [9] l. breiman, j. friedman, r. olshen, and c. stone, classification and regression trees, 3rd ed. london, u.k.: chapman & hall, 1984. [10] t. e. campos, i. bloch, and r. m. cesar jr., "feature selection based on fuzzy distances between clusters: first results on simulated data," lecture notes in computer science, vol. 20, no.13, pp. 186, 2001. 468 m. bhattacharya, a. das [11] e. nabil; a. badr; i. farag; "an immuno-genetic hybrid algorithm", international journal of computers, communications & control, vol. iv, no. 4, issn 1841 9836; e-issn 1841-9844, 2009. [12] adlassnig, k. p., "fuzzy neural network learning model for image recognition." integrated computer-aided engineering, pp. 43-55, 1982. [13] kim, j.s. and h. s. cho, "a fuzzy logic and neural network approach to boundary detection for noisy images." fuzzy sets and systems, pp. 141-159, 1994. [14] jang, j.-s.r., c.-t. sun, e. mizutani, "neuro-fuzzy and soft computing, a computational approach to learning and machine intelligent" pearson education. [15] c. muńoz, f. vargas, j. bustos, m. curilem, s. salvo ; h. miranda; "fuzzy logic in genetic regulatory network models", international journal of computers, communications & control, vol. iv, no. 4, issn 1841 9836; e-issn 1841 9844, 2009. mahua bhattacharya, an associate professor of indian institute of information technology & management, gwalior, india is working in the area of medical image analysis more than a decade in various fields of bio medical applications like multimodal medical image fusion and registration, mammographic image analysis, classification of tumor / cancer lesion in cns, computational techniques for study of neurodegeneracy in brain, study of bone degeneracy and erosion. she had her b.tech and m.tech degree and from the institute of radio physics and electronics, university of calcutta. she worked as a research scientist at indian statistical institute, calcutta from 1995 till 2000 calcutta and got her ph.d degree in the area of multimodal medical image processing and analysis used knowledge based approach in 2001 she was recipient of frank george award for the paper cybernetic approach to medical technology : application to cancer screening and other diagnostics’ wosc the world organization of systems & cybernetics, uk. she has published more than 70 papers in international journals and conference proceedings and as book chapters. arpita das is an assistant professor of institute of radio physics & electronics, university of calcutta, india. she received her b.tech. and m.tech. degree in radio physics and electronics, university of calcutta, in 2004 and 2006, respectively. presently she is pursuing her ph.d. on ’some studies on medical image processing methods and their implementation’. she was a senior research fellow under csir. her research interests include image processing, pattern recognition, soft computing approaches for biomedical applications int j comput commun, issn 1841-9836 vol.7 (2012), no. 2 (june), pp. 252-263 prioritization of traffic for resource constrained delay tolerant networks g. fathima, r.s.d. wahidabanu g. fathima adhiyamaan college of engg hosur. tamilnadu fathima_ace@yahoo.com r.s.d. wahidabanu govt. college of engg salem. tamilnadu drwahidabanu@gmail.com abstract: in networks with common shared wireless medium, the available bandwidth is always valuable and often scarce resource. in addition to it, memory available at nodes (eg., sensor nodes) might be limited relative to the amount of information that needs to be stored locally. as delay tolerant networks (dtns) rely on node mobility for data dissemination, the high node mobility limits the duration of contact. besides the issue of contact opportunities between nodes, the bandwidth, available storage at peering nodes and contact duration also affect data forwarding. these factors also influence the mechanisms such as buffer replacement and scheduling policies. so there are secondary problems that routing strategies may need to take care of such as to deal with limited resources like buffer, bandwidth and power. furthermore, despite inherent delay tolerance of most dtn driving applications, there can be situations where some messages may be more important than the others and expected to get delivered earlier. so considering the network limitations and application requirements, the problem of choosing the messages to be transmitted when a contact opportunity arises and the messages to be dropped when buffer full is formulated. a buffer management policy to address these issues is proposed and analysed in this paper. additionally the buffer utilization of various dtn routing protocols and the impact of buffer size on the performance of dtn are studied. keywords: delay tolerant networks, buffer management, prioritization of messages, delivery ratio and delivery delay. 1 introduction communication networks whether they are wired or wireless operate with the assumption of existence of end-to-end path always between source and destination. however, emerging applications such as earth-quake monitoring, habitat monitoring, vehicular ad hoc networks (vanets) [17] and military ad hoc networks, will likely to change the typical conditions under which networks operate. in fact in such scenarios, networks pose new challenges like frequent disconnections, limited bandwidth, long delays and high bit error rates. the critical fact is that in such situations, the transmission control protocol (tcp) often does not work [26]. to enable those applications to operate under such challenging conditions, a new network paradigm called delay tolerant networking has been proposed by researchers [16]. networked environments which operate under such intermittent connectivity are referred as delay/disruption-tolerant copyright c⃝ 2006-2012 by ccc publications prioritization of traffic for resource constrained delay tolerant networks 253 networks (dtns). delay-tolerant networking architecture given by [25] is designed to provide support for message based asynchronous communication over network prone to frequent disconnection, long and variable delays. the details of delay tolerant networking architecture are available in [11]. from the literature survey [9], [13], [15], [22], [24] it is understood that a large amount of research has been performed in developing efficient routing algorithms for dtns. dtns operate with the principle of store, carry and forward. according to this principle, a node may store a message in its buffer and carry it along for long period of time until it can forward it further. in addition, to achieve high delivery probability, messages are replicated multiple times. most of the routing protocols in dtn assume that the buffer available as infinite, which is not the case in reality. therefore the buffers in the node will run out of capacity at certain point of time due to long term storage and extensive message replication. further, transmission takes place when nodes come into communication range of each other. the node has to decide which of the messages to be transmitted among the messages those are available in the buffer. it is a challenging problem because of the resource constraints like limited bandwidth and limited period of contact due to node mobility. in this paper, a prioritized buffer management approach is proposed which takes care of both: which messages to be transmitted when a new contact arises and which messages to be dropped when buffer is full. the opportunistic network environment (the one) simulator [12], [14] which is designed specifically for dtn environment is used for evaluation of the proposed approach. the result of evaluation is compared with other dropping policies. the remainder of this paper is organized as follows: section 2 gives overview of various dtn routing protocols and their buffer utilization. additionally, the impact of buffer size on performance of dtn is analysed. the buffer management and related work is discussed in section 3. section 4 discusses about the proposed policy. the simulation setup and the results are discussed in section 5. section 6 concludes the paper. 2 routing protocols of dtn and their buffer utilization 2.1 overview of routing protocols of dtn extensive study of different routing mechanism is essential to understand the design of dtns. the details of various routing protocols of dtn are available in [9], [13], [15]. these protocols differ in the knowledge that they use in making routing decisions and the number of replication they make. examples of some of the dtn protocols are direct delivery, first contact, epidemic [4], [5], spray and wait [3], [21], prophet [2], and maxprop [10] routing. recently, the work in [28] categorized the routing solutions for dtn based on the approach they follow and the information known, as deterministic or scheduled, enforced and opportunistic routing. deterministic routing approaches are used when knowledge of contact information are known ahead of time. in enforced routing scheme, special-purpose mobile devices like message ferries proposed in [29] and data mules proposed in [27] are employed. the opportunistic routing approach relies on arrival of contact opportunities. it is used where there is no knowledge about connectivity or mobility is known a priori and no network infrastructure exists to provide connectivity. in this paper the proposed buffer management policy is used with opportunistic routing approach. 2.2 buffer utilization of dtn routing protocols in this section, the buffer utilization of various routing protocols at different transmission ranges are observed. buffer utilization is defined as the ratio of total size of the buffer occupied 254 g. fathima, r.s.d. wahidabanu and the total size of the buffer. it is represented mathematically as buffer utilization = total size of buffer occupied total size of the buffer the one simulator is used to perform the experimentation. a new simulation environment is created that combines movement modelling, routing simulation, visualization and reporting in a single framework. they are loaded dynamically based on the given configuration files. the figure 1 depicts the percentage of buffer utilization by various dtn routing protocols. it is observed from the results as shown in figure 1 that, irrespective of transmission ranges, the buffer utilization is identical and also less for direct delivery routing protocol as expected. this is due to the fact that the messages are delivered directly to the destination. the buffer utilization is at the maximum in the transmission range of 250 m for multi-copy routing protocols like epidemic and prophet routing as they do replication based on node encounters and they have comparatively more neighbours in this range. due to the individuality of spray and wait routing, it has less buffer usage compared to that of epidemic and prophet routing. the utilization of buffer by maxprop routing though varies with the transmission ranges still, it is noticeably less. it is due to the fact that it uses acknowledgement and removes the stale data from the buffer. it is also evident from the result in figure 1. in the transmission range of 50 m and below, the possible contact opportunities are less. therefore the requirement of buffer and their utilization is also less for all routing strategies. figure 1: buffer utilisation 2.3 impact of buffer size it is essential to understand the impact of buffer size on performance as this resource is limited in reality (example, sensor nodes). therefore the impact of buffer size is analyzed in this section. it is examined by varying the buffer size in terms of percentage of total number of messages generated. both epidemic and spray and wait routing are chosen for evaluation. both the protocols operate on the assumption of infinite buffer and are respectively of type uncontrolled and controlled flooding which requires huge buffer space. the trade-off between the delivery probability and the buffer size is explored at heavy and light traffic load and the results are shown in figure 2 and 3 respectively. the traffic load is varied by changing the message generation intervals. it is derived from the results that a buffer size of 5-25 % of the generated messages is sufficient to achieve high delivery ratio with reasonable latency. prioritization of traffic for resource constrained delay tolerant networks 255 figure 2: delivery probability vs. buffer size (at heavy traffic load) figure 3: delivery probability vs. buffer size (at light traffic load) 3 buffer management and related work this section reviews existing literature on buffer management in dtn. the single-copy protocols like direct delivery and first contact routing and multi-copy protocols like epidemic and spray and wait routing transmit messages in fcfs order, that is the messages are transmitted in the order in which they were stored in the buffer. prophet routing makes forwarding decision based on delivery predictability of the destination of the message. it needs history of past encounters for calculation of delivery predictability. maxprop routing [10] assigns priority to messages based on hop count and delivery likelihood. it needs to maintain history of data to estimate delivery likelihood. rapid protocol [7] explicitly calculates per-packet utility value for administrator-specified routing metric and forwards the message with highest utility value first. in prioritized epidemic routing [19], transmission and dropping is done based on the priority assigned to each bundle. the priority is based on hop count the bundle has traversed thus far. the optimal policy in [18], [23] deduces the optimal function for the metrics like delivery ratio and delivery delay independently and the message with smallest utility value is dropped when the buffer is full. the authors in [20] had given a framework for devising optimal routing and scheduling algorithm. it is a centralized algorithm. irrespective of the routing protocols, several dropping and scheduling policies were proposed in the literature [1]. a different combination of queuing and forwarding strategies had been proposed in [6]. but those policies can be used only with prophet routing. it is observed from the study that so far no policies had considered the importance of messages or the requirement of the application. 256 g. fathima, r.s.d. wahidabanu 4 proposed prioritized policy in addition to network and individual node features and capabilities, application specific requirement must be taken into account when developing dtn routing mechanism. more specifically, some applications that use delay tolerant networking service require preferential delivery of certain messages. for example, field agents wish to communicate their findings, regarding environment hazards to other field agents which are more important than the regular findings. however there is no existing solution which can be applied to variety of dtn applications given their requirements. under such conditions, a forwarding policy is required to serve different types of traffic differently. consequently it would be necessary to introduce traffic differentiation mechanism and ensure that they get best possible service according to their requirement. therefore the goal is to determine the policy which maximizes the delivery probability or equivalently minimizes the delivery latency of high priority messages. a buffer management policy to address these issues is proposed and analyzed in detail in this section. the proposed approach attempts to differentiate traffic based on class-of-service (cos) and schedules according to their class. further the messages are ordered according to their deadline within the class. the assumptions regarding system model is discussed in the following section. 4.1 system model it is assumed that the network is partially connected with low node density and the node meetings are short lived. when two nodes meet, transmission between them succeeds instantaneously. the messages are stored in the buffer until contact opportunity arises or until storage is full. the bundle protocol specified in [8] is used for transfer of messages in dtn. the bundle processing control flags bit in the dtn bundle protocol is used to differentiate the traffic through class-of-service (cos) field. the class-of-service is identified in this work by the size of the messages. the priority class is based on the concept proposed by dtn architecture. it is assumed that there are three priority classes of traffic: high, medium and low. the messages are transmitted as a whole from node to node. in this proposed approach, the available buffer is logically divided into three queues to hold the incoming messages. separate queue is maintained for each priority class as shown in figure 4. the size of the queue is represented in terms of number of messages. their sizes are defined at the beginning. they can be varied as the correlation of traffic changes. initially the buffer is divided equally among all queues. it is assumed that each node has a buffer b of size b = n(b) which is logically divided into three queues: b1, b2, b3 to accommodate high, medium and low priority bundles respectively such that b = b1 ∪ b2 ∪ b3. the size of b1, b2, b3 is b1, b2, b3 respectively such that b = b1+b2+b3. a minimum size qmin is specified to reserve space for medium and low priority queues, to avoid the complete negligence of medium and low priority traffic. it is an algorithmic parameter which is set dynamically according to the requirement of the application. 4.2 proposed approach the proposed approach comprises (i)bundle classifier which classifies the bundles based on the traffic class as soon as they arrive and stores them in appropriate queue, (ii)bundle scheduler which schedules the bundles based on the priority from high priority to low priority, (iii) bundle dropper which drops the message according to the policy. each bundle si in the buffer has a set of information stored with it such as source id, traffic class and time-to-live. several criteria can be used to categorize the bundles such as source id, destination id, size and the ttl of the bundle. in this paper, it is assumed that emergency prioritization of traffic for resource constrained delay tolerant networks 257 high priority medium priority low priority incoming messages figure 4: maintaining priority queue messages are small and have short deadline which is quiet a natural phenomenon. with this assumption, the bundles are classified as high priority when the size of the bundle is between 1 and 10kb. the bundles are classified as medium priority when the size of the bundle is between 11kb and 100kb. the bundles are classified as low priority when the size of the bundle is between 101kb and 1000kb. the bundle classifier is a function of newly arrived bundle snew such as f(snew) =   (s11,s12, ...s1b1) ∪ (snew) if snew is of high priority (s21,s22, ...s2b2) ∪ (snew) if snew is of medium priority (s31,s32, ...s3b3) ∪ (snew) if snew is of low priority where (s11,s12, ...,s1b1) are the messages in high priority queue, (s21,s22, ...,s2b2) are the messages in medium priority queue, (s31,s32, ...,s3b3) are the messages in low priority queue. the bundle classify procedure is shown in algorithm 1. algorithm.1 bundle classify procedure bundle_classify() receive bundle; if buffer full call bundle_drop procedure; else check for class-of-service of the bundle; if received bundle is expedited message, store it in high priority queue; else if received bundle is normal message, store it in medium priority queue; else if received bundle is bulk message, store it in low priority queue; bundle scheduler is invoked when contact opportunity arises. the number of messages that can be transmitted is limited by the bandwidth and the duration of the contact between the nodes. in such cases only few messages are transmitted and the order in which the messages 258 g. fathima, r.s.d. wahidabanu are transmitted is significant. bundle scheduler transmits the bundles from high priority to low priority. the procedure of bundle scheduler is shown in algorithm 2. the selection of the bundle to be transmitted st is represented mathematically as follows: st =   si|si ∈ b1 if high priority queue is not empty si|si ∈ b2 if medium priority queue is not empty si|si ∈ b3 if low priority queue is not empty the messages whose destination encountered are the first to be transmitted irrespective of the scheduling policy adopted and the same may be deleted from the buffer. nodes do not delete messages that are forwarded to other nodes (other than destination) as long as there is sufficient space available in the buffer. algorithm.2 bundle schedule procedure bundle_schedule() while high priority queue is not empty, transmit bundle from high priority queue; while medium priority queue is not empty, transmit bundle from medium priority queue; while low priority queue is not empty, transmit bundle from low priority queue; once the buffer is full, the bundle dropper is invoked. as the ttl of the bundle expires, it is dropped automatically. the low and medium priority bundles are dropped to give room for high priority bundles. it is also taken care that a node should not drop its own bundle (source) to give room for newly arrived bundles. the idea of giving priority to source bundles has been proposed in [19], and was shown to improve the average delivery ratio. so the same idea is followed here. the bundle drop procedure is shown in algorithm 3. there are three possible classes of bundles that arrive like high priority, medium priority and low priority. bundle dropping is a function which selects the bundle sd to be dropped based on the priority of the bundle that arrive. the identification of the bundle to be dropped is mathematically represented as follows: case 1: when high priority bundle arrives sd =   si|si ∈ b3 iflmin > qmin si|si ∈ b2 ifmmin > qmin si|si ∈ b1 otherwise case 2: when medium priority bundle arrives sd = { si|si ∈ b3 iflmin > qmin si|si ∈ b2 otherwise case 3: when low priority bundle arrives sd = si|si ∈ b3 prioritization of traffic for resource constrained delay tolerant networks 259 algorithm.3 bundle drop procedure bundle_drop( ) let lmin and mmin be the currently occupied space by low priority and medium priority bundles respectively. let qmin be the threshold value which controls the minimum space for low and medium priority bundles. case 1: high priority bundle arrives if lmin > qmin drop bundle from low priority and the room is added to high priority queue; else if mmin > qmin drop bundle from medium priority and the room is added to high priority queue; else the remaining time of existing bundles of high priority queue is compared with newly arrived bundle. the bundle with least remaining time is dropped; case 2: medium priority bundle arrives if lmin > qmin drop bundle from low priority and the room is added to medium priority queue; else the remaining time of existing bundles of medium priority queue is compared with newly arrived bundle. the bundle with least remaining time is dropped; case 3: low priority bundle arrives the remaining time of existing bundles of low priority queue is compared with newly arrived bundle.the bundle with least remaining time is dropped; by providing differentiated service based on the class, the best effort service has been enhanced. moreover the proposed policy takes a state less approach that minimizes the need for nodes in the network to remember anything about flows. it is more practical to implement. the messages are marked in a way that describes the service level that they should receive. these markings are used to provide appropriate service without the need to remember extensive state information for every flow. the metrics used to evaluate the performance of dtn are the delivery ratio and average delivery delay. both the metrics are defined below. delivery ratio = number of messages delivered total number of messages sent by the sender delivery delay = average(time taken by all the messages to reach from source to destination) when compared to other approaches of [18], [19], [20], [23], the proposed approach has the credit of no requirement to maintain state information. moreover it has less overhead than other approaches as there is no exchange of control traffic before bundle exchange. the proposed approach avoids the complete negligence of medium and low priority messages by reserving minimum space for them. 5 simulation results and analysis to evaluate the performance of proposed policy, experimentation is done using the one simulator. the simulation environment consists of sparsely distributed mobile nodes. they are capable of communicating when they are in the communication range of one another. the parameter of the nodes like buffer size, transmit range, transmit speed, number of nodes are set as mentioned in the table 1. further, qmin is an algorithm parameter which is set as 10% of the messages generated. the environment where nodes move randomly is considered. so mobility model is set as random waypoint model, in which nodes move independently to a randomly chosen destination. priority based scheduling is not a basic primitive of dtn routing. therefore 260 g. fathima, r.s.d. wahidabanu it has to be incorporated with any of the routing algorithms. epidemic routing is chosen as baseline for evaluation due to its simplicity and recognition as unbeatable routing protocol from the point of reliable delivery. furthermore it is suitable for opportunistic environment as it does not rely on previous contact information. however the proposed policy can be incorporated with any of the multi-copy routing protocols. table.1 simulation parameters parameters values number of nodes 50 transmit range (m) 250 transmit speed (mbps) 2 node speed (km/hr) 10 50 ttl of message (min) 60 buffer size (mb) 1500 message size 10 kb 1 mb number of messages /min 4 20 movement model random waypoint model simulation time (min) 160 the performance of epidemic routing under different buffer management policies are compared in terms of metrics like delivery probability and average delivery latency. figure 5 and 6 depict the behavior of the proposed policy versus other dropping policies like drop front (df), drop old (do) and drop random (dr) with respect to delivery probability and average delivery latency respectively at various traffic loads. the traffic load is increased by increasing the number of messages generated per interval. it is observed from the result that as and when the traffic load increases, the delivery probability decreases. it also shows that incorporating the prioritized policy does not degrade the performance when compared to other policies. from the literatures [6], [15] it is noticed that df policy results in highest delivery ratio and do figure 5: delivery ratio as a function of load results in least average delivery latency. the simulation results shown in figure 5 and 6 support it. the rationale behind this result is that messages nearing the deadline may get automatically removed from the buffer on the expiry of their deadline. so forcing such messages to drop will not decrease the delivery ratio. furthermore, high priority messages with earliest deadline are forwarded first. so they have more chances of reaching the destination quickly than other messages before missing their deadline. as the size of high priority message is less compared to other priority class, the number of messages that are transmitted per contact duration is also more. this results in good delivery ratio. prioritization of traffic for resource constrained delay tolerant networks 261 figure 6: delivery latency as a function of load 6 conclusion the paper studies the buffer utilization of different routing protocols and the impact of buffer size on performance. the work targets on application that requires preferential delivery in opportunistic dtn environment. the prioritized approach presented in this paper differentiates the traffic based on class-of-service and does scheduling and dropping based on their priorities. thereby it ensures the delivery of high priority messages first with least latency satisfying the application requirement. the proposed policy is more suitable and advantageous in strict resource constrained environment with emergency applications. the service required can be specified by the application. so it can be used in vehicular networks where accident notification messages are more important than other messages. in order to avoid starvation of low priority messages, weighted fair queuing can be used which is carried as future work. bibliography [1] james. a. davis, andrew h. fagg, and brian n. levine, n., wearable computers as packet transport mechanisms in highly-partitioned ad-hoc networks, in proceedings of international symposium on wearable computing, pp. 141-148, 2001 [2] anders lindgren, avri doria and olov schelen, probabilistic routing in intermittently connected networks, springer lncs, vol. 3126, pp. 239-254, 2004 [3] spyropoulos, t., psounis, k. and raghavendra, c.s., spray and wait: an efficient routing scheme for intermittently connected mobile networks, in proceedings of the acm sigcomm workshop on delay-tolerant networking, 2005 [4] alan demers, dan greene, carl houser, wes irish, john larson, scott shenker, howard sturgis, dan swinehart and doug terry, epidemic algorithms for replicated database maintenance", in proceedings of acm symposium on principles of distributed computing, pp. 1-12, 1987 [5] amin vahdat and david becker, epidemic routing for partially-connected ad hoc networks, technical report cs-200006, 2000 [6] anders lindgren and kaustubh phanse, s., evaluation of queueing policies and forwarding strategies for routing in intermittently connected networks, in proceedings of international conference on communication system software and middleware -comsware, 2006 262 g. fathima, r.s.d. wahidabanu [7] aruna balasubramanian, brian levine and arun venkataramani, dtn routing as a resource allocation problem, acm sigcomm computer communication review, vol. 37, no. 4, 2007 [8] scott, k. and burleigh, s., bundle protocol specification, rfc 5050, 2007 [9] evan jones, p.c. and paul ward, a.s., practical routing in delay-tolerant networks, ieee transaction on mobile computing, 6(8):943-959, 2007 [10] burgess, j., gallagher, b., jensen, d., and levine, b.n., maxprop: routing for vehiclebased disruption-tolerant networks, in proceedings of ieee international conference on computer communications, pp. 1-11, 2006 [11] fall, k., a delay-tolerant network architecture for challenged internets, in proceedings of sigcomm’03, 2003 [12] are keranen, jorg ott and teemu karkkainen, the one simulator for dtn protocol evaluation, in proceedings of the 2nd international conference on simulation tools and techniques, pp. 1-10, 2009 [13] sushant jain, kevin fall and rabin patra, routing in delay tolerant networks, acm sigcomm computer communication review, vol. 34, no. 4, 2004 [14] tkk/comnet. project page of the one simulator. 2008. www.netlab.tkk.fi/tutkimus/dtn/theone/ [15] zhensheng zhang, routing in intermittently connected mobile ad hoc networks and delay tolerant networks: overview and challenges, ieee communication surveys and tutorials, 8(1):24-37, 2006 [16] cerf, v., burleigh, s., hooke, a., torgerson, l., durst, r., scott, k., fall, k. and weiss, h., delay tolerant networking architecture, ietf network working group, rfc 4838, 2007 [17] basu, p. and little t.d.c., networked parking spaces: architecture and applications, in vehicular technology conference, vol.2, pp. 1153-1157, 2002 [18] amir krifa, chadi barakat and thrasyvoulos spyropolous, optimal buffer management policies for delay tolerant networks, in proceedings of ieee conference on secon, pp. 260-268, 2008 [19] ramnathan, r., hansen, r., basu, p., rosales hain, r. and krishnan, r., prioritized epidemic routing for opportunistic networks, in proceedings of the 1st international mobisys workshop on mobile opportunistic networking, pp. 62-66, 2007 [20] david hay and paola giaccone, optimal routing and scheduling for deterministic delay tolerant networks, in proceedings of international conference on wireless on-demand network systems and services, pp. 27-34, 2009 [21] spyropoulos, t., psounis, k. and raghavendra, c.s., efficient routing in intermittently connected mobile networks: the multi-copy case, ieee/acm transactions on networking, 16(1):77-90, 2008 [22] elizabeth daly, m. and mads haahr, the challenges of disconnected delay-tolerant manets, elsevier ad hoc networks journal, 8(2):241-250, 2010 prioritization of traffic for resource constrained delay tolerant networks 263 [23] amir krifa, chadi barakat, thrasyvoulous spyropolous, an optimal joint scheduling and drop policy for delay tolerant networks, ieee wowmom, 2008 [24] delay tolerant networking research group, [online]. available: http://www.dtnrg.org [25] fall, k. and farrell, s., dtn: an architectural retrospective, ieee journal on selected areas in communications, 26(5):828-836, 2008 [26] farrell, s., cahill, v., geraghty, d., humphreys, i. and mcdonald, p., when tcp breaks: delayand disruptiontolerant networking, ieee internet computing, 10(4):72-78, 2006 [27] shah, r.c., roy, s., jain, s. and brunette, w., data mules: modeling a three-tier architecture for sparse sensor networks, in proceedings of the ieee international workshop on sensor network protocols and applications, pp. 30-41, 2003 [28] thrasyvoulos spyropoulos, rao naveed rais, thierry turletti, katia obraczka and athanasios vasilakos, routing for disruption tolerant networks: taxonomy and design, wireless networks, vol. 16, no. 8, 2010 [29] zhao, w., ammar, m. and zegura, e., a message ferrying approach for data delivery in sparse mobile ad hoc networks, in proceedings of the 5th acm international symposium on mobile ad hoc networking and computing, mobihoc, pp. 187-198, 2004 int j comput commun, issn 1841-9836 vol.7 (2012), no. 4 (november), pp. 733-743 structural regular multiple criteria linear programming for classification problem z. qi, y. shi zhiquan qi research center on fictitious economy & data science, chinese academy of sciences, beijing 100190, china e-mail: qizhiquan@gucas.ac.cn yong shi 1. research center on fictitious economy & data science, chinese academy of sciences, beijing 100190, china and 2.college of information science & technology, university of nebraska at omaha omaha, ne 68182, usa e-mail: yshi@gucas.ac.cn abstract: classification problem has attracted an increasing amount of interest. various classifiers have been proposed in the last decade, such as anns, lda, and svm. regular multiple criteria linear programming (rmclp) is an effective classification method, which was proposed by shi and his colleagues and have been applied to handle different real-life data mining problems. in this paper, inspired by the application potential of rmclp, we propose a novel structural rmclp (called srmclp) method for classification problem. unlike rmclp, srmclp is sensitive to the structure of the data distribution and can construct more reasonable classifiers by exploiting these prior data distribution information within classes. the corresponding optimization problem of srmclp can be solved by a standard quadratic programming. the effectiveness of the proposed method is demonstrated via experiments on synthetic and available benchmark datasets. keywords: classification, rmclp, structural information of data, svm 1 introduction for the last decade, the researchers have extensively developed various optimization techniques to deal with the classification problem in data mining or machine learning. support vector machine (svm) ( [1,2]) is one of the most popular methods. however, applying optimization techniques to solve classification has seventy years history. linear discriminant analysis(lda) ( [3]) was first proposed in 1936. mangasarian ( [4]) has proposed a large margin classifier based on linear programming in 1960’s. from 1980’s to 1990’s, glover proposed a number of linear programming models to solve discriminant problems with a small sample size of data ( [5,6]). recently, shi and his colleagues( [7]) extend glover’s method into classification via multiple criteria linear programming (mclp), and then various improved algorithms were proposed one after the other ( [8, 9]). these mathematical programming approaches to classification have been applied to handle many real world data mining problems, such as credit card portfolio management ( [11, 12]), bioinformatics ( [13]), firm bankruptcy ( [14]), and etc. recently, how to apply the structural information of data to build a good classifier is a new research focus. many new large margin classifiers based on structural information have been proposed. exploiting clustering algorithms to extract the structural information embedded with classes is one popular strategy [15–17]. the structured large margin machine (slmm) [15] is a representative work based on the strategy. firstly, slmm explores the structural information within classes by ward’s agglomerative hierarchical clustering method on input copyright c⃝ 2006-2012 by ccc publications 734 z. qi, y. shi data [18], and then introduces the related structure information into the constraints. finally, slmm can be solved by a sequential second order cone programming (socp). experimentally, slmm is superior to support vector machine minimax probability machine (mpm) [19] and maximin margin machine(m4) [20]. however, as we all know, solving the involved socp problem is more difficult than the quadratic programming problem (qpp) as in svm, so slmm has more higher computational complexity than traditional svm. consequently, a novel structural support vector machine (srsvm) was proposed by xue et. al [17]. unlike slmm, srsvm exploits the classical framework of svm rather than as constraints in slmm and the corresponding optimization problem can still be solved by the qpp. srsvm has been shown to be theoretically and empirically better in generalization than svm and slmm. in this paper, inspired by the success of srsvm and the application potential of rmclp, we propose a novel structural rmclp (called srmclp) method for classification problem. unlike rmclp, srmclp is sensitive to the structure of the data distribution and can construct more reasonable classifiers by exploiting these prior data distribution information within classes. the remaining parts of the paper are organized as follows. section 2 introduces the basic notions and formulation of mclp; section 3 describes in detail our proposed algorithms; all experimental results are shown in section 4; conclusions are given in the last section. 2 background we give a brief introduction of mclp in the following. for classification about the training data t = {(x1,y1), · · · ,(xl,yl)}∈ (ℜn ×y)l, (1) where xi ∈ ℜn,yi ∈ y = {1,−1}, i = 1, · · · , l, data separation can be achieved by two opposite objectives. the first objective separates the observations by minimizing the sum of the deviations (msd) among the observations. the second maximizes the minimum distances (mmd) of observations from the critical value [6]. the overlapping of data u should be minimized while the distance v has to be maximized. however, it is difficult for traditional linear programming to optimize mmd and msd simultaneously. according to the concept of pareto optimality, we can seek the best trade-off of the two measurements [11, 12]. so mclp model can be described as follows: min u et u & max v et v, (2) s.t. (w ·xi) + (ui −vi) = b, for {i|yi = 1}, (3) (w ·xi)− (ui −vi) = b, for {i|yi = −1}, (4) u,v ≥ 0, (5) where e ∈ rl is a vector whose all elements are 1, w and b are unrestricted, ui is the overlapping and vi the distance from the training sample xi to the discriminator (w ·xi) = b (classification separating hyperplane). by introducing penalty parameter c,d > 0, mclp has the following version min u,v cet u−det v, (6) s.t. (w ·xi) + (ui −vi) = b, for {i|yi = 1}, (7) (w ·xi)− (ui −vi) = b, for {i|yi = −1}, (8) u,v ≥ 0, (9) structural regular multiple criteria linear programming for classification problem 735 figure 1: geometric meaning of mclp. the geometric meaning of the model is shown in figure 1. a lot of empirical studies have shown that mclp is a powerful tool for classification. however, we cannot ensure this model always has a solution under different kinds of training samples. to ensure the existence of solution, recently, shi et al proposed a rmclp model by adding two regularized items 1 2 wt hw and 1 2 ut qu on mclp as follows (more theoretical explanation of this model can be found in [8]): min z 1 2 wt hw + 1 2 ut qu + det u− cet v, (10) s.t. (w ·xi) + (ui −vi) = b, for {i|yi = 1}, (11) (w ·xi)− (ui −vi) = b, for {i|yi = −1}, (12) u,v ≥ 0, (13) where z = (wt ,ut ,vt ,b)t ∈ rn+l+l+1, h ∈ rn×n, q ∈ rl×l are symmetric positive definite matrices. obviously, the regularized mclp is a convex quadratic programming. compared with traditional svm , we can find that the rmclp model is similar to the support vector machine model in terms of the formation by considering the minimization of overlapping of the data. however, rmclp tries to measure all possible distances v from the training samples xi to separating hyperplane, while svm fixes the distance as 1 (through bounding planes (w · x) = b ± 1) from the support vectors. although the interpretation can vary, rmclp addresses more control parameters than the svm, which may provide more flexibility for better separation of data under the framework of the mathematical programming. in addition, different with svm, rmclp considers all the samples to solve classification problem. these make rmclp have stronger insensitivity to outliers. 3 structural regular multiple criteria linear programming for classification problem 3.1 extracting structural information within classes following the strategy of the slmm and srsvm, srmclp also has two steps. the first step is to extract structural information within classes by some clustering method; the second step is the model learning. in order to compare the main different of the second step between srmclp and the other two methods, here we adopt the same clustering method: ward’s linkage clustering(wil) [15–18], which is one of the hierarchical clustering analysis. a main advantage of wil is that clusters derived from this method are compact and spherical, which provides a 736 z. qi, y. shi meaningful basis for the computation of covariance matrices [15]. concretely, if s and t are two clusters with means µs and µt , the ward’s linkage w(s,t) between clusters s and t can be computed as [15] w(s,t) = |s| · |t| · ∥µs −µt∥ |s|+ |t| . (14) initially, each sample is considered as a cluster. the ward’s linkage of two samples xi and xj is w(xi,xj) = ∥xi −xj∥2/2. when two clusters are being merged to a new cluster a′, the linkage w(a′,c) can be conveniently derived from w(a,c),w(b,c) and w(a,b) by [15] w(a′,c) = (|a|+ |c|)w(a,c) + (|b|+ |c|)w(b,c)−|c|w(a,b) |a|+ |b|+ |c| . (15) during the hierarchical clustering, the ward’s linkage between clusters to be merged increases as the number of clusters decreases [15]. a relation curve between the merge distance and the number of clusters can be drawn to represent this process. the optimal number of clusters can be determined by finding the knee point. furthermore, the wil can also be extended to the kernel space. more details of wil can be found in [15]. 3.2 model learning we obtained two groups of p and n clusters in class cp and cn by the first step, i.e., p = p1 ∪ · · ·pi ∪ · · ·pcp ,n = n1 ∪ · · ·nj ∪ · · ·ncn . consider the optimization(10), choosing h,q to be identity matrix and introducing 1 2 b2 and 1 2 w⊤σw into the object function, srmclp can be be formulated as: min z 1 2 ∥w∥2 + 1 2 c1w ⊤σw + 1 2 ∥u∥2 + 1 2 b2 + c2e t u− c3et v, s.t. (w ·xi) + (ui −vi) = b, for {i|yi = 1}, (w ·xi)− (ui −vi) = b, for {i|yi = −1}, u,v ≥ 0, (16) where z = (wt ,ut ,vt ,b)t ∈ rn+l+l+1, c1,c2,c3 ≥ 0 are the pre-specified penalty factors, σ = σ+ + σ−, where σ+ = σp1 + · · · ,+σpcp , σ− = σn1 · · · ,+σncn , σpi and σnj are respectively the covariance matrices corresponding to the i th and j th clusters in the two classes, i = 1, · · · ,cp , j = 1, · · · ,cn . obviously, the regularized srmclp is a convex quadratic programming. by introducing its lagrange function l(w,u,v,b,α,β,η) = 1 2 ∥w∥2 + 1 2 c1w ⊤σw + 1 2 ∥u∥2 + 1 2 b2 + c2e t u− c3et v+ (17) l∑ i=1 αi(yi((w ·xi)− b) + ui −vi)− l∑ i=1 βiui − l∑ i=1 ηivi, (18) where αi,βi,ηi ∈ r are the lagrange multipliers, therefore the dual problem of (39) can be formulated as max w,u,v,b,α,β,η l(u,v,w,b,α,β,η), s.t.∇w,u,v,bl(u,v,w,b,α,β,η) = 0, βi,ηi ≥ 0, i = 1, · · · , l. (19) structural regular multiple criteria linear programming for classification problem 737 from (19) we get ∇wl = (i + c1σ)w + l∑ i=1 yiαixi = 0, (20) ∇uil = ui + c2 + αi −βi = 0, i = 1, · · · , l, (21) ∇vil = −c3 −αi −ηi = 0, i = 1, · · · , l, (22) ∇bl = b− l∑ i=1 yiαi = 0, (23) where i is an identity matrix. substituting the above equations into problem (19), the dual problem can be expressed as max α,u,b − 1 2 l∑ i=1 l∑ j=1 yiyjαiαj[x ⊤ i (2i + c1σ)xj]− 1 2 l∑ i=1 u2i , (24) s.t. l∑ i=1 yiαi = 0, i = 1, · · · , l, (25) −c2 −ui ≤ αi ≤−c3, i = 1, · · · , l. (26) solving the convex quadratic programming problem , we can obtain its solution. w = −(i + c1σ)−1 l∑ i=1 yiαixi, (27) b = l∑ i=1 yiαi. (28) (29) so the decision function can be formulated as follows f(x) = − l∑ i=1 yiαix ⊤ i (i + c1σ) −1x + b. (30) applying the kernel trick, we also extend the linear srmclp to the nonlinear case. introduce the kernel function k(x,x′) = (φ(x) · φ(x′)), where φ(·) is a mapping from the input space rn to hilbert space h φ : rn →h, x → φ(x). (31) then the optimization problem of srmclp in the kernel space can be described as: max α,u,b − 1 2 l∑ i=1 l∑ j=1 yiyjαiαj[φ(xi) ⊤(2i + c1σ φ)φ(xj)]− 1 2 l∑ i=1 u2i , (32) s.t. l∑ i=1 yiαi = 0, i = 1, · · · , l, (33) −c2 −ui ≤ αi ≤−c3, i = 1, · · · , l. (34) 738 z. qi, y. shi 2φ(xi) ⊤iφ(xj) = k(xi,xj), so we only need to consider how to compute the kernel matrix c1φ(xi) ⊤σφφ(xj). suppose tpi is a matrix corresponding to the cluster pi, tpi ∈ ℜ pi×n, in which the k-th row is x⊤k . opi is a mean matrix of cluster pi, opi ∈ℜ pi×n. each row of opi is the same, i.e. µpi = 1 pi ∑ xk∈pi xk. (35) the related covariance matrix for cluster pi can be expressed as σφpi = 1 pi (φ(tpi)−φ(opi)) ⊤(φ(tpi)−φ(opi)). (36) so we obtain φ(xi) ⊤ σφ+φ(xj) =( 1 √ pi (φ(tpi)−φ(opi))φ(xi)) ⊤ ( 1 √ pi (φ(tpi)−φ(opi))φ(xj)) =( 1 √ pi (k(tpi,xi)−k(opi,xi)) ⊤ ( 1 √ pi (k(tpi,xj)−k(opi,xj)). (37) similarly, φ(m)⊤σφ−φ(m) of fφ can computed φ(xi) ⊤ σφ−φ(xj) =( 1 √ pi (k(tni,xi)−k(oni,xi)) ⊤ ( 1 √ pi (k(tni,xj)−k(oni,xj)), (38) where tni is a matrix of cluster ni, oni is a mean matrix of cluster ni. srmclp has a similar structure of rmclp, we can easily proof that rmclp are the special case of srmclp. suppose the variance-covariance matrix of each cluster is σpi = σnj = i, i = 1, · · ·cp , j = 1, · · · ,cn . for an example of linear srmclp, the primal optimization problem (39) of srmclp becomes min z 1 2 ∥w∥2 + c1 cp + cn 2 ∥w∥2 + 1 2 ∥u∥2 + 1 2 b2 + c2e t u− c3et v, s.t. (w ·xi) + (ui −vi) = b, for {i|yi = 1}, (w ·xi)− (ui −vi) = b, for {i|yi = −1}, u,v ≥ 0, (39) it is not difficult to see that the optimization problem (10) is equivalent to one of the primal problem of rmclp. 4 experiments we compare the srmclp against rmclp and srsvm [16, 17] on various data sets in this section. the testing accuracies of all experiments are computed using standard 10-fold cross validation. c1,c2,c3 and rbf kernel parameter σ are all selected from the set {2i|i = −7, · · · ,7} structural regular multiple criteria linear programming for classification problem 739 by 10-fold cross validation on the tuning set comprising of random 10% of the training data. once the parameters are selected, the tuning set is returned to the training set to learn the final decision function. the “quadprog" function is used to solve the qp problems in srmclp, rmclp and srsvm. the “1 vs r" method [2] is used to solve the multi-class classification. all algorithms are implemented by using matlab 2010. the experiment environment: intel core i7-2600 cpu, 4 gb memory. 4.1 toy data in the subsection, we use a 2-d toy data to show the intuitive performance of srmclp. the 2-d toy data is the synthetic xor dataset [17], which is a typical linearly nonseparable problem in classification and randomly generated under two gaussian distributions in each class. in practise, samples in each class are designed to two clusters p1,p2 and n1,n2(the number of samples in each cluster is equal), and each gaussian distribution contains 100 samples. we respectively use 10%,20%,30%,50% of data in each cluster as the training set, and others for testing. the comparative results of srmclp and srsvm are shown in figure 2. in the xor dataset, the positive class and negative class have both the horizontal distribution and the vertical distribution. how to fully exploit these prior knowledge will be a very difficult task. from figure 2, we can find that srmclp’s discriminant boundaries basically enclose those of rmclp, which means that srmclp has better generalization performance than rmclp. figure 3, we can also find that the accuracy’s different of these methods decreases with the increase of training samples. those show that srmclp can fully exploit these prior structural information to design a more reasonable classifier. 4.2 uci datasets in this subsection, we perform these methods on the uci datasets [21]. for each dataset, we randomly select the same number of data from different classes to compose a dataset. 50% percent of each extracted dataset are for training, 50% for testing. the results are shown in the table 1. from the table 1, we can draw the conclusion as follows: 1) srmclp and srsvm have the better predictive ability than rmclp in all cases. this shows that these priori structural information embedded in classes has a great help to improve the classification performance of the classifier. 2) srmclp is superior to srsvm in most cases. this shows srmclp is a strong competitive method. 5 conclusion in this paper, we proposed a novel structural rmclp (called srmclp) method for classification problem. unlike rmclp, srmclp is sensitive to the structure of the data distribution and can construct more reasonable classifiers by exploiting these prior data distribution information within classes. the corresponding optimization problem of srmclp can be solved by a standard quadratic programming. the effectiveness of the proposed method is demonstrated via experiments on synthetic and available benchmark datasets and applications on the decision supporting system. in the future work, we will apply srmclp into other actual classification problems such as stock forecast, credit card analysis to further test its effectiveness. 740 z. qi, y. shi −10 −5 0 5 10 15 −4 −2 0 2 4 6 8 10 12 14 16 −10 −5 0 5 10 15 −4 −2 0 2 4 6 8 10 12 14 16 −10 −5 0 5 10 15 20 −5 0 5 10 15 −10 −5 0 5 10 15 20 −5 0 5 10 15 −10 −5 0 5 10 15 20 0 5 10 15 20 −10 −5 0 5 10 15 20 0 5 10 15 20 −10 −5 0 5 10 15 20 0 5 10 15 20 −10 −5 0 5 10 15 20 0 5 10 15 20 figure 2: the performance of srmclp and rmclp in the case of rbf case. the first column and second column are the results on the training set and testing set. each row is the result on 10%,20%,30% and 50% training sets, respectively. the magenta dotted curve and red solid curve denote the hyperplanes of srmclp and rmclp, respectively. structural regular multiple criteria linear programming for classification problem 741 40 60 80 100 120 140 160 180 200 0.91 0.915 0.92 0.925 0.93 0.935 0.94 0.945 0.95 0.955 0.96 size of training data a cc u ra cy srmclp rmclp figure 3: accuracy of srmclp and rmclp on the xor dataset table 1: the testing accuracy and training times on uci datasets datasets srmclp srsvm rmclp accuracy accuracy accuracy hepatitis 79.91±2.55 79.83±1.27 77.82±4.22 (155×19) australian 68.18±2.12 69.32±2.31 67.73±1.56 (690×14) bupa liver 68.12±1.34 68.96±1.09 67.61±2.71 (345×6) cmc 65.71±2.54 65.27±2.35 64.53±3.62 (844×9) credit 76.36±2.18 76.11±2.61 75.82±2.87 (690×19) diabetis 63.98±1.90 64.19±2.43 62.44±3.47 (768×8) flare-solar 59.11 ±2.98 58.43±2.77 57.96±2.51 (1066×9) german 63.91±1.93 63.84±1.88 62.55±2.86 (1000×20) heart-statlog 76.13 ±2.50 76.04±2.47 76.21±2.83 (270×14) image 83.18 ±2.57 83.44±1.40 82.64±2.88 (2310×18) ionosphere 76.93 ±2.61 76.55±2.63 76.43± 3.26 (351×34) 742 z. qi, y. shi 6 acknowledgment this work has been partially supported by grants from national natural science foundation of china( no.70921061, no.10601064), the cas/safea international partnership program for creative research teams, major international(ragional) joint research project(no.71110107026), the president fund of gucas. bibliography [1] vapnik v.n. the nature of statistical learning theory. 2nd ed. new york: springer, 2000. [2] deng n.y., tian y.j. support vector machines: theory, algorithms and extensions. science press, beijing, 2009. [3] fisher r.a. the use of multiple measurements in taxonomic problems. annals of eugenics 7(2): 179-188, 1936. [4] mangasarian o.l. generalized support vector machines. advances in large margin classifiers. cambridge, ma: mit press, 2000. [5] freed n., glover f. simple but powerful goal programming models for discriminant problems. european journal of operational research 7: 44-60, 1981. [6] freed n., glover f. evaluating alternative linear programming models to solve the two-group discriminant problem. decision science 17: 151-162, 1986. [7] olson d., shi y. introduction to business data mining. mcgraw-hill/irwin, 2007. [8] shi y., tian y.j., chen x.j., zhang p. a regularized multiple criteria linear program for classification. in: icdm workshops, pp 253-258, 2007. [9] kou g., shi y., wang s.y. multiple criteria decision making and decision support systems guest editor’s introduction. decision support systems 51(2): 247-249, 2011. [10] zhang d., tian y.j., shi y. a regression method by multiple criteria linear programming. in: 19th international conference on multiple criteria decision making (mcdm), pp 7-12, 2008. [11] shi y., wise w., lou m. multiple criteria decision making in credit card portfolio management. multiple criteria decision making in new millennium, pp 427-436, 2001. [12] shi y., peng y., xu w. data mining via multiple criteria linear programming: applications in credit card portfolio management. international journal of information technology and decision making 1: 131-151, 2002. [13] zhang j., zhuang w., yan n. classification of hiv-1 mediated neuronal dendritic and synaptic damage using multiple criteria linear programming. neuroinformatics 2: 303326, 2004 [14] kwak w., shi y., eldridge s. bankruptcy prediction for japanese firms: using multiple criteria linear programming data mining approach. international journal of data mining and business intelligence, 2006. structural regular multiple criteria linear programming for classification problem 743 [15] yeung d., wang d., ng w., tsang e., wang x., structured large margin machines: sensitive to data distributions, machine learning 68 (2): 171-200, 2007. [16] xue h., chen s., yang q., structural support vector machine, in: the 15th international symposium on neural networks, pp. 501–511, 2008. [17] xue h., chen s., yang q., structural regularized support vector machine: a framework for structural large margin classifier, neural networks, ieee transactions on 22 (4): 573– 587, 2011. [18] ward, j.r. hierarchical grouping to optimize an objective function, journal of the american statistical association, 58 (301): 236–244, 1963. [19] g. r. g. lanckriet, l. e. ghaoui, c. bhattacharyya, m. i. jordan, a robust minimax approach to classification, journal of machine learning research, 3: 555–582, 2002. [20] kzhuang k. h., yang h. , king i., learning large margin classifiers locally and globally, in: in the twenty-first international conference on machine learning, pp. 401-408, 2004. [21] murphy p.m. and aha d.w. uci machine learning repository, 1992. international journal of computers communications & control issn 1841-9836, 9(6):786-799, december, 2014. application of fuzzy reasoning spiking neural p systems to fault diagnosis t. wang, g. zhang, h. rong, m.j. pérez-jiménez tao wang, gexiang zhang*, haina rong school of electrical engineering, southwest jiaotong university chengdu, 610031, china wangtaocdu@gmail.com, zhgxdylan@126.com ronghaina@126.com *corresponding author: wangtaocdu@gmail.com mario j. pérez-jiménez research group on natural computing department of computer science and artificial intelligence university of sevilla, sevilla, 41012, spain marper@us.es abstract: this paper discusses the application of fuzzy reasoning spiking neural p systems with trapezoidal fuzzy numbers (tfrsn p systems) to fault diagnosis of power systems, where a matrix-based fuzzy reasoning algorithm based on the dynamic firing mechanism of neurons is used to develop the inference ability of tfrsn p systems from classical reasoning to fuzzy reasoning. some case studies show the effectiveness of the presented method. we also briefly draw comparisons between the presented method and several main fault diagnosis approaches from the perspectives of knowledge representation and inference process. keywords: fuzzy reasoning spiking neural p system with trapezoidal fuzzy number, fuzzy reasoning, fault diagnosis, trapezoidal fuzzy number, linguistic term. 1 introduction membrane computing, introduced by gh. păun in [1], is an attractive research field of computer science aiming at abstracting computing models, called membrane systems or p systems, from the structures and functioning of living cells, as well as from the way the cells are organized in tissues or higher order structures. in recent years, much attention is paid to the spiking neural p systems (sn p systems, for short), an important class of p systems introduced in [2] and investigated in a series of papers (see [3][12]), which can be described as a directed graph. an sn p system is a kind of distributed and parallel computing model inspired by the neurophysiological behavior of neurons sending electrical impulses (spikes) along axons from presynaptic neurons to postsynaptic neurons. the features of sn p systems, such as inherent parallelism, understandability, dynamics, synchronization/asychronization, non-linearity and nondeterminism [3], [4], are suitable for solving various engineering problems. until now, only a few investigations have focused on the use of sn p systems to solve engineering problems. in [3], a fuzzy reasoning spiking neural p system with real numbers (rfrsn p system) was presented to perform diagnosis knowledge representation and reasoning. in [13], an rfrsn p system was used for fault diagnosis of power systems and three examples were used to verify its effectiveness. the studies in [3, 13], dealing with the fault diagnosis problem, used certainty factors and truth degree values, which are described by real numbers obtained from the frequency of occurrences in historical data, it is known how difficult it is to obtain and process real-time statistical data from power network data and the knowledge of dispatchers and experts in electrical power systems as they usually contain linguistic terms with copyright © 2006-2014 by ccc publications application of fuzzy reasoning spiking neural p systems to fault diagnosis 787 some degree of uncertainty. so, using various models, including rfrsn p systems to solve fault diagnosis problems with a certain degree of uncertainty, represents a way to tackle this difficult problem. this paper discusses the extended version of rfrsn p systems, i.e., fuzzy reasoning spiking neural p systems with trapezoidal fuzzy numbers (tfrsn p systems), and its application to fault diagnosis of power systems. to adapt tfrsn p systems to solve fault diagnosis problems, a matrix-based fuzzy reasoning algorithm (mbfra) is used inspired by the dynamic firing mechanism of neurons. given initial pulse values of all input neurons of a tfrsn p system, mbfra can perform fuzzy inference to obtain the pulse values contained in other neurons and export reasoning results represented by trapezoidal fuzzy numbers. to make mbfra suitable for multiple faults diagnosis of power systems, a defuzzification method is applied for processing the reasoning results in order to obtain crisp numbers corresponding to them. some case studies show the effectiveness of the presented method. we also briefly draw comparisons between tfrsn p systems and several other fault diagnosis approaches. the remainder of this paper is organized as follows. section 2 introduces concepts and notations used in this work. section 3 provides the definition of tfrsn p systems and mbfra. section 4 discusses the application of tfrsn p systems to fault diagnosis of power systems. discussions on several fault diagnosis methods are made in section 5. conclusions are finally drawn in section 6. 2 preliminaries a trapezoidal fuzzy number can be characterized as a 4-tuple of real numbers t̃f = (a, b, c, d), a < b < c < d, shown in fig. 1, where a and d represents the left hand and right hand width of the trapezoidal distribution, (b, c) denotes the interval in which the membership value is equal to 1 and h t̃f (x) represents the membership function of t̃f defined as follows [14]: h t̃f (x) =   0, x ≤ a x−a b−a , a < x ≤ b 1, b < x ≤ c d−x d−c , c < x ≤ d 0, x > d (1) let ã and b̃ be two trapezoidal fuzzy numbers, ã = (a1, b1, c1, d1) and b̃ = (a2, b2, c2, d2). the arithmetic operations of ã and b̃ are defined as follows [15]: 1. addition ⊕: ã⊕ b̃ = (a1, b1, c1, d1)⊕ (a2, b2, c2, d2)=(a1 + a2, b1 + b2, c1 + c2, d1 + d2); 2. subtraction ⊖: ã⊖ b̃ = (a1, b1, c1, d1)⊖ (a2, b2, c2, d2)=(a1 −a2, b1 − b2, c1 − c2, d1 −d2); 3. multiplication ⊗: ã⊗b̃ = (a1, b1, c1, d1)⊗(a2, b2, c2, d2)=(a1×a2, b1×b2, c1×c2, d1×d2); 4. division ⊘: ã⊘ b̃ = (a1, b1, c1, d1)⊘ (a2, b2, c2, d2)=(a1/a2, b1/b2, c1/c2, d1/d2). we define four logic operations, where ã and b̃ are trapezoidal fuzzy numbers, and a, b are real numbers: 1. minimum operator ∧: a∧ b = min(a, b); 2. maximum operator ∨: a∨ b = max(a, b); 788 t. wang, g. zhang, h. rong, m.j. pérez-jiménez figure 1: a trapezoidal fuzzy number. 3. and ∧○: ã ∧○ b̃ = (a1, b1, c1, d1) ∧○ (a2, b2, c2, d2) =((a1∧a2), (b1∧b2), (c1∧c2), (d1∧d2)); 4. or ∨○: ã ∨○ b̃ = (a1, b1, c1, d1) ∨○ (a2, b2, c2, d2)=((a1 ∨a2), (b1 ∨ b2), (c1 ∨c2), (d1 ∨d2)). we define a scalar multiplication operation as follows: scalar multiplication: x · ã = x · (a1, b1, c1, d1) = (x ·a1, x · b1, x · c1, x ·d1), where a is a trapezoidal fuzzy number and x is a real number: the defuzzification method in [16] is chosen to obtain a crisp number tf associated with a trapezoidal fuzzy number t̃f , it is shown in (2), where e and g are the extreme values of the whole fuzzy set range. in this study, e and g are equal to 0 and 1, respectively. tf = (d−e) + (c−e) ((d−e) + (c−e))− ((a−g) + (b−g)) (2) 3 tfrsn p systems a tfrsn p system of m ≥ 1 is a construct π = (o, σ1, . . . , σm, syn, in, out), where: (1) o = {a} is a singleton alphabet (a is called spike); (2) σ1, . . . , σm are neurons, of the form σi = (θi, ci, ri), 1 ≤ i ≤ m, where: (a) θi is a trapezoidal fuzzy number in [0,1] representing the potential value of spikes (i.e. value of electrical impulses) contained in neuron σi; (b) ci is a trapezoidal fuzzy number in [0,1] representing the fuzzy truth value corresponding to neuron σi; (c) ri represents a firing (spiking) rule contained in neuron σi with the form e/aθ → aβ, where e is the firing condition and its form will be specified below, θ and β are trapezoidal fuzzy numbers in [0,1]. (3) syn ⊆{1, 2, . . . , m}×{1, 2, . . . , m} with i ̸= j for all (i, j) ∈ syn, 1 ≤ i, j ≤ m, is a directed graph of synapses between the linked neurons; (4) in, out ⊆ {1, 2, . . . , m} indicate the input neuron set and the output neuron set of π, respectively. application of fuzzy reasoning spiking neural p systems to fault diagnosis 789 in tfrsn p systems, the definition of neurons and pulse values can be extended. specifically, in tfrsn p systems, the neurons are extended to four types, i.e., proposition neurons and three kinds of rule neurons: general, and and or, and the pulse value contained in each neuron is no longer the number of spikes represented by a real value, but a trapezoidal fuzzy number in [0, 1], which can be interpreted as the potential value of spikes contained in neuron σi. it is worth pointing out that the number of spikes in each neuron is determined by the problem to be solved and the pulse value contained in each neuron is different. if neuron σi contains no spike, then θi = 0; otherwise, if neuron σi contains only one spike, then θi equals to the pulse value of this spike; in any other case, θi equals to the result of a operation on all pulse values received from its presynaptic neurons. for different types of neurons, the operations for pulse values are different. for proposition neurons and and rule neurons, they use operation ∧○ to handle all the pulse values received from their presynaptic neurons while or rule neurons use operation ∨○, where symbols ∧○ and ∨○ represent the and and or operators of trapezoidal fuzzy numbers, respectively. the firing condition e = as means that the spiking rule, e/aθ → aβ, contained in neuron σi, can be applied if and only if neuron σi contains at least s spikes, otherwise, the firing rule cannot be applied. more details about tfrsn p systems can be found to the preliminary work [9]. fuzzy production rules consist of five types: type 1 : ri(ci) : pj(θj) → pk(θk); θk = θj ⊗ ci. type 2 : ri(ci) : p1(θ1) ∧○ . . . ∧○pk−1(θk−1) → pk(θk); θk = (θ1 ∧○ . . . ∧○θk−1)⊗ ci. type 3 : ri(ci) : p1(θ1) → p2(θ2) ∧○ . . . ∧○pk(θk); θ2 = . . . θk = θ1 ⊗ ci. type 4 : ri(ci) : p1(θ1) ∨○ . . . ∨○pk−1(θk−1) → pk(θk); θk = (θ1 ∨○ . . . ∨○θk−1)⊗ ci. type 5 : ri(ci) : p1(θ1) → p2(θ2) ∨○ . . . ∨○pk(θk). where ri represents the ith fuzzy production rule; ci is the certainty factor of rule ri; pi is a proposition appearing in the antecedent or consequence part of a rule, 1 ≤ i ≤ k (k is the number of propositions in a rule-based system); pj in type 1 represents the jth proposition, 1 ≤ j ≤ k−1; θi represents the fuzzy truth value corresponding to the ith proposition [15]. ci and θi are trapezoidal fuzzy numbers defined in the universe of discourse [0, 1]. the causality between a fault on a faulty section in a power system and the status information about protective relays and circuit breakers (cbs) of this section can be described by the aforementioned fuzzy production rules. a simplified transmission network shown in fig. 2 is used to illustrate the notations in a fuzzy production rule. according to the protection principle, if there is fault on transmission l, then its main protective relays, i.e., mlr1 and mlr2 and their corresponding cbs, i.e., cb1 and cb2, will operate to protect l, which can be backward described by a fuzzy production rule: r(1, 1, 1, 1) : mlr1 operates (0.975, 0.98, 1, 1) ∧○ mlr2 operates (0.975, 0.98, 1, 1) ∧○ cb1 trips (0.975, 0.98, 1, 1) ∧○ cb2 trips (0.975, 0.98, 1, 1) → l has a fault. the certainty factor of this rule is (1, 1, 1, 1) which represents the contribution of this rule to the final diagnosis result. in this rule, there are four propositions mlr1 operates, mlr2 operates, cb1 trips and cb2 trips, and they have an equal fuzzy truth value (0.975, 0.98, 1, 1) representing the contributions of the propositions to the result l has a fault. because rules type 5 are unsuitable for diagnosis, they are not further described in section 3. tfrsn p system models for rules type 1 to type 4 are shown in fig. 3. to adapt tfrsn p systems to solve fault diagnosis problems, we describe mbfra in the following description. given initial truth values of propositions corresponding to all input neurons in an tfrsn p system, mbfra can perform fuzzy reasoning to obtain the fuzzy truth values of other neurons with unknown pulse values and output reasoning results. let us assume that the tfrsn p system contains l proposition neurons and n rule neurons, each of which may be general, and or or rule neurons, m = l + n, where m is the number of all the neurons in this system. 790 t. wang, g. zhang, h. rong, m.j. pérez-jiménez figure 2: a simplified transmission network. figure 3: tfrsn p system models for fuzzy production rules. (a) type 1 ; (b) type 2 ; (c) type 3 ; (d) type 4. in order to clearly present the reasoning algorithm, we first introduce some parameter vectors and matrices as follows. 1) θ = (θ1, θ2, . . . , θl)t is a fuzzy truth value vector of the l proposition neurons, where θi represents the pulse value contained in the ith proposition neuron, 1 ≤ i ≤ l, and is expressed by a trapezoidal fuzzy number in [0, 1]. if there is not any spike contained in a proposition neuron, its pulse value is “unknown” or (0, 0, 0, 0). 2) δ = (δ1, δ2, . . . , δn)t is a fuzzy truth value vector of the rule neurons, where δj represents the pulse value contained in the jth rule neuron, 1 ≤ j ≤ n, and it is expressed by a trapezoidal fuzzy number [0, 1]. if there is not any spike contained in a rule neuron, its pulse value is “unknown” or (0, 0, 0, 0). 3) c = diag(c1, c2, . . . , cn) is a diagonal matrix, where cj is the certainty factor of the jth fuzzy production rule, 1 ≤ j ≤ n, and it is expressed by a trapezoidal fuzzy number. 4) d1 = (dij)l×n is a synaptic matrix representing the direct connection between proposition neurons and general rule neurons. if there is a directed arc (synapse) from the proposition neuron σi to the general rule neuron σj, then dij = 1, otherwise, dij = 0. application of fuzzy reasoning spiking neural p systems to fault diagnosis 791 5) d2 = (dij)l×n is a synaptic matrix representing the direct connection between proposition neurons and and rule neurons. if there is a directed arc (synapse) from the proposition neuron σi to the and rule neuron σj, then dij = 1, otherwise, dij = 0. 6) d3 = (dij)l×n is a synaptic matrix representing the direct connection between proposition neurons and or rule neurons. if there is a directed arc (synapse) from the proposition neuron σi to the or rule neuron σj, then dij = 1, otherwise, dij = 0. 7) e = (eji)n×l is a synaptic matrix representing the direct connection between rule neurons and proposition rule neurons. if there is a directed arc (synapse) from the rule neuron σj to the proposition neuron σi, then eji = 1, otherwise, eji = 0. subsequently, we introduce some multiplication operations as follows. 1) ◦○: c ◦○ δ = (c1 ⊗ δ1, c2 ⊗ δ2, . . . , cn ⊗ δn)t ; dt ◦○ θ = (d̄1, d̄2, . . . , d̄n)t , where d̄j = d1jθ1 +d2jθ2 + . . . +dljθl, j = 1, 2, . . . , n. 2) ⊙: dt ⊙ θ = (d̄1, d̄2, . . . , d̄n)t , where d̄j = d1jθ1 ∧○ d2jθ2 ∧○ . . . ∧○ dljθl, j = 1, 2, . . . , n. 3) ∗○: et ∗○ δ = (ē1, ē2, . . . , ēl)t , where ēi = e1iδ1 ∨○ e2iδ2 ∨○ . . . ∨○eniδn, i = 1, 2, . . . , l. next, we list the pseudocode of mbfra. algorithm mbfra require: d1, d2,d3, e, c, θ0, δ0 1: set the termination condition 0 = (unknown, unknown, . . . , unknown)tn 2: let t = 0, where t represents the reasoning step 3: while δt ̸= 0 do 4: for each input neuron (t = 0) or each proposition neuron (t > 0) do 5: if the firing condition e = as is satisfied then 6: the neuron fires and computes the fuzzy truth value vector δt+1 via δt+1=(dt1 ◦○ θt)⊕ (dt2 ⊙θt)⊕ (d t 3 ∗○ θt) 7: if there is a postsynaptic rule neuron then 8: the neuron transmits a spike to the next rule neuron 9: else 10: just accumulate the value in the neuron 11: end if 12: end if 13: end for 14: for each rule neuron do 15: if the firing condition e = as is satisfied then 16: the rule neuron fires and computes the fuzzy truth value vector θt+1 via θt+1= et ∗○ (c ◦○ δt+1) and transmits a spike to the next proposition neuron 17: end if 18: t = t + 1 19: end for 20: end while ensure: θt, which represents the final states of pulse values contained in proposition neurons. 4 application examples and results in this section, a power system with 14-buses, chosen from [17] and as shown in fig. 4, is applied as an example to describe how to use tfrsn p systems with mbfra to solve a fault diagnosis problem. the system contains 34 system sections, including 14 buses and 20 transmission lines. the buses are marked as bpq and the transmission lines are represented 792 t. wang, g. zhang, h. rong, m.j. pérez-jiménez figure 4: the power system with 14 buses. as lpquv, where 0 ≤ p, q, u, v ≤ 9. the protection system of the 14-bus system contains 174 protective devices consisting of 40 circuit breakers (cbs), 40 main transmission line relays, 40 first backup transmission line relays, 40 second backup transmission line relays and 14 bus relays. a local part, which is composed of a transmission line l1314, its adjoining two buses, b13 and b14, and its adjoining three transmission lines, l1213, l0613 and l0914, of the protection system is given to describe its structure and symbols of protection devices. the local system is shown in fig. 5. the operational rules of the protective devices are descried as follows [17]. the main transmission line relay mlr1314 protects the entire line l1314 and it will operate to trip its associated circuit breaker (cb), i.e., cb1314, to clear a fault on the line l1314. the bus relay br13 protects the bus b13 and it will operate to trip the three cbs, i.e., cb1312, cb1306 and cb1314, if there is a fault on the bus b13. the first backup transmission line relay blr1314 is a local backup of the relay mlr1314 and has the same protection zone as mlr1314. the relay blr1314 will operate to trip cb1314 to clear a fault if the fault clearance by the relay mlr1314 fails. secondary backup transmission line relays slr1213 and slr0613 are the remote backups of the relays mlr1314 and blr1314. they will operate to trip their corresponding cbs, i.e., cb1213 and cb0613, respectively, to clear a fault if the fault clearance by both mlr1314 and blr1314 fails. the relays slr1213 and slr0613 are also two remote backups of the relay br13 and they will operate to trip cbs, i.e., cb1213 and cb0613, respectively, to clear a fault if the fault clearance by the relay br13 fails. the functions of the four relays, mlr1413, blr1413, slr0914 and br14, and three cbs, cb1413, cb1409 and cb0914, in the process of protecting the line l1314 and the bus b14 are similar and the protection systems for other sections in this 14-bus power system have the same protection rules, so it is not necessary to repeatedly describe application of fuzzy reasoning spiking neural p systems to fault diagnosis 793 figure 5: a local part of the protection system of the 14-bus power system. their operation rules. the protection rules described above show that when a fault occurs on a certain section of a power system, protection devices will reach certain statuses to protect the section. meanwhile, the relay trip signals and cbs status signals, used as inputs of fault diagnosis models of sections, can be obtained from remote terminal units (rtus) of supervisory control and data acquisition (scada) systems. the diagnostic strategy in this study is to build one tfrsn p system diagnosis model for each candidate fault section of a power system and each model performs mbfra by using scada data, i.e., relay trip signals and cbs status signals, to get a trapezoidal fuzzy number which represents the fault confidence level of this section. in a single fault case, the section with the highest fault confidence level is the faulty section. in multiple faults cases, several sections with fault confidence levels which are greater than a threshold, which is set as real number 0.5 in this study, are regarded as faulty sections. thus, to obtain real numbers for easily comparing the fault confidence levels with the threshold, a defuzzification method shown in (2) is used to process the reasoning results represented by trapezoidal fuzzy numbers. in addition, the fault confidence levels of faulty sections in multiple faults cases are ranked from high to low to help operators to decide a repair order of the sections. fig. 6 and fig. 7 show the tfrsn p system diagnosis models for l1314 and b13, respectively. it is worth noting that there are several assistant arcs (synapsises) with different arrow endings in the figures. for illustration purposes, we take arcs, from σ2 to σ25 and from σ2 to σ26, as examples. the meanings of the two arcs are that if cb1314 opens, the operation of its corresponding second backup protective devices, including relays (slr0613 and slr1213) and cbs (cb0613 and cb1213), is invalid and then the values of these relays and cbs are set as (0, 0, 0, 0); otherwise, the operation of the second backup protective devices is valid. in what follows we take transmission line l1314 as an example to show the fuzzy reasoning process of mbfra based on tfrsn p systems. case 1: a single fault. transmission line l1314 has a fault. operated relays: mlr1314, mlr1413, blr1314. tripped cbs: cb1314, cb1413. a tfrsn p system for l1314 is π1 and its corresponding tfrsn p system diagnosis model is shown in fig. 6. π1 = (o, σ1, σ2, . . . , σ36, syn, in, out) 794 t. wang, g. zhang, h. rong, m.j. pérez-jiménez figure 6: fault diagnosis model of transmission line l1314 based on a tfrsn p system. figure 7: fault diagnosis model of bus b13 based on a tfrsn p system. application of fuzzy reasoning spiking neural p systems to fault diagnosis 795 table 1: linguistic terms and their corresponding trapezoidal fuzzy numbers linguistic terms trapezoidal fuzzy numbers absolutely-false (af) (0, 0, 0, 0) very-low (vl) (0, 0, 0.02, 0.07) low (l) (0.04, 0.1, 0.18, 0.23) medium-low (ml) (0.17, 0.22, 0.36, 0.42) medium (m) (0.32, 0.41, 0.58, 0.65) medium-high (mh) (0.58, 0.63, 0.80, 0.86) high (h) (0.72, 0.78, 0.92, 0.97) very-high (vh) (0.975, 0.98, 1, 1) absolutely-high (ah) (1, 1, 1, 1) where 1) o = {a} is the singleton alphabet (a is called spike). 2) σ1, . . . , σ22 are proposition neurons corresponding to the propositions with fuzzy truth values θ1, . . . , θ22; that is, l = 22. 3) σ23, . . . , σ36 are rule neurons, where σ23, . . . , σ29 are and rule neurons, σ30 and σ36 are or rule neurons and σ31, . . . , σ35 are general rule neurons; that is, n = 14. 4) syn = {(1, 23) , (2, 23), (2, 24), (2, 25), (2, 26), (3, 24), (4, 25), (5, 25), (6, 26), (7, 26), (8, 27), (9, 27), (9, 28), (9, 29), (10, 28),(11, 29),(12, 29),(13, 30),(14, 30),(15, 31), (16, 32), (17, 33), (18, 34), (19, 35), (20, 36), (21, 36), (23, 13), (24, 14), (25, 15), (26, 16), (27, 17),(28, 18),(29, 19),(30, 20), (31, 20), (32, 20), (33, 21), (34, 21), (35, 21), (36, 22)}. 5) in = {σ1, σ2, . . . , σ12}, out = {σ22}. the knowledge of dispatchers in power systems may contain linguistic terms and the statuses of devices may have a certain degree of uncertainty. table 1 shows an example of linguistic terms and their corresponding trapezoidal fuzzy numbers. in the tfrsn p system π1, input neurons σ1, . . . , σ12 are assigned as the empirical values v h, v h, h, af, af, af, af, v h, v h, l, af, af , respectively. certainty factors corresponding to rule neurons σ23, . . . , σ36 are given values v h, v h, v h, h, v h, v h, h, v h, v h, v h, v h, v h, v h, v h, respectively. according to table 1, we obtain the trapezoidal fuzzy numbers θ0 and δ0. in order to succinctly describe the matrices, let us denote or = (x1, . . . , xr)t , where xi = (0, 0, 0, 0), 1 ≤ i ≤ r. θ0 =   (0.975, 0.98, 1, 1) (0.975, 0.98, 1, 1) (0.72, 0.78, 0.92, 0.97) o4 (0.975, 0.98, 1, 1) (0.975, 0.98, 1, 1) (0.04, 0.1, 0.18, 0.23) o12   22×1 , δ0 = [ o ] 14×1 when t = 0, we get the results 796 t. wang, g. zhang, h. rong, m.j. pérez-jiménez δ1 =   (0.9506, 0.9604, 1, 1) (0.702, 0.7644, 0.92, 0.97) o2 (0.9506, 0.9604, 1, 1) (0.39, 0.098, 0.18, 0.23) o8   14×1 , θ1 =   o12 (0.9268, 0.9412, 1, 1) (0.6845, 0.7491, 0.92, 0.97) o2 (0.9268, 0.9412, 1, 1) (0.2808, 0.0764, 0.1656, 0.2231) o4   22×1 when t = 1, we obtain the results δ2 =   o7 (0.9268, 0.9412, 1, 1) o2 (0.9268, 0.9412, 1, 1) (0.2808, 0.0764, 0.1656, 0.2231) o2   14×1 , θ2 =   o19 (0.9268, 0.9412, 1, 1) (0.9268, 0.9412, 1, 1) o1   22×1 when t = 2, we have the results δ3 = [ o13 (0.9268, 0.9412, 1, 1) ] 14×1 , θ3 = [ o21 (0.9036, 0.9224, 1, 1) ] 22×1 when t = 3, we get the results δ4 = [ o ] 14×1 . thus, the termination condition is satisfied and the reasoning process ends. we obtain the reasoning results, i.e., the fuzzy truth value (0.9036,0.9224,1,1) of the output neuron σ22. the transmission line l1314 is a faulty section with a confidence level (0.9036,0.9224,1,1). tfrsn p systems and mbfra are also suitable for multiple faults diagnosis problems in power systems. in what follows we take an example of the power system in fig. 4 to show the effectiveness of the method in diagnosing multiple faults. case 2: multiple faults. transmission line l1314 and bus b13 have faults. operated relays: mlr1314, mlr1413, slr0613, slr1213. tripped cbs: cb1314, cb1413, cb0613, cb1213. according to the scada data, four candidate fault sections, i.e., l1314, b13, l0613 and l1213, are selected. the tfrsn p systems of the four sections are established to perform mbfra, respectively. after the fuzzy reasoning, fault confidence levels, (0.9036, 0.9224, 1, 1), (0.6673, 0.7341, 0.92, 0.97), (0.2165, 0.299, 0.623, 0.7849) and (0.2165, 0.299, 0.623, 0.7849), represented by trapezoidal fuzzy numbers of sections l1314, b13, l0613 and l1213 are obtained. according to (2), we obtain their corresponding real numbers, i.e., 0.92, 0.7595, 0.475 and 0.475. thus, there are two faulty sections, i.e., l1314 and b13. the results are summarized in table 2. the logic analysis about case 2 is described as follows. in this case, information of protective relay br13 is not observed and cb1312 and cb1306 fail to trip. for transmission line l1314, its main transmission relays, mlr1314 and mlr1413, operate to trip their corresponding cbs, application of fuzzy reasoning spiking neural p systems to fault diagnosis 797 table 2: relay trip signals and cbs status signals observed, and diagnosis results candidate confidence corresponding ranking fault fault section level real number section l1314 (0.9036,0.9224,1,1) 0.92 1 yes b13 (0.6673,0.7341,0.92,0.97) 0.7595 2 yes l0613 (0.2165,0.299,0.623,0.7849) 0.475 no l1213 (0.2165,0.299,0.623,0.7849) 0.475 no cb1314 and cb1413, to clear a fault. so it is a faulty section. although main protective relay br13 of bus b13 fails to clear a fault, its remote backup protective relays, slr0613 and slr1213, operate to trip their corresponding cbs, cb0613 and cb1213, to clear this fault. so b13 is a faulty section. for transmission lines l0613 and l1213, only their single-ended remote backup protective relays slr0613 and slr1213 operate to trip their corresponding cbs, cb0613 and cb1213, respectively. actually, slr0613 and slr1213 and their cbs act as remote backup protections of b13. so, l0613 and l1213 are not faulty sections. therefore, according to the logic analysis and table 2, we can know that the presented method can obtain correct results in multiple fault situations. 5 discussions a tfrsn p system is a novel graphical model for representing fuzzy knowledge and information. this study employs it to diagnose the faults of power systems. the fault diagnosis ability of a method is usually associated with the knowledge availability and the reasoning process. thus, in what follows, we make a comparison between tfrsn p systems and several fault diagnosis approaches regarding the aspects of knowledge representation and inference process. (1) expert systems (ess). both es and the fault diagnosis method based on tfrsn p systems (fdm-tfrsnp) can make full use of experts’ knowledge. the differences are: an es needs long response time and the maintenance of its knowledge base is difficult [18]; fdmtfrsnp possesses parallel reasoning ability and adopt graphical knowledge representation and reasoning, which can avoid the main limitation of es. (2) fuzzy set theory (fst). fst is an effective way to represent uncertain information but the definition of membership function is a hard job [18]. the fst-based method and fdm-tfrsnp both possess the ability to deal with uncertain information of protective devices. in addition, linguistic terms used in both methods make them closer to the human thinking compared with the methods using crisp numbers. the main differences between them are that fdm-tfrsnp has a fast reasoning speed and the matrix reasoning process is easier to describe diagnostic process as well as its programming. (3) artificial neural networks (anns). anns can be regarded as opaque black boxes and can be easily used. the main problems of anns lie in the difficult acquisition of a complete sample set and a tedious training process needing extra time consumption. in addition, premature convergence is also a problem. fdm-tfrsnp neither needs a training process with a set of comprehensive training data nor has a premature convergence problem [13]. besides, fdmtfrsnp can intuitively represent the relationships between faults and operations of protection devices. this feature is very helpful for operators to analyze and summarize failure processes. (4) fuzzy petri nets (fpns). both fpns and tfrsn p systems have graphical knowledge representation and parallel computing ability. however, the mechanism of tfrsn p systems is originated from neurophysiological behavior of neurons or/and living cells. thus, the working principle of different types of neurons or/and cells may provide new inspirations for extending sn 798 t. wang, g. zhang, h. rong, m.j. pérez-jiménez p systems (or tfrsn p systems), which can increase the ways of knowledge representation and reasoning to solve new problems in power systems [4]. in addition, tfrsn p systems with trapezoidal fuzzy numbers have three rule neurons types, i.e., general, and and or, and one proposition neuron type, while fpns only contain same places and transition types. thus, different types of neurons make fdm-tfrsnp have better flexibility and trapezoidal fuzzy numbers (linguistic terms) make tfrsn p systems more understandable to operators of power systems. 6 conclusions in this study, tfrsn p systems and a matrix-based fuzzy reasoning algorithm, mbfra, for fault diagnosis are discussed to extend the application area of sn p systems in fault diagnosis of power systems. mbfra is based on the dynamic firing mechanism of neurons. given initial pulse values of all input neurons of a tfrsn p system, mbfra can obtain the pulse values of other neurons by performing fuzzy reasoning. to make mbfra suitable for fault diagnosis in power systems, a defuzzification method is employed to treat reasoning results represented by trapezoidal fuzzy numbers. application examples show that tfrsn p systems with mbfra is effective in diagnosing faulty sections of power systems. besides, a comparison between tfrsn p systems and different fault diagnosis approaches is made. the aim of this study is to construct a tfrsn p system diagnosis model for each candidate fault section. the scale of each diagnosis model depends on protective devices connections of the candidate fault section rather than the scale of power systems. thus, the presented method can be used for large-scale power systems. this study focuses on the effectiveness and correctness of the fault diagnosis method and the results of application examples are obtained by manual computation. to test the speed, convergence and accuracy of mbfra and to explore automatical generation of tfrsn p systems in diagnosing faulty sections in power systems, our future work will simulate them on matlab, p-lingua or mecosim [19][21]. moreover, how to verify and realize the parallelism of tfrsn p systems and mbfra on hardware such as fpga and cuda is also our further task. acknowledgment this work is supported by the national natural science foundation of china (61170016, 61373047, 61170030), the program for new century excellent talents in university (ncet-110715) and swjtu supported project (swjtu12cx008). bibliography [1] păun, g. (2000); computing with membranes, j comput. syst. sci., issn 0022-0000, 61(1): 108–143. [2] ionescu, m.; păun, g.; yokomori t. (2006); spiking neural p systems, fund. inform., issn 0169-2968, 71(2-3): 279–308. [3] peng, h.; wang, j.; pérez-jiménez, m.j.; wang, h.; shao, j.; wang, t. (2013); fuzzy reasoning spiking neural p system for fault diagnosis, inform. sciences, issn 0020-0255, 235(20): 106–116. [4] wang, j.; shi, p.; peng, h.; pérez-jiménez, m.j.; wang, t. (2013); weighted fuzzy spiking neural p systems, ieee trans. on fuzzy syst., issn 1063-6706, 21(2): 209–220. application of fuzzy reasoning spiking neural p systems to fault diagnosis 799 [5] păun, g.; pérez-jiménez, m.j.; rozenberg, g. (2006); spike trains in spiking neural p systems, int. j. found. comput. s., issn 0129-0541, 17(4): 975–1002. [6] cavaliere, m.; ibarra, o.h.; păun, g.; egecioglu, o.; ionescu, m.; woodworth, s. (2009); asynchronous spiking neural p systems, theor. comput. sci., issn 0304-3975, 410(24-25): 2352–2364. [7] păun, g.; rozenberg, g.; salomaa a. (ads.)(2010); the oxford handbook of membrance computing, oxford unversity press, new york. [8] pan, l.q.; zeng, x.x. (2011); small universal spiking neural p systems working in exhaustive mode, ieee trans. on nanobiosci., issn 1536-1241, 10(2): 99–105. [9] wang, t.; wang, j.; peng, h.; wang, h. (2011); knowledge representation and reasoning based on frsn p system, in: proc of the 9th world congress on intelligent control and automation, pp. 849–854. [10] zhang, x.y.; luo, b.; fang, x.y.; pan, l.q. (2012); sequential spiking neural p systems with exhaustive use of rules, biosystems, issn 0303-2647, 108(1-3): 52–62. [11] francis g.c.; henry n.a. (2012); on structures and behaviors of spiking neural p systems and petri nets, lecture notes in computer science, issn 0302-9743, 7762: 145–160. [12] song, t.; pan, l.q.; pǎun, gh. (2013); asynchronous spiking neural p systems with local synchronization, inform. sciences, issn 0020-0255, 219: 197–207. [13] xiong, g.j.; shi, d.y.; zhu, l.; duan, x.z. (2013); a new approach to fault diagnosis of power systems using fuzzy reasoning spiking neural p systems, math. probl. eng., issn 1024-123x, vol. 2013: article id 815352, 13 pages. [14] chen, w.h. (2011); fault section estimation using fuzzy matrix-based reasoning methods, ieee trans. on power deliver., issn 0885-8977, 26(1): 205–213. [15] chen, s m. (1996); a fuzzy reasoning approach for rule-based systems based on fuzzy logics, ieee trans. syst., man, cybern., syst., issn 1083-4427, 26(5): 769–778. [16] liu, h.c.; liu, l.; lin, q.l.; liu, n. (2013); knowledge acquisition and representation using fuzzy evidential reasoning and dynamic adaptive fuzzy petri nets, ieee trans. on cybern., issn 1083-4419, 43(3): 1059–1072. [17] luo, x.; kezunovic m. (2008); implementing fuzzy reasoning petri nets for fault section estimation, ieee trans. on power syst., issn 0885-8950, 23(2): 676-685. [18] chen, w.h.; tsai, s.h.; lin, h.i. (2011); fault section estimation for power networks using logic cause-effect models, ieee trans. on power deliver., issn 0885-8977, 26(2): 963–971. [19] the matlab website. http://www.mathworks.es/products/matlab/. [20] research group on natural computing, university of seville: the p-lingua website. http://www.p-lingua.org. [21] research group on natural computing, university of seville: the mecosim website. http://www.p-lingua.org/mecosim. international journal of computers communications & control issn 1841-9836, 10(1):100-111, february, 2015. routing optimization for delay tolerant networks in rural applications using a distributed algorithm c. velásquez-villada, f. solano, y. donoso carlos velásquez-villada universidad de los andes bogotá d.c., colombia, south america c.velasquez@ieee.org fernando solano warsaw university of technology poland, nowowiejska 15/19, warsaw fs@tele.pw.edu.pl yezid donoso* universidad de los andes bogotá d.c., colombia, south america *corresponding author: ydonoso@uniandes.edu.co abstract: internet access can improve people’s life quality by helping them to reduce and overcome the poverty and educational gaps. however, most rural communities in the world, specially in underdeveloped countries, do not have access to the internet. delay/disruption tolerant networking (dtn) is a recent low-cost technology now being used to provide connectivity to rural towns were some transportation means periodically arrive. dtns can be implemented to connect communities to internet, since this technology takes advantage of the existing people’s transportation infrastructure using it to move packets and messages to and from internet. this paper proposes a dtn mathematical optimization model that maximizes the availability probabilities of the paths from sources to destinations. we also present an opportunistic forwarding algorithm that takes into account the availability probability of a node’s neighbors to decide if a node should forward a message or store the message until a node with a higher availability probability contacts it. this algorithm was tested in five different scenarios and in all of them it found a path to the destination. keywords: disruption-tolerant, delay-tolerant, availability probability, opportunistic forwarding, ict for rural development, rural telecommunications. 1 introduction most rural communities in the world have scarce or non-existent internet availability [1]. according to the itu [1], only 38.8% of the world’s population have access to internet, either fixed or mobile. most of these connections are concentrated in the main cities, leaving most of the world’s surface with almost no internet availability. several governments are increasing the availability of internet in their territories, showing an increase of 145.57% from 2006 to 2013, as can be seen in the internet penetration figures from itu [1]. the initiatives to bring internet to more people include optical fiber deployments, satellite connectivity solutions, electronic devices for people in remote towns and training for them to use these new devices and connections. even with these efforts, many people will still be left behind without constant internet connectivity or without connectivity at all. the delay/disruption tolerant networking (dtn) architecture proposed in [2] is a new technology that can connect users in rural and disconnected places. this dtn technology can give disconnected users access to information that can help them overcome the digital divide and become net citizens taking full advantage of the information, copyright © 2006-2015 by ccc publications routing optimization for delay tolerant networks in rural applications using a distributed algorithm 101 education and opportunities that internet can provide. benefits of this connectivity include access to learning materials like wikipedia, distance learning courses or even offline versions of massive online open courses (moocs) that gives them the same education contents that their urban counterparts receive; thus, helping them to reduce the poverty gap [1, 3, 4]. there are also benefits when the local infrastructure is created in the communities where people can create contents and share them with others. local news can be reported on a local website, people can share their music, documents, videos or libraries. in some communities, people have created local websites for their businesses and even local radio stations over the net. these applications depend on a strong deployment of a local infrastructure and the presence of people willing to train and educate locals. even without the community network, the opportunities that can arise from a connection point and information distribution in an asynchronous way can bring many benefits for the inhabitants around these deployments. applications that show the benefits of giving ict to people in remote communities and accompanying these technologies with the respective knowledge, training and following can be found in many fields: healthcare [5–7], banking opportunities [8, 9], agriculture development [10–12], businesses creation and growing [8, 13], politics accountability and participation [14], and education [15]. although there are many studies about dtn architectures and technologies (as can be seen in section 2), research and developments in application software and specially in routing protocols are still missing. most current applications and protocols fail when implemented in a dtn environment, due to long delays or frequent disruptions. the focus of this work is on developing a routing algorithm that can be used in a real application to bring internet connectivity to rural communities around the world. this paper introduces a mathematical model to represent the best path choices in a delay/disruption tolerant network designed for rural connectivity scenarios. this model is built based on previous models developed by the authors [28–30]. the model uses the availability probabilities of the network’s links and their changes through time to calculate the best paths from the information sources to their destination. the model gives, as a result, the optimal decisions that the nodes can make to send their information to the destination (when should a node send a message to a neighbor and to which neighbor and when should this node store the message in its buffer to send it later instead). this paper also introduces a distributed heuristic algorithm to make forwarding decisions. this algorithm is implemented at each node in the simulation, and it decides whether to forward or store the messages based on the availability probability of a node’s neighbors (if this availability probability is good enough will be decided via a variable parameter that will be used and tested in the simulation). this paper extends paper [31]. the key additions of this journal version are as follows. first, section 2 includes and describes more related works, the dtn architecture, and dtn implementations for rural solutions. secondly, this paper contains an extended explanation of the mathematical model presented in section 3 and of the heuristic algorithm from section 4. finally, this paper contains additional results for the simulated scenarios. this paper is organized as follows: previous and related works are summarized in section 2; section 3 introduces the mathematical model used to calculate the optimal forwarding decisions; section 4 describes the distributed heuristic algorithm implemented for the simulation; section 5 shows and discusses optimization and simulation results; finally, conclusions and possible directions for future research are summarized at the end of the document. 2 related work and dtn architecture delay/disruption tolerant networks (dtn) are a relatively new kind of network that work in challenging environments [2, 16–23]. these networks have been developed with new protocols 102 c. velásquez-villada, f. solano, y. donoso tailored to their needs, which can make them incompatible between different applications and even with the internet. each one of these networks works within its own boundaries with known and relatively homogeneous delays and error rates, and some have trusted boundary devices to translate inner messages to different protocols for external communications. the initial application for the dtn, where the name appeared, was the interplanetary internet project [17], an idea from nasa to interconnect devices in space. it was motivated from the pathfinder mission to mars (1997), but it was tested in the spirit and opportunity missions in mars (2004) due to a failure in the communication solution implemented in one of these rovers. after this experience, nasa started working and researching for an interplanetary internet with the idea of connecting most of the devices in the space to provide a reliable store and forward network for all current and future missions. they also imagined a future where different spatial agencies could use these protocols and extend the reach of this interplanetary network (ipn) for further spatial exploration. earthly dtn protocols face other challenges, mainly high rates of disruptions or failures. they must enable interconnectivity between different networking technologies in order to allow these technologies–used for wireless sensor networks, unmanned vehicular networks, battlefield monitoring and smart infrastructure–to communicate through the internet and with each other. protocols must account for variable delays, variable error rates, network disruptions and attacks. the architecture for dtn proposed in [2] tries to change or at least relax some assumptions built into the internet architecture. these assumptions are that there are end-to-end paths between source and destination; that retransmissions based on feedback from data receivers are effective for errors correction; that end-to-end loss is relatively small; that all network nodes support the tcp/ip protocols; that applications does not need to be aware of the communications performance; that security can be achieved through mechanisms on end nodes; that packet switching is the most appropriate abstraction for interoperability and performance; and, that a single route is sufficient for acceptable performance. the dtn architecture relaxes most of these assumptions using variable-length messages; a naming syntax that supports a wide range of naming and addressing; store-and-forward over multiple paths; security mechanisms against unauthorized use; classes of service; delivery options; and a way to express the useful lifetime of data. the dtn architecture [2] uses a bundle layer [21] that serves as middleware between the application layer and lower layers. this bundle layer shows a transparent interface for applications that can go through the internet or dtn. however, applications for dtn must follow some design principles: they should minimize the number of round-trip exchanges, cope with restarts after failure while network transactions remain pending, and inform the network of the useful life and relative importance of data to be delivered. the bundle layer provides unacknowledged, prioritized (but not guaranteed) unicast message delivery. it also provides two options for enhancing delivery reliability: end-to-end acknowledgments and custody transfer. applications can use these end-to-end acknowledgments for reliability. the custody transfer option allows a node to give the responsibility for reliable transfer of messages to other node, a “custodian”. nodes in the dtn can be custodians or can choose not to be based on available resources and network congestion. a custodian-to-custodian acknowledgement mechanism is implemented in the bundle layer. the bundle protocol defines a convergence layer where underlying protocols will function to provide successful end-to-end communications. these convergence layers accomplish communications between nodes and can comprise a whole network stack with transport and network protocols that can run on top of existing networking technologies. many protocols can exist in these convergence layers (the protocols themselves are called convergence layers), but they must provide at least two services to the bundle protocol agent: “sending a bundle to all bundle routing optimization for delay tolerant networks in rural applications using a distributed algorithm 103 nodes in the minimum reception group of the endpoint identified by a specified endpoint id that are reachable via the convergence layer protocol; and delivering to the bundle protocol agent a bundle that was sent by a remote bundle node via the convergence layer protocol.” [21]. demmer [24] presents a full implementation of the previous architecture with storage and routing developments and an available simulation environment. daknet [25] is a project developed by mit researchers that tackles the problem of internet connectivity in rural or remote communities and is the closest one to the idea presented here. daknet aims to provide connectivity to remote villages were some wireless networking devices have been installed (kiosks) in strategic places and in buses or public service vehicles (mobile access points, maps) that eventually will go to this and other villages and bigger towns, delivering messages between these disconnected networks and also accessing internet (for non-real time access) at some moment and delivering and retrieving requested information. authors in [20], [26] and [27] present developments on dtns for underdeveloped regions concluding that providing full reliability and good strategies for data forwarding are difficult objectives in these scenarios. 3 optimization problem topology of dtns is usually unreliable and constantly changes due to their nodes movement. however, we can use these same node movements to transmit messages in the network and to internet. in this section, we present the problem of finding the best path in the network from one or more sources to a destination (it can be a node connected to internet) based on the availability probabilities of the links between the nodes. the mathematical model takes into account that the availability probability of a link can change from one instant of time to another. the model has complete information over the network and can decide whether to send a message over a link or wait (storing the message in an internal buffer) until there is a link with a better availability probability. thus, the network model presented in this paper maximizes the availability probability of the paths from sources to destinations, see fig. 1. the model assumes that every node has storage capabilities, that it can serve as a relay for messages from other nodes and that it can send a message at any moment of time. the links between the nodes are given as are given their availability probabilities that serve to model the movement of the nodes, these probabilities are assumed to be independent and following an uniform distribution for testing purposes. table 1: mathematical model parameters parameter definition p set of candidate paths t set of discrete time intervals cij capacity of edge (i,j) ∈ e. cii storage capacity of node i aij(t) availability probability of edge (i,j) ∈ e at time t it is assumed that the availability probability of the edge (i,j) is the same as the one for the edge (j,i) t discrete time interval, where t ∈ t δt time interval duration the model starts with a network represented by a graph g = (n,e), where n is the nodes set and e is the edges set. the edges set has the connections between the nodes. the graph g is extended in a new graph g′ = (n′,e′) that contains a copy of the original graph for every change 104 c. velásquez-villada, f. solano, y. donoso figure 1: network representation (a) two-node graph; (b) six-node graph in a link’s availability probability. the extended graph g′ is used by the optimization algorithm to make decisions on forwarding or storing the messages, since it has all the information about the network changes, see fig. 2. table 1 shows the main parameters used in the mathematical model. set t contains the time intervals t that define the validity of the links’ availability probabilities. these time intervals have a duration equal to δt. we are assuming that all time intervals have the same extent and that this time period lasts long enough to send a message between two neighboring nodes. capacities are assumed to be constant, since they depend on the hardware of the nodes and because of this are not so easily changed. figure 2: network graph (a) original graph changes through time; (b) extended graph with the availability probabilities through time represented at once max ∏ (i,j)∈p aij(t)xij(t) (1) max ∑ (i,j)∈e log (aij(t))xij(t) (2) routing optimization for delay tolerant networks in rural applications using a distributed algorithm 105 constraints ∑ j∈n xij(t) − ∑ j∈n xji(t) + xii(t) − xii(t − 1) = bi(t) ∀(i,j) ∈ e,i ̸= j (3) xij(t) ≤ cijδt ∀(i,j) ∈ e (4) xij(t) ∈ z≥0 ∀(i,j) ∈ e,p ∈ p (5) equation (1) shows the objective function that maximizes the network availability probability of the paths from sources to destination. this objective function multiplies the availability probabilities of the links in a path, since we are dealing with probabilities and we want to obtain the path that maximizes these availability probabilities. we are assuming that the internal buffer is always available, so this availability probability (to store a message in its buffer) is always 1. a linear approximation of the objective function is shown in equation (2). decision variable xij(t) determines the amount of information that flows through the link (i,j) using positive integers (5). xij(t) is greater or equal than 1 if the link (i,j) is in the path p for a source to a destination, and 0 otherwise. constraints (3) and (4) are data-flow constraints and they keep the information flow below the channels and buffers capacities and guarantee that the messages reach their destinations. figure 3: network routing solution (a) decision taken at time 0; (b) decision taken at time 1; (c) decision taken at time 2; (d) final path through time fig. 3 shows an example of the optimization algorithm at work. at time 0 node 1 creates a message to be sent to node 6 and all availability probabilities have their initial values. at this moment, node 1 chooses the highest availability probability to its neighbors and decides to send the message to node 2. then, at time 1, all availability probabilities can change and node 2 have to decide where it should send the message or if the message should be stored in its internal buffer to be sent later. node 2 decides to send the message to its neighbor with the highest availability probability, node 5. finally, node 5 sends the message towards its destination, node 6, at time 2. 4 proposed forwarding heuristic this section describes the forwarding heuristic implemented for the results section based on the mathematical optimization algorithm. the heuristic internal working is explained in fig. 4 and the pseudocode is shown in algorithm 1. the heuristic works as a distributed optimization algorithm in which each node has to decide by its own and with local information whether it can send a message to a neighbor or it should store the message in its local buffer until a better path becomes available, see fig. 4. for a node running the heuristic to decide if a link has a 106 c. velásquez-villada, f. solano, y. donoso good enough availability probability it has to compare it against a set parameter (alpha). this forwarding heuristic can be used in a rural scenario that has at least one central, fixed wireless distribution point (ap), at least one mobile distribution point (map) that can be in contact with the local aps periodically (once a day or less in many cases, depending on the transportation means to the town). the aps and maps must have reasonable storage capabilities. the scenario has users in each village where at least an ap is deployed and users requests for information are not real-time ones and can wait for one or more map-ap contacts to be satisfied. users must have the necessary technology to access the network and the appropriate training to use that technology and the applications given to them. figure 4: network distributed heuristic (a) first step at time 0; (b) time 1 (c) time 2 (d) time 3 the forwarding algorithm, shown in algorithm 1, is based on the assumption that every node knows the availability probability of its neighbors at any instant of time, and that if that probability is high enough (over a certain alpha) it can communicate with such neighbor. the alpha parameter is the threshold to take a forwarding action. if an availability probability is below this threshold (alpha) the node can assume that its neighbor is unreachable. nodes do not have access to the whole network graph but they can discover which one is the destination or gateway in the network in order to send their requests to internet. nodes can create a request at any instant of time. in our forwarding strategy we take into account the information available at the node that is taking the forwarding decision; i.e., the availability probabilities of its neighbors. and we also take into account that the node should not return the packet to the node that sent it to him (but it can send it to another node, even if the packet visited that other node earlier). 5 results the proposed heuristic described in section 4 was implemented using java in a 64-bit cpu. five different network configurations, from 6 to 10 nodes, as can be seen in fig. 5, were simulated. in each scenario the last node received messages from all the previous nodes. these nodes created a message to be delivered through the path with the best availability probability but using only routing optimization for delay tolerant networks in rural applications using a distributed algorithm 107 algorithm 1: distributed algorithm for every node in the network local information. all nodes served as relays to convey messages to the destination (i.e., to the final node). nodes had access to the availability probabilities of their links to their neighbors and they had to make a decision (send or store a message) based on this information and the parameter alpha (the threshold to decide if an availability probability is good enough). alpha was varied from 0.1 to 0.9 with 0.1 increases between each simulation run. we ran 10,000 simulations for each scenario and for every alpha to increase the confidence in the averaged results. the results, shown in fig. 6, included the averaged values of all the availability probabilities of the paths from sources to destinations; the hop count and the time steps count until the last message was delivered to the final node. all these variables were measured for every alpha and plotted in a single graph for each scenario shown in fig. 5 (see fig. 6). figure 5: scenarios used for the simulations with (a) 6; (b) 7; (c) 8; (d) 9; e) 10 nodes as can be seen in fig. 6, for alpha values between 0.1 and 0.6 there are no significant differences in all variables. the average hop count does not change; the time to deliver the messages presents an increase of around 7 steps (or a 30% increase); and the average availability probability shows an increase of around 0.1 (or a 30% increase). for alpha values from 0.7 to 0.9 the hop count remains steady but the time steps and the availability probability increase exponentially. the change in time steps from 0.6 to 0.9 alpha is around 200%; and the increase in availability probability is around 0.3 (an increase of around 77%). these results show that choosing an adequate alpha value is a critical decision and that choosing an alpha between 0.6 and 0.8 most likely provides the best trade-off between time to deliver the messages and availability probability of the paths. from fig. 6 we can also see that the hop count does not change with alpha variations and that it depends on the network size (number of nodes) and 108 c. velásquez-villada, f. solano, y. donoso network topology. figure 6: results for the scenarios with (a) 6; (b) 7; (c) 8; (d) 9; (e) 10 nodes 6 conclusions and future work we have proposed a mathematical model for a delay/disruption tolerant network in a rural application based on the dynamic availability probabilities of the links between neighboring nodes. these availability probabilities model the movement of nodes and the opportunistic behavior of the solution. we used a linear approximation of the objective function through a logarithmic function that allows a faster implementation and solution. for an application that can be implemented in a real scenario, we presented a distributed heuristic algorithm where nodes only have access to local information to make decisions. this heuristic was simulated for all the scenarios and it was able to deliver all the messages. the performance of the heuristic regarding the value of the alpha parameter was evaluated. from the results it can be seen that an alpha between 0.6 and 0.8 must be chosen in order to achieve a better availability probability of the paths without a notorious increase in the time needed to deliver all the packets. the hop count is independent of the alpha chosen, it is related to the number of nodes in the network but it also depends on the geometry of the graph and each node’s degree. the dtn implementation proposed here can be used in a rural environment to give internet connectivity to the people living there using any transportation mean available to them. we have shown that the distributed algorithm used to decide whether to forward or store a message works as expected and can deliver all messages in the scenarios tested; however, this algorithm can benefit from recording previous contacts between nodes and sharing this information with other nodes to create a historical record of contacts and choose the nodes that are most likely to deliver the message in short time. for future work we are working with the non-lineal model and the subproblem of path generation to evaluate those paths in the master problem (i.e., find the optimal path based on the availability probabilities of the links in it). routing optimization for delay tolerant networks in rural applications using a distributed algorithm 109 acknowledgments the research leading to these results has received funding from the european union seventh framework programme (fp7/2007-2013) under grant agreement no. 269985. the author carlos velásquez-villada received funding by the colombian “departamento administrativo de ciencia, tecnología e innovación colciencias conv. 528/2011”. bibliography [1] (2013), measuring the information society, international telecommunication union. [online]. available: http://www.itu.int/en/itu-d/statistics/pages/default.aspx [2] cerf, v.; burleigh, s.; hooke, a.; torgerson, l.; durst, r.; scott, k.; fall, k. & weiss, h. (2007), delay-tolerant networking architecture, rfc4838, april, 1-35. [3] andrew t. & petkov, d. (2003) the need for a systems thinking approach to the planning of rural telecommunications infrastructure, telecommunications policy, 27:75-93. [4] johnson d. l. & roux, k. (2008), building rural wireless networks: lessons learnt and future directions, proceedings of the 2008 acm workshop on wireless networks and systems for developing regions. acm, 17-22. [5] ruxwana, n. l.; herselman, m. e. & conradie, d. p. (2010), ict applications as e-health solutions in rural healthcare in the eastern cape province of south africa, health information management journal, health information management association of australia, limited, 39(1):17-26. [6] de savigny, d.; kasale, h.; mbuya, c. & reid, g. (2008), fixing health systems: linking research, development, systems, and partnerships, idrc. [7] donner, j. (2004), innovations in mobile-based public health information systems in the developing world: an example from rwanda, workshop on mobile technologies and health: benefits and risks. [8] de blasio, g. (2008), urban-rural differences in internet usage, e-commerce, and e-banking: evidence from italy, growth and change, 39(2): 341-367. [9] (2013) care, [online]. available: http://www.care.org [10] warren, m. (2002), adoption of ict in agricultural management in the united kingdom: the intra-rural digital divide, zemedelska ekonomikapraha, 48(1), pp. 1-8. [11] (2013) fair trade usa, [online]. available: http://www.fairtradeusa.org/ [12] sohoo, s. (2008), ict initiative of saarc agriculture centre in the saarc region, computer science and information technology. iccsit 08. international conference on. ieee, 923-929. [13] steinfield, c.; larose, r.; chew, h. e. & tong, s.t. (2012), small and medium-sized enterprises in rural business clusters: the relation between ict adoption and benefits derived from cluster membership, the information society, 28(2):110-120. 110 c. velásquez-villada, f. solano, y. donoso [14] kannabiran, g.; xavier, m. & banumathi, t. (2008), e-governance and ict enabled rural development in developing countries: critical lessons from rasi project in india, international journal of electronic government research (ijegr), 4(3): 1-19. [15] cheng, a.; sinha, a.; shen, j.; mouakkad, s.; joseph, l. & mehta, k. (2012), opportunities for social innovation at the intersection of ict education and rural supply chains, global humanitarian technology conference (ghtc), ieee, 328-335. [16] scott, k. (2009) delay/disruption tolerant networking, lisa09 invited talk, usenix. [online]. available: http://static.usenix.org/events/lisa09/stream1/scott.htm [17] warthman, f. (2012), delayand disruption-tolerant networks (dtns). a tutorial. v. 2.0, interplanetary internet special interest group, 2012. [18] burleigh, s.; cerf, v.; durst, r.; hooke, a.; rumeau, r.; scott, k.; travis, e. & weiss, h. (2001), the interplanetary internet: the next frontier in mobility, internet global summit, june, 2001. [19] durst, r. c.; feighery, p. d. & scott, k. l. (2000), why not use the standard internet suite for the interplanetary internet, interplanetary internet (ipn) technical information. [20] fall, k. (2003), a delay-tolerant network architecture for challenged internet, proceedings of the 2003 conference on applications, technologies, architectures, and protocols for computer communications, 27–34. [21] scott, k. l. & burleigh, s. (2007), bundle protocol specification, rfc5050, november, 1-50. [22] akyildiz, i. f.; akan, o. b.; chen, c.; fang, j. & su, w. (2003), interplanetary internet: state-of-the-art and research challenges, computer networks, 43(2):75–112. [23] khabbaz, m. j.; assi, c. m. & fawaz, w. f. (2012), disruption-tolerant networking: a comprehensive survey on recent developments and persisting challenges, communications surveys & tutorials, ieee, 14(2), 607–640. [24] demmer, m. j. (2008), a delay tolerant networking and system architecture for developing regions, phd thesis, university of california at berkeley. [25] pentland, a.; fletcher, r. & hasson, a. (2004), daknet: rethinking connectivity in developing nations, computer, 37(1):78–83. [26] seth, a.; kroeker, d.; zaharia, m.; guo, s. & keshav, s. (2006), low-cost communication for rural internet kiosks using mechanical backhaul, proceedings of the 12th annual international conference on mobile computing and networking, 334–345. [27] (2013), technology and infrastructure for emerging regions, university of california at berkeley. [online]. available: http://tier.cs.berkeley.edu/drupal/ [28] montoya, g. a.; velasquez-villada, c. & donoso, y. (2013), energy optimization in mobile wireless sensor networks with mobile targets achieving efficient coverage for critical applications, international journal of computers communications & control, 8(2), 247– 254. [29] velasquez-villada, c. & donoso, y. (2013), multipath routing network management protocol for resilient and energy efficient wireless sensor networks, 1st international conference on information technology and quantitative management (itqm), suzhou, china, may 16-18, 2013, procedia computer science, 17:387-394. routing optimization for delay tolerant networks in rural applications using a distributed algorithm 111 [30] montoya, g. a. & donoso, y. (2013), energy load balancing strategy to extend lifetime in wireless sensor networks, 1st international conference on information technology and quantitative management (itqm), suzhou, china, may 16-18, 2013, procedia computer science, 17:395-402 [31] velasquez-villada, c.; solano, f. & donoso, y. (2014), opportunistic forwarding algorithm for delay tolerant networks in rural applications, international conference on computers communications & control, romania, oradea, baile felix, may 6-10, 2014, abstracts of icccc 2014, issn 1844-4334. 4bologa.pdf international journal of computers communications & control special issue on fuzzy sets and applications (celebration of the 50th anniversary of fuzzy sets) issn 1841-9836, 10(6):803-811, december, 2015. a fuzzy-based decision support tool for engineering curriculum design o. bologa, r.e. breaz, s.g. racz octavian bologa, radu-eugen breaz*, gabriel sever racz lucian blaga university of sibiu no.10, victoriei, sibiu, 550024, românia, octavian.bologa@ulbsibiu.ro, radu.breaz@ulbsibiu.ro, gabriel.racz@ulbsibiu.ro *corresponding author: radu.breaz@ulbsibiu.ro abstract: this paper describes a decision support tool which can be used for aiding the academic sraff in making the decision of including a specialty subject in an engineering curriculum. the approach is based on building a list of competences that should be acquired through the study of the specialty subjects. an evaluation of the competences is made by means of questionnaires and finally, a fuzzy model will be run. the output of the fuzzy model reflects the need for the evaluated specialty subject to be included in the curriculum. the proposed method takes into consideration the opinions and experience of both the academic staff and the employers. keywords: curriculum design, decision support engineering studies, fuzzy logic. 1 introduction national regulations state that engineering curriculum in the romanian universities is composed mainly by four categories of subjects: fundamental subjects, domain subjects, specialty subjects and complementary subjects. the fundamental subjects include mathematics, physics, chemistry and other fundamental sciences which provide the future engineer the basic knowledge for operating with the laws and principles of engineering. the domain subjects are intended to provide the technical knowledge related to a specific area of engineering, which at national level is called "domain". for example, some engineering domains in romania are mechanical engineering, industrial engineering, chemical engineering, electrical engineering and many other. the specialty subjects are intended to provide the specific knowledge which individualizes a study programme (also called "specialization") within and engineering domain. for example, machine building technology and machine-tools and production systems are two different specializations within the industrial engineering domain. the complementary subjects are intended to provide the graduates the transversal competences and for engineering study programmes are usually selected from a list that include foreign language, history of technics, sports, culture and civilizations, communication techniques and so other. the national authority responsible with quality assurance within higher education system in romania, aracis (romanian agency of quality assurance in higher education) has established several rules regarding the distribution of fundamental, domain, specialty and complementary subjects within the curriculum. there are rules regarding the percentages of every subject category and there are also lists of subjects for every engineering domain from which the universities have to choose the subjects within the curriculum. however, while the lists of fundamental copyright © 2006-2015 by ccc publications 804 o. bologa, r.e. breaz, s.g. racz and domain disciplines are quite restrictive, for the specialty subjects, the constraints are more flexible and the list of possible "subject-candidates"/options is richer. the broader range of options, which in fact may be considered beneficial, creates a set of problems and difficulties regarding the answer to the question: which specialty subjects have/have not to be included in the curriculum? these problems have to be solved by each university, taking into consideration two aspects: the university autonomy, which encourage every school to customize its curriculum and the rapid evolution of the labor market, which have to be addressed by the universities by rapid adaptation of the curriculum. the research presented in this paper was conducted in order to develop a tool for aiding the university staff in the decision process of designing the engineering curriculum with regards of the specialty subjects. the remaining part of this paper is as follows. in second section, a survey of the state of the art regarding various approaches regarding engineering curriculum design is presented. third section is dedicated to building a competence list, considered necessary for an engineering graduate to possess. in the fourth section the flowchart of the proposed method is presented and the fifth section introduces the fuzzy model used for assessing the percent of introducing an evaluated subject in the curriculum. the last section is dedicated to the conclusion. 2 previous work curriculum design for engineering studies should be oriented to final outcome of the higher education process, providing the labor market with a graduate able to fulfill the requirements of the employers [1]. the problem is that there is a certain conflict between education, which is seen as the main objective by the universities and professional training on the other hand, which is seen as mandatory requirement by the employers. a literature survey shows that competence-based learning [3, 4] is one of the concepts considered as base for higher education curriculum design in order to bridge the gap between education and professional training [2]. competences are defined as a combination of skills and knowledge and it is considered that an individual’s performance at work are hardly influenced by them [3]. moreover, it is considered that competences comprises also entrepreneurial elements which are also needed by a higher education graduate [4, 5]. consequently, university staff and other academic organizations involved in the curriculum design process have to define and select the required competences in order to define and structure a successful engineering study programme. in order to customize a specialization among other, within the same domain, it is very important to define the specific competences and to provide them by means of the specialty subjects, while keeping the curriculum as flexible as possible [6]. however, care has to be taken in order to avoid too much specialization or too much generalization [7]. fuzzy logic was used quite often to build decision support tools related to educational process. many works are related to applications of fuzzy logic in assessing the academic performance of the students [8–10]. the results obtained by using the method presented in [8] were compared to the values produced by statistical means. another research, presented in [11], proposed a fuzzy logic approach to the assessment of student centered learning. with the widespread of e-learning and web-based learning, special software products called learning management systems (lms) are used in order to provide the platform for learning environment. fuzzy logic algorithms are reported to be used for the evaluation of lms systems [12]. competence-based learning had risen the problem of selection, analysis and evaluation the required competences for a higher education study programme. a literature survey revealed a fuzzy-based decision support tool for engineering curriculum design 805 that fuzzy logic was used for this purpose, using a revised dacum method (13). dacum is the abbreviation for developing a curriculuum and was invented by a group of scientists from new start company in canada’s nova scotia. by using fuzzy composite evaluation, the work presented in [13]developed a very effective method of analyzing technical competences for graduates of junior colleges. an approach which uses both fuzzy logic and competence selection and analysis was presented in [14] and [15]. a particular fuzzy model for assisting the decision of including a specialty subject in the curriculum, for a specialization within industrial engineering domain. the approach was developed by a team from "lucian blaga" university of sibiu, which includes the authors of the present paper. 3 the list of competences the approach proposed in this work is intended to have high degree of generality, in order to be applied practically to any engineering study programme which can be related to an industrial process. for example, the method can be applied for machine-building and manufacturing engineers, hydraulic and pneumatic machines engineers, welding engineers, textile and leather engineers, chemical engineers, electrical engineers and many other engineering fields. it is considered that in order to become a specialist in any of the above mentioned branch of engineering, the specialty subjects have to provide the student knowledge in five major components: technology, equipment, automation and control systems, cad/cae/cam (computer aided design/engineering/manufacturing) and research and development, as shown in figure 1. consequently, a generalized list of competences which should be provided by the study of specialty subjects within the curriculum was built and presented below. figure 1: major components of an industrial process competences related to technology are grouped as follows: competence to understand the basic principles of the technology (t1) 806 o. bologa, r.e. breaz, s.g. racz competence to implement the technology and to assembly the logistic chain for its implementation (t2) competence to design the technology (t3) competence to troubleshoot and optimize the technology (t4) competences related to equipment are grouped as follows: competence to understand the structure and operation principles of the equipment (e1) competence to operate and maintain the equipment (e2) competence to design the equipment (e3) competence to troubleshoot and optimize the equipment (e4) competences related to automation and control systems are grouped as follows: competence to understand and apply the basic principles of automation and control systems for the process (a1) competence to identify the components of the process which are suited for automation (a2) competence to design, operate and maintain automation systems for the process (a3) competence to troubleshoot and optimize automation systems for the process (a4) competences related to cad/cae/cam are grouped as follows: competence to operate with cad/cae/cam software developed for the process (c1) competence to build 3d models of single parts and assemblies within the structure of the process components (technology, equipment, automation systems) (c2) competence to realize mathematical models and to simulate the behavior of the process components (c3) competence to manufacture process components by means of computer automated machining (c4) competence to use methods and software for optimizing the process components (c5) competences related to research and development are grouped as follows: competence to investigate and analyze new processes, technologies, equipment and automation and control systems (r1) competence to perform fundamental research for the process (r2) competence to build experimental layouts for applicative researches for the process (r3) competence to produce and test prototype products for the process (r4) competence to write technical reports about the process (r5) in order to exemplify the high degree of generality, for every competences from the list the generalized item was italicized. it can be easily noticed that the italicized words can be replaced with more specific items, related to a specialty subject form the curriculum of an engineering specialization. for example, taking into consideration machine-building engineering as specialization and "plastic deformation technologies and equipment" as specialty subject, the italicized items could be replaced by specific items as follows: technology → plastic deformation technology equipment → plastic deformation equipment process → plastic deformation process. 4 the flowchart of the proposed method the flowchart of the proposed method is presented in figure 2. the preliminary stage consists of two steps: developing a generalized set of competences and building a portfolio of specialty subjects. the final stage consist of four steps. the first one involves the customization of the generalized set of competences for every evaluated subject, according to the specific requirements and characteristics of the study programme. as stated in the previous section, due to the high degree a fuzzy-based decision support tool for engineering curriculum design 807 figure 2: the flowchart of the proposed method of generality of the generalized set of competences, the customization process is quite straightforward. the second step involves the evaluation of the customized set of competences by means of questionnaires. because the method itself, the set of generalized and customized competences and the portfolio of specialty subjects are developed by the academic staff, the questionnaires are distributed to the other main stakeholders of the educational process, the employers. the questionnaire (one for each evaluated subject) includes the set of customized competences, and asks the employers to grade each competence within the list with a grade from 1 to 10. the next step involves the construction of a fuzzy model, also for every evaluated subject, which takes into consideration the results of the questionnaires for fuzzyfing the inputs. finally, after running the fuzzy model, the necessity of including the evaluated subject will be assessed. 5 the fuzzy model the general structure of the fuzzy model is presented in figure 3. the inputs are the competences from the list presented in section 3. for a certain specialty subject, the generalized fuzzy model has to be customized, by customizing the set of competences. moreover, if the evaluated subject does not include one of the five categories of competences (for example a specialty subject with does not include technologyrelated chapters), that category can be removed from the inputs. of course, for an engineering specialty subject it is hard to encounter such a situation, but nevertheless, it could appear. the output variable of the fuzzy model is the "percentage of inclusion of the discipline (pi)" and will fluctuate as a percentage between 0 and 100%. only the disciplines which obtain a percentage of inclusion equal or greater than a threshold (for example 80%) will be considered to be included in the curricula. the linguistic variables used for the fuzzification of the inputs t1, t2, e1, e2, a1, a2, c1, c2, c3 and c4 were chosen as:useless;necessary; mandatory. the linguistic variables used for the fuzzification of the inputs t3, t4, e3, e4, a3, a4, c5, r1, r2, r3, r4 and r5 were chosen as: inefficient; efficient. the linguistic variables for the output were chosen as: low; medium; high. 808 o. bologa, r.e. breaz, s.g. racz the fuzzyfication of the inputs was done by using the membership functions presented in figure 4 ( for t1, t2, e1, e2, a1, a2, c1, c2, c3 and c4) and figure 5 (for t3, t4, e3, e4, a3, a4, c5, r1, r2, r3, r4 and r5). figure 3: the structure for a fuzzy model the membership function used for the fuzzification of the output is shown in figure 6. the membership functions from figures 3-5 were built using only triangular and trapezoidal curves. the triangular curve is a function of a vector x and depends on three scalar parameters a, b and c as given by: f(x, a, b, c) =                0, x ≤ a x − a b − a , a ≤ x ≤ b c − x c − b , b ≤ x ≤ c 0, c ≤ x a fuzzy-based decision support tool for engineering curriculum design 809 figure 4: the membership function for fuzzyfication of the inputs t1, t2, e1, e2, a1, a2, c1, c2, c3 and c4 figure 5: the membership function for fuzzyfication of the inputs t3, t4, e3, e4, a3, a4, c5, r1, r2, r3, r4 and r5 figure 6: the membership function for fuzzyfication of the output 810 o. bologa, r.e. breaz, s.g. racz the trapezoidal curve is a function of a vector x and depends on four scalar parameters a, b, c and d as given by: f(x, a, b, c, d) =                    0, x ≤ 0 x − a b − a , a ≤ x ≤ b 1, b ≤ x ≤ c d − x d − c , c ≤ x ≤ d 0, d ≤ x as development tool, the matlab (with fuzzy logic tcoolbox) was used. the other characteristics of the fuzzy model are: and method minimum; or method maximum; implication method minimum; aggregation method maximum; defuzzification method centroid (center of gravity). 6 conclusion the main goal of this research was to develop a decision support tool, based on fuzzy logic for the decision of including or eliminating a specialty discipline within an engineering curriculum. the proposed tool will help the academic staff to choose from a portfolio of specialty subjects. usually, this portfolio is imposed by the national authorities for higher education in each country. of course, the number of disciplines within the portfolio is higher as the number of disciplines which can be included in the curriculum, so the universities have to choose between them, a process which is often extremely difficult. a list of competences which a future graduate of a four years engineering programme (seen as an industrial processes engineer) should possess after studying the subjects form above-mentioned portfolio was built. the competences within the list were divided into technology related, equipment related, automation and control systems related, cad/cam/cae related and research and development related. the list of competences were distributed (as questionnaires) to industrial companies in order to be assessed. the staff of companies, which are the main employer of the engineering graduates were asked to grade every competence from the list with grades between 1 and 10. using the list of competences and the grades, fuzzy models were built for each subject within the portfolio. the inputs of each fuzzy model were the set of customized competences selected from the list and the output was the percent of inclusion in the curriculum. the fuzzyfication of the inputs was made according to a chosen set of membership functions and to the grades received by each competence. finally, the models were used to assist the decision of including the discipline in the curricula by allowing the user to calculate the percent of inclusion. bibliography [1] national science foundation (1996), shaping the future: new expectations for undergraduate education in science, mathematics, engineering and technology, report nsf 96-139, washington dc. [2] ellstrom p. (1997), the many meaning of occupational competence and qualification, journal of european industrial training, 21(6): 266-273. a fuzzy-based decision support tool for engineering curriculum design 811 [3] sanchez j.(2010), university training for competency of sustainability practitioner. its impact on intention of creation, springer science + bussines media. [4] sanchez j. (2010), university training for entrepreneurial competences. its impact on intention of venture creation, international entrepreneurship and management journal, 7: 239-254. [5] nab j. et al. (2010), authentic competence-based learning in university education in entrepreneurship, international journal of entrepreneurship and small business, 9(11):20-35. [6] pearce h.t. (1997), flexibility in the engineering curriculum: a vital component for the future, proc. of the national seminar on engineering education, south africa, university of cape town, 27-28 september 1997, 198-206. [7] mulder m. et al. (20009), the new competence concept in higher education: error or enrichment, journal of european industrial training, 33(8/9): 755-770. [8] khairul a., qiang s. (2006), data-driven fuzzy rule generation and its application for student academic performance evaluation, journal of applied intelligence, 25: 305-319. [9] bai s.m., chen s.m. (2008); evaluating students’ learning achievement using fuzzy membership functions and fuzzy rules, ieee expert systems with applications, 34(1): 399-410. [10] chen s.m., lee c.h. (1999), new methods for students’ evaluating using fuzzy sets, fuzzy sets and systems, 104(2): 209-218. [11] ma j., zhou d. (2000), fuzzy set approach to the assessment of student-centered learning, ieee transaction on education, 2:237-241. [12] cavus n. (2010), the evaluation of learning management systems using an artificial intelligence fuzzy logic algorithm, advances in engineering software, 41:248-254. [13] chang l.t., hong c.m., shih c.h. (1996); an application of fuzzy theory to technical competency analysis for the entry-level electronic technician, intl. fuzzy systems and intelligent control conference, louisville ky, usa, april 8-10, 1-11. [14] bologa o., beck w., schupp p., breaz r., racz g., ionescu f. (2009); a decision algorithm for optimizing the specialty curricula in machine tools and production systems engineering studies, 5th balkan region conference on engineering and business education & 2nd international conference on engineering and business education, sibiu, romania, 15 17 october, 330-335. [15] beck w., schupp p., bologa o., breaz r., racz g., ionescu f. (2010); mathematical model for aiding the decision of changing the curriculum for higher education in the industrial engineering domain, iceri 2010 intl. conf. of education, research and innovation, 15-17 november, madrid, spain, 6535-6542. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vii (2012), no. 1 (march), pp. 184-195 maintaining communication links using a team of mobile robots w. zhuang, x. chen, j. tan wei zhuang nanjing university of information science & technology nanjing,210044,p.r.china e-mail: zw@nuist.edu.cn xi chen, jindong tan michigan technological university houghton,mi 49931,usa e-mail: xchen@mtu.edu, jitan@mtu.edu abstract: this paper presents a comprehensive metric to evaluate the link quality and the corresponding control schemes for the distributed control of a team of robots to maintain the communication links. the mobile robots dynamically reconfigure themselves to maintain reliable end-to-end communication links. such applications require online measurements of communication quality in real time and require a mapping between link quality and robot positions. in this paper, we present the empirical results and analysis of a link variability study for an indoor and outdoor environments including received signal strength indicator (rssi), throughput and packet loss rate. the distributed control algorithms consider the environmental constrains and obstacles. moreover, the self-deployment algorithms allow a team of robots to recognize the coverage gap by monitoring link qualities, and deploy the mobile robots for a variety of applications including self-healing, tethering, intelligent relaying. the assessment of link quality acts as the feedback for cooperative control of mobile robots. the experimental results have shown the effectiveness of evaluation for communication links and the related control schemes. keywords: communication link, networked robots, decentralized control, rssi. 1 introduction with the advancement in wireless sensor technologies, a communication network can be rapidly setup by densely deploying sensor nodes in an area of interests. these intelligent wireless sensors will self-organize themselves to form a communication and sensor grid. the quality of communication links are subject to many factors including the distance between neighboring nodes, the topology of the network, the environment attenuation, and the use of battery. the wireless sensors, once deployed, can not be reconfigured to maximize the network performance nor to guarantee end-to-end data transmission. the failure of a few wireless nodes on a critical path may lead to the failure of the entire network. there are many advantages to use mobile robots to setup a temporary communication network whose link qualities can be dynamically maintained by the motion control of mobile robots. fig. 1 shows a scenario where a team of mobile robots can be used to setup a self-organizable communication network in battlefield. these robots can be equipped many sensors for monitoring and reconnaissance of battlefields. military personnel relies on these sensors for situation copyright c⃝ 2006-2012 by ccc publications maintaining communication links using a team of mobile robots 185 figure 1: a large scale battlefield cooperation using networked robotic team awareness. in many circumstances, communication connections could be lost due to many reasons, such as ubiquitous wireless signal noise, angle of signal arrival, natural obstacles, and even a slight movement of pedestrian. the natural settings such as mountains and lakes shown in fig. 1 represents some of the barriers for smooth communication traffic within the network. in these situations, neighboring mobile robots can monitor their link quality and re-position themselves accordingly to the measurement of link quality. a rapidly deployable mobile network with a team of mobile robots has many potential applications in military and disaster relief. the self-deployment and maintaining a team of mobile robots face challenges in both link quality measurement and distributed control a multi-robot system. the quality of an end-end communication link are affected by both many factors including the ambient noise, environment attenuation, battery level, motion of the robots, direction of the antenna, etc. many evaluation metrics such as signal to noise ratio, received signal strength, packet loss, etc. none of these evaluation metrics are easily quantified for control purposes. the second challenge is the distributed control of mobile robots for dynamic link quality improvements. though distributed control of multiple mobile robots have been intensively discussed, the cooperative control of mobile robots for communication links faces new challenges in the coupling of environment and communication factors. the monitoring of communication links has long been a research topic in the communication society, and many approaches for evaluating link quality have been proposed. it is shown that that distance cannot be directly adopted as an assessment for link quality [1]. base on the celebrated shannon-hartley theory, the capacity of communication link is proportional to snr (signal-to-noise ratio). souryal et al. validated that snr can reliably predict the link quality. however, it is difficult to improve the snr measurements with this model due to consideration of external interference in the model [2]. dixon and frew employed snr of the communication links to define an optimal communication chain [3]. apart from using snr to evaluate link quality, researchers adopted wireless signal strength to assess the quality of connectivity in multi-robot system. luthy and grant et al. used rssi measurements to repair disconnected wireless sensor networks [4]. stump et al adopted fielder value (the second-smallest eigenvalue) to describe the degree of connectivity in a communication grid [1]. packet loss rate can also serve as a metric to reflect the performance of communication links. in this work, we analyze a set of comprehensive metrics ,rssi, throughput, packet loss rate and velocity,to explore the possibility of controlling mobile robots for reliable end-to-end communication links. rssi measurement may serve as an main metric to assess the link quality. however, from the results of our experiments, some other metrics such as speed, angle of arrival, can also reflect the link quality. many control algorithms for a large distributed robotic cooperation are presented before. these approaches could be divided into two categories, the centralized and decentralized algo186 w. zhuang, x. chen, j. tan rithms. generally, the question of whether to have centralized or decentralized control comes down to resources [5]. if one of the robots is defined as a central unit which is in charge of the data fusion and decision, the entire system could be brittle when such the manager did not work [6]. additionally, communication overhead and response time are limiting factors for centralized control. in our previous work, a scalable graph model for decentralized control of mobile robots was discussed. the combination of delaunay triangulation and potential field make it possible to adjust the robot positions in a distributed way while a global objective is achieved [7]. similarly, a decentralized mobility control scheme using extremum seeking method was presented to form a linked chain of mobile relays [8]. they focused on maximizing the total link bandwidth. simulation results and movement data from different platforms performing different applications verified the above approaches [9]. in this work, we discuss the control of multiple mobile robots to guarantee reliable end-to-end communication links using a distributed control method. 2 impact of comprehensive metrics in this section, we introduce the analysis of three metrics and related effects on communication links, rssi (received signal strength indicator), throughput and packet loss rate respectively. the shannon-hartley theory stated that the channel capacity c = b log2 (1 + s/n). where, c is the channel capacity, and b is the bandwidth of the channel. s is the total signal power over the bandwidth, and n represents the total noise power over the bandwidth. with a fixed noise distribution and bandwidth, the metric s is proportional to channel capacity which denoted rssi should be adopted as a metric to evaluate the links. throughput as a conventional metric to define quality of end-end communication link is also taken into account. moreover, packet loss rate, a real-time traffic steam detector, is accepted as a threshold to estimate network health status. we take these metrics into account so that the quality of link could be evaluated qualitatively and quantitatively. 2.1 received signal strength indicator (rssi) rssi vs. distance in wireless communication area, received signal strength indicator (rssi) is a measurement of the power present in a received radio signal. rssi measurements from mobile sensor networks could be directly acquired using simple light-weight processing, especially in ieee 802.11 protocol family. generally, rssi is not a stable measurement owning to the multi-path fading and shadowing [10]. it is not linearly proportional to corresponding physical distance. however, it could be considered as a threshold to evaluate the quality of link. fig. 2(a) shows the ubiquitous fluctuation of rssi measurement when the robot is deployed in static. we set the distance between a sender and a receiver as 2 meters in a noisy corridor. two sets of experiments have been done in a noisy corridor and open space outside respectively. it is obvious that the measurements sampled in outdoor are larger than that in a constrained indoor environment. the figure shows the measurements fluctuating all the time. the largest amplitude of measurements reaches -5dbm. another set of experiments aims to test feasibility of rssi as a metric for evaluating links in the same two scenarios. fig. 2(b) shows rssi value varies with the movement of mobile robots. one is set to move forward along a straight line. the results illustrates rssi values varies synchronously with corresponding distance. in a large scale area, if the noise distribution is fixed, we can adopt rssi value as a metric to reflect the link quality inferred from the shannon-hartley theory. maintaining communication links using a team of mobile robots 187 0 5 10 15 20 25 30 35 40 -70 -60 -50 -40 -30 -20 -10 r s s i( db m ) time(s) indoor outdoor (a) fluctuation of rssi in static(indoor and outdoor) 0 5 10 15 20 25 30 -80 -75 -70 -65 -60 -55 -50 -45 -40 -35 -30 r s s i( db m ) distance(m) indoor outdoor (b) rssi vs. distance during the movement of robot figure 2: rssi performance with and without mobility angle of arrival 0 5 10 15 20 -55 -50 -45 -40 -35 -30 -25 -20 r s s (d b m ) time(s) (a) rssi performance with robots being deployed in a line 0 5 10 15 20 25 -70 -65 -60 -55 -50 -45 r s s (d b m ) time(s) (b) rssi performance with robots being deployed in a diagonal figure 3: impact of angle of arrival it have been testified that the angle of arrival could affect the performance of rssi measurements [11]. in this section, we also illuminate this phenomenon during the movement of robots. theoretically, the antenna of 802.11b/g compatible network device is omni-directional, that is, no matter what direction it receives from the signal, the rssi measurers should be the same. but, these off-the-shelf devices do not show the symmetric feature in different orientation. fig. 3(a) and fig. 3(b) shows the variety in different signal received directions when being deployed in a line and diagonal respectively. two robots are deployed in a long corridor, where one is in static and the other is moving. the robot changes its orientation clockwise in turn at the same spot. both of the results show that rssi value has the minimum when the angle turns to 270 degree. the difference of measurements is up to -10dbm. moreover, the rssi difference between 0 degree and the other degrees is larger than that in a line. the results can be useful for the reliable sampling of rssi since the angle between neighbor robots could varied during the task. speed speed is also concerned both in rssi and throughput performance. note the speed we mentioned here is the velocity of mobile robot rather than signal modulation rate. in order 188 w. zhuang, x. chen, j. tan to obtain the real influence on the performance in mobile sensor network, we set up a series of outdoor experiments to testify it. two mobile robots equipped with 802.11b/g compatible network devices are deployed on a long road. we selected the maximum translate velocity of the robots as 700mm/s and the minimum speed as 100mm/s. to avoid the interference of other metrics like angle of arrival, etc., one of the robots is requested to move straightly, the other was stationary. the final trajectory is a straight line. fig. 4(a) shows the connection between rssi and related distance. the black curve (upper line) and the red curve(lower line) denoted the relationship with the speed of 100mm/s and 700mm/s respectively. it was obvious that the rssi measurements with lower speed is always larger than that with higher speed at the same distance except for the startup period. note that the convex curve of the red line was shown in fig. 4(a). theoretically, the curve is proportional to λ−n where λ is distance, and presents monotonic decreasing. however, with higher speed, the rssi measurements could be affected seriously by multi-path fading and reflection of constructions. here, the convex part locates at a special zone in which rssi could be disturbed acutely. meanwhile, the difference between fixed rssi value with two speed modes fluctuated rather than increasing continuously which is shown in fig. 4(b). these could be also explained by the special zones the robot passed in and out. here, the results illuminate that at a higher speed the robot moved, the much more possibility of interference and instability of rssi occurred. 0 5 10 15 20 25 -80 -75 -70 -65 -60 -55 -50 -45 -40 r ss i( db m ) distance(m) 0.1m/s 0.7m/s (a) impact of translate velocity on rssi 0 5 10 15 20 25 -14 -12 -10 -8 -6 -4 -2 0 2 r ss i( db m ) distance(m) rssi difference (b) rssi difference of two speed modes figure 4: rssi vs. distance with different speeds 2.2 throughput performance in this section, we introduce the performance of throughput in our proposed environment. mobility, transmission rate and speed of robots were concerned. throughput as a conventional metric in communication network reflects the performance of communication capacity [12]. as we mentioned before, if the robot moved into a "gray zone" [13] or extremely noisy area, the measurements of rssi could decayed by a small value (dbm) but the link is broken possibly. so, throughput and related parameters like response time or latency are prone to be accepted as compliments for the entire evaluation. mobility compared with computer networks, mobility is one of emphasis in mobile sensor network which can have effects on the throughput of neighbor robots. we deployed two mobile robots (pioneer-3at) in a walled indoor environment. one is set in static, the other moves along the corridor. ixchariot, a tcp/udp packets generator and analyzer is used to obtain real time maintaining communication links using a team of mobile robots 189 figure 5: results of active measurements during the movement table 1: ixchariot parameters of mobility console protocol tcp scripts credits number of timing records 50 size of record to send 100 send buffer size default throughput in our experiments. all the network parameters are listed in table 1. fig. 5 shows the fading of the throughput with movement. the obvious decreasing of throughput occurred at 15.5 meters approximately. based on the theory mentioned above, as long as the mobile robot stays in a circle with the radius less than a value the communication link can retain connection status. if the robot moves out of the circle, it still has ability to exchange data with neighbors but the throughput presents sharply fading. note that deploying in different scenarios (indoor and outdoor) or using different sending packets size, the metric could vary due to the multi-path fading and mac overhead. speed vs. throughput figure 6: comparison of throughput performance with different speeds figure 7: packet loss rate vs. received signal strength another relationship should be concerned is the speed influence on throughput. fig.6 shows the impact of speed on throughput. the upper line and the lower line denote transformation of 190 w. zhuang, x. chen, j. tan throughput as time elapsing with the speed of 100mm/s and 700mm/s respectively. ixchariot is adopted to sample the throughput using the parameters in table 1. there is no obvious difference between the two lines. since the mobile robot always keep connecting with neighbors, as long as the transmission rate stays lower than bandwidth, the traffic could be smooth and stable. the average throughput 546kbps at speed of 700mm/s is adequate for message and low-resolution low fps video transmission. 2.3 packet loss rate packet loss rate can also reflect quality of communication links in wireless network. however, [2] confirm that the low correlation between packet loss rate and distance and show the extent of variation in an indoor office environment. so, we select rssi instead of distance as x domain and corresponding packet loss rate as y domain shown in fig.7. one robot is set to move along the corridor while the other is stationary. 100 packets (522 byte per packet) were transmitted from the base to mobile robot every second continuously. the rssi interval is set as -5dbm, and the range is set from -35dbm to -95dbm. the result indicated the critical area of related rssi lay in −85dbm ± −5dbm. we assumed the point a where the observed rssi value is -80dbm and point b where the sampled rssi value is -90dbm. we define d as the distance between a and b. the mobile robot is arranged to move from a to b repeatedly. the average distance d̃ is less than 1 meter which denoted that it took less than 3.3 seconds at the speed of 300mm/s. generally, the interval is so short that fast responding action such as stop, turn, reverse, etc. could not be achieved in our assumed scenario. so, the packet loss rate can be fit for evaluation of data traffic integrality but not suitable for a feedback for controlling the robotic team. 3 decentralized control algorithms in this section, we introduce a decentralized control scheme adaptive to unconstrained and constrained scenarios. the improved rssi measurement is adopted to be feedback input into control system. base on the delaunay graphic model, we define two different chained form networks describing self-healing and tethering respectively. virtual forces composed of abstractive forces and repulsive forces are exploited to drive the robot to a stable status. 3.1 delaunay triangulation and network model first, we define the graph model for the multi-robot system. the configuration and control input of the whole system could be defined as: q = {q1,q2, ...,qn}t (1) the overall system can be denoted by q̇ = f(q,u), where f is the vector of system dynamics. in a unified inertial coordinate system, p̃i = { ˜xi(t), ˜yi(t)}t is defined as the position of the robot ri. we define pi = {xi(t),yi(t)}t as the position of the robot ri in its local coordinate frame. the delaunay triangulation with a set of nodes is defined such that any additional edge between any two nodes intersected one of the existing edges. the delaunay triangulation define the link properties between one-hop neighbors, which are described by a set of edges. here, we can adopt adjacency matrix to specify the connectivity of the delaunay tessellation using these edges. motivated by the connectivity in multi-robot team [3], we define the quality of communication link between ri and rj which connected with each other through one-hop link as a continuous value. this value is denoted by improved rssi measurement. the quality is set to be: maintaining communication links using a team of mobile robots 191 ψij ,   ∆ij, ∥pij∥ < ρ 0, ∥pij| > r exp[ −5(∥pij∥−ρ) r−ρ ], otherwise (2) where, r means the cut-off rssi measurement which is able to guarantee the link connection and ρ means a saturation rssi measurement where the communication status between robots does not change as they get closer together. the two parameters can be estimated by results of packet loss rate and active throughput respectively. unlike the idea in [1],we define ∆ as a variant which describes the degree of the connectivity of neighbors. all the analysis of the ∆ has been demonstrated in [14],which shows the impact of comprehensive metrics including rssi, active throughput and packet loss rate. we define an adjacency matrix a(t) to specify the connectivity of the delaunay triangulation. here, aij = ψij if ri and rj are one-hop neighbors, otherwise aij = 0. so, the link properties of ri and its one-hop neighbors are denoted by ith column of a(t), that is ai(t) = {ai1,ai2, ...,ain}t . 3.2 graphic model based control algorithm general model for a multi-robot system, the computation of delaunay triangulation is time consuming. a decentralized control method is presented to calculate individually and cooperate to repair the communication link. in this paper, a distributed control method is proposed based on both the potential field method and the delaunay triangulation. for a mobile robot ri, a performance index (candidate lyapunov function) is defined as follows: vi = 1 2 ni∑ j=1 kip(∥pij∥ − cij)2 + 1 2 kiv∥vi∥2 (3) where pij states the improved rssi value between robot ri and robot rj. ∥pij∥ equals to aij which is an entry in adjacency matrix aij, and cij is the actual and desired rssi value between the two neighboring robots. nij is the total number of the one-hop neighbors of rij. kip and kiv are the parameters for the virtual potential energy and kinetic energy of the robot. the control input is derived by: ui = − ∂vi ∂ai − ∂vi ∂vi (4) = − ni∑ j=1 kip(aij − cij) pij aij − kivvi = −fi − kivvi here, we have obtained the necessary conditions for controlling ri to keep the connection. when one node is lost in communication chain, the whole system can repair the network connection using neighboring robots to fill the position automatically. if all the other robots but ri are static, it is easy to prove that the above controller is globally convergent [7]. self-configuration and self-healing self-healing is based on the same theory of self-configuration. in our works before, [7] proposed an control algorithm for self-configuration both in constrained and unconstrained environment. 192 w. zhuang, x. chen, j. tan fig. 8(a) shows the simulation result of self-configuration after 10 random robots implemented the cooperation scheme. fig. 8(b) shows the result of self-healing using the general model when two nodes are disconnected with neighbors. the objective is to fill the coverage gaps the disconnected robots generated before. tethering and intelligent relays tethering is the reverse processing of intelligent relaying which stretches the network to be a chain as long as possible. different with self-healing, the objective is to keep connection with neighbors so as to maximize the end-end distance. theoretically, if the communication range is larger than double of sensing range, the chain should transform into a line in an unconstrained situation while not possible in real environment. an control algorithm named minimum − follow for stretching the network is proposed in this section. the delaunay triangulation could be broken into a chained form network. every robot in this network just have two one-hop links at last. the nearest neighbor with respect to ready-to-move robot could be chosen to take a movement every round in the iterative loop. the offset is determined by the virtual force generated by foregoing robot. each robot is considered to follow its foregoing moving robot. fig. 9 shows the simulation results of tethering algorithm. we set vi as 1m/s and cij as 40. the total processing time was 100 seconds. the start-up scenario was from the stable status after self-healing shown in fig. 8(b). as long as the space is wide enough, finally the chained form network will transform into a line. it is obvious that the cooperation using our algorithm is not optimal for stretching the network using this decentralized algorithm. however, each robot just needs to connect with neighbor robots while a powerful processing unit is able to manage the robots in centralized control. −100 −80 −60 −40 −20 0 20 40 60 80 100 −100 −80 −60 −40 −20 0 20 40 60 80 100 (a) stable status of the delaunay triangulation with 10 robots −100 −80 −60 −40 −20 0 20 40 60 80 100 −100 −80 −60 −40 −20 0 20 40 60 80 100 (b) stable status with 8 robots after selfhealing figure 8: the simulation results of self-healing 4 experiments in this section, we introduce related experiments of indoor and outdoor to test our algorithms. we used pioneer 3-at as our mobile robot which is equipped with one laser detector upper the board and 8 arrays of sonar mounted in front and back flank. a dell-d430 laptop is chosen to be the data processing unit. the real time rssi measurements are collected through wirelessmon. in order to minimize impact of speed, the linear velocity of mobile robot is defined as 110mm/s and the wireless channel is set in 1mbps/s. all the processing and control command codes are implemented in matlab. maintaining communication links using a team of mobile robots 193 −100 −50 0 50 100 −60 −40 −20 0 20 40 60 (a) the snapshot after 10s −100 −50 0 50 100 150 −100 −80 −60 −40 −20 0 20 40 60 80 100 (b) the snapshot after 50s −100 −50 0 50 100 150 200 −100 −80 −60 −40 −20 0 20 40 60 80 100 (c) the snapshot after 150s 0 50 100 150 200 250 300 350 400 −100 −80 −60 −40 −20 0 20 40 60 80 100 (d) the snapshot after 300s figure 9: the snapshots of tethering simulation results. the lead robot is demanded to keep moving at 1m/s, while optional neighbor robots respond. speeds of these ready-to-move robots are determined by virtual forces. finally, every robot only has two one-hop neighbors in the chained form network. table 2: environmental parameters of indoor and outdoor environment ρ r cij γ indoor -65dbm -90dbm -40dbm 20mm outdoor -70dbm -95dbm -50dbm 50mm 4.1 indoor and outdoor the environmental parameters has been listed in table 3. here, ρ means the rssi threshold of smooth traffic without obvious decaying while r means the link cutoff rssi of current environment. cij is the optimal desired rssi value in our system. for obstacles avoidance, γ is designed to denote the safe distance between robots and barriers. when the distance acquired from laser or sonar is larger than γ, the virtual repulsive force would resist the robot as a result of turning. 4.2 link and motion test in order to testify the effectiveness of the proposed metric for our control algorithm, a test of robotic cooperation is implemented using real-time sampled link metric. six robots are deployed in the end of u corridor. first, the team transforms into a chained form network. then, the team leader is demanded to take exploring along the wall. the speed of leader is set at 110mm/s. the desired rssi value cij is set as -40dbm. the parameter kp is set as 25. the objective is to extend the team of robots so as to maintain the end-end communication link during team leader’s 194 w. zhuang, x. chen, j. tan exploration. the sonar sensors are used to detect the range with wall in order to avoid colliding. fig. 10 shows the rssi measurements from every mobile robot. the relationship between rssi and related distance exhibited the rssi variety which could act as a feedback for robotic motion. when the rssi measurement is lager than -40dbm, which means a repulsive force rather than an abstractive force could be loaded, it could result in speed descending sharply or moving back. for example, five sharp spikes in fig. 10(a) reflect the robot 2 keep moving forth and back. note that the speed of each robot varies with respect to the dispersion of desired measurement. this pi control method could guarantee the smooth of robotic motion and cooperative stability. 0 10 20 30 40 50 -52 -50 -48 -46 -44 -42 -40 -38 -36 -34 r s s i( db m ) distance(m) 1-2 (a) robot 1 and 2 0 10 20 30 40 -52 -50 -48 -46 -44 -42 -40 -38 -36 -34 r s s i distance(m) 2-3 (b) robot 2 and 3 0 5 10 15 20 25 30 35 -52 -50 -48 -46 -44 -42 -40 -38 -36 r s s i( db m ) distance(m) 3-4 (c) robot 3 and 4 0 5 10 15 20 25 -50 -48 -46 -44 -42 -40 -38 -36 r s s i( db m ) distance(m) 4-5 (d) robot 4 and 5 0 2 4 6 8 10 12 14 -48 -46 -44 -42 -40 -38 -36 r s s i( db m ) distance(m) 5-6 (e) robot 5 and 6 figure 10: experimental results in an u corridor 5 conclusions a comprehensive metric for evaluating communication link quality and the robotic cooperation algorithms are proposed in this paper. received signal strength indicator has been testified to be an assessment for link quality by a series of empirical experiments. throughput and packet loss rate are the complements for the control system by generating critical point and cutoff point respectively. virtual forces are produced to drive the robot composed of repulsive forces and abstractive forces. two graphic models including self-healing and tethering are proposed to illustrated the kinematic modes of the team of robots. experimental results showed the feasibility of our comprehensive metric for link quality and stability of our control schemes. bibliography [1] e. stump, a. jadbabaie, and v. kumar. connectivity management in mobile robot teams. robotics and automation, 2008. icra 2008. ieee international conference on, pages 1525–1530, may 2008. [2] m.r. souryal, l. klein-berndt, l.e. miller, and n. moayeri. link assessment in an indoor 802.11 network. wireless communications and networking conference, 2006. wcnc 2006. ieee, 3:1402–1407, 2006. maintaining communication links using a team of mobile robots 195 [3] cory dixon and eric w. frew. maintaining optimal communication chains in robotic sensor networks using mobility control. in mobile networks and applications, volume 14, issue 3, june 2009. [4] k.a. luthy, e. grant, and t.c. henderson. leveraging rssi for robotic repair of disconnected wireless sensor networks. robotics and automation, 2007 ieee international conference on, pages 3659–3664, april 2007. [5] yongguo mei, yung-hsiang lu, y.c. hu, and c.s.g. lee. deployment of mobile robots with energy and timing constraints. robotics, ieee transactions on, 22(3):507–522, june 2006. [6] p. ogren, e. fiorelli, and n.e. leonard. cooperative control of mobile sensor networks:adaptive gradient climbing in a distributed environment. automatic control, ieee transactions on, 49(8):1292–1302, aug. 2004. [7] jindong tan. a scalable graph model and coordination algorithms for multi-robot systems. advanced intelligent mechatronics. proceedings, 2005 ieee/asme international conference on, pages 1529–1534, july 2005. [8] c. dixon and e.w. frew. controlling the mobility of network nodes using decentralized extremum seeking. decision and control, 2006 45th ieee conference on, pages 1291–1296, dec. 2006. [9] e.w. frew, d.a. lawrence, c. dixon, j. elston, and w.j. pisano. lyapunov guidance vector fields for unmanned aircraft applications. american control conference,. acc ’07, pages 371–376, july 2007. [10] m.r. souryal and n. moayeri. channel-adaptive relaying in mobile ad hoc networks with fading. sensor and ad hoc communications and networks, ieee secon 2005. second annual ieee communications society conference on, pages 142–152, sept., 2005. [11] michael r. souryal, johannes geissbuehler, leonard e. miller, and nader moayeri. realtime deployment of multihop relays for range extension. in mobisys ’07: proceedings of the 5th international conference on mobile systems, applications and services, pages 85–98, new york, ny, usa, 2007. acm. [12] arzad kherani sorav bansal, rajeev shoreyý. performance of tcp and udp protocols in multi-hop multi-rate wireless networks. in ieee wireless communications and networking conference, volume vol.1, pages 231 – 236, 2004. [13] henrik lundgren, erik nordströ, and christian tschudin. coping with communication gray zones in ieee 802.11b based ad hoc networks. in wowmom ’02: proceedings of the 5th acm international workshop on wireless mobile multimedia, pages 49–55, new york, ny, usa, 2002. acm. [14] wei zhuang, xi chen, jindong tan, and aiguo song. an empirical analysis for evaluating the link quality of robotic sensor networks. in ieee international conference on wireless communication and signal processing, nanjing, china, november 13-15, 2009., 2009. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 2 (june), pp. 236-245 a network coding based dtn convergence layer reliable transport mechanism over interplanetary networks s. haoliang, l. lixiang, h. xiaohui sun haoliang 1. graduate university of the chinese academy of sciences 19a yuquanlu,beijing, 100049, p. r. china 2. national key laboratory of integrated information system technology, institute of software, chinese academy of sciences 4# south fourth street, zhong guan cun, beijing 100190 p.r. china e-mail: haoliang08@iscas.ac.cn liu lixiang, hu xiaohui institute of software, chinese academy of sciences 4# south fourth street, zhong guan cun street, beijing, china, 100190 abstract: the realization of deep space scientific missions are enabled by the developments in the space technologies. tcp can not provide effective communication service in deep space links because of the long propagation delay and high bers characteristics.scientific community propose delay tolerant network (dtn) for resolving the communication problem between earth and other planet. dtn introduces a new layer in the protocol stack called bundle layer that is placed between application layer and transport layer.dtn calls the transport protocols that it uses to move data across different networks convergence layers. this study present a novel network coding based convergence layer reliable transport mechanism(nc-rtp) in order to provide effective communication service between dtn peers. this mechanism transmit a coded bundle every m original bundles(m is related to the packet error rate of the communication link). the coded bundle is a random linear combination of previous m original bundles. using coded bundle and (m-1)original bundles, the receiver could decode and generate a single lost bundle of previous m original bundles . in this way, nc-rtp could compensate any single lost bundle in (m+1) transmitted bundles(including m original bundles and 1 coded bundle).our theoretical and simulation performance evaluation results reveal that nc-rtp can enhance transmit reliability and make file transfered faster. keywords: dtn,convergence layer, reliable transport, network coding. 1 introduction the realization of deep space scientific missions are enabled by the developments in the space technologies. the future space exploration includes missions to deep space that require communication among planets, moons, satellites, asteroids, robotic spacecrafts, and crewed vehicles. significant amount of scientific data to be delivered to the earth are produced by these missions. the next step in the design and development of deep space networks is expected to be the internet of the deep space planetary networks ,called interplanetary (ipn) internet [1].ipn is an architecture envisioned for interconnecting earth with other planets. the purpose of ipn is to build a communication infrastructure between planets and satellites . interplanetary (ipn) internet, in contrast to conventional internet links, are characterized by: copyright c⃝ 2006-2011 by ccc publications a network coding based dtn convergence layer reliable transport mechanism over interplanetary networks 237 1. high link bit error rates. deep space links have extremely high bit error rates, which may be up to 10 [2], [3]. 2. very high propagation delays. the propagation delay increases with the distance. the propagation delay between earth and mars varies between 8 and 40 minutes depending on the orbital location of the planets [2] [3].and the propagation delays to outer space planets become even higher. 3. bandwidth asymmetry. asymmetry between the forward and the return path bandwidth may be up to 1000:1 [3] [4]. 4. intermittent connectivity. planetary bodies, asteroids or spacecraft may periodically interrupt the communication link between the path endpoints [3] [5]. in this network,classical transport layer mechanisms are not suitable. scientific community propose delay tolerant network (dtn) [6] for resolving the communication problem between earth and other planet such as mars. dtn is an architecture particularly useful in scenarios with very long transmission delay or intermittent connectivity, like ipn. dtn introduces a new layer in the protocol stack called bundle layer that is placed between application layer and transport layer [7]. the bundle layer resolves the high error rates, long delay, asymmetric data rates and intermittent connectivity by using a store and forward mechanism. it sends a bundle of message fragments to the next-hop node with per-hop error control, which increases the probability of data transmission. dtn calls the transport protocols that it uses to move data across different networks convergence layers. a convergence layer should make best use of the intermittent and temporary links in dtn. it transfers as much data as possible while the link between peering dtn nodes is up and available. it is widely accepted that regular tcp cannot operate efficiently as a convergence layer transport protocol for interplanetary networks based on dtn architecture [8]. tcp needs 240 minutes to reach slow start threshold equal to 20 packets [5] over a 40 minute round trip time (rtt) path,and longer rtt paths degrade tcp’s performance further. since tcp was designed to operate over wired links, where the link error rate is insignificant, the protocol cannot cope with high link bit error rates [9]. finally, tcp’s transmission rate depends largely on the receiver’s feedback.tcp sends one acknowledgment (ack) for each successfully received data packet. on the presence of bandwidth asymmetry, the large number of acks will cause congestion on the reverse path,reducing tcp’s transmission rate. as a result, in interplanetary networks based on dtn architecture, a convergence layer transport protocol other than tcp is needed. the remainder of the paper is organized as follows. section 2 introduces current convergence layer protocols in dtn architecture and discusses the drawbacks of them. section 3 proposes our new network coding based convergence layer reliable transport mechanism(nc-rtp). the theoretical evaluation of nc-rtp is presented in section 4. and the experimental evaluation is in section 5. we conclude the paper in section 6. 2 related works there exist already a number of convergence layer transport protocols for interplanetary networks based on dtn architecture . in this section, we briefly review these proposals. saratoga [10]is a reliable rate-based udp/ip file transfer protocol.it is capable of transferring efficiently both small and very large files. it has been developed by surrey satellite technology ltd (sstl) used for mission imaging data. saratoga was designed for dedicated point-to-point links between dtn peers,which focuses on transferring data efficiently to the next hop when link connectivity is available. saratoga achieves efficient transmission by sending out data packets 238 s. haoliang, l. lixiang, h. xiaohui at the line rate. it also uses a negative acknowledgment strategy in order to deal with channel bandwidth asymmetries. saratoga is used as a convergence layer to exchange delay tolerant networking bundles [6], [7] between peer nodes [10]. similarly to saratoga, the licklider transmission protocol (ltp) [11] is a point-to-point protocol applied as a dtn convergence layer. ltp transfer unnamed blocks of data and introduces the concept of partial reliability by dividing each block of data into two parts: the reliable red part and the unreliable green part. moreover, ltp send laconic acknowledgments only upon encountering explicit solicitations for reception reports (checkpoints) in the sequence of incoming data segments of the red part of the block. when the communication link is not available, deferred transmission is possible as well. deep space transport protocol (ds-tp) [12]also can be adapted to serve as an efficient convergencelayer for dtn, by transferring dtn bundles as well as files. it implements a retransmission technique, called double automatic retransmission (dar), which allows for fast and efficient holefilling at the receiver’s buffer. dar sends each bundle twice,importing some delay between the original transmission and the retransmission. more precisely,one redundant bundle is transmitted every 1/e-1 original bundles(e presents the packet error rate per).therefore, in the presence of link errors, corrupted bundles will eventually be replaced by the same correct bundles that arrive later. saratoga and ltp achieve efficient transmission by sending out data bundles at the line rate. this ensures that as much data as possible is transferred to the peering node. ds-tp implements double automatic retransmission(dar) to fast and efficiently compensate lost bundles. but ,according to the algorithm described in [12],dar only retransmit finite bundles((n/(1/e− 1) bundles ,n presents the number of bundles of the file). dar can not provide effective retransmission for all bundles, a more effective retransmission technique is needed to enhance transmit reliability. 3 network coding based convergence layer reliable transport mechanism network coding is an important approach to enhance transmit reliability in lossy link. the central idea of network coding is to transmit a combination of multi-packet to replace a single packet. the receiver could decode and generate needed packets using received packets. it has a good ability to deal with transmit error [13]. the simplest coding approach is random linear coding. we treat packets as vectors over a finite field. to get a coded packet, random linear coding generates a group of random coefficients vector and perform matrix multiplication with the packets vector. the transmission of coded packets are independent of each other. generally, to simplify computer programming and memory accessing, the size of finite field is set to 8,which is the size of a byte. for example, we have two 16 bits data packets to encode, 3a47h and 5f33h(hex), each packet is considered to be a vector with size 2. choose two decimal coefficients randomly,4, 30(the coefficients range from 0 to 255). the coded packet is 43a47h+305f33h=e91ch+27fah=1116h. using network coding in dtn convergence layer can enhance transmit reliability and transmit as many bundles as possible before get any ack information from the receiver. in our study , we transmit a coded bundle every 1/e−1 original bundles(e presents the packet error rate per). the coded bundle is a random linear combination of previous 1/e−1 original bundles.considering original bundles are liner independent with each other,the coefficients can be set to 1 to all original bundles in the coded bundle. theoretically, every 1/e bundles will lost one bundle under per e. using 1/e − 1 original bundles and the coded bundle of 1/e − 1 original bundles, we can decode a network coding based dtn convergence layer reliable transport mechanism over interplanetary networks 239 and generate the lost original bundle with gaussian elimination. thus we can say ,in this way ,the receiver could get all 1/e−1 original bundles using received 1/e−1 bundles(including original and coded bundles). this approach is called network coding based convergence layer reliable transport mechanism(nc-rtp). this mechanism could compensate lost bundle forwardly. we use an example to illustrate nc-rtp. let e = 0.2, so 1/e − 1 = 4. a coded bundle is generated and send every 4 bundles.let bn presents the nth bundle.the transmission sequence is b1, b2, b3, b4, b1,2,3,4(b1,2,3,4presents the coded bundle of b1, b2, b3, b4 and b1,2,3,4 = b1 +b2 +b3 +b4). theoretically, 1 bundle will be lost during the transmission. suppose b3 is lost, and the reciver get b1, b2, b4, and b1,2,3,4. since b1,2,3,4 = b1 + b2 + b3 + b4 and we have b1, b2, b4, b3 = b1,2,3,4 − b1 − b2 − b4,in this way we can decode and generate b3 . compare to ds-tp, nc-rtp could compensate any single lost bundle in 1/e − 1 original bundles, and transmit coded bundles (n presents the number of bundles of the file ) could make all bundles in the file have the chance to be compensated. in this way , nc-rtp can reduce the number of lost bundles during the transport procedure and enhance the transport reliability. moreover, we generate coded bundle using bundles,the coding and decoding delay is neglectable comparing to the propagation delay of interplanetary links. because of the long propagation delay of interplanetary links, using nc-rtp can enhance the transmit reliability and complete file transfers faster. in the next part of this section, we describe in detail the operation process of network coding based convergence layer reliable transport mechanism (nc-rtp),followed by pseudo-code of the algorithm. the initial value of per e can be set by previous transmit experience. in the transmit process, the sender can compute the value of e depending on the acknowledgement information from the receiver and change it. in this study ,we consider the per is fixed during the transmit process to simplify the analysis. sender procedure:the sender transmit bundles in order and transmit a redundant coded bundle every 1/e − 1 original bundles. the coded bundle is a liner combination of previous 1/e − 1 original bundles ,all coefficients are set to 1.when the sender get snack, retransmit lost bundle according to the acknowledgement information. in order to compensate lost bundle, use nc-rt in the retransmit operation, send a redundant coded bundle every 1/e − 1 original bundles. receiver procedure:when the receiver get bundles , it put original and coded bundles into its receiving queue. when it get accumulative ack_delay (ack_delay is set to the ratio of forward and reverse bind with of the link .when the file to be transmitted has few bundles,we can reduce ack_delay to provoke send snack procedure)bundles ,send snack procedure is provoked. send snack procedure:when send snack procedure is provoked ,the sender check original and coded bundles in its receiving queue, decode and generate lost bundle using original and coded bundles, put generated bundle into receiving queue. check the receiving queue again, if there is any lost bundle,write the lost bundle information into the snack bundle and send the snack bundle. the sender side algorithm has to respond to two types of events:the arrival of a bundle from the upper layer, and get snack bundle from the receiver.the receiver side algorithm has to respond to the arrival of a bundle. source side: 1)set snd_count to 0. 2)wait state:if any following events occurs,respond as follows;else,wait. 3)bundle arrives from upper layer: 240 s. haoliang, l. lixiang, h. xiaohui a)send the bundle to the receiver. snd_count ++. b) if(snd_count ==1/e − 1 ) generate a coded bundle with previous1/e − 1bundles. send the coded bundle. snd_count =0. 4)get a snack from the receiver: a)read the snack information,find out which bundle is lost. b)retransmit the lost bundle,go to 3). receiver side: 1)set rcv _count to 0. 2)set ack_delay properly (according to the ratio of the forward and reverse bandwith and the file size). 3)wait state:if get a bundle from lower layer,respond;else,wait. 4)bundle arrives from lower layer: a) rcv _cout ++; add the bundle into the receiving queue,according to the bundle’s id. b) if(rcv _count ==ack_delay ) using the original bundles and the coded bundle to encode the lost bundles. enqueue the generated bundle in to the receiving queue. after the encode operation,if there is any lost bundle,write the lost bundle information into the snack bundle and send the snack bundle. 4 theoretical evaluation we attempt to evaluate, theoretically, the performance of nc-rtp.in order to derive some initial evaluation results regarding the performance of the nc-rtp, we assume that the link error rate remains constant during the file transfer. in particular, we consider the fixed-rate transport protocol (frtp), whose main functionality is summarized as follows: the frtp sender sends data on a fixed rate according to the pre-scheduled line rate. the frtp receiver,responds with snacks in order to signal for lost bundle in the receiving queue. to simplify the analysis,we consider that snacks are sent back to the sender without lost,all acknowledgement information are sent back to the receiver side successfully. we evaluate the performance of the aforementioned protocols over a simple one-hop topology. such topology represents a point-to-point links between dtn peers in interplanetary networks.obviously, the primary metric of interest is the time required for the whole file to be delivered at the receiver side. we define a round to be the end-to-end transmission of a specific amount of data. a file transfer consists of several rounds, during the first round the original file is transmitted, while during the rest of the rounds, the sender retransmits bundles lost in previous rounds. assume the file consists of n bundles. the link per is e. the frtp sender will begin the transmission of the file at the channel rate. after completion of the first round the sender will have transmitted n bundles. during the first round,n · e bundles are lost and will need to be retransmitted during the second round. similarly,n ·e2 bundles are lost during the second round and need to be retransmitted during the third round. during the nth round, the fr-tp sender will need to retransmit n · en bundles. we assume that when n · en < 1 the file transfer is complete. a network coding based dtn convergence layer reliable transport mechanism over interplanetary networks 241 therefore, frtp needs nfrtp rounds in order to complete the file transfer: nfrtp = log en e = log 1 n e = ln 1 n ln e (1) besides frtp,we also consider dstp to evaluate nc-rtp’s performance. according to its operational properties, dstp will transmit both original and redundant bundles at the line rate. more precisely,one redundant bundle is transmitted every 1/e − 1 original bundles. let r1 be the number of transmitted redundant bundles during first round, r1 = n/(1/e − 1). it is to say, there are r1 bundles that were sent twice, and n − r1 bundles that were send only once. we assume that the number of bundles lost during the first round (and need to be retransmitted during the second round) equals a1 where: a1 = (n − r1) · e + r1 · e2 (2) substituting r1 into equation 2, we get that: a1 = n · e · (1 − e) (3) assume that the number of bundles lost during the second round equals a2 where: a2 = n · e2 · (1 − e)2 (4) generalizing equations 3 and 4, we assume that during the nth round, the ds-tp sender will need to retransmit an bundles, where an = n · en · (1 − e)n (5) the file transfer is complete, once the following equation holds: n · en · (1 − e)n < 1 (6) ndstp = log (e·(1−e))n e·(1−e) = log 1 n e·(1−e) = ln 1 n ln (e · (1 − e)) (7) we add nc-rtp on frtp, called frtp-nc. frtp-nc transmits redundant coded bundles every 1/e − 1 original bundles. the coded bundle is a liner combination of previous 1/e − 1 original bundles.according to its operational properties, frtp-nc will transmit both original and coded bundles at the line rate. in ideal condition, every 1/e− bundles (including 1/e − 1 original bundles and one coded bundles) will lost only one bundle. in this way, the redundant coded bundle will compensate the lost bundle, the file transfer wiil complete in just one round. however, when more than one bundles are lost in every 1/e bundles, the redundant coded bundle can not compensate all the lost bundles, the sender needs to retransmit lost bundles. in this case, the coded bundle can replace one lost bundle, and the sender needs to retransmit less bundles. let a = 1/e ,we divide the whole file into n/(a − 1) parts. all parts are independent of each other. in a single part, if only one bundle is lost(including original and coded bundles), no retransmission is needed. if k(k 6 a) bundles are lost, only k − 1 bundles are needed to retransmit. let pk presents the probability of k bundles retransmission in a single part(lost k+1 bundles). pk = c k+1 a · e k+1 · (1 − e)a−k−1 (8) the expectation of k presents the number of bundles needs to retransmit in a single part. e(k) = a−1∑ k=1 k · pk = a−1∑ k=1 k · ck+1a · e k+1 · (1 − e)a−k−1 (9) 242 s. haoliang, l. lixiang, h. xiaohui let k + 1 = j ,we get e(k) = a∑ j=2 (j − 1) · cja · e j · (1 − e)a−j = a∑ j=2 j · cja · e j · (1 − e)a−j − a∑ j=2 cja · e j · (1 − e)a−j (10) because ∑a j=0 j · c j a · ej · (1 − e)a−j = a · e = 1 , ∑a j=0 c j a · ej · (1 − e)a−j = (e + 1 − e)a = 1, and a = 1/e , e(k) = 1 − (1 − e)1/e−1 − [1 − (1 − e)1/e − (1 − e)1/e−1] = (1 − e)1/e (11) consider the whole file consists of n/(a − 1) parts, in the first transmission,e(k) · n/(a − 1) bundles need to retransmit. let y1 be the number of transmitted redundant bundles during first round. y1 = e(k) · n/(a − 1) = (1 − e)1/e · n/(1/e − 1) = n · e · (1 − e)1/e−1 (12) generalizing equation 12, we assume that during the nth round, the frtp-nc sender will need to retransmit yn bundles, where yn = n · en · (1 − e)n·(1/e−1) (13) the file transfer is complete, once the following equation holds: yn = n · en · (1 − e)n·(1/e−1) < 1 (14) nfrtp−nc = log en·(1−e)n·(1/e−1) e·(1−e)(1/e−1) = ln 1 n ln e · (1 − e)(1/e−1) (15) we use two figures to evaluate the performance of frtp, dstp and frtp-nc. figure 1 presents the numbers of rounds of frtp, dstp and frtp-nc in different pers(n = 100000). figure 2 presents the numbers of rounds of frtp, dstp and frtp-nc in different file sizes(per = 0.25). from figure 1 we observe that , when file size is fixed, for small error rates ,the three protocols perform similarly. as the link error rate increases, the rounds that frtp needs grows sharply, and the rounds that frtp-nc and dstp need grow slowly, and frtp-nc always needs less rounds than dstp. only when per = 0.5 ,dstp and frtp-nc have the same performance. at this time ,dstp and frtp-nc operate in the same way(transmit each bundle twice). from figure 2 we observe that ,when the link error rate is fixed, the growth of the file size results in the increment of the rounds that three protocols need, but frtp-nc is the lowest. our theoretical evaluation revealed that, frtp-nc that using nc-rtp needs less time to complete file transmission and it have a better ability to handle high per and large files in interplanetary networks. nc-rtp can be used in convergence layer to achieve efficient transmission between peer nodes . 5 experimental evaluation in this section, we evaluate the performance of nc-rtp experimentally. we use the network simulator "ns-2" [14] for all experiments and simulate. we use a simple one-hop topology, whose propagation delay is 1200s. such topology represents a point-to-point link between dtn peers in interplanetary networks. since congestion control can not respond to the link status in time because of the long propagation delay of deep-space links, the transport protocols we use in the experiment are without congestion control, using fixed rate data transmission. in our experiment a network coding based dtn convergence layer reliable transport mechanism over interplanetary networks 243 0.0 0.1 0.2 0.3 0.4 0.5 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 n um be r of r ou nd s packet error rate frtp dstp frtp-nc figure 1: rtt=2400s,file size=100000 bundles,rounds that three protocols need in different pers 0 10 100 1000 10000 100000 1000000 0 1 2 3 4 5 6 7 8 9 10 11 n um be r of r ou nd s bundle quantity frtp dstp frtp-nc figure 2: rtt=2400s,file size=100000 bundles,rounds that three protocols need in different file size ,we use frtp, dstp in[12],and frtp-nc that using nc-rtp to evaluate the performance of nc-rtp. the transmit rate is 1000 bundles per second. in scenario 1,we evaluate the performance of nc-rtp in different pers. the propagation delay is set to 1200s, file size is 100000 bundles and per changes from 0 to 0.5. in scenario 2 ,we evaluate the performance of nc-rtp in different file sizes. the propagation delay is still 1200s, per is 0.25 and file size changes from 0 to 1000000 bundles. from figure 4 and figure 5 we can get,the experimental result generally tally with the theoretical evaluation. frtp-nc that using nc-rtp needs less time to complete file transmission than frtp and dstp. and with the increment of file size , the file transfer complete time of frtp and dstp grow sharply, frtp-nc’s transfer complete time grows slowly. so we can get a conclusion that, nc-rtp can cope with high per in interplanetary networks and more suitable for large files,using nc-rtp can enhance transmit reliability and complete file transfers faster. 244 s. haoliang, l. lixiang, h. xiaohui figure 3: topology of a point-to-point links between dtn peers 0.0 0.1 0.2 0.3 0.4 0.5 0 10000 20000 30000 40000 50000 60000 70000 80000 t ra ns m it t im e packet error rate frtp dstp frtp-nc figure 4: rtt=2400s,file size=100000 bundles,transmit complete time change with pers 0 10 100 1000 10000 100000 1000000 0 5000 10000 15000 20000 25000 30000 t ra ns m it t im e bundle quantity frtp dstp frtp-nc figure 5: rtt=2400s,per=0.25 transmit complete time change with file size 6 conclusions and future works in this study, we present a network coding based convergence layer reliable transport mechanism (nc-rtp),whose main advantage is to compensate lost bundle forwardly .nc-rtp uses fixed rate transmission and generates a redundant coded bundle periodically. the coded bundle is a random linear combination of previous bundles. using original bundles and the coded bundle, we can decode and generate the lost original bundle with gaussian elimination. because of the long propagation delay of interplanetary links, using nc-rtp can enhance the transmit reliability and complete file transfers faster. our theoretical and simulation performance evaluation results reveal that nc-rtp presents high potential for deployability. protocols that use nc-rtp complete file transfers faster than a network coding based dtn convergence layer reliable transport mechanism over interplanetary networks 245 without nc-rt. we can get a conclusion that, nc-rtp can cope with high per in interplanetary networks and more suitable for large files,using nc-rtp can enhance transmit reliability and complete file transfers faster. future works includes, specifically implementation on how to react to change pers and further investigation and evaluation of the performance of nc-rtp when intermittent connectivity events happen on the transmission link. bibliography [1] i.f., akyildiz, et al.,the state of the art in interplanetary internet communications magazine, ieee volume 42, issue 7, july 2004 page(s): 108 118. [2] ian f. akyildiz, ozgur b. akan, chao chen, jian fang, andweilian su, interplanetary internet: state-of-the-art and research challenges, computer networks journal (elsevier), vol. 43, issue 2, pp. 75-113, october 2003. [3] r.c. durst, p.d. feighery, k.l. scott, why not use the standard internet suite for the interplanetary internet? available from http://www.ipnsig.org/techinfo.htm. [4] r.c. durst, g.j. miller, e.j. travis, tcp extensions for space communications,wireless networks,vol. 3, issue 5, pp. 389-403, october 1997. [5] akan, j. fang and i.f. akyildiz, performance of tcp protocols in deep space communication networks,ieee communications letters, vol. 6, issue 11, pp.478-480, november 2002. [6] v. cerf, s. burleigh et al., elay tolerant network architecture,ietf rfc 4838, april 2007. [7] k. scott and s. bureeigh, undle protocol specification irtf dtnrg internet draft revision 10, july 2007 [8] o.b. akan, j. fang and i.f. akyildiz, performance of tcp protocols in deep space communication networks,ieee communications letters, vol. 6, issue 11, pp.478-480, november 2002. [9] v. tsaoussidis and i. matta, pen issues on tcp for mobile computing,the journal of wireless communications and mobile computing, wcmc john wiley and sons, vol. 2, issue 1, pp. 3-20, february 2002. [10] l. wood, j. mckim, w. eddy, w. ivancic and c. jackson,aratoga: a convergence layer for delay tolerant networking,ietf july 2007 meeting. [11] m. ramadas et al., "licklider transmission protocol " specification, ietf internet draft, april 2007 [12] ioannis psaras, giorgos papastergiou, vassilis tsaoussidis, and nestor pec-cia. ds-tp:deepspace transport protocol.ieee aerospace conference, 2008, big sky, montana, usa. [13] t. ho, "networking from a network coding perspective,"it phd thesis, massachusetts institute of technology, dept. of eecs, may 2004. [14] "ns-2 network simulator," http://www.isi.edu/nsnam/ns/ 911turs6.pdf international journal of computers communications & control special issue on fuzzy sets and applications (celebration of the 50th anniversary of fuzzy sets) issn 1841-9836, 10(6):873-888, december, 2015. a hybrid model based on fuzzy ahp and fuzzy waspas for construction site selection z. turskis, e.k. zavadskas, j. antucheviciene, n. kosareva zenonas turskis, edmundas kazimieras zavadskas*, jurgita antucheviciene, natalja kosareva vilnius gediminas technical university sauletekio al. 11, lt-10223 vilnius, lithuania zenonas.turskis@vgtu.lt, edmundas.zavadskas@vgtu.lt jurgita.antucheviciene@vgtu.lt, natalja.kosareva@vgtu.lt *corresponding author: edmundas.zavadskas@vgtu.lt abstract: the purpose of this article is to propose a fuzzy multi-attribute performance measurement (mapm) framework using the merits of both a novel weighted aggregated sum-product assessment method with fuzzy values (waspas-f) and analytical hierarchy process (ahp). the object of this study is to select the best shopping centre construction site in vilnius. a number of conflicting qualitative and quantitative attributes exist for evaluating alternative construction sites. qualitative attributes are accompanied by ambiguities and vagueness. this makes fuzzy logic a more natural approach to this kind of multi-attribute decision making (madm) problems. fuzzy ahp is applied for assigning weights of the attributes and waspas-f method is used to determine the most suitable alternative. keywords: ahp, waspas-f, fuzzy ahp, multi-attribute decision making (madm), key performance attributes, construction site, shopping centre. 1 introduction the increasing competition cause a lot of construction site selection problems. making decisions is a complex process that involves multiple, usually conflicting, objectives or attributes. they are ill-structured. multi-criteria decision aid (mcda, the european school) or multiattribute decision making (madm, the american school) constitutes an advanced field of operations research which is devoted to the development and implementation of decision support methodologies to confront complex decision problems. the problem of how decisions are or ought to be taken by individuals, organisations and institutions was previously discussed by aristotle [1]. later, in the 18th century bernoulli [2] concentrated research on probability theory, borda [3] on social choice procedures. the foundations of mcda can be traced back in the works of von neumann and morgenstern [4] and fishburn [5] on utility theory. multi-attribute utility theory (maut) is an extension of the classical utility theory. the theory underlying multi-attribute performance measurement models was developed in the 1960s, as summarised in keeney and raiffa [6] and zeleny [7]. most of these methods have been developed based on the concepts of accurate measurements and crisp evaluation. the performance measurement parameters cannot be given precisely. the imprecision comes from different sources: unquantifiable information, incomplete information, no obtainable information, and partial ignorance. the real-world problems in performance measurement involve numerous aspects of uncertainty, contain a mixture of fuzzy and crisp data and may have a large number of alternatives and dozens of attributes. the solution is highly dependent on the preferences of the decision maker. it is common that people may not be 100% sure when making subjective judgments. if all or some of the alternatives are imprecise, then fuzzy madm methods are required. zimmermann’s description of uncertainty is as follows: copyright © 2006-2015 by ccc publications 874 z. turskis, e.k. zavadskas, j. antucheviciene, n. kosareva uncertainty implies that in a certain situation a person does not possess the information which quantitatively and qualitatively is appropriate to describe, prescribe or predict deterministically and numerically a system, its behaviour or other characteristics [8]. fuzzy set theory is developed for solving problems, taking into account uncertainty, imprecision, vagueness. a pioneering and outstanding works on fuzzy sets are done by zadeh [9] [12]. the papers introduced a new perspective on the treatment of uncertainty, ambiguity, linguistic variables, and a fundamental aspect of formal languages: fuzzy set theory. fuzzy measures can be introduced for two different uses: either they can represent a concept imprecisely known (although well defined) or a concept which is vaguely perceived such as in the case of a linguistic variable. the basic concepts and algorithms from classical madm methods have been used in the development of the fuzzy madm methods. fuzzy direct aggregation procedures of madm methods generally consist of two stages: 1. the aggregation of the performance scores with respect to all attributes (goals) and per decision alternative, and 2. the rank ordering of the aggregated judgments of decision alternatives determine the optimal alternative. construction site selection is an important task. proper construction site selection can improve project’s success. construction site selection for shopping centre requires a good visibility and accessibility, proper access to suppliers, customers, suitable transportation network and sufficient customer traffic and socio-economic population characteristics. the evaluation data of location performance of the construction site for various subjective attributes, and the weights of the attributes are usually expressed in linguistic terms. this makes fuzzy logic a more natural approach to this kind of problems. there is a limited number of papers evaluating suitable locations. usually gis in combination with decision making methods is applied for selecting the best site for ecologically and economically important objects such as renewable energy systems, including solar or wind farms [13] [15]. prioritizing the best sites for waste management is suggested by applying fuzzy topsis [16] or vikor [17] methods. ahp is applied for warehouse [18] or solar farm [19] construction site selection. also fuzzy mcdm frameworks for locating plants are suggested [20], [21]. application of interval type-2 fuzzy sets for watershed site selection is presented [22]. hybrid approach of fuzzy analytic network process (anp), fuzzy dematel and fuzzy electre for site selection is suggested [23]. the purpose of this article is to propose a novel fuzzy multi-attribute performance measurement framework using the merits of both a novel weighted aggregated sum-product assessment method with fuzzy values (waspas-f) and fuzzy analytical hierarchy process (ahp). 2 methodology the high value of a problem-solving process is that it helps to align human’s thinking and action around a common approach for winning team problem solving to the following nine steps: 1. selecting the problem; 2. exploring the problem and gathering data; 3. establishing success attributes; 4. developing a clear problem statement; 5. generating alternatives; a hybrid model based on fuzzy ahp and fuzzy waspas for construction site selection 875 6. evaluating alternatives; 7. selecting a preferred solution; 8. developing a plan for action; 9. testing and modifying the solution. there is no evidence in the literature of publications evaluating the shopping centre construction sites were applied in lithuania. this is the most powerful motivation to consider the site selection problem. flowchart of proposed problem solving process is shown in fig. 1. the first part of methodology, i.e. the fuzzy ahp was used to calculate attributes weights. in the second part, the waspas-f method was developed and used to rank and select the alternatives. first figure 1: flowchart of the proposed problem solving process of all expert group was formed. the creditability of a group of experts depends on the expertise of the people who are involved and how they can give full play to their professional expertise and make wise and fair decisions. the group of experts was formed based on three main problem solving skills: 1. communications patience (techniques that help members share information and perspectives by working hard to understand one another and working hard to be understood, members with controversial or divergent views are not ignored or blocked); 2. synergy creation (techniques that equip members to expand their thinking by generating many ideas, building on those ideas, and evaluating ideas to create synergistic solutions); 3. disciplined use of a problem solving process (members become disciplined in using a systematic process for analysing data, creating options, and evaluating and selecting preferred solutions). 876 z. turskis, e.k. zavadskas, j. antucheviciene, n. kosareva in the absence of good data, members waste time in pointless debates over opinions, and the problem with opinions is that we all have different ones. the members were asked to generate as many ideas or options as they can without censorship or judgment. ideas were evaluated considering all factors consequences and sequels of all the probable consequences of a particular scenario in the short term, the medium term, and the longer term. the members were asked to identify important criteria for assessing the feasibility of several ideas. 2.1 preliminairies a fuzzy set is a class of objects with a continuum of membership grades. such set is characterized by a membership function which assigns to each object a grade of membership ranging between zero and one [9]. a fuzzy set a defined in space x is a set of pairs: a = {(x, µa(x)), x ∈ x}, (1) where the fuzzy set a is characterized by its membership function µa : x → [0; 1], which associates with each element x ∈ x, a real number µa(x) ∈ [0; 1]. the value µa(x) at x represents the grade of membership of x in a and is interpreted as the membership degree to which x belongs to a . so the closer the value µa(x) is to 1, the more x belongs to a. a crisp or ordinary subset a of x can also be viewed as a fuzzy set in x with membership function as its characteristic function, i.e. µa(x) = { 1 x ∈ a; 0 x /∈ a. (2) the set x is called a universe of discourse and can be written ⊆ x. sometimes a fuzzy set a in x is denoted by list the ordered pairs (x, µa(x)), where the elements with zero degree are usually not listed. thus a fuzzy set a in x can be represented as a = {(x, µa(x))}, where x ∈ x and µa : x → [0; 1]. when the universe of discourse is discrete and finite with cardinality n, that is x = {x1, x2, . . . , xn}, the fuzzy set a can be represented as a = n ∑ i=1 µa(xi) xi = µa(x1) x1 + µa(x2) x2 + . . . + µa(xn) xn , (3) when the universe of discourse x is an interval of real numbers, the fuzzy set a can be expressed as a = ∫ x µa(x) x . (4) a fuzzy number a is defined to be a fuzzy triangular number, with α lower, β modal, and γ -upper values, if its membership function µa : x → [0; 1] is fully described as follows: µa(x) =          x − α β − α if x ∈ [α, β], x − α β − γ if x ∈ [β, γ], 0 otherwise. α 6 β 6 γ. (5) in order to obtain a crisp output, a defuzzification process is needed to be applied. defuzzification is the process of producing a quantifiable result in fuzzy logic, given fuzzy sets and corresponding membership degrees. the output of the defuzzification process is a single number. various types of membership functions are used. the most commonly used membership a hybrid model based on fuzzy ahp and fuzzy waspas for construction site selection 877 functions are the following [24]: triangular, trapezoid, linear, sigmoidal, π-type, and gaussian. a fuzzy number is generally a subjective data given by one expert or several (after collective agreement). the most typical fuzzy set membership function is triangular membership function (fig. 2). figure 2: triangular membership function van laarhoven and pedrycz [25] introduced the basic operations of fuzzy triangular numbers x̃1 and x̃2 (table 1). in the table 1 α is a lower value of fuzzy number, β modal value of fuzzy number, γ -upper value of fuzzy number. values of weight 0 < w̃j < 1, ∑n j=1 w̃j = 1 are usually determined by experts. there are various approaches for assessing weights. decision makers data which cannot be exactly described by means of numerical values, commonly describe the different ways they measure things numerically in terms of scales of measurement, which come in four flavours: nominal, ordinal, interval, or ratio scales. likert items were first introduced in 1932 [26]. likert scales can indeed be analysed effectively as interval or fuzzy scales [27] in which categories are labelled with numerical values. 2.2 fuzzy ahp the earliest work in fuzzy ahp appeared in van laarhoven and pedrycz [25], which compared fuzzy ratios described by triangular membership functions. chang [28] introduced a new approach for handling fuzzy ahp, with the use of triangular fuzzy numbers for pairwise comparison scale of fuzzy ahp, and the use of the extent analysis method for the synthetic extent values of the pairwise comparisons. this study concentrates on a fuzzy ahp approach introduced by chang [28], in which triangular fuzzy numbers are preferred for pairwise comparison scale. this questionnaire sought the satisfaction level of the experts concerning the candidate alternatives using a likert-type five-point scale. the linguistic variables matching tfns are provided in fig.3. fuzzy group weight is determined as follows: after obtaining the attributes weights from ahp the synthesising of ratio judgements is done. w̃ = [w̃1, w̃n] = [w̃j] fuzzy group weights for n attributes are determined as follows (w̃j is fuzzy 878 z. turskis, e.k. zavadskas, j. antucheviciene, n. kosareva table 1: basic operations on fuzzy triangular numbers equation operation x̃1 = (x1α, x1β, x1γ), x̃2 = (x2α, x2β, x2γ) (6) fuzzy triangular numbers x̃1 ⊕ x̃2 =    x1α + x2α, x1β + x2β, x1γ + x2γ    (7) addition x̃1 ⊖ x̃2 =    x1α − x2γ, x1β − x2β, x1γ − x2α    (8) substraction x̃1 ⊗ x̃2 =    x1αx2α, x1βx2β, x1γx2γ    (9) multiplicaction k ⊗ x̃1 =    kx1α, kx1β, kx1γ    (10) multiplicaction by constant x̃1 ÷ x̃2 =    x1α/x2γ, x1β/x2β, x1γ/x2α    (11) division x̃−1 1 =    1/x1γ, 1/x1β, 1/x1α    (12) reverse number x̃x̃2 1 =    x x2γ 1α , x x2β 1β , xx2α 1γ    (13) raising a fuzzy triangular number of the power of another fuzzy triangular number, if x1α 6 x1β 6 x1γ 6 1 and x2α 6 x2β 6 x2γ 6 1 is special for this case study triangular number): x̃j = (xjα, xjβ, xjγ), (14) where yjk is j attribute weight determined by k expert, p is number of experts, wjα = min k yjk, j = 1, n, k = 1, p is minimum possible value, wjβ = ( p ∏ k=1 yjk ) 1/p , j = 1, n is the most possible value and wjγ = max k yjk, j = 1, n, k = 1, p is maximum possible value of j attribute weight. a hybrid model based on fuzzy ahp and fuzzy waspas for construction site selection 879 figure 3: linguistic variables for the importance weight of each attribute. 2.3 a novel fuzzy multi-attribute performance measurement model: weighted aggregated sum-product assessment method with fuzzy values (waspas-f) this subsection extends waspas to the fuzzy environment. the merit of using a fuzzy approach is to assign the relative importance of attributes using fuzzy numbers instead of precise numbers. madm method, namely waspas, was introduced in 2012 by zavadskas et al. [29]. later, modification of the method waspas-ifiv was introduced [30]. there are a number of applications of waspas method, including site selection for wind turbine [31], deciding shopping mall locating [32] or assessing sites for implementation of solar projects [33]. also the method was successfully applied for evaluating alternative technological or design solutions in construction [34] [37], manufacturing [38], business issues [39] or even for performance analysis and ranking of scholarly journals [40]. the waspas method consists of two aggregated parts: 1. the weighted sum model (wsm); 2. the weighted product model (wpm). the wsm method is simple, easy to use and understood. it determines overall score of an alternative as a weighted sum of the attribute values. it is the best known and most widely used method [41]. the wpm is developed in order to avoid alternatives with poor attribute values. it determines score of each alternative as a product of the scale rating of each attribute to a power equal to the importance weight of the attribute [42]. based on the briefly summarized fuzzy theory above, waspas-f steps can be outlined as follows: step 1. forming of fuzzy decision-making matrix (fdmm). the performance values x̃ij and the attributes weights w̃j are entries of a dmm. choose the linguistic ratings. the system of attributes as well as the values and initial weights of attributes are determined by experts. the discrete optimization problem is represented by the preferences for m reasonable alternatives (rows) rated on n attributes (columns): 880 z. turskis, e.k. zavadskas, j. antucheviciene, n. kosareva x̃ =           x̃11 . . . x̃1j . . . x̃1n ... . . . ... . . . ... x̃i1 . . . x̃ij . . . x̃in ... . . . ... . . . ... x̃m1 . . . x̃mj . . . x̃mn           ; i = 1, m, j = 1, n, (15) where x̃ij – fuzzy value representing the performance value of the i alternative in terms of the j attribute. a tilde ˜ is placed above a symbol if the symbol represents a fuzzy set. then the determination of the priorities of alternatives is carried out in several steps. step 2. the initial values of all the attributes x̃ij are normalised – defining values ˜̄xij of normalised decision-making matrix ˜̄x = [˜̄xij]m×n. ˜̄xij =          x̃ij max i x̃ij if max i x̃ij is preferable, min i x̃ij x̃ij if min i x̃ij is preferable; i = 1, m, j = 1, n. (16) step 3a. calculate the weighted normalised fuzzy decision matrix ˜̂xq for wsm: ˜̂ xq =           ˜̂x11 . . . ˜̂x1j . . . ˜̂x1n ... . . . ... . . . ... ˜̂xi1 . . . ˜̂xij . . . ˜̂xin ... . . . ... . . . ... ˜̂xm1 . . . ˜̂xmj . . . ˜̂xmn           ; ˜̂xij = ˜̄xijw̃j, i = 1, m, j = 1, n. (17) step 3b. calculate the weighted normalised fuzzy decision matrix ˜̂xp for wpm: ˜̂ xp =           ˜̄̄x11 . . . ˜̄̄x1j . . . ˜̄̄x1n ... . . . ... . . . ... ˜̄̄xi1 . . . ˜̄̄xij . . . ˜̄̄xin ... . . . ... . . . ... ˜̄̄xm1 . . . ˜̄̄xmj . . . ˜̄̄xmn           ; ˜̄̄xij = ˜̄x w̃j ij , i = 1, m, j = 1, n. (18) step 4. calculate values of the optimality function: a) according to the wsm for each alternative: q̃i = n ∑ j=1 ˜̂xij, i = 1, m, (19) b) according to the wpm for each alternative: p̃i = n ∏ j=1 ˜̄̄xij, i = 1, m. (20) a hybrid model based on fuzzy ahp and fuzzy waspas for construction site selection 881 the result of fuzzy performance measurement for each alternative are fuzzy numbers q̃i and p̃i. the centre-of-area is the most practical and simple to apply for defuzzification: qi = 1 3 (qiα + qiβ + qiγ). (21) pi = 1 3 (piα + piβ + piγ). (22) step 5. the integrated utility function value of the waspas-f method for an alternative could be determined as follows: ki = λ m ∑ j=1 qi + (1 − λ) m ∑ j=1 pi, λ = 0, ..., 1, 0 6 ki 6 1. (23) λ is determined based on the assumption that total of all alternatives wsm scores must be equal to the total of wpm scores: λ = m ∑ i=1 pi m ∑ i=1 qi + m ∑ i=1 pi . (24) step 6. rank preference order. choose an alternative with maximal ki value. 3 construction site for shopping centre site selection in vilnius in this case study, an investor company conducted a feasibility study to establish strategies to locate an appropriate shopping centre in strategic demand areas of vilnius. the identification of feasible sites and the selection of the one will optimize the company’s performance strategically. the problem was solved according to the presented above methodology. the mapm model has been designed by considering the interests and objectives of all stakeholders, customers, local community, employees and suppliers. first of all, expert’s team determined the potential critical errors of construction site location selection (fig. 4). then, based on expert team questionnaire they determined the main problems of construction site selection (fig. 5). figure 4: critical errors of construction site selection. 882 z. turskis, e.k. zavadskas, j. antucheviciene, n. kosareva figure 5: problems of construction site selection. an interview to evaluate the most suitable location alternatives was conducted with seven experts, including stakeholders, business development managers, real estate consultants and academicians. the next step was collection of the information associated to all of the alternative construction sites identified and a description of the most important attributes. the strengths, weaknesses, opportunities, and threats analysis was performed. finally, four location alternatives, which are denoted as a1, a2, a3 and a4 are selected as feasible alternatives. a1 alternative is at the intersection of important roads between the airport and the railway station and the central bus station. a2 is one of the developing dwelling districts. a3 is the most populated residential district. a4 is one of the nearby the centre point of the vilnius. later, the attributes were compared each other using ahp. a likert-type ten-point scale (fig. 3) was used. in the next step the questionnaire was about experts’ satisfaction level toward construction site selection. this questionnaire adopts a likert-type ten-point scale (fig. 6). it has ten different levels from "very bad" (p̃1) to "excellent" (p̃10) on a fuzzy ten-level scale. for example, expert might think the satisfaction the linguistic score of "fair" (p̃5), that score would or correspond to a tfn of (0.4, 0.5, 0.6) respectively. the pairwise comparison matrix set by tfns that matches linguistic statements of data is shown in table 2. each of experts determined attributes weights. integrated results of established weights are shown in table 3. the priority weight vector describes the importance degree of the attributes in decision matrix. after getting the importance degree of attributes, waspas-f method was employed to evaluate alternative locations. in this phase of the study, waspas-f starts establishing fuzzy evaluations of the alternative locations (a1, a2, a3 and a4) with respect to the attributes by using tfns. this is an initial decision making matrix for ranking alternatives and indicates the performance ratings of the alternatives according to the attributes. the alternatives according to linguistic scales and their corresponding fuzzy numbers {(1, 1, 1) – very poor, (2, 3, 4) – poor, (4, 5, 6) – fair, (6, 7, 8) – good, (8, 9, 10) – very good} are compared by experts. table 4 shows comparison of alternatives according to attributes. the normalised decision matrix is obtained by using eq. (16) (table 5). the weighted normalised fuzzy decision matrix for wsm is obtained by using eq. (17) (table 6) and one for wpm is obtained by using eq. (18) (table 7). values of the optimality function values for wsm and wpm are calculated by using eq. (19) and eq. (20) respectively. the integrated utility function value of the waspas-f method for an alternative was determined by using eq. (24), as presented in table 8. as can be seen from table 8, a3 is the best alternative in the wsm, wpm and waspas method. a4 alternative is the worst among considered alternatives. decision maker should choose and implement the a3 alternative. a hybrid model based on fuzzy ahp and fuzzy waspas for construction site selection 883 table 2: pairwise comparisons of site selection attributes for shopping centre via tfn (the first expert). x1 x2 x3 x4 x5 x6 x7 x8 w construction costs x1 1, 1, 1 0.20, 0.33, 1 1, 1, 3 1, 3, 5 5, 7, 9 3, 5, 7 3, 5, 7 3, 5, 7 0.21 economical x2 1, 3, 5 1, 1, 1 1, 1, 3 1, 3, 5 3, 5, 7 1, 3, 5 3, 5, 7 3, 5, 7 0.21 road accesses x3 0.33, 1, 1 0.33, 1, 1 1, 1, 1 0.20, 0.33, 1 1, 3, 5 3, 5, 7 1, 3, 5 1, 3, 5 0.17 competition x4 0.20, 0.33, 1 0.20, 0.33, 1 1, 3, 5 1, 1, 1 1, 3, 5 1, 3, 5 1, 3, 5 1, 3, 5 0.17 population characteristics x5 0.11, 0.14, 0.20 0.14, 0.20, 0.33 0.20, 0.33, 1 0.20, 0.33, 1 1, 1, 1 3, 5, 7 1, 3, 5 1, 1, 3 0.12 environmental impacts x6 0.14, 0.20, 0.33 0.20, 0.33, 1 0.14, 0.20, 0.33 0.20, 0.33, 1 0.14, 0.20, 0.33 1, 1, 1 1, 3, 5 1, 1, 3 0.07 risks x7 0.14, 0.20, 0.33 0.14, 0.20, 0.33 0.20, 0.33, 1 0.20, 0.33, 1 0.20, 0.33, 1 0.20, 0.33, 1 1, 1, 1 1, 1, 3 0.04 attractiveness x8 0.14, 0.20, 0.33 0.14, 0.20, 0.33 0.20, 0.33, 1 0.20, 0.33, 1 0.33, 1, 1 0.33, 1, 1 0.33, 1, 1 1, 1, 1 0.01 table 3: fuzzy weights of attributes experts w̃ e1 e2 e3 e4 e5 e6 e7 wjα wjβ wjγ x1 0.21 0.33 0.29 0.31 0.23 0.35 0.27 0.21 0.28 0.35 x2 0.21 0.23 0.18 0.21 0.23 0.17 0.16 0.16 0.20 0.23 x3 0.17 0.16 0.14 0.17 0.14 0.17 0.16 0.14 0.16 0.17 x4 0.17 0.11 0.09 0.10 0.14 0.12 0.12 0.09 0.12 0.17 x5 0.12 0.07 0.09 0.07 0.08 0.08 0.09 0.07 0.08 0.12 x6 0.07 0.05 0.07 0.05 0.08 0.05 0.09 0.05 0.06 0.09 x7 0.04 0.03 0.07 0.05 0.05 0.04 0.05 0.03 0.05 0.07 x8 0.01 0.02 0.06 0.02 0.05 0.02 0.05 0.01 0.03 0.06 table 4: the initial fuzzy decision making matrix for construction site selection w a1 a2 a3 a4 max α β γ α β γ α β γ α β γ α β γ x̃1 0.21 0.28 0.35 0.5 0.6 0.7 0.6 0.7 0.8 0.6 0.7 0.8 0.6 0.7 0.8 0.8 x̃2 0.16 0.20 0.23 0.6 0.7 0.8 0.6 0.7 0.8 0.8 0.9 1.0 0.5 0.6 0.7 1 x̃3 0.14 0.16 0.17 0.8 0.9 1.0 0.5 0.6 0.7 0.6 0.7 0.8 0.6 0.7 0.8 1 x̃4 0.09 0.12 0.17 0.5 0.6 0.7 0.6 0.7 0.8 0.5 0.6 0.7 0.4 0.5 0.6 0.8 x̃5 0.07 0.08 0.12 0.8 0.9 1.0 0.7 0.8 0.9 0.6 0.7 0.8 0.5 0.6 0.7 1 x̃6 0.05 0.06 0.09 0.5 0.6 0.7 0.8 0.9 1.0 0.6 0.7 0.8 0.8 0.9 1.0 1 x̃7 0.03 0.05 0.07 0.4 0.5 0.6 0.5 0.6 0.7 0.8 0.9 1.0 0.7 0.8 0.9 1 x̃8 0.01 0.03 0.06 0.5 0.6 0.7 0.4 0.5 0.6 0.4 0.5 0.6 0.5 0.6 0.7 0.7 table 5: the normalised fuzzy decision making matrix w̃ a1 a2 a3 a4 α β γ α β γ α β γ α β γ α β γ ˜̄x1 0.21 0.28 0.35 0.63 0.75 0.88 0.75 0.88 1.00 0.75 0.88 1.00 0.75 0.88 1.00 ˜̄x2 0.16 0.20 0.23 0.60 0.70 0.80 0.60 0.70 0.80 0.80 0.90 1.00 0.50 0.60 0.70 ˜̄x3 0.14 0.16 0.17 0.80 0.90 1.00 0.50 0.60 0.70 0.60 0.70 0.80 0.60 0.70 0.80 ˜̄x4 0.09 0.12 0.17 0.63 0.75 0.88 0.75 0.88 1.00 0.63 0.75 0.88 0.50 0.63 0.75 ˜̄x5 0.07 0.08 0.12 0.80 0.90 1.00 0.70 0.80 0.90 0.60 0.70 0.80 0.50 0.60 0.70 ˜̄x6 0.05 0.06 0.09 0.50 0.60 0.70 0.80 0.90 1.00 0.60 0.70 0.80 0.80 0.90 1.00 ˜̄x7 0.03 0.05 0.07 0.40 0.50 0.60 0.50 0.60 0.70 0.80 0.90 1.00 0.70 0.80 0.90 ˜̄x8 0.01 0.03 0.06 0.71 0.86 1.00 0.57 0.71 0.86 0.57 0.71 0.86 0.71 0.86 1.00 884 z. turskis, e.k. zavadskas, j. antucheviciene, n. kosareva table 6: the weighted normalised matrix for wsm a1 a2 a3 a4 α β γ α β γ α β γ α β γ ˜̂x1 0.13 0.21 0.31 0.16 0.25 0.35 0.16 0.25 0.35 0.16 0.25 0.35 ˜̂x2 0.10 0.14 0.18 0.10 0.14 0.18 0.13 0.18 0.23 0.08 0.12 0.16 ˜̂x3 0.11 0.14 0.17 0.07 0.10 0.12 0.08 0.11 0.14 0.08 0.11 0.14 ˜̂x4 0.06 0.09 0.15 0.07 0.11 0.17 0.06 0.09 0.15 0.05 0.08 0.13 ˜̂x5 0.06 0.07 0.12 0.05 0.06 0.11 0.04 0.06 0.10 0.04 0.05 0.08 ˜̂x6 0.03 0.04 0.06 0.04 0.05 0.09 0.03 0.04 0.07 0.04 0.05 0.09 ˜̂x7 0.01 0.03 0.04 0.02 0.03 0.05 0.02 0.05 0.07 0.02 0.04 0.06 ˜̂x8 0.01 0.03 0.06 0.01 0.02 0.05 0.01 0.02 0.05 0.01 0.03 0.06 q 0.78 0.79 0.82 0.75 ∑ qi 3.15 table 7: the weighted normalised matrix for wpm a1 a2 a3 a4 α β γ α β γ α β γ α β γ ˜̄̄x1 0.85 0.92 0.97 0.90 0.96 1.00 0.90 0.96 1.00 0.90 0.96 1.00 ˜̄̄x2 0.89 0.93 0.96 0.89 0.93 0.96 0.95 0.98 1.00 0.85 0.90 0.94 ˜̄̄x3 0.96 0.98 1.00 0.89 0.92 0.95 0.92 0.94 0.97 0.92 0.94 0.97 ˜̄̄x4 0.92 0.97 0.99 0.95 0.98 1.00 0.92 0.97 0.99 0.89 0.95 0.97 ˜̄̄x5 0.97 0.99 1.00 0.96 0.98 0.99 0.94 0.97 0.98 0.92 0.96 0.98 ˜̄̄x6 0.94 0.97 0.98 0.98 0.99 1.00 0.96 0.98 0.99 0.98 0.99 1.00 ˜̄̄x7 0.94 0.97 0.98 0.95 0.97 0.99 0.98 0.99 1.00 0.98 0.99 1.00 ˜̄̄x8 0.98 1.00 1.00 0.97 0.99 1.00 0.97 0.99 1.00 0.98 1.00 1.00 p 0.74 0.75 0.79 0.71 ∑ pi 2.99 table 8: integrated utility function values of the waspas-f method a1 a2 a3 a4 q 0.78 0.79 0.82 0.75 p 0.74 0.75 0.79 0.71 λ 0.49 k 0.76 0.77 0.80 0.73 rank 3 2 1 4 a hybrid model based on fuzzy ahp and fuzzy waspas for construction site selection 885 figure 6: membership functions of linguistic values for criteria rating (likert-type ten-point scale). 4 conclusions today, increase in population well-being and income causes the need of new construction sites. selecting the best location for a new construction site in fuzzy environments becomes a difficult task for stakeholders. this paper proposed a combined fuzzy madm approach based on the fuzzy ahp and waspas-f methods for selecting a suitable construction site location. in the proposed method, the fuzzy ahp was used to determine the weights of the attributes, while waspas-f was employed to rank the alternative locations. the approach combines the strong sides of the ahp and waspas methods. as a result of the study, we find that the proposed method is practical for ranking alternatives with respect to multiple conflicting attributes for the large scale problems. bibliography [1] aristotle (1990); ethica nicomachea, oxford university press, oxford, originally published in 350bc, english edition by i. bywater. [2] bernoulli, d. (1954); specimen theoriae novae de mensurasortis, commentarii academiae scientiarum imperialis petropolitanae, econometrica, 22: 23-36. [3] borda, j. c. (1781); mémoiresur les élections au scrutiny, comptesrendus de l’académie des sciences, traduit par alfred de graziacomme mathematical derivation of an election system, 44: 42-51. [4] von neumann, j.; morgenstern, o. (1944); theory of games and economic behavior, princeton university press, princeton. [5] fishburn, p.c. (1970); utility theory for decision making, wiley, new york. 886 z. turskis, e.k. zavadskas, j. antucheviciene, n. kosareva [6] keeney, r. l.; raiffa, h. (1976); decision with multiple objectives: preferences and value tradeoffs, john wiley & sons, new york. [7] zeleny, m. (1982); multiple criteria decision making, mcgraw-hill, new york. [8] zimmermann, h. j. (2000); an application-oriented view of modelling uncertainty, european journal of operational research, 122: 190-198. [9] zadeh, l. a. (1965); fuzzy sets, information and control, 8(3): 338-353. [10] zadeh, l. a. (1975); fuzzy logic and its application to approximate reasoning, part i, information science, 8(3): 199-249 [11] zadeh, l. a. (1975); fuzzy logic and its application to approximate reasoning, part ii, information science, 8(4): 301-357. [12] zadeh, l. a. (1975); fuzzy logic and its application to approximate reasoning, part iii, information science, 9(1): 43-80. [13] aydin, n. y.; kentel, e.; duzgun, h. s. (2013); gis-based site selection methodology for hybrid renewable energy systems: a case study from western turkey, energy conversion and management, 70: 90-106. [14] chatterjee, n.; bose, g. (2013); a copras-f base multi-criteria group decision making approach for site selection of wind farm, decision science letters, 2(1): 1-10. [15] sánchez-lozano, j. m.; teruel-solano, j.; soto-elvira, p. l.; garcía-cascales, m. s. (2013); geographical information systems (gis) and multi-criteria decision making (mcdm) methods for the evaluation of solar farms locations: case study in south-eastern spain, renewable and sustainable energy reviews, 24: 544-556. [16] kim, y.; chung, e. s.; jun, s. m.; kim, s. u. (2013); prioritizing the best sites for treated wastewater instream use in an urban watershed using fuzzy topsis, resources, conservation and recycling, 73: 23-32. [17] liu, h. c.; you, j. x.; fan, x. j.; chen, y. z. (2014); site selection in waste management by the vikor method using linguistic assessment, applied soft computing, 21: 453-461. [18] garcía, j. l.; alvarado, a.; blanco, j.; jimăšnez, e.; maldonado, a. a.; cortăšs, g. (2014); multi-attribute evaluation and selection of sites for agricultural product warehouses based on an analytic hierarchy process, computers and electronics in agriculture, 100: 60-69. [19] uyan, m. (2013); gis-based solar farms site selection using analytic hierarchy process (ahp) in karapinar region, konya/turkey, renewable and sustainable energy reviews, 28: 11-17. [20] devi, k.; yadav, s. p. (2013); a multicriteria intuitionistic fuzzy group decision making for plant location selection with electre method, the international journal of advanced manufacturing technology, 66(9-12): 1219-1229. [21] erol, i̇.; sencer, s.; özmen, a.; searcy, c. (2014); fuzzy mcdm framework for locating a nuclear power plant in turkey, energy policy, 67: 186-197. [22] wang, j. c.; chen, t. y. (2014); a closeness coefficient-based multiple criteria decisionmaking method using interval type-2 fuzzy sets and its application to watershed site selection, journal of industrial and production engineering, 31(1): 1-16. a hybrid model based on fuzzy ahp and fuzzy waspas for construction site selection 887 [23] fetanat, a.; khorasaninejad, e. (2015); a novel hybrid mcdm approach for offshore wind farm site selection: a case study of iran, ocean & coastal management, 109: 17-28. [24] dubois, d.; prade, h. (1978). operations on fuzzy numbers, international journal of systems sciences, 9: 613-626. [25] van laarhoven, p. j. m.; pedrycz, w. (1983); a fuzzy extension of saaty’s priority theory, fuzzy sets and systems, 11(3): 229-241. [26] likert, r. (1932); a technique for the measurement of attitudes, archives of psychology, 140: 1-55. [27] allen, e.; seaman, c. a. (2007); likert scales and data analyses, quality progress, 40: 64-65. [28] chang, d.-y. (1996); applications of the extent analysis method on fuzzy ahp, european journal of operational research, 95: 649-655. [29] zavadskas, e.k.; turskis, z.; antucheviciene, j.; zakarevicius, a. (2012); optimization of weighted aggregated sum product assessment, electronics and electrical engineering = elektronika ir elektrotechnika, 122(6): 3-6. [30] zavadskas, e.k.; antucheviciene, j.; hajiagha, s. h. r.; hashemi, s. s. (2014); extension of weighted aggregated sum product assessment with interval-valued intuitionistic fuzzy numbers (waspas-ivif), applied soft computing, 24: 1013-1021. [31] bagočius, v.; zavadskas, e. k.; turskis, z. (2014); multi-person selection of the best wind turbine based on the multi-criteria integrated additive-multiplicative utility function, journal of civil engineering and management, 20(4): 590-599. [32] hashemkhani zolfani, s.; aghdaie, m. h.; derakhti, a.; zavadskas, e.k.; varzandeh, m.h.m. (2013); decision making on business issues with foresight perspective; an application of new hybrid mcdm model in shopping mall locating, expert systems with applications, 40: 7111-7121. [33] vafaeipour, m.; hashemkhani zolfani, s.; varzandeh, m.h.m.; derakhti, a.; eshkalag, m.k. (2014); assessment of regions priority for implementation of solar projects in iran: new application of a hybrid multi criteria decision making approach, energy conversion and management, 86: 653-663. [34] zavadskas, e. k.; antucheviciene, j.; saparauskas, j.; turskis, z. (2013); mcdm methods waspas and multimoora: verification of robustness of methods when assessing alternative solutions, economic computation and economic cybernetics studies and research, 47(2): 5-20. [35] dėjus, t.; antuchevičienė, j. (2013); assessment of health and safety solutions at a construction site, journal of civil engineering and management, 19(5): 728-737. [36] šiožinytė, e.; antuchevičienė, j. (2013); solving the problems of daylighting and tradition continuity in a reconstructed vernacular building, journal of civil engineering and management, 19(6): 873-882. 888 z. turskis, e.k. zavadskas, j. antucheviciene, n. kosareva [37] bitarafan, m.; hashemkhani zolfani, s.; arefi, s.l.; zavadskas, e.k.; mahmoudzadeh, a. (2014); evaluation of real-time intelligent sensors for structural health monitoring of bridges based on swara-waspas; a case in iran, the baltic journal of road and bridge engineering, 9(4): 333-340. [38] chakraborty, s.; zavadskas, e.k. (2014); applications of waspas method in manufacturing decision making, informatica, 25(1): 1-20. [39] lashgari, s.; antucheviciene, j.; delavari, a; kheirkhah, o. (2014); using qspm and waspas methods for determining outsourcing strategies, journal of business economics and management, 15(4): 729-743. [40] zavadskas, e. k.; skibniewski, m. j.; antucheviciene, j. (2014); performance analysis of civil engineering journals based on the web of science database, archives of civil and mechanical engineering, 14(4): 519-527. [41] maccrimon, k. r. (1968); decision marking among multiple-attribute alternatives: a survey and consolidated approach, rand memorandum, rm-4823-arpa, the rand corporation, santa monica, calif. [42] easton, a. (1973); one of a kind decisions involving weighted multiple objectives and disparate alternatives, in j. l. cochrane and m. zeleny (eds.), multiple criteria decision making, 657-667, university of south carolina press, columbia, south carolina. ijcccv11n1.dvi international journal of computers communications & control issn 1841-9836, 11(1):142-156, february 2016. a context-aware mhealth system for online physiological monitoring in remote healthcare w. zhang, k. thurow, r. stoll weiping zhang*, kerstin thurow university of rostock celisca, center for life science automation, rostock 18119, germany weiping.zhang@uni-rostock.de,kerstn.thurow@uni-rostock.de *corresponding author: weiping.zhang@uni-rostock.de regina stoll university of rostock institute for preventive medicine, university medical center,rostock 18055, germany regina.stoll@uni-rostock.de abstract: physiological or biological stress is an organism’s response to a stressor such as an environmental condition or a stimulus. the identification of physiological stress while performing the activities of daily living is an important field of health research in preventive medicine. activities initiate a dynamic physiological response that can be used as an indicator of the overall health status. this is especially relevant to high risk groups; the assessment of the physical state of patients with cardiovascular diseases in daily activities is still very difficult. this paper presents a context-aware telemonitoring platform, ipm-mhealth, that receives vital parameters from multiple sensors for online, real-time analysis. ipm-mhealth provides the technical basis for effectively evaluating patients’ physiological conditions, whether inpatient or at home, through the relevance between physical function and daily activities. the two core modules in the platform include: 1) online activity recognition algorithms based on 3-axis acceleration sensors and 2) a knowledge-based, conditional-reasoning decision module which uses context information to improve the accuracy of determining the occurrence of a potentially dangerous abnormal heart rate. finally, we present relevant experiments to collect cardiac information and upper-body acceleration data from the human subjects. the test results show that this platform has enormous potential for use in long-term health observation, and can help us define an optimal patient activity profile through the automatic activity analysis. keywords: decision support systems, telemonitoring, context-aware application 1 introduction for patients with cardiovascular disease, physiological stress is an important index to measure their overall health status [1]. however, the daily physiological stress ration of a patient cannot be carried out through simple laboratory simulation due to the various factors that influence daily life; this makes the standardization work for measuring physiological stress difficult. currently a large amount of research work in the field of preventive medicine is developed around physiological monitoring. with the development of communication and sensor technologies, the mode of (a) real-time physiological parameter acquisition and (b) automatic online analysis provides new technical solutions for effectively resolving long-term monitoring of a patient’s daily activity. in previous work [2] [23] we reported about the potential relationship between cardiac information copyright © 2006-2016 by ccc publications agora university a context-aware mhealth system for online physiological monitoring in remote healthcare 143 and physiological stress. a novel fuzzy modeling based hrv analysis method for stress assessment was proposed in [23]. the method of [23] extracts the features of hrv in time-frequency domain and fuzzy techniques are exploited to render robustness in hrv analysis against uncertainties arising from individual variations. this paper presents a wearable, remote-monitoring system based on patient context information. the monitor’s purpose is to evaluate the physiological parameter combined with the human body status, and detect an anomalous event through the context information. patient context-aware information is an important concept in the mobile health environment. according to the descriptions of dey, abowd, et al., in their thesis’s general definition of context [3], patient context can be defined as the information for a patient’s medical situation, which can be roughly divided into the following categories: (a) patient’s vital parameters, (b) medical symptoms (vomiting ...), (c) risk factors (cholesterol level ...), (d) activities (standing, walking ...), and (e) surrounding environment (room temperature ...) context-aware computing is an important research field in ubiquitous computing [4]. "contextaware system" refers to a system which can perceive a change in the user’s environment and make a corresponding adjustment. under a mobile health environment, the context-aware system connects various medical and perception equipment via the network; access to various resources and information is more convenient for medical staff, thus improving their efficiency. the fusion of environment/position information, patient’s health status, and physiological parameters can provide rich context information for doctors’ decisions and provide an enhanced environment for more-informed medical service. 2 related work in related work various types of relevant data have been collected individually or in some combination for a variety of healthcare purposes [5][6]. physiological profile data has been used exclusively [7] or has been used in conjunction with activity profile data [8] and environmental information [9]. among these papers, special note should be taken of the system described in [10]. this work presented an approach where vital parameter changes of patients are detected in the biosignal recording system in real-time, however, the users are prompted to provide additional input about their daily living activities. therefore, user intervention is required in that system to provide additional context information [11]. in [12] systems are presented that analyze physiological function in conjunction with automated observation, wellbeing, and health status. in [13] a system is described that analyzes the relationship between an activity profile and physiological information. automatic recognition and quantification of human activities using wearable sensors during activities of daily living has been increasingly used in many research works [24][27]. 3 system structure this paper proposes a multi-sensor system which uses an inference decision support system with a basis of rules to improve the accuracy of discovering potentially dangerous heart rate variability. as shown in figure 1, two different sensors at local site of patient are used for acquiring patient context information and physiological parameters. the sensors in the first category include environmental sensors used for monitoring interaction between a patient and the surrounding environment; meanwhile, a wearable acceleration sensor acquires the patient’s activity status. the information acquired in this group of sensors can accurately reflect the time, location, and context of the patient’s activity performing in the environment. the sensors in 144 w. zhang, k. thurow, r. stoll figure 1: institute for preventive medicine mobile health (ipm-mhealth) system structure figure 2: hidalgo equivital muti parameter sensor the second category include vital signs sensors, which is a typical wearable sensor for measuring the patient’s physiological parameters. real-time accurate physiological parameter acquisition is the basis for data analysis; then we can combine the context information of physical activity to carry out more effective physiological status analysis. this paper has the focus on the two core elements of the framework: (1) automated activity recognition, and (2) knowledge-based decision module. • activity recognition can mainly be divided in two ways: visual-based and sensor-based [14] [15]. the activity recognition method based on a 3-axis acceleration sensor belongs to the latter; this is the newly emerged branch of human body activity identification research. compared with traditional activity recognition based on visual sense, it provides advantages such as the acquisition of movement data in a simpler and more human manner, etc. the algorithm designed in this paper focuses on the acceleration signal worn on a patient’s body as a recognition basis. • the decision module represents the intelligent core part of the system. for context processing, modeling should be carried out first for a patient’s context information. (for example: the activity and position in an abnormal heartbeat situation can be included into one context). the experiments uses web ontology language (owl) to construct the model and process it using sample semantic web rule language [16], which can also provide the higher level information about the overall health status of patients through its conditions based engine. a context-aware mhealth system for online physiological monitoring in remote healthcare 145 figure 3: equivital sensor chest belt the system is tested by using the hardware devices: equivital wearable-mutiparametersensor solution developed by hidalgo [22]. one equivital device was placed on the upper body of the participant (as shown in figure 2 and 3). the sensor module with a special chest belt is an appropriate solution for recording the required measurands. it offers the possibility to acquire cardiac function (via three integrated fabric-based silver-coated electrodes), pulmonary function (via an integrated resistance strain gauge),activity function activity(via an integrated accelerometer adxl330) and skin temperature (via an integrated thermistor). the equivital sensor solution, fully charged, offers 24 hours of operation. due to the low overall weight of 175g (electronic module and the sensor 75g; belt 100g) and the dust/water resistance protection rating of ip 67 (0.4 m = 30 min.), the ipm-mhealth system allows subjects to move freely in their familiar work surroundings as an examiner continuously monitors their condition from any location. 4 system design 4.1 automatic activity recognition automated activity analysis is the essential component of the ipm-mhealth system which provides new opportunities for quality long-term studies of various vital functions. activity identification can be defined as a classification problem in the machine learning field. the basic flow of activity identification is as follows. first, 3-axisacceleration sensor data should be acquired and sensor features (mean value, peak value, standard deviation, spectrum energy, etc.) should be extracted within the specific time window. then, classification training for sensor data should be carried out under a different activity status. the activity recognition flow is shown in figure 4: 1. first the accelerometer data will be collected and smoothed. 2. specific sensor values will be extracted within a defined time window (average, maximum, standard deviation, etc.). 3. then the weka tool will be used to classify the measured data relevant to the various states. 146 w. zhang, k. thurow, r. stoll figure 4: flowchart for automatic activity recognition figure 5: direction of the equivital 3-axis accelerometer 4. finally, we use the 1-k-nearest neighbour (knn) classifier to classify the activity status. (1) acceleration data acquisition in this paper, we capture data for ten (10) volunteers performing six (6) different movements, including standing, walking, running, jumping, walking upstairs, and walking downstairs; for each action, the sensor data was collected five (5) times. a total of 300 samples of acceleration data are used as the original acceleration signal. the signals are collected by the equivital sensor described above. the sensor module itself has an integrated three-axis accelerometer, adxl330, and allows detection of orientation and body activity. the device’s integrated 3axis accelerometer was used at a sample rate of 100 hz for each axis, providing synchronous acquisition of motion data. the coordinate-system of the 3-axis acceleration sensors is defined relative to the front of the devicein its default orientation. as figure 5 shows, the x-axis is horizontal and points to the right, the y axis is vertical and points up and the z axis points towards the outside of the front face of the device. in this system, coordinates behind the device have negative z values. all of these data (raw data, calculated data, and message data) can be transferred continuously and wireless from the sensor module, via a bluetooth class interface, to the ipm-mhealth a context-aware mhealth system for online physiological monitoring in remote healthcare 147 figure 6: data transmission process client in realtime. by calling the sensor communication module offered in the ipm-mhealth client, the raw data of equivital acceleration sensors can be transferred and stored. as shown in figure 1, the ipm-mhealth client, running on the smart phones, is responsible for the collection of sensor node data and for sending the data to a remote data processing center. the data processing center, based on a hadoop server cluster, is responsible for data collection, analysis, processing, and data mining. the communication between the ipm-mhealth client and server is implemented through http/https, soap, and socket. http is used in synchronous xml workflow description; soap is used to transmit ontology instance information; the socket connection is used in real-time high-demand, for example, transmitting ecg (1024 bytes/sec) and acceleration (300 bytes/sec) data. the acceleration data transmission process is shown in figure 6.the accelerometer eventlistener class is established to acquire the data; this class inherits the class sensor eventlistener of the android api. it provides the following features: registration of sensor services, recall of sensor services, acquisition of the sensor data, and listening for real changes in the data. after the sensor data is detected, the sensor data of the three directions can be obtained by calling the methods: onsensorchanged-datax, onsensorchanged-datay and onsensorchangeddataz. since the direction has little influence to the following motion identification algorithm. here we carried out a handling, i.e.: the vector sum of the acceleration data in each change should be calculated through the following equation: di= √ (acc_x)2+(acc_y)2+(acc_z)2 the data after each change of acceleration sensor status and the currently relative time (unit: millisecond) are respectively saved in two arrays, time-series[] and data-series[]. the original data time sequence after acquisition is shown in the following figure, the horizontal axis is the time axis, i.e., time-series[], and the vertical axis is acceleration sensor data di, i.e., data-series[].the motion order in the below figure is: (1) walk, then (2) a series of three times of jumps, (3) run, 148 w. zhang, k. thurow, r. stoll figure 7: time sequence diagram of acceleration raw data and (4) slow walk. it can be seen from the original data that different motions differ in the data features of mean value, peak value, frequency, etc. (2) acceleration sensor data smoothing the data acquisition interface of the equivital sensor adopts a passive invoking method, where the data will be sent out only when the acceleration status changes. therefore, the data acquired in the previous step are not evenly distributed at each time point. in order to equalize the time interval between each data point we used the following smoothing algorithm to calculate the mean data value at the time point with the an interval of 40 milliseconds based on the acquired original data time-series[] and data-series[]. the latest two time points in the time-series[] which is nearest to the time point t0 to be calculated are t1 and t2. wherein: t1 t0 and the corresponding data on dataseries[] are obtained to be d1 and d2. assume that the acceleration speed of a mobile phone between t1 and t2 is linear, then the data value at t0 is d0, which can be calculated through the following weighted average method: d0= d1∗(t2−t0) +d2∗(t0−t1) t2−t1 new time-series[] and data-series[] can be calculated and set up through the above algorithm. (3) extraction of data features of acceleration sensor in order to accurately capture and identify motions, the data set window can be set to 512*40, i.e., 20,480 milliseconds. therefore, the data-series[] array can save 512 samples. when the array is full, re-save should be carried out from data-series[0] after emptying it. after filling in the array, data extraction with a time domain feature and frequency domain feature should be carried out for another time. (a) time domain feature the time domain feature is the feature value of the data within a certain time window. due to the smoothing of data carried out in the previous step, the acceleration sensor data is with the interval of 40 milliseconds therefore, the time domain feature of the acceleration sensor, within 20,480 milliseconds (about 20 second), can be calculated directly via the mean value, maximum value, and minimum value of all the data within the data-series[] array. by extracting the various features of the acceleration signal within a single time window, several feature vectors can be constructed to characterize the behavior. in this paper, the following feature vectors are used. 1) standard deviation formula: standard deviation is defined as σ= √ √ √ √ 1 n n ∑ i=1 (xi−x) 2 a context-aware mhealth system for online physiological monitoring in remote healthcare 149 where n is the number of samples and x is the sample mean. standard deviation, an often-used statistical characteristic. standard deviation reflects the degree of dispersion of the acceleration sensor data. since the acceleration data are unchanged when subjects are in the static state, standard deviation is near zero. when subjects are moving, the acceleration data are constantly changing and the standard deviation is always much greater than zero. therefore, the standard deviation is an important feature to identify static operation and dynamic action. 2) skewness: skewness is defined as formula sk= n ∑ n i=1 (xi−x) 3 (n−1) (n−2)σ3 where n is the number of samples, x is the sample mean (data), and σ is the sample standard deviation.in probability theory and statistics, skewness is a measure of the asymmetry of the probability distribution of the acceleration sensor data about its mean. the skewness of the x axis can effectively distinguish between actions such as walking downstairs. 3) kurtosis: kurtosis is defined as formula k= ∑ n i=1 (xi−x) 4 fi nσ4 where n is the number of samples, x is the sample mean (data) , σ is the sample standard deviation, and fiis the sample interval. kurtosis is any measure of the "peakedness" of the probability distribution of acceleration sensor data, an important statistical characteristic. kurtosis of the y axis can effectively distinguish running and other actions. (b) frequency domain feature the time domain feature alone cannot reflect the obvious characteristics in frequency domain of motion, therefore, characteristics in frequency domain should be obtained via fast fourier transformation. here, fast fourier transformation should first be carried out on the dataseries[] array. fast fourier transformation (fft) is a kind of fast algorithm of discrete fourier transformation (dft), which reduces the number of computations needed for n points from 2n2 to 2nlgn and the data can be efficiently changed from time domain into frequency domain. fast fourier transformation should be carried out on data-series[] (fft operation can be carried out directly if data-series[] array is 512and results in a complex number array in frequency domain space. each complex number in the array includes real and image data. the corresponding fourier component of each frequency within the frequency domain can be obtained through the following relationship, i.e., modulus of the complex number. the energy of characteristics quality in frequency domain, which is required in this paper, is the ratio between the quadratic sum of fourier components within a window and the window size. i.e., frequency domain energy reflected the data periodicity, as for the motion with high periodicity (such as running), the frequency domain energy is obviously high. energy= ∫ w k=0 (mod[k]) 2 w in order to be convenient for feature extraction, this paper uses a windowing method to split the original acceleration signals. the single acceleration signal, after being windowed, includes 512 samples, which is enough for including a single complete motion of patient. if a shorter, rectangular window is adopted, it is not large enough to include information for identifying different motions. if the length of the rectangular window is too long, serious delay phenomenon will occur to the real-time system. (4) classified-learning 150 w. zhang, k. thurow, r. stoll after above procedures obtain the feature values, data excavation tools should be used in the following step. this paper uses weka [18] to classify the learned test data, so as to lay a foundation for the following activity. (5) classification in the experiment, we used 45training samples and 6-11 training samples of each action. volunteers completed the following actions sequentially:(1) walking, (2) jumping, (3) standing, (4) walking up stairs, (5) walking down stairs, and (6) running. every action lasted 30 to 50 seconds to make sure that, for every action, three (3) to five (5) groups of eigenvalues could be collected. finally, we use the 1-k-nearest neighbour (knn) classifier to classify the obtained feature values based on the learning basis in the previous step. its confusion matrix test results are shown as table 1. we can see from the confusion matrix that the classification accuracy of 1-k-nearest neighbour (knn) is high, and the overall accuracy is 95.56%, of which an example could be one jump motion wrongly included into running, and an example of one standing motion included into walking, which shows that these two groups of motions may produce slight confusion to sorting algorithms. table 1. confusion matrix of knn results activity classification jump walking standing downstairs up-stairs running jump 7 0 0 0 0 1 walking 0 11 0 0 0 0 standing 0 1 11 0 0 0 down stairs 0 0 0 6 0 0 upstairs 0 0 0 0 4 0 running 0 0 0 0 0 4 4.2 knowledge representation in order to detect an abnormal situation, the heart rate variability is particularly the most relevant parameter identified by cardiologists [20]. in this experiment, we used equivital multiparameter sensors to collect the ecg data. the changes in the beat-to-beat heart rate, the heart rate variability (hrv),calculated from the electrocardiogram (ecg), is a key indicator of an individual’s cardiovascular condition. assessment of hrv has been shown to aid clinical diagnosis and intervention strategies. an overview of some implemented production rules in the system is shown in the following figure. during the process of acquiring physiological parameters and activity identification an approach based on context information is used in the physiological decision module to detect any abnormal cardiovascular events. it includes the basic medical knowledge required to identify potentially dangerous situations of patients in cooperation with the cardiologists and clinical experts. we first need to construct the context model before processing the context information [19]. the following forms are included in the context models: (subject, predicate, value). if the heart rate value is outside the scope regarded as "normal", or the heartbeat is not regular, we can simply identify hrv. moreover, it can also determine hrv by means of the sdann, the standard deviation of the average r-r intervals calculated over short periods, usually 5 minutes. however, according to the involved cardiologists, the hrv by itself may not have any meaning for identifying an anomalous situation; instead, it should be correlated with other information pertaining the patient, e.g. the physical activity and his/her posture. the following a context-aware mhealth system for online physiological monitoring in remote healthcare 151 physiological parameters are used by the implemented production rules on this aspect: cardiac information • hrmax: patient’s maximum heart rate calculated by karvonen formula [21]; • measuredheartrate: patient’s heart rate measurement; • restingheartrate: patient’s resting heart rate; • timethreshold: time threshold usually 10 sec; • rhrmax : rhr+10; • rhrmin : rhr-10; posture information • lying: datatype [boolean] property detecting whether the patient is lying or not; • standingup: datatype [boolean] property detecting whether the patient is standing up or not; physical activity information • running activity: datatype [boolean] property detect whether the patient is running or not; • walking activity: datatype [boolean] property detect whether the patient is walking or not; environmental information • room temperature: temperature of room where patient lives; for example, we have a patient with heartrate of 70 and id 001, and physical activity was regarded as running, then the context information would be recorded as follows: • (patient001, heartrate, 70), (activity, running, true), 152 w. zhang, k. thurow, r. stoll we can also define the specific conditions for the emergency task to be triggered: e.g. the heart rate of the patient drops below 40, or the blood pressure (systolic) exceeds 170. reasoning may be carried out according to the rules of the language itself, for example: • (?a?p?b),(?prdf : subpropertyof?q)− > (?a?q?b) user defined rules may also be used in the reasoning process, e.g.: • (?patient, equitalbloodpressure001, vl ), ge(?vl, 170), (?patient, equitalbloodpressure, vl), le(?vl, 40) ->(?patient, healthstatus, "danger") • (?patient, healthstatus, "danger"), (?patient, healthstatus, "movementfail"), (emergencytask, taskstate, "close") ->(emergencytask, taskstate, "open") some example rules are shown in table 2, which are used by the system to predict an abnormal situation. when the heart rate is abnormally high, the heartratehigh rule assigns the alarm level. abnormal heart rate in each instance of the heartrate class is specified in the hasminrange and hasmaxrange properties. the system sets the heartrate alarm if a patient’s heart rate is greater than the specified normal maximum. table 2. rules for the system’s alarm management rule description heartratehigh (?patient rdf:type heartrate) , (?par1 hascurrentvalue ?v1) , (?par1 hasmaxrange?max), greaterthan(?v1, ?max) ->(?taskstate hasalarmlevel "hes") heartratelow (?patient rdf:type heartrate) , (?par1 hascurrentvalue ?v1) , (?par1 hasminrange ?min), greaterthan(?v1, ?min) ->(?taskstate hasalarmlevel "les") 5 experimental results in order to identify whether various functions of the ipm-mhealth system are effective or reliable in practical application, we designed an initial experiment. its purpose was to carry out a test and verify the following functions: quality of physiological signals acquired by ipmmhealth; comfort of sensor system; online activity recognition and alarm reliability; reliability of short-distance wireless communication; and reliability of remote wireless communication. five healthy volunteers in the 21 35 age range without cardiovascular disease history participated in this test. in the first test subjects with equivital sensors carried out the motions for walking, jumping, running, standing, climbing the stairs, and walking down the stairs, in proper order; another observer recorded the test subjects’ motions and corresponding times. in the second test, we simulated the abnormal heartbeat event and tested the reliability of the decision-making system under lying-flat and running conditions with the test subjects. the relevant test data and statistical results are shown in table 3 and table 4. in the stability tests, we used ten (10) minutes to teach each volunteer how to put on and operate the system, then let them go home with the wearable device and keep the system running more than 24 hours. the sensor data was collected in real-time and sent to the central ipm-mhealth server; table 4 lists the transmission packet loss rate for 24 hours’ data. we can see that both, the short-distance wireless transmission and remote wireless transmission, operated normally. the expected realtime physiological data transmission was not lost. illegal disconnection conditions did not occur a context-aware mhealth system for online physiological monitoring in remote healthcare 153 in short-distance wireless transmissions during the entire test process. the acquired signal quality was satisfactory. in the automatic activity recognition experiment, volunteers completed totally 300 actions and 10 times for each action. in the abnormal alarm part, good effect was also received in the simulated anomaly detection, of which the conditions of over 150 heartbeats under 10 times of lying flat were correctly identified with alarms, while over 150 heartbeats under running state were included into the normal range. in the activity identification part, 90% motions were correctly identified by the system. the activity identification of standing and jumping had the best accuracy; the activity of climbing and walking down the stairs also had the better accuracy, while running, walking, etc. has misjudgement within 8%. table 3. results of the automatic activity recognition and abnormal situation indicating activity correct recognition rates hr max alert (hr max) std standing 97.8% 170 alarm 10 walking 91.6% 122 none 10 running 92.6% 178 none 10 jump 97.6% 150 none 10 climbing the stairs 96.6% 140 warning 10 walking down the stairs 95.2% 110 none 10 table 4. results of stability test parameter acquisition frequency of sensor data packetloss-rate data transmitted in a 24hour period std heartrate 0.3hz <0.1% 25,920 bytes 24 respiratory 0.3hz <0.1% 25,910 bytes 24 patient temperature 0.3hz <0.1% 25,890 bytes 24 r-r interval 1hz <0.1% 86,390 bytes 24 acceleration x 100hz <0.1% 2,164,986bytes 24 acceleration y 100hz <0.1% 2,164,980bytes 24 acceleration z 100hz <0.1% 2,164,900bytes 24 we also find in the test results that the motions identified within the activity change duration may be misjudged when test subjects conducted multiple motions successively. the causes for this misjudging lie in the selection of the time window when the sensor test data was acquired. the time window selected in this paper is about 20 seconds, the data within 1 second may mix the corresponding acceleration data of two activities. therefore, the data features and user activity did not mutually correspond, it could lead to misidentification between two activities. this problem can be solved by selecting a shorter time window, but the selection of an overly-short time window may create non-obvious problems for the corresponding data features of various activities. 6 conclusion a framework for context-aware physiological analysis with regard to daily activities is proposed in this paper for the detection of abnormal cardiac situations. owl is used in the contextreasoning module to construct target medical tasks and circumstances. the ipm-mhealth system has been used in the institute for preventive medicine at rostock university medical center 154 w. zhang, k. thurow, r. stoll germany for remote monitoring research and about 60 subjects carried out 8-24 hour remote monitoring tests. in the questionnaire, over 90 however, this paper does not address how to process the context conflicts in the process of constructing ontology and reasoning conditions. the reliability and energy consumption problem of remote medical systems will be the focus in further research. as for the energy consumption problem, some new ultra low power transmission protocols that can greatly improve the sustainability of remote monitoring have appeared, such as ant (ant is a proven ultra-low power (ulp) wireless protocol that is responsible for sending information wirelessly from one device to another device, in a robust and flexible manner) and ble (bluetooth low energy). other challenging tasks will include the system recovery mechanism and development of an intelligent error discovery to keep the stability of the system over time. 7 acknowledgement the authors wish to thank the federal ministry of education and research (bmbf, germany) for the financial support (fkz: 01z1kn11). bibliography [1] jovanov, e., milosevic, m., milenkovic, a.(2013); a mobile system for assessment of physiological response to posture transitions, proc. of the annual international conference of the ieee engineering in medicine and biology society, embs, art. no. 6611220, 7205-7208. [2] berndt, r.-d., takenga, m.c., kuehn, s., preik, p., stoll, n., thurow, k., kumar, m., weippert, m., rieger, a., stoll, r.(2011); a scalable and secure telematics platform for the hosting of telemedical applications. case study of a stress and fitness monitoring, ieee 13th international conference on e-health networking, applications and services, healthcom, art. no. 6026726, 118-121. [3] dey, a.k., abowd, g.d., salber, d.(2001); a conceptual framework and a toolkit for supporting the rapid prototyping of context-aware applications, human-computer interaction, 16 (2-4): 97-166. [4] vajirkar, p., singh, s., lee, y.(2003); context-aware data mining framework for wireless medical application, lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics), 2736, 381-391. [5] ito, t.(2013); an approach of body movement-based interaction towards remote collaboration, 20th ispe international conference on concurrent engineering, ce proceedings, 163-172. [6] hossain, m.a., alamri, a., parra, j.(2013); context-aware elderly entertainment support system in assisted living environment, electronic proc. of the ieee international conference on multimedia and expo workshops, icmew 2013, art. no. 6618365. [7] van cauwenberg, j., van holle, v., de bourdeaudhuij, i., clarys, p., nasar, j., salmon, j., maes, l., goubert, l., van de weghe, n., deforche, b.(2014); physical environmental factors that invite older adults to walk for transportation, journal of environmental psychology, 38: 94-103. a context-aware mhealth system for online physiological monitoring in remote healthcare 155 [8] ogawa, mitsuhiro, tamura, toshiyo, togawa, tatsuo.(1998); fully automated biosignal acquisition in daily routine through 1 month, annual international conference of the ieee engineering in medicine and biology proceedings, 4: 1947-1950. [9] manios, y., moschonis, g., papandreou, c., politidou, e., naoumi, a., peppas, d., mavrogianni, c., lionis, c., chrousos, g.p. (2015); revised healthy lifestyle-diet index and associations with obesity and iron deficiency in schoolchildren: the healthy growth study, journal of human nutrition and dietetics, suppl 2:50-8. doi: 10.1111/jhn.12183. epub 2013 dec 5. [10] jakkula, v.(2007); predictive data mining to learn health vitals of a resident in a smart home, proceedings ieee international conference on data mining, icdm, art. no. 4476662, 163-168. [11] gjevjon, e.r., eika, k.h., romoren, t.i., landmark, b.f.(2014); measuring interpersonal continuity in high-frequency home healthcare services, journal of advanced nursing, 70(3): 553-563. [12] costin, hariton, rotariu, cristian, et al. (2008); complex telemonitoring of patients and elderly people for telemedical and homecare services, proc. of the 1st wseas int. conf. on biomedical electronics and biomedical informatics, rhodes, greece, aug 20-22, 2008, edited by: long, ca; anninos, p; pham, t; et al., book series: recent advances in biology and biomedicine, 183-187. [13] berkow, m., monsere, c.m., koonce, p., bertini, r.l., wolfe, m. (2009); prototype for data fusion using stationary and mobile data, transportation research record, 102-112. [14] ermes, m., parkka, j., mantyjarvi, j., korhonen, i.(2008); detection of daily activities and sports with wearable sensors in controlled and uncontrolled conditions, ieee transactions on information technology in biomedicine, 12 (1): 20-26. [15] heinz, ea., kunze, ks., gruber, m., bannach, d., lukowicz, p. (2006); using wearable sensors for real-time recognition tasks in games of martial arts-an initial experiment, computational intelligence and games, ieee , 98-102. [16] han, s.n., lee, g.m., crespi, n. (2014); semantic context-aware service composition for building automation system, ieee transactions on industrial informatics, art. no. 6478809, 10 (1): 252-261. [17] zhang, s., mcclean, s., scotney, b., galway, l., nugent, c. (2014); a framework for context-aware online physiological monitoring, proceedings ieee symposium on computer-based medical systems, art. no. 5999038. [18] abe, b.t., olugbara, o.o., marwala, t. (2014); classification of hyperspectral images using machine learning methods, lecture notes in electrical engineering, 247 lnee, 555-569. [19] zhang, w., thurow, k., stoll, r.(2013); a soa and knowledge-based telemonitoring framework: design, modeling, and deployment, international journal of online engineering, 9 (6): 48-57. [20] sannino, g., de pietro, g. (2011); a smart context-aware mobile monitoring system for heart patients, ieee international conference on bioinformatics and biomedicine workshops, bibmw 2011, art. no. 6112448, 655-659. 156 w. zhang, k. thurow, r. stoll [21] she, j., nakamura, h., makino, k., ohyama, y., hashimoto, h., wu, m. (2013); experimental selection and verification of maximum-heart-rate formulas for use with karvonen formula, icinco proceedings of the 10th international conference on informatics in control, automation and robotics, 2: 536-541. [22] hidalgo limited, equivital eq-01 vital signs monitor health care practioner guide. dokumentation, hidalgo ltd., 2006. [23] m. kumar, m. weippert, r. vilbrandt, s. kreuzfeld, and r. stoll, (2007); fuzzy evaluation of heart rate signals for mental stress assessment, ieee transactions on fuzzy systems, 15: 791-808. [24] k aminian, b najjafi, (2002); body movement monitoring system and method, us.patient ep 119513910. [25] fuentes, d., gonzalez-abril, l., angulo, c., ortega, j. a. (2012); online motion recognition using an accelerometer in a mobile device, expert systems with applications, 39(3): 24612465. [26] godfrey, a., bourke, ak., olaighin, gm. (2011); activity classification using a single chest mounted triaxial accelerometer , medical engineering and physics, 33(9): 1127-1135. [27] fleury, a., noury, n., vacher, m. (2009); a wavelet-based pattern recognition algorithm to classify postural transitions in humans, 17th european signal processing conference, 2047 2051. ijcccv7n5.pdf int j comput commun, issn 1841-9836 7(5):798-806, december, 2012. multi-period customer service level maximization under limited production capacity s. babarogić, d. makajić-nikolić, d. lečić-cvetković, n. atanasov sladjan babarogić, dragana makajić-nikolić danica lečić-cvetković, nikola atanasov university of belgrade, faculty of organizational sciences serbia, 11000 belgrade, jove ilića 154 e-mail: sladjan@fon.bg.ac.rs, gis@fon.bg.ac.rs danica@fon.bg.ac.rs, nikola.atanasov@fon.bg.ac.rs abstract: this paper will focus on a make-to-stock multi-period order fulfilment system with random orders from different classes of customers under limited production circumstances. for this purpose a heuristic algorithm has been developed aimed at maximizing the customer service level in any cycle and in the entire multi-period. in this paper, in order to validate the results obtained with this algorithm, a mixed integer programming model was developed that is based on the same assumptions as the algorithm. the model takes into account the priorities of customer groups and the balanced customer service level within the same group. the presented approaches are applied to a real example of fast moving consumer goods. their comparison was carried out in several scenarios. keywords: limited production capacity, customer service level, heuristic algorithm, mixed integer programming. 1 introduction the distribution of available finished products among customer orders requires an efficient distribution system aimed at improving the effectiveness of the entire business. the effectiveness of business is directly related to products quantities and the profits through sales. in addition to profit-oriented decisions on the selection of orders to be met, it is necessary also to take into account the customer service level. according to [6], key performance indicators in manufacturing companies are identified in measuring the customer service level and customer satisfaction. customers whose purchases represent a large share of the company’s sales require special attention and the company should make sure that they achieve the highest possible fulfilment of each order. there are also customers that continuously increase their orders and based on that also expect a corresponding service level. due to their large number, small customers influence the overall sales of manufacturing companies. some of them also represent a potential for future sales growth and increase in revenues of the manufacturing company. these facts underline the importance of making the right decisions when selecting orders to be met. therefore, a heuristic algorithm has been developed [5] that is used for decision-making concerning the customer service level in each cycle by taking into account the priorities of the customers. traditional approaches to fulfil orders based on the make-to-stock (mts) production system are described in [1] by taking into account the available supplies of finished products to satisfy customer orders following the principle of first come first served (fcfs) without assigning priorities to customers and orders. the basic idea of the approach described in [8] is the segmentation of customers in order to increase the total revenue of the manufacturing company by accepting and delivering orders which provide maximum profit. in this paper the customers are clustered into priority groups based on the size of their orders. due to their potential growth, there is tendency to provide protected quantities of products to copyright c© 2006-2012 by ccc publications multi-period customer service level maximization under limited production capacity 799 the customers with the lowest priority. if a manufacturing company has a limited production capacity, it is clear that the company will decide to reject some of the orders received which will have a direct impact on the profitability of the company. the decision to reject orders is made based on a comparison of orders where less profitable orders are rejected. according to [2] this issue has also been defined as dynamic models for managing orders under limited production capacity based on profitability analysis. the problem discussed in this paper refers to meeting the demand in a multi-period, where the unmet demand in one cycle is not compensated in the following cycles, i.e. there are no backorders. demand is a weekly phenomenon which requires dynamic decision-making. the heuristics shown in [9] refer to the problem of replenishment of multiple products in order to meet the demand when the storage capacity is limited. authors in [13] present a mixed integer programming (mip) model that applies to small and medium-sized enterprises with limited available to promise (atp) quantities and which has to decide which customers they will accept and in each cycle which part of the demand of accepted customers they are going to meet. a large number of mip models include unlimited production capacity. the uncapacitated requirement planning model, with demand fulfilment flexibility, is shown in [7]. in each cycle separately a decision is taken regarding the launch of production and the part of demand of each customer that will be met in the respective cycle. a similar mip model is presented in [12]. this model implies that the manufacturer may, in each cycle, decide whether to start the production, which order to fulfil and to what extent. the above mentioned papers consider the maximization of profit as the main criterion. the approach presented in this paper, however, aims at maximizing the customer service level. in [11] the orders of customers are clustered into two groups: small-size orders and large-size (divisible) orders in the presented basic mip model, it is in each cycle determined which of the small-size orders will be fulfilled and what fraction of the large-size order will be met in order to maximize the customer service level. the multiobjective mi nonlinear mathematical model, in which the maximization of average customer service levels is one of the four objectives, is presented in [3]. the remainder of this paper is organized as follows: the second section defines the problem of allocating scarce resources in a manufacturing company. the allocation problem is related to the distribution of limited production capacity to customer orders in order to maximize the customer service level. the third section deals with the computational results and provides an analysis of these two approaches in a real example of fast moving consumer goods (fmcg). in the conclusions section the authors lists the principal advantages of the proposed algorithm for the allocation of limited production capacity, as well as possible directions for the further development. 2 problem definition and model formulation the algorithm has been developed for solving the problem of fmcg industry products. by introducing minor modifications it can also be applied to the product allocation problem in other industries. the basic assumptions of the problem discussed in this paper are the following: • it studies a multi-period and a set of customers that place order in all or almost all of the cycles. demand is uneven and is known only for one cycle in advance; • the production capacity is limited and constant in the entire period; • if the incoming customer orders in a single cycle do not exceed the available stock of finished goods, the allocation is complete and all customer orders are fulfilled, while any surplus products are stored for the next cycle. inventory holding costs are neglected; 800 s. babarogić, d. makajić-nikolić, d. lečić-cvetković, n. atanasov • when the total of all orders exceeds the available stock of products, it is necessary to define the distribution of products i.e. rules based on which the allocation of products will be done according to the received customer orders. the allocation has to maximize the customer service level. • considering the type of products that are being studied, orders that have not been fully met in the reporting cycle shall not be compensated in the subsequent cycles; • order of customer priority is known. these are clustered into priority groups in which they have the same order of priority as the other group members; • the product unit price is the same for all customers. based on the assumptions of problem, in previous research effort we have developed heuristic algorithm that introduces the concepts of partitions and tokens. the algorithm aims to maximize the cumulative customer service level with a balanced customer service level within the same group. the detailed explanation of the proposed algorithm is given in [5]. the important features of the algorithm are: • classification in groups, provides the order of allocation with primarily focus to satisfy customers that are important for the company. • application of mechanism of partitions, ensures certain groups of lower priority within protected partitions to be involved in allocation so that the low-ranked customers would be at least partially satisfied. • application of mechanism of group memory token, allows all customers within a marked group to receive the unsatisfied demand from the previous cycle, with the extended delivery lead time, with which they improve the overall customer service. 2.1 mip customer service level maximization model a mixed integer customer service level maximization model (csl model) model was developed in order to validate the results obtained with this algorithm. the model is based on the same assumption as the algorithm. however, here the total demand of all customers in all cycles is known in advance. given the purpose of the model, these are the assumptions on which the model is based: • an r number of cycles is observed and the demand of any of n customers is known in each of those cycles, ocil, i = 1, . . . , n, l = 1, . . . , r. the demand might not be fulfilled and the demand that was not fulfilled is not compensated for in the next cycle; • production is limited and constant in all cycles and it equals ptl, l = 1, . . . , r. if the production exceeds the total demand of the cycles, the surplus products are stored for the following cycle, so as that the total demand stl in any cycles equals ptl +max{0, ptl−1− otl−1}. inventory holding costs are neglected; • customers are grouped according to priority. lowest-priority customers are a protected group to which the amount of protected products apil, i = 1, . . . , n, l = 1, . . . , r is allocated in each cycle. model variables: multi-period customer service level maximization under limited production capacity 801 • zil customer service level, defined as the fraction of customer order demands ocil delivered on time [10], • acil allocated quantity of the customer i in the cycle l. csl model: max r∑ l=1 n∑ i=1 wizil (1) s.t. acil − zil · ocil = 0, i ∈ {1, . . . , n}, l ∈ {1, . . . , r} (2) n∑ i=1 acil ≤ stl, l ∈ {1, . . . , r} (3) acil ≥ apil, i ∈ {1, . . . , n}, l ∈ {1, . . . , r} (4) zil ≤ 1, i ∈ {1, . . . , n}, l ∈ {1, . . . , r} (5) acil ∈ z + , i ∈ {1, . . . , n}, l ∈ {1, . . . , r} (6) the objective function (1) represents the maximization of the total customer service level, where customers are differentiated by using weights. the wi parameter, which is the weight coefficient, is used to cluster the customers into priority groups. the optimum solution of the csl model is extremely sensitive to the given values of this parameter, in particular when production is significantly less than the total demand. the first constraint (2) refers to the percentage and absolute satisfaction of demand. the second constraint (3) models the total demand and the total fulfilment of demand in each of the cycles. the total demand in a cycle is made up of the production in the given cycle and the eventual surplus from the previous cycle. the third constraint (4) allows the creation of a protected partition. the apil parameter, which is the minimum quantity of products to be delivered to the customer i in the cycle l, represents the reserved quantity for the customer i which is in the protected partition. the value of this parameter for customers outside the protected partition is 0. the value of this parameter has a direct impact on the service level of customers from the protected partition and indirect impact on the service level of customers outside this partition. the last constraint (5) represents the upper bound of the fractional customer service level for the customer i in the cycle l. 3 computational results and discussion in order to analyze the impact of the production capacity to the customer service level, the algorithm and the cls model were tested in three scenarios. all the parameters are the same, except for the production level which equals 1000, 1300 and 1400 fmcg units respectively. for the scenario when the production level reaches 1000 units, the supply is significantly lower than the total demand, for the 1300 units scenario the supply almost meets the minimum total demand. in case of the 1400 units scenario, there are unallocated products only in a few cycles. table 1 shows the demand of nine customers over a period of nine weeks. the customers are clustered into three groups. customers a1 and a2 belong to the first group, while customers b1-b4 make part of the second one. the c-customers are in the third group. the first and the second group of customers are in the first partition, while the third one is in the second, protected partition. 802 s. babarogić, d. makajić-nikolić, d. lečić-cvetković, n. atanasov table 1: input parameters week (cycle) a1 a2 b1 b2 b3 b4 c1 c2 c3 demand w1 330 575 280 110 40 121 100 52 25 1633 w2 360 393 110 170 135 157 74 40 0 1439 w3 220 700 60 100 160 130 100 65 40 1575 w4 230 480 120 140 80 146 74 94 20 1384 w5 270 650 390 110 100 241 140 83 0 1984 w6 381 751 89 140 260 95 110 48 30 1904 w7 320 615 20 120 90 100 46 27 20 1358 w8 390 1.055 120 120 190 130 110 75 0 2190 w9 305 780 290 90 60 110 30 92 11 1768 total 2806 5999 1479 1100 1115 1230 784 576 146 the parameter value kp (protective percentage quota) for all three scenarios is 0.95 for the first partition and 0.05 for the second one. these values are based on the decision of the company management which is founded on the realistic assumption that it is necessary to satisfy even the small customers in order to keep them in the system and take advantage of their potential growth. in this way the dependence on major customers would also be reduced. the given kp parameter values are used to determine the ap (amount of allocated products) parameter value . for each customer, the value of ap parameter is set to 5 for the purpose of benchmarking the presented algorithm it was necessary to set the weights in the csl model that will provide the best balance of the customer service level for the given data. the customers from the protected partition are not included into this sensitivity analysis, as they belong to the lowest-priority group and in any weight distribution their weights are 1. the ap parameter values have a much greater influence on the fulfilment of their demands. the gnu linear programming kit [4] was used to solve the model. the programming kit includes the branch-and-cut algorithm for solving the mip problem. figure 1 shows five value variants of weight coefficients. the first value in brackets represents the weight coefficient of the customers from the first group, while the second value is the weight coefficient of the second group. given the fact that the first group is a higher priority group, the weight coefficients of the customers from this group have to be greater than the weight coefficients of the customers from the second group. the figure shows that the significant difference between weight coefficients has a negative effect on the balancing of the second group. however, small differences have a negative effect on the balancing of the first group. using the bisection method has shown that for the given data it is necessary to use the weight 65 for the first customer group and 10 for the second one. scenario 1 table 2 presents the results obtained through the application of the algorithm, while table 3 contains results of the cls model optimization. the given production capacity in both cases was 1000 fmcg units. both tables show the customer service level for every customer over a period of nine weeks. the last row provides the average customer service level for every customer. by using the cls model, the objective function value, which represents the weighted customer service level, is 1213.77. by weighting the results of the algorithm, the objective function becomes the value of 1092.77 (90.03% of the optimum value). this is an expected result because the cls model maximizes the weighted customer service level. however, from the point of view of the company management it is more important that the customer service level for the customers in the same group is balanced. based on the results for customers in the first group (a1 and a2), it can be concluded that multi-period customer service level maximization under limited production capacity 803 figure 1: cls sensitivity to weights table 2: algorithm results for 1000 units week (cycle) a1 a2 b1 b2 b3 b4 c1 c2 c3 w1 1.000 1.000 0.082 0.082 0.075 0.083 0.280 0.288 0.280 w2 0.786 0.784 1.000 0.594 0.274 0.707 0.432 0.450 w3 1.000 1.000 0.067 0.070 0.063 0.069 0.240 0.246 0.250 w4 0.843 0.846 0.467 0.664 1.000 0.829 0.270 0.266 0.250 w5 1.000 1.000 0.036 0.036 0.030 0.037 0.221 0.229 w6 0.496 0.498 1.000 0.757 0.373 1.000 0.264 0.271 0.267 w7 1.000 1.000 0.050 0.042 0.044 0.050 0.543 0.519 0.550 w8 0.438 0.440 0.158 0.958 0.453 0.731 0.273 0.267 w9 0.846 0.887 0.000 0.000 0.000 0.000 0.367 0.380 0.364 avg 0.797 0.797 0.214 0.400 0.287 0.370 0.293 0.304 0.308 table 3: results of cls model optimization for 1000 units week (cycle) a1 a2 b1 b2 b3 b4 c1 c2 c3 w1 1.000 0.963 0.050 0.055 1.000 0.058 0.170 0.308 0.640 w2 1.000 1.000 1.000 0.053 0.644 0.051 0.230 0.400 w3 1.000 0.794 1.000 1.000 0.050 0.054 0.170 0.246 0.400 w4 1.000 1.000 1.000 0.236 1.000 0.055 0.230 0.170 0.800 w5 1.000 1.000 0.051 0.055 0.080 0.054 0.121 0.193 w6 1.000 0.487 1.000 0.050 0.050 1.000 0.155 0.333 0.533 w7 1.000 0.829 1.000 0.050 1.000 0.050 0.370 0.593 0.800 w8 1.000 0.187 1.000 1.000 0.053 1.000 0.155 0.213 w9 1.000 0.482 0.052 1.000 1.000 1.000 0.567 0.174 1.000 avg 1.000 0.680 0.384 0.343 0.355 0.311 0.195 0.250 0.623 804 s. babarogić, d. makajić-nikolić, d. lečić-cvetković, n. atanasov the use of algorithm keeps the customer service level completely balanced. by using the cls models, customer a1, whose demand is much lower than the one of the customer a2, has a much higher customer service level. this is because the costs of high customer satisfaction within the same group are minimal when demand is the lowest, because the lowest demand causes the highest increase of the objective function. in the second group, when the cls model was used, the customer service level for all nine weeks was more balanced than with the application of the algorithm. however, analyzing customer service level by week, it becomes clear that every week the algorithm assigns a certain amount of products to every customer from the second group and that the balance is very good every odd week. this is primarily due to use of tokens. on the other hand, based on the results of the cls model, it is evident that every week at least one customer from the second group receives protected amount of products. in other words, looking at it by week, the balancing is inadequate. the third group belongs to a protected partition and always gets the guaranteed amount of products. the algorithm provides perfect balancing. however, based on the results of the cls model the last customer in the third group has the highest customer service level due to its low demand which is often less than the guarantied amount of products. this happens due to the maximization of the satisfaction fractions and not the absolute amount of assigned products. scenario 2 the main feature of a scenario with 1300 fmcg units is that the production of each week still does not meet the total demand, but the lack of finished products is lesser than in the first scenario. table 4 provides the average customer service level for each customer over a period of all nine weeks based on the results of the algorithm and the optimization of the cls model. table 4: results of the algorithm and the cls model for 1300 units a1 a2 b1 b2 b3 b4 c1 c2 c3 algorithm 0.937 0.929 0.539 0.706 0.556 0.675 0.434 0.452 0.429 cls model 1.000 0.870 0.412 0.792 0.497 0.891 0.305 0.328 0.808 the weighted overall customer service level based on the results of the cls model equals 1416.17. the results of the algorithm make this value reach 1361.42 (96.13 % of the optimum value). the balancing obtained by applying the algorithm is better than the balancing obtained through the model and it is even more prominent than in scenario 1. scenario 3 when the production reaches 1400 fmcg units, in two weeks (w4 and w7) the supply exceeds the demand and the surplus products are stored for the next week. the average customer service levels are shown in 5. based on the results, the weighted overall customer service level for the cls model equals 1463.77 and 1419.76 for the algorithm (96.99% of the optimum value). table 5: results of the algorithm and the cls model for 1400 units a1 a2 b1 b2 b3 b4 c1 c2 c3 algorithm 0.971 0.968 0.627 0.733 0.575 0.765 0.458 0.542 0.548 cls model 1.000 0.924 0.477 0.879 0.581 0.982 0.383 0.521 0.836 looking at the data from the three scenarios above, it can be concluded that by increasing the production the total value of customer service level obtained through the algorithm, is nearing the optimum value. if observed from the balancing point of view, the advantage of the algorithm is increasingly more prominent compared to the cls model. multi-period customer service level maximization under limited production capacity 805 4 conclusions and future works this paper presents benchmarking of a heuristic algorithm for the dynamic solving of the problem of allocating limited supplies of products to received customer orders with the aim of maximizing the customer service level. in order to validate the algorithm an mip model was developed. the model is used for the distribution of products based on demands that are known in advance for the entire period. computational results have indicated that the proposed algorithm, with the increase in the production capacity, ensures a value of the total customer service level that is closer to the optimum values obtained using the cls model. in addition, in all three scenarios the balancing results of service level for customers from the same group achieved with the algorithm were better than the balancing obtained using the cls model. further analysis of customer demand by week and the obtained customer service levels have shown that the further research might have to be directed towards the analysis of the correlation between the customer service level and the fluctuations in demand. besides, the algorithm could be modified by introducing the assumption that the selling price depends on the customer affiliation to a certain group or on the quantity of products ordered. the growth rate of a customer’s demand is one of the essential elements used by the management in manufacturing companies in planning the sales. the inclusion of this parameter into the problem would require the modification of the algorithm making it a more useful tool in the decision-making process. acknowledgement this research was partially supported by the ministry of education and science, republic of serbia, project number: tr35045. bibliography [1] cederborg o., rudberg m., customer segmentation and capable-to-promise in a capacity constrained manufacturing environment, 16th int. annual euroma conference, goteborg, sweden, 2009, http://www.iei.liu.se/prodek/forskning/iscaps/filarkiv /1.120209/cederborgandrudbergeuroma2009.pdf, accessed 12 january 2010. [2] chan f.t., chung s.h., a modified multi-criterion genetic algorithm for order fulfillment in manufacturing network, proceedings of the 9th asia pacific industrial engineering & management system conference, apiems, indonesia, 2221-2226, 2008. [3] chen c., lee w., multi-objective optimization of multiechelon supply chain networks with uncertain product demands and prices, comput. chem. eng., issn 0098-1354, no 28: 1131-1144, 2004. [4] glpk gnu linear programming kit. http://www.gnu.org/software/glpk, accessed 25 december 2011. [5] lecic-cvetkovic d., atanasov n., babarogic s., an algorithm for customer order fulfillment in a make-to-stock manufacturing system, int j comput commun, issn 18419836, 5(5): 983-791, 2010. [6] lin j., chen j.h., enhance order promising with atp allocation planning considering material and capacity constraints, jciie, issn 2151-7606, 22(4): 282-292, 2005. 806 s. babarogić, d. makajić-nikolić, d. lečić-cvetković, n. atanasov [7] merzifonluoglu y., geunes j., uncapacitated production and location planning models with demand fulfilment flexibility, int j prod econ, issn 0925-5273, 102: 199-216, 2006. [8] meyr h., customer segmentation, allocation planning and order promising in make-tostock production, or spectrum, issn 01716468, 31(1): 229-256, 2009. [9] minner s., a comparison of simple heuristics for multi-product dynamic demand lot-sizing with limited warehouse capacity, int j prod econ, issn 0925-5273, 118: 305-310, 2009. [10] pochet y.,wolsey l.a., production planning by mixed integer programming, springer, 2010. [11] sawik t., integer programming approach to reactive scheduling in make-to-order manufacturing, math comput model, issn 0895-7177, 46(11-12): 1373-1387, 2007. [12] xiao y., taaffe k., satisfying market demands with delivery obligations or delivery charges, comput oper res, issn 0305-0548, 37(2): 396-405, 2010. [13] xiong m.h. et al, a dss approach to managing customer enquiries for smes at the customer enquiry stage, int j prod econ, issn 0925-5273, 103(1): 332-346, 2006. int j comput commun, issn 1841-9836 8(4):571-577, august, 2013. pybneq a tool for computing bayes-nash equilibria i. joldeş, b. pârv, i. parpucea, v. lupşe iulian joldeş, bazil pârv babeş-bolyai university department of computer science romania, 400084 cluj-napoca, 1 m. kogălniceanu str. e-mail: joldesiulian@yahoo.com, bparv@cs.ubbcluj.ro ilie parpucea babeş-bolyai university department of mathematics and statistics romania, 400591 cluj-napoca, 58-60 teodor mihali str. e-mail: ilie.parpucea@gmail.com vasile lupşe technical university cluj-napoca north center baia mare romania, 430083 baia mare, 62/a dr. victor babeş str. e-mail: vasilelupse@ubm.ro abstract: this paper describes pybneq a tool for computing bayes-nash equilibria for games of incomplete information. it is implemented in python and has a graphical user interface, allowing the user to load/save/edit game data, and to find bayes-nash equilibria. currently, pybneq implements porter-nudelman-shoham algorithm for 2-player games and can be considered as a decision support system for solving games of incomplete information. keywords: bayes-nash equilibrium, decision support systems, game with incomplete information. 1 introduction many real-world problems, including ones which contain germs of a crisis, are modeled as games of incomplete information. examples include market competition, currency attacks, bank runs, liquidity crises, as well as military conflicts. this paper describes a tool for computing bayes-nash equilibrium for games with incomplete information. it is structured in 5 sections, as follows. after this introductory section, the section 2 introduces the bayesian games and a short discussion related to the computation of the bayesnash equilibria. section 3 describes the pybneq tool, especially its graphical user interface, while section 4 presents some case studies we used to test our tool. finally, the last section compares our application with the existing ones and presents the future developments. 2 theoretical background in what follows, we consider the definition of a bayesian game given in [1]. such a game is defined as a tuple (n, a, θ, ω, u), where: • n = {1, ..., n} is the set of agents or players; • a = (a1, a2, · · · , an) is the set of agents’ actions, ai being the set of actions available to agent i; copyright c⃝ 2006-2013 by ccc publications 572 i. joldeş, b. pârv, i. parpucea, v. lupşe • θ = (θ1, θ2, · · · , θn), with θi = (θi,1, θi,2, · · · , θi,mi) being the set of types for agent i and θi,j the j-th type of agent i; • ω = (ω1, ω2, · · · , ωn), with ωi = (ωi,1, ωi,2, · · · , ωi,mi) being the set of probabilities assigned to the types of agent i and ωi,j the probability assigned to θi,j; • u = (u1, u2, · · · , un) is the set of utility (payoff) functions, with ui = (ui,1, ui,2, · · · , ui,mi) being the set of utility functions of agent i and ui,j the utility function of θi,j whose arguments are the joint action a = (a1, a2, · · · , an) and the other agents’ types θ−i = (θ1, · · · , θi−1, θi+1, · · · , θn). alternative definitions can be found for example in [3, 8, 9]. the solution of a bayesian game is based on the concept of bayes-nash equilibrium. it is known that the computation of this equilibrium is np-complete, and the usual approach uses two steps [12]: 1. reduce the bayesian game to a complete-information game, and 2. compute the nash equilibrium for the complete-information game obtained in step 1. step 1 above can be addressed in several ways. the paper by ceppi et al. [1] contains a detailed discussion regarding the ways of performing reduction, based especially on the material in [5]. our approach is based on the use of sequence form, due to the smaller payoff matrixes than the ones in normal form. step 2 means computing of the nash equilibria for two-player complete-information games. ceppi et al. paper [1] analyze three algorithms for such games in strategic form: lemke-howson (lh), porter-nudelman-shoham (pns), and sandholm-gilpin-conitzer (sgc), and propose an extension of pns to bayesian games (b-pns), which is implemented in our tool. the b-pns algorithm has the following generic steps: 1. enumerate all the possible joint supports; 2. select a support for each possible type of agent 1; 3. prune the spaces of actions of the agent 2’s types by strict conditional dominance; 4. check the strict conditional dominance on agent 1’s support; 5. select a support for each possible type of agent 2; 6. check the strict conditional dominance on agent 2’s suport; 7. check the feasibility problem. 3 the pybneq tool in order to implement the b-pns algorithm, there were two interrelated design problems to solve: data representation and the individual algorithms for each step. a future paper will describe the design solutions in greater detail. the implementation language is python 2.7 [11], while data representation and numerical computation capabilities of numpy package [6] were extensively used. also, the application makes use of pycplex library [13], a python interface to the ilog cplex callable library [2], for solving the feasibility problem (step 7 of the above-described b-pns algorithm). finally, the graphical user interface was produced by using the wxpython toolkit [14]. the application is designed with a friendly user interface, having the following components: pybneq a tool for computing bayes-nash equilibria 573 • main window contains a menu bar for creating, loading or saving a game and also for running it. when a game is loaded, this window becomes the current game window, where the game parameters are displayed (see figure 1), and the bayes-nash echilibria can then be computed; • new game window, allowing the user to define actions, types and probabilities for both players (see figure 2); • payoffs window, where the payoffs could be added or edited (figure 3). figure 1: the main window of the application the input data are stored in /data sub-directory in .dat files. the structure of input data file is: line1: first agent’s index line2: first agent’s actions line3: first agent’s types line4: first agent’s probabilities line5: second agent’s index line6: second agent’s actions line7: second agent’s types line8: second agent’s probabilities line9 and below: the payoffs of the two players the result consists of the computed bayes-nash equilibria and it’s saved in /output subdirectory, in an .out file. it represents the strategies profile. 4 case studies we tested our tool with three examples of bayesian games: market entry game (see [7]), gift game (see [4], and the example game from [1]. all examples consider two players, whose actions are denoted by ai and bj, respectively. 574 i. joldeş, b. pârv, i. parpucea, v. lupşe figure 2: the add new game window figure 3: the payoffs window pybneq a tool for computing bayes-nash equilibria 575 4.1 market entry game both players have a single type. the following payoff matrix represents the input data: b1 b2 a1 14 , 1 4 3 4 , 0 a2 0, 14 1 2 , 0 a3 14 , 1 4 1 4 , 0 a4 0, 34 0, 0 the application outputs three solutions: (a1, b2), (a2, b1), and (a4, b1). 4.2 gift giving game player 1 has two types (θ1,1, θ1,2), and player 2 has one type (θ2,1). type θ1,1 means p > 12 , while θ1,2 means p < 12 . the input data are represented by the following payoff matrix: b1 b2 a1 0, 0 0, 0 a2 1 p, p 1 p 1, 0 a3 p, p -p, 0 a4 1, 2 · p 1 -1, 0 a) the type combination (θ1,1, θ2,1). player 1 is more likely to be a friend (p > 12 ); the application outputs two solutions: (a1, b2) and (a4, b1). b) the type combination (θ1,2, θ2,1). player 1 is more likely to be an enemy (p < 12 ); the application outputs one solution: (a1, b2). 4.3 third example: the game from [1] player 1 has one type (θ1,1) and player 2 has three types (θ2,1, θ2,2, θ2,3). there are three payoff matrices as input, corresponding to the combinations of players’ types: a) (θ1,1, θ2,1): ω2,1 = 12 b1 b2 b3 b4 b5 a1 0, 2 4, 0 2, 5 3, 2 5, 5 a2 0, 0 5, 0 1, 8 1, 1 2, 3 a3 6, 7 3, 1 2, 6 5, 3 1, 2 a4 0, 0 2, 4 4, 1 3, 5 2, 8 a5 0, 3 5, 0 4, 3 5, 9 3, 3 the solutions are: (a1, b5), (a2, b1), and (a3, b4). b) (θ1,1, θ2,2): ω2,2 = 14 b1 b2 b3 b4 b5 a1 2, 2 0, 5 2, 5 3, 0 5, 5 a2 3, 0 0, 6 1, 8 1, 0 2, 3 a3 4, 7 6, 6 2, 6 5, 1 1, 2 a4 4, 0 0, 4 4, 1 3, 4 2, 8 a5 2, 3 0, 0 4, 3 5, 0 3, 3 576 i. joldeş, b. pârv, i. parpucea, v. lupşe the solutions are: (a1, b3), (a2, b1), (a4, b5), and (a5, b1). c) (θ1,1, θ2,3): ω2,3 = 14 b1 b2 b3 b4 b5 a1 2, 2 4, 5 0, 5 3, 2 5, 0 a2 3, 0 5, 6 0, 8 1, 1 2, 0 a3 4, 7 3, 6 6, 6 5, 3 1, 1 a4 4, 0 2, 4 0, 1 3, 5 2, 4 a5 2, 3 5, 0 0, 3 5, 9 3, 0 the solutions are: (a2, b2), (a3, b2), (a3, b1), (a4, b5), and (a5, b4). 5 conclusions and future work in this paper we described a tool for computing bayes-nash equilibrium for two-player games of incomplete information. the authors of [1] implemented a c version of the b-pns algorithm, while we selected python as the implementation language due to its benefits as rapid application development by using component-based software development and the powerful libraries like numpy, pycplex, as well as the graphical user interface toolkit wxpython. the graphical user interface allows a very convenient way of playing with such games. in the future we would like to extend the tool functionalities so that it can easely be integrated in more elaborate decision support systems, including custom-made guis for different classes of games. also, the application programming interface will be improved and the tool will fully support games generated by the common generators like gamut. acknowledgements this work was supported by the grant id_2586, sponsored by nurc romanian national university research council (cncsis). bibliography [1] ceppi, s., n. gatti, n. basilico, computing bayes-nash equilibria through support enumeration methods in bayesian two-player strategic-form games in wi-iat, proc.of the 2009 ieee/wic/acm int. joint conference on web intelligence and intelligent agent technology, 2: 541-548, 2009, doi: 10.1109/wi-iat.2009.209. [2] cplex reference, ftp://ftp.software.ibm.com/software/websphere/ilog/docs/optimization/cplex/refcallablelibrary.pdf [3] eichberger, i., game theory for economists, academic press, 1993. [4] fehr, e. and k.m. schmidt, theories of fairness and reciprocity evidence and economic applications, in m. dewatripont, l.p. hansen and s.j. turnovsky (eds.) advances in economics and econometrics, econometric society monographs, 8th world congress, 1: 208-257, 2003. pybneq a tool for computing bayes-nash equilibria 577 [5] koller, d., n. megiddo, and b. von stengel, efficient computation of equilibria for extensive two-person games, games and economic behavior, 14(2): 220-246, 1996. [6] numpy home page, http://numpy.scipy.org/ [7] ochs, j. coordination problems. in j.h. kagel and a.e. roth (eds.) handbook of experimental economics, princeton university press, 195-251, 1995. [8] parpucea, i., b. pârv, and t. socaciu, t. modeling uncertainty in a decision problem by externalizing information, int j comput comun, 6(2): 328-336, 2011. [9] pârv, b. and i. parpucea, bayes-nash equilibrium in the presence of information sources: computational issues, studia univ. babes-bolyai, informatica, lvi (2011), 3: 33-38, 2011. [10] porter, r., e. nudelman, and y. shoham, simple search methods for finding a nash equilibrium, proc. of the aaai conference on artificial intelligence (aaai), 664-669, 2004. [11] python home page, http://www.python.org/ [12] shoham, y. and k. leyton-brown, multiagent systems: algorithmic, game theoretic and logical foundations. cambridge, usa: cambridge university press, 2008. [13] pycplex reference, http://www.cs.toronto.edu/ darius/software/pycplex/ [14] wxpython reference, http://wxpython.org/ international journal of computers communications & control issn 1841-9836, 10(4):579-592, august, 2015. a trusted-based cloud computing virtual storage system and key technologies k.h. wu, l. chen, y. li kehe wu, long chen*, yi li school of control and computer engineering north china electric power university no.2 beinong road, changping district, beijing 102206, china lw_ncepu@163.com, easy_cl@163.com, somethingnew1989@163.com *corresponding author: easy_cl@163.com abstract: with the popularity of cloud computing, people become incresingly concern about security problems, especially the data security, which has become the biggest obstacle for the development of cloud computing. in order to protect confidentiality and integrity of user data in cloud computing, this paper firstly studies the relevant research works in fields of trusted computing and cloud computing data protection and secondly introduces the concept of trusted into cloud computing data protection, presents the concept of trusted virtual block storage device (tvbsd) and designs the trusted cloud computing virtual storage system (tccvss). and then, the key technologies such as isolation, block device encryption and two-way authentication are expounded in this paper. finally, the result of experiments shows that the system and the related technologies can not only effectively ensure the security of user data, but also control the consequent performance overhead in a proper range. keywords: trusted, cloud computing, virtual storage, cloud storage, encryption, authentication. 1 introduction cloud computing is an extension of distributed computing and grid computing technology, which offers a variety of services to customers through the network, such as infrastructure as a service (iaas), platform as a service (paas) and software as a service (saas) [1, 2]. compared with the traditional pc service model, the cloud computing model provides almost limitless resources for users, which means inexhaustible, taken on demand and pay per use, and for the companies it can reduce the it infrastructure investment and operation & support costs, so that the companies can gain the maximum economic profits. admittedly, cloud computing owns many merits, but it also has defects. as the computing tasks are performed in the cloud and all user data must be stored in the cloud, so the problems, such as where and how the data would be stored, are transparent to the user, which means that the user loses the absolute control over the data. therefore, the service agreement signed between service providers becomes the only guarantee, and the confidentiality and integrity of data become the primary issues that should be considered [3, 4], and also become the biggest obstacle for the development of cloud computing. in cloud computing there are two main data storage modes. one is distributed file system, which regard the virtual machine as a client for system storage the typical representative include gfs(google file system) [5],hadoop [6] and amazon s3 [7]; and the other is virtual block storage device, the typical representative is amazon ec2. both have their advantages and disadvantages. the former, as its own application software must derectly minipulate user data, is difficult to guarantee the consistency of data; and the latter is more flexible, which is transparent, so that the customers can use it just as they usually use local disk, and thus the users can build their own copyright © 2006-2015 by ccc publications 580 k.h. wu, l. chen, y. li file systems and databases according to different needs. it not only can meet the requirements of massive digital information storage and improve the utilization of storage space, but also can shield the heterogeneity of operating system to realize storage management of automation and intelligentce, so as to better improve the quality of cloud computing services. therefore, it is very significant to provide a safe and reliable data storage system and protection technologies for cloud computing. many existing data storage system has good reliability, flexibility, efficiency and ease of management performance, but it still need to be improved for ensuring data security. though the exsiting storage system has used protecting strategies and technologies, including encryption, firewalls and rights management, it only has solved part of the security problems, which has a degree of limitation [8]. so there are still existing following problems in terms of ensuring the data safety and credibility: (i) the problems of internal attack. using the traditional technologies such as encryption and firewalls can only defend the illegal intrusion from outer, but it is incapable to defend the internal attack. in the cloud computing model, the cloud services administrator (simply administrator for short) has many authority, if the insiders obtain the administrator privileges, he can attack the system from inside by an "legal" method, and then destroy the user data and the system. the administrator has all control permissions on the data, services, network and other important resources, so it can access to user data at any time or change the configuration file of the storage devices. therefore, we must balance distribution of authority between the ordinary users and administrator or restrict administrators access to user data. only in this way can we solve the problems of internal attack, and ensure the confidentiality of user data. (ii) the problems of device management. as the characteristics of cloud computing, all user data are stored in the cloud, and in order to protect the confidentiality of the data, it will be stored with encrypted mode and be decrypted when using. however, as the storage device has no protection, the intruders can attack the device to obtain any data they want, and thus use attack technique the brute force attack to get the data that available to them, resulting in leakage of user data and destruction the confidentiality and integrity of data. therefore, there is a must to practice rational management to storage device, for increasing protection measure of devices, which means to prevent attacks to the devices, so as to better prevent user data from unauthorized intrusion and tampering. (iii) the problems of secure authentication. the following two situations can lead to the disclosure of user data: one is the storage device is remapped to the illegal users or accessed directly by the administrator; and the other is legitimate users stored their confidential data to a non-credible (illegal) storage device. in order to prevent the above situations from occurring, we must provide security authentication, through which, we can bind the storage device with legitimate user, making the legitimate users can not store data on a non-certified storage device, while the trusted storage devices can not be accessed by the unauthenticated user, that means user can only access the storage device belonging to its own. only in this way can we completely prevent the above situations from occurring and thus can prevent data leakage. in order to solve the above problems, this paper presents a trusted cloud computing virtual storage system based on the research of trusted computing and virtual storage technology, and the key techniques is researched and achieved. the main contributions are shown as follows: (i) through the study and research of trusted computing [9] and virtual storage technologies, this paper presents the concept of trusted virtual block storage devices, and then design a trusted cloud computing virtual storage systems , so as to better ensure the confidentiality and the integrity of data. (ii) as for the problems of internal attack, this paper proposes the isolation technology, which separate the management section in user-level and the executive section in kernel of the storage a trusted-based cloud computing virtual storage system and key technologies 581 system, and then place them in different virtual machine, so that even the administrator cannot access the data directorly, only through the interfaces provided by the storage system can he manage the trusted virtual block storage device thus, in this way it can solve the data security problems caused by administrator’s overmuch operating authorities. (iii) as for the problems of device management, this paper puts forward the block device encryption technology, which not only can encrypt the data stored in the trusted virtual block storage device, but also can encrypt the block storage device, so that only the user with the private key of the block storage device can access the device and obtain the data, while the unauthorized user cannot access it. in this way, it can avoid the illegal users obtain user data through attacking the storage devices, and thus well-ensured the security of the data. (iv) as for the problems of secure authentication, this paper raises a two-way authentication technology, namely the user needs to authenticate the storage device whether it is a trusted one, and at the same time, the device also needs to authenticate the user whether it has access, and thus combine the user and device, making the legitimate user can only access the device belongs to its own. that is, the legitimate users can not store data on a non-certified storage device, and the trusted storage devices can not be accessed by the unauthenticated user. (v) the effectiveness and performance of the system has been tested and analyzed, the result shows that the system and the related technologies not only can effectively ensure the security of user data, but also can control the consequent performance overhead in a proper range. the paper is organized as follows. in section 1, we study the relevant research works in fields of trusted computing and cloud computing data protection. in section 2, we introduce the concept of trusted into cloud computing virtual storage, propose the concept of trusted virtual block storage device (tvbsd) and design the trusted cloud computing virtual storage system (tccvss), and then provide highlights of the key technologies, such as isolation, block device encryption and two-way authentication. in section 3 , we conduct test and analyze the effectiveness and performance of the system. the last section is the conclusion of the paper and gives lights to our future work. 2 related work in recent years, as cloud computing technology gradually become mature, there are more and more studies on it., among which the data security become one of the crucial researches. some studies focus on the integrity of software load time and run time, enhancing system security by software integrity verification, so as to ensure the data security. for instance, the seshadri [10] through memory virtualization technology can ensure that only validated code can execute in kernel mode, so as to withstand the attack to the integrity of the kemel caused by code injection and ensure system security. xu et al. [11] propose a method based on the virtual machine monitor to detect and prevent the integrity of kemel, and hence prevent system from illeagal operating intrusion. in addition, some studies mainly through the key management to protect the security of data. for example, pearson et al. [12] use the anti-attack ability of tpm(trusted platform module) and the hardware encryption technology to protect data encryption key, and combine the key with platform, making the key incapable to be used on the other platforms ,so as to protect the confidentiality of the data. wang et al. [13] propose a management approach to key-used times based on trusted platform module in cloud storage, it can also control the times of using key while ensuring the safe storage of key, making the use of the key in user client controlled. cheng et al. [14] present a novel key management scheme based on global logical hierarchical graph (glhg), which is used to enforce correctly the global authorization policies of all users. 582 k.h. wu, l. chen, y. li it can eliminates the redundant to minimize the amount of keys transferred and stored, and to maximum reduce the costs and risks of user key management. as the trusted technology widely used in data security and platform security, there are also some research work applied the trusted technology to cloud computing to solve the credibility problem in virtual storage. liu et al. [15] propose a new storage process of object-oriented properties in trusted storage, which can verify the qos attributes of storage to resist a potential threat to the security and integrity of storage. however, this process only solve the trusted storage problem of physical storage device in single platform. wang et al. [16] present a method to solve the security of data storage in virtualization platform, which is a data encapsulation method based on the properties of component and achieved by using the trusted platform module (tpm). yang et al. [17] design an architecture of cloud storage system based on tpm. this architecture used the symmetric key to encrypt data, and then used the asymmetric key to encrypt symmetric key, and finally used tpm to protect the asymmetric key. only in this way can we manage the process of key storage, backup and share effectively. based on the above studies and inspirations, this paper introduce the trusted technology into the virtual storage of cloud computing, design a trusted cloud computing virtual storage system (tccvss), and put forward the isolation, block device encryption and two-way authentication technologies. meanwhile, by testing and analyzing its effectiveness and performance of the system, the result shows that the system and the related technologies can effectively ensure the security of user data. the following sections describe the system architecture and key technologies in detail. 3 tccvss and key technologies 3.1 the architecture of tccvss based on the research of trusted and storage technologies, this paper designs a trusted cloud computing virtual storage system (tccvss) applied to iaas. each trusted virtual block storage devices (tvbsd) of the system are built on the storage resources based on hard disk and iscsi, which can be used by multiple users but only one owner. tvbsd is a virtualization of the trusted physical storage device, which its storage space can be divided into two parts: one is a private storage space, which mainly used to store the confidential information of tvbsd, such as the key of tvbsd, the integrity measurement value, the access control list and so on; and the other is a user storage space, which mainly used to store the user data. tccvss is a trusted virtual storage system based on the virtualization platform framework of xen, which inherits the functions in traditional storage system including snapshot and mirroring, and also joins the isolation, block device encryption and two-way authentication technologies, making the protection of system data more safe and reliable. fig.1 shows the system architecture diagram of tccvss. among these, the main components are listed as follows: (1) the tvbsd manager tool: implementation of this section is based on logical volume manager (lvm2), and it is mainly used to manage the logical volume, such as creating, deleting, snapshotting, mirroring and dynamic extension capabilities. (2) the tvbsd master: this section can be achieved on the basis of device mapper, which is the new component of linux2.6 kernel and the later versions. device mapper provides a method to unify the ways for creating the virtual layer of block device, so it can facilitate realizes the functions including striping, tandem, mirroring and snapshots in virtual layer for users. this paper use the isolation technology, which will be discussed in later chapters, to separate this section into two parts: tvbsd master front-end and tvbsd master back-end, so that it can a trusted-based cloud computing virtual storage system and key technologies 583 figure 1: system architecture diagram of tccvss effectively prevent the illegal access to data. (3) the tvbsd driver: this section mainly provides system security functions including access control, data encryption and user authentication, which can guarantee the confidentiality and integrity of data. the encryption and user authentication technologies will be discussed in more detail in later chapters. 3.2 isolation technology in the traditional cloud computing model, the administrator has so many authority that if the insiders obtain the administrator privileges, it can attack the system from within, and then destroy the user data and the system. the administrator has all control permissions to the data, services, network and other important resources, so it can access to user data at any time and change the configuration file of the storage devices. therefore, we must balance distribution of authority between the ordinary users and administrator or restrict administrators access to user data. only then can we solve the problems of internal attack, and ensure the confidentiality of user data. in the traditional virtual storage system, such as in xen, the virtual block device (vbd) and its management tool are in the same operating system, as shown in fig.2 by dotted box. in this way, the administrator can access the vbd directly through vbd master tool while the system is running, and then illegal intrude or tamper with the data stored in vbd to destroy the confidentiality and integrity of data. in order to protect the security of user data, there is a need to prevent administrator to access the user data directly, so this paper present the isolation technology to solve this problem. that is, to divide the vbd master into vbd master front-end and vbd master back-end, as shown in fig.3. and then place the vbd master front-end and vbd master tool in the same virtual machine to provide vbd management interface for vbd master tool; and place the vbd master back-end in another virtual machine to receive the orders from vbd master front-end and execute them, as shown in fig.4 by dotted box. in this way, the executing section of vbd is separated from vbd master tool, so the administrator does not have permission to access the virtual machine where the vbd master back-end resides. therefore, the administrator can only access vbd through the interface provided by vbd master front-end instead of accessing 584 k.h. wu, l. chen, y. li figure 2: the organization chart of vbd in xen directly, thus ensuring the confidentiality and integrity of data. figure 3: vbd master divide into vbd master front-end and vbd master back-end 3.3 block device encryption technology since the characteristics of cloud computing, all user data is stored in the cloud, and in order to protect the confidentiality of the data, it will be stored with encrypted mode and be decrypted when using. however, due to the storage device does not make any protection, the intruders can attack the device to obtain any data they want, resulting in leakage of user data and destruction of data confidentiality and integrity. therefore, there is a need for rational management of storage devices, to increase protection measure of devices, which means to prevent attacks on the devices, so as to better protect user data from unauthorized intrusion and tampering. to this end, this paper presents a block device encryption technology, which means assigning a block device encryption key (bdek) for each trusted virtual block storage device(tvbsd). during the initializing phase, the system generates a bdek for each user, which is an asymmetric key consisted of two keys–a public key and a private key. the users own the private key themselves to protect the key from exposure to any other entity; and when the user need to store data, the system creates tvbsd according to user needs, and then the public key certificate issued to each tvbsd. the relationship between user and tvbsd is shown in fig.5 as follows: it can be seen from the figure that there is a one-to-many relationship between the user and the tvbsd, that is each user can have multiple tvbsd and each tvbsd corresponds to only one user. and meanwhile, each tvbsd has just one block device encryption key (bdek), which is used to sign the data information rather than encrypt it. if we used kpri and kpub to a trusted-based cloud computing virtual storage system and key technologies 585 figure 4: the isolation technology in tccvss figure 5: the relationship between tvbsd and user 586 k.h. wu, l. chen, y. li represent private key and public key of bdek, and used the data to represent the ciphertext, then the data read and write operations using bdek as shown below: (1) data write operation: use the public key of bdek to sign the data, which is expressed as follows: data1 = e(data,kpub) (1) where the e(data,kpub) means using the public key kpub to sign the data data, the data means the ciphertext and the data1 means the actual stored information in the block device, which is signed using the public key, and only the one who has the corresponding private key can decrypt the ciphertext to get the useful information. (2) data read operation: we need to isolate the information data from the signed data, which is expressed as follows: data = d(data1,kpri) (2) where the d(data1,kpri) means using the private key to decrypt the signed data data1 = e(data,kpub), and the data means the ciphertext. thus it can be seen that only the one who has the corresponding private key can decrypt the data signed by the public key, so as to protect the data from obtaining by the unauthorized entities. in summary, using the bdek technology can prevent the unauthorized entities from accessing the tvbsd, even under the condition of the device be attacked or lost, so as to ensure the security and credibility of tvbsd and protect the data from being invaded and abused. 3.4 two-way authentication technology the following two situations can lead to the disclosure of user data: one is the storage device is remapped to the illegal users or accessed directly by the administrator; and the other is legitimate users stored their confidential data to a non-credible (illegal) storage device. in order to prevent the above situations from occurring, we must provide security authentication, through which, we can bind the storage device with legitimate user, making the legitimate users can not store data on a non-certified storage device, while the trusted storage devices can not be accessed by the unauthenticated user, that means user can only access the storage device belonging to its own. this paper proposed the two-way authentication technology to solve the above problems, which is mainly shown in the following two aspects: (1) can block the storage requests initiated by the user who does not have the access to trusted virtual block storage device. (2) can reject the unauthorized virtual block storage device as users storage device. in order to better complete the two-way authentication process, this paper proposed a trusted authentication module (tam) and designed three tables, namely: the user information table user_list, the trusted virtual block storage device table tvbsd_list and the corresponding relation table of user and tvbsd is user_tvbsd_list. among this, the user_list is used to store the information of all authenticated users, while the tvbsd_list is used to store the information of all trusted devices, and the user_tvbsd_list is used to store the mapping relationship between the users and their corresponding tvbsd. the specific authentication steps are as follows: the process of two-way authentication is shown in the following fig.6: (1) the user send a data storage request to tam, including the information of user information iduser, device information idtvbsd and so on. (2) when the tam received the data storage request: (a) to search the user information table user_list based on the user information iduser, determining whether the user is an authenticated one. if so, then go to next step; if not, then reject the user’s data storage request and feedback to the user. a trusted-based cloud computing virtual storage system and key technologies 587 figure 6: the two-way authentication (b) to search the tvbsd table tvbsd_list based on the device information idtvbsd, determining whether the device is an authenticated one. if so, then go to next step; if not, then reject the user’s data storage request and feedback to the user. (c) if the user and the device are both authenticated, then continue to search the corresponding relation table of user and tvbsd user_tvbsd_list, determining whether the block storage devices users applied to belongs to users themselves. if so, then allow the data storage request and issue the certificate to the user and the tvbsd user applied to; if not, then reject the user’s data storage request and feedback to the user. (3) to establish a connection between user and tvbsd to complete the two-way authentication: (a) user creates a timestamp t1 and then sent it with user’s certificate to the tvbds. (b) tvbds creates another timestamp t2 and calculate m1 = e(t2||h(data)||e(iduser||t1||t2,kpubu),kprit ) (3) then sent its certificate with timestamp t2 and m1 to the user. among this, h(data) means the hash value of information in this communication, kpubu means the user’s public key, kprit means the tvbds’s private key, and e(data,k) means encrypting the information data with the key k. (c) to verify the tvbsd. firstly, to decrypt the m1 with the public key of tvbsd kpubt to get the information including t2, h(data) and e(iduser||t1||t2,kpubu). and then, to decrypt the e(iduser||t1||t2,kpubu) with private key of user kpriu to get the information including iduser and t1, determining whether the information we got matched the original one. if so, the user accept the tvbsd; if not, the user reject the tvbsd. v erity(iduser||t1||t2,m1) = true (4) (d) the user calculate m2 = e(h(data)||e(idtvbsd||t2,kpriu),kpubt ) (5) and then sent it to tvbsd. (e) to verity the user. firstly, to decrypt the m2 with the private key of tvbsd kprit to get the information including h(data) and e(idtvbsd||t2,kpriu). and then, to decrypt the e(idtvbsd||t2,kpriu) with public key of user kpubu to get the information including idtvbsd and t2, determining whether the information we got matched the original one. if so, the tvbsd accept the user; if not, the tvbsd reject the user. v erity(idtvbsd||t2,m2) = true (6) 588 k.h. wu, l. chen, y. li the whole procedure of two-way authentication is finished after the above steps, which can implement the binding between user and tvbsd, so as to better ensure the data confidentiality and integrity. 4 experiments and analysis 4.1 validity analysis in this paper, we designed four experiments to evaluate the effectiveness of the system, from the perspective of the kind of attack, it can be divided into the following tests: unauthorized user access attack test, illegal equipment mount attack test, data monitoring attack test and physical device attack test. (1) for the unauthorized user access attack test: the access of unauthorized user can be sorted into the following two situations: one is the access of illegal user without authorization; and the other is the access to the storage device, which does not belong to the authenticated user. however, in traditional virtual storage systems, take xen for example, as administrators have too much operating authority, so both of the above users can use the administrator privileges to access to the users storage device directly, or remap the storage device to the unauthorized users, so as to obtain the users’ data. while in the tccvss proposed in this paper, owning to the use of isolation technology, the administrator can only use the interface provided by the tvbsd master front-end to access to the tvbsd, which prevent the administrator from accessing to the virtual block storage devices directly. meanwhile, the trusted authentication module (tam) in two-way authentication technology can verify the identity of a user when the user initiate the request for accessing. the experiments result indicates that only the users, who can pass the validation of tam, can establish a connection with the tvbsd, while the illegal users, who can not pass the validation of tam, can not access to the tvbsd, so that it ensure the security of user data. (2) for the illegal equipment mount attack test: similar to the unauthorized user access, the access of illegal device mounts can also be divided into the following two situations: the first is the mount of equipment without authentication; the second is the authorized device mounted to the user who does not have the access to the device. in the tccvss, the trusted authentication module (tam) in two-way authentication technology can verify the identity of a user when user initiate request for mounting. the experiment suggests that only authenticated devices can establish a connection with user, while the illegal device would be denied by tam, and thus ensure the security of user data. (3) for the data monitoring attack test: in traditional storage systems, vbd management tools and vbd are in the same system, the system administrator can monitor the device’s memory by installing and running various software, and can steal user data by attacking the memory. in the tccvss, with the use of isolation technology, the management section of tvbsd and the actual operation section of tvbsd can be isolated, so that the administrator can only access to the tvbsd through the interface provided by the tvbsd master front-end. the result of testing indicates that the administrator is unable to install monitoring software in the equipment area where the perform section of tvbsd stayed, thereby, he cannot carry out monitoring attack to the device’s memory, and thus ensure the security of the data. (4) for the physical device attack test: in the traditional storage systems, both the attack on a physical storage device and the loss of the physical storage device may cause damage to user data. in the tccvss, with the use of block device encryption technology, the security of the device can be ensured. after testing, the results shows that it is difficult to steal user data by attacking the physical device in the absence of private key, and thus ensure the security of data. a trusted-based cloud computing virtual storage system and key technologies 589 the above experiments indicates that the system can perfectly ensure the confidentiality and integrity of data by using various technologies, such as the isolation, block device encryption and tw-way authentication, etc. compared with the traditional virtual storage system, the tccvss has a higher level of security and reliability. 4.2 performance analysis the experimental environment in this paper is as follows: cpu: inter core 2 @2.7 ghz ram: 2gb system: centos 6.4, the kernel version is 2.6.32 platform: xen v4.2.2 hash function: sha-1 encryption algorithm: aes in openssl0.9.8 test tools: iometer, iozone in order to test the performance of the system designed in this paper, we use the test tools iometer and iozone to measure the effects of this system on the performance of xen virtualization platform. iometer is an i/o subsystem measurement and characterization tool for single and clustered systems. its main functions include testing the performance of the disk and the network controllers, bandwidth and delay capacity of the bus, the throughput of the network, the performance of the shared bus, the performance of the hard drive in system level and the performance of the network in system level, etc [18]. therefore, it can be used as a benchmark and troubleshooting tool and is easily configured to replicate the behavior of many popular applications. iozone is a file system benchmark tool, which can test the read/write performance of file system in different operating system. the benchmark generates and measures a variety of file operations,such as read, write, re-read, re-write, read backwards, read strided, fread, fwrite, random read, pread, mmap, aio_read, aio_write,etc. iozone has been ported to many machines and runs under many operating systems. in order to better measure the performance of this system designed in this paper, we access the following three types of disk in xen: the disk with ecryptfs, the disk with dm-crypt and the disk with tvbsd. the ecryptfs is the enterprise file encryption system in linux; and the dmcrypt is a target driver developed based on the device-mapper, which mainly offers transparent encryption of block devices. first, we use iometer tool to do three experiments under the same operating environment, to compare the throughput of the disk under three cases. the test parameters set as follows: (1) the size of data block: transfer request size is set to two cases: 512bytes and 64kbytes; (2) the method of read and write: percent random/sequential distribution is set to two cases: 100% sequential and 100% random; (3) the percentage of read and write: percent read/write distribution is set to two cases: 100% read and 100% write . the result is shown in fig.7, which is the average of multiple measurement experiments: it can be seen that, compared with xen system without encryption function, the read and write performance of the other three disks has a certain loss, which is caused by adding the encryption function, and thus increase the system overhead. compared with the disk of dmcrypt, the tccvss only has a small portion of performance lost, which is not obvious. but compared with the disk of ecryptfs, the performance improved significantly. meanwhile, in the case of the same block size, the throughput of sequential read and write is higher than that of random read and write. 590 k.h. wu, l. chen, y. li xen ecryptofs dm-crypt tccvss 0 1 2 3 4 5 6 th ro ug hp ut r ag e( m b/ s) read write (a) 100%sequential, block size=512bytes xen ecryptofs dm-crypt tccvss 0 10 20 30 40 50 60 th ro ug hp ut r ag e( m b/ s) read write (b) 100%sequential, block size=64kbytes xen ecryptofs dm-crypt tccvss 0 1 2 3 4 5 th ro ug hp ut r ag e( m b/ s) read write (c) 100%random, block size=512bytes xen ecryptofs dm-crypt tccvss 0 10 20 30 40 th ro ug hp ut r ag e( m b/ s) read write (d) 100%random, block size=64kbytes figure 7: throughout evaluation then we use iozone tool to measure its i/o performance being a file system server. the experiment focus on comparing the change of throughout rate of the original xen system with that of this system under the circumstance of different i/o request size, the requested block size change from 16kb to 16mb. the results show in fig.8: as is shown in this figure, with the increase of i/o requests, the system’s performance gap also increase. this is because with the increase in i/o request, the time system spent in locating the file location and reading the document properties increases small, but the proportion of encryption and certification in the entire operation increases because of joining some technologies, such as the isolation, block device encryption and two-way authentication, so the cpu processing speed of encryption and authentication operations become the bottleneck of system performance, which is not obvious when the i/o request is very small. 5 conclusion based on the research of the trusted computing and storage technologies, this paper introduces the concept of trusted into cloud computing virtual storage, defines the concept of trusted virtual block storage device (tvbsd) and designs the trusted cloud computing virtual storage system (tccvss), and then provides highlights of the key technologies, such as isolation, block device encryption and two-way authentication, with which can we ensure the safety of the user data in the system. we use the simulation experiments to test its effectiveness, the results show that the system can well protect the confidentiality and integrity of the user data. and a trusted-based cloud computing virtual storage system and key technologies 591 16kb 32kb 64kb 128kb 256kb 512kb 1mb 2mb 4mb 8mb 16mb 0 2000 4000 6000 8000 10000 12000 14000 xen tccvss request size of i/o t hr ou gh pu t r at e( k b /s ) (a) the performance comparison of write operations 16kb 32kb 64kb 128kb 256kb 512kb 1mb 2mb 4mb 8mb 16mb 0 2000 4000 6000 8000 10000 12000 t hr ou gh pu t r at e( k b /s ) request size of i/o xen tccvss (b) the performance comparison of read operations figure 8: performance comparison between xen and tccvss at different i/o request size the test tools, such as iometer and iozone have been used to test its performance, the results show that this system, compared with other storage systems, has improved to some extent in protecting the confidentiality and the integrity of user data. however, the system loss increased due to use the technologies, such as isolation, block device encryption and two-way authentication, resulting in a performance degradation within a reasonable range. in addition, because the system is designed in an ideal experiment environment, there are many problems need to be solved in the practical application, so the further work is to improve the performance of the system under the condition of ensuring the confidentiality and integrity of user data, and to implement a more safe and reliable cloud computing virtual storage system, which can be suitable for practical application scenes. bibliography [1] b. tograph, y.r. morgens (2008), cloud computing, communications of the acm, 51(7): 9-11. [2] a. weiss (2007), computing in the clouds, network of acm, 11(4): 16-25. [3] a. fox, r. griffith, a. joseph, r. katz, a. konwinski, g. lee, d. patterson, a. rabkin, and i. stoica (2009), above the clouds: a berkeley view of cloud computing. dept. electrical eng. and comput. sciences, university of california, berkeley, rep. ucb/eecs 28, 13. 2009. [4] j. heiser, m. nicolett (2008), assessing the security risks of cloud computing. gartner report. [5] http://labs.google.com/papers/gfs.html. [6] http://hadoop.apache.org/. [7] http://aws.amazon.com/s3/. [8] u. kűhn, k. kursawe, s. lucks, a.r. sadeghi, c. stűble (2005), secure data management in trusted computing, in: cryptographic hardware and embedded systemsches 2005: springer, 324-338. 592 k.h. wu, l. chen, y. li [9] c. shen, h. zhang, h. wang, j. wang, b. zhao, f. yan, f. yu, l. zhang, m. xu (2010), research on trusted computing and its development. science china information sciences, 53(3): 405-433. [10] a. seshadri, m. luk, n. qu, a. perrig (2007), secvisor: a tiny hypervisor to provide lifetime kernel code integrity for commodity oses, acm sigops operating systems review, 41(3): 335-350. [11] m. xu, x. jiang, r. sandhu, x. zhang (2007), towards a vmm-based usage control framework for os kernel integrity protection. in: proceedings of the 12th acm symposium on access control models and technologies: acm, 71-80. [12] s. pearson, y. shen, m. mowbray (2009), a privacy manager for cloud computing. in: cloud computing: springer, 90-106. [13] l. wang, z. ren, y. dong, r. yu, r. deng (2013), a management approach to key-used times based on trusted platform module in cloud storage. jisuanji yanjiu yu fazhan/computer research and development, 50(8): 1628-1636. [14] f. cheng, z. peng, w. song, s. wang, y. cui (2013), key management for access control in trusted cloud storages, jisuanji yanjiu yu fazhan/computer research and development,50(8): 1613-1627. [15] l. zhaobin, q. wenyu, l. keqiu, f. ruoyu (2009), object oriented property attestation for trusted storage. in: ieee 9th international conference on computer and information technology, cit 2009, october 11, 2009 october 14, 2009 xiamen, china: ieee computer society, 93-97. [16] d. wang, d. feng (2010), a hypervisor-based secure storage scheme. in: 2nd international conference on networks security, wireless communications and trusted computing, nswctc 2010, april 24, 2010 april 25, 2010 wuhan, hubei, china: ieee computer society, 81-86. [17] x. yang, q. shen, y. yang, s. qing (2011), a way of key management in cloud storage based on trusted computing. in: 8th ifip international conference on network and parallel computing, npc 2011, october 21, 2011 october 23, 2011 changsha, china: springer verlag, 135-145. [18] j. chen (2011), design and implementation volume-based hierarchical storage system. huazhong university of science & technology. ijcccv8n1.pdf int j comput commun, issn 1841-9836 8(1):50-60, february, 2013. homeostatic control of sustainable energy grid applied to natural disasters f.m. cordova, f.f yanine felisa m. cordova universidad de santiago de chile, usach av. ecuador 3769, santiago, chile e-mail: felisa.cordova@usach.cl franco f. yanine pontifical catholic university of chile, puc av. vicua mackenna 4860, macul, santiago, chile e-mail: francoyanine@gmail.com abstract: according to seismologists chile has yet to face another big earthquake in the very near future, yet the country remains largely unprepared against massive electric power systems brake-down. the problem lies in the centralized electric power systems and the lack of adequate technologies and back-up/emergency power systems for disaster recovery. the flaws that are built into the very fabric of the presently centralized power systems were on full display in the february 27th earthquake in chile. nowhere it becomes more evident that hugely centralized power generation and distribution systems are extremely vulnerable and ineffective to disruptions from natural disasters, human error or other calamities. the large power networks that once proved very efficient and secure, are now at the center of discussion fueling the need for decentralization and the rapid growth of distributed generation (dg). highly decentralized, diversified and dg-oriented energy matrix is notoriously much better suited to withstand these disasters. in centralized electric power grids, servicing large metropolitan areas, albeit with some but limited differentiation in service, can only be on or off, so either everyone gets power or no one does. this makes recovering power service in an emergency situation a much more difficult task. on the other hand, decentralized power systems (dps) reduce the obstacles to disaster preparation and recovery by allowing the focus to shift first to critical infrastructure and then to flow outward to less integrated outlets. a dg-based model for a smart micro-grid based on hybrid electric power systems (heps) using both renewable energy technologies (ret) and conventional power generation units is presented. the hybrid energy system may be portable or fixed in one place, highly reliable, easy to assemble, modular, flexible and cost-effective solution, that is ready-to-run and go to where it is needed to supply power in natural disaster. keywords: natural disasters, energy sustainability, smart micro-grid, hybrid electric power systems, sustainable blocks, sustainable energy strategy. 1 introduction the days of huge, highly centralized electric power networks paradigm may be numbered based on the current trends in the electric power generation and distribution industries worldwide (see [1], [2], [3], [4], [5], [6], [7], [8] and [9]). in the last 20 years, dg and the decentralization of electric power systems in general has gained substantial ground, especially with the rapid advancement in renewable energy technologies (ret), power electronics for system interfacing, and the continuous efforts and great strides, especially in developed countries, to integrate ncre/alternative energies into their power generation matrices. yet the problem is not trivial when we think that current electric power grids that span hundreds of miles and supply power to very copyright c© 2006-2013 by ccc publications homeostatic control of sustainable energy grid applied to natural disasters 51 large regions which comprise several metropolitan areas, were not built thinking that one day dg plants would require access to these electric distribution networks in order to supply power to local communities of different sizes as part of worldwide efforts towards decentralization in the electric power generation and distribution industry ensue. however, the explanation for this rapid shift to dg and further decentralization in the industry worldwide may not lie only in an effort to cut on fossil fuel consumption, green house gas (ghg) emissions and reduce the strong dependence on foreign oil imports, both in developed and developing countries. yet, despite the profound vulnerability that such situations entail, right along with the dire need to reduce high energy costs and become more efficient in offering electrical energy and heating services to their populations, especially to remote, rural and isolated communities; that as we mentioned earlier, may not be the most pressing issue, at least not anymore. for some time now the authorities around the world have been worried and lately overly concerned about the rapid and drastic changes in the weather (one of the greatest concerns cited by the millennium project 2011 state of the future report [31]), along with the frequency and magnitude of natural catastrophes like large earthquakes, rising sea-levels and solar storms, among other threats to world populations in urban and rural areas, and how to deal with such situations, considering the severity of the disasters and chaos that they may bring to modern mankind (see [30] and [31]). 2 building the case for decentralized power systems (dps) and the penetration of dg natural disasters, like earthquakes, volcano eruptions, wide-spread fires and violent weather phenomena like prolonged periods of intense rain and snow, strong winds and hurricanes are not new to humanity but the difference is that in today’s 21st century world much of our fragile living systems and economic sustainability depend on modern infrastructure of which roads, electric power transmission and distribution networks, and telecommunications are a most vital part, yet increasingly vulnerable when faced with these calamities. since the huge earthquakes and resulting tsunamis in 2010, the first which hit chile with a giant earthquake and the second, which hit japan creating a devastating tsunami, more and more people have come to realize the vulnerability of modern world infrastructure and way of life. although of great concern for millions of people, particularly for those countries where the technology is currently being used, and a grave threat of disastrous implications to humanity should an accident or negligent act were to occur again (like the disasters as a result of the fukushima nuclear accident in japan and the chernobyl nuclear power plant meltdown in the old soviet union), the nuclear power issue and its future standing in today’s world energy matrix is a case of profound implications on its own right, and would require an entire paper to discuss it so we will leave it out. yet if we are to focus too much on power generation technologies like nuclear, fossil fuels or hydroelectricity generation, we may be ignoring a larger concern or at least not giving it its proper place in the scale of concern it deserves. centralized electric power systems, and their still huge concentration on fossil, non-renewable fuels and large hydroelectric projects which require building large dams and the inundation of vast extensions of fertile land are a big problem. another drawback of largely centralized power systems is their inability to differentiate among different end users. a health care facility like a hospital, or a school with small children, or traffic lights across an entire city should be serviced differently than just a regular home, machine shop or convenience store in a neighborhood. in a disaster situation, favoring the power supply to the most critical customers becomes very difficult because the systems are made to provide equal power quality and stability to everyone, even during disaster situations. in a centralized electric 52 f.m. cordova, f.f yanine power grid servicing a large metropolitan area, albeit with some but limited differentiation in service, can only be on or off, so either everyone gets power or no one does, as simple as that. this makes recovering in an emergency situation a much more difficult task, sometimes virtually insurmountable. on the other hand, dps reduce the obstacles to disaster recovery by allowing the focus to shift first to critical infrastructure and then to flow outward to less integral outlets [30]. 3 main benefits and advantages of dps and dg just like in decentralized communication networks, where if one of the stations breaks down, it does not bring the whole network down with it, when faced with a disaster like a hurricane or an earthquake, dps can eliminate the threat of the entire power grid collapsing by creating a series of redundancies in the power system itself. with several decentralized power generation plants and distribution networks, a smart micro grid, powered for example by several combined heat and power (chp) units (see [10] and [11]) can disconnect from a grid experiencing a power outage and continue to operate free of disruption until the main power grid is back on-line. the us has learned its lesson from major disasters over the past decade, especially in areas of recurrent phenomena like in the midwest and in the states of florida and texas, and when making choices on how to rebuild critical pieces of infrastructure, there has been a marked increase in the number of vital facilities like healthcare , schools, industrial parks, airports, large manufacturing plants, military outposts and administrative office buildings developing chp generation systems (see [10] and [11]). these, along with other changes like introducing adequate pieces of legislation and giving tax incentives to small businesses and large size community investments in ret and mini and micro-grid technology may represent the beginning of much needed change which we may see more forthcoming in the following two decades. thus the potential for building a stronger and more resilient energy matrix lies in our hands. we need to have less vulnerable, more flexible and resilient electric power systems, as the next disaster will come for sure, but when? nobody knows. but what we do know is that we ought to be ready and well prepared for it, otherwise we will continue to suffer at the expense of the capriciousness of nature and the unpredictability and absurdity of human error, especially in our most critical systems. notwithstanding this general consensus by experts and authorities in general that something has to be done fast, change, as we all know is slow in coming. if the power grid were to collapse in the immediate future, the only facilities with any power at all will be those equipped with microgrid power plants based on dg technologies, most likely utilizing a combination of both, diesel fueled generators and grid-connected hes employing conventional and unconventional generating technologies like ret (using one or more ncre), and able to work as a stand-alone system as well, to generate their own electric power, just like we see diesel fueled alternators in the basement of just about every apartment building in santiago today. in this regard, those facilities which for example, are equipped with their own chpbased microgrid stand the best chance of maintaining their full power supply regardless of the calamities and climatic menaces that may be lurking in the horizon (see [19] and [26]). 4 energy storage systems it is common to find that during off-peak times, when there is excess power due to high amounts of renewable energy being generated in the system by the different components, a certain amount of excess energy must be dumped if energy storage systems (ess) are not available, to avoid over voltages and damage to equipment. wasted energy that otherwise could very well be used at a later time. homeostatic control of sustainable energy grid applied to natural disasters 53 in [27] three stand-alone solar pv (photovoltaic) power systems, using different energy storage technologies, are modelled and optimized both technically and economically. the proposed modelling of components in [27] facilitates the assessment of the ess capacity and also allows for better computation of system’s overall efficiency. key components including pv modules, fuel cells, electrolyzers, compressors, hydrogen tanks and batteries are modelled in a clear way so as to facilitate the evaluation of the hybrid power system. using energy storage technology, a method of ascertaining minimal system configuration is designed to perform the sizing optimization and reveal the correlations between the system cost and the system efficiency. the three hybrid power systems, i.e., photovoltaic/battery (pv/battery) system, photovoltaic/fuel cell (pv/fc) system, and photovoltaic/ fuel cell/battery (pv/fc/battery) system, are optimized, analysed and compared (see [27], [28] and [29]). the optimal solution comes as a trade-off to the problem as expected, where the proposed pv/fc/battery hybrid power system is found to be the configuration with lower cost, higher efficiency and less pv modules as compared with either single storage system (see [27], [28] and [29]). for homes in general we believe the battery bank as part of the hes is the most convenient choice given the cost and modularity provided by this ess. particularly relevant when it comes to ess technology review are (see [12], [13], [14], [15], [16], [17] and [18]) all presenting important contributions to the subject. likewise it is important to mention the need to carry out a very thorough pre-feasibility study of non-conventional renewable energy (ncre) sources potential present at site, particularly solar and wind, studying load characteristics carefully to identify distinct consumer patterns in power demand, and to do a correct mapping of the electricity distribution network. once the project is decided upon, there comes the need to deal with system sizing, optimization of different system components and of the system as a whole, including energy system design, optimal control of hes. by choosing the right components and the appropriate system configuration, their costs can be minimized through proper equipment sizing and load matching (see [12], [13], [14], [15], [16], [17] and [18]). the bottom line is to make the most efficient use of ncre sources and the heps configuration chosen throughout the lifetime of the system. 5 smart, flexible and modular micro generation systems as means to supply electricity to homes in case of natural disasters smart micro generation systems belong in general to the type of applications called smart microgrids. smart microgrids are understood as the enabling technology, integrating hybrid energy systems and energy storage, which allows for the development and integration of alternative energy sources (see [19], [20], [21], [22], [23], [24], [25] and [26]). it is en essence a micro generation system that uses bidirectional communication systems. the concept of smart microgrid for a group of homes can be utilized to define a diverse group of applications, which fosters the ability to monitor and control an electricity network (us department of energy, 2010) (see [19], [20], [21], [22], [23], [24], [25] and [26]). there is however not one but several definitions of smart grid, even though it is easily distinguished from a conventional power grid. however, one can easily recognize that just about any definition on the subject points to an entirely different system, much more capable, robust and resilient than the traditional electric power systems we grew up with. in general, conventional power generation plants, transmission and distribution networks have a limited capability of monitoring and control. this way, it is common to see control centres far away from power generation communicate with generation centres, power substations and large consumers; control functions are often operated manually (see [19], [20], [21], [22], [23], [24], [25] and [26]). in contrast, a smart grid 54 f.m. cordova, f.f yanine is characterized by a two-way flow of electricity and information and is capable of monitoring everything from power plants to customer preferences to individual appliances. it delivers realtime information and enables a near-instantaneous balance of supply and demand at the device level. it can operate at different scales as long as it is located near the source of energy and near the areas of delivery, ideally used to produce and consume energy locally. it involves changes not only in the technology, but also in the elements such as users’ practices, regulations, industrial networks, infrastructure and symbolic meaning (see [19], [20], [21], [22], [23], [24], [25] and [26]). the distinctive feature of an appropriately designed network control is that it admits local and changing communication networks, is robust with respect to intermittency and latency of its feedbacks, and also tolerates connection and disconnection of network components [32]. however, a small dg has some significant problems of frequency and voltage variation when it is operated in stand-alone mode. therefore, a small dg should be interconnected with the power system in order to maintain the frequency and the voltage ” [32]. for a microgrid it is essential that the energy management system (ems) employed delivers active and reactive power generation by means of set-points for the generation sources, including the battery bank used in this particular case, and sends information to consumers about changes in energy supply and consumption. the paper ”grid-connected hybrid pv/wind power generation system with improved dc bus voltage regulation strategy” [33], authors point out that among the various types of renewable energy sources available, the solar and wind energies are by far the most utilized; because of the inherent complementary nature of the solar and wind energies, the hybrid pv/wind power system has higher reliability to deliver continuous power than either individual source [33]. also aiding this trend is the rapid growth of the power electronics techniques, the photovoltaic (pv) and wind power generations systems have increased rapidly (see [21], [22], and [23]).for a microgrid it is essential that the energy management system (ems) employed delivers active and reactive power generation by means of set-points for the generation sources, including the battery bank used in this particular case, and sends information to consumers about changes in energy supply and consumption. the paper ”grid-connected hybrid pv/wind power generation system with improved dc bus voltage regulation strategy” [33], authors point out that among the various types of renewable energy sources available, the solar and wind energies are by far the most utilized; because of the inherent complementary nature of the solar and wind energies, the hybrid pv/wind power system has higher reliability to deliver continuous power than either individual source [33]. also aiding this trend is the rapid growth of the power electronics techniques, the photovoltaic (pv) and wind power generations systems have increased rapidly (see [21], [22], and [23]). in the hybrid energy system configuration which is part of the microgrid proposed here, a battery bank must be used to draw maximum power output and to safeguard the electricity supply to the homes in case of a power outage. different circuit topologies for the grid-connected hybrid pv/wind power system are shown in the paper. since the output voltage of the pv array is different from the one of the wind turbine and the maximum power point tracking (mppt) feature is demanded, a dc/dc converter and a dc/ac inverter are both needed for the pv/wind power system [33]. as this paper shows, a good configuration option is to have an ac-shunted grid-connected hybrid pv/wind power system using two individual dc/dc/ac converters. each one of them is capable of delivering maximum power produced by the pv array and/or the wind turbine. in general, the use of wind energy is cheaper than that of solar energy. in areas where there is a limited wind source, a wind system has to be over-dimensioned in order to produce the required power resulting in higher plant costs [34]. it is emphasized the importance of the grid-connected pv system regarding the intermittent nature of renewable generation, and the characterization of pv generation with regard to grid code compliance. to meet technical requirements in this case, the dg units in the microgrid must have protection, homeostatic control of sustainable energy grid applied to natural disasters 55 control and communication components to have safe operation. the local electric utility grid, which is connected to the homes (ac loads) through the smart microgrid, is the key player within the system. once the key player is out of the game, the rest of the players, namely the advanced sensing, data measuring and communication systems, the advanced hybrid control system, the advanced human-machine interfacing and decision support systems and the energy storage systems (ess) must all figure out a way to continue to work and coordinate adequately to supply power to the homes in case of a power outage caused by a natural disaster or another adverse condition. figure 1 below shows the schematic diagram of the proposed pv-wind hybrid energy system (hes) with energy storage for power supply to a group of homes in case of a power outage caused by natural disasters. we envision a smart microgrid configuration comprised of one or more hes with the following simple but effective configuration. figure 1: schematic diagram of grid-connected solar pv-wind hybrid energy system (hes) with a battery bank as energy storage to supply electricity to a group of homes for disaster readiness. double pointed arrows on the battery bank connected to the ac bar indicate bi-directional electrical energy flow. the main utility power grid is always connected to the battery bank and recharges the energy storage system (ess) whenever this is needed. likewise, when there is a need to draw electricity from the ess, as it is the case when there is an power outage and no electricity is available from the main grid and in addition, the electric power being drawn from the wind turbines and the solar pv panels may not be enough or may unstable the meta-controller sends the signal to start the flow of electricity from the battery bank. the micro grid system shown in fig. 2 will operate connected to the local electric utility grid, and will have communications linkage among the different systems, namely the hes control system, the utility grid control operator, and the people in their homes whom are being provided electricity both by the microgrid system and by the local electric utility grid. the aim here is for everyone that is part of the system being proposed to have the necessary and relevant information 56 f.m. cordova, f.f yanine on the status and operating conditions of the microgrid and its electricity generation capability throughout the day. ideally the different loads in the homes to which the microgrid supplies power for their operation will have sensing devices, in order for the microgrid controller to sense the power consumption of the loads throughout the time it remains in operation and will also be sensing the data provided by the utility grid status. the task of the hes homeostatic regulator function in the hybrid controller is to regulate the electrical energy flow to and from the microgrid to the homes including the battery bank and also the electrical energy flow from the utility grid once the electric power supply is restored. it is part of the microprocessor-based control set-points, and its response is triggered by signals being sensed by the system (power consumption demand in watts) which tells the system that its capabilities to produce enough power to meet demand are either within the range of its generating capacity or out of range. figure 2: model diagram of grid-connected hybrid energy systems (hes) comprising a small micro grid with energy storage system to supply a group of homes in case of a electric power outage. in figure 2 we have the model diagram showing a solar pv array and a set of small wind generators as renewable energy sources (res) comprising the hybrid system for electricity supply to a group of homes. the generator will most likely be an induction generator (because of its convenient cost and simpler technology) coupled to the set of small wind turbines. the induction generator is then connected to a voltage rectifier and this in turn to the ac load (the homes). everything is connected to the microgrid controller. the solar pv array is connected to a pv energy conversion system or dc/ac power converter and this to the load ac bus and to the ac utility grid as well. likewise the battery bank is connected to a bi-directional dc/ac power inverter and an integrated voltage converter unit to make sure that the voltage signal going in and out of the ess is adequate. as it is said earlier, everything is connected to and controlled by the system controller. the controller coordinates and controls the entire system in a distributive manner, and incorporates homeostatic control of sustainable energy grid applied to natural disasters 57 an innovation element called a hes homeostatic regulator, which acts much like the homoeostatic regulation that takes place in the metabolic system of living organisms like humans and animals. this microgrid system is thus able to adapt to abrupt changes in climate conditions and/or changes in the consumption patterns of the homes or changes in the electricity supplied by the utility grid to which it is connected. thus the system is capable of reacting and at the same time communicating to other systems, namely the people in their homes and the power utility grid control operator the conditions of the system. the hes homeostatic regulator is a vital part of the hybrid controller, and constitutes the means to regulate energy flow from the smart microgrid to the homes and also the electric energy flow from the local utility grid to the homes, an important parameter which is fed into the microgrid controller in real time on a permanent basis. it is in essence an intelligent component based on data set-points enabled by a microprocessor that allows the control system of the smart microgrid to manage the energy flow among systems and also makes the controller ”aware” of what’s happening in the system in regards to its sustainability. the homes can be connected to the utility grid control operator and to the micro grid’s controller through a communications link via ip or lan. the controller coordinates and controls actuating on the different individual controllers and providing status and other vital systems information to users and to the utility grid control center operator. the electric power (in watts) being supplied by the local utility grid, the watts being consumed by the ac loads (homes) and the energy flow within and throughout the small microgrid being drawn from the hes’ operation will all be shown on a pc monitor in each home and in the monitor of the utility grid’s control operator remotely located. ideally voltage and frequency will also be shown to the electric utility control operator since voltage for some given frequency varies as a function of time, and thus both are important parameters of the quality of power that the homes are getting from the microgrid system. hence it is important for those monitoring the microgrid operation at a distance to learn of the quality of the voltage being supplied (looking for possible voltage drops or sagging) and that this is maintained within standards. the system controller will be able to act and respond to disruptions being provoked by internal or external phenomena that may occur during operation, adapting the hes operation to these changes. for example, should the dc/ac inverter fail/break-down, the controller will continue operating with the wind turbines and the battery bank as back-up; or else, should the wind turbines coupled to a generator and this to an ac/ac power converter break-down at any point, preventing the flow of electrical energy to continue operating properly, the controller will continue operating by means of the solar pv panels and the battery bank. 6 conclusions the potential for building a stronger and more resilient energy matrix lies in our hands. we need to have less vulnerable, more flexible and resilient electric power systems, as the next disaster will come for sure, but when? nobody knows. but what we do know is that we ought to be ready and well prepared for it, otherwise we will continue to suffer at the expense of the capriciousness of nature and the unpredictability and absurdity of human error, especially in our most critical systems. notwithstanding this general consensus by experts and authorities in general that something has to be done fast, change, as we all know is slow in coming. if the power grid were to collapse in the immediate future, the only facilities with any power at all will be those equipped with microgrid power plants based on dg technologies, most likely utilizing a combination of both, diesel fueled generators and grid-connected hes employing conventional and unconventional generating technologies like ret (using one or more ncre), and able to work as a stand-alone system as well, to generate their own electric power, just like we see diesel 58 f.m. cordova, f.f yanine fueled alternators in the basement of just about every apartment building in santiago today. in this regard, those facilities which are equipped with their own microgrid stand the best chance of maintaining their full power supply regardless of the calamities and climatic menaces that may be lurking in the horizon (see [19], [20], [21], [22], [23], [24], [25] and [26]). bibliography [1] r. h. lasseter and p. piagi, control and design of microgrid components: final project report, pserc publication 0603, jan. 2006. [2] s. hozumi, development of hybrid solar systems using solar thermal, photovoltaics, and wind energy, int. j. solar energy, 4(5):257-280, 1986. [3] f. giraud and z. m. salameh, steady-state performance of a grid-connected rooftop hybrid wind-photovoltaic power system with battery storage, ieee trans. energy convers., vol. 16(1):1-7, mar. 2001. [4] zhou xue-song; gu yue; ma you-jie; research on technology of smart grid, (tianjin key lab. for control theor. & applic. in complicated syst., tianjin univ. of technol., tianjin, china.). international conference on intelligent computing and intelligent systems (icis), 2010 ieee 29-31 oct. 2010; pp 698 701. [5] m. amin, north america’s electricity infrastructure: are we ready for more perfect storms?, ieee security and privacy magazine, 1(5):19-25, sept./october 2003. [6] m. amin, toward self-healing energy infrastructure systems, ieee computer applications in power, 14(1):20-28, january 2001. [7] m. amin and b.f. wollenberg, toward a smart grid: power delivery for the 21st century, ieee power and energy magazine, 3(5):34-41, sept/oct. 2005. [8] m. amin and p. f. schewe, preventing blackouts, scientific american, pp. 60-67, may 2007. [9] afzala, anis, mohibullah, mohibullaha & kumar sharmab, virendra, optimal hybrid renewable energy systems for energy security: a comparative study, international journal of sustainable energy, volume 29, issue 1, 2010 (available at taylor francis online and ebsco). [10] hedman, b. , combined heat and power and heat recovery as energy efficiency options, presentation on behalf of us chp association and icf consulting, washington, dc, 2007. [11] japan gas association, technology development and commercialization, presentation at iea chp / dhc collaborative policy makers, roundtable, paris, 2007. [12] testa, a. de caro, s. la torre, r. scimone, t., optimal design of energy storage systems for stand-alone hybrid wind/pv generators, ieee xplore digital library: speedam 2010 international symposium on power electronics, electrical drives, automation and motion, 2010. [13] guerrero, m. a., romero, e., barrero, f., milanos, m. i., gonzalez, e., overview of medium scale energy storage systems, power electronics & electric systems (pe&es), school of industrial engineering (university of extremadura); power quality, alternative energy and distributed systems journal compatibility and power electronics cpe 2009, 6th international conference-workshop, 2009. homeostatic control of sustainable energy grid applied to natural disasters 59 [14] wei li; joos, g., comparison of energy storage system technologies and configurations in a wind farm, mcgill univ., quebec power electronics specialists conference, ieee, date: 17-21 june 2007, pp. 1280 1285, 2007. [15] sels, t., dragu, c., van craenenbroeck, t., belmans, r., overview of new energy storage systems for an improved power quality and load managing on distribution level, ieee xplore digital library, international conference on electricity distribution, cired, 2002. [16] steven c. smith, p. k. sen, benjamin kroposki & keith malmedal; renewable energy and energy storage systems in rural electrical power systems: issues, challenges and application guidelines, rural electric power conference (repc), 2010 ieee, pp. b4 b4-7, 2010. [17] decoster, dennis, a state of the art review of energy storage alternatives for mission critical ups & cps power systems; (dennis decoster is executive principal, mission critical west incorporated). source: oct/05 proceedings of 2005 power quality conference http://www.mcwestinc.com/resources.htm [18] schainker, robert b.executive overview: energy storage options for a sustainable energy future, ieee xplore digital library: power engineering society general meeting, ieee, 2:2309 2314, 2004. [19] johnson, r.w, aspnes, r.a.; wies, j., design of an energy-efficient standalone distributed generation system employing renewable energy sources and smart grid technology as a student design project, power and energy society general meeting, 2010 ieee, date: 25-29 july 2010, pp. 1 8 location: minneapolis, mn., 2010. [20] nema, pragya, nema, r.k., rangnekar, saroj, a current and future state of art development of hybrid energy system using wind and pv-solar: a review, renewable and sustainable energy reviews, 13(8):2096-2103, 2009. [21] deshmukh, m.k. and deshmukh, s.s., modeling of hybrid renewable energy systems, renewable and sustainable energy reviews, 12(1):235-249, january 2008. [22] lim, p. y., nayar, c. v., rajakaruna, s., simulation and components sizing of a standalone hybrid power system with variable speed generator, ieee xplore digital library: environment and electrical engineering (eeeic), 2010 9th international conference on, pp. 465-468, may 2010. [23] gupta, a.; saini, r.p.; sharma, m.p., modeling of hybrid energy system for off grid electrification of clusters of villages, ieee xplore: digital library power electronics, drives and energy systems, 2006. international conference on, date: 12-15 dec. 2006. [24] vechiu i., camblong h., tapia g., dakyo b., nichita c., dynamic simulation model of a hybrid power system: performance analysis, european wind energy conference proceedings, london, 2004. [25] chen, fengzhen, duic, neven, alves, luis manuel, da grac-a carvalho, maria, renewislands-renewable energy solutions for islands, renewable and sustainable energy reviews, 11:1888-1902, 2007. [26] moriana, i., san martin, i., sanchis, p., wind-photovoltaic hybrid systems design, ieee xplore power electronics electrical drives automation and motion (speedam), 2010 international symposium, pp. 610 615, 2010. 60 f.m. cordova, f.f yanine [27] chun-hua li, xin-jian zhu, guang-yi cao, sheng sui, ming-ruo hu; dynamic modeling and sizing optimization of stand-alone photovoltaic power systems using hybrid energy storage technology, renewable energy, 34(3):815-826, march 2009. [28] ipsakis, dimitris, voutetakis, spyros seferlis, panos, stergiopoulos, fotis; papadopoulou, simira, elmasides, costas keivanidis, chrysovalantis, energy management in a stand-alone power system for the production of electrical energy with long term hydrogen storage, 18th european symposium on computer aided process engineering escape 18 bertrand braunschweig and xavier joulia (editors), elsevier b.v., 2008. [29] tajuelo gallego, jesus angel, estudio tecnico-economico de una instalacion fotovoltaica con baterias y tanques de hidrogeno, memoria de titulo (not yet published), director: javier contreras sanz universidad de castilla la mancha, ciudad real, espana, 2011. [30] international energy agency http://www.iea.org/ [31] the millennium project the oecd and the millennium development goals, oecd development co-operation directorate website, (retrieved 11 june 2011). the millennium institute http://www.millennium-institute.org/ [32] m.s. labini, g. delvecchio, m. guerra, c. lofrumento, f. neri, a study for optimizing the management strategies of a hybrid photovoltaic-diesel power generation system, proceedings of international conference on renewable energies and power quality (icrepq), barcelona, 2004. [33] yaow-ming chen, chung-sheng cheng, hsu-chin wu, grid-connected hybrid pv/wind power generation system with improved dc bus voltage regulation strategy, gapplied power electronics conference and exposition, apec 2006, twenty-first annual ieee, 2006. [34] ding j.j., buckeridge j.s. design considerations for a sustainable hybrid energy system, unitech institue of technology-ipenz transactions, 27(1):1-5, auckland, 2000. international journal of computers communications & control issn 1841-9836, 12(2):227-237, april 2017. the model for learning objects design based on semantic technologies d. gudoniene, r. maskeliunas, d. rutkauskiene daina gudoniene*, rytis maskeliunas, danguole rutkauskiene kaunas university of technology studentu str. 50, kaunas, 51392, lithuania *corresponding author: daina.gudoniene@ktu.lt rytis.maskeliunas@ktu.lt, danguole.rutkauskiene@ktu.lt abstract: the paper presents a comparison of state of the art methods and techniques on implementation of learning objects (lo) in the field of information and communication technologies (ict) using semantic web services for e-learning. the web can serve as a perfect technological environment for individualized learning which is often based on interactive learning objects. this allows learners to be uniquely identified, content to be specifically personalized, and, as a result, a learner’s progress can be monitored, supported, and assessed. while a range of technological solutions for the development of integrated e-learning environments already exists, the most appropriate solutions require further improvement on implementation of novel learning objects, unification of standardization and integration of learning environments based on semantic web services (sws) that are still in the early stages of development. this paper introduces a proprietary architectural model for distributed e-learning environments based on semantic web services (sws), enabling the implementation of a successive learning process by developing innovative learning objects based on modern learning methods. a successful technical implementation of our approach in the environment of kaunas university of technology is further detailed and evaluated. keywords: learning object, semantic technologies, web applications, learning environments. 1 introduction a variety of tools and systems could be used for the implementation of learning activities through e-learning processes. when talking about e-learning technologies the most important and painful question we get is: what method should be used for the integration of interactive learning objects (lo) in the development of e-learning? the painful thing about this question is that a usual questioner has often been misled to believe that there is only a single tool that does everything that everybody needs to be done: to create, host, and access e-learning material. successful e-learning strategies and scenarios may require integration of dozens of software products chosen from hundreds of candidates sprawling across multiple categories: development of learning objects, delivery of knowledge, content management, communication and collaboration, live learning and assessment, etc. they can also be categorized according to the possibilities of implementation of curriculum (realization of learning events: imitate, receive information, exercise, explore, experiment, create, self-reflect, debate); technological properties (e.g. synchronous, asynchronous, web based, pc application, mobile application, open source, free service); application domain (language learning, intercultural competences, ict skills, time management skills, study habits skills, etc.). however, independent on the purpose or functionality, all tools and systems are often integrated in the virtual learning environment (vle), which can reflect the discipline by providing a well-designed, visually stimulating interface that genuinely supports the needs of real world learning. a web-based education method suits this by providing more copyright © 2006-2017 by ccc publications 228 d. gudoniene, r. maskeliunas, d. rutkauskiene flexibility and intelligence. however, recent developments in the area of semantic web, while contributing to the solution to these problems, also raise new issues that must be solved [3]. the aim of the research in this paper is to present an architectural model of distributed e-learning environments based on semantic web services (sws) with a goal to improve the effectiveness of a learning process by introducing innovative learning objects that combine variety of multimedia elements and other learning material as well as structural integrity of advanced learning methods. extraction of semantic relations has always been a challenging problem in multimedia data. various architectures of multimedia databases have been developed in the past but the need to refine them still remains in order to get the desired results of users’ interest, to extract semantics from multimedia data in a way the user perceives them. 2 related works interactive learning possibilities were improved with the introduction of internet technologies; however, it still fails to reach the full potential. with new, more interactive internet technologies there is even more to be captured and adopted, such as the public knowledge contained in blogs, wikis, social bookmarking services, social networks, etc. [8, 17]. the ontologies, the semantic web, and the social semantic web offer a new perspective on intelligent educational systems by providing intelligent access to and management of web information, and semantically richer modelling of applications and their users [2-3]. this allows supporting more accurate representations of learners, their learning goals, learning materials and contexts of use, as well as more efficient access and navigation through learning resources with a primary aim to advance intelligent educational systems to achieve improved e-learning efficiency, flexibility, and adaptation for both single users and communities of users. the notion of the social semantic web describes an emerging design approach for building semantic web applications that employs social software techniques. social semantic web systems typically elicit domain knowledge through semi-formal ontologies, taxonomies or folksonomies. techniques related to educational content could be provided in different forms [1, 9, 18] and environments that have relations with semantic web services. as web-based education, it has become a very important branch of educational technology [2, 7, 19]. classroom independence and platform independence of web-based education, availability of authoring tools for developing web-based courseware, cheap and efficient storage and distribution of course materials, hyperlinks to suggested readings, digital libraries, and other sources of references relevant for the course are but a few of a number of clear advantages of web-based education. by analysing design of learning objects and essential characteristics of a range of proven learning activities, we can generate a set of requirements for the is architecture. for example, a proven existing learning activity based on implementation of learning objects might enable students to work simultaneously across a network on a design tool, such as a graphics program, and share the results in separate windows. this learning activity therefore generates the computational requirement to allow the use of any shareable application this way. other researchers [6, 9, 10] have analysed a number of digital resources to be used and reused for learning (learning objects) and concluded that a number of those is constantly increasing. therefore, description of learning objects with metadata is important as it allows enhancing search, retrieval and usage of learning objects and because it is very important in the integration process into any environment to efficiently organize a training process. learning objects can be considered not only as resources providing affiliated materials, but also as methodological resources which include teachers’ experiences, reflections, examples or instructions of usage of content objects and descriptions of learning methods [9, 18]. however, existing standards and specifications for learning objects metadata are not intended for including methodological the model for learning objects design based on semantic technologies 229 figure 1: principal schema of learning objects relations resources and learning method descriptions together with content objects. modeling of learning objects and search of learning objects of the same content in the semantic web is a challenge to every researcher. the metadata and ontologies (see fig. 1) let to find and to reuse learning objects in different situations as they make lo machine-understandable. according to rehman and kifor [21], ontologies are like repositories: they are helpful in exchange of knowledge, reusing existing knowledge, for reasoning and inferring on existing knowledge. dagiene et al. are discussing that digital learning resources by themselves are not as valuable as their target application in the learning process as well as a properly selected learning method. therefore, metadata is indispensable and semantic web services should be intended part of the design of learning objects. the systematically prepared corpus of metadata helps all the parties involved in learning process to cooperate, use and share learning resources [16], [13]. interface of a learning object usually presents the same content and has the same look and feel for every user, regardless of a student’s learning needs and individual characteristics. some interfaces are "customizable"; the user can choose to modify some characteristics of the graphical user interface. however, this does not entirely satisfy the needs of educational content presentation to the users. aspects of content personalization, proper interaction and efficient presentation are also important. some research on a design of learning objects has been already conducted [4]-[6] and some researchers [18] made the effort to combine learning objects and constructivism focus largely on how learning objects can be used in specific constructivist learning environments instead of building a universal generic structure. the results of research presented by gurbuz et al. [5] suggest that a new architectural model for learning objects development must be designed with a common login system that provides 230 d. gudoniene, r. maskeliunas, d. rutkauskiene figure 2: architectural model of the platform information about a person trying to connect (for example, name, address, email address, user code). aroyo at al. [10] and gote et al. [7] have found a key to enable the interoperability and to capitalize on the semantic conceptualization and ontologies, common standardized communication syntax, and large-scale service-based integration of educational content and functionality provision, and usage based on a model of semantic web services for education. the central role in achieving unified authoring support plays the process-awareness of authoring tools which should reflect the semantic evolution of e-learning systems. gurbuz et al. [5] present a system architecture model which is designed and based on serviceoriented architecture technology (soa). this architecture intended to provide an ability to create a flexible system focused on the provision of services with the ability to implement effective services and systems solutions for an integration of standard learning objects into information systems. the platform (see architectural model of the platform in fig. 2) can be structured in the following logical levels [10]-[12]: • user level, consisting of user and system interfaces, implemented according to user specification and system requirements; • the level of external systems; the adapter is provided for managing web services. in addition, a system of components is required for external inclusion; • the data level involves all data management, sanctioning, monitoring, archiving and storage components. when connected to one of the web services by the use of other systems the second time logging is not necessary anymore as the system automatically checks the permission of online users and the model for learning objects design based on semantic technologies 231 figure 3: integration of learning objects into interactive learning courses authorizes them. the architectural model provides main functionality necessary for the design and implementation of learning objects. a central role of achieving unified authoring support plays the process-awareness role of authoring tools which should reflect the semantic evolution of e-learning systems [14], [15], [20]. the researchers are working on the novel architecture of learning objects to solve the problem of unification of learning objects and to start working on standardization of los and to offer templates to users to design content based learning objects for e-learning systems related to semantic web services technologies. the semantic relations with the learning objects are well constructed and to be provided from learning objects’ interface. 3 a proprietary model of building los for semantic e-learning environments (sws) we propose a novel architecture for designing modern learning objects that can be easily applied to various domains. a model is based on identified key learning objects phases that make an influence on the organization of study process as well as on finding technological solutions to design los and to develop technical implementation. we have carried out a comprehensive data analysis on the implementation of e-learning processes starting with the requirement specification to the delivery phase. in the evaluation, we have analysed the design of learning objects, methods of learning (i.e. blended learning method, distance learning method, etc.) and methods of providing massive open online courses. the architectural model (see fig. 4) was designed theoretically (based on the state of the art and best practice scenarios); its implementation started on a step-by-step basis. one of the requirements was to include metadata of each section that could be used to provide some useful information about the task goal, keywords, background, etc. going up to the knowledge granularity level (see fig. 3), multiple sections can form a component. multiple components related to a learning topic can be grouped. when a learning object is generated by an instructor/author, the expert’s experience can be embedded into the configuration of the course, such as selection of section contents, selection of components, sequence of components, display modes of components, etc. when it is generated by the learning environment, certain organizational 232 d. gudoniene, r. maskeliunas, d. rutkauskiene figure 4: architectural model for the design and delivery of learning objects patterns can be applied to generate a course corresponding to the learner’s profile and to the learning goal. the architecture is based on soa principles. the functions will be provided as services; the internal service processes are intended to optimize the use of the provision of direct procedure calls that can be realized directly in the server application procedures. some kind of activities can be performed as services and will be directly integrated into learning objects. through the components of web services (ws), there is a possibility to take an advantage of the vle and its functionality which provides a specific service such as data recording, getting a report, etc. moreover, the service component of our system can be used to work in a workflow. the workflow can be used for both systems and services of vle is provided by external integrated systems. is and all its components are adapted using standard it platforms, operating [13], [16] systems and existing computer network infrastructure. information system can provide nonformal education and implementation of learning objects as well as self-tests carried out in the content management system that assures this essential functionality and the relations by semantic web services directly to content of learning objects, including preparation of learning objects, metadata description and semantic relations to other open resources; development of self-tests, integration into the software facilities; learning objects, learning programs and courses in preparation of and public access to its management; is and user administration. high resolution digital objects are stored in our university’s vle is repository at open.ktu.lt. these objects must then be converted (transformed) into different formats and quality facilities for further implementation and content creation of different types (for different training programmes and courses). is and it’s components are designed to work in the environment which was adapted to communicate with standard it platforms, operating systems and the computer network. looking to the is on student interface we can view learning objects in different ways. one the model for learning objects design based on semantic technologies 233 way is to click “view default” to choose to view the course created by the author/instructor if such a version is available. alternatively, students can choose and have the course generated automatically by the system according to some patterns like some external teaching methodology. there is a way to have a choice to view the learning objects sorted by the profile by which the learning materials will be generated session by session. finally, students can choose to view all of the raw components related to the topic. our approach demonstrates the possibility of using the constructivism learning theory to guide the design of learning objects based upon the original prototype. the collaboration among authors of lo’s is supported, while learners can also actively participate in the construction of learning objects. it provides a way to allow learners to grasp the whole picture of the course quickly. the ease of viewing contents of learning objects iteratively in different ways assists learners to learn efficiently in constructivist learning environments. through the web services (ws) components of the system, users and designer can take an advantage of the vle and its functionality which provides specific services, such as initiating business processes to record data to get a report and so on. besides, the service component of the system can be used to work in the workflow. workflow can be used for both vle is systems and services provided by external integrated systems. some parts of the described system are under constant improvement. changes in system functionality processes have to be implemented and adapted to the system. our proposed process modelling makes it easier to adapt to the emerging new needs, for instance, to decide to adjust the sequence of necessary steps to abandon any of the steps to change the conditions of implementation steps. the workflow can be easily incorporated during the steps carried out by the external system via services (service), as well as allow to easily changing the step executor. for example, a workflow step is carried out by an external system which presents the results of the vle from that system. is functions will assure the insertion of educational content (including different format of learning objects), testing, and content development: development of digital material and metadata description; design of learning objects, design of courses and open access management; development of tests and assignments and relation to the targeted program; virtual learning environment and users administration. key property of the semantic web architecture (common-shared-meaning, machine-processable metadata), enabled by a set of suitable agents, seems to be powerful enough to satisfy the elearning requirements: fast, just-in time and relevant learning. learning material is semantically annotated and it may be easily combined in a new learning course in case of a new learning demand [23]. the process is based on semantic querying and navigation through learning materials enabled by the ontological background. in fact, the semantic web could be treated as a very suitable platform for implementation of e-learning objects since it provides all means for (e-learning) ontology development, ontology-based annotation of learning materials, their composition in learning courses and (pro)active delivery of the learning materials through e-learning portals. in our system, learning objects are distributed on the web but they are also linked to agreed network of ontologies. this enables construction of user-specific courses by allowing semantic querying for various topics of interest. software agents of the semantic web can be used to enable co-ordination between other system agents and proactive delivery of learning materials in the context of actual problems. the semantic web can be as decentralised as possible. this enables an effective co-operative content management. content is determined by an individual user’s needs and aims to satisfy the needs of every user. the users by using personalised agent searches for learning material will 234 d. gudoniene, r. maskeliunas, d. rutkauskiene be customised for her/his needs. the ontology is the link between user needs and characteristics of the learning material. the semantic web enables the use of knowledge, provided in various forms, regarding semantic annotation of content. distributed nature of the semantic web enables continuous improvement of learning materials. it enables the use of distributed knowledge provided in various forms, enabled by semantic annotation of content. distributed nature of the semantic web enables continuous improvement of lo. 4 practical realization of the designed architectural model for learning objects development the developed architectural model for the design of learning objects includes content development tools and social network integration and video lecturing system. the content acquisition and control measures were implemented by using the open-source virtual learning environment moodle, an open source social networking system elgg and open-source drupal cms integrated with a system for video lecture recording as well as the tool ckd dedicated to the development of learning objects. the system allows users to create / edit / delete content, search, use e-learning content and share with other users for evaluation. facility installed and configured user roles and rights connected allow to form a single login system within active web services, in order to produce the information and allow for service centres to carry out the instructions sent by the inclusive media. semantic tools have been implemented to assist the workflow of course creation delivery, to revise a recommendation process of a relevant content and people in the context of the course and the institution, to assist students by recommending resources that match the topics of their assignments and personnel that may be able to support their activities, to help in group formation for collaborative work based on students’ background, personal preferences and successful prior collaboration, to add support for critical thinking and argumentation by visualizing arguments and linking relevant discussions. implementation of semantic technologies allowed enabling integration, searching and matching of nodes of information. large university repositories, like triple stores, where information can be efficiently stored, searched and managed, were also improved by a more efficient semantic model. according to the architectural model presented in the section 2, changes were made to is distance learning management system “mano.ktu.lt”. the platform was created with the idea to aggregate the information distributed by different institutional departments including personal user’s needs (portfolio). this system data of user activity is gathered by metadata service from different e-learning systems. metadata is then structured and saved as a user e-portfolio. semantic analysis service is activated on user elearning event, for example, the user is logging to any of the platform’s systems. semantic analysis service gets e-portfolio data and performs semantic analysis using several methods: user-based collaborative filtering, cosine similarity, person correlation, jaccard tanimoto index, sorensen coefficient. recommendations for learning materials and other users (for promotion of collaboration) are presented to the active user after an analysis is performed in the plugin of the environment where the user is performing his learning activities. for example, if a user is in a video presentation system, s/he gets similar videos based on the analysis of all data on all systems together (semantic proximity between users in elgg, moodle, cdk, mano.ktu.lt environments is analysed) – if two the model for learning objects design based on semantic technologies 235 figure 5: effectiveness of lo integration and more users collaborate in the same elgg group, they might want to check the same videos on a video presentation system. 5 initial experimental evaluation as there is no way to effectively introduce objective measures for other than performance tasks, a specified survey was issued to conceive the effectiveness of current milestone version. the users were requested to answer the questionnaire and express their opinion on the design and delivery of learning objects as well as about practical implementation of los into the institutional system: if it is friendly, easy to use or not. the research was implemented on national institutional level where 94 authors of academic courses were requested to answer the questionnaire. 54 % of respondents were aged between 45-60 and 46 % were aged between 30-45. the respondents were requested to answer the main question which technological solutions dedicated to design learning objects were friendlier to use? from an age perspective, we can claim that age was not a very important factor and the respondents were interested in learning this novel architectural model and were coping easily with a standard template to design interactive learning object that does not require special competences to provide an interactive learning content to students. we have also asked to compare the creation of standard learning objects vs non-standard learning objects used in educational process. the result showed that the users were friendlier with the standard objects (see a comparison of usability of learning objects based on the architectural model and not in table 1) where the number of respondents provided data on the lo usability in the architectural platform. the experimental research on the effectiveness of lo integration (see fig.5) was carried out by means analysing data on the platform developed on the architectural model (see fig. 4) and using of high resolution learning objects to be provided for the study process. in other cases the supplied video objects have proven as a more effective way to work with students who were in different locations, and to leave video records for self-learning as well. this is true even for the development and delivery of massive open online courses in our university. according to the respondents, successful integration of architectural model into practice opened an easy way to teachers and lecturers to make video records of lectures. after tuning a model on the remarks of our experimental participants, a component of video conferencing system was modified into a more user-friendly experience (vips, http://vips.liedm.lt) was developed. is environment became very friendly to use due to a novel combination of modern technologies and established methods of pedagogy. 236 d. gudoniene, r. maskeliunas, d. rutkauskiene conclusions new approaches of the learning objects modelling plays the central role in achieving unified authoring support in the process-awareness of authoring tools which should reflect the semantic evolution of e-learning systems. however, the researchers are working on the architecture on lo design to solve the problem of the unification of learning objects and to start standardization of los and to suggest the templates for users designing content based standard learning objects for e-learning systems based on semantic web services technologies. for conclusions the authors identify that the developed model offers perfect technology and environment for individualized learning based on interactive learning objects not only for teachers but for learners as well, as they can be uniquely identified, content can be specifically personalized, and the learning progress can be monitored, supported and assessed. the ontologies, the semantic web, and the social semantic web offer a new perspective on intelligent educational systems by providing intelligent access to and management of web information, and semantically richer modelling of applications and their users. the semantic web enables the use of distributed knowledge provided in various forms, provided by semantically annotated content. distributed nature of the semantic web enables continuous improvement of learning objects. the research data shows that architecture for the design of learning objects can be applied to various domains and that authors can easily work with in order to design and integrate of learning objects. at the same time, the technological solution for architectural model based on semantic web services cannot be influenced by the age of users as this is not very important and the respondents were interested in architectural model and standard template to design interactive learning object having very easy technologies that do not require special competences to provide to students interactive learning content. bibliography [1] ermalai, i.; dragulescu, b.; ternauciuc, a.; vasiu, r. (2013); building a module for inserting microformats into moodle, advances in electrical and computer engineering, 13(3): 23-26. [2] kruk, s.; gzella, a.; dobrzanski, j.; mcdaniel, b.; woroniecki, t. (2007); e-learning on the social semantic information sources, lecture notes in computer science, 4753: 172-186. [3] ermalai, i.; mocofan, m.; onita, m.;vasiu, r. (2009); adding semantics to online learning environments, proceedings of 5th international symposium on applied computational intelligence and informatics – saci2009, 569-573. [4] targamadze,a.; balbieris, g.; kubiliunas, r. (2005); the new generation of virtual learning environments in lithuania, information technology and control, 34(3): 276-284. [5] gurbuz, t.; gudoniene, d.; rutkauskiene, d. (2013); system architecture model based on service-oriented architecture technology, proceedings of 19th international conference information and software technologies, 102-113. [6] kurilovas, e.; dagiene, v. (2009); multiple criteria comparative evaluation of e-learning systems and components, informatica, 20(4): 499-518. [7] grtze, p.; engelund, c.; mortensen, r.l.; paszkowski, s. (2009); cross-national interoperability and enterprise architecture, informatica, 20(3): 369-396. the model for learning objects design based on semantic technologies 237 [8] abdul hamid, o.; abdul qadir, m.; iftikhar, n.; ur rehman, m.; uddin ahmed, m.; ihsan, i. (2007); generic multimedia database architecture based upon semantic libraries, informatica, 18(4): 483-510. [9] dagiene, v.; jevsikova, t.; kubilinskiene, s. (2013); an integration of methodological resources into learning object metadata repository, informatica, 24(1): 13-34. [10] aroyo, l.; dicheva, d. (2013); the new challenges for e-learning, the educational semantic web educational technology & society, 7 (4): 59-69. [11] rutkauskiene, d.; gudoniene, d.; cibulskis, g.; suk, o. (2013); ict architecture for online learning approach, proceedings of information and software technologies icist 2012, 373387. [12] rutkauskiene, d.; mark, r.; kubiliunas, r.; gudoniene, d. (2013); functional architecture of a service-oriented integrated learning environment, proceedings ecel 2013 12th european conference on e-learning, 431-439. [13] kuzucuoglu, a. e.; gokhan, e. (2011); development of a web-based control and robotic applications laboratory for control engineering education,information technology and control, 40(4): 352-358. [14] bajec, m. (2008); a framework and tool-support for reengineering software development methods, informatica, 19(3): 321-344. [15] bersin, j. (2008); social networking and corporate learning, certification magazine, 10: 14-14. [16] fertalj, k.; hoic-bozic, n.; jerković, h. (2010); the integration of learning object repositories and learning management systems, computer science and information systems, 7: 387-407. [17] grodecka, k.; wild, f.; kieslinger, b. (2011); how to use social software in higher education, poland. [18] targamadze, a.; petrauskiene, r. (2010); impact of information technologies on modern learning, information technology and control, 39(3): 169-175. [19] vitiutinas, r.; silingas, d.; telksnys, l. (2011); model-driven plug-in development for uml based modelling systems, information technology and control, 40(3): 191-201. [20] alsultanny, y. (2010); e-learning system overview based on semantic web, the electronic journal of e-learning, 4 (2): 111-118. [21] rehman, z.; kifor, s. (2014); a conceptual architecture of ontology based km system for failure mode and effects analysis, international journal of computers communications & control, 9 (4): 463-470. international journal of computers communications & control issn 1841-9836, 10(3):357-369, june, 2015. an application of latent semantic analysis for text categorization g. kou, y. peng gang kou school of business administration southwestern university of finance and economics, chengdu, china no.555, liutai ave, wenjiang zone chengdu, 611130, china kougang@swufe.edu.cn yi peng* school of management and economics university of electronic science and technology of china, chengdu, china no.2006, xiyuan ave, west hi-tech zone chengdu, 611731, china *corresponding author: pengyicd@gmail.com abstract: it is a challenge task to discover major topics from text, which provide a better understanding of the whole corpus and can be regarded as a text categorization problem. the goal of this paper is to apply latent semantic analysis (lsa) approach to extract common factors that representing concepts hidden in a large group of text. lsa involves three steps: the first step is to set up a term-document matrix; the second step is to transform the term frequencies into a term-document matrix using various weighting schemes; the third step performs singular value decomposition (svd) on the matrix to reduce the dimensionality. the reduced-order svd is the best k-dimensional approximation to the original matrix. the experiment uses more than fifteen hundreds research paper abstracts from a specific field. because different factor solutions of the lsa suggest different levels of aggregation, this work examines thirteen solutions in the experiment. the results show that lsa is able to identify not only principle categories, but also major themes contained in the text. keywords: latent semantic analysis, topic extraction, text mining, information retrieval. 1 introduction many multidisciplinary fields, such as data mining, bioinformatics, biochemistry, and neuroscience, emerge in the past several decades. since multidisciplinary fields involve theories, methods, and techniques from multiple disciplines, it is not easy to comprehend all the research efforts in these fields. text categorization, which organizes documents into groups based on their underlying structures, can help capturing the large amount of activities and diversity of a multidisciplinary field. the goal of this paper is to apply latent semantic analysis (lsa) approach to detect major research topics and themes of a multidisciplinary field. in particular, it is intended to address three questions: what are the core research areas of the selected field, what are the major research themes, and what is the dynamics of the discipline? lsa is an automatic mathematical and statistical technique for uncovering common factors that representing concepts hidden in text[1,2,3,4]. previous investigations in psychology and computer science have proved that lsa resembles the way the human brain distills meaning from text and is capable of inferring much deeper relations in the text data[3,5]. the rest of the paper is organized as follows. section 2 describes the basic concepts of lsa. section 3 presents the experimental study that was used to identify the core research areas copyright © 2006-2015 by ccc publications 358 g. kou, y. peng and themes. section 4 discusses the results of this analysis, focusing on three important factor solutions of lsa. section 5 summarizes the paper with conclusions and limitations. 2 research method latent semantic analysis (lsa) is a theory of knowledge acquisition, induction and representation[2]. it was first introduced as an information retrieval (ir) technique by [1] and [6]. it is an automatic mathematical learning technique for analyzing the relationships and similarity structures among documents and terms, relying on no human experiences, prior theoretic models, semantic dictionaries, or knowledge bases[3]. similar to factor analysis, principal components analysis, and linear neural networks, the main purpose of lsa is dimension reduction, which is realized through a matrix operation called singular value decomposition (svd). svd is a means of decomposing a matrix indo a product of three simpler matrices. by retaining the k largest singular values, the resulting reduced-order svd provides the best k-dimensional approximation to the original matrix, in the least square error sense[7]. in the results of svd, two sets of factor loadings, one for the words and one for the documents, are generated. each term and document is represented as a k-dimensional vector in the same latent semantic space derived by the svd. thus each latent semantic factor is now associated with a collection of high-loading terms and high-loading documents[5]. high-loading terms and documents are used to interpret and label the corresponding factor. the number of factors is an input parameter that needs to be provided before svd computation. as the number of factors changes, lsa groups key terms or documents into various levels of aggregation. when it is applied to identify important topics of a certain discipline using a collection of representative papers, a higher level of aggregation (e.g., 2 factors) indicates key research areas and a lower level of aggregation (e.g., 100 factors) represents general research themes[5]. the lsa analysis can be summarized in three main steps. the first step is to set up a termdocument matrix in which each row stands for a key word or term and each column stands for a document or context in which the key word appears. an entry in the matrix is the frequency of a key word in the corresponding document. the second step is to transform the term frequencies in a term-document matrix using various weighting schemes. the third step is to perform svd on the matrix to reduce the dimensionality, which is the key feature of the lsa method. in this step only the k largest singular values are retained. the reduced-order svd is the best k-dimensional approximation to the original matrix[7]. extensive experiments have demonstrated that the classification performance of lsa is robust[8] and it is capable of inferring relations in the text [3,5]. it can be used in information retrieval (ir), search optimization, classification, clustering, filtering and other ir-related applications[7]. readers interested in mathematical details of the lsa approach can refer to [1]. 3 experimental study this section describes the data source and the implementation details of lsa analysis that is utilized to identify the core research areas and research themes for the selected field. 3.1 data sources the field of multiple criteria decision making (mcdm) and multiattribute utility theory (maut) has grown exponentially and made remarkable progress since 1960s. as a multidisciplinary field, mcdm/maut has close collaboration with some neighboring disciplines, such as an application of latent semantic analysis for text categorization 359 mathematical programming, organizational behavior, engineering, decision analysis, and negotiation science[9]. during the past twenty years, extensive research papers have been published in mcdm, maut, and related disciplines. in the experiment, lsa is applied to a collection of mcdm/maut publications to extract major research topics and identify the trends of the field. since previous studies, such as [10], [11] and [12], have investigated the major areas and the evolution of mcdm/maut before 1990s, articles published before 1985 were not included in the analysis. a total of 1515 research abstracts published in 16 refereed mcdm-related journals in the english language during the period of 1985 to february 2009 that contain key words: multiple criteria and multicriteria, were collected. as the first and unique journal in multiple criteria decision analysis, articles published in the journal of multi-criteria decision analysis were all collected (from 1992 through 2007). the 16 refereed mcdm journals were selected according to two criteria: (1) journals appeared frequently in the multiple criteria decision aid bibliography on the international society on mcdm website[13]; (2) the most relevant and top-rated mcdm journals listed by [14] and [15]. each article collected in the dataset is stored in microsoft excel as one row with five fields: article title, author(s), journal name, year of publication, and abstract. table 1 lists the journals and the number of abstracts included in the text data. about 34% of the articles were published in the european journal of operational research, with about 19% in the journal of multi-criteria decision analysis and 8.5% in the journal of the operational research society. table1. refereed mcdm journal articles, 1985-2008 journals number of articles european journal of operational research (ejor) 519 journal of multi-criteria decision analysis (jmcda) 292 journal of the operational research society (jors) 130 computers & operations research (c&or) 88 fuzzy sets and systems (fss) 86 computers & industrial engineering (c&ie) 70 decision analysis (da) 64 omega 64 mathematical and computer modelling (m&cm) 41 annals of operations research (aor) 40 decision support systems (dss) 35 management science (ms) 31 operations research (or) 18 journal of optimization theory and applications (jota) 16 theory and decision (td) 11 organizational behavior and human decision processes (obhdp) 10 figure 1 summarizes the number of publications in the field of mcdm from 1985 to february 2009. because text data were retrieved in october 2008, the number of abstracts collected for the year of 2009 can not reflect the real publication trend and therefore is ignored in figure 1. as seen in figure 1, the mcdm publications have been increased rapidly since 1992 and the number of mcdm publications has increased 4.7 times from 1985 to 2008. 3.2 text preprocessing the initial step of lsa analysis is to represent the text as a term-document matrix in which each row stands for a term and each column stands for a document. in order to set up such a matrix, this study started the analysis with text preprocessing procedures that are popular in the information retrieval and text mining[16,17]. 360 g. kou, y. peng figure 1: mcdm publications from 1985 to 2008 the text preprocessing procedure consists of tokenization and term reduction. tokenization divides documents into a set of terms. in this study, each article is represented by its title and abstract. since titles are informative of research papers and normally contain pertinent key words, the weight of titles is set twice as much as abstracts. the 1,515 mcdm research papers generated a dictionary of 9,322 terms. tokenization was implemented using a self-developed c++ program. 3.3 term frequency matrix text preprocessing produced a term-frequency matrix with 1,515 columns (papers) and 3,299 rows (terms). originally, an entry in the matrix contains the number of times a term occurs in a document. a term-frequency matrix measures the association of a term with respect to a given document[17]. there are many methods to define term weights. in this study, the tf-idf, a traditional term-frequency weighting, was used to transform the raw term frequencies in the matrix. the tf-idf weighting scheme combines term frequency (tf) and inverse document frequency (idf) together: wij = tfij × idfi (1) where tfij is term frequency and idfi is the inverse document frequency of term i. inverse document frequency represents the importance of a term and is defined as: idfi = log2(n/dfi) + 1 (2) n is the total number of documents and dfi is the document frequency of term i. idf implies that the discriminative power of a term will be decreased if it occurs in many documents. in other words, the importance of a term will increase if it appears in a limited number of documents. the reasoning behind the tf-idf weighting is that a term occurring frequently in a document but rarely in the rest of the collection is considered to be important. experiments have shown that tf-idf measure works well in many applications[17,22]. the tf-idf weights were calculated using a linguistic analysis tool[23]. 3.4 latent semantic analysis lsa can be considered as an application of reduced-order singular value decomposition (svd)[23]. svd decomposes a term-document matrix x into the product of three other matrices: x = w0s0c(prime)0 (3) an application of latent semantic analysis for text categorization 361 w0 and c0 are the matrices of left and right singular vectors and s0 is the diagonal matrix of singular values. w0 has the same number of rows as the original matrix and c0 has the same number of columns as the original matrix. s0 is a square matrix with non-zero entries only along one central diagonal and sorted in decreasing order[1]. the dimensionality of the original matrix can be reduced by keeping the first k largest coefficients in the diagonal matrix s0 and setting the remaining smaller ones to zero. the zero rows and columns of s0 can then be deleted to get a new diagonal matrix s. similarly, the corresponding columns of w0 and c0 can be removed to obtain w and c respectively. the product of the simplified matrices is a new matrix x̂: x̂ = wsc(prime) (4) x̂ is the k -rank matrix with the best possible least-squares-fit to x [1]. the results of svd include one set of k -factor loading for the terms and one for the documents. high-loading terms and documents of a factor can then be used to interpret and label the factor. for mathematical and technical details of svd, please refer to [1](p. 397-399). the choice of k is a critical issue in svd. an ideal value of k should be large enough to fit all the real structure in the data and small enough to avoid unimportant details[1]. since solutions with different number of factors represent different levels of concept aggregation, we explored 2 through 13, and 100 factors respectively. factor interpretation and labeling was conducted manually by two mcdm researchers. the high-loading terms and documents of 2 through 13 and 100 factor solutions were examined and labeled independently. the next section discusses the results of the lsa analysis. 4 results and discussion 4.1 different factor solutions this work examined 13 solutions, including 2 through 13 and 100 factors, to identify key research areas and major research themes of mcdm. for the rest of the paper, factor x-y is used to indicate the yth factor of the x -factor solution[5]. for example, factor 100-2 refers to the second factor of the 100-factor solution. different factor solutions of lsa show different levels of research themes of the mcdm discipline. the 6-factor and 11-factor solutions describe the evolution of these areas during the past twenty-four years and reveal major research areas of mcdm, including maut, electre methods, analytic network process (anp), multicriteria decision support system (mcdss), heuristics, preference learning, interactive multiple objective programming, mcdm applications, and goal programming. as the number of factors increases, higher level research areas can be partitioned into subareas. for example, preference learning (factor 6-4 from table 2) in the 6-factor solution is represented by preference representation (factor 11-5 from table 3) and preference structure modeling (factor 11-7 from table 3) in the 11-factor solution; and preference modeling (factor 100-55), preference elicitation support (factor 100-77), and preference ordering techniques (factor 100-99) in the 100-factor solution. table2. top 30 high-loading terms for the 6-factor solution factor factor label top 30 terms 6-1 analytic network process (anp) pro,ecis,multi,decis,multipl,pre,riteria,criteria,roc, met,problem,ultipl, gener, rel,ref,model, criterion, valu,ram,set,risk,function, experi, probabl,appli, effect,approach,object, base, altern 6-2 multicriteria decision support system ecis,decis,riteria,criteria,pre,met,refer,ref,valu,method, multi,base, altern, risk,maker, group,appli, experi, prefer,model,multicriteria, util, function, result,theori, regret,multipl,analysi,rel,analys 362 g. kou, y. peng 6-3 multi-attribute utility theory (maut) multipl,method,pro,ultipl,decis,probabl,task,learn,met,hypothes, gener, rel,effect,criterion, fuzzi, program,experi,problem,addit,ram, approach,ecis, linear,function,pre,paper,goal,set,risk,find 6-4 preference learning met,multi,multipl,ultipl,decis,method,object,line,linear, process, program, pre,prefer,refer, ecis,ref,singl,learn, evalu,effici,function, ram, roc, search,task,fuzzi,solut,multiobject,paramet,probabl 6-5 electre methods riteria,criteria,method,ecis,decis,met,valu,multi,analysi, evalu,tri, analys, object,program,ram,multicriteria, perform,solut,linear,fuzzi, multipl, select,sel,algorithm, optim,risk,line,approach,paper,develop 6-6 heuristics riteria,pre,multi,refer,criteria,prefer,ref,multipl,ultipl, ram,met, problem,decis, function,optim,ecis, multicriteria, object,algorithm,program,pro, fuzzi,criterion, roc,solut,system,process,tri,plan,integ the 100-factor solution presents a large variety of research themes studied during the last twenty years by the mcdm and related disciplines (see table 4), including mcdm theories, algorithms, related areas of research, decision support systems, applications, and techniques. it also reveals important mcdm research topics that are not presented in the 6-factor and 11-factor solutions, such as data envelopment analysis (dea) method, genetic algorithms, simulation, behavioral issues, theoretic foundation, and visual tools. the 100-factor solution points out two notable trends in the mcdm publications. the first is the growth in applications of mcdm. in the 100-factor solution, 21 factors are related to mcdm applications. these applications cover not only traditional application areas, such as asset management[24], scheduling problem[25], assignment problem[26], questionnaire survey[27], credit scoring[28,29,30,31], and risk evaluation[32,33,34]; but also emerging novel areas, such as verbal data classification[35], web-based decision support[36], habitual domains[37], electronic commerce systems[38,39], and e-participation[40]. the second trend is that mcdm has entered into some new research areas[41]. for example, supply chain management has utilized mcdm methods to capture multicriteria decision making and decision-making under uncertainty[42]. geographical information systems (gis) and mcdm have been combined to aid spatial decisions[43]. these two results generally agree with [9], [12] and [28]. table3. top 30 high-loading terms for the 11-factor solution factor factor label top 30 terms 11-1 goal programming pro,ecis,multi,decis,multipl,pre,riteria,criteria, ultipl,met,problem,roc, gener,rel,ref,model,criterion, valu,ram,set,risk,function,experi, probabl,appli, effect,approach,object,base,altern 11-2 multiple criteria sorting problem ecis,decis,riteria,criteria,pre,met,refer,ref, prefer,method, multi, base,altern,risk,util,group, appli,maker,experi,valu,model,multicriteria, function, result,theori,regret,multipl,analysi,criterion,rel 11-3 interactive fuzzy multiple objective decision making multipl,method,pro,decis,ultipl,probabl,task,learn, met,gener,ram, hypothes,rel,effect,fuzzi,program, criterion,problem,approach,experi,linear, addit, set,function,goal,risk,ecis,pre,prefer,paper 11-4 ranking alternatives ram,met,multi,multipl,decis,ultipl,method,line,object,process,linear, program, pre,roc,ecis,prefer,refer,ref, evalu,singl,learn,fuzzi, effici, function, search,task,solut,multiobject,analyt,valu 11-5 preference representation riteria,criteria,ecis,method,decis,met,valu,multi,evalu, analysi, analys,object, program,tri,solut,ram,multipl, multicriteria,perform, optim, fuzzi, select,linear, risk,sel,ultipl,line,approach,algorithm,pape 11-6 heuristic approach riteria,pre,multi,refer,criteria,prefer,multipl,ref,ultipl,met, ram,decis, function, problem,optim,ecis,fuzzi, multicriteria,criterion,object,algorithm,pro, process, solut,system,program,singl,risk,prioriti,roc 11-7 preference structure modeling met,method,riteria,ref,refer,pre,criteria,prefer,model,multipl,function, ecis,process,roc,multicriteria,ultipl, weigh,regret,algorithm,decis, weight,group,prioriti, case,gener,goal,methodolog,learn,prior,sel 11-8 machine learning and knowledge discovery multi,multipl,ultipl,model,system,criteria,riteria,object, valu,attribut, analys,met,search,risk,function, research,regret,method,set, analysi,man, pre,problem, effici,theori,compar,ecis,line,polici,paper 11-9 applications tri,riteria,criteria,attribut,met,model,method,prefer,refer,multipl,system, ultipl,problem,evalu,valu,ref, util,ram,man,multi, multicriteria, multiattribut, term, ecis,log,pro,solut,criterion,manag,analys 11-10 multiattribute utility theory model,multi,analys,analysi,man,ultipl,problem,object, decis,solut, multipl, program,valu,manag,ram,ecis, refer,gener,prefer,multiobject, appli,method, maker, interact,system,rel,strateg,log,algorithm,paper an application of latent semantic analysis for text categorization 363 11-11 interactive procedure for mcdm model,tri,man,valu,function,attribut,weigh,line,linear,evalu,weight, criteria, manag,problem,multi,riteria, fuzzi,util,goal,system,plan, search, cost,ram,network, research,multiattribut,decis,altern,process table4. factor labels for the 100-factor solution factor label project selection and scheduling methodology multiple criteria decision making under uncertainty outranking relations gis and mcdm integration exact algorithms multiple criteria linear regression method for ranking alternatives dynamic consistency (dc) optimization techniques monte carlo simulation multi-objective optimization evaluating decision alternatives electronic commerce mcdm in data mining qualitative decision making portfolio selection and management electre methods stochastic goal programming multiple objective ant colony optimization algorithms scheduling problems comparative study of mcdm methods maut model multicriteria classification multiple criteria simulation optimization method bayesian approach heuristic algorithm mavt preference elicitation interactive multiple objective programming procedure artificial intelligence interactive multiobjective optimization manufacturing system neural network for mcdm alternative evaluation models interactive multiobjective sys. aim mcdm in strategic energy policy making multiple criteria decision support system environmental planning assessment and decisions measures of interdependences between the objectives design problem attribute weights determination influence diagram multicriteria expert support system tchebycheff procedure for multiple objective decision making multiple criteria group decision making genetic algorithms tabu search dynamic programming multi-criteria production planning decision maker’s utility function assessment group decision support system (gdss) information systems multiple criteria abc analysis knowledge discovery and mcdm (neural network) flow shop scheduling problem fuzzy set and approximate reasoning vector optimization genetic algorithms tabu search dynamic programming algorithm development internet and public decision making mcdm in cellular manufacturing system system performance measures preference modeling system design problem case study web-based decision support and applications game theory approach industrial facilities layout planning and design anp technique discrete multiple criteria problems operations research multiobjective decision making in military applications topsis facility location problem zionts-wallenius algorithm graphical display tools mcdm and industrial engineering modeling interaction between criteria in mcdm fuzzy mcdm dss applications of heuristic approaches lexicographic goal programming data mining and ml ahp improvements dea philosophy of mcdm parameter determination methods theoretic foundation goal programming metaheuristic algorithm behavioral issues multicriteria location problem smaa optimization algorithms and implementation of mcdm resource allocation model team decision making under uncertainty visual tools simulation modeling table5. top 10 high-loading papers for the 6-factor solution 364 g. kou, y. peng factor high-loading papers factor loading 6-1 jin woo lee, soung hie kim, c&or,2000 0.14 j. m. coutinho et al.,c&or ,1999 0.11 wey, wann-ming,wu, kuei-yang, m&cm,2007 0.11 behnam malakooti, jumah e. al-alwani, c&or, 2002 0.09 minghe sun et al., c&or, 2000 0.08 lorraine r. gardiner, ralph e. steuer, ejor, 1994 0.08 otto rentz, fss, 1996 0.08 taeyong yang et al., fss, 1991 0.08 bernard roy, roman slowinski, aor, 2006 0.07 mark a. coffin, bernard w. taylor, c&or, 1996 0.07 6-2 c. zopounidis, michael doumpos, c&or, 2000 0.14 t. terlaky, ejor, 1985 0.11 v. mousseau et al., c&or, 2000 0.11 taeyong yang et al., fss, 1991 0.11 otto rentz, fss, 1996 0.11 lorraine r. gardiner, ralph e. steuer, ejor, 1994 0.09 n. m. badra, fss, 2002 0.09 e. melachrinoudis, z. xanthopulos, c&or, 2003 0.09 masatoshi sakawa, hitoshi yano, fss, 1989 0.09 john a. aloysius, et al., ejor, 2006 0.08 6-3 jose rui figueira et al., ejor, 2008 0.15 risto lahdelma et al., ejor, 2003 0.15 david l. olson, ejor, 2001 0.15 s. greco, v. mousseau, r. slowinski, ejor, 2008 0.11 stelios h. zanakis, et al., ejor, 1998 0.1 pekka j. korhonen, jukka laakso, ejor, 1986 0.1 gerard colson, c&or, 2000 0.09 silvia angilella, ejor, 2004 0.09 an ngo the, vincent mousseau, jmd,2002 0.09 risto lahdelma, pekka salminen, ejor, 2002 0.08 edmund kieran burke, sanja petrovic, ejor, 2002 0.08 6-4 jose rui figueira et al.,ejor,2008 0.25 bernard roy, roman slowinski, ejor, 2008 0.19 george mavrotas, panagiotis trifillis, c&or, 2006 0.19 theodor j. stewart, ejor, 1986 0.18 peter muller, da, 2006 0.13 kim fung lam, eng ung choo, jors, 1995 0.1 murat koksalan, ahmet burak keha, 2003 0.1 risto lahdelma, pekka salminen, ejor, 2002 0.09 salvatore greco, et al., ejor, 2002 0.09 gregory e. kersten, dss, 1988 0.08 6-5 j.c. leyva-lopez, e. fernandez-gonzalez, ejor, 2003 0.28 salvatore greco et al., ejor, 2008 0.25 bernard roy, roman slowinski, ejor, 2008 0.12 risto lahdelma, pekka salminen, ejor, 2002 0.12 j os c. fodor, marc roubens, jmcda, 1997 0.12 silvia angilella et al., ejor, 2004 0.11 huseyin cavusoglu, srinivasan raghunathan, da, 2004 0.11 salvatore greco, et al., ejor, 2002 0.1 minghe sun, ejor, 2002 0.1 minghe sun, ejor, 2002 0.1 6-6 j. gupta, kruger, lauff, werner, sotskov, c&or, 2002 0.27 j. gupta, k. hennig, f. werner, c&or, 2002 0.24 peter muller et al., da,2006 0.13 sandeep purao et al., dss, 1999 0.12 gregory e. kersten, dss, 1988 0.12 ilia tsetlin, robert l. winkler, da, 2006 0.12 jatinder n. d. gupta, johnny c. ho, c&or, 2001 0.11 vincent t’kindt et al., c&or, 2003 0.11 b. malakooti, c&or, 1989 0.1 julian molina et al., ejor, 2008 0.1 an application of latent semantic analysis for text categorization 365 4.2 different factor solutions table6. factor labels and paper counts for the 6-factor solution factor factor label paper counts 85-89 90-94 95-99 00-04 05-09 6-1 analytic network process (anp) 41 48 54 63 61 6-2 multicriteria decision support system 37 63 51 49 39 6-3 multi-attribute utility theory (maut) 17 44 62 40 42 6-4 preference learning 22 27 34 22 55 6-5 electre methods 38 42 54 59 35 6-6 heuristics 32 36 49 39 45 table7. factor labels and paper counts for the 11-factor solution factor factor label paper counts 85-89 90-94 95-99 00-04 05-09 11-1 goal programming 21 20 37 27 16 11-2 multiple criteria sorting problem 37 63 51 49 39 11-3 interactive fuzzy multiple objective decision making 15 21 13 19 11 11-4 ranking alternatives 22 14 26 24 7 11-5 preference representation 21 16 29 34 30 11-6 heuristic approach 24 22 27 39 29 11-7 preference structure modeling 19 22 37 26 33 11-8 machine learning and knowledge discovery 14 16 25 29 16 11-9 applications 27 29 45 32 26 11-10 multiattribute utility theory 17 19 19 18 20 11-11 interactive procedure for mcdm 18 24 37 28 35 figure 2 suggests that the growth in some research areas, such as preference learning (factor 64), heuristics (factor 6-6), and analytic network process (anp) (factor 6-1) increased considerably from the 1985-1989 period to the 2005-2009 period. in the case of electre methods (factor 6-5), the number of publications maintained a relatively constant increase from 1985-2008. the research interests in multi-attribute utility theory (factor 6-3) grew significantly from 1985-1989 to 1995-1999 and dropped during the 2000-2004 period. the number of publications in maut remained stable since then. multicriteria decision support system (factor 6-2) experienced a rapid growth from 1985-1989 to 1990-1995 and declined slightly during 1995-2008. figure 2: dynamics of major research areas (six-factor solution) 366 g. kou, y. peng figure 3: dynamics of major research areas (eleven-factor solution) 5 conclusions and limitations this paper attempted to identify the major research areas and themes of mcdm field by examining a large body of related research papers using latent semantic analysis. in the experimental study, over fifteen hundred abstracts of mcdm/maut field were collected and analyzed to obtain thirteen factor solutions. the 6-factor and 11-factor solutions of the analysis reveal key research areas of mcdm/maut. maut, electre methods, anp, multicriteria decision support system (mcdss), heuristics, preference learning, interactive multiple objective programming, mcdm applications, and goal programming are among the main streams of thought of the field. the ideas and techniques of mcdm are continuing to integrate into other disciplines. for example, data mining (dm) field used electre methods to cluster opinions[44] and utilized multiple criteria decision aid process to help users to sort association rules[45]. artificial neural networks, an artificial intelligence (ai) method, has been used by mcdm researchers to solve discrete mcdm problem[46] and model decision-makers’ preference structures[47]. geographical information systems (gis) and mcdm have been combined to aid spatial decisions[43]. this study has several limitations. first, since the lsa analysis depends on identifying frequent word usage patterns from a collection of text, it is difficult to capture a research area if it is not well established and has not established consistent terminology among its researchers[5] . second, this study only collected articles published after 1985 because the major areas and the evolution of mcdm and maut before 1990s have been investigated in previous studies[10,11,12]. third, the research abstracts collected in this analysis include only english language journals. papers published in other languages are not considered. acknowledgements this work was supported in part by grants from the national natural science foundation of china (#71325001, #71222108 and #71173028), program for new century excellent talents in university (ncet-12-0086). an application of latent semantic analysis for text categorization 367 bibliography [1] deerwester, s.; dumais, s.; furnas, g.; et al. (1990). indexing by latent semantic analysis, journal of the american society for information science, 41(6): 391-407. [2] landauer, t.; dumais, s. t. (1997). a solution to plato’s problem: the latent semantic analysis theory of the acquisition, induction, and representation of knowledge, psychological review, 104: 211-240. [3] landauer, t.; foltz, p.; laham, d. (1998). introduction to latent semantic analysis, discourse processes, 25: 259-284. [4] kou, g.; lou, c. (2012). multiple factor hierarchical clustering algorithm for large scale web page and search engine clickstream data, annals of operations research, 197(1)25: 123-134. [5] sidorova, a.; evangelopoulos, n.; valacich, j. s.; et al. (2008). uncovering the intellectual core of the information systems discipline, mis quarterly, 32(3): 467-482. [6] dumais, s. t.; furnas, g. w.; landauer, t. k.;et al (1988). using latent semantic analysis to improve information retrieval, proceedings of chiż88 conference on human factors in computing systems, 281-285. [7] dumais, s. t. (2004). latent semantic analysis, annual review of information science and technology, 38: 189-230. [8] gansterer, w.n.; janecek, a.g.k.; neumayer, r. (2008). in m. w. berry and m. castellanos (eds.), survey of text mining: clustering, classification, and retrieval, second edition (pp. 165-183). springer [9] wallenius, j.; dyer, j. s.; fisburn, p. c.; et al. (2008). multiple criteria decision making, multiattribute utility theory: recent accomplishments and what lies ahead, management science, 54(7): 1336-1349. [10] stewart t. j. (1992). a critical survey on the status of multiple criteria decision making theory and practice, omega, 20(5/6): 569-586. [11] dyer, j. s.; fisburn, p. c.; steuer, r. e.; et al. (1992). multiple criteria decision making, multiattribute utility theory: the next ten years, management science, 38(5): 645¨c654. [12] urli, b.; nadeau, r. (1999). evolution of multi-criteria analysis: a scientometric analysis, j. multi-crit. decis. anal., 8: 31-43. [13] international society on mcdm, (2009). http://www.mcdmsociety.org, accessed 24 jun 2009 [14] steuer, r. e.; gardiner, l. r.; gray, j. (1996). a bibliographical survey of the activities and international nature of multiple criteria decision making, j. multi-crit. decis. anal., 5: 195¨c217. [15] bragge, j.; korhonen, p.; wallenius, j.; et al. (2008). bibliometric analysis of multiple criteria decision making/multiattribute utility theory, international society on multiple criteria decision making, accessed 11 june 2009. 368 g. kou, y. peng [16] fox, c. (1992). lexical analysis and stoplists. in w. b. frakes and r. baeza-yates (eds.), information retrieval: data structures and algorithms (pp. 102-130). upper saddle river, nj: prentice-hall. [17] han, j.; kamber, m. (2006). data mining: concepts and techniques, 2nd edition. san francisco, ca: morgan kaufmann publishers. [18] stopwords. (2008). webconfs.com, http://www.webconfs.com/stop-words.php, accessed 10 august, 2008. [19] sql sever 2005. microsoft.com, http://www.microsoft.com/sqlserver/2005/en/us/ overview.aspx, accessed 1 feb 2009. [20] porter, m. f. (1980). an algorithm for suffix stripping, program, 14(3): 130-137. [21] porter, m. f. (2008). the porter stemming algorithm. http://tartarus.org/ martin/ porterstemmer/. accessed 22 feb, 2009. [22] baeza-yates, r.; ribeiro-neto, b. (1999). modern information retrieval, addison-wesley, wokingham, uk. [23] lingpipe (2008). http://alias-i.com/lingpipe/index.html, accessed 1 march 2009. [24] langen, d. (1989). an (interactive) decision support system for bank asset liability management,decision support systems, 5(4): 389-401. [25] geiger, m. j. (2007). on operators and search space topology in multi-objective flow shop scheduling, european journal of operational research, 181(1): 195-206. [26] przybylski, a.; gandibleux, x.; ehrgott, m. (2008). two phase algorithms for the biobjective assignment problem. european journal of operational research, 185(2): 509-533. [27] ergu, d.; kou, g. (2012). questionnaire design improvement and missing item scores estimation for rapid and efficient decision making, annals of operations research, 197(1):5¨c23, doi 10.1007/s10479-011-0922-3. [28] shi, y. (2001). multiple criteria multiple constraint-level (mc2) linear programming: concepts, techniques and applications, world scientific publishing, 539 pages. [29] yu, l.; wang, s.; lai, k. k. (2009). an intelligent-agent-based fuzzy group decision making model for financial multicriteria decision support: the case of credit scoring. european journal of operational research, 195(3): 942-959. [30] kou, g.; peng, y.; wang, g.x. (2014a). evaluation of clustering algorithms for financial risk analysis using mcdm methods, information sciences, 27:1-12, doi: http://dx.doi.org/10.1016/j.ins.2014.02.137. [31] kou, g.; peng, y.; lu, c. (2014b). an mcdm approach to evaluate bank loan default models, technological and economic development of economy, 20(2): 278¨c297, doi: http://dx.doi.org/10.3846/20294913.2014.913275. [32] ergu, d.; kou, g.; shi, y.; et al. (2011). analytic network process in risk assessment and decision analysis, computers & operations research, doi: 10.1016/j.cor.2011.03.005. an application of latent semantic analysis for text categorization 369 [33] kou, g.; and lin, c. (2014) a cosine maximization method for the priority vector derivation in ahp, european journal of operational research, 235: 225-232 , doi: http://dx.doi.org/10.1016/j.ejor.2013.10.019 [34] montibeller, g.; belton, v.; lima, m.v.a. (2007). supporting factoring transactions in brazil using reasoning maps: a language-based dss for evaluating accounts receivable. decision support systems, 42(4): 2085-2092. [35] yevseyeva, i.; miettinen, k.; rasanen, p. (2008). verbal ordinal classification with multicriteria decision aiding. european journal of operational research, 185(3): 964-983. [36] hamalainen, r. p. (2003). decisionarium-aiding decisions, negotiating and collecting opinions on the web. journal of multicriteria decision analysis, 12(2-3): 101-110. [37] yu, p.l. (1991). habitual domains, operations research, 39(6): 869-876. [38] chiu, y.; shyu, j. z.; tzeng, g. h. (2004). fuzzy mcdm for evaluating the e-commerce strategy, international journal of computer applications in technology, 19(1): 12-22. [39] kameshwaran, s.; narahari, y.; rosa, c. h.; et al. (2007). multiattribute electronic procurement using goal programming. european journal of operational research, 179(2): 518-536. [40] moreno-jimenez, j. m.; polasek, w. (2003). e-democracy and knowledge. a multicriteria framework for the new democratic era. journal of multi-criteria decision analysis, 12(2-3): 163-176. [41] zeleny, m. (1998). multiple criteria decision making: eight concepts of optimality, human systems management, 17(2): 97-107. [42] dong, j.; zhang, d.; yan, h.; et al. (2005). multitiered supply chain networks: multicriteria decision making under uncertainty. annals of operations research, 135(1): 155-178. [43] gomes, e. g.; lins, m. (2002). integrating geographical information systems and multicriteria methods: a case study. annals of operations research, 116(1-4): 243-269. [44] bisdorff, r. (2002); electre-like clustering from a pairwise fuzzy proximity index, european journal of operational research, 138(2): 320-331. [45] lenca, p.; meyer, p.; vaillant, b.; et al. (2008). on selecting interestingness measures for association rules: user oriented description and multiple criteria decision aid. european journal of operational research, 184(2): 610-626. [46] malakooti, b.; zhou, y. q. (1994). feedforward artificial neural networks for solving discrete multiple criteria decision making problems, management science, 40(11): 15421561. [47] wang, j. (1994). a neural network approach to modeling fuzzy preference relations for multiple criteria decision making. computers and operations research, 21(9): 991-1000. ijcccv11n4.dvi international journal of computers communications & control issn 1841-9836, 11(4):457-471, august 2016. dtn routing algorithm for networks with nodes social behavior a.m. dziekonski, r.o. schoeneich andrzej marek dziekonski, radoslaw olgierd schoeneich* institute of telecommunications, warsaw university of technology ul. nowowiejska 15/19, warsaw, 00-665, poland a.m.dziekonski@stud.elka.pw.edu.pl *corresponding author: rschoeneich@tele.pw.edu.pl abstract: this article presents routing algorithm in delay and disruptive tolerant networks (dtn). the main idea of this work is routing method that is based on information about nodes social behavior and their social relations in sparse structure of network. the algorithm takes advantage of friendship relationships between nodes and uses historic information to create groups of friends for each node, which is used in buffer management and forwarding phase of routing. beside the routing method, mechanisms of collecting and exchanging of maintenance information between nodes is described. the algorithm was tested using the one simulation tool especially designed for dtn scenario and compared with miscellaneous popular solutions. keywords: dtn, routing algorithm, social behavior. 1 introduction communication has always been important part of different communities lives, both short and long distance. one of dynamically developing wireless networks are ad-hoc networks, that create connections directly between networks nodes mobile ad-hoc networks (manet) [1]. this is a type of network, in which its nodes are only users and also they are the only elements required for networks existence and operating. one kind from manet networks family are delay tolerant networks (dtn). dtn [2] is a kind of wireless networks that enables communication in sparse and disrupted mobile ad-hoc networks. there are not any central or privileged elements in the means of network operations. depending on networks purpose, there might be nodes collecting data from others, or controlling them, but this is ensured simply by distribution of proper messages, not by other networks mechanisms. dtn network is generally very dynamic its elements can freely move, so constant connections between nodes exist very seldom. also periods between subsequent nodes contacts is not deterministic and usually is very long. one of the essential parts of telecommunication, and also very important in dtn networks, is routing. routing is the decision making process of finding the best paths that messages should follow to reach destination in networks. without this mechanism nodes would not know which messages should be passed to which nodes to provide good network operation. in dtn networks routing is especially important, because contacts between nodes are rare and short lasting, so every opportunity should be perfectly used. unfortunately, for the same reason, it is more complex than in traditional networks, where all connections are known for long periods and it is easier to find proper paths for messages. nodes in dtn networks are devices that move during network operation. some nodes might be means of public transport like communication equipment connected to buses or trams, smartphones, small devices connected to animals etc. there is no limitation of dtn application. still the most interesting and with the biggest possible usage are networks based on smartphones or other equipment held by people. that is why there is a big need for development of solutions copyright © 2006-2016 by ccc publications 458 a.m. dziekonski, r.o. schoeneich directed for networks with human mobility patterns, since people movement and behavior is not random, but more-or-less predictable. 2 state of the art routing protocols for dtn networks are specific solutions due to the necessity of dealing with complex requirements set by the network conditions, mostly in the means of disruption and long message delivery latencies. that is the reason why traditional algorithms from common manet networks, like [3]: ad-hoc on-demand distance vector (aodv) or dynamic source routing (dsr) cannot be used. the easiest solution for delivering messages in dtn networks is epidemic routing algorithm [4]. it floods the network with data without making decisions based on any criteria. node is transmitting all of the messages from his buffer to any met node. the order of choosing messages is random. according to the fact, that it creates many copies of the same message in order to distribute them further, this algorithm belongs to the routing solutions group based on message replication. it is a very good reference point while comparing other solutions efficiency, because it chooses randomly proxy nodes and uses all available network resources. in perfect conditions, that are unlimited buffer capacities and infinite connection bandwidth, epidemic routing acquire biggest possible number of successfully delivered messages to the destination node. the next solution from replication-based types of routing algorithms is spray and wait [5]. it works much alike epidemic routing, but it does not flood the network with unlimited amount of data. each message can be replicated only in specified number of copies, which are transferred to other nodes. after creating and transmission of the last copy of selected message, all of the copies are kept in the buffers and transferred only while meeting the message destination node. spray and wait can work in two modes normal and binary. in normal mode only the node that created the message is distributing copies to other nodes, which later can transfer them only directly to the destination node. in binary mode the node transmitting the message passes half of its lasted copies to the other node, and all the nodes that keep in their buffers more than one copy can pass them to any node until they have only one copy left. prophet algorithm [6] in contrast to spray and wait and epidemic solutions is based on predictions of future nodes contacts. this is done using historic information collected during existence of the network. the main part of this algorithm is calculation of probability of meeting each of the other nodes from the network. in the first phase it calculates the new probability for the node that currently the connection is held. second phase is actualization of the transitive probabilities, that is probability with the use of a proxy node (non-direct probability). prophet solution implements also a mechanism that is prioritizing newest contacts before oldest ones. nodes that are more active in the network, that establish more connections, have higher probabilities and higher priority in other nodes routing data collections. this algorithm creates many copies of the same message, theoretically infinite there are no strict limits, but the network is not flooded with messages to the full extent during contact only those messages are exchanged, that have higher probability of reaching destination while held by the node on the other end of the connection. the algorithm that is best fitted for dtn networks based on complex, natural mobility models is maxprop [7], despite the fact that it was developed for vehicle-based networks. in this solution each node keeps information about the network as a graph, which edges weights are calculated probabilities of the contact two chosen nodes. then algorithm looks for the paths and makes decisions regarding order of the messages in the buffer. the creators did an observation based on many simulations, that transmission of newer messages, so the ones which passed by fewer nodes, in the first place can increase efficiency of routing. that is why in maxprop algorithm dtn routing algorithm for networks with nodes social behavior 459 they developed a mechanism, that messages having hop count less than some calculated border value are transmitted in the first place, and those which passed by more nodes are ordered based on the probabilities calculated with the use of dijkstra algorithm. one of the important elements of the solution is dealing with successfully delivered messages, that enables clearing nodes buffers from those messages and prevents the unneeded its further exchange. many research teams focused on analyzing human mobile patterns for the needs of telecommunication [8–12]. therefore there are also few algorithms that were developed for networks which nodes are behaving like people. one of the algorithms of this type is label routing [13]. it is based on an assumption that people belong to some communities, so they meet with some of the nodes regularly. authors developed a solution that create a label for each node that tells other nodes about his community. when message is supposed to be forwarded, it is passed directly to the destination node or to the node which is in the same community (has the same label) as the destination node. the problem is that messages are exchanged only if labels of potential next-hop and destination nodes match. in other cases messages are stored and waiting for a proper possibility to forward the message. this increases delivery delay and in some cases, for example with low and restricted node mobility, can have a big influence on routing efficiency. another interesting solution focused on social-based dtn networks is bubble rap forwarding [14]. this algorithm takes advantage of two social characteristics community and centrality. they assume that nodes belong to different communities and are active at different level. the algorithm allows each node to belong to more than one group one group can be family, another colleagues at work, another friends from high school etc. in each of those groups nodes are prioritized by calculation of centrality of the node among others. this value is kind of popularity of the node and shows which ones have the highest probability to meet all others from that group. since this value is calculated inside single group, each node has many values of centrality one for each community to which it belongs. this parameter helps to forward message when it is already held by a node from the message destination node community. another problem is to exchange message between different groups. for this purpose another centrality is calculated global one, that takes into account whole network as one group. when new message is created, the first phase is forwarding the message to the destination node community using the nodes global centrality values and then the second phase is exchanging message using local centralities from current group. social based multicasting [15] is a multicast approach, that is creating many copies of the same massage, to dtn routing that uses two social characteristics centrality and community. in this solution, the centrality of the nodes and the cumulative probability of future contacts is calculated based on poisson modeling of social networks. then it uses unified knapsack problem to select relay nodes to assure proper delivery ratio. the main issue is the computational complexity. on the other hand, the authors of sane routing [16] focus on completely different social characteristics interest and similarity. this solution comes from the assumption, that people with similar interests meet each other more often, than the ones that have nothing in common. the interests are represented as a vector, and messages are forwarded to nodes which interest profiles are close to the destination node one. there are few other algorithms that focus on nodes social behavior and its application in order to develop an efficient routing [17–19]. most of them focus on three social characteristics: community, centrality, friendship [20]. first of those assume that nodes can be divided into groups (communities) that has many regular contacts between each other and that will continue in further network operation. thanks to that algorithms can divide its operation into two phases between communities and inside destination group. examples of solutions using community characteristic are described above label and bubble rap, and also friendship based routing 460 a.m. dziekonski, r.o. schoeneich [21]. second of frequently used social characteristics is node centrality, which is a measure of activity and importance of the selected node. this helps to differ nodes inside the network and find most important nodes in the means of potential better forwarding efficiency. the main algorithms taking advantage of centrality is described above bubble rap as well as simbet [22] algorithms. the last social characteristic is friendship, which is measure of the relationship between two nodes. the number of contacts, the periods between consecutive connections, duration of contacts etc. all of those parameters that can help calculate the grade of relation between two nodes can be taken into account and considered as friendship measure. the example of use of this characteristic is friendship based routing algorithm. 3 the social based algorithm this routing algorithm is based on information of nodes social behavior collected during network existence. beside the main algorithm, also mechanisms of collecting and exchanging control information are important for making proper routing decisions. the main assumptions while developing the new solution were that the algorithm should be aimed for networks with human-mobility patterns, but still work well in other types of networks. it should predict future nodes behavior based on historic data collected during network operation. since in that type of networks resources should not be a big problem smartphones and similar devices have large amounts of memory and communication interfaces with fast transmission speed and the main goal is maximizing the number of delivered messages, while minimizing the delivery latency, the proposed algorithm is a full-replication-based type, so there is no limits of created messages copies. secondary goal is also to minimize the length of the paths that messages follow, since it is proven that it helps the network to operate more efficiently [23]. 3.1 information collected and exchanged for routing purposes the key element of the solution are historic information regarding the network needed for making routing decisions. this data can be collected by the nodes themselves, or received from other nodes. each node holds two information regarding whole network. (a.) first of them is maximum popularity value from all network nodes. it is used in mechanism that orders messages to be transfered to other nodes. (b.) second stored data is the time of last reset of nodes popularity values. this helps to determine time, when table aging mechanism, that is the decrease of number of nodes contacts and groups popularities, should be launched. for proper network operation it is needed to handle successfully delivered messages. for this purposes each node stores list of already delivered messages and updates and sends it during each new contact. after such an update node deletes from its buffer all messages that identification numbers are included in the received list. essential for making routing decisions are two tables. first of them stores number of contacts with each node from the network. based on it, the node decides which nodes belong to its friends group. the table is updated after each contact by increasing the number of connections with currently met node. then the border value for belonging to nodes friends group is calculated. identification numbers of the nodes that exceeds this parameter, so the ones that are chosen to friends group, and the popularity of the group, which is defined as calculated border value, are passed to each met node. second essential information is table containing definitions of other nodes friends groups. for each node, with which at least once connection was established, the list of its friends identification numbers and the group popularity value is stored. this data is used in buffer management mechanism and during making decisions related to transferring messages to other nodes. dtn routing algorithm for networks with nodes social behavior 461 except mentioned above, nodes store also information about the time of the last contact and the average period between those contacts for each node from the network. we use that data as one criteria in memory buffer management. nodes gets maintenance information in two ways. some are collected by the nodes themselves during network operation for example connection times with other nodes, number of meetings etc. other information are collected from met nodes. due to limited node communication capabilities it is very important to assure effective and compact way of exchanging maintenance data. after establishing a new connection first and significant for solution effectiveness element of the algorithm is the exchange of successfully delivered messages identification numbers. this data is exchanged in both ways. according to them, nodes update their lists and clears their buffers from appropriate messages. next the information about friends groups are exchanged. after update and rebuilding of nodes friends groups, each of the sides of the connection passes its friends identification numbers and popularity of his group to other node. this helps reduce amount of sent data in this phase of the algorithm. the node that receive this data refresh in its memory information about friends of the node on the other end of the contact. the last collected data from other nodes is information about most active router in the network hub node. the activeness is measured as the number of contacts. identification number of the node is not exchanged, only the value of the popularity of this biggest hub node. 3.2 memory buffer management after establishing a connection between two nodes, exchanging and processing maintenance information, the solution proceeds to the key phase of the algorithm making routing decisions. it is divided into two stages. the first one concerns the management of buffer, the actions that need to be done when the memory is overloading. the second is directly connected with routing it makes decisions in what order messages should be transmitted to connected nodes. this order has essential significance because of limited contact times and bandwidth speed. the first stage is memory management. nodes have limited space in buffers to store messages. it is highly probable situation, when a new message is created or received from other node, but there is no free space in the buffer. in this case the algorithm is launching decision-making procedure to free some space in the memory by choosing older message to be deleted. in the solution there is separate algorithm to compare messages for buffer management purposes. as a result of its operation all messages are sorted and placed in order to be removed, to free enough space for new message. it is done by directly comparing two messages and deciding which of them is less important and should be deleted before the second one. this algorithm is presented on the diagram below. in the buffer management decision-making process there are few criteria implemented. first of them, while choosing messages to delete, is the number of nodes that message already went through number of hops. in case of its different values, the one which went through more nodes is chosen to be deleted before the other one. when compared messages have the same hop count, the algorithm checks predicted time of next contact with destination node. prediction is based on historic information about the time of last contact and average period between them. this predicted time is not very precise, but taking into account non-deterministic nodes movement and the need to minimize resources, that are memory and computation time, it is sufficient to decide which messages should have higher priority to stay in memory. additionally, to compensate inaccuracy, that difference of two predicted contact times has to exceed some border value. in other case the algorithm passes to the last criteria, which is comparison of paths through friends groups. in the case of memory management the distance is calculated and the message which distance is longer is chosen to be deleted. there is no need 462 a.m. dziekonski, r.o. schoeneich figure 1: algorithm to sort messages for buffer-management purposes. dtn routing algorithm for networks with nodes social behavior 463 to add any extra criteria if all of described above steps do not differentiate priority of staying in buffer for two messages, it can be assumed that they are equally important. 3.3 data exchange the main part of developed routing solution is the way of passing messages to other nodes methods of making the decisions about the order of messages to be exchanged with the nodes with whom connection is currently established. the primary assumption in passing the messages is the routing algorithm type, that is replication-based what means that many copies of the same message are created. while spreading many copies it is essential to give messages priorities, that tell which ones should be transmitted in the first place. for this purposes the sorting algorithm is developed. characteristic distinction between sorting method in this stage of routing algorithm compared to the one in buffer management are the items being sorted. in this case not only messages are ordered, but pairs of message-established connection. it helps better estimate probability of delivering data to destination, depending on which node will be next hop on for the message. similarly to buffer management stage, in this phase also few criteria are implemented in order figure 2: algorithm to sort pairs messages-connection for data exchange purposes. to sort elements. first of the criteria is detection, if the node being compared is a hub or not. before checking that, to eliminate situations when both compared nodes are hubs or just the difference between them is very insignificant, the ratio of popularities of both nodes is calculated (bigger value to smaller) and it has to be bigger than border value set as a solution parameter. in case, when this ratio is bigger than the border value, both of the compared nodes are checked if they can be classified as hubs. if only one is successfully classified, his pair, which is 464 a.m. dziekonski, r.o. schoeneich the node and compared with it message, is receiving higher priority in sorting process. in other case, when both or none of them are classified, the algorithm advances to next criteria. the next criteria in routing part of the algorithm is the most important one and makes decisions in most cases distance based on friends groups. the main goal of the solution is working in networks, which nodes are personal equipment held by people, so the mobility is not random. people belong to different communities, have their own friends with whom they meet most often etc. those characteristics are mostly constant or at least long lasting. the algorithm draws conclusions based on observations made during network existence nodes spending most of the time in some location are establishing connections with chosen group of nodes. additionally for each node those groups are different family members spend a lot of time together, but they work in different places, take different routes using public transport or car, have different friends. that is why the creation of common groups is very complex and inefficient. in the solution each node create his own independent friends group and pass its description, containing friends identification numbers and group popularity, to other network elements. group popularity is calculated based on the number of connections required by nodes to belong to the friends group. calculation of the distance based on collection of friends groups is done by looking for a path through those groups to the destination node. in data exchange stage of the solution, the node that currently has an established connection is considered next hop for the message and based on its friends group next nodes on the message path are searched. after finding a path, the number of proxy groups is considered the calculated distance value. furthermore friends groups has different importance active nodes has higher border value that recognizes network elements as its friends, so they are more active and this suggest that this group should have higher priority. popularities of the groups on the predicted path are not added, but averaged geometrically to prevent situations that would preference of the paths with bottleneck group inside, so one group with very small popularity that could decrease efficiency of the path. still the length of the path calculated just as number of proxy groups is more important. that is why the popularity of the path is considered only when the length of compared paths is equal. there is an additional limitation of maximum path length, that the search process is finished after reaching that value and if path to destination is not found, the distance is set to infinity. this limitation was introduced for two reasons (a) to decrease computational complexity, usage of node resources and time needed to find paths, and (b) to introduce next criteria to compare pair message-neighbor node that can have better efficiency than comparison of very long potential message paths. this maximum path length is set as a variable parameter to the algorithm. if any of mentioned above criteria do not resolve which compared pair should have priority, the algorithm passes to final criteria comparison of ttl (time to live) parameter of the message. justification is that messages that will not expire in short future have better probability of reaching the destination. it is worth noticing, that in this criteria only messages are compared the connections in corresponding pairs are not influencing the result. 4 simulations simulations were performed to check the efficiency of the solution using the one simulator [24] a tool developed on aalto university in helsinki. it is used for examination of dtn networks operations, mainly for checking network efficiency with the use of different routing algorithms and different nodes mobility patterns. this tool has many advantages that simplify simulations good programming interfaces to easily add new algorithms, gui that helps finding problems during network operation and implementation of few mobility patterns and most popular existing routing algorithms, i.e. maxprop, prophet, spray&wait, epidemic. dtn routing algorithm for networks with nodes social behavior 465 table 1: values for parameters set for the simulations parameter name value parameter name value popularity 1,25 hub other 0,85 hop limit 4 reset seconds 21600 border value 1,4 reset divisor 2,0 hub max 0,95 predict. time threshold 1,25 the efficiency of routing algorithms was determined by analyzing few parameters. the main ones are: • delivered messages defined as ratio of number of successfully delivered messages to all generated messages. • message delivery latency average time between generation of the message and time of its delivery to destination node. only successfully delivered messages are taken into calculation. dropped, expired or non-delivered are omitted. • message hop count the number of nodes that message went through before reaching destination node. rejected, expired and non-delivered messages are not included in calculation. • message buffer time average time of holding each copy of the message in nodes buffers. • overhead ratio represents the level of flooding network with messages. it is calculated as a ratio of number of all copies of all messages exchanged by nodes, reduced by number of delivered messages, to number of delivered messages. in case of direct delivery algorithm (that passes meessags only when nodes meet destination node) overhead ratio reaches its minimum value 0. 4.1 comparison with chosen existing algorithms in order to compare efficiency of developed solution few simulations were performed, that differs mostly in used mobility patterns and simulation duration. they were executed using five different routing algorithms proposed new solution, maxprop, prophet, spray and wait and epidemic. to compare general efficiency of the new algorithm, values of its parameters that have influence on algorithm operation were set top-down, without looking for their optimal values, the same for all simulation scenarios. this way comparison with other solutions is fair and objective. they are presented in table 1. the values of the parameters were chosen to be universal and work well in different network types. we consider a node to be a hub if its popularity is at least 95% of the largest encountered popularity in the whole network (hub max parameter). this is important only if the difference between two nodes being compared in the message exchange process is larger than 15% (hub other parameter). also each node consider other nodes to be their friends if they meet them at least 40% more often than the average number of all its encounters (border value parameter). the aging of the number of encounters mechanism is set to be executed each 6 hours (reset seconds parameter) and is done by dividing the number by 2 (reset divisor parameter). maximum length of the path through friends groups is set to be 4 (hop limit parameter) and calculated difference in paths popularities must be at least 25% (popularity parameter). the difference of predicted time of the next encounters for two pairs of nodes must be larger than 25% (pred. time threshold parameter). 466 a.m. dziekonski, r.o. schoeneich the two node mobility patterns that were used are shortest-path map-based movement (spmbm) and working day movement (wdm). the second one is a pattern that simulates in the closest way the behavior of people during normal days of their life, so the one that algorithm was aimed for. delivery ratio delivery ratio, that is number of messages that were successfully delivered to destination, is the most important criteria while comparing routing algorithms. on the figure 3 the results of figure 3: delivery ratio for shortest path map based movement scenario. figure 4: delivery ratio for working day movement scenario. simulations for spmbm mobility pattern scenario are presented. the developed solution reaches the best results from all tested algorithms, regardless of the simulation period. the important observation is that the longer simulation lasts, the greater increase in delivery probability this algorithm reaches from all others. also it is easy to observe, that the difference between the best and the worst, which is epidemic routing, is substantial. on the figure 4 the results for wdm scenario are shown. in this case also the best delivery results are reached by described in the article new solution, but in this case the difference is not that significant. the important thing to observe is that in this scenario, so the one with human mobility patterns, with the longer simulation time the results are improving notably. it shows that nodes are making better routing decisions when they have more data collected, so they learn during network operation. delivery latency the second most important criteria while checking efficiency of routing algorithms is delivery latency, because in many situations it is important to deliver messages quick, in other case they can be useless. in the spmbm scenario (fig. 5) the new solution gets the longest average time figure 5: mean delivery latency for shortest path map based movement scenario. figure 6: mean delivery latency for working day movement scenario. for the messages to reach destination. it is not expected result, since one of the goals was to minimize latency. but it is important to look at this result together with the acquired delivery dtn routing algorithm for networks with nodes social behavior 467 ratios. in other case the conclusion would be that the epidemic routing is the most efficient, since the delivery latency is the smallest. the developed solution has big latency values, because it successfully delivered messages, that other algorithms dropped. this messages often take a lot of time to reach destination, so it overstates the measured average delivery latency. in wdm scenario the results look even better, than in spmbm. despite the fact, that new developed solution has the best delivery ratio, the average latency has quite similar values that all other algorithms. this means that not only it delivers more messages than other algorithms, but also average time of the same messages delivered by different algorithms is smaller for described in the article solution. average message hop count the figures 7 and 8 present the results of the average number of proxy nodes on the path (hops) figure 7: mean message hop count for shortest path map based movement scenario. figure 8: mean message hop count for working day movement scenario. that messages followed before reaching destination. in networks using epidemic routing the messages passed many nodes before getting to the destination node. this is caused by the fact, that there is no decision-making process and messages are transferred in random order. the best results for wdm scenario, so the ones when messages go through a small number of nodes, are reached by new developed solution, maxprop and spray&wait. they are very close to each other, so the conclusion is that is almost optimal value. in spmbm scenario also described in this article algorithm and spray&wait allow messages to reach destination in shortest paths calculated in number of proxy nodes. 4.2 different network conditions each algorithm works the best in certain environment. different conditions in network can have significant influence on the obtained results. even if all of the parameters of the network and its elements change the way that network operates, there are few characteristics that have the biggest influence. the main are mobility patterns, that were described in previous chapter, number of hosts, transmission speed and range and buffer sizes. number of nodes in dtn network we performed the simulation with the use of two algorithms the new described in the article and spray&wait to have a reference point to existing solutions. the results are presented on the figure 9. in the small network (100 nodes) both algorithms obtain the same delivery ratio, but spray&wait need less time for that. with the increase of the network size, the developed algorithm is gaining much better results. the delivery ratio is increasing more dynamically and saturates at higher level than the other algorithm. the delivery latency is also changing in the good way spray&wait needs more and more time to deliver messages with the growth of the size of network, while new solution is obtaining smaller delivery latency in bigger networks. it 468 a.m. dziekonski, r.o. schoeneich figure 9: impact of number of hosts in network. shows that the algorithm described in the article works best in big networks, but still get good results in smaller ones. transmission speed and range on the figure 10 and 11 the influence of changes in transmission are shown change in transmisfigure 10: impact of transmission speed. figure 11: impact of transmission range. sion speed, that allows to exchange more data during contact, and change in transmission range, that causes more contacts between nodes. it is clear that with the growth of these parameters, the conditions are becoming better, so the algorithms should work more efficiently. the charts show that new developed solution works exactly as expected with the growth of transmission speed or range, the delivery ratio is increasing and the latency is decreasing. the improvement in the routing efficiency is better in the new developed solution in comparison to prophet algorithm. buffer size the last examined network nodes characteristic that has big influence on the results is the buffer size. the increase of this parameters allows routers to hold more messages without the need to drop some of them. the figure 12 shows that the described in the article algorithm is more efficient when buffers are smaller and reaches its efficiency maximum for the much smaller buffer size than prophet algorithm. this means that the buffer management mechanisms and correlated with it decision making process is very efficient. dtn routing algorithm for networks with nodes social behavior 469 figure 12: impact of buffer size. 470 a.m. dziekonski, r.o. schoeneich 5 conclusion dtn networks are a young field of study and are not standardized. that leaves plenty of space for development new solutions for them. one of the very important part of network operation is routing and in case of dtn networks, in which nodes are connecting with others seldom and for short times, its efficiency is essential. the algorithm that we developed aims for the networks, where nodes follow human mobility patterns, so the ones where networks are made of devices held by people. our algorithm can be divided into three stages. the first one is the exchange and collection of historic control data needed for predicting future contacts and making decisions in further stages of the solution. the second phase is buffer management this consists of processing control data and making decisions of releasing space in buffer when it is overflowed. the last stage is strictly making decision process about the messages to be transferred other nodes. contacts are rare and short-lasting, so it is important to choose which messages to transfer and in what order, because in most cases there is no possibility of exchanging all the messages between nodes during contact. in case of developed solution, which is full-replication based, nodes try to exchange all the messages with other nodes they do not choose which ones but focus on the order in which messages are tried to be send. the developed algorithm gets very good results, especially in the most important criteria, that is successful delivery probability. compared to existing algorithms (spray&wait, epidemic, maxprop, prophet), our new solution if very efficient, especially in the networks with humanmobility patterns. bibliography [1] t.omari; g.franks; m.woodside (2005); on the effect of traffic model to the performance evaluation of multicast protocols in manet, electrical and computer engineering, canadian conference on, ieee: 404-407. [2] l. pelusi; a. passarella; m. conti (2006); opportunistic networking: data forwarding in disconnected mobile ad hoc networks, communications magazine, ieee 44(11):134-141. [3] s. corson; j. macker (1999); mobile ad hoc networking (manet): routing protocol performance issues and evaluation considerations, ietf rfc 2501, 1-11. [4] a. vahdat; d. becker (2000); epidemic routing for partially connected ad hoc networks, technical report cs-200006, duke university. [5] t. spyropoulos; k. psounis; c.s. raghavendra (2005); spray and wait: an efficient routing scheme for intermittently connected mobile networks, proc. of the 2005 acm sigcomm workshop on delay-tolerant networking: 252-259 [6] a. lindgren; a. doria; e. davies; and s. grasic (2012); probabilistic routing protocol for intermittently connected networks, ietf rfc 6693, 1-8. [7] j.burgess; et al. (2006); maxprop: routing for vehicle-based disruption-tolerant networks, infocom, 1-11, doi: 10.1109/infocom.2006.228. [8] d. karamshuk; c. boldrini; m. conti; a. passarella (2011); human mobility models for opportunistic networks, ieee communications magazine, 49(12):157-165. [9] d. karamshuk; c. boldrini; m. conti; a. passarella (2012); an arrival-based framework for human mobility modeling, proceedings of the ieee international symposium wowmom : 1-9. dtn routing algorithm for networks with nodes social behavior 471 [10] a. passarella; m. conti; c. boldrini; r. i.m. dunbar (2011); modelling inter-contact times in social pervasive networks, proceedings of the acm mswim : 333-340. [11] c. boldrini; m. conti; a. passarella (2007); users mobility models for opportunistic networks: the role of physical locations. proceedings of the wrecom07 : 1-6 [12] c. boldrini; m. conti; a. passarella (2009) the socialble traveller: human travelling patterns in social-based mobility, proceedings of the mobiwac : 34-41. [13] p. hui; j. crowcroft (2007); how small labels create big improvements, procedeengs ot the ieee percom: 65-70. [14] p. hui; j. crowcroft; e. yoneki (2011); bubble rap: social-based forwarding in delaytolerant networks, mobile computing, ieee transactions, 10(11): 1576-1589. [15] w. gao; q. li; b. zhao; g. cao (2009); multicasting in delay tolerant networks: a social network perspective networks, proceedings of the acm mobihoc: 299-308. [16] a. mei; g.morabito; p. santi; j.stefa (2011); social-aware stateless forwarding in pocket switched networks, proceedings of the ieee infocom : 251-255. [17] y. zhang; j. zhao (2009); social network analysis on data diffusion in delay tolerant networks, proceedings of the acm mobihoc: 345-346. [18] w. gao; g. cao (2011); user-centric data dissemination in disruption tolerant networks. proceedings of the ieee infocom : 3119-3127. [19] f. fabbri; r. verdone (2011); a sociability-based routing scheme for delay-tolerant networks, eurasip wireless communications and networking: 1-13. [20] y. zhu; b. xu; x. shi; y. wang (2013); a survey of social-based routing in delay tolerant networks: positive and negative social effects, ieee communications surveys and tutorials , 15(1):387-401. [21] e.bulut; b. k. szymanski (2010); friendship based routing in delay tolerant mobile social networks, proceedings of the ieee globecom, 10.1109/tpds.2012.83, 23(12): 2254-2265. [22] e. m. daly; m. haahr (2007); social networks analysis for routing in disconnected delaytolerant manets, proceedings of the mobihoc: 32-40. [23] c. boldrini; m. conti; a. passarella (2012); less is more: long paths do not help the convergence of social-oblivious forwarding in opportunistic networks. proceedings of the acm/sigmobile mobiopp: 1-8. [24] a. keranen; j. ott; t.karkkainen (2009); the one simulator for dtn protocol evaluation, proc. of the simutools: doi: 10.4108/icst.simutools2009.5674. int j comput commun, issn 1841-9836 vol.7 (2012), no. 4 (november), pp. 632-644 formation control of multiple agents with preserving connectivity and its application to gradient climbing k.d. do k.d. do department of mechanical engineering, curtin university of technology, perth, wa 6845, australia e-mail: duc@curtin.edu.au abstract: a design of cooperative controllers that force a group of n mobile agents with limited communication ranges to perform a desired formation is presented. the proposed formation control system also preserves initial communication connectivity and guarantees no collisions between the agents. the formation control design is based on smooth step functions, potential functions, and the lyapunov direct method. the proposed formation control system is applied to solve a gradient climbing problem where the gradient average of a distributed field is estimated over a bounded region using the field measurement by the agents. keywords: formation control, collision avoidance, gradient climbing. 1 introduction formation control involves controlling positions of a group of agents such that they perform desired tasks such as optimizing objective functions from measurements taken by each agent, and stabilization/tracking desired locations relative to reference point(s). various methods have been proposed for formation control of multiple agents. here, three popular methods are briefly mentioned. the leader-follower method (e.g., [1], [2]) uses several agents as leaders and others as followers. this method is easy to understand and ensures formation maintenance if the leaders are disturbed. however, the desired formation cannot be maintained if followers are perturbed unless a formation feedback is implemented, [3]. the behavioral method (e.g., [4], [5]), where each agent locally reacts to actions of its neighbors, is suitable for decentralized control but is difficult in control design and stability analysis since group behavior cannot explicitly be defined. the virtual structure method (e.g., [6], [7]) treats all agents as a single entity. this method is amenable to mathematical analysis but is difficult to deal with time-varying formation structure. research works on formation control usually utilize one or more of the above methods in a centralized or a decentralized manner. centralized strategies (e.g., [8], [3]) use a single controller that generates collision free trajectories in the workspace. these strategies guarantee a complete solution but require high computational power and are not robust. decentralized schemes (e.g., [9], [10], [7]) require less computational effort but have difficulties in controlling critical points, especially when collision avoidance between the agents is a must. the control design in the above works did not put hard constraints on the controlled outputs except for those papers considered the problem of collision avoidance. without hard constraints on controlled outputs, overshoot might result in loss of initial communication between agents due to limited communication between the agents. hard constraints on the controlled outputs were applied to design cooperative controllers for mobile agents to preserve initial communication. these constraints on the controlled outputs were obtained through barrier lyapunov or potential functions using non-trivial bump functions or switching control strategies in [11] for the agreement problem, [12] for the centralized approach, and [13] for the swarm aggregation. copyright c⃝ 2006-2012 by ccc publications formation control of multiple agents with preserving connectivity and its application to gradient climbing 633 this paper contributes two main folds. the first one is a design of smooth and bounded cooperative controllers for a group of mobile agents to perform a desired formation task. the desired formation task includes collision avoidance and communication connectivity preservation between the agents, time-varying desired formation shape, and stabilization of the desired formation shape at any reference trajectories with bounded time derivatives. the second contribution is an algorithm for estimating gradient average of a distributed field over a region in two dimensional space. this algorithm uses only the field measurement on the boundary of a region, over which the gradient average is to be estimated. the two contributions are then combined to provide an effective gradient climbing system for a group of mobile agents by allowing the reference trajectory for each agent generated based on the gradient average. 2 preliminaries and formation control objective 2.1 smooth step function this section presents a construction of a smooth step function. the smooth step function is to be embedded into a potential function to avoid discontinuities in the control law due to the agents’ communication limitation in solving collision avoidance and connectivity preserving problems. −1 0 1 2 3 4 −6 −4 −2 0 2 x h (x ,0 ,3 ,2 ) a n d it s d e ri va tiv e s h h’ h’’ figure 1: a smooth step function and its first and second derivatives. definition 1. a scalar function h(x,a,b,c) is said to be a smooth step function if it possesses the following properties where x ∈ r, h′(x,a,b,c) = ∂h(x,a,b,c) ∂x , h′′(x,a,b,c) = ∂2h(x,a,b,c) ∂x2 , a and b are constants such that a < b, and c is a positive constant. lemma 2. let the scalar function h(x,a,b,c) be defined as h(x,a,b,c) = f(τ) f(τ) + cf(1− τ) with τ = x−a b−a , (1) where f(τ) = 0 if τ ≤ 0 and f(τ) = e− 1 τ if τ > 0, (2) with a and b being constants such that a < b, and c being a positive constant. then the function h(x,a,b,c) is a smooth step function. proof. proof of this lemma follows the same lines as the proof of lemma 1 in [7]. an illustration of a smooth step function (a = 0,b = 3,c = 2) is given in figure 1. 634 k.d. do 2.2 problem statement agent dynamics y ( 1) i d q x o ( 1) i d q desired shape od q id l d o idq 1i o agent i -1 agent i 1i o agent i+1 i o i r i r figure 2: formation setup. we assume that the agent i has the dynamics: q̇i = ui, i ∈ n, (3) where n is the set of all agents in the group, ui ∈ rn is the control input vector and qi ∈ rn the position vector of the agent i. formation control objective each agent in the group needs its reference trajectory to track. the reference trajectories can be predefined or determined from measurement data. furthermore, each agent needs to communicate with other agents in the group to perform its cooperative mission. therefore, before stating formation control objective we impose the following assumption on the reference trajectories, communication and initial conditions between the agents in the group: assumption 3. 1) the agent i has a physical safety ball, which is centered at the point oi and has a radius ri, and has a communication ball, which is centered at the point oi and has a radius ri, see figure 2. the radius ri is such that ri ≥ ri + rj + ε1ij, (4) for all j ∈ n,j ̸= i, where ε1ij is a strictly positive constant. 2) the reference trajectory qid for the agent i is generated by qid = qod(sod) + lid, (5) where qod(sod) is referred to as the common reference trajectory with sod being the common trajectory parameter, and lid is to specify a desired formation shape. the trajectory qod has its bounded derivatives. the vectors lid, i ∈ n have bounded derivatives, and satisfy (ri + rj + ε2ij) ≤∥lid − ljd∥≤ min(ri,rj)−ε2ij, (6) for all (i,j) ∈ n, i ̸= j, where ε2ij is a strictly positive constant, and is strictly less than ε1ij 2 . 3)the agent i broadcasts its trajectory, qi, and its reference trajectory qid in its communication ball. moreover, the agent i can receive the trajectory, qj, broadcasted by other agents j, j ∈ n, j ̸= i in the group if the points oj of these agents are in the communication ball of the agent i. 4) at the initial time t0 ≥ 0, all the agents in the group are sufficiently far but not too far away from each other in the sense that the following condition holds: (ri + rj + ε3ij) ≤∥qi(t0)−qj(t0)∥≤ (min(ri,rj)−ε3ij), (7) for all (i,j) ∈ n, i ̸= j, where ε3ij is a strictly positive constant and is strictly less than ε1ij 2 . formation control of multiple agents with preserving connectivity and its application to gradient climbing 635 remark 4. item 2) in assumption 3 defines a desired formation (by vectors lid) and how this desired formation moves (by the common reference trajectory qod). item 3) specifies the way each agent communicates with other agents in the group within its communication range. in figure 2, the agents i and i − 1 are communicating with each other since the points oi−1 and oi are in the communication areas of the agents i and i− 1, respectively. item 4) implies that at the initial time t0 there are no collision between all the agents, and that all the agents are communicating with each other. the conditions (4), (6) and (7) are imposed to avoid conflict when solving collision avoidance and connectivity preserving problems. this is because we will design a formation control system so that qi to track its reference trajectory qid. under assumption 3, for each agent i design the control input vector ui to achieve a desired formation consisting of 1) no switchings in the controllers; 2) no collisions between any agents; 3) asymptotic convergence of each agent’s trajectory qi to its reference trajectory qid; and 4) initial connectivity preservation. mathematically, the objective is to design a smooth ui to achieve: ∥qi(t)−qj(t)∥ > (ri + rj), lim t→∞ (qi(t)−qid(t)) = 0, ∥qi(t)−qj(t)∥ < min(ri,rj), (8) for all ∀t ≥ t0 ≥ 0 and (i,j) ∈ n and j ̸= i. 3 formation control design consider the following potential function φ = n∑ i=1 ( γi + 1 2 βi ) . (9) the aim of the goal function γi is to achieve asymptotic convergence of each agent’s trajectory qi to its reference trajectory qid. as such the function γi puts penalty on the tracking errors between the trajectory qi of the agent i and its reference trajectory qid = qod + lid. we choose the function γi as: γi = 1 2 ∥qi −qid∥2. (10) the purpose of the collision avoidance and connectivity preserving function βi is to force the agent i to move away from other agents, and to maintain communication connectivity between the agent i and other agents in the group. this function is chosen as follows: βi = ∑ j∈ni βij, (11) where ni is the set of all the agents in the group except for the agent i. the function βij = βji is a function of ∥qij∥2/2 with qij = qi −qj , and possesses the following properties: 1) βij = 0, βij′ = 0, βij′′ = 0, ∀ ∥qij∥∈ ( (ri + rj + δij),(min(ri,rj)− δij) ) , 2) βij > 0, ∀ ∥qij∥∈ (( (ri + rj),(ri + rj + δij) ) ∪ ( (min(ri,rj)− δij),min(ri,rj) )) , 3) lim ∥qij∥→(ri+rj) βij = ∞, lim ∥qij∥→min(ri,rj) βij = ∞, 4) βij is smooth for all ∥qij∥∈ ( (ri + rj),(min(ri,rj)) ) , (12) 636 k.d. do where δij is a strictly positive constant and is strictly less than ε2ij specified in assumption 3. the terms βij′ and βij′′ are defined as follows: βij′ = ∞, βij′′ = ∞, if ∥qij∥ = ri + rj, or ∥qij∥ = min(ri,rj), βij′ = ∂βij ∂(∥qij∥2/2) , βij′′ = ∂2βij ∂(∥qij∥2/2)2 , elsewhere. (13) based on the smooth step function in section 2.1, we can find many functions that satisfy all properties listed in (12). as an example, we will use the following function βij: βij =κij [ 1−h ( ∥qij∥2 2 , (ri+rj) 2 2 , (ri+rj+δij) 2 2 ,cij ) ( ∥qij∥2 2 − (ri+rj) 2 2 )2 + h ( ∥qij∥2 2 , (min(ri,rj)−δij)2 2 , min(ri,rj) 2 2 ,cij)( min(ri,rj)2 2 − ∥qij∥ 2 2 )2 ] , (14) where κij and cij are positive constants, and the function h(•) is a smooth step function defined in definition 1. an illustration of βij defined in (14) is given in figure 3 with ri + rj = 1, min(ri,rj) = 11, δij = 2, cij = 1, κij = 1. 0 5 10 0 2 4 6 8 10 ||q ij || β i j figure 3: an illustration of βij. the derivative of φ along the solutions of (3) satisfies φ̇ = n∑ i=1 ωti (ui − q̇id) + n∑ i=1 ( ∑ j∈ni βij′qtij ) l̇id, (15) where ωi = qi −qid + ∑ j∈ni βij′qij. (16) from (15), we design the control input ui to make the sum ∑n i=1 ω t i (ui − q̇id) negative definite as ui = −kψ(ωi) + q̇od + l̇id, (17) where k is a positive constant, and ψ(ωi) denotes a vector of bounded functions of elements of ωi in the sense that ψ(ωi) = [ ψ(ω1i ) ...,ψ(ω l i), ...,ψ(ω n i ) ]t with ωli the l th element of ωi, i.e., ωi = [ω 1 i ...,ω l i...ω n i ] t . the function ψ(x) satisfies 1) |ψ(x)| ≤ m1, 2)ψ(x) = 0 if x = 0, xψ(x) > 0 if x ̸= 0, 3)ψ(−x) = −ψ(x),(x−y)[ψ(x)−ψ(y)] ≥ 0, 4) ∣∣∣ψ(x) x ∣∣∣ ≤ m2, ∣∣∣∂ψ(x) ∂x ∣∣∣ ≤ m3, ∂ψ(x) ∂x ∣∣∣ x=0 = 1, (18) for all x ∈ r,y ∈ r, where m1,m2,m3 are strictly positive constants. some functions that satisfy the above properties are arctan(x) and tanh(x). the above bounds mean that the large control effort problem is avoided when the distance ∥qij∥ between the agent i and an agent j in the group reaches a collision limit ri + rj or a connectivity preserving limit min(ri,rj). to deal with the sum ∑n i=1 ( ∑ j∈ni βij′q t ij ) l̇id in (15), we observe that βij′ = 0 for all ∥qij∥ ∈ ( (ri + rj + δij),(min(ri,rj) − δij) ) , see property 1) of the function βij in (12). this observation motivates us to design an update law for lid so that ∑n i=1 ( ∑ j∈ni βij′q t ij ) l̇id = 0 for all time and l̇id tends to its desired value vid asymptotically. as such, we choose: l̇id = hivid, (19) formation control of multiple agents with preserving connectivity and its application to gradient climbing 637 where hi = ∏ j∈ni h(∥qij∥2/2,(ri + rj + δij) 2/2,(ri + rj + δ v ij) 2/2,cij)× ( 1−h(∥qij∥2/2,min(ri + rj − δvij) 2/2,(ri + rj − δij)2/2,cij) ) , (20) with δvij being a positive constant such that δij < δ v ij < ϵ2ij, and h(•) being a smooth step function defined in definition 1. with the choice of δij < δvij < ϵ2ij, we can see that hi = 1, ∀ ∥qij∥∈ ( (ri + rj + δ v ij),(min(ri,rj)− δ v ij) ) , hi = 0, ∀ ∥qij∥∈ ( (0,(ri + rj + δij))∪ (min(ri,rj)− δij),∞) ) , 0 < hi < 1, elsewhere. (21) obviously, the choice of the update law for lid in (19) with hi being satisfied (21) gives:∑ j∈ni βij′qtij l̇id = 0, ∀ ∥qij∥∈ ((ri + rj),min(ri,rj)). (22) remark 5. 1) a careful look at the control law ui in (17) with ωi in (16) shows that the argument of the bounded ψ (with the negative sign moved in) consists of two parts. the first part is −(qi −qid), and the second part is − ∑ j∈ni βij′qij. the first part together with q̇od + l̇id is referred to as the attractive force plays the role of forcing the agent i to track its reference trajectory. the second part is referred to as the repulsive force takes care of collision avoidance and connectivity preserving for the agent i with the other agents in the group. moreover, the control ui is a smooth function of and depend on only its own state and reference trajectory, and the states of other neighbor agents j if the agents j are sufficiently close to the agent i for collision avoidance, or are sufficiently far away from the agent i for connectivity preserving. 2) the choice of the update law in (19) ensures that when the collision avoidance or connectivity preserving is active, i.e., when the sum ∑ j∈ni βij′qij is non-zero, the vector lid is not updated, i.e., the desired formation shape is not changed. this implies that the control law ui gives priority to the collision avoidance and/or connectivity preserving mission or the desired formation shape updating mission whenever which mission is more important. substituting the control law ui in (17) and the update law l̇id in (19) into (15) gives φ̇ = −k n∑ i=1 ωti ψ(ωi), (23) where we have used (22). on the other hand, substituting the control law the control law ui in (17) into (3) including the update law l̇id in (19) results in the closed loop system: q̇i = −kψ(ωi) + q̇od + l̇id, l̇id = hivid, (24) for all i ∈ n. we now present the main result of our paper in the following theorem. theorem 6. under assumption 3, the smooth control input ui = given in (17) and the update law l̇id in (19) for the agent i solve the formation control objective. in particular: 1) there are no collisions between any agents, connectivity between the agents is maintained, and the closed loop system (24) is forward complete. the first and last inequalities in (8) hold. 2) the reference velocity l̇id approaches its desired reference velocity vid asymptotically. 3) the trajectory qi of each agent i tracks its reference trajectory qid asymptotically, i.e., the limit in the second equation of (8) holds. proof. see appendix 1. 638 k.d. do 4 gradient climbing 4.1 approach in this section, we present an application of our proposed formation control to solve a gradient climbing mission in a distributed environment φ(t,η). to do so, we consider each agent in the group as a mobile sensor and the network as a reconfigurable sensor array. as such, at each time t the agent i with i ∈ n in the group of n agents is equipped with a sensor that can measure φ(t,qi) at the location qi. with φ(t,qi), we estimate/calculate an approximation of the gradient average, ∇φ, of the distributed environment over a region a bounded by a contour c, on which the agents in the group are positioned. after ∇φ is estimated/calculated, we let the gradient of the common reference trajectory qod equal to ∇φ. this means that the common reference trajectory qod is simply generated by q̇od = ∂qod ∂sod ṡod = ∇φṡod, (25) with some initial condition qod(t0), where ṡod specifies how fast the desired formation moves along the common reference trajectory qod. for the case of gradient descent, we can use q̇od = −∇φṡod instead of (25). moreover, we can specify the desired formation shape velocity vid to change (expand/shrink/rotate) the formation shape, i.e., change the shape define vector lid, see (19), to improve the gradient average approximation. we propose the the desired formation shape velocity vid as follows: vid = −k1v(lid − l∗id) + k2vψ(∇φ), (26) where k1v and k2v are diagonal positive definite matrices. the constant vectors l∗id, i ∈ n are chosen so that they specify the minimum desired formation shape, which is such that the condition (6) holds with lid replaced by l∗id for all i ∈ n. the vector function ψ(∇φ) is a bounded vector function of ∇φ, see the paragraph just below (17). once the common reference trajectory qod and the desired formation shape lid are available, the formation control design proposed in section 3 can be used directly to drive the agents in the group. the following section gives a method to estimate an approximation of the gradient average, ∇φ, of the distributed environment φ(t,η) from measurements φ(t,qi) on the boundary, i.e., the contour or surface c, carried out by the agents in the group. therefore, we will present a method to calculate the gradient average of a distributed field in the following subsection. 4.2 average gradient estimate of a distributed field a 1 c 2 c 1 ( )f x 2 ( )f x o y xa b r n t p figure 4: coordinates for a gradient computation we consider a region a, see figure 4, bounded by a contour c, such that any line through a parallel to either one of the coordinate axes intersects c in only two points. the curve c is divided by its leftmost and rightmost points (x = a and x = b) into a lower segment c1, described by y = f1(x), and an upper segment c2 described by y = f2(x). with the position vector to a point p on c given by r = xex + yey, where ex and ey are the unit vector on the ox and oy axes, respectively. the unit tangent vector at p is t = dr ds = dx ds ex + dy ds ey, where ds is the differential length along c, and the unit normal vector is n = t×ez = dydsex − dx ds ey = nxex +nyey. for the function formation control of multiple agents with preserving connectivity and its application to gradient climbing 639 φ(t,x,y) defined in a, consider the area integral∫ a ∂φ ∂y da = ∫ b a ( φ(t,x,f2(x))−φ(t,x,f1(x)) ) dx = ∫ b a ( [φ]c2 − [φ]c1 ) dx. (27) as shown in fig. 4, a positive contour integration corresponds to a counter-clockwise traversal of c. to make the first integral in (27) consistent with this connection, we write∫ a ∂φ ∂y da = − ∫ a b [φ]c2dx− ∫ b a [φ]c1dx = − ∫ c φdx = − ∫ c φ dx ds ds, (28) which combines with n = nxex + nyey to yield ∫ a ∂φ ∂y da = ∫ c φnyds. a similar computation gives ∫ a ∂φ ∂x da = ∫ c φnxds. therefore, we have∫ a ∇φda = ∫ c nφds (29) where ∇φ = [ ∂φ ∂x , ∂φ ∂y ]t . it is of interest to note that the total gradient ∫ a ∇φda of the distributed field φ(t,η) over the region a is completely determined from the integral ∫ c nφds carried out on the boundary c only. from (29), we can calculate the gradient average of φ(t,η) over the region a as ∇φ = ∫ c nφds ωa (30) where ωa is the area of the region a. usually, it is not possible to obtain an explicit result of the integral ∫ c nφds because the distributed field φ is unknown. hence, we approximate this integral from measurement φ(t,qi) at the time t and the location (qi) by each agent i, and approximate the area ωa. we assume that the formation shape is a convex polygon whose vertices are at qi. the steps to calculate an approximate value of the integral ∫ c nφds and the region area ωa are as follows: 1) using a curve fitting method such as spline or least square to find a best fitted and smooth contour c(θ), where θ is the curve parameter, that goes through all vertices at the time t; 2) calculating an approximate value of ∫ c nφds and ωa as follows:∫ c nφds ≈ n∑ i=1 φ(t,qi)n(θi)∆ci, ωa ≈ 1 2 n∑ i=1 ( xiyi+1 −xi+1yi ) , (31) where qn+1 = q1, n(θi) is the unit vector normal to c(θ) at θi corresponding to the position of the vertex qi, and ∆ci is the arc length from the middle point mi−1 between qi−1 and qi an the middle point mi+1 between qi and qi+1, see fig.5. a i xo y x ( ) i n 1i x 1i x i y 1i y 1i y ( )c 1i m 1i m 1i q i q 1i q ci figure 5: coordinates for gradient average calculation. for a special case where the formation shape is a regular simple polygon, which has the center at qod and the vertices at qi, i ∈ n, and that the contour c goes through all the vertices at the time t. moreover, the unit vector n normal to the contour c at qi is in the direction from qod to qi at the time t. the the integral ∫ c nφds and the region area ωa can be approximated as∫ c nφds ≈ n∑ i=1 φ(t,qi) qi −qod ∥qi −qod∥ ∥∥∥∥qi+1 −qi−12 ∥∥∥∥, ωa ≈ 1 2 n∑ i=1 ∣∣∣det([qi,qi+1])∣∣∣, (32) with qn+1 = q1 and q−1 = qn , and det([qi,qi+1]) is the determinant of the matrix [qi+1,qi]. 640 k.d. do 5 simulation results in this section, we a problem of gradient climbing by our proposed formation controller using a group of n = 6 identical agents. each agent i has a physical safety radius ri = 0.5 and a communication radius ri = 10. the control design parameters are taken as k = 4, δij = 0.5, δvij = 0.75, cij = 1, and the bounded function ψ(.) taken as arctan(.). the desired formation shape specification vectors l∗id are chosen as l ∗ id = rf[cos( 2(i−1)π n ), sin( 2(i−1)π n )]t with rf = 3, and the gain k1v = diag(2.5,2.5). this choice of l∗id means that the desired formation configuration is a polygon whose vertices uniformly distribute on a circle centered on the common reference trajectory and with a radius rf . the initial conditions are lid(0) = l ∗ id, qod(0) = [0 0] t , and qi(0) = rf[cos( 2(i−1)π n +π), sin(2(i−1)π n +π)]t . these particular initial qi(0) were chosen to illustrate the collision avoidance capability of our proposed formation control system as all the agents have to across the center of the desired formation shape to track their desired reference trajectories. the distributed environment φ(t,x,y) is taken as φ(t,x,y) = e− (x−15)2+(y−15)2 150 , which has a global maximum value at (x = 15,y = 15). we set k2v = diag(1.5,1.5) to improve the gradient climbing, i.e., the desired formation shape is adapted to the distributed field. simulation results are plotted in figure 6. from these figures, it is seen that our proposed formation is able to achieve the objective of both formation control and gradient climbing. the control inputs ui, see sub-figure 6d, force the agents to move in such a way that collision between the agents is avoided and that communication between the agents is preserved, see sub-figure 6a where trajectories of the agents are plotted in xyplane. these sub-figures also show that our proposed formation control performs the gradient climbing mission very well in the sense that the center of the formation shape, see the polygon of which vertices are the agents, converges to the global maximum location of the function φ(t,x,y). collision avoidance and communication preserving are also confirmed in sub-figure 6c, where the distances ∥q∥1i between the agent 1 and other agents in the group are plotted. these distances are within the range of (1,10) since ri + rj = 1 and min(ri,rj) = 10. −5 0 5 10 15 20 −5 0 5 10 15 20 x y a 0 20 40 60 80 0 0.5 1 1.5 2 2.5 3 3.5 4 time [s] m ea n g ra d. b 0 10 20 30 40 50 60 70 80 0 5 10 time [s] ||q 1i || c 0 10 20 30 40 50 60 70 80 −5 0 5 time [s] u 1 x, u 1 y d figure 6: simulation results with formation shape adaptation. formation control of multiple agents with preserving connectivity and its application to gradient climbing 641 6 conclusions a constructive method has been proposed to design smooth and bounded cooperative controllers for a group of n mobile agents with limited communication to perform a desired formation. novel potential functions encoding desired formation mission tasks with smooth step functions embedded in were constructed to design the controllers that guaranteed all equilibrium (critical) sets, except for the desired set in formation, are unstable. the proposed formation control system is applied to solve a gradient climbing problem. an extension of the proposed formation control design in this paper and those controllers designed for single underactuated ships in [17] to provide a formation control system for a group of underactuated ships is under consideration. 1 proof of theorem 6 proof of no collisions, connectivity preserving, and forward completeness of the closed loop system: it is seen from (23) that φ̇ ≤ 0. integrating φ̇ ≤ 0 from t0 to t and using the definition of φ in (9) with its components defined in (10) and (11) results in φ(t) ≤ φ(t0), (33) where φ(t0) = ∑n i=1(γi(t0) + 1 2 ∑ j∈ni βij(t0)) and φ(t) = ∑n i=1(γi(t) + 1 2 ∑ j∈ni βij(t)), for all t ≥ t0 ≥ 0. from the condition specified in item 4) of assumption 3, and properties of βij, we have the right hand side of (33) is bounded by a positive finite constant depending on the initial conditions. boundedness of the right hand side of (33) implies that the left hand side of (33) must be also bounded. as a result, βij(∥qij∥2/2) must be smaller than some positive constant depending on the initial conditions for all t ≥ t0 ≥ 0. from properties of βij, see (12), ∥qij∥, for all (i,j) ∈ n and i ̸= j, must be in the interval ( (ri + rj),min(ri,rj) ) . hence, there are no collisions between any agents and connectivity between agents is preserved for all t ≥ t0 ≥ 0. boundedness of the left hand side of (33) also implies that of (qi(t) − qid(t)) also bounded for all t ≥ t0 ≥ 0. moreover, from (21) we can see that |hi| ≤ 1. therefore, ∥l̇id(t)∥ ≤ ∥vid(t)∥ for all t ≥ t0 ≥ 0. therefore, the closed loop system (24) is forward complete. equilibrium set: we will use lemma 2 in [7] to find the equilibrium set, which the trajectories of the closed loop system (24) converge to. integrating both sides of (23) yields∫ ∞ 0 ω(t)dt = φ(t0)−φ(∞) ≤ φ(t0), (34) with ω(t) := ∑n i=1 ω t i (t)ψ(ωi(t)), where ωi(t) is given in (16), and the function ψ(ωi(t)) is the bounded vector function of ωi(t) with properties listed in (18). indeed, the function ω(t) is scalar, nonnegative and differentiable. now differentiating ω(t) along the solutions of the closed loop system (24) and using properties of the function βij given in (12) readily show that ∣∣dω(t) dt ∣∣ ≤ mω(t) with m being a positive constant. therefore lemma 2 in [7] results in limt→∞ ω(t) = 0, which implies from the expression of ω(t) and properties of the bounded vector function ψ(ωi(t)) in (18) that limt→∞ ωi(t) = 0. therefore, from the expression of ωi(t) the limit limt→∞ ωi(t) = 0 given in (16) implies that lim t→∞ n∑ i=1 ( qi(t)−qid(t) + ∑ j∈ni βij′(t)qij(t) ) = 0. (35) the limit in (35) implies that q(t) = [qt1 (t) q t 2 (t), . . . ,q t n(t)] t can tend to qd = [qt1d q t 2d, . . . ,q t nd] t denoted by the set ξd, since βij′(t) = 0 at qi = qid and qj = qjd, for all (i,j) ∈ n and i ̸= j or 642 k.d. do tend to the set qc = [qt1c q t 2c, . . . ,q t nc] t denoted by the set ξc as the time goes to infinity, i.e., the equilibrium sets can be ξd or ξc. the equilibrium set ξc is such that ω ∣∣∣ q∈ξc = ( qi −qid + ∑ j∈ni βij′qij )∣∣∣∣ q∈ξc = 0, (36) for all i ∈ n. thus, we have already proved that the trajectory q can approach either the desired equilibrium set denoted by ξd or the undesired equilibrium set denoted by ξc ’almost globally’. the term ’almost globally’ refers to the fact that the agents start from a set that includes the condition (7) and that does not coincide at any point in the undesired equilibrium set ξc. therefore, we now need to prove that ξd is a locally asymptotically stable set and that ξc is a locally unstable set. once this is proved, we can conclude that the trajectory q approaches qd from almost everywhere except for from the set denoted by the condition (7) and the undesired equilibrium set ξc, which is unstable (to be proved below). to prepare for showing that ξd is asymptotically stable and that ξc is unstable. we write the first equation of the closed loop system (24) for all i ∈ n in a vector form as q̇ = −kφ(q,qd) + q̇d (37) where φ(q,qd) = [ψt (ω1), ...,ψt (ωn)]. linearizing (37) around qo = [qt1o, . . . ,q t no] t , and letting the set ξo contain qo results in q̇ = −k ∂φ(q,qd) ∂q ∣∣∣ q∈ξo + q̇d, (38) where ∂φ(q,qd) ∂q = [∆ij] with ∆ij = ∂ψ(ωi) ∂ωi ∂ωi ∂qj and ∂ωi ∂qi = ( 1 + ∑ i∈ni βij′ ) in + ∑ j∈ni βij′′qijqtij, ∂ωi ∂qj = −βij′in×n −βij′′qijqtij, (39) for all (i,j) ∈ n. let n∗ be the set of the agents such that if the agents i and j belong to the set n∗ then ∥qij∥ ∈ ((ri + rj),min(ri,rj). next we will show that the equilibrium set ξd is asymptotically stable and that the equilibrium set ξc is unstable. proof of ξd being asymptotically stable: as mentioned above, to prove that the equilibrium set ξd is asymptotically stable, we just need to show that ξd is locally asymptotically stable. letting ξo be ξd in (38), we obtain q̇ = −k(q −qd) + q̇d, (40) where we have used the fact that βij′ ∣∣ q∈ξd = 0 and βij′′ ∣∣ q∈ξd = 0, see property 1) of the function βij in (12). local asymptotic stability of the equilibrium set ξd follows from (40) since the first time derivative of the function vd = 12∥q−qd∥ 2 along the solutions of (40) satisfies v̇d = −2kvd. proof of ξc being asymptotically stable: let us define q̄ = [qt12, ...,q t 1nq t 23, ...,q t 2n, ...,q t n−1,n] t , q̄c = [q t 12c, ...,q t 1ncq t 23c, ...,q t 2nc, ...,q t n−1,nc] t , βijc′ = βij′ ∣∣ q∈ξc , βijc′′ = βij′′ ∣∣ q∈ξc , qijc = qic −qjc. with the above definitions, we can see that stability of ξc is equivalent to that of ξ̄c = q̄c. define ωijc = ωic − ωjc, ∀(i,j) ∈ n, i ̸= j where ωic = ωi|q∈ξc = 0, see (36). therefore ωijc = 0. hence ∑ (i,j)∈n∗ q t ijcωijc = 0, i ̸= j, which by using (36) is expanded to∑ (i,j)∈n∗ ( qtijc(qijc−qijd)+nβijc′q t ijcqijc ) = 0 ⇒ ∑ (i,j)∈n∗ (1+nβijc′)qtijcqijc = ∑ (i,j)∈n∗ qtijcqijd (41) formation control of multiple agents with preserving connectivity and its application to gradient climbing 643 where i ̸= j. the sum ∑ (i,j)∈n∗ q t ijcqijd is strictly negative since at the point f where qij = qijd, ∀(i,j) ∈ n∗, i ̸= j all attractive and repulsive forces are equal to zero while at the point c where qij = qijc ∀(i,j) ∈ n∗, i ̸= j the sum of attractive and repulsive forces are equal to zero (but attractive and repulsive forces are nonzero). therefore the point o where qij = 0, ∀(i,j) ∈ n∗, i ̸= j must locate between the points f and c for all (i,j) ∈ n∗, i ̸= j. that is the points f , o, and c must be co-linear. hence, there exists a strictly positive constant b such that∑ (i,j)∈n∗ q t ijcqijd < −b, which is substituted into (41) to yield∑ (i,j)∈n∗ (1 + nβijc′)qtijcqijc < −b,i ̸= j. (42) since qtijcqijc > 0,∀(i,j) ∈ n ∗, i ̸= j, there exists a nonempty set n∗∗ ⊂ n∗ such that for all (i,j) ∈ n∗∗, i ̸= j, (1 + nβijc′) is strictly negative, i.e., there exists a strictly positive constant b∗∗ such that (1 + nβijc′) < −b∗∗, ∀(i,j) ∈ n∗∗, i ̸= j. we now define a subspace υ as υ := ( qij −qijc = 0, ∀(i,j) ∈ n\n∗∗ ) ∩ ( qtijc(qij −qijc) = 0, ∀(i,j) ∈ n∗, i ̸= j ) . in the subspace υ, we have v̄c = 1 2 ∑ (i,j)∈n∗∗ ∥qij −qijc∥2, ˙̄vc = −k ∑ (i,j)∈n∗∗ (1 + nβijc′)∥qij −qijc∥2 ≥ 2kb∗∗v̄c (43) where we have used (1 + nβijc′) < −b∗∗, ∀(i,j) ∈ n∗∗, i ̸= j. since the set n∗∗ is nonempty, (43) implies that the equilibrium set ξ̄c is unstable by chetaev’s theorem (theorem 4.3 in [15]). this implies the desired result that the equilibrium set ξc is unstable. we can further explore instability of the equilibrium set ξc based on (43) as follows. from (43), we have∑ (i,j)∈n∗∗ ∥qij(t)−qijc∥≥ ∑ (i,j)∈n∗∗ ∥qij(t0)−qijc∥ekb ∗∗(t−t0), i ̸= j, t ≥ t0 ≥ 0. (44) now assume that the equilibrium set ξc is stable, i.e., limt→∞ ∥qi(t) − qic∥ = di,∀i ∈ n with di a nonnegative constant. note that n∗∗ ⊂ n, we have limt→∞ ∥qi(t) − qic∥ = di,∀i ∈ n∗∗, which implies that limt→∞ ∑ (i,j)∈n∗∗ ∥qij(t) − qijc∥ = d ∗∗,∀(i,j) ∈ n∗∗, i ̸= j with d∗∗ a nonnegative constant, since qij = qi − qj and qijc = qic − qjc. this contradicts (44) for the case∑ (i,j)∈n∗∗ ∥qij(t0)−qijc∥ ̸= 0, since the right hand side of (44) is divergent (so does the left hand side). for the case ∑ (i,j)∈n∗∗ ∥qij(t0)−qijc∥ = 0, there would be no contradiction. however this case is never observed in practice since the ever-present physical noise would cause ∥qij(t∗)−qijc∥ for some (i,j) ∈ n∗∗, i ̸= j to be different from 0 at the time t∗ ≥ t0. proof of theorem 6 is completed. bibliography [1] a. das, r. fierro, v. kumar, j. ostrowski, j. spletzer, and c. taylor, “a vision based formation control framework,” ieee transactions on robotics and automation, vol. 18, no. 5, pp. 813–825, 2002. [2] j. hu and g. feng, “distributed tracking control of leader-follower multi-agent systems under noisy measurement,” automatica, vol. 46, no. 8, pp. 1382–1387, 2010. [3] m. egerstedt and x. hu, “formation constrained multiagent control,” ieee transactions on robotics and austomation, vol. 17, no. 6, pp. 947–951, 2001. 644 k.d. do [4] t. balch and r. c. arkin, “behavior-based formation control for multirobot teams,” ieee transactions on robotics and automation, vol. 14, no. 6, pp. 926–939, 1998. [5] r. t. jonathan, r. w. beard, and b. young, “a decentralized approach to formation maneuvers,” ieee transactions on robotics and automation, vol. 19, no. 6, pp. 933–941, 2003. [6] h. g. tanner and a. kumar, “towards decentralization of multi-robot navigation functions,” in proceedings of the 2005 ieee international conference on robotics and automation, (barcelona, spain), pp. 4132–4137, 2005. [7] k. d. do, “bounded controllers for formation stabilization of mobile agents with limited sensing ranges,” ieee transactions on automatic control, vol. 52, no. 3, pp. 569–576, 2007. [8] e. rimon and d. e. koditschek, “exact robot navigation using artificial potential functions,” ieee trans. robot. and automat., vol. 8, no. 5, pp. 501–518, 1992. [9] d. m. stipanovic, g. inalhan, r. teo, and c. j. tomlin, “decentralized overlapping control of a formation of unmanned aerial vehicles,” automatica, vol. 40, no. 8, pp. 1285–1296, 2004. [10] r. olfati-saber, “flocking for multi-agent dynamic systems: algorithms and theory,” ieee transactions on automatic control, vol. 51, no. 3, pp. 401–420, 2006. [11] m. ji and m. egerstedt, “distributed coordination control of multi-agent systems while preserving connectedness,” ieee transactions on robotics, vol. 23, no. 4, pp. 693–703, 2007. [12] m. m. zavlanos and g. j. pappas, “potential fields for maintaining connectivity of mobile networks,” ieee transactions on robotics, vol. 23, no. 4, pp. 812–816, 2007. [13] d. v. dimarogonas and k. j. kyriakopoulos, “connectedness preserving distributed swarm aggregation for multiple kinematic robots,” ieee transactions on robotics, vol. 24, no. 5, pp. 1213–1222, 2008. [14] k. d. do, “output-feedback formation tracking control of unicycle-type mobile robots with limited sensing ranges,” robotics and autonomous systems, vol. 57, pp. 34–47, 2009. [15] h. khalil, nonlinear systems. prentice hall, 2002. [16] m. krstic, i. kanellakopoulos, and p. kokotovic, nonlinear and adaptive control design. new york: wiley, 1995. [17] k. d. do and j. pan, control of ships and underwater vehicles: design for underactuated and nonlinear marine systems. springer, 2009. stanojevic_ijcccv11n5.pdf international journal of computers communications & control issn 1841-9836, 11(5):720-733, october 2016. continuous distribution approximation and thresholds optimization in serial multi-modal biometric systems m. stanojević, i. milenković , d. starčević, b. stanojević milan stanojević, ivan milenković, dušan starčević faculty of organizational sciences, university of belgrade, jove ilića 154, 11000 belgrade, serbia (milans, ivan.milenkovic, starcev)@fon.bg.ac.rs bogdana stanojević* mathematical institute of the serbian academy of sciences and arts, kneza mihaila 36, 11000 belgrade, serbia, *corresponding author: bgdnpop@mi.sanu.ac.rs abstract: multi-modal biometric verification systems use information from several biometric modalities to verify an identity of a person. the false acceptance rate (far) and false rejection rate (frr) are metrics generally used to measure the performance of such systems. in this paper, we first approximate the score distributions of both genuine users and impostors by continuous distributions. then we incorporate the exact expressions of the distributions in the formulas for the expected values of both far and frr for each matcher. in order to determine the upper and lower acceptance thresholds in the sequential multi-modal biometric matching, we further minimize the expected values of far and frr for the entire processing chain. we propose a non-linear bi-objective programming problem whose objective functions are the two error probabilities. we analyze the efficient set of the bi-objective problem, and derive an efficient solution as a best compromise between the error probabilities. replacing the least squares approximation of the score distributions by a continuous distribution approximation, this approach modifies the method presented in stanojević et al. [15] (doi: 10.1109/icccc.2016.7496752)a. the results of our experiments showed a good performance of the sequential multiple biometric matching system based on continuous distribution approximation and optimized thresholds. keywords: multi-modal biometrics, sequential fusion, multi-criteria optimization, continuous distribution approximation. areprinted (partial) and extended, with permission based on license number 3938230385072 © [2016] ieee, from "computers communications and control (icccc), 2016 6th international conference on". 1 introduction this paper is an extension of [15] (doi: 10.1109/icccc.2016.7496752). the mathematical model introduced in [15], that derives optimized thresholds for bi-modal biometric systems, is here formulated generally, for n-modal biometric systems. in addition, the linear approximation of the biometric scores, based on the least squares method, is here replaced by a non-linear continuous distribution approximation. we report our new computation results and compare them with previous results. biometrics is the automated recognition of individuals based on their behavioral and biological characteristics [11]. biometric recognition is used for many purposes including criminal copyright © 2006-2016 by ccc publications 722 m. stanojević, i. milenković , d. starčević, b. stanojević identification, secure access control, forensics and so forth. it was intensively researched and widely applied in the last decade [18]. a number of biometric technologies have been developed and several of them are being used in a variety of applications [7]. the most commonly used modalities are fingerprints, face, iris, speech, and hand geometry. due to their strengths and weaknesses, the choice of one or another modality is strictly dependent on the application requirements. in his book [5], kaklauskas presented different methods for analyzing the body language (movement, position, use of personal space, silences, pauses and tone, the eyes, pupil dilation or constriction, smiles, body temperature and the like) for better understanding people’s needs and actions, including biometric data gathering and reading. filip [3], briefly reviewed the book, and emphasized that it addressed two modern research domains: intelligent and integrated decision support systems and biometrics-based human-computer interface. an analysis of a multi-modal biometric system based on level of fusion was presented in [10]. the authors discussed the biometric systems, the limitations of individual biometric, and various fusion levels and methods of multi-modal systems. the parallel fusion mode was first introduced in 1998 [4]. fingerprint and face modalities were simultaneously used for identification. serial fusion of multiple matchers is a good trade-off between the widely adopted parallel fusion and the use of a mono-modal verification system [12]. an alternative to parallel fusion of biometric data is the use of serial fusion. kumar and kumar [6] presented a new approach for the adaptive management of multi-modal biometrics. they employed the ant colony optimization for the selection of the key parameters like decision threshold and fusion rule, to ensure the optimal performance in meeting varying security requirements during the deployment of multi-modal biometrics systems. zhang et al. [18] proposed a novel framework for serial multi-modal biometric systems based on semi-supervised learning techniques. they have promoted the discriminating power of the weaker but more user convenient traits over the use of the stronger but less user convenient traits. in this way, they proposed an alternative to other existing serial multi-modal biometric systems that suggest optimized orderings of the traits deployed and parameterizations of the corresponding matchers but ignore the most important requirements of common applications. their experiments on two prototype systems demonstrated the advantages of their methodology. marcialis et al. [8] proposed a theoretical framework for the assessment of performance of serial fusion multi-modal systems, theoretically evaluated the benefits in terms of performance, and estimated the errors in the model parameters computation. they analyzed the model from the point of view of its pros and cons, and performed preliminary experiments on a benchmark found in the literature. the importance of the use of multi-modal biometrics in the area of secure person authentication is highlighted in a recent study [13]. that study provided a different perception on how to use biometrics on the highest level of the network security with the fusion of multiple biometric modalities. snelick et al. [14] studied the performance of the multi-modal biometric authentication systems using the state-of-the-art commercial off-the-shelf (cots) fingerprint and face biometric systems on a large-scale population. they also proposed new methods of normalization and fusion that improved the accuracy of the biometric systems. the remainder of the paper is organized as follows. in section 2 we formulate the problem that we wish to solve. two solving methods are presented in section 3: the linear approximation method is briefly presented in section 3.1, and our novel approach is introduced in section 3.2. our computational results on random generated instances are given in section 4.1, and the numerical results on nist-bssr1 and nist-bssr2 databases are reported in section 4.2. the conclusions and some directions for future work are included in section 5. continuous distribution approximation and thresholds optimization in serial multi-modal biometric systems 723 2 the formulation of the problem multi-modal biometric verification systems use information from several biometric modalities to verify an identity of a person. the false acceptance rate (far) and false rejection rate (frr) are metrics generally used to measure the performance of such systems. the far is the probability that the system incorrectly matches the input pattern to a non-matching template in the database. it measures the percent of invalid inputs which are incorrectly accepted. in case of similarity scale, if the person is an impostor in reality, but the matching score is higher than the threshold, then he is treated as genuine. the far depends on the threshold value: the far increases as the threshold decreases. the frr is the probability that the system fails to detect a match between the input pattern and a matching template in the database. it measures the percent of valid inputs which are incorrectly rejected. it also depends on the threshold value: the frr increases as the threshold value increases. the visual characterization of the tradeoff between the far and frr is generally given by a graphic representation of the genuine acceptance rate (gar=1-frr) with respect to the false acceptance rate. in general, a matching algorithm performs a decision based on a threshold. the threshold determines how close to a template the input needs to be for it to be considered a match. if the threshold is reduced, there will be fewer false non-matches but more false accepts. conversely, a higher threshold will reduce the far but increase the frr. our goal is to find the system’s thresholds that assure a good compromise between the minimizations of the false acceptance rate and false rejection rate. pato and millett, in their book [11], emphasized that the biometric recognition systems are inherently probabilistic. in their opinion, the biometric recognition involves matching, within a tolerance of approximation, of observed biometric traits against previously collected data for a subject. the approximate matching is required due to the variations in biological attributes and behaviors both within and between persons. let us assume that the multi-modal biometric system consists of n matchings. after any of the first n − 1 matchings (indexed in the formulas from 2 to n), one of the following three decisions will be made: accept the person as genuine, reject the person as impostor, or demand another matching. naturally, after the last matching (that uses the index 1 in formulas), only two decisions will be possible: to accept, or to reject the person. for each modality in the system, we collect data from n persons. for each person we take m samples, and construct an m × n matrix m of input information. for real-life databases the components of matrix m are vectors derived by using classic protocols, that extract biometric features from the real data collected (images, videos, speeches). in the beginning we restrict our attention to one of the first n−1 matchings. using the matrix m we compute the distances between each two collected samples, and derive the distributions for the genuine users and impostors. the distribution of the genuine users is constructed using the distances between each two components of the same column of the matrix m. the distribution of the impostors uses the distances between each component of matrix m and each component of matrix m that lies on different columns. two samples obtained from the same person are highly expected to have a small distance between them. thus, the genuine distribution will generally have a range of smaller values than the impostor distribution. a graphical representation of such distributions is shown in figure 2 (top) using normalized histograms. since the number of distances computed for the genuine distribution is significantly less then the number of distances computed for the impostor distribution, and the number of intervals used to construct the histograms is the same for both distributions, the height of the genuine distribution is significantly greater than the height of the impostor distribution. let ak and bk denote the minimal distance in the impostors distribution and the maximal 724 m. stanojević, i. milenković , d. starčević, b. stanojević figure 1: the areas involved in computing the far and frr on the first n − 1 match levels (top), and on the last match (bottom) continuous distribution approximation and thresholds optimization in serial multi-modal biometric systems 725 distance in the genuine distribution, respectively, for the k-th matching (we assume that ak < bk, otherwise the decision is trivial, and the system is error-free). when one of the first n − 1 matchings of the biometric system is used to verify a person, a decision is made according to two thresholds xq, xr ∈ [ak, bk], q = 2k − 2, r = 2k − 1 as follows: if the distance between the given sample and the sample in the database is less than xq the person is accepted as genuine; if it is greater than xr the person is rejected as impostor; but if the distance belongs to the uncertainty region [xq, xr], the verification process demands another matching (see figure 1, top). for the last matching in the sequence, the distributions of both genuine users and impostors are constructed in the same way; two values a1 and b1 are specified, with the same meaning as ak and bk, k = 2, . . . , n from the first matchings; but the decision is based on a single threshold denoted by x1, that lies between a1 and b1 as follows: if the distance is less than x1 the identity of the verified person is accepted as a genuine user, otherwise it is rejected as impostor (see figure 1, bottom). the main problem is to find proper values for the thresholds involved in the given sequence of matchings. our goal is to provide optimized values for these thresholds, in sense of minimizing both false acceptance and false rejection errors, far and frr, respectively. 3 solving methods we first have to evaluate the false acceptance and false rejection errors with respect to the thresholds xk, k = 1, . . . , 2n − 1. having a graphical representation of the score distributions of both genuine users and impostors, let ak (xq), q = k − 2 denote the area under the impostor distribution bounded to the right by the vertical line that passes through xq; and let bk (xr), r = 2k −1 denote the area under the genuine distribution bounded to the left by the vertical line that passes through xr – for the k-th matching (see figure 1, top). similarly, let a1 (x1) denote the area under the impostor distribution bounded to the right by the vertical line that passes through x1; and b1 (x1) denote the area under the genuine distribution bounded to the left by the vertical line that passes through x1 – for the last matching (see figure 1, bottom). the probability of a false match error based on the k-th matching is ak (xq), and it is a1 (x1) for the last matching. similarly, the probability of a false non-match error based on the k-th matching is bk (xr), and it is b1 (x1) for the last matching. the function farn, that describes the probability of a false match error in the general case of a biometric system with n modalities is computed using the recurrent formula fark(x1, ..., x2k−1) = ak (xq) + [ak (xr) − ak (xq)] fark−1(x1, ..., x2k−3), (1) where q = 2k − 2, r = 2k − 1, far1(x1) = a1 (x1). similarly, the function frrn, that describes the probability of a false non-match error in the general case of a biometric system with n modalities is computed using the recurrent formula frrk(x1, ..., x2k−1) = bk (xq) + [bk (xq) − bk (xr)] frrk−1(x1, ..., x2k−3), (2) where q = 2k − 2, r = 2k − 1, frr1(x1) = b1 (x1). in order to find proper bounds to the uncertainty regions involved in the verification process, we minimize both probabilities of error. since a low false match error means a high false nonmatch error and reverse, we have to search for a good compromise between the two errors. such 726 m. stanojević, i. milenković , d. starčević, b. stanojević compromise is achieved by solving the bi-objective programming problem min farn(x1, ..., x2n−1), min frrn(x1, ..., x2n−1), s.t. ak ≤ x2k−2 ≤ x2k−1 ≤ bk, k = 2, . . . , n a1 ≤ x1 ≤ b1. (3) model (3) is the generalization of model (2) given in [15]. 3.1 local linearization approximation in this section we briefly present the local linearization approach used in [15] to approximate the areas involved in formulas (1) and (2). the discrete score distributions of both genuine users and impostors were approximated using the least squares method. a graphical representation of the discrete distributions (points) and their linear approximations (the straight lines) may be seen in figure 4. the scores for genuine users are on the left, and the scores for the impostors are on the right. the representation is restricted to the region significant to the decision, i.e. to the intersection of the distributions. in [15] the theoretical presentation of the approach was restricted to bi-modal systems, and quadratic expressions with respect to the thresholds x1, x2 and x3, were derived. combining them, a polynomial expressions of degree 4 were obtained, for the objective functions of the optimization model. the bi-objective optimization problem formulated in [15] is a particular case of model (3), obtained for n = 2. for a general multi-modal matching system with n modalities, the expressions of the objective functions obtained by the linear approximation are polynomials of degree 2n. 3.2 continuous distribution approximation in this section we propose a new way to approximate the discrete score distributions of both genuine users and impostors. in the previous approach, the discrete data collected from the users was grouped in intervals, relative frequencies were computed, and the discrete set of pairs (frequency, midpoints of the intervals) was continuously and linearly approximated using the least squares method. we now propose to use a continuous distribution that approximate the initial discrete collection of data. we need to compute the mean and the variance of the data, separately for each modality, genuine users and impostors, and try to identify a continuous distributions that best fit the distance frequencies. figure 2 shows how the continuous distributions approximate the original score distributions for both genuine users and impostors, for one modality. in this representation the gamma distribution was used for the genuine scores and the normal distribution for the impostors. for the same data set – representing the genuine users scores distribution – we performed a fit distribution test, that identified as best fit distributions the gamma, generalized gamma (4p), and johnson sb. the goodness of fit was performed for 61 well-known distributions. among those distributions, the kolmogorov-smirnov test identified the generalized gamma (4p) distribution as the best fit, the anderson-darling test identified johnson sb distribution as the best fit, while the chi-squared test identified the gamma distribution as the best fit. figure 3 graphically shows how the input data (with all values grouped in a histogram) may be approximated by gamma, normal, and beta probability density functions. we were restricted to choose among gamma, normal, beta, erlang, and chi-square distributions due continuous distribution approximation and thresholds optimization in serial multi-modal biometric systems 727 figure 2: the approximation based on gamma distribution for the genuine scores, and on normal distribution for the impostors scores versus the original (discrete) score distributions figure 3: probability density functions for the continuous approximations using gamma, normal, and beta distributions. the characteristics of the distributions are given in table 1 table 1: fitting results for the probability density functions of the gamma, normal, and beta distributions presented in figure 3 distribution parameters gamma α = 3.1919; β = 52049 normal σ = 92990; µ = 166130 beta α1 = 1.4696; α2 = 4.7366; a = 29877; b = 616440 728 m. stanojević, i. milenković , d. starčević, b. stanojević table 2: the goodness of fit for the gamma, normal, and beta distributions (presented in figure 3), using kolmogorov-smirnov, anderson-darling and chi-squared statistics tests. the rank is given out of 61 distribution kolmogorov-smirnov anderson-darling chi-squared statistic / rank statistic / rank statistic / rank gamma 0.04247 / 15 1.26490 / 19 6.4661 / 1 normal 0.11578 / 37 11.63 / 36 50.698 / 38 beta 0.06484 / 25 4.3973 / 27 22.072 / 25 to the toolbox of the programming language we used for the optimization. despite the fact that these distributions are not the best fit, they behave well during experiments since they approximate relatively well the initial score distributions in along the intervals relevant to the optimization. we used easyfit 5.6 standard software (http://www.mathwave.com) for goodness fitting distribution. table 1 reports the parameters that describe the fitting results. these parameters were computed with respect to the mean and variance of the original score distribution. table 2 shows the results for the statistic tests kolmogorov-smirnov, anderson-darling and chi-squared applied to the data graphically represented in figure 3. it also includes the ranks of the chosen distributions in the ranking list of 61 well-known distributions. figure 4 shows that the probability density function of the continuous distribution (the nonlinear graph) better approximates the original set of data than the linear function (the straight line) obtained by the least squares method presented in the previous section. on the top side of the figure the approximations for the genuine users scores is given. for the impostors scores the approximations are given on the bottom side. according to the new proposed approximation method, we use the cumulative density functions (cdf) to express the probabilities ak and bk, k = 1, . . . , n, in model (3). thus, ak(x) = cdfi k (x), k = 1, . . . , n in far (1) for impostors, and bk(x) = 1 − cdf g k (x), k = 1, . . . , n in frr (2) for genuine users. model (3) is a nonlinear bi-objective problem independent on the approximation used. one way to solve problem (3) is to aggregate the two objective functions, and optimize the obtained function. we propose the weighted sum method, with the initial weights (1, 1), to aggregate the objectives. in this way we optimize the total error rate (ter), that is one parameter used in analyzing the performance of an biometric system. by changing the set of weights, we favor one or another type of error. in order to chose certain weights to aggregate the far and frr, the analyst must know/model the cost functions for false rejections and false acceptances. there are more system specific factors that may influence the priorities in favoring one or another of the error rates. analyzing such characteristics is beyond the scope of this work. our approach provides optimized thresholds from the point of view of minimizing the total error rate. we solved the single objective optimization problem min farn(x1, ..., x2n−1) + frrn(x1, ..., x2n−1), s.t. ak ≤ x2k−2 ≤ x2k−1 ≤ bk, k = 2, . . . , n a1 ≤ x1 ≤ b1. (4) numerically. a starting point was chosen by taking the midpoint of the feasible interval for the threshold x1, the point that is at one third from the left bound of the feasible interval for the x2k−2, and the point that is at one third from the right bound of the feasible interval for the threshold x2k−1, k = 2, . . . , n. we optimized one variable at a time, fixing others to either their continuous distribution approximation and thresholds optimization in serial multi-modal biometric systems 729 figure 4: the approximation based on the least square method versus the approximation based on continuous distributions for both genuine users and impostors 730 m. stanojević, i. milenković , d. starčević, b. stanojević initial values or previously obtained optimal values. the optimization process was stopped when the distance between two consecutive solutions dropped under a given threshold. the advantage of finding optimized thresholds, to be used by the decision-maker in constructing the sequential multi-modal system, resides in yielding the needed information in the a priori stage of the decision. generally, the trade-off between two conflicting objectives, and particularly the trade-off between far and frr, may be a subject of a wider discussion. the usefulness of a priori, a posteriori, and interactive methods in multiple objective optimization are highly emphasized in the literature (see for instance [1]). a visualization technique for accessing the solution pool in the interactive methods of multiple objective optimization can be found in [2]. we used gnu-octave (https://www.gnu.org/software/octave/) for the optimization step. 4 computation results 4.1 experiments using random generated instances in order to test the performance of our method we organized the experiments, as in [15]. first, the input data, necessary to construct the score distributions, were randomly generated according to the rules that made them proper data for biometric tests. more precisely, we have generated a set of vectors with l real positive components. each of these vectors simulates the essential information that in real situations is extracted from the taken pictures of a person during a biometric measurement. the first sample of each user is generated as a vector of uniformly distributed random numbers. the mean and standard deviation were varied through instances. each subsequent sample of the same user is generated as a vector of random numbers with normal distribution, keeping the same mean and variance as for its corresponding first sample. in this way we provided that the samples of one person are more similar to each other than when compared to samples of another person. we computed the euclidean distances between each two generated vectors, and split them in two categories to be used for the construction of the genuine and impostor distributions. for the continuous distribution approximations we used the gamma distribution for the genuine users and the normal distribution for the impostors. we computed the bounds ak, bk, k = 1, 2; used the approximations to evaluate the errors of false acceptance and false rejection; and constructed the bi-objective optimization model. in order to find the optimized thresholds we added the two error functions, and minimized the total error. each triple (x1, x2, x3), including the triple of optimized thresholds, defines a bi-modal biometric system, whose performance will be evaluated. in order to estimate the far and frr of each system using the same data as in constructing the genuine and impostor distributions, we successively collected the answers of the system, obtained when each person i = 1, . . . , n claims that he/she is the person k = 1, . . . , n, and he/she is verified with all samples j = 1, . . . , m of the person k, according to the specific thresholds of the system. each time when the system accepts a person i as being the person k, the numerator of the ratio far increases with 1 unit. similarly, each time when the system rejects a person i when he/she claims that he/she is person i the numerator of the ratio frr increases with 1 unit. the nominator of far is n(n − 1)m2, while the nominator of frr is m(m − 1)n/2. finally, we compute the genuine acceptance rate (gar) as 1-frr. when biometric samples of a control group are available, we collect the system’s answers obtained by checking the control samples instead of the initial samples. the numerical results are grouped in two tables. table 3 contains the results obtained by both methods ls (based on least square approximation), and cd (based on continuous distribution approximation). comparing the two methods we note that, considering the total error rate, cd continuous distribution approximation and thresholds optimization in serial multi-modal biometric systems 731 table 3: comparison of ls and cd for certain instances with given characteristics instances characteristics far gar ter ls cd ls cd ls cd n = 100, m = 5, l = 3 9.4% 0.52% 99.6% 98.5% 9.80% 2.03% n = 100, m = 5, l = 5 2.2% 0.77% 100% 99.5% 2.20% 1.26% n = 100, m = 5, l = 10 0.05% 0.19% 99.2% 99.8% 0.85% 0.35% n = 200, m = 5, l = 10 0.001% 0.05% 99.5% 99.9% 0.50% 0.15% table 4: the numerical results obtained by running cd on more instances with given characteristics instances characteristics far gar ter l = 5 l = 10 l = 5 l = 10 l = 5 l = 10 n = 100, m = 10 0.91% 0.08% 99.6% 99.94% 1.31% 0.14% n = 200, m = 10 0.96% 0.08% 99.7% 99.97% 1.31% 0.11% n = 500, m = 10 0.92% 0.08% 99.6% 99.98% 1.34% 0.10% performs better than ls in all cases. analyzing separately the values of the false acceptance rate and genuine acceptance rate, cd is better for two sets of instances, and ls is better for other two sets of data. the results reported in table 4 were obtained by running the cd method on instances with 100, 200 and 500 users. for each user 10 different samples were available. the samples were described by vectors of length 5 and of length 10. it is obvious that the number of genuine users used to train the system does not influence the error rates. contrary, the length of the vectors used to describe the individuals is very important, as expected, since the error rates are much smaller in the case of l = 10 than in the case of l = 5. more precisely, when the length of the vectors is greater, more biometric information is enclosed in them, thus a more clear separation between individuals exists. consequently, the score distributions of genuine users and impostors are less overlapped, the uncertain regions are smaller, and the total error rate is smaller. contrary, a small vector length correspond to a wide overlapping of the score distribution, and to a biometric system with greater error rates. 4.2 experiments using the nist-bssr matching score sets the nist bssr1 multi-modal database contains scores from 517 users. for each user, the database contains one score set from the comparison of two right index fingerprints, one score set from the comparison of two left index fingerprints, and two score sets (from two separate matchers) from the comparison of two frontal faces. the score sets from the left (right) indexes are referred as “li” (“ri”). each matching set contains 517 genuine scores and 266, 772 (i.e. 516 × 517) impostor scores. we transformed the given scores into distances, i.e. a great (small) score representing a similarity (non-similarity) between two collected samples is transformed to a small (great) distance between the same two samples. as a part of our experiments we derived the optimized thresholds for the bi-modal systems developed from the bssr1 database; and considered the li-ri and ri-li 2-matcher combinations. the nist bssr2 multi-modal database contains scores from 6000 users. for each user, the database contains one score set from the comparison of two right index fingerprints, and one score 732 m. stanojević, i. milenković , d. starčević, b. stanojević table 5: computation results for nist-bssr1 instances match ls cd gamma-normal cd gamma-gamma 1st 2nd far gar ter far gar ter far gar ter li ri 0.15% 94.59% 5.57% 0.62% 94.39% 6.23% 0.69% 95.36% 5.33% ri li 0.11% 93.82% 6.30% 0.35% 94.78% 5.58% 0.73% 95.30% 5.37% table 6: computation results for nist-bssr2 instances match ls cd gamma-normal cd gamma-gamma 1st 2nd far gar ter far gar ter far gar ter li ri 0.96% 95.87% 5.09% 1.03% 94.59% 6.46% 0.61% 95.42% 5.19% ri li 1.55% 96.12% 5.4% 0.75% 94.44% 6.40% 0.62% 95.44% 5.19% set from the comparison of two left index fingerprints. the score set from the left (right) indexes are referred as "li" ("ri"). each matching set contains 6000 genuine scores and 35,994,000 (i.e 5999 × 6000 ) impostor scores. as for the bssr1 dataset, we transformed the similarity scores into distances; derived the optimized thresholds; and considered both possible combinations. tables 5 and 6 report the numerical results obtained by running cd on nist-bssr1 and nist-bssr2. in these tables we include the results obtained by running ls in order to compare the performances, and the results obtained by two versions of the cd. the first version used gamma distribution for both genuine users and impostors, while the second version used gamma distribution for the genuine users and the normal distribution for the impostors. according to these experiments, it is better to choose the continuous distribution gamma for both genuine users and impostors. for bssr1 instances, the total error rate for cd gamma-gamma is considerably smaller than the total error rate obtained by ls method. for bssr2 instances the total error rate obtained with cd gamma-gamma is better then the one obtained by ls, in the case when checking the left fingerprint is the first modality, and checking the right fingerprint is the second modality in the biometric system. many papers referred to the same matching score dataset (see for instance [6], [9], [16], and [17]). searching for papers that report experimental results to compare with, we faced two main issues. first issue is related to the fact that there is no consistent way to deal with this database. for example, some authors randomly selected the scores for the system training, and used the rest for evaluation, thus making impossible to repeat their experiments; and/or discarded some scores due to apparent template acquisition errors, but without explaining which scores were discarded [16]. the second one is related to the fact that we propose a set of thresholds to be used in the multi-modal system; but we do not generate a roc curve, thus the equal error rate (eer) cannot be employed straightforward to validate our approach. 5 conclusions and future works in this paper we proposed a novel approach to determine the upper and lower acceptance thresholds in sequential multi-modal biometric matching systems. the new approach uses continuous distribution for the score distributions approximations of both genuine users and impostors. it improved the results obtained by the least square approximation approach. in the present paper we introduced the general mathematical model that may be used to minimize the total error function, and derive the thresholds in a general multi-modal system with n modalities. we solved the non-linear optimization problem numerically. the paper is an extension of [15], where the linear approximation of the biometric scores based on the least squares method was continuous distribution approximation and thresholds optimization in serial multi-modal biometric systems 733 introduced, and the optimization model was given biometric systems with 2 modalities. one of the advantages of the new approach is the fact that it relays on less input parameters than the previous method. the method based on the least square approximation included the computation of the frequencies needed for constructing the histograms for the score distributions. the number of the intervals for the histogram is theoretically uncertain for guaranteeing the best results. generally, the optimization of the thresholds for a serial multi-modal system serves the a priori need of the decision maker when building a convenient multi-modal biometric system. our method that provides optimized thresholds for a multi-modal biometric system is fast, and relatively simple to implement. for a biometric system that works in real time, the existence of multiple matchings, and the possibility that the unproblematic genuine users pass the system after the first match with a low false rejection rate, offers the advantage of an increased speed of the matching process. moreover, if the first match is based on a face image, or a video record that may be taken even without user’s will, then the process is even faster. the numerical results of our experiments were reported in the paper. for the majority of our experiments on random generated instances, the pair (far, gar), obtained by our method, dominates at least one pair (far, gar) from the set of the final results obtained by the fusion based method. the experiments showed a good performance of the sequential bi-modal biometric matching system based on optimized thresholds and continuous approximation of the distribution scores. our numerical results on real life datasets were also included in the paper. we referred to the nist-bssr1 and bssr2 data sets, and intend to extend our experiments to more benchmark data from the literature. we performed some experiments with multi-modal biometric systems with more than two matchings, but the instances were not yet statistically relevant. the research may advance by refining the approximation step needed for obtaining the expressions for the false acceptance and false rejection rates to be used for finding the optimized thresholds. one direction is to search for the fitting distributions that approximate well the initial score distributions just along the uncertain region. it is also possible to involve other metrics, instead of the euclidean distance, to compare the samples vectors. acknowledgment this research was partially supported by the ministry of education and science, republic of serbia, project numbers tr32013 and tr36006. bibliography [1] ehrgott, m. ; (2000); multicriteria optimization, berlin, germany: springer-verlag, isbn 3-540-21398-8. [2] filatovas, e.; podkopaev, d.; kurasova, o.; (2015), a visualization technique for accessing solution pool in interactive methods of multiobjective optimization, international journal of computers communications & control, issn 1841-9836, 10:508-519. [3] filip, f.g.; (2015); book review: “biometric and intelligent decision making support”, international journal of computers communications and control, issn 1841-9836, 10(6):952953. [4] hong, l.; jain, a.; (1998), integrating faces and fingerprints for personal identification, ieee trans. pattern anal. mach. intell., issn: 0162-8828, 20(12):1295-1307. 734 m. stanojević, i. milenković , d. starčević, b. stanojević [5] kaklauskas, a.; (2015), biometric and intelligent decision making support, springer-verlag, isbn 978-3-319-13659-2. [6] kumar, a. ; kumar, a.; (016), adaptive management of multimodal biometrics fusion using ant colony optimization, information fusion, issn: 1566-2535, 32:49-63. [7] maltoni, d.; maio, d.; jain, a.k.; prabhakar, s.; (2003); handbook of fingerprint recognition, new york: springer-verlag, isbn 978-1-84882-254-2. [8] marcialis, g.l.; mastinu, p.; roli, f. (2010), serial fusion of multi-modal biometric systems. in: biometric measurements and systems for security and medical applications (bioms), ieee workshop on, isbn: 978-1-4244-6302-2, 1-7. [9] mehrotra, h.; . singh, r; vatsa, m.; majhi, b.; (2016), incremental granular relevance vector machine: a case study in multimodal biometrics, pattern recognition, issn: 00313203, 56:63-76. [10] pathak, m.; srinivasu, n.; (2015), analysis of multimodal biometric system based on level of fusion, international journal of inventive engineering and sciences, issn: 2319-9598, 3:8-11. [11] pato, j.n.; millett, l.i. (eds); (2010); biometric recognition – challenges and opportunities, the national academies press, isbn: 978-0-309-14207-6. [12] ross, a. ; nandakumar, k.; jain, a.k.; (2006); handbook of multibiometrics, springer, isbn: 978-0-387-33123-2. [13] sheena, s.; sheena, m.; (2014), a study of multimodal biometric systems, international journal of research in engineering and technology, issn: 2321-7308, 3:93-98. [14] snelick, r.; uludag, u.; mink,a.; indovina, m.; jain, a.; (2005), large-scale evaluation of multimodal biometric authentication using state-of-the-art systems, pattern analysis and machine intelligence, ieee transactions on, issn: 0162-8828, 27:450-455. [15] stanojević, m. ; milenković, i.; starčević, d.; stanojević, b.; (2016), optimization of thresholds in serial multimodal biometric systems, 2016 6th international conference on computers communications and control (icccc), isbn: 978-1-5090-1735-5, 140-146. [16] tulyakov, s. ; li, j.; govindaraju, v.; (2008), enrolled template specific decisions and combinations in verification systems, biometrics: theory, applications and systems, 2008. btas 2008. 2nd ieee international conference on, arlington, va, 2008, isbn: 978-1-42442729-1, 1-7. [17] villegas, m.; paredes, r.; (2009), score fusion by maximizing the area under the roc curve”, pattern recognition and image analysis, lncs 5524, issn: 1054-6618, 473-480. [18] zhang, q.; yin, y.; zhan, d.-c.; peng, j.; (2014), a novel serial multimodal biometrics framework based on semisupervised learning techniques, ieee transactions on information forensics and security, issn: 1556-6013, 9:1681-1694. int j comput commun, issn 1841-9836 7(5):832-839, december, 2012. public discourse semantics. a method of anticipating economic crisis d. gîfu, d. cristea daniela gîfu alexandru ioan cuza university of iaşi, faculty of computer science 16, general berthelot st., 700483 iaşi, romania e-mail: daniela.gifu@info.uaic.ro dan cristea 1. romanian academy the iasi branch, institute for theoretical computer science 2, t. codrescu st., 700481 iaşi, romania and 2. alexandru ioan cuza university of iaşi, faculty of computer science 16, general berthelot st., 700483 iaşi, romania e-mail: dcristea@info.uaic.ro abstract: this paper provides a proof that anticipation of an economic crisis by analysing public discourses (in particular, speeches on economic issues) is feasible. it proposes a method of text classification and semantic interpretation based on natural language processing techniques that could be used to trace, over a period of time, the print press discourses, with the aim to valuate the perspective of occurrence of crises. classification is the task of assigning tags (words, expressions) to the texts that make up a corpus. in our case, we were interested to identify among the texts under scrutiny those belonging to classes like financial, economic, nationalism, etc. this approach is sustained by the fact that public discourses can be characterized from a rhetorical perspective, depending on the specific strategies their authors have chosen: orientation to change opinions or to determine action, ratio between rational (logos) and emotional (pathos), etc. we are sugesting an automatic analysis of the content of the public language, by using quantitative measures. our purpose was to develop a computational tool able to offer to researchers in the economic, social or political sciences, but, not less, to the public at large, the possibility to measure the acuity of different accents of a written public discourse (financial, emotional, etc.), as mean to anticipate the threat of financial waves. such a tool could help the processes of decision making in the analysis of crisis. although our analysis used as data the journalistic and economic environments of romania, it could easily be extrapolated to other languages/countries. keywords: public language, text categorization, semantic analysis, economic crisis. 1 introduction in the atempt to divulge ante-factum crises in public discourse, primarily the voices of those entities must be listen to which are most influencial on the financial and economic domains. these entities, clearly, are: the romanian national bank (in the internal context) and the world bank (in the international context)1. the voices of these entities are best listen to in the public speeches of governors and, in many cases, of journalists specialised on economic-financial issues. a public discourse arguing on some extremely important moment-related issue is, most of time, an amalgam of arguments, rational forms, descriptions, stylistic procedures, which are 1"in times of internal or international crisis (...), we talk about managing various symbolic aspects of the role of: guardian of institutions, guarantor of national unity, moderator." [3] copyright c⃝ 2006-2012 by ccc publications public discourse semantics. a method of anticipating economic crisis 833 intended to inform or to prepare a receptor in front of a problematic reality. but, as close to the subject a discourse would be, often it hides, in subtle ways, the true nature of the subjective thinking of the emitter. for instance, an exaggerated trust in the fresh energy of the society, in the benefits the loans on mortgage could bring to ordinary people, on the exceptional rise of the rate of interests, or on the incredible high bonuses certain banks are offering to their highly ranked employees could simultaneously bring the negative news, that something wrong is in the air, that a crisis is insinuating. decoding of this hidden message, which is most of the time transmitted unintentionally, could be done only by someone extremely sensible to all facets of the financial and economic life. signals for economic crises are issued by the central banks (e.g. federal reserve system, central bank of u.s., european central bank, etc.). during the period 2001-2008, when the banking system issued large but artificially cheap credits, there have been many public rhetoric appearances favourable to this behaviour which tried to set up an economic development investment with questionable prudence. slogans like "a home for every american" (u.s.) or "credit with only an id" (romania), addressing a wide range of borrowers but having extremely low interest rates, could have been taken as signals of an economic potential crisis. in this study we address the question: can an economic crisis be anticipated by evaluating public discourses from a lexical-semantic perspectives? we are interested to pursue a content analysis of the public language, using for that investigation tools that belong to the domain of natural language processing (nlp) and addressing: vocabulary (key words, frequent words), semantics (classes of concepts arranged in a hierarchy) and rhetorical-pragmatic discursive strategies (presence of the person i, preference for vague statements, generalities, etc.). in u.s., the tradition of quantitative analysis is very strong, its roots being defined by lasswell [5]. in europe the interest grew more towards theoretical investigation of the semiotics of discourse ( [1], [10], [1]). modern content analysis is not only an illustration of a theory of text, but, should be rooted on empirical data. on the other hand, the american analysis is often neutral, technical, comparative, while the european analysis (especially the critical discourse analysis model2) has a critical component and a strong enough ethicist. in the perspective of our study, we are interested on public discourses (speeches), in written form, given by specialists on economy or by journalists, on economic issues. it is known that economy crises succeeds either a period of economic thrive or, as happened recently, a previous crisis. in our investigation we have used texts produced by most pertinent spokesmen which appeared in press materials issued by the romanian national bank (bnr), the most legitimate voice on economy issues in romania. the other major filter in selecting the texts that should populate our corpus was the economic context (e.g. economic stability vs. economic crisis). a text categorization application filtered a stream of news that was considered of interest for our research. some of the topics of interest have been: "credit id only", "real estate boom", "mortgages", and "transactions with land or housing". on another hand, at the base of our quantitative investigation was laid a lexical-semantic database. in order to assure generality, in acquiring it we had to use rather neuter sources, not necessarily tight to our specific corpus of texts. as such, the lexicon and the semantic classes have been collected from different sources usually dealing with economy themes: the bnr publications, already mentioned, but also a collection of dailies, ziarul financiar, curierul naţional, bursa, that have been monitored for a long period of time. current empirical approaches in analysing the public language put at work nlp techniques, 2"critical theories, thus also cda, are afforded special standing as guides for human action. they are aimed at producing enlightenment and emmancipation. such theories seek not only to describe and explain, but also to root out a particular kind of delusion. even with differing concepts of ideology, critical theory seeks to create awareness in agents of their own needs and interests." [11] 834 d. gîfu, d. cristea by which a multitude of features of the discourse were extracted and interpreted. the domain of nlp includes a theoretically motivated range of computational techniques for analyzing and representing naturally occurring texts at one or more levels of linguistic analysis for the purpose of achieving human-like language processing for a range of tasks or applications. in this paper we describe a platform (discourse analysis tool dat) specialised in the interpretation of the public discourse, which integrates a range of language processing tools with the intent to build complex characterisations of the public discourse. the idea behind it is that the vocabulary betrays discursive tonalities, this way allowing interpretations over the speakers orientation. the paper is structured as follows. section 2 shortly describes the previous work. section 3 presents the dat software and section 4 discusses an example of comparative analysis of economic discourses, elaborated during one year (2007-2008). finally, section 5 highlights interpretations anchored in our analysis and presents conclusions. 2 previous work the aim of an interdisciplinary approach such as analysing the language of public speeches is to define and explain different discursive contexts, in our case, reflected in the print media. the studies in this direction have mainly concentrated on three tasks. the first had to do with a cognitive side and, often, with an emotional side, of how humans acquire, produce, and understand language. the second aimed at understanding the relationship between the linguistic utterance and the world, and the third at understanding the linguistic structure of the language as a communication device. linguistics has usually treated language as an abstract object which can be accounted for without reference to social or political concerns of any kind [9]. as we will see, one aspect of the platform that we present touches a lexical-semantic functionality, which has some similarities with the approach used in linguistic inquiry and word count (liwc), an american software used to analyse the elections in united states in 2008. there are, however, important differences between the two platforms. liwc-2007 basically counts words and increments counters associated with their declared semantic classes. dat performs part-of-speech (pos) tagging and lemmatization of words. the lexicon contains a collection of lemmas (over 8800) for the pos categories of verb, noun, adjective and adverb, each being associated with one or more semantic classes. in the context of the lexical semantic analysis, the pronouns, numerals, prepositions and conjunctions, considered to be semantically empty, have been left out. then, a special section of the lexicon includes expressions. an expression is defined as a sequence of stems of words. dat includes now 33 semantic classes, chosen to fit optimally with the necessities of interpreting the public discourse, five of them having been added recently (failures, nationalism, moderation, firmness, spectacular). then, another range of differences between the two platforms regards the user interface. in dat, the user is served by a friendly interface, offering a range of services: opening and displaying one or more files, editing and saving the text, functions of undo/redo, functions of editing the lexicon, visualization of the mentioning of occurrences of certain semantic classes in the text, etc. the menus offers a whole range of output visualization functions, from the tabular form to graphical representations and to printing services. and finally, and most importantly, to help the user to interpret different authors simultaneously, she/he can chose among a collection of formulas that facilitate comparative studies. public discourse semantics. a method of anticipating economic crisis 835 figure 1: the dat interface: in the left window appear the selected files, in the middle window the text from the selected file, and in the right window, information about the text (language, word count, dominant class, etc.). bellow, a plot chosen from a range of graphical styles is displayed. by selecting a specific class in the middle window, all words assigned to that class are highlighted in the text. 3 the dat platform the discourse analysis tool (dat, currently at version 3) considers the public discourse from two perspectives: lexical and semantic. we describe shortly our platform which integrates a range of language processing tools, with the intent to build complex characterisations of the public discourse. the concept behind this method is that the vocabulary used by a speaker betrays the authors sensibility, her/his level of culture, her/his cognitive world, and, by this, to the semantic spectrum of her/his speeches, while the syntax may reveal the level of culture, intentional persuasive attitudes towards the public, etc. some of these means of expression are intentional, aimed to deliver a certain image to the public, while others are unintentional. figure 1 shows a snapshot of the interface showing a semantic analysis, during a working session. to display the results of the lexical-semantic analysis, the platform incorporates two alternative views: graphical (pie, function, columns and areas) and tabular (microsoft excel compatible). the vocabulary of the platform covers 33 semantic classes (swear, social, family, friends, people, emotional, positive, negative, anxiety, anger, sadness, rational, intuition, determine, uncertain, certain, inhibition, perceptive, see, hear, feel, sexual, work, achievements, failures, leisure, home, financial, religion, nationalism, moderation, firmness, spectacular), considered to fulfil optimally the necessity of interpreting the public discourse in different contexts. some of these categories are placed in a hierarchical relation. linguistic processing begins by tokenization, part of speech tagging and lemmatization. only the words belonging to the lexicon are considered relevant and therefore count in establishing the weights of different semantic classes. in response to the text being sent by the user, the system returns a compendium of data which includes: the language of the document, the number of words, and the type of discourse detected, a unique identifier (usually the file name), and a 836 d. gîfu, d. cristea table 1: examples of phrases on economy issues, on bnr editorials classes original in romanian english equivalent financial positive creşterea pib, expansiunea economiei mondiale, investiţii, scăderea ratei şomajului, expansiunea economică pib growth, global economic growth, investments, unemployment has declined, economic growth negative moderarea ritmului de creştere a salariilor, gradul de incertitudine, turbulenţe pe pieţele financiare, efectul inhibitor asupra consumului şi investiţiilor moderate the wage growth, uncertainty, financial markets turmoil, dampening impact on consumption and investment report of the lexical-semantic analysis. our interest went mainly in determining those discursive attitudes able to betray an approaching recession. but the system can be parameterised to fit also other conjunctures: the user can define at will her/his semantic classes, which, as indicated, are partially placed in a hierarchy. thus, for example, for the lemma economist, the following classes are assigned: 2 = social and 5 = people. the class people, is a subclass of the class social. these classes and their hierarchy are defined in a xml-like manner: whenever an occurrence belonging to a lower level class is detected in the input file, all counters in the hierarchy, from that class to the root, are incremented. in other words, the lexicon assigned to superior classes includes all words/lemmas of its subclasses. 4 a comparative study 4.1 the corpus the corpus used for our investigation was configured to allow a comparative study over the discursive characteristics of economic-financial themes, by including economy texts published on the bnr site in three different periods: 1. april-june 2007, when romania crossed a period of economic stability. 2. april-june 2008, when romania was near the economic crisis. 3. july 2008, when the romanian president declared the economic recession. table 1 presents examples of phrases in the economy domain that exhibit two different discourse moods: positive emotional and negative emotional. the analyzed texts were essentially dealing with the topics social and financial. after processing the texts with the dat software, the following classes proved to have preponderant occurrences: financial, social, work, emotional (positive and negative), rational (intuition, determine, uncertain, certain and inhibition) and nationalism. to stress the distinguishing features, only these classes were finally left on the graphics. 4.2 the lexical-semantic analysis we show in this section the results outputted by dat when analysing the streams of textual data belonging to the three sections of the corpus (presented in section 4.1). for that, we have public discourse semantics. a method of anticipating economic crisis 837 figure 2: difference between the occurrence of semantic classes in bnr editorials: one year before the economic recession versus three months before. used the dat feature of performing comparative studies. the values are supposed to reflect correctly the indicated classes, because they were computed by averaging on the whole collections of texts, not just a single text. the graphics considered for the interpretation computed one-toone differences, as given by formula 1, included in the dat mathematical functions library: diff1−1x,y = average(x) − average(y) (1) where x and y are two streams; average(x) and average(y) are the average frequencies of x and y over the whole stream, and the difference is computed for each selected class. since a difference can lead to both positive and negative values, these particular graphs should read as follows: values above the horizontal axis are those prevailing at the first element more than at the second element, and those below the horizontal axis show the reverse prominence. a zero value indicates equality. our experience showed that values below the threshold of 0.5% should be considered as irrelevant and, therefore, were ignored in the interpretation. so, the graphical representation in figure 2, in which the editorials (apr.-jun. 2007) are compared against the editorials (apr.-jun. 2008) should be interpreted as follows: in 2007 the bnr discourse was extremely optimistic (high difference values of the class positive) and they were giving high importance to romanian specific aspects (class nationalism), while in 2008 (nearly recession time) the bnr discourse had become rather pessimistic (class negative) and speculative (class intuition) with respect to the romanian economic future. in the following we will compare the same 2007 discourse against their discourse immediately after the recession. the graphical representation in figure 3, in which the editorials (apr.-jun. 2007) are compared against the editorials (july 2008) should be interpreted as follows: the difference in optimism between the bnr discourse one year before the recession and that of the moment the crisis was officially declared (class positive) is more pregnant (1.25% here versus 0.89% in figure 2). however, although the pessimistic tone (class negative) is more pronounced in july 2008 than in the period of stability, it has weakened in intensity. we could say that bnr is caution to push too much on the distress pedal, because its voice could influence the fixing and, by that, worsen 838 d. gîfu, d. cristea figure 3: difference between the occurrence of semantic classes in bnr editorials: one year before the economic recession versus one month after the economic recession. the financial market even more. moreover, bnr is offering a possible immediate solution, by accenting on the job sphere (class work). 5 conclusions and future work in this paper we presented a quantitative method and an application that strengthen the idea that crises can be anticipated by monitoring public speeches produced by representative entities. we are aware that some of the differences which we have evidenced in our comparative study should partially be attributed to idiosyncratic rhetorical styles. however, when the traits inventoried acquire the regularities of patterns, then they could be used as measure apparatuses and, properly used, could emit useful signals to a receptive society. there are a number of ways in which we think our research could be continued. first, we want to add new features to the platform, with a special emphasis on the syntactic and rhetorical levels of analysis. the new release of dat should help the user to identify and count patterns of use at the syntactic and rhetorical level. another line to be continued regards the evaluation metrics, which have not received enough attention till now. we are currently studying other statistical metrics able to give a more comprehensive image on different facets of the public discourse. a weakness of the present system is the fact that the unequal sizes of the lexicons characteristic to semantic classes can influence the decisions: the more entries in the lexicon a certain class contains, the higher its influence could be foreseen. to this problem, the solution is not to balance the classes in their number of entries, because the language makes them intrinsically unequal, but to find calibration techniques that bring their values on equivalent ranges, irrespective of the dimensions of the lexicons. let’s note that in the present study we have counterbalanced somehow this skew by using the difference-based formulas (and thus avoiding absolute values). surely, the problem of characterising public speeches receives no final solution with our approach. we believe, however, that our method sheds an interesting light on possibilities of automatically interpreting discourses and, equally, it opens new perspectives. public discourse semantics. a method of anticipating economic crisis 839 acknowledgements in pursuing this research the authors had partial support from the projects posdru-63663, ict-psp 250467-atlas and ict-psp 270893-metanet4u. bibliography [1] bürger, c.,textanalyse als ideologiekritik, zur rezeption zeitgenössischer unterhaltungsliteratur, frankfurt am main, athenäum, 1973. [2] cristea, d., răschip, m., forăscu, c., haja, g., florescu, c., aldea, b., dănilă, e., the digital form of the thesaurus dictionary of the romanian language, in proceedings of sped 2007 speech technology and human-computer dialogue, iaşi, may 10-12, 2007. [10] dijk van, t. a., sémantique générale et théorie des textes, linguistics, 62, 66-95, 1970. [3] gerstlé, j., comunicarea politică, trad. gabriela cămară ionesi, institutul european, iaşi, 94, 2002. [4] gîfu, d., cristea, d., computational techniques in political language processing: anadip2011, in j.j. park, l.t. yang, and c. lee (eds.), futuretech 2011, part ii, ccis 185, 188-195, 2011. [5] lasswell, h. d., politics: who gets what, when, how, mcgraw-hill, new york, 1936. [6] lazarsfeld, p. f., berelson, b., hazel, g., the peoples choice: how the voter makes up his mind in a presidential campaign, 3d ed., new york, columbia university press, 1944. [7] perelman, c., olbrechts-tyteca, l., traité de l’argumentation, éd. de l’institut de sociologie de l’université libre de bruxelles, 72, 1972. [1] plett, h. f., ştiinta textului şi analiza de text, trad. speranţa stănescu, ed. univers, bucureşti: 72, 1983. [9] romaine, s., language in society. an introduction to sociolinguistics, oxford university press inc., new york, 1994. [11] wodak, r., critical linguistics and critical discourse analysis, handbook of pragmatics, benjamins, 2006. int j comput commun, issn 1841-9836 vol.7 (2012), no. 3 (september), pp. 550-564 improving tracking performance of predictive functional control using disturbance observer and its application to table drive systems t. satoh, k. kaneko, n. saito toshiyuki satoh, kotaro kaneko, naoki saito akita prefectural university department of machine intelligence and systems engineering 84-4, aza-ebinokuchi, tsuchiya, yurihonjo, akita, japan e-mail: tsatoh@akita-pu.ac.jp, m12a011@akita-pu.ac.jp naoki_saito@akita-pu.ac.jp abstract: a practical approach for improving tracking performance of the predictive functional control (pfc) is proposed. the disturbance observer is utilized to nominalize the actual plant and to reduce the predicted output error in the pfc algorithm by canceling not only constant but also high-order disturbances. the proposed control scheme is experimentally validated on a single axis table drive system and is compared with the standard pfc and the industrial cascade control. the experimental results prove the effectiveness of the proposed disturbance observer-based pfc scheme. keywords: predictive functional control, disturbance observer, table drive system, model predictive control 1 introduction the model-based predictive control (mpc) has been successfully applied mainly in the petrochemical industry since it was put into practical use in the 1970’s. in general, the quadratic optimization problem must be solved on-line to compute the optimal control sequence in the standard constrained multivariable mpc algorithm [14]. when the mpc is applied to the control of mechatronic servo systems, the timeconsuming optimization may be problematic since the sampling period of such systems is usually less than a few milliseconds and may be too short to complete the optimization. on the other hand, a simple mpc scheme called the predictive functional control (pfc) [11–13] is widely used. unlike the standard mpc, the pfc is primarily intended to single-input-single-output systems, and no on-line optimization is required since the control input is expressed as a linear combination of time-dependent basis functions. although the pfc algorithm is much simpler than the standard mpc algorithm, it gives a similar performance to the full mpc. accordingly, a variety of successful industrial applications have been reported (e.g., [2–5, 7, 8, 10, 20]). the tracking performance attained by the pfc heavily depends on the accuracy of the internal model of the actual plant. so, if the predicted output error caused by disturbances or model uncertainties is significant, the tracking performance may deteriorate, and the designed response cannot be obtained on the actual system. another weakness of the pfc is that it cannot cope with high-order disturbances, such as ramp or parabola disturbances. assume that disturbances entering the system are the 1/sk-type (k > 1). then, for non-integrating plants, it is known that the pfc is offset-free for stepwise (i.e., k = 1) disturbances, which means that the stepwise disturbance is asymptotically rejected. however, this property does not hold true for high-order (i.e., k > 2) disturbances 1. also, for integrating plants, even stepwise disturbances cannot be rejected in the pfc scheme. 1among high-order disturbances, ramp disturbances are especially worth considering. examples of ramp disturbances include the viscous friction under uniform accelerated motion in mechatronic systems, or flow fluctuations in petrochemical plants. copyright c⃝ 2006-2012 by ccc publications improving tracking performance of predictive functional control using disturbance observer and its application to table drive systems 551 the purpose of this paper is to propose a practical way to overcome the above-mentioned drawbacks. to this end, we construct a dual loop control structure which consists of an inner loop formed by the disturbance observer (dob) [15, 16, 19] and an outer loop formed by the pfc. the dob is known to be an effective compensation mechanism which reduces the influence of disturbances, uncertainties and nonlinearities in the plant and enforces the nominal input/output behavior on the actual plant especially in the low frequency range where the frequency of the reference signal concentrates. accordingly, the actual plant behaves as if it is the nominal plant by introducing the dob. in the dob design, we can specify the number of integrators introduced in the loop transfer function. hence, the dob has the ability to asymptotically reject high-order disturbances as well as stepwise disturbances. the contributions of the paper are as described below. first, it offers a way of improving the prediction accuracy within the pfc algorithm. this leads to the improvement of the transient performance of the pfc in the presence of disturbances and/or modeling errors since the dob nominalizes the actual plant especially at low frequencies. second, it solves the inherent weakness of the pfc against the highorder disturbances. when the 1/sk-type (k > 2) disturbances are applied, those disturbances cannot be rejected by the standard pfc controller. however, due to the introduction of the dob, such high-order disturbances can be asymptotically rejected since the dob has the specified number of integral action. we should mention here that the idea of the combination of the pfc and the disturbance observer has originated in the recent application to the control of a pneumatic artificial muscle actuator by the authors [18]. however, in-depth argument about the advantages of the proposed scheme was not provided there. the present paper states the advantages that were left unclear in [18]. in addition, the proposed method is validated on a single axis table drive system and compared with the standard pfc and the industrial cascade control which consists of the inner proportional-plus-integral control loop and the outer proportional control loop. we also examine the performance differences between two different control systems that have the same pfc controller but different dobs. 2 predictive functional control enhanced with disturbance observer 2.1 predictive functional control in this subsection, we give a brief overview of the predictive functional control (pfc). for more details of the pfc algorithm based on the state-space model, see, for example, s. abu el ata-doss et al. [1]. figure 1 shows the basic concept of the pfc. suppose that the current time is labeled as time step k. a set-point trajectory is defined as a command signal which the process output yp should follow, and the value of the set-point trajectory at the current time step is denoted by c (k). also shown is a reference trajectory denoted by yr. this trajectory starts at the current process output yp (k) and defines a desired trajectory along which the process output yp should approach the set-point trajectory. on the reference trajectory, there are a few coincidence points on which the performance index is defined so that the process output yp will coincide with the reference trajectory yr. as an example, three coincidence points are drawn in figure 1. the optimal control input trajectory is then computed on the basis of the predicted output. once we have computed a future control input trajectory, we apply only the first element to the process. at the next time step, we repeat the whole cycle from the definition of the reference trajectory to the application of the first element of the optimal control input trajectory. we call this way of control a receding horizon control. next, we show the basic pfc algorithm which is a slightly modified formulation for handling time delay. if we set d = 0 in the following description, the control law accords with the one without time delay. in the following, let rn and zn be the set of all n-dimensional real vectors and the set of all ndimensional integer vectors, respectively. now assume that the plant is stable and has the time delay of l and that the sampling period is t s. the development of the pfc algorithm is based on the following 552 t. satoh, k. kaneko, n. saito siso discrete-time linear state-space model of the plant: xm (k + 1) = am xm (k) + bm u (k) ,ym (k) = cm xm (k) (1) where xm ∈ rn is the state vector, u ∈ r is the control input, ym ∈ r is the model output, respectively. here, the model output ym (k) is used to predict the future plant output ŷp (k + d) where d ∈z is defined as the nearest integer of l/t s. assume that the following condition holds: det am − i bm cm 0  , 0. (2) then the reference trajectory is defined as follows: yr (k + d + i) := c (k + d + i) −αi (c (k + d) − ŷp (k + d)) , i = 0,1, . . . (3) where α ∈r is a parameter which adjusts the approaching ratio of the reference trajectory to the set-point (0 < α < 1). for example, dieulot et al. [3] have chosen the parameter α as α = e−3t s/tcltr along with the following three coincidence points: ( h1 h2 h3 ) = ( tcltr 3t s tcltr 2t s tcltr t s ) (4) where tcltr ∈r is constant and called the desired closed-loop time response, which is taken as the time required to reach 95% of the final value [13]. in many cases, the performance index is defined as the quadratic sum of the errors between the predicted process output ŷp and the reference trajectory yr as follows: j (k) := nh∑ j=1 { ŷp ( k + d + h j ) −yr ( k + d + h j )}2 (5) where h j ∈ z ( j = 0,1, . . . ,h) and nh ∈ z are respectively the coincidence time point and the number of coincidence points. in the pfc, the future control input computed at each sampling instant is assumed to be the sum of weighted basis functions, and a time-dependent polynomial basis is usually employed. then the optimal control input that minimizes the performance index (5) is given by u (k) = k0 {c (k + d) −yp (k)}− de∑ m=1 kmem (k + d) + ν̃ t x xm (k) + ν̃ t xd xm (k − d) (6) figure 1: concept of the predictive functional control. improving tracking performance of predictive functional control using disturbance observer and its application to table drive systems 553 where de ∈z and em ∈z respectively denote the degree of the polynomial and unknown coefficients that are used when the predicted error at the future time step k + d + i is approximated by a time-dependent polynomial, and k0 ∈r, km ∈r, ν̃x ∈rn and ν̃xd ∈rn are respectively given by k0 = ν t  1 −αh1 1 −αh2 ... 1 −αhnh  , km = ν t  hm1 hm2 ... hmnh  , ν̃x =−  cm ( ah1m −α h1 i ) cm ( ah2m −α h2 i ) ... cm ( a hnh m −α hnh i )  t ν, ν̃xd =−  ( αh1 − 1 ) cm( αh2 − 1 ) cm ...( αhnh − 1 ) cm  t ν. (7) in (7), ν ∈rhnh is given by ν= ( yb (h1) · · · yb ( hnh ))t  nh∑ j=1 yb ( h j ) yb ( h j )t  −1 ub (0) (8) where yb ( h j ) = ( yb1 ( h j ) · · · ybnb ( h j ))t ∈ rhnh×nb and ub (0) = ( 1 0 · · · 0 )t ∈ znb . here, ybl (i) ∈r is the forced response to the basis function of the form il−1 (l = 1,2, . . . ,nb). the second term in the right-hand side of (6) denotes the term to compensate for the prediction error due to disturbances and/or uncertainties. the unknown coefficients em (k + d) (m = 1,2, . . . ,de) are determined in the following fashion. first, the number of steps hc used for the polynomial approximation is specified. then, at each sampling instant, the following vectors φ ∈ rhc , θ ∈ rde and a matrix h ∈zhc×de are defined: φ :=  e (k + d) − e (k + d − hc) e (k + d − 1) − e (k + d − hc) ... e (k + d − hc + 1) − e (k + d − hc)  , h :=  hc h2c · · · h de c hc − 1 (hc − 1)2 · · · (hc − 1)de ... ... ... ... 1 1 · · · 1  , θ :=  e1 (k + d) e2 (k + d) ... ede (k + d)  . (9) the problem is to approximate φ by hθ in the least-squares sense. the vector of unknown coefficients θ that minimizes the square error j (θ) := (φ− hθ)t (φ− hθ) is hence given by θ= ( ht h )−1 htφ. (10) once the number of steps hc is specified, the matrix ( ht h )−1 ht can be preliminarily computed off-line. therefore, the optimal coefficients em (k) (m = 1,2, . . . ,de) can be determined by just updating the vector φ and computing (10) on-line. the mechanism which extrapolates the past prediction error to the future prediction horizon by using the coefficients given in (10) is called the auto-compensation. the use of the auto-compensation is optional, so the pfc is often utilized without the second term in the right-hand side of (6). 2.2 disturbance observer the disturbance observer (dob) has the structure depicted in figure 2 where p (s) is the transfer function of the real plant, pn (s) is the nominal model of the plant, and q (s) is a proper and stable filter. the nominal model pn (s) is supposed to be minimum-phase in the following. on the assumption that disturbances acting on the system, modeling errors and nonlinearities can be regarded as an equivalent 554 t. satoh, k. kaneko, n. saito figure 2: structure of the disturbance observer. figure 3: equivalent form of the disturbance observer. disturbance d at the plant input, the dob computes the estimate d̂ of the current disturbance d, which is subtracted from the input u to cancel the disturbance. the designed parameter of the dob is the filter q (s). this filter is closely related to the sensitivity properties of the dob. for example, the transfer function from the disturbance d to the measurement output y is given by gyd (s) = (1 − q (s)) p (s) pn (s) pn (s) + (p (s) − pn (s)) q (s) . (11) accordingly, to reject the disturbance d asymptotically, the gain of q (s) should be unity at low frequencies. on the other hand, the transfer function from the measurement noise n to the measurement output y is given by gyn (s) = −p (s) q (s) pn (s) + (p (s) − pn (s)) q (s) . (12) hence, to suppress the measurement noise, the gain of q (s) should be 0 at high frequencies. therefore, q (s) should be a low-pass filter with the dc gain of 1. since the closed-loop transfer function from the input u to the measurement output y in figure 2 is given by gyu (s) = p (s) pn (s) (1 − q (s)) pn (s) + p (s) q (s) , (13) the transfer property approximates the nominal plant pn (s) at low frequencies if q (s) is chosen as stated above. this means that the dob can nominalize the real plant at low frequencies. when the plant has no unstable zeros, the low-pass filter q (s) can be selected as [19] q (s) = 1 + ∑nq−ρq m=1 fm s m 1 + ∑nq m=1 fm s m (14) where nq ∈ z, ρq ∈ z and fm ∈ r are respectively the order of q (s), the relative degree of q (s) and unknown coefficients to be determined. to make q (s) pn (s) −1 proper, ρq must be chosen as ρq > ρp where ρp ∈ z is the relative degree of pn (s). figure 3 shows the equivalent form of the disturbance observer. in the figure, (1 − q (s))−1 can be written as 1 1 − q (s) = 1 + ∑nq m=1 fm s m sni ( fni + ∑ρq m=1 fni+m s m ) (15) where ni = nq −ρq + 1, if the q-filter of the form given in (14) is used. hence, we can see that the loop transfer function includes ni integrators in figure 3, which means that the dob can asymptotically reject up to the 1/sni -type disturbances. various types of analogue filters can be utilized to realize q (s), and in many cases, the butterworth or the binomial low-pass filters are used. more specifically, the butterworth filter is a reasonable option improving tracking performance of predictive functional control using disturbance observer and its application to table drive systems 555 figure 4: structure of the unknown input disturbance observer. figure 5: structure of the proposed control system. when nq = ρq, which means that the degree of the numerator polynomial of q (s) is 0. otherwise, the binomial filter of the following form is useful: q (s) = 1 + ∑nq−ρq m=1 am ( sτnq )m 1 + ∑nq m=1 am ( sτnq )m (16) where am (m = 1,2, . . . ,nq) are the binomial coefficients (i.e., am = nq!/m!(nq − m)!) and τnq is a design variable to be determined. aside from the dob, another type of disturbance observer based on the unknown input observer, which is referred to as the unknown input disturbance observer (uidob), has also been proposed [6]. the uidob is a natural extension of the luenberger observer, and it is based on the following state-space models of the plant and the fictitious disturbance generator: plant  ẋ (t) = ax (t) + b (ua (t) + d (t)) ,y(t) = c x (t) , disturbance generator  ż (t) = adz (t) ,d (t) = cd z (t) (17) where x ∈ rn is the state variable of the plant, ua ∈ r is the control input, y ∈ r is the plant output, z ∈ rnd is the state variable of the disturbance generator and d ∈ r is the disturbance. notice that the eigenvalues of ad are not restricted to those on the origin (i.e., disturbances are not necessarily the 1/sktype). it is assumed that the pair (c, a) and (cd, ad ) are both observable and that no eigenvalues of ad coincide with zeros of the plant. the uidob is based on the augmented system obtained from (17) and is given by   ˙̂x (t)˙̂z (t)  = a bcd 0 ad  x̂ (t) ẑ (t)  + b 0  ua (t) + l y(t) − (c 0) x̂ (t) ẑ (t)   ,x̂ (t) d̂ (t)  = i 0 0 cd  x̂ (t) ẑ (t)  (18) where x̂, ẑ and d̂ are the estimates of x, z and d, respectively. l ∈ rn+nd is the observer gain which is chosen to stabilize the observer system. figure 4 shows the structure of the uidob where au = a bcd 0 ad  , bu = b 0  , cu = (c 0) , x̂u (t) = x̂ (t) ẑ (t)  . (19) the estimate d̂ is fed back to the control input to compensate for the input disturbance d. unlike the dob, the uidob estimates x̂ as well as d̂, so the observer-based state feedback technique can be applicable to stabilize the plant. 556 t. satoh, k. kaneko, n. saito the equivalence and difference between the dob and uidob has been already investigated by schrijver and van dijk [9]2, and they have shown that the dob is equivalent to the uidob for some specific design choices. actually, the dob is more general than the uidob for the following reasons [9]: • given any uidob designed for the 1/sk-type disturbance model, we can always find exactly the same dob as the given uidob. • it is possible to design a dob by freely specifying the relative degree of the q (s). however, in the uidob case, the relative degree always becomes ρp + 1 where ρp is the relative degree of the plant. hence, it is generally impossible to convert a given dob into an equivalent uidob structure. • there is no freedom to choose the order of q (s) in the uidob structure, and the order of the uidob is higher than that of the dob for plants with stable zeros. we therefore use the dob in this paper. refer to schrijver and van dijk [9] for more details about the dob and uidob. 2.3 structure of proposed control system the structure of the overall control system we propose is shown in figure 5 where p represents the actual plant, c is the set-point, yp is the plant output, d is the disturbance, n is the noise, and d̂ is the estimated disturbance. the control system consists of an inner loop formed by a disturbance observer and an outer loop formed by a pfc controller. the advantages of using the disturbance observer along with the predictive functional control are twofold. first, an improvement of prediction accuracy in the pfc algorithm is expected since the input/output behavior of the actual plant approaches its nominal characteristics by using the dob. this can be understood in the following way. as explained in the previous subsection, the design parameter q (s) is a low-pass filter, and its dc gain is usually chosen to be 1 (= 0 db) to reject the 1/sk-type disturbances asymptotically. hence, in the low-frequency band on which disturbances generally concentrate, the transfer function from the control input to the plant output given in (13) can be approximated by gyu (s) = p (s) pn (s) (1 − q (s)) pn (s) + p (s) q (s) ≃ pn (s) (20) since q (s) ≃ 1. this means that the behavior of the actual plant equipped with the dob shown in figure 2 is close to that of the nominal model pn (s) as long as the bandwidth of q (s) is properly designed. although gain mismatches or modeling errors between the internal model and the real plant are unavoidable in real-world applications, the pfc has tolerance for such uncertainties to some extent. however, the transient performance may deteriorate even though the steady-state response is satisfactory. for instance, suppose that the real plant p (s) and the internal model pn (s) are both stable first-order systems and given by as follows: p (s) = k s + p , pn (s) = kn s + p (21) where k > 0, kn > 0 and p > 0. if there exists a gain mismatch between these two transfer functions such that k < kn, then the closed-loop response is slower than the designed response based on the internal model pn (s). this difference is due to the inaccuracy of the prediction in the pfc algorithm. the use of 2the dob is denoted as the disturbance estimation filter (def) in [9]. improving tracking performance of predictive functional control using disturbance observer and its application to table drive systems 557 table 1: specifications of linear actuator (left) and dc motor (right). description value description value ball screw lead 10 mm nominal voltage 24 v stroke 500 mm rated current 2.96 a rail length 670 mm rated torque 0.28 nm maximum rotational speed 500 mm/s rated speed 1810 r/min figure 6: single axis table drive system. figure 7: schematic diagram of experimental setup. the disturbance observer improves the prediction accuracy and, in consequence, leads to the improvement of the transient response in the pfc scheme. second, the disturbance observer can compensate for high-order (e.g., ramp, parabola, etc.) disturbances. it is known that, in the case of non-integrating processes, the steady-state error caused by stepwise disturbances becomes zero in the pfc scheme. unfortunately, this property does not hold true for high-order disturbances. also, for integrating processes, the steady-state error remains even though the disturbance is stepwise. to eliminate the steady-state error in such cases, one or more integrators must be added in the loop transfer function. this can be accomplished within the local loop formed by the disturbance observer when ni > 1 in (15), and the disturbance is asymptotically eliminated in the inner-loop shown in figure 5. consequently, the disturbance rejection property of the pfc can be enhanced, and the above-mentioned difficulty is remedied by the simultaneous use of the disturbance observer. 3 controller design for single axis table drive system 3.1 plant description figure 6 shows the external view of the single axis table drive system which is used in the experiment shown in section 4. also, figure 7 shows the schematic diagram of the experimental setup. as is clear from figure 7, the system is controlled by the semi-closed-loop control method. the core of this experimental apparatus is a single axis linear actuator (nsk ltd., monocarrier mcm08050h10k), which is driven by an 80 watt dc servo motor (maxon motor ag, f2260.885) equipped with an optical encoder with the resolution of 1000 pulses/revolution. the dc motor is driven by a dc servo amplifier (maxon motor ag, ads 50/5). the specifications of the linear actuator and dc motor are summarized in table 1. the encoder pulse is received by a 24-bit encoder counter board (interface co., ltd., pci-6204) and counted by quad edge evaluation. the command signal to the dc servo amplifier is sent from a 12bit digital-to-analogue converter, and the current monitor signal from the dc servo amplifier is received by a 12-bit analog-to-digital converter. those d/a and a/d converters are both implemented on one analogue i/o board (interface co., ltd., pci-3521). the control pc is equipped with dual 1.7 ghz intel xeon r⃝ processors and is running on microsoft windows 2000 r⃝ operating system. a real-time control environment is constructed by using matlab r⃝, simulink r⃝ and real-time windows target r⃝ (math558 t. satoh, k. kaneko, n. saito works, inc.), and the table drive system is controlled at the sample rate of 1 khz. two s-function blocks for the use of pci-6204 (encoder counter board) and pci-3521 (a/d and d/a board) were developed by the authors since simulink is not shipped with blocks for these boards. we use the dc servo amplifier in the current control mode to command and control the motor torque instead of directly manipulating the motor current. hence, on the assumption that the table drive system can be approximated by a one-inertia system, its equation of motion subject to friction torque is given by θ̈ (t) = ks j ea (t) − 1 j td (t) (22) where θ denotes the angle of the motor shaft, ea is the command voltage to the dc servo amplifier, j is the equivalent moment of inertia, ks is the conversion factor from the command voltage to the motor torque, and td is the nonlinear friction torque. notice that td includes the viscous friction torque. in the case of our experimental apparatus, the conversion factor is given by ks = 0.0801 nm/v, and the nominal value of the moment of inertia is identified as j = 1.6928 × 10−4 kg m2 by using a standard system identification technique. we define the state vector and output variables as (x1 x2) t := (θ θ̇)t and y := θ, respectively. also, the time delay of the experimental apparatus can be negligible as long as the sampling period is greater than or equal to 1 ms. then the state-space description of the single axis table drive system is given by ẋ1 (t) ẋ2 (t)  = 0 1 0 0  x1 (t) x2 (t)  +  0 ks/j  ea (t) +  0−1/j  td (t) , y(t) = ( 1 0 ) x1 (t) x2 (t)  . (23) here, the nonlinear friction characteristics of this single axis table drive system can be approximated by, for example, the general kinetic (gk) friction model [17] of the form td (t) =  { tc + (tb − tc) e − ∣∣∣∣ θ̇(t)θ̇str ∣∣∣∣2 } sgn θ̇ (t) + dθ̇ (t) , if θ̇, 0, te (t) , if θ̇= 0 and |te|< t s, tb sgn te (t) , if θ̇= 0 and |te|> t s (24) where θ̇str is the stribeck velocity, tc is the coulomb friction level, tb is the breakaway torque, d is the viscous friction coefficient, te is the external torque generated by the driving motor. however, to evaluate the effectiveness of the disturbance observer, and to compare the prediction accuracy in the pfc algorithm with or without the disturbance observer, we treat the friction torque as unknown disturbance in this paper. 3.2 design of disturbance observer the nominal transfer function from the applied voltage to the angular velocity can be described as pn (s) = 473.2 s . (25) hence, the relative degree of the q-filter must be chosen as ρq > 1, and we take ρq to be 1 in this paper. the required number of the integral action introduced by the dob depends on the nature of disturbances, and it is difficult to decide its appropriate number beforehand. so, we will design two dobs to see the difference due to the number of integral action under the following design conditions: improving tracking performance of predictive functional control using disturbance observer and its application to table drive systems 559 10 0 10 1 10 2 10 3 10 4 10 5 −70 −60 −50 −40 −30 −20 −10 0 10 frequency (rad/s) g ai n (d b ) q 1 (s) q 2 (s) 1−q 1 (s) 1−q 2 (s) figure 8: frequency characteristics of qi (s) and 1 − qi (s) (i = 1,2). 1. relative degree ρq = 1, number of integral action ni = 1. 2. relative degree ρq = 1, number of integral action ni = 2. since the transfer function from the input voltage ea to the rotor angle θ is given by the double integrator model pn (s)/s = 473.2/s2, the closed-loop system equipped with the dob can asymptotically reject the stepwise disturbance when ni = 2. the order of the q-filter nq can be determined by the relation nq = ni +ρq − 1, so nq = 1 for the former case, and nq = 2 for the latter case. first, we design a first order filter q1 (s) as an analogue butterworth low-pass filter. the target cut-off frequency ωc is chosen to be 40 hz. then, q1 (s) is given by q1 (s) = 251.33 s + 251.33 . (26) next, we design a second order filter q2 (s). in this case, we use the prototype of the binomial filter given in (16). for the case of q2 (s), the binomial coefficients a1 and a2 are given as a1 = 2 and a2 = 1, respectively. to determine the parameter τnq , we numerically solve the following optimization problem: min τnq n∑ i=1 ( |qd (ωi)| − ∣∣∣∣q2 ( jωi,τnq )∣∣∣∣)2 (27) where |qd (ω)| is the gain of the ideal low-pass filter whose value is 1 over the frequency range from 0 to ωc and is 0 at higher frequencies than ωc, and ωi (i = 1,2, . . . , n) are n logarithmically-spaced points between decades 10−1 rad/s and 104 rad/s. solving the optimization problem with n = 3000, we obtain the optimal value of the parameter τnq = 0.015, which results in the following second order filter: q2 (s) = 133.38 (s + 33.35) (s + 66.69)2 . (28) figure 8 shows the gains of qi (s) and 1 − qi (s) (i = 1,2). we can see form figure 8 that the property of the disturbance suppression of the dob using q2 (s) is better than that using q1 (s). 3.3 pfc parameters using the control structure depicted in figure 5, we implement the pfc algorithm in the position control loop with the double integral internal model pn (s)/s. also, the time delay steps d is fixed to 0 in the control law given in (6). 560 t. satoh, k. kaneko, n. saito to cope with non-constant references, both the step and ramp basis functions are employed which will be expected to achieve better tracking performance than the step basis function alone. we choose the desired closed-loop time response tcltr to be 0.02 s to make a compromise between the stability of the closed-loop system and the tracking performance, and the sampling period t s of the system is fixed to 0.001 s. the same approaching ratio and the coincidence points as dieulot et al. [3] have used, which are already shown in subsection 2.1, are employed. so, the approaching ratio becomes α= 0.8607, and the three coincidence points given in (4) are defined as (h1 h2 h3) = (6 10 20). in the experiment shown in the next section, we will also use the standard pfc control scheme with the auto-compensation for comparison, and the predicted error is approximated by a first degree polynomial. the number of steps used for the polynomial approximation in (9) is taken to be hc = 20. it can be seen that the tracking performance is improved as hc becomes small. however, in exchange for the improvement, the control effort becomes large and shows extremely oscillatory behavior. the number of steps 20 is therefore the result of compromise between the tracking performance and control effort. 3.4 design of industrial controller as well as the pfc plus auto-compensation scheme, we will use an industrial controller for comparison in the experiment. it consists of a proportional-integral controller in the inner speed loop and a proportional controller in the outer position loop. the current loop is neglected since the current is already controlled in the motor driver. the pi controller in the speed loop is defined as cpi (s) = kpi 1 + dj s  (29) where kpi is the proportional gain. this pi controller is constructed to cancel the plant dynamics and force the open-loop transfer function to be ks kpi/j s. it follows that the viscous friction is theoretically canceled. here, the proportional gain kpi is chosen to be 1. as the value of kpi is increased, the speed of response is improved, but the maximum tracking error is not. also, the stability of the inner loop is sensitive to the change of this gain and easily collapses when kpi > 1. hence, kpi is fixed to 1. the p controller in the position loop is defined as cp (s) = kp. on the simulation model, we can increase the proportional gain up to 100. however, due to uncertainties in the actual plant and the nonlinear friction, the responses of the closed-loop system becomes highly oscillatory, and the large control effort is applied to the system when kp is greater than 20. hence, kp is fixed to 20 in the experiment. 4 experimental results to test the proposed control scheme, an experiment was conducted using a sinusoidal reference input. the amplitude and the angular frequency are respectively 5 rad (7.96 mm in displacement) and π/4 rad/s. figure 9 shows the reference and the motor angles obtained by different kinds of control schemes. in this and the subsequent figures, the correspondence between the legends and control schemes are as follows: ‘pfc’ is the standard pfc, ‘pfc+ac’ is the pfc with the auto-compensation, ‘pfc+dob1’ is the pfc combined with the disturbance observer using q1 (s), ‘pfc+dob2’ is the pfc combined with the disturbance observer using q2 (s), ‘p+pi’ is the industrial control. we can see from the top graph that only the standard pfc can hardly actuate the table drive system and cannot track the reference angle. this is mainly because the stiction torque (breakaway torque) is larger than the driving torque generated by the motor and, thus, the table drive system does not move. every control scheme except the standard pfc tracks the reference, and it is hard to distinguish from each other in the top graph. so, a detailed improving tracking performance of predictive functional control using disturbance observer and its application to table drive systems 561 0 1 2 3 4 5 6 7 8 −10 −5 0 5 10 p os it io n (m m ) plant output reference pfc pfc+ac pfc+dob1 pfc+dob2 p+pi 0.5 0.502 0.504 0.506 0.508 0.51 2.75 2.8 2.85 2.9 2.95 3 3.05 3.1 p os it io n (m m ) 1.5 2 2.5 7.6 7.7 7.8 7.9 8 time (s) p os it io n (m m ) reference p+pi pfc+acpfc+dob1pfc+dob2 pfc+ac reference pfc+dob1 pfc+dob2 p+pi pfc figure 9: comparison of control performances (top graph) and its closeups (middle and bottom graphs). 0 1 2 3 4 5 6 7 8 −0.3 −0.2 −0.1 0 0.1 0.2 0.3 time (s) p os it io n (m m ) tracking error pfc+ac pfc+dob1 pfc+dob2 p+pi p+pi pfc+ac pfc+dob1 pfc+dob2 figure 10: comparison of tracking errors. view near 0.5 seconds where the coulomb friction torque and the viscous friction torque respectively act as the stepwise disturbance and the ramp disturbance is shown in the middle graph of figure 9. in addition, a detailed view around 2 seconds where a change of direction of the movement occurs and therefore the friction torque takes the maximum is shown in the bottom graph of figure 9. it can be seen from figure 9 that the pfcs along with the disturbance observer track the reference better than other control schemes without the disturbance observer. figure 10 shows the tracking error of each control scheme where the result of the pfc is excluded from the graph since its tracking error is significant. comparing the two pfc plus disturbance observer schemes, we see that the tracking performance of pfc+dob2 is better than that of pfc+dob1. the difference arises from the number of integrators in the velocity loop, and, as a result, the disturbance observer using q2 (s), which adds two integrators in the loop, has lower sensitivity and better disturbance rejection property at low frequencies. we also see that pfc+ac is a simple but effective control scheme, considering the fact that it uses no active disturbance estimation or cancellation. however, the peak-to-peak value of its tracking error is about twice as large as that of pfc+dob1. also, since the auto-compensation is based on the past prediction errors and extrapolation, a certain amount of degradation of the tracking error is unavoidable when the reference signal changes its direction; see around 2 seconds in figure 10. finally, we can see that the industrial controller (p+pi) provides the worst tracking performance. figure 11 shows the control effort of each control scheme. basically, the magnitude is similar to each other. every control effort presents oscillatory behavior during the phase of initiation of motion (see from 0 to 2 seconds), though that of the industrial controller especially draws our attention. it can be seen from the bottom graph that, with the disturbance observer, the control effort quickly responds to the change of direction of the reference to cancel the disturbance torque. this leads to the improvement of the tracking error on the time interval between 2.1 s and 2.2 s. figure 12 shows the actual plant output and the predicted output of the inner model. here, to compare the prediction accuracy inside the pure pfc algorithm, the pfc scheme using the auto-compensation (pfc+ac) is excluded from the comparison. the top graph is in the case of the pfc without the distur562 t. satoh, k. kaneko, n. saito 0 1 2 3 4 5 6 7 8 −1 −0.5 0 0.5 1 1.5 v ol ta ge ( v ) control effort pfc+ac pfc+dob1 pfc+dob2 p+pi 1.5 1.6 1.7 1.8 1.9 2 2.1 2.2 2.3 2.4 2.5 −0.8 −0.4 0 0.4 0.8 time (s) v ol ta ge ( v ) p+pi pfc+ac pfc+dob1 pfc+dob2 figure 11: comparison of control inputs (top graph) and its closeup (bottom graph). 0 1 2 3 4 5 6 7 8 −10 0 10 p os it io n (m m ) pfc plant output predicted output (1/200) 0 1 2 3 4 5 6 7 8 −10 0 10 p os it io n (m m ) pfc+dob1 plant output predicted output 0 1 2 3 4 5 6 7 8 −10 0 10 p os it io n (m m ) time (s) pfc+dob2 plant output predicted output figure 12: comparison of prediction accuracy. table 2: root integrated squared values. control scheme tracking error control effort pfc 11.45 1.78 pfc+dob1 0.14 2.12 pfc+dob2 0.10 2.13 p+pi 0.48 2.23 pfc+ac 0.32 2.19 bance observer. since the amplitude of the predicted output is too large in this case, two-hundredth of its value is drawn in the figure. as we have already seen in figure 9, the ordinary pfc cannot overcome the friction torque, and the table drive system does not start to move. hence, the predicted output is completely different from the plant output. on the other hand, we can see from the middle and bottom figures that the predicted output approximates the actual plant output. also, the disturbance observer using q2 (s) provides a more accurate result than the one using q1 (s). in the middle figure, the magnitude of the predicted output is larger than the plant output, which, roughly speaking, indicates that the gain of the nominal model is larger than the actual plant. it follows that the speed of response of the plant becomes slower than the designed response. this can be confirmed by comparing the response of the pfc using q1 (s) with that of the pfc using q2 (s) in figure 9. even though the specified desired closed-loop time response tcltr is the same in both cases, the former slightly lags behind the latter, which is very close to the designed response owing to the accurate prediction. to see the difference of the control performances quantitatively, we show the root integrated squared values of the tracking error e and that of the control effort u computed on the time interval between 0 s and 8 s in table 2. clearly, the pfc plus the disturbance observer using q2 (s) provides the best performance of all the control schemes. on the other hand, in terms of the control effort, we can hardly recognize a difference apart from the standard pfc. as a consequence, we can see that, in this experiment, the pfc combined with the disturbance observer successfully improved the tracking performance without putting extra control energy into the system. improving tracking performance of predictive functional control using disturbance observer and its application to table drive systems 563 5 conclusions we have presented a disturbance observer-based approach for improving the tracking performance of the predictive functional control. the role of the disturbance observer is to reject unknown disturbances including the plant-model mismatch or nonlinearities that can be regarded as an input disturbance and to nominalize the real plant. owing to the disturbance observer, the predicted output error in the pfc algorithm is reduced, and, as a result of this, the tracking performance is improved. the proposed control scheme was implemented and validated on a single axis table drive system. it was confirmed that the predicted output error was effectively reduced and that better tracking performance was provided than the standard pfc using the auto-compensation mechanism and the conventional p plus pi control scheme even in the severe environment where the system was not driven by the standard the pfc scheme due to friction. future works involves the extension to non-minimum phase linear systems. acknowledgment this work was supported by the japan society for the promotion of science under grant-in-aid for scientific research (c) 21560247. bibliography [1] s. abu el ata-doss, p. fiani and j. richalet, “handling input and state constraints in predictive functional control,” proc. of the 30th conference on decision and control, pp. 985-990, brighton, england, 1991. [2] n. bigdeli and m. haeri, predictive functional control for active queue management in congested tcp/ip networks, isa transactions, vol. 48, no. 1, pp. 107-121, 2009. [3] j.y. dieulot, t. benhammi, f. colas, p.j. barre, composite predictive functional control strategies, application to positioning axes, international journal of computers, communications & control, vol. iii, no. 1, pp. 41-50, 2008. [4] d. dovžan and i. škjrank, predictive functional control based on an adaptive fuzzy model of a hybrid semi-batch reactor, control engineering practice, vol. 18, no. 8, pp.979-989, 2010. [5] m. hadjiski and v. asenov, predictive functional control using a blending approach, cybernetics and information technologies (bulgarian academy of science), vol. 5, no. 2, pp. 32-41, 2005. [6] c. johnson, “accommodation of external disturbances in linear regulator and servomechanism problems,” ieee transactions on automatic control, vol. 16, no. 6, pp. 635-644, 1971. [7] m. lepetič, i. škrjanc, h. g. chiacchiarini and d. matko, predictive functional control based on fuzzy model: comparison with linear predictive functional control and pid control, journal of intelligent and robotic systems, vol. 36, pp. 467-480, 2003. [8] m. primucci and m. basualdo, thermodynamic predictive functional control applied to cstr with jacket system, proc. of ifac 15th triennial world congress, barcelona, spain, 2002. [9] e. schrijver and j. van dijk, “disturbance observers for rigid mechanical systems: equivalence, stability, and desigh,” asme journal of dynamic systems, measurement, and control, vol. 124, pp. 539-548, 2002. 564 t. satoh, k. kaneko, n. saito [10] i. škrjanc and d. matko, predictive functional control based on fuzzy model for heat-exchanger pilot plant, ieee transactions on fuzzy systems, vol. 8, no. 6, pp. 705-712, 2000. [11] j. richalet, s. abu el ata-doss, c. arber, h.b. kuntze, a. jacubasch and w. schill, predictive functional control: application to fast and accurate robot, proc. of ifac 10th world congress, munich, germany, 1987. [12] j. richalet, industrial applications of model based predictive control, automatica, vol. 29, no. 5, pp. 1251-1274, 1993. [13] j. richalet and d. o’donovan, predictive functional control: principles and industrial applications. london, england: springer-verlag, 2009. [14] j. m. maciejowski, predictive control with constraints. harlow, england: pearson education, 2002. [15] k. ohishi, m. nakao, k. ohnishi and k. miyachi, microprocessor-controlled dc motor for loadinsensitive position servosystem, ieee transactions on industrial electronics, vol. 34, no. 1, pp.4449, 1987. [16] t. murakami and k. ohnishi, advanced motion control in mechatronics – a tutorial, proc. of the ieee international workshop on intelligent control, istanbul, turkey, vol. 1, pp. sl9-sl17, 1990. [17] e. g. papadopoulos and g. c. chasparis, analysis and model-based control of servomechanisms with friction, asme journal of dynamic systems, measurement, and control, pp. 911-915, 2004. [18] t. satoh, n. saito and n. saga, predictive functional control with disturbance observer for pneumatic artificial muscle actuator, proc. of the 1st international conference on applied bionics and biomechanics, venice, italy, no page number, 2010. [19] t. umeno and y. hori, robust speed control of dc servomotors using modern two-degrees-offreedom controller design, ieee trans. on industrial electronics, vol. 38, no. 5, pp. 363-368, 1991. [20] a. vivas and p. poignet, model based predictive control of a fully parallel robot, control engineering practice, vol. 13, no. 7, pp. 863-874, 2005. international journal of computers communications & control issn 1841-9836, 11(5):613-630, october 2016. efficient historical query in hbase for spatio-temporal decision support x.y. chen, c. zhang, b. ge, w.d. xiao xiao-ying chen, chong zhang*, bin ge, wei-dong xiao science and technology on information systems engineering laboratory national university of defense technology changsha 410073, p.r.china chenxiaoying1991@yahoo.com, leocheung8286@yahoo.com gebin1978@gmail.com, wilsonshaw@vip.sina.com *corresponding author: leocheung8286@yahoo.com abstract: comparing to last decade, technologies to gather spatio-temporal data are more and more developed and easy to use or deploy, thus tens of billions, even trillions of sensed data are accumulated, which poses a challenge to spatio-temporal decision support system (stdss). traditional database hardly supports such huge volume, and tends to bring performance bottleneck to the analysis platform. hence in this paper, we argue to use nosql database, hbase, to replace traditional back-end storage system. under such context, the well-studied spatio-temporal querying techniques in traditional database should be shifted to hbase system parallel. however, this problem is not solved well in hbase, as many previous works tackle the problem only by designing schema, i.e., designing row key and column key formation for hbase, which we don’t believe is an effective solution. in this paper, we address this problem from nature level of hbase, and propose an index structure as a built-in component for hbase. stehix (spatio-temporal hbase index) is adapted to two-level architecture of hbase and suitable for hbase to process spatio-temporal queries. it is composed of index in the meta table (the first level) and region index (the second level) for indexing inner structure of hbase regions. base on this structure, three queries, range query, knn query and gnn query are solved by proposing algorithms, respectively. for achieving load balancing and scalable knn query, two optimizations are also presented. we implement stehix and conduct experiments on real dataset, and the results show our design outperforms a previous work in many aspects. keywords: spatio-temporal query, hbase, range query, knn query, gnn query, load balancing. 1 introduction nowadays, either organizations or common users need sophisticated spatio-temporal decision support system (stdss) [1] for countless geospatial applications, such as urban planning, emergency response, military intelligence, simulator training, and serious gaming. meanwhile, with the development of positioning technology (such as gps) and other related applications, huge of spatio-temporal data are collected, of which volume increases to pb or even eb. consequently, this necessarily poses a challenge to stdss applications. traditionally, these data are stored in relational database, however, since the database can’t resist such a huge volume, such architecture would bring performance bottleneck to the whole analysis task. hence, the new structural storage system should back up stdss. in this paper, we argue that hbase [2] is capable to accomplish such task, since hbase is a key-value, nosql storage system, which can support large-scale data operations efficiently. on the other hand, from system point of view, an ideal geospatial application designed to formulate and evaluate decision-making questions for stdss should contain efficient presentation copyright © 2006-2016 by ccc publications 614 x.y. chen, c. zhang, b. ge, w.d. xiao of a basic set of spatio-temporal queries, such as: find doctors who can carry out rescue in a certain area, recently, find 5 flower shops nearest to tony, a group of friends spreading over different places want to find nearest restaurant to them, aggregately, i.e., the sum of distances to them is minimum. these operations are supported well in relational database, however, they are not supported by hbase in a straightforward way. the main reason is that hbase do not natively support multi-attribute index, which limits the rich query applications. hence in this paper, we explore processing for basic spatio-temporal queries in hbase for stdss. from a variety of applications, we mainly address three common and useful spatiotemporal queries as follows: • range query: querying data in specific spatial and temporal range. for instance, in realtime monitoring and early warning of population, query the number of people in different time intervals within a specific area. • knn query (k-nearest neighbor): querying data to obtain k nearest objects to a specific location during a certain period. for instance, in the past week, find 5 nearest uber taxis to a given shopping mall. • gnn query (group nearest neighbor): querying data to obtain k nearest objects aggregately (measured by sum of distances) to a group of specific locations during a certain period. for instance, during last month, find the nearest ship to the given three docks. as an example, figure 1 shows the spatial distribution of users during two time interval [1, 6] and [7, 14]. for range query, find the users who are in the spatial range marked by the dashed line rectangle within time period [1, 6], apparently, {u1, u3} is the result. for 1nn query, if we want to find the users who are nearest to p1 during time period [1, 6] and [7, 14], respectively, the result is u2 for [1, 6] and u1 for [7, 14]. for gnn query, if we want to find the user who are nearest to p1 and p2 by summing the distances during time period [1, 6], the result is u2. p1 (u1,1) (u1,4) (u1,6) (u2,6) (u3,3) (u3,5) (u3,6) (u2,1) x y (u1,3) p1 (u1,9) (u1,14) (u2,12) (u3,10) (u3,14) (u2,14) x y t [1,6] t [7,14] p2 p2 (u2,6) (u1,6) (u1,4) (u3,5) (u1,3) (u1,1) (u3,6) (u3,3) (u2,1) ||p1,ui || ||p2,ui || sum (u1,9) (u2,12) (u1,14) (u3,10) (u3,14) (u2,14) ||p1,ui || ||p2,ui || sum 10 units figure 1: an example for range, knn and gnn query efficient historical query in hbase for spatio-temporal decision support 615 1.1 motivation our motivation is to adapt hbase to efficiently process spatio-temporal queries as basic operations for spatio-temporal decision support system. although some previous works propose distributed index on hbase, but these works only consider spatial dimension, more critically, most of these works only concern how to design schema for spatial data, which do not tackle the problem from the nature level of hbase, except one, md-hbase [5] is designed to add index structure into the meta table, however, it doesn’t provide index to efficiently retrieve the inner data of hbase regions. our solution, stehix (spatio-temporal hbase index), is built on two-level lookup mechanism, which is based on the retrieval mechanism of hbase. first, we use hilbert curve to linearize geo-locations and store the converted one-dimensional data in the meta table, and for each region, we build a region index indexing the storefiles in hbase regions. we focus on range query, knn query and gnn query for such environment in this paper. 1.2 contributions and paper organization we address how to efficiently answer range query, k nearest neighbor (knn) query and gnn query on spatio-temporal data in hbase. our solution is called stehix (spatio-temporal hbase index), which fully takes inner structure of hbase into consideration. the previous works focus on building index based on the traditional index, such as r-tree, b-tree, while our method constructs index based on hbase itself, thus, our index structure is more suitable for hbase retrieval. in other way, stehix considers not only spatial dimension, but also temporal one, which is more in line with user demand. we use hilbert curve to partition space as the initial resolution, the encoded value of which is used in the meta table to index hbase regions, then we use quad-tree to partition hilbert cells as the finer resolution, based on this, we design region index structure for each region, which contains the finer encoded values for indexing spatial dimension and time segments for indexing temporal dimension. and later, we show such two-level index structure, meta table + region index, is more suitable for hbase to process query in the experiment. based on our index structure, algorithms for range query, knn query and gnn query are devised, and load balancing policy and optimization to knn query are also presented to raise stehix performance. we compare stehix with md-hbase on real dataset, and the results show our design philosophies make stehix to be more excellent than the counterpart. in summary, we make the following contributions: • we propose stehix structure which fully follow inner mechanism of hbase and is a new attempt on building index for spatio-temporal data in hbase platform. • we propose efficient algorithms for processing range query, knn query and gnn query in hbase. • we carry out comprehensive experiments to verify the efficiency and scalability of stehix. the rest of this paper is organized as follows. section 2 reviews related works. section 3 formally defines the problem and prerequisites. section 4 presents stehix structure. in section 5, algorithms for range query knn query and gnn query are presented. section 6 reports the optimizations to the index. and we experimentally evaluate stehix in section 7. finally, section 8 concludes the paper with directions for future works. 616 x.y. chen, c. zhang, b. ge, w.d. xiao 2 related works to overcome the drawbacks of traditional rdbms, as an attractive alternative for largescale data processing, cloud storage system currently adopts a hash-like approach to retrieve data that only support simple keyword-based queries, but lacks various forms of information search. for data processing operations, several cloud data managements (cdms), such as hbase, are developed. hbase, as nosql databases, is capable to handle large scale storage and high insertion rate, however, it does not offer much support for rich index functions. many works focus on this point and propose various approaches. nishimura et al. [5] address multidimensional queries for paas by proposing md-hbase. it uses k-d-trees and quad-trees to partition space and adopts z-curve to convert multidimensional data to a single dimension, and supports multi-dimensional range and nearest neighbor queries, which leverages a multi-dimensional index structure layered over hbase. however, md-hbase builds index in the meta table, which does not index inner structure of regions, so that scan operations are carried out to find results, which reduces its efficiency. hsu et al. [6] propose a novel key formulation scheme based on r+-tree, called kr+-tree, and based on it, spatial query algorithm of knn query and range query are designed. moreover, the proposed key formulation schemes are implemented on hbase and cassandra. with the experiment on real spatial data, it demonstrates that kr+-tree outperforms md-hbase. kr+tree is able to balance the number of false-positive and the number of sub-queries so that it improves the efficiency of range query and knn query a lot. this work designs the index according to the features found in experiments on hbase and cassandra. however, it still does not consider the inner structure of hbase. zhou et al. [7] propose an efficient distributed multi-dimensional index (edmi), which contains two layers: the global layer divides the space into many subspaces adopting k-d-tree, and in the local layer, each subspace is associated to a z-order prefix r-tree (zpr-tree). zpr-tree can avoid the overlap of mbrs and obtain better query performance than other packed r-trees and r∗-tree. this paper experimentally evaluates edmi based on hbase for point, range and knn query, which verifies its superiority. compared with md-hbase, edmi uses zpr-tree in the bottom layer, while md-hbase employs scan operation, so that edmi provides a better performance. han et al. [8] propose hgrid data model for hbase. hgrid data model is based on a hybrid index structure, combining a quad-tree and a regular grid as primary and secondary indices, supports efficient performance for range and knn queries. this paper also formulates a set of guidelines on how to organize data for geo-spatial applications in hbase. this model does not outperform all its competitors in terms of query response time. however, it requires less space than the corresponding quad-tree and regular-grid indices. hbasespatial, a scalable spatial data storage based on hbase, proposed by zhang et al. [9]. compared with mongodb and mysql , experimental results show it can effectively enhance the query efficiency of big spatial data and provide a good solution for storage. but this model does not compare with other distributed index method. all the previous works we have mentioned above only consider the spatial query. for moving objects, a certain type of geo-spatial applications, requires high update rate and efficient realtime query on multi-attributes such as time-period and arbitrary spatial dimension. du et al. [10] present hybrid index structure based on hbase, using r-tree for indexing space and applying hilbert curve for traversing approaching space. it supports efficient multi-dimensional range queries and knn queries, especially it is adept at skewing data compared with md-hbase and kr+-tree. as this work focus on moving objects, it is different for our goal, and it also does not take the inner structure of hbase into account. efficient historical query in hbase for spatio-temporal decision support 617 to address the shortcoming which have mentioned above, the stehix structure which fully follow inner mechanism of hbase and is a new attempt on building index for spatio-temporal data in hbase platform is proposed. 3 problem definition and prerequisites in this section, we first formally describe spatio-temporal data, and then present the structure of hbase storage. for simplicity, only two-dimensional space is considered in this paper, however, our method can be directly extended into higher dimensional space. a record r of spatio-temporal data can be denoted as 〈x, y, t, a〉, where (x, y) means the geo-location of the record, t means the valid time when the data is produced, a represents other attributes, such as user-id, object’s shape, descriptions, and etc. we give the descriptions for structure of storage and index in hbase [11], [12], for simplicity, some unrelated components, such as hlog and version, are omitted. usually, an hbase cluster is composed of at least one administrative server, called master, and several other servers holding data, called regionservers. logically, a table in hbase is similar to a grid, where a cell can be located by the given row identifier and column identifier. row identifiers are implemented by row keys (rk), and the column identifier is represented by column family (cf) + column qualifier (cq), where a column family consists of several column qualifiers. the value in a cell can be referred to as the format (rk, cf:cq). table 1 shows a logical view of a table in hbase. for instance, value v1 can be referred to as (rk1, cf1:cq1). table 1: logical view for hbase table cf1 cf2 cq1 cq2 cq3 cqa cqb rk1 v1 v2 v3 v4 v5 rk2 v6 v7 v8 v9 v10 physically, a table in hbase is horizontally partitioned along rows into several regions, each of which is maintained by exactly one regionserver. the client directly interacts with the respective regionserver when executing read or write operations. when the data, formally as 〈rk, cf:cq, value〉 (we alternatively use term key-value data in rest of the paper), are written into a region, the regionserver first keeps the data in a list-like memory structure called memstore, where each entry is pre-configured with the same fixed size (usually 64kb) and the size of a certain number of entries is equal to that of the block of the underlying storage system, such as hdfs. when the size of memstore exceeds a pre-configured number, the whole memstore is written into the underlying system as a storefile, the structure of which is similar to that of memstore. further, when the number of storefiles exceeds a certain number, the regionserver will execute the compaction operation to merge storefiles into a new large one. hbase provides a two-level lookup mechanism to locate the value corresponding to the key (rk, cf:cq). the catalog table meta stores the relation {[table name]:[start row key]:[region id]:[region server]}, thus given a row key, the corresponding regionserver can be found, and then the regionserver searches the value locally according to the given key (rk, cf:cq). figure 2 shows an example of hbase two-level lookup structure. from above descriptions, we can see that hbase only provides a simple hierarchical index structure based on the meta table, and the corresponding regionserver must do scan work to refine the results, which would be inefficient to handle spatio-temporal queries. 618 x.y. chen, c. zhang, b. ge, w.d. xiao tablet1, rk1, regiona-->serveri tablet2, rk2, regionb-->serverii tabletn, rkn, regiony-->serverx regoina storefilememstore serveri regoinb serverii regoiny serverx meta regoin ... regoin ... regoin ... storefilememstore storefilememstore (rk1,cf1:cq1,value1) (rk1,cf1:cq2,value2) (rk1,cf1:cq3,value3) 64kb figure 2: hbase two-level lookup 4 stehix structure in this section, we present the structure of our index, stehix (spatio-temporal hbase index). the following philosophies are considered during index design, 1) for applications, it is not necessary for users to dedicatedly to design schema for query spatio-temporal data, i.e., our index should add no restriction on schema design, but a inner structure associated with hbase, 2) the index should be in accordance with the architecture of hbase as identical as possible, 3) the index should be adaptive to data distribution. for design rule 1), we don’t care the schema design and generalize each record to be a keyvalue data in storefile(memstore), formally (rk, cf:cq, r), where r=〈x, y, t, a〉. for design rule 2), our index is built on the two-level lookup mechanism. in particular, we use hilbert curve to linearize geo-locations and store the converted one-dimensional data in the meta table, and for each region, we build a region index to index the storefiles. figure 3 shows an overview of stehix architecture. 4.1 meta table organization we use hilbert curve to partition the whole space as the initial granularity. according to the design rationale of hbase, the prefix of row key should be different so that the overhead of inserting data could be distributed over regionservers. and such design is able to satisfy this demand. hilbert curve is a kind of space filling curve which maps multi-dimensional space into onedimensional space. in particular, the whole space is partitioned into equal-size cells and then efficient historical query in hbase for spatio-temporal decision support 619 meta data for other purpose [hs1, he1], region a->serveri [hsn, hen], region y->serverx regoin a storefile serveri meta [hs2, he2], region b->serverii s-index t-index storefile storefile regoin b serverii s-index t-index regoin y serverx s-index t-index figure 3: overview of stehix a curve is passed through each cell for only once in term of some sequence, so that every cell is assigned a sequence number. different space filling curves are distinguished by different sequencing methods. due to information loss in the transformation, different space filling curves are evaluated by the criteria, locality preservation, meaning that how much the change of proximities is from original space to one-dimensional space. hilbert curve is proved to be the best locality preserved space filling curve [13]. with hilbert curve, any object in the original space is transformed into [0, 22λ − 1] space, where λ is called the order of hilbert curve. figure 4 shows four hilbert curves in two-dimensional space with λ=1, 2, 3 and 4. (a) =1 (b) =2 (c) =3 (d) =4 p1 p2 r1 figure 4: hilbert curves we describe three functions for hilbert curve, first one is mapping a point in the original space to a value in one-dimensional space, the second is mapping a range window to a series of intervals, and the third is retrieving proximity cells of a point. specifically, for a hilbert curve with order=λ, • coortocell(p). given a point p=(x1, x2, . . . , xn) in n-dimensional space s, coortocell(p) returns a cell number (between 0 and 22λ − 1) referring the cell where p lies within s. 620 x.y. chen, c. zhang, b. ge, w.d. xiao • recttointervals(r). given a range window r=(xl1, x l 2, . . . , x l n, x u 1 , x u 2 , . . . , x u n) in ndimensional space s, where xli and x u i (1 ≤ i ≤ n) are the lower and upper bound of the ith-dimension, respectively, recttointervals(r) returns a series of intervals representing the cells intersecting with r in s. • getneighborcells(p). given a point p=(x1, x2, . . . , xn) in n-dimensional space s, getneigh borcells(p) returns a list of cell numbers referring the cells which are neighbors of the cell coortocell(p). for instance, in figure 4 (b), coortocell(p1) = 2, coortocell(p2) = 13, recttointervals(r1) = {[1,2], [7,8], [11,15]}, and getneighborcells(p2)={1, 2, 7, 8, 11, 12, 15, 14}. based on above descriptions, we use hilbert cell value as row key in the meta table to index spatio-temporal data as first level, thus, each record can be placed into the corresponding region according to hilbert value of spatial part of the record. in particular, the following mapping structure is built in the meta table (for simplicity, table name is omitted): {[start hilbert cell, end hilbert cell]:[region id]:[region server]}. initially, assuming there are n regions across m regionservers, we can uniformly assign hilbert cells to these regions, for instance, the first entry could be {[0, ((22λ − 1)/n) − 1] : regiona : serveri}, and the second {[((22λ − 1)/n), (2 ∗ (22λ − 1)/n) − 1] : regionb : serverii}. 4.2 region index structure for retrieving local data efficiently, we design the region index which is kept in memory like memstore. considering memstore is always kept in memory, region index is only to index storefile, however, for answering a query, memstore must be scanned to guarantee the completeness of results. region index is a list-like in-memory structure, each entry of which points to a list of addresses referring to key-value data in the storefile. the region index consists of two parts, one is called s-index indexing spatial component of data, the other is called t-index indexing the temporal part, and such design is able to benefit query efficiency as we will see in next section. for constructing s-index, the space is further partitioned at a finer granularity, i.e., each hilbert cell is recursively divided by quad-tree and the resulting tiles are encoded with binary zorder. such consideration is able to deal with the skewed data, i.e., when a hotspot is detected, quad-tree can be used recursively until the hotspot is eliminated. later, we will use this idea to design an adaptive load balancing policy. after partitioning the hilbert cell, each tile is corresponding to an entry in the s-index, i.e., the entry points to the key-value data whose geolocations lie in that tile. for instance, figure 5 shows an example of meta table and region index, where in the meta table, hilbert cells [0, 1] indexes regiona : serveri and [2, 3] for regionb : serverii, respectively. for regiona, hilbert cells 0 and 1 are divided using quad-tree into 11 tiles, 7 of which are 2-bit tiles and 4 are 4-bit tiles, and for each entry in s-index, the identifier is a combination of hilbert value and binary z-order value, for instance, entry 0-10, where 0 is the number of hilbert cell 0 and 10 is the code of lower-right tile in hilbert cell 0, points to a list containing two addresses referring to two key-value records in storefile. for building t-index, we use a period t to bound the length of the list of t-index, and such consideration is based on the fact that there may be some cycle for the spatial change of objects. the period t is divided into several segments, each of which is corresponding to an entry in t-index. each entry points to a list of addresses referring to key-value data in storefile, whose temporal component modulo t lies in the segment. continuing the example, figure 5 shows the structure of t-index. let t=24, which means a period of 24 hours is a cycle, and let each segment = 3 hours, which means t is divided into 8 segments, and entry [3, 6) points to 8 key-value data whose temporal value modulo 24 between 3 and 6. efficient historical query in hbase for spatio-temporal decision support 621 00 01 10 11 01 10 11 0000 0010 0001 0011 [0, 1], region a->serveri meta regoin a serveri 0-10 0-11 1-0000 1-0001 1-01 [0, 3) [3, 6) s-index t-index [21, 0) storefile [2, 3], region b->serverii q 1-0010 1-0011 #addr #addr #addr #addr #addr #addr #addr #addr #addr #addr #addr #addr #addr #addr #addr figure 5: region index structure 5 query processing in this section, the processing algorithms for range query, knn query and gnn query are presented. 5.1 range query a range query q=(xl, yl, xu, yu, ts, te), aims to find all the records, whose geo-locations lie in the range (xl, yl, xu, yu) during time [ts, te]. the basic work flow for processing a range query q is described as follows, first, using hilbert curve, spatial predicate (xl, yl, xu, yu) is converted into a set of one-dimensional intervals iq, then according to mapping relation in the meta table, the involved regionservers are informed to search the corresponding regions locally, utilized by region index. here we propose a query optimization, i.e., using s-index and t-index to calculate selectivity, which is helpful to choose the high-selectivity to filter more unrelated data, in particular, the spatial predicate is recursively divided by quad-tree, the results of which are intersected with the entries in s-index, and then the number of addresses to key-value data can be calculated, say sn, similarly, using t-index can also calculate a number, tn, then if sn is less than tn, s-index is followed to retrieve results, other wise t-index is used. algorithm 1 describes the range query processing for stehix. in line 1, the spatial predicate is converted into one-dimensional intervals iq, and the temporal predicate is converted into [0, t ] interval in line 2. in line 3, function findregions() finds the involved regions which intersect with iq. from line 4 to 11, each corresponding region index is inspected to retrieve results, in 622 x.y. chen, c. zhang, b. ge, w.d. xiao particular, s-index and t-index is used to calculate selectivity for the query, which is implemented by function getcard(), and the index with the lower cardinality is chosen to retrieve the results. algorithm 1 range query processing require: q=(xl,yl,xu,yu, ts, te) ensure: qlist //result list 1: iq = recttointervals(xl,yl,xu,yu) 2: keys = ts mod t , keye = te mod t 3: regions=findregions(iq) /*the following processing is executed separately in each region*/ 4: for each region ∈ regions do 5: sn=region.s-index.getcard(xl,yl,xu,yu) 6: tn=region.t-index.getcard(keys,keye) 7: if sn ≤ tn then 8: qlist←region.s-index.seachindex(q) 9: else 10: qlist←region.t-index.seachindex(q) 11: end if 12: end for 13: return qlist figure 5 shows an example for range query processing in stehix. the spatial bound of q is depicted with dashed line and we assume that temporal predicate of q is [3, 6]. then hilbert cells 0 and 1 are intersected with q, thus, two entries in the meta table are examined, namely, {[0, 1] : regiona : serveri} and {[2, 3] : regionb : serverii}. for instance, in regiona, the entries in s-index are intersected with spatial predicare of q, resulting 0-10, 0-11, 1-0001, 1-0011, 1-01 these 5 entries, which refer to totally 7 addresses to key-value data, and similarly, entry [3, 6) of t-index refers to 8 addresses, consequently s-index is followed to retrieve the results. 5.2 knn query a knn query could be formally defined as: given a set r of spatio-temporal data records, a knn query q=(xq, yq, ts, te, k), aims to find a set r(q) ⊆ r , such that |r(q)|=k, and d(o, (xq,yq)) ≤ d(o′, (xq,yq)), ∀o ∈ r(q), o′ ∈ r\r(q), and o.t, o′.t ∈[ts, te], where d() is the euclidean distance function. we don’t want to use n range queries to accomplish the knn query, which means continuously enlarging spatial range of the query until k records are obtained [14], because we believe such a method would cause heavy querying overhead. we propose an approach utilized by incremental retrieval idea [15]. the basic work flow is, proximity objects of point (xq, yq) are constantly, incrementally retrieved until k results are found. in particular, first, hilbert cell h containing point (xq, yq) is located, then the corresponding region index is utilized to retrieve all records lie in h, meanwhile, neighbor cells of h are also retrieved, and these records and hilbert cells are all enqueued into a priority queue where priority metric is the distance from (xq, yq) to record or hilbert cell. then top element is constantly dequeued and processed, either being added to result list or being followed to retrieve neighbor cells to be enqueued, until k results are found. algorithm 2 presents knn query processing. the first line initializes a priority queue pq where each element is ordered by the distance from (xq, yq) to the element. the element can be hilbert cell or record, and if it is a hilbert cell, the distance is mindist [16], other wise, the distance is the euclidean distance from (xq, yq) to geo-location of the record. in line 2, the hilbert cell containing (xq, yq) is gained, and is enqueued in line 3. from line 4, the procedure constantly retrieves top element e from pq (line 5) and processes it, in particular, if e is a hilbert cell (line 6), find the corresponding region rg from the meta table (line 7), and then the corresponding region index is searched to retrieve all the records satisfying temporal predicate (line 8), which are enqueued into pq (line 9 to 11), after that, the neighbor cells of e are obtained and enqueued into pq (line 12 to 15); other wise, i.e., if e is a record (line 16), which means e is a result, e is added into qlist (line 17), and the above procedure is looped until the size of efficient historical query in hbase for spatio-temporal decision support 623 algorithm 2 knn query processing require: q=(xq, yq, ts, te, k) ensure: qlist //result list 1: pq=null //initial a priority queue 2: h=coortocell(xq, yq) 3: pq.enqueue(h, mindist((xq, yq), h)) 4: while pq 6= φ do 5: e=pq.dequeue() 6: if e is typeof cell then 7: rg=findregions(e) 8: rs=rg.findrecords(e, (ts, te)) 9: for each record ∈ rs do 10: pq.enqueue(record, dist((xq, yq), record)) 11: end for 12: cellset=getneighborcells(e.center) 13: for each cell ∈ cellset do 14: pq.enqueue(cell, mindist((xq, yq), cell)) 15: end for 16: else if e is typeof record then 17: qlist←e 18: if qlist.size()=k then 19: return qlist 20: end if 21: end if 22: end while qlist reaches k (line 18 to 20). 5.3 gnn query a gnn query in our work could be formally defined as: given a set r of spatio-temporal data records and a set of location point(s) p , a gnn query q=(p , ts, te, k), aims to find a set r(q) ⊆ r , such that |r(q)|=k, and the point(s) of r(q) with smallest sum of distances to all points in p (|p |=n), i.e. ∑n i=1 d(o, (xi,yi)) ≤ ∑n i=1 d(o′, (xi,yi)), ∀o ∈r(q), o′ ∈r\r(q), and o.t, o′.t ∈[ts, te], where d() is the euclidean distance function. different from knn query, gnn query aims to finds a group of point(s) that nearest to a set of points. in knn query processing, firstly hilbert cell h containing point (xq, yq) is located, while in gnn processing, we firstly find the ideal nearest neighbor p, which could not exist in the dataset r. this approach is that the nearest neighbor is the point(s) "near" p. let (x,y) be the coordinates of ideal 1nn point p and (xi,yi) be the coordinates of point pi ∈p , p minimizes the sum of distance function: sumdist(p,p) = n∑ i=1 √ (x−xi)2 + (y −yi)2 (1) partially calculate the derivation of function sumdist(p,p) with respect to variables x and y, let them equal to zero, we have:  ∂sumdist(p,p) ∂x = n∑ i=1 x−xi√ (x−xi)2 + (y −yi)2 = 0 ∂sumdist(p,p) ∂y = n∑ i=1 y −yi√ (x−xi)2 + (y −yi)2 = 0 (2) however, this equations can not be solved when n > 2. according to the method in [18], we start with the arbitrary initial coordinates x = ∑n i=1 xi n , y = ∑n i=1 yi n , then modifies as follows: x = x−η∂sumdist(p,p) ∂x ,y = y −η∂sumdist(p,p) ∂y (3) where η is a step size. the process is repeated until the distance function sumdist(p,p) converges to a minimum value. we call this processing p = getnearest(p). the range around p in which we should look for points of r(q). 624 x.y. chen, c. zhang, b. ge, w.d. xiao the basic work flow is similar to knn query processing which introduce above. algorithm 3 presents gnn query processing. in particular, first, hilbert cell h containing point p (x, y) is located, then the corresponding region index is utilized to retrieve all records lie in h, meanwhile, neighbor cells of h are also retrieved, and these records and hilbert cells are all enqueued into a priority queue where priority metric is the sum of distance from p to record or hilbert cell. then top element is constantly dequeued and processed, either being added to result list or being followed to retrieve neighbor cells to be enqueued, until k results are found. algorithm 3 gnn query processing require: q=(p , ts, te, k) ensure: qlist //result list 1: pq=null //initial a priority queue 2: p = getnearest(p) 3: h=coortocell(p) 4: pq.enqueue(h, summindist ((p), h)) 5: while pq 6= φ do 6: e=pq.dequeue() 7: if e is typeof cell then 8: rg=findregions(e) 9: rs=rg.findrecords(e, (ts, te)) 10: for each record ∈ rs do 11: pq.enqueue(record, sumdist((p), record)) 12: end for 13: cellset=getneighborcells(e.center) 14: for each cell ∈ cellset do 15: pq.enqueue(cell, summindist ((xq, yq), cell)) 16: end for 17: else if e is typeof record then 18: qlist←e 19: if qlist.size()=k then 20: return qlist 21: end if 22: end if 23: end while 6 optimizations in this section, we propose two methods for raising performance of stehix from the aspects of load balancing and query optimization. 6.1 adaptive load balancing for achieving design rule 3), adaptive load balancing is considered. our spatial partition procedure contains two phases, first is hilbert curve, and the second is quad-tree. and load balancing is based on the second phase and region split, in particular, when the volume of a region exceeds a limit due to the hotspot in spatial dimension, the procedure detects which hilbert cell is the hotspot, and uses a quad-tree to divide it into four subspaces, thus the original region is split into five regions, i.e., four corresponds to the four subspaces and one corresponds to the undivided hilbert cell(s). after that, the meta table is also updated to renew the mapping information as well as the region index. figure 6 shows an example of region split. we can see when a hotspot is generated in hilbert cell 0, the cell is divided into four subspaces by quad-tree, and the corresponding region is split into five, namely, 0-00, 0-01, 0-10, 0-11 and 1, and the meta table and new regions are updated accordingly. 6.2 optimization for knn query from knn algorithm we can see, each time for retrieving the records of a hilbert cell, the meta table must be searched to locate the corresponding region, which would increase overhead of the query. to deal with such a problem, we add modifications to region index, in particular, each region index ri is connected to the regions whose hilbert cells are the neighbors of ri’s efficient historical query in hbase for spatio-temporal decision support 625 region’s hilbert cells. thus, when getneighborcells() method is invoked, the current region is able to retrieve records from proximity regions, however, not all the records can be retrieved, and for this case, the meta table should be searched. nevertheless, this optimization would reduce the overhead of querying the meta table. 00 01 10 11 [0, 1], region a->serveri meta 0-00, region r1->serverii meta 0-01, region r2->serverii 0-10, region r3->serverx 0-11, region r4->serverx 1, region a->serveri figure 6: load balancing 7 experimental evaluation we evaluate our algorithms on real dataset, which contains trajectories of taxis in beijing1. in particular, the dataset contains about 100 million records, and temporal range is from nov. 1st to 3rd, and each record in the dataset contains vehicle id, geo-location, recording time stamp, etc. our algorithms are implemented in hadoop 2.5.1 and hbase 0.98.6, and run on a cluster with size varied from 5 to 33, in which each node is equipped with intel(r) core(tm) i3 cpu @ 3.40ghz, 4gb main memory (for master 16gb), and 500gb storage, and operating system is centos release 6.5 64bit, and network bandwidth is 10mbps. for comparison, we choose md-hbase due to the similar function. 1http://activity.datatang.com/20130830/description 626 x.y. chen, c. zhang, b. ge, w.d. xiao 3 10 15 30 50 1000 2000 3000 4000 5000 6000 7000 8000 d e la y (m s ) �(%) stehix md-hbase (a) effect of selectivity 5 9 17 33 2000 3000 4000 5000 6000 7000 8000 9000 d e la y (m s ) cluster size stehix md-hbase (b) effect of cluster size figure 7: experimental results for range queries 7.1 range queries first, we evaluate the algorithm for range queries. and we introduce two parameters to test the algorithm under various conditions. one is selectivity θ defined as: θ = l(ts,te) lt · arq as where l(ts,te) means the length of query temporal range (ts, te), lt means the length of temporal extent of the dataset, arq means the area of query spatial range rq, and as means the area of the whole space. selectivity specifies the size of the query range, and the larger θ is, the more spatio-temporal records are involved. in this experiment, the default values of θ and cluster size are 10% and 9, respectively. for each value of θ or size, we issue 10 queries with different temporal ranges and spatial ranges, and collect the average response time as the measurement of performance. first, we vary θ from 3% to 50% and figure 7(a) shows the results. we can see that response time increases with θ for both methods. this is because a larger selectivity would access more records to be retrieved and examined, which increases the processing time. however, we can see stehix outperforms md-hbase, which can be explained by the design of region index. although md-hbase builds index in the meta table, it doesn’t index inner structure of regions, thus, scan operations are carried out to find results, which cost heavily. our stehix is adapted to the two-level architecture of hbase, and is able to use region index to efficiently search each region, which highly improve performances. next, we vary cluster size from 5 to 33, and figure 7(b) shows the results. it is apparent that stehix is excellent due to its nearly horizontal response time and good scalability. when the number of cluster size is increased, more regionservers take part in the processing and use their region indexes parallel. however, due to lack of indexing storefiles, the scalability of md-hbase is not good. efficient historical query in hbase for spatio-temporal decision support 627 4 8 16 32 0 500 1000 1500 2000 2500 3000 d e la y (m s ) k stehix md-hbase (a) effect of k 5 9 17 33 500 1000 1500 2000 2500 3000 d e la y (m s ) cluster size stehix md-hbase (b) effect of cluster size figure 8: experimental results for knn queries 7.2 knn queries in this experiment, the default values of k and cluster size are 8 and 9, respectively. first, we vary k from 4 to 32, and figure 8(a) shows that stehix outperforms md-hbase. when k is increased, both methods need more time to process queries. stehix uses less time to retrieve k results, which can be explained by the same reason, i.e., the region index embedded in hbase region. and then cluster size is varied from 5 to 33, still, stehix is better than md-hbase, figure 8(b) shows the fact. 7.3 gnn queries in this experiment, we vary the size of location set p and cluster to measure performance of stehix. note that md-hbase does not study gnn query, so we just simply use our virtual centroid method to apply to it. we vary n (size of p) from 3 to 9, and figure 9(a) shows the results. we can see with increasing of n, the response time is also increased, this is because a larger size of p would cause more time to calculate the virtual centroid, however, we can the delay time does not increase very steeply, due to the fact that computing the virtual centroid only cost cpu time. similarly, our stehix still outperforms md-hbase in both varying n and cluster size. 7.4 effect of optimizations we examine the effect of optimizations to stehix in this experiment, and figure 10 show the results. first, we use maximum imbalance load ratio [17] as metric, and test our adaptive load balancing policy, the results of comparison with non-load balancing are plotted in figure 10 (a). we can see with cluster size increased, both ratios are raised, this is because the more nodes participate in the cluster, the more difficult is to distribute load uniformly, however, we can see our load balancing method indeed takes effect, i.e., when load balancing policy is used, the ratio is averagely around 6, while the counterpart shows the performance about 38 to 70. next, we test the effect of knn optimization, from figure 10 (b), we can see the connections among region indexes give chances to reduce querying overhead. 628 x.y. chen, c. zhang, b. ge, w.d. xiao 3 4 5 6 7 8 9 1000 1200 1400 1600 1800 2000 2200 2400 2600 d e la y (m s ) n stehix md-hbase (a) effect of size of p 5 9 17 33 1000 1500 2000 2500 3000 d e la y (m s ) cluster size stehix md-hbase (b) effect of cluster size figure 9: experimental results for gnn queries 4 8 16 32 0 200 400 600 800 1000 1200 1400 1600 1800 2000 d e la y (m s ) k stehix stehix optimization (a) effect of k 5 9 17 33 0 10 20 30 40 50 60 70 m a x im b a la n c e lo a d ra ti o (% ) cluster size load balancing non load balancing (b) effect of cluster size figure 10: experimental results for optimizations efficient historical query in hbase for spatio-temporal decision support 629 8 conclusion and future works with development of positioning technology, more and more spatio-temporal data need to be processed. to equip hbase with efficient and scalable spatio-temporal querying capability will benefit the whole spatio-temporal decision support system. in this paper, we argue that many previous works fail to tackle this problem due to lack of deep design for hbase, while we address the problem by proposing a novel index structure adapted to two-level architecture of hbase, which is suitable for hbase to process queries. algorithms for range query, knn query and gnn query are designed, what’s more, the optimizations for load balancing and knn query are also proposed. we carry out extensive experimental studies for verifying our index, and the results show that our approach for hbase is more efficient and scalable than the previous work. in the future, we plan to utilize this idea to efficiently store and retrieve graph data and apply to social networks. acknowledgment this work is supported by nsf of china grant 61303062 and 71331008. we would like to thank peijun he for helping with the implementation. bibliography [1] van orshoven et al. (2011), upgrading geographic information systems to spatio-temporal decision support systems, mathematical and computational forestry & natural resource sciences, 3(1): 36-41. [2] wiki, h. hbase: bigtable-like structured storage for hadoop hdfs. 2012-02-23)[2012-0417]. http://wiki. apache, org/hadoop/hbase. [3] ralph kimball, margy ross (1996), the data warehouse toolkit, wiley. [4] ralph kimball, margy ross (2012), the data warehouse toolkit: the complete guide to dimensional modeling, 2nd edition, wiley. [5] nishimura, s., das, s., agrawal, d., abbadi, a. e. (2011, june). md-hbase: a scalable multi-dimensional data infrastructure for location aware services. in mobile data management (mdm), 2011 12th ieee international conference on, 1: 7-16. [6] hsu, y. t., pan, y. c., wei, l. y., peng, w. c., lee, w. c. (2012), key formulation schemes for spatial index in cloud data managements. in mobile data management (mdm), 2012 ieee 13th international conference on, 21-26. [7] zhou, x., zhang, x., wang, y., li, r., wang, s. (2013), efficient distributed multidimensional index for big data management. in web-age information management, springer berlin heidelberg, 130-141. [8] han, d., & stroulia, e. (2013), hgrid: a data model for large geospatial data sets in hbase. in cloud computing (cloud), 2013 ieee sixth international conference on, 910-917. [9] zhang, n., zheng, g., chen, h., chen, j., chen, x. (2014). hbasespatial: a scalable spatial data storage based on hbase. in trust, security and privacy in computing and communications (trustcom), 2014 ieee 13th international conference on, 644-651. 630 x.y. chen, c. zhang, b. ge, w.d. xiao [10] du, n., zhan, j., zhao, m., xiao, d., & xie, y. (2015), spatio-temporal data index model of moving objects on fixed networks using hbase, in computational intelligence & communication technology (cict), 2015 ieee international conference on, 247-251. [11] hbase, a. (2012), apache hbase reference guide. webpage available at http://wiki. apache. org/hadoop/hbase/hbasearchitecture. webpage visited, 04-04. [12] george, l. (2011). hbase: the definitive guide, o’reilly media, inc. [13] faloutsos, c., roseman, s. (1989), fractals for secondary key retrieval, proceedings of the eighth acm sigact-sigmod-sigart symposium on principles of database systems, 247-252. [14] wang, j., wu, s., gao, h., li, j., ooi, b. c. (2010), indexing multi-dimensional data in a cloud system. proceedings of the 2010 acm sigmod international conference on management of data, 591-602. [15] hjaltason, g. r., samet, h. (1999), distance browsing in spatial databases, acm transactions on database systems (tods), 24(2): 265-318. [16] roussopoulos, n., kelley, s., vincent, f. (1995). nearest neighbor queries. in acm sigmod record, 24(2):71-79. [17] vu, q. h., ooi, b. c., rinard, m., tan, k. l. (2009), histogram-based global load balancing in structured peer-to-peer systems, knowledge and data engineering, ieee transactions on, 21(4): 595-608. [18] hochreiter, s., younger, a. s., conwell, p. r. (2001), learning to learn using gradient descent. artificial neural networks-icann 2001, springer berlin heidelberg. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 1 (march), pp. 21-32 an automatic face detection system for rgb images t. barbu tudor barbu institute of computer science, romanian academy, iaşi branch, iaşi, romania e-mail: tudbar@iit.tuiasi.ro, www.iit.tuiasi.ro/ tudbar abstract: we propose a robust face detection approach that works for digital color images. our automatic detection method is based on image skin regions, therefore a skin-based segmentation of rgb images is provided first. then, we decide for each skin region if it represents a human face or not, using a set of candidate criteria, an edge detection process, a correlation based technique and a threshold-based method. a high face detection rate is obtained using the proposed method. keywords: color image, color space, rgb, hsv, skin region, face detection, cross-correlation coefficient, edge detection, template matching, threshold. 1 introduction this paper approaches an important digital image analysis domain. face detection represents a computer technology that determines the locations and sizes of human faces in arbitrary digital images. face detection can be regarded as a specific case of object-class detection. in object-class detection, the task is to find the positions and sizes of all objects in an image that belong to a given class [1]. while being a sub-domain of the object detection field, face detection represents a generalization of face localization. in face localization, the task is to find the location and size of a given input image, while in face detection one does not have any information about the human faces [2]. the most important application area of face detection is biometrics. face finding is often considered the first step of the face recognition process [3, 4]. thus, most facial recognition systems, and the more complex biometric systems including face recognition components, use face detection techniques. video surveillance represents another important application domain of face detection. a robust face detection task consists of identifying and locating all the faces in an image, regardless of their position, scale, pose, orientation and illumination [2, 3]. early face-detection techniques focused on the detection of frontal human faces only, and did not consider the rotation problem. the newer methods attempt to solve the more general and difficult problem of multiview face detection. these algorithms take into consideration the two types of face rotation: pose, representing the out-of-plane rotation, and orientation, representing the in-plane rotation [2]. also, there are several factors which could transform the human face finding process into a difficult task, such as: the structural components (presence or absence of beards, moustaches, glasses or other elements), the facial expressions (smiling, laughing, crying and others), occlusions (the faces can be occluded by other objects) and imaging conditions (lighting, camera characteristics) [2]. a robust face detection approach must take into consideration the presence of these factors. there are several known categories of face detection approaches: knowledge-based techniques [5], feature-based methods [2,6,7], appearance-based approaches [8–13] template matching methods copyright c⃝ 2006-2011 by ccc publications 22 t. barbu [14]. the knowledge-based methods encode human knowledge of what constitutes a typical face, usually the relationships between facial features. a face is represented using a set of human-coded rules. these rules are then used to guide the face search process. the advantages of the knowledge-based techniques are: the easy rules to describe the face features and their relationships, and the good results obtained for face localization in uncluttered background. their disadvantages are: the difficulty to translate the human knowledge in rules precisely and the difficulty to extend these methods to detect faces in different poses, respectively [5]. the feature based approaches aim to detect invariant face features. these are structural features of a face that exist even when the pose, viewpoint or lighting conditions vary. we could mention here the random graph matching based approaches [6] and the feature grouping techniques [7]. the main advantage of the feature oriented face detection approaches consists in the fact that these features are invariant to rotation changes. their main drawback is the difficulty to located facial features in a complex background. appearance-based techniques train a classifier using various examples of faces. the classifiers which can be used in the training process include: neural networks (multilayer perceptrons) [8], hidden markov models [9], bayes classifiers [10], support vector machines (svm) [11], sparse network of winnows (snow) [12], principal component analysis (pca) [3] and boosting algorithms (ada-boost) [13]. the template matching based techniques use stored face templates [2, 14]. usually, these approaches use correlation operations to locate faces in images [15]. the templates are handcoded, not learned. also, these templates have to be created for different poses. we propose a template matching based face detection method in this paper, too. our detection technique works for rgb color images only and it is based on the skin regions of the image. thus, in the first stage, our approach performs a skin segmentation process, extracting the human skin regions from the analyzed image. the proposed skin detection technique is described in the next section. next, the technique identifies the human faces, by performing an analysis of the previously obtained skin segments. the face identification method is provided in the third section. in the fourth section, the experiments performed using the proposed human face detection system, are discussed. the paper ends with a conclusions section and the references section. 2 a skin detection approach for rgb images human skin color is proven to represent a very useful face detection and localization tool [2, 14, 16, 17]. a skin-based face finding approach identifies the skin regions of the image, then determine those of them which represent human faces. besides face detection, there exist other important application areas of skin detection, such as image content filtering and finding illegal internet content [16], content-aware video compression or image color balancing. many skin color localization techniques have been developed in recent years. a robust and very known skin finding method is the algorithm proposed by fleck and forsyth in 1996, that uses a skin filter [16]. we are interested in color images only and do not perform skin and face detection in grayscale images. obviously, the color images are usually in the rgb format. while it is one of the most used color spaces for processing and storing of digital image data, rgb is not a favorable choice for skin color analysis, because of the high correlation of its three channels and the mixing of luminance and chrominance data [17]. for this reason, most skin segmentation algorithms work with other color spaces, such as the normalized rgb, hsv (and other hue saturated based spaces) and ycrcb formats [2, 17]. an automatic face detection system for rgb images 23 we propose a skin detection technique using the hsv and ycrcb color spaces. first, a denoising process should be performed on the input rgb image, i. usually, these images are affected by gaussian noise. therefore, a 2-d gaussian smoothing filter has to be applied to them, to remove detail and noise [18]. then, the smoothed image is converted into the hue saturation value format, by computing the three components using the known conversion equations. we obtain the components, h, s and v , as three matrices with coefficients in the [0, 1] interval. we are interested mainly in the hue value, h. the ycrcb color model represents a family of color spaces [17]. in fact, it is not an absolute color space, but a way of encoding the rgb information. in this format, y represents the luminance, while cr and cb are the blue-difference and red-difference chroma components. these three components of the color space are computed as linear combinations of r, g and b components of the image. thus, the computation formulas of the chroma components, cr and cb, have the general form α · r + β · g + γ · b + 128, where coefficients α, β, γ ∈ [−0.5, 0.5]. we choose empirically some proper values for these coefficients and get the following components:{ cr = 0.15 ·r −0.3 ·g + 0.45 ·b + 128 cb = 0.45 ·r −0.35 ·g−0.07 ·b + 128 (1) then, we apply a set of restrictions on these two components and on the hue, to identify the skin regions. thus, we have determined a skin related interval for each component. in our approach, each pixel of the image i belongs to a human skin segment if the corresponding values in cr, cb and h are situated in those intervals. we create a binary image sk, having the same size as i, whose white regions correspond to the skin segments. the proposed skin segmentation process is modeled by the following relation: sk = { 1, if cr(i, ju) ∈ [150, 165] ∝ cb(i, j) ∈ [145, 190] ∝ h(i, ju) ∈ [0.02, 0.1] 0, otherwise (2) where i, j ∈ [1, m] and j ∈ [1, n], i representing an [m ×n] image. the connected components from image sk, computed by (2), represent the detected skin regions. the proposed detection method provides good results, although some skin identification errors could appear. that means some non-skin image regions could be detected as skin segments, but this fact will not affect the final goal, human face detection. therefore, we are satisfied with the obtained skin finding results. in figure 1 (a), there is displayed an rgb image depicting human persons. the result of the hsv conversion is displayed in figure 1 (b). the skin detection process is performed by applying the equations (1) and (2). the resulted skin regions are those depicted in figure 2. 3 a face finding technique the skin regions detected in the previous section are used in the human face identification process. for each skin segment we have to decide if it represents a face, or it is a non-facial skin region. we will propose an automatic template matching scheme for face detection. before applying the matching procedure, our face finding approach performs several necessary preprocessing steps. 24 t. barbu figure 1: digital color image conversion: rgb to hsv figure 2: skin detection result an automatic face detection system for rgb images 25 3.1 skin region pre-processing a task we try to solve is the separation of human faces from adiacent or occluding skin regions. our detection task cannot identify properly the faces which are occluded by other skinlike objects in the images. usually, this situation appears in group photos, like that displayed in figure 3 (a). therefore, we provide a separation technique involving some morphological operations [19], performed on the corresponding binary image, sk. thus, we will apply two successive erosions on it. first, the binary image is eroded with a structuring element l, representing a vertical line, having a length of 5 pixels: sk′ = sk ⊖l = ∩ ℓ∈l sk−ℓ (3) figure 3: face separation then, another erosion operation is performed on sk′, using a structuring element sq, representing a small square area (for example containing a single pixel): sk′′ = sk′ ⊖sq = ∩ p∈sq sl′−p (4) in the figure above, one can see a skin separation example using the proposed method. in figure 3 (b), the skin segments corresponding to the faces from figure 3 (a) are conjoined into a single region. the result of the morphology-based process, given by the relations (3) and (4), is displayed on figure 3 (c). the two greatest skin regions are clearly separated in the final binary image, sk′′. obviously, there could be situations, provided by big occlusions for example, when the face separation is not possible in the binary image. the binary image sk′′ contains a set of skin segments, which represent connected sequences of white pixels. let this set of regions be {s1, ..., sn}. now, we have to decide which of these regions could qualify as face candidates for the template matching process. so, we have established a set of candidate criteria for si segments. first condition is size related. the skin region set is usually very large because of the many small white regions which could be present in image sk′′. we have decided to not take into consideration these small area white spots, because they cannot represent serious face candidates. 26 t. barbu it is still possible to exist very small faces in an image, like those in a very large crowd, but we consider them irrelevant and do not try to detect them. so, if a white region area (the number of pixels) is below a given threshold value, that region is labeled as a non-facial segment. another condition is related to the shape of the skin regions. obviously, a face region should have a rectangular-like or ellipse-like shape. thus, a rigorous shape analysis can determine which skin segments cannot represent human faces. we propose a less complex approach to this task. a connected component si has to be rejected as non-face, having a non-facial shape, if its solidity, representing the ratio between its area and its bounding box area is below an established threshold. a facial skin-region is usually characterized by a high ratio, close to 1, but the solidity value may become lower because the region’s area is affected by the presence of many black holes in the face region. these holes could represent human face components, such as eyes, eyebrows, mouth, nose, ears and wrinkles, or some skin detection errors. therefore, we perform a black hole filling process on the binary image sk′′, first, then compute and use the areas of the filled si segments. also, a human face is characterized by some limits of its width to height ratio. none of the two dimensions of a face, the width and the height, can be much larger than the other. for this reason, we set another condition, requiring the width to height ratio of the face candidates to be restricted to a certain interval. the ratio between the width of the region’s bounding box and the height of the bounding box should be between two properly chosen threshold values. the white regions satisfying the proposed restrictions represent the face candidates. for each si, i ∈ [1, n], the described face candidate identification process is formally expressed as follows: area(si) ≥ tiα area(fill(si)) area(box(si)) ≥ t2α width(box(si)) height(box(si)) ∈ [t3, t4] =⇒ si = candidate (5) where area( ) computes the number of white pixels of the region received as argument, fill( ) performs the filling process, box( ) returns the bounding rectangle, width( ) and height( ) returning the dimensions of a rectangle. we have considered the following proper values for the thresholds in equation (5): the area threshold t1 = 130, the solidity threshold t2 = 0.65, and the width to height thresholds t3 = 0.6 and t4 = 1.8. a face candidate identification example is described in figure 4. in the rgb image from figure 4 (a) one can see a boy flexing his muscles. figure 4 (b) represents the corresponding binary image resulted after skin detection, erosion operations, small region removing and hole filling process. the bounding boxes of the three remaining skin regions are depicted in pictures c), d) and e). the one representing the right arm is rejected because of its low solidity percent (meaning a wrong shape), the skin segment of the left arm is rejected because a wrong width to height ratio, and the one representing the skin of head and neck is accepted as a right face candidate. 3.2 template matching process in its next stage, our facial detection approach determines which of the face candidates represent human faces. first, one converts the denoised rgb image i into a 2d grayscale form, let it be i′. if there is a set of k face candidates, where k ≤ n, then we determine the set of the sub-images of i′ corresponding to the bounding boxes of these candidates. the face detection process can be affected by the head hairline of the persons and by the skin zone corresponding to the neck and the upper chest. therefore, a narrow upper zone and a narrow bottom zone from each image are removed. in our tests, the height of each removed zone represents one eleventh of the bounding box height. let the set of the truncated skin images be {i1, ..., ik}, where ii ⊂ i′, ∀i ≤ k. then, we perform a correlation-based template matching process on this set. our template-based approach an automatic face detection system for rgb images 27 figure 4: face candidate identification example works like a supervised classification algorithm. we create a face template set, containing human faces of various sizes, orientations and poses, and representing both male and female people, of various ages and races. let the template set be {f1, ..., fn}, with n large enough, where each fi represents a grayscale image. next, an edge detection operation is performed on both the skin image set and the face template set. a canny filtering technique is used for image edge extraction, because this detector is less likely than the others to be affected by noise [20]. first, it computes the gradient of the image, using the derivative of a gaussian filter. then, it finds edges by looking for local maxima of this gradient. the canny method uses two thresholds, to detect strong and weak edges. it takes into consideration only the truly weak edges, representing those connected to the strong ones. thus, for each skin image ii and each face image fj, a binary image representing its edges is determined. let us note iei and f e j the edge images corresponding to images ii and fj respectively. then, for each candidate (skin image), one computes the 2d cross-correlation coefficients [21] between its edge image and the edge images of the templates, and the average value of this sequence of coefficients. let us note v(ii), this mean value corresponding to image ii. each time a correlation operation is performed the edge image of the candidate has to be resized to the size of the template. the best solution to the face detection task is a threshold-based one. the computed two-dimensional mean correlation coefficient corresponding to a facial skin image, must exceed a properly chosen threshold value. the face identification process is expressed mathematically as follows: ∀i ∈ [1, k], ii = face ⇐⇒ v(ii) ≥ t (6) 28 t. barbu where t represents the chosen threshold and v(ii) = 1 n n∑ j=1 σxσy(i e i (x, y)−µ(i e i ))(f e j (x, y)−µ(f e j )) (σxσy(i e i (x, y)−µ(i e i )) 2)(σxσy(f e j (x, y)−µ(f e j )) 2) (7) where µ( ) computes the mean of a matrix. the threshold value is determined empirically. from the performed experiments, we have got a satisfactory threshold value, t = 0.185. if t is not exceeded for any ii, then the color image i contains no human faces. we propose a no-threshold automatic face finding approach, too. the threshold can be replaced with a clustering procedure that uses the values v(ii) computed by (7) as feature vectors. thus, the value set {v(i1), ..., v(ik)} is divided into two classes. a region-growing algorithm can be used in this case. there is also a more simple way to perform the clustering: to sort the set in ascending order, then to find the greatest difference between two successive values. that pair of successive correlation-based values marks the dividing point between the two clusters. obviously, the cluster containing the high values is the one corresponding to the facial images. this method works satisfactory when the values v(ii) related to faces are much greater than those corresponding to non-facial images. some face detection errors could be produced. for this reason, the threshold based approach is preferred by us. after determining those ii images representing faces, if such images exist, the corresponding facial sub-images of the rgb image i are provided to the output of our face detection system. let us return to the example described in the first two figures. if the described face finding technique is applied to the skin detection results depicted in figure 2, we get the results displayed in figure 5. in figure 5 (a) there are displayed the main skin regions of the image, obtained after performing the morphological operations, the hole filling and the small region removing processes on the binary image depicted in figure 2. one of these skin regions is rejected because of its low solidity, the remaining regions being accepted as face candidates, as one can see in figure 5 (b). the template matching process is performed using the template face set represented in figure 6. one can see the resulted average correlation coefficient values in figure 5 (c), those greater than 0.185 corresponding to the detected faces, surrounded by black rectangles in that grayscale image. the final face detection result for the rgb image is displayed in figure 5 (d), the human faces being marked by red bounding boxes. 4 experiments we performed a lot of face detection experiments using the described system. our tests involved tens of rgb images containing human faces and produced satisfactory results. a high face detection rate is obtained. we created a template face set that contains 25 grayscale images of various scales and imaging conditions, for our experiments. as one can see in the pictures below, these templates represent both male and female faces, and people of various ages and races. these faces are also characterized by various orientations and poses, some of them have structural elements, too. the template set can be extended, by adding new faces, but although a large set could improve the detection results, it also produces a high computation complexity. as mentioned in the previous section, the threshold-based face finding approach provides much better results than the clustering-based one. our face detection technique is characterized not only by a high detection rate, that is approximately 90% and indicates a low number of false negatives (missed faces), but also by a low number of false positive (non-facial image regions declared to be faces). an automatic face detection system for rgb images 29 figure 5: face detection example the performances of the proposed face detection system are comparable with those of the face detection approaches mentioned in the introduction. it achieves better detection results for the frontal faces, than for faces characterized by various orientations. 5 conclusions a skin segmentation based face detection system for rgb color images has been proposed in this paper. the main contributions of this work are the proposed skin detection approach and face identification technique. the skin regions resulted from the skin segmentation process are analyzed to determine which of them could represent human faces. a set of face candidate criteria was proposed by us, to reduce the set of face candidates and the computation volume and complexity. we used a template matching method for face detection and provided a cross-correlation 30 t. barbu figure 6: template face set based skin region discrimination approach. unlike other template matching algorithms, our procedure uses the edges of the skin region and not the region itself. we choose to perform an edge detection first, because we consider that face features are contained mainly in the edge image. the good face detection results obtained in our experiments prove the effectiveness of our technique. our future work will focus on developing robust face recognition and more complex biometric systems, using the face detector proposed in this paper. we approached the face recognition domain in our previous works [4], and we want to unify the research in these two areas. acknowledgment the research described here has been supported by the grant pncdi ii, idei program having the cncsis code 70/2008. bibliography [1] c. papageorgiou, m. oren, t. poggio. a general framework for object detection, international conference on computer vision, bombay, india, pp. 555-562, jan. 1998. an automatic face detection system for rgb images 31 [2] m.h. yang, d. kriegman, n. ahuja. detecting faces in images: a survey, ieee transactions on pattern analysis and machine intelligence (pami), vol. 24, no. 1, pp. 34-58, jan. 2002. [3] s. atsushi, i. hitoshi, s. tetsuaki, h. toshinori. advances in face detection and recognition technologies, nec journal of advanced technology, vol. 2, no. 1, pp. 28-34, 2005. [4] t. barbu. eigenimage-based face recognition approach using gradient covariance, numerical functional analysis and optimization, volume 28, pp. 591 . 601, issue 5 & 6, may 2007. [5] g. yang, t.s. huang. human face detection in a complex background. pattern recognition, vol. 27, no. 1, pp. 53-63, 1994. [6] t.k. leung, m.c. burl, p. perona. finding faces in cluttered scenes using random labeled graph matching, proceedings of the 5th international conference on computer vision, pp. 637-644, cambridge, mass., june 1995. [7] k.c. yow, r. cipolla. a probabilistic framework for perceptual grouping of features for human face detection, second ieee international conference on automatic face and gesture recognition (fg ’96), pp. 16, 1996. [8] h.a. rowley, s. baluja, t. kanade. neural network-based face detection, ieee computer society conference on computer vision and pattern recognition, pp. 203-208, 1996. [9] a.v. nefian. an embedded hmm-based approach for face detection and recognition, proceedings of the acoustics, speech, and signal processing ďż˝e99 on 1999 ieee international conference, vol. 6, pp. 3553-3556, 1999. [10] t.v. pham, m. worring, a.w.m. smeulders. face detection by aggregated bayesian network classifiers, machine learning and data mining in pattern recognition, book series lecture notes in computer science, volume 2123, pp. 249-262, 2001. [11] e. osuna, r. freund, f. girosi. an improved training algorithm for support vector machines, in proceedings of ieee nnsp’97, pp. 276-285, amelia island, florida, 1997 (a). [12] m. nilsson, j. nordberg, i. claesson. face detection using local smqt features and split up snow classifier, ieee international conference on acoustics, speech, and signal processing (icassp), vol. 2, pp. 589-592, april 2007. [13] k. ichikawa, t. mita, o. hori. component-based robust face detection using adaboost and decision tree, proc. of the 7th int. conference on automatic face and gesture recognition, pp. 413-420, 2006. [14] z. jin, z. lou, j. yang, q. sun. face detection using template matching and skin-color information, advanced neurocomputing theory and methodology, vol. 70, issues 4-6, pp. 794-800, jan. 2007. [15] s. majed, h. arof. pattern correlation approach towards face detection system framework, information technology, 2008. itsim 2008. international symposium on, vol. 4, pp. 1-5, aug. 2008. [16] d. a. forsyth, m. m. fleck. identifying nude pictures, ieee workshop on the applications of computer vision ’96, pp. 103-108, 1996. 32 t. barbu [17] v. vezhnevets, v. sazonov, a. andreeva. a survey on pixel-based skin color detection techniques, in proceedings of the graphicon 2003, pp. 85-92, 2003. [18] l.g. shapiro, g. c. stockman. computer vision, pp. 137150, prentince hall, 2001. [19] h.j.a.m. heijmans. morphological image operators, advances in electronics and electron physics, boston: academic press, 1994. [20] j. canny, a computational approach to edge detection, ieee trans. pattern analysis and machine intelligence, vol. 88, pp. 679-714, 1986. [21] a.l. edwards, an introduction to linear regression and correlation, san francisco, ca: w.h. freeman, pp. 33-46, 1976. international journal of computers communications & control issn 1841-9836, 9(5):555-569, october, 2014. a fairness load balancing algorithm in hwn using a multihoming strategy y. donoso, c. lozano-garzon, m. camelo, p. vila yezid donoso universidad de los andes bogotá, colombia, south america ydonoso@uniandes.edu.co carlos lozano-garzon universidad de los andes bogotá, colombia, south america & universitat de girona, girona, spain. calozanog@ieee.org miguel camelo, pere vila universitat de girona, girona, spain. miguel.camelo@udg.edu, pere.vila@udg.edu abstract: due to the growth of the number of intelligent devices and the broadband requirements, between others technical requirements, of the new applications, suppose a new challenge in planning, maintenance and resource allocation in mobile networks for the telecommunication operators. service providers must ensure a quality of service for users in a new environment based in heterogeneous wireless networks (hwn). a good way to achieve this goal is to prevent the quantity of services of each mobile users being connected to the same access networks and therefore reducing the possibility of overloading it. this paper presents a load balancing optimization scheme that enables operators to make decisions about re-allocation of each of the services in different access networks, keeping the required quality of service (qos). in this paper, we propose 1) a mathematical model addressed as a fairness resource allocation in order to obtain a global load balancing, and 2) a two-step algorithm based on the anchor-adjustment heuristic to solve it. our algorithm contribute to unload the network with maximum load while at the same time, the other networks are balanced. as a result, we show that our algorithm finds (near)-optimal solutions while keeps low complexity. keywords: fairness, load balancing, multihoming, quality of service, heterogeneous wireless networks (hwn). 1 introduction nowadays, mobile operators have a great challenge in planning, maintenance and optimization of their complex network infrastructure [2]. many of these challenges are related to the continuous growth of both mobile subscriptions and mobile traffic; in [3] the global mobile suppliers association estimates that there are around 1.3 billion of broadband mobile subscriptions in the second quarter of 2013, and that the monthly traffic was around 885 petabyte in 2012 and is expected to reach to 11.2 exabyte in 2017 [4]. this growing, together with the need of allowing a quality of service that ensures connectivity and mobility to its users, regardless of the access technology, constitute the main topic of our study. copyright © 2006-2014 by ccc publications 556 y. donoso, c. lozano-garzon, m. camelo, p. vila current infrastructure deployed by the majority of mobile operators is made up by a combination of radio access technologies (rat). it is possible that during the network operation of the network some of these access channels could be overloaded due to a sudden growth of traffic. hence it is necessary to perform a good management of the resource allocation, specifically bandwidth in this case, over all the access infrastructures. the final aim is to ensure a balance between the use of network resources and the number of user connections; likewise, operators must allow user mobility between technologies without users perceiving the change [5]. thus, it is essential for operators to use decision-making algorithms to manage both their network resources and the connection of the services. this decision could be based on the qos requirements, among other parameters, and it can be seen as a rat selection process. combining the advantage of the existence of user equipment and network protocols to provide connections through multiple interfaces to different kind of networks [10], it would provide network benefits such as ubiquitous access, reliability, load balancing, among others [6]. specifically, multihoming could separate a flow between multiple points of attachment (simultaneously active or not) of a node, usually by choosing the less loaded connection or according to preferences on the mapping between flows and interfaces. following the latest idea, we initiated our study about the always best connected (abc) problem in hwn with an approach based on the possibility that the mobile user could make the decisions about which network it wants to be connected [7]. we designed a vertical handover (vho) decision algorithm that allows the user terminal to start a proactive re-allocation of the mobile based on parameters such as: user preferences, qos requirements, and network conditions. this process has as aim to avoid the over burdening of any interface. in this paper, we propose an efficient decision-making algorithm to perform a load balancing by re-allocation of connected services among networks by using multihoming as main strategy. however, the decisions are made on the operator side by using both local and global information. the proposed algorithm is based on the anchor-adjustment heuristic proposed by tversky and kahneman in [8], which runs in quadratic time proportional to the number of mobiles in the network. this paper extends the conference paper [1]. the key additions of this journal version are as follows. first, section 2 describes several works about the use of multihoming in this research scenario. second, this paper contains an additional explanation of the proposed model in section 3 and the algorithm designed in section 4. finally, this paper contains an additional scenario composed by seven networks and a variable number of mobiles; our obtained results are contrasted with the results of round robin and least connected algorithms, two ore the most used load balancing algorithms. the remainder of this paper is organized as follows. in section 2 we present the related work on load balancing in cellular networks using multihoming as main strategy. in section 3 we introduce the mathematical model that encode the objective function to obtain a global load balancing among hwn under qos constraints. in section 4 the load balancing algorithm based on the anchor-adjustment heuristic is presented. the experimental results about the performance of our proposal compared to round robin and least connected algorithms are shown in section 5. finally, concluding remarks and directions for further research are given in section 6. 2 related work multihoming and load balancing strategies have been topics of several research projects. a mathematical model to load balancing is presented in [9]. this model aims to minimize the load of network by re-allocating services from the more loaded network. sousa, pentikousis and curado present in [10] the architectural goals and system design principles for multihoming, and review a fairness load balancing algorithm in hwn using a multihoming strategy 557 different approaches. in addition, they show in a survey, how multihoming is supported at the different levels of the osi layers, covering all recent proposals based on a locator/identifier split approach. a mathematical model for heterogeneous network and its performance, considering both multihoming and network coding, is presented in [11]. they propose an optimal resource allocation by deciding which data should be transmitted on which interface and, simultaneously, which coding parameters should be used on the data. they showed that the combination of multihoming and network coding can improve significantly the service rate of access points (wifi) and base stations (cellular system) and reduce the system delay. an alternative technique based on traffic splitting using common radio resource management for long term evolution (lte) and high-speed downlink packet access (hsdpa) networks is presented in [12]. they propose a mathematical model to solve the traffic split problem. in their research, they found that to maximize the throughput it is necessary minimize the transmission delay between both networks. then, the split ratios can be dynamically adjusted according to the channel qualities and the load status of the networks. some other research studies about the use of multihoming in the cellular heterogeneous networks are related with the vho process. in [13], authors present a transport-layer scheme to support vho between universal mobile telecommunications system (umts) and wireless local area network (wlan) using stream control transmission protocol (sctp). their method is based in the multihoming capability and the dynamic address reconfiguration extension of sctp; through it, they obtain a decrease in handover delay and improve throughput performance. the study of liu, boukhatem, martins and bertin in [14] propose a multihoming approach to implement a seamless vho for umts and wimax over integrated and tight coupling architectures. based on these architectures, authors design a sublayer of osi layer 2 to be added to the radio network controller (rnc) and mobile station (ms); this sublayer implements a dual retransmission queue scheme to enable a soft handover that can eliminate packet losses and reduce handover latency significantly. paik et al. [15] also designed a seamless vho mechanism using multihoming for mobile networks. they implement a multihomed mobile access point, which was tested over heterogeneous networks formed by wireless broadband network (wibro) and hsdpa access technologies. as a result, they obtained a significant reduction of handover latency by reducing ip connection latency. authors in [16] also use the seamless handover and multihoming techniques but their aim was to increase the access availability, which is fundamental for qos and critical services. they propose one method to calculate and distribute network topology information with estimated availability, which allows to predict the overall availability of accessible networks. this is a criteria, among others, used for a handover decision in the imh 802.21 framework. to test the proposal they used three different access technologies, wlan, wimax and umts. finally, other researchers have studied the use of multihoming as strategy to make a better distribution of the bandwidth charge. sungwook and varshney [17] worked on a dynamic on-line bandwidth reservation algorithm for some multimedia services over cellular networks. this algorithm was designed to control bandwidth according to the priority of traffic services and current network traffic conditions. later, in [18] they proposed another adaptive on-line algorithm for the multimedia services but it is based on the minimization of the maximum available bandwidth in each cell in order to keep the load balancing. they compared the performance of both proposed schemes with the abr scheme and the cac provision scheme, obtaining an appropriate performance balance between contradictory requirements. in this paper, we propose both a generic mathematical model to load balancing in hwn, which uses as a main objective function the concept of fairness over the networks, and a scalable two-step algorithm with low computational complexity that can be implemented by using vho. 558 y. donoso, c. lozano-garzon, m. camelo, p. vila 3 a mathematical model for load balancing since the resources are limited, an incorrect allocation could impact both the performance of the network and the satisfaction perception of the users. therefore, it is necessary to define a mathematical model that encodes the requirements of the user, the environment constraints and the main target: to balance the load among different networks while ensuring the qos requirements are met. in the following sections, we define the variables, functions and parameters of the mathematical model, and a solution that gives us an optimal distribution of loads across multiple networks under qos constraints. 3.1 network load let n, m and s be the sets of n networks, m mobiles and s services that compose a cellular system, respectively (see figure 1). additionally, let yj,k ∈ [0,1] be a binary parameter that indicates if the service k of the mobile j is activated or not. we calculate the load of the network i (αi) as the sum of demanded bandwidth (dk) of each connected service (k), for each mobile (j) over the total capacity of the network channel (ci). αi = ∑m j=1 ∑s k=1 dk ·x k i,j ·yj,k ci ,∀i (1) where xki,j = 1 if the service k of the mobile j is connected to the network i, or 0 otherwise. mobile device 1 services 1 services 2 services s mobile device 2 mobile device m services serverrat 1 rat 2 rat n core network services server internet services 1 services 2 services s services 1 services 2 services s figure 1: multihoming cellular system 3.2 load balancing: jain’s index functions fairness is a concept related with “equality" in the resource allocation widely used in many research fields, including those in wireless networks [19]. the jain’s index function [20] can be formulated to measure the fairness of resource allocation among networks, i.e. it can give us an accurate measurement of how well-distributed is the network load in the cellular system. this function can be formulated as follows: f1(α) = [ ∑n i=1 αi] 2 n · ∑n i=1 α 2 i ,αi ≥ 0,∀i (2) a fairness load balancing algorithm in hwn using a multihoming strategy 559 f1(α) is a continuous map in to the real interval [0,1], where a value closer to 0 represent a unbalanced load in the system, whilst a value near to 1 represents a fairer allocation. 3.3 constraints in this model we need to consider several constraints in order to ensure the adjustment to the real-life networks. some constraints are related to qos requirements and others to the service connectivity. qos constraints we consider only two qos parameters in our modeling: bandwidth and signal strength. however, it is important to remark that the model can be extended by including any other qos requirements. the bandwidth is a resource that is not unlimited, therefore, it is necessary to ensure that all the services connected to a given network receive the requested bandwidth, otherwise the network must reject the connection. in other words, the service k of the mobile j can be connected to the network i if and only if: dk ·xki,j ≤ abi ∀i ∈ n,∀j ∈ m,∀k ∈ s (3) where abi is the available bandwidth of the network i. the received signal strength indication (rssi) is the relative received signal strength in a wireless environment. it measures the power level being received by the antenna, where higher values of the rssi imply stronger received signals. if this value is below a certain threshold rssith, we assume that the quality of the communication between the mobile and the base station is very poor. thus, we can define that the service k of the mobile j can be connected to the network i if and only if: xki,j ≤ z k i,j ∀i ∈ n,∀j ∈ m,∀k ∈ s (4) where the parameter zki,j = 1 if rssiki,j rssith ≥ 1, or 0 otherwise. actived services constraint this constraint ensures that all activated services of each mobile device j must be connected to some network in n. xki,j ≤ yj,k ∀i ∈ n,∀j ∈ m,∀k ∈ s. (5) connectivity constraint through this constraint we ensure that each service used by the mobile j is connected to only one network. n∑ i=1 xki,j = max 1≤i≤n {zki,j ·yj,k} ∀j ∈ m,∀k ∈ s. (6) 560 y. donoso, c. lozano-garzon, m. camelo, p. vila summary a summary of the proposed mathematical model is presented as follows: maximize f1(α) = [ ∑n i=1 αi] 2 n· ∑n i=1 α 2 i ,αi ≥ 0 (2) subject to dk ·xki,j ≤ abi, ∀i ∈ n,∀j ∈ m,∀k ∈ s (3) xki,j ≤ z k i,j, ∀i ∈ n,∀j ∈ m,∀k ∈ s (4) xki,j ≤ yj,k, ∀i ∈ n,∀j ∈ m,∀k ∈ s (5)∑n i=1 x k i,j = max 1≤i≤n {zki,j ·yj,k}, ∀j ∈ m,∀k ∈ s (6) xki,j ∈{0,1}, ∀i ∈ n,∀j ∈ m,∀k ∈ s (7) 4 proposed algorithms the resource allocation problem has been proved to be np-complete [21], therefore, an efficient algorithm that solves this problem is not known yet. in addition, our model includes an objective function (see equation 2) that can only be solved by mixed-integer nonlinear fractional programming methods [22], which also difficult the possibility of finding an optimal solution even in small problems. thus, it is necessary to implement heuristics that either solve the problem quickly or find an approximate solution when classic methods fail. based on the anchor-adjustment heuristic proposed in [8], we present a two-step algorithm that solves the above mathematical model in quadratic time proportional to the number of mobiles in the network. the proposed algorithm (see algorithm 1) finds a (near)-optimal load balancing among networks by performing a re-allocation of connected services. roughly speaking, the first step of the algorithm tries to balance the load of the network by distributing the services from the more loaded network to the one with minimum load. second step performs an optimization on the result obtained from the first one by using a load distribution method with local information. algorithm 1 two-step algorithm based on anchor-adjustment heuristic require: list of the set of available networks anj,k = {t1, . . . , tp},∀j ∈ m, ∀k ∈ s and p ≤ n require: set of actual connection of mobiles and their services x = {x11,1, . . . ,x s n,m} ensure: a (re) allocation of each used service by each connected mobile device. 1: call anchor algorithm based on a max-min strategy at network level (algorithm 2) 2: call adjustment algorithm based on local information (algorithm 3) 3: return a set of connections of mobiles and their services x = {x11,1, . . . ,x s n,m} we assume that for each mobile j ∈ m and each service k ∈ s of j, we have access to the set of available networks anj,k = {t1, . . . , tp},∀j ∈ m, ∀k ∈ s and p ≤ n, which is derived by selecting those networks that meet both requirements the rssi threshold and the available bandwidth (see constraints 3 and 4). we will see below that the time complexity of this two-step algorithm is bounded by o(m2), when m >> n and m >> s. a fairness load balancing algorithm in hwn using a multihoming strategy 561 4.1 first step: anchor based on a max-min strategy at network level given the set n of networks and a valid set of connections x = {x11,1, . . . ,x s n,m}, i.e. set of connections that meet the constraints 5 and 6, the algorithm finds the network imax ∈ n with maximum load αimax and selects a random service k connected to it. then, k is moved to the available network with minimum load imin ∈ n that is available for k. in case that there are no alternative networks to connect s, i.e. either there are not more reachable networks to connect the service or all the networks have the same load, the service is not re-allocated. note that although the algorithm 2 can obtain an optimal load balancing among networks in most cases, the addition of constraints into the mathematical model may permit that the algorithm can be stuck in local minimum. for example, if all services connected to imax ∈ n have not alternative networks to move, then the algorithm can not perform a load balancing among any network. the load balancing is obtained by the assignation of each service to the feasible network with the minimum load at that moment. algorithm 2 anchor based on a max-min strategy at network level require: list of the set of available networks anj,k = {t1, . . . , tp},∀j ∈ m, ∀k ∈ s and p ≤ n require: set of actual connections of mobiles and their services x = {x11,1, . . . ,x s n,m} ensure: a (re) allocation of each used service by each connected mobile device. 1: imax ← 0, imin ← 0, l ← 0; 2: repeat 3: compute αi,∀i ∈ n; 4: imax ← i | i ∈ n and max i {αi}; 5: select any xki,j = 1 | i = imax; 6: imin ← t | t ∈ anj,k and min t {αt}; 7: if αimin < αimax then 8: xkimax,j = 0, x k imin,j = 1; 9: end if 10: l = l + 1 11: until l = m ·s 12: return a set of connections of mobiles and their services x = {x11,1, . . . ,x s n,m} the complexity of the algorithm 2 is θ(m2 ·s2), where m is the number of mobile devices, and s the number of services. remark that although the main loop has θ(m ·s) iterations, step 6 of the algorithm implies to find the services connected to a specific network. this operation takes at most θ(m·s) steps, i.e. we need to classify each mobile and its services regarding which network they are connected to. 4.2 second step: adjustment based on local information algorithm 4.2 uses the result of the algorithm 2, and tries to improve the allocation of services by using an iterative process of adjustment. the algorithm iterates on each j ∈ m and moves each one of its service k to its available network with minimum load. in case that there are no alternative networks to connect the service k, i.e. either there are not more reachable networks to connect the service or all the networks have the same load, the service is not re-allocated. observe that algorithm 3 permits to leave local optimal from algorithm 2 in a greedy way: in each iteration, the service k ∈ s of each mobile j ∈ m is moved to its less loaded available network (local information). however, we can not ensure an optimal solution at the end of algorithm 3 because in each iteration, the actual load of the networks has been influenced by 562 y. donoso, c. lozano-garzon, m. camelo, p. vila both the initial allocation of services (from step 1) and the allocation of services performed before the current iteration. algorithm 3 adjustment based on local information require: list of the set of available networks anj,k = {t1, . . . , tp},∀j ∈ m, ∀k ∈ s and p ≤ n require: set of actual connection of mobiles and their services x = {x11,1, . . . ,x s n,m} ensure: a (re) allocation of each used service by each connected mobile device. 1: for 1 ≤ j ≤ m do 2: for 1 ≤ k ≤ s do 3: iact ← i | xki,j = 1; 4: imin ← t | t ∈ anj,k and min t {αt}; 5: if αimin < αiact then 6: xkiact,j = 0, x k imin,j = 1; 7: end if 8: end for 9: end for 10: return a set of connections of mobiles and their services x = {x11,1, . . . ,x s n,m} we found that the complexity of the algorithm 3 is θ(m·s), where m is the number of mobile devices, and s the number of services. note that this algorithm has a linear time complexity with respect to the total number of mobiles. in general, the combination of algorithm 2 and 3 gives us a time complexity equal to θ(m ·s + m2 ·s2). however, since the number of mobiles is usually larger than the number of services and networks, the time complexity of the algorithm 1 has as upper bound o(m2). 5 experimental results we propose an experimental environment composed for seven different radio access technologies and a finite number of mobile devices, each one with at most three different network interfaces to supporting data, voice, and video services. table 1: access network bandwidth network edge hspa wimax hspa+ wifi g wifi n lte bandwidth(mbps) 0.384 14.4 37.0 42.0 54.0 100 100 table 2: requested bandwidth for service id service required bandwidth (mbps) 1 voice 0.012 2 data 0.028 3 video 0.128 tables 1 and 2 show the characteristics of each access network and the minimum required bandwidth to ensure qos for each service. to verify the efficiency and effectiveness of our algorithm, we compare the solutions found by our algorithm with respect to those obtained by solving the mathematical model and by using both the round robin and least connected algorithms. the mathematical model was implemented in gams [23], and it was solved by using the global optimization solver bonmin (basic open-source nonlinearmixed integer programming) [24]. two different scenarios to evaluating the proposed algorithm are presented in the following sub-sections. a fairness load balancing algorithm in hwn using a multihoming strategy 563 5.1 scenario 1: three networks and 10 mobiles the first scenario is composed of three networks (wimax, edge and hspa) and 10 mobile devices, where each one of them has two activated services: data and voice. mobiles are randomly distributed among networks and the rssi threshold is set to 10. figure 2, and table 3 show the distribution of mobiles among the networks and the rssi values of each mobile according to a random distribution. table 4 (a) presents an initial connection matrix, which is the result of a random selection process on the mobile rssi matrix, which ensures the required bandwidth of each service. figure 2: distribution of mobiles in scenario 1 mobile wimax edge hspa k1 0 3 26 k2 9 27 23 k3 16 19 29 k4 5 15 12 k5 29 4 12 k6 14 17 24 k7 24 8 29 k8 15 14 18 k9 28 15 0 k10 19 21 11 table 3: rssi for each mobile in scenario 1 (a) mobile id service 1 service 2 k1 hspa hspa k2 hspa hspa k3 edge edge k4 hspa hspa k5 hspa hspa k6 edge edge k7 hspa hspa k8 edge edge k9 edge edge k10 edge edge jain’s index = 0.3510 (b) mobile id service 1 service 2 k1 hspa hspa k2 hspa hspa k3 wimax wimax k4 hspa hspa k5 hspa hspa k6 wimax wimax k7 hspa hspa k8 wimax wimax k9 wimax wimax k10 wimax wimax jain’s index = 0.5586 (c) mobile id service 1 service 2 k1 hspa hspa k2 edge hspa k3 wimax wimax k4 hspa hspa k5 wimax wimax k6 wimax wimax k7 wimax wimax k8 hspa wimax k9 wimax wimax k10 wimax wimax jain’s index = 0.6653 table 4: connection of services: (a) initial, (b) obtained from algorithm 2, and (c) obtained from algorithm 3 from the initial connection matrix, the computed jain’s index (see function (2)) was 0.3510 (see table 4 (a)). once we ran the algorithm 2, this value raised up to 0.5586 (see table 4 (b)). finally, by using the connection matrix resulting from algorithm 2 as input for 3, the jain’s index increased to 0.6653 (see table 4 (c)), while the optimal value of such function calculated by bonmin was 0.7070 in 17 seconds (see table 5). we emphasize that the solution found by our proposal has only a relative error of 5.9% with respect to the optimal one, and its execution time was less than 1 second. 564 y. donoso, c. lozano-garzon, m. camelo, p. vila mobile id service 1 service 2 k1 hspa hspa k2 hspa hspa k3 wimax wimax k4 hspa hspa k5 hspa wimax k6 wimax hspa k7 hspa hspa k8 edge hspa k9 wimax wimax k10 wimax wimax jain’s index = 0.7070 table 5: optimal connection of services 5.2 scenario 2: seven networks and a variable number of mobiles in this scenario, we show the behavior of the proposed algorithm when the number of mobiles is increased. the scenario is composed for seven different radio access networks (edge, hspa, wimax, hspa+, wifi g, wifi n, and lte), and a set of a distributed randomly mobile devices over those networks, where each one has three activated services and the number of them is increased from 10 to 1000. we also implement both the round robin (rr) and least connected (lc) algorithms [25] to compare their performance with respect to our proposed algorithm. the pseudo-codes of these algorithms are described in algorithms 4 and 5. it is important to note that our version of rr and lc also include a random selection process when the next network in the list can not be reached by a given mobile. algorithm 4 round robin algorithm require: list of the set of available networks anj,k = {t1, . . . , tp},∀j ∈ m, ∀k ∈ s and p ≤ n require: set of actual connection of mobiles and their services x = {x11,1, . . . ,x s n,m} ensure: a (re) allocation of each used service by each connected mobile device. 1: inew ←−1; 2: for 1 ≤ j ≤ m do 3: for 1 ≤ k ≤ s do 4: iact ← i | xki,j = 1; 5: inew ← (inew + 1) mod (n−1) + 1; 6: if tinew ∈ anj,k then 7: xkiact,j = 0, x k inew,j = 1; 8: else 9: i ← select a random network index from anj,k; 10: xkiact,j = 0, x k i,j = 1; 11: end if 12: end for 13: end for 14: return a set of connections of mobiles and their services x = {x11,1, . . . ,x s n,m} a fairness load balancing algorithm in hwn using a multihoming strategy 565 algorithm 5 least connected algorithm require: list of the set of available networks anj,k = {t1, . . . , tp},∀j ∈ m, ∀k ∈ s and p ≤ n require: set of actual connection of mobiles and their services x = {x11,1, . . . ,x s n,m} ensure: a (re) allocation of each used service by each connected mobile device. 1: inew ←−1; 2: for 1 ≤ j ≤ m do 3: for 1 ≤ k ≤ s do 4: iact ← i | xki,j = 1; 5: inew ← network with less number of connections; 6: if tinew ∈ anj,k then 7: xkiact,j = 0, x k inew,j = 1; 8: else 9: i ← select a random network index from anj,k; 10: xkiact,j = 0, x k i,j = 1; 11: end if 12: end for 13: end for 14: return a set of connections of mobiles and their services x = {x11,1, . . . ,x s n,m} # mobiles optimal without vertical handover relative error proposed algorithm relative error 10 0.852 0.231 0.729 0.717 0.158 30 0.890 0.173 0.806 0.890 0.000 50 0.991 0.179 0.819 0.767 0.226 70 0.999 0.174 0.826 0.769 0.230 100 0.955 0.176 0.816 0.882 0.076 200 1.000 0.176 0.824 0.784 0.216 300 1.000 0.172 0.828 0.841 0.159 400 1.000 0.170 0.830 0.878 0.122 500 0.945 0.169 0.821 0.939 0.006 600 0.975 0.171 0.825 0.973 0.002 700 0.991 0.172 0.826 0.990 0.001 800 0.998 0.172 0.828 0.997 0.001 900 1.000 0.173 0.827 0.999 0.001 1000 1.000 0.173 0.827 0.998 0.002 table 6: computed jain’s index: optimal, without vertical handover and using the proposed algorithm # mobiles round robin relative error least connected relative error 10 0.162 0.810 0.174 0.796 30 0.198 0.778 0.248 0.721 50 0.216 0.782 0.256 0.742 70 0.244 0.756 0.307 0.693 100 0.490 0.487 0.338 0.646 200 0.500 0.500 0.409 0.531 300 0.659 0.341 0.515 0.485 400 0.624 0.376 0.589 0.411 500 0.682 0.278 0.646 0.316 600 0.709 0.273 0.686 0.296 700 0.783 0.210 0.712 0.282 800 0.815 0.183 0.758 0.240 900 0.829 0.171 0.781 0.219 1000 0.860 0.140 0.737 0.203 table 7: computed jain’s index: round robin and least connected algorithms 566 y. donoso, c. lozano-garzon, m. camelo, p. vila figure 3: effectiveness of the proposed algorithm figure 4: relative error of the proposed algorithm table 8: final network load and the computed jain’s index for the instances of 30, 500 and 1000 mobiles number of mobiles 30 500 1000 network pa rr lc pa rr lc pa rr lc hspa 2.14% 5.22% 11.31% 24.83% 96.08% 99.89% 48.89% 93.81% 99.11% hspa+ 1.52% 1.90% 1.78% 23.80% 32.72% 41.53% 48.25% 76.98% 70.10% wimax 1.74% 3.07% 2.53% 24.22% 39.37% 67.36% 47.89% 90.36% 99.63% lte 1.35% 0.70% 0.50% 23.89% 14.57% 6.01% 48.16% 29.38% 19.41% edge 3.23% 68.75% 48.96% 43.75% 45.83% 94.79% 54.08% 54.17% 98.96% wifi n 1.36% 0.99% 0.70% 24.18% 12.46% 5.36% 48.26% 29.07% 14.02% wifi g 1.35% 0.74% 0.61% 24.18% 27.11% 28.72% 48.01% 55.69% 99.27% jain-s index 89.05% 19.84% 24.84% 93.94% 68.23% 64.61% 99.82% 86.01% 79.73% a fairness load balancing algorithm in hwn using a multihoming strategy 567 tables 6 and 7 show the computed jain’s index and the relative error of the solutions found by round robin (rr), least connected (lc) and the proposed (pa) algorithms. it is important to remark that for the instances between 100 and 1000 mobile devices, we relaxed the variable xki,j to accept real values between 0 and 1 in order to obtain a lower bound for those instances by nonlinear programming. as you can see in figure 3 our proposed algorithm obtains a near optimal solutions when the number of mobiles is increased. compared to rr and lc, we can see in table 8 that our algorithm converge to the optimal values obtaining a fairness load balancing. the results obtained shows that the convergence of rr and lc to the optimal values of jain’s index is due to the networks being loaded up to 100% instead of a fairness load balancing in fact. it is appropriate to note that without vertical handover, the jain’s index of the networks was low, and therefore the initial network load is unbalanced. once we executed our algorithm, we observed that the resulting resource allocation had given us solutions with relative error less than 23% in the first instances. however, when the size problem increases, our algorithm presents a better convergence towards the optimal solution. figure 4 shows how the relative error converges to zero when the number of mobiles is increased. 6 conclusions and future work in this paper, we have presented a load balancing optimization model using multihoming approach in heterogeneous wireless networks. in this approach, we have worked with different wireless access networks like hsdpa, hspa+, edge, wimax, wifi and lte. furthermore, in this paper we designed and implemented a vertical handover (vho) algorithm following the always best connected scheme. it allows to give a solution, in a proactive way, for re-allocation of services when a new access network is available. it is also best to provide the required resources. our proposed algorithm gives a solution to global optimality. the resource allocation problem has been proved to be np-complete and we have proposed an algorithm that calculates the best load balancing solution in heterogeneous wireless networks using a multihoming approach in polynomial time o(m2). the mathematical optimization model was computed by using the solver called bon-min (basic open-source nonlinear mixed integer programming) and their results were compared with our proposed algorithm. simulation results showed that without vertical handover, the initial value of the jain’s index was very low, which means that the initial load balancing of the network was poor. however, the proposed algorithm solved the load balancing optimization model with nearly the same values as the ones given by the optimal solution obtained with the bonmin solver. finally, when the number of mobiles was increased, the results obtained by our algorithm were very close to the optimal ones from the mathematical optimization model. in further studies, we will consider researching the applicability of evolutionary algorithms for a multi-objective load-balancing scheme. in the technical scheme, we are going to work to implement this algorithm like a functional protocol. 568 y. donoso, c. lozano-garzon, m. camelo, p. vila acknowledgments the authors acknowledge to the administrative department of science, technology and innovation (colciencias) for the financial support to carlos lozano-garzon through the 528 2011 national call for doctoral studies in colombia, and to the comissionat d’universitats i recerca (cur), which is part of the departamento de innovacion, universidades i empresas (diue) of the generalitat de catalunya, the roger project (tec 2012-32336) of the spanish government, the csi project (sgr-1202) of the generalitat de catalunya, and the european social fund through the financial support given in the grant fi-dgr 2011 to miguel camelo. bibliography [1] donoso y., lozano-garzon c., camelo m., vila p.(2014); a multihoming load balancing algorithm for a fairness resource allocation in heterogeneous wireless networks, international conference on computers, communications & control, romania, oradea, baile felix, may 6-10, 2014, abstracts of icccc 2014, issn 1844-4334, 4: 43. [2] donoso y. (2008); network design for ip convergence, auerbach pubn, isbn 9781420067507. [3] global mobile suppliers association, gsm/3g stats. fast facts, global mobile suppliers association, available at http://www.gsacom.com/news/statistics.php4. [4] cisco (2013); cisco visual networking index: global mobile data traffic forecast update, 2012-2017, cisco, san jose, ca, usa. [5] kumar s., anand s.(2011); a novel scalable software platform on android for efficient qos on android mobile terminals based on multiple radio access technologies, wireless telecommunications symposium (wts), new york city, 1-6. [6] ernst t., montavont n., wakikawa r., ng c., kuladinithi k. (2008); motivations and scenarios for using multiple interfaces and global addresses, draft ietf monami6 working group, 2008. [7] lozano-garzon c., ortiz-gonzalez n., donoso y.(2013); mobile network a proactive vhd algorithm in heterogeneous wireless networks for critical services, international journal of computers communications & control, issn 1841-9844, 8(3): 425-431. [8] tversky a., kahneman d. (1974); judgment under uncertainty: heuristics and biases, science, issn 0036-8075, 185: 1124-1131. [9] donoso y., fabregat r.(2007); multi-objective optimization in computer networks using metaheuristics, auerbach pubn, isbn 978-0-8493-8084-6. [10] sousa b.m., pentikousis k., curado m.(2008); multihoming management for future networks, mobile network application, issn 1383-469x, 16(4):505-517. [11] capela n., sargento s. (2012); optimizing network performance with multihoming and network coding, globecom workshops, 2012 ieee, 210-215. [12] ruiming y., yongyu c., jia s., dacheng y. (2012); traffic split scheme based on common radio resource management in an integrated lte and hsdpa networks, vehicular technology conference (vtc fall), 2012 ieee, 1-5. a fairness load balancing algorithm in hwn using a multihoming strategy 569 [13] li m., yu f., leung, v.c.m., randhawa t. (2004); a new method to support umts/wlan vertical handover using sct, ieee wireless communications, 11(4):44-51. [14] liu b., boukhatem n., martins p., bertin, p. (2010); multihoming at layer-2 for interrat handover, 2010 ieee 21st international symposium on personal indoor and mobile radio communications (pimrc), 1173-1178. [15] paik e.k., heo s.y., kim h., jin j.s., lee s.c., lee s.h. (2008); seamless vertical handover using multihomed mobile access point, 2008 ieee global communications conference, 1-4. [16] folstad e.l., helvik b.e. (2009); managing availability in wireless inter domain access, international conference on ultra modern telecommunications & workshops, 1-6. [17] sungwook k., varshney, p.k. (2002); an adaptive bandwidth reservation algorithm for qos sensitive multimedia cellular networks, 2002 ieee 56th vehicular technology conference, 3:1475-1479. [18] sungwook k., varshney, p.k. (2003); adaptive load balancing with preemption for multimedia cellular networks, 2003 ieee wireless communications and networking (wcnc), 3: 1680-1684. [19] shi h., prasad v., onur e., niemegeers i. (2013); fairness in wireless networks:issues, measures and challenges, ieee communications surveys & tutorials, issn 1553-877x, 524. [20] jain r., chiu d.m., hawe w.r., a quantitative measure of fairness and discrimination for resource allocation in shared computer systems, dec research report tr-301, 1984. [21] bayrak a. e., optimization algorithms for resource allocation problem ff air tasking order preparation, master thesis, middle east technical university, available at etd.lib.metu.edu.tr/upload/12612325/index.pdf. [22] mond. b, craven b. d. (1975); non-linear fractional programming, bulletin of the australian mathematical society, 12(3) : 391-397. [23] gams development corporation, the general algebraic modeling system (gams), retrived november 2013, from http://www.gams.com/. [24] computational infrastructure for operations research, bonmin and bonminh solvers, available at http://www.gams.com/dd/docs/solvers/coin.pdf. [25] teo y. m., ayani r. (2001); comparison of load balancing strategies on cluster-based web servers, simulation, 77(5-6): 185-195, 2001. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vii (2012), no. 1 (march), pp. 101-114 optimal tuning of pid controller using adaptive hybrid particle swarm optimization algorithm s. morkos, h. kamal sawsan morkos gharghory electronics research institute dokki, cairo, egypt e-mail: sawsan@eri.sci.eg hanan ahmed kamal faculty of engineering, cairo university giza, egypt e-mail: hanan−ak2003@yahoo.com abstract: particle swarm optimization (pso) has proved its ability as an efficient search tool in many optimization problems. however, pso is easy to be trapped into local minima due to its mechanism in information sharing. under this circumstance, all the particles could quickly converge to a position by the attraction of the best particle; all particles could hardly be improved. to overcome premature convergence of the standard pso algorithm, this paper presents an adaptive hybrid pso, namely (ahpso) by employing an adaptive mutation operator for local best particles instead of applying the mutation operator to the global best particle as has been done in previous work. the developed algorithm is a new approach which allows the swarm to be more diverse by making better exploration of the local search space instead of global search space investigated by previous researchers. the proposed algorithm holds on the properties of simple structure, fast convergence, and at the same time enhances the variety of the population, and extends the search space. it is applied to self-tuning of proportional-integral-derivative-(pid) controller in the ball and hoop system which represents a system of complex industrial processes. the results are compared with those obtained by applying standard pso, and adaptive hybrid pso based on global best particles. it has been shown that the developed ahpso local best algorithm is faster in convergence and the obtained results are proved to have higher fitness than the other two algorithms. keywords:pso, adaptive mutation, pid controller, and ball and hoop system. 1 introduction the pid controller was the most popular controller of this century because of its remarkable effectiveness, simplicity of implementation and broad applicability. in practice, it is hard to obtain optimal tuning for pid controller. most of pid tuning is done manually which is difficult and time consuming. in order to use pid controller better, the optimal tuning of its parameters have become an important research field [1]. people have made lots of research, and proposed some advanced pid control methods, such as expert pid control based on knowledge inference[2], self-learning pid control based on regulation, neural network pid control based on connection mechanism[3], and intelligent pid control based on fuzzy logic[4,5]. genetic algorithm (ga) has copyright c⃝ 2006-2012 by ccc publications 102 s. morkos, h. kamal been applied to self-tuning of pid parameters, too [6]. however, ga has the disadvantages of premature and slow convergence rate, and the need to set up many parameters. recently, the computational intelligence has proposed particle swarm optimization (pso) [7, 8] as opened paths to a new generation of advanced process control. the pso algorithm, proposed by kennedy and eberhart [7] in 1995, was an evolution computation technology based on population intelligent methods. in comparison with genetic algorithm, pso is simple, easy to realize and has very deep intelligent background. it is not only suitable for scientific research, but also suitable for engineering applications in particular. thus, pso received widely attentions from evolution computation field and other fields. now the pso has become a hotspot of research. many efforts on the enhancement of traditional pso have been proposed, by combining the pso with other techniques, especially evolutionary computation techniques. the research effort in [9] has developed a hybrid method combining two heuristic optimization techniques, ga and pso, for the global optimization of multimodal functions. the work in [10], obtained better results by applying pso first followed by applying ga in their profiled corrugated horn antenna optimization problem. the effort in [11] has introduced a new integrated genetic swarm optimization algorithm (igsa), combining the strengths of pso with ga. it is applied in the tuning of pid controllers for the ball and hoop system. a genetic programming based adaptable evolutionary hybrid particle swarm optimization algorithm, have presented in [12], for avoiding premature convergence to local minima by the introduction of diversity in the swarm. in addition to incorporate evolutionary algorithms into pso, another research trend is to merge evolutionary operators like selection, crossover and mutation to the pso. by applying selection operation in pso, the particles with the best performance are copied into the next generation; therefore, pso can always keep the best performed particles [13]. by applying crossover operation, information can be swapped between two individuals to have the ability to "fly" to the new search area as that in evolutionary programming and ga [14]. among the three evolutionary operators, the mutation operators are the most commonly applied evolutionary operators in pso. the purpose of applying mutation to pso is to increase the diversity of the population and the ability to have the pso to escape from the local minima. one approach is to mutate pso parameters such as the position of the best neighborhood, as well as the inertia weight [15]. another approach is to prevent particles from moving too close to each other so that the diversity could be maintained and therefore escape from being trapped into local minima. in [16], the particles are relocated when they are too close to each other. in [17], collisionavoiding mechanisms are designed to prevent particle from colliding with each other and therefore increase the diversity of the population. in [18], deflection and stretching techniques as well as a repulsion technique are incorporated into the original pso to avoid particles to move toward the already found global minima, so that the pso can have more chances to find as many global minima as possible. chen [19] presented a gaussian mutation operator with adaptive mutation probability. wang [20] proposed an adaptive mutation on the basis of average velocity of swarm. pant [21] used an adaptive cauchy mutation operator in pso, which was based on beta distribution. tang [22] proposed local search pso, namely lspso by applying an adaptive mutation operator which dynamically adjusts the step size of local search in terms of the size of current search space in order to improve the global search ability of pso. he introduced one technique for mutating the global best particle by a value which based on the difference between the maximum and minimum value for each dimension of the search space. however, the results given by lspso as well as the standard pso, pso with both gaussian and cauchy mutation fall into local optima for some types of test functions. in the present work, a new adaptive hybrid pso, called ahpso local best is proposed by applying an adaptive mutation operator, which differs from the above mutation techniques by using three types of mutation operators instead of using one technique. the main idea optimal tuning of pid controller using adaptive hybrid particle swarm optimization algorithm 103 of ahpso local best is to generate an operator that can adaptively select the most suitable mutation method in each generation according to each stage of the problem. three types of mutation operators are used in this work, gaussian, cauchy, and levy mutation operators. in the proposed algorithm, the local best particles are to be mutated by the selected mutation operator instead of applying the mutation operator to the global best particle as in the previous literatures. this can be accomplished by searching the neighborhood of the global best particles in each generation, resulting in more exploration of the search space and increasing the diversity of the population and the ability to have the pso to avoid the local optima. the new developed algorithm is carried out for the optimal tuning of pid controller to the ball and hoop system. the performance of the system is compared with the standard pso and adaptive pso using the global best particle. experimental studies on tuning the parameters of pid controller for the ball and hoop problem show that ahpso local best performs better than the standard pso and adaptive pso based on global best particle search technique. the obtained results have higher fitness and faster convergence. the rest of the paper is organized as follows. section 2 describes the standard pso. in section 3, pso with adaptive mutation operator is described and the proposed ahpso local best algorithm is presented in detail. an overview of the control problem which will be solved is provided in section 4. experimental results and discussions are presented in section 5. finally, section 6 concludes the whole work. 2 particle swarm optimization pso is a stochastic optimization technique [7] which operates on the principle of social behavior like bird flocking or fish schooling. like other evolutionary algorithms, pso is also a population-based search algorithm and stats with an initial population of randomly generated solutions called particles which fly through the search space. each particle represents a candidate solution to the optimization problem, and has a velocity and a position. the position of a particle is influenced by the best position visited by itself i.e. its own experience and the position of the best particle in its neighborhood i.e. the experience of neighboring particles. the best particle in the population is denoted by (global best), while the best position that has been visited by the current particle is denoted by (local best). consequentially, each particle is influenced by the best performance of any member in the entire population due to the sharing information between them. the performance of each particle is measured using a fitness function that varies depending on the optimization problem. each particle in the swarm is represented by the following characteristics: xi : the current position of the particle i. vi: the current velocity of the particle i. pi : the best position of particle i so far,and pg is the best position found in the whole swarm so far. equations (1) and (2) are used for updating both of the velocity and the position of each particle. vi = wi.vi + c1.r1.(pi − xi) + c2.r2.(pg − xi) (1) xi = xi + vi (2) where: c1 and c2 are the cognitive coefficients and r1, and r2 are random real numbers drawn from u (0, 1), ω is the inertia weight which is used to achieve a balance in the exploration and exploitation of the search space and plays very important role in pso convergence behavior. the 104 s. morkos, h. kamal inertia dynamically reduces during a run from 1.0 to near 0 in each generation which facilitates a balance in the exploration and exploitation of the search space, it is determined as follows: wi = wmax − wmax − wmin itermax .iter (3) where iter_max, is the maximum number of iterations, and iter is the current number of iteration. several topologies exist in literature for the particles to communicate with one another. the topologies are ring, star, pyramid and master-slave topologies [23]. among the topologies, the star is the best topology. 3 adaptive particle swarm optimization different types of mutation operators can be used to increase the diversity of the population and to help pso jump out of local minima. the type of mutation operator may be more effective or worse depending on the stage of optimization process. in the present work, three types of mutation operators are applied at different stages of the problem for more exploration of search space. an adaptive method for selecting the mutation operator that is suitable for each stage of the problem was proposed in this paper. also, the developed approach search for the best neighborhood of the global best particle to be mutated by the selected mutation operator. the proposed mutation operators are presented in the following section as follow: -cauchy mutation operator vg = vg exp(δ) (4) xg = xg + vgδg (5) where: xg and vg represent position and velocity of the global best particle. δ and δg denote cauchy random numbers with the scale parameter of 1. -gaussian mutation operator vg = vg exp(n) (6) xg = xg + vgng (7) where: xg and vg represent position and velocity of the global best particle. n and ng are gaussian distribution numbers with the mean equals 0 and the variance equals 1. -levy mutation operator vg = vg exp(l(α)) (8) xg = xg + vglg(α) (9) l(α) and lg(α) are random numbers generated from levy distribution with a parameter α which is set to 1.3 [24]. optimal tuning of pid controller using adaptive hybrid particle swarm optimization algorithm 105 3.1 the adaptive mutation operator the method proposed for adaptive mutation uses the three mutation operators described above. initially, the selection ratio is set equal to 1/3 for the three mutation operators, by this ratio; the number of particles mutated by each operator is calculated. then, each mutation operator is applied to the swarm particles according to its selection ratio and finally, the resulting offspring fitness is evaluated. the mutation operators that result in higher fitness values of offspring have the most chance to be selected than the other one with lower fitness values of offspring. gradually the most suitable mutation operator will be chosen automatically and control all the mutation behavior in the whole swarm. the steps for selecting the best mutation operators are described as follows [24]: 1the progress value for each operator at each generation is evaluated as: progi(t) = mi∑ j=1 f(p ij (t)) − min(f(p i j (t)),f(c i j(t))) (10) where: p ij (t) , and c i j(t) denote the fitness of a parent and its child produced by mutation operator i at generation t, and mi is the number of particles that select mutation operator i to mutate. 2the reward value for each operator is calculated as: rewardi(t) = exp( progi(t)∑n j=1 progj(t) α + si mi (1 − α)) + cipi(t) − 1 (11) where: si is the number of particles whose children have a better fitness than themselves after the mutation by operator i, pi(t) is the selection ratio of mutation operator i at generation t, α is a random weight between (0,1), n is the number of mutation operator, and ci is a penalty factor for mutation operator i, and is defined as: ci = { 0.9 if si = 0 and pi(t) = maxnj=1(pj(t)) 1, otherwise (12) the mutation operator with maximum reward has the best chance to mutate the best local particles selected during each generation. 3the selection ratio to the next generation for the mutation operator is updated as follows: pi(t + 1) = rewardi(t)∑n j=1 rewardj(t) (1 − n − γ) + γ (13) where, γ is the minimum selection ratio for each mutation operator and is set equal to 0.01 in our problem. the selection ratio for the next generation depends on four factors: the progress value, the minimum selection ratio, the previous selection ratio, and the ratio of successful mutation operator. the selection of each mutation operator may be updated each generation or after a fixed number of generations, in this paper we update the selection ratio after each generation. 106 s. morkos, h. kamal 3.2 the proposed ahpso local best particles algorithm the standard pso was inspired by the social and cognitive behavior of swarm. according to the analysis given in [25], particles are largely influenced by its previous best particles and the global best particle. once the best particle has no change in a local optimum, all the rest particles will quickly converge to the position of the best particle. the present work proposes searching neighbors of the global best particle to be mutated in each generation, rather than selecting the global best particle for mutating. as a result, it would be helpful for the best particles to jump out the local minima, and the whole swarm would move to better position. this can be accomplished by applying the adaptive mutation operator described above to the neighborhood of the global best particle in each generation. the framework of pso algorithm with one of the three mutations operators according to its selection ratio to mutate the best neighborhood particles of the global best particle is given as follows: 1generate the initial position and velocity for each particle in the swarm randomly. 2evaluate the fitness of each particle, and determine the local and the global best fitness for each particle in the swarm. 3set the initial selection ratio equal 1/3. 4update each particle according to equation (1) and (2) 5for each particle i, if its fitness is smaller than the fitness of its previous best position (pi ) update pi . 6update the fitness of the best position (pg) of all particle if there is a particle with fitness smaller than the current best fitness pg . 7apply each one of mutation operator to number of particles according to its selection ratio. 8evaluate the progress and the reward values to select the best one from the three above mutation operators, and then update the selection ratio of each operator for the next generation. 9mutate the best neighborhood particles of the global best particle with the best mutation operator (with maximum reward), and select the best one from the mutated best neighborhood to produce (p⋆g ). 10compare p⋆g and pg to select the better to reproduce in the next generation. 11stop if the stop criterion is satisfied otherwise, go to step 4. 4 plant system the ball and hoop system illustrates the dynamics of a steel ball that is free to roll on the inside of a rotating circular hoop. there is a groove on the inside edge of the hoop so that a steel ball can roll freely inside the hoop. this introduces the complexity of the rolling radius of the ball being different to the actual radius of the ball as illustrated in figure 1where angle θ is the hoop angular position. the position of the ball is given by: 1γ is the position of the ball on the hoop periphery with respect to a datum point. 2-ψ is the slosh angle which measures the deviation of the ball from its rest position. a fourth order system for the ball and hoop system with the following transfer function[26] is: g(s) = 1 s4 + 6s3 + 11s2 + 6s (14) the ball and hoop apparatus is difficult to control optimally using a pid controller because the system parameters are constantly changing. the parameters of pid controller will be tuned offline separately, using pso, ahpso global best, and ahpso local best algorithms as shown in figure 2. optimal tuning of pid controller using adaptive hybrid particle swarm optimization algorithm 107 r ball point a hoob figure 1: the ball and hoop system _ tuning algorithms pso, ahpso global, and ahpso local plantpid controller output input + error figure 2: the structure of the proposed algorithms in tuning pid controller for the plant system various objective functions based on error performance criterion are used to evaluate the performance of the above algorithms. each objective function is fundamentally the same except for the section of code that defines the specific error performance criterion being implemented to optimize the performance of a pid controlled system. the performance index is calculated over a time interval t. performance indices used to estimate the best parameters of pid controller are given by: integral of the square of the error (ise) iise = ∫ t 0 e2(t)dt (15) mean of the square of the error (mse) imse = 1 n n∑ i=1 (e(t))2 (16) integral of absolute magnitude of the error (iae) iiae = ∫ t 0 |e(t)|dt (17) 108 s. morkos, h. kamal where: e is the error calculated over a time interval t. the effectiveness of the proposed ahpso local best algorithm in comparison with the other two algorithms is tested using the above three performance indices. 5 simulation result to evaluate the performance of ahpso based on local best particles, experiments have been carried out for optimal tuning of pid controller to the ball and hoop system. the performance results of pid controller tuned by ahpso local best search in comparison with pso, and ahpso global best particle in the swarm is analyzed using iae, ise, and mse performance indices. cost functions achieved by each algorithm are averaged over 10 runs for 30 generations. the resulted time response and cost function for the three algorithms using three performance indices are shown in figures 3-8 respectively. tables 1-3 give comparison of cost function values and the transient response characteristics for pso, ahpso global best, and ahpso local best algorithms using iae, ise and mse performance indices. 0 5 10 15 20 25 30 0 0.2 0.4 0.6 0.8 1 1.2 1.4 time(sec) a m p li tu d e pso alone ahpso global best ahpso local best figure 3: system response using iae for pso, ahpso global particle, and ahpso local particle optimal tuning of pid controller using adaptive hybrid particle swarm optimization algorithm 109 0 5 10 15 20 25 30 0 0.2 0.4 0.6 0.8 1 1.2 1.4 time(sec) a m p lit u d e pso alone ahpso global best ahpso local best figure 4: system response using ise for pso, ahpso global particle, and ahpso local particle 0 5 10 15 20 25 30 0 0.2 0.4 0.6 0.8 1 1.2 1.4 time(sec) a m p lit u d e pso alone ahpso global best ahpso local best figure 5: system response using mse forpso, ahpso global particle and ahpso local particle 110 s. morkos, h. kamal 2 4 6 8 10 12 14 16 18 20 5 10 15 20 25 30 35 40 45 generation c o s t f u n c ti o n pso alone ahpso global best ahpso local best figure 6: cost function using iae for pso, ahpso global particle, and ahpso local particles 2 4 6 8 10 12 14 16 18 20 5 10 15 20 25 30 35 40 45 50 generation c o s t f u n c ti o n pso alone ahpso global best ahpso local best figure 7: cost function using ise for pso, ahpso global particle, and ahpso local particles optimal tuning of pid controller using adaptive hybrid particle swarm optimization algorithm 111 2 4 6 8 10 12 14 16 18 20 0.025 0.03 0.035 0.04 0.045 generation c o s t f u n c ti o n pso alone ahpso global best ahpso local best figure 8: cost function using mse for pso, ahpso global particle, and ahpso local particles table 1: transient response charteristics using iae criteria criteria iae standard pso ahpso global ahpso local rise time tr 1.12 1.29 1.17 peak value mp 25% 15.5% 14% settling time ts 7.2 7.58 5.1 peak time tp 1.27 2.30 2.25 cost function 13.39 13.38 13.36 table 2: transient response charteristics using ise criteria criteria ise standard pso ahpso global ahpso local rise time tr 1.02 0.97 0.83 peak value mp 25.95% 28% 25.89% settling time ts 9.54 9.29 9.14 peak time tp 1.72 1.72 1.67 cost function 7.46 7.49 7.43 112 s. morkos, h. kamal table 3: transient response charteristics using mse criteria criteria mse standard pso ahpso global ahpso local rise time tr 1.012 1.01 0.84 peak value mp 25.9% 29% 25.6% settling time ts 9.59 9.4 9.32 peak time tp 1.72 1.72 1.69 cost function 0.0248 0.0248 0.0247 simulation results demonstrate the superiority of ahpso based on local search comparing to the other algorithms. in terms of overshoot (peak value), ahpso local search has a lower overshoot by 1.3% than ahpso global search and 8.8% than pso for iae performance index. in ise performance index, the improvement is about 1.6% over ahpso global search and has 0.047 over pso. for mse, the improvement is about 0.23% over pso, and about 2.6% over ahpso global search using mse performance index. for cost function the results are as follows: the improvement is about 0.22% over pso, and 0.149% over ahpso global search using iae performance index. for ise performance index it is about 0.4% over pso, and 0.8% over ahpso global search. the improvement using mse performance index is about 0.68% over pso, and 0.4% over ahpso global search. as for settling time, ahpso local search has minimum settling time to reach the final minimum cost function. the improvement is about 29% over pso, and 32.7% over ahpso global search using iae. it equals 4.2% over pso, and is 1.6% over ahpso global search for ise. also, it is about 2.8% over pso, and is 0.85% over ahpso global search using mse metric. 6 conclusion this paper is concerned with developing adaptive operator for the selection of best mutation technique of three investigated mutation techniques: cauchy, gaussian, and levy techniques. instead of applying single mutation operator, several mutation operators are applied at different stages for best performance. a new particle swarm algorithm based on adaptive mutation operator to local best particles namely ahpso local best is proposed in this paper. instead of applying the best mutation operators to the global particle, it is applied to the neighbors’ of global best particle. besides, the paper investigates the use of ahpso based on local best particles for tuning pid controller parameters for the ball and hoop system and compares the system performance with the standard pso and ahpso based on global best particles. the performance of the three algorithms is analyzed based on three performance indices; iae, ise, and mse. experimental results show the superiority of the ahpso local search over the other techniques for the optimal tuning of pid controller to the ball and hoop system. also, it has been shown that the developed algorithm is faster in convergence and gives higher fitness value than the other algorithms, at the same time enhances the variety of the population, and extends the search space. also, the time response characteristics of the proposed algorithm are better than other techniques. in future research, we intend to apply the technique to different set of practical constrained problems to show the robustness of the technique. also, comparison of the effectiveness of different mutation operators on particles velocity for different types of problems will be studied. optimal tuning of pid controller using adaptive hybrid particle swarm optimization algorithm 113 bibliography [1] astroöm k., and hagglund t., "the future of pid control". control engineering practice, 9, pp. 1163-1175, 2001. [2] conradie a., miikkulainen r., and aldrich c., "adaptive control utilizing neural swarming", in proceedings of the genetic and evolutionary computation conferences, usa, 2002. [3] hossein shayeghi, heidar ali shayanfar and aref jalili," multi stage fuzzy pid load frequency controller in a restructured power system", journal of electrical engineering, vol. 58, no.. 2, pp. 61-70, 2007. [4] saban cetin, and ozgür demir," fuzzy pid controller with coupled rules for a nonlinear quarter car model", world academy of science, engineering and technology vol. 41, pp.238-241, 2008. [5] aye aye mon," fuzzy logic pid control of automatic voltage regulator system", proceedings of pwaset, vol. 38, feb., 2009. [6] cipperfield a. flemming p., and fonscea c., "genetic algorithms for control system engineering", in proceedings adaptive computing in engineering design control, pp128-133, 1994. [7] kennedy j. and eberhart c., "particle swarm optimization", proceedings of the ieee international conference on neural networks, australia, pp. 1942-1948, 1995. [8] oliveira, p. m., cunha, j. b., and coelho, j. o. p., "design of pid controllers using the particle swarm algorithm.", twenty-first iasted international conference: modeling, identification, and control (mic 2002), innsbruck, austria. 2002. [9] yi-tung kao and erwie zahara," a hybrid genetic algorithm and particle swarm optimization for multimodal functions", applied soft computing vol. 8, pp 849-857, 2008. [10] robinson, j., sinton, s., and rahmat-samii, y., " particle swarm, genetic algorithm, and their hybrids: optimization of a profiled corrugated horn antenna", ieee international symposium on antennas & propagation. san antonio, texas. june, 2002. [11] h. a. kamal, " a new integrated ga/pso algorithm for optimal tuning of pid controller", the mediterranean journal of measurement and control, vol. 6, no. 1, pp.18-24, january 2010. [12] m. rashid and a. rauf baig, " a genetic programming based adaptable evolutionary hybrid particle swarm optimization algorithm", international journal of innovative computing, information and control (icic), vol. 6, nu. 1, january 2010. [13] angeline, p. j., "using selection to improve particle swarm optimization", proceedings of the ieee congress on evolutionary computation (cec 1998), anchorage, alaska, usa. 1998. [14] lvbjerg, m., rasmussen, t., and krink, t, "hybrid particle swarm optimizer with breeding and subpopulations", proceedings of the third genetic and evolutionary computation conference (gecco), vol. 1, pp. 469-476, 2001. 114 s. morkos, h. kamal [15] miranda, v., and fonseca, n.," new evolutionary particle swarm algorithm (epso) applied to voltage/var control", the 14th power systems computation conference (pscc’02), seville, spain, june, 2002. [16] lvbjerg, m., and krink, t., "extending particle swarms with self-organized criticality", proceedings of the fourth congress on evolutionary computation (cec-2002). [17] blackwell, t., and bentley, p. j., (2002). "don’t push me ! collision-avoiding swarms". ieee congress on evolutionary computation, honolulu, hawaii usa, 2002. [18] parsopoulos, k. e., and vrahatis, m., "on the computation of all global minimizers through particle swarm optimization", ieee transactions on evolutionary computation, (accepted for special issue on pso, 2004. [19] j. chen, z. ren and x. fan, "particle swarm optimization with adaptive mutation and its application research in tuning of pid parameters," in proc. 1st international symposium on systems and control in aerospace and astronautics, pp. 990-994, 2006. [20] h. wang, y. liu c. h. li, and s. y. zeng, "a hybrid particle swarm algorithm with cauchy mutation," ieee swarm intelligence symposium, honolulu, hawaii, pp. 356-360, 2007. [21] pant, m. thangaraj, r. abraham, a. , "particle swarm optimization using adaptive mutation," in proc. 19th international conference on database and expert systems application, pp. 519-523, 2008. [22] jun tang, and, x. zhao, "a hybrid particle swarm optimization with adaptive local search", journal of networks, vol. 5, no.4, april 2010. [23] fatih ta?getiren m and yun-chia liang, "a binary particle swarm optimization algorithm for lot sizing problem", journal of economic and social research, vol.5 no.2, pp. 1-20, 2004. [24] c. li, s. yang and i. a. korejo. "an adaptive mutation operator for particle swarm". proceedings of the 2008 uk workshop on computational intelligence, pp. 165-170, 2008. [25] h. wang, y. liu c. h. li, and s. y. zeng, "a hybrid particle swarm algorithm with cauchy mutation," ieee swarm intelligence symposium, honolulu, hawaii, pp. 356-360, 2007. [26] i. griffin, "on-line pid controller tuning using genetic algorithms", msc. thesis school of electronic engineering dublin city university, 2003. int j comput commun, issn 1841-9836 vol.7 (2012), no. 2 (june), pp. 388-395 a reliability level list based sdd algorithm for binary cyclic block codes b. yamuna, t.r. padmanabhan b.yamuna assistant professor,dept of ece amrita vishwa vidyapeetham, amrita school of engineering amrita nagar, coimbatore. 641 112 tamil nadu, india e-mail: b_yamuna@cb.amrita.edu dr. t.r.padmanabhan professor emeritus, dept of it amrita vishwa vidyapeetham, amrita school of engineering e-mail: trp@amrita.edu abstract: soft decision decoding (sdd) provides a better coding gain by making use of the unquantized channel output. in this paper we introduce the concept of a reliability level list (rll); based on the rll a new sdd algorithm for binary phase shift keying (bpsk) based binary cyclic block codes is proposed. the algorithm guarantees to extract the most reliable codeword in an iterative manner. the formation of the rll involves a search for the next possible entry into the rll based on the error probability which is a reflection of the reliability values of the bits of the received word obtained from the channel. the procedure for the formation of rll which is the central idea of the paper is given as a structured algorithm. keywords: cyclic block codes, reliability based decoding, soft decision decoding, probability of error. 1 introduction soft decision decoding (sdd) algorithms in general focus on the extraction of the codeword from the received word by making use of the reliability information available at the channel output which is otherwise discarded in hard decision decoding. sdd algorithms have increased complexity compared to hard decision decoding but they trade off complexity for better error performance. the idea of exploiting the real channel output to the maximum possible extent instead of quantizing it to 0’s and 1’s as with hard decision decoding has been the essence of sdd vis-â-vis binary cyclic block codes. different sdd algorithms have been proposed in the recent years-each with its own trade off between complexity and error performance[13]. the reliability based sdd algorithms aim at decoding a received word which is more reliable even under low signal-to-noise ratio (snr) conditions. researchers are focusing on making the best use of the channel output and improving decoding reliability. in these algorithms the channel output is used to quantify reliability values of the bits of the received word. the bits are processed with respect to their reliability values. they can be based on processing the bits from the least reliable end as with chase decoding algorithms and generalized-minimum distance (gmd) decoding algorithms [4], [5] or they can be based on processing from the most reliable copyright c⃝ 2006-2012 by ccc publications a reliability level list based sdd algorithm for binary cyclic block codes 389 end as with ordered statistics decoding algorithms[6]. a sdd algorithm is evolved in this paper. the algorithm uses the channel output reliability in the true sense. the significance of the work is in the fact that the target codeword identified through the algorithm here is the best that reliability based sdd algorithm can lead to. the method involves searching in a space of 2n words for the most reliable codeword. but the extent of actual search is confined to a much shorter range as brought out in the paper. the focus of the paper is two-fold: (1) proving that the codeword extracted is the best possible one from a soft decision point of view and (2) giving a structured algorithm for the extraction of the codeword. 2 preliminaries of the rll based sdd algorithm the problem of sdd is one of starting with the received word and identifying a codeword which will be the most reliable one; this codeword is called the ’target codeword’ here. the process essentially amounts to arranging all the possible 2n binary bit sequences in the order of increasing reliability and picking the first codeword from it. the sequence so arranged is called the reliability level list (rll) here. hence rll represents the search sequence to be followed to find the most reliable codeword. for an n bit sequence let the pair {(bi, mi)}, 0 ≤ i ≤ n represent the sets of the bit values and the corresponding reliability magnitudes. each bit of hard decision has an associated probability of error given as ∫ ∞ mi 1√ 2π e− x2 2 dx. let q(mi σ )= ∫ ∞ mi 1√ 2π e− x2 2 dx, σ being the noise variance. arrange the set {q(mi σ )} in descending order of magnitudes and assign integer values k to them such that k =1 for the bit with the largest value of q(mi σ ); let q[1] represent this q(m σ ) value. similarly k = 2 for the next one with q[2] representing the corresponding q( m σ ) value and so on until k = n for the bit with the smallest value of q( mi σ ). let {q[k]} be the rearranged set of numbers and {p[k]} = {1 − q[k]}. with an (n,k) code all the 2k possible codewords are included in the total space of 2n words. all these codewords appear in the rll each with its own associated probability of being the target codeword; the very first codeword to appear in the list being the most reliable of the codewords is the target codeword; those further down the rll being less reliable need not be examined. the first few entries in rll as defined above can be seen to be as given below: • the topmost entry is the term ∏15 k=1 p[k] having the largest magnitude. this represents the received word itself. i.e., none of the bits is in error. • the next entry in rll is the product q[1] ∏15 k=2 p[k] having the next higher magnitude. this represents the received word with the least reliable bit with k =1 being in error. hence the received word with the least reliable bit flipped is examined and if found to be a codeword it is the target codeword and the search stops. if it is not a codeword then the subsequent entry in rll has to be made and the process of examining the entry for identification of target codeword is to be continued. the process is thus repeated until target codeword is identified. the third and fourth entries are given below for clarity. • the third entry in rll is the term q[2]p[1] ∏15 k=3 p[k] • the fourth entry in rll is either q[3]p[2]p[1] ∏15 k=4 p[k] or p[3]q[2]q[1] ∏15 k=4 p[k] based on whichever candidate has the higher magnitude. in general the position of a word in rll is decided by the magnitude of the corresponding product∏ ki p[ki] ∏ kj q[kj] (1) 390 b. yamuna, t.r. padmanabhan where ki and kj represent the set of k values for which the p[ki] and q[kj] values respectively appear in the product. this is the crux of selecting the next entry in to the rll at every stage. withs[k ] = q[k] p[k] define f = ∏ kj q[kj]∏ ki p[ki] (2) using f the product ∏n τ=1 p [τ]×f can be used in place of the product in (1) above. this directly leads to the following lemma. lemma 1: let v represent the rank of an entry in rll and fv the corresponding f value as defined in equation (2). then the vth entry in rll is such that fv−1 ≥ fv ≥ fv+1 for 0 ≤ v ≤ n the above inequality means that the problem of identifying the rank of an entry in the rll is the same as computing the fv′s and arranging them in descending order of magnitudes. since log fv is a monotonically increasing function of f, the inequality implies log fv−1 ≥ logfv ≥ log fv+1 for 0 ≤ v ≤ n. if an entry in rll is known its log fv is known. the problem of deciding the very next entry can be looked upon as the problem of identifying a set of log s[k] such that the corresponding σ log s[k] satisfies the following two conditions 1. it should be smaller than the σlog s[k] for the present entry. 2. it should be the largest in magnitude amongst the set yet to be entered in the rll. with w =σ log s[k], an entry with rank v in rll is characterized by its wv. with this the above result can be formalized as a theorem: theorem 1: for any v ∈ {0, 2n−1} the corresponding entry in rll has a wv such that wv−1 ≥ wv ≥ wv+1 log s[k] being a negative quantity, w =σlog s[k] is also a negative quantity. it is more convenient to work with log ( 1 s[k] ) = (-log s[k]) and the corresponding sum m = -σ log s[k]. with this, theorem 1 implies lemma 2 below which is more convenient to work with. lemma 2: for any v ∈ {0, 2n−1 } the corresponding entry in rll has a mv such that mv−1 ≤ mv ≤ mv+1 3 procedure for forming the rll the procedure involves forming sequences of n bit binary numbers which would be the entries in the rll. let v denote the rank of an entry in the list with v ranging from 0 to 2n. the step by step procedure for making the entries in the rll is follows. 1. m0, m1, m2, where m0 =0,m1 = log s[1], m2 = log s[2] are formed as explained earlier in section 2 and the corresponding binary sequences designated n0, n1, n2 are entered in the rll. 2. define a ’pending list’ pl as the collection of all contenders for the next entry to the rll. 3. the entries in the pl are characterized by the following. (a) each entry is a collection of indices. (b) each entry has an associated characteristic magnitude m. a reliability level list based sdd algorithm for binary cyclic block codes 391 4. once the pl is fully known, the next entry to the rll can be decided by comparing the m values for each entry in the pl. it also implies that the left over elements in the pl will also be members for the pl for the subsequent entry in the rll. 5. once an entry to the rll is decided, the pl for the next entry to the rll is formed through the following steps. (a) all the entries left over after the previous entry to the rll are also to be in the new pl. (b) additional sets are to be added to the new pl depending on the present entry to the rll. this essentially amounts to scanning the indices in the set for the present entry and altering them. a perusal of the set of indices shows that the additional element sets can be formed as follows: i. if the least index k1 for the set is not 1 add 1 to the index set. ii. for every index xi in the set if xi+1 < xi+1 replace xi with xi+1. this amounts to adding a set to the pl where the contribution to m by xi is increased by the least possible amount in the neighborhood of xi. the procedure to form the rll as a sequence of 2n binary numbers explained above can be cast as structured algorithm. the list of notations used in the algorithm is given below: • k: reliability index assigned to the bits of the received word. • nv : binary number in rll with rank v. • s[k] = q[k] 1−q[k] • n : block length of the codeword • pl : pending list after updating • pu : pending list before updating • ti : ith set in pl • jmax :total number of ti′s in pl • βi: number of elements in ti • f : flag • ui, µ, ∆: temporary symbols used for ti′s, ui , and jmax respectively 4 rll based sdd algorithm 1. input: {-log s[k]}, n. 2. output: sequential entries in rll 3. initial condition: n0 = {0}, n1 = {1}, n2 = {2}, pl = {t1, t2} where t1 = {3}, t2 = {1, 2},jmax = 2, f = 0. 4. do for v=3 to 2n 392 b. yamuna, t.r. padmanabhan 5. do for i=1 to jmax compute mvi = − ∑βi α=1 log(s[xα]) enddo i 6. mmin = min{ mvi for i = 1 to jmax} 7. let im = the i for which mvi is minimum 8. form a binary sequence nv whose 1 bits are decided by the indices in the set tim 9. βv = βim 10. do for i=1 to jmax if(i ̸= im) ui = ti enddo i (removing tim from the pending list and reassigning the ti’s to the ui’s.) 11. pu = {0} 12. do for i = i to im−1 pu = { pu :ui} enddo i 13. do for i = im to jmax (a) ui = ui+1 (b) pu = { pu :ui} (c) βi = βi+1 enddo i 14. in nv (updating the pending list by forming candidate entries; processing done with present entry) 15. do for d = 1 to n (a) if k1 ̸= 1, (if lsb of nv is 0 add 1 to the present entry) { i. µ = { nv :1} ii. if µ is not in pu , iii. ui = µ for i = jmax iv. ∆ = jmax v. βjmax = βjmax + 1 vi. f = 1 } (b) if kd +1 < kd+1 { if f =1 { i. form µ with kd replaced by kd +1 in nv ii. if µ is not in pu , a reliability level list based sdd algorithm for binary cyclic block codes 393 iii. u∆+1 = µ iv. β∆+1 = βv } else { 15.2.5 form µ with kd replaced by kd +1 in nv 15.2.6 if µ is not in pu 15.2.7 u∆ = µ 15.2.8 β∆ = βjmax } 16. increment ∆ } 17. jmax = jmax +1 18. enddo d 19. pl = pu for i =1 to ∆ (pending list updated) 20. f = 0. 21. enddo v 5 example extensive simulation with a variety of binary cyclic block codes like (15, 7), (31, 16), and (127, 64) have been carried out; in each case a few thousand transmissions over an awgn channel was done for different values of σ and the erroneous received words decoded using the proposed algorithm. in all the cases the resultant target codeword was found to be the most reliable one. details of a representative set are discussed briefly here. a (15, 7) binary cyclic block code with dmin = 5 and t =⌊ dmin/2 ⌋ = 2 is considered. the message 030h has been encoded as 304eh and transmitted over an awgn channel with σ= 0.8. the received word is 78cch. there are 4 errors at bit positions 1, 7, 11, 14. with the input as the set {-log s[k]}, n = 15 and the initial condition as:n0 = {0} n1 = {1} , n2 = {2}, pl = {t1, t2} where t1 = {3}, t2= {1, 2},jmax = 2, f = 0, following the steps of the algorithm the rll was formed. a representative segment of the rll is given in table 1. the table gives the rank v, k value, mmin , and the contents of ti with the corresponding mvi (both separated by a hyphen in the table). as seen in table 1 in each row the specific ti in italics has the minimum value of mvi and forms the candidate identified for the next entry. further in each row the entries in bold represent the updated candidates that are formed by applying the rules in row 15 in the algorithm given above in section 4. for example considering the row with rank 82, the entry t36= {2,3,6,7 0.977} with k=2, k = 3, k = 6, k = 7 as 1′s and σ log s[k] as 0.977 is the next entry with rank as 83 ; and this is the one with minimum of mvi . the entries in the same row with rank 82, namely t38 = {1,4,6,7 1.198} and t39 = {1,3,6,8 1.4992}are the ones formed on applying the rules in rows 15.1 and 15.2.1 of the algorithm given in section 4 respectively. 394 b. yamuna, t.r. padmanabhan table 1: partial rll rank v k m min details of t i‘s in the updated pending list pl 1 1 0.0832 2 2 0.0957 t1= {3 0.1223 }, t2= {1,2 0.1789} 3 3 0.1223 pl= { t1, t2, t3}; t1= {1,2 -0.1789 }, t2= {1,3 -0.2055}, t3= {4,-0.3558} 4 1,2 0.1789 t1= {1,3 0.2055 }, t2= {4 0.3558} 5 1,3 0.2055 t1= {4 0.3558}, t2= {2,3 0.218}, t3= {1,4 0.439} . . . . . . . . 82 1,3,6,7 0.9645 t1= {1,8 0.9978}, t2= {2,8 1.0103}, t3= {3,8 1.0369}, t4= {1,2,8 1.0935}, t5= {1,3,8 1.1201}, t6= {2,3,8 1.1326}, t7= {1,2,3,8 1.2158}, t8= {4,8 1.2704}, t9= {5,8 1.2727}, t10= {6,8 1.2937}, t11= {1,4,8 1.3536}, t12= {1,5,8 1.3559}, t13= {2,4,8 1.3661}, t14= {2,5,8 1.3684}, t15= {1,6,8 1.3769}, t16= {2,6,8 1.3894}, t17= {3,4,8 1.3927}, t18= {4,5,6 1.093}, t19= {4,5,7 1.0938},t20= {3,5,8 -1.395}, t21= {4,6,7 1.1148}, t22= {3,6,8 1.416}, t23= {1,2,4,8 1.4493}, t24= {1,2,5,8 1.4516}, t25= {1,2,3,4,5 1.0151}, t26= {1,2,6,8 1.4726}, t27= {1,9 1.0262}, t28= {10 1.0421}, t29= {1,2,3,4,6 1.0361}, t30= {1,2,3,4,7 1.0369}, t31= {2,3,4,8 1.4884}, t32= {1,2,3,5,6 1.0384}, t33= {2,4,5,6 1.1887}, t34= {1,2,3,5,7 1.0392}, t35= {2,4,5,7 1.1895}, t36= {2,3,6,7 0.977}, t37= {2,3,5,8 1.4907}, t38= {1,4,6,7 1.198 }, t39= {1,3,6,8 1.4992 } 83 2,3,6,7 0.977 t1= {1,8 0.9978}, t2= {2,8 1.0103}, . . . . . . . . . . . . . . . .t37= {1,4,6,7 1.198 }, t38= {1,3,6,8 1.4992 },t39={1,2,3,6,7 1.0602}, t40={2,4,6,7 1.2105}, t41={2,3,6,8 1.5117}. a reliability level list based sdd algorithm for binary cyclic block codes 395 with each such entry to the rll the corresponding candidate word is formed by complementing the bits indicated by the entry. the candidate word is examined to check if it is a codeword; if yes it is the desired target codeword. once the target codeword is identified the process of filling the rll can be discontinued since any codeword found below this in the rll is less reliable. for the specific case here the target code word is obtained at the rank of 83 with errors at bits 7, 1, 11, and 14 with the corresponding k values as 2, 3, 6, and 7 respectively. 6 conclusion a structured, iterative, and reliability based soft decision decoding algorithm is proposed. the algorithm uses the reliability value of the received word as a soft metric and outputs the desired target codeword through an ordered search in the 2n word space. the algorithm identifies an entry in the rll such that the corresponding word is the most reliable yet. for each entry in the rll the corresponding word formed has to be checked to see if it is a code word and if so it is the desired target codeword. the proposed algorithm guarantees to return the best in terms of reliability because of the very fact that any other codeword found below the identified one will be less reliable. the structured algorithm can be easily coded and used for decoding any (n,k) binary cyclic block code. bibliography [1] wenyi jin and marc.p.c.fossorier,fellow,ieee, reliability-based soft-decision decoding with multiple biases , ieee trans. inform. theory, vol. 53, pp. 105-120, jan. 2007 [2] ye liu, member, ieee, shu lin, fellow, ieee, and marc.p.c.fossorier,fellow,ieee, map algorithms for decoding linear block codes based on sectionalized trellis diagrams, ieee trans. communications, vol. 48, pp. 577-587, apr. 2000 [3] yuansheng tang, member, ieee, san ling and fang weifu, on the reliability based soft decision decoding algorithms for binary linear block codes, ieee trans. inform. theory, vol. 52, pp. 328-335, jan. 2006 [4] david chase, member, ieee, a class of algorithms for decoding block codes with channel measurement information, ieee trans. inform. theory, vol. it-18, pp.170-182, jan 1972 [5] g. d. forney, jr., generalized minimum distance decoding, ieee trans. inform. theory, vol. it-12, pp. 125-131, apr. 1966 [6] marc p. c. fossorier, member, ieee, and shu lin, fellow, ieee, soft-decision decoding of linear block codes based on ordered statistics,ieee trans. inform. theory, vol. 41, pp. 1379-96, sep 1995 int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 4, pp. 525-531 fingerprints identification using a fuzzy logic system i. iancu, n. constantinescu, m. colhon ion iancu, nicolae constantinescu, mihaela colhon department of informatics university of craiova, al.i. cuza street, no. 13, craiova ro-200585, romania e-mail: i_iancu@yahoo.com, nikyc@central.ucv.ro, mghindeanu@yahoo.com. abstract: this paper presents an optimized method to reduce the points number to be used in order to identify a person using fuzzy fingerprints. two fingerprints are similar if n out of n points from the skin are identical. we discuss the criteria used for choosing these points. we also describe the properties of fuzzy logic and the classical methods applied on fingerprints. our method compares two matching sets and selects the optimal set from these, using a fuzzy reasoning system. the advantage of our method with respect to the classical existing methods consists in a smaller number of calculations. keywords: fuzzy models, fingerprint authentication, cryptographic signature model. 1 introduction fingerprint identification is the most mature biometric method being implemented at an early level since 1960. the recognition of a fingerprint can be done with two methods: ”one-to-one” (verification) and ”one-to-many” (1 : n identification). the first method is applied when we have two fingerprints and we want to verify if they belong to the same person. the second one is used when we have one fingerprint and we search it in a data base. the verification is much easier and faster because we have the two fingerprints and we just need to compare them. on the other hand, the identification implies more time for extracting the fingerprint because there are needed much more details. the fingerprints are not compared with images, they use a method based on characteristic points named ”minutiae”. these points are characterized by ridge ending (the abrupt end of a ridge), ridge bifurcation (a single ridge that divides in two ridges), delta (a y-shaped ridge meeting), core (a u-turn in ridge pattern), etc. all these features are grouped in three types of lines: line ending, line bifurcation and short line. after the minutiae points are localized, a map with all their locations on the finger is created. every minutiae point has associated two coordinates (x,y), an angle for orientation and a measure for the fingerprint quality. the matching of two fingerprints depends on the position and on the rotation. for this reason, every fingerprint is represented, not only, as a group of points with two coordinates, but also, as a group of points with coordinates relative to other points. this allows obtaining an unique positioning of a point regarding to other three points. the three selected points must not be collinear. when two fingerprints are compared, first are compared the relative coordinates. if this stage ends successfully, these coordinates are transformed in 2d coordinates and verified. after verifying the fingerprints, the result will tell us if they are from the same person with a high probability. still, the cases when the belonging probability of a fingerprint is 0 (false) or 1(true) are rarely. in most of the cases, the probability will be a number p ∈ [0,1]. this fact leads to a fuzzy logic. the values in fuzzy logic can range between 0 and 1 (1 is for absolute truth, 0 for absolute falsity). a fuzzy value for an element x will express the degree of membership of x in a set x . it is essential to realize that fuzzy logic uses truth degrees as a mathematical model of the vagueness phenomenon while probability is a mathematical model of randomness. copyright c⃝ 2006-2010 by ccc publications 526 i. iancu, n. constantinescu, m. colhon 2 state of the art two fingerprints are similar if n out of n points match. to verify this, freedman et al. introduced the fuzzy matching protocols [3]. using these protocols, the information about the fingerprint we want to identify (or verify) will not be revealed if no match is found. to describe the fuzzy private matching problem we will take a set of words x = x1 ...xn where {xi} are the letters. two words x = x1 ...xn and y = y1 ...yn match only if: n ≤ |{k : xk = yk | 1 ≤ k ≤ n}| and this relation is denoted with x ≈n y . in the subsequent we will name the set x as the total set for selection. the input of the protocol will be two sets of words (x = x1 ...xm for the client and y = y1 ...ys for the server) and the parameters m,s,n and n. while the output of the server is empty, the output of the client will be a set {yi ∈ y |∃xi ∈ x : xi ≈n yi}, where a ≈n b means that the points a and b are very close. this set is, in fact, the intersection of the two input sets [1]. it was demonstrated that this protocol leads information about the input even if no match is found [1]. another protocol, based on freedman’s protocol, was presented in [1]. it uses σ as a combination of n different indices γ1,γ2 ...γn and σ(x) = xγ1||... ||xγt for a word x . after the parameters and the public key are sent, the client constructs a polynomial representation of the points set: pσ = (x − σ(x1))∗(x − σ(x2))∗...∗(x − σ(xm)) this is a feedback polinomial value for a set of fingerprints. then he sends {pσ }k2 to the server. the server analyzes every received polynomial {pσ } at the point σ(yi) and computes {wσi }k2 = {r ∗ pσ (σ(yi))+yi}k2 , where r is a random value. after all the calculations, the server sends {wσi }k2 to the client. the client will decrypt all the messages and if wσi matches with any word from x then it is added to the output set. {wσi }k2 is a combination between fingerprint points value and the parameter which characterizes the common information between a base set and the current set of collected values. a particular scheme of fingerprint authentication describes a method which is not based on the minutiae points [12], but by the texture of the finger, called fingercode. such a fingercode is a vector composed from 640 values between 0 and 7. the vector is ordered and stable in size. the method uses euclidean distance to find the matching. after estimating the block orientation, a curvature estimator is designed for each pixel. its maximal value is, in fact, the morphological searched center. using a properly tuned gabor filter ( [11, 13]) we can catch ridges and valleys from the fingerprint. the fingercode is computed as the average absolute deviation from the mean of every sector of each image. error-correction for the fingercode would never be efficient enough to recognize a user. in [12], the method proposed uses a secret d +1− letter word , which correspond to the d +1 coefficients of a polynomial p of degree d. the public key will be extract from (f, p), where f is the fingercode. then, we choose n random point of p. these points will be hidden like in a fuzzy commitment scheme. to find the polynomial p, each point is decoded. if at least d +1 points are decoded then p can, also, be retrieved. a method based on the minutiae points and, also, on the pattern of the finger was presented in [4]. all the ridges that cross a line (x,y) where x and y are minutiae points are counted. then, are presented all the possible combinations of three minutiae points and the ridges crossing that line. such a combinations’ list needs c3n entries, where n is the number of minutiae points. this method is more complex because before all the calculations are done we need to identify the minutiae points and then combine them. 3 our method 3.1 system description a commercial fingerprint-based authentication system requires a very low false reject rate (frr) for a given false accept rate (far) where far is the probability that the system will incorrectly identify and frr is the probability of failure in identification. fingerprints identification using a fuzzy logic system 527 our method is, also, based on the minutiae points of the fingerprints. we can identify at least 40 minutiae points on a fingerprint, depending on its quality. in general, the number of the minutiae points varies from 0 to 100. all the methods mentioned above can be applied to a fingerprint verification. but, for an identification we need an algorithm with a low level of complexity because the data bases used in practice have millions of fingerprints. to reduce the search time and complexity, we first propose to classify the fingerprints, and then, to identify the input fingerprints only in one subset of the data base. to choose the right subset the fingerprint is matched at a coarse level to one of the existing types. after that, it is matched at a finer level to all the fingerprints of the subset. the fbi in the united states recognize eight different types of patterns [5]. for example, we have an input fingerprint and we want to identify it in a data base with 15000 entries. we will take the minim number of minutiae points, 40. if no classification is made we have to do at least 40 × 15000 = 600000 operations. but, if we use a classification with eight types (each subset has the same number of fingerprints 15000/8 = 1875) we will have at least (8+1875)×40 = 75320 calculations. this is because we will first compare the input fingerprint with each group and after that it will be compared with each element of the chosen group. as we can see, the calculations are reduced to only 12,5%. the classification of the fingerprints is preferred to have more than three types of subsets. this is because a higher accuracy is achieved. such a classification, also, helps to reduce the number of calculations with a higher percentage. 3.2 fuzzy mathematical background a fuzzy set a in x is characterized by its membership function: µa : x → [0,1] where µa(x) ∈ [0,1] represents the membership degree of the element x in the fuzzy set a. we will work with membership functions represented by trapezoidal fuzzy numbers. such a number n = (m,m,α,β ) is defined as µn (x) =   0 f or x < m − α x − m + α α f or x ∈ [m − α,m] 1 f or x ∈ [m,m] m + β − x β f or x ∈ [m,m + β ] 0 f or x > m + β the rules are represented by fuzzy implications. let x and y be two variables whose domains are u and v , respectively. the rule i f x is a then y is b is represented by its conditional possibility distribution ( [14], [15]) πy /x : πy /x (v,u) = µa(u)→ µb(v), ∀u ∈ u, ∀v ∈ v where → is an implication operator ( [2]) and µa and µb are the membership functions of the fuzzy sets a and b, respectively. one of the most important implication is lukasiewicz implication [2], il(x,y) = min(1− x + y,1). 3.3 proposed fuzzy logic system fuzzy control provides a formal methodology for representing, manipulating and implementing human’s heuristic knowledge about how to control a system. in a fuzzy logic controller, the expert knowledge is of the form 528 i. iancu, n. constantinescu, m. colhon if (a set o f conditions are satis f ied) t hen (a set o f consequences are in f erred) where the antecedents and the consequences of the rules are associated with fuzzy concepts (linguistic terms). the most known systems are: mamdani, tsukamoto, sugeno and larsen which work with crisp data as inputs. a mamdani type model which works with interval inputs is presented in [10]. in this paper we use a version of fuzzy logic control (flc) system from [9] in fingerprints identification. this version is characterized by: • the linguistic terms (or values), that are represented by trapezoidal fuzzy numbers • lukasiewicz implication, which is used to represent the rules • the crisp control action of a rule, computed by middle-of-maxima method • the overall crisp control actions, computed by discrete center-of-gravity. we assume that the facts can be given by crisp data, intervals and/or linguistic terms and a rule is characterized by: • a set of linguistic variable a, having as domain an interval ia = [aa,ba] • na linguistic values a1,a2,...,ana for each linguistic variable a • membership function µ0ai(x) for each value ai, where i ∈ {1,2,...,na} and x ∈ ia. according to the structure of a flc, the following steps are necessary in order to work with our system. firing levels we consider an interval input [a,b] with aa ≤ a < b ≤ ba. the membership function of ai is modified ( [10]) by membership function of [a,b] as follows ∀x ∈ ia, µai(x) = min(µ 0 ai(x), µ[a,b](x)) where µ[a,b](x) = { 1 i f x ∈ [a,b] 0 otherwise it is obvious that, any t-norm t can be used instead of min (see, for instance, [6–8]). the firing level, generated by the input interval [a,b], corresponding to the linguistic value ai is given by: µai = max{µai(x)|x ∈ [a,b]}. the firing level µai , generated by a linguistic input value a ′ i is µai = max{min{µ 0 ai(x), µa′i (x)}|x ∈ ia}. the firing level µai , generated by a crisp value x0 is µ 0 ai (x0). fuzzy inference we consider a set of fuzzy control rules ri : i f x1 is a 1 i and ... and xr is a r i then y is ci where the variables x j, j ∈ {1,2,...,r}, and y have the domains u j and v, respectively. the firing levels of the rules, denoted by {αi}, are computed by αi = t (α1i ,...,α r i ) where t is a t-norm and α ji is the firing level for a j i , j ∈ {1,2,...,r}. the conclusion inferred from the rule ri, using the lukasiewicz implication is c ′i (v) = i(αi,ci(v)),∀v ∈ v. fingerprints identification using a fuzzy logic system 529 figure 1: conclusion obtained with lukasiewicz implication defuzzification the fuzzy output c ′i of the rule ri is transformed in a crisp output zi using the middle-of-maxima operator. the crisp value z0 associated to a conclusion c ′ inferred from a rule having the firing level α and the conclusion c represented by the fuzzy number (mc,mc,αc,βc) is: z0 = mc + mc +(1− α)(βc − αc) 2 the overall crisp control action is computed by the discrete center-of-gravity method: if the number of fired rules is n then the final control action is z0 = ( n∑ i=1 αizi ) / n∑ i=1 αi where αi is the firing level and zi is the crisp output of the i-th rule. 4 an application in fingerprint identification for the proposed flc we consider rules with two inputs and one output. the input variables are σ1 = {xi|xi ∈ x} and σ2 = {x j|x j ∈ x,xi ̸= x j,∀i, j} where x is the set defined in section 2. by σ1 we represent the values set of basic input data and σ2 is a user data to be evaluated and authenticated. these values sets will be denoted by s1 and s2 respectively. the σ set values denote the optimal points set to be used for the fuzzy matching authentication, according with s output variable. the fuzzy rule-base consists of r1: if s1 is low and s2 is very low then s is low r2: if s1 is very low and s2 is low then s is low r3: if s1 is very low and s2 is very low then s is very low r4: if s1 is very low and s2 is very low then s is low r5: if s1 is very low and s2 is middle then s is middle r6: if s1 is very low and s2 is middle then s is low r7: if s1 is high and s2 is very high then s is high r8: if s1 is very high and s2 is high then s is high r9: if s1 is very high and s2 is very high then s is high r10: if s1 is low and s2 is very high then s is middle the value very low for a variable s1, s2 or s represents a minimum degree of trust while the value very high represents the maximum degree. there are five linguistic values for every variable {very low, low, middle, high, very high}. we consider the universes of discourse [0,10]. the membership functions corresponding to the linguistic values are represented by the following trapezoidal fuzzy numbers: 530 i. iancu, n. constantinescu, m. colhon • for the variable s1: {(0,1,0,1.5),(3,4,1,0.5),(5,6,1,0.5),(7.5,8.5,3,1),(9.5,10,0.5,0)} • for the variable s2: {(0,1.5,0,1),(2.5,4,0.5,0.5),(5,6,2,0),(6.5,8,1.5,0),(8.5,10,0.5,0)}, • for the variable s: {(0,1,0,1.5),(2.5,4,0.5,0.5),(5,7,2.5,0.5),(8,8.5,2,0.5),(9.5,10,1,0)} we consider the following interval input values: [1.5,2.2] for s1 and [3.2,4.2] for s2. the positive firing levels corresponding to the linguistic values of the input variable s1 are µverylow = 0.666, µlow = 0.2 and the positive firing levels corresponding to the linguistic values of the input variable s2 are: µlow = 1, µmiddle = 0.6 the fired rules and their firing levels, computed with t-norm product t (x,y) = xy, are: r2 with firing level α2 = 0.666, r5 and r6 with α5 = α6 = 0.3996. the fired rules give the following crisp values as output: z2 = 3.25, z5 = 5.3996, z6 = 3.25; then the overall crisp control action is z0 = 3.836. these values represent the matching approach for every subset points which are candidate to be in the final set, and are computed using a fuzzy merging comparison between selection sets σ1 and σ2. the optimal selection set (which has less points) is represented by the output variable σ . 5 conclusions among all the biometric techniques, the identification based on fingerprints is used in the most applications. the uniqueness of the fingerprint can be determinate by the pattern of ridges and the minutiae points. for identifying an input fingerprint, the proposed method uses a fuzzy classification of the data. the proposed system is much more efficient than the flc presented in [8]. this is because, in order to reduce the necessary points number, we find the minutiae points by using a fuzzy logic reasoning system which compare two points sets matching values. in a practical application, it is recommended to use the proposed system with a set of implications and aggregate the results given by every implication, in order to obtain the overall output; in this way can be obtained a stronger base for more accurate results of our system. we intend to use these results in a future work, by maping their relative placement on the finger, and comparing all its points with the ones of the fingerprints for the right subset. bibliography [1] l. chmielewski and j. h. hoepman, fuzzy private matching (extended abstract), ares ’08: proceedings of the 2008 third international conference on availability, reliability and security, ieee computer society, pp. 327–334, 2008. [2] e. czogola and j. leski, on equivalence of approximate reasoning results using different interpretations of fuzzy if-then rules, fuzzy sets and systems, vol. 117, no. 2, pp. 279–296, 2001. fingerprints identification using a fuzzy logic system 531 [3] m. freedman, k. nissim and b. pinkas, efficient private matching and set intersection, advances in cryptology, eurocrypt 2004, springer-verlag, pp. 1–19, 2004 [4] r. s. germain, a. califano and s. colville, fingerprint matching using transformation parameter clustering, ieee comput. sci. eng., vol. 4, no. 4, pp. 42–49, 1997. [5] m. r. hawthorne, fingerprints. analysis and understanding, crc press, 2009 [6] i. iancu, t -norms with threshold, fuzzy sets and systems. international journal of soft computing and intelligence, vol. 85, no. 1, pp. 83–92, 1997. [7] i. iancu, operators with n-thresholds for uncertainty management, journal of applied mathematics & computing, springer berlin, vol. 19, no. 1-2, pp. 1–17, 2005. [8] i. iancu, generalized modus ponens using fodor’s implication and t -norm product with threshold, international journal of computers, communications & control (ijccc), vol. 4, no. 4, pp. 330–343, 2009. [9] i. iancu, extended mamdani fuzzy logic controller, the fourth iasted international conference on computational intelligence ci 2009, acta press, vol. 5, pp. 143–149, 2009. [10] f. liu, h. geng and y. q. zhang, interactive fuzzy interval reasoning for smart web shopping, applied soft computing, elsevier, vol. 5, no. 4, pp. 433–439, 2005. [11] d. g. radojevic, fuzzy set theory in boolean frame, international journal of computers, communications & control (ijccc), vol. 3, no. 5, pp. 121–131, 2008. [12] v. v. t. tong, h. sibert, j. lecoeur and m. girault, biometric fuzzy extractors made practical: a proposal based on finger codes, advances in biometrics, springer berlin / heidelberg, pp. 604–613, 2009. [13] t. vesselenyi, s. dzitac, i. dzitac and m.j. manolescu, fuzzy and neural controllers for a pneumatic actuator, international journal of computers, communications & control (ijccc), vol. 4, no. 2, pp. 375–387, 2007. [14] l. a. zadeh, a theory of approximate reasoning, machine intelligence 9, elsevier, pp. 149–194, 1979. [15] l. a. zadeh, fuzzy sets as a basis for a theory of a possibility, fuzzy sets and systems, vol. 100, pp. 9–34, 1999. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 4, pp. 469-476 hierarchical and reweighting cluster kernels for semi-supervised learning z. bodó, l. csató zalán bodó, lehel csató department of mathematics and computer science babeş–bolyai university kogălniceanu 1, 400084 cluj-napoca, romania e-mail: {zbodo, lehel.csato}@cs.ubbcluj.ro abstract: abstract: recently semi-supervised methods gained increasing attention and many novel semi-supervised learning algorithms have been proposed. these methods exploit the information contained in the usually large unlabeled data set in order to improve classification or generalization performance. using data-dependent kernels for kernel machines one can build semi-supervised classifiers by building the kernel in such a way that feature space dot products incorporate the structure of the data set. in this paper we propose two such methods: one using specific hierarchical clustering, and another kernel for reweighting an arbitrary base kernel taking into account the cluster structure of the data. keywords: kernel methods, semi-supervised learning, clustering 1 introduction extracting information from large data collections is an important research topic in mathematical modeling: it helps designing automated inference procedures with limited or no user intervention [9]. the resulting algorithms are used in various domains like bioinformatics or natural language processing, both involving the processing of large data sets. data sets are usually labeled; this manual labeling is done before the automated information extraction procedure takes place. the limitation of the procedure is that the total number of items cannot be labeled. in this scenario the semi-supervised learning (ssl) develops methods that handle partially labeled data sets where only a small portion has labels, the rest of it is collected but unlabeled. since unlabeled data is ubiquitous, in semi-supervised learning we jointly handle both the labeled and the unlabeled parts to improve the performance of the algorithm. in the following we only consider semi-supervised classification, i.e. those methods that assign single or multiple labels to a given input. to use the unlabeled part of the data set, some assumptions have to be made [5]: (i) smoothness assumption, (ii) cluster assumption, (iii) manifold assumption. most ssl methods are built on the top of the supervised algorithms by using these assumptions together with estimates of the input distribution. we use the input distribution to define a change in the input metric, leading to a modified distance between items. we study ssl methods comprising the following two steps: first we determine the new distance – dot product or kernel function – between the learning examples, and in the second step with a supervised method we obtain the decision function by using the new distance obtained in the first step. in this paper we focus on methods that exploit the induced changes in distances and characterize this induced distance measure with kernels [3]. kernel methods constitute a powerful tool to rewrite a linear algorithm into a non-linear one. they are based on a symmetric positive semi-definite (kernel) function, that is a dot product in a high-dimensional space [10]. the kernel in the first step of the generic ssl method mentioned above is data-dependent. data-dependent kernels combine kernel algorithms and semi-supervised learning by providing a new representation for the examples that uses both the labeled copyright c⃝ 2006-2010 by ccc publications 470 z. bodó, l. csató and the unlabeled parts of the data set. a formal definition of the data-dependent kernel is the following: if d1 ̸= d2, k(x1,x2;d1) m k(x1,x2;d2) where “m” reads as “not necessarily equal” and the semicolons denote conditioning. it is important that these kernel functions are conditioned on the data sets, nevertheless we omit it in the following: it will be clear from the context if a kernel is data-dependent. we propose two data-dependent kernels in this paper: (i) a kernel using the distances induced by hierarchically clustering the labeled and unlabeled data; (ii) a reweighting kernel based on the hadamard product of some base kernel matrices. the paper is structured as follows: section 2 outlines the notations, in section 3 the hierarchical and graph-based hierarchical cluster kernels for semi-supervised classification are described. in section 4 we introduce the reweighting kernels based on data clustering and kernel combination. in section 5 we present the experiments and results and in section 6 we present the conclusions and discussions on the proposed kernels. 2 notation we denote with d = {(xi,yi) |i = 1,2,...,ℓ}∪{xi |i = ℓ+1,...,ℓ+ u} the training data, with the first set being the labeled and the second the unlabeled data set. we further assume xi ∈ x with x metric space, yi ∈ y , and the set of labels is of finite cardinality, i.e. |y | < ∞. a key assumption is that the size of labeled data is much smaller than the available unlabeled part, i.e. ℓ ≪ u. in the paper n denotes the size of the entire training set, n = ℓ + u. we use the scalar k to denote the number of clusters, where needed. boldface lowercase letters denote vectors, boldface capitals are matrices, all other variables are scalars; a′ denotes the transpose. for a matrix a, ai j is its element in the i-th row and j-column, and ai· and a· j denote the vectors corresponding to the i-th row and j-th column. 3 hierarchical cluster kernels in this section we introduce the proposed hierarchical cluster kernels. we propose the use of distances induced by different clustering algorithms instead of the original distance measure in the input space. if unlabeled data is added to the relatively small labeled data set, we expect that the new distance, obtained via clustering and the use of unlabeled data, induces a better representational space for classification. for clustering we use special hierarchical clustering techniques – the ones that result in ultrametric distance matrices – leading to positive semi-definite kernel matrices. our method is based on the connectivity kernel [8] and we extend on this kernel construction by involving the unlabeled data and allowing any hierarchical clustering method leading to ultrametric distance matrices. for data sets where the manifold assumption is expected to hold, we construct the hierarchical cluster kernel using distances induced by the k-nn and ε -nn data graphs. 3.1 hierarchical clustering and ultrametricity hierarchical clustering builds a tree in successive steps, where the nodes of the tree represent nested partitions of the data, in contrary to partitional clustering methods, which result in a single partition. for the proposed hierarchical cluster kernel we use special agglomerative clustering methods. to fully specify a hierarchical clustering algorithm, cluster similarities have to be measured; these are called linkage distances. based on the choice of the linkage distance measuring distances between clusters in agglomerative clustering, one can design a large variety of clustering methods. hierarchical and reweighting cluster kernels for semi-supervised learning 471 single linkage: d(c1,c2) = min{d(x1,x2) | x1 ∈ c1,x2 ∈ c2} (1) complete linkage: d(c1,c2) = max{d(x1,x2) | x1 ∈ c1,x2 ∈ c2} (2) average linkage: d(c1,c2) = 1 |c1||c2| ∑ x1i∈c1 ∑ x2 j∈c2 d(x1i,x2 j) (3) the linkage distances are based on d(x1,x2), the pointwise distance in the input space that usually is the euclidean distance d(x1,x2) = ∥x1 − x2∥2. in this paper we experiment only with the three linkage distances presented above; for a detailed discussion of these and other distances see [7]. all these three methods lead to ultrametric hierarchical clustering. the property is used for constructing positive semi-definite kernels from clusters: suppose that we choose to merge three clusters, c1, c2 and c3 in the following order: we first merge c1 with c2 resulting in c12, and then we merge it with c3. now if d(c1,c2) ≤ d(c1,c3) and d(c2,c1) ≤ d(c2,c3)   then d(c1,c2) ≤ d(c12,c3) (4) based on an agglomerative clustering method that uses ultrametric linkage distance, we can define an ultrametric distance matrix, based on that a kernel function that can be used for a better representation. 3.2 the connectivity kernel our method of constructing hierarchical cluster kernels is based on [8]. the authors propose a twostep clustering: map the points to a new representational space based on the effective dissimilarities, and cluster them using the new representation. the method is an approximation to pairwise clustering. to compute the effective dissimilarities used in pairwise clustering, the authors build a graph of the data; they assume that on the path between two points belonging to different clusters there will be an edge with large weight, representing the weakest link on the path. the effective dissimilarity will be represented by this value. they approximate the effective dissimilarities using a kruskal-style algorithm [8]. our method can be viewed as a generalization of the connectivity kernel, since if the ultrametric property is satisfied, we can use an arbitrary linkage distance when performing the agglomerative clustering. moreover we propose to use the kernel in semi-supervised learning settings, when only a small portion of the data labels is known, and we propose a manifold-based extension of the kernel too. 3.3 constructing the kernel the hierarchical clustering results in a dendrogram, whose nodes are labeled with the distance between the clusters that were merged at the respective node. we build a distance matrix by taking the label attached to the lowest common ancestor of the points in the tree. in order to transform distances to dot products we use a method similar to multi-dimensional scaling (mds) [2]: k = − 1 2 jmj with j = i − 1 n 11′ where m contains the squared distances based on the dendrogram and j is the centering matrix built from the identity matrix i and the tensor product of the vector with all elements 1. the resulting matrix contains the dot products between a set of vectors {zi}ni=1 with squared euclidean distances ∥zi − z j∥ 2 2 = mi j [8]. 472 z. bodó, l. csató in what follows, we construct the cluster kernel using linkage distances from hierarchical clustering. thus we map the points to a feature space where the pointwise distances are equal to the cluster distances in the input space. the steps are shown in algorithm 1. algorithm 1 hierarchical cluster kernel 1: perform an agglomerative clustering with ultrametric linkage distances from section 3.1 on the labeled and unlabeled data. 2: define matrix m with mi j = linkage distances of xi and x j; mii = 0. 3: define the kernel matrix as k = −1 2 jmj. the resulting kernel k is obviously data-dependent. we use the unlabeled data in the clustering step to determine better pointwise distances, leading to the kernel; we expect to obtain better similarities then using only the labeled part. it is important that in order to compute the kernel function for the test set we include them into the unlabeled set. this means that if the test point is unavailable at training time, the whole clustering process should be repeated, slowing down the classification. to efficiently compute the kernel for unseen points is left as a future realization and is discussed in section 6. 3.4 hierarchical cluster kernel with graph distances in building the hierarchical cluster kernel, we only used the cluster assumption. here we extend the above kernel to also exploit the manifold assumption, mentioned in section 1, using a graph-based hierarchical cluster kernel. we approximate distances by using shortest paths based on k-nn or ε -nn graphs, similar to isomap [11]. in this process we substitute the graph distances for the pointwise distances d(·,·). the result is that the hierarchical clustering algorithm is preceded with the steps shown in algorithm 2. algorithm 2 graph-based hierarchical cluster kernel -2: determine the k-nearest neighbors or an ε -neighborhood of each point, and let distances to other points equal to ∞. -1: compute shortest paths for every pair of points – using for example dijkstra’s algorithm. 0: use these distances for the pointwise distance in eqs. (1), (2), or (3). we deliberately started the numbering from −2 to emphasize that these steps precede the algorithm from the previous subsection. we emphasize that these steps are optional: should only be used if the manifold assumption holds on the data set. we use here the shortest path distances computed on the k-nearest neighbor or the ε -neighborhood graph of the data, thus – if the data lie on a low-dimensional manifold – approximating pointwise distances on this manifold. the graph built from the k-nearest or the ε -neighborhoods may contain several disconnected components, for example if k or ε is too small. if this scenario happens we use a method similar to the one described in [13] for building a connected graph. 4 reweighting kernels combining kernels to improve classification performance was thoroughly studied [10]. in the following – based on kernel combination – we propose three techniques to reweight a base kernel using the cluster assumption of semi-supervised learning. hierarchical and reweighting cluster kernels for semi-supervised learning 473 we make use of the following properties: for any k1 and k2 positive semi-definite matrix and any positive scalar value a > 0, the following combinations are positive semi-definite matrices: k1 + k2, a k1, k1 ⊙ k2, where ⊙ denotes the hadamard, or direct product. we develop techniques that reweight the kernel matrix by exploiting the cluster structure of the training data. thus, if two points are in the same cluster, their similarity obtains a high weight, while lying in different clusters induces a lower weight; the resulting kernel is called the reweighting kernel, or krw(x1,x2). the similarity weights are combined with the values of the base kernel kb(x1,x2), thus forming the final kernel matrix. to sum up, the new cluster kernel is k(x1,x2) = krw(x1,x2)kb(x1,x2) where krw(·,·) is the reweighting and kb(·,·) is the base kernel. in matrix form it can be written as k = krw ⊙ kb we are faced with two problems in the construction of the above cluster kernel: (i) the reweighting kernel must be positive semi-definite, (ii) the base kernel matrix has to be positive semi-definite and positive. the first requirement is obvious: it is needed to guarantee the positive semi-definiteness of the resulting kernel. the second condition is crucial, since for negative values in the base kernel matrix a quite different reweighting should be performed. to avoid complications due to negativity, we require a positive base kernel, kb(x1,x2) ≥ 0. the bagged cluster kernel, proposed in [12], reweights the base kernel values by the probability that the points belong to the same cluster. for computing this probability the bagged cluster kernel uses k-means clustering, together with its property that the choice of the initial cluster centers highly affects the output of the algorithm. assuming we have n data items and k clusters, the kernel is constructed by running k-means t times, each time with different initialization resulting in different cluserings. the resulting kernel is called the bagged kernel. the final cluster kernel is the hadamard product of the base kernel and the bagged kernel. borrowing the underlying idea of the bagged cluster kernel in the following we develop reweigthing kernels based on various clustering algorithms. 4.1 gaussian reweighting kernel suppose that we are given the output of a clustering algorithm, the cluster membership matrix u of size k × n. we assume that two points belong to the same cluster(s) if their cluster membership vectors are similar or close to each other. we define similarity via the gaussian kernel in the following way: krw(x1,x2) = exp ( − ∥u·x1 − u·x2∥ 2 2σ 2 ) (5) where u·x denotes the cluster membership vector of point x, i.e. the column of x in u. we know that the resulting matrix is positive semi-definite [10] with each element between 0 and 1. in this case the parameter σ defines the amount of separation between similar and dissimilar points: if σ is large, the gap between the values expressing similarity and dissimilarity becomes smaller, while for smaller σ these values get farther from each other. 4.2 dot product-based reweighting kernels another possibility of using the cluster membership vectors is to define the following reweighting kernel: krw = u′u + α n 11′ (6) 474 z. bodó, l. csató where u denotes the cluster membership matrix and α ∈ [0,1). the first term scores point similarity according to the cluster memberships, and the second term is used to avoid zero similarities: if two membership vectors are orthogonal, leading to a zero in the dot product matrix. we may assume that the obtained clustering is not too confident, i.e. we should use a small value α , the crisp cluster membership is thus alleviated with the term (α/n)11′. shoving that the reweighting kernel from equation (6) is positive semi-definite is straightforward: both the first and the second term is an external product, thus positive semi-definite, and owing to the properties defined in section 4, results that the kernel is indeed positive semi-definite. another version of the kernel in (6) is krw = β u′u + 1 n 11′ (7) where β ∈ (0,∞). here the kernel values for which the dot product matrix of cluster membership vectors correspond to zero, by (1/n)11′, remain the same, however if the points lie in the same cluster β u′u gives a weight greater than zero, thus this kernel value will be increased. the above equations could clearly be merged, but we left them as separate reweighting kernels to differentiate between the underlying ideas. 5 experiments and results in this section we present the results obtained using our cluster kernels, and we compare it to other data-dependent kernels. for learning we used support vector machines (svms) [3], namely the libsvm (version 2.85) implementation [4]. the data sets used for evaluating the kernels were the following: usps, digit1, coil2, text. the detailed descriptions of these sets can be found in [5]. each data set has two variations: one with 10 and one with 100 labeled data; furthermore each data set contains 12 labeled/unlabeled splits of the data. we used only the first split from each set. the columns having labels 10 and 100 in the tables showing the obtained result indicate which version of the data set was used, i.e. they show the number of labeled and unlabeled examples used. table 1 shows the accuracy results obtained using different kernels. we used accuracy as the evaluation measure, and the results are given in percentage. for each data set we indicated the best, second best and third best results obtained. the first two rows contain the baseline results obtained with linear and gaussian kernels. principally we wanted to improve on these results. the hyperparameter for the gaussian kernel was set using a cross-validation procedure. the following 7 rows show the results obtained using the isomap kernel [11], the neighborhood kernel [12], the bagged cluster kernel [12], the multi-type cluster kernel with different transfer functions [6] and laplacian svm [1], respectively. the next 15 rows – below the second horizontal line – show the results obtained using our kernels. hck and ghck denote the hierarchical cluster and graph-based hierarchical cluster kernels from section 3, respectively. rck1, rck2 and rck3 denote the reweighting cluster kernels defined in equations (5), (6) and (7), respectively. here we experimented with three clustering techniques: k-means, hierarchical and spectral clustering. because of lack of space we omitted the description of setting the parameters. 6 discussion as the results show we obtained good results with the proposed hierarchical and reweighting cluster kernels, and in many cases the results provided by our kernels are very close to the best results, as shown hierarchical and reweighting cluster kernels for semi-supervised learning 475 usps digit1 coil2 text 10 100 10 100 10 100 10 100 linear 72.82 86.43 81.07 90.86 60.74 80.43 58.26 67.86 gaussian 80.07 89.71 56.11 93.86 57.38 82.50 59.06 56.43 isomap 85.10 86.71 94.43 97.43 62.62 80.64 59.80 72.43 neighborhood 76.31 94.14 87.11 94.21 64.43 84.43 51.68 62.79 bagged 87.38 92.79 93.29 96.93 71.28 85.57 63.29 66.14 multi-type, step 80.07 92.86 91.01 91.29 55.77 84.86 53.56 74.79 multi-type, linear step 80.07 92.86 91.01 91.36 55.77 84.86 53.02 75.29 multi-type, polynomial 80.07 80.29 48.86 65.07 54.23 82.29 50.60 56.71 lapsvm, gaussian 81.95 95.93 84.50 97.64 76.64 97.71 63.42 62.50 hck, single 80.07 81.79 48.86 70.21 67.85 96.00 66.78 73.14 hck, complete 82.01 89.50 60.67 89.71 55.64 86.36 50.27 49.57 hck, average 81.48 92.86 71.75 93.79 68.05 91.71 64.63 50.14 ghck, single 80.07 81.79 48.86 70.21 60.60 93.86 66.78 73.14 ghck, complete 88.26 95.64 75.50 93.71 68.52 88.79 56.17 67.71 ghck, average 89.26 95.64 94.70 95.21 60.54 90.64 47.32 66.86 rck1, k-means 84.45 92.98 83.14 94.28 58.55 83.76 – – rck1, hierarchical 86.17 95.29 89.06 94.94 62.08 85.93 62.35 68.07 rck1, spectral 81.43 90.87 88.32 95.20 58.22 83.83 63.26 66.93 rck2, k-means 83.86 92.45 84.58 94.08 58.95 83.76 – – rck2, hierarchical 86.11 95.50 89.06 95.29 62.08 85.64 61.28 71.14 rck2, spectral 81.63 91.39 88.32 94.64 58.03 83.37 61.50 70.07 rck3, k-means 83.66 92.59 84.13 92.96 58.60 83.28 – – rck3, hierarchical 84.97 95.29 89.06 94.57 62.95 86.07 59.13 71.21 rck3, spectral 81.16 91.56 88.32 94.73 55.83 83.20 59.26 71.00 table 1: accuracy results using different kernels. the results are given in percentage. for each data set the best three results were formatted in boldface. in table 1. individually lapsvm outperformed every other method, possibly because of the careful selection of its parameters, but also because it is a very powerful technique. thus the results show that the proposed kernels for semi-supervised classification can be used for different types of data sets, and they provide better performances compared to simple, data-independent kernels, e.g. the gaussian kernel. moreover with data-dependent kernels any supervised kernel method can be easily turned into a semi-supervised method, without changing the underlying learning algorithm. in order to compute the kernel for the test points one needs to include these points in the unlabeled data set. that is one can say that the methods resemble transductive learning, where the decision function is computed only on the points in question. thus if a new point arrives the whole process must be repeated. to overcome this costly process approximation methods could be implied, but this is left as a future work. we also plan to develop methods or heuristics for automatically choosing the parameters of the proposed kernels. acknowledgments the authors acknowledge the partial support of the romanian ministry of education and research via grant pnii 11-039/2007. bibliography [1] mikhail belkin, partha niyogi, and vikas sindhwani. manifold regularization: a geometric framework for learning from labeled and unlabeled examples. journal of machine learning research, 7:2399–2434, 2006. [2] ingwer borg and patrick j. f. groenen. modern multidimensional scaling, 2nd edition. springerverlag, new york, 2005. [3] b. e. boser, i. guyon, and v. n. vapnik. a training algorithm for optimal margin classifiers. computational learning theory, 5:144–152, 1992. 476 z. bodó, l. csató [4] chih-chung chang and chih-jen lin. libsvm: a library for support vector machines, 2001. [5] olivier chapelle, bernhard schölkopf, and alexander zien. semi-supervised learning. mit press, september 2006. [6] olivier chapelle, jason weston, and bernhard schölkopf. cluster kernels for semi-supervised learning. in suzanna becker, sebastian thrun, and klaus obermayer, editors, nips, pages 585– 592. mit press, 2002. [7] richard duda, peter hart, and david stork. pattern classification. john wiley and sons, 2001. 0-471-05669-3. [8] bernd fischer, volker roth, and joachim m. buhmann. clustering with the connectivity kernel. in sebastian thrun, lawrence k. saul, and bernhard schölkopf, editors, nips. mit press, 2003. [9] imre j. rudas and jános fodor. intelligent systems. int. j. of computers, communication & control, iii(suppl. issue: proceedings of icccc 2008):132–138, 2008. [10] b. schölkopf and a. j. smola. learning with kernels. the mit press, cambridge, ma, 2002. [11] j. b. tenenbaum, v. de silva, and j. c. langford. a global geometric framework for nonlinear dimensionality reduction. science, 290(5500):2319–2323, december 2000. [12] jason weston, christina leslie, eugene ie, and william stafford noble. semi-supervised protein classification using cluster kernels. in olivier chapelle, bernhard schölkopf, and alexander zien, editors, semi-supervised learning, chapter 19, pages 343–360. mit press, 2006. [13] quan yong and yang jie. geodesic distance for support vector machines. acta automatica sinica, 31(2):202–208, 2005. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 2 (june), pp. 317-327 classification performance using principal component analysis and different value of the ratio r j. novakovic, s. rankov jasmina novakovic “faculty of computer science” megatrend university belgrade serbia, 11000 belgrade, bulevar umetnosti 29 e-mail: jnovakovic@megatrend.edu.rs sinisa rankov megatrend university belgrade bulevar umetnosti 29 e-mail: rankovs@megatrend.edu.rs abstract: a comparison between several classification algorithms with feature extraction on real dataset is presented. principal component analysis (pca) has been used for feature extraction with different values of the ratio r, evaluated and compared using four different types of classifiers on two real benchmark data sets. accuracy of the classifiers is influenced by the choice of different values of the ratio r. there is no best value of the ratio r, for different datasets and different classifiers accuracy curves as a function of the number of features used may significantly differ. in our cases feature extraction is especially effective for classification algorithms that do not have any inherent feature selections or feature extraction build in, such as the nearest neighbour methods or some types of neural networks. keywords: feature extraction, linear feature extraction methods, principal component analysis, classification algorithms, classification accuracy. 1 introduction data dimensionality reduction is an active field in computer science. it is a fundamental problem in many different areas, especially in forecasting, document classification, bioinformatics, and object recognition or in modelling of complex technological processes. in such applications datasets with thousands of features are not uncommon. all features may be important for some problems, but for some target concept only a small subset of features is usually relevant. to overcome the curse of dimensionality problem, dimensionality of the feature space should be reduced. this may be done by selecting only the subset of relevant features, or creating new features that contain maximum information about the class label from the original ones. the former methodology is named feature selection, while the latter is called feature extraction, and it includes linear (pca, independent component analysis (ica) etc.) and non-linear feature extraction methods. finding new features subset are usually intractable and many problem related to feature extraction have been shown to be np-hard ( [1]). feature extraction brings the immediate effects for application: speeding up a data mining algorithm, improving the data quality and thereof the performance of data mining, and increasing the comprehensibility of the mining results. it has been a fertile field of research and development since 1970’s in statistical pattern recognition ( [2] and [3]), machine learning and data mining. some classification algorithms have inherited ability to focus on relevant features and ignore irrelevant ones. decision trees are primary example of a class of such algorithms ( [4] and [5]), but also multi-layer perceptron (mlp) neural networks with strong regularization of the input copyright c⃝ 2006-2011 by ccc publications 318 j. novakovic, s. rankov layer may exclude the irrelevant features in an automatic way ( [6]). such methods may also benefit from independent feature selection or extraction. on the other hand, some algorithms have no provisions for feature selection or extraction. the k-nearest neighbour algorithm (k-nn) is one family of such methods that classify novel examples by retrieving the nearest training example, strongly relaying on feature selection or extraction methods to remove noisy features. 2 pca pca is a standard statistical technique that can be used to reduce the dimensionality of a data set. it ( [7]) is known as karhunen-loeve transform, has proven to be an exceedingly useful tool for dimensionality reduction of multivariate data with many application areas in image analysis, pattern recognition and appearance-based visual recognition, data compression, time series prediction, and analysis of biological data to mention a few. the strength of pca for data analysis comes from its efficient computational mechanism, the fact that it is well understood, and from its general applicability. for example, a sample of applications in computer vision includes the representation and recognition of faces ( [8], [9], [10] and [11]), recognition of 3d objects under varying pose ( [12]), tracking of deformable objects ( [13]), and for representations of 3d range data of heads( [14]). pca is a method of transforming the initial data set represented by vector samples into a new set of vector samples with derived dimensions. the basic idea can be described as follows: a set of n-dimensional vector samples x = {x1, x2, x3, ..., xm} should be transformed into another set y = {y1, y2, ..., ym} of the same dimensionality, but y-s have the properties that most of their information content is stored in the first few dimensions. so, we can reduce the data set to a smaller number of dimensions with low information loss. the transformation is based on the assumption that high information corresponds to high variance. if we want to reduce a set of input dimensions x to a single dimension y , we should transform x into y as a matrix computation y = a · x (1) choosing a such that y has the largest variance possible for a given data set. the single dimension y obtained in this transformation is called the first principal component. this component is an axis in the direction of maximum variance. the first principal component minimizes the distance of the sum of squares between data points and their projections on the component axis. in practice, it is not possible to determine matrix a directly, and therefore we compute the covariance matrix s as a first step in features transformation. matrix s ( [15]) is defined as sn×n = 1 n − 1 n∑ j=1 (xj − x′)t · (xj − x′) (2) where x′ = 1 n n∑ j=1 xj (3) in the next step, the eigenvalues of the covariance matrix s for the given data should be calculated. finally, the m eigenvectors corresponding to the m largest eigenvalues of s define a linear transformation from the n-dimensional space to an m-dimensional space in which the features are uncorrelated. to specify the principal components we need the following additional explanations about the notation in matrix s: 1) the eigenvalues of sn×n λ1, λ2, ..., λn, where λ1 ≥ classification performance using principal component analysis and different value of the ratio r 319 λ2 ≥, ..., ≥ λn ≥ 0 and 2) the eigenvectors e1, e2, ..., en correspond to eigenvalues λ1, λ2, ..., λn, and they are called the principal axes. principal axes are new, transformed axes of n-dimensional space, where the new variables are uncorrelated, and variance for the i-th component is equal to the i-th eigenvalue. most of the information about the data set is concentrated in a few first principal components. in this paper we research how many of the principal components are needed to get a good representation of the data. in other words, we try to find the effective dimensionality of the data set. for this purpose we analyze the proportion of variance. dividing the sum of the first m eigenvalues by the sum of all the variances (all eigenvalues), we will get the measure for the quality of representation based on the first m principal components. the result is expressed as a percentage. the criterion for features selection is based on the ratio of the sum of the m largest eigenvalues of s to the trace of s. that is a fraction of the variance retained in the m-dimensional space. if the eigenvalues are labeled so that λ1 ≥ λ2 ≥, ..., ≥ λn, then the ratio can be written as r = m∑ i=1 λi n∑ i=1 λi (4) all analyses of the subset of m features represent a good initial estimate of the n-dimensionality space if the ratio r is sufficiently large, it means greater than the threshold value. this method is computationally inexpensive, but it requires characterizing data with the covariance matrix s. in implementation, the transformation from the original attributes to principal components is carried out through a process by first computing the covariance matrix of the original attributes and then, by extracting its eigenvectors to act as the principal components. the eigenvectors specify a linear mapping from the original attribute space of dimensionality n to a new space of size m in which attributes are uncorrelated. the resulting eigenvectors can be ranked according to the amount of variation in the original data that they account for. typically, the first few transformed attributes account for most of the variation in the data set and are retained, while the remainders are discarded. pca is an unsupervised method, which makes no use of information embodied within the class variable. because, the pca returns linear combinations of the original features, the meaning of the original features is not preserved. over the years there have been many extensions to conventional pca. for example, independent component analysis (ica) ( [16] and [17]) is the attempt to extend pca to go beyond decorrelation and to perform a dimension reduction onto a feature space with statistically independent variables. other extensions address the situation where the sample data live in a low-dimensional (non-linear) manifold in an effort to retain a greater proportion of the variance using fewer components ( [18], [19], [20], [21], [22] and [23]) and yet other (related) extensions derive pca from the perspective of density estimation (which facilitate modeling non-linearities in the sample data) and the use of bayesian formulation for modeling the complexity of the sample data manifold ( [24]). 3 classification algorithms four supervised learning algorithms are adopted here to build models, namely, ib1, naive bayes, c4.5 decision tree and the radial basis function (rbf) network. this section gives a brief overview of these algorithms. 320 j. novakovic, s. rankov 3.1 ib1 ib1 is nearest neighbour classifier. it uses normalized euclidean distance to find the training instance closest to the given test instance, and predicts the same class as this training instance. if multiple instances have the same (smallest) distance to the test instance, the first one found is used. nearest neighbour is one of the simplest learning/classification algorithms, and has been successfully applied to a broad range of problems ( [25]). to classify an unclassified vector x, this algorithm ranks the neighbours of x amongst a given set of n data (xi, ci), i = 1, 2, ..., n, and uses the class labels cj (j = 1, 2, ..., k) of the k most similar neighbours to predict the class of the new vector x. in particular, the classes of these neighbours are weighted using the similarity between x and each of its neighbours, where similarity is measured by the euclidean distance metric. then, x is assigned the class label with the greatest number of votes among the k nearest class labels. the nearest neighbour classifier works based on the intuition that the classification of an instance is likely to be most similar to the classification of other instances that are nearby to it within the vector space. compared to other classification methods such as naive bayes’, nearest neighbour classifier does not rely on prior probabilities, and it is computationally efficient if the data set concerned is not very large. if, however, the data sets are large (with a high dimensionality), each distance calculation may become quite expensive. this reinforces the need for employing pca and information gain-based feature ranking to reduce data dimensionality, in order to reduce the computation cost. 3.2 naive bayes this classifier is based on the elementary bayes’ theorem. it can achieve relatively good performance on classification tasks ( [26]). naive bayes classifier greatly simplifies learning by assuming that features are independent given the class variable. more formally, this classifier is defined by discriminant functions: fi(x) = n∏ j=1 p(xj | ci)p(ci) (5) where x = (x1, x2, ..., xn) denotes a feature vector and cj, j = 1, 2, ..., n, denote possible class labels. the training phase for learning a classifier consists in estimating conditional probabilities p(xj | ci) and prior probabilities p(ci). here, p(ci) are estimated by counting the training examples that fall into class ci and then dividing the resulting count by the size of the training set. similarly, conditional probabilities are estimated by simply observing the frequency distribution of feature xj within the training subset that is labeled as class ci. to classify a class-unknown test vector, the posterior probability of each class is calculated, given the feature values present in the test vector; and the test vector is assigned to the class that is of the highest probability. 3.3 c4.5 decision tree different methods exist to build decision trees, but all of them summarize given training data in a tree structure, with each branch representing an association between feature values and a class label. one of the most famous and representative amongst these is the c4.5 tree ( [27]). the c4.5 tree works by recursively partitioning the training data set according to tests on the classification performance using principal component analysis and different value of the ratio r 321 potential of feature values in separating the classes. the decision tree is learned from a set of training examples through an iterative process, of choosing a feature and splitting the given example set according to the values of that feature. the most important question is which of the features is the most influential in determining the classification and hence should be chosen first. entropy measures or equivalently, information gains are used to select the most influential, which is intuitively deemed to be the feature of the lowest entropy (or of the highest information gain). this learning algorithm works by: a) computing the entropy measure for each feature, b) partitioning the set of examples according to the possible values of the feature that has the lowest entropy, and c) for each are used to estimate probabilities, in a way exactly the same as with the naive bayes approach. note that although feature tests are chosen one at a time in a greedy manner, they are dependent on results of previous tests. 3.4 rbf networks a popular type of feed forward network is rbf network. usually, the rbf network consists of three layers, i.e., the input layer, the hidden layer with gaussian activation functions, and the output layer. each hidden unit essentially represents a particular point in input space, and its output, or activation, for a given instance depends on the distance between its point and the instance-which is just another point. intuitively, the closer these two points, the stronger the activation. this is achieved by using a nonlinear transformation function to convert the distance into a similarity measure. a bell-shaped gaussian activation function, whose width may be different for each hidden unit, is commonly used for this purpose. the hidden units are called rbfs because the points in instance space for which a given hidden unit produces the same activation form a hypersphere or hyperellipsoid. the output layer of an rbf network takes a linear combination of the outputs of the hidden units and-in classification problems-pipes it through the sigmoid function. the parameters that such a network learns are (a) the centers and widths of the rbfs and (b) the weights used to form the linear combination of the outputs obtained from the hidden layer. one way to determine the first set of parameters is to use clustering, without looking at the class labels of the training instances at all. the simple k-means clustering algorithm can be applied, clustering each class independently to obtain k basis functions for each class. intuitively, the resulting rbfs represent prototype instances. then the second set of parameters can be learned, keeping the first parameters fixed. this involves learning a linear model using one of the techniques such as, linear or logistic regression. if there are far fewer hidden units than training instances, this can be done very quickly. rbf networks ( [28]) are a special class of neural networks in which the distance between the input vector and a prototype vector determines the activation of a hidden neuron. prototype vectors refer to centers of clusters obtained during rbf training. usually, three kinds of distance metrics can be used in this network, such as euclidean, manhattan, and mahalanobis distances. the rbf network provides a function y : rn → rm, which maps n-dimensional input patterns to m-dimensional outputs ({(xi, yi) ∈ rn × rm, i = 1, 2, ..., n}). assume that there are m classes in the data set. the m-th output of the network is as follows ( [29]): ym(x) = k∑ j=1 wmj θj(x) + wm0bm (6) in this case x is the n-dimensional input pattern vector, m = 1, 2, ..., m, and k is the number of hidden units. m is the number of classes (outputs), wmj is the weight connecting the j-th 322 j. novakovic, s. rankov hidden unit to the m-th output node, bm is the bias, and wm0 is the weight connecting the bias and the m-th output node. the radial basis activation function θ(x) of the rbf network distinguishes it from other types of neural networks. several forms of activation functions have been used in applications ( [29]): • θ(x) = e −x2 2σ2 (7) • θ(x) = (x2 + σ2)−β, β > 0 (8) • θ(x) = (x2 + σ2)β, β > 0 (9) • θ(x) = x2 ln(x) (10) here σ is a parameter that determines the smoothness properties of the interpolating function. a disadvantage of rbf networks is that they give every feature the same weight because all are treated equally in the distance computation. hence they cannot deal effectively with irrelevant features. 4 experiments and results real datasets called "statlog (australian credit approval)" and "statlog (german credit data)" for tests were used, taken from the uci repository of machine learning databases. these datasets were used to compare the classification performance using ib1, naive bayes, rbf networks and c4.5 decision tree classifiers, in conjunction with the use of pca and different value of the ratio r. the classification performance is measured using ten-foldcross-validation. german credit data this dataset classifies people described by a set of features as good or bad credit risks. data set characteristics is multivariate, feature characteristics are categorical and integer. number of instances is 1000, number of features is 20, and there are no missing values. australian credit approval this file concerns credit card applications. data set characteristics is multivariate; feature characteristics are categorical, integer and real. number of instances is 690, number of features is 14, and there are missing values. this dataset is interesting because there is a good mix of features continuous, nominal with small numbers of values, and nominal with larger numbers of classification performance using principal component analysis and different value of the ratio r 323 values. there are also a few missing values. australian credit approval data classification accuracy without pca naive bayes 77,7 c4.5 decision tree 86,1 ib1 classifier 81,2 rbf network 79,7 table 1: classification results for australian credit approval data without using pca. the number of input components produced australian credit approval data 20 22 25 29 29 30 32 33 r 0,8 0,85 0,9 0,95 0,96 0,97 0,98 0,99 table 2: the number of input components produced using pca at various ratio r values for australian credit approval data. figure 1: the classification performance using naive bayes, c4.5 decision tree, ib1 and rbf network classifiers, in conjunction with the use of pca and different value of the ratio r. statlog (australian credit approval) data set classification results without using pca as a standard statistical technique that can be used to reduce the dimensionality of a data set, for australian credit approval are presented in table 1, and for german credit data in table 3. table 2 and 4 show the number of input components produced for each ratio r value investigated. it can be observe from the tables that the number of input components reduces with decreasing values of the ratio r used. in figure 1 classification performance for naive bayes and rbf network are significantly better with pca. for ib1 and c4.5 decision tree classifiers the results are better without pca. 324 j. novakovic, s. rankov for greater value of the ratio r classification accuracy of ib1 classifier is better. the others types of classifier in our experiment have the better results with greater value, but when value reached some boundaries performance of classifier are the same or worst. german credit data classification accuracy without pca naive bayes 75,4 c4.5 decision tree 70,5 ib1 classifier 72 rbf network 74 table 3: classification results for german credit data without using pca. the performance of some classifiers depends on its generalization capability, which in turn is dependent upon the data representation. one important characteristic of data representation is uncorrelated. this is because correlated data reduce the distinctiveness of data representation and thus, introduce confusion to the classifier model during the learning process and hence, producing one that has low generalization capability to resolve unseen data. the results demonstrated that the elimination of correlated information in the sample data by way of the pca method improved naive bayes and rbf network classification performance (figure 1). at point 0.95% of the ratio r value with 29 input components, all classifiers significantly improved the classification accuracy. after that the ratio r value, the classification accuracy is about the same with little variations between classifiers. it suggests that this number of inputs is sufficiently optimal for the all classifiers to learn distinct features in the data and perform better input/output mapping. the number of input components produced german credit data 31 34 38 42 43 44 45 46 r 0,8 0,85 0,9 0,95 0,96 0,97 0,98 0,99 table 4: the number of input components produced using pca at various ratio r values for german credit data. german credit data doesn’t consist of correlated information caused by overlapping input instances. without correlation in sampled data there is not confusion in classifiers during the learning process (figure 2) and thus, no degrades their generalization capability. in this case all classifiers’ classification performance doesn’t improved by pca. for greater value of the ratio r classification accuracy of ib1 classifier is better. naive bayes classifier has the worst results with greater values of the ratio r. classification accuracy of rbf network have oscillation values. classification accuracy for c4.5 decision tree doesn’t change too much with different values of the ratio r. this final part of the comparative study is set to investigate the differences between different classifiers, in terms of their classification ability. it is clear from figures 1 and 2 that on average, naive bayes and rbf network classifiers tend to significantly outperform the decision tree and ib1 classifiers. 5 conclusions feature extraction leading to reduced dimensionality of the feature space. pca is one of the most popular techniques for dimensionality reduction of multivariate data points with application classification performance using principal component analysis and different value of the ratio r 325 figure 2: the classification performance using naive bayes, c4.5 decision tree, ib1 and rbf network classifiers, in conjunction with the use of pca and different value of the ratio r. statlog (german credit data) data set areas covering many branches of science. this is especially effective for classification algorithms that do not have any inherent feature selections or feature extraction builds in, such as the nearest neighbour methods or some types of neural networks. pca has been used for feature extraction with different values of the ratio r, evaluated and compared using four different types of classifiers on two real benchmark data sets. accuracy of the classifiers is influenced by the choice different values of ratio r (figure 1 and figure 2). there is no best value of the ratio r, for different datasets and different classifiers accuracy curves as a function of the number of features used may significantly differ. but, in more cases, the value of 0.95 gave the best results. several improvements of the feature extraction method presented here are possible: • the algorithms and datasets will be selected according to precise criteria: different algorithms with pca as linear feature extraction method, and several datasets, either real or artificial, with nominal, binary and continuous features. • ica and others linear feature extraction methods may be included. • problem of data dimensionality reduction may be analysed with non-linear feature extraction methods. these conclusions and recommendations will be tested on larger datasets using various classification algorithms in the near future. 326 j. novakovic, s. rankov bibliography [1] a.l. blum, r.l. rivest, training a 3-node neural networks is np-complete, neural networks, 5:117 127, 1992. [2] n. wyse, r. dubes, a.k. jain, a critical evaluation of intrinsic dimensionality algorithms. in e.s. gelsema and l.n. kanal, editors, pattern recognition in practice, pp 415–425. morgan kaufmann publishers, inc., 1980. [3] m. ben-bassat, pattern recognition and reduction of dimensionality, in p. r. krishnaiah and l. n. kanal, editors, handbook of statistics-ii, pp 773-791. north holland, 1982. [4] l.breiman, j.h. friedman, r.h. olshen, stone c.j., classification and regression trees, wadsworth and brooks, monterey, ca, 1984. [5] j.r. quinlan, c4.5: programs for machine learning, san mateo, morgan kaufman, 1993. [6] w. duch, r. adamczak, k. grabczewski, a new methodology of extraction, optimization and application of crisp and fuzzy logical rules, ieee transactions on neural networks, vol. 12, pp. 277-306, 2001. [7] i.t. jolliffe, principal component analysis, springer-verlag, new york, 1986. [8] l. sirovich, m. kirby, low dimensional procedure for the characterization of human faces, journal of the optical society of america, 4(3) 519-524, 1987. [9] m. turk, a. pentland, eigen faces for recognition, j. of cognitive neuroscience 3(1), 1991. [10] b. moghaddam, a. pentland, b. starner, view-based and modular eigenspaoes for face recognition, proceedings of the ieee conference on computer vision and pattern recognition (cvpr), pp 84-91, 1994. [11] p.n. belhumeur, j.p. hespanha, d.j. kriegman, eigenfaces vs. fisherfaces: recognition using class specific linear projection, proceedings of the european conference on computer vision, 1996. [12] h. murase, s.k. nayar, learning and recognition of 3d objects from appearance, ieee 2nd qualitative vision workshop, pp 39-50, new york, ny, june 1993. [13] m. j. black, d. jepson, eigen-tracking: robust matching and tracking of articulated objects using a view-based representation, proceedings of the european conference on computer vision (eccv), pp 329-342, cambridge, england, 1996. [14] j.j. atick, p.a. griffin, n.a. redlich, statistical approach to shape-from-shading: deriving 3d face surfaces from single 2d images, neural computation, 1997. [15] m. kantardzic,data mining: concepts, models, methods, and algorithms, john wiley & sons, 2003. [16] p. comon, independent component analysis, a new concept? signal processing pages 36(3), pp 11-20, 1994. [17] a.j. bell, t.j. sejnowski, an information maximization approach to blind separation and blind deconvolution, neural computation, pp 1129-1159, 1995. classification performance using principal component analysis and different value of the ratio r 327 [18] c. bregler, s.m. omohundro, nonlinear manifold learning for visual speech recognition, iccv, boston, jun 1995. [19] t. heap, d. hogg, wormholes in shape space: tracking through discontinuous changes in shape, iccv, 1998. [20] t. hastie, w. stuetzle, principal curves, journal of americam statistical association 84, pp 502-516, 1989. [21] m.a. kramer, non linear principal component analysis using autoassociative neural networks, ai journal 37(2), pp 233-243, 1991. [22] a.r. webb, an approach to nonlinear principal components-analysis using radially symmetrical kernel functions, statistics and computing 6(2), pp 159-168, 1996. [23] v. silva, j.b. tenenbaum, j.c. langford, a global geometric framework for nonlinear dimensionality reduction, science, 290, december 2000. [24] j.m. winn, c.m. bishop, non-linear bayesian image modelling, proceedings of the european conference on computer vision, dublin, ireland, june 2000. [25] m. kuramochi, g. karypis. gene classification using expression profiles: a feasibility study, international journal on artificial intelligence tools, 14(4):641-660, 2005. [26] p. domingos, m. pazzani, feature selection and transduction for prediction of molecular bioactivity for drug design, machine learning, 29:103-130, 1997. [27] e. p. xing, m. l. jordan, r. m. karp feature selection for high-dimensional genomic microarray data, proceedings of the 18th international conference on machine learning, 601-608, 2001. [28] c.m. bishop, neural network for pattern recognition, oxford university press inc., new york, 1995. [29] l. wang, x. fu, data mining with computational intelligence, springer-verlag berlin heidelberg, germany, pages 9-14, 2005. ijcccv7n5.pdf int j comput commun, issn 1841-9836 7(5):807-815, december, 2012. memetic engineering for permanent education in line with sustainable growth c.i. brumar, r.d. fabian, m.-j. manolescu, v. chiş cristina i. brumar, ralf d. fabian lucian blaga university of sibiu romania, 550024 sibiu, 10 bd.victoriei e-mail: crista.brumar@gmail.com, ralf.fabian@gmail.com mişu-jan manolescu agora university, romania, piata tineretului, 8, 410526 oradea, romania, e-mail: mmj@univagora.ro violeta chiş aurel vlaicu university of arad faculty of exact sciences department of mathematics-informatics romania, 310330 arad, 2 elena dragoi e-mail: viochis@yahoo.com abstract: given the recent point of view of the european commission regarding the implementation of a new strategy for sustainable growth and jobs, this paper emphasises the opportunity and urgency of supporting the eu 2020 strategy, providing an appropriate educational tool for the knowledge society. the main objectives are: a. adapting memetic engineering expressed in terms of general system theory to the teaching component of permanent education based on bounded rationality and "just in time" as key tools for fighting cognitive chaoplexity in the post-industrial era; b. facilitating the use of memetic engineering based on its double-faceted nature: as both positive and negative feedback; c. extending the applicability of memetic engineering to ecology as source of memes; d. exemplifying the above in primitive metamodels applying memetic engineering in ecology and highlighting the relevant design-space dimensions. among the conclusions: a. to be sustainable in the long run permanent education must be modelled in line with learner bounded rationality, since bounded rationality is a psychological lasting feature; b. sustainable development depends on affordable permanent education; c. as a result, e-teaching should be systematically revisited through intense transdisciplinary research. keywords: sustainable growth, permanent education, bounded rationality, chaoplexity, memetic engineering. 1 introduction given the recent point of views of the european commission (march 2, 2012, brussels [13]) regarding the implementation of a new strategy that "pursues both continued fiscal consolidation and determined action to boost growth and jobs; sustainable growth and jobs cannot be built on deficits and excessive debt levels" [13], this paper follows the requirements and urgency of implementing the strategy [13], namely strengthening and development of actions to achieve continuous growth of services and economy. at the european level, after renewing the lisbon strategy through the eu 2020 strategy, lifelong learning is regarded as key pillar of sustainable growth. permanent education based on skills acquired through dynamic knowledge, recognizes the failure of static knowledge-based lifelong learning through learning objects repositories [16]. copyright c© 2006-2012 by ccc publications 808 c.i. brumar, r.d. fabian, m.-j. manolescu, v. chiş since education is seen as a complex service in a post-industrial era, this paper aims to extend to ecology the general educational tool for knowledge society as proposed in [20], [19], [16], [6]. dominant features of the post-industrial era are summarized and explained in [10] as: "a) growing speed of change (due to the intense positive feedback entailed by moore’s law outcomes: internet, broad-band technology, semantic web, google, etc.). b) growing complexity (architectural, cognitive, structural). c) globalization (expressed in it context mainly through the modern enterprise paradigms). [...] thus, modern it environments, except for simple applications, move towards open and heterogeneous (resources are unalike and their availability is not warranted), dynamic (the pace of exogenous and endogenous changes is high) and uncertain (both information and its processing rules are revisable, fuzzy, and uncertain). most situations to be controlled are complex and uncertain, and involve parallel processes." [10]. thus, after updating related work in section 2, the paper tracks the first objective in section 3 investigating the main concepts related to memetic stability. on this groundwork, as bounded rationality and "just-in-time" were tools for e-teaching [16], section 4 is build as paper core: it launches the concept of memetic engineering regarded as tool for research in ecology since it matches with e-teaching environmental science. section 5 highlights the design-space dimensions of a (primitive) metamodel for the "integral ecology" [12]. conclusions and future work (section 6) close the paper. 2 rationale and related work considering the far range target of sustainable growth, the milestones are set up by the europe 2020 strategy: "and if europe is to emerge stronger from the crisis, we need more than ever to stimulate smart and green economic growth, underpinned by knowledge and innovation as its key drivers. [...] we need to focus on policies that give a chance to everyone to develop one’s skills and live in dignity. [...] our common aim is to provide a sustainable response to the many challenges facing us to transform the european union into a knowledge-based, resource efficient and low-carbon economy" [2]. that means that the students have to learn how to be skilled (chaoplexity teaching for permanent education based on bounded rationality), how to reduce the ecological damage made by economic growth (ecological systems based on memetic engineering), and we need to prepare them for what comes next (students have to live based on very different principles). as regards the concept of meme, a short definition might be: a meme is a unit of information residing in the brain and is the mutating replicator in human cultural evolution (richard dawkins, [8]). slogans, riddles, songs, signs, inventions and fashions are typical memes. an idea or information pattern is not a meme until it causes someone to spread, to transmit it to another. "why another new "....tics"? memetics is a key zeitgeist-component and is transdisciplinary par excellence; the "self" memotype shows a cognitive complexity similar to the structural one of the genotype; the "self-*" memeplex invaded modern it ("star" = awareness) [...]; the thick-time meme is vigorous and old (preceded ai itself)" [1]. "according to the theory memetic engineering is, simply put, the analysis of an individual or individual’s behaviour, the selection of specific memes and the distribution or propagation of those memes with the intent of altering the behaviour of others". [http://en.wikipedia.org/wiki/ memetic_engineering]. including memetics in this paper (as shown in [16]) is reasonable because most paradigms in modern artificial intelligence have a memetic character and this phenomenon is ever more important in an increasing chaoplexic world [16]. memes are transdisciplinary par excellence and they originate from myths that represent valuable sources of metaphors an obvious source of stability. hence, the research about the role of permanent education for sustainable growth as well as memetic engineering for permanent education in line with sustainable growth 809 the role of memetic engineering in permanent education [19], [16], [20], [6] has been continued in line with the following objectives: a) adapting memetic engineering expressed in terms of general system theory to the teaching component of permanent education based on bounded rationality and "just-in-time" as key tools for fighting cognitive chaoplexity in post-industrial era [16]; b) facilitating the use of memetic engineering based on his double faceted nature: as negative feedback (through memetic stability) and positive feedback (through intense spreading of memes); c) extending the applicability of memetic engineering from myths and metaphors, including (paleo)linguistics [16], to ecology as source of memes; d) exemplifying the above in primitive metamodels applying memetic engineering in ecology and highlighting the relevant design-space dimensions. sustainable growth depends on permanent education; this involves memetic engineering and bounded rationality, and requires changes in the three referred fields: cognitive psychology, it, and higher education. for the sake of self-containment the paper summarises some results of [16], [6], [20] underlining the transdisciplinary aspects related to: psychology, semantics, vital service, skill oriented and self-organizing (lifelong) learning, corpus of knowledge, intensely dynamic and uncertain environments. 3 main concepts related to memetic stability norbert n. seel in "encyclopedia of the sciences of learning" [22] proposed an interdisciplinary overview about learning and the connections between different fields of sciences: "over the past century, educational psychologists and researchers have posited many theories to explain how individuals learn, how they acquire, organize and deploy knowledge and skills. [...] as the learning sciences became more specialized and complex, the various fields of interest were widely spread and separated from each other; as a consequence, even presently, there is no comprehensive overview of the sciences of learning or the central theoretical concepts and vocabulary on which researchers rely. learning theories are not limited to psychology and related fields of interest but rather we can find the topic of learning in various disciplines, such as philosophy and epistemology, education, information science, biology, and as a result of the emergence of computer technologies especially also in the field of computer sciences and artificial intelligence." [22]. three memetic features should be focused on to ease applying them in permanent education [16]: a) lastingness (they are a leitmotif in cultural history); b) ubiquitousness (they permeate all cultures); c) effectiveness (they are active now in education) [16], [6]. even without prior knowledge of psychology it is well-known that behaviours are extremely lasting. hence, they can illustrate the role of bounded rationality as "educational mechanism" much more convincingly than widespread topics albeit very famous. the three steps are similar to those above: a) choosing the pervasive habit of scoring (instead of counting) scoring was and still is certainly easier than counting; b) investigating the related memeplex; c) proposing a boundedly rational way to exploit simplicity in e-teaching. memetic engineering for permanent education tries to integrate transdisciplinary sciences. in recent papers [19], [20] bounded rationality and "just-in-time" were used as tool for e-teaching; this paper aims at proving that memetic engineering for permanent education is a tool for eteaching environmental science. ecologists use memetic stability because they want to preserve the environment. as regards the concept of chaoplexity, it reveals its value in military operational research: "chaoplexic warfare draws on the study of nonlinear phenomena of self-organization to propose a radical decentralization of armed forces through the adoption of the network form. [...] information remains the central concept, and in this sense chaoplexity is an outgrowth of cybernetics; but the focus on change, evolution and positive feedback breaks with the cybernetic pioneers’ 810 c.i. brumar, r.d. fabian, m.-j. manolescu, v. chiş concern for stability" [4]. to add a flavour to the syntagm "educational chaoplexity" (edchy): "in association with the respective technologies of the clock, engine and computer, the scientific theories of mechanism, thermodynamics, and cybernetics have all in turn been recruited to shape distinct approaches to the challenges of imposing order on the chaos of the battlefield. today, it is on the basis of the new sciences of chaos and complexity that the latest regime of the scientific way of warfare is being erected" [4], [19]. in the abbreviation edchy, the focus is on the letters d and h because education has to take into account that we live in a dynamic and heterogonous environment. concerning ecology, even if the time span is larger implied in edchy regarding learning, climatic changes take place during thousands of years. hence "just-in-time" measures should be taken to ensure environment preservation an obvious chaoplexic issues. in short: permanent education relies on bounded rationality and "just-in-time". bounded rationality can be used as a tool to educational chaoplexity and as common denominator for the two facets of permanent education, namely e-teaching and e-learning. "just-in-time", as connotation of "real time", is related with the role of response time (vital in any teacher-learner interaction) [5]. permanent education involves "just-in-time" synchronization between e-teaching and e-learning and is basic for the shift from product-based to a service-based society [16], [6]. the links to memetic stability will become evident in the next section. 4 double faceted nature in line with common linguistic usage, positive and negative has connotations different from the scientific ones: stability (as negative feedback) has positive effect while creativity (as positive feedback) could become extremely risky if not restricted. 4.1 negative feedback through memetic stability "negative feedback helps to maintain stability in a system in spite of external changes. it is related to homeostasis. for example, in a population of foxes (predators) and rabbits (prey), an increase in the number of foxes will cause a reduction in the number of rabbits; the smaller rabbit population will sustain fewer foxes, and the fox population will fall back." [http://en.wikipedia.org/wiki/feedback]. on the same sight appear other correct sentences, but using the contrasting connotation: "in an electronic amplifier feeding back a negative copy of the output to the input will tend to cancel distortion, making the output a more accurate replica of the input signal" or even "positive feedback amplifies possibilities of divergences (evolution, change of goals); it is the condition to change, evolution, growth; it gives the system the ability to access new points of equilibrium" [wikipedia]. [...] some assertions need badly to be improved (e.g., "negative feedback, which tends to reduce the input signal that caused it, is also known as a self-correcting or balancing loop. [...] the terms negative and positive feedback can be used loosely or colloquially to describe or imply criticism and praise, respectively. this may lead to confusion with the more technically accurate terms positive and negative reinforcement, which refer to something that changes the likelihood of a future behaviour." [wikipedia]). almost always in nature (e.g., homeostasis in living beings) and very often in technology (e.g., reducing noise and message distortion in communication systems), the target looked for is preservation (e.g., in living systems to prevent decay), or stability (e.g., in artificial systems to prevent deterioration). this basic kind of feedback is called for obvious historical and physical reasons negative feedback [16]. for permanent education, negative feedback is sine qua non, above all for the teaching process because it is corrective, promotes stationariness, stability, and reversibility; (e.g. ecologists are based on traditions and, some time, on people’s inertia). memetic engineering for permanent education in line with sustainable growth 811 in short: it and memetic engineering provides to ecologists tools to handle environmental issues and to decide what is sustainable or not sustainable for them. memetics helps indirectly in the sense of tradition, because ecology implies environment stability. 4.2 positive feedback through spreading of memes positive feedback means creativity; using positive feedback, the parameter values are increased, the whole process is evolutive, innovative, generating chain reactions that increase instability and involve irreversibility. concerning ecology, positive feedback refers to the countermeasures to be taken to oppose/face environment degradation. (e.g. since the environment is preserved by fighting/opposing nuclear energy, specialists have to be creative to find methods for supplying energy without destroying the environment). with no doubt language and culture are close related and have been subject of sociological, anthropological and memetic research [3], [8]. language is determined by culture (ancient civilisations did not have words for computer, internet, phone, television, radio, etc.) but culture is determined by language too, if memes are regarded as fundamental replicators for their evolution. richard dawkins emphasised in "the selfish gene" [8] the importance of thinking about evolution in terms of information that can be transferred from one person to another via imitation [8]. on this groundwork susan blackmore in "the meme machine" asserts: "just as the design of our bodies can be understood only in terms of natural selection, so the design of our minds can be understood only in terms of memetic selection." [3] according to her, language developed as result of memetic evolution and is the principal medium used for spreading memes, i.e. replicating themselves in as many minds as possible. traditionally, spreading of memes was through word-of-mouth followed by written word and printed word. today memetic propagation via language takes advantages of globalization and of a wide media spectrum (i.e., radio, television, internet, etc.) and of increasing propagation speed due to intense positive feedback (id est., moore’s law and the new technologies based on it) [10]. talking about internet, it is impossible to ignore the meme spreading power of search engines. without them information would be hard to reach. moreover, by email, newsgroups, message boards and social networks people can circulate their memes allowing adherents to unpopular/popular memes to come together ideologically. companies already promote memes this way to advertise their products by viral marketing strategies. internet and it&c in general "is becoming a tool for social interaction bridging the strands between online and offline activities, respectively, digital and social behaviour. [...] information and communication technologies (ict) experienced in people’s everyday life sets a milestone for an active participation in the knowledge society" [17]. at european level, it is still a basic pillar of the strategy for growth: "high-speed internet underpins all sectors of the economy and will be the backbone of the digital single market. for every 10% increase in the broadband penetration the economy grows by 1 to 1.5%." [14]. of course, this stimulates the speed of meme propagation. memes are spread by copying and suffer modifications in their evolution similar to genes. thus, the newness, the sparkle of creativity is within mutation, since it exceeds the possibilities of simple copying from crossover (when archimedes came out of bath exclaiming "eureka" he was not copying anyone). thinking in terms of genetic algorithms, crossover is perceived as negative feedback, preserving stability by selecting and copying fittest the properties from parents to their offspring, whereas positive feedback manifests itself creatively in mutations, giving rise to new unpredictable, hence non-deterministic outcome. corollary: memetic evolution (and also genetic algorithms) involves intrinsically non-determinism. that is why non-deterministic software is 812 c.i. brumar, r.d. fabian, m.-j. manolescu, v. chiş used in it [16] to be able to exploit positive feedback too. in short, technology in first place, only eases rapid spreading and gives no clue about the effect of a meme. to decide if a meme is toxic or not, is still up to the end-user. the contradiction between negative feedback fundamental for stability required by ecologists and positive feedback fundamental for growth can be solved only by approaching the problem through general system theory (gst). any sectorial, partial, local or parochial solutions for solving the problems are inappropriate, because these solutions are not only for a limited biocenosis but have to be for a global ecosystem. consequently, holistic analyses of situations are sine qua non and the tools for handling these situations must also be holistic [16]. examples and explanations inspired by integral ecology [12] are abridged below. based on a large public acceptance of renewable energy, the hype around wind energy developed significantly in the last years, promoting wind turbines and wind farms as one ecological sustainable solution for green energy [18]. a recent report praises the wind energy as "recessionbusting industry" for its impact as "green growth" on jobs and economy [15]. beside benefits of green energy, e.g. no air pollution, moderate capital cost, low ongoing costs, energy independence, several side-effects came to fore. wind farms need to be placed in wide areas for great efficiency. reports of bird and bat mortality argue that wind turbines are artificial structures that interfere and destroy ecosystems, especially in regions where there are important flight paths for migratory bird [21], [9]. furthermore, aesthetic impact, sound emission or even increase of night time temperatures [7] are part of the price to pay when it comes to green energy. to which extend the ecological impact is significant or not is still in question but it clearly leads to an "environmental paradox" [11] that ecologists have to deal with. as regards environment, preservation is a legitimate endeavour whereas wild altering proved to be harmful. memetic stability is consistent with the requirement for stability of environment preservation whereas creativity, in unconventional technological approaches, is shaping hope for finding proper solutions. climate change is linked subjective and objective to "just-in-time". that is, ecology has as main target conservation whereas climate change is perceived memetically as slowly. in this context "just-in-time" could be understood as a secondary restriction since there is enough time to fight climate change and find solution. such arguments are perhaps convenient for emerging industries, pushing limits, conventions and boundaries as far as possible in the future (e.g. kyoto protocol). on the other hand, relying on the meme that climatic changes are slowly and considering that the last significant climate change of human history, the glacial period, dates of approximate 20 millenniums ago, means ignoring heavy side effects, like hurricane katrina or el niño phenomena, with evolutions visible even in periods shorter than a human lifespan. hence, solutions are addressable only through bounded rationality, "just-in-time" and chaoplexity together. that is why holistic approaches are needed; any reductionist view leads to solutions that could hardly satisfy ecosystem stability and sustainable growth. 5 metamodel nowadays, the world’s natural resources are under pressure. actions as soil erosion, acid rain, the extinction of species, have all contributed to the environmental system damage. "economic production influences the environment in many ways, through the consumption of energy and natural, often non-renewable resources, and the production of pollution, toxic wastes, etc. they further stressed (not without opposition) that present environmental problems require a new type of development process which harnessed the benefits of economic growth without the damaging consequences which growth can have on the environment. up to now, technology has been memetic engineering for permanent education in line with sustainable growth 813 developed for the sole purpose of increasing economic and social standards, with little or no regard for its potential negative impact on the environment (e.g. exhaustion of non-renewable resources, extinction of species, eutrophication, acidification, ozone-depletion, etc.)" [23]. understanding human diversity through cultural value and thinking, enables an approach to education that explains different outcomes of choices people make. the challenge is for explaining the thinking that led to certain behaviour and not for behaviour itself. since memes being decision-systems transcending culture, society and evolution, the struggle is with the memes in humans that are at rumour. to name some memes in ecologic field: nuclear power plant fukushima economic efficiently but ecological disaster; whaling coastal communities have long tradition and industrial level emerged as technology increased, but whaling is immoral, unsustainable, and should be banned. as regards memetic engineering as antidote to vicious memes, it has been shown that "folklore can be harmful when myths are not dismantled before being disseminated as memes. the need to counteract them is obvious and memetic engineering is just the newest arm in the well assorted panoply of persuasive means, from ancient rhetoric to modern aggressive advertising." [16]. with the growing economy, we use a lot of resources which will regenerate in years, and therefore, if there are not taken certain measures from an ecological point of view, we exhausted all resources. today’s youth must be trained to meet this challenge, and for that we need to "e-teach" environmental science. the field of ecology is manifold and should be treated from a transdisciplinary point of view. hence, it is appropriate to create a primitive metamodel for ecology (its relevant design-space metadimensions are shown in figure 1). sean esbjörnhargens in his article [12] wrote about the existence of four correlated dimensions to approach ecological issues. "integral ecology inquires into all for quadrants, or four terrains: behavioral terrain (behaviors at all levels of organization), experience terrain (experiences at all levels of perception), systems terrain (systems at all levels of ecological and social intersection), and cultural terrain (cultures at all levels of mutual resonance and understanding)" [12]. in short: memetic engineering based on bounded rationality and "just-in-time" fits in all four terrains, but of course is the right and the duty of the end-user ( i.e. researchers in ecology) to choose and to prioritise those terrains or maybe some blend of them. 6 conclusions and future work starting of the outcomes of [16], the conclusions are: a) to be sustainable in the long run permanent education must be modelled in line with learner bounded rationality, since bounded rationality is a psychological lasting feature; b) for sustainable development e-teaching should 814 c.i. brumar, r.d. fabian, m.-j. manolescu, v. chiş be systematically revisited through intense transdisciplinary research; c) in line with the eu 2020 strategy, education for sustainability involves also to develop metamodels in the field of integral ecology using memetic engineering to alleviate the apparent opposition between economic growth and environment preservation; d) time has to be dealt with carefully since teaching and learning take place in different temporal frameworks. any educational process that is changing and adapting every day has to be in accordance with the new strategy; memetic stability helps creating a new king of e-teaching. some corollaries are: a) it provides ecologists, in their attempt to preserve environment, a useful tool: memetic engineering is applied to put to work useful memes and to avoid vicious memes; b) to be sustainable on the long run lifelong learning needs eteaching based on innovation and "out of the box" thinking; c) it tools avoiding nondeterministic software are inefficient because of the evolutionary unpredictable processes, both memes and genes are based on. as regards future work, to be reliable, the research should be separated regarding research object (ecology) and method (memetic engineering). in this respect, the intention is to develop methods based on memetic engineering where memes are more appropriate applied, namely in decision making adapting the way decisions will be described in a future work dedicated to strategic decisions for environment preservation. bibliography [1] bărbat, b.e., newton, husserl, wiener: a temporal golden braid (invited paper at int. conf. on comput. commun. and control, icccc 2010, oradea), in abstracts of icccc papers, issn 1844-4334, pp. 12. 2010. [2] barroso j. m. d., urban areas, drivers of growth and jobs, 5th european summit of regions and cities, copenhagen, 22 march 2012. [3] blackmore s., the meme machine, oxford university press, 2000. [4] bousquet a., chaoplexic warfare or the future of military organization, international affairs, 84(5):915-929, wiley online library, 2008. [5] brumar c. i., r. d. fabian, bărbat b.e., csitao carnap-like glossary: http://bcu.ulbsibiu.ro/digitale/doctorate/glossary_csitao.pdf. [6] brumar c. i., sustainable development in spite of educational chaoplexity. state of the art, first technical report for the phd thesis titled "nondeterministic e-teaching in uncertain, dynamic environments. experimental model based on memetic engineering", lbus, 2011: http://bcu.ulbsibiu.ro/digitale/doctorate/cristina_brumar_ref1_presentation.pdf. [7] carrington d., wind farms can increase night time temperatures, research reveals, the guardian, april 29, 2012, http://www.guardian.co.uk/environment/2012/apr/29/windfarms-night-temperatures-study. [8] dawkins, r., the selfish gene (30th anniversary edition), oxford university press, 2006. [9] diac m., proiectele eoliene sunt tot mai controversate din punctul de vedere al protectiei mediului, green report, november 25, 2011, http://www.green-report.ro/stiri/proiectele-eolienesunt-tot-mai-controversate-din-punctul-de-vedere-al-protectiei-mediului(in romanian). [10] dzitac i., bărbat b.e., artificial intelligence + distributed systems =agents, int j comput commun, issn 1841-9836, 4(1):17-26, 2009. memetic engineering for permanent education in line with sustainable growth 815 [11] eilperin j., mufson s., renewable energy’s environmental paradox, washington post, april 16, 2009. [12] esbjorn-hargens s., integral ecology, a post-metaphysical approach to environmental phenomena, aqal journal of integral theory and practice, spring 2006, 1(1):305-378: http://jfk-integral-life.up.seesaa.net/image/vol1_no1_final_02_11_07_opt.pdf. [13] european council, conclusions 1/2 march 2012, brusells, http://www.consilium.europa.eu/ uedocs/cms_data/ docs/pressdata/en/ec/128520.pdf. [14] european union, digital agenda: commission opens public consultation on how to reduce the cost of rolling out high speed internet, press release 27.04.2012: http://europa.eu/rapid/ pressreleasesaction.do?reference=ip/12/434&format=html&language=en. [15] european wind energy association report, green growth the impact of wind energy on jobs and the economy: http://www.ewea.org/fileadmin/ewea_documents/documents/publications/ reports/green_growth.pdf, (retrieved april, 2012). [16] fabian r. d., bounded rationality in agent orientation "just-intime"visual pattern recognition, phd thesis in computer science and information technology, sibiu, 2011, copyright: lbus, ralf d. fabian, http://bcu.ulbsibiu.ro/digitale/doctorate/ralf_fabian_phd_thesis.pdf. [17] fabian r.d., m.j. manolescu, l. galea, g. bologa, bounded rationality through the filter of the lisbon objectives, int j comput commun, issn 1841-9836, 5(5):710-718, 2010. [18] global wind energy council, global wind report, annual market update 2011, http://www.gwec.net/fileadmin/documents/newsdocuments/annual_report_2011_lowres.pdf. [19] oprean c., brumar c. i., canter m., bărbat b. e., sustainable development: e-teaching (now) for lifelong e-learning, procedia social and behavioral sciences, issn: 1877-0428, elsevier, pp. 988-992, 2011, http://www.sciencedirect.com/science/article/pii/s1877042811020179 [20] oprean c., fabian r. d., brumar c. i., bărbat b. e., bounded rationality for "just in time" education, procedia social and behavioral sciences, issn: 1877-0428, elsevier,pp. 983-987, 2011: http://www.sciencedirect.com/science/article/pii/s1877042811020167. [21] pearce-higgins j. w. et al., greater impacts of wind farms on bird populations during construction than subsequent operation: results of a multi-site and multi-species analysis, journal of applied ecology, 49(2):386-394, april, 2012. [22] seel n. m., encyclopedia of the sciences of learning, springer, 2011. [23] ulhoi j. p., henning m., sustainable development and sustainable growth: conceptual plain or points on a conceptual plain? http://www.systemdynamics.org/conferences/1999/papers/para197.pdf. ijcccv11n4.dvi international journal of computers communications & control issn 1841-9836, 11(4):553-566, august 2016. a hybrid model for concurrent interaction recognition from videos m. sivarathinabala, s. abirami m. sivarathinabala* department of information science and technology, anna university, chennai, india. *corresponding author: sivarathinabala@gmail.com s. abirami department of information science and technology, anna university, chennai, india. abirami_mr@yahoo.com abstract: human behavior analysis plays an important role in understanding the high-level human activities from surveillance videos. human behavior has been identified using gestures, postures, actions, interactions and multiple activities of humans. this paper has been analyzed by identifying concurrent interactions, that takes place between multiple peoples. in order to capture the concurrency, a hybrid model has been designed with the combination of layered hidden markov model (lhmm) and coupled hmm (chmm). the model has three layers called as pose layer, action layer and interaction layer, in which pose and action of the single person has been defined in the layered model and the interaction of two persons or multiple persons are defined using chmm. this hybrid model reduces the training parameters and the temporal correlations over the frames are maintained. the spatial and temporal information are extracted and from the body part attributes, the simple human actions as well as concurrent actions/interactions are predicted. in addition, we further evaluated the results on various datasets also, for analyzing the concurrent interaction between the peoples. keywords: pose prediction, interaction recognition, layered hmm 1 introduction human interaction analysis involves human activities that are happening between two or more persons to understand the interaction. the automation is required in video surveillance to detect activities from the videos and infer some useful information without the intervention of human beings. the type of human activities detected mainly depends upon the domain in which surveillance system has been employed. human activity recognition may be used for behavior pattern observation or for suspicious activity detection. the activities can either be detected or reported during the event when it takes place or it can be predicted in advance. a system or framework to understand human interaction from surveillance videos involves the following key components: a) low level components for background modeling, feature extraction and object tracking, b) middle level components for object classification c) high level components for semantic interpretations ( ie, understanding actions, interactions between two / multiple people). significant works have been progressing in the literature for each level in this framework. this work mainly focuses on higher level components in order to predict the human interactions. human interaction recognition (hir) involves activity recognition of humans to understand their behaviors. human activity recognition has been presented by ( [2], [18]) as, single layered and hierarchical approaches like spacetime volumes, spacetime trajectories, space-time features and state based models such as hmm and dbn. from these approaches, human activity such copyright © 2006-2016 by ccc publications 554 m. sivarathinabala, s. abirami as shaking hands, punching, pushing, pointing, picking up the object, throwing are recognized. there exist some limitations such as difficulty in recognizing the interactions that happened between multiple people in varying time difference in when he/she re-enters in the scene, difficulty in distinguishing the poses when transformation and scaling has been performed, variation in the performance of body part detection, difficulty to recognize the interactions in complex environments. 2 related works activity analysis involves two fold actions: (i) analysis of motion patterns (ii) understanding of high level descriptions of actions/interactions happening place among humans or in an environment. activities can be recognized [2] in single layered approaches and in hierarchical approaches. in single layer approaches human activities could be recognized based on the image sequences and it is suitable for gesture/action recognition. in contrast, hierarchical approaches recognize high level activities which are complex in nature. it has been observed from the literature that the hierarchical approach well suits to recognize high level activities (interactions). thus, many researchers proposed a level of hmm differently as hierarchical hmm, semihmm, 2dhmm, factorial hmm, coupled hmm, asynchronous io hmm etc., this paper attempts to analyze the interactions between two or more persons in a new fashion of hidden markov model . hidden markov model (hmm) plays a vital role in determining activities which are vision based. the research works ( [11], [13], [13], [12], [19])proves that hmm is one of the most appropriate models for person activity prediction. [9] stated that, a layered hidden markov model (lhmm) can obtain different levels of temporal details while recognizing human activity. the lhmm is a cascade of hmms, in which each hmm has different observation probabilities processed with different time intervals. [22] has proposed layered hidden markov model (lhmm) as a statistical model which is derived from the hidden markov model (hmm). in hmm, the activities are recognized with less temporal correlated frames and over fitting problem occurs during the calculation of observation probability. the activities that take place in long temporal difference cannot be identified with good accuracy. moreover, the use of single variable state representation makes more difficult to model the complex activities involving multiple interacting agents. in order to solve the over fitting problem in the hmm, human action recognition [8] has been done using three layered hmm. their system was capable of recognizing six distinct actions, including raising the right arm, raising the left arm, stretching, waving the right arm, waving the left arm, and clapping. from the observations made in the survey process, most of the previous work centered on identification of particular activities in the particular scenario and less effort has been done to recognize interactions. many research works ( [20], [21], [3], [13]) have been done using graphical models such as hmm and crf. a conventional hmm can have many mathematical structures and has proved its simplicity in recognizing temporal events. its only limitation in conventional hmm is, it cannot capture high temporal correlated frames since the output depends only on the current states. our work is different in recognizing interactions (i.e., actions between two or more persons) in the group. when there are multiple persons in the scene, one person may stand still (or) not doing any actions (or) not interacting with any other persons in the group. that particular person’s activity may be considered as abnormal activity. in order to identify the abnormal activity in the particular frame, we are interested in concurrent interactions between a group of peoples. as a result, a three layered hmm has been designed in our work to recognize the actions and interactions in the group. in the first layer, pose of the persons has been identified and in the second layer ie., the action layer in which actions of the individual persons has been recognized a hybrid model for concurrent interaction recognition from videos 555 and in the third layer, the actions of the first person and the second person are coupled in order to identify the interaction of the people. this three layered hmm able to handle temporal correlations between the frames. this paper has been motivated to design a human interaction analysis system which can overcome these limitations and to design an automated smart surveillance system which could predict actions/interactions taking place in public environments. as motivated by the above challenges the following contributions arose in our work: a) learning model has been designed and implemented in order to learn the complex activity/interaction, b) joint form of lhmm and chmm provides concurrent interactions between the persons c) proposed model has been validated using different interaction recognition datasets and self generated data set. 3 preprocessing and feature extraction pre-processing includes two phases called background modeling and foreground segmentation. this phase highlights the portion of the frame which is under motion. background modeling [5] is a process which tries to model the current background of the scene precisely and aids to segment the foreground objects from the background. foreground segmentation has been performed to obtain the foreground image from the actual image. feature extraction starts with contour extraction [19] and the region properties of the frame have been considered, the number of objects in each frame has been found out. object centroid and object area have been calculated from the region properties as the next set of features. the human body parts need to be modeled before performing the pose estimation. here, to employ a silhouette based approach ( [6], [14], [15], [13]) in body part modeling, convex hull technique has been adapted. here, the convex hull points have been obtained for the whole blob to construct the skeleton. a minimum of 5 dominant points such as head, left hand, right hand, left leg and right leg that lie on the convex hull polygon has been chosen in our observation. body part modelling [4] differs for each and every pose since the pose stand may completely vary from the pose sit. in order to predict every pose precisely, the height of the human is divided into four quadrants, upper most quarter, upper middle quarter, lower middle quarter, and lower most quarters. from the contour (sketch) of the human body, the location of the hands, head and legs have been exactly predicted using the convex hull co-ordinates that lie in the arrangement of the four regions mentioned above. this type of body part modelling can better suit to any kind of pose prediction. 4 human interaction recognition in our work, three layers of hmm have been framed in order to identify the group activity in varying time intervals. figure 1 shows the overview of the human interaction recognition system. this layered representation provides outputs at different levels and decompose the levels in different time granularity. hidden markov model (hmm) [7] is the most successful approach to modeling and classifying dynamic behaviors. layered hidden markov model (lhmm) and coupled hidden markov model (chmm) are combined together to enhance the robustness of the interaction analysis system by reducing the training parameters. effective learning process has been carried out by combining max-belief algorithm and baum-welch algorithm. max-belief algorithm is used to derive the most likely sequence of results in observation activities and the baum-welch algorithm reduces the inference errors. 556 m. sivarathinabala, s. abirami figure 1: overview of human interaction recognition system 5 proposed learning model the visual features are extracted from the videos and the feature vector has been derived from the observations and shown in figure 2. feature vector observation includes spatial information si(t) and temporal information tn where t corresponds to the time stamp within the set of frames and tn represents the trajectory information for n number of frames. z is the feature set and it is represented as z = s1(t),s2(t),s3(t), .....sn (t),t1,t2,t3, .......tn . here, spatial representations have been given as joint locations. let the layered hmm1 models the observations of the person 1 and layered hmm2 models the observation of person 2 respectively. the observations of the person 1 and 2 have been given as input to the layered hmm1 and layered hmm2 respectively. let p1,p2, .....pn represents the pose of the person that has been shown in pose layer (p-hmm) and defined as layer 1i and layer 1j for person 1 and 2 respectively. the action layer (a-hmm) for person 1 and 2 is defined in layer 2i and layer 2j. pose as the feature vector has been given as input to this layer and action of the person has been identified. in order to provide high level information action attributes has been added to the action. after identifying the action attributes, it has been manually labeled. action and attributes of person 1 and action and attributes of person 2 has been coupled together to recognize the new interaction. the interaction layer (i-hmm) has been considered as layer 3 and interactions are represented as i1,i2,i3, .....in.because of coupling the a1hmm and a2-hmm are coupled so that interactions between two persons have been recognized. 6 layers in hir model 6.1 pose layer (low level layer) pose is defined as the preliminary motion sequences that are obtained from the observations. let the input observations be o1,o2,o3, .....onas n represents the number of observations. high dimensional pose vector, pi = l1,l2,l3,l4,l5 where li represents the joint locations of each part of an image frame.pi ∈ r2 where r2 represents the 2d space. pi ∈ p where p represents the pose of the body.p(p|z) represents the inference framework to estimate the posterior probability a hybrid model for concurrent interaction recognition from videos 557 where p represents the pose and z represents the feature set. the desired posterior probability has been calculated using likelihood and prior. p(p|z) ∝ p(z|p)p(p). maximum a posteriori solution can give a high likelihood. in the layer 1, the pose of the person in the particular time interval has been identified and the output has been given as input to the next level. pmap = argmax(p(p|z)) (1) figure 2: hmm model for hir system 6.2 action layer (mid level layer) action is defined as the stream of successive pose that happened in the particular time interval. actions may be defined as the event that takes place for a single person. action,a = p1,p2,p3, .....pn (2) the action of the individual person includes pose and action class labels. ie.,action,a ∈ p,ai(t) maximizing the log likelihood probabilities, the inferential result has been calculated and the actions such as walking, running, jogging, loiter and get hurt has been identified. after training, the likelihood of the action class in the layer 2i and layer 2j is calculated separately. let at = at1,a t 2, ......a t pn ∈ rpn denotes the pose vector in a continuous space of dimension equal to the number of individual actions.this layered approach directly outputs the probability ptk for each individual action model mk where k = p1,p2,p3, .....pn as input to action hmm where atk = p t k for all k. to calculate the probability of model mk for the given sequence x t 1 is computed in the following manner: xt1 = x1,x2, ....xt represents the sequence of model. let us define forward variable α(i,t) = p(xt1,qt = i) which is the probability of having generated the sequence xt1 being in state i at time t. in asynchronous hmm,α(i,t) can be replaced by a corresponding factor ( [9], [23]). assume ∑ns j=1 p(qt = j) = 1 where ns is the number of states for all models. probability p(qt = i|xt1) of state i is given as p(qt = i|xt1) = p(qt = i|xt1) p(xt1) = p(qt = i|xt1) p(qt = j|xt1) = α(i,t) ∑ns j=1 α(j,t) (3) 558 m. sivarathinabala, s. abirami from this, the probability of model mk can be computed as pkt = ∑ i∈mk (p(qt = i|xt1)) = ∑ i∈mk α(i,t) ∑ns j=1 α(j,t) (4) where i is the state of the model mk, i ∈ states of all models and ns denotes the total number of the states. in this work, individual action recognition vectors with the action attributes as observations given to the interaction level hmm. attribute selection criteria human interactions can be recognized with the help of action attributes (aa). the purpose of the attributes is to provide high level knowledge about actions. actionattributeset,aa = a1,a2,a3,a4,a5 (5) attributes have been manually labeled where low level features are given a class label. here, in this work, five attributes have been manually labeled. stretching arm, withdrawing arm, stretching legs, withdrawing leg, hand contact and body contact are defined as the attributes. the presence or absence of each attribute is approximated by the confidence value (0 or 1). attribute classifiers are learned from training data sets. in the multi-level modeling approach [24] , the knowledge of actions and attributes gives interaction in a more accurate way. 6.3 interaction layer (high level layer) interaction is defined as successive actions between two persons that are integrated together. i represent the possible interactions between all possible co-existing pairs a1 and a2 where a1 and a2 denote the action of the person 1 and 2 respectively. a1 and a2 are coupled together to identify the new interaction i(1,2) = [((a1 + aa)(t1)......(a1 + aa)(tz))u((a2 + aa)(t1).....(a2 + aa)(tz))] (6) where iij represents the interactions between 2i layer, 2j layers respectively, and t1,t2...tz represents the time frames from 1 to z. the interaction layer (i-hmm) is defined as the third layer in which the observations have been done using log-likelihood of the action layer and its inferential results. the interactions such as handshaking, pushing, hugging, fighting and meeting have been recognized. the knowledge of both the layers has been coupled to recognize the interaction between two or more persons. this layer uses spatio-temporal constraints as features. a chmm model (λ) is defined by the following parameters. [16] πoc (i) = p(q c 1 = si) (7) aci|j,k = p(q c t = si|qa1t−1 = sj,qa2t−1 = sk) (8) bct (i) = p(o c t ||qct = si) (9) where c ∈ (action1,action2)and qct represent the state of coupling nodes in the cth stream at time t. in coupled hmm, the output observed from layer 2i and layer 2j are given as inputs. the observed sequence has been given as, o = a1t ,a2t where a1t = a11,a12,a13,a1t are the observations of the first person and a2t = a21,a22,a23,a2t are the observed sequence of second person. the observation a11 consists of a1 + aa. here, the observation sequence consists of actions of each person (a) and action attribute (aa) of each person. the state sequence a hybrid model for concurrent interaction recognition from videos 559 has been given us, s = xt1 ,x t 2 where x t 1 = x11,x12,x13, ....x1t x1 ∈ 1, ...m are the state sequence of first observations and xt2 = x21,x22,x23, ....x2t x2 ∈ 1, ...mare the state sequence of second observations. state transition probabilities of the first chain of observations have been represented as, p(x1t+1|x1t,x2t) and for the second chain as p (x2t+1|x1t,x2t). p (x1) and p (x2) are the prior probabilities of first and second chain respectively. p(a1t|x1t) and p(a2t|x1t) are the observation densities assumed to be multivariate gaussian with mean vectors µx , µy and covariance matrices σx,σy. expectation maximization (em) algorithm finds the maximum likelihood that estimates the model parameters by maximizing the following function (10). parameter λ , contains parameters of transition probability, prior probability, and parameters of observation densities. m(λ) = p(x1)p(x2)πnt−1p(a1t|x1t)p(a2t|x2t)p(x1t+1|x1t,x2t) p(x2t+1|x1t,x2t),1 ≤ t ≤ n (10) 7 concurrent interaction recognition the interactions that happened between groups of people simultaneously at a particular time interval are defined as concurrent interactions. example of concurrent interaction is hugging and handshaking in a single scenario. here, in this work four persons have been considered and concurrent interactions between them have been identified. ci(1,2,3,4) = i(1,2) + i(3,4) (11) the training of chmm differs from standard hmm in the expectation step (e) while they are both identical in the maximization step (m) which tries to maximize the equation (10).the expectation step of chmm is defined in terms of forward and backward recursion. for the forward recursion, we define a variable for both observation chains at t=1, α person1(a1) t=1 = p(a1|x1)p(x1) (12) α person2(a2) t=1 = p(a2|x2)p(x2) (13) then the variable α is calculated incrementally at any arbitrary moment t as follows. α person1(a1) t+1 = p(a1t+1|x1t+1) ∫ ∫ (α person1(a1) t )(α person2(a2) t p(x1t+1|x1t,x2t),dx1tdx2t (14) α person2(a2) t+1 = p(a2t+1|x2t+1) ∫ ∫ (α person1(a1) t )(α person2(a2) t p(x2t+1|x1t,x2t),dx1tdx2t (15) in the backward direction, there is no split in the calculated recursions which can be expressed as: β person1(a1),person2(a2) t+1 = p(o n t+1|st) = ∫ ∫ p(a1nt+1,a2 n t+1|x1t+1,x2t+1) p(x1t+1,x2t+1|x1t,x2t)dx1t+1dx2t+1 (16) after combining both forward and backward recursion parameters, an interaction will be tested on the trained model, generating the equivalent interaction that most likely fit the model. the generated interaction sequence is determined when there is a change in the likelihood. 560 m. sivarathinabala, s. abirami 8 results and discussion 8.1 datasets and experimental setup the proposed system has been implemented in matlab version13a, with no special requirements in hardware. the proposed work has been analyzed and evaluated using four video datasets. they are ut-interaction dataset, behave dataset, self generated dataset and kth dataset. the ut-interaction dataset consists of different two person interaction patterns like shake hands, hug, point, kick, punch and push. the behave dataset consist of the outdoor environment and it consists of activities like group formation, crossing each other, depart, approach, move closer, move farther, etc. the generated dataset consists of single person activities and interactions that happen between two persons are taken under indoor environment. the activities covered in this dataset are walking, running, jogging, loitering and getting hurt. in kth dataset, the walking, jogging and running scenarios have been taken for evaluation. in experiment 1, the ut-interaction dataset [12] contains videos of continuous executions of 6 classes of human-human interactions: shake-hands, point, hug, push, kick and punch. there are a total of 20 video sequences whose lengths are around 1 minute. the videos are taken with the resolution of 720*480 and 30fps. high level interactions have been recognized using four participants. results from this dataset has been shown in figure 3,4,5. figure 3: activity recognition approach and group formation, departing, collide and divert figure 4: interaction recognition ( kick, hug and push) figure 5: concurrent interaction recognition (hug and punching, pushing and approaching, handshaking and pushing) in experiment 2, training and testing has been carried out using behave dataset [17] comprises of two views of various scenario’s of people acting out various interactions. the data is captured a hybrid model for concurrent interaction recognition from videos 561 at 25 frames per second. the resolution is 640 * 480. the following ten interactions have been recognized in the behave dataset. in group, approach, walk together, meet, split, ignore, chase, fight, run together and following are the interactions. the results from this dataset have been shown in figure 6. figure 6: concurrent interaction recognition (in group and crossing each other, walk together and walking) 8.2 performance analysis in this research, metrics such as accuracy, precision, recall, positives such as true positive (tp), false positive (fp) and negatives such as true negative (tn), and false negative (fn) have been used to measure the performance of the system. the performance metrics have been calculated for the poses, activity and interactions. figure 7: pose recognition rate precision and recall values for each pose such as stand, sit, lie, bend and crawl have been shown in figure 7. stand and sit poses has more precision and recall values. lie, bend and crawl poses has more precision values than recall values. figure 8 shows the precisionrecall curve for activity recognition rate. the performance for the activities such as walk, run, jog, loiter and get hurt has been shown. walk, run and loiter activities have high recall value than precision value. the activities walk, run and loiter are slightly confusing due to temporal difference. the other activities such as jog and get hurt have high precision values. figure 9 show the precision-recall curve for interaction recognition rate. the interactions such as handshaking, hugging, kicking, punching and pushing have been predicted. handshaking, punching and pushing interactions have high precision values. hugging and kicking have low precision values than other interactions. the recognition rate of poses, activity, interaction 562 m. sivarathinabala, s. abirami figure 8: activity recognition rate and concurrent interaction in different datasets has been shown in table 1. these concurrent interactions have been identified using the proposed learning model. single person action has been recognized from kth dataset and recognition rate has been obtained as 87.62%. figure 9: interaction recognition rate table 2 shows the comparison of the previous works that are carried out in this activity recognition. in learning methodologies, the joint form of lhmm and chmm outperforms other learning models. the spatial relations and the levels of temporal granularity have been considered. usually, the hmm can handle temporal variations in the video. in this paper, the interaction between neighboring states also been considered. chmm has the capacity to underlying synchronization of two different processes. the two actions have been coupled using coupling probabilities with proper weights and the concurrent interactions have been recognized. table 1. recognition rate across datasets (in terms of accuracy) datasets poses single person action twoperson interaction concurrent interaction kth 87.62 ut-interaction 83.90 81.54 behave 71.56 self-generated 91.55 93.47 84.23 88.36 a hybrid model for concurrent interaction recognition from videos 563 table 2. comparison of activities recognized with previous works previous works learning model proposed concentrated on activities recognized recog. rate(% ) sunyoung cho et al.,2013 [25] visual and textual information as features,graphical model using structured learning with latent variables activity recognition high five, handshake, hug and kiss. 78.4 m.j.marin jimenez et al.,2013 [26] dictionary learning, support vector machines with ϕ2 kernel. activity recognition hug, kiss and hand shake. 78 hejin yuan et al.,2015 [27] semi supervised learningk-means clustering, skeleton features. (cumulative skeleton image(csi), silhouette history image(shi)) activity recognition single person basic actions weizmann dataset (10 basic actions) bend, jump, jump, jack, wave, wave1, wave 2, side, walk, run. 90 fadime sener et al .,2015 [4] multiple instance learning process. shape and motion features two person interactions. ut interactions dataset and tv interaction dataset 75.60 proposed system three layer hmm along with coupled hmm concurrent interaction with four persons complex activities in group, walking together, chase, fight, following, handshaking and hugging. 88.36 8.3 discussion our interaction recognition system is based on hierarchical hidden markov model (hhmm) which combines layered and coupled hmm that tries to find the concurrent interactions. the three layers such as pose layer, action layer and interaction layer has been modeled. in order to identify the interaction, action of the two persons has been modeled independently. as a preprocessing step, we have been deployed body part modeling and location based trajectory tracking to aid the localization of the people in frames. the action sequence is composed of parallel states presenting the poses and each pose is composed of the specific number of observations (m= 5 in our case). the action sequence of person 1 and 2 has been modeled individually in the layered fashion. interaction sequence is composed of the action sequence of person 1 and the action sequence of person 2. (m=10). after training lhmm, the observation sequence from the databases, the system tries to find a corresponding sequence of poses based on the learning during the training phase. the generated pose sequence is the sequence that achieves the maximum likelihood estimation with the poses. thus the observed pose is given as input 564 m. sivarathinabala, s. abirami to the next layer hmm. in the second layer hmm, the system tries to find the corresponding action sequences. the action of person1 is identified from the maximum likelihood estimation of the action sequences. in the same way, the action of person 2 also identified using two layered hmm. coupled hmm (chmm) couples both the action sequence of person1 and person2 and the system tries to find the interaction in a better way. chmm in lag1 condition can couples the observation channels. each channel has its own action sequence. from both the action sequences the next state emission probability has been generated. based on all previous works, the specific activity has been recognized using the specific learning model. hmm is the most successful framework in speech and video applications and it is well suited for computing with uncertainties. here, in this work to demonstrate the concurrent interactions, the hmm learning model has been extended in the joint form of layered hmm and coupled hmm. layered hmm models the non-causal symmetric influences and chmm to model the temporal and asymmetric conditional probabilities between observation chains. conclusion in this work, a hybrid learning framework is designed to recognize the concurrent interactions between multiple peoples. the spatial and temporal information and body part attributes are considered as features. the poses and actions are recognized in a layered fashion. the actions of multiple persons are coupled to recognize the interactions and concurrent interactions. the joint form of lhmm and chmm has been used for providing concurrency. the interactions between neighboring persons has also been recognized. here, the activity recognition has been done for the continuous events and this could be extended in future to discrete event recognition mechanisms also. further work will focus on identifying interactions and behavior in different person to person interaction contexts that will allow the system to recognize the interactions under different conditions.this system can act as a smart surveillance to recognize the actions/interactions of multiple people without human intervention in the environments such as meeting hall, discussion groups, public places, banking sectors, where multiple people could interact with each other. acknowledgment the work reported in this paper has been supported by anna university,chennai by providing anna centenary research fellowship. we also acknowledge the anonymous reviewers for comments that lead to clarification of the paper. bibliography [1] alexandros andre chaaraoui, pau climent-perez, francisco florez-revuelta(2013); silhouette-based human action recognition using sequences of key poses, pattern recognition letters, 34(15): 1799-1807. [2] aggarwal, j. k.and ryoo, m. s. (2011); human activity analysis: a review, acm computing survey, 43(3): 16:1–16:43. [3] arnold wiliem,vamsi madasu, wageeh boles and prasad yarlagadda (2012); a suspicious behaviour detection using a context space model for smart surveillance systems, computer vision and image understanding, 116(2): 194-209. a hybrid model for concurrent interaction recognition from videos 565 [4] fadime sener and nazli ikizler-cinbis (2015); two person interaction recognition via spatial multiple instance embedding, journal of visual communication and image representation, 32: 63-73. [5] gowsikhaa.d, abirami.s and baskaran.r. (2012); automated human behavior analysis from surveillance videos: a survey, artificial intelligence review , doi 10.1007/s10462-012-93413, 1-19. [6] gowsikhaa.d, manjunath and abirami s. (2012); suspicious human activity detection from surveillance videos, international journal on internet and distributed computing systems, 2(2): 141-149. [7] junji yamato, jun ohya and kenichiro ishii (1992); recognizing human action in timesequential images using hidden markov model, proceedings of ieee computer society conference on computer vision and pattern recognition, doi:10.1109/cvpr.1992.223161, 379-385. [8] matthew brand, nuria oliver, and alex pentland (1997); coupled hidden markov models for complex activity recognition, proceedings of ieee computer society conference on computer vision and pattern recognition, doi: 10.1109/cvpr.1997.609450, 994 999. [9] nuria oliver, ashutosh garg and eric horvitz (2004); layered representations for learning and inferring office activity from multiple sensor channels,computer vision and image understanding, 96: 163-180. [10] roberto melfi, shripad kondra and alfredo petrosino (2013); human activity modeling by spatio temporal textural appearance, pattern recognition letters, 34(15): 1990-1994. [11] ryoo m.s. (2011); human activity prediction: early recognition of ongoing activities from streaming videos, proceedings of ieee international conference on computer vision (iccv), doi: 10.1109/iccv.2011.6126349, 1036-1043. [12] ryoo, m.s, and aggarwal, j.k. (2010); ut interaction dataset, proc. of icpr contest on semantic description of human activities. [13] sangho park and j.k. aggarwal (2004); semantic-level understanding of human actions and interactions using event hierarchy, proc. of ieee computer society conference on computer vision and pattern recognition workshop, doi:10.1109/cvpr.2004.160, 1-12. [14] sang min yoon , arjan kuijper (2013); human action recognition based on skeleton splitting, expert systems with applications, doi:10.1016/j.eswa.2013.06.024, 40(17): 6848-6855. [15] sivarathinabala m. and abirami s. (2014); motion tracking of humans under occlusion using blobs, proceedings of advanced computing, networking and informaticsvolume 1, smart innovation, systems and technologies, 27: 251-258. [16] shih-kuan liao, baug-yu liu,(2010); an edge-based approach to improve optical flow algorithm, proceedings of third international conference on advanced computer theory and engineering, 6: 45-61. [17] shizhong and joydeep ghosh (2001); a new formulation of coupled hidden markov models, doi=10.1.1.607.5700rep=rep1type=pdf. [18] s. j. blunsden and r. b. fisher (2010); the behave video dataset: ground truthed video for multi-person behavior classification, annals of the bmva, 4: 1-12. 566 m. sivarathinabala, s. abirami [19] teddy ko (2010); a survey on behavior analysis in video surveillance applications. proceedings of ieee, applied imagery pattern recognition workshop, 1-8. [20] thomas brox, bodo rosenhahn, juergen gall, and daniel cremers (2010); combined region and motion-based 3d tracking of rigid and articulated objects, ieee transactions on pattern analysis and machine intelligence, 32(3): 402-415. [21] weilun lao, jungong han, and peter h. n. dewith (2010); flexible human behavior analysis framework for video surveillance applications. international journal of digital multimedia broadcasting,id: 920121, 1-9. [22] weiyao lin, ming-ting sun, radha poovendran and zhengyou zhang (2010); group event detection with a varying number of group members for video surveillance, ieee transactions on circuits and systems for video technology, 20(8): 1503.00082. [23] weiming hu, guodong tian , xi li , stephen maybank (2013); an improved hierarchical dirichlet process-hidden markov model and its application to trajectory modeling and retrieval, int j comput vis, doi 10.1007/s11263-013-0638-8, 105:246-268. [24] dong zhang, daniel gatica-perez, samy bengio, iain mccowan, and guillaume lathoud (2004); modeling individual and group actions in meetings: a two-layer hmm framework, the second ieee workshop on event mining: detection and recognition of events in video, in association with cvpr, 1-8. [25] gildas morvan, daniel dupont,jean-baptiste soyez, rochdi merzouki (2012); engineering hierarchical complex systems: an agent-based approach, the case of flexible manufacturing systems, chapter service orientation in holonic and multi-agent manufacturing control, series studies in computational intelligence, 402: 49-60. [26] cho, sunyoung and kwak, sooyeong and byun, hyeran (2013); recognizing humanhuman interaction activities using visual and textual information, pattern recogn. lett., 34(15):1840-1848. [27] manuel j. marin-jimenez, enrique yeguas, nicolas perez de la blanca (2013); exploring stip-based models for recognizing human interactions in tv videos, pattern recognition letters, 34: 1819 -1828. [28] hejin yuan (2015); a semi-supervised human action recognition algorithm based on skeleton feature, journal of information hiding and multimedia signal processing, 6(1): 175-181. int j comput commun, issn 1841-9836 vol.7 (2012), no. 2 (june), pp. 264-272 a decision-making perspective for designing and building information systems f.g. filip florin gheorghe filip the romanian academy: bar and ince, and the nat. inst. for r&d in informatics -ici, bucharest, romania ffilip@acad.ro abstract: this paper aims at highlighting several aspects and associated decision situations which may be met in the process of designing and building modern information systems, such as: choosing the approach and methods to be utilized for building the system and selecting the it tools, integrating the system into the enterprise and evaluating the project. a particular emphasis is put on evaluation criteria to be utilized in solving the various decision problems. keywords: decision criteria, it&c tools, methodology, prototype, saas, standards. 1 introduction a large number of models and corresponding solvers have been proposed and reported in the literature with a view to getting optimal solutions for the academic test-problems or real-world management and control applications. they can be found in the technical literature. in many cases, in practical applications, a necessary condition to make the models and the corresponding solvers utilized is to incorporate them into adequate information systems (is). in the same time, the it&c (information technology and communication) vendors release to the market ever more modern hardware and software products. new trends can be noticed on the software market [21], such as "merger mania" (consisting in mergers, acquisitions, partnerships and strategic alliances between business software vendors), functional expansion, a clear dominance of a few (three) databases (oracle, microsoft sql server, and ibm db2), increased usage of internet, viewing it/software as a service and so on. exhaustive services for project management in the software field to serve those managers who want to save time are already available on the market [18]. one can notice, during the process of designing and implementing information systems, sequences of decisions which should be made with respect to the choice of the most adequate alternatives concerning several critical aspects, such as system orientation, composition of the team, method to be adopted, it&c tools to be utilized, resources to be allocated and so on. in this context, this paper aims at surveying from a decision-making perspective several methodological and practical aspects of designing effective (usable, useful, and actually utilized ) is. the paper extends, details and up-dates an earlier preliminary version of the paper [9]. the remaining part of this paper is organized as it follows. in the second section, several factors which may influence decisions meant to design and implement information systems in organizations are reviewed. then, several design and implementation critical aspects are presented, such as the design approach adopted and the selection of the it&c tools. the paper concludes with a discussion of technical and non-technical integration issues and evaluation aspects. throughout the paper the evaluation criteria to be used in various decisions are highlighted. copyright c⃝ 2006-2012 by ccc publications a decision-making perspective for designing and building information systems 265 2 main influence factors there are several factors which can influence the process of designing and implementing an information system, such as: people involved, the orientation and the purpose of the system, the organizational setting, standards utilized and so on [8]. those influence factors should be taken into consideration by the management of the target enterprise and the designer as well, when a decision on introducing and creating an information system is to be made. figure 1: influence factors (adapted from [9]) the people involved in the is design and implementation should cooperate closely to form a virtual team who aims at obtaining the best solution for the allocated resources (time, manpower, money). there are several classes of people who should take part in various extents of involvement and contributions to the process for the first moment of discussing the idea of is until its "steady state" operation and impact evaluation. one can identify the following generic classes:"clients", designers, and it&c vendors. the members of the "client" class include the project "sponsor" (a manager) and the project "champion" who represents the interests of the future direct (’hands-on") or indirect ("beneficiary") actual users ( who may be also involved in design and implementation of the is). the "project champion" possesses the necessary knowledge of the application domain. the "project sponsor" possesses the authority since he/she represents the interests of the organization and, consequently, is empowered to accept or reject the project solutions and allocate the necessary resources. the designers can be members of a group of people of the organization or/and a team of analysts and it&c professionals from a consultancy firm who master the design techniques and who are aware of the it&c products available on the market. the it&c manufacturers and vendors can adapt and alter the it&c products to be utilized. the information system may be oriented to serve a certain generic class of users ("roles") or to help a specific group of persons with names, identities and specific it skills ("actors"). the system purpose might be either to facilitate and make more comfortable the work of 266 f.g. filip the users, or to promote the change. the normative model of the change [14] includes several steps of cooperation between users and designers which can be met in the process of creating an information system, such as: • pioneering (to evaluate the needs of the final users and necessary competences of the systems constructors); • acceptance (to establish the objectives to be agreed by the users and the designers); • diagnosis (to collect the data, define the problem and estimate the necessary resources); • planning (to set up the work plan and allocate the corresponding resources); • action (to design and implement the information system and train its users); • evaluation of the process and project impact. the target organization where the information system is to be implemented may create a context which strongly influences the solution and the process of the system building. there might be constraints caused by a) the insufficient it&c knowledge and skills of the future users and b) scarce available data or/and limited internal data access rights of the external consultants. several integration problems may show up caused by the "legacy it systems" or/and the operating procedures permitted within the organization. standards should play a central role in design. the international standard organization (iso) is an excellent source of documents to be utilized to set the stage for useful, usable and used solutions. the standards for usable (traditionally called "user-friendly") interfaces, such as those of the series [13]iso 9241 ("ergonomics of human-system interaction"), are recommended and can contribute to obtaining a user-centered solution. the quite recent standards, such as iso 9241-171.2008 ("guidance in software accessibility") and iso 9241-151.2008 ("guidance www user interface"), are of a particular importance in the context of modern information systems which are ever more oriented to use www technologies. suduc [23] gives a comprehensive analysis of the design methods to be utilized in the low-cost interface design. cojocaru [5] proposes the intelligent interface concept. other aspects, such as previous experience, industry competitors’ moves, legislation and, the most serious one, available budgets and intended due dates, may also influence the is construction process. 3 design and implementation approaches there are various approaches to designing, building and implementing an information system. they can be grouped in accordance with several criteria, such as: • it&c tools and platforms which will be utilized (general-purpose products vs integrated suites/generators/shells); • buying it or using it as a service-itas [11], or software as a service -saas [4] and [25]; • place for construction (within the target organization or at the consultant’s site); • method utilized (the lifecycle method or the evolving/adaptive design which is based on the use of the prototype). the lifecycle-based method requires several steps, such as system analysis, design, implementation, and operation which are carried out in sequential ("cascade") manner. it also implies that the well defined procedures and checkpoints are strictly observed and the solutions adopted are well documented. it is, consequently, recommended for large-scale applications. a decision-making perspective for designing and building information systems 267 the origins of the prototype-based method [20] in the field of is could be traced back in mid 70’s in the empiric remark that 80% of the design ideas in the field are wrong [17]. consequently, in order to avoid the waste of resources, the prototype permitted to spend 20% of the resources in the early stages of design and construction for identifying the 80% wrong ideas, so that the remaining 80% of resources should be utilized to implement the remaining 20% of ideas which are likely to be correct. when adopting the prototype-based method, there are a few basic principles which are to be observed such as: • the process starts with approaching the most critical problems of the target organization, so that the user’s confidence could be gained as early as possible. in some cases, a "demo script", which includes a critical business scenario [21], can be also utilized to select the most adequate product from a short list (see the next section); • the early requirements can be formulated in collaboration with the user in a "quick and dirty "simplified manner; • the information system is developed in several cycles which include operations, such as experiment, evaluation, modification. the cycles should be as short as possible and the cost of the first version must be very low, in order not to lose the user’s interest and confidence; • the evaluation of the effects of the usage of the preliminary version is carried out on a permanent time basis. two main types of prototypes have been commonly utilized [22]: a) the "throwaway" prototype, and b) the "evolving one". while the former is only utilized to test the design ideas and then is discarded ( the next versions re-designed by using new technologies), the latter consists in a series of refinements of the initial version (fig. 2). a decision choice should be made on which type to be utilized. figure 2: types of prototypes (legend: pr= prototype, sv= system version, ex= experiment, ev= evaluate, mo= modify) the prototype-based (adaptive/incremental/iterative) methods allow for obtaining a good well customized, early utilizable and helpful solution, even if the information on organization 268 f.g. filip and its context is scarce and uncertain. on the other hand, methods may stimulate the tendency to continually modify the solution or, on the contrary, to adopt too early a solution which is imperfect or incomplete. in [6] and [8] the story of constructing dispatcherr, a family of decision support systems (dss) which is meant to assist logistics and production control decisions to be made in the context of the continuous process industries and related fields, is described. the dispatcherr project started in early 80’s as an optimization model and software for production scheduling. since then, under the influence of several factors such as the users’ changing needs and improvement of their it&c skills, specific characteristic features of target enterprises, and new products and technologies released in the field of it&c, several application versions were designed and installed in refineries, pulp and paper mills, chemical plants, and water systems. dipatcherr has evolved towards a complex solution, a dss generator which could be adapted to new business models (such as the "extended"/networked/"virtual" enterprise), to support new functions and usages. it includes new constituents, such as a threelevel modeling scheme of the plant (expressed in terms of final users, analysts and programmers, respectively), ai (artificial intelligence)-based solvers and model experimentation tools( fig. 3). figure 3: evolution of dispatcher system (adapted from [6] and [9]) an important decision consists in making a choice between buying or leasing it&c products. for example, in recent years, the approach to use saas (software as a service) model has become ever more popular. this new business model means that the software vendors host on their servers the applications to be accessed, via internet, by client organizations only when it is necessary. the pricing scheme is based on paying monthly lease fees, instead of initial license cost and annual maintenance fee. the saas scheme can be of particular interest for smaller companies that have limited it infrastructure and skilled personnel. on the other hand, when a decision is to be made one should take into account long run costs and security issues. availability of necessary it infrastructure, ease of usage are the main evaluation criteria when a decision is to be made. 4 selection of the it&c tools the selection of the it&c tools should be viewed as a multi-attribute decision-making (madm) problem ( [12], [7], [19], [15]). the general criteria to be used in selecting and ranking the possible it&c products which can be found on the market can be grouped as it follows: • adequacy: informational transparency, accuracy of expected results, robustness to errors and low quality uncertain input data, response time; • quality of implementation: scalability, flexibility, easy integration with the "legacy systems", functional transparency, documentation completeness; a decision-making perspective for designing and building information systems 269 • delivery quality: price, delivery time, provider’s general reputation, easy adaptation, degree on dependence on the technical assistance from the provider’s specialists for implementation and usage). a particular attention has been received by the software selection. a set of criteria for filtering the software products from the initial long list is recommended by software resources [21], such as: a) budget for the new system, b) unique functionality (i.e. multi-currency, multi-company, budgeting, workflow), c) technology preferences, d) reporting, e) scalability, and f) vendor stability. as a subsequent step to make a choice from the short list software resources recommends the "scripted (or the standard) demo" in order to determine the software vendor or the var (value added reseller ) to modify their product demo to show how the specific business needs of the client can be solved. a systematic methodology for software evaluation and selection through the usage of madm was proposed by [16] and an experimental expert system was proposed by [27]. an interesting list of pitfalls to be avoided when selecting the software is provided by software resources [21]. it includes the following 12 "deadly mistakes": "a) buying the same software as the competitors, b) buying software based on features alone, and overlooking other critical factors (scalability, flexibility, excessity, technology and cultural fit, affordable cost, insufficient technical support and infrastructure), c) neglecting the proper consideration of the vendor reputation, d) buying software without focusing on the implementation partner, e) taking into consideration only the low initial costs and overlooking significantly higher ongoing costs, f) buying software using input from an elite group without getting buy-in from the organization at large, g) choosing the popular software without considering all the posible and affordable options h) buying software that is too complex, i) making a choice without properly defining your requirements, j) buying software that is either at the end or at the beginning of its product lifecycle, k) buying software that is based on dying technology, l) selecting a software only to fix the current business problems" (instead of implementing the change). comprehensive on-line independent support for software evaluation and selection can be obtained from specialized consulting firms, such as technology evaluation center [24], software resources [21], project perfect [18]. 5 integration in many cases a new information systems should be integrated into the existing or planned it&c infrastructure of the target organization. several principles are recommended by [26] which are still valid for technical integration, such as: • adopting an "open system" architecture; • neutralizing the information which can be achieved by using standardized data formats; • semantic unification which means a symbol has a unique meaning throughout the whole system. there are, however, several new problems which can show up due to non-technical causes, for example: • wrong orientation of the solution which does not facilitate solving the central problems of the organization; this may be associated with informational opacity (the system provides more/less than necessary outputs); 270 f.g. filip • functional opacity which means that he user is not given the necessary information and incentives to understand how the system works; • frustration of the "hands-on" user due to a long response time or an un-adequate (insufficient/excessive) number of functions to perform his/her task. 6 evaluation evaluation of is has been subject of interest for long time [10]. there are several main principles to be observed in the process of designing, building and implementing an information system, such as: • evaluation is necessary all over in the design and implementation process to support making a decision choice from the set of possible alternatives (project continuation, giving up, allocating some additional resources and so on); • the objectives and the degree of detail of evaluation depend on various factors, as : a) the project scope, b) technical complexity, c) duration and cost of the project, d) the person who requested the evaluation, e) overall state of the target enterprise; • involving the designer into the evaluation team is necessary especially in the case of a large project. as previously stated, the evaluation is meant to support a decision-making process. consequently, a set of evaluation criteria should be set up, namely: • impact on the efficiency of users’ professional performance in accomplishing his/her tasks and quality of life (intellectual development, possible additional stress caused, comfort of performing the task); • impact on target enterprise general evolution; • implementation and further running costs. a set of more detailed set of criteria which was used in a specific context is given by [1]. there are several methods which can be utilized for evaluation, for example: a) benefits/cost analysis, though the nvp ("net present value") of the investments, b) value analysis, c)"rating and scoring", d) event logging and so on. agouram [2] gives a useful methodology to assess the is success. 7 conclusion the activities of designing and implementing an information system form, in practical applications, a process which may include many decisions to be made at different stages. there are several critical aspects, both of the technical and non-technical nature, which should be taken into consideration. among the main aspects which might cause problems are the evolution of the technical constituents associated with the increased requirements for the solution quality set by the users who are ever more informed and skilled and have to face an ever more fierce competition. multi-attribute decision models could be effectively utilized to solve the decision situations which can be encountered. a decision-making perspective for designing and building information systems 271 bibliography [1] al-adaileh r. m, an evaluation of information systems success: a user perspective the case of jordan telecom group, european journal of scientific research, 37(2):226-239, 2009 [2] agouram h., defining information system success in germany, international journal of information management, 29:129-137, 2009 [3] bizoi, m., sisteme support pentru decizii bazate pe comunicatii (communication-based decision support systems), ph.d. thesis, (in romanian), (http://www.racai.ro /doctorate/bizoi_rezumat_teza.pdf, 2010, accessed on 09.07.2011) [4] carraro g., chong f., software as a service (saas): an enterprise perspective, msdn library (http://msdn.microsoft.com/en-us/library/aa905332.aspx, 2006, accessed on 09.02.2012) [5] cojocaru s., interfete inteligente ("intelligent interfaces"), in [9], pp. 213-215, 2007 [6] filip f. g. , towards more humanized real-time decision support systems, in balanced automation systems: architectures and design methods (l. camarinha-matos, h. afsarmanesh, eds), chapman & hall, london, pp. 230-240, 1995 [7] filip f.g., decizie asistata de calculator; decizii, decidenti, metode de baza si instrumente informatice associate ("computer-aided decision-making; decisions, decision-makers, basic methods and software tools"), 2nd edition, editura tehnica, bucuresti(in romanian), 2005 [8] filip f. g., sisteme support pentru decizii ("decision support systems"), 2nd edition, editura tehnica, bucuresti (in romanian), 2007 [9] filip f.g., designing and building modern information systems: a series of decisions to be made, computer science journal of moldova, 119-129, 2011 [10] hamilton s., chervany n.l., evaluating information system effectiveness part i: comparing evaluation approaches, mis quarterly, 5(3):55-69, 1981 [11] hine j., laliberte b., enabling it as a service, white paper. esg (http://www.cisco. com/en/us/prod/collateral/netmgtsw/ps6505/ps11869/esg_enabling_it.pdf, 2011 [12] gaindric c., luarea deciziilor: metode si tehnologii ("decision making: methods and technologies"), editura stiinta, chisinau (in romanian), 1998 [13] iso 9241 ergonomics of human-system interaction, 2012 http://www.iso.org/iso/search.htm?qt=9241&sort=rel&type=simple&published=on; accessed on 09.02.2012) [14] kolb d.a., frohman a. l., an organization development approach to consulting, sloan management review, 12(4):51-65, 1970 [15] gang kou, yanqun lu, yi peng, and yong shi, evaluation of classification algorithms using cdm and rank correlation, international journal of information technology & decision making, vol. 11, issue: 1, doi: 10.1142/s0219622012004872, 2012. [16] moriso m., tsoukias a., jusware: a methology for evaluation and selection of software products, iee proc-softw. eng., 144(2):162-174, 1997 272 f.g. filip [17] ness d.n., decision support systems: theory and design, in wharton office of naval research on dss, philadelphia, november, pp. 4-7, 1975 [18] pp: project perfect: project management software, 2012 http://www.projectperfect.com.au/, 2012, accessed on 09.02.2012) [19] resteanu c., somodi m., alexe b., multi-attribute decision-making; e-course, ici, bucharest, 2007 [20] shelly g. b., cashmanan t. j., rosenblatt h.j., systems analysis and design, 8th ed., thomson course technology, boston, mass, 2010 [21] sr: soft resources: empowering software decisions, 2012, (http://www.softresources.com/, accessed on 06.02.2012) [22] sprague jr. r. h., carlson e. d., building effective decision support systems, prentice hall, englewood cliffs, n. j., 1982 [23] suduc a.m., interfete avansate pentru sisteme support pentru decizii("advanced interfaces for dss"), phd thesis, (in romanian), 2010 (http://www.racai.ro/doctorate/suduc_ rezumat_teza.pdf , accessed on 09.07.2011) [24] tec: technology evaluation center, 2011, (http://www.technologyevaluation.com/software/, accessed on 09.07.2011) [25] trumba , white paper: five benefits of software as a service, 2007 (http://www.trumba.com/ connect/knowledgecenter/software_as_a_service.aspx, accessed on 09.02.2012) [26] vernadat a., enterprise modeling and integration principles and applications, chapman & hall, london, 1996 [27] vlahavas i., stamelos refanidis i., tsoukias a., esse: an expert system for software evaluation, knowledge-based systems, 12(4):183-197, 1999 int j comput commun, issn 1841-9836 7(5):945-956, december, 2012. cooperative robot structures modeled after whale behavior and social structure i.c. res.ceanu, g.c. călugăru, c.f. res.ceanu, n.g. bîzdoacă ionut. cristian res.ceanu cristina floriana res.ceanu nicu-george bîzdoacă university of craiova faculty of automation, computers and electronics romania, 200440 craiova, decebal blvd., no. 107 e-mail: {resceanu, cristina, nicu}@robotics.ucv.ro george-cristian călugăru university of craiova faculty of automation, computers and electronics romania, 200440 craiova, decebal blvd., no. 107 e-mail: calugaru.george.nds@gmail.ro abstract: this paper analyses the communications and social structure of whale pods and tries to apply their principles on cooperative robot structures which can be guided to perform a certain task. the communication patterns and social structure are presented at first in order to define real, natural phenomena which can then be translated through cooperative robotic structures. whale communication has suffered modifications in the latter years mostly because of the increase of noise in the ocean. problems regarding communications between the cooperative structures chosen for modelling whale behavior are solved by data fusion techniques. in the last part of the paper the problem of dynamic compensation of disturbances is studied with regard to cooperative structures. keywords: cooperative structures, data fusion, manipulative robots. 1 introduction the order cetacea (cetus, whale, from greek) includes the marine mammals commonly known as whales, dolphins and porpoises. cetus is latin and is used in biological names to mean "whale"; its original meaning, "large sea animal", was more general. it comes from ancient greek (ketos), meaning "whale" or "any huge fish or sea monster". in greek mythology the monster perseus defeated was called ceto, which is depicted by the constellation of cetus. cetology is the branch of marine science associated with the study of cetaceans. suborders of the cetacea order are mysticeti, odontoceti and archaeoceti (extinct ancient whales, depict the evolution of whales throughout time). cetaceans are mammals that evolved throughout the ages for aquatic environments. their body is fusiform (spindle-shaped) and the forelimbs are modified into flippers as a result of this evolution. the tiny hindlimbs are vestigial; they do not attach to the backbone and are hidden within the body. the tail has horizontal flukes. cetaceans are nearly hairless, and are insulated from the cooler water they inhabit by a thick layer of blubber. some species are noted for their high intelligence. 1.1 whale social structure whale family structures are fascinating. one of the most important facts about whales is that they are particularly intelligent mammals and like humans, place much value on their families copyright c⃝ 2006-2012 by ccc publications 946 i.c. res.ceanu, g.c. călugăru, c.f. res.ceanu, n.g. bîzdoacă and the role that each member plays within the unit. notably, the individual families also travel and migrate together in pods. each family member continues to play a vital role within that pod, as a greater unit of the family. these groups demonstrate the sociable nature of whales and their unspoken cooperation with one another is evidence of the insight and sense of responsibility inherent to these animals. another interesting whale fact: whales tend to separate themselves into pods according to age and sex. the whale cows and their calves travel together in pods of up to 30 members at a time, accompanied by one dominant bull. cows without their calves, or whose calves are mature enough, act as midwives to pregnant and nursing mothers. they assist them with their birth by ensuring that the newborn reaches the surface of the water for air. cows are also babysitters to the other mother’s calves in her absence, and assist her with the care of her new baby in a general sense. the calves stick close to their mothers for an average of three to six years. but this period can be even longer, depending on the individual calf and their species. even after they have left their mother’s side, they may still return to the main pod to visit her. females are also known for returning to their first pod when they become pregnant with their own calf. the youths of the group eventually branch off into a smaller juvenile pod. they will move into larger pods once they reach sexual maturity and begin to calve. in some cases, the juvenile pod is either replaced or broken away from by a ’bachelor pod’, consisting of only the young bulls. whales are considered ’juveniles’ from about three years of age to approximately thirteen. there is a dominant bull in each core pod, and he is responsible for the pod in which he resides. he is sexually mature and cares for his harem of cows and calves. the other males tend to stick to themselves, traveling separately from the rest of the main pod, as to respect the ’property’ of the dominant bull. this familyand pod structure is designed to protect the weak and the young of the group. because whale calves do not mature as quickly as some other mammals do, they require time to grow and develop within a protected environment. the organization of the dominant male and the group of mothering cows ensures that calves are isolated from the dangers of the deep. traveling in this way also ensures that whale migrations remain orderly and safe for all involved, preventing smaller family units from drifting off course or facing the dangers that come with isolation. of course, whales are also social creatures and benefit from the close interaction with others of their sort. this level of mutual understanding and cooperation is another indication of the brilliance of the whale creation and instinct. [12] 1.2 whale communication whales can communicate through a very intriguing method called echolocation. echolocation, also known as biosonar, is the biological sonar used by several animals such as shrews, most bats and most cetaceans. the term was first used by donald griffin, whose work with robert galambos was the first to conclusively demonstrate its existence in bats. two bird groups also employ this system for navigating through caves, the so called cave swiftlets in the genus aerodramus (formerly collocalia) and the unrelated oilbird steatornis caripensis. [13] echolocating animals emit calls out to the environment and listen to the echoes of those calls that return from various objects in the environment. they use these echoes to locate, range and identify the objects. echolocation is used for navigation and for foraging (or hunting) in various environments. echolocation makes use of active sonar, using sounds made by an animal. ranging is done by measuring the time delay between the animal’s own sound emission and any echoes that return from the environment. the relative intensity of sound received at each ear provides information cooperative robot structures modeled after whale behavior and social structure 947 about the horizontal angle (azimuth) from which the reflected sound waves arrive. unlike some sonar that relies on an extremely narrow beam to localize a target, animal echolocation relies on multiple receivers. echolocating animals have two ears positioned slightly apart. the echoes returning to the two ears arrive at different times and at different loudness levels, depending on the position of the object generating the echoes. the time and loudness differences are used by the animals to perceive distance and direction. with echolocation, the bat or other animal can see not only where it is going but also how big another animal is, what kind of animal it is, and other features. the sounds produced by whales can travel for miles as objects found in the water can help amplify them. the sounds will echo back to the whale that emitted them. this form of communication has an estimated speed of 1 mile/second. sounds made by whales are in fact very unique. the clicks are a basic part of this communication. the clicks also help the whales navigate through the waters as well as being a language feature. different species of whales communicate in various ways. this is due to the fact that they don’t have an inner ear. the humpback whales have a very particular way of communication called singing. this signing mainly consists of periodical sound patterns. these songs can be up to 30 minutes in length and can travel up to 100 miles. sperm whales have only been heard making clicks, while toothed whales use echolocation that can generate sounds of 30000 watts at 163 decibels. whales create small pods with various communication patterns within. researchers established that whales from different locations in the world have different terminology. this terminology is also different at whales in captivity in comparison with free whales. it is believed that whales have amplified their songs in the past few decades. whale songs can travel thousands of miles, but new research shows that their ability to communicate is severely affected by the increase of the amount of noise in an ocean. thus the whales adapted by making their songs louder. [11] 2 using data fusion for process surveillance and diagnosis in the modelling of whale communication terms like data fusion, multi-sensor data fusion, sensor fusion and information fusion or multi-sensor integration are used frequently in literature to depict a variety of techniques, technologies, systems and applications which use data provided by multiple sources. the data fusion applications are various from real-time sensor fusion applied to mobile robots navigation to the fusion of strategic intelligence developed for military purposes. [6] data fusion is used in the development of modern applications regarding process surveillance and diagnosis. using specific algorithms, damaged sensors can be determined and in the same time signals can be synthesized in order to replace the erroneous information. data fusion techniques combine data from a multitude of sensors and related information to obtain more specific results than in the case of one sensor. the data fusion concept can find its equivalent in nature. throughout their evolution, people and animals developed their ability to use multiple senses in order to survive. for example, establishing if a food is eatable or not cannot be determined by using only the sight sense; the combination of sight, touch, smell and taste is far more effective. in a similar manner, when the ability to see is blocked by built structures or vegetation, the hearing sense can offer advanced warnings in case of imminent danger. thus, multi-sensor data fusion is performed naturally by animals and people in order to evaluate the configuration of the surrounding environment and 948 i.c. res.ceanu, g.c. călugăru, c.f. res.ceanu, n.g. bîzdoacă in order to detect possible threats. even though the concept of data fusion is well established, the development of new sensors, new techniques of processing and hardware made real-time data fusion viable. recent progresses made in the development of computers and sensors offer the ability to emulate the natural abilities regarding data fusion for both people and animals through hardware and software means. nowadays, data fusion systems are used in military applications such as: target recognition (e.g. intelligent weapons), vehicle guidance and remote detection like iffn systems (identification friend-foe-neutral systems). non-military applications include monitoring of manufacturing processes, condition based maintenance for complex machinery, robotics and medical applications. 2.1 multi-sensor advantages another frequent term is that of multi-sensor integration which signifies the synergic use of sensor data to perform a specific task. sensor fusion is different from multi-sensor integration because the first includes the current combination of sensor information in a single representation format. multiple sensor data fusion offers multiple advantages in comparison with information prelevated through one sensor. first, if multiple sensors are used (e.g. identical radars following an object in motion) combining all the determinations will provide a better estimate of the motion and position of the object. a statistical advantage is obtained by adding n independent determinations (e.g. estimation of the position and speed of the target is improved by a factor of ), with the assumption that data are combined in an optimal manner. the same result could be obtained by adding n independent determinations to a single sensor. figure 1: data fusion and multiple sensor integration configurations a second advantage implies the use of relative placement and motion of sensors to improve the process of observation. for example, 2 sensors that measure the angular motion of an object can be coordinated in order to determine the position of an object through triangularization. this technique is used for surveillance and commercial navigation. a third advantage is the improved observability. the initial increase in physical observability may lead to significant improvements. 2.2 control architectures for system control, the following types of data fusion can be defined with regard to sensors, [1]: • complementary data fusion the fusion of multiple sensors spread across an area provides partial information about the environment; e.g. the fusion of multiple sensors which meacooperative robot structures modeled after whale behavior and social structure 949 sure distances or video cameras pointing in different directions. this type of fusion solves the problem of incomplete information. • competitive fusion the fusion of uncertain data obtained from multiple sensors e.g. a radar and a video camera which detect the same object. through data fusion, the distance to the object can be obtained with a higher accuracy. this type of fusion is used to reduce the effect of uncertainties and erroneous measurements. • cooperative fusion the fusion of different sensors in which one of the sensors is based on determinations of another sensor in order to obtain its own set of data. for example, when a tactile sensor supplies information about the shape of an object previously estimated by a proximity sensor. this type of fusion is used to diminish the uncertainty effect, measurement errors as well as the incomplete state of the data. three basic alternatives can be used for multi-sensor data: 1. direct fusion of data provided by sensors 2. representation of sensor data through characteristic vectors with subsequent fusions of them. 3. processing of each sensor in order to obtain high level control decisions which are later combined. each of these approaches will use different data fusion techniques. if multi-sensor data are proportional (if sensors measure the same physical phenomenon) then data from the sensors can be directly combined. the techniques for fusion of sheer data imply classical estimation methods like kalman filtering. however, if sensor data is not proportional the data must be merged at the level of the characteristic/state vector or at a decision level. data fusion at the characteristics level implies the extraction of the representative characteristics from the sensor. it has been demonstrated that people use a cognitive function based on characteristics in order to determine objects. in case of data fusion at the characteristics level, the characteristics are extracted from multiple sensor observers and combined in a single vector which is subjected to pattern recognition techniques like neural networks or grouping algorithms. decision level data fusion combines sensor information after every sensor performed a preliminary determination of the entity location, attributes and identification. 2.3 a model for processing data fusion in order to improve communications between military researchers and system developers, jdl (joint directors of laboratories), established in 1986, began an effort to define the specific terms regarding data fusion. the result of the effort led to the creation of a process model for data fusion and specific terminology. the jdl process model is set to be very general and useful in multiple application domains, identifies processes and categories of techniques applicable to data fusion. the model is hierarchical with 2 levels. at the upper level, the data fusion process is described by the sensor inputs, the interaction between man and computer, database management, source preprocessing and 4 key sub-processes: level 1 processing (object refining) combines sensor data to obtain the most reliable and accurate estimation of the position of the object, its velocity, attributes and identity. level 2 processing (scenario refining) tries dynamically to develop a description of the current relationship between entities and events taking place in their surrounding environment. level 3 processing (threat refining) projects the current scenario into the future in order to draw the inferences about for threats, about friends and the foe vulnerabilities and opportunities of action. 950 i.c. res.ceanu, g.c. călugăru, c.f. res.ceanu, n.g. bîzdoacă figure 2: a. direct data fusion of sensors; b. representation of sensor data through characteristic vectors and subsequent fusions of them; c. the processing of each sensor in order to obtain high level inferences and decisions which are then combined level 4 processing (process refining) is a meta-process which monitors the global process of data fusion in order to evaluate and improve the real-time system. for each of these sub-processes, the jdl hierarchical model identifies specific function and technique categories (in the second level of the model) and specific techniques (in the lower layer of the model). the implementation of the data fusion systems integrates and correlates those functions in a general work flow. 3 the dynamic model of a manipulative robot used in modelling whale behavior the cooperative structure used to model whale behavior contains manipulative robots. the dynamic model of a such a robot is presented below. for more details on the principles of bioinsipired computing and robots see [10]. also for more details on cooperative robot structures see [3]. the dynamic model is given by: j′1θ̈1 + j∗cos(θ1 − θ2)θ̈2 + j∗sin(θ1 − θ2)θ̇22 + m′1cosθ1 = m1 (1) j′2θ̈2 + j∗cos(θ1 − θ2)θ̈1 − j∗sin(θ1 − θ2)θ̇21 + m′2cosθ2 = m2 (2) cooperative robot structures modeled after whale behavior and social structure 951 figure 3: the jdl process model for data fusion figure 4: the kinematical model of the manipulative robot 3.1 determining the operation span in fault conditions the present day migration courses of whale pods are affected by naval traffic. for instance, in 2007 a mother whale and her calf were trying to return from the sacramentosan joaquin delta to the pacific ocean. the two whales had deviated off course a week earlier. the change in course is believed to be influenced by the sounds of tug boats met along the way. the two whales appeared to have been wounded by a ship’s propeller. in case the whale swimmers are affected, the whale will try to maintain its course using the wounded swimmers with a smaller operation span. in this paragraph the operation span in fault conditions is determined. according to the kinematical configuration of the robot system considered, it has an area of operation in the shape of an annulus sector as depicted in the figure below: in the case of a fabrication line which contains a number of robots with the same kinematical configuration, trespassing over these admissible operation areas can lead to interference problems. in order to avoid this, all areas that can be trespassed are eliminated apriori and thus each robot can have its own distinct area. so, for every robot a rectangular operation area (called an operation cell) will be defined. fig. 5.a) depicts the three parameters that describe the area of operation: maximum angle φ , minimum range rmin and respectively, the maximum range rmax. in figure 6.b) a rectangular operation cell is defined where x and y represent the height and width of the cell. between the 2 regions the following relations are defined: 952 i.c. res.ceanu, g.c. călugăru, c.f. res.ceanu, n.g. bîzdoacă figure 5: definition of the area of operation for the manipulative robot: a) area of operation b) definition of the operation cell y 2 < rmintgφ (3) r2max = (rmin + y) 2 + (y 2 )2 (4) 3.2 manipulator fault due to a jammed articulation it is assumed that the fault which appears at the manipulator is due to a jammed articulation. it is also assumed that the value of the angle of the jammed articulation is known. even though the blocking position is not available from the sensor attached to every articulation, this can be calculated based on the position of the griper of the faulted manipulator by solving backward kinematics problems. an articulation fault when the first articulation is jammed, the faulted manipulator can only move its third joint by means of the second joint. depending on the configuration of the system at the moment the fault occurred, the faulted manipulator can be placed in a position similar to the one below: figure 6: aerial view of the manipulator with the first articulation blocked the admissible region from the area of operation is, in this case, projected on a line of the arc as can be seen in figure 8. in this case, the terminal of the faulted manipulator can be placed in one of the two possible positions, a and a’, from its trajectory. the kinematical constraints that can guarantee the existence of these two placement positions are defined like this: cooperative robot structures modeled after whale behavior and social structure 953 figure 7: the kinematical constraints on case of first articulation failure y 2 + d ≤ r ≤ r′ (5) where r is the radius of the radius of the arc and r’ is the distance between the point in which the arm attaches itself to the support and the limits of terminal of the manipulator. r′ = 1 2 √ x2 + (y + 2d)2 (6) r = a1cosθ̂1 + a2cosθ2 (7) where θ̂1 is the angle of the jammed articulation. note the fact that r is identical to the length of the projection of the manipulator on the area of operation. y 2 + d ≤ a1cosθ̂1 + a2cosθ2 ≤ 1 2 √ x2 + (y + 2d)2 (8) 4 dynamic delay compensation modelled after whale communication habits this part of the paper deals with dynamic compensation of disturbances, [7] in the last decades, the amount of noise present in the ocean has increased significantly in the ocean. this is largely due to the increase of naval traffic, the exploitation of certain resources(e.g. oil through oil platforms) and the intensive use of sonars. the whales themselves have adapted by increasing the intensity of their sounds. with regard to robotic structure communication a dynamic delay compensation structure is necessary. thus, alternative smith predictor structures were used for the dynamic delay compensation, [2], [4], [5], [9], [8]. the following simulations were performed using smith predictor structures on a robotic arm acted on by a quanser srv-02 servo-motor. 954 i.c. res.ceanu, g.c. călugăru, c.f. res.ceanu, n.g. bîzdoacă figure 8: modified smith predictor structure figure 9: system response to a signal generator figure 10: system response in case of delay occurence figure 11: system response when using a modified smith predictor cooperative robot structures modeled after whale behavior and social structure 955 figure 12: improved smith predictor structure figure 13: position control of the quanser srv-02 plant using an improved smith predictor figure 14: step response of the system using the improved smith predictor and a pi controller 956 i.c. res.ceanu, g.c. călugăru, c.f. res.ceanu, n.g. bîzdoacă 5 conclusions this paper analyses the possibility of modelling cooperative robot structures after the behavior and communication habits of whale pods. certain key aspects are studied like the modelling of whale communication using data fusion techniques, determining the operation span in fault conditions and the dynamic delay compensation modelled also after whale communication. aspects like communication and social structure of whale pods are well documented in literature. what this paper is trying to establish is that there is the possibility of implementing this complex behavior in a cooperative robot structure with a well determined purpose. further research and the study of other key behavior aspects should lead to the appearance of another class of problems based on real-life phenomena-the evolution of whale pods in matters of communication. acknowledgement. this work was supported by the strategic grant posdru/89/1.5/s/61968, project id61968 (2009), co-financed by the european social fund within the sectorial operational program human resources development 2007-2013 bibliography [1] k.j. astrom, b. wittenmark, computer controlled systems: theory and design, prenticehall, englewoods cliffs, nj, 1984 [2] k.j. astrom, c.c. hang, b.c. lin, a new smith predictor for controlling a process with an integrator and long dead-time, ieee transactions on automatic control, 39(2), pp. 343-345, 1994 [3] y.u. cao, a.s. fukanaga, a.b. kahng, cooperative mobile robotics: antecedents and directions, autonomous robots, 4, 1-23, kluwer academic publishers, boston, 1997 [4] d. feng, d.,wencai, l. zhi, new smith predictor and nonlinear control for networked control systems, proceedings of the international multiconference of engineers and computer scientists, march 18-20, hong kong, vol. 2, pp. 1184-1188, 2009 [5] d. feng, q. qian, study of ncs with improvement smith predictor and fuzzy immune control, proceedings of the international conference on intelligent systems and knowledge engineering, chengdu, china, 2007 [6] d.l. hall, j. llinas,handbook of multisensor data fusion, crc press, new york, usa, 2001 [7] j.g. truxal, m.j. shooman, w.r. blesser, j.w. clark, remote control, in the handbook of telemetry and remote control, mc-graw hill, new-york, pp 1-169, 1967 [8] j. velagic, design of smith-like predictive controller with communication delay adaptation, world academy of science, engineering and technology, no. 47, pp 199-203, 2008 [9] m. veronesi, performance improvement of smith predictor through automatic computation of dead time, yokogawa technical report english edition, pp 25-30, 2003 [10] x. yang, bio-inspired computing and networking, crc press, 2011. [11] http://www.eoearth.org/article/whale_communication_and_culture [12] http://www.napali.com/na_pali_coast/hawaii_whale_watching/social_communication.html [13] http://www.whales.org.za/facts_communication.aspx international journal of computers communications & control issn 1841-9836, 11(3):428-440, june 2016. checking multi-domain policies in sdn f.a. maldonado-lopez, e. calle, y. donoso ferney a. maldonado-lopez systems and computing engineering department universidad de los andes, bogotá, colombia fa.maldonado1897@uniandes.edu.co eusebi calle bcds, broadband communication and distributed systems universitat de girona, spain e.calle@udg.edu yezid donoso* systems and computing engineering department universidad de los andes, bogotá, colombia *corresponding author: ydonoso@uniandes.edu.co abstract: programmable network like sdn allows administrators to program network infrastructure according to service demand and custom-defined policies. network policies are interpreted by the centralized controller to define actions and rules to process the network traffic on devices that belong to a single domain. however, actual networks are multi-domain where several domains are interconnected. then, because sdn controllers in a domain cannot define nor monitor policies in other domains, network administrators cannot ensure that their own policies, origin policies are being enforced by the domains not directly managed by them (i.e. foreign domains). we present audit, a multi-domain sdn policy verifier that identifies whether an origin policy is enforced by foreign domains. audit comprises (1) model for network topology, policies, and flows, (2) an audit protocol to gather information about the actions performed by network devices to carry the flows of interest, and (3) a validation engine that takes that information and detects security policy violations, and (4) an extension to the openflow protocol to enable external auditing. this paper presents our approach and illustrates its application using an example considering multiple sdn networks. keywords: network operating systems, software-defined networking, network management, policy verification 1 introduction in software-defined networking (sdn), network administrators use software languages to define how network traffic is processed and delivered. they use these languages to implement network policies, concrete rules about how a network must deal with specific types of traffic known as flows. for instance, these languages can be used to specify which users or network machines can connect to specific servers; which network devices must be used to deliver specific types of traffic, or which bandwidth can be assigned to specific flows. sdn is based on the separation of control and data planes. on one hand, the control plane remains on a centralized server called controller that makes decisions about how the traffic is processed. the controller is responsible for managing connections, addressing, and routing protocols. applications at the controller use specific-sdn protocols such as openflow [14] to instruct elements in the data plane how to process and deliver the network traffic. the data plane consists of network devices, or datapaths, responsible for packet forwarding and switching. copyright © 2006-2016 by ccc publications checking multi-domain policies in sdn 429 in the sdn architecture, a single controller manages policies and behavior of a network domain. only the domain controller has access to the rules used by each network device in its own domain. thus, neither a network device nor a controller can access information about rules from other network domains. although this is perfect to deal with policies in a single domain, a network administrator cannot observe how a external network, out of its domain, handles the traffic. with the current sdn architecture, the administrator is not able to enforce nor monitor multidomain policies. that is because forwarding rules must be implemented on multiple domains. for instance, today is very common that network traffic is delivered using a internal network such as lans, and external networks such as wans and internet. if the network administrator wants to enforce or monitor a network policy, she can define applications in her own network domain but cannot do it in the external domains. unfortunately, she cannot check if external domains enforce a network policy because she cannot determine how the traffic is delivered in those external networks. this situation can be specially critical in network security policies. for example, essential traffic that is delivered through external networks can be duplicated or redirected to other network machines using a simple application in the external domain sdn controllers. due to network administrators cannot get access to the rules in the external networks, they are disabled to detect these situations neither validate if a policy is achieved. we propose a mechanism to audit network policies in multiple domains called audit. our approach overcomes these sdn limitations and allows to network administrators to validate if the network policies are enforced by a external domain. audit comprises three modules: an extension to the openflow protocol to enable external auditing, the audit interface for network devices that gathers information about the actions performed in external domains to carry the flows of interest, and a validation engine that runs into the internal network controller and detects policy violations. other mechanisms were suggested to check network policies. for example, hinrichs [4] developed a declarative language called flow-based management language (fml) to describe network policies and configuration in a high-level and declarative approach. this fml is a high-level declarative language, based on flows, that checks the first packet of every flow against the policy. lately, monsanto et. al. [16] introduce a declarative language called netcore. it is a high-level declarative language that describes the desired behavior of the network but does not deepen the implementation of that behavior. with netcore is possible to express packet forwarding policies for sdn. afterwards, soulé et. al. present merlin [21]. merlin is also a declarative language based on logical predicates and regular expressions with which a network administrator can write network policies. in contrast to previous works, we propose to express network policies as predicates but use a sat solver and a model finder to evaluate predicates, find inconsistencies and detect policy violations. audit uses alloy [6] to describe the network topology, policies and network traffic. mirzaei et al. proposed used alloy to verify network properties in [15]. in this case they model internal states of a network and openflow switches. in summary, we introduce the foreign controller verification problem, we define multi-domain policies in programmable networks, a mechanism to gather information from external sdn domains, and a validation engine that uses gathered information to check if a network policy is enforced by the external domain. rest of this paper is organized as follow: section 2 explains the problem of auditing own policies at external domains; then we present a model to illustrate network topology, paths, forwarding rules and policies in section 3. then, section 4 introduces audit protocol and functionality, and presents an example. finally, section 5 concludes the paper and presents 430 f.a. maldonado-lopez, e. calle, y. donoso future work. 2 auditing policies in multi-domain networks a network policy is a set of conditions, constraints, and settings about how a specific type of traffic must be managed by a network. it also includes which users and hosts are authorized to create connections, and the circumstances under which they can or cannot connect. network policies are the accurate and unambiguous way to specify the traffic behavior. initially, stone et al. proposed a path-based policy language (ppl) that abstracts topological (physical) paths and flows to check network properties [22]. now, with programmable networks as sdn, new network policy abstractions are under development, therefore the challenges in policy checking open a rich field of study. high-level declarative language were proposed to represent network policies with more expressiveness. declarative languages such as fml [4, 5] express network policies in terms of flows. for general purposes, hinrichs developed a declarative language called flow-based management language (fml) to describe network policies and configuration in a high-level and declarative approach [4]. fml is based on flows, and checks the first packet of every flow against the policy. fml identifies a network flow by: source and target for users, hosts, and access points, in addition to protocols and requests a flow is the specification of a traffic, sometimes is called a session, that contains common attributes such as source, destination, protocol, but also can specify more granular characteristics as duration, valid time, users, data format and so on. then those policies are processed using datalog to find matching flows. other languages were designed for sdn are merlin and netcore. merlin [21] is a framework to write network policies for sdn. netcore [16] is a language for describe forwarding rules and it is integrated with another framework called frenetic, a project from cornell and princeton universities. these languages allow network administrators to define policies in a single-domain networks. they did not contemplate checking policy enforcement on a third-domain. in contrast to previous approaches, audit offers not only the ability to write and check network policies, it is unified with the controller and extract forwarding data and check it. audit also uses the flow specification and checks if the set of flows is valid for a given topology. moreover, audit reports inconsistencies in terms of flows not only as instructions at the hardware implementations. for example, suppose that a network policy defines only computers assigned to members of it department can get access to database servers. we can write this policy as: allow(src,target) | src ∈ it ∧ target ∈ database (1) deny(src,target) | src /∈ it ∧ target ∈ database (2) expression (1) means that the network must allow flows from it to database servers. due to this policy must be closed, (2) denies any flow from other machines to the same servers. in sdn networks, policies are enforced by its domain controller which rules the behavior of every forwarding device –switch– in its domain. the big question is: can the network administrator monitor that this policy is achieved? 2.1 policies in multi-domain networks in multiple-domain networks, each domain is managed by its own controller. in our example scenario depicted in figure 1.a, the domain a is ruled by its controller ca, and operates the it department and its users. the external domain b, managed by the controller cb, operates checking multi-domain policies in sdn 431 s3 it s4 s6 s5s2 ca cb s1 db s1 flowtable match action src = it ∧ dst = db ⟨ fwd s3 ⟩ s3 flowtable match action src = it ∧ dst = db ⟨ fwd s4 ⟩ a) multi-domain network scenario. b) flowtables for devices in domain a. figure 1: domain a may send a policy to be implemented in domain b, but there is not guarantee b implements the policy correctly. the database servers. network administrator supervises her own controller ca, and may install forwarding rules on devices s1, s2, and s3 to deal traffic generated by the it department. network controller ca cannot access the rules in forwarding devices in the domain b neither compel controller cb to install required forwarding actions into the devices s4, s5, and s6. a multi-domain policy must be enforced by own and external domains. the controller ca may share the policy with the controller cb, and awaits that cb implements the policy in its devices. however, there is not certainty that delivered traffic in the external domain b obeys any policy defined by a. following the example, the administrator of domain a cannot enforce policies related to deliver traffic to the database servers, because the domain b is external. 2.2 challenges in multi-domain networks each network controller is in charge of configuring switching devices on its domain. figure 1.b shows the required configuration installed on devices in the path of domain a that process the flow until it reaches the next domain. network configuration are rules that implements the policy. in this case, the configuration conducts the flow traffic from it department to database servers. this traffic arrives to switch s1, then is forwarded to switch s3, and finally it is forwarded to domain b interface, switch s4. clearly, a controller unknowns and cannot handle implemented configurations in external domains. however, administrator want to know if their policies are enforced in external domains. because of database servers are located in external network, for instance it is hosted by another company, the above policy redirects flows from the it department to external servers but deny the flows originated from sources other than it department. however, usually companies rely on external networks such as wans and internet to deliver network flows. since the configuration of network device is protected information, it is only accessed by its own domain controller, and administrator wants to check if the external controller applies a policy on its domain, we have identified a main challenge: how to detect if a policy is enforced by a external domain? and how to audit the policy enforcement without reveal risky information? 2.3 policies in programmable networks in sdn environment there are some languages to describe network policies. for example, netcore is a high-level declarative language that describes the desired behavior of the network but does not deepen the implementation of that behavior. with netcore is possible to express packet forwarding policies for sdn networks [16]. another work, merlin [21] is also a declarative language but based on logical predicates and regular expressions which can be solved using linear programming to determine forwarding paths. 432 f.a. maldonado-lopez, e. calle, y. donoso verification of sdn configurations is focused on check network properties that follow a rule. for instance veryflow [10] creates a network-wide invariants and checks them against rules. fattire [19] uses regular expressions and writes policies in this way to be able to validate. other works attend to find conflict rules, rules that contradict earlier ones. in such a way fortnox [18] checks new flow-rules against a flow-constraint set, and authenticates the source of rules by means of digital signatures. another illustration is netplumber [9] that searches if a candidate rule introduces network misconfigurations or policy violations . it executes a procedure called header space analysis (hsa) over dependency graphs to find conflicts. these approaches examine the forwarding tables from each network device and could check if they conform with the specified policy. however, none of these approaches support the validation of policies in external domains. from policies to flow-rule implementation network applications – or functionality– run on a controller and define the general behavior or policies by installing specific configurations on each switching device. regularly, those programs use openflow (of) [17] to communicate controllers and forwarding devices, and install, modify, or get flow-rules that specify how a device deals with specific traffic. a flow-rule is a pair map on the device’s flowtable. a flow-rule defines which action is performed once a packet header matches the match pattern. of defines a set of messages to control the internal information on each device, and rules used to process a flow. in summary, of messages can add, modify, and query rules from device’s flowtable. actions also include: dropping a packet (drop), forwarding a packet to a specific port (fwd), or report the set of installed rules (status). the rule-set is closed, and the packet is reported to the controller if its header does not match any rule. 3 topology and policy models we describe a model that involves the physical topology and paths; and network operation definitions such as flows, policies, and conflicts. first, we use the following specification for networks. we will use the relation of correspondence later on when we write the model in alloy. definition 1 (network graph). a network graph g is a duple (n,l) such that n is the set of nodes, and links l := (n,n,c) is the correspondence c with domain and co-domain n → n. considering the links relation l, we write the function links(n) to denote {m|(n,m) ∈ l}, the set of all the nodes m connected to a node n. in addition, for convenience, we write n → m to express from n to m sometimes instead of (n,m) ∈ l. a well-formed network g must satisfy the following rules: 1) network is connected, ∀n ∈ n| links(n) ̸= ∅, i.e. there are not isolated nodes; 2) no self-loops, @n ∈ n|n → n, i.e. there are not links from a node to itself; and, 3) for all link, there is an arrival node, @n ∈ n|links(n) ∩ n = ∅. it is important to note that nodes in our model does not represent a network device – router nor switch–, a node denotes a device port. then under this abstraction, the link relation represents forwarding rules, not just physical links in the topology. definition 2 (path). a path p is a tuple (s,t,np,lp). a source node s, a target node t, a subset of nodes np = {n1,n2, . . . ,nk}, and a subset of links that creates the sequence lp = {(s,n1),(n1,n2), . . . ,(nk−1,nk),(nk, t)}. a path, can be described as a list of nodes that maintains a sequence. path nodes np ⊂ n, path links lp ⊂ l. a well-formed path satisfies: 1) all implicated nodes in the links belong to node set, ∀(na,nb) ∈ lp =⇒ {na,nb} ∈ np, 2) {s,t} ∈ n, source and target nodes are in the network, and 3) source node opens and target node finishes a path, @(n1,n2) ∈ lp|n2 = s∨n1 = t. checking multi-domain policies in sdn 433 at this point is appropriated to describe the transitive closure. we use this concept to tackle the reachability property when describe a path. a binary relation r is transitive if contains tuples in the way a → b and b → c, but also contains a → c. this relation is noted as r+ and contains r. finally, a path has no loops, considering the relation l and the function links+(n) is the set of all nodes that can be reached from n. then a path has no loops if @n ∈ np|n ∈ links+(n). also for convenience, we denote a path as a node sequence as ⟨s,n1,n2, . . . , t⟩. we include a wildcard symbol (∗) to denote any unspecified node or sequence of nodes. for example, the path p = ⟨a,∗,c⟩ is the path that starts at node a and ends at c. 3.1 traffic flows flow is the fundamental abstraction for our model. for the reader it is similar to communication session supported by a set of paths and device configurations. traffic flow defines the high-level network parameters needed to create a competent communication channel. a flow provide enough detail to describe a set of feasible sessions, and provides a form to group and manage these sessions. definition 3 (flow). a flow is a sequence of traffic constraints f = (f1, . . . ,fn). each term fi is a restriction over a traffic characteristic, strongly related to filters on packet fields. due to we illustrate flows as traffic constraints. reader should note we indicate packet-field match as those constraints. the used definition allows us to construct flexible and composed communication flows. a term of flow involves transport-layer protocol, source / destination at third layer, or applications. also, we use set operators over these packet fields to define the flow. for instance, flowa = {protocol = tcp,src_ip = 192.168.5.10,dst_ip = 192.168.7.10} details a traffic flow between those ip addresses and tcp as transport protocol. note that this flow only defines the traffic in one way. it means, the other direction is not under this definition. however, flow definition is only associated to communication characteristics and packet fields, but not the set of paths that supports the flow. 3.2 policies, conflicts and semantics in order to define the set of paths that implements a policy and then identify policy conflicts and violation we follow the guidelines of harel and rumpe [8] to specify a modeling language ł describing the syntactic domain ll, the semantic domain sl and the semantic function ml : ll → sl, also traditionally written j·kl. the policy is a set of rules that achieves a management procedure. forwarding rule is the action that a node executes to forward a packet into a computer network. rules are described in terms of flows, by the previous definition. definition 4 (policy). a network policy is a tuple π = (f,p,c,α) s.t. f is the target flow composed of packet-field values, p is the set of paths that support the flow, c is a set of conditions, over the flow f or path p , and α is an action, regularly {permit,deny}. essentially, a policy resolves whether allow the flow f over the set p conclude on a specific action α. it is decisive and produces a configuration network that allows or deny the traffic flow. for example, the network administrator wants to apply the policy: ana is a user with profile of it member, who is in the subnetwork 192.168.5.*/24 (s1), is allowed to access the database at subnetwork 192.168.7.*/24 (s6) and port 1521, and her traffic must go through the router s3. now the manager has to detail the policy. in order to do that she solves the following steps: 1. path s1_s6 := , 434 f.a. maldonado-lopez, e. calle, y. donoso 2. transport protocol: = tcp, 3. port number: = 1521; 4. the conditions user = ana, and ana ∈ it member; 5. finally the policy decision: permit in this way, we can find ω, the set of configurations and instructions that implements the paths and the policy π. note that ip addresses, user groups, traffic class and protocols should be modeled as sets. on the other hand, ordered items such as time are modeled as sequences to be able to compare them using ≤ and ≥ operators. ω = impl(p : path|(s1_s6) ∧ protocol = tcp ∧ port = 1521 ∧ ana : user ∈ it member)(3) this representation of a policy, utilizing logical conjunctions, allows us to express this policy as a conjunctive normal form predicate (cnf), and logically solve it. moreover, we are able to check a formal solution using a model finder as alloy [7], compare solutions, or find inconsistencies. definition 5 (policy semantic). a semantic of a policy jπ,gk is the set of paths that implements the flow over a path on g and achieves the policy. jυk = ω is the semantic of all network policies and produces the set of all paths implemented on the network. the complete network configuration is denoted by ω. the semantic functionjπ,gk of a policy contains the sets of paths, flow definitions, conditions and the network g that satisfies the policy π. obviously, the policy π is valid in a network g, if jπ,gk is not empty. definition 6 (policy conflict). a policy conflict occurs when a set of policies are not implemented by any path or there are inconsistencies that prevent the generation of a path. essentially, if the semantic of a policy is empty, means that there is not set of configuration of paths that satisfies the policy. given two valid policies π1 and π2, they are not conflicting in the network g if jπ1 ∪ π2,gk is not empty. that is, two policies are not conflicting if there is a set of paths, flows, and restrictions in the network g that satisfies both policies. in contrast, we say that two policies π1 and π2 are conflicting in g if |jπ1 ∪ π2,gk| = 0. definition 7. (minimal diagnosis) given a set of policies π ⊆ υ such that jυ \ πk ̸= ∅, the minimal set π is the minimal diagnosis. we refer to this minimal set as the littlest configuration applicable without conflicts. now, we need a tool, called a verifier, able to find (calculate) the semantic function jπ,gk and verify if that set is empty and the minimal diagnosis of that set. we use similar tools also for validating paths on network infrastructure [12], and recently we show how to use of minimal diagnosis to detect and prevent firewall-rule conflicts on software-defined networking [13]. 4 checking multi-domain policies with audit audit is an auditing extension for openswitch protocol and openflow controller that allows domain controllers to validate security policies on a foreign domain. our proposal creates a language definition and transformation to audit network policies. we use alloy to obtain a set of tuples that satisfy the policies (exactly the semantic function). if alloy does not find any element (the set is empty), the policy set is invalid or conflicting. checking multi-domain policies in sdn 435 foreign alloy sat solver owner domain figure 2: audit architecture audit works as a validation protocol that allows a controller to gather auditing information from external domains and validate the origin policy. it performs two phases: gathering network information and validation process. first, the controller in the origin domain gets information we called audit packets that is routed through the network as regular traffic. then, when auditor packet reaches devices in the external domain, these network devices report a subset of its own flowtable to the controller in the origin domain. finally, the controller in the origin domain process the gathered flowtables to obtain all the processing rules related to the flows of interest, and executes the validation engine that checks if the external domain is accomplishing the security policy. 4.1 audit extension openflow specifies a set of control messages between controller and forwarding devices. control messages include: modify-state to add or delete flowtables in the device, collect-statistics to read counters and device statistics, managing groups of flowtables. controller is also able to request device status, where the device reports the flowtable to the controller. audit uses regular controller primitives to request information from the flowtable on devices of external domain. information gathering once the controller enables audit on each network device, and the audit packet arrives, the device invokes ofpmp_table_features and the header match to filter a subset of the rule table that matches the header. thereby, it extracts a set of all the related flowtable entries (rfe). rfe = {e|e.src ⊙ p.src ∪ e.target ⊙ p.target} (4) for simplicity, our example only considers ip addresses and masks in the it-database scenario. our rfe are defined by (4). where p is the policy and ⊙ is the match relation. audit message figure 3 shows the structure of an audit message. it comprises the same flow header in order to be routed through the same path; moreover, it includes origin controller identifier, controller authentication data, other audit settings, and the list of fields and rules to be filtered by the device. 436 f.a. maldonado-lopez, e. calle, y. donoso flow header origin controller id audit settings controller signature list of fields list of policies figure 3: structure of an audit packet. the list of policies are constraints over packet fields. 4.2 audit protocol figure 4 shows the proposed protocol that allows controllers to enable audit protocol, gather information from foreign devices, and check policies. 1. involved domains subscribe an audit agreement that specifies the permission to create, send and process audit packets. then, all implicated domains update their module that recognizes the audit request and overwrites audit_enable variable. 2. origin domain a shares the traffic policy over it’s traffic with b. security policy described in section 1 2: database is only accessed from it department. external-domain controller cb enforces the security policy in its network, translates the policy into rules applicable to its infrastructure. 3. origin controller creates an audit packet. audit packet contains all packet fields of the flow traffic. this procedure request information about how the traffic is delivered. thus, foreign network devices process the audit packet as they process regular data flow, or use a interface to return the related flowtable entries (rfe). 4. foreign devices reply the audit packet with the rfe. the list of entries from its flowtable. 5. at the origin, the controller of a executes the validation engine, determines if there is a subset of rules that violates the policy, and writes a conflict report. s3 it s4 s6 ca cb s1 db audit agreement tra�c policy audit request forwarding rules con�gura�on set veri�ca�on engine conflict report figure 4: audit protocol execution. devices from domain b report packet rules to ca, then a verifies traffic policy and generates an auditing report. 4.3 multi-domain policy checking at the end, origin controller owns all rules related with the traffic policy that comes from the external domain, and validates the set of related flowtable entries (rfe) against the policy to identify violations. figure 5 shows the set rfe that ac gathers from domain b. it is a list of rules related with the traffic policy defined in expressions 1 and 2. then, the validation engine checking multi-domain policies in sdn 437 determines if this subset of rules violates the policy. this policy-rule validation engine could be similar to [9]. s4 flowtable s6 flowtable match action match action src = it ∧ dst = db ⟨ fwd s6 ⟩ src = it ∧ dst = db ⟨ fwd db ⟩ figure 5: related flowtables entries from devices in domain b. 4.4 inference engine based on sat we develop an inference engine able to check implementation procedures against network policy. topology is defined as a set of nodes. links is a closure relation of arity two over the node set. specifically for this project, we model device ports as nodes, and links by a closure relation over nodes. figure 6 shows how the topology is represented in terms of device ports. a forwarding rule, the simplest instruction that redirects a packet from one port to another is represented as part of the path. under this perspective, the configuration is part of the topology. forwarding rules are shown in the figure as dotted lines. these soft-links are considered as regular topology once the model is built. an optimization opportunity arises due to forwarding rules create soft-links that are interpreted as part of the topology. if the traffic policy is quite specific, the resultant topology is disconnected graph, even is reduced to some paths. this abstraction of nodes as device ports, and soft-links can reduce the complexity at evaluation time. s1 s2 s3 s4 s5 s6 db it figure 6: representation of a network topology based on ports from the original deployment. blocks are network devices and circles are ports. forwarding rules are depicted as dotted lines that connect two ports. flow is depicted as the list of constraints over packet header, traffic movement sense, and topological considerations. for example, the flowtable described in figure 1.b is the interpretation of constraints, source and destination addresses, over fields of a packet header. reader should note that flowtable also denotes the soft-link between two ports generated by the forwarding rule. nevertheless, this soft-links are part of the topology equally as wired links do. in other words, the model does not discerns one from another. communication details such as protocols or port numbers are considered sets if these elements are part of the packet header. due to our work uses set theory notion of order is not considered in this model, for that reason we cannot have policies with arithmetic conditionals. for example, the expression if the number port is greater that 1024, then ... is invalid in our approach. our inference machine is implemented on alloy [6]. it is fed with external-domain information gathered by the audit procedure or through services that exposes forwarding information. 438 f.a. maldonado-lopez, e. calle, y. donoso 5 conclusions and future work we presented openflow audit, a mechanism that checks if foreign domains are enforcing multi-domain network policies. audit helps to overcome policy-checking limitations of the sdn architecture. it comprises (1) an extension to the openflow protocol to enable external auditing, (2) an audit protocol to gather information about rules applied to specific network flows, and (3) a validation engine that uses flow information and determines if the external network is enforcing specific traffic policies. additionally, audit can identify policy violations. it informs the network configuration, rule of flow that infringes the policy and its identifier. in general terms, audit allows network administrators to gather information from external domains and determine if network policies are enforced in multiple domains. 5.1 experiments and results we implement and test our audit protocol using the floodlight controller 1. test cases are divided into two groups: information gathering, and violation inferencing. we run the controller on a server and deploy a test-network using mininet 2, which operate as external domain and implement the example topology used by sethi in [20]. from another terminal, which operates as owner domain, we run our audit interface and extract traffic information from the controller. audit implementation creates a topology representation, a policy inventory, and a configuration repository. thereafter, the inference engine is executed. audit writes, policies, configurations and topology as alloy instructions and executes the satisfiability solver. 0 500 1.000 1.500 10.703 13.518 16.333 19.148 time (ms) p ri m a ry v a ri a b le s found solutions minisatprover jni minisat jni figure 7: solutions from alloy implementation. the same implementation is evaluated using two solvers: minisat and minisatprover with minimal unsatisfiable core. figure 7 shows two evaluations over the same topology and set of policies. we implement the same fattree topology also described in [20] to compare our approach. audit takes less than a second using the minisat solver, which only finds if an instance accomplishes the set of policies. on the other hand, if the network administrator wants to determine the set of policies violated by the external domain, she executes audit with the minisatprover option and could take up to 1.5s. these measures are lower than the values reported on [20], for the same fat tree topology composed of 20 switches, 16 hosts, and 48 links. we test forwarding and reachability on a intel i5 at 3.0 ghz, with 3.74 gb ram. with the intension to show how state explosion and variable affect the performance we test audit for 930k, 1.5m, 2.2m and 2.8m of states, which are represented on primary variables shown in figure 7. however, audit does not have complete information about the network as opposed to netplumber [9]. moreover, audit requires the deployment of our openflow extensions into the 1 http://www.projectfloodlight.org/floodlight/ 2 mininet.org checking multi-domain policies in sdn 439 network devices in those external domains. commercial products (i.e. switches from companies such as ibm or hp) do not support the deployment of new extensions without a firmware update. we expect that future experimental implementation shows the benefits of audit and can be a foundation to introduce multi-domain policy validation into the standard. letting external domains gather information about network flow processing may represent a potential security risk for external controllers. in addition, controllers in external domains may include programs that hide information or mimic policy enforcement. future work focuses on evaluating security risks on our experimental implementation in order to determine which additional mechanisms are required to ensure safe auditing of multi-domain policies. acknowledgment ferney a. maldonado-lopez is a recipient of a the fellowship n528–2011 for doctoral studies by the colombian department of science, technology and innovation colciencias. we would like to thank to comit members for their reviews and meaningful recommendations. bibliography [1] al-shaer, e.; marrero, w.; el-atawy, a.; elbadawi, k. (2009); network configuration in a box: towards end-to-end verification of network reachability and security, 17th ieee international conference on network protocols, icnp 2009, 123-132. [2] canini, m.; venzano, d.; perešíni, p.; kostić, d.; rexford, j. (2012); a nice way to test openflow applications, proceedings of the 9th usenix conference on networked systems design and implementation, usenix association, 10-10. [3] gude, n.; koponen, t.; pettit, j.; pfaff, b.; casado, m.; mckeown, n.; shenker, s. (2008) nox: towards an operating system for networks, sigcomm comput. commun. rev., acm, 38: 105-110. [4] hinrichs, t. l.; gude, n. s.; casado, m.; mitchell, j. c.; shenker, s. (2009); expressing and enforcing flow-based network security policies, university of chicago, technical report, 1-20. [5] hinrichs, t. l.; gude, n. s., casado, m.; mitchell, j. c.; shenker, s. (2009); practical declarative network management, 1st acm workshop on research on enterprise networking, 2009, 1-10. [6] jackson, d. (2002); alloy: a lightweight object modelling notation, acm trans. softw. eng. methodol.; april 2002. [7] jackson, d. (2006); software abstractions: logic, language, and analysis, the mit press, 2006. [8] harel, d. and rumpe, b. (2004); meaningful modeling: what’s the semantics of "semantics"?, computer, ieee computer society press, 37: 64-72. [9] kazemanian, p.; chang, m.; zheng, h.; varghese, g.; mckeown, n. (2013); real time network policy checking using header space analysis, proceeding on network system design and implementation (nsdi), usenix, 99-112. 440 f.a. maldonado-lopez, e. calle, y. donoso [10] khurshid, a.; zou, x.; zhou, w.; caesar, m.; godfrey, p. b. (2013); veriflow: verifying network-wide invariants in real time, 10th usenix symposium on networked systems design and implementation (nsdi), proceeding hotsdn ’12 proceedings of the first workshop on hot topics in software defined networks, 49-54 . [11] mai, h.; khurshid, a.; agarwal, r.; caesar, m.; godfrey, p. b.; king, s. t.(2011); debugging the data plane with anteater, sigcomm comput. commun. rev., acm, 41: 290-301. [12] maldonado-lopez, f.; chavarriaga, j. and donoso,y. (2014); detecting network policy conflicts using alloy, abstract state machines, alloy, b, tla, vdm, and z, springer berlin heidelberg, 8477: 314-317. [13] maldonado-lopez, f. a.; calle, e. and donoso, y.; (2015);detection and prevention of firewall-rule conflicts on software-defined networking, reliable networks design and modeling (rndm), 2015 7th international workshop on, 259-265. [14] mckeown, n.; anderson, t.; balakrishnan, h.; parulkar, g.; peterson, l.; rexford, j.; shenker, s.; turner, j. (2008); openflow: enabling innovation in campus networks, sigcomm comput. commun. rev., acm, 38: 69-74. [15] mirzaei, s., bahargam, s. and skowyra, r. (2013); using alloy to formally model and reason about an openflow network switch, technical report, http://hdl.handle.net/2144/11416. [16] monsanto, c.; foster, n.; harrison, r.; walker, d. (2012); a compiler and run-time system for network programming languages, sigplan, acm, 47: 217-230 [17] open networking foundation openflow switch specification, v.1.3.1, onf open networking foundation, 2012 [18] porras, p.; shin, s.; yegneswaran, v.; fong, m.; tyson, m.; gu, g. (2012) a security enforcement kernel for openflow networks proceedings of the first workshop on hot topics in software defined networks, acm, 121-126. [19] reitblatt, m.; canini, m.; guha, a.; foster, n.(2013); fattire: declarative fault tolerance for software-defined networks, proceedings of the second acm sigcomm workshop on hot topics in software defined networking, acm, 109-114. [20] sethi, d.; narayana, s. and malik, s. (2013); abstractions for model checking sdn controllers, formal methods in computer-aided design (fmcad), 2013, 145-148. [21] soulé, r.; basu, s.; kleinberg, r.; sirer, e. g.; foster, n. (2013); managing the network with merlin, 12th workshop on hot topics in networks, hotnets’13, nov. 2013, 1-8. [22] stone, g.; lundy, b. and xie, g. (2001); network policy languages: a survey and a new approach, ieee network, 15: 10-21. international journal of computers communications & control issn 1841-9836, 10(1):112-122, february, 2015. energy-efficient design for relay-aided mimo-ofdm cognitive radio networks b. wu, j. zuo, l. zhao, c. zou bin wu*, jiakuo zuo, li zhao, cairong zou school of information science and engineering, southeast university, nanjing, 210096, china njwubin@seu.edu.cn, zuojiakuo85418@gmail.com zhaoli@seu.edu.cn, cairong@seu.edu.cn *corresponding author:njwubin@seu.edu.cn abstract: with the explosive growth of high-rate multimedia services and promptly boomed energy consumption in wireless networks, energy-efficient design is become more and more important. in this paper, we investigate energy-efficient design for relay-aided multiple-input multiple-output-orthogonal frequency division multiplexing (mimo-ofdm) cognitive radio networks. we formulate an energy-efficient power allocation problem, which takes a form of nonlinear fractional programming. to solve the problem, we first make a joint concave approximation to the original problem which facilitates the optimal algorithm development. then, we derive an equivalent parametric optimization problem of the approximated problem. finally, an iteration energy-efficient power allocation algorithm is presented. numerical results reveal that the proposed algorithm can improve energy efficiency over traditional capacity maximization method. keywords: cognitive radio, power allocation, mimo, relay, fractional programming. 1 introduction cognitive radio (cr) and multiple-input multiple-output-orthogonal frequency division multiplexing (mimo-ofdm) communications have been considered as a promising scheme to solve the spectrum scarcity problem and improve the quality of wireless communications [1]. recently, to reduce the multi-path fading and improve the channel capacity, cooperative relaying technique is considered as a potent means to be adopted in the cr networks. thus, the researches of relay aided mimo cr networks are being received a growing attention in recent years [2, 3]. in [4], the relay selection and beamforming problem for the non-regenerative mimo cognitive multi-relay network was considered and an optimal scheme was proposed via maximizing the capacity of the sus by selecting the best cognitive mimo relay. [5] studied a new paradigm for cr networks, which allowed the secondary users (sus) to cooperatively relay the traffic for the primary users (pus) while simultaneously transmitting their own traffic, and proposed a novel mimo cooperative cognitive radio networks framework. [6] considered the power allocation problem for mimo two-way cr sytem under a specturm sharing scenario, and presented an analytical expression of the optimal power allocation to each antenna of the treminals. [7] studied the power and channel allocation, and relay assignment for mimo-ofdm based cooperative cr networks and proposed an optimal complexity algorithm and a sub optimal low complexity algorithm. [8] investigated subcarrier pairing and power allocation for mimo-ofdm relay-aided cr networks and used environmental learning algorithm to mitigate the interference of the pus. in the previous works, most researches intend to improve the throughput of relay aided cr systems. however, the energy efficiency (ee) has been considered more and more important in future wireless communication networks. the wireless devices and equipments consume about 9% of the total energy of information technology, the communication and information technology already contributes to about 2% of the global carbon dioxide emissions [9, 10]. therefore green copyright © 2006-2015 by ccc publications energy-efficient design for relay-aided mimo-ofdm cognitive radio networks 113 communication, which emphasizes on ee in wireless communication networks, is attracting more and more attention [11, 12]. a large amount of work has bee reported on energy-efficient design for cr networks [13-15]. for mimo cr networks, [16] studies ee optimization problem of mimo cr broadcast channels to improve the system throughput for unit energy consumption. in [17], the throughput and energy efficiency optimization under quality-of-service (qos) constraints for mimo cr systems are studied. in [18], a promising framework of spectrum sharing strategy selection based on ee is proposed for mimo cr interference channels. in this paper, we focus on energy-efficient power allocation for relay-aided mimo-ofdm cr networks. we formulate an optimization problem related to maximization of ee of the consider network under total power constraints of cognitive source node and cognitive relay node, and interference constraints of primary users. since the original optimization problem is difficult to solve directly, we first get an approximated problem of the original problem, and then transform the approximated problem into an equivalent convex optimization problem. a new iterative energy-efficient power allocation scheme is presented at last. the rest of this paper is organized as follows: in section 2, we introduce the system model and formulate an energyefficient power allocation problem. in section 3, the double-loop iterative method is presented. finally, simulation results and conclusions are presented in section 4 and 5. the following notations are used in this paper, cm×n denotes m × n complex matrix, (·)h denotes the conjugate transpose, (·)+ means max (0, ·), the distribution of a circularly symmetriccomplex-gaussian vector with mean vector x and covariance matrix y is denoted by cn (x,y) , diag (·) returns a square matrix with the elements of (·) on the diagonal. sr ng ssn srn sdn pu 1 pu l rd ng s p 1 n g h s p 1 n g s p l n g h s p l n g d p1 n g h d p 1 n g h d p l n g d p l n g r p 1 n g h r p l n g r p l n g hr p 1 n g figure 1: relay-aided mimo-ofdm cognitive radio network 2 signal model and problem statement consider a two-hop relay-aided cognitive radio (cr) network shown in fig.1, there are a secondary source node (ssn), a secondary destination node (sdn), and a secondary relay node (srn). the relay-aided cr network coexists with l licensed primary users (pus). the ssn communicates with sdn through srn, and they share the whole spectrum with pu. ssn, sdn, and srn are equipped with ms antennas, each pu is equipped with mp (mp ≤ ms )antennas. the relay-aided cr network adopts ofdm modulation for transmission, and the total number of available subcarriers for cr network is n. let gsr (n) ∈ cms×ms and grd (n) ∈ cms×ms denote the channel matrices from ssn to srn and srn to sdn over the n-th subcarrier respectively. let gspl (n) ∈ cmp×ms , grpl (n) ∈ cmp×ms and gdpl (n) ∈ cmp×ms denote the 114 b. wu, j. zuo, l. zhao, c. zou channel matrices from ssn to l-th pu, from srn to l-pu, and from sdn to l-pu, respectively. the channel matrix from l-th pu to ssn, sdn, and srn are ghspl (n), g h rpl (n) and g h dpl (n). assume there is no cooperation between cr network and pus, environmental learning (el) method [19] is performed to control the interference to the pus. via el learning method, secondary nodes estimate the null space information of the channels between secondary nodes and pu. assume the cognitive beamforming (cb) matrices at ssn, srn, sdn for the n-th subcarrier are uspl (n) ∈ cms×(ms−mp ) , urpl (n) ∈ cms×(ms−mp ) and udpl (n) ∈ cms×(ms−mp ), respectively. these cb matrices satisfy uhspl (n) gspl (n) = 0, u h rpl (n) grpl (n) = 0 and uhdpl (n) gdpl (n) = 0. however, the accurate cb matrices are difficult to be acquired, therefore in practical applications, the estimated cb matrices ũspl (n), ũrpl (n) and ũdpl (n) are used. in the first hop, the received signal at crn in the n-th subcarrier is given by: yr (n) = gsr (n) ũspl (n) xs + g h rpl (n) w 1 p + zr (n) (1) where xs is the transmitted signal of ssn, w1p is the pu interference to srn in the first hop. zr (n) ∼ cn ( 0,σ2rims ) is the additive white gaussian noise (awgn) at srn. in the second hop, srn first filters received signal yr (n) with ũ h rpl (n), and then precodes the filtered signal by forwarding matrix b (n), finally precodes the resultant signal by ũdpl (n). therefore, the received signal at sdn in the n-th subcarrier is: yd (n) = grd (n) ũrpl (n) b (n) ũ h rpl (n) yr (n) + g h dpl (n) w 2 p + zd (n) (2) where w2p is the pu interference to sdn in the second hop, zd (n) ∼ cn ( 0,σ2dims ) is additive white gaussian noise (awgn) at sdn in the n-th subcarrier. finally, the cr-dn does receive cb by filtering yd (n) with ũ h dpl (n), we have y = d2 (n) b (n) d1 (n) xs + d(n)2b (n) n1 (n) + n2 (n) (3) where d1 (n) = ũ h dpl (n) grd (n) ũrpl (n) and d2 (n) = ũ h rpl (n) gsr (n) ũspl (n), n1 (n) = ∆uhrpl (n) g h rpl (n) w 1 p +ũ h rpl (n) zr (n), n2 (n) = ∆u h dpl (n) g h dpl (n) w 2 p +ũ h dpl (n) zd (n), ∆uhrpl (n) = ũ h rpl (n) − u h rpl (n), ∆u h dpl (n) = ũ h dpl (n) − u h dpl (n) denote the first-order perturbations of the cb matrices due to imperfect environmental learning [8, 19]. let the singular value decomposition of {dk (n)}k=1,2 be dk (n) = uk (n) λk (n) v h k (n) (k = 1,2), and define qm,n = √ prm,n psm,nλ sr m,n+σ 2 r , where psm,n and p r m,n are the transmit power of ssn and srn, λsrm,n is the eigenvalue of gsr (n). thus, the forwarding matrix can be defined as b (n) = v h2 (n) q (n) u1 (n). multiplying y with u h 2,n at sdn , we have y = λ2 (n) σ (n) λ1 (n) v 1 (n) xs + λ2 (n) σ (n) u h 1 (n) n1 (n) + u h 2 (n) n2 (n) (4) where σn = diag (q1,n,q2,n, · · · ,qm,n). according to formula (4), the mimo-ofdm channel between ssn and sdn can be decomposed into n × ms parallel independent channels, therefore, the throughput of the relay-aided mimo-ofdm network is ctp (p) = 1 2 m∑ m=1 n∑ n=1 log2 ( 1 + psm,nαm,np r m,nβm,n 1 + psm,nαm,n + p r m,nβm,n ) (5) where αm,n = λsrm,n σ2 r +ψ1 , βm,n = λrdm,n σ2 d +ψ2 , λrdm,n is the eigenvalue of grd (n), ψ1 and ψ2 are constants and linear with 1 nel ( nel is the number of samples in el stage), p = { psm,n,p r m,n } is power energy-efficient design for relay-aided mimo-ofdm cognitive radio networks 115 vector. the overall power consumption at ssn and sdn can be expressed respectively as follows: pssn = τs m∑ m=1 n∑ n=1 psm,n + p s c (6a) psrn = τr m∑ m=1 n∑ n=1 prm,n + p r c . (6b) where τs and psc are the reciprocal of drain efficiency of power amplifier and circuit power at ssn. τr and prc are the reciprocal of drain efficiency of power amplifier and circuit power at sdn. the ee of the cognitive relay network while selecting the l-th crn for transmitting is defied as: ξee (p) = 1 2 m∑ m=1 n∑ n=1 log2 ( 1 + psm,nαm,np r m,nβm,n 1+psm,nαm,n+p r m,nβm,n ) τs m∑ m=1 n∑ n=1 psm,n + τ r m∑ m=1 n∑ n=1 prm,n + p s c + p r c (7) since we use the estimated cb matrices, the interferences to pus cased by ssn and srn are inevitably, the interferences cased by ssn and srn to l-th pu are ispl = µs σ2l m∑ m=1 n∑ n=1 psm,n (8a) irpl = µr σ2l m∑ m=1 n∑ n=1 prm,n. (8b) where µs and µr are constants and linear with 1nel , σ 2 l is the transmit power of the l-th pu signal. from (7), the objective of energy-efficient power allocation problem for the relay-aided mimo-ofdm cr network can be expressed as: op1 max psm,n,p r m,n≥0 1 2 m∑ m=1 n∑ n=1 log2 ( 1 + psm,nαm,np r m,nβm,n 1+psm,nαm,n+p r m,nβm,n ) τs m∑ m=1 n∑ n=1 psm,n + τ r m∑ m=1 n∑ n=1 prm,n + p s c + p r c (9) subject to   c1 : m∑ m=1 n∑ n=1 psm,n ≤ psth c2 : m∑ m=1 n∑ n=1 prm,n ≤ prth c3 : µs σ2 l m∑ m=1 n∑ n=1 psm,n ≤ ith, l = 1,2, · · · ,l c4 : µr σ2 l m∑ m=1 n∑ n=1 prm,n ≤ ith, l = 1,2, · · · ,l. where psth and p r th are the total power budgets of ssn and srn respectively, ith is the interference threshold of pus. c1 and c2 are transmission power constraints of ssn and sdn, c3 and c4 are the interference constraints of the pus. duo to lack of convexity, it is difficult to solving op1 directly. in the following, we make a joint concave approximation to op1 and introduce a new equivalent optimization problem via nonlinear fractional programming (nfp) [21]. 116 b. wu, j. zuo, l. zhao, c. zou 3 energy efficient power allocation algorithm to make op1 more tractable, the throughput ctp can be approximated at the high signalto-noise ratio (snr) as c̃tp (p) = 1 2 m∑ m=1 n∑ n=1 log2 ( 1 + psm,nαm,np r m,nβm,n psm,nαm,n + p r m,nβm,n ) (10) note: as in [20], c̃tp (p) is joint concave with psm,n and p r m,n. thus, we can also get the approximation of ee as: ξ̃ee (p) = c̃tp (p) ptotal (p) (11) where ptotal (p) = pssn + psrn. substitute ξ̃ee into op1, we get the approximated optimization problem op2 max psm,n,p r m,n≥0 ξ̃ee (p) (12) subject to c1 ∼ c4 for notational simplicity, we define ℵ as the set of feasible solution of op2, and let p ={ psm,n,p r m,n } be variable vector. define the maximum ee ρ∗ of network as follows: ρ∗ = c̃tp (p∗) ptotal (p∗) = max p∈ℵ c̃tp (p) ptotal (p) (13) where p∗ is the optimal solution of op2. introducing a new parametric optimization problem op3 op3 max psm,n,p r m,n≥0 { c̃tp (p) − ρptotal (p) } (14) subject to c1 ∼ c4 where ρ is non-negative parameter. since c̃tp (p) is joint concave with psm,n and p r m,n, for a given ρ, op3 is a convex optimization problem. next, introduce a theorem based on nfp [21]: theorem the optimal solution achieve the maximum ee if and only if max p∈ℵ { c̃tp (p) − ρ∗ptotal (p) } = c̃tp (p∗) − ρ∗ptotal (p∗) = 0 (15) with c̃tp (p) ≥ 0, ptotal (p) > 0. proof: similar proof can be found in [21]. the theorem implies that for fractional op2, there is an equivalent problem whose objective function is in subtractive form, e.g. c̃tp (p) − ρ∗ptotal (p) . therefore, solving op2 is equivalent to solve problem op3 for a given ρ and then update ρ until the theorem is satisfied. energy-efficient design for relay-aided mimo-ofdm cognitive radio networks 117 an alternative method solving op3 is through deriving the lagrange dual [22] of the optimization problem op3. the lagrange function of op3 is defined as bellow: lag ( p,θ1,θ2,{χl} l l=1 ,{ηl} l l=1 ) = 1 2 m∑ m=1 n∑ n=1 log2 ( 1 + psm,nαm,np r m,nβm,n psm,nαm,n+p r m,nβm,n ) −ρ [ τs m∑ m=1 n∑ n=1 psm,n + τ r m∑ m=1 n∑ n=1 prm,n + p s c + p r c ] −θ1 ( m∑ m=1 n∑ n=1 psm,n − psth ) − θ2 ( m∑ m=1 n∑ n=1 prm,n − prth ) − l∑ l=1 χl ( µs σ2 l m∑ m=1 n∑ n=1 psm,n − ith ) − l∑ l=1 ηl ( µr σ2 l ms∑ m=1 n∑ n=1 prm,n − ith ) (16) where θ1, θ2 , χl and ηl are the lagrange multipliers. therefore, the lagrange dual function of the primal problem op3 can be written as: dual ( θ1,θ2,{χl} l l=1 ,{ηl} l l=1 ) = max p≥0 lag ( p,θ1,θ2,{χl} l l=1 ,{ηl} l l=1 ) (17) the corresponding lagrangian dual problem of op3 can be expressed as: min θ1,θ2,χl,ηl≥0 dual ( θ1,θ2,{χl} l l=1 ,{ηl} l l=1 ) (18) the problem (17) is convex, according to the karush-kuhn-tucker condition: ∂lag ∂psm,n = 0 and ∂lag ∂prm,n = 0, then we have: 1 2 ln 2 αm,nβ 2 m,n ( prm,n )2( psm,nαm,n + p r m,nβm,n )( psm,nαm,n + p r m,nβm,n + p s m,np r m,n ) = ρτs + θ1 + µs l∑ l=1 χl σ2l (19) 1 2 ln 2 α2m,nβm,n ( psm,n )2( psm,nαm,n + p r m,nβm,n )( psm,nαm,n + p r m,nβm,n + p s m,np r m,n ) = ρτr + θ2 + µr l∑ l=1 ηl σ2l (20) solving the above two equations, we get the optimal power allocation solutions as: psm,n = 1 √ αm,nx1 (√ x1 αm,n + √ x2 βm,n )(1−(√ x1 αm,n + √ x2 βm,n )2)+ (21) prm,n = 1√ βm,nx2 (√ x1 αm,n + √ x2 βm,n )(1−(√ x1 αm,n + √ x2 βm,n )2)+ (22) where [x]+ = max (0,x), x1 = ρτs + θ1 + µs l∑ l=1 χl σ2 l and x2 = ρτr + θ2 + µr l∑ l=1 ηl σ2 l . note: (21) and (22) show that psm,n and p r m,n are either both positive or both zero, this implies that if power allocated to the n-th subcarrier in the first hop is zero, then no power is allocated to its corresponding subcarrier in the second hop, which meets the intuition very well. 118 b. wu, j. zuo, l. zhao, c. zou table 1: algorithm: approximated energy-efficient power allocation 1 initialization: initial ρ, ϖθ1, ϖ θ 2, ϖ χ l and ϖ η l , the maximum tolerance δ 2 repeat 3 repeat 4 update psm,n and p r m,n according to (21) and (22) 5 update ϖθ1, ϖ θ 2, ϖ χ l and ϖ η l according to (23) 6 until ϖθ1, ϖ θ 2, ϖ χ l and ϖ η l converge 7 update ρ = ξ̃ee (p) via (11) 8 until ∣∣∣c̃tp (p) − ρptotal (p)∣∣∣ ≤ δ the optimal dual variables can be obtained from the dual problem (18) using the subgradient method[23]. the dual variables could be updated as: θ1 = ( θ1 + ϖ θ 1 ( psth − m∑ m=1 n∑ n=1 psm,n ))+ (23a) θ2 = ( θ2 − ϖθ2 ( prth − m∑ m=1 n∑ n=1 prm,n ))+ (23b) χl = ( χl − ϖ χ l ( ith − µs σ2l m∑ m=1 n∑ n=1 psm,n ))+ (23c) ηl = ( ηl − ϖηl ( ith − µr σ2l m∑ m=1 n∑ n=1 prm,n ))+ (23d) where ϖθ1, ϖ θ 2, ϖ χ l and ϖ η l are the step length. according to the aforementioned analysis, we propose a two loop iterative algorithm to solve the approximated energy-efficient power allocation problem op2, which is termed as aee-pa and tabulated as in table 1. note: [23] shows that the subgradient algorithm can converge to the optimal solution of convex optimization problems within a small range. therefore, the inner loop can converge to the optimal solution of the dual problem (18) with in a small range. since op3 is convex optimization problem, the duality gap for op3 is zero, the inner loop also converges to the optimal solution of op3 within a small range. the detailed proves of the convergence of the outer loop, i.e. nfp can be found in [21]. 4 performance simulations we perform numerical simulations to evaluate the present some numerical experiments to evaluate the performance of our proposed scheme. without loss of generality, the channel gains are assumed to be rayleigh fading with an average power gain of 1db, and set the parameters n = 10, l = 2 , ms = 4, mp = 2, σ2r = σ 2 d = 10 −6w , σ2l = 1w , τ s = τr = 1, psc = p r c = 10 −2w , psth = p r th = pmax. since ψ1, ψ2, µs and µr are linear with 1 nel , for energy-efficient design for relay-aided mimo-ofdm cognitive radio networks 119 simplicity, let , , and are equals with . all the results have been averaged over 500 iterations. we compare the proposed algorithm with the traditional throughput maximum problem. change the objective function ξee (p) in op1 with the throughput ctp (p) in (5) and change the approximated objective function ξ̃ee (p) in op2 with the approximated throughput c̃tp (p) in (10), then we formulate the traditional throughput maximum problem. since c̃tp (p) is joint concave in with psm,n and p r m,n, the throughput maximum problem is convex problem which can be solved by many standard convex optimization algorithms [22]. we name the method to solve the throughput maximum problem as tm-pa. in the following, we compare the proposed algorithm with the tm-pa scheme. since the proposed aee-pa consists of two loops, we only consider the affect of the number of outer loop iterations to and set the number of inner iterations large enough to guarantee that the inner loop can find the optimal solution of op3. fig.2 shows the ee versus the outer loop iterations to for different total power budget under ith = 1w , nel = 500 . it can be observed in fig.2 that aee-pa converges to the optimal value within eleven iterations for all considered value of total power budgets. the maximum ee can be improved when there are more total power budgets. fig.3 depicts the ee versus total power budget pmax for different interference thresholds under nel = 500. as shown in fig.3, the ee of the both algorithms increases with the increasing of the total power budget, however the proposed aee-pa has a higher ee than the non energy efficiency scheme tm-pa. the ee versus interference threshold ith for different total power budgets under nel = 500 is evaluated in fig.4. it is shown in fig.4 that the ee of the both algorithms grows with the growth of the interference threshold. this is because that the lower the interference threshold is, the more the cr network suffers outage. we also evaluate the impact of nel (the number of samples in el stage) on the proposed algorithm. in fig.5, ee versus interference threshold ith for different nel under pmax = 0.5w is depicted. obviously, the algorithm has a better performance with lager nel than small nel. this is because smaller nel performs poor learning and yields large interference to pus. 1 2 3 4 5 6 7 8 9 10 11 7 8 9 10 11 12 13 t o e n e rg y e ff ic ie n cy ( b it/ s/ jo u le ) p max =1w p max =0.8w p max =0.6w figure 2: energy efficiency versus the outer iterations for different total power budget 120 b. wu, j. zuo, l. zhao, c. zou 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 p max (w) e n e rg y e ff ic e n cy ( b it/ s/ jo u le ) aee−pa i th =1w tm−pa i th =1w aee−pa i th =0.01w tm−pa i th =0.01w figure 3: energy-efficiency versus total power budget for different interference threshold 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.1 2 3 4 5 6 7 8 i th (w) e n e rg y e ff ic e n cy (b it/ s/ jo u le ) aee−pa p max =1w tm−pa p max =1w aee−pa p max =0.5w tm−pa p max =0.5w figure 4: energy efficiency versus interference threshold for different total power budget 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.1 3.5 4 4.5 5 5.5 i th (w) e n e rg y e ff ic e n cy (b it/ s/ jo u le ) n el =1000w n el =500w n el =100w figure 5: energy efficiency versus interference threshold for different nel energy-efficient design for relay-aided mimo-ofdm cognitive radio networks 121 5 conclusions in this paper, we investigated the power allocation for relay-aided mimo-ofdm cognitive radio networks from energy efficiency perspective. different from traditional throughput maximizing methods, we solve the power allocation problem via maximizing the energy efficiency measured by "joule per bit" metric. however, the formulated problem is nonconvex. to make it solvable, we first make an approximation to the original problem. indeed, the approximated problem is a fractional programming problem. then, the approximated problem is transformed into a parametric convex optimization problem. finally, we give closed form solutions to the parametric convex optimization problem and proposed a two loop iterative energy-efficient power allocation algorithm. to show the improvement in energy efficiency, we compared the proposed algorithm with the traditional throughput maximizing method. from the simulation results, we observed that the proposed new scheme have a better performance than conventional capacity maximization scheme in energy efficiency. bibliography [1] l. lu, g.y. li, a. maaref, et. al. (2014), opportunistic transmission exploiting frequencyand spatial-domain degrees of freedom, ieee wireless communications, april 2014, 21(2):91-97. [2] muhammad naeem, alagan anpalagan, muhammad jaseemuddin, et. al. (2014), resource allocation techniques in cooperative cognitive radio networks, ieee communications surveys & tutorials, second quarter 2014, 16(2):729-744. [3] mehdi ghamari adian and hassan aghaeinia (2014), optimal and sub-optimal resource allocation in multiple-input multiple-output-orthogonal frequency division multiplexingbased multi-relay cooperative cognitive radio networks, iet communications, march 2014, 8(5):646-657. [4] quanzhong li, qi zhang, renhai feng, et. al. (2013), optimal relay selection and beamforming in mimo cognitive multi-relay networks, ieee communications letters, june 2013, 17(6):1188-1191. [5] sha hua, hang liu, xuejun zhuo, et.al. (2014), exploiting multiple antennas in cooperative cognitive radio networks, ieee transactions on vehicular technology, february 2014, 99: 1-12. [6] ahmad alsharoa, hakinm ghazzai and mohamed-slim alouini (2014), optimal transmit power allocation for mimo two-way cognitive relay networks with multiple relays using af strategy, ieee wireless communications letters, february 2014, 3(1):30-33. [7] mehdi ghamari adian and hassan aghaeinia (2014), resoure allocation in mimo-ofdm based cooperative cognitive radio networks, ieee transactions on communications, may 2014, 99: 1-11. [8] shou li, bingquan li, chengwen xing, et. al.(2013), joint resource allocation for learningbased cognitive radio networks with mimo-ofdm relay-aided transmissions, 2013 ieee wireless communications and networking conference, 7-10 april 2013, shanghai, 327132676. [9] g. p. fettweis and e. zimmermann (2008), ict energy consumption-trends and challenges, proceedings in 11-th international symposiums wireless personal multimedia communications, lapland, finland, september 2008, 1-4. 122 b. wu, j. zuo, l. zhao, c. zou [10] chunlong he, geoffrey ye li, fu-chun zheng, et.al (2014), energy-efficient resource allocation in ofdm systems with distributed antennas, ieee transactions on vehicular technolgoy, march 2014, 63(3):1223-1231. [11] geoffrey ye li, zhikun xu, cong xiong, et.al.(2011), energy-efficient wireless communications: tutorial, survey, and open issues, ieee wireless communications, december 2011, 18(6):28-35. [12] cong xiong, lu lu and geoffery ye li (2014), energy-efficient specturm access in cognitive radios, ieee journal of selected areas in communicatinos, march 2014, 32(3):550-562. [13] cong xiong, geoffery ye li, yalin liu, et.al. energy-efficient design for downlink ofdma with delay-sensitive traffic, ieee transactions on wireless communications, june 2013, 12(6):3085-3095. [14] suzan bayhan and fatih alagoz (2013), schedulign in centralized cognitive radio networks for energy efficiency, ieee transactions on vehicular technolgoy, february 2013, 62(2):582595. [15] seonwook kim, byeong gi lee (2014), energy-per-bit minimized radio resource allocation in heterogeneous networks, ieee transactions on wireless communications, april 2014, 13(4):1862-1873. [16] junling mao, gang xie, jinchun gao, et al.(2013), energy efficiency optimization for cognitive radio mimo broadcast channels, ieee communications letters, february 2013, 17(2):337-340. [17] sami akin and mustafa cenk gursoy (2013), on the throughput and energy efficiency of cognitive mimo transmissions, ieee transactions on vehicular technology, march 2013, 62(7):3245-3260. [18] wei zhong, and jiaheng wang (2013), energy efficient spectrum sharing strategy selection for cognitive mimo interference channels, ieee transactions signal processing, july 2013, 61(14):3705-3717. [19] feifei gao, rui zhang, ying-chang liang, et.al.(2010), design of learning-based mimo cognitive radio systems, ieee transactions on vehicular technology, may 2010, 59(4):17071720. [20] ingmar hammerstrom and armin wittneben (2007), power allocation schemes for amplifyand-forward mimo-ofdm relay links, ieee transactions on wireless communications, august 2007, 6(8):2798-2802. [21] werner dinkelbach (1967), on nonlinear fractional programming, management science, march 1967, 13(7):492-498. [22] stephen boyd and lieven vandenberghe (2014), convex optimization, cambridge university press, 2004. [23] stephen boyd, l. xiao and a. mutapcic, subgradient methods, online: https://web.stanford.edu/class/ee392o/subgrad_method.pdf. international journal of computers communications & control issn 1841-9836, 10(4):471-479, august, 2015. efficient design and deployment of aqua monitoring systems using wsns and correlation analysis s. babu chandanapalli, e. sreenivasa reddy, d. rajya lakshmi suresh babu chandanapalli* associate professor, department of cse, gudlavalleru engineering college, gudlavalleru-521356, andhrapradesh, india. *corresponding author: sureshdani2004@gmail.com dr e. sreenivasa reddy professor of cse, anu college of engineering & technology, acharya nagarjuna university, guntur-522510, andhrapradesh, india. edara_67yahoo.com dr d. rajya lakshmi professor, hod of cse, jntuk university college of engineering, vizianagaram535002, andhrapradesh, india. jrajyalashmi@gmail.com abstract: the roots of innovation are extending towards every field to provide ace solution. we cater an ace solution for aquaculture, where their yields (shrimp, fish, etc.) depends on the ponds water characteristics. the parameters depending on water must kept at certain optimal levels for better cultivation of aqua. the parameters of water extremely project alterations during the day and also alter depending upon the environmental conditions i.e., it is necessary to monitor these parameters with high frequency. we adopt wireless sensor networks to monitor aqua forms. this system consists of two modules, they are transmitter and receiver station. we navigate data to database at receiver station through the gsm. the graphical user interface was designed in such a manner that the observations are forwarded to the farmer as message in their respective local languages to their mobile phones. that alerts them in unhygienic environmental conditions for adopting suitable measures. keywords: aquaculture, wireless sensor networks, iar-kick, ph; 1 introduction aquaculture is one of the widely extending industry attributable to the rapid demand for fish and seafood all over the world. the term aquaculture is referred as the cultivation of fish,plants and animals in various types of environments that includes rivers, ponds and oceans. aquaculture consists of two types i.e. one is marine aquaculture that is nothing but the cultivation of species in ocean and another is freshwater aqua culture where species are cultivated in native water bodies. in shrimp culture, it is observed that samples that are taken into consideration to predict low levels in dissolved oxygen, temperature, salinity and ph levels. taking all these parameters into consideration deploying sensors in shrimp culture for monitoring water quality and alert regarding contaminants in water will yield exceptional results. [1] [8]. the analysis of water quality desires consistent observation of water depended parameters in significant catchments. the various parameters which we consider as ph, dissolved oxygen, water temperature and water levels [2] at various depths. making use of various ideal instruments like sensors and wireless sensor networks will produce better results. the adoption of zigbee standard for short range and low cost module of wireless sensor network is developed in real time information system [3], in copyright © 2006-2015 by ccc publications 472 s. babu chandanapalli, e. sreenivasa reddy, d. rajya lakshmi which it consists of small sensor nodes, coordinator or gateway node and personal computer. in this system the smart sensor node monitor the ecological parameters such as water level, ph and dissolved oxygen and transmits it to the coordinator or gateway node from which data is again transferred to the personal computer where it is visualized for human-computer interface. the application of wireless sensor networks in various fields for detection of ecological parameters and transferring data to database using network. yet, the wireless sensor networks have many constraints like memory, processing power and limited batter energy but the efficient utilization of energy is a decisive issue. we propose the implementation of wireless sensor networks to have a distributed collection of sensor nodes networked together to transfer the raw data to a central location known as base station through gsm. every sensor node consists of a micro-controller, some sensors and a radio transceiver for communication [3]. the micro-controller is used for in-network processing for transferring needed information instead of raw data. the information which is transferred is preserved in a database and analyzed for further process. after analysis the data from database is forwarded to the farmers as a message to their mobile in their respective languages to alert them about the unhygienic environmental conditions. the proposed system makes farmers aware about the vulnerabilities so as to resolve them. 2 motivation figure 1: paper cutting for problems in aquaculture in aquaculture, the yields(shrimp, fish etc) depends on the water characteristics of the aquaculture pond. parameters must be maintained at certain levels for better cultivation of fish yields. the parameters asph level, salinity, turbidity, alkalinity and nutrition level alter from day and environmental issues. from early years, considerable amount of research and experimentation have processed in this field but till there is no perfect solution of this issue. there are certain possible wireless sensor network solutions that act as a perfect solution for the problem but to their drawbacks in power utilization, configuration, communication failures, environmental influences and scalability inspired us to perform further explorations in aquaculture. [1]. fig.2.shown given below, certain innovations those were carried out in aquaculture for their detection in 2011 the innovation of traceability enciphers for recirculation aquaculture and in 2012 creation of water checking framework where we adopt zigbee and gprs for transmission of gathered data [4]. in this existing system we use a ph meter where we use a probe to testing the parameters and transferring the results using wireless sensor networks. in this manual testing there are some efficient design and deployment of aqua monitoring systems using wsns and correlation analysis 473 drawbacks as arrangement of ph meter at some frequency,maintains of probe, time for calculation of ph and manually testing is not possible in unhygienic environmental conditions. the earlier systems are not made up of gsm for aqua monitoring system for remote connectivity and sensing parameters. figure 2: manual testing of ph 2.1 proposed system description we propose the automatic system for checking water characteristics of aqua ponds in hygienic and unhygienic conditions. the proposed system consists of two modules, they are transmitter and receiver station and a personal computer which is used as analyzing station. the following list contributes overall framework of the aqua monitoring system: • embedded system design • radio frequency integrated circuits. • wireless sensor network. • the collected data will be analyzed by using data mining tool. • convert the output into local language using local language. • user interface design in local language 2.2 design of transmitter station we propose the implementation of wireless sensor networks to have a distributed collection of sensor nodes networked together to transfer the raw data to a central location known as base station through gsm. every sensor node consists of a micro-controller, sensors and radio transceiver for communication. the micro-controller is used for in-network processing for transferring needed information instead of raw data. as the sensor node are powered by battery and thereby reduction of total n umber of bits will save the battery life. research activities are 474 s. babu chandanapalli, e. sreenivasa reddy, d. rajya lakshmi figure 3: overall architecture of aqua monitoring system processing for the change of sensor node replacement for optimal coverage, topology formation, routing, data processing techniques to reduce communication costs, operating system design etc. it is also to estimate the impact of the present sensors in aquaculture as it needs few more crop cycles for generation of accurate results [6] [11]. in the proposed system, we introduce a sensor that makes use of the off-the-shelf available sensor nodes and we didnt make any specific efforts in hardware design cost reduction. the sensor is embedded in the transmitter station for monitoring of aqua forms depending on parameters like ph levels, dissolved oxygen levels, temperature inside and outside water, ammonia levels etc.the user interface was designed in vb and .net. so that farmers and investigators may analyze and investigate the data. [7] the transmitter consists of: 1.sensor nodes 2.gsm modems 3.micro-controllers 4.arm processor 2.3 sensor nodes sensors are the electronic devices that sense the alterations of physical environmental conditions as sound, temperature, pressure etc.the sensor works at particular voltage and continuously transmit the signal in analog form. these signals are converted into digital form using converters. the sensors are of small size, consume low power and operate in high densities. we make use of following sensors: • temperature sensor for measuring the altering conditions of water • ph sensor • humidity sensor. sensor selection • temperature sensor for outside water: we make use of ds18b20 i-wired digital sensor from maxim ic. efficient design and deployment of aqua monitoring systems using wsns and correlation analysis 475 figure 4: block diagram of transmitter • temperature sensor for inside water: we make use of hobo u-series sensor with ux 120-006m external channel data logger. • ph sensor : we make use of a glass electrode for ph sensing. • humidity sensor : we use to measure the humidity. gsm modems we make use of gsm modems for long range communication as the aquaculture ponds are far away to villages and the supply of power is not available to ponds therefore data collection nodes are not placed near to pond. we place the receiver node at data collection center in villages or towns and transmits message to the mobile phones with the help of a sim, that supports 2g and g technologies, hsupa, umts, wcdma, hsdda, gpra and edge. these modems are embedded both at transmitter and receiver station.in this prototype we adopt at89c52 micro-controller that is having the capability of showing high-performance by making use of cmos-8 bit microcomputer having 8k bytes flash programmable and erasable read only memory.we make use of the ace arm processor that comes with all the well known features and functionality.system software of the aqua monitoring system gains the sensor data and verifies with the data records and preserves the data in the database to construct a complete database. the constructed database consists of all the analyzed patterns depending on the altering environmental changes of fish ponds [5]. the sensor data is correlated with the ideal conditions present in the database. any alteration in the ideal conditions will transmit a warning message to the farmer for adopting suitable measures. communication process module is connected to the pc via serial port rs-232. we use iar or ram for controlling the wireless sensor system in the embedded software that is programmed in c. we port the hex document to the flash memory of at-mega8 micro-controller via usb interface. the project is created with subroutine modules for sensor indicator digitization, engine transfer and value hand-off control, remote information procurement and transmission, universal and asynchronous receive transmit. the analog and digital converter of micro-controller performs cyclic scans for the sensors and considers the varying check rate that is programmable [9]. the sensor information might be contrasted and set-point qualities put away in memory. the transmission of information through the gsm to the base collector is done at serial correspondence at 9600 baud rate. 476 s. babu chandanapalli, e. sreenivasa reddy, d. rajya lakshmi tab.1. technical specifications of communication protocol parameters zigbee bluetooth wi-fi gsm range 30m 1.6 km 30 3000ft 100 150 ft 30 35 km power consumption 0.2 ma(one node) 1 watts 10.80 watts 230 volts frequency 2.4g 2.4g 2.4g 900 and 1800 mhz cost $350 $325 $500 $250 3 system implementation the system was implemented in gudlavalleru, krishna district. the intensive aquaculture monitoring system is acquiring data from march-2014 to august-2014.this system analyzes all the appropriate operations and assesses the feasibility. we had taken a pond area of three acres and divided into four fish ponds. we deployed four sensor nodes and a transmitter node consisting of ph, humidity and water temperature at the ponds. the whole system was tested appropriately with the connection of sensors to their modules and monitoring computer to its receiver nodes in the control room with the proper installation of sensor nodes in the fish ponds and their operations. figure 5: installation of transmitter station & receiver station at pond 4 results and discussion tab.2. summary of sensor data between mar-2014 to july-2014 parameters mar-14 apr-14 may-14 min max stdiv min max stdiv min max stdiv *w. i.temp. 23 26 2.12 23 25 1.4 18 22 2.83 **w .o.temp. 25 27 1.41 25 25 0 25 26 0.7 humidity 44.1 50.2 4.3 43.5 46 1.73 43.6 46.6 2.12 ph 6.1 6.3 0.14 5.8 6.2 0.28 5.8 6.2 0.28 fish mortality 1.4 1.1 1 parameters june -14 july-14 aug-14 min max stdiv min max stdiv min max stdiv *w. i.temp. 22 24 1.41 23 23 0 23 24 0.71 **w .o.temp. 25 27 1.41 25 26 0.71 25 27 1.41 humidity 45.9 46.9 0.71 46.8 48.7 1.34 42 47.6 3.96 ph 5.8 6.3 0.35 5.7 6 0.21 5.7 7.8 1.48 fish mortality 1.2 1 1.3 *-water inside temperature **-water outside temperature efficient design and deployment of aqua monitoring systems using wsns and correlation analysis 477 figure 6: monitoring screen and user interface of aqua monitoring system figure 7: water inside and outside temperature,humidity,ph variation from a period of six months 478 s. babu chandanapalli, e. sreenivasa reddy, d. rajya lakshmi correlation co-efficients correlation co-efficient may termed as the measure of the strength of linear association among two variables or parameters. the value of correlation always lies in between -1.0 to +1.0.if the value of correlation, c(r), is positive then we retrieve positive relationship, if negative we retrieve negative relationship [7] [10]. formula: c(r) = [ n ∑ xy − (∑ x )(∑ y )] /sqrt ([ n ∑ x2 − (∑ x )2][ n ∑ y 2 − ( y )2]) where, n = the total number of values or elements x = first score y = second score∑ xy = sumoftheproductoffirstandsecondscores∑ x = sumoffirstscores∑ y = sumofsecondscores∑ x2 = sumofsquarefirstscores∑ y 2 = sumofsquaresecondscores tab.3. summary of correlation coefficients parameters ph humidity *w.i.temp **w.o.temp ph 1 0.75 0.85 0.95 humidity 0.75 1 0.65 0.75 *w.i.temp 0.85 0.65 1 0.70 **w.o.temp 0.95 0.75 0.70 1 *-water inside temperature **-water outside temperature taking the available parameters in the table 3.i.e. ph, water inside temperature, water outside temperature and humidity calculate the value for the variables x and y. 5 conclusions and future works a wireless sensor network for aquaculture monitoring and control based on virtual instruments is presented. we implemented the hardware design of smart sensor nodes, transmitter/receiver with software design and prototype system of four sensor nodes with the embedding of transmitter/receiver with proper design structure. the collected data provides an accurate analysis of successful operation of the system. the proposed work can be utilized in various fields as greenhouse monitoring and control, hydrological water conservation and farm land irrigation. future development efforts should involve enhancing the wsn by adding gprs modules on the smart sensor nodes to monitor aquaculture environment on-line through the internet. the monitoring program should be linked to the web server through a passive ip address in one port of the receiving computer, thus making the architecture more scalable and robust. 6 acknowledgment this work was supported by university grants commission minor research project for teachers, southern eastern regional office, india and is also sanctioned fund in-house r & d by gudlavalleru engineering college. efficient design and deployment of aqua monitoring systems using wsns and correlation analysis 479 bibliography [1] http://www.nmfs.noaa.gov/ [2] a. harun, d. l. ndzi, m. f. ramli, a. y. m. shakaff, m. n. ahmad, l. m. kamarudin, a. zakaria, and y. yang (2012); signal propagation in aquaculture environment for wireless sensor network applications, progress in electromagnetics research, issn:1070-4698, e-issn:1559-8985, 131(1): 477-494. [3] zhuiykov s., ertud k., mirza i. (eds.)(2010); water quality: physical, chemical and biological characteristics, sensors and actuators b: chemical,issn: 0925-4005, chapter 3 : 71-96. [4] wang, xin; ma, longquan; yang, huizhong (2011); online water monitoring system based on zigbee and gprs, proc. eng., 15: 2680-2684. [5] tai, haijiang; liu, shuangyin; li, daoliang; ding, qisheng; ma, daokun (2012); a multienvironmental factor monitoring system for aquiculture based on wireless sensor networks, sensor lett., issn:1546-198x , 10 (12):265-270. [6] d m. razzemi, z. abdul halim, m. kusairay musa and k. hasbullah (2011); development of water quality monitoring system prototype for fresh water fish culture, informatics engineering and information science communications in computer and information science, issn:978-3-642-25483-3 , 254(6): 470-478. [7] qi, j. zhang, m. xu, z. fu, w. chen, x. zhang (2010); developing wsn-based traceability system for recirculation aquaculture, mathematical and computer modelling pii: s08957177(10)00423-1, 53: 2162-2172. [8] bosma r. h., verdegem m. c.j. (2011); sustainable aquaculture in ponds: principles, practices and limits, livestock sci., 139(1):58-68. [9] jose juan carbajal hernandez, luis p. sanchez fernandez, oleksiy pogrebnyak (2011); assessment and prediction of the water quality in shrimp culture using signal processing techniques, aquaculture international, issn:1573-143x, 19 (6): 1083-1104. [10] anita bhatnagar, pooja devi(2013); water quality guidelines for the management of pond fish culture, international journal of environmental sciences, issn:0976-4402, 3(6):19812009. [11] glasgow h.b et al.(2004); real-time remote monitoring of water quality: are view of current applications, and advancements in sensor, telemetry and computing technologies, journal of experimental marine biology and ecology, issn: 0022-0981, doi: 10.1016/j.jembe.2004.02.022, 300:409-448. int j comput commun, issn 1841-9836 vol.7 (2012), no. 4 (november), pp. 744-758 minimum cycle time analysis of ethernet-based real-time protocols j. robert, j.-p. georges, e. rondeau, t. divoux jérémy robert, jean-philippe georges, eric rondeau, thierry divoux centre de recherche en automatique de nancy, université de lorraine, cnrs umr 7039 campus sciences, bp 70 239, f-54506 vandœuvre-lès-nancy, france e-mail: jeremy.robert@univ-lorraine.fr, eric.rondeau@univ-lorraine.fr, jean-philippe.georges@univ-lorraine.fr, thierry.divoux@univ-lorraine.fr abstract: the ethernet standard is a standard solution for interconnecting industrial devices despite its intrinsic drawbacks, particularly its nondeterministic medium access method. many ethernet-based commercial solutions available (cots components off the shelves) on the market guarantee time performance. this means that user selection of one particular solution is a critical decision, but the choice often depends more on political strategizing with an industrial device manufacturer than on the intrinsic performance of ethernet-based interfaces. the objective of this paper is to provide a formal behavioural analysis of each ethernet-based solution, in order to facilitate comparison. keywords: real-time systems, performance analysis, embedded systems. 1 introduction fieldbuses interconnecting industrial equipment are typically designed by programmable logic controller (plc) manufacturers. this is mainly attributable to specific constraints on industrial communication, which require a high degree of expertise. fieldbuses must be robust in the noisy environments produced by the plant (physical layer), deterministic in guaranteeing data refresh during controller cycle periods (link layer), and capable of exchanging information between all types of industrial devices (application layer). manufacturers provide different solutions to satisfy the first two constraints, without necessarily considering the final constraint to be relevant in business terms. this has resulted in the specification of a large number of fieldbus standards as described in [1]. a new trend endorsed particularly by the iaona (industrial automation open networking alliance) consortium was to promote the ethernet network as a standard for industrial communications. the expected benefits are less costly network installations, because equipment is available off-the-shelf, and the avoidance of interoperability problems, because ethernet technology is broadly used. other advantages are that ethernet is a well-known protocol, which is widely implemented, and its performance improves continuously with technological evolution (especially bandwidth). however, access to the ethernet medium relies on the nondeterministic csma/cd algorithm, which applies a stochastic method to resolving collisions and cannot guarantee that message transmissions will be received in bounded time. consequently, the native ethernet protocol cannot be directly implemented in a plant with severe time constraints, and many ethernetbased solutions have been proposed to overcome this issue, as mentioned in [2–4]. however, if these ethernet-based solutions are adapted to industry requirements, they lead to two types of problem. the first is that different solutions are not interoperable because different fieldbuses are copyright c⃝ 2006-2012 by ccc publications minimum cycle time analysis of ethernet-based real-time protocols 745 developed by each manufacturer. the second problem is that time performances are insufficiently evaluated or compared. according to [5], different ethernet products can be summarily classified into three main categories: the native ethernet standard (ethernet/ip, modbus/tcp), ethernet solutions using the priorities defined in ieee802.1d/q, and ethernet-based solutions that incorporate new scheduling features in asic/fpga (ethercat, profinet irt). the last approach enables the elimination of all collisions and simplifies transmission time estimation, as described in [5, 6]. the authors compared ethercat with profinet irt in a simplified context using analytic models. this analysis was improved and refined in section 2 and extended to two other well-known industrial ethernet products: the modbus/tcp solution and ethernet/ip. in section 3, we present several scenarios that use the analytic models to facilitate assessment of different ethernet products. 2 estimation of minimum cycle time 2.1 introduction the general objective of this study was to compare the time performance of the major industrial ethernet products available on the market. this comparison could be achieved only in a common application context. thus, the specification of the communication scenario was based on one controller (for example, a plc), interconnecting sensors, and actuators in an ethernet network. the controller was treated as the communication master that initiated all dialogues with slave nodes (sensors and actuators). the controller was characterized by its controller cycle time period, which was divided into three steps, as shown in figure 1: the sensor data refresh time, in the controller memory, the processing time, and the actuator update time. steps (1) and (3) represent communication periods, which should be less than the difference between the controller cycle time period and the processing time period. thus, the time performance of each ethernet product was compared according to a constraint named minimum cycle time, which was defined as: the minimum cycle time was the communication time required by the controller to both collect and update the data memories of all sensors and actuators. figure 1: plc cycle time in the following section, the analytic models of minimum cycle time are elaborated for ethercat, profinet irt, modbus/tcp, and ethernet/ip. these models all used the following parameters: the transmission delay, the network device latency, the propagation delay, the link capacity, the payload, and the number of slaves. the notations of these are given in table 1. it was assumed that there were no transmission errors and that the network was dedicated to the plc application and was not shared with other applications. 746 j. robert, j.-p. georges, e. rondeau, t. divoux table 1: notations terms notation units minimum cycle time γ s delivery time (from frame i) di s transmission delay τ s network device latency ℓ s propagation delay δ s link capacity c bits/s payload x bytes number of network devices (slaves) n – 2.2 ethercat the ethercat network was developed by the beckhoff company (type 12 in standard iec 61158, [7, 8]). in theory, ethercat cards are standard ethernet interfaces. in practice, specific hardware (fpga, field-programmable gate array, or asic, application-specific integrated circuit) is used to mitigate the frame forwarding delay. the ethercat network adds a master/slave protocol over the ethernet. a frame is sent by the master and slaves can read and write data on the fly. the duration of reading or writing operations corresponds only to the network device latency ℓ, which is independent of frame size and the same for all slaves. a logical ring is defined between the slaves such that when a frame reaches the last slave in the ring it is returned to the master via all the slaves. the space–time diagram shown in figure 2 illustrates the behaviour of ethercat communications. figure 2: ethercat space-time diagram the ethercat protocol can support both line and ring topologies. because the line topology is used mainly in the industrial framework, this topology was investigated in our study (figure 3). the ethercat datagram is directly encapsulated inside the basic ethernet frame as shown in figure 4. an ethercat frame is composed of a header specifying the length of the frame and a list of datagrams. the number of datagrams depends on the number of slaves. a datagram is minimum cycle time analysis of ethernet-based real-time protocols 747 figure 3: ethercat line topology defined for each slave, and it contains the command type and associated data. figure 4: ethercat frame (field lengths are given in bytes) in this study, our analysis of ethercat performance considered the following hypothetical test scenario: the topology was a line, the initialization step was ignored and only cyclic communication was studied, the master sent only one frame per cycle, and the payload x was the same for each slave. the minimum cycle time shown in figure 2 was determined for this scenario. the link transmission delay is the ratio between frame size and link capacity c. the total frame size can be divided into two parts: a constant value that equals the sum of the ethernet protocol (26 bytes), the interframe gap (corresponding to the time of 12 bytes), and the ethercat header (2 bytes) and a variable value that depends on the slave number n, the amount of data to transport x, and the header (12 bytes). the link transmission delay is: τ = 8 (40 + max (44,n(12 + x))) c . (1) the term 44 in the equation (1) was added to ensure the minimum data size defined by the ethernet protocol. if the ethercat telegram length was less than 46 bytes, an equivalent amount of padding was inserted in the ethernet frame. the ethercat telegram already included a 2 bytes header, which meant that there was no padding requirement when the length of the datagram sequence was larger than 44 bytes. as shown in figure 2, the cycle time was estimated using the expression: γ = (2n−1)ℓ + 2nδ + τ = (2n−1)ℓ + 2nδ + 8 (40 + max (44,n(12 + x))) c . (2) it should be noted that (2) considers only one frame. because the ethernet payload size depends directly on the number of slaves and the ethernet frame size cannot exceed 1 526 bytes 748 j. robert, j.-p. georges, e. rondeau, t. divoux (and therefore the data field 1 500 bytes), (2) is only valid if the number of devices interconnected to the network is less than: n ≤ nmax = ⌊ 1500−eh 12 + x ⌋ , where eh is the ethercat header size (2 bytes) and nmax is the maximum number of datagrams, of length x, which can be included in a single frame (in the following, we have assumed that x ≤ 1486 bytes). in general, the number of slave devices on the network can be greater than the frame size capacity. this means that the controller has to send more than one frame in a cycle time. in fact, the number of ethernet frames required to support n devices with a constant payload x is given by: k = ⌈ n nmax ⌉ . consequently, (2) now integrates a different transmission time for each frame and finally gives: γ = (2n−1)ℓ + 2nδ + 8 c ( 40k + (k −1)nmax (12 + x) ) + 8 c max ( 44,(n− (k −1)nmax) (12 + x) ) (3) the final term of expression (3) was used to differentiate cases where the last frame generated padding. similar results are given in [5] but, in contrast to this earlier work, expression (3): • takes into account the on the fly minimum cycle time mechanism proposed by ethercat; the main advantage of this is that a device can begin frame forwarding before complete reception of a frame (in contrast to, e.g., store-and-forward mode), which significantly reduces the forwarding time as shown in figure 2, • considers the use of padding, as defined by ethernet, • integrates the time required to forward the information sent from devices to the controller, as shown in figure 2, and • considers cases where the number of slaves and their payload requires the utilization of several frames. the accuracy of the ethercat synchronization mechanism was reported in [9], which shows that this issue need not be considered because it was estimated as equal to a few nanoseconds. 2.3 profinet irt introduction the profinet protocol was developed by the siemens company (type 10 in standard iec 61158, [7, 8]). profinet irt manages real-time communications. however, standard ethernet cards cannot be used because profinet irt requires the operation of specific hardware on slaves (asic type – 2 or 4 ports inbuilt switch). profinet irt is based on the time-slice mechanism, which specifies two modes, the asynchronous mode and the isochronous mode. the asynchronous and isochronous modes are used for unconstrained traffic and real-time traffic, respectively. our study dealt only with real-time traffic. thus, only the isochronous mode was analysed. for more information on these two modes, the reader can refer to [10]. in minimum cycle time analysis of ethernet-based real-time protocols 749 the isochronous mode, the master sends one data frame to each device and each device replies to the master. profinet can support line, star, and ring topologies. only the line topology (see figure 5), with full-duplex links, was analysed in this study (as for ethercat). figure 5: profinet line topology in this study, the profinet irt used the slipstreaming effect, where the controller began by sending frames to the most remote slave in the line topology, and then to the second remote slave slave and so on, until it reached the nearest slave. this mechanism enabled a reduction in the cycle time by minimizing the transmission time. the slipstreaming effect was also applied to exchanges from slaves to the controller. the links were set up in full-duplex mode. the global communication scheme is given in figure 6. figure 6: profinet irt space-time diagram with slipstreaming effect the space–time diagram shown in figure 6 illustrates the optimum use of the profinet irt protocol. devices were assumed to be synchronized using a clock synchronization protocol, such as the ieee 1588 standard. the ieee 1588 standard generates synchronization frames (ptp) between devices, but these frames were not relevant to this study. minimum cycle time estimation the slave devices periodically sent their messages to the controller at the same times as messages were sent by the controller to slaves. only the positive characteristics of the slipstream effect were considered. this required that τ ≥ δ + ℓ, as noted by [5]. in such cases, the minimum cycle time was given by [5, 6] as the sum of: • the latency ℓ that crosses all devices plus the propagation delay δ for each link and 750 j. robert, j.-p. georges, e. rondeau, t. divoux • and the link transmission time τ for each frame sent by the controller. hence, the minimum cycle time is written as: γ = δ + ℓ + nτ. (4) equation (4) was developed by analysing the transmission delay τ. the profinet datagram is encapsulated in the ethernet frame as shown in figure 7. figure 7: profinet irt frame (field lengths are given in bytes) three fields are added to the ethernet frame: a data identifier (2 bytes), the data value (with the assumption that x ≤ 1494 bytes), and an information status (4 bytes). when considering the constraint of the minimal ethernet frame size, the transmission delay of a profinet frame was given by: τ = 8 38 + max (46,6 + x) c . (5) the term 38 in the equation (5) corresponds to the size of the ethernet layer (26 bytes) plus the interframe gap (12 bytes). as with ethercat, the term 46 was added in order to ensure the minimum data size as defined by the ethernet protocol. if the ethercat telegram length was less than 46 bytes, an equivalent amount of padding was inserted in the ethernet frame. the final form of equation (4) was expressed as: γ = δ + ℓ + n 8 c ( 38 + max (46,6 + x) ) comments the ethercat minimum cycle (3) was less than the profinet irt one (4), because the profinet irt transmission delay was multiplied by the number of devices. the profinet irt analysis assumed that all devices had the same clock reference. figure 6 shows that all devices were synchronized because of the ieee 1588 protocol. they shared the same clock and periodically sent their messages at the same time. hence, it may be expected that clock synchronization errors will increase the minimum cycle time. this paper aims only at comparing optimal performances of cots ethernet-based protocols, i.e. profinet irt nodes sharing the same clock reference. next studies will hence aim at reporting this synchronization issue. 2.4 modbus/tcp modbus is a serial communication protocol developed by modicon in 1979. modbus/tcp is a variant of the modbus protocol (type 15 in standard iec 61158, [7,8]), which uses the ethernet physical and link layers [12]. modbus/tcp encapsulates a modbus frame into a tcp frame as minimum cycle time analysis of ethernet-based real-time protocols 751 figure 8: general modbus/tcp frame and function specific modbus application frame (field lengths are given in bytes) shown in figure 8. the modbus datagram is composed of the modbus application protocol header (mbap), the function (read/write), and the data value. modbus/tcp is a pragmatic approach that works on several types of configurations that impact on its performance. parameters, including topology, exchange management in the application layer, and the processing capacity of devices, change the modbus/tcp time behaviour. modbus/tcp is based on connection-oriented transactions and it can use different exchange models, such as master/slaves, producer/consumer, or client/server. the objective of this study was to compare several ethernet products, so it was necessary to use similar contexts for all ethernet products. because ethercat and profinet irt were previously analysed using a master/slaves model, this model was also used for modbus/tcp evaluation. the master/slaves model also simplified the analysis because all the devices were synchronized by network events (none clock synchronization protocol is required). the communication scheme defined in this study followed these steps. the master sent a frame to one slave and when the slave received this frame, it sent a reply to the master. when the master received the reply, it repeated the same procedure with another slave. all the slaves were processed by the master using a round-robin method. this communication scheme was implemented in the application layer of the osi model. the request/reply protocol is shown in figure 9. modbus/tcp not only supports application data but also tcp pdu, and it comprises opening and closing tcp connections when acknowledging segment reception. the acknowledgment can be achieved either immediately a segment is received, after the reception of several segments, or inside the next data transmission (piggybacking). in practice, tcp behaviour changes according to the operating system, the tcp configuration, and whether or not the nagle algorithm is used. in this study, we assumed that the timeout to send the acknowledgement was 0.5 s, meaning that only piggybacking was analysed, as shown in figure 9. the transient states of the tcp opening and closing steps were not considered in the modelling. modbus/tcp supports line, star, and ring topologies. complex architectures based on switches can be used, especially when the network is shared by several applications, although switches induce additional costs. however, because only the master/slaves communication 752 j. robert, j.-p. georges, e. rondeau, t. divoux figure 9: modbus/tcp space-time diagram scheme was considered in this study, a bus infrastructure was used. it was assumed that each device was interconnected through a common hub, and the propagation delay was equal to δ. figure 9 shows that the cycle time was equal to the number of devices n multiplied by the time required to poll one device. given previous assumptions, this can be written as: γ = n(τreq + τrep + 2 (2δ + ℓ)) . (6) in the worst case, where a tcp acknowledgment is sent for each segment, the minimal cycle time would be equal to:gamma = n(τreq + τrep + 3× (2δ + ℓ) + 2τack) where τack = 672/c and 672 is the minimal size in bits for an ethernet frame (interframe gap included). when the architecture was composed of several hubs, the propagation delay increased and (6) was slightly different. the frame transmission time was composed of: • a constant part related to the sum of the ethernet protocol (38 bytes with the interframe gap), the ip header (20 bytes without options), the tcp header (20 bytes without options), and the modbus/tcp header (7 bytes for the modbus application protocol header), • a variable part related to the type of modbus message, with the size changing according to the type of data (function code) and the transaction model state (request or reply). consequently, the analysis of modbus/tcp was only valid for an application framework. in this study, we considered only write requests. • a variable part proportional to the payload (because the byte count field was stored as a single byte, this indicates that the date field length was limited to x ≤ 255 bytes). the sum of ip, tcp, and modbus header sizes was larger than the minimal ethernet data length. thus, no added padding was required. the delay is directly given by: τreq10h + τrep10h = 8 91 + x c + 8 90 c such that (6) corresponds to: γ10h = n ( 8 181 + x c + 2 (2δ + ℓ) ) . (7) minimum cycle time analysis of ethernet-based real-time protocols 753 2.5 ethernet/ip introduction ethernet/ip (ip, industrial protocol) is a network developed by rockwell automation in 2001 and supported by odva (open devicenet vendor association) [13, 14]. ethernet/ip (type 2 in standard iec 61158, [7, 8]) uses the common industrial protocol (cip) for off-theshelf ethernet products and tcp-udp/ip stack. ethernet/ip is a connection-based network. a cip connection defines the type of packet sent to the network. there are two types of connections: the explicit messaging connection and the i/o (or implicit) connection. explicit messaging provides generic and multi-purpose communication paths between two nodes, whereas i/o messaging is specific to application i/o data and provides serial purpose communication paths. when the application is time-constrained, i/o messaging is the preferred mode because it employs udp rather than tcp sockets. cip uses the producer/consumer model and requires broadcast exchanges encapsulated in udp. this study evaluated only i/o connections. because ethernet/ip relies on cots, no particular topology was specified. busor switchbased architectures are both possible. switches are interesting because they break the collision domain, allowing the support of vlan and the classification of service mechanisms [15]. switched architectures are recommended for exchange management with time-critical (implicit) messaging. however, several switched ethernet architectures are possible. in this study, a linear switched topology was selected, as shown in figure 10. this can be viewed as an extension of the experimental set-up considered in [16], where the number of switches varied according to the number of ports per switch. figure 10: ethernet/ip switched linear topology in contrast to modbus/tcp, utilization of a slipstreaming effect on ethernet/ip does not facilitate the use of a bus topology. indeed, it is possible that two messages can be on the network at the same time (as shown in figure 11). in order to compare the other protocols with ethernet/ip, a similar exchange scenario was proposed. the controller sent a frame to each slave device and each slave device produced data that were sent to the controller. figure 11 shows this behaviour. initially, ethernet/ip did not support medium access synchronization points, as found in the master/slaves technique used in modbus/tcp. this meant that any device could access the network at any time. however, the time synchronization mechanism can be used because of the support of ethernet/ip by the ieee 1588 [17] protocol (implemented by the cipsync profile). consequently, devices can send messages using the same clock reference. we used the slipstreaming effect for the controller and a common departure time for the devices, as shown in figure 11. obviously, this profile corresponded to an ideal case related to the minimal cycle time for this architecture, because lock synchronization errors would lead to another profile inducing a longer cycle time and the use of the slipstreaming effect requires (see profinet irt) that τ ≥ δ+ℓ. 754 j. robert, j.-p. georges, e. rondeau, t. divoux figure 11: ethernet/ip space-time diagram for 4-ports switches minimum cycle time considering the profile given in figure 11, the minimum cycle time corresponded to the sum of: the latency ℓ for crossing only one switch plus twice the propagation delay δ, between a controller/device and a switch and the link transmission time τ for each frame sent by the controller. thus, the minimum cycle time can be written as: γ = 2δ + ℓ + nτ. (8) the term given in (8) was valid only if τ ≥ δ + ℓ, meaning that switches did not work in the store-and-forward mode. the link transmission time was computed according to the encapsulation format of i/o messages. figure 12 shows that 18 bytes were added by the cip protocol, 8 bytes by udp, 20 bytes by ip, and finally 38 bytes by ethernet. in this case, there was no padding. thus, under the assumption that x ≤ 1454 bytes, the link transmission time was given by: τ = 8 38 + 20 + 8 + 18 + x c . (8) can be modified as: γ = 2δ + ℓ + 8n 84 + x c . (9) the profile given in figure 11 can be optimized if several “items” (data) are encapsulated inside one frame, as specified in the cip protocol. it induces a mitigation of the minimum cycle time. minimum cycle time analysis of ethernet-based real-time protocols 755 figure 12: ethernet/ip i/o messaging frame (field lengths are given in bytes) since we employed the same approach that for profinet irt (in particular the slipstreaming effect), next studies will hence aim at reporting the synchronization effect on the minimum cycle time. in the next section, the results obtained for ethercat, profinet irt, modbus/tcp, and ethernet/ip are compared in different application contexts. 3 comparisons the objective was to analyse and compare the behaviour of different ethernet-based solutions. the topologies used in each assessment were linear. two bandwidths were studied: 100 mb/s and 1 gb/s. we analysed two payload sizes introduced by [6] as being representative of an industrial context: 16 bytes and 100 bytes. the network device latencies ℓ were defined in [5, 6] and those used in our study are described in table 2. it was assumed the performances of switches used in ethernet/ip were the same as those used by profinet irt. clearly, alternative assumed values would yield different results. thus, the results for the study using ethernet/ip are only valid when τ ≥ δ + ℓ. the link propagation delay was 50 ns and corresponded to a distance of 10 m between two devices. the comparison is related to the minimum cycle time without synchronization errors. table 2: latencies protocol fastethernet (100 mb/s) gigaethernet (1 gb/s) ethercat 1.35 µs 0.85 µs profinet irt 3 µs 0.6 µs modbus/tcp 1 µs (hub) ethernet/ip 3 µs 0.6 µs figure 13 shows the minimum cycle times (in ms), according to the number of slave devices with a payload equal to 100 bytes the first observation was that modbus/tcp provided the worst results, whatever the payload. this was because of its medium access mechanism, which is based on polling at the application level. in the case of a small payload, ethercat provided the best results in the fastethernet mode. the impact of bandwidth on profinet irt and ethernet/ip was very significant, because it 756 j. robert, j.-p. georges, e. rondeau, t. divoux figure 13: minimum cycle time as a function of the number of devices with a constant payload of 100 bytes per device and two bit rates of 100 mb/s and 1 gb/s enabled a reduction in the minimum cycle time. this benefit was lower with ethercat. the explanation is simple: ethercat sends only one frame for communicating with all its slaves and so ethercat provides only one link per transmission. in contrast, profinet irt and ethernet/ip sent n frames to dialogue with n slaves, which provides n times the link transmission. the bandwidth was the most crucial parameter in reducing the time cycle. moreover, 2n latencies must be considered with ethercat because profinet and ethernet/ip cycle times were composed of only one switch latency ℓ, because of the slipstreaming effect. profinet irt provided the best results at 1 gb/s. figure 13 shows that all the ethernet solutions, with the exception of fastethernet modbus/tcp, could interconnect with more than 60 slaves in cycle times of less than 1 ms. this cycle time constraint, and the number of slaves and the frame size, enables adequate coverage for most industrial applications. performances were similar when the payload was increased. the only difference was that the ethercat minimal cycle time was larger than profinet when the number of devices was increased. this was because the ethercat telegram had to be fragmented, which decreased protocol performance. an increase in frame size means that the choice of the fastethernet solutions must be carefully considered because the cycle time grows quickly. the gigaethernet solution eliminated this problem. 4 conclusion this study analysed the time performance of industrial ethernet protocols. the general conclusion was that all ethernet protocols perform suitably with a bandwidth of 1 gb/s for interconnected real-time systems. at 100 mb/s, special attention is required by engineers in the selection of ethernet-based protocols. other considerations have to be taken into account in the selection of ethernet-based products, including persistence in the market and interoperability minimum cycle time analysis of ethernet-based real-time protocols 757 with other industrial equipment and network. regarding interoperability concerns, ethernet/ip is a standardized solution and provides acceptable performance for controlling industrial systems. ethernet/ip can also implement priority mechanisms (ieee 802.1p), which is important when a network is shared with other applications because this facility enables the differentiation of network services offered by real-time traffic and unconstrained traffic. moreover, multicast communications may be interesting in industrial applications. it is hence important to know what is the capacity of the analysed solutions to enhance nodes operation using multicasting techniques. as ethernet/ip relies on cots, multicast addresses and vlan techniques may be used to create multicast communications. because of its transport protocol (tcp), modbus/tcp does not seem to be suitable to multicast communications (due to multiple acknowledgement issue). profinet irt considers only multicast communications in asynchronous mode. finally, ethercat supports vlan switches in its topology. as a conclusion, all solutions excepted modbus/tcp make it possible to support multicast communications. future works are related to the clock synchronization issue, especially for solutions based on the slipstreaming effect. the objective is to deal with non optimal situations for profinet irt and ethernet/ip where synchronization errors appear. bibliography [1] m. felser, t. sauter, the fieldbus war: history or short break between battles?, 4th ieee international workshop on factory communication systems, pp.73-80, 2002. [2] m. alves, e. tovar, and f. vasques, ethernet goes real-time: a survey on research and technological developments, technical report hurray-tr-2k01, ipp-hurray, polytechnic institute of porto (isep-ipp), 2000. [3] j.-d. decotignie, a perspective on ethernet-tcp/ip as a fieldbus, 4th ifac international conference on fieldbus systems and their applications, pp.138-143, 2001. [4] j.-d. decotignie, ethernet-based real-time and industrial communications, proceedings of the ieee, 93(6):1102-1117, 2005. [5] j. jasperneite, m. schumacher, k. weber, limits of increasing the performance of industrial ethernet protocols, 12th ieee conference on emerging technologies and factory automation, pp.17-24, 2007. [6] g. prytz, a performance analysis of ethercat and profinet irt, 13th ieee conference on emerging technologies and factory automation, pp.408-415, 2008. [7] iec, digital data communications for measurement and control ? fieldbus for use in industrial control systems: part 3: data link service definition, iec standard 61158, part 3, 2006. [8] iec, digital data communications for measurement and control ? fieldbus for use in industrial control systems: part 4: data link protocol specification, iec standard 61158, part 4, 2006. [9] g. cena, i c. berlotti, s. scanzio, on the accuracy of the distributed clock mechanism in ethercat, 4th ieee international workshop on factory communication systems, pp.43-52, 2010. [10] j. jasperneite , e. elsayed, investigations on a distributed time-triggered ethernet realtime protocol used by profinet, 3rd international workshop on real-time networks, 2004. 758 j. robert, j.-p. georges, e. rondeau, t. divoux [11] the modbus organization. modbus. [12] modbus-ida, modbus application protocol specification v1.1b, 2006. [13] p. brooks, ethernet/ip-industrial protocol, 8th ieee international conference on emerging technologies and factory automation, vol.2, pp.505-514, 2001. [14] v. schiffer, the common industrial protocol (cip) and the family of cip networks, technical report pub00123r0, open devicenet vendor association, inc. (odva), 2006. [15] a. modlovansky, utilization of modern switching technology in ethernet/ip networks, 1st international workshop on real-time lans in the internet age, pp.35-37, 2002. [16] e. alessandria, l. seno, s. vitturi, performance analysis of ethernet/ip networks, 7th ifac international conference on fieldbuses and networks in industrial and embedded systems, vol.7, 2007. [17] ieee computer society, ieee standard for a precision clock synchronization protocol for networked measurement and control systems, ansi/ieee standard 1588-2002, 2002. int j comput commun, issn 1841-9836 8(1):61-69, february, 2013. crcwsn: presenting a routing algorithm by using re-clustering to reduce energy consumption in wsn a.g. delavar, a.a. baradaran arash ghorbannia delavar, amir abbas baradaran department of computer engineering and information technology, payam noor university, po box 19395-3697, tehran, iran a_ghorbannia@pnu.ac.ir, amirbaradaran24@yahoo.com abstract: in this paper, we have presented an algorithm, based on genetics and re-clustering, to reduce energy consumption in wireless sensor networks. algorithm crcwsn could be best used by selected chromosomes in different states. in this algorithm, a new technique of selecting cluster head(ch) has been initially used by genetic algorithm. these chs have been used individually in each round to transmit data. in this research, considering distance and energy parameters, we have created a target function having more optimum conditions, compared to previous techniques. the created target function has been evaluated by input chromosome, and the combination of chromosomes has been done by a new technique having more efficiency compared to previous similar techniques. consequently, the timing of generation repeat is based on local distribution in chromosomes, and their using in sending data from source to destination that decrease generations’ repeat, compared to previous methods .results by simulation show that, at the end of each round, the number of alive nodes in the suggested algorithm increases, compared to previous methods, which increases network’s lifetime. keywords: genetic algorithm, wireless sensor network (wsn), routing, reduce energy consumption, re-clustering. 1 introduction recently, wireless sensor networks have widely been considered[1].these networks include some nodes receiving environment data and then sending them to a base station(bs)[2,3].generally, in these kinds of networks, nodes have batteries of limited energy which guarantee network’s lifetime[1,2,3,4]. the most important problem in wireless sensor networks is to create effective routing protocols in order to decrease energy consumption and increase network’s lifetime[2,4,5]. so many methods, having advantages and limitations have been presented , to find the best route and transmit data to bs. one of the most effective ways of finding optimum route and data transmission from common nodes to bs, is using genetic algorithm(ga)[2,6]. an important advantage of routing by using ga, is a multipurpose search rather than a point to point search, which leads to using all points in all processes of running ga. this causes non-optimum points in previous stages, to be attended in next stages[2]. to solve the problems, ga uses a group of chromosomes related to one population. in each running of ga, current chromosomes take a genetic operation for next generation to be appeared. this operation includes selection, composition and mutation[2,3,5]. optimum route is created by ga, after running several generations. in this research, we try to present a genetic and re-clustering based routing algorithm to reduce energy consumption of sensor network. routine is comparing to previous methods. 2 related works one of the most important re-clustering algorithms is leach which is based on some rounds[2,3,5]. each round includes two phases; setup and steady. in setup phase, clusters are copyright c⃝ 2006-2013 by ccc publications 62 a.g. delavar, a.a. baradaran formed and common nodes and chs are determined and in steady phase, data are transmitted from common node to ch and from ch to bs. in leach algorithm; in each round, each node can be a ch or a common node. whether a node can be a ch or not depends on the following threshold[2,7,8,9]: t(n) = { p 1−p∗(r mod 1 p ) if n ∈ g, 0 otherwise (1) in which : p: ch decision percentage( percentage of being ch, for example p=0.05.); r: current round; g: a set of rounds not being ch in 1/p of current round; another routing way by ga, is the algorithm presented by annie s.wu,ming zhou, shiyuan jin[10]. in this algorithm, selecting and clustering nodes is done by ga. of other methods of routing by ga, we can refer to the rgwsn algorithm [2]. in this algorithm, clustering and selection of optimum chs are done by ga. another geneticbased method, presented by jianming zhang, yaping lin, cuihong zhou, jingcheng ouyang could reduce energy consumption of sensor networks by considering distance and energy parameters[11]. 2.1 genetic algorithm ga is a method, based on natural inspiration, which numerically does direct and random search. finding an optimum solution is based on repeat and differs from other search methods in applying natural selection[2]. ga includes bit strands named chromosomes, each bit of which is called a gene. chromosomes, represent the entire set of variables and ga, in each repeat, will use non-optimum points of previous repeat[2,12]. one important advantage of ga , in comparison with other searching methods, is multi-point selection rather than one-point selection, in searching space. so ga, in searching and finding an optimum solution, is less likely to converge into a local maximum. ga includes some stages. each stage in ga is called a generation and a series of solutions is called a population[2,5,9]. ga initiates with initial population and after operations of genetic operators including selection, composition and mutation, a new population is created. the creation of new population is done by fitness function , i.e. after applying fitness function on created population , by termination of each generation, a new population is created. the processes are running through different generations to find optimum solution. generally, ga includes the following stages[2,6,12]: 1. selection: in this stage, two chromosomes having higher fitness, are selected as parent. 2. crossover: in this stage, two parents selected in the previous stage, are composed and new children are created. 3. mutation: in this stage, the child having mutation conditions, mutates. after this stage, children are decoded and compared to fitness function. if, by regarding fitness function, conditions are not optimum, new children will be used in initial population and algorithm proceeds. in this stage, generated chromosomes are treated as initial population and answers of low fitness are omitted, and algorithm proceeds by n chromosomes. under following conditions ga can come to an end[2,6]: 1. the best degree of fitness for children is achieved. 2. no improvement achieved by running the algorithm during several generations. 3. mean value of fitness function reaches a fixed measure as per several repeats(e.g. during 50 generations). 4. the number of generations reaches a fixed value after several repeats. 5. a combination of the abovementioned items occurs. crcwsn: presenting a routing algorithm by using re-clustering to reduce energy consumption in wsn 63 2.2 network model all sensor nodes and bs are motionless and after establishment can not be added or omitted. also the base energy of nodes differs and sensor nodes are informed of situation , i.e. they need hardware ,s such as gps to do this. 2.3 radio model sensors, when receiving data or transmitting data, consume energy[7,13]. standard radio model used in wsn, uses free space and multi-pass fading models depending on the distance between sender and receiver . this distance is the shortest crossover distance dcrossover[7,9]. transmit power equals[14]: pr(d) = ptgtgrλ 2 (4πd)2 (2) in which: pt: transmit power; gt: gain of transmit antenna; λ: wave length of carrier signal (in meter). when receiver distance is longer than dcrossover, transmit power equals[7,9]: pr(d) = ptgtgrh 2 t h 2 r (d)4 (3) ht: height of sender antenna (in meter). hr: height of receiver antenna (in meter). to transmit an n-bits message in d meter distance, radio energy consumption equals[7,9]: etx(n,d) = n(eelect+ ∈fs d2) d < dcrossover etx(n,d) = n(eelect+ ∈mp d4) d ≥ dcrossover (4) to receive n-bits message radio energy equals[9]: erx(n) = neelect (5) ∈fsand ∈mp are parameters depending on sensitivity(intelligence)of receiver and noise shape and eelectis an electric energy depending on some factors as digital code, modulation, filtering[9]. 3 the new presented algorithm crcwsn the new proposed algorithm includes some rounds each of which has two setup and steady phases. clustering is formed in setup phase and data transmission is in steady phase. setup phase initiates with an initial message from bs, including the nodes’ position and initial energy, and after running ga, some generations having lower fitness than other generations , are selected (optimum generations), and common nodes and chs are selected from available nodes in selected generations. (for example, if ga stops after running 50 generations, the 5 generations having lower fitness than other generations are selected). in the proposed method, a binary coding system has been used and we suppose 0 bits representing common nodes and 1 bits representing chs. after selection of common nodes and chs, data are transmitted in steady phase. the routine in setup phase is as follows: the distributive environment of nodes is divided into separated areas called grid. then some nodes, likely to be optimum ch, from each grid are selected. the way of selecting nodes of each grid is based on the distance from gravity center of nodes of each 64 a.g. delavar, a.a. baradaran grid, and the initial energy thereof, i.e. the nodes having more initial energy and short distance to gravity center are better selected. then the nodes selected as chromosomes ’bits are attended in ga. for example, if the environment is divided into 10 grids and 4 nodes from each grid are selected, the chromosomes attended in ga will have 40 bits(0,1). routine in ga is as follows: firstly, from initial chromosome which is initial population, some random populations are created and then randomly by using genetic operators of selection , composition and mutation, new children are created binarily from created populations. for example, if we create 50 new populations from initial populations, we will have 100 new children after applying genetic operators. after creating children, we apply fitness function on new population to select some populations(for example 5 populations) having lower fitness , as optimum generations. this trend will be proceeded for several generations until ga stops. (when ga reaches a fixed number of generations). for proposed methods, fitness function equals mean energy consumed by entire network in each population. fitness function is calculated , regarding heinzelman model. heinzelman has stated, in a model, that each node, to transmit l bits of data in d distance from itself, consumes etenergy. et = leelect + l ∈fs d2 d < d0 et = leelect + l ∈mp d4 d ≥ d0 (6) in which: d0: the shortest crossover distance. eelect: energy required to activate electronic circuits. ∈mp, ∈fs: parameters related to receiver’s sensitivity and noise shape. also the energy to receive l bits of data equals: er = leelect (7) in the proposed algorithm and setup phase, we compute fitness value for the bits in final selection, and suppose that 0 bits representing common nodes and 1 bits representing chs. the total consumed energy of network equals: e = e1 + e2 + e3 + e4 (8) in which: e1: energy necessary to send from common node to ch; e2: energy necessary to receive ch data from common nodes; e3: aggregation energy in ch; e4: energy necessary to send from ch to bs. which we have:   e1 = leelect + l ∈fs d2distoch ddistoch < d0 or e1 = leelect + l ∈mp d4distoch ddistoch ≥ d0 (9) in which: ddistoch: distance from common node to ch; l: number of bits. e2 = leelect × n_common (10) n_common: common node number e3 = leag × n_ch (11) crcwsn: presenting a routing algorithm by using re-clustering to reduce energy consumption in wsn 65 eag: aggregation energy in ch n_ch: chs number (bits 1) e4 = leelect + l ∈fs d2distobs ddistobs < d0 or e4 = leelect + l ∈mp d4distobs ddistobs ≥ d0 (12) ddistobs: distance from ch to bs. in setup phase, after applying fitness function on final populations(population of selected optimum generations) and specifying common nodes and chs,e1for common nodes(0 bits) and e2,e3, e4for 1 bits, are calculated. finally in steady phase nodes’ energy is reduced, based on transmissions. for example if we want to run algorithm in 1000 rounds and 5 optimum populations (population resulted by 5 optimum populations)are selected in setup phase, second round with second population, third round with third population, fourth round with fourth population, and fifth round with fifth population, is run. similarly, the sixth round with first population and the seventh round with second population is run. code i shows the proposed method. also chart 1 shows flowchart of the proposed algorithm. code i. proposed method: 1. state = normal 2. grid 3. node selection of each grid 4. create initial population 5. create a random population of initial population 6. running ga 7. selection 5 optimum selected generation 8. running steady phase 9. if round = 0 go to 10 else go to 6 10. end. 4 simulation the proposed algorithm analysis has been done by matlab software. in this analysis, some indices as alive nodes at the end of each round, number of grids, number of selected nodes in each grid are considered to compose initial population. also initial nodes’ energy, are random measures between 0.3 to 0.5. other parameters used in simulation are as follow: 1. nodes are randomly placed in a square –shaped environment; 2. bs position is variable; 3. eelect: 50nj/bit; 4. ∈fs: 10pj/bit/m2; 5. ∈mp: 0.0013pj/bit/m4; 6. eag: 5nj/bit/signal; 7.d0 = √ ∈fs ∈mp 66 a.g. delavar, a.a. baradaran figure 1: flowchart of the proposed algorithm figure 2: total number of alive nodes in the rgwsn, leach,gsaga,rcsdn,rgwsn the proposed method has been compared to leach, rcsdn, gsaga , and rgwsn methods. figure 1 shows the number of alive nodes at the end of 1400 rounds and figure 2 shows fitness function for population of 5 optimum selected generation. table 1 shows simulation parameters. total number of cluster in different round in leach. crcwsn: presenting a routing algorithm by using re-clustering to reduce energy consumption in wsn 67 figure 3: the energy consumed by entire network for 5 optimum selected generation figure 4: total number of cluster in different round in leach figure 5: total number of cluster in different round in crcwsn 68 a.g. delavar, a.a. baradaran table i.simulation parameters value parameter 100*100 m network size 50,50 m base station location rand [0.3,0.5] j initial energy for node 50nj/bit eelec 10pj/bit/ m2 εfs 0.0013pj/bit/m4 εmp 5nj/bit/signal data aggregation energy 100 nodes number 6 grids number 5 nodes number of each grid 87m d0 1. the energy consumed by entire network for 5 optimum selected generation. as shown by diagrams, after running 1400 rounds, the alive nodes in the proposed algorithm are more than those in leach, rcsdn,gsaga, and rgwsn methods. so the network’s lifetime increases. in this method, we also compare the number of clusters in various rounds, to leach methods. figures 3,4 show the number of clusters in leach and proposed methods. total number of cluster in different round in crcwsn. as shown by diagrams, in the proposed method, clusters formation in different rounds is more balanced than in the leach method. 5 conclusion in this paper, we presented a new method of clustering to transmit data from common nodes to ch and from ch to bs in sensor networks. selection of optimum cluster play an effective role in increasing sensor network’s lifetime. we show, by multi simulations, that the proposed algorithm differs from other proposed algorithm in reducing energy consumption and can significantly increase network’s lifetime compared to similar previous methods. bibliography [1] gao de-yun, zhang lin-juan, wang hwang-cheng, energy saving with node sleep and power control mechanisms for wireless sensor networks,in: national engineering laboratory for next generation internet interconnection devices, school of electronics and information engineering, beijing jiaotong university, china, 18(1):49-59, 2011. [2] a. g. delavar, a. abbas baradaran, j. artin, rgwsn: presenting a genetic-based routing algorithm to reduce energy consumption in wireless sensor network,international journal of computer science issues, vol. 8, issue 5, no 1, 54-59, september 2011. [3] ]y. zhu, w. wu, j. pan, y. tang, an energy-efficient data gathering algorithm to prolong lifetime of wireless sensor networks, comput. commun., 33:639-647, 2010. crcwsn: presenting a routing algorithm by using re-clustering to reduce energy consumption in wsn 69 [4] cheng hong-bing, yang geng, nhrpa: a novel hierarchical routing protocol algorithm for wireless sensor networks, journal of china universities of posts and telecommunications, 15(3): 75-81, 2008. [5] a.h. mohajerzadeh, m.h.yaghmaee, h.s.yazdi,a.a.rezaee, a fair protocol using generic utility based approach in wireless sensor networks, ultra modern telecommunications & workshops, 2009. icumt ’09. international conference on, pp. 1-4, 2009. [6] s.yussof, r.z. razali, o.h.see, a parallel genetic algorithm for shortest path routing problem, 2009 international conference on future computer and communication, doi 10.1109/icfcc.2009.36, 2009. [7] a.g. delavar,j.artin,m.m.tajari, rcsdn : a distributed balanced routing algorithm with optimized cluster distribution, icsap 2011, 3rd international conference onsignal acquisition and processing , 26-28, february, 2011, singapore [8] a.g. delavar, j.artin, m.m.tajari, prwsn: a hybrid routing algorithm with special parameters in wireless sensor network, in: a. özcan, j. zizka, and d. nagamalai (eds.): wimo/coneco 2011, ccis 162, pp. 145–158, 2011. [9] heinzelman, w.r., chandrakasan, a., balakrishnan, h., energy efficient communication protocol for wireless sensor networks, proc. of the 33rd hawaii international conference on system science, vol. 2, doi: 10.1109/hicss.2000.926982, 2000. [10] shiyuan jin, ming zhou, annie s. wu, sensor network optimization using a genetic algorithm, school of eecs university of central florida orlando, fl 32816 [11] jianming zhang,yaping lin,cuihong zhou,jingcheng ouyang, optimal model for energyefficient clustering in wireless sensor networks using global simulated annealing genetic algorithm, doi 10.1109/iita.workshops.2008.40 [12] v.purishotham reddy, g.michael, m.umamaheshwari, coarse-grained parallelgeneticalgorithm to solve the shortest path routing problem using genetic operators, indian journal of computer science and engineering, issn : 0976-5166, 2(1):39-42, 2011. [13] wang, q., yang, w. energy consumption model for power management in wireless sensor networks, in 4th annual ieee communications society conference on sensor, mesh and ad hoc, communications and network, doi:10.1109/sahcn.2007.4292826, 2007. [14] t. rappaport, wireless communications: principles & practice, nj, prentice hall, 1996. ijcccv8n1.pdf int j comput commun, issn 1841-9836 8(1):8-17, february, 2013. forecasting chaotic series in manufacturing systems by vector support machine regression and neural networks m.d. alfaro, j.m. sepúlveda, j.a. ulloa miguel d. alfaro, juan m. sepúlveda jasmín a. ulloa department of industrial engineering, university of santiago of chile 3769 ecuador ave. santiago, chile. e-mail: miguel.alfaro@usach.cl, juan.sepulveda @usach.cl jasmin.ulloa@usach.cl abstract: currently, it is recognized that manufacturing systems are complex in their structure and dynamics. management, control and forecasting of such systems are very difficult tasks due to complexity. numerous variables and signals vary in time with different patterns so that decision makers must be able to predict the behavior of the system. this is a necessary capability in order to keep the system under a safe operation. this also helps to prevent emergencies and the occurrence of critical events that may put in danger human beings and capital resources, such as expensive equipment and valuable production. when dealing with chaotic systems, the management, control, and forecasting are very difficult tasks. in this article an application of neural networks and vector support machines for the forecasting of the time varying average number of parts in a waiting line of a manufacturing system having a chaotic behavior, is presented. the best results were obtained with least square support vector machines and for the neural networks case, the best forecasts, are those with models employing the invariants characterizing the system’s dynamics. keywords: chaos; forecast; neural networks; vector support machines; manufacturing systems 1 introduction manufacturing systems are conceived as complex ones; although the complex term does not have a unique definition [1] it is possible to distinguish two kinds of complexities in production systems: a) structural complexity or static complexity dealing with the number of system’s components and their relationships, and b) dynamic complexity dealing with the uncertainty in the systems behavior [2]. it may seem paradoxical that an artificial system engineered for making a set of given tasks had its own laws as if it was a natural system. this is due to the fact that production systems are becoming everyday more complex by the technological progress and the transformation of the supply chain. flexible manufacturing machinery, global markets, and supply network relationships are typical examples of such changes. managing these systems to bring them under control is today a difficult task. the dynamic complexity of production systems has been demonstrated by the kinds of behavior that they can exhibit, among these a chaotic behavior [3], [4], [5]. several metrics have been proposed for measuring the complexity of manufacturing systems [2], [6], [7], [8]. these studies relate metrics of performance with metrics of complexity. in this article, a way of control by forecasting the system’s behavior is shown. for this purpose, a time series of the average number of parts in the waiting line of a chaotic manufacturing system is utilized [3]. as forecasting methods, support vector machines (svms) and artificial neural networks (anns) have been selected because these methods can distinguish chaotic patterns and therefore they can predict the evolution of an observed control variable. in [9] svms have been used for support vector regression (svr) analysis of several exchange rates with respect to the us dollar. in the present work, a least square support vector regression copyright c© 2006-2013 by ccc publications forecasting chaotic series in manufacturing systems by vector support machine regression and neural networks 9 (ls-svr) with less computational effort than the one reported in [10] is proposed. in [11] an ann model for forecasting the observed error of monitoring units of the sea level in singapore is presented. in [12] an ann is constructed for forecasting the behavior of a diode having a chaotic pattern; in this case the local dimension and the time delay are proposed for determining the network architecture. the originality of this work consists of the study of the performance of two forecasting techniques: ls-svm and ann as applied to chaotic series. similar studies have been made but for series that are not chaotic [13], [14]. also, it is a novel application in the manufacturing area. the paper is organized as follows: section 2 shows in a summarized way the manufacturing system from which the time series was obtained; section 3 presents the methods utilized for the forecasting; section 4 shows the results of the analysis of the time series by using nonlinear dynamic systems (nlds) theory. forecasting results are detailed in section 5. finally, conclusions and research directions are given. 2 system under study 2.1 variable to be analyzed the system under study is described in [3]. the variable to be analyzed is the average in time of the number of parts in the waiting line of a flexible manufacturing system. by utilizing a time series of this variable the system’s dynamics is characterized by means of the theory of non-linear dynamic systems. the machining shop is formed by three different machines producing three types of parts. each part has a set of operations which can be executed in different machines according to the operations sequence. figure 1 shows the layout of the system under study. figure 1: machining shop layout. 2.2 operation of the system • the arrivals rate and service rate are such that the system is under equilibrium; that is, the number of parts neither tends to zero nor infinity. • upon arrival of a part, a function f assigns the part to a machine able to perform the operation and that has the least number of parts in queue. • the priority of the queue of each part type at any machine is first-in first-out (fifo). 10 m.d. alfaro, j.m. sepúlveda, j.a. ulloa • the machine works cyclically with a kind of part, during a time interval equivalent to the time needed to complete the stock of that kind of part at the k-th machine. the function gk manages the execution cycle according to the part type. the values of plant parameters are shown in table 1, where βj is the arrival rate of the j-th part type (number of parts/time unit) and oij is the i-th operation of the j-th part. the values in the table represent the operation time at each machine. 3 forecasting methods utilized research on forecasting models has received considerable attention over the last 50 years. currently, there exist numerous forecasting methods [15]. for the case of chaotic systems, the very theory on nlds provides forecasting methods [16]. in this article, nlds theory is used as a base for constructing ann and vsm models. these techniques are proposed due to their capacity for recognizing chaotic patterns. 3.1 artificial neural networks an artificial neural network (ann) is a computational model of the brain. it consists of a limited number of connected elements (neurons) and it is distributed in an input layer, one or more hidden layers, and an output layer.an ann is a mathematical structure that allows pattern recognition; in this work we use a back-propagation type of network, as shown in figure 2. as it is known that the system is chaotic, it is proposed as the number of neurons in the input layer the dimension of the phase space. this value is obtained by the method of false neighbors. the temporal distance between the input variables corresponds to the time delay in the construction of the systemďż˝s attractor. this value is obtained from the average of the mutual information [16]. the transfer function is of the sigmoidal type and the network is defined by the equation (1). xt = β0 + n ∑ i=1 βif(sωi0 + d ∑ j=1 ωijxt−j) (1) where n is the number of neurons in the hidden layer, d is the number of neurons in the forecasting chaotic series in manufacturing systems by vector support machine regression and neural networks 11 figure 2: neural network for forecasting input layer, s is the standard deviation of the weights matrix and βi, ωij are the weights. the number of neurons in the hidden layer is due to the following empirical relationship 2: nobs 10 ≥ (ne + 1)nc + (nc + 1)ns (2) where nobs is the number of observations, ne, nc and ns are the number of neurons in the input layer, hidden layer, and output layer, respectively. 3.2 support vector machines for least squares regression support vector machines (svm) algorithms emerged from the artificial intelligence field and they have been successfully used in a variety of applications for problems of classification and regression. the least-squares vector support machine is a modified version of a standard vsm for regression; the model is trained by solving a linear system instead of a quadratic programming optimization model [10]. the ls-svm are closely related with regularization networks and gaussian processes, buy they emphasize and exploit its interpretation from the viewpoint of the optimization theory. the general formulation for a ls-svr is shown in (3): y = wt f(x) + b (3) where x is the input vector of the data series, y is the output vector. parameters w and b are obtained from the optimization problem given by equations (4), (5). min τ(w, b, e) = 1 2 ‖w‖2 + γ 1 2 n ∑ i=1 e2i (4) s.t. (yi − (〈w, xi〉) + b) = ei ∀i = 1, ...n (5) in (4), γ is an arbitrarily chosen parameter. for learning, the kernel utilized is the radial base function (rbf) (6). k(xi, xj) = exp(− ‖yi − xi‖ 2 σ2 ) (6) 12 m.d. alfaro, j.m. sepúlveda, j.a. ulloa the architecture of the ls-svr [17] is shown in figure 3. figure 3: architecture generated by regression svm 4 characterization of the system’s dynamics it has been stated that the system has a chaotic behavior; in figures 4, 5, 6, 7, 8, 9 the analysis confirming this assumption is shown. in figure 4 the time series plot is seen, figure 5 shows the fourier spectrum where the erratic nature of the series is verified; figure 6 presents the time delay corresponding to the first minimum of the mutual information average. figure 7 shows the phase space dimension d = 5, which has been obtained by the percentage of false neighbors [16]. however, figure 8 shows that the local dimension is 3 (correlation dimension 2.784). finally, in figure 9 is observed that the highest lyapunov’s exponents value is 0.41, which corroborates the signal’s chaotic character. figure 4: average of the number of parts in time figure 5: fourier power spectrum forecasting chaotic series in manufacturing systems by vector support machine regression and neural networks 13 figure 6: mutual information average figure 7: dimension of phase space figure 8: correlation dimension figure 9: lyapunov’s exponents 5 forecasting models results in order to measure the performance of the models, the apropiability index (ia) and the normalized root of quadratic error (rms) are used, as defined by (7) and (8) respectively: ia = 1 − ∑n i=1(yi − y , i )2 ∑n i=1 (|yi| + |y , i |)2 (7) rms = √ ∑n i=1(yi − y , i )2 ∑n i=1 (yi)2 (8) where, yi is the value of the average number of parts at period i, y , i is the value predicted by the model, and n is the number of forecasted periods. the ia indicates the proportion of the variance that is explained by the model; values greater than 0.9 are expected. whereas, the rms compares the error between the desired output and the one generated by the model; values close or below to 0.1 are expected. the series has 12001 data or periods, which correspond to the time persistent average number of parts in a time interval of 0.15 time units. a 70% is used for training and the remaining 30% for validation. 5.1 experimental results for ann for the forecasting model, a neural network is designed based on the time delay τ and the dimension d which were obtained from the chaotic system’s characterization. five forecasting models are constructed with 1, 3, 5, 7 and 10 neurons in the input layer, with a time delay τ = 4 in each model. the number of neurons in the hidden layer is obtained by the relationship (2) for 14 m.d. alfaro, j.m. sepúlveda, j.a. ulloa each model; thus it is obtained: 279, 167, 119, 93, 69 neurons, respectively. all of the models were implemented by using the matlabtmann toolbox. figures 10, 11, and 12 show the results for the cases of 3, 5 and 10 neurons in the input layer. figure 10: forecast with three neurons in the input layer figure 11: forecast with five neurons in the input layer figure 12: forecast with ten neurons in the input layer table 2 show the values of ia and rms indicators for the different models. it is observed that the model of 10 neurons is superior in 0.03% to the model with five neurons in the input layer. however, if the rms of the same models are compared, the model with 10 neurons is greater in 3.65%. in a similar analysis between the models with three and ten neurons, it can be seen that the difference of the ias is 0.05% and the rms’s 0.67%. then, it is possible to state that the best models can be found between the ones with three and five neurons in the input layer. these values are exactly those corresponding to the local dimension and the global dimension in the phase space. 5.2 experimental results for ls-svr the values of the rbf kernel parameters associated with the optimization problem described by equations (4), (5) and (6) are γ = 256 and σ = 8. as time delay in the input variables τ = 4 has been used. likewise the ann case, five models have been constructed with 1, 3, 5, 7 and 10 input vectors. the models were implemented by using matlabtm lssvmlab1.7. figures 13, 14 and 15 show results for 3, 5 and 10 input vectors. forecasting chaotic series in manufacturing systems by vector support machine regression and neural networks 15 figure 13: forecast with three input vectors figure 14: forecast with five input vectors figure 15: forecast with ten input vectors table 3 shows the values of the indicators ia and rms for the different models. the results in table 3 are not the expected ones; the best results correspond to seven and ten input vectors but not between dimensions three and five as with anns. the same experiment was executed without time delay, that is τ = 1 was assumed. the results are shown in table 4. again, it is observed that best results are for seven and ten input vectors. thus, for this specific case, it is verified that does not exists a behavior pattern based on nlds characterization. 5.3 comparison between experimental results for ann and ls-svr even though the difference between the ia of the best ann and ls-svr models is 0.11% in these models (five neuron for ann and seven input vector for ls-svr) the lssvr’s rms value is practically a half the ann’s. hence, it is possible to conclude that the best model is the one constructed with ls-svr. nevertheless, it is observed that in general terms both approaches perform adequately. 16 m.d. alfaro, j.m. sepúlveda, j.a. ulloa 6 conclusions as seen, the best results were obtained with least square support vector machines. similar results have been reported for svr and ann for non-chaotic series [13], [14]. notwithstanding, it can be stated that both models are efficient for the forecasting of a chaotic series obtained from a flexible manufacturing system. according to the results, it can be concluded that the system’s behavior can be predicted in one time step, that is 0.15 time units. as research directions, it is suggested to develop models able to predict a longer time interval. for this purpose, ongoing work by the authors is addressing the use of the inverse of the lyapunov’s exponent in order to determine the number of neurons in the output layer for the ann case, and of the number of output vectors for the ls-svr. acknowledgments this research has been supported by dicyt ( scientic and technological research bureau) of the university of santiago of chile (usach) and department of industrial engineering. bibliography [1] eimaraghy h., kuzgunkaya 0., urbanic r., manufacturing systems configuration complexity, cirp annals-manufacturing technology, issn 0007-8506, 54(1): 445-450, 2005. [2] papakostas n., efthymiou k., mourtzis d., chryssolouris g., modelling the complexity of manufacturing systems using nonlinear dynamics approaches, cirp annalsmanufacturing technology, issn 0007-8506, 58: 437-440, 2009. [3] alfaro m., sepúlveda j. chaotic behavior in manufacturing systems, int. j. production economics, issn 0925-5273, 101: 150-158, 2006. [4] papakostas n., mourtzis d., an approach for adaptability modeling in manufacturinganalysis using chaotic dynamics, cirp annals-manufacturing technology, issn 0007-8506, 56(1): 491-494, 2007. [5] donnera r., scholz-reiter b., hinrichs u., nonlinear characterization of the performance of production and logistics networks, journal of manufacturing systems issn: 0278-6125, 27(2): 84-99, 2008. [6] wu y., frizelle g., efstathiou j., a study on the cost of operational complexity in customersupplier systems, int. j. production economics, issn 0925-5273, 106(1): 217-229, 2007. [7] phukan a., kalava m., prabhu v.,complexity metrics for manufacturing control architectures based on software and information flow, computers and industrial engineering issn: 03608352, 49(1): 1-20, 2005. [8] wang h., hu s., manufacturing complexity in assembly systems with hybrid configurations and its impact on throughput, cirp annals-manufacturing technology, issn 0007-8506, 59: 53-56, 2010. forecasting chaotic series in manufacturing systems by vector support machine regression and neural networks 17 [9] huang s., chuang p., wub c., lai h., chaos-based support vector regressions for exchange rate forecasting, expert systems with applications, issn: 0957-4174, 37: 85908598, 2010. [10] he k., lai k., yen j., a hybrid slantlet denoising least squares support vector regression model for exchange rate prediction, procedia computer science issn 1877-0509, 1: 2397-2405, 2010. [11] sun y., babovic v., chan e., multi-step-ahead model error prediction using time-delay neural networks combined with chaos theory, journal of hydrology, issn 00221694, 395(1): 109-116, 2010. [12] hanias m., karras d. on ef?cient multistep non-linear time series prediction in chaotic diode resonator circuits by optimizing the combination of non-linear time series analysis and neural networks, engineering applications of artificial intelligence issn 0952-1976, 22(1): 32-39, 2009. [13] vanajakshi l., rilett l.,a comparison of the performance of artificial. neural networks and support vector machines for the prediction of traffic speed, ieee intelligent vehicles symposium isbn 0-7803-8310-9, 194-199, parma, italy june 2004. [14] yoon h.,jun s, ,hyun y.,bae g., lee k.,a comparative study of artificial neural networks and support vector machines for predicting groundwater levels in a coastal aquifer, journal of hydrology, issn 0022-1694, 396(1): 128-138, 2011. [15] scott j., it principles of forecasting, university of pennsylvania, kluwer academic publisher 2001. [16] abarbanel h., analysis of observed chaotic data. new york, springer-verlag, 1996. [17] vapnik v., statistical learning theory, wiley. springer, new york. usa. 1998. int j comput commun, issn 1841-9836 9(3):253-260, june, 2014. solving continuous trajectory and forward kinematics simultaneously based on ann c.k. ang, s.h. tang, s. mashohor, m.k.a.m. arrifin chun kit ang*, sai hong tang, syamsiah mashohor, mohd khairol anuar mohd arrifin universiti putra malaysia department of mechanical and manufacturing engineering, faculty of engineering, 43400 upm, serdang, selangor darul ehsan. email: ack_kit@hotmail.com, saihong@upm.edu.my, syamsiah@upm.edu.my, khairol@upm.edu.my *corresponding author abstract: robot movement can be predicted by incorporating forward kinematics (fk) and trajectory planning techniques. however, the calculations will become complicated and hard to be solved if the number of specific via points is increased. thus, back-propagation artificial neural network is proposed in this paper to overcome this drawback due to its ability in learning pattern solutions. a virtual 4-degree of freedom manipulator is exploited as an example and the theoretical results are compared with the proposed method. keywords: artificial neural network (ann), back propagation neural network, forward kinematics, continuous trajectory. 1 introduction trajectory planning can be considered as an important issue in robot industry. without an appropriate trajectory planning, a robot’s motion will become unpredictable and it may collide with obstacles or go through undesirable points. thus, an appropriate trajectory planning enables us to determine the required workspace and provides us an opportunity to avoid obstacles. basically, there are two types of trajectory planning, point to point and continuous trajectory (with via points). however, continuous trajectory planning will be more practical compare to point to point trajectory because it allows robot to pass through various specific points and it is useful for real time robot. conventionally, there are a few methods that can be used for continuous trajectory planning such as bezier function, linear segments with parabolic blends (lspb), and high order polynomials. unfortunately, those methods require extensive calculations when the number of specific points is increased and thus leading to the emergence of a large number of formulas ( [1][5]). hence, it is advisable to develop a method which is able to overcome this drawback. 2 preliminaries forward kinematics (fk) can be used to determined the position and orientation of a robot’s hand if all the configurations are known and the equations can be derived by using denavithartenberg (dh) method because it is the standard way of modelling the robot motion due to its simplicity of modelling robot links and joints that is applicable for any robot configuration s regardless of its complexity. the total transformation for a robot manipulator from base to hand will be copyright © 2006-2014 by ccc publications 254 c.k. ang, s.h. tang, s. mashohor, m.k.a.m. arrifin rth = rt 11 t2 2t3... n−1tn = a1a2a3...an =   nx ox ax px ny oy ay py nz oz az pz 0 0 0 1   (1) and the end effector can be defined as: re(t) = fe(θ(t)) (2) where re ∈ rm in cartesian space is related to the joint space vector, m is dimensional position vector of end effector. in fact, what we are actually interested in is the inverse kinematics (ik) because it allows us to determine the value of each joint in order to place the arm at a desired position and orientation. the inverse kinematic problem is to find the joint variable θ for any givenre(t). unfortunately, it is impossible to find an analytic solution due to the nonlinearity off(θ(t))in reality. the inverse kinematics problem of manipulators is thus usually solved at the velocity level. differentiating re(t) = fe(θ(t)) with respect to time yields a linear relation between the cartesian velocity ṙeand the joint velocityθ̇: je(θ)θ̇ = ṙe (3) where je(θ) ∈ rm×n is the end effector jacobian matrix which is defined as je(θ) = ∂f(θ) ∂θ [613]. however, the derived ik equations are not necessarily available when the degeneracy and singularity problems ( [6], [9], [12][14])] occurred and thus we need to use some new methods to solve it such as artificial neural network, genetic algorithm, neural fuzzy and etceteras ( [7][15]). generally, high order polynomial is used to plan a trajectory with the intention to let the robot’s hand to pass through all the specific points. however, solving the high order polynomial equations requires extensive calculations. instead, it is advisable to use combinations of lower order polynomials for different segments of the trajectory and blend them together to satisfy all required boundary conditions ( [1][5]). θ (t) = c0 + c1t + c2t 2 + ....... + cn−1t n−1 + cnt n (4) for an example, for a 4-3-4 trajectory, a fourth order polynomial is used to plan a trajectory between the initial point and the first via point, followed by a third order polynomial to plan a trajectory between two via points, lastly another fourth order polynomial is implemented to plan the trajectory between the last via point and final angle. θ (t)1 = a0 + a1t + a2t 2 + a3t 3 + a4t 4 θ (t)2 = b0 + b1t + b2t 2 + b3t 3 θ (t)3 = c0 + c1t + c2t 2 + c3t 3 + c4t 4 in short, ik allows us to allocate the robot’s hand at desired location and the motion of robot’s hand can be determined by incorporating the fk and trajectory techniques. sadly, the calculations will become tedious and hard to be solved when the number of specific points is increased. solving continuous trajectory and forward kinematics simultaneously based on ann 255 3 theoretical and simulation results experiments were performed on a simulated 4-degree of freedom (dof) robot manipulator which shown in figue 1, ik was used to determine the starting and ending configurations for the robot’s hand while the high order polynomials and fk were used to track the motion of the robot’s hand from beginning to the end in cartesian space. it was specified that the robot need to pass through two specific points. at the same time, velocities and accelerations constraints need to be taken into consideration. figure 1: simulated 4 dof manipulator assuming that a manipulator needs to spend tf to complete the moving path and a manipulator joint needs to reach θaat interval time ta then θbat interval time tb where ta < tband ta, tb ∈ [0, tf]. one may need to match the position, velocity, and accelerations of the two segments at each point to plan a continuous trajectory. hence, higher order polynomials in the form of θ (t) = c0 + c1t + c2t2 + ... + cn−1tn−1 + cntn are required. however, it is an extensive calculation process to solve high order polynomial and it is advisable to use the combinations of lower order polynomials for different segment of the trajectory such as 4-3-4 trajectory or 3-5-3 trajectory. the velocity and acceleration for initial and final point are zero. in order to fulfill the entire initial, final, and via points requirement, a matrix form can be generated where 256 c.k. ang, s.h. tang, s. mashohor, m.k.a.m. arrifin tab = tb − taand tc = tf − ta − tb.  θ1 θ̇1 θ̈1 θ2 θ2 0 0 θ3 θ3 0 0 θ4 θ̇4 θ̈4   =   1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 1 ta t 2 a t 3 a t 4 a 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 1 2ta 3t 2 a 4t 3 a 0 −1 0 0 0 0 0 0 0 0 0 2 6ta 12t 2 a 0 0 −2 0 0 0 0 0 0 0 0 0 0 0 1 tab t 2 ab t 3 ab 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 1 2tab 3t 2 ab 0 −1 0 0 0 0 0 0 0 0 0 0 2 6tab 0 0 −2 0 0 0 0 0 0 0 0 0 0 0 1 tc t 2 c t 3 c t 4 c 0 0 0 0 0 0 0 0 0 0 1 2tc 3t 2 c 4t 3 c 0 0 0 0 0 0 0 0 0 0 0 2 6tc 12t 2 c   ×   a0 a1 a2 a3 a4 b0 b1 b2 b3 c0 c1 c2 c3 c4   or [θ] = [m] [c] (5) by solving those constant values in matrix[c], the joint turning angle at any interval time can be obtained where θ (t) = a0 + a1t + a2t 2 + a3t 3 + a4t 4 0 < t ≤ ta θ (t) = b0 + b1t + b2t 2 + b3t 3 t ∈ [ta, tb] , t = t − ta θ (t) = c0 + c1t + c2t 2 + c3t 3 + c4t 4 t ∈ [tb, tf] , t = t − ta − tb the equation which is used to represent the robot’s hand at any interval time can be denoted as: re (ti) =   cos (θ1 (ti)) [ l1 × cos (θ2 (ti)) + l2 × cos ( 3∑ k=2 θk (ti) ) + l3 × cos ( 4∑ k=2 θk (ti) )] sin (θ1 (ti)) [ l1 × cos (θ2 (ti)) + l2 × cos ( 3∑ k=2 θk (ti) ) + l3 × cos ( 4∑ k=2 θk (ti) )] l1 × sin (θ2 (ti)) + l2 × sin ( 3∑ k=2 θk (ti) ) + l3 × sin ( 4∑ k=2 θk (ti) )   (6) in fact, all these calculations can be solved simultaneously by back-propagation artificial neural network. the back-propagation neural network can be activated by presented a training set data with inputs x1(p), x2(p)...xn(p)and desired outputsyd,1(p), yd,2(p), ..., yd,n(p). assume that the back-propagation network consists of three layers which are input layer, hidden layer, solving continuous trajectory and forward kinematics simultaneously based on ann 257 and output layer. the actual outputs of the neurons in the hidden layer will be yj(p) = sigmoid [ n∑ i=i xi(p) × wij(p) − θj ] where n is the number of inputs for neuron j in the hidden layer, and sigmoid is the sigmoid activation functiony sigmoid = 1/(1+e−x). for the output layers, the actual outputs of the neurons will be yk(p) = sigmoid [ m∑ j=1 xjk(p) × wjk(p) − θk ] where m is the number of inputs for neuron k in the output layer. an error signal ek(p) = yd,k(p) − yk(p)will be generated from neuron kat output layer and propagated backward for updating the weights. for updating the weights at the output neurons with learning rate, α: error gradient,δk(p) = yk(p) × [1 − yk(p)] × ek(p) ek(p) = yd,k(p) − yk(p) weight correction, ∆wjk(p) = α × yj(p) × δk(p) update weight, wjk(p + 1) = wjk(p) + ∆wjk(p) similar to output layer, for updating the weights at the hidden neurons with learning rate, α: error gradient,δj(p) = yj(p) × [1 − yj(p)] × 1∑ k=1 δk(p) × wjk(p) weight correction, ∆wij(p) = α × xi(p) × δj(p) update weight,wij(p + 1) = wij(p) + ∆wij(p) the same procedures can be repeated by increasing the iteration puntil the output error criterion is satisfied. a training data set which consists of inputs and outputs is presented to the networks for learning purpose. inputs consists of the final angle of each joint θf,n and interval time variable ti where ti ∈ [0, tf] and tf is the time for the manipulator to complete the moving path. the outputs will be the coordinates of robot’s hand at any interval time with respect to ti,re(ti) = [px py pz] t . there are two set of results in table 1. the theoretical results which obtained by using conventional fk and high order polynomial trajectory were compared to the artificial neural network (ann) results. the robot started moving from its initial position (θ1=θ2=θ3=θ4=0) and passed through two via points before it reached its destinations. it shows that the robot still able to pass through those specific points even though the destination point was changed and the moving path could be tracked by using ann. the theoretical and ann results were almost the same and the errors were less than ±0.3unit (less than 1%). this can be used as an evidence to proof that artificial neural network is able to solve fk and continuous trajectory simultaneously due to its adaptive learning ability. since back-propagation neural network is able to track the motion of end effector, it can be used to track the position of each joint in cartesian space as well. referring to figure 1, joint 1 and joint 2 are located at the origin of reference frame, (0,0,0). for tracking the position of joint 3, the training data set inputs and outputs will be [ti θ1(ti) θ2(ti)] t and re,3(ti). correspondingly, for joint 4, the training data set inputs and outputs will be [ti θ1(ti) θ2(ti) θ3(ti)] t and re,4(ti). once the joins location of all joints are computed, the manipulator link equations can be derived 258 c.k. ang, s.h. tang, s. mashohor, m.k.a.m. arrifin by using parametric equations. assuming that the location and vector for joint m and joint n at any interval time ti: re,n(ti) = [px,n py,n pz,n] t , −−→ opn = [an bn cn] t and re,m(ti) = [px,m py,m pz,m] t , −−−→ opm = [am bm cm] t . the link equation that connecting joint n and m will be x−px,n an−am = y−py,n bn−bm = z−pz,n cn−cm = ζ or [x y z] t = [px,n py,n pz,n] t + [an − am bn − bm cn − cm]t ζ for the robot manipulator which is mentioned in this paper, the robot link equations can be denoted as: link 1: x−px,3 a3−a2 = y−py,3 b3−b2 = z−pz,3 c3−c2 = ζ2 0 ≤ z ≤ pz,3 link 2: x−px,4 a4−a3 = y−py,4 b4−b3 = z−pz,4 c4−c3 = ζ3 pz,3 ≤ z ≤ pz,4 link 3: x−px,5 a5−a4 = y−py,5 b5−b4 = z−pz,5 c5−c4 = ζ4 pz,4 ≤ z ≤ pz,5 conclusively, the link equation can be derived as long as the coordinates of two successive joints are provided by using back-propagation neural network and the links parametric equations can be derived based on the vector of the two successive joints. once the equation of each manipulator link is known, a person is able to predict or compute the entire motion of a robot manipulator. besides, obstacle collision can be detected if the link equations are known. assuming that there is a rigid body obstacle enclosed in a sphere which possesses a r radius and center locates at (xo, yo, zo), the sphere equation will be (x − xo)2 + (y − yo)2 + (z − zo)2 = r2 the intersection points between the link and obstacle can be obtained by solving the link and sphere surface equation. for an example, a line that pass through the point (e, f, g)possessing a same direction as vector[a b c]t , the equation for the link will be: x−e a = y−f b = z−g c = ζor   x y z   =   e f g   +   a b c   ζ substituting the link equation into sphere surface equation yields, (aζ + e − xo)2 + (bζ + f − yo)2 + (cζ + g − zo)2 = r2 simplified and yields, aζ2 + 2bζ + c = 0 where, a = a2 + b2 + c2 b = a(e − xo) + b(f − yo) + c(g − zo) c = (e − xo)2 + (f − yo)2 + (g − zo)2 − r2 solving continuous trajectory and forward kinematics simultaneously based on ann 259 by examining the discrimination roots, the intersection points between link and sphere surface exists when∆ ≥ 0, [a(e − xo) + b(f − yo) + c(g − zo)]2 ≥ 4 [ a2 + b2 + c2 ] [ (e − xo)2 + (f − yo)2 + (g − zo)2 − r2 ] the intersection point between the manipulator links and obstacle can be determined by solving the constant ζ. table 1: comparison of conventional and ann techniques in tracking robot end effector moving path 4 conclusions this paper shows that back-propagation neural network is able to solve trajectory and kinematic problems simultaneously. the moving path of the robot end effector still able to be tracked even though the ending point is changed. besides, this method can be used to track the position of robot joints which can be used to derive the robot link equations. in comparison to conventional continuous trajectory planning methods, ann is much easier to be applied and provide high accuracy results. 260 c.k. ang, s.h. tang, s. mashohor, m.k.a.m. arrifin bibliography [1] saeed b. n. (2001). introduction to robotics analysis, system, application. prentice hall, 29-172. [2] ata, a., myo, t. (2008). optimal trajectory planning and obstacle avoidance for flexible manipulators using generalized pattern research. world j. modeling and simulation, 4:163–171. [3] guan, y., yokoi, k. , stasse, o. , kheddar, a. (2005). a. on robotic trajectory planning using polynomial interpolations. proc. ieee int. conf. on robotics and biomimetics, 111-116. [4] reichenbach, t, kovacic, z.(2005). collision-free path planning in robot cells using virtual 3d collision sensors. cutting edge robotics, 683–697. [5] campos, j., flores, j.a.r., montufar, c.p.(2008). robot trajectory planning for multiple 3d moving objects interception: a polynomial interpolation approach. proc. ieee int. conf. on electronics, robotics and automotive mechanics, 478–483. [6] zarkandi s., vafadar a., esmaili m. r.(2011). prrrrrp redundant planar parallel manipulator: kinematics, workspace and singularity analysis. ieee 5th int. conf. on robotics, automation and mechatronics (ram), doi: 10.1109/ramech.2011.6070457, 61–66. [7] al-mashhadany, y. i. (2010). inverse kinematics problem (ikp) of 6-dof manipulator by lrnns. int. conf. on management and service science (mass), 1 – 5. [8] zhang, d., lei, j.(2011). kinematic analysis of a novel 3-dof actuation redundant parallel manipulator using artificial intelligence approach. robotics and computer-intergrated manufacturing, 27: 157–163. [9] firmani, f., podhoprodeski, r. p.(2009). singularity analysis of planar parallel manipulatos based on forward kinematic solutions. mechanism and machine theory, 44: 1386–1399. [10] vesselenyi, t., dzitac, s., dzitac, i., manolescu, m.j. (2007). fuzzy and neural controllers for a pneumatic actuator. int j comput commun, issn 1841-9836, 2(4):375-387. [11] alvandar, s., nigam, m. j. (2008). neuro-fuzzy based approach for inverse kinematics solution of industrial robot manipulators. int j comput commun, issn 1841-9836, 3(3):224–234. [12] mahidzal, d., and jian-ding, t. (2011). forward and inverse kinematics model for robotic welding process using kr-16ks kuka robot. proc. ieee int. conf. on modeling, simulation and applied optimization, 1–6. [13] shah, j., rattan s. s., nakra, b.c.(2011). kinematic analysis of 2-dof planer robot using artificial neural network. world academy of science, engineering and technology, 81: 282–285. [14] parikh, j.p., lam, s.s. (2009). solving the forward kinematics problem in parallel manipulators using an iterative artificial neural network strategy. int. j. adv. manuf. technol., 40: 595–606. [15] her, m.-g., chen, c.y., karkoub, m. (2002). approximating a robot inverse kinematics solution using fuzzy logic tuned by genetic algorithms. int. j. adv. manuf. technol., 20: 375–380. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 2 (june), pp. 246-257 multi-objective optimization for the m-pdptw: aggregation method with use of genetic algorithm and lower bounds i. harbaoui dridi, r. kammarti, m. ksouri, p. borne imen harbaoui dridi lagis : ecole centrale de lille, villeneuve d’ascq, france lacs : ecole nationale des ingénieurs de tunis, tunis belvédère. tunisie e-mail: imenharbaoui@gmail.com ryan kammarti, mekki ksouri lacs : ecole nationale des ingénieurs de tunis, tunis belvédère. tunisie e-mail: kammarti.ryan@planet.tn, mekki.ksouri@insat.rnu.tn pierre borne lagis : ecole centrale de lille, villeneuve d’ascq, france e-mail: p.borne@ec-lille.fr abstract: the pdptw is an optimization vehicles routing problem which must meet requests for transport between suppliers and customers in purpose to satisfy precedence, capacity and time constraints. we present, in this paper, a genetic algorithm for multi-objective optimization of a multi pickup and delivery problem with time windows (m-pdptw), based on aggregation method and lower bounds. we propose in this sense a brief literature review of the pdptw, present our approach to give a satisfying solution to the m-pdptw minimizing the compromise between total travel cost and total tardiness time. keywords: pdptw, multi-objective, aggregation method, lower bounds. 1 introduction the vehicle routing planning and traffic management are considered a major logistical challenge in terms of supply, inter-plant transport or distribution transport. [1] many studies have been directed mainly towards solving the vehicle routing problem (vrp). it’s an optimization vehicle routing problem to meet travel demands. other researchers became interested on an important variant of vrp which is the pdptw (pickup and delivery problem with time windows) with capacity constraints on vehicles. the pdptw is divided into two categories: 1-pdptw (single-vehicle) and m-pdptw multivehicle). in the m-pdptw problem which we are interested in, we consider a vehicles fleet vk of capacity qk and a set of goods to transport providers to different destinations. the goal is to provide a set of customers under certain constraints concerning vehicles and their capacity, precedence between nodes, and this by minimizing the compromise between the total travel cost and total tardiness time. in this paper we present a literature review of the pdptw followed by the proposed approach for the optimization of pick-up and delivery problem with time window, using the genetic algorithms, aggregation method and lower bounds. copyright c⃝ 2006-2011 by ccc publications multi-objective optimization for the m-pdptw: aggregation method with use of genetic algorithm and lower bounds 247 2 litterature review 2.1 vehicle routing problem the vehicle routing problem (vrp) represents a multi-goal combinatorial optimization problem which has been the subject of many works and variations in the literature [2] [3]. the theory of the vrp is formulated as follows: given a depot d and a set of customers orders c = (c1, ... , cn), to build a package routing, for a finite number of vehicles, beginning and ending at the depot. in this routing, a customer must be served only once by a single vehicle and vehicle capacity transport for a routing should not be exceeded [4] [5]. the meta heuristics were also applied to solve the vehicle routing problem. among these methods, we can include ant colony algorithms, which were used by montamenni, r et al for the resolution of dvrp [6], and by sorin c. negulescu et al to solve the vehicle route allocation problem (vrap) [7]. kaoru hirota et al have presented a computational intelligence approach to vrsdp (vehicle routing, scheduling, and dispatching problems). the objective of the vrsdp is to produce a delivery schedule for a group of vehicles, with respect to multiple users, so that while satisfying constraints delivery cost corresponding to users’ order is minimized [8]. savelsbergh et al have shown that the vrp is a np-hard problem [9]. 2.2 the pdptw: pickup and delivery problem with time windows the pdptw is a variant of vrptw where in addition to the existence of time constraints, this problem implies a set of customers and a set of suppliers geographically located. every routing must also satisfy the precedence constraints to ensure that a customer should not be visited before his supplier. [10] a dynamic approach for resolving the 1-pdp without and with time windows was developed by psaraftis, h.n considering objective function as a minimization weighting of the total travel time and the non-customer satisfaction [11]. jih, w et al have developed an approach based on the hybrid genetic algorithms to solve the 1-pdptw, aiming to minimize combination of the total cost and total waiting time. [12] another genetic algorithm was developed by velasco, n et al to solve the 1-pdp bi-objective in which the total travel time must be minimized while satisfying in priority the most urgent requests. in this literature, the method proposed to resolve this problem is based on a no dominated sorting algorithm (nsgaii). [13] kammarti, r et al treat the 1-pdptw, minimizing the compromise between the total travel distance, total waiting time and total tardiness time, using an evolutionary algorithm with special genetic operators, tabu search to provide a set of viable solutions. [14] [15]. this work has been extended, by proposing a new approach based on the use of lower bounds and pareto dominance method, to minimize the compromise between the total travel distance, total waiting time and total tardiness time. [16] about the m-pdptw, sol, m et al have proposed a branch and price algorithm to solve the m-pdptw, minimizing the vehicles number required to satisfy all travel demands and the total travel distance. [17] quan, l et al have presented a construction heuristic based on the integration principle with the objective function, minimizing the total cost, including the vehicles fixed costs and travel expenses that are proportional to the travel distance. [18] a new metaheuristic based on a tabu algorithm, was developed by li, h et al to solve the m-pdptw. [19] li, h et al have developed a "squeaky wheel" method to solve the m-pdptw with a local search. [20] a genetic algorithm was developed by harbaoui dridi i et al treating the m-pdptw to minimize the total travel distance and the total transport cost [21]. this work has been extended, by proposing a new approach based on the use of pareto dominance method to give a set of satisfying solutions to the m-pdptw minimizing total travel cost, total tardiness time and the vehicles number. [22] [23] 248 i. harbaoui dridi, r. kammarti, m. ksouri, p. borne 3 mathematical formulation our problem is characterized by the following parameters: • n: set of customers, supplier and depot vertices, • n′: set of customers and supplier vertices, • n+: set of supplier vertices, • n−: set of customers vertices, • n: size of the initial population, • k: vehicle number, • dij: euclidian distance between the vertex i and the vertex j. if dij = ∞ then the road between i and j doesn’t exist, • tijk: time used by the vehicle k to travel from the vertex i to the vertex j, • [ei, li]: time window of the vertex i, • si: stopping time at the vertex i, • qi: goods quantity of the vertex i request. if qi > 0, the vertex i is a supplier; if qi < 0, the vertex i is a customer and if qi = 0 then the vertex was served. • qk: capacity of vehicle k, • i = 0...n : predecessor vertex index, • j = 0...n : successor vertex index, • k: 1...k: vehicle index, • xijk = { 1 if the vehicle travel from the vertex i to the vertex j 0 else • ai: arrival time of the vehicle to the vertex i, • di: departure time of the vehicle from the vertex i, • yik: the goods quantity in the vehicle k visiting the vertex i, • ck: travel cost associated with vehicle k, • a vertex is served only once, • a vertex is served only once, • the capacity constraint must be respected, • the depot is the starting and finishing vertex for the vehicle, • the vehicle stops at every vertex for a period of time to allow the request processing, • if the vehicle arrives at a vertex i before its time windows beginning date it waits. multi-objective optimization for the m-pdptw: aggregation method with use of genetic algorithm and lower bounds 249 the function to minimize is given as follows: minimizef =   λ1c1 ∑ k∈k ∑ i∈n ∑ j∈n ckdijkxijk+ λ2c2 ∑ k∈k ∑ i∈n ∑ j∈n max(0, di − li)xijk   (1) where λi and ci are weights and scaling coefficients. subject to: n∑ i=1 k∑ k=1 xijk = 1, j = 2, ...n (2) n∑ j=1 k∑ k=1 xijk = 1, i = 2, ...n (3) ∑ i∈n xi0k = 1, ∀k ∈ k (4) ∑ j∈n x0jk = 1, ∀k ∈ k (5) ∑ i∈n xiuk − ∑ j∈n xujk = 0, ∀k ∈ k, ∀u ∈ n (6) xijk=1⇒yjk=yik+qi,∀i,j∈n;∀k∈k (7) y0k=0,∀k∈k (8) qk ≥ yik ≥ 0, ∀i ∈ n; ∀k ∈ k (9) dw ≤ dv, ∀w ∈ n+; ∀v ∈ n− (10) d0 = 0 (11) xijk = 1 ⇒ di + tijk ≤ dj∀i, j ∈ n; ∀k ∈ k (12) the constraint (2) and (3) ensure that each vertex is visited only once by a single vehicle. the constraint (4) and (5) ensure that the vehicle routing is beginning and finishing in the depot. the constraint (6) ensures the routing continuity by a vehicle. (7), (8) and (9) are the capacity constraints. the precedence constraints are guaranteed by (10), (11) and (12). 4 genetic algorithm for optimization of the m-pdptw 4.1 generation of the initial populations in our case, we generate two types of populations. a first population noted pnode (figure 1), which represents all nodes to visit with all vehicles, according to the permutation list coding. the second population noted pvehicle (figure 2) indicates nodes number visited by each vehicle. 250 i. harbaoui dridi, r. kammarti, m. ksouri, p. borne figure 1: the permutation list coding figure 2: example of individual from the population pvehicle 4.2 correction procedures before beginning the construction of the population pnode/vehicule, we proceed to the correction procedures of precedence and capacity between nodes. we consider the following couples customer / supplier: (1,5), (2,8), (9,7), (10,3) and (4,6), noting that qk max= 60 and q = 20, we present, respectively, in figures 3 and 4 the principle of correction precedence and capacity. figure 3: correction precedence figure 4: correction capacity 4.3 computation procedure taking into account the population pnode, correction procedures and pvehicle we illustrate in figure 5 an example of an individual from the population pnode/vehicule . knowing that it is necessary to verify that a couple is visited by only one vehicle. [24] with: n ′ = 10 and k = 2 figure 6 represents the process to determine the population pnode/vehicule. the principles of different genetic operations such as crossover and mutation operator are detailed in our work [21]. multi-objective optimization for the m-pdptw: aggregation method with use of genetic algorithm and lower bounds 251 figure 5: example of an individual from the population pnode/vehicule 4.4 multi-criteria evaluation a multi-objective problem is defined as an optimization vector problem, which seeks to optimize several components of a vector function cost. pareto dominance method a multi-criteria problem p is composed of n variables, m inequality constraints, p equality constraints and k criteria that can be formulated as follows: p ⇒   minf(x)= [f1, f2, f3, .....fk(x)] gi(x)≤0i=0...m gj(x)=0j=0...p (13) however, it is necessary to find solutions representing a possible compromise between the criteria. the pareto optimality concept introduced by the economist v. pareto in the twentieth century is frequently used [25]. v. pareto formulated the following concept: in a multi-criteria problem, there is equilibrium so that we can not improve one criterion without deteriorating at least one other. this equilibrium has been called pareto optimal a solution is noted pareto optimal if it is dominated by any other point in solutions space. these points are noted non dominated solutions. a point x ∈ e dominates y ∈ e if:{ ∀i, fi(x) ≤ fi(y) and ∃ j, such asfj(x) < fj(y) (14) figure 7 shows an example where we seek generations of the initial populations to minimize f1and f2. the points 1, 3 and 5 are not dominated. on the contrary point 2 is dominated by point 3, and point 4 is dominated by point 5. aggregation method in the resolution of mop (multi objectives problem), several traditional methods are transforming the mop into a single objective problem. among these methods we find the aggregation method. this is one of the first methods used to generate pareto optimal solutions. it is to transform the problem (mop) in a problem (pmoλ) which combines the different cost functions of the problem into a single objective function f generally linear [26]: f(x) = n∑ i=1 ciλifi(x) (15) where λi and ci are weights and scaling coefficients, according to the application, that the different objectives are not necessarily commensurable. the constants ci are usually initialized to 1 fi(x∗) where fi(x∗) is the optimal solution associated to the objective function fi considered 252 i. harbaoui dridi, r. kammarti, m. ksouri, p. borne figure 6: computation procedure multi-objective optimization for the m-pdptw: aggregation method with use of genetic algorithm and lower bounds 253 figure 7: dominance example separately. the idea of the aggregation method (figure 8) is to fix a weight vector i.e. find a hyper-plane in the objective space (a line for a bi-criteria problem) with a fixed orientation. the pareto optimal solution is the point where the hyper-plane has a common tangent with a feasible space. the advantage of the aggregation method is to produce a single solution and thus do not require interaction with the decision maker. figure 8: multi-objective optimization and computing of lower bounds the computing of lower bounds has been studied in literature for several scheduling problems which we quote: the problems with a machine [27], with parallel machines [28] and [29], the hybrid flow-shop problems [30] and the flexible job shop problems [31]. these proposed methods for computing of the lower bounds are generally based on the relaxation of constraints (preemption of tasks, constraints related resources ...) to minimize one or more criteria for optimal scheduling. based on these methods and seen that we don’t have information on the optimal solutions associated with different cost functions fi for our problem we should compute the minimum value to determine the scaling constants ci . for this objective, we use the relaxation of various constraints. to find a minimum value associated with the criterion of total travel cost f1 =∑ k∈k ∑ i∈n ∑ j∈n ckdijxijk, we have treated this problem to the travelling salesman problem when we try to minimize ∑ i∈n ∑ j∈n dijxij. we subsequently determine the routing crossed by a single vehicle, minimizing the total travel distance by incorporating the constraints and capacity precedence. what gives us dmin .p.c. 254 i. harbaoui dridi, r. kammarti, m. ksouri, p. borne dmin .p.c = min( ∑ i∈n ∑ j∈n dijxij) (16) by setting k, the number of vehicles used, we get: dmin k.p.c = dmin .p.c k (17) consequently, we acquire a value f1b that represents a minimum value of the total travel cost. f1b = ∑ k∈k ckdmin k.p.c (18) to determine the minimum value of the total tardiness time, we fix k the vehicles number and find out the population pnode/vehicule . thus, we calculate the total tardiness time for each individual and determine the minimum. f2b = ∑ k∈k ∑ i∈n ∑ j∈n max(0, di − li)xijk (19) knowing that for the criterion of tardiness, a better lower bound is zero. we will have therefore:{ c2 = 1 f2b s.c.f2b ̸= 0 (20) 4.5 computational results to test our approach, we use benchmark problem instances generated by li and lim [19] from solomon’s ones [32]. corresponding to solomon’s classification of c1, c2, r1, r2, rc1 and rc2, their data sets were also generated in six classes: lc1, lc2, lr1, lr2, lrc1 and lrc2. the lc problems are clustered whereas in the lr problems, providers and customers are randomly generated. therefore in the lrc problems the providers and the customers are partially clustered and partially randomly distributed. while lc1, lr1 and lrc1 problems have a short scheduling horizon, lc2, lr2 and lrc2 have longer scheduling one. [33] in our work, we consider a vehicle number k ranging between 1 and 25. table 1 shows the results of our simulation using the parameters of the problem lrc1. of course, for every given solution, we note the corresponding routing, crossed by each vehicle. nsol: represents the number of non dominated solutions. nk: represents the vehicles number used. we observe that our approach generates a multiple number of solutions that give flexibility of choice for the decision maker and that by using two different methods to determine the vehicles number used, minimizing the compromise between the total travel cost and the total tardiness time. we also observe that we obtain a total tardiness equal to zero with a tolerable cost. 5 conclusion in this paper, we have presented our approach to solve the m-pdptw, based on pareto dominance method, with use of genetic algorithm and lower bounds. our purpose was in a first part a brief literature review on the vrp, 1-pdptw and m-pdptw. the mathematical formulation of our problem is detailed in second part. then, we have detailed the use aggregation multi-objective optimization for the m-pdptw: aggregation method with use of genetic algorithm and lower bounds 255 lrc1 nsol nk f1b f2b f1 f2 f(x) lrc101 4 11 67971,97 39,08 187085,73 2,173 1,09 192150,78 2,09 192843,93 1,32 194911,78 0,651 lrc101 1 25 67971,97 0 216261,26 0 2,49 lrc103 2 11 645,1144 0 1946,2465 0 1,09 1711,304 0,429 lrc103 1 25 645,1144 0 2081,1978 0 2,49 lrc105 3 9 697,0005 0,823 1862,0582 4,28 0,89 1965,2465 0,823 1947,6537 4,27 lrc105 2 25 697,0005 0 2171,1193 0,107 2,49 2246,6729 0 lrc107 1 11 647,345 0 1803,9515 0 1,09 lrc107 1 25 647,345 0 2171,1006 0 2,49 table 1: results for the lrc1 problem method and lower bounds to determine a set of solutions, minimizing our objective functions. simulation was presented in a last part by using benchmark’s data. bibliography [1] h. ezzedine t. bonte c. kolski and c. tahon. integration of traffic management and traveller information systems: basic principles and case study in intermodal transport system management. international journal of computers, communications & control (ijccc), issn 1841 9836, e-issn 1841-9844 vol. iii, no. 3, pp. 281-294, 2008. [2] christofides n. mingozzi a. and toth p. the vehicle routing problem. in combinatorial optimization, volume 11, pages 315-338. john wiley, 1979. [3] lenstra j. and rinnooy kan a. complexity of the vehicle routing and scheduling problems. in networks, volume 11, pages 221-228. springer, 1981. [4] nabaa m. zeddini b. and tranouez p. approche décentralisée pour résoudre le problème du transport ŕ la demande. schedae, prépublication n◦ 23, (fascicule n◦ 2, p. 133-140), 2007. [5] toth p. and vigo d. the vehicle routing problem, siam monographs on discrete mathematics and applications, 2002, isbn 0-89871-498-2. [6] montamenni r. gambardella l.m. rizzoli a.e. and donati a.v. a new algorithm for a dynamic vehicle routing problem based on ant colony system. idsia, switzerland, 2002. [7] sorin c. negulescu, claudiu v. kifor, and constantin oprean. ant colony solving multiple constraints problem: vehicle route allocation. int. j. of computers, communications & control (ijccc), issn 1841-9836, e-issn 1841-9844, vol. iii, no. 4, pp. 366-373, 2008. 256 i. harbaoui dridi, r. kammarti, m. ksouri, p. borne [8] kaoru hirota, fangyan dong and kewei chen. a computational intelligence approach to vrsdp (vehicle routing, scheduling, and dispatching problems). icccc, baile felixoradea, romania pp. 53-54, 2006. [9] savelsbergh m.p.w. and sol m., "the general pickup and delivery problem", transportation science 1995. [10] psaraftis h.n. an exact algorithm for the single vehicle many to many immediate request dial a ride problem with time windows. transportation science, 17, 351-357, 1983. [11] psaraftis h.n. a dynamic programming solution to the single vehicle many to many immediate request dial a ride problem. transportation science, 14(2):130-154, 1980. [12] jih. w. and hsu j. dynamic vehicle routing using hybrid genetic algorithms. international conference on robotics and automation, pages 453-458, 1999. [13] velasco n. dejax p. gueret c. and prins c. un algorithme génétique pour un problème de collecte bi-objectif. mosim 2006. [14] kammarti. r. hammadi. s. borne. p. and ksouri. m. a new hybrid evolutionary approach for the pickup and delivery problem with time windows. ieee international conference on systems, man and cybernetic. 2004. volume 2, p 1498-1503, oct 2004. [15] kammarti. r. hammadi. s. borne. p. and ksouri. m. "improved tabu search in an hybrid evolutionary approach for the pickup and delivery problem with time windows", intelligent transportation systems, 2005. proceeding 2005 ieee, p 148-153, 2005. [16] kammarti. r. hammadi. s. borne p. and ksouri. m. "solving the real time dynamic pickup and delivery problem with an hybrid evolutionary approch". multiconference on computational engineering in systems application. volume 2, p 1520-1525, oct 2006. [17] sol m. and savelsbergh m. a branch-and-price algorithm for the pickup and delivery problem with time windows. memorandum cosor 94-22, dept. of mathematics and computing science, eindoven university of technology, eindoven, the netherlands, 1994. [18] quan l. and maged m. a new insertion-based construction heuristic for solving the pickup and delivery problem with time windows. science direct, european journal of operational research 2003. [19] li h. and lim a. a metaheuristic for the pickup and delivery problem with time windows. in ieee international conference on tools with artificial intelligence, volume 13, pages 160167, 2001. [20] li h. lim a. and rodrigues b.. solving the pickup and delivery problem with time windows using squeaky wheel" optimization with local search. smu business conference paper series, 2002. [21] harbaoui dridi i. kammarti r. borne p. and ksouri m. un algorithme génétique pour le problème de ramassage et de livraison avec fenętres de temps ŕ plusieurs véhicules. cifa 2008, bucarest (roumanie), septembre 2008 proc. article 176. [22] harbaoui dridi i. kammarti r. borne p. and ksouri m. approche multicritère pour le problème de ramassage et de livraison avec fenętres de temps ŕ plusieurs véhicules. symposium lt’2009, tunisie, (sousse), lt09-006, mars 2009. multi-objective optimization for the m-pdptw: aggregation method with use of genetic algorithm and lower bounds 257 [23] harbaoui dridi i. kammarti r. borne p. and ksouri m. genetic algorithm for multicriteria optimization of a multi-pickup and delivery problem with time windows. 13th incom ifac symposium, moscow (russia), pp 1521-1526, june 2009. [24] harbaoui dridi i. kammarti r. ksouri m and borne p. a genetic algorithm for the multipickup and delivery problem with time windows. studies in informatics and control (sic), vol 18, n◦2, pp 173-180, juin 2009. [25] pareto. v. cours d’économie politique, lausanne : rouge, 1896-7, reproduit in vilfredo pareto, oeuvres complètes, genève : librairie droz, 1964. [26] hwang, c. and masud, a. multiple objective decision making methods and applications. in lectures notes in economics and mathematical systems, volume 164. springer-verlag, berlin, 1979. [27] jouglet. a, baptiste. b et carlier. j. exact procedures for single machine total cost scheduling. proceeding of the ieee / smc’02 conf. 6-9 october, hammamet, tunisia, 2002. [28] jurich, b. scheduling jobs in shops with multi-purpose machines. ph.d thesis, fachbereich mathematik / informatik, universitat osnabruck., 1992. [29] carlier, j. scheduling jobs with release dates and tails on identical machines to minimize the makespan. european journal of operational research , 29 (1987) 298-306, north-holland. [30] billaut, j.c, carlier, j, néron, e. ordonnancement d’ateliers ŕ ressources multiples. chapitre dans l’ouvrage : ordonnancement de la production, edition hermès, france, 2001. [31] i. kacem. ordonnancement multicritères des job shops flexibles : formulation, bornes inferieures et approche éévolutionniste coopérative. thèse en automatique et informatique industrielle ŕ l’université de lille 1, 2003. [32] solomon m.m., "algorithms for the vehicle routing and scheduling problem with time window constraints", operations research, 41, 469-488, 1987. [33] kammarti. r. hammadi. s. borne. p. and ksouri. m. "lower bounds in an hybrid evolutionary approach for the pickup and delivery problem with time windows". ieee international conference on systems, man and cybernetics. 2005. volume 2, p 1156-1161, oct 2005. international journal of computers communications & control issn 1841-9836, 9(4):453-462, august, 2014. cloud service management system for innovative clusters. application for north-west region of romania d. popescu, a. dodescu, p. filip daniela popescu, anca dodescu* university of oradea economy department romania, 410610 oradea, university street, 1 depopescu@uoradea.ro *corresponding author: adodescu@uoradea.ro petru filip 1. dimitrie cantemir christian university, romania, 040042 bucharest, splaiul unirii, 176 2. agora university of oradea, romania, 410526 oradea, piata tineretului, 8 3. university of oradea romania, 410610 oradea, university street, 1 pfilip@uoradea.ro abstract: in order to stimulate and optimize the organization and management of innovative clusters from value chain perspective and guide their planning activities towards a differentiation strategy in which cluster members cooperate, we propose a cloud service management system (csms) that provides it services for these innovative clusters companies that can be customized for both enterprises with the associated clusters. within such a system, actors begin to depend one on another and to take advantage of the local knowledge base. each cluster is designed to have a different profile which will integrate all the companies mapped with it, with the objective of keeping the profile and data for each company. for the existing companies the idea is to migrate their services into the related cluster for integration within csms. thus, our proposed csms will consider and meet different quality of services (qos) parameters of each individual enterprise and service which will be included in specific service level agreements (slas), after the negotiation between the cloud service provider and the csms. realizing that technological progress is at the heart of regional development and decision-makers could support the development of technology clusters towards transforming them into regional innovative clusters, the application of our proposal aims to overcome existing bottlenecks in terms of business strategies and regional development policies in the north-west region of romania. keywords: cloud computing, service oriented architecture, open cloud architecture, it services, innovative clusters, supply chain management 1 introduction the paper proposes a cloud service management system (csms) that provides it services for the innovative clusters companies in order to develop collaborative mechanisms specific for innovative clusters aimed to solving the problems identified in the economic development of the north-west region of romania. although in the past 20 years numerous studies have been conducted regarding the importance of innovative clusters for the regional economic development, the number of innovative clusters in romania, in general, and in the north-west region, in particular, is surprisingly small, many of the existing clusters not being functional, due to lack of experience in organizing and managing copyright © 2006-2014 by ccc publications 454 d. popescu, a. dodescu, p. filip collaborative mechanisms specific for innovative clusters and insufficient and ineffective interconnection of companies in the same sector of activity through services and related technologies which enable an efficient group management and fostering innovation. surprisingly small is also the number of networks of firms in romania, the explanation being related to the lack of confidence of economic actors in the collaborative mechanisms and the lack of investments in building confidence at regional level, the lack of good governance practices of networks of firms that lead to bottom-up clustering as well as the lack of patterns of structured planning and development of innovative clusters. in this context, the practical relevance of the problem sought to be solved through the present paper is the proposal of applying a cloud service management system (csms) that provides it services for innovative clusters which produces an effective group management, able to stimulate cooperation between cluster members in terms of the value chain on high value added niches, increasing competitiveness and innovation within the cluster and hence diversified smart specialization and regional development in order to increase the number and ensure long-term sustainability of innovative clusters in the north-west region of romania. proposing a functional model for the management of innovative clusters to compensate for functional limits of collaborative mechanisms within networks of companies and existing regional clusters, research involves an interdisciplinary cooperation of specialists in information technology, management and economics, the solutions targeted being positioned in the interference areas of the information technology with knowledge management and regional development. 2 cloud service management system for innovative clusters given the advantages of cloud which allows the delivery of scalable resources on demand, we propose a cloud service management system (csms) that provides it services for the innovative clusters companies that can be customized for both enterprises with the associated clusters. in this sense, csms will be built within a private cloud with multiple clusters. we are focused to deliver broad range of cloud services as the ones described by david linthicum in [2] due to their spread across industry. these are: storage-as-a-service, database-as-a-service, informationas-a-service, process-as-a-service, application-as-a-service, platform-as-a-service, integrationas-a-service, security-as-a-service, management / governance-as-a-service, testing-as-a-service, infrastructure-as-a-service. these services can be rent by customers depending on their needs and used as computing utilities. the advantage is the decreasement of the initial investment and that billing changes accordingly with the computing requirements for an individual or an organization changes, without incurring any additional cost. regarding the security of the cloud service management system, we must point out that besides the stages of a service level agreement (sla) content [1] [4], our services provided within each cluster of the csms must enhance availability, confidentiality, data integrity, control and audit. within such a system, actors begin to depend one on another and to take advantage of the local knowledge base, if this interdependence causes a continuous flow of product and process innovations, diffusion of knowledge and collective learning processes at local or regional level, public-private partnerships being the synergistic factors that lead to innovative clusters. each cluster is designed to have a different profile (e.g. it, automotive, apparel, tourism, health services, green energy etc.) which will integrate all the companies mapped with it, with the objective of keeping the profile of each company (figure 1). for the existing companies the idea is to migrate their services into the related cluster for integration within csms. eventually, csms will bring together all the involved services provided by the enterprise applications. moreover, in order to make the cloud computing approach to work as an efficient cloud sercloud service management system for innovative clusters. application for north-west region of romania 455 figure 1: the csms/ vice management for the innovative clusters, and to deliver the business agility and it flexibility promised by web services, it is required the creation of a service oriented environment (soe) [3].our approach is to deliver value to customers through the services provided using the soe without implying any additional cost and risk from the customer perspective. the services with csms are supposed to bring to customers: simplicity, flexibility maintainability, reusability and independence from technology. services can be published, discovered and used in a technology neutral, standard form by using the web services protocols. thus, through a common interface, separate services for companies within each cluster are created and managed with the purpose of achieving the objectives of each company that has its own assets, employees, suppliers, partners (and information about them) or existing it infrastructure [2] [6]. the huge increase in it system interoperability that soa can bring, not only at enterprise level, but also at the innovative cluster level is based on the use of smaller, modular services that have interface descriptions and contracts that ensure the business agility. these services are identified, defined and described in the context of the innovative cluster business activities and they are performed by the it system and managed at the csms level. for each service is clearly set what it does and it is stipulated in a contract. the development of effective and flexible solutions is ensured by the use of soa techniques like: service composition, discovery, message-based communication, and model-driven implementation [3]. soa represents not only the architecture of services seen from a technology perspective, but also the policies, practices, and frameworks by which we ensure for the entire innovative cluster the right services are provided and consumed in order to ensure a best business outcome for the innovative cluster [2]. the software services used by the innovative cloud business operations are supported by a cloud infrastructure that, together with the it services, improves information flow within the companies and between the companies from the innovative cluster and all of them end the outside. the access to all of these services frequently involves a human-computer interface, often implemented as a web interface using portals (figure 2), etc. thus, our proposed csms will consider and meet different quality of services (qos) parameters of each individual enterprise and service which will be included in specific service level agreements (slas), after the negotiation between the cloud service provider and the csms. 456 d. popescu, a. dodescu, p. filip figure 2: the infrastructure (adapted from [3])/ realizing that technological progress is at the heart of regional development and decision-makers could support the development of technology clusters towards transforming them into regional innovative clusters, the application of our proposal aims to overcome existing bottlenecks in terms of business strategies and regional development policies in the north-west region of romania in order to stimulate and optimize the organization and management of innovative clusters from value chain perspective and guide their planning activities towards a differentiation strategy in which cluster members cooperate with high value added niches (smart diversified specialization) and, consequently, to create regional growth and development. to get the ideal agile in collaborating environments (such as business) is needed that the it infrastructure necessary to access the functionality by services be able to be configured by the user without the need to become experts in the field. a well suited solution can be the cloud computing open architecture (ccoa), proposed in [5]. our csms solution for the management of services of the innovative cluster is developed based on the cloud computing open architecture, which we have to customize in order to ensure the services needed to migrate the companies services into the related cluster for integration within csms. so, at the second level of the ccoa model [5] we develop the innovative clusters using an hpc infrastructure. level 3, structured as soa, will be used to define the profiles and the services for each company included in each cluster. level 4 will be specific to each cluster, namely profile. level 5 provides the granular services, while level 6 will have custom profile services for each cluster (figure 3). 3 innovative clusters in north-west region of romania there is currently a broad consensus to consider that economic agglomeration effects in a given location is the result of the balance of opposed agglomeration forces (centripetal and centrifugal), generally or in a particular field, branch of industry, assuming technological networking, mainly for efficiency and flexibility, or by propensity for innovation [7]. the identification of endogenous determinants of knowledge spillover and collective learning process within innovative clusters represent one of the most advanced theoretical perspectives on regional development [8]. initially defined by michael e. porter as business strategy in a global economy context, as a source of competitive advantage for the participating companies [9], and later as "geographic cloud service management system for innovative clusters. application for north-west region of romania 457 figure 3: the csms model/ concentrations of interconnected companies, specialized suppliers, service providers, firms in related industries and associated institutions (eg universities, standardization agencies, professional associations) in a particular field that compete but also cooperate" [10], by encouraging the formation of some relational regional assets external to individual firms, but with major influence on the performance of their competitiveness [9], [12], the porterian cluster has turned from the most influential business strategy into the most popular regional development strategy. innovation has become a critical factor for survival in a competitive spatial economy and also the factor that determines the direction and the rhythm of regional development. in this context, the propensity to form innovative clusters characterizes development policies at present, due to regional and local impact (providing a successful combination of infrastructure, highly skilled workforce and flexible corporations for development, offering well-paid and high quality jobs, have prospects for long-term growth and development, are dynamic and dominated by innovative smes because continuous research and development is vital for survival, it accelerates the technological and innovative process at local or regional level etc.) [13]. the linkages between firms and organizations that form an innovative clusters the institutional density at local level, the unsold interdependency network that go beyond market transactions, the mechanisms by which technological progress within the cluster turns into a collective learning system, have been the subject of numerous research based, especially, on case studies of successful technology clusters [14], the cambridge technology cluster with the involvement of cambridge university and science parks: melbourne science park, cambridge science park and st john’s innovation park being frequently analyzed. close working relationship between specialized smes, the involvement of higher education and research institutions, the free movement of highly skilled labor and availability of specialized consulting services within the cluster are considered the main explanatory mechanism for the success of these clusters. in this context, the classical concept of network of companies, which designates a form of cooperation between companies legally independent, geographically dispersed, but with common interests in economic terms has evolved into the concept of collaborative networks of innovative technological systems with respect to network configuration referring to coordinate cooperation between the various organizations that pursue a common goal: the development of a region; and the concept of technological cluster evolved into the "pôle de compétitivité" (in french ap458 d. popescu, a. dodescu, p. filip proach) or innovative cluster facilitated by the availability of the nature of information technology to transform into a cumulative process and "institutional density" which facilitates cooperation and technological knowledge flow, leading to the accumulation of "strong local base of knowledge" and create a system like "innovative millieu" [15], [16], [17]. within such a system, smes begin to depend one on another and to take advantage of the local knowledge base, if this interdependence does not stuck on the "old, competitive and unstable paths" [18], but causes a continuous flow of product and process innovations, diffusion of knowledge and collective learning processes at local or regional level, public-private partnerships being the synergistic factors that lead to innovative milieux [19], toward learning regions [20], creative regions, knowledge based regions etc. [8], [13]. realizing that technological progress is at the heart of regional development and decision-makers could support the development of technology clusters towards transforming them into regional innovative clusters, learning environments and innovative regions or areas of knowledge, quickly penetrated regional growth and development policies of the eu member states [15], [16], closely related with the pulses set in the lisbon strategy on "knowledge based economy" and later with the approach of the territorial agenda 2020 and the specific smart growth objectives of europe 2020 strategy. the current financial and economic crisis is pressing even more to stimulate the innovation process and increase the role of regions in this context. the eu reports and strategic documents demonstrates that innovation policy should focus not only on research and development or science and technology, almost exclusively concentrated in a few regions of the eu, especially capital regions, but "to anticipate" appropriate policy mix for innovation covering all the dimensions of innovation, including those concerning organizational process, creativity, design, etc. and all the regions of europe [21], [22], [23]], and the reality of a high concentration of technological capabilities in the developed center of the eu must change gradually and create space for regions and companies with a lower absorption capacity. but how to change this in favor of the less favored regions in conditions of economic crisis? the most common response is by stimulating both innovation and entrepreneurship through innovative regional clusters [22], [23]]. according to the 2013 european cluster observatory star rating [24], there are 92 clusters in romania that have received at least one star depending to the extent to which clusters have achieved critical mass by employing measures in terms of size, specialization and focus [25]. in this context, the clustering potential of the north-west region of romania is sensitively high in the figure 4 mentioned industries, where we represented the number of employees in the stars classified industries: with a diverse economic structure, characterized by the shift from predominant development of strong labor intensive sectors to the services sector, particularly the intelligence intensive and high value-added industries, that require skilled and specialized labor force, north west region of romania has a certain degree of regional specialization for the following industries: furniture industry, electrical equipment, metallurgy, leather and footwear industry. for the following industries: furniture industry, electrical equipment, metallurgy, leather and footwear industry [26], according with zaman’s methodology for calculating apparent or revealed regional comparative advantage indicator, based on the top main regional trade exporters in the eu and between the eu extra trade [27]. cloud service management system for innovative clusters. application for north-west region of romania 459 figure 4: star clusters in northwest vest region of romanial/ (source: star clusters in romania, center for strategy and competitiveness, csc stockholm school of economics, april 2011, http: // www. clusterobservatory. eu/ common/ galleries/ downloads/ star_ clusters_ romania. pdf) regarding economic agglomeration, according with most recent regional development strategy 20142020 draft plan [26], in the north-west region there were 6 registered clusters, out of which only 4 are active as shown in table below: table 1. clusters situation in north-west region of romania cluster field of activity location information about association initiative under cluster structure and status romanian water cluster water energy industry clujnapoca information not found unfunctional geothermal cluster renewable energies, services in tourism oradea created in 2009 through clustherm (creating a central european thermal water research cluster) project, financed through fp7, regions of knowledge programme (regions2007-2), coordinator: innova szak-alfld regional development and innovation nonprofit ltd (hu), http://www.clustherm.eu/. unfunctional trec transnational renewable energies cluster renewable energies clujnapoca created in 2012, through trec (transnational renewable energies cluster) project, financed through smart +, interreg ivc programme, coordinator: northwest regional development agency, romania, http: //www.nord-vest.ro/document_files/noutati/00001234/ciic6_ 1.%20trec%20-%20concept,%20activitati,%20viziune.pdf. active cluster mobilier transilvan furniture clujnapoca created in 2012, bottom up, as a non-governmental organization through legal association of 18 firms from furniture industry, 3 catalyst organizations, 1 public institutions and 1 university, website: http://transylvanianfurniture.com/ active cluj it cluster it & c clujnapoca created in 2012, bottom up, as a non-governmental organization through legal association of 28 it & c companies, 6 public institutions and catalyst organizations, 4 universities, website: http://www.clujit.ro/ active agro-food-ind napoca agro-food clujnapoca created in 2013, bottom up, as a non-governmental organization through legal association of 13 companies and associations from agro-food industry, 2 catalyst institutions, 3 public administration institutions, 2 universities; website: http://www.agrotransilvaniacluster.ro/ active source: authors’ processing looking at figure 4 and table 1, we should notice that, despite its sensitively high clustering 460 d. popescu, a. dodescu, p. filip potential in apparel, farming and animal husbandry, agricultural products, footwear, textiles, tobacco, building fixtures, equipment and services, construction, construction materials, leather products, paper products [26], only furniture industry is covered with active cluster in the region. surprising is the absence of a cluster in the apparel, footwear, textiles industries, despite existing industrial agglomerations and tradition in the region. according to the cluster programme and policy stress test for north-west regional development agency regional innovation strategy [28], the score of the north-west region is of 9 points out of 50 points, which places the region in the "yellow area" of the "league of clusters" traffic light, the minimum limit for compliance with the "green zone" being of 40 points. the weaknesses identified in the north west region of romania after applying the policy stress test are related to the management approach of clusters and the internationalization of their activity, the malfunction integration of regional programs for clusters in the general context of industrial policy and economic development policy, the critical points being related to the implementation, monitoring, evaluation and impact of regional programs for clusters and the absence of a national policy for clusters. 4 conclusions the analysis of economic agglomerations in north-west region of romania shows the predominance of economic agglomerations based on simple agglomeration economies (especially, industrial parks) and high difficulties in the implementation and ensuring sustainability on long term of economic agglomerations based on economies of dynamic agglomeration (especially, innovative clusters) and highlights the need of instruments to improve the management approach of clusters and to stimulate the innovative process, and also collective learning processes through cooperation and relationality. in this context, the proposed cloud service management system model aims to represent an important support for the development of existing clusters and to find solutions for the functionality and sustainability of new clusters in the north-west region for valuing high regional clustering potential. in this respect, broad range of cloud services that can be rent by firms from same industry, aligning cluster members on value chain. a number of services can be adapted to the cluster specificity: project planning, marketing planning, strategy, business plan, kpi’s system etc.; knowledge management, open virtual library, e-learning, working groups, application procurement, legislation info, match-making, funding opportunities portal, common eu projects, etc. within such a system, actors begin to depend one on another and to take advantage of the local knowledge base, if this interdependence causes a continuous flow of product and process innovations, diffusion of knowledge and collective learning processes at local or regional level, public-private partnerships being the synergistic factors that lead to innovative clusters. the proposed model will help to tackle the functionality and long-term sustainability issues of innovative clusters in the north-west region of romania and will represent the base of an integrated network that will bring together human resources trained within the project in interdisciplinary applied research and smes networks from the north-west region of romania and will contribute decisively to the accumulation of the "critical mass" of resources for research, development and innovation for increasing regional competitiveness. through the development of some model for structured planning and development of innovative clusters, our future steps will be focused on testing of model on a pilot cluster in order to stimulate learning, relationships and cooperation within innovative clusters, the development of the following innovative services: methodologies and methods for planning and structured development of innovative clusters, handbook for the management of innovative clusters, system of integrated software for managing clusters and knowledge, and fostering cooperation within an innovative cluster. cloud service management system for innovative clusters. application for north-west region of romania 461 bibliography [1] lonea, a.m., tianfield, h. and popescu, d.e. (2013). identity management for cloud computing, new concepts and applications in soft computing, 417: 175-199. [2] linthicum, d. s. (2009); cloud computing and soa convergence in your enterprise: a step-by-step guide, addison-wesley information technology series. [3] open group, soa and enterprise architecture, available online: http://opengroup.org/ [14.12.2013]. [4] lonea, a.m., popescu, d.e. and prostean, o. (2012); the overall process taken by enterprises to manage the iaas cloud services, proceedings of the 6th european conference on information management and evaluation, university college, cork, ireland, 13-14 september 2012. [5] vitek, a. j. & morris, m.n, (2012); service oriented cloud computing architectures, umm csci senior seminar conference, available online: https://wiki.umn.edu/ [07.11.2013]. [6] entreprise concept, available online: http://enterprise-concept.com/ [05.12.2013]. [7] sövell, ö., lindqvist, g. & ketels, ch. (2003); the cluster initiative greenbook, stockholm: bromma tryk ab. [8] capello, r. & nijkamp, p. (eds.) (2009); handbook of regional growth and development theories, edward elgar publishing. [9] porter, m. e. (1990); the competitive advantage of nations, new york: free press. [10] porter, m.e. (1998); on competition, boston: harvard business school press. [11] porter, m.e. (1998); clusters and the new economics of competition, harvard business review, november-december, 77-90. [12] mccann, p. (2001); urban and regional economics, oxford: oxford university press. [13] armstrong, h. & taylor, j. (2000); regional economics and policy, third edition. oxford: blackwell. [14] karlsson, c. (2008); handbook of research on clusters, edward elgar publishing house. [15] ketels, ch. & svell, a. (2006); innovation clusters in the 10 new member states of the european union, european communities, available online: http://www.isc.hbs.edu/ [22.07.2013]. [16] european commission (2007); innovation clusters in europe. a statistical analysis and overview of current policy support, europe innova / pro inno europe paper no. 5, dg enterprise and industry report. [17] hamdouch, a. (2008); conceptualizing innovation clusters and networks, international conference: innovation networks, tacoma-seattle, washington, usa, may 14-16, 2008, available online: http://rrien.univ-littoral.fr/ [22.07.2013]. [18] armstrong, h. & taylor, j. (2000); regional economics and policy, third edition. oxford: blackwell. 462 d. popescu, a. dodescu, p. filip [19] camagni, r. (ed.) (1991); innovation networks: spatial perspectives, london: belhaven press. [20] morgan, k. (1997); the learning region: institutions, innovation and regional renewal, regional studies, 31(5);491-503. [21] european commission (2011); the urban and regional dimension of europe 2020.seventh progress report on economic, social and territorial cohesion, report from the commission, available online: http://ec.europa.eu/ [24.09.2013]. [22] european commission (2012); state of the innovation union report 2012 accelerating change, communication from the commission to the european parliament, the council, the european economic and social committee and the committee of the regions swd(2013) 75 final, available online: http://eur-lex.europa.eu/ [24.09.2013]. [23] european commission (2013); innovation union progress at country level 2013, available online: http://ec.europa.eu/ [24.09.2013]. [24] european cluster observatory (2013);available online: http://www.clusterobservatory.eu/ [18.02.2014]. [25] european cluster observatory (2011); star clusters in romania, center for strategy and competitiveness, csc stockholm school of economics, april 2011, available online: http://www.clusterobservatory.eu/ [13.11.2013]. [26] romanian north-west regional development agency (2013); 2014-2020 regional development plan (draft, september 2013), available online: http://www.nord-vest.ro/ [18.02.2014]. [27] zaman, gh. (2010); sistemul de indicatori ai competitivitatii romanesti in contextul dezvoltarii durabile, communication for afer summer school, university of oradea, september 2010. [28] lammer-gamp, k. (2013); results cluster programme and policy stress for north-west regional development agency regional innovation strategy, european secretariat for cluster analysis, berlin int j comput commun, issn 1841-9836 8(4):578-587, august, 2013. information security engineering: a framework for research and practices m. li, m. tang meng’gang li china center for industrial security research, beijing jiaotong university, china morganli@vip.sina.com mincong tang international center for informatics research, beijing jiaotong university, china mincong@bjtu.edu.cn abstract: information security is not a new topic in academics and industry. however, through a comprehensive literature review, we found that most research in information security focus on technical perspectives including evaluation methods and mathematical approaches for securities, risk mitigation algorithms, with some research focus on economic perspective of information security and even a few talked about social engineering of information security. there is not a unique framework to integrate different types of research in information security. we believe that information security research apply the theories and methodologies in systems engineering to investigate the problems, that is, information security engineering. in this paper, we propose a conceptual framework of information security engineering. this framework explicitly illustrates the methodological system, content system, procedures and strategies for information security engineering research and practices. keywords: computer science, information security engineering (ise), systems engineering, information systems. 1 introduction information security problem has been coming along with the development of our society. in the past two decades, information technologies (it) and internet have changed our world in an unbelievable speed. they bring to us many conveniences as well as risks and uncertainties. thus information security problem becomes very prominent under such circumstance and it now plays a decisive position in national securities of a country. information security mainly involves governments, enterprises, organizations, associations and individuals. it spreads with a very wide time domain and covers all fields of the society including politics, economy, culture, military etc. compared with the securities of politics, economy and military, information security has the following features: 1) information security is the core of national security in information era; 2) the nature of information security is information resource security; 3) information security relies more and more on the securities of technological systems; 4) attack sources of information security are characterized as spectral and concealed. information security nowadays is no longer an isolated and scattered but a very complicated systems engineering problem. thus we propose that it’s necessary and urgent to conduct in-depth and comprehensive studies to examine information security from the viewpoint of engineering. we define this as information security engineering (ise): ise is a kind of systems engineering which is based on information technology and takes information security management as its approach and information security laws and policies as its assurance. as a subset of systems engineering, ise is the embodiment of information system securities for systems security engineering, systems engineering and system acquisitions. ise is a product of the integration copyright c⃝ 2006-2013 by ccc publications information security engineering: a framework for research and practices 579 of security engineering, information management and information systems development. the contents of ise mainly include: connotations, contents and objectives of ise; information security risk analysis and evaluation procedures, methods and tools; requirement analysis methods, security strategies, security architecture, security solutions, security implementation standards, security test and operations, emergency measures and methods, security education and training. the thinking of engineering has been well applied into software development and security management, from which two subjects: software engineering and security engineering have been formed and developed. though these two subjects are relatively completed, studies of information security have been affected by the knowledge structures of researchers, rapid development of our society and other factors, the findings of information security research are scattered and the theory lags behind the demand of applications. on the other hand, though the sense of security has been strengthen in the past 10 years, from installing anti-virus software to procuring security products, the understanding of security still stays at the very stage of ‘treat only where the pain is’. the problem of information security cannot be solved by pure technology, nor can it be solved by putting security products together. it has to rely on complicated systems engineering— information security engineering (ise). therefore, there is a need to conduct in-depth research in information security from a systems engineering perspective. we organize this paper in the following ways: in the next section we present a comprehensive literature review, which forms the foundations of ise. followed by is the processes of ise in section 3. after that we propose a framework of ise, the framework is used to suggest future trends within ise research and practices in section 4. 2 foundations of ise as developed from software engineering and security engineering, ise follows the foundations of systems engineering but its context is limited to information security and information systems. different from typical systems engineering, ise research focuses on the development of methodology and architecture for information security. it intends to unite the diversities in information security research and makes it an interdisciplinary research without limiting the analysis to a particular discipline (i.e. computer sciences, information systems etc.). by comprehensively reviewing existing information security literature, we mainly found that four issues of information security have been well studied. these four issues are: 1) security management (sm): sm refers to the ways like information systems planning and evaluation, to maintain secure information systems within organizations. backup, recovery, contingency management are included in sm. 2) communication security (cs): cs refers to the measures adopted to ensure secure communication achieved between people. 3) access of information and systems (ais): ais refers to the measures that control people’s access to information and systems. 4) secure information systems development (sisd): sisd refers to the methods, policies and procedures that lead a secured information system to be developed. according to the differences in research approaches of the literatures we reviewed, we only 580 m. li, m. tang incorporate those studies after 2000 for the purpose of most updated knowledge in these areas, we summarized the findings from existing literature as table 1. table 1: summary of existing literature in information security issue source propositions/findings approach security management dhillon & backhouse (2000) responsibility, integrity, trust and ethicality (rite) principles hold the key for successfully managing information security in addition to confidentiality, integrity, and availability (cia). conceptual eloff & eloff (2003) to successfully secure the information and technology related assets of an organization, management should aim towards establishing an information security management system (isms) conceptual & organizational solms & solms (2004) 10 essential aspects like not realizing that information security is a corporate governance responsibility and not realizing that information security is a business issue and not a technical issue and so on must be taken into account in an information security governance plan to make it a success conceptual solms (2005) a separate information security compliance management department is needed when talking about information security management conceptual saintgermain (2005) for organizations to fall into several regulatory realms, they need to establish a comprehensive, flexible framework for implementing cost-effective compliance, deployed via a governing system that maintains security policies and controls organizational (case study) tsoumas & gritzalis (2006) the separation of security requirements from their technical implementations facilitates security management technological chang & lin (2007) organization culture will significantly influence the effectiveness of implementing information security management organizational (survey) ashenden (2008) human challenge of information security management has largely been neglected and people need to look at the skills needed to change organizational culture, the identity of the information security manager and effective communication between information security managers, end users and senior managers. conceptual werlinger et al. (2009) 18 challenges like technical complexity, mobility and so on can affect it security management within organizations organizational (survey) communication security chaddoud et al. (2001) baal protocol as a scalable solution to group key management problems and it can resolve the user’s revocation problem technological khadra et al. (2003) impulsive synchronization of two chaotic systems is very robust, this robustness is useful in designing chaos based cryptosystems, which is used to ensure secure communication technological information security engineering: a framework for research and practices 581 issue source propositions/findings approach yang (2004) impulsive synchronization is more robust than continuous synchronization. based on a combination of both conventional cryptographic method and impulsive synchronization of chaotic systems, a new chaotic secure communication scheme is proposed, which is to ensure secure communication technological liang et al. (2008) secrecy capacity region of the gaussian bcc complements the secrecy capacity region of the discrete memoryless bcc. this will enhance secure communication. technological kiani-b et al. (2009) the proposed encoding chaotic communication has achieved a satisfactory, typical secure communication scheme. results show that security is enhanced based on spreading the signal in frequency and encrypting it in time domain in the proposed system. technological access of information and systems kagal et al. (2001) trust is added as a new dimension to pervasive computing, allowing greater flexibility in designing policies and providing more control over accessing services and information conceptual bonatti, & samarati (2002) an approach for regulating service access and information disclosure on the web is proposed, which consists of a uniform formal framework to formulate and reason about – both service access and information disclosure constraints. this approach ensures communicating users’ requirement while disclosing no private information. technological whitman (2003) ‘understanding the enemy’ is believed to be an important component of information protection conceptual gritzalis & lambrinoudakis (2004) a security architecture based on a role-based access scheme is proposed and found to be effective to identify different users’ access to local sources and other sites conceptual karyda et al. (2005) contextual factors including organizational culture, management support etc. for the application of is security policies have been discussed, these factors are to be taken into consideration when implementing information security policies including access to information and systems. conceptual cheng et al. (2007) a new model for risk-based access control is proposed. this model is based on fuzzy multi–level security access control and found to be more effective than traditional bell–lapadula model. technological secure is development jurjens (2001) uml is used to express security requirements in system development. technological georg et al. (2002) an aspect-oriented approach to modeling is proposed to allows developers to encapsulate design concerns like security, availability of services, and timeliness so that they can be woven into a design in a systematic and consistent manner technological 582 m. li, m. tang issue source propositions/findings approach jones & rastogi (2004) security has to be “baked in” to the overall systems development life-cycle process. technological villarroel et al. (2005) eleven secure systems design methodologies have been compared to see how they should be adopted in system development conceptual mouratidis et al. (2005) an approach considering security concerns as an integral part of the entire system development process is proposed to be necessary organizational (case study) mellado et al. (2007) security has to be dealt with at all stages of is development, especially in the establishment of security requirements to achieve a robust is. conceptual cheng et al. (2008) a new concept of security engineering environment (see) is proposed, see concept with high security requirements can provide a base for designers, developers, users, and maintainers with standard, formal, and consistent supports technological mouratidis & jurjens (2010) two prominent approaches, a goal-oriented security requirements engineering approach called secure tropos and an mbse approach called umlsec have been integrated to help how elicited security requirements can be realized in the design stage and how the developed design can be verified against the security requirements of the system technological based on the discussion from existing literature, we believe that these four issues of information security above consist of the major contents of ise (figure 1) and form the foundations of ise. specifically, the four issues in ise can be elaborated as followings. security management communication security access of information & systems secure is development figure 1: fours issues in information security studies while the contents of information security mainly cover the four issues, ise research issues will include but not limits to: philosophical foundations of information security, definitions information security engineering: a framework for research and practices 583 of information security, mathematical foundations of information security, safety rheology and mutation laws, physiological and psychological issues in information security, ise methods of analysis, forecasting and control, information security risk evaluation and information security management methods, human-computer environment analysis and design etc. these research issues interact with the four issues of information security. 3 processes of ise as a subset of systems engineering, ise mainly follows the processes of systems engineering, which is illustrated by figure 2. discover requirement define system design system implement system users/representatives of users evaluate effectiveness figure 2: processes of systems engineering the processes described above are usually carried out in the following ways: 1) discover tasks or requirements; 2) define functionalities of system; 3) design system; 4) implement system; 5) evaluate effectiveness for ise, the processes above are customized to specific information securities. the key is to fulfill the requirements of information protection by implementing systems engineering processes. ise can facilitate the development of system products and process solutions to satisfy the users’ requirements. thus, the processes of ise becomes: 1) discover requirements of information security: ise will first investigate users’ requirement, policies, standards, vulnerabilities and threats regarding of information. then ise will mark the users of information and systems, their roles and responsibilities in information security. 2) define information security system: users’ requirements of information protection and description of information system environment are interpreted as the objectives and functionalities of information security system. in this stage, ise will define what can be done by information protection system and the executions of information security system as well as the internal and external interface of information security system. 3) design information security system: ise will design the architecture of information security system and detail the design scheme of information security system. 584 m. li, m. tang 4) implement information security system: according to the requirements of information security, this stage aims to development, procure, integrate, test and verify the collections of configuration of information security system. similar with the corresponding stages in systems engineering, ise will conduct implementation and testing in this stage. 5) evaluate effectiveness of information security system: ise emphasizes the capabilities of providing confidentialities, integrities, availabilities and non-repudiation for information. ise processes emphasize marking, conceiving and controlling information security risks and optimize these risks to protect potential losses due to various possible threats and attacks. 4 a framework for ise research and practices in oxford dictionary, framework is defined as a structure upon or into which contents can be put and further relates it to thoughts that are directed for a purpose. the ise framework proposed in this study provides academics and practitioners with an understanding of how to conduct an information security research and practices from an engineering point of view so as to align ise theories with applications. figure 3: a framework for information security engineering research and practices information security engineering: a framework for research and practices 585 as illustrated in figure 3, there are four levels in the framework: definition, basic theory, methodology and application. at the definition level, four elements are included. we propose that objectives, definitions, research subjects and relations with other disciplines belong to this level. for ise research, these four elements must be clearly defined. at the basic theory level, three elements are included. philosophical foundations of ise refer to those philosophical issues like unity and contradiction of security and risks, accuracy and fuzziness of information security etc., these provide the highest level of understanding of ise. mathematical foundations of ise provide theoretical and analytical methods to solve the problems of ise while physiological and psychological problems of ise focus on the behavioral perspectives of ise research. at the methodology level, different techniques (methods) for analysis and evaluation are included. these do not only include analytical methods (i.e. event tree, fault tree etc.), but also include causal analysis methods which are widely used in behavioral research. at the application level, two categories are proposed to provide guideline for practitioners in ise, which include information security management and backbone technologies for information security. this framework provides guidelines and directions for researchers in information security areas. first, it proposes three elements in the basic theory level. these elements cover a very large section of the existing literature in information security especially for the mathematical foundations. second, from a conceptual approach point of view, the philosophical foundations cover many discussions in literature. third, physiological and psychological problems, which are still gaps in literature of information security, they are areas calling for further studies. for practitioners, this framework provides insights for their practices in information security and management. behaviorally, three managerial perspectives are proposed: standards, processes, policies and regulations. technologically, three issues of information security have been proposed, they are regarded as backbone technologies through which the objectives of information security could be achieved from a technological view. 5 concluding remarks information security engineering is a comprehensive and cross-disciplined subject which covers the knowledge in mathematics, physics, telecommunication, computer sciences and management. it is not a simple combination of various tools of security technologies, nor does it equal to a series of managerial regulations and safety standards. it is a complicated system engineering. by conducting a comprehensive literature review, we have a body of knowledge on information security research, which forms the basics and contents of ise. establishing information security engineering as an independent subject is critical to assure national information securities and improve the levels of information security training. moreover, it guides theoretical research and practical applications in this area and make it possible to integrate the critical security technologies and standards and subsequently create a unique and effective system for information securities. acknowledgement this paper is part of the project ‘research on china industrial security index’ (no. b09c1100020) funded by the ministry of education, china. we are grateful for the suggestions and comments from two experts who reviewed this article for us. 586 m. li, m. tang bibliography [1] ashenden, d., information security management: a human challenge?, information security technical report,13(4): 195-201, 2008 [2] bonatti, p. & samarati, p., a uniform framework for regulating service access and information release on the web, journal of computer security, 10(3):241-271, 2002 [3] chrisment, i. & schaff, a., dynamic group communication security, proceedings of sixth ieee symposium on computers and communications, 49-56, 2001 [4] chang, s. & lin, c., exploring organizational culture for information security management, industrial management & data systems, 107(3):438-458, 2007 [5] cheng, et al., fuzzy multi-level security: an experiment on quantified risk-adaptive access control, ieee symposium on security and privacy, 222-230, 2007 [6] cheng et al., a security engineering environment based on iso/iec standards: providing standard, formal, and consistent supports for design, development, operation, and maintenance of secure information systems, international conference on information security and assurance, 350-364, 2008 [7] dhillon, g. & backhouse, j., information system security management in the new millennium, communications of the acm, 43(7): 125-128, 2000 [8] eloff, j. & eloff, m., information security management: a new paradigm, proceedings of the 2003 annual research conference of the south african institute of computer scientists and information technologists on enablement through technology, 130-136, 2003 [9] georg, g., ray, i. & france, r., using aspects to design a secure system, proceedings of the eighth ieee international conference on engineering of complex computer systems, 117-226, 2002 [10] gritzalis d. & lambrinoudakis, c., a security architecture for interconnecting health information systems, international journal of medical informatics, 73(3):305-309, 2004 [11] hong, k., chi, y., chao, l. & tang, j., an integrated system theory of information security management, information management & computer security, 11(5): 243-248, 2003 [12] jones, r. & rastogi, a., secure coding: building security into the software development life cycle, information systems security, 13(5): 29-39, 2004 [13] jurjens, j., towards development of secure systems using umlsec, fundamental approaches to software engineering lecture notes in computer science, vol. 2029: 187-200, 2001 [14] kagal, l., finin, t. & joshi, a., trust-based security in pervasive computing environments, computer, december: 151-157, 2001 [15] karyda, m., kiountouzis, e. & kokolakis, s., information systems security policies: a contextual perspective, computers & security, 24(3):246-260, 2005 [16] khadra, a., liu, x. & shen, x., robust impulsive synchronization and application to communication security, dynamics of continuous, discrete and impulsive systems series b: applications & algorithms, 10: 403-416, 2003 information security engineering: a framework for research and practices 587 [17] kiani-b, a., fallahi, k., pariz, n. & leung, h., a chaotic secure communication scheme using fractional chaotic systems based on an extended fractional kalman filter, communications in nonlinear science and numerical simulation, 14(3), 863-879, 2009 [18] liang,y., poor, h. & shamai, s., secure communication over fading channels, ieee transactions on information theory, 54(6):2470-2492, 2008 [19] mellado, d., fernández-medina, e. & piattini, m., a common criteria based security requirements engineering process for the development of secure information systems, computer standards & interfaces, 29(2): 244-253, 2007 [20] mouratidis, h., giorgini, p. & manson, g., when security meets software engineering: a case of modelling secure information systems, information systems, 30(8): 609-629, 2005 [21] mouratidis, h. & jurjens, j., from goal-driven security requirements engineering to secure design, international journal of intelligent systems, 25(8):813-840, 2010 [22] saint-germain, r., information security management best practice based on iso/iec 17799, the information management journal, july/august: 60-66, 2005 [23] siponen, m. & oinas-kukkonen, h., a review of information security issues and respective research contributions, the data base for advances in information systems, issn 15320936, 38(1): 60-80, 2007 [24] solms, b., information security governance–compliance management vs. operational management, computers & security, 24(6): 443-447, 2005 [25] solms, b. & solms, r., the 10 deadly sins of information security management, computers & security, 23(5): 371-376, 2004 [26] tsoumas, b. & gritzalis, d., towards an ontology-based security management, proceedings of the 20th international conference on advanced information networking and applications, 2006 [27] werlinger, r., hawkey, k. & beznosov,k., an integrated view of human, organizational, and technological challenges of it security management, information management & computer security, 17(1): 4-19, 2009 [28] villarroel, r., fernández-medina, e. & piattini, m., secure information systems development: a survey and comparison, computers & security, 30(8):609-629, 2005 [29] whitman, m., enemy at the gate: threats to information security, communications of the acm, 46(8):91-95, 2003 [30] yang, t., a survey of chaotic secure communication systems, international journal of computational cognition, 2(2): 81-130, 2004 ijcccv8n4.dvi int j comput commun, issn 1841-9836 8(4):538-547, august, 2013. enlarging the domain of attraction in nonlinear polynomial systems f. hamidi, h. jerbi, w. aggoune, m. djemaï, m.n. abdelkrim faiçal hamidi, mohamed naceur abdelkrim unité de recherche modélisation, analyse et commande des systèmes, enig, 6029 gabes, tunisie. email: faicalhamidi@yahoo.fr, naceur.abdelkrim@enig.rnu.tn houcem jerbi university of hail, college of engineering p.o.box 2440, hail, kingdom of saudi arabia. email: h.jerbi@uoh.edu.sa wahiba aggoune ecs-lab, equipe commande des systèmes, ensea,6 avenue du ponceau, 95014 cergy, france. email: aggoune@ensea.fr mohamed djemaï univ. lille nord de france, f-59000 lille, france. uvhc, lamih, cnrs, umr 8201, campus du mont houy f-59313, valenciennes, france. email: mohamed.djemai@univ-valenciennes.fr abstract: this paper addresses the problem of enlarging the domain of attraction (da) based on a generalized eigenvalue problem (gevp) approach. the main contribution is the maximization of the (da) while characterizing the asymptotic stability region by a lyapunov function. such result is obtained using a genetic algorithm (ga). a theoretical proof of the validity of the obtained domain is developed. an illustrative example ends the paper. keywords: nonlinear polynomial systems, lmi, genetic algorithm, stability. 1 introduction the problem of enlarging the domain of attractions (da) has been the topic of an important number of research works (see for example [2] [4], [5], [6] [11], [13] and the references cited therein). the da is defined as the set of initial conditions from which the states converge to the asymptotically stable equilibrium point [7]. as a result, it is essential to identify the shape of this region whenever one has to study the stability of a system. for this purpose, we exploit the fundamental theory of lyapunov stability (see [3], [12]). indeed, for a particular lyapunov function, the largest estimated region of asymptotic stability can be defined as the largest level set of the lyapunov function included in the region where its derivative is negative. in a recent work [3], the author proposes a static nonlinear feedback input, which allows enlarging the da. the proposed controller is polynomial in the measurable output; it exploits relaxations based on the sum of squares of polynomials in order to prove that the lower bound of the maximum achievable largest estimated domain of attraction and a corresponding controller can be computed via a generalized eigenvalue problem. the main advantage of the methodology is that the problem is formulated as a quasi-convex linear matrix inequalities (lmi) ( [1], [3], [10]). the main purpose of this work is to develop an exact method allowing the maximization of the da. the objective of this work is to improve the approach adopted in [3] by combining the copyright c© 2006-2013 by ccc publications enlarging the domain of attraction in nonlinear polynomial systems 539 genetic algorithm (ga) as an advanced optimization strategy to the lmi technique in order to maximize the da. the parameter optimizing approach will simultaneously deal with the lyapunov function and the control input parameters. based on the reverse trajectory method (rtm) one can accurately determine a preliminary maximal region of asymptotic stability and thereafter define the parameter of the maximal lyapunov function ( [8], [9], [12]). this allows to define precisely the initial values and the constraints related to the required parameters of the investigated lyapunov function. 2 problem statement and notations consider the continuous-time polynomial system ẋ = f(x) + g(x)u; y = h(x̃) (1) where f(.), g(.) and h(.) are polynomial functions such that f(0) = 0, g(0) = 0 and h(0) = 0. (the equilibrium point is the origin). in what follows, we assume that x ∈ rn is the state vector, u ∈ rp is the input vector and y ∈ rq is the measurable output. the control input is supposed to be a polynomial function of the form u = uφ (h (x̃)) (2) where φ(.) is a given polynomial function of the output and h (x̃) and u are defined by : h (x̃) = [ x̃[1], x̃[2], · · · , x̃[q] ] ; u = q∑ i=1 uix̃ [i] (3) with x̃[i] is the non-redundant kronecker power of the state vector x to the ith order, q is a truncation order and u ∈ rp×r is a matrix belonging to the interval matrix: u = { u = [u1, . . . , uq] : ui ∈ ( u−i , u + i ) , i = 1, .., q } . for the seek of simplicity and for explaining our approach, we consider the case of q = 2. we obtain u1 = [u1, u2], u2 = [u3, u4, u5] and consequently u = u1x1 + u2x2 + u3x 2 1 + u4x1x2 + u5x 2 2 the domain of attraction of the controller is the set of states which can be steered towards the terminal region. this paper is devoted to enlarge this domain. the size of the da depends on the control parameters, and the chosen lyapunov function. a wise and optimal choice of both of these may yield a bigger domain of attraction. the size of the region depends on the computed controller, the constraints on the system and the procedure used to compute it. thus, the most used procedure to enlarge the domain of attraction are based on a polynomial control. this leads to a greater number of parameters and therefore, to a greater computational effort. in this paper, a formulation of the problem, focused on enlarging the domain of attraction without increasing the computational effort is presented. the optimization problem formulation, and hence the computational effort is similar to the original one but with a larger domain by using the genetic algorithm approach. for this purpose, we proceed in three steps : -first, we exploit the method described in [3] to derive an initial da. 540 f. hamidi, h. jerbi, w. aggoune, m. djemaï, m.n. abdelkrim -second, we implement a ga combined with a linear matrix inequalities lmi approach to determine explicitly a maximal parameterized lyapunov function. -third, the implementation of the rtm leads to a maximized asymptotic stability region, while giving an accurate idea on numerical values of the lyapunov function parameters. the second step is finally reapplied in order to define the maximal lyapunov function and its corresponding da. a parameter optimization will cover in this step both the lyapunov function parameters and those of the polynomial control input. 3 preliminaries results before proceeding further, we will give some preliminary results. let v (x) ∈ r be a positive definite, radially unbounded and continuously differentiable function. the bounded set ω(c) = {x ∈ rn/v (x) ≤ c} (4) is an estimate of the region of attraction if ω ⊂ d where d = {x ∈ rn/v̇ (x, u) < 0} ∪ {0}. the time derivative of v (x, u) along trajectory of system (1) is given by v̇ (x, u) = ∂v (x) ∂x f(x) + ∂v (x) ∂x g(x) uφ(h(x̃)) = lfv (x) + lgv (x)uφ(h(x̃)) (5) where lfv (x) (resp. lgv (x)) is the lie derivative of v (x) along the polynomial function vector f(x) (resp. g(x)). in what follows, we shall denote lgv (x)uφ(h(x̃)) by l(g,u)v (x) for simplicity; i.e. l(g,u)v (x) = lgv (x)uφ(h(x̃)). the largest estimate of the da is given by ω(c∗(u)) where : c∗(u) = inf x∈rn v (x) such as v̇ (x, u) = 0 for each matrix u ∈ u. (6) the optimal value of c∗(u) is obtained by c∗ = sup u∈u c∗(u). (7) in [3], it has been proven that for any given c ∈ r , c ≤ c∗ if there exists u ∈ u and s(x) a positive definite polynomial such that v̇ (x, u) + (c − v (x)) s (x) < 0 (8) then the polynomial degrees of v (x) and v̇ (x, u) are 2δv and δl respectively. if we choose s(x) degree to be 2δs such that δs ≥ δl 2 − δv (9) it follows that the degree of the polynomial t(x, u, c, s(x)) = v̇ (x, u) + (c − v (x))s(x) (10) is equal to 2δm where δm = δv + δs. an approach based on both square matricial representation (smr) and complete square matrix representation (csmr) of polynomials is used in order to determine an appropriate enlarging the domain of attraction in nonlinear polynomial systems 541 optimization problem [3]. the csmr provides all the possible representations of a polynomial in terms of a quadratic form. the csmr matrix of t(x, u, c, s(x)) is given by t(α, u, c, s) = df (α) + dg(u) + cw1(s) − w2(s) (11) where df(α) is the csmr of lf v (x), α ∈ rτ(n,δm), dg(u) is the smr of l(g,u)v (x), w1(s) and w2(s) are the smr of s(x) and v (x)s(x). the condition (8) with (11) implies that if ĉ∗ = sup u∈u,α,s>0 c such that t(α, u, c, s) < 0; then ĉ∗ ≤ c∗ (12) theorem 1. ( [3]) the lower bound ĉ∗ is given by ĉ∗ = −λ∗ (u) 1 + µλ∗ (u) (13) where λ∗ (u) is the solution of the following gevp λ∗ (u) = inf u∈u,α,s>0,λ λ (14) such that    1 + µλ > 0; u ∈ u; s > 0 λw (s) > df (α) + dg (u) − w2 (s) with µ being any positive scalar and w(s) = kt (( 1 0 0 µv ) ⊗ s ) k. (15) the symbol ⊗ is the kronecker’s product and the matrix k satisfies ( 1 x{δv } ) ⊗ x{δs} = kx{δm} (16) where x{δm} ∈ rς(n,δm), α ∈ rτ(n,δm), x{δv } ∈ rς(n,δv ), x{δs} ∈ rς(n,δs), k ∈ rς(n,δs)(ς(n,δv )+1)×ς(n,δm) and the quantities ς (n, δm) and τ (n, δm) are given by ς (n, δm) = (n + δm)! n!δm! − 1 τ (n, δm) = 1 2 ς (n, δm) (ς (n, δm) + 1) − ς (n, 2δm) + n. 4 main results in this section, we present the main results of the paper. for this, we go back to the rtm introduced in [12], for estimating the region of asymptotic stability (ras) through reversing the system trajectory flow. our main aim is to enlarge the ras resulting form the method described in [3]. the idea consists in determining the maximal ras of the system obtained via the implementation of the rtm. this allows determining an analytical expression of the maximal da included in the ras. such a result is not given by the rtm, which finally gives an interesting graphical representation of the ras. we want to compute an analytical expression of 542 f. hamidi, h. jerbi, w. aggoune, m. djemaï, m.n. abdelkrim the ras in terms of a lyapunov function. we consider, for this purpose, a quadratic lyapunov function of the form: v (x) = xt px, where p = p t > 0 (17) for the sake of simplicity, we assume that p ∈ r2, with p = [ p1 p2 p2 p3 ] , so that : v (x) = p1x 2 1 + 2p2x1x2 + p3x 2 2. the results are generalizable for matrices of larger sizes. by using such a lyapunov function we can express the ras in terms of an ellipsoid in (x1, x2) plane which will be contained in the ras obtained by the trajectory reversing method. the idea consists in estimating the parameters pi, (i = 1, 2, 3) and ui, (i = 1, 2, 3, 4, 5) via a genetic algorithm. by combining this algorithm with an lmi optimization we can obtain the largest ellipsoid contained in the ras. the candidate solutions pi, (i = 1, 2, 3) and ui, (i = 1, 2, 3, 4, 5) are chosen arbitrarily and can be regarded as individuals. each variable can be considered as a gene and the different steps of the genetic algorithm can be expressed as follows : 1. initial population : each individual in the population is represented by a chromosome that is coded in binary form. 2. selection : in the selection process, strings must be of suitable fitness to be selected as a factual member in creating a new population. 3. replication : in this process, the best strings which have a greater probability than others will be member of the next generation. 4. crossover : two random chromosomes are selected in recreation process. they will be exposed to the crossover rate and they will swap from the crossover points. the experienced boundary for crossover rate is from 0.6 to 0.9. 5. mutation : we underline mutation relevance in just one way : for each bit we generate a random number and if it is less than the specified mutation probability, we flip the bit. if it is "one" we change it to "zero" and vice versa. the values, which are the boundaries for mutation rate, are between 0.1 and 0.3. in order to estimate, the parameter, by means of a genetic algorithm, we must encode the vector into binary code in order to let the parameters with genetic algorithm easier. theorem 2. in this work, a ga is used to estimate the parameters pi, (i = 1, 2, 3) and ui, (i = 1, ..., 5) where the next constraint is satisfied by : p1 > 0, p3 > 0 and p1.p3 > (p2) 2. (18) theorem 3. each iteration of the ga optimization routine, results in new parameters values pi, (i = 1, 2, 3), ui, (i = 1, ..., 5) and ĉ ∗ which is a solution of the lmi corresponding to this set of parameters. the genetic algorithm leads to the best set of parameters pi, (i = 1, 2, 3) and ui, (i = 1, ..., 5) which are used in the computation of the best solution ĉ ∗ (defined by (13)) which is the solution of the lmi (14). • the set of variables {pi, i = 1, 2, 3} and {ui, i = 1, ..., 5} are encoded into the gene candidate. • the fitness value is given by the maximum c∗ for which there exists a feasible solution of the lmi optimization. enlarging the domain of attraction in nonlinear polynomial systems 543 • the global optimization of the variables pi, (i = 1, 2, 3) and ui, (i = 1, ..., 5) is performed by the genetic operators (selection, recombination and mutation). the fitness function is evaluated in two steps as follows : first step second step: set pi, (i = 1, ..., 3) as gene candidates while p1 > 0, p3 > 0 and p1.p3 > (p2)2 apply the lmi optimization if cj > ci, j > i; i, j = 1, ..., n write pj1, pj2, pj3 end end return ĉ∗ = cj as fitness value. set pi, (i = 1, ..., 3) as gene candidates set ui, (i = 1, ..., 5) as gene candidates while : p1 > 0, p3 > 0 and p1.p3 > (p2)2 apply the lmi optimization if cj > ci, j > i; i, j = 1, ..., n write pj1, pj2, pj3, uj1, uj2, uj3, uj4, uj5 end end return ĉ∗ = cj as fitness value. the following flowchart, in figure 1, presents the main steps needed to implement the synthesized algorithm, according to which, we can maximize the region of asymptotic stability. the obtained solution is specified by the definition of a maximal quadratic lyapunov function. figure 1: flowchart of advanced lmi optimization algorithm for maximizing the da 5 illustrative example consider the following polynomial nonlinear system [3]: { ẋ1 = x2 ẋ2 = −x1 − x2 + x22 + x21x2 + u (19) 544 f. hamidi, h. jerbi, w. aggoune, m. djemaï, m.n. abdelkrim consider a controller that is linear in y1 = x1; that is, u = u1x1, with : u = {u = u1 : −2 ≤ u1 ≤ 2}. in order to find the shape of the da, we employ a lyapunov function of the form : v (x) = p1x 2 1 + 2p2x1x2 + p3x 2 2 (20) since the degree δl of v̇ (x, u) is 4, we can select δs = 1 which implies δm = 2. vectors x{δv}, x{δs} and x{δm} are selected as: x{δv } = x{δs} = (x1, x2) t and x{δm} = ( x1, x2, x 2 1, x1x2, x 2 2 )t which implies that: df (α, p1,2,3) =   −2p2 (p1 − p2 − p3) 0 α1 α2 + p2 (p1 − p2 − p3) (2p2 − 2p3) −α1 −α2 p3 0 −α1 0 p2 α3 α1 −α2 p2 −2(α3 − p3) 0 α2 + p2 p3 α3 0 0   dg (p1,2,3) =   2p2u1 p3u1 0 0 0 p3u1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0   ; s = ( s1 s2 s2 s3 ) w (s, p1,2,3) =   s1 s2 0 0 0 s2 s3 0 0 0 0 0 µp1s1 µα 0 0 0 µα µα µγ 0 0 0 µγ µp3s3   ; w2 (s, p1,2,3) =   0 0 0 0 0 0 0 0 0 0 0 0 p1s1 α 0 0 0 α β γ 0 0 0 γ p3s3   with α = p1s2 + p2s1, β = p1s3 + 4p2s2 + p3s1 and γ = p2s3 + p3s2. we propose to encode the parameters pi into 7 bits code : pi = (p0i, p1i, p2i, p3i, p4i, p5i, p6i), and put them into a chromosome as follows : { pop_size = 50; crossover_rate = 0.65; mutation = 0.1; max_generations = 100. (21) note that we consider the lmi representation as a fitness function for the genetic program. then, we obtain: p1 = 3.5; p2 = 1.5; p3 = 5.5. this result justifies a domain of attraction that is larger than that obtained through the direct method [3]. the result of this study is shown in fig. 2 where µ = 1 100 . in this figure, solid line v̇ (x, u) = 0, and v (x) = 2.5244 obtained by the ga method. dashed line the v̇ (x, u) = 0 and v (x) = 1.2324 obtained by the method developed in [3]. when we apply the rtm described in ( [12]), the result given in fig. 3 represents the region of asymptotic stability of the system under consideration. hence, the equation of the largest da is given by v (x) = 9x21 + 4.4x1x2 + 15.5x 2 2 = 9.5. when implementing our proposed method based on the lmi and the ga, we obtain the following result, with input u = 0.4864x1: v (x) = 9.5x21 + 4x1x2 + 15.5x 2 2 = 8.5066 enlarging the domain of attraction in nonlinear polynomial systems 545 the result of this study is depicted in fig. 4 which shows that the resulting domain of attraction is larger than the initial one given by [3]. in this figure, solid line indicates the v̇ (x, u) = 0 and v (x) = 8.5066 obtained via the proposed method with ga, the dashed line indicates the v̇ (x, u) = 0 and v (x) = 1.2324 result in [3]. now, we investigate the maximization of the da by estimating parameters p1, p2 and p3 of the lyapunov function and u1, u2, u3, u4 and u5 of the control input. consequently, we have u = u1x1 + u2x2 + u3x 2 1 + u4x1x2 + u5x 2 2 (22) with u = {u = [u1, u2, . . . , u5] : −2 ≤ ui ≤ 2, i = 1, ..., 5} we propose to encode the parameters pi and ui into 7 bits code pi = (p0i, p1i, p2i, p3i, p4i, p5i, p6i) and ui = (u0i, u1i, u2i, u3i, u4i, u5i, u6i) and put them into a chromosome as described in equation (21). this implies that df (α, p1,2,3), s, w (s, p1,2,3) and w2 (s, p1,2,3) remains unchanged, while dg (u1,2,3,4,5, p1,2,3) is defined as : dg (u1,2,3,4,5, p1,2,3) =   2p2u1 (p3u1 + p2u2) p2u3 (p3u3 + p2u4) 0 (p3u1 + p2u2) 2p3u2 0 (p2u5 + p3u4) p3u5 p2u3 0 0 0 0 (p3u3 + p2u4) (p2u5 + p3u4) 0 0 0 0 p3u5 0 0 0   the result of this study yields v (x) = 9.5x21 + 3.9x1x2 + 15.5x 2 2 = 38.6183 which means that the input is written as u = 0.6348x1 − 1.9996x2 + 0.0006x21 − 0.0015x1x2 − 1.0044x22 in figure 5, we represent the different da(s) by continuous lines (which allow elliptic forms). the dash-dotted curves represent the constraints. clearly, the largest (da) is obtained when we optimize both parameters of the lf and those of the control input. in this figure, the da ( blue line) illustrates the results obtained through the optimization of parameters p1, p2, p3, u1, u2, u3, u4 and u5 by combining ga and lmi. da with red line : obtained results while optimizing parameters p1, p2, p3, u1 by combining the ga and lmi [2]. da with the black line : obtained results by optimizing parameters p1, p2, p3, u1, with lmi [3].this result demonstrates the consistency of the proposed method. 6 conclusion in this paper, the problem of enlarging the attraction domain for nonlinear controlled systems is investigated. we were particularly interested in the class of nonlinear polynomial systems which represents a large class of physical nonlinear dynamics that can be approximated by polynomials using taylor series expansions. we rely on an optimization approach based on linear matrix inequality (lmi) to compute an initial region of attraction. the main contribution consists in the determination of an explicit da by using a parameterized lyapunov function. the parameters of both the lyapunov function and nonlinear control input are computed by combining the genetic algorithm and an lmi approach. the implementation of the reverse trajectory method leads to represent the largest elliptic shape of the ra. an illustrative example has demonstrated the efficiency of the established results. 546 f. hamidi, h. jerbi, w. aggoune, m. djemaï, m.n. abdelkrim acknowledgements this work has been supported, for m. djemaï, by ari-projets and by international campus on safety and intermodality in transportation, the european community, the delegation regionale a la recherche et a la technologie, the ministere de l’enseignement supérieur et de la recherche, the region nord pas de calais and the centre national de la recherche scientifique. figure 2: v̇ (x, u) = 0, and v (x). figure 3: largest ellipsoid included in ras figure 4: v̇ (x, u) = 0 and v (x). figure 5: da by combining ga and lmi. bibliography [1] d. rosinova and v. vesely,robust pid decentralized controller design using lmi, int j comput commun, issn 1841-9836, 2(2):195-204, 2007. [2] f. hamidi, h. jerbi, w. aggoune, m. djemaï and m. naceur abdkrim, enlarging region of attraction via lmi-based approach and genetic algorithm, int. conf. on communications, computing and control applications, hammamet tunisia 2011. [3] g. chesi, computing output feedback controllers to enlarge the domain of attraction in polynomial systems, ieee trans. on automatic control, 49: 846-1850, 2004. [4] g. chesi, a. garulli, a. tesi and a. vicino, characterizing the solution set of polynomial systems in terms of homogeneous forms: an lmi approach, int. j. of nonlinear and robust control, 13(13): 1239-1257, 2003. [5] g. chesi, a. garulli, a. tesi and a. vicino, solving quadratic distance problems: an lmi approach, ieee transactions on automatic control, 13: 200-212, 2003. enlarging the domain of attraction in nonlinear polynomial systems 547 [6] g. chesi, a. garulli, a. tesi and a. vicino, homogeneous lyapunov function for systems with structured uncertainties, automatica, 39(6): 1027-1035, 2003. [7] h.k. khalil, nonlinear systems,third edition, upper saddle river, nj :prentice hall, 2001. [8] m. loccufier and e. noldus, a new trajectory reversing method for estimating stability regions of autonomous nonlinear systems, nonlinear dynamics, 21: 265-288, 2000. [9] m. loccufier and e. noldus, on the estimation of asymptotic stability regions for autonomous nonlinear systems, ima j. of mathematic control an information, 12: 91-109, 1995. [10] m.m. belhaouane, r. mtar, h. belkhiria ayadi and n. benhadj braiek, an lmi technique for the global stabilization of nonlinear polynomial systems, int j comput commun, 4(4):335-348, 2009. [11] o. hachido, and b. tibken, estimating domains of attractions of a class of nonlinear dynamical systems with lmi methods based on the theory of moments, proc.of the 41st ieee conference on decision and control, las vegas, nevada, 3150-3155, 2002. [12] r. genesio, m. tartaglia and a. vicino, on the estimation of asymptotic stability regions: state of the art and new proposals, ieee trans. on automatic control, 30(8): 747-755, 1985. [13] y. fujisaki, and r. sakuwa. estimation of asymptotic stability regions via homogeneous polynomial lyapunov functions, international journal of control, 79(6): 617-623, 2006. 5du.pdf international journal of computers communications & control special issue on fuzzy sets and applications (celebration of the 50th anniversary of fuzzy sets) issn 1841-9836, 10(6):812-824, december, 2015. fuzzy robust tracking control for uncertain nonlinear time-delay system z.-b. du, t.-c. lin, t.-b. zhao zhenbin du yantai university yantai, shandong, 264005, china zhenbindu@126.com tsung-chih lin feng-chia university taichung, 40724, taiwan tclin@fcu.edu.tw tiebiao zhao university of california merced, ca, 95343, usa tzhao3@ucmerced.edu abstract: the problem of fuzzy robust tracking control is investigated for uncertain nonlinear time-delay systems. the nonlinear time-delay system is modeled as fuzzy takagi-sugeno (t-s) system, and fuzzy logic systems are used to eliminate the uncertainties of the system. a sufficient condition for the existence of fuzzy controller is given in terms of linear matrix inequalities (lmis) and adaptive law. based on lyapunov stability theorem, the fuzzy control scheme guarantees the desired tracking performance in sense that all the closed-loop signals are uniformly ultimately bounded (uub). simulation results of 2-link manipulator demonstrate the effectiveness of the developed control scheme. keywords: fuzzy t-s model; fuzzy logic systems; nonlinear system; time-delay; tracking control. 1 introduction fuzzy control approach offers a powerful and systematical control methodology to handle nonlinear system. owing to the superior approximation and reasoning abilities of the fuzzy controller, fuzzy control approach has been applied in different applications. with the extensive efforts of the researchers working on the fuzzy control discipline, fruitful stability analysis results have been obtained to aid the design of stable fuzzy controllers. in [1], a fuzzy t-s model was employed to represent the system dynamics of the nonlinear system. the fuzzy t-s model represents the nonlinear system as a weighted sum of some linear subsystems. this particular structure offers a general framework to represent the nonlinear system which is favorable for system analysis. fuzzy controllers [2-4] were proposed to handle the nonlinear system represented by the fuzzy t-s model. to avoid the effect of the uncertainties, a matching condition is assumed in [5–7], and an upper bound on uncertainties is introduced in [8–10]. the matching condition and the upper bound in dealing with the uncertainties are effective and feasible. however, there exists certain conservatism. the matching condition is a very conservative assumption and the upper bound may be too big or too small, which adds some difficulties to the controller design. on the other hand, it is well known that fuzzy logic systems can uniformly approximate nonlinear continuous functions to arbitrary accuracy. thus, fuzzy logic systems are used to model uncertain nonlinear systems in [11–13]. copyright © 2006-2015 by ccc publications fuzzy robust tracking control for uncertain nonlinear time-delay system 813 time delays are frequently encountered in engineering systems. the existence of time delays usually becomes the source of instability and degrading performance of systems. therefore, stability analysis and controller synthesis for nonlinear time-delay systems are important both in theory and in practice. by using fuzzy t-s model and fuzzy logic systems, we propose a novel robust tracking control scheme for a class of uncertain nonlinear time-delay system. fuzzy t-s model is used to approximate the nonlinear system, and a fuzzy state feedback controller is designed to guarantee the stability of the fuzzy system. a compensator based fuzzy logic systems is introduced to eliminate the uncertainties of the system. the fuzzy control scheme ensures the desired tracking performance in sense that all the closed-loop signals are uniformly ultimately bounded (uub). the rest of the paper is organized as follows. section 2 provides the problem formulation. section 3 develops a procedure of the controller design. section 4 gives the main result. section 5 presents simulation examples to illustrate the effectiveness of the proposed method. these are followed by conclusions in section 6. 2 problem formulation consider the following uncertain nonlinear time-delay system: ẋ1 = x2, · · · ẋ(β1−1) = xβ1, ẋβ1 = f1(x, x(t − τ1), · · · , x(t − τr), u) + f̃1(x, x(t − τ1), · · · , x(t − τr), u) + d1, ẋ(β1+1) = x(β1+2), · · · ẋn = fm(x, x(t − τ1), · · · , x(t − τr), u) + f̃m(x, x(t − τ1), · · · , x(t − τr), u) + dm, (1) where x = [x1, · · · , x (β1−1) 1 , · · · , x(n−βm+1), · · · , x (βm−1) (n−βm+1) ]t ∈ rn with β1+β2+· · ·+βm = n and u ∈ rm are the system state and control input, respectively. fi (i = 1, · · · , m) are known smooth nonlinear functions, f̃i (i = 1, · · · , m) are unknown nonlinear uncertainties, τi(i = 1, · · · , r) are time delays, and di (i = 1, · · · , m) are external bounded disturbances. the control objective of this paper is to find a fuzzy tracking controller such that, while maintaining all the closed-loop signals uub, the system states of nonlinear system (1) follow those of the given stable reference model. 3 fuzzy model, reference model and fuzzy controller a fuzzy-model-based control system, formed by a fuzzy model, a reference model, and fuzzy controller connected in a closed-loop, is introduced. 3.1 fuzzy model a fuzzy dynamic model has been proposed by takagi and sugeno to represent a nonlinear system. the fuzzy dynamic model is described by the following fuzzy if-then rules and will be employed here to deal with the control design problem for the nonlinear system in (1). 814 z.-b. du, t.-c. lin, t.-b. zhao plant rule i: if z1(t) is f i 1 and, · · · , and zs(t) is f i s,then ẋ(t) = aix(t) + r ∑ l=1 ailx(t − τl) + biu(t) + d, i = 1, · · · , l (2) where z1(t), · · · , zs(t) are the premise variables, f i j (j = 1, · · · , s) are the fuzzy sets, l is the number of if-then rules, ai,bi and ail are some constant matrices with compatible dimensions, bi=[0, · · · , b t i1, · · · , 0, · · · , b t im] t ∈ rn×m with bi1 ∈ r m, · · · , bim ∈ r m , and d = [0, · · · , d1, · · · , 0, · · · , dm] t . then, the final output of the fuzzy system is inferred as follows: ẋ(t) = l ∑ i=1 µi[aix(t) + r ∑ l=1 ailx(t − τl)] + l ∑ i=1 µibiu(t) + d, (3) where µi = vi(z(t)) / l ∑ i=1 vi(z(t)), vi(z(t)) = s ∏ j=1 f ij (zj(t)) (4) for all t ≥ 0, and f ij (zj(t)) is the grade of membership of zj(t) in f i j . it can be seen that l ∑ i=1 vi(z(t)) > 0, and vi ≥ 0(i = 1, · · · , r) for all t ≥ 0. we have µi ≥ 0(i = 1, · · · , r), l ∑ i=1 µi = 1. hence, the nonlinear system (1) can be rearranged as the following equivalent system : ẋ(t) = l ∑ i=1 µi[aix(t) + r ∑ l=1 ailx(t − τl)] + l ∑ i=1 µibiu(t) + b∆(x, x(t − τ)) + d, (5) where b∆(x, x(t−τ)) = b∆(x, x(t−τ1), · · · , x(t−τr)) denotes the uncertainties between the nonlinear system (1) and the fuzzy model (3), and b = diag[b1, · · · , bm] with bi = [0, · · · , 0, 1]t ∈ rβi . 3.2 reference model the system states of nonlinear systems (1) are driven to follow those of the following stable reference model ẋr(t) = arxr(t) + r(t), (6) where xr(t) is a reference state, r(t) is a bounded reference input, and ar is an asymptotically stable matrix. 3.3 fuzzy controller a fuzzy controller is chosen as u(t) = ul(t) − uf(t), (7) where ul(t) denotes the fuzzy state feedback control based on t-s model, and uf(t) is the adaptive compensator based on fuzzy logic systems. the former is used to stabilize the linear part of system (11), and the latter is used to compensate the uncertainties. ul(t) and uf(t) are designed as (8) and (10), respectively. for the fuzzy model represented by (2) or (3), fuzzy state feedback control ul(t) shares the same if parts with the following structure. fuzzy robust tracking control for uncertain nonlinear time-delay system 815 control rule i: if z1(t) is f i 1 and, · · · , and zs(t) is f i s , then ul(t) = ki(x(t) − xr(t)), i = 1, · · · , l. hence, the overall state feedback controller ul(t) is given by ul(t) = l ∑ i=1 µiki(x(t) − xr(t)), (8) where ki(i = 1, 2, · · · , l) are matrices with proper dimensions and satisfy the following inequalities ātijp + pāij + r ∑ l=1 α −1 l pāilā t ilp + r ∑ l=1 αli + 1 ρ2 pp + q̄ < 0, i, j = 1, · · · , l, (9) where āij = [ ai + bikj −bikj 0 ar ] , āil = [ ail 0 0 0 ] , q̄ = diag{2q, 2q}, p and q are some symmetric and positive definite matrices, and αl(l = 1, · · · , r) are positive constants. the adaptive compensator based on fuzzy logic systems uf(t) are as follows: uf(t) = { e−1û(x, x(t − τ)|θ), if e is nonsigular et (i + eet )−1û(x, x(t − τ)|θ), if e is sigular (10) where ei = [b t i1, · · · , b t im] t ∈ rm×m,e = l ∑ i=1 µiei, and û(x, x(t − τ)|θ) is constructed by fuzzy logic systems. the weight θ is an adaptive parameter, which is adapted by θ̇ = η1ψ t (x, x(t − τ))b̄t px̃, (11) where η1 is a positive constant, ψ(x, x(t−τ)) is a fuzzy basis-function matrix, and x̃ = [x t , xtr ] t . in the following, we explain the solution of the inequalities (9) and the construction of fuzzy logic systems û(x, u|θ). 1) by schur complements, the inequalities (9) are transformed into the lmis. for the convenience of design, p is chosen as the formp = diag{p1, p2}, where p1, p2 are some symmetric and positive definite matrices. the inequalities (9) are equivalent to the following matrix inequalities    s11 −p1bikj 0 −(bikj) t p1 s22 p2 0 p2 −ρ 2i    < 0, i, j = 1, 2, · · · , l, (12) where s11 = p1(ai + bikj) + (ai + bikj) t p1 + r ∑ l=1 α −1 l p1aila t il p1 + r ∑ l=1 αli + 1 ρ2 p1p1 + 2q, s22 = p2ar + a t r p2 + r ∑ l=1 αli + 2q. the matrix inequalities (12) imply s11 < 0. let w = p −1 1 and yj = kjw . s11 < 0 is equivalent to the lmis with prescribed q and αl(l = 1, · · · , r),   s w w −( r ∑ l=1 αli + 2q) −1   < 0, i, j = 1, 2, · · · , l (13) 816 z.-b. du, t.-c. lin, t.-b. zhao where s = aiw + wa t i + biyj + (biyj) t + r ∑ l=1 α −1 l aila t il + (ρ2)−1i. by solving the lmis (13), p1 and kj(j = 1, 2, · · · , l) could be obtained. and then, by substituting p1 and kj(j = 1, 2, · · · , l) into (12), (12) becomes standard lmis. we can easily solve p2 from (12). therefore, the common solution p and kj(j = 1, 2, · · · , l) could be found. remark 1: either the matching condition or the upper bound is related to a large number of matrix operations. without the matching condition and the upper bound, the dimension of the lmis of this paper is reduced. 2) fuzzy adaptive systems consist of four main components: fuzzy rule base, fuzzy inference engine, fuzzifier and defuzzifier [11]. the fuzzy rule base is composed of a collection of if-then inference rules: rl: if x1 is a l 1, · · · , xn is a l nłźthen y is g l(l = 1, · · ·p) where ali(i = 1, · · · , l) and g l(l = 1, · · ·p) are fuzzy sets. the kth element of ∆(x, x(t−τ)) is of the following form: ∆̂k(x, x(t − τ)|θk) = ξ t k (x, x(t − τ))θk, where θk = (θ 1 k , · · · , θ p k )t ∈ rp, ξt k (x, x(t − τ)) = (ξ1 k , · · · , ξ p k ) ∈ rp, ξlk = n ∏ i=1 µf l i (xi, xi(t − τ)) / p ∑ l=1 n ∏ i=1 µf l i (xi, xi(t − τ)), µf l i (xi, xi(t−τ)) = µf l i (xi) r ∏ j=1 µf l i (xi(t − τj)), and µf l i (xi)(i = 1, 2, · · · , n) are the membership functions. in this paper, fuzzy logic systems are constructed to eliminate the uncertainties ∆(x, x(t−τ)). the approximation form is given as follows: ∆̂(x, x(t − τ)|θ) = ψ(x, x(t − τ))θ, (14) where ψ(x, x(t − τ)) = diag[ξt1 (x, x(t − τ)), · · · , ξ t m(x, x(t − τ))], θ = [θ t 1 , θ t 2 , · · · , θ t m] t . define the optimal the parameter θ∗ as θ∗ = arg min θ∈ω1 [ sup x∈u1 |û(x, x(t − τ)|θ) − ∆(x, x(t − τ))|], (15) where u1 = {x ∈ r n : ‖x‖ ≤ n}, ω1 = {θ ∈ r pm: ‖θ‖ ≤ m}. u1, ω1 denote the sets of suitable bounds on x,θ respectively, n, mare upper bounds. the approximation error for the function ∆(x, x(t − τ))can be expressed as ∆̂(x, x(t − τ)|θ) − ∆(x, x(t − τ)) = ψ(x, x(t − τ))θ̃ + w, (16) where θ̃ = θ − θ∗ the estimation error for θ, w = [w1, · · · , wm] t is a residual term. remark 2: in order to guarantee ‖θ‖ ≤ m, the adaptive law (11) must be modified by the projection algorithm [11] as follows: θ̇ = { η1ψ t (x, x(t − τ))b̄t px̃, if(‖θ‖ 0 where pθ[.]=η1ψ t (x, x(t − τ))b̄t px̃ − η1 x̃t pb̄ ψ(x,x(t−τ))θ ‖θ‖2 . fuzzy robust tracking control for uncertain nonlinear time-delay system 817 4 stability analysis substituting (11) into (11) yields ẋ(t) = l ∑ i=1 µi[aix(t) + r ∑ l=1 ailx(t − τl)] + l ∑ i=1 l ∑ j=1 µiµjbikj(x(t) − xr(t)) −b(û(x, x(t − τ)|θ) − ∆(x, x(t − τ))) + d. (17) let x̃(t) = [xt (t), xtr (t)] t , andb̄ = [ b t 0 ]t . by using (11) and (17), a new extended closedloop system is as follows: ˙̃x(t) = l ∑ i=1 l ∑ j=1 µiµj[āijx̃(t)+ r ∑ l=1 āilx̃(t − τl)]+b̄(−(û(x, x(t−τ)|θ)−∆(x, x(t−τ)))+d′, (18) where d′ = [dt , rt (t)]t . when fuzzy logic systems û(x, x(t−τ)|θ) could eliminate ∆(x, x(t−τ)), the closed-loop system (18) is stable. by denoting w′ = [w̄t , rt (t)]t , w̄ = [0, · · · , d1 − w1, · · · , 0, · · · , dm − wm] t and using (14), the closed-loop system (18) could be rewritten as ˙̃x(t) = l ∑ i=1 l ∑ j=1 µiµj[āijx̃(t) + r ∑ l=1 āilx̃(t − τl)] + b̄(−ψ(x, x(t − τ))θ̃) + w′. (19) from the above analysis, we have the following conclusion. theorem 1. given a matrixq > 0, scalarsρ > 0,αl(l = 1, · · · , r) > 0, η1 > 0.if there exist matricesp > 0,kj(j = 1, 2, · · ·, l) such that the inequalities (9) hold. if the updating law for fuzzy logic systems is chosen as (11). then there exists a controller (11) with the fuzzy state feedback controller (8) and the adaptive compensator (10) such that, while maintaining all the closed-loop signals uub, the following tracking performance(20) is achieved ∫ t 0 (x(t) − xr(t)) t q(x(t) − xr(t))dt ≤ x̃ t (0)px̃(0) + 1 η1 θ̃t (0)θ̃(0) + ρ2 ∫ t 0 (w′t w′)dt. (20) proof: consider the following lyapunov-krasoviskii candidate v = 1 2 x̃t px̃ + 1 2 r ∑ l=1 ∫ t t−τl αlx̃ t (v)x̃(v)dv + 1 2η1 θ̃t θ̃, (21) where v̇ = v̇1 + v̇2, v̇1and v̇2 are given in (22) and (26), respectively. v̇1 = 1 2 ( l ∑ i=1 l ∑ j=1 µiµj[āijx̃(t) + r ∑ l=1 āilx̃(t − τl)]) t px̃(t) + 1 2 x̃t (t)p( l ∑ i=1 l ∑ j=1 µiµj[āijx̃(t) + r ∑ l=1 āilx̃(t − τl)]) + 1 2 w′t px(t) + 1 2 xt (t)pw′ + 1 2 r ∑ l=1 αlx̃ t (t)x̃(t) − 1 2 r ∑ l=1 αlx̃ t (t − τl)x̃(t − τl) ≤ 1 2 ( l ∑ i=1 l ∑ j=1 µiµj[x̃ t (t)ātijpx̃(t)+x̃ t (t)pāijx̃(t)+ r ∑ l=1 α −1 l x̃t (t)pāilā t ilpx̃(t)+ r ∑ l=1 αlx̃ t (t − τl)x̃(t − τl)] 818 z.-b. du, t.-c. lin, t.-b. zhao − 1 2 ( 1 ρ px(t) − ρw′)t ( 1 ρ px(t) − ρw′) + 1 2 ρ2w′t w′ + 1 2ρ2 x̃t (t)ppx̃(t) + 1 2 r ∑ l=1 αlx̃ t (t)x̃(t) −1 2 r ∑ l=1 αlx̃ t (t − τl)x̃(t − τl) ≤ 1 2 l ∑ i=1 l ∑ j=1 µiµjx̃ t (t)(ātijp +pāij + r ∑ l=1 α −1 l pāilā t ilp + r ∑ l=1 αli + 1 ρ2 pp)x̃(t)+ 1 2 ρ2w′t w′. (22) substituting (9) into (22) yields v̇1 ≤ − 1 2 x̃t (t)q̄x̃(t) + 1 2 ρ2w′t w′. (23) by using (11), v2 = [x̃ t pb̄(−(ψ(x, x(t − τ))θ̃) + 1 η1 θ̃t θ̇] = 0. (24) from (23)-(24), v̇ ≤ − 1 2 x̃t (t)q̄x̃(t) + 1 2 ρ2w′t w′. (25) when‖x̃(t)‖ > ρ λmin(q̄) ‖w′‖,v̇ < 0.thus, the closed-loop system consisting of (1), (11), (8) and (10) is uub . ✷ note that ∫ t 0 (x(t) − xr(t)) t q(x(t) − xr(t))dt ≤ ∫ t 0 x̃t (t)q̄x̃(t)dt. integrating the above equation (25) from t = 0 to t yields (20). 5 simulation example in this section, we provide an example to verify the effectiveness of the proposed control scheme. example: consider the following 2-link manipulator system in [14] q̈(t) + c(q, q̇)q̇(t) + g(q) = b(q)u(t) + r ∑ i=1 ξi(t)q(t − τi) + d′, (26) where c(q, q̇) = h−1(q)c′(q, q̇), g(q) = h−1(q)g′(q), b(q) = h−1(q),d′ = h−1(q)d,q = [q1, q2] t , ξi(t)(i = 1, · · · , r)are uncertain and bounded, and dis the external bounded disturbance. the reference model is as follows: ẋr(t) = arxr(t) + r(t), (27) wherear = diag{ar1, ar2},ar1 = ar2 = [ 0 1 −6 −5 ] , r(t) = [0, r1(t), 0, r2(t)] t , r1(t) = r2(t) = 3 sin(2t). fuzzy robust tracking control for uncertain nonlinear time-delay system 819 step1: denote x1 = q1, x2 = q̇1, x3 = q2, and x4 = q̇2. then, (26) can be written as a fourthdimension system. a nine-rule fuzzy t-s model is used to approximate the nonlinear 2-link manipulator system at x1 = − π 2 , 0, π 2 and x3 = − π 2 , 0, π 2 , where a1 =       0 1 0 0 5.927 −0.001 −0.315 −0.0000084 0 0 0 1 −6.859 0.002 3.155 0.0000062       , a2 =       0 1 0 0 3.0428 −0.0011 −0.1791 −0.0002 0 0 0 1 −3.5436 0.0313 2.5611 0.0000114       , a3 =       0 1 0 0 6.2728 0.003 0.4339 −0.0001 0 0 0 1 −9.1041 0.0158 −1.0574 −0.000032       , a4 =       0 1 0 0 6.4535 0.0017 1.2427 −0.0002 0 0 0 1 −3.1873 0.0306 −5.1911 −0.000018       , a5 =       0 1 0 0 11.1336 0 −1.8145 0 0 0 0 1 −9.0918 0 9.1638 0       , a6 =       0 1 0 0 6.1702 −0.001 1.687 −0.0002 0 0 0 1 −2.3559 0.0314 4.5298 −0.000011       , a7 =       0 1 0 0 6.1206 0.0041 0.6205 0.0001 0 0 0 1 8.8794 0.0193 −1.0119 0.000044       , a8 =       0 1 0 0 3.6421 −0.0018 0.0721 0.0002 0 0 0 1 2.429 −0.0305 2.9832 −0.000019       , a9 =       0 1 0 0 6.2933 −0.0009 0.2188 −0.000012 0 0 0 1 −7.4649 0.0024 3.2693 −0.0000092       , a11 = a21 = a31 = a41 = a51 = a61 = a71 = a81 = a91 =       0 0 0 0 0.01 0 0 0 0 0 0 0 0 0 0 0       , a12 = a22 = a32 = a42 = a52 = a62 = a72 = a82 = a92 =       0 0 0 0 0 0 0 0 0 0 0 0 0 0 0.01 0       , b1 = [ 0 1 0 −1 0 −1 0 2 ]t , b2 = [ 0 0.5 0 0 0 0 0 1 ]t , b3 = [ 0 1 0 1 0 1 0 2 ]t , b4 = [ 0 0.5 0 0 0 0 0 1 ]t , b5 = [ 0 1 0 −1 0 −1 0 2 ]t , b6 = [ 0 0.5 0 0 0 0 0 1 ]t , 820 z.-b. du, t.-c. lin, t.-b. zhao b7 = [ 0 1 0 1 0 1 0 2 ]t , b8 = [ 0 0.5 0 0 0 0 0 1 ]t , b9 = [ 0 1 0 −1 0 −1 0 2 ]t . the membership functions are adopted as the triangle type. step 2: on the basis of theorem1, withα1 = 0.005, α2 = 0.005, andρ = 1,we have k1 = [ -76.9685 -42.9566 -19.6919 -8.9116 6.0025 -0.4619 -51.4252 -25.0336 ] , k2 = [ -77.7828 -42.8754 -13.6211 -5.9413 8.7179 1.2251 -50.6614 -24.6859 ] , k3 = [ -76.8347 -42.9089 -19.8204 -8.9785 5.8595 -0.5257 -51.3739 -25.0109 ] , k4 = [ -77.7828 -42.8754 -13.6211 -5.9413 8.7179 1.2251 -50.6614 -24.6859 ] , k5 = [ -77.7828 -42.8754 -13.6211 -5.9413 8.7179 1.2251 -50.6614 -24.6859 ] , k6 = [ -77.7828 -42.8754 -13.6211 -5.9413 8.7179 1.2251 -50.6614 -24.6859 ] , k7 = [ -79.8424 -43.4072 -6.0780 -2.2626 12.7745 3.6898 -50.2150 -24.4989 ] , k8 = [ -77.7828 -42.8754 -13.6211 -5.9413 8.7179 1.2251 -50.6614 -24.6859 ] , k9 = [ -80.1162 -43.5088 -5.8152 -2.1328 13.0509 3.8147 -50.3242 -24.5472 ] . step 3: in fuzzy adaptive compensator, the membership functions are selected as µf 1 i (xi) = 1 1 + exp[5(xi + 0.8)] , µf 2 i (xi) = exp[−(xi + 0.6) 2], µf 3 i (xi) = exp[−(xi + 0.4) 2], µf 4 i (xi) = exp[−(xi) 2], µf 5 i (xi) = exp[−(xi − 0.4) 2], µf 6 i (xi) = exp[−(xi − 0.6) 2], µf 7 i (xi) = 1 1 + exp[5(xi − 0.8)] , i = 1, 2, · · · , 4. step 4: some parameters are choose as η1 = 10, r = 2, τ1 = 0.5, τ2 = 1, ξ1(t) = 5 + 20sin(5t), andξ1(t) = 1 + 15cos(5t), θ(0) = [0.2, 0.2, 0.2, 0.2, 0.2, 0.2, 0.2, 0.2, 0.2, 0.2, 0.2, 0.2, 0.2, 0.2, 0.2], (x1(0), x2(0), x3(0), x4(0), xr1(0), xr2(0), xr3(0), xr4(0)) = (0.4, 0,−0.4, 0, 0, 0, 0, 0). by using the method in theorem 1, the tracking performances of x1(t), x2(t), x3(t), x4(t) are shown in fig.1,and the control efforts u1(t) and u2(t) are given in fig.2,respectively. fuzzy robust tracking control for uncertain nonlinear time-delay system 821 0 2 4 6 8 10 −1 −0.5 0 0.5 1 t(sec.) s ta te 1 0 2 4 6 8 10 −1 −0.5 0 0.5 1 t(sec.) s ta te 2 0 2 4 6 8 10 −1 −0.5 0 0.5 1 t(sec.) s ta te 3 0 2 4 6 8 10 −1 −0.5 0 0.5 1 t(sec.) s ta te 4 x 1 x r1 x 2 x r2 x 3 x r3 x 4 x r4 figure 1: the responses of x1,x2,x3,x4, xr1,xr2,xr3andxr4 0 2 4 6 8 10 −80 −60 −40 −20 0 20 40 t(sec.) c o n tr o l1 0 2 4 6 8 10 −10 0 10 20 30 40 50 60 70 c o n tr o l2 t(sec.) figure 2: the control inputs u1,u2 0 2 4 6 8 10 −1 −0.5 0 0.5 1 t(sec.) s ta te 1 0 2 4 6 8 10 −1 −0.5 0 0.5 1 t(sec.) s ta te 2 0 2 4 6 8 10 −1 −0.5 0 0.5 1 t(sec.) s ta te 3 0 2 4 6 8 10 −1 −0.5 0 0.5 1 t(sec.) s ta te 4 x 1 x r1 x 2 x r2 x 3 x r3 x 4 x r4 figure 3: the responses of x1,x2,x3,x4, xr1,xr2,xr3andxr4 822 z.-b. du, t.-c. lin, t.-b. zhao 0 2 4 6 8 10 −1 −0.5 0 0.5 1 t(sec.) s a te 1 0 2 4 6 8 10 −1 −0.5 0 0.5 1 t(sec.) s a te 2 0 2 4 6 8 10 −1 −0.5 0 0.5 1 t(sec.) s a te 3 0 2 4 6 8 10 −1 −0.5 0 0.5 1 t(sec.) s a te 4 x 1 x r1 x 2 x r2 x 3 x r3 x 4 x r4 figure 4: the responses of x1,x2,x3,x4, xr1,xr2,xr3andxr4 0 2 4 6 8 10 −0.1 0 0.1 0.2 0.3 0.4 t(sec.) s ta te 1 0 2 4 6 8 10 −0.8 −0.6 −0.4 −0.2 0 0.2 s ta te 2 t(sec.) 0 2 4 6 8 10 −0.4 −0.3 −0.2 −0.1 0 0.1 t(sec.) s ta te 3 0 2 4 6 8 10 −0.5 0 0.5 1 t(sec.) s ta te 4 x 4 x r4 x 2 x r2 x 1 x r1 x 3 x r3 figure 5: the responses of x1,x2,x3,x4, xr1,xr2,xr3andxr4 0 2 4 6 8 10 −15 −10 −5 0 5 t(sec.) c o n tr o l1 0 2 4 6 8 10 −5 0 5 10 15 t(sec.) c o n tr o l2 u 1 u 2 figure 6: the control inputs u1,u2 fuzzy robust tracking control for uncertain nonlinear time-delay system 823 whenτ1 = 1,τ2 = 1, simulation results are shown in fig.3.whenτ1 = 1,τ2 = 2, simulation results are shown in fig.4. when r1(t)andr2(t) are square waves having an amplitude ±0.2 with a period of 2π, the tracking performances of x1(t), x2(t), x3(t), x4(t) are shown in fig. 5, and the control efforts u1(t) and u2(t) are given in fig.6. simulation results illustrate that the proposed controller design is effective and feasible. 6 conclusion based on fuzzy technique, a novel tracking control scheme is presented for uncertain nonlinear time-delay system. as main contribution of this paper, we design a novel fuzzy tracking controller, which is independent of the matching condition or the upper bound for the uncertainties. furthermore, the tracking control design for discrete nonlinear systems is also developed. acknowledgment this work was supported by the national natural science foundation of china(61203320,61273128), shandong natural science foundation(zr2014fl023) and the project of shandong province higher educational science and technology program(j14ln70). bibliography [1] t. takagi; m. sugeno. (1985); fuzzy identification of systems and its applications to modeling and control, ieee transactions on systems, man, and cybernetics, issn 0018-9472, smc-15(1):116-132. [2] j.-w. wang; h.-n. wu; h.-x. li. (2011); distributed fuzzy control design of nonlinear hyperbolic pde systems with application to nonisothermal plug-flow reactor, ieee transactions on fuzzy systems, issn 1063-6706, 19(3): 514 526. [3] d.w. kim;h.j. lee. (2012); sampled-data observer-based output-feedback fuzzy stabilization of nonlinear systems: exact discrete-time design approach, fuzzy sets and systems, issn 0165-0114, 201(16): 20-39. [4] g.k. koo;j.b. park;y.h. joo. (2013); guaranteed cost sampled-data fuzzy control for nonlinear systems: a continuous-time lyapunov approach, iet control theory and applications, issn 1751-8644, 7(13): 1745–1752. [5] y.-s. zhang;s.-y. xu;y. zou;j.-j. lu. (2011); delay-dependent robust stabilization for uncertain discrete-time fuzzy markovian jump systems with mode-dependent time delays, fuzzy sets and systems, issn 0165-0114, 164(1):66-81. [6] j. yoneyama. (2012); robust sampled-data stabilization of uncertain fuzzy systems via input delay approach, information sciences, issn 0020-0255, 198 (1): 169-176. [7] j. yoneyama. (2013); robust h8 filtering for sampled-data fuzzy systems, fuzzy sets and systems, issn 0165-0114, 217(16) : 110-129. 824 z.-b. du, t.-c. lin, t.-b. zhao [8] z.-y. xi;g. feng;t. hesketh. (2011); piecewise integral sliding-mode control for t–s fuzzy systems, ieee transactions on fuzzy systems, issn 1063-6706, 19(1): 65-74. [9] j. chen;f.sun;y.yin;c.hu. (2011); state feedback robust stabilization for discrete-time fuzzy singularly perturbed systems with parameter uncertainty, iet control theory and applications, issn 1751-8644, 5(10): 1195 1202. [10] c.-h. lien;j.-d. chen;k.-w. yu;l.-y.chung. (2012); robust delay-dependent h8 control for uncertain switched time-delay systems via sampled-data state feedback input, computers and mathematics with applications, issn 0898-1221 , 64(5):1187-1196. [11] l.-x. wang. (1993); stable adaptive fuzzy control of nonlinear systems, ieee transactions on fuzzy systems, issn 1063-6706,1(3):146-155. [12] w.-s. chen;z.-q. zhang. (2010); globally stable adaptive backstepping fuzzy control for output-feedback systems with unknown high-frequency gain sign, fuzzy sets and systems, issn 0165-0114,161(6): 821-836. [13] z.-b. du;t.-c. lin;v. e. balas. (2012); a new approach to nonlinear tracking control based on fuzzy approximation, international journal of computers, communications and control, issn 1841-9836,7(1):61-72. [14] w.-s. yu. (2004); tracking-based adaptive fuzzy-neural control for mimo uncertain robotic systems with time delays, fuzzy sets and systems, issn 0165-0114,146(3): 375-401. 912wangshi6.pdf international journal of computers communications & control special issue on fuzzy sets and applications (celebration of the 50th anniversary of fuzzy sets) issn 1841-9836, 10(6):889-903, december, 2015. homeomorphism problems of fuzzy real number space and the space of bounded functions with same monotonicity on [-1,1] h.-d. wang, s.-c. guo, s.m. hosseini bamakan, y. shi hua-dong wang 1. key laboratory of big data mining and knowledge management, chinese academy of sciences, beijing 100190, china 2. research center on fictitious economy & data science, chinese academy of sciences, beijing 100190, china 3. college of mathematical sciences, university of chinese academy of sciences, beijing, 100190, china huadw2012@163.com si-cong guo college of sciences, liaoning technical university, fuxin 123000, china seyed mojtaba hosseini bamakan 1. key laboratory of big data mining and knowledge management, chinese academy of sciences, beijing 100190, china 2. research center on fictitious economy & data science, chinese academy of sciences, beijing 100190, china 3. school of management, university of chinese academy of sciences, beijing, 100190, china s_mojtabahossini@yahoo.com yong shi* 1. key laboratory of big data mining and knowledge management, chinese academy of sciences, beijing 100190, china 2. research center on fictitious economy & data science, chinese academy of sciences, beijing 100190, china *corresponding author: yshi@ucas.ac.cn abstract: in this paper, based on the fuzzy structured element, we prove that there is a bijection function between the fuzzy number space ε1 and the space b[−1, 1], which defined as a set of standard monotonic bounded functions with monotonicity on interval [−1, 1]. furthermore, a new approach based upon the monotonic bounded functions has been proposed to create fuzzy numbers and represent them by suing fuzzy structured element. in order to make two different metrics based space in b[−1, 1], hausdorff metric and lp metric, which both are classical functional metrics, are adopted and their topological properties are discussed. in addition, by the means of introducing fuzzy functional to space b[−1, 1], we present two new fuzzy number’s metrics. finally, according to the proof of homeomorphism between fuzzy number space ε1 and the space b[−1, 1], it’s argued that not only does it give a new way to study the fuzzy analysis theory, but also makes the study of fuzzy number space easier. keywords: fuzzy numbers; fuzzy structured element, standard monotonic bounded functions, fuzzy functional, homeomorphism copyright © 2006-2015 by ccc publications 890 h.-d. wang, s.-c. guo, s.m. hosseini bamakan, y. shi 1 introduction fuzzy numbers, which are a generalization of real numbers, have been perfectly applied to model and show the fuzzy data. recently, application of fuzzy numbers in data mining algorithms has been an interesting topic to the researchers in this domain, for instance, clustering [1, 2], classification [3] and regression [4, 5]. generally, the efforts have been done in study of fuzzy mathematical analysis and its application falls into two main categories: first, studies on constructing fuzzy number metrics based on the fuzzy numbers and their topological properties. many researchers proposed different metrics and many discussions on them have been proposed. for example, hausdorff metric [6], lp metric [7] and sendograph metric [8], were proposed as some of the most well-known widely used metrics. the second category consists of those studies which addresses the relationship between the fuzzy number space and other topological spaces, study the properties of the fuzzy number space and develop some new methods in the proposed spaces. among these studies, goetschel and voxman(2003) introduced a homeomorphic mapping from θ-crisp fuzzy number space to hilbert space ℓ2, which ranges in a convex cone (see [9]). later, gerg [10] generalized this mapping by extending the θ-crisp fuzzy number space to a more general one. in order to apply the functional analysis to the fuzzy-valued functions studies, in which variables are real numbers and function values are fuzzy numbers, puri and ralescu [11], proposed an embedding theorem that the fuzzy number space ε1 can be embedded into a banach space x, with the help of the radstrom embedding theorem of compact convex set. this theorem establishes the theoretical link between the fuzzy number space and the banach space. however, because of do not considering any specific structure of banach space, it is not easy to implement (it is not applicable anymore). thus, by adopting the mapping of goetschel and voxman, wu and ma [12, 13] embedded fuzzy number space into the concrete banach space c[0, 1] × c[0, 1] (c[0, 1] = {f : f is a bounded left-continuous function on (0, 1], and f has right limit on (0, 1], especially f is right-continuous at 0}), and present a specific isometrically isomorphic operator. although the proposed embedding operator is proved to be as same as the embedding operator given by puri and ralescu [11] in the sense of isometrical isomorphism, the embedding operator has a specific form. this paper is organized as follows: section 2 and section 3 introduced definitions and notations employed throughout the paper. in section 4, we introduced a specific fuzzy number, namely; fuzzy structured element. then two important theorems which are the local mapping theorem and the structured element representation theorem of fuzzy number, are proved. as a result, we obtain a conclusion that there exist an one-one mapping from b[−1, 1] with the same order standard monotonic bounded function family on [0, 1] to the fuzzy real number space ε1. in section 5, we introduced lp metric and hausdorff metric into b[−1, 1] and some of its topological properties, such as completeness and separability are discussed. in section 6, by means of a fuzzy functional induced by fuzzy structured element, two fuzzy number metrics induced in the given metrics of b[−1, 1]. this section discussed the homeomorphism problems between b[−1, 1] and the space ε1. finally, we conclude and provide future works in section 7. 2 notion of the fuzzy numbers fuzzy numbers are the natural generalization of real and crisp numbers. a fuzzy number is a set of the real line with the upper semi-continuous and quasi-concave membership function. the definition implies that α-cut (aα) of a fuzzy subset a is a closed interval in [a α l , a α r ] for any α ∈ (0, 1]. the support of a fuzzy number a is a crisp set so that suppa = cl({x : a(x) > 0}) = [a0l , a 0 r](the closure of the support of a). thus, by supposing suppa to be a bounded closed homeomorphism problems of fuzzy real number space and the space of bounded functions with same monotonicity on [-1,1] 891 interval, a defined as a bounded fuzzy number. denote all bounded fuzzy numbers on real line r as ñc(r)(or ε 1). theorem 1. [12] if u ∈ ñc(r), then let u(α) = inf{x : x ∈ uα}, u(α) = sup{x : x ∈ uα}, here u(α) and u(α) are two functions that satisfy the following conditions (1)–(4) on [0, 1]: (1) u(α) is a bounded left continuous nondecreasing function on (0, 1]; (2) u(α) is a bounded left continuous nonincreasing function on (0, 1]; (3) u(α) and u(α) are right continuous at α = 0; (4) u(α) ≤ u(α). conversely, if functions u(α) and u(α) satisfy the conditions (1)–(4) on [0, 1], then there exists an unique u ∈ ñc(r) such that uα = [u(α), u(α)] for each α ∈ [0, 1]. the theorem says that for any fuzzy number a, it can be uniquely determined by two monotonic functions u(α), u(α) on interval [0, 1]. 3 notions of the extended set-valued function and general inverse function let f be a monotonic and bounded function on [a, b] and x0 ∈ (a, b) be a discontinuous point in f. by considering f as a monotone increasing function, f can be a surjective function from [a, b] to (−∞, +∞) by the following formula: f(x0) = [f(x0−), f(x0+)], f(a) = (−∞, f(a+)], f(b) = [f(b−), +∞), here, we denote a new function f̂, which f̂ is a monotonic set-valued function extended by f and it also called extensional set-valued function of f. furthermore, we denote all the family of function f which are bounded and have the same monotonicity on [a, b], by d[a, b]. 3.1 discontinue monotonic function with set-valued extensional at discontinuity for discontinue monotonic increasing function f, x0 is a discontinuous point in the range of [−1, 1]. here, f(x0 − 0) = m1 and f(x0 + 0) = m2, by considering our default suppose that f is an increasing function, then m1 < m2 and f(x0) is an interval number between [m1, m2]. if functional values of all discontinuities redefined as closed interval with left-hand and righthand limited values, then this new function is called monotonic bounded set-valued function extensional from f that we denote it by f̂. obviously, inverse function f̂−1 of f̂ exist. 3.2 continuous non-strictly monotonic function suppose f is a non-strictly increasing function, then there exists at least one pair points {x1, x2} on [−1.1] such that value of f is equal to constant c = f(x1) = f(x2) on interval [x1, x2]. and suppose x1, x2 are two endpoints so that increasing function f is equal to constant, that is, when x < x1, f(x) < c and when x > x2, f(x) > c. here, we define inverse function f −1(x) which is continue close to 0 at discontinuity, i.e. when x2 ≤ 0, f̂ −1(c) = limx→c+0 f −1(x) = y+; 892 h.-d. wang, s.-c. guo, s.m. hosseini bamakan, y. shi when x1 ≥ 0, f̂ −1(c) = limx→c−0 f −1(x) = y−; when 0 ∈ [x1, x2], define f −1(c) as set of two points {y−, y+}, denote by f̂ −1(c) = {y−, y+}. it is quite straightforward to verify that, if f is an increasing and bounded function on [−1, 1] and f̂ is the extensional set-valued function of f, then the inverse function of f̂ can be equivalently defined as: f̂−1(x) =      sup{t : f̂(t) = x,−1 6 t < 0}, −∞ < x 6 f(0−) 0, f(0−) 6 x 6 f(0+) inf{t : f̂(t) = x, 0 < t 6 1}, f(0+) 6 x < +∞ . (1) example 2. to make the above concept more understandable, let consider f as a monotonic bounded function on [0, 2], f(x) = { x, 0 6 x 6 1 1 + x, 1 < x 6 2 . then its extensional set-valued function f̂(x) and its inverse function f̂−1(x) are defined as the following: f̂(x) =      x, 0 6 x < 1 [1, 2], x = 1 1 + x, 1 < x 6 2 , f̂−1(x) =      x, 0 6 x < 1 1, 1 6 x < 2 x − 1, 2 6 x 6 3 . the f(x) and f̂(x) can be illustrated by figure 1 y x21 0 1 2 y x21 0 1 2 ( )f x � ( )f x figure 1: set-valued function f̂ extended by f 4 fuzzy structured element and transformation in order to establish the relationship between the fuzzy real number space ε1 and the monotone function space on interval [−1, 1], we introduce a method, namely the fuzzy structured element, which was proposed by sicong guo in [14]. definition 3. let e be a fuzzy set on real line r and e(x) is membership function of e. then, e is called a fuzzy structured element, if e(x) satisfies the following properties: 1) e(0) = 1; 2) e(x) is monotonic increasing and right-continuous on [−1, 0), monotonic decreasing and leftcontinuous on (0, 1]; homeomorphism problems of fuzzy real number space and the space of bounded functions with same monotonicity on [-1,1] 893 3) for any x ∈ (−∞,−1) ∪ (1, +∞), e(x) = 0. further, e is called a normal fuzzy structured element if the fuzzy structured element e satisfies: (1) e(x) > 0 for all x ∈ (−1, 1); (2) e(x) is continuous, strictly monotonic increasing on [−1, 0) and also continuous, strictly monotonic decreasing on (0, 1]. according to definition 3, it is easy to know that the fuzzy structured element is a special fuzzy number on real line r, which can be used to express the concept of fuzzy zero 0̃. let e be a fuzzy number. e is called a triangular structured element if it has membership function µe(x), where µe(x) =      1 − x, x ∈ [0, 1] 1 + x, x ∈ [−1, 0] 0, otherwise (2) as it is shown in figure 2. obviously, e is a special fuzzy structured element. 1 11! o x ( )e x figure 2: triangular structured element e based on the fuzzy structured element, we can give the following two theorems: theorem 4 (local mapping theorem). suppose e is a fuzzy structured element on r with membership function e(x). f(x) is monotonically bounded on [−1, 1] and f̂(x) is extensional setvalued function of f(x). then f̂(e) is a bounded closed fuzzy number and membership function of f̂(e) is e(f̂−1(x)), where f̂−1(x) is the inverse function of f̂(x) (if f(x) is strictly increasing and continuous on [−1, 1], then f̂−1(x) is a ordinary inverse function of f(x)). proof: let a = f̂(e), a(y) defined as a membership function of f̂(e). suppose that f(x) is increasing and bounded on [−1, 1]. by extension principle, we have a(y) = ∨ y∈f̂(x) e(x), f̂(e) = ⋃ x∈r e(x) ∗ f̂(x), where e(x) ∗ f̂(x)(y) = { e(x), y ∈ f̂(x) 0, otherwise . from the former equation, the membership function of f̂(e) is f̂(e)(y). when y ∈ f̂(x), counterpart membership degree defined as e(x). denote α–cut of e by eα = [e − α , e + α ]. it follows from the concept of fuzzy structured element that e0 = [e − 0 , e + 0 ] ⊆ [−1, 1]. since f(x) is increasing bounded on interval [−1, 1] and f̂(x) is surjection on r, it follows that for α ∈ (0, 1], [f̂(e)]α = f̂(eα) = f̂[e − α , e + α ] = [f(e − α ), f(e + α )], 894 h.-d. wang, s.-c. guo, s.m. hosseini bamakan, y. shi for α = 0, [f̂(e)]0 = suppf̂(e) = ∪α∈(0,1][f̂(e)]α = ∪α∈(0,1][f̂(eα)] = [f(e − 0 +), f(e + 0 −)] (3) thus eα, α ∈ [0, 1] are bounded closed sets. for all α1, α2 ∈ (0, 1], if α1 ≤ α2, then eα1 ⊆ eα2, that is e − α2 ≤ e−α1, e + α1 ≤ e+α2 . since f(x) is monotone increasing, we have f(e−α1−) ≤ f(e − α2 −), f(e+α2+) ≤ f(e + α1 +) therefore, [f̂(e)]α1 ⊆ [f̂(e)]α2 . furthermore, it follows from eq.(3) that [f̂(e)]0 = ∪α∈(0,1][f̂(e)]α ⊇ [f̂(e)]α1, it means that f̂(e) is a convex set on real line r. since e is a fuzzy number and 1-cut set e1 of e is nonempty and [f̂(e)]1 = f̂(e1) also is nonempty, hence we can say that f̂(e) is a normal fuzzy number. from definition of bounded closed fuzzy number, we know f̂(e) ∈ ñc(r). since f̂(e)(y) = ∨ y∈f̂(x) e(x) = e(xy) as y ∈ f̂(x), where xy = f̂ −1(y). it follows that f̂(e)(y) = e(f̂−1(y)), or f̂(e)(x) = e(f̂−1(x)). if f(x) is monotonic decreasing on [−1, 1], the proof can be shown in a similar manner. ✷ theorem 5 (theorem of structured element expression of fuzzy number ). for a given regular fuzzy structured element e and any bounded fuzzy number a, there exists a monotonic bounded function f on [−1, 1] such that a = f(e) (strictly, exists a extended set-valued function f̂ such that a = f̂(e)). we called it fuzzy number a generated by the fuzzy structured element. proof: from fuzzy number expression theorem, fuzzy number u can be expressed by a family set {uα : uα = [u(α), u(α)], α ∈ [0, 1]}. therefore, we just need to prove that there exists a monotone bounded function f(x) on [−1, 1] such that f(e) = u, that is, for all α ∈ [0, 1], [f(e)]α = [u(α), u(α)]. let f(x) = { u(e(x)), x ∈ [−1, 0] u(e(x)), x ∈ (0, 1] . it follows from theorem 1 that f(x) is a monotone increasing bounded function on [−1, 1]. from the local mapping principle, f(e) is a bounded closed fuzzy number. it follows from the extension principle that [f(e)]α = f(eα). denote e(x) on [−1, 0] as le(x) and e(x) on [0, 1] as re(x). since e is strictly increasing on [−1, 0] and is also a bijection from [−1, 0] to [0, 1] , e is strictly decreasing on [0, 1] and is also a bijection from [0, 1] to [0, 1], so le(x), re(x) inverse and are denoted by l−1e (α), r −1 e (α), then eα = [e(α), e(α)] = [l −1 e (α), r −1 e (α)]. since u(α), u(α) are left-continuous on (−1, 0] and are right-continuous at α = 0, also e(x) is continuous, we know that u(e(x)) = u(le(x)) is left-continuous on (−1, 0] and is right-continuous at x = −1, u(e(x)) = u(re(x)) is right-continuous on [0, 1) and is left-continuous at x = 1. since homeomorphism problems of fuzzy real number space and the space of bounded functions with same monotonicity on [-1,1] 895 f(x) is increasing, it follows that for all α ∈ (0, 1], f(eα) = f[l −1 e (α), r −1 e (α)] = [f(l −1 e (α)−), f(r −1 e (α)+)] = [u(e(l−1e (α)+)), u(e(r −1 e (α)−))] = [u(e(l−1e (α))), u(e(r −1 e (α)))] = [u(α), u(α)] for α = 0, f(e0) = f[l −1 e (0), r −1 e (0)] = [f(−1+), f(1−)] = [u(e(−1+)), u(e(1−))] = [u(e(−1)), u(e(1))] = [u(0), u(0)] therefore, we conclude that [f(e)]α = [u(α), u(α)] for all α ∈ [0, 1]. that’s to say that f(e) = u. we complete the proof of this theorem. ✷ when no confusion can arise in the following discussions, we will use f(x) to denote the extended function f̂(x) and use f(e) to instead of f̂(e), respectively. theorem 6. let f be a monotonic bounded function and e be a fuzzy structured element on r and fuzzy number u = f(e). for all α ∈ [0, 1], eα = [e − α , e + α ]. then (1) if f(x) is increasing on [−1, 1], then α-cut of fuzzy number u is closed interval uα = { [f(e−α−), f(e + α +)], α ∈ (0, 1], [f(e−α +), f(e + α−)], α = 0, . (4) (2) if f(x) is monotonic decreasing function on [−1, 1], then α-cut of u is closed interval uα = { [f(e+α−), f(e − α +)], α ∈ (0, 1], [f(e+α +), f(e − α−)], α = 0, (5) proof: based on theorem 4 and function f satisfies the monotone condition of local mapping theorem, it follows that for any α ∈ (0, 1], we have [f(e)]α = f(eα). since f is monotone on closed interval eα = [e − α , e + α ] ⊆ [−1, 1], it follows that: if f is increasing, for α ∈ (0, 1], we have uα = f(eα) = f[e − α , e + α ] = [inf{y : y ∈ f[e−α , e + α ]}, sup{y : y ∈ f[e − α , e + α ]}] = [f(e−α−), f(e + α +)], and for α = 0, it holds that u0 = suppf(e) = f(e)0̇ = f(e0̇) = lim α→0 [f(e−α−), f(e + α +)] = [f(e − 0 +), f(e + 0 −)]. 896 h.-d. wang, s.-c. guo, s.m. hosseini bamakan, y. shi if f is decreasing, we have uα = f(eα) = f[e − α , e + α ] = [inf{y : y ∈ f[e−α , e + α ]}, sup{y : y ∈ f[e − α , e + α ]}] = [f(e+α +), f(e − α−)], ✷ from the local mapping theorem 4, we know that given a fuzzy structured element e, it will be transformed into an fuzzy number a = f(e) with any a monotonic function f on [−1, 1]. when f is not a monotonic function, the fuzzy set f(a) can not be guaranteed to be a fuzzy number. theorem 5 show us that for any bounded fuzzy number a, we always can find a monotonic bounded function f on [−1, 1] such that f(e) = a. therefore, the two theorems reveal to us that there exists a deep relationship between the family of bounded monotonic function on [−1, 1] and the fuzzy number space. 5 the same order standard monotonic bounded function classes b[−1, 1] let f be monotonic bounded function on [−1, 1]. if for any discontinuity x in [−1, 1], we have f(x) = 1 2 [f(x+) + f(x−)], (6) where f(x+)(f(x−)) is the right-limit(left-limit) of f(x) at the point x, then f(x) is called a standard monotonic bounded function on [−1, 1]. all same order standard monotonic bounded function on [−1, 1] is denoted by b[−1, 1]. it is obvious that a continuous monotonic bounded function on d[−1, 1] is also a standard monotonic bounded function. definition 7. suppose that f ∈ d[−1, 1], we define f̌(x) =        f(−1+), x = −1 [f(x−) + f(x+)]/2, x ∈ (−1, 1) f(1−), x = 1 (7) where f̌(x) is as a standardized form of f(x). obviously, f̌ ∈ b[−1, 1]. if f is a standard monotonic bounded function, then f̌ = f. in the following we introduced two distance formulas: dp(f, g) = [ ∫ 1 −1 |f(x) − g(x)|pdx ]1/p , for all f, g ∈ b[−1, 1], (8) dh(f, g) = sup x∈[−1,1] |f(x) − g(x)|, for all f, g ∈ b[−1, 1]. (9) where 1 ≤ p < +∞. theorem 8. let e be a normal fuzzy structured element, k is a bounded closed interval on r. denote bf(k) = {f : f ∈ b[−1, 1] and [f(−1), f(1)] ⊆ k}, (10) metric spaces (b[−1, 1], dh) and (bf(k), dp) both are complete. homeomorphism problems of fuzzy real number space and the space of bounded functions with same monotonicity on [-1,1] 897 proof: first, we prove completeness of space (b[−1, 1], dp). suppose each elements in (b[−1, 1], dp) are increasing, for decreasing situation has similarity conclusions. suppose given sequence {xn(t)}, where xn(t) ∈ (b[−1, 1], dp), n = 1, 2, · · · . let dp(xn, xm) → 0, (as n, m → ∞), that is, sequence {xn(t)} satisfies cauchy uniformly convergence conditions. suppose x0(t) is limit of sequence xn(t),i.e. lim n→∞ xn(t) = x0(t) for all t ∈ [−1, 1]. since all xn(t1) is monotonic increasing function, then xn(t1) ≤ xn(t2) for all t1, t2 ∈ [−1, 1]. so lim n→∞ xn(t1) ≤ lim n→∞ xn(t2), and x0(t1) ≤ x0(t2). thus x0(t) is monotonic increasing function. now we prove g(t) is standard function on [−1, 1]. suppose t0 is a discontinuous point of g(t). we might as well suppose that g(t) is’t standard, that is g(t0) 6= 1 2 [g(t0+) + g(t0−)]. letδ = ∣ ∣g(t0) − 1 2 [g(t0+) + g(t0−)] ∣ ∣ . furthermore, since fn(t), n = 1, 2, · · · , are standard, we have e(t0+) |fn(t0+) − g(t0+)| < ε, e(t0−) |fn(t0−) − g(t0−)| < ε, e(t0) |fn(t0) − g(t0)| < ε. take ε = ε0 < δ/2. when n ≥ n0(ε0), we have |g(t0) − 1 2 [g(t0+) + g(t0−)]| ≤ |fn(t0) − g(t0)| + 1 2 |fn(t0+) − g(t0+)| + 1 2 |fn(t0−) − g(t0−)| ≤ ε0 + ε0 = 2ε0 < δ. a contradiction. thus, g(t) is standard on [−1, 1]. 2) now we prove the metric space (bf(k), dp) is complete. suppose that fn is a cauchy sequence in bf(k), dp, then for any ε > 0, there exists a positive integer n such that for any m, n > n, we have dh(fm, fn) = [ ∫ 1 −1 |fm(t) − fn(t)| p dx ]1/p < ε. this indicates that {fn} is a cauchy sequence of lp[−1, 1]. we know that lp[−1, 1] is complete space, so {fn} is converse in lp[−1, 1]. suppose h is a limit of sequence {fn}. similar to the proof in 1) that h(x) is increasing and bounded in interval [−1, 1]. therefore, h(−1+) and h(1−) exist. let f(x) =      h(−1+), x = −1 [h(x+) + h(x−)] /2, x ∈ (−1, 1) h(1−), x = 1 . it is obvious that f(x) ∈ b[−1, 1] and also is a limit of cauchy sequence fn. f(x) and h(x) have different values which happened only on discontinuity, so we have dp(fn, f) = dp(fn, g). thus, f is a limit of cauchy sequence in (bf(k), dp), that is, {fn} converges in (bf(k), dp). hereby, the completeness of (bf(k), dp) have been proved. ✷ 898 h.-d. wang, s.-c. guo, s.m. hosseini bamakan, y. shi theorem 9. let e be a normal fuzzy structured element and k ∈ i(r) be a nonempty set, denoted by: bf(k) = {f : f ∈ b[−1, 1] and [f(−1), f(1)] ⊆ k}, then the metric space (bf(k), dp) is complete. example 10. the metric space (b[−1, 1], dp) is not complete. for example, fn ∈ b[−1, 1] is defined by fn+1(x) = { fn(x), x ∈ [−1, 1 − 1/n 2] n, x ∈ (1 − 1/n2, 1] , (n ≥ 1), where f1(x) = 0, x ∈ [−1, 1]. it is obvious that fn(x)(n ≥ 1) are bounded functions. suppose m ≤ n, we have dp(fm, fn) = [ ∫ 1 −1 |fm(t) − fn(t)| p dt ]1/p < [ 1 (n + 1)2 + 1 (n + 2)2 + · · · + 1 m2 ]1/p < [ 1 n − 1 m ]1/p < 1 n → 0(m, n → ∞) thus, {fn} is a cauchy sequence, their standard function sequence {f̌n(x)} is a cauchy sequence in b[−1, 1]. it is easy to understand that {f̌n(x)} do not convergence to any upper bounded function. theorem 11. metric space (b[−1, 1], dh) is not a separable space and metric space (b[−1, 1], dp) is a separable space. proof: 1) it is sufficient to construct an uncountable set bf of b[−1, 1] with the property that if f, g ∈ bf , then dh(f, g) = 1. for each t ∈ (−1, 1), define ft ∈ b[−1, 1] by ft(x) =      0 x ∈ [−1, t) 0.5 x = t 1 x ∈ (t, 1] . denote bf = {ft : t ∈ (−1, 1)}. consequently, if t1 6= t2, then dh(ft1, ft2) = 1. 2) since lp[−1, 1] is a separable space with respect to the metric dp and b[−1, 1] ⊂ lp[−1, 1], then b[−1, 1] is separable with respect to metric dp (for any separable metric space x, any nonempty subset of this space is also separable). ✷ in general, f ∈ b[−1, 1],−f ∈ b[−1, 1] unless f is a constant-valued function. because, if f isn’t constant-valued function, despite −f is also monotonic function, but it is not same order with f. hence, b[−1, 1] cannot form group with respect to operation of addition,just can form a semigroup. it should be noted that each element in b[−1, 1] is not a closed form with respect to ordinary subtraction operator. we can take an example, function obtain by two monotonic function subtracted may be non-monotonic. therefore, b[−1, 1] can’t form linear space with respect to addition and number multiply operation. theorem 12. b[−1, 1] is a convex cone with 0 as its vertex. the theorem is obvious, so the proof is omitted. homeomorphism problems of fuzzy real number space and the space of bounded functions with same monotonicity on [-1,1] 899 6 topological relationship between b[−1, 1] and ñc(r) 6.1 two types of fuzzy number metric spaces induced by the fuzzy structured element let e be a symmetrical regular fuzzy structured element on real line r and ñc(r) be the set of all bounded closed fuzzy numbers. for given function f ∈ b[−1, 1], there exists corresponding unique fuzzy number such that af = f(e). in other words, fuzzy structured element determines a mapping from b[−1, 1] to ñc(r). denote he : b[−1, 1] → ñc(r), f → he(f) = f(e) ∈ ñc(r). then he is called fuzzy functional induced by fuzzy structured element e. using metrics dp and dh on b[−1, 1], mapping he induces distances dnp(a, b) = dp(h −1 e (a), h −1 e (b)), (11) dnh(a, b) = dh(h −1 e (a), h −1 e (b)), (12) on ñc(r), where h −1 e (a), h −1 e (b) are preimage of mapping he at a and b, respectively. suppose a = f(e), b = f(e), where f, g ∈ b[−1, 1], then eq.(11) and eq.(12) can also rewrite as dp(f, g) = dp(he(f), he(g)), (13) dh(f, g) = dh(he(f), he(g)), (14) (ñc(r), dnp) and (ñc(r), dnh) are said to be distance space induced by (b[−1, 1], dp) and (b[−1, 1], dh), respectively. it is easy to understand that he is an isometric bijection of b[−1, 1] onto ñc(r). using isometric bijection he, we can translate metric of elements in fuzzy number space to metric between the same order standard monotonic bounded functions in range of [−1, 1]. then, what is the relationship between those metrics and the other metrics on fuzzy numbers? before discussing the relationship, a lemma need to be presented here: for u ∈ ñc(r), e is a normal fuzzy structured element. if g ∈ b[−1, 1] such that u = g(e), as defined in the following: gu(x) = { u(e(x)) −1 ≤ x ≤ 0 u(e(x)) 0 < x ≤ 1 . lemma 13. suppose that e is a normal fuzzy structured element, fuzzy number u ∈ ñc(r), uα = [u(α), u(α)](α ∈ [0, 1]). if u = f(e), f ∈ b[−1, 1], then f(x) = ǧu(x), x ∈ [−1, 1], where ǧu is the standard function of gu. proof: the proof has been provided in the following two steps: 900 h.-d. wang, s.-c. guo, s.m. hosseini bamakan, y. shi 1) first, we prove ǧu = u. according to the decomposition theorem, we only need to prove that [ǧu(e)]α = uα for any α ∈ (0, 1]. denote eα = [e − α , e + α ]. from the extension principle and g(x) is a increasing function, it follows that [ǧu(e)]α = ǧu(eα) = ǧu([e − α , e + α ]) = [ǧu(e − α−), ǧu(e + α +)]. since u(α), u(α) are continuous on (0, 1], we have u(e(x)) is left-continuous on (−1, 0] and u(e(x)) is right-continuous on [0, 1). therefore, [ǧu(e − α−), ǧu(e + α +)] = [gu(e − α−), gu(e + α +)] = [ u ( e(e−α ) ) , u ( e(e+α ) )] = [ u ( α ) , u ( α )] that is, [ǧu(e)]α = uα. thus, ǧu = u. 2) here we prove f is unique in b[−1, 1]. suppose f1, f2 ∈ b[−1, 1] such that f̂1(e) = f̂2(e) = u, then f̂ −1 1 = f̂ −1 2 . furthermore, we have f̂1 = f̂2, then f1 = f2. thus, f is unique in b[−1, 1]. therefore, f(x) = ǧu(x), x ∈ [−1, 1]. the proof is complete. ✷ the following theorem shows the relation between the induced fuzzy number metrics dnh, dnp and the previous metrics of fuzzy numbers. theorem 14. let e be regular structured element, u, v ∈ ñc(r), there are f, g ∈ b[−1, 1] such that u = f(e), v = g(e). denote uα = [u(α), u(α)], vα = [v(α), v(α)], then dnp(u, v) = [ ∫ 1 −1 |f(x) − g(x)|pdx ]1/p = [ ∫ 1 0 |u(α) − v(α)|p de(α) + |u(α) − v(α)|p de(α) ]1/p (15) dnh(u, v) = sup x∈[−1,1] |f(x) − g(x)| = sup x∈[−1,1] (|u(α) − v(α)|∨ |u(α) − v(α)|) (16) proof: 1) we have from lemma 13 that dnp(u, v) = dp(f, g) = dp(f̌u, ǧv), and ǧu(x) = gu(x) and ǧv(x) = gv(x) are bounded almost everywhere on [−1, 1] respectively, it follows that dp(f̌u, ǧv) = dp(fu, gv). therefore, dnp(u, v) = dp(fu, gv) = [ ∫ 1 −1 |fu(x) − gv(x)| p dx ]1/p = [ ∫ 0 −1 |u(e(x)) − v(e(x))| p dx + ∫ 1 0 |u(e(x)) − v(e(x))| p dx ]1/p denote e(x) = le(x) for x ∈ [−1, 0] and re(x) = e(x) for x ∈ [0, 1]. since e is a regular fuzzy structured element, we know that le is bijective from [−1, 0] to [0, 1], re is bijective from [0, 1] to [0, 1]. thus, we can say that l−1e , r −1 e exist and they are monotone bijections. it obvious that e = l−1e and e = r −1 e both are differentiable almost everywhere. therefore, we have dnp(u, v) = [ ∫ 1 0 |u(α) − v(α)| p dl−1e (α) + ∫ 0 1 |ū(α) − v̄(α)| p dr−1e (α) ]1/p = [ ∫ 1 0 |u(α) − v(α)| p de(α) − ∫ 1 0 |ū(α) − v̄(α)| p dē(α) ]1/p homeomorphism problems of fuzzy real number space and the space of bounded functions with same monotonicity on [-1,1] 901 2) from lemma 13, we obtain dnh(u, v) = dh(f, g) = dhf̌u, ǧv), furthermore, fu(x), gu(x) both are left-continuous on (−1, 0] and right-continuous on [0, 1), and are right-continuous at x = −1 and left-continuous at x = 1, we have |fu(x) − gv(x)| ≤ |fu(x−) − gv(x−)|∨ |fu(x+) − gv(x+)|, x ∈ (−1, 1) |fu(x) − gv(x)| = |fu(x) − gv(x)|, x ∈ {−1, 1} since f̌u(x), ǧu(x) are standard functions of fu(x), gv(x) for x ∈ (−1, 1), we have |f̌u(x−) − ǧv(x−)| = |fu(x−) − gv(x−)|; |f̌u(x+) − ǧv(x+)| = |fu(x+) − gv(x+)|. it follows that dh(f̌u, ǧv) = dh(fu, gv). moreover, we have dh(fu, gv) = sup x∈[−1,1] {|fu(x) − gv(x)|} = sup x∈[−1,0] {|u(e(x)) − v(e(x))|}∨ sup x∈[0,1] {|ū(e(x)) − v̄(e(x))|} let α = e(x) on [−1, 0] and [0, 1], respectively. since e is a regular fuzzy structured element, it follows that sup x∈[−1,0] {|u(e(x)) − v(e(x))|} = sup α∈[0,1] {|u(α) − v(α)|} , sup x∈[0,1] {|ū(e(x)) − v̄(e(x))|} = sup α∈[0,1] {|ū(α) − v̄(α)|} , completing the proof of the theorem. ✷ since integral variables e(α), ē(α) in the eq.(6.1) are two general functions, the definite integral is a riemann-stieltjes integral. when the e is a triangular structured element (2), we have de(α) = d(α − 1) = dα, dē(α) = d(−α + 1) = −dα. then the eq.(2) becomes the following form: dnp(u, v) = [ ∫ 1 0 |u(α) − v(α)|p dα −|u(α) − v(α)|p dα ]1/p . from eq.(16) in the theorem 14, we also note that the induced fuzzy number metric dnh(u, v) is the same as the hausdorff metric(diamond,1989) [6]. 6.2 homeomorphism between the fuzzy number space ñc(r) and b[−1, 1] proposition 15. suppose that (x, dx), (y, dy ) are two metric spaces. f is an isometric bijection from (x, dx) to (y, dy ). then f is continuous and inverse mapping f −1 of f exists and is also continuous. 902 h.-d. wang, s.-c. guo, s.m. hosseini bamakan, y. shi proof: since f is a bijection of (x, dx) into (y, dy ), there exists inverse mapping f −1 which is also one-to-one mapping. by definition of continuous mapping, for all x0 ∈ x and any positive number ε, there always exists a positive number δ such that dy (f(x), f(x0)) < ε as dx(x, x0) < δ. since dx(x, x0) = dy (f(x), f(x0)), given ε, it is sufficient by taking δ ≤ ε (for instance, take δ = ε/2).hence, f is continuous. similarly, we can also prove inverse mapping f−1 which is also continuous. ✷ given a bounded closed interval k, let the uniformly bounded fuzzy number set as ñc(k) = {u : u ∈ ñc(r) and supp u ⊆ k}, since there exists a bijection he of b[−1, 1] into ñc(r) and he and inverse function h −1 e are continuous, thus we have conclusions as follows: theorem 16. metric spaces (b[−1, 1], dh) and (ñc(r), dnh) are homeomorphic. metric spaces (bf(k), dp) and (ñc(k)), dnp) are homeomorphic. since space (ñc(r), dnh) and (b[−1, 1], dh) are homeomorphic, that is, both metric spaces are topologically equivalent. so elements in both of them have consistent properties on metrics. there are one-to-one relationship between fuzzy number sequence {un} of (ñc(r), dnh) and function sequence {fn}, fuzzy number sequence on (ñc(r), dnh) and function sequence on (b[−1, 1], dh) have completely same properties. similarly, fuzzy number sequence on (ñc(k), dnp) and function sequence on (bf(k), dh) have completely same properties. therefore, the properties of convergence sequence of general metric spaces are also founded to the convergence fuzzy number sequence. thus, they are trivial to the following corollaries. corollary 17. fuzzy number metric space (ñc(r), dnh) is complete and (ñc(r), dnp) is not complete. corollary 18. for any nonempty closed interval k on r, (ñc(k), dnp) is a complete metric space. corollary 19. for any fuzzy number sequence {un} of fuzzy number space (ñc(r), dn) only has a limit almost everywhere. that is, the limit of convergence sequence is unique.(where dn represents dnh or dnp). corollary 20. suppose that {un} is a fuzzy number sequence of (ñc(r), dnh), {fn} is a function sequence on (b[−1, 1], dh). for all n ≥ 1, un = fn(e). then fuzzy number sequence {un} converge if and only if function sequence {fn} converge. let lim n→∞ un = u0, lim n→∞ fn = f0, then u0 = f0(e). if using dnp , dp, bf(k) (defined as 10) instead of dnh, dh and b[−1, 1] in corollary 20, respectively. the corollary is still founded. 7 conclusion by using monotonic mapping of the fuzzy structured element, we have proved that the bounded fuzzy number space is homeomorphic to the space b[−1, 1] of monotonic bounded function with same monotonicity on [−1, 1]. therefore, the problem of the fuzzy number space can be transformed to one’s of space b[−1, 1], such as the convergence of sequence of fuzzy numbers, the convergence of the fuzzy series , continuous of fuzzy-valued function and so on. to some extent, our study provides a new way for the study of fuzzy analysis. homeomorphism problems of fuzzy real number space and the space of bounded functions with same monotonicity on [-1,1] 903 acknowledgements this work has been partially supported by grants from national natural science foundation of china (grant no.71331005, no.71110107026). bibliography [1] m.-s. yang, c.-h. ko (1996), on a class of fuzzy c-numbers clustering procedures for fuzzy data, fuzzy sets and systems,84(1):49–60. [2] u. reuter(2011), a fuzzy approach for modeling non-stochastic heterogeneous data in engineering based on cluster analysis, integrated computer-aided engineering,18(3):281–289. [3] a. li, y. shi, j. he, y. zhang (2011), a fuzzy linear programming-based classification method, international journal of information technology & decision making, 10(06):1161– 1174. [4] k. lin, p. pai, y. lu, p. chang (2013), revenue forecasting using a least-squares support vector regression model in a fuzzy environment, information sciences, 220:196–209. [5] h. wang, s. guo, l. yue (2014), an approach to fuzzy multiple linear regression model based on the structural element theory, systems engineering theory & practice, 34(10): 26–28. [6] p. diamond, p. kloeden (1989), characterization of compact subsets of fuzzy sets, fuzzy sets and systems, 29(3):341–348. [7] p. diamond, p. kloeden (1990), metric spaces of fuzzy sets, fuzzy sets and systems, 35(2): 241–249. [8] p. diamond, p. e. kloeden, p. e. kloeden (1994), a. mathematician, p. e. kloeden, metric spaces of fuzzy sets: theory and applications, world scientific. [9] r. goetschel, w. voxman (1983), topological properties of fuzzy numbers, fuzzy sets and systems, 10(1):87–99. [10] l. gerg (1992), generalisation of the goetschel-voxman embedding, fuzzy sets and systems, 47(1): 105–108. [11] m. l. puri, d. a. ralescu (1983), differentials of fuzzy functions, journal of mathematical analysis and applications, 91(2):552–558. [12] c.x. wu, m. ma (1991), embedding problem of fuzzy number space: part i, fuzzy sets and systems, 44(1):33–38. [13] c.x. wu, m. ma(1992), embedding problem of fuzzy number space: part ii, fuzzy sets and systems, 45(2):189–202. [14] s. guo(2002), method of structuring element in fuzzy analysis, journal of liaoning technical university, 21(5):670–673. int j comput commun, issn 1841-9836 8(6):791-799, december, 2013. association rule mining using path systems in directed graphs s. arumugam, s. sabeen s. arumugam* 1. national centre for advanced research in discrete mathematics (n-cardmath) kalasalingam university anand nagar, krishnankoil-626126, india. 2. school of electrical engineering and computer science the university of newcastle nsw 2308, australia. *corresponding author: s.arumugam.klu@gmail.com s. sabeen department of computer applications jaya engineering college chennai-600054, india. sabeens@rediffmail.com abstract: a transaction database (tdb) consists of a set i of items and a multiset d of nonempty subsets of i, whose elements are called transactions. there are several algorithms for solving the popular and computationally expensive task of association rule mining from a tdb. in this paper we propose a data structure which consists of a directed graph d (loops and multiple arcs are permitted) and a system of directed paths in d to represent a tdb. we give efficient algorithms for generating the data structure, for extracting frequent patterns and for association rule mining. we also propose several graph theoretic parameters which lead to a better understanding of the system. keywords: directed graphs, path system, in-degree, out-degree, association rule mining, frequent patterns, data mining. 1 introduction the task of association rule mining in a large database of transactions was proposed by agarwal et al. [1]. since then this problem has received a great deal of attention and association rule mining is one of the most popular pattern discovery methods in knowledge discovery from database (kdd). a broad variety of efficient algorithms such as apriori algorithm [3], fpgrowth [4], fp-tree [4], setm [6], dic [5] have been developed during the past few years. in this paper we propose a data structure consisting of a directed graph d and a multiset of directed paths in d to represent a database of transactions. we give an algorithm which generates the directed graph d and which also simultaneously computes several other measures such as indegree, out-degree, total number of arcs, length of a largest transaction, frequency of occurrence of various nodes and the number of occurrences of each arc in d. the second algorithm generates all the patterns of the transaction database using the above data structure. this algorithm can be modified to extract frequent patterns. the third algorithm deals with association rule mining. in this process we scan the database exactly once and the digraph is constructed dynamically. 2 directed graphs and path systems a directed graph d = (v,a) consists of a finite nonempty set v and a multiset a of ordered pairs of elements of v. the elements of v are called vertices and the elements of a are called arcs. an ordered pair (v,v) is called a loop at v. we allow both loops and multiple arcs (that copyright © 2006-2013 by ccc publications 792 s. arumugam, s. sabeen is, an arc (u,v) appearing more than once) in d. if a = (u,v) is an arc in d, then u is called the tail of a and v is called the head of a. for basic terminology in directed graphs we refer to the book by chartrand and lesniak [2]. a directed path in d is a sequence of distinct vertices p = (v1,v2, . . . ,vk) such that (vi,vi+1) is an arc in d for all i,1 ≤ i ≤ k − 1. a directed cycle in d is a sequence of vertices c = (v1,v2, . . . ,vk,v1) such that (vi,vi+1) is an arc in d for all i,1 ≤ i ≤ k − 1, (vk,v1) is an arc in d and the vertices v1,v2, . . . ,vk are distinct. a directed graph is called acyclic if it contains no directed cycle. for any vertex v, the in-degree id(v) is defined to be the number of arcs of the form (u,v) in d and the out-degree od(v) is defined to the numbers of arcs of the form (v,u) in d. we observe that a loop at v contributes 1 to both id(v) and od(v) and an arc (u,v) contributes 1 to od(u) and 1 to id(v). a vertex v such that no arc is incident with v or all the arcs incident at v are loops is called an isolated vertex of d. we denote by d1 the subdigraph of d obtained by removing all the loops in d. a path system or a path cover in a directed graph d is a multiset ψ of directed paths in d such that every arc of d is in exactly one path in ψ. we adopt the convention that all loops in d are members of ψ. if d contains multiple arcs, a directed path in d may occur more than once in ψ. 3 digraph model for transaction database let i = {1,2, . . . ,n} be a set of objects whose elements are called items. we impose on i the natural ordering of the set of positive integers. any nonempty subset of i is called a transaction. a transaction database d is a multiset of transactions in i. thus a subset x of i may occur more than once in d. we assume that for each i ∈ i, there is at least one transaction in t ∈ d with i ∈ t. in otherwords ∪ t∈d t = i. we say that a transaction t ∈ d supports an item set x ⊆ i if x ⊆ t. the support of x is defined by supp(x) = |t∈d:x⊆t||d| . thus supp(x) is the fraction of transactions supporting x. if we fix a threshold value s, then any subset x of i with supp(x) ≥ s is called a frequent pattern. an association rule is an implication of the form x ⇒ y where x,y ⊆ i and x ∩ y = ∅. the support of a rule x ⇒ y is defined to be supp(x ∪ y ). the confidence of the rule is defined as conf(x ⇒ y ) = supp(x∪y ) supp(x) . the support and confidence values are usually normalized so that these values occur between 0% and 100% instead of 0 to 1.0. normally we generate association rules for frequent patterns. for further terminology in tdb and association rule mining we refer to the book by han and kamber [7]. in this paper we propose a data structure which consists of a directed graph d and a system of directed paths in d for representing a transaction database. the vertex set of the directed graph is the item set i. we sort the elements of each transaction t ∈ d in the increasing order and represent t as a directed path in d. thus the direction on each edge is given by the orientation from low-to-high. any transaction t of the form {x} is represented as a loop at x. we illustrate this database with a small example. example 1. let i = {1,2,3,4,5}. let d = {t1,t2,t3,t4,t5,t6} where t1 = {1,2,5},t2 = {1,4},t3 = {1,2,4},t4 = {3,5},t5 = {2},t6 = {3} and t7 = {3,4,5}. the directed graph of d is given in figure 1. association rule mining using path systems in directed graphs 793 � ^ � � i o � w s ss ss : : 2 3 1 4 5 figure 1 since every transaction t in d gives a directed path in d, it follows that the multiset d of all transaction forms a path system in d. we observe that the directed graph d1 which is obtained by removing all the loops in the directed graph d representing a tdb is an acyclic directed graph. 4 directed graph of a transaction data base (dgtdb) in this section we propose an algorithm to construct the directed graph representing a tdb. the algorithm scans the data exactly once, dynamically constructs the digraph d and simultaneously computes several parameters such as frequency of occurrence of each node, number of loops at each node, number of occurrence of each arc uv, total number of arcs in d, the maximum length of a transaction, in-degree and out-degree of each node. the algorithm first creates all nodes of d, one node for each item, with support count 0. then each transaction is scanned and the directed path in d representing the transaction is constructed. if (i1, i2, . . . , ik) is a transaction, the arc (ij, ij+1) is represented as a linked list. the header of this list has two fields. one field is used to store the list of vertices (i1, i2, . . . , ij+1), which is called the label of the arc (i,j) and the other field is used to store the frequency of occurrence of the arc (i,j). for example in the digraph given in figure 3, the arc (3, 5) occurs with values 2, (1,3,5) and also with values 1 (2,3,5), indicating that it occurs twice as part of (1,3,5) and occurs once as a part of (2,3,5). in general if an arc (i,j) has labels k,(i1, i2, . . . , ir, i,j) it means that the arc (i,j) appears k items as part of (i1, i2, . . . , ir, i,j). dynamic memory allocation method is used for storing these values. the pseudo code for the construction of dgtdb is given in figure 3. this algorithm also generates further informations which are given in the output. algorithm: construction of dgtdb input: transaction database tdb, n: number of distinct items in tdb, m: number of transactions. output: dgtdb: directed graph of tdb. method: create a node for each item and initialize the values f(i), out-degree (i), in-degree (i), in-edge(i), out-edge (i) lc(i) to zero. initialize predecessor = ∅; and x = ∅; 1: for each transaction tj and for each each item i in tj, do 2: x = ∪ {i}; f(i) + +; 3: if (|x| > k) then 794 s. arumugam, s. sabeen 4: k = |x|; 5: end if 6: if (|tj| = 1) then 7: lc(i) + +; 8: if (lc(i) = 1) then 9: // if an edge from i to i does not exist so far 10: createedge(i, i); 11: set label (ec, i) = ⟨x⟩; 12: f(ec, i) = 1; 13: n(e) + +; 14: out-degree(i) + +; 15: in-degree(i) + +; 16: in-edge(i) + +; 17: out-edge(i) + +; 18: end if 19: end if 20: if (lc(i) > 1) then 21: f(ec, i) + +; 22: out-degree(i) + +; 23: in-degree(i) + +; 24: end if 25: if (predecessor ̸= ∅) and (x /∈ any label (ec, i)) then 26: createedge (predecessor,i); 27: n(e) + +; 28: set label(ec, i) = ⟨x⟩ 29: f(ec, i) = 1; 30: out-degree(predecessor) + +; 31: in-degree(i) + +; 32: in-edge(i) + +; 33: out-edge(predecessor) + +; 34: end if 35: if (x ∈ label (ej, i)) then 36: f(ej) + +; 37: out-degree(predecessor) + +; 38: in-degree(i) + +; 39: end if 40: predecessor = i; 41: end for 42: return dgtdb; figure 2. pseudo code for dgtdb construction we illustrate the algorithm dgtdb with a transaction database consisting of 12 items and 30 transactions, which is given in table 1. association rule mining using path systems in directed graphs 795 tid products in each tid products in each transaction transaction t001 1, 3, 7, 8 t016 2, 4, 6 , 8, 10, 12 t002 1, 2, 3, 8 t017 1, 3,5, 7 t003 2, 4, 5, 6 t018 9, 11 t004 2, 3, 5, 6 t019 10, 11,12 t005 1, 4, 5, 11 t020 6 , 7, 8,12 t006 3, 4, 5, 12 t021 3 t007 1, 2, 3, 4 t022 10, 11, 12 t008 4, 5, 11 t023 2, 4,6 , 8, 10,12 t009 1, 4, 5, 8 t024 1, 3, 5, 7, 9, 11 t010 3, 4, 10 t025 2, 4, 6, 8, 10, 12 t011 2, 3, 4 t026 3 t012 3 t027 2, 4, 6 , 8, 10, 12 t013 3, 6 , 9, 12 t028 3 t014 1, 10, 12 t029 12 t015 3, 6 ,9 ,12 t030 3 table 1. transaction database, tdb the dgtdb for the above transaction database is given in figure 3. figure 3. directed graph of tdb in table 1. the number given at each node represents the frequency of the corresponding item. 5 algorithm for extracting frequent patterns (xofp) in this section we present an algorithm for extracting the set l of all frequent patterns from dgtdb constructed in section 4. for each node i, the algorithm generates all frequent patterns l1 with i as the last item of l1. if in-edge (i) = 0, then {i} is the only pattern with i as the last item. otherwise for each edge ec with i as head we consider all subsets x of label (ec, i) such that |x| ≥ 2 and i ∈ x. then the frequency of x is the sum of the frequencies of all the edges ec with i as head for which x ⊆ label(ec, i). if this frequency is greater than or equal to the minimum support threshold, then x is added to the set of frequent patterns. the algorithm xofp is given in figure 4. algorithm: xofp, extraction of frequent patterns from dgtdb. input: s: minimum support threshold; dgtdb: directed graph of tdb. output: l: the set of all frequent patterns mined from the dgtdb. method: 796 s. arumugam, s. sabeen 1: l = ∅; m = 0; 2: // initialize set of frequent patterns 3: for each node i do 4: if (f(i) ≥ s) then 5: l = l ∪ {i} 6: return {i},f(i) 7: m + +; 8: end if 9: if (in-edge(i) > 0) then 10: f = 0; 11: for each label (ec, i), 1 ≤ c ≤ in-edge(i) do 12: f = f(ec, i); 13: w = label(ec, i); 14: if (w /∈ l) then 15: for each x ⊆ w, |x| ≥ 2 and i ∈ x do 16: for each label (ey, i) do 17: if (x ⊆ label ey(i) then 18: f = f + f(ey, i)); 19: end if 20: end for 21: end for 22: if (f ≥ s) then 23: l = l ∪ {x}; 24: return {x},f 25: m + +; 26: end if 27: end if 28: end for 29: end if 30: end for 31: return (l) figure 4. xofp, extraction of patterns from dgtdb example 2. by applying xofp to the dgtdb given in figure 3, the set of frequent patterns obtained using the items 1,2,3 and 4 along with the respective frequencies are given in table 2. node id frequent patterns frequency 1 {1} 8 2 {2} 9 {1,2} 2 3 {3} 16 {1,3} 5 {2,3} 4 {1,2,3} 2 4 {4} 12 {1,4} 3 {2,4} 7 {3,4} 4 {2,3,4} 2 table 2. extracted frequent patterns from the nodes 1, 2, 3 and 4 of dgtdb where s = 2. association rule mining using path systems in directed graphs 797 6 algorithm for generating association rules (gear) in this section we present an algorithm for generating association rules and strong association rules from the set of frequent patterns mined from the given tdb. an association rule which satisfies both minimum support threshold and minimum confidence threshold is called a strong association rule. for each frequent pattern x and for each nonempty proper subset y of x the algorithm computes the support and confidence of the association rule y ⇒ x − y. algorithm: gear, generating association rules from the frequent patterns input: l: set of all frequent patterns; cminimum confidence threshold of rule; sminimum support threshold. output: r: set of all strong association rules; r′: set of all association rules not in r. method: 1: r = ∅;r′ = ∅;n(r) = 0;n(r′) = 0; 2: for each x,y where x ∈ l with |x| > 1 and y ⊆ x,y ̸= ∅,y ̸= x do 3: generate rule (y ⇒ (x − y )) 4: compute conf = f(x) f(y ) 5: // f(x) is the frequency of pattern x. f(y ) is the frequency of pattern y. 6: if (conf ≥ c) then 7: r = r ∪ {y ⇒ (x − y ) : support=s, confidence = conf}; 8: n(r) + +; 9: else 10: r′ = r′ ∪ {y ⇒ (x − y ) : support=s, confidence = conf}; 11: n(r′) + +; 12: end if 13: end for 14: return (r) and return (r′); figure 5. gear, algorithm for generating association rules example 3. from table 2 we have x = {1,2,3} is a frequent pattern with frequency 2. the set of all association rules generated from this pattern by using gear along with the confidence and support for each rule is given in table 3. we have taken the minimum confidence threshold c and the minimum support threshold s as 50 and 6 respectively. s.no association rules confidence support of r or r′ of the rule the rule 1 {1} ⇒ {2,3} 25 6 r′ 2 {2} ⇒ {1,3} 22.2 6 r′ 3 {3} ⇒ {1,2} 12.5 6 r′ 4 {1,2} ⇒ {3} 100 6 r 5 {1,3} ⇒ {2} 66.67 6 r 6 {2,3} ⇒ {1} 50 6 r table 3. association rules mined from the frequent pattern {1,2,3}. for the tdb given in table 1 we have generated 188 frequent patterns. the number of frequent patterns generated with various support counts is given in table 4. the total number 798 s. arumugam, s. sabeen of associate rule generated is 1588. the number of association rule with various support counts is given in table 5. the breakup of the number of association rules generated with various levels of confidence is given in table 6. s.no support % no. of patterns s.no support % no. of patterns generated generated 1 3 188 10 30 3 2 6 113 11 33 3 3 9 84 12 36 3 4 12 72 13 39 1 5 15 25 14 42 1 6 18 14 15 45 1 7 21 12 16 48 1 8 24 9 17 51 0 9 27 5 18 54 0 table 4. number of patterns with various support counts s.no support % ≥ no. of association s.no support % ≥ no. of association rules generated rules generated 1 3 1588 10 30 0 2 6 746 11 33 0 3 9 484 12 36 0 4 12 292 13 39 0 5 15 38 14 42 0 6 18 16 15 45 0 7 21 10 16 48 0 8 24 2 17 51 0 9 27 0 18 54 0 table 5. number of association rules with various support counts s.no confidence % no. of association rules generated 1 < 50 465 2 50-59 162 3 60-69 81 4 70-79 36 5 80-89 95 6 ≥ 90 749 table 6. number of strong association rules with various confidence levels 7 conclusion in this paper we have proposed a new data structure consisting of a directed graph d and a path system in d for representing a tdb. we have presented algorithms for constructing d, for generating frequent patterns using d and for generating association rules. during the entire process the data is scanned exactly once. further it is possible to get several information about the tdb by using graph theoretic parameters. for example if in-degree(i) = 0 in the directed association rule mining using path systems in directed graphs 799 graph d1 obtained from d by removing all the loops, then the item i always appears as the first item in every transaction t with i ∈ t. similarly if out-degree(i) = 0, then the item i always appears as the last item in every transaction t with i ∈ t. our algorithm can be used to identify all such items. use of other graph theoretic parameters to extract new knowledge about the tdb and the comparison of the performance of this algorithm with other existing algorithms in the literature using real data set will be reported in a subsequent paper. acknowledgment the first author is thankful to the department of science and technology, new delhi for its support through the n-cardmath project no. sr/s4/ms:427/07. bibliography [1] r. agrawal, t. imielinski and a. swami, mining association rules between sets of items in large database, in proc. of the acm sigmod international conference on management of data (acm sigmod 93), washington, usa, 22(2)207-2016, may 1993. [2] g. chartrand and l. lesniak, graphs and digraphs, chapman and hall, crc, 4th edition, 2005. [3] r. agrawal and r. srikant, fast algorithms for mining association rules, in proc. of the 20th international conference on very large database (vldb’ 94), santiago, chile, 487499, june 1994. [4] j. han, j. pei and y. yin, mining frequent patterns without candidate generation, in proc. of the 2000 acm sigmod international conference on management of data, dallas, texas, usa, 29(2):1-12, may 2000. [5] s. brin, r. motwani, j.d. ullman and s. tsur, dynamic itemset counting and implications rules for masket basket data, in proc. of the acm sigmod international conference on management data, 26(2):255-264, 1997. [6] m. hontsma and a. swami, set oriented mining for association rules in relatrend database, the technical report rj9567, ibm almaden research centre, san jose, california, october 1993. [7] j. han and m. kamber, data minining, concepts and applications, elsevier inc., (2006). int j comput commun, issn 1841-9836 7(5):840-848, december, 2012. visual depth perception of 3d cad models in desktop and immersive virtual environments f. gîrbacia, a. beraru, d. talabă, g. mogan florin gîrbacia, andreea beraru doru talabă, gheorghe mogan transilvania university of brasov romania, 500036 brasov, eroilor, 29 e-mail: garbacia@unitbv.ro, aberaru@unitbv.ro talaba@unitbv.ro, mogan@unitbv.ro abstract: in this paper is presented an experimental study that aims to compare the depth perception of virtual prototypes in immersive virtual environments with the depth perception of cad models using 2d lcd display. first, a multipurpose solution of a large-scale interactive multi wall projected virtual environment named holo-cave is described and then the conducted experiments are presented. the experiments carried out highlight that perceived depth values estimated for virtual prototypes are significantly influenced by the 3d stereoscopic visualization. another interesting result of the study is that the estimated depth accuracy increases with the depth size that has to be perceived. the results of experimental study illustrate that the use of immersive stereoscopic visualization is useful during computer aided design related activities. keywords: virtual reality, computer aided design, immersive 3d systems. 1 introduction current computer aided design (cad) systems offer extremely rich modeling features and functions for the development of 3d virtual prototypes, which increase the productivity of the new products design. while the geometrical database is 3d since long ago, the user interaction within this software has not significantly changed. at present time cad tools use standard wimp (window, icon, menu, pointer) desktop-based graphical user interfaces (gui), and the interaction is made through keyboard, mouse and crt/lcd display which are solely 2d devices. in the last years, virtual reality (vr) technology became a dynamic field of research that has began to be used to a certain extent in industrial applications. an important goal of the current worldwide research efforts is to facilitate the implementation of vr in industrial product development processes and to asses the impact and its feasibility into the workplace and everyday life contexts in terms of cost-effectiveness, human-machine interaction and side-effects on the users, as well as their impact on the actual working environment, at both individual and organizational level. vr provides new perspectives for user interaction with cad tools. it enhances the immersion feeling and the depth perception of 3d objects, providing information with less perceptive ambiguities. this opportunity is important for a cad application where users must have a direct and thus better appreciation of object shapes and dimensions. many research activities are currently focused to integrate cad architecture inside vr-systems in order to enhance the immersion feeling and the user interaction interface [1], [7], [10]. in many applications, vr technologies are used only for visualization and analysis of previously created cad models [7], [10]. another emerging category of vr design applications are the vr-cad integrated systems, which allow the creation, modification and manipulation of 3d models directly in the vr environment [1]. despite of the intensive research activities, none of them produced a significant impact for the development of the next generation of cad systems. copyright c⃝ 2006-2012 by ccc publications visual depth perception of 3d cad models in desktop and immersive virtual environments 841 therefore, it is necessary to develop various experimental researches in order to evaluate the impact of virtual reality technologies in the design process, and analyze their advantages and shortcomings. the last generation of commercial cad systems still uses 2d crt/lcd displays for visualization in most of the cases. the disadvantage of these devices is the lack of depth perception of 3d models. generally, the commercial visualization systems offer better visualization conditions comparing to the reduced possibilities of 2d displays. on the other hand, the performance of various technical solutions is different, each of them being appropriate to be used in special defined situations. therefore, regarding cad related activities, the development of an evaluation study of the visual perception is necessary, that will highlight the impact of vr technologies on visual perception of 3d cad models during the design process of products. there are previous extensive researches on depth perception of 3d models using immersive or volumetric 3d environments [3], [4], [5], [6], [8], [9], but the novelty of this experimental study is the comparison of the depth perception of cad models using two distinct display modalities: monoscopic desktop system and 3d stereoscopic immersive environment. the conducted experimental study analyzes the depth perception appreciation (value dimension along z axis) of 3d cad models using immersive cave-like stereoscopic visualization systems compared to the usage of 2d traditional display. it is known that the real perception of dimensions in computer aided design related activities plays an important role in the decisionmaking process of a design solution. this experimental study is helpful in evaluating the benefits of immersive visualization system compared with monoscopic visualization using traditional 2d lcd/crt equipment. 2 description of immersive holo-cave system the conventional cad systems use for the visualization of the generated cad model a traditional crt/lcd 2d display. the disadvantage of this type of display for cad systems is the lack of depth cues. immersive virtual reality cave-like systems [2] are 3d stereoscopic displays that significantly improve the way users can visualize, navigate and interact in virtual environments. compared to other devices like head mounted display (hmd) or volumetric displays, the cave-like systems offer several advantages: improvement of the immersion awareness; obtaining high-definition stereoscopic images; large filed of view; collective visualization; collaboration between several users. a multipurpose architecture was developed at vr lab of transilvania university of brasov, that is able to provide both possibilities for the 3d visualization: four walls cave-like and holobench [12] functionality. therefore, the system is called "holo-cave" (figure 1). this solution allows making experiments related to the study of product engineering tasks that are performed by a human operator in the posture "seated" in the case of holobench functionality or, alternatively, in a "standing" posture when the system is configured as cave. the developed system enables the visualization of large scale, high-resolution 3d stereoscopic images with a large field-of-view. another advantage is the improvement of immersion awareness and the possibility to visualize the cad models in their natural dimensions. the physical structure of the holo-cave has the dimensions of 2.8 x 2.8 x 3 meters. the hardware architecture of the vr holo-cave is presented in figure 2. the used screens were screen-tech type, rigid back-projection with the dimension of 2.7 x 2 meters. the mechanical frame of the system was constructed from wood material because it was the simplest and easiest construction free of magnetic field that could be built. the screens were attached to the frames using tubes and glue. considering the high price and the variety of manufacturers, it was decided to use eight hitachi cpx1350 high-end projectors (two for each screen for displaying the passive 842 f. gîrbacia, a. beraru, d. talabă, g. mogan figure 1: the holo-cave immersive system stereoscopic images) capable of displaying images with a resolution of 1400 x 1050. the physical projection distance between the projector and the screen is of 5.2 of meters. mirrors have been used to cut down the required distance. in order to calculate the exact locations and dimensions of the mirrors and the projectors a cad model was used. a pc cluster was used as a computer system, composed of one server with a dual 2.4 ghz cpu and eight pc with 3 ghz cpu and dedicated video cards. figure 2: architecture of the holo-cave immersive system the holo-cave software is capable to load and display in a synchronized way a 3d scene on the multi-wall display environment, to display passive stereo 3d images and plug in different vr devices. it also provides methods by which the user can manipulate, add or remove objects in virtual environment. the holo-cave software architecture is designed as a distributed highly modular network based on the strict separation of its vr system management into two layers: a multi user server that performs the administration of the 3d model, the users’ interaction and a virtual environment server that coordinates local projections and navigation devices. the 3d visual depth perception of 3d cad models in desktop and immersive virtual environments 843 representation is full vrml2.0 (virtual reality modeling language [11]) thus compliant with all vrml sensors, events and sounds that can be used. because vrml is the data format, vrml events are used for communication. based on this approach, all vrml sensors, thus environmental sensors (time, proximity, visibility, and collision node), pointing device sensors (plane, cylinder, sphere, anchor, touch) and embedded javascript/ecma script can be used. 3 experiment description this study tries to answer the following research questions: 1. is immersive 3d visualization useful for the design engineers? 2. what is the performance of 2d display devices compared to 3d immersive visualization for perception of dimensions of 3d cad models? 3. which is the most intuitive and natural interface for the visualization of 3d cad models? we have devised and conducted two experiments to measure and record the estimated depth value of several cad models using two types of displays. the former is the traditional desktop workspace with 2d input (keyboard and mouse) and 2d output (computer screen) peripherals. the latter consists of a multimodal immersive interface of an integrated vr-cad system that uses the immersive 3d holo-cave system. the results of these experiments will allow answering the three research questions. 4 experiment procedure in order to evaluate the perception of depth in a cad model an experiment was conducted, involving eight subjects (three women and five men) with the average age of 28 years and with a healthy sense of vision. none of the subjects used vr immersive stereoscopic 3d visualization for the perception of 3d cad models before. instead, they had extensive experience in using cad software and good computer skills. in the conducted experiment six 3d cad models were used, each of them composed from a parallelepiped part with variable dimensions. in order to give the subjects the opportunity of appreciating depth, the models were placed on a virtual table with the size of 300 x 200 x 150 cm. the solid models were visualized using two types of devices: a universal 2d lcd display with the diagonal of 15.4" for desktop interface (figure 3a) and a holo-cave system for immersive perception (figure 3b). to display the 3d environment in the first case a solidworks cad system was used and in the second case dedicated software was used: bscontact stereo vrml visualization player integrated in the holo-cave. in the beginning, each subject was informed about the purpose of the experiment and specific instructions were given regarding the method of depth estimation. the subjects were asked to assess the depth of six objects using centimeter as measurement unit. in order to estimate the depth of cad models, the subjects were informed about the size of the virtual table where virtual objects were positioned. in the case of holo-cave immersive system, the distance between the viewpoint of the user and the projection screen was kept constant, 2 m (figure 3b). for each subject were displayed in a random order the cad models. each subject that participated to this experiment filled a questionnaire in which they were asked to provide information about age, experience of using vr equipment, experience in using cad systems and computer skills. half of the users estimated first the depth of virtual objects using the traditional desktop cad system, then, after a break of 20 minutes, they were asked to estimate the depth of virtual cad objects using the holo-cave stereoscopic visualization system. simultaneously, the other half of subjects estimated first the depth of 3d cad objects using stereoscopic system and then 844 f. gîrbacia, a. beraru, d. talabă, g. mogan using monoscopic desktop system. the value of the estimated depth was recorded in a text file that was used afterward for the assessment of the results. figure 3: the subject estimating depth using 2d display(a) the subject standing inside the cave-like visualization system(b) 5 results evaluation figure 4a presents the difference between the estimated values of the depth and the real dimensions of objects. after analyzing the data, the drawn conclusion is that for small values of depth (less than 35 cm) the subjects overestimated the depth of cad 3d models. another significant result is that for higher values of depth, the average of estimated depth was more accurate when using stereoscopic 3d immersive visualization. an interesting result obtained by using the immersive 3d visualization, was that for all models the subjects overestimated the depth value. figure 4b shows the accuracy of depth estimation that was obtained by using the value of relative error. the relative error was calculated using the following formula: er = (dp − dr)/dr (1) in which er is the value of the calculated relative error, dp the value of the estimated depth, dr the real value of virtual objects depth. figure 4: difference between the estimated values of the depth and real object dimensions(a); accuracy of depth estimation(b) visual depth perception of 3d cad models in desktop and immersive virtual environments 845 if the value of relative error is positive, the subject overestimated the depth of the cad models, and if the value of relative error is negative then the subject underestimated the depth of the cad models. the highest value of the relative error was obtained for the depth value of 35 mm and was due to the overestimation of depth. the conclusion drawn from this experiment is that the precision of depth estimation for stereoscopic viewing is lower for cad models with small depth values, but increases significantly when cad models depth value is higher. after conducting the experiments, each of the subjects was asked what viewing equipment he/she prefers. most subjects would use the immersive stereoscopic system because of the superior intuitive way of visual perception. however, few subjects considered as a shortcoming the need to wear glasses for passive stereoscopic visualization. we can conclude that the subjects estimated the depth of 3d cad models with greater accuracy using the holo-cave stereoscopic immersive visualization compared to monoscopic traditional desktop display. another interesting result drawn from the experiment was the increasing of estimated depth with the dimensions of the 3d cad models. in order to emphasize the results of the experiments described above, there was a new series of experiments conducted on the same experimental set-up already presented, namely the holo-cave system. the experiment was dedicated to assert the variation of stereopsis depth perception. the observers viewed the image by wearing polarizing glasses. the position of the observer was tracked by using a magnetic ascension flock of birds tracking system with 6 dof. the observer was standing inside the cave-like visualization system facing the screen (fig. 5). the viewing distance was set to three predefined values (1.5, 2.0, and 2.5 meters). the stereoscopic image consisted of two cubes, a red and a blue one having the sides of sizes 50 and 35 cm respectively. the arrangement of the cubes was such that the smaller one was set to a distance of 2 m (behind) with respect to the bigger one. the cube displayed to the left eye had a range of disparities added to it by shifting its horizontal position. the values of the disparities were 1, 6, and 11 cm. when the observer fuses left and right image he always perceives the cubes being in front of the screen. each observer was tested individually having the task to estimate the depth of the scene, namely which is the distance he perceives to the red cube and to the blue cube. the dimensions of the cubes and the relative position of one to the other were not made known to the participants. free eyes movement and as much time as required to estimate the depth of the scene were allowed. each observer for all values of the viewing distance and disparities repeated the task. figure 5: the observer standing inside the cave-like visualization system depth of a scene can be determined by using a simple arrangement as in figure 6, where e 846 f. gîrbacia, a. beraru, d. talabă, g. mogan is the interpupillary distance, d is the viewing distance, d is the disparity distance and l is the depth distance from the observer. the following equation expresses the depth l as function of variable d: l = ed/(e + d) (2) for d and e constants, the depth is affected only by the variation of disparity d. therefore, when disparity becomes smaller the object tends to be farther away from the user and vice versa. this is also illustrated in figure 7 where the predicted depth is represented as a function of the disparity distance for all the three cases of the viewing distance. for the calculation, an average value of the interpupillary distance of 6 cm has been considered. figure 6: determination of depth distance figure 7: calculated depth the results of these experiments are summarized in figures 8a and 8b. the graph in figure 10 shows a good correspondence between the average values of the perceived depth as a function of disparity with the calculated values of the depth for the same value of the viewing distance d = 2m. the open circles represent the average values of the perceived depth whereas the full circles are the calculated values of the depth. for d = 1.5m, one can found the same good correspondence between the theoretically estimated values of depth and the perceived values. for visual depth perception of 3d cad models in desktop and immersive virtual environments 847 figure 8: depth as a function of disparity(a);depth as a function of the viewing distance(b) d = 2.5m the agreement between theoretical and experimental values is not so good anymore, in this case the observers reporting difficulties in estimation of the depth. in figure 8b it is displayed the dependence of the perceived depth on the viewing distance for a constant value of the disparity d = 1 cm. for the sake of the comparison, the calculated depth is displayed too. the same trend it is observed for all the other values of the disparity. concerning the precision of the depth perception of the viewer, it is observed that the users presented more accurate stereopsis when the value of the disparity is small while increasing the disparity value leads to more imprecise stereopsis. 6 conclusions and future works realistic perception of the models depth in computer aided design plays an important role in decision making of design engineers. in this paper was presented an experiment aimed to estimate the depth of 3d cad models. from the performed experiment, we can emphasize that the perception of cad model depth is significantly influenced by the stereoscopic visualization. the subjects estimated depth of 3d models with greater accuracy using the immersive stereoscopic holo-cave system compared to traditional desktop display. the accuracy of depth perception is not considerably improved when the depth of cad models is small, but it increases significantly corresponding to a higher depth. as a general conclusion, we can declare that the alternative of replacing the 2d desktop systems with 3d vr visualization systems can be considered a viable alternative. 7 acknowledgments the research activities where supported by the sectoral operational programme human resources development (sop hrd), financed from the european social fund and by the romanian government under the contract number posdru/89/1.5/s/59323. bibliography [1] bourdot, p.; convard, t.; picon, f.; ammi, m.; touraine, d.; vezien, j.-m.(2010); vr-cad integration: multimodal immersive interaction and advanced haptic paradigms for implicit edition of cad models, comput. aided des, 42(5): 445-461. 848 f. gîrbacia, a. beraru, d. talabă, g. mogan [2] cruz-neira, c.(1995); virtual reality based on multiple projection screens: the cave and its applications to computational science and engineering, ph.d. dissertation, university of illinois at chicago, chicago, il, usa. umi order no. gax95-32383. [3] foley, l. m.(1991); stereoscopic distance perception, pictorial communication in virtual and real environments, stephen r. ellis (ed.). taylor & francis, inc., bristol, pa, usa, 558-566. [4] grossman, t.; balakrishnan r.(2006); an evaluation of depth perception on volumetric displays, proceedings of the working conference on advanced visual interfaces (avi ’06), acm, new york, ny, usa, 193-200. [5] hoskinson, r.; akai c.; fisher, b.; dill, j; po b.(2004); causes of depth perception errors in stereo displays, proceedings of the 1st symposium on applied perception in graphics and visualization, acm, new york, ny, usa, 164-164. [6] lang, m.; hornung, a.; wang, o.; poulakos, s.; smolic, a.; gross, m.(2010); nonlinear disparity mapping for stereoscopic 3d, acm transactions on graphics, 29(4): 1-10. [7] raposo, a.; soares, l; wagner, g.; corseuil, e.; gattass, m.; santos i.;(2009); environ: integrating vr and cad in engineering projects, ieee comput. graph. appl., 29(6): 91-95. [8] reichelt, s.; haussler, r.; futterer, g.; leister, n.(2010); depth cues in human visual perception and their realization in 3d displays, three dimensional imaging, visualization and display, bahram javidi and jung-young son(ed.), proc spie 7690, 134-144. [9] svarverud, e.; gilson, s.j.; glennerster, a. (2010); cue combination for 3d location judgements, journal of vision, 10(1): 1-13. [10] weidlich, d.; cser, l.; polzin, t.; cristiano, d.; zickner, h. (2009); virtual reality approaches for immersive design, int. j. on interactive design and manufacturing, 3: 103-108. [11] www.web3d.org/x3d/specifications/vrml/iso-iec-14772-vrml97/ [12] http://www.barco.com/fr/virtualreality/product/961 int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 4, pp. 477-482 the avatar in the context of intelligent social semantic web a. braşoveanu, m. nagy, o. mateuţ-petrişor, r. urziceanu adrian braşoveanu lucian blaga univeristy of sibiu, romania e-mail: adrian.brasoveanu@gmail.com mariana nagy aurel vlaicu university of arad, romania e-mail: mnagy62@yahoo.com oana mateuţ-petrişor, ramona urziceanu agora university, oradea and r&d agora ltd. cercetare dezvoltare agora oradea, romania e-mail: ganearamona2002@yahoo.com, oana_mateut@yahoo.com abstract: when the first articles about the semantic web (sw) appeared, there were hardly any signals that the next revolution would be related to social networking. social networking services (sns) have grown after myspace, linkedin and facebook were launched in 20032004 and combine text, images, movies, music, animations and all sorts of lists to create personal presentation pages for users, means to connect to real or virtual friends from all over the world and recommendations based on trust. the rise of the social semantic web and the convergence of different media to create rich experiences is one of the most interesting paradigm shift in the last decades because the probable effect of this movement is the fact that in one day virtual meetings will become legitimate in all aspects of our daily lives (if they are not already). the most important question in this context (the one that we try to answer in this paper) is related to how people will try to shape and use their avatars. in order to understand this, we will study the links between multimodal ontologies, affective interfaces, social data portability and other recent findings. this paper starts with a survey of the current literature of the field, examines some social semantic web mechanisms that changed the way we think about sns and in the end discusses some methods of connecting emotions with the social semantic web which pose some interesting questions related to the use of avatars. between conclusions, one of the most interesting is the one that states that the use of affective interfaces adds value to the multimodal ontologies, while another suggests that the avatar must be a mediator between different technologies. keywords: avatar, semantic web (sw), social networking services (sns), affective interfaces, human-computer interaction (hci). 1 introduction the semantic web (sw) has evolved into a technology we use on daily basis, sometimes without even being aware of this, but dreams like those described by world wide web creator timothy berners-lee and his collaborators in the 2001 article [4] are still not common place. this is often the case when new technologies are presented to the general public while still in their infancy. the original article has been revised 5 years later [5] and berners-lee admits that we are still very far from his original vision of agents replacing humans for several tasks like buying tickets or making appointments to the doctor. almost a decade later, the web has changed and the information is no longer presented through classic text and pictures. social networking services (sns) like facebook, linkedin, myspace combine text, images, movies, music, animations and all sorts of lists to create personal presentation pages for users, different means to connect to real or virtual friends from all over the world and recommend things based on trust. these days even the way we make a search is going through a paradigm shift, because it is widely believed that the recommendations of a copyright c⃝ 2006-2010 by ccc publications 478 a. braşoveanu, m. nagy, o. mateuţ-petrişor, r. urziceanu circle of trusted friends are more valuable than those of a search engine. this is clearly a human response to the strategies of increasing the ranks of the pages in search engines. as it was stated by berners-lee, the original vision of the semantic web was to "enable machines to comprehend semantic documents and data, not human speech and writings" [4]. because of the chaotic developments of the last decade, machines will not only need to comprehend human speech and writings, but will also need to be capable to search through videos almost in the same way we do. such capabilities go well beyond the initial meaning of the semantic web, imply reasoning and emotions and since they are similar to the way we think, allow us to use the term "intelligent" before the terms "social semantic web". in this paper we will try to see how the developments in different areas like hci, sns or sw reshape old concepts on the fly. we have chosen the avatar in order to express some of our thoughts. 2 rationale and approach: why the avatar? to explain the rationale behind choosing the avatar for presenting some of our ideas, we have to examine a brief history of the links between the semantic web and social networking. we will also need to look at the trends from these areas and hci. constraints force us to think creatively, is the mantra of the agile development community (and of ruby on rails in particular), but also the phrase that defines the last decade in the it industry. when we look back to the year 2001, we do not see wtc but rather a fragile industry which tried to recover from the "dot com bubble". in this climate of uncertainty when the seminal article about semantic web by berners-lee and his research group [4] was published it sparked a lot of debates and started a series of innovations which did not stop to date. in those days it was almost impossible to predict that any company related to the it would be sold for half a billion or more, given the fact that many companies failed to bring cash to the investors. few years later, when myspace was sold for a considerable amount to newscorp and google acquired youtube, everybody understood that something changed in the world of it. somewhere between 2002 and 2005, without knowing it, the world has become social. historically social networks were viewed as the preserve of the rich and even as a sort of social divide between the rich and the poor, but with the rise of the social networking services they became useful for everyone [26] [27]. the user centric social network, where everything is a link from or to a friend, is necessary to everyone who tries to find a job or his old friends from college these days. social networking services added new layers not only to the social interaction but to the semantic web as well. some ideas like mika’s community-based ontology extraction from web pages [21] would have never emerged without the rising of sns. the field of ontology was connected with it for several decades, but the first definition dates back to gruber’s 1993 trial, an ontology being an "explicit specification of a conceptualization" [14]. this definition has been revised several times by different authors [15], but all these revisions are still based on [14]. between 2001 and 2007 a lot of ontologies have been created with the purpose of connecting different sns or as extensions to applications that were built around data extraction from web applications [18] [22] [24]. rich visualization techniques [22] like those generally used in research appeared on the web with the huge success of the adobe flash technology (1998-2005) and the introduction of ajax and tag clouds (2004-2005). the use of labels for annotating different pieces of information has created a new field for knowledge representation called folksonomies [26]. the use of folksonomies is often linked with the use of ontologies since both concepts aim to offer a way to retrieve information. the folksonomy will give us some ideas about the most valuable words for a certain group of users, while the ontology will also try to model the relations between different topics (as we can see in figure 1). the various methods of bridging folksonomies and ontologies to enable better knowledge representation are presented in [18]. the links between social networking and semantic web are discussed in berners-lee revision of [4] in the 2006th article [5] and in [6], while the history of the sns is examined in [8]. when it comes to sns we usually agree with the chronology proposed by boyd in [8], but when we deal with bridging sns and sw we propose a simple timeline: • 2001 2004: the first attempts to link sw and sns; • 2004 2007: the explosion of sns and the first important results in bridging sw and sns like katrina peoplefinder [21]; • 2008 present day: sns are now present in all aspects of our lives and research is focused on rather more advanced topics like affective interfaces. perhaps the most interesting conclusion from the early years of research regarding the links between sns and the sw (2001-2007) belongs to mika from his most cited article from [21]:"it seems that ontologies are us: inseparable from the context of the community in which they are created and used". the research in recent years the avatar in the context of intelligent social semantic web 479 figure 1: a) foaf ontology visualization with welkin; b) tag cloud for facebook. (2008 2010) has rendered mika’s thesis as true. some interesting problems related to social networking (similarity problems usually phrased like "find all users which share certain interests"; semantic concept clustering and many others) have been modeled on various datasets (from small predefined datasets selected for special topics to large online datasets), but the emergence of the semantic web tools on large scale, mostly in the last six years (in the same time with the sudden growth of interest in the area of sns), enables us to look at them from new perspectives. our approach to the subject consists in making connections between different recent findings in the fields of sns and sw. the most interesting trends in social software (2008-2009) we investigated are social data portability [7] [23], live social semantics [1], mass interpersonal persuasion also known as mip [15], and the growth of interest surrounding affective interfaces [10] [19]. social data portability is a classic problem in the field of social software since its inception. it might have started as something which was interesting only for people with large social networks in real life, but as soon as the companies discovered that sns had the potential of attracting clients with clear profiles, something that tv or written media does not guarantee, it became the norm to be present on all the major sns. the growing number of social networking sites in different niches is a problem for the users or companies interested in more than one field of knowledge who want to share their data. porting data between different sns or platforms like facebook and opensocial or defining standards for presenting the articles from magazines or blogs is the work of researchers like uldis bojars and john g. breslin from deri, ireland, authors of the sioc ontology [7]. live social semantics is a concept proposed by a team of researchers from uk, france and italy lead by harith alani [1]. the main idea is to "integrate data from the semantic web, online social networks, and a real-world contact sensing platform" [1]. their system which integrated the social semantic web mechanisms with the real world was tested at eswc 09. live social semantics is something that benefits when being linked with social data portability, because in the same room we will have people connected to more than one sns in most of the cases. mass interpersonal persuasion or mip is the generic name given to those techniques of persuading millions of people to join to a certain initiative [12]. before facebook gathering millions of people was not an easy task, therefore it is not a surprise that in this case the theory appeared only after several applications succeeded. its creator, bj fogg is the same man who introduced captology and coined the term persuasive technology [13]. the main idea behind mip is that we should design new systems with mip and the huge social graph in mind if we want to change something in this world. when it comes to mip, the technology, the topic or the creator’s initial intent does not matter much. what is important is that mip gives to a creator the possibility of reaching a wider than expected audience. fogg stated that: "we are at the start of a revolution in how individuals and cultures make decisions and take action" [13]. affective interfaces have been proposed since 1980s but recently there has been a renowned interest. beyond fogg’s phd work and his seminal book [13], today affective computing is sometimes related to the semantic web. between the most interesting concepts is the use of multimodal ontologies for describing emotions [10] [19]. the use of affective interfaces adds value to the multimodal ontologies because they improve communicability. other interesting topic in the field is the way in which emotional experiences from sns affect us in ordinary life as presented in [22] and [24]. while work in 3d graphics is not necessarily connected with the semantic web, when it comes to affective interfaces, the research related to the animation of the emotional facial expressions [25] can very easily be combined with the ontologies for describing emotions [10] [19]. the rise of the social semantic 480 a. braşoveanu, m. nagy, o. mateuţ-petrişor, r. urziceanu web and the convergence of different media to create rich experiences is one of the most interesting paradigm shifts in the last decades. one of the effects of this movement is the fact that in one day virtual meetings will become legitimate in all aspects of our daily lives (if they are not already). in such a context the role of the avatar is to replace a human being, as it was supposed to be, but the trends we have examined also suggest new approaches to the concept of the avatar and will also enable us to get closer to the visions described in [4]. 3 the avatar in the intelligent social semantic web for the purpose of this paper we use the meaning of the term avatar which represents an agent that is a double for a real person, a double that takes care of our social self, its virtual ego [20]. it can very well be even a pseudoavatar, not necessarily as it is viewed in berners-lee’s paper [4]. it can represent a person, an organization or a fictional character that needs to be in the social space and it can also have a graphical representation be it 2d or 3d. the first question that comes in mind is clear: what is the purpose of the avatar in the intelligent social semantic web at which all of the concepts presented in the previous section are aimed toward? are there any links between those concepts? figure 2: some of the functions of the avatar in the intelligent social semantic web: a) maintain communication with its human counterpart (or organization); b) act as replacement for the human in virtual meetings; c) use the web, ontologies and other sources to find out news about the master’s fields of interest; d) update the social network or the sites of its human counterpart. one link was already shown in the previous section by connecting social data portability and the live social semantics. another one is the idea that only emotional agents are believable [13], proposed by fogg. but why would we need an avatar that is an emotional being? even though the work presented in [11] [12] [16] [18] [25] is good for avatars that represent human beings, which are emotional in the real world, it is debatable if the same can be applied to organizations. advertising is often misleading because a firm only uses emotions to sell products to produce emotional reactions which can lead to the decision of choosing a certain brand not because it really has emotions. people that work for a certain organization do have emotions, but the organization itself does not. there should be clear differences between the avatars that represent organizations and the avatars that represent human beings, but these are not in our sights for the moment. an avatar that represents an organization uses sns to achieve mip. without mip and the targeted advertising social networking has no value for organizations. the main philosophical problem that arises when the avatars try to use mip is related to trust and it is expressed very well in [3]: "whereas trust is generic to human communication and implies evaluative aspects, social presence is aiming at mediated communication and is more descriptive by nature". we can build different associations between the research areas we have mentioned in the previous section, but it should be enough to analyze the parts of the expression from the title (intelligent social semantic web) to the avatar in the context of intelligent social semantic web 481 understand the purpose of the avatar in this medium. sns represent the social part of the equation, while the sw is represented through ontologies and their applications, so it should be clear that the avatar should be a part of the intelligence. in fact it is in the same time a part of the intelligent side of the equation, as well as of the social part (if we limit ourselves to the meaning of the avatar in the current sns like facebook, myspace, linkedin). for each major field of interest of a human person or organization several ontologies already exist or will be developed. the main problem that an avatar will face will be to wisely choose those ontologies or even perform ontology matching [11] [16] and use them to extract the meaningful data from the web (like in [28]) or create content that could help us (humans or organizations) to fulfill our objectives. it has to work for us when we sleep and alert us when something critical for our activity happens. the avatar will be credible only if it will be emotional, because it’s not easy to wake up a man at 3:00 am and tell him that in other part of the world something happened and it will change his life. in any other way it would be impossible to have any impact in an open dynamic environment [2]. we might also need to change the way we design socio-technical systems [9] in order to enable the avatars to automate different tasks. when we will have this, the vision from [4] will be closer to us than ever, if not reality [17]. 4 conclusions and future work predicting the future is not an easy task as we have seen. any technology needs several iterations before achieving its goal so we should not be surprised that it will take some time until the results of our work will be implemented and validated. the avatar of the future will have some difficult tasks to solve (like choosing the proper ontologies) if we are to benefit from its use. it will also need to have emotions if we want it to be believable because the use of affective interfaces improves communicability. the problem of differentiating between the avatars of the real persons and the avatars of the organizations will remain an open problem until the use of avatars will be the subject of standards committees or international law. the future work will consider implementing new mechanisms for linking the multimodal ontologies and affective interfaces with recent research in semantic web and hci in a 3 years interval (during the phd studies of the first author). the objectives are to be fulfilled involving european teams of researchers interested in this kind of projects. acknowledgements this work was partially supported by the strategic grant posdrui88i1.5isi60370 (2009) on "doctoral scholarships" of the ministry of labour, family and social protection, romania, cofinanced by the european social fund investing in people. bibliography [1] h. alani, m. szomszor, c. cattuto, w. van den broeck, g. correndo, a. barrat. live social semantics. in: 8th international semantic web conference (iswc), october 2009, us, 2009. [2] i. dzitac, b.e. bărbat. artificial intelligence + distributed systems = agents. international journal of computers, communications & control, iv, 1, 17-26, (http:llwww.britannica.comlbpsl additionalcontentl18l36182542lartificial-intelligence-distributed-systems-agents), 2009. [3] g. bente, s. ruggenberg, n. c. kramer, f. eschenburg. avatar-mediated networking: increasing social presence and interpersonal trust in net-based collaborations. human communication research 34 (2008) 287-318. [4] t. berners-lee, j. hendler, o. lassila. the semantic web. scientific american, may 2001, pp. 34-43. [5] n. shadbolt, w. hall, t. berners-lee. the semantic web revisited. ieee intelligent systems, pages 96101, mayljune 2006. [6] t. berners-lee, w. hall, j.a. hendler, k. o’hara, n. shadbolt, d.j. weitzner. a framework for web science. foundations and trends in web science, 1 (1), pages 1-130, 2006. 482 a. braşoveanu, m. nagy, o. mateuţ-petrişor, r. urziceanu [7] u. bojārs, a. passant, j.g. breslin, s. decker. social networks and data portability using semantic web technologies. the 2nd workshop on social aspects of the web (saw 2008) at the 11th international conference on business information systems (bis 2008), innsbruck, austria, may 2008. [8] d. m. boyd, n. b. ellison. social network sites: definition, history, and scholarship. in journal of computermediated communication, 13(1). http:lljcmc.indiana.edulvol13lissue1lboyd.ellison.html, 2007. [9] v. bryl, p. giorgini, and j. mylopoulos. designing socio-technical systems: from stakeholder goals to social networks. requirements engineering, 14(1):47-70, 2009. [10] i. cearreta, j. m. lopez, n. garay-vitoria. modelling multimodal context-aware affective interaction. proceedings of the doctoral consortium of the second international conference on acii’07. lisbon, portugal. pages 57-64, 2007. [11] j. euzenat, p. shvaiko. ontology matching, springer, 2007 [12] b.j. fogg. persuasive technology. morgan kaufmann, san francisco, 2003. [13] b.j. fogg. mass interpersonal persuasion:an early view of a new phenomenon. in h.oinaskukkonen et al. (eds.). persuasive 2008, lncs 5033 (pp.23-34). new york, springer, 2008. [14] t. r. gruber. a translation approach to portable ontologies. knowledge acquisition, 5(2):199220, 1993. [15] n. guarino, d. oberle, s. staab. what is an ontology? in s. staab and r. studer (eds.), handbook on ontologies, second edition. international handbooks on information systems. springer verlag: 1-17, 2009. [16] harth, s. kinsella, s. decker. using naming authority to rank data and ontologies for web search. in proc. international semantic web conference, iswc’09, washington, usa, october 2009, 2009 [17] d. j. lewis. intelligent agents and the semantic web. developing an intelligent web. retrieved from http:llwww.ibm.comldeveloperworkslwebllibrarylwa-intelligentagel. 2008. accessed: december 2009. [18] f. limpens, f.gandon, and m. buffa. linking folksonomies and ontologies for supporting knowledge sharing: a state of the art. technical report, eu project, isicil, 2009. [19] j. m. lopez, r. gil, r. garcia, i. cearreta, n. garay. towards an ontology for describing emotions. wsks (1) 2008: 96-104. [20] p. messinger, x. ge, e. stroulia, k. lyons, k. smirnov, m. bone. on the relationship between my avatar and myself. journal of virtual worlds research 1 (2). http:lljournals.tdl.orgljvwrlarticlelviewl352l. (accessed december 2009) [21] p. mika. social networks and the semantic web, springer, 2007 [22] d. petrelli, s. mazumdar, a-s. dadzie, f. ciravegna. multivisualization and dynamic query for effective exploration of semantic data. in proc. international semantic web conference, iswc’09, washington, usa, october 2009, 2009. [23] l. răzmeriţa, m. jusevičius, rokas firantas. new generation of social networks based on semantic web technologies: the importance of social data portability. in: workshop on adaptation and personalization for web 2.0, umap’09, june 22-26, 2009. [24] c. sas, a. dix, j. hart, s. ronghui. emotional experience on facebook site. in: chi ’09: chi ’09 extended abstracts on human factors in computing systems, 4-9 april 2009, boston, ma. [25] r.j.s. sloan, m. cook, b. robinson. considerations for believable emotional facial expression animation. 2nd international conference on visualization, barcelona, spain, 2009. [26] t. vanderwall. folksonomy coinage and definition. 2007. retrieved from http:llvanderwal.netlfolksonomy.html. accessed: december 2009. [27] m. webb. 2004. on social software. http:llinterconnected.orglhomel2004l04l28l. [28] s.y. yang. developing an ontological faq system with faq processing and ranking techniques for ubiquitous services. proc. of the first ieee international conference on ubi-media computing, lanzhou, china, 2008, pp. 541-546. int j comput commun, issn 1841-9836 vol.7 (2012), no. 3 (september), pp. 565-573 packet-layer quality assessment for networked video h. su, f. yang, j. song honglei su, fuzheng yang, jiarun song state key laboratory of integrated service networks xidian university, xi’an, shaanxi 710071, china e-mail:{hlsu,fzhyang}@mail.xidian.edu.cn, sjrxidian@hotmail.com abstract: to realize real-time and non-intrusive quality monitoring for networked video, a content-adaptive packet-layer model for quality assessment is proposed. considering the fact that the coding distortion of a video is dependent not only on the bit-rate but also on the motion characteristic of the video content, temporal complexity is evaluated and incorporated in quality assessment in the proposed model. since very limited information is available for a packet-layer model, an adaptive method for frame type detection is first applied. then the temporal complexity which reflects the motion characteristic of the video content is estimated using the ratio of the bit-rate for coding i frames and p frames. the estimated temporal complexity is incorporated in the proposed model, making it adaptive to different video content. experimental results show that the proposed model achieves an advanced performance in comparison with the itu-t g.1070 model. keywords: packet-layer model, networked video, video quality assessment, coding distortion, temporal complexity. 1 introduction recently, with the development of advantage multimedia processing technologies [1], multimedia services such as videophone, mobile conference and internet protocol television (iptv) have gained significant popularity in our daily life. however, the quality of these applications cannot be guaranteed in an ip network due to its best-effort delivery. it is therefore crucial to establish an objective model for video quality assessment targeting system design, qos (quality of service) planning and quality monitoring [2], [3]. objective video quality assessment can be categorized into media-layer models, bitstream-layer models, packet-layer models, parametric models and hybrid models from the viewpoint of the input information. to estimate the perceptual quality of service (qos) for users, the media-layer models use media signals [4], where characteristics of the video content and decoder strategies such as error concealment are usually taken into account. the bitstream-layer models, on the other hand, perform an analysis on the bitstream without resorting to a complete decoding [5], which can be used in situations where one does not have access to decoded video sequences. the packet-layer models exploit the packet headers to obtain information about the service quality [6], making them well suited for in-service non-intrusive monitoring. the parametric models employ parameters from the network or the application [7], [8]. parameters from the network may include the packet loss rate and the delay information, while those from the application usually cover the coding bit rate, frame rate, and so on. the hybrid models use a combination of information from the bitstream and the media data, and therefore have an advanced performance as well as combined features of the other models [9]. since the packet-layer model only utilizes information from packet headers, it is very efficient in quality monitoring due to its low complexity, especially suitable for quality monitoring at network internodes. the other advantage is that the packet-layer model does not need decryption and decoding, copyright c⃝ 2006-2012 by ccc publications 566 h. su, f. yang, j. song making it favorable when packet payloads are encrypted. in this paper, a packet-layer model is proposed for efficient quality assessment for networked video. utilizing the limited information which can be provided by packet headers, the frame type and temporal complexity are estimated based on the coding bit-rate. the temporal complexity is incorporated in the proposed model to make it content-adaptive. the remainder of this paper is organized as follows. the framework for proposed packet-layer video quality assessment is introduced in section 2. section 3 discusses the relationship between the coding distortion and the bit-rate. the proposed packet-layer model for video quality assessment is described in section 4. the experimental results are presented in section 5. this paper closes with conclusions given in section 6. 2 packet-layer model for video quality assessment packet-layer model for video quality assessment is especially suitable for application scenarios like in-service video quality monitoring and network service planning. it predicts the networked video quality from packet-header information, without resorting to any media-related payload information. since only the packet header is exploited, the packet-layer model is very useful at network inter-nodes due to its low complexity, where it can monitor thousands of video streams at the same time. the other advantage is that the packet-layer model does not need decryption and video decoding, making it favorable when packet payloads are encrypted. as an example, figure 1 shows the structure of a packet in rtp/udp/ip protocol stacks. in this case, the ip (internet protocol) header, the udp (user datagram protocol) header, and the rtp (real-time transport protocol) header can be accessed by a packet-layer model. the length of payload is easily obtained since the udp length field indicates the length of the udp header [10], the rtp header and the payload [11]. the marker bit in the rtp header indicates the end of a video frame, and all packets related to one video frame are with the same rtp timestamp. using this information, the packets can be assembled to frames. figure 1: the structure of a packet under rtp/udp/ip protocol stacks by analyzing packet headers, the information needed by the parametric model can be obtained and used to estimate the video quality [12]. apart from the parameters of bit-rate, frame rate, and packet loss rate which are usually employed in parametric models [13], [14], other information can also be employed in a packet-layer model, such as coding parameters (e.g., the frame type, and the bit-rate of each frame), information about the video content characteristics (e.g., the ratio of the bit-rate for coding i frames and p frames), and the detailed positions of lost packets in a video. the framework of the proposed packet-layer model is shown below in figure 2. firstly, after packet header analysis, the bit-rate for coding each frame can be obtained. then, it is employed to detect the frame type and calculate the ratio of the bit-rate for coding i frames and p frames. this ratio is employed in the proposed model to estimate the temporal complexity which reflects the motion characteristic of the video content. finally, the coding distortion of networked video is evaluated using the bit-rate information and the estimated temporal complexity. packet-layer quality assessment for networked video 567 packet header analysis bitstream frame type detection coding distortion evaluation temporal complexity estimation bit-rate video qualityframe type temporal complexity figure 2: framework of the proposed packet-layer model 0 0.1 0.2 0.3 0.4 0.5 0.6 1.5 2 2.5 3 3.5 4 4.5 bit−rate(bits/pixel) m o s container coastguard sign−irene football figure 3: relationship between the mos and the bit-rate for each sequence 0 0.1 0.2 0.3 0.4 0.5 0.6 0 5 10 15 20 25 30 35 bit−rate(bits/pixel) r at io o f bi t− ra te f or i a nd p f ra m es coastguard container football news sign−irene tempete figure 4: the ratio of the bit-rate for coding i frames and p frames 3 coding distortion and bit-rate the bit-rate is a key parameter for estimating the coding distortion. it has been well recognized that there is a relationship between the bit-rate and the average mean opinion score (mos) for different video sequences. therefore, several functions have been proposed to approximate the relationship to predict the average coding distortion using the bit-rate, such as the computational model proposed in the itu-t recommendation g.1070 [15] and its enhancement [16]. other model forms can also be found, such as the "m-n model" [17], as well as the exponential model [12], [18]. a detailed performance comparison of those models is provided in [19], where superior performance of the enhanced g.1070 model is observed. although the average mos can be predicted using models, the subjective quality of individual videos cannot be well formulated when provided only with the coding bit-rate. considering the video quality for each sequence, the relationship between the bit-rate and the mos is shown in figure 3. it is observed that there are obvious differences in the video quality at a same bit-rate for different sequences. therefore using the bit-rate only is not suitable for estimating the quality of a certain video service. it has now been widely acknowledged that content features must be taken into account for an accurate prediction of the perceived video quality [20]. building on this argument, video clips are classified into three classes according to the subjective movement content (high, medium and low movement content), and the model parameters are calculated for each class [18], [19]. however, it is not described how to obtain the information about movement content for each video clip based on objective parameters [18]. although the average sad (sum of absolute differences) can be employed in [19] to reflect the motion characteristic of video content, this value is not available for a packet-layer model. according to figure 3, it can be seen that the video clip which has a higher motion complexity such as "football" has a comparatively lower quality at the same bit-rate. correspondingly, "container" having a lower motion has a higher quality over the others under the same coding bit-rate. therefore, the temporal complexity estimated from the packet headers is expected to reflect the motion extent of video clips. how to measure this variable and then establish a packet-layer model based on video content is proposed in the next section. 568 h. su, f. yang, j. song 0 0.5 1 1.5 2 1 4 7 10 13 16 19 22 25 28 31 34 b it -r a te ( b it s/ p ix e l) frame number i p (a) football 0 0.5 1 1.5 2 1 4 7 10 13 16 19 22 25 28 31 34 b it -r a te ( b it s/ p ix e l) frame number i p (b) news figure 5: bit-rate histogram of different frame type at 0.202(bits/pixel) 4 packet-layer model based on video content figure 4 shows the values of the ratio of the bit-rate for coding i frames and p frames for different video clips. this ratio is always comparatively lower for the "football" sequence due to its high temporal complexity which results in more bit-rates in coding the p frames. on the other hand, the values of this ratio for the "container" sequence are consistently larger because of its lower temporal complexity. as the observation, low values of this ratio may correspond to a high temporal complexity. therefore, the temporal complexity can be roughly estimated using the ratio of the bit-rate for coding i frames and p frames. for a packet-layer model, however, the frame type information is not readily available. consequently, a frame type detection method based on the bit-rate distribution for each frame is introduced as follows. 4.1 frame type detection as a general principle, video coding exploits spatial redundancy using intra coding and temporal redundancy using inter coding, where the inter coding modes are usually more efficient in removing redundancy. accordingly, the bit-rate for coding an i frame is usually much higher than that for a p frame, as shown in figure 5. consequently, a threshold-based method is proposed to detect the frame type using the information about the coding bit-rate. however, figure 5 also shows that the bit-rate related to a certain frame type varies with the video content. because of the high motion complexity of "football", more bit-rates are distributed to p frames. as the result, at a same bit-rate, the values of bit-rate coding i frames of "football" are lower than the values of "news" which has a lower motion complexity under the same coding bit-rate. therefore, to make the detection more effective, the threshold for video clips of higher temporal complexity should be lower than the threshold for clips of lower temporal complexity. so fixed thresholds may fail in detecting for different video clips. dynamic thresholds which are adaptively adjusted [21] are applied in this paper. 4.2 temporal complexity estimation after frame type detection, the ratio of the bit-rate for coding i frames and p frames can be calculated using the information about the frame type and bit-rate of each frame. this ratio is defined as: r = ri rp , (1) where ri is the average bit-rate for coding i frames in a certain duration and rp is the average bit-rate for coding p frames in the same duration. however, it can be seen from figure 4 that there is obvious packet-layer quality assessment for networked video 569 0 0.2 0.4 0.6 0.8 1 0.6 0.8 1 1.2 1.4 1.6 bit−rate(bits/pixel) e st im at ed t em po ra l co m pl ex it y coastguard container football news sign−irene tempete figure 6: relationship between estimated temporal complexity and bit-rate 0.6 0.8 1 1.2 1.4 1.6 0 0.05 0.1 0.15 temporal complexity v 4 figure 7: relationship between v4 and temporal complexity differences in the values of r at different bit-rates for each sequence. due to the fact that the temporal complexity should be evaluated for the sequence with a given related value, r can not be directly applied as the measure of temporal complexity. consequently, a mathematical mapping is used to unify the values of r for all the range of bit-rates for each sequence. firstly, the natural logarithm function acts on the values of r to eliminate the enormous differences due to their distribution of different orders of magnitude. then, adaptively adjusted factors which are related to the average bit-rate are introduced to make the values generated by the first step for each sequence as near as possible. both of these two steps are implemented without influencing the relatively ranking of the curves of different sequences. however, to give the temporal complexity a practical significance (a high value corresponds to a higher motion extent sequence), the inverse function is employed at last. so the temporal complexity is formulated as: σt = a1 · ln(r) + b1 ln(ri ) − ln(rp) , (2) where r is the average bit-rate, and a1 and b1 are constants obtained by the experiments. as shown in figure 6, the values of estimated temporal complexity calculated by formula 2 for each sequence are roughly consistent for all bit-rate. and different sequences have different average values which can reflect the motion characteristic of the video content. so the estimated temporal complexity can be employed for the packet-layer model based on video content. 4.3 proposed model for quality assessment it is obvious that the average mos for different video sequences increases as the bit-rate increases and saturates at the maximum mos, which has been formulated in itu-t recommendation g.1070 as: vq = 1 + v3 · (1 − 1 1 + ( r v4 )v5 ), (3) where vq is the video quality, r is the bit-rate, and v3, v4, v5 are empirical parameters. this model can estimate the average video quality for different contents at each bit-rate. however, video quality strongly depends on the video content. best values for v3, v4 and v5 are calculated for the video sequences are presented in table 1. it can be found from figure 6 and table 1 that a higher value of v4 corresponds to a video sequence with higher temporal complexity (e.g., the "football" sequence). on the contrary, a video sequence whose temporal complexity is low usually has a low value of v4 (e.g., the "container" sequence). figure 7 shows the relationship between v4 and the temporal complexity, and a linear model approximates this relationship well as follows, v4 = a2 ·σt + b2, (4) 570 h. su, f. yang, j. song where a2, b2 are obtained by the experiments, and v4 is not a constant but a variable varied with σt . table 1: the values of v3, v4 and v5 for each video sequence video sequence v3 v4 v5 container 3.546 0.027 1.237 news 3.371 0.040 2.232 coastguard 3.464 0.063 1.733 tempete 3.456 0.072 1.687 sign-irene 3.546 0.091 1.884 football 3.481 0.134 2.230 in addition, table 1 shows that there is relative small difference between the values of v3 for different sequences, which is the maximum mos of the sequence. though there is a relative large difference between the values of v5 for different sequences, the difference influences the value of vq slightly. therefore, v3 and v5 are set as constants for all video clips in the proposed model and both of them are obtained by the experiments. consequently, submitting formula 4 into formula 3, the proposed model is established as: vq = 1 + v3 · (1 − 1 1 + ( ra2·σt+b2 ) v5 ). (5) apart from the bit-rate, the temporal complexity, which reflects the motion characteristic of the video content, is considered in this model to make the evaluation more accurate. 5 experimental results the video sequences chosen for experiments covered a wide range of scenes from high motion to low motion events. specifically, standard video sequences of "carphone", "coastguard", "container", "football", "foreman", "hall_monitor", "mother&daughter", "news", "paris", "sign_irene", "silent", "soccer" and "tempete" were used for performance evaluation. the sequences were all in the common intermediate format (cif) at 25 frames per second (fps), and encoded using x264 coder [22] with a gop (group of picture) structure of "ippp" sized of 30. for each sequence, the first 8 seconds were used for evaluation. the subjective scores were collected for comparison purposes. the guidelines specified by the video quality experts group (vqeg) in [23] were followed for the subjective tests. twenty-five non-expert viewers were involved in these tests, using the absolute category rating (acr) with a 5-point scale to obtain the moss of reconstructed sequences [24], [25]. the parameters were obtained according to the experiments, and their values are shown in table 2. these parameters were set fixed for all carried experiments. however, if applied to videos generated by the other codecs, they may need to be adjusted. table 2: parameter values v3 v5 a1 b1 a2 b2 3.477 1.834 -0.334 1.137 0.142 -0.065 pearson correlation coefficient (pcc) and the root-mean-squared error (rmse) were used to evaluate the performance of the proposed model. by comparison with the g.1070 model, the proposed model gets an increment about 0.024 in pcc and a decrement about 0.082 in rmse, as shown in table 3. the packet-layer quality assessment for networked video 571 0 1 2 3 4 5 0 1 2 3 4 5 scores by proposed model m o s (a) proposed model 0 1 2 3 4 5 0 1 2 3 4 5 scores by g.1070 model m o s carphone coastguard container football foreman hall−monitor mother&daughter news paris sign−irene silent soccer tempete (b) g.1070 model figure 8: scatter plot of moss vs objective scores scatter plots of the objective scores versus the subjective scores are shown in figure 8, from which the same conclusion can be drawn that using the proposed model the perceived coding distortion can be more accurately measured. table 3: performance comparison of proposed model and g.1070 model video quality assessment model pcc rmse proposed model 0.9582 0.3250 g.1070 model 0.9338 0.4067 6 conclusions a packet-layer model based on characteristics of the video content is proposed in this paper to measure the perceived coding distortion for networked video. without resorting to the payload information, the temporal complexity is estimated using the ratio of the bit-rate for coding i frames and p frames to reflect the motion characteristic of the video content. based on analysis of the parameters in the original g.1070 model, the measure of temporal complexity is integrated in the proposed model. extensive experimental results have demonstrated that the proposed model shows an advanced performance in comparison with the g.1070 model. further work may include the application of the proposed model to practice by considering both coding distortion and packet loss. acknowledgement this work was supported by the national science foundation of china (60902081, 60902052), the fundamental research funds for the central universities (72004885), the international science and technology cooperation program of china (2010dfb10570), and the 111 project (b08038). 572 h. su, f. yang, j. song bibliography [1] c. grava, a. gacs¨˘di, i. buciu, "a homogeneous algorithm for motion estimation and compensation by using cellular neural networks", international journal of computers communications & control, issn 1841-9836, vol. 5, no. 5, pp.719-726, 2010. [2] h. r. wu, k. r. rao, eds., digital video image quality and perceptual coding, crc press, 2005. [3] a. marchand, m. chetto, "quality of service scheduling in real-time systems", international journal of computers communications & control, issn 1841-9836, vol. 3, no. 4, pp. 353-365, 2008. [4] itu-t recommendation j.148, "requirements for an objective perceptual multimedia quality model", 2003. [5] o. verscheurei, x. garcia, "user-oriented qos in packet video delivery", ieee network, pp. 12-21, nov. 1998. [6] a. clark, "modeling the effects of burst packet loss and recency on subjective voice quality", ip telephony workshop, 2001. [7] k. yamagishi, t. hayashi, "analysis of psychological factors for quality assessment of interactive multimodal service", electronic imaging 2005, pp. 130-138, jan. 2005. [8] k. yamagishi, t. hayashi, "opinion model using psychological factors for interactive multimodal services", ieice trans. commun., vol. e89-b, no. 2, pp. 281-288, feb. 2006. [9] a. takahashi, a. kurashima, h. yoshino, "objective assessment methodology for estimating conversational quality in voip", ieee trans.on salp, nov. 2006. [10] rfc 768, udp, user datagram protocol, 2003. [11] rfc 3550, rtp, a transport protocol for real-time applications, 2003. [12] a. raake, m. garcia, j. berger, f. kling, p. list, j. johann, c. heidemann, "t-v-model: parameterbased prediction of iptv quality", proc. international conference on acoustics, speech, and signal processing, pp. 1149-1152, mar. 2008. [13] m. n. garcia, a. raake, "parametric packet-layer video quality model for iptv", proc. information sciences signal processing and their applications, kuala lumpur, malaysia, may 2010. [14] k. yamagishi, t. hayashi, "parametric packet-layer model for monitoring video quality of iptv services", proc. international communications conference, beijing, china, may 2008. [15] itu-t recommendation g.1070, "opinion model for video-telephony applications", apr. 2007. [16] j. joskowicz, j. c. lopez-ardao, "enhancements to the opinion model for video-telephony applications", proc. the international latin american networking conference, pelotas, brazil, sep. 2009. [17] j. joskowicz, j. c. lopez-ardao, m. a. g. ortega, c. l. garcia, "a mathematical model for evaluating the perceptual quality of video", proc. international. workshop on future multimedia networking, coimbra, portugal, june 2009. [18] h. koumaras, a. kourtis, d. martakos, j. lauterjung, "quantified pqos assessment based on fast estimation of the spatial and temporal activity level", multimedia tools and applications, vol. 34, no. 3, sep 2007. packet-layer quality assessment for networked video 573 [19] j. joskowicz, j. c. lopez-ardao, "a general parametric model for perceptual video quality estimation", proc. communications quality and reliability, vancouver, bc, june 2010. [20] m. n. garcia, a. raake, p. list, "towards content-related features for parametric video quality prediction of iptv services", proc. acoustics, speech and signal processing, las vegas, usa, pp. 757-760, april 2008. [21] n. liao, z. chen, "a packet-layer video quality assessment model based on spatiotemporal complexity estimation", proc. visual communications and image processing, huangshan, china, july 2010. [22] videolan, x264 codec, http://www.videolan.org/x264.html. [23] vqeg, "hybrid perceptual/bitstream group test plan 1.1", http://www.its.bldrdoc.gov/vqeg/, sep. 2007. [24] itu-t, recommendation p. 910, "subjective video quality assessment methods for multimedia applications", april 2008. [25] itu-r recommendation bt.500-11, "methodology for the subjective assessment of the quality of television pictures", 2002. international journal of computers communications & control issn 1841-9836, 10(3):390-402, june, 2015. a reference dataset for network traffic activity based intrusion detection system r. singh, h. kumar, r.k. singla raman singh*, harish kumar and r.k. singla university institute of engineering and technology panjab university, chandigarh, india raman.singh@ieee.org, harishk@pu.ac.in, rksingla@pu.ac.in *corresponding author: raman.singh@ieee.org abstract: the network traffic dataset is a crucial part of anomaly based intrusion detection systems (idss). these idss train themselves to learn normal and anomalous activities. properly labeled dataset is used for the training purpose. for the activities based idss, proper network traffic activity labeled dataset is the first requirement, however non-availability of such datasets is bottlenecked in the field of ids research. in this experiment, a synthetic dataset "panjab university intrusion dataset (pu-ids)" is created. the purpose of this study is to provide the researchers a reference dataset for the performance evaluation of network traffic activity based idss. university of new brunswick network security laboratory knowledge disscovery in databases (nsl-kdd) is a benchmark dataset for anomaly detection but it does not contain activity based labeling. so basic characteristics of this dataset are taken for the generation of the new synthetic dataset with various activities based labels. the dataset is first categorized as per protocol and service. thereafter, as per minimum & maximum values of attributes, activity profiles are synthetically generated. this paper also discusses various statistical characteristics of pu-ids. the total number of 198533 instances along with 273 of activity profiles are created. this dataset also contain different 98 protocol_service profiles. keywords: intrusion detection system, network traffic dataset, network traffic profiling, behavioral profiling, traffic activity profiling 1 introduction anomaly based intrusion detection system (ids) is an emerging research interest for network security researchers and professionals. in order to detect intrusions, idss learn the network traffic activities/ behaviors of malware rather than rely on virus definitions and security updates. well labeled network traffic dataset is used for this training. this dataset is crucial and only properly labeled dataset can serve this purpose. profiling based anomaly detection is a new and emerging field of network security research, but, no standard network traffic activities based labeled dataset is available for the training and performance testing. some available datasets are internet traces and un-labeled while others are only single level labeled (normal or anomalous). despite the wide availability of these datasets, non-availability of network traffic activities based datasets is hindrance in the intrusion detection research. in the anomaly based idss, normal behavior of networks may differ for different kind of organizations. for example, network game packets (online or intranet games) may be malicious for consultancy companies while they are absolutely normal for game development and testing companies. in the same manner within an organization, different departments may have different network traffic patterns. there is need of development of such anomaly detection techniques which can identify such network traffic activities/behaviors and create normal (and/or anomalous) profiles for intrusions detection. no such dataset exists for this purpose. the synthetic dataset should be generated so as to be used as a reference dataset for testing and performance evaluation. in this paper, university of new brunswick network security laboratory knowledge disscovery copyright © 2006-2015 by ccc publications a reference dataset for network traffic activity based intrusion detection system 391 in databases (nsl kdd) network traffic dataset is taken as a base and "panjab university intrusion dataset (pu-ids)" network traffic dataset is generated synthetically. nsl kdd is benchmark dataset but does not contains network traffic activities based labels. these labels are introduces in pu-ids. this paper is divided into five sections; section 1 introduces the topic, section 2 discusses the various available network traffic datasets. this section also discusses available network traffic profiling based threat detection techniques. section 3 describes the dataset used. this section also explain the methodology of synthetic dataset generation. in section 4, statistical analysis of synthetic dataset is carried out. finally, section 5 discusses the conclusions and future works. 2 network traffic dataset and profiling : related work 2.1 network traffic dataset in the last few years, some datasets are either generated or collected for research purpose. these datasets are used by researchers worldwide. the cyber systems and technology group (formerly the darpa intrusion detection evaluation group) of mit lincoln laboratory has collected network traffic dataset in 1998 and 1999. this dataset is collected by simulating various attacks like denial of services (dos), remote to local (r2l), user to remote (u2r) etc. on different platforms like windows, unix etc. this dataset become benchmark kdd dataset for research in ids [1]. later, some of the problems like redundant and duplicate records present in this dataset are removed and more efficient dataset become available for researcher which is known as nsl kdd dataset [2]. this dataset is widely used for performance analysis of various intrusion detection techniques. in this dataset each instances are labeled as normal or anomalous. in another version of this dataset, instances are classified as par various attacks like dos,u2r, l2r, probe and others. the drawback of this dataset is that, no further network traffic activities/behaviors based labels are available. the stanford network analysis project created 50 network traffic datasets by using various nodes of social networks, internet work, web graphs, etc. some of this collection of datasets is labeled while others are un-labeled. profiling based classification is missing in this dataset [3]. the dataset is created by the center for applied internet data analysis (caida) at various topologically and geographically separated locations. anonymized internet traces along with other worms related dataset is created by ensuring privacy preservation from the year 2008 to the year 2013. this is a collection of various datasets like anonymized internet traces and attack specific dataset. the internet traces dataset is un-labeled and so may not be useful directly for performance analysis of ids. it needs to be the first pre-processed for labeling [4]. cert synthetic sendmail system dataset is created by the university of new mexico using sun sparc stations. some system call instances are live while others are synthetic. normal instances and intrusion traces are provided separately. further level of labeling is not provided in this dataset [5]. real traffic of http, smtp, ssh, imap, pop3, and ftp are used to create a network traffic dataset in the university of new brunswick information security ? centre of excellence (unb iscx) lab. this dataset which contains seven days of normal and malicious instances is known as unb iscx dataset [6]. this dataset is available on request for university researchers. this dataset includes normal and malicious instances. profiling level labeling is not available for this dataset. internet traces of the sub-network of panjab university network (pucan) is captured from july 2011 to august 2011 in order to create a network traffic dataset. this dataset consists of un-labeled internet traces which needs further processing for labeling [7]. internet traffic traces like lan/wan ethernet traffic, tcp traffic, http traffic, http logs etc., are available in other dataset. the collection of datasets includes lan/wan traces, various 392 r. singh, h. kumar, r.k. singla logs and web client traces. a further level of classification and labeling is not present in these datasets [8]. from the literature studied it has been found that most of the datasets are not labeled on the basis of network traffic activities/behaviors. these datasets are labeled on single level profiling. therefore, there is a need for the development of a dataset which is based on the behavior profiling that consider network traffic activities and various user’s behaviors. 2.2 network traffic profiling researchers are using clustering and profiling in order to detect intrusions. network traffic activities based idss are the future of network security. in this sub-section various techniques which are used to detect intrusions are discussed. clustering is used in the network traffic dataset in order to cluster various instances into different activity profiles. these profiles are used to determine normal and malicious instances [9]. behavioral distance based anomaly detection for real time traffic analysis is also proposed. horizontal and vertical distance metrics are used for different attributes of network traffic datasets [10]. behavioral foot printing method along with content based signature is used to profile self-propagating worms. the worm’s dynamic infection sequence is learned to detect it [11]. clustering based anomaly detection technique is proposed for modeling user’s normal behavior in [12]. traffic causality graphs (tcgs) are used to analyze and visualize temporal and spatial causality of flows to profile network traffic. the advantage of this technique is that there is no need of payload inspection [13]. personal and application profiles are created by tagging network traffic. traffic from known source is profiled with role tag and application tag [14]. ip to ip communication graph and information is used to profile internet backbone traffic. this technique is known as profiling by association [15]. data mining and entropy based techniques are used to profile the behavior of internet end to end hosts [16]. researchers also proposed behavior based tracking to create long term profiles of user?s interest. this methodology is tested on dns traffic [17]. these techniques can be used on network traffic activity based datasets in order to detect various normal and anomalous network traffic activities/behaviors. in the state of the art idss, the normal behavior of user’s groups needs to be learned to effectively detect intrusions. 3 materials and methods unavailability of traffic activity/ behavior level network traffic dataset motivates this experiment of generation of synthetic dataset. the network traffic dataset used for generation of the synthetic dataset is discussed as below: 3.1 network traffic dataset used benchmarked nsl-kdd network traffic dataset is used as a base and its various characteristics of different attributes are used to synthetically generate instances. the various characteristics used are minimum and maximum values of different attributes. table 1 shows the list of attributes of this dataset. the same attributes are taken in pu-ids. these attributes are either continuous or categorical type. continuous attributes are those in which the value belongs to indefinite set while in categorical attribute values are assigned from a definite set. attribute number 2, 3, 4, 7, 12, 14, 15, 21, 22 and 42 are categorical attributes while all others are continuous attributes. a reference dataset for network traffic activity based intrusion detection system 393 table 1: list of attributes of nsl kdd and synthetically generated dataset sr. no. feature sr. no. feature sr. no. feature 1 duration 15 su attempted 29 same srv rate 2 protocol type 16 num root 30 diff srv rate 3 service 17 num file creations 31 srv diff host rate 4 flag 18 num shells 32 dst host count 5 source bytes 19 num access files 33 dst host srv count 6 destination bytes 20 num outbound cmds 34 dst host same srv rate 7 land 21 is host login 35 dst host diff srv rate 8 wrong fragment 22 is guest login 36 dst host same src port rate 9 urgent 23 count 37 dst host srv diff host rate 10 hot 24 srv count 38 dst host serror rate 11 number failed logins 25 serror rate 39 dst host srvserror rate 12 logged in 26 srvserror rate 40 dst host rerror rate 13 num compromised 27 rerror rate 41 dst host srvrerror rate 14 root shell 28 srvrerror rate 42 class label 3.2 synthetic dataset generation methodology figure 1 shows the methodology of synthetic dataset generation. it describes the procedures followed in the experiment to generate instances of synthetic dataset. this methodology is implemented using matlab [18] as a tool. the various steps are described as below: separation of dataset in the first step, nsl kdd training and testing dataset is clubbed into a single set. this dataset is then separated into two categories of normal and anomalous sub-datasets. protocol_service profile creation in the second step, protocol_service profiles are created by integrating protocol and service attributes for both categories of sub-datasets (normal and anomalous sub-datasets). all protocol_service profiles created in the first level of profiling are shown in table 2. thereafter, all instances are separated as per protocol_service profiles for both normal and anomalous sub-datasets. extraction of basic characteristics in the third step, all continuous attributes of all protocol_service profiles of both sub-datasets are considered and basic characteristics like minimum and maximum values are extracted. unique values for each categorical attributes are also calculated. calculation of cluster_gap and number_of_instances_per_cluster in the fourth step, for each protocol_service profile, number of clusters and the number of instances to be created are taken as input from the user. as per equation 1 cluster_gap and as per equation 2 number_of_instances_per_cluster is calculated for each protocol_service profile. 394 r. singh, h. kumar, r.k. singla figure 1: synthetic dataset generation methodology a reference dataset for network traffic activity based intrusion detection system 395 table 2: list of protocol_service profiles icmp_eco_i tcp_courier tcp_gopher tcp_link tcp_pop_2 tcp_systat icmp_ecr_i tcp_csnet_ns tcp_harvest tcp_login tcp_pop_3 tcp_telnet icmp_red_i tcp_ctf tcp_hostnames tcp_mtp tcp_printer tcp_time icmp_tim_i tcp_daytime tcp_http tcp_name tcp_private tcp_uucp icmp_urh_i tcp_discard tcp_http_2784 tcp_netbios_dgmtcp_remote_jobtcp_uucp_path icmp_urp_i tcp_domain tcp_http_443 tcp_netbios_ns tcp_rje tcp_vmnet tcp_irc tcp_echo tcp_http_8001 tcp_netbios_ssntcp_shell tcp_whois tcp_x11 tcp_efs tcp_imap4 tcp_netstat tcp_smtp udp_domain_u tcp_z39_50 tcp_exec tcp_iso_tsap tcp_nnsp tcp_sql_net udp_ntp_u tcp_aol tcp_finger tcp_klogin tcp_nntp tcp_ssh udp_other tcp_auth tcp_ftp tcp_kshell tcp_other tcp_sunrpc udp_private tcp_bgp tcp_ftp_data tcp_ldap tcp_pm_dump tcp_supdup udp_tftp_u in these equations xmax is the maximum and xmin is the minimum value extracted in step three for a particular protocol_service profile. kcluster is the number of clusters required and number_of_instances_protocol_service is the number of instances required for particular protocol_service profile which are taken from the user. cluster_gap = xmax − xmin kcluster (1) number_of_instances_per_cluster = round{ number_of_instances_protocol_service kcluster } (2) table 3 shows various parameters like kcluster and number_of_instances_protocol_service used to generate synthetic dataset. in table 3, cluster_n (kcluster) and insta_n (number_of_instances_protocol_service) corresponds to the number of clusters and instances to be created for normal dataset respectively. cluster_a and insta_a is the number of clusters and instances to be created for anomalous dataset respectively. 396 r. singh, h. kumar, r.k. singla table 3: synthetic dataset generation parameters protocol service profiles cluster _n insta _n cluster _a insta _a protocol service profiles cluster _n insta _n cluster _a insta _a icmp_eco_i 3 1215 6 4200 tcp_link 1 5 4 1536 icmp_ecr_i 2 456 4 3120 tcp_login 0 0 3 855 icmp_red_i 1 125 0 0 tcp_mtp 0 0 3 858 icmp_tim_i 1 220 1 42 tcp_name 0 0 4 780 icmp_urh_i 1 150 0 0 tcp_netbios_dgm0 0 4 2920 icmp_urp_i 4 1200 1 45 tcp_netbios_ns 0 0 3 960 tcp_irc 3 900 1 50 tcp_netbios_ssn 0 0 2 1048 tcp_x11 1 175 1 124 tcp_netstat 0 0 3 1305 tcp_z39_50 0 0 5 2800 tcp_nnsp 0 0 4 2600 tcp_aol 0 0 1 50 tcp_nntp 0 0 2 708 tcp_auth 2 450 4 952 tcp_other 2 460 5 1850 tcp_bgp 0 0 6 2100 tcp_pm_dump 0 0 1 135 tcp_courier 0 0 7 2100 tcp_pop_2 0 0 1 235 tcp_csnet_ns 0 0 5 3270 tcp_pop_3 1 350 1 115 tcp_ctf 0 0 4 800 tcp_printer 0 0 1 354 tcp_daytime 0 0 6 1800 tcp_private 1 22 6 28000 tcp_discard 0 0 4 1680 tcp_remote_job 1 5 1 154 tcp_domain 1 255 3 405 tcp_rje 0 0 1 204 tcp_echo 0 0 4 1040 tcp_shell 1 16 1 165 tcp_efs 0 0 5 1850 tcp_smtp 5 9600 2 386 tcp_exec 0 0 3 1395 tcp_sql_net 0 0 2 840 tcp_finger 3 375 5 1750 tcp_ssh 1 20 2 400 tcp_ftp 4 1616 4 1800 tcp_sunrpc 0 0 2 856 tcp_ftp_data 6 3006 5 2000 tcp_supdup 0 0 3 1404 tcp_gopher 0 0 3 570 tcp_systat 0 0 2 1206 tcp_harvest 0 0 1 26 tcp_telnet 5 5750 3 1836 tcp_hostnames 0 0 2 500 tcp_time 1 170 2 680 tcp_http 6 50010 4 2800 tcp_uucp 0 0 3 2085 tcp_http_2784 0 0 1 12 tcp_uucp_path 0 0 3 2346 tcp_http_443 0 0 3 999 tcp_vmnet 0 0 3 1875 tcp_http_8001 0 0 1 14 tcp_whois 0 0 3 2256 tcp_imap4 1 160 3 888 udp_domain_u 5 11825 1 14 tcp_iso_tsap 0 0 4 820 udp_ntp_u 1 320 0 0 tcp_klogin 0 0 3 924 udp_other 4 2624 2 304 tcp_kshell 0 0 2 650 udp_private 3 1212 4 2600 tcp_ldap 0 0 2 360 udp_tftp_u 1 35 0 0 synthetic instances generation in the fifth step, for each continuous attributes of various protocol_service profiles, kcluster numbers of traffic activity/ behavior profiles are created by using xmin, xmax and kcluster. for categorical attributes of various protocol_service profiles kcluster number of traffic activity/ behaviors are created by considering cate_unique_values. for each traffic activity profile, number_of_instances_per_cluster numbers of instances are a reference dataset for network traffic activity based intrusion detection system 397 generated by using same characteristics used for generation of various traffic activity/ behavior profiles. values of continuous attributes for each cluster is calculated as given in equation (3) value_conti_attribute = max_value_previous_cluster + cluster_gap (3) subject to: max_value_previous_cluster = xmin, for first cluster and xmin ≥ value_conti_attribute ≤ xmax for other clusters where value_conti_attribute represents value of a particular continuous attribute and max_value_previous_cluster represents the maximum value of previous cluster of that particular attribute. for categorical attributes cate_unique_values are extracted by taking each unique value for each categorical attribute separately as shown in equation (4). value_cate_attribute = div{cate_unique_values(kcluster)} (4) function div means values of particular categorical attributes are obtained by dividing cate_unique_value into cluster groups and then for each group/ cluster number_of_instances_per_cluster instances are created. 3.3 illustrative example let?s say for tcp_http protocol_service profile, 3 clusters (kcluster) and 1000 instances (number_of_instances_protocol_service) are required to be generated. so the values of xmin and xmax for all continuous attributes are extracted. also assume the values of xmin and xmax for attribute source_bytes are 30 and 255. so cluster_gap and number_of_instances_per_cluster are calculated as per equation (1) and (2) respectively as shown in example 1. example 1. cluster_gap = (255-30) /3 = 75 number_of_instances_per_cluster = round (1000/3) = 333 range of first cluster value_conti_attribute1 = (from 30 up to 30+75=105) range of second cluster value_conti_attribute2 = (from 105 up to 105+75=180) range of third cluster value_conti_attribute3 = (from 180 up to xmax = 255) for each continuous attribute , 333 instances per cluster are created and the same procedure is followed for all continuous attributes (last cluster may have one additional instance to sum up total instances to 1000). example 2 explain the assignment of values of different clusters for categorical attributes (like flag). example 2. cate_unique_values = s0, s1 ,s2, s3, s4, s5, s6, s7, s8 ( assumed value) for first cluster, value_cate_attribute1 = s0, s1, s2 for second cluster, value_cate_attribute2= s3, s4, s5 for third cluster, value_cate_attribute3= s6, s7, s8 now for each categorical attribute, 333 instances per cluster are created by taking value from value_cate_attribute for all categorical attributes (last cluster may have one additional instance to sum up total instances to 1000). these procedures are followed for each continuous 398 r. singh, h. kumar, r.k. singla and categorical attributes of all protocol_service profiles and for both categories of sub-datasets. both categories of normal and anomalous synthetic sub-datasets are integrated to obtain one synthetically generated dataset. this dataset has two levels of profile labeling. one level is protocol_service while other is traffic activity/ behavior level. 4 results and analysis the nsl kdd (training + testing) dataset taken for experimentation has 148517 instances and 72 protocol_service profiles. the same numbers of protocol protocol_service are created in synthetic dataset. table 4 shows the comparative statistical analysis of nsl kdd and pu-ids dataset. table 4: basic statistics of nsl kdd and pu-ids dataset statistics nsl kdd dataset pu-ids normal anomaly normal anomaly numbers of instances 77054 71463 92727 105806 numbers of protocol_service profiles 30 68 30 68 numbers of network traffic activity profiles not present not present 72 201 in nsl kdd dataset, 30 normal and 68 anomalous protocol_service profiles are present. some of these profiles are common in both normal and anomalous sub-datasets. total 198533 instances are generated synthetically out of which 92727 are normal while other are anomalous. in pu-ids, the generated instances are more than the base dataset. these excess instances provide an opportunity of better and effective training of ids as dimensions of normal and anomalous behavior is increased. these instances also helps in performance testing of ids in broad spectrum. equal numbers of protocol_service profiles as of base dataset are created in pu-ids dataset. generated dataset has equal first level of profiles. in nsl kdd dataset, traffic activity labeling are not present and hence this second level of labeling is synthetically generated. different 273 traffic activities in datasets are synthetically generated as shown in table 4. 72 normal and 201 anomalous behavior profiles are generated. this second level of labeling will help the researcher to train and test anomaly detection techniques, where nsl kdd provides limited opportunity. figure 2 shows the number of normal instances generated for each protocol_service profile. 53.93% of the generated normal instances are of tcp_http protocol_service. this profile is the biggest contributor in normal instances. the different significant normal protocol_service profiles are udp_domain_u (12.75%), tcp_smtp (10.35%), tcp_telnet (6.2%), tcp_ftp_data (3.24%), udp_other (2.83%), tcp_ftp (1.74%), icmp_eco_i (1.31%), udp_private (1.31%) and icmp_urp_i (1.29%). other protocol_service (tcp_irc, tcp_other, icmp_ecr_i, tcp_auth, tcp_finger, tcp_pop_3, udp_ntp_u, tcp_domain, icmp_tim_i, tcp_x11, tcp_time, tcp_imap4, icmp_urh_i, icmp_red_i, udp_tftp_u, tcp_private, tcp_ssh, tcp_shell, tcp_link, and tcp_remote_job) collectively contribute 5.04% of total normal instances. only 0.0053% of tcp_remote_job instances are generated which is lowest in normal instances. a reference dataset for network traffic activity based intrusion detection system 399 figure 2: distribution of normal instances as per protocol_service profiles figure 3: distribution of anomalous instances as per protocol_service profiles 400 r. singh, h. kumar, r.k. singla figure 3 shows the number of anomalous instances generated for each protocol_service profile. in the anomalous instances the tcp_private profile has largest share, which is 26.46%. the various significant anomalous protocol_service profiles are icmp_eco_i (3.97%), tcp_csnet_ns (3.09%), icmp_ecr_i (2.95%) and tcp_netbios_dgm (2.76%). other anomalous protocol_service profiles (like tcp_http_443, tcp_netbios_ns, tcp_auth, tcp_klogin, tcp_imap4, tcp_mtp, tcp_sunrpc, tcp_login, tcp_sql_net, tcp_iso_tsap, tcp_ctf, tcp_name, tcp_nntp, tcp_time, tcp_kshell, tcp_gopher, tcp_hostnames, tcp_domain, tcp_ssh, tcp_smtp, tcp_ldap, tcp_printer, udp_other, tcp_pop_2, tcp_rje, tcp_shell, tcp_remote_job, tcp_pm_dump, tcp_x11, tcp_pop_3, tcp_irc, tcp_aol, icmp_urp_i, icmp_tim_i, tcp_harvest, tcp_http_8001, udp_domain_u, and tcp_http_2784 ) contribute 16.29% of total anomalous instances. least number of anomalous instances of tcp_http_2784 protocol_service (only 0.011%) are generated. figure 4 shows the comparative study of various protocols like tcp, udp and icmp, present in nsl-kdd and synthetically generated pu-ids dataset. in the base dataset 121569 of tcp, 17614 figure 4: numbers of instances of tcp, udp and icmp protocols of udp and 9334 of icmp instances are present. in pu-ids, the numbers of synthetically generated instances of tcp, udp and icmp are 168826, 18934 and 10773 respectively. 5 conclusions and future works the network traffic dataset is an important part of intrusion detection system as it learns normal and anomalous behavior of computer networks. perfect training of idss depends on the proper labeled dataset. the well-known benchmark nsl kdd dataset is widely used in ids research, but this dataset is very old and only has one level of labeling. the other available unlabeled datasets has little importance in performance evaluation of malware detection techniques. traffic activity labeled datasets are not present for research purpose. activity/ behavioral based network traffic datasets are needed for state of the art idss. a new synthetic network traffic dataset (pu-ids) with two levels of labeling is generated by taking basic characteristics of the nsl kdd dataset. ”protocol service” level and ”traffic activity/ behavior” level labeling is created so as this dataset can be used for performance evaluation. this will overcome limited utility of nsl kdd network traffic dataset. pu-ids consists of total numbers of 198533 instances along a reference dataset for network traffic activity based intrusion detection system 401 with 72 protocol profiles. it also consists 273 synthetically created traffic activity/ behavioral profiles which is the novelty of this dataset. in the future, an organizational network with different departments and various user groups should be set up to create the simulated dataset. users within one organization should exhibit similar network activities/ behaviors as they are working on the same set of software. behavior based idss should identify these similar network activities and create normal or anomalous model. acknowledgment this research work is supported by world bank funded technical education quality improvement program ? phase ii (teqip ii) in the form of research assistantship. this research is revised and extended version of paper presented at international conference on computers, communications and control (icccc 2014) held at oradea, romania on may 7-9, 2014. bibliography [1] http://kdd.ics.uci.edu/databases/kddcup99/kddcup99.html [2] http://nsl.cs.unb.ca/nsl-kdd [3] http://snap.stanford.edu/data [4] http://www.caida.org/data/overview [5] http://www.cs.unm.edu/ immsec/data [6] http://www.iscx.ca/datasets [7] singh, r., kumar h., singla r.k (2012); traffic analysis of campus network for classification of broadcast data. 47th annual national convention of computer society of india. int. conf. on intelligent infrastructure, macgraw hill professional: 163-166. [8] http://ita.ee.lbl.gov/html/traces.html [9] marchette, d. (1999); a statistical method for profiling network traffic, workshop on intrusion detection and network monitoring : 119-128. [10] sengar, h.; wang, x.; wang, h.; wijesekera, d.; jajodia, s. (2009); online detection of network traffic anomalies using behavioral distance, 17th int. workshop on quality of service : 1-9. [11] jiang, x.; zhu x. veye (2009); behavioral footprinting for self-propagating worm detection and profiling, knowledge and information systems ; 18(2): 231-262 [12] oh, h.s.; lee, w.s. (2003); an anomaly intrusion detection method by clustering normal user behavior, computers & security, 22(7): 596-612. [13] asai, h.; fukuda, k. ; esaki, h. (2011); traffic causality graphs: profiling network applications through temporal and spatial causality of flows, proc. of the 23rd int. teletraffic congress : 95-102. [14] zoquete, a.; correia, p.; shamalizadeh, h. (2011); packet tagging system for enhanced traffic profiling. ieee 5th int. conf. on internet multimedia systems architecture and application (imsaa) : 1-6. 402 r. singh, h. kumar, r.k. singla [15] iliofotou, m.; gallagher, b.; eliassi-rad, t.; xie, g.; faloutsos, m.(2010); profiling-byassociation: a resilient traffic profiling solution for the internet backbone. proc. of the 6th int. conference co-next’10 : doi: 10.1145/1921168.1921171. [16] xu, k.; zhang, z.l.; bhattacharyya s.(2008); internet traffic behavior profiling for network security monitoring. ieee/acm trans. on networking, 16(6): 1241-1252. [17] herrmann, d.; banse, c.; federrath, h.(2013); behavior-based tracking: exploiting characteristic patterns in dns traffic. computers & security, 39 (part a): 17-33. [18] http://www.mathworks.in/products/matlab int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vii (2012), no. 1 (march), pp. 115-122 switched nonuniform and piecewise uniform scalar quantization of laplacian source a.v. mosic, z.h. peric, s.r. panic aleksandar v. mosić faculty of electronic engineering, university of niš aleksandra medvedeva 14, 18000 niš, serbia e-mail: mosicaca@yahoo.com zoran h. perić faculty of electronic engineering, university of niš aleksandra medvedeva 14, 18000 niš, serbia e-mail: zoran.peric@elfak.ni.ac.rs stefan r. panić faculty of natural science and mathematics, university of priština lole ribara 29, 38200 kosovska mitrovica, serbia e-mail: stefanpnc@yahoo.com abstract: in this paper switched nonuniform and piecewise uniform scalar quantization of laplacian source are analyzed. this scalar quantization techniques are used in order to obtain higher signal quality by increasing signal-to-quantization noise ratio (snrq) with respect to it’s necessary robustness over a broad range of input variances in a wide range of signal volumes. we observe µ-law compandor implementation to achieve compromise between high-rate digitalization and variance adaptation. the main contribution of this model is kipping almost the same quality as the nonuniform compandor model, with simpler realization structure and the possibility of it’s applying for digitalization of wide range continuous signals. keywords: switching quantization, µ-law companding, variance adaptation, laplacian source. 1 introduction quantization denotes the heart of analog to digital (a/d) conversion and efficient technique of data compression. quantizers play an important role in the theory and practice of modern day signal processing. they are applied for the purpose of storage and transmission of continual signals. in a number of papers like this one the quantization of laplacian source was analyzed since the probability density function (pdf) of the instantaneous speech signal values for higher number of digitalization samples is better represented by laplacian then the gaussian function [1], [2], [4], [7], [10]. analysis of nonlinear quantization optimization in wide volume range [5], [6], for gaussian source is given in [8], [9]. in this paper we analyze robust and switched nonuniform and piecewise uniform scalar quantization of laplacian source to achieve compromise between high-rate digitalization and variance adaptation. the aim of our research is to find a simple way to realize a quantizer system having high quality performance but maintaining robustness in wide range of input signal. the goal of the paper is designing piecewise uniform scalar quantizer based on the µ-law compandor which satisfy g.712 standard. in this purpose, we must to find optimal values for parameter µ and number of quantizers. proposed solution copyright c⃝ 2006-2012 by ccc publications 116 a.v. mosic, z.h. peric, s.r. panic has the same complexity as the g.711 standard, because we use 8 bits, but it satisfy g.712 standard with 7 bits/sample. important issues from the engineer’s point of view are the design and implementation of quantizers to meet performance objectives. when the required hardware solution it is better to use piecewise uniform scalar quantizers, because they can be realized using linear electronic circuits. but if the required software solution it is better to use nonuniform scalar quantizers. this paper is organized as follows. in chapter 2. we have developed expressions for granular and overload distortion of nonuniform scalar quantization, using bennett’s integral on laplacian distribution. then in chapter 3. a switching nonuniform model and numerical results for µlogarithmic compandor are presented. in chapter 4. our switching piecewise uniform model and numerical results for µ-logarithmic compandor are presented. last two chapters show how the increase of number of quantizers, in switching scheme, affects the sqnr dependence of input power. also we have discussed how constant µ should be chosen in order for total distortion to be as minimum as possible in the wide volume range of input signal. finally, in conclusion we have discussed obtained results, and on these bases, we derived conclusions about the possibilities of this switched quantization application in speech processing. 2 distortion for nonuniform µ-logarithmic compandor scalar quantizers are the only types of quantizers considered in this paper, so we just briefly recall their properties. an n-point fixed rate scalar quantizer is characterized by the set of real numbers t1, t2, . . . , tn , called decision thresholds, which satisfy −∞ = t0 < t1 < · · · < tn−1 < tn = +∞, and set y1,y2, . . . ,yn , called representation levels, which satisfy yj ∈ αj = (tj−1, tj], for j = 1, . . . ,n. sets α1,α2, . . . ,αn form the partition of the set of real numbers r and are called quantization cells. the quantizer is defined as many-to-one mapping q : r → r, q(x) = yj where x ∈ αj. in practice, input signal value x is quantized to the value yj. cells α2,α3, . . . ,αn−1 are inner cells (or granular cells) while α1 and αn are outer cells (or overload cells). in such way, cells α2,α3, . . . ,αn−1 form granular while cells α1 and αn form an overload region. let input signal is characterized by continuous random variable x with pdf p(x). the first approximation to the long-time-averaged pdf of amplitudes is provided by a two-sided exponential or laplacian model. waveforms are sometimes represented in terms of adjacentsample differences. the pdf of the difference signal for an image waveform follows the laplacian function [4]. laplacian source can be also used for modelling of the speech signal. in the rest of the paper we assume that information source is laplacian source with memoryless property and zero mean value. the pdf of that source is given by: p(x) = 1 √ 2σ2 e− |x| √ 2 σ , (1) where x is zero-mean statistically independent laplacian random variable of variance σ2. the quality of the quantizer is measured by distortion of resulting reproduction in comparison to the original one. mostly used measure of distortion is mean-squared error. it is defined by: d(q) = e(x − q(x))2 = n∑ i=1 ∫ ti ti−1 (x − yi)2p(x)dx. (2) the n-point quantizer q is optimal for the source x if there is no other n-point quantizer q1 such that d(q1) < d(q). we also define granular distortion dg(q) and overload dol(q) switched nonuniform and piecewise uniform scalar quantization of laplacian source 117 distortion by: dg(q) = n−1∑ j=2 ∫ tj tj−1 (x − yj)2p(x)dx, (3) dol(q) = ∫ t1 −∞ (x − y1)2p(x)dx + ∫ +∞ tn−1 (x − yn)2p(x)dx. (4) obviously follows that d(q) = dg(q) + dol(q). the companding technique is one of the commonly used techniques for the construction of nearly optimal quantizers for large number of quantization levels. it forms the core of the itut g.711 standard [3]. it is a nonuniform pcm standard recommended for encoding speech signals. the recommendation is based on digitally linearizable companding, which permits a precise control of quantization characteristics. the compression and expansion characteristics are piecewise linear approximations to µ-law , where µ = 255, with 8 bits/sample are adopted, leading to a bit-rate of 64 kbps at 8 khz of sampling frequency. companding procedure consists of following steps: i) compressing the input signal x by applying the compressor function c(x). ii) applying the uniform quantizer qu on the compressed signal. iii) expanding the quantized version of the compressed signal using an inverse compressor function c−1(x). as explained the corresponding non-uniform quantizer consisting of a compressor, a uniform quantizer, and an expander in cascade is called companding quantizer (compandor). hence, the companding quantizer can be represented as q(x) = c−1(qu(c(x))), where qu(x) is uniform quantizer in the interval [−1,1]. let us denote by tu,i and yu,i decision thresholds and representation levels of the uniform quantizer qu(x). corresponding values ti and yi of the companding quantizer q(x) could be determined as the solutions of the following equations: c(ti) = tu,i = −1 + 2i n , c(yi) = yu,i = −1 + 2i − 1 n . (5) there are several ways how to choose the compressor function c(x) for compression law. in practice, a piecewise linear approximation of the logarithmic compression characteristic is used. there are two different ways. in north america, a µ-law compression characteristic is used, which is defined as follows: c(x) =   xmax ln(1+µ) ln ( 1 + µ x xmax ) , 0 ≤ x ≤ xmax − xmax ln(1+µ) ln ( 1 − µ x xmax ) , −xmax ≤ x ≤ 0 . (6) substituting (1), (5) and (6) in (3) and (4), and considering that yn can be approximated with xmax, granular and overload distortions are defined as: dg(q) = σ2 ln2(1 + µ) 3n2 [ 1 µ2 x2max σ2 + xmax σ √ 2 µ + 1 ] , (7) dol(q) = σ 2e− √ 2xmax σ . (8) since we now know how to calculate total distortion for quantization of a laplacian source that has variable average power in a wide range: d(q) = σ2 ln2(1 + µ) 3n2 [ 1 µ2 x2max σ2 + xmax σ √ 2 µ + 1 ] + σ2e− √ 2xmax σ , (9) 118 a.v. mosic, z.h. peric, s.r. panic we can find the signal power-to-total-distortion ratio (db), which is denoted as sqnr: sqnr = 10 lg σ2 d(q) = 10 lg 1 ln2(1+µ) 3n2 [ 1 µ2 x2max σ2 + xmax σ √ 2 µ + 1 ] + e− √ 2xmax σ . (10) on the basis of this expression we will design the desired model below. 3 switching nonuniform scalar quantization and numerical results for µ-logarithmic compandor classical quantizer with µ-law compression characteristic (see fig. 1, µ = 255) has limited range of input variances. we will solve that problem with switching quantization application. one simple technique is switched codebook adaptive scalar quantization. the basic scheme of robust and switched codebook adaptation is shown in [8]. this technique uses a classifier that looks at the contents of the input frame buffer and decides that the next block of samples belongs to a particular statistical class of samples from a finite set of k possible classes. namely, the index specifying the class is used to select a particular codebook from a redesigned set of k codebooks. in addition, this index is transmitted as side information to the receiver. then, each sample in the block is encoded by the scalar quantizer, which performs a search through the selected codebook. one frame has length of m. the index to identify the class is sent on the end of block. if each of the k codebooks has size n, the bit rate per sample is: r = log2 n + log2 k m . (11) codebook size n depends on number of bits that are used for the encoding n. the relation between n and n is n = 2n, where n is the number of bits per sample. we will use this switching technique for our problem solving. we have k codebooks, i.e., k nonuniform scalar quantizers designed for particular values σ0j to cover input power range σ20j/σ 2 0 ∈ [σ 2 1j/σ 2 0,σ 2 2j/σ 2 0),where σ0 denotes referent value of input power and ∪ k j=1[σ 2 1j/σ 2 0[db], σ22j/σ 2 0j[db]) = [−20,20). maximal amplitude for each quantizer xmaxj (each codebook j) is chosen in a way, that for each input power range σ20j/σ 2 0 ∈ [σ 2 1j/σ 2 0,σ 2 2j/σ 2 0) the total distortion has a minimum. the procedure is as follows: we optimize total distortion (9) to have a minimum. the optimization is going over parameter c, witch denotes ratio xmax/σ. after finding copt, for corresponding µ, which satisfies the following term: ∂d(q) ∂c = 0 ⇒ c = copt. (12) we can easily evaluate xmaxj for each input power range σ20j/σ 2 0 ∈ [σ 2 1j/σ 2 0σ 2 2j/σ 2 0), from the expression xmaxj = coptσ0j. each particular value σ0j can be represented as σ0j = kjσ0, where kj is called adaptation factor. during the switched quantizer design, a particular type of memory is needed. each input class j = 1,2, . . . ,k requires one quantizer, for which we know adaptation factor kj and input power range [σ21j/σ 2 0,σ 2 2j/σ 2 0) for which the quantizer is designed. also we have to store in memory the corresponding µ and copt. first, let us examine switched codebook adaptive scalar quantization model with only one quantizer present. here, only parameter that can be optimized, for achieving high quality of transmission by increasing snrq, in a wide range of signal volumes (variances) with respect to switched nonuniform and piecewise uniform scalar quantization of laplacian source 119 it’s necessary robustness over a broad range of input variances is the ľ parameter in expression for snrq. parameter µ can be optimized, for the case when expression for snrq has his maximum, which means that expression (9) for total distortion should have his minimum. optimization of total distortion is derived in two steps. first, we accomplish adaptation on maximal amplitude of input signal, or the optimization for parameter c in corespondents to µ, which is described as: ∂d(q) ∂c = 0 ⇒ c = copt(µ). (13) then in the second step, we find required µopt, for witch total distortion should have his minimum, which is described as ∂d(q) ∂µ ∣∣∣∣ c=copt(µ) = 0 ⇒ µ = µopt, d(q)(µopt) = dmin(q). (14) these two steps can be represented as the following equation system: ∂d(q) ∂c = σ2  −√2e−√2c + ( 2c µ2 + √ 2 µ ) ln2(1 + µ) 3n2   = 0 (15) ∂d(q) ∂µ ∣∣∣∣ c=copt(µ) = σ2  2 ( 1 + c2opt(µ) µ2 + √ 2copt(µ) µ ) ln(1 + µ) 3n2(1 + µ) − ( 2c2opt(µ) µ3 + √ 2copt(µ) µ2 ) ln2(1 + µ) 3n2   = 0. (16) for n1 = 128 and n2 = 256, numerical solutions for presented systems are copt1 = 8.8, µopt1 = 15 and copt2 = 9.9 and µopt2 = 17, respectively. if there are not restrictive limitations about memory size and sample bit rate for the transmission system, then there is a possibility to choose optimal number of quantizers in our model, for which we can achieve high quality measured by sqnr, in a wide range of signal volumes (variances) with respect to it’s necessary robustness over a broad range of input. if we increase number of quantizers k, there is a way to flatten the sqnr dependence of input power in such a way that, if the memory size isn’t the limiting factor, with data compression being disregarded, we will achieve a signal-to-noise ratio that does not have a large variation during input power changes which is shown in figs 1 and 2. in fig 2, we can see that sqnr varies from it’s peak value, for maximum 0.282 db and 0.313 db for each input power range [σ21j/σ 2 0,σ 2 2j/σ 2 0), ∪ k j=1[σ 2 1j/σ 2 0[db],σ 2 2j/σ 2 0j[db]) = [−20,20), for which the quantizer is designed, in case of codebook size of 128 and 256, with 16 codebooks. there is a conclusion, that if we want to satisfy the same standard for varying of sqnr in twice larger input power range of [-40db,40db], we will have to use same codebook size for 32 codebooks. if we want to satisfy less restrictive standards of sqnr variance for each input power range [σ21j/σ 2 0,σ 2 2j/σ 2 0), we can use smaller number of codebooks, and if we want to achieve smaller peak value of sqnr, we can use smaller size of each codebook, for each input power range. if we analyze bit sample rate in function of frame length with respect to number of quantizers k, we can se that for relatively small frame length of 80 samples, bit sample rate rapidly convergates to the value of bit sample rate of transmission without side information. so we can derive conclusion that memory size is much more restrictive limitation for multi-quantizer implementation, than sample rate is. 120 a.v. mosic, z.h. peric, s.r. panic from fig. 2, we can see that presented nonuniform switched quantizer with optimized µ outperform classical µ = 255 switched quantizer characteristic for 3 db and 2.7 db in the cases of n = 128 and n = 256, respectively. 4 switched piecewise uniform scalar quantization and numerical results for µ-logarithmic compandor this paper utilizes the basic concept of the jayant quantizer model in order to provide the development of the adaptive piecewise uniform scalar quantizer [4]. particularly, the model avails the jayant manner of maximum amplitude adaptation, which means according to the one word memory. finally, since one of the main goals when designing quantizers is to provide as high as possible quality, i.e. as low as possible distortion, an analysis will be conducted in order to provide optimal values of µ. in this section, it is showed that segmental µ-law better than method presented in section 3, when the required hardware solution because they can be realized using linear electronic circuits, and provide almost the same results for quality. switched piecewise uniform scalar quantization of memoryless laplacian source is asymptotically analyzed for the case where the power of input signal varies in a wide range. one possible solution for encoder design is given for the same quantizer. switched quantization is used in order to give a higher quality in a wide range of signal volumes (variances). these systems, although not optimal, may have asymptotic performance arbitrarily close to the optimum. furthermore, their analysis and implementation can be simpler than those of optimal systems. we also define granular distortion dg(q) and overload dol(q) distortion by: dg(q) = 2 l∑ i=1 ∆2i 12 pi(σ) (17) dol(q) = 2 ∫ +∞ xmax (x − yl,nunif ) 2p(x)dx. (18) where are amplitudes quant in the i-th segment defines es: ∆i = xi − xi−1 n 2l (19) and yl,nunif is representation layer in the last cell of l-th segment: yl,nunif = xmax − ∆l 2 ≈ xmax (20) finally p(x) is the pdf of inner signal on quantizer, which define probability pi(σ) that quantum of signal belong the i-th segment: pi(σ) = ∫ xi xi−1 p(x,σ)dx. (21) as we know d(q) = dg(q) + dol(q). after sam basic mathematical operations we can get that: d(q) = x2max 12(n/2l)2µ2 l∑ i=1 (1 + µ) 2(i−1) l ((1 + µ) 1 l − 1)2e− √ 2xmax µσ ((1+µ) i−1 l −1) × ( 1 − e− √ 2xmax µσ ((1+µ) i l −(1+µ) i−1 l ) ) + σ2e− √ 2xmax σ . (22) switched nonuniform and piecewise uniform scalar quantization of laplacian source 121 optimizing the previous expression in the same manner as in section 3, we get the results given in the table 1 and figure 2. in figure 2 we can observe that our l = 4 switching piecewise uniform method with n1 = 128 and n2 = 256 outperforms the classical µ = 255 law method for 2.5 db and 2.2db, respectively. comparatione is also made with g.711 standard, and it is also shown that our model satisfy the g.712 standard with 1 bit compression obtained. our switching piecewise uniform model, with l = 8 segments and l = 16 segments (see table 1), gives almost same results as switching nonuniform method, with much more simpler realization structure of hardware. figure 1: improvement of quality of transmission (snrq), for model implementations with two quantizers over robust quantization. figure 2: comparatione of quality of transmission (sqnr), for model implementations with sixteen quantizers for standard and optimized value of parameter µ. 5 conclusions and future works we have suggested switched nonuniform and piecewise uniform scalar quantzation of laplacian source that solve the problem of variable input power in a wide range. we also have presented 122 a.v. mosic, z.h. peric, s.r. panic l sqnrn=128 sqnrn=256 2 32.76 38.21 4 34.35 39.96 8 34.73 40.37 16 34.82 40.48 64 34.85 40.51 128 34.85 40.52 table 1: comparison between values of sqnr of piecewise uniform scalar quantizers for different values of bitrate r, i.e. n1 = 128, µ1 = 15 and n2 = 256, µ2 = 17. how they can accomplish high quality of snrq by optimization on maximal amplitude of input signal, by optimization on parameter µ, and by adaptation on input variance range length. they can be applied for coding of speech signals and other continuous signals. the dependence of quality and robustness of quantized signals is analyzed over the broad range of input variances and corresponding number of codebooks with respect to system memory and sample bit rate. presented switching piecewise uniform model gives almost same results as switching nonuniform method, with much more simpler realization structure of hardware. bibliography [1] r. gray, quantization and data compression, lecture notes, stanford university, 2004. [2] d. hankerson, g. a. harris, p. d. jr. johnson, introduction information theory and data compression, 2nd ed., chapman & hall/crc, 2004. [3] itu-t, recommendation g.711, pulse code modulation (pcm) of voice frequencies, international telecommunication union, 1972. [4] n. s.jayant, p. noll, digital coding of waveforms, prentice-hall, new jersey, 1984. [5] z. h. perić, i. djordjević , m. stefanović, s. bogosavljević, combined source and channel coding of speech signal in wide volume range, proceedings of the iasted, las vegas, nevada, usa, 1998. [6] z. h. perić, quantization optimizations of speech signal in wide volume range, electroniccs and electrical engineering, vol.45, pp.41-48, 2003. [7] z. h. perić, j. nikolić, an effective method for initialization of lloyd-max’s algorithm of optimal scalar quantization for laplacian source, informatica, vol.18, pp.1-10, 2007. [8] z. h. perić, a. v. mosić , s. r. panić, robust and switched nonuniform scalar quantization of gaussian source in a wide dynamic range of power, journal of communications technology and electronics, avt journal, vol.6, pp.74-84, 2008. [9] z. h. perić, a. v. mosić , s. r. panić, coding algorithm based on loss compression using scalar quantization switching technique and logarithmic companding, journal of information science and engineering, vol.26, pp.967-976, 2010. [10] k. sayood, introduction to data compression, 3rd ed., elsevier inc, 2006. international journal of computers communications & control issn 1841-9836, 9(5):570-583, october, 2014. trust model in cloud computing environment based on fuzzy theory l. gu, j. zhong, c. wang, z. ni, y. zhang lichuan gu, chengji wang, youhua zhang school of computer and information, anhui agricultural university no.130 changjiang road, hefei, anhui, 230036 china glc@ahau.edu.cn, wang_chengji@qq.com, yhzhang@ahau.edu.cn jinqin zhong university school of international business no.420 linquan road, hefei, anhui, 230031 china. jinqinzhong@163.com zhiwei ni, lichuan gu school of management, hefei university of technology no.9 tunxi road, hefei, anhui, 230009 china gdnzw@hfut.edu.cn abstract: recent years have witnessed the development of cloud computing. however, there also come some security concerns in cloud computing environment, such as emerging network attacks and intrusions, and instable cloud service provision due to flexible cloud infrastructure and resources. to this end, we research on the trusted computing in cloud computing environment. specifically, in this paper, we propose a trust model based on virtual machines, with two considerations. first, we introduce timeliness strategy to ensure the response time and also minimize the idle time of servers. second, we extend the linear trust chain by differentiating the trust of the platform domain and user domain. besides, we develop a fuzzy theory based method to calculate the trust value of cloud service providers. we also conduct some experiments to evaluate our method. keywords: trust model, fuzzy theory, cloud computing. 1 introduction recent years have witnessed the development of cloud computing, which is an integration of parallel computing, grid computing and distributed computing [1, 2]. with massive computing and storage capability, cloud computing provides various resources to end users through trusted and reliable services. in this way, users can be relieved from trivial management routines and stay focused on the interesting business only. for example, cloud services can help to reduce the complexity of enterprise informatization process, improve the efficiency of companies’ operation, and facilitate the utilization of computer resources [3]. however, although cloud computing brings us extremely convenience, there also comes some security concerns [4, 5]. the security flaws are growing ever since the complexity of system softwares increases. also, the increasingly development of internet, as well as emerging network attacks and intrusions [6], leads to more security events. besides, the flexibility of cloud infrastructure and resources increases the difficulty of management and brings instability. moreover,there might be single point failure in demanding for high quality trusted service, and thus the cloud service delivery could be delayed or failed. for example, suitable authentication is required for access to bank accounts, healthrecords, intellectual property and business or politically sensitive information to reduce the security risks of cloud computing infrastructure [7]. copyright © 2006-2014 by ccc publications trust model in cloud computing environment based on fuzzy theory 571 to this end, trusted computing [8,9] is proposed by the trusted computing group (tcg) [10]. with trusted computing technologies, computers can be safer and less prone to viruses and mal ware, and therefore hardware and software can consistently behave in expected manner. one way to ensure the functionality of cloud infrastructure through trusted computing is to leverage the idea of chains of trust. in this paper, we propose a trust model and its evaluation method in cloud computing environment. specifically, (1) the trust model in this study is built in cloud computing environment instead of traditional scenarios; (2) since time factor is significant for both qos in requesting cloud services and also maximizing the utilization of cloud resources, we consider timeliness strategy in choosing trusted cloud services; (3) we extend the traditional linear trust chain as a tree-like structure to differentiate the trust of the platform domain and user domain; and (4) the evaluation of trusted computing is based on the fuzzy theory. the remains of this paper are organized as follows. in section 2 we provide some related work. section 3 presents our proposed trust model and the fuzzy theory based evaluation method. then experiments are conducted in section 4. finally, the paper is concluded in section 5. 2 related work in this section, we present some related work. generally, there are three categories: theoretical research on trusted computing (tc),architecture and implementation of tc, and tc for virtual machines environment. the first category is theoretical efforts on trusted computing. blaze et al. [11] first proposed the concept of trust management in 1996. then, josang et al. [12–14]proposed a trust model based on subjective logic, and introduced evidence space and opinion space to measure the trust relationship. beth et al. [15] classified trust into direct trust and indirect trust, and proposed to measure the trust based on the degree of task completion. fault tolerance capability in trusted computing within the whole life cycle of software development was also discussed [16–18]. smith et al. [19] developed a outbound authentication model using ibm secure coprocessors. abadi et al. [20] provided a formal description of the access control process in ngscb system using secure logical language. chen et al. [21]described the process of secure bootstrap in trusted computing using predicate logic. there are also many efforts on the architecture and implementation of tc. ibm 4758 secure coprocessors [22] are one of the most earliest secure hardware. the design of ibm 4758 is to provide an isolated running environment to ensure the computing and storage capability even when something happens to the operating system or the main processors. stanford university developed an architectural support for copy and tamper resistant software, called xom [23]. suh et al. [24] developed aegis, an architecture for a single-chip aegis processor which can be used to build computing systems secure against both physical and software attacks. chen et al. [25] designed another secure processor cerium. bear [26, 27] constructs trusted computing in commercial trusted platforms in linux, and extends trust chain to the folder layer by checking the integrity of folders when they are first opened. also, ibm proposed an architecture for trusted computing called ima [28]. the last category of related work is trusted computing for virtual machines. garfinkel et al. [29] developed a virtual machine-based platform for trusted computing, called terra. it allows applications with a wide range of security requirements to run simultaneously on commodity hardware. also, ibm implemented vtpm [30] to support trusted computing for multiple virtual machines, by introducing a virtual layer and extending the trust chain for virtual machines based on typical trusted platform module (tpm). 572 l. gu, j. zhong, c. wang, z. ni, y. zhang 3 proposed trust model in typical cloud computing environment, the service model is multi-layered. that is, the cloud service provider (csp) not only provides services to end users, but also to the upper layer csps, which forms a trust chain of service providers and consumers. as shown in figure 1, the arrow denotes from providers to consumers. for example, the service of end users might be provided by a saas csp directly, or first paas csp then saas csp, or from iaas csp downstream to end users. figure 1: trust transitivity service providers and consumers there are two kinds of trust in cloud computing environment: direct and indirect trust. direct trust means the impression of consumer users on the service quality of csp, while indirect trust denotes the aggregated impression of all other previous consumers who have used the service or other csps who have connections with current csp. suppose user ua wants to use the cloud service from csp x .the objective is to calculate the trust value of x and determine if x is trusted. denote imp(a,b) as the impression of b on a. therefore, the trust value of x for ua can be calculated as: imp(x,ua) = c1 ∑ ui∈u αiimp(x,ui) + c2 ∑ y βiimp(x,y ) (1) where imp(x,ui) is the impression value of ui on csp x , imp(x,y ) is the impression of other csps on x, and c1, c2, αi, βi are coefficients. 3.1 timeliness strategy in a cloud computing environment, host data nodes provide trusted services with high credible and stable resources, where timeliness is a significant indicator. generally, the process of trusted cloud services is as follows. first, if there exist idle host data nodes, then evaluate the timeliness of the node. second, calculate the trust value of data node based on the timeliness and trust model. if the trust value is satisfactory, then the node is assigned for cloud services; otherwise, repeat the process for the next service request. we assume that the trust value is related to time, and the more recent the evaluation is, the more contribution it has to the current trust calculation. let imp(x,ui)l be the trust value at time l therefore, the trust value of x for ua can be represented by rewriting equation (1) through introducing timeliness strategy: trust(x,ua) = imp(x,ua)l −∆imp ·f(t−l) (2) where imp(x,ua)l is the current impression value at time l , and ∆imp ·f(t−l) indicates the affect of previous trust value. suppose there are k historical impression values before l, and imp(x,ua)l is defined as: imp(x,ua)l = k∑ i=1 imp(x,ua)iwi (3) trust model in cloud computing environment based on fuzzy theory 573 where k∑ i=1 wi = 1. since the history impression evaluation happens randomly, we equally split the history time period into m segments in order to differentiate the importance of each historical evaluation. historical impressions within the same time window w are assigned an identical weight ωw . therefore, inspired by [31], weight wi can be calculated as: wi = n∑ j=1 m−1 √ ωw1(m− j)ωwm(j −1) n (4) where n is the number of values within i-th historical impression. substitute equation (4) into (3), we get imp(x,ua)l = k∑ i=1 imp(x,ua)i n∑ j=1 m−1 √ ωw1(m− j)ωwm(j −1) n (5) suppose the experience distribution of ∆imp follows gumbel distribution [32]. therefore the estimate of ∆imp is: ĝ(impi) = exp { − exp { − impi − û δ̂ }} (6) where û, δ̂ is the maximum likelihood estimates for gumbel distribution. time based function f(t−l) follows exponential distribution: f(t−l) = ∫ +∞ l (t−l)f(t)dt = exp(−λl) λ (7) 3.2 tree structured trust chain in this section, we consider the trust between a specific csp and users. the objective is to measure the trust value from the csp hardware to user defined software. trust relationship in cloud environment is more complicated than the traditional scenarios. for example, the services are typically running in virtual machines with larger management and user domain, which is hardly measured by the traditional linear structure. moreover, for some business requirements, services are assembled across multiple user domains, which also increase the difficulty of measuring the trust. note that by management domain, we mean the set of objects, typically refers to the component in traditional trust chain; by user domain, we mean that different users that request for the service. typically, cloud users don’t have controls over the hardware devices, and thus the safety is only ensured by service-level agreement (sla). however, users wish to somehow control the virtual computing resources, i.e., virtual machines (vm), so that user-defined security strategy can be passed over vm and therefore ensure the safety of cloud resources. one popular solution is to use trusted platform module (tpm), which is a specialized chip that can securely store information, such as passwords and encryption keys, with independent execution cpu unit. the typical architecture of tpm is shown in figure 2. tpm provides trusted computing by ensuring security to operating systems and tcg software stack (tss). tss is a support software of tpm, and its architecture is shown in figure 3. in cloud computing environment, trusted computing is typically implemented by virtualizing the trusted platform module (vtpm). however, vtpm is only a virtual instance of physical tpm in user domain, which calls the physical tpm resource to provide tpm service, as shown 574 l. gu, j. zhong, c. wang, z. ni, y. zhang figure 2: tpm component architecture figure 3: architecture of tss trust model in cloud computing environment based on fuzzy theory 575 in figure 4. there are some limitations of this simple structure. first, the whole structure is dependent on limited physical tpm, which is not scalable for large scale virtual machines in cloud platform. second, evaluation of each individual vtpm is sequential due to the limited physical resource, and therefore it is not dynamic and elastic. last, vtpm relies on physical tpm, that is, the trusted capability is ensured by the cloud infrastructure only. accordingly, users cannot specify their personalized security strategies on demand. moreover, we observe that existing trust models, where the evaluation is all performed by one single node, are inappropriate especially in cloud computing environment [33]. figure 4: illustration of user virtual machine to this end, in this section, we propose a tree structured trust model by distributing the evaluation work to multiple nodes. specifically, we combine the trust evaluation from physical platform and user domain. as shown in figure 5, there are two stages in the evaluating of trust value. first, there is a chain of trust in physical tpm, i.e., crtm’bios’grub’vmm. the integrity and security of the system is ensured by the isolation mechanism of physical tpm and cloud infrastructure. second, tpm controller creates tpm for each user, i.e., utpm, which is responsible for the evaluation and security of software components in the user domain. each utpm is created for a user virtual machine, and holds the results of integrity evaluation and reports to users. there are two kinds of trust transitivity in this model. (1) pass the trust from physical tpm to tpm controller, from hardware to user virtual machine. in this way, we can combine the trust of platform and the trust of user virtual machine together, to provide a complete trust chain. (2) pass the trust from user to virtual machine. by virtualizing tpm, an independent vtpm is created for each user virtual machine, i.e., utpm. in figure 5, each node in the upper physical trust chain is responsible for evaluating trust, and the evaluation is performed for each user in parallel in the lower user domain trust chain. theorem 1. the tree structured trust model in figure is trusted. 576 l. gu, j. zhong, c. wang, z. ni, y. zhang figure 5: illustration of tree structured trust model proof: for ease of description, we simplify the tree structured trust model in figure 6. denote the trust value of node a for b as t(a,b) . inferred by the trust transitivity principle, we have t(a,b) = min{t(a,b),t(b,c), . . . ,t(n −1,n)} . as proved in [24], the left part chain a → b → c →···→ n is trusted. now we consider the right part chain. each node ui represents a trust chain of each user virtual machine, and therefore ui itself is trusted. there are multiple direct paths, i.e., t1(n,u1),t2(n,u2), . . . ,tn(n,un). the final trust value t(n,u) is no less than max { t1(n,u1),t2(n,u2), . . . ,tn(n,un) } . therefore the right part chain is trusted. 2 figure 6: illustration of tree structured trust model 3.3 fuzzy theory based trust model as used in many suppose the confidence level for trust is u = {u1,u2,u3,u4} = {‘not trusted’, ‘somehow trusted’, ‘normal trusted’, ‘complete trusted’}, and the confidence vector v = {v1,v2,v3,v4}, where vi(i = 1,2,3,4) denotes the degree of membership of each level ui. figure 7: illustration of trust transitivity trust model in cloud computing environment based on fuzzy theory 577 the trust between initial entity and target entity is evaluated through the stepwise evaluation and the transitivity between them. as shown in figure 7, where a is observed entity, b is evaluation entity, and r1,r2, . . . ,rn are the intermediate entities. the corresponding trusts between nodes are vt1,vt2, . . . ,vtn,vd , and the trust vector from a to b is vt . the integrity of direct measurement is mainly affected by the measuring capability of the current evaluation node. now we consider the timeliness factor. let dt(ri,rj,t) be the direct trust of rj in ri at time t , and it can be calculated as: dt(ri,rj, t) = srjri srjri + frjri δ(t,t0) (8) where srjri denotes the number of successful historical integrity evaluation of rj on ri, frjri is the number of failure evaluation, and t, t0 denote the current and first evaluation time respectively. δ(t,t0) is the time decay function, defined as: δ(t,t0) = 1− t− t0 t ξ (9) where ξ ∈ [0,1] is the adjustment factor of the decay. the value of direct trust dt(ri,rj, t) can be transformed into a fuzzy vector v (i,j) = (v1,v2,v3,v4) , where v1 =   1−2 ( dt 0.5 )2 , 0 ≤ dt ≤ 0.25; 2 ( 0.5−dt 0.5 )2 , 0.25 < dt < 0.5; 0, 0.5 < dt < 1. (10) v2 =   0, 0 ≤ dt ≤ 0.25; 1−2 ( dt−0.25 0.5 )2 , 0.25 < dt ≤ 0.25; 2 ( 0.75−dt 0.5 )2 , 0.5 < dt ≤ 0.75; 0, 0.75 < dt ≤ 1. (11) v3 =   0, 0 ≤ dt ≤ 0.25; 2 ( dt−0.25 0.5 )2 0.25 < dt ≤ 0.5; 1−2 ( 0.75−dt 0.5 )2 0.5 < dt ≤ 0.75; 0 0.75 < dt ≤ 1. (12) v4 =   0, 0 ≤ dt ≤ 0.5; 2 ( dt−0.5 0.5 )2 0.5 < dt ≤ 0.75; 1−2 ( dt−1 0.5 )2 0.75 < dt ≤ 1. (13) suppose there are n evaluation metrics z = {z1,z2, . . . ,zn}, we get the evaluation matrix for entity ri : vi =   v1,1 v1,2 · · · v1,n v2,1 v2,2 · · · v2,n v3,1 v3,2 · · · v3,n v4,1 v4,2 · · · v4,n   (14) 578 l. gu, j. zhong, c. wang, z. ni, y. zhang therefore, the indirect trust of ri is calculated as: iti = wi ·vi (15) where wi = (w1,w2,w3,w4) is the weights of each metric, and wi ∈ [0,1], 4∑ i=1 wi = 1. we employ a fuzzy reasoning method based on similarity. suppose there exist a rule: r : if a then b,λ,w (16) where a is the antecedent component, b is the consequent component of the rule, and λ is a threshold which decides the rule to be executed or not. the similarity between two entities is calculated as: s(a′i,ai) =   m(a′i∩ai) m(a′i)∨m(ai) , if a′i ⊆ ai or a′i ⊇ ai a′i∩ai m(a′i) , else. (17) where a′i, ai are fuzzy set, m(ai) = ∑ x∈xi µai(x), xi is the mathematical domain of discourse for a′i, ai. as shown in figure, in our case, the antecedent components are vt1,vt2, . . . ,vtn, and the consequent component is vt. that is, the rule is: r : if the measurement of a for r1 is vt1 and the measurement of r1 for r2 is vt2 and · · · and the measurement of rn for b is vd then the measurement of a for b is vt (18) given the observed values are v ′t1,v ′t2, . . . ,v ′tn, the goal is to calculate the observed v ′t. the process of inference is as follows. step 1: calculate the similarity between observed s(v ′ti,vti), i = 1,2, · · · ,n; step 2: if s(v ′ti,vti) > λ , calculate the overall similarity: sw (v ′ti,vti) = n∑ i=1 s(v ′ti,vti)∗ wi n∑ j=1 wj (19) step 3: compute the inference results: θ1 = k1∑ i=1 s(v ′ti,vti)∗wi k1∑ i=1 wj (20) where j ∈{i : m(v ′ti) ≥ w(vti)} , and θ2 = k2∑ i=1 s(v ′ti,vti)∗wi k2∑ i=1 wj (21) trust model in cloud computing environment based on fuzzy theory 579 where j ∈ { i : m(v ′ti) < w(vti) } . if θ ̸= 0 and θ2 ̸= 0, then v ′t =   vt∗θ1 θ2 , θ1 ≤ θ2; min { 1, vt∗θ1 θ2 } , otherwise. (22) if θ1 = 0 or θ2 = 0, then vt =  vt ∗sw , θ1 ≤ θ2; min { 1, vt sw } , otherwise. (23) to sum up, in this section we presented the proposed trust model, which works as follows. when user a wants to use some service, for each qualified candidate csp x, he/she first inquires the information from other users and csps who have interactions with x. then, for each x, a tree structured trust chain is constructed, where the upper part is the trust chain for physical tpm of x, and the lower part is built for each possible user. after that, an evaluation method based on fuzzy theory is performed to calculate the trust value of a for x. once all the trust value is learned, user a can determine if a csp x is trusted or not. 4 experiment in this section, we evaluate the efficiency of our proposed tree structured trust model. the configuration of pc is as follows. intel core i5 2.8 ghz cpu with four cores, 4 gb memory, 500 g hard disk. we use xen virtualization platform for virtualization implementation, cloudsim [34] for cloud computing platform simulation, and matlab for fuzzy system implementation. 4.1 evaluating trusted cloud service selection figure 8 shows the trust value of four types of csps with different number of transactions. we have the following observations. first, for complete trusted and normal trusted csps, the trust value is near linearly growing since they are offering real trusted services. second, for somehow trusted csps, the trust value is unstable. third, the value of not trusted csps decreases quickly, and they would not be selected as the cloud service provider. 4.2 evaluating timeliness we compare our method with traditional dynamic level scheduling (dls) algorithm [35] for scheduling cloud services with the consideration of timeliness factor. the results are reflected as the average of 100 executions. figure 9 shows the ratio of successful execution with different numbers of tasks. we can observe that the successful execution tends to increase when the number of tasks is growing. however, for dls algorithm, the maximum success ratio is around 0.65. the reason is that it does not consider the node failure at specific time. by contrast, our method performs better because we consider the timeliness factor in all historical evaluations. figure 10 shows the average schedule length with different numbers of tasks. the average schedule length grows when there are more tasks. also, the average schedule length of dls is smaller than that of our method. therefore, we can see that our method achieves larger success ratio by sacrificing the scheduling time. although the time cost increases, our method can help to select more successful trusted cloud services. 580 l. gu, j. zhong, c. wang, z. ni, y. zhang figure 8: trust value of four types with different number of transactions figure 9: the ratio of successful execution with different numbers of tasks trust model in cloud computing environment based on fuzzy theory 581 figure 10: average schedule length with different numbers of tasks 5 conclusion in this paper, we proposed a trust model in cloud computing environment. specifically, the trust model is designed for virtual machines with the consideration of timeliness factor. moreover, we employ a fuzzy theory based method to calculate the trust value of specific csp. in our experiments, we exhibit the trust value of four pre-defined confidence levels, and also evaluate the efficiency of the timeliness consideration. we find that our method can improve the successful response of selecting cloud services at the expense of average schedule length. however, in future works, we might want to explore a more optimal balance between efficiency and effectiveness. acknowledgements this work was supported by the national natural science foundation of china ( grant no.313715 33, grant no. 71271071 ), national science and technology support program ( grant no. 2013baj10b12 ), the key technologies r & d program of anhui province (grant no.1301032169) and by the natural science foundation of anhui province ( grant no.1308085mf89 ). bibliography [1] armbrust, michael, et al. (2010); a view of cloud computing, communications of the acm, 53(4): 50–58. [2] mell p., grance t. (2011); the nist definition of cloud computing, http://csrc.nist. gov/publications/nistpubs/800-145/sp800-145.pdf, 1-7. [3] lin c., pervan g. (2001); a review of is/it investment evaluation and benefits management issues, problems and processes, in information technology evaluation methods and management, isbn:1-878289-90-x, 2-24. 582 l. gu, j. zhong, c. wang, z. ni, y. zhang [4] brodkin j. (2008); gartner: seven cloud-computing security risks. infoworld (2008): 1–3. [5] zissis d., lekkas d. (2012); addressing cloud computing security issues. future generation computer systems, 28(3): 583–592. [6] lonea a.m., popescu d.e., tianfield h.(2012); detecting ddos attacks in cloud computing environment, international journal of computers communications & control, 8(1): 70–78. [7] popescu d.e, , lonea a.m. (2013); an hybrid text-image based authentication for cloud services, international journal of computers communications & control, 8(2): 263–274. [8] pearson s., balacheff b., eds. (2003); trusted computing platforms: tcpa technology in context, prentice hall professional. [9] mitchell c. ed.(2005), trusted computing, institution of electrical engineers. [10] sumrall n., novoa m. (2003); trusted computing group (tcg) and the tpm 1.2 specification. intel developer forum. vol. 32. [11] blaze m., feigenbaum j., lacy j. (1996); decentralized trust management. security and privacy, 1996 ieee symposium on, 164-173. [12] josang a. (2001); a logic for uncertain probabilities. international journal of uncertainty, fuzziness and knowledge-based systems, 9(3): 279–311. [13] knapskog, s. j.(1998); a metric for trusted systems, proc. of the 21st national security conference, available at http://folk.uio.no/josang/papers/jk1998-nsc.pdf, 1-14. [14] josang a. (1999); trust-based decision making for electronic transactions, proc. of the fourth nordic workshop on secure computer systems, 1-21. [15] beth t., borcherding m., klein b. (1994); valuation of trust in open networks, springer berlin heidelberg. [16] meyer j. f. (1980); on evaluating the performability of degradable computing systems. computers, ieee transactions on, 100(8): 720–731. [17] isermann r. (1984); process fault detection based on modeling and estimation methods, a survey. automatica, 20(4): 387–404. [18] arlat j. et al.(1993); fault injection and dependability evaluation of fault-tolerant systems, computers, ieee transactions on, 42(8): 913–923. [19] smith s. w. (2002); outbound authentication for programmable secure coprocessors. computer security, esorics, 2002. springer berlin heidelberg, 72–89. [20] abadi m., wobber t. (2004); a logical account of ngscb. formal techniques for networked and distributed systems, cforte 2004. springer berlin heidelberg, 2004. 1–12. [21] chen s., wen y., zhao h. (2007); formal analysis of secure bootstrap in trusted computing, autonomic and trusted computing, springer berlin heidelberg, 352–360. [22] dyer j. g., et al. (2001); building the ibm 4758 secure coprocessor. computer, 34(10): 57–66. trust model in cloud computing environment based on fuzzy theory 583 [23] lie, david, et al.(2000); architectural support for copy and tamper resistant software, acm sigplan notices , 35(11): 168–177. [24] suh g. e. et al. (2003); aegis: architecture for tamper-evident and tamper-resistant processing. proc. of the 17th annual international conference on supercomputing. acm, 1-18. [25] chen b., morris r. (2003); certifying program execution with secure processors, hotos, available at http://pdos.csail.mit.edu/papers/cerium:hotos03.pdf, 1-6. [26] macdonald r. et al. (2003); bear: an open-source virtual secure coprocessor based on tcpa. computer science technical report tr2003-471, dartmouth college. [27] marchesini j. et al.(2003); experimenting with tcpa/tcg hardware, or: how i learned to stop worrying and love the bear. computer science technical report tr2003-476, dartmouth college . [28] sailer r. et al. (2004); design and implementation of a tcg-based integrity measurement architecture. usenix security symposium, 13:223-238. [29] garfinkel t. et al.(2003); terra: a virtual machine-based platform for trusted computing, acm sigops operating systems review. 37(5):193-206. [30] berger s. et al. (2006); vtpm: virtualizing the trusted platform module, proc. 15th conf. on usenix security symposium, 305-320. [31] fullér r., majlender p. (2001); an analytic approach for obtaining maximal entropy owa operator weights, fuzzy sets and systems, 124(1): 53–57. [32] saure d. et al. (2010); time-of-use pricing policies for offering cloud computing as a service, service operations and logistics and informatics (soli), 2010 ieee international conference on, 300-305. [33] bo z. et al.(2010); the system architecture and security structure of trusted pda, chinese journal of computers, 33(1): 82–92. [34] calheiros r. n. et al.(2011); cloudsim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms. software: practice and experience, 41(1): 23–50. [35] wang w. et al. (2012); dynamic trust evaluation and scheduling framework for cloud computing. security and communication networks, 5(3): 311–318. int j comput commun, issn 1841-9836 vol.7 (2012), no. 4 (november), pp. 645-660 a multimodel approach for complex systems modeling based on classification algorithms n. elfelly, j-y dieulot, m. benrejeb, p. borne nesrine elfelly, jean-yves dieulot, mohamed benrejeb, pierre borne université des sciences et technologies de lille (ustl), ecole polytechnique de lille, ecole nationale d’ingénieurs de tunis -lara automatique (enit), ecole centrale de lille (ec lille) laboratoire d’automatique, génie informatique et signal ecole centrale de lille, cité scientifique bp 48 59651 villeneuve d’ascq cedex, france e-mail:nesrine.elfelly@ed.univ-lille1.fr, jean-yves.dieulot@polytech-lille.fr, mohamed.nerejeb@ec-lille.fr,pierre.borne@ec-lille.fr abstract: in this paper, a new multimodel approach for complex systems modeling based on classification algorithms is presented. it requires firstly the determination of the model-base. for this, the number of models is selected via a neural network and a rival penalized competitive learning (rpcl), and the operating clusters are identified by using the fuzzy k-means algorithm. the obtained results are then exploited for the parametric identification of the models. the second step consists in validating the proposed model-base by using the adequate method of validity computation. two examples are presented in this paper which show the efficiency of the proposed approach. keywords: complex systems , multimodel , system modeling, classification 1 introduction the multimodel approach has been recently developed in several science and engineering domains, with typical applications in the mechanical and chemical engineering areas, with application to modelling, control and/or fault detection e.g. [1–3]. it was introduced as an efficient and powerful method to cope with modelling and control difficulties when complex non linear and/or uncertain processes are concerned. the multimodel approach assumes that it is possible to replace a unique non linear representation by a combination of simpler models thus building a so-called model-base. usually, each model of this base describes the considered process at a specific operating point. the interaction between the different models of the base through normalized activation functions allows the modelling of the global non-linear and complex system. therefore, the multimodel approach aims at lowering the system complexity by studying its behavior under specific conditions. the multimodel principle is given in figure 1. the different models of the base could be of different structures and orders but no model can represent the system in its whole operating domain. the decision unit allows the estimation of the weight of each model and thus the selection of the most relevant models at each time. as for the output unit, controlled by the decision unit, it allows the computation of the multimodel output which is obtained by the contribution of the different models’ outputs. in spite of its success in many fields (academic, biomedical, etc), the multimodel approach remains confronted with several difficulties, in particular the determination of the number and parameters of the different models representative of the system and the choice of an adequate method of validities computation used for multimodel output deduction. last years, many authors [4–7] have been copyright c⃝ 2006-2012 by ccc publications 646 n. elfelly, j-y dieulot, m. benrejeb, p. borne figure 1: multimodel approach principle. interested by this approach. the main differences between the proposed studies are the selected method for models identification and the type of models. linear models are mainly used, e.g. fuzzy takagi-sugeno models [8, 9] which are often obtained using linearization methods. however, the multimodel identification is more difficult to work out when the models of the base should be determined from only input/output data. some results are given in [10]. in the latter case, the number of models must firstly be determined. then, the data are classified and the models’ parameters are estimated. last, the validities of the different models are computed according to the adequate method. the issue of the selection of the appropriate method of validities’ estimation was discussed in [11]. many authors propose to apply classification algorithms in order to handle a set of dynamical models. for example, neural networks have been used to represent and control complex systems [7, 12–15]. in another hand, thanks to their ability to classify data and their simplicity, k-means algorithms have proved to be efficient for data clustering e.g. [16, 17]. in short, whereas many architectures using multiple models and neural networks have been proposed, there has not been much work on clustering techniques, based on neural networks and k-means algorithms [18], applied to traditional multimodel representation using only input/output data. the most tedious issues are related to the model-base size and the clustering procedure which aims to the determination of the operating domains. this paper thus proposes a multimodel approach for complex processes modelling based on classification algorithms. the efficiency of the proposed study is illustrated by two examples for which simulations are proposed. the issue of the determination of the number of models in the base is solved through the application of the rival penalized competitive learning (rpcl), which is an extension of the work in [19] . in the following sections, the different steps of model-base building are detailed knowing that only input/output data are available. 2 model-base determination using classification algorithms the proposed approach allows the determination of the number, structure and parameters of the different models of the base. firstly, a neural network and a rival penalized competitive learning are used for the selection of the adequate number of models. the second step consists in clustering system data by using fuzzy k-means. the classification results are then used for a multimodel approach for complex systems modeling based on classification algorithms 647 the parametric identification of the models. 2.1 determination of the number of models with a rival penalized competitive learning (rpcl) the proposed approach allows the construction of the model-base by using two clustering algorithms. the application of this approach requires firstly the determination of the number of models which will be handled by using a rival penalized competitive learning (rpcl) [20]. secondly, the application of the fuzzy k-means algorithm for data clustering and then the characterization of the different base-models is carried out by exploiting the clustering results. finally, the validation of the modeling strategy is considered by the use of an adequate method for validity computation allowing the generation of the multimodel output, compared to the real system output for a different set of inputs. the models number determination requires experimental data obtained by applying an appropriate input sequence. in order to generate the different operating domains of the process, the measurements must be merged into a set of clusters by the use of a classification algorithm with unsupervised learning. most existing clustering algorithms [21, 22] do not handle the selection of the appropriate number of clusters, which is, however, essential to the estimation and classification performance in the multimodel approach when no information is available about the operating domains and their number. however, many experimental results have shown that the rpcl algorithm automatically allocates an appropriate number of units for an input data set when they are used for clustering. the selection of the number of models in the multimodel representation requires that the excitation signal must be rich enough (a sine curve for example with the adequate frequency and amplitude added to a random signal) to take into consideration the non-linear aspect of the considered process. for tackling the issue of determination of this number, via input-output data, it is proposed to apply the learning algorithm called rpcl which allows the selection of the adequate number of operating clusters for an input data set. thus, the extra units are gradually driven far away from the distribution of the data set when the number of units is larger than the real number of clusters in the input data set. rpcl is an unsupervised learning strategy (proposed by xu [23] and renewed by tambe [24]), that automatically determines the optimal number of clusters. the principle underlying rpcl can be considered as an extension of the competitive learning based on kohonen rule [25]. its specificity lies in the modification, for each input vector, not only of the winner weights, but also of the weights of its rival (called second winner) so that the rival will be moved or penalized. the rate, at which the rival is penalized, is much smaller than the learning rate, e.g. [26]. given a competitive learning neural network (figure 2), i.e. a layer of units with the output ui of each unit and its weight vector wi for i = 1 . . .n; n is the number of output units, the rpcl algorithm can be described by the following steps. 1. initialize weight vectors wi randomly. 2. take a sample x from a data set d, and for i = 1 . . .n, let ui =   1 if i = c; −1 if i = r; 0 otherwise; (1) 648 n. elfelly, j-y dieulot, m. benrejeb, p. borne figure 2: competitive learning neural network. with: γc∥x−wc∥2 = min j γj∥x−wj∥2; (2a) γr∥x−wr∥2 = min j ̸=c γj∥x−wj∥2; (2b) ∥∗∥: euclidean distance; c: index of the unit which wins the competition (winner); wc: weight vector of the winner; r: second winner (rival) index; wr: weight vector of the rival; γj: conscience factor (relative winning frequency) used to reduce the winning rate of the frequent winners. it is useful to develop a set of equiprobabilistic features or prototypes representing the input data. γj is calculated as follows [27]: γj = nj∑n i=1 ni ; (3) where nj refers to the cumulative number of occurrences the node j has won the competition (uj = 1). 3. update the weight vectors as follows: wj(k + 1) = wj(k) + ∆wj; (4) with: ∆wj =   αc(k)(x−wj(k)) if uj = 1; −αr(k)(x−wj(k)) if uj = −1; 0 otherwise; (5) 0 ≤ αc(k) and 0 ≤ αr(k) ≤ 1 are respectively the winner learning rate and the rival delearning rate. in practice, the rates are fixed small numbers or depend on time (starting from not so small initial values and then reduced to zero in some way) with αc(k) ≫ αr(k) at each step. several empirical functions have been proposed for the update of the learning and de-learning rates [27, 28]. 4. repeat steps 2 and 3 until the whole learning process has converged. referring to the learning results, only the units enclosed within the data set should be considered and the number of clusters could be deduced as equal to the number of the selected units. a multimodel approach for complex systems modeling based on classification algorithms 649 2.2 identification of the operating clusters after the selection of the appropriate number of clusters, let us consider the problem of splitting up the measurements to generate the different operating domains from which the basemodels will be identified. data classification by using fuzzy k-means algorithm the fuzzy k-means classification algorithm has been chosen for data classification, according to its performance and easy working out. fuzzy k-means algorithm fuzzy k-means algorithm (developed by dunn [29] and improved by bezdek [30]) is a data clustering technique which allows each data point to belong to more than one cluster with different membership degrees (between 0 and 1) and vague or fuzzy boundaries between clusters. the aim of this method is to find an optimal fuzzy k-partition and corresponding prototypes minimizing the following objective function: jm = m∑ i=1 k∑ j=1 umij∥xi − cj∥ 2, 1 ≤ m < ∞; (6) with: ∥∗∥: any norm expressing the similarity between any measured data and a cluster centre; m: weighting exponent (real number greater than 1) which is a constant that influences the membership values; uij: degree of membership of xi to cluster j, such as uij ∈ [0,1], ∑k j=1 uij = 1 ∀i and 0 <∑m i=1 uij < m ∀j; xi: ith data point; cj: center vector (node) of the cluster j; m: number of observations; k: number of clusters (2 ≤ k < m). fuzzy partitioning is carried out through an iterative optimization of the objective function shown above, with the update of membership uij and the cluster centers cj [31]. this procedure will stop when max i,j {|uij(k + 1)−uij(k)|} < ε and converges to a local minimum of jm, where ε is a termination criterion belonging to [0,1] and k the iteration step. the algorithm is composed of the following steps. 1. initialize the matrix u = [uij], u(0). 2. at k-step, calculate the centers vectors cj;j = 1 . . .k: cj = ∑m i=1 u m ij xi∑m i=1 u m ij . (7) 3. update the matrix of membership degrees u(k) according to the new centers positions u(k + 1): uij = [ k∑ l=1 ( ∥xi − cj∥ ∥xi − cl∥ ) 2 m−1 ]−1 . (8) 650 n. elfelly, j-y dieulot, m. benrejeb, p. borne 4. while max i,j {|uij(k + 1)−uij(k)|}≥ ε , return to step 2. in our study, the final clustering result is obtained by considering that a given data point belongs to the cluster to which it presents the greatest membership degree. parametric identification of the models in the base the application of the clustering algorithm results in some repartition of the data set. each cluster is represented by a set of input/output measurements which will be exploited for the identification of the different models in the base. the application of the classification algorithm results in some repartition of the data set. each cluster is represented by a set of input/output measurements which will be exploited for the identification of the different models in the base. for this, the models orders are first estimated by using the so-called instrumental determinants’ ratio-test. this method is mainly based on the conditions concerning a matrix called "information matrix" which contains the input/output measurements [32]. this matrix is described as follows: qm = 1 nob nob∑ k=1   u(k) u(k + 1) ... u(k −m + 1) u(k + m)   [ y(k + 1) u(k + 1) · · · y(k + m) u(k + m) ] ; (9) where nob is the number of observations. the instrumental determinants’ ratio (idr) is given by: idr(m) = ∣∣∣∣ det(qm)det(qm+1) ∣∣∣∣ . (10) for each value of m,(m ≥ 1) the determination procedure of the order consists in building the matrices qm and qm+1 and in evaluating the ratio idr(m); the retained order m is the value for which the ration idr(m) quickly increases for the first time. given those different orders of models, the parametric identification issue consists in calculating the values of the parameters of the corresponding model-equation, given several experimental measures which describe the dynamic behavior of the model. for this, the recursive leastsquares method (rls) [32] is applied to achieve the parameters estimation. 3 validity computation and validation of the proposed modelling scheme the steps described in the previous paragraphs allow the design of the model-base. the purpose is to test the efficiency of the proposed modeling. for this, a validation step is worked out for some inputs different from those used for clustering. then, the multimodel output ymm, computed and compared to the real output of the studied process, is obtained through a fusion of the k models’ outputs yi weighted by their respective validity indexes vi, as illustrated by the a multimodel approach for complex systems modeling based on classification algorithms 651 system 11 and figure 3. ymm(k) = k∑ i=1 yi(k)vi(k); (11a) k∑ i=1 vi(k) = 1. (11b) figure 3: fusion principle. the validity index is a real number belonging to the interval [0,1]. it represents the relevance degree of each model calculated at each instant. in the literature, several methods have been proposed to deal with the validity issue. in our study, the residues’ approach is adopted for the calculation of validities. this method is based on the distance measurement between the outputs of the process and of the considered model. for example, the residue can be given by the following expression: ri = |y −yi|; i = 1, . . . ,k; (12) with: y: process output; yi: output of the model mi. if this residue value is equal to zero, the corresponding model mi represents perfectly the process at that time; if not, the model represents the process partially. between the methods proposed for the calculation of validities [4, 33], only the simple and the reinforced validities approaches are here considered. in general, the expression of the validities is given by: vi = 1−r′i ; (13) where r′i represent the normalized residues and are given by: r′i = ri∑k j=1 rj . (14) simple validities: the normalized simple validities vsimpi are defined so that their sum must be equal to 1 at each time: v simp i = vi k −1 . (15) 652 n. elfelly, j-y dieulot, m. benrejeb, p. borne reinforced validities for this type of validities, the reinforcement expression v′renfi is introduced as: v ′renf i = vi k∏ j =1,j ̸=i (1−vj) . (16) the normalized reinforced validities vrenfi could be written as follows: v renf i = v ′renf i∑k j=1 v ′renf j . (17) the comparative study between the two considered validities [11] has shown that the selection of the suitable approach depends on clustering results i.e. the clusters structure and repartition. in fact, when there are important variations in the same cluster and when an overlapping between clusters occurs, it is worth to use the simple validities’ method since it takes account of different models’ outputs referring to the expression 15. in this case, no model could represent ideally the process at any time. but when the clusters present very few variations and are well separated, the reinforced validities’ method is better adapted. thanks to the reinforcement expression 16, the application of this method promotes the contribution of the most dominant model which represents at best the process behavior. validation of the proposed modeling scheme once the appropriate method of validity computation selected, the validation of the global modeling scheme is carried out through a comparison between the real and the multimodel outputs for different input sequences. 4 simulation examples 4.1 modelling of a mechanical manipulator in order to underline the interest and the efficiency of the proposed approach, a first example of nonlinear system is considered, which consists of a two-link manipulator (figure 4) [34, 35] that can be described by the following equations: figure 4: mechanical system (simulation example). a multimodel approach for complex systems modeling based on classification algorithms 653   ẋ1 = x2, ẋ2 = −mgl i sin(x1)− mgli (x1 −x3), ẋ3 = x4, ẋ4 = k j (x1 −x3) + 1j u. (18) the considered system is composed of two rotating bars, where: • x1,x2: angular position and angular velocity of bar 1; • x3,x4: angular position and angular velocity of bar 2; • u: torque applied to bar 2; • g = 9.8 m ·s−2: gravity constant; • i = 1 kg ·m2: moment of inertia of bar 1; • j = 1 kg ·m2: moment of inertia of bar 2; • l = 1 m: half of the length of bar 1; • m = 1 kg: mass of bar 1; • k = 1 nm ·rad−1: elastic rigidity at the link between bars 1-2. the normal form of the nonlinear model of the system can be written as follows:  ż1 = z2, ż2 = z3, ż3 = z4, ż4 = a(z) + b(z)u, (19) where a(z) = mgl i sin(z1)z 2 2 − mgl i cos(z1)z3 − (ki + k i )z3 − kmglij sin(z1) and b(z) = k ij . the variables zi are related to the variables xi through the following equations:  x1 = z1, x2 = z2, x3 = z1 + 1 k z3 + mgl k sin(z1), x4 = z2 + 1 k z4 + mgl k cos(z1)z2. (20) in the remainder of the study, u will be considered as the input and x1 as the output of the system. first, the system is excited by an adequate signal u(k) in order to collect the measurements x1(k) and x1(k − 1) at different instants. these numerical data are used for the determination of the appropriate number of operating clusters by using a neural network and the rpcl algorithm. figure 5, which gives the results of the learning procedure, shows that by considering six units in the input layer, two centres move away from the observation space, which permits to conclude that the adequate number of clusters can be chosen equal to four. then, the fuzzy k-means algorithm is carried out in order to select the different operating clusters (figure 6). referring to each of the four data sets relative to the clusters resulting from the implementation of the proposed classification algorithm, the orders and the parameters of the transfer 654 n. elfelly, j-y dieulot, m. benrejeb, p. borne figure 5: determination of the number of clusters (n=6). figure 6: classification results. functions relative to the four models of the base are estimated by using respectively the idr and the rls methods. the study presented in [11,19] leads to the selection of the reinforced validities method since the classification results (figure 6) present well-separated clusters with few variations. in order a multimodel approach for complex systems modeling based on classification algorithms 655 to give prominence to the capacity of the identified models to reproduce the operating system in different domains, many input sequences have been considered, two of which are presented in this paper and are described by the following equations: u1(k) = sin(0.5k). (21) u2(k) = 1 + 0.5sin(0.6k). (22) referring to the designed model-base, the multimodel output is computed for each input sequence (by fusion of the models’ outputs weighted by the reinforced validities) and is then compared to the real system output. the results are given in figure 7 and figure 8 where x1 is the real output, and x1mm the multimodel output obtained by using the fuzzy k-means algorithm. figure 7: real and multimodel outputs for the mechanical manipulator (input sequence u1). figure 8: real and multimodel outputs for the mechanical manipulator (input sequence u2). introducing the nrmse (normalized root mean square error): nrmse = √∑n i=1(ymm−y)2 n ymax −ymin , (23) where ymm,y are the multimodel and process output and n is the number of samples; one obtains an accuracy of nrmse = 0.001. a fair comparison was made in a previous paper [39] with an 656 n. elfelly, j-y dieulot, m. benrejeb, p. borne academic system which was controlled in [6] with a multimodel method where 64 models were proposed instead of only 4 for the method proposed in this paper. the results where only 3 times better for the method in [39] which needed 16 times more models and complicated workout and tuning, but both methods outperformed the results. it can be noticed that the implementation of the proposed multimodel approach allows an acceptable modelling of the studied system respective to its complexity. in fact, by considering the first input sequence u1, the multimodel output follows the real output and the error between the two signals shaves over the time. besides, even with a more complicated system output (obtained by considering the input sequence u2) with important oscillations and for a longer time interval, the results given by the proposed modelling approach (figure 8) are satisfactory. 4.2 modeling of a bioreactor this second example shows the relevance of our method compared to a "black-box" multimodel technique with a large model base as provided in the work by [6], illustrated on the same benchmark, a bioreactor model. this model has been used in several nonlinear modeling or control issues [6, 36, 37]. substrate is fed continuously with a constant feedrate f to the well-mixed bioreactor which has a constant volume v . microbial growth follows a contois model [38], the microorganisms and substrate concentration, respectively x1 and x2, are supposed to be small. the discrete-time mass-balance equations are derived as follows:  x1(k + 1) = x1(k) + 0.5 x1x2 x1+x2 −0.5u(k)x1(k), x2(k + 1) = x2(k)−0.5 x1x2x1+x2 −0.5u(k)x2(k) + 0.05u(k), y(k) = x1(k) (24) where y(k) is the model output, u(k) = f/v is the dilution rate, normalized to 1. as in the previous example, it is assumed that no a priori information is given on the possible (real) model structure and the system is adequately excited to generate a set of input/output measures which will allow to describe the whole process dynamics. as was explained before, the rival penalized competitive learning algorithm is applied, and the number of models was found to be equal to 10. experimental measurements are then classified by using the fuzzy k-means algorithm. based on the learning results, where small overlapping between clusters was found, the reinforced validities were embedded in the multimodel structure. in terms of modelling accuracy, the obtained results were found to be quite similar to those given in [6] where 196 models were needed to represent the system dynamics . this enlightens the interest of the proposed approach as a high number of models yields a considerable computational burden. figure 9 provides the real output y and the multimodel output ymm by considering the following input sequence: u(k) = 0.5 + 0.1sin(2k). the results are confirmed through another input sequence u2 shown on figure 10 and figure 11 shows a good agreement between multimodel and real outputs. 5 conclusion in this paper, a new approach for complex systems modelling is proposed and an experimental validation is presented. this approach is applicable when dealing with complex, strongly nonlinear and uncertain systems. it allows the determination of the base of models, which are representative of the system in different operating domains, by using two classification algorithms and two methods of structural and parametric identification. the issue of selection of the models’ number is solved by using a neural network and a rival penalized competitive learning (rpcl). once this number determined, the data collected on the system are then clustered by using a multimodel approach for complex systems modeling based on classification algorithms 657 figure 9: real and multimodel outputs for the bioreactor model (input sequence u1). figure 10: bioreactor input sequence u2. figure 11: real and multimodel outputs for the bioreactor model (input sequence u2). the fuzzy k-means algorithm. the classification results are exploited for the identification of models. finally, a validation procedure is worked out in order to demonstrate the ability of the proposed modelling structure to reproduce the system response in different operating domains. the proposed approach has been implemented and applied for a complex mechanical system and for a bioreactor. the obtained results seem to be interesting and prove the efficiency of the proposed modelling strategy. 658 n. elfelly, j-y dieulot, m. benrejeb, p. borne bibliography [1] alavandar s., nigam m.j., neuro-fuzzy based approach for inverse kinematics solution of industrial robot manipulators, int j comput commun, 3(3):224-234, 2008. [2] j. m. böling and d. e. seborg and j. p. hespanha, multi-model adaptive control of a simulated ph neutralization process, control engineering practice, vol. 15, pp. 663-672, 2007. [3] m. rodrigues and d. theilliol and m. adam-medina and d. sauter a fault detection and isolation scheme for industrial systems based on multiple operating models, control engineering practice,16,225-239,2008. [4] f. delmotte, l. dubois, p. borne, a general scheme for multi-model controller using trust, mathematics and computers in simulation, vol. 41, pp. 173-186, 1996. [5] t. a. johansen, b. a. foss, editorial: multiple model approaches to modelling and control, international journal of control, vol. 72, pp. 575, 1999. [6] j. cho and j. c. principe and d. erdogmus and m. a. motter, quasi-sliding mode control strategy based on multiple-linear models, neurocomputing, 70, 960-974,2007. [7] i. s. baruch and r. b. lopez and j-l. olivares and j-m. flores, a fuzzy-neural multi-model for nonlinear systems identification and control, fuzzy sets and systems, 159, 2650-2667, 2008. [8] t. takagi, m. sugeno, fuzzy identification of systems and its applications to modelling and control, ieee transactions on systems man and cybernetics, vol. 15, pp. 116-132, 1985. [9] m. benrejeb, d. soudani, a. sakly, p. borne, new discrete tsk fuzzy systems characterization and stability domain, int j comput commun, 1(4):9-19, 2006. [10] p. borne and m. benrejeb, on the representation and the stability study of large scale systems, int j comput commun, 3(s):55-66, 2008. [11] n. elfelly and j-y. dieulot and p. borne, a neural approach of multimodel representation of complex processes, int j comput commun,3(2):149-160, 2008. [12] m. ronen and y. shabtai and h. guterman, hybrid model building methodology using unsupervised fuzzy clustering and supervised neural networks, biotechnology and bioengineering, 77, 420-429, 2002. [13] w. yu, multiple recurrent neural networks for stable adaptive control, neurocomputing, 70, 430-444, 2006. [14] y. fu and t. chai, nonlinear multivariable adaptive control using multiple models and neural networks, automatica, 43, 1101-1110, 2007. [15] g.d. manioudakis and e.n. demiris and s.d. likothanassis, a self-organized neural network based on the multi-model partitioning theory, neurocomputing, 37, 1-29, 2001. [16] d. dembélé and p. kastner, fuzzy c-means method for clustering microarray data, bioinformatics,19, 973-980, 2003. a multimodel approach for complex systems modeling based on classification algorithms 659 [17] p. hore and l.o. hall and d.b. goldgof and w. cheng, online fuzzy c means, annual meeting of the north american fuzzy information processing society, nafips 2008,1-5, 2008. [18] z.k. xue and s.y. li, multi-model modelling and predictive control based on local model networks, control and intelligent systems, 34, 105-112, 2006. [19] n. elfelly and j.-y. dieulot and m. benrejeb and p. borne, a new approach for multimodel identification of complex systems based on both neural and fuzzy clustering algorithms, engineering applications of artificial intelligence, 23, 1064-1071, 2010. [20] l. xu and a. krzyzak and e. oja, unsupervised and supervised classification by rival penalized competitive learning, pattern recognition, 11, 496-499, 1992. [21] a.k. jain and r.c. dubes, algorithms for clustering data, prentice-hall inc., upper saddle river, nj, 1988. [22] b. mirkin, mathematical classification and clustering, kluwer academic press, bostondordrecht, 1996. [23] l. xu and a. krzyzak and e. oja, rival penalized competitive learning for cluster analysis rbf and curve detection, ieee transactions on neural networks,4, 636-649,1993. [24] s.s. tambe and b.d. kulkarni and p.b. deshpande, elements of artificial neural networks with selected application on chemical engineering, and chemical and biological sciences, simulation & advanced controls inc., louisville-ky, usa, 1996. [25] t. kohonen, the self-organizing map, ieee proceedings, 78, 1464-1480, 1990. [26] p. borne and m. benrejeb and j. haggège, les réseaux de neurones, editions technip, paris, france, 2007. [27] t. murlidharan nair and c. l. zheng and j. lynn fink and r. o. stuart and m. gribskov, rival penalized competitive learning (rpcl): a topology-determining algorithm for analyzing gene expression data, computational biology and chemistry, 27, 565-574, 2003. [28] i. king and l. xu and l. chan, using rival penalized competitive clustering for feature indexing in hong kong’s textile and fashion image database, proceedings of the ieee international joint conference on neural networks, 1, 237-240, 1998. [29] j. c. dunn, a fuzzy relative of the isodata process and its use in detecting compact well-separated clusters, journal of cybernetics, 3, 32-57, 1973. [30] j.c. bezdek, pattern recognition with fuzzy objective function algorithms, plenum press, new york, 1981. [31] s. nascimento and b. mirkin and f. moura-pires, a fuzzy clustering model of data and fuzzy c-means, the ninth ieee international conference on fuzzy systems, 1, 302-307, 2000. [32] r. ben abdennour and p. borne and m. ksouri and f. m’sahli, identification et commande numérique des procédés industriels, editions technip, paris, france, 2001. [33] d. j. leith, w. e. leithead, analytic framework for blended multiple model systems using linear local models, international journal of control, vol. 72, pp. 605-619, 1999. 660 n. elfelly, j-y dieulot, m. benrejeb, p. borne [34] j.v. salcedo and m. martínez, design of pdc fuzzy controllers under persistent disturbances and application in mechanical systems, advances in engineering software, 39, 937-946, 2008. [35] c.-w. chen, stability conditions of fuzzy systems and its application to structural and mechanical systems, advances in engineering software, 37, 624-629, 2006. [36] j. p. gauthier and h. hammouri and s. othman, a simple observer for nonlinear systems applications to bioreactors, ieee transactions on automatic control, 37, 875-880, 1992. [37] g. bastin and d. dochain, on-line estimation and adaptive control of bioreactors, elsevier, amsterdam, 1980. [38] d. contois, kinetics of bacterial growth relationship between population density and specific growth rate of continuous cultures, journal of genetic macrobiol, 21, 40-50, 1959. [39] n. elfelly and j.-y. dieulot and m. benrejeb and p. borne, multimodel control design using unsupervised classifiers, studies in informatics and control, issn 1220-1766, vol. 21 (1), pp. 101-108, 2012. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 2 (june), pp. 328-336 modeling uncertainty in a decision problem by externalizing information i. parpucea, b. pârv, t. socaciu ilie parpucea babes-bolyai university faculty of economic sciences and business administration romania, 400591 cluj-napoca, 58-60 teodor mihali e-mail: ilie.parpucea@econ.ubbcluj.ro bazil pârv babes-bolyai university faculty of mathematics and computer science romania, 400084 cluj-napoca, 1 m. kogălniceanu e-mail: bparv@cs.ubbcluj.ro tiberiu socaciu stefan cel mare university faculty of economic sciences and public administration romania, 720229 suceava, 13 universităţii e-mail: socaciu@seap.usv.ro abstract: this paper deals with decision problems under uncertainty. the solution of a decision problem involves observation, processing, and modeling of statistical data in order to quantify the uncertainty. better data measurement and estimation of uncertainty add more consistency to the solution of a decision problem. the paper proposes a new way of predicting the bayesiannash equilibrium which uses information sources to measure new information received by information consumers. thus, the estimation of uncertainty is based on a more solid mathematical foundation, needed (as in the case of artificial intelligence) to produce logical inferences. from another perspective, the externalization of information helps the software designers to produce better software architectures for decision support systems. an theoretical example illustrates a market situation with a small number of firms, each firm’s output being likely to have a large impact on the market price. keywords: bayesian-nash equilibrium, information source, conditional probability distribution. 1 introduction game theory, founded by von neumann and morgenstern (1947), studies situations in which multiple agents or players interact in order to each maximize an objective (payoff) function. the payoff function of a player is determined not only by its own actions, but also by the actions of other players. in a game with incomplete information, the payoffs also depend on information that is private to the individual agents. this information is known as an agent’s type. bayesian decision theory is concerned with the question of how a player (decision maker) should choose a particular action from a set of possible choices if the outcome of the choice also depends on some unknown state (from the states of the world). in our approach, the decision maker is modeling the information received by the system (i.e. new information) as an information source ( [2]). a decision problem involves one or several information sources. we copyright c⃝ 2006-2011 by ccc publications modeling uncertainty in a decision problem by externalizing information 329 assume that each person is able to represent his beliefs, as the likelihood of the different n states of the information source, by a subjective discrete probability distribution ( [5]). the structure of this paper is as follows. after this introductory section, the next one introduces the teoretical background related to games with incomplete information and information sources. the third section presents the bayes-nash equilibrium in the presence of information sources, and the fourth discusses a market-related example. the last section compares the proposed approach with the classical one, and outlines future work. 2 theoretical background this section introduces the main concepts discussed in this paper. it starts with some definitions stating the context, and then defines the concept of information source, both taken from [1] and [3]. the last sub-section introduces the new concept of bayesian-nash equilibrium based on information sources. in what follows, information means a message about an event that has occurred, will occur, or is likely to occur. the received information regarding a possible realization of an event is extremely important. information is a particular case of reflection, as an interaction between two processes; one’s properties (the process that generates or produces information) will be reproduced in another process or several other processes (that consume information). interaction between two or more processes involves an exchange of information. 2.1 games with incomplete information definition 1. a game with incomplete information ( [1]), is denoted by: γt = (i, (fi)i∈i, (p i t(f, θ))i∈i, (θi)i∈i, µt), (1) where: • i is the set of players, |i| = m, • fi is the strategy set for player i, i = 1, m, and f = f1 × f2 × · · · × fm is the the set of all possible strategy profiles; • f = (f1, f2, · · · , fm) ∈ f is a joint strategy or strategy profile; • θi is the set of types for the player i, and θ = θ1 × θ2 × · · · × θm is the joint type space; • θ = (θ1, θ2, · · · , θm) ∈ θ is the joint type of all players; • pit(f, θ) is the payoff function for player i at the moment t if the strategy f and the type combination θ are chosen. note that the payoff for the player i may depend not only on its type θi, but also on the other players’ type, denoted by θ−i. • µt the probability distribution on the set θ at the moment t. in our exposition, we assume that type sets θi are finite; consequently, θ is a finite set also. µt(θ), θ ∈ θ denotes the probability of chosing type combination θ at the moment t. as in [4], we assume, without loss of generality, that players have incomplete information about their opponents’ payoffs but have complete information about the strategies of all other players. the following definition, taken from [1, 3], introduces the classical bayes-nash equilibrium of a game with incomplete information. 330 i. parpucea, b. pârv, t. socaciu definition 2. a strategy profile f(θ) = (f1(θ1), f2(θ2), · · · , fm(θm)) constitutes a bayes-nash equilibrium of a game γt with incomplete information if the following inequality: ∑ θ−i∈θ−i pit(f ∗ i (θi), f ∗ −i(θ−i), θi, θ−i)µt(θ−i|θi) ≥ ∑ θ−i∈θ−i pit(fi(θi), f ∗ −i(θ−i), θi, θ−i) · µt(θ−i|θi) (2) holds for all possible players i ∈ i and all types θi ∈ θi and all strategies fi ∈ fi. 2.2 information sources a process (information producer or consumer ) is specified by a set of n variables, denoted by v = {v1, v2, · · · , vn}, where vj is the set of values for the jth variable: vj = {v1j , v 2 j , · · · , v nj j }. the state of a process at a certain moment is given by the vector v = {v1, v2, · · · , vn}, vj ∈ vj, j = 1, n. future values of all variables are random, and the realization of the states depends on received information. this information is in message form, decreasing or increasing the uncertainty of the realization of an event. an information source is a way of specifying the states of a process, regarding one or several variables. the information source assigned to the jth variable is denoted by sj, and the set of distinct values vkj ∈ vj, k = 1, nj, represents a complete space of events. the simultaneous realization of two events is impossible, and the union of the events represents a certain event. a state skj of the information source s j is assigned to each event vj = vkj . for a bounded interval of time, only information sources with a finite number of states will be taken into account. consider the following assumptions: • each player is able to represent his beliefs, as to the likelihood of the different nj states of the information source sj, by a subjective discrete probability distribution. • the information source sj has discrete states and the individual is supposed to be able to assign to each state skj a degree of belief, in the form of (normalized) numerical weights p k j , between zero and one and whose sum is one: ∀j = 1, n : 0 ≤ pkj ≤ 1, ∀k = 1, nj; nj∑ k=1 pkj = 1. (pkj is the probability that the state s k j occurs). if the information source sj has nj states, the set of states and probabilities defined at a moment t, forms a discrete random variable denoted by: s j t : ( skj pkj (t) ) k=1,nj , j = 1, n. a simple information source is an information source defined with respect to a single variable vj. a complex information source is an information source defined with respect to two or more variables, which can be independent or dependent. in the second case (i.e. the variables are related to one another), the mathematical model of the complex information source needs to contain this dependency. in order to illustrate how a complex information source is constructed, let’s consider for the beginning the simplest case of two independent variables, v1 ∈ v and v2 ∈ v and assign to each variable a simple information source, s1t and s 2 t respectively: s1t : ( sk1 pk1(t) ) k=1,n1 , s2t : ( sl2 pl2(t) ) l=1,n2 , where sk1, k = 1, n1 are the states of information source s 1 t and s l 2, l = 1, n2 are the states of information source s2t . modeling uncertainty in a decision problem by externalizing information 331 the complex information source sc1,2t , built with respect to the variables v1 and v2 at the moment t, has the following mathematical model: sc 1,2 t : ( sk1s j 2 pk1(t)p j 2(t) ) k=1,n1,j=1,n2 . now let us consider the case of two dependent variables, with the simple information source s1t assigned to the independent variable v1, and the simple information source s 2 t assigned to the dependent variable v2. the discrete random variable s1t is: s1t : ( sk1 pk1(t) ) k=1,n1 . for a state of the source s1t , denoted by s k 1, the information source s 2 t conditioned by the state sk1 is defined as follows: s 2/1 t (s k 1) : ( sl2 p k,l 2 (t) ) l=1,n2 , where pk,l2 (t) = p(s 2 t = s l 2|s 1 t = s l 1) is the probability of occurence of state s l 2 conditioned by state sk1. the complex information source sc2/1t , constructed by considering the two variables, has the following form: sc 2/1 t : ( sk1s l 2 pk1(t) · p k,l 2 (t) ) k=1,n1,l=1,n2 , where the probability of occurence of the state sk1s l 2 is: p(s1t = s k 1; s 2 t = s l 2) = p(s 1 t = s k 1) · p(s 2 t = s l 2|s 1 t = s k 1) = p k 1(t) · p k,l 2 (t). if s2t is a discrete probability distribution and s 1 t is an information source, then, according to the above discussion, we can say that s2t is a distribution conditioned by information source s1t . therefore we have a probability distribution updated by an information source. 3 bayes-nash equilibrium in the presence of information sources let us consider now the probability distribution µt defined on the discrete set θ and a information source st (simple or complex), common to all players. according to the probability distribution conditioned by an information source discussed above, we have: p(µt = θ; st = s j) = p(st = s j) · p(µt = θ|st = sj), where θ ∈ θ and sj ∈ st. 3.1 notations considering the following notations: • p(µt, st) the joint probability of µt and st occurring simultaneously, referred to as the historic probability distribution, • p(µt/st) the probability of µt occurrring conditional on st having occurred (i.e. the conditional probability of µt given st), also known as the probability distribution of information source, and • p(st) the marginal probability of st, referred to as the posterior probability distribution, the following equation holds (as in [7]): p(µt/st) = p(µt,st) p(st) . according to the above, posterior means historical updated with information, i.e. the probability distribution µt conditioned by the information source st, denoted by µct, is the probability distribution µt updated by the information source st. 332 i. parpucea, b. pârv, t. socaciu the information source st is a possible probability distribution of states at the moment t+1. denoting the game with incomplete information at the moment t+1 and based on st with γt+1, it can be defined recursively as follows: γt+1 = γt(st), where γt+1 = (i, (fi)i∈i, (pit(f, θ))i∈i, (θi)i∈i, µct). the game γt+1 is the updated game γt based upon st. this information source updates probability distribution µt on θ and thus the equilibrium of γt is modified. 3.2 decision functions and strategies if a player receives information about his/her own type, then he/she can choose a particular strategy to maximize his/her expected payoff. definition 3. a decision function of player i ∈ i, denoted by fi(.), is a function that, for each type θi ∈ θi, specifies the strategy fi(θi) ∈ fi this player will choose if his/her type turns out to be θi. let µcjt(θ−i|θ(i)) be the updated probability obtained by using bayesian updating rule, of a particular type combination for the opponents θ−i, given that player i has type θi. for each type profile θ ∈ θ, there are updated beliefs for each player, i.e. a list of conditional probability distributions ( µc1t (θ−1|θ1), · · · , µcmt (θ−m|θm) ) . players’ beliefs after they have received information about their types, are no longer identical. definition 4. the strategy combination of all players except player i, that will be played according to the decision functions f−i(.), if type combination θ−i occurs, is a list of decision functions f−i(.) = (f1(.), · · · , fi−1(.), fi+1(.), · · · , fi(.)) for all players (other than player i) and θ−i(.) = (θ1, · · · , θi−1, θi+1, · · · , θi), a type combination for the other players, f−i(θ−i), that is: f−i(θi) = (f1(θ1), · · · , fi−1(θi−1), fi+1(θi+1), · · · , fm(θm)). 3.3 bayes-nash equilibrium in the presence of information sources the above definitions allow us to give the following: definition 5. the bayes nash equilibrium of the game γt+1 is a list of decision functions (f∗1 (.), · · · , f ∗ i (.)), such that for all possible players i ∈ i and all types θi ∈ θi:∑ θ−i∈θ−i pit ( f∗i (θi), f ∗ −i(θ−i), θi, θ−i ) ·µct(θ−i|θi) ≥ ∑ θ−i∈θ−i pit ( fi, f ∗ −i(θ−i), θi, θ−i ) ·µcit(θ−i|θi) (3) holds for all strategies fi ∈ ft. the equilibrium of γt+1 can differ from the equilibrium of γt due to the information in st. for a given player i ∈ i the updated equilibrium of γt+1 is: f∗i (.) = ∑ j f∗ij(.) · p j(t), where f∗ij(.) is the equilibrium of player i for the state s j of the information source st and pj(t) is the probability that st = sj. the above equation represents the updated equilibrium as a weighted average of all equilibria for the states of information source. 4 a market-related example consider a game with incomplete information, where the players are two firms supplying slightly different products (produced with zero production costs, as in [1]), with prices denoted modeling uncertainty in a decision problem by externalizing information 333 by p1 and p2. as a result of new received information, the variation of each price around the average price can be modeled using two simple information sources, s1t and s 2 t , as it follows: s1t : ( s11 : p1 ≤ p1 s 2 1 : p1 > p1 π1 1 − π1 ) ; s2t : ( s12 : p2 ≤ p2 s 2 2 : p2 > p2 π2 1 − π2 ) where p1 represents the average price of the first firm, and p2 the average price of the second firm. both information sources have two states, (s11, s 2 1) and (s 1 2, s 2 2), with the probabilities (π1, 1 − π1) and (π2, 1 − π2), respectively. 4.1 notations for our example, consider the following problem-specific notations: • the demand functions for the goods of the two firms: d1(p1, p2) = a · ∆p1 + b · ∆p2, d2(p1, p2) = c · ∆p1 + d · ∆p2, where ∆pi = pi − pi, i = 1, 2, are the deviation of price pi from the average price pi. firm one does not know parameters c and d; firm two does not know parameters a and b. • the sets of possible types of the two players: θ1 = {(ai, bj), i = 1, 2; j = 1, 2} , θ2 = {(ci, dj), i = 1, 2; j = 1, 2}; • the payoff functions of the two players: π1 (p1, p2, (ai, bj)) = (ai · ∆p1 + bj · ∆p2) · p1, π2 (p1, p2, (ci, dj)) = (ci · ∆p1 + dj · ∆p2) · p2. 4.2 building the complex information source with the above notations, the complex information source is rewritten as: sct+1 : ( p1 ≤ p1 ∧ p2 ≤ p2 p1 ≤ p1 ∧ p2 > p2 p1 > p1 ∧ p2 ≤ p2 p1 > p1 ∧ p2 > p2 π1 · π2 π1 · (1 − π2) (1 − π1) · π2 (1 − π1) · (1 − π2) ) , no matter what dependency relation is between the two prices considered. the information source sct+1 describes the behavior of the market of the both products considering the variation of their prices in the next period, as a result of the received information. in a bayesian-nash equilibrium, each firm is supposed to choose a type contingent strategy, that is decision functions p1(.) and p2(.) respectively, which is the best response to the opponent’s decision function. in this example, µct is a probability distribution defined on θ1 × θ2 and conditioned by the information source sct. for a state sk1s l 2 of source sct we build two conditional distributions, µc1t and µc 2 t as can be seen next. 4.3 computing the best response for the firm one consider p2(.) = (p2(c1, d1), p2(c1, d2), p2(c2, d1), p2(c2, d2)) as given (fixed) and suppose that firm one has just learned that it has the demand parameters (a1, b1). firm one’s expected payoff can be rewritten as: π1 (p1(a1, b1), p2(c1, d1)) · µc1t ((c1, d1)|(a1, b1)) + +π1 (p1(a1, b2), p2(c1, d2)) · µc1t ((c1, d2)|(a1, b1)) + +π1 (p1(a2, b1), p2(c2, d1)) · µc1t ((c2, d1)|(a1, b1)) + +π1 (p1(a2, b2), p2(c2, d2)) · µc1t ((c2, d2)|(a1, b1)) = = a1 · p21(a1, b1) + p1(a1, b1) (b1 · p2(cidj|a1b1) − a1p1 − b1p2) . (4) 334 i. parpucea, b. pârv, t. socaciu in the above equation we used the following notation for the average price p2 conditioned by state (a1, b1): p2(ci, dj)|a1b1) = p2(c1, d1) · µc1t ((c1, d1)|(a1, b1)) + p2(c1, d2) · µc 1 t ((c1, d2)|(a1, b1)) + +p2(c2, d1) · µc1t ((c2, d1)|(a1, b1)) + p2(c2, d2) · µc 1 t ((c2, d2)|(a1, b1)). the payoff function (4) is continuously differentiable in firm one’s strategy p1(.). therefore, any p1 satisfying the first order condition for a maximum will be the best response to the type-contingent strategy p2(.), previously considered. solving the first order condition for the maximum of the expected payoff function (4), one obtains the following best response p∗1(a1, b1) for firm one of type (a1, b1) to the decision functions (p2(c1, d1), p2(c2, d2), p2(c2, d1), p2(c2, d2)) of firm two: p∗1(a1, b1) = 1 2 (p1 + b1 a1 (p2 − p2(cidj|a1b1))). (5) in a similar way, one obtains for all other types of firm one: p∗1(a1, b2) = 1 2 (p1 + b2 a1 (p2 − p2(cidj|a1b2))), (6) p∗1(a2, b1) = 1 2 (p1 + b1 a2 (p2 − p2(cidj|a2b1))), (7) p∗1(a2, b2) = 1 2 (p1 + b2 a2 (p2 − p2(cidj|a2b2)). (8) 4.4 computing the best response for the firm two now consider that firm two learns that its type is (c1, d1). for a fixed type-contingent strategy of firm one p1(.) = (p1(a1, b1), p1(a1, b2), p1(a2, b1), p1(a2, b2)), the expected payoff of firm two will be as follows: π2 (p1(a1, b1), p2(c1, d1)) · µc2t ((a1, b1)|(c1, d1)) + +π2 (p1(a1, b2), p2(c1, d2)) · µc2t ((a1, b2)|(c1, d1)) + +π2 (p1(a2, b1), p2(c2, d1)) · µc2t ((a2, b1)|(c1, d1)) + +π2 (p1(a2, b2), p2(c2, d2)) · µc2t ((a2, b2)|(c1, d1)) = = d1 · p22(c1, d1) + p2(c1, d1) (c1 · p1(aibj|c1d1) − c1p1 − d1p2) . (9) in the equation (9) of the payoff function for the second firm, the average price p1 conditioned by the state (c1, d1), is given by: p1(ai, bj)|c1d1) = p1(a1, b1) · µc2t ((a1, b1)|(c1, d1)) + p1(a1, b2) · µc 2 t ((a1, b2)|(c1, d1)) + p1(a2, b1) · µc2t ((a2, b1)|(c1, d1)) + p1(a2, b2) · µc 2 t ((a2, b2)|(c1, d1)). first-order condition gives the best response function for a firm of type (c1, d1): p∗2(c1, d1) = 1 2 (p2 + c1 d1 (p1 − p1(aibj|c1d1))). (10) a similar calculation yields firm two’s best response for all other types of the following typecontingent strategies: p∗2(c1, d2) = 1 2 (p2 + c2 d1 (p1 − p1(aibj|c1d2))), (11) p∗2(c2, d1) = 1 2 (p2 + c1 d2 (p1 − p1(aibj|c2d1))), (12) p∗2(c2, d2) = 1 2 (p2 + c2 d2 (p1 − p1(aibj|c2d2))). (13) modeling uncertainty in a decision problem by externalizing information 335 4.5 conclusion in order to find a bayesian-nash equilibrium, one has to solve the system of equations given by the best response functions. with two players and four types for each player, this leads to a system of eight equations. the solution, (p∗1(.), p ∗ 2(.)), is the bayesian -nash equilibrium. the probability of realization of equilibrium prices (p∗1(.), p ∗ 2(.)), as a result of information received, is equal to the probability of realization of state sk1s l 2 of complex source sct+1. 5 conclusions and future works 5.1 our approach vs other approaches the main points of our approach are as follows: • the complexity of uncertainty is given by the great (huge) number of variables; when the complexity of a decision problem (and the number of components dominated by uncertainty) grows, it is recommended to use a bayesian network ( [6]); in our case, we use a simple bayesian network, subject to a learning algorithm; • the original idea is to separate the set of problem components into two disjoint subsets: (a) deterministic components, and (b) components dominated by uncertainty; the separation of game information into external and internal can be done for each decision problem dominated by uncertainty; • this separation allows you to study the influence of each individual factor to the solution of the game in a more efficient way; also, it suggests some architectural patterns (styles) to be used when designing a decision support system. the paper [8] discusses this issue in more detail. the essential difference between the classic approach and those proposed in this paper is given by the separation of the information external to the game from the game-specific information. this separation follows the separation of responsibilities principle. this way, both external and internal elements of the game are easier to model and understand. the classical approach does not make any difference between these two categories of information; more precisely, the influence of external information on the uncertainty that dominates the game is not taken into account/quantified. by splitting the game information into external and internal, the former being modeled by information sources, the influence of external environment on the variation of the solution is better captured and quantified. this provides a better evaluation of the contribution of individual factors to the predicted equilibrium. another advantage of this separation is that it allows a better, easier calibration of the model, by comparing the computed equilibrium with real solution, taken from historical data. 5.2 future work our future efforts are directed to apply this general algorithm to various games with incomplete information and to build decision support systems based on it. acknowledgements this work was supported by the grant id_2586, sponsored by nurc romanian national university research council (cncsis). 336 i. parpucea, b. pârv, t. socaciu bibliography [1] eichberger, i., game theory for economists, academic press, 1993. [2] florea, i., parpucea, i., economic cybernetics (romanian), babeş-bolyai university, 1993. [3] fudenberg, d., tirole, j., game theory, mit press, 1991. [4] harsanyi, j.c., games with incomplete information played by bayesian players, parts i, ii, iii, management science, 14, 1967, pp. 159-182, 320-334, 486-502. [5] hirshleifer ,j., riley j. g., the analytics of uncertainty and information, cambridge university press, 1995. [6] jensen, f.v., nielsen, t.d., bayesian networks and decision graphs, springer, 2nd ed., 2007 [7] koop, g., bayesian econometrics, wiley, 2003. [8] pârv, b., parpucea, i., computing bayes-nash equilibrium for games with incomplete information by using information sources, submitted to ijccc. [9] reiz, b., csato, l. tree-like bayesian network classifiers for surgery survival chance prediction, int. j. of computers, communications & control, iii, 2008, pp. 470-474. international journal of computers communications & control issn 1841-9836, 10(4):551-566, august, 2015. analytical modelling and performability evaluation of multi-channel wlans with global failures y. kirsal-ever, y. kirsal, e. ever, o. gemikonakli yoney kirsal-ever, orhan gemikonakli computer and communication engineering middlesex university london, hendon, uk y.kirsal@mdx.ac.uk,o.gemikonakli@mdx.ac.uk yonal kirsal electrical and electronic engineering european university of lefke north cyprus, mersin 10, turkey ykirsal@eul.edu.tr enver ever* computer engineering middle east technical university northern cyprus campus, guzelyurt, mersin 10, turkey *corresponding author: eever@metu.edu.tr abstract: wireless local area networks (wlans) which are based on ieee 802.11 standard are used widely in existing local area network configurations. ieee 802.11 offers multiple non-overlapping channels to increase the capacity of the network. there are strong evidences that wlans are prone to impairments. in order to improve the quality of service (qos) and to evaluate the performance of wlans realistically, the availability of the systems should be considered. this paper studies performability evaluation of a multi-channel wlan using analytical modelling approach. unlike the existing studies, the failures of the overall system, where a critical function unit fails making all the channels unavailable are considered. a new term is introduced as global failures. it is possible to solve the models considered using matrix geometric method where system parameters and minimal non negative solution r is computed by an iterative method. however spectral expansion method is a well-known alternative where the iterative calculations for solving r is avoided using eigenvalues and eigenvectors. the exact spectral expansion method is employed to obtain performability measures such as mean queue length and blocking probability. iterative refinements are employed in solution of simultaneous equations. keywords: analytical modelling, 2-dimensional markov chain analysis, performability evaluation, availability, spectral expansion, multi-channel wlans. 1 introduction the demand of new multimedia services, increase in bandwidth available to users, high traffic densities, and ubiquitous connectivity have caused the rapid growth of wireless and mobile communication systems. wireless lans (wlans) based on the ieee 802.11 standard have gained widespread popularity mainly because they provide users unlimited access to the internet with relatively high data rates. the ieee 802.11 specifications provide a multi-rate capability at the physical layer (phy) to accommodate mobile users with diverse wireless channel conditions [1]. the ieee 802.11b specification supports data rate up to 11 mbps. the subsequent versions of wlans such as 802.11a, 802.11g, 802.11e and the latest version, 802.11n, offers maximum copyright © 2006-2015 by ccc publications 552 y. kirsal-ever, y. kirsal, e. ever, o. gemikonakli rate of 54 mbps and 500-600 mbps respectively [2, 3]. the classical ieee 802.11 mac protocol considers a single shared channel. nevertheless, multiple channels can also be used [1,5,20,22,24]. the recent wlans based on ieee 802.11 standards permit simultaneous operation on multiple non-overlapping channels at the physical layer. extensive research has been carried out on multi-channel mac protocols for ieee 802.11 based wlans [1, 21, 23, 24]. these approaches proposed multi-channel wlans to improve overall network performance depending on different channel assignment schemes. in other words, the wireless access points (aps) can be considered as multi-channel systems with multiple wireless radios each operating independently on different channels. these multiple radios and multiple channels facilitates dynamic selection of channels for the users to transmit and receive packets without interfering each other [1,2,4,5,8,14]. thus, these new features improve the qos such as throughput, blocking probability, response time etc. [15, 21–24]. in [20] two cross-layer models have been proposed to access transmissions channels. the first model is single class type traffic. the traffic divided into two classes as high and low priority traffic in the second model. the performance parameters for both single and multi-channel wireless networks, like the requests throughput, acceptance probability are calculated. wlans suffer especially when the users with different and high transmission rates share the same physical channel. in order to solve this problem different transmission channels have been proposed in [21] to achieve better qos. in addition, an 802.11 based multi-hop wireless ad-hoc network architecture have been proposed in [22] where it employs multiple radio channels simultaneously in the system. multiple channels along with the bi-directional channel reservation policy of ad hoc networks based on ieee mac protocol considered is considered in [23] using analytical modelling. the paper also considered two different channel schemes for further analysis. in [24] existing multi-channel protocols are proposed to enhance the qos by classifying them into single-radio protocols, multi-radio single-hop protocols, and multi-radio multi-hop protocols. the existing studies in the literature propose modified mac layer architecture which supports multichannel networks. the main aim is to find an optimal channel for a single packet transmission to improve qos. in order to improve qos and to maximize the performance of such systems analytical modelling and evaluation is still one of the key issues in the performance characterization of wireless communication systems. various approaches exist in the literature for the performance evaluation of wlans [6–8, 12, 14, 20]. however, most of the existing work has been done on pure performance analysis of ieee 802.11 distributed coordination function (dcf) as well as the analysis of the ieee 802.11 mac protocol. the traditional pure performance models are too optimistic. they ignore failure and recovery behaviour and only focus on system’s ability to perform. system availability models, that consider failure and repair, should be taken into account in order to obtain more realistic qos measures. the performability modelling and evaluation is a combination of performance and availability models [9–11, 13]. in practice, wlans are prone to system failures such as failure of an ap and base station (bs) like any other physical system [10, 11, 13, 15, 16, 19]. the signal strength, fading, reflection and refraction of signals, and man-made noise etc. are some important factors that can be the cause of the channel unavailability in wlans [5, 10, 11, 13, 15]. in wlans, failing of ap/bs and/or the components of networks may cause the failure of all the available channels. this fault is termed as global system failure of wlan in this paper. examples of system failure scenarios in wlans would include failure of aps and/or bss, loss of the link between aps and users etc. this type of failures may occur when an ap fails or the communication links between an ap and service providers do not function, or operate. failures may occur due to software, hardware, human error, or a combination of these factors in wlans. different kind of failures, reasons, and failure affects can be found in [5, 9, 11–13, 16, 19]. channel failures due to the environmental interference as analytical modelling and performability evaluation of multi-channel wlans with global failures 553 well as from other wireless devices are considered in [15]. moreover, in [13], channels are defined as virtual channels and each channel is subject to failure for transmission because of multipath fading in the wlans. in case of a system failure, all the channels, resources, services, and the components become unavailable to the users. system and channel failures occurring in system is lead to degradation of the entire system performance. however, the existing studies do not consider the channel unavailability with global failures which can be quite critical for qos measurements. one of the main objectives in this paper is to model and analyse the system performance considering global failures and recovery in multi-channel wlans. thus, modelling and analysis of the system performance with global failures and repairs is considered with potential channel failures in multi-channel wlans. in addition, mobility related factors are considered and analysed. therefore the proposed model attempts to understand, and improve system performance in terms of system capacity, mobility and blocking probability of packets. the channel assignment scheme employed is another major issue in the performance characterisation of wireless communication systems. there have been extensive studies on dynamic channel access in multi-channel wlans recently [1, 3, 4, 14, 20, 22, 23]. this is because dynamic channel access scheme is appropriate and efficient scheme to improve qos of multi-channel wlans. there is no fixed relationship between channels and systems in dynamic channel access. all channels are assigned into a central pool (e.g bandwidth). the channels are assigned to the incoming packet requests dynamically and channel returns to the central pool after the service [3, 4]. in order to meet the future demands and increase available bandwidth for each user within wlans, dynamic channel access is also taken into consideration in this paper. this is done by (assuming the packets are accepted with certain bandwidth assigned) optimal allocation of wlan bandwidth for the number of channels required. all the channels used by the same wlan are perfectly coordinated and synchronised. spectral expansion method is employed for the exact solution of the analytical model considered [9,10]. using the proposed model and an exact solution, important performability measures, such as mean queue length and blocking probability can be computed. the rest of the paper is organized as follows: section ii describes the proposed model. the two dimensional modelling approach and exact steady state solution are explained in section iii and section iv respectively. in section v, numerical results computed by using an exact solution approach are presented. some pure performance results are also presented for comparison. conclusions and recommendations are provided in section vi. 2 the system considered and the analytical model the wlan presented is subject to failures. the entire system failures (e.g ap or bs) are considered as global failures that may be caused by power cut-off or resetting ap etc. the model covers mobility issues and various queue capacities as well. the system has multiple identical channels. allocation of packets is usually done by availability of channels and in this regard, it is well known that, in terms of efficiency the common queue is more suitable than individual queues in the queuing theory [7, 10, 11, 17]. the proposed model considered for performability evaluation of multi-channel wlans is shown in fig. 1. the system consists s identical channels, numbered 1, 2, 3, · · · , s with a common queue. the common queue is bounded with a capacity of w (w ≥ s). the maximum number of calls in the system is equal to the number of calls assigned to the channels plus the queuing capacity. this is given by l where, l = s+w . the superposition of all arrival streams (potential arrivals from other wireless technologies can be incorporated) follows a poisson process. in other words, inter-arrival times of the incoming call requests are assumed to follow an exponential distribution with mean rate of λ similar to studies [7–11], [17]. if the channels are available and idle in the 554 y. kirsal-ever, y. kirsal, e. ever, o. gemikonakli figure 1: the queueing system considered for proposed multi-channel wlan wlan, the packet arrivals can be assigned to any channel. otherwise, in case of a channel failure, or a busy channel, the incoming packet request is queued. similarly, the service times of the packet serviced by channel s (s=1, 2, · · · , s) are distributed exponentially. in addition to that, the dwell time, which is the time that mobile users spend in the wlan coverage area, is assumed to be distributed exponentially with mean 1/µwd [9, 10]. equation (1) is used to calculate µwd. µwd = e[v]f πo (1) where, e[v] denotes the expected value of the random variable v, which is the average speed of the mobile users. since the coverage area of the wlans are relatively small and most of the users are almost static, v is taken only for pedestrians (e.g e[v]=2km/h). f is the length of the perimeter of cell (a wlan with an arbitrary shape is assumed), and o is the area of the wlan [9], [10], [11]. tch is the average packet holding time in the wlan and follows an exponential distribution with mean e[tch]=1/µch. since f is small for wlan, it would be impractical to consider high speeds as the user will be out of range before they are able to use the connection. hence in this study, near stationary speeds are considered. t can be defined as the total channel holding time of a packet, which is exponentially distributed with mean e[t ] = 1/µ, where µ = µch + µwd (2) channel assignment schemes can be implemented in many different ways depending on the medium access control (mac) and the system architecture. wlan (the ieee 802.11) mac assures fair sharing of wlan bandwidth/channels stations since wlans use a shared medium [1, 4, 6, 7, 14, 19, 23]. traffic load makes channels unreliable and unavailable for transmission. this causes severe degradation in the performance of wlans. in order to solve the performance degradation problem in wlans, frequency channels should be assigned to aps in an appropriate and efficient manner. hence, in this paper, an analytical model developed maximizes the bandwidth usage according to the number of mobile users within the wlan. in other words, considering the behaviour of mac mechanism, the users fairly share wlan bandwidth. this sharing can be thought as a dynamic bandwidth allocation in wlan. this is analytical modelling and performability evaluation of multi-channel wlans with global failures 555 done by (assuming the call is accepted with certain bandwidth assigned) optimal allocation of wlan bandwidth for the number of channels required. the main objective of using dynamic bandwidth allocation is to improve the overall qos of an entire wlan by sharing the available bandwidth. when dynamic channel allocation scheme is employed, the channel does not change for different numbers of packets in the performance model employed. figure 2: the operative states of the proposed system the failure and repair behaviour of each channel are also represented by exponentially distributed failure and repair times with means 1/ξ and 1/η, respectively [?], [16]. a single repairman facility is assumed for all of the channels but the models provided can easily be extended for systems with multiple repairmen facilities. an inoperative period of a channel would also include the possible waiting time for a repairman. no operative channel can be idle if there are packets waiting, and no repairman can be idle if there are failed channels waiting for repair. on the other hand, the system (ap) can fail, and the repair facility is provided with means 1/ξg and 1/ηg, respectively. in case of global failures (ξg is the respective global failure rate) the priority of repair is given to the ap because the channels cannot be used when the ap is inoperative. all inter-arrival, service, failure/repair of channels and global failure and repair time variables are distributed exponentially and are independent of each other. the operative state of such a system is illustrated in fig. 2. the states 1, 2, 3, · · · , s are the working states of the multi-channel wlan. in state 0 there are no channels available due to the channel failures and in state g there are no channels available due to the ap failures. 3 two dimensional markov representation of the proposed model the state of the system at time t can be described by a pair of integer valued random variables, i(t) and j(t), specifying the channels plus global failure/recovery configurations and the number of packets present, respectively. here, channel configuration, and hence the range of i(t), refers to the operative states of the channels. in general, there are s + 2 configurations, represented by the values i(t) = 0, 1, · · · , s, g. the first s + 1 configurations represent the number of available channels in the system (from 0 to s). the (s+2)th state is used to represent the system in case the ap is down (state g). when the ap is repaired with repair rate ηg, the system resets with s operative channels. j(t) is the number of jobs in the system, j(t) = 0, 1, . . ., l. z = [i(t),j(t)]; t ≥ 0 is an irreducible markov process on a lattice strip (a qbd process), that models the system. its state space is, (0, 1, · · · , s + 1) x (0, 1, · · · , l). let the possible operative states of both channels and ap be represented in the horizontal direction (the number of channels available and g for the ap) and the number of packets in the vertical direction of a lattice strip. here, a is the matrix of instantaneous transition rates from state (i,j) to state (k,j), (i=0, 1, · · · , s +1; k=0, 1, · · · , s +1; i ̸= k; j=0, 1, · · · , l), with zeros 556 y. kirsal-ever, y. kirsal, e. ever, o. gemikonakli on the main diagonal, caused by a change in the state. these are the purely lateral transitions of the process z. matrices b and c are transition matrices for one-step upward and one-step downward transitions respectively [9], [10]. b is the matrix of one-step upward transition rate from state (i,j) to state (k,j + 1), (i=0,1, · · · , s + 1; k=0,1, · · · , s + 1; and j=0,1, · · · , l), caused by a packet arrival. clearly, the elements of a depend on the parameters s, ξ, η, ξg, ηg. the transition matrices of a system with s channels are of size (s + 2)×(s + 2). it is possible to specify the numbering of the matrices as (0, 1, 2, 3, · · · , s + 1) for states (0, 1, 2, 3,· · · , s, g) respectively. the state transition matrices a, aj, b, and bj, can be given as follows: a = aj =   0 η 0 0 0 0 0 ξg ξ 0 η 0 0 0 0 ξg 0 2ξ 0 η 0 0 0 ξg 0 0 3ξ 0 . . . 0 0 ξg 0 0 0 . . . 0 η 0 ξg 0 0 0 0 (s − 1)ξ 0 η ξg 0 0 0 0 0 sξ 0 ξg 0 0 0 0 0 0 ηg 0   b = bj =   λ 0 0 0 0 0 λ 0 0 0 0 0 . . . 0 0 0 0 0 λ 0 0 0 0 0 λ   on the other hand, c is the matrix of one-step downward transition rate from state (i,j) to state (k,j − 1), (i=0, 1, · · · , s + 1; k=0, 1, · · · , s + 1; and j=0, 1, · · · , l), caused by the departure of a serviced packet. the transition rate matrices do not depend on j for j ≥ m, where m is a threshold having an integer value. the elements of c and cj depend on the parameter µ and can be given as follow: c = cj =   0 0 0 0 0 0 0 0 µ 0 0 0 0 0 0 0 2µ 0 0 0 0 0 0 0 . . . 0 0 0 0 0 0 0 (s − 1)µ 0 0 0 0 0 0 0 sµ 0 0 0 0 0 0 0 0   please note that the stations considered within the coverage are of the wlan can be mobile. when such a scenario is considered, the transition rate matrix c depends on j since the departure rate caused by mobility depends on the number of packets in the queue [9, 10]. therefore, the threshold m is taken as m = l. if the number of packets in the system is less than the number of available channels, a channel is assigned for each packet. therefore, the downward transition rate is chosen as the minimum of number of packets and number of available channels. on the other hand, if the number of packets is greater than the number of available channels, all of the available channels are assigned to incoming calls and the calls in the queue have the departure rate µwd [9, 10] due to mobility. the matrix c is defined below, together with cj matrices for two different regions explained above: analytical modelling and performability evaluation of multi-channel wlans with global failures 557 c =   0 0 0 0 0 0 0 µ + wµwd 0 0 0 0 0 0 . . . 0 0 0 0 0 0 (s − 1)µ + wµwd 0 0 0 0 0 0 sµ + wµwd 0 0 0 0 0 0 0   for j > s cj =   0 0 0 0 0 0 0 µ + (j − s)µwd 0 0 0 0 0 0 . . . 0 0 0 0 0 0 (s − 1)µ + (j − s)µwd 0 0 0 0 0 0 sµ + (j − s)µwd 0 0 0 0 0 0 0   for j ≤ s cj =   0 0 0 0 0 0 0 0 min(1, j)µ 0 0 0 0 0 0 0 min(1, j)µ 0 0 0 0 0 0 0 . . . 0 0 0 0 0 0 0 min(1, j)µ 0 0 0 0 0 0 0 min(1, j)µ 0 0 0 0 0 0 0 0   this system can be solved and the steady state probabilities, pi,j can be obtained using the steady state solution presented in the next section. 4 steady state solution the solution is given for systems with bounded queueing capacities. the steady-state probabilities of the system considered can be expressed as: pi,j = limt→∞ p(i(t) = i,j(t) = j); (3) 0 ≤ i ≤ s + 1, 0 ≤ j ≤ l 4.1 spectral expansion solution it is possible to use spectral expansion or matrix-geometric methods to solve the system considered. when matrix-geometric method is employed, a non-linear matrix equation is formed from the system parameters and an iterative method is employed in order to compute the minimal non-negative solution r of this equation. one of the main disadvantages of this method is that it is not possible know the number of iterations needed to compute r for a specified accuracy. in this study spectral expansion method is employed for steady state solution. let’s define certain diagonal matrices of size (s + 2)× (s + 2) as follows: daj (i, i) = s+1∑ k=0 aj(i,k); d a(i, i) = s+1∑ k=0 a(i,k); (4) 558 y. kirsal-ever, y. kirsal, e. ever, o. gemikonakli dbj (i, i) = s+1∑ k=0 bj(i,k); d b(i, i) = s+1∑ k=0 b(i,k); (5) dcj (i, i) = s+1∑ k=0 cj(i,k); d c(i, i) = s+1∑ k=0 c(i,k); (6) and q0 = b, q1 = a−da −db −dc, q2 = c. all state probabilities in a row can be defined as: vj = (p0,j,p1,j, · · · ,ps+1,j);j = 0,1,2 . . . ,l (7) the steady-state balance equations for bounded queuing systems (0 ≤ j ≤ l) can now be written as follows: v0[d a 0 + d b 0 ] = v0a0 + v1c1 (8) vj[d a j + d b j + d c j ] = vj−1bj−1 + vjaj + vj+1cj+1; (9) 1 ≤ j ≤ m −1 vj[d a + db + dc] = vj−1b + vja + vj+1c; (10) m ≤ j ≤ l vl[d a + dc] = vl−1b + vla (11) the normalizing equation is given as follows: l∑ j=0 vje = l∑ j=0 s+1∑ i=0 p(i,j) = 1.0 (12) from the equations above, one can write vjq0 + vj+1q1 + vj+2q2 = 0 (13) (m −1) ≤ j ≤ (l−2) furthermore, the characteristic matrix polynomial q(λ) can be defined as: qλ = q0 + q1λ + q2λ 2; q̄β = q2 + q1β + q0β 2; (14) where ψqλ = 0; |qλ| = 0; ϕq̄β = 0; |q̄β| = 0; (15) analytical modelling and performability evaluation of multi-channel wlans with global failures 559 λ and ψ are eigenvalues and left-eigenvectors of qλ and β and ϕ are eigenvalues and lefteigenvectors of q̄β, respectively. note that, ϕ is a vector defined as ϕ = ϕ0,ϕ1, . . . ,ϕs+1 (16) and β as: β = β0,β1, . . . ,βs+1. (17) furthermore, vj = s+1∑ k=0 (akψkλ j−m+1 k + bkϕk(i)β l−j k ),m-1≤j≤ l and in the state probability form, pi,j = s+1∑ k=0 (akψkλ j−m+1 k + bkϕk(i)β l−j k ) (18) m −1 ≤ j ≤ l where λk(k = 0,1, . . . ,s + 1) and βk(k = 0,1, . . . ,s + 1) are s + 2 eigenvalues each, that are strictly inside the unit circle and ak, bk (k=0, 1,. . ., s + 1) are arbitrary constants which can be scalar or complex-conjugate [9], [10]. all the ak, bk values and the remaining vj vectors can be obtained using the process in [9] and [10]. 4.2 lu factorization with partial pivoting for the solution of real linear equations to obtain the scalars ak, and bk, (k=0, 1,. . ., s + 1) the set of linear equations are considered with a single right-hand side, using an lu factorization with partial pivoting, and iterative refinement [25]. lu factorization of the matrix employed is computed first with partial pivoting (l is lower triangular and u is unit upper triangular). an approximation for the answer vector x is then found by forward and backward substitutions. using additional precision, the residual vector is calculated in turn. an iterative refinement approach is employed until full machine accuracy is obtained. once all the steady state probabilities pi,j (i=0, 1, . . ., s + 1; j=0, 1, . . ., l) are computed, a number of steady-state performance measures can be obtained. for numerical results and discussions, the mean queue length (mql) and blocking probabilities (pb) of multi-channel wlan are considered as: mql = l∑ j=0 j s+1∑ i=0 pi,j (19) pb = s+1∑ i=0 pi,l (20) 560 y. kirsal-ever, y. kirsal, e. ever, o. gemikonakli 5 numerical results and discussions numerical results are presented in this section for the performability analysis of multi-channel wlans with global failures. the exact spectral expansion solution is employed to obtain performability measures. numerical results are presented for performability measures of multichannel wlan (e.g. mql and pb). the parameters used are mainly taken from the relevant literature and are e[tch] = 60sec, e[v]=2km/h, r=100m [7, 8, 10, 11, 13, 18]. the results in fig. 3 show that for low channel failure rates, the system can perform as good as systems without failures. however, as the channel failure rate increases the performance degradation becomes more evident (for ξ=0.1, the mql values become up to twice as much as the mql values calculated for the other rates). the mql and pb measures are presented in fig. 4 and fig. 5, respectively. the parameters used are the same as previous computations with various λ values. figure 3: mql results for systems with various channel failure rates fig. 4 and fig. 5 show that the global failures (ap failures) affect the system significantly. the effects of global failures are more evident for blocking probabilities, since as mean arrival rate and global failure rate increase, the limiting effect of queue capacity becomes more evident. the effects of global failures are considered in fig. 6 as well. this time a 25 channels system is considered (s=25) together with w=100. mql values are presented as a function of mean arrivals. the other parameters used are same as previous computations. fig. 6 shows that when failure rate of the access point is taken as 0.1 the difference in mql values is quite significant. however for systems with ξg=0.01/h and ξg =0.001/h the difference is not as significant. this is mainly due to good repair facilities provided. since the ηg=2/h the mean repair time is taken as half an hour for the access point. the effects of having various repair times are considered in fig. 7. the other parameters considered are the same as the ones used for fig. 6. the repair time of the access point affects the performance of the system significantly. fig. 7 clearly shows that systems with mean repair time equal to one hour performs significantly better then systems with 1/ηg=1.5h and 1/ηg=2h. in the model considered the priority of repair is given to the access point. however, since the repair time also includes delay factors such as the time needed for transportation of the repairman, configuration of access point etc., the repair facility should be carefully considered in order to meet expected levels of performability. the results demonstrate that the systems may become overwhelmed because of long repair times, analytical modelling and performability evaluation of multi-channel wlans with global failures 561 figure 4: mql results for systems with various global failure rates figure 5: blocking probabilities for systems with various global failure rates especially if heavy loads of packet requests are expected. fig. 8 shows the effects of having various numbers of channels in a wlan with various mean incoming arrival rates. the other parameters used are the same as the ones used for fig. 7. fig. 8 shows that the number of channels affects the overall performability of the system significantly. when λ=0.25 is considered, the mql takes values close to l, such as 104.32 for five channel systems and it is around 1.042 for systems with 25 channels. systems with different number of available channels may perform similarly for heavy loads and light loads, however, it is obvious that as the number of channels employed increases, the system performs better. 562 y. kirsal-ever, y. kirsal, e. ever, o. gemikonakli figure 6: mql as a function of λ for s=25 and w=100 for various ξg values figure 7: mql as a function of λ for s=25 and w=100 for various ηg values finally, fig. 9 shows the mql as a function of mean rate for incoming packet requests. two different scenarios are considered for the computations. in the first scenario the mobile stations waiting in the queue for channels are static, whereas in the second scenario the mobile stations are pedestrians with e[v]=2km/h. the other parameters are s=10, w=50, ξ=0.01/h, η=2/h, ξg=0.001/h, ηg=2/h. the results show that, although the velocity of pedestrian users is relatively low, in case of high arrival rates, the mean number of users leaving the system due to mobility is quite significant. for example, for λ=0.2 mql is 28.3 for systems with static users in the queue, and 9.92 for systems with pedestrian users in the queue. analytical modelling and performability evaluation of multi-channel wlans with global failures 563 figure 8: mql as a function of s for various λ values figure 9: mql as a function of λ, for mobile users waiting for service 564 y. kirsal-ever, y. kirsal, e. ever, o. gemikonakli 6 conclusions in this paper, multi-channel wlan is considered for performability evaluation. a two dimensional modelling approach and an exact solution approach where iterative refinements are used to increase the accuracy for solution of system of linear equations are employed. the multichannel failures/repairs, and global failures/repairs of wlans modelled for exact solution. the dynamic channel allocation scheme is also considered in an attempt to maximise the performance of the underlying infrastructure. the proposed model is used to analyse qos measures such as mean queue length and blocking probabilities. the presented examples were kept simple due to the introductory nature of the proposed model for the multi-channel wlans. obviously, the analysis can be expanded upon for more informed decisions, since the multi-channel wlans are common today. recent wlans have multiple wireless radios each operating independently on different channels. the analysis of the multi-channel wlans is an important issue in order to achieve better qos measurements in future wireless and communication systems. in this paper, in order to obtain more realistic qos measures, considerable amount of focus is given operative states of multi-channel as well as global failures of wlans with the proposed analytical model and its exact solution. the results mainly show that, channel and global failures affect the qos of the system significantly. therefore, the system availability is important for the future system design and modelling. results also show that the failures cause significant performance degradations in the system, and the channel and global recovery are important parameters for the system’s performance. the systems with higher number of channels perform better as expected. as is widely agreed, future mobile communication systems, fourth-generation (4g) systems, will be heterogeneous networks, which provide ubiquitous access and seamless mobility across heterogeneous network technologies, such as cellular, wlan, and broadcast access. in 4g wireless networks, mobility management is still one of the most important issues that need to be considered. thus, results provided in fig. 9 are very important in that sense. the results mainly show the analytical model and the exact steady state solution approach presented is very useful in future systems since the recent wlans technologies operate in multichannel basis (e.g hiperlan2 or wimax). the model presented in this paper is flexible and can be adopted to the other wlans. bibliography [1] m. gong and c. williamson (2009); scheduling issues in multi-channel wireless networks, ieee international symposium on modeling, analysis and simulation of computer and telecommunication systems (mascots’09), 1-10. [2] z. hua, m. li, i. chlamtac, and b. prabhakaran (2004); a survey of quality of service in ieee 802.11 networks, ieee wireless communications journals, 11: 6-14. 2004. [3] h. bobarshad (2010); evaluation and modelling of real time video transmission over wlan, phd thesis, department of electronic engineering, king’s college, london. [4] q. wang, m. liu (2010); throughput optimal switching in multi-channel wlans, ieee transactions on mobile computing, 99: 383-388. [5] p. bahl, a. adya, j. padhye, and a. wolman (2004); reconsidering wireless systems with multiple radios, acm computer communication review, 34(5):39-46. [6] g. bianchi (2000); performance analysis of the ieee 802.11 distributed coordination function, ieee journal on selected areas on communications, 18(3): 535-547. analytical modelling and performability evaluation of multi-channel wlans with global failures 565 [7] r. p. liu, g. j. sutton, and i. b. collings (2010); a new queuing model for qos analysis of ieee 802.11 dcf with finite buffer and load, ieee transaction on wireless communications, 9(8): 26642675. [8] j. yin, w. xiaodong, and d. p. agrawal (2005); modeling and optimization of wireless local area network, ieee computer communications, 28(10): 1204-1213. [9] e. ever, y. kirsal, and o. gemikonakli (2009); performability modelling of handoff in wireless cellular networks and the exact solution of system models with service rates dependent on numbers of originating and handoff calls, international conference on modelling and simulation, czech republic, 282-287. [10] y. kirsal, e. ever, o. gemikonakli, g. mapp (2011); modelling and performability analysis of wlans as a queuing model with channel/access point failures and reconfiguration, ieee proceedings of 5th european modelling symposium (ems 2011), universidad politecnica de madrid, 440-445. [11] y. ma, j. han, and k.s. trived (2001);, composite performance and availability analysis of wireless communication networks, ieee trans. on vehicular technology, 50(5): 1216-1223. [12] f. capar, t. weiss, i. martoyo, and f. jondral (2003); analysis of coexistence strategies for cellular and wireless local area networks, vehicular technology conference, doi:10.1109/vetecf.2003.1285338, 3: 1812 1816. [13] gowrishankar, g.n. sekhar, and p.s. satyanarayana (2009); analytic performability model of vertical handoff in wireless networks, journal of computer science,5(6):445-450. [14] j. so, and n. vaidya (2004); multi-channel mac for ad hoc networks: handling multichannel hidden terminals using a single transceiver, acm international symposium on mobile ad hoc networking and computing (mobihoc), 222-233. [15] b. paramvir, a. adya, j. padhye, and, a. walman (2004); reconsidering wireless systems with multiple radios, sigcomm computer communications review, 34(5): 9-46. [16] d. chen, c. kintala, s. garg, and k. s. trivedi (2003); dependability enhancement for ieee 802.11 wireless lan with redundancy techniques, international conference on dependable systems and networks, 521-528. [17] a.v. nikolov (2008); analytical model for a multiprocessor with private caches and shared memory, international journal of computers communications & control, 3(2): 172182, 2008. [18] w. xia, and, l. shen (2007); modeling and analysis of handoffs in cellular and wlan integration, ieee international conference on communications, 385-391. [19] y. zhao, and k. k. leung (2006); adaptive channel allocation for ieee 802.11 wireless lans, the 12th european wireless 2006 (ew2006) conference, athens, 1-6. [20] a. amer and f. gebali (2009); general model for single and multiple channels wlans with quality of service support, international journal of wireless and mobile networks, 1(2): 1-19. [21] t. kuang, q. wu and c. williamson (2005); mrmc: a multi-rate multi-channel mac protocol for multi-radio wireless lans, proc. of the 2005 workshop on wireless networks and communication systems (wincs), philadelphia, 263-272. 566 y. kirsal-ever, y. kirsal, e. ever, o. gemikonakli [22] a. raniwala and t. chiueh (2005); architecture and algorithms for an ieee 802.11-based multi-channel wireless mesh network, infocom, doi:10.1109/infcom.2005.1498497, 3: 22232234. [23] v. gupta, m. gong, s. dharmaraja, c. williamson (2011); analytical modeling of bidirectional multi-channel ieee 802.11 mac protocol, int. j. communication systems, 24(5): 647-665. [24] s.h. kim, d.w. kim, y.j. suh (2013); a survey and comparison of multichannel protocols for performance anomaly mitigation in ieee 802.11 wireless networks. int. j. communication systems, 26(10): 1288-1307. [25] numerical algorithms group (2005), nag c library manual, nagnp3660/8 december 2005 isbn:978-1-85206-206-4 (isbn 1-85206-206-1). int j comput commun, issn 1841-9836 9(3):305-312, june, 2014. development of mcdm methods – in honour of professor edmundas kazimieras zavadskas on the occasion of his 70th birthday o. kaplinski, f. peldschus, l. tupėnaitė oleg kaplinski poznań university of technology piotrowo 5, 60-965 poznań, poland oleg.kaplinski@put.poznan.pl friedel peldschus leipzig university of applied science 132 karl liebknecht st, 04227 leipzig, germany friedel@peldschus.net laura tupėnaitė* vilnius gediminas technical university sauletekio ave. 11, vilnius, lt-10223, lithuania e-mail: laura.tupenaite@vgtu.lt *corresponding author abstract: multiple criteria decision making (mcdm) substantially evolved during the past decades and became one of the most important areas in operational research/management science. the article presents a review of extensive scientific work of professor edmundas kazimieras zavadskas on development of mcdm methods on the occasion of his 70th birthday. the article also highlights his research carrier, and lists some of his publications. keywords: decision support, mcdm, methods, development, edmundas kazimieras zavadskas. 1 introduction multiple criteria decision making (mcdm) methods have substantially evolved since 1970s, and had various types of real world applications. new mcdm methods have been developed, and existing methods improved, showing that research in the decision-making is still critical and valuable. one of the early and exceptional authors, continuously working on the development and improvement of mcdm methods since 1976, is professor edmundas kazimieras zavadskas (fig. 1). this article is an attempt to summarise his research and achievements in development of the mcdm methods on the occasion of professor˙s 70th birthday. 2 achievements in development of mcdm methods edmundas kazimieras zavadskas was born on the 12th of may 1944 in vilnius. he presented his phd in 1973 – he researched the applications of polymer resins in reinforced concrete. this was the time when he took interest in optimising constructions, technologies and organisations. a selection of decision-making solutions dominated his research. this is how he significantly developed some elements of rational decisions theory. as a synthesis of research results, in 1987 e. k. zavadskas defended his post-doctoral (habilitation) thesis where mcda/madm methods (topsis, saw, electre, entropy, game theory, utility theory, permutation copyright © 2006-2014 by ccc publications 306 o. kaplinski, f. peldschus, l. tupėnaitė method, judgement methods) were applied for construction tasks solutions. furthermore, these methods were used for development of decision support systems [1]. there is a monograph summarising his achievements from that period [2]. this monograph has had a strong influence on research conducted by young academics working towards their phd theses in number of countries, from uzbekistan through russia, poland, germany, denmark, and cuba to syria. another monograph [3] strengthened the professor˙s position as a leader in this part of europe, conquering the area of multi criteria decision aiding methods, and operational research applications in construction industry. synthetic works on multiple criteria decision support systems in construction have been published in individual monographs or in collaboration with his colleagues [4] – [6]. new methods of performing multiple criteria analysis in a project have been developed by professor and his team, including: • a method of complex proportional assessment (copras) [5], complex proportional assessment of alternatives with grey relations (copras-g) – presented in publication [7] which was nominatd the new hot paper for january 2010 by science watch (thomson reuters) in the field of engineering (see http://archive.sciencewatch.com/dr/nhp/2010/ 10jannhp/10jannhpzavaet/) and complex proportional assessment of alternatives applying fuzzy sets (copras-f) [8]; • additive ratio assessment (aras) method [9], additive ratio assessment applying attributes values determined in intervals (aras-g) method [10] and applying fuzzy sets (aras-f) [11]; • a selection of rational dispute resolution method by applying new step-wise weight assessment ratio analysis (swara) [12]; • topsis method applying mahalanobis distance measure (topsis-m) [13]; • a new normalization method in games theory [14]; • a method of weighted aggregated sum product assessment [15]; • algorithm of maximising the set of common solutions for several mcdm problems [16]. furthermore, variety of the new mcdm methods and software was developed in collaboration with academic colleagues from abroad, i.e.: • software for multiple criteria evaluation [17]; • a method of multi-objective optimisation on the basis of ration analysis (moora) [18]; • a method multimoora (moora plus full multiplicative form) [19]; • copras method for group decision making in an interval-values intuitionistic fuzzy environment [20]; • extensions of linamp model for multi criteria decision making with grey numbers [21]; • fuzzy dea approach based on parametric programming [22]; • intuitionistic fuzzy dea for efficiency evaluation under uncertainty [23]; • stepwise dea analysis and grey incidence analysis [24]. all the above listed methods had wide real world applications in such areas as: sustainable development in civil engineering, building life cycle, modelling of construction and real estate sector, quality control of construction projects, etc. professor e. k. zavadskas continuously develops new and researches existing mcdm methods for further improvements. researches results, among many others, as illustration of professor˙s works can be distinguished: • measuring congruence of ranking results applying particular mcdm methods [25]; • evaluation of ranking accuracy in multi-criteria decisions, presented in paper [26] which was titled as fast breaking paper for june 2009 by science watch (thomson reuters) in development of mcdm methods – in honour of professor edmundas kazimieras zavadskas on the occasion of his 70th birthday 307 the field of mathematics (see http://archive.sciencewatch.com/dr/fbp/2009/pdf/09junfbpvysh.pdf); • verification of robustness of methods when assessing alternative solutions [27] – [31]; • proposal of multi-criteria assessment model of technologies [32] – [39] . professor e. k. zavadskas, in collaboration with his colleagues, has also developed hybrid decision making methods by combining mcdm methods topsis, saw, electre, ahp and the methods proposed by the professor (see, i.e. [40] – [45]). about 20 papers were nominated as hot papers in thomson reuters database. in one of these papers there is a published interview with the author (see http://archive.sciencewatch.com/dr/nhp/2011/11maynhp/11maynhpzavaet/). professor e. k. zavadskas was granted a lithuanian award for research achievements for a cycle of works ´multiple criteria assessment of construction projects and technological solutionsą (1980–1996) in 1996, and for a cycle of works ´modelling in construction (methods, simulation, decision support and information systems, web-based technologies, practical application)ą (1996–2003) – in 2004. in 1996 he also was awarded the 4th class medal of the lithuanian grand duke gediminas. 3 biography and general data edmundas kazimieras zavadskas (fig. 1) was born in 1944 vilnius, lithuania. he graduated from an elementary school in vorkuta, ussr and secondary school in 1962, dūkštas, lithuania. e. k. zavadskas studied at faculty of construction economics in vilnius branch of kaunas polytechnic institute (visi) (1962–1967) (now – vilnius gediminas technical university, vgtu). in 1968 he became an assistant, 1969–1972 he was a phd student at visi, lecturer – in 1974, associate professor in 1977 and became a professor in 1988. the professor became a rector of vilnius civil engineering institute (visi) in 1990. during the period between february-october 1990, he successfully reorganized the institute, which became the vilnius technical university, and was nominated a rector for period of 1990–1996. later the university was renamed as vilnius gediminas technical university (vgtu) and prof. e. k. zavadskas became a rector for the period of 1996–2002. in 2002–2011, he was a vicerector of vgtu. in this time, he worked towards making the university one of the largest universities in lithuania, taking a leading position in technical and engineering education and research. since 1986 till now professor zavadskas has been a head of the department of construction technology and management, civil engineering faculty, vgtu. his impressive academic carrier started in 1973 when he became a phd student at the vilnius civil engineering institute (visi, now vgtu) and defended his post-doctoral (habilitation) thesis at moscow civil engineering institute in 1987 (on multi attribute decision making in construction). in 1993, he won laurea doctorali ad habilitationem at technical sciences at vgtu. professor e. k. zavadskas was an expert member (1991–1993), corresponding member (1993–2011) and in 2011, became a full member of the lithuanian academy of sciences. furthermore, he has been granted a title of honorary doctor of three universities: poznań, kiev, and st. petersburg (2001–2003) and the honorary international professor of the national taipei university of technology. he presides and participates in a number of scientific corporations and editorial boards of scientific publishing houses. he is a member of two russian academies, the ukrainian academy of cybernetics, and many (17) scientific and research organisations, from melbourne to brussels. he also represents the baltic states in international organisations. the professor was a president of operational researchers society in lithuania and the baltic states 308 o. kaplinski, f. peldschus, l. tupėnaitė figure 1: r. j. slowiński, b. roy, e. k. zavadskas at the 52nd meeting of the euro working group multicriteria aid for decisions (mcda), 6–7 october, 2000, vilnius, lithuania (organised by h. pranevičius, l. sakalauskas, e. k. zavadskas, a. kaklauskas) in 2001–2012. the professor˙s research and teaching includes fields of construction materials, materials resistance, construction technology and management, operational research methods, decision support systems, life cycle analysis, etc. prof. e. k. zavadskas has published over 50 books, including 5 textbooks and 16 monographs as single author, or in collaboration with other authors, 10 popular science books, over 400 research articles as well as several hundreds of articles on various social and cultural topics. he has edited over 20 collective volumes. professor e. k. zavadskas has set up three famous international scientific journals: ´technological and economic development of economyą (editor-in-chief since 1994), ´journal of civil engineering and managementą (editor-in-chief since 1995) and ´international journal of strategic property managementą (editor-in-chief since 1997 till 2011). since 2008 all three journals have been referred in thomson reuters web of science database, and since 2010 – have impact factor (if). furthermore, since 2010 these journals are published by vgtu publishing house ´technikaą in collaboration with a famous publishing house – taylor & francis. the professor is also a member of editorial boards of 16 international journals referred in thomson reuters web of science database and 17 other journals. on various occasions special issues of journals and collective volumes were dedicated to professor˙s works, i.e. journal of management and decision making: ´normalisation in decision making methodsą (2007), international journal of environment and pollution (2007, 2008), automation in construction (2010), informatica (2001), ecology (2007). professor e. k. zavadskas was a chairman and member of organizing committees of numerous international conferences, as well as editor of conference proceedings, including: • modelling and simulation of business systems (vilnius, lithuania 2003); • 33rd symposium international fesf strasbourg: recent developments in environmental protection (vilnius, lithuania, 2003); • simulation and optimization in business and industry: international conference on operdevelopment of mcdm methods – in honour of professor edmundas kazimieras zavadskas on the occasion of his 70th birthday 309 ational research (tallinn, estonia, 2006); • the 20th international conference euro mini conference ľconstruction optimization and knowledge-based technologies´ (europt’2008) (neringa, lithuania, 2008); • the 25th international symposium on automation and robotics in construction (isarc 2008) (vilnius, lithuania, 2008); • international conference on modelling of business, industrial and transport systems (riga, latvia, 2008); • international conference ´modern buildings, materials and structuresą, (vilnius, lithuania, 2004, 2007, 2010, 2013), etc. professor e. k. zavadskas was one of the main initiators of international german – lithuanian – polish colloquium dedicated to operational research (or) in civil engineering. the first one was held in 1986, leipzig, germany. the colloquia are organized every two years. since the first one, 14 colloquia have already been organized. on the basis of collaboration during 11th and the 12th colloquia, the idea of setting up of a new euro working group ´or in sustainable development and civil engineering (ewgorsdce)ą was presented. on the initiative of professor zavadskas, the working group was established during the 23rd european conference on operational research ´or creating competitive advantageą, which took place in bonn, germany 5–8 july 2009 (http://www.euroonline.org/web/ewg/32/ewg-orsdce-or-in-sustainable-development-and-civil-engineering). prof. e. k. zavadskas is a chairman of this working group. under e. k. zavadskas supervision, 33 phd dissertations were presented (four of his former students were awarded the title of full professor). on the day of his jubilee, we would like to congratulate professor edmundas kazimieras zavadskas – an exceptional scientist of greatest format. we wish the professor good health and creativity in further contributions to mcdm methods. 4 conclusions professor edmundas kazimieras zavadskas has greatly contributed to development and practical applications of mcdm methods. this is why a part of the article was devoted to his achievements. his numerous articles and, most of all, authorship or co-authorship of books contribute to the mcdm theory and practice, as well as encourage continuous innovations in this field. bibliography [1] zavadskas, e.k. (1987); multiattribute decision making in construction. dr.sc. dissertation. moscow: institute of civil engineering. 720 p. (in russian). [2] zavadskas, e. (1987); complex estimation and choice of resource-saving decisions in construction. vilnius: mokslas. 210 p. (in russian). [3] zavadskas, e.k. (1991); system of estimation of technological solutions in building construction. leningrad: stroizdat. 256 p. (in russian). [4] zavadskas, e.; peldschus, f.; kaklauskas, a. (1994); multiple criteria evaluation of projects in construction. vilnius: technika. 226 p. [5] zavadskas, e.k.; kaklauskas, a. (2007); mehrzielselektion für entscheidungen im bauwesen. stuttgart: irb verlag. 276 p. (in german). 310 o. kaplinski, f. peldschus, l. tupėnaitė [6] zavadskas, e.; kaplinski, o.; kaklauskas, a.; brzezinski, j. (1995); expert systems in construction industry. trends, potencial & applications. vilnius: technika. 180 p. [7] zavadskas, e.k.; kaklauskas, a.; turskis, z.; tamosaitiene, j. (2008); selection of the effective dwelling house walls by applying attributes values determined at intervals, journal of civil engineering and management, 14(2): 85-93. [8] zavadskas, e.k.; antucheviciene, j.; (2007); multiple criteria evaluation of rural building’s regeneration alternatives, building and environment, 42(1): 436-451. [9] zavadskas, e.k.; turskis, z. (2010); a new additive ratio assessment (aras) method in multicriteria decision-making, technological and economic development of economy, 16(2): 159-172. [10] turskis, z.; zavadskas, e.k.; (2010); a novel method for multiple criteria analysis: grey additive ratio assessment (aras-g) method, informatica, 21(4): 597-610. [11] turskis, z.; zavadskas, e.k. (2010); a new fuzzy additive ratio assessment method (aras-f). case study: the analysis of fuzzy multiple criteria in order to select the logistic centers location, transport, 25(4): 423-432. [12] keršulienė, v.; zavadskas, e.k.; turskis, z. (2010); selection of rational dispute resolution method by applying new step-wise weight assessment ratio analysis (swara), journal of business economics and management, 11(2): 243-258. [13] antuchevičienė, j.; zavadskas, e.k.; zakarevičius, a. (2010); multiple criteria construction management decisions considering relations between criteria, technological and economic development of economy, 16(1): 109-125. [14] zavadskas, e.k.; turskis, z. (2008); a new logarithmic normalization method in games theory, informatica, 19(2): 303-314. [15] zavadskas, e.k.; turskis, z.; antucheviciene, j.; zakarevicius, a. (2012); optimization of weighted aggregated sum product assessment, electronics and electrical engineering = elektronika ir elektrotechnika, 122(6): 3-6. [16] dadelo, s.; krylovas, a.; kosareva, n.; zavadskas, e.k.; dadeliene, r. (2014); algorithm of maximizing the set of common solutions for several mcdm problems and it˙s application for security personnel scheduling, international journal of computers, communications & control (ijccc), 9(2): 140-148. [17] zavadskas, e.k.; ustinovičius, l.; peldschus, f. (2003); development of software for multiple criteria evaluation, informatica, 14(2): 259-272. [18] brauers, w.k.m.; zavadskas, e.k. (2006); the moora method and its application to privatization in a transition economy, control and cybernetics, 35(2): 445-469. [19] brauers, w.k.m.; zavadskas, e.k. (2010); project management by multimoora as an instrument for transition economies, technological and economic development of economy, 16(1): 5-24. [20] razavi hajiagha, s.h.; hashemi, s.s.; zavadskas, e.k. (2013); a complex proportional assessment method for group decision making in an interval-valued intuitionistic fuzzy environment, technological and economic development of economy, 19(1): 22-37. development of mcdm methods – in honour of professor edmundas kazimieras zavadskas on the occasion of his 70th birthday 311 [21] razavi hajiagha, s.h., hashemi, s.s.; zavadskas, e.k.; akrami, h. (2012); extensions of linmap model for multi criteria decision making with grey numbers, technological and economic development of economy, 18(4): 636-650. [22] razavi hajiagha, s.h.; mahdiraji, h.a.; zavadskas, e.k.; hashemi, s.s. (2013); a fuzzy data envelopment analysis approach based on parametric programming, international journal of computers, communications & control (ijccc), 8(4): 594-607. [23] razavi hajiagha, s.h.; akrami, h.; zavadskas, e.k.; hashemi, s.s. (2013); an intuitionistic fuzzy data envelopment analysis for efficiency evaluation under ucertainty: case of a finance and credit institution, e&m economics and management = e&m economie a management, 1: 128-137. [24] razavi hajiagha, s.h.; zavadskas, e.k.; hashemi, s.s. (2013); application of stepwise data envelopment analysis and grey incidence analysis to evaluate the effectiveness of export promotion programs, journal of business economics and management, 14(3): 638-650. [25] antuchevičienė, j.; zakarevičius, a.; zavadskas, e.k. (2011); measuring congruence of ranking results applying particular mcdm methods, informatica, 22(3): 319-338. [26] zavadskas, e.k.; zakarevicius, a.; antucheviciene, j. (2006); evaluation of ranking accuracy in multi-criteria decisions, informatica, 17(4): 601-618. [27] zavadskas, e.k.; antucheviciene, j.; saparauskas, j.; turskis, z. (2013); mcdm methods waspas and multimoora: verification of robustness of methods when assessing alternative solutions, journal of economic computation and economic cybernetics studies and research (ececsr), 47(2): 5-20. [28] brauers, w. k. m.; zavadskas, e. k. (2012); robustness of multimoora: a method for multi-objective optimization, informatica, 23(1): 1-25. [29] brauers, w. k. m.; kildienė, s.; zavadskas, e. k.; kaklauskas, a. (2013); the construction sector in twenty european countries during the recession 2008-2009 – country ranking by multimoora, international journal of strategic property management, 17(1): 58-78. [30] yazdani-chamzini, a.; yakhchali, s. h.; zavadskas, e. k. (2012); using a integrated mcdm model for mining method selection in presence of uncertainty, ekonomska istraživanja economic research, 25(4): 869-904. [31] brauers, w. k. m.; zavadskas, e. k. (2012); a multi-objective decision support system for project selection with an application for the tunisian textile industry, e&m economie a management, 15(1): 28-43. [32] zavadskas, e. k.; turskis, z.; volvačiovas, r.; kildienė, s. (2013); multi-criteria assessment model of technologies, studies in informatics and control, 22(4): 249-258. [33] hashemkhani zolfani, s.; sedaghat, m.; zavadskas, e. k. (2012); performance evaluating of rural ict centers (telecenters), applying fuzzy ahp, saw-g and topsis grey, a case study in iran, technological and economic development of economy, 18(2): 364-387. [34] hashemkhani zolfani, s.; zavadskas, e. k.; turskis, z. (2013); design of products with both international and local perspectives based on yin-yang balance theory and swara method, ekonomska istraživanja economic research, 26(2): 153-166. 312 o. kaplinski, f. peldschus, l. tupėnaitė [35] yazdani-chamzini, a.; fouladgar, m. m.; zavadskas, e. k.; moini, s. h. h. (2013); selecting the optimal renewable energy using multi criteria decision making, journal of business economics and management, 14(5): 957-978. [36] hashemkhani zolfani, s.; esfahani, m. h.; bitarafan, m.; zavadskas, e. k.; arefi, s. l. (2013); developing a new hybrid mcdm method for selection of the optimal alternative of mechanical longitudinal ventilation of tunnel pollutants during automobile accidents, transport, 28(1): 89-96. [37] staniunas, m.; medineckiene, m.; zavadskas, e. k.; kalibatas, d. (2013); to modernize or not: ecological-economical assessment of multi-dwelling houses modernization, archives of civil and mechanical engineering, 13(1): 88-98. [38] kracka, m.; zavadskas, e. k. (2013); panel building refurbishment elements effective selection by applying multiple-criteria methods, international journal of strategic property management, 17(2): 210-219. [39] brauers, w. k. m.; kracka, m.; zavadskas, e. k. (2012); lithuanian case study of masonry buildings from the soviet period, journal of civil engineering and management, 18(3): 444456. [40] zavadskas, e.k.; vainiūnas, p.; turskis, z.; tamošaitienė, j. (2012); multiple criteria decision support system for assessment of projects managers in construction, international journal of information technology & decision making, 11(2): 501-520. [41] hashemkhani zolfani, s.h.; aghdaie, m.h.; derakhti, a.; zavadskas, e.k.; varzandeh, m.h.m. (2013); decision making on business issues with foresight perspective; an application of new hybrid mcdm model in shopping mall locating, expert systems with applications, 40(17): 7111-7121. [42] fouladgar, m. m.; yazdani-chamzini, a.; zavadskas, e. k.; haji m., hamzeh, s. (2012); a new hybrid model for evaluating the working strategies: case study of construction company, technological and economic development of economy, 18(1): 164-188. [43] rezaeiniya, n.; hashemkhani zolfani, s.; zavadskas, e. k. (2012); greenhouse locating based on anp-copras-g methods – an empirical study based on iran, international journal of strategic property management, 16(2): 188-200. [44] hashemkhani zolfani, s.; rezaeiniya, n.; pourhossein, m.; zavadskas, e. k. (2012); decision making on advertisement strategy selection based on life cycle of products by applying fahp and topsis grey: growth stage perspective; a case about food industry in iran, inžinerinė ekonomika engineering economics, 23(5): 471-484. [45] zavadskas, e. k.; sušinskas, s.; daniūnas, a.; turskis, z.; sivilevičius, h. (2012); multiple criteria selection of pile-column construction technology, journal of civil engineering and management, 18(6): 834-842. international journal of computers communications & control issn 1841-9836, 11(6):804-818, december 2016. the particle swarm optimization algorithm with adaptive chaos perturbation l. mengxia, l. ruiquan, d. yong li mengxia 1. school of information and mathematics, yangtze university jingzhou hubei 434023, china 2. petroleum engineering college, yangtze university wuhan hubei 430100, china 3. the branch of key laboratory of cnpc for oil and gas production, yangtze university wuhan hubei 430100, china 4. key laboratory of exploration technologies for oil and gas resources, yangtze university wuhan hubei 430100, china limengxia81@126.com liao ruiquan 1. petroleum engineering college, yangtze university wuhan hubei 430100, china 2. the branch of key laboratory of cnpc for oil and gas production, yangtze university wuhan hubei 430100, china 3. key laboratory of exploration technologies for oil and gas resources, yangtze university wuhan hubei 430100, china liaoruiquan@263.net dong yong* 1. school of information and mathematics,yangtze university jingzhou hubei 434023, china 2. the branch of key laboratory of cnpc for oil and gas production, yangtze university wuhan hubei 430100, china 3. key laboratory of exploration technologies for oil and gas resources, yangtze university wuhan hubei 430100, china *corresponding author: dongyong80@126.com abstract: aiming at the two characteristics of premature convergence of particle swarm optimization that the particle velocity approaches 0 and particle swarm congregate, this paper learns from the annealing function of the simulated annealing algorithm and adaptively and dynamically adjusts inertia weights according to the velocity information of particles to avoid approaching 0 untimely. this paper uses the good uniformity of anderson chaotic mapping and performs chaos perturbation to part of particles based on the information of variance of the population’s fitness to avoid the untimely aggregation of particle swarm. the numerical simulations of five test functions are performed and the results are compared with several swarm intelligence heuristic algorithms. the results shows that the modified algorithm can keep the population diversity well in the middle stage of the iterative process and it can improve the mean best of the algorithm and the success rate of search. keywords: particle swarm optimization, inertia weight,population diversity, expected velocity, chaos perturbation. 1 introduction particle swarm optimization (pso) is an evolutionary algorithms (eas). it stems from the simulation of group behavior for birds swarm’s foraging [1, 2]. the parameters and structures of copyright © 2006-2016 by ccc publications the particle swarm optimization algorithm with adaptive chaos perturbation 805 pso are very simple and are apt to be realized. it has parallelism essentially. it has no requirement to the properties of objective functions. it has good adaptation and gains wide concern. it is applied in many fields such as decision feedback equalizer [3, 4], parameter identification [5, 6], power dispatch [7] and mechanical control [8,9], etc. the particle swarm of the standard pso is apt to get into local best position and occurs premature convergence phenomenon. the premature convergence has two characteristics that the velocity of particle swarm approach 0 and the particles aggregate in a small region. in order to avoid early approaching to 0 for the velocity of particle swarm, the inertia weights of standard pso are modified from fixed values to variation values in the references [10-15], and the performance of pso is improved in some extent. the reference [2] proposed a method that the inertia weight decreases within the increasing of iterations. it emphasizes the global search in the earlier stage of evolution and intensifies the local exploitation in the later stage of evolution. but the effect on the complex problem is not obvious. the reference [10] use the fuzzy controller to adjust the inertia weight, but it is complex to build the fuzzy rule. the reference [11] proposed the method to set the inertia weight randomly and obtained a greater effect on pursuing the optimization. the reference [12] used the chaotic mapping to change the inertia weight and the inertia weight changes according to the logistic chaotic mapping. the reference [13, 14] used different exponential forms to build the setting method of the inertia weight which deceases according to the exponential rule. the reference [15] uniformly adjusted the inertia weight for the all particles according to the mean velocity of the particle swarm, and the numerical results showed that it can keep the population diversity in the middle stage of evolution. for pso algorithm, the particle,s position is different. the global best particle and the personal best particle are in the leading position. in the process of evolution, the other particles are affected by the two particles and are close to them. the global best particle and the personal best particle are considered as on group which is called the dominant group. the other particles are taken as another group which is called the ordinary group. obviously, it is appropriate to deal with the dominant group and the ordinary group respectively. this paper uses the smaller inertia weight to develop the local development capabilities for the particles in the dominant group. for the particles in the ordinary group, the method in reference [15] that adjusts the inertia weight based on the mean velocity of swarm is modified and the inertia weight corresponding to each particle is adjusted respectively according to the expected values of velocity of particle swarm to avoid the swarm velocity early approaching 0. for the second characteristics of the premature convergence, the reference [16] considered that the variance of the population’s fitness can reflect the concentration degree of population. if the value of is less than some specified threshold, the particle swarm aggregate and the premature convergence occurs. σ2 = n∑ k=1 ( fk −fm f )2 (1) f = max { 1,max k {|fk −favg|} } (2) in eq. 1 and eq. 2, fkdenotes the fitness value of the k-th particle.fmdenotes the mean value of the variance of the population’s fitness. fdenotes the normalized parameter. the reference [16] didn’t point out how to determine the appropriate threshold value. to analyze a large number of numerical simulations, the results showed that the variances σ2keep stabilization and basically equal in the subsequent several evolutionary when the premature convergence occurs. hence, this paper compares the corresponding variancesσ2between two 806 l. mengxia, l. ruiquan, d. yong adjacent iterations. if the change of σ2value is very small, it considers that the premature convergence occurs and needs to set the position of particle swarm to advance the population diversity. one common method is to use the logistic chaotic mapping performing perturbation to part of particles. it is easy to verify that the distribution of the chaotic sequence generated by the logistic chaotic mapping has the characteristic that is big at both ends and small in the middle and the uniformity of the chaotic sequence is poor. but the chaotic sequence generated by the anderson chaotic mapping has better uniformity. so it is appropriate to use the anderson chaotic mapping to perform the perturbation to part of particles. based on the above analysis, this paper firstly considers the expected value of population velocity, and changes the inertia weights respectively corresponding to the particles according to the category of particles. and then it performs chaos perturbation to part of particles based on the difference of the variances of the population’s fitness between two adjacent iterations. finally, the numerical simulations are performed and the results show that the new algorithm has higher performance. 2 standard pso algorithm pso algorithm is simulating the foraging process of the bird flock. particle denotes the individual. it has position and velocity, but has no volume and mass. multiple particles constitute the particle swarm which denote the bird flock. the objective function value corresponding to the particle’s position is called the fitness of the particle which is used to evaluate the good points and the bad points of the particle. the best position which the individual particle went through is called the personal previous best position. the personal previous best position of the kth particle is denoted by pk.the previous best position of the particle swarm is called the swarm best position which is denoted by pg.the evolution of the particle is realized through tracing the personal previous best position and swarm best position. the evolution of standard pso algorithm is divided into the evolution of the velocity and the evolution of the position. the evolution equations are shown in eq. 3 and eq. 4 [1, 2] as below. vt+1k = ω ×v t k + c1 ×r1 × (pk −xtk) + c2 ×r2 × (pg −xtk) (3) xt+1k = x t k + v t+1 k (4) in eq. 3 and eq. 4, vtkdenotes the velocity of the kth particle in the tth evolution. x t kdenotes the position of the kth particle in the tth evolution. r1 and r2 denote the independent random numbers on the interval [0, 1]. c1 and c2 denote the learning factors which generally equal 2. ω denotes the inertia weight which takes fixed value between 0.1 and 0.9. the value ranges of the particle’s velocity and position are usually restricted. if the maximum of the position is determined by xmax, the maximum of the velocity is vmax = k × xmax for 0.1 ≤ k ≤ 1.0 [16]. 3 modification of the inertia weight the mean velocity vta of the particle swarm in the kth evolution is denoted by eq. 5. vta = ( n∑ i=1 m∑ k=1 ∣∣∣v(t)k,i∣∣∣)/(n×m) (5) the particle swarm optimization algorithm with adaptive chaos perturbation 807 n is the dimension of the problem which is the dimension of the search space. m is the total number of the particles. v(t)k,i is the velocity of the kth particle in the ith dimensionality as the particle experiences the tth evolution. v(t)a denotes the evolution amplitude of the particle swarm or the evolution step. the larger the value of v(t)a is, the larger the evolution step is. it means that the search capability of the particle swarm is great. if the value of v(t)a is small, it means that the search of particle swarm focuses on the local exploitation. in the earlier stage of evolution, the global search capability is expected to be stronger. it means that the value of v(t)a is expected to be larger. in the later stage of the evolution, the capability of local exploitation is expected to be intensified to increase the capability of finding the maximal solution by the algorithm, and the value of v(t)a is expected to be smaller. this paper firstly introduce the expected value of the mean value of particle swarm in the evolution which is shown in eq. 6 as follow. vte = v0 ×exp(−(λ1 × t/tmax)λ2 ) (6) in eq. 6, v0 denotes the mean velocity of the initial particle swarm. λ1 and λ2 are adjustable parameters which is to control the change rule of the expected value vte of the particle swarm shown in figure 1. tmax denotes the maximal evolution generations. figure 1: the change rule of vte under different parameters from figure 1, it can be seen that the value of vte as λ1 = 1.8 and λ2 = 5 is suitable to be the expected mean velocity of particle swarm. in this case, in the earlier stage of evolution, the expected value of mean velocity of particle swarm is larger and it helps to keep the global search capability. in the last stage of evolution, the expected value of mean velocity approaches 0 and it helps to improve the local exploitation capability. ω0 denotes the initial inertia weight. vte denotes the actual velocity of the kth particle. ω t k denotes the inertia weight of the kth particle in the tth evolution. ωt+1k denotes the inertia weight of the kth particle in the (t + 1)th evolution. ωt+1k is determined by the following rule. (1) if the particle is the better particle, ωt+1k =ωmin; (2) if the particle is the trivial particle, then, define v (t) k,a = ( n∑ i=1 ∣∣∣v(t)k,i∣∣∣)/n (7) 808 l. mengxia, l. ruiquan, d. yong there are three cases to be considered as below. (2.1) if v(t)k,a > v (t) e ,then, ω (t+1) k = ω (t) k ×p1. (2.2) if v(t)k,a < v (t) e ,then, ω (t+1) k = ω (t) k ×p2. (2.3) if v(t)k,a = v (t) e ,then, ω (t+1) k = ω (t) k . in the above process, p1 and p2 are constants with 0 < p1 < 1 and p2 > 1. it also can take p1 = 1/p2. if ω (t+1) k > ωmax, p2 is taken to be 1.05 according to the reference [8] or is determined by the experiment. then taking ω(t+1)k = ωmax. ωmax and ωmin denote the upper limit and lower limit of the inertia weight, respectively. 4 anderson chaotic mapping this paper uses the rule that the difference of σ2 between the adjacent evolution process is less than some given value se to recognize the premature convergence, for example, taking se = 10−3. if there exists premature convergence, then it needs to perform the perturbation for the current particle to improve the population diversity. generally, the researchers use the logistic chaotic mapping to perform perturbation [18-21], but the sequence generated by the logistic chaotic mapping has bad uniformity [22]. the logistic chaotic mapping is shown in eq. 8 as below. yn+1 = µ×yn × (1 −yn) (8) when µ = 4, eq. 8 will generate the chaotic sequence whose value is taken between 0 and 1. this paper introduce anderson chaotic mapping to generate chaotic sequence which has good uniformity [23]. anderson chaotic mapping is shown in eq. 9 as below. yi = (ln(xi + 1/2) + ln(2)))/ln(3) (9) where, xn+1 = { 3 2 ×xn + 1/4 0 6 xn < 12 1 2 ×xn − 1/4 12 6 xn < 1 (n = 1, 2, ...) (10) the sequence yi is uniform distribution on (0, 1). in order to compare the uniformity of the two chaotic mapping, the initial value is taken arbitrarily. and then iterate 1000 times according to eq. 8 and eq. 9. the interval [0, 1] is divided into 10 subintervals with equal width. the frequency that the elements of the chaotic sequence locate at each subinterval is considered and shown in figure 2. if the other initials are taken except that the value of 0.25 and 0.75 cannot be taken because of the occurring of fixed points for logistic chaotic mapping. the obtained frequency graph is similar to figure 2. this means that anderson chaotic mapping has stable uniformity. 5 modified pso algorithm the objective function is expressed in eq. 11. minf(x) = f(x1,x2, ...,xn) (11) where, n is the dimension of the problem. the particle swarm optimization algorithm with adaptive chaos perturbation 809 0 0.5 1 0 200 400 600 800 1000 1200 f re qu en cy h is to gr am f ro m a nd er so n subinterval 0 0.5 1 0 500 1000 1500 2000 2500 f re qu en cy h is to gr am f ro m l og is ti c subinterval figure 2: comparison of the uniformity between two chaotic mappings 5.1 initial particle swarm generated by the chaotic mapping firstly, one m-dimensional vector is generated randomly and its element is between 0 and 1. secondly, each element is taken as initial value. and using anderson chaotic mapping iterates n−1 times to generate ms n-dimensional vector which are denoted by y1,y2, ...,ym. finally, the value range of element of yi for i = 1, 2, ...,m is transformed to the search space from the interval [0, 1]. suppose y ∈ [a,b],then x ∈ [c,d]. and the relation between x and y is shown in eq. 12. x = c + y −a b−a (d− c) (12) the initialization of the velocity can be finished analogously. 5.2 modification for evolution process in figure.3, after finishing the tth evolution process, the variance σ2(t) is obtained by calculating the variance of the population’s fitness. and then it is compared with the variance σ2(t− 1) in the previous evolution process. if ∣∣σ2(t) −σ2(t− 1)∣∣ 6 se, the particles are ordered from small to large according to the fitness value. the particles which account for 61.8% of total particle number after ordering are performed chaotic perturbation. the number of foregoing particles is denoted by s. that is, the positions of ss particles are initialized again. 5.3 modified pso algorithm step 1 give the initial value of the inertia weight ω0 = ωmax = 0.95,ωmin = 0.05 learning factors c1 = c2 = 2, the number of particles m = 40, the maximal evolution generations tmax = 10000, the maximal evolution generations n, the control value for starting the chaotic perturbation se = 10−3, the search space [xmin,xmax] and the upper limit of velocity vmax. step 2 randomly generate a m-dimensional particle on [0, 1].use the anderson mapping shown in eq. 9 and eq. 10 to get ms n-dimensional particles. the ms n-dimensional particles which are transformed into the search space based on eq. 12 are denoted by xi for i = 1, 2, ...,m. 810 l. mengxia, l. ruiquan, d. yong figure 3: the flow chart of chaos perturbation the particle swarm optimization algorithm with adaptive chaos perturbation 811 similarly, generate the particle’s velocity. take vmax = xmax. let the evolution generations be 0, and then turn to step 3. step 3 calculate the fitness of each particle and the variance of the population’s fitness. determine the swarm best position pg, the personal previous best position pi for i = 1, 2, ...,m. turn to step 4. step 4 adjust ω according to the contents in section 3. evolve the particle’s velocity and position according to eq. 3 and eq. 4. and then add one to the evolution generations. update pg and pi.calculate the variance of current population’s fitness. compare the calculating variance with the variance of fitness value before evolution. if the difference is less than se, turn to step 5. otherwise, turn to step 6. step 5 determine the number of particles which need to be performed perturbation, s. use anderson mapping to generate ss particles to replace the ss particles whose fitness value order are back. then turn to step 6. step 6 if the evolution generations is less than tmax, turn to step 4. otherwise, turn to step 7. step 7 output the results: pg and f(pg). 6 numerical simulation in order to test the performance of the pso algorithm with chaos perturbation proposed in this paper, this paper selects five benchmark function shown in table 1. table 1: benchmark functions and part parameters function dimensionality search space optimal value convergence criteria sphere function 30 [−100, 100]n 0 0.01 rosenbrock’s function 30 [−30, 30]n 0 0.01 rastrigin’s function 30 [−5.12, 5.12]n 0 0.01 griewanks’s function 30 [−600, 600]n 0 0.01 schwefel function 30 [−100, 100]2 0 0.01 table 1 gives some parameters of the algorithm, such as dimensionality, search space, theoretical optimum and convergence criteria. the theoretical optimums of these benchmark functions are all zeros. because the theoretical optimum is difficult to obtained, so this paper uses the convergence criteria to determine the astringency of the algorithm. the convergence criteria equals to 0.01. aiming at the above benchmark functions, this paper compares the optimal performance of the following algorithms with each other: the algorithm acpso proposed in this paper, the standard pso algorithm denoted by spso, the algorithm in the reference [12] denoted by ref. 12, the algorithm in the reference [15] denoted by ref. 15, and the algorithm in the reference [20] denoted by ref. 20. the algorithm acpso proposed in this paper tests the number of two kinds of particles. the spso algorithm is shown in the reference [2]. the algorithm ref. 12 generates the random number of the velocity equation for the spso algorithm by using logistic chaos. the algorithm ref. 15 uniformly adjusts the inertia weight according to the average velocity of the all particles. the algorithm ref. 20 is the firefly algorithm. for ref. 20, it takes the number 812 l. mengxia, l. ruiquan, d. yong of particles 40, the evolution generations 1000, initial attractiveness 0.728, the initial absorption coefficient 0.345 and the randomization parameter 0.25. using the parameters shown in table 2, each algorithm is run 20 times respectively. table 2: the parameters for each algorithm algorithm number of particles evolution generations initial ω p2 se chaos update rate r1 r2 acpso 40 1000 0.95 1.05 1 61.8% / / acpso 400 1000 0.95 1.05 1 61.8% / / spso 40 1000 0.7298 / / / / / ref. 12 40 1000 0.7298 / / / 0.2 0.8 ref. 15 40 1000 3 1.05 / / / / the performance of each algorithm is compared by adapting the following criterions. (1) the convergence rate ir. it is the ratio of running time as achieving the convergence criteria to the total running time 20. (2) optimal value fbest. it is the minimum value of the results in 20 times run. (3) the mean optimal value denoted by fav. it is defined by the arithmetic mean value of the optimal fitness in 20 times run. (4) mean squared deviation denoted by std. it is the mean squared deviation for some algorithm in 20 times run. (5) elapsed time. it is the consuming time for each run. (6) the schematic diagram of mean convergence. firstly the evolution generations and the corresponding optimal value are extracted. and then take the arithmetic mean value for the optimal value in 20 times run for each evolution generation. finally, draw the graph by taking the evolution generations and the mean optimal value of each evolution generation to be the abscissa and ordinate respectively. (7) the schematic diagram of optimal convergence. aiming at 20 times run, take the running process corresponding the optimal result. and draw the graph by taking the evolution generations and the optimal value of each evolution generation to be the abscissa and ordinate, respectively. it gives the result respectively for each benchmark function from table 3 to table 7 as follow. table 3: the result of sphere function algorithm ir /% fbest fav std elapsed time /s acpso 0 0.7204 2.68 1.11 1.66 acpso 1 7.83e− 13 6.60e− 12 4.77 15.23 spso 0 2164.84 6705.48 2772.00 0.97 ref. 12 0 22237.05 55486.04 14306.06 0.97 ref. 15 0 41.8138 438.52 448.86 1.14 ref. 20 1 0.00174 0.00268 0.000495 5.12 the benchmark functions adapted in this paper are all high-dimensional functions of 30dimension. from figure 4 to figure 13, it can be seen that the acpso algorithm proposed in this paper has better search capability, especially in the earlier stage of the evolution. from table 3 to 7, it can be seen that the acpso algorithm shows better stability. on the whole, the particle swarm optimization algorithm with adaptive chaos perturbation 813 0 200 400 600 800 1000 −12 −10 −8 −6 −4 −2 0 2 4 6 8 generations lo g 1 0f itn es s acpso 40 pso ref. 12 ref.15 ref.20 acpso 400 figure 4: average evolutionary process for sphere function 0 200 400 600 800 1000 −12 −10 −8 −6 −4 −2 0 2 4 6 8 generations lo g 1 0f itn es s acpso 40 pso ref. 12 ref.15 ref.20 acpso 400 figure 5: best evolutionary process for sphere function table 4: the result of rosenbrock’s function algorithm ir/% fbest fav std elapsed time s acpso 0 38.10 174.31 128.14 1.64 acpso 1 16.90 23.19 10.99 14.74 spso 0 6.65e + 5 3.79e + 6 3.16e + 6 0.90 ref. 12 0 1.40e + 8 2.26e + 8 4.57e + 7 1.03 ref. 15 0 138.44 3.75e + 3 4.86e + 3 1.05 ref. 20 1 28.07 1.76e + 5 7.00e + 5 5.27 table 5: the result of rastrigin’s function algorithm ir/% fbest fav std elapsed time s acpso 0 44.32 105.88 39.94 1.63 acpso 0 29.85 80.35 29.34 15.05 spso 0 205.15 262.46 39.06 0.93 ref. 12 0 304.99 392.91 41.42 0.94 ref. 15 0 29.01 62.14 24.09 1.09 ref. 20 0 17.39 34.47 9.39 4.96 814 l. mengxia, l. ruiquan, d. yong ! " # $ % ! " # $ % %! %" &'(')*+,-(. /0 % 1 ,+( '. . 23456 " 456 7'189%! 7'18%: 7'18! 23456 " figure 6: average evolutionary process for rosenbrock’s function ! " # $ % ! " # $ % %! %" &'(')*+,-(. /0 % 1 ,+( '. . 23456 " 456 7'189%! 7'18%: 7'18! 23456 " figure 7: best evolutionary process for rosenbrock’s function ! " # $ % %&' ! !&' ( (&' " "&' ' '&' # #&' )*+*,-./0+1 20 3 % 4 /.+ *1 1 56789 " 789 :*4&;%! :*4&%' :*4&! 56789 " figure 8: average evolutionary process for rastrigin’s function the particle swarm optimization algorithm with adaptive chaos perturbation 815 ! " # $ % % ! & " ' # ( )*+*,-./0+1 20 3 % 4 /.+ *1 1 56789 " 789 :*4;<%! :*4;%' :*4;! 56789 " figure 9: best evolutionary process for rastrigin’s function table 6: the result of griewanks’s function algorithm ir/% fbest fav std elapsed time s acpso 0 0.7952 0.9675 0.0068 1.87 acpso 35 5.66e− 10 0.0214 0.0169 19.52 spso 0 19.303 51.182 18.643 1.10 ref. 12 0 278.85 527.87 116.75 1.11 ref. 15 0 1.2985 6.2802 5.9002 1.36 ref. 20 85 1.04e− 4 0.0041 0.0082 6.21 ! " # $ % &' &! &% % ! ' ()*)+,-./*0 1/ 2 % 3 .-* )0 0 45678 " 678 9)3:;%! 9)3:%< 9)3:! 45678 " figure 10: average evolutionary process for griewanks’s function table 7: the result of griewanks’s function algorithm ir/% fbest fav std elapsed time s acpso 0 1.0486 2.0469 0.6447 1.84 acpso 1 0.00038 0.00038 1.91e− 08 17.08 spso 0 207.8326 453.7501 130.34 1.10 ref. 12 0 2014.7359 3771.971 852.53 1.11 ref. 15 0 12.2179 44.2053 32.499 1.27 ref. 20 0 4621.4715 6014.141 767.825 5.38 816 l. mengxia, l. ruiquan, d. yong ! " # $ % &" &' &! &% % ! ' ()*)+,-./*0 1/ 2 % 3 .-* )0 0 45678 " 678 9)3:;%! 9)3:%< 9)3:! 45678 " figure 11: best evolutionary process for griewanks’s function ! " # $ % &" &' &! &% % ! ' " ( )*+*,-./0+1 20 3 % 4 /.+ *1 1 56789 " 789 :*4;<%! :*4;%( :*4;! 56789 " figure 12: average evolutionary process for schwefel function ! " # $ % &" &' &! &% % ! ' " ( )*+*,-./0+1 20 3 % 4 /.+ *1 1 56789 " 789 :*4;<%! :*4;%( :*4;! 56789 " figure 13: best evolutionary process for schwefel function the particle swarm optimization algorithm with adaptive chaos perturbation 817 the search capability of the algorithm proposed in this paper is superior to the standard pso algorithm and is better than the algorithm ref. 20 for part benchmark functions. conclusions (1) the algorithm proposed in this paper introduces the anderson chaotic mapping to realize the initial of the particle swarm and the perturbation of part particles. it also introduces the concept of the expected mean velocity of particle swarm to realize the adjustment of the inertia weight of particles. (2) the results of benchmark functions show that the algorithm proposed in this paper reflects preferable search ability. (3) the analyses of the mean optimal fitness and evolution generations show that the algorithm of this paper keeps the population diversity well in the middle stage of the evolution and it has stronger local search ability in the later stage of evolution. acknowledgment the authors will thank people in the branch of key laboratory of cnpc for oil and gas production and key laboratory of exploration technologies for oil and gas resources for their great help. this paper is supported by educational commission of hubei province of china (b2015449) and national natural science foundation of china (61572084 and 51504038). bibliography [1] kennedy j, eberhart r,(1995); particle swarm optimization. ieee int. conf. on neural networks. piscataway, nj. ieee service center, 1942-1948. [2] shi y, eberhart r c,(1998); a modified particle swarm optimizer. ieee int. conf. on evolutionary computation. piscataway, nj. ieee service center, 69-73. [3] naveed iqbal, azzedine zerguine,naofal al-dhahir,(2014); decision feedback equalization using particle swarm optimization. signal processing, 108:1-12, doi: 10.1016/j.sigpro.2014.07.030 [4] manish mandloi, vimal bhatia,(2016); a low-complexity hybrid algorithm based on particle swarm and ant colony optimization for large-mimo detection. signal processing, 50:66-74. doi: 10.1016/j.eswa.2015.12.008 [5] md ashiqur rahmana,sohel anwara,afshin izadian,(2016); electrochemical model parameter identification of a lithium-ion battery using particle swarm optimization method.journal of power sources.307, 86-97. doi: 10.1016/j.jpowsour.2015.12.083 [6] razieh sheikhpour,mehdi agha sarrama,robab sheikhpour,(2016); particle swarm optimization for bandwidth determination and feature selection of kernel density estimation based classifiers in diagnosis of breast cancer.applied soft computing, 40113-131. doi: 10.1016/j.asoc.2015.10.005 [7] yu-shan cheng,man-tsai chuang,yi-hua liu,shun-chung wang,zong-zhen yang,(2016); a particle swarm optimization based power dispatch algorithm with roulette wheel re-distribution mechanism for equality constraint.renewable energ, 88: 58-72, doi: 10.1016/j.renene.2015.11.023. 818 l. mengxia, l. ruiquan, d. yong [8] migdat hodzic, li-chou tai (2016); grey predictor reference model for assisting particle swarm optimization for wind turbine contro.renewable energ, 86: 251-256, doi: 10.1016/j.renene.2015.08.001. [9] tao lin, peng wu, fengmei gao, yi yu, linhong wang (2016); study on svm temperature compensation of liquid ammonia volumetric flowmeter based on variable weight pso, international journal of heat and technology , 33(2):151-156, doi: 10.18280/ijht.330224. [10] shi y. and eberhart, r.c. (2001); fuzzy adaptive particle swarm optimization, ieee int. conf. on evolutionary computation.seoul, korea, 101-106. [11] zhang l., yu h., hu s. (2003); a new approach to improve particle swarm optimization, genetic and evolutionary computation conference 2003 .chicago, il, usa, 134-142. [12] jiang c. w., etorre b. (2005); a hybrid method of chaotic particle swarm optimization and linear interior for reactive power optimization, mathematics and computers in simulation, 68:57-65. [13] chen g.m., huang x.b.,jia j y, min z.f.(2006); natural exponential inertia weight strategy in particle swarm optimization.world congress on intelligent control & automation, 1: 3672-3675. [14] jiao b., lian z.g.,gu x.s.(2008); a dynamic inertia weight particle swarm optimization algorithm.chaos solitons & fractals, 37(3): 698-705. [15] zhang dingxue, liao ruiquan,(2009); adaptive particle swarm optimization algorithm based on population velocity.control and decision, 24(8): 1257-1265, doi:10.13195/j.cd.2009.08.139.zhangdx.025 [16] lv zhensu, hou zhirong (2004); particle swarm optimization with adaptive mutation.acta electronica sinica, 32(3):416-420. [17] zeng jianchao, cui zhihua (2012); nature inspired computation, beijing, national defense industry press, 252-256. [18] kazem a., sharifi e., hussain f.k., saberi m., hussain o.k. (2013); support vector regression with chaos-based firefly algorithm for stock market price forecasting, applied soft computing, 13: 947-958. doi: http://dx.doi.org/10.1016/j.asoc.2012.09.024. [19] liu huaying, lin yue (2006); a hybrid particle swarm optimization based on chaos strategy to handle local convergence, computer engineering and applications, 42(13):7779. [20] yang x.s.(2011); chaos-enhanced firefly algorithm with automatic parameter tuning, international journal of swarm intelligence research, 2: 1-11. [21] lu y., liu x.(2011); a new population migration algorithm based on the chaos theory, ieee 2nd international symposium on intelligence information processing and trusted computing, 147-150, doi: 10.1109/iptc.2011.44 [22] lv jinhu, lu junan, chen shihua,(2002); chaotic time series analysis and its application, wuhan: wuhan university press, 87-89. [23] r. anderson (1996); industrial cryptography, ieee rev., 118-120. international journal of computers communications & control issn 1841-9836, 12(3), 307-322, june 2017. a rating-based integrated recommendation framework with improved collaborative filtering approaches s. cheng, b. zhang, g. zou shulin cheng 1. school of computer engineering and science, shanghai university 99 shangda road, baoshan district, shanghai, 200444, pr, china chengshulin@shu.edu.cn 2. school of computer and information, anqing normal university 1318 jixian north road, anqing, anhui province, 246133, pr, china chengshl@aqnu.edu.cn bofeng zhang*, guobing zou school of computer engineering and science, shanghai university 99 shangda road, baoshan district, shanghai, 200444, pr, china *corresponding author: bfzhang@shu.edu.cn guobingzou@gmail.com abstract: collaborative filtering (cf) approach is successfully applied in the rating prediction of personal recommendation. but individual information source is leveraged in many of them, i.e., the information derived from single perspective is used in the user-item matrix for recommendation, such as user-based cf method mainly utilizing the information of user view, item-based cf method mainly exploiting the information of item view. in this paper, in order to take full advantage of multiple information sources embedded in user-item rating matrix, we proposed a rating-based integrated recommendation framework of cf approaches to improve the rating prediction accuracy. firstly, as for the sparsity of the conventional item-based cf method, we improved it by fusing the inner similarity and outer similarity based on the local sparsity factor. meanwhile, we also proposed the improved user-based cf method in line with the user-item-interest model (uiim) by preliminary rating. second, we put forward a background method called user-item-based improved cf (uibcf-i), which utilizes the information source of both similar items and similar users, to smooth itembased and user-based cf methods. lastly, we leveraged the three information sources and fused their corresponding ratings into an integrated cf model (inte-cf). experiments demonstrate that the proposed rating-based inte-cf indeed improves the prediction accuracy and has strong robustness and low sensitivity to sparsity of dataset by comparisons to other mainstream cf approaches. keywords: personalized recommendation, collaborative filtering, rating integration. 1 introduction the recommender system [21] has been studied by many researchers in the past decade, which are widely applied in many fields like information retrieval [5], item recommendation [22], e-commerce [13]. recommender system obtained relatively promising results and facilitated users, in which collaborative filtering (cf) recommendation methods are classic and useful ones, and do well in recommending items with ratings such as products, movies, music. cf recommendation approaches [21] [19] [23] can be divided into memory-based approaches and model-based approaches. memory-based approaches are heuristic and comprises item-based and user-based approaches, while model-based approaches are built based on machine learning theory. item-based and user-based approaches both leverage the idea of neighbors to generate recommendation by measuring the similarities between the target item and other items, or, between copyright © 2006-2017 by ccc publications 308 s. cheng, b. zhang, g. zou the target user and other users. and the similarities are viewed as weights between items or users in the process of rating prediction. but many of item-based and user-based approaches predict unknown ratings from single perspective of either users or items, in which only partial information embedded in user-item matrix is utilized. traditional approach [22] [19] with single view has relatively low performance due to the poor ability against the sparsity of user-item matrix except few ones like recommendation in amazon [13]. naturally, some researchers studied the imputation of missing data [6] [16] which produced relatively good performance. but they did not consider the information of multiple sources embedded in user-item rating matrix. therefore, we study an integrated recommendation framework of cf approaches using the information of multiple sources from user-item matrix. in this paper, we propose a rating-based integrated recommendation framework inte-cf with improved cf methods. our integrated framework is, to some extent, similar but different to hybrid recommendation approach which usually combined cf methods with content-based approaches applying the strategies of pre-fusion or/and post-fusion, or built linear combination of different cf methods. our framework could directly obtain the values of optimal fusion parameters by one time of learning, whereas other methods like [16] [24] found out the suitable values of combination parameters by many times of learning and manual comparisons. in our framework, an objective optimization function for predicting unknown ratings is put forward by considering three varying information sources from different perspectives of improved traditional cf approaches. the framework can implement more accurate recommendations through learning the optimization parameters, whose advantages are fully leveraging the information embedded in user-item matrix from three different perspectives, reducing the dependence on missing data and balancing three cf methods by optimization parameters. the remainder of the paper is organized as follows. we first summarized the related works in section 2. the rating-based integrated recommendation framework is presented in section 3. section 4 presented the improved approaches of traditional itemand user-based methods. we designed a background rating prediction method based on both similar users and similar items in section 5. the details of integrated recommendation framework are demonstrated in section 6. the experimental results of the proposed scheme are discussed in section 7. finally, we discussed the findings of our work along with the future work in the last section. 2 related works since the recommender systems were generated, cf recommendation has been viewed as the most successful recommender method including memory-based heuristic approaches cf methods and model-based learning approaches [14]. there have been many cf recommender applications in academia and industry. to the best of our knowledge, tapestry system [9] identifying likeminded user is the earliest real cf recommender system. and amazon web site [13] is a famous application of cf approach. in order to increase the recommendation accuracy, many scholars tried to improve cf approaches by varying similarity calculations between users or items. breese [3] etc. compared the prediction accuracy of several similarity algorithms including correlation coefficient-based algorithm, vector-based algorithm and statistical bayesian algorithm. choi [4] etc. proposed a new similarity function for selecting neighbors for each target item. others like conditional probability-based similarity algorithm [7] of item similarity and genetic algorithm [2] of user similarity were also studied. good similarity computation method as a kind of enhancement of cf methods indeed improves the recommendation accuracy to some extent. but it is sensitive to the data quality like the sparsity of dataset. despite the success of cf approaches, sparsity is still a major challenge and heavily affects a rating-based integrated recommendation framework with improved collaborative filtering approaches 309 recommendation accuracy. the fact is that a large volume of entries’ value in user-item matrix is missing. therefore, some solutions were proposed to address the issues of sparsity. the simplest ways [6] are using either the value of zero or the average rating of users or items. obviously, these two ways are too coarse and imprecise. later on, some relatively better methods were adopted, such as dimensionality reduction based on matrix factorization [20], imputation based on preliminary rating prediction of missing data [16] [8]. in recent years, some other information related to users or items was adopted to alleviate sparsity, like the trust and distrust relationships between users, which were studied in open dataset of epinions [1]. the information in social networks such as friend relations, social influence, is also researched to alleviate sparsity by some scholars [27]. indeed, these information is useful, but they are not always obtained like in the movielens dataset, unless in social networks. it is straightforward that the information that could be utilized in recommendation depends on the questions of a certain specific domain. therefore in this paper, we consider only the information embedded in user-item rating matrix and the semantic information of items to reduce the sparsity according to the limited available information. in addition to taking full advantage of improvements of similarity computation, dimensionality reduction and other related information sources, there is another important improvement way called hybrid filtering which combines cf with other recommendation approaches. lu [15] etc. proposed the ccf approach for the news topic recommendation in bing, which combined cf approach and content-based filtering method. in e-commerce, song [24] etc. leveraged both demographic recommendation techniques and cf algorithms to put forward a hybrid algorithm in order to improve recommendation accuracy. ma [16] etc. proposed a linear combination of userand item-based methods based on the missing value prediction by finding suitable combination parameters and obtained better performance. moin [17] etc. suggested feature hybrid weighting schemes for improving the precision of neighborhood based cf algorithms, while it increases the complexity of computation. from the perspective of optimization, nilashi [18] etc. proposed hybrid recommendation for cf method based on multi-criteria to improve prediction accuracy. hybrid recommendations absorb the advantages of each recommender algorithm and do improve the precision of recommendation. they effectively alleviated sparsity and solved the problem of cold start to some extent, especially the combination of content-based method and cf approach. in this paper, we also leverage a similar but different idea of content-based method to improve the item-based cf approach, which utilize inner similarity of items and is discussed in subsection 4.1. to sum all, cf is applied successfully into all kinds of recommendation fields and obtained a lot of improvements, which focuses on the similarity improvements either in user-based or in item-based methods and combinations with other types of recommendation algorithms. in this paper, we emphasis on the improvement of accuracy by cf-self integration based on three types of ratings deriving from three perspectives of users, items and both users and items. the integrated model is proved to be more accurate, effective and interpretive than some other mainstream cf methods demonstrated in our experiments. what’s more, the model is easy to be paralleled to improve the running efficiency. 3 integrated recommendation framework of cfs our proposed integrated cf recommendation framework, namely, inte-cf is shown in fig.1, which comprises four core parts. the first part is to generate the first type of rating (rating 1) from the perspective of item by improving conventional item-based cf approach based on the fusion of two kinds of similarities of item. the second part is in charge of generating the second type of rating (rating 2) from the perspective of user by improving the traditional 310 s. cheng, b. zhang, g. zou user-based cf (ubcf) approach with uiim extracted from user-item matrix. the third part demonstrates a combination model of generating the third type of prediction rating (rating 3) based on both similar items and similar users from the two perspectives of item and user. the three types of ratings are integrated together to build an objective function f in the part 4, which is our proposed integrated optimal model. it is tuned by the optimization parameters which are learned by training sample data. the parts of 1, 2, and 3 serve the part 4. the details of generating each type of rating are demonstrated in the subsequent sections. preliminary ratings compute user similarity users ratings interest model user-based cf rating 1 rating 3 based on both similar users and similar items rating 2 4 integrate ratings objective function f: min training optimal parameter vector integrated predicting user-item matrix 2 objective function training for ratings integration and predicting u1 i1 i2 i3 ... ij ... in u2 u3 ... ui ... um r1,1 ... ... rm,1 r3,1 r1,n r2,n ... ri,n ... rm,n r1,2 r2,2 ri,2 ... r2,3 ... ri,3 ... rm,3 r3,3 ... .. ... ... r1,j r2,j ... ... rm,j ... ... ... ... inner similarity local sparseness factor outer similarity items ratings semantic information item-based cf fusion similarity 1 3 figure 1: the rating-based integrated recommendation framework of cf approaches 4 improvements of traditional item-based and user-based cf item-based and user-based cf approaches have similar rationale to predict the unknown ratings in user-item matrix. firstly, the neighbors of target user or item are obtained by similarities in the two cf approaches. then the unknown rating of each entry related to the target user or item in user-item matrix is predicted by these neighbors whose similarities to the target user or item are viewed as weights in calculation. lastly, the top-k recommendation list is generated in accordance with the predicted ratings. the details of them can refer to [7,13,19,23]. 4.1 improved item-based approach by similarity fusion similarities between items the performance of recommender system partially depends on the computation of similarities between items. according to the principles related to dialectics, the relevance between things is determined by inner factors and outer factors. in conventional item-based cf approaches (ibcf), the similarities which are calculated in line with the ratings in user-item matrix are measured from outer factors, namely, the perspective of user evaluation. actually, the similarities between items are also influenced, to a large extent, by the inner factors such as the properties of items, which embody item’s inherent semantic information [26]. in other words, the similarities between a rating-based integrated recommendation framework with improved collaborative filtering approaches 311 items depend both on inner factors and outer factors. in this paper, inner factors denote the properties of item, which are utilized to characterize items and depend on the specific objects. for instance, if the object is movie, the properties could be genres etc.; if the object is product or commodity, the properties could be appearance, genres, color, function, price, quality etc. therefore, it is necessary to take the two kinds of factors into account to measure the similarities between items. for convenience, the similarity produced by outer factors is called outer similarity and the similarity produced by inner factors is called inner similarity. the outer similarity is calculated by the ratings of items in user-item matrix showed in eq. (1). the inner similarity is computed by the properties of items showed in the eq.(2). simiout(i,j) = ~ii · ~ij ||~ii||× ||~ij|| (1) simiin(i,j) = ∑ k=1 ϕ(k)sim(θ(k), i,j) (2) where θ denotes the property set of item i and item j , sim(θ(k), i,j) represents the similarity of item i and item j on property k in θ, ϕ(k) is the weight of property k like the genre property of a movie. item local sparsity factor user-item matrix is commonly heavily sparse. the existing sparsity called global sparsity degree is measured by the ratio that is equal to the number of unknown ratings over the number of total entries in user-item matrix. when calculating the item similarity, we defined a local sparsity factor, i.e., item local sparsity factor which is used to describe the sparsity of the set of co-ratings from the local perspective of item. definition 1. (item local sparsity) let uii be the set of users who rated on item i and u i j be the set of users who rated on item j, then the item local sparsity is defined as: spii,j = 2 ∗ |uii ∪u i j|− (|u i i + |u i j|) 2 ∗ |uii ∪u i j| (3) fusion of inner similarity and outer similarity of item according to the aforementioned analysis, it is reasonable to fuse the inner similarity and outer similarity of items. the item local sparsity factor can be used to balance the outer similarity and inner similarity. therefore, we define the weight function between inner and outer similarities incorporating the item local sparsity by sigmoid function as follows: f(spii,j) = { 1 1+e−sp i 0 ≤ spii,j<1 1 spii,j = 1 (4) clearly, spii,j is between 0 and 1, and f(sp i i,j) belongs to 0.5 and 1, which guarantees inner similarity always be in the resulting item similarity, because inner similarity between two items is always useful and works. when spii,j equals 0, namely, two items have the complete common rating users and the set of co-rating users is full, the value of f(spii,j) is 0.5, which means inner and outer similarities have same weights. when spii,j equals 1 that means item i and item j have 312 s. cheng, b. zhang, g. zou no common rating users, f(spii,j) is set to 1, i.e., the similarity between item i and item j only depends on the inner similarity. the resulting similarity after being fused based on f(spii,j) is as follows: simi(i,j) = f(spii,j)∗sim i in(i,j) + (1−f(sp i i,j))∗sim i out(i,j) (5) the resulting similarity of items embodies the two aspects of inner factors and outer factors, effectively alleviates the dependence on the sparsity of user-item matrix and overcomes the item cold start problem. f(spii,j) balances the inner factors and outer factors. therefore, ibcf can be improved by the item fusion similarity and called ibcf-i. 4.2 improved user-based cf for rating prediction based on user item interest model although conventional ubcf method can predict rating with some extent accuracy, it still has the space of improvement. the key of conventional ubcf approach is to find quality neighbors of a target user. so the similarity calculation between users is important. but due to the heavy sparsity of initial user-item matrix, sometimes the similarity calculation like cosine similarity in conventional ubcf method has relatively low accuracy, even not correct occasionally. in order alleviate the sparsity, deng [6] etc. proposed an approach of preliminary rating of unknown rating entries, which, while, still suffered from the sparsity, since it only used the existing known ratings. different from deng [6], we propose a preliminary rating model (prm) based on user-item-interest to conquer the sparsity, which is similar to imputation. and the rating prediction method ubcf-i is put forward based on prm. applying cf method, user-item matrix is the information source leveraged to make study and analysis. the uiim model is built based on knn cluster approach using inner similarities between items. generally, user rates similar items with similar ratings. therefore, items that user has rated can be clustered into k clusters in line with their inner similarities for building uiim. then the nearest cluster to the target item with unknown rating entry is selected. lastly, it is utilized to make preliminary rating for the unknown rating entry. so there are more coratings between users, which are used to calculate the similarities between users. obviously it can produce better accuracy of user similarity than traditional ubcf method. the detailed process is discussed as follows. let ip be the known rating item set of user up, iq be the known rating item set of user uq, i∪p,q be the union set of ip and iq, and i∩p,q be the intersect set of ip and iq with co-ratings namely, i∪p,q = ip ∪iq, i∩p,q = ip ∩iq. then the unknown rating item sets np and nq of user up and user uq are np = i∪p,q − ip and nq = i∪p,q − iq, respectively. the process of preliminary ratings of np and nq are similar, here we take an example for np. assume item ij ∈ np, firstly, compute the semantic distances of item ij to the k clusters in uiim of the user up, and sort them by ascend. the cluster which is the nearest to the item ij is selected as the neighbors called in. then calculate the preliminary rating r′p,j of unknown rating entry of user up on item ij according to the neighbors in, as follows: r′p,j = ∑ l∈in sim i(j, l) ∗rp,l∑ l∈in sim i(j, l) (6) so far, each entry in the union set i∪p,q has co-ratings of user up and user uq either known rating or preliminary rating. the resulting similarity between user up and user uq is quality. therefore, in user space the similarities between user up and other users can be calculated effectively. the nearest neighbor user set nup used to calculate the similarities to the target user up is formed in lines with the rule of top-k. finally, ubcf-i is applied to predict the unknown ratings of user a rating-based integrated recommendation framework with improved collaborative filtering approaches 313 up. ubcf-i method has some advantages (1) searching similar items for preliminary ratings is in a small item scope rather than in the whole item space, which derived from the most related items in uiim; (2) avoiding the sparsity of computing the item similarity for making preliminary rating, especially for the case two users have many ratings but few common ones, since all the missing values between two users are imputed in our method when computing their similarity. although ubcf-i has some strong points, we don’t intend to deeply research the serious user cold start problem in which the number of user ratings is close or equal to 0. to well solve the problem needs some other information like user social information [12,27], trust relationships [1] and etc., which should be deeply studied but not always be achieved in traditional dataset such as movielens, unless in social network. therefore, as for the very serious user cold start problem in which clustering is ineffective, considering averaging user’s average rating and the item’s average rating as the preliminary rating is a good selection [8]. 5 rating based on both similar users and similar items ibcf-i and ubcf-i predict ratings from the perspectives of similar items and similar users, respectively. but only depending on one of them is undesirable [22,23]. it is necessary to think about that taking both similar users and similar items into account, which correspond to the rows and columns in user-item matrix, respectively, can provide more effective information sources for predicting ratings. that means similar users making similar item ratings provides an extra and useful information source for prediction. but how to make full use of the information that derives from both similar users and similar items? firstly, reorder user-item matrix according to the similarities of users and similarities of items. second, generate the predictive ratings by fusing the two similarities towards the target user and the target item related to the entries with unknown ratings in user-item matrix. therefore, we proposed one kind of cf method using the compound similarity based on the two-dimension coordinates to address the problem. for the convenience of expression, we call this method uibcf-i. the rationale of uibcf-i is shown in fig.2 as follows. users itemsi1 i2 i3 ... ij ... in r1,1 ... ... rm,1 r3,1 r1,n r2,n ... ri,n ... rm,n ? r1,2 r2,2 ri,2 ... r2,3 ... ri,3 ... rm,3 r3,3 ... .. ... ... r1,j r2,j ... ... rm,j ... ... ... ... u1 i1 i2 i3 ... ij ... in u2 u3 ... ui ... um r1,1 ... ... rm,1 r3,1 r1,n r2,n ... ri,n ... rm,n ? r1,2 r2,2 ri,2 ... r2,3 ... ri,3 ... rm,3 r3,3 ... .. ... ... r1,j r2,j ... ... rm,j ... ... ... ... u1 u2 u3 ... ui ... um b expression of re-ranked matrix in coordinatea origin user-item matrix :f ratings figure 2: principle of predicting rating based on uibcf-i the part (a) denotes the original user-item matrix and part (b) represents the rebuilding and mapping of user-item matrix in two-dimension coordinates. horizontal axis denotes item and vertical axis represents user. the entries of user-item matrix correspond to the points in coordinates. all the users and items are ordered in descend by the magnitudes of similarities to the target user and the target item. the question mark denotes the entry of unknown rating related to the target user ui and the target item ij. the top-k most similar users uss and top-m 314 s. cheng, b. zhang, g. zou most similar items iss to the user ui and the item ij are selected, respectively. the predictive rating can be calculated by the compound similarity of similar users and similar items in eq.(7). rpss(i,j) = ∑ k∈uss ∑ m∈iss simss(i,j,k,m) ∗r(k,m)∑ k∈uss ∑ m∈iss simss(i,j,k,m) (7) where simss represents the compound similarity of similar users and similar items, which is computed in eq. (8) as follows: simss(i,j,k,m) = λ1sim u(i,k) + λ2sim i(j,m) (8) λ1 and λ2 are tuning parameters, whose values are commonly denoted by 0.5 respectively. 6 integrated cf recommendation model by ratings fusion 6.1 overview the core task of a recommendation algorithm is to predict which items a user relatively most likes based on his/her observed feedback which denotes ratings on items here. so far, we have obtained three types of ratings, namely, ratings of 1, 2 and 3 according to the aforementioned content. they are obtained by three different methods from three varying information sources. each of them has its own strength and weakness. how to combine the three types of ratings represents a novel challenge. we proposed the optimal integration framework inte-cf based on the three types of ratings by learning relevant parameters. in inte-cf model, the ratings predicted by ibcf-i, ubcf-i and uibcf-i from three varying perspectives and using different information sources which complement each other. and uibcf-i could also be viewed as a background method of ibcf-i and ubcf-i and smooth the rating predictions generated by ibcf-i and ubcf-i. therefore, the integration of the three types of ratings not only leverages the three varying information sources but also can reduce the dependence on data sparsity. let u = {u1,u2, ...,um} be the set of m users and i = {i1, i2, ..., in} be the set of n items. rui,r̂ui denote the rating and predicted rating of user u on item i, respectively. r̂ (1) ui , r̂ (2) ui , r̂ (3) ui represent user u′s predicted ratings by ibcf-i, ubcf-i and uibcf-i on item i, respectively. ~̂rui is a predicted rating vector composed of r̂ (1) ui , r̂ (2) ui , r̂ (3) ui . we also use r ∈ r m×n to represent the matrix of observed ratings and ~w = (w1,w2,w2) to denote a parameter vector. for convenience, we use s∗u ⊆ u ×i to denote the set of user-item pairs of user u, for which the observed ratings are available. 6.2 integration we have obtained user u′s three predicted ratings of r̂(1)ui , r̂ (2) ui , r̂ (3) ui on item i, which are leveraged to predict user u′s preference on item i from three different perspectives. in order to achieve more accurate user u′s predictive rating on item i, we proposed an algorithm of combining them with an integrated model to implement the aforementioned framework inte-cf as follows: r̂ui = r̂ (1) ui ∗w1 + r̂ (2) ui ∗w2 + r̂ (3) ui ∗w3 = ~̂rui~w t (9) actually, it is an optimization problem of the following general form: min ~w (`(rui, r̂ui) + r(~w)) (10) a rating-based integrated recommendation framework with improved collaborative filtering approaches 315 here `(rui, r̂ui) is a loss function measuring the discrepancy between the observed rating and the predicted rating of user u′s on item i. the regularization function r(~w) overly penalizes the model to suppress overfitting. the goal of the model is to make the predicted rating r̂ui as close to the observed rating rui as possible. the common and good selection for the loss function is to use squared error loss form: `(rui, r̂ui) = 1 2 ∑ (u,i)∈s∗u (rui − r̂ui)2 (11) certainly, there are several other forms of loss function such as [14]. here we used squared error form for loss function due to its simplicity and easiness of implementation. we used the frobenius norm of parameters to build the regularization function r(~w), which was adopted by koren [10] et al. due to its smooth differentiable property. r(~w) = 1 2 λ||~w||2f (12) where the parameter λ ≥ 0 is used to control the strength of regularization and helps to balance between training error and model complexity. so the training model can be rebuilt as follows according to the equations of (10), (11) and (12): f(~̂rui, ~w) = min ⇀ w (`(rui,~rui) + r(~w)) = 1 2 min ~w ( ∑ (u,i)∈s∗u (rui − ~̂rui~wt ) 2 + λ||~w||2f ) (13) for convenience, we convert eq.(13) into eq. (14):  fmin = 1 2 ( ∑ (u,i)∈s∗u (rui − ~̂rui~wt ) 2 + λ||~w||2f ) s.t. ||~w||1f = 1, wj ≥ 0, j ∈{1, 2, 3} (14) obviously, this is an optimization problem and could be solved by dynamic program with constraints. we adopted the stochastic gradient descent (sgd) [11] method to learn the parameters in order to accelerate the optimization process. the optimization procedure is shown in algorithm 1. the algorithm takes as input the matrix r of observed ratings, error ε and a group of vectors ~̂ruis which derive from the three types of predicted ratings. 7 experiments in order to verify our proposed integrated model inte-cf, we experimented on the classic dataset of movielens1 and eachmovie2. due to the high similar results on the two datasets, we only report the experiment results of movielens (out of space consideration). the movielens dataset is comprised with 943 users, 1682 movies (items) and 100,000 ratings (1-5 scales) with the global sparsity of 0.93695, where each user has rated at least 20 items. to better validate our proposed inte-cf model, we conducted 4 groups of experiments corresponding to 200, 400, 600 and 800 users and relevant data extracted from the dataset at random. the purpose of dividing the dataset into 4 groups is to find out the difference of optimal parameters which, we thought, depend on the information of specific dataset, such as the size, sparsity. the experiments were finished in line with 10-fold cross-validation. we have two goals to conduct the experiments. one is to validate the higher prediction accuracy and more effectiveness of inte-cf model. the other is to find out the component variation law of optimal parameter vector with varying scale dataset. 1http://www.grouplens.org/ 2http://www.research.digital.com/src/eachmovie/ 316 s. cheng, b. zhang, g. zou algorithm 1 the optimization of inte-cf model input: the rating matrix r, error ε and a group of vectors ~̂ruis of three types of predicted ratings. output: model parameter vector ~w = (w1,w2,w3). begin 1: initialize the vector ~w = (0.333, 0.333, 0.334), k ← 1 and f(0) = 0; 2: calculate f(1); 3: while |f(k) −f(k−1)| > ε do 4: k → k + 1; 5: s (k) 1 ←− ∇f(w1) ||∇f(w1)|| , s(k)2 ←− ∇f(w2) ||∇f(w2)|| ; 6: w1 ← w1 + αks (k) 1 , w2 ← w2 + αks (k) 2 ;//αk is learning parameter 7: w3 ← 1 −w1 −w2; 8: calculate f(k); 9: end while 10: return vector ~w = (w1,w2,w3) end 7.1 preliminary here we conducted the experiments on the dataset of movielens. in order to calculate the inner similarities between items (movies) discussed in the sub-section of 4.1, we need to quantify the information of item’s genre properties which characterize the movie’s inherent features. the genre of each movie in the dataset is multi-valued and has 20 possible values such as drama, action and comedy. in general, more than one of these genres present with different degree in a movie. some of them with high presence are called major or dominating genres for that movie. for example for the movie "copycat" as presented in the movielens dataset [28] has "crime/mystery/thrill/drama", crime is the most dominating genre value; mystery is the second one, etc. the rest in 20 possible genres do not present. therefore, in order to quantify the presence degree, we utilize a gaussian-like function [28] to compute it. µ(gi,ij) = ri/2 √ α∗nj∗(ri−1) (15) where gi denotes the genre i of item ij, nj represents the total number of the present genres, rj denotes the rank position that indicates the magnitude of presence of gi and 1 ≤ ri ≤ nj. the rank positions of those genres with no presence equal 0. and α > 1 is a constant threshold which controls the difference in presence degree of the gi in the item ij. here we set α = 1.2 which makes the calculation perfect [28]. due to no information of rank positions of genres in the dataset, we complemented them by crawling the information from the online movie database(http://www.imdb.com/). 7.2 metrics as for assessing the accuracy of a recommender system with prediction ratings, one of the most popular evaluation metrics is mean absolute error (mae) [21, 27], which measures the average absolute deviation between the real rating assigned by the user and the predicted rating calculated by a certain recommendation algorithm. therefore, we use mae to measure the prediction quality of our proposed integrated framework inte-cf with other mainstream cf methods, which is defined as follows. a rating-based integrated recommendation framework with improved collaborative filtering approaches 317 mae = ∑ (u,i)∈rtest |ru,i − r̂u,i| |rtest| (16) where rtest is the set of all user-item pairs (u,i) in the test set. the smaller mae value means a better performance. 7.3 preliminary experiments of verifying ubcf-i and ibcf-i methods we first conducted a preliminary experiment to verify the effectiveness of our proposed improved cf methods ibcf-i and ubcf-i, which are compared to the conventional methods of ubcf and ibcf so that we could proceed to do the next further experiments of inte-cf model. we randomly selected half of data of the movielens dataset to conduct the preliminary experiment. the data was split into two parts, namely, training set 80% and prediction set 20%. the related users and items are 444 and 1605, respectively. the global sparsity is 0.9298. the experimental results are showed in fig.2. figure 3: preliminary experiment about comparisons of mae between improved methods and conventional methods obviously, ubcf-i and ibcf-i methods are more accurate than the conventional methods of ubcf and ibcf from overall view. ubcf-i obtains maximum 19.57% and average 17.25% increases than ubcf, respectively. similarly, ibcf-i obtains maximum 18.93% and average 18.12% increases than ibcf method, respectively. they all get lower mae value with the increase of neighbors since ubcf-i approach benefits from uiim and ibcf-i method benefits from fusion similarity composed of inner and outer similarities. we achieved the significant improvements of ubcf-i and ibcf-i approaches on mae and not planned to further analysis and conduct the preliminary experiments. we emphasized on the subsequent experiments of our integrated framework model inte-cf. 7.4 experiments for predictive accuracy we conducted 4 experiments in which users are divided into 4 groups of 200, 400, 600 and 800. for convenience, we call them g2, g4, g6 and g8, respectively. we compared our integrated model inte-cf to two individual predictors, namely, ubcf-i and ibcf-i, and other two combination predictors, namely, our proposed uibcf-i method and a linear combination method [16], 318 s. cheng, b. zhang, g. zou table 1: comparison to other cf methods: a smaller value means a better performance groups g2 g4 g6 g8 inte-cf 0.792 0.744 0.731 0.711 uibcf-i 0.887 0.827 0.774 0.748 ubcf-i 0.845 0.775 0.763 0.759 ibcf-i 0.873 0.778 0.786 0.764 ui-linear 0.861 0.819 0.762 0.728 table 2: the value of average optimal vector in 4 groups w g2 g4 g6 g8 w3 0.625 0.697 0.711 0.725 w2 0.204 0.192 0.174 0.166 w1 0.171 0.111 0.115 0.109 which, for convenience, is called ui-linear showed at the last row in table 1. the optimal number of neighbors was 35 selected by many tests. table 1 summarizes the results, showing the how inte-cf approach outperforms the other methods in all 4 groups of experiments. inte-cf approach is the best recommendation method in table 1. ubcf-i and ibcf-i have relatively low performance compared to inte-cf, although they are improved based on standard cf method. uibcf-i similar to ui-linear is less accurate than ubcf-i and ibcf-i in g2 and g4 because of the less data when both considering similar users and similar items. but their accuracy increases fast with more users and items. if there are enough users and items, they will outperform ubcf-i and ibcf-i just like in g6 and g8, since they benefit from the strengths of combination. inte-cf has the best performance which fuses three information sources deriving from ubcf, ibcf and uibcf-i, and absorbs the advantages of them. we also found that the performances of all the methods have been improved with more users in dataset. it is evident that more users and items produce more ratings on whole, which provide more accurate prediction when applying enhanced cf methods. 7.5 discussion about optimal parameter vector each group of experiments generates 10 optimal parameter vectors in 10-fold cross-validation experiments. in order to demonstrate the overall changes of vectors and proportions of ubcf-i, ibcf-i and uibcf-i, we select the average optimal parameter vector of each group of experiments for the comparisons. table 2 and fig.3 show the changes of value derived from all the 4 groups of average optimal parameter vectors. each vector contains three components which are w1, w2 and w3 corresponding to the weights ω1, ω2 and ω3 of ibcf-i, uibcf-i and uibcf-i in parameter vector, respectively. in each group of experiment, w1, w2 and w3 have the similar situations. w3 is dominant value and plays an important role in predicting ratings, especially in more rating data. w1 and w2 decreased with the increase of rating data, maybe since both more similar users to the target user and more similar items to the target item result in great influence on uibcf-i. the optimal value of w3 is in vibration around 0.7 in most cases. combining table 1 and fig.3, our proposed model of inte-cf makes full use of the three kinds of information sources of ibcf-i, ubcf-i and uibcf-i from varying views and obtains the best performance. a rating-based integrated recommendation framework with improved collaborative filtering approaches 319 figure 4: three components of average optimal parameter vector in 4 groups table 3: statistics information of 4 groups groups g2 g4 g6 g8 users 200 400 600 800 items 1409 1484 1596 1655 existing data count 22378 41826 64384 85697 theoretical data count 281800 553600 957600 1324000 global sparsity 0.9206 0.9296 0.9328 0.9353 320 s. cheng, b. zhang, g. zou table 3 gives the statistics information of 4 groups of experiments. the global sparsity in each group is adjacent regardless of the increase of the rating data, and is close to the global sparsity 0.93695 of the whole dataset. the three components of the optimal parameter vector changes a little in 4 groups. the background method of uibcf-i plays a great role in 4 groups, especially in g8 whose global sparsity is relatively large. the optimal parameter vector is low sensitive to the data size. the component weights of uibcf-i in 4 groups are all high since compound similarity between items does work and uibcf-i makes full use of both similar users and similar items. 8 conclusions and future work as for the shortage of individual predictors of conventional item-based and user-based cf recommendation approaches utilizing single information source, we proposed a rating-based integrated framework to combine three cf recommendation methods of ibcf-i, ubcf-i and uibcf-i. uibcf-i is considered as a background method to smooth the rating predictions of ubcf-i and ibcf-i. meanwhile, we improved traditional item-based cf by inner similarity and outer similarity, and user-based cf by preliminary ratings based on uiim. furthermore, we built an optimal learning model inte-cf of the framework by dynamic program with constraints to find out the optimal parameter vector in rating predictions. the experiments showed that our new integration framework of cfs is effective in improving the prediction accuracy of cf recommendation approaches. that is to say our integrated model inte-cf leveraging the three kinds of information sources achieves the best performance. but inte-cf pays the price of a little more running time which is not avoided but worthy. fortunately, some calculations which accelerate the overall execution process are off line or incremental, including inner similarity of between items, clustering of uiim, and the predictions of three ratings (rating 1, 2 and 3) can be parallel processing. in the future work, we will compare inte-cf model to more other methods such as sf [25] and evaluate it on more metrics. the parallelization of inte-cf is also interesting when encountered big data. and we will continue to optimize inte-cf by considering the rating biases of users and items and the influence of time. acknowledgment we would like to thank all of the anonymous reviewers for their insightful comments and useful suggestions that must lead to a much higher quality of our manuscript.this work was partially supported by the national science natural foundation of china (nos. 61303096). bibliography [1] anand d., bharadwaj k.k. (2013); pruning trust-distrust network via reliability and risk estimates for quality recommendations, social network analysis and mining, 3(1), 65-84, 2013. [2] bobadilla j., ortega f., hernando a.; alcal j. (2011); improving collaborative filtering recommender system results and performance using genetic algorithms, knowledge-based systems, 24(8), 1310-1316, 2011. [3] breese j.s., heckerman d., kadie c. (1998); empirical analysis of predictive algorithms for collaborative filtering, proceedings of the fourteenth conference on uncertainty in artificial intelligence, 43-52, 1998. a rating-based integrated recommendation framework with improved collaborative filtering approaches 321 [4] choi k., suh y. (2013); a new similarity function for selecting neighbors for each target item in collaborative filtering, knowledge-based systems, 37, 146-153, 2013. [5] das a. s., datar m., garg a., rajaram s. (2007); google news personalization: scalable online collaborative filtering, proceedings of the 16th international conference on world wide web, 271-280, 2007. [6] deng a.l., zhu y.y., shi b. (2003); a collaborative filtering recommendation algorithm based on item rating prediction, journal of software (chinese), 14(9), 1621-1628, 2003. [7] deshpande m., karypis g. (2004); item-based top-n recommendation algorithms, acm transactions on information systems (tois), 22(1), 143-177, 2004. [8] ghazanfar m.a., pršgel-bennett a. (2013); the advantage of careful imputation sources in sparse data-environment of recommender systems: generating improved svd-based recommendations, informatica (slovenia), 37(1), 61-92, 2013. [9] goldberg d., nichols d., oki b.m., terry d. (1992); using collaborative filtering to weave an information tapestry, communications of the acm, 35(12), 61-70, 1992. [10] koren y. (2010); collaborative filtering with temporal dynamics, communications of the acm, 53(4), 89-97, 2010. [11] li q., sato i., murakami y. (2007); efficient stochastic gradient search for automatic image registration, international journal of simulation modelling (ijsimm), 6(2), 114-123, 2007. [12] li w., ye z., xin m., jin q. (2015); social recommendation based on trust and influence in sns environments, multimedia tools and applications, 1-18, 2015. [13] linden g., smith b., york j. (2003); amazon.com recommendations: item-to-item collaborative filtering, ieee internet computing, 7(1), 76-80, 2003. [14] liu n.n., zhao m., yang q. (2009); probabilistic latent preference analysis for collaborative filtering, proceedings of the 18th acm conference on information and knowledge management, 759-766, 2009. [15] lu z., dou z., lian j., xie x., yang q. (2015); content-based collaborative filtering for news topic recommendation, twenty-ninth aaai conference on artificial intelligence, 217-223, 2015. [16] ma h., king i., lyu m.r. (2007); effective missing data prediction for collaborative filtering, proceedings of the 30th annual international acm sigir conference on research and development in information retrieval, 39-46, 2007. [17] moin a., ignat c.l. (2014); hybrid weighting schemes for collaborative filtering (doctoral dissertation, inria nancy), france, 2014. [18] nilashi m., bin ibrahim o., ithnin n. (2014); hybrid recommendation approaches for multicriteria collaborative filtering, expert systems with applications, 41(8), 3879-3900, 2014. [19] park d. h., kim h. k., choi i.y., kim j.k. (2012); a literature review and classification of recommender systems research, expert systems with applications, 39(11), 10059-10072, 2012. 322 s. cheng, b. zhang, g. zou [20] paterek a. (2007); improving regularized singular value decomposition for collaborative filtering, proceedings of kdd cup and workshop, 5-8, 2007. [21] ricci f., rokach l., shapira b. (2011); introduction to recommender systems handbook, springer, 2011. [22] sarwar b., karypis g., konstan j., riedl j. (2001); item-based collaborative filtering recommendation algorithms, proceedings of the 10th international conference on world wide web, 285-295, 2001. [23] shi y., larson m., hanjalic a. (2014); collaborative filtering beyond the user-item matrix: a survey of the state of the art and future challenges, acm computing surveys (csur), 47(1), 3-45, 2014. [24] song r.p., wang b., huang g.m., liu q.d., hu r.j., zhang r.s. (2014); a hybrid recommender algorithm based on an improved similarity method, applied mechanics and materials, 475, 978-982, 2014. [25] wang j., de vries a.p., reinders m.j. (2006); unifying user-based and item-based collaborative filtering approaches by similarity fusion, proceedings of the 29th annual international acm sigir conference on research and development in information retrieval, 501-508, 2006. [26] xu s. y., raahemi b. (2016); a semantic-based service discovery framework for collaborative environments, international journal of simulation modelling (ijsimm), 15(1), 83-96, 2016. [27] yang x., guo y., liu y., steck h. (2014); a survey of collaborative filtering based social recommender systems, computer communications, 41, 1-10, 2014. [28] zenebe a., zhou l., norcio, a.f. (2010); user preferences discovery using fuzzy models, fuzzy sets and systems, 161(23), 3044-3063, 2010. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vii (2012), no. 1 (march), pp. 8-19 a non-fragile h∞ output feedback controller for uncertain fuzzy dynamical systems with multiple time-scales w. assawinchaichote wudhichai assawinchaichote department of electronic and telecommnunication engineering king mongkut’s university of technology thonburi 126 prachautits rd., bangkok 10140, thailand e-mail: wudhichai.asa@kmutt.ac.th abstract: this paper determines the designing of a non-fragile h∞ output feedback controller for a class of nonlinear uncertain dynamical systems with multiple timescales described by a takagi-sugeno (ts) fuzzy model. based on a linear matrix inequality (lmi) approach, we develop a non-fragile h∞ output feedback controller which guarantees the l2-gain of the mapping from the exogenous input noise to the regulated output to be less than some prescribed value for this class of uncertain fuzzy dynamical systems with multiple time-scales. a numerical example is provided to illustrate the design developed in this paper. keywords: fuzzy control, linear matrix inequality (lmi), non-fragile h∞ output feedback control, multiple time-scale systems. 1 introduction in the last few years, the problem of control design for dynamical systems with multiple timescale has been intensively studied by a number of researchers; see [1][12]. this is due not only to theoretical interest but also to the relevance of this topic in control engineering applications. singularly perturbed systems are dynamical systems with multiple time-scales. singularly perturbed systems often occur naturally due to the presence of small “parasitic” parameter, typically small time constants, masses, etc. indeed multiple time-scales phenomena are almost unavoidable in “real-life” systems. examples of such systems abound and include convection-diffusion systems, diffusion-drift motion systems, power systems, scheduling systems, economic models, telecommunication systems and bifurcations. presently, many researchers have studied the h∞ control design for a general class of linear singularly perturbed systems due to a great practical importance; see [4,5,7]. the main purpose of the singular perturbation approach to analysis and design is the alleviation of high dimensionality and ill-conditioning resulting from the interaction of slow and fast dynamics modes. the separation of states into slow and fast ones is a nontrivial modelling task demanding insight and ingenuity on the part of the analyst. in state space, such systems are commonly modelled using the mathematical framework of singular perturbations, with a small parameter, say ε, determining the degree of separation between the “slow” and “fast” modes of the system. although many researchers have studied linear singularly perturbed systems for many years, the h∞ control design of nonlinear singularly perturbed systems remains as an open research area. this is because, in general, nonlinear singularly perturbed systems can not be separated into slow and fast subsystems. over the past two decades, there has been rapidly growing interest in application of fuzzy logic to control problem. researches have been focused on its application to industrial processes and a number of successful results have been reported in the literature. in spite of these successes, there are many basic issues remain to be addressed. one of them is how to achieve a systematic copyright c⃝ 2006-2012 by ccc publications a non-fragile h∞ output feedback controller for uncertain fuzzy dynamical systems with multiple time-scales 9 design that guarantees closed-loop stability and performance. recently, a great amount of effort has been devoted to describing a nonlinear system using a takagi-sugeno fuzzy model; see [16][29]. the takagi-sugeno (ts) fuzzy model represents a nonlinear system by a family of local linear models which smoothly blended together through fuzzy membership functions. unlike conventional modelling techniques which uses a single model to describe the global behavior of a nonlinear system, fuzzy modelling is essentially a multi-model approach in which simple sub-models (typically linear models) are fuzzily combined to described the global behavior of a nonlinear system. based on this fuzzy model, a number of systematic model-based fuzzy control design methodologies have been developed. the aim of this paper is to design a non-fragile h∞ output feedback controller for a uncertain nonlinear dynamical system with multuple time-scales. based on an lmi approach, we develop the fuzzy non-fragile h∞ output feedback controller that guarantees the l2-gain of the mapping from the exogenous input noise to the regulated output to be less than or equal to a prescribed value for this class of fuzzy dynamical systems. in order to alleviate the ill-conditioned linear matrix inequalities resulting from the interaction of slow and fast dynamic modes, the illconditioned lmis are decomposed into ε-independent and ε-dependent lmis. the ε-independent lmis are not ill-conditioned and the ε-dependent lmis tend to zero when ε approaches to zero. it can be shown that when ε is sufficiently small, the original ill-conditioned lmis are solvable if and only if the ε-independent lmis are solvable. the proposed approach does not involve the separation of states into slow and fast ones, and it can be applied not only to standard, but also to nonstandard singularly perturbed systems. this paper is organized as follows. in section 2, system descriptions and definition are presented. in section 3, based on an lmi approach, we respectively develop a technique for designing a non-fragile h∞ output feedback controllers such that the l2-gain of the mapping from the exogenous input noise to the regulated output is less than a prescribed value for the system described in section 2. the validity of this approach is demonstrated by an example from a literature in section 4. finally, conclusions are given in section 5. 2 system descriptions and definitions in this section, we consider the ts fuzzy system with multiple time-scales to represent a ts fuzzy multiple time-scale system with parametric uncertainties as follows: eεẋ(t) = ∑r i=1 µi(ν(t)) [ [ai + ∆ai]x(t) + [b1i + ∆b1i]w(t) + [b2i + ∆b2i]u(t) ] z(t) = ∑r i=1 µi(ν(t)) [ [c1i + ∆c1i]x(t) + [d12i + ∆d12i]u(t) ] y(t) = ∑r i=1 µi(ν(t)) [ [c2i + ∆c2i]x(t) + [d21i + ∆d21i]w(t) ] (1) where eε = [ i 0 0 εi ] , ν(t) = [ν1(t) · · · νϑ(t)] is the premise variable vector that may depend on states in many cases, ε > 0 is the singular perturbation parameter, µi(ν(t)) denotes the normalized time-varying fuzzy weighting functions for each rule (i.e., µi(ν(t)) ≥ 0 and ∑r i=1 µi(ν(t)) = 1), ϑ is the number of fuzzy sets, x(t) ∈ ℜn is the state vector, u(t) ∈ ℜm is the input, w(t) ∈ ℜp is the disturbance which belongs to l2[0,∞), y(t) ∈ ℜℓ is the measurement, z(t) ∈ ℜs is the controlled output, the matrices ai,b1i,b2i,c1i,c2i,d12i and d21i are of appropriate dimensions, and r is the number of if-then rules. the matrices ∆ai,∆b1i,∆b2i,∆c1i,∆c2i,∆d12i and ∆d21i represent the uncertainties in the system and satisfy the following assumption. 10 w. assawinchaichote assumption 1. ∆ai = f(x(t), t)h1i, ∆b1i = f(x(t), t)h2i, ∆b2i = f(x(t), t)h3i, ∆c1i = f(x(t), t)h4i, ∆c2i = f(x(t), t)h5i, ∆d12i = f(x(t), t)h6i and ∆d21i = f(x(t), t)h7i where hji, j = 1,2, · · · ,7 are known matrix functions which characterize the structure of the uncertainties. furthermore, the following inequality holds: ∥f(x(t), t)∥ ≤ ρ (2) for any known positive constant ρ. next, let us recall the following definition. definition 1. suppose γ is a given positive number. a system (1) is said to have an l2-gain less than or equal to γ if∫ tf 0 zt (t)z(t)dt ≤ γ2 [∫ tf 0 wt (t)w(t)dt ] , (3) for all tf ≥ 0, x(0) = 0 and w(t) ∈ l2[0,tf]. note that for the symmetric block matrices, we use (∗) as an ellipsis for terms that are induced by symmetry. 3 non-fragile h∞ output feedback controller the nature of the information of the state available to the controller has a major effect on the complexity of the designing problem and of the resulting controller. the state-feedback control design problem is an easier problem in which all information are available. however, in most real physical systems, the state is not perfectly known, and so we must estimate it. the process of estimating the system state from the measurement output that are available is called the estimator design. by utilizing the state estimator, the output feedback problem is converted to the state-feedback problem for a new problem. this new problem employs the estimated state as its own state variable and the solution of the new state-feedback problem leads to the solution of the dynamic output feedback control problem. basically, the dynamic output feedback is a coupling of control and estimation. this section aims at designing a full order dynamic non-fragile h∞ fuzzy output feedback controller of the form eε ˙̂x(t) = ∑r i=1 ∑r j=1 µ̂iµ̂j [ âij(ε)x̂(t) + b̂iy(t) ] , u(t) = ∑r i=1 µ̂iĉix̂(t) (4) where x̂(t) ∈ ℜn is the controller’s state vector, âij, b̂i and ĉi are parameters of the controller which are to be determined, and µ̂i denotes the normalized time-varying fuzzy weighting functions for each rule (i.e., µ̂i ≥ 0 and ∑r i=1 µ̂i = 1), such that the inequality (3) holds. clearly, in real control problems, all of the premise variables are not necessarily measurable. thus, in this section, we consider the designing of the non-fragile h∞ output feedback control into two cases as follows. in subsection 3.1, we consider the case where the premise variable of the fuzzy model µi is measurable, while in subsection 3.2, the premise variable which is assumed to be unmeasurable is considered. a non-fragile h∞ output feedback controller for uncertain fuzzy dynamical systems with multiple time-scales 11 3.1 case i–ν(t) is available for feedback the premise variable of the fuzzy model ν(t) is available for feedback which implies that µi is available for feedback. thus, we can select our controller that depends on µi as follows: eε ˙̂x(t) = ∑r i=1 ∑r j=1 µiµj [ âij(ε)x̂(t) + b̂iy(t) ] , u(t) = ∑r i=1 µiĉix̂(t). (5) before presenting our next results, the following lemma is recalled. lemma 1. consider the system (1). given a prescribed h∞ performance γ and a positive constant δ, if there exist matrices xε = xtε , yε = y t ε , bi(ε) and ci(ε), i = 1,2, · · · ,r, satisfying the following ε-dependent linear matrix inequalities:[ xε i i yε ] > 0 (6) xε > 0 and yε > 0 (7) ψ11ii(ε) and ψ22ii(ε) < 0, i = 1,2, · · · ,r (8) ψ11ij (ε) + ψ11ji(ε) and ψ22ij (ε) + ψ22ji(ε) < 0, i < j ≤ r (9) where ψ11ij (ε) =   ( e−1ε aiyε + yεa t i e −1 ε + e −1 ε b2icj(ε)e −1 ε +e−1ε cti (ε)b t 2j e−1ε + γ −2e−1ε b̃1ib̃ t 1j e−1ε ) (∗)t[ yεc̃ t 1i + e−1ε cti (ε)d̃ t 12j ]t −i   (10) ψ22ij (ε) =   ( ati e −1 ε xε + xεe −1 ε ai +bi(ε)c2j + c t 2i btj (ε) + c̃ t 1i c̃1j ) (∗)t[ xεe −1 ε b̃1i + bi(ε)d̃21j ]t −γ2i   (11) with b̃1i = [ δi i δi 0 b1i 0 ] , c̃1i = [ γρ δ ht1i 0 γρ δ ht5i √ 2λρht4i √ 2λct1i ]t , d̃12i = [ 0 γρ δ ht3i 0 √ 2λρht6i √ 2λdt12i ]t , d̃21i = [ 0 0 0 δi d21i i ] and λ =  1 + ρ2 r∑ i=1 r∑ j=1 [ ∥ht2ih2j ∥ + ∥h t 7i h7j ∥ ] 1 2 , then the system (1) has the prescribed h∞ performance γ > 0. furthermore, a suitable controller is of the form (5) with âij(ε) = eε [ y −1ε − xε ]−1 mij(ε)y −1ε b̂i = eε [ y −1ε − xε ]−1 bi(ε) and ĉi = ci(ε)e−1ε y −1ε (12) where mij(ε) = −ati e −1 ε − xεe −1 ε aiyε − xεe −1 ε b2iĉjyε − [ y −1ε − xε ] e−1ε b̂ic2jyε − c̃ t 1i [ c̃1jyε + d̃12jĉjyε ] −γ−2 { xεe −1 ε b̃1i + [ y −1ε − xε ] e−1ε b̂id̃21i } b̃t1je −1 ε . (13) 12 w. assawinchaichote proof: the proof can be carried out the same technique used in lemma 1. 2 remark 1. the lmis given in lemma 3.1 may become ill-conditioned when ε is sufficiently small, which is always the case for the multiple time-scale systems. in general, these ill-conditioned lmis are very difficult to solve. thus, to alleviate these ill-conditioned lmis, we have the following ε-independent well-posed lmi-based sufficient conditions for the uncertain fuzzy multiple time-scale systems to obtain the prescribed h∞ performance. theorem 1. consider the system (1). given a prescribed h∞ performance γ > 0 and a positive constant δ, if there exist matrices x0, y0, b0i and c0i, i = 1,2, · · · ,r, satisfying the following ε-independent linear matrix inequalities:[ x0e + dx0 i i y0e + dy0 ] > 0 (14) ext0 = x0e, x t 0 d = dx0, x0e + dx0 > 0 (15) ey t0 = y0e, y t 0 d = dy0, y0e + dy0 > 0 (16) ψ11ii and ψ22ii < 0, i = 1,2, · · · ,r (17) ψ11ij + ψ11ji and ψ22ij + ψ22ji < 0, i < j ≤ r (18) where e = ( i 0 0 0 ) , d = ( 0 0 0 i ) , ψ11ij = ( aiy t 0 + y0a t i + b2ic0j + c t 0i bt2j + γ −2b̃1ib̃ t 1j (∗)t[ y0c̃ t 1i + ct0id̃ t 12j ]t −i ) (19) ψ22ij = ( ati x t 0 + x0ai + b0ic2j + c t 2i bt0j + c̃ t 1i c̃1j (∗) t[ x0b̃1i + b0id̃21j ]t −γ2i ) (20) with b̃1i = [ δi i δi 0 b1i 0 ] , c̃1i = [ γρ δ ht1i 0 γρ δ ht5i √ 2λρht4i √ 2λct1i ]t d̃12i = [ 0 γρ δ ht3i 0 √ 2λρht6i √ 2λdt12i ]t , d̃21i = [ 0 0 0 δi d21i i ] and λ =  1 + ρ2 r∑ i=1 r∑ j=1 [ ∥ht2ih2j ∥ + ∥h t 7i h7j ∥ ] 1 2 , then there exists a sufficiently small ε̂ > 0 such that for ε ∈ (0, ε̂], the prescribed h∞ performance γ > 0 is guaranteed. furthermore, a suitable controller is of the form (5) with âij(ε) = [ y −1ε − xε ]−1m0ij (ε)y −1ε , b̂i = [y −10 − x0]−1b0i, and ĉi = c0iy −10 (21) where m0ij (ε) = −a t i − xεaiyε − xεb2iĉjyε − [ y −1ε − xε ] b̂ic2jyε −c̃t1i [ c̃1jyε + d̃12jĉjyε ] − γ−2 { xεb̃1i + [ y −1ε − xε ] b̂id̃21i } b̃t1j (22) xε = { x0 + εx̃ } eε and y −1ε = { y −10 + εnε } eε (23) a non-fragile h∞ output feedback controller for uncertain fuzzy dynamical systems with multiple time-scales 13 with x̃ = d ( xt0 − x0 ) and nε = d ( (y −10 ) t − y −10 ) . proof: suppose the inequalities (14)-(16) hold, then the matrices x0 and y0 are of the following forms: x0 = ( x1 x2 0 x3 ) and y0 = ( y1 y2 0 y3 ) with x1 = xt1 > 0, x3 = x t 3 > 0, y1 = y t 1 > 0 and y3 = y t 3 > 0. substituting x0 and y0 into (23), respectively, we have xε = { x0 + εx̃ } eε = ( x1 εx2 εxt2 εx3 ) (24) y −1ε = { y −10 + εnε } eε = ( y −11 −εy −1y2y −1 3 −ε(y −1y2y −13 ) t εy −13 ) . (25) clearly, xε = xtε , and y −1 ε = (y −1 ε ) t . knowing the fact that the inverse of a symmetric matrix is a symmetric matrix, we learn that yε is a symmetric matrix. using the matrix inversion lemma, we can see that yε = e −1 ε { y0 + εỹ } (26) where ỹ = y0nε(i + εy0nε)−1y0. employing the schur complement, one can show that there exists a sufficiently small ε̂ such that for ε ∈ (0, ε̂], (7) holds. now, we need to show that ( xε i i yε ) > 0. (27) by the schur complement, it is equivalent to showing that xε − y −1ε > 0. (28) substituting (24) and (25) into the left hand side of (28), we get[ x1 − y −11 ε(x2 + y −1 1 y2y −1 3 ) ε(x2 + y −1 1 y2y −1 3 ) t ε(x3 − y −13 ) ] . (29) the schur complement of (14) is[ x1 − y −11 0 0 x3 − y −13 ] > 0. (30) according to (30), we learn that x1 − y −11 > 0 and x3 − y −1 3 > 0. (31) using (31) and the schur complement, it can be shown that there exists a sufficiently small ε̂ > 0 such that for ε ∈ (0, ε̂], (6) holds. next, employing (24), (25) and (26), the controller’s matrices given in (12) can be re-expressed as follows: bi(ε) = [ y −10 − x0 ] b̂i + ε [ nε − x̃ ] b̂ib0i + εbεi ci(ε) = ĉiy t0 + εĉiỹ t c0i + εcεi. (32) 14 w. assawinchaichote substituting (24), (25), (26) and (32) into (10) and (11), and pre-post multiplying (10) by( eε 0 0 i ) , we, respectively, obtain ψ11ij + ψ11ij and ψ22ij + ψ22ij (33) where the ε-independent linear matrices ψ11ij and ψ22ij are defined in (19) and (20), respectively and the ε-dependent linear matrices are ψ11ij = ε   aiỹ t + ỹ ati + b2icεj + ctεibt2j (∗)t[ ỹ c̃t1i + c t εi d̃t12j ]t 0   (34) ψ22ij = ε   ati x̃ + x̃t ai + bεic2j + ct2ibtεj (∗)t[ x̃b̃1i + bεid̃21j ]t 0   . (35) note that the ε-dependent linear matrices tend to zero when ε approaches zero. employing (17)-(18) and knowing the fact that for any given negative definite matrix w, there exists an ε > 0 such that w + εi < 0, one can show that there exists a sufficiently small ε̂ > 0 such that for ε ∈ (0, ε̂], (8)-(9) hold. since (6)-(9) hold, using lemma 2, the inequality (3) holds. 2 3.2 case ii–ν(t) is unavailable for feedback the output feedback fuzzy controller is assumed to be the same as the premise variables of the fuzzy system model. this actually means that the premise variables of fuzzy system model are assumed to be measurable. however, in general, it is extremely difficult to derive an accurate fuzzy system model by imposing that all premise variables are measurable. in this subsection, we do not impose that condition, we choose the premise variables of the controller to be different from the premise variables of fuzzy system model of the plant. in here, the premise variables of the controller are selected to be the estimated premise variables of the plant. in the other words, the premise variable of the fuzzy model ν(t) is unavailable for feedback which implies µi is unavailable for feedback. hence, we cannot select our controller which depends on µi. thus, we select our controller as (4) where µ̂i depends on the premise variable of the controller which is different from µi. let us re-express the system (1) in terms of µ̂i, thus the plant’s premise variable becomes the same as the controller’s premise variable. by doing so, the result given in the previous case can then be applied here. note that it can be done by using the same technique as in subsection. after some manipulation, we get eεẋ(t) = ∑r i=1 µ̂i [ [ai + ∆āi]x(t) + [b1i + ∆b̄1i]w(t) + [b2i + ∆b̄2i] ] u(t) z(t) = ∑r i=1 µ̂i [ [c1i + ∆c̄1i]x(t) + [d12i + ∆d̄12i]u(t) ] y(t) = ∑r i=1 µ̂i [ [c2i + ∆c̄2i]x(t) + [d21i + ∆d̄21i]w(t) ] (36) where ∆āi = f̄(x(t), x̂(t), t)h̄1i, ∆b̄1i = f̄(x(t), x̂(t), t)h̄2i, ∆b̄2i = f̄(x(t), x̂(t), t)h̄3i, ∆c̄1i = f̄(x(t), x̂(t), t)h̄4i, ∆c̄2i = f̄(x(t), x̂(t), t)h̄5i, ∆d̄12i = f̄(x(t), x̂(t), t)h̄6i and ∆d̄21i = f̄(x(t), x̂(t), t)h̄7i a non-fragile h∞ output feedback controller for uncertain fuzzy dynamical systems with multiple time-scales 15 with h̄1i = [ ht1i a t 1 · · ·a t r h t 11 · · ·ht1r ]t , h̄2i = [ ht2i b t 11 · · ·bt1r h t 21 · · ·ht2r ]t , h̄3i = [ ht3i b t 21 · · ·bt2r h t 31 · · ·ht3r ]t , h̄4i = [ ht4i c t 11 · · ·ct1r h t 41 · · ·ht4r ]t , h̄5i = [ ht5i c t 21 · · ·ct2r h t 51 · · ·ht5r ]t , h̄6i = [ ht6i d t 121 · · ·dt12r h t 61 · · ·ht6r ]t h̄7i = [ ht7i d t 211 · · · dt21r h t 71 · · ·ht7r ]t and f̄(x(t), x̂(t), t) = [ f(x(t), t) (µ1−µ̂1) · · · (µr−µ̂r) f(x(t), t)(µ1−µ̂1) · · · f(x(t), t)(µr−µ̂r) ] . note that ∥f̄(x(t), x̂(t), t)∥ ≤ ρ̄ where ρ̄ = {3ρ2 + 2} 1 2 . ρ̄ is derived by utilizing the concept of vector norm in the basic system control theory and the fact that µi ≥ 0, µ̂i ≥ 0, ∑r i=1 µi = 1 and ∑r i=1 µ̂i = 1. note that the above technique is basically employed in order to obtain the plant’s premise variable to be the same as the controller’s premise variable; e.g. [28]. now, the premise variable of the system is the same as the premise variable of the controller, thus we can apply the result given in case i. theorem 2 consider the system (1). given a prescribed h∞ performance γ > 0 and a positive constant δ, if there exist matrices x0, y0, b0i and c0i, i = 1,2, · · · ,r, satisfying the following ε-independent linear matrix inequalities:[ x0e + dx0 i i y0e + dy0 ] > 0 (37) ext0 = x0e, x t 0 d = dx0, x0e + dx0 > 0 (38) ey t0 = y0e, y t 0 d = dy0, y0e + dy0 > 0 (39) ψ11ii and ψ22ii < 0, i = 1,2, · · · ,r (40) ψ11ij + ψ11ji and ψ22ij + ψ22ji < 0, i < j ≤ r (41) where e = ( i 0 0 0 ) , d = ( 0 0 0 i ) , ψ11ij = ( aiy t 0 + y0a t i + b2ic0j + c t 0i bt2j + γ −2 ˜̄b1i ˜̄bt1j (∗) t[ y0 ˜̄ct1i + c t 0i ˜̄dt12j ]t −i ) (42) ψ22ij = ( ati x t 0 + x0ai + b0ic2j + c t 2i bt0j + ˜̄ct1i ˜̄c1j (∗) t[ x0 ˜̄b1i + b0i ˜̄d21j ]t −γ2i ) (43) with ˜̄b1i = [ δi i δi 0 b1i 0 ] , ˜̄c1i = [ γρ̄ δ h̄t1i 0 γρ̄ δ h̄t5i √ 2λ̄ρ̄h̄t4i √ 2λ̄ct1i ]t , ˜̄d12i = [ 0 γρ̄ δ h̄t3i 0 √ 2λ̄ρ̄h̄t6i √ 2λ̄dt12i ]t , ˜̄d21i = [ 0 0 0 δi d21i i ] and λ̄ =  1 + ρ̄2 r∑ i=1 r∑ j=1 [ ∥h̄t2ih̄2j ∥ + ∥h̄ t 7i h̄7j ∥ ] 1 2 , then there exists a sufficiently small ε̂ > 0 such that for ε ∈ (0, ε̂], the prescribed h∞ performance γ > 0 is guaranteed. furthermore, a suitable controller is of the form (4) with âij(ε) = [ y −1ε − xε ]−1m0ij (ε)y −1ε , b̂i = [y −10 − x0]−1b0i, and ĉi = c0iy −10 (44) 16 w. assawinchaichote where m0ij (ε) = −a t i − xεaiyε − xεb2iĉjyε − [ y −1ε − xε ] b̂ic2jyε − ˜̄ct1i [ ˜̄c1jyε + ˜̄d12jĉjyε] − γ−2{xε ˜̄b1i + [y −1ε − xε]b̂i ˜̄d21i} ˜̄bt1j (45) xε = { x0 + εx̃ } eε and y −1ε = { y −10 + εnε } eε (46) with x̃ = d ( xt0 − x0 ) and nε = d ( (y −10 ) t − y −10 ) . proof: since (36) is of the form of (1), it can be shown by employing the proof for theorem 3.1. 2 4 example consider the tunnel diode circuit where the tunnel diode is characterized by id(t) = −0.2vd(t)− 0.01v3d(t). assume that ε is a “parasitic” inductance in the network. let x1(t) = vc(t) be the capacitor voltage and x2(t) = il(t) be the inductor current. then, the circuit can be modelled by the following state equations: cẋ1(t) = 0.2x1(t) + 0.01x 3 1(t) + x2(t) εẋ2(t) = −x1(t) − rx2(t) + u(t) + 0.1w2(t) y(t) = jx(t) + 0.1w1(t), z(t) = [x1(t) x2(t)] t (47) where u(t) is the control input, w1(t) is the measurement noise, w2(t) are is the process noise which may represent un-modelled dynamics, y(t) is the measured output, z(t) is the controlled output, j is the sensor matrix, x(t) = [xt1 (t) x t 2 (t)] t and w(t) = [wt1 (t) w t 2 (t)] t . note that the variables x1(t) and x2(t) are treated as the deviation variables (variables deviate from its desired trajectories). the parameters in the circuit are given by c = 100 mf and r = 1 ± 0.3% ω, with these parameters (47) can be rewritten as ẋ1(t) = 2x1(t) + (0.1x 2 1(t)) · x1(t) + 10x2(t) εẋ2(t) = −x1(t) − (1 ± ∆r)x2(t) + u(t) + 0.1w2(t) y(t) = jx(t) + 0.1w1(t), z(t) = [x1(t) x2(t)] t . (48) for the sake of simplicity, we will use as few rules as possible. assuming that |x1(t)| ≤ 3, the nonlinear network system (48) can be approximated by the following ts fuzzy model: 1 0 1 2 m (x ) m (x ) x 1 1 1 −3 3 figure 1: membership functions for the two fuzzy set. a non-fragile h∞ output feedback controller for uncertain fuzzy dynamical systems with multiple time-scales 17 plant rule 1: if x1(t) is m1(x1(t)) then eεẋ(t) = [a1 + ∆a1]x(t) + b1w(t) + b21u(t), z(t) = c1x(t), y(t) = c21x(t) + d21w(t). plant rule 2: if x1(t) is m2(x1(t)) then eεẋ(t) = [a2 + ∆a2]x(t) + b1w(t) + b22u(t), z(t) = c1x(t), y(t) = c22x(t) + d21w(t) where x(0) = 0, x(t) = [xt1 (t) x t 2 (t)] t , w(t) = [wt1 (t) w t 2 (t)] t , a1 = [ 2 10 −1 −1 ] , a2 = [ 2.9 10 −1 −1 ] , b1 = [ 0 0 0 0.1 ] , b21 = b22 = [ 0 1 ] , c1 = [ 1 0 0 1 ] , c21 = c22 = j, d21 = [ 0.1 0 ] , ∆a1 = f(x(t), t)h11, ; ∆a2 = f(x(t), t)h12 and eε = [ 1 0 0 ε ] . note that, the plot of the membership functions is the same as in figure 1. now, by assuming that in (2), ∥f(x(t), t)∥ ≤ ρ = 1 and since the values of r are uncertain but bounded within 30% of their nominal values given in (47), we have h11 = h12 = [ 0 0 0 0.3 ] . note that by employing the results given in lemma 1 and the matlab lmi solver, it is easy to realize that when ε < 0.03, the lmis become ill-conditioned and the matlab lmi solver yields the error message, “rank deficient". using the lmi optimization algorithm and theorems 3.1-3.2 with ε = 0.01, γ = 1 and δ = 1, we obtain the following results as shown in table 1. table 1: the performance index γ of the system with different values of ε. the performance index γ ε output feedback case i case ii 0.01 0.316 0.346 0.15 0.574 0.922 0.16 0.600 > 1 0.28 0.989 > 1 0.29 > 1 > 1 remark 2. for a sufficiently small ε, the non-fragile output feedback controllers guarantee that the l2-gain, γ, is less than the prescribed value. the disturbance input signal, w(t), which was used during the simulation is the rectangular signal with magnitude 0.1 and frequency 1 hz. for an example, when ε is 0.01, the output feedback controller in case i where γ = 0.316 and in case ii where γ = 0.346, all are less than the prescribed value 1. thus, table 1 shows the result of the performance index γ with different values of ε. 5 conclusion this paper has considered the problem of designing a non-fragile output feedback controller for a ts fuzzy system with multiple time-scales. sufficient conditions for the existence of nonfragile fuzzy controllers are derived in terms of a family of ε-independent linear matrix inequalities. the proposed approach does not involve the separation of states into slow and fast ones, 18 w. assawinchaichote and it can be applied not only to standard, but also to nonstandard multiple time-scale systems. a numerical simulation example has been presented to illustrate the effectiveness of the designs. acknowledgment this work is financed by thailand national research university funding (nru). the author also would like to acknowledge to the department of electronic and telecommnunication engineering, king mongkut’s university of technology thonburi for their supports in this research work. bibliography [1] m. suzuki, and m. miura, “stabilizing feedback controller for singularly perturbed linear constant systems," ieee trans. automat. control, vol. ac-21, pp. 123–124, 1976. [2] p.v. kokotovic, r.e. o’malley, jr, and p. sannuti, “singular perturbations and order reduction in control theory–an overview," automatica, vol. 12, pp. 123–132, 1976. [3] j. o’reilly, “two time-scale feedback stabilization of linear time-varying singularly perturbed systems," j. franklin inst., vol. 30, pp. 465–474, 1979. [4] z. pan, and t. basar, “h∞–optimal control for singularly perturbed systems parti: perfect state measurements," automatica, vol. 29, pp. 401–423, 1993. [5] z. pan, and t. basar, “h∞–optimal control for singularly perturbed systems partii: imperfect state measurements," ieee trans. automat. control, vol. 39, pp. 280–299, 1994. [6] e. fridman, “state-feedback h∞ control of nonlinear singularly perturbed systems," int. j. robust nonlinear control, (in press), 2001. [7] p. shi, and v. dragan, “asymptotic h∞ control of singularly perturbed system with parametric uncertainties," ieee trans. automat. control, vol. 44, pp. 1738–1742, 1999. [8] p.z.h. shao, and m.e. sawan, “robust stability of singularly perturbed systems," int. j. control, vol. 58, pp. 1469–1476, 1993. [9] w.c. su, z. gajic, and x.m. shen, “the exact slow-fast decomposition of the algebraic riccati equation of singularly perturbed systems," ieee trans. automat. control, vol. 37, pp. 1456–1459, 1992. [10] w.c. su, “sliding surface design for singularly perturbed systems," int. j. control, vol. 72, pp. 990–995, 1999. [11] t. grodt, and z. gajic, “the recursive reduced-order numerical solution of the singularly perturbed matrix differential riccati equation," ieee trans. automat. control, vol. 33, pp. 751–754, 1998. [12] p.v. kokotovic, h.k. khalil, and j. o’reilly, singular perturbation methods in control: analysis and design, academic press, london, 1986. [13] k. gu, “h∞ control of systems under norm bounded uncertainties in all systems matrices," ieee trans. automat. contol, vol. 39, pp. 1320–1322, 1994. a non-fragile h∞ output feedback controller for uncertain fuzzy dynamical systems with multiple time-scales 19 [14] h. mukaidani, and h. xu, “robust h∞ control problem for nonstandard singularly perturbed systems and applications," proc. acc, arlington, pp. 3920–3925, june 2001. [15] b.d.o. anderson, and j.b. moore, optimal control: linear quadratic methods, prenticehall, new jersey, 1990. [16] l. a. zadeh, “fuzzy set,” information and contr., vol. 8, pp. 338–353, 1965. [17] l. a. zadeh, “outline of a new approach to the analysis of complex systems and decision processes,” ieee trans. syst. man, cybern., vol. 3, pp. 28–44, 1973. [18] e. h. mamdani and s. assilian, “an experiment in linquistic synthesis with a fuzzy logic controller,” int. j. man-machine-studies., vol. 7, pp. 1–13, 1975. [19] l. x. wang, a course in fuzzy systems and control. englewood cliffs, nj: prentice-hall, inc., 1997. [20] j. yoneyama, m. nishikawa, h. katayama, and a. ichikawa, “output stabilization of takagisugeno fuzzy system,” fuzzy sets systs., vol. 111, pp. 253–266, 2000. [21] k. tanaka and m. sugeno, “stability analysis and design of fuzzy control systems", fuzzy sets systs., vol. 45, pp. 135–156, 1992. [22] k. tanaka, “stability and stabiliability of fuzzy neural linear control systems", ieee trans. fuzzy syst., vol. 3, pp. 438–447, 1995. [23] c. l. chen, p. c. chen, and c. k. chen, “analysis and design of fuzzy control system", fuzzy sets systs., vol. 57, pp. 125–140, 1995. [24] x. j. ma, z. q sun, and y. y he, “analysis and design of fuzzy controller and fuzzy observer", ieee trans. fuzzy syst., vol. 6, pp. 41–51, 1998. [25] w.assawinchaichote and s. k. nguang, “h∞ filtering for nonlinear singularly perturbed systems with pole placement constraints: an lmi approach", ieee trans. signal processing, vol. 52, pp. 579–588, 2004. [26] w. assawinchaichote and s. k. nguang, “h∞ fuzzy control design for nonlinear singularly perturbed systems with pole placement constraints: an lmi approach,” ieee trans. syst., man, cybern. b, vol. 34, pp. 579–588, 2004. [27] l. x. wang, “design and analysis of fuzzy identifiers of nonlinear dynamic systems,” ieee trans. automat. contr., vol. 40, pp. 11–23, 1995. [28] s. k. nguang and p. shi, “h∞ fuzzy output feedback control design for nonlinear systems: an lmi approach ," ieee trans. fuzzy syst., vol. 11, pp. 331–340, 2003. [29] h. j. lee, j. b. park, and g. chen, “robust fuzzy control of nonlinear system with parametric uncertainties," ieee. trans. fuzzy syst., vol. 9, pp. 369–379, 2001. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. v (2010), no. 4, pp. 540-550 solving vertex cover problem by means of tissue p systems with cell separation c. lu, x. zhang chun lu key laboratory of image processing and intelligent control department of control science and engineering huazhong university of science and technology wuhan 430074, hubei, people’s republic of china e-mail: luchun.et@gmail.com(corresponding author) xingyi zhang school of computer science and technology, anhui university hefei 230601, anhui, people’s republic of china e-mail: xyzhanghust@gmail.com abstract: tissue p systems is a computing model in the framework of membrane computing inspired from intercellular communication and cooperation between neurons. many different variants of this model have been proposed. one of the most important models is known as tissue p systems with cell separation. this model has the ability of generating an exponential amount of workspace in linear time, thus it allows us to design cellular solutions to np-complete problems in polynomial time. in this paper, we present a solution to the vertex cover problem via a family of such devices. this is the first solution to this problem in the framework of tissue p systems with cell separation. keywords: membrane computing, tissue p system, cell separation, vertex cover 1 introduction membrane computing is an emergent branch of natural computing, which is inspired by the structure and the function of living cells, as well as the organization of cells in tissues, organs and other higher order structures. the devices in membrane computing, called p systems, provide distributed parallel and non-deterministic computing models. since gh. păun introduced the p system in [10], this area has received important attention from the scientific community, such as computer scientists, biologists, formal linguists and complexity theoreticians. in the last years, many different models of p systems have been proposed (a comprehensive bibliography can be found in [14]). the most studied variants are the cell-like models of p systems, where membranes are hierarchically arranged in a tree-like structure. various models of cell-like p systems have been successfully used to design solutions to np-complete problems in polynomial time (see [4]). these solutions are obtained by generating an exponential amount of workspace in polynomial time and using parallelism to check simultaneously all the candidate solutions. in general, cell division, cell creation and cell separation are the three efficient ways to obtain exponential workspace in polynomial time, thus obtaining three corresponding variants of p systems: cell division, where the new workspace is generated by membrane division, cell creation, where the new membranes are created from objects, and cell separation, where the new workspace is generated by membrane separation. it has been proved that all of the three models can efficiently solve np-complete problems, but technically they are pretty different in the way of designing solutions. another interesting class of p systems is known as tissue p systems, where membranes are placed in the nodes of a graph. this variant has two biological inspirations (see [6]): intercellular communication copyright c⃝ 2006-2010 by ccc publications solving vertex cover problem by means of tissue p systems with cell separation 541 and cooperation between neurons. the common mathematical model of these two mechanisms is a net of processors dealing with symbols and communicating these symbols along channels specified in advance, based on symport/antiport rules [9]. tissue p systems can also efficiently solve np-complete problems provided that some ingredients are added into such systems, as in the case of cell-like p systems. the first attempt in this respect is to consider cell division in tissue p systems, yielding tissue p systems with cell division [12]. in this model, the two new cells generated by a division rule have exactly the same objects except for at most a pair of different objects. this model was shown to efficiently solve np-complete: sat [12], 3-coloring [1], subset sum [2], vertex cover [3], etc. recently, another class of tissue p systems is proposed based on cell separation, that is, tissue p systems with cell separation, and a polynomial-time solution to the np-complete problem sat is given in [8]. in this model, the contents of the two new cells evolved from a cell by separation rules can be different, thus leading to a significant difference in specific techniques for designing solutions to concrete np-complete problems. in this paper, we shall explore the possibility of using such a model to solve another np-complete problem–vertex cover. specifically, a family of tissue p systems with cell separation is constructed, in which each system can solve all instances of vertex cover of a fixed size in a polynomial time. although the vertex cover problem has been considered in the framework of other models in membrane computing (for instance, cell-like p systems with active membrane, tissue p systems with cell division, and so on), here the first solution for this problem is presented in the framework of tissue p systems with cell separation. the paper is organized as follows: in sections 2 and 3 preliminaries and the definition of tissue-like p systems with cell separation are recalled, respectively. in section 4, recognizer tissue p systems are briefly described. a polynomial-time solution to vertex cover problem is presented in section 5, including a short overview of the computation and of the necessary resources. finally, some conclusions and new open research lines are presented. 2 preliminaries an alphabet, σ , is a finite and non-empty set of abstract symbols. an ordered sequence of symbols is a string. let σ be a (finite) alphabet; then σ ∗ is the set of all strings over σ . the number of symbols in a string u is the length of the string, and it is denoted by |u|. as usual, empty string (with length 0) is denoted by λ . the set of strings of length n built with symbols from the alphabet σ is denoted by σ n and σ ∗ = ∪n≥0σ n. let a be a (finite) set, a = {a1,··· ,an}. then a finite multiset m over a is a function f : a → in. if m = (a, f ) is a multiset then its support is defined as supp(m) = {x ∈ a | f (x) > 0}. the size of the multiest m is |m| = σx∈a f (x). a multiset is empty (resp. finite) if its support is the empty set (resp. finite). a multiset m over a can also be represented by any string x that contains exactly fm(ai) symbols ai for all 1 ≤ i ≤ n, e.g., by a f (a1) 1 a f (a2) 2 ...a f (ak) k . thus, superscripts indicate the multiplicity of each element, and if f (x) = 0 for any x ∈ a, then this element is omitted. we suppose that the reader is already familiar with the basic notions and the terminology of p systems. for details, see [11]. 3 tissue p systems with cell separation according to the first works on tissue p systems [5, 6] the membrane structure did not change along the computation. a new model based on the cell-like model of tissue p systems with cell separation is presented in [7]. the biological inspiration of them is clear: alive tissues are not static network of cells, since membrane fission generates new cells in a natural way. 542 c. lu, x. zhang formally, a tissue p system with cell separation of initial degree q ≥ 1 is a construct π = (γ ,o1,o2,w1,...,wq,e,r,io), where: 1. γ is the alphabet of objects, γ = o1 ∪ o2, o1,o2 ̸= /0, o1 ∩ o2 = /0; 2. w1,...,wq are strings over γ , describing the multisets of objects placed in the cells of the system at the beginning of the computation; 3. e ⊆ γ is the set of objects present in the environment in arbitrarily copies each; 4. r is a finite set of rules of the following forms: (a) (i,u/v, j), for i, j ∈ {0,1,2,...,q},i ̸= j, u,v ∈ γ ∗; communication rules; 1,2,··· ,q identify the cells of the system, 0 is used as the label of the environment. this rule (i,u/v, j) can be applied over two cells i and j such that u is contained in cell i and v is contained in cell j. the application of this rule means that the objects of the multisets represented by u and v are interchanged between the two cells; (b) [a]i → [o1]i[o2]i, where i ∈ {1,2,...,q} and a ∈ γ ; separation rules; under the influence of object a, the cell with label i is separated into two cells with the same label; at the same time, the object a is consumed; the objects from o1 are placed in the first cell, those from o2 are placed in the second cell; 5. io ∈ {0,1,2,...,q} is the output region. rules are used in the non-deterministic maximally parallel manner as customary in membrane computing. in each step, all cells which can evolve must evolve in a maximally parallel way (in each step a multiset of rules which is maximal is applied, no further rule can be added). this way of applying rules has only one restriction: when a cell is separated, the separation rule is the only one which is applied for that cell in that step; the objects inside that cell do not evolve by means of communication rules. the daughter cells will participate to the interaction with other cells or with the environment by means of communication rules in the next step, if they are not separated once again. their labels precisely identify the rules which can be applied to them. a sequence of transitions which starts from the initial configuration is called a computation with respect π . a computation is completed only if it halts and the computations give a result, and result is the multiset of objects present in region io in the halting configuration. 4 recognizer tissue p systems with cell separation np-completeness has been usually studied in the framework of decision problems. let us recall that a decision problem is a pair (ix ,θx ) where ix is a language over a finite alphabet (whose elements are called instances) and θx is a total boolean function over ix . the notions from classical computational complexity theory are adapted for membrane computing to study the computing efficiency for solving decision problems. recognizer tissue p systems are introduced in [12] for tissue p systems with the same idea of recognizer p systems introduced into cell-like p systems [13]. a recognizer tissue p system with cell separation of degree q ≥ 1 is a construct π = (γ ,o1,o2,σ,w1,...,wq,e,r,iin,io) where: solving vertex cover problem by means of tissue p systems with cell separation 543 • (γ ,o1,o2,w1,...,wq,e,r,io) is a tissue p system with cell separation of degree q ≥ 1 (as defined in the previous section). • the working alphabet γ has two distinguished objects yes and no, at least one copy of them present in some initial multisets w1, . . . , wq, but not present in e. • σ is an (input) alphabet strictly contained in γ . • iin ∈ {1,...,q} is the input cell. • the output region io is the environment. • all computations halt. • if c is a computation of π , in the last step of the computation either the object yes or the object no (but not both) have to be send out to the environment. the computations of the system π with input w ∈ σ ∗ start from a configuration of the form (w1,w2,...,wiin w,...,wq;e), that is, after adding the multiset w to the contents of the input cell iin. we say that the multiset w is recognized by π if and only if the object yes is sent to the environment, in the last step of the corresponding computation. we say that c is an accepting computation (respectively, rejecting computation) if the object yes (respectively, no) appears in the environment associated to the corresponding halting configuration of c. definition 1. a decision problem x = (ix ,θx ) is solvable in polynomial time by a family of recognizer tissue p systems π = {π(n) | n ∈ in} with cell separation, if the following holds: • the family π is polynomially uniform by turing machines, that is, there exists a deterministic turing machine constructing π(n) from n ∈ in in polynomial time. • there exists a polynomial-time coding (cod,s) form ix to π such that: − for each instance u ∈ ix , s(u) is a natural number and cod(u) is an input multiset of the system π(s(u)); − the family π is polynomially bounded with regard to (x,cod,s), that is, there exists a polynomial function p, such that for each u ∈ ix every computation of π(s(u)) with input cod(u) is halting and, moreover, it performs at most p(|u|) steps; − the family π is sound with regard to (x,cod,s), that is, for each u ∈ ix , if there exists an accepting computation of π(s(u)) with input cod(u), then θx (u) = 1; − the family π is complete with regard to (x,cod,s), that is, for each u ∈ ix , if θx (u) = 1, then every computation of π(s(u)) with input cod(u) is an accepting one. we denote by pmct s the set of all decision problems which can be solved by means of recognizer tissue p systems with cell separation in polynomial time. 5 a solution to the vertex cover problem the vertex cover of a non-directed graph is a subset of its vertices such that for each edge of the graph at least one of its endpoints belongs to that subset. the size of the vertex cover is the number of vertices in the subset. the vertex cover problem considered in this paper is formulated as follows: given a non-directed graph, g = (v,e), and a natural number k ≤ |v |, determine whether or not g has a vertex cover of size at most k. 544 c. lu, x. zhang we shall prove that vertex cover can be solved in linear time (in the number of nodes and edges of the graph) by a family of recognizer tissue-like p systems with cell separation. we construct a family π = {π(⟨n,m,k⟩) | n,m,k ∈ in} where each system of the family will process every instance u of the problem given by a graph with n vertices and m edges, and by a size k of the vertex cover (that is, s(u) = ⟨n,m,k⟩, where ⟨a,b⟩ = (a+b)(a+b+1) 2 + a and ⟨a,b,c⟩ = ⟨⟨a,b⟩,c⟩. in order to provide a suitable encoding of these instances, we will use the objects ai j, with 1 ≤ i < j ≤ n, to represent the edges of the graph, and we will provide cod(u) = {ai j | 1 ≤ i < j ≤ n ∧(vi,v j) ∈ e} as the initial multiset for the system. with an instance u of the vc problem, the system π(s(u)) with input cod(u) decides that instance by a brute force algorithm, implemented in the following four stages: • generation stage: the initial cell labeled by 2 is separated into two new cells; the separations are iterated until a cell has been produced for each possible candidate solution. • pre-checking stage: after obtaining all possible subsets of vertices encoded in cells labeled by 2, this stage only select the subsets of size k. • checking stage: for each of these subsets, it is checked if there exists an edge of the graph for which none of its endpoints is in the subset. • output stage: the system sends to the environment the right answer according to the results of the previous stage. π(⟨n,m,k⟩) = (γ (⟨n,m,k⟩),σ(⟨n,m,k⟩),w1,w2,r(⟨n,m,k⟩),e(⟨n,m,k⟩),iin,i0), for each n,m,k ∈ in. the family π contains the following systems: • γ (⟨n,m,k⟩) = o1 ∪ o2, o1 = {ci, j,ai, j,zi, j,pi, j | 1 ≤ i < j ≤ n}∪{ ji | 1 ≤ i ≤ 2n +1} ∪{ai,bi,b′i ,c ′ i ,ti,f ′ i | 1 ≤ i ≤ n}∪{di | 1 ≤ i ≤ n +1} ∪{di, j | 1 ≤ i, j ≤ n}∪{a1,i,b1,i,d1,i,gi,hi,li,ei | 1 ≤ i ≤ n −1} ∪{ai | 1 ≤ i ≤ 5n + m +⌈lg n⌉+9}∪{a2,i,b2,i,d2,i | 2 ≤ i ≤ n −1} ∪{ai, j,k,bi, j,k,di, j,k | 1 ≤ i < j ≤ n,1 ≤ k ≤ n −1} ∪{ci, j,bi, j | 1 ≤ i ≤ n,1 ≤ j ≤ m}∪{li | 1 ≤ i ≤ m +⌈lg n⌉+7} ∪{pi | 1 ≤ i ≤ m +⌈lg n⌉+6}∪{hi | 1 ≤ i ≤ ⌈lg m⌉+1} ∪{gi | 1 ≤ j ≤ ⌈lg n⌉+1}∪{b,z, f1,y,s,e0,e1,e2,t,n,yes,no}, o2 = {c′i, j,a ′ i, j,z ′ i, j} | 1 ≤ i < j ≤ n}∪{t ′ i ,fi | 1 ≤ i ≤ n}∪{y ′,z′, f ′}. • σ(⟨n,m,k⟩) = {ci, j,ai, j,a′i, j | 1 ≤ i < j ≤ n}. • w1 = a1a1,1g1ai, j,1yes no. • w2 = ci, jai, ja1. • r(⟨n,m,k⟩) is the set of rules: 1. separation rule: r1 ≡ [s]2 → [o1]2[o2]2. 2. communication rules: r2,i ≡ (1,ai/ai+1,0) for 1 ≤ i ≤ 5n + m +⌈lg n⌉+8; r3,i, j,k ≡ (1,ai, j,k/bi, j,k,0) for 1 ≤ i < j ≤ n,1 ≤ k ≤ n −1; r4,i, j,k ≡ (1,bi, j,k/c2i, jd 2 i, j,k,0) for 1 ≤ i < j ≤ n,1 ≤ k ≤ n −1; r5,i, j,k ≡ (1,di, j,k/ai, j,k+1,0) for 1 ≤ i < j ≤ n,1 ≤ k ≤ n −2; solving vertex cover problem by means of tissue p systems with cell separation 545 r6,i ≡ (1,gi/hi,0) for 1 ≤ i ≤ n −1; r7,i ≡ (1,hi/l2i a 2 i+1,0) for 1 ≤ i ≤ n −1; r8,i ≡ (1,li/gi+1,0) for 1 ≤ i ≤ n −2; r9,i ≡ (1,a1,i/b1,i,0) for 1 ≤ i ≤ n −1; r10,i ≡ (1,b1,i/c2d21,ie 2 i ,0) for 1 ≤ i ≤ n −1; r11,i ≡ (1,d1,i/a1,i+1,0) for 1 ≤ i ≤ n −2; r12,i ≡ (1,ei/a2,i+1,0) for 1 ≤ i ≤ n −2; r13,i ≡ (1,a2,i/b2,i,0) for 2 ≤ i ≤ n −1; r14,i ≡ (1,b2,i/c2d22,i,0) for 2 ≤ i ≤ n −1; r15,i ≡ (1,d2,i/a2,i+1,0) for 2 ≤ i ≤ n −2; r16,i, j ≡ (2,ci, jai, j/zi, jz′i, jai, ja ′ i, j,0) for 1 ≤ i < j ≤ n; r17,i, j ≡ (2,ci, ja′i, j/zi, jz ′ i, jai, ja ′ i, j,0) for 1 ≤ i < j ≤ n; r18,i ≡ (2,cti/zz′tit ′i ,0) for 1 ≤ i ≤ n −1; r19,i ≡ (2,ct ′i /zz ′tit ′i ,0) for 1 ≤ i ≤ n −1; r20,i ≡ (2,cfi/zz′fif ′i ,0) for 1 ≤ i ≤ n −1; r21,i ≡ (2,cf ′i /zz ′fif ′i ,0) for 1 ≤ i ≤ n −1; r22 ≡ (2,an/tnfn f1 f ′1s,0); r23,i ≡ (2,ai/tifiyy′zz′s,0) for 1 ≤ i ≤ n −1; r24,i ≡ (2,y/ai,1) for 2 ≤ i ≤ n; r25,i ≡ (2,y′/ai,1) for 2 ≤ i ≤ n; r26 ≡ (2,z/c,1); r27 ≡ (2,z′/c,1); r28,i, j ≡ (2,zi, j/ci, j,1) for 1 ≤ i < j ≤ n; r29,i, j ≡ (2,z′i, j/ci, j,1) for 1 ≤ i < j ≤ n; r30 ≡ (1,z/λ ,0); r31 ≡ (1,z′/λ ,0); r32,i, j ≡ (1,zi, j/λ ,0) for 1 ≤ i < j ≤ n; r33,i, j ≡ (1,z′i, j/λ ,0) for 1 ≤ i < j ≤ n; r34 ≡ (2, f / j1d1,0); r35 ≡ (2, f ′/ j1d1,0); r36,i, j ≡ (2,d jti/di, j,0) for 1 ≤ i, j ≤ n; r37,i, j ≡ (2,d jt ′i /di, j,0) for 1 ≤ i, j ≤ n; r38,i, j ≡ (2,di, j/bid j+1,0) for 1 ≤ i, j ≤ n; r39,i ≡ (2, ji/ ji+1,0) for 1 ≤ i ≤ 2n; r40 ≡ (2, j2n+1dk+1/e0,0); r41 ≡ (2,e0/l1e1,0); r42,i ≡ (2,li/li+1,0) for i = 1,...,m +⌈lg n⌉+6; r43 ≡ (2,e1/p1e2,0); r44 ≡ (2,e2/g1h1,0); r45,i ≡ (2,pi/pi+1,0) for i = 1,...,m +⌈lg n⌉+5; r46,i ≡ (2,gi/g2i+1,0) for i = 1,...,⌈lg n⌉; r47,i ≡ (2,hi/h2i+1,0) for i = 1,...,⌈lg m⌉; r48,i, j ≡ (2,ai, jh⌈lg m⌉+1/pi, j,0) for 1 ≤ i < j ≤ n; r49,i, j ≡ (2,a′i, jh⌈lg m⌉+1/pi, j,0) for 1 ≤ i < j ≤ n; r50,i ≡ (2,g⌈lg n⌉+1bi/ci,0) for i = 1,...,n; r51,i ≡ (2,ci/ci,1bi,1,0) for i = 1,...,n; r52,i, j ≡ (2,bi, j/bi, j+1b′i ,0) for i = 1,...,n and j = 1,...,m; r53,i, j ≡ (2,ci, j/ci, j+1c ′i ,0) for i = 1,...,n and j = 1,...,m; r54,i, j ≡ (2,b′i pi, j/λ ,0) for 1 ≤ i < j ≤ n; 546 c. lu, x. zhang r55,i, j ≡ (2,c ′jpi, j/λ ,0) for 1 ≤ i < j ≤ n; r56,i, j ≡ (2,pm+⌈lg n⌉+5pi, j/n,0) for 1 ≤ i < j ≤ n; r57 ≡ (2,lm+⌈lg n⌉+7pm+⌈lg n⌉+6/t,0); r58 ≡ (1,b/t,2); r59 ≡ (1,a5n+m+⌈lg n⌉+9b/n,2); r60 ≡ (1,t yes/λ ,0); r61 ≡ (1,n no/λ ,0); • e(⟨n,m,k⟩) = γ (⟨n,m,k⟩)−{yes,no}. • iin = 2 is the input cell. • io = 0 is the output region. we will show that the family π = {π(⟨n,m,k⟩) | n,m,k ∈ in} defined above is polynomially uniform by turing machines. to this aim it will be proved that π(⟨n,m,k⟩) is built in polynomial time with respect to the size parameter n, m and k of instances of vertex cover problem. it is easy to check that the rules of a system π(⟨n,m,k⟩) of the family are defined recursively from the values n, m and k. the necessary resources to build an element of the family are of a polynomial order, as shown below: • size of the alphabet: n2 +5mn +26n +7m +4⌈lg n⌉+⌈lg m⌉+27 ∈ o(n2 + mn). • initial number of cells: 2 ∈ o(1). • initial number of objects: 3m +6 ∈ o(m). • number of rules: 5mn +3n2 +26n +10m +4⌈lg n⌉+⌈lg m⌉+6 ∈ o(n2 + mn). • maximal length of a rule: 6 ∈ o(1). therefore, a deterministic turing machine can build π(⟨n,m,k⟩) in a polynomial time with respect to n, m and k. 5.1 an overview of the computation a family of recognizer tissue p systems with cell separation is constructed in the previous section. in the following, we informally describe how the recognizer tissue p system with cell separation π(s(γ)) with input cod(γ) works. let us start with the generation stage, where all the possible subsets of the vertices of the graph are generated. this stage has several parallel processes, which we describe in several items. – in the cells with label 2, in the presence of ci, j, by the rules r16,i, j, r17,i, j, the objects ci, jai, j, ci, ja′i, j introduce the objects zi, jz′i, jai, ja ′ i, j, respectively. in the next step, primed objects and non-primed objects are separated into the new daughter cells with label 2. the objects zi, j and z′i, j in cells with label 2 are exchanged with the objects ci, j in the cell with label 1 by the rules r28,i, j and r29,i, j. in this way, the cycle of duplication-separation can be iterated. – in parallel with the above duplication-separation process, the objects c are used to duplicate the objects ti, t ′i , fi and f ′ i by the rules r18,i – r21,i (in general ti(t ′ i ) and fi(f ′ i ) correspond to the values true and f alse of vertex ai); the rules r26 and r27 take care of introducing the object c from the cell with label 1 to cells with label 2. solving vertex cover problem by means of tissue p systems with cell separation 547 – in the initial configuration of the system, the cell with label 2 contains an object a1 (ai encodes the i-th variable in the propositional formula). the objects t1, f ′1 , z, z ′, y, y′ and s are brought in the cell with label 2, in exchange of a1, by the rule r23,i. in the next step they are separated into the new daughter cells with label 2 by separation rule, because (t1,f ′1 ) ∈ o1 and (f1,t ′1 ) ∈ o2. the object s is used to activate the separation rule r1, and is consumed during the application of this rule. the objects y and y′ are used to introduce a2 from the cell with label 1, and the process of truth-assignment for variable v2 can continue. in this way, in 3n −1 steps, we get 2n cells with label 2, and each one contains one of the 2n possible truth-assignments for the n variables. – in parallel with the operations in the cells with label 2, the objects ai, j,k+1 from the cell with label 1 are traded for objects bi, j,k+1 from the environment at the step 3k +1 (0 ≤ k ≤ n −3) by the rule r2,i, j,k. in the next step, each object bi, j,k+1 is traded for two copies of objects ci, j and di, j,k+1 by the rule r3,i, j,k. at step 3k + 3 (0 ≤ k ≤ n − 3), the object di, j,k is traded for object ai, j,k+2 by the rule r4,i, j,k. especially, at step 3n−5, ai, j,n−1 is traded for bi, j,n−1 by the r2,i, j,k, at step 3n−4, each copy of object bi, j,n−1 is traded for two copies of ci, j by the r4,i, j. after step 3n − 4, there is no object ai, j,k appears in the cell with label 1, and the group of rules r3,i, j,k – r5,i, j,k will not be used again. note that the subscript k of the object ai, j,k grows by 1 in every 3 steps until reaching the value n −1, and the number of copies of ai, j,k is doubled in every 3 steps. at step 3k +3 (0 ≤ k ≤ n −2), the cell with label 1 contains 2k+1 copies of object ci, j. at the same time, we have 2k+1 cells with label 2, and each cell with label 2 contains one copy of object zi, j (or z′i, j). due to the maximality of the parallelism of using the rules, each cell with label 2 gets exactly one copy of ci, j from the cell with label 1 by the rules r28,i, j and r29,i, j. the object ci, j in cell with label 2 is used for duplication as described above. – the objects a1,i and a2,i in the cell with label 1 has a similar role as object ai, j,k in cell 1, which introduces appropriate copies of object c for the duplication of objects ti, t ′i , fi and f ′ i by the rules r9,i – r15,i. note that at step 3k + 3 (0 ≤ k ≤ n − 2), there are (k + 1)2k+1 copies of object c which, by the maximality of the parallelism of using the rules, ensures that each cell with label 2 gets k +1 copies of object c . – the object gi+1 in the cell with label 1 is traded for hi+1 from the environment at step 3i + 1 (0 ≤ i ≤ n −3) by the rule r6,i. in the next step, the object hi+1 is traded for two copies of objects li+1 and ai+2 by the rule r13,i. at the step 3i + 3 (0 ≤ i ≤ n − 3), the object li+1 is traded for two copies of gi+2, so that the process can be iterated, until the subscript i of gi reaches n −1. at step 3n −5, object gn−1 is traded for hn−1 by the rule r6,i. at step 3n −4, each object hn−1 is traded for two copies of an. after step 3n − 4, no object gi appears in the cell with label 1, and the group of rules r15,i – r18,i will not be used again. at the step 3i + 3 (0 ≤ i ≤ n − 2), the cell with label 1 contains 2i+1 copies of ai+2, and we have 2i+1 cells with label 2, each of them containing one copy of object y or one copy of object y′. due to the maximality of the parallelism of using the rules, each cell with label 2 gets exactly one copy of ai+2 from cell 1 by the rules r24,i and r25,i. in this way, the truth-assignment for the vertex ai+1 can continue. – the objects zi, j, z′i, j, y, y ′, z and z′ in the cell with label 1 are removed by the rules r28,i, j, r29,i, j, r30, r31. note that this non-deterministic generation stage is performed by the successive application of the separation rules, and at the end of the stage the same configuration is always reached. thus, the system is confluent in this stage and performs 3n +1 steps. now that all the subsets of vertices of the graph are generated, the pre-checking stage selects only those of size k. this stage is activated by rules r34 and r35, which interchange the object f (or f ′) of each 2-cell (recall that there are 2n of them) from the environment, and then each of the latter in each 2-cell 548 c. lu, x. zhang introduces an object d1 and an object j1 from the environment (recall that there are infinitely many of them). the objects d1 and j1 start two processes of counting in each 2-cell. the first process counts the steps of this stage with counter ji using rules r39,i. the second process counts the number of vertices in the subset. it is performed using rules r36,i, j and r37,i, j, which interchange the objects ti in the 2-cells by objects bi (indicating this way that the corresponding vertex has been counted) and increase the counter d j (the only purpose of the objects di j is to reduce the length of the rules). note that this is a non-deterministic process, since the vertex "counted" in each step is chosen in a non-deterministic way. however, as the size of the subsets of vertices is bounded by n, after 2n steps of this process, the same configuration is always reached, so the system is also confluent in this stage. for the counter d j of a 2-cell to increase, it is necessary and sufficient that in that cell there exist objects bi left. this means that at the end of the process explained in the previous paragraph, the only 2-cells that contain objects encoding subsets of vertices of size k are those containing the object dk+1. at this moment, those cells also contain the counter j2n+1, which then in two steps cause (using rules r40 and r41, and the intermediate object e0 for rules size reduction) the object dk+1 to be interchanged by objects l1 and e1 from the environment. the total number of steps of the pre-checking stage is 2n +2. the checking stage starts now, but before checking if any of the subsets of vertices of size k selected in the previous stage is a vertex cover of the graph, we need some preparation steps. first of all, the objects li will be used as a counter, controlled by rules r42,i, of the number of steps performed. on the other hand, rule r43 introduces another counter pi, controlled by rules r45,i, which runs in parallel, but with a delay of one step. also, in each 2-cell encoding a subset of vertices of size k objects g1 and h1 are introduced by rules r43 and r44, and are then multiplied by rules r45,i and r46,i until obtaining n copies of the former and m copies of the latter. the objects h⌈lg m⌉+1 are used by rules r48,i, j and r49,i, j to change into objects pi j encoding the edges of the graph. on the other hand, rules r50,i, r51,i, r52,i, j and r53,i, j produce, from objects g⌈lg n⌉+1 and bi and by successive interchanges of objects between the 2-cells and the environment, m copies of objects b′i and c ′ i for each and all of the vertices in the subset encoded into the 2-cell. as the copies of objects b′i and c ′ i are being produced, rules r54,i, j and r55,i, j eliminate from the 2cell, in a non-deterministic way, edges of the graph (encoded by objects pi j) such that at least one of its endpoints is contained in the subset encoded in the corresponding 2-cell. once this stage has performed m +⌈lg n⌉+6 steps, we are sure that if there is any object pi j left in the 2-cell, then the subset of vertices encoded in that cell is not a vertex cover of the graph, and rule r56,i, j eliminates the counter pi in an additional step. the answer stage starts at step 5n + m +⌈lg n⌉+9, when the object lm+⌈lg n⌉+7 appears in every 2-cell encoding a subset of vertices of size k. if the counter p has survived in any of these 2-cells, it means that it encoded a vertex cover of the graph, and rule r57 interchanges the two counters with an object t from the environment, which is then sent to the 1-cell of the system by rule r58. then, rules r59, r60 and r61 control if this cell has received at least one object t from any of the 2-cells of the system. if this is the case, it is detected at step 5n + m + ⌈lg n⌉ + 9, when an object yes is sent to the environment and the system halts. otherwise, it is detected at step 5n + m +⌈lg n⌉+10, when an object no is sent to the environment and the system halts. 5.2 main results from the discussion in the previous section, the family π is polynomially bounded, sound and complete with regard to (vc,cod,s). we have the following result: theorem 5.1. vertex cover ∈ pmct s. solving vertex cover problem by means of tissue p systems with cell separation 549 corollary 2. np ∪ co − np ⊆ pmct s. proof: it suffices to make the following observations: the vertex cover problem is np-complete, vertex cover∈ pmct s and this complexity class is closed under polynomial-time reduction and under complement. 6 discussion the main purpose of this paper is to provide a polynomial time solution for vertex cover problem based on tissue p systems with cell separation. we showed that the membrane separation is an important feature that could hold the power to solving computationally hard problems in polynomial time. following this direction, it remains as further work to describe classical complexity classes below pspace with this framework. 7 acknowledgements the authors acknowledge the support of national natural science foundation of china (60674106, 30870826, 60703047, and 60533010), program for new century excellent talents in university (ncet05-0612), ph.d. programs foundation of ministry of education of china (20060487014), chenguang program of wuhan (200750731262), hust-srf (2007z015a), and natural science foundation of hubei province (2008cdb113 and 2008cdb180). bibliography [1] d. díaz-pernil, m. a. gutiérrez-naranjo, m. j. pérez-jiménez, a. riscos-núñez. a linear–time tissue p system based solution for the 3–coloring problem. electronic notes in theoretical computer science, vol. 171, pp. 81–93, 2007. [2] d. díaz-pernil, m. a. gutiérrez-naranjo, m. j. pérez-jiménez, a. riscos-núñez. solving subset sum in linear time by using tissue p systems with cell division. in: j. mira, j. r. alvarez, j. r. ivarez (eds.) 2nd international work-conference, iwinac 2007, interplay between natural and artificial computation lecture notes in computer science, vol. 4527, pp. 170–179, 2007. [3] d. díaz-pernil, m. j. pérez-jiménez, a. riscos-núñez, a. romero. computational efficiency of cellular division in tissue-like membrane systems. romanian journal of information science and technology, vol. 11(3), pp. 229–241, 2008. [4] m. a. gutiérrez-naranjo, m. j. pérez-jiménez, f. j. romero-campero. a linear solution for qsat with membrane creation. lecture notes in computer science, vol. 3850, pp. 241–252, 2006. [5] c. martín vide, j. pazos, gh. păun, a. rodríguez-patón. a new class of symbolic abstract neural nets: tissue p systems. lecture notes in computer science, vol. 2387, pp. 290–299, 2002. [6] c. martín vide, j. pazos, gh. păun, a. rodríguez-patón. tissue p systems. theoretical computer science, vol. 296, pp. 295–326, 2003. [7] l. pan, t.-o. ishdorj. p systems with active membranes and separation rules. journal of universal computer science, vol. 10(5), pp. 630–649, 2004. 550 c. lu, x. zhang [8] l. pan, m. j. pérez-jiménez. efficiency of tissue p systems with cell separation. in m. a. martínez-del-amor, e. f. orejuela-pinedo, gh. păun, i. pérez-hurtado, a. riscos-núñez, seventh brainstorming week on membrane computing, sevilla, report rgnc 02/2009, 169–196, 2009. [9] a. păun, gh. păun. the power of communication: p systems with symport/antiport. new generation computing, vol. 20(3), pp. 295–395, 2002. [10] gh. păun. computing with membranes. journal of computer and system sciences, vol. 61(1), 108–143, 2000. [11] gh. păun. membrane computing, an introduction, springer–verlag, berlin, 2002. [12] gh. păun, m. j. pérez-jiménez, a. riscos-núñez. tissue p system with cell division. in gh. păun, a. riscos-núñez, a. romero-jiménez, f. sancho-caparrini (eds.), second brainstorming week on membrane computing, sevilla, report rgnc 01/2004, 380–386, 2004. [13] m. j. pérez-jiménez, a. romero-jiménez and f. sancho-caparrini, a polynomial complexity class in p systems using membrane division, in e. csuhaj-varjú, c. kintala, d. wotschke and gy. vaszyl (eds.), proceedings of the 5th workshop on descriptional complexity of formal systems, dcfs 2003, pp. 284–294, 2003. [14] the p system web page: http://ppage.psystems.eu chun lu is a ph.d candidate in huazhong university of science and technology, wuhan, china. he received his master degree in systems engineering from huazhong university of science and technology in 2008. currently, his main research interests cover membrane computing, neural computing, automata theory and its application. xingyi zhang was born in china on june 6, 1982. he received his doctor degree at huazhong university of science and technology in 2009. currently, he works in school of computer science and technology, anhui university. his main research fields are formal language theory and its applications, unconventional models of computation, especially, membrane computing. he has published several scientific papers in international journals. int j comput commun, issn 1841-9836 7(5):957-967, december, 2012. function approximation with artmap architectures l.m. sasu, r. andonie lucian m. sasu 1. transilvania university of braşov mathematics and computers department romania, 500091 braşov, iuliu maniu, 50 lmsasu@unitbv.ro 2. siemens corporate technology romania, 500096 braşov, 15 noiembrie, 46 e-mail: lucian.sasu@siemens.com răzvan andonie 1. computer science department usa, central washington university, ellensburg 400 east university way ellensburg, wa 98926, usa 2. transilvania university of braşov electronics and computers department romania, 500024 braşov, politehnicii, 1 e-mail: andonie@cwu.edu abstract: we analyze function approximation (regression) capability of fuzzy artmap (fam) architectures well-known incremental learning neural networks. we focus especially on the universal approximation property. in our experiments, we compare the regression performance of fam networks with other standard neural models. it is the first time that artmap regression is overviewed, both from theoretical and practical points of view. keywords: fuzzy artmap, universal approximation, regression. 1 introduction the approximation of functions that are known only at a certain number of discrete points is a classical application of neural networks. almost all approximation schemes can be mapped into some kind of network that can be dubbed as a “neural network” [1]. a neural network has the universal approximation property if it can approximate with arbitrary accuracy an arbitrary function of a certain set of functions (usually the set of continuous function) on a compact domain. the drawback is that such an approximation may need an unbounded number of “building blocks” (i.e., fuzzy sets or hidden neurons) to achieve the prescribed accuracy. therefore it is reasonable to make a trade-off between accuracy and the number of the building blocks, by determining the functional relationship between them. historically, of fundamental importance was the discovery [2] that a classical mathematical result of kolmogorov (1957) was actually a statement that for any continuous mapping f : [0, 1]n ⊂ ℜn −→ ℜm there must exist a three layered feedforward neural network of continuous type neurons that implements f exactly. this existence result was the first step. cybenko [3] showed that any continuous function defined on a compact subset of ℜn can be approximated to any desired degree of accuracy by a feedforward neural network with one hidden layer using sigmoidal nonlinearities. many other papers have investigated the approximation capability of three layered networks in various ways. in addition to sigmoid functions, more general functions can be used as activation functions of universal approximator feedforward networks [4]. copyright c⃝ 2006-2012 by ccc publications 958 l.m. sasu, r. andonie girosi and poggio proved that radial basis function (rbf) networks also have universal approximation property [1]. hartman and kowalski [5] proved that a one hidden layer neural network with gaussian hidden nodes is a universal approximator for real-valued maps defined on convex, compact sets of ℜn. additional related papers are [6] and [7]. the fuzzy artmap (fam) family of neural networks is one of the best known incremental learning systems. there are many variations of carpenter’s et al. [8] initial fam model, including gaussian artmap (gam) [9], probart [10], famr [11], gart [12], [13], and appart [14]. compared to fam classification, the function approximation (regression) capability of fam was less frequently addressed. it is our goal here to discuss fam regression capability for different fam architectures. the fam maps subsets of ℜn to ℜm, accepting both binary and analogue inputs in the form of pattern pairs. the initial fam, probart, and the famr architectures have been used for incremental regression estimation. since the initial fam was proved to be universal approximator [15], it is reasonable to believe that members of the fam family may also have the universal approximation capability. however, since some of the fam variations are quite different than the initial fam, each model should be considered individually. the bayesian theory allows for elaboration of general neural network training methods [16].recently, vigdor and lerner have combined the bayesian theory and the fam introducing the bayesian artmap (ba) [17]. like the gam and the gart networks, during training, the ba uses gaussian categories and fam competitive learning. however, the ba prediction phase is very different than the fam competitive algorithm, being a bayesian approach. vigdor and lerner have compared the ba performance with respect to classification accuracy, learning curves, number of categories, sensitivity to class overlapping and risk with those of the fam. generally, the ba outperformed the fam in classification tasks. up to our contribution, the ba regression capability was not discussed or tested. our paper is the first overview of both theoretical and practical aspects of fam regression, considering several major fam architectures: the initial fam of carpenter et al., probart, famr, ba, gam, and appart. we discuss universal approximation capabilities of these fam models. in our experiments, we compare the regression performance of fam networks with standard neural networks: multi layer perceptron (mlp), rbf, general regression neural network (grnn), and fasback. section 2 reviews the main notations and paradigms of fam. in section 3, we discuss the universal approximation capability of the following fam architectures: the original fam, probart, famr, ba, and appart. we synthesize our comparative experiments in section 4. section 5 contains the final remarks. 2 fuzzy artmap a fam consists of a pair of fuzzy art modules, arta and artb, connected by an inter-art module called mapfield, f ab. arta contains a preprocessing layer f a0 , an input (or short-term memory) layer f b1 and a competitive layer f b 2 . the following notations apply: ma is the number of nodes in f a1 , na is the number of nodes in f a 2 , and w a is the weight vector between f a1 and f a 2 . we say that a node – also called a category – from f a2 is uncommitted if it has not learned yet an input pattern, and committed otherwise. analogous layers and notations are used in artb. each node j from f a2 is linked to each node from f b 2 via a weight vector w ab j from f ab, the jth row of the matrix wab, 1 ≤ j ≤ na. all weights are initialized to 1. all input vectors are complement-coded by the f a0 layer in order to avoid category proliferation [8], [18], [19]: the input vector a = (a1, . . . , an) ∈ [0, 1]n produces the normalized vector a = (a1, . . . , an, 1 − a1, . . . , 1 − an). during pattern processing, the operator ∧ used is the fuzzy function approximation with artmap architectures 959 and operator defined as (p ∧ q)i = min(pi, qi), where p = (p1, . . . , pn) and q = (q1, . . . , qn). | · | denotes the l1 norm. before learning a normalized input vector a, the vigilance parameter factor ρa is reset to its baseline value ρa and each input category is considered as not inhibited, competing for the current input pattern. a fuzzy choice function is computed for every arta category: tj(a) = |a∧waj | αa+|waj | , for 1 ≤ j ≤ na. the non-inhibited node of index j having the maximum fuzzy choice function value is further checked whether it passes the resonance condition, i.e. if the input is similar enough to the winner’s prototype: |a ∧ waj|/|a| ≥ ρa. if this condition is not fulfilled, then the node having index j is inhibited and another non-inhibited node maximizing the fuzzy choice function is considered as above. if no such node exists, a new node with index j is created to represent the input vector. in parallel, a similar step is performed in the artb module; we obtain output vector yb = (δik)1≤i≤nb , where k is the index of the output winner node (1 ≤ k ≤ nb) and δij is kronecker’s delta. if input node j is newly added, then we associate it with the current output: wabjk = δkk and this association becomes permanent. each time input node j is activated, it predicts as output value the only index k for which wabjk = 1. if node j is not new, then we check whether its predicted value is k. if the prediction is incorrect, a new activity (called match tracking) is triggered in arta solely. otherwise, learning occurs in both arta and artb: w a(new) j = βa ( a ∧ wa(old)j ) + (1 − βa)w a(old) j (1) where βa ∈ (0, 1] is the learning rate parameter. a similar learning step takes place in artb. the match tracking raises the ρa threshold for the current input pattern: ρa = δ + |a ∧ waj|/|a|. if ρa > 1 then the current input pattern is rejected; otherwise, the search for an appropriate input category is continued, as described above. for each f a2 category we have the following geometrical interpretation. node w a j is a hyperrectangle rj inside the n-dimensional hypercube, having size n − |wj| [8]. learning, as in equation (1), is equivalent to expanding the hyperrectangle towards the current input pattern, unless this pattern is not already in rj. if βa = 1, then rj expands to rj ⊕ a, the minimal hyperrectangle containing both rj and input pattern a. a similar geometrical interpretation applies to artb. 3 fam architectures used in regression 3.1 the initial fam for regression the fam regression capability was first tested by carpenter et al. for univariate real functions [8]. input categories were considered to predict not real values, but real intervals. the experiments targeted the study of predicted output intervals’ geometry and the number of resulted categories for various values of ρb. for the test set, the authors counted the matchings between predicted output categories and actual output values. a matching between f(a) and the predicted output category (a rectangle) rbk was established if the size of r b k ⊕ f(a) did not exceed (1 − ρb). as expected, the number of matchings increased with ρb. verzi et al. [15] proved that a slightly modified fam version can be used to universally approximate any measurable function in lp ([0, 1]). more specifically, given 1 ≤ p < ∞, for every f ∈ lp ([0, 1]), f ≥ 0, a series of fam computable functions sn with the following property were determined: functions sn approximate f in the limit and sn are dense in lp ([0, 1]). one can extend this result to the initial fam. 960 l.m. sasu, r. andonie 3.2 probart for function approximation probart is a modification of fam motivated by empirical findings on the operational characteristics of fam under certain conditions [10]. the authors replaced the mapfield update rule fam by wabj = { yb + wabj if the j-th f a 2 node is active and f b 2 is active wabj if the j-th f a 2 node is active and f b 2 is inactive (2) thus, wabjk indicates the number of associations between the j-th arta node and k-th artb node. initially, wabjk = 0, i.e. no association has been made yet. there is no match tracking phase. the predicted value for an input pattern activating the jth arta category is µjl = 1 |wabj | nb∑ k=1 ϵklw ab jk, 1 ≤ l ≤ mb (3) where µjl is the expected value of the l-th component of the predicted output pattern associated with the current input pattern, |wabj | is the total number of associations of the j-th arta category and each category from artb, and ϵkl represents the kth artb category. specifically, for probart the authors considered ϵkl as the lth component of the kth artb category exemplar. only the first m components of each output category wbk are meaningful for computing the prediction corresponding to the current input pattern. equation (3) can be written as µjl = ∑nb k=1 ϵklpjk, where pjk is the empirically estimated association probability between the jth arta category and the kth artb category: pjk = wabjk/|w ab j |. 3.3 the famr model for function approximation the famr (fuzzy artmap with relevance factor), a version of the fam, has a novel learning mechanism. we will review here the famr basic notations (details in [11]) and discuss its function approximation capabilities. the main difference between the famr and the initial fam is the update method of the wabjk weights. the famr uses the following updating formula [11]: w ab(new) jk =   w ab(old) jk if j ̸= j w ab(old) jk + qt qnew j ( 1 − wab(old)jk ) w ab(old) jk ( 1 − qt qnew j ) if k ̸= k (4) where qt is the relevance assigned to the t-th input pattern (t = 1, 2, . . . ) and qnewj = q old j + qt. the relevance qt is a real positive finite number directly proportional to the importance of the experiment considered at step t. initially, each qj (1 ≤ j ≤ na) has the same initial value q0. to maintain the stochastic nature of each wabj row in mapfield, we modified the mapfield dynamics: when a new input category is created, a new row filled with 1/nb is added to wab; when a new artb category indexed by k is added, each existing input category is linked to it by wabjk = q0 nbqj , and the rest of elements wabjk are decreased by wab jk nb−1 , for 1 ≤ j ≤ na, 1 ≤ k ≤ nb, k ̸= k. the update in eq. (4) preserves the stochastic property of each row. finally, the vigilance test is changed to: nb wabjk ≥ ρab. according to [11], this wabjk approximation is a correct biased estimator of posterior probability p(k|j), the probability of selecting the k-th artb category after having selected the j-th arta. function approximation with artmap architectures 961 to estimate the corresponding output value for a given input pattern, famr uses the same formula as in eq. (3), but in this case ϵk contains the coordinates of the kth artb category centroid. during the famr training process, the l-th component of the centroid can be updated by kohonen’s learning rule: ϵb(new)kl = ϵ b(old) kl + (bl − ϵ b(old) kl )/size b j . this rule incorporates an idea from [20]. the value sizebj is the number of output vectors of the k-th artb category and bl is the l-th component of b, the output vector of the current training pair (a, b). 3.4 the bayesian artmap function approximation algorithm in ba, in contrast to fam, waj is not a weight vector (a prototype), but simply a category label. also, the art categories are gaussians, similar to the gam. each ba category j is characterized by the n-dimensional vector µ̂aj (mean), the n × n covariance matrix σ̂ a j , and the count number of training patterns clustered to category j, naj . analogous notations appear in artb, where one provides m-dimensional vectors. the associations between input and output categories are stored inside the mapfield module, as probart does, and one can approximate the conditional probability p(wbk|w a j ) as p̂(wbk|w a j ) = w ab jk/ ∑nb l=1 w ab jl . the following description uses arta notations; analogous notations are used for artb. all existent arta categories compete to represent the current input pattern. the posterior probability of category j given input a is estimated according to bayes’ theorem: p̂(waj |a) = p̂(a|waj )p̂(w a j ) na∑ i=1 p̂(a|wai )p̂(w a i ) (5) where p̂(waj ) is the estimated prior probability of the j-th arta category, p̂(w a j ) = n a j / ∑na i=1 n a i . the conditional probability p(a|waj ) is estimated using all patterns already associated with gaussian category waj : p̂(a|waj ) = 1 (2π)n/2 ∣∣∣σ̂aj∣∣∣1/2 · exp { − 1 2 (a − µ̂aj ) t(σ̂aj ) −1(a − µ̂aj ) } (6) during the category choice step in arta, the winning category j is the one maximizing the posterior probability p̂(waj |a). the following vigilance test is performed: saj ≤ s a max, where s a j = ∣∣∣σ̂aj∣∣∣ is the hyper-volume of the winning category, and samax is an upper bound threshold. during processing a training pattern, samax may decrease from its initial value s a max. in contrast, s b max remains unchanged. every newly recruited category inside an arta (artb) module is centered in the current pattern and has the initial covariance matrix set to λ(sbmax) 1/m ·im (and λ(sbmax) 1/n ·in, respectively), where λ is a small positive constant. this is done when none of the categories fulfills the vigilance test. adding a new input (output) category triggers the addition of a new zero-filled line (column) to the association matrix wab. if the connection strength p̂(wbk|w a j) between winning categories w a j and w b k is below a fixed threshold pmin, then samax is slightly decreased under the current winner input category’s sj , and the quest for another input category is continued. otherwise, if the current winner input category was not newly added during processing the current pattern, arta learns the current pattern: 962 l.m. sasu, r. andonie µ̂aj(new) = naj naj + 1 µ̂aj(old) + 1 naj + 1 a , (7) σ̂aj(new) = naj naj + 1 σ̂aj(old) + 1 naj + 1 (a − µ̂aj(new))(a − µ̂ a j(new)) t ∗ in (8) naj = n a j + 1 (9) unless wbk is a newly added category for the current training pattern, an analogous learning process in artb takes place. finally, the mapfield association counter wabjk is updated. after learning, the ba can be used for prediction. we estimate the probabilistic association of an output category wbk with input test pattern a: p̂(wbk|a) = na∑ j=1 p̂(wbk|w a j )p̂(a|w a j )p̂(w a j ) nb∑ l=1 na∑ j=1 p̂(wbl |w a j )p̂(a|w a j )p̂(w a j ) (10) as in [21], we assume the conditional independence of activating categories wbk and w a j , given input pattern a. for function approximation the following average formula is used: f̂(a) = nb∑ k=1 p̂(wbk|a) · µ̂ b k (11) since, under certain mild conditions on the kernel function, rbf networks are universal approximators [1], [5], [6], [7], and the fam also has universal approximation capability [15], it looks natural for the ba, which is essentially a fam architecture with gaussian categories, to be universal approximator. however, this statement can not be directly deducted from the rbf and fam results. this is was a good reason for us to proof the following theoretical result [22]: theorem 1. ba is a universal approximator on a compact set x ⊂ ℜn. 3.5 appart: hybrid stable learning for universal function approximation appart [14] is an art-based neural network model that incrementally approximates continuousvalued multidimensional functions through a higher-order nadaraya–watson regression. an input pattern x is feedforwarded from input layer f1 to the f2 layer. the f2 layer consists of n categories, modeling a local density of the input space using gaussian receptive fields with mean µj and standard deviation σj. a match criterion is used to detect whether the current leaning pattern activates an existing f2 category or a new one should be added. the match function is: gj = exp ( − 1 2 n∑ i=1 ( xi − µji σji )2) , 1 ≤ j ≤ n (12) if all gj values are below threshold ρf2, a new node is recruited to represent the current input pattern. otherwise, the input strength of each f2 node is computed as gj = i(gj > ρf2) · (ηjgj/ ∏n i=1 σji), where ηj is a measure of the prior activation probability of the jth category, and i is the binary indicator function: i(p) = 1 iff p is true. the activation values vj of the f2 nodes are obtained by normalizing gj. one can use vj as an approximation of the posterior probability p(j|x) of category j given input pattern x. function approximation with artmap architectures 963 the p and o layers together compute the prediction of the network. in the p layer, there are m + 1 nodes whose corresponding values are computed as: ak = ∑n j=1 αkjvj (1 ≤ k ≤ m), b =∑n j=1 βjvj where αkj and βj are weights connecting each f2 category to the each node in the p layer. each αkj is the sum of values of output feature k, learned when the jth f2 node was active. βj counts how many patterns the jth f2 category has learned. output layer o has m output nodes, whose predictions are ok = i(b > 0) · ak/b. incorrect predictions are detected by comparing a threshold ρo with the degree of closeness between the prediction of the network and the desired output. if an incorrect prediction is produced, a match tracking mechanism (similar to the one in fam) is triggered. this might produce a new f2 node or find a more suitable node for the current input pattern. the learning process takes place for µj, σj, ηj, αj and βj: ηj(t + 1) = ηj(t) + vj, µji(t + 1) = (1 − η−1j vj)µji(t) + η −1 j vjxi λji(t + 1) = (1 − η−1j vj)λji(t) + η −1 j vjx 2 i , σji(t + 1) = √ λji(t + 1) − µji(t + 1)2 αkj(t + 1) = αkj(t) + ϵ −1vjyk, βj(t + 1) = βj(t) + ϵ −1vj a common value γi = γcommon may be used for the standard deviation in case of all input features. an important theoretical result of appart is [14]: theorem 2. appart with ρf2 = 0, ρo = 0 and γi = γcommon, 1 ≤ i ≤ n behaves as grnn. since the grnn can be viewed as a normalized rbf expansion, one can transitively apply to appart two important properties of rbf networks: the universal approximation and the best approximation properties [1]. 4 experimental results for the first test, we consider function [10] f : [0, 1] → [0, 1] defined by f(x) = (10 +∑7 t=1 sin(10tx))/20. we use independent, randomly generated datasets for training, validation and testing, consisting of 800, 200, and 1000 patterns, respectively. each training pattern is a (x, f(x)) inputoutput pair. the testing set was not used in the training phase, but only to assess generalization performance. the ba parameters samax, s b max, and pmin are optimized on the validation set by trial and error, for samax, s b max ∈ {10 −3, 5·10−4, 10−4, 5·10−5, 10−5, 5·10−6} and pmin ∈ {0, 0.1, . . . , 0.9}. the ba with optimized parameters (i.e., generating the lowest rmse on the validation set) was trained on the training+validation dataset. the generalization performance of the trained ba was assessed on the testing set in two ways (see table 4): 1. the “ba(1)”, corresponds to a ba network with unbounded number of categories. 2. for “ba(2)”, we considered only ba models with similar number of input categories as for probart. 964 l.m. sasu, r. andonie arta categories no. artb categories no. rmse fam 312 53 0.0074 probart 110 53 0.0169 ba(1) 185.6 57.8 0.0076 ba(2) 111.0 35.8 0.0106 table 1: fam, probart, and ba performance for regression on data generated by function f. the rmse for ba(1) and ba(2) were each averaged for five different runs, using each time randomly generated training, validation, and test sets. the results for probart and fam are from [10]. fam in our experiments is carpenter’s initial fam version. the ba(1) results are very similar to the fam results, but for a considerably smaller number of input categories. on average, ba(2) produced one more input category than probart, while improving the rmse by 40.23%. it is quite difficult to directly compare the resulted ba(2) and fam, since ba(2) has 64.42% less input categories than the fam. considering both the rmse score and the number of input categories, we may conclude that, for this experiment, the ba performs better than the fam and probart. in the second test, we use the fifth-order chirp function [14]: g(x) = 0.5+0.5 sin(40πx5). marti et al. have experimentally compared the function approximation performance of the following neural models [14]: appart, multi layer perceptron (mlp), rbf, general regression neural network (grnn), fam, gam, probart, and fasback [23]. the reported score was the mean squared error (mse). the authors run the training algorithms for several epochs. the data set consisted of 10000 points x ∈ [0, 1], of which 70% were used for training and the rest for testing. the cited paper does not fully describe the parameter values used for each of the networks. in our experiment, we partition a dataset of 10000 patterns into a 4000 patterns training set, a 3000 validation set, and a 3000 patterns testing set. we perform a trial and error search for samax, s b max ∈ {10 −4, 10−3, 10−2, 10−1}, pmin ∈ {0, 0.1, . . . , 0.9}. the values producing the best mse on the validation set are used to train the ba on the train+validation dataset, and the testing set mse was reported. the above procedure are repeated five times, for randomly generated datasets. we only use single epoch training. table 2 contains the results for mlp, rbf, grnn, fam, gam, probart, fasback, appart and ba. for the first eight neural networks the results are from [14]. ba produces a very good mse score for this regression task, most likely due to the optimized parameter values obtained by trial and error. comparing the mse ba score, obtained by single epoch training, and those reported in [14], where multi-epoch training was used, we can state that the ba clearly performs better. 5 conclusions theoretical universal approximation results were obtained for several fam architectures: • explicit results were obtained for a slight variation of the initial fam and for the ba. • implicit results, derived by association with other networks: famr, probart, and appart. function approximation with artmap architectures 965 model mse training epochs mlp 0.4362 30000+ rbf 0.2701 10000 grnn 0.1540 150 fam 0.1802 140 gam 0.1521 45 probart 0.1435 50 fasback 0.0915 10000 appart 0.0803 30 ba 0.0086 1 table 2: ba vs. other neural networks generalization performance for data generated by function g. the result showing fam networks to be universal approximators is an important fact in establishing the utility of fam architectures. a learning algorithm which is known to be a universal approximator can he applied to a large class of interesting problems with the confidence that a solution is at least theoretically available. experimentally, fam architectures performed well compared to other neural function approximators. the fam model, as well as other universal approximators, suffer from the curse of dimensionality, as defined by bellman [24]: an exponentially large number of art categories may be required to reach a final solution. therefore, the universal approximation capability of a network is an generally an existential result, not a constructive procedure to obtain a guaranteed compact network approximation of an arbitrary function. an important problem we have not addressed here is that of determining the network parameters so that a prescribed degree of approximation is achieved (see [25]). the fam and its offsprings are incremental learning models. therefore, they may be used for fast approximation of massive streaming input data. this may be a serious plus when compared to other neural predictors. how could a neural posterior probability estimator, like the ba, be used in risk assessment and decision theory? one possibility would be to combine the inferred posterior probabilities with a loss function, as suggested for a more general framework in [26]. this way, we could obtain an incremental learning risk assessment tool capable of processing fast large amounts of data. bibliography [1] girosi, f.; poggio, t. (1989); networks and the best approximation property, biological cybernetics, 63: 169-176. [2] hecht-nielsen, r. (1987); kolmogorov’s mapping neural network existence theorem, proceedings of ieee first annual international conference on neural networks, 3: iii-11–iii-14. [3] cybenko, g. (1992); approximation by superpositions of a sigmoidal function, mathematics of control, signals, and systems, 5(4): 455-455. 966 l.m. sasu, r. andonie [4] chen, t.; chen, h. (1995); universal approximation to nonlinear operators by neural networks with arbitrary activation functions and its application to dynamical systems, ieee transactions on neural networks, 6(4): 911-917. [5] hartman, e; keeler, j.d.; kowalski, j.m. (1990); layered neural networks with gaussian hidden units as universal approximations, neural computations, 2(2): 210-215. [6] park, j.; sandberg, i.w. (1991); neural computations, 3(2): 246-257. [7] park, j.; sandberg, i.w. (1993); neural computations, approximation and radial-basisfunction networks, 5(2): 305-316. [8] carpenter, g.a.; grossberg, s.; markuzon, n.; reynolds, j.h.; rosen, d.b. (1992); ieee transactions on neural networks, fuzzy artmap: a neural network architecture for incremental supervised learning of analog multidimensional maps, 3(5): 698-713. [9] williamson, j. (1996); neural networks, gaussian artmap: a neural network for fast incremental learning of noisy multidimensional maps, 9:881–897. [10] marriott, s.; harrison, r.f. (1995); neural networks, a modified fuzzy artmap architecture for the approximation of noisy mappings, 8(4): 619-641. [11] andonie, r.; sasu, l. (2006); ieee transactions on neural networks, fuzzy artmap with input relevances, 17: 929-941. [12] yap, k.s.; lim, c.p. abidi, i.z. (2008); ieee transactions on neural networks, a hybrid art-grnn online learning neural network with a ε-insensitive loss function, 19: 1641– 1646. [13] yap, k.s.; lim, c.p. junita, m.s. (2010); journal of intelligen & fuzzy systems, an enhanced generalized adaptive resonance theory neural network and its application to medical pattern classification, 21: 65-78. [14] marti, l.; policriti, a.; garcia, l. (2002); hybrid information systems, first international workshop on hybrid intelligent systems, adelaide, australia, december 11-12, 2001, proceedings, appart: an art hybrid stable learning neural network for universal function approximation, 93-119. [15] verzi, s.j.; heileman, g.l.; georgiopoulos, m.; anagnostopoulos, g.c. (2003); proceedings of the ieee international joint conference on neural networks (ijcnn 2003), universal approximation with fuzzy art and fuzzy artmap, (3): 1987-1992. [16] mackay, d.j.c. (1996); computation in neural systems, probable networks and plausible predictions a review of practical bayesian methods for supervised neural networks, 6: 469 505. [17] vigdor, b.; lerner, b. (2007); ieee transactions on neural networks, the bayesian artmap, 18: 1628-1644. [18] moore, b. (1988); proceedings of the 1988 connectionist model summer school, art1 and pattern clustering, 174-185. [19] carpenter, g.a.; grossberg, s.; reynolds, j.h. (1991); neural networks, fuzzy art: fast stable learning and categorization of analog patterns by an adaptive resonance system, (4): 759-771. function approximation with artmap architectures 967 [20] lim, c.p.; harrison, r.f. (1997); neural networks, 10(5), an incremental adaptive network for on-line supervised learning and probability estimation, 925-939. [21] lerner, b.; guterman, h. (2008); computational intelligence paradigms studies in computational intelligence, springer, advanced developments and applications of the fuzzy artmap neural network in pattern classification, 137: 77-107. [22] sasu, l; andonie, r. (2012); the bayesian artmap for regression, under review. [23] izquierdo, j.m.c.; dimitriadis, y.a.; coronado, j.l. (1997); proceedings of the sixth ieee international conference on fuzzy systems, fasback: matching-error based learning for automatic generation of fuzzy logic systems, 3: 1561 -1566. [24] bellman, r.e. (1961), rand corporation research studies, adaptive control processes: a guided tour. [25] andonie, r. (1997); dealing with complexity: a neural network approach, the psychological limits of neural computation, 252-263. [26] duda, r.o.; hart, p.e.; david g.s (2000); pattern classification, 2nd edition. international journal of computers communications & control issn 1841-9836, 12(3), 415-428, june 2017. compensation of time-varying delay in networked control system over wi-fi network h.-c. yi, c.-j. an, j.-y. choi hyun-chul yi, cheol-jin an, joon-young choi* department of electronics engineering pusan national university, pusan, korea 2, busandaehak-ro 63beon-gil, geumjeong-gu, pusan, 46241, korea hcy@pusan.ac.kr, cja@pusan.ac.kr *corresponding author: jyc@pusan.ac.kr abstract: in this study, we design a state predictor-based output feedback controller that compensates for unavoidable time-varying network delays in networked control systems (ncss) over wi-fi networks. we model time-varying network delays as timevarying input delays of ncss over wi-fi networks. the designed controller consists of a linear quadratic regulator (lqr), a full-order observer, and a time-varying stepahead state predictor. the state predictor plays a key role in compensating for the time-varying input delay by providing the lqr with an estimation of future states ahead by the current network delay time. the time-varying network delays are acquired in real time by measuring the time differences between sent and received control data packets. we verify the stability and compensation performance of the designed controller by performing extensive experiments for an ncs in which a rotary inverted pendulum is controlled over wi-fi networks. keywords: networked control system (ncs), wi-fi network, time-varying delay, state predictor, rotary inverted pendulum. 1 introduction networked control systems (ncss) are spatially distributed systems in which sensors, actuators, and controllers exchange i/o information through a shared band-limited digital communication network. ncss have been applied to a broad range of areas such as wireless sensor networks (wsns), remote surgery, haptics collaboration over the internet, automated highway systems, and unmanned aerial vehicles (uavs) [4,10,25]. in particular, wireless networked control systems (wncss) have been increasingly applied in different fields because of the need for mobile operations, flexible installations, and rapid deployment in many applications. as an alternative solution to wired ncss, wncss are considered as primary solutions of networked control applications because of their simple configuration and mobility. however, the reliability and real-time performance of wncss are lower than for wired ncss because with wireless networks, the sizes of network delays abruptly changes over time owing to the dynamic state variation of wireless networks. it is commonly known that even a small time delay in control system feedback loops can make the whole system oscillating or unstable [4,15,25], and it is obvious that fluctuating time-varying delays severely degrade the stability and performance in wncss. in order to deal with the network delays, various control schemes for wncss are proposed in [1–3, 5–9, 11–14, 16–23, 26, 27]. these include: fuzzy-based control [6, 22], predictor-based control [13, 27], pid-based control [7, 14, 19, 20], h∞ filter-based control [1, 5, 17, 21, 23], faulttolerant-based control [3, 9], linear quadratic regulator (lqr)-based control [8], linear matrix inequality (lmi)-based control [18], kalman filter-based control [12,16,26], and observer-based control [11]. copyright © 2006-2017 by ccc publications 416 h.-c. yi, c.-j. an, j.-y. choi when constructing these control schemes, it is necessary to model the network delays. in [1,3,8,16,22], the network delays are modeled as constant-valued delays or bounded time-varying delays that do not suitably describe the rapidly fluctuating time-varying delays in wncss. in [2,5–7,9,11–14,17–21,23,26,27], the network delays are modeled as time-varying delays, but the stability and performance of proposed control schemes are demonstrated only by performing network or numerical simulations using opnet++, truetime network simulators, or matlab/simulink. considering dynamically changing states of wireless networks, the stability and performance of controllers designed for wncss must be verified by performing experiments over real wireless networks. moreover, all of the controllers in [1–3, 5–9, 11–14, 16–23, 26, 27] were designed under the assumption that the network delays are known in advance in the form of constant-valued or time-varying delays for all operation times. however, this is not the case in real operations of wncss, but time-varying network delays must be measured in real time, and should be used for the control action to compensate for the time-varying delays. hence, a real-time measurement method for time-varying network delays must be designed and be a part of wncss. in order to cope with the problems in the existing results of wncss, we design a state predictor-based output feedback controller to compensate for the time-varying network delays of ncss over wi-fi networks. we model wi-fi network delays as time-varying input delays, and we design a control scheme consisting of an lqr, a full-order observer, and a time-varying step-ahead predictor. moreover, we design a real-time measurement method for time-varying network delays, and we use the measured delays to construct the state predictor in real time. to apply the designed control scheme, we construct a wncs hardware platform where the rotary inverted pendulum is controlled over wi-fi networks, and we conduct extensive experiments to verify the stability and performance of the designed predictor-based controller. this paper is organized as follows. in section 2, we formulate the control problem. in section 3, we propose the design of a state predictor-based output feedback controller. in section 4, we propose a delay-measurement method in a wncs hardware platform. in section 5, we conduct experiments and discuss the results. in section 6, we conclude the paper. 2 problem formulation we consider a wncs over a wi-fi network, where the plant belongs to the class of singleinput multi-output systems, and the controller output is transferred via a wi-fi network, as depicted in fig. 1. in order to precisely describe the dynamic changes in the network state of wncs, we model the network delays as time-varying delays, d(k), and we assume that the measured values of the plant output are directly available for the construction of the controller. with respect to the models of the plant and controller, we choose discrete-time models because modern controllers are usually implemented using digital computers [2], and continuous-time controllers inevitably involve a degrading discretization process during implementation. the discrete-time linear plant including the network delays is described by the following model. x(k + 1) = ax(k) + bu(k −d(k)) y(k) = cx(k), (1) where x(k) ∈ rn is the system state, u(k) ∈ r is the control input, d(k) is the time-varying input delay, y(k) ∈ rq is the plant output, and it is assumed that the pair (a,b) are controllable and (a,c) are observable. the goal of this paper is summarized as follows. first, we design a state predictor-based output feedback controller for a discrete-time linear system (1) in order to compensate for the compensation of time-varying delay in networked control system over wi-fi network 417 figure 1: structure of wncs over wi-fi network. time-varying input delay and regulate the system output to a desired reference. second, we develop a real-time method to measure the time-varying network delays in wi-fi networks. the measured network delays are used for the implementation of the designed controller in real time. finally, we construct the hardware platform of a wncs with a rotary inverted pendulum as the plant for experiments of the designed controller. 3 design of predictor-based feedback controller we design the predictor-based feedback controller by performing the following three steps. in the first step, we design a full-state feedback controller for the plant with no input delay by applying discrete-time lqr theory. in the second step, we design a full-order observer that estimates the plant states. in the last step, we design a state predictor for the compensation of the time-varying input delay. first, we design a static state feedback controller u(k) = −kx(k) for the plant (1) with no input delay (i.e., with d(k) = 0) such that the designed controller can stabilize the plant (1); that is, the matrix (a−bk) becomes schur stable. the existence of the stabilizing state feedback gain, k, is guaranteed by the assumption that the pair (a,b) is controllable [24]. by applying the discrete-time lqr theory, we can easily obtain the optimal state feedback controller u(k) = −kx(k), (2) which stabilizes the plant system (1) with no input delay and minimizes the following quadratic cost function: j (u) = ∞∑ k=1 ( x(k) t qx(k) + ru2(k) ) , (3) where the state-cost matrix, q, and the performance index constant, r, are design parameters [24]. the implementation of the state feedback controller (2) requires knowledge of all state variables, and it is necessary to estimate the system state, x(k), from the measurement of the system 418 h.-c. yi, c.-j. an, j.-y. choi output y(k). for this purpose, we design the full-order observer as follows. x̂(k + 1) = ax̂(k) + bu(k −d(k)) + l (y(k) − ŷ(k)) ŷ(k) = cx̂(k), (4) where x̂ ∈ rn is the estimated state, ŷ ∈ rq is the estimated output, and l ∈ rn×q is the observer gain, which is designed such that the matrix (a−lc) becomes schur stable; that is, the designed observer becomes exponentially stable [24]. then, for the designed observer (4), it holds that x̂(k) exponentially converges to x(k), and it is proven that the estimated state, x̂(k), can be used to construct the state feedback controller (2) instead of the unavailable actual state, x(k). finally, we seek the final controller that is constructed with the estimated state as u(k −d(k)) = −kx̂(k), (5) which can be alternatively written as u(k) = −kx̂(k + d(m)), (6) where m satisfies m−d(m) = k, and it is non-implementable because it requires future values of state. however, the d(m)-step-ahead predictor is designed in [2] as x̂(k + d(m)) = ad(m)x̂(k) + k−1∑ j=k−d(m) ak−j−1bu(j), (7) which yields the implementable predictor-based feedback controller u(k) = −k  ad(m)x̂(k) + k−1∑ j=k−d(m) ak−j−1bu(j)   . (8) the closed-loop system with the designed controller (8) is globally exponentially stable in the sense of the norm ( |x̂(k)|2 + ∑d−1 j=0 |u(k + j −d(m + j))| 2 )1/2 , where d is an upper bound of the time-varying input delay [2]. remark 1. in order to implement the designed controller (8), we must find m satisfying m − d(m) = k at each time k, which requires some knowledge of future input delay, d(m). practically, it is difficult to know the future time-varying input delay in advance. we overcome this problem by measuring the time-varying delay at the plant input, and adopting the measured delay as an estimate of d(m). the overall structure of the wncs with the designed controller is depicted in fig. 2. as shown in fig. 2, the delayed input value is applied to the plant input, and the measured delays in real time are used as the time-varying input delay, d(m), for the state predictor. once the timevarying input delay, d(m), is estimated, the state predictor can compensate for the time-varying delay and stabilize the entire closed-loop system. compensation of time-varying delay in networked control system over wi-fi network 419 predictor controller plant observer delay figure 2: wncs with the predictor-based feedback controller. 4 delay measurement method in wncs hardware platform as emphasized in remark 1, in order to implement the designed controller (8), we need to measure the time-varying delay in real time that occurs in wncs. in this section, we design a delay-measurement method for wncs over wi-fi networks by extending the method proposed in [25] to measure wsn (wireless sensor network) delays in real time. sender receiver time time = = -th packet -th packet = = received time of -th packet i[�sending time of -th packet figure 3: measurement method for time-varying network delay. the basic idea of the delay-measurement method is to measure the transmission delay of signal packets from the controller output to the plant input. when controller output packets are sent, the time of sending is appended to the packets; when they are received at the plant input, the transmission delay is calculated by subtracting the time of sending from the present time. 420 h.-c. yi, c.-j. an, j.-y. choi figure 4: structure of wncs hardware platform. then, the calculated transmission delay is used to generate the next time-step controller output as the estimated delay of d(m) in (8). this basic operation principle is depicted in fig. 3 in the level of transmission packets. in order to implement and verify the designed controller and the delay-measurement method, we construct the wncs hardware platform, as depicted in fig. 4, where the rotary inverted pendulum is chosen as the physical plant to be controlled. the rotary inverted pendulum is well known to be nonlinear and unstable, and it is commonly accepted as a test plant to demonstrate the stability and performance of designed controllers [4]. as shown in fig. 4, the pendulum is connected through a data interface board to an intel i7-6700 3.4 ghz desktop pc (control pc), which acquires the pendulum position data, computes the control input, and transmits the control input to another intel i5-3570 3.4 ghz desktop pc (relay pc) over a wi-fi network. the relay pc retransmits received packets from the control pc to the control pc over the wi-fi network, which results in the time-varying delay in the wncs. the delay-measurement method is implemented on the control pc such that the sending time is added to each sending packet, and the received time of each packet from the relay pc is recorded. then, by subtracting the sending time from the received time, the current network delay is obtained and used as an estimate of d(m) in (8). 5 experiment 5.1 implementation of designed controller as mentioned in chapter 4, we choose the rotary inverted pendulum as the physical plant; fig. 5 shows the photograph and schematic diagram of the pendulum’s movement, and the actual parameters of the pendulum are listed in table 1. in order to apply the designed controller (8) to the pendulum over the wi-fi network, we first linearize the continuous-time nonlinear model of the pendulum. then, we convert the continuoustime linear model into the discrete-time linear model because the controller is designed based on a discrete-time linear system. we compute the linearized equations using the taylor series expansion, and the discretized equations using the zero-order hold discretization method. based on the parameters in table 1 and the sampling period of 2ms, we obtain the following discretecompensation of time-varying delay in networked control system over wi-fi network 421 (a) photograph of rotary inverted pendulum pendulum rotary arm load gear : rotary arm position : pendulum position load gear (b) schematic diagram and coordinate system figure 5: rotary inverted pendulum. table 1: parameters of rotary inverted pendulum. parameters description value lr the length of the rotary arm 21.6cm jr the moment of inertia of the rotary arm 9.98 × 10−4kg.m2 mr the mass of the rotary arm 0.257kg lp the length of the pendulum 33.7cm jp the moment of inertia of the pendulum 0.0012kg.m2 mp the mass of the pendulum 0.127kg ηg the efficiency of the gearbox 0.90(±10%) kg the total gear ratio of the high-gear 70 ηm the efficiency of the motor 0.69(±5%) kt the current-torque constant of the motor 7.68 × 10−3n −m/a km the back-emf constant of the motor 7.68 × 10−3v/(rad/s) rm the armature resistance of the motor 2.6ω(±12%) bp the viscous damping coefficient of the pendulum 0.0024nm(s/rad) br the viscous friction torque of the pendulum 0.0024nm(s/rad) 422 h.-c. yi, c.-j. an, j.-y. choi time linear model for the rotary inverted pendulum system. x(k + 1) =   1 0.0002 0.0019 0 0 1.0002 −0.0001 0.0020 0 0.1554 0.9125 −0.0016 0 0.2368 −0.0842 0.9975  x(k) +   0.0002 0.0002 0.1594 0.1533  u(k) y(k) = [ 1 0 0 0 0 1 0 0 ] x(k). (9) where the state vector x(k) and output vector y(k) consist of the rotary arm position φ, the pendulum position θ, and their velocities as x(k) = [ φ(k) θ(k) φ̇(k) θ̇(k) ]t y(k) = [ φ(k) θ(k) ]t , (10) and the input, u(k), is the motor input voltage. now, we are ready to construct the designed controller (8). we obtain the state feedback gain by applying the lqr method to the plant model (9) using the following parameters in the cost function (2): q =   30 0 0 0 0 30 0 0 0 0 0 0 0 0 0 0   and r = 1, (11) which yields the state feedback gain as k = [ 5.3456 −24.8038 2.9490 −3.3639 ] , (12) which assigns the eigenvalues of (a−bk) to {0.9114, 0.9699, 0.9902 + 0.0039j, 0.9902−0.0039j} and makes (a−bk) schur stable. with respect to the observer, the eigenvalues of (a−lc) must be selected much faster than the eigenvalues of (a−bk). considering the eigenvalues of (a−bk) from the state feedback, we select the desired eigenvalues of (a − lc) as {−0.2000,−0.2100,−0.2200,−0.2300}, which yields the following observer gain: l =   2.3441 −0.0100 −0.0955 2.4262 666.0953 −6.2360 −77.0342 735.4454   . (13) using (9), (11), and (13), we can implement the predictor-based feedback controller as u(k) = −k  ad(m)x̂(k) + k−1∑ j=k−d(m) ak−j−1bu(j)   . (14) compensation of time-varying delay in networked control system over wi-fi network 423 figure 6: wncss in the second and the third case. 0 20 40 60 80 100 120 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 time (s) w i− f i d e la y (s ) figure 7: time-varying delay in wi-fi network. 424 h.-c. yi, c.-j. an, j.-y. choi 0 20 40 60 80 100 120 −0.5 0 0.5 time (s) φ ( ra d ) 0 20 40 60 80 100 120 −0.1 −0.05 0 0.05 0.1 time (s) θ ( ra d ) 0 20 40 60 80 100 120 −5 0 5 time (s) u ( v ) (a) without the state predictor 0 20 40 60 80 100 120 −0.5 0 0.5 time (s) φ ( ra d ) 0 20 40 60 80 100 120 −0.1 −0.05 0 0.05 0.1 time (s) θ ( ra d ) 0 20 40 60 80 100 120 −5 0 5 time (s) u ( v ) (b) with the state predictor using the constant delay 0 20 40 60 80 100 120 −0.5 0 0.5 time (s) φ ( ra d ) 0 20 40 60 80 100 120 −0.1 −0.05 0 0.05 0.1 time (s) θ ( ra d ) 0 20 40 60 80 100 120 −5 0 5 time (s) u ( v ) (c) with the state predictor using the time-varying delay figure 8: response of rotary inverted pendulum (φ is the rotary arm position, θ is the pendulum position, and u is the input voltage) compensation of time-varying delay in networked control system over wi-fi network 425 time-varying figure 9: rms values of state and input variables. 5.2 discussion of experiment results in our experiments, we consider three cases. the first case is the wncs only with the state feedback (12) and the observer (13) without the state predictor (14). the experimental result of the first case is shown in fig. 8(a), which demonstrates that the pendulum cannot be controlled because it falls down at around 6s. this is because the time-varying delay is not compensated and destroys the stability. in order to verify the stability and performance of the designed controller for wncss over wi-fi networks, we conduct several extensive experiments on the hardware platform constructed in chapter 4. a detailed configuration of the experimental set-up is depicted in the block diagram of fig. 6. before presenting the experiment results for the controller performance, we discuss the true extent of time-varying delays that occurred in the wncs by measuring the real delays in wi-fi networks for a 120s period with a 2ms sampling rate. the measurement delays are shown in fig. 7 and it is obvious that the delays in wi-fi networks fluctuate extensively and are relatively large, and they must be compensated to realize stable operation of the wncs. the maximum, average, and minimum delays in fig. 7 are 260.5ms, 14.1ms, and 1.3ms, respectively. the second case is the wncs with the state feedback (12), the observer (13), and the state predictor (14), but the time-varying delay is approximated to a constant delay such that d(m) is selected as a constant in (14). that is, the state predictor (14) with a constant d(m) is applied to the wncs with time-varying delays, and fig. 8(b) shows the experimental result when d(m) = 10, which demonstrates that the pendulum is adequately controlled, and is kept in the upright position with small oscillations of the state variables, with only a constant-delay compensation. however, we observe that when we conduct the same experiments with various constant-delay values, only the delay within the range of 5 ≤ d(m) ≤ 41 can stabilize and control the pendulum kept in the upright position. the third case is the wncs with the state feedback (12), the observer (13), and the state predictor (14) using the measured time-varying delay, d(m). fig. 8(c) shows the experimental result for the third case, which demonstrates that the pendulum is adequately controlled, and is kept in the upright position with small oscillations of the state variables, as expected. 426 h.-c. yi, c.-j. an, j.-y. choi from fig. 8(b) and 8(c), we observe that the amplitudes in fig. 8(b) are larger than those in fig. 8(c) for both state and input variables, which demonstrates that the time-varying delay compensation exhibits better performance than the constant-delay compensation. in order to highlight the better performance quantitatively, in fig. 9, we display the root mean square (rms) values of state and input variables for both the time-varying delay compensation and several constant-delay compensations that are computed for the experiment interval ranging from 0s to 120s. considering the performance in terms of both the state and input variables, fig. 9 demonstrates that the time-varying delay compensation achieves the best performance. 6 conclusion we designed a state predictor-based output feedback controller for ncss over wi-fi networks, and we constructed the wncs hardware platform for many different experiments. in order to design a feedback controller to compensate for time-varying network delays, we proposed a three-step design process. we designed a state feedback controller based on the lqr theory, the observer estimates the full-state of the system, and the predictor predicts the future state for the plant input. we acquired the time-varying network delays in real time by measuring the time difference between sending and received control data packets. we used the measured delays for the designed controller in real time, which allows the controller to precisely compensate time-varying delays. a distinct feature compared to other studies involving wnscs is that we verified the stability and performance of wncss even by considering the time-varying delays in wi-fi networks and conducting real experiments on the wncs hardware platform. the designed controller can be applied to a specific class of wncss over wi-fi networks, where only the controller output is connected to the plant input via wi-fi networks. this limitation provides the motivation for carrying out a challenging future study, in which we aim to extend the results obtained in this paper to realize the control of more general wncss over wi-fi networks, where the plant input and output are connected to the controller output and input, respectively, through wi-fi networks. in this case, the network delays between the plant output and the controller input can be modeled as the time-varying output delays of the plant, which would require a new observer design scheme that estimates system states from the timevarying delayed output. acknowledgment this research was supported by basic science research program through the national research foundation of korea (nrf) funded by the ministry of education (nrf-2015r1d1a1a01056843). bibliography [1] bai j. et al. (2015); modeling and h∞ control of wireless networked control system with both delay and packet loss, journal of the franklin institute, issn 0016-0032, 352, 3915-3928, 2015. [2] choi j.y., krstic m. (2015); compensation of time-varying input delay for discretetime nonlinear systems, international journal of robust and nonlinear control, doi: 10.1002/rnc.3382, 1-22, 2015. compensation of time-varying delay in networked control system over wi-fi network 427 [3] ding s.x. et al. (2013); an integrated design framework of fault-tolerant wireless networked control systems for industrial automatic control applications, industrial informatics, ieee transactions on, issn 1551-3203, 9(1), 462-471, 2013. [4] drew m. et al. (2005); networked control system design over a wireless lan, proceedings of the 44th ieee conference on decision and control, isbn 0-7803-9567-0, 6704-6709, 2015. [5] du d. et al. (2015); multiple event-triggered h2 / h∞ filtering for hybrid wired-wireless networked systems with random network-induced delays, information sciences, issn 0020-0255, 325, 393-408, 2015. [6] du z., lin t.c., zaho t. (2015); fuzzy robust tracking control for uncertain nonlinear time-delay system, international journal of computers communications & control, issn 1841-9836, 10(6), 812-824, 2015. [7] ferrari p., flammini a., sisinni e. (2013); improving simulation of wireless networked control systems based on wirelesshart, computer standards & interfaces, issn 09205489, 35(6), 605-615, 2013. [8] girolamo g.d.d., d’innocenzo a., benedetto m.d.d. (2015); co-design of controller and routing redundancy over a wireless network, 5th ifac workshop on distributed estimation and control in networked systems necsys 2015, issn 2405-8963, 48(22), 100-105, 2015. [9] halawa h.h. et al. (2013); performance optimization for reliable wireless networked control systems in the presence of interference, emerging technologies and factory automation (etfa), 2013 ieee 18th conference on, issn : 1946-0740, 1-4, 2013. [10] hespanha j.p., naghshtabrizi p., xu y. (2007); a survey of recent results in networked control systems, proceedings of the ieee, issn 0018-9219, 95(1), 138-162, 2007. [11] hong z., gao j.f., wang n. (2014); output-feedback controller design of a wireless networked control system with packet loss and time delay, mathematical problems in engineering, issn 1563-5147, 1-7, 2014. [12] kilinc d., ozger m., akan o.b. (2015), on the maximum coverage area of wireless networked control systems with maximum cost-efficiency under convergence constraint, automatic control, ieee transactions on, issn 0018-9286, 60(7), 1910-1914, 2015. [13] kuzu a., bogosyan s., gokasan m. (2016); predictive input delay compensation with grey predictor for networked control system, international journal of computers communications & control, issn 1841-9836, 11(1), 67-76, 2016. [14] leng q. et al. (2014); improving control performance by minimizing jitter in rt-wifi networks, real-time systems symposium (rtss), 2014 ieee, issn 1052-8725, 63-73, 2014. [15] li h., li x., wang j., ma h. (2009); design of networked control system based on wireless sensor networks, information and automation, 2009, icia ’09. international conference on, isbn 978-1-4244-3607-1, 1140-1143, 2009. [16] li l.l., lemmon m. (2014); weakly coupled event triggered output feedback system in wireless networked control systems, discrete event dynamic systems, issn 0924-6703, 24(2), 247-260, 2014. 428 h.-c. yi, c.-j. an, j.-y. choi [17] ling r. et al. (2016); energy-efficient h∞ filtering over wireless networked systems-a markovian system approach, signal processing, issn 0165-1684, 120, 495-502, 2016. [18] mahmoud m.s., sabih m., elshafei m. (2015); event-triggered output feedback control for distributed networked systems, isa transactions, issn 0019-0578, 18, 1-9, 2015. [19] moghadam h.f., vasegh n. (2014); robust pid stabilization of linear neutral time-delay systems, international journal of computers communications & control, issn 1841-9836, 9(2), 201-208, 2014. [20] quang n.v.a., yoo m. (2014); performance analysis of packet loss on wireless network control systems, information and communication technology convergence (ictc), 2014 international conference on, isbn 978-1-4799-6786-5, 185-186, 2014. [21] shi a., liu b. (2015); robust h∞ control of wireless ncs with delay and packet dropouts, control and decision conference (ccdc), 2015 27th chinese, isbn 978-1-4799-7016-2, 6022-6025, 2015. [22] tong s., qian d., liu g.p. (2014); networked predictive fuzzy control of systems with forward channel delays based on a linear model predictor, international journal of computers communications & control, issn 1841-9836, 9(4), 471-481, 2014. [23] wang y. et al. (2014); an h∞ fault estimation scheme of wireless networked control systems for industrial real-time applications, control systems technology, ieee transactions on, issn 1063-6536, 22(6), 2073-2086, 2014. [24] william s. levine (1996); the control handbook, 2nd ed., crc press, isbn : 978-0-84938570-4, 1996. [25] yi h.c. et al (2012); design of networked control system using rtt measurement over wsn, wireless information technology and systems (icwits), 2012 ieee international conference on, isbn 978-1-4673-0947-9, 1-4, 2012. [26] zhang w. et al. (2014); time delay compensation in a wireless tracking control system with previewed reference, 2014 american control conference (acc), issn 0743-1619, 3293-3298, 2014. [27] zhang w. et al. (2015); robust time delay compensation in a wireless motion control system with double disturbance observers, 2015 american control conference(acc), isbn 978-1-4799-8685-9, 5294-5299, 2015. int j comput commun, issn 1841-9836 vol.7 (2012), no. 2 (june), pp. 273-284 a general approach for minimizing the maximum interference of a wireless ad-hoc network in plane v. haghighatdoost, m. espandar vahid haghighatdoost, maryam espandar computer and electrical engineering department, shahed university, opposite holy shrine of imam khomeini, khalij fars expressway, tehran, iran. {haghighatdoost, espandar}@shahed.ac.ir abstract: the interference reduction is one of the most important problems in the field of wireless sensor networks. wireless sensor network elements are small mobile receiver and transmitters. the energy of processor and other components of each device is supplied by a small battery with restricted energy. one of the meanings that play an important role in energy consumption is the interference of signals. the interference of messages through a wireless network, results in message failing and transmitter should resend its message, thus the interference directly affect on the energy consumption of transmitter. this paper presents an algorithm which suggests the best subgraph for the input distribution of the nodes in the plane how the maximum interference of the proposed graph has the minimum value. the input of the application is the complete network graph, which means we know the cost of each link in the network graph. without any lose of generality the euclidean distance could be used as the weight of each link. the links are arranged and ranked according to their weights, in an iterative process the link which imposition minimum increase on the network interference with some extra conditions which is proposed in future sections, is added to resulting topology and is eliminated from list until all nodes are connected together. experimental results show the efficiency of proposed algorithm not only for one dimensional known distribution like exponential node chain, but also for two dimensional distributions like two exponential node chains and α-spiral node chains. keywords: wireless ad-hoc network, sensor network, interference, spanning tree. 1 introduction wireless ad-hoc networks consist of mobile nodes equipped with, among other components, a processor, some memory, a wireless radio, and a power source. due to physical constraints, nodes are primarily powered by a weak battery, so energy is a scarce resource in wireless ad-hoc networks. in a general way, topology control can be considered as the task of, given a network communication graph, constructing a subgraph with certain desired properties while minimizing energy consumption. the subgraph needs to meet some requirements, the minimum requirement being to maintain connectivity and it should be a spanner of all nodes in original graph; additionally, symmetric links are desired as they permit simpler higher-layer protocols [1]. one of the foremost approaches to achieve substantial energy conservation is by minimizing interference between network nodes. the concept of topology control restricts interference by reducing the transmission power levels at the network nodes and cutting off long-range connections in a coordinated way. at the same time transmission power reduction has to proceed in such a way copyright c⃝ 2006-2012 by ccc publications 274 v. haghighatdoost, m. espandar figure 1: the interference model of a graph with 5 vertexes that the resulting topology preserves connectivity. some other works focused on topology control algorithms emphasizing locality while exhibiting more and more desirable properties [2–4], sometimes presenting distributed algorithms that optimize various design goals concurrently. all these approaches have in common, however, that they address interference reduction only implicitly. the intuition was that a low minimizing the maximum degree of nodes of graph would solve the interference issue automatically. as depicted in [1] this intuition was proved wrong in [5], starting a new thread that explicitly studies interference reduction in the context of topology control [6–8]. the general interference model introduced in [9], proposes a natural way to define interference in ad-hoc networks. the general question is: how can one connect the nodes such that as few nodes as possible disturb each other? in the following, we discuss the network and interference model presented in [9]. so far, not many results have been published in the context of explicit interference minimization. for networks restricted to one dimension the authors in [9] present a 4 √ n-approximation of the optimal connectivity preserving topology that minimizes the maximum interference. for the two dimensional case, the authors in [10] propose an algorithm that bounds the maximum interference to o( √ n). if average interference of a graph is considered, there is an asymptotically optimal algorithm achieving an approximation ratio of o(logn) [11]. kevin buchin in [12] proved that this problem is np-complete. in the following sections the proposed algorithm is explained briefly. the time complexity of producing the spanning tree with minimum interference is o(n5). 2 interference model of network the network is modeled as a geometric graph g = (v,e). as mentioned in previous section the links between nodes are symmetric and have not direction, it means that a message sent over a link can be acknowledged by sending a corresponding message over the same link in the opposite direction. so the matrix e is symmetric. let nu denote the set of all neighbors of a node u ∈ v in the resulting topology. then, each node u features a value ru defined as the distance from u to its farthest neighbor. more precisely ru = maxv∈nu{|u − v|}, where |u − v| denotes the euclidean distance between nodes u and v. since we assume the nodes to use omnidirectional antennas, d(u,ru) denotes the disk centered at u with radius ru covering all nodes that are possibly affected by message transmission of u to one of its neighbors. then the interference of a node v is defined as the number of other nodes that potentially affect message reception at node v. definition 1. given a graph g = (v,e), the interference of a node v ∈ v is defined as: i(v) = ∣∣∣∣{u|u ∈ v �{v},v ∈ d(u,ru)} ∣∣∣∣ (1) a general approach for minimizing the maximum interference of a wireless ad-hoc network in plane 275 a) exponential node chain b) two exponential node chains figure 2: two special node distributions note that even though each node is also covered by its own disk, we do not consider this kind of self-interference. the graph interference is the maximum interference occurring in a graph: definition 2. the interference of a graph g = (v,e) is defined as: i(g) = maxv∈v i(v) (2) as shown in figure 1 the interference of nodes is as follow: node: a b c d e interference: 2 2 2 2 1 according to definition 2 the interference of graph i(g) is 2. 3 the nearest neighbor forest in the first view of the interference problem, one may say the nearest neighbor forest or minimum spanning tree is the best subgraph which results in minimum interference. in this section, it is shown that this is already a substantial mistake, as thus interference becomes asymptotically incomparable with the interference-minimal topology. for some special distribution the nearest neighbor forest results in the worst interference. authors in [13], introduced an instance which seems to yield inherently high interference: the so called exponential node chain is a one-dimensional graph g = (v,e) where the distance between two consecutive nodes grows exponentially from left to right as depicted in figure 2(a). that is, the distance between nodes vi and vi+1 is 2i for i = 0,1,2, ...,n − 1. so as shown in figure 3(a) the nearest neighbor forest results in the interference of ω(n). also in [9] they introduced the two exponential node chains as shown in figure 2(b), on the bottom, there is a horizontal chain of nodes vi with exponentially growing distances, the same as the one dimensional exponential chain, thus distance between vi and vi+1 is 2i. each of these nodes vi has a corresponding node ti vertically displaced by a little more than vi’s distance to its left neighbor, that is, |vi − ti| > di where di = |vi − vi−1| = 2i−1. note that the nodes ti also form a (diagonal) exponential node chain. finally, between two of these diagonal nodes ti−1 and ti an additional helper node ci is placed such that |vi −ci| ≥ |vi −ti|. the nearest neighbor forest for this node distribution is shown in figure 3(b). the nearest neighbor forest for the distributions in figure 2 and their disk graph is shown in figure 3. the algorithm proposed in [9] finds a subgraph for the exponential node chain (figure 2(a)) with i(g) ∈ o( √ n). and they proved bellow theorem: 276 v. haghighatdoost, m. espandar a) the nearest neighbor forest and disk graph for exponential b)the nearest neighbor forest and disk graph for chain, i(g) = n − 2 = ω(n) two exponential node chains, i(g) = ⌊ n 3 ⌋ + 2 = ω(n) figure 3: the disk graph and the interference of nearest neighbor forest a) the best spanning tree for the two exponential node b)the disk graph of the tree that shows the constant chains where i(g) always is equal to 3 interference figure 4: suggested topology for two exponential node chains in [9] theorem 1: given an exponential node chain g = (v,e) with n = |v | , √ n is a lower bound for the interference i(g). they also proposed a topology with constant interference for the two exponential node chains which is depicted in figure 4 but there is no algorithmic method which generate automatically similar subgraph. according to the construction of the exponential node chain, only nodes connecting to at least one node to their right increase the v1’s (leftmost node) interference. they called such a node a hub and define it as follows [9]: definition 3. given a connected topology for the exponential node chain g = (v,e). a node vi ∈ v is defined to be a hub in g iff there exists an edge (vi,vj) with i < j. the aexp algorithm which is proposed in [9] for exponential chain is as follows: the algorithm starts with a graph gexp = (v,eexp), where v is the set of nodes in the exponential node chain and eexp is initially the empty set. following the scan-line principle, aexp processes all nodes in the order of their occurrence from left to right. initially, the leftmost node is set to be the current hub h. then, for each node vi, aexp inserts an edge {h,vi} into eexp. this is repeated until i(gexp) increases due to the addition of such an edge. now node vi becomes the current hub and subsequent nodes are connected to vi as long as i(gexp) the overall interference does not increase. figure 5 shows the resulting topology when aexp algorithm is used for the exponential node chain with 17 nodes and i(g)=6. a general approach for minimizing the maximum interference of a wireless ad-hoc network in plane 277 figure 5: the result topology of aexp algorithm for exponential node chain. the interference of the exponential node is bounded by o( √ n). for clarity of representation edges are depicted as arcs and x dimension is shown in logarithmic scale. the interference of each node is wrote under the node position in the next section a new algorithm which is the extension of aexp is proposed for the nodes in the plane. this algorithm finds the best sub graph with minimum interference. the output of proposed algorithm for exponential chain is very similar to aexp’s output with equal complexity. also it shows the subgraph with constant interference for two exponential node chains. 4 proposed algorithm (apln) the following algorithm apln is the extension of aexp for finding a spanning tree with minimum interference of nodes distributed in the plane. the apln the same as aexp is an iterative algorithm which generates the resulting graph in n steps. where n is the number of nodes. in the beginning the result graph has no edge and in each step of algorithm one edge is added to result graph until the connected result graph is generated. suppose gin = (vin,ein) be the input graph, where vin = {v1,v2, ...,vn} is the set of n separate nodes in the plane. (xi,yi) is the coordinate of vi. also ein is the n×n adjacent matrix of gin with einij elements. e in ij is the euclidean distance between i’th and j’th nodes. einij = |vi − vj| = √ (xi − xj)2 + (yi − yj)2 ∀i,j = 1,2, ...,n (3) the steps of generating the spanning tree with minimum interference from gin are as follows: step 1) preparing data: generate the adjacent list of gin’s edges. the adjacent list of edges of a graph is a list of triples (i,j,eij); where i points to i’th vertex and j points to j’th vertex of graph and eij) is the weight of edge (in our problem it is the euclidean distance). gin is a complete graph and the adjacent matrix ein is symmetric so the adjacent list would have n(n−1) 2 elements. step 2) preparing data: sort the elements of adjacent list according to the weight of each edge and call the new list srte1 . thus the elements of the srte are arranged as follow: edgelength(srtei) ≤ edgelength(srtei+1) ∀i = 1,2, ..., n(n − 1) 2 (4) step 3) find start point: find the smallest edge from srte list; set emin = srte1. step 4) initialization: suppose that gpln = (vpln,epln) is the result graph. set vpln = vin and epln contains of only the smallest edge emin = (h,k,ehk) , in other word all elements of epln are valued with zero instead of e pln hk and e pln kh . initialize the maximum interference of result 1srte: obtained from sorted edge list 278 v. haghighatdoost, m. espandar graph maxipln with:   a) vpln = vin b) e pln ij = 0 ∀i,j = 1,2, ...,n c) e pln hk = e pln kh = e in hk d) maxipln = 1 (5) step 5) initialize the active vertexes list: the nodes which are in sub connected graph are named as active vertexes. at the beginning sub connected graph consist of only the smallest edge. the active vertex list av is a 1×n array; iff the node vi be an active vertex the i’th element of av (avi) gets value 1 otherwise its value is 0. av = { 1 if i = h,k 0 if otherwise ∀i = 1,2, ...,n. (6) in other word: 1 2 ... k ... h ... n av = 0 0 0 1 0 1 0 0 (7) where k,h are the corresponding nodes of smallest edge. step 6) check the status: while all nodes of gpln are not connected together repeat steps 7 to 9. step 7) generate candidate edges: according to av list, generate the active edge list (ae). ae is a subset of srte; an edge from srte exist in ae iff only one of its endpoints have value 1 in av . for each srtek ∈ srte if (avi = 1 ⊕ avj = 1) → add srtek to ae wheresrtek = (i,j,eij) (8) the xor symbol (⊕) in the predicate part of relation (8) ensures the intolerance of multi selecting one edge and recursion in final subgraph. according to (4) and (8) we can write: edgelength(aei) ≤ edgelength(aei+1) ∀i = 1,2, ...,size(ae) − 1 (9) step 8) find the best edge: select the edge aem = (p,q,einpq) from ae , which leads to minimum increase on maxipln when is added to gpln. after adding the aem to gpln , update the av and maxir. the algorithm of finding the aem will be explained in next section. the specification of aem is as follow: ∃aem ∈ ae|∀aej ∈ ae → i(gpln,epln ∪ aem) ≤ i(gpln,epln ∪ aej) (10) i(g,e) determines the maximum interference of graph g according to adjacent matrix e. updating the variables is done as follow: (a) suppose aem = (p,q,e in pq) (b) epln = epln ∪ aem means−−−−→ e pln pq = e in pq,e pln qp = e in qp (c) maxipln = i(gpln,epln) (d) avp = 1,avq = 1 (11) go to step 6 for checking the status. step 9) finalizing: the obtained gpln = (vpln,epln) is the spanning tree with minimum interference for the input distribution of nodes vpln in the plane. step 10) finish a general approach for minimizing the maximum interference of a wireless ad-hoc network in plane 279 a) connected component consist of 7 nodes and 3 new vertex with b)three vertexes have maximum interference and 21 possible edge are candidate to joint to connected component their interference is 4. figure 6: the topology of graph after 6 step with i(g)=4 5 find best edge algorithm in step 8 of the apln algorithm we have addressed an algorithm which has found the best edge from ae list. the brief introduction of finding the best edge from ae list is as follow: suppose that the current state of algorithm is a connected sub graph with m vertex and (n−m) disconnect nodes which are shown in figure 6(a) we want to expand the connected sub graph with minimum possible increase of maxi. lemma 1) adding a new vertex to connected sub graph in the worst case result in two unit increase of the current maximum interference. proof) suppose ri determines the distance from vi to its farthest neighbour and di is a disk with radius ri and centered by vi. the disk di shows the domain of i’th transmitter and the nodes inside the di affect by vi. when a new vertex vk is added to the sub graph by connecting to vj, the new disk dk is added to current disks and the radius of dj may increase. thus in the worst case dj and dk will dominate all other vertexes. so the current max-interfered node is affected by two new transmitter signals and for this reason we will have two unit increase of maxi. and in the best case maxi not changes. adding a new edge in the worst case → maxinew = maxiold + 2 (12) figure 6 shows the increase of interference when a new vertex is added to sub graph. the algorithm of finding the best edge is as follows: step 1) as lemma 1 determined, in the worst case we will have 2 unit increase on maximum interference of the graph so repeat the bellow steps for △i = 0,1,2 step 2) for each aei from ae list repeat bellow step 3) set maxinew = i(gpln,epln ∪ aei) step 4) if maxinew is equal to (maxinpln + △i) determine the aei as the best edge and go to step 6, else check for next edge. step 5) set △i = △i + 1 and go to step 2. step 6) finish. 280 v. haghighatdoost, m. espandar figure 7: relation of iteration and complexity of find the best edge for n=100. the complexity of determining the best edge is as follow: complexity = o(3 ∗ length(ae) ∗ o(i(g,e))) = o(3 ∗ m(n − m) ∗ n2) = o(mn3 − m2n2) (13) where m is number of active vertexes and n is the number of total vertexes. the relation (13) shows that in the first iteration (m = 1) the complexity of finding the best edge is o(n3) and in the final step (m = n − 1) it is also o(n3) but in the middle iteration when m = n/2 the complexity is as follow: if(m = n/2) → o(mn3 − m2n2) = o( n4 2 − n4 4 ) = o( n4 4 ) = o(n4) (14) the relation between complexity of determination of best edge and iteration is depicted in figure 7. applying the best edge algorithm for the node distribution which was shown in figure 6 is depicted in figure 8. 6 experimental results at first the proposed algorithm apln is compare with aexp. as depicted in figure 9 the interference of the exponential chain for both algorithms are equal but the topologies are different. for aexp the order of nodes is important but the apln does not have any precondition on the distribution. in following the strength of apln for finding the spanning tree with minimum interference for two exponential node chains and some special similar distributions is depicted. figure 10 depicts the resulting topology if apln is applied to the two exponential node chains.the most important note on this topology is that the apln algorithm does not know this distribution but the resulting topology is the same as best topology which is created by human brain in [9]. for random distribution of nodes in plane the apln algorithm always suggest a better topology with equal or smaller interference value rather than nearest neighbor forest. another test case is α-spiral node chain. in α-spiral node chain which is shown in figure 12, the k’th node is placed in (2k cos(ak),2k sin(ak)). a general approach for minimizing the maximum interference of a wireless ad-hoc network in plane 281 a) the best edge which results in one unit increase on maxi b)two disks are different in this topology rather than is selected to expand the connected sub graph. previous topology, which is shown in figure 6. figure 8: the topology of graph after 7 step with i(g)=5 a) the aexp result for exponential chain with 9 nodes b)the apln result for exponential chain with 9 nodes c) the aexp result for exponential chain with 12 nodes d)the apln result for exponential chain with 12 nodes figure 9: the spanning trees obtained by aexp,apln, algorithms for exponential chain. for clarity of representation edges are depicted as arcs and x dimension is shown in logarithmic scale. the interference of each node is wrote under the node position the results of applying the apln and nearest neighbor forest (amst ) to some α-spiral node chains are depicted in figure 13. 7 conclusion in this paper a general algorithm named apln for finding the spanning tree of separate nodes in the plane has been proposed. the apln algorithm presents an iterative routine for minimizing the maximum interference of the resulting spanning tree. at the beginning the resulting tree has only one edge which is the smallest edge in the input graph, until all input nodes are not connected together, the algorithm adds a new edge to the resulting tree. for adding a new edge to sub graph the best edge which imposes minimum increase on the interference of all nodes from all available edges is selected. in section 6 the experimental result of using apln and aexp for some special distributions are depicted and all of them show the good performance of the proposed algorithm. the apln is a general algorithm for any two dimensional 282 v. haghighatdoost, m. espandar figure 10: constant interference for two exponential node chains obtained by apln algorithm. the interference of each node is wrote beside the node position a) result of amst for random distribution of 20 nodes. b)result of apln for random distribution of 20 nodes c) result of amst for random distribution of 15 nodes d)result of apln for random distribution of 15 nodes figure 11: result of applying nearest neighbor forest and apln algorithms on random distribution of nodes. distribution and it has no limit or special conditions for the input distribution and there is no need to inform it about the input distribution, the algorithm itself finds the best spanning a general approach for minimizing the maximum interference of a wireless ad-hoc network in plane 283 figure 12: α-spiral node chain. a)the apln suggestion for 45-spiral node chain.i(g)=13 b)the amst suggestion for 45-spiral node chain.i(g)=38 c)the apln suggestion for 30-spiral node chain.i(g)=18 d)the amst suggestion for 30-spiral node chain.i(g)=78 figure 13: 45-spiral node chain with 40 and 30-spiral node chain with 80 nodes and proposed topologies with apln and amst . note that for clarity of representation, the k′th node is positioned in (kcos(αk), ksin(αk)) instead of (2kcos(αk), 2ksin(αk)). the interference of each node is wrote beside the node position. tree with minimum interference. in this paper we can not compute the final interference order according to the input size but for some special distributions which are criterions for interference problem the resulting topology is generated and all resulting topologies are satisfactory. for the 284 v. haghighatdoost, m. espandar future work by mathematical relations we will find the order of final topology according to the number of input nodes for the worst case of node distribution. bibliography [1] t. locher, p. von rickenbach, and r. wattenhofer: sensor networks continue to puzzle: selected open problems. in proc. 9th internat. conf. distributed computing and networking (icdcn), 2008 [2] p. santi: topology control in wireless ad hoc and sensor networks. wiley , 2005 [3] x.y. li , w.z. song , w. wan: a unified energy efficient topology for unicast and broadcast. in: proc. of the 11th int. conf. on mobile computing and networking (mobicom), 2005 [4] m. damian, s. pandit, s. pemmaraju: local approximation schemes for topology control. in: proc. of the 25th annual acm symposium on principles of distributed computing (podc), 2006 [5] m. burkhart, p. von rickenbach, r. wattenhofer, a. zollinger: does topology control reduce interference?. in: proc. of the 5th acm int. symposium on mobile ad-hoc networking and computing (mobihoc), 2004 [6] k. moaveni-nejad, x.y. li: low-interference topology control for wireless adhoc networks. ad hoc & sensor wireless networks: an international journal 1(1-2),2005 [7] t. johansson, l. carr-motyčková: reducing interference in ad hoc networks through topology control. in: proc of the 3rd acm joint workshop on foundations of mobile computing (dialm-pomc), 2005 [8] m. benkert, j. gudmundsson, h. haverkort, a. wolff: constructing interferenceminimal networks. computational geometry: theory and applications, 2007 [9] p. von rickenbach, s. schmid, r. wattenhofer, a. zollinger: a robust interference model for wireless ad-hoc networks.in: proc. of the 5th int. workshop on algorithms for wireless, mobile, ad hoc and sensor networks (wman), 2005 [10] m.m. halldórsson, t. tokuyama: minimizing interference of a wireless ad-hoc network in a plane. in: proc. of the 2nd int. workshop on algorithmic aspects of wireless sensor networks(algosensors), 2006. [11] t. moscibroda, r. wattenhofer: minimizing interference in ad hoc and sensor networks. in: proc. of the 3rd acm jointworkshop on foundations of mobile computing (dialmpomc), 2005 [12] k. buchin: minimizing the maximum interference is hard. corr 2008 and sited in arxiv/0802.2134 , 2008 [13] n. clark, c. j. colbourn, d.s. johnson: unit disk graphs. discrete mathematics, 86:165177, 1990 [14] m. heide, c. schindelhauer, k. volbert, m. gruenewald: energy: congestion and dilation in radio networks. in proc. of the 14th annual acm symposium on parallel algorithms and architectures (spaa), pages 230-237, 2002 international journal of computers communications & control issn 1841-9836, 11(6):789-803, december 2016. analytical modelling of a new handover algorithm for improve allocation of resources in highly mobile environments y. kirsal yonal kirsal* electrical and electronic engineering european university of lefke north cyprus, mersin 10, turkey *corresponding author: ykirsal@eul.edu.tr abstract: wireless and mobile communication systems have evolved considerably in recent years. seamless mobility is one of the main challenges facing mobile users in wireless and mobile systems. however, highly mobile users lead to a high number of handover failures and unnecessary handovers due to the limited resources and coverage limitations with a high mobile speed. the traditional handover models are unable to cope with high mobile users in such environments. this paper proposes, an intelligent handover decision approach to minimize the probability of handover failures and unnecessary handovers whilst maximizing the usage of resources in highly mobile environments. the proposed approach is based on modelling the system using a markov chain to enhance the system’s performance in terms of blocking probability, mean queue length and transmission delay. the results are compared with the traditional handover model. simulation is also employed to validate the accuracy of the proposed model. numerical results have shown that the proposed method outperforms the traditional algorithm over a wide range of handover failures and significantly reduced the number of such failures and unnecessary handovers. the results of this study show that quality if service (qos) measures of such systems can be evaluated efficiently and accurately using the proposed analytical model. however, the performance results have also shown that it is still necessary to explore an effective model for operational spaces. in addition, the proposed model can also be adapted to various types of networks considering the high speed of the mobile user and the radius of the network. keywords: analytical modelling, mobility, handover decision algorithm, quality of service (qos), highly mobile environments. 1 introduction with the rapid development and deployment of wireless technologies, next-generation wireless networks are expected to provide seamless mobility and ubiquitous access to the networks [1,2,4]. researchers have focused on the improved quality of service (qos) and performance evaluation of 4g/5g networks in highly mobile environments [6,7,9]. one of the main challenges for seamless mobility in next-generation wireless networks is the availability of the resources in the networks which allow mobile users to roam among heterogeneous environments [4, 9]. mobility between wireless networks may lead to a high number of unnecessary handovers and handover failures when a mobile user is in highly mobile environments [10,11]. the mobile user requires less time to across the coverage area of the network when the speed increases. thus, the mobile user does not have enough time to acquire the network resources to do the handover. in addition, mobile users can join the queue by requesting resources from the system. however, mobile users will never get access to a channel for communication due to the user’s velocity as well as the lack of resources. hence, the queue will have unnecessary handovers by allowing such users to access the system. unnecessary handover occurs when the mobile user’s travelling time within the system is copyright © 2006-2016 by ccc publications 790 y. kirsal less than the handover process from the neighbouring networks to the system. thus, the mobile user leaves the network coverage area before the handover process is executed [2, 9, 10]. this causes network connection breakdown [11] and interrupts the service [10]. unnecessary handover is undesirable because it wastes network resources [7, 10]. on the other hand, if the user’s call holding time is equal to or less than the total time of handover into and out of the system, then handover failure occurs [11]. in this situation, the mobile user does not transmit or receive any data packet to system; however, the mobile user might enter the system just after triggering the handover process. in other words, the system will be available after a short time when the mobile user triggers the handover process. many network characteristics (such as power consumption, received signal strength, and network conditions) affect the on handover process in wireless and mobile systems. most existing decision algorithms have focused on the vertical handover process, such as optimization problems [12], a policy-enabled schemes [13,14], and fuzzy logic [15]. on the other hand, in [16] the main focus is various mathematical models used in vertical handover decisions for heterogeneous networks. however, most previous works consider vertical handover decision algorithms. more recently, the literature has proposed various handover decision mechanisms [1–6] but none of the existing works consider user’s the high mobility in the handover decision procedure. in [1] two-dimensional (2-d) markov queuing models have been constructed to enhance multiple service requirements in the lte network. the proposed models decrease the call blocking rate, especially for handover users. however, the mobility issues are not considered in [1]. the authors in [2] demonstrated their overall approach by describing the vanet testbed in vehicular environments. the results obtained in [2] showed that it is necessary to consider a new handover model based on a probabilistic rather than traditional approach. in addition, proactive queueing approach for handover is also considered in [2], but the parameters (e.g., two channels in the systems) and scenario used for the analysis are rather simple in order to obtain realistic qos results in highly mobile environments. in [4] a simple and robust two-step vertical handover decision algorithm is proposed for wireless and mobile networks. the new call blocking probability of the proposed model is modelled as m/m/bi/bi based on the erlang-b model. however, [5] showed that the erlang-b model is not suitable for the handover schemes of highly mobile environments due to user’s mobility in the system. in addition, as the reserved bandwidth may not be utilized effectively in low handover rates, the traditional reservation-based schemes are not efficient for future networks, especially in 4g vehicular networks [16]. in [17] the time before the vertical handover and the network dwell time are calculated and presented for any network topology. however, both parameters have not been used in previous studies to improve unnecessary handovers and handover failures. in order to achieve seamless handover, when the mobile user has a high speed, it is important to predict the network availability, the time before handover and network dwell time with coverage boundaries. the time before handover is the time after which the handover occurs, and the network dwell time is the time that the mobile user spends in the coverage area of the network. these two parameters are important in order to obtain the best time and place for the handover for mobile users. it is also possible to improve resource allocation in mobile and wireless systems by using these two parameters. traditional handover models have been studied for wireless and mobile systems [1,3–5,8,9] but they lead to the degradation of the qos due to the network’s small coverage area and the velocity of the users, especially in highly mobile environments. hence, the new handover approach is necessary for providing ubiquitous communication in next-generation systems. an analytical modelling and performance evaluation of mobile and wireless system using queueing theory has recently been performed [1–6, 8, 9]. modelling limited resources and enhancing the qos of the systems are the basis of the queuing phenomenon. thus, the queuing theory can be used to model and analyse such problems [1, 7, 9]. therefore, developing an analytical modelling of a new handover algorithm for improve allocation of resources in highly mobile environments 791 analytical model of a new handover approach and resource allocation model for such systems would be the best option for obtaining more efficient qos measurements. this paper develops a new analytical model for handover and resource allocation in highly mobile environments based on the time the mobile user needs to acquire network resources for the handover. the main contribution of the paper can be summarized as follows: • the qos degradation of the traditional approach due to the handover failures and unnecessary handovers can be improved by the proposed algorithm considering call holding time, mobile user dwell time and time before handover. • the performance improvement of high mobile users and management of the high mobile users within the cell and/or between neighbouring cells can be based on the acceptance factor. • the results show that the proposed method gives better performance results than the traditional approach. however, a statistical model is still necessary to predict the degree of contention in highly mobile environments. the main purpose of this paper is to develop a useful analytical model based on time before handover, call holding time and network dwell time by using a new decision algorithm to improve the qos of real networks. the analysis done in this paper is based on modelling the system using a markov chain to enhance the system performance in terms of blocking probability, mean queue length and transmission delay. this proposed model is applicable for most wireless communication systems. the rest of the paper is organised as follows: section ii presents the traditional handover approach. section iii describes the proposed handover approach. section iv discusses the performance evaluation of the proposed algorithm with the traditional approach and simulation results. finally, section v provides the conclusions and future works. 2 the traditional approach this section explains and represents the traditional handover approach for wireless and mobile environments. figure 1 shows the handover process in such environments. the traditional handover approach introduces two types of threshold circles in the coverage area of the system [3,8,9,17]. the handover threshold and the exit threshold circles are shown in figure 1 for the traditional handover approach. figure 1: the traditional handover approach with threshold circles based on the traditional approach, the cell can be divided into different regions depending on the radius. the continuous circle with radius r2 represents the handover threshold circle. in addition, the dotted circle with r1 represents the exit threshold circle as shown in figure 1. the exit threshold circle is the starting point for the the handover process in the traditional handover 792 y. kirsal approach. in order to process successful mobility, mobile users have to finish the handover before reaching the classic handover threshold circle. if the mobile user is not handed over successfully before the circle, the mobile user will lose the connection. the traditional handover approach is currently being used in [1,3–5,8,9] and [17] for wireless and mobile systems. 2.1 traditional handover queueing model in mobile and wireless communication networks, the service is provided by the base station (bs) and/or access point(ap) depending on the network. mobile users communicate via radio links with bss/aps [3,5]. a single and arbitrary shape of cell is assumed. there are s channels that the system can provide for the service. in addition, the length of the queue is q. the maximum number of calls allowed into the system is a combination of the number of users being served (s) and the number of users in the queue (q). hence, the maximum number of calls in the system is given by l where l = s + q. the traditional handover queueing model is given in figure 2 [3,8,9]. figure 2: traditional handover queueing model the originating calls and handover calls are two kinds of arrival rates defined in the system, with mean arrival rates given as λo and λh, respectively. λo is newly generated calls in the system and λh is the mobile calls from one cell to another. if the channels are available and idle in the system, both call’s arrival can be assigned to any channel. otherwise, the incoming call request is added to the queue if the channels are busy [3,8,9]. in addition, if the queue is full, the incoming call is blocked. the channel requests in the system are served by first in first out (fifo) rule. the inter-arrival times of the incoming call requests are assumed to follow an exponential distribution. λ is defined as the total arrival rate of calls in the cell, where λ = λo + λh. in traditional handover, the mobile users are moving at a velocity v, and there is a probability that it can also leave the network when being served due to the mobility. moreover, a mobile user is placed in the queue waiting for the channel to be served. however, the mobile user can leave the system due to the mobility shown in figure 2. a formula is given in [3] and [9] for λh. in traditional handover, tc is call holding time in the system. an exponentially distributed tc with a mean rate of µc is assumed. in addition, tdwell is the dwell time, indicating the time that mobile users spend in the cell. this is also assumed to be exponentially distributed with a mean rate of µdwell. the equation 1 is used in the literature for the dwell time in wireless and mobile systems [3, 8, 9] for traditional handover queuing model. thus, µdwell can be calculated and described as follows: µdwell = e[v ] ·l π ·a (1) where e[v ] is the average of the random variable, v is the speed of mobile users, l is the length of the perimeter of cell (a cell with an arbitrary shape is assumed), and a is the area of the analytical modelling of a new handover algorithm for improve allocation of resources in highly mobile environments 793 cell [3,8,9]. the total channel holding time of a call is exponentially distributed with mean 1/µ where, µ = µc +µdwell. the state transition diagram of the traditional handover queueing model is shown in figure 3. the states are defined as i (i=0,1,2,· · · ,s+q) the number of calls in the system at time t. figure 3: the state diagram of the traditional handover queueing model ρ is the traffic intensity in the system, where ρ=λ/µ. assuming a system in a steady state, the state probabilities, pi’s, can be obtained as in equation 2 [3,8,9]. pi =   (λo+λh) i i! ·p0 0 ≤ i ≤ s (λo+λh) s s! ·(λo+λh)i−s·p0 i∏ j=s+1 [sµ+(j−s)µdwell] s < i ≤ s + q (2) in equation 2, pi is the probability that there are i calls in the system. p0 can be defined as follows: p0 =   s∑ i=0 (λo + λh) i i! + s+q∑ i=s+1 (λo+λh) s s! · (λo + λh)i−s i∏ j=s+1 [sµ + (j −s) µdwell]   −1 (3) once all the steady state probabilities pi are computed, the rest of the performance measures can be easily obtained. more information about the traditional handover queueing model can be found in [3,8,9]. 3 the proposed approach this section presents an abstract intelligent handover algorithm for high mobile environments applying queuing theory. the accurate knowledge of network availability, coverage boundaries (radius of the cell) and the velocity of mobile users are fundamental factors that play an important role in correct decision making during the handover. hence, in the proposed model, the proposed algorithm determines the time that a mobile user needs before performing a handover. as mentioned in the previous sections, the proposed scheme is based on the current time tcurrent, network dwell time tdwell and estimated time before handover testimated of the mobile users. in order to reduce the number of unnecessary handovers and handover failures, the proposed algorithm determines the required time for the mobile user whether, admitting it into the system or performing a handover as shown in figure 4. assuming a mobile user is moving at a velocity v towards the system at tcurrent, the user can request a channel for communication. the user 794 y. kirsal needs a channel at testimated and releases the channel at (testimated + tdwell). based on the call holding time (tc) of the calls, three possible conditions are proposed and analysed in figure 4. hence, the proposed approach can improve the resource allocation, especially in highly mobile environments. • first condition: (testimated + tdwell)n−1 < (testimated)n if the channel needs time of the current user (n) is higher than the channel release time of the user being served (n-1), then the mobile user can enter the system seamlessly. this shows that the mobile user has enough time to get a place in the system to be served. in other words, unnecessary handovers (tcurrent + tdwell) < (testimated) and partial handovers tc ≤ (tdwell + testimated) do not occur. • second condition: (testimated)n−1 < (testimated)n and (testimated + tdwell)n−1 < (testimated + tdwell)n for the second condition, the users are currently using the channels or waiting in the queue to be served. if the channels’ release time of the users and/or waiting time in the queue (n-1) are higher than the channel release time of the current user (n), then the system will be partially busy by the time the current user reaches the system. this means that the current user might be admitted to the system after a short time. in other words, the system will be available soon for the service after the current user requests a channel. hence, there is a partial contention tc ≤ (tdwell + testimated) in the system. • third condition: (testimated)n−1 < (testimated)n and (testimated + tdwell)n−1 > (testimated + tdwell)n if the channel release time of the users being served (n-1) is greater than the channel release time of the current user (n), then the system will be busy during the travel of the current user. hence, the current user will never get access to the system. the channels and queue will no longer be available and the mobile user will be handed over to another network. thus, the mobile user leaves the network coverage area before the handover process is executed (tcurrent + tdwell) < (testimated) [11]. this causes a network connection breakdown [11] and interrupts the service [10]. in summary, in the event of the third condition, current mobile users will never join the system. the unnecessary handovers occur due to the high speed of the user as well as the radius of the network. thus, the proposed algorithm passes the mobile user to the next available network via the acceptance factors. when the first condition is identified, the system (channels plus queue) can be used by the mobile user. in addition, when the second condition is identified and notified before the current user reaches the system, the contention can be signalled and the mobile user might be passed to other available networks nearby instead of waiting for the service. this approach should result in better network performance. 3.1 the proposed handover queuing model in the proposed approach, the decision algorithm decides whether the mobile user will be admitted to the system based on the analysis described above section (see figure 4). it is clearly seen that the proposed algorithm ensures that mobile users do not wait and leave the system unserved because of mobility. in other words, all mobile users will be allowed into the system depending on the analysis. otherwise, the mobile user at a high speed towards the system will not have enough time to enter the system. hence, the mobile users move at a high speed will be handed over to the another available network. thus, mobile users do not wait long and leave the system without service. the proposed handover queueing model is shown in figure 5. analytical modelling of a new handover algorithm for improve allocation of resources in highly mobile environments 795 figure 4: the proposed handover decision algorithm the proposed handover queueing model (similar to the traditional handover) considers s number of channels and can allow i requests at time t as shown in figure 5. the queueing capacity of the system is q. the arriving requests may be sent from different users to the 796 y. kirsal figure 5: the proposed handover queueing model system. hence, the inter-arrival time of consecutive requests follows the poisson process which can be distributed as an exponential distribution with arrival rate λ. according to [3], for a two-dimensional fluid model, the arrival rate of handover calls can be obtained as follows: λh ≈ µdwell µc λo (4) the decision algorithm distinguishes the calls (λo/λh) and decides to send them into the system depending on the acceptance factors. α and β are the acceptance factors of the originating calls and handover calls, respectively. for the purpose of the proposed analytical model, α and β are taken as constant. it is assumed that the originating calls can join the system with an arrival rate of λo (1-α). similarly, the handover calls can join the system with an arrival rate of λh (1-β). hence, the total arrival rate is λ = λo(1−α)+λh (1-β). as the requests are rejected from entering the system, especially into the queue, the queue can be treated as a normal queuing system. hence, the service rate is µ = µc + µdwell. figure 6: the state diagram of the proposed handover model it is clearly seen that the m/m/c/k queuing model is fit for the proposed model for the performance evaluation. thus, the proposed system can be illustrated by the given one-dimensional markov chain as shown in figure 6. let’s define the states i (i=0,1,2,· · · ,s+q) as the number of calls in the system at time t. the arrival rate can be taken as constant for all requests regardless of the number of users in the system. hence, the arrival rate is the birth rate in the proposed model and can be obtained as [λo (1-α) + λh (1-β)]. in contrast, the rate of service completions in the proposed scheme depends on the number of calls in the system based on the analysis. if there are s or more requests in the system, then all s channels are busy. as each channel services users at the rate µc + µdwell, the combined service rate for the system is s(µc + µdwell). if there are fewer than s requests in the system, i < s, only i of the s channels are busy and the combined service rate for the system is i(µc + µdwell), as shown in figure 6. hence µi can be calculated as follows: µi =   i(µc + µdwell) 0 ≤ i < s s(µc + µdwell) s ≤ i ≤ s + q (5) analytical modelling of a new handover algorithm for improve allocation of resources in highly mobile environments 797 assuming the system is in a steady state, then using the well-known birth and death process, the steady state probabilities pi can be obtained and are given in equation 6: pi =   [λo(1−α)+λh(1−β)]i i!(µc+µdwell) i ·p0 0 ≤ n < s [λo(1−α)+λh(1−β)]i si−ss!(µc+µdwell) i ·p0 s ≤ i ≤ s + q (6) in order to find p0 the normalization condition used since the probabilities must sum to 1, which gives: p0 = [ s−1∑ i=0 [λo(1 −α) + λh(1 −β)]i i!(µc + µdwell) i + s+q∑ i=s [λo(1 −α) + λh(1 −β)]i si−ss!(µc + µdwell)i ]−1 (7) the average number of packets in the system, mql can then be calculated as mql = s+q∑ i=0 i ·pi which gives: mql = [ s−1∑ i=0 i [λo(1 −α) + λh(1 −β)]i i!(µc + µdwell) i + s+q∑ i=s i [λo(1 −α) + λh(1 −β)]i si−ss!(µc + µdwell)i ] ·p0 (8) similarly, the blocking probability pb can be calculated as: pb = p(s + q) = [λo(1 −α) + λh(1 −β)]s+q sqs!(µc + µdwell) s+q ·p0 (9) in addition, the average queue length lq is: lq = s+q∑ i=s+1 (i−s) ·pi (10) using little’s formula, the mean waiting time of channel requests in the queue can be calculated as follows: e[tw] = lq (1 −pb)[λo(1 −α) + λh(1 −β)] (11) hence, the average value of time of a call in a cell is: e[ts] = mql (1 −pb)[λo(1 −α) + λh(1 −β)] (12) let us define mqlh as the mean number of handovers per user during its lifetime which can be calculated as: mqlh = mqlhe[tw]e[tw] e[ts] (13) therefore, the mean transmission delay of packet is calculated: e[td] = mqlhe[tw] (14) 798 y. kirsal 4 performance evaluation this section presents numerical results in order to show the accuracy and effectiveness of the proposed analytical model of the new handover algorithm to improve resource allocation in highly mobile environments. in addition, the results obtained from the solution of the traditional and proposed approaches are validated by using discrete event simulation (des). the simulation tool is mainly used for the validation of both the traditional and proposed models. as the simulation simulates the actual scenario rather than the markov models presented in this paper, it can also be used for the performance evaluation of such systems. the des developed considers the stochastic processes for all types of mobile users’ arrivals and departures. mobile users’ arrivals and departures occur one at a time in a random, discrete, event-triggered fashion when an arrival enters the system and service is completed, respectively. in addition, the users waiting in the system are served based on first come first serve (fcfs) basis in the order of their arrival. the channel (and/or channels) becomes idle or remains busy with requests stored in the queue when the service event is completed. while a particular event is handled, the next event is generated. the results obtained from the simulation runs are within the 5% confidence interval with a 95% confidence level [9]. the simulation model was adopted for the scenario considered and implemented in c++ language. in order to validate the proposed analytical model, the results obtained from the analytical model and the simulation results for different performance measures are presented and compared. the numerical study focuses on mql, pb and transmission delay of the proposed models. the mean arrival and mean service rates are mainly application dependent. the assumptions in [3, 5, 7, 8] and [9] are employed in this paper as well for consistency, unless stated otherwise. 4.1 key parameters the system parameters used are mainly taken from [3, 5, 7, 8] and [9] based on the relevant literature [1,2,4,6,10–13,16]. the system has a fixed number of identical channels: s=16. q is the queuing capacity which represents the number of packets waiting for service. it is assumed that the moving direction of the mobile users can be detected by the bs/ap using a control channel. in addition, a mixed traffic pattern is also assumed, as in [2] where on average a minimum of 2 slots are 0.5 ms. hence, the rates are translated into packet per second in order to use consistent values. the service rate of the mobile users µdwell is calculated using equation 1. the requests are handed over or rejected from entering the system due to the proposed analysis; thus, the arrival rate is λ = λo(1 −α) + λh(1 −β). however, in this paper α is taken as 0.01 because λh passing through in a unit time with a high speed is larger than λo. in other words, λo calls are assumed to be allocated by the system as they are newly generated in the system. the analysis of α could be explored in future work. 4.2 results the figures 7 and 8 show mql and pb results, respectively, as a function of the originating calls λo in the system. the parameters are s=16, q=50, e[v]=40m/s (144km/hr), r=1000m, e[tc]=120 packets/sec, and α=0.01 and the λo rate per user varies from 0.01 packets per second. the figures clearly show that the proposed approach works far better than the traditional approach. in the traditional approach, due to the high mobile users, most users will leave the system without being served. in addition, the handover calls from the neighbour cells will request channel allocation at the same time, especially for heavy traffic loads (e.g., λo=0.08). this causes an increase in mql as well as the pb of the system. thus, β has an impact on the analytical modelling of a new handover algorithm for improve allocation of resources in highly mobile environments 799 figure 7: mean queue length results as a function of originating calls λo with different β values figure 8: blocking probability results as a function of originating calls λo with different β values system. it is clear from the figures that β significantly affects the system performance. hence, β is an important parameter for the handover management in highly mobile environments. figure 9 shows transmission delay as a function of the originating calls λo with different β values. in wireless and mobile networks, transmission delay is another important qos parameter criterion. it can be clearly seen that transmission delay increases rapidly for the traditional handover when λo increases due to the number of unnecessary handovers allowed in the system. such handovers will leave the system without being served. the system is then busy with unnecessary handovers and the transmission delay increases. it can be observed from the graphs in figures 7, 8 and 9 that the proposed approach gives better qos results in terms of mean calls in the system, blocking probability and transmission delay, respectively, when β increases. this means that, according to the acceptance factor, highly mobile users are handed over to the neighbour cell and/or served without wasting the network resources. table 1 illustrates blocking probability results as a function of queue size. parameters used 800 y. kirsal figure 9: transmission delay as a function of originating calls λo with different β values table 1: blocking probability results as a function of queue size (q) q traditional approach proposed approach, β = 0.2 proposed approach, β = 0.4 proposed approach, β = 0.6 proposed approach, β = 0.8 30 0.089398 0.005699 1.57e-05 3.01e-09 1.35e-14 50 0.0863125 0.001211 6.89e-08 9.82e-14 6.58e-22 70 0.0858134 0.000269 3.03e-10 3.20e-18 3.20e-29 90 0.0857308 0.0000601 1.33e-12 1.04e-22 1.56e-36 for figures 7, 8 and 9 are used for the results presented in table 1 as well. the parameters are as follows: s=16, λo=0.1, e[v]=40m/s, r=1000m, e[tc]=120 packets/secs and α=0.01. the blocking probability decreases slightly in the traditional handover because (especially for a loaded system) highly mobile users make the system busy due to the unnecessary handover as well as the handover failures. however, this is not the case when the proposed algorithm is employed. the blocking probability decreases rapidly when q increases. this means that the proposed approach can handle unnecessary handovers and handover failures. in other words, the proposed approach gives better resource usage than the traditional approach. on the other hand, mql results as a function of originating calls for low mobile users are given in figure 10. the results show that the proposed model performs better than the traditional approach when the system utilisation (u = λ/sµ) is less than 0.72. however, as the velocity decreases, the traditional approach outperforms the proposed approach in some situations, especially for a heavy-loaded system (e.g., u =0.88). this is mainly because at such low velocity no one leaves the system due to the mobility. then, large mql results are experienced in the proposed system when β = 0.1 and 0.3. however, the proposed model gives better results when higher values of β are considered (i.e., β = 0.7). in addition, even at low velocity, most of the mobile users leave the queue without being served in the traditional approach. the numerical results obtained from the proposed model are also validated by the simulation in table 2 and figure 11. the parameters used in table 2 and figure 11 are the same parameters used in figures 7 and 8. table 2 shows the pb results of the traditional and proposed approaches with different β. it is obvious in table 2 that numerical results obtained from the proposed model show agreement with the results obtained from the simulation as the discrepancies are less than 5%. the numerical results show the effectiveness of the proposed model. the mql results for both approaches as a function of λo are shown in figure 11 and validated by simulations. the results of the proposed analytical approach and simulation results show good agreement. the maximum discrepancy between the analytical model and simulation is 3.42% which is well within analytical modelling of a new handover algorithm for improve allocation of resources in highly mobile environments 801 figure 10: mean queue length results as a function of originating calls λo with different β values for low mobile users the 5% confidence interval of the simulation. figure 11: the analytical and simulation mql results of both traditional approach (ta) and proposed approach (pa) as a function of originating calls λo with different β values 802 y. kirsal table 2: validation of pb results as a function of λo for both traditional and proposed approaches. (d is discrepancy). pb,traditional approach pb,proposed approach β= 0.3 pb,proposed approach β =0.5 λo analytical simulation d(%) analytical simulation d(%) analytical simulation d(%) 0.085 0.2471 0.2490 0.78 0.0335 0.0340 1.40 1.20e-12 1.20e-12 0.04 0.09 0.2889 0.2900 0.38 0.0805 0.0810 0.57 2.89e-11 2.89e-11 0.02 0.095 0.3263 0.3180 2.62 0.1282 0.1240 3.42 5.63e-10 5.62e-10 0.05 0.1 0.3600 0.3610 0.28 0.1718 0.1750 1.85 9.06e-09 9.05e-09 0.07 0.105 0.3905 0.3900 0.12 0.2112 0.2130 0.84 1.22e-07 1.22e-07 0.03 0.0.11 0.4182 0.4120 1.50 0.2471 0.2400 2.94 1.39e-06 1.39e-06 0.22 0.115 0.4435 0.4430 0.11 0.2798 0.2790 0.28 1.34e-05 1.35e-05 0.51 0.12 0.4667 0.4660 0.14 0.3098 0.3090 0.26 1.09e-04 1.10e-04 0.91 0.125 0.4880 0.4800 1.67 0.3374 0.3370 0.12 7.23e-04 7.30e-04 0.93 0.13 0.5077 0.5000 1.54 0.3629 0.3620 0.25 3.76e-03 7.70e-03 1.63 0.135 0.5259 0.5250 0.18 0.3865 0.3860 0.13 1.38e-02 1.3e-02 0.23 0.14 0.5429 0.5400 0.53 0.4084 0.4080 0.10 3.433e-02 3.43e-02 0.10 0.145 0.5586 0.5500 1.57 0.4288 0.4300 0.28 6.233e-02 6.23e-02 0.07 0.15 0.5733 0.5700 0.58 0.4478 0.4490 0.26 9.21e-02 9.39e-02 1.96 0.155 0.5871 0.5800 1.22 0.4657 0.4656 0.01 1.21e-01 1.22e-01 0.81 conclusions and future work this paper proposed a new analytical modelling approach and qos management for handovers based on a new handover admission control mechanism in highly mobile environments. the analysis of the handover is an important issue in order to achieve better performance, especially in highly mobile environments. the proposed handover admission control mechanism is useful for achieving better performance in such systems. it offers the perspective of considering the current time tcurrent, network dwell time tdwell and estimated time before handover testimated of the mobile users. the system is modelled as an open queuing network using a markov chain with continuous time to determine the state probabilities. based on the proposed approach developed in this paper, computer simulations are also used to assess the accuracy for the proposed model. the proposed model can be used to analyse qos measures such as mql, pb and transmission delay. the presented examples were kept simple for performance evaluation due to the introductory nature of the proposed model for highly mobile environments. the proposed method successfully reduced the number of handover failures and unnecessary handovers to the system by using the proposed algorithm compared to the traditional approach for highly mobile users. it minimizes the number of handover failures and unnecessary handovers to the system by enhancing the usage of the resources. with this approach, resource allocation can be improved in such systems with highly mobile environments. however, there are still specific operational aspects that need to be explored where the proposed approach can be applied to get the best effect. in addition, considering the availability, modelling the proposed model could be considered for future work. bibliography [1] chen, y.; yang, s.; xu, s.; xue, p.; zhou, x.; (2012); queuing theory based handover resource self-management in lte networks, international conference on wireless communications, networking and mobile computing (wicom):1-4. [2] ghosh, a.; paranthaman, v.; mapp, g.; gemikonakli, o.; loo, j. (2015); enabling seamless v2i communications: towards developing cooperative automotive applications in vanet systems, ieee communications magazine, special issue on towards autonomous driving: advances in v2x connectivity, 53(12):80-86. analytical modelling of a new handover algorithm for improve allocation of resources in highly mobile environments 803 [3] zeng, q.a.; agrawal, d. p., (2001); modeling of handoffs and performance analysis of wireless data networks, international conference on parallel processing workshops, 491-496. [4] he, d. et al. (2010); a simple and robust vertical handoff algorithm for heterogeneous wireless mobile networks, wireless personal communication, 59(2):361-373. [5] trivedi, k.s.; dharmaraja, s.; ma, x.; (2002); analytic modelling of handoffs in wireless cellular networks, information sciences, 148:155-166. [6] rejeba, s. b.; nasser, n.; tabbane, s.; (2014) a novel resource allocation scheme for lte network in the presence of mobility, journal of network and computer applications, elsevier, 46:352-361. [7] halabian, h.; rengaraju, p.; lung, c.h.; lambadaris, i.; (2015), a reservation-based call admission control scheme and system modeling in 4g vehicular networks, eurasip journal on wireless communications and networking, 1-12. [8] kirsal-ever, y.; kirsal y.; ever, e.; gemikonakli,o.; (2015); analytical modelling and performability evaluation of multi-channel wlans with global failures, international journal of computers communications and control, 10:551-566. [9] kirsal y.; ever, e.; kocyigit, a.; gemikonakli,o.; mapp, g.; (2015); modelling and analysis of vertical handover in highly mobile environments, the journal of supercomputing, 71(12):4352-4380. [10] xiaohuan, y., mani, n., sekercioglu, y.a. (2008); a traveling distance prediction based method to minimize unnecessary handovers from cellular networks to wlans, ieee communications letters, 12(1):14-16. [11] kyoung, s.l., ae-soon, p. (2014); reduction of handover failure for small cells in heterogeneous networks, intl conf. on information and communication technology convergence (ictc): 707-708. [12] zhu, f.; macnair, j.; (2004); optimizations for vertical handoff decision algorithms, ieee wireless communications and networking conference, 2:867-872. [13] zhu, f.; mcnair, j.; (2006); multiservice vertical handoff decision algorithms, eurasip journal on wireless communications and networking, 2; 1-13. [14] stevens-navarro, e.; lin, y.; wong, v.w.s.; (2008); an mdp-based vertical handoff decision algorithm for heterogeneous wireless networks, ieee transactions on vehicular technology, 57(2): 1243-1254. [15] ismail, a.; byeong-hee, r.; (2011) adaptive handovers in heterogeneous networks using fuzzy madm, intl conf. on mobile it-convergence, 99-104. [16] yan, x.; sekercioglu, a. y.; narayanan, s.; (2010) a survey of vertical handover decision algorithms in fourth generation heterogeneous wireless networks, computer networks, 54(11):1848-1863. [17] mapp. g. et al. (2009); exploring efficient imperative handover mechanisms for heterogeneous networks, international symposium on emerging ubiquitous and pervasive systems, 286-291. int j comput commun, issn 1841-9836 vol.7 (2012), no. 3 (september), pp. 482-493 a fast and scalable re-routing algorithm based on shortest path and genetic algorithms j. lee, j. yang jungkyu lee cyram inc., 516 seoul national university research park naksungdae-dong, gwanak-gu, seoul 151-919 koera e-mail: jklee@cyram.com jihoon yang department of computer science, sogang university 1 sinsu-dong, mapo-gu, seoul 121-742 koera e-mail: yangjh@sogang.ac.kr abstract: this paper presents a fast and scalable re-routing algorithm that adapts to dynamically changing networks. the proposed algorithm, dga, integrates dijkstra’s shortest path algorithm with the genetic algorithm. dijkstra’s algorithm is used to define the predecessor array that facilitates the initialization process of the genetic algorithm. then the genetic algorithm keeps finding the best routes with appropriate genetic operators under dynamic traffic situations. experimental results demonstrate that dga produces routes with less traveling time and computational overhead than pure genetic algorithm-based approaches as well as dijkstra’s algorithm in largescale routing problems. keywords: evolutionary algorithm, routing in dynamic networks, car navigation system. 1 introduction the car navigation system has become a very useful tool for many drivers. when a driver turns on a car navigation system and inputs where he or she wants to go, the system searches the map and finds the best route (e.g. shortest path) to the destination. recently, in addition to such a basic functionality, car navigation systems are equipped with real-time traffic information services like tpeg (transport protocol experts group) [1–6]. here, the navigation system is provided with the traffic information on current road conditions, with which it re-computes the best route with minimal expected travel time. unfortunately, such traffic information is not truly real-time but delivered from a central server at certain intervals. in addition, updating the entire map with the new information delivered from the server causes an exorbitant overhead. in this paper, we propose a novel approach to deal with these problems and to produce the best route dynamically. our algorithm integrates dijkstra’s shortest path algorithm [7] with a genetic algorithm [8], and thus named dga. the former is for incorporating useful prior knowledge on the network (e.g. distance between two places) and facilitating the initialization process of the genetic algorithm, and the latter is for finding the best routes. (detailed descriptions on dga will be given in section 3.) dga re-computes the routes quickly whenever new real-time traffic information is available. a car is assumed to send the traffic information (e.g. its speed) to the vehicles it meets during the trip via wireless communication. this direct and local communication among vehicles provides genuine real-time information and obviates the use of the expensive central server. this paper is organized as follows: section 2 briefly introduces a representative genetic algorithmbased approach to the shortest path problem proposed by ahn [9] which will be compared with dga. section 3 describes dga. section 4 presents the results of the experiments designed to evaluate the performance of dga. section 5 concludes with a summary and future research directions. copyright c⃝ 2006-2012 by ccc publications a fast and scalable re-routing algorithm based on shortest path and genetic algorithms 483 2 related work: genetic algorithm for shortest path problem the genetic algorithm (ga) is one of the global search heuristics inspired from biology and has been successfully applied to a variety of optimization problems [8, 10]. a great deal of research on ga-based shortest path search has been carried out in various communication network applications [9, 11–15], among which [9, 14, 15] are related to car navigation systems. ahn’s method [9] is one of the most representative applications of ga to the shortest path routing problem. however, though ahn’s method was able to find a good solution with solid theoretical results, it worked only for moderate-sized networks. in fact, our experiments with the algorithm failed to produce solutions within a reasonable period of time for networks with more than 10,000 nodes. considering real-world networks where there exist huge number of nodes (or places), ahn’s approach is thus far from applicable. there exists another ga-based approach for the car navigation system proposed by kanoh [14, 15]. kanoh’s approach is similar to dga in that it computes routes by considering dynamic road conditions and initializing the population using dijkstra’s algorithm. however, the motivation of their algorithm is quite different from ours and not fully comparable: they use ga for improving the quality of solution in terms of multi-objective criteria (e.g. traveling time, route length, number of signals, number of right turns, etc.), whereas our algorithm focuses on re-routing. in addition, kanoh’s approach was evaluated only with small networks (of less than 20,000 nodes) though the computational overhead was claimed to be low. the main contribution of this paper is the design of an efficient algorithm that can be deployed in a car navigation system and be used frequently for re-routing in a large-scale network only with locallytransmitted traffic information. to the best of our knowledge, there does not exist a ga-based algorithm to the shortest path problem that can cope with dynamic situations in a huge network. since dga has characteristics common with ahn’s ga, we briefly introduce the method here. (see [9] for detailed descriptions.) 2.1 genetic representation a chromosome (representing a candidate solution path) is variable-length and consists of the sequence of positive integers that represent the ids of nodes through which a routing path passes. the gene at the first and the last loci are reserved for the source and the destination nodes, respectively. 2.2 population initialization the chromosomes are initialized randomly. starting from the source, a chromosome encodes a routing path by successively selecting the next node at random among the neighboring nodes that are linked to the current node. note that the chance for generating a valid path is fairly slim due to the randomness, which makes the approach infeasible for large networks, as verified in section 4. 2.3 fitness function the fitness function of the ith chromosome, fi, is defined as fi = ( mi∑ i=1 c(gi( j),gi( j + 1))) −1 (1) where mi is the length of the chromosome, gi( j) represents the gene at the jth locus, and c(gi( j),gi( j+1)) is the link cost between node gi( j) and gi( j + 1). 484 j. lee, j. yang 2.4 selection the tournament selection without replacement is used. in other words, non-overlapping random sets of 2 chromosomes are chosen from the population, and the chromosome with higher fitness was selected from each set to survive in the next generation. 2.5 crossover the concept of crossover is depicted in fig. 1. first, the crossover points are determined by randomly choosing a common gene appearing in both parent chromosomes. then the chromosomes are interchanged with respect to the crossover points and the offsprings are generated. figure 1: the concept of crossover. 2.6 mutation typically, ga performs mutation by changing or flipping the genes in the candidate chromosome, thereby maintaining the genetic diversity. here, the mutation operation attempts to maintain the diversity in the population by modifying the current path represented by a chromosome. for a chromosome, a gene is randomly selected as a mutation point. starting from the mutation point, a sequence of neighboring nodes are randomly chosen to define a complete path (i.e. until the node chosen last is the destination). the concept of mutation is depicted in fig. 2. figure 2: the concept of mutation. 2.7 repair function note that a chromosome produced by the crossover operation may contain a loop in the path it represents, which is an invalid solution. to make the path valid, the repair function was proposed. as shown in fig. 3, the repair function eliminates a loop by finding the intersection (or repeated) node and removing the intermediate nodes in the loop. figure 3: the concept of repair function. for example, assume that the following chromosome is produced: 1,2,3,4,5,6,3,7,8 a fast and scalable re-routing algorithm based on shortest path and genetic algorithms 485 then the repair function finds (and keeps a single occurrence of) the intersection node 3 and removes the intermediate nodes 4, 5 and 6. the resulting chromosome is: 1,2,3,7,8 2.8 the overall algorithm now, ahn’s ga for finding the shortest path is described in algorithm 1. algorithm 1. ahn’s ga 1: initialize the population; 2: repeat until convergence 3: calculate the fitness of individuals in the population; 4: do selection; 5: do crossover; 6: remove loops by repair function; 7: do mutation; 8: end the condition for the convergence of the algorithm is if all chromosomes are identical. 3 dga we describe our algorithm, dga, in this section. the purpose of dga is to adapt to the dynamically changing networks and to re-route the shortest path fast. as aforementioned, dga inherits the characteristics of both dijkstra’s shortest path algorithm and ga. the former is to initialize the population in the latter with meaningful candidate solutions (i.e. paths) instead of random ones. for instance, the chromosomes can be generated based on useful information such as the distance or average vehicle speed between two nodes. among the various data structures used for dijkstra’s algorithm, the overflow bag introduced by cherkassky et al. [16] was adopted in our work. (cherkassky et al. had developed the overflow bag to reduce the memory requirement of dijkstra’s algorithm with the bucket data structure proposed by dial [17].) instead of dijkstra’s algorithm, any single-source shortest path algorithm (e.g. bellman-ford algorithm [18]) can be also used for dga. 3.1 population initialization the random population initialization of ahn’s ga does not work well for large-scale networks since the chance for generating invalid paths becomes very high as explained in section 2. to overcome this problem, dga makes use of dijkstra’s algorithm and produces a predecessor array as described in fig. 4. first, from the start (source) node, the shortest paths to all the other nodes including the goal (destination) node are computed by dijkstra’s algorithm. then, for the shortest path from an arbitrary node a to the goal node, all the links on the path are stored in the form of a (direct) predecessor array pred which is s sequence of nodes constituting the path in a reverse order (i.e. from the goal to a). fig. 4(a) shows an example of pred. once pred is constructed, the shorted path from the goal to a can be easily obtained by a call getpath(pred,goal,a) defined as follows, which makes fast initialization of the population possible: subroutine 1. getpath( pred, x, y) // compute the path from x to y using pred. 1: set current node scur = x and path = [scur]; 2: repeat 486 j. lee, j. yang figure 4: example of reverse graph and predecessor array. 3: scur = pred(scur); 4: path = [path scur]; 5: until (scur = y) 6: return path; let g(n, e) be a directed graph with the set of nodes n and the set of edges e. we define the reverse graph of a directed graph g(n, e) as the graph grev(n, erev) with erev = {(u, v)|(v,u) ∈ e} (2) for example, if we reverse all the edges of g(n, e) in fig. 4(a), we get the reverse graph grev(n, erev) in fig. 4(b), with which we can compute the shortest paths from the goal node to all the other nodes by dijkstra’s algorithm. then we can compute a reverse-pred for grev(n, erev) which is a series of nodes constituting the path from an arbitrary node to the goal node as shown in fig. 4(b). now, if we consider the original graph g(n, e) with reverse-pred computed with resect to the reverse graph grev(n, erev) as in fig. 4(c), we can see that reverse-pred contains pointers to the optimal node to travel from any node in g(n, e) to reach the goal. suppose that an agent travels around the graph to arrive at the destination node. even if the agent deviates from the optimal path, it can eventually reach the destination by following the next node that reverse-pred points. in other words, reverse-pred serves as a guide to the lost or wandering agents in the network. for instance, in fig. 4(c), if an agent on node 0 moves to node 1 which is not on the optimal path, it can adapt to the situation and follow the optimal path from node 1 by consulting reverse-pred. like this, if an agent deviates from the shortest path on any node, it can rectify its plan and follow the optimal path to the goal. in the field of reinforcement learning, such a reverse-pred is called the optimal policy [19]. the optimal policy π∗ : n 7→ n is the mapping from the current node to the next node that is on the optimal path. as a scheme to apply the optimal policy π∗, the ϵ-greedy method is used which picks the best move most of the times but allows a random move with a small probability of ϵ. this can be summarized as a fast and scalable re-routing algorithm based on shortest path and genetic algorithms 487 ϵ-greedy(s,π∗) =  π ∗(s) if ζ < ϵ random move otherwise (3) where π∗(s) is equivalent to reverse-pred(s), s is the current node, and ζ is a random number generated between 0 and 1 to be compared with ϵ. now, the population can be initialized by subroutine 2: subroutine 2. populationinit(π∗, start, goal) 1: for i = 0 to populations ize − 1 2: set scur = start and chromosome[i] = [scur]; 3: repeat 4: scur = ϵ-greedy(scur,π∗); 5: chromosome[i] = [chromosome[i] scur]; 6: until (scur = goal) 7: end 8: return chromosome; there are several advantages in our population initialization method. first, the amount of data needed in a random initialization method (like ahn’s ga) is even larger than in our algorithm since the former requires the information on the overall network topology while the latter only refers to the optimal policies in the predecessor array. therefore, in real-world situations where the size of the network is huge, dga has significantly less overhead than ahn’s ga. second, our initialization method increases the probability of generating valid chromosomes while the probability with random population initialization is inversely, exponentially proportional to the length of the path. this is theoretically proved in claim 1, and experimentally verified by large networks wherein valid chromosomes could not be generated within reasonable time. claim1. let x be a random variable drawn from bernoulli(m) distribution defined as follows: if an agent reaches the destination node in reasonable time by selecting the next node randomly, then x = 1, otherwise x = 0. that is, the probability p(x = 1) is m. the agents is assumed to makes l independent selections of next nodes. we claim that the probability prand (l) of generating a valid path (chromosome) with length l in reasonable time with random population initialization is prand (l) = m l (4) meanwhile, let y be a random variable drawn from bernoulli(1) defined as follows: if an agent reaches the destination node in reasonable time by executing the optimal policy, then y = 1, otherwise y = 0. that is, the probability p(y= 1) is 1. we now claim that the probability pϵ-greedy(l) of generating a valid path with length l in reasonable time with ϵ-greedy selection is pϵ-greedy(l) = m l(1−ϵ) (5) proof: prand (l) = l︷︸︸︷ p(x = 1) × p(x = 1) × ·· ·× p(x = 1) = l︷︸︸︷ m × m ×·· ·× m = ml 488 j. lee, j. yang pϵ-greedy(l) = l︷︸︸︷ p(x = 1) · · · p(x = 1) p(y= 1) · · · p(y= 1)︸︷︷︸ lϵ = l︷︸︸︷ m · · ·m · 1 · · ·1︸︷︷︸ lϵ = ml−lϵ = ml(1−ϵ) � for example, let m = 0.995, l = 50, ϵ = 0.5. then, prand (l) = (0.995) 50 = 0.7783 pϵ-greedy(l) = (0.995) 25 = 0.8822 pϵ-greedy(l)/prand (l) = 1.1335 however, if l = 1000, prand (l) = (0.995) 1000 = 0.0067 pϵ-greedy(l) = (0.995) 500 = 0.0816 pϵ-greedy(l)/prand (l) = 12.1791 3.2 fitness function since the purpose of the proposed algorithm is to re-route the shortest path considering dynamic traffic situations, the fitness of each chromosome is based on the traveling time instead of the physical distance between the source and the destination. so we redefine c(x,y) in eq. (1) with the traveling time from node x to node y, and represent the costs as a hash table. we can define the set of all edges comprising the chromosomes as ω= {(yi, j,yi, j+1)|yi, j} (6) where yi, j is the jth gene in the ith chromosome in the population. then the hash table contains the edge (x,y) ∈ ω with its associated cost. this scheme provides fast access of the edge costs, and requires less communication overhead of real-time traffic information only for the edges in ω instead of all the edges in the network. 3.3 selection although the time complexity of tournament selection without replacement used in ahn’s ga is not costly (o(|chromosomes|) where |chromosomes| is the number of chromosomes in the population), it has a problem that good chromosomes can dropout early if they are met with chromosomes with higher fitness values in the tournament. we devised the following selection method to solve the problem. 1. the average fitness of all chromosomes in the population is calculated. 2. the chromosomes with above-average fitness survive in the next generation, and the chromosomes with below-average fitness are weeded out. 3. the deleted chromosomes are replaced by the survived ones at random. each step of the above selection method has time complexity of o(|chromosomes|), which also makes the total complexity of o(|chromosomes|). with asymptotically the same computational overhead, our selection method can overcome the early dropout problem. a fast and scalable re-routing algorithm based on shortest path and genetic algorithms 489 3.4 crossover as described earlier, the crossover operator in ahn’s ga finds all genes that appear in both parent chromosomes and then chooses one of them randomly. let α and β be the lengths of such two chromosomes, respectively. then the time required to find the crossover point is o(αβ). if α and β increase for large networks, the cost for searching the crossover point become expensive. to optimize the process of crossover point search, we use the following subroutine. subroutine 3. searchcrosspoint(x1, x2) // x1, x2 are two chromosomes to crossover. 1: s1 = rand % size(x1); 2: if (s1 == 0) e1 = size(x1) − 1 else e1 = s1 − 1; end 3: s2 = rand % size(x2); 4: if (s2 == 0) e2 = size(x2) − 1 else e2 = s2 − 1; end 5: for i = s1 to e1 6: for j = s2 to e2 7: if (x1[i] == x2[ j]) return i, j; end 8: if ( j == size(x2) − 1) j = 0; end 9: end 10: if (i == size(x1) − 1) i = 0; end 11: end note that s earchcrosspoint(x1, x2) determines the random crossover point of common genes starting from arbitrary positions of two chromosomes x1 and x2, and keeps comparing the genes in a circular way (i.e. after considering the last gene, it starts from the first gene of the chromosome). as soon as the first match occurs, the subroutine returns the genes. otherwise, it repeats the comparisons for all possible pairs of positions. the remaining steps of the crossover operation (i.e. generation of offsprings from the crossover point and application of the repair function) remain the same as ahn’s ga. 3.5 mutation as described in section 3.2, only the edge (x,y) ∈ ω can appear in the chromosomes. if an edge (x′,y′) <ω appears in the chromosomes in a new generation as a result of mutation, the traffic information on (x′,y′) needs to be fetched to compute the shortest path, which causes additional communication. to prevent this overhead, the mutation is omitted in our algorithm. in our preliminary experiments, there was no significant difference in performance (in terms of the path quality and the convergence speed) between two approaches where the mutation was applied or not. 3.6 the overall algorithm dga can be summarized as algorithm 2. algorithm 2. dga 1: construct π∗ (from reverse-pred) and the initial population y (section 3.1). 2: remove loops in chromosomes in y by repair function (section 2.7). 3: construct hash table for edges (x,y) ∈ω (section 3.2). 4: repeat until convergence 5: calculate the fitness of population y (section 3.2). 6: do selection (section 3.3), crossover (section 3.4), and remove loops in y (section 2.7). 7: end 8: return y 490 j. lee, j. yang table 1: parameter settings. network id network size ϵ population size #1 50 0.5 20 #2 100 0.5 20 #3 200 0.5 20 #4 400 0.5 20 #5 800 0.5 20 #6 2000 0.5 20 #7 4000 0.5 20 #8 8000 0.5 20 #9 20000 0.5 30 #10 40000 0.6 30 #11 80000 0.6 40 #12 160000 0.6 40 #13 320000 0.6 40 #14 640000 0.7 40 #15 800000 0.7 40 #16 1000000 0.7 40 #17 1200000 0.7 50 as in ahn’s ga, the condition for the convergence of the algorithm is if all chromosomes are identical. 4 experiments 4.1 setup dga is implemented in c and all the experiments were conducted on intel core2quad processors (2.40ghz clock rate). we generated strongly connected random networks of size (i.e. number of nodes) ranging 50-1,200,000, and the distance dist(i, j) between node i and j is assigned with a random integer in [1-9,999]. there are two parameters in dga: ϵ (of ϵ-greedy strategy) and the population size, except the crossover probability which was set to 1. the lower ϵ and the higher population size we set, the greater the diversity in a population will be. we applied parameter settings for each network as illustrated in table 1. to evaluate the performance of dga under real-time traffic conditions, we also constructed a traffic simulator as follows: 1. a vehicle travels around the networks (generated as in table 1) to arrive at the destination node. 2. whenever a vehicle makes a move from the current to the next node, all the edge costs of the network are changed dynamically by c(i, j) = dist(i, j) vi j , for each node i, j (7) where vi j is velocity of vehicles on the link (road) between node i and j which is drawn from two normal distributions with the same mean but different standard deviations (i.e. n(80km/s, 20km/s) and n(80km/s, 40km/s)) to see the behavior of the algorithms in different situations. 3. a vehicle re-routes the path whenever the edge costs are changed. a fast and scalable re-routing algorithm based on shortest path and genetic algorithms 491 (a) a comparison of average cpu time (low variation) (b) a comparison of average traveling time (low variation) (c) a comparison of average cpu time (high variation) (d) a comparison of average traveling time (high variation) figure 5: simulation results. 4. there are three types of vehicles implementing three algorithms: dijkstra’s algorithm (implemented with the overflow bag structure as in [16]), ahn’s ga, and dga. 5. for 300 randomly generated source-destination node pairs, the performance (in terms of the cpu time and the traveling time) are measured and averaged. 4.2 results the experimental results are shown in fig. ?? where (a), (b) are for the networks with less drastic changes in the velocity of vehicles (i.e. standard deviation of 20km/s), and (c), (d) are with more drastic changes (i.e. standard deviation of 40km/s). the x-axis of the graphs represents the network id of table 1. the y-axis represents the average cpu time in (a) and (c), and the average traveling time in (b) and (d). the results of ahn’s ga for above 20,000 node-sized networks are not included due to the excessive running time. as shown in fig. ??(a) and (c), it is impossible for ahn’s ga to find the path in reasonable time. for networks with less than 40,000 nodes, the average cpu time of dijkstra’s algorithm and dga are similar. however, as the size of the network increase, dga outperforms dijkstra’s algorithm by a large margin. this is because dijkstra’s algorithm computes a new path over the entire nodes for each traffic condition, while dga adjusts the path based on the locally updated traffic condition with the predecessor array. as shown in fig. ??(b) and (d), the quality of the path (i.e. average traveling time) of ahn’s ga is even inferior to other algorithms. fig. ??(b) verifies that dga produces paths as good as the ones produced by dijkstra’s algorithm for less dynamic networks. however, dga outperforms dijkstra’s algorithm for highly dynamic networks as shown in fig. ??(d). this is because dijkstra’s algorithm sticks to the current traffic conditions too much and might make inefficient changes in the path (e.g. 492 j. lee, j. yang detours), while dga makes local adjustments to the current path and produces stable solutions. this verifies the feasibility of dga in real-world car navigation systems where traffic conditions are constantly and possibly drastically changing. 5 conclusion we have presented a fast and scalable re-routing algorithm, dga, that adapts to dynamically changing networks. in addition to the theoretical soundness, the experimental results have also shown the outstanding performance of dga on large networks. dga is a good candidate for intelligent car navigation systems since it is capable of re-routing the optimal path swiftly whenever new traffic information is available. in addition, dga has a significant merit of requiring the minimal traffic information and reducing the communication cost between the car navigation system and the central server, or among the navigation systems in each vehicle. we have not tested dga with real-world maps and traffic information due to the lack of required infrastructures (e.g. communication, information collection). therefore, dga needs to be deployed and fully evaluated when the infrastructures become available. also, dga can be extended to work in the unexplored environment where the agent does not have the global picture on the environment where it belongs. in such an environment, markov decision processes (mdps) [20] and reinforcement learning approaches [19, 21] can be useful to learn the optimal routing policy, as attempted in [22]. in addition, dga can be also extended to consider additional criteria for navigation similar to [15, 23, 24]. some of these research issues are currently in progress. acknowledgments this research was supported by basic science research program through the national research foundation of korea(nrf) funded by the ministry of education, science and technology (2009-0076594) to jihoon yang, the corresponding author. bibliography [1] ebu bpn. 027-1 ňtransport protocol experts group (tpeg) specifications. [2] ebu bpn. 027-2 ňtransport protocol experts group (tpeg) specifications. [3] ebu bpn. 027-3 ňtransport protocol experts group (tpeg) specifications. [4] ebu bpn. 027-4 ňtransport protocol experts group (tpeg) specifications. [5] ebu bpn. 027-5 ňtransport protocol experts group (tpeg) specifications. [6] ebu bpn. 027-6 ňtransport protocol experts group (tpeg) specifications. [7] e. w. dijkstra. a note on two problems in connexion with graphs. numerische mathematik, 1(1):269–271, 1959. [8] m. mitchell. an introduction to genetic algorithms. mit press, 1996. [9] c.w. ahn and r. s. ramakrishna. a genetic algorithm for shortest path routing problem and the sizing of populations. ieee transactions on evolutionary computation, 6(6):566–579, 2002. [10] d. e. goldberg. genetic algorithms in search and optimization. addison-wesley, 1989. a fast and scalable re-routing algorithm based on shortest path and genetic algorithms 493 [11] q. zhang and y. w. leung. an orthogonal genetic algorithm for multimedia multicast routing. ieee transactions on evolutionary computation, 3(1):53–62, 1999. [12] f. xiang, l. junzhou, w. jieyi, and g. guanqun. qos routing based on genetic algorithm* 1. computer communications, 22(15-16):1392–1399, 1999. [13] y. leung, g. li, and z. b. xu. a genetic algorithm for the multiple destination routing problems. ieee transactions on evolutionary computation, 2(4):150–161, 1998. [14] h. kanoh. dynamic route planning for car navigation systems using virus genetic algorithms. international journal of knowledge-based and intelligent engineering systems, 11:65–78, 2007. [15] h. kanoh and k. hara. hybrid genetic algorithm for dynamic multi-objective route planning with predicted traffic in a real-world road network. in proceedings of the conference on genetic and evolutionary computation, pages 657–664. acm, 2008. [16] b. v. cherkassky, a. v. goldberg, and t. radzik. shortest paths algorithms: theory and experimental evaluation. mathematical programming, 73(2):129–174, 1996. [17] r. b. dial. algorithm 360: shortest-path forest with topological ordering [h]. communications of the acm, 12(11):632–633, 1969. [18] r. bellman. on a routing problem. quarterly of applied mathematics, 16(1):87–90, 1958. [19] r. s. sutton and a. g. barto. reinforcement learning. mit press, 1998. [20] r. bellman. a markovian decision process. journal of mathematics and mechanics, 6, 1957. [21] l. p. kaelbling, m. l. littman, and a. w. moore. reinforcement learning: a survey. journal of artificial intelligence research, pages 237–285, 1996. [22] j. a. boyan and m. l. littman. packet routing in dynamically changing networks: a reinforcement learning approach. proceedings of the advances in neural information processing systems, pages 671–671, 1994. [23] m. stanojević, m. vujošević, and b. stanojević. number of efficient points in some multiobjective combinatorial optimization problems. international journal of computers, communications & control, 3(suppl.): 497-502, 2008. [24] i. harbaoui dridi, r. kammarti, m. ksouri, and p. borne. multi-objective optimization for the m-pdptw: aggregation method with use of genetic algorithm and lower bounds. international journal of computers, communications & control, 6(2): 246-257, 2011. international journal of computers communications & control issn 1841-9836, 9(4):463-470, august, 2014. a conceptual architecture of ontology based km system for failure mode and effects analysis z. rehman, s. kifor zobia rehman* 1. faculty of engineering and management lucian blaga university of sibiu, romania 2. department of computer science comsats institute of information technology, pakistan *corresponding author: zobia.rehman@gmail.com stefania kifor lucian blaga university of sibiu, romania kifors@yahoo.com abstract: failure mode and effects analysis (fmea) is a systematic method for procedure analyses and risk assessment. it is a structured way to identify potential failure modes of a product or process, probability of their occurrence, and their overall effects. the basic purpose of this analysis is to mitigate the risk and the impact associated to a failure by planning and prioritizing actions to make a product or a process robust to failure. effective manufacturing and improved quality products are the fruits of successful implementation of fmea. during this activity valuable knowledge is generated which turns into product or process quality and efficiency. if this knowledge can be shared and reused then it would be helpful in early identification of failure points and their troubleshooting, and will also help the quality management to get decision support in time. but integration and reuse of this knowledge is difficult because there are number of challenges e.g., unavailability of unified criteria of fmea knowledge, lack of semantic organization, natural language text based description of knowledge, most of the times fmea is started from scratch instead of using existing knowledge that makes it incomplete for larger systems, and above all its success depends on the knowledge which is stored in the brains of perfectionists in the form of experience which may or may not be available anytime anywhere. in this article we are proposing an information and communication technology (ict) based solution to preserve, reuse, and share the valuable knowledge produced during fmea. in proposed system existing knowledge available in repositories and experts head will be gathered and stored in a knowledge base using an ontology, and at the time of need this knowledge base will be inferred to make decisions in order to mitigate the probable risks. ontology based approaches are best suited for the knowledge management systems, in which human experts are required to model and analyze their expertise in order to feed them in a conceptual knowledge base for its preservation and reuse. keywords: knowledge management (km), ontology, failure mode and effects analysis (fmea). 1 introduction knowledge is often defined as "justified true belief". facts, information, and skills acquired by practical experience or education are also called knowledge. knowledge is the information combined with experience, context, interpretation, and reflection. it is a high-value form of information that is ready to apply to decisions and actions [3]. in this era knowledge is the core asset of any organization. it is important for the organizations to know what they know. nowadays sustainability of an organization depends on what it knows, how efficiently it utilizes what it knows, and how quickly it acquires and utilizes new knowledge [3], and in other words success of copyright © 2006-2014 by ccc publications 464 z. rehman, s. kifor any organization relies on the effective management of its knowledge. organizational knowledge can reside in the minds of experts and experienced employs of organizations, files in cabinets, or in huge volumes of databases on servers. since ages human beings and organizations have been creating, acquiring, communicating, and re-utilizing their knowledge. knowledge management can be defined as doing what is needed to get the most out of the knowledge resources [6]. it is also the collection of activities of knowledge gathering, structuring, refinement, and dissemination. nowadays ict is helping to make these activities more effective for organizations. knowledge management systems have been and are being designed and developed to facilitate organizations in managing their knowledge and making it useful in the absence of concerned experts. but knowledge management systems are very difficult to build. it is difficult to find experts who are willing to share their knowledge, and allow experiencing how they solve problems. it is difficult to gather, compile and refine the knowledge across the organization from person to person, documenting the way how decisions are reached, and structuring it to represent on machine. a project risk is defined as an uncertain and undesired event or condition which affects the objectives of a project in positive or negative way [15].risk management is an endeavor to identify and manage internal events and external threats that can affect the likelihood of a project’s success. it identifies risk events, plans how to minimize the consequences or impact of the risk event, lists all anticipated tasks that can be completed in order to avoid undesired events, and builds contingency plans to handle such events after their occurrence. it is a proactive approach to reduce surprises and negative consequences. it provides better control over the future, makes a project manager ready to take the advantage of appropriate risks; and increases the probability of meeting project objectives within budget and time [8]. successful project management is based on effective risk planning and management. risk management helps a project manager to actively handle all expected and unexpected risks on projects. there are various other factors that affect a project’s success but inadequate risk management increases the possibility of project failure. risk management process is based on four major actions; risk identification, risk assessment, risk response development, and risk response control [1]. in risk identification phase all probable risks are considered which can potentially make a process or product fail to achieve its objectives; brain storming, problem identification, and risk profiling are effective approaches for this phase. in risk assessment phase, the value of risk associated to a product or process is determined by calculating the magnitude and probability of an undesired event. there are different tools for risk assessment, e.g. scenario analysis for event probability and impact, risk assessment matrix, fmea, probability analysis; and semi-quantitative scenario analysis. risk response development phase is responsible for deciding an appropriate response for assessed risk. this response could be mitigation, avoidance, transfer, or retention of the risk. to control risk, response strategies are designed and executed, events are monitored, contingency plans are initiated and new risks are identified; all these tasks are completed in risk response control phase. fmea is an effective systematic technique for failure analysis during product life cycle. its purpose is to identify all potential failure modes related to a system, product or process in advance, so that experts can think about their remedies and can take actions to reduce the probability of their occurrence or severity of their impacts [18]. for the very first time fmea was used in the 1960’s in the aerospace industry during the apollo missions. in 1974, the us navy developed mil-std-1629 about the use of fmea. in the late 1970’s, the automotive industry started using fmea and they realized about the advantages of using this tool to reduce risks in order to improve quality of the product [18]. now this inductive reasoning analysis is in practice in almost every field of engineering and technology and especially it has become a preliminary step of quality engineering, reliability engineering, and safety engineering. fmea report outlines the a conceptual architecture of ontology based km system for failure mode and effects analysis 465 findings which have been developed from worksheets. the findings focus on the failure modes which can have significant effects on the system and categorize them as catastrophic, critical, down to minimal, nuisance value, or etc. an fmea covering the complete system includes its constituent fmeas by a reviewing and analyzing the interfaces between the subsystems. during the analysis some failure modes are difficult to assess, so fmea is responsible for the assessment of such failure modes by executing practical tests and generating their results. in time prepared comprehensive fmea provides technical support in identification of weak points in a system, reduces the cost of modification along the product life cycle [2][4].organizations spend a lot of effort and high cost to apply fmea method but still the knowledge they acquire during this analysis is neither reused nor shared. major reason is that the failure modes are not semantically organized. their interpretation varies from person to person and from situation to situation. even there is a possibility that a team prepared an fmea in one situation, might not be able to apply the same to some other situation. also reusing of this information becomes imprecise and unproductive because of its larger size and its production again and again [12]. so an it based solution is needed to store, share, reuse, and infer the knowledge produced by fmea. artificial intelligence (ai) has been supportive in not only storing and manipulating data but also in acquiring, manipulating, and representing knowledge. knowledge based systems based on the concept of ontologies and expert systems have been popular for addressing knowledge management problems. in this article we will focus on the ontology based knowledge management systems which have been formulated for fmea. we also propose a conceptual model of an ontology based knowledge management system for fmea, which we believe is able to solve all above mentioned problems. ontology is an axiomatic theory, designed for a specific domain composed of objects, and is made explicit by means of a formal specific language [21]. 2 related work there are different technologies which have been helpful for applying knowledge, but ai played major role in enabling these systems to work. ai systems are based on the belief that intelligence and knowledge are closely related to each other. knowledge is associated with the human’s capability to manipulate cognitive symbols, whereas human intelligence deals with our ability to learn and communicate our understandings in order to solve problems. human beings are born with a certain degree of intelligence, by using this they learn and acquire new knowledge. there are some ai systems which are known as knowledge based systems and their purpose is to mimic problem solving capabilities of the experts of a specific domain. these intelligent systems and their associated technologies assist us in every phase of knowledge management [6]. four ai techniques have been widely used to build knowledge management systems, which are case based reasoning (cbr), rule based expert systems, fuzzy logic, and ontologies. ontologies are found better than rest of the three, as some disadvantages are associated with rule-based systems (fuzzy or crisp). for example in many situations, a large number of rules are required to represent the domain knowledge, and it could make the inference mechanism quite complex and time consuming, as it adds difficulty in coding, verifying, validating, and maintaining these rules. moreover they do not have any ability to learn [6]. whereas the major disadvantage associated with cbr is that all cases are needed to be stored in case library with additional care about their references and attributes, efficient methods are needed to search the cases and identify their search attributes, and they do not provided well presented information to their users [11]. purpose of ontology development is to provide semantically shared domain knowledge for intelligent inferencing, since ontologies are famous for domains like semantic web, system engineering, bio-medical informatics,software engineering, e-commerce, knowledge engineering, knowl466 z. rehman, s. kifor edge management, artificial intelligence, natural language processing, etc. ontologies play a distinct role in information retrieval and information systems, they better serve as system models and help in storing and retrieving knowledge semantically. ontology engineering is a sub-field of knowledge engineering. its aim is to make knowledge explicit, which is hidden in software applications, business procedures, and in enterprise repositories [16]. while constructing an ontology for any domain, knowledge acquisition is the major bottleneck, so in the knowledge management systems. but there are some automated knowledge acquisition techniques which can be applied e.g., linguistic techniques to acquire knowledge from unstructured text documents, machine learning techniques to acquire knowledge from structured documents, crawling through web, and requesting experts for required knowledge according to a template [14]. some work has been found in which ontologies have been helpful in managing knowledge produced by fmea. authors in [12] applied ontology based knowledge retrieval approach for distributed knowledge sharing and reuse in the pfmea domain. using this strategy the authors were able to construct a complex search patterns, combining many complex roles of pfmea, and connecting them to a particular instance. this system helped them to establish actions with adequate knowledge retrieval and semantic organization. in [5] the authors realized that traditional way (text based) to represent knowledge produced and used during fmea, makes it difficult to reuse. that’s why they developed an ontology, based on the standard iso-15926 to improve the representation of knowledge used and produced during fmea. in [13], authors are trying for lead free soldered, for its accomplishment they need to carry an fmea to find all critical failures associated to this process and their impact on the quality of lead free soldered joints. they suppose that ontology is suitable to store the knowledge produced during fmea as it offers the common understanding of the domain of their focus and its procedures; and it is also a computationally processed model to represent knowledge. authors in [4] believe that fmea is highly effective method to prevent the failure of a system in early phases of its development but a lot of effort is needed to develop an fmea and it is difficult to earn the advantages immediately. to get more advantages from fmea it is needed to shorten its development process by reusing the knowledge gathered in already developed fmea. knowledge produced by fmea is in natural language, that’s why systematized components, functions and failure modes are not explicit and are hard to reuse. most of the times the meanings of such natural language documents, depend on the understanding of the team who prepare these and may not be clearly interpret-able for a different one. authors found that ontology could help to integrate the related knowledge by providing uniform semantic knowledge of a domain for solving a fore mentioned problems. in [7] authors tried to transform scattered knowledge of extended functional model to fmea sheets using ontology. functional model included the knowledge of functional structure and faulty behavior. they developed the ontology by identifying the knowledge models in various forms and mapped the ontology by identifying correspondence relations between the concepts of two ontologies. authors were also trying to transform this knowledge to fta. authors in [20] found that fmea knowledge structure lacks unified standards because of this it cannot be collected during manufacturing process. because of this for many organizations fmea is not an effective and feasible method for improvement in manufacturing process. they proposed a system to extract and convert fmea existing knowledge to develop a systematic, structured ontology-based fmea knowledge management system. this system was helpful for intelligent inquiry and reasoning from fmea knowledge in order to make decisions timely, to handle quality issues in manufacturing process. similarly [4] [9][19]used ontology to smooth the progress of fmea proceedings. they all have done a lot for reusing the knowledge gathered during fmea using ontologies for different industries but still some very important things are missing from the scene. for the very first time [10] presented an approach to bind fmea and ontologies together but this model a conceptual architecture of ontology based km system for failure mode and effects analysis 467 lacks the facility of inference because according to authors ontology is just a conceptual model without having any rules. authors in [4] focused on the deficiencies left by [10] by combining knowledge management and quality management systems to reuse the existing fmea knowledge easily but according to them this work still misses a commonsense ontology to provide the parts of standardized and functional taxonomy. authors in [13] proposed a method for better fmea procedure representation using ontologies for lead free soldering but still his work is under research. moreover no specific ontology is found to address the fmea knowledge sharing and reuse for automotive domain. in this article we are proposing a conceptual architecture for ontology based knowledge management system specifically for automotive domain. 3 proposed conceptual architecture working of knowledge management systems is based on four major actions; knowledge gathering, knowledge organization and structuring, knowledge refinement, and knowledge distribution. figure 1, depicts a conceptual architecture for a knowledge management system and in the following subsections it is described in terms of four major knowledge management actions. 3.1 knowledge gathering knowledge gathering is the set of activities which are responsible for acquisition of the knowledge to be managed. according to the given model it will be accomplished by gathering existing fmea reports completed by experts and also by gathering the rationales and heuristics of the knowledge of domain experts. 3.2 knowledge organization and structuring from information science perspective an ontology is described as formal description of a domain, in terms of instances and relationship between those instances. according to the given model it will describe the basic structure for the instances which will be stored in the knowledge base. knowledge to be structured and stored will be provided to ontology populator after its extraction from existing fmea reports and document files composed on the heuristics and rationales of experts. ontology populator will learn the structure of instances and their relationships from ontology and will populate the knowledge in the knowledge base. to extract knowledge from existing reports some natural language processing techniques will be applied, but depending on the accuracy of the techniques some human intervention would be needed. 3.3 knowledge refinement once the existence knowledge is gathered the next phase is about its refinement. for refinement of this knowledge some experts will be invited to query the system for specific problems of domain. purpose of this querying would be to validate the knowledge in the knowledge base that if it is correct or enough to solve all probable problems expected in the domain. these experts will be authorized to update the knowledge base according to their level of expertise. they can add new instances and also can update the existing ones. 468 z. rehman, s. kifor 3.4 knowledge distribution once the knowledge base is updated with appropriate and refined knowledge, it is ready to assist its users via browser interface. inference engine will depict the rationales of experts and will provide intelligent conclusions after inferring through knowledge base. this conceptual architecture is equally useful in multi-user environment, where number of authenticated users can interact with the system via internet. figure 1: conceptual architecture of ontology based knowledge management system for failure mode and effects analysis (fmea) 4 required tools and techniques collection of fmea reports would be in a formal way by requesting the organizations to provide these. heuristics of experts over fmea can be gathered by interviewing them about a given scenario and cross validating their views. these views can be stored in machine readable documents. once the basic knowledge is gathered, gate (general architecture for text engineering) can be used to extract required knowledge, it is an open source infrastructure to develop and implement software components to process human languages, it helps in keeping data separate from application. we will use protege to create ontology and its instances. protege is an open source software tool to develop domain models and knowledge based systems with ontologies. it provides both of the main ways to model ontologies e.g., frame based and owl. it provides full reasoning support to compute the inferred ontology class hierarchy and for consistency checking. owl-lite can be used to get full support of existing description logic reasoners for inference engine. jena can be used to query the ontology and to update its instances. a conceptual architecture of ontology based km system for failure mode and effects analysis 469 5 conclusion strategic management of any organization revolves around the risk management. good risk management focuses on the identification and treatment of the risk in time, to increase the probability of the success and decrease the probability of the failure in achieving overall objectives of an organization. fmea is a systematic way to identify probable failure modes and their overall effects on a system, if this analysis is timely completed, it helps to reduce the probability of risk occurrence. in this article we proposed a conceptual architecture for ontology based knowledge management system to eliminate the problems regarding poor management of the knowledge produced during fmea, to timely generate fmea reports even in the absence of domain experts, and to reuse and share existing fmea knowledge. ontologies are like repositories to organize formal knowledge and information represented by common vocabulary of a language to support reasoning, they are becoming the core component for many applications e.g., knowledge portals, knowledge management and integration systems, and e-commerce applications. ontologies are helpful in exchange of knowledge, reusing existing knowledge, for reasoning and inferring on existing knowledge [17]. as ontologies use common understanding of concepts of a domain this is why fmea knowledge stored in ontology will be general, comprehensive, machine readable and explicit. in future we will materialize this conceptual model for automotive domain. bibliography [1] carbone, t. a.; tippett, d.; (2004); project risk management using the project risk fmea, engineering management journal, 16, 28-35. [2] chang, k.h.; ching, c.h.; (2010); a risk assessment methodology using intuitionistic, international journal of systems science, 1457–1471. [3] davenport , t. ; prusak, l.; (1998); working knowledge, boston, ma : harvard business school press. [4] dittmann, l. et al (2004); performing fmea using ontologies, 18th international workshop on qualitative reasoning, 209–216. [5] ebrahimipour, v. et al (2010); an ontology approach to support fmea studies, elsevier expert systems with applications, 671-677. [6] fernandez, b.i.; saberwal, r.; (2010); knowledge management systems and processes, m.e. sharpe. [7] koji, y. et al (2005); ontology-based transformation from an extended functional model to fmea, international conference on engineering design, melbourne. [8] larson, e. w.; gray, c. f.; (2011); project management: the managerial process (5th edition ed.), mcgraw hill. [9] laaroussi, a. et al (2007); ontology-aided fmea for construction products, j. ieee press, 189-194. [10] lee, c.f.; (2001); using fmea models and ontologies to build diagnostic models, artificial intelligence for engineering design, analysis and manufacturing, 281-293. 470 z. rehman, s. kifor [11] mansouri, d.; hamdi-cherif, a.; (2011); ontology-oriented case-based reasoning (cbr) approach for trainings adaptive delivery, 15th wseas int. conf. on computers (cscc2011), 328-333. [12] mikos, w.l. et al (2011); a system for distributed sharing and reuse of design and manufacturing, elsevier journal of manufacturing systems, 133-143. [13] molhanec, m. et al (2010); the ontology based fmea of lead free soldering process, international spring seminar on electronics technology, isse. [14] natalya, f. n.; deborah, l. m.; (2001); ontology development 101: a guide to creating your first ontology, http://protege.stanford.edu/publications/ontology_ development/ontology101.html. [15] pmi; (2000); project management body of knowledge guide pmbok. pennsaylvania: project management body of knowledge guide pmbok. [16] pouchard, line; ivezic, nenad; schlenoff, craig (march 2000); ontology engineering for distributed collaboration in manufacturing, proceedings of the ais2000 conference. [17] sharman, r. et al (2007); ontologies a hand book of principles, concepts, and applications in information systems, springer. [18] stamatis, d.h.; (2003); failure mode and effect analysis: fmea from theory to execution, usa: asq quality press. [19] wang, y.m. et al (2009); risk evaluation in failure mode and effects analysis using fuzzy weighted geometric mean, elsevier expert systems with applications, 1195-1207. [20] xiuxu, z.; yuming, z.; (2012); application research of ontology-enabled process fmea knowledge management method, int. j. intelligent systems and applications, 34-40. [21] zuniga, g.l.; (2001); ontology: its transformation from philosophy to information system, proceedings of international conference on formal ontology, 187-197. international journal of computers communications & control issn 1841-9836, 10(1):123-135, february, 2015. tractable algorithm for robust time-optimal trajectory planning of robotic manipulators under confined torque q. zhang, s.-r. li, j.-x. guo, x.-s. gao qiang zhang, shu-rong li automation department china university of petroleum(east china) e-mail:zhangqiangupc@gmail.com, lishuron@upc.edu.cn jian-xin guo, xiao-shan gao* academy of mathematics and systems science chinese academy of sciences e-mail:guojianxin@amss.ac.cn *corresponding author: xgao@mmrc.iss.ac.cn abstract: in this paper, the problem of time optimal trajectory planning under confined torque and uncertain dynamics and torque parameters along a predefined geometric path is considered. it is shown that the robust optimal solution to such a problem can be obtained by solving a linear program. thus a tractable algorithm is given for robust time-optimal path-tracking control under confined torque. keywords: robust optimal control, time minimum trajectory planning, parameter uncertainties, tractable algorithm. 1 introduction in order to maximize productivity, the problem of minimum time motion planning of robotic manipulators is widely studied. in practical applications, the motion planning problem can be solved by two kinds of methods: the coupled method and the decoupled method [1]. as the name implies, the coupled methods determine the geometric path and the velocity along the path simultaneously. on the other hand, the decoupled methods determine the geometric path first according to certain criteria, and then determine the velocity along the path to minimize the path traversal time. this paper follows the decoupled approach due to the following reasons. simultaneous optimization of the geometric path and the velocity is computationally difficult [2], especially when one wants to find a tractable algorithm. also, in many industrial applications, the geometric path is predefined and does not need to be optimized. many efficient approaches for the time minimum trajectory planning (tmtp) problem along a predefined geometric path have been proposed, including the phase plane analysis approaches [3–6], the pontryagin maximum principle and shooting algorithms [7], the convex optimization [2, 10], the path reshaping [8], and the direct search method [9]. the recent works [2, 11] are significant in which general and efficient methods are proposed. in [2], the second-order cone program is applied to solve the corresponding convex optimization problem. in [11], the sequential linear programming is used to solve the tmtp, which has better computational complexity than that of the phase plane method given in [4]. the above approaches assume that the dynamics model of the robotic manipulator is precisely known and equal to some nominal values. in practice, the dynamic model always has uncertainties. for example, to simplify the solution procedure, the viscous friction is often neglected and the uncertainties caused by the payload variation are often ignored. many robust trajectory planning methods were proposed to handle these uncertainties. one of the commonly used approach is the min-max optimization approach which obtains the time minimum solution in the case of worst disturbances [12–14]. due to its bi-level structure, min-max optimization problems pose copyright © 2006-2015 by ccc publications 124 q. zhang, s.-r. li, j.-x. guo, x.-s. gao challenges for efficient numerical solution in many cases. other interesting approaches include the stochastic optimization approach [15] and the predictive control [16, 17]. in this paper, a tractable offline algorithm is given for the robust tmtp of robotic manipulators under torque limits and bounded dynamic parameter uncertainties and torque disturbances based on the min-max optimization approach. to solve the problem, it is shown that the tmtp problem for robotics under confined torque is equivalent to a linear optimal control problem meaning that both the objective function and the constraints are linear in the control and state variables. when disturbances are added, a robust linear optimal control problem is obtained, which is formulated as a min-max optimization problem. furthermore, it is shown that when the trajectory is approximated with a quadratic b-spline curve, the min-max optimization problem is reduced to a min-max linear optimization problem which in turn can be reduced to a plain linear programming problem and thus is tractable, meaning that the problem can be solved in polynomial time. numerical simulations are used to show that the method gives high precision approximate solutions to the robust tmtp very efficiently. comparing to existing approaches [12–14], the proposed method seems to be the first to convert the robust tmtp for robotics into a linear program. linear program was also used in [12], where the objective function is not equivalent to time-minimality. the rest of the paper is organized as follows. in section 2, the nominal tmtp problem is reduced to a linear control problem. in section 3, methods of parametrization is given. in section 4, a tractable algorithm for the robust tmtp problem is given. in section 5, numerical examples are presented. conclusions are presented in section 6. 2 linear formulation for tmtp under confined torque in this section, we will show that the tmtp problem can be formulated as a linear optimal control problem meaning that the objective function and the constraints of the problem are linear in the control and state variables. consider an n dof robotic manipulator satisfying the following dynamic equation [2] τ = m (q) q̈+q̇t c (q) q̇+g (q) , (1) where τ ∈ rn is the vector of actuator torques, q ∈ rn is the vector of joint angular positions, m (q) ∈ rn×n is the inertia matrix of the manipulator, c (q) ∈ (rn)n×n is a matrix, whose elements are in rn, representing the coefficients of the centrifugal and coriolis forces, and g (q) ∈ rn is the vector of gravitational torques. in this paper, the given path to be tracked in task space is denoted by r (s) = [x(s) ,y (s) ,z (s)] t ,s ∈ [0,1] , (2) and the corresponding joint path q (s) in the joint space is also assumed to be determined by considering task requirement, obstacle avoidance, singularity avoidance, etc. hence, a kinematics map between r (s) and q (s) exists. here the path singularity problem is not considered since it can be avoided by properly adjusting the joint path. following [4], each element of r (s) is assumed to be a piecewise analytic real-valued function and has at least c1 continuity at the connection points. furthermore, r is assumed to be nonsingular, that is, for any s ∈ [0,1] there exists at least one l ∈ {x,y,z} such that dl(s) ds ̸= 0. denote ′ = d ds and ˙ = d dt , where t represents time. then the joint velocity and joint acceleration can be formulated as q̇ = q′ṡ, q̈ = q′s̈+q′′ṡ2. (3) tractable algorithm for robust time-optimal trajectory planning of robotic manipulators under confined torque 125 following (1) and (3), the dynamics model can be written as a linear function in s̈ and ṡ2: τ = m (s) s̈+c (s) ṡ2+g (s) , (4) where m (s) = m (q (s)) q′ (s) ∈ rn,c (s) = m (q (s)) q′′ (s) +q′(s)t c (q (s)) q′ (s) ∈ rn, g (s) = g (q (s)) ∈ rn. the tmtp problem can be described as the following optimal control problem [2]: min b(s) jt = ∫ 1 0 1√ a(s) ds s.t. a′ (s) = 2b(s) , ∀s ∈ [0,1], a(s) > 0, ∀s ∈ (0,1) ,a(0) = a0,a(1) = a1, m (s)b+c (s)a+g (s) = τ (s) , τmin ≤ τ (s) ≤ τmax, (5) where a = ṡ2, b = s̈, τmin and τmax are the bound vectors for the torque, and the boundary values a(0) and a(1) can be obtained from (3) and the boundary values of the joint velocities: q̇ (0) = q̇0, q̇ (1) = q̇f. when b(s) = s̈ is known, ṡ and q(t) can be uniquely determined. problem (5) has the following important property. theorem 1. the optimal solution ṡo(s),s ∈ [0,1] of problem (5) is unique and is maximum at any parameter value. that is, let ṡf(s) be another parameter velocity satisfying the constraints of (5). then, ṡf(s) ≤ ṡo(s) for all s ∈ [0,1]. theorem 1 follows from the proof of theorem 2 in [3] and the proof of theorem 3 in [4]. in both cases, an admissible parameter velocity ṡo(s),s ∈ [0,1] is first constructed. to show ṡo(s) is a solution to problem (5), the authors actually proved that ṡo(s) is maximum at any parameter value. the constraints of problem (5) are linear in a and b. it will be shown that the nonlinear objective function of problem (5) can be replaced with a linear function in a. the idea is to maximize the total feedrate for the end effector of the robots instead of minimizing the traversing time. the feedrate of the end effector of the robot is fd (s) = ∥r′ (s)∥ √ a(s), where ∥r′ (s)∥ =√ x′2 (s) + y′2 (s) + z′2 (s). consider the optimal control problem. max b(s) jf = ∫ 1 0 ∥r′ (s)∥2a(s)ds s.t. a′ (s) = 2b(s) , ∀s ∈ [0,1], a(s) > 0, ∀s ∈ (0,1) ,a(0) = a0,a(1) = a1, m (s)b+c (s)a+g (s) = τ (s) , τmin ≤ τ (s) ≤ τmax. (6) since the objective function of the above problem is linear in a(s), problem (6) is a linear optimal control problem meaning that both the objective function and the constraints are linear in a and b. we thus have theorem 2. problem (6) and problem (5) have the same unique optimal solution. proof: from [3] and [4], problem (5) has an optimal solution ao(s) = ṡo(s)2,s ∈ [0,1]. since the two problems have the same constraints, ao(s) is also an admissible pseudo feedrate to problem (6). from theorem 1, ao(s) achieves maximum values among all a(s) satisfying the constraints of (6) at any parametric value. since the parametric path r(s) is assumed to be non-singular, ∥r′(s)∥2 > 0 for all s ∈ (0,1). thus ao(s) is also the optimal solution to problem (6). 2 126 q. zhang, s.-r. li, j.-x. guo, x.-s. gao 3 parametrization and tractable algorithm for nominal tmtp in this section, a method of parametrization is given to reduce the infinite dimensional optimal control problem (6) to a linear program. the parametrization method will also be used in the next section to solve the robust tmtp. the first step is to use a special type quadratic b-spline function to approximate the function a(s). we first define the b-splines used in this paper. let k and p be positive integers and si = i k−p+2, i = 0, . . . ,k − p + 2. consider the following knot sequence with the first p + 1 and last p + 1 knots “clamped". s =  0, . . . ,0︸︷︷︸ p+1 ,s1, . . . ,si, . . . ,sk−p+1,1, . . . ,1︸︷︷︸ p+1  . (7) the b-spline curve of degree p in the parameter interval [0,1] is defined as ã(s) = k+1∑ i=0 ni,p (s)âi, (8) where âi are parameters and ni,p (s) is the i-th spline basis function of degree p defined as ni,0 (s) = { 1 if s(i) ≤ s < s(i + 1) 0 otherwise , i = 0, . . . ,k + p + 1 ni,k (s) = s − s(i) s(i + k) − s(i) ni,k−1 (s) + s(i + k + 1) − s s(i + k + 1) − s(i + 1) ni+1,k−1 (s) , k = 1, . . . ,p;i = 0, . . . ,k + p − k + 1. the derivative of the b-spline basis function is given below n ′i,p (s) = d ds ni,p (s) = p s(i + p) − s(i) ni,p−1 (s) − p s(i + p + 1) − s(i + 1) ni+1,p−1 (s) . (9) now, function a(s) is approximated by a quadratic b-spline function a(s) ≈ ã(s) where ã(s) is from (8) for p = 2. set â0 = a0, âk+1 = a1. due to the multiple knots in the b-splines, we have have ã(0) = â0 and ã(1) = âk+1 [18]. the control points x = [â1, . . . , âk]t will be optimized. the variable b(s) is b(s) = a′ (s)/2 ≈ 1 2 k+1∑ i=0 n ′i,p (s)âi, (10) where n ′i,p (s) is given in (9). the second step is to reduce the continuous optimal control problem (6) to a finite state optimal problem by using pointwise constraints to replace the continuous torque constraints. the constraint knot points are s̄j =   1 2 s1,j = 1 1 2 (sj−1+sj) ,j = 2, . . . ,k − 1 1 2 (sk−1+1) ,j = k (11) with these knot points, the objective function can be approximated as jf = ∫ 1 0 ∥r′ (s)∥2a(s)ds ≈ bx (12) tractable algorithm for robust time-optimal trajectory planning of robotic manipulators under confined torque 127 where b = (b1, . . . ,bk) with bi = k∑ j=1 ∥r′ (s̄j)∥2ni,p (s̄j) ∆s̄j for ∆s̄j = sj − sj−1 = 1/k. from (4), (8), and (10), the torques of the l-th joint at the knot points are τ̂l = alx + ĝl, l = 1, . . . ,n, (13) where τ̂l = [τl (s̄1) , . . . ,τl (s̄k)] t with τl (s̄j) = 12ml (s̄j) k+1∑ i=0 n ′i,p (s̄j) âi +cl (s̄j) k+1∑ i=0 ni,p (s̄j) âi + gl (s̄j), al ∈ rk×k with alji = 12ml (s̄j)n ′ i,p (s̄j) + cl (s̄j)ni,p (s̄j), and ĝl = [ gl (s̄1) + ξ1, . . . , gl (s̄k) + ξk]t with ξj = 12ml (s̄j) (n ′ 0,p (s̄j)a0 + n ′ k+1,p (s̄j)a1) + cl (s̄j) (n0,p (s̄j) a0 + nk+1,p (s̄j)a1). with the above parametrization, problem (6) is reduced to the following linear program max x bx s.t. ã(s̄j) = njx + n0,p(s̄j)a0 + nk+1,p(s̄j)a1 > 0,j = 1, . . . ,k, τl,mini ≤ alx + ĝl ≤ τl,maxi, l= 1, . . . ,n, (14) where i = (1, . . . ,1)t ∈ rk, b is from (12), nj = [n1,p(s̄j), . . . ,nk,p(s̄j)], and al, ĝl are from (13). note that the boundary conditions a(0) = â0 = a0 and a(1) = âk+1 = a1 are automatically satisfied. properties of the above problem will be discussed in the next section. 4 tractable algorithm for robust tmtp in this section, we will show that the robust tmtp can be reduced to a linear program by using proper parametrization and methods of robust optimization. suppose that the dynamics model (4) has uncertainties and the real dynamics model is τ (s) = (m (s) + ∆m (s)) s̈+ (c (s) + ∆c (s)) ṡ2 + (g (s) + ∆g (s)) − ∆d (s) , (15) where ∆m (s) ,∆c (s) ,∆g (s) , and ∆d (s) are the disturbances for m (s) ,c (s) ,g (s), and the torque, respectively. the disturbances are assumed to satisfy the following constraints: d(∆m,∆c,∆g,∆d) :   |∆ml (s)| ≤ bm (l,s) , l = 1, . . . ,n |∆cl (s)| ≤ bc (l,s) , l = 1, . . . ,n |∆gl (s)| ≤ bg (l,s) , l = 1, . . . ,n |∆dl (s)| ≤ bd (l,s) , l = 1, . . . ,n   (16) where ∆ml (s) ,∆cl (s) ,∆gl (s), and ∆dl (s) are the elements of the vectors ∆m (s) ,∆c (s) ,∆g (s), and ∆d (s), respectively. using the min-max approach [12, 13], the robust tmtp can be formulated as the following bi-level optimization problem: max b(s) jf = ∫ 1 0 ∥r′ (s)∥2a(s)ds (17) s.t. a′ (s) = 2b(s) ,a(s) > 0,s ∈ (0,1), a(0) = a0, a(1) = a1, max d(∆m,∆c,∆g,∆d) ( (m (s) + ∆m (s))b(s)+ (c (s) + ∆c (s))a(s)+ (g (s) + ∆g (s)) − ∆d (s) ) ≤ τmax min d(∆m,∆c,∆g,∆d) ( (m (s) + ∆m (s))b(s)+ (c (s) + ∆c (s))a(s)+ (g (s) + ∆g (s)) − ∆d (s) ) ≥ τmin where d(∆m,∆c,∆g,∆d) is defined in (16). 128 q. zhang, s.-r. li, j.-x. guo, x.-s. gao using the parametrization procedure presented in section 3, the robust tmtp problem (17) is reduced to the following bi-level optimization problem max x bx s.t. njx + n0,p(s̄j)a0 + nk+1,p(s̄j)a1 > 0,j = 1, . . . ,k max ∆aml,∆acl,∆gl,∆dl alx + ĝl + ∆amly + ∆aclx + ∆ĝl − ∆d̂l ≤ τl,maxi, min ∆aml,∆acl,∆gl,∆dl alx + ĝl + ∆amly + ∆aclx + ∆ĝl − ∆d̂l ≥ τl,mini, l = 1, . . . ,n, (18) where y = [n′1, . . . ,n ′ k] t x is a newly defined variable with n′j = [ n ′1,p (s̄j) , . . . ,n ′ k,p (s̄j) ] , b, nj, al, and ĝl are the same as in (14), ∆ĝl = [∆gl (s̄1) , . . . ,∆gl (s̄k)] t is the disturbance of the gravitational torque, ∆d̂l = [∆dl (s̄1) , . . . ,∆dl (s̄k)] t is the torque disturbances, ∆aml ∈ rk×k and ∆acl ∈ rk×k are the uncertainty matrixes of the l-th joint with elements ∆amlji = 1 2 ∆ml (s̄j) and ∆aclji = ∆cl (s̄j)ni,p (s̄j) respectively. and the uncertainty constraints are ∆aml : |∆ml(s̄j)| ≤ bm (l, s̄j) , (19) ∆acl : |∆cl(s̄j)| ≤ bc (l, s̄j) , (20) ∆gl : |∆gl(s̄j)| ≤ bg(l, s̄j), ∆dl : |∆dl(s̄j)| ≤ bd(l, s̄j),j = 1, . . . ,k. based on a result of soyster about robust optimization [19], the bi-level optimization problem (18) can be relaxed to the following linear program max x,z1,z2 bx s.t. njx + n0,p(s̄j)a0 + nk+1,p(s̄j)a1 > 0,j = 1, . . . ,k, alx + ĝl + ∆bmlz1 + ∆bclz2 + ∆bgl + ∆bdl ≤ τl,maxi, alx + ĝl − ∆bmlz1 − ∆bclz2 − ∆bgl − ∆bdl ≥ τl,mini, −z2 ≤ x ≤ z2,−z1 ≤ y ≤ z1,z1 ≥ 0,z2 ≥ 0, y = [n′1, . . . ,n ′ k] t x, l = 1, . . . ,n, (21) where z1 ∈ rk and z2 ∈ rk are new sets of variables, ∆bml ∈ rk×k and ∆bcl ∈ rk×k are matrixes with elements ∆bmlji = 12bm (l, s̄j) and ∆bclji = ni,p (s̄j)bc (l, s̄j) respectively, ∆bgl = [bg(l, s̄1), . . . ,bg(l, s̄k)] t , and ∆bdl = [bd(l, s̄1), . . . ,bd(l, s̄k)]t . the number of variables in problem (21) is 3k and the number of constraints is (2n + 7)k. according to karmarkar’s result [20], problem (21) can be solved with o((nk)3.5) floating point arithmetic operations. we briefly discuss the convergency of the method, that is, whether the solutions of the linear programs (14) and (21) convert to the optimal solutions of the optimal control problems (6) and (17), respectively, when k becomes sufficiently large. from the approximation theory, quadratic splines can approximate the solution of optimal control problems to any given precision when k is large enough [21]. notice that the method of using quadratic splines to approximate the control variable is a special case of the ritz method, the convergence of which has been proved only for special types of constraints [21–23]. convergency for problems like the one in this paper seems open. in the next section, numerical results will be used to demonstrate that the method gives the optimal solution with very high precision in less than one second. tractable algorithm for robust time-optimal trajectory planning of robotic manipulators under confined torque 129 5 numerical examples the experimental robot is a six-dof puma 560 manipulator which is modeled using the robotics toolbox for matlab [24]. the torque limits of the six joints are set to be [97.6; 186.4; 89.4; 24.2; 20.1; 21.3]n.m. initial and end feedrates of the end effecter are zeros. the path to be traced in the robot work space is shown in fig.1(a) with the following parametric formula   x(s) = 0.5, y (s) = 0.25 ( cos (2aπs) − cos (2bπs)3 ) , z (s) = 0.25 ( sin (2aπs) − sin (2bπs)3 ) , s ∈ [0,1] . where a = 2,b = 1. the corresponding joint path is shown in fig.1(b). −0.4 −0.2 0 0.2 0.4 0.6 −0.4 −0.2 0 0.2 0.4 y(m) z (m ) start point (a) 0 200 400 600 800 1000 −1.5 −1 −0.5 0 0.5 1 1.5 path parameter(−) d is pl ac em en t (r ad ) joint 1 joint 2 joint 3 joint 4 joint 5 joint 6 (b) figure 1: (a): the task path on y-z plane. (b): the corresponding joint path. first, problem (14) is used to show that the proposed method is efficient and correct. the proposed linear program (lp) approach will be compared to the convex optimization (cp) approach [2] for efficiency. the approach in [2], which is realized by executing the second order cone program (socp), was the fastest existing numerical method by now. both algorithms are implemented by using optimization software package sedumi [25] in matlab on a pc with a 2.6ghz cpu and 2g memory. table 1: comparison of computation times for different k number k cpu time(s) optimal motion time(s) lp cp in [2] lp cp in [2] 100 0.3964 1.2460 1.6473 1.6449 200 0.6830 2.2970 1.6495 1.6491 500 0.4630 3.6260 1.6506 1.6506 700 0.9048 4.2620 1.6508 1.6507 1000 0.9300 3.4430 1.6508 1.6508 2000 1.1960 6.3440 1.6508 1.6508 130 q. zhang, s.-r. li, j.-x. guo, x.-s. gao 0 0.2 0.4 0.6 0.8 0 0.2 0.4 0.6 0.8 1 path parameter(−) p ar am et er v el oc it y( − ) vlc nominal case1 case2 case3 figure 2: optimized parameter velocity curves of all four cases for k=1000. 0 0.2 0.4 0.6 0.8 1 −200 0 200 jo in t to rq ue (n .m ) joint 1 joint 2 joint 3 0 0.2 0.4 0.6 0.8 1 −20 0 20 path parameter(−) jo in t to rq ue (n .m ) joint 4 joint 5 joint 6 figure 3: joint torques of the planned nominal minimum time motion for k=1000. 0 0.2 0.4 0.6 0.8 1 −200 0 200 jo in t to rq ue (n .m ) joint 1 joint 2 joint 3 0 0.2 0.4 0.6 0.8 1 −20 0 20 path parameter(−) jo in t to rq ue (n .m ) joint 4 joint 5 joint 6 figure 4: joint torques of robust minimum time motion in case 1 for k=1000. in table 1, the computation time and the robot motion time are given for different values of k. from table 1, we can see that the optimal motion times based on formulation (6) and formulation (5) are almost the same, which is consistent with theorem 2. however, the computation time of our approach is faster than that of [2]. also note that the actually computational complexity of our approach is sub-linear, meaning less than o(k), which might be due to the special structure of al in (14). the practical computational complexity of this approach is better than the complexity o(k2) of the method given in [11]. the planned joint torques are given in fig. 3. from [3], the optimal solution of problem (5) is bang-bang-singular in the sense that for any s ∈ [0,1], the torque of at least one joint reaches its boundary values in the constraint τmin ≤ τ (s) ≤ τmax. this property is often used to verify whether a numerical solution is the optimal solution and whether k is large enough. in fig. 3, the torque is clearly bang-bang-singular: the torques of joints 2, 1, 2, 1, and 2 reach their maximal or minimal values in the intervals [0,0.17], [0.17,0.35], [0.35,0.41], [0.41,0.78], [0.78,1], respectively. this indicates that for k = 1000, the solution obtained with our method most probably gives high precision approximation to the optimal solution of the original problem (5). now, it will be shown that the solution of problem (21) is indeed robust. suppose that the payload holden by the end effector is the only uncertain parameter. as mentioned in [24], the maximum payload capability of the puma 560 is 2.5 kg. in this section, the robust trajectory tractable algorithm for robust time-optimal trajectory planning of robotic manipulators under confined torque 131 0 0.2 0.4 0.6 0.8 1 −200 0 200 jo in t to rq ue (n .m ) joint 1 joint 2 joint 3 0 0.2 0.4 0.6 0.8 1 −20 0 20 path parameter(−) jo in t to rq ue (n .m ) joint 4 joint 5 joint 6 figure 5: joint torques of robust minimum time motion in case 2 for k=1000. 0 0.2 0.4 0.6 0.8 1 −200 0 200 jo in t to rq ue ( n .m ) joint 1 joint 2 joint 3 0 0.2 0.4 0.6 0.8 1 −20 0 20 path parameter(−) jo in t to rq ue ( n .m ) joint 4 joint 5 joint 6 figure 6: joint torques of robust minimum time motion in case 3 for k=1000. planning is implemented under three cases of maximum payload uncertainty assumptions, respectively. the assumptions are 0.5kg for case 1, 1.25kg for case2, and 2.5kg for case 3. the disturbed ∆m, ∆c, and ∆g are estimated by using the dynamics computation function of the robotic toolbox for matlab. the computation performance of the proposed robust trajectory planning algorithm is tested by implementing case 2 under different values of k. the results are listed in table 2. from this table, we can see that the calculated optimal solution is convergent when k increases. also in this example, the actually computational complexity of our approach is less than o(k). table 2: computation performance of robust algorithm for case 2 with different k number (k) 100 200 500 700 1000 2000 cpu time 0.4330 0.7080 1.0720 1.1290 1.6920 2.6680 optimal motion time 1.7355 1.7395 1.7407 1.7408 1.7409 1.7409 the planned joint torques of the robust minimum time motions in case 1, case 2, and case 3 are given in fig. 4, fig. 5, and 6, respectively. the optimized parameter velocities ṡ for all four cases are shown in fig. 2. comparing to fig. 3, the torques for the robust time-minimum motion are reduced to reserve enough space to reject uncertainties and disturbances without causing torque saturation. to demonstrate the robustness of the obtained trajectories compared to the nominal one, consider 10 equidistantly distributed mass values of payload in [0,2.5]kg. the planned feedrates in the cases of fig. 3, fig. 4, fig. 5, and fig. 6 are used to traverse the path, and the corresponding torques are given in fig. 7, fig. 8, fig. 9, and fig. 10, respectively. note that the torque bounds are [97.6; 186.4; 89.4; 24.2; 20.1; 21.3] n.m. from fig. 7, fig. 8 and fig. 9, we can see that the actual torques pass the given bounds for about 25%, 19% and 11%, respectively. from fig. 10, it can be seen that the actual torques are all within the given bound, which is consistent with the fact that the robust trajectory in this case is planned under the condition of maximum 2.5kg payload. the robust trajectories for the three cases of uncertainty assumptions are used to traverse 132 q. zhang, s.-r. li, j.-x. guo, x.-s. gao 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 −200 0 200 s im ul at ed j oi nt t or qu e (n .m ) joint 1 joint 2 joint 3 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 −20 0 20 time (s) s im ul at ed j oi nt t or qu e (n .m ) joint 4 joint 5 joint 6 figure 7: joint torques for 10 difference payload masses along the nominal reference trajectory. 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 −200 0 200 s im ul at ed j oi nt t or qu e (n .m ) joint 1 joint 2 joint 3 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 −20 0 20 time (s) s im ul at ed jo in t to rq ue ( n .m ) joint 4 joint 5 joint 6 figure 8: joint torques for 10 difference payload masses along the robust reference trajectory of case 1. 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 −200 0 200 s im ul at ed jo in t to rq ue ( n .m ) joint 1 joint 2 joint 3 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 −20 0 20 time (s) s im ul at ed j oi nt t or qu e (n .m ) joint 4 joint 5 joint 6 figure 9: joint torques for 10 difference payload masses along the robust reference trajectory of case 2. 0 0.5 1 1.5 −200 0 200 s im ul at ed j oi nt t or qu e (n .m ) joint 1 joint 2 joint 3 0 0.5 1 1.5 −20 0 20 time (s) s im ul at ed j oi nt t or qu e (n .m ) joint 4 joint 5 joint 6 figure 10: joint torques for 10 difference payload masses along the robust reference trajectory of case 3. the path using the ten values of the payload. detailed data of the experiments are given in table 3, where ctime stands for the computation time, mtime stands for the optimal motion time, rti stands for the rate of time increase compared with the nominal case, mtv stands for the maximum torque violation, atv stand for the average torque violation, and rtv stands for the rate of torque violation. from table 3, we can see that when the 2.5kg payload uncertainty assumption is used, along the generated robust trajectory, the actual torques do not pass the given bounds. by contrast, along the non robust nominal trajectories, at almost all times at least one actual torque passes the given bound and the maximum torque violation reaches 25% of the given bound. the optimal motion time of this robust trajectory increases about 10.7% compared with the nominal case, which is a reasonable price to pay for robustness. tractable algorithm for robust time-optimal trajectory planning of robotic manipulators under confined torque 133 table 3: results of trajectory robustness test for k=1000 ctime(s) mtime(s) rti(%) mtv(n.m) atv(n.m) rtv(%) nominal 1.4508 1.6508 0 46.39 3.99 99.16 case 1 1.9810 1.6872 2.2 35.65 2.87 98.96 case 2 1.6920 1.7409 5.5 21.04 1.75 82.35 case 3 1.8600 1.8281 10.7 none none 0 6 conclusions in this paper, a linear program based robust time minimum trajectory planning approach is presented. the robust version of the tmtp problem is constructed to overcome the parameter uncertainties and torque disturbances of the dynamic system. to make our approach tractable, the tmtp problem is proven to be equivalent to a linear optimal control problem by considering the total motion feedrate of the end-effector of the robot as the objective function. quadratic b-spline curves are used to parameterize the linear optimal control problem into a linear program which can be solved in polynomial time. the approach theoretically ensures that the generated velocity is time optimal and for every possible realization of the uncertainties and disturbances within the bound, the robust trajectory will not violate any joint torque constraints. though the proposed algorithm is designed to execute offline, online use is also possible because of the good computation performance. acknowledgment this research was partially supported by a national key basic research project of china (2011cb302400), a grant from nsfc (60821002) and a grant from upc (120501a). bibliography [1] katzschmann, r.; kroger, t.; asfour, t.; khatib o.(2013); towards online trajectory generation considering robot dynamics and torque limits, in intelligent robots and systems (iros), 2013 ieee/rsj international conference on, issn 2153-0858, tokyo, 5644 5651. [2] verscheure, d.; demeulenaere, b.; swevers, j.; de schutter, j.; diehl, m.(2009); timeoptimal path tracking for robots: a convex optimization approach, ieee trans. on automatic control, issn 0018-9286, 54(10): 2318-2327. [3] bobrow, j.e.; dubowsky, s.; gibson, j.(1985); time-optimal control of robotic manipulators along specified paths, international journal of robotics research, issn 0278-3649, 4(3): 3-17. [4] shin, k.; mckay, n.(1985); minimum-time control of robotic manipulators with geometric path constraints, ieee trans. on automatic control, issn 0018-9286, 30(6): 531-541. [5] timar, s.d.; farouki, r.t.(2007); time-optimal traversal of curved paths by cartesian cnc machines under both constant and speed-dependent axis acceleration bounds, robotics and computer-integrated manufacturing, issn 0736-5845, 23(5): 563-579. 134 q. zhang, s.-r. li, j.-x. guo, x.-s. gao [6] yuan, c.; zhang, k.; fan, w.(2013); time-optimal interpolation for cnc machining along curved tool pathes with confined chord error, journal of systems science and complexity, issn 1559-7067, 26(5): 836-870. [7] chen, y.; desrochers, a.a.(1989); structure of minimum-time control law for robotic manipulators with constrained paths, in robotics and automation, ieee international conference on, isbn 0-8186-1938-4, scottsdale, usa, 971-976. [8] guo, j.x.; zhang, q.; gao, x.s.(2013); tracking error reduction in cnc machining by reshaping the kinematic trajectory, journal of systems science and complexity, issn 1559-7067, 26(5), 800-817. [9] zhang, k.; yuan, c.m.; gao, x.s.(2013); efficient algorithm for feedrate planning and smoothing with confined chord error and acceleration for each axis, the international journal of advanced manufacturing technology, issn 0268-3768 , 66(9): 1685-1697. [10] ardeshiri, t.; norrlof, m.; lofberg, j.; hansson, a.(2011); convex optimization approach for time-optimal path tracking of robots with speed dependent constraint, in proceedings of the 18th ifac world congress, issn 1474-6670, milano, italy, 14648-14653. [11] hauser, k.(2013); fast interpolation and time-optimization on implicit contact submanifolds, in proceedings of robotics: science and systems, issn 2330-765x, berlin, germany. [12] shin, k.g.; mckay, n.d.(1987); robust trajectory planning for robotic manipulators under payload uncertainties, ieee trans. on automatic control, issn 0018-9286, 32(12): 10441054. [13] kieffer, j.; cahill, a.j.; james, m.r.(1997); robust and accurate time-optimal pathtracking control for robot manipulators, ieee trans. on robotics and automation, issn 1042-296x, 13(6): 880–890. [14] diehl, m.; gerhard, j.; marquardt, w.; monnigmann, m.(2008); numerical solution approaches for robust nonlinear optimal control problems, computers & chemical engineering, issn 0098-1354, 32(6): 1279-1292. [15] marti, k.; aurnhammer, a.(2002); robust optimal trajectory planning for robots by stochastic optimization, mathematical and computer modelling of dynamical systems, issn 1744-5051, 8(1): 75-116. [16] chisci, l.; rossiter, j.a.; zappa, g.(2001); systems with persistent disturbances: predictive control with restrictive constraints, automatica, issn 0005-1098, 37(7): 1019-1028. [17] mayne, d.q.; seron, m.m.; rakovic, s.v.(2005); robust model predictive control of constrained linear systems with bounded disturbances, automatica, issn 0005-1098, 41(2): 219-224. [18] patrikalakis, n.m.; maekawa, t.(2010); shape interrogation for computer aided design and manufacturing, isbn 978-3-642-04074-0, springer berlin, heidelberg. [19] bertsimas, d.; brown, d.b.; caramanis, c.(2011); theory and applications of robust optimization, siam review, issn 0036-1445, 53(3): 464–501. [20] karmarkar, n.(1984); a new polynomial time algorithm for linear programming, combinatorica, issn 0209-9683, 4(4): 373-395. tractable algorithm for robust time-optimal trajectory planning of robotic manipulators under confined torque 135 [21] sirisena, h.r.; chou, f.s.(1979); convergence of the control parameterization ritz method for nonlinear optimal control problems, journal of optimization theory and applications, issn 0022-3239, 29(3): 369-382. [22] daniel, j.w.(1973); the ritz-galerkin method for abstract optimal control problems, siam journal on control, issn 0036-1402, 11(1): 53-63. [23] schwartz, a.l.(1996); theory and implementation of numerical methods based on rungekutta integration for solving optimal control problems, ph.d. thesis, univ. of california at berkeley. [24] corke, p.(1996); a robotics toolbox for matlab, ieee robotics and automation magazine, issn 1070-9932, 3(1): 24-32. [25] sturm, j.f.(1999); using sedumi 1.02, a matlab toolbox for optimization over symmetric cones, optimization methods and software, issn 1055-6788, 11(1-4): 625-653. ijcccv11n1.dvi international journal of computers communications & control issn 1841-9836, 11(1):51-66, february 2016. personnel ranking and selection problem solution by application of kemira method n. kosareva, e.k. zavadskas, a. krylovas, s. dadelo natalja kosareva, edmundas kazimieras zavadskas*, aleksandras krylovas, stanislav dadelo vilnius gediminas technical university sauletekio al. 11, lt-10223 vilnius, lithuania natalja.kosareva@vgtu.lt, edmundas.zavadskas@vgtu.lt aleksandras.krylovas@vgtu.lt, stanislav.dadelo@vgtu.lt *corresponding author: edmundas.zavadskas@vgtu.lt abstract: in this study kemeny median indicator rank accordance (kemira) method is applied for solving personnel ranking and selection problem when there are two subgroups of evaluating criteria. each stage of kemira method illustrated with the examples. in the first stage kemeny median method is applied to generalize experts’ opinions for setting criteria priorities. medians were calculated for all experts opinions generalization and for experts majority opinions generalization. in the second stage criteria weights calculated and alternatives ranking accomplished simultaneously by indicator rank accordance method. the obtained solutions compared with the results received in previous work of authors. keywords: multiple criteria decision making, kemeny median, criteria priority, optimization problem. 1 introduction personnel ranking and selection process in companies is focused on testing and evaluating of human resource potential, skills and personal characteristics. developed personnel selection and evaluation systems are usually oriented to companies operating cost reduction in order to optimize human resources demand and layout planning [8]. personnel selection and placement of the necessary positions is seen as the most important factor affecting organization’s security, stability and development [20]. proper recruitment has influence on organization’s climate directly or influencing it through mediators [34]. security personnel selection process requires identification of specific criteria for the occupied position and setting of their weights values [12]. solution of this problem is relevant to many companies and organizations. at the same time it is necessary to create unified (being in accordance with uniform standards) personnel selection algorithms [3]. personnel selection deployment process is directed to human resource potential, skills and personal features, i.e. factors affecting professional efficiency, testing and evaluation. research has revealed that the main factors affecting the professional ability to work are of different origin: physical, psychological, cognitive, social/behavioral, workplace factors and factors outside the workplace [18]. this investigation has not determined the strength (weight) of identified factors influence on professional working capacity. it can be assumed that these factors have a different impact on the effectiveness of different professional activities. experts’ assessment applied for weight identification is often a subjective process based on stereotypes, attitudes, sympathy and so on [15]. the results of such evaluation often do not meet the expectations of the ultimate goal. from the other hand application of specialized tests (objective assessments) for weight identification meets the uncertainty factor. it is not clear how much test results will be concerned with the quality of the work results in the future [22]. according to the authors the most reliable methodology of professional selection is integrating several fields of criteria copyright © 2006-2016 by ccc publications agora university 52 n. kosareva, e.k. zavadskas, a. krylovas, s. dadelo and sub-criteria systems [41]. it was found that data obtained with the objective (internal) and subjective (external) methods correlate with each other at about 0.3 [11], so harmonization of objective and subjective criteria schemes allow to expect a higher efficiency of the staff selection process [13]. solution of this relevant problem is often subject to the multiple criteria decision making methods (mcdm). mcdm is rapidly evolving methodologies direction, does have considerable researchers attention. decision-making methods face new requirements for modernization. there is a variety of mcdm methods. a monograph [39] provides readers with a capsule look into the existing methods, their characteristics and applicability to analysis of multiple attribute decision making (madm) problems. models for madm, transformation of attributes, fuzzy decision rules, and methods for assessing weights are presented in this work. the paper [42] presents a panorama of decision making methods in economics and summarizes the most important results and applications which have appeared in the last decade. [33] added several new important concepts and trends in the mcdm field for solving actual problems. they provided comments on a previously published paper of [42] that could be thought of as an attempt to complete the original paper. despite the intensive development worldwide, few attempts have been made to systematically present the theoretical bases and developments of mcdm methods. the article [43] describes the situation with reviews of mcdm/madm methods. the goal of the paper [31] is to propose an approach to resolve disagreements among mcdm methods based on spearman’s rank correlation coefficient. the authors showed that the proposed approach can resolve conflicting mcdm rankings and reach an agreement among different mcdm methods. sometimes, there are situations where mcdm methods generate very different results. the results proposed in [35] proved that mcdm methods are useful tools for evaluating multiclass classification algorithms and the fusion approach of mcdm methods is capable of identifying a compromised solution when different mcdm methods generate conflicting rankings. however, mcdm techniques for recruitment and layout planning areas are not consistently structured and improved, this is an obstacle to their wide application. new mcdm methods are developing and new fields are looking for their application. the fuzzy multimoora for group decision making (multimoora-fg) enables to aggregate subjective assessments of the decision-makers and offer an opportunity to perform more robust personnel selection procedures [4], [5]. the selection process often contains imprecise data and linguistic variables. the model proposed in [6] combines fuzzy analytic hierarchy process (fahp) and technique for order preference by similarity to ideal solution (topsis) approaches to overcome these problems. the personnel selection problem is suitable to be dealt with by the linguistic vikor method because it includes some conflicting criteria and needs to consider the relative competitiveness of each candidate [9], [16]. the hybrid mcdm models which employ analytic network process (anp) and modified topsis and combination of the fuzzy delphi method, anp, and topsis demonstrated the effectiveness and feasibility of the proposed models for personnel selection [14] and for investment in stock exchange [19]. the model based on fuzzy dematel (decision making trial and evaluation laboratory) and fuzzy anp is proposed in [25] to to cope with the interdependencies between evaluation criteria. in the mentioned paper, a fuzzy dematel-anp model is proposed for selection of snipers. an approach enabling combination of fuzzy anp, fuzzy topsis and fuzzy electre techniques were enabled for solving of the same problem [26]. other authors proposed other mcdm methods, like fuzzy ahp adopted for triangular fuzzy numbers (tfn) [37] or such as electre iii and promethee ii [27], [36] in the process of staff selection and recruitment. other methods used for solving similar problems are grey relational analysis (gra) based methods [41]. a gra-based intuitionistic fuzzy multi-criteria group decision making method for personnel selection is proposed in [23]. in this paper intuitionistic fuzzy weighted averaging personnel ranking and selection problem solution by application of kemira method 53 (ifwa) operator is utilized to aggregate individual opinions of decision makers and intuitionistic fuzzy entropy is used to obtain the entropy weights of the criteria. the hybrid approach applying ahp and topsis grey as an effective decision aid to improve human resources management in various areas of economic activities is proposed in [23]. a personnel selection system based ahp and complex proportional assessment of alternatives with grey relations (copras-g) method was proposed in [24]. the use of hesitant fuzzy sets as a powerful technique has been studied in [20]. this paper explores aggregation methods for prioritized hesitant fuzzy elements and their application on personnel evaluation. the comparison of various methods in the selection of personnel may help in finding out the accuracy, appropriateness, suitability, fairness and practicality efficiently [37]. using several independent sub-criteria for personnel selection we face with the lack of appropriate mcdm solutions. the purpose of this article is to offer methods of this problem solution. studies have shown that the subjective and objective assessments are different [2], [21]. in order to effective solutions, it is necessary to synthesize the objective and subjective assessments, this opens up new opportunities and improve the quality of the selection process [10]. most previous studies did not provide clear criteria for grouping and ranking process patterns [30]. the proposed new kemira method allows to combine criteria structuring and evaluating on the basis of subjective and objective levels. this method can be used to solve unstructured evaluation tasks when consensus among the experts on the importance of sub-criteria is not obligatory or when criteria of different nature (genesis) are used. it occurs in those areas of human activity where it is necessary the evaluation of experts from different specializations (personnel, finance, manufacturing and technology, etc.). using this algorithm for decision-making, the decision-making process takes on a higher quality and greater integration levels to include in the evaluation and selection process a lot of important criteria of different nature, incomparable among themselves and couldn’t be combined. for example, external and internal criteria; subjective and objective criteria; physical, mental and psychological indicators, etc. the proposed kemira criteria selection and structuring model extends the mcdm approach. the advantage of proposed kemira method is its efficiency for solving mcdm problems when the set of criteria is divided into a few subgroups having different origin. nevertheless, kemeny median method, which is a part of kemira method, is applicable for much wider class of mcdm problems when it is necessary to establish priority of criteria. it must be mentioned that criteria weights are found at the same time as the process of mcdm problem solution (ranking the alternatives) goes on. 2 problem formulation and scheme of solution this work continues the series of works designed to solve mcdm problem of elite selection from security personnel, when two separate groups of criteria describing security guards are presented. in general case it may be more than two groups of criteria. external assessment values and internal measurements are given for each subject. additionally, information about experts independently set preferences for criteria in each criterion group is known. the purpose of the study is selection of 10% best subjects according to the given information. usually mcdm problems are being solved in the following order. at first stage of the problem solution data normalization procedure is performed. then criteria weights are determined and finally mcdm methods applied for selecting best objects and/or objects ranking [29]. each problem solution phase can be performed by different methods. our proposed kemeny median indicator rank accordance (kemira) [32] method consists of other three phases: 1. data normalization and standardization; 2. determination of criteria preferences reflecting the aggregated experts opinion; 54 n. kosareva, e.k. zavadskas, a. krylovas, s. dadelo 3. minimization of the target functions value, determination of criteria weights and objects ranking. the second step is determination priority of criteria by kemeny median method. in this stage we’ll use the information about criteria preferences set by experts. note that the third step combines two phases into one determination of criteria weights and objects ranking will be carried out simultaneously. this process is dynamic, so both goals will be achieved at the end of optimization procedure. brief scheme of problem solution is depicted in figure 1. comprehensive description of the method steps is provided below. at first introduce some notations: m – number of internal criteria (x), n – number of external criteria (y ), k – number of alternatives, s – number of experts. initial decision-making matrix: d =          x (1) 1 . . . x (1) j . . . x (1) n y (1) 1 . . . y (1) j . . . y (1) m . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . x (i) 1 . . . x (i) j . . . x (i) n y (i) 1 . . . y (i) j . . . y (i) m . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . x (k) 1 . . . x (k) j . . . x (k) n y (k) 1 . . . y (k) j . . . y (k) m          . (1) criteria preferences estimated by experts: expert x1 . . . xl . . . xn y1 . . . yl . . . ym 1 i (1) 1 . . . i (1) l . . . i (1) n j (1) 1 . . . j (1) l . . . j (1) m . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . s i (s) 1 . . . i (s) l . . . i (s) n j (s) 1 . . . j (s) l . . . j (s) m . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . s i (s) 1 . . . i (s) l . . . i (s) n j (s) 1 . . . j (s) l . . . j (s) m figure 1: scheme of kemeny median indicator rank accordance method (kemira) application for selecting the best objects. step 1. normalization of elements of decision making matrix (1). all criteria are representing a benefit, i.e. the bigger is a value, the better is the respective alternative. elements of decisionpersonnel ranking and selection problem solution by application of kemira method 55 making matrix are normalized by formulas: x (i)∗ j = ( x (i) j −x (i) min ) / ( x(i)max −x (i) min ) , y (i)∗ j = ( y (i) j −y (i) min ) / ( y(i)max −y (i) min ) . (2) step 2. determining priority of criteria x and y components. criteria x and y components priority is established independently by choosing priority which minimizes sum of distances to the priorities set up by all s experts: ra = arg min r s ∑ j=1 ρa ( r,r(j) ) . (3) the result of this step is the median criteria components priority: xj1 ≻ xj2 ≻ ··· ≻ xjn, yi1 ≻ yi2 ≻ ··· ≻ yim. step 3. fixing initial weights of criteria satisfying median criteria components priority set in step 2 and normalizing condition: αj1 ≻ αj2 ≻ ··· ≻ αjn, βi1 ≻ βi2 ≻ ··· ≻ βim, ∑n r=1 αjr = ∑m s=1 βis = 1. (4) step 4. calculation of functions ϕ(x) and ψ(y ) values and total value ϕ(x) + ψ(y ) for each alternative. ϕ(x) = n ∑ r=1 αjrxjr, ψ(y ) = m ∑ s=1 βisyis. (5) step 5. calculation of ranks of alternatives according to the total value ϕ(x) + ψ(y ): r(k)(α) and r(k)(β), k = 1,2, . . . ,k. step 6. calculation number of elements having ranks lower than kx according to the criteria x and lower than ky according to the criteria y : ∣ ∣ ∣ akx ∩bky ∣ ∣ ∣ . step 7. calculation sum of squared ranks differences for the alternatives satisfying conditions of step 6, i.e. denote akx ∩bky = {k ∈{1,2, . . . ,k} : r (k) (α) 6 kx ,r (k) (β) 6 ky}. then ∑ k ∈ {akx ∩ bky } ( r(k) (α) −r(k) (β) )2 . (6) steps 4–7 are repeated until number of elements in step 6 will reach its maximum value and the value of target function in step 7 reach its minimum value. the obtained values of criteria weights αj1,αj2, · · · ,αjn,βi1,βi2, · · · ,βim are used for calculation of total value ϕ(x) + ψ(y ) and mcdm problem solution. 3 determining priority of criteria say that priority of criteria x = (x1,x2, . . . ,xn) and y = (y1,y2, . . . ,ym) was estimated by s experts. each expert sorted criteria in the priority descending order: x (s) i1 ≻ x (s) i2 ≻ . . . ≻ x (s) in , y (s) i1 ≻ y (s) i2 ≻ . . . ≻ y (s) im , (7) here s ∈{1,2, . . . ,s} – number of experts. we must ascertain generalized experts opinion. this task will be accomplished by kemeny median method. suppose that permutations (j1,j2, . . . ,jn) of the set of natural numbers {1,2, . . . ,n} determine priorities of vector (criteria) x components: xj1 ≻ xj2 ≻ ··· ≻ xjn. it means that for the set {x1,x2, . . . ,xn} a strict order relationship r = (xj1,xj2, . . . ,xjn) is defined. this relationship can be defined by the square matrix ar = ‖aij‖, 56 n. kosareva, e.k. zavadskas, a. krylovas, s. dadelo which elements are: aij = { 1, if xi < xj, 0, if xi > xj. , where aii = 0 and aij = 1 −aji, for i 6= j. define function ρa ( r(1),r(2) ) = n ∑ i=1 n ∑ j=1 ∣ ∣ ∣ a (1) ij −a (2) ij ∣ ∣ ∣ . (8) function (8) is a certain measure of difference between two relationships and its values coincide with values of kemeny distance function [28]. for example, if ranks of criteria x = (x1,x2,x3,x4) components are established by the first expert as r(1) = (3,2,4,1) and by the second expert as r(2) = (2,1,4,3), consequently priorities of criteria components are set accordingly x4 ≻ x2 ≻ x1 ≻ x3 and x2 ≻ x1 ≻ x4 ≻ x3. then corresponding matrices are: a(1) =      0 0 1 0 1 0 1 0 0 0 0 0 1 1 1 0      , a(2) =      0 0 1 1 1 0 1 1 0 0 0 0 0 0 1 0      . function (8) in this case is gaining value ρa ( r(1),r(2) ) = 4 ∑ i=1 4 ∑ j=1 ∣ ∣ ∣ a (1) ij −a (2) ij ∣ ∣ ∣ = 1+1+0+2 = 4. suppose that s experts established priorities r(1),r(2), . . . ,r(s). most consistent with these estimates will be priority ra, which is called median: ra = arg min r s ∑ j=1 ρa ( r,r(j) ) . (9) notice that we can analyze functions ρa analogues, i.e. another distances between two relationships. the solution of (9) is not necessary a unique value. sometimes we can obtain several medians which are solutions of problem (9). if we are interested in the majority experts opinion only (not all experts) then we apply cluster analysis procedure, that helps to distinguish a cluster of experts majority. then the median is sought among selected group of experts. let’s us consider an example. priority of criteria x = (x1,x2,x3,x4) and y = (y1,y2,y3) was estimated by 5 experts. results of estimation presented in the table 1. first expert set table 1: criteria x and y components preferences established by 5 experts. expert x1 x2 x3 x4 y1 y2 y3 1 3 2 4 1 1 3 2 2 2 1 4 3 1 2 3 3 3 2 1 4 1 2 3 4 2 3 4 1 2 1 3 5 3 4 2 1 2 3 1 criterion x components priorities as follows: x4 ≻ x2 ≻ x1 ≻ x3 and criterion y components priorities: y1 ≻ y3 ≻ y2. notice that medians must be sought independently for criterion x and y components. let’s calculate elements of the matrices a (i) x : a (1) x =      0 0 1 0 1 0 1 0 0 0 0 0 1 1 1 0      , a (2) x =      0 0 1 1 1 0 1 1 0 0 0 0 0 0 1 0      , a (3) x =      0 0 0 1 1 0 0 1 1 1 0 1 0 0 0 0      , personnel ranking and selection problem solution by application of kemira method 57 a (4) x =      0 1 1 0 0 0 1 0 0 0 0 0 1 1 1 0      , a (5) x =      0 1 0 0 0 0 0 0 1 1 0 0 1 1 1 0      . note, that we must search the median rax among all 24 = 4! components priorities options. median is the priority, which minimize functions (9) value. in this case solution was found among priorities r(1) −r(5) (for other priorities options value of function (9) is bigger): 5 ∑ j=1 ρa ( r(1),r(j) ) = 0 + 4 + 10 + 2 + 6 = 22, 5 ∑ j=1 ρa ( r(2),r(j) ) = 4 + 0 + 6 + 6 + 10 = 26, 5 ∑ j=1 ρa ( r(3),r(j) ) = 10 + 6 + 0 + 12 + 8 = 36, 5 ∑ j=1 ρa ( r(4),r(j) ) = 2 + 6 + 12 + 0 + 4 = 24, 5 ∑ j=1 ρa ( r(5),r(j) ) = 6 + 10 + 8 + 4 + 0 = 28 ... median components priority is rax = r (1) = (3,2,4,1) or x4 ≻ x2 ≻ x1 ≻ x3. next we find criterion y components priorities among all 6 = 3! options. calculate elements of matrices a (i) y : a (1) y =    0 1 1 0 0 0 0 1 0    , a (2) y = a (3) y =    0 1 1 0 0 1 0 0 0    , a (4) y =    0 0 1 1 0 1 0 0 0    , a (5) y =    0 1 0 0 0 0 1 1 0    , a (6) y =    0 0 0 1 0 1 1 0 0    , a (7) y =    0 0 0 1 0 0 1 1 0    . here r(6) = (3,1,2),r(7) = (3,2,1). 5 ∑ j=1 ρa ( r(1),r(j) ) = 0 + 2 + 2 + 4 + 2 = 10, 5 ∑ j=1 ρa ( r(2),r(j) ) = 5 ∑ j=1 ρa ( r(3),r(j) ) = 2 + 0 + 0 + 2 + 4 = 8, 5 ∑ j=1 ρa ( r(4),r(j) ) = 4 + 2 + 2 + 0 + 6 = 14, 5 ∑ j=1 ρa ( r(5),r(j) ) = 2 + 4 + 4 + 6 + 0 = 16, 5 ∑ j=1 ρa ( r(6),r(j) ) = 6 + 4 + 4 + 2 + 4 = 20, 5 ∑ j=1 ρa ( r(7),r(j) ) = 4 + 6 + 6 + 4 + 2 = 22. median components prioritiy is ray = r (2) = r(3) = (1,2,3) or y1 ≻ y2 ≻ y3. 4 calculation of weights and mcdm problem solution suppose that there are known objective measurements (x) and subjective expert evaluations (y) of k test takers t(j),j = 1,2, . . . ,k : x (j) 1 , x (j) 2 , . . ., x (j) n , y (j) 1 , y (j) 2 , . . ., y (j) m , 0 6 x (j) i ,y (j) i 6 1: t(j1) � t(j2), if (∀i) x (j1) i > x (j2) i & y (j1) i > y (j2) i . (10) suppose that 0 6 wxi,wyi 6 1 are weighted coefficients: n ∑ i=1 wxi = m ∑ i=1 wyi = 1. then under conditions (10) the following inequalities take place: w(j1)x > w (j2) x & w (j1) y > w (j2) y . (11) here w(j)x = n ∑ i=1 wxix (j) i , w (j) y = m ∑ i=1 wyiy (j) i . (12) 58 n. kosareva, e.k. zavadskas, a. krylovas, s. dadelo it means that if all measurements values are bigger for object t(j1) than for object t(j2) then each linear combination of measurements for t(j1) will also be bigger than for t(j2) provided that weighted coefficients are nonnegative. in practice more often we have encountering situation when results of the measurements are as follows: x (j1) i1 > x (j2) i1 and x (j1) i2 < x (j2) i2 . then it isn’t possible to apply criteria (10) for selecting better alternative. the idea of current study is to choose such values of weights wxi, wyi which will guarantee proximity of values w (j) x and w (j) y for 10% of best security guards. the measure of closeness of these values would be sum of squares of ranks differences. let’s denote r (j) x and r (j) y positive integers r (j) x,y ∈{1,2, . . . ,k} satisfying condition: r (j1) x,y < r (j2) x,y , when w (j1) x,y > w (j2) x,y , i.e. they are ranks of numbers w (j) x,y. let’s define akx and bky as subsets of the set {1,2, . . . ,k}: akx = { {j1,j2, . . . ,jkx} : r (ji) x 6 kx } , bky = { {j1,j2, . . . ,jky} : r (ji) y 6 ky } including objects which have lowest ranks according to the corresponding criteria (x and y ) and having kx and ky elements respectively. note, that the lower is the rank, the better is the alternative. the best alternatives selection task according to both criteria (12) is equivalent to the task of finding the intersection of sets akx ∩bky , which has the required number of elements ∣ ∣akx ∩bky ∣ ∣. if the number of elements ∣ ∣akx ∩bky ∣ ∣ is insufficient, it is necessary to increase numbers kx and ky. preferences of the selected alternatives could be determined by the following criteria expressed as sum of criteria (12): w(j) = w (j) x + w (j) y . consider the following example. suppose that 5 objects have 4 attributes of one type (x) and 3 attributes of other type – (y ). values of attributes are given in the table 2. let’s calculate table 2: attributes xi and yj values for the 5 given objects. x1 x2 x3 x4 y1 y2 y3 1 0.4 0.6 0.7 0.3 0.5 0.4 0.2 2 0.5 0.3 0.7 0.3 0.6 0.7 0.2 3 0.2 0.4 0.6 0.5 0.7 0.7 0.4 4 0.5 0.3 0.5 0.4 0.6 0.8 0.2 5 0.6 0.2 0.8 0.8 0.4 0.5 0.8 values of criteria (12) for the first object when weights are as follows: wx,1 = 1 2 , wx,2 = 1 4 , wx,3 = 1 8 , wx,4 = 1 8 , wy,1 = 5 8 , wy,2 = 1 4 , wy,3 = 1 8 . we obtain criteria (12) values: w (1) x = 1 2 0.4 + 1 4 0.6 + 1 8 0.7 + 1 8 0.3 = 0.475,w (1) y = 5 8 0.5 + 1 4 0.4 + 1 8 0.2 = 0.4375. similarly calculate criteria values for the remaining 4 objects. all criteria values and their ranks are presented in the table 3. previously defined sets akx and bky are those including table 3: criteria w (j) x and w (j) y and their ranks values for the 5 given objects. j w (j) x w (j) y r (j) x r (j) y 1 0.475 0.4375 2 5 2 0.45 0.575 3 3 3 0.3375 0.6625 5 1 4 0.4375 0.60 4 2 5 0.55 0.475 1 4 respectively kx and ky elements with lowest ranks r (j) x and r (j) y : a1 = {5}, a2 = {1,5}, a3 = {1,2,5}, a4 = {1,2,4,5}, b1 = {3}, b2 = {3,4}, b3 = {2,3,4}, b4 = {2,3,4,5}, a1 ∩b1 = ∅, a2 ∩b2 = ∅, a3 ∩b3 = {2} and so on. personnel ranking and selection problem solution by application of kemira method 59 calculation of criteria weights is performed simultaneously with the decision of mcdm problem. suppose that there are known criteria-referenced assessments for certain alternatives x(k) = ( x (k) 1 ,x (k) 2 , . . . ,x (k) n ) , y (k) = ( y (k) 1 ,y (k) 2 , . . . ,y (k) m ) , k = 1,2, . . . ,k and criteria x, y priorities are determined: xj1 ≻ xj2 ≻ ··· ≻ xjn, yi1 ≻ yi2 ≻ ··· ≻ yim. (13) according to the established criteria x, y priorities (13) functions ϕ(x) and ψ(y ) are determined as follows: ϕ(x) = n ∑ r=1 αjrxjr, where αj1 > αj2 > . . . > αjn > 0, (14) ψ(y ) = m ∑ s=1 βisyis, where βi1 > βi2 > . . . > βim > 0. (15) for identifying criteria weights we use the heuristic described in [13]. require that the weighting coefficients αj and βi satisfy normalizing condition (4). denote s(j1,j2,...,jn) class of all convolutions constructed on the base of weighted averages and having priority feature (j1,j2, . . . ,jn), i.e. satisfying condition (14). analogous s(i1,i2,...,im) is a class of all convolutions having priority feature (i1, i2, . . . , im) (satisfying condition (15)). suppose that ϕ and ψ are criteria x and y convolutions having corresponding priority features (13). for each alternative ( x(k),y (k) ) we’ll calculate values of both criteria convolutions ϕ ( x(k) ) , ψ ( y (k) ) . numbering them in ascending order we get ranks of alternatives: r (k) x , r (k) y . denote akx and bky sets of the best alternatives – i. e. those subsets of the set {1,2, . . . ,k}, whose elements ranks satisfy the inequalities r (k) x 6 kx and r (k) y 6 ky. in the set akx there are kx the best alternatives according to criteria x, similarly in the set bky there are ky the best alternatives according to criteria y . numbers kx ir ky are chosen so that the intersection of sets akx ∩bky have not less than 10 % of the best alternatives. then we search such functions ϕ and ψ that the number of elements of sets akx and bky intersection should be the maximum: max ϕ ∈s(j1,j2,...,jn) ψ ∈s(i1,i2,...,im) ∣ ∣ ∣akx ∩bky ∣ ∣ ∣ . (16) condition (16) means that we must select convolutions ϕ and ψ, which maximize criteria x and y compatibility. the number of convolutions ϕ and ψ, maximizing (16) can be great, therefore additional optimization problem must be formulated. denote r(k)(α) and r(k)(β) ranks of the numbers { ϕ ( x(1) ) ,ϕ ( x(2) ) , . . . ,ϕ ( x(k) )} and { ψ ( y (1) ) ,ψ ( y (2) ) , . . . ,ψ ( y (k) )} respectively (k = 1,2, . . . ,k). we’ll apply indicator rank accordance method for minimizing ranks discrepancy function, i. e. sum of squares of the highest ranks differences according to criteria x and y : crkx,ky (α,β) = min ϕ ∈s(j1,j2,...,jn) ψ ∈s(i1,i2,...,im) ∑ k ∈ {akx ∩ bky } ( r(k) (α) −r(k) (β) )2 . (17) here kx and ky are chosen so that the number of elements in the intersection akx ∩bky will be equal to the desired number of selected objects. 60 n. kosareva, e.k. zavadskas, a. krylovas, s. dadelo for example, suppose that for the data given in table 2 five experts proposed criteria priorities presented in table 1. in chapter 3 for these data we have determined criteria priority features as follows: x4 ≻ x2 ≻ x1 ≻ x3, y1 ≻ y2 ≻ y3. therefore, we’ll search for weights values, satisfying conditions α4 > α2 > α1 > α3, β1 > β2 > β3 (18) and normalizing conditions (4) which maximize number of elements in the intersection of sets (16) and minimize value of function (17). let’s choose kx = ky = 3 and initial values of coefficients as follows: α1 = 1 12 , α2 = 1 3 , α3 = 1 12 , α4 = 1 2 , β1 = β2 = β3 = 1 3 . values of functions ϕ(x(i)) and ψ(y (i)) and their ranks for each alternative are given in table 4. table 4: values of functions ϕ(x(i)) and ψ(y (i)) and their ranks values for the 5 given objects. j w (j) x w (j) y r (j) x r (j) y 1 0.4417 0.33 3 5 2 0.35 0.45 5 4 3 0.45 0.54 2 1 4 0.3833 0.48 4 3 5 0.5833 0.51 1 2 a3 = {1,3,5}, b3 = {3,4,5}, a3 ∩b3 = {3,5}, |a3 ∩b3| = 2. so, in the sum (17) we have only 2 summands: cr3,3 (α,β) = (2−1) 2 + (1 −2) 2 = 2. in the next step we choose other values of coefficients satisfying conditions (18) and (4). if the number of elements in the intersection of sets (16) is greater or equal than 2, we calculate function (17) value. if this value is less than in previous step, values of these new coefficients are considered to be the best solution before the next step. after the last step when all coefficients satisfying conditions (18) and (4) are verified we’ll receive values of coefficients α1,α2,α3,α4,β1,β2,β3 which are the solution of mcdm problem. finally, values of functions ϕ(x) and ψ(y ) are calculated and objects are ranked according to the criteria ϕ(x) + ψ(y ). 5 case study the present problem has been solved in [13]. in the mentioned paper weighted coefficients of criteria were calculated as the proportions of the collected scores to the total score. in the current research the weights obtained by kemira method. results of selecting the best 10% security guards will be compared with previous results. moreover, guard ranking will be performed. 118 security guards were randomly selected from the company g4s lietuva. 22 leader managers (experts) ranked the competences described below. personnel elite – 10% the best employees of the private security company – are selected according to six internal assessment criteria (x1−x6) and nine external evaluation criteria (y1−y9). 6 internal criteria are objective tests and measurements: x1 is employee’s theoretical and practical preparation; x2 – professional activity, x3 – mental qualities; x4 – physical development; x5 – motor abilities (personal physical conditions allowing to carry out physical tasks); x6 – fighting efficiency. 9 external criteria are evaluation of subordinate by his immediate superior: y1 – specialty knowledge, professionalism, y2 – diligence and positive attitude to work, y3 – behavior with colleagues and supervisors; y4 – reliability at work; y5 – quality of work; y6 – workload performance; y7 – image; y8 – development rate; y9 – being promising (potential to make a career). fragment of security guards evaluation criteria structure is given in table 5. table 5 data are given in the paper [13]. note that all 15 criteria are associated with benefit and their personnel ranking and selection problem solution by application of kemira method 61 table 5: security guards internal and external evaluation criteria. security guards internal criteria external criteria x1 x2 . . . x6 y1 y2 . . . y9 a1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . a118 . . . . . . . . . . . . . . . . . . . . . . . . greater value is better. therefore criteria values were transformed to values xj∗i ,y j∗ i , belonging to the range [0; 1] by the transformations (2). 22 independent experts determined preferences separately for internal and external evaluation criteria. criteria preferences established by experts are presented in the table 6. a higher grade means that the criterion is more important. criterion x priority feature is determined from expert estimates of the form (j1,j2, . . . ,j6). for example, first expert set criterion x components priorities as follows: (x1,x3,x6,x4,x5,x2). here x1 is the most important and x2 – the least important component. criterion y priority feature is established similarly. for example, first expert set them in this order: (y2,y4,y1,y7,y5,y3, y6,y8,y9). generalized experts opinion expressing criteria priority feature is established by calculating median with the respective metric (8) to calculate distances between priorities. calculation of medians was performed by full re-selection of options, which is respectively 6! = 720 and 9! = 362880. medians were calculated separately for criterion x components xj and criterion y components yi by minimizing function (9) values. two solutions were obtained for criterion x components xj priority features: (1,5,3,6,4,2), (1,3,5,6,4,2). criterion y components yi priority features were determined uniquely: (2,4,5,3,1,7,6,8,9). therefore, criteria x and y components order was established respectively: x1 ≻ x5 ≻ x3 ≻ x6 ≻ x4 ≻ x2 or x1 ≻ x3 ≻ x5 ≻ x6 ≻ x4 ≻ x2, (19) y2 ≻ y4 ≻ y5 ≻ y3 ≻ y1 ≻ y7 ≻ y6 ≻ y8 ≻ y9. (20) table 6: criteria x and y components preferences established by experts. expert x1 · · · x6 y1 · · · y9 1 615324 794853621 2 523164 795682341 3 513264 594681732 4 615342 597863421 5 615234 796853421 6 514263 596873421 7 423165 485972631 8 625341 685973421 9 546321 794863512 10 546123 674983521 11 413265 586972431 expert x1 · · · x6 y1 · · · y9 12 624153 695784321 13 645132 596873421 14 635142 786954321 15 413265 597863241 16 413265 798654231 17 614235 687945231 18 634152 698745231 19 415263 498671532 20 624153 596783421 21 526341 687954321 22 416325 697854231 finally, cluster analysis [17] was applied to the data presented in the table 6 to distinguish a group of experts whose opinions are very close to each other and which form the majority of the experts. by applying between groups, ward’s and furthest neighbor methods the group of experts with numbers {1,4,5,8,9,10,12,13,14,17,18,20,21,22} was selected. further only those experts priorities were examined to get criteria priority preferences, because we did not want to distort the opinion of this group by the remaining minority group of experts. kemeny median method was applied for this reduced data array. for criterion x components xj the solution 62 n. kosareva, e.k. zavadskas, a. krylovas, s. dadelo minimizing function (9) received as follows: (1,3,5,6,2,4). results for criterion y components yj were the same as in (20). therefore, for the majority of experts criteria x and y components order was established respectively: x1 ≻ x3 ≻ x5 ≻ x6 ≻ x2 ≻ x4 (21) and (20). from the priority features (19), (21) and (20) it follows, that it is necessary to search for convolutions ϕ and ψ in the form: ϕ1(x) = α1x1 + α5x5 + α3x3 + α6x6 + α4x4 + α2x2, α1 > α5 > α3 > α6 > α4 > α2 > 0, ϕ2(x) = α1x1 + α3x3 + α5x5 + α6x6 + α4x4 + α2x2, α1 > α3 > α5 > α6 > α4 > α2 > 0, ϕ3(x) = α1x1 + α3x3 + α5x5 + α6x6 + α2x2 + α4x4, α1 > α3 > α5 > α6 > α2 > α4 > 0, ψ(y ) = β2y2 + β4y4 + · · · + β9y9, β2 > β4 > β5 > · · · > β8 > β9 > 0. we define parameter values for our problem: number of investigated objects k = 118, number of criterion x components n = 6, number of criterion y components m = 9. values kx = 21 and ky = 21 were chosen to fulfill the condition ∣ ∣ ∣akx ∩bky ∣ ∣ ∣ = 12, since the goal is to select the top 12 security guards. in the case of functions ϕ1(x) and ψ(y ) the lowest criterion (17) value was found cr21,21 (α,β) = 331 with the following values of weighted coefficients: α1 α5 α3 α6 α4 α2 β2 β4 β5 β3 β1 β7 β6 β8 β9 0.202 0.202 0.197 0.191 0.11 0.098 0.32 0.149 0.14 0.12 0.085 0.064 0.053 0.035 0.034 the best 12 security guards, belonging to intersection of sets a21 ∩ b21 and ranked according to the criterion w (j) 1 = ϕ1(x (j)) + ψ(y(j)) as follows: a56 ≻ a76 ≻ a91 ≻ a81 ≻ a21 ≻ a47 ≻ a36 ≻ a106 ≻ a34 ≻ a111 ≻ a102 ≻ a77. in the case of functions ϕ2(x) and ψ(y ) the lowest criterion (17) value cr21,21 (α,β) = 362 was obtained with these weighted coefficients values: α1 α3 α5 α6 α4 α2 β2 β4 β5 β3 β1 β7 β6 β8 β9 0.194 0.192 0.188 0.182 0.132 0.11 0.29 0.24 0.094 0.089 0.082 0.078 0.055 0.055 0.017 the same 12 security guards got to the intersection of sets a21 ∩b21. alternatives ranked according to the criterion w (j) 2 = ϕ2(x (j)) + ψ(y(j)) as follows: a56 ≻ a76 ≻ a91 ≻ a81 ≻ a21 ≻ a47 ≻ a36 ≻ a106 ≻ a111 ≻ a34 ≻ a102 ≻ a77. two alternatives have exchanged places: a111 and a34. the obtained results coincide with the results got in [13], since in the case of ϕ2(x) and ψ(y ) criteria priorities were set in the same order as in the mentioned article. for the generalized opinion of the experts majority determined by cluster analysis the solution was sought between the functions of the form ϕ3(x) and ψ(y ). the minimum (17) value cr19,19 (α,β) = 373 was obtained with these weighted coefficients values: α1 α3 α5 α6 α2 α4 β2 β4 β5 β3 β1 β7 β6 β8 β9 0.2 0.19 0.19 0.19 0.133 0.097 0.25 0.249 0.12 0.1 0.081 0.07 0.05 0.05 0.03 the top 12 security guards are: a21, a34, a36, a47, a56, a72, a76, a81, a91, a102, a106, a111. the difference from the previous cases when solutions were sought amongst functions ϕ1(x) or ϕ2(x) is that a77 was changed by a72. alternatives ranked according to the criterion w (j) 3 = ϕ3(x (j))+ ψ(y(j)) as follows: a56 ≻ a91 ≻ a76 ≻ a81 ≻ a21 ≻ a36 ≻ a111 ≻ a47 ≻ a106 ≻ a34 ≻ a102 ≻ a72. ranking results differ from those accomplished by criteria w (j) 1 and w (j) 2 much more. personnel ranking and selection problem solution by application of kemira method 63 6 discussion and conclusions many authors studied the relationship between the set of qualitative indicators of human resources and business performance. recently, this phenomenon is gaining considerable economic importance. a lot of recruitment methods, whose main aim is to help organizations make the best personnel management decisions, are being created. many authors have expressed concern about the human resources performance assessment process because it can be biased due to the non-compliance of methods [1]. traditional human resource evaluation and selection methods are usually based on a statistical analysis for evaluation of objective indicators thought to reflect the realities. however, it is important to assess not only known but also unknown, hard-diagnosable factors. solution of this problem requires modern techniques involving large amounts of uncertain and subjective information [38]. for the assessment of indicators it is necessary to apply a taxonomy based sorting principles [7]. the different types of equally important factors should be grouped into separate sub-criteria [41]. there can be two or more sub-criteria. in selection process sub-criteria are given the same weights. this paper presents an algorithm which opens possibilities for recruitment in compliance with the above requirements. sub-criteria applied to the evaluation of candidates reflect subjective and objective information about the candidates. in the current paper two equally important hierarchical structures were generated and a new method for ranking proposed. this method combines expert (subjective) evaluation and testing indicators (objective evaluation) hierarchical layout. in the article a new approach to mcdm problem, when objects are evaluated by two groups of criteria having different origin, is presented. in the first stage criteria components preferences are established separately in each group by applying the novel method of kemeny medians. method of kemeny medians can be used when information about criteria components preferences established by independent experts is available. in the second stage criteria weights are determined and mcdm problem is solved by applying proposed indicator rank accordance method. principle of this method is to choose criteria weights values from the set of all possible values which maximize number of elements having lowest ranks according to the both criteria (16) and minimize sum of squared ranks differences calculated for two groups of criteria (17). a case study of selecting top 12 security guards was analyzed by kemira method. for the proposed data method of kemeny medians was applied twice: for all experts and for the majority group of experts distinguished by cluster analysis methods. the proposed methodology allows to weigh and synthesize subjective (managers assessments) and objective (the candidates skills) indicators. in general, there can be more than two groups of criteria having different origin. method of kemeny medians is useful for a wide range of mcdm problems when priority of criteria must be established according to experts evaluation. new kemira method opens up the new opportunities of application and development of decisionmaking methods not only in the selection of personnel. it is suggested that this type of research could be extended to other areas of human activities where mcdm problems arise (business, manufacturing, trade and etc). bibliography [1] a. afshari, m. mojahed, r.m. yusuff (2010); simple additive weighting approach to personnel selection problem, international journal of innovation, management and technology, 1(5): 511–515. [2] p. arezes, m. neves, s. teixeira, c. leão, j. cunha (2013);testing thermal comfort of trekking boots: an objective and subjective evaluation. applied ergonomics, 44(4): 557– 64 n. kosareva, e.k. zavadskas, a. krylovas, s. dadelo 565. [3] a. arisha, w. abo-hamad (2013); towards operations excellence: optimising staff scheduling for new emergency department. proceedings of the 20th international annual euroma conference "operations management at the heart of the recovery", 9-12 june 2013, (pp. 1–10). dublin, ireland. [4] a. baležentis, t. baležentis, w.k.m. brauers (2012); personnel selection based on computing with words and fuzzy multimoora. expert systems with applications, 39(9): 7961–7967. [5] a. baležentis, t. baležentis, w.k.m. brauers (2012); multimoora-fg: a multi-objective decision making method for linguistic reasoning with an application to personnel selection. informatica, 23(2): 173–190. [6] s. balli, s. korukoǧlu (2014); development of a fuzzy decision support framework for complex multi-attribute decision problems: a case study for the selection of skilful basketball players. expert systems, 31(1): 56–69. [7] j. bednarik, w. andreff, s. popović, d. jakšić, e. kolar, g. jurak (2013); financial taxonomy of non-governmental sports organizations, kinesiology, 45(2): 241–251. [8] j. van den bergh, j. beliėn, p. de bruecker, e. demeulemeester, l. de boeck (2013); personnel scheduling: a literature review. european journal of operational research, 226(3): 367–385. [9] c. chen, p. pai, w. hung (2011); applying linguistic vikor and knowledge map in personnel selection. asia pacific management review, 16(4): 491–502. [10] l.-c. cheng, h.-a. wang (2014); a fuzzy recommender system based on the integration of subjective preferences and objective information, applied soft computing, 18: 290–301. [11] s. dadeło (2005); czynniki determinuja̧ce kompetencje pracowników ochrony na litwie. awf warszawa-vilnius. [in polish, abstract in english, in lithuanian] [12] s. dadelo, z. turskis, e.k. zavadskas, r. dadelienė, (2013);integrated multi-criteria decision making model based on wisdom-of-crowds principle for selection of the group of elite security guards, archives of budo, 9(2): 135–147. [13] s. dadelo, a. krylovas, n. kosareva, e.k. zavadskas, r. dadelienė (2014); algorithm of maximizing the set of common solutions for several mcdm problems and it’s application for security personnel scheduling, international journal of computers, communications & control, 9(2): 140–148. [14] m. daǧdeviren (2010); a hybrid multi-criteria decision-making model for personnel selection in manufacturing systems, journal of intelligent manufacturing, 21(4): 451–460. [15] a. dahlstrom, m. campbell, c. hewitt (2013); the role of uncertainty and subjective influences on consequence assessment by aquatic biosecurity experts. journal of environmental management, 127(30): 103–113. [16] m. el-santawy, r. el-dean (2012); on using vikor for ranking personnel problem. life science journal-acta, 9(4): 1534–1536. [17] v. estivill-castro (2002); why so many clustering algorithms – a position paper. acm sigkdd explorations newsletter 4(1): 65–75. personnel ranking and selection problem solution by application of kemira method 65 [18] j. fadyl, k. mcpherson, p. schlüter, l. turner-stokes (2010); factors contributing to workability for injured workers: literature review and comparison with available measures. disability and rehabilitation, 32(14): 1173–1183. [19] s. fazli, h. jafari (2002); developing a hybrid multi-criteria model for investment in stock exchange. management science letters, 2(2): 457–468, 2002. [20] b. garland b, n. hogan, t. kelley, b. kim, e. lambert e. (2013); to be or not to be committed: the effects of continuance and affective commitment on absenteeism and turnover intent among private prison personnel. journal of applied security research, 8(1): 1–23. [21] a. gasiorowska (2014); the relationship between objective and subjective wealth is moderated by financial control and mediated by money anxiety, journal of economic psychology, 43: 64–74. [22] j. grand, j. golubovich, a. ryan, n. schmitt (2013); the detection and influence of problematic item content in ability tests: an examination of sensitivity review practices for personnel selection test development. organizational behavior and human decision processes, 121(2): 158–173. [23] s. hashemkhani zolfani, j. antuchevičienė (2012); team member selecting based on ahp and topsis grey. inzinerine ekonomika engineering economics, 23(4): 425–434. [24] s. hashemkhani zolfani, n. rezaeiniya, m. aghdaie, e.k. zavadskas (2012); quality control manager selection based on ahp-copras-g method: a case in iran. ekonomska istraživanja – economic research, 25(1): 88–104. [25] m. kabak (2013); a fuzzy dematel-anp based multi criteria decision making approach for personnel selection. journal of multiple-valued logic and soft computing, 20(5-6): 571– 593. [26] m. kabak, s. burmaoǧlu, y. kazançoǧlu (2012); a fuzzy hybrid mcdm approach for professional selection. expert systems with applications, 39(3): 3516–3525. [27] a. kelemenis, d. askounis, a new topsis-based multi-criteria approach to personnel selection. expert systems with applications, 37(7): 4999–5008, 2010. [28] j. kemeny, j. snell (1963); mathematical models in the social sciences, new york. [29] m. kendall (1970); rank correlation methods, fourth ed., charles griffin & co., london. [30] v. keršulienė, z. turskis (2013); an integrated multi-criteria group decision making process: selection of the chief accountant. the 2-nd international scientific conference "contemporary issues in business, management and education 2013", procedia social and behavioral sciences, 110: 897–904. [31] g. kou, y. lu, y. peng, y. shi (2012); evaluation of classification algorithms using mcdm and rank correlation. international journal of information technology & decision making, 11(1): 197–225. [32] a. krylovas, e.k. zavadskas, n. kosareva, s. dadelo (2014); new kemira method for determining criteria priority and weights in solving mcdm problem. international journal of information technology & decision making, 13(6): 1119–1133. 66 n. kosareva, e.k. zavadskas, a. krylovas, s. dadelo [33] j. liou, g. tzeng (2012); comments on “multiple criteria decision making (mcdm) methods in economics: an overview”. technological and economic development of economy, 18(4): 672–695. [34] p. papsiene, s. vaitkevicius (2014); human resource assessment impact to organization climate: case of lithuanian public sector organizations, inzinerine ekonomika – engineering economics, 25(2): 223–230. [35] y. peng, g. kou, g. wang, y. shi (2011); famcdm: a fusion approach of mcdm methods to rank multiclass classification algorithms. omega, 39(6): 677–689. [36] b. rouyendegh, t. erkan (2013); an application of the fuzzy electre method for academic staff selection. human factors and ergonomics in manufacturing & service industries, 23(2): 107–115. [37] b. rouyendegh, t. erkan (2012); selection of academic staff using the fuzzy analytic hierarchy process (fahp): a pilot study. tehnicki vjesnik–technical gazette, 19(4): 923–929. [38] a. salehi, m. izadikhah (2014); a novel method to extend saw for decision-making problems with interval data, decision science letters, 3(2): 225–236. [39] g. tzeng, j. huang (2011); multi attribute decision making: methods and applications, crcpress. [40] d. yu, w. zhang, y. xu (2013); group decision making under hesitant fuzzy environment with application to personnel evaluation, knowledge-based systems 52: 1–10. [41] e.k. zavadskas, z. turskis, j. tamošaitiene, v. marina (2008); multicriteria selection of project managers by applying grey criteria, technological and economic development of economy, 14(4): 462–477. [42] e.k. zavadskas, z. turskis (2011); multiple criteria decision making (mcdm) methods in economics: an overview. technological and economic development of economy, 17(2): 397– 427. [43] e.k. zavadskas, z. turskis, s. kildienė (2014); state of art surveys of overviews on mcdm/madm methods. technological and economic development of economy, 20(1): 165–179. [44] s. zhang, s. liu (2011); a gra-based intuitionistic fuzzy multi-criteria group decision making method for personnel selection. expert systems with applications, 38(9): 11401– 11405. int j comput commun, issn 1841-9836 8(1):70-78, february, 2013. detecting ddos attacks in cloud computing environment a.m. lonea, d.e. popescu, h. tianfield alina madalina lonea "politehnica" university of timisoara, faculty of automation and computers b-dul vasile parvan, nr. 2, 300223, timisoara, romania e-mail: madalina _ lonea@yahoo.com daniela elena popescu university of oradea, faculty of electrical eng. and information tech. universitatii street, nr. 1, 410087, oradea, romania e-mail: depopescu@uoradea.ro huaglory tianfield school of engineering and built environment, glasgow caledonian university cowcaddens road, glasgow g4 0ba, united kingdom e-mail: h.tianfield@gcu.ac.uk abstract: this paper is focused on detecting and analyzing the distributed denial of service (ddos) attacks in cloud computing environments. this type of attacks is often the source of cloud services disruptions. our solution is to combine the evidences obtained from intrusion detection systems (idss) deployed in the virtual machines (vms) of the cloud systems with a data fusion methodology in the front-end. specifically, when the attacks appear, the vm-based ids will yield alerts, which will be stored into the mysql database placed within the cloud fusion unit (cfu) of the front-end server. we propose a quantitative solution for analyzing alerts generated by the idss, using the dempster-shafer theory (dst) operations in 3-valued logic and the fault-tree analysis (fta) for the mentioned flooding attacks. at the last step, our solution uses the dempsters combination rule to fuse evidence from multiple independent sources. keywords: cloud computing, cloud security, distributed denial of service (ddos) attacks, intrusion detection systems, data fusion, dempster-shafer theory. 1 introduction cloud computing technology is in continuous development and with numerous challenges regarding security. in this context, one of the main concerns for cloud computing is represented by the trustworthiness of cloud services. this problem requires prompt resolution because otherwise organizations adopting cloud services would be exposed to increased expenditures while at a greater risk. a survey conducted by international data corporation (idc) in august 2008 confirms that security is the major barrier for the cloud users. there are two things that cloud service providers should guarantee all the time: connectivity and availability, and if there are not met, the entire organizations will suffer high costs [1]. this paper is focused on detecting and analyzing distributed denial of service (ddos) attacks in cloud computing environment. this type of attacks is often the source of cloud services disruptions. one of the efficient methods for detecting ddos is to use the intrusion detection systems (ids), in order to assure usable cloud computing services [2]. however, ids sensors have the limitations that they yield massive amount of alerts and produce high false positive rates and false negative rates [3]. copyright c⃝ 2006-2013 by ccc publications detecting ddos attacks in cloud computing environment 71 with regards to these ids issues, our proposed solution aims to detect and analyze distributed denial of service (ddos) attacks in cloud computing environments, using dempstershafer theory (dst) operations in 3-valued logic and fault-tree analysis (fta) for each vmbased intrusion detection system (ids). the basic idea is to obtain information from multiple sensors, which are deployed and configured in each virtual machine (vm). the obtained information is integrated in a data fusion unit, which takes the alerts from multiple heterogeneous sources and combines them using the dempster’s combination rule. our approach quantitatively represents the imprecision and efficiently utilizes it in ids to reduce the false alarm rates. specifically, our solution combines the evidences obtained from intrusion detection systems (idss) deployed in the virtual machines (vms) of the cloud system with a data fusion methodology within the front-end. our proposed solution can also solve the problem of analysing the logs generated by sensors, which seems to be a big issue [4]. the remainder of this paper is organized as follows: section 2 introduces dempster-shafer theory. section 3 presents the related work of ids in cloud computing and the related work of ids using data fusion. section 4 introduces the proposed solution of detecting ddos attacks in cloud computing. finally, in section 5 the paper presents the concluding remarks. 2 dempster-shafer theory (dst) dempster-shafer theory is established by two persons: arthur dempster, who introduced it in the 1960’s and glenn shafer, who developed it in the 1970’s [5]. as an extension of bayesian inference, dempster-shafer theory (dst) of evidence is a powerful method in statistical inference, diagnostics, risk analysis and decision analysis. while in the bayesian method probabilities are assigned only for single elements of the state space (ω),in dst probabilities are assigned on mutually exclusive elements of the power sets of possible states [6], [7]. according to dst method, for a given state space (ω) the probability (called mass) is allocated for the set of all possible subsets of ω, namely 2ω elements. consequently, the state space (ω) is also called frame of discernment, whereas the assignment procedure of probabilities is called basic probability assignment (bpa) [6], [7], [8]. we will apply the particular case of dst, i.e., the dst operations in 3-valued logic using the fault-tree analysis (fta), adopted by guth (1991) and also used in popescu, et al. (2010). thus, if a standard state space ω is (true, false), then 2ω should have 4 elements: { ϕ, true, false, (true, false) }. the (true, false) element describes the imprecision component introduced by dst, which refers to the fact of being either true or false, but not both. dst is a useful method for fault-tree analysts in quantitatively representing the imprecision [8]. another advantage of dst is it can efficiently be utilized in ids to reduce the false alarm rates by the representation of ignorance [6], [7], [10]. for the reason that in dst the [sum of all masses] = 1 and m(ϕ) = 0,we have the following relation: m(true) + m(false) + m(true,false) = 1 (1) in order to analyze the results of each sensor we’ll use the fault tree analysis, which can be realized by boolean or gate. table 1 describes the boolean truth table for the or gate. from table 1 we have: m(a) = (a1,a2,a3) = {m(t),m(f),m(t,f)} (2) 72 a.m. lonea, d.e. popescu, h. tianfield table 1: boolean truth table for the or gate b1 b2 b3 ∨ t f (t,f) a1 t t t t a2 f t f (t,f) a3 (t,f) t (t,f) (t,f) m(b) = (b1,b2,b3) = {m(t),m(f),m(t,f)} (3) ⇒ m(a ∨ b) = (a1b1 + a1b2 + a1b3 + a2b1 + a3b1;a2b2;a2b3 + a3b2 + a3b3) (4) m(a ∨ b) = (a1 + a2b1 + a3b1;a2b2;a2b3 + a3b2 + a3b3) (5) at the last step, our solution applies the dempster’s combination rule, which allows fusing evidences from multiple independent sources using a conjunctive operation (and) between two bpa’s m1 and m2 , called the joint m12 [11]: m12(a) = ∑ b ∩ c=a m1(b)m2(c) 1 − k , (6) when : a ̸= ϕ m12(ϕ) = 0 and k = ∑ b ∩ c=ϕ m1(b)m2(c) the factor 1-k, called normalization factor, is constructive for entirely avoiding the conflict evidence. data fusion is also applied in real world examples: robotics, manufacturing, remote sensing and medical diagnosis, as well in military threat assessment and weather forecast systems [12]. sentz and ferson (2002) demonstrated in their study that dempster’s combination rule is suitable for the case that the sources of evidences are reliable and a minimal conflict or irrelevant conflict is generated. 3 related work 3.1 intrusion detection systems (ids) in cloud computing one of the ids strategies proved reliable in cloud computing environments is its applicability to each virtual machine. this is the method we’ll choose for our proposed solution. mazzariello, et al. (2010) presented and evaluated this method in comparison with another ids deployment strategy, which uses single ids near the cluster controller. ids applied to each virtual machine in cloud computing platform eliminates the overloading problem, because in a way the network traffic is split to all idss. thus, applying ids to each virtual machine gets rid of the issue of the ids strategy near the cluster controller, which tends to be overloaded because of its necessity to monitor all the supposed traffic from the cloud computing infrastructure. another advantage of this strategy as described by roschke, et al. (2009) is the benefit of reducing the impact of the possible attacks by the ids sensor vms. however, the limitation of ids strategy applied to each virtual machine is the missing of the correlation phase, which is suggested in the future work by mazzariello, et al. (2010). detecting ddos attacks in cloud computing environment 73 the correlation phase will be included in our proposed solution, because beside the ids for each virtual machine, our ids cloud topology will include a cloud fusion unit (cfu) on the front-end, with the purpose of obtaining and controlling the alerts received from the ids sensor vms as presented by roschke, et al. (2009) in their theoretical ids architecture for cloud, which utilizing an ids management unit. compared to roschke, et al. (2009) who suggested the utilization of idmef (intrusion detection message exchange) standard, a useful component for storage and exchange of the alerts from the management unit, the alerts in our proposed solution will be stored into the mysql database of cloud fusion unit. the cloud fusion unit will add the capacity to analyze the results using the dempster-shafer theory (dst) of evidence in 3-valued logic and the faulttree analysis for the ids of each virtual machine and at the end the results of the sensors will be fused using dempster’s combination rule. a similar method of using a ids management unit is proposed in dhage, et al. (2011), who presented a theoretical model of an ids model in cloud computing, by using a single ids controller, which creates a single mini ids instance for each user. this ids instance can be used in multiple node controllers and a node controller can contain ids instances of multiple users. the analysis phase of the mini ids instance for each user takes place in the ids controller. compared with roschke, et al. (2009) where the emphasis is on how to realize the synchronization and integration of the ids sensor vms, in dhage, et al. (2011) the focus is to provide a clear understanding of the cardinality used in the basic architecture of ids in cloud infrastructure. applying the ids for each virtual machine is an idea suggested also by lee, et al. (2011), who increases the effectiveness of ids by assigning a multi-level intrusion detection system and the log management analysis in cloud computing. in this sense the users will receive appropriate level of security, which will be emphasized on the degree of the ids applied to the virtual machine, and as well on the prioritization stage of the log analysis documents. this multi-level security model solves the issue of using effective resources. lo, et al. (2010) proposed a cooperative ids system for detecting the dos attacks in cloud computing networks, which has the advantage of preventing the system from single point of failure attack, even if it is a slower ids solution than a pure snort based ids. thus, the framework proposed by lo, et al. (2010) is a distributed ids system, where each ids is composed of three additional modules: block, communication and cooperation, which are added into the snort ids system. 3.2 ids using dempster-shafer theory dempster-shafer theory (dst) is an effective solution for assessing the likelihood of ddos attacks, which was demonstrated by several research papers in the context of network intrusion detection systems. dissanayake (2008) presented a survey upon intrusion detection using dst. our study is to detect ddos attacks in cloud computing environments. dempster-shafer theory (dst) is used to analyze the results received from each sensor (i.e. vm-based ids). data used in experiments using dst vary: yu and frincke (2005) used darpa ddos intrusion detection evaluation datasets, chou et al. (2008) used darpa kdd99 intrusion detection evaluation dataset, chen and aickelin (2006) used the wisconsin breast cancer dataset and iris plant data, while others scientists generated their own data [7]. the data to be used in our proposed solution will be generated by ourselves, by performing ddos attacks using specific tools against the vm-based ids. siaterlis, et al. (2003) and siaterlis and maglaris (2005) performed a similar study of detecting ddos using data fusion and their field was an operational university campus network, while in our solution the ddos attacks are proposed to be detected and analyzed in our private cloud 74 a.m. lonea, d.e. popescu, h. tianfield computing environment. additionally, we consider to analyze the attacks generated against the tcp, udp, icmp packets, like siaterlis, et al. (2003) and siaterlis and maglaris (2005). however, instead of applying dst on the state space ω = {normal,udp − flood,sy n − flood,icmp − flood}, our study uses dst operations in 3-valued logic as suggested by guth (1991) for the same flooding attacks: tcp-flood, udp-flood, icmp-flood, for each vm-based ids. like siaterlis and maglaris (2005), chatzigiannakis, et al., (2007) chosen the same frame of discernment, while hu, et al. (2006) used a state space: {normal, tcp, udp and icmp}. furthermore, compared with the study performed by siaterlis, at al. (2003) and siaterlis and maglaris (2005), who use a minimal neural network at the sensor level, our proposed solution will assign the probabilities using: dst in 3-valued logic, the pseudocode and the fault tree analysis. whilst the computational complexity of dst is increasing exponentially with the number of elements in the frame of discernment [12], the dst 3-valued logic proposed to be used in our research will not encounter this issue, which will meet the efficiency requirements in terms of both detection rate and computation time [15]. finally, the data fusion of the evidences obtained from sensors studied by siaterlis and maglaris (2005) will be used in our study. the data fusion will be realized using the dempstershafer combination rule, which was demonstrated in siaterlis and maglaris (2005) for its advantages, i.e., maximization of ddos true positive rates and minimization of the false positive alarm rate, by combining the evidence received from sensors. therefore, the work of cloud administrators will be alleviated, whereas the number of alerts will decrease. 4 proposed solution in order to detect and analyze distributed denial of service (ddos) attacks in cloud computing environments we propose a solution as presented in figure 1. for illustration purpose, a private cloud with a front-end and three nodes is set up. whilst the detection stage is executed within the nodes, more precisely inside the virtual machines (vms), where the intrusion detection systems (idss) are installed and configured; the attacks assessment phase is handled inside the front-end server, in the cloud fusion unit (cfu). the first step in our solution includes the deployment stage of a private cloud using eucalyptus open-source version 2.0.3. the topology of the implemented private cloud is: a front-end (with cloud controller, walrus, cluster controller, storage controller) and a back-end (i.e. three nodes). the managed networking mode is chosen because of the advanced features that it provides and xen hypervisor is used for virtualization. then, the vm-based ids are created, by installing and configuring snort into each vm. the reason of using this ids location is because the overloading problems can be avoided and the impact of possible attacks can be reduced [2], [13]. these idss will yield alerts, which will be stored into the mysql database placed within the cloud fusion unit (cfu) of the front-end server. a single database is suggested to be used in order to reduce the risk of losing data, to maximize the resource usage inside the vms and to simplify the work of cloud administrator, who will have all the alerts situated in the same place. a similar idea of obtaining and controlling the alerts received from the ids sensor vms using an ids management unit was presented by roschke, et al. (2009) as a theoretical ids architecture for cloud. a similar method of using an ids management unit is proposed in dhage, et al. (2011). however, our solution adds the capacity to analyse the results using the dempster-shafer theory of evidence in 3-valued logic. as showed in figure 1, the cloud fusion unit (cfu) comprises 3 components: mysql database, bpas calculation and attacks assessment. detecting ddos attacks in cloud computing environment 75 figure 1: ids cloud topology i. mysql database the mysql database is introduced with the purpose of storing the alerts received from the vm-based ids. furthermore, these alerts will be converted into basic probabilities assignments (bpas), which will be calculated using the pseudocode below. ii. basic probabilities assignment (bpa’s) calculation for calculating the basic probabilities assignment, first we decide on the state space ω. in this paper we use dst operations in 3-valued logic {true, false, (true, false)} guth (1991) for the following flooding attacks: tcp-flood, udp-flood, icmp-flood, for each vm-based ids. thus, the analyzed packets will be: tcp, udp and icmp. further, a pseudocode for converting the alerts received from the vm-based ids into bpas is provided. the purpose of this pseudocode is to obtain the following probabilities of the alerts received from each vm-based ids: (mudp (t),mudp (f),mudp (t,f)) (mtcp (t),mtcp (f),mtcp (t,f)) (micmp (t),micmp (f),micmp (t,f)) 76 a.m. lonea, d.e. popescu, h. tianfield figure 2: bpa’s calculation pseudocode for converting the alerts into bpa’s: for each node begin for each x ∈ {udp; tcp; icmp}: begin 1: query the alerts from the database when a x attack occurs for the specified hostname 2: query the total number of possible x alerts for each hostname 3: query the alerts from the database when x attack is unknown 4: calculate the belief (true) for x, by dividing the result obtained at step 1 with the result obtained at step 2 5: calculate the belief (true, false) for x, by dividing the result obtained at step 3 with the result obtained at step 2 6: calculates belief (false) for x: 1belief (true) belief (true, false) end end furthermore, after obtaining the probabilities for each attack packet (i.e. udp, tcp, icmp) for each vm-based ids, the probabilities for each vm-based ids should be calculated following the fault-tree as shows in figure 2. figure 2 reveals only the calculation of the probabilities (i.e. ms1(t),ms1(f),ms1(t,f)) for the first vm-based ids. thus, using the dst with fault-tree analysis we can calculate the belief (bel) and plausibility (pl) values for each vm-based ids: bel(s1) = ms1(t) (7) pl(s1) = ms1(t) + ms1(t,f) (8) iii. attacks assessment the attacks assessment consists of data fusion of the evidences obtained from sensors by using the dempster’s combination rule, with the purpose of maximizing the ddos true positive rates and minimizing the false positive alarm rate. ms1,s2(t) can be calculated using table 2 and equation (6). detecting ddos attacks in cloud computing environment 77 table 2: boolean truth table for the or gate ms1(t) ms1(f) ms1(t,f) ms2(t) ms1(t) ms2(t) ms1(f) ms2(t) ms1(t,f) ms2(t) ms2(f) ms1(t) ms2(f) ms1(f) ms2(f) ms1(t,f) ms2(f) ms2(t,f) ms1(t) ms2(t,f) ms1(f) ms2(t,f) ms1(t,f) ms2(t,f) 5 conclusions to detect and analyze distributed denial of service (ddos) attacks in cloud computing environments we have proposed a solution using dempster-shafer theory (dst) operations in 3-valued logic and the fault-tree analysis (fta) for each vm-based intrusion detection system (ids). our solution quantitatively represents the imprecision and efficiently utilizes it in ids to reduce the false alarm rates by the representation of the ignorance. whilst the computational complexity of dst is increasing exponentially with the number of elements in the frame of discernment [12], the dst 3-valued logic in our solution does not have this issue, which meets the efficiency requirements in terms of both detection rate and computation time. at the same time, the usability requirement has been accomplished, because the work of cloud administrators will be alleviated by using the dempster rule of evidence combination whereas the number of alerts will decrease and the conflict generated by the combination of information provided by multiple sensors is entirely eliminated. to sum up, by using dst our proposed solution has the following advantages: to accommodate the uncertain state, to reduce the false negative rates, to increase the detection rate, to resolve the conflicts generated by the combination of information provided by multiple sensors and to alleviate the work for cloud administrators. acknowledgment this work was partially supported by the strategic grant posdru/88/1.5/s/50783, project id50783 (2009), co-financed by the european social fund investing in people, within the sectoral operational programme human resources development 2007-2013. bibliography [1] perry, g., minimizing public cloud disruptions, techtarget, [online]. available at: http://searchdatacenter.techtarget.com/tip/minimizing-public-cloud-disruptions, 2011. [2] roschke, s., cheng, f. and meinel, c.,intrusion detection in the cloud. in eighth ieee international conference on dependable, autonomic and secure computing, pp. 729-734, 2009. [3] yu, d. and frincke, d.,a novel framework for alert correlation and understanding. international conference on applied cryptography and network security (acns) 2004, springer’s lncs series, 3089, pp. 452-466, 2004. [4] lee, j-h., park, m-w., eom, j-h. and chung, t-m., multi-level intrusion detection system and log management in cloud computing. in 13th international conference on advanced communication technology (icact) icact 2011, seoul, 1316 february, pp.552555, 2011. [5] chen, q. and aickelin, u., dempster-shafer for anomaly detection. in proceedings of the international conference on data mining (dmin 2006), las vegas, usa, pp. 232-238, 2006. 78 a.m. lonea, d.e. popescu, h. tianfield [6] siaterlis, c., maglaris, b. and roris, p., a novel approach for a distributed denial of service detection engine. national technical university of athens. athens, greece, 2003. [7] siaterlis, c. and maglaris, b., one step ahead to multisensor data fusion for ddos detection. journal of computer security, 13(5):779-806, 2005. [8] guth, m.a.s., a probabilistic foundation for vagueness & imprecision in fault-tree analysis. ieee transactions on reliability, 40(5), pp.563-569, 1991. [9] popescu d.e., lonea a.m., zmaranda d.,vancea c. and tiurbe c. , some aspects about vagueness & imprecision in computer network fault-tree analysis. int j comput commun, issn: 1841-9836, 5(4):558-566, 2010. [10] esmaili, m., dempster-shafer theory and network intrusion detection systems. scientia iranica, vol. 3, no. 4, sharif university of technology, 1997. [11] sentz, k. and ferson, s., combination of evidence in dempster-shafer theory. sandia national laboratories, sandia report, 2002. [12] dissanayake, a., intrusion detection using the dempster-shafer theory. 60-510 literature review and survey, school of computer science, university of windsor, 2008. [13] mazzariello, c., bifulco, r. and canonico, r., integrating a network ids into an open source cloud computing environment. in sixth international conference on information assurance and security, pp. 265-270, 2010. [14] dhage, s. n., et al., intrusion detection system in cloud computing environment. in international conference and workshop on emerging trends in technology (icwet 2011) ’ tcet, mumbai, india, pp. 235-239, 2011. [15] lo, c-c. , huang, c-c. and ku, j., a cooperative intrusion detection system framework for cloud computing networks. in 39th international conference on parallel processing workshops, pp.280-284, 2010. [16] yu, d. and frincke, d., alert confidence fusion in intrusion detection systems with extended dempster-shafer theory. acm-se 43: proceedings of the 43rd acm southeast conference, pp. 142-147, 2005. [17] chou, t., yen, k.k., luo, j., network intrusion detection design using feature selection of soft computing paradigms. international journal of computational intelligence, 4(3):102105, 2008. [18] chatzigiannakis, v., et al., data fusion algorithms for network anomaly detection: classification and evaluation. proceedings of the third international conference on networking and services (icns’07), 2007. [19] hu, w., li, j. and gao, q., intrusion detection engine based on dempster-shafer’s theory of evidence. communications, circuits and systems proceedings, 2006 international conference, 3:1627-1631, 2006. ijcccv11n4.dvi international journal of computers communications & control issn 1841-9836, 11(4):472-479, august 2016. direct evolutionary search for nash equilibria detection r.i. lung rodica ioana lung babeş-bolyai university of cluj-napoca faculty of economics and business administration t. mihali 58-60, cluj-napoca rodica.lung@econ.ubbcluj.ro abstract: a direct method of computing mixed form nash equilibria of a normal form game by using a simple evolutionary algorithm is proposed. the direct evolutionary search algorithm (des) uses a generative relation for nash equilibria with binary tournament selection and uniform mutation. numerical experiments are used to illustrate the efficiency of the method. keywords: mixed form nash equilibria, evolutionary algorithms, generative relation 1 introduction the problem of computing nash equilibria of normal form games is one of the most challenging problems faced by computer scientists. the complexity of this problem varying hugely from a type of game to another is still studied. normal form games can be solved by modern heuristics by transforming them into an optimization or fixed point problem [2]. an interesting challenge consists on designing a method that solves the game directly, as a game and not as a corresponding optimization problem. the main problem in directly approaching the nash equilibria is caused by the fact that there does not exist an order (or even preorder) relation defined for game situations that can guide a search operator towards the nash equilibrium. while any kind of optimization endeavor is driven by a corresponding order (or preorder) relation defined in the objective space, in the case of nash equilibria the lack of such a relation limits the design possibilities of computational heuristics. however, recently [1] a generative relation for nash equilibria called the nash ascendancy relation has been proposed. the nash ascendancy relation is defined on strategy profiles. even if it does not induce an actual order, numerical results indicate that it is capable of guiding a search operator towards nash equilibria. nash equilibria definition and generative relation nash equilibrium [4] is the most popular solution concept in noncooperative game theory. a finite strategic game is defined by a set o players, a set of strategies available to each player and a set of payoff functions for each player and denoted by γ = (n,s,u) where: • n represents the set of players, n = {1, ....,n}, n is the number of players; • for each player i ∈ n, si represents the set of actions available to him, si = {si1,si2, ...,simi} where mi represents the number of strategies available to player i and s = s1×s2×...×sn is the set of all possible situations of the game; • for each player i ∈ n denote by pij the probability that he selects its j-th action, j ∈ {1, ...,mi}. then pi = (pi1, ...,pimi) represents a probability distribution over the set of actions of player i and p = (p1, ...,pn) represents a mixed strategy profile for the game, where pij ∈ [0,1] and ∑mi j=1 pij = 1, ∀i = 1,n and ∀j = 1,mi; copyright © 2006-2016 by ccc publications direct evolutionary search for nash equilibria detection 473 • for each player i ∈ n, ui(p) represents the expected payoff for the mixed strategy p ; denote by (qi,p∗−i) the strategy profile obtained from p ∗ by replacing the probability distribution of player i with qi i.e. (qi,p ∗ −i) = (p ∗ 1 ,p ∗ 2 , ...,p ∗ i−1,qi,p ∗ i+1, ...,p ∗ n). a strategy profile p ∈ s for the game γ represents a nash equilibrium [2, 4] if no player has anything to gain by unilaterally changing his own strategy while the others do not modify theirs. several methods to compute ne of a game have been developed. for a review on computing techniques for the ne see [2] and [6]. nash ascendancy relation a generative relation for nash equilibria is a relation between two strategy profiles that enables their comparison with respect to the nash solution concept, i.e. it evaluates which one is "closer" to equilibrium. in [1] such a generative relation has been introduced and shown that solutions that are non-dominated/ascended with respect to this relation are exactly the nash equilibria of the game. consider two mixed strategy profiles p and q and the operator κ that associates the cardinality of the set κ(p,q) = |{i ∈{1, ...,n}|ui(qi,p−i) ≥ ui(p),qi 6= pi}| to the pair (p,q), i.e. the number of players i that would benefit from unilaterally switching their strategies from pi to qi. the operator κ can be used to induce a relation on the set of mixed strategy profiles in the following manner: we can say that the strategy profile p nash ascends the strategy profile q in and we write p ≺n q if the inequality κ(p,q) < κ(q,p) holds, i.e. there are les players that can increase their payoffs by switching their strategy from p to q than vice-versa. it can be said that strategy profile p is more stable (closer to equilibrium) then strategy q. regarding the nash ascendancy relation, two strategy profiles p and q may be in the following situation: 1. p ascends q: p ≺n q (κ(p,q) < κ(q,p)) 2. q ascends p : q ≺n p (κ(p,q) > κ(q,p)) 3. or κ(p,q) = κ(q,p) and p and q are considered indifferent (neither p ascends q nor q ascends p). the strategy profile p∗ is called non-ascended in nash sense (nas) if there does not exist any mixed strategy profile q such that q 6= p∗ and q ≺n p∗. a very important result in [1] shows that for pure strategies all non-ascended strategies are ne and also all ne are non-ascended strategies. the proof in the case of mixed strategies is 474 r.i. lung similar and direct. thus the nash ascendancy relation can be used to characterize the equilibria of a game and can be considered as a generative relation for nes. the nash ascendancy concept was introduced with the purpose to compare two strategy profiles [1] during the search of an evolutionary algorithm in order to compute nes of a game. consider two strategy profiles p∗ and p from ∆. then k : ∆ × ∆ → n associates the pair (p∗,p) the cardinality of the set k(p∗,p) = card{i ∈{1, ...,n}|ui(pi,p∗−i) > ui(p∗),pi 6= p∗i }. this set is composed by the players i that would benefit if given the strategy profile p∗ would change their strategy from p∗i to pi. it is obvious that for any p∗,p ∈ s, we have 0 ≤ k(p∗,p) ≤ n. definition 1. let p,q ∈ ∆. we say the strategy profile p nash ascends q and we write p ≺ q if the inequality k(p,q) < k(q,p), holds. remark 1.1. two strategy profiles p,q ∈ ∆ can have the following relation: 1. either p nash ascends q, 2. either q nash ascends p, 3. if k(p,q) = k(q,p) then p and q are indifferent definition 2. a strategy profile p∗ ∈ s is called non-dominated with respect to the nash ascendancy relation (nns) if ∄q ∈ ∆,q 6= p∗such that q ≺ p∗. definition 3. the set of all nash nondominated strategy profiles with respect to the nash ascendancy relation is the set containing all nondominated strategies i.e. nnd = {s ∈ s|s nash non-dominated with respect to the nash ascendancy relation} proposition 1. a strategy profile p∗ ∈ ∆ is a ne iff the equality k(p∗,q) = 0,∀q ∈ ∆, holds. proof: let p∗ ∈ ∆ be a ne. suppose there exists q ∈ ∆ such that k(p∗,q) = w, w ∈ {1, ...,n}. therefore there exists i ∈ {1, ...,n} such that ui(qi,p∗−i) > ui(p∗) and qi 6= p∗i , which contradicts the definition of ne. for the second implication, let p∗ ∈ ∆ such that ∀q ∈ ∆,k(p∗,q) = 0. this means that for all i ∈{1, ...,n} and for any qi ∈pi we have ui(qi,p∗−i) ≤ ui(p∗). it follows that p∗ is a ne. ✷ proposition 2. all ne are nash nondominated solutions (nnd) i.e. ne ⊆ nnd. direct evolutionary search for nash equilibria detection 475 proof: let p∗ ∈ ∆ be a ne. suppose that there exists a strategy profile p ∈ ∆ such that p ≺ p∗. it follows that k(p,p∗) < k(p∗,p). but k(p∗,p) = 0, therefore we must have k(p,p∗) < 0 which is not possible since k(p,p∗) denotes the cardinality of a set. ✷ proposition 3. all nash nondominated solutions are ne, i.e. nnd ⊆ ne. proof: let p∗ be a nondominated strategy profile. suppose p∗ is not ne. therefore there must exist (at least) one i ∈{1, ...,n} and a strategy pi ∈pi such that ui(pi,p ∗ −i) > ui(p ∗), holds. let’s denote by q = (pi,p∗−i). it means that k(p ∗,q) = 1. but k(q,p∗) = 0. therefore k(q,p∗) < k(p∗,q) which means that q ≺ p∗ thus the hypothesis that p∗ is nondominated is contradicted. ✷ using propositions 2 and 3 it is obvious that the next result holds: proposition 4. the following relation holds: ne = nnd, i.e. all ne are also nash nondominated and also all nash nondominated strategies are ne. direct evolutionary search the direct evolutionary search (des) algorithm is a simple evolutionary algorithm based on tournament selection and uniform mutation designed for nash equilibria detection. individuals represent strategy profiles of the game. real valued encoding is used. each individual is represented as a vector composed of ( ∑n i=1 mi) components between 0 and 1. for each individual n corresponding payoffs are computed. selection binary tournament selection is used in the following manner: for each individual i, another one k is selected randomly. if individual k nash ascends individual i, k is selected and copied in a separate population of children p ′. mutation uniform mutation with probability pm is applied to all children in p ′. with probability pm all strategies are modified (±) with ε. if the resulting value is lower than 0, it is set to 0. if it is higher than 1, it is set to 1. termination condition des runs either a maximum number of generations which is a parameter of the algorithm and depends on the problem, either until no child can replace a parent for 100 generations. for this the variables countreplacements and control in algorithm 1 are used. 476 r.i. lung algorithm 1 direct evolutionary search algorithm randomly generate population; evaluate population; countreplacements = 1; control = true; for nrgen = 0; ((nrgen < maxnogenerations) and (control));nrgen + + do apply selection; apply mutation(pm); replace children; countreplacements+ =no of children that replace a parent; if nrgen = 100 then if countreplacements = 0 then control = false; else countreplacements = 0; end if end if end for return non-ascended solutions; =0 table 1: description of the seven games tested name of game no. of players no. of strategies type of ne game1 2 2,2 totally mixed game2 2 2,2 totally mixed matching pennies 2 2,2 totally mixed g1 3 2,2,2 totally mixed g2 3 3,3,3 mixed o’neill 2 4,4 totally mixed poker 2 4,2 totally mixed direct evolutionary search for nash equilibria detection 477 table 2: descriptive statistics of numerical results obtained by des using the following parameters: population size 50, pm = 0.5, ε = 0.5. game1 avg. dist: 0 st dev: 0 avg. no. gen.: 844.33 st dev: 476.57 avg. eval.: 447,117.47 st dev: 271909.88 game3 avg. dist: 7.75094e-17 st dev: 3.13e-17 avg. no. gen.: 501.00 st dev: 141.42 avg. eval.: 243,736.93 st dev: 74416.31 matching pennies avg. dist: 0 st dev: 0 avg. no. gen.: 497.67 st dev: 125.12 avg. eval.: 233,854.53 st dev: 56641.84 g1 avg. dist: 4.04061e-11 st dev: 2.25e-17 avg. no. gen.: 11,941 st dev: 13424.39 avg. eval.: 15,541,134 st dev: 17317631 g2 avg. dist: 4.04061e-11 st dev: 6.98e-18 avg. no. gen.: 931 st dev: 324.70 avg. eval.: 1,076,041 st dev: 374858.2 oneill avg. dist: 2.79146e-05 st dev: 0.0001 avg. no. gen.: 8,217.63 st dev: 26628.89 avg. eval.: 6,556,652.7 st dev: 21598601 poker avg. dist: 3.70074e-18 st dev: 1.99e-17 avg. no. gen.: 794.33 st dev: 240.73 avg. eval.: 419,827.93 st dev: 126713.48 2 numerical results des was first tested on a set of simple well known 2 or 3-players with up to 4 actions available to them. the games have been chosen from the gambit [3] set of normal form games. the characteristics of the games are presented in table 1. table 2 presents descriptive statistics of the results obtained using des for the seven games: average and standard deviation values for the minimum distance to the ne of the game for the non ascended solutions in the final population, number of evaluations and of generations until des stopped the search. the second set of games that was used to test if des was generated by using the gamut distribution [5]. the distribution names, characteristics and rates of success of des are presented in table 3. 478 r.i. lung table 3: gamut distributions and results obtained by des. rate of success represents percent of runs in which des detected a ne. parameters of des: population size 100, pm = 0.1 distribution no. of players no. of actions rate of success bertrand 2 10/player 100% ologopoly 2 20/player 100% 2 50/player 100% 2 100/player 100% 5 2/player 100% 5 4/player 100% 5 6/player 100% bidirectional leg 2 10/player 100% complete graph 2 20/player 100% 2 50/player 100% 2 100/player 100% 5 2/player 100% 5 4/player 100% 5 6/player 100% bidirectional leg 2 10/player 100% random graph 2 20/player 100% 2 50/player 100% 2 100/player 100% 5 2/player 100% 5 4/player 100% 5 6/player 100% bidirectional leg 2 10/player 100% star graph 2 20/player 100% 2 50/player 100% 2 100/player 100% 5 2/player 100% 5 4/player 100% 5 6/player 100% covariance game 2 10/player 100% ρ = 0.9 2 20/player 100% 2 50/player 100% 2 100/player 100% 5 2/player 100% 5 4/player 100% 5 6/player 100% direct evolutionary search for nash equilibria detection 479 conclusion the presented results indicate that it it possible for an evolutionary algorithm to directly detect nash equilibria of normal form games. a population composed of situations of the game is randomly generated; using selection and mutation operators and the nash ascendancy relation individuals of the populations are guided towards the nash equilibrium of a normal form game. two sets of problems have been chosen to test the method: the first one consists of games presenting a mixed nash equilibrium. results indicate that des is capable of locating mixed equilibria for the selected games. the second group of games is used to test the scalability of the method: the number of actions available to each player, as well as the number of players are increased for five distributions available in the gamut package. results also indicate the potential of the method, inspite of well known scalability issues associated with evolutionary algorithms. acknowledgment this work was supported by a grant of the romanian national authority for scientific research and innovation, cncs uefiscdi, project number pn-ii-ru-te-2014-4-2560. bibliography [1] rodica ioana lung and d. dumitrescu (2008); computing nash equilibria by means of evolutionary computation, intrenational journal of computers communications & control, suppl. issue, 3(5):364-368. [2] richard d. mckelvey and andrew mclennan (1996); computation of equilibria in finite games. in h. m. amman, d. a. kendrick, and j. rust, editors, handbook of computational economics, volume 1 of handbook of computational economics, chapter 2, pp. 87-142, elsevier, 1996. [3] richard d. mckelvey, andrew m. mclennan, and theodore l. turocy (2010); gambit: software tools for game theory, technical report. [4] john f. nash (1951) non-cooperative games, annals of mathematics, 54:286-295. [5] eugene nudelman, jennifer wortman, yoav shoham, and kevin leyton-brown(2004); run the gamut: a comprehensive approach to evaluating game-theoretic algorithms, proceedings of the third international joint conference on autonomous agents and multiagent systems, washington, dc, usa, 2004. ieee computer society, aamas04, 2: 880-887. [6] ryan porter, eugene nudelman, and yoav shoham (2008); simple search methods for finding a nash equilibrium, games and economic behavior, 63(2): 642-662. ijcccv8n1.pdf int j comput commun, issn 1841-9836 8(1):18-29, february, 2013. broadcast scheduling problem in tdma ad hoc networks using immune genetic algorithm d. arivudainambi, d. rekha d. arivudainambi, d. rekha department of mathematics anna university, chennai, india e-mail: arivu@annauniv.edu, rekhadurai@gmail.com abstract: in this paper, a new efficient immune genetic algorithm (iga) is proposed for broadcast scheduling problem in tdma ad hoc network. broadcast scheduling is a primary issue in wireless ad hoc networks. the objective of a broadcast schedule is to deliver a message from a given source to all other nodes in a minimum amount of time. broadcast scheduling avoids packet collisions by allowing the nodes transmission that does not make interference of a time division multiple access (tdma) ad hoc network. it also improves the transmission utilization by assigning one transmission time slot to one or more non-conflicting nodes such a way that every node transmits at least once in each tdma frame. an optimum transmission schedule could minimize the length of a tdma frame while maximizing the total number of transmissions. the aim of this paper is to increase the number of transmissions in fixed ad hoc network with time division multiple access (tdma) method, with in a reduced time slot. the results of iga are compared to the recently reported algorithms. the simulation result indicates that iga performs better even for a larger network. keywords: ad hoc networks, broadcast scheduling, genetic algorithm, immune genetic algorithm. 1 introduction a wireless ad hoc network is a collection of nodes, which communicate with each other using radio transmissions. in wireless networks, nodes can communicate directly if they are within the transmission range of each other. when a source node is out of the transmission range of destination node then it uses intermediate nodes for routing their message. in ad hoc networks, there are no base stations to act as routers and the nodes themselves perform routing. therefore, data should be delivered from source to destination through multiple hops. ad hoc networks rely on multihop transmission among the nodes on the same channel. we have proposed how efficiently broadcasting can be done in ad hoc network. in this paper, we consider a fixed ad hoc networks with tdma and its topology can be represented by a graph. tdma is divided into frames where each frame is further divided into time slots that can be assigned to different nodes. in a single frame, all nodes must be allowed to transmit packets at least once. tdma scheduling algorithms can be categorized into link scheduling algorithm and broadcast/node scheduling algorithm [10]. in ad hoc network, a link can be represented as (t, r), where t is a transmitter and r is a receiver. in link scheduling, the transmission in every slot is assigned to certain links, where as in broadcast scheduling the transmission in every slot is assigned to certain nodes. the aim of the algorithm is to activate all the nodes at least once, to improve the utilization factor of the network channels, i.e., to increase the number of transmissions within minimum number of time slots. ahmad et al., [1] proposed an algorithm based on finite state machine synthesis that determines the minimum frame length with the maximum slot utilization. a maximal compatible set of nodes is produced and these are chosen such that the nodes in that set do not have conflicts copyright c© 2006-2013 by ccc publications broadcast scheduling problem in tdma ad hoc networks using immune genetic algorithm19 with one another. a tight lower bound derived from set of maximal incompatibles forms the basis for deriving minimum frame length. the algorithm applies set of rules on the maximal compatibles in order to maximize utilization of slots. genetic algorithm (ga) is population-based stochastic optimization method with an iterative process of generation-and-test. it has been recognized that ga is promising approach for nphard or np-complete problems. ga solves many search and optimization problems effectively. a standard genetic algorithm approach is given by chakraborty in 1998 for scheduling problem in packet radio networks, though the algorithm able to solve small problems but performs poorly for large networks. this is because classical crossover and mutation operations create invalid population that goes through several generations and delay the progress of search for valid solutions. special crossover and mutation operations for elite population method is defined by chakraborty [3], such that members of the population always remain valid solutions for the problem. even though, it produces optimal solution in less number of generations and it reduces invalid solutions but the computation time is not reduced. gunasekaran et al., [4] proposed two different algorithms for spatial reuse in wimax networks. first, a dynamic programming (dp) method is adopted to produce maximal collision free set of nodes, but suffered from high memory requirements. the second one is genetic algorithm approach, it is more scalable than dp approach but does not guarantee optimality. two kinds of population are used in co-evolutionary genetic algorithm that act as competitors for each other and hence increase the evolving rates. this approach seems to have a better performance than classical genetic algorithm approach, which did not produce good result for large networks. the main drawback of co-evolutionary genetic algorithm is every member of test population has to be compared with every member of solution population. this requires many comparisons, calculations and hence may slow down the procedure when population sizes are vast. a novel hysteretic noisy chaotic neural network (hncnn) by controlling noises of the equivalent model for broadcast scheduling problem in packet radio networks is proposed by ming sun et al., [7]. they combine the hncnn with the gradual expansion scheme to find the minimal frame length in the first phase, and to maximize the conflict-free transmission in the second phase. ngo and li [8] proposed an approach based on a modified ga called genetic-fix. they formulate the problem based on a within-two-hop connectivity matrix and proposed a centralized scheduling algorithm using a modified genetic-fix algorithm. traditional ga generates subsets of all possible sizes whereas genetic-fix algorithm generates fixed-size subsets i.e., in binary representation number of one’s is fixed. a mixed tabu-greedy algorithm has been implemented to solve the broadcast scheduling problem in packet radio networks is given by peng et al., [9]. improvements are achieved in terms of both channel utilization and packet delay by using a two-step algorithm. the problem of determining an optimal minimum length tdma schedule for a general multihop radio network is np-complete for both link and broadcast scheduling [10]. both link scheduling and broadcast scheduling are considered in their approach. salcedo et al., [11] propose a procedure that combines hopfield neural network for the constraints satisfaction and genetic algorithm for achieving maximal throughput. their approach to solve broadcast scheduling problem is by dividing problem in two sub problems. the first is to find minimum frame length without interference using discrete hopfield neural network. the second increases the throughput for given frame length is done by combining hopfield neural network with genetic algorithm. a linear integer programming formulation for the composite problem of maximizing channel utilization while minimizing the length of the frame is given by syam menon [12] that performs in reduced computation time but maximum number of stations taken in their approach is 50 20 d. arivudainambi, d. rekha stations. in [13], wang and ansari propose a broadcast scheduling algorithm based on mean field annealing neural networks. they proposed a three-step algorithm first step reduces solution space by presetting some neurons according to the topology of the scheduling network. second step executes mfa procedure to maximize channel utilization. a heuristic approach is the final step to arrange transmissions of unassigned stations. a competent permutation encoded genetic algorithm is executed to solve optimum time division multiple access broadcast scheduling problem for mobile ad hoc networks is proposed by wu et al., [14]. the problem search space is reduced mostly and genetic algorithm becomes more capable in searching the optimum solutions. in genetic algorithm, the mutation creates new genes for the population and crossover operator orients seeking the best solution from genes in the population. however, they may drop into local optimal solutions or they may find optimal solution by low convergence speed and ga blindly wanders over search space. in ga, a normal mutation operator takes chance to change a best solution obtained from previous operation. to overcome these problems, we used immune concept to enhance the ga. immune genetic algorithm (iga) does not perform mutation in a normal way and mutation operator is carried out by two steps in this algorithm 1) immune selection and 2) vaccination. immune selection performs reduction of time slots whereas vaccination gets the knowledge from hop matrix to mutate a bit. iga increases number of transmission in a reduced time slot. the aim of proposed immune genetic algorithm is to activate all nodes at least once, while minimizing number of time slots. another goal is to increase utilization factor of network channels, i.e., the usage of available channels should be maximized by increasing number of transmissions. the method based on immune genetic algorithm for the broadcast scheduling problem in ad hoc networks has not been studied so far. a comparison with existing methods for the test instance reported in the literature shows that our algorithm identifies optimal solution in less number of generations and the average time delay is reduced even for large network. this paper is organized as follows: section 2 gives a formal definition of broadcast scheduling problem, along with constraints to be satisfied. an immune genetic algorithm approach is provided in section 3. simulation, results and analysis of the proposed algorithm are provided in section 4. section 5 concludes the paper. 2 formulation of broadcast scheduling problem ad hoc topology can be represented as undirected graph g = (n, e), where n is the set of nodes or mobile stations and e is the links or transmissions assumed bidirectional. a link (i, j) exists between nodes i and j if they are within transmission range of each other. when node i transmits data, other nodes within the transmission range of i will receive it. if both i and j transmit packet in the same time slot then it leads to primary conflict. when a node receives two or more packets from directly connected different nodes in a single time slot then it cause to secondary conflict. for a collision free transmission primary conflict and secondary conflict should not occur. two mobile stations can transmit in the same time slot without mutual interference, if they are located more than two hops apart. three important matrixes used in this study are connectivity matrix [cm] is to represent a direct link between nodes, hop information matrix [hm] is to represent one-hop and two-hop connectivity information of each node and tdma frame matrix [tm] is to represent the allotted time slots of given network without any interference. the connectivity matrix in fig.2.(a) represents direct link between nodes given in the network fig.1. each column represents the nodes of network and row represents link existence between nodes. in fig.2.(a), first row says about connectivity information of node 1, likewise for remaining broadcast scheduling problem in tdma ad hoc networks using immune genetic algorithm21 figure 1: a sample 6-node network figure 2: (a) connectivity matrix and (b) hop information matrix for a 6-node network figure 3: (a), (b), (c) and (d) sample solution tdma frames created for a 6-node network nodes. the matrix has value 0 or 1, where 1 represents existence of a link. the hop information matrix for the given 6-node network is shown in fig.2.(b), where row value represents one-hop and two-hop information between nodes. the matrix takes value 0 or 1 from first row, it is identified nodes 2 and 3 is either one-hop or two-hop away from node 1. the tdma frame matrix is a |m|x|n| matrix where |m| is the number of time slots and |n| = {n1, n2, ..., nx} is total number of nodes in the network. for the 6-node network, possible tdma frame matrix is shown in fig.3. the first [tm] is a trivial solution where there is no chance to get conflict since each node is assigned in each time slot. fig.3.(b) & (c). represents solution frame with reduction in time slot but it is not an optimal solution, where as the solution frame in fig.3.(d) is generated with 4 time slots which is optimal solution for the given 6-node network. the optimal solution is determined based on fitness criteria tight lower bound and channel utilization variable. tight lower bound is md = max nǫ|n| |ds (n)| (1) let ds(n) be the degree set of n nodes, md represents maximum degree of the network, based on this value tight lower bound is generated as, ∆ = |m|− md ≥ 1 (2) if ∆ = 1 then the solution is optimal. channel utilization variable for entire network ρ = 1 |m| ∗ |n|   |m| ∑ i=1 |n| ∑ j=1 [tmij]   (3) the total number of transmission taken by each node is calculated using ρx = |m| ∑ i=1 [tmix] (4) 22 d. arivudainambi, d. rekha applying the criteria in the sample tdma solutions shown in fig.3 it is identified solution in fig.3.(d) satisfies tight lower bound value with high channel utilization value compared to solution in fig.3.(a).(b) & (c). md value for the given 6-node network is 3, so ∆ = 1 for solution in (d) and channel utilization of entire network is 0.33, where as in solution (a) channel utilization ρ is 0.17, for (b) and (c) ρ is 0.2. 3 proposed iga 3.1. analysis of genetic algorithm genetic algorithm is a heuristic search technique that simulates the processes of natural selection and evolution. genetic algorithms are effective, robust search procedure for np-complete problems. ga is a nontraditional search and optimization method. it works for the problem that has large number of solutions, out of which some are feasible and some are infeasible. the task is to get best solution out of feasible solutions. standard ga starts with a set of solutions called population. a population is a collection of chromosomes. solutions for one population are taken and used to form a new population, which are selected according to their fitness. this is repeated until some conditions for improvement of best solution are satisfied. three main operators are used to create a new population 1) selection, 2) crossover, and 3) mutation. the new population is further evaluated and tested for termination. if termination criteria are not satisfied, the population is iteratively operated by three operators and evaluated. one sequence of these operations and subsequent evaluation procedure is a generation in ga. 3.1.1. initial population tdma scheduler matrix is represented as bit string chromosome containing 0’s and 1’s in broadcast scheduling problem. each row and column of scheduler matrix represents to time slot and node transmission. the value 1 in position (i, j) in scheduler matrix indicates jth node is allowed for transmission in ith time slot. in classical ga approach, chromosome contains random string of 0’s and 1’s that does not perform well when number of nodes exceeds 40 [3]. the reason is when number of generations increases number of invalid individuals in population keeps increasing as the validity of every individual is not ensured while following classical ga crossover and mutation operations. finally, invalid members dominate total population and thus optimal solution is not found. this was overcome by changing initial population, crossover and mutation methods to keep members of population valid by chakraborty [3]. in our algorithm, initial tdma frames are constructed using elite population method of chakraborty [3]. 3.1.2. selection the selection operators for parent selection and survivor selection follow darwinian principle of survival of the fittest. for parent selection two chromosomes is selected randomly from the population to serve as parents for reproduction process. second, survivor selection applies the principle of survivor of the fittest. only fittest individuals selected as parents for next-generation, to achieve this k-tournament selection is done. survivor selection is also called as elitism, which is to retain some of the best individuals in each generation. in this study, a small percentage of best fitness individuals retained to next generation. it increases the performance of algorithm, by preventing loss of best found solution. from each generation 10% of best solution is retained to next iteration. 3.1.3. crossover and mutation the selected chromosomes for reproduction are gathered in mating pool. the single-point crossover operator is done on rows of the population. once a crossover point is identified, a random row from first parent pr1 is crossed over with a random row from second parent pr2. the resultant chromosome ch1 is replaced with pr1 and ch2 is replaced with pr2. after replacing, if the solution violates constraints then it is removed from population. the broadcast scheduling problem in tdma ad hoc networks using immune genetic algorithm23 mutation operator behaves in a different manner depending on the fitness of selected gene. the mutation operator changes one bit in selected chromosome depending on individual fitness. simple mutation is done by flipping a bit. at every bit of all members of population, a random number between 0 and 1 is generated. if it is less than or equal to mutation probability, it is flipped from 0 to 1 and vice versa. 3.1.4. fitness function the fitness function evaluates quality (fitness) of candidate solutions. the fitness function for scheduling problem is based on the variables tight lower bound and channel utilization. the termination point determines whether optimal solution is determined in that generation or not. the optimal solution is the one, which satisfies both criteria. when the generation of evolution reaches this termination point, algorithm stops and output optimal solution for the given network, else elitism method is done on populations and proceeds to next generation. at the end of iteration before evaluating fitness function, populations produced in the generation are taken for duplicate row elimination i.e., time slot which is repeated is removed from the population in order to produce optimized tdma frame. 3.2. optimization properties of immune genetic algorithm compared to ga in ga two main genetic operators, crossover and mutation, not only give each individual’s the evolutionary chance to obtain global optimum but also cause the degeneracy to some extent because of random and unsupervised searching during the entire process. on the other hand, ga is lack of capability of making use of some basic and obvious characteristic or knowledge in pending problem. based on the considerations above, immune genetic algorithm is proposed. algorithm 1 shows the structure of immune genetic algorithm. the solution after crossover is taken for immune operations. iga is an intelligent optimization algorithm, which mainly constructs an immune operator accomplished by two steps: immune selection and vaccination. the initial populations are created using elite population method of chakraborty [3]. random selection method for parent selection and 10% of best solution is taken to next generation. the knowledge added iga algorithm performed in the following way. algorithm 1. immune genetic algorithm immune genetic algorithm(g=(v,e), psel, pcr, psize, maxgen) { generate compatibility and hop information matrix; find the degree of the network; for loop=1:psize population = generate initial population using elite population; end for while (until reaches maxgen or optimal solution is identified ) // two parents are selected based on the selection probability spop = selection (psel, population); // single point crossover is done on the selected parents cpop = crossover (pcr, spop); // the chromosomes after reproduction is taken for immunization immunization (cpop) { // solutions that satisfies primary and secondary constraints are taken by immune selection isel = immuneselection (cpop); // resulting solutions are arranged according to channel utilization variable // vaccination is performed based on hop information matrix vpop = vaccination (isel); } 24 d. arivudainambi, d. rekha evaluate the fitness of each vpop if optimal solution is identified then break; // come out of the while loop and print the solution else // replace the best few offspring with the initial population and continue the loop survival (initial population, vpop); end if end while } 3.2.1. crossover the modified crossover operator given by chakraborty [3] is performed in iga. crossover is done on rows of the tdma cycle. rows from members of the population, i.e., different tdma frames are selected using predetermined crossover probability and are marked to be members of mating pool. a pair of rows pr1 and pr2 selected randomly from the mating pool for crossover. the objective of crossover is to create an offspring with a better schedule for a time slot with more transmissions by combining parent schedules. by crossover operation, only one child is created which may or may not replace the parent, depending on how good it is. first, pr1 is logically and with pr2, then logically exclusive or operator is performed for pr1 and pr2. both new populations are scanned from left to right, the first occurred one is copied to same position of new row. the next one is checked with corresponding row of hop information matrix if it does not make conflict then it is copied to new row else, it is rejected. in the same way, it will continue until end of both population and a new row is created from pr1 and pr2. the offspring may replace with pr1 and pr2 based on the condition set given by chakraborty [3]. 3.2.2. mutation random mutations alter a certain percentage of bits in the list of chromosomes. a single point mutation changes 1 to 0, and vice versa. increasing number of mutations increases algorithm’s freedom to search outside the current region of variable space. it also tends to distract the algorithm from converging on a correct solution so mutations are not allowed for best solutions. they are designated as elite solutions destined to propagate unchanged. such elitism is very common in gas. why throw away a perfectly good solution? however, in previous algorithm solutions generated after crossover is taken for mutation operation and random mutation is done on populations, which may change best population. to avoid this and to make algorithm knowledgeable, immune concept is added to genetic algorithm. a normal mutation operation is not done in iga instead, knowledge added two steps are performed in place of mutation. the two steps of iga are immune selection and vaccination. a. immune selection the newly created population after crossover, which satisfies the primary and secondary constraints, is selected for reduction of time slots. those populations selected are stored in vaccine pool. from the vaccine pool, one population is taken and the algorithm finds repeated time slot in that population. if so then the repeated time slot is deleted from the population. this step is carried out for the remaining populations that are in vaccine pool. the resulting time slot reduced populations are arranged according to the channel utilization variable and stored in the vaccine pool by replacing the old populations. b. vaccination vaccination is used to improve fitness by modifying the genes of an individual population with prior knowledge to gain higher fitness with greater probability. a chromosome from vaccine pool is taken for vaccination. iga identifies a node transmits first in the population in a time slot. during same time slot some other node, which does not create interference with transmitting broadcast scheduling problem in tdma ad hoc networks using immune genetic algorithm25 node can be allowed to transmit in the same time slot. to perform this a node is selected randomly and checked with hop information matrix whether it creates an interference with currently transmitting node, if not node value is mutated to one allowing the selected node to transmit in same time slot. the genes of selected chromosome are modified based on the knowledge obtained from hop information matrix of given network. hence, the vaccination process increases number of transmissions. 4 solving bsp using iga: simulation and results a series of simulations are carried to evaluate the performance of iga to solve broadcast scheduling problem, in comparison with finite state approach [1], ga [3], ga with collision free set (gacfs) [4], mean field annealing [13] and competent permutation genetic algorithm [14]. the performance of the ga and iga were tested for a large number of times, using ad hoc networks of different sizes. in the following sections, we discuss simulation results regarding the number of nodes |n|, the number of timeslots |m| and the computation time. the iga was tested by 100 randomly generated graphs, each representing an ad hoc network topology. the simulation results are based on population size 50, maximum number of generations 500, crossover rate 0.30, mutation probability 0.001 and on the measures, 1 tight lower bound ∆ value is one. 2 channel utilization variable to find the improvement in number of transmission. 4.1 results obtained from ga the purpose this simulation was to investigate performance of genetic algorithm for different networks. the number of nodes taken for simulation ranges from five to hundred. smaller node networks executed with more number of transmission in an acceptable generation. however, for a 100 node network with 200 edges identifies the optimal solution tdma frame after 489 generations. the average number of generations for 100-node network is 410. this has to be decreased in order to reduce execution time. 4.2 results obtained from iga the simulation results based on iga is given in the following table 1. compared to genetic algorithm, knowledge added iga could improve the searching ability, adaptability and greatly increase the converging speed. during vaccination process, selected antigen is improved with more number of transmissions so channel utilization is increased. comparing simulation results of iga with ga number of generations is reduced and utilization of each network is improved. even for large network, the solution is identified with acceptable generation. the values of table 2 clearly imply iga performs better than standard ga. one main aim is to reduce number of time slots so there is a steady decrease in utilization index as the number of nodes increases. 4.3 optimum schedule performance with other methods the channel utilization of iga is high compared to the channel utilization of elite population genetic algorithm given by chakraborty [3] is shown in fig.4. similarly, for networks with nodes 14, 16 and 40 number of transmissions generated by iga is somewhat closer to ga [3] whereas for large networks iga has improved on an average of 20 to 30 transmissions is shown in fig.4. table 3 gives a detailed comparison between iga and existing algorithms. it represents maximum number of transmissions produced by some recent algorithms for the given number of nodes and time slots. the results indicate for smaller networks the number of transmission differs slightly, whereas for a 100-node network number of transmission generated by iga create a variation of 10 to 20 transmissions. two benchmark problems discussed by [13] are solved using iga and results are compared with other algorithms such as, the finite state machine based algorithm fsma [1], co-evolutionary 26 d. arivudainambi, d. rekha genetic algorithm for collision free set gacfs [4], gradual hysteretic noisy chaotic neural network g-hncnn [7], mean field annealing algorithm mfa [13] and the competent permutation genetic algorithm cpga [14] is shown in table 4. 30 nodes with 70 edges is analyzed in problem instance # 1 the channel utilization is largely improved compared to other algorithms similarly the time delay is also reduced reasonably by iga compared to other methods. 40 nodes with 66 edges and maximum degree of 7 is analyzed in problem instance # 2 to this the channel utilization is increased moderately and the time delay is reduced by iga. the results of other algorithm ghncnn, gacfs, fsma, cpga and mfa are given in [7], [4], [1], [14] and [13]. the algorithm gacfs and cpga have not calculated the average time delay, which is represented by hyphen in table 4. the average time delay is calculated by η = |m| |n| |n| ∑ i=1 ( 1 ∑|m| j=1 |tmij| ) (5) the average time delay η for each node represents the average availability of the network, and the minimal η is very important for the optimal ad hoc network design and its evaluation is described by ming sun et al., [7]. in fig.5 (a) the input graph topology for a 100-node network with 200 edges with average degree of 4 and its optimum solution produced by iga is shown in fig.5 (b). the solution is identified with 9 time slots and 152 transmissions compared to recent algorithms the number of transmissions produced by iga is largely improved that attain the aim of this paper. problem no. of average no. of time total channel computation number nodes degree slots |m| transmissions utilization ρ time 1 14 3 6 24 0.285 0.5 sec. 2 16 3.6 5 23 0.287 0.5 sec. 3 40 4 8 67 0.209 2.1 sec. 4 100 4 10 152 0.152 5.2 min. 5 200 4 9 282 0.157 15.7 min. 6 300 3 10 489 0.163 36.1 min. 7 400 4 10 623 0.156 69 min. table 1: simulation results of iga with total number of transmission, channel utilization ρ, and the computation time. problem no. of average no. of time channel number nodes degree slots |m| utilization ρ 1 10 3 4 0.3 2 20 3.6 7 0.286 3 30 4 8 0.208 4 50 4 9 0.2 5 100 4 10 0.152 6 120 4 9 0.149 table 2: results obtained by iga for varying number of nodes, number of time slots and channel utilization ρ. broadcast scheduling problem in tdma ad hoc networks using immune genetic algorithm27 no. of no. of time iga gacfs ga finite state component nodes slots |m| approach permutation ga 15 8 20 18 17 20 20 30 9 37 28 33 35 37 40 8 69 65 65 64 64 100 9 152 139 133 134 136 table 3: comparison of time slots |m| and total number of transmissions using iga with other competitive algorithms. instance parameter iga g-hncnn gacfs fsma cpga mfa |m| 10 10 10 10 10 12 #1 ρ 0.19 0.1233 0.093 0.1167 0.1233 0.1056 avg. time delay 7.54 8.83 — 9.2 — 10.5 |m| 8 8 8 8 8 9 #2 ρ 0.237 0.2125 0.203 0.200 0.200 0.197 avg. time delay 5.212 5.7056 — 6 — 6.9 table 4: comparison of time slot |m|, channel utilization ρ and average time delay of each node by iga with other competitive algorithms. figure 4: (a) channel utilization vs. number of nodes, (b) number of transmissions vs. number of nodes for ga [3] and iga 5 conclusion the basic genetic algorithm and knowledge added immune genetic algorithm are discussed to improve broadcast scheduling problem in ad hoc. according to our knowledge, this paper is the first one to study immune genetic algorithm to the broadcast scheduling problem. compared to ga, iga actively aims on improving solutions, while ga blindly wanders over search space. immune genetic algorithm gets knowledge from hop matrix during vaccination process and increases number of transmissions in a reduced time slot in an acceptable computation time compared to recently proposed algorithms. the simulation results confirm advantages of iga in terms of channel utilization, number of generations and running time. the outcome validates the effectiveness and efficiency of iga for broadcast scheduling problem. further research may 28 d. arivudainambi, d. rekha figure 5: (a) graph topology with 100 nodes, 200 edges and average link degree of 4, (b) solution found by iga for the same network be performed to improve the performance of iga by finding variations in crossover and altering the immune operator by adding refined knowledge to vaccination. acknowledgement we gratefully acknowledge department of science and technology, india for providing financial support to carry out this research work under purse scheme. bibliography [1] i. ahmad, b. al-kazemi, and a.s. das. (2008); an efficient algorithm to find broadcast schedule in ad hoc tdma networks, journal of computer systems, networks, and communications, 12 : 1-10. [2] dingwei wang, richard y.k. fung, and w.h. ip. (2009); an immune-genetic algorithm for introduction planning of new products, computers and industrial engineering, 56 : 902-917. [3] goutam chakraborty. (2004); genetic algorithm to solve optimum tdma transmission schedule in broadcast packet radio networks, ieee transactions on communications, 52 (5) : 765-777. [4] r. gunasekaran, s. siddharth, p. krishnaraj, m. kalaiarasan, and v. rhymend uthariaraj.(2010); efficient algorithms to solve broadcast scheduling problem in wimax mesh networks, computer communications, 33 : 1325-1333. [5] licheng jiao and lei wang.(2000); a novel genetic algorithm based on immunity, ieee transactions on systems, man, and cybernetics-part a: systems and humans, 30(5) : 552561. [6] m. liu, w. pang, k.p. wang, y.z. song, and c.g. zhou.(2006); improved immune genetic algorithm for solving flow shop scheduling problem. computational methods, 1057-1062. [7] ming sun, lin zhao, wei cao, yaoqun xu, xuefeng dai, and xiaoxu wang.(2010); novel hysteretic noisy chaotic neural network for broadcast scheduling problems in packet radio networks. ieee transactions on neural networks, 21(9). broadcast scheduling problem in tdma ad hoc networks using immune genetic algorithm29 [8] c. y. ngo and v. o. k. li.(2003); centralized broadcast scheduling in packet radio networks via genetic-fix algorithms, ieee transactions on communications, 51(9) : 1439-1441. [9] y. peng, b.h. soong, and l. wang.(2004); broadcast scheduling in packet radio networks using mixed tabu-greedy algorithm, electronics letters, 40 (6) : 375-376. [10] s. ramanathan and e. l. lloyd.(1993); scheduling algorithms for multihop radio networks, ieee/acm transactions on networking, 1(2) : 166-177. [11] s. salcedo-sanz, c. bousono-calzon, and a.r. figueiras-vidal.(2003); a mixed neuralgenetic algorithm for the broadcast scheduling problem, ieee transactions on wireless communications, 2 : 277-283. [12] syam menon.(2009); a sequential approach for optimal broadcast scheduling in packet radio networks, ieee transactions on communications, 57(3) : 764-770. [13] g. wang and n. ansari.(1997); optimal broadcast scheduling in packet radio networks using mean field annealing, ieee journal on selected areas in communications, 15 : 250-260. [14] x. wu, b.s. sharif, o.r. hinton, and c.c. tsimenidis.(2005); solving optimum tdma broadcast scheduling in mobile ad hoc networks: a competent permutation genetic algorithm approach, iee proceedings: communications, 152(6) : 780-788. int j comput commun, issn 1841-9836 8(6):863-868, december, 2013. direct method for stability analysis of fractional delay systems m.a. pakzad, m.a. nekoui mohammad ali pakzad* department of electrical engineering science and research branch, islamic azad university tehran, iran *corresponding author: m.pakzad@srbiau.ac.ir mohammad ali nekoui faculty of electrical and computer engineering k.n.toosi university of technology, seyed-khandan p.o. box 16315-1355, tehran, iran manekoui@eetd.kntu.ac.ir abstract: in this paper, a direct method is presented to analyze the stability of fractional order systems with single and multiple commensurate time delays. using the approach presented in this study, first, without using any approximation, the transcendental characteristic equation is converted to an algebraic one with some specific crossing points. then, an expression in terms of system parameters and imaginary root of the characteristic equation is derived for computing the delay margin. finally, the concept of stability is expressed as a function of delay. an illustrative example is presented to confirm the proposed method results. keywords: control, fractional delay systems, stability windows, root-locus. 1 introduction time delay is an inherent part of many dynamical and physical systems. the device delay often appears in computer based control systems, the wireless and web-based control systems, communication systems and so on. also, the existence of various sensors and actuators in the feedback loop is the cause of delay in many systems. moreover, taking an automatic computerbased control system or a process control with networked transmission for example, it is necessary and more reasonable to simultaneously consider the possible transmission delay for the corresponding control law and the device delay due to computing or processing. recently, much attention has been paid to the subjects of stability, stabilization and control of the time delay systems [1] [5]. fractional order models would be more accurate than integer order models. in fact, real world processes generally or most likely are fractional order systems. in a generic sense, the fractional-order systems are identified by non-integer powers of the laplace variable s. when time delays and fractional-order derivative are involved in dynamical systems, we have fractional-delay systems. the stability of fractional-delay systems can be determined by the root location of the characteristic equations. the most natural way to find the location of the roots of a linear fractional order system is to solve its corresponding characteristic equation. but in our case this will be a transcendental one, being thus generally impossible to solve it directly. for this reason, most of the existing approaches study stability of such systems by finding the crossings of poles through the imaginary axis [8]. there has been a large effort to deal with this problem, as can be seen by the large quantity of articles dealing with it for the standard case (integer order systems); see [5], and others. in [6], the necessary and sufficient conditions for the bibo stability of fractional order delay systems have been introduced. from the numerical analysis point of view, the effective numerical algorithms have been discussed in [7] and [8] for the evaluation of bibo stability of fractional copyright © 2006-2013 by ccc publications 864 m.a. pakzad, m.a. nekoui order delay systems. in [9] a heavy computation scheme based on the cauchy’s integral has been proposed to test the stability of such systems, and in [10], a technique based on the lambert w function was used for the same purpose. in this work, an analitical direct approach to determine all possible stability regions in the parametric space of delay is proposed. the original idea in this strategy is derived based on the method reported in [5] to achieve the stability criteria of integer order delay systems. the proposed method is an analytically elegant procedure that first converts the transcendental characteristic equation into an algebraic equation without the transcendentality by eliminating the highest degree of commensuracy terms successively. the resulting algebraic equation without the transcendentality also enables us to easily determine the delay dependency of the system stability and the sensitivities of crossing roots (root tendency) with respect to the time delay. 2 preliminaries and defintions a general class of fractional order lti (linear time invariant) systems with multiple commensurate delays of retarded type is taken into account: d 1/α t x(t) = d1/αx(t) dt1/α = a0 + n∑ ℓ=1 aℓx(t − ℓτ) the fractional delay characteristic equation of above system can be expressed in the general form of c(α √ s,τ) = det ( s1/αi − a0 − n∑ ℓ=1 aℓe −ℓτs ) = n∑ ℓ=0 pℓ( α √ s) e−ℓτs = 0 (1) where parameter τ is non-negative, such that τ ∈ r+ and pℓ(α √ s) for ℓ ∈ nn is a real polynomial in the complex variable α √ s (where α ∈ n). note that the zeros of characteristic equation (1) are in fact the poles of the system under investigation. we find out from [6] that the transfer function of a system with a characteristic equation in the form of (1) will be h∞ stable if, and only if, it doesn’t have any pole at ℜ(s) ≥ 0. for fractional order systems, if a auxiliary variable of v = α √ s is used, a practical test for the evaluation of stability can be obtained. by applying this auxiliary variable in characteristic equation (1), the following relation is obtained: cv(v,τ) = n∑ ℓ=0 pℓ(v) e −ℓτvα (2) this will transform the domain of the system from a multisheeted riemann surface into the complex plane, where the poles can be easier calculated. in this new variable, the instability region of the original system is not given by the right half-plane, but in fact by the region described as: |∠v| ≤ π 2α (3) with v ∈ c . let us assume that ; s = ±jω or in other words, s = ωe±jπ/2 are the roots of characteristic equation (1) for a τ ∈ r+. then for the auxiliary variable, the roots are defined as follows: v = α √ s = α √ ω e±jπ/2α (4) direct method for stability analysis of fractional delay systems 865 3 crossing position the main objective of this section is to present a new method for the evaluation of stability and determination of the unstable roots of a fractional delay system. the proposed method eliminates the transcendental term of the characteristic equation without using any approximation and converts it into a equation without the transcendentality such that its real roots coincide with the imaginary roots of the characteristic equation exactly. if the characteristic equation (1) has a solution of s = jωc then c(α √ −s,τ) = 0 will have the same solution. c(α √ −s,τ) = n∑ ℓ=0 pℓ( α √ −s) eℓτs = 0 (5) characteristic equation (5) can be written in terms of the auxiliary parameter v as: c(v,τ) = n∑ ℓ=0 pℓ(v) e ℓτvα = 0 (6) a recursive procedure should be developed to achieve that purpose. therefore, let us define c(1)(v,τ) = n−1∑ ℓ=0 [p0(v)pℓ(v) − pn(v)pn−ℓ(v)] e−ℓτv α = n−1∑ ℓ=0 p (1) ℓ (v) e −ℓτvα = 0 (7) then, we have c(1)(v,τ) = n−1∑ ℓ=0 [p0(v)pℓ(v) − pn(v)pn−ℓ(v)] eℓτv α = n−1∑ ℓ=0 p (1) ℓ (v) e ℓτvα = 0 (8) where p (1) ℓ (v) = p0(v)pℓ(v) − pn(v)pn−ℓ(v) (9) we can easily repeat this procedure to eliminate commensuracy terms successively by defining a new polynomial p (r+1) ℓ (v) = p (r) 0 (v)p (r) 0 (v) − p (r) n−r(v)p (r) n−r−ℓ(v) (10) and an augmented characteristic equation c(r)(v,τ) = n−r∑ ℓ=0 p (r) ℓ (v) e −ℓτvα = 0 (11) by repeating this procedure n times, we eliminate the highest degree of commensuracy terms and obtain the following augmented characteristic equation c(n)(v) = p (n) 0 (v) = 0 (12) where p (n) 0 (v) = p (n−1) 0 (v)p (n−1) 0 (v) − p (n−1) 1 (v)p (n−1) 1 (v) (13) it should be emphasized that that if s = jωc is the solution of (1) for some τ, then it is also a solution of (12). if we substitute v = (e−jπ/α)v and v = α √ ωc e jπ/2α in (13), we get the following equation in ω d(ω) = ( p (n−1) 0 (e −jπ/α v) p (n−1) 0 (v) − p (n−1) 1 (v) p (n−1) 1 (e −jπ/α v) )∣∣∣ v=α √ ωc ejπ/2α (14) 866 m.a. pakzad, m.a. nekoui the corresponding value of time delay is then computed by τ∗ = 1 ωc tan−1 ( ℑ[(p(n−1)0 (v))/(p(n−1)1 (v))] ℜ[−(p(n−1)0 (v))/(p (n−1) 1 (v))] )∣∣∣ v=α √ ωc e jπ 2α + 2kπ ωc ;k ∈ z+ (15) theorem 1. a system with characteristic equation (1) has finite crossing points for any τ ∈ r+. proof: let assume s = jωc be a pair of roots for c(α √ s,τ) then v = α √ s = α √ ωc e ±jπ/2α would be a pair of roots for d(ω). since d(ω) is a finite degree polynomial with maximum degree of (p (n) 0 (v)) then the number of crossing points of (1) that are the real roots of d(ω) is finite. 2 the whole ω values, for which s = jω is a root of equation (1) for some non-negative delays, is defined as the crossing frequency set. ω = { ω ∈ r+ ∣∣∣ c(α√s,τ) = 0, for some τ ∈ r+} (16) corollary 2. if the system given as (1) is stable for τ = 0 (i.e. system without delay) and ω = ϕ, then the system will be stable for all positive values of τ ∈ r+. proof: from the fact that there are no roots crossing the imaginary axis. 2 4 direction of crossing after the crossing points of characteristic equation (1) from the imaginary axis are obtained, the goal now is to determine whether each of these root crossings from the imaginary axis is a stabilizing cross or a destabilizing cross. assume that (s,τ) is a simple root of c(α √ s,τ) = 0. the root tendency for each ωcm and τmk is defined as: root tendency = rt |s=jωc = sgn ( ℜ ( ssτ|s=jωcm τ=τmk )) = sgn ( ℜ ( − ∂c/∂τ ∂c/∂s ∣∣∣∣ s=jωc )) (17) if it is positive, then it is a destabilizing crossing, whereas if it is negative, this means a stabilizing crossing. notice that root tendency represents the direction of transition of the roots at jωcm as τ increases from τmk−ϵ to τmk−ϵ , 0 < ϵ ≪ 1. theorem 3. the root tendency at a crossing, jωc is invariant with respect to time delay τmk . proof: one can find ds/dτ for simple roots of (1) as follows: ds dτ = − ∂c/∂τ ∂c/∂s = ∑n ℓ=0 pℓ( α √ s) ℓ s e−ℓτs∑n ℓ=0 pℓ( α √ s) ds e−ℓτs − ∑n ℓ=0 pℓ( α √ s)ℓτse−ℓτs (18) based on (18) and definition given in (17), the root tendency of each time delay τmk is written as follows: rt|τs=jωc = sgn ( ℜ ( ssτ|s=jωc )) = sgn ( ℜ ( − ∂c/∂τ ∂c/∂s ∣∣∣∣ s=jωc )) = sgn ( ℜ ( ∑n ℓ=0 pℓ( α √ s)ℓse−ℓτs∑n ℓ=0 pℓ( α √ s) ds e−ℓτs − pℓ(α √ s)ℓe−ℓτs )) = sgn ( ℜ ( ∑n ℓ=0 pℓ( α √ s) ds e−ℓτs∑n ℓ=0 pℓ( α √ s) ℓ s e−ℓτs − τ s )−1) = sgn ( ℜ ( ∑n ℓ=0 pℓ( α √ s) ds e−ℓτs∑n ℓ=0 pℓ( α √ s)ℓse−ℓτs )) = sgn ( ℑ ( ∑n ℓ=0 pℓ( α √ s) ds e−ℓτs∑n ℓ=0 pℓ( α √ s)ℓe−ℓτs ) )∣∣ s=jωc τ=τm1+ 2kπ ωc (19) the root tendency in each time delay τmk is independent from the time delay itself and constant for each crossing frequency, because e−ℓτs and pℓ(α √ s) do not depend on τmk. 2 direct method for stability analysis of fractional delay systems 867 5 illustrative example we present an example case, which display all the features discussed in the text. example 4. this example has been taken from [3] and [8]. consider the following linear timeinvariant fractional order system with one delay: c( √ s,τ) = ( √ s)3 − 1.5( √ s)2 + 4( √ s) + 8 − 1.5( √ s)2e−τs (20) this system has a pair of poles (s = ±8j) on the imaginary axis for τ = 0. a very involved calculation scheme based on cauchy’s integral has been used in [9] to show that this system is unstable for τ = 0.99 and stable for τ = 1. our objective in this example is to find all the stability windows based on the method described in this article for this system. using auxiliary variable v = √ s into (20) the characteristic equation is obtained as: cv(v,τ) = v 3 − 1.5v2 + 4v + 8 − 1.5v2 e−τv 2 (21) by applying the criterion expressed in the previous section, we can eliminate exponential term from (21) as follows: jv6 + 1.5(1 − j)v5 + 14(1 + j)v3 − j16v2 + 32(1 − j)v + 64 = 0 (22) by inserting expression v = √ ω ej π 4 = a(1 + j) in the above equation , we get: 8a6 − 12a5 − 56a3 + 32a2 + 64a + 64 = 0 (23) where a = √ ω/2. the real solutions of (23) for a is a = 1.82 , v2 = 1.82(1 + j) , ω = 6.6248 (24) a = 2 , v1 = 2(1 + j) , ω = 8 (25) the corresponding infinite countable time delays of the cross points in (24) and (25) are obtained with regards to relation (15) as τ1k = 0.0499 + 0.3019kπ and τ2k = 0.25kπ, respectively. by applying the criterion expressed in the previous section, it is easy to find out that a destabilizing crossing of roots (rt = +1) has occurred at τ = 0.25kπ for s = ±j6.6248 and a stabilizing crossing (rt = −1) has taken place at τ = 0.0499 + 0.3019kπ for s = ±j8 for all values of k ∈ z+. therefore, we will have 5 stability windows as follows: 0.0499 < τ < 0.7854 , 0.9983 < τ < 1.5708 , 1.9486 < τ < 2.3562 , 2.8953 < τ < 3.1416 and 3.8437 < τ < 3.9270 which agree with the results presented by [3] and [8]. note that at τ = 3.9269 , an unstable pair of poles crosses toward the right half-plane, and before this unstable pole pair can turn to the left half-plane at τ = 4.7922 , another unstable pair of poles goes toward the right half-plane at τ = 4.7123 ; and thus, the system can not recover the stability. to get a better understanding of the properties of this system, its root-locus curve has been plotted as a function of delay in fig.1. 6 conclusions an efficient method to analyze the bibo stability of a large class of time-delayed fractional order systems for both single and commensurate-delay cases is proposed. the method introduces an augmented equation whose real roots give the finite values of crossing frequencies at which stability feature of the system change. according to the infinitely countable time delays corresponding to each crossing point, the parametric space of τ is discretized to investigate stability in each interval. finally, an illustrative example is presented to highlight the proposed approach. 868 m.a. pakzad, m.a. nekoui figure 1: root-loci for c( √ s,τ) until τ=3.9 bibliography [1] stojanovic, s.b.; debeljkovic,d.lj.; dimitrijevic, n. (2012); stability of discrete-time systems with time-varying delay: delay decomposition approach, int j comput commun, issn 1841-9836, 7(4): 775-783. [2] liu, c.l.; liu,f. (2010); consensus problem of second-order dynamic agents with heterogeneous input and communication delays, int j comput commun, issn 1841-9836, v(3):325335. [3] pakzad, m. a.; pakzad, s.; nekoui, m.a. (2013); stability analysis of time-delayed linear fractional-order systems, international journal of control, automation, and systems, 11(3): 519-525. [4] pakzad, s.; pakzad, m. a. (2011), stability condition for discrete systems with multiple state delays, wseas trans. on systems and control, 6(11): 417-426. [5] walton, j ke.; marshal, je. (1987), direct method for tds stability analysis, iee proceeding part d. 134, 101-107. [6] bonnet, c.; partington, j.r. (2002), analysis of fractional delay systems of retarded and neutral type, automatica, 38(7):1133-1138. [7] pakzad, m. a.; pakzad, s.; nekoui, m.a. (2013); stability analysis of multiple time delayed fractional order systems, american control conference, washington, dc, 170-175. [8] a. r. fioravanti, c. bonnet, h. ozbay and s. i. niculescu, (2012), a numerical method for stability windows and unstable root-locus calculation for linear fractional time-delay systems, automatica, 48(11):2824-2830. [9] hwang, c.; cheng, y.c. (2006), a numerical algorithm for stability testing of fractional delay systems, automatica, 42(5): 825-831. [10] hwang, c.; cheng, y.c. (2005), a note on the use of the lambert w function in the stability analysis of time-delay systems, automatica, 41(11):1979-1985. international journal of computers communications & control issn 1841-9836, 10(5):654-666, october, 2015. mining periodic traces of an entity on web x. huang, x. wang, y. zhang, j. zhao xinyan huang 1. shandong university 2. shandong university of finance and economics num 1500, sunhua road in high tech industrial development zone ji’nan, china 20063462@sdufe.edu.cn xinjun wang*, yan zhang, jinxin zhao shandong university num 1500, sunhua road in high tech industrial development zone ji’nan, china wxj@sdu.edu.cn, zy@sdu.edu.cn, zjx@sdu.edu.cn *corresponding author: wxj@sdu.edu.cn abstract: a trace of an entity is a behavior trajectory of the entity. periodicity is a frequent phenomenon for the traces of an entity. finding periodic traces for an entity is essential to understanding the entity behaviors. however, mining periodic traces is of complexity procedure, involving the unfixed period of a trace, the existence of multiple periodic traces, the large-scale events of an entity and the complexity of the model to represent all the events. however, the existing methods can’t offer the desirable efficiency for periodic traces mining. in this paper, firstly, a graph model(an event relationship graph) is adopted to represent all the events about an entity, then a novel and efficient algorithm, tracesmining, is proposed to mine all the periodic traces. in our algorithm, firstly, the cluster analysis method is adopted according to the similarity of the activity attribute of an event and each cluster gets a different label, and secondly a novel method is proposed to mine all the star patterns from the event relationship graph. finally, an efficient method is proposed to merge all the stars to get all the periodic traces. high efficiency is achieved by our algorithm through deviating from the existing edge-by-edge pattern-growth framework and reducing the heavy cost of the calculation of the support of a pattern and avoiding the production of lots of redundant patterns. in addition, our algorithm could mine all the large periodic traces and most small periodic traces. extensive experimental studies on synthetic data sets demonstrate the effectiveness of our method. keywords: event, periodic trace, pattern. 1 introduction an event is something that happens at some specific time. nowadays, due to the popularity of the internet, an increasing number of events about an entity are reported on web every day. however, since these events are usual scattered and redundant, meaningful information can not be extracted by people, such as a behavior trajectory of an entity which is named a trace. periodicity is one of the most common phenomena that these traces show, such as the launch of new products, the product promotion and so on, which are named periodic traces by us. in addition, a periodic trace can be loosely defined as the repetitive events series with several kinds of event relationship between them, such as causal, part of and following relationship etc. periodic traces can provide an insightful and concise explanation over the long development history of one entity, which are very valuable in the prediction of future events’ happening of an entity. copyright © 2006-2015 by ccc publications mining periodic traces of an entity on web 655 unfortunately, mining periodic traces from an entity’s long and noisy history data is a challenge, which includes the following major issues: firstly, the period of a trace are usually unfixed. however, in previous work, li z [8,10] studied the frequent periodic behaviors for moving objects with the period fixed. unfortunately, the fixed period is unfit for the periodic trace. for example, the period of the launch of new products is usually varying with the changes of the competitive environment or others factors. secondly, the model to represent so large events set is a complex issue. considering the big scale of the events and the relatively complex relationship between them, a big graph is employed, in which the events are represented as vertices and the events relationship (such as, causal relationship, following relationship etc) are represented as edges. and the big graph is named the event relationship graph of the entity. at the same times, both the structure of periodic traces and the mining of periodic traces become more complex. the problem of mining periodic traces is transformed into mining frequent subgraph from a big graph. for nowadays, the general methods of frequent subgraph mining from a single graph are based on edge-by-edge (i.e., incremental) pattern-growth [12], which are more fit for a small graph than our big graph with several thousand vertices. in this paper, the problem of periodic traces mining is solved through the following steps: firstly, the cluster analysis method is adopted according to the similarity of the activity attribute of an event and each cluster gets a different label. secondly, a novel method is proposed to mine all the vertice-edge-vertice patterns from the event relationship graph firstly and then all the star patterns are mined based on the preceding patterns. finally, an efficient method is proposed to merge all the stars to get all the periodic traces of the entity. in our paper, we address an important problem of mining periodic traces. in our algorithm, high efficiency is achieved through deviating from the existing edge-by-edge pattern-growth framework. at the same time, the heavy cost of the calculation of the support of a pattern is reduced and the production of lots of redundant patterns can be avoided. in addition, our algorithm could mine all the large periodic traces and most small periodic traces. the rest of the paper is organized as follows. in section 2, some related work is discussed. section 3 gives the problem formulation. section 4 and section 5 provides an outline of our algorithm to discover the periodic traces. we report our experimental results in section 6, conclude our study in section 7. 2 related work at present, some studies [20] about how to organize events and events relationship have been investigated. however, less studies aim to mine some meaningful information from the events data of entities. an event relationship migration graph was proposed by zhaoman zhong in [1] to organize events, in which a typical event is defined as a behavior sequence or a series of state changes. a concept of event network is proposed by zongtian liu [3], which is used to organized the events and the relationship between them in an article. christopher c. yang [4] introduces the concept of event evolution graph in which events are organized by the time sequence, which can help to efficiently browse the whole of event evolution process. heng ji [5] proposes to identify the “centroid entities” which are frequently involved in events and then link the events involving the same centroid entity along a time line. however, no more studies are done based on the work ahead. zhenhui li proposed an concept of periodic behavior in [8,10], which is mined from the spatiotemporal data over a long histoy and propose a two-stage algorithm, periodica, to detect 656 x. huang, x. wang, y. zhang, j. zhao the periods in complex movements and to mine periodic movement behaviors. similar to method in literature [3], all events of an entity is firstly linked according to three types of relationships between them. but the difference is that their goal is to create an event ontology but not to mine the meaningful information on the event ontology. however, our main goal is to organize the events of an entity for a long history and to mine the meaningful periodic traces, so much more vertices are involved than [3]. the model of a periodic trace adopted by us is a graph structure which is different from surprising periodic patterns [13] and the periodic behavior in [8,10] which is characterized as a probabilistic model. subdue [15] is probably the most well-known algorithm for mining frequent subgraphs in a single graph, however, it tends to mine small patterns with high frequency.the spidermine algorithm in [7] is proposed to mine top-k largest frequent patterns from a single massive network. subdue and spidermine are both approximate algorithms, which aren’t fit for our problem to mine all the periodic traces.although moss is an algorithm to mine the complete pattern set, it suffers from a significantly runtime complexity issue as the input graph size grows [7,17]. so we propose an efficient method fit for our problem. our algorithm is to mine the complete pattern set, which achieves its efficiency through reducing the heavy cost of calculation of the support of a pattern and avoiding the production of lots of redundant patterns. 3 problem formulation all the following work is based on the assumption that the event relationship graph g = (v,e) has been established already. the mining of periodic traces is to mine the periodic events series with three kinds of event relationship from the events data of an entity. with the help of the event relationship graph, the main task of periodic traces mining is now transformed to mine the frequent subgraph from the event relationship graph. we formally define the notions related with the mining of periodic traces as follows: definition 1 (event). an event is something that happens at some specific time, and often some specific place, which is usual a phrase or sentence in the web pages including an activity which is the main word most clearly expressing an event occurrence. similar to [2], in our paper, the event of an entity is defined as a model e with five attributes, described as follows: e⟨subject, activity,{object}, time,{location}⟩. among them, subject, activity and time elements are required. definition 2 (event relationship). the dependent relationship from an event to another is named an event relationship. there are three kinds of relationships that we considered in this paper, which are causal relationship [9], part of relationship and following [19] relationship. for these three kinds of relationships, we consider an event relationship as a logical dependency between two events. if an event e1 is a component of an event e2, then there is a part of relationship between e2 and e1. besides, if the occurrence of an event e2 always depends on the occurrence of an event e1, then there must be a causal relationship or a following relationship from e1 to e2. if e1 surely leads to the occurrence of e2, then there is a causal relationship between e1 and e2. the rest are following relationships in which e2 always occurs after e1. mining periodic traces of an entity on web 657 definition 3 (event relationship graph). event relationship graph is to link all the events of an entity according to the relationship between them. an event relationship graph is denoted as a directed graph g = (v,e), where v are the vertices representing events and e are the directed edges representing event relationships. a fragment of an event relationship graph is shown in fig 1, in which rc represents causal relationship, rf represents part of relationship and rp represents part of relationship. definition 4 (trace). a trace is denoted as a directed graph g∗ = (v ∗,e∗), which represent a behavior trajectory of the entity. definition 5 (periodic trace). a periodic trace is also denoted as a directed graph g′ = (v ′,e′) composed of periodicly happening events as the vertices v ′ and periodicly happening event relationships as the directed edges e′ between vertices v ′, where v ′ ∈ v , e′ ∈ e. and the support of g′ in g is beyond σ, σ is the minimum support that we set. the problem of mining periodic traces is to mine all the periodic traces g′ = (v ′,e′) from an event relationship graph g, and the occurrence of g′ in g is beyond σ. we will solve the problem in the following sections. 4 find all the event classes figure 1: a fragment of an event relationship graph a fragment of an event relationship graph is shown in fig 1, in which all the events is represented by it’s activity attribute. in this section, our main goal is to find all the event classes ec = {ec1,ec2, . . . ,ecd} from all the events, eci is an event class in which all the events has the similar activity and is represented like eci = {e2,e6,ej, . . . ,en}. in this section, a clustering method is employed. with the help of wordnet, all the events are clustered into groups according to semantic similarity of activity attribute of them, because the activity attribute could mostly express an event, hence the same kind of events get together and get an identical label, such as a, b etc. we name the events with the similar activity as an event class. for example, a launch event class, a promotion event class etc. 658 x. huang, x. wang, y. zhang, j. zhao then the event relationship graph is turned to a labeled directed graph, in which there are three kind of edges labeled as 1, 2, 3 etc. 5 results and discussion in this section, we will describe the periodic trace mining algorithm, tracesmining, which could mine all the periodic traces from the event relationship graph. the main algorithm of mining periodic traces is shown in algorithm 1. algorithm 1 the periodic traces mining algorithm require: input all the event classes ec = {ec1,ec2, . . . ,ecd}, the event relationship graph g of an entity, support threshold σ. ensure: all the periodic traces s. 1: initialize s ← φ; 2: t ′← minsmining(g,ec,σ); 3: /* mine all the minimum patterns– vertice-edge-vertice;*/ 4: s′← starsmining(g,t ′,σ); 5: /* mine all the patterns–stars based on t ′;*/ 6: s ← starsmerging(s′,σ); 7: /* merge all stars whenever possible*/ 8: return s; 5.1 mining all the frequent patterns of vertice-edge-vertice in the minsmining(g,ec,σ) algorithm, for every two event classes,we should get all the patterns of vertice-edge-vertice and all their corresponding instances in g, here an instance of an pattern is just a physical occurrence of the pattern. as is shown in fig 2, a rf−−→ b is a vertice-edge-vertice pattern, the frequency that the physical events in one event class a has the event relationship rf with the physical events in another b in the event relationship graph g must be beyond the threshold σ, rf is one of the three relationships we defined ahead. the generation of a pattern a rf−−→ b and it’s instances is illustrated in fig 2, in which ei refers to an physical event, eci refers to an event class the label of which is b and σ is set to 2. 5.2 mining all the patterns of stars in the algorithm 1, the starsmining(g,t ′,σ) algorithm is to generate all the patterns of stars and their corresponding instances based on all the veritice-edge-vertice patterns produced ahead. just as fig 3 illustrated, the so called star is just a pattern the number of occurrence of which in the event relationship graph is beyond σ and in which a vertice serves as the center, 1 is the radius, the edge represent one of three kinds of event relationships and the direction of edge is optional which can point to or be away from the center. in fig 3, the label of e1 and e2 is b, the corresponding label of e7 and e8 is d etc. then we propose an efficient method to mine all the stars based on all the vertice-edge-vertice patterns, as shown in algorithm 2. in the algorithm 2, in line 5, the function find(c,li) is to find all the vertice-edge-vertice pattern that have a vertice with the label li. in line 10, the function count(t[i],t [j]) is to mining periodic traces of an entity on web 659 figure 2: generation of a vertice-edge-vertice pattern and it’s instances count the number of the same physical events that the label li in t[i] and t[j] stands for and judge whether the number is beyond σ. in line 13, the function merge(t[i],t [j],t ′) is to merge t[i] and t[j] if the number is beyond σ. in line 14, the function check(t,s′) is to put all the patterns which has never been merged into s′. ultimately, the s′ stores all the stars centered by li, for example, the stars centered by label a are named like star(a1), star(a2) etc according to the different physical events set that the label a stands for. in the algorithm 2, we shall finally find all the star patterns. in this paper, stars could help efficiently mine the periodic traces due to the following reasons: 1. stars reduce the heavy cost of the calculation of support of a pattern. judging whether two patterns can be merged, what we only need to do is to determine which star the merging bases on and further confirm the number of the same physical events that the center label of the star stand for. the number is the support of the merged pattern, which could greatly reduce the heavy cost of the calculation of support of a pattern. 2. stars reduce the complexity of merging. the inherent character of stars with one label as the center makes the merging of pattern convenient. to merge a star, a pattern is just to judge whether one of the borders of the pattern can merge with the star. and the merging makes the patterns grow by stars 660 x. huang, x. wang, y. zhang, j. zhao figure 3: a star pattern and it’s instances deviating from the existing edge-by-edge pattern-growth way [14], which greatly improve the efficiency of merging. the last and most important step of generating the periodic traces is to merge the stars, details is shown in 5.3. 5.3 generating periodic traces to discover all the periodic traces, the last step is to merge all the stars. then an effecient method starsmerging() is proposed to merge all the stars that can be merged until no more frequent patterns can be found. although the number of underlying periodic traces is usually unknown, the algorithm starsmerging() could at the same time determine the optimal number of periodic traces while generating all the periodic traces, details shown in. algorithm 3. s is implemented as a queue into which all the stars are put. star(s) (line 3) points to the current star to be dealt with. p.border (line 6) points to all the boundaries of the pattern p. the function compare(b,s) (line 10) is to count the number of the same physical events that the vertice b and the center of s stand for. at line 4–13, each current star as far as possible merges the stars in s through checking it’s borders one by one until no stars can be merged to. the function cleanup(s,σ) (line 21) is employed to clean all the stars that has been merged and the number of remainder physical events of the center vertice is not beyond σ from s. compared with the algorithm in literature [7], our algorithm is more effective and efficient. in our algorithm, a pattern with 4 stars just need 3 merging without additional patterns generated, which does not generated any redundant patterns and does not need any additional merging, mining periodic traces of an entity on web 661 algorithm 2 the starsmining algorithm require: input all the vertice-edge-vertice pattern c{c1,c2,c3, . . . ,ci, . . . ,ce}, all the instances of c, all the labels l{l1,l2,l3, . . . ,li, . . . ,lf} appears in c, support threshold σ. ensure: all the stars s. 1: initialize s ← φ; 2: for i = 1 to f do 3: s′← φ; 4: flag = false; 5: t ← find(c,li) /*find all the vertice-edge-vertice patterns that have a vertice with the label li*/ 6: repeat 7: t ′← φ; 8: for i = 0 to t .length do 9: for j = i + 1 to t .length do 10: n ← count(t[i],t[j]); /*to count the number of the same physical events that the label li in t[i] and t[j] stands for*/ 11: if n ≥ σ then 12: flag = ture; 13: merge(t[i],t [j],t ; ); /*add the merged pattern to t ′*/ 14: end if 15: end for 16: end for 17: check(t,s′) /*all the pattern in which the number of the physical events that label li stand for is beyond σ and which hasn’t been merged are put into s′*/ 18: t = t ′ 19: until flag /* the merger is completed*/ 20: s′ = s′∪t ; 21: s = s ∪s′; 22: end for 23: return s while the algorithm in [7] averagely needs 5 additional patterns generated and needs 6 merging. with the number of stars growing, more obvious advantages are shown in our algorithm. more details of the comparison with [7] are shown in section 6. 6 experiments in this section, we performed extensive experiments to evaluate the performance of our algorithm of mining the periodic traces on synthetic data. all experiments were done on a 2.8ghz intel pentium iv pc with 1gb main memory, operating system windows xp. our algorithm is implemented in python 2.7. the mining of periodic traces is experimented on a single graph which is generated by the erdős rényi random network model. the erdős rényi model is a well-known model to generate random graphs. using the g(n, p) function, our synthetic single graph is constructed. 662 x. huang, x. wang, y. zhang, j. zhao algorithm 3 starsmerging require: all the stars queue s, support threshold σ. ensure: the periodic traces s′. 1: s′← φ; 2: while s.length > 0 do 3: p = star(s); /* select one star from s*/ 4: while true do 5: flag = false; /*a flag whether merge happened*/ 6: border ← p.border; /*check the borders of p one by one*/ 7: for b in border do 8: ss = lookups(b); /*find all the stars of which the label of the center vertice is b*/ 9: for s in ss do 10: n = compare(b,s) /*count the same physical events that the vertice b and the center vertice of s stand for*/ 11: if n > σ then 12: p.merge(s); 13: flag = true; 14: end if 15: end for 16: end for 17: if !flag then 18: break; 19: end if 20: end while 21: s′ = s′∪p; /*put these patterns that don’t grow to s′*/ 22: cleanup(s,σ); 23: end while 24: return s′ however, the graph generated by the erdős rényi model is an undirected graph, which is different from our event relationship graph which is a directed graph. so some changes made based on the erdős rényi model is that 0.5 probability is adopted to decide the direction of the edge. besides, the labels of vertices and the labels of edges are distributed randomly under the condition that labels of any two adjacent vertices can’t be identical. in the experiment, in order to calculate the recall and precision ratio, a set of large patterns as well as a set of small patterns are injected into the graph. our goal is to find all the large frequent patterns and all the small patterns from the big graph. nowadays despite lots of studies in graph mining, few algorithms are capable of the mining task in a big single graph due to the exponentially high combinatorial complexity and support computation. and since spidermine [7] has been compared with the other algorithms (seus [16] (version 1.0), moss [17] (version 5.3) and origami [18]) and shows a tremendous advantage in efficiency and effectiveness, we just compare our algorithm with spidermine and the well-known subdue [15] (version 5.2.2), which is a classic approximate algorithm on a single graph. firstly, we generate 4 different data sets (labeled data 1 to 4) with varied parameter settings referring to [7]. the description of the data sets is given in fig 4. the details of the parameters is given as follows. |v | is the number of vertices. lv is the number of vertice labels and le is mining periodic traces of an entity on web 663 data |v | lv le d m |vl| el n |vs| es 1 400 70 3 2 5 30 2 5 3 2 2 600 130 3 4 5 30 2 20 3 2 3 1000 250 3 4 5 30 2 5 3 20 4 1500 350 3 4 10 30 2 10 3 2 figure 4: data sets the number of edge labels. d is the average degree. |vl| (or |vs|) is the number of vertices of each injected large (resp. small) pattern. m (or n) is the number of large (resp. small) patterns injected. el (or es) is the number of instances of each large (resp. small) pattern injected. we implement the spidermine [7] according to it’s algorithms in python. but a little changes are made to spidermine, because spidermine is designed for undirected graph without labels for it’s edges while our graph is directed with labeled edges. at same times, spidermine is to mine top k large patterns from a big graph. here we set k = 5, dmax = 4. figs 5 to 8 show the distribution of patterns mined by spidermine, subdue and our algorithm tracesmining for different data sets in fig 4. the minimum support threshold is set to 2. figure 5: data 1 figure 6: data 2 figure 7: data 3 figure 8: data 4 in figs 5, 6, 7 and 8, we can see that compared with spidermine() and subdue, our algorithm, tracesmining could mine all the large periodic traces and most small periodic traces injected into the graph.while spidermine() focus on the large patterns in four different data sets and subdue tends to mine smaller patterns . 664 x. huang, x. wang, y. zhang, j. zhao figure 9: large pattern figure 10: small pattern run time (seconds) data tracesmining spidermine 1 0.686 0.704 2 2.248 3.951 3 10.089 13.041 4 13.190 16.272 figure 11: run time in figs 9 and 10, tracesmining shows a tremendous advantage compared with spidermine and subdue in both large and small patterns mining. spidermine is good at mining large patterns and subdue tends to mine smaller patterns with the growing of graph, which fully illustrate that spidermine() and subdue don’t suit our task to find all the periodic traces. since spidermine has compare the run time with subdue in [7], the runtime comparison of spidermine and tracesmining is shown in fig 9 on the three data sets. we can see that in runtime, our algorithm has a clear advantage over spidermine. 7 conclusion in this paper, we address an important and difficult problem: mining periodic traces of an entity on web. we propose a novel and efficient framework to solve the aforementioned problem. our algorithm achieves its efficiency through deviating from the existing edge-by-edge patterngrowth framework and reducing the heavy cost of the calculation of the support of a pattern and avoiding the production of lots of redundant patterns. in addition, our algorithm could mine all the large periodic traces and most small periodic traces. experiments demonstrate the efficiency as well as scalability of our algorithm. acknowledgement the first author acknowledges the support by the natural science foundation of china (no. 61303005, no. 61303089, no. 61202151, no. 61103117). mining periodic traces of an entity on web 665 bibliography [1] z. zhong, z. liu, z. wen(2009), the model of event relation representation, journal of chinese information processing, 23(6): 56–60. [2] z. zhong, c. li (2013), web news oriented event multi-elements retrieval, journal of software, 24(10):2366–2378. [3] z. liu, m. huang, w. zhou(2009), research on event-oriented ontology model, computer science, 36(11): 191–195. [4] c.c. yang, x. shi, c.p. wei (2009), discovering event evolution graphs from news corpora, systems, man and cybernetics, part a: systems and humans, ieee transactions on, 39(4): 850–863. [5] h. ji, r. grishman, z. chen et al (2009), cross-document event extraction and tracking: task, evaluation, techniques and challenges, ranlp, 166–172. [6] w. liu, d. wang, w. xu et al (2012), a sub-topic partition method based on event network, the seventh international conference on internet and web applications and services, 194–199. [7] f. zhu, q. qu, d. lo et al(2011), mining top-k large structural patterns in a massive network, proc. of the vldb endowment, 4(11): 807–818. [8] z. li, j. han, b. ding et al (2012), mining periodic behaviors of object movements for animal and biological sustainability studies, data mining and knowledge discovery, 24(2): 355–386. [9] s. bethard, j.h. martin (2008), learning semantic links from a corpus of parallel temporal and causal relations, proc. acl-hlt, 177–180. [10] z. li, j. han, m. ji et al (2011), movemine: mining moving object data for discovery of animal movement patterns, acm transactions on intelligent systems and technology (tist), 2(4): 37. [11] z. guan, x. yan, l.m. kaplan (2012), measuring two-event structural correlations on graphs, proceedings of the vldb endowment, 5(11): 1400–1411. [12] l. gao, g.-m. qin gui, x.-f. zhou (2008), an overview of algorithms for mining frequent patterns in graph data, acta electronica sinica, 36(8): 1603–1609. [13] j. yang, w. wang, s.y. philip (2008), mining surprising periodic patterns, data mining and knowledge discovery, 9(2): 189–216. [14] m. wörlein, t. meinl, i. fischer et al(2005), a quantitative comparison of the subgraph miners mofa, gspan, ffsm, and gaston, springer berlin heidelberg. [15] l.b. holder, d.j. cook, s. djoko (1994), substucture discovery in the subdue system, kdd workshop, 169–180. [16] s. ghazizadeh, s.s. chawathe (2002), seus: structure extraction using summaries, discovery science, springer berlin heidelberg, 71–85. 666 x. huang, x. wang, y. zhang, j. zhao [17] m. fiedler, c. borgelt (2007), support computation for mining frequent subgraphs in a single graph, mlg. [18] m. al hasan, v. chaoji, s. salem et al(2007), origami: mining representative orthogonal graph patterns, icdm, 153–162. [19] z. li, f. wu, m.c. crofoot (2013), mining following relationships in movement data, icdm, 458–467. [20] i.c. resceanu, g.c. călugăru, c.f. resceanu et al(2012), cooperative robot structures modeled after whale behavior and social structure, international journal of computers communications & control, 7(5): 945–956. int j comput commun, issn 1841-9836 vol.7 (2012), no. 3 (september), pp. 574-585 bionic wavelet based denoising using source separation m. talbi, a.b. aicha, l. salhi, a. cherif mourad talbi, lotfi salhi, adnene cherif faculty of sciences of tunis, laboratory of signal processing, university campus, 2092 el manar ii, tunis, tunisia e-mail: mouradtalbi196@yahoo.fr, lotfi.salhi@laposte.net, adnane.cher@fst.rnu.tn anis ben aicha université de carthage, ecole supérieure des communications laboratoire de recherche cosim route de raoued 3.5 km, cité el ghazala, ariana, 2083, tunisie, tél. : +216 71 857 000 fax : +216 71 856 829 e-mail: ben.aicha.anis@gmail.com abstract: we consider the problem of speech denoising using source separation. in this study we have proposed a hybrid technique that consists in applying in the first step, the bionic wavelet transform (bwt) to two different mixtures of the same speech signal with noise. this speech signal is corrupted by a gaussian white noise with two different values of the signal to noise ratio (snr) in order to obtain those two mixtures. the second step consists in computing the entropy of each bionic wavelet coefficient and finds the two subbands having the minimal entropy. those two subbands are used to estimate the separation matrix of the speech signal from noise by using the source separation. our proposed technique is evaluated by comparing it to the denoising technique based on source separation in time domain. keywords: bionic wavelet transform, blinde source separation, entropy, speech enhancement. 1 introduction in signal processing, the source separation constitutes an attractive problem. its goal is to extract from many signals mixture, the meaningful signals. this is performed with minimum a priori information on the mixture process. in the case of instantaneous mixture, many approaches employing the ica algorithm can solve the problem of the source separation. one of those approaches permits to estimate the unmixing matrix by minimizing the mutual information between the separated sources [1, 2]. others exploit the non-gaussianity of the source signals and perform separation by maximizing this non-gaussianity [2]. for example, a technique using a subband decomposing in combination with ica, has been developed by tanaka et al [3]. kisilev et al [4] have employed geometric algorithms for separating mixed signals. rachid moussaoui et al [5] have proposed an algorithm using the idea of applying a preprocessing in the transformed domain but the separation is performed in the time domain. in this paper we have used the source separation with bionic wavelet transform (bwt) for enhancement of speech signal corrupted by white noise. the source separation is performed with ica and instead of using the wavelet packet transform as used in the technique proposed by rachid et al [5], we have used in this work the bwt. copyright c⃝ 2006-2012 by ccc publications bionic wavelet based denoising using source separation 575 2 restrictions of ica the ica standard formulation needs at least as many sensors as sources. therefore, we suppose in this paper that the source number is equals to the sensor number. in the instantaneous mixture case, the sources are not directly observed but as a linear combination such as: xi(t) = j=n∑ j=1 ai js j(t), (1) where x are the observed signals, s are the source signals and a = [ai j] is unknown full rank mixing matrix. figure 1: ica principle. figure 1 shows the ica principle. the ica aim is practically to find the inverse matrix of a, which is the unmixing matrix w = a−1. to make an estimation of w, certain assumptions have to be made and some restrictions have to be imposed [2]: we assume that the individual components si(t) are statistically independent over the observation time and the individual components must have non gaussian distributions. in comparison to previous work, the novelty of the approach of rachid moussaoui et al [5] resides in the preprocessing implementation before the source separation process in to: relax the previous restrictions by increasing the non-gaussanity which is a pre-requirement for ica. initiate a preliminary separation by decreasing the mutual information between the resultant signals from the preprocessing. the preprocessing transforms the observed signals to find an adequate representation where the signals distributions are non-gaussian. for this reason, the wavelet transform is used in order to emphasize the non gaussian nature of the observed signals. once we have found the inverse matrix w with the wavelet packets based ica then, the separation is performed in the time domain [5]. figure 2 illustrated an overview of the system proposed by rachid moussaoui et al [5]. in this paper we have chosen s2(t) to be a white noise that corrupted the clean speech signal s1(t) with two different values of the signal to noise ratio (snr). 576 m. talbi, a.b. aicha, l. salhi, a. cherif figure 2: overview of the source separation system proposed by rachid moussaoui et al [5]. 3 the proposed technique the proposed speech enhancement system, illustrated in figure 4, is inspired from that of rachid moussaoui et al [5]. the latter is conceived for multi-channel source separation and is based on wavelet based independent component analysis. it comprises two modules shown in dotted boxes in figure 4. the first module (pre-processing) extracts appropriate signals from the observed signals in order to facilitate the separation of the speech and noise signals. for this, the observed signals are projected on suitable bases, more specifically on bionic wavelet bases. the second module (speech and noise separation) performs the source separation using standard ica [1]. the input of this module is the extracted signals from module 1 and the observed signals. its output is the cleaned or the enhanced speech signal ŝ1(t). figure 4 illustrated an overview of the proposed speech enhancement system which is summarized by the following steps: i decompose the observed signals (the two noisy speech signals) into bionic wavelet subbands by applying the bwt. ii compute the entropy value of each subband and select the two subbands having the minimum entropy. iii use those two subbands as the inputs of the ica system in order to estimate the separation matrix, a−1. iv estimate the enhanced speech signal ŝ1(t) by applying a−1 to the temporal mixtures x1(t) and x2(t) . the used entropy in this work, is the shannon entropy which is defined for each subband w·, j, 1 ≤ j ≤ 30 as: h( j) =− ∑ pilog( pi). (2) bionic wavelet based denoising using source separation 577 note that in the expression w·, j, 1 ≤ j ≤ 30, · is replaced by 1 if we apply the bwt to x1(t) and is replaced by 2 if we apply the bwt to x2(t). the probability pi is expressed as: pi = w·, j(i)2 ∥w∥2 , 1 ≤ i ≤ n and n is the number of samples in the subband w·, j and w is obtained by concatenating all the subbands w·, j, 1 ≤ j ≤ 30. 4 the bionic wavelet transform j. yao and y. t. zhang have proposed the bionic wavelet transform (bwt) as a new time-frequency technique by referring to the perceptual model [6]. the term "bionic" means that the bwt is guided by an active biological mechanism [7]. moreover, the bwt decomposition is both perceptually scaled and adaptive [8]. the initial perceptual aspect of the transform comes from the logarithmic spacing of the baseline scale variables which are designed to match basilar membrane spacing [8]. then, two adaptation factors control the time-support employed at each scale, based on a non-linear perceptual model of the auditory system [8]. the basis of this transform is the giguerre -woodland non-linear transmission line model of the auditory system [9, 10], an active-feedback electro-acoustic model incorporating the auditory canal, middle ear, and cochlea [8]. the model yields estimates of the time-varying acoustic compliance and resistance along the displaced basilar membrane, as a physiological acoustic mass function, cochlear frequency-position mapping, and feedback factors representing the active mechanisms of outer hair cells. the net result can be seen as a technique for the estimation of the time-varying quality factor qeq of the cochlear filter banks as the input sound waveform function [8]. the references [6–9] give the complete details on the elements of this model. the bwt adaptive nature is ensured by a timevarying linear factor t (a,τ) which represents the scaling of the cochlear filter bank quality factor qeq at each scale over time [8]. for each scale and time, the adaptation factor t (a,τ) of bwt is computed by using the update equation [8]: t (a,τ+∆τ) = 1[ 1 − g1 gsgs+|xbwt (a,τ)| ] [ 1 + g2| ∂∂t xbwt (a,τ)| ] (3) where cs is a constant (typically cs = 0.8) that represents non linear saturation effects in the cochlear model [6, 8]. the quantities g1 and g2 are respectively the active gain factor, which represents the outer hair cell active resistance function, and the active gain factor representing the time-varying compliance of the basilar membrane [8]. practically speaking, the partial derivative in equation (3) can be approximated by using the first difference of the previous points of the bwt at that scale [8]. xbwt (a,τ) represents the bionic wavelet transform (bwt) of the signal x(t) and it is given by: xbwt (a,τ) = 1 t (a,τ) √ a ∫ x(t) · φ̃∗ [ t −τ a · t (a,τ) ] · e− jω0( t−τ a )dt, (4) where a denotes the parameter of scale, τ is the shifting parameter in time and φ̃ is the mother wavelet envelop given by [7]: φ(t) = 1 t (a,τ) √ a φ̃ [ t t (a,τ) ] · e jω0t (5) where ω0 is the base fundamental frequency of the unscaled mother wavelet. in practice ω0 is equals to 15165.4 for the human auditory system [6]. the discretization of the scale a is achieved by employing a pre-determined logarithmic spacing across the desired frequency range, so that at each scale the center frequency is expressed by [8]: ωm = ω0 (1.1623)m ,m = 0,1,2, . . . . (6) 578 m. talbi, a.b. aicha, l. salhi, a. cherif based on yao and zhang’s original work for cochlear implant coding [9], coefficients at 22 scales, m = 7,. . . ,28, are calculated employing numerical integration of the continuous wavelet transform. these 22 scales correspond to center frequencies logarithmically spaced from 225 hz to 5300 hz. (although the scales used here match those from yao and zhang’s original work, empirical variation of the number of scales and frequency placement showed minimal effect on the overall enhancement results). for this implementation, we have used coefficients at 30 scales. in the formula (4), the role of first factor t (a,τ) multiplying √ a is to ensure that the energy remains the same for each mother wavelet. the role of second factor t (a,τ) is to adjust the envelop φ̃(t) without adjusting the central frequency of φ(t) [7]. thus, the main difference between (bwt) and the continuous wavelet transform (cwt) is based on the fact that the time-frequency resolution achieved by (bwt) can be adjusted in an adaptive manner not only by frequency variation of the signal but also by instantaneous amplitudes of this signal. it is the mother wavelet which makes the continuous wavelet transform adaptive, while the adaptive characteristic of the bwt comes from the mechanism of active control in the human auditory model. which adjusts the mother wavelet associated to (bwt) according to the analyzed signal. basically, the idea of the (bwt) is inspired from the fact that we need to make the mother wavelet envelop variable in time according to the signal characteristics. the employed mother wavelet φ(t) in [7] is the morlet wavelet and its envelop φ̃(t) is given by [8]: φ̃(t) = e [ −( tt0 ) 2 ] (7) where t0 denotes the initial time-support. figure 3 illustrated the real and the imaginary parts of the complex morlet mother wavelet. figure 3: the morlet wavelet. it can be shown [7, 9] that the obtained bwt coefficients, xbwt (a,τ) are derived by using the following formula [8]: xbwt (a,τ) = k(a,τ)xwt (a,τ), (8) where k(a,τ) is given by: k(a,τ) = √ π c t0√ 1 + t 2(a,τ) (9) where c represents a normalizing constant calculated from the squared mother wavelet integral. bionic wavelet based denoising using source separation 579 this representation yields to an effective computational technique for calculating in direct manner, the bwt coefficients from those of the wavelet transform. this is performed without using the bwt definition given by equation (4). there are some key differences between the discretized cwt employing the morlet wavelet used for the bwt and a filterbank based wpt using an orthonormal wavelet. one of them is that the wpt provides a perfect reconstruction, while the discretized cwt is an approximation whose exactness depends on the number and placement of frequency bands selected [8]. figure 4: overview of the proposed system. 5 criterion of evaluation for evaluating our proposed technique, we have compared it to the temporal technique based on runica. the evaluation is based on snr, ssnr, isd and pesq computation. these parameters are defined as follow: signal-to-noise ratio the signal-to-noise ratio (snr) of the enhanced speech signal is defined by: s nrdb = 10log10  ∑n−1 n=0 x(n) 2∑n−1 n=0 (x(n) − x̂(n)) 2  , (10) where x(n) and x̂(n) represent respectively the original and the enhanced speech signals, and n is the samples number per signal. segmental signal to noise ratio the segmental signal-to-noise ratio (segsnr) is calculated by averaging the frame based snrs over the signal: segs nrdb = 1 m m−1∑ m=0 10log10  ∑nm+n−1 n=nm x(n)2∑nm+n−1 n=nm (x(n) − x̂(n))2  , (11) 580 m. talbi, a.b. aicha, l. salhi, a. cherif where m designates the number of frames, n is the size of frame, and nm is the beginning of the m− th frame. as the snr can become negative and very small during silence periods, the segsnr values are limited to the range of [-10db, 35db] as per [10]. itakura-saito distance the distance of itakura-saito (isd) measures the spectrum changes and can be computed employing the coefficients of linear prediction (lpc) and this according to the following equation: is d(a,b) = (a − b)t r(a,b) at ra , (12) where a represents the lpc vector of the original speech signal x(n). r is the matrix of autocorrelation and b is the lpc coefficients vector of the enhanced speech signal x̂(n). in this work, a 10th order lpc based measure is employed. perceptual evaluation of speech quality the perceptual evaluation of speech quality (pesq) algorithm [11, 12] is an objective quality measure, that is approved as the itu-t recommendation p.862. it is a tool of objective measurement conceived to predict the results of a subjective mean opinion score (mos) test. it was proved [13, 14] that the pesq is more reliable and correlated better with mos than the traditional objective speech measures. 6 results and discussion from table 1 to table 8, we report the obtained results from the application of our proposed speech enhancement technique and the temporal technique based on runica on eight noisy sentences taken from the timit database. table 1: sentence 1. parameters proposed method temporal method snri of the first mixture 0.7672 0.7672 snri of the second mixture -3.4638 -3.4638 snrf(db) 69.5528 58.7147 ssnri of the first mixture -3.5715 -3.5715 ssnri of the second mixture -6.3509 -6.3509 ssnrf(db) 34.8736 34.1366 pesqi of the first mixture 1.3110 1.3110 pesqi of the second mixture 1.0983 1.0983 pesqf 4.4989 4.4936 isdi of the first mixture 2.4029 2.4029 isdi of the second mixture 3.9475 3.9475 isdf 4.181 10−10 4.948 10−8 these results show clearly that our proposed technique outperforms the temporal technique of source separation using standard ica [1]. fig. 5 and fig. 6 illustrate an example of speech enhancement using our proposed technique. bionic wavelet based denoising using source separation 581 table 2: sentence 2. parameters proposed method temporal method snri of the first mixture -2.1038 -2.1038 snri of the second mixture -6.6325 -6.6325 snrf(db) 66.5296 48.9184 ssnri of the first mixture -5.8749 -5.8749 ssnri of the second mixture -7.8815 -7.8815 ssnrf(db) 34.2126 30.7198 pesqi of the first mixture 1.4577 1.4577 pesqi of the second mixture 1.2445 1.2445 pesqf 4.4981 4.4535 isdi of the first mixture 2.8500 2.8500 isdi of the second mixture 4.6523 4.6523 isdf 1.4043 10−9 3.1847 10−6 table 3: sentence 3. parameters proposed method temporal method snri of the first mixture 4.2400 4.2400 snri of the second mixture -0.2606 -0.2606 snrf(db) 77.4719 49.5281 ssnri of the first mixture -1.4316 -1.4316 ssnri of the second mixture -4.6465 -4.6465 ssnrf(db) 34.7279 31.6486 pesqi of the first mixture 1.9873 1.9873 pesqi of the second mixture 1.7460 1.7460 pesqf 4.4999 4.4621 isdi of the first mixture 1.8622 1.8622 isdi of the second mixture 3.1204 3.1204 isdf 2.5837 10−10 8.2210 10−6 7 conclusion in this paper, we have proposed a new speech enhancement technique that consists in applying in the first step, the bionic wavelet transform (bwt) to two different mixtures of the same speech signal with gaussian white noise with two different values of signal to noise ratio (snr). the second step consists in computing the entropy of each bionic wavelet coefficient and finds the two subbands having the minimal entropy. those two subbands are used to estimate the separation matrix of the speech signal from noise by employing the source separation. the obtained results from the snr, ssnr, isd and pesq computation, show clearly that the proposed speech enhancement technique outperforms the temporal technique of source separation using standard ica. 582 m. talbi, a.b. aicha, l. salhi, a. cherif table 4: sentence 4. parameters proposed method temporal method snri of the first mixture 1.6366 1.6366 snri of the second mixture -2.7143 -2.7143 snrf(db) 61.8042 55.4325 ssnri of the first mixture -4.3027 -4.3027 ssnri of the second mixture -6.3345 -6.3345 ssnrf(db) 31.4223 29.4299 pesqi of the first mixture 1.5414 1.5414 pesqi of the second mixture 1.2476 1.2476 pesqf 4.4816 4.4506 isdi of the first mixture 2.5180 2.5180 isdi of the second mixture 4.0632 4.0632 isdf 7.4591 10−8 8.9142 10−7 table 5: sentence 5. parameters proposed method temporal method snri of the first mixture -0.7340 -0.7340 snri of the second mixture -5.7034 -5.7034 snrf(db) 59.5033 57.0129 ssnri of the first mixture -5.6047 -5.6047 ssnri of the second mixture -7.6901 -7.6901 ssnrf(db) 31.4463 30.8201 pesqi of the first mixture 1.3907 1.3907 pesqi of the second mixture 1.1391 1.1391 pesqf 4.4814 4.4748 isdi of the first mixture 2.6384 2.6384 isdi of the second mixture 4.2292 4.2292 isdf 2.4528 10−8 1.0487 10−7 table 6: sentence 6. parameters proposed method temporal method snri of the first mixture 2.5544 2.5544 snri of the second mixture -2.1241 -2.1241 snrf(db) 62.4473 62.2521 ssnri of the first mixture -3.3643 -3.3643 ssnri of the second mixture -5.8605 -5.8605 ssnrf(db) 31.4816 31.4398 pesqi of the first mixture 1.3805 1.3805 pesqi of the second mixture 1.0564 1.0564 pesqf 4.4929 4.4926 isdi of the first mixture 2.6047 2.6047 isdi of the second mixture 4.1036 4.1036 isdf 9.3330 10−8 1.0324 10−7 bionic wavelet based denoising using source separation 583 table 7: sentence 7. parameters proposed method temporal method snri of the first mixture -0.0411 -0.0411 snri of the second mixture -4.9428 -4.9428 snrf(db) 69.0167 67.9430 ssnri of the first mixture -5.7362 -5.7362 ssnri of the second mixture -7.8201 -7.8201 ssnrf(db) 33.9383 33.8069 pesqi of the first mixture 1.6015 1.6015 pesqi of the second mixture 1.4326 1.4326 pesqf 4.4978 4.4974 isdi of the first mixture 2.5867 2.5867 isdi of the second mixture 4.1275 4.1275 isdf 4.6205 10−10 6.7472 10−10 table 8: sentence 8. parameters proposed method temporal method snri of the first mixture -1.7018 -1.7018 snri of the second mixture -6.5995 -6.5995 snrf(db) 68.1109 52.2155 ssnri of the first mixture -5.8031 -5.8031 ssnri of the second mixture -7.8610 -7.8610 ssnrf(db) 33.2544 29.8972 pesqi of the first mixture 1.3814 1.3814 pesqi of the second mixture 1.1889 1.1889 pesqf 4.4968 4.4504 isdi of the first mixture 3.3273 3.3273 isdi of the second mixture 5.1147 5.1147 isdf 3.1514 10−9 5.4382 10−6 584 m. talbi, a.b. aicha, l. salhi, a. cherif figure 5: (a) clean speech signal, (b) enhanced speech signal. figure 6: (c) first mixture, (d) second mixture. bionic wavelet based denoising using source separation 585 bibliography [1] a. j. bell, t. j. sejnowski, an information maximization approach to blind separation and blind deconvolution, neural computation, vol.7, pp.1004-1034, 1995. [2] a hyvarinen, j. karhunen, e. oja, independent component analysis, wiley and sons, 2001. [3] t. tanaka, a. cichocki, subband decomposition independent component analysis and new performance criteria, icassp, pp.541-544, 2004. [4] p. kisilev, m. zibulevsky, blind source separation using multinode sparse representation, icip, 2001. [5] r. moussaoui, j. rouat, r. lefebvre, wavelet based independent component analysis for multichannel source separation, icassp, pp.645-648, 2006. [6] j. yao, y. t. zhang, bionic wavelet transform: a new timefrequency method based on an auditory model, ieee trans. on biomedical engineering vol.48, no.8, pp.856-863, 2001. [7] xiaolong yuan, b.s.e.e. a thesis, ť auditory model-based bionic wavelet transform for speech enhancement. electrical and computer engineering. [8] m. t. johnsona, x. yuanb, y. rena, speech signal enhancement through adaptive wavelet thresholding. in conference elsevier, pp.123-133, 2007. [9] j. yao, y. t. zhang, the application of bionic wavelet transform to speech signal processing in cochlear implants using neural network simulations, ieee trans. biomed. eng., vol.49, no.11, pp. 1299-1309, 2002. [10] b. chen, p. c. loizou, a laplacian-based mmse estimator for speech enhancement, speech communication, vol.49, no.2, pp.134-143, 2007. [11] itu-t p.862. perceptual evaluation of speech quality (pesq): an objective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs, itu recommendation p.862, 2001. [12] a. w. rix, j. g. beerends, m. p. hollier, a. p. hekstra, perceptual evaluation of speech quality (pesq) a new method for speech quality assessment of telephone networks and codecs, icassp, pp.749-752, 2001. [13] y. hu, p. c. loizou, evaluation of objective measures for speech enhancement, ieee trans. speech, audio processing, vol.16, no.1, pp.229-238, 2008. [14] e. zavarehei, s. vaseghi, q. yan. interframe modeling of dft trajectories of speech and noise for speech enhancement using kalman filters, speech communication, vol.48, no.11, pp.1545-1555, 2006. int j comput commun, issn 1841-9836 8(6):825-837, december, 2013. localization of wireless sensor network based on genetic algorithm n. jiang, s. jin, y. guo, y. he nan jiang*, sixin jin, yan guo college of information engineering, east china jiaotong university nanchang, jiang xi 330013, p.r.china jiangnan@ecjtu.jx.cn, aquajsx@gmail.com, guoyan997@gmail.com *corresponding author: jiangnan@ecjtu.jx.cn yueshun he college of information engineering, east china institute of technology nanchang, jiang xi 330013, p.r.china hys8418@163.com abstract: this paper proposes a novel localization approach based on genetic algorithm for wireless sensor networks. in this method, we use a new way to approximate the distance between anchor node and unknown node which is out of the anchor nodes’ communication radius. in addition, we use self-adapting genetic algorithm into localization, which will ensure it can produce the result as similar as its real position in any environment. the experiments on various network topologies show the better results. in comparison, we find that previous anchor node free localization approach cannot work well in the unidealization environment. the demonstration explains that the approach can help unknown nodes obtain high accuracy position whether in open space, the environment with obstruction, or even the unconnected well environment. keywords: wireless sensor networks(wsns), genetic algorithm(ga), localization 1 introduction localization is one of key supporting technologies to wireless sensor networks (wsns). it could provide accurate position information for kinds of expanding application. in this paper, we study the localization problem for a 2d large-scale wsns with obstruction. if we equip gps for every sensor node, it will raise the cost much more, and it also brings a new problem of nodes’ power support. can we only use less gps for large-scale wsns to obtain the position of nodes, in obstruction or unconnected well environment? the major challenge is reducing the influence of obstruction between pairs of nodes. how to achieve high accuracy and reliability localization scheme for wsns that is always the research subject to researcher around the world, in the environment with obstruction or unconnected well. at the beginning of the localization study on wsns, received signal strength indicator(rssi) [1], tine of arrival(toa) [2], time difference of arrival(tdoa) [2] [3],angle of arrival(aoa) [4]and other range-based localization have been proposed. in addition, centroid algorithm [5],adhoc positioning system(aps) [6], amorphous [7], apit [8] and other range-free localization also have been proposed. through literature [1] [2] [3] [4] [5] [6] [7] [8], we found that these algorithms only could provide low accuracy position of nodes for us, and couldn’t meet the needs of expanding application on wsns. there are many approaches with only network connectivity to calculate the position of nodes such as multidimensional scaling (mds) [9]. it gives us an o.k. localization results on a wellconnected dense network (show in figure 7). but it does not have any well results in the environment with obstruction (show in figure 9). more recently, there are many researchers put graph rigidity theory into localization of wsns [10] [11] [12], they have made a lot of contribution on it. the methods proposed by s. lederer copyright © 2006-2013 by ccc publications 826 n. jiang, s. jin, y. guo, y. he et al. just use the network connectivity information to get the position of unknown nodes. in order to obtain the nodes’ position, they partition the network into voronoi cells with using delaunay triangles closest to the shape of region of concern. these kinds of methods are good choice of localization plan for underwater, underground or indoor environment, but in these way we can’t localization in the environment with obstruction or unconnected well. apart from this, the locations of landmarks should know before localization. how to obtain the nodes’ position in obstruction or unconnected well environment also is the research hotspot for us. our contribution. we propose a novel localization algorithm, it uses genetic algorithm(ga) [13] [14] into localization of wsns, the approach causes nodes in the environment with obstruction or unconnected well could get high accuracy position by themselves. we assume wsns has been deployed in a region with obstruction, in the network only fewer nodes equipped gps which could localization by themselves, but others are not. all nodes only could communicate with others in its communication radius, unknown nodes could get the position of anchor node and distance between with anchor node. when there are obstruction stop communicating pairs of nodes, unknown nodes could make use of other nodes in its communication radius to get the position of anchor nodes, and they could obtain distance with anchor nodes through the method provide in section 4. at last, every unknown node could estimate the position by the global search feature of ga. through the simulation result in this paper, we prove that the method provided in this paper has a well property in the environment with obstruction than any other methods, especially it’s could localization well in the environment where is separated into several parts like figure 1. figure 1: localization environment with obstruction, these pictures come from google maps. the outline of the paper is as follows: section 2 presents the theoretical foundations of genetic algorithm. the localization approach based on self-adapting genetic algorithm is proposed in section 3. section 4 presents the evaluation of system and compare simulate results use a different approach. section 5 concludes this paper. 2 theoretical foundations genetic algorithm (ga) is a bionics optimize algorithm. it has been put forward by john holland on 1975 [13] [14]. it is based on darwin’s biological evolutionism which is "survival of the fittest" and mendel’s genetic variation principle which is "biological genetic evolution mainly take place in the chromosome". ga through selection, crossover and mutation to die out the low-fitness individuals and maintain high-fitness individuals like in the real nature, it can make the population towards to optimize and converge to a global optimum individual. table 1 shows the pseudocode of the genetic algorithm. in this paper, we use the self-adapting ga which has been proposed in literature [15], the algorithm could change the operate points by the different multiformity of group. we should set up some parameters like code length l, group size n, cross probability pc, mutation probability localization of wireless sensor network based on genetic algorithm 827 table 1: pseudocode of genetic algorithm step 1: initialization the number of individuals, nind; the maximum number of generation, maxgen; the precision of variables, preci; the generation gap, ggap; initialized the times of generation gen= 0; step 2: coding initialize the group by gray, p(t); step 3: evaluate evaluate the fitness of each individual in the group p(t); step 4: genetic iteration while t < maxgen selecting the individuals for crossover; crossing the selected individuals by certain probability; mutating the individuals of group by certain probability; evaluating the fitness of the new group; producing a new group after the evolution, p(t); partial best = min(evaluate p(t)); update the times of generation, t = t + 1; end step 5: obtain the result global best = partial best; pmand so on. using ga into the node localization approach of wsns, we make use of the partial search ability and the global search ability of ga. the partial search ability will help nodes in class i(show in section 3.1) to obtain its position; the global search ability will help nodes in class ii(show in section 3.1) to obtain its position, these two kinds of abilities are supplied by cross probability and mutation probability. 3 algorithm description the localization approach based on genetic algorithm (gal) proposed in this section is divided into two phases: firstly unknown nodes obtain the distance to adjacent anchor nodes; secondly unknown nodes use genetic algorithm to estimate unknown node’s position. 3.1 obtain the distance after all sensor nodes been deployed, all anchor nodes begin broadcasting their position information. then unknown nodes could obtain messages with anchor nodes’ position directly which is in its radius, and they also could obtain other anchor nodes’ position through other nodes by routing forwarding. at last, unknown nodes metric distance with adjacent anchor nodes through rssi. after these steps, unknown nodes are classified into two categories: definition 1. consider an unknown node has obtained the distance to three or more anchor nodes called class i nodes. we say that this kind of nodes can use genetic algorithm to calculate its own position directly. 828 n. jiang, s. jin, y. guo, y. he the calculate method will proved in section 3.2. definition 2. consider an unknown node hasn’t obtained the distance of which to three or more anchor nodes called class ii nodes. we say that this kind of nodes cannot use genetic algorithm to calculate its own position directly. now we are ready to explain how to estimate the distance for class ii nodes. shown in figure 2. figure 2: two types of unknown nodes in the wsns. the red points mean anchor nodes and the black means unknown nodes. definition 3. there are two kinds of nodes, one is anchor node such as node a, the other is unknown node such as node c. node k is either kind of nodes in the network. node k can communicate with node a and c. let node k to node a farther than to node c. node b also is an assume node on the circle k, the radius of this circle is the distance of between node k and node c. shown in figure 3. figure 3: how to estimate the distance to anchor nodes for class ii nodes. every node has the same communication radius, and it is marked by gray area. definition 4. node d an assumed node which is the intersecting point of the extended line of line ak and circle k. θ is the included angle of line ak and kb, then φ = π − θ. shown in figure 4. lemma 5. for isoceles triangle abk, it has sin θ 2 = √ ak2 + (ck − ak)2 4ak2 (1) lemma 6. for the small circle of concentric circles k, it has b̂d = φ � bk (2) localization of wireless sensor network based on genetic algorithm 829 figure 4: how to estimate the distance to anchor nodes for class ii nodes. figure 5: class ii unknown nodes get the distance to anchor nodes. based on lemma 5. and lemma 6. we can conclude that: theorem 7. for class ii nodes like the node c in the figure 5, the distance between node a and c is equal to the distance between node a and c’ . in this paper, we use the sum of ab and the length of b̂c to similar the distance between node a and c’. as point c is an uncertain node, point c’ is also, it could appear on any position of b̂d. because of this, we use the half length of b̂d to measure b̂c′ . according this, we estimate the distance between anchor node and class ii unknown node as following ac = ac′ ≈ ab + b̂c′ ≈ ab + 1 2 (b̂d) (3) proof: node c’ is the mirror image of node c, it subjects to uniform distribution on b̂d. so if c ∼ u [b,d], then the probability distribution of node c subinterval [b,d], p (b ≤ c ≤ d) = ∫ d b 1 b − d dx = d − b b − d (4) in order to estimate the distance between node a and c’, we use the expectation of the length b̂c′ similar as the longer part of ac’ than ab. the expectation of the length b̂c′ is e ( b̂c′ ) = 0 + b̂d′ 2 (5) 2 3.2 compute unknown node’s position by genetic algorithm this paper measure the multiformity of group by f (t) × ρ(t), to guarantee ga group’s multiformity and it could expand search range during the previous period of iterate, moreover, optimize well during the later period of iterate. in this way, the self-adapting ga makes this localization approach can deal with the unconnected environment well. 830 n. jiang, s. jin, y. guo, y. he 0 10 20 30 40 50 60 70 80 90 100 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 figure 6: the function of ρ(t), t=100. definition 8. in this paper, the element of measuring the group’s multiformity is multiformity function f (t), it is considered that f (t) = 1 − h(t) l (6) where the average hamming distance is h (t) = n−1∑ i=1 n∑ j=i+1 h (xi (t) ,xj (t)) n−1∑ i=1 (n − i) (7) where the hamming distance between individual xik (t) and xjk (t) is h (xi (t) ,xj (t)) = l∑ k=1 |xik (t) − xjk (t)| (8) in the equation (8),xik (t) means the t generation’s k (k = 1,2, · · · ,l) bit of individual i(i = 1,2, · · · ,n). definition 9. in the multiformity measure, the other element is ρ(t), it is considered that ρ(t) = exp ( −t2/2σ2 ) ,σ = t/3 (9) in the equation (9) of definition 9, t means the last generation of ga iteration, when t=100, the figure of ρ(t) like figure 6. moreover, we also can’t use the encoding method for simple genetic algorithm. the encoding method of simple genetic algorithm is simply binary code, as this way, it will make great hamming distance between neighbor encode elements, the distance is called hamming cliff [16].the hamming cliff is too difficult to crossover or mutation, therefore we initialize encoding by gray code to avoid this problem. the most important part of genetic algorithm is fitness function. it is used to evaluate the fitness of every node’s estimate position. definition 10. in this paper, consider the fitness function f(j) = |dist(i,j) − dist(n,i)| + f(j)last−iteration (10) localization of wireless sensor network based on genetic algorithm 831 table 2: pseudocode of localization based on genetic algorithm step 1: initialization the number of individuals, nind; the maximum number of generation, maxgen; the precision of variables, preci; the generation gap, ggap; initialized the times of generation gen= 0; the number of anchor nodes, nodenum; the number of aimnodes, aimnum; the distance between anchor and aimnode, d; step 2: coding generate the position of aimpiont by gray code randomly, aimp(t); step 3: evaluate for n = 1; n < aimnum; n++ t = 0; for j = 1; j < nind; j++ f(j) = 0;%f(j) is the fitness function to the group for i = 1; i < nodenum; i++ f(j) = dist( nodep(i) − aimp(j) ) − d(n,i) + f(j); end end end step 4: select, crossover, mutation, reproduce while t < maxgen evaluating the fitness of the new group; selecting the individuals for crossover; crossing the selected individuals by certain probability; mutating the individuals of group by certain probability; evaluating the fitness of the new group; producing a new group after the evolution, aimp(t); partial best = min(f(j)); update the times of generation, t = t + 1; end step 5: obtain the result aimp = decode(partial best); dist(i,j) be the distance between anchor node i and unknown node j, calculating it needs the coordinate of node i and j. how to get the coordinate of anchor node i has proposed in section 3.1, and the coordinate of unknown node j could estimate by genetic algorithm. dist(i,j) be defined as: dist(i,j) = √( xnodep(i) − xnodep(j) )2 + ( ynodep(i) − ynodep(j) )2 (11) the pseudocode of the localization method proposed in this paper is showed in table 2. 832 n. jiang, s. jin, y. guo, y. he 4 simulation in this paper, we develop matlab toolbox to verify the effectiveness of our approach, and then contrast the experiment results by using different methods in different environment. during the follow experiment, there are 300 nodes deployed in the area 1000m × 1000m, the communication radius of anchor nodes and unknown nodes both are 200m. in this paper, the values of main parameters as follow, the ggap is 0.7, cross probability is 0.85 and the mutation probability is 0.1, it is similar in other application as usual. 4.1 localization effect in different environment from the following results, we could easily find that using mds to get the position of unknown nodes is a good choice for wsns, which is deployed in the open space environment (show in figure 7). while we using gal to localization for this environment, we can find that unknown nodes also could obtain its position accurately, if there are enough anchor nodes around it (show in figure 7). 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c o o rd in a te 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c o o rd in a te 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c o o rd in a te 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c o o rd in a te 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c oo rd in at e 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c oo rd in at e figure 7: the first row figures are produced by mds; the second row figures are produced by gal. the left col figures show the nodes deploy in the open space; the middle col figures show the neighbor relationship have been built; the right col figures show the localization error. to verify the localization ability of gal in the environment with obstacles, we put rectangle, triangle, circle, wall into the region of consideration. when anchor nodes are 20% of all, the localization results are shown in figure 8. from the simulate result of figure 8, it is not difficulty to see that obstacles can’t influence localization accuracy much, unknown node also could obtain its position within accepted range. in the same environment as figure 8, we use mds to obtain unknown nodes’ position again, the experiment results show in figure 9. from figure 9 it’s not hard to find that localization use mds is much worse than localization using gal. in the environment like figure 9 (l) and (o) we can’t use mds to get the position of unknown nodes. it is well know that mds has the better localization accuracy than any other classical methods, in order to make this experiment completed we also use dv-hop and centroid algorithm to get the unknown nodes’ position in a few environments appeared in figure 8 and 9. the results localization of wireless sensor network based on genetic algorithm 833 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c o o rd in a te 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c o o rd in a te 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c oo rd in at e 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c oo rd in at e 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c oo rd in at e 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c o o rd in a te 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c o o rd in a te 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c o o rd in a te 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c oo rd in at e 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c o o rd in a te 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c o o rd in a te 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c o o rd in a te 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c o o rd in a te 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c o o rd in a te 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c o o rd in a te 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c o o rd in a te 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c o o rd in a te 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c o o rd in a te figure 8: these pictures show the system evaluation by gal in the environment with obstacles or unconnected. the left col figures show the nodes deploy in different environment; the middle col figures show the neighbor relationship have been built; the right col figures show the localization error of gal. 834 n. jiang, s. jin, y. guo, y. he 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c oo rd in at e 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c oo rd in at e 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c oo rd in at e 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c o o rd in a te 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c oo rd in at e −400 −200 0 200 400 600 800 1000 −200 0 200 400 600 800 1000 1200 x coordinate y c oo rd in at e 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c o o rd in a te 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c o o rd in a te 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c oo rd in at e 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c o o rd in a te 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c oo rd in at e 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c oo rd in at e 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c o o rd in a te 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c oo rd in at e 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c o o rd in a te 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 x coordinate y c oo rd in at e figure 9: these pictures show the system evaluation by mds in the environment with obstacles. the left col figures show the nodes deploy in different environment; the middle col figures show the neighbor relationship have been built. the right col figures show the localization error of mds. but in the last two environment, we can’t use mds to get nodes’ position. localization of wireless sensor network based on genetic algorithm 835 of localization error is shown in table 3. 4.2 contrast gal with mds on characters in section 4.1 we have presented the result of localization using gal and mds in different environment with obstacles. in order to compare with two localization algorithms quantify and objectively, we make experiment in each environment and use different anchor nodes ratio, then analysis the localization error of these results. figure 10 shows the localization error use different anchor percentage. 6 8 10 12 14 16 18 20 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 anchor percentage(%) e s ti m a ti o n e rr o r( r ) gal mds 6 8 10 12 14 16 18 20 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 anchor percentage(%) e s ti m a ti o n e rr o r( r ) gal mds 6 8 10 12 14 16 18 20 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 anchor percentage(%) e s ti m a ti o n e rr o r( r ) gal mds (a).open space (b). rectangle obstacle (c). the area like “c” 6 8 10 12 14 16 18 20 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 anchor percentage(%) e s ti m a ti o n e rr o r( r ) gal mds 6 8 10 12 14 16 18 20 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 anchor percentage(%) e s ti m a ti o n e rr o r( r ) gal mds (d). triangle obstacle (e). pentagram obstacle figure 10: these pictures show the localization error in different anchor percentage by mds and gal. in these results, each point is the average result of ten experiments. figure 10 (a) shows the localization error in different anchor percentage in the open space. mds has higher accuracy than gal, and it just need little number of anchor nodes. the more anchor nodes, the more accuracy of gal, but the accurate of mds hardly change. when anchor nodes up to 20%, the localization accuracy of gal is close to mds. figure 10 (b) to (e) show the localization error comparison of two methods in the environment with obstacles, from these figures we can find that the more anchor nodes percentage, the more localization accuracy of gal, but the accurate of mds also hardly to be changed. when percentage of anchor nodes less than 8% in the rectangle or pentagram obstacle environment, or less than 11% in the triangle obstacle environment, mds could get better localization accuracy than gal, but the localization error is too high to application. however, gal could make localization error less than 15%, when percentage of anchor nodes reaches 13%. therefore, gal has highly robustness and practicality than any other localization algorithms in the environment not well for communicating. table 3: the localization error of dv-hop and centroid method without obstruction triangle obstruction five pointed star obstruction dv-hop 0.34246 0.375053 0.32259 centroid 0.318298 nan nan 836 n. jiang, s. jin, y. guo, y. he 5 conclusion a novel localization approach is proposed in this paper, in this method unknown nodes through their near anchor nodes to obtain their position. in order to reduce error during localization, we use new means to approximate the distance between unknown nodes and anchor nodes when it is larger than node’s communication radius. moreover, we use self-adapting genetic algorithm to calculate the similar position of nodes, it makes the localization error much lower than the common method. through the contrast experiments in this paper, we find this localization method can work well, whether in the environment with obstruction or not. for the aim of localization in non-planar environment like interior of tall building, we would develop a 3-d localization algorithm in the future study. acknowledgment this work is supported by national natural science foundation of china under grant no.61063037 and no. 51364001, key projects in the science and technology pillar program of jiangxi province of china under grant no. 20111bbg70031-2 and 20133bbe50033, and foundation for young scientists of jiangxi province of china under grant no. 20133bcb23016. bibliography [1] girod, l.; bychkovskiy, v.; elson, j.; estrin, d. (2002); locating tiny sensors in time and space: a case study, in proceedings of ieee international conference on computer design: vlsi in computers and processors, issn 1063-6404, 214-219. [2] girod, l.; estrin, d.(2001); robust range estimation using acoustic and multimodal sensing, in proceedings of 2001 ieee/rsj international conference on intelligent robots and systems, isbn 0-7803-6612-3, 1312-1320. [3] a. savvides; c.c. han; m.b. srivastava(2001); dynamic fine-grained localization in adhoc networks of sensors, in proceedings of 7th annual internatonal conference on mobile computing and networking, isbn 1-58113-422-3, 166-199. [4] d. niculescu; b. nath.(2003); ad hoc positioning system (aps) using aoa, in proceedings of twenty-second annual joint conference of the ieee computer and communications, issn 0743-166x, 1734-1743. [5] n. bulusu; j. heidemann; d. estrin(2000); gps-less low-cost outdoor localization for very small devices, ieee personal communications, issn 1070-9916, 7:28-34. [6] d. niculescu; b. nath.(2001); ad hoc positioning system (aps), in proceedings of global telecommunications conference, isbn 0-7803-7206-9, 2926-2931. [7] r. nagpal(1999); organizing a global coordinate system from local information on an amorphous computer, mit artificial intelligence laboratory memo no.1666. [8] t. he; c. huang; b.m. blum; j.a. stankovic; t. abdelzaher(2003); range-free localization schemes for large scale sensor networks, in proceedings of 9th annual internatonal conference on mobile computing and networking, isbn 1-58113-753-2, 81-95. localization of wireless sensor network based on genetic algorithm 837 [9] y. shang; w. ruml; y. zhang; m.p.j. fromherz(2003); localization from mere connectivity, in proceedings of the 4th acm international symposium on mobile ad hoc networking & computing, isbn 1-58113-684-6, 81-95. [10] s. lederer; y. wang; j. gao(2008); connectivity-based localization of large scale sensor networks with complex shape, in proceedings of the 27th annual ieee conference on computer communications, issn 0743-166x, 789-797. [11] y. wang; s. lederer; j. gao(2009); connectivity-based sensor network localization with incremental delaunay refinement method, in proceedings of the 28th annual ieee conference on computer communications, issn 0743-166x, 2401-2409. [12] m. jin; s. xia; h. wu; x. gu(2011); scalable and fully distributed localization with mere connectivity, in proceedings of the 30th annual ieee conference on computer communications, issn 0743-166x, 3164-3172. [13] j.h. holland(1962); concerning efficient adaptive systems, self-organizing systems, issn 1069-0948, 215-230. [14] j.h. holland(1992); adaptation in natural and artificial systems, ma: mit press. [15] m. srinivas; l.m. patnaik(1994); adaptive probabilities of crossover and mutation in genetic algorithms, ieee transactions on systems, man and cybernetics, issn 0018-9472, 24:656667. [16] k. deep; m. thakur(2007); a new crossover operator for real coded genetic algorithms, applied mathematics and computation, issn 0974-4665, 188:895-911. international journal of computers communications & control issn 1841-9836, 11(3):372-380, june 2016. forecasting gold prices based on extreme learning machine s. kumar chandar, m. sumathi, s.n. sivanadam s. kumar chandar* associate professor, christ university, bangalore, india *corresponding author: kumar.chandar@christuniversity.in m. sumathi associate professor, sri meenakshi government college for arts for women (autonomous), madurai, india sumathivasagam@gmail.com s.n. sivanandam professor emeritus, karpagam college of engineering, coimbatore, india sns12.kit@gmail.com abstract: in recent years, the investors pay major attention to invest in gold market because of huge profits in the future. gold is the only commodity which maintains its value even in the economic and financial crisis. also, the gold prices are closely related with other commodities. the future gold price prediction becomes the warning system for the investors due to unforeseen risk in the market. hence, an accurate gold price forecasting is required to foresee the business trends. this paper concentrates on forecasting the future gold prices from four commodities like historical data’s of gold prices, silver prices, crude oil prices, standard and poor’s 500 stock index (s &p500) index and foreign exchange rate. the period used for the study is from 1st january 2000 to 31st april 2014. in this paper, a learning algorithm for single hidden layered feed forward neural networks called extreme learning machine (elm) is used which has good learning ability. also, this study compares the five models namely feed forward networks without feedback, feed forward back propagation networks, radial basis function, elman networks and elm learning model. the results prove that the elm learning performs better than the other methods. keywords: feed forward neural networks, extreme learning machine, gold price forecasting. 1 introduction gold is the major commodity in the economic and monetary market. india and china are the major importers among the world and consumes 60 % of the global gold. every day, the value of the gold increases and cannot be controlled. nowadays, people tend to invest in gold owing to huge profits in future. the gold prices are closely related with other commodities. a hike in oil prices will have positive impact on gold prices and vice versa. when there is a hike in equities, gold prices goes down. this is because when there is a boom in the stock market, the investors tend to invest the gold money in the equities. hence, an accurate gold price forecasting is required to foresee the business trends in future. soft computing techniques like neural networks, fuzzy logic, genetic algorithms, particle swarm optimization and simulation annealing can be used to forecast the gold price. among the above, artificial neural networks are very accurate and predicts the future very well. some of the recent studies on gold price forecasting are discussed below. copyright © 2006-2016 by ccc publications forecasting gold prices based on extreme learning machine 373 gary et al. [1] used neural networks for forecasting standard & poor’s 500 stock index and gold futures prices. their forecast was based on the historical prices of the stock index and gold prices. malliaris et al. [2] used times series techniques and artificial neural networks for forecasting the prices of gold, oil and euro. they gave an interrelationship among the three and proposed ann technique to forecast the individual variables. and they concluded that both short term and long term relationship exist between the three variables. mehdi bijari et al. [3] proposed a hybrid arima model using fuzzy logic and artificial neural network for forecasting exchange rates and gold prices. fuzzy logic and ann was hybridized with the arima model in order to get accurate results. the results explained that the proposed technique predicts the future prices accurately than the other methods. ali et al. [4] uses multilayer perceptron neural network model for predicting the changes in stock prices and gold prices. the data used in this study was tehran’s stock exchange (t.s.e). the results showed that the ann models perform better than the traditional statistical techniques. deepika et al. [5] proposed autoregressive integrated moving average (arima) models for forecasting the monthly gold price from period 1980 to 2012. this paper also finds the factors influencing the gold price using multiple regression analysis. lazim abdullah [6] used auto-regressive integrated moving average (arima) model for forecasting the selling prices of gold bullion coins. they forecasted that the selling prices are in the upward direction and the investors can invest money in the gold bullion coins. trian et al. [7] explored gold equivalent for forecasting steel prices in pipeline projects. this paper elaborates on how the steel prices depend on the gold price using a regression model of the historical data. massarrat ali khan et al. [8] used box-jenkins, arima model for forecasting the gold prices. the period used for the study was from january 2003 to march 2012. they concluded that arima model was the suitable model for forecasting the gold price. bai li [9] proposed improved artificial bee colony algorithm (abc) for forecasting the gold price modelling using wavelet neural networks. the experimental results showed that the improved abc algorithm works more effective than the conventional abc algorithm. fengyi zhang et al. [10] proposed the methods for forecasting gold price using radial basis function (rbf) neural networks and hybrid fuzzy clustering algorithm. principal component analysis was used to unite technical indicators namely moving average, receive operator characteristics and p-accuracy rate. the results showed that the hybrid fuzzy clustering algorithm works better than the rbf neural network. hossein mombeini et al. [11] developed a defined model for forecasting gold prices. the performance measures are used to access the accuracy of the model. they have presented gold price forecasting using two models namely artificial neural networks and arima models and showed that the ann model works better than the arima model in terms of performance metrics. this paper concentrates on forecasting the future gold prices from four commodities like historical data’s of gold prices, silver prices, crude oil prices, standard & poor’s 500 stock index (s & p 500) index and foreign exchange rate. the period used for the study is from 1st january 2000 to 31st april 2014. in this paper, a learning algorithm for single hidden layered feed forward neural networks called extreme learning machine (elm) is used which has good learning ability. also, this study compares the three models namely feed forward networks without feedback, feed forward back propagation networks and elm learning model. the organization of the paper is as follows: section 1 summarizes on the introduction and literature survey. section 2 illustrate on the research data used in this study, section 3 explains the elm algorithm. section 4 explains on the application of elm algorithm to gold price forecasting. section 5 discusses on the results, section 6 concludes the paper and section 7 enlighten on the references used in the study. 374 s. kumar chandar, m. sumathi, s.n. sivanadam 2 research data the gold prices are normally related with other commodities like crude oil, stock prices, silver prices etc. the period of the study is from 1st january 2000 to 31st april 2014. the monthly data are used to forecast the future gold prices. in this study, the future gold prices are forecasted from four commodities like historical data’s of gold prices, silver prices, crude oil prices, standard & poor’s 500 stock index (s & p 500 index and foreign exchange rate. the gold price and silver price data was gathered from http://www.bullion-rates.com/gold /inr/2014-4-history.htm, crude oil price data was collected from http://www.indexmundi// .com/commodities, bombay stock exchange data was accumulated from http://bseindia.com website and foreign exchange data was collected from http://fxtop.com/en/historical-exchangerates.phpma=1. in the sample data set, the input attributes contains the average monthly data of gold price per oz, the monthly average data of crude oil, the monthly average data of silver price, the monthly average value of us dollars in terms of indian rupees the monthly average values of s & p 500 stock data and the target attribute has the actual gold prices. the summary statistics of the research data is calculated from the input data by finding the minimum, maximum, mean and standard deviation of these values. table 1 statistics of the research data input data maximum minimum mean standard deviation gold price 95,194.24 12,100.23 39,317.532 8,592.43 crude oil 6,928.11 887.42 3091.0277 1716.6898 silver price 1,899.62 197.92 695.25052 517.21883 usd 63.90 39.26 47.4437 5.037152 s&p 500 stock 8,592.43 850.56 4398.487849 2500.74607 3 extreme learning machine single layer feed forward network (slfn) is one of the most common network architecture. it is widely used in many applications like classification and regression. gradient descent learning methods like slfn networks are time consuming and have serious issues such as over fitting, local minima problem and some parameters need to be tuned manually. some researchers have discovered many possibilities for slfn networks. among all, huang et al. [12–14] proposed that randomly chosen input weights and the bias weights in all the iterations with utmost n hidden neurons will bring desirable accuracy. moore-penrose generalized pseudo inverse is used for calculating the output weights. the learning algorithm is faster and has good generalization ability. this type of learning is known as ’extreme learning machine’ (elm). the salient features of elm are specified in a simple three step algorithm. in contrast to gradient descent methods, elm need not see the training data before generating the hidden node parameters. elm algorithm works for all piecewise continuous activation function. elm tries to find solution for many problems like local minima, time consuming and tuning the parameters. it works easier than other learning algorithms such as neural networks and support vector machine. the brief overview on elm algorithm is discussed below: 3.1 extreme learning machine a single layer feed forward network with x1,x2, ...,xm, input nodes, h1,h2, ...,hn, hidden nodes and ti be the target node is shown in fig.1. forecasting gold prices based on extreme learning machine 375 figure 1: single layer feed forward network let (ai,bi) be the weights connecting from input layer to hidden layer and β1,β2, ...,βn be the weights of the nodes connecting from hidden layer to the output layer. let ’g’ be the piecewise continuous activation function. the hidden layer outputs are given as n∑ i=1 [βig(ai,bi,xj)] = tj,j = 1, ...,n (1) equation (1) can be rewritten as βh = t .here h is called the hidden layer output matrix , which can be expressed as follows, h(a1, ...,an;b1, ...,bn;x1, ...,xm) = (2)   g(a1,b1,x1) . g(an,bn,x1). . . g(a1,b1,xm) . g(an,bn,xm)   β = [β1β2...βn] t and t = [t1t2...tn]t (3) in the hidden output matrix, each value represents the hidden output values of their corresponding node. the three step elm learning algorithm is as follows: in a single layer feed forward neural network, (xi, ti) be the training pair and g(a,b,x) be the hidden node output function. step 1: initially hidden nodes are chosen randomly, (ai,bi) where i = 1,2,3, ...,n step 2: calculate the hidden layer output matrix h step 3: calculate the output weights î˛ where î˛= t. here represents the moore-penrose inverse of hidden layer output matrix h. 376 s. kumar chandar, m. sumathi, s.n. sivanadam 4 an application of elm learning to gold price prediction in this section, elm training methodology is applied to gold price prediction. the proposed architecture is shown in fig.2. the gold prices are normally related with other commodities like crude oil, stock prices, silver prices etc. the period of the study is from 1st january 2000 to 31st april 2014. in this figure, monthly average price of five parameters like gold price, crude oil, silver price, usd and s&p 500 stock price are taken as inputs for elm learning network. the output of this network gives the next month’s gold average price. all the hidden nodes weights are chosen randomly. first, the hidden node output values are calculated from equation (1). then the output of this network is calculated by multiplying outputs of the hidden nodes with the weights of the hidden node and the output nodes. figure 2: architecture of the proposed system 5 results and discussions all the implementations are carried out using matlab r2009 running in an ordinary pc with pentium iv processor with 2.40 ghz speed and 256 mb of ram. . the period of the study is from 1st january 2000 to 31st april 2014. the monthly data are used to forecast the future gold prices. in this study, the future gold prices are forecasted from four commodities like historical data’s of gold prices, silver prices, crude oil prices, standard & poor’s 500 stock index (s & p 500) index and foreign exchange rate. among the data, 70% of the data is used for training, 15% is used for validation and 15% of the data is used for testing. the total number of trading days is 86 from (1st jan 2000 31st apr 2014). all the ann models are predicting next month’s average gold value. the details of the historical datasets for all the prediction models are listed in table 2 table 2 details of historical dataset forecasting gold prices based on extreme learning machine 377 details of the dataset total number training validation testing of trading days sample sample sample monthly average gold price monthly average silver price monthly average crude oil price monthly average s&p 500172 120 26 26 stock index monthly average usdollars in indian rupees elm learning algorithm is implemented and compared with other four existing neural networks such as feed forward neural network, back propagation network, radial basis function and elman networks. 5.1 feed forward networks without feedback a network is said to be feed forward network when the outputs are not directed back as inputs to the same or preceding layer. in this network, the information moves only in one direction and there is no feedback. all the layers are trained using levenberg-marquardt (trainlm) function and initnw and trains functions are used for adaptation of weights. during training process, this network has been given with 1000 epochs. but, it achieves the best performance goal at 13th iterations. the training time, training accuracy, testing time and testing accuracy of this network is given in table 3. 5.2 feed forward back propagation networks back propagation networks are multi-layered feed forward networks trained with respect to the back propagation error algorithm. this is the most widely used network. here, the outputs are directed back as inputs to the same or preceding layer nodes. hence in this model, the function ’mapminmax’ is used to reduce the noise in the dataset. training is done with the levenberg-marquardt ’trainlm’ training function and back propagation weight/bias learning is done using ’learngdm’ function. adaption is done with ’trains’ which updates weights with the specified learning function. the transfer function ’tansig’ function is used for hidden layers and ’purelin’ function for output layer. performance is measured according to the mean square error ’mse’ performance function. this model runs up to 11 iterations and achieves the best performance goal. the training time, training accuracy, testing time and testing accuracy of this network is given in table 3. 5.3 radial basis function network radial basis function network is a particular type of feed forward network used for approximating the functions and recognizing the patterns. the architecture of the network consists of three layers namely, the input layer, hidden layer and the output layer. 5.4 elman networks the elman networks are a form of recurrent neural networks. a three layer feed forward network where a set of ’context units’ are attached to the hidden layer. at each time step, a copy of hidden layer unit is copied onto the context unit having a weight of one. thus the network is learnt by the current input signals, a copy of previous hidden layer unit’s i.e context 378 s. kumar chandar, m. sumathi, s.n. sivanadam units and the output of the network. the context unit can be considered as a one of the inputs to the hidden layer. 5.5 elm learning algorithm in this study, the weights of the hidden nodes are chosen randomly and the number of hidden nodes is varied from 5 to 20. but the best training and testing accuracy was achieved when the number of hidden nodes was 10. when compared with the other two networks, elm achieves 97.5% for training and 93.82% for testing. the performance of elm algorithm with respect to gold price prediction is shown is table 3. table 3 performance comparison of different neural networks training testing algorithm training training testing testing time accuracy time accuracy elm 3.1682 97.65 % 0.05272 93.82 % feed forward networks without feedback 4.1184 93.82 % 0.3772 91.14 % zabir haider khan et al. [5] feed forward back propogation networks 7.6752 92.41 % 0.5389 90.02 % malliaris et al. [2] radial basis function 5.1589 95.37 % 0.1839 92.58% fengyi zhang et al. [10] elman networks 6.9384 94.28% 0.20915 91.67% figure 3: comparison graph of predicted monthly average gold value vs target gold price forecasting gold prices based on extreme learning machine 379 the graph is drawn between monthly average gold value and time series which is shown in fig.3. in this graph, the predicted gold prices are mapped and it is compared with the actual gold prices. from the graph, it is understood that this proposed model predicts the future gold prices more accurately with the actual gold prices. 6 conclusions an accurate gold price forecasting is necessary for predicting the future gold price. hence in this paper, a new learning algorithm called elm learning is applied which has good learning ability and generalization capability. the period used for the study is from 1st january 2000 to 31st april 2014. since, the gold price is related with other commodities, four commodities like like historical data’s of gold prices, silver prices, crude oil prices, standard & poor’s 500 stock index (s & p 500) index and foreign exchange rate are given as inputs to the single layer feed forward network also, this study compares the three models namely feed forward networks without feedback, feed forward back propagation networks and elm learning model. the results clearly explain that the elm algorithm achieve approximately 3% increase in the training and testing efficiency when compared with the other networks. bibliography [1] gary grudnitski, larry osburn, forecasting s &p and gold futures prices: an application of neural networks, the journal of futures markets, 13(6) : 631-643, 1993. [2] a.g. malliaris and mary malliaris, time series and neural networks comparison on gold, oil and the euro, proceedings of international joint conference on neural networks, atlanta, georgia, usa, june 14-19, 2009. [3] mehdibijari, gholam aliraissi ardali, improvement of auto-regressive integrated moving average models using fuzzy logic and artificial neural networks (anns), neurocomputing, 72 : 956-967, 2009. [4] ali ghezelbash, predicting changes in stock index and gold prices to neural network approach, the journal of mathematics and computer science, 4 : 227-236, 2012. [5] deepika m g, gautam nambiar and rajkumar m, forecasting price and analysing factors influencing the price of gold using arima model and multiple regression analysis,international journal of research in management, economics and commerce, 2(11) : nov 2012. [6] lazim abdullah, arima model for gold bullion coin selling prices forecasting, international journal of advances in applied sciences, 1(4) : 153-158, 2012. [7] trian hendro asmoro, exploring gold equivalency for forecasting steel prices on pipeline projects, 2(5) : 1-22, may 2013. [8] dr. m. massarrat ali khan, forecasting of gold prices (box jenkins approach), international journal of emerging technology and advanced engineering, 3(3) : 2013. [9] bai li, research on wnn modelling for gold price forecasting based on improved artificial bee colony algorithm, hindawi publishing corporation computational intelligence and neuroscience vol. 2014, pp. 1-10, 2014. 380 s. kumar chandar, m. sumathi, s.n. sivanadam [10] fengyi zhang and zhigao liao, gold price forecasting based on rbf neural network and hybrid fuzzy clustering algorithm, proceedings of the seventh international conference on mangement science and engineering management, lecture notes in electrical engineering, vol. 1, pp. 73-84, 2014. [11] hossein mombeini and abdolreza yazdani-chamzini, modeling gold price via artificial neural network, journal of economics, business and management, 3(7) : 2015. [12] guang-bin huang, lei chen, chee-kheong siew, universal approximation using incremental networks with random hidden computational nodes ieee transactions on neural networks, 17(4) : 879-892, 2006. [13] guang-bin huang, qin-yu zhu, chee-kheong siew, extreme learning machine: theory and applications,neurocomputing, 70 : 489-501, 2006. [14] guang-bin huanga, lei chena, convex incremental extreme learning machine, neurocomputing, 70 : 3056-3062, 2007. int j comput commun, issn 1841-9836 7(5):849-856, december, 2012. strategic decision models cross-validation by use of decision reports information extraction l. hancu lucian hancu 1. "babes-bolyai" university of cluj-napoca romania, cluj-napoca, 1 m. kogalniceanu, and 2. softproeuro ltd. cluj-napoca romania, 400614 cluj-napoca, 1 lacul rosu e-mail: lhancu@softproeuro.ro abstract: from all the events in the life of a business entity, the mergers and acquisitions transactions are one of the most challenging ones, as they drastically affect the life of the involved entities, but also their business stakeholders (like clients or suppliers). the merger transaction can be seen as a growth crisis in the life of the buyer entity and a strive for survival in the life of the acquired company. studying such transactions are being a constant preoccupation for both academia and practitioners, modeling mergers in order to predict them one of the most ambitious task. in this paper, we present our technique of cross-validating the results of our model and use several boosting methods for improving the computed decisions scores. keywords: mergers and acquisitions, quantitative models, cross-verification, boosting algorithm, growth crisis, business survival 1 introduction the strategic decisions of the type mergers and acquisitions are of crucial importance for the life of both the entities involved in such a process and their stakeholders ones (clients, suppliers, or even competitors). predicting such transactions are, thus, of great importance for the participants to the economic activities, as the changes in the market conditions can drastically affect the entities, especially the small competitors. bearing this in mind, we have previously built a model of predicting future mergers and acquisitions based on the financial statements of the entities involved in such a strategic process and on the correlations between the two entities activity’s codes (the so-called business dependencies map [4]). in addition, data regarding previously completed acquisitions are available on the web and can be easily downloaded and analyzed. information extraction from such data can be of great help in cross-verifying the quantitative models for mergers and acquisitions. in this paper, we apply a cross-validation mechanism in order to correlate data manually extracted from the competitors council merger decisions reports from 2003 up to 2008 of the type buyer (the entity who bought another entity) target (the entity who was bought) seller (the entity who sold its ownership of the target entity to the buyer entity) with the results of the mavoc (mergers, acquisitions, virtualizations or conservations) quantitative model. the cross-validation occurs only when the buyer and target are both romanian entities and their financial statements are present in the previously-computed database of the top of the romanian entities, so it is possible to compute the mavoc quantitative score. prior to performing such a step, an automatic cleansing is performed on the data extracted from the competition’s council decisions reports, that assures that the entities are found on the databases collecting the financial statement. the cleansing step is crucial as many entities change names after the completion of an acquisition transaction, which makes difficult (or even impossible) the finding of the financial information regarding the specified entity. copyright c⃝ 2006-2012 by ccc publications 850 l. hancu in addition to the cross-validation task, a boosting algorithm is used in order to improve the results of the mergers and acquisitions mavoc model. the boosting algorithm is based on the risk profile of both buyer and the seller and it takes into consideration the risk associated with the two entities activity codes. the boosting algorithm has the scope of improving the acquisition score and downgrading the other scores (especially the inverse acquisition aand conservation c), so as the mavoc model would output acquisition for the two companies extracted from the decision reports. this boosting technique is required when the acquiring company’s financial strength is similar to (or weaker than) the acquired company’s financial strength, which would conduct to a false inverse acquisition (a-) recommendation if used alone without boosting. the paper is organized as follows. in the following section we provide a brief introduction to the mergers and acquisitions research. this research usually is the result of consultant companies and it takes several years of investigation, when analyzing transactions from several decades. in the subsequent section, we briefly explain our methodology of modeling mergers and the technique of extracting data from decisions reports from the competition council, that are later used in cross-validating the model. the fourth section describes our boosting techniques aimed at improving the results of the decision model, whereas the paper ends with a brief summarization of the discussed topics and depicts directions for future work. 2 mergers and acquisitions transactions a thorough analysis of more than 20 years of italian mergers and acquisitions transactions is done in [7]. the research takes into consideration the interval between 1998 and 2010, in which the authors analyze transactions from various sectors including banking and public services like electricity and gas. it is analyzed the context of such transactions the italian economy in the analyzed period, in which the small and medium entities occupy a large percentage of the total amount of business entities. the context is, to some extent, similar to the one of the romanian market: some transactions were done as privatizations, between the state agencies and private entities (most of them being foreign entities), others being transactions between foreign entities (one which previously acquired the company and which is now willing to exit from the investment). a special role in these transactions are occupied by investment funds, business entities that acquire several percentages of companies, develop a new business, then sell the company to a third-party investor. while transactions between two companies (competitors, clients or suppliers) are easier to be analyzed as one could extract features like financial indicators, position on the market, coverage of the market, the case of private funds (also known as investment funds or equity funds) remain a distinct subject of research and it shall be left behind during our research. the main motives of mergers transactions are depicted in [2]: to affect more rapid growth, gain economies of scale, increase market percentage, expand in the territory, increase stock market value, expand or improve the mix of products, spread risk through diversification, enhance the power and influence of the entity, invest the entity’s idle capital, acquire technical knowledge and expertise, counter cyclical of seasonal revenues, obtain managerial talent, gain from tax advantages, obtain more control over the supply sources and/or the retail outlets or to defend against a possible takeover. some of these motives are summarized also in [1]. the main aim of the merger transaction is to realize value, by managing risk and exercising power. the mergers transactions from the market power perspective are also analyzed in [3] and [8]. the former research states that markets are passing through several stages in their way to consolidation, when having almost 90% of the market power concentrated into the industry giants. the latter research, instead, focuses on strategic decision models cross-validation by use of decision reports information extraction 851 exploting niche markets for mergers transactions. the research literature of mergers and acquisitions coming from both practitioners but especially from academic researchers is crucial in deriving the criteria for future modeling of mergers and acquisitions transactions. by analyzing the research literature, we have figured out that several mergers motives can be modeled by quantitative variables, while others (denoting especially human resource-related questions) focus on more qualitative results. in this article, we shall concentrate on modeling mergers through quantitative models and improve them by crossvalidations with data previously extracted from the decisions reports of the national competition council, that are published through their public web information systems. 3 gathering decision reports and cross-verification of merger model modeling mergers decisions, based on the financial indicators of the analyzed entities and the business dependencies of the various business activities (which constitutes the business dependency map [4], has been first explained in [5]. the mergers acquisitions virtualizations or conservations mavoc model consists in extracting the financial indicators of the involved entities from the available public web sources (like the ministry of finance or registry of commerce). the model makes use of the two-business entities activity codes, number of employees, financial resources (turnover, tangible assets, intangible assets), market share whether the entities are part of the local (or national) top of the entities corresponding to the county of each business entity. according to the relations between the financial indicators of the two business entities, a score for each alternative (mavoc) is computed, the score which is higher ranked is returned as the suggested alternative for the two business entities for the specified year. one method of verifying the quantitative mergers model would be to extract data from the decision reports published by the competition council and cross-verifying the results of the model with the data extracted from these decision reports. in the following paragraph, we briefly explain the methodology we use in extracting data from these reports. 3.1 extracting information from decision reports strategic decisions like mergers or takeovers are analyzed by the country’s competition council (in the case that the two entities are from the same country) or by each country’s specific competition council (in the case that the two business entities are coming from two different countries). upon each analysis, a decision report is issued (and usually available using web information systems) which describes the details of the transaction and the council’s acceptance or rejection. the technique of extracting information from these reports has been thoroughly explained in [6]. we extract information from these reports then cross verify the extracted data with our data sources, in order to obtain relationships of the type buyer (the entity which bought another entity) target (the entity who was bought) seller (the entity which previously owned the target). in some cases, the relationship is restricted to only buyer target, as there is no entity which owns the target company. 3.2 cross-verification of the model with previously-extracted data the extraction of the data resulted in 740 relationships of the form buyer target seller, from which 20, 27% of them were business entities coming from romania. in order to further 852 l. hancu perform cross-validation tasks, we had to check that these entities are business entities (and not public agencies owned by the state), so that we can obtain the financial indicators from the external information sources. the results of the cross-validation baseline experiment are depicted in the first row of the table 1 and in the figure 1. the table synthesizes the percents of the final results of the model for the input coming from the decisions reports: 2, 67% were classified as mergers, 14% were classified as inverse acquisitions (if a and b are the entities, we denote direct acquisition or a+ the acquisition of b made by a and inverse acquisition or athe acquisition of a made by b). in addition, 26, 67% were classified as virtualizations (creation of short-term virtual entities for profiting from a business momentum), 30% as direct acquisitions and 26, 67% as errors. the errors are due to the fact that several values are missing in our information systems which contain financial data on business entities. instead of adding an auto-updater tool to our information systems for correcting these errors, we have performed several boosting strategies in order to correct or limit these errors, strategies that are explained in the next section. the last two columns of the table 1 represent the relative number of correct answers with respect to all the output of the model including the errors, and the relative number of those answers with respect to the output excluding the errors. the former is computed by dividing the m, a+ and v answers to the m, a+, a-, v and error answers, while the latter is computed by dividing the m, a+ and v answers to the m, a+, aand v answers. as previously noted in our experiments, with the introduction of the 250 top of the entities (instead of the previous top of 10 entities in the original model [5]) the number of conservations (c) results is zero. we consider here the m, a+ (direct acquisitions) and v as correct answers for the cross-validations with the decision reports data extraction. during our boosting experiments we shall concentrate in minimizing the number of errors and the number of inverse acquisitions a-. in the figure 1 we depict the analytical scores of the model the maximum between the m, a+ and v scores in comparison with the aand c scores. the image serves at an overview of the baseline experiment. we shall improve the score and present an updated version of the score, during the boosting section. 4 boosting techniques several approaches were required for improving the results of the decisions scores. in this section, we briefly summarize the boosting approaches that we are using. experiment 2 downward model in order to improve the number of correct answers, we apply a downward strategy to our model: we take the initial year of the transaction and the fiscal ids of the two entities from the extracted decision reports (see previous paragraph for the methodology). if the model outputs an error, we apply the same model to the previous years from the range 2003..2008 in order to obtain better results (non-error result of the model). the reason for applying this boosting technique is: several transactions completed at the end of a fiscal year are analyzed only in the next year, when one of the entity might not be a valid entity any more, in the mean time being absorbed by the other entity, and, thus, making impossible the gathering of its financial indicators for the current year. the results are depicted in the second row of table 1 and reveal a slight improvement of the results of the experiments. experiments 3 upward model a similar mechanism of upward strategy is also used, when the model outputs the same error (indicating that one or both business entities are missing from the information systems business data), with the sole difference that we analyze the years after the transaction and output the model decision score, when available. strategic decision models cross-validation by use of decision reports information extraction 853 figure 1: model score before boosting experiments 4 and 5 using total incomes instead of turnover into the model. by manually inspecting the false negative results from our experiments, we have discovered that a number of entities had much higher total incomes with respect to the turnover, which suggested us to use the other indicator into the model in order to improve the results. the difference between the two experiments is a small change in the importance of the business classes of the two entities, that affected the results of our experiments; we shall use this strategy in the boosting techniques, discussed below. risk-based boosting: experiments 6-12 the main idea of this paper was to apply riskbased boosting techniques in order to improve the decision scores of our model. the motivation for this approach relies on the fact that businesses coming from certain business sectors are more willing to accept higher risks than businesses coming from the more conservative business sectors. we have previously developed various techniques for computing the dependency-based risks from the virtualized supply chains; in this article we shall limit at applying boosting techniques to the high risk-tolerant business classes, without insisting in the classification low-medium-high risk business class. thus, higher risks from mergers and acquisitions transactions are supposed to be accepted for the high risk-tolerant business classes. the risk-based boosting techniques are summarized below: model’s parameters when the business class of the a company is one of the high risktolerant business classes, we use different thresholds for the parameters k1, k2, k3, k4 (in the relations 1, 2, 3, 4, as follows: for the first relation we use the lower-bound parameter for higher risk business classes, allowing that weaker business entities acquire stronger ones 854 l. hancu m av a+ err proc1 proc2 1 2,67 14 26,67 30 26,67 59,34 80,91 2 2,67 16 30,67 36 14,67 69,34 81,25 3 2,6 14,94 31,17 39,61 11,69 73,38 83,08 4 2,6 16,23 30,52 38,96 11,69 72,08 81,62 5 2,6 12,34 31,82 41,56 11,69 75,98 86,03 6 1,3 12,99 31,17 42,86 11,69 75,33 85,29 7 5,23 13,07 26,8 43,14 11,76 75,17 85,19 8 4,55 12,34 26,62 44,81 11,69 75,98 86,03 9 1,3 10,39 37,01 39,61 11,69 77,92 88,23 10 12,34 7,14 29,22 39,61 11,69 81,17 91,91 11 7,14 7,14 29,22 44,81 11,69 81,17 91,91 12 2,6 5,19 29,87 50,65 11,69 83,12 94,12 table 1: boosting experiments (but within the specified threshold between the intangible assets aca and the sum between tangible and intangible assets aimb and acb. the relation 2 specifies that the lowest parameter should be use in less risk-tolerant business classes, whereas the highest should be used in the higher risk-tolerant ones. we point out that the more risk-tolerant business class refers to the business class of a and not to the risk class of b. similar relations are 3 and e4 when we use the number of employees. more risk-tolerant business classes are those that are willing to assume a higher risk during the acquisition transaction. this could be contracting a credit for the acquisition of the company, or having a weaker company willing to acquire a stronger one for gaining market share. aca > k1 ∗ (aimb + acb), k1 ∈ {0.5, 1.0} (1) acb > k2 ∗ (aima + aca), k2 ∈ {1.0, 2.0} (2) ema > k3 ∗ emb, k3 ∈ {0.5, 2.0} (3) emb > k4 ∗ ema, k4 ∈ {2.0, 4.0} (4) importance of criteria the criteria used in our mavoc model can be divided into two categories: risk-dependent ones and non-risk-dependent ones. for the risk-dependent ones like the financial criteria, human resources criteria and business classes criterion we use different importance weights than in the standard risk classes. this assures that the risk-dependent variables are more weighted in constructing the final decision score. weights of alternatives when using the risk-dependent criteria we provide various weights for the 5 alternatives, that are tuned during the boosting experiments 6-12. higher weights are given to the alternatives that express a direct acquisition, whereas gradually lower weights are given to the ones expressing inverse acquisition a-. strategic decision models cross-validation by use of decision reports information extraction 855 figure 2: model score after boosting the results from our experiments are summarized in table 1 and figure 2. the figure shows that the cross-validated results are better than the initial ones. the reason for summing the results of the scores m, a+, v is that we can consider the merger and virtualization score as similar to the acquisition score. as stated in the experiments from [5], the merger score is given to almost equal in financial and human strength of business entities with similar activity codes, whereas the virtualization score is given to the same kind of business entities, but having different and correlated activity codes. 5 conclusions and future works in this paper we have presented a technique for boosting the results of the merger scores obtained by cross-validating the mavoc model decision scores when the business classes are ones from more risk-tolerant business classes. several factors limit the results of our experiments: the relatively high number of transactions in which one or two of the businesses from the buyer-target-seller relationship are foreign entities (or local businesses hidden behind foreign offshore companies). in these cases, it was not possible to apply the decision score and boost its results. in our future works, we plan to also investigate the possibility to include foreign companies into the model, by enhancing the information systems database which contains the financial statements of the analyzed business entities. 856 l. hancu acknowledgment this work was partially supported by the cncs-uefiscdi under the grant pn ii-rupd-397/2010. we would like to thank the combinations of companies m.sc. students and their academic mentors from the faculty of economic sciences and business administration for their contribution in manually extracting the mergers and acquisitions informations from the decisions reports. bibliography [1] d. angwin, mergers and acquisitions, blackwell publishing, 2007, pg. 21. [2] h. k. baker, t. o. miller, b. j. ramsperger, a typology of mergers motives, in akron business and law review, 12(4), 1981, 24-29, reprinted in j. a. krug, mergers and acquisitions, sage, 2008, pp. 67-76. [3] g.k. deans, f. kroeger, s. zeisel, winning the merger endgame, a playbook for profiting from industry consolidation, a.t. kearney, 2003, pp. 22-95. [4] l. hancu, data-mining techniques for supporting merging decisions, in international journal of computers, communications and control, suppl. issue, 2008, pp. 322-326. [5] l. hancu, pruning decision trees for easing complex strategic decisions, in annals of the tiberiu popoviciu seminar, volume 6, 2008, pp. 194-203. [6] l. hancu, mining strategic decisions information systems for predicting future market concentrations, in proceedings of the international iadis information systems conference, march 2012, berlin, germany. [7] kpmg, 20 anni di m&a fusioni e acquisizioni in italia dal 1988 al 2010 (20 years of m&as mergers and acquisitions in italy from 1988 to 2010), egea, 2010. [8] f. kroeger, a. vizjak, m. moriarty, beating the global consolidation endgame, a. t. kearney, 2008. international journal of computers communications & control issn 1841-9836, 9(4):408-418, august, 2014. simulation experiments for improving the consistency ratio of reciprocal matrices d. ergu, g. kou, y. peng, x. yang daji ergu southwest university for nationalities 4th section,yihuan nanlu,chengdu, 610041, china, ergudaji@163.com gang kou ∗ school of business administration southwestern university of finance and economics no.555, liutai ave, wenjiang zone,chengdu, 610054, china *corresponding author kougang@swufe.edu.cn yi peng school of management and economics university of electronic science and technology of china no.2006, xiyuan ave, west hi-tech zone, chengdu, 611731, china pengyicd@gmail.com xinfeng yang school of statistics, southwestern university of finance and economics no.555, liutai ave, wenjiang zone, chengdu, 610054, china, 315425159@qq.com abstract: the consistency issue is one of the hot research topics in the analytic hierarchy process (ahp) and analytic network process (anp). to identify the most inconsistent elements for improving the consistency ratio of a reciprocal pairwise comparison matrix (pcm), a bias matrix can be induced to efficiently identify the most inconsistent elements, which is only based on the original pcm. the goal of this paper is to conduct simulation experiments by randomly generating millions numbers of reciprocal matrices with different orders in order to validate the effectiveness of the induced bias matrix model. the experimental results show that the consistency ratios of most of the random inconsistent matrices can be improved by the induced bias matrix model, few random inconsistent matrices with high orders failed the consistency adjustment. keywords: reciprocal random matrix, consistency ratio, induced bias matrix, simulation experiment; analytic hierarchy process (ahp)/analytic network process (anp) 1 introduction in the analytic hierarchy process (ahp) and analytic network process (anp), the pairwise comparison matrix (pcm hereinafter) originated by thurstone [1] is one of the most important components, which is used to compare two criteria or alternatives with respect to a given criterion , then a matrix a = (aij)n×n is built to reflect the direct and indirect judgment relations between pairs of criteria or alternatives with respect to a given criteria, where aij > 0, aij = 1aji . all pcms are then used to derive the priority vectors, and the alternatives can be ranked by aggregating the local priority vectors [2–4]. however, the decision made based on the final priority vectors is effective only if the paired comparison matrices pass the consistency test [5]. in practice, it is usually difficult to obtain a matrix that satisfies the perfect consistency condition (i.e. aij = aikakj for i, j, k = 1, 2, . . . , n). therefore, saaty [6, 7] proved that the maximum eigenvalue λmax of matrix a always satisfies λmax ≥ n and the equality holds if and only if a is perfectly copyright © 2006-2014 by ccc publications simulation experiments for improving the consistency ratio of reciprocal matrices 409 consistent. based on this property, saaty proposed the consistency ratio (cr) to measure the consistency of a matrix, i.e. the consistency of a matrix is acceptable if the cr is less than 0.1. however, this condition sometimes cannot be satisfied with because of the limitations of experiences and expertise, prejudice as well as the complexity nature of the decision problem [8]. to improve the consistency ratio of a matrix, many models and methods have been proposed over the past few decades. for instance, harker [9] regarded the largest absolute value(s) in matrix { viωj − a2jivjωi } for all i, where i > j , as the most inconsistent element(s). saaty [6] constructed the deviation differences matrix b = [bij] = [|aij − ωi/ωj|] to identify the most inconsistent entry, where ωi and ωj are any two subjective priority weights in the ω = (ω1, · · · , ωn) . based on these models, xu and wei [10] generated a near consistent matrix b = (aλij(ωi/ωj) 1−λ)n×n to improve the consistency, where λ is a parameter of the auto-adaptive algorithm. besides, saaty [7] and cao et al. [11] introduced hadamard operator “ ◦ ” to build a perturbation matrix e and a deviation matrix a, in which e = (aij) ◦ (ωj/ωi)] and a = [ωi/ωj] ◦ [aij/(ωi/ωj)] , to identify the most inconsistent entry. there is a common feature in the previously reviewed models, that is, these models are dependent on the priority weights ωi and ωj, but there exist more than 20 priority derivation methods [12–14], and the final priority weights obtained from different methods might be different when the matrix is inconsistent. therefore, ergu et al. [8] proposed an induced bias matrix to identify the most inconsistent entry in the original inconsistent matrix a. to do so, three major steps containing seven specific steps were proposed and several numerical examples were used to validate the proposed model. in this paper, we attempt to conduct simulation experiments to further validate the effectiveness of the proposed induced bias matrix (ibm) model by generating randomly millions number of the reciprocal positive matrices with different orders. the step 6 and step 7 proposed in ergu et al. [8] are further quantified and detailed in order to implement automatically modification. the remaining parts of this paper are organized as follows. the next section briefly describes the induced bias matrix model. the simulation experiments and some algorithms are performed and proposed in section 3. section 4 concludes the paper as well as future research directions. 2 the induced bias matrix model in ergu et al. [8], the theorem of induced bias matrix and two corollaries were proposed to identify the most inconsistent entries in a pcm and improve the consistency ratio. for the readers’ convenience, we first briefly describe the related theorem and corollaries of the ibm model as preliminary of ibm model. the theorem 1 says that "the induced matrix c=aa-na should be a zero matrix if comparison matrix is perfectly consistent". based on this theorem, if comparison matrix a is approximately consistent, corollary 1 derived that "the induced matrix c=aa-na should be as close as possible to zero matrix". however, if the pairwise matrix is inconsistent, corollary 2 says that "there must be some inconsistent elements in induced matrix c deviating far away from zero". by this corollary, the largest value in matrix c can be used to identify the most inconsistent element in the original matrix a. some of the identification processes are presented next. the procedures of the ibm model proposed in ergu et al. [8] include three major steps, containing seven specific steps (details are referred to ergu et al. [8]). the first five steps are easy to be implemented by matlab software in practice, i.e. 1) construct an induced matrix c=aa-na; 2) identify the largest absolute value(s) of elements and record the corresponding row and column; 3) construct the row vector and column vector using the recorded location; 4) calculate the scalar product f of the vectors; 5) compute the deviation elements between aij 410 d. ergu, g. kou, y. peng, x. yang and vectors f. however, for steps 6-7, the definitions are not easy to be quantified and it needs the decision makers to determine when we should use method of maximum,method of minimum, and method for adjusting aij to identify the most inconsistent entries. in the following section, we combine these identification methods to perform the simulation experiment by generating randomly reciprocal matrix in order to validate the effectiveness of the induced bias matrix model. 3 simulation experiments 3.1 design of simulation experiments the simulation experiments were performed to confirm the effectiveness of the induced bias matrix model using random inconsistent reciprocal matrices. we generated randomly 106 set of reciprocal matrices with orders 3 to 9, and 105 set of reciprocal matrices with orders 10-12, i.e. the entries above the main diagonal of a reciprocal matrix is generated randomly from the 17 numbers (1/9, 1/8, 1/7, . . . , 1, 2, 3, . . . , 9) in order to satisfy the saaty’s fundamental 9point scales, the entries below the main diagonal of the pcm is filled automatically with the corresponding reciprocal value. then calculating the consistency ratio by the formula proposed by saaty [7], where λmax is the maximum eigenvalue of matrix a, and n is the order of matrix a. if the cr < 0.1, discard the generated matrix, if the cr ≥ 0.1, then applying the ibm model to modify the inconsistent entry and improve the consistency ratio by the six steps and the combined algorithm, as shown in figure 1. if the consistency ratio of the generated randomly reciprocal pairwise comparison matrix cannot be reduced to be lower than 0.1, then record the corresponding matrix. the specific procedures of this simulation experiment are shown in figure 1. figure 1: procedures of simulation experiment by ibm model for the matrices with cr ≥ 0.1, the formula of steps 1-6 presented in ergu et al. [8] are directly used to identify and modify the most inconsistent entries in matrix a as well as improving the consistency ratio. however, the method of maximum, method of minimum and method for simulation experiments for improving the consistency ratio of reciprocal matrices 411 identifying aij proposed in step 6 and step 7 involve qualitative observation and judgment, for instance, how many absolute values in vector f can be regarded as more absolute values that are around zero? how to measure the absolute values of aij, aik and akj are too large or too small by quantifying terms? therefore, the following algorithm is used to combine the previously mentioned identification processes. 3.2 algorithms of simulation experiments in order to simulate the induced bias matrix model, the program codes with two input parameters, n and n1 were written by matlab 7.0 to randomly generate reciprocal matrices, in which n denotes the numbers of random reciprocal matrix, while n1 represents the number of simulation. for the space limitations, we omitted the first five steps, and the following algorithm is used to combine the method of maximum,method of minimum, and method for adjusting aij. algorithm 1: improving the consistency ratios of the random reciprocal matrices with cr ≥ 0.1 input: matrix order, n; number of simulation, n1 output: matrices with cr ≥ 0.1 process: 01. c=aa-na % matrix a is the generated randomly reciprocal matrix with cr ≥ 0.1 02. if cij < 0 03. adjust aij using aij = 1n−2 ∑n k=1 aikakj 04. end % method for identifying aij(1) 05. if cij > 0 & & min(f) == 0 % we can obtain that aij is inconsistent whether it is too large or too small, in which f is the vector product 06. adjust aij using ãij = 1n−2 ∑n k=1 aikakj 07. end % method for identifying aij(2) 08. if cij > 0 % aij and aik (or akj ) might have problematic 09. [m, k] = max(f); % m is the element with the largest value in vector f, while k is the corresponding location. 10. if cik < 0 & & ckj ≥ 0 % aik is problematic (too large). 11. adjust cik using aik + cik/(n − 2). 12. break 13. end 14. if cik ≥ 0& & ckj < 0 % akj is problematic and large 15. adjust akj using akj + ckj/(n − 2) 16. break 17. end 19. if cik < 0 & & ckj < 0 % cik and ckj are problematic 20. if abs(cik) >= abs(ckj) 21. adjust aik using aik + cik/(n − 2) 22. break 23. else 24. adjust akj using akj + ckj/(n − 2) 25. break 26. end 27. end 412 d. ergu, g. kou, y. peng, x. yang 28. if f(k) > 0 & & cik >= 0 & & ckj >= 0 % it is unreasonable to occur simultaneously, if it does occur, then go to adjust the second largest value. 29. cij = 0; 30. end 31. end 32. calculate the cr; % see algorithm 2 33. if cr < 0.1 34. break 35. end. algorithm 2: calculating the consistency ratio of the modified matrix b input: modified random matrix b output: consistency ratio cr. process: 01. n=length(b); % b is reciprocal matrix 02. ri=[1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 0 0 0.52 0.89 1.12 1.26 1.36 1.41 1.46 1.49 1.52 1.54 1.56 1.58 1.59]; 03. [a,b]=eig(b); 04. [c1,d1]=max(b); % c1 is maximum value in each column, d1 is the corresponding row of each element 05. [e1,f1]=max(c1); % e1 is the largest element, f1 is the corresponding column. 06. ci=(e1-n)/(n-1); 07. cr=ci/ri(2,n); 3.3 experimental results in this section, we do not attempt to optimize the program codes for speed, therefore, we set the matrix order n to be 3-12, and the simulation number n1=106 for the matrices with orders 3-9. for the matrix with orders 9-12, we only simulated 105 numbers of randomly reciprocal matrices. the results of simulation experiments are shown in table 1. it can be seen that some of the random reciprocal matrices with orders from 3 to 6 passed the consistency test, for instance, 206130 random matrices with order 3 passed the consistency test among 106 matrices, while 73 random matrices with order 6 passed the consistency text. however, all random matrices with orders 7-12 did not pass the consistency test. for the random matrices with cr ≥ 0.1, the proposed ibm model was used to modify the most consistent entries and improve the consistency ratio. table 1 shows that the consistency ratios of all the inconsistent random matrices with cr ≥ 0.1 and orders 3-7 have been improved and lower than 0.1 after the proposed ibm model is used to modify the random matrices, as shown in figures 2-7, while some matrices still failed the consistency test, the numbers are 3 for order 8, 5 for order 9, 1 for order 10, 2 for order 11 and 13 for order 12, as shown in figures 8-12. the corresponding simulation figures for 106 random matrices with orders 3 to 9, and 105 random matrices with orders 10-12 are shown in figures 2-10. simulation experiments for improving the consistency ratio of reciprocal matrices 413 table 1 simulation experiments for randomly generated matrices with different orders matrix order number of simulation number of matrices with cr ≥ 0.1 failed matrices succeeded matrices 3 1000000 793870 0 793870 4 1000000 968083 0 968083 5 1000000 997518 0 997518 6 1000000 999927 0 999927 7 1000000 1000000 0 100000 8 1000000 1000000 3 999997 9 1000000 1000000 5 999995 10 100000 100000 1 99999 11 100000 100000 2 99998 12 100000 100000 13 99987 0 2 4 6 8 10 x 10 5 0 0.5 1 1.5 2 2.5 cr>0.1 t h e v a lu e o f c r the number of simulation 0 2 4 6 8 10 x 10 5 0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0.2 cr<0.1 t h e v a lu e o f c r the number of simulation figure 2: simulation experiment for 106 randomly generated matrices with order 3 0 5 10 x 10 5 0 0.5 1 1.5 2 2.5 cr>0.1 t h e v a lu e o f c r the number of simulation 0 5 10 x 10 5 0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0.2 cr<0.1 t h e v a lu e o f c r the number of simulation figure 3: simulation experiment for 106 randomly generated matrices with order 4 414 d. ergu, g. kou, y. peng, x. yang 0 5 10 x 10 5 0 0.5 1 1.5 2 2.5 cr>0.1 t h e v a lu e o f c r the number of simulation 0 5 10 x 10 5 0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0.2 cr after adjustment t h e v a lu e o f c r the number of simulation figure 4: simulation experiment for 106 randomly generated matrices with order 5 0 5 10 x 10 5 0 0.5 1 1.5 2 2.5 cr>0.1 t h e v a lu e o f c r the number of simulation 0 5 10 x 10 5 0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0.2 cr after adjustment t h e v a lu e o f c r the number of simulation figure 5: simulation experiment for 106 randomly generated matrices with order 6 0 5 10 x 10 5 0 0.5 1 1.5 2 2.5 cr>0.1 t h e v a lu e o f c r the number of simulation 0 5 10 x 10 5 0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0.2 cr after adjustment t h e v a lu e o f c r the number of simulation figure 6: simulation experiment for 106 randomly generated matrices with order 7 simulation experiments for improving the consistency ratio of reciprocal matrices 415 3 4 5 6 7 8 9 10 x 10 5 x: 9.816e+005 y: 0.1768 cr>0.1 the number of simulation x: 3.154e+005 y: 0.1743 x: 2.589e+005 y: 1.85 x: 6.034e+005 y: 1.811 0 1 2 3 4 5 6 7 8 9 10 x 10 5 0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0.2 x: 6.259e+005 y: 0.0109 cr<0.1 t h e v a lu e o f c r the number of simulation x: 4.691e+005 y: 0.01213 x: 4.669e+004 y: 0.01128 figure 7: simulation experiment for 106 randomly generated matrices with order 8 3 4 5 6 7 8 9 10 x 10 5 cr>0.1 the number of simulation x: 9.508e+005 y: 0.2731 x: 2.771e+005 y: 0.2575 x: 9.83e+005 y: 1.685 x: 4.352e+005 y: 1.7 x: 1.862e+005 y: 1.708 0 1 2 3 4 5 6 7 8 9 10 x 10 5 0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0.2 cr<0.1 t h e v a lu e o f c r the number of simulation figure 8: simulation experiment for 106 randomly generated matrices with order 9 2 3 4 5 6 7 8 9 10 x 10 5 cr>0.1 the number of simulation 0 1 2 3 4 5 6 7 8 9 10 x 10 5 0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0.2 cr<0.1 t h e v a lu e o f c r the number of simulation figure 9: simulation experiment for 105 randomly generated matrices with order 10 416 d. ergu, g. kou, y. peng, x. yang 3 4 5 6 7 8 9 10 x 10 4 x: 2.42e+004 y: 1.436 cr>0.1 the number of simulation x: 5.303e+004 y: 1.467 x: 8.631e+004 y: 1.48 x: 1.849e+004 y: 0.5031 x: 9.352e+004 y: 0.4573 x: 2.631e+004 y: 0.4731 0 1 2 3 4 5 6 7 8 9 10 x 10 4 0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0.2 x: 1.612e+004 y: 0.03592 cr<0.1 t h e v a lu e o f c r the number of simulation figure 10: simulation experiment for 105 randomly generated matrices with order 11 2 3 4 5 6 7 8 9 10 x 10 4 cr>0.1 the number of simulation 0 1 2 3 4 5 6 7 8 9 10 x 10 4 0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0.2 cr<0.1 t h e v a lu e o f c r the number of simulation figure 11: simulation experiment for 105 randomly generated matrices with order 12 simulation experiments for improving the consistency ratio of reciprocal matrices 417 4 conclusions in this paper, some of the identification processes proposed in ergu et al. [8] were combined to implement the programming. based on these combinations, an algorithm was proposed and simulation experiments on random reciprocal matrices with different orders were conducted to validate the effectiveness of the induced bias matrix model. we found that some matrices generated randomly could pass the consistency test, and the higher the orders of matrices are, the less the matrices have cr < 0.1. when the orders of random matrices increase to 7, all matrices generated randomly have cr ≥ 0.1, and they need to be adjusted. after the algorithm of the induced bias matrix (ibm) model was applied to these matrices, all the consistency ratios of random matrices with orders 3-7 were improved and less than the threshold 0.1, while fewer matrices with order higher than 8 still could not be modified satisfactorily. however, we believe that the consistency of the pairwise comparison matrices provided by experts will be much better than the consistency of random matrices, thus the proposed ibm model is capable of dealing with the consistency of a pcm. although the results of the simulation experiments show the effectiveness of the ibm model, the experimental findings also reveal the failed tendency will increase with the increase of the matrices order. the failed matrices should be paid more attention to and analyze the reason why it failed the consistency test, we leave it for further research in future. acknowledgements this research was supported in part by grants from the national natural science foundation of china (#71222108 for g. kou, #71325001 and #71173028 for y. peng, #71373216 for d. ergu, #91224001 for s.m.li);program for new century excellent talents in university #ncet12-0086 for y. peng, and the research fund for the doctoral program of higher education (#20120185110031 for g. kou). we thank prof. waldemar w. koczkodaj for encouraging us to conduct simulation experiments in order to further validate the effectiveness of ibm model. bibliography [1] thurstone, l. (1927), a law of comparative judgment, psychological review, 34(4): 273-273. [2] saaty, t. l. 1986. axiomatic foundation of the analytic hierarchy process, management science 32(7): 841-855. [3] saaty, t. l. (1994), how to make a decision: the analytic hierarchy process, interfaces, 24: 19-43. [4] siraj, s.; mikhailov, l.; keane, j. (2012), a heuristic method to rectify intransitive judgments in pairwise comparison matrices, european journal of operational research, 216: 420-428. [5] kou, g., ergu,d., shang,j. (2014). enhancing data consistency in decision matrix: adapting hadamard model to mitigate judgment contradiction, european journal of operational research, 236(1):261-271. [6] saaty, t. l. (1980), the analytical hierarchy process, new york: mcgraw-hill. [7] saaty, t. l. (2003). decision-making with the ahp: why is the principal eigenvector necessary, european journal of operational research, 145(1): 85-91. 418 d. ergu, g. kou, y. peng, x. yang [8] ergu, d. ; kou, g.; peng, y.; shi, y. (2011). a simple method to improve the consistency ratio of the pair-wise comparison matrix in anp, european journal of operational research, 213(1): 246-259. [9] harker, p. t. 1987. derivatives of the perron root of a positive reciprocal matrix: with applications to the analytic hierarchy process, applied mathematics and computation, 22: 217-232. [10] xu, z., wei, c. (1999),a consistency improving method in the analytic hierarchy process, european journal of operational research, 116: 443-449. [11] cao, d., leung, l. c., law, j. s. (2008). modifying inconsistent comparison matrix in analytic hierarchy process: a heuristic approach, decision support systems, 44: 944-953. [12] choo, e., wedley, w. (2004). a common framework for deriving preference values from pairwise comparison matrices, computer and operations research, 31 (6): 893-908. [13] lin, c. (2007). a revised framework for deriving preference values from pairwise comparison matrices, european journal of operational research, 176 (2): 1145-1150. [14] kou,g, lin,c. (2014). a cosine maximization method for the priority vector derivation in ahp, european journal of operational research, 235:225-232. int j comput commun, issn 1841-9836 8(6):891-900, december, 2013. rsea-aodv: route stability and energy aware routing for mobile ad hoc networks p. srinivasan, p. kamalakkannan p. srinivasan* mahendra institute of technology, namakkal tamil nadu, india *corresponding author: salemsrini4u@gmail.com dr. p. kamalakkannan department of computer science, government arts college, salem tamil nadu, india abstract: frequent changes in network topology, confined battery capacity of nodes and unreliable nature of wireless channels are the main challenges for reliable routing in mobile ad hoc network (manet). selecting a long lasting route is a critical task in manet. in this paper, we propose a new protocol, route stability and energy aware ad hoc on-demand distance vector (rsea-aodv) protocol which is an enhancement of ad hoc on-demand distance vector (aodv) protocol. it designs a bi-objective optimization formulation to compute the reliability factor based on stability and residual energy of nodes. the route with the highest reliability factor value is selected for data transmission. this protocol is compared with other similar routing protocols: perra and aodv. we use ns-2 for simulation. our simulation results show that, the proposed protocol increases the node expiration time by 12 32 % and accomplishes 7 13 % higher packet delivery ratio compared to perra or aodv. the packet delay and control overhead of the proposed protocol is comparable to that of aodv. keywords: manet, reliable routing, route stability, biobjective optimization. 1 introduction mobile ad hoc networks (manet) are groups of wireless mobile devices, which can communicate with each other without any infrastructure support. it is a self-configured and selfmaintained network with no central authority. every node in manet acts as both a host and a router. dynamic topology, limited bandwidth, battery, cpu resources and multi-hop communication are the characteristics that put special challenges in routing protocol design. several routing protocols have been proposed for manets. based on the route discovery principle, we can classify them into either proactive or reactive. proactive routing protocols update routes for every pair of nodes at regular intervals irrespective of their requirement. the reactive or on-demand routing protocols, determine route only when there is a need to transmit a data packet, using a broadcasting query-reply (rreq-rrep) procedure. most of these protocols use min-hop as the route selection metric and found that the routes discovered by these protocols are not stable. the stability based routing protocols [1][4] are designed to choose stable route passing through stable links. these protocols improve route lifetimes and packet delivery ratio compare to the shortest path routing protocols. the energy-aware routing protocols [5, 6] are designed considering factors like residual energy, total transmission power or both. these protocols avoid over using of certain nodes and reduce total energy consumptions. but, there exist a very few protocols in the literature [6] [8] that consider both stability and energy metric during route discovery and maintenance. copyright © 2006-2013 by ccc publications 892 p. srinivasan, p. kamalakkannan in this paper, our objective is to design a routing scheme based on route stability and residual energy metrics during route discovery and maintenance. this scheme compute the link stability based on measurement of received signal strength of successive packets and route stability is computed as the product of link stability of all links that make up the route. this scheme allows nodes that satisfy required energy metric to act as intermediate nodes. 2 related work aodv [9] a reactive routing protocol establish a route to a destination only on demand.. the advantage of aodv is reduced control overhead. but, multiple rrep packets in response to a single rreq packet can lead to heavy control overhead. presence of stale routes and unnecessary bandwidth consumption due to periodic beaconing are the drawbacks of aodv. link stability is a measure of how stable the link is and how long it will last. signal strength, pilot signals, link duration distributions, residual energy of the nodes and relative speed between nodes are the parameters used for the computation of link stability. stability based routing protocols use link stability factor and path stability factor calculated using the above specified parameters to select stable path for data transmission. the lifetime of a network is one of the important factors to be considered in designing a manet routing protocol. maximizing the network lifetime by minimizing the power consumption for the data transfer is the main aim of energy aware routing protocols [5]. optimizations carried out in these protocols are classified in the following schemes. (i) minimize the total energy consumed along the route (ii) avoid using node with minimum residual energy and (iii) minimize the total battery cost along the route. they reduce the energy wastage ensuing from retransmission due to bit error rate, frame error rate and link failures due to energy depletion. in [4], the authors propose route stability based qos routing (rsqr) where they use the received signal strength of the packets to calculate link stability and route stability. it considerably reduces the number of route breakages by restricting traffic admission. however, the above described protocols do not consider energy metrics during route discovery. in [7], the authors propose link-stability and energy aware routing protocol (laer). it considers joint metric of link stability and energy drain rate into route discovery. it balances the traffic load on the nodes and considerably decreases the control overhead. however, laer does not able to discriminate between links of the same age. in [8], the authors propose power efficient reliable routing protocol for ad hoc networks (perra), a reactive routing protocol, which accounts for both link stability and power efficiency on route discovery. it considerably reduces control overhead by constrained flooding of route requests. however, perra does not able to discriminate longer link from lifetime point of view as it does not consider link lifetime and its statistical behavior. 3 route stability and energy aware (rsea) model 3.1 stability aware metric rsea model considers signal strengths and mobility for computing the probability of link failures. it computes link stability (ls) using signal strength values receivd from the mac layer. any link e has an associated link stability ls(e) [4] and it is given by lsi,j = u2 − dssi,j u2 − u1 (1) rsea-aodv: route stability and energy aware routing for mobile ad hoc networks 893 where dss is the differentiated signal strength to decide whether the signals are getting stronger or weaker. it is computed as follows. dssi,j = sscuri,j − ssnewi,j (2) a path between source s and destination d is given as p(s, d) = (s, e(s, x),x, e(x, y),y, ... , e(z, d), d). formally, a path between two nodes s and d is a set of all feasible path between them and can be represented as p(s, d) = p0, p1, . . . , pn, where each pi is a feasible path between s and d. we define the stability of the path p, by the product of link stability of its edges as follows stability(p) = ∏ e∈p ls(e) (3) the path with higher path stability value contains more stable links and choosing it will considerably reduce the probability of link failure. 3.2 energy aware metric it is assumed that all wireless nodes come with residual power detection device. the energy required to transmit a packet (etx) [8] can be computed as etx = psize ∗ ptx bw (4) where psize is the packet size, ptx is the packet transmitting power and bw is the bandwidth of the link. the transmitting energy is directly proportional to the distance between nodes. the source application layer communicates n value to the network layer, for selecting nodes that meet the energy requirement. it avoids link breakages due to energy depletion. the total energy required (reqe) for data packet transmission is given by reqe = n ∗ (etx + eproc) (5) where eproc is the energy required for packet processing and n is the number of packet . the energy metric (em) of the path is given by em(p) = n∏ i=1 ( ri fi ) (6) where ri(t) is remaining battery capacity and fi is full battery capacity of intermediate node i, at time t. the goal of this metric is to maximize em. it takes the product of the residual battery of the intermediate nodes to select a path that has nodes with maximum residual energy among the path that just meet the basic energy requirement reqe. 3.3 problem formulation the problem can be stated as, ‘to find a reliable path for data communication based on route stability and residual energy metrics’. the above bi-objective optimization problem can be transformed into a single objective problem, by providing importance factor (i.e. w1 and w2) for each criterion of the objective. we combine the objectives into a single objective function to calculate the reliability factor (rf) of the path p, can be mathematically stated as rf(p) = w1 · stability(p) + w2 · em(p) (7) 894 p. srinivasan, p. kamalakkannan where the parameters w1 and w2 are chosen based on the network dynamics and application requirements. in this study, in order to give equal importance to both stability and energy metrics, we assign 0.5 to both w1 and w2, such that w1+ w2=1 condition is satisfied. consequently, the sum of the objectives has to be maximized and maximum reliability factor (mrf) can be computed by mrf = max(rf(p1),rf(p2)...,rf(pn)) (8) the path with mrf value is selected as a reliable path for data transmission. 4 route stability and energy aware (rsea) routing in manet 4.1 route discovery when a node s needs to send packets to a destination d, it searches for the route in its route table. if a route to the destination d is not available, then the source s will broadcast a route request (rreq) message to its neighbors. the rreq of rsea-aodv is an extension of a rreq packet of aodv routing protocol. three new fields accumulated path stability (aps), accumulated energy metric (aem) and required energy (reqe) are added to the rreq packet. it initializes the values to the added fields as follows: aps, aem with 1. the required energy is calculated using equation 5 and is initialized to reqe field. 4.2 route discovery at intermediate nodes if the strength of the rreq packet is poor, then it drops the rreq. then node i checks whether its residual energy will meet the required energy reqe specified in the rreq packet. if the above conditions are satisfied, then node i make a reverse route entry in the routing table (rt). then it calculates ls. if the signal strength is above sthr1, then it assigns 1 to ls. it implies that nodes are close and link is sufficiently stable. otherwise, it calculates ls using equation 1. energy metric is calculated using equation 6. after these steps, it updates the aps and aem fields such that the updated values contain the route stability and energy metric of the explored route up to the current node. it enters the relevant information from rreq into route request forward table (rft). then, node i broadcast the rreq to its neighbor. in case of duplicate packet, if it contains better values for aps or aem, then it makes an entry in rft and discards the packet. on receiving a rrep packet, node i measures the strength of rrep. if its strength is poor, then it will drop the rrep packet. it looks up rft for the corresponding rreq entries, to select the node with the highest rf value. it forwards the rrep packet to the node with the highest rf value. it is shown in algorithm 2. it makes an entry in the rt. 4.3 route selection at destination destination node will receive rreq packets from different possible routes. on receiving the first rreq packet, the node d starts a timer t1 for the duration of route reply latency (rrl) time. it stores all the rreq that arrives, in its routing table. it computes rf value for the path explored by the rreq. if destination node receive more than one rreq before the timer t1 expires, then it forwards the rrep packet to the node with the highest rf value. it is shown in algorithm 3. this considerably reduces the amount of control overhead incurred during the route establishment due to multiple rrep for single rreq as in aodv. the destination node makes an ft entry for the flow. in case, if it does not receive any data packets within the timeout period, then it will delete the respective entry from the ft table. rsea-aodv: route stability and energy aware routing for mobile ad hoc networks 895 896 p. srinivasan, p. kamalakkannan 4.4 route maintenance in dynamic mobile ad-hoc networks, link-breaks occur frequently. rsea-aodv comes with make-before-break route maintenance mechanism. this mechanism quickly adapt to the link breakage likely to occur due to the mobility and energy drain. it is depicted in algorithm 4. it executes the mechanism after every t2 seconds, to monitor the status of the route established. if an intermediate node is in the critical battery status or it is receiving weaker signal packets, then it creates an hlp message with time to live (ttl) set to 1 and broadcasts it to its neighbors. neighbors on receiving the hlp packet, check for route availability in its routing table for the destination specified in the hlp packet. if the route is available, then it returns the route to the downstream node of the node broadcasting the hlp packet. the downstream node on receiving the route it updates its routing table. it routes the data packets in the new route available, preventing packet losses due to link breakage. if the node with critical battery is the destination node, then it will send the stop traffic intimation to the source node, to avoid future packet drops and wastage of resources. if there is no alternate route available with the one-hop neighbors, then after the expiry of timer i.e. timeout, the node will send the route change request (rcr) to the source node. the source node on receiving the rcr will go for the re-route discovery to the destination. rsea-aodv: route stability and energy aware routing for mobile ad hoc networks 897 4.5 simulation parameters we have simulated a scenario of 50 mobile nodes in a rectangualr toplpogy of 1000m * 500m, with each node having a transmission range of 250m. the values of sthr1 and sthr2 are set to 1.5 * rxthr and 1.2 * rxthr, respectively. we simulate it in ns2 2.31.the simulation has been run for 600 seconds. the results show 95% confidence interval on all observed metrics. the first experiment evaluated the overall performance of rsea-aodv, perra and aodv. during simulation, we generated 12 cbr connections producing 3 packets /second. table 1: simulation results parameters rsea-aodv perra aodv pdr 96.42 90.54 86.29 coh 0.8342 1.0871 1.1956 delay(sec) 0.0698 0.0732 0.0659 table 1 presents the results of the simulations. we observe that the pdr of rsea-aodv is improved by 6.5% and 13.2 % relative to perra and aodv, respectively. this is because of node selection and, route maintenance strategy carried on by the rsea-aodv. it chooses the intermediate nodes considering the stability and residual energy issues. in case of aodv, it chooses a path with the shortest route. it may contain low energy nodes which lead to disconnections of sessions. the control overhead of rsea-aodv is reduced by 22 % and 30.2% relative to perra and aodv, respectively. this is due to reduction in path reconstruction and constrained flooding of control packets. as rsea-aodv selects the most reliable path, it considerably reduces the total number of required messages for route reconstruction. the packet delay for rsea-aodv is comparable to aodv and perra. this is due to following opposing factors. (i) finding a reliable route, considering the stability and energy metrics, increases the delay on the one hand. (ii) on the other hand, route selection and route maintenance procedure of rsea-aodv increases the lifetime of the routes and the bottle-neck nodes. it significantly reduces the need for packet retransmissions. figure 1: node expirationtime figure 2: connection expiration time the plots of net and cet are presented in figures 1 and 2. it is observed that rseaaodv extends the node lifetime by between 12%-24% over perra and by between 20%-32% over aodv. this is because of rsea-aodvs route maintenance mechanism, in which nodes stop transmitting data traffic if they are about to drain and find an alternate route. aodv exhibits 898 p. srinivasan, p. kamalakkannan the worst performance in terms of the nodes expiration time because intermediate nodes continue packet forwarding regardless of the remaining battery until battery depletion. aodv exhibits a longer lifetime of connections despite a shorter lifetime of nodes. it is noted that, aodvs connection expiration times being anywhere between 25%-5% better than rsea-aodv or perra. this is because, in rsea-aodv: (i) a source node tries for route reestablishment only fixed number of times after route change request or link failures, and (ii) nodes reject request for new session once their residual energy is below energy threshold. but aodv keeps retrying and so has a higher chance of finding an alternate route and keeps connection alive for a longer time. figure 3: average number of path reconstructions figure 3 shows the number of path reconstructions due to node mobility and energy shortages. rsea-aodv chooses the more reliable route and thus, reduces the need for route reconstructions. in addition, it decreases the control overhead and offers some energy benefits. the route discovery and maintenance mechanism carried on by perra considerably reduced the number of route reconstructions compared to aodv. in the second experiment, the stability weight (w1) was provided with the values 0.1, 0.3, 0.5, 0.7 and 0.9 (lowest to highest) and performance of the protocols are observed. the nodes moved at the maximum speed of 5 meters / second. from figure 4, it is observed the packet delivery ratio of rsea-aodv was higher than that of perra and aodv. it is because it uses more stable routes and presence of enhanced route maintenance mechanism. perra showed comparatively better performance than aodv, due to its route discovery mechanism and maintenance of alternative route. as aodv does not discriminate the link in stability issue, it does not show any deviation. the variance of residual node energy is depicted in figures 5. this parameter shows the load balancing capability of the protocols. it is noted that as stability weight factor increases, there is an increase in energy variance. as aodv overloads certain nodes and shows a big variance between remaining energies of the node. figure 6 shows that there is a gradual increase in the hop count as stability weight increases in both rsea-aodv and perra. the increase in hop count is due to the selection of short and stable link as the stability weight increases. the average physical length of the hops chosen by rsea-aodv is 45-55% of the transmission range. it is 50-60% in the case of perra and 55-65% in the case of aodv. the third experiment measured the average residual energy of the nodes on the chosen path. rsea-aodv: route stability and energy aware routing for mobile ad hoc networks 899 figure 4: packet delivery ratio vs. stability figure 5: residual energy vs stability figure 6: hop count vs. stability figure 7: residual energy vs. energy the average residual energy of nodes was set at 30 [j] and with some randomness as in [8] ree = e[ree] ± γ (9) where e[ree] is the residual energy. 10 50 % randomness in the residual energy for the nodes was provided in this simulation. for example, in the case of 10 % randomness, each node had 27 33 [j] residual energy. the sources energy requirement was 30j. figures 7 depicts the average energy of the nodes on the chosen path by the protocols. it is noted that rsea always had higher residual energy compared to perra and aodv protocols. this was due to the consideration of nodes residual energy during route selection. but, aodv showed lower residual energy as it goes for the shortest path selection and does not consider residual energy during route discovery. 5 conclusions and future work in this paper, we proposed a new route stability and energy aware routing scheme. it establishes route based on the joint metric of link stability and residual energy. during the route discovery process, it constructs the route with nodes that satisfy the sources energy requirement. among the possible routes, rsea selects the route with the highest reliability factor for data 900 p. srinivasan, p. kamalakkannan transmission. this scheme can be incorporated in any of the existing manet routing protocols. using the ns2 simulator, we compared the performance of rsea-aodv with perra and aodv. the simulation results highlight that rsea-aodv was outstanding in terms of node expiration time, delivery ratio and path reconstruction overhead. we intend to evaluate the performance of the protocol under different node densities and traffic loads, as part of our future work. bibliography [1] abid, m.; belghith, a. (2011); stability routing with constrained path length for improved routability in dynamic manets, personal and ubiquitous computing, 15: 799-810. [2] hui, z.; ning, d.y. (2007); a novel path stability computation model for wireless ad hoc networks, ieee signal process. letter, 14: 799-810. [3] sarma, n.; sukumar nandi. (2010); multipath qos routing with route stability for mobile ad hoc networks, iete technical review, 27: 380-397. [4] nandi, s.; sarma, n. (2009); route stability based qos routing in mobile ad hoc networks. wireless pers. commun, springer, 54: 203-224. [5] tan, w.c.w.; bose, s.k. (2012); power and mobility aware routing in wireless ad hoc network. commun, iet, 6: 1425-1437. [6] lee, s.; park. (2008); a routing protocol for extent network lifetime through the residual battery and link stability in manet, in proc. acc 08, 199-204. [7] rango, f. de.; fazio, p.; (2012); link stability and energy aware routing protocol in distributed wireless networks, ieee trans. on parallel and distributed systems, 23: 713726. [8] kim, k.j.; sang, y. (2005); power-efficient reliable routing protocol for mobile ad hoc networks, ieice trans. on commun., 88: 4588-4597. [9] perkins, c.e.; royer , e.m.; and das, s.r. (2003); ad hoc on-demand distance vector (aodv) routing, ietf rfc 3561. international journal of computers communications & control issn 1841-9836, 9(6):686-693, december, 2014. mathematical decision model for reverse supply chains inventory l. duta, c.b. zamfirescu, f.g. filip luminita duta automation and computer science department valahia university of targoviste romania, 130083, targoviste, 24, unirii ave. duta@valahia.ro constantin-bala zamfirescu dept. of computer science and automatic control lucian blaga university of sibiu romania, 550024, sibiu, 10, victoriei ave. zbc@acm.org florin g. filip the romanian academy romania, bucharest, 010071, 125 calea victoriei ffilip@acad.ro abstract: in the reverse supply chain inventory theory, inventory models are concerned with the demand of reusable parts, stock replenishment, ordering cycle, delivery lead time, number of disassembled products, ordering costs. the particularity of these models consists in the occurrence of high uncertainties of the quantity and quality of the returned products and resulting parts. to overcome the problem, an inventory model that incorporates decision variables at proactive and reactive levels is derived and discussed in this paper. keywords: reverse supply chains, decision aid, inventory models, bayesian networks 1 introduction a reverse supply chain is the process of moving end of life (eol) products from customer to distributor or manufacturer for the purpose of capturing the market leftover value, or for proper disposal. reverse supply chains operations have a strong impact on forward supply chain stocks and transportation capacities. customers act as suppliers in reverse supply chains. after sorting, the collected items will be moved through the reverse supply chain. some used products are sold for recycling, usually after destroying their physical structure. others are used as source of spare parts on the secondary market. another common use of returns is as spare parts for warranty claims in order to reduce the cost of providing these services to customers. defective and used products are moved to manufacturer for reusing or remanufacturing or for disposal. remanufactured or refurbished products are sold for additional revenue to a secondary market. part of the reverse supply chain, the reverse logistics has to deal with product transportation, production planning and inventory management. in reverse supply chain inventory, two problems might occur: a) a high level of uncertainty provided by the product end-of-life state of the eol product and b) the irregular supply for remanufacturing or refurbishing. in this respect, two types of inventory replenishment models have been identified in literature: a deterministic model with stationary or dynamical demand and a stochastic model with continuous or periodical review [1]. this paper addresses the stochastic models with variability of the demand. copyright © 2006-2014 by ccc publications mathematical decision model for reverse supply chains inventory 687 in the sequel the paper is organized as it follows. the state of art from the second section of this paper describes various reverse supply chain inventory models found in literature. in the third section we present a brief review of supply chain inventory policies and the positioning of our work. in the fourth section, an approach based on a mathematical decision model to deal with the stochastic aspect of the reverse supply chain inventory is described. decision variables are introduced on two levels: a proactive level and a reactive level. results and discussions are highlighted in the fifth section. conclusions and future work perspectives are presented in the last part of the paper. 2 literature review early researches in the domain of interest of this paper concerned the economic order quantity logic which was exploited by using deterministic models with stationary demand. in [2] a survey on quantitative models for reverse logistics is given. in such systems, authors show that the objective of the inventory management is to control the component recovery process to guarantee a required level of maintenance service and to minimize fixed and variable costs. in [3] kiesmuller and sherer determinate, for the first time, an optimal policy for a remanufacturing system with dynamic demand, by assuming that there are no backorders and lead times. in [4], authors develop two heuristic procedures to investigate the effect of stochastic yields on a disassembly to order system. in [5], imtavanich and gupta use heuristics to address different stochastic parameters of a disassembly to order (dto) system and they propose a goal programming procedure to calculate the number of returned products that satisfy various goals. a dto system is concerned with the process of finding how many products have to be disassembled in order to fulfil the demand of reusable parts or subassemblies that will be used in remanufacturing. bayindira and et al. [6] investigate the desired level of recovery under various inventory control policies when the success of recovery is probabilistic. all used and returned items go into a recovery process that is modelled as a single stage operation. the recovery effort is represented by the expected time spent for it. using brand new parts may represent an alternative or/and complementary solution to ensure the necessary stocks for satisfying the orders received in case the recovered parts do not satisfy the demand. four inventory control policies that differ in timing of and information used in purchasing decision could be envisaged. the objective is to find the recovery level together with inventory control parameter that minimizes the long-run average total cost. a numerical study covering a wide range of system parameters is carried out. finally computational results are presented with their managerial implications. a dto model with deterministic yields is presented in [7]. the dto system is represented by an integer programming model. the objective function is to minimize the total costs. the authors propose three decision variables: one v1 for the amount of the end of life eol products to be acquired in view of disassembling, one v2 for the amount of new parts to be acquired in order to meet the overall demand and one v3 for the number of the parts that need to be disposed. the authors made several assumptions: there is no lead time for acquisition or disassembly, products are completely disassembled, there is a continuous supply of eol products and the demand for parts is completely fulfilled. in [8], a decisional dto decision model is presented. the author separated decomposes the decision process into two stages. in the first stage one the goal is to minimize the costs by deciding how many items must be disassembled to fulfil the reusable parts demand. in the second stage the goal is to minimise the number of disposed parts and the procurement costs of the new parts. another model for dto systems with stochastic yields is presented by kongar and gupta in [9]. the quantity of reusable parts is obtained from the number of eol products disassembled multiply by a weight coefficient. the method uses models in which linear programming encompasses a degree of stochasticity. decisions are split 688 l. duta, c.b. zamfirescu, f.g. filip into two groups: proactive decisions and reactive decisions. while proactive decisions determine the number of eol products to disassemble, reactive decisions take into account the uncertain variables (as the quality of the disassembled part, the lead time or the disassembly time) and their probability of occurrence. kongar and ilgin propose a linear physical programming solution based methodology which deals with tangible and intangible financial, environmental and performance measures of dto systems [10]. two types of decision depending on the disassembly operations are taken into consideration. if the item (the eol product) is subjected to resale or storage, a non-destructive disassembly process is performed. otherwise, if the item is subjected to recycling or disposal, a destructive disassembly is applied [9]. in 2010, ahiska and king [11] extend kiesmullerand sheres’ approach [3] by considering different lead times for manufacturing and remanufacturing. recently, godichaud [12] demonstrates the relevance of bayesian networks (bn) in a spare parts inventory model which deals with costs and time uncertainties. the same tool (bn) is used in [13] to highlight the temporal dependences between variables in a model for optimal disassembly policy. 3 inventory policies for reverse supply chain the economic order quantity model (eoq) is one of the most widely known inventory control methods [14]. it is used to set the quantities to order in replenishing inventories so that a trade-off between inventory holdings and ordering/set up costs is achieved. some assumptions are associated with this model: a) constant and known demand, instantaneous receipt of inventory, b) constant and known time intervals between order placement and receipt of the order, stock outs are avoided by placing orders at the right time. it is also assumed that the unit procurement price remains constant irrespective of the number of units purchased. this model induces four inventory basic policies according to the moment of the inventory position review: with continuous review or with periodical review. used notations: q = order quantity, s = order-up-to level, s = reorder point, t = review period [15]: • the (s, q) policy: whenever the inventory position (inventory level plus quantity on order) drops to a given level limit, s, or below, an order is placed for a fixed quantity, q. • the (s, s) policy: whenever the inventory position (inventory level plus quantity on order) drops to a given level s, or below, an order is placed for a sufficient quantity to bring the inventory position up to a given level, s. • the (t, s) policy: inventory position is reviewed at regular discrete time moments spaced at intervals of length t time units. at each review, an order is placed for a sufficient quantity to bring the inventory position up to a given level, s. • the (t, s, s) policy: inventory position is reviewed at regular instants spaced at time intervals of length t. at each review, if the inventory position is at level s or below, an order is placed for a sufficient quantity to bring the inventory position up to a given level s. if the inventory position is above s, no order is placed. remarks: • when t=0 in the (t, s, s) policy, one obtains the (s, s) policy. so, the (t, s, s) policy can be regarded as a periodic version of the (s, s) policy, which, in turn, may be viewed as a special case of the (t, s, s). • the (t, s) policy represents a special case of the (t, s, s) policy in which s = s mathematical decision model for reverse supply chains inventory 689 4 the decision model since the demand for spare parts is variable and is depending on the needs and the time when an order for replenishment is placed until the replenishment arrives (the lead time), the (t, s, s) policy was considered in this paper. in such a system, the period of review is fixed and the ordered quantity changes as per demand or rate of consumption. the period of review t is decided such as the ordered quantity is economical for purchasing the items. the problem is to coordinate the two processes disassembly and remanufacturing so as to meet the demand of items. another problem is the disassembly depth that deals with how completely a product should be disassembled. in this context, the researcher must weigh not only the costs of the process, whether is destructive or not, but also consider which reusable parts are already in stock, how many will be obtained through disassembly and will be accumulated in inventory and how many parts will have to be disposed of. it is obvious that a reverse chain inventory model has to include different decision variables. 4.1 notations ndi = number of items i to acquire and disassemble pki = amount of part k in item i p rki = amount of part k in item i to reuse p dki = amount of part k in item i to dispose p hki = amount of part k in item i to hold (to stock) cai = acquisition cost of the item i cdi = disassemble cost of the item i crk = reusing cost of the part k of the item i cdk = disposal cost of the part k of the item i chk = holding cost of the part k of the item i psc = probability of scenario sc occurring tc = total inventory cost to simplify the analytical model, the following assumptions are made: only a single type of product to disassembly is considered; there is no lead times for acquisition or disassembly; eol items to disassembly are always available; a single disassembly scenario (psc = 1) is occurring; products are completely disassembled; there are only two types of eol options: reusing and disposal; two types of disassembly operations are considered: destructive and non-destructive; all costs are deterministic and constant; the interval to acquire is deterministic (at the first slice of time t); the amount of reusable, disposable and holding parts is subjected to uncertainties; the number of items to acquire at the second slice of time have a probabilistic distribution; the model is periodically reviewed [8]. 690 l. duta, c.b. zamfirescu, f.g. filip 4.2 the bayesian network model to implement the correct model, bayesian networks were used so as to determinate all influences and causalities between decision variables. the results will show how information influences decisions and how these decisions cause the change of information. bayesian networks (bn) have the ability of capturing both qualitative knowledge through their network structure, and quantitative knowledge through their parameters [14]. a static bayesian network can be extended to a dynamic bayesian network (dbn) by introducing relevant temporal dependencies to capture the dynamic behaviors of the system at different moments. to validate the model, the bayesialab® software is used ( [16] and [17]). the software is able to seize degrees of probability. once validated, probabilities are used jointly with the probability distribution for giving a new probability distribution. bayesialab® allows the temporal dimension integration in a bayesian network. thus, a bn can be easily transformed into a dbn. temporal nodes at instants t and t+1 can be represented and connected by temporal arcs. the parameters evolution of the dbn nodes can be so tracked in time.decision nodes are marked by squares. the amount of part k found in item i influences the amount to reuse, to dispose or to hold. further, these decisional variables change the number of products to be acquired at the next slice of time (moment t+1 ). the two temporal nodes nid(t) and nid(t+1) are linked by a temporal arc. figure 1 shows background calculation of the total cost. the objective function is included in the utility node tc. in the figure, two decision nodes are represented: da decision to acquire used products, dd decision to disassembly. figure 1: dynamic bayesian network model 4.3 the mathematical model the total cost of the disassembly phase (tc) is composed of product acquisition cost, disassembly operations costs, eol options costs and inventory cost. the value of the total cost is given by the equation (1). the first term is the cost of product acquisition and disassembly so mathematical decision model for reverse supply chains inventory 691 as to meet the demand of parts for remanufacturing, while the second term is the inventory cost of the disassembled parts. this function is a multi-objective one, since the aim is to find not only the optimal quantity of products to acquire to meet the demand of parts, but also to find the best eol strategy or scenario so as to optimize the number of parts to reuse in a remanufactured product. for the moment we are interested on quantity to acquire to meet the demand, while scenarios probabilities are introduced by the software to reduce the objective function to a mono-objective one. the objective function to be minimized is: min(tc) = min (∑ i (cai + c d i ).n d i + ∑ i ∑ sc psc(c r k.p r ki + c d k.p d ki + c h k .p h ki) ) (1) in equation 1 the unknown variables are ndi , p r k , p d k , p h k . this objective function is subjected to the following constraints: dk 6 ∑ i ∑ sc psc.(pki −p dki) (2) ndi ≥ 0 and integer (3) pki, p r k , p d k , p h k ≥ 0 and integer (4) ∑ sc psc = 1 (5) where dk is the demand (the number of k parts needed). one can note that equations (1) to (5) form a linear integer mathematical model where decision variables can be treated as continuous in order to satisfy the integer value of the demand. 5 results figure 2: (t, s, s) policy decisions( [13], p 117) figure 3: reactive decisions 692 l. duta, c.b. zamfirescu, f.g. filip computations were performed with the help of fico xpress® optimization tool [18] which provides a mixed integer solver and framework for constraint integer programming. supposing a periodic review inventory with a security stock at the beginning of the period, the classical (t, s, s) policy is modeled in figure 2. the planning horizon was fixed to a month, and the review period is established to ten days. results show that at the end of each review period, the number of disassembled products arrives to a minimal value and the tc to a maximal one. for the eoq model with constant demand, a decision to acquire products to disassemble is taken whenever the inventory level reaches the reorder point. running simulations on the previous model, the decision to acquire is taken before the beginning of each new review period (fig. 3). in other words, starting with a periodical review inventory model, which usually provides orders at constant periods of time, we have reached to a mixed model where orders are given in accordance with the reactive decisions (i.e. in real time). 6 conclusions and future works an inventory model for reverse supply chains was presented. this model is deduced from the classical inventory model with constant review period and variable demand, and it encompasses decision variables structured on two decisional levels: the proactive level and the reactive level. to implement the approach, dbn was found to be an appropriate decision aid tool. using a dbn, one can determine the optimal disassembly inventory policy dealing with stochastic aspects of the system. proactive decisions aim to determine the initial number of items to disassembly to fulfill the demand of parts. reactive decisions take into consideration disassembly scenarios and the endof-life options of the disassembled parts. the model optimizes the quantity of used products to acquire so as to minimize the total inventory cost. future work will integrate the results above in a decision support system [19]. this issue needs further investigation in real-world settings where the increased cognitive complexity of using different models (i.e. dbn, integer programming), most likely by a collective decision-maker [20], will play a major role in adopting the proposed solution. bibliography [1] gupta s. m. (2013); reverse supply chains: issues and analysis, crc press, taylor&francis [2] fleischmann m, bloemhof-ruwaard j. m., dekker r. (1997); quantitative models for reverse logistics: a review, european journal of operational research , 103: 1-17. [3] kiesmuller g p, scherer c w (2003); computational issues in a stochastic finite horizon one product recovery inventory model, european journal of operational research 146(3): 553-579. [4] inderfurth k, langella i. m. (2006); heuristics for solving disassembly to order problems with stochastic yields, or spectrum, 28 (1): 73-99 [5] imtanavanich p, gupta s. m. (2006); calculating disassembly yields in a multicriteria decision making environment for a disassembly to order system, application of management science, elsevier science, amsterdam, 12: 109-125 [6] bayindira z.p., dekkerb r, porrasb e. (2006); determination of recovery effort for a probabilistic recovery system under various inventory control policies, the international journal of management science, 34: 571 584. mathematical decision model for reverse supply chains inventory 693 [7] inderfurth k, langella i. m., (2008); planning disassembly for remanufacture to order systems, environment conscious manufacturing, gupta and lambert eds.,crc press, boca raton, fla. [8] langella i. m (2007), planning demand driving disassembly for remanufacturing, deutscher universitäts-verlag. [9] kongar e, gupta s m, (2009) solving the disassembly to order problem using linear physical programming, international journal of mathematics in operational research, 1(4): 504-531. [10] gupta , s.m., ilgin, m. i. ( 2012), physical programming; a review of the state of the art, studies in informatics and control, 21(4): 349-366. [11] ahiska s s., king r e (2010); inventory optimization in a one product recoverable manufacturing system, international journal of production economics, 124(1): 11-19. [12] godichaud m. (2010); outils d’aide à la décision pour la sélection des filières de revalorisation des produits issus de la déconstruction des systemes en fin de vie. thèse de doctorat, université de toulouse. [13] duta l., addouche s.a. (2012); dynamic bayesian network for decision aided disassembly planning, studies in computational intelligence, springer, 402: 143-154. [14] blumenfeld, d.e. (2008), operations research calculation handbook, crc press, taylor and francis group. [15] ghorbel n., duta l., addouche s.a., el mhamedi a. (2011), decision aided tool for sustainable inventory control, the 21th international conference on production research, sttutgart, germany. [16] conrady s, (2011), introduction to bayesian networks, conrady applied science, llc bayesia’s north american partner for sales and consulting. [17] http://www.bayesia.com/en/products/bayesialab.php (last consulted in april 2014) [18] fico xpress optimization suite (http://www.fico.com/en/products/fico-xpressoptimization-suite/) [19] filip f.g. (2008), decision support and control for large-scale complex systems. annual reviews in control, 32(1):61-70. [20] zamfirescu c.b., duta l., iantovics b. (2010),on investigating the cognitive complexity of designing the group decision process, studies in informatics and control, 19 (3):263-27. 6krein6.pdf international journal of computers communications & control special issue on fuzzy sets and applications (celebration of the 50th anniversary of fuzzy sets) issn 1841-9836, 10(6):825-833, december, 2015. why fuzzy cognitive maps are efficient v. kreinovich, c. stylios vladik kreinovich* department of computer science university of texas at el paso 500 w. university el paso, texas 79968, usa *corresponding author: vladik@utep.edu chrysostomos d. stylios laboratory of knowledge and intelligent computing department of computer engineering technological educational institute of epirus 47100 kostakioi, arta, greece stylios@teiep.gr abstract: in many practical situations, the relation between the experts’ degrees of confidence in different related statements is well described by fuzzy cognitive maps (fcm). this empirical success is somewhat puzzling, since from the mathematical viewpoint, each fcm relation corresponds to a simplified one-neuron neural network, and it is well known that to adequately describe relations, we need multiple neurons. in this paper, we show that the empirical success of fcm can be explained if we take into account that human’s subjective opinions follow miller’s seven plus minus two law. keywords: fuzzy cognitive maps, neural networks, seven plus minus two law. 1 introduction: fuzzy cognitive maps and their puzzling success need for (imprecise) expert estimates. to characterize a real-life system, we must know its properties. some of these properties come from measurements and are thus represented by real numbers. however, in many cases, a large amount of information comes from expert estimates. for example, to describe the current state of a patient, it is rarely enough to collect the corresponding measurement results – such a temperature, blood pressure, etc. medical doctors supplement this information by providing imprecise ("fuzzy") estimates, such as "somewhat soft", "small", "rather high", etc. similarly, to adequately describe the financial situation of a company or of a country, it is important to supplement the corresponding numbers with expert estimates describing how probable is a default or, vice versa, how probable in an increase in profitability (and how big this increase can be). a typical expert’s opinion sounds like this: "a vert big increase is improbable, but it is reasonable to expect a modest increase in reasonable time". fuzzy techniques as a natural way to describe imprecise expert estimates. when an expert completely agrees or completely disagrees with a precise statement (such as "the price of this stock will increase by at least 5% in a year"), in the computer, the resulting expertestimated truth value of a statement is either "true" or "false". in the computer, "true" is usually represented as 1, and "false" as 0. when the statement is imprecise, like the one above about a modest increase, the expert is not 100% sure that the price will increase by 5%. instead, the expert has some degree of copyright © 2006-2015 by ccc publications 826 v. kreinovich, c. stylios confidence in this 5% increase. since full confidence in a statement is described by the number 1, and full confidence in its negation is described by the number 0, a reasonable way to describe the expert’s partial confidence is by using numbers between 0 and 1: the higher the value, the larger the expert’s degree of confidence. the use of numbers from the interval [0, 1] for describing the experts’ degrees of confidence is the main idea behind fuzzy logic [6, 9, 15]. from individual fuzzy properties to fuzzy cognitive maps. fuzzy properties describing a system are often interrelated, in the sense that some properties imply others. for example, in medicine, if a person is overweight and not very physically fit, this increases the possibility that this person may get diabetes and thus, may be in a pre-diabetic stage. in financial situations, if a company has many new patents, especially patents in a "hot" area like advanced bioinformatics, it is usually a good indication of its future financial prosperity, etc. fuzzy cognitive maps (fcm) are a way to describe the relations between different fuzzy properties. to describe these relations, for each property p , we first need to list all the properties p1, . . . , pn that directly affect the property p . once this list is produced, we need to describe how the numerical values x1, . . . , xn ∈ [0, 1] of the properties pi affect the value x of the property p . in computational terms, we need to come up with an algorithmic function f(x1, . . . , xn) that predicts the value x based on the known values x1, . . . , xn. which functions should we choose? a natural idea is to start with the simplest functions. the simplest possible functions are linear functions, in which case we have x = w0 + w1 · x1 + . . . + wn · xn. (1) however, we cannot simply use general linear functions: • the predicted value should be within the interval [0, 1], but • for different combinations of weights, the above linear expression can be any real number, not necessarily a number between 0 and 1. a reasonable idea is that, after we get the above linear combination, we then apply an additional transformation s(x) that maps the whole real line to a number from the interval [0, 1]. in other words, instead of the linear expression (1), we use a slightly more complex expression x = s(w0 + w1 · x1 + . . . + wn · xn), (2) where s(x) is a pre-selected function that maps the real line into a unit interval [0, 1]. this function s(x) is called an activation function. this is the main idea behind fuzzy cognitive maps (fcm); see, e.g., [2,3,7,8,11,12,16–20,22, 24–29]. the fcm model is used when experts provide estimates only for some of the properties. in this case, the values of other properties are estimates by using the corresponding formulas of type (2). which activations functions are used? several different activation functions s(x) have been used in fcm; the most frequently used is the sigmoid function s(x) = 1 1 + exp(−x) . (3) the main reason why this function is used is the same reason why the same function is used in artificial neural networks: our goal is to describe human reasoning, and the sigmoid function provides a good approximate description of how similar processing in performed by the biological neurons in the brain; see, e.g., [1]. why fuzzy cognitive maps are efficient 827 comment. there are also theoretical reasons explaining why the sigmoid function is, in some reasonable sense, optimal; see, e.g., [10, 14, 23]. these theoretical reasons may also explain why evolution resulted in selecting this particular function in the actual brain – since this function is indeed optimal. fuzzy cognitive maps are efficient. in many practical applications, fuzzy cognitive maps have led to a reasonably good description of human reasoning; see, e.g., [2, 3, 7, 8, 11, 12, 16–20, 22, 24–29]. this empirical success is puzzling. from the theoretical viewpoint, this empirical success is puzzling. indeed, as we have mentioned, the output (3) of each corresponding fuzzy rule is the same as the output of a standard non-linear neuron [1, 24]. it is known that a 3-layer neural network has the universal approximation property; see, e.g., [1]. this means that if we use several (k) nonlinear neurons, with the outputs x(k) = s ( w (k) 0 + w (k) 1 · x1 + . . . + w (k) n · xn ) , (4) and then use an additional linear neuron to combine these output into a single combination x = w (0) + k ∑ k=1 w (k) · x(k), (5) then, for each continuous function x = f(x1, . . . , xn) on any box – in particular, on the box [0, 1]×[0, 1] – and for every ε > 0, we can find the values of the weights w(k) i and w (k) for which, for every inputs, the final output (5) is ε-close to the desired value f(x1, . . . , xn). it is also known that we need several neurons to get the universal approximation property, a single neuron does not have this property; see, e.g., [9]. and here, we have an opposite phenomenon: in many practical cases, already a single neuron provides a good approximation for the desired dependence! this is very puzzling. comment. the fact that a single neuron does not have a universal approximation property can be explained if we take into account that when the dependence x = f(x1, . . . , xn) is described by the formula (2), then for every i, we get ∂f ∂xi = s′ · wi, where s′(x) is the derivative of the activation function s(x). thus, for every i 6= j, we have ∂f ∂xi = const · ∂f ∂xj , where the constant is the ratio wi wj . this property is already not satisfied by the simplest nonlinear operation of multiplication f(x1, x2) = x1 · x2, for which ∂f ∂x1 = x2 6= const · ∂f ∂x2 = const · x1. what we do in this paper. in this paper, we provide a possible explanation for the (puzzling) empirical success of fuzzy cognitive maps. 828 v. kreinovich, c. stylios 2 possible explanation for the puzzling empirical success of fcms main idea behind our explanation. our main idea is to take into account the following difference between the general universal approximation property (as used in neural network theory) and what we want in fuzzy cognitive maps. the difference is that in the general applications of neural networks, the values x1, . . . , xn, and x are usually well-defined physical quantities, quantities which can be, in principle, measured with an arbitrary accuracy ε. for example, if we use neural networks to design an appropriate control, we want the resulting control value x to be as close to the optimal value f(x1, . . . , xn) as possible. in contrast, in fuzzy cognitive maps, all the variables x1, . . . , xn, and x are degrees of confidence describing expert opinions. these degrees are, by definition, imprecise, so computing them with too much for an accuracy simply does not make sense. an expert may be able to mark his or her degree of confidence by marking 6 on a scale from 0 to 10 – which corresponds to the degree of confidence 0.6 – but a normal expert cannot meaningfully distinguish between degrees of confidence 0.61 and 0.62. let us show that this difference can explain the puzzling empirical success of fuzzy cognitive maps. how accurate are expert estimates: 7 ± 2 rule. psychologists have found out that we usually divide each quantity into 7 plus plus minus 2 categories – this is the largest number of categories whose meaning we can immediately grasp; see, e.g., [13, 21] (see also [30]). for some people, this "magical number" is 7 + 2 = 9, for some it is 7 − 2 = 5. this rule is in good accordance with the fact that in fuzzy logic, to describe the expert’s opinion on each quantity, we usually use 7±2 different categories (such as "small", "medium" , etc.). since on the interval [0, 1], we can only have 7±2 meaningfully different degrees of confidence, the accuracy of these degrees ranges from 1/9 (for those who use 9 different categories) to 1/5 (for those who use only 5 different categories). what is the overall accuracy of the corresponding degrees. a fuzzy cognitive map usually combines knowledge of a large number of experts. since we have a large number of experts, it is practically certain that these experts include experts of all types: namely, those who can estimate their degree of confidence with a higher accuracy of 1/9, as well as those who can only estimates their degree of confidence with a much lower relative accuracy of 1/5 = 20%. in general, if we process a large amount of data of different accuracy, the accuracy of the result is determined by the lowest accuracy of the inputs. for example, if we estimate the overall amount of money m = m1 + m2 + m3 owned by three people, and we know m1 and m2 with an accuracy of 1 cent, but we only know m3 with an accuracy of 50% (i.e., we only know the ballpark estimate for m3), then clearly our estimate for the sum m will be very inaccurate as well. from this viewpoint, since fcm contains lower-accuracy data, with the accuracy 20%, we cannot expect the estimation results to be more accurate than that. how accurate should our predictions be? based on the above arguments, it makes sense to estimate the dependence of x on x1, . . . , xn with accuracy 20%. attempts to get a more accurate estimation would be, in general, a useless computational exercise which is not related to the desired problem – of estimating the expert’s degrees of confidence. for example, if the expert’s degree is 0.6, and our formula predicts 0.65, it is a very good match, and there is no need to come up with a formula that predicts exactly 0.6. so, how many neurons do we need to make predictions with this accuracy: let us start our analysis. let us show that in general, if we want predictions with accuracy 20%, why fuzzy cognitive maps are efficient 829 then one neuron is sufficient. specifically, we will show that if, instead of taking the neuron that provides the largest contribution to the prediction, we consider both neurons, then – within the given accuracy – the result will not change. it makes sense to treat the outputs of two neurons as random variables. as we have mentioned, for a general neural network, the result is a sum of the terms corresponding to different neurons. let t1 and t2 be terms corresponding to the two neurons. in general, these terms depends on many factors, so it makes sense to treat them as random variables. as usual in statistics, we can somewhat simplify the problem by subtracting the means e[ti] from the corresponding variables. in precise terms, instead of the original random variables ti, we can consider the differences di def = ti − e[ti] for which the mean value is 0: e[di] = 0. what we compare. we compare the two situations: • a situation in which we consider the sum d1 + d2 of both neural terms, and • a situation in which we only have a single neuron, the one that provides the largest contribution: – we consider d1 if |d1| ≥ |d2|, and – we consider d2 if |d2| ≥ |d1|. it is reasonable to assume that the variables corresponding to different neurons are independent. since we have no reason to believe that the variables corresponding to different neurons are correlated, it makes sense to assume that the variables t1 and t2 – and thus, the corresponding differences d1 and d2 – are independent. this conclusion is in line with the general maximum entropy approach to dealing with probabilistic knowledge: if there are several possible probability distributions consistent with our knowledge, it makes sense to select the one which has the largest uncertainty (entropy; see, e.g., [4, 5]), i.e., to select a distribution for which the entropy s = − ∫ ρ(x) · ln(ρ(x)) dx attains the largest possible value, where ρ(x) is the probability density function (pdf). in particular, for the case when for two random variables, we only know their marginal distributions, with probability densities ρ1(x1) and ρ2(x2), the maximum entropy approach selects the joint probability distribution with the probability density ρ(x1, x2) = ρ1(x1) · ρ2(x2) that corresponds exactly to the case when these two random variables are independent. this independence make perfect sense for neural networks: when we train a neural network, we want to get a model which is as accurate as possible, and if we use two highly correlated neurons, we waste the second neuron to describe what the first neuron describes already. how can we estimate the size of each random variable? for a random quantity with 0 mean, a natural measure of its size is its standard deviation σ. if we only consider the term ds corresponding to a single neuron, then this size can be described by the corresponding standard deviation σs. if we consider both neurons, then the size of the sum d1+d2 can be similarly characterized by its standard deviation σ12. since the variables are independent, the variance σ212 of this sum is equal to the sum σ 2 1 + σ 2 2 of the corresponding variances. thus, the standard deviation σ12 of the sum has the form σ12 = √ σ21 + σ 2 2. (6) 830 v. kreinovich, c. stylios what we plan to prove. we plan to prove that the change caused by adding the second neuron is, in general, below the desired accuracy bound, i.e., that ∣ ∣ ∣ ∣ σ12 − σs σs ∣ ∣ ∣ ∣ ≤ 0.2. (7) let us estimate the sizes σs and σ12 corresponding to two possible situations. we do not have much information about the size of the signals di corresponding to different neurons. we may guess some bounds d ≤ di ≤ d. if all know about the probability distribution is that its values are always located on the interval [ d, d ] , then the maximum entropy approach recommends that we select a uniform distribution on this interval. this recommendation is in perfect accordance with common sense: if we have no reason to believe that some values from this interval are more probable or less probable then others, then it is reasonable to assume that all these values have the exact same probability, i.e., that the distribution is indeed unform. for a uniform distribution on the interval [ d, d ] , the mean value is known to be equal to the midpoint d + d 2 of this interval. since we are interested in random variables di with 0 mean, this means that this point must be equal to 0, i.e., that we have d = −d. since the mean is 0, the variance is equal to the expected value of d2i . here, d 2 i = a 2 i , where by ai def = |di| we denoted the absolute value of the di. one can easily check that this absolute value ai is uniformly distributed on the interval [0, d], with a constant probability density ρi(x) = 1 d , so its variance σ2 = ∫ x2 · ρ(x) dx is equal to σ2i = ∫ d 0 x2 · 1 d dx = 1 3 · ( d )3 · 1 d = 1 3 · ( d )2 . (8) due to the formula (6), we thus have σ12 = √ 1 3 · ( d )2 + 1 3 · ( d )2 = √ 2 3 · d. (9) now, we need to estimate the variance σ2s of the case when we only select one of the neurons, i.e., the expected value of the square of the selection ds. similarly to the one-neuron case, since d2s = |ds|2, this variance is equal to the expected value of a2s, where we denoted as def = |ds|. by definition, as = |ds| = max(|d1|, |d2|) = max(a1, a2). we know that a1 and a2 are two independent random variables which are uniformly distributed on the interval [ 0, d ] . the distribution of the maximum can be described in terms of the cumulative distribution functions (cdf) f(x) def = prob(x ≤ x). for the uniformly distributed variable a1, we have f1(x) = prob(a1 ≤ x) = x d . similarly, f2(x) = prob(a2 ≤ x) = x d . for the maximum as = max(a1, a2), we have fs(x) = prob(as ≤ x) = prob(max(a1, a2) ≤ x). since the maximum of the two numbers is smaller than or equal to x if and only both of these numbers are ≤ x, we conclude that fs(x) = prob((a1 ≤ x) & (a2 ≤ x)). the variables a1 and why fuzzy cognitive maps are efficient 831 a2 are independent, so fs(x) = prob(a1 ≤ x) · prob(a2 ≤ x) = x d · x d = x2 ( d )2 . (10) from this cdf, we can compute the corresponding pdf ρs(x): ρs(x) = dfs(x) dx = 2x ( d )2 . (11) thus, the desired variance σ2s has the form σ2s = ∫ d 0 x2 · 2x ( d )2 dx = 2 ( d )2 · ∫ d 0 x3 dx = 2 ( d )2 · 1 4 · ( d )4 = 1 2 · ( d )2 . (12) thus, σs = √ 1 2 · d. (13) final step: checking that the desired inequality (7) is indeed satisfied. now that we have expressions (9) and (13) for the sizes σ12 and σs, we can plug them into the inequality (7) and check that this inequality is satisfied – i.e., that within the desired accuracy of 20%, adding the second neuron, on average, does not matter. indeed, substituting expressions (9) and (13) into the left-hand side of the formula (7) and dividing both the numerator and the denominator by the common factor d, we get the ratio r = ∣ ∣ ∣ ∣ ∣ ∣ ∣ ∣ √ 2 3 − √ 1 2 √ 1 2 ∣ ∣ ∣ ∣ ∣ ∣ ∣ ∣ . dividing both terms in the numerator by the denominator, we get r = ∣ ∣ ∣ ∣ ∣ √ 4 3 − 1 ∣ ∣ ∣ ∣ ∣ = ∣ ∣ ∣ ∣ 2√ 3 − 1 ∣ ∣ ∣ ∣ = ∣ ∣ ∣ ∣ 2 3 · √ 3 − 1 ∣ ∣ ∣ ∣ . for √ 3 = 1.73 . . ., we get r = ∣ ∣ ∣ ∣ 2 · 1.73 . . . 3 − 1 ∣ ∣ ∣ ∣ = ∣ ∣ ∣ ∣ 3.46 . . . 3 − 1 ∣ ∣ ∣ ∣ = |1.15 . . . − 1| = 0.15 . . . < 0.2. the statement is proven. 3 conclusion thus, we have explained why fuzzy cognitive maps (i.e., 1-neuron neural networks) are adequate for describing the dependence between the experts’ degree of confidence, when the relative accuracy of 20% is quite sufficient. 832 v. kreinovich, c. stylios comment. we have proven that, on average, the relative error does not exceed 20%. this explains why fuzzy cognitive maps are efficient in many practical situations. however, the fact that this inequality is satisfied on average does not necessarily mean that it is satisfied always. there may be cases when fuzzy cognitive maps do not work that well – in this case, it makes sense to describe the corresponding dependencies x = f(x1, . . . , xn) by generic (multi-neuron) neural networks. acknowledgment this work was supported in part by the national science foundation grants hrd-0734825 and hrd-1242122 (cyber-share center of excellence) and due-0926721. this work was performed when c. stylios was a visiting researcher at the university of texas at el paso. bibliography [1] c.m. bishop (2006), pattern recognition and machine learning, springer, new york. [2] e. bourgani, c.d. stylios, g. manis, v. georgopoulos (2013), a study on fuzzy cognitive map structures for medical decision support systems, proc. of the 8th conf. of the european society for fuzzy logic and technology eusflat’2013, milano, italy, september 11-13, 744-751. [3] y. boutalis, t. kottas, m. christodoulou m. (2009), adaptive estimation of fuzzy cognitive maps with proven stability and parameter convergence, ieee trans. on fuzzy systems, 17(4): 874-889. [4] b. chokr, v. kreinovich (1994), how far are we from the complete knowledge: complexity of knowledge acquisition in dempster-shafer approach, in: r.r. yager, j. kacprzyk, m. pedrizzi (eds.), advances in the dempster-shafer theory of evidence, wiley, new york, 555-576. [5] e.t. jaynes, g.l. bretthorst (2003), probability theory: the logic of science, cambridge university press, cambridge, uk. [6] g. klir, b. yuan (1995), fuzzy sets and fuzzy logic, prentice hall, upper saddle river, new jersey. [7] c. knight, d. lloyd, a. penn (2014), linear and sigmoidal fuzzy cognitive maps: an analysis of fixed point, applied soft computing, 15: 193-202. [8] b. kosko (1986), fuzzy cognitive maps, international j. of man-machine studies, 7: 65-75. [9] v. kreinovich, a. bernat (1994), parallel algorithms for interval computations: an introduction, interval computations, 1994(3):3, 6-62. [10] v. kreinovich, c. quintana (1991), neural networks: what non-linearity to choose?, proc. of the 4th university of new brunswick artificial intelligence workshop, fredericton, new brunswick, canada, 627-637. [11] w. lu, j. yang, x. liu (2014), numerical prediction of time series based on fcms with information granules, international j. of computers communications & control, 9(3): 313-324. [12] y. miao, c.y. miao, x.h. tao, z.q. shen, z.q. liu (2010), transformation of cognitive maps, ieee trans. on fuzzy systems, 18(1): 114-124. [13] g.a. miller (1956), the magical number seven, plus or minus two: some limits on our capacity for processing information", psychological review, 63(2): 81–97. [14] h.t. nguyen, v. kreinovich, applications of continuous mathematics to computer science, kluwer, dordrecht. why fuzzy cognitive maps are efficient 833 [15] h.t. nguyen and e.a. walker (2006), a first course in fuzzy logic, chapman and hall/crc, boca raton, florida. [16] e. papageorgiou, c. stylios (2008), fuzzy cognitive maps, in: w. pedrycz, a. skowron, v. kreinovich, handbook of granular computing, john wiley & sons, 755-776. [17] e.i. papageorgiou, c. stylios, p.p. groumpos (2006), introducing interval analysis in fuzzy cognitive map framework, in: g. antoniou et al. (eds), proc. of the 4th hellenic conf. on artificial intelligence setn’2006, heraklion, crete, may 18-20, 2006, springer lecture notes in artificial intelligence, 3955: 571-575. [18] w. pedrycz (2010), the design of cognitive maps: a study in synergy of granular computing and evolutionary optimization, expert systems with applications, 37(10): 7288-7294. [19] w. pedrycz, w. homenda, from fuzzy cognitive maps to granular cognitive maps, ieee trans. on fuzzy systems, 22(4): 859-869. [20] y.g. petalas, e.i papageorgiou, k.e. parsopoulos, p.p. groumpos, m.n. vrahatis (2005), interval cognitve maps, proc. of intl. conf. of numerical analysis and applied mathematics icnaam’05, rhodes, greece, september 16-20, 2005, 1120-1123. [21] s.k. reed (2010), cognition: theories and application, wadsworth cengage learning, belmont, california. [22] j.t. rickard, j. aisbett, r.r. yager (2015), a new fuzzy cognitive map structure based on the weighted power mean, ieee trans. on fuzzy systems. [23] o. sirisaengtaksin, v. kreinovich, h.t. nguyen (1995), sigmoid neurons are the safest against additive errors, proc. of the first intl. conf. on neural, parallel, and scientific computations, atlanta, georgia, may 28-31, 1995, 1: 419-423. [24] h.j. song, c.y. miao, z.q. shen, w. roel, d.h. maja, c. francky (2010), design of fuzzy cognitive maps using neural networks for predicting chaotic time series, neural networks, 23: 1264-1275. [25] h. song, c. mia, r. wuyts, z. shen, m. hondt, f. catthoor (2011), an extension to fuzzy cognitive maps for classification and prediction, ieee trans. on fuzzy systems, 19(1): 116-135. [26] w. stach, l. kurgan, w. pedrycz (2008), numerical and linguistic prediction of time series with the use of fuzzy cognitive maps, ieee trans. on fuzzy systems, 16(1): 61-72. [27] c.d. stylios, p.p. groumpos (2000), fuzzy cognitive maps in modeling supervisory control systems, journal of intelligent & fuzzy systems, 8(2): 83-98. [28] c. stylios, p. groumpos (2004), modeling complex systems using fuzzy cognitive maps, ieee transactions on systems, man and cybernetics, part a: systems and humans, 34(1): 155-162. [29] r. taber (1991) knowledge processing with fuzzy cognitive maps, expert systems with applications, 2(1): 83-87. [30] r. trejo, v. kreinovich, i.r. goodman (2002), j. martinez, r. gonzalez, a realistic (nonassociative) logic and a possible explanations of 7±2 law, international journal of approximate reasoning, 29: 235-266. [31] l.a. zadeh (1965), fuzzy sets, information and control 8: 338-353. int j comput commun, issn 1841-9836 9(3):356-369, june, 2014. a feedback-corrected collaborative filtering for personalized real-world service recommendation s. zhao, y. zhang, b. cheng, j.-l. chen shuai zhao*, yang zhang, bo cheng, jun-liang chen state key laboratory of networking and switching technology beijing university of posts and telecommunications beijing, 100876, china *corresponding author: zhaoshuaiby@bupt.edu.cn abstract: the emergence of internet of things (iot) integrates the cyberspace with the physical space. with the rapid development of iot, large amounts of iot services are provided by various iot middleware solutions. so, discovery and selecting the adequate services becomes a time-consuming and challenging task. this paper proposes a novel similarity-measurement for computing the similarity between services and introduces a new personalized recommendation approach for real-world service based on collaborative filtering. in order to evaluate the performance of proposed recommendation approach, large-scale of experiments are conducted, which involves the qos-records of 339 users and 5825 real web-services. the experiments results indicate that the proposed approach outperforms other compared approaches in terms of accuracy and stability. keywords: internet of things, service recommendation, similarity measurement, collaborative filtering. 1 introduction 1.1 background internet of things (iot) seamlessly integrates user requirement, cyberspace and physical space, which enables the dynamic cooperation of ą°human-machine-thingąą. as the adopting of soa (service oriented architecture) paradigm in iot environment [1], real-world devices will be able to offer their functionality via service interfaces, which enable other components to interact with them dynamically. the functionalities provided by these devices ( e.g. the provisioning of online sensing data) are referred to as real-world services because they are provided by embedded systems that are directly related to the physical world [2]. with the developments of iot, lots of middleware solutions like openiot [3], gsn [4], cosm [5] are proposed which act as service provision platforms. these platforms access real-world resources around the world and provide their capability in form of millions of real-world services, which enable sharing, monitoring and controlling environmental data on the web. however, these existing platforms only provide limited functions for service selecting and recommendation. as the rapid increase of available services, selecting the appropriate services becomes challenging and time-consuming [6]. therefore, service discovery becomes the critical issues of iot development. other than web-service discovery based on functional property which has been deeply studied [7], the studies of discovery based on non-functional property are far from mature, i.e. it is difficult to differentiate services with similar or identical functions. as the user-received performance of service is tightly related to the personalized information of specific users, identifying the optimal one for service users is difficult and costly in the case of many services with equivalent functions. effective personalized service selection and recommendation based on non-functional property become more and more important [8]. quality of service (qos) (e.g. observation-accuracy, round-trip time (rtt), etc.) copyright © 2006-2014 by ccc publications a feedback-corrected collaborative filtering for personalized real-world service recommendation 357 is served as key non-functional property, which acts as important element considered when discovering and selecting services [9, 10]. the values of qos are usually influenced by the specific environment (such as network quality) of users and tend to vary with each user. because conducting actual service invocation is time and resource consuming, it is unpractical to evaluate the qos of all candidate services for every user [11]. so, the idea of making personalized qos prediction for users using a small amount of available qos value is extremely useful. based on the predicted qos values, personalized service recommendation is available for service-users. it enables users to select the service with optimal qos from a number of services which are function-equivalent. 1.2 motivation collaborative filtering (cf) are widely used in recommender systems [12]. the algorithms of cf can be divided into two categories: memory based and model based. cosine-based approach (cos) [14, 15] and pearson correlation coefficient (pcc) [12, 13] are two of the most popular memory-based approaches [16] to calculate the similarity between items. a number of works that employ cos-cf (cosine based collaborative-filtering) or pcc-cf (pearson correlation coefficient based collaborative filtering) for qos based service recommendation and selection have been proposed recently [11] [17–19]. however, the performance that using pcc and cos to measure similarity leaves much to be desired and the prediction accuracy of these works cannot satisfy the requirement of practical application. moreover, the experiments of these existing works are not convincing enough. the existing service recommendation [17, 19] approaches are short of sufficient-scale and systematic evaluation to verify their recommendation results. some of them employ item dataset (such as movielens [20]) instead of real service dataset to evaluate their approaches. in order to address these issues, this paper proposes a novel similarity-measurement for computing the similarity between service users and introduces a new personalized recommendation approach for real-world service based on collaborative filtering, which named feedback-corrected tan-ned (tanimoto normalized euclidean distance). the contributions of this paper are summarized as the following aspects: • this paper proposes a novel similarity measurement for memory based cf, which avoids the shortcomings of existing approaches for service recommendation and takes the characteristics of real-world service qos into account. therefore, it finds similar users more accurately and obtains more accurate qos-prediction. • a feedback-corrected cf approach is proposed, which significantly improves the performance of service recommendation in comparison with existing approaches. • to evaluate the performance of proposed approach, we conduct comprehensive evaluative experiments based on a large-size real-service qos dataset which includes 5825 real webservices and 339 users. the following parts of this paper are organized as follows. in section 2, we review existing similarity measurement approaches for memory-based cf. section 3 presents the proposed feedback-corrected tan-ned cf approach. the experiment results of proposed approach are discussed in section 4. section 5 concludes the work and discusses the future work. 358 s. zhao, y. zhang, b. cheng, j.-l. chen 2 existing similarity measurement the service similarity measurements are divided into two types, namely, functional-measurement and non-functional-measurement. our previous work [21] focuses on measuring the functional similarity between real-world services based on semantic model. other than functional similarity, this paper focuses on non-functional similarity, i.e. qos similarity. the idea of memory-based cf is inspired by the fact that users trust the recommendations from the one who have similar context and preference. these methods predict the qos of a particular service for a user based on the qos obtained from users who have similar context and preference. in memory-based cf, cosine-based approach (cos) [14,15] and pearson correlation coefficient (pcc) [12,13] are two of the most popular algorithms to measure the similarity. assuming that a service recommender system includes n services and m users, then we obtain an m ×n user-service matrix, in which the entry rm,n denotes the qos value of service n observed by user m. if the entry rm,n = ∅, it indicates that user m has never invoked service n. pcc-cf can be used to calculate the similarity between user u and user v by following formula: sim(u, v) = ∑ i∈i(ru,i − ru)(rv,i − rv)√∑ i∈i(ru,i − ru)2 √∑ i∈i(rv,i − rv)2 (1) where i = iu ∩ iv is the set of services that are co-invoked by users u and v, ru,i denotes the quantized qos value of service i observed from the view of user u, and ru is the average value on the qos of services in i observed by user u. the values of pcc range from −1 to 1 according to the definition of equation (1). in cos cf, the similarity between users can be measured by calculating the cosine similarity of the vectors between them: sim(u, v) = ∑ i∈i ru,irv,i√∑ i∈i r 2 u,i √∑ i∈i r 2 v,i (2) table 1: an example of user-service qos matrix service1 service2 service3 service4 service5 user1 2 4 2 4 5 user2 1 2 1 2 5 user3 2 4 2 ∅ ∅ user4 2 2 2 1 4 user5 5 5 5 4 ∅ user6 3 2 3 3 1 user7 3 2 3 4 ∅ user8 3 1 ∅ ∅ ∅ table 1 is an example of qos matrix which contains 5 services (service1 to service5) and 8 users (user1 to user8). the values from 1 to 5 are the minimum to the maximum qos-values of the user-service matrix. ∅ denotes the user has never invoked the corresponding service before. we calculate similarity value adopting cos approach (eq. (2)), and get the following arithmetic expression: sim(user3, user1) = sim(user3, user2) a feedback-corrected collaborative filtering for personalized real-world service recommendation 359 it indicates that user1 is similar to user3 as much as user2 is. actually, user1 is more similar to user3 than user2 , which can be easily observed according to the values in table 1. hence, calculation results and facts are contradictory. we can also compute the similarity between user4 and user5: sim(user4, user5) = 0.9885 we can draw conclusion that user4 and user5 are very similar according to this computation result. however, this is in conflict with the fact shown in table 1, since user4 and user5 almost have the opposite qos-values. user5’s qos-values are approximated to the maximum qos-value 5 in the user-service matrix while user4’s values are approximated to the minimum qos-value 1 of the matrix. this contradiction arises from that when measuring the similarity between two vectors, cos only considers the angle between two vectors and does not consider the length of vector. letążs consider another example. if we employ pcc defined in eq.(1) to measure the similarity between users, we can get the following result: sim(user7, user6) < sim(user7, user8) it indicates that user6 is less similar to user7 than user8 is. actually, user6 is more similar to user7 than user8, because there are the same value in three dimensions and a difference of 1 in one dimension for user7 and user6, and there are the same value in one dimension and a difference of 1 in one dimension for user7 and user8. therefore, the calculation results are in conflict with the facts. this contradiction arises from that pcc does not consider the number of services co-invoked which implies the similarity of selection preference and style between users. in addition to above mentioned approaches, some other similarity measurements are also proposed such as rated-item pools (rips) user similarity [22], proximity impact popularity (pip) measure [13], and mean squared difference [14], which are either for special purposes, or for special situations, or not used widely. among the traditional similarity measurement approaches, the approaches that we elaborated above are strongly representative. 3 feedback-corrected tan-ned cf 3.1 tan-ned similarity measurement in order to address the issues of existing similarity measurement, this paper proposes a novel similarity measurement named tanimoto [23] normalized euclidean distance (tan-ned). compared with the traditional similarity measurement approaches, our approach measures the similarity based on normalized euclidean distances of difference multidimensional vector spaces. although euclidean-distance approach can also be employed to measure similarity, tan-ned is completely different from it. since the number co-invoked services is different deal with different couples of users in a recommender system, the euclidean distances of different couples of users tend to compute in the different dimensions of vector spaces. moreover, the maximal values of euclidean distances in different vector spaces are usually very different. a value which denotes the maximal value in one vector space may be a very small euclidean distance in another vector space. therefore, putting them together to measure similarity is meaningless. for instance, if users a, b both invoked the same 10 services while users c, d both invoked the same 300 services, dist(a, b) is the euclidean distance between the two users a and b, dist(c, d) is the euclidean distance between the two users c and d, then it will be meaningless to mention dist(a, b) and dist(c, d) in the same breath, because dist(a, b) is calculated in 10-dimension vector space, and 360 s. zhao, y. zhang, b. cheng, j.-l. chen dist(c, d) is computed in 300-dimension vector space. therefore, to measure the similarity of vectors we should consider the difference of dimension-number. our ned normalizes the qos values of different users to the same range in order to address the different maximal-value issue. then, it unifies the similarity metrics of different vector spaces. before measuring the similarity between users, it uses the maximal and minimal qos-value of each row to normalize every value of the same row in the original matrix m. after that, the qos-values of each row are normalized to [0, 1]. in consequence, the original qos value matrix m is mapped into a row-normal matrix mnu. assuming that the number of co-invoked services by user u, v is num, and user u, v have the observed qos-value vectors u⃗, v⃗ respectively in matrix mnu. then, the euclidean distance between vector u⃗ and v⃗ in the num dimensions can be calculated by dist(u, v). besides, the maximal euclidean distance of the num dimensions are calculated by distmax. since the matrix mnu has been normalized, each dimension ranges from 0 to 1, and the maximal distance of each dimension is 1. in mnu, nr(u,i), nr(v,i) are the qos-value of service i towards user u, v respectively. the similarity between user u and v can be calculated by ned as follows: nru,i = ru,i − rumin rumax − rumin , nrv,i = rv,i − rvmin rvmax − rvmin dist(u⃗, v⃗) = √∑ i∈i (nru,i − nrv,i)2 simned(u, v) = 1 − dist(u⃗, v⃗) distmax = 1 − √∑ i∈i( ru,i−rumin rumax−rumin − rv,i−rvmin rvmax−rvmin )2√∑|i| k=1(1 − 0)2 i.e., simned(u, v) = 1 − √∑ i∈i( ru,i−rumin rumax−rumin − rv,i−rvmin rvmax−rvmin )2√ |i| (3) where i = iu ∩iv is the set of services which is co-invoked by user u and v; |i| is the number of i; rumin and rumax are the minimal and the maximal qos-values of user u in the original matrix m, r(u,i) denotes the qos-value of service i towards user u in m. the results of eq.(3) range from 0 to 1, i.e., simned(u, v) ∈ [0, 1], where simned(u, v) = 0 represents that two users are dissimilar and simned(u, v) = 1 indicates that these two user are exactly similar even the same. further, in order to use more information of the two users, we also take the number of invoked services by each user and that by both users which implies the qos preference and style of users into account. we propose tan-ned which combines tanimoto similarity coefficient [23] with ned. the formula of tan-ned is as follow: sim(u, v) = |i| |iu| + |iv| − |i| × simned(u, v) i.e., sim(u, v) = |i| |iu| + |iv| − |i| × (1 − √∑ i∈i( ru,i−rumin rumax−rumin − rv,i−rvmin rvmax−rvmin )2√ |i| ) (4) all the contractions mentioned in section 2 can be eliminated by our tan-ned. adopting eq.(4) we get the following results which are consistent with the facts: a feedback-corrected collaborative filtering for personalized real-world service recommendation 361 sim(user3, user1) > sim(user3, user2) sim(user4, user5) = 0.3381 sim(user7, user6) > sim(user7, user8) 3.2 feedback-corrected tan-ned collaborative filtering tan-ned can calculate the similarity between two users, based on it, a novel memory-based cf approach named tan-ned cf is proposed. tan-ned cf predicts the unknown qos-value r(u,i)′ of service i towards user u based on the already available qos-values of service i towards other users that are similar with user u. the more similar user v to user u is, the greater user v’s qos-value influences on r(u,i)′. the normalized predicting value r̂(u,i)′ can be calculated by eq.(5), and then we recovers the normalized value to the original scale of user u by the maximal and minimal values of user u. the qos-value predicted by tan-ned cf is defined as follow: r̂(u,i)′ = ∑ v∈u sim(u, v) × nrv,i∑ v∈u sim(u, v) (5) r(u,i)′ = rumin + (rumax − rumin)r̂(u,i)′ (6) u is the set that contains the similar users to user u. each element v ∈ u has also invoked service i. nrv,i denotes the normalized qos-value of user v on service i in matrix mnu which is row-normal. sim(u, v) can be calculated by eq.(4), and rumax and rumin are the maximal and minimal qos-values of user u in the original matrix m. then we employ the feedback (i.e., the difference value between prediction qos value and real qos value of invoked services) to correct the following prediction. the correction is a continuous process, so uncertain abnormal real qos values may appear which cause "dirty" feedback to the correction process. for example, if a users router is congested then the rtt of service invoked by this user will become very long, which will not response the real qos of service. in order to reduce the impact of "dirty" feedback, we adopt a gaussian distribution coefficient to control the correction efforts of feedback. the qos value prediction of feedback corrected tan-ned is as follow: ru,i = ∆r · 1 √ 2π e − (∆r− ∆rmin+∆rmax 2 )2 2σ2 + ru,i′ (7) where ∆r denotes the difference value between real qos and prediction qos of the previous service invoked by user u, i.e., ∆r = rreal − rpred. ∆rmin is the minimum value of historical ∆r and ∆rmax is the maximum value of historical ∆r. σ is the standard deviation of gaussian distribution which determines the amplitude of distribution, and an appropriate σ value will be obtained by experiments. the feedback correction is a continuous recursive process along with every service invoking. 3.3 feedback-corrected tan-ned cf for service recommendation in the case of candidate services having equivalent functions, the predicted qos-values which are calculated by feedback-corrected tan-ned can be used for service recommendation, and the service that has the best predicted qos performance will be recommended to the corresponding user. then, our approach enables personalized service recommendation using a small amount of available qos value without the time-consuming and costly actual service invocation. in service recommender system, either services or users may be remotely distributed in different 362 s. zhao, y. zhang, b. cheng, j.-l. chen location. besides, the network performances which influence the qos of services are highly dynamic. hence, the qos styles and preferences of user are quite different from each other. since the proposed approach considers the diversity of qos style and preference and adopts the information of similar users to make the prediction, it is applicable to a variety of environments. 4 evaluative experiments 4.1 qos dataset in order to has sufficient data to evaluate our approach, we use the web-service qos set [24] which contains 1873838 real rtt (route trip time) records on 5825 real web-services from 339 distributed service users. to our knowledge, this is the largest dataset in the domain of service-computing. in order to collect the data, zheng et al. monitor 5825 web-services using 339 distributed planet-lab computers. assuming that a m × n user-service matrix contains n services, m users and t non-null records, then the density of this matrix can be defined as density = t (m×n) . according to this concept, the density of user-service matrix used for evaluation is 94.9%. 4.2 evaluation metric we adopt the mae (mean-absolute-error) metric to evaluate the prediction accuracy of proposed approach. mae denotes the average-absolute-deviation between the ground-truth values and the predictions values. it is defined as follow: mae = ∑ u,i |r̄u,i − ru,i| n (8) ru,i is the prediction rtt of web-service i towards user u, r̄u,i is the real rtt of i observed by u, n is the number of predicted rtts. the less the value of mae is, the better the accuracy of prediction is. 4.3 experimental setup the experiments are divided into three parts, namely, performance of similarity measures, impact parameters of prediction, and performance of prediction approaches. first, we compare the performance of our tan-ned with other similarity measures. in this experiment, we use the original user-service matrix. the rtt records of matrix are divided into two parts: 80% of the records as the training set and 20% of the records as the test set. then, we measure the impact of σ (controls the correction effort of feedback), neighbour size k (top-k similar users to calculate the prediction qos value), and density of matrix. in the third part, we compare the performance of our feedback-corrected tan-ned with other prediction approaches. in order to evaluate the accuracy of rtt value prediction by different prediction algorithms, the user-service rtt records in the original matrix is removed randomly to generate ten sparse matrices. as section 4.1 defined, the densities of these ten matrices are incremental with the step-size of 2%, their densities range from 2% to 20%. we adopt these small density matrices in order to get closer to the practical situation that a user may only invokes limited number of services in large amount of available services. so, the real user-service matrix is generally very sparse. we divide each of the ten matrices into three parts: 70% of the rtt records as the training set, 10% as the feedback set, 20% as the test set. a feedback-corrected collaborative filtering for personalized real-world service recommendation 363 4.4 performance evaluation of similarity measurements in order to validate the validity of proposed tan-ned similarity measurement (eq.(4)), we compare its performance with other two well-known similarity measurements: cos (eq.(2)) and pcc (eq.(1)). then, we combine tan-ned, cos, and pcc with formula eq.(6) to predict the missing rtt values respectively. figure 1 presents the accuracy of predictions by different similarity measurements. as figure 1 shows, the proposed tan-ned consistently outperforms other compared approaches under different k-values; even the worst-case of tan-ned still overmatch the best cases of the compared approaches. therefore, compared with cos and pcc, the proposed tan-ned significantly improves the accuracy of prediction. figure 1: performance comparison of similarity measures 4.5 impact of parameters impact of σ the parameter σ determines the amplitude of distribution which controls the correction efforts of feedback. the higher the value, the more flat the distribution, which means that the probability distribution of ∆r (rreal −rpred) is dispersive. then, the correction efforts of different ∆r value are approximate. whereas, the lower the value, the steeper the distribution. it means that a few values which close to the median of ∆r have strong efforts, however, other values have relatively weak efforts. in this experiment, we increase the parameter σ from 0.1 to 1.0 with the step-size of 0.1 in order to study the impact of śň on the prediction result. the original user-service matrix is adopted, and the neighbour size k is set to 30. the influence of parameter σ on prediction is presented in figure 2. as it shows, the values of mae first slightly decline and then sharply rise. when σ = 0.4, it hits the bottom. this experiment result indicates that the prediction accuracy can be improved by adjusting the amplitude of correction-effort distribution. impact of neighbour size k in the proposed tan-ned approach, the neighbour size k determines the number of similar users used for missing value prediction. it acts an important role in the prediction performance. if the value of k is too low, many similar users will be filtered out. if the value of k is too high, dissimilar usersąż records will be considered to calculate the prediction qos value. the neighbor size k is increased from 10 to 100 with the step-size of 10 in order to study the impact 364 s. zhao, y. zhang, b. cheng, j.-l. chen figure 2: impact of sigma of k. the original user-service matrix is employed, and the parameter σ is set to 0.4. the impact of k on the prediction of tan-ned is presented in figure 3. as it shows, the value of mae first slightly declines and then slightly rises, indicating that our tan-ned achieves best performance for this dataset when k = 50. the deviation between the highest value and the lowest value is only 0.041, which means that our approach is not sensitive to the neighbour size. it is because that our approach uses the value of similarity degree to restrict the effects of each similar user when calculating the prediction qos value (shown in eq.(5)). figure 3: impact of neighbour size impact of user-service matrix density matrix density denotes the proportion of non-null records that can be used for missing value prediction in the matrix. this section studies the impact of the matrix density on the accuracy of tan-ned. in this experiment, the density is increased from 0.04 to 0.2 with the step-size of 0.02. the parameter σ is set to 0.4 and k is set to 50. the impact of matrix density on the accuracy of tan-ned prediction is presented in figure 4. as it shows, the value of mae declines as the density increase. the result of this experiment denotes that the prediction of tan-ned becomes more accuracy as the matrix density increase. the reason is that a more intensive a feedback-corrected collaborative filtering for personalized real-world service recommendation 365 matrix provides more reference information for tan-ned to predict missing value. figure 4: impact of user-service matrix density 4.6 performance evaluation of prediction approaches we compare the proposed feedback corrected tan-ned with other three prediction approaches: upcc (user-based cf adopting pcc), umean (user-mean) and wsrec (webservices recommender) in order to validate its effectiveness. upcc refers the information of similar users to predict the missing value [7, 19]. wsrec [11] is a novel memory-based cf for web-service recommendation, which achieves a relatively good performance. umean uses the average rtt values of other web-services from the same user to make missing value prediction. the predictions of these four approaches are influenced by the neighbour size k. the neighbour size is increased from 10 to 50 with the step-size of 20 in this experiment. the parameter σ of tan-ned is set to 0.4 and the confidence weight of wsrec is set to 0.11. figure 5 presents the prediction accuracy measured by mae of the evaluated approaches. the three subfigures of figure 5 correspond to the neighbour-size of 10, 30, and 50 respectively. we increase the density of matrix from 0.02 to 0.2 with the step-size of 0.02. each subfigure shows the value of mae with the matrix density changes. as shown in figure 5, our feedback corrected tan-ned is significantly superior to other compared approaches. when the density of user-service matrix becomes sparser, the co-valued dimensions between vectors decrease. it means that the number of available values used for missing value prediction is limited, which expands the gap of performance between tan-ned and compared approaches. as the increase of density, the improvement rate of tan-ned declines due to each approach has enough available values for prediction. however, in practical situation, the user-service matrix is usually very sparse. moreover, as each subfigure shows, the deviation between the maximum mae and the minimum mae of tan-ned is small. it means that tan-ned keeps a stable mae performance under the different density of user-service matrix. the results of this experiment indicate two features of tan-ned: 1) compared with other existing approaches, the sparser the user-service matrix is, the more superior of tan-ned is; 2) the performance of tan-ned is insensitive to the decrease of matrix density, namely, even with limited available qos records tan-ned can also make relatively accurate prediction. these two features of tan-ned are suitable for the actual situation of real-world service environment. in 1the confidence weight in wsrec denotes the impact of user-based method on the final prediction result. as [11] discussed, when confidence weight is set to 0.1, it achieves the best performance. 366 s. zhao, y. zhang, b. cheng, j.-l. chen (a) set the number of neighbour to 10 (b) set the number of neighbour to 30 (c) set the number of neighbour to 50 figure 5: the performance of compared prediction approaches a feedback-corrected collaborative filtering for personalized real-world service recommendation 367 real-world service recommendation, the density of matrix is generally very sparse. therefore, compared with existing approaches, our approach can make more accurate and stable prediction for qos-value. 5 conclusion this paper proposes a feedback corrected tan-ned approach to solve the issue of real-world service personalized recommendation. it studies the features of the qos-values of real-world services, and proposes a novel similarity measurement which seeks similar users more accurately and provides a basis for accurate qos-value prediction. then, the proposed approach can use a small number of available qos-values from similar users to predict the service qos-value for the user according to his personalization. in the service recommender system, the proposed approach assists service users to select the service with optimal qos from a number of functionequivalent services instead of conducting the costly actual service invocation. in order to evaluate the performance of feedback corrected tan-ned, this paper conducted comprehensive evaluative experiments using a real-world web-service dataset which has sufficient qos records. experiment results indicate that compared with existing approaches the proposed approach improves the accuracy of qos-value prediction significantly. since dynamic is a new feature of iot environment, the qos of real-world service changes with time frequently. in the future work, we will focus on enhancing the efficiency of our approach to handle the dynamic qos issue. it may be addressed by using the latest advanced technologies of machine learning. acknowledgement this study is supported by 973 program of national basic research program of china (grant no. 2011cb302704 and 2012cb315802). national natural science foundation of china (grant no. 61001118, 61171102); program for new century excellent talents in university (grant no. nect-11-0592); project of new generation broadband wireless network under grant (grant no.2010zx03004-001). bibliography [1] gustafaason, j. (2011); integration of wireless sensor and actuator nodes with it infrastructure using service-oriented architecture, ieee trans industrial informatics, issn 1551-3203, 6(1): 1-10. [2] guinard, d.; trifa, v. (2010); interacting with the soa-based internet of things: discovery, query, selection, and on-demand provisioning of web services, ieee trans services computing, issn 1939-1374, 3(3): 223-235. [3] ict fp7 open iot project. open source solution for the internet of things into the cloud, (2011). http://vmusm03.deri.ie/. [4] epfl gsn project (2009). http://sourceforge.net/apps/trac/gsn/. [5] cosm. cosm platform, (2007). https://cosm.com/. 368 s. zhao, y. zhang, b. cheng, j.-l. chen [6] perera, c.; zaslavsky, a.; christen, p. (2013). context-aware sensor search, selection and ranking model for internet of things middleware. 14th ieee international conference on mobile data management, 314–322. [7] sreenath, r.m.; singh, m.p. (2003); agent-based service selection, j web semantics, issn 1570-8268, 1(3): 261-279. [8] zhang, l.j.; zhang, j.; cai, h. (2007) services computing, springer and tsinghua university press, issn 0895-4852. [9] moser, o.; rosenberg, f.; dustdar, s. (2008). non-intrusive monitoring and service adaptation for ws-bpel, 17th intl conf. on world wide web, 815–824. [10] papazoglou, m; georgakopoulos, d. (2003). service-oriented computing, communications of the acm, issn 0001-0782, 46(10): 25–28. [11] zheng, z.; ma, h.; lyu, m.r.; king, i. (2009). wsrec: a collaborative filtering based web service recommender system, 7th intl conf. web services, 437–444. [12] resnick, p.; iacovou, n.; suchak, m.; bergstrom, p.; riedl, j. (1994). grouplens: an open architecture for collaborative filtering of net news, acm conf. computer supported cooperative work, 175–186. [13] shardanand, u.; maes, p. (1995). social information filtering: algorithms for automating word of mouth, sigchi conf. human factors in computing systems, 210–217. [14] sarwar, b.; karypis, g.; konstan, j.; riedl, j. (2001). item-based collaborative filtering recommendation algorithms, 10th intl conf. world wide web, 285–295. [15] breese, j.; heckerman, d.; kadie, c. (1998). empirical analysis of predictive algorithms for collaborative filtering, 14th intl conf. on uncertainty in artificial intelligence, 43–52. [16] adomavicius, g.; tuzhilin, a. (2005). toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions, ieee trans on knowledge and data engineering, issn: 1041-4347, 17: 734–749. [17] chen, x.; zheng, z.; liu, x.; huang, z.; sun, h. (2013). personalized qos-aware web service recommendation and visualization, ieee trans on service computing, issn: 1939-1374, 6(1):35-47. [18] karta, k. (2005). an investigation on personalized collaborative filtering for web service selection. honours programme thesis, university of western australia. [19] shao, l.s.; et al. (2007). personalized qos prediction forweb services via collaborative filtering. intl conf. on web services, 439–446. [20] miller, b.; albert, i.; lam, s.; konstan, j.; riedl, j. (2003). movielens unplugged: experiences with an occasionally connected recommender system. 8th international conference on intelligent user interfaces, 263–266. [21] zhao, s.; zhang, y.; et al. (2013). a multidimensional resource model for dynamic resource matching in internet of things. concurrency and computation: practice experience. [22] thio, n.; karunasekera, s. (2005). automatic measurement of a qos metric for web service recommendation, australian software engineering conference, 202–211. a feedback-corrected collaborative filtering for personalized real-world service recommendation 369 [23] lipkus, a.h. (1999). a proof of the triangle inequality for the tanimoto distance, journal of mathematical chemistry, issn: 0259-9791, 263-265. [24] zheng, z.; ma, h.; lyu, m.r.; king, i. (2011). qos-aware web service recommendation by collaborative filtering, ieee trans on service computing, issn: 1939-1374, 4(2): 140-152. international journal of computers communications & control issn 1841-9836, 10(1):30-37, february, 2015. a comprehensive approach to off-line advanced error troubleshooting in intelligent manufacturing systems l.s. csokmai, r.c. ţarcă, c. bungău, g. husi lehel szabolcs csokmai, constantin bungău*, radu cătălin ţarcă mechatronics department, university of oradea romania, 410087 oradea, universitatii st., 1 cs.lehel@gmail.com, rtarca@uoradea.ro *corresponding author: bungau@uoradea.ro géza husi department of electrical engineering and mechatronics university of debrecen hungary, debrecen, dembinszky ut. husigeza@eng.unideb.hu abstract: the errors recovery in the production systems will be always an open issue. therefore, the fmss have to be endowed with tools and techniques allowing an automatic recovery of errors. the objective of this work consists in proposing an off-line version of the software framework for error troubleshooting in a flexible manufacturing system [1]. the main difference between the on-line and off-line version is that the error database is stored on the mobile device and the frame marker device is connected directly to the fms components without the need of the pc.). our framework system is designed to solve the failures in the functioning of the fms and to generate self-training from previous experience. keywords: frame maker, fms(flexible manufacturing system), error, troubleshooting, advanced decision systems. 1 introduction to improve their products quality, many companies used the six sigma approach to capture measureand eliminate the defects in manufacturing process in last years. nowadays, in flexible manufacturing systems, smart products and fmss components (equipped with embedded smart devices) are wirelessly networked and remotely monitored in a real-time, using for this intelligent control systems. consequently, using these systems, we can achieve real time data gathering, remote monitoring and analysing the results acquired from smart product and fms components, the purpose being: to control the manufacturing quality and to detect quality degradation, that will allows assessment of the failure situation and taking of appropriate corrective actions accordingly. the error handling issue can be observed from two points of views. the former (out-of order) is related to the hole system-level point of view and deals with unexpected events as: fms’s components breakdowns, changes in task priorities, and all others which can be identified by the system’s controller. the latter (out-of-ordinary events) is related to the errors at the cell level. exceptions (differences between the actual and the expected state of the system) in a flexible manufacturing cell can be classified as: (1) unexpected events [2] (like time-out on expected process report or occurrence of unexpected reports), (2) errors [3] (like positioning errors), (3) unpredictable failures [9] (such as resource’s out of orders, cameras failures, equipment failures, tool breakages, human errors, material handling problems, collisions, obstructions and handling failures). the diagnosis issue is widely spread in the fms research community. the diagnosis task can be split in the following sub-tasks [5], [6], [9]: copyright © 2006-2015 by ccc publications a comprehensive approach to off-line advanced error troubleshooting in intelligent manufacturing systems 31 • detection, the goal of this task is to identify that something is not going as expected; • isolation, the goal of this task is to determine the exact location of the disturbance; • identification, the goal of this task is to determine the disturbance magnitude and impact; • error diagnosis and prognosis, that means to identify the responsible for the system degradation and the future degradation consequences; • error or failure recovery a procedure that allows to restart the error of failure in a way that either eliminates or minimizes it. from the perspective of the automated control system, real-time status and data gathering of the various fms components or manufacturing process and information transfer to the manufacturing process control centre or a corrective maintenance intervention upon fmss components represents the close loop control and management of the manufacturing system (figure 1). figure 1: disturbance management system figure 1 illustrates the informational flow in a disturbance management system. the monitoring devices is continuously observe the fmss component status and manufacturing process, performing a comparison with the existing states and plans to verify if a deviation occurs. if a deviation is detected, a diagnostic is generated, so the causes for the disturbance are identified; then its dynamics is evaluated, the possible actions to be executed to prevent a dynamic evolution of the disturbances consequences [7] ; all these with the aim to recover the system from this status. if the diagnostic is generated, the diagnostic system starts a self-repair procedure; if it is not possible, the system requests a corrective maintenance intervention to the maintenance department. in the same time this department generates updates info about occurred disturbances. in [8] was developed a prototype system, using rfid monitoring system and zigbee wireless transmission as a close loop, to monitor the manufacturing process for a fms. in [9] is presented a new disturbance management system approach, based on adacor holonic control architecture [10]. 32 l.s. csokmai, r.c. ţarcă, c. bungău, g. husi 2 system structure to monitoring the fms system developed at the university of oradea and presented in figure 2, our researchers have conceived and realized an off-line frame marker device for error troubleshooting. figure 2: fms structure the main components of the fms are: the 5-axis machine-tool tma al 550 equipped with fanuc cnc, two abb irb 1600 robots with schunk png 100 grippers, an asrs (to storage pallet with work-piece, finite product and tools) and a conveyor. 3 frame maker device the starting point in using frame markers to detect errors in fms is the ease of recognition of their patterns fiducial markers by vuforia sdk frame marker software which can run on any smart phone. these fiducial markers (called frame markers) present a very predictable and specific pattern and are generated by frame marker device (fmd) realized by authors. the frame marker’s id, used to recognize the marker in its environment by the camera during run-time, is generated by fmd which has three zones as follows: a continuous outline black square, an area with light and dark blocks which follow the outline border and the last one the inside area a black square (figure 3). the position and arrangement of blocks from second zone are distinct, offering possibility to develop a computer vision algorithm for frame markers detection. the id is encoded four times (on each edge of the frame) using a different base pattern of dark and light blocks. this redundancy increases the robustness of marker in detection and tracking. to realise the code generator device of the fmd two major components were used:: • led stripe with 36 leds individually addressable; a comprehensive approach to off-line advanced error troubleshooting in intelligent manufacturing systems 33 figure 3: frame marker device • as signal generator for led stripe was used an atmega 1280 micro-controller . each led is powered separately because of the high energy consumption. if all 36 leds are set to white colour the consume should be around 12 watts ( 2.5 amps and 5 volts) each led is provided with an control chip (shift register) which uses one pin for input and one for output. from this reason the used protocol is very timing-specific and the micro-controller has to work with high repeatability (100ns timing precision) and at least on 8mhz. for each led, the colour can be set with 8-bit pwm precision (24-bit colour / led). the atmega 1280 micro-controller can be programmed with the processing/wiring open source programming language and the integrated development environment (ide). the programming language is built on java but uses a simplified syntax. 4 structure of the off-line error troubleshooting system the structure of the off-line error troubleshooting system is presented in figure 4. the frame maker device is connected through factory server to each fms component’s control module, and also receive information about the fms’s components status directly from the monitoring devices. if any error appears in fms (e.g. failure at any fmss component, or at the fms control modules or at the communications level) the error troubleshooting system will try to solve this problem searching in the error database for a possible solution. if the error can be manage automatically by the system it will be solve by fms software. if not, the code generator maker of the frame marker device will generate and deliver an error code to the fmd and the operator will be notified about the need of troubleshooting the error through a visual and sound alarm. the error id can be read, from the frame maker device, using a portal device ccd camera, which should be directed toward the fmd (figure 5). the image acquired by ccd camera is decoded by vuforia sdk to an error id. on the portable device screen will appear an error message, with a brief description (which corresponds to that error id figure 6). 34 l.s. csokmai, r.c. ţarcă, c. bungău, g. husi figure 4: structure of the off-line error troubleshooting system errors types for different cases of failures is presented in figure 7 (with red for abb irb robot, with blue for asrs system, with purple for conveyor). in some cases when the portable device screen is touched, detailed information of the error can be supplied, if exists. after one error was successfully corrected, the operator should communicate the intervention via wifi with fms’s error database. another application was also developed by authors: to check the functional status of fms components by orienting the portable device towards the marker device; if a message on a green background appears on portable device screen, it means that the fms component works properly; in case that a message on a purple background appears on screen, it means that the fms component is off-line. a comprehensive approach to off-line advanced error troubleshooting in intelligent manufacturing systems 35 figure 5: image acquired by ccd camera figure 6: error message 36 l.s. csokmai, r.c. ţarcă, c. bungău, g. husi figure 7: errors types for different cases of failures 5 conclusions and future works the contribution of this paper is to propose an off-line version of the software framework using a frame marker device conceived by the authors, which can offer effectiveness control activities in the frame of a disturbance management system. the fms components status is controlled by the software framework which transfers the information about the need of error troubleshooting from the fms component to the operator, via a smartphone. with the off-line error database we can create a more compact and network free error troubleshooting system. the only drawback of the system is the updates are not instant; they can be applied when the mobile system connects to the fms control software when recharging. the initial error database can be dynamically extended with new items corresponding to unpredictable errors developed during the manufacturing process. bibliography [1] csokmai,l.; moldovan, o.; tarca, i.; tarca, r. (2013); software framework for advanced error troubleshooting in flexible manufacturing system, applied mechanics and materials, 397-400: 21-24. [2] borchelt, r. d., thorson, j. (1997), toward reusable hierarchical cell control software, international journal of production research, 35(2):577-594. [3] kao, j.f., (1995), optimal recovery strategies for manufacturing systems, european journal of operational research, 80(2):252-263. [4] wu, h.j., (1999), methodology of generating recovery procedures in a robotic cell, proceedings ieee international conference on robotics and automation, 1:799-804. a comprehensive approach to off-line advanced error troubleshooting in intelligent manufacturing systems 37 [5] toguyeni, a.k.a.; craye, e.; gentina, j.c. (1996); a framework to design a distributed diagnosis in fms, ieee international conference on systems man and cybernetics, 4:27742779. [6] bruccoleri, m.; pasekb, z.j.; koren, y. (2006); operation management in reconfigurable manufacturing systems: reconfiguration for error handling, int. j. production economics, 100:87-100. [7] felea i, dzitac s., vesselenyi t., dzitac i. (2014), decision support model for production disturbance estimation, international journal of information technology and decision making, 13(3): 623-647. [8] qiang ruan, wensheng xu, gaoxiang wang (2011); rfid and zigbee based manufacturing monitoring system, 2011 international conference on electric information and control engineering (iceice), 1672-1675. [9] leito p. (2010); a holonic disturbance management architecture for flexible manufacturing systems, international journal of production research, 49(5): 1269-1284. [10] leito, p., restivo, f. (2006); adacor: a holonic architecture for agile and adaptive manufacturing control, computers in industry, 57 (2): 121-130. [11] bruccoleri, m., renna, p., perrone, g. (2005), reconfiguration: a key to handle exceptions and performance deteriorations in manufacturing operations, international journal of production research, 43(19):4125-4145. [12] bruccoleri, m. (2007), reconfigurable control of robotized manufacturing cells, robotics and computer-integrated manufacturing, 23:94-106. [13] zhang x.l., yan k., ye j. ; li j. (2012), a remote manufacturing monitoring system based on the internet of things, proceedings of 2012 2nd international conference on computer science and network technology (iccsnt 2012), 221-224. [14] dev anand m., selvaraj, t. kumanan, s. (2012), detection and fault tolerance methods for industrial robot manipulators based on hybrid intelligent approach, advances in production engineering and management, 7(4):225-236. [15] kumanan, s., selvaraj, t.. dev anand, m., janarthanan, j. (2008), fault diagnosis system for a robot manipulator through neuro-fuzzy approach, international journal of modelling, identification and control, 3(2):181-192. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 1 (march), pp. 53-62 effective retransmission in network coding for tcp j. chen, l.x. liu, x.h. hu, w. tan jing chen information center the ministry of science and technology of the people’s republic of china 15#b, fuxing road, beijing, 100862, p.r. china e-mail: dongdcj@gmail.com lixiang liu, xiaohui hu institute of software, chinese academy of sciences 4# south fourth street, zhong guan cun street, beijing, china, 100190 wei tan baidu, inc e-mail: tanweildd@gmail.com abstract: incorporating network coding into tcp has the advantage of masking packet losses from the congestion control algorithm. it could make a lossy channel appear as a lossless channel for tcp, therefore the transport protocol can only focus on handling congestion. however, most schemes do not consider the decoding delay, thus are not suitable to be implemented in practical systems. we propose a novel feedback based network coding (fnc) retransmission scheme which has high throughput and quite low decoding delay without sacrificing throughput. it uses the implicit information of the seen scheme to acquire the exact number of packets the receiver needs for decoding all packets based on feedback. we also change the encoding rules of retransmission, so as to decode part of packets in advance. the scheme can work well on handling not only random losses but also bursty losses. our scheme also keeps the end-to-end philosophy of tcp that the coding operations are only performed at the end hosts. thus it is easier to be implemented in practical systems. simulation results show that our scheme significantly outperforms the previous coding approach in reducing decoding delay, and obtains the throughput which is close to the scenarios where there is zero error loss. it is particularly useful for streaming applications. keywords: network coding, tcp, decoding delay, retransmission. 1 introduction it is well known that tcp suffers poor performance in lossy wireless networks. even if a perfect congestion control algorithm can avoid congestion loss in transmission, there are still noncongestion losses (including random losses with a fixed bit error rate and bursty losses due to bad weather or signal shadowing, etc.), which necessarily degrade the transmission performance. network coding has emerged as an important potential approach in the operation of communication networks [1]. the core idea is that the sender transmits coded packets combined with unacknowledged original packets rather than transmitting individual packets. thus sending a packet can be seen as adding a packet to the pool and acknowledgement as removing the received packets from the pool. each transmission is not affected by any other losses. thus, incorporating copyright c⃝ 2006-2011 by ccc publications 54 j. chen, l.x. liu, x.h. hu, w. tan network coding with tcp is a natural way to enhance the robustness and effectiveness of data transmission in lossy channels. the arq for network coding (anc) scheme presented in [2] defines the seen packet as an abstraction for the case in which a packet has not yet been decoded, but can be safely removed from the sender’s buffer. the expected queue size of the scheme is reduced from the traditional length ω((1−ε)−2), to ω((1−ε)−1) where is the erasure probability. however, the weak receivers are unlikely to recover from erasures in reasonable time, since the decoding delay becomes very large. tcp/nc [3] uses seen scheme to mask losses from the congestion control algorithm. it aims to make a lossy channel appear as a lossless channel to tcp, so the congestion control protocol does not need to pay attention to the error losses and thus can focus solely on the congestion. in fact, masking losses from tcp was considered earlier by using link layer retransmission [4]. yet it has been noted in [5], [6] that the interaction between link layer retransmission and tcp’s retransmission is complicated and the performance may suffer due to independent retransmission protocols at different layers. tcp/nc uses a constant redundancy factor for retransmission in order to compensate for the loss rate of the channel and match tcp’s sending rate. however, if it suffers bursty losses due to bad weather or signal shadowing, the algorithm must wait a long time to decode all packets. in fact, most coding schemes do not take decoding delay into consideration. the receiver has to wait for a considerable number of packets before it can decode the data. consequently, it is hard for them to deploy in a real system in spite of the benefits in terms of throughput and robustness the network coding can bring [7], [8]. the work by barros et al. [9] redesigns the encoding rules and stages of the anc scheme to decrease decoding delay, which is useful for streaming applications with special delay requirements. however, it reduces delay by sacrificing some of the throughput. and its expected queue size increases from ω((1−ε)−1) in anc to ω((1−ε)−2), since the sender cannot discard a packet after it is being seen by all senders. therefore, when incorporating the existing coding mechanisms with tcp in wireless environment, we faced two main problems: (1) retransmission schemes which are triggered by a static parameter such as a fixed delay threshold or a redundancy factor, can not balance both throughput and decoding delay. this is shown in sec v., fig. 3 [9], and sec 3 of our paper. (2) the retransmission schemes require the fixed loss rate and are deeply influenced by it. as a result it cannot handle bursty losses well. to overcome the disadvantages in existing approaches, we provide the end-to-end feedback based network coding (fnc) retransmission scheme which makes use of the implicit information of the seen scheme to obtain the exact number of packets needed by the receiver to decode all data as soon as possible. we also change the encoding rules to decode part of the packets in advance. our effective retransmission scheme can mask losses better than the previous scheme, and hence significantly improve the performance of decoding delay and throughput under both random losses and bursty losses. simulation results show that our scheme significantly outperforms the previous coding approach in reducing decoding delay while increasing throughput. it obtains the throughput which is close to the scenarios where there is zero error loss. the remainder of the paper is organized as follows. section 2 introduces the terminology and describes basic ideas of network coding in tcp with the seen scheme. section 3 proposes our new network coding retransmission scheme fnc. the corresponding simulation results are presented in section 4. we conclude the paper in section 5. effective retransmission in network coding for tcp 55 2 essential background we treat packets as vectors over a finite field fq of size q. the kth packet generated by the source has an index k and is denoted as pk. when the source is allowed to transmit, it sends a random linear combination of all packets instead of the original packet. we will firstly explain the seen scheme with logical description and then illustrate its implementation in the existing protocol stacks. 2.1 arq for network coding the arq for network coding (anc) scheme presented in [2], is designed to handle loss in a multicast environment. in the scheme, the decision of which packets to combine relies on the concept of the seen packets. a packet pk is said to be seen by a receiver if it has enough information to compute a linear combination in the form of pk + q, in which q is a linear combination of packets that are newer than pk, i.e. q = ∑ l>k αl ·pl, with αl ∈ fq for all l > k. the receiver acknowledges the oldest unseen packet, so the sender always transmits a packet that is a combination of the oldest unseen packets of each receiver. a packet can be dropped from the sender queue when it is seen by all receivers. this provides an efficient method to keep the sender’s queue sizes small, although the receiver may not decode all packets. it is demonstrated to be throughput optimal for the case of poisson arrivals, perfect feedback, and identical erasure probabilities on all channels, because each reception is innovative. 2.2 network coding for tcp the reference system for our scheme is the tcp/nc protocol [3], which is designed with respect to a single source that generates a stream of packets to one sink. it incorporates the seen scheme with congestion control and introduces a new network coding layer between the transport layer and the network layer in the protocol stack, which masks packet losses from congestion control algorithm. there are several modifications on anc to be fit for end-to-end connection. first, the sender transmits random linear combinations of packets in the coding window, instead of combinations of the oldest unseen packets of each receiver. the receiver acknowledges the oldest unseen packet although it may not be decoded yet. there will never be any duplicate acks as each reception is innovative. every ack will cause the congestion window to advance, so it is not proper to apply fast retransmit/recovery algorithms which use three duplicate acks as the packet loss indication. therefore tcp/nc chooses tcp vegas [12] as the congestion control approach and introduces a novel rtt estimation algorithm. it matches the newly arrived ack with the transmission that occurred after the one that triggered the previous ack, rather than the transmission that triggered this ack. for example, in figure 1(c), rtt2 matches the 2nd transmission, rather than the 4th transmission. in the implementation, tcp/nc embeds the network coding operations in a separate layer below tcp and above ip on two end nodes in order to naturally add network coding to the current protocol stack. in the source side, when a packet arrives at the coding layer from the transport layer, the coding layer generates a random linear combination of the packets in the coding window and sends it to the sink. in order to compensate for the loss rate of the channel and to match tcp’s sending rate, for every packet from tcp, r linear combinations are sent to ip on average, where r is the constant redundancy factor equal to the reciprocal of the probability of successful reception. as an example shown in figure 1(a), assuming packet loss rate e to be 20%, hence r is 1/0.8=1.25, which means that the sender makes a retransmission after sending four combinations triggered by tcp. the retransmission does not include any new 56 j. chen, l.x. liu, x.h. hu, w. tan 1: p1 2: p1+6p2 3: p1+3p2+ 4p3 4: 7p2+ p3+3p4 5: 4p2+p3+3p4 6: 3p3+3p4+2p5 s: p1; d: p1 s: p2,3 s: p4 rtt1 rtt2 rtt3 1: p1 2: p1+6p2 3: p1+3p2+ 4p3 4: 7p2+ p3+3p4 5: 4p2+p3+3p4 6: 3p3+3p4+2p5 s: p1; d: p1 s: p4; d: p4 s: p5; d: p,5 s: p2; d: p2 s: p3; d: p3 1: p1 2: p1+6p2 3: p1+3p2+ 4p3 4: 7p2+ p3+3p4 5: 4p2+p3+3p4 6: 3p3+3p4+2p5 s: p1; d: p1 s: p3,4; d: p3,4 s: p2 s: p5; d: p,5 (a) proper retransmission (b) redundant retransmission (c)insufficient retransmission figure 1: example of the transmission with a redundancy factor original packet from tcp. in figure 1(a), the 5th transmission is a retransmission which only encodes p2, p3 and p4, without p5. it compensates the loss of the 2nd transmission, makes p3, p4 to be seen and p3, p4, p5 decodable. however, the retransmission with a fixed rate may result in the following two problems: 1) the retransmission may be useless. for example, in figure 1(b), the 5th retransmission is useless since no losses happened before it. we call this redundancy or redundant retransmission. 2) the retransmission may be helpless. in figure 1(c), the 5th retransmission is valid but still can not help the receiver to decode all packets as there are two combinations lost before. therefore, it is natural to think about using a feedback based retransmission scheme to replace the scheme with a fixed retransmission rate. 3 network coding retransmission scheme to reduce the decoding delay and redundancy, we propose the feedback based network coding (fnc) retransmission scheme. it acknowledges the total number of coding packets required to repair a loss and then decodes all packets. in the receiver, the difference between the number of seen packets and the largest packet index in the coefficient matrix implies the number of packets that the source needs to retransmit. so we maintain two variables in the receiver side: one is the number of seen packets seen_cnt ; the other is the largest index of the received packets max_seq. for each ack, the receiver embeds in the header not only the sequence number that equals the oldest unseen packet, but also the difference between max_seq and seen_cnt , which is called diff . for example, suppose the source transmits the following linear combinations: x = p1, y = (p1 + p2) and z = (p1 + p2 + p3). the second transmission y is lost. so the sink only receives the linear combinations x and z. as p1 and p2 have been seen, seen_cnt is 2, the largest index max_seq is 3, and hence diff = 3 − 2 = 1. thus diff indicates the number of packets the receiver needs to change into the decodable state which means the receiver can decode all packets. in the sender side, when an ack arrives, if the diff is larger than zero, the sender uses this value to decide to retransmit in the following two steps: first, it checks to see if the difference between the current time tnow and the time of the last retransmission tlast is greater than the timeout value. if it is, the sender retransmits diff linear combinations of first diff packets in the coding window. this avoids retransmission more than once for losses that occurred during one rtt interval. if it is not, the sender compares the diff received this time with the previous retransmission last _diff . if the current diff is greater than the last _diff , there is new packet loss and it retransmits (diff − last _diff) random linear combinations of the first (diff − last _diff) packets in coding window. if we try to retransmit the combinations of all packets in the coding windows, as in tcp/nc [3], then the average decoding delay of our scheme is indeed lower than nc. however, it is still effective retransmission in network coding for tcp 57 1: p1 2: p1+6p2 3: p1+3p2+ 4p3 4: 7p2+ p3+3p4 5: 4p2+p3+3p4 6: p2+3p3+3p4+p5 s: p1,diff=1 s: p2,diff=2 s: p3,diff=1 s: p4,diff=1 1: p1 2: p1+6p2 3: p1+3p2+ 4p3 4: 7p2+ p3+3p4 5: 4p2 6: p2+3p3+3p4+p5 s: p1,diff=1 s: p2,diff=2 s: p3,diff=1; d: p2,3 s: p4,diff=1 (a) coding with all packets (b) coding with the first several packets figure 2: retransmission based on feedback much higher than our expectation. given an example in figure 2(a), if the sender retransmits the coded packet combined with p2, p3 and p4 in the current coding window in the 5th transmission, the receiver is still in undecodable state and cannot decode any packets, because the diff has changed to 2 due to the loss of the 3rd transmission. therefore, we improve the encoding rule in which retransmission packets are only combined with first diff (or diff − last _diff ) packets in the coding window in order to make a part of packets decodable. although we do not decode all packets together, we significantly reduce the number of undecodable packets as well as the decoding delay. additionally, encoding less packets can reduce the overhead and complexity of encoding and decoding operations. consequently, in the example of fnc in figure 2(b), we only retransmit the combination containing p2 (as diff equals to 1) in the 5th transmission, and thus p1 and p2 are decodable. additionally, tcp/nc simply retransmits packets through redundancy factor to compensate for error losses, rather than for congestion losses. the retransmission in the coding layer does not consume the congestion window. however, our previously mentioned algorithm treats congestion losses and error losses the same, which may bring or exacerbate congestion. we can solve this problem by limiting the current total number of retransmission packets in the coding layer with no more than the product of the total number of transmitted packets n and the loss rate e. in contrast with tcp/nc that retransmits dispersedly, our policy can retransmit the appropriate number of packets together while they are required, and does not mask congestion losses at the same time. in other words, nc (i.e. tcp/nc) is a static retransmission scheme whereas fnc is a dynamic one. however, regarding those near-zero congestion loss algorithms such as vcp [10] and mlcp [11], it is unnecessary to adopt this policy. thus, it can handle such losses much better, when it encounters burst or inconstant loss rate conditions. the improved algorithm is specified below using pseudo-code: source side: initialization: set last _diff and tlast to 0. ack arrives from receiver: the source side algorithm removes the seen packet from the buffer and retrieves diff from ack header. if this is the fourth uninterrupted time that diff is larger than 0: 1) if (tnow −tlast > rtt ) a) set last _diff =0; b) goto 3); 2) if diff <= last _diff goto 5). 3) repeat the following (diff −last _diff ) times: a) generate a random linear combination of the packets in the coding window. b) deliver the packet to the ip layer. 58 j. chen, l.x. liu, x.h. hu, w. tan 4) update tlast to the current time; 5) update last _diff to the diff of the new arrival. receiver side: initialization: set seen_cnt and max_seq to 0. packet arrives from source side: 1) performs gaussian elimination to update the set of seen packets. 2) update seen_cnt and max_seq. 3) add the network coding ack header to tcp ack, consisting of the value of diff which is the difference between max_seq and seen_cnt . the algorithm not only avoids the redundant transmissions when the receiver is in the decodable state, but also retransmits the appropriate number of packets so as to make a part of packets decoded in advance. it significantly reduces the decoding delay, which is particularly useful for streaming applications with stringent delay requirement. also, as we do not change the acknowledgment scheme of seen packet, the expected queue size for our scheme remains ω((1−ε)−1) rather than ω((1−ε)−2) in snc [9] and tcp. in contrast with traditional retransmission schemes, such as sack etc., our scheme obtains the total number of packets for decoding. it significantly reduces overheads in the ack header. it is also more robust and effective. compared with tcp/nc [3], fnc has a relative small decoding delay. it can handle both random losses and unknown bursty losses. furthermore, our scheme respects the end-to-end philosophy of tcp that coding operations are only performed at the end hosts while achieving the aim of masking losses from congestion control algorithm at the same time. 4 simulation results we evaluate the performance of different coding algorithms by means of the network simulator "ns-2" [13]. the basic setting is a tandem network consisting of 4 hops. the source and sink nodes are at opposite ends of the chain. the packet size is 1000 bytes. we incorporate network coding with tcp vegas. the vegas parameters are set as α = 28, β = 30, γ = 2. the performance metrics are the throughput, the average decoding delay and the maximum decoding delay. the throughput with network coding is calculated as the total number of seen packets, rather than the decoded packets, divided by the simulation time. all simulations are run for at least 200s to ensure that the system reaches its steady state. 4.1 random losses we first evaluate the performance of fnc under the case of a fixed packet loss rate. the basic setting is a 10mbps link capacity, an 80ms round-trip time, and a 5% packet loss rate. we compare the throughput of nc, fnc and vegas under a fixed loss rate with the standard vegas under no loss rate to evaluate their performance on masking losses. each scenario runs 20 times and we get the mean value. 4.2 impact of packet loss rate we first study the variation of throughput with loss rate from 0% to 15%. figure 3(a) shows that the throughput of vegas falls rapidly as losses increase. nc performs better than vegas but it is worse than fnc since it only successfully masks part of losses. in contrast, fnc is very robust to losses. it maintains over 88% throughput as if there is no error loss. our effective retransmission in network coding for tcp 59 2 4 6 8 10 12 14 16 0 20 40 60 80 100 0.1 t hr ou gh pu t ( % ) packet loss rate (%) fnc nc vegas 2 4 6 8 10 12 14 16 0.0 0.4 0.8 1.2 1.6 2.0 fnc nc a ve ra ge d ec od in g de la y (s ) 0.1 packet loss rate (%) 2 4 6 8 10 12 14 16 0 4 8 12 16 20 fnc nc m ax im um d ec od in g de la y (s ) packet loss rate (%) 0.1 (a) throughput (b) average decoding delay (c) maximum decoding delay figure 3: performance as a function of the loss rate variation 10 100 1000 0 20 40 60 80 100 50050 rtt (ms) t hr ou gh pu t ( % ) fnc nc vegas 10 100 1000 0 2 4 6 8 10 fnc nc rtt (ms) a ve ra ge d ec od in g de la y (s ) 50050 10 100 1000 0 4 8 12 16 20 24 28 rtt (ms) fnc nc m ax im um d ec od in g de la y (s ) 50050 (a) throughput (b) average decoding delay (c) maximum decoding delay figure 4: performance as a function of rtt variation scheme also keeps quite a low decoding delay and the average value does not increase as the loss rate increases. this is because fnc is according to the actual losses to retransmit packets dynamically. in contrast, nc is deeply affected by the loss rate since its retransmission is static. in some cases, the average decoding delay of nc is even higher than the maximum decoding delay of fnc. 4.3 impact of feedback delay next, we evaluate the performance across a wide range of propagation delay from 10ms to 1000ms. as illustrated in figure 4(a), our scheme performs much better than nc and vegas in every case. the throughput in fnc is at least 2 times more than in nc. however, the fnc’s throughput degrades significantly when rtt increases to 1000ms. this is because our retransmission scheme is based on feedback. nc may perform better than fnc when rtt becomes much larger. thus our retransmission scheme can not be applied in deep space communication of which rtt may be hundreds of seconds, whereas in most real cases, rtt is no more than 1000ms. figure 4(b) shows that the average decoding delay of nc is 2 to 10 times of fnc. the decoding delay for both schemes increases quickly when rtt grows beyond 500ms. when rtt is 500 ms, fnc’s average decoding delay is 1.00165s, and when rtt is 1000ms, its average decoding delay is 2.35303s. they are approximately 2 times that of rtt, which is in accordance with our expectations. the average decoding delay of nc is 3.0087s and 8.93286s respectively, much larger than fnc. 60 j. chen, l.x. liu, x.h. hu, w. tan 0.1 1 10 100 0 20 40 60 80 100 500 fnc nc vegas t hr ou gh pu t ( % ) bandwidth (mbps) 0.1 1 10 100 0 4 8 12 16 fnc nc 500 a ve ra ge d ec od in g de la y (s ) bandwidth (mbps) 0.1 1 10 100 0 10 20 30 40 50 500 m ax im um d ec od in g de la y (s ) fnc nc bandwidth (mbps) (a) throughput (b) average decoding delay (c) maximum decoding delay figure 5: performance as a function of the bandwidth variation 4.4 impact of link capacity we fix the round-trip time to 100ms with a packet loss rate of 5%, and vary the link capacity from 0.1mbps to 1000mbps. as shown in figure 5(b) and (c), the maximum decoding delay for nc is very high in every case, whereas both the maximum and the average decoding delay for fnc are quite low. when the bandwidth is lower than 1mbps, the congestion control algorithm itself leads to congestion or even loss, which increases the delay. if the bandwidth is very high, one loss can lead to great throughput decrease. it is shown in figure 5(a) that due to the limitation of vegas itself, the performance of all three schemes is not very high when the bandwidth is larger than 100mbps. since our scheme can mask packet losses more effectively, it has the highest throughput in contrast with nc and vegas without incorporating with network coding. overall, the decoding delay for fnc depends on the rtt, the overhead of operation of encoding and decoding rules, and the congestion control algorithm, but it is not deeply affected by the loss rate. it seldom has a long undecodable chain thus its maximum decoding delay is endurable. furthermore, fnc masks random losses more effectively than nc does, and hence obtains higher throughput in most cases. 4.5 bursty loss all previous simulations focus on the behavior of fnc under the fixed loss rate. now, we investigate its performance in an unknown environment with unfixed loss rate. the setting of this scenario is a 10mbps link capacity with 80 ms rtt, where the background loss rate is 0.1%. at t=40s, the loss rate is changed to 50%, and lasts for 5 seconds until t=45s. at t=60s, the loss rate is changed to 100% which means the signal is fully shadowed by obstacles, and it lasts for 2 seconds until t=62s. the tested protocols vegas and fnc do not know these loss rate changes. figure 6 clearly shows that fnc can quickly and effectively handle the sudden bursty losses. before t=40s, fnc almost masks all random losses whereas vegas without network coding is seriously affected by random losses. when the loss rate is changed to 50%, vegas could hardly work whereas fnc still has a relatively high throughput. fnc does not mask all losses because the retransmission packet also has a 50% probability to be lost. after around t=63s, the sending rate of fnc has a sudden increase because the sender retransmits all lost packets together. due to the limitation of the congestion control algorithm vegas itself, the stable sending rate after this burst does not recover as before. the maximum decoding delay of fnc is 3.33053s, which contains 2 seconds during which the connection is broken. the average decoding delay is 0.0839042s, a little more than one rtt . we do not test nc because its retransmission scheme depends on the estimation of the loss rate, and it is hard to give an effective estimation algorithm. effective retransmission in network coding for tcp 61 0 20 40 60 80 100 0 200 400 600 800 1000 th ro ug hp ut (p kt/ s) time (secs) fnc vegas figure 6: performance under unknown bursty losses 5 conclusions network coding is a powerful tool in fighting against non-congestion losses. however, its redundancy and decoding delay can significantly impair transmission performance so that most schemes cannot be implemented in practical systems. in our work, we propose a novel dynamic network coding retransmission scheme which makes use of the information implied in the seen scheme to acquire the exact number of packets the receiver wants instantly. as we do not retransmit packets with a stable rate or a constant redundancy factor, our approach can handle not only random losses, but also unknown bursty losses. simulation results show that our scheme significantly outperforms the previous coding approach in reducing decoding delay and masking losses. it obtains the throughput which is close to the scenarios where there is zero error loss. the remaining issue in our research is to evaluate the performance of incorporating network coding with other congestion control algorithms, such as those load factor-based algorithms, vcp and mlcp, etc. furthermore, we intend to implement our algorithm in a linux protocol stack to asses its strengths and limitations in practice. bibliography [1] t. ho, networking from a network coding perspective, phd thesis, massachusetts institute of technology, dept. of eecs, may 2004. [2] j. k. sundararajan, d. shah, m. medard, arq for network coding, in ieee isit 2008, toronto, canada, jul, 2008. [3] j. k. sundararajan, d. shah, m. medard, m. mitzenmacher, and j. barros, network coding meets tcp, in ieee infocom, 2009, san francisco, usa, apr 2009. [4] s. paul, e. ayanoglu, t. f. l. porta, k.-w. h. chen, k. e. sabnani, and r. d. gitlin, an asymmetric protocol for digital cellular communications, in proceedings of infocom, 1995. [5] a. desimone, m. c. chuah, and o.-c. yue, throughput performance of transport-layer protocols over wireless lans, ieee global telecommunications conference (globecom ’93), pp. 542-549 vol. 1, 1993. 62 j. chen, l.x. liu, x.h. hu, w. tan [6] h. balakrishnan, s. seshan, and r. h. katz, improving reliable transport and handoff performance in cellular wireless networks, acm wireless networks, vol. 1, no. 4, pp. 469-481, 1995. [7] s. katti, h. rahul, w hu, databi, m. mcdard, and j. crowcrofg, xors in the air: practical wireless network coding, in ieee/acm transactions on networking, 16(3): 497-510, 2008. [8] c. fragouli, j.-y. le boudec, and j. widmer, network coding: aninstant primer, acm computer communication review, jan. 2006. [9] j. barros, r. a. costa, d. munuaretto, and j. widmer, effective delay control in online network coding, in ieee infocom, 2009, san francisco, usa, apr 2009. [10] yong xia, l. subramanian, i. stoica, s. kalyanaraman, one more bit is enough, in ieee/acm trans. networking, 16(6):1281-1294, dec 2008. [11] i. a. qazi, and t. zuai, on the design of load factor based congestion control protocols for next-generation networks, in ieee infocom, apr 2008. [12] l. s. bramko, s. w. o’malley, and l. l. peterson, tcp vegas: new technichques for congestion detection and avoidance, in proceedings of the sigcomm ’94 symposium, august 1994. [13] ns-2 network simulator, http://www.isi.edu/nsnam/ns/ int j comput commun, issn 1841-9836 vol.7 (2012), no. 4 (november), pp. 661-673 the research of differentiated service and load balancing in web cluster a. gao, q. pan, y. hu ang gao, quan pan,yansu hu school of automation northwest polytechnical university xi’an 710072, china e-mail: gaoang@nwpu.edu.cn, quanpan@nwpu.edu.cn, huyansu@gmail.com abstract: differentiated service, as a key solution to meet the heterogenicity of web clients’ qos requirements, has been widely used to optimize the server utilization without over-providing resources. based on the relative differentiated service, this paper treats the application of proportional delay as a optimal control problem, and focuses on the cluster-side architecture improvement as well as qos controller design. a load balancing web cluster architecture supported differentiated service is proposed and implemented. by system identification and resource optimal control, the frontend dispatcher could adjust the resource quotas assigned to different classes in every single back-end server, and multi-class based maximum idle first load balancing strategy is designed to ensure a fair resource consumption among back-end nodes. as a result, the end-to-end delay is controlled and proportional delay is guaranteed. the experiments demonstrate that no matter using round-robin, least connection scheduling or maximum idle first load balancing strategy, the proposed resource optimal controller could hold the relationship among different classes. compared to round-robin and least connection first scheduling, maximum idle first strategy increases the cluster throughput by 33% and reduces the average delay by 21%. keywords: differentiated service, maximum idle first, load balancing, proportional delay guarantee 1 introduction with the dramatic explosion of online information, the internet is undergoing a transition from a data communication infrastructure to a service intergraded utility. the increase of web applications, web clients and http requests on the internet makes the web server systems often suffer from huge pressure of heavy workload. the deployment of web cluster system keeps increasing to meet the demand for availability, scalability and stability of the diversified performance demands of clients. web cluster organizes a number of web servers as a logical entirety to enhance the storage and processing capacity. the cluster also shows a good expansibility, whose capability can be easily tuned by changing the number of back-end server, which are connected by high speed local area network. clients’ http requests are well-proportioned and transparently dispatched to back-end. server nodes work concurrently and the responsiveness as well as the reliability of web sites are improved (see [17]). to take full use of every server’s processing resources, a major issue is how to arrange each server node appropriate requests according to its capability. however, the cluster system scale is limited by the financial cost of web sits and idc (internet digital center), and the load characteristics of web sites is often affected by the browsing habits, geographic distribution and breaking news. it is impossible to accurately predict the peak load and prepare enough copyright c⃝ 2006-2012 by ccc publications 662 a. gao, q. pan, y. hu computing resources. so it is not cost-effective to allocate excessive computing resources for a web site to accommodate the potential peak. even the large-scale clusters, there will still be the case of overload. now, the internet has become a commercial product, the growth of ecommerce is creating demand for services with financial incentives for the service providers, i.e., the economic transaction is more important than a simple browsing, and the premium users also expect better quality services. so, the other major problem web cluster system facing is how to meet the service level agreements(slas) with their clients without excessively over-provisioning resources (see [21]). as an initial effort, a feedback control mechanism is designed to achieve proportional delay differentiation service and load balancing (pdds-lb) in a web cluster system. first, according to the general web cluster system framework and the http processing procedure, a load balancing web cluster architecture is proposed. second, considering the residue delay1 is the main factor affecting the users’ experience in a web application, with the aid of system identification and optimal control, we design a feedback controller, which periodically re-allocates the processing resources to keep the residue delay ratio around the set point. finally, we present multi-class based maximum idle first load balancing strategy(mif) to achieve efficient and fair resource consumption. experiment results show that our mechanism is effective, the total throughput increases by 33%, and the average delay reduces by 23%. 2 the overview of cluster 2.1 the architecture and forwarding technology figure 1 gives the framework of web cluster.the front-end is called dispatcher, which is the entrance of the cluster system. the clients’ http requests first reach dispatcher, then are distributed to back-end servers according to the load balancing strategy. the the dispatcher can select a back-end according to the request-url in http message and/or other information in entity-header fields, such as user-agent, from, host, etc. according to the layer of dispatcher, clusters can be divided into three types (see [17]): l4/2 cluster-l4 switcher,l2 forwarding; l4/3 cluster-l4 swithcer,l3 forwarding; l7 clusterapplication layer forwarding. http request dispatcher server m server 1 browser internet router http response figure 1: the processing of http request in web cluster the dispatcher should establish tcp connection with clients and back-ends concurrently. although compared with the hardware-accelerated forwarding method used by l4/2 and l4/3 cluster, l7 cluster has the limitation of larger processing overhead. but it is still a promising implementation of web server cluster (see [11]). it could not only combine l4/2, l4/3 forwarding technology (see [3]), but also take usage of application information to enforce the content-aware dispatcher and combine the priority scheduling with the processing/threads-based web qos control scheme (see [18]). 1residue delay is consist of connecting time and processing time. the research of differentiated service and load balancing in web cluster 663 from the view of forwarding technology used by dispatcher, there are also three types of clusters: reverse proxy, tcp splicing and tcp handoff. the most widely-used is reverse proxy which is shown in figure 1 arrow ¬. the dispatcher distributes http requests to backends and transmits the responses back to clients. in order to avoid the redundant data copies and enhance the forwarding efficiency of reverse proxy, tcp splicing is enforced in operation system kernel. as arrow shown in figure 1, by means of tcp handoff protocol, the tcp connection established between dispatcher and clients is transfer to back-ends, and the response is directly sent back to the clients without relaying of dispatcher, which increases the potential throughput of the system. tcp splicing and tcp handoff are both proposed to enhance the forwarding efficiency. however, none of them is software-compatible because the implements require the amendments of tcp/ip protocol stack and system kernel in dispatcher and/or back-end servers. for the reasons above, our research focus on l7 cluster whose dispatcher running as reverse proxy. 2.2 the related work the current related work mainly focus on the load balancing strategy and algorithm. while the researches of l4 cluster are just on how to uniformly spread the requests. besides this, l7 cluster considers the contents of the request as well. typical l4 load balancing strategies are as follows: round robin (rr), i.e. executes (i + 1) mod n for every request and selects back-end according to the result; weighted round robin (wrr), i.e. every back-end servers’ processing capacity is in accordance with its weight. the larger weight the more times requests be sent; least connection scheduling (lcs) and weighted least connection scheduling (wlcs), which forward the requests to the server with least active tcp connections; source hashing scheduling (shs) and destination hashing scheduling (dhs) are statics mapping algorithms, the ip source address or destination address of http request is hashed and mapped to a certain back-end server. there are two common scheduling strategy used in l7, client-aware policy (cap) (see [4]) and locality aware request distribution (lard)(see [13]). cap is actually a kind of rr scheduling supported client-side classification. same kind of requests are spreading uniformly without considering their content. lard is server-status based load balancing strategy, in the threshold of stability, the same url are forwarding to the same server to achieve a higher hit rate. however, no matter l4 or l7 cluster research does not concern the issue of differentiated service and the related study is few. [21] gives the demand-driven service differentiation(ddsd) approach, which improves web cluster model into a multi-queueing system, such as m/m/1/∞ and m/g/1/∞ models proposed by [19,20]. by selecting stretch factor, it treats the re-allocation of cluster resources as a slas constrained optimization problem. however, because queueing model is based on the premise of poisson arrival rate and exponential distribution of processing time, queueing model can not accurately describe the of http traffic feature when the arrival rate and leave rate does not match(see [2, 9]). dynamic partitioning (dynamicpart) algorithm is proposed by casalicchio (see [1]). by feedback control, the back-ends is dynamically servicing different kinds of requests, which transforms a best-effort web cluster into a qos-enhanced system. but the performance isolation is enforced at the level of server host, it can only provide coarse-grain control at host-level, which disturbs the on-line extension and the resources utilization. our research purpose is to provide a differentiated services to increase the resources utilization of web cluster at the level of processing/thread. first, the resource consumption of every backend should be coordinated and load-balanced. then the resource management and scheduling 664 a. gao, q. pan, y. hu should meet the qos requirement (see [5] ). 3 qos-enhanced web cluster 3.1 pdds-lb architechure the processing/thread-based web qos control is used in the researches of single web server. the processing or thread pool is partitioned into several parts to support the performance isolation. the processing or thread number in each part is called quota. the http requests are classified into different business flows (1...n). by adjusting the quotas of every class, the performance isolation and differentiation service are achieved (see [8,14]). the initial ideal of pdds-lb is a unified scheduling of all back-ends’ processings/threades and a fair consumption of different servers’ resource to the same flows. as shown in figure 2, pdds-lb supports differentiated service in two layers: clients request dispatcher quotas allocation qos monitor optimal controller ydesire y front-end local area network back-end delay request response x request classifier request distribution layer resource allocation layer figure 2: pdds-lb architecutre in the resource allocation layer, the qos monitor perceives the average residue delay of each class 2 ℓi,(1 ≤ i ≤ n) while the back-ends report their status periodically (detailed in section 5). the optimal controller adjusts every back-ends’s quota ci,j,(1 ≤ j ≤ m) assigned to different business flows. in the request distribution layer, the requests are separated into different priorities according to the specified strategy and slas. the request dispatcher selects a back-end for each request by the load balancing algorithm. the front-end dispatcher is the key element of the cluster, which establishes tcp connection with back-end and clients simultaneously. back-end servers deploy the same contents and can service all the requests. classification strategy back-ends used is consistent with the front-end. different requests flows passed into the corresponding connecting queueing wait to be served in the manner of first-in-first-out(fifo). heartbeat is original used to make a high amiability of cluster infrastructure (front-end and back-ends)(see [15,16]). in pdds-lb, it is referenced as a daemon to provide the communication between client infrastructure. at every heartbeat time, the status reporter sends the status messages to front dispatcher and the quotas of business flows are adjusted according to the optimal controller’s output. 2residue delay is the sum of connecting time w and processing time χ. the research of differentiated service and load balancing in web cluster 665 3.2 pdds-lb software component considered our previous work (see [8,9]), we implement and deploy pdds-lb in the apache plantform. the back-end server is modified into the multi-processing queue architecture. on the dispatcher, mod_proxy (see in [6]) is activated as reverse proxy, and mod_proxy_balancer is used to translate the request url into absolute url in back-end server. 1. mod_cluster is modified to achieve communication between dispatcher and back-end servers. at every heartbeat time, back-end servers initiatively report their status. meanwhile, mod_clsuter in dispatcher implements the function of resource allocation layer. by optimal control, the refreshed quotas of classes are sent back to back-end servers. 2. mod_proxy_balancer is modified to enforce the functions of request distribution layer. mod_cluster calculates the busyness of every back-end server according to the their status and urls are redirected to a appropriate server. then apache httpd obtains responses from it. 4 cluster resource management 4.1 system model identification supposed the cluster consists of m back-end servers, serving n classes of business flows. every back-end runs cj(j = 1, . . . ,m) service threads: cj = n∑ i=1 ci,j, ςi = m∑ j=1 ci,j,1 ≤ i ≤ n,1 ≤ j ≤ m, (1) where ci,j is the number of threads assigned to class i in jth back-end, i.e. the quota that server j assigns to class i. ςi is the number of threads assigned to class i in the whole cluster. at every sampling time, the optimal controller adjusts ci,j to hold equation (2): ℓi ℓl = δi δl ,1 ≤ i ≤ n,1 ≤ l ≤ n, (2) where δi is the class i’s priority assigned by sla. the smaller δi, the higher its priority. as shown in figure 2,the controlled object is threads of the whole cluster. for the constrain of equation (1), the input has i = n ×m independent variables. at the kth sampling time, the input is: x(k) = [c1,1(k),c1,2(k), . . . ,c1,m(k), . . . ,cn−1,1(k),cn−1,2(k), ...,cn−1,m(k)] t. define yi(k), yidesire are the normalized residue delay and its expected value: yi(k) = ℓi(k) n∑ l=1 ℓl(k) ,yidesire = δi n∑ l=1 δl ,1 ≤ i ≤ n. (3) because ∑n i=1 yi(k)= 1 and ∑n i=1 yidesire= 1, the output has o = n − 1 independent variables. so let y(k) = [y1(k),y2(k), . . . ,yn−1(k)] t, ydesire = [y1desire,y2desire, . . . ,yn−1desire] t, 666 a. gao, q. pan, y. hu where y(k) is the measurement output. according to the derivation e(k) between y(k) and ydesire, the optimal controller adjusts the threads quotas of every classes to guarantee the relative relationship constant. strictly speaking, we require a discrete and nonlinear model for pdds-lb. however, such a nonlinear model is not amenable to the straight forward theoretical design and analysis (see [12]). so the following linear model is used to approximate the system. supposing r-order could be precise enough, the corresponding difference equation is: y(k) = r∑ j=1 [ajy(k − j) + bjx(k − j)], (4) and the z-domain transformation is: a(z−1)y(k) = b(z−1)x(k) + ε(k), (5) where ε(k) = [ε1(k),ε2(k), ...,εn−1(k)]t is o-order not related white noise sequence with mean zero. a(z−1) = i−a1z−1 − ...−arz−r,ai ∈ ro×o,0 < i ≤ r. b(z−1) = b1z −1 + ... + brz −r,bj ∈ ro×i,0 < j ≤ r. because of observation error and system noise, define ε(k) = [ε1(k),ε2(k), ...,εn−1(k)]t be o-order white noise sequence. equation (5) can be rewritten as: y(k + 1) = θφ(k) + ε(k + 1), (6) where θ = [b1...,br,a1, ...ar],k ≥ r −1, φ(k) = [xt(k), ...,xt(k −r + 1),yt(k), ...,yt(k − r + 1)]r, θ ∈ ro×[o×2×r]. recursive least square (rls) estimate algorithm is used to calculate parameter matrix θ, and the white noise-similarity of pseudo-random sequence is used as impulse to fully stimulate the system . in the system identification experiment, the cluster is composed of 4 back-end server, each of which runs 100 threads serving two classes of business flows. i.e. n = 2, m = 4 ,cj = 100,j = 1, . . . ,4, ydesire = [y1desire] = [1/3]. in order to fully stimulate the system, at every sampling time, the quotas assigned to every classes x(k + 1) = [c1,1(k + 1),c1,2(k + 1), ...,c1,4(k + 1)] t are adjusted according to the current value of pseudo-random sequence. the pseudo-random sequence is generated as equation (7). set p = 7, q = 12, the relationship between ϵ(k) and x is shown in table 1. ϵ(k) = ϵ(k −p) + ϵ(k −q)(mod4) (7) table 1: relation of ϵ(k) and x when p = 7, q = 12 ϵ(k) c1,j(k + 1) c2,j(k + 1) 0 25 75 1 40 60 2 60 40 3 75 25 the system identification experiment lasts 5000 seconds with the sampling time t = 30s and we got 150 sets of effective data. supposing θ̂q is the estimation of θ from the former q (q ≥ r−1) the research of differentiated service and load balancing in web cluster 667 sampled data. after the q + 1th sampling time, θ̂q can be revised as: θ̂q+1 = θ̂q + [y(k + 1)− θ̂qφ(k)]φt(k)pq λ + φt(k)pqφ(k) , (8) where p−1q+1 = p −1 q + [1 + (λ−1) φt(k)pqφ(k) (φt(k)φ(k))2 ]φ(k)φt(k), pq is covariance matrix and λ is forgetting factor. φ, y are measured by qos monitor. by selecting an appropriate θ̂0 and p0, we could get the estimation of parameter matrix. the criteria for θ̂0 and p0 is{ θ̂0 = ι,ι is sufficiently small real vector p0 = α 2i,α is sufficiently large real number (9) the loss function is defined as equation (10) to describe the variance between identified residue delay proportion and its measured value. j(m) = m+m−1∑ k=m ||y(k + 1)− θ̂qφ(k)||2, (10) where || • || is vector norm, and r is sample size. the system order r is desided by f-test. supposed m1 and m2 are adjacent orders of system, statistics variable h is constructed as follows: h(m1,m2) = j(m1)− j(m2) j(m2) m −2m2 2(m2 −m1) . (11) if m is large enough and m2 > m1, h(m1,m2) obeys f-distribution. h ∼ f(2(m2 −m1),m −2m2). the following pseudo-code used for j(m) will be generated: 1. begin 2. set m be the maximum possibel order, i.e. m = mmax . 3. q = 0, k = m−1, chose an appropriate θ̂0 and p0. 4. φ(k) is constructed according to the former [k −m + 1,k] sample data. 5. θ̂q+1 and pq+1 is calculated as equation (8). 6. k = k + 1, q = q + 1. 7. if k ≤ m, goto 4 ; else θ̂m−m+1 is the rls estimation of this m-order system. 8. j(m) is calculated as equation (10). 9. m = m−1. 10. if m ≥ 1, goto 3. 11. end. then j = [j(1),j(2), ...,j(mmax)]t is obtained. given the degree of confidence α = 5%, where is f0.05(2,144) ≈ 3.05. because h(2,3) = 2.29 < f0.05(2,144), which means there is no 668 a. gao, q. pan, y. hu significant reduction of the loss function when system order changes from 2 to 3. so the pddslb model can be modeled as a second-order linear time-invariant system. the corresponding rls estimation of parameter matrix θ is: θ̂ = [b̂0,b̂1,â1,â2], b̂1 = [b1,1,b1,2,b1,3,b1,4] = [−0.0004,−0.0005,−0.0004,−0.0004] b̂2 = [b2,1,b2,2,b2,3,b2,4] = [0.0049,0.0060,0.0056,0.0050] â1 = a1 = 0.4528 â2 = a2 = 0.0922 figure 3 is the compare of identified value vs. actual measurement of y (k). the identified value of y (k) is closed to the measured value, so 2-order linear mimo model is appropriate to describe pdd-lb. 0 20 40 60 80 100 120 140 160 0 1 2 3 time/30s re si d u e d el a y ra ti o identified value measured value figure 3: the identified value of y (k) is closed to actual measurement 4.2 optimal controller design the resource reallocation of process/thread can be treated as an optimization problem which minimizes the deviation between yi(k) and yidesire , while penalizes large changes in the control variables to save the overhead (see in [12]). so we construct the following quadratic cost function: l = e{||w [y (k + 1)−ydesire]||2 + ||q[x(k)−x(k −1)]||2}, (12) where w is o × o positive-definite weighting matrix and q is i × i positive-definite penalty matrix. diagonal elements of w represent the priorities of the corresponding classes, the smaller wi,i, the more discriminated against the class i . in our application, w , q are diagonal matrix and unit matrix. the derivative of l about x is zero when at its minimum, so there is(the derivation of equation see in [10, 12]): ∂l ∂x(k) = 2(w b̂0) tw [θ̂φ̃(k)−ydesire] + 2(w b̂0)tw b̂0x(k) + 2qtqx(k)−2qtqx(k −1) = 0, (13) by solving the equation (13), we can obtain the following optimal control law at the kth sampling time x∗(k) = [(w b̂0) tw b̂0+q tq]−1{(w b̂0)tw [ydesire − θ̂φ̃(k)] + qtqx(k −1)}. (14) the research of differentiated service and load balancing in web cluster 669 5 maximum idle first the resource optimal control is used to calculate the quotas ςi assigned to class i, while the load balancing strategy is to assure a fair assumption of threads, avoiding some back-end starvation or overload. most researches of load balancing are on the promise of possion arrival distribution and the exponential service time distribution, such as rr and lcs. however, the tendency that web pages is on a going to contains more and more dynamic objects and database operation makes processing time hard to meet the exponential distribution, and all of these strategies can not support differentiated service. in order to precisely estimate the load status of each classes in back-end servers and dispatch new coming requests equally, we propose a load balancing strategy called maximum idle first (mif). the status of jth back-end server are n sequence of two-tuples < ni,j,τi,j >ďż˝ďż˝i = 1, . . . ,n, which includes the connection queue length nij and the number of idle threads in the server pool τij. the residue delay ℓi is proportional to the queue length ni,j and inverse proportional to the number of service threads, i.e. τi,j (see in [8, 14]). for the clients supported http 1.1 , a web session is always a sequence of http requests on a consistent tcp connection as a manner of pipeline, so there is ℓi,j = αni,je[ρi,j]/τi,j, (15) idlei,j ∝ 1/ℓi,j. (16) let idlei,j be the idle degree of service for class i in server j, which is inverse proportional to the residue dalay. α is a planform dependence translation parameter. e[ρi,j] is the mathematic expecting of web session size. (see in [7]),there is : e[ρi,j] = e[vi,j]e[ui,j],1 ≤ i ≤ n,1 ≤ j ≤ m (17) vi,j,ui,j are the size and the number of the embedded requests from a web session. because every back-end servers deploys the same content and the self-similarity of the requests, e[ρi,j] is a constant value. when a request belonged to class i arrivals at the front-end dispatcher, mod_proxy_banlancer calculates each back-end’s idleij, then redirects this request to the server, which has the maximum idle degree of service. 6 experiment evaluation 6.1 configuration of experiment the test-bed is developed to evaluate the pdds-lb mechanism, which consists of 9 computers connected together via 1gbps ethernet. there are 4 back-ends running apache-2.0.53, which each has 100 concurrent threads, and 4 linux machines simulate 120 × 4 different clients using surge-1.00a. considering the front-end may be a new bottleneck, we configure the apache2.2.63 on the front-end with 1000 concurrent threads, which is large enough. in our experiment, http requests are classified into 2 business flows based on the clients’ ip address,i.e. class 1 and class 2. set the expected delay ratio is ℓ1/ℓ2 = 1/2, ydesire = [1/3], i.e. the class 1’s residue delay should be the half of the class 2’s. define the relative variance ψ(y ) be a control performance metric. a smaller ψ(y ) indicates a better stability that controller can keep y (k) at ydesire. ψ(y ) = √∑i k=1 ||y (k)−ydesire||2/i ydesire , (18) 670 a. gao, q. pan, y. hu 6.2 result and analysis we design two sets of contrast experiments to evaluate the pdds-lb cluster’s ability of differentiation,throughput and dalely under different load balancing strategy. firstly, we evaluate the optimal controller’s ability of differentiation under different load balancing strategy and sampling time. as shown in figure 4(a), 4(b) and 4(c), each of them are the results of t = 20s and t = 30 when using rr, lcs and mif algorithm. the optimal controller launches at 800 seconds, and the measured residue delay ratio of class 1 and class2 is gradually settled around the expected value. according to the equation (18), ψ(y ) are calculated and shown in figure 4(d). compared figure 4(a), 4(b) and 4(c), the following conclusion can be summarized: 1. no matter using which load balancing algorithms or which sampling time, our optimal controller can provide proportional delay services. this proves the correctness and feasibility of pdds-lb cluster model. 2. ℓ−800i and ℓ +800 i are used respectively to compare average residue delay before and after 800 sec, there is: ℓ+8002 − ℓ −800 2 ≥ ℓ −800 1 − ℓ +800 1 , (19) it means that the increase of class 2’s delay is more than the decrease of class 1’s. nevertheless, it is worth making for the reasons in section 1. 3. in equilibrium state, i.e. y (k) = ydesire, there are ℓ+8001,mif < ℓ +800 1,rr ≈ ℓ +800 1,lcs, (20) ℓ+8002,mif < ℓ +800 2,rr ≈ ℓ +800 2,lcs, (21) where ℓ+800i,rr , ℓ +800 i,lcs, ℓ +800 i,mif are the average residue delay of class i in stable sate under rr, lcs and mif strategies, equation (20)and (21) proves that mif has less delay in the stable state. 4. as shown in figure 4(d), when sampling time t = 20s, the delay ratio jitters severely. the possible reason is the delay jitter caused by large files, and which are more apparent in small sampling time. then, a further comparation is enforced to evaluate the different balancing strategy on the system throughput and residue delay. when t = 20, as the increase of the client’s tcp connection requests arrival rate, the throughput and residue time also enhance, while the two classes of business still keep differentiate relationship. the figure 5(a) and 5(b) are the histogram of throughput and the residue delay with the width of 20 tcp connections/sec. as can be seen, when the tcp connection arrive rate is 25/sec the system is saturated. both in lcs and rr, the throughput of class 1 and 2 are 80 requests/sec and 40 requests/sec respectively, while in mif, the throughput of class 1 can increase to 120 requests/sec. at the same time, in lcs and rr, the average delay of class 1 and 2 are 100ms and 280ms, while the average delay of class 2 reduce to 200ms in mif. this is not only because mif consider the impact of the connection queue length and idle threads on the residue delay, but also due to the fine-grained state feedback. the status of back-ends contain details of every class , we can design a better dispatch strategy. although lcs and rr can dispatch requests to all back-ends evenly, since lack of the fine-grained state feedback, they cannot maximize the resource utilization. for example, if the thread quotas of class 1 will be exhausted, or its connection queue will overflow on back-end j, while the resources the research of differentiated service and load balancing in web cluster 671 0 100 200 300 400 0 1000 2000 3000 0.5 1.5 2.5 3.5 re s id u e d e la y ( m s ) d e la y r a ti o time(sec) t=20 class 1 dealy class 2 delay delay ratio 0 100 200 300 400 0 1000 2000 3000 0.5 1.5 2.5 3.5 re s id u e d e a ly ( m s ) d e la y r a ti o time(sec) t=30 class 1 dealy class 2 delay delay ratio (a) rr 0 100 200 300 400 0 1000 2000 3000 0.5 1.5 2.5 3.5 re s id u e d e a ly ( m s ) d e la y r a ti o time(sec) t=20 class 1 dealy class 2 delay delay ratio 0 100 200 300 400 0 1000 2000 3000 0.5 1.5 2.5 3.5 re s id u e d e a ly ( m s ) d e la y r a ti o time(sec) t=30 class 1 dealy class 2 delay delay ratio (b) lcs 0 100 200 300 400 0 1000 2000 3000 0.5 1.5 2.5 3.5 re s id u e d e la y ( m s ) d e la y r a ti o time(sec) t=20 class 1 dealy class 2 delay delay ratio 0 100 200 300 400 0 1000 2000 3000 0.5 1.5 2.5 3.5 re s id u e d e a ly ( m s ) d e la y r a ti o time(sec) t=30 class 1 dealy class 2 delay delay ratio (c) mif ψ(y ) rr lcs mif 20s 0.3843 0.3810 0.31925 30s 0.3831 0.3759 0.28000 (d) ψ(y ) figure 4: differentiation effects under different load balancing strategy and sampling time of other classes are idle, lcs and rr tend to redirect requests to this back-end. now, there is a new request, which is exactly belongs to class 1, resource shortage on back-end j will further deteriorate, however, other back-ends may be in idle status. 7 conclusion in this paper, we design a feedback control mechanism to achieve proportional delay differentiation service and load balancing in a web cluster system. by recursive least square estimation and f-test, pdds-lb is model as a second-order liner time-invariant system. we construct the cost function and obtain the optimal law by derivation of cost function. experimental evaluations have shown that our mechanism achieves pdds, while the low priority class is not over-sacrificed. with the aid of mif, we maximize the resource utilization of the cluster system. however, our experiments are just in the condition of ideal transport layer, in the real network, link status is complex and changeable ,the factors that influence qos interweave each other, such as bandwidth, cache, i/o etc. as part of out ongoing work, we are exploring a integrated resource management to adapt to more complex environment. 672 a. gao, q. pan, y. hu 0 20 40 60 80 100 120 140 10-30 30-50 50-70 70-90 90-110 10-30 30-50 50-70 70-90 90-110 t h ro u g h p u t (/ s e c ) tcp connections (/sec) class 1 class 2 mif lcs rr (a) throughput 0 50 100 150 200 250 300 10-30 30-50 50-70 70-90 90-110 10-30 30-50 50-70 70-90 90-110 re s id u e d e la y ( m s ) tcp connections (/sec) class 1 class 2 mif lcs rr (b) delay figure 5: compared with lcs and rr, the average delay reduces 21% and the total throughput increases 33% in mif. bibliography [1] m. andreolini, e. casalicchio, m. colajanni, and m. mambelli. a cluster-based web system providing differentiated and guaranteed services. cluster computing, 7(1):7–19, 2004. [2] g. ang, m. dejun, h. yansu, and p. wenping. proportional delay guarantee in web qos based on predictive control. in information science and engineering (icise), 2009 1st international conference on, pages 1789–1792. ieee, 2009. [3] g. apostolopoulos, d. aubespin, v. peris, p. pradhan, and d. saha. design, implementation, and performance of a content-based switch. in ieee infocom, volume 3, pages 1117–1126. citeseer, 2000. [4] e. casalicchio and m. colajanni. a client-aware dispatching algorithm for web clusters providing multiple services. in proceedings of the 10th international conference on world wide web, page 544. acm, 2001. [5] l. cherkasova and m. karlsson. scalable web server cluster design with workload-aware request distribution strategy ward. in wecwis, page 0212. published by the ieee computer society, 2001. [6] jboss community. http://www.jboss.org/mod_cluster. [7] a. gao, d. mu, and y. hu. a qos control approach in differentiated web caching service. journal of networks, 6(1):62–70, 2011. [8] a. gao, d. mu, y. hu, and w. pan. proportional delay guarantee in web qos based on predictive control. 1st international conference on information science and engineering,icise2009, 1:1789–1792, 2009. [9] a. gao, h. zhou, y. hu, d. mu, and w. hu. proportional delay differentiation service and load balancing in web cluster systems. in infocom ieee conference on computer communications workshops, 2010, pages 1–2. ieee, 2010. [10] m. karlsson, x. zhu, and c. karamanolis. an adaptive optimal controller for nonintrusive performance differentiation in computing services. in control and automation, 2005. icca’05. international conference on, volume 2, pages 709–714. ieee. [11] a.f. liu. research on the web server cluster technologies. phd thesis, central south univercity, 2005. the research of differentiated service and load balancing in web cluster 673 [12] x. liu, x. zhu, p. padala, z. wang, and s. singhal. optimal multivariate control for differentiated services on a shared hosting platform. in proc. of the 46th ieee conf. on decision and control. citeseer, 2007. [13] v.s. pai, m. aron, g. banga, m. svendsen, p. druschel, w. zwaenepoel, and e. nahum. locality-aware request distribution in cluster-based network servers. in proceedings of the eighth international conference on architectural support for programming languages and operating systems, pages 205–216. acm, 1998. [14] w. pan, d. mu, h. wu, and q. sun. proportional delay differentiation service in web application servers: a feedback control approach. international journal of intelligent information technology application, 1(1):37–42, 2008. [15] a. robertson. linux-ha heartbeat system design. in proceedings of the 4th annual linux showcase & conference-volume 4, pages 20–20. usenix association, 2000. [16] a. robertson. the evolution of the linux-ha project. in ukuug lisa/winter conference high-availability and reliability, 2004. [17] t. schroeder, s. goddard, and b. ramamurthy. scalable web server clustering technologies. ieee network, 14(3):38–45, 2000. [18] z.g. shan and c. ling. performance evaluation of qos-aware load balancing of web-server clusters. journal of system simulation in chinese, 17:184–189, 2005. [19] x. wu, m. li, and j. wu. enhanced demand-driven service differentiation algorithm in web clusters. in ieee international conference on e-business engineering, 2006. icebe’06, pages 386–391, 2006. [20] z. xing, p.l. yan, and c.c. guo. proportional stretch factor differentiated service in heterogeneous web server cluster. computer science in chinese, 33(010):61–65, 2006. [21] h. zhu, h. tang, and t. yang. demand-driven service differentiation in cluster-based network servers. in ieee infocom, volume 2, pages 679–688. citeseer, 2001. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vii (2012), no. 1 (march), pp. 123-134 an ims architecture and algorithm proposal with qos parameters for flexible convergent services with dynamic requirements m. navarro, y. donoso miguel navarro universidad de los andes bogotá, colombia e-mail: mignavarro@egresados.uniandes.edu.co yezid donoso universidad de los andes bogotá, colombia e-mail: ydonoso@uniandes.edu.co abstract: quality of service (qos) provisioning is one of the main requirements in the 3gpp ip multimedia subsystem (ims) and it has been addressed in different works since the beginning of the ims standardization process. as a result of the fixed and mobile networks evolution, the parameters standardized in ims have changed constantly until the specification of the policy and charging control (pcc) architecture that integrates ims qos and charging functionalities. however, current ims qos specifications still have some limitations to handle service flexibility that is required to provide internet services over ims. in this work, we propose an enhanced ims qos architecture to support efficient qos providing for flexible services with dynamic requirements. this proposal is compared against different approaches to evaluate their behavior under network saturation conditions. simulations results show that the architecture we propose achieves efficiency and flexibility, maintaining the number of blocked and active sessions, and increasing the number of high priority sessions activated in a saturated network. keywords: convergent services, ip multimedia subsystem, quality of service 1 introduction networks for convergent services are the result of an evolution process followed by fixed and mobile networks. as a result of different trends of evolution, the ip multimedia subsystem (ims) was introduced as the accepted network architecture for convergent services with guaranteed requirements, such as qos, security, charging, and roaming. qos provision on ims networks is a problem that has been studied since the first ims standardization given by the 3rd generation partnership project (3gpp) in release 5 [1]. ims was first introduced as the subsystem in charge of session control for ip services in 3g networks, and for this reason, its evolution process towards ip in mobile networks could be compared to the next generation network´s(ngn) evolution process in fixed networks. however, since ims was already considering session control features, it was accepted as the unifying standard core network (cn) for ip convergent services, increasing its initial scope to include fixed networks as an additional access network [2] [3]. currently, with the specification of the fourth generation (4g) in mobile networks, 3gpp introduced the program named evolved packet system (eps), which combines the long term evolution (lte) program, copyright c⃝ 2006-2012 by ccc publications 124 m. navarro, y. donoso and the evolved packet core (epc) program. ims objectives continued having a leading role in eps, since lte is considered as a new access network that may be integrated to the network architecture, and the core in eps integrates ims architecture. at this point, networks working with these programs are referred as next generation mobile networks (ngmn) [4] [5]. in ims, the problem regarding qos provision at the ip media transport layer is the same as it is defined for the internet. several authors have already covered this problem and the models of integrated services (intserv) and differentiated services (diffserv) have been studied under different contexts. both models apply for ims networks; nevertheless, diffserv model’s ability to keep minimal information about the network state makes it more scalable compared to intserv. as a result, 3gpp defined diffserv as the qos model for the ip media transport layer [1] [6]. for upper layers, 3gpp has also specified the mechanisms for providing qos. since ims release 7 specification, 3gpp introduced the policy and charging control (pcc) architecture, which continued until release 9 as the mechanism for determining qos and charging for convergent services. although, the pcc architecture specification gives the definition of the entities involved and their basic functions, there is still much work to do in order to cover all possible scenarios and to guarantee qos requirements. in [7], 3gpp standardized the qos parameters applied in the service level, and also introduced the concepts of service priority and pre-emption capability and vulnerability, which support conflict handling between services in a state of network saturation. in spite of this concept and function definitions, their relation to the main functional entities in ims layered architecture is still an ongoing process. the main objective in this work is to define an enhanced ims qos architecture, in order to support qos providing for flexible services with dynamic requirements in an efficient way. then, we defined an architecture that supports service relocation between different qos levels, based on information about priority, pre-emption and the service capability to be flexible. to achieve this, we defined a new qos parameter called the service flexibility bit (sfb) and a new entity named the qos level relocation function (qos-lrf)in the pcc architecture. the remainder of this paper is organized as follows. in section 2, we describe some related work. in section 3, we present the pcc architecture. in section 4, we propose an enhanced qos architecture and a heuristic algorithm to validate the architecture. in section 5, we present the architecture and performance evaluation. the discussion of the results is presented in section 6. finally, section 7 contains our conclusions and directions for further study. 2 related work related work about qos in ims has been presented prior the standardization of the pcc architecture in ims release 7. the main focus is on the heterogeneity introduced by different access networks and discrepancies between qos classes in all of them. this problem is analyzed in [8], where authors present the work developed by 3gpp and etsi tispan in qos provisioning for ims. with regard to the session control layer from the ims architecture, authors emphasize on the importance of the policy decision function (pdf) for 3gpp specifications, a function that is later performed by the pcc architecture. the transport layer is also considered, presenting the benefits and weaknesses’ in diffserv core networks. in the end, a practical implementation on a real network is stated and given for further study. after the pcc architecture specifications where given, several works have been presented focusing on enhancements for charging and qos functions. in ims, qos may be studied according to the different architectural layers, starting with the session control layer and their effects on the application and service layers. in [9], authors propose an approach to ims policy control based on session policies. in this work, they present service integration using common functions provided by ims, and horizontal integration as the methodology applied for multimedia service development. with this methodology, they an ims architecture and algorithm proposal with qos parameters for flexible convergent services with dynamic requirements 125 are allowed to combine service functions together in order to provide a specific functionality, in contrast to the traditional vertical service integration, which basically provides all the functionality with one service module. there are more studies concerning different problems in qos on ims, like [10], [11], and [12]; however, the problem introduced by dynamic qos requirements, service level relocation, and their effect in the transport network, has not been considered. 3 quality of service in ims the ims pcc architecture specified for release 9 in [7] comprises high-level functions for both charging and qos. this architecture associates functions previously carried by the flow based charging (fbc) and the service-based local policy (sblp) mechanisms, which were separated in previous releases. the evolution process that leads to the pcc architecture starts in release 5, with a policy framework specification based on the ietf’s policy management architecture standardized in [13], and the common open policy (cops) protocol defined in [14]. then, in release 6, 3gpp specifies the service-based local policy (sblp) mechanism to differentiate qos parameters in the service level. later, in release 7, the pcc architecture was first introduced, including charging functions related to the qos decisions and the allocated resources. finally, in release 9 the pcc architecture includes some new specifications. the functions included in the pcc architecture to control the qos are the following: resource allocation, event triggering, media flow establishment, and gating control. the pcc architecture includes the specification of four service-level qos parameters: qos class identifier (qci), allocation and retention priority (arp), guaranteed bit rate (gbr), and maximum bit rate (mbr). these parameters define qos features that will be taken into account for further implementations of functions performed by pcc entities [7]. qos class identifier (qci) the qci is a scalar number associated to a network element and it is used to describe the packet forwarding treatment in terms of performance characteristics. this value needs to be pre-configured by the operator directly into the element. since there may be many characteristics associated to the qci values, 3gpp standardized four characteristics: resource type, priority, packetdelaybudget, and packeterrorlossrate. allocation and retention priority (arp) the arp parameter incorporates information about the priority level, pre-emption capability (pec) and pre-emption vulnerability (pev). the priority level has a range of values from 1 to 15, in which 1 is the highest possible value. in the same way, values from 1 to 8 should be assigned to services with priority treatment in the network, and values from 9 to 15 should be used for roaming services. in the case of pec and pev, they are defined as the capability of a session to get resources that are already assigned to another session with lower priority lever, and as the vulnerability of a session to allow the loss of resources that are already assigned from another session with higher priority level, respectively. the values of the pec and pev parameters are set as "yes" or "no". guaranteed bit rate (gbr)/non-guaranteed bit rate (non-gbr) this parameter indicates whether a session has reserved bit rate resources or not. it is associated to the resource type characteristic of the qci. maximum bit rate (mbr) the mbr parameter indicates the maximum bit rate authorized for a session. up to this point, we have presented the specifications given by 3gpp for qos provisioning at a service level involving the ims session control and multimedia services layers. as mentioned 126 m. navarro, y. donoso earlier, diffserv is the qos model defined for the ims media transport layer, therefore an association is needed between diffserv’s parameters and the service-level qos parameters discussed in the previous subsection. to define that association, 3gpp includes qos classes for umts networks in the qos concept and architecture specification given in [6]. there are four umts qos classes: conversational, streaming, interactive, and background. the principal characteristic that differentiates between these classes is delay sensitivity, going from the most sensitive (conversational class), to the less sensitive (background class). having a characteristic to differentiate between classes, many services could be classified according to their specific requirements.the relation between umts qos classes and diffserv parameters is presented in [2], based on the gsma specification for the gprs roaming exchange (grx). this relation includes additional distinguishing factors in addition to delay sensitivity, such as jitter, packet loss, and service data unit (sdu) error ratio. 4 proposed architecture in the previous section we presented the qos specifications in ims on a service level and how they are associated todiffserv parameters in the media transport layers. we focus on congested networks that need diverse mechanisms to solve conflicts between the different sessions trying to access the network. in current ims specifications, these mechanisms are based on information contained in the arp qos parameter: priority, pec, and pev. nevertheless, it is not completely specified how these parameters are used to solve conflicts, and because of this, configurations may be applied according to each carrier on its own convenience. the problem when this information is not specified, is that each carrier may apply its own configuration following 3gpp indications about service priority levels, but missing to have congruent configurations will lead to increase the probability of rejecting incoming and active user sessions. diffserv assigns a percentage of the network capacity to each per-hop behavior (phb), based on previous information the carrier knows about their users demands [15]. despite having accurate information about their users demands, the dynamism introduced by ims services, makes it very difficult to collect that information for one operator, and when relations between different operators are also introduced, there may be several scenarios in which many sessions will be rejected. at the same time that ims introduces dynamic services, those services allow some flexibility in their qosrequirements. flexibility could be used to define mechanisms that not necessarily resolve session conflicts by blocking or canceling sessions when there are not enough resources. we use the concepts blocking and canceling to differentiate the time when a session is rejected from the network; when a new session is trying to enter the network and that request is denied, we name it blocking the session, and when the session is already activated by the time it is removed from the network, we name it canceling the session. we define an enhanced ims qos architecture that supports flexible services and their relocation in the qos level assigned at the ip media transport layer. first, relying on the service-level qos parameters standardized for the pcc architecture [7], we specified a new parameter named the service flexibility bit (sfb) that reflects the service capability of being relocated in a different qos level. the sfb can be set to "1" or "0", when a session accepts being relocated or not, respectively. the enhanced pcc architecture that we propose is depicted in figure 1. this architecture introduces a new entity called qos level relocation function (qos-lrf), which is in charge of making decisions about session relocation in the qos levels. the qos level relocation function (qos-lrf) uses the information given by the sfb, priority level, pec, and pev, in addition to parameters about the transport network state, to decide whether a session is going to be relocated and where. in order to define how the qos-lrf uses the information to make the decision, we define the mapping of these parameters an ims architecture and algorithm proposal with qos parameters for flexible convergent services with dynamic requirements 127 figure 1: the enhanced pcc architecture according to the standardized qci characteristics [7] and umts qos classes and their relation to diffserv parameters [2]. first, we take the qos transport levels defined in diffserv by each phb and we assign them a priority level between 1 and 9, which is the specified range of values. to assign these values, we joined the corresponding services, starting with ims signaling that has the highest priority value, and then the different services according to their qos level. after that, we defined the information required from each service andthat is considered by the qos-lrf to make the relocation decisions. at this point, we divided qos parameters in two classes: parameters associated to the qos level and parameters associated to the session. finally, we reduced the qos level parameters to the bandwidth (bw) requirement in order to reduce the problem complexity, maintaining the relation between umts qos classes and diffserv parameters, as described in the previous section. according to the qos level classification and the services that would be using each of these levels, we define the session relocation as the possibility of reserving the required network resources on a different qos level, and transferring the session to a different level in order to provide the service according to the qos parameters specified for the new level. the main objective of this feature included in the qos-lrf is to benefit the session with higher priority in each qos level, and also to optimize network resources offering the possibility to use other qos level resources, we use the pre-emption functions specified with the pcc architecture, the pec and pev parameters, which give us the possibility to use other session’s resources and reserve them for a different session with higher priority level. the introduction of the sfb gives us the possibility of using the pre-emption functions in the other qos levels before blocking the activation of a new session, or before canceling an active session with lower priority level. the heuristic algorithm used by the qos-lrf is given in algorithm 1. as seen in algorithm 1, when a new session is going to be relocated, the pec, pev and sfb values are saved as historical values in order to recover them when resources become available at the original qos level. in addition, when those values are saved, the new values assigned depends on how the relocation is being done; for example, if a new af session finds enough resources at the ef level, its pev and sfb parameters are set to "1", so that if a new ef session enters the networks, the af session could be relocated in a different level or rejected, but just until the ef level resources are required. on the other hand, when ef and af sessions are relocated, they go to a lower qos level, then the pec parameter is set to "1" and the sfb is set to "0", so that the session that is being relocated can use resources from sessions with lower priority and with the pev parameter activated. in addition, when the session is relocated it cannot be relocated again. this means that a session cannot be transferred two levels below its initial qos level and we will not find ef sessions in the be level. the relocation algorithms for ef and af sessions are given in algorithm 4. finally, when users leave the network and finish their sessions, if they forced other sessions relocation and those sessions are still active, they can be relocated at their 128 m. navarro, y. donoso algorithm 1 new sessions entering the network a new session enters the network if qos-level = ef then if availability in ef then resources are reserved in ef/ the new ef session is activated in ef else if pec is activated and enough resources from ef users with lower priority and pev activated then resources are released from the selected ef users/ ef sessions are relocated in af (*)/ released resources in ef are reserved for the new ef session/ the new ef session is activated in ef else if sfb is activated then the pec, pev and sfb values from the new ef session are saved as historical values/ pec = 1 / pev = 0/ sfb = 0 if availability in af then resources are reserved in af for the new ef session/ the new ef session is activated in af else if pec is activated and enough resources from af users with lower priority and pev activated then resources are released from the selected af users/ af sessions are relocated in be (*)/ released resources in af are reserved for the new ef session/ the new ef session is activated in af else the new ef session entering the network is rejected end if else the new ef session entering the network is rejected end if else if qos-level = af then if availability in af then resources are reserved in af/ the new af session is activated in af else if availability in ef then the pec, pev and sfb values from the new ef session are saved as historical values/ pec = 0/ pev = 1/ sfb = 1/ resources are reserved in ef/ the new af session is activated in ef else if pec is activated and enough resources from af users with lower priority and pev activated then resources are released from the selected af users/ af sessions are relocated in be (*)/ released resources in af are reserved for the new af session/ the new af session is activated in af else if sfb is activated then the pec, pev and sfb values from the new ef session are saved as historical values/ pec = 1/ pev = 0/ sfb = 0 if availability in be then resources are reserved in be for the new af session/ the new af session is activated in be else if pec is activated and enough resources from be users with lower priority and pev activated then resources are released from the selected be users/ the sessions from the selected be users are rejected/ released resources in be are reserved for the new af session/ the new af session is activated in be else the new af session entering the network is rejected end if else the new af session entering the network is rejected end if else if qos-level = be then if availability in be then resources are reserved in be/ the new be session is activated in be else if availability in af then the pec, pev and sfb values from the new ef session are saved as historical values/ pec = 0/ pev = 1/ sfb = 1/ resources are reserved in af/ the new be session is activated in af else if the new be session priority level is 8 (highest priority in the be level) and enough resources from be users with lower priority and pev activated then resources are released from the selected be users/ the sessions from the selected be users are rejected/ released resources in be are reserved for the new be session/ the new be session is activated in be else the new ef session entering the network is rejected end if end if an ims architecture and algorithm proposal with qos parameters for flexible convergent services with dynamic requirements 129 initial qos level with the historic pec, pev and sfb values. algorithm 2 ef and af session relocation algorithms ef session relocation the pec, pev and sfb values from the ef session are saved as historical values pec = 1 / pev = 0 / sfb = 0 if availability in af then resources are reserved in af for the ef session the ef session is activated in af else if pec is activated and enough resources from af users with lower priority and pev activated then resources are released from the selected af users af sessions are relocated in be (*) released resources in af are reserved for the ef session / the ef session is activated in af else the ef session rejected end if af session relocation the pec, pev and sfb values from the ef session are saved as historical values / pec = 1 / pev = 0 / sfb = 0 if availability in be then resources are reserved in be for the af session the af session is activated in be else if pec is activated and enough resources from af users with lower priority and pev activated then resources are released from the selected be users the sessions from the selected be users are rejected released resources in be are reserved for the af session / the af session is activated in be else the af session is rejected end if 5 architecture and performance evaluation the evaluation presented in this section is based on simulations of architectural models in different scenarios. we define three architectural models in order to have different values to compare results and to have the opportunity to observe improvements given by the session relocation feature and the sfb. then, the scenarios present different network states, varying times and service requirements for sessions entering the network. 5.1 architectural models the first architectural (m1) model is the reference point that gives standard values to compare results obtained with models 2 (m2) and 3 (m3). this model implements neither the session relocation feature, nor the sfb functionality, and for this reason its behavior under congestion conditions is similar to current 3g networks. it looks if there are enough resources and if there are not, the new session is rejected. this reference to current networks is based on the analysis presented on [8] regarding diffserv networks and qos resource management. the second architectural model, m2, implements the session relocation feature in case there are resources available in a higher level, and it also implements the pre-emption functions for using resources in the same qos level. the sessions, which resources are released to be used by a higher priority session, are rejected. in this model, a session can only be upgraded to a higher level, so that the qos provided is not reduced from the original requirements. the third architectural model, m3, comprises all the functionality that we propose for the qos-lrf. besides implementing the second model’s functionality, it implements the sfb that allows using the pre-emption functions in a lower level before rejecting the session. in this model before rejecting any session, even sessions with lower priority and the pev parameter activated, if the sfb is activated there is a possibility to use resources from a lower qos level. the relevant information we want to obtain from simulations is the number of rejected and active sessions; with this information we are able to analyze the benefits of the proposed architecture. the first architectural model gives standard values to calculate a percentage error for other models using (1). 130 m. navarro, y. donoso (a) deterministic parameters variable description n number of monte carlo simulations λ[users/time] process rate t simulation time even_num = λ ∗ t [sessions] number of sessions cap_ef level ef capacity cap_af level af capacity cap_be level be capacity (b) services priority level and bandwidth requirements priority level qos level service bandwidth 2 ef voip 32 kbps 3 ef video conference 1 mbps 4 af streaming 512 mbps 5 af transactional services 1 mbps 6 af web browsing 64 kbps 7 af telnet 8 kbps 8 be e-mail 1 mbps 9 be web browsing 1 mbps (c) ranges and distributions for random parameters parameter distribution ranges arrival time uniform [1, t ] session lengh normal n(µ, σ) according to the scenario qos level (type of service) according to the scenario ef, af, be priority level uniform according to the qos level bandwidth uniform according to the qos level and the priority level pev/pec uniform 0, 1 sfb uniform 0, 1 table 1: simulation parameters δ = 100 (vexp − vstd) vstd (1) we simulate implementations of the three architectural models and the network behavior. arrival of network users is simulated as a poisson stochastic process with a rate parameter λ, using monte carlo simulations, and we define the simulation deterministic parameters as shown in table 1(a). table 1(b) shows the bandwidth requirements defined for services in the different priority levels. afterwards, we specify the random parameters of the simulation, such as arrival time, length of the session, qos level, priority levels, session requirements, pec/pev, and sfb; table 1(c) shows the ranges and distributions for the random parameters. 5.2 simulation scenarios simulation parameters were fixed for all scenarios. they were selected to achieve the objective of simulating the architecture in a saturated network and therefore, having the opportunity of studying the model’s behavior in that state. the process rate λ was set to 0.95 simulations per period. then, the simulation time was set to 2000 sec; it can also be interpreted as any other consistent unit of time. with this values, for each simulation 1900 users try to access the network with a random service, and a service duration time following a normal distribution with µ=300 sec and σ=200 sec. afterwards, levels capacities are set to 20 mbps and according to the bandwidth requirements from table 0(b), the state of network saturation may be achieved with at least 60 sessions in a worst-case scenario. finally, type of service is selected as the parameter that changes for each scenario and all the other parameters are defined as random with equal probability for every value in its range. figure 2 presents a basic description of each simulation scenario, and the results obtained for the number of rejected users and the percentage of usage for each qos level. 6 discussion previous simulations give information about the number of rejected and active sessions for three models in the four scenarios we selected. the four scenarios were chosen to test the models varying the type of sessions entering the network, a parameter that we selected as the most sensitive to the algorithm proposal because inthe simulation it determines how the qos level resources are used, and it also gives information about the type of carrier. that allows us to analyze the results according to the behavior a carrier would expect. an ims architecture and algorithm proposal with qos parameters for flexible convergent services with dynamic requirements 131 (a) scenario 1 (b) scenario 1 (c) scenario 2 (d) scenario 2 (e) scenario 3 (f) scenario 3 (g) scenario 4 (h) scenario 4 figure 2: simulation scenarios. scenario 1: basic scenario, sessions generated with the same probability. scenario 2: sessions generated with 60% probability for ef, 20% for af, 20% for be. scenario 3: sessions generated with 60% probability for af, 20% for ef, 20% for be. scenario 4: sessions generated with 60% probability for be, 20% for ef, 20% for af. (a), (c) (e) and (g) show the accumulated percentage of rejected sessions, and (b),(d), (f) and (h) show the percentage of usage of each qos-level in the network. the first graphics presented, for each scenario, depict the behavior of rejected sessions. as we defined it previously, a blocked session refers to a session that could not be activated in the network due to the lack of resources. a canceled session is counted when a session that was already active in the network is removed from it at because a new session, with the pec parameter activated and with a higher priority, is going to use the resources from the canceled session. then, rejected sessions refer to the total number of sessions that leave the network, adding the blocked and canceled values. comparing results from the different scenarios in the previous section, we can see that m3 reduces the number of blocked sessions in all scenarios compared to m2 and m1. this results presented in figure 3(a), indicates a benefit from implementing m3 over 132 m. navarro, y. donoso (a) (b) (c) figure 3: simulation results. part (a) shows the blocked sessions relative error comparison and, it shows the blocked sessions absolute values comparison at t=2000sec. part (b) shows the rejected sessions relative error comparison, and the rejected sessions absolute values comparison at t=2000sec. part (c) shows the active sessions relative errors to model 1 at t=2000sec, and the number of active sessions at t=2000sec. m1 and m2. despite having this result for the number of blocked sessions, it is very important for the algorithm not to increment abruptly the canceled session percentage because m1 does not cancel any session. according the definition of m1, once a session reserves resources they cannot be released until the user finishes the session. the small effect of canceled sessions may be confirmed with the percentage of rejected sessions, as depicted in figure 3(b). looking at the results in all scenarios, the number of canceled sessions in m3 may be considered to have an effect that we may consider as small if we take into account that m2 always ends having a mayor percentage of canceled sessions than m3. the small effect of canceled sessions may be confirmed with the percentage of rejected sessions form figure 2. in scenarios 2 and 4 there is no significant difference for the percentage of rejected sessions between m1 and m3; however, in scenarios 1 and 3 there are relative errors of 6% and 10%, respectively. as expected, m2 increases the total percentage of rejected sessions compared to m1 and m3, and it is also very important to remark that the behavior of m3 may maintain the percentage of rejected sessions obtained in m1, the reference model. it is not enough to demonstrate that our proposal maintains the percentage of rejected sessions to validate it, because there would be no reason to select m3 over m1. the value added given by this work comes from the number of active sessions and how they are distributed among the qos levels. in this point, m3 algorithm’s behavior is better for scenarios 2 and 4 than for scenarios 1 and 3, but it still has a negative relative error, which means that m3 may reduce the number of active sessions. unlike blocked session data, the information in figure 3(c) about active sessions is considered instantaneous and in absolute values, not accumulated percentages as before. then, taking into account the final values is not enough. if we consider the results for the number of active sessions in each qos level presented in the previous section, we can observe there is a consistent behavior for m3 increasing the number of ef active sessions. therefore, results obtained with m3 are according to the objectives, increasing the number of active sessions that have the highest priority level, and m2 does not increase the number of active ef session compared to m1 in any scenario. under saturation conditions, the number of ef active sessions in m1 is higher than m2, validating the importance of the sfb implementation in m3. bringing previous observations together, the simulations results show that our proposal, implemented in m3, may be a feasible implementation for the four considered scenarios, although an ims architecture and algorithm proposal with qos parameters for flexible convergent services with dynamic requirements 133 it has a better behavior in scenarios 2 and 4. in scenario 2, it is very important to see that m3 maintains the same values as m1 for both rejected and active sessions. there is significant reduction of the number of active af users, but since ef sessions are arriving with three times af session’s probability, it may be considered as an accepted tradeoff. then, scenario 4 gives important results because m3 also maintains very small differences with m1 in rejected and active sessions. in this scenario, the number of be sessions entering the network is higher compared to the other qos level sessions. finally, scenarios 1 and 3 present higher differences between m1 and m3; nevertheless, we consider them feasible scenarios because they maintain the model’s objective and increase the number of high priority sessions with a higher tradeoff in the number of sessions rejected from the network. analyzing the complexity of the algorithms in a worst case basis, it is evident that for every incoming session, the running time will be o(n), where n is the number of active sessions in the network. considering concurrent sessions entering the network, the running time will be o(mn), where m is the number of sessions entering the network. 7 conclusions in this work we present an efficient and enhanced ims qos architecture to support qos providing for flexible services with dynamic requirements. our approach follows the 3gpp qos specifications and is based on the pcc architecture. we propose an architecture including new features in the pcrf entity given by the concept of session relocation and the introduction of the qos-lrf and sfb. the proposed heuristic algorithms for the qos-lrf use information already available at the pcrf according to the pcc architecture specifications. according to the three model simulations, our proposal overcomes the first two models, which offer a valid implementation of current 3gpp pcc architecture specifications. the results obtained for the number of rejected and active sessions validates it, and for this reason, our proposal would have a good performance for carriers with customers requesting more ef and be services. furthermore, for carriers with customers requesting all types or services at the same rate, or requesting more af services, the algorithm achieve the objectives but with some tradeoffs for its implementation that would need to be evaluated. the architecture proposal achieves the objectives of efficiency and flexibility. efficiency may be analyzed according to how network resources are used. the objective validation is given by the simulations showing that implementing our proposal, the number of rejected and active sessions is maintained and at the same time, the number of high priority sessions is increased, then network resources are properly assigned according to the priority level. flexibility is achieved with the definition of the sfb and the algorithms implementations. they offer the possibility of relocating a session in a lower qos level, before it is rejected from the network. other important contributions of this work is that carriers would have the possibility of assigning different priorities to the same service within the same qos level, and offer the service at different rates controlled by the pcc architecture charging mechanisms. finally, the worst case running time of the algorithms is o(n), where n is the number of active sessions in the network, and if the possibility of having m concurrent sessions entering the network is considered, the worst case running time is o(mn). for further study, we will continue with the message flow analysis required to implement our proposal and a prototype implementation. we will also study scenarios involving different carriers and roaming services, which could be implemented in the prototype. 134 m. navarro, y. donoso bibliography [1] 3gpp, "ip multimedia subsystem (ims); stage 2", release 5, ts 23.228 v5.15.0, june 2006. http://www.3gpp.org/ftp/specs/html-info/23228.htm [2] r. copeland, converging ngn wire line and mobile 3g networks, crc press, usa, 2009. [3] g. camarillo and m. a. garcía-martín, the 3g ip multimedia subsystem (ims): merging the internet and the cellular worlds, 2nd edition, john wiley & sons ltd., england, 2006. [4] m. sauter, beyond 3g bringing networks, terminals and the web together. john wiley & sons ltd., united kingdom, 2009. [5] t. magedanz, a. diez, m. corici, and d. vingarzan, "understanding ngmn and related technologies lte, epc and ims", ims workshop 2009, tutorial 4, fraunhofer fokus. berlin, germany, november 2009. [6] 3gpp, "quality of service (qos) concept and architecture", release 9, ts 23.107 v9.0.0, december 2009. http://www.3gpp.org/ftp/specs/html-info/23107.htm [7] 3gpp, "policy and charging control architecture", release 9, ts 23.203 v9.0.0, march 2009. http://www.3gpp.org/ftp/specs/html-info/23203.htm [8] s. tompros, and s. denazis. "interworking of heterogeneous access networks and qos provisioning via ip multimedia core networks", computer networks, volume 52, issue 1, pp. 215-227, january 2008. [9] g. camarillo, t. kauppinen, m. kuparinen, and i. m. ivars, "towards an innovation oriented ip multimedia subsystem", communications magazine, ieee, volume 45, issue 3, pp. 130136, march 2007. [10] r. good and n. ventura, "end to end session based bearer control for ip multimedia subsystems", ifip/ieee international symposium on integrated network management im ’09, pp. 497-504. june 2009. [11] m. ageal, r. good, a. elmangosh, m. ashibani, n. ventura, and f. ben-shatwan, "centralized policy provisioning for inter-domain ims qos", eurocon ’09, pp. 1793-1797. may 2009. [12] s. tompros, c. kavadias, d. vergados, and n. mouratidis, "a strategy for harmonised qos manipulation in heterogeneous ims networks", wireless personal communications, vololume 49, number 2, pp. 197-212. springer netherlands, august 2008. [13] r. yavatkar, d. pendarakis, and r. guerin, "rfc2753 a framework for policybased admission control", network working group, january 2000. http://www.rfceditor.org/rfc/rfc2753.txt [14] d. durham, j. boyle, r. cohen, s. herzog, r. rajan, and a. sastry, "rfc 2748 the cops (common open policy service) protocol", network working group, january 2000. http://www.rfc-editor.org/rfc/rfc2748.txt [15] c. filsfils and j. evans, deploying diffserv in backbone networks for tight sla control, ieee internet computing, volume 9, issue 1, pp. 66-74. january 2005 international journal of computers communications & control issn 1841-9836, 10(2):263-279, april, 2015. qeam: an approximate algorithm using p systems with active membranes g. zhang, j. cheng, m. gheorghe, f. ipate, x. wang gexiang zhang*, jixiang cheng school of electrical engineering, southwest jiaotong university chengdu, 610031, p.r. china *corresponding author: zhgxdylan@126.com marian gheorghe faculty of engineering and informatics, university of bradford, bradford, west yorkshire bd7 1dp, uk, m.gheorghe@bradford.ac.uk florentin ipate faculty of mathematics and computer science, university of bucharest academiei 14, bucharest, romania florentin.ipate@ifsoft.ro xueyuan wang school of information engineering southwest university of science and technology mianyang 621010, p.r.china abstract: this paper proposes an approximate optimization approach, called qeam, which combines a p system with active membranes and a quantum-inspired evolutionary algorithm. qeam uses the hierarchical arrangement of the compartments and developmental rules of a p system with active membranes, and the objects consisting of quantum-inspired bit individuals, a probabilistic observation and the evolutionary rules designed with quantum-inspired gates to specify the membrane algorithms. a large number of experiments carried out on benchmark instances of satisfiability problem show that qeam outperforms qeps (quantum-inspired evolutionary algorithm based on p systems) and its counterpart quantum-inspired evolutionary algorithm. keywords: membrane computing, active membranes, approximate optimization approach, quantum-inspired evolutionary algorithm; satisfiability problem. 1 introduction in the last decades, natural computing has been intensively studied and a wide range of applications in computer science and many other areas have been produced. as a well established branch of natural computing, membrane computing, using models called p systems, has made a significant impact on the development of various disciplines [22], such as theoretical computer science, biology, linguistics, etc. the first variants of p systems were proposed in 1998 by g. păun [19]. they represent a new distributed-parallel framework for designing cell-like or tissue-like computing models, handling multisets of abstract objects in a compartmentalized arrangement of membranes. the membrane structure delimits compartments in a hierarchical or network manner. objects are arranged as multisets and dispersed across these compartments. rules are usually associated to the regions enclosed by membranes and control the evolution of objects inside in a maximally parallel way. the main characteristics of p systems are the hierarchical or network architecture of membranes, type of rules (transformation, communication etc.) and intrinsic parallelism, which are all very effective from a computational point of view copyright © 2006-2015 by ccc publications 264 g. zhang, j. cheng, m. gheorghe, f. ipate, x. wang and attractive and suitable for modelling various problems. until now, p systems have been developed principally from a mathematical and computational point of view, building a great variety of computing models and studied for their computational power, complexity aspects and potential solutions to np-complete problems, and have been utilized for modelling real-world problems in graphics, linguistics, biology. however, the issue of adapting p systems for solving practical problems remains a fundamental aspect of the research in this field, and fortunately a burgeoning interest in this respect for many researchers has been noticeable in the last years [7]. the application of p systems to such problems is still in a developmental phase [22], as compared to evolutionary computation. inspired by the evolution in natural selection and molecular genetics, evolutionary algorithms (eas) have become the most successful metaheuristic search techniques [2]. the great success of eas in various applications, such as evolutionary optimization and machine learning, can be attributed to two outstanding characteristics: practicability and robustness. eas are regarded as blind search methodologies without domain specific knowledge [23], suitable for a variety of complex problems in real-world applications. as population-based search tools, eas usually sample multiple points of the search space in a single step and consequently are quite robust with respect to the objective function landscapes containing many peaks. developing potential efficient solutions for specific problems is a challenging and attractive topic for researchers from a wide range of areas. quantum-inspired evolutionary algorithms (qieas), one of the three main research areas related to the complex interaction between quantum computing and evolutionary algorithms, are receiving renewed attention [28]. a qiea is a new evolutionary algorithm for a classical computer rather than for quantum hardware. qieas use quantum-inspired bits (q-bits), quantum-inspired gates (q-gates) and observation processes to specify their structure and steps. more specifically, q-bits are applied to represent genotype individuals; q-gates are employed to operate on q-bits to generate offspring; and the genotypes and phenotypes are linked by a probabilistic observation process. even though p systems and eas use different rules and computational strategies to handle different objects, both of them are paradigms of natural computing and employed to solve complex problems such as np-complete problems [27, 34]. p systems represent a suitable formal framework for parallel-distributed computation and eas are very effective for implementing different algorithms to solve many problems. thus, the possible interplay between p systems and eas is very promising for further exploration and represents a fertile research field. being the successful instances of this interaction, membrane algorithms can be regarded as a class of hybrid optimization algorithms using the concepts and principles of metaheuristic search methodologies and the hierarchical or network structures of membranes and, to some extent, rules of p systems. when a p system is considered as a parallel-distributed framework for metaheuristic search techniques, it is investigated in terms of optimization results and computation framework, instead of computing power and efficiency. according to the investigations in the literature, there are two main types of membrane algorithms in terms of membrane structures: hierarchical and network. in [30], a tissue membrane system with a network structure was used to appropriately organize five representative variants of differential evolution algorithms. three principal categories, nested membrane structure (nms), one-level membrane structure (olms) and hybrid membrane structure, were reported with respect to the membrane algorithms with hierarchical membrane structures. in [16], a membrane algorithm with nms was proposed by using a genetic algorithm and a local search method to solve travelling salesman problems. this kind of membrane algorithms was also applied to solve the min storage problem [13], dna sequence design problem [24, 25] and the proton exchange membrane fuel cell model parameter estimation problems [26]. in [27], a membrane algorithm integrating olms with a qiea, called qeps, was proposed to solve knapsack problems and the experiment-based comparisons qeam: an approximate algorithm using p systems with active membranes 265 between olms and nms were drawn, implying that the choice of the membrane structure is very important for membrane algorithms. this membrane structure was also combined with a qiea and tabu search [32], differential evolution [3], ant colony optimization [29], particle swarm optimization [33] and multiple qiea components to solve radar emitter signal time-frequency atom decomposition, numerical optimization problems, travelling salesman problems, broadcasting problems in p systems and image processing, respectively. in [11], a dynamic multi-objective optimization algorithm using a membrane system with a hybrid structure was developed to design a controller for a time-varying unstable plant. the dynamic behavior analysis in [31] indicates that the membrane algorithm, qeps, has a stronger capability to balance exploration and exploitation than its counterpart approach, qiea. it is worth pointing out that păun has made a clear claim that membrane algorithms represent a research directions with a well-defined practical use [22], and therefore further studies are very necessary to prove the use of p systems for solving real-world applications. p systems with active membranes can produce an exponential growth of membranes and consequently can solve a class of np-complete problems, such as the satisfiability (sat) problem [1, 20] and the knapsack problem [18], in a linear or polynomial time. the two types of complete problems were discussed in [1,18,20] and in many other places from a mathematical perspective. to the best of our knowledge, no evolutionary algorithm using p system with active membranes has been devised to approximately solve the two aforementioned kinds of problems. this paper proposes an approximate algorithm combining a p system with active membranes model and a qiea, called qeam. this approach is based on the hierarchical arrangement of the compartments and developmental rules (e.g., membrane separation, merging, transformation/ communication-like rules) of a p system with active membranes model, and the objects consisting of q-bit individuals, a probabilistic observation and the evolutionary rules designed with qgates to specify the membrane algorithms. in the experiments, the application of qeps to sat problems is first discussed, and then qeam is tested on 65 benchmark sat problems. extensive experiments show that qeam achieves much better results than qeps and its counterpart qiea. also, the parametric and non-parametric tests show significant differences. 2 qeam in this section, we start by introducing some concepts related to p systems with active membranes and qieas and then describe in detail the proposed qeam algorithm. 2.1 p systems with active membranes in this subsection, we give a brief description of p systems with active membranes without polarizations due to [20] and [17], where more details can also be found. a membrane structure is a rooted tree represented by a venn diagram and is identified by a string of correctly matching parentheses, with a unique external pair of parentheses; this external pair of parentheses corresponds to the external membrane, called the skin. a membrane without any another membrane inside (the leaves of the tree) is said to be elementary. for example, the structure in fig. 1 contains 8 membranes; membranes 3, 5, 6 and 8 are elementary. the string of parentheses identifying this structure is µ = [1[2[5]5[6]6]2[3]3[4[7[8]8]7]4]1. all membranes are labelled; here we have used the numbers from 1 to 8. we say that the number of membranes is the degree of the membrane structure, while the height of the tree associated in the usual way with the structure is its depth. in the example above we have a membrane structure of degree 8 and of depth 4. 266 g. zhang, j. cheng, m. gheorghe, f. ipate, x. wang the membranes delimit regions, precisely identified by the membranes (the region of a membrane is delimited by the membrane and all membranes placed immediately inside it, if any such a membrane exists). in these regions we place objects, which are represented by symbols of an alphabet. several copies of the same object can be present in a region, so we work with multisets of objects. a multiset over an alphabet v is represented by a string over v , together with all its permutations: the number of occurrences of a symbol a ∈ v in a string x ∈ v ∗ (v ∗ is the set of all strings over v ; the empty string is denoted by λ) is denoted by |x|a and it represents the multiplicity of the object a in the multiset represented by x. a polarizationless p system with active membranes is a construct π = (v, t, h, µ, w1, . . . , wm, r), where 1. m ≥ 1 (the initial degree of the system); 2. v is an alphabet (the working alphabet of the system); 3. t ⊆ v (the terminal alphabet); 4. h is a finite set of labels for membranes; 5. µ is a membrane structure consisting of m membranes, labelled (not necessarily in a oneto-one manner) with elements of h; 6. w1, . . . , wm, are strings over v , describing the multisets of objects placed in the m regions of µ; 7. r is a finite set of developmental rules, of the following forms: (a) [ha → v]h, for h ∈ h, a ∈ v, v ∈ v ∗; (object evolution rules, associated with membranes and depending on the label, but not directly involving the membranes, in the sense that the membranes are neither taking part in the application of these rules nor are they modified by them); (b) a[h]h → [hb]h, for h ∈ h, a, b ∈ v ; (communication rules; an object is introduced in the membrane, possibly modified during this process); (c) [ha]h → [h]hb, for h ∈ h, a, b ∈ v ; (communication rules; an object is sent out of the membrane, possibly modified during this process); (d) [h]h[h]h → [h]h, for h ∈ h; (merging rules for elementary membranes; in reaction of two membranes, they are merged into a single membrane; the objects of the former membranes are put together in the new membrane); (e) [hw ]h → [hu]h[hw − u]h, for h ∈ h, u ⊂ w ; (separation rules for elementary membranes; the membrane is separated into two membranes with the same labels; the objects from u are placed in the first membrane, those from w − u are placed in the other membrane); for a detailed description on how to use these rules, refer to [17, 20]. it is worth pointing out that these rules are used in the non-deterministic maximally parallel manner, i.e., in any given step, one or more rules of type (a), such that no unallocated object to rules can be allocated to any rule, and/or at most one rule of types (b)-(e) can be applied to each membrane. in this way, we get transition from a configuration of the system to the next configuration. a sequence of transitions forms a computation. a computation is halting if no other rules can be employed in its last configuration. qeam: an approximate algorithm using p systems with active membranes 267 6 5 2 3 8 7 4 1 1 8 7 432 5 6 figure 1: a membrane structure the beginning. begin 1t (i) initialize q(t) while (not termination condition) do (ii) make p(t) by observing the states of q(t) (iii) evaluate p(t) (iv) update q(t) using q-gates (v) store the best solutions among p(t) 1t t end end fig. 2. pseudocode algorithm for qiea [13] figure 2: pseudocode algorithm for qiea [10] 2.2 quantum-inspired evolutionary algorithms the interaction of quantum computing and evolutionary algorithms has produced three research avenues: evolutionary-designed quantum algorithms using evolutionary algorithms to design new quantum algorithms, quantum evolutionary algorithms implementing evolutionary algorithms in a quantum computing environment and qieas [28]. qiea is employed to describe the computational methods using concepts and principles of quantum computing for solving various problems in the context of a classical computer [14]. based on the concepts and principles of quantum computing, such as quantum bit (qubit), quantum gate and superposition, a qiea is developed as a novel evolutionary algorithm for a classical computer. narayanan and moore [15] introduced a preliminary idea of a qiea and han and kim [10] proposed its practical algorithm. a qiea is characterized by a q-bit representation, a probabilistic observation and a q-gate evolutionary rule. in recent years, qieas have become a promising and rapidly growing branch of evolutionary computation. in qieas, a q-bit is defined by a pair of complex numbers (α, β) as [α β]t , where |α|2 and |β|2 are probabilities that the observation of the q-bit will render a ’0’ or ’1’ state. normalization requires that |α|2 + |β|2 = 1. note that qieas just need real numbers for amplitudes. besides ’0’ and ’1’ states, a q-bit can also be in a superposition of the two states. a q-bit individual is represented as a string of l q-bits [ α1|α2| · · · |αl β1|β2| · · · |βl ] , (1) where |αi|2 + |βi|2 = 1 (i = 1, 2, · · · , l). a q-gate in a qiea is defined as a variation operator for updating the q-bit individuals such as to guarantee that they also satisfy the normalization condition |α|2 + |β|2 = 1 [10]. the basic pseudocode algorithm for a qiea is shown in fig. 2 and the description for each step is as follows. 1. in the "initialize q(t)" step, a population q(1) with n q-bit individuals is generated, q(t)={qt1, q t 2, · · · , q t n}, at generation t, where qti (i = 1, 2, · · · , n) is an arbitrary individual in q(t), which is represented as qti = [ αti1|α t i2| · · · |α t il βti1|β t i2| · · · |β t il ] , (2) where l is the number of q-bits, i.e., the string length of the q-bit individual. in the initial population, that is when t = 1, we have αtij = β t ij = 1/ √ 2 for all i = 1, 2, · · · , n 268 g. zhang, j. cheng, m. gheorghe, f. ipate, x. wang and j = 1, 2, · · · , l. this means that all possible states are superposed with the same probability at the beginning. 2. by observing the states q(t), binary solutions in p(t), where p(t)={xt1, x t 2, · · · , x t n}, are produced at step t. according to the current probability, either |αtij| 2 or |βtij| 2 of qti, i = 1, 2, · · · , n, j = 1, 2, · · · , l, a classical bit 0 or 1 is generated. thus, l classical bits can construct a binary solution xti (i = 1, 2, · · · , n). 3. the fitness value for each binary solution xti (i = 1, 2, · · · , n) is calculated by using an evaluation function. 4. in this step, all the q-bit individuals in q(t) are updated by applying q-gates. to be specific, the jth q-bit in the ith q-bit individual qti, j = 1, 2, · · · , l, i = 1, 2, · · · , n, is updated by applying the current q-gate gtij(θ). as usual, qieas use a quantum rotation gate as a q-gate; this is given by gtij(θ) = [ cos θtij − sin θ t ij sin θtij cos θ t ij ] , (3) where θtij is an adjustable q-gate rotation angle. 5. the best solutions among p(t) are selected and stored into b(t). in qieas, the q-bit representation, which can describe simultaneously multiple genotype states using a linear superposition of states in a probabilistic way, makes the algorithm rather good with respect to population diversity. q-gate evolutionary rules are executed in the qbit probability space to avoid the selection pressure problem of conventional genetic algorithms with selection, crossover and mutation operators. as compared with local search methods and conventional genetic algorithms, a qiea has good balance between exploration and exploitation so as to obtain stronger global search capability and better convergence. furthermore, a qiea is able to exploit the search space for a global solution with a small number of individuals, even with one individual; q-gate evolutionary rules, which are only related to searching the best solution, are easy to implement in a parallel distributed structure because little information needs to be transmitted and exchanged. 2.3 qeam this section will introduce the membrane algorithm, qeam, combining p systems with active membranes and qieas. qeam uses a dynamic p systems-like framework, which is initially randomly produced and then may be changed in the process of evolution. this framework directly uses some of the elements of a p system with active membranes, whereas others are slightly adapted for this evolutionary algorithm. the objects employed will be organized in multisets of special strings built either over the set of q-bits or {0, 1}. the rules will be responsible to make the system evolve and select the best fit q-bit individuals. more precisely, the dynamic p system-like framework will consist of: 1. a dynamic structure [0[1]1, [2]2, · · · , [m]m]0 with m regions contained in the skin membrane, denoted by 0, where m is a number varied during the evolution process; 2. an alphabet that consists of all possible q-bits and the set {0, 1}; 3. a set of terminal symbols, t = {0, 1}; qeam: an approximate algorithm using p systems with active membranes 269 4. initial multisets w0 = λ, w1 = q1q2 · · · qn1 , w2 = qn1+1qn1+2 · · · qn2 , · · · , wm = qn(m−1)+1qn(m−1)+2 · · · qnm , where qi, 1 ≤ i ≤ n, is a q-bit individual; nj, 1 ≤ j ≤ m, is the number of individuals in wj; ∑m j=1 nj = n, where n is the total number of individuals in this computation; 5. rules include the types of (a)-(e) in p systems with active membranes and their use will be given in the following description. in what follows, we summarize the steps of qeam by using a pseudocode notation, shown in fig. 3, to help presenting the membrane algorithm. begin input n, gmax, tmax 1t (i) initialize the membrane st ruct ure (m t elementary membranes) and object s; (ii) while (not t erminat ion condit ion) do (iii) produce g (t); (iv) perform object evolut ion rule (a) in elementary membranes; (v) perform communicat ion rule (c); (vi) perform object evolut ion rule (a) in t he skin membranes; 1t t ! (vii) det ermine the number mt+1 of element ary membranes; if (mt+1< m t) (viii) p erform membrane merging rule (d); el se i f (mt+1> m t) (ix) p erform membrane separat ion rule (e); en d (x) p erform communicat ion rule (b); en d output : the best individual en d figure 3: pseudocode algorithm for qeam. 1. in the initialization of qeam, a one level membrane structure [0[1]1, [2]2, · · · , [m]m]0 consisting of a skin membrane denoted by 0 and m elementary membranes delimiting m regions inside the skin membrane is constructed as the membrane structure at iteration t = 1, where m is a random number ranged from 1 to n, where n is the number of q-bit individuals. each q-bit individual forms an object. thus, n objects are randomly scattered across the m elementary membranes in a non-deterministic way to make sure that each elementary membrane contains at least one object. so the number of objects in each elementary membrane varies from 1 to n − m + 1. 2. the termination condition for qeam could be a prescribed number of maximal iterations or the algorithm searches the optimal or close-to-optimal solution. 3. this step determines the numbers g(t) = (g1, g2 . . . , gm), of evolutionary generations for independently performing object evolution rule (a) in the m elementary membranes, where gi (i = 1, 2, . . . , m) for the ith elementary membrane is generated randomly between 1 and a certain integer number gmax. 4. the steps (ii) to (v) of the qiea shown in fig. 2 are performed independently in each elementary membrane to evolve the objects inside. the termination condition for the ith 270 g. zhang, j. cheng, m. gheorghe, f. ipate, x. wang begin if 2 [0,1)random then 0x else 1x end figure 4: observation process in qiea begin 0t ; initialize tabu search; while ( max t t ) do 1t t ; search the neighborhood; evaluate candidate solutions; update tabu list. end end figure 5: pseudocode algorithm for tabu search elementary membrane is the maximal number gi(i = 1, 2, · · · , m) of evolutionary generations. it is worth pointing out that the observation process in qieas, illustrated in fig. 4, is applied to build a connection between a q-bit [α β]t and a classical bit and hence to build a link between q-bit individuals and binary solutions. the q-gate update procedure[ α′ β′ ] = g(θ) [ α β ] (4) is used to transform a current q-bit [α β]t into the corresponding q-bit [α′ β′]t at the next generation. the rotation angle θ in the q-gate g(θ) in (4) is defined as θ = s(α, β) · ∆θ, where s(α, β) and ∆θ can be obtained from the lookup table in [10]. 5. the communication rule is employed to send the best binary solution in each elementary membrane out to the skin membrane. this step is helpful to exchange information among the objects in the elementary membranes and the skin membrane because the qiea employs q-gates, which are related to only the best individual searched, to generate the offspring. after this step there are m binary solutions in total in the skin membrane. 6. in the skin membrane, a local search, tabu search [8, 12], is performed on the best binary solution selected from the m binary solutions, which are sent from the m elementary membranes (see step (v)). the pseudocode algorithm for tabu search is shown in fig. 5. in the "initialize tabu search" step, an empty tabu list is constructed and tabu length is set to a value. at each iteration, the neighborhood of the best binary solution in the skin membrane is explored to obtain candidate solutions. next, the candidate solutions are evaluated by using the fitness function and the best of them is selected to update the tabu list. 7. the number mt+1 of elementary membranes at iteration t+1 is produced randomly between 1 and n, which will directly determine the membrane structure at the next iteration. 8. if mt+1 < mt, the (mt−mt+1) elementary membranes will be merged into the mt+1 elementary membranes. the merging process is shown in fig. 6, where em represents elementary membranes. in each merging operation, we first choose any two arbitary elementary membranes i and j from m elementary membranes, i.e., 1 ≤ i, j ≤ m and i ̸= j; and then we merge the elementary membranes i and j into a single membrane and put the objects in the elementary membranes i and j into the merged membrane. the initial value of m is mt. thus, multiple membranes may be merged into a single membrane. so this rule is a multi-merging one. qeam: an approximate algorithm using p systems with active membranes 271 begin t m m ; while ( 1tm m !" ) do choose any t wo arbit ary elementary membranes; perform the merging rule (d); 1m m # ; en d en d d figure 6: merging process of ems begin t m m ; while ( 1t m m ! $ ) do choose any one elementary membrane; while (|w|<2) do choose any one elementary membrane; end perform the separation rule (e); 1m m ! ; end end figure 7: separation process of ems 9. if mt+1 > mt, the (mt+1 − mt) elementary membranes will be separated into two membranes. the separation process is illustrated in fig. 7, in which |w | is the number of objects in the pre-separation membrane. we choose any one elementary membrane i which has at least two objects from m elementary membranes, i.e., 1 ≤ i ≤ m. the initial value of m is mt. when the separation rule is performed, |u| (|u| < |w |) objects are placed in the first membrane and the |w | − |u| objects are placed in the other membrane. thus, a single membrane may be divided into several membranes. so, this rule is a multi-separation one. 10. by performing the communication rule (b) in the skin membrane, this step sends the fittest binary solution to each elementary membrane for the further evolution steps. 3 experimental results to test the performances of the presented algorithm, qeam, we will use the satisfiability problem, which is a well-known np-complete problem, to conduct the experiments. we start from the description of the satisfiability problem, and then turn to use qieas and qeps as benchmark algorithms to solve 65 representative instances of the satisfiability problem. finally, qeam is tested on the same instance of the satisfiability problem to draw conclusions. 3.1 satisfiability problem the satisfiability problem (sat) is a fundamentally paradigmatic problem in artificial intelligence applications, automated reasoning, mathematical logic, and related research areas [5]. sat can be described as follows: given a boolean formula in conjunctive normal form (cnf), determine whether or not it is satisfiable, that is, whether there exists an assignment to its variables on which it evaluates to true, i.e., a sat instance is to search a variable assignment x so that a boolean formula f(x) becomes true, where x is a set of boolean variables x1, x2, · · · , xn, i.e., xi ∈ {0, 1}, i = 1, 2, · · · , n and the propositional formula f(x) is in a conjunctive normal form, i.e., f(x) = c1(x) ∧ c2(x) ∧ · · · ∧ cm(x), (5) where each clause cj(x), j = 1, 2, · · · , m, is a disjunction of literals, and a literal is a variable or its negation [9]. a sat instance is called satisfiable if such x exists, and unsatisfiable otherwise. in this paper only 3-sat problems, in which each clause has exactly three literals, will be considered 272 g. zhang, j. cheng, m. gheorghe, f. ipate, x. wang 2 5 10 15 20 25 30 35 40 45 50 0 20 40 60 80 100 number of membranes s u c c e s s fu l ra te s ( % ) sat1 sat2 sat3 sat4 sat5 sat6 sat7 sat8 sat9 sat10 figure 8: successful rates 2 5 10 15 20 25 30 35 40 45 50 0 0.5 1 1.5 2 2.5 3 3.5 x 10 4 number of membranes a e s ( 1 /5 0 ) sat1 sat2 sat3 sat4 sat5 sat6 sat7 sat8 sat9 sat10 figure 9: aes because a number of other problems, such as the travelling salesman problem and the n-queens problem, can be reformulated with respect to 3-sat problems. in [4] it is shown that 3-sat problem is np-complete. in membrane computing, various types of p systems with membrane division are frequently investigated from a mathematical point of view to obtain an exponential working space in a linear time to solve the sat problem [1, 20]. this paper will use an approximate algorithm to solve the sat problem, in which the number of clauses that are not satisfied by the variable assignment x is considered as the evaluation function. 3.2 results of qeps and qiea according to the study in [27], the number of elementary membranes has a significant impact on the qeps performances. so we first focus on how to set the number of elementary membranes in an empirical way. ten benchmark 3-sat problems1, each of which has 20 boolean variables and 91 clauses, are applied to conduct the experiments. the fitness function is the number of clauses that are not satisfied by the variable assignment. the population size n is set to 50. the values of 2, 5, 10, 15, 20, 25, 30, 35, 40, 45 and 50, for the number m of elementary membranes, are used in the experiments. according to previous investigations regarding the effect of the number gi(i = 1, 2, . . . , m) of iterations on the qeps performances [27], the parameter gi(i = 1, 2, . . . , m) is set to a uniformly random integer ranged from 1 to 10. the algorithm stops when either 2.75 × 106 evaluation steps are made or the sat problem solution is found, i.e., the minimal fitness value 0 is attained. the performances of the above 11 cases are evaluated by using the successful rate of 30 independent runs (the percentage of the runs making the sat problem satisfiable) and the average number of evaluations to solutions (aes) over the successful runs. the experimental results are listed in fig. 8 and fig. 9, which illustrate that the successful rates and the aes vary with the number of elementary membranes. as shown in fig. 8 and fig. 9, the successful rates and the aes show a broad range of variability with respect to the number of different elementary membranes; this indicates that the number of elementary membranes has a significant impact on the qeps performances. in order to obtain a balance between the successful rates and the aes, the number of elementary membranes could be fixed at 15. 1satlib the satisfiability library, http://www.satlib.org/ qeam: an approximate algorithm using p systems with active membranes 273 qiea is also applied to conduct the experiments on the 10 sat problems. in these experiments, qiea employs the same population size and stopping criteria as the qeps. the statistical results of 30 independent runs for each problem are listed in table 1. the best experimental results of qeps are also shown in table 1, where each of which has 20 boolean variables and 91 clauses; sr and aes represent successful rates and average number of evaluations to solutions, respectively. to perform convincing comparisons between qiea, qeps and qeam, additional fifty-five 3-sat benchmark problems are employed to carry out experiments. both qeps and qiea use 50 individuals as a population, the prescribed number of 2.75 × 106 evaluations to solutions as the stopping criterion and the number of clauses that are not satisfied by the variable assignment as the fitness function. in qeps, the parameter gi(i = 1, 2, . . . , m) is set to a uniformly random integer ranged from 1 to 10, and the number of elementary membranes is assigned to 15. the performances of the two algorithms are evaluated by using the following criteria: the mean of the solutions over 15 runs and their standard deviations. it is worth pointing out that the experiments are very time-consuming and therefore only 15 independent runs are performed for each sat problem. the number of boolean variables, the number of clauses in each boolean formula and the experimental results are provided in table 2. table 1: comparisons of qiea, qeps and qeam on 10 sat problems qiea qeps qeam problems sr(%) aes sr(%) aes sr(%) aes sat1 77 572750 100 528850 100 220804 sat2 70 496700 90 701200 100 300468 sat3 53 638250 73 949400 97 279174 sat4 100 218250 100 90300 100 59978 sat5 87 575900 87 582300 100 179445 sat6 57 469650 83 701700 100 336728 sat7 53 1137750 67 907300 93 527795 sat8 27 1120300 50 410300 100 354903 sat9 87 684800 100 405450 100 128633 sat10 93 281050 100 572750 100 115876 3.3 results of qeam in the experiments for testing qeam performance, the population size and the prescribed number of evaluations of solutions as the stopping criterion are set to 50 and 2.75 × 106, respectively, which are the same as those in qiea and qeps. qeam applies the same gmax as qeps. additionally, the tabu length and tmax in qeam are 5 and 100, respectively. for each of the first 10 benchmark 3-sat problems shown in table 2, we performed 30 independent runs and recorded the successful rate and the aes over successful runs. the experimental results are provided in table 1. the qeam performance is further investigated by using the remaining 55 3-sat benchmark problems. we record the average solution and the standard deviations over 15 runs for each of them. the experimental results are listed in table 2, where each of the first ten, the second ten, the third ten, the fourth ten, the fifth ten and the last five problems has 50, 75, 100, 125, 150 and 250 boolean variables and 218, 325, 430, 538, 645 and 1065 clauses, respectively; mean and std represent the mean of the best solutions and the standard deviation of the best solutions, respectively; (+) represents significant difference. 274 g. zhang, j. cheng, m. gheorghe, f. ipate, x. wang table 2: comparisons of qiea, qeps and qeam using 55 instances of the sat problem (to be continued). qiea qeps qeam qeamvs.qiea qeamvs.qeps sat mean std mean std mean std t-test imp.(%) t-test imp.(%) 1 7.67 0.82 6.60 1.18 0.93 0.26 5.23e-23(+) +87.87 5.30e-17(+) +85.91 2 8.27 2.12 7.40 1.12 1.53 0.64 2.32e-12(+) +81.50 1.13e-16(+) +79.32 3 7.40 1.40 6.27 0.88 1.00 0.53 5.90e-16(+) +86.49 5.64e-18(+) +84.05 4 7.87 1.19 6.33 0.98 1.00 0.38 7.31e-19(+) +87.29 5.74e-18(+) +84.20 5 7.13 1.41 6.20 0.86 0.07 0.26 1.30e-17(+) +99.02 2.50e-21(+) +98.87 6 7.27 0.80 5.93 0.80 0.20 0.41 5.39e-23(+) +97.25 1.53e-20(+) +96.63 7 7.73 1.16 6.40 1.18 1.07 0.46 1.73e-18(+) +86.16 8.26e-16(+) +83.28 8 8.73 0.70 7.67 1.18 1.53 0.83 5.99e-21(+) +82.47 6.01e-16(+) +80.05 9 7.87 1.13 6.87 1.19 1.47 0.64 1.27e-17(+) +81.32 2.84e-15(+) +78.60 10 8.33 0.82 6.93 0.88 1.07 0.59 5.74e-22(+) +87.15 7.33e-19(+) +84.56 11 16.67 1.11 15.40 0.99 2.00 0.93 5.05e-26(+) +88.00 9.32e-26(+) +87.01 12 15.07 1.49 14.60 0.74 1.47 0.74 1.76e-23(+) +90.25 1.38e-28(+) +89.93 13 16.33 0.82 14.80 2.01 2.07 0.88 6.61e-28(+) +87.32 1.84e-19(+) +86.01 14 14.00 1.20 12.87 1.60 1.87 0.64 1.53e-24(+) +86.64 1.41e-20(+) +85.47 15 15.07 1.03 14.87 0.92 1.60 1.06 9.13e-25(+) +89.38 3.01e-25(+) +89.24 16 16.13 1.06 14.87 1.19 2.20 0.68 4.29e-27(+) +86.36 5.80e-25(+) +85.21 17 15.33 1.40 14.53 1.64 1.67 0.90 1.54e-23(+) +89.11 2.00e-21(+) +88.51 18 15.73 1.44 14.53 1.51 2.20 0.86 2.54e-23(+) +86.01 8.03e-22(+) +84.86 19 14.93 1.49 13.80 1.97 1.80 0.41 6.02e-24(+) +87.94 9.25e-20(+) +86.96 20 14.40 1.45 13.93 1.22 2.00 0.76 1.48e-22(+) +86.11 1.19e-23(+) +85.64 21 24.53 1.81 22.93 1.39 3.0 0.93 1.44e-26(+) +87.77 5.29e-28(+) +86.92 22 24.20 1.21 22.80 2.14 3.33 0.62 4.78e-31(+) +86.24 3.08e-24(+) +85.39 23 24.27 1.28 22.93 1.79 4.20 0.94 1.15e-28(+) +82.69 6.05e-25(+) +81.68 24 23.40 1.68 22.80 1.08 3.53 0.92 2.63e-26(+) +84.91 1.51e-29(+) +84.52 qeam: an approximate algorithm using p systems with active membranes 275 table 2 comparisons of qiea, qeps and qeam (continued) qiea qeps qeam qeam vs. qiea qeam vs. qeps sat mean std mean std mean std t-test imp.(%) t-test imp.(%) 25 24.27 1.22 22.80 1.47 3.73 0.88 1.45e-29(+) +84.63 4.13e-27(+) +83.64 26 24.00 1.51 22.47 1.60 3.60 0.74 3.53e-28(+) +85.00 1.06e-26(+) +83.98 27 24.13 1.06 23.40 1.35 3.87 0.74 2.99e-31(+) +83.96 1.08e-28(+) +83.46 28 24.00 1.85 23.40 1.30 3.73 0.80 6.33e-26(+) +84.46 6.40e-29(+) +84.06 29 25.13 1.68 24.13 2.00 4.27 1.28 1.06e-25(+) +83.01 9.18e-24(+) +82.30 30 22.93 2.15 22.27 1.98 3.40 0.74 4.81e-24(+) +85.17 1.64e-24(+) +84.73 31 33.53 1.96 32.87 1.51 5.67 0.90 6.07e-29(+) +83.09 3.87e-31(+) +82.75 32 33.80 1.90 32.07 2.12 5.40 1.06 6.07e-29(+) +84.02 2.76e-27(+) +83.16 33 34.47 1.81 33.73 1.33 6.07 1.33 4.37e-28(+) +82.39 1.85e-30(+) +82.00 34 34.06 1.84 33.27 2.34 5.13 0.74 1.13e-30(+) +84.94 1.78e-27(+) +84.58 35 34.67 1.76 33.60 2.20 5.20 1.32 2.25e-29(+) +85.00 4.32e-27(+) +84.52 36 34.80 2.21 33.93 1.44 6.20 1.47 9.51e-27(+) +82.18 1.93e-29(+) +81.73 37 32.80 2.86 32.93 1.33 5.27 1.28 2.49e-24(+) +83.93 1.05e-30(+) +84.00 38 32.80 1.97 32.47 1.19 5.27 1.10 3.02e-28(+) +83.93 4.14e-32(+) +83.77 39 34.20 2.08 33.40 1.76 5.67 0.98 1.78e-28(+) +83.42 1.09e-29(+) +83.02 40 33.93 1.67 33.20 2.34 5.67 0.98 1.96e-30(+) +83.29 7.19e-27(+) +82.92 41 42.60 1.50 41.20 2.01 6.87 1.13 1.29e-33(+) +83.87 1.13e-30(+) +83.33 42 43.07 1.67 40.00 2.67 5.40 0.91 4.16e-34(+) +87.46 2.66e-28(+) +86.50 43 41.73 1.71 41.53 1.06 6.27 1.75 2.55e-30(+) +84.97 2.08e-32(+) +84.90 44 42.80 3.28 41.07 1.94 6.47 1.25 2.73e-26(+) +84.88 1.01e-30(+) +84.25 45 43.93 1.67 42.60 2.64 7.13 1.06 2.38e-33(+) +83.77 1.66e-28(+) +83.26 46 43.00 3.23 42.07 1.62 7.60 1.30 4.55e-26(+) +82.33 6.09e-32(+) +81.93 47 43.20 2.04 42.73 1.87 7.87 1.30 2.12e-30(+) +81.78 5.60e-31(+) +81.58 48 44.27 2.19 43.60 2.32 7.47 1.06 7.50e-31(+) +83.13 4.97e-30(+) +82.87 49 44.67 2.26 43.53 2.10 8.93 1.67 9.21e-29(+) +80.01 6.37e-29(+) +79.49 276 g. zhang, j. cheng, m. gheorghe, f. ipate, x. wang table 2 comparisons of qiea, qeps and qeam (continued) qiea qeps qeam qeam vs. qiea qeam vs. qeps sat mean std mean std mean std t-test imp.(%) t-test imp.(%) 50 43.13 1.55 41.47 2.50 6.53 0.99 3.87e-34(+) +84.86 5.45e-29(+) +84.25 51 83.07 3.45 81.27 2.91 12.93 1.33 1.48e-33(+) +84.43 5.51e-35(+) +84.09 52 83.40 3.44 82.73 2.96 15.27 1.67 8.05e-33(+) +81.69 4.07e-34(+) +81.54 53 83.00 2.73 81.60 2.50 12.67 1.50 1.05e-35(+) +84.73 3.04e-36(+) +84.47 54 85.13 3.23 83.60 3.14 15.80 1.66 1.15e-33(+) +81.44 1.14e-33(+) +81.10 55 84.87 2.83 80.80 2.31 14.20 1.52 2.22e-35(+) +83.27 1.76e-36(+) +82.43 according to these experimental results, we employ statistical techniques to analyze the behaviour of the three algorithms over the 55 instances of the sat problem. there are two statistical methods: parametric and non-parametric [6]. the former, also called single-problem analysis, uses a parametric statistical analysis t-test to analyse whether there is a significant difference between the two algorithms solving the optimization problem. the latter, also called multiple-problem analysis, applies non-parametric statistical tests such as wilcoxon’s and friedman’s tests, to compare different algorithms whose results represent average values for each problem, regardless of the inexistence of relationships among them. therefore, a 95% confidence student t-test is first applied to check whether the number of false clauses of the two pairs of algorithms, qeam vs. qiea and qeam vs. qeps, are significantly different or not. furthermore, the percentage of improvement (%) in the average number of false clauses due to the qeps algorithm over qiea and qeps is also listed in table 2. then two non-parametric tests, wilcoxon’s and friedman’s tests, are employed to check whether there are significant differences between the two pairs of algorithms, qeam vs. qiea and qeam vs. qeps. the level of significance considered is 0.05. the results of wilcoxon’s and friedman’s tests are shown in table 3. the symbols + and in tables 2–3 represent significant difference and no significant difference, respectively. in the experiments carried out on the qeam, the qeps and the qiea, we also record the average elapsed time for the first ten sat problems over 30 independent runs and for the remaining 55 instances of the sat problem over 15 independent runs. the comparisons of the three algorithms are illustrated in fig. 10. the x-axis and y-axis represent the number of sat problems and the elapsed time, respectively. as shown in table 1, the qeam greatly outperforms the qiea and the qeps in terms of the successful rates and the average number of evaluations. also, table 1 shows that qeps obtains higher successful rates and smaller average number of evaluations than qiea. it can be seen from the experimental results of 55 sat bench problems in table 2 that the qeam achieves much better results than the qiea and the qeps. the qeps obtains better results than the qiea in 54 out of 55 cases. the t-test results demonstrate that there are 55 significant differences between the two pairs of algorithms, qeam vs. qiea and qeam vs. qeps. the p-values of the two non-parametric tests in table 3 are far smaller than the level of significance 0.05, which indicates that the qeam really outperforms the qiea and the qeps by introducing the framework and some rules of p systems with active membranes. it is worth noting that the study in [6] shows that the non-parametric statistical tests are more appropriate than parametric statistical tests in the analysis of the behaviour of the evolutionary algorithms over multiple optimization problems. the qeps uses the framework and some evolution rules of p systems. each elementary membrane evolves for a certain number of generations in a non-deterministic way, and then all elementary membranes communicate in the skin membrane. thus, the qeps has better population diversity and the capability to balance exploration and exploitation. consequently the qeps obtains better results and smaller elapsed time, shown in fig. 10, than the qiea. the qeam goes further and applies the framework and some evolution rules of p systems with active membranes. the good performance of the qeam is qeam: an approximate algorithm using p systems with active membranes 277 1 10 20 30 40 50 60 65 0 2500 5000 7500 10000 12500 15000 no. of sat problems e la p s e d t im e ( s ) qeam qeps qiea figure 10: comparisons of elapsed time. due to the combination of independent evolution of each elementary membrane in a non-deterministic way, communication in the skin membrane, membrane separation and merging, and a local search in the skin membrane. figure 10 shows that the qeam and qeps consumes less time than qiea, which indicates that the use of evolution rules of p systems in the qeam and qeps has little effect on the overall computational load. furthermore, the qeam and qeps may use a slightly smaller number of evaluations of the solutions than the qiea because of the randomness of evolutionary generations for each elementary membrane. additionally, as a result of the use of membrane separation and division, the qeam consumes slightly more time than the qeps, which is shown in fig. 10. table 3: results of non-parametric statistical tests for the two pairs of algorithms, qeam vs. qeps and qeam vs. qiea, in table 2. the symbol + represents significant difference. tests qeam vs. qiea qeam vs. qeps wilcoxon test (p-value) 1.21e-13 (+) 1.21e-13 (+) friedman test (p-value) 1.11e-10 (+) 1.11e-10 (+) 4 conclusions membrane algorithms, defined by carefully mixing selected ingredients of p systems and metaheuristic search methodologies, and the interaction between p systems and quantum computing, are highly promising and give rise to challenging research issues, which are mentioned as open problems and research topics in [7, 21]. benefiting from the cross-fertilization of ideas from p systems, evolutionary computation and quantum computing areas, this paper discussed a novel membrane algorithm combing p systems with active membranes and qiea to solve satisfiability problem. a large number of experiments show that qeam performs better than qeps and qiea. as further work, we aim to investigate other interactions between the three disciplines and their applications to specific problems. acknowledgments the work of gz is supported by the national natural science foundation of china (61170016, 61373047), the program for new century excellent talents in university (ncet-11-0715) and swjtu supported project (swjtu12cx008). the work of mg and fi was partially supported by a grant of the romanian national authority for scientific research, cncs-uefiscdi, project number pn-ii-idpce-2011-3-0688. 278 g. zhang, j. cheng, m. gheorghe, f. ipate, x. wang bibliography [1] alhazov, a.; martin-vide, c.; pan, l.q. (2003); solving a pspacecomplete problem by recognizing p systems with restricted active membranes, fund inform, issn 0169-2968, (2): 67-77. [2] bonissone, p.p.; subbu, r.; eklund, n.; kiehl, t.r. (2006); evolutionary algorithms + domain knowledge = real-world evolutionary computation, ieee t evolut comput, issn 1089-778x, 10(3):256-280. [3] cheng, j.; zhang, g.; zeng, x.(2011); a novel membrane algorithm based on differential evolution for numerical optimization, int j unconv comput, issn 1548-7199, 7(3):159-183. [4] cook, s. (1971); the complexity of theorem-proving procedures, proc. of stoc, 151-158. [5] folino, g.; pizzuti, c.; spezzano, g. (2001); parallel hybrid method for sat that couples genetic algorithms and local search, ieee t evolut comput, issn 1089-778x, 5(4):323-334. [6] garcia, s.; molina, d.; lozano, m.; herrera, f.(2009); a study on the use of non-parametric tests for analyzing the evolutionary algorithms’ behaviour: a case study on the cec 2005 special session on real parameter optimization, j heuristics, issn 1381-1231, 15(6):617-644. [7] gheorghe, m.; păun, gh.; prez-jimenez, m.j.; rozenberg, g. (2013); frontiers of membrane computing: open problems and research topics, int j found comput sci, issn 129-0541, 24(5):547-623. [8] glover, f.; taillard, e.; werra de, d. (1993); a users guide to tabu search, ann oper res, issn 0254-5330, 41(1):3-28. [9] gottlieb, j.; marchiori, e.; rossi, c. (2002); evolutionary algorithms for the satisfiability problem, evolut comput, issn 1063-6560, 10(1):35-50. [10] han, k.h.; kim, j.h.(2002); quantum-inspired evolutionary algorithm for a class of combinatorial optimization, ieee t evolut comput, issn 1089-778x, 6(6):580-593. [11] huang, l.; suh, i.h.; abraham, a.(2011), dynamic multi-objective optimization based on membrane computing for control of time-varying unstable plants, inform sciences, issn 0020-0255, 181(11): 2370-2391. [12] hwang, g.j.; yin, p.y.; yeh, s.h.(2006); a tabu search approach to generating test sheets for multiple assessment criteria, ieee t educ, issn 0018-9359, 49(1):88-97. [13] leporati, a.; pagani, d. (2006); a membrane algorithm for the min storage problem, lect notes comput sci, issn 0302-9743, 4361:443-462. [14] moore, m.; narayanan, a. (1995); quantum-inspired computing. tech. rep., department of computer science, university exeter, exeter, u.k. [15] narayanan, a.; moore, m. (1996); quantum-inspired genetic algorithms, proc of ieee cec, 61-66. [16] nishida, t.y.(1996); membrane algorithm with brownian subalgorithm and genetic subalgorithm. int j found comput sci, issn 129-0541, 18(6):1353-1360. [17] pan, l.q., alhazov, a., ishdorj, t.o. (2005); further remarks on p systems with active membranes, separation, merging, and release rules. soft comput, issn 1432-7643, 9(9):686-690. [18] pan, l.q.; martín-vide, c. (2005); solving multidimensional 0-1 knapsack problem by p systems with input and active membranes, j parallel distr comput, issn 0743-7315, 65(12):1578-1584. [19] păun, gh. (2000); computing with membranes, j comput syst sci, issn 0022-0000, 61(1):108-143. [20] păun, gh. (2001); p systems with active membranes: attacking np-complete problems. j automata lang comb, issn 1430-189x, 6(1):75-90. [21] păun, gh. (2007); tracing some open problems in membrane computing. rom j inf sci tech, issn 1453-8245, 10(4):303-314. [22] păun, gh.; rozenberg, g.; salomaa, a., eds. (2010); the oxford handbook of membrane computing. oxford university press. qeam: an approximate algorithm using p systems with active membranes 279 [23] whitley, d. (2001); an overview of evolutionary algorithms: practical issues and common pitfalls. inform software tech, issn 0950-5849, 43(14):817-831. [24] xiao, j.h.; zhang, x.y.; xu, j.(2012); a membrane evolutionary algorithm for dna sequence design in dna computing. chinese sci bull, issn 1001-6538, 57(6):698-706. [25] xiao, j.h.; jiang, y.; he, j.j.; cheng, z.(2013); a dynamic membrane evolutionary algorithm for solving dna sequences design with minimum free energy. match-commun math ch, issn 0340-6253, 70(3):971-986. [26] yang, s.; wang, n. (2012): a novel p systems based optimization algorithm for parameter estimation of proton exchange membrane fuel cell model. int j hydrogen energ, issn 0360-3199, 37(10): 84658476. [27] zhang, g.; gheorghe, m.; wu, c. (2008); a quantum-inspired evolutionary algorithm based on p systems for knapsack problem, fund inform, issn 0169-2968, 87(1):93-116. [28] zhang, g. (2011); quantum-inspired evolutionary algorithms: a survey and empirical study. j heuristics, issn 1381-1231, 17(3): 303-351. [29] zhang, g.; cheng, j.; gheorghe, m. (2011); a membrane-inspired approximate algorithm for traveling salesman problems, rom j inf sci tech, issn 1453-8245, 14(1):3-19. [30] zhang, g.; cheng, j.; gheorghe, m.; meng, q. (2013); a hybrid approach based on differential evolution and tissue membrane systems for solving constrained manufacturing parameter optimization problems, appl soft comput, issn 1568-4946, 13(3):1528-1542. [31] zhang, g.x.; cheng, j.x; gheorghe, m. (2014); dynamic behavior analysis of membrane-inspired evolutionary algorithms, int j comput commun control, issn 1841-9836, 9(2):227-242. [32] zhang, g.; liu, c.; rong, h. (2010); analyzing radar emitter signals with membrane algorithms. math comput model, issn 0895-7177, 52(11-12):1997-2010. [33] zhang, g.; zhou, f.; huang, x.; cheng, j.; gheorghe, m.; ipate, f.; lefticaru, r. (2012); a novel membrane algorithm based on particle swarm optimization for solving broadcasting problems, j univers comput sci, issn 0948-695x, 18(13):1821-1841. [34] zhang, x.; zeng, x.; luo, b.; zhang, z.(2012); a uniform solution to the independent set problem through tissue p systems with cell separation, front comput sci, issn 2095-2228, 6(4):477-488. international journal of computers communications & control issn 1841-9836, 9(5):584-592, october, 2014. choice of countermeasures in project risk management using fuzzy modelling d. kuchta, d. skorupka dorota kuchta* 1. wroclaw university of technology poland, 50-370 wroclaw, wybrzeze wyspianskiego 27, dorota.kuchta@pwr.wroc.pl 2. tadeusz kosciuszko military academy of land forces poland, 51-150 wroclaw, ul. czajkowskiego 109, *corresponding author dariusz skorupka tadeusz kosciuszko military academy of land forces poland, wroclaw abstract: the paper proposes a new method for project risk management. it is proposed how, after risk identification, the countermeasures for risk mitigation and elimination can be selected, taking into account the cost and effort linked to them as well as the weights assigned by the decision maker to risk attributes, such as probability or consequences, and the values of those attributes. the risk attributes and weights, as well as the maximal total risk and the maximal total effort of risk mitigation accepted by the decision maker for the project are expressed as fuzzy numbers, which in turn constitute models for linguistic expressions. keywords: project risk, risk mitigation, risk transfer, risk elimination, fuzzy logicbased optimisation. 1 introduction project risk management is a very important subject and there is a lot of literature connected to it. there are also a lot of risk definitions (it is assumed here, after [1], that risk is defined as an event which may happen and if it does, it will have negative consequences on at least one of project success determinants1) and risk management methods or techniques (e.g. [1]). most of them can be summarized as follows: i. risk identification; ii. risk evaluation the risks identified in point i. are evaluated on the basis of their attributes (like probability and consequences); iii. risk elimination, transfer or mitigation taking some steps or countermeasures which eliminate some risks or decrease the evaluation of others; iv. risk control during the project execution; v. lesson learned recording. the problem to which this paper wants to contribute is point iii. of the above procedure. there are no formal methods in the literature, apart from one presented in [10], helping the decision maker in this step. normally only informal procedures are proposed (e.g. [6]). and in [10] the authors assume among others that the probability of the risks cannot be explicitly 1project success determinants are discussed later in the paper. copyright © 2006-2014 by ccc publications choice of countermeasures in project risk management using fuzzy modelling 585 influenced by the countermeasures, but only the expected value of monetary losses, which in our opinion is wrong in reality usually each risk attribute (e.g. probability, consequences, etc.), can be individually influenced by individual countermeasures. the authors of [10] assume also that the goal is the maximization of the monetary expected value of risk reduced by the countermeasures, whereas we assume that the goal should be the minimization of countermeasures total cost needed to attain a fixed total risk level of the project, which we think to be more realistic. also, in [10] crisp values are required for risk attributes, ever for quality, whereas we assume linguistic expressions (fuzzy numbers) for risk attributes, like it is done e.g. in [4] and [9], as we think crisp values would usually be difficult to obtain from the decision maker. also, basing ourselves on [9], it is assumed here that different risk attributes may have various weights in the eyes in the decision maker, which in [10] is ignored. for one decision maker mostly the consequences may count (this is the case for projects where consequences are injuries or deaths of human beings), for some decision makers mostly the probabilities (it is so for many public projects, where each problem, even a relatively small one, is noticed by the media who make a big deal of it). what is more, nowhere in the literature the notion of risk levels (e.g. project level, closest environment level, further environment level, national level, international level etc., introduced in [8]) and the effort linked to an effective application of countermeasures on each of the different levels are combined with the choice of risk countermeasures, and this is done in the present paper. 2 basic notions and notation as it was mentioned above, project risk is an event which might happen and if it happens, it will have negative consequences on one of the success determinants of the project. there are several determinants of project success (for a review see e.g. [2]), like time, quality, cost, customer satisfaction, etc. let us denote those determinants as dl, l = 1, . . . ,l. the risks may concern several levels ( [8]), e.g. the project level, the closer environment level (for construction projects this would be the construction market level), the further environment level (for construction projects this would be the national level), etc. the various levels are denoted as lt, t = 1, . . . ,t . the higher the level index, the harder accessible and the less possible to influence the level is, thus the less manageable the risks linked to this level are. it is assumed that (point i. of the project risk management procedure given above) p lt risks for each level lt, t = 1, . . . ,t and for each determinant dl, l = 1, . . . ,l have been identified (the risks will be denoted as rt,lk , k = 1, . . . ,p l t ). each risk r t,l k , if it happens, it concerns level lt (thus, if one wanted to mitigate or eliminate it, one would have to act on the respective level) and will influence determinant dl, l = 1, . . . ,l of project success. it is also assumed that each risk is linked to exactly one level and to exactly one determinant, but this assumption would not be difficult to be given up. each risk rt,lk will have several attributes. usually two or at the most three risk attributes are considered: the probability of occurrence, the consequences of the occurrence (for the project success determinant linked to the specific risk, i.e. risks may have consequences for time, cost, quality, customer satisfaction etc.) and sometimes the difficulty in an early detection or in early forecasting of the risk occurrence (e.g. [6]). sometimes additional attributes may be important, like the probability of the occurrence of consequences once the risk has happened, the degree of influence we have on the occurrence of the risk etc. the latter attribute is especially important in case we have to consider several levels on which the risks may occur: the further the level, the less effective our countermeasures may be. let us denote the number of attributes as a, and the attributes themselves as va, a = 1, . . . ,a. the attributes are functions of risks, thus the value of the a-th attribute of risk rt,lk will be 586 d. kuchta, d. skorupka va(r t,l k ), a = 1, . . . ,a,l = 1, . . . ,l, t = 1, . . . ,t, k = 1, . . . ,p l t . va(r t,l k ) will be all measured in the same scale, their values will be given by experts. as mentioned above, various risk attributes may have various weights in the eyes of the decision maker. each attribute va(r t,l k ) will be thus assigned a weight wa(r t,l k ), also according to a given scale. finally, it is possible to use the defined parameters to evaluate each risk (point ii. of the project risk management procedure). the evaluation rev (rt,lk ) of each risk may be defined using one of the formulae proposed e.g. in [6]2, e.g. the following one: rev (r t,l k ) = a∑ a=1 wa(r t,l k ) ·va(r t,l k ) a∑ a=1 wa(r t,l k ) (1) as a further step (point iii. of the project risk management procedure), it is proposed to try to identify certain countermeasures to eliminate or mitigate the risks. it is assumed that each measure may concern only one level, one project success determinant and one risk attribute. this assumption seems natural: one countermeasure can e.g. be effective only on the national level and affect only the time, i.e. a possible delay being the consequence of a certain risk, another countermeasure may concern only the project level and affect the cost and the probability of an event which would increase cost. let us introduce also a simplifying assumption that each attribute of each risk can be affected by only one measure. thus, for each l = 1, . . . ,l, t = 1, . . . ,t and a = 1, . . . ,a let us define a set of measures (mt,l,ah ,h = 1, . . . ,h l,a t ). then, for each attribute va(r t,l k ), a = 1, . . . ,a of each risk r t,l k (k = 1, . . . ,p l t) it is identified which measure among the measures { m t,l,a h , h = 1, . . . ,h l,a t } can be applied, and the index of this measure will be denoted as m(va(r t,l k )). the value of each attribute of the risk r t,l k after the application of the measure indexed by m(va(r t,l k )) will be equal to m(va(r t,l k )). of course it is true m(va(r t,l k )) < va(r t,l k ). for some risks and attributes no countermeasure may exist then it is set everywhere m(va(r t,l k )) = va(r t,l k ). the evaluation of each risk after the application of the measures will be rev (r t,l k ) = a∑ a=1 wa(r t,l k ) ·m(va(r t,l k )) a∑ a=1 wa(r t,l k ) (2) each measure will of course be linked to a cost, measured in monetary values, denoted as c(m t,l,a h ), h = 1, . . . ,h l t, l = 1, . . . ,l, t = 1, . . . ,t, a = 1, . . . ,a. apart from the cost, the implementation of each measure will mean an effort, problems of various kind, the necessity to seek the access to certain people, to ask for permissions, uncertainty as far the effect of the measure is concerned, etc. usually the bigger, the higher the level index is. thus it is assumed that the effort depends on the level, not on the individual measures (again, it is only a simplifying assumption). it will be denoted as et, t = 1, . . . ,t and evaluated by an expert in a fixed scale. each single measure mt,l,ah , h = 1, . . . ,h l t, l = 1, . . . ,l, a = 1, . . . ,a will be thus linked to effort et. 2in [6] there is a discussion showing that from the practical point of view all the formulae are in fact equivalent. choice of countermeasures in project risk management using fuzzy modelling 587 3 model optimising the choice of countermeasures in project risk management in the model we assume that the project manager wants, if possible, to achieve a small project overall risk level at the minimal cost. also, the effort linked to the measures should be taken into account. thus in fact three objective functions should be present in the model: the overall risk level, the cost of applying risk mitigation or elimination measures and the overall effort of applying those measures. in order to solve this multicriteria problem, let us choose the easiest approach, where the level of all but one criteria has to be fixed, so that only one criteria formally remains as an objective function. in our opinion in most cases the decision maker will have to maintain a certain level of overall risk, and this level would be fixed beforehand. the decision maker would thus seek a way to achieve this level at the minimal cost. also, he would like to minimize the effort, but probably rather as a secondary goal, trying to control it and be aware of it than to search for an absolute minimum. that is why the following model is proposed, but of course other formulations of this multicriteria problem would be possible too.∑ h=1,...,hlt, l=1,...,l, t=1,...,t,a=1,...,a c(m t,l,a h ) ·x t,l,a h → min (3) ∑ l=1,...,l,t=1,...,t, a=1,...,a,k=1,...,plt a∑ a=1 wa(r t,l k ) · (1−y t,l k,a) ·va(r t,l k ) + a∑ a=1 wa(r t,l k ) ·y t,l k,a ·m(va(r t,l k )) a∑ a=1 wa(r t,l k ) ≤ rl (4) ∑ k∈{1,...,plt} and m(va(rt,lk ))=h y t,l k,a ≤ m ·x t,l,a h (5) for each h = 1, . . . ,hlt, l = 1, . . . ,l, t = 1, . . . ,t, a = 1, . . . ,a∑ h=1,...,hlt, l=1,...,l, t=1,...,t,a=1,...,a e(m t,l,a h ) ·x t,l,a h ≤ el (6) where xt,l,ah , h = 1, . . . ,h l t, l = 1, . . . ,l, t = 1, . . . ,t, a = 1, . . . ,a, will be equal to 1 if the corresponding measure should be applied and to 0 otherwise, yt,lk,a, l = 1, . . . ,l, t = 1, . . . ,t, a = 1, . . . ,a, k = 1, . . . ,p lt will be binary variables fulfilling the constraint y t,l k,a = 0 (thus, being able to be eliminated from the model) if the value of the a-th attribute of risk rt,lk would not be changed thanks to the corresponding countermeasure. the constant rl stands for the admissible overall project risk level, the constant el for the admissible effort level linked to the selected countermeasures and the constant m for a sufficiently big number. actually, in practice most of the parameters in the above model may be impossible to determine in an exact way and may require a fuzzy formulation. because of the limited scope of the paper, we assume, basing ourselves on [4, 5, 7, 9], only the risk attributes and the weights of the attributes to be fuzzy, as well the constants rl and el and cost values will be crisp. for the fuzzy modeling, let us use here the language from [9], where the authors consider three risk attributes (probability of occurrence, severity of consequences, early detection difficulty) whose values can take five fuzzy vales each (vh very high, h high, m moderate, l 588 d. kuchta, d. skorupka low, vl very low). the same 5 values are assigned in [9] to the attributes weights. the human language assumed in [9] at the background is as follows (ã = (a1,a2,a3) stands for the triangular fuzzy number with support [a1,a3] and a2 as the value with the membership degree 1): table 1: linguistic scale for risk attributes, their weights and the effort linked to the countermeasures [9] linguistic term corresponding triangular fuzzy number very low (vl) (0, 0, 0.25) low (l) (0, 0.25, 0.5) moderate (m) (0, 0.5, 0.75) high (0, 0.75, 1) very high (vh) (0.75, 1, 1) for rl and el we will use fuzzy numbers ã = (a1,a2,a3) with a2 = a3. the decision maker might be supported in the choice of the fuzzy numbers for rl and el by means of questions similar to the following ones: • for rl: how many serious risks at the maximum would he accept, and how many serious risk he thinks have to be linked to each project at the minimum (as there are no projects without risk and accepting risks opens new possibilities)? • for el: similar questions, but regarding difficult countermeasures. with fuzzy parameters denoted by ,̃ we would have a mathematical programming problem with fuzzy parameters in constraint (4), which would become ∑ l=1,...,l,t=1,...,t, a=1,...,a,k=1,...,plt a∑ a=1 w̃a(r t,l k ) · (1−y t,l k,a) · ṽa(r t,l k ) + a∑ a=1 w̃a(r t,l k ) ·y t,l k,a · m̃(va(r t,l k )) a∑ a=1 w̃a(r t,l k ) ≤ r̃l (7) the rest of model (3) (6) will remain unchanged. constraint (7) is not unequivocal. it may be interpreted in many ways, according to the way the decision maker chooses to compare fuzzy numbers. also, the fuzzy numbers on the left hand side have to be multiplied with a constant and added one to another. let us thus discuss shortly basic arithmetical operations on fuzzy numbers and the simplest approaches of comparing them. let us thus consider two triangular fuzzy numbers ã and b̃, based, respectively, on the following triples of crisp numbers: (a1,a2,a3) and (b1,b2,b3). then we have: • ã + b̃ = (a1 + b1,a2 + b2,a3 + b3) (8) • ã · b̃ = (a1b1,a2b2,a3b3) if all the parameters are positive (9) • for each crisp number r we have: r · ã = ã ·r = (ra1,ra2,ra3) (10) if it comes to the inequality ã ≤ b̃, it may be interpreted depending on the pessimism/optimism degree of the decision maker. e.g. three interpretations listed below are possible, where the first one is the most pessimistic (it is most difficult for ã to satisfy it) and the last one the most optimistic: choice of countermeasures in project risk management using fuzzy modelling 589 • a3 ≤ b1 (11) • a3+a2 2 ≤ b2+b1 2 (12) • a2 ≤ b2 (13) • a2+a1 2 ≤ b3+b2 2 (14) • a1 ≤ b3 (15) for other possibilities see e.g. [3]. choosing (in cooperation with the decision maker) one of the approaches, we can turn constraint (7) into a crisp constraint. 4 example let us consider a real world construction project, presented in detail in [8]. there are over 30 risks there and a similar number of countermeasures. here let us consider only a selection of the data. thus we have the following risks: • on the project level, determinant time: r1,11 erroneous identification of soil; • on the project level, determinant cost: r1,21 accidents during the construction; • on the construction market level, determinant time: r2,11 non-availability of work force at the budgeted cost; • on the construction market level, determinant cost: r2,21 non-availability of required equipment; • on the national level, determinant cost: r3,21 unfavorable inflation. the values of the risk attributes were assessed by the experts, as well as the weights of the attributes (we assume in the example that the weights are the same for all the risks). table 2: risks attributes and their weights in the example risk r1,11 r 1,2 1 r 2,1 1 r 2,2 1 r 3,2 1 weight of the attribute probability of occurrence m vh m h h l consequences of occurrence h vh h m vh vh difficulty in early detection vh h l l l m difficulty in influencing the risk by countermeasures l l h h vh m five countermeasures have been identified, together with their cost and effort: 590 d. kuchta, d. skorupka table 3: countermeasures, their cost, efforts and effects for the example countermeasure risk and attribute influenced effect on the attribute cost of the countermeasure (in monetary units denoted $) effort linked to the countermeasure having ready an alternative technology which can be used immediately when the soil problem is detected, m1,1,21 r 1,1 1 consequences h → l 40 $ l additional check of all the equipment and workers training, m1,2,11 r 1,2 1 , probability vh→h 30 $ l training offered to pupils which are about to graduate a construction high school, m2,1,21 r 2,1 1 , probability m→l 50 $ h renting or reserving the needed equipment already now, m2,2,11 r 2,2 1 , consequences m→vl 40 $ h using a more stable currency in all the transactions, m3,2,11 r 3,2 1 , probability h→l 20 $ vh for the example we have: • l = 1,2 (we consider two project success determinants: time and cost) • t = 1,2,3 (we have three levels: the project level, the construction market level, the national level); • a = 1,2,3,4 (we have four risk attributes corresponding to the rows of table 2); • hl,at = 1 for the following triples (t, l,a), which can be read off from the rows of table 3: (1,1,2),(1,2,1),(2,1,1),(2,2,2),(3,2,1). otherwise it is equal to 0; • decision variables (binary) are as follows: x1,1,21 ,y 1,1 1,2,x 1,2,1 1 ,y 1,2 1,1,x 2,1,2 1 ,y 2,1 1,1,x 2,2,2 1 ,y 2,2 1,2, x 3,2,2 1 ,y 3,2 1,1. let us assume rl = el = (2,3,3). we get then the following model (using values from table 1,2 and 3 and formulae (8)-(10)): 40x 1,1,2 1 + 30x 1,2,1 1 + 50x 2,1,2 1 + 40x 2,2,2 1 + 20x 3,2,2 1 → min (16) choice of countermeasures in project risk management using fuzzy modelling 591 a =(0,75,1,1)(1−y1,11,2)(0,0,75,1) + (0,75,1,1)y 1,1 1,2(0,0,25,0,5) + (0,0,25,0,5)(1−y1,21,1)(0,75,1,1) + (0,0,25,0,5)y 1,2 1,1(0,0,75,1) + (0,0,25,0,5)(1−y2,11,1)(0,0,5,0,75) + (0,0,25,0,5)y 2,1 1,1(0,0,25,0,5) + (0,75,1,1)(1−y2,21,2)(0,0,5,0,75) + (0,75,1,1)y 2,2 1,2(0,0,0,25) + (0,0,25,0,5)(1−y3,21,1)(0,0,75,1) + (0,0,25,0,5)y 3,2 1,1(0,0,25,0,5) =(0,0,75,1)(1−y1,11,2) + (0,0,25,0,5)y 1,1 1,2 + (0,0,25,0,5)(1−y 1,2 1,1) + (0,0,1875,0,5)y 1,2 1,1 + (0,0,125,0,375)(1−y 2,1 1,1) + (0,0,0625,0,25)y 2,1 1,1 + (0,0,5,0,75)(1−y2,21,2) + (0,0,0,25)y 2,2 1,2 + (0,0,1875,0,5)(1−y 3,2 1,1) + (0,0,0625,0,25)y 3,2 1,1 (17) b = (0,75,1,1) + (0,0,25,0,5) + (0,0,25,0,5) + (0,75,1,1) + (0,0,25,0,5) = (1,5,2,75,3,5) (18) a b ≤ (2,3,3) (19) y 1,1 1,2 ≤ mx 1,1,2 1 ,y 1,2 1,1 ≤ mx 1,2,1 1 ,y 2,1 1,1 ≤ mx 2,1,2 1 ,y 2,2 1,2 ≤ mx 2,2,2 1 ,y 3,2 1,1 ≤ mx 3,2,2 1 (20) (0,0,25,0,5)x 1,1,2 1 + (0,0,25,0,5)x 1,2,1 1 + (0,0,75,1)x 2,1,2 1 + (0,0,75,1)x 2,2,2 1 + (0,75,1,1)x 3,2,2 1 ≤ (2,3,3) (21) assuming the method (12) of comparing fuzzy numbers and combining (17), (18) i (19), we get the following crisp linear programming model: 40x 1,1,2 1 + 30x 1,2,1 1 + 50x 2,1,2 1 + 40x 2,2,2 1 + 20x 3,2,2 1 → min 0,875(1−y1,11,2) + 0,375y 1,1 1,2 + 0,375(1−y 1,2 1,1) + 0,34375y 1,2 1,1 + 0,25(1−y 2,1 1,1) + 0,15625y 2,1 1,1 + 0,625(1−y 2,2 1,2) + 0,125y 2,2 1,2 + 0,34375(1−y 3,2 1,1) + 0,15625y 3,2 1,1 ≤ 2,5 (22) y 1,1 1,2 ≤ mx 1,1,2 1 ,y 1,2 1,1 ≤ mx 1,2,1 1 ,y 2,1 1,1 ≤ mx 2,1,2 1 ,y 2,2 1,2 ≤ mx 2,2,2 1 ,y 3,2 1,1 ≤ mx 3,2,2 1 0,375x 1,1,2 1 + 0,375x 1,2,1 1 + 0,875x 2,1,2 1 + 0,875x 2,2,2 1 + x 3,2,2 1 ≤ 2,5 (23) solving model (16),(20),(22),(23) we get the information that measures m1,1,21 and m 2,1,1 1 should be applied, at the cost of 80. however, if we change the weight of the risk attributes in the last column of table 2, e.g. if the probability gets the weight vh, there is no solution and we have to accept a higher risk level: rl = (3,4,4). in that case the cheapest way of attaining this risk level would be the application of measures m1,2,11 and m 3,2,1 1 . 592 d. kuchta, d. skorupka 5 conclusions a new method of selecting risk countermeasures to attain the desired risk level has been proposed. it allows to choose risk countermeasures taking into account such elements as various risk attributes and various weights assigned to them by the decision maker, various risk levels (the ones closer to the project, where we have more influence, and the ones further away), the monetary cost linked to the countermeasures as well as the immeasurable effort their application implies. the information required from the decision maker may be given in linguistic terms, modeled by fuzzy numbers. a real world example has been presented too. further research is necessary to verify the approach in other real world projects as well as to support the decision maker in the choice of the definition of fuzzy inequalities, which may be interpreted in many ways. the choice of the interpretation of fuzzy constraints has to correspond to the preferences of the decision maker, otherwise the model will not lead to a satisfying solution. bibliography [1] courtot h. (1988), la gestion des risques dans les projets, ed. economica, paris. [2] camilieri e. (2011), project success: critical factors and behaviours, gower publishing company. [3] liu x. (2001), measuring the satisfaction of constraints in fuzzy linear programming, fuzzy sets and systems, 122(2): 263-275. [4] mandal s., maiti j. (2013), risk analysis using fmea: fuzzy similarity value and possibility theory based approach, expert systems with applications, 41(7): 3527-3537. [5] meyer a., zimmermann, h. (2011); applications of fuzzy technology in business intelligence. international journal of computers communications & control, 6(3): 428-441. [6] norman t.l. (2010), risk analysis and security countermeasure selection, crc press. [7] shang k., hossen z. (2013); research paper: applying fuzzy logic to risk assessment and decision-making, casualty actuarial society, canadian institute of actuaries, society of actuaries, 1-59. [8] skorupka d. (2012), method of construction projects risk assessment, lambert academic publishing. [9] wang y.-m., chin k.-s., poon g. k. k., yang j.-b. (2009), risk evaluation in failure mode using fuzzy weighted geometric mean, expert systems with applications, 1195-1207. [10] zhang y., zhi-ping p. (2013); an optimization method for selecting project risk response strategies, international journal of project management, 32(3); 412-422. int j comput commun, issn 1841-9836 7(5):968-975, december, 2012. the specification of etl transformation operations based on weaving models m. vučković, m. petrović, n. turajlić, m. stanojević milica vučković, marko petrović, nina turajlić, milan stanojević university of belgrade, faculty of organizational sciences serbia, 11000 belgrade, jove ilića 154 e-mail: {milica.vuckovic, marko.petrovic, nina.turajlic, milan.stanojevic}@fon.bg.ac.rs abstract: in the etl process the transformation of data is achieved through the execution of a set of transformation operations. the realization of this process (the order in which the transformation operations must be executed) should be preceded by a specification of the transformation process at a higher level of abstraction. the specification is given through mappings representing abstract operations specific to the transformation process. these mappings are defined through weaving models and metamodels. a generated weaving metamodel (gwmm) is proposed giving the complete mapping semantics through specific link types (representing the abstract operations) and appropriate ocl constraints. weaving models specifying the actual mappings must be in accordance with this proposed gwmm. keywords: etl process, mdd, weaving models. 1 introduction in crisis management the tracking of a large amount of information (regarding people, material, financial, medical and other resources) is crucial. the establishment of a data warehouse, in which all of the relevant information could be easily stored, processed and analyzed, would enable the crisis management coordinators to make efficient decisions. one of the most demanding phases in the data warehouse design process is the design of the process for transforming the source data into a form suitable for its further analysis (the extract-transform-load process). mistakes made during this phase may lead the whole project to failure. since it is usually very complex and time-consuming it is necessary to provide data warehouse designers with adequate techniques to aid them in overcoming this complexity. the etl process consists of the extract, transform and load phases. the focus of this paper is the transform phase of the etl process. this transformation process involves the execution of a set of operations through which the actual transformations are achieved. most of the existing approaches directly define the order in which the transformation operations must be executed during the transformation process. we consider this to be too complex because it involves the definition of the process realization at a low level of abstraction and propose that this phase should be preceded by a specification of the transformation process at a higher level of abstraction. the main focus of this paper is the specification of the key abstract operations specific to the transformation process. these abstract transformation (at) operations denote the semantics related to the different possible types of correspondences that exist between the source models and the target model and are the basis for the specification of mappings. we propose an approach in accordance with model driven development (mdd) which is based on the premise that the most important product of software development is not the source code itself but rather the models representing knowledge about the system that is being developed. the main goal of mdd is to automate software development through the successive copyright c⃝ 2006-2012 by ccc publications the specification of etl transformation operations based on weaving models 969 application of model transformations, starting from the model representing the specification of the system and ending in a model representing the detailed description of the physical realization, from which the executable code can ultimately be generated. in accordance with mdd a special kind of model i.e. a weaving model (wm) is used for the specification of mappings between heterogeneous models [1–3]. through these weaving models the correspondences between individual elements of different models (called woven models) are defined. in compliance with the omg mda an appropriate metamodel (i.e. a weavingmetamodel wmm) is defined and weaving models must conform to it. the wmm actually defines the types of correspondences which may occur between particular concepts of concrete models. however, metamodel concepts (e.g. relational metamodel concepts) cannot be used in the definition of the wmm, hence only new link types can be defined. this implies that the corresponding wm i.e. the mappings between concepts of concrete models (e.g. a concrete relational model) cannot be semantically controlled. in this paper mapping models and metamodels are used for specifying the at operations. in the proposed solution we provide for the explicit introduction of metamodels into the weaving model approach as well as an appropriate metamodel for the semantic mapping between these metamodels (the generated wmm or gwmm). the at operations are represented through the concepts of this gwmm. these concepts allow the establishing of semantic correspondences only between those metamodel concepts on which the respective abstract operations can be applied. since weaving models must conform to the gwmm this implies that both the syntax and the semantics of the correspondences between concepts of concrete models can be controlled. the paper is organized as follows. the next section describes the main issues of the design of the etl processes and briefly presents the related work regarding the different approaches to its design. section 3 describes the existing mdd approach used in the eclipse modeling framework for the specification of model mappings. the proposed solution for the specification of the gwmm as well as several examples demonstrating its application, are given in section 4. finally, a conclusion is given detailing the main benefits of the proposed approach. 2 design issues one of the main issues in data warehouse is the problem of data integration, i.e. the integration of heterogeneous data sources into a single source. to this end first a global model i.e. a reconciled model should be created giving a unified view of the relevant source data. the main benefit of this approach is that it creates a common reference data model for the whole organization [7]. after the reconciled model has been created the next step is the population of the data warehouse (dw) with the actual data. if the chosen architecture for dw design presumes that the reconciled model is materialized, first the reconciled data layer will be populated from the sources and then the dw we be populated from the reconciled data layer. we have adopted such an approach in this paper since it enables the clear separation of source data extraction and integration from dw population [7]. therefore, the data extracted from the sources first needs to be transformed into a format compliant with the defined reconciled model. assuming that a single reconciled model has previously been created on the basis of the source models (e.g. by overlapping the source models) we propose that the first step in the transformation process design should be the specification of the correspondences between the source model concepts and the reconciled model concepts at the highest level of abstraction. at this stage conflicts can occur both at the structure and the instance levels (i.e. the data level). structure level conflicts are caused by the fact that different structures, and relationships among them, are used to represent the same real world concepts in different source models. instance level conflicts are more complex and may be the result of different granularity levels (e.g. events 970 m. vučković, m. petrović, n. turajlić, m. stanojević recorded on a daily or weekly level) or different formats in which the data is recorded. at the structure level we can differentiate between mappings at two different granularity levels i.e. the element and attribute level. element level mappings are used to define the correspondences between source model and reconciled model concepts by which same real world concepts are modeled. these mappings represent different abstract transformation (at) operations (e.g. join, equivalence, etc.) by which the semantics of the correspondences between the related elements are defined and which are easily understandable from the end-user viewpoint. the at operations are not executable and will be transformed, in the subsequent phases of the etl process design, into one or more actual operations (e.g. sql join, union, etc.), though this transformation is not in the scope of this paper.the attribute level mappings are at a lower granularity level and give the details of the established element level mappings. attribute mappings represent at operations that transform the values of one or more source model attributes into one or more reconciled model attribute values (e.g. equals, concatenate, split, add, etc.). most of the existing approaches to etl design proposed in literature or implemented in commercial tools do not provide concepts which allow the explicit and formal definition of the semantics of the element or attribute mappings. in [11] mechanisms for the specification of the most common etl process operations (e.g. join, filter, etc.) are provided and a set of corresponding uml stereotypes is defined. these mechanisms are related through uml dependencies, and attribute mappings are defined by notes attached to the dependencies. data mapping diagrams based on the data mapping uml profile are introduced in [10] to trace the flow of data and are organized into four levels (database, dataflow, table and attribute levels) through the use of uml packages. at the table level only data relationships are specified and not the actual processes therefore, these mappings do not carry any semantics. at the attribute level the semantics of the mappings are given either as uml notes attached to the target attributes if the relationship is represented as an association, or, if the relationship is represented as a mapping object, by the tag definition of the mapping object. in [12] a static conceptual model of the etl process is proposed for identifying the transformations in the etl process and it includes the transformation entity (an abstraction of modules of code executing a single task related to filtering, cleaning or transformation operations), etl constraints (regarding the necessary data requirements), notes (explaining the semantics of the applied functions: the type or expression/condition). in these approaches most of the transformations semantics are given through notes, often in a natural language, which does not represent a formal specification. on the other hand, most approaches directly define the order in which the transformation operations must be executed during the transformation process i.e. the dataflow, as in [10, 11]. also in [9] etl process models are designed in accordance with the introduced bpmn4etl metamodel (based on the business process modeling notation) which defines the dataflow. we consider that the specification of the transformation process realization should be preceded by a specification of the process at a higher level of abstraction. these specifications should be formal enough to enable the designer to move to lower specification levels resulting in the dataflow specification, through certain model transformations (in accordance with mdd). to this end, in [12] a static conceptual model of the etl process is proposed which can be mapped into a logical model representing the workflow. in [8] a uml profile is proposed which introduces the mappingoperator stereotype (among others) to tag classes defining the functions that can be used for specifying the mapping expression of a particular mapping. therefore, we propose that the specification of the dynamics of the etl process should be preceded by a static specification in which the mappings are defined through concepts which explicitly represent the semantics of the transformation operations. this specification should be formal enough to allow its transformation into the dataflow specification (in accordance with mdd), it should be extensible to allow the designers to add their own concepts, and its concepts the specification of etl transformation operations based on weaving models 971 should be easily understandable from the end-user viewpoint. 3 the amw approach in accordance with the mdd motto that everything should be treated as models, the weaving model approach treats mappings between heterogeneous models also as models [1, 2] and is supported by the atlas model weaver (amw) toolkit [3] within the eclipse modeling framework (emf) environment. in the amw approach weaving models (wm) are used for defining the correspondences between individual concepts of different models (called woven models). in the context of the amw approach mapping specifications can be regarded at different abstraction levels (illustrated in figure 1.), which actually correspond to the different levels in the omg mda standard [4]. taking into account the relationships which must exist between models from different abstraction levels in the omg mda, a mapping model at a given level of abstraction serves as a metamodel for mapping models from the lower abstraction level. or, in other words, every mapping model must conform to a mapping metamodel from the higher abstraction level. it should be noted that in this paper we concentrate only on the metamodel (m2) and model (m1) levels. �� figure 1: the amw approach in accordance with the previous statements, a wm must also conform to a particular weaving metamodel (wmm). this wmm is actually an extension of the generic wmm i.e. the core wmm, which is based on the meta-metamodel namely, the ecore meta-metamodel in the emf environment. since the concepts of the core wmm do not give the definition of the specific semantics of the correspondences, it should be extended [1] to provide the semantics relevant for the specific context in which the models are used (i.e. it should be extended to include concepts specific to a certain domain). therefore, the wmm will actually define the types of correspondences which may exist between concepts of woven models, and consequently correspondences specified in a wm must only be instances of the types defined in the wmm. however, a wmm cannot specify the mappings between metamodel concepts (e.g. the relational metamodel, the xsd metamodel concepts etc.), it merely defines new correspondence types. more precisely, the semantics of an introduced correspondence type are given only through the definition of its name without specifying the type of metamodel concepts that may participate in that type of correspondence. thus, one of the main drawbacks of the amw approach is the fact that a wmm does not include the necessary semantic rules for establishing mappings between metamodels so mappings between meta-concepts cannot be defined. instead it is only possible to define mappings between concepts of concrete models. consequently, only the syntax of these mappings can be controlled and not their semantics. in other words, it is up to the designer to know these semantic rules and ensure they are fulfilled when defining the mappings. in [5,6] a solution is proposed for overcoming this problem in the context of the specification of mappings between heterogeneous schemas. this solution is based on the explicit introduction of 972 m. vučković, m. petrović, n. turajlić, m. stanojević meta schemas into the wm approach as well as an appropriate wmm for the semantic mapping between the concepts of these meta schemas. the details of this approach and it application for the specification of at operations are given in the following section. 4 the proposed solution to take into account the assertions made in the previous section an extension of the core wmm is proposed which includes the concepts necessary for providing the semantics of the correspondences in the context of the transformation process. in figure 2. this extended wmm is shown using the uml class diagram. new mapping link types representing the identified at operations (join, equivalence, equals, concatenate, etc.) are defined as specializations of the wlink concept. it should be emphasized that every identified at operation is represented by a separate mappinglink type in our extended wmm though we have only depicted those relevant for the following examples. these new mappinglink types are used for the specification of mapping between the concepts of different metamodels (the actual concepts are represented by the mappinglinkend subclass of the wlinkend class). references source and target enable the defining of many-to-many mappings between the concepts of different metamodels. mappingmodelref and mappingelementref represent references to concrete uml models and their elements. �� figure 2: extended weaving metamodel (simplified) as explained in the previous section the extended wmm merely defines new mapping link types (whose semantics are only given through their names). since each specific type of mapping assumes certain constraints regarding the number and type of model concepts that may participate in that type of mapping (e.g. the join operation may only be used for mapping elements and must map at least two source model elements to a single reconciled model element) these constraints should be included in a wmm to prevent structural and semantic mistakes in wms. the procedure for generating the gwmm is based on the transformation given in [5, 6]. the transformation is accomplished by defining an individual wm for the mappings between metamodel concepts (the mm-wm) and then transforming it into a generated wmm with appropriate ocl constraints (e.g. for checking whether the type of model concepts involved in each mapping are valid). specifically, a wm conforming to the proposed extended wmm is created in which the mappings are defined through a set of mapping links which are instances of the introduced mapping link types (e.g. tablejoin is an instance of the join link type). this wm is then transformed into a new generated wmm (gwmm) in which the semantics of these mapping links are represented by appropriate ocl constraints. this gwmm now serves as a metamodel for weaving models at a lower level of abstraction (the m1 level in the omg mda). more precisely put, while the extended wmm defines the new mapping link types (representing the at operations), it is the gwmm with ocl constraints that gives the complete semantics the specification of etl transformation operations based on weaving models 973 of the defined mappings (regarding the metamodel concepts on which those operations can be applied). this process is illustrated in figure 3. it is assumed that all of the models (both source and reconciled) are expressed using the same formalism i.e. they are based on the same metamodel, which is common practice in dw design. for the purpose of this paper we have chosen the relational metamodel. the concrete models are represented as uml class diagrams using the appropriate uml stereotypes defined by the relational metamodel (the definition of these uml stereotypes is omitted from this paper due to space constraints). �� ! "#�� $$� �% �� && $$� �% �� && $$��% �� &&� �� figure 3: proposed solution next we give several examples of concrete weaving models conforming to the proposed gwmm. it should be emphasized that the first step in creating a particular wm is to define the element mappings. subsequently, the attribute mappings are defined detailing the established element mappings. all mappings are defined by using the concepts of the proposed gwmm. in figure 4. an example is given illustrating the abstract tableequivalence operation. the mapdoctor mapping is an instance of the tableequivalence mapping link which states that the mapped elements medicalworker and doctor are equivalent in the sense that they describe the same real world concept at the same abstraction level. the details of the mapdoctor element mapping are given through the child attribute mappings. the attribute mapping mapfullname is given by the columnconcatenate mapping link whose semantic indicates that the values of the firstname and lastname attributes of the medicalworker element should be concatenated to obtain the value for the fullname attribute of the corresponding doctor element. the mapssn mapping is given by the columnequals mapping link whose semantic indicates that attributes ssn of the medicalworker and doctor elements exactly coincide. another example is given in figure 4. (on the right) illustrating the situation in which the medicalworker element of the fist source model and the doctor element of the second source model represent the same real world concept but record different information about it. therefore they are represented by a single doctor element in the reconciled model which includes all of the relevant information contained in both of the source models. for defining this type of mapping the abstract tablejoin operation is used whose semantic indicates that the medicalworker and doctor elements of the two different source models should be joined to obtain all of the relevant information for the doctor element in the reconciled model (the mapdoctor mapping link). in addition to defining the corresponding child attribute mappings it is also necessary to define the condition by which the elements should be joined. this is given by the columncondition mapping link, also a child of the defined element mapping, which specifies that the join should be performed on the basis of the ssn attributes. the columnconstraint link, again, also a 974 m. vučković, m. petrović, n. turajlić, m. stanojević �� !�� !�� " �� #�� figure 4: illustrations of the proposed approach child of the defined element mapping indicates that only those instances of the doctor element which satisfy the defined constraintyear regarding the required dateofbirth will be included in the transformation. the child attribute mappings giving the details of the mapdoctor element mapping have been omitted from because they would render the picture unnecessarly complex. 5 conclusion to overcome the compexity of etl process design they should be designed gradually through the development of a series of models and the corresponding transformations between them (in accordance with mdd). the aim of this paper is to facilitate the defining of the transformation process at the highest level of abstraction. the specification of the transformation is given through mappings between the source models and the reconciled model which represent abstract operations specific to the transformation process. in the proposed solution the specification of the at operations is based on the weaving model approch. to this end, first a description of the existing model weaving approach in the emf environment is given and its drawbacks are discussed. then, a solution is proposed which actually extends the existing amw approach to overcome the identified drawbacks. the proposed solution is based on the introduction of a special generated wmm with ocl constraints through which the mapping links between concepts of concrete models are semanticaly controlled. namely, the at operations of the etl process are actualy represented through appropriate semantic mapping links between metamodel concepts.therefore, the main benefit of the proposed approach is the introduction of a formal specification of the semantics of the transformation operations giving a static view of the transformation process which is extensible and easily understandable from the end-user viewpoint. it is also platform independent so it can be used to complement other approaches dealing with the dynamic aspects of the transformation. on the basis of these specifications the designer would, through certain model transformations (in accordance with mdd) move to lower specification levels which would result in the data flow specification i.e. the specification of the process dynamics (the at operations would be transformed into one or more actual operations e.g. sql join, union, etc.). therefore, further work in this area would be aimed at the realization of the proposed solution in the emf environment. the specification of the at operations given through weaving models would be used for obtaining the transformation models in a given transformation language (atl, qvt or xslt). these transformation models can be used for the automatic generation of code for any target platform. the specification of etl transformation operations based on weaving models 975 acknowledgment the research presented in this paper was partially supported by the ministry of education and science of the republic of serbia, grant iii-44010 and tr32013. bibliography [1] del fabro, m.d., bézivin, j., jouault, f., valduriez, p., applying generic model management to data mapping. in proc. of base de donnes avances (bda 2005), france, 2005. [2] del fabro, m.d., valduriez, p., semi-automatic model integration using matching transformations and weaving models. in proc. of symposium on applied computing, acm, 2007. [3] del fabro, m.d., bézivin, j, and valduriez, p., weaving models with the eclipse amw plugin. in: eclipse modeling symposium, eclipse summit europe 2006, germany, 2006. [4] miller, j., mukerji, j., model driven architecture (mda). http://www.omg.org; 2001. [5] nešković, s., vučković, m., aničić, n., on using weaving models to specify schema mappings. 2nd intl workshop on future trends of model-driven development, portugal, 2010. [6] aničić, n., nešković, s., vučković, m., cvetković, r., specification of data schema mappings using weaving models. paper accepted for publication in comsis journal, march 2012. [7] golfarelli m., rizzi, s., data warehouse design: modern principles and methodologies. mcgraw-hill, isbn-13: 978-0071610391, 2009. [8] kurz, s., guppenberger, m., freitag, b., a uml profile for modeling schema mappings. in proc. of the intl. conf. on advances in conceptual modeling, (pp. 53–62), springer, 2006. [9] el akkaoui, z., zimányi, e., mazón, j.-n., trujillo, j., a model-driven framework for etl process development. in proc. of 14th intl. workshop on dw and olap, acm, 2011. [10] lujan-mora, s., vassiliadis, p., trujillo, j., data mapping diagrams for data warehouse design with uml. in proc. 23rd intl. conf. on conceptual modeling, springer, 2004. [11] trujillo, j., luján-mora, s., a uml based approach for modeling etl processes in data warehouses. conceptual modeling er 2003, (pp. 307–320), springer-verlag, 2003. [12] simitsis, a., mapping conceptual to logical models for etl processes. in proc. of the 8th acm international workshop on data warehousing and olap, (pp. 67–76), acm, 2005. int j comput commun, issn 1841-9836 vol.7 (2012), no. 2 (june), pp. 312-324 an intelligent and pervasive surveillance system for home security a. longheu , v. carchiolo, m. malgeri, g. mangioni alessandro longheu, vincenza carchiolo, michele malgeri, giuseppe mangioni dipartimento di ingegneria elettrica, elettronica ed informatica facoltà di ingegneria università degli studi di catania viale a. doria 6, i 95125 catania italy {alessandro.longheu, vincenza.carchiolo, michele.malgeri, giuseppe.mangioni}@dieei.unict.it abstract: domotics is a promising area for intelligent and pervasive applications that aims at achieving a better quality of life. harnessing modern technologies is valuable in many contexts, in particular in home surveillance scenario, where people safety or security might be threatened. modern home security systems endorse monitoring as well as control functions in a remote fashion, e.g. via devices as a laptops, pdas, or cell phones, thus implementing the pervasive computing paradigma; moreover, the intelligence is now often embedded into modern applications, e.g. surveillance systems could adapt to the environment through a self-learning algorithm. this work presents an intelligent and pervasive surveillance system for home and corporate security based on the zigbee protocol which detects and classifies intrusions discarding false positives, also providing remote control and cameras live streaming. results of tests in different environments show the effectiveness of the proposed system. keywords: home surveillance systems; pervasive systems; zigbee protocol. 1 introduction home automation [1], [2] exploits the latest technologies to provide an intelligent control of lighting, air conditioning, plumbing systems, home appliances and security systems, achieving comfort, safety, efficiency, costs/energy savings and, in summary, a better quality of life [3], [4]. one of the first scenarios where the penetration of domotics occurred is home surveillance, where sensors, actuators, alarms, controllers or even robots [5] increase safety through continuously environment scanning, sending alarms and recording incidents upon detecting any abnormal event, based on the consolidated principles of corporate security [6]. the amazing part of modern home security systems is that monitoring and control can be performed remotely via devices as a laptops, pdas, or cell phones. according to this criterion, home security systems can be classified into four categories [7]: • hardware-based, the simplest systems where both monitoring and control are implemented in hardware, • passive systems, where only the monitoring is remote (the control is manual), • phone based systems, with monitoring and control performed through the phone (wired and/or cellular) network • web-based systems, identical to phone-based but using the internet as the communication infrastructure. copyright c⃝ 2006-2012 by ccc publications an intelligent and pervasive surveillance system for home security 313 hardware-based systems in general offer high performances but with higher costs than other solutions, and they might be subjected to proprietary solutions thus with potentially drawbacks as less interoperability and less flexibility; conversely, lower performances but greatest flexibility at limited costs is achieved with phoneand web-based systems, which are now more and more adopted for home surveillance. moreover, the use of these systems together with the cutting-edge hardware and software technologies of mobile devices allows a more effective implementation of the intelligent [8] and pervasive (or ubiquitous) computing paradigma [9], [10]. the intelligence is now often embedded into modern applications, for instance the surveillance system could adapt to the environment through self-learning [11] or it can automatically start some countermeasure as some critical event occurs [12]. the concepts of ubiquitous and pervasive computing, sometimes considered as similar [13] or even overlapped with mobile or embedded computing [14], are implemented on one hand thanks to the advanced pc-based hardware (sensors, actuators etc.) available for home security, in accordance with the ubiquitous computing paradigm where computers ”vanish into the background” [16], [15], on the other hand the use of smatphones or pdas as monitor/controller devices gives users the possibility of accessing information and services anytime from anywhere, as promoted by the pervasive computing approach [17]. another relevant issue concerning home security systems is the underlying transmission network, indeed a consolidated standardization is still missing [18], whilst the need for a quick deployed and cost-effective wireless solution to support easy remote control and affordable data transmission is emerging rapidly [19]. currently, several home wireless networks are available as infrared technology (irda), bluetooth and zigbee protocol. irda operates over short distances and is subjected to high error rate, whereas the bluetooth technology is limited by network capacity and performances. the most promising standard for wireless home and personal area networks is then represented by the zigbee technology [20], [21], which comes with features as low complexity, low data error rate, low power and low-cost [22], [23]. the work presented in this paper falls into the scenario outlined above, in particular we propose an intelligent surveillance system for home security that receives intrusion attempts detected by sensors and cameras connected through a zigbee-based network and classifies such intrusions with a customizable algorithm in order to exclude false positive cases (e.g. leaves moved by the wind); this paper is an extended version of [24]. potential alarms can be managed via an iphone r⃝ application that receives alarms, and allows users to use the iphone as a system remote controller and as a monitoring console to view real-time camera images. the paper is organized as follows: section 2 introduces the architecture of the system, while in section 3 we describe in detail the application that manages devices, processes alarms detection and implements the push notification service. section 4 shows the calibration that helps false positive intrusion detection as well as the system at work; finally, section 5 presents our conclusions and further works. 2 the physical layer the home surveillance system we propose is represented in figure 1; its components come from several goals to be addressed: • first, we want to detect of unauthorized accesses to the perimeter, i.e. whenever a potential intrusion occurs it should be detected using infrared radars and ip cameras) 314 a. longheu , v. carchiolo, m. malgeri, g. mangioni • the system must be able to classify the events detected, distinguishing between real intrusions and false positives e.g. due to leaves moved by the wind or to animals; this goal is achieved through a customizable algorithm [25] that can be tailored to both indoor and outdoor environments • finally, once real intrusions are recognized, the system provides a notification via an iphone r⃝ application so the user is immediately warned; he should also be able to view real-time images from cameras, so real-time countermeasures as siren activation or police action request can be taken. figure 1: logical schema of the proposed suirveillance system referring to figure 1, the system key component is the supervisor which manages the network, the data coming from devices (cameras and radars) as well as remote iphone connections and finally processes video images for intrusions detection. the supervisor used in our experiments was a medium-performance notebook, including a pentium r⃝ dual-core cpu @ 2.30ghz and 2gb ram, however the system does not require significant cpu/bandwidth resources, so it can be easily implemented on a low-cost 32-bit microcontroller provided with a serial interface (to allow communications with the coordinator module). the coordinator acts as the interface between the supervisor and the zigbee network; it is implemented using a custom zigbee module, it manages the low-level network and can be configured and managed via standard serial interface. moreover, we used only devices that provide standard ttl output, in order to easily achieve interoperability with any programmable digital circuit. to manage the communication between the devices and the zigbee module we used a microcontroller that listens to events from the device and controls the zigbee module. for instance, the pir (passive infrared) based component, i.e. a typical residential pir motion detector with multi-fresnel lens cover, needs to be interfaced with the zigbee network; to this purpose, we used the pic16f628a microcontroller (see fig.2), identical to the well known pic16f84 except for its additional usart module, which allows a high level management of serial communication. in addition to standard pins connection, as clock and reset, also the rb0/int (pin 6) and rb2/tx/ck (pin 8) were connected, in particular the former was used to receive asynchronous interrupt signals coming from the pir radar, whilst the latter was set up as an output for the pic16f628a and used to connect to the zigbee network. the microcontroller run at a 16mhz frequency clock in order to ensure the compatibility with the zigbee module the microcontroller waits for signals coming from sensors ttl output; as soon as it receives a signal i.e. a potential alarm it alerts the coordinator (via the zigbee module) that controls the camera using the rs232 protocol. note that each module acts as a zigbee end device, so they spend most of the time in a sleep state, thus saving energy. this however also requires to ”wake up” the device in order to an intelligent and pervasive surveillance system for home security 315 figure 2: interfacing a pir motion detector with the zigbee network using the pic16f628a communicate with it, so the pic16f628a was programmed so that the device changes from the sleep to the ready state before sending a command, and the same controller allows the device going into the sleep state again after the command has been executed. finally, note that for the pir motion detector we also detect potential tampering attempts by connecting the device tamp pin to the pic16f628a so that an alarm is sent even in this case. 3 the application layer the architecture described in the previous section operates thanks to the supervisor application, a software that manages devices and evaluates alarms, helping to prune false positive ones. such application is arranged into the following modules: • control module: it controls the physical connection with devices and is based on the qextserialport project [27]; this module automatically detects new devices (for instance, a new radar) and allows the supervisor user to add it to the zigbee network • zigbee management module: it is an higher layer manager of the zigbee devices as cameras and radars; • alarm management module: it processes potential alarms to remove false positives and manages the remote alarm notifications as well as control. all these modules are implemented in c++ within the qt framework [26], whereas the opencv libraries [28] were used to process images coming from ip camera. in particular, the available libraries to communicate with the coordinator were developed as an ocx control, so just windows r⃝-based systems can interface with the system. to overcome this limitation, we develop a c++ version of such libraries; this not only provides more compatibility, but also can be easily interfaced with computer vision (cv) softwares, which are generally written in c language. the reason for choosing the qt framework is that it ensures platform portability and operating system independence; its only drawback is the phonon framework used by qt to manage the multimedia layer. phonon is an abstract framework whose implementation depends on the operating systems, in particular it leverages the quicktime r⃝ libraries for mac os-x, directshow r⃝ libraries within the windows r⃝ platform and gstreamer for linux os. phonon does not allow 316 a. longheu , v. carchiolo, m. malgeri, g. mangioni an easy single frame management of a video stream, so we exploited the qtkit and quicktime frameworks to write a class that provided these functionalities. a complete uml diagram of the supervisor application is depicted in fig.3, in particular: figure 3: uml diagram of the proposed system • the zbdevice class represents a generic zigbee device with its standards parameters, as the mac address, the type of device and so on • three classes are derived from the zbdevice, i.e. the zbnetworkcoordinator, the zbradar and the zbcamera, whose roles are clearly understandable; in particular, the coordinator is the only device that can access the physical network via the serialporthandler class • the zbnetwork class describes the underlying zigbee network, whereas asynchronous signals in the network are managed by the serialportreceiver, that forwards such signals to the right device, for instance as soon as the coordinator is turned on, it sends a message to its supervisor with its mac address in order to be registered following a process mediated by the serialportreceiver which also exploits the serialporthandler to provide message acknowledgements a snapshot of the supervisor application is presented in fig.4. in the following we discuss how the alarms are detected, whereas the subsection 3.2 shows how the remote notification occurs whenever a relevant alarm is detected. 3.1 alarms detection the mp4 video stream generated by the ip camera is provided to the supervisor’s alarm processing module through a real time streaming protocol (rtsp) server connected via the wireless connection; this allows the evaluation of whether an intrusion occurred and an alarm should be generated or not. the received frames are evaluated through the algorithm (details can be found in [25]) in order to assign a precision (i.e. a numeric value) thus establishing whether a false positive has been detected. to do this, the first (trivial) solution we adopted was to evaluate the difference between the current frame and the previous one, applying a threshold to detect a binary pattern; this worked an intelligent and pervasive surveillance system for home security 317 figure 4: snapshot of the supervisor application in almost static background scenarios, as for indoors environments (e.g. a room) with constant lighting, but it is not suitable when frames background is not static, for instance when tree leaves are moved by the wind. the next step was to apply the gaussian background model [29], citelee2:2005, in order to effectively remove the (even dynamic) background; the algorithm implemented in the supervisor application works following the steps illustrated in fig. 5: it first converts colour images into a black and white format for binary processing (1), then it detects the foreground (2) and extracts the corresponding bounding rectangle (3), i.e. the area where the gaussian model detected relevant information. figure 5: alarms sources extraction process of the supervisor algorithm bounding rectangles across subsequent frames are compared for a given time interval and if something is detected with a specified precision, an alarm is generated; the system stores in xml format any relevant event, should it determine an alarm or not, in accordance with a set of severity levels. 318 a. longheu , v. carchiolo, m. malgeri, g. mangioni note that alarm detection is completely customizable by setting algorithm parameters, so a proper configuration allows to correctly distinguish between false positives/negatives and real alarms; for instance, the threshold can be reduced for indoor environments (where the background is almost static) so even a little movement will be detected; similarly, the minimum bounding rectangle can be increased, in order to discard cats/dogs movement detection in outdoor environments and so on. parameters should be tailored to the actual scenario the system will be installed into (section 4 shows a typical calibration session); in fig. 6 the c struct implementing such parameters is shown, together with the screenshot of the supervisor application used to set up their values. details about the meaning of parameters and how they affect the cv recognition process can be found in the opencv documentation [28]. figure 6: cv parameters set up 3.2 alarm notifications and remote control as soon as a (possibly real) alarm is detected, the system sends notification to the user, also providing a remote control of overall systems functionalities. these are implemented making use of the well-known push technology [31], citepohja:2009, in particular we used the apple push notification service (apns in the following) [33]. the supervisor application is first registered at the server providing apns; then, whenever the supervisor needs to send a notification, it establishes a secure connection to the apns using the certificate obtained by the apns during the registration, and finally data are transmitted. the format of the packet being sent is illustrated in figure7, where the devicetoken is a 32 byte device identifier and the payload represents the notification, created according to the json format, a lightweight data-interchange format based on a subset of the javascript language [34]. during our tests notifications were received within about 4 seconds since the request, that can be considered a good response time for this kind of application. to manage notifications and provide remote control functionalities, a mobile application operates in conjunction with the supervisor’s software counterpart. an intelligent and pervasive surveillance system for home security 319 figure 7: the format of notifications sent by the supervisior this application was developed in objective-c within the cocoa touch framework [35], and it displays notifications received from the push server, also allowing the management of the zigbee network (e.g. enabling or disabling devices) and the possibility of remotely examining current camera video stream from the mobile phone, checking whether it deals with a real intrusion on not and applying proper countermeasures, as an alarm activation or a police call for on-site actions. finally, note that the system also performs a complete log of all events, each stored with a severity level, a timestamp and a description, so that it will be possible to analyze who, why and when a given alarm notification has been triggered. 4 experiments and results in order to evaluate the performance of the intrusion detection system presented in previous sections, we performed two types of test, the former to assess how the calibration affects alarms classification and the latter is about the behaviour of the zigbee network, in particular for what concerns the largest area that can be effectively kept under surveillance. to assess false positive alarms pruning, we used a panasonictm standard vga resolution camera in two different scenarios, an indoor environment (a room) with scarce illumination and the an outdoor environment (a garden in front of a house) with strong lighting. for each scenario we performed 50 test sessions, 25 with an intruder walking in front of the camera and the remaining 25 without intrusion. finally, results from each scenario have been evaluated first without any configuration for the detection algorithm parameters (i.e. with default values) and then providing calibration for such values (details are here omitted [25]) in the following we present the test results using diagrams that show the number of tests according to the percentage of accuracy reached (ranging from 0 to 100%). the measures for the indoor scenario with no calibration (see figure 8 a) show unsatisfactory results. in the set of 25 tests with an intruder, 20% of them shown negative results, i.e. the system did not detect the intruder. the same scenario with no intruder shows incorrect results for 44% of tests (a non-existent intruder is detected); this was due to missing calibration and also by changes in the light coming from the windows (which lead to false positive detection). after the calibration, results significantly improved as figure 8 b) shows. the algorithm indeed detected the intruder with high accuracy (no false negative occurred), and similarly in the case of absence of the intruder (no false positive have been detected). the second scenario concerns an outdoor environment with strong lighting (a garden in front of a house in the morning); results are displayed in figure 9. the significantly increasing in illumination allow better results even with no calibration (see fig. 9 a); the algorithm indeed shown high precision in detecting an intruder (no false negatives), whilst we have just 16% of false positives in the case of no intrusion. after the calibration results were further improved (false positives were completely prevented). the second set of tests was about the zigbee network, in particular we want to assess the largest area that can be effectively kept under survelliance. to quantify this effectiveness, we 320 a. longheu , v. carchiolo, m. malgeri, g. mangioni figure 8: indoor scenario used the link quality indicator (lqi), an 8 bit sequence representing a figure of the quality of the link between two zigbee nodes [36]. to perfom such tests we used a system made up of a coordinator module and a router module which was positioned at different locations (increasing the distance from the coordination) along a given perimeter. at each position we performed 25 measures; results are shown in fig. 10, where the value of lqi for distance from 0 to 12m is plotted, in particular the solid line represents the theoretical lqi value whereas the other line represents measured lqi values. each point is the average of 25 measures; in the same figure we report some lqi measurements together with corresponding variance values; note that variance remains limited until a distance of 4 meters, whereas it increases significantly over this distance, this is due to the presence of a wall at 5 mt, which hinders the communication between zigbee modules. tests revealed worse performances than those declared on the component datasheets, indeed the system worked well up to a distance of 20 meters even if the lqi decreased significantly since a distance of 5 meters. however, we noted that even with a low lqi the communications between zigbee modules was acceptable, in particular some connection losses were detected but the system was able to restore the connection in few seconds. 5 conclusions and future works several intelligent and pervasive applications are being developed within the domotics context. in particular, an intelligent surveillance system for home security was presented in this paper. it exploits the zigbee protocol and it can detect and classify intrusions to discard false positive and negative alarms, also providing remote control functions and cameras live streaming to allow users to analyze who and why is triggering alarms. we also presented results about both the detection algorithm effectiveness and the zigbee network performances. some future works include the following issues: • the application currently was tested on windows or macos x operating systems; to increase portability, we are planning to test it in linux-based platforms; an intelligent and pervasive surveillance system for home security 321 figure 9: outdoor scenario figure 10: lqi versus distance between coordinator and router zigbee modules • the system includes just a coordinator and a router with a camera; adding other components, e.g. smoke and gas sensors, household appliances etc., allows to build a system that guarantees security and safety in a global fashion as electronic keys or biometric sensors to improve the system’s capability. • the algorithm we used to evaluate intrusions was quite simple, and it should be intended as the best choice for first experiments (it indeed did not affect on the overall system performance); better yet slower algorithm should be tested. 6 acknowledgement this paper was inspired by a work developed in cooperation with valueteam it consulting and solutions (http://www.valueteam.com). particularly, we thank francesco consoli as senior manager at security division, for his guidance and support to this work. we also thank danilo torrisi for the implementation and test of the system. 322 a. longheu , v. carchiolo, m. malgeri, g. mangioni bibliography [1] jacobson, j., understanding home automation, electronic house, 14(6):18-21, 2001 [2] rodden, tom and benford, steve, the evolution of buildings and implications for the design of ubiquitous domestic environments, chi ’03: proceedings of the sigchi conference on human factors in computing systems, acm, new york, usa, pp. 9-16,2003 [3] lorente, s., key issues regarding domotic applications, international conference on information and communication technologies: from theory to applications, pp. 121 122, 2004 [4] park, sang hyun and won, so hee and lee, jong bong and kim, sung woo, smart homedigitally engineered domestic life, personal and ubiquitous computing, springer london, 7(3):189-196, 2003 [5] luo, r.c. and hsu, t.y. and lin, t.y. and su, k.l., the development of intelligent home security robot, mechatronics, 2005. icm ’05. ieee international conference on, pp. 422 -427, 2005 [6] european institute for corporate security, http://www.eicsm.org/index.html [7] chunduru, v. and subramanian, n., perimeter-based high performance home security system, consumer electronics, 2007. isce 2007. ieee international symposium on, pp. 1 -7, 2007 [8] russell, stuart j. and norvig, peter, artificial intelligence: a modern approach, prentice-hall, inc., 2010 [9] nieuwdorp, eva, the pervasive discourse: an analysis, comput. entertain., acm, new york, ny, usa, vol. 5(2):13-13, 2007 [10] bell, genevieve and dourish, paul, yesterday’s tomorrows: notes on ubiquitous computing’s dominant vision, personal ubiquitous comput., springer-verlag, london, uk, 11(2):133-143, 2007 [11] jun hou and chengdong wu and zhongjia yuan and jiyuan tan and qiaoqiao wang and yun zhou, research of intelligent home security surveillance system based on zigbee, intelligent information technology application workshops, international symposium on, ieee computer society, los alamitos, ca, usa, pp. 554-557, 2008 [12] krausz, barbara and herpers, rainer, event detection for video surveillance using an expert system, area ’08: proceeding of the 1st acm workshop on analysis and retrieval of events/actions and workflows in video streams, vancouver, british columbia, canada, acm, new york, ny, usa, pp. 49-56,2008 [13] want, roy and pering, trevor, system challenges for ubiquitous & pervasive computing, icse ’05: proceedings of the 27th international conference on software engineering, st. louis, mo, usa, acm, new york, ny, usa, pp. 9-14,2005 [14] mccullough, malcolm, digital ground: architecture, pervasive computing, and environmental knowing, mit press, cambridge, ma, usa, 2004 [15] weiser, m. and gold, r. and brown, j. s., the origins of ubiquitous computing research at parc in the late 1980s, ibm syst. journal,ibm corp., riverton, nj, usa, 38(4):693-696, 1999 an intelligent and pervasive surveillance system for home security 323 [16] weiser, mark, the computer for the 21st century, sigmobile mob. comput. commun. rev., acm, new york, ny, usa, 3(3):3-11, 1999 [17] hansmann, uwe and nicklous, martin s. and stober, thomas, pervasive computing handbook, springer-verlag new york, inc., new york, ny, usa, 2011 [18] miori, vittorio and russo, dario and aliberti, massimo, domotic technologies incompatibility becomes user transparent, commun. acm, new york, ny, usa, 53(1):153-157, 2010 [19] egan, d., the emergence of zigbee in building automation and industrial control, computing & control engineering journal, 16(2):14-19, 2005 [20] fukui kiyoshi and tanimoto akira and fukunaga shigeru, zigbee technology for lowcost and low-power radio communication systems, journal of the institute of electronics, information and communication engineers, vol. 88(1):40-45, 2005 [21] wang dong and zhang jin-rong and wei yan, building wireless sensor networks (wsns) by zigbee technology, journal of chongqing university (natural science edition), 29(8):9598, 2006 [22] li cai and nina dai, the home security system based on zigbee technology, wireless communications networking and mobile computing (wicom), 2010 6th international conference on, pp. 1-2, 2010 [23] the zigbee protocol ob/el, http://www.digi.com/technology/rf-articles/wirelesszigbee.jsp, http://www.digi.com/technology/rf-articles/wireless-zigbee.jsp, 2009 [24] carchiolo v. et al, pervasive home security: an intelligent domotics application, intelligent distributed computing iv (idc 2010) conference, pp. 145-154, 2010 [25] danilo torrisi, sistema di sicurezza perimetrale con allarmistica e controllo a distanza, 2009, tech. report dipartimento di ingegneria informatica e delle telecomunicazioni facolta’ di ingegneria universita’ di catania [26] qt nokia framework, http://qt.nokia.com/about/news/nokia-releases-qt-4.6.2 [27] qextserialport, http://qextserialport.sourceforge.net/ [28] opencv libraries, http://sourceforge.net/projects/opencvlibrary/ [29] brian r. williams and ming zhang, multiple dimension chrominance model for background subtraction, computational intelligence, pp. 438-443, 2005 [30] lee, dar-shyang, effective gaussian mixture learning for video background subtraction, ieee trans. pattern anal. mach. intell., ieee computer society, washington, dc, usa, 27(5):827-832, 2005 [31] eugster, patrick th. and felber, pascal a. and guerraoui, rachid and kermarrec, annemarie, the many faces of publish/subscribe, acm computing survey, acm, new york, ny, usa, 35(2):114-131, 2003 [32] pohja, mikko, server push with instant messaging, sac ’09: proceedings of the 2009 acm symposium on applied computing, honolulu, hawaii, acm, new york, ny, usa, pp. 653658, 2009 324 a. longheu , v. carchiolo, m. malgeri, g. mangioni [33] apple push notification service, http://developer.apple.com/ iphone/ library/ documentation/ networkinginternet/ conceptual/ remotenotificationspg/ applepushservice/ applepushservice.html [34] introducing json, 2011, http://www.json.org [35] cocoa touch framework, http://developer.apple.com/iphone [36] philip orlik and jinyun zhang and bharat bhargava and gang ding and gang ding and zafer sahinoglu and zafer sahinoglu, reliable broadcast in zigbee networks, in proceedings of secon ieee conference, 2005 int j comput commun, issn 1841-9836 8(6):800-811, december, 2013. adaptive network coding scheme for tcp over wireless sensor networks y.-c. chan, y.-y. hu yi-cheng chan*, ya-yi hu department of computer science and information engineering, national changhua university of education no.2, shi-da road, changhua city 500, taiwan ycchan@cc.ncue.edu.tw, m9954016@mail.ncue.edu.tw *corresponding author: ycchan@cc.ncue.edu.tw abstract: the purpose of this paper is to develop a network coding scheme to enhance tcp performance in wireless sensor networks. it is well known that tcp performs poorly over wireless links which suffer from packet losses mainly due to the bad channel. to address this problem, it is useful to incorporate network coding into tcp, as network coding can offer significant benefits in terms of throughput, reliability, and robustness. however, the encoding and decoding operations of network coding techniques will bring an additional delay that has a negative effect on applications of wireless sensor networks. in this paper, we propose an adaptive network coding (anc) scheme which contains two major aspects: the adjustment of the redundancy factor r and the adjustment of the coding window size cw. we dynamically adjust these two parameters depending on the measured packet loss rate, so that the proposed anc can effectively mask packet losses and reduce the decoding delay of network coding. the performance of our scheme is evaluated by simulations using ns-2 simulator. compared to other schemes, the anc not only achieves a good throughput but also has the lowest average delay and the lowest maximum delay in all experimental environments. keywords: network coding, tcp, delay, wireless sensor networks. 1 introduction the transmission control protocol (tcp) is the main transport protocol that provides reliable transmission in the current internet. it has been developed for many years. most applications on the internet depend on tcp to ensure safe delivery of data, such as ftp (file transfer protocol), http (hypertext transfer protocol), smtp (simple mail transfer protocol), and so on. tcp performs well in wired networks where packets losses mainly occur due to congestion. however, the performance of tcp degrades very fast in wireless networks. in a wireless environment, there is not only congestion but also numerous other reasons for packet losses exist. traditional tcp treat a packet loss event as the indication for network congestion, and then decrease its congestion window size. this may severely impair the throughput when the tcp runs on a wireless channel. therefore, it is very important to improve the performance of tcp in wireless networks. and this goal can be achieved by combining network coding with tcp. network coding is a new transmission paradigm originally proposed by ahlswede et al [1]. in recent years, it has received much attention and has been generated huge research in communication networks [2]. tcp/nc [3] is the first one that incorporates network coding into tcp with minor changes to the protocol stack. they present a solution which embeds the network coding operation in a separate layer below transport layer and above network layer on the source and receiver side. the idea of network coding is that, instead of transmitting individual packets, the sender takes several packets and combines them together for transmission. consequently, successful reception of information does not depend on receiving specific packet content but rather copyright © 2006-2013 by ccc publications adaptive network coding scheme for tcp over wireless sensor networks 801 on receiving a sufficient number of combinations [4]. then it can compensate for the packets in the presence of random losses, as long as the sent redundant combinations are enough. this characteristic is very attractive for tcp to improve the robustness and effectiveness of data transmission over lossy wireless networks. wireless sensor networks (wsns) are usually composed of a large number of radio-equipped sensor devices to cooperatively monitor physical or environmental conditions, such as temperature, sound, vibration, pressure, motion, or pollutants. these devices typically include some strong constraints in terms of energy, memory, computational speed and communication bandwidth. in fact, most applications on wireless sensor networks prefer faster and reliable packet delivery to higher throughput [5]. generally, these applications are not only loss-sensitive that require successful transmission of all packets or at a certain success ratio but also delay-sensitive that require timely delivery of data [6]. for these applications, packet loss will lead to retransmission and the inevitable consumption of additional battery power. in addition, if the delay time is too long, the monitored data may be outmoded. to overcome these problems it is beneficial to use network coding by transmitting redundant packets to mask packet losses. thus, the number of retransmissions and timeouts can be reduced. in this paper, we propose a new network coding scheme to improve tcp performance over wireless sensor networks which is called adaptive network coding (anc). the concept of anc is divided into two parts. in the first part, we adjust the redundancy parameter dynamically according to the network situation. as a result, the event of packet loss can be masked from the congestion control algorithm of tcp by sending enough redundant combinations. in the second part, we change the coding window size contingent on the measured packet loss rate to obtain the optimal throughput-delay trade-off. the results of simulation show that our scheme achieves higher throughput and lower delay in difference network scenarios. the remainder of this paper is organized as follows. in section 2 we provide an overview of related work. section 3 we propose our schemes and algorithms focus on the redundancy parameter and the coding window size of network coding. in section 4 we present an experimental evaluation of the performance, and finally, our conclusions as well as the future work are discussed in section 5. figure 1: an example of random linear network coding 2 related work random linear network coding is one of the major techniques in the network coding. the encoding coefficients are randomly chosen from a set of coefficients of a finite field. a linear equation of packets is then performed to generate a coded packet, and the receiver only needs to receive a sufficient number of linear equations in the forms of the coded packets to successfully decode the original packets. for instance (see figure 1), suppose that the sender buffers the first 802 y.-c. chan, y.-y. hu n packets p1, . . . ,pn in its coding buffer, then the sender chooses n coefficients c1, . . . ,cn from a finite field and mates n coefficients with n packets. next, the sender encodes these pairs into a linear combination x, x = c1p1 + c2p2 + · · · + cnpn, then sends it to the receiver. if the receiver receives enough number of these combinations then it can decode the original packets. in recent years, there have been many studies make use of random linear network coding to improve tcp performance. the main aim of tcp/nc [3] is to mask losses from tcp using random linear coding. the source transmits random linear combinations of packets currently in the congestion window, and the receiver acknowledges every innovative linear combination it receives, even if it cannot decode an original packet immediately. this scheme gives a new interpretation of acks, and brings a new concept that “seen packet” which is defined in [7] as an abstraction for the case in which a packet cannot yet be decoded but can be safely removed from the coding buffer at the sender. tcp-vegas is chosen for the transport layer protocol in tcp/nc, as it is more compatible with their modifications. when losses are effectively masked, tcp-vegas can infer congestion from increased rtts. additionally, tcp/nc uses a constant redundancy parameter r to compensate for the loss rate of the channel, for every packet arrives from tcp, r linear combinations are sent to the ip layer on average. however, in a wireless environment, the packet loss rate is very likely to not a constant value. a fixed redundancy parameter r may damage network performance, since it is not always suitable for all network conditions. if the value of parameter r is too small, then the losses are not effectively masked from the tcp layer and will result in timeout. on the other hand, if the value of parameter r is too large, the source may send too many linear combinations that consume network resources. thus, the value of r should be dynamically adjusted depending on the estimated packet loss rate. the feedback based network coding (fnc) [8] uses the implicit information behind of the seen scheme to find out the exact number of packets needed by receiver to decode all data. the receiver computes the diff value by two variables: the number of seen packets and the largest packet index in the coefficient matrix, and then embeds this value into the ack header. when the sender receives an ack, it uses this value to decide how many random linear combinations should be retransmitted and how many original packets should be combined in a linear combination. in this way, the fnc retransmission scheme can reduce the decoding delay and the number of redundant retransmissions. but the fnc retransmission scheme highly relies on feedback, this will greatly impair the throughput when the network with a large round-trip time. furthermore, if the ack is lost, the sender cannot repair the loss of linear combinations through instant retransmissions. the sanc-tcp protocol [9] is designed primarily to optimize the tcp/nc protocol. the redundancy factor r of tcp/nc is constant, while sanc-tcp adjusts the redundancy factor r adaptively based on existing network conditions. in order to implement this approach, sanctcp adds some information in the ack header to indicate the current network state, thus enable the sender to dynamically change the value of r according to the real system. our scheme is similar to the sanc-tcp that with dynamic adjustment of redundancy factor r, but in different ways. moreover, we limit the coding window size and dynamically change it depending on the packet loss rate to reduce the delay of network coding. 3 proposed method in this section, we describe our adaptive network coding scheme (anc) which consists of two parts. first, we adjust the redundancy factor r dynamically by estimating the packet loss rate in current network, so that the value of r can represent the actual network state. second, in order to reduce the decoding delay of network coding, we limit the coding window size and adaptive network coding scheme for tcp over wireless sensor networks 803 dynamically adjust it contingent on the measured packet loss rate. 3.1 adjustment of the redundancy factor r before adjusting the redundancy factor r, we have to find the packet loss rate of current network. in our scheme, we insert two variables into the header of ack, which are send_count and seen_count. the variable send_count refers to the number of packets that have been sent from the sender and the variable seen_count refers to the number of seen packets at the receiver. then, the sender can use these and other related variables to calculate packet loss rate. for example, when the ack arrives from receiver, the sender retrieves the variables send_count and seen_count from ack header and compares seen_count with a threshold t which is the variable used to determine if it is time to adjust the value of r. if seen_count is more than or equal to t , the sender starts to calculate the packet loss rate by the following equations: diff_send = send_count − send_old, (1) diff_seen = seen_count − seen_old, (2) loss_rate = diff_send − diff_seen diff_send , (3) where send_old and seen_old are the previous value of send_count and seen_count respectively. the initial send_old and seen_old is set to 0. the diff_send refers to the number of packets that have been sent during this calculation cycle while diff_seen refers to the number of seen packets during this calculation cycle. therefore, the loss_rate means the packet loss rate of current network. through the above equations, we can obtain the accurate packet loss rate to conduct the correct adjustment of redundancy factor r. after calculating the packet loss rate, the sender adjusts the redundancy factor r accordingly. if current t is equal to init means this is the first time for the sender to adjust the value of r. in the simulation, we choose init = 20 which is the same as initial coding window size. the equation below shows the calculation of the redundancy factor r for the first time: rcurrent = 1 1 − loss_rate , (4) where rcurrent is current redundancy factor r. on the other hand, if it is time to adjust the value of r after the first calculation, the value of r can be obtained as follow: rcurrent = a ∗ 1 1 − loss_rate + b ∗ rold,0 ≤ a,b ≤ 1,a + b = 1. (5) the above equation is calculated using moving average to take into account the previous trends on rold, where rold is the last value of redundancy factor r. the values of variables a and b can be set based on the network situations. if the random loss rate of the network environment tends to change frequently, the value of a should be set greater than b. in contrast, if the random loss rate of the network environment tends to change occasionally, the value of a should be set less than b. 3.2 adjustment of the coding window size the coding window size represents that the largest number of original packets can be encoded in a linear combination. to reduce the decoding delay of network coding, we limit the coding window size and adjust it according to the packet loss rate. 804 y.-c. chan, y.-y. hu after the sender computed the redundancy factor r, the next process for the sender is to adjust the coding window size. the following equation describes the adjustment of coding window size for the first time: cw = 20 + ⌊loss_rate ∗ 10⌋, (6) where cw refers to the coding window size. in our scheme, the coding window size is dynamically adapted depending on the packet loss rate. this means that the coding window size increases as the packet loss rate increases, and the coding window size decreases as the packet loss rate decreases. after the first calculation, the value of coding window size can be obtained as follow: cw = 20 − ⌊ 1 − rcurrent rcurrent ∗ 10 ⌋ . (7) the above equation shows that the packet loss rate estimation for adjusting the coding window is derived from the value of rcurrent rather than the measured packet loss rate at this period. that is because the value of rcurrent is used to predict the packet loss rate in the next period. finally, we must update several related variables, such as rold to rcurrent, send_old to send_count and seen_old to seen_count. the value of t also should be reset by the following equation: t = cw + seen_count. (8) table 1: coding window size adjustment in difference loss rates dynamic coding window adjustment cw loss rate 0% loss rate 10% loss rate 20% loss rate 30% loss rate 40% 10 585.1 0.06828 617.2 0.07108 515.7 0.07460 346.3 0.07648 91.1 0.07476 15 877.6 0.06828 849.3 0.08667 733.7 0.09681 602.5 0.10356 341.6 0.10394 20 998.6 0.07997 898.6 0.11153 793.3 0.13131 681.6 0.14362 509.7 0.15326 25 998.6 0.09997 899.9 0.14440 798.9 0.17561 695.2 0.19556 589.3 0.20850 30 998.6 0.11993 899.9 0.17740 799.0 0.21640 696.1 0.25118 592.8 0.27962 35 998.6 0.13986 899.9 0.21272 799.0 0.21640 696.1 0.31536 592.8 0.27962 40 998.6 0.15976 899.9 0.24271 799.0 0.21640 696.1 0.37298 592.8 0.27962 anc 998.6 0.07997 899.9 0.13830 799.0 0.18977 696.1 0.23014 592.4 0.26464 cw 20 24 27 28 29 3.3 analysis of the coding window size the initial value of coding window size in our scheme is acquired by the experimental results presented in table 1. for this experiment, the topology and experimental parameters are the same as that in the subsection 4.1.1. table 1 shows the data of throughput (tp) and delay which are obtained by adjust the coding window size in difference loss rates. the data that have an outline border is the optimal trade-off value when compared with others data in the same loss rate. the last two rows in table 1 are the experimental results and the coding window adjustments of our scheme anc. for the initial setting, we assume the packet loss rate at beginning of network is zero, so we set the initial coding window size to 20 that is the optimal value in table 1 when the loss rate is 0%. after this, the sender adjusts the coding window size in accordance with the measured packet loss rate. this experimental result (see table 1) indicates that the throughput adaptive network coding scheme for tcp over wireless sensor networks 805 of our scheme at last row can reach the optimal value in most cases while has a shorter delay, that is, the adjustment of the coding window size in our scheme is suitable. as mentioned before, in order to reduce the decoding delay of network coding, we limit the coding window size in our scheme. when the coding window size is limited, the coding coefficient matrix can become smaller and the complexity of decoding can become lower. hence, the time of packets remain in the decoding buffer is decreased, that is, packets can be discarded earlier from the decoding buffer. for these reasons, our scheme can effectively reduce the decoding delay of network coding. nevertheless, a smaller coding window size does not mean to be better. if the size of coding window is too small, the coding window will restrict the use of available network resources which could cause a poor throughput. as a consequence, we must take into account the trade-off between throughput and delay to get higher throughput and lower delay. the algorithm of our scheme is specified using pseudo-code that shown in tables 2 and 3. table 2: the operations of network coding layer at the sender event pseudo-code initialization: 1)set num,send_count, send_old and seen_old to 0. 2)set init to 20. let t = init. when the packets arrive from tcp sender: 1)if the packet is a control packet used for connection management, deliver it to the ip layer then doing nothing; else, move to state 2). 2)if the packet not already in the coding buffer, add it to the coding buffer. 3)set num = num + r. (r = redundancy factor) 4)repeat the following ⌊num⌋ times: 1.generate a random linear combination of the packets in the coding window. 2.count the send_count = send_count + 1. 3.add the network coding header specifying the set of packets in the coding window and the coefficients used for the random linear combination. 4.add the variable send_count to the network coding header. 5.deliver the packet to the ip layer. 5)set num = fractional part of num. when the ack arrives from receiver: 1)pick up the variables seen_count and send_count from ack header. 2)if seen_count ≥ t start to adjust the values of r and cw ; else, move to state 3). 1.compute the diff_send = send_count − send_old and the diff_seen = seen_count − seen_old. 2.compute the loss_rate = (diff_send − diff_seen)/diff_send. 3.if t = init: a)rcurrent = 1/(1 − loss_rate). b)cw = 20 + ⌊loss_rate ∗ 10⌋. c)move to state 5. 4.if t > init: a)rcurrent = a ∗ 1/(1 − loss_rate) + b ∗ rold. b)cw = 20 − ⌊(rcurrent − 1)/rcurrent ∗ 10⌋. 5.update rold to rcurrent, send_old to send_count and seen_old to seen_count. 6.reset t = cw + seen_count. 3)remove the acked packet from the coding buffer and hand over the ack to the tcp sender. 4 performance evaluation our adaptive network coding scheme is evaluated by means of the network simulator (ns-2) under various conditions such as different link bandwidths, packet sizes, and random loss rates. evaluation metrics in performance test are the throughput, the average delay, and the maximum 806 y.-c. chan, y.-y. hu table 3: the operations of network coding layer at the receiver event pseudo-code initialization: 1)set seen_count to 0. when a packet arrives from sender: 1)count the seen_count = seen_count + 1. 2)remove the network coding header, then retrieve the coding vector and the variable send_count. 3)add the coding vector as a new row to the existing coding coefficient matrix, and perform gauss-jordan elimination to update the set of seen packets. 4)add the payload to the decoding buffer. perform the operations corresponding to the gauss-jordan elimination, on the buffer contents. if any packet gets decoded in the process, deliver it to the tcp sink and remove it from the buffer. 5)generate a new ack with sequence number equals to that of the oldest unseen packets and add two variables send_count and seen_count to the ack header. when an ack arrives from the tcp sink: 1)if the ack is a control packet for connection management, deliver it to the ip layer; else, ignore the ack. delay. we compare our scheme anc with sanc-tcp and tcp-vegas under both fixed and unfixed loss rate. we use tcp-vegas as the transport layer protocol, and set the parameters of tcp-vegas to α = 28, β = 30, γ = 2. this setting is identical with tcp/nc [3]. the values of a and b in our scheme are set to a = 0.2 and b = 0.8 individually. figure 2: a tandem network consisting of 6 hops 4.1 results for fixed random loss rates the topology in this simulation is a tandem network consisting of 6 hops, as shown in figure 2. the sender and the receiver are at opposite sides of the chain. in contrast to traditional wireless networks, wireless sensor networks generally have a lower bandwidth for data transmission. we experiment with two different bandwidth settings: (a) bandwidth = 1 mbps and (b) bandwidth = 250 kbps which is the channel bandwidth of ieee 802.15.4. in the following experiments we assume that the source always has data to send. set the link bandwidth to 1 mbps in this experiment, each link has a bandwidth of 1 mbps, and a propagation delay of 10 ms. the buffer size on the links is set to 200 packets. the packet size is 1000 bytes, and the random loss rate is varied from 0% to 20% on each link. the simulation time is 1000 seconds. the throughput obtained corresponding to different random loss rates is plotted in figure 3. our scheme anc and sanc achieve a similar throughput, but the throughput of vegas decrease rapidly when the random loss rate increases. the adjustment of the redundancy factor r that we proposed in subsection 3.1 makes the sender to send out the proper number of redundant linear combinations so that the random loss rate can be masked effectively. as a result, we can get a fairly well throughput. adaptive network coding scheme for tcp over wireless sensor networks 807 0 200 400 600 800 1000 0 2 4 6 8 10 12 14 16 18 20 t h r o u g h p u t( k b p s) random loss rates (%) sanc anc vegas figure 3: throughput 0 50 100 150 200 250 300 350 0 2 4 6 8 10 12 14 16 18 20 d e la y (m s) random loss rates (%) sanc anc figure 4: average delay figure 4 shows the average delay of anc and sanc. compared with the sanc, our scheme have lower average delay in every case, moreover, the average delay of our scheme is almost half of the average delay of sanc. this is because we limit the coding window size which would decrease the complexity of decoding and reduce the time that packets remain in the decoding buffer. thus, we can have a lower delay time than that of sanc. 0 50000 100000 150000 200000 250000 0 2 4 6 8 10 12 14 16 18 20 n u m b e r o f p a c k e ts random loss rates (%) sanc anc vegas figure 5: the total number of packets 0 200 400 600 800 1000 1200 1400 1600 1800 0 2 4 6 8 10 12 14 16 18 20 d e la y (m s) random loss rates (%) sanc anc figure 6: maximum delay figure 5 shows the total number of packets that have been send for sanc, anc, and vegas. this figure is essentially identical to figure 3 except the y-axis represents the total number of sent packets. as shown in figure 5, the metric of total number of sent packets for anc and sanc outperform vegas as the loss rate increases. with regard to the simulation of delay, we also retrieve the maximum delay at various loss rates. as illustrated in figure 6, the maximum delay of sanc rises significantly with loss rate, while our scheme rise slightly. furthermore, the maximum delay of sanc is at least two times more than anc in most cases. set the link bandwidth to 250 kbps the channel bandwidth of ieee 802.15.4 is 250 kbps [10]. many wsns adopt ieee 802.15.4 for communicating among nodes. the characteristics of ieee 802.15.4 technology includes low rate, low transit distance, low power, low cost, simple architecture, small size, etc. all these characteristics are applied to the applications of wsn. the experimental parameters in this simulation are identical to the setting used in subsection 4.1.1, except that the bandwidth of each link is 250 kbps, and the buffer size on the links is set to 50 packets. besides, we simulate the effect of two different packet sizes on the performance. the packet sizes are set to 1000 bytes and 200 bytes. 808 y.-c. chan, y.-y. hu 0 50 100 150 200 250 0 2 4 6 8 10 12 14 16 18 20 t h r o u g h p u t( k b p s) random loss rates (%) sanc anc vegas (a) throughput 0 100 200 300 400 500 600 700 800 0 2 4 6 8 10 12 14 16 18 20 d e la y (m s) random loss rates (%) sanc anc (b) average delay figure 7: the throughput and delay when packet size is 1000 bytes 0 50 100 150 200 250 0 2 4 6 8 10 12 14 16 18 20 t h r o u g h p u t( k b p s) random loss rates (%) sanc anc vegas (a) throughput 0 20 40 60 80 100 120 140 160 180 200 0 2 4 6 8 10 12 14 16 18 20 d e la y (m s) random loss rates (%) sanc anc (b) average delay figure 8: the throughput and delay when packet size is 200 bytes as the link capacity and the buffer size become smaller, our scheme anc still has high throughput and low delay, as shown in figure 7. however, it must take a longer delay to transmit a large packet (1000 bytes) if the link capacity becomes smaller. this is demonstrated through the simulation results, when the loss rate is 20%, anc’s average delay is 189.77 ms in figure 4 where the bandwidth is 1 mbps, while anc’s average delay is 627.58 ms in figure 7(b) where the bandwidth is 250 kbps. then, we change the packet size to 200 bytes. figure 8(a) demonstrates that when the loss rate is 0%, the throughput of our scheme anc is lower than sanc and vegas. this is because that the coding window size we adjust according to subsection 3.2 is not big enough in such a network environment, thus the available network resources is restricted by coding window and result in lower throughput. that is to say, when the packet size is very small, we must have a sufficient coding window size to take full advantage of network resources. in figure 8(b), the average delay of our scheme is lower than sanc in every case. in addition, the average delay in figure 8(b) is lower when compared with the figure 7(b) because the smaller packet size has the lower transmission time. 4.2 results for unfixed random loss rates all previous simulations focus on the behavior of anc under the fixed loss rate. now, we evaluate its performance in an unknown environment with unfixed loss rate. we use the same topology and parameters as 4.1.1. the background loss rate in this scenario is 2%. the loss rate adaptive network coding scheme for tcp over wireless sensor networks 809 is changed to 6% from 200 to 400 second, and changed to 10% from 600 to 800 second. total simulation time is 1000 seconds. in figure 9, the x-axis represents the simulation time, and the y-axis represents the average throughput. it clearly shows that anc and sanc can quickly and effectively handle the sudden bursty losses by adjusting the redundancy factor r dynamically, but vegas is seriously affected by bursty losses. in figure 10, we show the comparison of delay between anc and sanc. the delay of sanc increases rapidly when the loss rate is suddenly changed at 200 second and 600 second. in contrast to sanc, anc can still maintain quite low latency under the changed loss rate. this is because our scheme anc can constantly measure the packet loss rate of network and further adjust the coding window size to get low delay. based on the above experimental results, we can demonstrate that even under the unknown environment with bursty losses, our scheme anc still can send enough linear combinations to compensate the lost packets. thus, anc can reach high throughput and keep low delay through the adjustment of r and cw . 0 100 200 300 400 500 600 700 800 900 1000 0 100 200 300 400 500 600 700 800 900 1000 t h r o u g h p u t( k b p s) time(s) snac anc vegas figure 9: the average throughput under bursty losses figure 10: the delay evolution under bursty losses 4.3 results in a more realistic environment in this subsection, we compare our scheme with sanc-tcp and tcp-vegas in a wireless topology shown in figure 11. there are five nodes (four hops) in this topology. the distance between each node is 30 meters. table 4 is a parameter table which shows the parameter settings that used in the performance test. the simulation time is 100 seconds. the ftp flow stars at 1 second. in this experimental environment, some packets can be lost for reasons other than congestion. figure 11: wireless topology the experimental results show that the throughput of anc and sanc are 272.0 kbps and 271.9 kbps respectively. both of them are outperform vegas whose throughput is 260.5 kbps. the comparison of rtt between anc and sanc is shown in figure 12. at the beginning of this experiment, we can see the rtt for both schemes are rising rapidly, then the rtt of anc is maintained between 9 to 14 milliseconds that is lower than the rtt of sanc which is maintained between 15 to 20 milliseconds. figure 13 presents the average rtt and maximum 810 y.-c. chan, y.-y. hu table 4: parameter table parameter value packet size 200 bytes buffer size 50 packets buffer management scheme droptail link bandwidth 2 mbps transmission range 40 meters carrier sensing range 90 meters mac protocol csma/ca routing algorithm dsr rtt of anc and sanc. it is clear anc has lower average rtt and maximum rtt than sanc. as mentioned before, this is because we limit the coding window size, and then the complexity of decoding can be decreased. thus, packets will be discarded earlier from the decoding buffer. according to the experimental results, our scheme anc can also have a good performance in such an environment. 0 5 10 15 20 25 0 10 20 30 40 50 60 70 80 90 100 r t t (m s) time(s) sanc anc figure 12: rtt of anc and sanc 0 5 10 15 20 25 average rtt maximum rtt r t t (m s) anc sanc figure 13: average rtt and maximum rtt of anc and sanc 5 conclusions in this paper, we introduced a network coding scheme called adaptive network coding (anc) that can be applied on wireless sensor networks. the objectives of our study are to mask packet losses by transmitting redundant linear combinations and to reduce the decoding delay of network coding by limiting the coding window size. we compare our scheme with sanc-tcp and tcp-vegas in difference random loss rates, different link bandwidths, and difference packet sizes. from the experimental results obtained, our scheme has a lower delay than sanc-tcp in all experimental environments without sacrificing throughput. besides, both the throughput of our scheme and sanc-tcp are significantly better than that of tcp-vegas in most cases. it is important to note that the adjustment of coding window size can be investigated in more detail so as to improve the throughput when tcp runs on a harsh network environment. a possible direction for further study is to allow intermediate nodes to perform the encoding and decoding whenever they receive packets. that means encoding and decoding operations are done in a hop-by-hop manner. in this way, the network coding can be used for a wide variety of topologies of wireless sensor networks. adaptive network coding scheme for tcp over wireless sensor networks 811 bibliography [1] r. ahlswede, n. cai, s.-y. r. li, and r. w. yeung, network information flow. ieee trans. on information theory, 46(4):1204-1216, jul. 2000. [2] d. silva and f. r. kschischan, universal secure network coding via rank-metric codes, ieee trans. on information theory, 52(2): 1124-1135, feb. 2011. [3] j. k. sundararajan, d. shah, m. medard, m. mitzenmacher, and j. barros, network coding meets tcp, 2009 proceedings of ieee infocom, 280-288, apr. 2009. [4] c. fragouli, j.-y. le boudec, and j. widmer, network coding: an instant primer, acm sigcomm computer communication review, 36(1): 63-68, jan. 2006. [5] yao-nan lien, hop-by-hop tcp for sensor networks, international journal of computer networks & communications, 1(1):1-16, apr. 2009. [6] c. wang, k. sohraby, b. li, m. daneshmand, and y. hu, a survey of transport protocols for wireless sensor networks. ieee network magazine, 20(3): 34-40, jun. 2006. [7] j. k. sundararajan, d. shah, and m. medard, arq for network coding, in proc. of ieee international symposium on info. theory (isit), 1651-1655, jul. 2008. [8] j. chan, l. liu, x.hu, and w. tan, effective retransmission in network coding for tcp. int j comput commun, issn 1841-9836, 6(1):53-62, 2011. [9] s. song, h. li, k. pan, j. liu, and shuo-yen robert li, self-adaptive tcp protocol combined with network coding scheme, in international conference on systems and networks communications (icsnc), 20-25, oct. 2011. [10] f. xia, a. vinel, r. gao, l. wang, and t. qiu, evaluating ieee 802.15.4 for cyber-physical systems, eurasip j. wireless commun. and networking, 1-15, feb. 2011. international journal of computers communications & control issn 1841-9836, 12(4), 507-518, august 2017. an approach for construction of augmented reality systems using natural markers and mobile sensors in industrial fields d. gomes jr., p. reis, a. paiva, a. silva, g. braz jr., m. gattass, a. araújo daniel lima gomes júnior* 1. federal institute of maranhão, são luís, ma, brazil 2. applied computing center federal university of maranhão, são luís, ma, brazil *corresponding author: daniellima@ifma.edu.br paulo roberto jansen dos reis, anselmo cardoso de paiva, aristófanes corrêa silva, geraldo braz junior applied computing center federal university of maranhão, são luís, ma, brazil jansen@nca.ufma.br, paiva@deinf.ufma.br, ari@dee.ufma.br, geraldo.braz@ufma.br marcelo gattass tecgraf institute/puc-rio, rio de janeiro, rj, brazil mgattass@inf.puc-rio.br antônio sérgio de araújo hydroeletric company of são francisco, recife, pe, brazil asergio@chesf.gov.br abstract: this paper presents a methodology for the development of augmented reality (ar) visualization applications in industrial scenarios. the proposal presents the use of georreferenced natural markers detected in real time, which enables the construction of ar systems. this use of augmented visualization allows the creation of tools that can aid on-site maintenance activities for operators. ar use makes possible including information about the equipment during a specific procedure. in this work, the detection of natural markers in the scene are based on haar-like features associated with equipment geolocalization. this approach enable the detection of equipment in multiple user’s viewpoints in the industrial scenario and makes it possible the inclusion of real information about those equipment in real time as ar annotations. in this way, beyond a methodology approach, this paper presents a new way for power system information visualization in the field that can be used in both for training and for control operations. keywords: augmented reality, data visualization, natural markers. 1 introduction augmented reality (ar) is associated with the insertion of additional information or virtual objects superimposed or combined into a visualization of the real world [3]. the use of ar techniques allows to magnify user’s visual experience with the addition of information to the visualized real scene. it contributes to a potential increase of user’s comprehension about the physical environment where the user stands. therefore, information or images generated by computers need to be registered to the real scene. it means that they need to be precisely and contextually positioned in the real world visualization. in addition, it allows a big variety of applications on several areas such as medicine, education, and industrial activities. industrial companies demand for applications that use ar technology for training and also for control and operation procedures. this technology has been used in a quite limited form, copyright © 2006-2017 by ccc publications 508 d. gomes jr., p. reis, a. paiva, a. silva, g. braz jr., m. gattass, a. araújo in areas such as architecture, engineering, and civil construction [6]. the main limitation in these areas are related to high precision processing in real time. another limiting factor is the complexity of the detection process and the object tracking in these scenes. this demands the use of strategies and algorithms of computer vision (cv) which permit a suitable time of processing for the visualization. a strategy generally used to reduce processing time during object detection is the use of fiducial markers in the physical environment [25]. this is the insertion of objects or marks of low complexity, highly distinct of the present objects in the scene that allow to decrease the necessary computing effort for the correct spatial orientation in an ar environment. although the use of these markers increase robustness and reduce significantly the computational cost, it has the disadvantage of requiring the inclusion of artificial marker in the real environment. besides this, there are safety restrictions in industrial environments and incidental displacement or destruction of these markers result in the immediate inoperability of the ar system. using another type of ar marker, known as natural marker, the elements that already exist in the scene are detected directly without the need of fiducial markers inclusion. this approach can use the equipment of the industrial environment as markers for an ar application in an industrial plant, for example. the detection of these objects require extraction of the image features and comparison with patterns previously registered. this is considered one of the most complex tasks for the creation of ar applications. although there are techniques which address the feature detection problem and object recognition [17] [14], processing time is still a challenge for these systems, since in ar applications, detection needs to be done in real time. also there is a demand for ar applications that offer support to operators during real maintenance procedures of equipment in the field. in general, in these situations, data from equipment sensors are sent to a distant control room. the operators make decisions based on these information and in some circumstances there is the need of operators to move from the site to get access to the information. this could generate a situation in which information can be modified during the route between control room and maintenance site. this methodology proposes a solution for the creation of ar applications which enables a visualization of these information in the maintenance site, enabling a decentralization of control rooms. this work presents a methodology which is composed by three main steps: the creation of an ar marker using a computer vision technique, followed by using sensors (gps and compass) from mobile device to perform the correct identification of an equipment. finally, it is presented the integration with equipment data. in our case study it is used the communication and integration with supervisory control and data acquisition systems/energy management system (scada/ems). so, this methodology enables the creation and use of ar in industrial outdoor scenarios. the investigation method is based on the association of location data sensors and the detection of real scene objects that are used as natural markers. this way, we search to contribute for the use of full potential of ar application in industrial environments aiming the improvement of maintenance tasks in these scenarios. this paper is organized as follows. the section 2 presents and discuss several related works and in section 3 the technical background on cv and ar. from section 4 to section 6 we present the methodology steps described above. in section 7 we present the results from a case study related to substation scenario followed with conclusion and future works. 2 related work in [31] the concept of ar as a novel way to enhance visualization is used for educational purposes. in this work the authors highlight the need of registration, because the change of an approach for construction of augmented reality systems using natural markers and mobile sensors in industrial fields 509 camera viewpoint changes the position that the virtual element have in the scene. this is an obstacle to build ar mobile applications. the use of applications in mobile devices in real time has grown rapidly and several applications for these device formats have been developed. in [12], simple techniques of image processing are used in smartphones for detection of fiducial markers to include 3d elements in real scenes in real time. this related research are enabled by the evolution of the devices and its processing capabilities in the last years. also, according to [7], the correct position identification to include virtual information is the key for ar. in [29] there is a strategy presented for detection of ships on the african coast which uses cascade classifiers based in haar-like features. the work shows a detection system which generally uses the vessel’s transponder for monitoring, but in sabotage cases or in transponder shutdown, the solution is to detect it with radar images and a combination of techniques. the use of techniques such as template matching demonstrate a creation approach of ar environments based in image processing. besides that, the adoption of mobile devices brought a great advance for ar systems and, according to [26], generally techniques similar to surf (speededup robust features) [?] have been used for natural markers detection. object detection is a fundamental task for ar applications to be used in outdoor environments. the detection results may be used for objects recognition, tracking, and construction of environment maps. in [11] the detection is applied for the use in security robots of the substation with use of object detection algorithm with processing based in cloud computing. another interesting strategy presented is to reduce the comparison area with costly algorithms such as surf and sift [17] to optimize the processing. several works like [16] and [8] point to the question of data communication and architecture questions for electric power systems. although the emergence of new devices suggest possibilities of using newer technologies in these scenarios. our work present an approach for new ways of visualization in power system substations. despite the importance of new visualizations for power systems, accordingly to [20], in the last 20 years there were few proposals or new ideas in visualization approaches for electric system data. even though some works were done like [18], [19], [23], [33] and [24]. the above mentioned analysis concludes that visualizations must have the intention of replacing textual data or numeric information and must be explored the visual patterns for a mapping, the most natural possible way, aggregating meaning of the data for the visualization. information must be understood naturally and colors must be used, but carefully not to generate discomfort for system operators. a more recent work [1] proposed the use of qr code for ar markers and the use of iec 61850 communication for automation integrated with scada information. according to the authors, many solutions use ar applications for simulation and operator training, and there are several situations where it might be advantageous to have the capability of annotating the real process with information. besides all important contributions, the above-mentioned solution still needs to include the fiducial markers in real equipment, differently from proposed in our work. 3 computer vision and augmented reality we can define ar as a system that combines real world visualization with virtual information objects, requiring real time interactivity and 3d world registration. tracking and registration is one of the fundamental tasks for ar systems. these systems must work in real time and, aim to get a credible scene for the user, the real and virtual cameras should be mapped in such a way that both environments precisely match. thus ar system can take advantage of cv techniques. because ar can rely on visual features that are naturally present on the real scene, avoiding the need for engineering the environment. 510 d. gomes jr., p. reis, a. paiva, a. silva, g. braz jr., m. gattass, a. araújo cv aims to obtain geometrical, topological or physical information from an image and the objects which are present in that image. these information may allow the recognition of patterns, object classification, robot movement, among other possibilities. furthermore, the digital images carry with them information such as colors, light intensity that permit an image analysis through image processing. to suply the ar need for tracking, fiducial markers are commonly used. a fiducial marker is an easily detection used as a point of reference to an object targeted for tracking. an options is to use natural information of the scene (natural markers). thus, computer vision techniques can be used with additional information such as georeferencing data, gyroscope data, thermal sensors, among others sensors. these natural markers generally are specific for each type of application and the object type to be identified. for processing the inclusion of 3d virtual elements, there is a need to align the object coordinate system to the world coordinate system for a proper virtual information placement in the real scene. 4 natural 3d marker for detection of objects using haar-like features to use natural markers, we need to detect the objects that are in the scene image. this depends on the correct selection of the representation model. there are several techniques to detect objects in scene and each different kind of object changes the detection approach. in some situations it is necessary to modify the whole training, even using the same detection technique. the proposed methodology uses a technique proposed in [32] and extended by [15]. this is a a robust framework for the construction of fast object detectors, using machine learning, that reach high detection rates. the version used includes a rectangular feature rotation, enriching the detection algorithm and keeping calculation efficiency. combining the aspects of the works mentioned above, the proposed 3d marker uses cascade classifier based on haar features. the process is based on two main stages of training. firstly, the classifier is trained with images containing the object to be detected and these images are called positive examples. during processing these images are resized into a commom size defined empirically. the second stage is responsible for training the negative examples, which are arbitrary images that do not contain the object trained in first step. our approach have used several images from real industrial environment in both stages. this is because there is the requirement to detect an object regardless of where the user stands, the proposed work used images from every side of the equipment for the object detector’s training. figure 1 shows that each image matches an equipment side that is used in training process. the overall process of training and detection is presented in figure 2. the equipment detection process is performed for each frame acquired from the camera. it is initially performed necessary adjustments in the image related to camera position configuration and operational system, then it executes related procedures for feature detection in image with cascade classifier. this work have used transformer initially as a case study for detection, since this one is one of the most important equipment inside the substation. as result of the training, a three dimensional natural marker is created, in which object detection is performed independently of user’s viewpoint. using this approach, it is possible to build ar environments with use of 3d natural markers, without the need to include fiducial markers. the advantages of using natural markers can be seen in table 1. it is worth mentioning that an approach for construction of augmented reality systems using natural markers and mobile sensors in industrial fields 511 figure 1: image acquisition from different sides of object for training the cascade classifier figure 2: processes from training to detection of objects in camera the natural marker corresponds to an object that already exists on scene without modification of it. table 1: comparison of ar markers technical feature fiducial marker qr code natural marker training independence no yes no model storing local internet local or internet environment remains unchanged no no yes multiple viewpoint detection with a single loaded marker no no yes the first step for the creation of the natural 3d marker is the detection. after the correct detection of the object for each camera frame of camera in real time, the next steps consist in associating this information with predefined information about the objects existing in scenario and the use of sensors to be combined with result of detection. 5 sensors for automatic identification of equipment associated to detection after the equipment detection we need to identify it. this is a required task to allow the query of the equipment information from automation and control systems. the methodology 512 d. gomes jr., p. reis, a. paiva, a. silva, g. braz jr., m. gattass, a. araújo proposed herein suggests the integrated use of more than one sensor type. the sensors are used in identifying the natural 3d markers. most devices, such as smartphones and digital cameras, have auxiliary sensors for geolocation and orientation. for applications that do not need big precision, only using gps sensor is possible to solve location problem that has been used mainly in situations with map localizations. in outdoor applications, such as detection of an equipment inside an industrial area, the use of gps data does not only constitute a reasonable solution. due to the error inherent to gps localization (up to 7.8 meters with 95% of confidence). but, this error can present variation depending on atmospheric effects, reception quality and sky blockage [10]. there are some frameworks for construction of ar environment that use just gps data as main source of information. these frameworks add virtual information to the real world but this gps error variation causes annotation out of place in real application. besides gps, this methodology also uses compass or magnetometer sensor. this sensor works like a pointer to the north pole. after the transformer detection, the algorithm considers three main variables: the operator location at the time of the image acquisition from mobile device, the transformer’s location present in the substation and the compass orientation sensor for field of view (fov) calculation from the user point of view, as shown in figure 3a. the white circles in figure 3b are user position or equipment inside the substation. (a) image from real substation (b) field of view calculated with equipment inside figure 3: the proposed method for equipment identification from gps integrated information, compass, user’s location, transformers location and image obtained from camera. the fov is computed from intrinsic parameters of the camera, as described in equation 1. the α angle is half of fov and used to identify which equipment are inside the obtained image. α = arctan s 2f s = sensor dimension f = focal length (1) besides that, we use camera information such as sensor size and focal distance to compute the fov to verify objects inserted inside the camera frame. by obtaining this fov, the objects position is analyzed in relation to the central line of fov and the boundary lines, right and left, in relation to user’s viewpoint. with this information it is possible to identify which equipment were detected in the image. with this intention the distance is calculated to the central line and the object is detected at the right side or at the left of the observer (figure 4). this approach still allows to identify more than one detected object on the same image. an approach for construction of augmented reality systems using natural markers and mobile sensors in industrial fields 513 figure 4: identification of objects inside the observer viewpoint. due to degree of reliability of gps data in low cost sensors, this proposal can be used on outdoor environments. in indoor scenarios the gps error increases and the application behavior depends on these data. the strategy of joining 3d natural markers associated with georeferenced data and compass minimizes the error compared to use only gps data. this strategy works as an auxiliary information layer, containing detection data of the desired object, inside the ar system. 6 communication and integration to scada/ems after detection and identification of the equipment inside the ar scenario, a data request is performed about the equipment to scada/ems communication system based on identified equipment tag. each equipment has a unique tag inside the open system for energy management (sage) and through this tag the associated information is obtained. with this methodology is possible to build ar systems, including real information which could help the system operators in control process and problem identification inside the energy substation, on the field. besides that, critical information may be specified, showing just alert states based on defined alarms in the system. decreasing the data quantity observed by operator. this integration architecture is based in agitoserver [27], in which control systems are accessed through interfaces via remote calls (rpc) provided by sage and operator training simulator (ots). integration with database provide access and control to modify data via tcp/ip protocol available in socket format. after that, it is possible to send and receive messages with data of the power system in json format (figure 5). this architecture allows perform queries to equipment data, such as equipment electric parameters, operation state (open or closed switch), among others. through message exchange model, this solution allows to obtain the desired information request about the equipment and add it to the visualization. furthermore, the ots training environment is similar to real environment, only with simulated data of operation state of the systems and chosen obviously in this way for safety reasons of the electric system and enabling consultation application tests to the data. 514 d. gomes jr., p. reis, a. paiva, a. silva, g. braz jr., m. gattass, a. araújo figure 5: architecture of acquisition of integrated data to the sage/ots 7 results and case study: ar visualization in energy substation the evaluation of the proposed methodology was applied using mobile devices, which allow rendering different visualization formats. it might enable the construction of two augmented reality versions of application: a tablet view version with a single display and a head-mounted display (hmd) version using smartphone integrated to google carboard sdk [9]. this possibility of including a mobile tool for operators enables that the user can perform information acquisiton in the operation field, with no need of returning to operation center to access a given system which contains desired information. by using the auxiliary use of ar it is possible to add information about the task being executed during operation. images used in this study were obtained from real operating environments of chesf. more specifically at chesf substations suape ii and suape iii in recife-pe, and at chesf substation extremoz in natal-rn, all located in brazil. the approach used cascade classifier based in haar-like features for transformer detection. in this approach the classifier training is performed with negative and positive images set and detection quality depends on the amount of training images. the obtained results initially used a base of positive images (transformer pictures) and negative images, which include basically areas around the substation itself do not containing the transformer. the results can be seen in table 2 using amount of p = 419 (positive images) and n = 270 (negative images) during training and also, for the detection test, a total of 40 pictures of equipment were used. for elimination of false-positives, the haar cascade method demands an image base as large as possible, and this statement is valid both for positive images as for negative ones. these tests show that as we increase the quantity of stages, the algorithm minimizes incorrect detection features. but, to be able to increase equipment detection in these conditions is necessary to increase the training base. applications have been developed for devices with android operational system and they have been installed in tablet version (single view) as shown in figure 6 and hmd combined with an approach for construction of augmented reality systems using natural markers and mobile sensors in industrial fields 515 table 2: equipment detection with haar cascade classifier cascade stages false positives (fp) total features detection rate (%) 14 95 222 78.05% 15 73 181 78.05% 16 54 131 80.49% 17 36 120 80.49% 18 22 99 82.93% 19 13 87 82.93% 20 11 73 80.49% 21 10 66 82.93% 22 1 46 70.73% 23 0 33 53.66% 24 0 31 48.78% 25 0 24 43.90% 26 0 18 34.15% smartphone (using stereoscopic view) in figure 7a and figure 7b. figure 6: ar application visualization in tablet (single view) the work proposes real data integration after stages taken related to equipment detection system. the sage system is widely used by several electric power system companies in brazil. according to [5] the sage system may be used in substations and power plant and it supports several hardware, including different manufacturers. in addition to sage, the use of ots has permitted an environment for training operators without the necessity of being connected to real equipment. thus, it enables running several simulations of possible energy system scenarios. 516 d. gomes jr., p. reis, a. paiva, a. silva, g. braz jr., m. gattass, a. araújo (a) stereoscopic view of object detection (b) augmented user visualization integrated with ots data figure 7: ar application visualization in hmd (stereoscopic view) 8 conclusion and future work this work presents two main contributions: the first one, and most important, it is a methodology for creation of ar systems using georeferenced natural 3d markers. it uses image processing techniques and sensor information present in mobile devices to provide identification for detected objects in outdoor environments. this contribution has applicability in others scenarios and several areas of application. the second contribution is the application constructed itself using the natural marker and integration with mobile sensors, as a case study. the application integrated with scada/ems and equipment’s real information create a new visualization format in power system operational environment. this last innovation intends to improve the data visualization inside industrial environments. this new visualization is applied in stereoscopic view display, which enables the operator to modify the real world through addition of virtual elements (information or auxiliary data of equipment state). it allows the effective creation of an innovative ar solution and the use of this technology can be used no only for training but also for operation tasks. so, the next step will be experimenting this methodology in real industrial environments. there are many points to test about its usability, mainly on the stereoscopic visualization, due the need of reduce operator’s visual discomfort infringed by the use of hmd for several hours. we also will conduce experiments to detect the pose of the object and allow a virtual analysis of it. bibliography [1] antonijević m.; sučić s.; keserica h. (2016); augmented reality for substation automation by utilizing iec 61850 communication, 39th international convention on information and communication technology, electronics and microelectronics distributed computing, visualization and biomedical engineering, opatija, croatia, 2016. [2] aukstakalnis s.; blatner d. (1992); silicon mirage the art and science of virtual reality, peachpit press, isbn: 978-0938151821, 317, 1992. [3] azuma r.t. (1997); a survey of augmented reality, presence: teleoperators and virtual environments, 355-385, 1997. an approach for construction of augmented reality systems using natural markers and mobile sensors in industrial fields 517 [4] bay h.; ess a.; tuytelaars t.; gool l. v. (2008); speeded-up robust features (surf), computer vision and image understanding, 110(3), 346-359, 2008. [5] cepel (2004); open energy management system (sistema aberto de gerenciamento de energia), [online] avaiable: http://sage.cepel.br [6] côté s.; barnard j.; snyder r. (2013); offline spatial panoramic video augmentation for visual communication in the aec industry, 13th international conference on construction applications of virtual reality (convr 2013), 82-89, 2013. [7] dai j.; zhang l. (2011); registered localization method-based on opencv for augmented reality, international conference of soft computing and pattern recognition (socpar), 492-495, 2011. [8] farag h. e. z.; el-saadany e.f. (2012); a novel cooperative protocol for distributed voltage control in active distribution systems, ieee transactions on power systems, 1645-1656, 2012. [9] google (2016); google cardboard, [online] available at http://googlecardboard.com.br [10] gps (2014); gps accuracy, official u.s. government information about the global positioning system, [online] available: http://www.gps.gov/systems/gps/performance/accuracy/ [11] jian y.; xin w.; xue z.; zhenyou d. (2015); cloud computing and visual attention based object detection for power substation surveillance robots, ieee 28th canadian conference on electrical and computer engineering, 337-342, 2015. [12] kim j.; jun h. (2011); implementation of image processing and augmented reality programs for smart mobile device, the 6th international forum on strategic technology, 10701073, 2011. [13] li z.; wang j.; sun h.; guo q. (2015); transmission contingency analysis based on integrated transmission and distribution power flow in smart grid, ieee transactions on power systems, 3356-3367, 2015. [14] liang m.; min h.; luo r.; zhu j. (2015); simultaneous recognition and modeling for learning 3-d object models from everyday scenes, ieee transactions on cybernetics, 45(10), 2237-2248, 2015. [15] lienhart, r.; maydt, j. (2002); an extended set of haar-like features for rapid object detection, ieee international conference on image processing (icip), 900-903, 2002. [16] lin h.; sambamoorthy s.; shukla s.; thorp, j.; mili l. (2011); power system and communication network co-simulation for smart grid applications, innovative smart grid technologies (isgt), 1-6, 2011. [17] lowe d.g. (2004); distinctive image features from scale-invariant keypoints, international journal of computer vision, 60(2), 91-110, 2004. [18] mahadev p. m.; christie r. d. (1993); envisioning power system data: concepts and a prototype system state rep-resentation, ieee transactions on power systems, 8(3), 1084-1090, 1993. 518 d. gomes jr., p. reis, a. paiva, a. silva, g. braz jr., m. gattass, a. araújo [19] mahadev p. m.; christie r. d. (1994); envisioning power system data: vulnerability and severity representations for static security assessment, ieee transactions on power systems, 9(4), 1915-1920, 1994. [20] mikkelsen c.; johansson j.; cooper m. (2012); visualization of power system data on situation overview displays, 16th international conference on information visualisation, 188-197, 2012. [21] milgram p.; kishino f. (1994); a taxonomy of mixed reality visual displays, ieice trans. information and systems, 1321-1329, 1994. [22] nigam s.; deb k.; khare a. (2013); moment invariants based object recognition for different pose and appearances in real scenes, international conference on informatics, electronics & vision (iciev), isbn: 978-1-4799-0397-9, 1-5, 2013. [23] overbye t. j.; weber j. d. (2000); new methods for the visualization of electric power system information, ieee symposium on information visualization, issn: 1522-404x, 1-9, 2000. [24] overbye t. j.; weber j. d. (2000); visualization of power system data, proceedings of the 33rd annual hawaii international conference on system sciences, 1-7, 2000. [25] pribeanu c. (2012); specification and validation of a formative index to evaluate the ergonomic quality of an ar-based educational platform, international journal of computers communications & control, 7(4), 721-732, 2012. [26] prochazk d.; stencl m.; popelka o.; stastny j. (2011); mobile augmented reality applications, proceedings of mendel 2011: 17th international conference on soft computing, 469-476, 2011. [27] ribeiro t. r.; reis p. r. j.; braz jr., g.; paiva a. c.; silva a. c.; maia i. m. o.; araújo a. s. (2014); agito: virtual reality environment for power systems substations operators training, international conference on augmented and virtual reality (avr), 113-123, 2014. [28] rublee e.; rabaud, v.; konolige k.; bradski, g. (2011); orb: an efficient alternative to sift or surf, ieee international conference on computer vision, 2564-2571, 2011. [29] schwegmann c.p.; kleynhans w.; salmon b.p. (2014); ship detection in south african oceans using sar, cfar, and a haar-like feature classifier, geoscience and remote sensing symposium (igarss), 557-570, 2014. [30] szeliski r. (2011); computer vision: algorithms and applications, springer-verlag london, isbn: 978-1-84882-935-0, 812, 2011. [31] vargas h.; farias g.; sanchez j.; dormido s.; esquembre f. (2013); using augmented reality in remote laboratories, international journal of computers communications & control, 8(4), 622-634, 2013. [32] viola p.; jones m. j. (2001); rapid object detection using a boosted cascade of simple features, computer vision and pattern recognition (cvpr), 511-518, 2001. [33] weber j. d.; overbye t. j. (2000); voltage contours for power system visualization, ieee transactions on power systems, 15(1), 404-409, 2000. international journal of computers communications & control issn 1841-9836, 10(1):136-146, february, 2015. design of robust fuzzy sliding-mode controller for a class of uncertain takagi-sugeno nonlinear systems x.z. zhang, y.n. wang x.z. zhang* 1. school of electrical and information engineering hunan university china, 410082 changsha, yuelushan 2. cooperative innovative center of wind power hunan institute of engineering china, 411104 xiangtan, fuxing road 88 *corresponding author: zxz@hnie.edu.cn y.n. wang school of electrical and information engineering hunan university china, 410082 changsha, yuelushan yaonan@hnu.edu.cn abstract: this paper presents the fuzzy design of sliding mode control (smc) for nonlinear systems with uncertainties, which can be represented by a takagi-sugeno (t-s) model. there exist the parameter uncertainties in both state and input matrices, as well as the matched external disturbance. the key feature of this work is the great ability of the controller to deal with systems without assuming that the control matrices of each local t-s model to be same and knowing the priori information of the upper norm-bounds of uncertainties. a sufficient condition for the existence of the desired fuzzy smc is obtained by solving a set of linear matrix inequalities (lmis). the reachability of the specified sliding surface is proven. a numerical example is illustrated in order to show the validity of the proposed scheme. keywords: uncertain t-s nonlinear system, sliding mode control, linear matrix inequalities (lmi), system stabilitybooks, examples. 1 introduction it has been shown that takagi sugeno (t-s) fuzzy model is a powerful and efficient tool to handle complex nonlinear system, and be employed in most model-based fuzzy analysis approaches [1, 2]. since the parallel distributed compensation (pdc) approach is formulated by tanaka and wang [3], the stability analysis and stabilized controller design for t-s fuzzy systems become achievable and have been applied in a wide range of areas such as electrical/mechanical systems [4, 5], process control [6], robot [7] and time-delayed systems [8]. a lot of research outcomes on this field have been appeared in literatures. practically, a real nonlinear system may contain various kinds of uncertainties such as parameter variations, modeling errors and external disturbances, etc. in this case, the pdc-based controllers [3] may perform well no longer. then, the robustly stabilized control to uncertainties of nonlinear fuzzy systems is required. based on the t-s fuzzy model, wu studied the robust h2 fuzzy observer-based control problem for discrete-time nonlinear systems with parametric uncertainties [9]. lin, et.al, investigated the mixed h2/h∞ controller for uncertain t-s fuzzy systems with discrete time-varying delay [10]. it’s well known that the sliding mode control (smc) is an effective means to design robust controllers for nonlinear systems with uncertainties bounded by known scalar-valued functions. recently, the smc has been successfully applied into solving the stabilization problems for uncertain fuzzy systems [11, 12]. specially, under the smc control the state trajectories would copyright © 2006-2015 by ccc publications design of robust fuzzy sliding-mode controller for a class of uncertain takagi-sugeno nonlinear systems 137 enter into the pre-designed sliding-mode motion within finite time and after that keep staying on it; thus, the system dynamics is not sensitive to parameter variations and external disturbances any more [9, 13]. basically, the feedback gains of the smc control are often determined by the feasible solution of a set of linear matrix inequalities (lmi). choi presented a robust stabilization of uncertain fuzzy systems using smc system approach [14]. zhang and wang proposed a mixed smc-h∞ controller for time-delay system with unmatched uncertainties [15]. although many researchers have proposed a variety of t-s fuzzy smc-based methods [4, 7– 10,14], by far there still two problems yet remained to be well-solved: (1) the suitable relaxation on the assumption that all the control matrices of the nominal sub-systems’ models are identical; it is practically difficult to satisfy this assumption. in practice, this assumption is very strict and insufficient to model various uncertainties/nonlinearities in most of actual systems such as nonlinear stirred tank reactor, fourth-order cart-pole system, and active queue management in tcp networks and two-link robot manipulator; (2) the reasonable assumption that the system uncertainties/perturbations are unknown but norm-bounded. for the smc-based methods, if uncertainties are known, it’s easy to choose proper switching gains, which are bigger than the upper norm-bounds of uncertainties, to ensure the reachability of sliding-mode motion. unfortunately, the information of the upper bound of uncertainties/perturbations may not easily be obtained in practice. therefore, it’s supposed to adopt parameter identification or adaptive control approach to estimate the bounds of uncertainties on-line [16]. motivated by the above discussion, it is meaningful to design fuzzy variable structure controller in this paper such that the closed-loop system is asymptotically stable without (1) assuming the control matrices of each local linear model to be same; and (2) knowing the priori information of the upper norm-bounds of uncertainties. the existence condition of linear sliding surfaces and the asymptotical stability of the reduced-order equivalent sliding-mode dynamics are firstly derived by the lmi optimization technique. after the establishment of the local t-s model, the upper norm-bounds of uncertainties and modeling errors between the original and local systems are estimated. then, the designed smc controller composes of a state-feedback control term and an adaptive switching-feedback control term, which are achieved based on the lyapunov function method. as a result, the aforementioned problems are well solved in the proposed scheme. finally, a numerical example is illustrated in order to show the effectiveness of the proposed methods. 2 problem formulation and preliminaries as stated in introduction, t-s fuzzy models can provide an effective representation of complex nonlinear systems in terms of fuzzy sets and fuzzy reasoning applied to a set of linear input output sub-models. hence, in this work, a class of nonlinear systems is represented by a t-s model. as in [3], the t-s fuzzy system with uncertainties is described by fuzzy if then rules, which locally represent linear input-output relations of nonlinear systems. the i-th rule of the fuzzy model is formulated in the following equation: plant rule i: if z1 is mi1 and z2 is m i 2 · · · and zp is m p 1 , then ẋ(t) = aix(t) + biu(t) (1) where mij is the fuzzy set, z(t) = [z1(t),z2(t), · · · ,zp(t)] t is the premise variable vector, r is the number of rules of this t-s fuzzy model. x(t) ∈ rn is the state vector, u(t) ∈ rm is the control input vector. ai,bi are known real constant matrices with appropriate dimensions. 138 x.z. zhang, y.n. wang the overall fuzzy model achieved by fuzzy blending of each plant rule is represented as follows: ẋ(t) = r∑ i=1 hi(z)[aix(t) + biu(t)] (2) where hi(z(t)) = wi(z(t))∑r j=1 wi(z(t)) ,wi(z(t)) = ∏p j=1 m i j(z(t)), in which m i j(z(t)) is the membership grade of zj(t) in mij. according to the theory of fuzzy sets, we have m i j(z(t)) ≥ 0. therefore, it implies that hi(z(t)) ≥ 0 and ∑r i=1 hi(z(t)) = 1. in practical applications, the system (2) usually exists the parameters variation and the external disturbances. after considering the perturbation, the global fuzzy model of system (2) can be rewritten as ẋ(t) = r∑ i=1 hi(z)[(ai + ∆ai)x(t) + (bi + ∆bi)u(t) + di(t,x,u)] (3) where ∆ai, ∆bi are unknown time-varying matrices representing parameter uncertainties, and di(t,x,u) denotes the external disturbance. in this work, the control matrices doesn’t satisfy b1 = b2 = ... = br and we define the weighted nominal matrix as b = ∑r i=1 hi(z)bi. moreover, it is required that b is a nonzero matrix with full-column (or full-row) rank. the uncertain matrices are assumed to be matched, i.e. there exist certain functions d(t), e(t) and f (t) such that ∆ai(t) = bid(t), ∆bi(t) = bie(t) and di(t,x,u) = bif (t) hold. as a result, the system (3) can be rewritten as ẋ(t) = r∑ i=1 hi(z)aix(t) + b[u(t) + g(t,x,u)] (4) where the time-varying function g(t,x,u) = d(t)x(t)+e(t)u(t)+f (t), then g(t,x,u) contains all the perturbation in system (3). before proceeding, some assumptions and lemmas are given as following, which are useful for the development of our result. (assumption.1) the time-varying uncertainties g(t,x,u) is assumed to be norm-bounded, that is, ∥g∥ ≤ ∑n−1 k=0 γk∥x(t)∥ k, where γk is unknown coefficient and n is a positive integer. remark 1: assumption 1 is a standard assumption in the study of variable structure control. (lemma.1 choi [14]): give any matrix x, y , z with appropriate dimensions, and y > 0. then we have −xtz − ztx ≤ xty x + zty −1z. (lemma.2 schur’s complement [3]): given the matrix inequality [ s11 s12 st12 s22 ] < 0, where s11 and s22 are invertible symmetrical matrices, it’s equal to each of the following inequalities: (i) s11 < 0, s22 − st12s −1 11 s12 < 0; (ii) s22 < 0, s11 − s12s −1 22 s t 12 < 0. 3 controller design the objective of this work is to design an smc law such that the desired control performance for the resulting closed-loop system is obtained despite of parameter uncertainties and unmatched external disturbance. in this section, an smc law is synthesized such that the closed-loop systems are robustly asymptotically stable. it is also proven that the reachability of the specified switching (sliding) surface s(t) = 0 can be ensured by the proposed smc law. thus, it is concluded that the synthesized smc law can guarantee the state trajectories of uncertain system (4) to be driven onto the sliding surface, and asymptotically tend to zero along the specified sliding surface. design of robust fuzzy sliding-mode controller for a class of uncertain takagi-sugeno nonlinear systems 139 3.1 design of the sliding function and stability analysis of the sliding motion essentially, a vsc design is composed of two phases: hyper-plane design and controller design [9]. there are various methods for designing hyper-plane, however in this paper the switching surface is defined as s(t,x) = cx(t) (5) where c ∈ rm×n is the designed coefficient. according to the previous works [2], for the system (2), there are two prerequisites to find the switching surface (5) that is (p-1): the matrix cb is invertible for any hi(z) satisfying hi(z) ≥ 0 and ∑r i=1 hi(z) = 1. (p-2): the reduced sliding-mode motion of the system dynamics restricted on the switching surface is asymptotically stable to all admissible uncertainties. in this part, we analyze the dynamic performance of the system described by (4), and derives some sufficient conditions for the asymptotically stability of the sliding dynamics via lmi method. the following theorem shows that system (4) with the switching surface as in (5) is asymptotically stable. theorem 1. consider the fuzzy uncertain systems (4) with assumptions (1). the switching function is given by (5). if there is feasible solution q such that the lmis shown in (6) hold for ∀i,j,k ∈ 1, ...,r, the proper sliding-mode coefficient c exists and c = (btq−1b)−1bt .  aiq + qa t i qb t j a t i btj q −b t j bk 0 ai 0 −i   < 0 (6) where the invertible matrix q ∈ rn×n is decisive variable. proof: first, the existence of the coefficient c is proved. since q is designed to be invertible, the inequalities bt b ̸= 0 and bt q−1b ̸= 0 hold; thus, its easy to prove that the achieved matrix cb is invertible and (cb)−1 = (bt b)−1(bt q−1b). that is to say, if the coefficient is chosen as c = (bt q−1b)bt , both the sliding surface and the equivalent control exist. with the chosen sliding surface, once the n-order system enters the m-order sliding surface s = cx(t), the system dynamic of (4) is equivalent to the (n−m)-order sliding motion, and the system states will asymptotically converge to zero with proper switching gains. in the following, we will derive the equivalent control and the sliding mode as well as the stability analysis. now, the linear transformation t is carried on the states to separate the m-th sliding-mode states and the reduced (n − m)-order states t = [ t1 t2 ] = [ (kt qk)−1kt q (bt q−1b)−1bt ] (7) where t1 ∈ r(n−m)×n, t2 ∈ rm×n and k ∈ rn×(n−m is an orthonormal basis for the null space of bt such that bt k = 0 and kt k = i satisfy. we have z = t x(t) = [ z1 z2 ] (8) in (8), it’s obvious that z2 = (bt q−1b)bt x(t) = cx(t) is the sliding-mode state and z1 is the reduced-order state. its easy to obtain the inverse transformation t −1 = [k,q−1b]. 140 x.z. zhang, y.n. wang by differentiating the above transformation, we can obtain ż = t x(t) = r∑ i=1 hi(z) [ t1 t2 ] ai [ k q−1b ] t x(t) + [ t1 t2 ] b[u + g(t,x,u)] = r∑ i=1 hi(z) [ t1aik t1aiq −1b t2aik t2aiq −1b ][ z1 z2 ] + [ t1b t2b ] [u + g(t,x,u)] (9) in (9), it’s not difficult to verify that t1b = (kq−1k)kt b = 0 and t2b = cb. according to the sliding mode theory, let ṡ = s = ż2 = 0, we have ż2 = r∑ i=1 hi(z)(t2aikz1 + t2aiq −1bz2) + t2b[u + g(t,x,u)] = r∑ i=1 hicaix(t) + t2b[g(t,x,u)] (10) from (16), the following equivalent control can be derived ueq(t) = −g(t,x,u) − r∑ i=1 hi((z)(t))(cb) −1caix(t) (11) substitute ueq(t) in (11) into (10), the reduced-order sliding motion in the switching surface can be obtained as ż1 = r∑ i=1 hi(z(t))(k t qk)−1kt [i − b(cb−1c)]aikz1 (12) to analyze the stability of the sliding-mode dynamics (8), we consider the fuzzy uncertain system (4) with lmis in (6) and choose the lyapunov functional candidate v1 = zt1 pz1, where p = kt qk is a positive matrix. by differentiating the function v1, we obtain the differential along the trajectories as v̇1 = z t 1 p ż1 + ż t 1 pz1 = r∑ i=1 hi(z) [ zt1 p t1(i − b(cb) −1cai)kz1 + z t 1 k t ati (i − b(cb) −1c)t t t1 p tz1 ] = r∑ i=1 hi(z) [ zt1 k t q(i − b(cb)−1c)aikz1 + (∗) ] = r∑ i=1 hi(z) [ wt (t)q(i − b(cb)−1c)aikw(t) + (∗) ] = r∑ i=1 hi(z)w t (t) {[ qai + (∗) ] − [ qb(cb)−1cai + (∗) ]} w(t) (13) where the defined new state vector w(t) = kz1 ∈ rn×1. by now, the stability of the states x(t) in the switching surface is equivalent to the stability of the news states w(t). noticing that b(cb)−1c = b(bt b)−1bt , it is easy to have v̇1 < 0 if the following inequality holds[ qai + (∗) ] − qb(bt b)−1bt ai − ati b(b t b)−1bt q ≤ 0 (14) design of robust fuzzy sliding-mode controller for a class of uncertain takagi-sugeno nonlinear systems 141 according to lemma 2, let x = qb(bt b)−1bt , z = ai and y = i > 0, the inequality in (21) implies that [ qai + (∗) ] + qb(bt b)−1bt + ati ai ≤ 0 (15) according to the inequalities (14), (15), we have [ qai + (∗) ] − [ qb(bt b)−1bt ai + a t i b t (bt b)−1bt q ] ≤ 0 (16) by using schur’s complement formula and noticing that bt b ≥ 0, we can rewrite the above inequality as   aiq + qa t i qb a t i bt q −bt b 0 ai 0 −i   < 0 (17) from (17) and noticing the definition of the matrix b and bt b = ∑r j=1 ∑r k=1 hj(z)hk(z)b t j bk, we can deduce the following inequality v̇1 = r∑ i=1 r∑ j=1 r∑ k=1 hi(z)hj(z)hk(z)w t (t)   aiq + qa t i qb t j a t i btj q −b t j bk 0 ai 0 −i  w(t) ≤ 0 (18) and this leads to for ∀i,j,k ∈ 1, ...,r   aiq + qa t i qb t j a t i btj q −b t j bk 0 ai 0 −i  w(t) ≤ 0 (19) this means that the sliding surface (5) and the equivalent control (11) for the fuzzy system (4) with assumption 1 exist and the sliding motion is asymptotically stable. 2 3.2 design of controller and adaptive laws as the last step of design procedure, we will further design the vsc controller ensures the reachability of the specified switching surface. the adaptive vsc controller is represented in a set of fuzzy rules as following controller rule i: if z1 is mi1 and z2 is m i 2 · · · and zp is m i p, then u(t) = −(cb)−1 [ caix(t)+∥n(t)∥∥ḣα∥∥x(t)∥sgn(s)+∥cb∥(ρ̂0+ρ̂1∥x(t)∥)sgn(s)+εsgn(s) ] (20) the global fuzzy vsc and the adaptation laws are designed as u(t) = − r∑ i=1 hi(z)(cb) −1[caix(t) + φ̂(t,x)sgn(s)] (21) with the following adaptive laws 142 x.z. zhang, y.n. wang   φ̂(t,x) = ∥n(t)∥∥ḣα∥∥x(t)∥ + 1∑ k=0 ρ̂k∥cb∥∥x(t)∥k + ε ˙̂ρ0 = γ0∥st (t)∥ ∥∥ r∑ i=1 hi(z)cbi ∥∥ ˙̂ρ1 = γ1∥st (t)∥ ∥∥ r∑ i=1 hi(z)cbi ∥∥∥x(t)∥ (22) where the adaptation rates γi > 0, the constant ε > 0, and the functions n(t), hα will be defined in the following theorem, which gives the stability analysis when the controller (30) is enforced on the system (4) and the reachability of the specified sliding surface s(t) = 0 can be obtained. theorem 2. for the uncertain fuzzy systems (4) with the switching function (5), q is the feasible solution of lmis (6). then it can be shown that the state trajectories of the system (4) will be driven onto the switching surface s(t) = 0 and asymptotically converge to zero by the adaptive smc law in (20),(21) and (22). proof: for purpose of design integrity, a simple stability analysis based on lyapunov direct method is carried out. define the lyapunov function candidate v2(t) = 1 2 st s + 1∑ k=0 1 2γk ρ̃2k ≥ 0 (23) where the estimation error ρ̃k = ρ̂k − ρk. by differentiating the switching surface s(t), we have ṡ(t) = c(t)ẋ(t) + ċ(t)x(t) (24) in (24), by following from lemma 1 we have ċ(t) = d dt (htαphα) −1bαhα = n(t)ḣα(t) (25) where hα(t) = [h1, ...,hr], bα(t) = [b1, ...,br]t , p = bαq−1btα , n(t) = [ −(htαphα)−1htα(p + p t )(htαphα) −1bαhα + (h t αphα) −1bα ] then, it follows from (4) that we have ṡ(t)s(t) = c(t)ẋ(t)c(t)x(t) + n(t)ḣα(t)x(t)c(t)x(t) (26) by substituting the controller (21) into (26), we have st (t)ṡ(t) = st (t) [ r∑ i=1 hi(z)caix(t) + cbu(t) + n(t)ḣα(t)x(t) + r∑ i=1 hi(z)cbig(t,x,u) ] = st (t) [ n(t)ḣα(t)x(t) + cbg(t,x,u) − φ̂(t,x)sgn(s(t)) ] ≤ ∥st (t)∥∥n(t)∥∥ḣα∥∥x(t)∥ + ∥st (t)∥∥cb∥ 1∑ k=0 ρk∥x(t)∥k − φ̂(t,x)∥s(t)∥ = ∥st (t) [ ∥cb∥ 1∑ k=0 ρk∥x(t)∥k − ∥cb∥ 1∑ k=0 ρ̂k∥x(t)∥k − ε ] (27) design of robust fuzzy sliding-mode controller for a class of uncertain takagi-sugeno nonlinear systems 143 by differentiating the function v2 and substituting the adaptive laws (22) into (27), the simplified expression of (27) can be obtained as v̇2(t) = s t ṡ + ˙̃ρ0 γ0 (ρ̂0 − ρ0) + ˙̃ρ1 γ1 (ρ̂1 − ρ1) = −ε∥st (t)∥ (28) noticing that ϵ > 0, thus the derivative v̇2(t) < 0 when s(t) ̸= 0, which implies that under the controller (21) and (22) the reachability of the specified switching surface is guaranteed, and the trajectories of the fuzzy uncertain system (3) are globally driven onto the specified switching surface s(t) ̸= 0. moreover, it is seen that the estimation error ρ̃k will converge to zero. 2 4 numerical simulation to show the effectiveness of the proposed controller design techniques, the inverted pendulum with parametric uncertainties, which is taken from wu and juang [16], is formulated for simulation. the control objective is to drive its state trajectories to the origin. the equations of motion for the inverted pendulum device are  ẋ1(t) = x2(t) ẋ2(t) = −f1mlx2cosx1 − m2gl2sinx1cosx1 + (j + ml2)[mlx22sinx1 − f0x4 + u(t) + d(t)] m̄(j + ml2) − m2l2cos2x1 ẋ3(t) = x4(t) ẋ4(t) = −f1m̄x22 − m 2l2x22sinx1cosx1 + f0mlx4cosx1 + m̄mgsinx1 − mlcosx1[u(t) + d(t)] m̄(j + ml2) − m2l2cos2x1 where x1 denotes the angle (rad) of the pendulum from the vertical,x2 is the angular velocity (rad/s), x3 is the displacement (m) of the cart, and x4 is the velocity of the cart.g = 9.8m/s2 is the gravity constant, m is the mass (kg) of the pendulum, m is the mass (kg) of the cart, f0 is the friction factor (n/m/s) of the cart, f1 is the friction factor (n/rad/s) of the pendulum, l is the length (m) from the center of the mass of the pendulum to the shaft axis,j is the moment of inertia (kg.m2) of the pendulum round its center of mass, and u(t) is the force (n) applied to the cart. the model parameters are given as: m̄ = m + m,m = 1.3282kg, m = 0.22kg, f0 = 22.915n/m/s, f1 = 0.007056n/rad/s, l = 0.304m, j = 0.004963kg.m2 in the numerical simulation. it’s assumed that d(t) is bounded by d(t) ≤ ρ0 + ρ1∥x∥, where ρi is unknown parameter. the fuzzy model of system is described as the following two rules: plant rule i: if x1(t) is mi1, then ẋ(t) = ai + bi [ u(t) + g(t,x). the model parameters are given as a1 =   0 1 0 0 29.2529 −0.3149 0 44.1811 0 0 0 1 −1.2637 0.0136 0 −16.7096  , a2 =   0 1 0 0 22.0587 −0.2872 0 20.1425 0 0 0 1 −0.4765 0.0062 0 −15.2361  , b1 =   0 −1.9280 0 0.7292  , b2 =   0 −0.8790 0 0.6649  . the membership functions are selected as m1(x) = [1 − 1/(1 +e−7(x1−π/24))]/(1 +e−7(x1+π/24)), m2(x) = 1−m1(x).due to b2 ≤ b1, the stabilization result (zheng et al., 2002) is invalid. the perturbation is set to be g(t,x) = [0.1sint,0.05sint]t , the initial states x(0) = [π/3,0,π/5,0]. to assess 144 x.z. zhang, y.n. wang the effectiveness of our fuzzy controller, we apply the controller to the original system (12) with nonzero d(t). we choose the adaptation parameters γ0 = 0.001, γ1 = 0.1.via lmi optimization with (19), we obtain the feasible solutions and the switching suafrace. the simulation results are given in figures 1-2. it is seen that the reachability of the sliding motion can be guaranteed. the system enters sliding-mode motion after about t=0.8 second. from figure 1, one can see that the system states converge to zero fast, furthermore, the simulation results also show that our present design effectively attenuates the effect of both parameter uncertainties and external disturbances. in figure 2, the control effort is shown and approaches to be stable after a short-term adjustment in the initial stage. 0 0.5 1 1.5 2 −10 −5 0 5 10 15 20 time(s) st a te t ra je ct o ri e s x x1 x2 x3 x4 figure 1: trajectories of states x1, x2, x3 and x4 0 0.5 1 1.5 2 −250 −200 −150 −100 −50 0 50 time(s) co n tr o l i n p u t figure 2: control input design of robust fuzzy sliding-mode controller for a class of uncertain takagi-sugeno nonlinear systems 145 5 conclusions and future works this paper has generalized the t-s fuzzy model to represent a class of nonlinear systems which includes parameter uncertainties or external disturbances. a novel adaptive vsc control scheme has been proposed for the uncertain model, which relaxes the restrictive assumption that the input matrices of the local sub-models are identical and needs no information of uncertainties. the overall fuzzy vsc controller of the system is achieved by fuzzy blending of the local vsc controller. the existence condition of linear sliding surfaces guaranteeing asymptotic stability of the equivalent dynamics is derived as well as the stability analysis. finally, a numerical design example is illustrated in order to show the effectiveness of our scheme. acknowledgment the authors are grateful to the support of the national natural science foundation of china (61203019), the china postdoctoral science foundation funded project (2012m521518), the key projects of chinese ministry of education (no.212122), the department of education (14a032) and innovative research team in higher educational institutions of hunan province, and the natural science foundation of hunan provincial (no.13jj9019). bibliography [1] takagi, t.; sugeno, m. (1985); fuzzy identification of systems and its applications to modeling and control, ieee trans. syst., man, cybern., issn 0018-9472, 15(1): 116-132. [2] feng, g. (2006); a survey on analysis and design of model-based fuzzy control systems, ieee trans. fuzzy syst., issn 1063-6706, 14(5): 676-697. [3] tanaka, k.; ikeda, t.; wang, h.o. (1998); fuzzy regulators and fuzzy observers: relaxed stability conditions and lmi-based designs, ieee trans. fuzzy syst., issn 1063-6706, 6(2): 250-265. [4] zhang, x. z.; wang, y. n. (2009); fuzzy variable structure control based on a takagisugeno model for permanent magnet synchronous motors, journal of systems and control engineering, issn 0959-6518, 223(6): 773-783. [5] besheera, a. h.; emarab, h. m.; azizb, m. m. a.(2009); wind energy conversion system regulation via lmi fuzzy pole cluster approach, electric power system research, issn 03787796, 79(6): 531-538. [6] aouaouda, s.; chadli, m.et.al.(2012); robust fault tolerant tracking controller design for unknown inputs ts models with un-measurable premise variables, journal of process control, issn 0959-1524, 22(5): 861-872. [7] liang, y. w.; xu, s. d.; liaw, d. c.et.al.(2008); a study of t-s model-based smc scheme with application to robot control, ieee trans. ind.electron., issn 0278-0046, 55(11): 39643971. [8] tian, e. g.; peng, c.(2006); delay dependent stability analysis and synthesis of uncertain ts fuzzy systems with time-varying delay, fuzzy sets and systems, issn 0165-0114, 157(4): 544-559. 146 x.z. zhang, y.n. wang [9] wu, h. n.(2007); robust h2 fuzzy output feedback control for discrete-time nonlinear systems with parametric uncertainties, international journal of approximate reasoning, issn 0888613x, 46(1): 151-165. [10] lin, y.c.; lo, j.c.(2005); robust mixed h2/h∞ filtering for discrete-time delay fuzzy systems, international journal of system science, issn 1841-9836, 36(15): 993-1006. [11] wang y.n.; zhang x.z.et.al.(2011); position-sensorless hybrid sliding mode control of electric vehicles with brushless dc motor, ieee trans. veh. technol., issn 0018-9545, 60(2): 421-432. [12] zheng, f.; wang, q. g. et.al.(2002); output tracking control of mimo fuzzy nonlinear systems using variable structure control approach, ieee trans. fuzzy syst., issn 1063-6706, 10(3): 686-697. [13] cao, w. j.; xu, j. x.(2004); nonlinear integral-type sliding surface for both matched and unmatched uncertain systems, ieee trans. automat. contr., issn 0018-9286, 49(8): 13551360. [14] choi, h. h.(2008); robust stabilization of uncertain fuzzy systems using variable structure system approach, ieee trans. fuzzy syst., issn 1063-6706, 16(3): 715-724. [15] zhang, x.z.; wang, y.n.et.al.(2010); h∞ robust t-s fuzzy design for uncertain nonlinear systems with state delays based on sliding-mode control, international journal of computers communications & control, issn 1841-9836, 5(4): 592-602. [16] wu, t.z.; juang, y.t.(2008); design of variable structure control for fuzzy nonlinear systems, expert systems with applications, issn 0957-4174, 35(3): 1496-1503. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 3 (september), pp. 403-417 identification of erd using fuzzy inference systems for brain-computer interface i. dzitac, t. vesselényi, r.c. ţarcă ioan dzitac aurel vlaicu university of arad department of mathematics-informatics elena dragoi st., 2, complex m arad, romania e-mail:ioan.dzitac@uav.ro tiberiu vesselényi, radu cătălin ţarcă university of oradea faculty of management and technological engineering universitatii st. 1, 410087 oradea, romania e-mail:tvesselenyi@yahoo.co.uk, rtarca@uoradea.ro abstract: a brain-computer interface uses measurements of scalp electric potential (electroencephalography eeg) reflecting brain activity, to communicate with external devices. recent developments in electronics and computer sciences have enabled applications that may help users with disabilities and also to develop new types of human machine interfaces. by producing modifications in their brain potential activity, the users can perform control of different devices. in order to perform actions, this eeg signals must be processed with proper algorithms. our approach is based on a fuzzy inference system used to produce sharp control states from noisy eeg data. keywords: event related desynchronization (erd), brain-computer interface (bci), electroencephalography (eeg), fuzzy inference system. 1 introduction devices that are using brain (cortical or scalp) electric potentials which allow brain function signals to control machines (or computers) are called brain-computer interfaces (bci). it had been shown that bci systems using steady state visual evoked potentials have allowed healthy subjects to communicate with computers [1]. non-invasive, electroencephalogram (eeg)-based bci technologies can be used to control a limb prothesis, for the use of a computer program, and for other functions such as environment control or entertainment. bci technologies can improve the lives of people with neurological disorders and also can restore motor control by guiding activity-dependent brain plasticity [2]. a review of the advantages of bcis which shows that they can provide even a good candidate for space applications is presented in [3]. some bci implementations are using sensory-motor rhythms as it is presented in [4]. the interest shown for bci technology is motivated by the applicability for helping disabled, for gaming, and as a tool in cognitive neuroscience [5]. in [6] a model is proposed which includes a set of definitions of the typical entities encountered in a bci, diagrams which explain the structural correlations among them and a detailed description of the timing of a trial. in this paper we present some of our original experiments with bci, and eeg data processing methods applied. copyright c⃝ 2006-2011 by ccc publications 404 i. dzitac, t. vesselényi, r.c. ţarcă 2 bci applications important application for bcis can be found in the medical field. a good example for this is presented in [7] where bci methodology based on self-regulation of slow-cortical potentials of the eeg is used. this kind of two choice task is called "brain-switch" [8], but there are other applications in which multiple choices are employed. another important application of bci is represented by cognitive experiments during which the researchers try to better understand some sequences of brain functions. in [9] for example the researchers use information derived from previous eeg recordings to inform the analysis of functional magnetic resonance imaging (fmri) data collected for the same behavioral task. robotics and environment control applications are also emerging in the studies of this area. for example in [10] a study was made in order to determine the most discriminative features for a bci system based on statistically significant differences between two energy density maps calculated from eeg signals during two different motor tasks. in [11] it is shown that simple rehabilitation tasks can be performed and household robotic devices can be easily controlled by means of bci technology. robotic arms, which have been controlled by means of cortical invasive interfaces in animal studies, could be the next frontier for noninvasive bci applications. a review of current robotic technologies that are relevant to bci is presented in [12]. as it can be seen from the references in this field, employment of prothesis and rehabilitation is a largely studied area in the field of bci applications [13], [14], [15]. several other applications of bci are designed for entertainment, gamming and learning to concentrate or to achieve a certain desired mental state. 3 considerations on eeg signals and signal processing methods used in bci most of the bci devices developed by now are based on the electric potentials of scalp known as eeg, which is noninvasive opposing to those based on cortical potentials where the electrodes are in direct contact with the cortex (electrocorticography ecog). a special method is using eeg signals combined with an appropriate model of volume conduction and of neuro-electrical sources (high-resolution electroencephalography) [16]. in order to intentionally generate the electrical potentials of the brain two kinds of tasks are widely used which are also correlated with different regions of the brain: the sensory-motor (or sometimes motor imagery) tasks and those tasks which imply some kind of processing called mental or cognitive-related tasks (i.e. simple syntactic analysis of words). the sensory-motor tasks are by far mostly used because their direct implication in limb movement [17], [4], [18]. examples of mental or cognitive-related task studies are given in [19] and [20]. in sensory-motor tasks an often used method is to employ event-related desynchronization (erd) which is a reduction in eeg signal amplitude in a specific frequency band (8 12 hz) as a result of a particular event. when a subject is making a hand movement, or even just imagine it, a reduction in sensory-motor signal amplitude can be observed in certain parts of the scalp. in order to use this in a bci application the proper components of the signal must be found. the signals measured by sensors, particularly those used in noninvasive techniques, are known to be very noisy due to a series of potentials generated by other biological or artificial sources than brain potentials. one of the major challenges of bci devices is to filter out the unwanted components of the signals. here both hardware and software solutions are to be applied. the other major task is to uniquely recognize a specific pattern in the useful component of the signals which is meant to trigger a specific command. the number of sensors placed in specific areas identification of erd using fuzzy inference systems for brain-computer interface 405 of the scalp can vary from 1 to 36 which are read in on different channels of the device. each of these channels can be filtered separately to monitor a specific, larger or narrower, frequency domain. researchers applied a wide range of techniques to filter, preprocess and analyze signals and to produce decisions. statistical approach. in [21] the researchers studied spike sorting by introducing a mathematical model consisting of three gaussian waveforms, which appropriately represents the general shapes of action potentials. then they searched for the best-fit waveform for each noise-corrupted spike based on the model, using peak fitting method. the performance of the proposed method was assessed with synthesized neural recordings composed by spike templates and white gaussian noise in various signal to noise ratio environments. gaussian process (gp) classification to binary discrimination of motor imagery eeg data is described in [22]. a robust bayesian linear regression algorithm is presented in [23] that automatically detects relevant features and excludes irrelevant ones, all in a computationally efficient manner. spectral methods. in [24] the researchers investigated spectral power changes both in intracranial ecog recordings in epilepsy patients and in non-invasive eeg recordings optimized for detecting high gamma band (a specific frequency domain in eeg signals) activity in healthy subjects. in order to characterize the non-gaussian information contained within the eeg signals, a new feature extraction method based on "bispectrum" is proposed in [25] and applied to the classification of right and left motor imagery. in [26] a novel method is presented for detecting frequency-frequency coupling between the electrical output of cortical areas as measured by ecog, electroencephalography eeg and magnetoencephalography meg, the biphase-locking value. the method is specifically sensitive to non-linear interactions, i.e. quadratic phase coupling across frequencies. this method was employed to study signals measured from pre-motor area, in the motor cortex. wavelet methods. wavelet transforms are applied in [27] to study motor imagery tasks. in this work the continuous wavelet transform (cwt) was applied together with student’s twosample t-statistics for 2d time-scale feature extraction, where features are extracted from eeg signals recorded from subjects performing left and right motor imagery. in [28] a novel method was proposed for the feature extraction of electroencephalogram (eeg) based on wavelet packet decomposition (wpd). in [29] the assessment of wavelet transform (wt) as a feature extraction method was used in representing the electrophysiological signals. genetic algorithms. positive potentials at a latency of about 300 ms in eeg (p300) are largely used to study brain functions. this pattern is used as sign of cognitive function in decision making processes. in [30] p300 detection approach based on some features and a statistical classifier was implemented. the optimal feature set was selected using a genetic algorithm from a primary feature set including some morphological, frequency and wavelet features and was used for the classification of the data. artificial neural networks and support vector machines. in [31] the authors investigated if an artificial neural network-based model for closed-loop-controlled neural prostheses could use neuromuscular activation recorded from individuals with impaired spinal cord to predict their end-point gait parameters (such as stride length and step width). this ann-based model allows a seamless incorporation of neuromuscular activity, detected from paralyzed individuals, to adaptively predict their altered gait patterns, which can be employed to provide closed-loop feedback information for neural prostheses. support vector machine implementations are used in [32], [33], [34]. the results indicate that the proposed algorithms are promising for future use of rehabilitative bci applications in neurologically impaired patients. fuzzy logic. in [35] a new method is introduced, which combines information from different classic time series similarity measures, using a fuzzy fusion framework. this method is accurate 406 i. dzitac, t. vesselényi, r.c. ţarcă and reliable in p300 detection. this framework is used to combine two computationally simple signal detection methods: "peak picking" and "template matching". fusion takes place in the last step (decision-making step) by means of a fuzzy rule-base. in order to predict the gripping force from the eeg signals in [36] a methodology that uses subsequent signal processing methods is used: filtering, principal component analysis, and the phase-demodulation method. a fuzzy inference system is then used to predict the gripping force from the processed eeg data. neuro-fuzzy methods are also to be considered as a highly efficient way to deal with uncertain information. the paper [37] introduces a number of modifications to the learning algorithm of the selforganizing fuzzy neural network (sofnn) to improve computational efficiency. an analysis of the sofnn effectiveness when applied in an electroencephalogram (eeg)-based brain-computer interface (bci) involving the neural-time-series-prediction-preprocessing (ntspp) framework is also presented, where a sensitivity analysis of the sofnn hyper-parameters was performed using eeg data recorded from three subjects during left/right-motor-imagery-based bci experiments. the aim of this analysis was to eliminate the need to choose subject and signal specific parameters for eeg preprocessing. the results indicate that a general set of ntspp parameters chosen provide the best results when tested in a bci system. a new brain-computer interface design using fuzzy artmap (fa) neural network, as well as an application of the design is proposed in [38]. the objective of this bci-fa design is to classify the best three of the five available mental tasks for each subject using spectral amplitude values of electroencephalogram (eeg) signals. the findings show that the average bci-fa outputs gave very low error using the best triplets of mental tasks identified from the classification performances of fa. in [39] a subject-based feature extraction method using the fuzzy wavelet packet in brain-computer interfaces (bcis) is presented. in [40] is presented a new method introduced in [34], which combines information from different classic time series similarity measures, using a simple fuzzy fusion framework. this method is accurate and reliable in p300 (a positive event-related component occurring 300ms after stimulus onset) detection. fusion takes place in the decision-making step by means of a fuzzy rule-base. compared to similar works on electroencephalogram-based (eeg-based) bci datasets, in spite of being computationally simple, this new technique’s performance is comparable to very complicated methods. in [41] an analyze of electroencephalogram (eeg) signals of imaginary left and right hand movements, an application of brain-computer interface (bci) is presented. the researchers propose here to use an adaptive neuronfuzzy inference system (anfis) as the classification algorithm. anfis has an advantage over many classification algorithms in that it provides a set of parameters and linguistic rules that can be useful in interpreting the relationship between extracted features. the continuous wavelet transform is used to extract highly representative features from selected scales. 4 method and results the 10-14 hz frequency band of eeg signals is, among others, associated with those cortical areas that are most directly connected to the brain’s normal motor output channels. movement or preparation for movement is typically accompanied by a decrease of amplitude in this band over sensory motor cortex. this decrease has been labeled "event-related desynchronization" or erd [42]. opposite to erd, amplitude increase, or "event-related synchronization" (ers) occurs in the post-movement period and with relaxation. furthermore, and most relevant for bci applications, erd and ers occur also with motor imagery (i.e., imagined movement) and they do not require actual movement. in this research our goal is to analyze eeg signals by observing erd and ers characteristics identification of erd using fuzzy inference systems for brain-computer interface 407 and find a suitable algorithm to automatically detect these characteristics by using a fuzzy inference system. we consider that given the high noisiness of eeg signals a fuzzy or neurofuzzy approach would be desirable. in our experiments subjects were asked to perform several hand movements at irregular times (with random intervals between two movements) during a session of 40 seconds. the hand movement consists of pushing a switch button. the subjects were also asked to concentrate on the action (hand movement) imagining it before. the eeg signals collected from a single electrode placed in c3 point on the scalp were registered. electrode placement location is shown in figure 2. the switch on-off states were also registered using a separate acquisition device which is triggered to start data acquisition at the same time as the eeg signal acquisition, so the eeg signals and the switch on off states data covers the same time period. figure 1: theoretical eeg signals in 12 hz frequency band. a) resting hand; b) moving hand (amplitude decrease erd). a total of five healthy male adults (aged 25-48) participated in this study. none of the subjects had participated in the same experiments before and the subjects had a short basic training for the task. the subjects were informed about the nature and purpose of the experiments and consented to participate. eeg signals were acquired using a bst112 8-channel amplifier from veb messgeratewerk zvoniz and for a/d conversion and transmission of data to a pc, a national instruments ni-usb 6251 daq board had been used. the accuracy of the measuring system is of 1.5259µv. the ag-agcl electrode was placed in the c3 point on the scalp, using a conductive gel. as voltage reference, the a1 point was used (figure 2) and the ground was located on the right leg. data were sampled at a sample rate of 1000 hz and were transmitted through the usb port to a pc for storage and processing. the data acquisition and processing programs were written in matlab language. every subject has made 3 training sessions to accommodate with the environment and the equipment and then 4 sessions of 40 seconds each were recorded. figure 2: location of electrode on the scalp in point c3. the diagram of a sample acquisitioned signal is given in figure 3. as it can be observed the signal is affected by low level noise resulting in a variable offset across it’s whole length. 408 i. dzitac, t. vesselényi, r.c. ţarcă figure 3: a sample of acquisitioned eeg signal. this can be eliminated by using a running average filter which gives local average values which then are extracted from the original signal (seeg1). if n is the length of the signal v (in samples) and m is the running average filter length given by the vector u, then the length of the average will be m+n-1. the running average w is given by convolution of the filter vector with the signal as it is shown in formulae 1: w(k) = ∑ j u(j) · v(k − j + 1) (1) where k is the index of the resulting signal w and j is the index of the filter u. in this case all values of u are equal to 1/m. in this case we used m = 200, which for a sampling rate of 1 ks/sec will not generate errors for the studied frequency band (centered on 12 hz). the running average values are shown in figure 4 (red). according to the convolution theorem, that convolution of signal and the digital filter’s impulse response is nearly the same as multiplying their fourier transforms. this can be used to speed up the calculations if it can be shown that the results are accurate up to an acceptable limit. figure 4: detail of the acquisitioned signal for a 1 second time span (black). running average values of the signal (red). after extraction of the running average the resulting signal (seeg2), is free of low frequency noises as it can be seen in the diagram given in figure 5. a further enhancement of the signal can be obtained by reducing the random peaks, employing a running average filter of the same form as before, but this time keeping the resulting average values. this has the effect of smoothing the signal without modifying the studied components. identification of erd using fuzzy inference systems for brain-computer interface 409 figure 5: the signal given in figure 3, with running average values extracted. this time we used m = 20. the resulting signal (seeg3) is shown in detail for a time span of 1 second in figure 6. studying this signal we can observe that the dominating frequency is around 12-14 hz but still having some low frequency components. figure 6: detail of processed signal given in figure 5, for a time span of 1 second. in order to further analyze the signal we have to switch to frequency domain analysis because time domain diagrams can no longer be enhanced. in case of frequency domain analysis of acquisitioned digital signals spectral amplitudes are computed using the fast fourier transform (fft which is a variant of the fourier transform used in numeric computing). a fft amplitude value is given by formulae 2: a(k) = n∑ j=1 a(j)ω (j−1)(k−1) n (2) where: ωn = e (−2π·i) n , n number of samples; a(j) amplitude of jth sample in the signal. in order to obtain the required spectrum we will use only the real part of the obtained fft amplitudes (real(a(k))). as we want to analyze changes in the signal the fft algorithm should be applied not on the entire length of the signal but with a suitably sized window which is moved step by step on the whole length of the signal. in our case we used a window of size nfft = 256. the number of partitions is np = integer(m/nfft). some samples at the end of the signal will be lost but these can be only a quarter of a second which in our case is reasonable (given a 40 seconds long signal). 410 i. dzitac, t. vesselényi, r.c. ţarcă representing the result we will obtain a diagram of fft amplitudes as a function of time and frequency as it is shown in figure 7. the results had been interpolated using a cubic spline interpolation method in order to smooth the borders between partitions. figure 7: time frequency diagram of the processed signal. the frequency band centered on 12 hz is clearly observable. if we select a slice of this diagram at a specific moment in time we ca obtain the spectra of the signal. such spectra are given in figures 8 (resting hand) and 9 (moving hand). figure 8: fft amplitudes of the eeg signal for resting hand. figure 9: fft amplitudes of the eeg signal for moving hand. if we represent a slice of the time-frequency diagram for a specific frequency we will obtain identification of erd using fuzzy inference systems for brain-computer interface 411 the variation in time of fft amplitude at that frequency. as erd and ers are occurring in a frequency band of 10-14 hz choosing a center frequency of 12 hz the resulting diagram is shown in figure 10 (black). on this diagram we had superimposed the switch on-off states (red). we can observe that the "on" states (which are showing the real movement of the hand) mainly corresponds to the drop of signal amplitude of the chosen frequency. this should show the occurrence of erd. also there can be observed a rise of the curve corresponding with time intervals when the switch is not activated (the subject is relaxed). this should show the occurrence of ers. in case of erd the anticipation of the real movement, the fact that the amplitude is dropping earlier than the hand movement occurs shows that it happens due to motor imagery. figure 10: fft amplitude versus time for the processed signal at the frequency of 12 hz (black) and switch on-off states diagram (red). as it can be seen in figure 10, and it had been observed in almost all of the studied cases, these characteristics can not be expressed with simple parameters (i.e. value of amplitude or the slope of the amplitude drop). hence, in order to automatically detect erd occurrence in fft amplitude signals some fuzzy or neuro-fuzzy algorithms can be applied which may be capable of recognizing these kinds of amplitude drops. fuzzy logic is widely used in modelling and identification problems and in applications where data sets are obtained with a greater or smaller degree of uncertainty [43], [44], [45]. in the followings we will use an adaptive neuro-fuzzy inference system (anfis) to solve this issue. anfis is implemented in the fuzzy logic toolbox of the matlab environment and uses a hybrid learning algorithm in order to identify parameters of sugeno-type fuzzy inference systems [46]. it applies a combination of the least-squares method and the backpropagation gradient descent method for training membership function parameters to emulate the given training data set. the data sets obtained and processed, as it has been described, will be used for training and checking of the anfis system. after training, this system can be used to identify portions of the eeg signal which meet the erd characteristics. from the acquisitioned and processed data we had chosen 14 intervals which represents true erd characteristics and which we will use to train, check and validate the fuzzy identification algorithm. in order to apply the algorithm the amplitudes must be normalized and the time intervals must be scaled so all the datasets representing erd characteristics will have the same amplitude domain and the same number of samples. for normalization the datasets were divided with the maximum value of each dataset and for time scaling a linear interpolation algorithm had been used. two of the datasets were chosen as input data for training and checking. these datasets are shown in figure 11. another dataset had been chosen as output dataset shown in figure 12. 412 i. dzitac, t. vesselényi, r.c. ţarcă figure 11: normalized fft amplitudes versus scaled time for two erd characteristics; a. training input, b. checking input. figure 12: normalized fft amplitudes versus scaled time, output dataset. in the training process the initial membership functions are modified to fit the training input output dataset and in the checking process these membership functions are tuned in correspondence with the checking input-output datasets. the diagrams of the membership functions obtained after training are given in figure 13.a. and after tuning in figure 13.b. figure 13: membership functions obtained after training (a.) and after tuning (b.). the fuzzy inference system parameters are: number of nodes: 12; number of linear parameters: 4; number of nonlinear parameters: 6; number of fuzzy rules: 2. once the fuzzy inference system was trained and tuned we used 14 erd characteristic identification of erd using fuzzy inference systems for brain-computer interface 413 datasets to verify their outputs (vo) compared to the results obtained for the dataset used as output in the training process (to) which is used as reference. system output of the two different validation datasets are given as examples in diagrams of figure 14. figure 14: validation and training system output data diagrams. for each validation dataset the root mean square (rms) values of the difference between vo and to datasets were computed. these values are shown in the diagram in figure 15. figure 15: rms values of difference between vo and to for 14 validation datasets. in order to identify if a dataset is a true erd component or not the rms values of difference between vo and to can be used as a classifier. on the basis of rms values span obtained for known true erd components we can distinguish erd components in real time acquired signals using simple or complex threshold techniques. 5 conclusions and future works in this work we described our experiments and the development of data processing for eeg signals, focusing on identification of erd characteristics which can be used in brain computer interfaces. the experiments showed that eeg signals had to be carefully acquired and processed in order to identify erd components. data acquisition equipment quality, careful selection of subjects and subject focus capacity are of major importance in achieving successful results. data preparation had been done considering simple and time effective algorithms which can be used for real time processing of eeg signals. 414 i. dzitac, t. vesselényi, r.c. ţarcă further studies must be made to validate the described fuzzy inference system on a larger number of datasets for both true and false erd components in order to establish it’s robustness. there is still in discussion whether to use output in the training process dataset as reference for erd identification or to find another more appropriate reference. in a future work we will try to answer these questions. acknowledgements we thank our colleagues simona dzitac, cristina hora and ioan moga for her/his practical contributions in this research. also we thank the professors hariton costin and horia-nicolai teodorescu for the positive and constructive comments and recommendations on our manuscript, which significantly contributed to improving the quality of this paper. we are very honored to thank professor lotfi a. zadeh for his generous openness in collaborations with romanian researchers and for his visits in romanian cities arad (2003: he was granted as doctor honoris causa of aurel vlaicu university of arad) and oradea (2008: as keynote speaker in our conference icccc 2008 and exploratory workshop on natural language computationewnlc 2008). dear professor zadeh, celebrating the 90th birthday, we wish you a good health, long life, and new interesting achievements! bibliography [1] brendan z. allison, dennis j. mcfarland, gerwin schalk, shi dong zheng, melody moore jackson, jonathan r. wolpaw, towards an independent brain-computer interface using steady state visual evoked potentials, clinical neurophysiology, 119(2): 399-408, 2008 [2] janis j daly, jonathan r., wolpaw, brain-computer interfaces in neurological rehabilitation, the lancet neurology, 7(11): 1032-1043, 2008 [3] carlo menon, cristina de negueruela, josé del r. millán, oliver tonet, federico carpi, michael broschart, pierre ferrez, et. all, prospects of brain-machine interfaces for space system control, acta astronautica, 64(4): 448-456, 2009 [4] elisabeth v.c. friedrich, dennis j. mcfarlanda, christa neuperb, theresa m. vaughan, peter brunner, jonathan r. wolpaw, a scanning protocol for a sensorimotor rhythm-based brain-computer interface, biological psychology, pp. 80(2): 169-175, 2009 [5] marcel van gerven, ole jensen, attention modulations of posterior alpha as a control signal for two-dimensional brain-computer interfaces, journal of neuroscience methods, 179(1): 7884 [6] lucia rita quitadamo, maria grazia marciani, gian carlo cardarilli, luigi bianchi, describing different brain computer interface systems through a unique model: a uml implementation, neuroinform, 6: 81-96, 2008, doi 10.1007/s12021-008-9015-0 [7] i.h. iversen, n. ghanayim, a. kübler, n. neumann, n. birbaumer, j. kaiser, a braincomputer interface tool to assess cognitive functions in completely paralyzed patients with amyotrophic lateral sclerosis, clinical neurophysiology, 119(10): 2214-2223, 2008 [8] g. pfurtscheller, t. solis-escalante, could the beta rebound in the eeg be suitable to realize a "brain switch"? , clinical neurophysiology, 120(11): 24-29, 2008 identification of erd using fuzzy inference systems for brain-computer interface 415 [9] marios g. philiastides, paul sajda, eeg-informed fmri reveals spatiotemporal characteristics of perceptual decision making, the journal of neuroscience, 27(48):13082-13091, 2007 [10] a. vuckovic, f. sepulveda, quantification and visualisation of differences between two motor tasks based on energy density maps for brain-computer interface applications, clinical neurophysiology, 119(2): 446-458, 2008 [11] oliver tonet, martina marinelli, luca citi, paolo maria rossini, luca rossini, giuseppe megali, et. all, defining brain -machine interface applications by matching interface performance with device requirements, journal of neuroscience methods, brain-computer interfaces (bcis), 167(1): 91-104, 2008 [12] hyun k. kim, shinsuk park, mandayam a. srinivasan, developments in brain-machine interfaces from the perspective of robotics, human movement science, 28(2): 191-203, 2009 [13] febo cincotti, donatella mattia, fabio aloise, simona bufalari, gerwin schalk, giuseppe oriolo, et. all, non-invasive brain-computer interface system: towards its application as assistive technology, brain research bulletin, special issue: robotics and neuroscience, 775(6): 796-803, 2008 [14] f. galán, m. nuttin, e. lew, p.w. ferrez, g. vanacker, j. philips, j. del r. millán, a brain-actuated wheelchair: asynchronous and non-invasive brain-computer interfaces for continuous control of robots, clinical neurophysiology, 119(9): 2159-2169, 2008 [15] chia-lin chang, zhanpeng jin, hou-cheng chang, allen c. cheng, from neuromuscular activation to end-point locomotion: an artificial neural network-based technique for neural prostheses, journal of biomechanics, 42(8): 982-988, 2009 [16] febo cincotti, donatella mattia, fabio aloise, simona bufalari, laura astolfi, fabrizio de vico fallani, et. all, high-resolution eeg techniques for brain-computer interface applications, journal of neuroscience methods, brain-computer interfaces (bcis), 167(1): 31-42, 2008 [17] valerie morash, ou bai, stephen furlani, peter lina, mark halletta, classifying eeg signals preceding right hand, left hand, tongue, and right foot movements and motor imageries, clinical neurophysiology, 119(11): 2570-2578, 2008 [18] vadim v. nikulin, friederike u. hohlefeld, arthur m. jacobs, gabriel curio, quasimovements: a novel motor-cognitive phenomenon, neuropsychologia, 46(2): 727-742, 2008 [19] farhad faradji, rabab k. ward, gary e. birch, plausibility assessment of a 2-state selfpaced mental task-based bci using the no-control performance analysis, journal of neuroscience methods, 180(2): 330-339, 2009 [20] daniel pérez-marcos, jens-uwe knote, reinhard both, galina ivanova, quantification of cognitive-induced brain activity: an efficient method for online applications, computers in biology and medicine, 38(11-12): 1194-1202, 2008 [21] seong-eun ro, joon hwan choi, taejeong kim, a new action potential classifier using 3gaussian model fitting, advances in neural information processing (iconip 2006) / brazilian symposium on neural networks (sbrn 2006), neurocomputing, 71(16-18): 3631-3634, 2008 416 i. dzitac, t. vesselényi, r.c. ţarcă [22] mingjun zhong, fabien lotte, mark girolami, anatole lécuyer, classifying eeg for brain computer interfaces using gaussian processes, pattern recognition letters, 29(3):354-359, 2008 [23] jo-anne ting, aaron d’souza, kenji yamamoto, toshinori yoshioka, donna hoffman, shinji kakeif, et. all, variational bayesian least squares: an application to brain -machine interface data, neural networks, 21(8): 1112-1131, 2008 [24] tonio balla, evariste demandt, isabella mutschler, eva neitzel, carsten mehring, klaus vogt, et. all, movement related activity in the high gamma range of the human eeg, neuroimage, 41(2): 302-310, 2008 [25] shang-ming zhou, john q. gan, francisco sepulveda, classifying mental tasks based on features of higher-order statistics from eeg signals in brain-computer interface, information sciences, 178(6): 1629-1640, 2008 [26] f. darvas, j.g. ojemann, l.b. sorensen, bi-phase locking a tool for probing non-linear interaction in the human brain, neuroimage, 46(1): 123-132, 2009 [27] wei-yen hsu, yung-nien sun, eeg-based motor imagery analysis using weighted wavelet transform features, journal of neuroscience methods, 175(2): 310-318, 2009 [28] wu ting, yan guo-zheng, yang bang-hua, sun hong, eeg feature extraction based on wavelet packet decomposition for brain computer interface, measurement, 41(6): 618-62, 2008 [29] dean cvetkovic, elif derya übeyli, irena cosic, wavelet transform feature extraction from human ppg, ecg, and eeg signal responses to elf pemf exposures: a pilot study, digital signal processing, 18(5): 861-874, 2008 [30] vahid abootalebi, mohammad hassan moradib, mohammad ali khalilzadeh, a new approach for eeg feature extraction in p300-based lie detection, computer methods and programs in biomedicine, 94(1): 48-57, 2009 [31] r. murat demirer, mehmet sirac ozerdem, coskun bayrak, classification of imaginary movements in ecog with a hybrid approach based on multi-dimensional hilbert-svm solution, journal of neuroscience methods, 178(1): 214-218, 2009 [32] w. a. chaovalitwongse, p. m. pardalos, on the time series support vector machine using dynamic time warping kernel for brain activity classification, cybernetics and systems analysis, 44(1): 125-138, 2008 [33] andreas trřllund boy, ulrik qvist kristiansen, martin billinger, omar feix do nascimento, dario farina , identification of movement-related cortical potentials with optimized spatial filtering and principal component analysis, biomedical signal processing and control, 3(4): 300-304, 2008 [34] gholamreza salimi-khorshidi, ali motie nasrabadi, mohammadreza hashemi golpayegani, fusion of classic p300 detection methods’ inferences in a framework of fuzzy labels, artificial intelligence in medicine, 44(3): 247-259, 2008 [35] vito logar, igor škrjanc, aleš belič, simon brežan, blaž koritnik, janez zidar, identification of the phase code in an eeg during gripping-force tasks: a possible alternative approach to the development of the brain-computer interfaces, artificial intelligence in medicine, 44(1): 41-49, 2008 identification of erd using fuzzy inference systems for brain-computer interface 417 [36] dzitac s., felea i., dzitac i., vesselenyi t., (2008), an application of neuro-fuzzy modelling to prediction of some incidence in an electrical energy distribution center, international journal of computers, communications and control, issn 1841-9836, 3(s):287-292, 2008 [37] coyle d, prasad g, mcginnity t.m., faster self-organizing fuzzy neural network training and a hyperparameter analysis for a brain-computer interface, ieee trans syst man cybern b cybern., 39(6):1458-71, 2009 [38] palaniappan r., paramesran r., nishida s., saiwaki n., a new brain-computer interface design using fuzzy artmap, ieee transactions on neural systems and rehabilitation engineering, issn 1534-4320, 10(3): 140-148, 2002 [39] bang-hua y., guo-zheng y., ting w., rong-guo y., subject-based feature extraction using fuzzy wavelet packet in brain-computer interfaces, signal processing, 87(7): 1569-1574, 2007 [40] vesselenyi t., dzitac i., dzitac s., hora c., porumb c., preliminary issues on brainmachine contextual communication structure development, soft computing applications, 2009. sofa ’09. 3rd international workshop on, isbn 978-1-4244-5054-1, pp.37-42, ieee xplore doi 10.1109/sofa.2009.5254882, 2009 [41] darvishi, s.; al-ani, a. brain-computer interface analysis using continuous wavelet transform and adaptive neuro-fuzzy classifier, engineering in medicine and biology society, 2007. embs 2007. 29th annual international conference of the ieee, 3220 3223. [42] pfurtscheller, g.: eeg event-related desynchronization (erd) and event-related synchronization (ers), in e. niedermeyer, f.h. lopes da silva (eds.) electroencephalography: basic principles, clinical applications and related fields, 4th edition, williams and wilkins, baltimore, md, 958-967, 1999 [43] zadeh l. a., the role of fuzzy logic in modeling, identification and control, modeling, identification and control, 15(3):191-203, 1994 [44] zadeh l. a., fuzzy logic and the calculus of fuzzy if-then rules, proceedings of synapse’91, tokyo, japan, 1991 [45] zadeh l. a., toward extended fuzzy logic-a first step, fuzzy sets and systems, 160: 31753181, 2009 [46] ***, matlab, user guide, mathworks, 2008 int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 3 (september), pp. 562-576 axiomatic theory of complex fuzzy logic and complex fuzzy classes d.e. tamir, a. kandel dan e. tamir texas state university, cs department 601 university drive san marcos, texas 78666 e-mail: dt19@txstate.edu abraham kandel university of south florida, cse department 4202 e. fowler ave tampa, florida 33620 e-mail: kandel@cse.usf.edu abstract: complex fuzzy sets, classes, and logic have an important role in applications, such as prediction of periodic events and advanced control systems, where several fuzzy variables interact with each other in a multifaceted way that cannot be represented effectively via simple fuzzy operations such as union, intersection, complement, negation, conjunction and disjunction. the initial formulation of these terms stems from the definition of complex fuzzy grade of membership. the problem, however, with these definitions are twofold: 1) the complex fuzzy membership is limited to polar representation with only one fuzzy component. 2) the definition is based on grade of membership and is lacking the rigor of axiomatic formulation. a new interpretation of complex fuzzy membership enables polar and cartesian representation of the membership function where the two function components carry uncertain information. moreover, the new interpretation is used to define complex fuzzy classes and develop an axiomatic based theory of complex propositional fuzzy logic. additionally, the generalization of the theory to multidimensional fuzzy grades of membership has been demonstrated. in this paper we propose an axiomatic framework for first order predicate complex fuzzy logic and use this framework for axiomatic definition of complex fuzzy classes. we use these rigorous definitions to exemplify inference in complex economic systems. the new framework overcomes the main limitations of current theory and provides several advantages. first, the derivation of the new theory is based on axiomatic approach and does not assume the existence of complex fuzzy sets or complex fuzzy classes. second, the new form significantly improves the expressive power and inference capability of complex fuzzy logic and class theory. the paper surveys the current state of complex fuzzy sets, complex fuzzy classes, and complex fuzzy logic; and provides an axiomatic basis for first order predicate complex fuzzy logic and complex class theory. keywords: fuzzy logic, fuzzy class theory, complex fuzzy logic, complex fuzzy class theory. 1 introduction in 1965, l. a. zadeh has established the theory of fuzzy sets where the degree of membership of an item in a set can get any value in the interval [0, 1] rather than the two values {/∈,∈} [1,2]. copyright c⃝ 2006-2011 by ccc publications axiomatic theory of complex fuzzy logic and complex fuzzy classes 563 in addition he introduced the notion of fuzzy logic [1][4]. which is a multilevel extension of classical logic where propositions can get truth-values in the interval [0, 1], and are not limited to one of the two values {true, false} (or {0, 1}) [3]. the four decades that followed his pioneering work has shown a multitude of research work and applications related to signal processing [5], knowledge representation [6], control theory [4], reasoning [7][9], and data mining [10]. in 1975 zadeh introduced the concept of linguistic variable and the induced concept of type-2 (type-n) fuzzy sets [11, 12]. in recent years, type-1 and type-2 along with interval type1/type-2 fuzzy logic and fuzzy systems have been applied in many areas including signal processing [12], fuzzy clustering [13], data mining [10], and software testing [14]. complex fuzzy sets, classes, and logic have an important role in applications, such as prediction of periodic events and advanced control systems, where several fuzzy variables interact with each other in a complex way that cannot be represented effectively via simple fuzzy operations such as union, intersection, complement, negation, conjunction and disjunction. the initial formulation of these terms stems from the definition of complex fuzzy grade of membership. ramot et al. propose an extension of fuzzy set theory and fuzzy logic where the range of degrees of membership and the range of truth-values is the complex unit circle [15, 16]. these definitions, however, have several constraints. first, the derivation relies on a complex fuzzy membership which is limited to polar representation and carries fuzzy information only in the magnitude component of the function. second, they do not provide an axiomatic approach for their theory. finally, the derivation of complex fuzzy logic assumes complex fuzzy relations thereby presumes the existence of complex fuzzy sets. dick has extended the work by ramot et al., [17], yet his approach is also limited to polar representation with single fuzzy component. tamir et al., provide further generalization of the concept of complex fuzzy membership function and use a cartesian complex fuzzy membership function where both the real part and the imaginary part can be fuzzy functions. alternatively, polar representation where both the magnitude and the phase values of the complex membership function convey fuzzy information, can be utilized [18]. furthermore, they provide a new interpretation of complex fuzzy grades of membership as a representation of a complex fuzzy class along with complex fuzzy class operations. while this formulation has several advantages over previous formalisms, it is still based on the definition of grade of membership which is a limiting factor in the ability to provide a rigorous, axiomatic based, theory. in ref. [19], tamir et al., develop an axiomatic based propositional complex fuzzy logic theory which is independent of complex fuzzy sets, classes and relations. in addition, they demonstrate the potential use of this formalism for inference in complex systems. in this paper we expand the definition of complex fuzzy logic to include axiomatic based definition of first order predicate complex fuzzy logic and use the new formalism to establish an axiomatic framework for complex fuzzy classes. the new theory is compatible with classical logic, as well as with first order predicate fuzzy logic [20][24]. furthermore, the new theory supports cartesian as well as polar representation of complex logical fuzzy propositions with two components of ambiguous information. hence, this form significantly improves the expressive power and inference capability of complex fuzzy logic. the paper reviews the current state of complex fuzzy sets theory, provides a brief overview of complex fuzzy classes, and complex fuzzy logic; and introduces a new and generalized complex fuzzy propositional logic theory. the new formalism can be used in advanced complex fuzzy logic systems and provides ways for extension into multidimensional fuzzy propositional and first order logic. furthermore, it can be used for inference with type 2 (or higher) fuzzy sets [11, 12]. the rest of the paper is organized in the following way: section 2 introduces current research based on the concepts of complex fuzzy sets [15][18]. section 3 provides the axiomatic definition of first order complex fuzzy predicate logic [19]. based on the formulation in section 3, section 564 d.e. tamir, a. kandel 4 contains an axiomatic theory of complex fuzzy classes while section 5 includes conclusions and directions for further research. 2 current and related research on complex fuzzy sets ramot et al., observe that the expressive power of fuzzy set theory and fuzzy logic and the utility of derived applications can be significantly improved via the introduction of complex fuzzy sets [15,16]. their observation is mainly motivated by fuzzy processes that contain periodical behavior such as the cycles in economic markets. in order to capture these phenomena in reasoning, they introduce a complex grade of membership and derive the definition of complex fuzzy sets. later, they introduce complex fuzzy logic via relations on complex fuzzy sets. their formalism, however, is limited due to the fact that they restrict the membership function to representation using polar coordinate system where only the magnitude carries fuzzy information. motivated by similar considerations, tamir et al., extends the formalism proposed by ramot et al., and introduce complex class theory where both component of a complex fuzzy grade of membership carries fuzzy information [18]. this enables reasoning about processes with multi-dimensional components where each component is carrying fuzzy information and the interaction between the components cannot be decomposed and represented via primitive, one dimensional, fuzzy set theory and fuzzy logic operations such as union, intersection, conjunction, and disjunction. the current formulation of complex fuzzy sets, complex fuzzy classes, and propositional fuzzy logic is based on the definition of grade of membership. this section briefly reviews the first two formalisms [15][18]. the next section introduces axiomatic fuzzy logic and complex fuzzy logic theory and includes a discussion on propositional complex fuzzy logic [19]. 2.1 membership-grade based definition of complex fuzzy sets a complex fuzzy set s on a universe of discourse u is defined by a complex-valued grade of membership function µs(x); (ramot et al. [15, 16]): µs(x) = rs(x)e jωs(x) (1) where j = √ −1. this definition utilizes polar representation of complex numbers along with conventional fuzzy set definition; where rs(x), the amplitude part of the grade of membership, is a fuzzy function defined in the interval [0, 1]. and ωs(x) is a real number standing for the phase part of the grade of membership. ramot et al. propose the following operations on complex fuzzy sets: 1) a directional fuzzy complement which induces the following membership function c(µs(x)) = c(rs(x))e j(ωs(x)+θ) (2) where c(f(x))=1-f(x). 2) the union function, ∪, of two complex fuzzy sets a and b induces a membership function µa∪b(x) defined to be: µa∪b(x) = [ra(x) ⊕ rb(x)] · ejωa∪b(x) (3) where ⊕ represents a t-conorm function [24]; and ωa∪b(x) is a real function. 3) the intersection function ∩, of two complex fuzzy sets a and b induces a membership function µa∩b(x) defined to be: µa∩b(x) = [ra(x) ⊕ rb(x)] · ejωa∩b(x) (4) axiomatic theory of complex fuzzy logic and complex fuzzy classes 565 where ⊕ represents a t-conorm function [24]; and ωa∩b(x) is a real function. dick expands the research on complex fuzzy sets [17], [25][30]. in addition, he observes the relations between complex fuzzy sets and complex neural networks where the excitation, the outputs, and weights can obtain complex values [31][35]. buckley introduces the definition of complex fuzzy numbers [36, 37]. complex fuzzy numbers have been utilized in several numerical applications [38, 39]. nevertheless, the concept of complex fuzzy numbers is different from the concept of complex fuzzy sets, classes, and logic [15][19], it is also different from the concepts presented in the current paper. buckley is concerned with generalizing number theory, while references [15][19] as well as the current paper are concerned with the generalization of fuzzy set theory and fuzzy logic. 2.2 membership-grade based definition of complex fuzzy classes in the following sections, a class is denoted by an upper case greek letter, a crisp set is denoted by an upper case latin letter, and a member of a set is denoted by a lower case latin letter. the cartesian and polar representation of a complex grade of membership, respectively are given by: µ(v,z) = µr(v ) + jµi(z),µ(v,x) = r(v )e jσϕ(z) (5) where µr(v ),µi(z),r(v ) and ϕ(z), the real and imaginary components as well as the amplitude and phase components of the complex fuzzy grade of membership, are real value functions over the interval [0,1]. the scaling factor σ is in the interval (0,2π]. it is used to control the behavior of the phase within the unit circle according to the specific application. we adopt the definition of fuzzy classes given by bĕhounek et al., and define a complex fuzzy class γ to be a fuzzy class of order 2 [44,45]. formally, let u be a universe of discourse and let 2u be the power set of u. let f1 be a function from 2u to [0, 1] and let f2 be a function that maps elements of u to the interval [0, 1]. for v ∈ 2u and z ∈ u define µγ(v,z) to be: µγ(v,z) = µr(v ) + jµi(z) = f1(v ) + jf2(z) (6) then, µγ(v,z) defines a fuzzy class of order two, where for every v ∈ 2u, and for every z ∈ u, µγ(v,z); is the degree of membership of z in v and the degree of membership of v in γ. hence, a complex fuzzy class γ can be represented as the set of ordered triples: γ = {v,z,µγ(v,z)|v ∈ 2u,z ∈ u} (7) depending on the form of µγ() (cartesian or polar), µr(), µr(), r(), and ϕ() denote the degree of membership of z in v and / or the degree of membership of v in γ. the role of the real and imaginary parts of the membership function can be interchanged, the same applies to the components of the polar representation. furthermore, the usual definition of coordinate transformation: µγ(v,z) ⇔ r(v )ejϕ(x);µγ(v,z) ⇔ r(v ) + jϕ(x) (8) maintains the semantics of complex fuzzy classes. thus, without loss of generality we assume that: µr(v ) and r(v ) denote the degree of membership of v in γ and that µi(z) and ϕ(z) denote the degree of membership of membership of z in v for the cartesian and the polar representations respectively. one can consider the grade of membership based definition of a fuzzy class of order 1 as a mapping into a one dimensional space and the grade based definition of fuzzy class of order 2 (see [40]) as a mapping into a two dimensional space. hence, it is 566 d.e. tamir, a. kandel possible to consider a degree of membership of order n as a mapping into an n-dimensional space. operations on complex fuzzy classes. consider the two complex fuzzy classes: 1. γ = {v,z,µγ(v,z)|v ∈ 2u,z ∈ u}. 2. ψ = {t,z,µψ(t,z)|t ∈ 2u,z ∈ u}. where v and t are fuzzy classes. assume that γ and ψ are defined over a universe of discourse u and let 2u denote the power set of u. further assume that the degree of membership of an object z ∈ v, and an object y ∈ t is given by: µγ(v,z) = µγr(v ) + jµγi(z);µψ(t,y) = µψr(t) + jµψi(y) respectively, where µγr(α),µψr(α),µγi(α) and µψi(α) stand for the real and imaginary parts of µγ(v,x) and µψ(t,y). finally, let w ∈ 2u, let ⊕ denote a t-conorm operation, and let ⊕ denote a t-norm operation. 1. the complex fuzzy class complement is defined to be: c(µγ(v,z)) = c(µr(v )) + jc(µi(z)) (9) 2. the complex fuzzy class union is defined to be: µγ∪ψ(w,z) = (µγr(v ) ⊕ µψr(t) + j(µγi(z) ⊕ µψi(z))) (10) 3. the complex fuzzy class intersection operation is given by: µγ∩ψ(w,z) = (µγr(v ) ⊕ µψr(t) + j(µγi(z) ⊕ µψi(z))) (11) reference [18] includes several examples of complex fuzzy classes and their applications. 3 axiomatic complex fuzzy logic two main considerations are motivating this section. the first relates to the observations stated in section 2. namely, there are multidimensional fuzzy processes which cannot be decomposed into simple single dimensional logical operations. second, many researchers has observed that the grade based membership approach is limited in its capability to deliver a concise and precise formalism for fuzzy logic [20][24], [40]. instead, current research in fuzzy logic, fuzzy class theory, fuzzy mathematics, and its applications is based on axiomatic theory. in ref. [19] we have presented an axiomatic framework for propositional complex fuzzy logic (cfl). in this section we extend the definition to include first order predicates cfl. the presented framework is based on the basic fuzzy propositional and predicate logic (bl) along with the fuzzy łukasiewicz (ł) and fuzzy product (π) logical systems [21, 22, 40]. we refer to the propositional logic system as łπ and to the first order predicate fuzzy logic system as łπ∀. this section introduces the łπ and łπ∀ fuzzy logic system as well as the łπ and łπ∀ cfl. we use the following notation: a variable is denoted by a lower case latin letter and a predicate/proposition is denoted by an upper case latin letter. 3.1 propositional fuzzy logic several axiom based logical systems have been investigated [20][24], [40]. bĕhounek, et al., ( [40]) use the łπ/ łπ∀ as the basis for the definition of fuzzy class theory (fct). in this section we closely follow łπ the system used by bĕhounek, et al. for clarity, we reintroduce some of the important notions, notations, and concepts from that paper. a fuzzy proposition p can get any truth-value in the real interval [0,1], where ’0’ denotes “false,” and ’1’ denotes “true.” furthermore, the relation ≤, over the interval [0,1] implies a monotonically increasing ordering on the truth-values associated with the proposition. a fuzzy interpretation of a proposition p is an assignment of a fuzzy truth-value to p. let p and q axiomatic theory of complex fuzzy logic and complex fuzzy classes 567 denote fuzzy propositions and let i(p) denotes the fuzzy interpretation of p. table 1, includes the basic connectives of łπ. table 2 includes connectives that can be derived from the basic connectives. the constant 0 is assumed; and the constant 1 can be derived from 0 and the basic connectives. operation interpretation ł-implication i(p→l q) = min(1,1-i(p)+i(q)) π-implication i(p→π q) = min(1,i(p)/i(q)) π-conjunction i(p ⊗ q) = i(p) · i(q) table 1: basic łπ connectives operation interpretation ł-negation i(¬p) = 1 i(p) π-delta ∆(i(p)) = 1 if i(p)=1; else ∆(i(p))=0 equivalence i(p ↔ q) = i(p →l q) ⊗ i(q →l p) p⊖ q i(p ⊖ q) = max(0,i(p) i(q)) table 2: derived łπ connectives bĕhounek, et al., use the basic and derived connectives along the truth-constants and the following set of axioms [40]: 1) the łukasiewicz set of axioms [21]. 2) the product set of axioms [21]. 3) the łukasiewicz delta axiom: ∆(p →l q) →l (p →π q) (12) 4) the product delta axiom: ∆(p →π q) →l (p →l q) (13) 5) the axiom: r ⊗ (p ⊖ q) ↔l (r ⊗ p) ⊖ (r ⊗ q) (14) the rules of inference are: 1) modus ponens. 2) product necessitation (infer ∆q from p). reference [40] includes several theorems that follow from the definition of łπ propositional fuzzy logic. in the next section, we define łπ first order predicate fuzzy logic (łπ∀). 3.2 first order predicate fuzzy logic following classical logic, łπ first order predicate fuzzy logic, referred to as łπ∀, extends łπ propositional fuzzy logic. the primitives include constants, variables, arbitrary arity functions and arbitrary arity predicates. formulae are constructed using the basic connectives defined in table 1, derived connectives such as the connectives presented in table 2, the truth-constants, the quantifier ∀ and the identity sign =.the quantifier ∃ can be used to abbreviate formulae derived from the basic primitives and connectives. a fuzzy interpretation, of a proposition p(x1, ...,xn) 568 d.e. tamir, a. kandel over a domain m is a mapping that assigns a fuzzy truth-value to each n-tuple of elements of m. as in the case of łπ, we closely follow the system used in ref. [40]. assuming that y can be substituted for x in p and x is not free in q the following axioms are used. 1) instances of the axioms of łπ obtained through substitution. 2) universal axiom i: (∀x)p(x) → p(y) (15) 3) universal axiom ii: (∀x)(p →l q) → (p →l (∀x)q) (16) 4) identity axiom i: x = x (17) 5) identity axiom ii: (x = y) → ∆(p(x) ↔ p(y )) (18) modus ponens, product necessitation, and generalization are used for inference. in the next section, we define propositional and first order predicate cfl. 3.3 propositional and first order predicate complex fuzzy logic as noted, the main motivation for developing cfl is twofold: first, we have observed that many multidimensional processes cannot be adequately represented as a combination of single dimension processes expressed via single dimension fuzzy logic. second, we are interested in an axiomatic based formal system that can be used for a rigorous definition of complex fuzzy classes. a complex fuzzy proposition p is a composition of two propositions each of which can accept a truth-value in the interval [0,1]. in other words; the interpretation of a complex fuzzy proposition is a pair of truth-values from the cartesian interval [0,1] × [0,1]. alternatively, the interpretation can be formulated as a mapping to the unit circle. formally a fuzzy interpretation of a complex fuzzy proposition p is an assignment of fuzzy truth-value of the form i(pr) + j · (pi), or of the form i(r(p))ejσi(θ(p)), where σ is a scaling factor in the interval (0,2π], to p. for example, consider a proposition of the form “x ... a ... b ...," along the definition of linguistic variables and constants. namely, a linguistic variable is a variable whose domain of values is formal or natural language words [1]. generally, a linguistic variable is related to a fuzzy set such as {very young male, young male, old male, very old male} and can get any value from the set. a linguistic constant has a fixed and unmodified linguistic value i.e., a single word or phrase from formal or natural language. thus, in a proposition of the form “x ... a ... b ...," where a and b are linguistic variables, i(pr)(i(r(p))) can be assigned to the term a and i(pi)(i(θ(p))) can be assigned to term b. propositional cfl extends the definition of propositional fuzzy logic and first order predicate cfl extends the notion of first order predicate fuzzy logic. nevertheless, since propositional cfl is a special case of first order predicate cfl we only present the formalism for first order predicates cfl here. reference [19] presents propositional cfl. hence, this section augments ref. [19] with first order predicate cfl. it is the first work that provides this formalism. tables 3 and 4 present the basic and derived connectives of łπ∀ cfl. in essence, the connectives are symmetric with respect to the real and imaginary parts of the predicates. axiomatic theory of complex fuzzy logic and complex fuzzy classes 569 operation interpretation ł-implication i(p→l q) = min(1,1 − i(pr) + i(qr)) + j · min(1,1 − i(pi) + i(qi)) π-implication i(p→π q) = min(0, i(pr)/i(qr)) + j · min(0,1(pi)/i(qi)) π-conjunction i(p ⊗ q) = i(pr) · i(qr) + j · (i(pi) · i(qi)) table 3: basic łπ∀ cfl connectives operation interpretation ł-negation i(¬p) = 1 + j1 i(p) π-delta ∆(i(p)) = 1 if i(p)=1 + j1; else ∆(i(p))=0 + j0 equivalence i(p ↔ q) = i(pr →l qr) ⊗ i(qr →l pr) + j · (i(pi →l qi) ⊗ i(qi →l pi)) p⊖ q i(p ⊖ q) = max(0,1(pr) − i(qr)) + j · max(0, i(pi) − i(qi)) table 4: derived łπ∀ cfl connectives following classical logic, łπ∀ cfl extends, łπ cfl. the primitives include constants, variables, arbitrary arity functions and arbitrary arity predicates. formulae are constructed using the basic connectives defined in table 3, derived connectives such as the connectives presented in table 4, the truth-constants, the quantifier ∀ and the identity sign =. the quantifier ∃ can be used to abbreviate formulae derived from the basic primitives and connectives. a fuzzy interpretation of a proposition p(x1, ...xn) = pr(x1, ...xn)+j ·pi(x1, ...xm) over a domain m is a mapping that assigns a fuzzy truth-value to each (n-tuple) × (m-tuple) of elements of m. as in the case of łπ fuzzy logic, we closely follow the system used in ref. [40]. the same axioms used for first order predicate fuzzy logic are used for first order predicate complex fuzzy logic, and modus ponens as well as product necessitation, and generalization are the rules of inference. complex fuzzy propositions and connectives examples. one form of a fuzzy proposition is: “x ... a ...;" where a is a linguistic variable such as “young male" or “tall person" and ‘...’ denote natural language constants such as “moses." “is," “portfolio," “mutual fund," etc. for example, under one interpretation, the fuzzy truth-value associated with the fuzzy proposition: p = “moses is young male" can be 0.3, and under another interpretation, the fuzzy truth-value associated with the proposition p can be 0.9. in this case, the linguistic variable is “young male," and it is distinguished from the fuzzy constants “moses" and “is" by its italics font. one form of a complex fuzzy is: “x ... a ... b ..." where a and b are values assigned to linguistic variables and ‘...’ denotes natural language constants. for example, under one interpretation, the complex fuzzy truth-value associated with the complex fuzzy proposition: “x is a volatile stock in a strong-portfolio," can be 0.1 + j0.5. alternatively, in another context, the same proposition can be interpreted as having the complex truth-value 0.3ej0.2. consider the following propositions(p and q respectively): 1) “x is a volatile stock in a strong-portfolio." 2) “x is a stock in a decline-trend in a strong-portfolio" hence, p is of the form: “x is a a in a b " and q is of the form “x is a stock c in b " in this case, “volatile stock," “a strong-portfolio," and “a decline-trend " are values assigned to the linguistic variables {a,b,c}. assume that the complex fuzzy interpretation (i.e., degree of confidence or complex fuzzy truth-value) of p is i(pr) + j · (pi), while the complex fuzzy interpretation of q is i(qr) + j · (pi). thus, the truth-value of “x is a volatile stock " is i(pr), the truth-value of “x is in a strong-portfolio" is i(pi), and the truth-value of “x is in a decline-trend " is i(qr) suppose 570 d.e. tamir, a. kandel that the term “non-volatile" stands for “not volatile” the term “weak ” stands for “not strong,” and the term “rising” stands for “not declining,” then, the complex fuzzy interpretation of the following composite propositions is: 1) i(¬p)= (1 − i(pr)) + j · (1 − i(pi)). that is, ¬p denotes the proposition “x is a non-volatile stock in a weak portfolio.” the confidence level in ¬p is (1 − i(pr)) + j(1 − i(pi)); where the fuzzy truth-value of the term “x is a non-volatile stock” is (1 − i(pr)) and the fuzzy truth-value of the term “weak portfolio" is (1 − i(pi)). 2) i(¬p →l ¬q) = min(1,1 + i(pr) − i(qr)) + j · min(1,1 + i(pi) − i(qi)). thus, (¬p→l ¬q) denotes the proposition “if x is a non-volatile stock in a weak portfolio, then x is a stock in a rising-trend in a strong-portfolio." the truth-values of individual terms, as well as the truth-value of ¬p→l ¬q are calculated according to table 3. 3) i(p ∨ q) = max(i(pr) · i(qr)) + j · max(i(pi) · i(qi)). that is, (p∨¬q) denotes the proposition “x is a volatile stock in a strong-portfolio" or “x is a stock in a rising-trend in a week portfolio." the truth-values of individual terms, as well as the truth-value of p∨¬q are calculated according to tables 3 and 4. 4) i(¬p ⊗ q) = min(1 − i(pr), i(qr)) + j · min(1 − i(pi), i(qi)). that is, (¬p ⊗ q) denotes the proposition “x is a volatile stock in a strong-portfolio" and “x is a stock in a rising-trend in a strong-portfolio." the truth-values of individual terms, as well as the truth-value of ¬p ⊗ q are calculated according to table 3. complex fuzzy inference example. assume that the degree of confidence in the proposition r = ¬p (p is defined above) is i(rr) + j · (ri) let s = ¬q and assume that the degree of confidence in the fuzzy implication t = r →ls is i(tr) + j · (ti). then, using modus ponens: r r → s s one can infer s with a degree of confidence min(1,i(rr) · i(tr)) + j · min(1,i(ri) · i(ti)). in other words one using: “x is a non-volatile stock in a weak portfolio." if “x is a non-volatile stock in a weak portfolio," then “x is a stock in a rising-trend in a weak-portfolio." “x is a stock in a rising-trend in a weak-portfolio" then, one can infer “x is a stock in a rising-trend in a weak-portfolio" with a degree of confidence min(1,i(rr) · i(tr)) + j · min(1,i(ri) · i(ti)). 4 complex fuzzy class theory many natural phenomena are complex and cannot be modeled using one dimensional classes and / or one dimensional variables. for example, in pattern recognition, objects can be represented by a set of measurements and are considered as vectors in a multidimensional space. these patterns might be clustered into several clusters where different clusters relate to different subsets of features. often, it is not practical to assume that this multidimensional information can be represented via a one-dimensional combination of variables and operators. as a relatively simple example consider the stock market; at any given time each stock can be evaluated by its current value and trend. each of these parameters can be represented by a fuzzy set and inference can be carried by fuzzy logic. nevertheless, ignoring the current state of the entire market and axiomatic theory of complex fuzzy logic and complex fuzzy classes 571 many other parameters provides a very limited capability to make sound decisions. moreover, often the relations between the current value of the stock and the current market trends cannot be adequately represented via simple fuzzy logic operations over individual parameters. other examples for the need in a mechanism that can represent complex objects and complex sets occur in signal processing where many signals are represented using complex variables. in a noisy environment these signals might form a “complex fuzzy class." finally, consider a set of values where each value is a member of a fuzzy set. this set referred to as fuzzy set of type-2 cannot be represented by basic operations on fuzzy sets of type-1 [11, 12]. this type of sets however, can be represented via complex classes. motivated by these considerations ramon et al., and dick propose complex fuzzy sets. the limitation of their approach is described in section 2. tamir et al. introduce a new interpretation of complex fuzzy grade of membership and derive the concept of complex fuzzy classes using the framework of a complex fuzzy grade of membership [18]. this form is limited since it cannot be easily axiomatized and rigorously formalized. in this paper we provide an alternative formulation for complex fuzzy classes which is based on first order predicate complex fuzzy logic (łπ∀ cfl). based on the logic system presented in section 3.1, the next section introduces the axiomatic based formulation of fuzzy class theory (fct) as developed in ref. [40]; section 4.2 provides the extension of fct to complex fuzzy class theory (cfct). 4.1 axiomatic based fuzzy class theory the axiomatic fuzzy logic can serve as a basis for establishing an axiomatic fct. similarly, axiomatic based complex fuzzy logic can serve as the basis for formal definition of complex fuzzy classes. several variants of fct exists, most of them use a similar approach and mainly differ in the selection of the logic base. another difference between various approaches is the selection of class theory axioms. bĕhounek, et al., present and analyze a few variants of fct. in ref. [40] they present a łπ∀ based fct. their formalism serves as the starting point for the complex fct presented in this paper. next, we provide an overview of the łπ∀ based fct. the main components of fct are: 1) variables. a. variables denoting objects; b. variables denoting crisp sets, i.e., a universe of discourse and its subsets; c. variables denoting fuzzy classes of order 1; d. variables denoting fuzzy classes of order n, that is, fuzzy classes of fuzzy classes of order n-1. we use the following notation for variables: objects are denoted by lower case latin letters, crisp sets are denoted by upper case latin letters, fuzzy classes of order n are denoted by upper case greek letter of the form γn (n is omitted when n=1). 2) the łπ∀ fuzzy logic system along with its variables, connectives, predicates, and axioms defined in section 3.1 and ref. [40]. 3) additional predicates. a. a binary predicate ‘x ∈ γ’ denoting membership of objects in fuzzy classes. 4) additional axioms. a. instances of the comprehension schema (further explained below) (∃γ)∆(∀x)(x ∈ γ ↔ p(x)) (19) b. the axiom of extensionality (∀x)∆(x ∈ γ ↔ x ∈ ψ) → γ = ψ (20) 572 d.e. tamir, a. kandel note that a grade of membership is not a part of the above specified terms; yet it can be derived or defined using these terms. the comprehension schema is used to “construct" classes. it has the basic form of: (∀x)(x ∈ γ ↔ p(x)). intuitively, this schema refers to the class γ of all the objects x that satisfy the predicate p(). instances of this scheme have the generic form: (∃γ)(∀ x)(x ∈ γ ↔ p(x)). associated with this schema are comprehension terms of the form: y ∈ {x|p(x) ↔ p(y)}. the ∆ operation introduced in equation 20, is used to produce precise instances of the extensionality schema and ensure the conservatism of comprehension terms. fixing a standard model over the fct, enables the definition of commonly used terms, set operations, and definitions, as well as proving fct theorems [45]. some of these elements are listed here: 1) the characteristic function χx∈γ ≡ χγ and the grade of membership function µx∈γ ≡ µγ. 2) class constants, α-cuts, iterated complements, and primitive binary operations such as union and intersection etc. these operations are constructed using the schema op(γ) ≡ {x|p(x ∈ γ)}. table 5 lists some of these elements. 3) uniform and supreme relations defined in ref. [40] enable the definition of fuzzy class relations such as inclusion. 4) theorems on axioms, primitive fuzzy class operations, and fuzzy class relations [40]. term symbol p comments empty class θ 0 universal class φ 1 strict complement \γ ∼ ∼ stands for gödel(g) negation class intersection ∩ ⊕ ⊕ stands for a g, ł, or π conjunction t-norm class union ∪ ∨ ∨ stands for a g, ł, or π disjunction table 5: primitive fuzzy class operations 4.2 axiomatic based complex fuzzy class theory the axiomatic fuzzy logic can serve as a basis for establishing an axiomatic fct. similarly, axiomatic based complex fuzzy logic can serve as the basis for formal definition of complex fuzzy classes. in this section we provide a formulation of complex fuzzy class theory (cfct) that is based on the logic theory presented in section 3.2. this is the first work that provides this formalism. a new concept that is introduced in this section is the concept of complex objects. intuitively this may sound odd since we are used to think about set objects as one-dimensional or dimensionless. nevertheless, in reality, most of the objects of interest in many applications and research projects are represented by a set of measurements and are considered as multidimensional. in the context of this paper, a multi-dimensional object is an object that is associated with (or perceived by) more than one measurement or stimuli. a simple example is human color perception which associates the “qualities" of hue and saturation with many colors. a complex object is represented by two measurements or stimuli also referred to as features. the main components of fct are: 1) variables. a. variables denoting objects (potentially complex objects); b. variables denoting crisp sets, i.e., a universe of discourse and its subsets; c. variables denoting complex fuzzy classes of order 1; axiomatic theory of complex fuzzy logic and complex fuzzy classes 573 d. variables denoting complex fuzzy classes of order n, that is, complex fuzzy classes of complex fuzzy classes of order n-1. 3) additional predicates. a. a binary predicate ‘x ∈ γ’ denoting membership of objects in complex fuzzy classes. to elaborate, assume that γ is a complex class, and let x be a complex object of the form x = xr + j · xi, then the predicate x ∈ γ yields a complex truth-level of the form x ∈ γ = [xr ∈ γ + j · xi ∈ γ]. intuitively this truth-value represents fuzzy validity of the compounded assertion “[xr,xi] belongs to γ," or “[xr,xi] is a member of γ”. 4) additional axioms. a. instances of the comprehension schema (further explained below) (∃γ)∆(∀x)(x ∈ γ ↔ p(x)) (21) where x is a complex fuzzy object, γ is a complex fuzzy class, and p() is a complex fuzzy predicate. b. the axiom of extensionality (∀x)∆(x ∈ γ ↔ x ∈ ψ) → γ = ψ (22) again, x is a complex fuzzy object, γ is a complex fuzzy class, and p() is a complex fuzzy predicate. note that a grade of membership is not a part of the above specified terms; yet it can be derived or defined using these terms. the comprehension schema is used to ”construct" classes. it has the basic form of: (∀x)(x ∈ γ ↔ p(x)). intuitively, this schema refers to the class γ of all the objects x that satisfy the predicate p(). instances of this scheme have the generic form: (∃γ)(∀x)(x ∈ γ ↔ p(x)). associated with this schema are comprehension terms of the form: y ∈ {x|p(x) ↔ p(y)}. the ∆ operation introduced in equation 22, is used to produce precise instances of the extensionality schema and ensure the conservatism of comprehension terms. fixing a standard model over the cfct enables the definition of commonly used terms, set operations, and definitions, as well as proving cfct theorems. some of these elements are listed here: 1) the complex characteristic function χx∈γ ≡ χγ and the complex grade of membership function µx∈γ ≡ µγ. 2) complex class constants, α-cuts, iterated complements, and primitive binary operations such as union and intersection etc. these operations are constructed using the schema op(γ) ≡ {x|p(x ∈ γ)}. table 6 lists some of these elements. 3) uniform and supreme relations defined in ref. [40] enable the definition of fuzzy class relations such as inclusion. 4) theorems on axioms, primitive fuzzy class operations, and fuzzy class relations [40]. following the axiomatic based definition of grade of membership equations (9-11) can be used as a basis for the definition of “membership-grade based” complement, union, and intersection. we are currently performing research aimed to provide a comprehensive list of theorems, complex class operations, and relations. complex fuzzy propositions and connectives examples. in order to provide a concrete example we define the following complex fuzzy classes using the comprehension schema. let the universe of discourse be the set of all the stocks that were 574 d.e. tamir, a. kandel term symbol p comments empty complex class θ 0 universal complex class φ 1 strict complement \γ ∼ ∼ stands for gödel(g) negation complex class intersection ∩ ⊕ ⊕ stands for a g, ł, or π conjunction t-norm complex class union ∪ ∨ ∨ stands for a g, ł, or π disjunction table 6: derived primitive class operations available for trade on the opening of the new york stock exchange (nyse) market on january 3, 2011 along with a set of attributes related to historical price performance of each of these stocks. consider the following complex propositions: 1) p(x)≡ “x is a volatile stock in a strong-portfolio," 2) q(x)≡ “x is a stock in a decline-trend in a strong-portfolio," then, the proposition: (∃γ)∆(∀x)(x ∈ γ ↔ (p(x) ⊗ q(x)) where x is any member of the universe of discourse defines a complex fuzzy class γ that can be “described" as the class of “volatile stocks in a decline-trend in strong-portfolios,” on the other hand, the proposition: (∃γ)∆(∀x)(x ∈ γ ↔ (¬p(x)∨q(x)) where x is any member of the universe of discourse, defines a complex fuzzy class γ that can be “described" as the class of “non-volatile stocks in a decline-trend in strong-portfolios." 5 conclusions and directions for further research a new and innovative formal definition of complex fuzzy logic (cfl), referred to as łπ∀ cfl as well as a formalism of complex fuzzy class theory (cfct) that is based on cfl, is presented in this paper. the new form significantly improves the expressive power and inference capability of cfl and cfct based systems. in addition, it enables axiomatic and rigorous development of the relevant theory. in the future, we plan to extend the theory. in addition, we plan to apply the new formalism to multidimensional fuzzy propositional and predicate logic; and further explore its potential for usage in advanced complex fuzzy logic systems as well as inference with type-2 (or higher) fuzzy sets. bibliography [1] zadeh, l.a., the concept of a linguistic variable and its application to approximate reasoning part i. information sciences, 1975, 7: p. 199-249. [2] klir, g.j., tina, a., fuzzy sets, uncertainty, and information. 1988, upper saddle river, nj: prentice hall. [3] kandel, a., fuzzy mathematical techniques with applications. 1987, new york: addison wesley. [4] hossu d., fagarasan i., hossu a., iliescu s.s., evolved fuzzy control system for a steam generator, international journal of computers communications & control, 2010, 5(2):p. 179-192. [5] mhalla a., jerbi n., dutilleul s.c., craye e., benrejeb m., fuzzy filtering of sensors signals in manufacturing systems with time constraints , international journal of computers communications & control, 2010, 5(3):p. 362-374. axiomatic theory of complex fuzzy logic and complex fuzzy classes 575 [6] cordova f.m., leyton g., a fuzzy control heuristic applied to non-linear dynamic system using a fuzzy knowledge representation, international journal of computers communications & control, 2010, 5(5): p. 664-674 [7] tamir, d.e. and a. kandel, an axiomatic approach to fuzzy set theory. information sciences, 1990, 52: p. 75-83. [8] tamir, d.e., kandel, a., fuzzy semantic analysis and formal specification of conceptual knowledge. information sciences, intelligent systems, 1995, 82(3-4): p. 181-196. [9] ronen, m., shabtai, r., guterman, h., hybrid model building methodology using unsupervised fuzzy clustering and supervised neural networks. biotechnology and bioengineering, 2002, 77(4): p. 420-429. [10] bhattacharya m., das a., genetic algorithm based feature selection in a recognition scheme using adaptive neuro fuzzy techniques, international journal of computers communications & control, 2010, 5(4): p. 458-468. [11] zadeh, l.a., the concept of a linguistic variable and its application to approximate reasoning part i. information sciences, 1975, 7: p. 199-249. [12] mendel, j.m., uncertainty, fuzzy logic, and signal processing. signal processing, 2000, 80: 913-933 [13] tamir, d.e., kandel, a., “the pyramid fuzzy c-means algorithm,” the international journal of computational intelligence in control, december 2010. [14] agarwal, d., tamir, d. e., last, m., kandel, a., a comparative study of software testing using artificial neural networks and info-fuzzy networks. 2011, (submitted). [15] ramot, d., milo, r., friedman, m., kandel, a., complex fuzzy sets. ieee transactions on fuzzy systems 2002, 10(2): p. 171-186. [16] ramot, d., friedman, m., langholz, g., kandel, a., complex fuzzy logic. ieee transactions on fuzzy systems, 2003, 11(4): p. 450-461. [17] dick, s., towards complex fuzzy logic. ieee transaction on fuzzy systems, 2005, 13: p. 405-414. [18] tamir, d.e., lin, j., kandel, a., a new interpretation of complex membership grade, accepted for publication in the international journal of intelligent systems, 2011. [19] tamir, d.e., last, m., kandel, a., generalized complex fuzzy propositional logic. accepted for publication in the world conference on soft computing, san francisco, 2011. [20] baaz, m., hajek, p., montagna, f., veith, h., complexity of t-tautologies. annals of pure and applied logic, 2002, 113(1-3): p. 3-11. [21] cintula, p., weakly implicative fuzzy logics. archive for mathematical logic, 2006, 45(6): p. 673-704. [22] cintula, p., advances in ł and ł1/2 logics. archives of mathematical logic, 2003, 42: p. 449-468. [23] montagna, f., on the predicate logics of continuous t-norm bl-algebras. archives of mathematical logic, 2005, 44: p. 97-114. 576 d.e. tamir, a. kandel [24] hájek, p., fuzzy logic and arithmetical hierarchy. fuzzy sets and systems, 1995, 3(8): p. 359–363. [25] moses, d., degani, o., teodorescu, h., friedman, m., kandel, a. linguistic coordinate transformations for complex fuzzy sets. in 1999 ieee international conference on fuzzy systems. 1999, seoul, korea. [26] zhang, g., dillon, t. s., cai, k., ma, j., lu, j., operation properties and delta equalities of complex fuzzy sets. international journal on approximate reasoning, 2009, 50(8): p. 12271249. [27] nguyen, h.t., kandel, a., kreinovich, v. complex fuzzy sets: towards new foundations. in proceedings, 2000 ieee international conference on fuzzy systems. 2000, san antonio, texas. [28] deshmukh, a.y., bavaskar, a. b., bajaj, p. r., keskar a. g., implementation of complex fuzzy logic modules with vlsi approach. international journal on computer science and network security, 2008, 8: p. 172-178. [29] chen, z., aghakhani, s., man, j., dick, s., ancfis: a neuro-fuzzy architecture employing complex fuzzy sets. 2009, (submitted). [30] man, j., chen, z., dick, s. towards inductive learning of complex fuzzy inference systems. in proceedings of the international conference of the north american fuzzy information processing. 2007, san diego, ca. [31] hirose, a., complex-valued neural networks. 2006: springer-verlag. [32] michel, h.e., awwal, a. a. s., rancour, d. artificial neural networks using complex numbers and phase encoded weights-electronic and optical implementations. in proceedings of the international joint conference on neural networks. 2006, vancouver, bc. [33] noest, a.j., discrete-state phasor neural nets. physics review a, 1988, 38: p. 2196-2199. [34] leung, s.h., hyakin, s., the complex back propagation algorithm. ieee transactions on signal processing, 1991, 39(9): p. 2101-2104. [35] iritani, n.t., sakakibara, k., improvements of the traffic signal control by complex-valued hopfield networks. in proceedings of the international joint conference on neural networks. 2006, vancouver, bc. [36] buckley, j.j., fuzzy complex numbers. fuzzy sets and systems, 1989, 33: p. 333-345. [37] buckley, j.j., qu, y., fuzzy complex analysis i: differentiation. fuzzy sets and systems, 1991, 41: p. 269-284. [38] zhang, g., fuzzy limit theory of fuzzy complex numbers. fuzzy sets and systems, 1992, 46(2): p. 227-2352. [39] wu, c., qiu, j., some remarks for fuzzy complex analysis. fuzzy sets and systems, 1999, 106: p. 231-238. [40] bĕhounek, l., cintula, p., fuzzy class theory. fuzzy sets and systems, 2005, 154(1): p. 34-55. int j comput commun, issn 1841-9836 8(1):105-110, february, 2013. flexible gps/gprs based system for parameters monitoring in the district heating system a. peulic, s. dragicevic, z. jovanovic, r. krneta aleksandar peulic, snezana dragicevic, zeljko jovanovic, radojka krneta university of kragujevac, technical faculty serbia, 32000 cacak, e-mail: apeulic, snezad, zeljko.jovanovic, radojka@tfc.kg.ac.rs abstract: energy consumption for heating purposes accounts for a significant part of the budgets of individual and collective users. this increases the importance of issues related to the monitoring of heating energy flows, analysis of flow parameters, verification of fees and, in the first place, minimization of energy consumption. the goal of this paper is to develop, by employing global positioning system receivers, measurement techniques that are suited to the continuous monitoring of the heating substation parameters. this paper presents the design and implementation of gps/gprs (global positioning system/general packet radio service) system for low power data acquisition using msp430 texas instruments microcontroller for monitoring of the heating substation parameters. the system is implemented in heating stations for a temperature and pressure monitoring. it contains gps/gprs gateway and 8 analog sensor inputs. acquisition module and the server base station are suitable for industrial applications, home applications and for other appliances. the proposed measurement procedures, which are different from commercially available measurement units, are based on general-purpose acquisition hardware and processing software, thus guaranteeing the possibility of being easily reconfigured and reprogrammed according to the specific requirements of different possible fields of application and to their future developments. keywords: distributed measurement systems, gps/gprs, computer data acquisition, low power microcontroller. 1 introduction an adequate control of all relevant parameters of the heating process is one of the most significant means of power consumption optimization. district heating substations, as a link between hot-water network and internal heating installations in buildings, are used to adapt highpressure hot water to temperature and pressure conditions required by space heating systems of buildings as well as by the systems for the preparation of hot service water in buildings. to control the energy transfer in the district heating substation, some kind of control system is needed. the overall efficiency of district heating could clearly be improved by using new strategies for measurement and control. to maximize energy efficiency in the district heating network it is essential to have a large temperature drop across the substation between supplies and return pipes in the distribution network. a larger temperature drop will contribute to more possible customers in available district heating networks without increasing the production power. an efficient system will reduce the amount of wasted energy while maintaining comfort, and indirectly reduce co2 emissions for heating purposes, which accounts for 30% of the worlds current co2. very rough estimates shows savings of more than 1 million year when increasing the temperature drop across the substation between by 5 ◦c in a 760 gwh district heating system. today, in most substation control a system focuses on indoor comfort and do not generally consider temperature copyright c⃝ 2006-2013 by ccc publications 106 a. peulic, s. dragicevic, z. jovanovic, r. krneta drop across the substation, since it is not measured by the control system. the rapid progress in microprocessor and communication technologies over the last ten years or so has provided great potential for innovative applications in the field of protection and substation control [1]. there are a number of applications of different strategies of monitoring and control of district heating system components: a new control and communication architecture based on wsn and soa for district heating substations are developed in [2], [3]. reference [4] shows the issue of integrating of intelligent electronic devices (ied) data recording by different ied types and focuses on how to facilitate the use of the integrated data; the water temperature control of a district heating substation using soft computing methods, based on fuzzy logic, is presented in [5]. fuzzy logic control is implemented and the good performance of the fuzzy control proves that this can be an alternative to the classic control. the control and monitoring system for the heat distribution network with a multi-layer structure, which integrates several state-of-the-art technologies and standards applied in modern industrial automatics, are presented in [6]. the applied control system and supervisory control algorithms have result in power savings. reference [7] proposes an alternative approach to the problem of district heating monitoring parameters selection. the wireless technology comparing with non-wireless technology has some important benefits, for example the system cost reduces and easier the installation and maintenance. some of the most popular low power wireless sensors networks are zigbee, bluetooth, distance between sensors and the base station is limited to about 1500 m are presented in [8], [9]. this paper presents gps/gprs based wireless acquisition system. the global positioning system (gps), which is a satellite based system, is the main synchronizing source that is used to provide a time reference on the communication networks, and its widespread availability makes it possible to obtain, at each point of the tested system, a clock signal that is synchronized with the one generated in other remote places. currently, gps is the only satellite system with sufficient availability and accuracy for most distributed monitoring and control applications in distribution systems. alternatives will eventually become available, with galileo being the most promising at this time [10]. 2 the remote acquisition system the basis of hardware part of gprs based system for data acquisition (gprsuc) from remote locations consists of low power msp430f147 microcontroller and telit gm-862 gprs/ gsm/gps module. they communicate with each other in the process of collecting and sending data to remote server. the main objective of the microcontroller is to sample data from eight multiplexed analog inputs, and to form data blocks with time stamp from gps sentences. telit module is used to send those data blocks to the server base over gprs system. besides the standard functions of the devices used in m2m (machine to machine) communication, this module has a gps receiver, as well as the gps dedicated port on which the data obtained from gps are shown in nmea(national marine electronics association) format. system’s operating range is from -10 ◦c to + 55 ◦c, which can be potential problem if gprsuc is used in environments with very low temperatures. the gsm modem is made in a way that the rf transmission is not continuous else, it is packed into bursts at a base frequency about 216 hz. firmware is written in c, and its structure is shown in fig. 1. telit module acts as a slave carrying out the commands that are sent by the microcontroller. since the platform is designed to be suitable for the monitoring of the mobile location, gps accuracy is very important. after the initialization, moving system variable need to be set. complete algorithm depends on the value of this variable. after checking the network status system is trying to catch gps signal (see fig. 1). when there is no gps signal telit module flexible gps/gprs based system for parameters monitoring in the district heating system107 figure 1: flow chart of the firmware returns null in latitude and longitude fields. this routine repeats until it is established that the gps signal is captured. last good gps position is saved and used in case that gps signal is lost due the impact of various barriers, which is especially important when system is moving. if the system is not moving then saved gps position is used. time is extracted from gps sentence in short routine and it is used even though the system is not moving. 3 communication resources used for remote aquisition general packet radio services standard allows data transfer in a completely different way from the circuit switch data (csd) type of transmission. in csd, data is transmitted by establishing connection with other, remote modem, directly, so all devices in between are used to provide simulation of the physical connection between the final points (point-to-point connection). besides the obvious disadvantage in terms of low utilization of network resources, there are also problems of long delays in establishing a connection and high fees for using network resources, based on time period of established link, not amount of data like in gprs [11]. one message from remote acquisition system consists of eight measured parameters, time stamp and identification field, and has near 110 bytes, so it is appropriate to use gprs system. mobile operators provide fixed ip address service, and it is possible to achieve communication in both directions with changes in overall software of the system. in realization of gprsuc prototype this service is not used in order to make remote acquisition platform as cheap as possible. as practical part of this project is regarded, its main goal is to presents the data collected in a way, which was explained in the previous chapter. the entire software solution has been realized using the open source j2ee technology. one of the reasons why the j2ee technology is selected is the possibility of extension to mention the system used on mobile phones in a j2me applications. completed software solution can be roughly divided into two parts, which exchange data with hardware and presents results to the client the part of the software used to exchange data with hardware, is the interface with the hardware support. this part of the software has no visual interpretation and is executed only when the http request is passed to the servlet by a device 108 a. peulic, s. dragicevic, z. jovanovic, r. krneta that forwards data. it has been develop as a java servlet, which call by hardware is using the http get request. to the appropriate data servlet that performs the same processing and saves them in the mysql database. the server confirms a successful reception of data and it is able to send the parameters which correct the way that device work. since the servlet can be accessed with the use of http requests from anywhere in the world, this way of communication gives this project one global level. there is the ability of usage in different cities, countries and even continents. the preference listed in the realization of communication in addition to the great advantages is also a defect that opens a project to the attacks and simulation of some other persons who would like to emulate values of the passed parameters. as the container, apache tomcat version, 6.0.16 is used. the project is implemented as a web application, which is located on the server in laboratory. it provides current monitoring of more parameters from one station and displays their values in real time. since this system has a role in monitoring, the value of parameters for easier viewing, the marker may be green, yellow or red depending on the values of parameters, i.e. whether they allow or do not allow critical range, respectively. in addition to transferring data, hardware has a role that does the processing of data, and as a parameter sends its status. processing is not on the server from the simple reason that for n stations which monitoring m parameters would representing an (m x n) processing every few seconds. it is very important that every measured value is stored in a mysql database and thanks to that it could be seen whenever it is needed. this is important because the further analysis of data is possible in a very easy way. since a system sends gps coordinates, gis support is used for better data presentation. this is especially important with moving systems. it is also interesting using gis presentation when there is more then one system for tracking and their positions are easy to monitor. as a geographical support in the project, the google maps api is used. it provides great opportunities thanks to the entire globe coverage with satellite and aero-photo shots of high resolution. the principle of working with google maps api is that a complete gis system is on google’s server. the user passes the coordinates and parameters for the display, to the corresponding server that replies by sending the required graphic content. google maps usage is free of charge and only google key is needed. the google supplies the key and a gmail account (mail) is required. in order to get the key, it is needed to enter the url of the web server on which google maps will be used. mouse click on the marker surface displays current parameter values for wanted system (see fig. 2). figure 2: multiple systems monitoring with wanted system data presentation if live monitoring for one system is needed, web application offers functionality (see fig. 3) where user activity is not needed since ajax is refreshing a web page after any data value change. alarm values are shown with red background color. there is also a possibility for history observation of measured parameters values with table or chart view. flexible gps/gprs based system for parameters monitoring in the district heating system109 figure 3: live temperature and pressure monitoring in laboratory area 4 conclusions and future works in this paper, a flexible measurement system for parameters monitoring in district heating system is proposed. this system is able to react rapidly to any incidental parameters changes and alarm responsible server in base station. gsm network is widespread, reliable and cheap. gprsuc system describes an attempt of integration low power microcontroller, telit module and server applications into a distributed system for data acquisition and monitoring remote measurement sites. the flexibility of system arises from the use of general-purpose acquisition hardware, which allows the system to be easily upgraded and/or reconfigured according to the specific measurement needs existing and evolving in modern district heating systems. communication of the system is complete wireless, easily operable and low power. to maximize the energy efficiency in the district heating network, it is essential to have a large temperature drop across the substation between supplies and return pipes in the distribution network and the proposed system has and economic reasons for implementation. the proposed measurement systems could be further improved simply by using more sophisticated acquisition hardware. bibliography [1] p. bornard, power system protection and substation control: trends, opportunities and problems,international journal of electrical power and energy systems, 10(2):101-109, 1998. [2] j. gustafsson, distributed wireless control strategies for district heating substations, licentiate thesis, dept. of comp. sci. and elect. eng., lulea university of technology, lulea, sweden, 2009. [3] j. v. deventer, j. gustafsson, j. delsing, j. eliasson, wireless infrastructure in a district heating substation,ieee int. sys. conf., pp. 139-143, vancouver, canada, 2009. [4] m. kezunovic, t. popovic, substation data integration for automated data analysis systems,ieee pes general meeting, pp. 1-6, tampa, florida, 2007. [5] l. mastacan, i. olah, c. c. dosoftei, district heating substations water temperature control based on soft computing technology,6th int. conf. on electromechanical and power systems, pp. 172-175, rep.moldova, 2007. [6] w. grega, k. kolek, monitoring and control of heat distribution,int. carpathian control conf. iccc, pp.439-444, malenovice, czech republic, 2002. 110 a. peulic, s. dragicevic, z. jovanovic, r. krneta [7] p. malinowski, p. ziembicki, analysis of district heating network monitoring by neural networks classification,journal of civil engineering and management, 12(1):21-28, 2006. [8] j. a. gutierrez, m. naeve, e. callaway, m. bourgeois, v. mitter, b.heile, ieee 802.15.4: a developing standard for low-power low-cost wireless personal area networks,ieee network, 15(5):12-19, 2002. [9] a. z. alkar, an internet based wireless home automation system for multifunctional devices, ieee trans. consumer electronics, 51(4):1169-1174, 2005. [10] ieee standard for synchrophasors for power systems, ieee std. c37.118-2005 (revision of ieee std. 1344-1995), 2006. [11] a. alheraish, design and implementation of home automation system, ieee trans. consumer electronics, 50(4):10871092, 2004. ijcccv4n3draft.pdf int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. iv (2009), no. 3, pp. 301-310 introducing a space complexity measure for p systems antonio e. porreca, alberto leporati, giancarlo mauri, claudio zandron università degli studi di milano-bicocca dipartimento di informatica, sistemistica e comunicazione viale sarca 336, 20126 milano, italy e-mail: {porreca, leporati, mauri, zandron}@disco.unimib.it received: april 5, 2009 accepted: may 30, 2009 abstract: we define space complexity classes in the framework of membrane computing, giving some initial results about their mutual relations and their connection with time complexity classes, and identifying some potentially interesting problems which require further research. keywords: membrane computing, complexity theory. 1 introduction until now, research on the complexity theoretic aspects of membrane computing has mainly focused on the time resource. in particular, since the introduction of p systems with active membranes [5], various results concerning time complexity classes defined in terms of p systems with active membranes were given, comparing different classes obtained using various ingredients (such as, e.g., polarizations, dissolution, uniformity, etc.). other works considered the comparisons between them and the usual complexity classes defined in terms of turing machines, either from the point of view of time complexity [8, 3, 11], or space complexity classes [10, 1, 9]. despite the vivid interest on this subject, up to now no investigations concerning space complexity classes defined in terms of p systems have been carried out in formal terms. of course, the evident relation between time and space in p systems with active membranes is informally acknowledged: all results concerning solutions to nnnppp-complete problems are solved using an exponential workspace obtained in polynomial time. nonetheless, there is no formal definition of space complexity classes for p systems and, as a consequence, no formal results concerning the relations between space and time. in this paper, we make the first steps in this direction, first by defining the space requirements for a given p system on a specific computation, and then by formally defining space complexity classes for p systems. we will then give a first set of results concerning relations among complexity classes for p systems, some of them directly following from the definitions, and others which can be derived by considering space requirements of various solutions proposed in the literature which make use of p systems with active membranes. in what follows we assume the reader is already familiar with the basic notions and the terminology underlying p systems. for a systematic introduction, we refer the reader to [6]. a survey and an up-todate bibliography concerning p systems can be found at the web address http://ppage.psystems.eu. the rest of the paper is organized as follows. in section 2 we give basic definitions for membrane systems which will be used throughout the rest of the paper. in section 3 we give formal definitions of space complexity classes in terms of p systems. in section 4 we present some results concerning such complexity classes, which follow immediately from the definitions, while in section 5 we present some results which can be obtained by considering known results for time complexity classes in the framework of p systems with active membranes. section 6 concludes the paper by presenting some conjectures and open problems concerning space complexity. copyright c© 2006-2009 by ccc publications 302 antonio e. porreca, alberto leporati, giancarlo mauri, claudio zandron 2 definitions we begin by recalling the formal definition of p systems with active membranes and the usual process by which they are used to solve decision problems. moreover, we recall the main definitions related to time complexity classes in this framework. definition 1. a p system with active membranes of degree m ≥  is a structure π = (γ ,λ , µ, w, . . . , wm, r) where • γ is a finite alphabet of symbols or objects; • λ is a finite set of labels; • µ is a membrane structure (i.e. a rooted, unordered tree) of m membranes, labeled with elements of λ ; different membranes may be given the same label; • w, . . . , wm are multisets over γ describing the initial contents of the m membranes in µ ; • r is a finite set of developmental rules. the polarization of a membrane can be + (positive), − (negative) or  (neutral); each membrane is assumed to be initially neutral. developmental rules are of the following six kinds: (a) object evolution rule of the form [a → w]αh it can be applied inside a membrane labeled by h, having polarization α and containing an occurrence of the object a; the object a is rewritten into the multiset w (i.e. a is removed from the multiset in h and replaced by the multiset w). (b) communication rule of the form a [ ]αh → [b] β h it can be applied to a membrane labeled by h, having polarization α and such that the surrounding region contains an occurrence of the object a; the object a is sent into h becoming b and, simultaneously, the polarization of h is changed to β . (c) communication rule of the form [a]αh → [ ] β h b it can be applied to a membrane labeled by h, having polarization α and containing an occurrence of the object a; the object a is sent out from h to the surrounding region becoming b and, simultaneously, the polarization of h is changed to β . (d) dissolution rule of the form [a]αh → b it can be applied to a membrane labeled by h, having polarization α and containing an occurrence of the object a; the membrane h is dissolved and its content is left in the surrounding region unaltered, except that an occurrence of a becomes b. (e) elementary division rule of the form [a]αh → [b] β h [c] γ h it can be applied to an elementary membrane labeled by h, having polarization α and containing an occurrence of the object a; the membrane is divided into two membranes having label h and polarizations β and γ ; the object a is replaced, respectively, by b and c while the other objects in the initial multiset are copied to both membranes. introducing a space complexity measure for p systems 303 (f) non-elementary division rule of the form [ [ ]+h ···[ ]+hk [ ] − hk+ ···[ ]−hn ]α h → [ [ ]δh ···[ ] δ hk ]β h [ [ ]εhk+ ···[ ] ε hn ]γ h it can be applied to a non-elementary membrane labeled by h, having polarization α , containing the positively charged membranes h, . . . , hk and the negatively charged membranes hk+, . . . , hn; no other non-neutral membrane may be contained in h. the membrane h is divided into two copies with polarization β and γ ; the positive children are placed inside the former, their polarizations changed to δ , while the negative ones are placed inside the latter, their polarizations changed to ε . any neutral membrane inside h is duplicated and placed inside both copies. note that here we have used the original definition of division rules introduced in [5]. afterwards, other papers have proposed several alternatives for non-elementary division rules; nonetheless, in this paper these variants are not considered. a configuration of a p system with active membranes π is given by a membrane structure and the multisets contained in its regions. in particular, the initial configuration is given by the membrane structure µ and the initial contents of its membranes w, . . . , wm. a computation step leads from a configuration to the next one according to the following principles: • the developmental rules are applied in a maximally parallel way: when one or more rules can be applied to an object and/or membrane, then one of them must be applied. notice that an object or membrane may remain inactive, even if it can trigger a rule, only when its use is inhibited by the application of another rule. • each object can be subject to only one rule during each step. also membranes can be subject to only one rule, except that any number of object evolution rules can be applied inside them. • when more than one rule can be applied to an object or membrane, then the one actually applied is chosen nondeterministically. thus multiple, distinct configurations may be reachable by means of a computation step from a single configuration. • when a dissolution or division rule is applied to a membrane, the multiset of objects to be released outside or copied is the one after any application of object evolution rules inside such membrane. • the skin membrane cannot be divided or dissolved, nor any object can be sent in from the environment surrounding it (i.e. an object which leaves the skin membrane cannot be brought in again). a sequence of configurations, each one reachable from the previous one by means of developmental rules applied according to the above principles, is called a computation. due to nondeterminism, there may be multiple computations starting from the initial configuration, thus producing a computation tree. a computation halts when no further configuration can be reached, i.e. when no rule can be applied in a given configuration. families of recogniser p systems can be used to solve decision problems as follows. definition 2. let π be a p system whose alphabet contains two distinct objects yes and no, such that every computation of π is halting and during each computation exactly one of the objects yes, no is sent out from the skin to signal acceptance or rejection. if all the computations of π agree on the result, then π is said to be confluent; if this is not necessarily the case, then it is said to be non-confluent and the global result is acceptance iff there exists an accepting computation. definition 3. let l ⊆ σ ⋆ be a language, d a class of p systems and let πππ = {πx | x ∈ σ ⋆} ⊆ d be a family of p systems, either confluent or non-confluent. we say that πππ decides l when, for each x ∈ σ ⋆, x ∈ l iff πx accepts. 304 antonio e. porreca, alberto leporati, giancarlo mauri, claudio zandron complexity classes for p systems are defined by imposing a uniformity condition on πππ and restricting the amount of time available for deciding a language. definition 4. consider a language l ⊆ σ ⋆, a class of recogniser p systems d, and let f : n → n be a proper complexity function. we say that l belongs to the complexity class mmmccc⋆ d ( f ) if and only if there exists a family of confluent p systems πππ = {πx | x ∈ σ ⋆} ⊆ d deciding l such that • πππ is semi-uniform, i.e. there exists a deterministic turing machine which, for each input x ∈ σ ⋆, constructs the p system πx in polynomial time; • πππ operates in time f , i.e. for each x ∈ σ ⋆, every computation of πx halts within f (|x|) steps. in particular, a language l ⊆ σ ⋆ belongs to the complexity class pppmmmccc⋆ d iff there exists a semi-uniform family of confluent p systems πππ = {πx | x ∈ σ ⋆} ⊆ d deciding l in polynomial time. the analogous complexity classes for non-confluent p systems are denoted by nnnmmmccc⋆ d ( f ) and nnnpppmmmccc⋆ d . another set of complexity classes is defined in terms of uniform families of recogniser p systems: definition 5. consider a language l ⊆ σ ⋆, a class of recogniser p systems d, and let f : n → n be a proper complexity function. we say that l belongs to the complexity class mmmcccd( f ) if and only if there exists a family of confluent p systems πππ = {πx | x ∈ σ ⋆} ⊆ d deciding l such that • πππ is uniform, i.e. for each x ∈ σ ⋆ deciding whether x ∈ l is performed as follows: first, a polynomial-time deterministic turing machine, given the length n = |x| as a unary integer, constructs a p system πn with a distinguished input membrane; then, another polynomial-time dtm computes a coding of the string x as a multiset wx, which is finally added to the input membrane of πn, thus obtaining a p system πx accepting iff x ∈ l. • πππ operates in time f , i.e. for each x ∈ σ ⋆, every computation of πx halts within f (|x|) steps. in particular, a language l ⊆ σ ⋆ belongs to the complexity class pppmmmcccd iff there exists a uniform family of confluent p systems πππ = {πx | x ∈ σ ⋆} ⊆ d deciding l in polynomial time. the analogous complexity classes for non-confluent p systems are denoted by nnnmmmcccd( f ) and nnnpppmmmcccd. 3 a measure of space complexity for p systems in order to define the space complexity of p systems, we first need to establish a measure of the size of their configurations. the first definition we propose is based on an hypothetical implementation of p systems by means of real biochemical materials (cellular membranes and molecules). under this assumption, every single object takes some constant physical space: this is equivalent to using a unary coding to represent multiplicities. definition 6. let c be a configuration of a p system π , that is, a rooted, unordered tree µ representing the membrane structure of π , whose vertices are labeled with the multisets describing the contents of each region. the size |c| of c is then defined as the sum of the number of membranes in µ and the total number of objects they contain. an alternative definition focuses on the simulative point of view, i.e. on the implementation of p systems in silico, where it is not necessary to actually store every single object (using a unary representation), but we can just store their multiplicity as a binary number, thus requiring exponentially less space for each kind of symbol. introducing a space complexity measure for p systems 305 definition 7 (alternative). let c be a configuration of a p system π , that is, a rooted, unordered tree µ representing the membrane structure of π , whose vertices are labeled with the multisets describing the contents of each region. the size |c| of c is then defined as the sum of the number of membranes in µ and the total number of bits required to store the objects they contain. in the following discussion we will assume the first definition; however notice that the actual results might or might not depend on the precise choice between definitions 6 and 7 (a thorough analysis of the differences involves a clarification of the relative importance of the number of membranes and the number of objects in various classes of p systems, and it is left as an open problem). once a notion of configuration size is established, we need to take account of all possible computation paths which can develop even in confluent recogniser p systems; the following definitions are given in the spirit of those concerning time complexity for p systems [7]. definition 8. let π be a (confluent or non-confluent) recogniser p system, and let ~c = (c, . . . , cm) be a halting computation of π , that is, a sequence of configurations starting from the initial one and such that every subsequent one is reachable in one step by applying the rules in a maximally parallel way. the space required by ~c is defined as |~c| = max{|c|, . . . , |cm|}. the space required by π itself is then |π | = max{|~c| such that ~c is a halting computation of π }. definition 9. let πππ = {πx | x ∈ σ ⋆} be a uniform or semi-uniform family of recogniser p systems, each πx deciding the membership of the string x in a language l ⊆ σ ⋆; also let f : n → n. we say that πππ operates within space bound f iff |πx| ≤ f (|x|) for each x ∈ σ ⋆. we are now ready to define space complexity classes for p systems. definition 10. let d be a class of confluent recogniser p systems; let f : n → n and l ⊆ σ ⋆. then l ∈ mmmcccssspppaaaccceee⋆ d ( f ) iff l is decided by a semi-uniform family πππ ⊆ d of p systems operating within space bound f . the corresponding complexity class for uniform families of confluent p systems is denoted by mmmcccssspppaaaccceeed( f ), while in the non-confluent case we have the classes nnnmmmcccssspppaaaccceee ⋆ d ( f ) and nnnmmmcccssspppaaaccceeed( f ) respectively. as usual, we provide a number of abbreviations for important space classes. definition 11. the classes corresponding to polynomial and exponential space, in the semi-uniform and confluent case, are pppmmmcccssspppaaaccceee⋆ d = ⋃ k∈n mmmcccssspppaaaccceee⋆ d (o(nk)) eeexxx pppmmmcccssspppaaaccceee⋆ d = ⋃ k∈n mmmcccssspppaaaccceee⋆ d ( o(n k) ) . the definitions are analogous in the uniform and non-confluent cases. 4 basic results from the above definitions, some results concerning space complexity classes and their relations with time complexity classes follow immediately. we state them only for semi-uniform families, but they also hold in the uniform case. the first two propositions can be immediately derived from the definitions. 306 antonio e. porreca, alberto leporati, giancarlo mauri, claudio zandron proposition 12. the following inclusions hold: pppmmmcccssspppaaaccceee⋆ d ⊆ eeexxx pppmmmcccssspppaaaccceee⋆ d nnnpppmmmcccssspppaaaccceee⋆ d ⊆ nnneeexxx pppmmmcccssspppaaaccceee⋆ d . proposition 13. mmmcccssspppaaaccceee⋆ d ( f ) ⊆ nnnmmmcccssspppaaaccceee⋆ d ( f ) for each f : n → n, and in particular pppmmmcccssspppaaaccceee⋆ d ⊆ nnnpppmmmcccssspppaaaccceee⋆ d eeexxx pppmmmcccssspppaaaccceee⋆ d ⊆ nnneeexxx pppmmmcccssspppaaaccceee⋆ d . the following results mirror those which hold for turing machines, and they describe closure properties and provide an upper bound for time requirements of p systems operating in bounded space. proposition 14. the classes pppmmmcccssspppaaaccceee⋆ d , nnnpppmmmcccssspppaaaccceee⋆ d , eeexxx pppmmmcccssspppaaaccceee⋆ d , and nnneeexxx pppmmmcccssspppaaaccceee⋆ d are all closed under polynomial-time reductions. proof. let l ∈ pppmmmcccssspppaaaccceee⋆ d and let m be the turing machine constructing the family πππ that decides l. let l′ be reducible to l via the polynomial-time computable function f . now let m ′ be the following turing machine: on input x of length n compute f (x); then behave like m on input f (x), thus constructing π f (x). since | f (x)| is bounded by a polynomial, m ′ operates in polynomial time and π f (x) in polynomial space; but then πππ ′ = {π f (x) | x ∈ σ ⋆} is a polynomially semi-uniform family of p systems deciding l′ in polynomial space. thus l′ ∈ pppmmmcccssspppaaaccceee⋆ d . the proof for the three other classes is analogous. proposition 15. mmmcccssspppaaaccceee⋆ d ( f ) is closed under complement for each function f : n → n. proof. simply reverse the roles of objects yes and no in order to decide the complement of a language. proposition 16. for each function f : n → n mmmcccssspppaaaccceee⋆ d ( f (n)) ⊆ mmmccc⋆ d ( o( f (n) log f (n)) ) nnnmmmcccssspppaaaccceee⋆ d ( f (n)) ⊆ nnnmmmccc⋆ d ( o( f (n) log f (n)) ) . proof. let l ∈ mmmcccssspppaaaccceee⋆ d ( f (n)) be decided by the semi-uniform family πππ of recogniser p systems in space f ; let πx ∈ πππ with |x| = n and let c be a configuration of πx. then c can be described with a string of length at most k f (n) log f (n) over a finite alphabet, say with b ≥  symbols, for some constant k: • we need k f (n) symbols in order to represent the membrane structure and the objects it contains, as well as the polarizations. • we also need to encode the labels of the membranes: even if they do not contribute to the space required by the p system, nonetheless each assignment of labels gives rise to a different configuration. since all labels might be distinct, and there are at most f (n) of them, k f (n) log f (n) symbols are needed. notice that there are less than bk f (n) log f (n)+ such strings. since πx is a recogniser p system, by definition every computation halts: then it must halt within bk f (n) log f (n)+ steps in order to avoid repeating a previous configuration (thus entering an infinite loop). this number of steps is o( f (n) log f (n)). the same proof also works in the non-confluent case (only the acceptance criterion is different). introducing a space complexity measure for p systems 307 5 space complexity of p systems with active membranes in this section we provide a brief review of part of the ample literature on complexity results about p systems with active membranes; our aim is to analyse existing polynomial-time solutions to hard computational problems in order to obtain space complexity results. we first consider the class of p systems with active membranes which do not make use of membrane division rules, usually denoted by nam. it is a well known fact that such p systems are able to solve only problems in ppp (the so-called milano theorem [11]); on the other hand, they can be used to solve all problems in ppp with a minimal amount of space, when a semi-uniform construction is considered: proposition 17. ppp ⊆ mmmcccssspppaaaccceee⋆ nam (o()). proof. let l ∈ ppp. then there exists a deterministic turing machine m deciding l in polynomial time. now consider the family of p systems πππ = {πno, πyes}, where πno (resp. πyes) is the following trivial p system with active membranes: • the membrane structure consists of the skin only, labelled by h; • in the initial configuration, exactly one object a is located inside the skin; • the only rule is [a]h → [ ]  h no (resp. [a]  h → [ ]  h yes). it is clear that such p systems halt in one step and that the space they require is independent of the size of the instance they decide. the family of p systems πππ can be constructed in a semi-uniform way in order to decide l by a deterministic turing machine which first simulates m (it can do so, since m operates in polynomial time), then outputs one of πyes, πno according to the result (acceptance or rejection, respectively). one of the most powerful features of p systems with active membranes is the possibility of creating an exponential workspace in polynomial time by means of elementary membrane division rules; we denote the class of such p systems by eam. this feature was exploited for solving nnnppp-complete problems in polynomial (often even linear) time. in terms of space complexity, this can be stated as follows: proposition 18. nnnppp ∪ cccooonnnppp ⊆ eeexxx pppmmmcccssspppaaaccceee⋆ eam . proof. in [11] a polynomial-time semi-uniform solution to sat is described; the number of membranes and objects required is exponential with respect to the length of the boolean formula. the result then follows from closure under reductions and complement of eeexxx pppmmmcccssspppaaaccceee⋆ eam . this result can be improved when the use of non-elementary membrane division rules is allowed; indeed, all problems in pppssspppaaaccceee can be solved by such class of p systems with active membranes, denoted by am. proposition 19. pppssspppaaaccceee ⊆ eeexxx pppmmmcccssspppaaaccceee⋆ am . proof. in [10] a polynomial-time uniform solution to qbf (also known as qsat), the canonical pppssspppaaaccceeecomplete problem, is described; the space required by each p system is still exponential, and the result follows from the closure properties. in [1] a uniform solution for the same problem was achieved, with the same space requirements; this provides a tighter upper bound to pppssspppaaaccceee: proposition 20. pppssspppaaaccceee ⊆ eeexxx pppmmmcccssspppaaaccceeeam. 308 antonio e. porreca, alberto leporati, giancarlo mauri, claudio zandron since standard p systems with active membranes are very powerful when division rules are allowed, but very weak otherwise, another line of research involves removing some other features, such as polarizations. polarizationless p systems with active membranes have been proved able to solve qsat uniformly in polynomial time by making use of both elementary and non-elementary division rules [2]. since the space requirements are once again exponential, the following result is immediate: proposition 21. pppssspppaaaccceee ⊆ eeexxx pppmmmcccssspppaaaccceeeam , where am  is the class of polarizationless p systems with active membranes and both kinds of division rules. 6 open problems in p systems with active membranes, division rules are usually exploited by producing an exponential number of membranes in linear time, which then evolve in parallel; for instance, several solutions to nnnpppcomplete problems explore the full solution space (e.g. generating every possible truth assignment and then checking whether one of them satisfies a boolean formula). it appears that membrane division may become much less useful when a polynomial upper bound on space is set; or, in other words, conjecture 22. the three complexity classes pppmmmcccssspppaaaccceee⋆ nam , pppmmmcccssspppaaaccceee⋆ eam and pppmmmcccssspppaaaccceee⋆ am coincide. an idea which might be useful in proving this conjecture is precomputing the “final” membrane structure (which is obtained via division rules) during the construction phase. while this is straightforward when considering membrane divisions which always occur, the matter might be much more difficult in the case of “conditional” division (i.e. division rules are applied only when certain conditions are met) or when the p system exhibits a recurring behaviour (e.g. a membrane divides, then one of the two copies is dissolved, and the process is repeated continuously). another interesting problem involves the relations between time and space complexity classes for p systems with active membranes. we know that turing machines, once a polynomial space bound is fixed, are able to solve more problems in exponential time than in polynomial time (at least when ppp 6= pppssspppaaaccceee is assumed). this fact has not been investigated yet in the setting of membrane computing, as all solutions to decision problems presented until now (up to the knowledge of the authors) require only a polynomial amount of time. formally, the question we pose is the following: problem 23. is pppmmmccc⋆ d 6= pppmmmcccssspppaaaccceee⋆ d for any class of p systems d among nam, eam, am? that is, do problems which can be solved in polynomial space but not in polynomial time exist? another important property of traditional computing devices is described by savitch’s theorem: nondeterministic space-bounded turing machines can be simulated deterministically with just a polynomial increase in space requirements, and as a consequence pppssspppaaaccceee = nnnpppssspppaaaccceee holds. the proof does not appear to be transferable to p systems in a straightforward way; nonetheless, an analogous result might hold even in this setting: problem 24. does pppmmmcccssspppaaaccceee⋆ d = nnnpppmmmcccssspppaaaccceee⋆ d hold for any class of p systems d among nam, eam, am? the classes of p systems with active membranes we have considered in all the previous problems are only defined according to which kinds of membrane division rules are available (none, just elementary or both elementary and non-elementary). the same questions may be also worth posing about other restricted classes, such as p systems without object evolution or communication [12, 4], p systems with division but without dissolution, or even purely communicating p systems, with or without polarizations. finally, we feel that the differences between p systems and traditional computing devices deserve to be investigated for their own sake also from the point of view of space-bounded computations. we formulate this as an open-ended question: introducing a space complexity measure for p systems 309 problem 25. what are the relations between space complexity classes for p systems and traditional ones, such as ppp, nnnppp, pppssspppaaaccceee, eeexxx ppp, nnneeexxx ppp, and eeexxx pppssspppaaaccceee? bibliography [1] a. alhazov, c. martín-vide, l. pan, solving a pspace-complete problem by recognizing p systems with restricted active membranes, fundamenta informaticae, vol. 58(2), pp. 67–77, 2003. [2] a. alhazov, m. j. pérez-jiménez, uniform solution of qsat using polarizationless active membranes, in: j. durand-lose, m. margenstern, eds., machines, computations, and universality, 5th international conference, mcu 2007, orléans, france, lecture notes in computer science, vol. 4664, pp. 122–133, springer, 2007. [3] m. a. gutiérrez-naranjo, m. j. pérez-jiménez, a. riscos-núñez, f. j. romero-campero, p systems with active membranes, without polarizations and without dissolution: a characterization of p, in: c. calude, m. j. dinneen, g. păun, m. j. pérez-jiménez, g. rozenberg, eds., unconventional computation, 4th international conference, uc 2005, sevilla, spain, lecture notes in computer science, vol. 3699, pp. 105–116, springer, 2005. [4] a. leporati, c. ferretti, g. mauri, m. j. pérez-jiménez, c. zandron, complexity aspects of polarizationless membrane systems, natural computing, special issue devoted to iwinac 2007, to appear. [5] g. păun, p-systems with active membranes: attacking np complete problems, in: i. antoniou, c. calude, m. j. dinneen, eds., unconventional models of computation, 2nd international conference, umc’2k, brussels, belgium, springer, 2001. [6] g. păun, membrane computing. an introduction, springer-verlag, 2002. [7] m. j. pérez-jiménez, á. romero jiménez, f. sancho-caparrini, complexity classes in models of cellular computing with membranes, natural computing, vol. 2(3), pp. 265–285, 2003. [8] m. j. pérez-jiménez, á. romero-jiménez, f. sancho-caparrini, the p versus np problem through cellular computing with membranes, in: n. jonoska, g. păun, g. rozenberg, eds., aspects of molecular computing, lecture notes in computer science, vol. 2950, pp. 338–352, springer, 2004. [9] a. e. porreca, g. mauri, c. zandron, complexity classes for membrane systems, rairo theoretical informatics and applications, vol. 40(2), pp. 141–162, 2006. [10] p. sosík: the computational power of cell division in p systems: beating down parallel computers?, natural computing, vol. 2(3), pp. 287–298, 2003. [11] c. zandron, c. ferretti, g. mauri, solving np-complete problems using p systems with active membranes, in: i. antoniou, c. calude, m. j. dinneen, eds., unconventional models of computation, 2nd international conference, umc’2k, brussel, belgium, springer, 2001. [12] c. zandron, a. leporati, c. ferretti, g. mauri, m. j. pérez-jiménez, on the computational efficiency of polarizationless recognizer p systems with strong division and dissolution, fundamenta informaticae, vol. 87(1), pp. 79–91, 2008. 310 antonio e. porreca, alberto leporati, giancarlo mauri, claudio zandron antonio e. porreca was born on september 18, 1982 in monza, italy. he got his b.sc. and m.sc. in computer science, in 2005 and 2008 respectively, from university of milano-bicocca, where he is now a ph.d. student. his current research interests focus on complexity-theoretic aspects of membrane computing. alberto leporati obtained a ph.d. in computer science from the university of milano (italy) in 2002. since 2004, he is assistant professor at the university of milano-bicocca. his research interests are in membrane computing, theoretical computer science and computational complexity. giancarlo mauri is full professor of computer science at the university of milano-bicocca. his research interests are mainly in the area of theoretical computer science, and include: formal languages and automata, computational complexity, bioinformatics and unconventional computing models, in particular membrane systems. on these subjects, he published more than 200 scientific papers in international journals, contributed volumes and conference proceedings. claudio zandron was born in 1972 in milan, italy. he received his m.sc. in computer science from the university of milano in 1996, and a ph.d. degree in 2002 from the same university. since 2006 he is associate professor at the university of milano-bicocca, department of informatics, systems, and communication. his research interests concern the areas of formal languages, molecular computing, and computational complexity. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 4 (december), pp. 592-602 a non-cooperative game algorithm for task scheduling in wireless sensor networks l. dai, y.l. chang, z. shen liang dai school of electronic and control engineering chang’an university xi’an 710064, china e-mail: ldai1981@gmail.com yilin chang, zhong shen state key laboratory of integrated service networks xidian university xi’an 710071, china e-mail: ylchang@xidian.edu.cn, zhshen@mail.xidian.edu.cn abstract: scheduling tasks in wireless sensor networks is one of the most challenging problems. sensing tasks should be allocated and processed among sensors in minimum times, so that users can draw prompt and effective conclusions through analyzing sensed data. furthermore, finishing sensing task faster will benefit energy saving, which is critical in system design of wireless sensor networks. but sensors may refuse to take pains to carry out the tasks due to the limited energy. to solve the potentially selfish problem of the sensors, a non-cooperative game algorithm (ngtsa) for task scheduling in wireless sensor networks is proposed. in the proposed algorithm, according to the divisible load theory, the tasks are distributed reasonably to every node from sink based on the processing capability and communication capability. by removing the performance degradation caused by communications interference and idle, the reduced task completion time and the improved network resource utilization are achieved. strategyproof mechanism which provide incentives to the sensors to obey the prescribed algorithms, and to truthfully report their parameters, leading to an effient task scheduling and execution. a utility function related with the total task completion time and tasks allocating scheme is designed. the nash equilibrium of the game algorithm is proved. the simulation results show that with the mechanism in the algorithm, selfish nodes can be forced to report their true processing capability and endeavor to participate in the measurement, thereby the total time for accomplishing the task is minimized and the energy-consuming of the nodes is balanced. keywords: wireless sensor networks; task scheduling; divisible load theory; game algorithm; mechanism design. 1 introduction & motivation wireless sensor networks are constituted by a large number of nodes, and without fixed infrastructure and centralized control mechanisms, so the task scheduling needs of mutual cooperation in all nodes. the energy of sensors is limited, so not all the sensors are willing to participate in collaborative efforts. lack of coordination between the nodes in wireless sensor networks is a new problem[1]. owing to the sensors with limited energy, the task should be completed within the shortest possible amount of time, so that users can draw useful conclusions through analyzing sensed copyright c⃝ 2006-2011 by ccc publications a non-cooperative game algorithm for task scheduling in wireless sensor networks 593 data. furthermore, finishing sensing task faster will benefit energy saving, which is critical in system design of wireless sensor networks. divisible load theory [2] provides an effective solution to wireless sensor networks for task scheduling [3-6]. different from other heuristic solutions of task scheduling problem in wireless sensor networks [7, 8], this scheme can get not only the optimal solution, but also the analytic solution, thus ensuring the consistency of the results of scheduling. divisible load scheduling algorithms were applied to wireless sensor networks in [3-6]. although the authors derived closed-form solutions to obtain the optimal finish time, the network topology discussed in those papers is single-level tree structure. while in wireless sensor networks, as compared with the single-level tree structure, clustered structure (multi-level tree structure) has a great of advantages [9]. those algorithms mentioned above designed on the premise that sensors manipulate the algorithms and perform a task to the best of their abilities. but the sensors may be owned by autonomous rational organizations or people that have no a priori motivation for cooperation. consequently, they will manipulate the algorithms if it benefits them to do so. therefore, we present a non-cooperative game algorithm for task scheduling in wireless sensor networks. the goal of this algorithm is to minimize the overall execution time (hereafter called makespan) and fully utilize network resources, by finding an optimal strategy of splitting the original tasks received by sink into a number of sub-tasks as well as distributing these subtasks to the clusters in the right order, and through the proposed mechanism to encourage collaboration. 2 mechanism design the energy of sensors is limited, if we do not integrate contributions with benefits, not all nodes willing to participate in collaborative efforts to perform tasks. this selfish, non-cooperative behavior will result in poor performance. if we follow the principle of more labor and more benefits, some selfish sensors, driven by the interests, will intentionally exaggerate its datameasuring capacity to increase their income. such dishonest behavior will result in that it’s difficult to determine which sensor reporting its actual capacity, and which said a lie, so we can not globally optimize tasks allocation according to each sensor’s capacity. these dishonest or selfish behavior, has seriously affected the efficiency of wireless sensor network. therefore, we should provide incentives to the nodes to obey the prescribed algorithms and to truthfully report their parameters, leading to an efficient allocation and execution. this problem can be solved by applying mechanism design of game theory. mechanisms are generally composed of allocation o = o(a1, a2, · · ·, an) and payment p = (p1(a), p2(a), · · ·, pn(a)) rules. allocation rules also are called output function. in mechanism design, there are usually n agents, and each agent has some private information, which is known as the type of agent or true value. the private information is only known by agent, and is confidential for the external, for example, the type of a agent ti can be the cost executing an assigned task. vi(ti, o) is the value function of an agent, said the cost of performing a task. pi(·) is the payment function for agent performing a task. ui(·) is the utility function of an agent, that ui(·) = pi(·) − vi(ti, o). to maximize the utility function value is the goal of agents. mechanism should be designed to maintain unity with the goal the agent seeking for. mechanism can induce the agent’s choice or behavior by means of payment. if in a mechanism, the only optional behavior of all agents is directly stating their preferences, then we say that the mechanism is a direct revelation mechanism. in a direct revelation mechanism, if all agents will tell their true preferences in order to maximize their utility, then we say that the mechanism is incentive compatibility. the nature of incentive compatibility allows 594 l. dai, y.l. chang, z. shen sink c1 ck c3 c2 n11 nknk nk2 nk1 n1n1 n13 n12 l11 l13 l12 cluster1 l1n1 lknk lk2 lk1 clusterk l1 l2 l3 lk figure 1: network topology us to design a mechanism to overcome the selfishness of agents, because in an incentive compatible mechanism, the agent will always choose to say their true private information in order to maximize their utility. in a direct revelation mechanism, if it’s a dominant strategy that they gave their true preferences, then we claimed that the mechanism is strategyproof. if each agent is honest, and each agent’s income is not less than zero, then the mechanism satisfies the voluntary participation condition. 3 scheduling algorithm 3.1 network model the network topology discussed in this paper is shown in fig.1. wireless sensor networks construct clusters several times in its life cycle. each cluster will have a set-up phase and a steady-state phase[11]. assuming that there are k clusters (clusteri, i = 1, ...., k) in the network during a steady-state phase. each cluster head is expressed as ci. within the clusteri, there are ni nodes expressed as ni,j (i = 1, ....k; j = 1, ...ni), respectively. communication links between cluster heads and sink are expressed as li, i = 1, ...., k, respectively. communication links between intra-cluster nodes and cluster head ci are expressed as li,j (i = 1, ....k; j = 1, ...ni), respectively. the following notations will be used throughout this paper: αi: the total fraction of load that is assigned by sink to cluster head ci; αi,j: the fraction of load that is assigned to intra-cluster node ni,j in clusteri by ci; by definition we can see: k∑ i=1 αi = 1 (1) ni∑ j=1 αi,j = αi (2) ωi: a constant that is inversely proportional to the processing (data fusion) speed of cluster head ci; yi,j: a constant that is inversely proportional to the measuring speed of intra-cluster node ni,j; a non-cooperative game algorithm for task scheduling in wireless sensor networks 595 zi: a constant that is inversely proportional to the speed of link between sink and cluster head ci; zi,j: a constant that is inversely proportional to the speed of link between the cluster head ci and the intra-cluster node ni,j; tcp: data fusion intensity constant. tcp is the time that takes to fuse all the load on a cluster head when ωi = 1. the entire load can be fused on cluster head ci in time ωitcp; tms: measurement intensity constant. tms is the time that takes the intra-cluster node ni,j to measure the entire load when yi,j = 1. the entire assigned measurement load can be measured on the intra-cluster node ni,j in time yi,jtms; tcm: communication intensity constant. this is the time it takes to transmit all the processing loads over a link when zi = 1. the entire load can be transmitted over the ith link in time zitcm; φi: the information utility constant [8,10] of cluster head ci. information utilization constant is based on a technique of information accuracy estimation. through estimating accuracy of information, cluster head can know the approximate percentage of data fusion. tf : the finish time of a given task. in this paper, we formulate the problem of task scheduling in a large scale sensor network as an optimization problem to minimize tf . 3.2 scheduling strategy the proposed task scheduling algorithm is based on leach [11] protocol. leach was firstly proposed clustering protocol in wireless sensor network, and its certain governing ideas breathe through many other clustering protocols, such as teen and daea. leach protocol structures the single-hop networks. each cluster’s intra-cluster nodes share the same channel, while each cluster head has its own channel to sink. the original tasks received by sink are divided into two stages: inter-cluster task scheduling and intra-cluster task scheduling. first, inter-cluster task scheduling partitions the entire tasks into each cluster, and then the sub-tasks in a cluster are assigned to each intra-cluster node by intra-cluster task scheduling. 1) intra-cluster task scheduling fig.2 illustrates the timing diagram for a set of intra-cluster nodes, indexed from n1 to nk, in one cluster. from fig.2, it can be observed that there is no time gap between every two successive nodes because the divisible workload can be transferred in the cluster. all nodes start to measure data at the same time. once the previous node finishes transmitting data, the other one completes its measuring task and starts to report its data. as a result, the proposed timing diagram minimizes the makespan by scheduling the measuring time and reporting time of each node. moreover, since the intra-cluster scheduling tries to avoid the transmission conflicts at the cluster head, energy spent on retransmission is conserved. for cluster i, based on the timing diagram shown in fig.2, one can write the following set of equations about tasks assigned to the intra-cluster nodes in clusteri: αi,j−1yi,j−1tms = αi,jyi,jtms + αi,jzi,jtcm, i = 2, 3, ...k (3) a general expression for the above set of recursive equations can be written as: αi,j=si,jαi,j−1 (4) where si,j = yi,j−1tms/(yi,jtms + zi,jtcm). 596 l. dai, y.l. chang, z. shen n1 n2 ti n3 nk ,1 ,1i i cmz tα,1 ,1i i msy tα ,2 ,2i i cmz tα,2 ,2i i msy tα ,3 ,3i i cmz tα,3 ,3i i msy tα , ,i ni i ni cmz tα, ,i ni i ni msy tα measurement data reporting figure 2: timing diagram for intra-cluster task-processing using eq.2 and eq.4, the above recursive equation for αi,1 can be rewritten in terms of αi only as αi,1 = αi/(1 + ni∑ j=2 j∏ k=2 si,k) (5) the cluster head ci will use the above value of αi,1 to obtain the amount of data that has to be measured by the rest of the ni − 1 sensors by using αi,j = αi j∏ k=2 (si,k)/(1 + ni∑ j=2 j∏ k=2 si,k) (6) then from fig.2, the minimum time for measuring, reporting sink’s sub-task αi can be given as ti=αi(yi,1tms + zi,1tcm)/(1 + ni∑ j=2 j∏ k=2 si,k) (7) 2) inter-cluster task scheduling after cluster heads fusing the intra-cluster nodes’ measured data, cluster heads can send the fused data to sink concurrently because each cluster head has a separate channel to sink. in order to remove the performance degradation caused by idle, and to improve efficiency, as shown in fig.3, we can get ti + αiwitcp + φiαizitcm = tj + αjwjtcp + φjαjzjtcm, i ̸= j (8) in eq.7, we make (yi,1tms + zi,1tcm)/(1 + ni∑ j=2 j∏ k=2 si,k) = gi, then substitute gi into eq.8, we can get αiri = αjrj (9) where ri = gi + witcp + φizitcm. now using the eq.1 and eq.9, one can solve for αi as αi=(1/ri)/ k∑ i=1 (1/ri) (10) a non-cooperative game algorithm for task scheduling in wireless sensor networks 597 intra-cluster scheduling reporting data to sink 1t 2t 3t 4t kt 1 1 cpwtα 2 2 cpw tα 3 3 cpw tα 4 4 cpw tα k k cpw tα k k k cmz tϕ α 1 1 1 cmztϕ α 2 2 2 cmz tϕ α 3 3 3 cmz tϕ α 4 4 4 cmz tϕ α c1 ck c4 c3 c2 s i n k tf data fusion figure 3: timing diagram for inter-cluster task scheduling hereto, we can get the fraction of load cluster head ci and the intra-cluster nodes within it received from sink (αi and αi,j). and the total task execution time is as follow: tf=ti + (witcp + φizitcm)(1/ri)/ k∑ i=1 (1/ri) (11) for the homogeneous networks, the parameters(measurement speed, communication speed, data-fusing speed, information utility constant) of the network are the same and there are n sensors in each cluster. sink can evenly distributed loads to each cluster. the sub-loads each cluster receiving is as follows: αi=1/k (12) in the homogeneous networks, the minimum time-consuming for the total task is: tf=(d + wtcp + φztcm)/k (13) where d= (ytms + ztcm)/n. with the increasing density of wireless sensor networks, the number of the clusters is higher with the current clustering algorithms. from eq.13, it can be known that tf approaches 0 with k approaching infinite. 4 mechanism design for ngtsa in this section, we propose strategyproof mechanism for task scheduling of wireless sensor networks. the ith cluster is composed of ni strategic sensors. each task-performing sensor ni,j is characterized its true value yi,j, which is equal to the time to measure the unit task. we denote by yi = (yi,1, yi,2, ···, yi,ni) the vector of true measuring speed. measuring speed yi,j is private to ni,j. as the sensor ni,j with rationality may give a bid bi,j which is not equal the true measuring speed yi,j, if it benefits it to do so. cluster heads calculated the fraction of load that is assigned to intra-cluster nodes αi(bi) = (αi,1(bi), αi,2(bi), ···, αi,ni(bi)), according to the bids of intra-cluster nodes, where ni∑ j=1 αi,j(bi) = αi, and bi = (bi,1, bi,2, · · ·, bi,ni) is the bid vector of intra-cluster nodes. 598 l. dai, y.l. chang, z. shen as the selfishness of sensors, the sensor may complete the measurement task at a relatively low speed ỹi,j, where ỹi,j ≤ yi,j. after intra-cluster nodes completed their task, cluster head node will get the actual measuring speed. the value function of ni,j under task allocation αi(bi) is defined as follow: vi,j(αi(bi), ỹi,j) = −αi,jỹi,j (14) the value of vi,j(αi(bi), ỹi,j) is equivalent to the negation of the actual time required for ni,j to complete αi,j of total load. it can be regarded as the cost of ni,j to complete its task. the mechanism will provide payment for involving in measurement, so each node will select the appropriate strategy to maximize its benefits. the utility function of ni,j is defined as follow: ui,j(bi, ỹi) = qi,j(bi, ỹi) + vi,j(αi(bi), ỹi,j) (15) where qi,j(bi, ỹi) is the payment cluster head ci paid to ni,j, ỹi = (ỹi,1, ỹi,2, · · ·, ỹi,ni) is the vector of each node’s actual measurement speed. utility function equals the payment the node got from mechanism minus the cost for completing its tasks. designing such mechanisms involves finding an allocation and a payment scheme that minimize the makespan according to the sensors’ bid bi and motivate all the intra-cluster sensors to bid their true value yi,j and measure the task at their full measuring capacity ỹi,j = yi,j. define the payment function for mechanism is as follow: qi,j(bi, ỹi) = ci,j(bi, ỹi) + bi,j(bi, ỹi) (16) where ci,j(bi, ỹi) = −vi,j(αi(bi), ỹi,j) is the compensation to the node ni,j. the bonus of ni,j is defined as follow: bi,j(bi, ỹi) = t−i,j(αi(b−i,j), b−i,j) − t(αi(bi), (b−i,j, ỹi,j)) (17) where t−i,j(α(b−i,j), b−i,j) is the optimal task completion time when ni,j is not involved in measurement, that is, the bonus of intra-cluster node is equal to its contribution in reducing the task completion time because of its participation for measurement. theorem (strategyproofness). the mechanism proposed in this paper is strategyproof. proof. according to the bids of intra-cluster nodes in cluster i, bi = (bi,1, bi,2, · · ·, bi,ni), we can get the utility of ni,j is as follow: ui,j(bi, ỹi) = qi,j(bi, ỹi) + vi,j(αi(bi), ỹi,j) = t−i,j(α(b−i,j), b−i,j) − t(αi(bi), (b−i,j, ỹi,j)) + αi,j(b)ỹi,j − αi,j(b)ỹi,j = t−i,j(α(b−i,j), b−i,j) − t(αi(bi), (b−i,j, ỹi,j)) (18) when ni,j perform a task to the best of their abilities (ỹi,j = yi,j), and bid the measurement as bi,j = yi,j, the utility of ni,j is as follow: uti,j = t−i,j(α(b−i,j), b−i,j) − t(αi(bi), (b−i,j, ỹi,j)) = t−i,j(α(b−i,j), b−i,j) − t ti,j (19) where t ti,j is the shortest possible amount of time to finish total tasks according to divisible load theory. if ni,j bids a lower value, that is, bi,j < yi,j ,the makespan is increased by a small amount as the load allocated to the other intra-cluster nodes is increased. therefore, only when ni,j voluntarily gives the true measuring speed and be assiduous in its duty, they can get the maximum benefit. because t−i,j is the optimal task completion time when ni,j is not involved in measurement, its value must not be less than when all the nodes involved in the tasks. so, we can get uti,j ≥ 0, that is, all nodes within one cluster will try to do their best to carry out a task. a non-cooperative game algorithm for task scheduling in wireless sensor networks 599 5 wireless energy use in this section, the energy model of the ngtsa algorithm is presented in detail and the equations of energy consumption of individual sensor nodes are derived. the model is based on first-order radio model [11]. there are three kinds of energy consumption in the wireless sensor network: measurement, data fusion, and communication. because nodes in the sensor network cooperate with each other via data transmission. energy consumption of communications exist in sensor nodes, cluster heads and sink. it is not necessary for cluster heads and sink to perform any sensing task. thus, there is no energy cost for cluster heads due to the measurement of these nodes, while the additional energy cost of cluster heads attributes to data fusion. the energy to sense, fuse, and transmit a unit sensory data are denoted by es, ep, and etx, respectively. sensor nodes also consume the energy of erx to receive one unit of data. the distance between the sender and the receiver is d. the energy use for each kind of nodes is outlined as follows: energy use for individual sensor nodes j in cluster i: ei,j = αi,j(es + etxd 2), i = 1, · · ·, k , j = 1, · · ·, ni (20) energy use for individual cluster head: ei = αi(erx + ep + φietxd 2), i = 1, · · ·, k (21) energy use for sink: esink = k∑ i=1 αiφietx (22) 6 performance evaluation in the above sections, we have obtained the close-form solution of the task completion time and the formula for the energy consumption of each node. in this section, we investigate the effects of different cases under homogeneous network environment on the makespan, energy consumption of every intra-cluster nodes, and the payment/utility of the cheating nodes. we consider the wireless sensor network comprising 10 clusters. within each cluster, there are 20 intra-cluster nodes. assume that only the first intra-cluster node cheats by reporting values different than its true measuring speed, and by measurement at a different speed than the true speed in each cluster. the speed of communication link between each node were 0.05; the measuring speed of each intra-cluster node is 0.2; the data fusion speed of each cluster head is 0.1, and the information utility constant of each cluster head is 0.5. in the simulation, the following energy parameters are adopted: transmitting a unit of sensor reading over a unit distance takes etx=200nj, receiving one unit of sensor reading consumes erx=150nj, measuring one unit of sensor reading needs es=100nj, fusing one unit of observation consumes ep=20nj and the distance between the sender and the receiver is d=100m. in the simulation, we supposed that: tms=tcm=tcp= 1. we simulated the following eight cases: 1⃝ yi,1 = bi,1 = ỹi,1; 2⃝ yi,1 = bi,1 < ỹi,1; 3⃝ yi,1 < bi,1 = ỹi,1; 4⃝ yi,1 = ỹi,1 < bi,1; 5⃝ bi,1 < yi,1 = ỹi,1; 6⃝ yi,1 < ỹi,1 < bi,1; 7⃝ yi,1 < bi,1 < ỹi,1; 8⃝ bi,1 < yi,1 < ỹi,1. where bi,1 states the bid of the first intra-cluster node; yi,1 represents the node’s actual value; ỹi,1 is the true value of the measurement speed. the bid bi,1 and the measurement value ỹi,1 for each case are presented in tables 1: 600 l. dai, y.l. chang, z. shen table 1: bids and execution values case 1 2 3 4 5 6 7 8 bi,1 0.2 0.2 0.3 0.3 0.1 0.4 0.3 0.1 ỹi,1 0.2 0.3 0.3 0.2 0.2 0.3 0.4 0.3 figure 4: makespan when the first intra-cluster node cheats the simulation results are shown in fig.3 and fig.5. firstly, fig.3 shows the makespan for the eight cases. notice that case 1 (yi,1 = bi,1 = ỹi,1) results in the minimum makespan, while all other cases result in a larger makespan. in that case, each node honestly bids, and makes effort to measure. when ỹi,1 ≤ bi,1, the case 3,4, and 7, the makespan is increased by a small amount as the load allocated to the other intra-cluster nodes is increased. the effect of cheating is dispersed throughout the remaining nodes. in the remaining case (ỹi,1 > bi,1),the case 2,5,6, and 8, the network performance dramatically degrades as the first intra-cluster nodes is overloaded, and the other nodes are underutilized. in these cases, the first intra-cluster nodes is slowing down the entire network. the increase in makespan is large, and it is due to the impact of ỹi,1. next, the second simulation is about the energy consumption of intra-cluster sensor nodes for the 8 cases. sink and cluster heads are not taken into account, because generally, sink has no energy constraint and the chosen cluster heads have the possibly enough energy. the network is configured with 20 clusters. without loss of generality, the intra-cluster sensor nodes in the first cluster are chosen to study the energy consumption, as shown in fig.5. in each case, the energy consumption of sensor nodes monotonically decreases due to the reduced workload. in the case (yi,1 = bi,1 = ỹi,1), each node honestly bids, and make effort to measure, so the energy-consuming of each intra-cluster node is most evenly. in the case 2,5,6, and 8, because the cluster head allocates more tasks on the first intra-cluster node as it bids a rate slower than its true rate, the energy-consuming of the first intra-cluster nodes increases noticeably, which will run out power and affected the entire network performance. then, the third simulation is about the payment and utility of intra-cluster nodes. fig.6 shows the payment and utility for the first intra-cluster nodes in the 8 cases. as we expected, case 1 (yi,1 = bi,1 = ỹi,1), maximizes the first intra-cluster node’s utility. in all the other cases, utility is lessened. when ỹi,1 > bi,1, the utility is negative due to the impact of ỹi,1 on the a non-cooperative game algorithm for task scheduling in wireless sensor networks 601 figure 5: energy consumption when the first intra-cluster node cheats figure 6: payment and utility of the first intra-cluster node when it cheats makespan. in the remaining cases, payment and utility are reduced as αi,1. 7 conclusions as the node in wireless sensor network has limited energy, the tasks should be completed as quickly as possible, and the network resources should be fully utilized. in this paper, we present a task scheduling algorithm (ngtsa) in clustered wireless sensor networks. the goal of this algorithm is to minimize the makespan and fully utilize network resources, by finding an optimal strategy of splitting the original load received by sink into a number of chunks as well as distributing these chunks to the clusters in the right order. as the gradual development of the wireless sensor networks and ubiquitous networks, this scheduling model using divisible load theory needs further study and validation. in addition, the validation of divisible load theory in wireless sensor networks still remains in the simulation phase. there are short of scanty of large-scale, practical applications to demonstrate its true potential, so applying the divisible load theory in actual wireless sensor network should be an important research direction for the next step. 602 l. dai, y.l. chang, z. shen acknowledgement the authors thank the editors and the anonymous reviewers for their valuable comments that helped to improve the paper. the work was supported by the national natural science foundation of china (no.60972047), and the 111 project (no.b08038). bibliography [1] c. pandana, h. zhu, l. ray, cooperation enforcement and learning for optimizing packet forwarding in autonomous wireless networks. ieee transactions on wireless communications, vol.7, no.8, pp.3150-3163, 2008. [2] v. bharadwaj, d. ghose, t. g.robertazzi, divisible load theory: a new paradigm for load scheduling in distributed systems. cluster computing, vol.6, no.1, pp.7-18, 2003. [3] m. moges, t.g. robertazzi, wireless sensor networks: scheduling for measurement and data reporting. ieee transactions on aerospace and electronic systems, vol.42, no.1, pp.327340, 2006. [4] h. liu, x. yuan, m. moges, an efficient task scheduling method for improved network delay in distributed sensor networks. in proceedings of tridentcom 2007, orlando, fl, usa, 1-8, 2007. [5] h. liu, j. shen, x. yuan, m. moges, performance analysis of data aggregation in wireless sensor mesh networks, in proceedings of earth & space 2008, akron, oh, usa, 1-8, 2008. [6] c. kijeung , t. g. robertazzi, divisible load scheduling in wireless sensor networks with information utility performance. in proceedings of ipccc 2008, austin, texas, usa, 9-17, 2008. [7] z. zeng, a. liu, d. li, a highly efficient dag task scheduling algorithm for wireless sensor networks, in proceedings of icycs 2008,zhang jia jie , hunan , china, 570-575, 2008. [8] j. lin, w. xiao, f. l. lewis, energy-efficient distributed adaptive multisensor scheduling for target tracking in wireless sensor networks. ieee transactions on instrumentation and measurement, vol.58, no.6, pp.1886 1896, 2009. [9] p. guo, t. jiang, k. zhang, h. chen, clustering algorithm in initialization of multi-hop wireless sensor networks. ieee transactions on wireless communications, vol.8, no.12, pp. 5713–5717, 2009. [10] n. nisan, a. ronen. algorithmic mechanism design, games and economic behavior, vol.35, nos.1-2, pp.166 -196, 2001. [11] w. heinzelman, a. chandrakasan, an application-specifid protocol architecture for wireless microsensor networks. ieee transaction on wireless communications, vol.1, no.4, pp. 660-670, 2002. international journal of computers communications & control issn 1841-9836, 10(5):732-745, october, 2015. a modified membrane-inspired algorithm based on particle swarm optimization for mobile robot path planning x.y. wang, g.x. zhang, j.b. zhao, h.n. rong, f. ipate, r. lefticaru xueyuan wang 1. school of electrical engineering southwest jiaotong university chengdu 610031, p.r. china 2. school of information engineering southwest university of science and technology mianyang 621010, p.r.china 121053406@qq.com gexiang zhang*, junbo zhao, haina rong school of electrical engineering southwest jiaotong university chengdu 610031, p.r. china gexiangzhang@gmail.com, junbozhao55589@gmail.com, ronghaina@126.com *corresponding author: gexiangzhang@gmail.com florentin ipate, raluca lefticaru faculty of mathematics and computer science university of bucharest academiei 14, bucharest, romania florentin.ipate@ifsoft.ro, raluca.lefticaru@fmi.unibuc.ro abstract: to solve the multi-objective mobile robot path planning in a dangerous environment with dynamic obstacles, this paper proposes a modified membraneinspired algorithm based on particle swarm optimization (mmpso), which combines membrane systems with particle swarm optimization. in mmpso, a dynamic double one-level membrane structure is introduced to arrange the particles with various dimensions and perform the communications between particles in different membranes; a point repair algorithm is presented to change an infeasible path into a feasible path; a smoothness algorithm is proposed to remove the redundant information of a feasible path; inspired by the idea of tightening the fishing line, a moving direction adjustment for each node of a path is introduced to enhance the algorithm performance. extensive experiments conducted in different environments with three kinds of grid models and five kinds of obstacles show the effectiveness and practicality of mmpso. keywords: membrane computing, evolutionary membrane computing, particle swarm optimization, variable dimensions, mobile robot path planning, membrane systems. 1 introduction as a branch of natural computing, membrane computing(mc), initiated by păun in 1998, aims to abstract distributed and parallel computing models, also called p systems or membrane systems, from the compartmentalized structure and interactions of living cells [1, 2]. there are three main research directions in this area [3]: theoretical study including computing models and their computing power and efficiency; applications such as modeling biological processes and approximately solving engineering optimization problems [4]; software and hardware realization. in the past seventeen years, much attention has been paid to the theoretical aspects, but the applications are worth further discussing, especially for solving real-world engineering problems. copyright © 2006-2015 by ccc publications a modified membrane-inspired algorithm based on particle swarm optimization for mobile robot path planning 733 evolutionary algorithms (eas) are a class of probabilistic search methods with many advantages such as flexibility, convenient application and robustness. while mc can provide flexible evolution rules and parallel-distributed framework [5], which is very beneficial to produce the membrane-inspired evolutionary algorithms (mieas). until now, different kinds of mieas have been proposed. in [6], a certain number of nested membrane structures in the skin membrane were combined with eas for multi-objective optimization problems. a novel miea, called qeps, combining quantum-inspired evolutionary algorithms with p systems to solve image processing problems and knapsack problems were proposed in [7, 8]. particle swarm optimization (pso) [9] with one-level membrane structure (olms) was used to solve broadcasting problems of p systems and radio spectrum allocation, respectively [10, 13]. in [11], xiao et al. applied the bio-inspired algorithm based on membrane computing for engineering design problems. the investigations verify the usefulness of the introduction of p systems into eas to solve various real-world applications. to the best of our knowledge, there is not any work focusing on the use of a miea to solve mobile robot path planning problems (mr3p), which is one of very important real-world applications. in this paper, a modified membrane-inspired algorithm based on particle swarm optimization (mmpso) is proposed to solve mr3p. the main contributions of this paper can be summarized as follows: (1) in this study, the solving process of mr3p is considered as a dimension-reducing optimization procedure and therefore a pso with variable dimensions (vpso) is introduced into mmpso, and further a dynamic double olms (d-olms) with membrane division and dissolution is presented to combine with vpso to arrange the particles and execute the communications between regions delimited by membranes. (2) mobile robot path planning is a multi-objective optimization problem. this study considers three objectives, distance, safety and smoothness, instead of a single objective (path length) [15–17] or bi-objectives (path length and risk degree) as previously considered in the literature [18, 21]. a point repair algorithm and a smoothness approach are presented to effectively trade-off multiple objectives and speed-up the mmpso convergence. (3) inspired by the idea of tightening the fishing line, a moving direction adjustment for each node of a path is introduced to enhance the algorithm performance, together with the point repair algorithm. (4) extensive experiments are carried out by considering various environments with different grid models and different obstacles to verify the effectiveness and practicality of mmpso. the rest of this paper is organized as follows. section 2 describes mr3p. in section 3, we present mmpso for solving mr3p. section 4 discusses parameter setting and provides experimental results. conclusions are drawn in section 5. 2 mobile robot path planning problems this section gives a brief description of mr3p and then summarizes the related works. 2.1 problem statement mr3p aims to find a reasonable collision-free path for a mobile robot from the starting position to the target position through an environment containing static or dynamic obstacles. it is proved that mr3p is an np-complete problem [19]. mobile robots are very useful for the dangerous or hostile environments that humans are not able to reach. so, in recent years, mobile robotics field is a very hot and also challenging research area. as one of important research themes in the mobile robotics field, mr3p, launched at the 1960s, has become an attractive area. 734 x.y. wang, g.x. zhang, j.b. zhao, h.n. rong, f. ipate, r. lefticaru generally, the criterion for planning a mobile path has to consider many factors, such as the shortest distance, safety degree, smoothness, the lowest energy cost and minimum time, based on the characteristics of a special robot with the minimal turning radius, acceleration and the limited velocity and the features of the environment, such as the distances between obstacles, the shapes of obstacles, the occurrence probabilities of dynamic obstacles. thus, the optimization of a mobile robot path in a static or dynamic environment is very complicated. to realize the real time robot path planning in a dynamic environment or the consideration that a robot can dynamically track the motion target, at least three aspects should be carefully considered together due to their interactions among each other. (i) an efficient and effective optimization approach is very important for planning a good mobile robot path. aiming to solve mr3p in a dynamic environment, this study proposes mmpso. (ii) a simple and good objective function is very important for planning a good mobile robot path. in this study, the objective function aims to minimize the path length, to maximize the smoothness and the distances between a robot and the obstacles or dangerous sources, and can be expressed as f = kd ·dis + kf ·s + ks ·sd (1) where kd, kf , ks are the weighing factors of path length, smoothness and safety degree, respectively. the detailed description of path length, smoothness and safety degree are as follows: 1. path length: path length dis is the sum of distances between n nodes from the starting point to the end point and can be described as dis = n−1∑ i=0 l(i, i + 1) (2) where l(i, i + 1) = √ (xi+1 −xi)2 + (yi+1 −yi)2 is the distance between nodes i and i + 1, where xi and xi+1 are the x-axis values of nodes i and i + 1; yi and yi+1 are the y-axis values of nodes i and i + 1, respectively. 2. smoothness: smoothness refers to the sum of the reflection angles formed by any three neighboring nodes of a path. as usually calculating directly the smoothness is a timeconsuming process, this study uses an indirect approach, i.e., it uses the ratio sc of the number of deflection angles less than the given expected value to the total number of deflection angles and the ratio sp of the number of path segments more than the number of the segments in the path with the smallest number of path segments in a group to the total number of path segments to evaluate the smoothness of a path. smoothness can be calculated by using the following formula: s = α ·sc + β ·sp (3) where sc = 1− dalnf −1 ; sp = 1− smin nf , where nf is the total number of path segments; dal is the number of deflection angles greater than the expected value; smin is the number of the segments in the path with the smallest number of path segments in a group; α and β are two weighting coefficients. 3. safety degree: safety degree (sd) is the sum of deviation degrees ci (i = 1,2, . . . ,n) between any segment in a path and its nearest obstacle. sd is defined as sd = n−1∑ i=1 ci = { 0, di ≥ λ∑n−1 i=1 e λ−di, di < λ (4) a modified membrane-inspired algorithm based on particle swarm optimization for mobile robot path planning 735 where di is the minimal distance between the ith segment and its nearest obstacle; λ is the threshold of safty degree. (iii) the establishment of an environment model is the foundation of mr3p and decides the environment feature (static or dynamic), and how to choose an evaluation method and an optimization approach to implement the path planning for a mobile robot. there are three main environment models: vector (obstacles represented by polygons), grid (occupancy cell) and graph (voronoi diagram or visibility graph). as compared with vector and graph, grid has the advantages of simple and flexible. this study uses a grid environment. there are two ways of representing a grid-based environment. one is a x-y coordinates plane [15] and the other is an orderly numbered grid, which has been widely used. we adopt the latter approach, in which a square environment is evenly divided into a certain number of squares, i.e., the x-axis and y-axis are divided equally into m parts, thus, we get m×m grids, where one or more grids are used to represent the obstacles. an example of the 7×7 grids is shown in fig. 1(a), where the grid map is encoded by using matlab and the grey shadow grids represent obstacles. the mapping relations between coordinates (x,y) and the serial number p beginning from one can be identified by the following formula: p = fix(y/sog) ·noc + fix(x/sog) (5) where noc is the number of columns; sog is the size of a grid; the function fix(t) rounds t to its nearest integer towards zero. 2.2 related work since the pioneering work of lozano-pérez [12], a number of algorithms for solving the path planning problems have been reported in the past thirty years. these algorithms can be generally divided into two main classes: classical [14] and heuristic [19]. although the classic approaches can be used to solve this problem, they may suffer from some drawbacks, such as easily falling into local minima, high complexity in high dimensions. in order to overcome these problems of classic methods, heuristic algorithms have been developed. the representative heuristic approaches for solving mr3p are neural networks, genetic algorithms [15], ant colony optimization, fuzzy logic [16], simulated annealing [17], pso [21], probabilistic road maps, rapidly exploring random trees, etc. although heuristic methods do not guarantee to find an optimal solution, they may be faster and may have higher efficiency than classical methods [19]. the studies in [20, 21] have shown that the interest in pso-based meta heuristics algorithms is growing in mobile robotics. particularly interesting is the work in [20,21] for solving the static or dynamic mr3p. according to the reports in the literature, the dimensions of the search space are set to a fixed value and remain constant throughout the entire optimization process in almost all of pso-based algorithms for solving mr3p; consequently, the solving ability of the algorithms is limited to a single individual’s dimension and the algorithm cannot find the optimal solutions. in mr3p, the dimensions of the search space decide the number of nodes of the optimal path. high dimensions may result in the decrease of the searching efficiency, while low dimensions may cause the case in which it is impossible to get barrier-free paths. in order to find a proper dimension for a path and improve the search efficiency to selfadapt the dynamic environment with randomly appearing or disappearing obstacles, mmpso with variable dimensions is introduced to solve mr3p and will be presented in the next section. 736 x.y. wang, g.x. zhang, j.b. zhao, h.n. rong, f. ipate, r. lefticaru 3 mmpso for mr3p this study considers a grid-based environment, in which moving obstacles or dangerous sources may appear or disappear. to adapt the hostile environment, mmpso uses a variable dimension pso and a dynamic membrane structure with membrane division and dissolution. to improve the mmpso performance such as effectiveness, efficiency and extensibility, we introduce a point repair algorithm, a smoothness approach and a moving direction adjustment technique. in what follows, we first present the variable dimensions and then describe the point repair algorithm and the smoothness approach. finally, we summarize the mmpso algorithm. 3.1 variable dimensions in mmpso, each particle represents a feasible path, instead of an infeasible path in other heuristic approaches such as genetic algorithms. if the dimension of each particle is fixed, the search efficiency is often low due to the following reasons: (1) population initialization, i.e., obtaining a population of initially feasible paths through randomly searching each node row by row, is time-consuming, especially for large grids or complex environments with circuitous route phenomenon. (2) the search process of the algorithm with fixed dimensions is also timeconsuming, as compared with the dimension-reducing methods, because the variable dimensions in this study consider the removal of redundant information at each iteration. (3) due to the complex or hostile environment, the optimization algorithms with fixed dimensions have low efficiency and poor adaptability. to overcome these shortcomings, a set of high-dimension particles are needed at the beginning and the dimension of the best solution, i.e., the optimal path, is usually quite low. thus, the dimensions of each particle in mmpso are considered to be variable. in mmpso for solving mr3p, the initial population p of particles (feasible paths) is classified into several subpopulations pmin,. . . ,pmax, where pmin is the subpopulation with lower dimensions, which represent shorter paths, that pass around fewer obstacles, and pmax is the subpopulation with higher dimensions, which denote longer paths, passing around more obstacles. at the beginning, the population size smin of pmin may be similar to the one smax of pmax, and the particles in pmax may search a feasible path through passing around external obstacles, while the particles in pmin may go to the contrary case. as the algorithm goes forward, smin will increase and smax will decrease. in general, a particle with low dimensions produces a shorter path, while a particle with high dimensions corresponds to a longer path. however, there are still some exceptions. but in mmpso, the point repair algorithm, the smoothness approach and the moving direction adjustment technique can rectify the exceptions. the implementation of variable dimensions motivates the dynamic membrane structure of mmpso. 3.2 point repair algorithm in the process of searching the optimal path, some nodes may move into obstacles and some path segments may cross obstacles, which results in infeasible paths and it is necessary to repair them. this study introduces a point repair algorithm to change the infeasible paths into feasible paths. we first define some special grids in the environment model. in fig. 1(a), p1, which is surrounded by the three peripheral grids, 8, 9 and 15, and p2, which is surrounded by the three peripheral grids, 29, 36 and 37, are the vertexes of obstacle o. all the peripheral grids have two kinds of coefficients, γ1 and γ2, which are randomly selected and are controlled by the weighing factors, kd,ks and kf in (1). the coefficient γ1 related to lateral grids {9,15,29,37} is mainly controlled by kd. the coefficient γ2 related to the diagonal grids {8,36} is mainly controlled by ks and kf , where kd + ks + kf = 1, 0.6 ≤ kd ≤ 1, 0 ≤ γ1 ≤ 1, 0 ≤ γ2 ≤ 1, γ1+γ2 = 1. the a modified membrane-inspired algorithm based on particle swarm optimization for mobile robot path planning 737 relationship between γ1 and γ2 is shown in fig. 1(b). for example, if kd = 0.8, ks = kf = 0.1, we obtain γ1 = 0.5 and γ2 = 0.5. if kd = 1, ks = kf = 0, we get γ1 = 1 and γ2 = 0. (a) 1 g2g d k (b) figure 1: definition of grids o 1 3 4 ! 2 "! "# # (a) o 1 3 4 ! " #$ #% l (b) o 2 3 2 !" !# 4 1 (c) o 4 3 2 1 ! " (d) (e) figure 2: an example for point repair and smoothness algorithms two types of infeasible paths are shown in fig. 2(a)-(b), where d1 is the point to point distance between node 2 and p1; d2 is the point to point distance between node 2 and p2; d3 is the point to line distance between p1 and l; d4 is the point to line distance between p2 and l; d1, d2, d3 and d4 decide which peripheral grids will be selected. in fig. 2(a), the node 2 in the path {1,2,3,4} (type 1) must be repaired. in fig. 2(b), the path segment l crossing the obstacle (type 2) must be broken. the point repair process is as follows. step 1 : evaluate a path found. if it is feasible, we skip the repair process, otherwise, we perform the repair process. step 2 : judge the type of an infeasible path, types 1 or 2. step 3 : if the infeasible path is type 2, go to step 4, otherwise, conduct the following steps: (a) calculate the distance between the infeasible point p0 and the vertex pi of the obstacle, then get the value(s) of di, i = 1 (the obstacle is in the corner) or i = 1,2 (the obstacle is located at the edge of the map) or i = 1,2,3,4 (the obstacle is located in the middle of the map). next, sort di in an increasing order. (b) select unused pi by using the corresponding smallest value in di and get the peripheral grids pjg_i,j = 1,2,3, and randomly select unused p j g_i by using γ1 and γ2. (c) replace the value of the infeasible node with selected pjg_i. (d) judge the path again. if it is feasible, this process terminates, otherwise, go to step 2. step 4 : type 2 is repaired according to the following steps: (a) calculate the distance between the infeasible path segment l and the vertex pi of the obstacle, get the value(s) of d′i, i = 1 (the obstacle is in the corner) or i = 1,2 (the obstacle is located at the edge of the map) or i = 1,2,3,4 (the obstacle is located in the middle of the map). next, sort d′i in an increasing order. (b) select unused pi by using the corresponding smallest value in d′i and get the peripheral grids pjg_i,j = 1,2,3, and randomly select unused p j g_i by using γ1 and γ2. 738 x.y. wang, g.x. zhang, j.b. zhao, h.n. rong, f. ipate, r. lefticaru (c) insert the selected pjg_i between the two nodes of the infeasible path segment and get two new path segments paths_1 and paths_2. (d) judge each of the two paths paths_1 and paths_2. if one of them is not feasible, go to step 4, otherwise, the repair process terminates. there are three cases of infeasible paths shown in fig. 2(e) (dash line) and fig. 2(a)-(b). we use the introduced point repair algorithm to process the three cases and obtain the corresponding results shown in fig. 2(e) (solid line) and fig. 2(c)-(d), respectively. e.g., the path segment {3,4} across the obstacle o in the infeasible path represented by the nodes {1,2,3,4,5,6} in fig. 2(e) should be modified. if γ1 ≫ γ2, the path segment {3,4} is replaced by two path segments {3,8} and {8,4} in the first modification, but the modified path segment {3,8} is still infeasible and must be modified further. in the second modification, the segment {3,8} is replaced by the path segments {3,7} and {7,8}. thus, all the path segments are feasible and the new path is {1,2,3,7,8,4,5,6}. in fig. 2(c)-(d), the new feasible path is {1,2′,2′′,3,4} or {1,2′,3,4} under the condition γ1= 0,γ2 = 1 or γ1= 1,γ2 = 0. 3.3 smoothness algorithm the smoothness algorithm is used to get rid of those redundant nodes of a feasible path. the smoothness process is described as follows: step 1 : sort the nodes in a path from the starting node to the goal node and get a sequence ni, i = 1,2, . . . ,m, where m is the dimension of a particle; n1 and nm are the starting and goal nodes, respectively. step 2 : judge the path segment lij between ni and nj (at the beginning, i = 1,j = 3), if lij is infeasible, insert the nodes i and j − 1 into the node set pf of the smoothed path, i.e., pf ={i,j − 1} and let i = j − 1 and j increase 1, the algorithm continues to judge the path segment lij, otherwise, let j increase 1, continue to judge the feasibility of the path segment lij till it is infeasible, insert the nodes i and j − 1 into the node set pf of the smoothed path and let i = j −1. repeat this step till j = m. as shown in fig. 2(e), we use the introduced smoothness algorithm to remove the redundant nodes in the path {1,2,3,7,8,4,5,6} and obtain the smoothed path {1,8,6}. similarly, the smoothed paths {1,2′,2′′,4} and {1,2′,4} come from the paths {1,2′,2′′,3,4} and {1,2′,3,4}, respectively, as shown in fig. 2(c)-(d). 3.4 mmpso in mmpso, a dynamic membrane structure (specifically, olms alternates with d-olms shown in fig. 3(a)) is introduced to arrange a population of particles, each of which is a feasible path for a mobile robot, and specify various rules, such as membrane division, transformation and communication-like rules, and membrane dissolution. the dimension of each particle is variable in the process of evolution. the point repair algorithm described above is used to change infeasible paths into feasible ones. the repair process may increase the dimensions of each particle. the smoothness algorithm presented in this section is applied to remove the redundant nodes of a path and the process may decrease the dimensions of each particle. in addition, a moving direction adjustment technique is presented to accelerate the algorithm convergence. the pseudocode algorithm of mmpso is shown in fig. 3(b), where each step is described in detail as follows: step 1 : an olms [[ ]1, . . . , [ ]m]0 composed of a skin membrane denoted by 0 and m + 1 regions inside the skin membrane is constructed. the way in which the parameter m is chosen will be discussed in section 4. a modified membrane-inspired algorithm based on particle swarm optimization for mobile robot path planning 739 olms d-olms (a) (1) initialize membrane structure (2) initialize populations (3) produce new particles in each elementary membrane (4) while (not termination condition) do (5) divide each elementary membrane (6) evaluate every particle (7) find (8) find local best particle (9) execute communication rules (a) (10) find global best particle (11) execute communication rules (b) (12) update particle’s velocity v(t) (13) update particle’s position x(t) (14) execute point repair algorithm (15) execute smoothness algorithm (16) adjust each particle’s moving direction (17) dissolve elementary membrane end end +1t t¬ begin: 1t ¬ ( ) b d g t ( ) id best p t ( ) b ij g t (b) figure 3: membrane structures and the pseudocode algorithm for mmpso step 2 : a particle swarm x with m particles in a d-dimensional search space is randomly generated and each particle is put inside an elementary membrane in olms, where d represents the number of nodes in a feasible path; x = {x1,x2, . . . ,xm}, where xi is an arbitrary individual in x and denotes a feasible path, xi = (xi1,xi2, . . . ,xid). step 3 : in this step, a moving direction adjustment technique is introduced to produce n particles inside each elementary membrane. to be specific, we modify the velocity of the particle inside each elementary membrane to generate a new particle by using (6) and (7), v (g + 1) = ρ1 ·vr(g) + ρ2 ·vf(g) (6) x(g + 1) = x(g) + v (g + 1) (7) where ρ1 and ρ2 are the inertia weighting factors and usually are set to larger values for exploring the global solutions; vr(g) is the randomly produced velocity of the gth particle (at the beginning for each elementary membrane, g=0); vf(g) is the adjusted velocity of the gth particle by using the idea of tightening fishing line and the moving directions of each node in the gth particle is shown in fig. 4(a); v (g + 1) is the velocity of the (g + 1)th particle; x(g) and x(g + 1) are positions of the gth and (g + 1)th particles. inspired by the idea of tightening fishing line, we consider a feasible path as a fishing line and tighten the line from the target node, thus, each node except for the target one in the path will show a moving direction toward the next node (the target node is the first one). the moving directions of all the nodes in the path construct the velocity vf(g). this step is repeated for n times to produce n new particles for each elementary membrane and used together with the point repair algorithm and smoothness algorithm. the dimensions of new particles may be greater than or less than d. thus, the swarm will have m×n particles in total. fig. 4(b) shows an example that one particle with 20 dimensions (in thick line) inside a certain elementary membrane produces 10 particles with 6–10 dimensions (in thin line). compared to the random approach, the production of the new particles can remove redundant nodes of a path and has better adaptability in hostile environment, especially in the circuitous route environment. step 4 : the maximal number of iterations is used as the termination condition. 740 x.y. wang, g.x. zhang, j.b. zhao, h.n. rong, f. ipate, r. lefticaru s t (a) (b) figure 4: an example of the generation of new particles and direction of each dimension of the individual step 5 : this step first classifies the n particles inside the ith elementary membrane into ki clusters according to the dimension of each particle and then divides the ith elementary membrane into ki membranes, each of which contains the particles with the same dimension, i = 1,2, . . . ,m. thus, olms becomes d-olms. step 6 : each particle is evaluated by using (1) and assigned a fitness value. step 7 : find p idbest(t), which is the best solution of each particle in its history with respect to the fitness values. step 8 : find the locally best solution gbij(t) in the jth elementary membrane inside the ith membrane, i = 1,2, . . . ,m,j = 1,2 . . . ,ki, in terms of fitness values. step 9 : perform communication rules (a), which first send all the locally best solutions gbij(t) (j = 1,2 . . . ,ki) out into the ith submembrane, i = 1,2, . . . ,m, and further send out into the skin membrane. step 10 : find the globally best solution gbd(t) by comparing g b ij(t) with the same dimension d, d ∈{1,2, . . . ,d}, i = 1,2, . . . ,m,j = 1,2 . . . ,ki. step 11 : perform communication rules (b), which send gbd(t) back into the elementary membrane containing d-dimension particles across certain submembrane. step 12 : update the velocities of the d-dimension particles using (8). vid(t + 1) = δ1 · ( ρ ·vid(t) + c1 ·r1 · ( p idbest(t)−xid(t) ) + c2 ·r2 · ( gbij(t)−xid(t) ) +c3 ·r3 · ( gbd(t)−xid(t) )) + δ2 ·v f id(t) (8) where vid(t) and vid(t + 1) are the velocities of the particle at generation t and t+1, respectively; p idbest(t) is the best solution of the particle at generation t; xid(t) is the position the particle at generation t; gbij(t) is the locally best solution with the same dimension d at generation t; g b d(t) is the globally best solution with the same dimension d at generation t; v fid(t) is the adjusted velocity of the particle at generation t; δ1 and δ2 are proportion coefficients; ρ is an inertia weighting factor; r1 , r2 and r3 represent the functions that generate independently random numbers, which are uniformly distributed between 0 and 1; c1, c2 and c3 are acceleration coefficients. step 13 : update the positions of the d-dimension particles using (9). xid(t + 1) = xid(t) + vid(t + 1) (9) where xid(t) and xid(t + 1) are the positions of the particle at generation t and t+1, respectively; vid(t + 1) is the velocity of the particle at generation t + 1. step 14 : execute point repair algorithm for each particle. step 15 : execute smoothness algorithm for each particle. a modified membrane-inspired algorithm based on particle swarm optimization for mobile robot path planning 741 step 16 : adjust the moving direction of each particle by using the moving direction adjustment technique. step 17 : this step dissolves all the elementary membranes and releases their particles into their corresponding submembranes. thus, d-olms becomes the original olms. 4 experimental results the mmpso performance is tested by using mr3p. we first discuss how to set the number m of elementary membranes in olms by using 20 × 20 grid model environment with 6, 8 and 10 obstacles, respectively. then, 16 × 16 grid model environment with 9 static obstacles are applied to compare mmpso with its counterpart vpso and ga [15]. subsequently, the complex environments, 32×32 and 64×64 grid model environments with 20 static obstacles, are applied to further test the mmpso performance. in these experiments, one dynamic obstacle representing a moving obstacle or a dangerous source occurring suddenly is employed to analyze the mmpso behavior. (a) 6 obstacles (b) 8 obstacles (c) 10 obstacles 2 4 6 8 10 12 14 16 18 20 0 0.5 1 1.5 2 2.5 3 number of elementary membranes e la p s e d t im e obstacle 6 obstacle 8 obstacle 10 (d) results figure 5: the near shortest paths in three environments (20 × 20 grid with obstacles 6, 8 and 10, respectively) and experiment results 4.1 parameter setting in this subsection, we use 20 × 20 grid model environment with three kinds of obstacles shown in fig. 5 to discuss how to choose m. fig. 5(a)-(c) have 6, 8, 10 static obstacles (shaded areas), respectively. in the following experiments, the population size is set to 100; in (8), c1 = c2 = c3 = gsize/vmax, where gsize = 0.08 is the size of a grid and vmax is the maximal distance allowing a node to move in a step; the proportion coefficients δ1 = 0.65, δ2 = 0.35; ρ is defined as a variable, which varies from 0.246 to 0.157 along the logarithm function log10(y). in (6), ρ1 and ρ2 are set to random values between 0.4 and 0.6. in (3), α = 0.6, β = 0.4. in (4), λ is set to the robot radius. s t r1=0,r2=1 r1=1,r2=0 (a) ! (b) s t r1=0,r2=1 r1=1,r2=0 (c) figure 6: experimental results of mmpso in the environments 16×16 grids, os = 9 and od = 1. 742 x.y. wang, g.x. zhang, j.b. zhao, h.n. rong, f. ipate, r. lefticaru s t r1=0,r2=1 r1=1,r2=0 (a) 32 × 32 s r1=0,r2=1 r1=1,r2=0 (b) 32 × 32 s t (c) 32 × 32 s t r1=0,r2=1 r1=1,r2=0 (d) 64 × 64 figure 7: experimental results of mmpso in the environments 32 × 32, 64 × 64 grids, os = 20 and od = 1. in what follows, m varies from 2 to 20 by the interval 2, thus, we first generate m particles in step 2 and in step 3 for the first m− 1 elementary membranes, we produce round(100/m) particles and 100− (m−1)∗round(100/m) particles for the mth elementary membrane, where round(.) is a function for rounding its element towards nearest integer. in the experiments, if a given near-optimal solution is reached, mmpso stops. because the optimal solution to mr3p is usually unknown, we set kd = 1,ks = kf = 0 in (1) and independently perform mmpso for 30 times, where the terminal condition is such that the maximal number of iterations is set to 2000, in order to find the near-optimal solution. fig. 5(a)-(c) show the near shortest paths of the model environment with different obstacles, 6, 8 and 10, respectively. the mmpso performance for each of the 19 cases is evaluated by using the mean of the elapsed time in 30 independent runs. the experimental results are shown in fig. 5(d), where the elapsed time for three environments first decreases and then increases as the value of m goes up. these experimental results indicate that m could be assigned as 13 by considering the three environments. 4.2 mr3p experimental results to investigate the mmpso performance, this subsection uses three grid models, 16 × 16, 32 × 32 and 64 × 64, to carry out the experiments and considers five environments: 16 × 16 with 9 static obstacles (os = 9), 16 × 16 with os = 9 and one dynamic obstacle (od = 1), 32 × 32 with os = 20, 32 × 32 with os = 20 and od = 1, 64 × 64 with os = 20. the place for the possible occurrence of the dynamic obstacle is set to the near center, which is very likely to block the feasible paths. the model with 16 × 16 grids is applied to compare mmpso with its counterpart pso (vpso) and ga in [15]. the models with 32 × 32 and 64 × 64 grids are used to further discuss the mmpso performance in different complex environments. the setting of the parameters in mmpso except for kd,ks,kf is the same as in subsection 4.1. m=13. the termination condition is designated as the maximal number 2000 of iterations. all the experiments are run on the pc with cpu 1.7ghz, 512mb ram, and the software platform matlab7.4 and windows xp os. we first use the model with 16 × 16 grids to compare mmpso with vpso (when m = 1, mmpso becomes vpso) and ga in [15]. we consider three cases for kd,ks,kf as follows: (1) case 1: kd = 1,ks = kf = 0,γ1 = 1,γ2 = 0; (2) case 2: kd = 0.6,ks = kf = 0.2,γ1 = 0,γ2 = 1; (3) case 3: kd = 0.8,ks + kf = 0.2,γ1 + γ2 = 1. the experimental results of mmpso are shown in fig. 6. in fig. 6(a), the blue line is the best path in case 1 considering only one objective, path length, and the red line is the best result in case 2 by trading-off safety and smoothness. fig. 6(b) illustrates 8 near optimal paths (8 colors) through balancing the path length, safety and smoothness. the paths in fig. 6(c) a modified membrane-inspired algorithm based on particle swarm optimization for mobile robot path planning 743 table 1: comparisons of three methods in the environment(fig. 6 (a)) method noo nono noi fv gn st ga[15] 9 78 13 24.68 16 1.68 vpso 83 108 0 24.95 65 2.97 mmpso 94 239 0 24.26 27 0.84 table 2: comparisons of three methods in the environment(fig. 6 (c)) method noo nono noi fv gn st ga[15] 32 68 0 24.71 12 0.69 vpso 81 103 0 28.56 73 3.12 mmpso 92 235 0 27.43 34 0.97 are obtained by considering one dynamic obstacle and the blue line is the best path in case 1 considering only one objective, path length, and the red line is the best result in case 2 by trading-off safety and smoothness. to draw a comparison with ga in [15] and vpso, let kd=1 and the experiment is executed for 100 independent runs. tables 1 and 2 show the experimental results of ga, vpso and mmpso for the environments with static obstacles and the environments with static and dynamic obstacles. in tables 1-3 , noo, nono, noi, fv, gn, st represents the number of optimal solutions, the number of near optimal solutions, the number of infeasible solutions and the fitness value in 100 trials, the average generations for finding the optimal solution and the mean of the elapsed time (s) in each trial, respectively. as it can be clearly seen from table 1 and fig. 6, mmpso finds much more optimal paths and near optimal paths, while it spends smaller computing time than ga. there are some infeasible solutions in ga, while there is not any infeasible solution in vpso and mmpso because the point repair algorithm have repaired the infeasible path. on the other hand, vpso also finds more optimal paths and near optimal solutions than ga, but the elapsed time is far larger than ga. mmpso is better than vpso with respect to optimal and near optimal solutions and the elapsed time, which indicates the advantage of the combination of a membrane system with pso. tables 2 shows similar conclusions to those in tables 1. to further analyze the mmpso performance in more complex environments, more experiments are conducted in the environments with 32 × 32 and 64 × 64 grids containing 20 or 21 obstacles, as shown in fig. 7 (a-d). the environment with 32×32 grids and 20 static obstacles are shown in fig. 7 (a). fig. 7(b)-(c) show the environment with 20 static obstacles and one dynamic obstacle. in fig. 7 (c), the three objectives, path length, smoothness and safety, are considered. the parameters of mmpso are the same as above except for the population size table 3: experimental results of mmpso in different environments in fig. 7 environment noo nono fv gn st 32×32, os = 20, od = 0 86 242 28.79 36 1.72 32×32, os = 20, od = 1 82 225 31.53 45 1.93 64×64,os = 20, od = 0 83 247 28.14 59 2.68 744 x.y. wang, g.x. zhang, j.b. zhao, h.n. rong, f. ipate, r. lefticaru 150 and m = 15. all the tests are executed for 100 independent runs. table 3 shows the results. it can be seen from tables 1-3 that the optimal solutions of mmpso drop from 94 to 83, the elapsed time rises from 0.84 to 2.68 and the average generations vary from 27 to 59 as the number of grids increases from 16×16 to 64×64 and the static obstacles go up from 9 to 20. the elapsed time and average generations increase a little with the dynamic obstacle. to sum up, as the number of model grids increases by 4n (n = 1,2,3 . . .) and the static obstacles double, the increase of the elapsed time is quite small, instead of an exponential increase. mmpso maintains good search capability to find the optimal solution in both static and dynamic environments, which indicates mmpso has good adaptability to mr3p under complex environments. 5 conclusions this paper discusses a feasible combination of membrane systems and pso to solve mr3p. the outstanding novelty is to justify the introduced dynamic membrane structure, which proves to be suitable for solving mr3p with variable dimensions. mmpso uses the alternation of olms and d-olms to integrate a pso with variable dimensions, point repair algorithm, smoothness algorithm and moving direction adjustment. a large number of experiments are carried out on several mr3p with various environments and the results show that mmpso can achieve much better solutions than its counterparts pso and ga, as reported in the literature. this paper considers only the planar (two dimensions) environments. following this work, some issues need to be further investigated, such as how to extend mmpso to three dimensional spaces, how to use mmpso to solve more difficult path planning problems (mobile robots follow the tracks of moving targets in a hostile environments), how to combine mmpso with numerical p systems to control mobile robots and how to apply the idea of variable dimension pso to solve more engineer application problems. acknowledgment the work of xw, gz, jz and hr was supported by the national natural science foundation of china (61170016, 61373047), the program for new century excellent talents in university (ncet-11-0715) and swjtu supported project (swjtu12cx008); the work of fi and rl was supported by a grant of the romanian national authority for scientific research, cncsuefiscdi, project number pn-ii-id-pce2011-3-0688. bibliography [1] g. păun, g. rozenberg, a. salomaa (eds.) (2010); the oxford handbook of membrane computing, oxford university press. [2] g. păun (2007); tracing some open problems in membrane computing, rom j inform sci tech, issn 1453-8245, 10(4): 303–314. [3] g. zhang, j. cheng, m. gheorghe, q. meng (2013); a hybrid approach based on differential evolution and tissue membrane systems for solving constrained manufacturing parameter optimization problems, appl soft comput, issn 1568-4946, 13(3):1528-1542. [4] t.y. nishida (2004); an application of p system: a new algorithm for np-complete optimization problems. proc 8th wmcsci, 109–112. a modified membrane-inspired algorithm based on particle swarm optimization for mobile robot path planning 745 [5] g.x. zhang, m. gheorghe, l.q. pan,; m.j. pérez-jiménez (2014); evolutionary membrane computing: a comprehensive survey and new results, inform sci, issn: 0020-0255, 279: 528551. [6] l. huang, x. he, n. wang, y. xie (2007); p systems based multi-objective optimization algorithm, prog nat sci, issn 1002-0071, 17(4): 458-465. [7] g.x. zhang, m. gheorghe, y. li (2012); a membrane algorithm with quantum-inspired subalgorithms and its application to image processing, nat comput, issn 1567-7818, 11(3): 701-717. [8] g.x. zhang, j.x. cheng, m. gheorghe (2014); dynamic behavior analysis of membraneinspired evolutionary algorithms, international journal of computers communications & control, issn 1841-9836, 9(2): 227-242. [9] j. kennedy, r. eberhart (1995); particle swarm optimization, proc icnn, 4: 1942-1948. [10] g.x. zhang, f. zhou, x.l. huang, j.x. cheng, m. gheorghe, f. ipate, r. lefticaru (2012); a novel membrane algorithm based on particle swarm optimization for solving broadcasting problems, j univers comput sci, issn 0948-6968 18(13): 1821-1841. [11] j. xiao, y. huang, z. cheng, j. he, y. niu (2014); a hybrid membrane evolutionary algorithm for solving constrained optimization problems, optik, issn 0030-4026, 125(2): 897-902. [12] t. lozano-pérez,m.a. wesley (1979); an algorithm for planning collision-free paths among polyhedral obstacles, commun acm, issn 0001-0782, (22)10: 560–570. [13] h. gao, j. cao (2012); membrane quantum particle swarm optimisation for cognitive radio spectrum allocation, int j comput appl tech, issn 0952-8091, 43(4): 359-365. [14] y.k. hwang, n. ahuja (1992); gross motion planning-a survey, acm comp surv, 24: 219-291. [15] a. tuncer, m. yildirim (2012); dynamic path planning of mobile robots with improved genetic algorithm, comput electr eng, issn 0045-7906, 38: 1564-1572. [16] m.a. garcia, o. montiel (2009); path planning for autonomous mobile robot navigation with ant colony optimization and fuzzy cost function evaluation, appl soft comput, issn 1568-4946, 9(3): 1102-1110. [17] z. qidan, y.j. yang, z.y. xing (2006); robot path planning sased on artificial potential field approach with simulated annealing, proc isda, 622-627. [18] y. zhang, d.w. gong (2013); robot path planning in uncertain environment using multiobjective particle swarm optimization, neurocomputing, issn 0925-2312, 103: 172-185. [19] e. masehian, d. sedighizadeh (2007); classic and heuristic approaches in robot motion planning-a chronological review, proc. waset, 101-106. [20] w.f. xu, c. li, b. liang (2008); the cartesian path planning of free-floating space robot using particle swarm optimization, int j adv rob syst, issn 1729-8806, 5: 301-310. [21] d.w. gong, j.h. zhang (2011); multi-objective particle swarm optimization for robot path planning in environment with danger sources, j comput, 6(8): 1554-1561. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vii (2012), no. 1 (march), pp. 20-38 controlling the double rotary inverted pendulum with multiple feedback delays v. casanova, j. salt, r. piza, a. cuenca v. casanova, j. salt, r. piza, a. cuenca departamento de ingeniería de sistemas y automática instituto universitario de automática e informática industrial universitat politécnica de valencia camino de vera s/n 46022 valencia (spain) e-mail: {vcasanov, julian, rpiza, acuenca}@isa.upv.es abstract: the aim of this work is the development and implementation of a control structure for the double rotary inverted pendulum, suitable to be used in a networked control system environment. delays are quite common in this kind of systems and, when controlling multivariable plants, it is possible that different delays are applied to the multiple inputs and outputs of them. a control structure that allows compensating individually each one of the multiple loop delays would be useful when one of these delays changes. inverted pendulums are quite sensitive to delays and for this reason are appropriated plants to be used in these conditions. the control structure is developed modifying the control in no-delay conditions with a generalized predictor able to deal with unstable and non-minimum plants as the chosen one is. the proposed structure has been simulated and implemented to control a real double rotary inverted pendulum. keywords:control applications, delay compensation, distributed control, multi-loop control, multi-variable feedback control, transport delay. 1 introduction the main goal of this paper is to propose a control structure that deals with multivariable plants when different delays are applied to the feedback signals. consider the scenario in figure 1. a multivariable (mimo) plant is going to be controlled by a mimo controller. the plant outputs are feedback into the controller and the control actions are calculated by it, following some predesigned control law. these actions are applied to the plant. if the communication links between controller and plant is ideal, no delays are present. when a real link is used at least one sample time delay must be considered in each communication link. the influence of these delays must be removed to reach the desired behavior: the one in ideal (no-delay) conditions. in the proposed scenario sensors and actuators (i.e. ad and da conversions) are supposed to be separated by long distances among them and from the controller. the communication follows different paths for output signals and control actions, suffering different transport delays. the controller used in ideal conditions must be modified to compensate these delays. this is a common situation in networked control systems (ncs) when a shared medium is used to communicate a number of sensors and actuators with the main control. the longer the distance is, the greater the delay must be removed. the delay compensator becomes a mimo structure to deal with the delay in each pair action/signal. in wide-area ncs it is usual to have a great number of ad and da conversions involved in the same control structure. the communication is performed through a shared medium instead of using point-to-point links. this allows to reduce costs (wiring, maintenance...) and copyright c⃝ 2006-2012 by ccc publications controlling the double rotary inverted pendulum with multiple feedback delays 21 p l a n t d / a a / d a / d d / a c o n t r o l l e r c o n t r o l o u t p u t … … d a1 d an d s1 d sm figure 1: problem scenario to increase flexibility of the control system. the main drawbacks are the delay uncertainty and the bandwidth limitation. if there are different distances between sensors/actuators and the controller and/or the priority of the devices attached to the network is different, multiple delays can arise in the control structure. in the most general case different number of sample time delays are applied to each one of the signals towards the actuators (δan being n the nth control action) and to the signals from the sensors (δsm being m the mth measured variable). the control structure becomes a collection of nxm loops each one of them includes a δan+δsm sample time delay, from the point of view of the controller (loop delay). this is the delay to be compensated in the controller of each one of these loops in the control structure. by removing the influence of the delays, the ideal behavior is intended to be reached. the goal of this work is to use the same controller, designed without considering the delays, to deal with the problem when they are present. the presence of multiple delays in a ncs environment has been considered in many descriptions of this kind of systems ( [1], [2], [3], [4], [5], [6], [7]) and several previous works have dealt with this problem. in [8] and [9] there is a description of the problem of multiple delays in ncs when they are smaller than one sampling period. in [10], [11] and [12] a ncs with multiple distributed delays is modeled and an optimal control is proposed. [13] proposes a model of an ncs with multiple successive delay components in the state. [14] deals with the problem of observability and optimal control in these control systems. [15] analyses the maximum allowable delay bound in control loops with network induced delays. [16] considers the problem of multiple delays in ncs from the point of view of fault diagnosis. some works ( [17], [18]) have considered the problem as a multivariable control design in which multiple input-output delays are included in the plant model. [19] uses an observer to provide state information, comparing the system behaviour with the smith predictor. the proposed solution is going to be applied to a classical control problem: the double rotary 22 v. casanova, j. salt, r. piza, a. cuenca inverted pendulum (also known as double furuta pendulum, see figure 2). this is a classical textbook example of non-linear, non-minimum phase plant. the plant to be controlled has one input (the control action to the motor) and three outputs (the angular position of the motor shaft, the angular position of the first rod of the inverted pendulum and the angular position of the second rod). the goal of this control problem is to keep the pendulum angles as close to zero as possible, avoiding disturbances and to follow a certain reference with the shaft angle. so, four signals are involved between control and plant and four different time delays are going to be considered. as it is well known, this plant is quite sensitive to delays between controller and plant becoming unstable even with small delays. this sensitivity makes the furuta pendulum quite appropriated to the proposed environment. the controller used in ideal conditions (i.e. no delays) must be properly modified to remove the delay influence in the feedback loop. in addition, the plant is an unstable and non-minimum phase one which makes more difficult the delay compensation. the single and double inverted pendulums have been widely used as a test-bed for different control strategies ( [20], [21], [22]). figure 2: the double rotary inverted pendulum the paper begins with a description of the plant to be controlled (section 2), followed by the controller used in ideal conditions (section 3). in section 4 the controller is modified to remove the influence of the delays, considering different control-plant delays for each one of the signals involved. simulated results are presented in these sections to compare the behavior in ideal conditions, the nearly-unstable behavior when delays are included and the behavior when delay influence is removed from the loops. results from a real plant are presented in section 5 showing how the proposed structure works when real disturbances and noise are present in the system. finally, in section 6 conclusions and future work are presented. 2 double rotary inverted pendulum model the plant that has been chosen to implement the proposed control structure is the double rotary inverted pendulum (double furuta pendulum) due to its sensitivity to feedback delays. the theoretical model of this plant is described in this section and it is particularized to model controlling the double rotary inverted pendulum with multiple feedback delays 23 a real one, the double rotary inverted pendulum developed by quanser consulting inc. (shown in figure 2 in its upwards position). the equations describing the behavior of the double inverted pendulum can be derived, using the euler-lagrange method. a complete development of this model can be found in [23] and [24]. by applying the euler-lagrange equation to the lagrangian (i.e. the difference between kinetic and potential energy) the development of the mathematical model is greatly simplified. figure 3 shows a schematic representation (top and front view) of the plant. the variables considered in the model are: • θ, θ̇, θ̈: angular position, velocity and acceleration of the motor shaft, around the vertical axis. • α, α̇, α̈: angular position, velocity and acceleration of the first rod, around the motor shaft axis. • γ, γ̇, γ̈: angular position, velocity and acceleration of the second rod, around the motor shaft axis. • τ: torque applied to the motor shaft. • u: control action applied to the dc motor (voltage). figure 3: significant variables. first, the torque applied by the motor must be related with the angular positions. the three equations defining the non-linear dynamics of the mechanical part of the pendulum can be expressed as follows: 24 v. casanova, j. salt, r. piza, a. cuenca τ(t) = [j1l 2 1(m2 + m3)]θ̈(t) − [l1(m2l2 + m3l2) cosα(t)]α̈(t) + [m3l3l1 cosγ(t)]γ̈(t) + (1) +b1θ̇(t) + [l1(m2l2 + m3l2) sinα(t)]α̇(t) 2 + [m3l3l1 sinγ(t)]γ̇(t) 2 0 = −[l1(m2l2 + m3l2) cosα(t)]θ̈(t) + [j2 + m2l22 + m3l 2 2]α̈(t) + (2) +[m3l3l2 cos (γ(t) − α(t))]γ̈(t) + b2α̇(t) − −[m3l3l2 sin (γ(t) − α(t))]γ̇(t)2 − [g(m2l2 + m3l2) sinα(t)] 0 = −[m3l3l1 cosγ(t)]θ̈(t) + [m3l3l2 cos (γ(t) − α(t)]α̈(t) + [j3 + m3l23]γ̈(t) + (3) +[m3l3l2 sin (γ(t) − α(t))]α̇(t)2 + b3γ̇(t) − [gm3l3 sinγ(t)] the parameters (whose values are provided by the manufacturer) in these equations are the lengths, masses, moments of inertia and friction coefficients of the elements in the pendulum. • j1: moment of inertia around the center of rotation of the motor arm. • j2: moment of inertia around the center of rotation of the first rod of the pendulum. • j3: moment of inertia around the center of of the second rod of the pendulum. • l1: length of the motor arm. • l2: length of the first rod of the pendulum. • l2: length between the centers of rotation and gravity of the first rod of the pendulum. • l3: length between the centers of rotation and gravity of the second rod of the pendulum. • m2: mass of the first rod of the pendulum. • m3: mass of the second rod of the pendulum. • b1: viscous friction in the joint of the motor arm. • b2: viscous friction in the joint of the first rod of the pendulum. • b3: viscous friction in the joint of the second rod of the pendulum. • g: acceleration of the gravity. the non-linear model must be linearized to reach a mimo linear model to be used in the design of the linear controller. the setting point for the linearization will be around α = 0 and γ = 0, , corresponding this values to the upwards position, in which the pendulum is desired to remain stable. the signal derivatives must be zero at the settling point to keep the pendulum stable. in these conditions, the three linear (linearized) equations are: τ(t) = [j1l 2 1(m2 + m3)]θ̈(t) − [l1(m2l2 + m3l2)α̈(t) + [m3l3l1]γ̈(t) + b1θ̇(t) (4) 0 = −[l1(m2l2 + m3l2)]θ̈(t) + [j2 + m2l22 + m3l 2 2]α̈(t) + [m3l3l2]γ̈(t) + (5) +b2α̇(t) − [g(m2l2 + m3l2)α(t)] 0 = −[m3l3l1]θ̈(t) + [m3l3l2]α̈(t) + [j3 + m3l23]γ̈(t) + b3γ̇(t) − [gm3l3]γ(t)] (6) to implement the controller the input must be the voltage applied to the motor that moves the arm that carries the pendulum rods. the relationship between the voltage applied and the torque can be stated from the typical dc motor: controlling the double rotary inverted pendulum with multiple feedback delays 25 τ(t) = ηgηmkgkt rm (u(t) − kgkmθ̇(t)) (7) the mechanical parameters in this fourth equation are: • ηg: gearbox efficiency. • ηm: motor efficiency. • kg: total gear ratio. • kt: motor torque constant. • km: motor back-emf constant. • rm: motor armature resistance. solving the system of four equations, the linear continuous state-space model of the plant to be controlled can be obtained. the model must have the motor voltage as input signal and the three angular positions as output signals. the three angular velocities are not included as outputs because they are not going to be measured in the real plant. the development is omitted here for space reasons. as the goal is to control a real pendulum, the model has to be particularized with the real parameters (dimensions, masses, inertias, frictions, motor constants...). these parameters have been provided by the manufacturer. this information can be reached from the quanser documentation [25]. as the control is going to be designed and implemented in discrete time, the continuous model must be discretized. the sample time is going to be 10 ms, small enough to control the plant and large enough to become unstable with a couple of sample time delays. this model is the one to be used when designing, first, the controller in ideal conditions and, after that, the predictor to remove the influence of the loop delays. the behavior, with and without the delays, must be similar in both, simulated and real systems. the delay compensation is going to be different (because the delays are different) for the three controlled variables. the mimo system can be divided into three siso systems one for each of the output signals, (θ, α and γ as described before): gθ(z) = θ(s) u(s) = 0.0015763(z + 0.9446)(z − 0.9432)(z − 0.8953)(z − 1.061)(z − 1.145) (z − 1)(z − 0.9716)(z − 0.9077)(z − 1.084)(z − 1.18)(z − 0.7665) (8) gα(z) = α(s) u(s) = 0.0018286(z + 0.9477)(z − 0.912)(z − 1)(z − 1.114) (z − 0.7665)(z − 0.9077)(z − 0.9716)(z − 1.084)(z − 1.18) (9) gγ(z) = γ(s) u(s) = −0.0019976(z + 0.9538)(z − 0.9968)(z − 1)2 (z − 0.7665)(z − 0.9077)(z − 0.9716)(z − 1.084)(z − 1.18) (10) as can be seen these transfer functions describe the dynamical behaviour of unstable and non-minimum phase systems. the mimo model will be used in the following section to design the controller in ideal conditions. the three siso models will be used in section 4 to design the predictors for delay compensation purposes. 26 v. casanova, j. salt, r. piza, a. cuenca 3 control structure in no delay conditions to reach the desired behavior, a linear-quadratic regulator (lqr) controller is going to be used control the plant in absebtia of delays. the controller must keep the angles of the two rods as close to zero as possible (unstable equilibrium upwards position), canceling the unavoidable disturbances. in addition the system must follow a certain reference for the motor shaft angle. the control structure is not robust enough to cope with delays, becoming unstable, even though they are very small. this ideal control will be appropriately modified to remove loop delays in the following section. in order to improve the steady-state error of the shaft position and the robustness of the controller, the six-state model is increased including an integration term. with this term the motor shaft can follow step references with zero error, even with the small dead zone present in the real plant. the integral of θ(t) will be computed in the controller from the measured angle. this augmented state is now: x̃ = ( θ(t) α(t) γ(t) θ̇(t) α̇(t) γ̇(t) ∫ θ(t)dt )t (11) the six-state mimo model obtained in the previous section is modified, including this new state: ẋ = ax + bu −→ ˙̃x = ( a 0 0 0 ) x̃ + ( b 0 ) u (12) using matlab lqr command, the seven feedback control gains for the seven states from the augmented plant model are: k = ( 1.2824 −42.4077 −101.7583 1.8735 −11.8712 −11.5716 0.3162 ) (13) although is not included in the model and, so, not considered when designing the controller, the real control action saturated ±10 volts. this saturation has been included in the simulation model. however, in ideal conditions and when the delay is compensated, the control action always remains in this range and the saturation can be ignored. this controller needs the seven states as inputs to generate the control action to be applied to the plant. in the real plant only the first three of them will be available, by means of sensors measuring the angular positions. there will not be sensors to measure the other states (angular velocities and integral error). these states are going to be computed in the control structure using a filtered discrete derivative and a discrete integrator. so, the controller becomes a three inputs-one output discrete system to be implemented in the control device. as it has been described in the introduction, the three inputs of the controller are arriving through different paths and different delays are applied to them. to compensate the delays independently it will be useful to decompose the controller into three sub-controllers, each one of them concerned for one of the three measured angles. the transfer functions of these individual controllers are as follows: cθ(z) = uθ(z) θ(z) = k(1) + k(4)drv(z) + k(7)int(z) = k(1) + k(4) 50z − 50 z − 0.6065 + k(7) 0.01 z − 1 (14) cα(z) = uα(z) α(z) = k(2) + k(5)drv(z) = k(2) + k(5) 50z − 50 z − 0.6065 (15) cγ(z) = uγ(z) γ(z) = k(3) + k(6)drv(z) = k(3) + k(6) 50z − 50 z − 0.6065 (16) controlling the double rotary inverted pendulum with multiple feedback delays 27 using drv(z) and int(z) as discrete derivator and integrator respectively, all the states can be considered measurable. the control structure with these three sub-controllers is shown in figure 4. figure 4: control structure without delay compensation. the inner loop tries to keep γ angle as close to zero as possible, avoiding disturbances. the middle loop does the same with α angle. the combination of these two loops keep the pendulum in the upwards position as desired. the outer loop tries to follow the desired reference with θ angle and drives the integral of the error to zero. as the three plant-to-control signals and the control-to-plant signal are separated, different delays will be applied to the four communication links to study its influence in the system performance. with the described control structure, three sub-control actions are generated and the addition of them is the control action to be sent and then applied to the plant. closing the loop with the model from the previous section and the gain matrix from the lqr, the simulated results are shown in figures 5 and 6. a ±45 degrees step reference has been applied to be tracked by the shaft angle (θ) and a small random noise has been added to the measured pendulum angles (α and γ). no loop delay is present. figure 5 depicts the output variables (θ, α and γ angles in degrees). as can be seen, the reference is followed properly (with zero steady-state error) by the shaft angle and the pendulum angles are kept close to zero, despite the noise and the changes in the main reference. figure 6 shows the control actions of the three sub-controllers and the addition, which is the final control action to be applied to the plant. note that α and γ control actions are clearly influenced by the noise, added to the measured angles. 4 control structure with delay compensation delays in the communication between controller and plant are a well-known source of instability. this is critical when controlling processes as unstable as the furuta pendulum is. the influence of the delay in the control loop must be removed to assure that the system remains stable. figure 7 shows the behavior of the three controlled angles when a single sample time delay is included in the control action (with ±10 volts saturation in the control action as in the real plant). another sample time delay in one of the feedback signals makes the system completely unstable. the sensitivity of the plant to loop delays makes this plant the ideal one for the work in this paper. the proposal is to modify the original delay-free controller to handle with delays even when they are different in the signals transmitted to and from the plant. 28 v. casanova, j. salt, r. piza, a. cuenca 30 35 40 45 50 55 60 65 70 reference vs. theta angle time (sec) 30 35 40 45 50 55 60 65 70 alpha angle time (sec) 30 35 40 45 50 55 60 65 70 gamma angle time (sec) figure 5: theta, alpha and gamma angles: ideal conditions. 30 35 40 45 50 55 60 65 70 theta control action time (sec) 30 35 40 45 50 55 60 65 70 alpha control action time (sec) 30 35 40 45 50 55 60 65 70 gamma control action time (sec) 30 35 40 45 50 55 60 65 70 control action time (sec) figure 6: control actions: ideal conditions. controlling the double rotary inverted pendulum with multiple feedback delays 29 26 28 30 32 34 36 38 40 42 44 reference vs. theta angle time (sec) 26 28 30 32 34 36 38 40 42 44 alpha angle time (sec) 26 28 30 32 34 36 38 40 42 44 gamma angle time (sec) figure 7: theta, alpha and gamma angles: delayed. smith predictor has been traditionally used to overcome delays for stable plants without integral behavior. several modifications ( [26], [27], [28]) of the original idea have been proposed to be used for multivariable plants, with different input-output delays. the limitation is that they usually cannot be used with non-minimum phase, unstable plants with integral behavior. the classical smith predictor has been modified ( [29], [30], [31]) to deal with unstable plants. if all the controlled variables (i.e. all the loops in the control system) have the same delay, one of these modified predictors could be used. the plant to be controlled in this paper is an unstable and non-minimum mimo (one input and three outputs) plant but it can be easily decomposed into three siso plants. in addition, in the scenario considered in this work, each one of the siso plants is influenced by a different loop delay to be compensated. parameter δu is the number of sample time delays for the control action (controller-to-plant signal). parameters δθ, δα and δγ are the delays for the shaft and pendulum angles (plant-to-controller signals). the most general case is given when the four delays are different, as shown in figure 4, there are three loops in the systems. the loop delay (from the output to the input of the controller) for each one of the loops is given by the addition of the control action delay and the corresponding output signal delay. these are the delays to be compensated in each one of the sub-controller. only constant and integer multiple of the sample time delays are considered in this work but the idea can be extended to fractional and variable delays. in [32] and [33] a delay compensation is proposed, suitable to be used with unstable and nonminimum phase siso plants. the predictor proposed and described in these references (generalized predictor, gp, figure 8) is stable for unstable/stable, minimum/non-minimum plants. the inputs of the gp are the plant input (control action) and output (controlled variable). the output of the gp is the predicted signal. depending on the quality of the plant model used to design the predictor, the predicted signal will be equal to the controlled variable without the delay. it is the same behavior than with the classical smith predictor but, in this case, it can be used with the transfer functions of the inverted pendulum. from the (unstable nonminimum phase) transfer function and the known loop delay the gp can be designed. this gp 30 v. casanova, j. salt, r. piza, a. cuenca figure 8: generalized predictor. will provide the information to be used by the controller, removing the influence of the delay in the feedback. the predicted (undelayed) output will be used instead of the measured one, by the same controller designed in ideal (no-delay) conditions. as usual, the system behavior depends on the uncertainty of the model and the presence of significant disturbances. to generate this predicted signal, transfer functions f1(z) and f2(z) must be calculated. being g(z) the plant to be controlled, this transfer function can be divided in two parts: g(z) = gmp (z)gnmp (z) (17) gmp (z) is a transfer function including al the poles (stable and unstable) and the minimum phase zeros. gnmp (z) is a polynomial including the non-minimum phase zeros. gmp (z) can be expressed as a polynomial quotient or as a minimum order state space model: gmp (z) = nmp (z) dmp (z) = cmp (zi − amp ) −1 bmp (18) a new transfer function can be defined, including the delay (δ) in the minimum phase one: g∗mp (z) = n∗mp (z) dmp (z) = cmp (zi − amp ) −1 aδmp bmp (19) from these definitions the two transfer functions of the gp can be calculated as follows: f1(z) = ( cmp δ∑ i=1 aδmp bmp z −i ) gnmp (z) (20) f2(z) = n∗mp (z) nmp (z) (21) the predicted output (i.e. the estimation of the plant output without delay) is calculated by the gp as follows: yp (z) = u(z)f1(z) + y (z)f2(z) (22) in the proposed scenario, three nested loops are present in the system. three different loop delays are considered and had to be compensated. three different predictors must be designed and included in the control structure. figure 9 shows the control structure including the three controlling the double rotary inverted pendulum with multiple feedback delays 31 figure 9: control structure with delay compensation. predictors to provide undelayed information for the controllers. note that the controllers in this structure are the same used in figure 3, when no delay was present in the system. to compare the information provided by the plant (measured) and by the gp’s (predicted), figure 10 shows a detail of the measured angles (in radians) compared with the predicted ones. measured (red) and predicted (blue) signals can be easily distinguished as the measured is delayed signal respect to the predicted one. undelayed signals are not equal to the plant outputs due to the noised included in the simulation. it can be seen that the delay compensated by the predictors are different for θ, α and γ angles. in the simulation one sample period delay has been included in the control action, two sample delays in the γ angle, three sample delays in the α angle and four sample delays in the θ angle. so, the loop delay for the gpγ predictor is δu + δα = 3 and for gpα δu + δθ = 4. these delays can be seen in the time between predicted and measured signals. 25.05 25.075 25.1 25.125 25.15 25.175 25.2 25.225 25.25 25.275 25.3 25.325 25.35 25.375 25.4 25.425 25.45 25.475 25.5 theta angle: predicted vs. measured time (sec) 25.05 25.075 25.1 25.125 25.15 25.175 25.2 25.225 25.25 25.275 25.3 25.325 25.35 25.375 25.4 25.425 25.45 25.475 25.5 alpha angle: predicted vs. measured time (sec) 25.05 25.075 25.1 25.125 25.15 25.175 25.2 25.225 25.25 25.275 25.3 25.325 25.35 25.375 25.4 25.425 25.45 25.475 25.5 gamma angle: predicted vs. measured time (sec) figure 10: measured and predicted angles. the delay-free predicted signals are used as inputs for the three independent controllers to generate the control action to be applied to the plant. figures 11 and 12, show simulation results in these conditions. comparing these results with the ones in figures 5 and 6, it can be seen that the system recovers the ideal conditions behavior, even with different loop delays in the signals 32 v. casanova, j. salt, r. piza, a. cuenca between controller and plant. real results are presented in the following section. 5 experimental results the results presented in the previous section have been obtained from a simulation model. it is easier to see how the control structure works in simulated results without (unbounded) noise and/or uncertainty. nevertheless, the work would remain uncompleted if not implemented over a real plant to reach real results. the proposed structure has been implemented to control a real plant, the quanser double rotary inverted pendulum (figure 2). the plant is endowed with a potentiometer to measure the shaft angle (θ) and a couple of incremental encoders to measure the angles of the two rods of the pendulum (α and γ). it has also a tachometer to measure the shaft angular velocity but it has not been used in the implementation. 30 35 40 45 50 55 60 65 70 reference vs. theta angle time (sec) 30 35 40 45 50 55 60 65 70 alpha angle time (sec) 30 35 40 45 50 55 60 65 70 gamma angle time (sec) figure 11: measured angles: delayed+compensated. the signal applied to the plant saturates at ±10 volts but the action calculated by the controller is almost always inside this range. the aim is to consider a plant in which the different (and long enough) distance between each one the three sensors and the device implementing the control introduces non-negligible delays. however, with the dimensions of the used prototype, the multiple delays between controller and plant had to be simulated and included in the control structure. matlab/simulink real-time windows target has been used to implement the control structure, using quanser q4 board for the a/d and d/a conversions. as noted in section 2, a sample time t = 10ms has been used, being large enough for real-time requirements. with this sample time, two sample time delays in the inner loop of the proposed structure are enough to make the system unstable. figure 13 shows the real measured angles when the delays appear, being the pendulum in its upwards stable position. the conventional control cannot bear this delay and the pendulum falls. to overcome this delay influence, three gp’s has been included in the control structure, as controlling the double rotary inverted pendulum with multiple feedback delays 33 30 35 40 45 50 55 60 65 70 theta control action time (sec) 30 35 40 45 50 55 60 65 70 alpha control action time (sec) 30 35 40 45 50 55 60 65 70 gamma control action time (sec) 30 35 40 45 50 55 60 65 70 control action time (sec) figure 12: control actions: delayed+compensated. 60 65 70 reference vs. theta angle time (sec) 60 65 70 alpha angle time (sec) 60 65 70 gamma angle time (sec) figure 13: real results: unstable behavior. 34 v. casanova, j. salt, r. piza, a. cuenca shown in figure 9.as in the simulation results, the gp’s have been calculated to compensate delays δu = 1, δθ = 3, δα = 2 and δγ = 1. the predictor in the inner loop, gpγ offers a prediction for the γ angle with a z−2 delay. in the middle loop, gpα has been designed for a z−3 delay. finally, in the outer loop gpθ predicts the shaft angle removing a z−4 delay. with these delays in the loops, the control is absolutely unstable if the measured angles are used. figure 14 shows the results in these conditions, using the predicted signals instead of the measured ones. as can be seen, the shaft angle is following a ±45 degrees reference and keeps the rods angles close to zero, to hold the pendulum in its upwards position, despite noises and disturbances. it is quite difficult to see differences between these results and the ones in ideal conditions (without delays) because the behavior depends deeply on the initial setup to put the plant in its working point. figure 15 shows the behavior with some mechanical disturbances in the second rod angle. the control structure keeps the pendulum in its upwards position despite the multiple delays in the plant-control communications. due to the noise in the measured angles it is difficult to compare the real with the predicted signals. to see the operation of the predictors, both measured and predicted signals, have been low-pass filtered to remove the noise. these filtered signals are shown in figure 16 for θ, α and γ angles in radians. although it is not easy to measure because the signals are slightly different, the time between predicted and measured signals is 40, 30 and 20 ms, respectively. these are the loop delays between the plant ant the three sub-controllers in the control structure. as the predictors remove the delay influence from the feedbacks, the system behaves as in ideal conditions. 60 65 70 75 80 85 90 95 reference vs. theta angle time (sec) 60 65 70 75 80 85 90 95 alpha angle time (sec) 60 65 70 75 80 85 90 95 gamma angle time (sec) figure 14: real results: delayed+compensated. 6 conclusions and future works the aim of this work was to propose a multiloop control structure for a multivariable plant with different delays in each one of the signals between controller and plant. the double inverted rotary pendulum has been chosen for a practical implementation of the proposed control. this controlling the double rotary inverted pendulum with multiple feedback delays 35 60 65 70 75 80 85 90 95 reference vs. theta angle time (sec) 60 65 70 75 80 85 90 95 alpha angle time (sec) 60 65 70 75 80 85 90 95 gamma angle time (sec) figure 15: real results: disturbances. 5.54 5.56 5.58 5.6 5.62 5.64 5.66 5.68 5.7 5.72 5.74 5.76 5.78 5.8 5.82 5.84 5.86 5.88 5.9 5.92 5.94 5.96 5.98 6 theta angle: predicted vs. measured time (sec) 5.54 5.56 5.58 5.6 5.62 5.64 5.66 5.68 5.7 5.72 5.74 5.76 5.78 5.8 5.82 5.84 5.86 5.88 5.9 5.92 5.94 5.96 5.98 6 alpha angle: predicted vs. measured time (sec) 5.54 5.56 5.58 5.6 5.62 5.64 5.66 5.68 5.7 5.72 5.74 5.76 5.78 5.8 5.82 5.84 5.86 5.88 5.9 5.92 5.94 5.96 5.98 6 gamma angle: predicted vs. measured time (sec) figure 16: real results: measured and predicted angles. 36 v. casanova, j. salt, r. piza, a. cuenca is a classic example of multivariable plant whose stability is seriously worsened even with small delays. there are four links in the system: one control-plant link (actuator) and three plantcontrol ones (sensors). the scenario assumes that this four links are a submitted to different communication delays which increases the difficulty to remove its influence on the stability. a generalized predictor has been used to provide delay-free information for each one of the three loops in the control structure. each one has been designed attending to each loop delay. depending on the quality of the model used in the predictor, the system behaves as in ideal conditions, even with one different delay in each of the links. the validity of the proposed control structure has been shown by means of simulated results. the real implementation using the quanser double rotary inverted pendulum shows how the proposed structure works when a certain degree of uncertainty, non-negligible noises and disturbances are present. the delays in this work have been included in the control structure but they should be caused by the communication between sensors/actuators and the device in which the control algorithm is implemented. the goal is to close the loops through a shared communication medium (fieldbus, wireless network...), following different paths for each one of the links. this will cause different transport delays in each loop that may be solved with the proposed structure. in this more realistic implementation an additional problem will arise. the unavoidable lack of synchronization between the devices (sensors, controllers and actuators) will make the delay variable and, probably unknown. some kind of delay identification must be performed or the control structure must be modified to be adaptive to compensate variable (maybe random) delays. another improvement of this proposal is to use a plant with more than one input. this will increase the number of communication links and the number of feedback loops. the 2d inverted pendulum is a good choice to be used in this improvement. this plant increases the degrees of freedom allowing movement in the xy plane. it has two inputs for a planar movement of the cart carrying the pendulum. two measurements of the cart position must be used in the controller. the rod can bend in two directions giving two angular measurements. so, the plant offers a multivariable model (two inputs, four outputs) and, as is also quite sensitive to delays, is an appropriated plant to extend the work in this paper. acknowledgement this work was supported by the spanish ministerio de ciencia y tecnologa project dpi200914744-c03-03, by generalitat valenciana project gv/2010/018, and by universidad polit´ecnica de valencia project paid06-08. bibliography [1] y. halevi, a. ray, integrated communication and control systems: part i-analysis, journal of dynamic systems, measurement and control, vol. 110, pp. 367-373, 1988. [2] a. ray, y. halevi, integrated communication and control systems: part ii-design considerations, journal of dynamic systems, measurement and control, vol. 110, pp. 374-381, 1988. [3] l.g. bushnell, networks and control, ieee control systems magazine, vol. 21, no. 1, pp. 22-23, 2001. [4] y. tipsuwan, m-y. chow, control methodologies in networked control systems, control engineering practice, vol. 11, no. 10, pp. 1099-1111, 2001. controlling the double rotary inverted pendulum with multiple feedback delays 37 [5] p.f. hokayem, c.t. abdallah, inherent issues in networked control systems: a survey, proceedings of the 23rd american control conference, boston (usa), vol.6, pp. 4897-4902, 2004. [6] t.c. yang, networked control systems: a brief survey, iee proceedings on control theory and applications, vol. 153, no. 4, pp. 403-412, 2006. [7] v. casanova, j. salt, multirate control implementation for an integrated communication and control system, control engineering practice, vol. 11, no. 11, pp. 13351348, 2003. [8] j. nilsson, real-time control systems with delays. phd thesis, lund institute of technology, lund, sweden, 1998. [9] j. nilsson, b. bernhardsson, b. wittenmark, stochastic analysis and control of real-time systems with random time delays, automatica, vol. 34, no. 1, pp. 57-64, 1998. [10] f.l. lian, j.r. moyne, d.m. tilbury, performance evaluation of control networks: ethernet, controlnet, and devicenet, ieee control systems magazine, vol. 21, pp. 66-83, 2001. [11] f.l. lian, j.r. moyne, d.m. tilbury, network design consideration for distributed control systems, ieee transactions on control systems technology, vol. 10, pp. 297-307, 2002. [12] f.l. lian, j.r. moyne, d.m. tilbury, modelling and optimal controller design of networked control systems with multiple delays, international journal of control, vol. 76, no. 6, pp. 591-606, 2003. [13] h. gaoa, t. chenb, j. lamc, a new delay system approach to network-based control, automatica, vol. 44, no. 1, pp. 39-52, 2008. [14] a.v. savkin, analysis and synthesis of networked control systems: topological entropy, observability, robustness and optimal control, automatica, vol. 42, no. 1, pp. 51-62, 2006. [15] d.s. kim et al, maximum allowable delay bounds of networked control systems, control engineering practice, vol.11, no. 11, pp.1301-1313, 2003. [16] h. fanga, h. yeb, m. zhong, fault diagnosis of networked control systems, annual reviews in control, vol.31, no. 1, pp. 55-68, 2007. [17] o.e. agamennoni, a.c. desages, j.a. romagnoli, a multivariable delay compensator scheme, chemical engineering science, vol. 47, no. 5, pp. 1173-1185, 1992. [18] b.a. ogunnaike, w.h. ray, multivariable controller design for linear systems having multiple time delays, aiche journal, vol. 25, no. 6, pp. 1043-1057, 1979. [19] k. watanabe, m. ito, an observer for linear feedback control laws of multivariable systems with multiple delays in controls and outputs, systems & control letters, vol. 1, no. 1, pp. 54-59, 1981. [20] s. mori, h. nishihara, k. furuta, control of unstable mechanical system: control of pendulum, international journal of control, vol. 23 no. 5, pp. 673-692, 1976. 38 v. casanova, j. salt, r. piza, a. cuenca [21] k. furuta, m. yamakita, s. kobayashi, m. nishimura, a new inverted pendulum apparatus for education, ifac advances in control education conference, pp. 191196, 1991. [22] s. yurkovich, m. widjaja, fuzzy controller synthesis for an inverted pendulum, control engineering practice, vol. 4, no. 4, pp. 455-469, 1996. [23] w. zhong, h. röck, energy and passivity based control of the double inverted pendulum on cart, ieee conference on control applications, pp. 896-901, 2001. [24] j. driver, d. thorpe, design, build and control of a single/double rotational inverted pendulum, the university of adelaide, school of mechanical engineering, australia, 2004. [25] quanser consulting inc, rotary experiment #8: double inverted pendulum (dbpen), 2006. [26] g. alevisakis, d.e. seborg, an extension of the smith predictor method to multivariable linear systems containing time delays, international journal of control, vol. 17, no. 3, pp. 541551, 1973. [27] k. watanabe, y. ishiyama, m. ito, modified smith predictor control for multivariable systems with delays and unmeasurable step disturbances, international journal of control, vol. 37, no. 5, pp. 959-973, 1983. [28] j.m. maciejowski, robustness of multivariable smith predictors, journal of process control, vol. 4, no. 1, pp. 29-32, 1994. [29] a.m. de paor, a modified smith predictor and controller for unstable processes with time delay, international journal of control, vol. 41, no. 4, pp. 1025-1036, 1985. [30] h.j. kwak, s.w. sung, i.b. lee, a modified smith predictor with a new structure for unstable processes, industrial & engineering chemistry research, vol. 38, no. 2, pp. 405-411, 1999. [31] t. liu, y.z. cai, d.y. gu, w.d. zhang, new modified smith predictor scheme for integrating and unstable processes with time delay, iee proceedings on control theory and applications, vol. 152, no. 2, pp. 238-246, 2005. [32] p. garcía, p. albertos, t. hägglund, control of unstable non-minimum-phase delayed systems, journal of process control, vol. 16, no. 10, pp. 1099-1111, 2006 [33] p. albertos, p. garcía, robust control design for long time-delay systems, journal of process control, vol. 19, no. 10, pp. 1640-1648, 2009. int j comput commun, issn 1841-9836 8(3):469-476, june, 2013. on the characteristic functions of fuzzy systems h.n.l. teodorescu horia-nicolai l. teodorescu 1. institute of computer science, romanian academy iasi branch romania, iasi, carol i, 8 2. gheorghe asachi technical university of iasi romania, iasi, str. d. mangeron, 67 hteodor@etti.tuiasi.ro abstract: we provide several properties of the input-output function of the siso / miso sugeno fuzzy systems with center of gravity defuzzification. the properties analyzed are related to continuity under various conditions for the input membership functions, including the case when the input space is a topological space. keywords: sugeno fuzzy system , characteristic function, continuity, domain. 1 introduction the progress of knowledge on fuzzy systems historically followed a bifurcated path. on one way, the engineering applications drove the development [8] (tong), with the underlying foundations following. on the second way, logic and mathematics produced advances with limited reference to the applications. that left several area in-between the two paths partly unclarified, especially regarding fuzzy logic systems (fls) foundations. we address several properties of the characteristic function of the siso / miso (single input single output / multiple input single output) fuzzy systems with defuzzification, for sugeno fuzzy systems. as far as we know, these properties have been explicitly addressed until now only sporadically. the main issues concern the continuity of the characteristic functions of the fuzzy systems with center of gravity (c.o.g.) defuzzification, as continuity is essential in understanding the applicability and constraints when using fuzzy systems in control, system modeling (approximation), predictors and other current applications. the approximation properties of fuzzy logic systems with defuzzification have been extensively studied starting in the 1990s [6] (teodorescu 1990), [1] [2], [3], [7]. however, while somewhat elementary, the properties that guarantee that these systems have derivable or continuous input-output functions have not been investigated in detail, except cases discussed in [4], [9]. the continuity of flss is essential in many applications, and is effectively assumed in most control applications, as in [13] (even when not stated, e.g., as in many robotic system papers [10], [15]), and in interpolations and approximation with flss, as in [11], [12]. the topic was intensively studied, see e.g. [14], [9], [4] etc. the organization of the paper is as follows. in the next section we recall a few definition and results and state several working hypotheses that are used in the paper. the third section addresses the continuity of sugeno-type fuzzy systems. the last section is conclusive. 2 definitions and hypotheses consider two spaces, s ⊆ x named input space and s′ ⊆ y named output space, and applications µ : s → [0, 1] ⊂ r named input membership functions and respectively η ...s′ → [0, 1] ⊂ r named output membership functions. the definition domain of all input membership functions is s; the subset of s where the membership functions have values larger than 0 is named the support of the function. copyright c⃝ 2006-2013 by ccc publications 470 h.n.l. teodorescu we work under a set of very general hypotheses that simplify the presentation of the main results; most results remain valid without these hypotheses, with slightly more elaborate conditions, but their proofs become tedious and lengthy. the hypotheses are as follows. all input and output membership functions (h1) are normalized, that is, there is at least one point where the membership functions have value 1. (h2) have a finite set of α-intervals for every α. (we will not use h2 when building an example in section 3.) (h3) have compact supports, that is, compact sets where they are larger than zero. (h4) have bounded supports. (h5) the union of all the supports of the input membership functions is compact. this hypothesis has no significant impact on the results, but simplifies the discussion. hypothesis h3 is typically accepted for fuzzy sets, yet the hypothesis can be removed by defining two (or several) fuzzy sets with compact support, with their union equal to the given non-compact support set. we will still require that the number of disjoint sets of the support is finite. hypothesis h4 (boundedness of the support) is only used to avoid studying one more trivial case for membership functions. it can be removed with no influence on the results. hypothesis h5 (compactness of the union of the supports) plays no role except of avoiding splitting the problem into sub-problems. notice that in many respects, the type of input space is not essential, as far as it is a topological space where continuity of the mappings can be defined. in general, we assume that an input membership function is an application from a compact set in a hausdorff topological space to the unit interval of the real line, µ : s ⊆ x → [0, 1] ⊆ r. we assume that the input membership functions are not singletons on x. consider a finite set of real numbers, named singletons, {βi}i=1,..,q. we recall a few definitions. definition 1. a siso zero-order sugeno fuzzy system is a construction comprising (i) a set of input membership functions µk : s → [0, 1], k = 1, · · · , m, (ii) a set of output singletons, {βh}; (iii) an application i(.) : {1, a, m} → {1, · · · , q}, that associates to any k ∈ 1, · · · , m a value i(k) ∈ {1, · · · , q}; each such individual association, that is a value i(k) for a specified k, is named rule; (iv) an application s → r defined by x → y = ∑ k βi(k)µk(x)∑ k µk(x) , βi(k) ∈ r . the part (iii) in the construction is equivalent to saying that there is a set of m rules, r(k): if input is µk, then output is βi(k) , that is, one singleton is assigned to each input membership function. a definition a little more general allows for several singletons assigned to one input membership function, (iii*) an application i(.) : {1, · · · , m} → ℘{1, · · · , q}, where ℘ denotes the set of parts (power set); i(k) for a specified k is named rule; (iv) an application s → r defined by x → y = ∑ k βh maxjs.t.i(j)=h µj(x)∑ k maxjs.t.i(j)=h µj(x) . the part (iii*) in the construction is equivalent to saying that there is a set of rules as r(k): if input is µk, then output is βi1(k), βi2(k), · · · , βir(k) , that is, several singletons may be assigned simultaneously to a single input (non-univalent, multivalued / multivocal association). equivalently, one can allow for several rules with the same antecedent and different consequents connected by or, as r(k): if input is µk, then output is βi1(k) , or on the characteristic functions of fuzzy systems 471 r(k): if input is µk, then output is βir(k). the application x ∈ s ⊆ x → y ∈ y ⊆ r is named characteristic or input-output function of the fls. this function is defined in all points where at least one input membership function is not null, for sugeno-type flss. this condition is required by the c.o.g. defuzzification method, specifically by the condition that the denominator in the c.o.g. is not null. for ease, we will say that a fuzzy system is continuous if its characteristic function is continuous. 3 properties of continuity siso sugeno systems the simplest way to analyze the continuity of sugeno-type system is to use the formula of the output according to the definition in the previous section and to consider that all input membership functions are defined on the whole domain, x. when an input membership function is defined on a subset of the input space, µ : s ⊆ x → [0, 1], the last condition is enforced by extending the definition of the input membership functions to the whole x by µ(x ∈ x \s) = 0. we will study the cases when all input membership functions are continuous on x. because we assume that all membership functions are continuous everywhere, moreover because the maximum, minimum and rational functions of continuous functions are continuous (wherever they are defined), we directly have the basic result: proposition 1. sugeno siso fuzzy systems with continuous membership functions over the whole input space are continuous input-output applications in the subset of the input space where the system is defined. the input space can be whatever topological space x that allows us building continuous applications from it to [0, 1]. beyond this very general condition, all the other conditions regard the interval [0, 1] and functions defined on it; hence, the generality of the result in proposition 1. while the above result is very general, easy to obtain, in addition easily extendable to all sugeno systems (of whatever order), it sheds little light on the internal mechanisms of these systems and on what happens outside the subset of the input space where the system is defined. therefore, we will provide another proof for the continuity, studying along the proof the properties of the frontier of the definition space of the system. the analysis will be developed first for the simplest case of flss, namely when the input membership functions are defined on r. then, we deal with x as a general topological space. we first introduce a few concepts that simplify the explanations. consider single-input single-output (siso) fuzzy systems with input membership functions defined on the real axis (mono-dimensional input functions). we assume that any input membership function is not null on an opened interval and null outside it. denote by ik = (xk, xk) the opened interval corresponding to the membership function µk, with µk(x) > 0 for all x ∈ ik and µk(x) = 0 for all x ∈ r \ ik. the case xk = xk is not allowed, as the membership function would be void everywhere except a point, reducing it to an input singleton. while not essential, for ease of exposition consider that the union of all closures ik of the intervals ik is an interval,i = ∪kik . consider that the intersection of the closures of two specified intervals, ik and ij, is not void. the following cases will play a role in the discussion. the intersection of two intervals is a single point. assume that ik ∩ij = xk = xj or ik ∩ij = xj = xk. the two intervals are adjacent, non-overlapping. (see fig. 1 a). then, in the respective point xk = xj or xj = xk, both functions have zero value, therefore another membership function must have non-zero value in that point, for the fuzzy system is defined in that point. in the respective point (x3 in fig. 1 (a)), the system is not defined. if the two membership functions are 472 h.n.l. teodorescu associated with different singletons, the system will have different values at left and at right of x3, making it impossible to achieve continuity of the system simply by extending the characteristic function of the system to one of the two values. the cases ik ∩ij = xk = xj and ik ∩ij = xk = xj are not possible, because we required that xk ̸= xk and that the intersection reduces to a single point. therefore, this case is not allowed under normal circumstances (sugeno systems are not using input singletons). the intersection is an interval. when ik ∩ ij = ik or ik ∩ ij = ij, the membership functions may look as in fig. 1 b. we will take into account this case. the typical case in applications is an intersection that is an interval but not equal to one of the two, ik, ij. figure 1: (a) not allowed case, if no other function is non-zero in x3; (b) allowed case. lemma 1. in the case (ii) above, when ik ̸= ij, there is at least one point inside ik ∩ ij such that the two membership functions have equal values. proof. indeed, as the membership functions are assumed continuous, so is their difference, µk − µj. denote ik ∩ ij = [a, b]. then, either (µk − µj)(a) < 0 and (µk − µj)(b) > 0, or (µk−µj)(a) > 0 and (µk−µj)(b) < 0. therefore, there is a point x ∈ [a, b] such that (µk−µj)(x) = 0. consider all points where two membership functions have equal values. denote these points by ek and create the ordered set comprising all distinct ek, xk, xk points. denote this set of ordered points by pc = ah. the set of intervals [ao, a1), · · · , [ah, ah+1), · · · , [an, an+1] will be named canonical partition of the interval i. the points ek play a role in considerations only regarding the derivability of sugeno systems and regarding mamdani systems; for sake of generality we use them here too. one could define the canonical partition without the points ek and introduce them only when needed. lemma 2. for any interval in the canonical partition, a single rule (iii) or a single subset of rules (iii*) simultaneously apply (are active) in a single-input fuzzy logic system. proof. consider x changing inside some interval of the canonic partition. because such an interval is included or at most equal to any support (basis interval) for a membership function, moreover no point inside such an interval, x ∈ (ah, ah+1), represents an edge of the support of an input membership function, the activated rules will remain the same for all inside points. we recall that the minimum and maximum of two continuous functions are continuous functions, that is, if f and g are continuous functions in some interval, min(f(x), g(x)) and max(f(x), g(x)) are continuous functions on that interval. consequently, the truncation operation applied to a continuous membership function µ, min(µ(x), γ), as used for mamdani systems, is continuous for any γ ∈ r, in the given interval of continuity of µ. we introduce the notion of support of a fuzzy systems defined by the union of the supports of all the input membership functions, i = ∪kik . we now can prove proposition 2. any siso zero-order sugeno system with continuous input membership functions defined on r is continuous inside the support of the system. proof. according to lemma 2, a single subsystem of rules always apply inside an interval of the canonical partition, thus the set of integers k for which µk(x) ̸= 0 in x → y = ∑ k βi(k)µk(x)∑ k µk(x) on the characteristic functions of fuzzy systems 473 is uniquely defined. as the functions µk(x) are continuous and none null, the denominator is nowhere null inside the interval and both the nominator and denominator are continuous, thus the function x ∈ i → y ∈ r is continuous. in the points ending the interval for x, x ∈ (ah, ah+1), the subset of active rules changes because one or several membership functions become zero. consider the point ah, which is not on the frontier of the support of the fuzzy system, ah ∈ i \ i. at the limit, we have lim x→ah,x>ah ∑ k βi(k)µk(x)∑ k µk(x) . some of the membership functions tend to zero, while others tend to some non-zero values. those membership functions not tending to zero remain non-null for x < ah in some vicinity of ah, because of continuity. at the same time, other membership functions are non-zero for x < ah and tend to zero in ah. therefore, the limit at right and at left of ah are equal, lim x→ah,x>ah ∑ k βi(k)µk(x)∑ k µk(x) = lim x→ah,x 0 and µk ̸=h(x) = 0, then the limit is βi(h). this is probably the most frequent case in applications. figure 2: evolution near zero of the system described in example 1 however, in general the limit does not exist. example 1. consider µ1(x) = max(1, x|sin(1/x)|), µ2(x) = max(1, x|sin(1/(2x))|), β(i(1)) = 1, βi(2) = 3. notice that these functions do not obey the requirement in the hypothesis (h2). the limit 474 h.n.l. teodorescu lim x→0,x>0 (max(1, x|sin(1/x)|) + √ 3 max(1, x|sin(1/ √ 2x)|) max(1, x|sin(1/x)|) + max(1, x|sin(1/ √ 2x)|) does not exist. however, this is not a case of practical interest when choosing membership functions for typical control problems. figure 3 shows the variation of the above function around zero, at two scales. on the other hand,if for any point x on the frontier of the support of the fuzzy system with continuous input membership functions on x, there is a membership function µh such that for all the other membership functions µi ̸=h there is a finite limit lim u→x µi(u) µh(u) = li,h(x)(x), then the characteristic function of the sugeno system can be extended on the frontier as x → y = ∑ k βi(k)µk(x)∑ k µk(x) for x inside s∗ x → y = β(i(h(x))) + ∑ ( k ̸= h(x))l(i, h(x))(x)) 1 + ∑ ( k ̸= h(x))l(i, h(x))(x) , x ∈ γ(s∗)). the extension is obtained as limu→x ∑ k β(i(k))µk(x)∑ k µk(x) = βi(h(x)) + ∑ k=h(x) βi(k)µk(x))/(1 +∑ k=h(x) li,h(x)(x). the limit is obtained by factoring in the nominator and in the denominator by µh(u). the possibility to choose several membership functions for µh does not modifies the result, as it is easy to check. example 1. consider a sugeno system with only two membership functions, both defined on [0, 0.1], µ1(x) = x, µ2(x) = x2, β1 = 1, β1 = 3. in this case, h = 1. then, inside [0, 0.1], x → y = (x+3x2)/(x+x2), thus limx→0(y) = limx→0(x+3x2)/(x+x2) = 1. remark 1. the characteristic function of the system so defined is continuous in the closure of the support, s∗ if the input membership functions are continuous in x. this property results from the above discussion. remark 2. consider the more general case when x is a multidimensional space, for example x ∈ x and x = r2 and γ(s∗) is a curve in the plane. it is easy to see that the function x → y = (βi(h(x)) + ∑ k ̸=(x) li,h(x)(x)/(1 + ∑ k ̸=h(x) li,h(x)(x) defined on γ(s ∗) may be discontinuous if there are membership functions that are discontinuous even in only one of the variables of the plane. because the standard definition of the fuzzy logic systems forces us to exclude the definition of the characteristic function on the support of the system, that is, forces the support to be an open set, we suggest the next extension of the definition for siso sugeno systems by replacing condition (iii) in the definition: (iii**) the output of the system is x → y = lim u→x ∑ k βi(k)µk(u)/ ∑ k µk(u), whenever the limit exists. this definition generalizes (iii) covering it as a specific case. in addition, it allows us to use membership functions as in fig. 1 (b) and still have the system defined in the closed interval corresponding to the bases of the triangles. moreover, the definition completes the domain of definition of the system to a closed set, and makes it continuous (when the input membership functions are continuous and satisfy the other conditions regarding the limit) on its whole (closed) domain. of course, outside the reunion of the input membership functions support, the system is undefined. on the characteristic functions of fuzzy systems 475 the case of multi-input single-output miso case, with membership functions defined on r for each input (mono-dimensional input functions) is dealt similarly. we do not study it here because of space limits. 4 conclusions we provided several properties of the input-output (characteristic) function of the siso and miso sugeno fuzzy systems with defuzzified output by the center of gravity method. these properties regard the continuity of the characteristic functions of the fuzzy systems and are essential in understanding the applicability and constraints when using fuzzy systems in control, system modeling (approximation), predictors and other current applications. we made a clear difference between the domain of definition of the input membership functions and the domain of the characteristic (input-output) function of the fuzzy system, stressing that they are not identical. the continuity inside the system domain was proved for input topological spaces that allow the definition of the membership functions. next, we showed cases when it is possible to and explained how to extend the definition of sugeno fuzzy systems to a closed domain, preserving the continuity of the system. while we provided a proof of continuity for the opened domain of the systems, proof that is valid for general topologic spaces, we worked out a second proof, which is longer but has the advantage of detailing the internal mechanics of sugeno systems. the second way of dealing with the continuity problem has the additional advantage of showing a path toward the extending of the domain of the systems to a closed domain, for a case that may be of interest in applications. bibliography [1] buckley, j.j., hayashi, y. (1993), fuzzy input-output controllers are universal approximators, fuzzy sets and systems, 58(3):273-278 [2] buckley, j.j., hayashi, y., numerical relationships between neural networks, continuous functions, and fuzzy systems. fuzzy sets and systems, 60(1):1-8, 1993. [3] buckley, j.j., hayashi, y., can fuzzy neural nets approximate continuous fuzzy functions? fuzzy sets and systems, 61(1):43-51, 1994. [4] dongrui w., mendel, j.m., on the continuity of type-1 and interval type-2 fuzzy logic systems. fuzzy systems, ieee trans., 19(1):179-192, 2011. [5] mendel, j.m., john, r.i., and liu, f., interval type-2 fuzzy logic systems made simple. ieee transactions on fuzzy systems, 14(6):808-821, 2006. [6] teodorescu, h.n., relations between fuzzy and crisp systems. in: teodorescu h.n. (editor), fuzzy signals and systems. amse press, lyon, france, 1990 [7] teodorescu, h.n., taylor and bi-local piecewise approximations with neuro-fuzzy systems, studies in informatics and control, 21(4):367-376, 2012. [8] tong, r.m., a control engineering review of fuzzy systems, automatica, 13(6):559-569, 1977. [9] yu-ru s., sugianto, l.f., lee, e.s., a note on continuity and semicontinuity of fuzzy mappings. fuzzy information processing society, 2007. nafips ’07. annual meeting of the north american , 24-27 june 2007, 106 111. 476 h.n.l. teodorescu [10] a. bazoula, m.s. djouadi, h. maaref, formation control of multi-robots via fuzzy logic technique, int j comput commun, issn 1841-9836, suppl. issue, 3(s):179-184, 2008. [11] d.dubois, h. prade, on fuzzy interpolation, int j of general systems, 28(2-3):103-114, 1999. [12] e.p. klement, l.t. koczy, b. moser, are fuzzy systems universal approximators?, int j of general systems, 28(2-3): 259-282, 1999. [13] pierre borne, mohamed benrejeb, on the representation and the stability study of large scale systems, int j comput commun, issn 1841-9836, suppl. issue, pp. 3(s):55-66, 2008. [14] p.j.s.g. ferreira, m.j.c.s. reis, fuzzy information on discrete and continuous domains: approximation results, int j of general systems, 33(5):583-591, 2004. [15] s.b. cononovici, a. curaj, an approach to walking robots planning and control, proceedings of the romanian academy, series a, 11(1):75-82, 2010. int j comput commun, issn 1841-9836 9(2):217-226, april, 2014. a metamodel for an adaptive control system f. valles-barajas fernando valles-barajas universidad regiomontana, information technology department 15 de mayo 567 pte., c.p. 64000 colonia centro, monterrey, nuevo león, méxico, tel. +52 81 8220-4733 e-mail: fernando.valles@acm.org, fernando.valles@ieee.org abstract: in this paper a metamodel for an adaptive control system (acs) is described. this metamodel was built employing use, which is a uml-based specification environment. the main goal of the metamodel is to complement other models describing different views of an acs. as the reader will notice, the metamodel is composed of a graphical and a mathematical model. weak constraints are specified in the graphical model using a unified modeling language (uml) class diagram, while strong constraints are defined in the mathematical model using the object constraint language (ocl). keywords: adaptive control, metamodels, ocl, uml 1 introduction a software process describes who is doing what, how, and when [9]. one of the phases of a software process is design. in this phase a model of a system, that will later be implemented, is constructed [15]. this model is useful to detect flaws but also for documenting and to establish a communication channel with system’s user. depending on the process being constructed designers can use text or mathematics or a combination of both to build models. an advantage of using text, which is an informal technique, is that the resulting design is easy to understand and can be rapidly constructed. its disadvantage is that sometimes a model made using an informal technique can lead to a misunderstanding. formal models, which rely on mathematics, do not have this disadvantage. on the other hand, control systems are used in industry to assist control engineers in maintaining processes in a desired state (see [3], [25], [10]). control algorithms are embedded in control systems. sometimes control systems are applied to control critical systems, whichrequire free errors designs and because of one of the part of control systems is software, recently control engineers are applying software processes in the building of control algorithms [18]. proposal of the paper : usually a design covers one view of the system being modeled. according with [7], the design views of system are: structural, procedural and behavior. while making the design’s structural view, the designer should document constraints affecting system entities and the relations between them [7]. this documentation can be made using text or a formal language to get a more precise specification. in this paper, the author explains how a semi formal modeling language and a formal language can be used to specify constraints affecting the parts of an adaptive control system as well as constraints affecting the relations between them. the unified modeling language (uml) [5], [14] and the object constraint language (ocl) will be used to specify these constraints. previous works: the first step in a software process is to gather requirements system. in [17] a requirements process for control system software is presented. this process is based mainly in copyright © 2006-2014 by ccc publications 218 f. valles-barajas the rational unified process (rup). as is recommended in personal software process (psp) [7], once the requirements are captured by software engineers, the next step is to make a pre-design for the purpose of prediction and planning. in [19] a proposal for the building of pre-design for control systems is presented. further details, not covered in the pre-design, can be specified in the design phase. psp proposes that design can be analyzed from several perspectives. in [20] a proposal, based on psp, for modeling the structural view of control systems is introduced; in this view the entities of the system, its attributes and relations among the system entities are modeled. once the entities of system are specified, software designer should detect entities having several states and model them using a state machine. in [21] the authors model an adaptive control system using a state machine. the design views proposed in psp are a subset of the uml diagrams, which model systems in more detail. in [23] a survey of the application of uml to model mechatronics systems is presented. once software design is made, code must be built. in [18] and in [22] the authors explain best practices in using programming languages at the moment code for control system software is made. related works: the z language is a formal modeling language based on first-order logic and set theory. this modeling language has been applied to the specification of critical systems. in [8], this language is used to specify a control program for a radiation therapy machine. another application of the z modeling language in critical systems is described in [15], where a system that monitors the blood glucose level of diabetics and automatically injects insulin when it is required is specified in the z language. the disadvantage of the z language in comparison with ocl, which is the modeling language used in this paper, is that ocl complements uml and it does not use mathematical symbols to make specifications. the latter characteristic could be attractive to control engineers not familiar with mathematical logic. control systems are usually modeled using differential equations and analyzed using, for example, lyapunov stability theory. in [2] a novel approach based on hoare logic for reasoning about control systems is presented. in [12] the authors present intent-specifications model for a robotic software control system. according to the authors, an intent-specification is composed of seven levels that as a whole model the entire software control system. each level models the system from a different perspective and, in particular, in level 4 a design representation of the system is included. the unified modeling language (uml) has also been used to model software for control systems. in [24] the authors propose a methodology for generating code for programmable logic controllers (plcs) from uml diagrams. the application of uml for process control was evaluated from a usability and cognitive science point of view. the authors performed an experiment to evaluate the acceptation of uml for control engineers. some researchers are studying mapping between traditional tools used by control engineers for software modeling and uml. for example, in [16] the author analyzed mapping between function blocks, which are defined by the international electro-technical commission as the basic construct for distributed control applications and uml. when compared with uml, function blocks do not consider all the benefits of object oriented theory. another paper including functional blocks and uml is [13]; in that paper the authors apply uml activity diagrams to model function blocks. theorem provers have also been used to specify and verify the correctness of control software. a metamodel for an adaptive control system 219 for example, in [6] the authors apply the interactive theorem prover pvs to model the light control system, which is a benchmark in formal methods. outline of the paper : in this section, the motivation of this work has been explained. section 2 contains a brief description of an adaptive control system. section 3 describes the tools used in this paper to specify the constraints of an adaptive control system. in section 4 the adaptive control system design is included. the last section contains concluding remarks. 2 adaptive control systems fig. 1 shows the configuration of an indirect adaptive control system. in this kind of adaptive control, a model of the process (gp(z−1)) is obtained based on a set of input-output measurements (u(k),y(k)) and then the controller (gc(z−1)) is designed using this model [10]. the paa gp(z −1)gc(z −1)σ y(k) y(k)u(k)e(k)yref (k) θ̂p aa(k) desired performance controller σ ? d2(k) σ ? d1(k) supervisor block σ� n(k) --? 6 � � 6 ? u ? on/off θ̂cd(k) design figure 1: block diagram for an adaptive control system parameter adaptation algorithm (paa) block is the responsible for obtaining the parameter vector of the process (see θ̂paa(k)) in fig. 1). the controller design block specifies the parameters of the controller (θ̂cd(k)) based on the model obtained by the paa and on the desired performance specified by the system operator. an adaptive control system must control a process in spite of disturbances d1(k), d2(k), noise n(k) and the parametric variations of the process. the supervisor block is in charge of detecting any event that may provoke a decreasing in the performance of system; in these cases the supervisor will turn off the controller gc in fig. 1, yref(k) is the reference, e(k) is the control error, u(k) is the manipulated variable and k is the kth sampling time. 3 uml & ocl: tools to model software systems uml is a modeling language, based on object oriented theory, developed by the three amigos with the aim of modeling complex software [14]. every of the uml diagrams models a different view of a system; for example class diagrams specify system entities and the relations between 220 f. valles-barajas figure 2: class diagram for an adaptive control system them. uml has acquired a good acceptation in the software community, though sometimes models created in uml lead to miscommunication. in these cases it is recommended to complement uml models with an ocl specification, which is a formal modeling language based on logic and set theory. one of the advantages of ocl is that it does not use mathematical symbols to build models; instead it uses a textual representation. for designers without strong background in logic, this characteristic of ocl may result attractive. 4 uml/ocl specification for an adaptive control system in this section the adaptive control system metamodel is presented. first of all, a graphical model is described using uml and as will be shown this model contains weak constraints on the elements composing an adaptive control system. with the introduction of the uml model the disadvantages of only using graphical models to specify an adaptive control system are demonstrated. a model complementing the graphical model is then constructed using ocl. as the reader will see the weak constraints established in the uml model will be strengthened by using ocl. 4.1 uml specification fig. 2 contains a uml class diagram for the acs of fig. 1. for simplicity, this diagram does not contain all of the details of an acs; for example, the supervision block and the controller design block are not considered in this diagram. fig. 3 and 4 contain the use specification of the class diagram of fig. 2. as the reader may notice a composition relation between classes gp (process model), gc (controller) and paa (parameter adaptation algorithm) was specified (see the composition acs in the class diagram of fig. 2 and its definition in the use specification of fig. 4). also, an association class between classes gp and gc was defined. as can be seen in the use specification of fig. 4, this class was defined as an association class because it defines attributes that do not belong to a particular class; for example ts (time sampling) is associated to the entire system and not to a particular class. a reflexive relation, named cascade, was defined in the class gc. master and slave roles of this relation are specified in fig. 4. a metamodel for an adaptive control system 221 model adaptivecontrolsystem enum externalagent {d1, d2, n} enum paa_state {on, off} enum paa_fault {poorexcitation, perturbation, lowm} enum tuning {quarterdecayratio, stepresponse, iae, itae, other} abstract class gc attributes id : integer isinautomatic: boolean r: sequence(real) s: sequence(real) t: sequence(real) tuningtype: tuning indicator : string constraints inv updatedisplay: isinautomatic implies indicator = ’automatic’ inv validid: id >= 1 end class paa attributes theta: set(real) phi: set(real) lambda: real state : paa_state faults : set(paa_fault) operations updategp( b: sequence(real), a: sequence(real) ) constraints inv: if faults −>notempty then state = #off else state = #on endif end class pid < gc attributes tao_d: real tao_i: real kc: real operations assigngp(gp: gp) end abstract class gp attributes b: sequence(real) a: sequence(real) d: integer na: integer nb: integer np: nyquistplot operations ismonichurwitzpolynomial (): boolean constraints inv sructure3: d>0 and na>0 and nb>0 inv structure: na >= nb inv structure2: a−>size() = na and b−>size() = nb end class fop < gp attributes k: real theta: real tao: real operations initgpfop() fromcontinoustodiscrete() constraints inv structure: nb = 1 and na = 2 end class sop < gp constraints inv structure: nb = 2 and na = 3 end figure 3: use specification for an adaptive control system (part i) 222 f. valles-barajas class process attributes identifiedgp : gp affected : set(externalagent) end class frequency attributes value: real end associationclass gpgc between gp [1] role model2; gc [1] role controller ; attributes ts: integer affectedby: set(externalagent) faulthistory : bag(externalagent) gainmargin: real phasemargin: real modulusmargin: real delaymargin: real operations registerafault( f : externalagent) end association tune between paa [1] role tunner; gc [1] role controller2 ; end association corresponds between frequency [1] complexnumber [1] end class complexnumber end class criticalpoint < complexnumber end class nyquistplot attributes w: sequence(frequency) c: set(complexnumber) end association cascade between gc [0..1] role master; gc [0..1] role slave ; end association validzone between process [1] role plant; gp [1..∗] role model; end association identifies between paa [1] role identifier ; process [1] role process; end composition acp between paa [1] gp [1] gc [1] end figure 4: use specification for an adaptive control system (part ii) the model defined in fig. 2, 3 and 4 contain some flaws; for example this model allows that a controller gc1 plays at the same time master and slave role in the cascade association. another design flaw occurs between classes gc and gp as is explained as follows. pid controllers have demonstrated to have a good performance when the process being controlled can be modeled as a first or second order process (see [11]), however the class diagram of fig. 2 allows that any kind of process may participate in the relation between gc and gp. these design flaws justify the addition of ocl constraints to the acs class diagram model shown in fig. 2 and defined in fig. 3 and 4. 4.2 ocl specification in this section, invariants on the class diagram defined in fig. 2 are defined. some invariants affect more than two elements of the class diagram. invariants are explained in the same order they appear in fig. 5 and 6. fig. 5 defines three constraints on the class controller, gc. invariant onlyonerole assures that a controller gc1 cannot be at the same time, both a master controller and a slave controller. invariant differentids constraints that in a cascade relation the controller playing the master role has a different id that the controller playing the slave role. the last invariant of the class gc specifies that if the type of controller is a pid controller, then an appropriate tuning type must be selected (quarterdecayratio, stepresponse, iae or itae a metamodel for an adaptive control system 223 context gc inv onlyonerole: master−>notempty implies slave−>isempty inv differentids : (master−>notempty implies self.id <> master.id) and (slave−>notempty implies self.id <> slave.id) inv validgp: self .oclistypeof( pid ) implies tuningtype <> #other and (model2.oclistypeof( fop ) or model2.oclistypeof( sop )) and model2.oclistypeof( fop ) implies model2.oclastype( fop ).theta < gpgc.ts context pid::assigngp( gp: gp ) pre: fop.allinstances−>union( sop.allinstances )−>includes( gp ) post: model2.plant.identifiedgp = gp context nyquistplot inv orderedfreq1: let integers = sequence{1 .. w−>size()} in integers−>forall(e1, e2: integer | w−>at(e2).value > w−>at(e1).value implies e2 > e1) inv orderedfreq2: −− another way to specify invariant orderedfreq1 w−>forall(f1, f2: frequency | w−>indexof(f2) > w−>indexof(f1) implies f2.value > f1.value) inv stability : w−>forall(f: frequency | criticalpoint.allinstances−>excludes(f.complexnumber)) inv completeness: w−>forall(f: frequency | complexnumber.allinstances−>exists(cn | f.complexnumber=(cn))) figure 5: use constraints for an adaptive control system (part i) among others, see the enumeration tuning defined in fig. 3). besides this, the process being controlled should be modeled as a first order process or as a second order process. if the process is a first order process then the dead time, θ, should be smaller than ts. a method to assign a process model to a pid controller is defined in fig. 5. this method forces that the model, gp, should be either a first order process or a second order process. four invariants are defined on the class nyquistplot. invariants orderedfreq1 and orderedfreq2 declares that the frequencies specified in a nyquist plot should be ordered from low to high. invariant stability defines that a nyquist plot should not contain the critical point. the last invariant of class nyquistplot specifies that every frequency in a nyquist plot has a complex number assigned to it. fig. 6 declares a method of the class gp, which constraints that the polynomial a of gp should be monic; in other words the leading coefficient should be 1. the method updategp defined on the class paa, updates the model gp as long as the model identified by the paa is inside the valid zone of the process being controlled and there is not fault affecting the entire system. invariant stabilitycriteria defines the standard values to get a robust controller recommended in [3]. method registerafault defined on the association class gpgc updates the logging of system faults. 224 f. valles-barajas context gp :: ismonichurwitzpolynomial(): boolean pre: a−>first() = 1.0 post: result = true context paa::updategp( b: sequence(real), a: sequence(real) ) pre: state = #on pre: process.model−>exists(gp: gp | gp.a = a and gp.b = b) pre: set{faults, process.model.gpgc.affectedby}−>isempty post: process.identifiedgp .a = a and process.identifiedgp.b = b context gpgc inv stabilitycriteria : gainmargin >= 6 and modulusmargin >= −6 and sequence{ 30, 31 .. 59, 60 }−>includes(phasemargin) and delaymargin = 0.1∗ts context gpgc::registerafault(f : externalagent) post: faulthistory = faulthistory@pre−>including(f) context process inv not_a_pid: let ip = identifiedgp in let fop = ip.oclastype( fop ) in ip .oclistypeof( fop ) and (fop.theta > fop.tao∗0.25 and fop.theta < ip.gpgc.ts) implies pid.allinstances−>excludes( ip.controller ) figure 6: use constraints for an adaptive control system (part ii) the last invariant of fig. 6 constrains the class process. if the process is modeled as a first order process then θ should be less than 0.25τ (τ is the constant time of a first order process) and less than ts. 5 conclusions in this paper an informal modeling language (uml) and a formal modeling language (ocl) have been applied to model an adaptive control system. uml was used to specify weak constraints whereas ocl was applied to strength the model made in uml. the author of this paper believes that the model included in this work, will be useful for control engineers that want to have a better understanding of how to apply uml and ocl in the modeling of control systems. a simple example (an adaptive control system) was used in this paper to illustrate how control system software can be specified using an informal notation (uml) together with a formal modeling language (ocl). the technique proposed in this paper should be used to model software for complex control systems, e.g. nuclear plants, robots, planes among others. the future lines of research based on this paper are: • due to the fact that uml is a general purpose modeling language, in some cases a more specialized language gives a more precise specification than that obtained using uml. this drawback of uml has been studied and analyzed by the uml community and to overcome this problem uml extension mechanisms have been proposed; one of these are profiles. a profile based on uml and ocl will be studied in future papers. • there are some tools to prove if some constraints are fulfilled by one specification (see for example [1] and [4], among others). building one tool with this characteristic would be useful to check the models created by the technique proposed in this paper. a metamodel for an adaptive control system 225 • every modeling technique specifies a system from a particular perspective. in this paper, a uml class diagram was used to model the entities composing a control system along with the constraints among them; strong constraints were specified using ocl. other perspectives not considered in this paper could model the interaction between the entities composing the system and the interaction between the user of a system and the system itself. a study of other models complementing the perspectives of control system software will be made in a future research. bibliography [1] r. arthan, p. caseley, c. o’halloran and a. smith (2000). clawz: control laws in z, third ieee international conference on formal engineering methods (icfem 2000), 2000. [2] r. arthan, u. martin, e. a. mathiesen and p. oliva (2007). reasoning about linear systems, proceedings of 5th ieee international conference on software engineering and formal methods sefm, 2007. [3] k. j. åström and r. m. murray (2012). feedback systems: an introduction for scientists and engineers, version v2.11b, princeton university press. [4] s. bensalem, p.caspi, c.parent-vigouroux and c.dumas (1999). a methodology for proving control systems with lustre and pvs, 7th working conference on dependable computing for critical applications (dcca7), san jose, january 1999. [5] m. fowler (2003). uml distilled, addison-wesley, 3rd edition, 2003. [6] a. de groot, j. hooman (20000), analyzing the light control system with pv, journal of universal computer science, vol. 6, no. 7. [7] w. s. humphrey (2005). psp : a self-improvement process for software engineers, addison-wesley. [8] j. jacky, j. unger, m. patrick, d. reid and r. risler (1997). experience with z developing a control program for a radiation therapy machine, lecture notes in computer science, 1212:317-328. [9] p. kruchten (2003). the rational unified process: an introduction, addison-wesley professional, 3rd edition. [10] i. d. landau and g. zito (2006). digital control systems: design, identification and implementation, springer. [11] a. leva, c. cox, and a. ruano (2002). hands-on pid autotuning: a guide to better utilization, ifac professional briefs, 2002. [12] i. navarro, k. lundqvist, and n. leveson (2001) an intent-specifications model for a robotic software control system, 20th conference digital avionics systems, 2001. dasc. [13] s. panjaitan, g. frey (2005), functional design for iec 61499 distributed control systems using uml activity diagrams. proceedings icici 2005, bandung, indonesia, 64-70. [14] j. rumbaugh, i. jacobson, and g. booch (2004). the unified modeling language reference manual, addison-wesley professional, 2nd edition. 226 f. valles-barajas [15] i. sommerville (2010). software engineering, addison-wesley, 9th edition. [16] k. c. thramboulidis 92004), using uml in control and automation: a model driven approach, 2nd ieee international conference on industrial informatics indin’04, 24th -26th june, 2004, berlin, germany. [17] f. valles-barajas (2007), a requirements engineering process for control engineering software, innovations in systems and software engineering: a nasa journal, 3(4):217-227. [18] f. valles-barajas and w. schaufelberger (20080, use of object-oriented languages in control engineering, journal of research in computing science, vol.36. 2008. [19] f. valles-barajas (2009), a sysml requirements model for the 1992 acc robust control benchmark, information technology and control, 38(3):245-251. 2009. [20] f. valles-barajas and w. schaufelberger (2010), a proposal for the software design of control systems based on the personal software process, international journal of innovative computing, information and control, 6(8):3451-3466. 2010. [21] f. valles-barajas (2010), a novel model for adaptive control systems; a state machine approach, int j comput commun,issn 1841-9836, 5(3):292-300. 2010. [22] f. valles-barajas (2011), a survey of high-level programming languages in control systems, the international arab journal of information technology, 8(2):178-187. [23] f. valles-barajas (2011), a survey of uml applications in mechatronic systems, innovations in systems and software engineering: a nasa journal, 7(1):43-51. [24] b. vogel-heuser, d. friedrich, u. katzke and d. witsch (2005), usability and benefits of uml for plant automation some research results engineering, vol. 3, no. 1. [25] b. wittenmark, k. j. åström, and k. e. årzén. computer control: an overview, ifac professional briefs, 2002. int j comput commun, issn 1841-9836 vol.7 (2012), no. 3 (september), pp. 494-508 mining temporal sequential patterns based on multi-granularities n. li, x. yao, d. tian naiqian li, xinhui yao, dongpin tian department of computer science baoji university of arts and sciences baoji 721016, shaanxi, china e-mail: xalnq@hotmail.com, baojiyxh@163.com, tdp211@163.com abstract: sequential pattern mining is an important data mining problem that can extract frequent subsequences from sequences. however, the times between successive items in a sequence is typically used as user-specified constraints to pre-process the input data or to prune the pattern search space. in either cases, the times cannot be used to identify item intervals of sequential patterns. in this paper, we introduce a form of multi-granularity sequence patterns, which is a sequential pattern where each transition time is annotated with multi-granularity boundary interval and average time derived from the source data rather than the user-predetermined time interval or only a typical time. then we present a novel algorithm, mg-prefixspan, of multiple granularity sequential patterns based on prefixspan[, which discovers all such patterns. empirical evaluation shows that mg-prefixspan scales up linearly as the size of database, and has a good scalability with respect to the length of sequence and the size of transaction. keywords: data mining algorithm, sequential pattern mining, sequential data, time granularity, temporal patterns. 1 introduction among various types of data mining applications [1, 2], the sequential pattern mining, which discoveries interesting sequential patterns hidden in sequence of events, is an important data mining problem with broad applications, including market analysis, decision support, the prediction of occurrences of recurrent illnesses, system performance analysis and telecommunication network analysis etc. the problem of mining sequential patterns was first proposed by agrawal and strikant [3]: given a data set of sequences, each sequence is a list of transactions, where each transaction is a set of items. the sequential pattern mining is to find all subsequences that is more frequent than a user-specified minimum support threshold while maintaining their item occurrence order. for example, in the database of a book-club, a sequential pattern might be “5 percent of customers bought ‘foundation’, then ‘foundation and empire’, and then ‘second foundation”’ [4]. although the discovered sequential patterns reveal what items are frequently bought together and in what order, they cannot reveal how long time the items will be bought after the preceding items. unfortunately, not knowing the time means that we cannot exactly predict when the next purchase will happen. in addition, some sequential patterns could occur in different periods with different time granularities. for example, “hp stock could rise within 5 days after ibm stock rose" and “hp stock could fall within 6 month after ibm stock rose", which reveal hp stock rise or fall with respect to ibm stock rise at different time granularities (day and month), are two useful different patterns. however, these traditional sequence patterns can only tell us “hp stock could rise after ibm stock rose" and “hp stock could fall after ibm stock rose". this situation means that the two patterns are useless. another situation may be “hp stock copyright c⃝ 2006-2012 by ccc publications mining temporal sequential patterns based on multi-granularities 495 could rise 5 days later after ibm stock rose" and “hp stock could rise 6 months later after ibm stock rose". although these two patterns are completely different with regard to different time granularities, these traditional sequence patterns could treat the two patterns as the same pattern “hp stock could rise after ibm stock rose", which make the extracted pattern less precise and some useful information lost. given the above reasons, in this paper we generalize the problem definition given in [1, 8, 9, 13] to incorporate the maximum, minimum and average time between successive transactions, which are derived from the source data, and different time granularities in traditional sequential patterns. we present a novel algorithm of multiple granularity sequence patterns based on prefixspan [6] , called mg-prefixspan, which discovers all such patterns. empirical evaluation shows that mg-prefixspan scales up linearly as the size of database, the length of sequence and the size of transaction. the rest of this paper is organized as follows. related work is discussed in section 2. we give a formal description of the problem of mining temporal sequence patterns based multiple granularities in section 3. in section 4, we describe mg-prefixspan, an algorithm for finding such patterns, and then section 5 provides empirical evaluation of the performance of mg-prefixspan. finally, we conclude the paper in section 6. 2 related work sequential pattern mining, in general, can be grouped into two categories. one category, called un-temporal sequence pattern mining or traditional sequence pattern mining, considers only the item occurrence order in a sequential pattern, but does not deal with time-related data [1, 3–6]. the other category, called temporal sequence pattern mining, consider not only the item occurrence order in a sequence pattern, but also the time between successive items in a sequential pattern, such as [8, 9, 11–13]. 2.1 un-temporal sequence pattern mining agrawal and strikant [3] introduced the notion of sequential pattern mining, and based on the property that any sub-pattern of a frequent pattern must be frequent, three apriori-based algorithms were proposed: apriorisome, dynamicsome and aprioriall. two of these algorithms were designed only to find maximal sequential patterns. the third algorithm, aprioriall, finds all patterns. briefly, aprioriall is decomposed into two phases: (1) generating candidate sequences; (2) scanning the sequential database to check the support of each candidate to determine frequent sequence patterns according to minimal support threshold. although aprioriall is not efficient, it is the basis of many efficient algorithms developed later. spads [5] is an algorithm proposed to find frequent patterns using efficient lattice search technology and simple joins. it decomposes the original search space (lattice) into smaller pieces (sub-lattices), which can be processed independently in the main memory. due to adopting a vertical id-list database format to count the number of frequent patterns, all the sequential patterns are discovered with only a few passes over the database. spads outperforms apriorall [2,3]. prefixspan[6] is another more efficient algorithm for mining sequential patterns comparing with the aprior-based algorithm apriorall and spads, especially in dealing with very large databases. it mainly adopts a projection-based, sequential pattern-growth method to make the database for next pass much smaller and consequently make the algorithm more speedy. also in prefixspan there is no need to generate candidates, only recursive projection of database according to their prefix. our method is based on the prefixspan algorithm. some have tried to exert constraints on the mining of sequential patterns so that only those sequential patterns interesting to users are discovered rather than the whole possible sequential patterns[2,5,6,7]. strikant and agrawal [4] generalized their definition of sequential patterns in [3] to integrate with time constraints, sliding time window, and user-defined taxonomy, and proposed algorithm gsp. mannila et al. [7] proposed a method of mining frequent episodes in a sequence of events, in which episodes are essentially constraints on events in form of acyclic graphs. garofalakis et al. [8] proposed a family of 496 n. li, x. yao, d. tian spirit algorithms of mining user-specified sequential patterns by using regular expression constraints. pei et al. [9] developed an extended framework based on a sequential pattern growth for constraint-based sequential pattern mining. although time between successive items is typically used as a user-specified constraint to shrink the pattern search space to make the computation more efficient, it is not used in the output frequent sequence patterns. 2.2 temporal sequence pattern mining yoshida et al.[10] proposed a notation of delta pattern, which is a temporal sequence pattern with temporal constraints in the form a [0,7] → b [3,5] → c of bounding intervals. an example of delta pattern has the form denoting a sequential pattern a → b → c that frequently appears in the database with transition times from a to b and from b to c that contained in [0, 7] and in [3, 5] respectively. however, yoshida et al. only provided a heuristics for finding some frequent delta patterns, and did not investigate the problem of finding all of them. along the same direction, chen et al. [11] introduced a form of temporal sequence pattern by inserting pseudo items into the original sequential pattern. pseudo items are userdefined time interval segmentations in advance. when counting the support of a sequential pattern, only the sequence, in which the pseudo item between successive items is same, supports the sequential pattern. two algorithms, i-apriori and i-prefixspan, were proposed. i-apriori is based on apriori algorithm [12], and i-prefixspan is based on prefixspan[6]. hirate et al. [13] proposed generalized sequential pattern mining with item intervals. they extended sequences which are defined by inserting pseudo items based on the interval itemization function and exerting four interval constraints on items. however, as they adopted some user-predefined pseudo items or constraints on the time intervals between successive items, it is difficult for a user to specify optimal constraints related to item interval and cannot reveal the time interval between successive items in the patterns precisely, it may result in some useful sequential patterns not being found. giannotti et al. [14] introduced another form of sequential pattern, called temporally-annotated sequence, tas in short, where each transition is annotated with temporal information representing a typical time derived from the source data. for instance, tas a t1→ b t2→c denotes the fact that a sequential pattern a → b → c frequently appears in the database and that the time for getting from event a to b is close to t1 and from event b to c is close to t2. although this method using typical time to annotate transition between two successive events can reveal how much time the event will occur after the preceding event, they cannot distinguish completely patterns with regard to different time granularities. for example, from event a to b is close to t1 days, other from event a to b may be close to t2 months, these two different patterns are treated as the same. it is useful to be able to distinguish the patterns to understand not only what event will follow, but also when these events will occur. finally, we mention the work in [15], where event structures that have temporal constraints with multiple granularities are introduced. event structures essentially are used as a flexible user-defined constraint specification to define the pattern discovery problem with these structures that enable users to focus on their interested sequential patterns. this method can only find the patterns that satisfy the event structures. furthermore, an event structure consists of a number of variables representing events and temporal constraints among these variables, an efficient event structure is difficult to define beforehand even if you are a domain expert. in our work, we introduce a new form of sequential pattern with multi-granularities, which is a sequential pattern where each transition is annotated with multi-granularity boundary interval and average time derived from the source data rather than user-predetermined time intervals [8,9] or only a typical time [14]. we also define the pattern discovery problem involving multi-granularities for these pattern and study efficient algorithm based on prefixspan to solve it. mining temporal sequential patterns based on multi-granularities 497 3 problem formulation 3.1 time granularity in order to formally define temporal sequence pattern that involves time granularities, we first review the notion of a time granularity [15]. definition 1. a granularity is a mapping µ from the set of the positive integers ( the time ticks) r to r2 (the set of absolute time sets) such that for all positive integer i and j with i ≤ j, the following two condition are satisfied: (1) µ(i) , ∅∧µ( j) , ∅ implies that each number in µ(i) is less than all the numbers in µ( j), and (2) µ(i) , ∅ implies µ( j) , ∅. each set µ(i), if non-empty, is called a granule of the µ. property (1) says that granules do not overlap and the order on time ticks follow the order on the corresponding granules. property (2) says that the subset of the time ticks corresponding to the granules forms a set of contiguous integers. the set µ(i) of reals is said to be the ith tick of µ, or tick i of µ . for example, hour, day, week, month, and year, satisfy the above definition. we can also define more complex granularities like business-week, weekend and so on. when dealing with temporal types, it is needed to determine the tick (if any) of a temporal type µ that covers a given tick z of anther temporal type v. formally, for each positive integer z and temporal types µ and v, if ∃ z’ (necessarily unique) such that v(z) ⊆µ(z’) then ⌈z⌉µv = z′, otherwise ⌈z⌉ µ v is undefined[15]. in this paper, all timestamps in a sequence are assumed to be in terms of a fixed granularity g0, and abbreviate ⌈z⌉µv as ⌈z⌉µ if v= g0. definition 2. a multi-granularity schema is a tuple of the form gm = (gm,gm−1, · · · ,g2,g1), where each gi (1 ≤ i ≤ m) is a granularity. for simplicity, we require that in gm and g0, each unit of gi is contained in a unit gi+1 (0 ≤ i ≤ m−1). 3.2 temporal sequence and multi-granularity sequence pattern let i = {i1, i2, · · · , in} be a set of items. an itemset si is a non-empty subset of items (without loss of generality, we assume that items of an itemset are sorted alphabetically) denoted as i1, · · · , ik, where i j (1 ≤ j ≤ k) is an item. a traditional data sequence is an ordered list of itemsets, which is sorted by the order of priority of the transaction time and denoted as (s1 → s2 →···→ sn), where si (1 ≤ i ≤ n) is an itemset [1,3]. in practice, a data sequence of a customer is also formed as an ordered list of itemsets and time stamps [11], we call it a temporal sequence. definition 3. a temporal sequence s is represented as ⟨(s1, t1) → (s2, t2) → ··· → (sn, tn)⟩, where si is an itemset and ti stands for the time at which si occurs, ti < t j for 1 ≤ i < j ≤ n. when adding the itemset time information in the sequence, the time interval value between any two elements in the sequence can be computed as follows: ti j = t j − ti,where 1 ≤ i < j ≤ n we now formally define multi-granularity schema, multi-granularity sequence patterns and its’ support, then formalize our novel mining problem as the discovery of all frequent multi-granularity sequence patterns in the database d. definition 4. given a multi-granularity schema gm = (gm,gm−1, · · · ,g2,g1), a multi-granularity sequence pattern is represented as follows: α=α1 [l1,m1,u1]µ1→ α2 [l2,m2,u2]µ2→ ··· [ln−1,mn−1,un−1]µn−1→ αn 498 n. li, x. yao, d. tian where (1) αi (1 ≤ i ≤ n) is an itemset; (2) µi ∈ {g j| j = 1, · · · ,m} (1 ≤ i ≤ n − 1) a granularity; (3) li, mi, and ui (1 ≤ i ≤ n − 1) are respectively the lower bound, average time and upper bound of the time that αi+1 occurs after α1, α2, · · · and αi with the granularity µi. [li, mi,ui]µi is called temporal annotation of αi+1, and the tuple ([li, mi,ui]µi,αi+1) also called a temporal item (1 ≤ i ≤ n − 1). example α = α1 [1,25,5]day → α2 [2,33,4]week → α3 is a multi-granularity sequential pattern, where α1,α2,α3 are itemsets or events, day and week are time granularities. the pattern indicates that α2 occurs in 1 day to 5 days or average 2.5 days after α1, then 2 weeks to 4 weeks or average 2.5 weeks later, α3 occurs. the total number of items in a multi-granularity sequence pattern is referred as the length of the pattern. a multi-granularity sequence pattern whose length is k is also represented as k-multi-granularity sequence pattern. definition 5. given a temporal sequence s = (s1, t1) → (s2, t2) →···→ (sn, tn) and a multi-granularity schema gm = (gm,gm−1, · · · ,g2,g1). let α be a multi-granularity sequence pattern α = α1 [l1,m1,u1]µ1→ α2 [l2,m2,u2]µ2→ ··· [lk−1,mk−1,uk−1]µk−1→ αk, where µi ∈ g j| j = 1, · · · ,m (1 ≤ i ≤ k − 1). s is said to support( or contain ) α if and only if there exists integers 1 ≤ i1 < i2 < · · ·< ik ≤ n such that (1) ∀ j1 ≤ j ≤ k, αi ⊆ si j (2) ∀ j1 ≤ j ≤ k−1, if µ j = gh (1 ≤ h ≤ m−1) then gh(1) ≤ ti ji j+1 22.89 < (80) [1,3,6]day → (80) > 1.71 < (80) [1,2,5]day → (80) > 1.97 < (80) [1,3,6]day → (80) [1,2,5]day → (80) > 0.54 < (80) [2,3,6]day → (569) > 0.62 < (80) [1,2,4]day → (569) > 0.84 < (80,432) > 1.19 < (80,432) [1,2,3]day → (944) > 0.60 table 4: part of the exacted pattern extracted patterns support(%) < (80) > 22.89 < (80) → (80) > 3.03 < (80) → (80) → (80) > 0.71 < (80) → (569) > 1.39 < (80,432) > 1.19 < (80,432) → (944) > 0.62 5.3 time granularity selection in the experiment, we use multi-granularity schema: g3 = (week, day, hour), and all the timestamps in the synthetic data sets are hour, i.e. g0 = hour. in order to compare the performance of algorithm i-prefixspan to that of mg-prefixspan, all the intervals between successive items are partitioned into five intervals: {l0, l1, l2, l3, l4}, where l0 : t = 0 : 0 < t < 1, l2 : 1 ≤ t < 24, l3 : 24 ≤ t < 168, and l4 : 168 ≤ t <∞. 5.4 comparison of extracted sequential pattern quality the first test is a comparison of the quality of extracted patterns by algorithm prefixspan[6], iprefixspan[11] and mg-prefixspan. the test is designed using the data set c10t4s5i2 and all minimum support thresholds is set to 0.005(0.5%). table 3 shows part of the extracted frequent multi-granularity sequence patterns using mg-prefixspan algorithm, and tables 4 and 5 show part of the extracted frequent sequential patterns using the algorithm prefixspan and i-prefixspan, respectively. as shown in table 3, we can made the following observation: (1) once item 80 occurs, then item 80 will occur again within 1 day to 6 days, or with average time 3 days, with probability (1.71/22.89) × 100% = 7.4%; within 1 week to 5 weeks, or with average time 2 weeks, with probability (1.97/22.89) × 100% = 8.6%. (2) once item 80 occurs and item 80 occurs within 1day to 6 days, or with average time 3 days, then item 80 will occur again within 1 week to 5 weeks, or with average time 2 weeks, with probability (0.54/1.19)100% = 45.4%. 504 n. li, x. yao, d. tian 0 500 1000 1500 2000 2500 3000 0.005 0.01 0.015 0.02 su pport thre shol d r u n ti m e (i n s ec o n d s) mg-prefixspan prefixspan i-prefixspan figure 1: performance of the of the algorithms on data set c10t2s4i1. 0 500 1000 1500 2000 2500 3000 0.005 0.01 0.015 0.02 su pport thre s hol d r u n ti m e (i n s ec o n d s) mg-prefixspan prefixspan i-p refixspan figure 2: performance of the algorithms on data set c10t2s4i2. (3) once item 80 occurs, then item 569 will occur within 2 days to 6 days, or with average time 3 days, with probability (0.62/22.89) × 100% = 2.7%; within 1 week to 4 weeks, or with average time 2 weeks, with probability (0.84/22.89) × 100% = 3.7%. (4) once item 80 and 432 occur, then item 94 will occur within 1 days to 3 days, or with average time 2 days, with probability (0.60/1.19) × 100% = 50.4%; as shown in table 4, although the patterns are similar to those in table 3, the time intervals between successive itemsets are not tighter than those in table 3. thus users cannot precisely predict when items 569 and 944 occur and 80 occurs again. on the other hand, as shown in table 5, there is no item annotation information based on multi-granularities in extracted sequences. thus, users are not able to predict how long time item 569, 944 will occur and item 80 will occur again. furthermore, the pattern < (80) → (569) > also makes users unable to distinguish periods of item 80 and 569 occurrence. these results indicate that the multi-granularity sequence patterns mined by algorithm, mg-prefixspan, are more useful than those mined by algorithms, i-prefixspan, or prefixspan. 5.5 comparison of the execution time the second test of the three algorithms would compare the run times for different minimum supports. the comparison is on the five data sets shown in the table 2, where the minimum support threshold is varied from 0.5% to 2%. fig.1 to fig. 5 summarizes the results. it is clearly show that how the performance of the three algorithms changes as varying of the parameters |c|, |t|, |s| and |i|, and the differences among the three algorithms. mining temporal sequential patterns based on multi-granularities 505 0500 1000 1500 2000 2500 3000 0.005 0.01 0.015 0.02 s u pport th re sh ol d r u n ti m e (i n s ec o n d s) mg-prefixspan prefixspan i-prefixspan figure 3: performance of the algorithms on data set c10t2s8i1. 0 1000 2000 3000 4000 5000 6000 7000 8000 9000 10000 0.005 0.01 0.015 0.02 s upport th re sh ol d r u n ti m e (i n s ec o n d s) mg-p refixspan prefixspan i-p refixspan figure 4: performance of the algorithms on data set c10t4s4i1. the results in the fig. 1 fig. 3 indicate that the processing time of algorithm mg-prefixspan is at different support threshold is no significant difference. the efficiency of mg-prefixspan is little less than that of the i-prefixspan. this result matches our expectation, because the mg-prefixspan algorithm does some more complicated computes for the bounds and average interval time in different granularities than that of the i-prefixspan. on the other hand, the results in the fig. 1, fig. 2 and fig. 3 show that the run time of three algorithms at different support threshold is also no significant difference. the speed of mg-prefixspan and i-prefixspan are little less than that of prefixspan. it is correct, because the algorithm prefixspan does not some complicated computes for interval time. when we see the fig. 3, fig. 4 and fig. 5 we can find that the run time of the three algorithms is increase rapidly as the minimum support threshold values vary from small to large. it is correct, because larger number of frequent sequential patterns could be found as the minimum support threshold value became smaller. however, it is worth note that the algorithm mg-prefixspan and i-prefixspan take less time than the prefixspan algorithm and the time difference of processing become larger as the minimum support threshold declines although the mg-prefixspan and i-prefixspan are more complicated than the prefixspan. in order to find the reason, let us note the data sets. fig. 3 test is performed on the data set c10t2s8i1, where the parameter average length of maximal potentially large sequences s increased from 4(fig.1) to 8. on average, a potentially frequent sequential pattern consists of 8 transactions, which mean that although the average number of transactions in a potentially frequent sequential pattern is set to 8, the number of transactions in a sequence is still set to 10. this situation can not cause the number and length of frequent sequential pattern increase considerably, because the data set c10t2s8i1 (1.746mb) is as sparse as c10t2s4i1 (1.733mb); fig.4 on the data set c10t4s4i1, where the parameter average number 506 n. li, x. yao, d. tian 0 1000 2000 3000 4000 5000 6000 7000 8000 9000 10000 11000 12000 13000 14000 15000 0.005 0.01 0.015 0.02 support threshold r u n ti m e (i n s ec on d s) mg-prefixspan prefixspan i-prefixspan figure 5: performance of the algorithms on data set c20t2s4i1. 0 1000 2000 3000 4000 5000 6000 10 15 20 25 30 database si z e (i n k se qu e n ce s) r u n ti m e (i n s ec o n d s) threshold=0.01 threshold=0.015 figure 6: scalability test of the algorithm mg-prefixspan on data set t2s4i1. of items per transaction t increased from 2 (fig.1) to 4, which mean, on average, a sequence consists of 10 transactions, each transaction composed of 4 items. it may result in the number of frequent sequential pattern increasing significantly; fig.5 on the data set c20t2s4i1, where the parameter average number of transactions of per customer c increased from 10 (fig.1) to 20, which mean, on average, a sequence consists of 20 transactions, each transaction composed of 2 items. it may result in the length of frequent sequential pattern increasing greatly. this both situations may cause the number and length of frequent sequential pattern increase considerably, because the data sets c10t4s4i1 and c20t2s4i1 are denser than the c10t2s8i1 and c10t2s4i1. when we use prefixspan to find all frequent sequential patterns, much more time would be spent in dealing with these more frequent sequential patterns although many of frequent sequential patterns may be useless. on the contrary, to the mg-prefixspan and i-prefixspan, although adding multiple time granularities or pseudo items to the traditional sequential pattern may make each of them to produce the several multiple granularities or time-interval patterns and need spend some time to does some complicated computation, many of them may not be frequent and some patterns found frequent by prefixspan may be infrequent by mg-prefixspan and i-prefixspan too. this reason reducing the numbers of frequent sequential patterns is why the mg-prefixspan and i-prefixspan are faster than the prefixspan. 5.6 related global performances the final test is the scalabilities of the mg-prefixspan algorithm. four tests are designed using the data set c10t2s4i1 and all minimum support thresholds are set to 0.01 and 0.015. in each test, test how the runtime of the mg-prefixspan algorithm scales as one parameter is increased. fig. 6 shows the result mining temporal sequential patterns based on multi-granularities 507 010002000 3000 4000 5000 6000 5 10 15 20 average number of transaction per customer r u n ti m e( in s ec on d s) threshold=0.01 threshold=0.015 figure 7: performance of the algorithms on data set t2s10i1. 01000 2000 3000 4000 5000 6000 2 3 4 5 ave rage n u mbe r of tran sacti on pe r cu stome r r u n ti m e( in s ec o n d s) threshold=0.01 threshold=0.015 figure 8: performance of the algorithms on data set c10s10i1. of scalability of the algorithm as the data set size grows from 10000 to 30000; fig.7 shows the result as the average number of transactions of per customer varies from 5 to 20; fig. 8 shows the result as the average number of items per transaction varies from 2 to 5. the results indicate that the mg-prefixspan algorithm has good scalabilities for its runtime increases linearly as each parameter varies from small to large respectively. 6 conclusions in this paper, we have proposed a novel approach for mining multi-granularities sequence patterns based on prefixspan [6], called mg-prefixspan. the multi-granularities sequence pattern not only reveals what items occur frequently together and in what order, but also the time interval that the next items occur after the preceding items. we use boundary interval and average time with multi-granularities, which are derived from the source data, rather than user-predetermined the time interval or only a typical time to annotate time interval between successive items in the patterns. the performance analysis shows that mg-prefixspan scales up linearly as the size of database, and has a good scalability with respect to length of sequence and the size of transaction. the future work along this line of research includes several aspects as follows: (1) validation on large, real databases; (2) low-level optimizing mechanism of the algorithm. acknowledgment this work was supported by the department of education of shaanxi province of china under grant 05jk137 and the natural science foundation of shaanxi province of china under grant 2005f11. 508 n. li, x. yao, d. tian bibliography [1] constantinescu, z, marinoiu, c, vladoiu, m, “driving style analysis using data mining techniques,” international journal of computers communications and control, vol.5, no.5, pp. 654663, dec 2010. [2] andonie, r, “extreme data mining: inference from small datasets,” international journal of computers communications and control, vol.5, no.3, pp. 280-291, sep 2010. [3] r. agrawal and r. srikant, “mining sequential patterns,” proc. of the 7th international conference on data engineering (icde’95), pp. 3-14, march, 1995. [4] r. srikant and r. agrawal, “mining sequential patterns: generalizations and performance improvements,” proc. of the 5th international conference on extending database technology, pp. 3-17, march, 1996. [5] m. zaki, “an efficient algorithm for mining frequent sequences,” machine learning, vol. 40, pp. 31-60, 2000. [6] j. pei, j. han and h. pinto et al, “mining sequential pattern-growth: the prefixspan approach,” ieee transactions on knowledge and engineering, vol.16, no.11, pp. 1424-1440, 2004. [7] h. manila, h. toivonen, and a.i. verkamo, “discovery of frequent episodes in event sequences,” data mining and knowledge discovery, vol. 1, no. 3, pp. 256-289, 1997. [8] m.n. garofalakis, r. rastogi and k. shim, “spirit: sequential pattern mining with regular expression constraints,” proc. of the 25th international conference on very large data bases (vldb’99), pp. 223-234, september, 1999. [9] j. pei, j. han and w. wang, “constraint-based sequential pattern mining: the pattern-growth methods,” journal of intelligent information systems, volume 28, issue 2, pp. 133-160, april, 2007. [10] m. yoshida et al. “mining sequential patterns including time intervals”, proc. of spie conf.dmkd, pp. 213-220, april, 2000. [11] y.-l. chen, m.-c. chiang and m.-t. ko, “discovering time-interval sequential patterns in sequence databases,” expert system with applications, volume 25, issue3, pp. 343-354, october, 2003. [12] r. algawal and r. srikant, “fast algorithm for mining association rules in large databases.,” proc. of the 20th international conference on very large data bases (vldb’94), pp. 487-499, september 1994. [13] y. hirate, h. yamana, “generalized pattern mining with item intervals,” journal of computers, vol.1, no3, pp. 51-60, june, 2006. [14] f. giannotti, m. nanni, and d. pedreschi. “efficient mining of temporally annotated sequences,” proc. of the 6th siam international conference on data mining, pp. 346-357, april, 2006. [15] c. bettini, x.s. wang and s. jajodia et al, “discovering temporal relationships with multiple granularities in time sequences,” ieee transactions on knowledge and data engineering, vol. 10 (2), pp. 222-237, 1998. international journal of computers communications & control issn 1841-9836, 12(5), 715-727, october 2017. fast motion estimation algorithm using hybrid search patterns for video streaming application r. vani, n. ushabhanu, m. sangeetha vani rajamanickam* meenakshi college of engineering, chennai, india *corresponding author: ecehod@mce.edu.in ushabhanu nageswaran valliammai engineering college, srm nagar, chennai, india ushabhanun.ece@valliammai.co.in sangeetha marikkannan bharath university, chennai, india abstract: the objective of the paper is to develop new block matching motion estimation (me) algorithm using hybrid search patterns along the direction of best match. the search efficiency for sequences with fast motions and high resolutions is improved by proposing new cross diagonal-hexagon search (ncdhs) algorithm which involves a novel multi half-hexagon grid global search pattern and a cross diagonal-hexagon local search pattern. the new search pattern enables the proposed algorithm to perform better search using 9.068 search points on an average, to obtain optimal motion vector with slight improvement in quality. this inturn reduces me time upto 50.11%, 47.12%, 32.99% and 43.28% on average when compared to the existing diamond search (ds), hexagon search (hs), new cross hexagon search (nhexs) and enhanced diamond search (eds) algorithms respectively. the novelty of the algorithm is further achieved by applying the algorithm proposed for live streaming application. the ncdhs algorithm is run on two matlab sessions on the same computer by establishing the connection using transmission control protocol (tcp) /internet protocol (ip) network. the me time obtained is 14.5986 seconds for a block size 16×16, is less when compared to existing algorithms and that makes the ncdhs algorithm suitable for real time streaming application. keywords: hybrid search patterns, motion estimation time, search points, peak signal-to-noise ratio (psnr). 1 introduction the h.264/advanced video coding (avc) standard was developed by joint video team (jvt) in the year 2003 [6]. the h.264 standard adopts the variable block size motion estimation (vbsme) feature which enables the standard to find its importance in various applications such as high definition digital video disc (hd-dvd), digital video broadcasting and mobile applications as discussed by chen et al. [2]. the variable block size motion estimation (vbsme) feature plays a vital role in the h.264 video coding standard. the input video sequence is processed as frames and each frame is divided into macroblocks. as the human visual system (hvs) is more sensitive to luma than the chroma components, the h.264 standard uses 4:2:0 sampling format as mentioned by richardson [8], in which each macroblock consists of one block of 16 × 16 pixels for representing the luminance component and two blocks of 8 × 8 pixels for representing the chroma components as shown in fig. 1. the 16×16 block known as macroblock (mb) is the basic building block of the video coding standard which is subdivided into 16×8 or 8×16 or 8×8 sub-blocks. an 8×8 sub-macroblock copyright © 2006-2017 by ccc publications 716 r. vani, n. ushabhanu, m. sangeetha figure 1: macroblock (4:2:0) can be further divided into 4×4 or 8×4 or 4×8 blocks. according to wen et al. [9], for objects with different motions and directions, the vbsme feature has improved the coding performance in h.264 standard than the fbsme method adopted in the earlier standards. this feature has enabled the motion estimation algorithms in the h.264 standard to find its applications in video conferencing, video streaming, surveillance, video telephony, multimedia networks, etc as mentioned in li & yang [7]. 1.1 concepts involved in block matching algorithms in the encoder design, the me process occupies about 53% of the time when one reference frame is used and about 70.20% of the total encoding time, when four reference frames are used, as mentioned in huayi et al. [5]. it increases the overall computational complexity in the encoder design. the probability of mv distribution has to be analyzed to obtain better match accuracy with less number of search points in the me algorithms. in a video sequence, more than 70% of the blocks are considered as either stationary at the location (0,0) or quasi stationary at the location (±1,0) or (0,±1). as discussed by zhao & xu [10], in the 5×5 search area as shown in fig. 2, the higher possibility of mv occurs at the centre position a, which is about 45.49%. figure 2: probability of mv distribution in the 5 × 5 search area at a pixel distance of ±1 from the centre, the mv is distributed for about 23.52% which is indicated as position b. at a pixel distance of ±2 from the centre, the possibility of distribution is about 5.73% indicated by position c. at the positions d, e and f, the possibility of mv distribution is very less and the sum of probability is about 7.051%. fast motion estimation algorithm using hybrid search patterns for video streaming application 717 1.2 limitations of the existing algorithms the diamond search (ds) algorithm proposed by zhu & ma [12] starts searching with large diamond search pattern (ldsp) and this process is repeated a number of times until the best match is obtained. then final refinement is done using small diamond search pattern (sdsp) which is applied only once. zhu et al. [11] proposed hexagon search (hs) algorithm that achieves better speed improvement than the existing ds algorithm with similar distortion performance. it is used for implementing the video codec standards such as h.261, mpeg-1 and mpeg2. cheung & po [3] proposed cross-diamond-hexagonal search (cdhs) algorithm which uses cross shaped pattern followed by hexagon search pattern. it performs faster searching than ds algorithm. the new cross hexagon search (nchexs) hybrid algorithm proposed by belloulta et al. [1] is developed for h.264 standard. it utilizes hybrid search patterns such as the cross search and the hexagon search patterns for search path analysis. the performance of motion estimation algorithms are influenced by the number of search points for many video based applications. the traditional me block matching algorithms used in the earlier standards are based on fixed block size and are not able to incorporate different motions in the video frames. for real time video sequences with different motions, the h.264 standard adopts variable block size for video processing to address the complexity problems. the algorithms mentioned in the literature adopt fixed and hybrid search patterns which have reduced the complexity in me time. it is necessary to develop the motion estimation algorithm based on motion vector characteristics with quality improvement. the objective of the paper is to propose new efficient me algorithm for improving the search efficiency using hybrid search patterns and to have less time consumption. the rest of the section is organised as follows: section 2 explains about the proposed algorithm and its search path analysis. in section 3, the performance analysis of the proposed and existing algorithm is done using matlab software. section 5 illustrates the live streaming of the proposed algorithm. finally, section 6 concludes the future possibilities of the work. 2 proposed ncdhs algorithm the ncdhs algorithm involves both coarse grain and fine grain search to find the best matching between the successive frames. initially, we consider the motion vector point (mvp) to be at the position (0,0). the first step is designed using unsymmetrical cross pattern with step size ±1, and followed by modified asymmetric hexagon with step size ±1 and ±2. the last step in which the fine grain is done which utilises either a cross or diagonal hexagon depends on the minimum cost with step size ±1 from centre point. the steps for proposed algorithm are given below: step 1: the coarse grain search starts with the unsymmetrical cross search pattern with pixel distance ±1 as shown in fig. 3. the minimum cost point is obtained using sad computation. if the minimum cost is found to be at centre point, then the search stops else the search continues to step 2. step 2: the half hexagon search is performed with pixel distance ±1 and ±2 as shown in fig. 4 as a second search with minimum cost point from step 1 acting as a centre point. this reduces the number of search points as the grid chooses the minimum cost either in the upper half or lower half of the hexagon pattern. step 3: switch the search from the coarse to the fine resolution inner search to find the best motion vector. 718 r. vani, n. ushabhanu, m. sangeetha figure 3: first search (i) a small cross pattern with best search point obtained from global search is formed as a centre. if the current minimum sad is at centre, the search stops. else it moves to next step 2. (ii) the same cross pattern is repeated based on the minimum sad position. if the current and the previous minimum sad positions are same, then the selected search point is the best match point else move to step 3. (iii) diagonal-hexagon search pattern is formed with previous minimum sad point as centre. the algorithm steps are iterated until the final best vector is obtained as shown in fig. 5. the proposed ncdhs algorithm adopts the pseudo code as shown in algorithm 1 to find the minimum cost. 2.1 theoretical search point analysis based on the assumption, let the best match block occur at the position (3,0) as shown in fig. 6. initially the search starts with unsymmetrical cross search pattern with seven check points. if the mbd point is at the location (1, 0), the algorithm searches for minimum motion vector using half hexagon grid with two check points (2,-1), (3,0) in addition with the pixels at the location (1,0), (0,-1) and (-1,0) that is reused from the previous search. with the minimum mbd point found at location (3,0) from second search, the ncdhs algorithm finally searches for four check points around it, to form a small cross pattern. the search point located at (3,0) is found by using 12 search points by adopting ncdhs algorithm. 3 performance analysis to evaluate the performance of the ncdhs algorithm in terms of me time, search points and psnr, simulation is done using matlab for the proposed and the existing algorithms at a frame rate of 30 fps. the test sequences used are "foreman" and "mobile" with a resolution fast motion estimation algorithm using hybrid search patterns for video streaming application 719 figure 4: second search of 352 × 288, "bqmall" and "mobisode" with a resolution of 832 × 480, "people on street" and "traffic" with a resolution of 2560 × 1600. the block size is chosen as 16 × 16 with sum of absolute differences (sad) as the distortion metric. the sad metric at the search location (m,n) is calculated as given in eq. (1). sad(m,n) = n−1∑ x=0 n−1∑ y=0 |i1(x,y) − i2(x + m,y + n)| (1) where n × n denotes block size, i1(x,y) indicates the pixel intensity at the location (x,y) in the current frame, i2(x,y) indicates the pixel intensity at the location (x,y) in the reference frame and (m,n) denotes the displacement vector. the lowest sad estimates the best position of prediction occurred within the search window. 3.1 motion estimation time table 1 summarises the simulation results obtained for the me time taken by the proposed ncdhs and the existing algorithms. for the mobile sequence, the me time obtained for proposed ncdhs algorithm requires 10.161 seconds and the me time obtained for the existing ds algorithm proposed by zhu & ma [12], hs algorithm proposed by zhu et al. [11] and nchexs algorithm proposed by belloulata et al. [1] are 19.881, 19.448 and 13.321 seconds respectively. the results show that the proposed ncdhs algorithm has reduced the me time when compared to the other existing algorithms. 3.2 search points the average search points for each block in a frame are calculated for different sequence in encoding 30 frames. table 2 shows that the proposed ncdhs algorithm consumes the smallest number of search points compared to other algorithms. the results show that the average number of search points in ncdhs algorithm is 9.068. in order to demonstrate the performance of the proposed algorithm, the fig. 7 gives the result of the average number of computations per block for different video sequences as the frame number increases. for mobisode sequence, the search 720 r. vani, n. ushabhanu, m. sangeetha figure 5: search path analysis of ncdhs algorithm table 1: simulation results showing me time of the ncdhs and the existing algorithms test sequences me time obtained for existing and proposed algorithms (in seconds) ds [12] hs [11] nchexs [1] proposed ncdhs foreman 19.387 19.86 15.233 9.48 mobile 19.881 19.448 13.321 10.161 bqmall 61.207 54.765 48.776 31.506 mobisode 77.945 69.422 57.707 37.461 video codec standards h.261, h.263 h.261, mpeg-1,2 h.264 points obtained for ncdhs algorithm is 8.174 which is less when compared to the existing algorithms. it clearly manifests the superiority of the proposed ncdhs algorithm to the other methods in terms of number of search points used. 4 me time reduction and psnr improvement the ∆me time and ∆psnr denotes the cpu time reduction in me process and improvement of peak signal to noise ratio (psnr) of the encoded image respectively. they are defined as in eqs. (2)–(4): ∆me time = metproposed −metoriginal (2) %∆me time = (metoriginal −metproposed) metoriginal × 100 (3) ∆psnr = psnrproposed −psnroriginal (4) fast motion estimation algorithm using hybrid search patterns for video streaming application 721 algorithm 1 pseudocode of ncdhs algorithm input: initialize a large cross pattern such that sph = 2 ∗ (spv) output to find minimum cost 1: begin 2: if (mvp = minsad) then 3: mv = mvp 4: else 5: with p = minsad, select large half hexagon with range {p + 2, p−2} calculate the overall minsad (minsadcg) 6: end if 7: with minsadcg, initialize fine-grain search with small cross with 5 points 8: if (minsadcurrent = minsadprev) then 9: mv = minsad(x,y) 10: else if (minsad is in the same row or column) then 11: cross slides to next position 12: else 13: with minsad(x,y), a diagonal-hexagon pattern is formed 14: if (minsadprev = minsadcurrent) then 15: mv = minsad(x,y) 16: end if 17: end if 18: end table 2: average number of search points per block sequence average number of search points (computations) for each block ds [12] hs [11] nhexs [1] eds [4] proposed foreman 16.839 12.716 12.084 12.757 10.48 mobile 14.323 12.73 11.341 11.634 9.396 bqmall 15.519 11.439 11.204 11.307 8.225 mobisode 18.793 13.31 12.982 13.86 8.174 average computations 16.368 12.548 11.902 12.3895 9.068 where metproposed and psnrproposed denotes the me time and psnr of the proposed algorithm. the metoriginal and psnroriginal represents the me time and psnr of ds, nhexs, hs and eds algorithms. table 3 shows the improvement in psnr and the average reduction of me time for proposed algorithm when compared to existing ds, hs and nhexs search patterns. it is clear that the ncdhs algorithm reduces an average of 50.11% of time for motion estimation compared to diamond search (ds) consequently 47.12% of motion estimation time compared to hexagon search (hs), 32.99% of motion estimation time compared to new hexagon search (nhexs) and 43.28% of motion estimation time compared to enhanced diamond search (eds). this indicates that the average reduction in me time is greatly achieved by the proposed algorithm with slight improvement in psnr, when compared to the excising algorithms. the comparison of the existing and the proposed algorithms is shown in fig. 8 highlighting 30 frames of different video sequences. when compared to the existing algorithms, the proposed scheme can reduce the me time of the encoded image, as the frame number increases. 722 r. vani, n. ushabhanu, m. sangeetha figure 6: ncdhs algorithm search point analysis table 3: simulation results showing psnr improvement and average reduction of me time sequence psnr improvement (db) (∆psnr) ∆me time (in%) proposed vs ds proposed vs hs proposed vs nhexs proposed vs eds proposed vs ds proposed vs hs proposed vs nhexs proposed vs eds foreman 0.57 0.11 0.06 0.15 -51.10 -52.26 -37.76 -51.83 mobile 0.64 0.34 0.1 0.33 -48.89 -47.75 -23.72 -46.99 bqmall 0.62 0.24 0.02 0.05 -48.52 -42.47 -35.40 -39.07 mobisode 0.92 0.84 0.24 0.59 -51.93 -46.03 -35.08 -35.23 average % time savings of the ncdhs algorithm -50.11 -47.12 -32.99 -43.28 5 video streaming application based on ncdhs algorithm the block diagram of video streaming system for the proposed ncdhs algorithm is shown in fig. 9. the ncdhs server and the ncdhs client run on two matlab sessions on the same computer as shown in fig. 10. after establishing the connection using transmission control protocol (tcp) /internet protocol (ip) network, the server and client becomes ready for streaming. at the server end, the input video frames of frame size 352×288 are captured by the camera and is displayed on the pc screen. the proposed ncdhs algorithm starts estimating the motion between the frames by computing sad values. the motion vectors are transmitted to the client over the network. the reconstruction of the video frames is done using ncdhs algorithm and the video frames are displayed on the client screen. fast motion estimation algorithm using hybrid search patterns for video streaming application 723 figure 7: performance comparison of ds, hs, nhexs, eds and the proposed ncdhs algorithm for (a) foreman sequence (b) mobile sequence (c) mobisode sequence (d) bqmall sequence in terms of average number of search points 724 r. vani, n. ushabhanu, m. sangeetha (a) foreman sequence (b) mobile sequence (c) mobisode (d) bqmall figure 8: performance comparison of ds, hs, nhexs and the proposed ncdhs for (a) foreman sequence (b) mobile sequence (c) mobisode sequence (d) bqmall sequence in terms of time reduction in me process figure 9: video streaming using ncdhs algorithm fast motion estimation algorithm using hybrid search patterns for video streaming application 725 figure 10: snapshot of ncdhs server and client sessions using ncdhs figure 11: sample of live input and the reconstructed output in the server and client using ncdhs algorithm 726 r. vani, n. ushabhanu, m. sangeetha table 4: simulation results showing me time of the ncdhs and the existing algorithms video codec standard h.264 algorithms nchexs [1] eds [4] ncdhs me time (in seconds) 17.18488 19.1599 14.5986 the sample video input and the reconstructed output is as shown in fig. 11. for a frame size 352×288 and block size 16×16, the live video frames are captured using laptop camera. for the first thirty frames, the motion estimation time taken for the proposed ncdhs and the existing algorithms for block size 16 × 16 is as shown in table 4. for the live video sequence, the me time obtained for ncdhs algorithm is 14.5986 seconds, for the nchexs algorithm introduced by belloulata et al. [1] is 19.1599 seconds and for the eds algorithm discussed by devi et al. [4] is 17.18488 seconds. the simulation results show that the me time obtained by the ncdhs algorithm is less when compared to the existing algorithms. the speed of the algorithm can be further improved by reducing the motion estimation time when implemented on dedicated hardware as suitable for real time streaming applications. 6 conclusion the proposed new cross diagonal-hexagon search (ncdhs) algorithm has adopted novel half-hexagon grid global search pattern and a cross diagonal-hexagon local search pattern. the hybrid global and local search pattern utilises less search points to obtain optimal motion vector and this inturn reduces the motion estimation time, when tested on live sequences. the search points reduction has reduced the computational complexity when compared to the existing algorithms with little improvement in quality. the new block matching motion estimation algorithm using hybrid search patterns achieves less me time when compared to the existing algorithms, which makes it suitable for live streaming application. the algorithm proposed can be extended to high efficiency video coding (hevc)/h.265 standard by considering the reduction in motion estimation time and by extending the block size to meet the requirements of the standard. bibliography [1] belloulata k., zhu s., tian j., shen x. (2011); a novel cross hexagon search algorithm for fast block motion estimation, international workshop on systems, signal processing and their applications, 1-4, 2011. [2] chen t.c., lian c., chen l.g. (2006); hardware architecture design of an h.264/avc video codec, ieee asia and south pacific conference on design automation, 8, 2006. [3] cheung c.h., po l.m. (2005); novel cross-diamond-hexagonal search algorithms for fast block motion estimation, ieee transactions on multimedia, 7(1), 16-22, 2005. [4] devi s.r., rangarajan p., perinbam j.r.p., paul r. (2013); vlsi implementation of high performance optimized architecture for video coding standards, acta polytechnica hungarica, 10(6), 237-249, 2013. [5] huayi l., lini m., hai l. (2010); analysis and optimization of the umhexagons algorithm in h. 264 based on simd, second international conference on communication systems, networks and applications, 1, 239-244, 2010. fast motion estimation algorithm using hybrid search patterns for video streaming application 727 [6] itu-t rec.h.264, iso/iec 14496-10 avc. (2003); joint video team draft itu-t recommendation and final draft international standard of joint video specification, 2003. [7] li z., yang q. (2012); a fast adaptive motion estimation algorithm, international conference on computer science and electronics engineering, 3, 656-660, 2012. [8] richardson i.e. (2011); the h. 264 advanced video compression standard, john wiley & sons, 2011. [9] wen x., au o.c., xu j., fang l., cha r., li j. (2011); novel rd optimized vbsme with matching highly data re-usable hardware architecture, ieee transactions on circuits and systems for video technology, 21(2), 206-219, 2011. [10] zhao w., xu s. (2012); research and optimization of umhexagons algorithm based on h.264, ieee 4th international conference on multimedia information networking and security, 600-603, 2012. [11] zhu c., lin x., chau l.p., lim k.p., ang h.a., ong c.y. (2001); a novel hexagonbased search algorithm for fast block motion estimation, ieee international conference on acoustics, speech, and signal processing, 3, 1593-1596, 2001. [12] zhu s., ma k.k. (2000); a new diamond search algorithm for fast block-matching motion estimation, ieee transactions on image processing, 9(2), 287-290, 2000. international journal of computers communications & control issn 1841-9836, 9(6):694-702, december, 2014. detecting emotions in comments on forums d. gîfu, m. cioca daniela gîfu "alexandru ioan cuza" university of iaşi 16 general berthelot st., iaşi, 700483, românia daniela.gifu@info.uaic.ro marius cioca "lucian blaga" university of sibiu 10, victoriei bd., sibiu, 550024, românia marius.cioca@ulbsibiu.ro abstract: the paper presents one of the most important issues in natural language processing (nlp), emotion identification and classification to implement a computational technology based on existing resources, open-source or freely available for research purposes. furthermore, we are interested to use it for establishing gold standards in sentiment analysis area, such as sentiwordnet. in this sense, we propose to recognize and classify the emotions (sentiments) of the public consumer from the written texts which appeared on the various forums. we analyse the writing style which refers to how consumers construct sentences together when they write comments to indicate their passion about an entity (persons, brand, location, etc.). we present in this paper a method for integrating romanian lexical resources from emotional perspective, in developing, which can be used in sentiment analysis. this study is intend to help direct beneficiaries (public consumer, marketing managers, pr firms, politicians, investors), but, also, specialists and researchers in the field of natural language processing, linguists, psychologists, sociologists, economists, etc. keywords: sentiment analysis, language resources, emotions levels, semantic classes, forums. 1 introduction in our context, emotion in writing refers to how public consumers express a personal opinion of their experience about entities (products, persons, tourism objectives, etc.). when we say public consumer, actually, we say any commentator who is interested in a range of information about a particular entity.the option for such a topic, known as sentiment analysis (sa) or opinion mining1, encountered in texts circulated on different forums, and comes from the need to clarify descriptive consumer behavior, affected by the amount of promotional messages, regardless of their nature and purpose. at the present time, sentiment analysis is one of the most studied natural language processing (nlp) issues. the hypothesis of this paper is that by observing the emotional orientation of the commentators over time (visible in writing style) on forums can help us to build a database with information on topics, services, products, etc. for the public interest, which can serve to implement a nlp tool, useful to predict potential consumer needs. the paper is structured in five sections. after a brief introduction about the importance of this study, the section 2 mentions some important works focused on sa. the section 3 describes 1opinion mining originates from the information retrieval (ir) community, and aims at extracting and processing users’ opinions about entities (products, movies, etc.). sentiment analysis was initially formulated as the nlp task of retrieval of sentiments expressed in texts. looking closely, these two issues are similar in their own essence and fall under the area of subjectivity analysis. copyright © 2006-2014 by ccc publications detecting emotions in comments on forums 695 four units of sentiment analysis some of the most coomonly used in sa, and section 4 describes the our tool functionality. the last section highlights conclusions and mentions the future work, one of the projects of nlp-group@uaic-fii. 2 state of the art nowadays, forum becomes a long-term instrument that can consolidate the public sphere, habermas’s concept [9] and civil society. in opposite to the instrumental view of liberalization of the internet, the new dimension can be classified as environmental. the ubiquity of forums affects the marketing mechanisms to respond to the challenges imposed by it. if the landscape of communication becomes denser, more complex and more participative, then the network population gets increased access to information, achieving multiple opportunities by engaging in public speech and putting in motion collective actions. but, a problem appears. more information, more opinions reflected mostly in writing style. in fact, any difference in writing reflects the heterogeneity in reviewers culture, education, occupation and so on. this heterogeneity can be quantified in sentiments. the sentiment is the overall emotion towards the subject matter expressed by the reviewer. in general terms, sa consists of extracting opinions from text. it is assimilated as subjectivity analysis [2] or evaluating affection [1]. sa defines the processing search results from an article, generating a list of attributes product (quality, characteristics, etc.) and aggregating opinions for each of them (e.g. poorly, good). moreover, sa has been interpreted as including various types of analysis and evaluation [14], [15], [17], [18]. another important dimension of sa is researching objectivity in a text, finally resulting a text classification into two classes objective and subjective -, frequently more difficult to undertake than for a polarity one [16]. in 2001, sentiment analysis was the subject of two researches by das and chen [3], and tong [1], concerned on the opinions on the market sales. out attention is also take up by the classification of the degree of positivity of a text (document, sentence/clause, etc.), consisting in opinion words (e.g. angry, happy). for instance, in elections, we established two classes, positive and negative, each of them with other three subclasses for determining the intensity of sentiment [7]. moreover, in the sentiment analysis area there are approaches that consider, also, the neutral class (value 0), assigning words with one value from -5 to +5, with two classes more than the first author [8]. this paper describes a method with a shorter scale of values, from -1 to +1, as the authors are interested to discover the sentiment extracted from their comments. 3 units of sentiment analysis sa offers organizations the possibility to monitor opinions about products/ services and their reputation (e.g. measuring feedback with statistical software packages sas statistical analysis system, spss statistical package for the social sciences or superior performing statistical software), on various forums platforms in real time and to act accordingly. we describe below four lexical units for sa. 3.1. document as the unit of analysis it is the simplest form of sa and assumes that the document contains an opinion on one main message expressed by the commentator. we will stop at two approaches of sentiment analysis from the document. a) supervised the document must be classified in a finite set of classes, the training data are assigned to each class. this is for the simple case, when there are two classes: positive and 696 d. gîfu, m. cioca negative. also, a neutral class can be added or a numeric scale can be considered from which the document has to be reported (for instance, sentiwordnet). esuli and sebastiani [6] reports three sentiment scores: positivity, negativity and objectivity. the system learns a classification model based on the training data, using an algorithm of classification, such as svm (support vector machines) or knn (k-nearest neighbors). then, this classification is used for mapping new documents in their different sentiment classes. good precision is achieved even when each document is represented as a bag of words [13]. b) unsupervised the document is based on determining the semantic orientation (so) of specific phrases. if the average so of these phrases is above a predefined threshold, the document is classified as positive. otherwise, it is considered negative. for instance, a set of predefined part-of-speech (pos) models can be used to select those sentences [21] approach taken into consideration in this study or to create an opinion lexicon structured in words and syntagmas used by the first author since 2009. 3.2. sentence as the unit of analysis for a more refined analysis of opinions about an entity (organization, product, political actor, etc.) we must move to the sentence level. it is assumed that there is only one opinion (sentiment) in each sentence. to prove it, each sentence is splitted in clauses (a fragment with a predicative verb) and every clause contains only one opinion which we classified it in subjective or objective. only the subjective clauses will be analyzed. for instance, the approach is based on minimal reductions [19], as the premise is that the neighboring clauses should have the same subjective classification. then the sentences can be classified as either positive or negative. 3.3. comparative sentiment analysis in many cases, users do not offer a direct opinion about a product, preferring instead comparable opinions such as: dacia logan arată mult mai bine decât dacia solenza2. in this case, the purpose of the sentiment analysis system is to identify opinions of the sentence containing the comparative views, as well as to extract there from the preferred entity. authors like jindal and liu [12] describe this analytical method. using a relatively small number of words as comparative adverbial adjectives mai mult, mai puţin, uşoare3, superlative adjectives and adverbs mai, cel puţin, cele mai bune4, additional clauses favoare, mare, preferă, decât, superioară, inferior, numărul unu, împotriva5, we can cover 98 % of the comparative opinions. for these words/groups of words which frequently appear in texts, but with low precision, a classifier6 can be used to filter phrases that do not contain comparative views. ding, liu and zhang [4] present a simple algorithm for identifying preferred entities relating to the type of comparisons used and the presence of negation. 3.4. sentiment lexicon as we have seen so far, the lexicon is the most important resource for the majority of the sentiment analysis techniques. there are three options in order to create a lexicon of sentiments: a) manual approaches, when researchers create a manual lexicon, consisting of a set of words selected from explanatory dictionaries that will be subsequently extended by using existing lexical resources (synonyms and antonyms for enrichment). we have already mentioned wordnet. this process requires a laborious effort, especially that each domain needs its own lexicon. a handy algorithm is proposed by kamps, j., marx, m., mokken, r.j. and de rijke, m. (2004). 2en. dacia logan looks much better than dacia solenza. 3en. more, less, easy. 4en. more, at least, the best, etc. 5en. favour, high, prefer, rather than, superior, inferior, the number one, against. 6for example, naive bayes classifier, a statistical method for forms classification and recognition, where each document represents a collection of words and word order is considered irrelevant. detecting emotions in comments on forums 697 b) corpus-based approaches, in which a set of words/phrases extracted from a relatively small corpus is extended by using a large corpus of documents of a single domain. the main disadvantage of any dictionary-based algorithm (a) is that the acquired lexicon is too general and therefore does not capture the specific features of a particular area. advanced approaches based a lexicon are reported in dragut et al. [5]. if we want to create a specific sentiment lexicon, we have to use a corpus-based algorithm. a classical work in this area [10] highlights the concept of sentiments consistency allowing the identification of complex polar adjectives. in other words, a set of linguistic connectors şi, sau, nici, fie, sau7 has been used to find the adjectives that are connected to the adjectives with well-known polarity. for example: bărbat puternic şi armonios8. if we admit that puternic is a positive word, we can assume that the word armonios is also positive thanks to the use of the connector şi. 4 the tool description this version of our tool9 is able to detect and to explain the appreciations about some entities (persons, products, brands, etc.). this tool is based on information like labeling of parts of speech (e.g. the xml example), extracting of interest nominal groups, automatic extracting of entities and anaphoric connections.

nimic mai odios , mai oribil decât pantofii sport cu platforma moreover it was developed an important ontology of entities, categories and values. in figure 1 we have the interface of our tool. we describe briefly work methodology: figure 1: the interface of the computational tool 1. a corpus of texts (50 texts) is manually annotated using palinka10, in order to build triplets of the form: . 2. the text is preprocessed using uaic romanian part of speech tagger11 [20]. this tagger combines a statistical model to one based on rules. the morphological dictionary was largely extracted from dexonline and contains 1.25 milion distinct words. the result is an xml file, each word has been tockenized and annotated according to the pos that it represents. 3. noun phrases are detected and annotated with np-chunker12 [20]. this chunker is used in 10http://clg.wlv.ac.uk/trac/palinka/ 11pos tagger has a precision of 96,6%8, considered on the corrected version of the novel "1984" (george orwell).(http://instrumente.infoiasi.ro/webposro/). 12chunker receives as input the tokenized text, in xml, formed by suitable groups in text, and the output is another xml file where each nominal interest group will be annotated xml with np label (http://instrumente.infoiasi.ro/webposro/). detecting emotions in comments on forums 699 many applications to resolve the ambiguities or to extract information. for example, the newest work studies based on machine translation use texts in two languages (parallel corpora) to derive the appropiate transfer models. 4. proper names of entities are automatically extracted using a named entity recognizer techonology gate13 open source (annie)14. 5. anaphoric links (especially, pronouns) are extracted from the text using rare (robust anaphora resolution engine implemented by eugen ignat [11].). this process makes appreciations that the text expresses about those entities (coreferences) to be aggregated to the same entity (reference). 6. entities, categories and values from the ontologies that have been already created are recognized in the text using ner (named entity recognition) which extracted the entities automatically. ner recognizes entities such as persons, organizations or geographic locations, receiving as input a natural language text and the output is a text file which contains entities as a string that uses separators to delimit named entities. 7. a set of rules is written for the recognition of values and the connections such as are established. 8. graphical inteface reveals the extracted information and global scores. of the recorded, our tool is able to detect and explain qualitative appreciations about entities. in figure 2 is profiled the architecture of this software as follows: building an anthology of entities, categories and values, useful to obtain a correct and complete result; preprocessing text, meaning annotation, splitting text into entities (words, symbols or tokens); noun phrase chuncking (np-chunk), meaning splitting text into sequences of syntactically correlated words (nominal groups); recovering anaphoric connections, important not to lose any reference to a particular entity, using rare. extracting entities, using ner module. it receives a file .txt (input). the output file contains only the entities mentioned in the analyzed text. for instance: " vodafone românia oferă cea mai bună conectivitate pentru serviciile de date dintre toate reţelele mobile gsm / umts / cdma din românia". the output file contains the following entities: vodafone, românia, vodafone românia, gsm, umts, cdma. if an entity appears more than once, it will be found only once in the output file. as an exemplification, here is a part of the xml output-file: vodafone românia conectivitate pentru serviciile de date bună recognizing categories, values and relationships with entities. considering the resulting files, once the previous phases have been completed, it will automatically extract the categories, values and relationships with entities using a set of rules (regular expression). these regular expressions use parentheses (round, square brackets) that form rules for constructing words. the most frequent use of regular expressions consists in recognizing if a string contains or not words or sub-string, that can be formed by that regular expression. for instance: the string p[oa]t can be interpreted as pot and pat. 13http://gate.ac.uk/ 14http://services.gate.ac.uk/annie/ 700 d. gîfu, m. cioca figure 2: the architecture of the computational software basically, the tool completes the following steps: it identifies opinion words and phrases; it assigns to every positive or negative word a value (+1) for the positive one and (-1) for the negative one; the words which depend on context get also a value (0). for instance: dacia logan este mai fiabilă decât orice opel . dacia logan capacitatea sistemelor tehnice de a funcţiona fiabilă 5 conclusions and future work this paper presents an automatic method able to detect and explain opinions on certain entities (peoples, companies, products, etc.) identified in a text, regardless of its nature (advertising, political, journalistic, etc.) based on a lexicon of opinions resulted from manual annotation (presented in other papers) of an initial corpus (consisting of opinion words and syntagmas). moreover, in addition to this lexicon, we focused on the semantic role of negations and pragmatic connectors like "dar" ("but"). this application seeks to support the development of a complex lexical resource, necessary to interpret qualitative assessments found in any text. we are convinced that this analyze manner may be an important support for marketing managers, pr firms, politicians, online buyers, but, also, for specialists in nlp, linguistics, etc. until now, we observed the fact that when a variable of neutralizing sentiments appears, it is not enough to cover only the summarizing operation of values for each opinion sentence. because of that, we propose to add degrees of intensity and power in expressing opinions. in romanian language, the superlative amplify semantically the convictions of the person who opines on an issue. in the sentence vodafone românia oferă cea mai bună conectivitate pentru serviciile de date dintre toate reţelele mobile gsm/ umts/ cdma din românia. the word bună gets +1. the detecting emotions in comments on forums 701 superlative cea mai expands the scale of values. it can get the degree of positivity (or negativity). it depends on which word follows. so, cea mai bună gets (+2). also, due to pragmatic connectors, we have to give up on summarizing values. acknowledgments in order to perform this research the first author received financial support from the erasmus mundus action 2 emerge project (2011 2576 / 001 001 ema2). i am also grateful to the nlp-group@uaic-fii for offering me support in using some tools for automatic interpretation of romanian language. bibliography [1] ardeleanu, i. (2013); extragerea de opinii din texte, lucrare de licenta coord. de prof.univ.dr. dan cristea, universitatea alexandru ioan cuza din iasi. [2] dave, k.; lawrence, s. and pennock d.m. (2003); mining the peanut gallery: opinion extraction and semantic classification of product reviews in proceedings of www. [3] das, s.; chen, m. (2001); yahoo! for amazon: extracting market sentiment from stock message boards in proceedings of the asia acific finance association annual conference (apfa). [4] ding, x., liu, b. and zhang, l. (2009): entity discovery and assignment for opinion mining applications. in proceedings of acm sigkdd international conference on knowledge discovery and data mining. [5] dragut, e.c., yu, c., sistla, p. and meng, w. (2010): construction of a sentimental word dictionary. in proceedings of acm international conference on information and knowledge management. [6] esuli, a.; sebastiani, f. (2006); determining term subjectivity and term orientation for opinion mining in proceedings of eacl-06, 11th conference of the european chapter of the association for computational linguistics, trento, it. forthcoming. [7] gifu, d. (2012); political text categorization in humanities and social sciences review, vol. 1, no. 3, university publications.net, usa, part of the paper presented in the international journal of arts and sciences’ (ijas) international conference for academic disciplines, harvard university, cambridge, massachusetts, 27-31 may 2012. [8] gifu, d. (2013); temeliile turnului babel. o perspectiva integratoare asupra discursului politic, ed. academiei romane, bucuresti. [9] habermas, j. (1962); strukturwandel der ofentlichkeit: untersuchungen zu einer kategorie der burgerlichen gesellschaft. neuwied, luchterhand. [trad. rom.: sfera publica si transformarea ei structurala, bucuresti, ceu, 1989.] [10] hatzivassiloglou, v. and mckeown k. r. (1997): predicting the semantic orientation of adjectives. proceedings of acl-97, 35th annual meeting of the association for computational linguistics, madrid, es, association for computational linguistics. 702 d. gîfu, m. cioca [11] ignat, e. (2011); rare-uaic (robust anaphora resolution engine), resursa gratuita pe meta-share, universitatea "alexandru ioan cuza" din iasi, 2011. [12] jindal, n. and liu, b. (2006): identifying comparative sentences in text documents. in proceedings of acm sigir conf. on research and development in information retrieval. [13] kamps, j., maarten, m., r. ort.mokken and maarten de rijke. (2004): using wordnet to measure semantic orientation of adjectives in proceedings of lrec-04, 4th international conference of language resources and evaluation, vol. iv. [14] liu, b. (2010); sentiment analysis and subjectivity. handbook of natural language processing. n. indurkhya and f.j. damerau, eds. [15] liu, b. (2012); sentiment analysis and opinion mining. synthesis lectures on human language technologies, morgan claypool publishers. [16] mihalcea, r.; banea c.; wiebe, j. (2007); learning multilingual subjective language via cross-lingual projections in 45th annual meeting of the association for computational linguistics (acl-2007). [17] pang, b.; lee, l. (2008); opinion mining and sentiment analysis in foundations and trends in information retrieval, 2. [18] pang, b.; lee, l.; vaithyanathan, s. (2002); thumbs up? sentiment classification using machine learning techniques in proceedings of emnlp-02, 7th conference on empirical methods in natural language processing (philadelphia, pa). association for computational linguistics, morristown, nj. [19] pang, b.; lee, l. (2004); a sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts in proceedings of the association for computational linguistics. [20] simionescu, r. (2011); pos-tagger hibrid, lucrare de disertatie coord. de prof.univ.dr. dan cristea, universitatea "alexandru ioan cuza" din iasi. [21] turney, p. (2002); thumbs up or thumbs down? semantic orientation applied to unsupervised classification of reviews in proceedings of the association for computational linguistics. [22] tong, r.m. (2001); an operational system for detecting and tracking opinions in on-line discussion in workshop note, sigir 2001 workshop on operational text classification. international journal of computers communications & control issn 1841-9836, 12(3), 403-414, june 2017. an extension of the vsm documents representation l. vinţan, d. morariu, r. creţulescu, m. vinţan lucian vinţan, daniel morariu, radu creţulescu*, maria vinţan lucian blaga university of sibiu romania, sibiu, emil cioran, 4 lucian.vintan@ulbsibiu.ro, daniel.morariu@ulbsibiu.ro radu.kretzulescu@ulbsibiu.ro, maria.vintan@ulbsibiu.ro *corresponding author: radu.kretzulescu@ulbsibiu.ro abstract: in this paper we will present a new approach regarding the documents representation in order to be used in classification and/or clustering algorithms. in our new representation we will start from the classical "bag-of-words" representation but we will augment each word with its correspondent part-of-speech. thus we will introduce a new concept called hyper-vectors where each document is represented in a hyper-space where each dimension is a different part-of-speech component. for each dimension the document is represented using the vector space model (vsm). in this work we will use only five different parts of speech: noun, verb, adverb, adjective and others. in the hyper-space each dimension has a different weight. to compute the similarity between two documents we have developed a new hyper-cosine formula. some interesting classification experiments are presented as validation cases. keywords: documents representation, vector space model, hyper-vectors, documents similarity, classification, clustering. 1 introduction one of the main goals of information retrieval is organizing and retrieving information from a large number of text-based documents. typically, an information retrieval problem is to locate relevant documents based on the user’s input, such as keywords or sample documents. usually information retrieval systems include on-line library catalog systems and on-line document management systems. an information retrieval system [8] based on similarity finds similar documents using a set of common keywords. the output for this system is based on the degree of relevance measured according to keywords closeness and the relative frequency of the keywords [2,5]. in some cases, it is difficult to give an accurate measure of the relevance between the keywords set. in modern information retrieval systems, keywords for document representation are automatically extracted from the documents. this system often associates a stopword list with the set of documents. a stopword list is a set of words that are considered "irrelevant" for the representation of the document set and can vary when the document set varies. another issue is the extraction of the word stem. a group of different words may share the same word stem. a text retrieval system needs to identify groups of words where the words in a group have small syntactic variants, and collect only the common word stem per group. the most common method used in document representation for classification or clustering algorithms is the vector of word frequencies [5]. this method is quite rapid and reliable because it does not require huge computing power. however, the latest approaches for the representation of documents in current applications (which require speed and fast response time such as the majority of the online applications) are not always feasible due to computational cost and the waiting time needed for processing large results. copyright © 2006-2017 by ccc publications 404 l. vinţan, d. morariu, r. creţulescu, m. vinţan in this article, we intend to present a new approach for the representation of text documents, which augments the classical vsm (vector space model) representation with some semantic information such as the parts of speech [6] of the words (or other classification of the corresponding words). this new representation we have called h-vsm (hyper vector space model). the reason is natural because in this case the new document representation, instead of containing just a single vector of scalars (representing its words occurrences), like in the vsm representation, it contains a hyper-vector containing parts of speech vectors. such a vector might contain, for example, all the noun-words; another one contains all the verb-words and so on. this idea brings in a new light, a more general vsm representation form, containing new information (the words classes parts-of-speech in our case). it may open a new effective approach to document classification or clustering using computational techniques. despite the semantic approaches emergence, these computational methods are still of great interest in documents classification research. the reasons are multiple: they are simpler, easier to implement, faster, independent of the research field, more feasible in some implementations (for example in internet browsers) than the semantic approaches (ontology, nlp, etc.), whose complexities are enormous and the run time, too. 2 the classical vsm representation model for text documents in the classical vsm representation, a text document is represented as a vector of frequencies of its words [2, 5]. thus, in this approach for each set of data a dictionary of words (typically the stem-words) will be generated. let us consider a set of d documents and a set of t terms used for representing the documents into the information retrieval system. we can represent each document as a vector v in the t -dimensional space rt. the ith coordinate of v[] is a number that measures the association of the ith term with respect to the given document: it is generally defined as 0 if the document does not contain the term, and nonzero otherwise. the vi element from a v[] vector, can indicate the frequency of the word in the document. for defining the frequency of the terms (term-weighing) for the nonzero entries in such a vector there are also used many methods [5, 7]. for example, it can be simply defined vi = 1 if the ith term occurs in the document, or let vi be the term frequency, or normalized term frequency. the term frequency is the number of occurrences of the ith term in the document. there are different ways to weight the term frequency for the nonzero entries in such a vector. we can simply set value 1 if the term t occurs in the document (binary normalization), or use term frequency or the relative them frequency, that is, the term frequency divided by total number of occurrences of all the terms in the document (nominal normalization). another way to normalize the term frequency is cornell smart representation that uses the following formula: tf(d,t) = { 0 if freq(d,t) = 0 1 + log (1 + log (freq(d,t))) othervise (1) where freq(d,t) is the frequency of term t in the document d. based on this representation there is another measure called inverse document frequency that represents the scaling factor for a term t relative to the frequency of term in all documents from the dataset. idf(t) = log 1 + |d| |dt| (2) where d is the document collection and dt is the set of documents containing term t in a complete vector space model tf and idf are combined together forming the tf − idf measure: an extension of the vsm documents representation 405 tf − idf(d,t) = tf(d,t) × idf(t) (3) we expect that similar documents will have similar relative terms frequencies, and we can measure the similarity among a set of documents. there are many metrics for measuring the document similarity. a common used measure is the euclidean distance but the most used in the literature is the cosine similarity defined as: sim(v1,v2) = v1 ·v2 ‖v1‖ ‖v2‖ (4) 3 the new h-vsm model 3.1 the rationale for a new vector model representation let v be a vector, which represents a document in the vsm. let us agree that all related words to v are from the syntactic point of view only of two types (x and y). keeping the vsm representation, the vector v can be written as v = vx&vy where & is the concatenation operator. but if we consider vx and vy being two orthogonal axes, we could represent v as: v = vx ·hi + vy ·hj (5) where hi and hj are the "hyper-unit vectors" of this "plane", because vx and vy are vectors (although in formula 5 they are denoted as simple scalars just to simplify the notations). we note the vector v in this representation a "hyper-vector" (representing a vector of vectors). 3.2 a first particular model of hyper-vector similarity let consider two hyper-vectors represented in a 2d hyper-plane as: hv1 = vx1 ·hi + vy1 ·hj hv2 = vx2 ·hi + vy2 ·hj (6) where (vx1,vy1) and (vx2,vy2) are the projections of hv 1 respectively hv 2 on the orthogonal axes. in text documents representation the vectors vx1, vx2 could be, for example, the verb vectors for the document 1 and document 2 and the vectors vy1, vy2 could be the noun vectors for the same documents (syntactic extensions are immediate). as we already pointed out, a common metric used to measure the similarity between two vectors in the vsm representation is the cosine distance. the problem is to consistently define a representation of this distance for 2 "hyper-vectors". we will call this distance "hyper-cosine" and we will note it hcos(hv 1,hv 2), even if it is, in fact, the cosine between two vectors. to solve this problem, we will write the "scalar" product of the "hyper-vectors" as: hv1 ·hv2 = (vx1 ·hi + vy1 ·hj) · (vx2 ·hi + vy2 ·hj) = = vx1 ·vx2 + vy1 ·vy2 (7) but: vx1 = vx11 · i + vx12 · j and vx2 = vx21 · i + vx22 · j (8) 406 l. vinţan, d. morariu, r. creţulescu, m. vinţan where vx11,vx12 represent the frequencies of occurrence of the syntactic part x (verbs for example) in the vectors vx1 respectively vx2 . analogous for the vectors vy1 respectively vy2 (representing nouns, for example). here i and j are the well-known orthogonal unit vectors. substituting the 8 equalities in formula 7 we obtain: hv1 ·hv2 = vx11 ·vx21 + vx12 ·vx22 + vy11 ·vy21 + vy12 ·vy22 (9) it follows that: hcos ( hv1,hv2 ) = hv1 ·hv2 |hv1| · |hv2| = = vx11 ·vx21 + vx12 ·vx22 + vy11 ·vy21 + vy12 ·vy22√ v 2x1 + v 2 y1 · √ v 2x2 + v 2 y2 (10) but: v 2x1 = (vx11 · i + vx12 · j) 2 = v 2x11 + v 2 x12 (11) we represent v 2y1, v 2 x2, v 2 y2 in a similar way. replacing in 10 we obtain: hcos(hv1,hv2) = vx11 ·vx21 + vx12 ·vx22 + vy11 ·vy21 + vy12 ·vy22√ v 2x11 + v 2 x12 + v 2 y11 + v 2 y12 · √ v 2x21 + v 2 x22 + v 2 y21 + v 2 y22 (12) we notice that v 2x11 + v 2 x12 = ∣∣vx1∣∣2 and analogues. based on this notation, 12 relationship can be written more concisely: hcos(hv1,hv2) = vx1 ·vx2 + vy1 ·vy2√∣∣vx1∣∣2 + ∣∣∣vy1∣∣∣2 · √∣∣vx2∣∣2 + ∣∣∣vy2∣∣∣2 (13) note: the formulas 12 and 13 are "different" representations for the cosine between two vectors. indeed, if all the words belong to a single syntactic category (and not two, x and y) formula 13 becomes the well-known formula for the cosine: cos(v1,v2) = v 1 ·v2∣∣v1∣∣ · ∣∣v2∣∣ (14) cos(v̄1, v̄2) = 1 being equivalent with v̄1 = kv̄2. 3.3 a generalization of the similarity between hyper-vectors considering two hyper-vectors having "n" orthogonal dimensions: hv1 = n∑ k=1 vxk1 ·hik hv2 = n∑ k=1 vxk2 ·hik (15) formula 13 becomes: hcos(hv1,hv2) = n∑ k=1 vxk1 ·vxk2√ n∑ k=1 ∣∣vxk1∣∣2 · √ n∑ k=1 ∣∣vxk2∣∣2 (16) an extension of the vsm documents representation 407 further, in order not to complicate the notation, we are considering in this first model that all vectors vxk, · · ·k = 1,n have the same length "n". of course, different lengths for each vector "mk" can also be considered (as we do in the next section). in order to generalize formula 12 we can use the following notation: vxk1 = m∑ p=1 v p xk1 ·~ip, ∀k = 1,n (17) (where "p" is considered an index, not a power). for instance: vx11 represents the "syntactic" vector of verbs from document number 1 (k=1), vx21 represents the "syntactic" vector of nouns from document number 1 (k=2), etc. vxk1 ·vxk2 = m∑ p=1 v p xk1 ·v p xk2 (18) also: ∣∣vxk1∣∣2 = m∑ p=1 ( v p xk1 )2 (19) ∣∣vxk2∣∣2 = m∑ p=1 ( v p xk2 )2 (20) replacing 18, 19 and 20 in 16 we obtain: hcos(hv1,hv2) = n∑ k=1 m∑ p=1 v p xk1 ·v p xk2√ n∑ k=1 m∑ p=1 ( v p xk1 )2 ·√ n∑ k=1 m∑ p=1 ( v p xk2 )2 (21) formula 21 represents the generalization of formula 12. the formulas 16 and 21 represent similarities between documents that are represented in a space "syntactically richer" than the vsm, namely a hyper-space which generalizes consistently the vsm representation. normalized weights of classes of words (x1,x2, . . .xn) are possible and they lead to authentic generalizations of cosines, meaning: hv1 = n∑ k=1 αk ·vxk1 ·hik hv2 = n∑ k=1 αk ·vxk2 ·hik with n∑ k=1 αk = 1 (22) αk > 0, will be chosen larger or smaller depending on the greater or lower "semantic importance" of a certain (k) hyper-dimension. unlike the cases presented so far, in this case we would obtain a h-cos type similarity formula, different from the classical cos type, which could have positive consequences in improving the accuracy of document classification algorithms (see formula 23). hcos(hv1,hv2) = n∑ k=1 (α2k · m∑ p=1 v p xk1v p xk2)√ n∑ k=1 (α2k · m∑ p=1 (v p xk1) 2 ) · √ n∑ k=1 (α2k · m∑ p=1 (v p xk2) 2 ) 6= cos(hv1,hv2) (23) 408 l. vinţan, d. morariu, r. creţulescu, m. vinţan the αk coefficient can be simplistically computed for instance using: αk = nk m∑ j=1 nj , for k = 1,m (24) nk represents the length of a certain dimension k (k = 1,m). in this case we consider that a longer vector for a given type of a part of speech could be more important than a shorter one. also there can be used other formulas for computing the αk coefficient. it remains to be proven by experiments that this new representation of the text documents with the new similarity metrics will improve the accuracy of the document classification in accordance with our scientific hypothesis. this hypothesis is based on a rational intuition: including in document representation some new "morphologic" information offers a greater discriminatory power. 3.4 generalization for "mk" length different for each vxk vector we are considering in this paragraph that our document is represented in a hyper-space with m dimensions (e.g. the document is represented for m different parts of speech). each space of this hyper-space has nk dimensions (any 2 spaces can have different sizes ni 6= nj). for example n1 represents the dimension of the nouns vector; n2 represents the dimension of the verbs vector, etc. any two documents are represented in the same hyper-space m and have the same dimension in each space. thus: hv1 = m∑ k=1 vk1 ·hk hv2 = m∑ k=1 vk2 ·hk (25) we try to simplify the notations as much as possible. in the formula 21 we have replaced xk directly with k. the scalar product of two documents vectors in this hyper-space will be (generalizing formula 9): hv1 ·hv2 = m∑ k=1 nk∑ p=1 vk1p ·vk2p (26) the norm of the hyper-vector becomes: ∣∣hv1∣∣ = √√√√ m∑ k=1 nk∑ p=1 v 2k1p (27) using the formula 14 representing the cosine between two vectors and generalizing formula 21 the hyper-cosine in the hyper-space defined in this way becomes: hcos(hv1,hv2) = m∑ k=1 nk∑ p=1 vk1p ·vk2p√ m∑ k=1 nk∑ p=1 v 2k1p · √ m∑ k=1 nk∑ p=1 v 2k2p (28) this formula is similar with formula 21 (using a different notation style). an extension of the vsm documents representation 409 under these circumstances if we consider that the hyper-vector is a single vector with the dimension equal with n1 + n2 + · · · + nm, than the cosine value should be identical. the initial idea from which we have started was that the vectors are represented as follows: hv1 = m∑ k=1 ( αk ·vk1 ·hk ) hv2 = m∑ k=1 ( αk ·vk2 ·hk ) (29) where ∑ αk = 1 (equilibrium relationship). then the hyper-cosine becomes: hcos(hv1,hv2) = m∑ k=1 ( α2k · nk∑ p=1 vk1p ·vk2p ) √√√√ m∑ k=1 ( α2k · nk∑ p=1 v 2k1p ) · √√√√ m∑ k=1 ( α2k · nk∑ p=1 v 2k2p ) (30) in this case the value for the hcos can be different from the value obtained by a simple concatenation of the part-of speech vectors. again the αk coefficient may be computed for instance as in the following formula: αk = nk m∑ j=1 nj , for k = 1,m (31) this idea could bring in a new light the vsm representation. the alpha coefficients for each part of speech depend more on the semantics of the documents than on the percentage for a certain part of speech (as we have previously suggested in 24 formula). therefore choosing the optimal alpha weights remains an open problem. in this article, we have tried finding, in an empirical way, the coefficients’ values so that the classification accuracy increases, without claiming to have found the optimal values. 3.5 other generalized formulas used to measure the similarity between two vectors considering that all the vxk, · · ·k = 1,n, vectors have the same length "n", the euclidian hyper-distance becomes: hdeucl(hv1,hv2) = √√√√ m∑ k=1 n∑ p=1 ( v p xk1 −v p xk2 )2 (32) as well it would be possible to weight classes of words, depending on their importance, leading to consistent generalizations of the euclidean distance, with potentially positive influences on the document classification algorithms. in this case the euclidean distance for different space dimension mk becomes: hdeucl(hv1,hv2) = √√√√√ m∑ k=1  α2k nk∑ p=1 (vk1p −vk2p) 2   (33) the dot product between two vectors can be expressed as: 410 l. vinţan, d. morariu, r. creţulescu, m. vinţan hv1 ·hv2 = m∑ k=1  α2k nk∑ p=1 vk1p ·vk2p   (34) similarly other distances can be expressed using weight classes generalization (city-block distance, etc.). hdcb ( hv1,hv2 ) = m∑ k=1  α2k nk∑ p=1 |vk1p −vk2p|   (35) 4 improvements brought to document classification 4.1 tagging the reuters dataset for validating the above presented theoretical results, based on a set of documents, we have tried a separation of words according to their part of speech. after that step, we have then performed a vector representation of documents as vectors of frequencies of words but we have taken into account also the part of speech for the given words. in this way, we have obtained an augmented vsm representation. in this new representation each vector in the hyper-vector space represents a part of speech component. we want to compare the results of the new representation with the results obtained by us in previous experiments where documents were represented only by frequencies of words vectors [9]. we have used the support vector machine (svm) algorithm for classifying text documents. because svm is a supervised learning algorithm, we need a set of data that is tagged in terms of parts of speech and classified in terms of documents belonging to classes. these classes are defined axiomatically according to the content of documents. in initial experiments, we have used the reuters 2000 dataset [10] which contains documents that are pre-classified by the reuters news agency but without having any information about the part of speech of words contained. in our experiments, regarding the parts of speech, we have used the brown corpus [1] that is a corpus labeled in terms of part of speech but not classified into categories (classes) by document contents. therefore, in the first step we have tried to label the words from the documents contained in the reuters database with their corresponding parts of speech. for this purpose, using the brown corpus we have evaluated several known tagging (labeling the part of speech) applications and also some tagging applications which were developed by us [3], in order to find the best suited tagging application / applications for labeling the documents. the entire experiment for the tagger selection was explained in our article published in [4]. in [4] we have performed a number of experiments for selecting the "best" tagger in order to tag the reuters dataset. analyzing the obtained results we have decided to use, in the first experiments, only the tree tagger tool [11]. this tool has obtained the best results for three of the tested parts of speech: noun, verb and adverb. hopefully, in further experiments, we’ll combine the results from more taggers (through meta-tagging) in order to increase the probability to obtain the correct part of speech for a word in a given context. after tagging our reuters dataset and representing each document as a vector of words frequencies, where each word is separated based on its part of speech, we have obtained 27240 different words (for all 5 parts-of speech taken into consideration). in the next table we present for each part of speech the number of words discovered by the tagger in the reuters dataset. after the prediction of the part of speech using the tree tagger tool, we have extracted the stem of the words and after that we have recorded the words’ frequencies. we have obtained an extension of the vsm documents representation 411 table 1: pos for the reuters data part of speech # of words % of total word nouns 16820 61.75 verbs 3668 13.47 adjectives 5555 20.39 adverbs 1001 3.67 others 196 0.72 total 27240 27240 different words, which mean in the hyperspace to have a representation with five different spaces, each space having a different dimension. for example, the space of nouns has 16820 dimensions. it is remarkable that the tree tagger has labeled in the "other" category only 196 stem of words that represents less than 1% from all stem of words extracted from the reuters dataset. 4.2 obtained results with svm classifier we have started some initial experiments using our tagged reuters data set in order to determine if there are some improvements in the classification accuracy using this new representation. in the svm classifier, we have decided to use both polynomial and gaussian kernels as it was presented in [9]. the results obtained using the svm classifier and all 27240 features are presented in the table 2. these results were compared with previous results obtained using a vector with 18424 features but without pos. (the new document representation has a bigger dimension because a certain word could belong to multiple parts of speech). this was the biggest dimension obtained for the words frequencies vectors representation. in [9] it was demonstrated that if the vector dimension decreases and fewer features are chosen through some feature selection techniques, than the noise introduced in the classification algorithm is smaller and the learning is better. with this new representation it was interesting to observe that although the number of features was higher, which would theoretically induce more noise in the document representation, the classification results are at average with 0.85% better for the polynomial kernel and with 1.1% at average better for the gaussian kernel. these results give us hope for future better results especially for the gaussian kernel. in the table 2 we have marked with bold the highest values for the two developed experiments. when the part of speech was introduced, the results improved (even if we have weights equal to 1 for the hyper-vectors) especially for small degrees of the kernels (so there is no need for a shift into a higher space). also the results are more equilibrate for different kernel dimensions that mean that we can obtain better results searching only in few spaces. further we have selected only the best 1309 features from all 27240 features using the information gain as feature selection method [7]. in table 3 we present the number of words obtained, according to their part of speech. in the table 4 we present the results of our classification experiments where we have applied the formula 34 for computing the dot product between two hyper-vectors, considering that each hyper-vector consists of five vectors with different sizes for each part of speech. in these experiments we have weighted with 0.8 the verbs and the adverbs, with 0.9 the nouns and the adjectives and with 0.6 the "other" category. the obtained results are shown in the table 4. the values of coefficients were chosen close to 1 (value that is used in case of computing the classical dot product between two vectors for example, the formula 14). 412 l. vinţan, d. morariu, r. creţulescu, m. vinţan table 2: vsm versus vsm-with-pos for all features using svm classifier polynomial kernel gaussian kernel degree of kernel data representation for 18428 features (vsm) for 27240 features (vsm with pos) degree of kernel data representation for 18428 features (vsm) for 27240 features (vsm with pos) p1.0 bin 83.03 83.41 c1.0 bin 82.01 82.22 p1.0 nom 86.22 86.98 c1.0 smart 81.75 84.22 p1.0 smart 82.52 84.01 c1.3 bin 82.69 82.86 p2.0 bin 85.79 85.16 c1.3 smart 82.39 84.43 p2.0 nom 85.50 85.67 c1.8 bin 82.86 83.03 p2.0 smart 85.92 86.39 c1.8 smart 82.60 84.26 p3.0 bin 83.96 76.05 c2.1 bin 82.56 82.77 p3.0 nom 84.94 85.92 c2.1 smart 82.43 84.18 p3.0 smart 77.16 85.03 average 82.41 83.50 p4.0 bin 53.64 64.44 p4.0 nom 82.99 82.56 p4.0 smart 59.34 55.47 average 75.25 80.09 table 3: number of words selected according to their pos part of speech # of words % of total word nouns 683 52.18 verbs 289 22.08 adjectives 188 14.369 adverbs 63 4.81 others 86 6.57 total 1309 an extension of the vsm documents representation 413 table 4: vsm versus vsm-with-pos for 1309 features using svm classifier polynomial kernel gaussian kernel degree of kernel data representation for 1309 features (vsm) for 1309 features (vsm with pos) degree of kernel data representation for 1309 features (vsm) for 1309 features (vsm with pos) p1.0 bin 81.45 80.99 c1.0 bin 82.99 84.05 p1.0 nom 86.69 86.39 c1.0 smart 82.99 84.77 p1.0 smart 80.99 81.07 c1.3 bin 83.74 84.05 p2.0 bin 86.64 85.11 c1.3 smart 83.57 84.56 p2.0 nom 85.03 84.94 c1.8 bin 83.24 83.96 p2.0 smart 87.11 85.03 c1.8 smart 84.30 84.47 p3.0 bin 85.79 85.33 c2.1 bin 83.11 84.05 p3.0 nom 84.35 82.52 c2.1 smart 83.83 84.13 p3.0 smart 86.51 85.71 average 83.47 84.25 p4.0 bin 74.61 83.03 p4.0 nom 81.54 79.20 p4.0 smart 71.84 81.67 average 82.71 83.41 as a first observation, it is interesting to point out that the average classification accuracies for the 1309 features representation are significantly better (up to 7.46%) than those obtained using all features representation (see table 4 vs. table 2). regarding the polynomial kernel, most results are slightly lower than that obtained without the pos. interesting, all the vsm with pos results are close to a certain value for both small degrees and high degrees kernel values suggesting that the new representation would not be affected so much by shifting into another higher space. however, at average, the new representation gets better results with 0.70% for the polynomial kernel. using the new representation with the gaussian kernel, we have obtained constantly slightly better results than with the classical vsm representation. more precisely, at average the svm with gaussian kernel obtained, using the new representation, an improvement of 0.78% using only 1309 features, compared to the simple frequency of words vector representation. as we have already mentioned, the selection for the weighting values for each vector from the hyper-space remains an open problem because it is difficult to determine what contribution has each part of speech regarding the quality of the classification accuracy. in future experiments we will try to find different methods and strategies for computing the optimal values for the weights. 5 conclusions and future works in our paper, we have presented a possible improvement for the vsm representation used in text documents classification adding some new morphological information, transforming the vectors of documents into hyper-vectors, which contain information about the part of speech of the words (just in our case presented here). we have also developed a new formula for the cosine between two hyper-vectors starting from the well-known formula for the cosine distance between two vectors. in fact, the proposed model is more general, because it tries to augment the classical 414 l. vinţan, d. morariu, r. creţulescu, m. vinţan representation which leads to a separation of the representations that can have different weights. considering these first experiments we have observed that such a representation which adds supplementary information helps to achieve better classification results. we intend to improve this information trying to find the most optimal representation. anyway, even if the classification algorithms will have no improvement with this representation, this does not necessarily mean that the idea of "representation by hyper-vectors" is bad; this means that the used hyper-space is inappropriate for the purpose (classification). the chosen representations might not lead to a better discrimination. if we choose otherwise, it may lead to a better classification. what representation should be used to obtain a better classification? well, that nobody knows, we can only make assumptions based on intuition (common-sense). as a further work idea, instead of using the parts of speech for words we can consider to use the parts of the sentence (subject, predicate, attribute, etc.) or, more generally, we can consider to group information in other quasi-orthogonal categories (these categories could be whatever: syntactic, morphologic, etc.) and weighted each category separately and compute the similarity. (for example we can have "beautiful" words and "ugly" words. if this involves better classification accuracy, why not?) bibliography [1] brown university standard corpus of present-day american english (brown corpus), [online] http://icame.uib.no/brown/bcm.html, accessed in april 2014. [2] chakrabarti s.(2003); mining the webdiscovering knowledge from hypertext data, morgan kaufmann press, 2003. [3] cretulescu r., david a., morariu d., vintan l. (2014); part of speech tagging with naive bayes methods, proceedings of the 18-th international conference on system theory, control and computing, sinaia (romania), doi: 10.1109/icstcc.2014.6982457, 446-451, 2014. [4] cretulescu r., david a., morariu d., vintan l. (2015); part of speech labeling for reuters database, proc. of the 19-th international conference on system theory, control and computing, gradistea (romania), doi: 10.1109/icstcc.2015.7321279, 117-122, 2015. [5] han j., kamber m. (2001); data mining: concepts and techniques, morgan kaufmann publishers, 2001. [6] manning d., schütze h. (1999); foundations of statistical natural language processing, mit press, isbn: 987–0–262–133360–9, 1999. [7] mitchell t. (1999); machine learning, mcgraw hill publishers, 1997. [8] mitkov r. (2005); the oxford handbook of computational linguistics, oxford university press, 2005. [9] morariu d. (2008); text mining methods based on support vector machine, matrixrom, bucharest, 2008. [10] reuters corpus, [online] http://about.reuters.com/researchandstandards/corpus/, released in november 2000. [11] tree tagger, [online] http://www.cis.uni-muenchen.de/ schmid/tools/treetagger, accessed in april 2014. international journal of computers communications & control issn 1841-9836, 11(6):755-775, december 2016. incremental and decremental svm for regression h. gâlmeanu, l.m. sasu, r. andonie honorius gâlmeanu* 1. siemens corporate technology honorius.galmeanu@siemens.com 2. faculty of mathematics and informatics transilvania university of braşov *corresponding author: galmeanu@unitbv.ro lucian mircea sasu 1. faculty of mathematics and informatics transilvania university of braşov lmsasu@unitbv.ro 2. siemens corporate technology lucian.sasu@siemens.com răzvan andonie 1. computer science department central washington university, ellensburg, usa andonie@cwu.edu 2. electronics and computers department transilvania university of braşov abstract: training a support vector machine (svm) for regression (function approximation) in an incremental/decremental way consists essentially in migrating the input vectors in and out of the support vector set with specific modification of the associated thresholds. we introduce with full details such a method, which allows for defining the exact increments or decrements associated with the thresholds before vector migrations take place. two delicate issues are especially addressed: the variation of the regularization parameter (for tuning the model performance) and the extreme situations where the support vector set becomes empty. we experimentally compare our method with several regression methods: the multilayer perceptron, two standard svm implementations, and two models based on adaptive resonance theory. keywords: support vector machine, incremental and decremental learning, regression, function approximation 1 introduction the approximation of continuous functions that are known only at a certain number of discrete points (also known as regression), a standard procedure in statistics, can be also approached from a machine learning perspective. in the context of supervised training, incremental learning means learning each input-output sample pair, without keeping it for subsequent processing. the topic addressed in this paper is supervised incremental/decremental learning of regression models, in particular svm models. the fundamental issues in incremental/decremental learning are: i) how can a learning system adapt to new information without corrupting or forgetting previously learned information (the stability-plasticity dilemma addressed by carpenter and grossberg [2]), and ii) how can a learning system "forget" information without corrupting previously learned information and without adding new information. in other words, how can a system learn without unlearning and unlearn without learning. copyright © 2006-2016 by ccc publications 756 h. gâlmeanu, l.m. sasu, r. andonie support vector regressions (svrs) are svm learning models used for regression. these algorithms solve the quadratic optimization problem using a decomposition method based on sequential minimum optimization [4], or on an incremental learning method. the incremental method considers that input patterns are added to the solution (or removed from the solution), one at a time, without affecting the learning process of the other patterns [13,14]. after introducing some basic notations, we aim review in this introductory section related work on incremental/decremental svrs and also highlight our contribution. 1.1 svr notations for a non-linear svr, regression translates into the following optimization problem [14]: approximate a given set of training pairs (xi,yi), i = 1 . . .n, xi ∈ rm, yi ∈ r, with an svm defined by g(xi) = w t φ(xi) + w0, (1) where φ(·) is a non-linear function and w0 is an offset. the objective function g(x) : rm → r requires the minimization [21] of min w,w0,ξi,ξ ∗ i j(w) = min w,w0,ξi,ξ ∗ i ( 1 2 ‖w‖2 + c n∑ i=1 (ξi + ξ ∗ i ) ) , (2) and the constraints are given by −�− ξ∗i ≤ g(xi) −yi ≤ � + ξi for ξi,ξ∗i ≥ 0, i = 1 . . .n (3) svr matches the output value of the objective function g(xi) as close as possible to the given yi. the output value is constrained to lay inside a "tube" of values, given by −� + yi ≤ g(xi) ≤ � + yi, where � is a given constant. the slack variables ξi and ξ∗i allow the system to cope with patterns that do not fall inside the "tube" and also to minimize the acceptable error for such patterns. to minimize the expression of j(w) in equation (2) with respect to w and w0, after several manipulations, the lagrangian can be written as: l(w,w0,αi,α ∗ i ) = − 1 2 ∑ i ∑ j (α∗i −αi)φ(xi)t φ(xj)(α∗j −αj) + ∑ i (α∗i −αi)yi − ∑ i (α∗i −αi)w0 − ∑ i (αi + α ∗ i )� (4) subject to αi,α∗i ≥ 0. the karush-kuhn-tucker (kkt) conditions (imposed by minimizing the lagrangian) are given by: w = ∑ i (α∗i −αi)φ(xi) (5)∑ i (αi −α∗i ) = 0 0 ≤ αi ≤ c 0 ≤ α∗i ≤ c (6) incremental and decremental svm for regression 757 the wolfe dual form states that this minimization with respect to w and w0 can be transformed into a maximization with respect to αi and α∗i . this leads to the minimization of l′(w,w0,αi,α ∗ i ) = −l(w,w0,αi,α∗i ). the expression of l′(w,w0,αi,α∗i ) depends only on the inner product φ(xi) t φ(xj). therefore, we use the notation qij = φ(xi)t φ(xj). however, svm uses the "kernel trick": the inner products of the vectors φ(x), belonging to an infinite-dimension feature space, are replaced by the non-linear kernel function k(xi,xj) = e−σ(xi−xj) 2 = φ(xi) t φ(xj) [19]. generally, kernel functions are used for non-linear svms, so we use the notation qij = k(xi,xj) to designate the kernel function. the extremum conditions for l′(w,w0,αi,α∗i ), considering also the conditions (5, 6), are given by: ∂l′ ∂αi = − ∑ j qij(α ∗ j −αj) −w0 + yi + �−δi + ui = 0 (7) ∂l′ ∂α∗i = ∑ j qij(α ∗ j −αj) + w0 −yi + �−δ∗i + u∗i = 0 (8) with the following additional kkt conditions: δi,δ ∗ i ≥ 0 δiαi = 0, δ∗i α∗i = 0 (9) ui,u ∗ i ≥ 0 ui(c −αi) = 0, u∗i (c −α∗i ) = 0 (10) from equation (1), using the condition (5) and the notation α∗j −αj = θj, the expression of the "margin" function hi becomes: hi = g(xi) −yi = ∑ j qijθj + w0 −yi (11) where −c ≤ θi ≤ c (12) from equations (7) (12), observing in the kkt conditions that at most one of the αi, α∗i values is nonzero, it follows that there are three possible vector sets: • error vectors (e), for the following cases: hi < −� (on θi = c), or hi > � (on θi = −c); • support vectors (s), for the cases: hi = −� (on 0 < θi < c), or hi = � (on −c < θi < 0); • other vectors (o), for the cases where −� < hi < �, with θi = 0. svms can be trained in an incremental/decremental way. using equation (11), we can decompose hi into two parts: the contribution of other vectors and the contribution of its own vector: hi = ∑ j 6=i qijθj + qiiθi + w0 −yi (13) given that qii > 0, an increasing negative contribution of other vectors θj (for j 6= i) would be compensated by increasing the value for θi up to the point where θi = c. from this point onward, since the increasing of negative contribution for θj would no longer be counteracted by 758 h. gâlmeanu, l.m. sasu, r. andonie the increase of θi, the net result is that hi would decrease past the −� value, and xi would be treated as an error vector. in equation (13), only the error and support vectors contribute to the expression hi. for all the sets, the associated threshold value θi is bounded by the regularization parameter c (equation (12)). r = {e ∪o} is the set of reserve vectors. we denote by r, s, and e the reserve, support and error vectors, respectively. the whole training set is defined as {e ∪s ∪o}. 1.2 related work and our contribution the adiabatic incremental/decremental training algorithm for svms (and svrs) was introduced in [13,14] and it follows from a method proposed by cauwenberghs and poggio [3,5]. for each pattern being part of the solution (called support vector), the associated threshold value is θi ∈ {−�,�}. following equation (13), as one new vector is "learned", its associated threshold θi increases/decreases, starting from 0, on the expense of other vectors’ thresholds. during this process, the rest of the vectors migrate between the sets of support, error and other vectors. when the same vector is "unlearned", its corresponding threshold value θi is decreased (when positive) or increased (when negative), in order to reach 0 (when the vector is removed). during unlearning a vector, the vectors which previously migrated between sets when learning that vector will now migrate in the reverse order. an implementation of the adiabatic algorithm can be found in [12], with further developments in [7]. during incremental training, while changing the threshold of the newly introduced vector, the vectors will migrate among sets. the value of the threshold increment determines how many vectors will migrate. therefore, we are interested to determine the largest threshold increment ∆θi which is small enough to cause only one vector to migrate. proceeding with fixed small increments of ∆θi is not an efficient search technique. we discussed in [7] the conditions for vector migrations between the sets and how to determine the optimal increment. we gave the incremental/decremental context, but we did not describe particular situations, like the empty support vector set situation and the variation of the regularization parameter. an incremental/decremental svr, which can simultaneously add batches of new samples and also remove the obsolete ones, was applied for online time series prediction [11]. the ν-svr, presented in [21], is an incremental/decremental svr which determines a priori the proportion ν of support vectors from the total set. the algorithms in [9,11] are variations of the adiabatic algorithm. for a classification problem, an incremental/decremental svm operates in the following way: to add vector xc to the solution, the associated threshold θc is set from 0 to c, whereas to remove xc, θc is set to 0. for svrs, this operations are not symmetric anymore: to add xc, θc is set from 0 to c (increase) or to −c (decrease); to remove xc, θc is also increased or decreased. the direction (incremental or decremental) is not directly given by the intention adding or removing the vector, but by the margin hc. we will discuss this aspect in section 2. we name our method incremental/decremental support vector machine regression (idsvmr). it is based on the adiabatic training algorithm. compared to [13,14], the idsvmr contains all aspects needed for an implementation: a) we provide the relations that give the exact amount needed for increasing the threshold θc for the new vector, before any vector migrates; b) we determine the exact threshold variation that leads from one migration to the next. this produces a robust incremental algorithm, that creates a sequence of vector migrations, as part of the learning process; c) we deduct the expressions for the variation of the threshold θc, from one migration to the next. during training, the support vectors’ set may become empty. we give the exact expressions needed to increase θc also considering this particular case. compared to [7], we contribute with: i) a complete description of the increment/decrement incremental and decremental svm for regression 759 operations of the regularization parameter, showing that the expressions to be maximized (minimized) are similar, and ii) we expand the results from [12] by giving an exact procedure on how to continue the migration in the empty support vector set cases, for both the regular procedure and the increment/decrement of the regularization parameter. section 2 describes the expressions of the threshold variables θi and the expression of the vectors margins before the first migration. section 3 introduces the expressions of the maximum threshold variations that occur before vectors migrations. section 4 analyzes the maximum variation of the regularization parameter. section 5 discusses the variations of the free parameter w0 and the regularization parameter in case of an empty support vector set. section 6 presents the results of the idsvmr implemented as a weka [10] plugin, compared with the multilayer perceptron (mlp), two models based on adaptive resonance theory, and two classical svm– derived regression models. finally, section 6 contains our conclusions. 2 incremental and decremental updates of thresholds we start this section with summarizing the adiabatic training algorithm, which is at the core of the idsvmr method. then, we will introduce idsvmr implementation details which are not included in the original adiabatic algorithm. to adapt to a new vector xc, the adiabatic algorithm migrates specific vectors between the sets, in order to reestablish the kkt conditions [5]. at the beginning, xc has the threshold θc = 0. the threshold can evolve both toward positive or negative directions. first, this direction is determined. then, the threshold is modified considering, with each update, the migration between the sets. the migration is identified by observing the maximum (or minimum) variation of the margins hi for each vector. considering the kkt conditions, specifically the conditions for the thresholds (6) and margin function (11), their variations can be expressed as: ∆hi = ∑ j qij∆θj + ∆w0 (14) 0 = ∑ j ∆θj (15) when adding vector xc to the set, the migrations that take place strive to keep kkt conditions in place. thus, the variations of thresholds and margins can be further expanded to: ∆hi = ∑ j qij∆θj + ∆w0 + qic∆θc (16) 0 = ∑ j ∆θj + ∆θc (17) these relations can be written more compact using e, the vector of ones:  ∆hs ∆hr ∆hc 0   =   es qss er qrs 1 qcs 0 ets   [ ∆w0 ∆θs ] + ∆θc   qsc qrc qcc 1   (18) the modification of ∆θc would be absorbed by the modification of ∆θs (the set of support vectors), ∆w0, and the variation of margin functions. the margins of support vectors do not change: they are either −� or +�. hence, we have ∆hs = 0 and we obtain: 760 h. gâlmeanu, l.m. sasu, r. andonie [ ∆w0 ∆θs ] = − [ 0 ets es qss ]−1 [ 1 qsc ] ︸︷︷︸ β ∆θc (19) considering the definitions of ∆w0 and ∆θs, the margin functions of the current and reserve vectors are: [ ∆hc ∆hr ] = [ 1 qcs er qrs ] β + [ qcc qrc ] ︸︷︷︸ γ ∆θc (20) to prevent the recalculation of the inverse matrix in equation (19), the sherman-morrisonwoodbury formula was used in [12]. if used repeatedly, this leads to error accumulation. therefore, we prefer to calculate the inverse by lu decomposition, which is in o(n3) but numerically stable [8]. 3 learning direction and extremum increment/decrement before vector migration in what follows, we describe the conditions for vector migration between the sets and establish migration conditions for all possible situations. we explain how we prevent immediate cycling, mentioned in [12]. although laborious, we give full coverage of all particular situations. to "learn" a new vector xc, the training process starts with the initial value θc = 0. its margin, hc, is first determined using relation (11). since ∆hc = γc∆θc, the aim is to modify θc such that the margin function is updated to fit as well as possible into the "tube" according to relation (3). the following cases occur: 1. if hc > �, then the margin decreases to fit into the −� < hc < � "tube" and ∆hc < 0: • if γc > 0, then ∆θc < 0, θc decreases to −c, and the direction is decremental; • else, if γc < 0, then ∆θc > 0, θc increases to c, and the direction is incremental; 2. if hc < −�, then the margin increases and ∆hc > 0: • if γc > 0, then ∆θc > 0, θc increases to c and the direction is incremental; • else, if γc < 0, then ∆θc < 0, θc decreases to −c and the direction is decremental; from these relations we can establish the direction: • if sgn(hc) 6= sgn(γc), the approach is incremental and ∆θc > 0; • if sgn(hc) = sgn(γc), the approach is decremental and ∆θc < 0. to "unlearn" a vector xc from the support or error sets, the direction only depends on the value of θc: if θc < 0, the direction is incremental; if θc > 0, the direction is decremental. once the direction (the variation of θc) is established, we have to determine the variation increment ∆θc. we have the following cases (details are provided in appendix a): incremental and decremental svm for regression 761 1. migration of support (s) vectors. considering only the ∆θs components from equation (19) and following the β notation, the expression of the increment for only one support vector is ∆θs = βs · ∆θc. it gives the limits for the θs updates (we considered βs to be the specific component from the β vector). on the other hand, considering the allowed variation interval for θs, we can determine the limit for θc before the support vector associated with θs leaves the set: • the incremental case: – if sgn(hs) = sgn(βs), θs may reach 0, so we have the migration to set o: ∆θc ≤− θs βs (s → o) – if sgn(hs) 6= sgn(βs), θs may reach c or −c: ∆θc ≤ sgn(βs) ·c −θs βs (s → e) • the decremental case: – if sgn(hs) 6= sgn(βs), θs may reach 0: ∆θc ≥− θs βs (s → o) – if sgn(hs) = sgn(βs), θs may reach c or −c: ∆θc ≥ −sgn(βs) ·c −θs βs (s → e) 2. migration of other (o) vectors. these are the vectors with −� < he < �: • for the incremental case: ∆θc ≤ sgn(γr) · �−hr γr (o → s) • for the decremental case: ∆θc ≥ −sgn(γr) · �−hr γr (o → s) 3. migration of error (e) vectors. these are the vectors with he < −� or he > �: • for the incremental case: – if sgn(hr) 6= sgn(γr): ∆θc ≤ −sgn(γr) · �−hr γr (e → s) – if sgn(hr) = sgn(γr): vector does not migrate • for the decremental case: – if sgn(hr) = sgn(γr): ∆θc ≥ sgn(γr) · �−hr γr (e → s) 762 h. gâlmeanu, l.m. sasu, r. andonie – if sgn(hr) 6= sgn(γr): vector does not migrate 4. migration of current xc vector. the current vector xc newly added to the set is checked for possible migration to the support set (s): • for the incremental case: – if sgn(hc) 6= sgn(γc): ∆θc ≤ −sgn(γc) · �−hc γc (e → s) – if sgn(hr) = sgn(γc): vector does not migrate • for the decremental case: – if sgn(hc) = sgn(γc): ∆θc ≥ sgn(γc) · �−hc γc (e → s) – if sgn(hc) 6= sgn(γc): vector does not migrate for removing xc from the training set, the objective is to increase/decrease θc to zero, so that the variation of the margin does not determine vector migration. 5. the variation of θc from/to zero should be −c ≤ θc ≤ c, and we have the following conditions: • for adding xc: – for the incremental case, ∆θc ≤ c −θc – for the decremental case, ∆θc ≥−c −θc • for removing xc: – for the incremental case, where θc < 0, ∆θc ≤−θc – for the decremental case, where θc > 0, ∆θc ≥−θc the maximum variation of ∆θc for the incremental case, before the vectors change sets, is determined by computing the minimum of all these values. conversely, the minimum variation for the decremental case is established by determining the maximum of all these (negative) values deduced in the five cases presented above. 4 incrementing and decrementing the regularization parameter training a svm also requires at some stage a fine-tuning of the regularization parameter. many times, this is done by trial-and-error. in order to avoid re-training from scratch, the adiabatic algorithm modifies the regularization parameter in an incremental or decremental way, in small increments, watching for vectors migrations between sets. in the following, we aim to determine the maximum values for these increments. starting from equations (14) and (15), and adopting the notation θk = bk ·c for the threshold of error vectors, the relations for the margin functions (when only the regularization parameter is modified) can be written as: ∆hi = ∑ j∈s qij∆θj + (∑ k∈e qikbk ) ∆c + w0 (21) incremental and decremental svm for regression 763 0 = ∑ j∈s ∆θj + (∑ k∈e bk ) ∆c (22) or, more compact:  ∆hs∆hr 0   =   es qsser qrs 0 ets   ·[ ∆w0 ∆θs ] +   ∑ k∈e bkqsk∑ k∈e bkqrk∑ k∈e bk   · ∆c (23) since the margins for the support vectors remain constant, ∆hs = 0, the variations of support vectors threshold and the margin variations can be computed as [ ∆w0 ∆θs ] = − [ 0 ets es qss ]−1 · [ ∑ k∈e bk∑ k∈e bkqsk ] ︸︷︷︸ η︸︷︷︸ β ∆c (24) ∆hr = ([ er qrs ] β + ∑ k∈e bkqrk ) ︸︷︷︸ γ ∆c (25) let us summarize the migration conditions for each vector type (details can be found in appendix b). 1. migration of support vectors. • for incremental (∆c > 0, cumulative conditions): – if sgn(βs + sgn(hs)) 6= sgn(hs), then the limit is imposed by (migration s → e): ∆c ≤ −sgn(hs) ·c −θs βs + sgn(hs) – if sgn(βs) = sgn(hs), then the limit is imposed by (migration s → o): ∆c ≤−θs βs • for decremental (∆c < 0, cumulative): – if sgn(βs + sgn(hs)) = sgn(hs), then the limit is imposed by (migration s → e): ∆c ≥ −sgn(hs) ·c −θs βs + sgn(hs) – if sgn(βs) 6= sgn(hs), then the limit is imposed by (migration s → o): ∆c ≥−θs βs 2. migration of other vectors. for other (o) vectors, −� < hr < �: 764 h. gâlmeanu, l.m. sasu, r. andonie • for incremental (∆c > 0): ∆c ≤ sgn(γr)�−hr γr • for decremental (∆c < 0): ∆c ≥ −sgn(γr)�−hr γr 3. migration of error (e) vectors. • for incremental (∆c > 0): (a) if sgn(hr) 6= sgn(γr): ∆c ≤ −sgn(γr)�−hr γr (b) if sgn(hr) = sgn(γr), the limit is not active; • for decremental (∆c < 0): (a) if sgn(hr) = sgn(γr): ∆c ≥ sgn(γr)�−hr γr (b) if sgn(hr) 6= sgn(γr), the limit is not active; 4. variation of c is also limited by a target value ctarget: • for incremental: ∆c ≤ ctarget −c • for decremental: ∆c ≥ ctarget −c when varying c, the maximum variation before the vectors change sets is given by the minimum of these previously described ∆c values for the incremental scenario, or as the maximum, for the decremental case. 5 migration when the support vector set is empty during training process, the support vector set may become empty. in this circumstance, the variations of the free parameter w0 and the regularization parameter are very particular. 5.1 regular training when the support vector set is empty for this case, equations (19) and (20) cannot be used to determine new values for the threshold parameters. equation (16) can be written considering that there are no support vectors and that the threshold values for the error and other vectors do not change: ∆hc = ∆hr = ∆w0 (26) the variation of the margin for the current vector is the same as the variation for the margins of all vectors. imposing limits on the allowed change in margins, before some vector change sets, will lead to determine the specific vector that migrates first. depending on desired direction (increase or decrease of θc), we have two cases: incremental and decremental svm for regression 765 1. if hc < −�, then θc > 0 increases, we have the following limits for margins: • for other vector xo, −� < ho < �, ∆ho ≤ �−ho; • for error vector xe with he > �, the restriction is not active; • for error vector xe with he < −�, ∆he ≤−�−he; • for xc, ∆hc ≤−�−hc. 2. if hc > �, then θc < 0 decreases, we have the following limits for margins: • for other vector xo, −� < ho < �, ∆ho ≥−�−ho; • for error vector xe with he > �, ∆he ≥ �−he; • for error vector xe with he < −�, the restriction is not active; • for xc, ∆hc ≥ �−hc. the rules are valid for either the incremental or the decremental case. the margin of the xc vector is sought to either increase (first situation) or decrease (second). it is sufficient to take ∆w0 to be the smallest (or largest, depending on the desired direction) of these quantities. this way of varying the margins would always guarantee that some of them would be equal to −� or � limit, ensuring that the support vector set would not become empty. 5.2 updating the regularization parameter when the support vector set is empty when the support vector set is empty, the regularization parameter cannot modify on the expense of the variation of support vectors’ threshold values. relations (21) and (22) will change to: ∆hi = (∑ k∈e qikbk ) ∆c + ∆w0 (27) 0 = ∑ k∈e bk∆c (28) relation (28) is always valid, regardless of c’s variation. as opposed to how we proceeded previously, we assume that w0 parameter is kept unchanged, thus ∆hi = µi∆c, where we made the notation µi = ∑ k∈e qikbk. 1. when c increases (∆c > 0, incremental): (a) for other vectors xr ∈ o, −� < hr < �: • if µr > 0 then ∆hr > 0, thus ∆hr ≤ � − hr or µr∆c ≤ � − hr, which leads to ∆c ≤ �−hr µr ; • if µr < 0 then ∆hr < 0, thus ∆hr ≥−�−hr or µr∆c ≥−�−hr, which leads to ∆c ≤ −�−hr µr ; (b) for error vectors xr ine and θr = c, hr < −�: • if µr > 0 then ∆hr > 0, thus ∆hr ≤−�−hr or µr∆c ≤−�−hr, which leads to ∆c ≤ −�−hr µr ; • if µr > 0 then ∆hr < 0, the condition is not active; (c) for error vectors xr ine and θr = −c, hr > �: • if µr > 0 then ∆hr > 0, the condition is not active; 766 h. gâlmeanu, l.m. sasu, r. andonie • if µr < 0 then ∆hr < 0, thus ∆hr ≥ � − hr or µr∆c ≥ � − hr, which leads to ∆c ≤ �−hr µr ; in brief, a minimum will be computed among all the thresholds computed for vectors, imposed by conditions like: ∆c ≤ set ·sgn(µr)�−hr µr (29) where set = 1 if xr ∈ o and set = −1 if xr ∈ e. the condition will be inactive if sgn(µr) = sgn(hr) for xr ∈ e. 2. when c decreases (∆c < 0, incremental): proceeding in the same manner, a maximum will be computed among all the thresholds determined by similar conditions: ∆c ≥ −set ·sgn(µr)�−hr µr (30) where set is defined like above. the condition will be inactive if sgn(µr) 6= sgn(hr), for xr ∈ e. 6 experimental results in our experiments, we compare the idsvmr with the mlp, two svm–based regression models, and two incremental learning models derived by us in previous work from adaptive resonance theory: the fuzzy artmap with relevance (famr) and the bayesian artmap for regression (bar). since the last two models are less known, we included here a short description of them. famr was introduced in [1], as an extension of the classical fuzzy artmap (fam) [15] and probart [16]. famr can be used as classifier, regression model and posterior probability estimator. each training pattern can come with its own relevance factor assigned to it, which influences the probabilistic links formed between the input and output categories. a relevance factor allows for ranking of sample pairs according to the confidence one has in the information source. famr builds one–to–many mappings between input and output categories through maximum likelihood, approximating the probability of associations between input and output categories. famr comes with a stochastic approximation procedure, which allows making use of the relevance factor associated to the training patterns. under some mild constraints, famr’s mapfield values converge both in mean square and with probability one to the posterior probability p(k|j) between the jth input category and the kth output category. bayesian artmap (ba) [17] is a neural architecture which uses a combination of fam competitive learning and bayesian learning. ba uses gaussian categories and fam competitive learning. ba modifies some of the characteristics of the fam algorithm mainly by replacing the hyperrectangular categories with gaussian categories. the categories may grow or shrink, and they are probabilistically associated to classes. ba probabilistically infers the classes associated to input categories, by using all input categories, unlike the competitive approach used by fam and famr. in [18], the ba is extended for function approximation. the resulted architecture – bayesian artmap for regression (bar) – generalizes the ba algorithm using the clustering functionality of both input and output modules. the bar has the universal approximation capability and also the best approximation property; this is very important in regression [18]. both the famr and the bar can be trained incrementally, but not decrementally. this is typical for fam models. incremental and decremental svm for regression 767 for benchmarking, we use the following public datasets [6]: cpu computer hardware (cpu), boston housing (bh), wisconsin breast cancer (wbc), and communities and crime (cc). for the cpu datasets we removed the following input features: vendor name, model name, and estimated relative performance. from the wbc dataset, we removed the four instances with missing values and also the outcome attribute. the first five features of the cc dataset are acknowledged (by the dataset donors) not to be predictive. we removed them, together with the features with missing values. the original datasets are described in [6]. the filtered datasets, as explained above1, are synthetically described in table 1. to support performance comparisons across various benchmarks, both input and output values are independently scaled between 0 and 1. table 1: synthetic description of the four benchmark datasets used. some instances and/or attributes were removed from the original datasets dataset input attributes instances cpu 6 209 bh 13 506 wbc 32 194 cc 99 1994 we compare the regression capabilities of idsvmr, bar, and famr. we are also interested in comparing these incremental models with the popular mlp model and two classical svmbased regression algorithms, namely ε-svr [20] and ν-svr [21]. this is of interest because idsvmr, ε-svr and ν-svr have overlapping inductive bias, and beyond that they cover incremental and non-incremental adaptive models. we include mlp in the group of non–incremental models. even if is trained with mini–batches of data, the mlp does not learn new information without possibly corrupting previously learned information (the stability-plasticity condition is not addressed). for famr, bar and idsvmr, we use our own weka plugins, whereas for ε-svr, ν-svr and mlp we use the weka–provided implementations [10]. we use ten random permutations (shuffles) of each datasets. for each permutation, 66% of the data is used as training/validation set and the rest serves exclusively as test set. a ten–fold crossvalidation on the training/validation set is performed, for fine-tuning the hyperparameters of each model. for mlp, bar and famr, paper [18] enumerates the sought hyperparameters and their corresponding search ranges. for idsvmr, the hyperparameters are: the regularization coefficient c (shown in equation 2) and σ, the width of the gaussian kernel2. for the cc dataset, we seek an optimal value of c in {4, 4.5, . . . , 8}, and σ is sought over the candidate values {0.5, 0.7, 0.9}. for the other three datasets, c is sought in the set {0.2, 0.4, . . . , 10}, and the candidate values for σ are in the set {0.5, 0.6, . . . , 0.9}. the same hyperparameters and corresponding search ranges as for idsvmr are used for both ε-svr and for ν-svr. after cross-validation, we train the optimized models on the whole learning/validation set, and their generalization capability is assessed on the test set. the test set is used solely in this final assessment. the reported scoring values are the root mean squared error (rmse) and the mean absolute error (mae). table 2 contains the average rmse and mae values over the ten permutations, measured on the test sets. we split the table into two sections: the former contains only incremental models, the latter is devoted to mlp and to the two svm–based regression models. 1available at http://www.liaad.up.pt/~ltorgo/regression/datasets.html 2note that section 1.1 and the subsequent ones "hide" this hyperparameter under the notation qij. nevertheless, one ought to seek for a proper value of it as well. 768 h. gâlmeanu, l.m. sasu, r. andonie the results for mlp, famr, and bar are the ones found in [18]. all the steps for crossvalidation, learning, and performance assessment are performed under the weka experimenter framework [10]. table 2: the performance assessment results for ten random permutations of each of the four benchmarks datasets. the results are split into two groups: the former refers the incremental models (famr, bar and idsvmr), the latter presents the scores for ε-svr, ν-svr and mlp. every cell of the table contains the pair of rmse and mae values averaged on the test sets. the boldfaced values are optimal in their corresponding group. the loss values for mlp, famr and bar are the ones reported in [18]. model cpu bh wbc cc famr 0.09, 0.06 0.15, 0.12 0.31, 0.27 0.21, 0.14 bar 0.12, 0.06 0.17, 0.12 0.27, 0.23 0.20, 0.14 idsvmr 0.12, 0.05 0.09, 0.05 0.26, 0.22 0.14, 0.09 ε-svr 0.07, 0.05 0.09, 0.06 0.26, 0.22 0.16, 0.12 ν-svr 0.05, 0.02 0.09, 0.05 0.26, 0.22 0.15, 0.11 mlp 0.19, 0.14 0.15, 0.11 0.30, 0.25 0.15, 0.12 for all four datasets, in the group of incremental models, the minimum mae is obtained by idsvmr. except for the cpu dataset, the lowest rmse is produced by idsvmr; for cpu, idsvmr produces the median rmse value, at tie with the bar. for the bh dataset, idsvmr largely outperforms the other models. for all four datasets, idsvmr exhibits a consistent tendency to produce the lowest error scores. in case of non–incremental models, ν-svr always obtains the lowest scores. except for the cpu dataset, the ε-svr and ν-svr models have a similar performance. mlp’s scores are close to the ones obtained by ν-svr for cc, while for the other ones mlp shows rather modest performances. finally, we compare the incremental and non–incremental models. for cpu, ν-svr obtains better performance scores than any incremental model. for bh and wbc, ν-svr is at par with idsvmr. for cc, idsvmr outperforms any non–incremental model. conclusion the idsvmr proves to be a performant and functional implementation of the adiabatic incremental/decremental svr model. we have described it here with complete implementation details and we have implemented it as a weka plugin. in conclusion, the newly–introduced idsvmr shows itself as a promising incremental regression model, favorably comparing with the other three incremental algorithms in terms of performance. due to the differences between the hyperparameters number and ranges, it is rather meaningless to consider execution time for the cross-validation stage used for hyperparameter optimization. svms (and svrs) are known be relatively slow during training. using incremental/decremental training is therefore very attractive when dealing with large datasets and with data streams. incremental and decremental svm for regression 769 bibliography [1] andonie, r.; sasu, l. (2006); fuzzy artmap with input relevances, ieee transactions on neural networks, 17: 929-941. [2] carpenter, g.a.; grossberg, s. (1988); the art of adaptive pattern recognition by a self-organizing neural network, ieee computer, 77–88. [3] cauwenberghs, g.; poggio, t. (2000); incremental and decremental support vector machine learning, neural information processing systems, 409-415. [4] chang, c.c.; lin, c.j. (2001); libsvm: a library for support vector machines, software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm [5] diehl, c.p.; cauwenberghs, g. (2003); svm incremental learning, adaptation and optimization, proceedings of the ijcnn, 4: 2685-2690. [6] frank, a., asuncion, a. (2010); uci machine learning repository, http://archive.ics.uci.edu/ml [7] galmeanu, h.; andonie, r. (2008); incremental/decremental svm for function approximation, proceedings of the 11th intl. conf. on optimization of electrical and electronic equipment, 2: 155-160. [8] golub, g.h.; van loan, c.f. (1996); matrix computations, jhu press, baltimore and london, 1996. [9] gu, b.; wang, j.d.; yu, y.; zheng, g.s.; yu fan.; xu, t. (2012); accurate on-line v-support vector learning, neural networks, 27: 51–59. [10] hall, m.; frank, e.; holmes, g.; pfahringer, b.; reutemann, p.; witten, i. h. (2009); the weka data mining software: an update, sigkdd explorations newsletter, 11 (1):10–18. [11] karasuyama, m.; takeuchi, i. (2010); multiple incremental decremental learning of support vector machines, ieee transactions on neural networks, 21(7): 1048–1059. [12] laskov, p.; gehl, c.; krüger, s.; müller, k.r. (2006); incremental support vector learning: analysis, implementation and applications, journal of machine learning research, 7: 1909–1936. [13] martin, m. (2002); on-line support vector machines for function approximation, technical report lsi-02-11-r, software department, universitat politecnica de catalunya. [14] ma, j.; thelier, j.; perkins, s. (2003); accurate on-line support vector regression, neural computation, 15(11): 2683–2703. [15] carpenter, g. a.; grossberg, s.; markuzon, n.; reynolds, j. h.; rosen, d. b., (1992), fuzzy artmap: a neural network architecture for incremental supervised learning of analog multidimensional maps, ieee transactions on neural networks, 3: 698-713. [16] marriott, s.; harrison, r. f. (1995), a modified fuzzy artmap architecture for the approximation of noisy mappings, neural networks, 8(4): 619–641. [17] vigdor, b.; lerner, b. (2007). the bayesian artmap, ieee transactions on neural networks 18: 1628–1644. 770 h. gâlmeanu, l.m. sasu, r. andonie [18] sasu, l.m.; andonie, r. (2013); bayesian artmap for regression, neural networks, issn 0893-6080, 46: 23-31. [19] shashua, a. (2009); introduction to machine learning: class notes 67577, https://arxiv.org/abs/0904.3664v1 [20] vapnik, v. (1998); statistical learning theory, new york: wiley. [21] schölkopf, b; smola, a. j; williamson, r. c.; bartlett, p. l. (2000); new support vector algorithms, neural computation, 12 (5): 1207–1245. appendix a migration of vectors on learning and unlearning 1. migration of support (s) vectors. from equation (19), ∆θs = βs · ∆θc, the following cases are possible: (a) hs = �, for −c < θs < 0: i. if incremental (∆θc > 0) and βs > 0: ∆θs > 0, θs can reach 0, βs · ∆θc = ∆θs ≤ 0 −θs, thus ∆θc ≤−θsβs (s → o) ii. if incremental (∆θc > 0) and βs < 0: ∆θs < 0, θs can reach −c, βs · ∆θc = ∆θs ≥ −c − θs, thus ∆θc ≤ −c−θsβs (s → e) iii. if decremental (∆θc < 0) and βs > 0: ∆θs < 0, θs can reach -c, βs · ∆θc = ∆θs ≥ −c − θs, thus ∆θc ≥ −c−θsβs (s → e) iv. if decremental (∆θc < 0) and βs < 0: ∆θs > 0, θs can reach 0, βs · ∆θc = ∆θs ≤ 0 −θs, thus ∆θc ≥−θsβs (s → o) (b) hs = −�, for 0 < θs < c: i. if incremental (∆θc > 0) and βs > 0: ∆θs > 0, θs can reach c, βs ·∆θc = ∆θs ≤ c−θs, thus ∆θc ≤ c−θsβs (s → e) ii. if incremental (∆θc > 0) and βs < 0: ∆θs < 0, θs can reach 0, βs · ∆θc = ∆θs ≥ 0 −θs, thus ∆θc ≤−θsβs (s → o) iii. if decremental (∆θc < 0) and βs > 0: ∆θs < 0, θs can reach 0, βs · ∆θc = ∆θs ≥ 0 −θs, thus ∆θc ≥−θsβs (s → o) iv. if decremental (∆θc < 0) and βs < 0: ∆θs > 0, θs can reach c, βs ·∆θc = ∆θs ≤ c−θs, thus ∆θc ≥ c−θsβs (s → e) 2. migration of other (o) vectors. these are the vectors with −� < he < �, θr = 0, ∆hr = γr · ∆θc. (a) if incremental (∆θc > 0) and γr > 0: ∆hr > 0, hr can reach �, γr · ∆θc = ∆hr ≤ �−hr, thus ∆θc ≤ �−hrγr (o → s) (b) if incremental (∆θc > 0) and γr < 0: ∆hr < 0, hr can reach −�, γr ·∆θc = ∆hr ≥−�−hr, thus ∆θc ≤ −�−hrγr (o → s) incremental and decremental svm for regression 771 (c) if decremental (∆θc < 0) and γr > 0: ∆hr < 0, hr can reach −�, γr ·∆θc = ∆hr ≥−�−hr, thus ∆θc ≥ −�−hrγr (o → s) (d) if decremental (∆θc < 0) and γr < 0: ∆hr > 0, hr can reach �, γr · ∆θc = ∆hr ≤ �−hr, thus ∆θc ≥ �−hrγr (o → s) 3. migration of the error (e) vectors. (a) for hr > �, θr = −c, ∆hr = γr · ∆θc: i. if incremental (∆θc > 0) and γr > 0: ∆hr > 0, hr increases even further, vector does not migrate; ii. if incremental (∆θc > 0) and γr < 0: ∆hr < 0, hr may reach �, γr · ∆θc = ∆hr ≥ �−hr, ∆θc ≤ �−hrγr (e → s) iii. if decremental (∆θc < 0) and γr < 0: ∆hr > 0, hr increases even further, vector does not migrate; iv. if decremental (∆θc < 0) and γr > 0: ∆hr < 0, hr may reach �, γr · ∆θc = ∆hr ≥ �−hr, ∆θc ≥ �−hrγr (e → s) (b) for hr < −�, θr = c, ∆hr = γr · ∆θc: i. if incremental (∆θc > 0) and γr > 0: ∆hr > 0, hr may reach −�, γr ·∆θc = ∆hr ≤−�−hr, ∆θc ≤ −�−hrγr (e → s) ii. if incremental (∆θc > 0) and γr < 0: ∆hr < 0, hr decreases even further, vector does not migrate; iii. if decremental (∆θc < 0) and γr > 0: ∆hr < 0, hr decreases even further, vector does not migrate; iv. if decremental (∆θc < 0) and γr < 0: ∆hr > 0, hr may reach −�, γr ·∆θc = ∆hr ≤−�−hr, ∆θc ≥ −�−hrγr (e → s) 4. migration of the current xc vector. the current newly added vector xc is checked for possible migration to the support set (s): (a) for hc > �: i. if incremental (∆θc > 0) and γc > 0: ∆hc > 0, hc increases even further, vector does not migrate; ii. if incremental (∆θc > 0) and γc < 0: ∆hc < 0, hc may reach �, γc · ∆θc = ∆hc ≥ �−hc, ∆θc ≤ �−hcγc (e → s) iii. if decremental (∆θc < 0) and γc > 0: ∆hc < 0, hc may reach �, γc · ∆θc = ∆hc ≥ �−hc, ∆θc ≥ �−hcγc (e → s) iv. if decremental (∆θc < 0) and γc < 0: ∆hc > 0, hc increases even further, vector does not migrate; (b) for hc < −�: i. if incremental (∆θc > 0) and γc > 0: ∆hc > 0, hc may reach −�, γc ·∆θc = ∆hc ≤−�−hc, ∆θc ≤ −�−hcγc (e → s) ii. if incremental (∆θc > 0) and γc < 0: ∆hc < 0, hc decreases even further, vector does not migrate; iii. if decremental (∆θc < 0) and γc > 0: ∆hc < 0, hc decreases even further, vector does not migrate; iv. if decremental (∆θc < 0) and γc < 0: ∆hc > 0, hc may reach −�, γc ·∆θc = ∆hc ≤−�−hc, ∆θc ≥ −�−hcγc (e → s) 772 h. gâlmeanu, l.m. sasu, r. andonie appendix b migration of vectors on varying regularization parameter 1. migration of the support vectors. (a) for ∆c < 0, when c is decreasing (decremental): i. for support vector with 0 ≤ θs ≤ c, hs = −�, the variation of c is limited by 0 ≤ θs + ∆θs ≤ c + ∆c: a. for βs ≤ 0: • βs · ∆c = ∆θs ≥ 0, θs + ∆θs ≥ 0 is always fulfilled; • θs + βs · ∆c ≤ c + ∆c; so (βs − 1)∆c ≤ c −θs leads to ∆c ≥ c−θs βs−1 , ∆θs ≥ 0, θs can reach c (migration s → e) b. for 0 ≤ βs ≤ 1, both conditions are active: • 0 ≤ θs + βs∆c, leading to ∆c ≥−θs βs , ∆θs ≤ 0, θs can reach 0 (migration s → o) • θs + βs∆c ≤ c + ∆c, (βs − 1)∆c ≤ c −θs, leading to ∆c ≥ c−θs βs−1 , ∆θs = βs∆c, θs can reach c (migration s → e) c. for βs ≥ 1: • θs + βs∆c ≥ 0 leads to ∆c ≥−θs βs , θs decreases (migration s → o) • θs + βs∆c ≤ c + ∆c, or (βs − 1)∆c ≤ c −θs, is always fulfilled. summary: • if βs − 1 ≤ 0, we have s → e migration: ∆c ≥ c −θs βs − 1 • if βs ≥ 0, we have s → o migration: ∆c ≥−θs βs ii. for support vector with −c ≤ θs ≤ 0, hs = +�, the variation of c is limited by −c − ∆c ≤ θs + ∆θs ≤ 0: a. for βs ≤−1: • −c − ∆c ≤ θs + βs∆c, or −c −θs ≤ (βs + 1)∆c is always fulfilled; • θs + βs∆c ≤ 0 leads to ∆c ≥−θs βs , ∆θs ≥ 0, θs can reach 0 (migration s → o) b. for −1 ≤ βs ≤ 0, both conditions are active: • −c − ∆c ≤ θs + βs∆c, leading to ∆c ≥ −c−θs βs+1 (migration s → e) • θs + βs∆c ≤ 0 leads to ∆c ≥−θs βs (migration s → o) c. for βs ≥ 0: • −c − ∆c ≤ θs + βs∆c, or −c −θs ≤ (βs + 1)∆c, leading to ∆c ≥ −c−θs βs+1 (migration s → e) • θs + βs∆c ≤ 0 is always fulfilled. incremental and decremental svm for regression 773 summary: • if βs ≤ 0, we have s → o migration: ∆c ≥−θs βs • if βs + 1 ≥ 0, we have s → e migration: ∆c ≥ −c −θs βs + 1 (b) for ∆c > 0, when c is increasing (incremental): i. for support vector with 0 ≤ θs ≤ c, hs = −�, the variation of c is limited by 0 ≤ θs + ∆θs ≤ c + ∆c: a. for βs ≤ 0: • θs + βs∆c ≥ 0, or βs∆c ≥−θs leads to ∆c ≤−θs βs (migration s → o) • θs + βs∆c ≤ c + ∆c or (βs − 1)∆c ≤ c −θs is always fulfilled; b. for 0 ≤ βs ≤ 1: • θs + βs∆c ≥ 0, or βs∆c ≥−θs is always fulfilled; • θs + βs∆c ≤ c + ∆c or (βs − 1)∆c ≤ c −θs is also always fulfilled; c. for βs ≥ 1: • θs + βs∆c ≥ 0, or βs∆c ≥−θs is always fulfilled; • θs + βs∆c ≤ c + ∆c or (βs − 1)∆c ≤ c −θs leads to ∆c ≤ c−θs βs−1 (migration s → e) summary: • if βs ≤ 0, we have s → o migration: ∆c ≤−θs βs • if βs − 1 ≥ 0, we have s → e migration: ∆c ≤ c −θs βs − 1 ii. for support vector with −c ≤ θs ≤ 0, hs = +�, the variation of c is limited by −c − ∆c ≤ θs + ∆θs ≤ 0: a. for βs ≤−1: • −c − ∆c ≤ θs + βs∆c, or −c −θs ≤ (βs + 1)∆c leads to ∆c ≤ −c−θs βs+1 (migration s → e) • θs + βs∆c ≤ 0 or βs∆c ≤−θs is always fulfilled; b. for −1 ≤ βs ≤ 0: • −c − ∆c ≤ θs + βs∆c, is always fulfilled; • θs + βs∆c ≤ 0 is also always fulfilled; c. for βs ≥ 0: • −c − ∆c ≤ θs + βs∆c is always fulfilled; • θs + βs∆c ≤ 0 or βs∆c ≤−θs leads to ∆c ≤−θs βs (migration s → o) 774 h. gâlmeanu, l.m. sasu, r. andonie summary: • if βs + 1 ≤ 0, we have s → e migration: ∆c ≤ −c −θs βs + 1 • if βs ≥ 0, we have s → o migration: ∆c ≤−θs βs for migration of support vectors, these conditions are summarized in section 4. hence, we have (conditions may be cumulative): • for incremental (∆c > 0): – if sgn(βs + sgn(hs)) 6= sgn(hs), then the limit is imposed by (migration s → e): ∆c ≤ −sgn(hs) ·c −θs βs + sgn(hs) – if sgn(βs) = sgn(hs), then the limit is imposed by (migration s → o): ∆c ≤−θs βs • for decremental (∆c < 0): – if sgn(βs + sgn(hs)) = sgn(hs), then the limit is imposed by (migration s → e): ∆c ≥ −sgn(hs) ·c −θs βs + sgn(hs) – if sgn(βs) 6= sgn(hs), then the limit is imposed by (migration s → o): ∆c ≥−θs βs 2. migration of other vectors. for other (o) vectors, −� < hr < �, where θr = 0; ∆hr = γr · ∆c. (a) for ∆c < 0, when c is decreasing (decremental): i. for γr > 0: ∆γr < 0, hr may reach −�, γr∆c ≥−�−hr, it leads to ∆c ≥ −�−hrγr (migration o → s) ii. for γr < 0: ∆γr > 0, hr may reach �, γr∆c ≤ � − hr, it leads to ∆c ≥ �−hrγr (migration o → s) (b) for ∆c > 0, when c is decreasing (decremental): i. for γr > 0: ∆γr > 0, hr may reach �, γr∆c ≥ � − hr, it leads to ∆c ≤ �−hrγr (migration o → s) ii. for γr < 0: ∆γr < 0, hr may reach −�, γr∆c ≥−�−hr, it leads to ∆c ≤ −�−hrγr (migration o → s) resuming the o → s migration: incremental and decremental svm for regression 775 • for incremental (∆c > 0): ∆c ≤ sgn(γr)�−hr γr • for decremental (∆c < 0): ∆c ≥ −sgn(γr)�−hr γr 3. migration of error (e) vectors. (a) for error vectors with hr > �, where θr = −c; ∆hr = γr · ∆c: i. for ∆c < 0 and γr < 0, or ∆c > 0 and γr > 0, then ∆hr > 0, hr increases even further, there is no migration; ii. for ∆c < 0 and γr > 0, then ∆hr < 0, hr may reach �; γr∆c ≥ �−hr leads to ∆c ≥ �−hr γr (migration e → s) iii. for ∆c > 0 and γr < 0, then ∆hr < 0, hr may reach �; γr∆c ≥ �−hr leads to ∆c ≤ �−hr γr (migration e → s) (b) for error vectors with hr < −�, where θr = c; ∆hr = γr · ∆c: i. for ∆c < 0 and γr > 0, or ∆c > 0 and γr < 0, then ∆hr < 0, hr decreases even further, there is no migration; ii. for ∆c < 0 and γr < 0, then ∆hr > 0, hr may reach −�; γr∆c ≤−�−hr leads to ∆c ≥ −�−hr γr (migration e → s) iii. for ∆c > 0 and γr > 0, then ∆hr > 0, hr may reach −�; γr∆c ≤−�−hr leads to ∆c ≤ −�−hr γr (migration e → s) resuming the e → s migration: • for incremental (∆c > 0): (a) if sgn(hr) 6= sgn(γr): ∆c ≤ −sgn(γr)�−hr γr (b) if sgn(hr) = sgn(γr), the limit is not active; • for decremental (∆c < 0): (a) if sgn(hr) = sgn(γr): ∆c ≥ sgn(γr)�−hr γr (b) if sgn(hr) 6= sgn(γr), the limit is not active; 4. variation of c is also limited by a target value ctarget: • for incremental: ∆c ≤ ctarget −c • for decremental: ∆c ≥ ctarget −c int j comput commun, issn 1841-9836 vol.7 (2012), no. 2 (june), pp. 204-217 application of chaos embedded pso for pid parameter tuning o.t. altinoz, a.e. yilmaz, g.-w. weber o. tolga altinoz hacettepe university bala vocational school electronics technology department, ankara, turkey taltinoz@hacettepe.edu.tr a. egemen yilmaz ankara university electronics engineering department, ankara, turkey aeyilmaz@eng.ankara.edu.tr g. wilhelm weber middle east technical university institute of applied mathematics, ankara, turkey gweber@metu.edu.tr abstract: proportional-integral-derivative (pid) control is the most common method applied in the industry due to its simplicity. on the other hand, due to its difficulties, parameter tuning of the pid controllers are usually performed poorly. generally, the design objectives are obtained by adjusting the controller parameters repetitively until the desired closed-loop system performance is achieved. this allows researchers to use more advanced and even some heuristic methods to achieve the optimal pid parameters. this paper focuses on application of the chaos embedded particle swarm optimization algorithm (cpso) for pid controller tuning, and demonstrates how to employ the cpso method to find optimal pid parameters in details. the method is applied to optimal pid parameter tuning for three typical systems with various ordered, and comparisons with the conventional pso and the ziegler-nichols methods are performed. the numerical results from the simulations verify the performance of the proposed scheme. keywords: particle swarm optimization (pso); chaos theory; pid control; multidimensional optimization. 1 introduction numerous control techniques have so far been introduced and applied in industrial problems. on the other hand, since its invention in 1910, proportional-integral-derivative (pid) control has become the best known controller due to its simplcitiy and efficiency in various control problems. industrial implementation of the pid controller is remarkable that its tuning still presents a challenge in many applications. primary problem associated with such systems is to obtain the optimal pid controller parameters for a satisfactory control performance, which is a quite difficult task. therefore, enormous amount of research results have been reported in the literature, among which the well-known tuning method was proposed by ziegler and nichols in 1942. despite the huge number of proposed approaches for pid parameter tuning, most of them (as well as the ziegler-nichols method) occasionally yield poor performance in practice. generally, conventional tuning methods use the root locus and frequency response for pid parameter tuning. early presentation of the most common conventional tuning method, the ziegler-nichols method, was based on the open-loop step response of the system. with such copyright c⃝ 2006-2012 by ccc publications application of chaos embedded pso for pid parameter tuning 205 definition, the performance was observed to be inadequate and poor. later, the authors improved the method by making it dependent on the frequency response of the close-loop system. the procedure imposed by this method can be summarized as follows: i) first, the system is assumed to be under proportional control only. the proportion parameter is increased until the system becomes critically stable. the proportion parameter yielding this condition is recorded as well as the corresponding oscillation period; ii) based on these values, other parameters (i.e. corresponding to the integral and derivative operations) are determined via some look-up-tables which can be found any control theory textbook. however, this method yields unsatisfactory phase and gain margins. in addition, due to its off-line unautomatized nature, the ziegler-nichols method is not quite applicable to processes in the working systems in practice. for this reason, recently, heuristic methods have been employed for pid parameter tuning in order to improve the controller performance. in the last decade, heuristic approaches have received increased attention from researchers dealing with engineering problems. over the years, several heuristic methods have been proposed for pid parameter tuning. in 1995, a novel heuristic approach called the particle swarm optimization (pso) was introduced by eberhart and kennedy [7]. compared to other optimization algorithms, pso has a very simple mathematical definition yielding easy-to-implement and short computer programs, which can generate high-quality solutions with reasonable speed. however, like other heuristic methods, conventional pso suffers from premature convergence especially in higher order complicated problems. in order to improve pso, so far numerous variants have been introduced. compared to the conventional pso, most of these variants show relatively better performance. much research is still in progress for improvement of the performance of pso especially in complicated problems. in this study, we present a simple method for pid controller parameter tuning by using the chaos embedded particle swarm optimization algorithm (cpso). pid parameter tuning achives in such a way that parameters are adjusted in real-time when the controller is active, which is called on-line tuning or the parameters are measured than the tuned controller is performed on a plant, which is called off-line tuning. hence, in this study, the cpso is applied to overall system to obtained the design objectives by adjusting the controller parameters at each iteration repetitively until the desired closed-loop system performance is achieved. this attitude is off-line tuning. the performance of the closed-loop system can defined in terms of rise time, overshoot, settling time and steady state error. in general, the system with fast rise and settling time under no steady-state error and almost zero overshoot is desired. hence, in this study to provide a desired performance, mean value of squared error signal, which is different between reference signal and the system output, is minimized by using cpso and pso. these methods (cpso, pso and ziegler-nichols) are applied to three typical systems of different orders. the first system under investigation is the acceleration model of an object in a frictional environment. the second one is the inverted pendulum control problem, and the last issue is the field-controled of a dc motor problem. when these systems are controlled via pid controller, the closed-loop performance is sensitively dependent to the pid parameters. hence, determination of these parameters becomes quite critical. therefore, pid parameter tuning methods should be performed on the closed-loop system. in section 2 of the paper, the descriptions and formulations of the controller and problems under investigation are presented. section 3 contains the definitions of the pso and cpso algorithms; and finally, section 4 presents the numerical results and section 5 contains the comparisons together with discussions and concluding remarks. 206 o.t. altinoz, a.e. yilmaz, g.-w. weber table 1: increase in each pid parameter and its impact on transient response parameter rise time overshoot settling time steady state error kp decrease increase increase decrease ki decrease increase increase decrease kd decrease decrease decrease slight change 2 problem statement in this study, pid control is discussed and three systems of different orders are investigated for the performance comparison of the proposed cpso method with the ziegler-nichols method and conventional pso. each system is summarized via pictorial description as well as its transfer function in the upcoming subsections. 2.1 the proportional-integral-derivative (pid) control the basic form of the pid controller composes from sum of the multiplication, integration and differentiation of the error signal, and each operator is multiplied by three control parameters. time domain representation of pid control equation is presented in eq. 1, meanwhile the transfer function is given in eq. 2: u(t) = kp e(t) + ki ∫ e(t)dt + kd de(t) dt , (1) g(s) = u(s) e(s) = kds 2 + kps + ki s , (2) where kp , ki and kd are the control parameters, u(t) is the controller output/system input, and e(t) is error signal. in frequency domain, the transfer function of the pid controller has one pole at the origin, and two zeros with locations depending on the tuning strategy. table 1 presents the separate and isolated impact of change in each parameter on the system performance. however, in reality, the parameters must be mutually tuned. therefore, these impacts listed in table 1 should not be considered as rules-of-thumb, and applied directly for parameter tuning. 2.2 a system of first order: acceleration control of an object in a frictional environment the first order system in this study results from the acceleration of an object against the friction due to the ground contact, which is one of the fundamental models in vehicular control. the figure 1 depicts the pictorial description of the relevant system. figure 1: accelerated object model in a frictional environment. the transfer function of the system is given in eq. 3. the model presents the equation of motion for the speed of an object with mass m. assuming that the input u imposes a force; application of chaos embedded pso for pid parameter tuning 207 table 2: setup1a and setup1b descriptors descriptor symbol setup1a setup1b friction constant (n.sec/m) b 50 50 mass of the object (kg) m 100 50 desired velocity of the object (m/sec) v 1 1 meanwhile the friction force bv restricts the movement of the object, where v is the velocity of the object as well as the output of the overall system. the aim of the controller is to make the object to reach to a desired velocity. v (s) u(s) = 1/m s + b m (3) for this particular type of problem, two different setups are constructed and investigated. from now on, these setups will be referred to as setup1a and setup1b. the descriptor parameters of these setups are given in table 2. 2.3 a system of second order: inverted pendulum the inverted pendulum is one of the benchmark problems of control theory for illustration and comparison of different control methodologies. the problem comes from the motivation on development of missiles, rockets, robots and other transportation means. in general, the aim in the inverted pendulum problem is to maintain the pendulum at upright position. therefore, in addition to the pid control, various techniques (such as optimal control [15], linear [9] control, nonlinear control [5], intelligent and adaptive control methods [3]) have also been applied to this problem. figure 4 shows the general model of the inverted pendulum. the pendulum is composed of a free moving pendulum with the mass mp and length l attached to the cart with mass mt, where a force f is applied to this setup. figure 2: inverted pendulum model. eq. 4 shows the transfer function of the inverted pendulum system. the moment of inertia i about the pendulum’s mass center is defined as i = (1/3)mpl2. θ(s) u(s) = mpl ((i + mpl2) − m2pl2)s2 − mpgl(mp + mt) . (4) for this particular type of problem, four different setups are constructed and investigated. from now on, these setups will be referred to as setup2a, setup2b, setup2c and setup2d. the descriptor parameters of these setups are given in table 3. 208 o.t. altinoz, a.e. yilmaz, g.-w. weber table 3: setup2a setup2d descriptors descriptor symbol setup2a setup2b setup2c setup2d mass of the cart (kg) mt 0.5 0.5 1 1 mass of the pole (kg) mp 1 1 1 1 length of the pole (m) l 0.5 0.5 2 2 initial value of the rod angle (rad) θ 0.4 0.8 0.4 0.8 table 4: setup3a and setup3b descriptors descriptor symbol setup3a setup3b motor constant (n.m/a) km 50.10−3 50.10−3 friction constant (n.sec/m) b 50.10−3 50.10−3 resistance (ω) rf 1 10 field time effect (msec) τf 1 10 rotor time constant (msec) τl 100 10 rotor angle (rad) θ 2 2 2.4 a system of third order: field-controlled dc motor the third problem under investigation is the field-controlled dc motor problem depicted in figure 1, which has a third order transfer function. the main purpose of this type of systems are to increase speed while reduce the torque. eq. 5 gives the transfer function of the system. figure 3: field-controlled dc motor model. θ(s) vf(s) = km/(brf) s(τfs + 1)(τls + 1) , (5) where km is the motor constant, b is the friction constant, τf = lf/rf is the field time effect, τl = j/b is the rotor time constant, and j is the rotor inertia, vf is the input and θ is the output of the system. for this particular type of problem, two different setups are constructed and investigated. from now on, these setups will be referred to as setup3a and setup3b. the descriptor parameters of these setups are given in table 4. 3 particle swarm optimization (pso) the pso algorithm depends on motions of particles (swarm members) searching for the global best in an n-dimensional continuous space. the position of each particle is nothing but a solution candidate, and every time, the fitness of this candidate is re-evaluated. in addition application of chaos embedded pso for pid parameter tuning 209 to its exploration capability (i.e. the tendency for random search throughout the domain), each particle has a cognitive behavior (i.e. remembering its own good memories, and having tendency to return there); as well as a social behavior (i.e. observing the rest of the swarm and having tendency to go where most other particles go). the original pso formulation of kennedy and eberhart [7] depends on the update of the position xi[k] and the velocity vi[k] of the ith particle (swarm member) at the kth iteration as follows: vi[k + 1] = vi[k] + c1 × rand() × (pbesti − xi[k]) + c2 × rand() × (gbesti − xi[k]), (6) xi[k + 1] = xi[k] + vi[k + 1] × ∆t, (7) where c1 and c2 are measures indicating the tendencies of approaching to pbest and gbest, which are the best positions achieved personally by the ith particle and the whole swarm, respectively. in other words, c1 and c2 are the measures of the cognitive and the social behaviors (called cognitive and social parameters), respectively. rand() is a uniformly distributed pseudo-random number generator which produces random numbers between 0.0 and 1.0; and the time step size ∆t is taken to be unity for simplicity. this optimization algorithm demonstrates an outstanding performance under complicated problems. but still, it occasionally faces problems such as getting stuck at local optima and stagnation for multi-dimensional and complex problems. thus, various improvements have been proposed in order to get rid of these problems. one of the important innovations was introduced by shi and eberhart [11], who proposed a term called "inertial weight" in order to improve the performance of the method (used for controlling local and global exploration behavior of the population). by introduction of this term, which puts an additional control on the current velocity of the particles, eq. 6 is modified as: vi[k + 1] = w[k] × vi[k] + c1 × rand() × (pbesti − xi[k]) + c2 × rand() × (gbesti − xi[k]), (8) which is referred to as the inertial weight pso, and is currently accepted as the de-facto pso formulation. moreover, in a following study [11], shi and eberhart showed that the ideal choice for the inertial weight is to decrease it linearly from 0.9 to 0.4. the following pseudocode presents the pso algorithm. pso algorithm initialize random velocity and position do for i = 1 to swarm size 1) calculate fitness function (fit) aimed to be minimized. this is the function that the mean squre error of the difference between reference signal and the output of the system if fit < best pattern ith particle best position = ith particle position best pattern = fit end if end for find min best pattern and corresponding particle for i = 1 to swarm size update velocity 210 o.t. altinoz, a.e. yilmaz, g.-w. weber update position end for check the limits of the maximum velocity while maximum iterations are not exceeded or minimum error is not achieved 4 chaos embedded particle swarm optimization (cpso) after pso has been presented, various hybrid and innovative methods were introduced in order to get rid of its drawbacks such as premature convergence and stagnation for multidimensional and complicated problems. one of the ideas for preventing premature convergence is to embed chaotic maps into pso. in recent years, chaotic maps have been utilized for generating pseudo-random numbers, and they have been applied to many areas like communication and control theory, since they can produce sufficiently random numbers enhancing the overall performance of the systems. for pso, the chaotic maps can be helpful for prevention of premature convergence [2]; moreover, it can improve the global/local searching capabilities of the algorithm. implementation of chaotic maps in pso can be categorized into two different approaches: in the first approach, variables and/or random number generators are reinforced with the chaotic maps without any radical changes in the algorithm. in a similar manner, any stochastic algorithm can be improved by using chaotic maps. for pso, the performance of the algorithm is greatly dependent on the parameters w, c1, c2 as well as the random number generators. thus, any improvement performed on these variables would slightly have influence on the overall performance. in the second approach, chaos is used in order to interact with the pso algorithm for searching the solution space. the modification in the pso algorithm must be fulfilled. in addition to the conventional algorithm, the add-on code is evaluated in order to find a solution candidate with better fitness. at each iteration, the search space boundary is reduced for increasing the searching efficiency. by this way, the particles are kept away from local optimum. this phenomenon is called chaos search. in this study, due to its effective performance, the second approach is preferred to the first approach. the chaos embedded particle swarm optimization (cpso) [8] differs from the conventional pso with four new operators, which are: chaotic map, variable mapping, inverse variable mapping, and search range operators. 4.1 chaotic maps the discrete-time dynamical system in the iteration form in eq. 9 is called chaotic mapping or chaotic map. cxi+1 = f(cxi,p), (9) where f : s → s and s ∈ ℜ , s = [0,1] or s = [−1,1]. p is the chaos parameter, cx is a vector and f is a nonlinear transformation. the equation starting from the initial value cx0 in the iterative map is obtained. chaotic maps have sensitivity on initial value and they produce pseudo random sequences based on cx0. one of the most common chaotic maps is the logistic map. the logistic map which is a model of population biology, is frequently used with pso [8]. cxi+1[k] = µcxi[k](1 − cxi[k]), (10) where µ (which is called bifurcation parameter) is set to 4 for ergodicity as seen in figure 4. there are two fixed points (if the condition cx∗ = f(cx∗,p) holds cx∗ is called fixed point) exists application of chaos embedded pso for pid parameter tuning 211 at 3/4 and 0. moreover, if the chaotic map will be used with cpso, the fixed points must be checked at the algorithm. figure 4: bifurcation diagram of logistic map. the chaotic map is a function producing a random number based on the initial value. in cpso, the initial value is the current position for a particle and chaotic map produces a new position. if the fitness value related to the new position is optimal than the initial position, then the position of a particle is changed as the new position. thus, it is essential that the chaotic orbit must be ergodic, which means the initial value of the chaotic map must be varied. however, the range of the particle position is not between [−1,1] or [0,1]. hence, the particle position should be mapped into chaotic space. therefore, the variable mapping operator is defined for this purpose. 4.2 variable mapping the position of the particle should be mapped into the chaotic domain by using carrier equation as defined in eq. 11 [14]. cxi[k] = (xi[k] − xmin)/(xmax − xmin), (11) where cxi is decision variable, which is the initial value of the chaotic map, x is the position of the particle and xmin and xmax are the boundaries of the search space. this function maps the range [xmin,xmax] to the range of chaotic variable [0,1]. the procedure; variable mapping a particle position and then using this position as an initial value of a chaotic map is called chaos search. however, when the optimal point is obtained, it should be mapped to a search space. hence, the inverse mapping operator is produced. 4.3 inverse variable mapping after determination of the chaotic variable from a chaotic map with the initial value from variable mapping, the chaotic variable is converted into the particle position by using eq. 12. this function maps the range of chaotic variable [0,1] to the range of solution space [xmin,xmax]. xi[k] = xmin + cxi[k](xmax − xmin) (12) the position, which is taken from the inverse mapping, is used in order to evaluate the new solutions. if the new solution is better than the non-chaos-search one, the new solution is put pursuant to chaos search. 212 o.t. altinoz, a.e. yilmaz, g.-w. weber 4.4 search range if the search space extends in a wide area, the searching cannot be completed in the optimal area in a short time. hence, in order to obtain high performance, the chaotic search is run in a small range. this search area is changed in the current optimal solution neighborhood [4]. xmin = max(xmin,xg − r(xmax − xmin)), (13) xmax = min(xmax,xg + r(xmax − xmin)), (14) where xg is the global best position so far and r is the variable between 0 and 1. in this study, r is chosen as 0.45, and xg is chosen as 0 for simplicity. decreasing the range of the chaotic search will increase the searching efficiency. the outline of the cpso used in this study can be listed as follows: chaos pso algorithm initialize random velocity and position do for i = 1 to swarm size 1) calculate the fitness function (fit) aimed to be minimized. this function is nothing but the mean square error of the difference between reference signal and the output of the system if fit < best pattern ith particle best position = ith particle position best pattern = fit end if end for find min best pattern and corresponding particle for i = 1 to swarm size update velocity update position end for check the limits of the maximum velocity 1) execute variable mapping that maps the positions into chaos variables (section 4.2) 2) use logistic map and find new chaos variable (section 4.1) 3) execute inverse variable mapping that converts chaos variable into positions (section 4.3) if (new position < old position) position = new position break end if 4) change the search area (section 4.4) end for while maximum iterations are not exceeded or minimum error is not achieved application of chaos embedded pso for pid parameter tuning 213 table 5: optimization algorithm parameters for the simulation property value number of particles in population 10 number of iterations (maxiter) 50 social and cognitive parameters (c1 and c2) 1.494 inertia weight (wmax and wmin) 0.9 down to 0.4 table 6: pid controller parameter values for setup1a parameter value obtained value obtained value obtained by cpso by pso by ziegler-nichols kp 100 22.64 30.52 ki 38.745 100 109.01 kd 1 0.1 30.52 5 simulation results in this study, a simulation environment is constructed in order to apply the tuning methods and make performance comparisons. figure 5 shows the overall setups for all cases; and the selected parameters for the optimization algorithms are given in table 5. figure 5: overall setups for all cases. the brief summary of the solution strategy to the control problems in this study is as follows: 1) initialize the overall system with random pid parameters; 2) obtain the error, which is the transient response of the system where the reference signal is zero; 3) calculate the fitness function; 4) update the parameters by using ziegler-nichols, pso or cpso; 5) return to step 2 until the maximum iteration is achieved. 5.1 results for setup1 for this type of problem, the pid parameters are tuned by using pso, cpso and zeiglernichols methods. figure 6 shows the transient responses of the closed loop system for setup1a and setup1b. table 6 and 7 present the obtained pid parameters for setup1a and setup1b by means of different methods. it can be observed from the figure that for setup1a, pso yields the worst pid parameters. cpso has no overshoot and better rise time compared to pso and ziegler-nichols. for setup1b, ziegler-nichols yields the worst performance. even though cpso seems to outperform to pso, it can be concluded that cpso performance is almost the same with pso. in summary, for this particular type of problem, cpso presents better performance compared to pso and ziegler-nichols. 214 o.t. altinoz, a.e. yilmaz, g.-w. weber figure 6: transient response of the system for a) setup1a and b) setup1b. table 7: pid controller parameter values setup1b parameter value obtained value obtained value obtained by cpso by pso by ziegler-nichols kp 33.86 24.89 0.85 ki 49.3 45.57 43.6 kd 1 1 0.85 5.2 results for setup2 for this type of problem, again, the pid parameters are tuned by using pso, cpso and ziegler-nichols methods. tables 8 and 9 give the obtained pid parameters, and figure 3 shows the transient responses of designs. the results as seen in figure 3 indicates that cpso outperforms to pso and ziegler-nichols. however, in case 2a and case 2b cpso performance is similar to that of ziegler-nichols and pso. from the simulations, it is observed that, cpso is better than pso; on the other hand, it should be noted that for this particular type of problem, ziegler-nichols outperforms to conventional pso. this result emphasizes the importance of the efforts spent for improvement of the conventional pso. 5.3 results for setup3 the pid parameters obtained for this type of problem are presented in tables 10 and 11, and illustrated in figure 8. table 8: pid controller parameter values for setup2a and setup2b parameter value obtained value obtained value obtained by cpso by pso by ziegler-nichols kp 93.6226 60.0136 60.3788 ki 10.4097 0 4.997 kd 10.1 3.6553 4.977 application of chaos embedded pso for pid parameter tuning 215 figure 7: transient response of the system for a) setup2a, b) setup2b, c) setup2c and d) setup2d. table 9: pid controller parameter values for the setup2c and setup2d parameter value obtained value obtained value obtained by cpso by pso by ziegler-nichols kp 162.9 169.7 190 ki 10 79.03 4.9 kd 10 10 5.12 table 10: pid controller parameter values for setup3a parameter value obtained value obtained value obtained by cpso by pso by ziegler-nichols kp 395.33 219.20533 1.08 ki 1.1 55.2488 0.0314 kd 41.6245 1 6.22 216 o.t. altinoz, a.e. yilmaz, g.-w. weber figure 8: transient response of the system for a) setup3a and b) setup3b. table 11: pid controller parameter values for the setup3b parameter value obtained value obtained value obtained by cpso by pso by ziegler-nichols kp 28.73 62.7446 0.2442 ki 186.95 1.7 87.205 kd 8.7 1 1.7 10−4 the results once more indicate the efficiency of the cpso algorithm. at this point, the following remark shall be made: even though for setup3b pso seems to have a better rise time as well as a settling time compared to cpso, it yields a steady state error. therefore, it can still be claimed that cpso has a better overall performance compared to pso and ziegler-nichols. 6 conclusion and future works in this study, the chaos embedded particle swarm optimization algorithm (cpso) is developed, and applied to the some popular engineering problems. the parameter tuning of the pid controller is performed by applying cpso to three systems. at each iteration, the pid parameters are found off-line; then, the optimized controller is applied to the system. the effectiveness of the proposed method is exposed by comparing it to pso and ziegler-nichols methods. it is observed and concluded that cpso outperforms to pso and ziegler-nichols, and it is quite efficient for usage in other systems. the scope of the current study is limited to off-line determination of the pid parameters; but simple and parallelizable structure of pso (and also of cpso) gives the impression that it is feasible to have real-time implementations of pso (and also cpso) in industrial applications for on-line pid controller parameter tuning. application of chaos embedded pso for pid parameter tuning 217 bibliography [1] l.s. coelho, a novel quantum particle swarm optimizer with chaotic mutation operator, chaos, solitons and fractals, vol.37, pp. 1409-1418, 2008 [2] l.s. coelho, v.c. mariani, a novel chaotic particle swarm optimization approach using henon map and implicit filtering local search for economic load dispatch, chaos, solitons and fractals, vol.39, pp. 510-518, 2009 [3] s. cong and y. liang, pid-like neural network nonlinear adaptive control for uncertain multivariable motion control systems, ieee transactions on industrial electronics, 56(10):38723879, 2009 [4] x.y. gao, l.q. sun, d.s. sun, an enhanced particle swarm optimization algorithm, information technology journal, vol.8, pp. 1263-1268, 2009 [5] h.n. iordanou, b.w. surgenor, experimental evaluation of the robustness of discrete sliding mode control versus linear quadratic control, ieee transactions on control systems technology, 5(2):254-260, 1997 [6] c. jiejin, m. xiaoqian, l. lixiang, p. haipeng, chaotic particle swarm optimization for economic dispatch considering the generator constraints, energy conversion and management, vol.48, pp. 645-653, 2007 [7] j. kennedy, r.c. eberhart, particle swarm optimization, ieee international conference on neural networks, pp. 1942-1948, 1995 [8] b. liu, l. wang, y.h. jin, f. tang, c.x. huang, improved particle swarm optimization combined with chaos, chaos, solitons and fractals, 25, pp. 1261-1271, 2005 [9] g.a. medrano-cersa, robust computer control of an inverted pendulum, ieee control systems magazine, 19(3):58-67, 1999 [10] y. shi, r.c. eberhart, a modified particle swarm optimizer, ieee international conference on evolutionary computation, pp. 69-73, 1998 [11] y. shi, r.c. eberhart, empirical study of particle swarm optimization, congress of evolutionary computing, pp. 1945-1950, 1999 [12] y. song, z. chen, z. yuan, new chaotic pso-based neural network predictive control for nonlinear process, ieee transactions on neural networks, 18(2):595-601, 2007 [13] j.m.t. thompson, h.b. stewart, nonlinear dynamics and chaos, john wiley and sons, 2nd edition, 2002 [14] t. xiang, x. liao, k.w. wang, an improved particle swarm optimization combined with piecewise linear chaotic map, applied mathematics and computation, pp. 1637-1645, 2007 [15] g.w. van der linder, p.f. lambrechts, h-inf control of an experimental inverted pendulum with dry friction, ieee control systems magazine, 13(4):44-50, 1993 914xun6.pdf international journal of computers communications & control special issue on fuzzy sets and applications (celebration of the 50th anniversary of fuzzy sets) issn 1841-9836, 10(6):936-951, december, 2015. pi and fuzzy control for p-removal in wastewater treatment plant h. xu, r. vilanova hongyang xu*, ramón vilanova dept. de telecomunicació i enginyeria de sistemes escola d’enginyeria, universitat autónoma de barcelona, carrer de es sitges, 08193 bellaterra, bcn, spain xu.hongyang@e-campus.uab.cat, ramon.vilanova@uab.cat *corresponding author: xu.hongyang@e-campus.uab.cat abstract: due to the complex and non linear character, wastewater treatment process is difficult to be controlled. the demand for removing the pollutant, especially for nitrogen (n) and phosphorus (p), as well as reducing the cost of wastewater treatment plant is an important research theme recently. thus, in this paper, the benchmark proposed default control strategy and 10 additional control strategies are applied on the combined biological p and n removal benchmark simulation model no.1 (bsm1-p). in addition, according to the results of applying pi controllers, as usual, we also chose the group with the better performance, as well as the default control strategy, to replace the pi controllers with fuzzy controllers. in this way, it can be seen that in all cases the quality of effluent of the controlled process could be improved in some degree; and the fuzzy controllers get a better phosphorus removal. keywords: wastewater treatment plant, pi controllers, fuzzy control, p removal. 1 introduction as the human society develops rapidly, the demand for water resources is playing a more and more important role in the civil life and industrial production. nowadays, stringent legislation for wastewater treatment plants (wwtps) is currently a top driving force for the development of new treatment technologies and for the optimisation of the existing ones. meeting stringent concentration requirements for carbon (c), nitrogen (n) and phosphorus (p) discharge with minimal costs has raised the need of a more efficient operation. however, as a large nonlinear system, wwtps are subject to complex disturbances, where complicated biological and hydrodynamic phenomena are taking place. thus it is difficult to achieve the aim to meet the standard of wwtps effluent quality (eq) and minimize the operational cost (oc) simultaneously. many control strategies have been proposed in the literature but their evaluation and comparison are difficult, either practical or based on simulation. different control algorithms for wwtps have been introduced over the years. for instance, sufficient nitrification can be maintained by applying a constant aeration flow rate, by controlling the dissolved oxygen (do) level at a pre-selected set-point or by using a variable do set-point controller based on ammonia concentration in the last aerobic reactor of the plant [1, 2]. on the other hand, the denitrification process is usually controlled by manipulating the external carbon flow rate or internal recirculation flow rate based on nitrate concentration in the last anoxic reactor or in the last aerobic reactor [3, 4]. unfortunately, various plant configurations, influent characteristics and evaluation criteria have been used in the assessment of control algorithms. as all of these factors influence the choice of a control strategy, it is difficult to say which control algorithm is the most appropriate with respect to minimal oc and best eq, and whether the implementation of complex control algorithms is really necessary. this is caused by many reasons: the variability of influent (both the volume and the chemical component of copyright © 2006-2015 by ccc publications pi and fuzzy control for p-removal in wastewater treatment plant 937 influent), the weather condition, and the complexity of biochemical and physical phenomena, the large range of time constants and the lack of standard evaluation criteria, among others. to deal with this problem, in recent years the benchmark simulation model no.1 (bsm1) [5, 6] has been proposed as a standard platform for comparing different control strategies in the community of wastewater treatment processes. this benchmark model can be used for simulating an effective wwtp to reduce the chemical oxygen demand (cod) of the polluted water as well as to remove the nitrogen (n), both of which are key standards for a wwtp. in addition to cod and n, another major concern is p. since p has been identified as the key element responsible for eutrophication in the aquatic environment, reducing p release to the environment is an important issue for protecting water resource. among those activated sludge systems for p removal, enhanced biological phosphorus removal (ebpr) system was notable since it was introduced [20]. in the ebpr system, the phosphate accumulating organisms (paos ) are responsible for the active of p removal and are enriched to accumulate large quantities of polyphosphate (poly-p) in their cells. in this way the biological p removal is enhanced. however, the paos have a stricter requirement of cyclic anaerobic, anoxic and aerobic conditions than n removal and cod removal, thus the process of p removal is more difficult and complex. although the bsm1 modelling tool has been widely used in the wwtp research community, it has a structural limitation that it does not involve the p removal that should be taken into account for achieving a more realistic simulation model. to fill this gap, gernaey and jorgensen [7] developed a simulation benchmark which models the combined biological p and n removal suitable for the anaerobic-anoxic-oxic (aao) processes, which could be regarded as benchmark simulation model no.1 including p removal (bsm1-p). two pi controllers have been designed and tested for this process and defined as the default control (dc). but since there are many potential combinations of control variables, in this paper we propose 10 pi control strategies to compare the control performance. among them, 5 strategies are basic ones with no more than three controllers, whereas the other 5 strategies are combinations of those 5 basic ones. besides the traditional pi controller, fuzzy logic controller was drawing more attention for improving the performance of wwtps due to its model free and easily understandable character. fuzzy control can be regarded as a viable alternative control strategy in comparison with the conventional control in some certain situations, e.g. the control process with nonlinear characters which may lead to difficult mathematical modelling and controller tuning. so in this paper, we also give some examples of applying several fuzzy controllers on the wwtp process, and compare the control performance with these referred pi controllers. 2 description of bsm1-p the description could be seen in fig.1. resembling the bsm1 model, the bsm1-p has a process layout of seven biological reactors and one settler. as it is showed in the fig.1, the plant lay-out consists of 7 bio-reactors in series followed by a sedimentation tank. here, qin means influent, qe means effluent, qint means nitrate recycle, qr means sludge recycle and qw means waste sludge. the total volume of the biological tanks is 6749 m3, the volumes of tanks 1, 2, 3 and 4 are 500 m3, 750 m3, 750 m3 and 750 m3 respectively, which four of them are fully mixed, but not aerated. tanks 5, 6 and 7 are fully mixed as well as aerated, and their volumes are 1333 m3. aeration of tanks 5, 6 and 7 is achieved by using a maximum kla of 10 h −1, here kla means oxygen transfer coefficient. in the openloop situation, default klas of tanks 5 and 6 are equally set to 10 h−1, and that of tank 7 is set to 2.5 h−1. dissolved oxygen (do or so2 ) saturation in arebic tanks is 8 g(-cod)/m3. the volume of sedimentation tank is 6000 m3, with area of 1500 m2 and a depth of 4 m; the sedimentation tank is feed at the point of 2.2 m above the bottom. two internal recycles are also included: qintr from tank 7 to tank 3 at a default flow 938 h. xu, r. vilanova rate of 300% of the influent flow rate, and qr from the underflow of the sedimentation tank to the inffluent of tank 1. the default qr is equal to the qin. besides of qr, the underfolw of the sedimentation tank is also devided to waste sludge qw, and the default qw is 400 m 3/d. more detailed explanation about the configuration of combined n and p removal plant can be seen in [7]. figure 1: lay-out of the benchmark plant for evaluation of control strategies on combined n and p removal processes 2.1 influent composition the influent for bsm1-p was generated from the asm1 influent composition [6, 7]. besides the concentration of pollutant, another important parameter that will affect the operation of wwtp and should be considered is the volumn of influent, which is affected significantly by weather condition. thus, in bsm1-p, three weather conditions are taken into account: dry weather, rain weather and storm weather. 2.2 plant performance criteria it is also necessary to build up a number of indexes to evaluate the performance of the simulated benchmark wwtp studied in this paper. similar to the original bsm1, the effluent quality index (eqi) for bsm1-p was included p, as calculated by eqs. (1) and (2). eqi = 1 1000(tf − t0) ∫ tf t0 pu(t)qe(t) dt (1) pu(t) = putss(t) + pucod(t) + pubod(t) + putkn(t) + puno3(t) + puptot(t) (2) in eq. (1), t0 and tf represent the start time and end time of the period of evaluating eqi separately. the pollutant load puk (kg/d) corresponding to each component k is estimated by eq. (3). puk = βkck (3) the factors βk are weighting factors that are attributed to each effluent component. in this paper, the factors were chosen as follows: βtss = 2; βcod = 1; βbod = 2; βtkn = 20; βno3 = 20; βptot = 20. furthermore, the instantaneous concentrations of the different pollutants ck are calculated by eqs. (4)-(10). ctss = xtss (4) pi and fuzzy control for p-removal in wastewater treatment plant 939 ccod = sf + sa + si + xi + xs + xh + xpao + xpha + xa (5) cbod = 0.25(sf +sa+(1−fsi )xs +(1−fxih)xh +(1−fxip )(xpao+xpha)+(1−fxia)xa) (6) ctkn = snh4 + in,sf sf + in,sisi + in,xixi + in,xsxs + in,bm(xh + xpao + xa) (7) cno3 = sno3 (8) cntot = ctkn + cno3 (9) cptot = sp o4 +ip,sf sf +ip,sisi +ip,xixi +ip,xsxs +ip,bm(xh +xp ao+xa)+xp p +(1/4.87)xmep (10) here, fsi means fraction of si from hydrolysis, fxia, fxih, fxip represent fraction of inert cod from xa, xh and xpao, respectively. in addition, in,k and ip,k represent n and p fraction in organic component k (k = sf , si, xi, xs, xh, xa or xpao), respectively. the influent quality index (iq) is calculated in the same way as eqi, but the bod coefficient in eq. (6) is modified from 0.25 to 0.65. similar to bsm1 the limits for certain components should be provided to evaluate the performance of wwtp in detail. by comparing the simulation output with these limits, we could calculate the number of times that the effluent concentration of a pollutant exceeded the limit during the evaluation period. the limit for p is based on the danish wwtp effluent standard [7], whereas other limits are same to the bsm1, i.e., cptot =1.5 g p/m 3, cntot =18 g n/m 3, cbod=10 g/m 3, ccod=100 g cod/m 3, ctss=30 g/m 3 and snh4 =4 g n/m 3. in addition, to quantify the cost of wwtp operation, the operating cost index (oci) was introduced [21]: oci = αeqieqi + αaeae + αpepe + αsldgpsldg (11) in eq. (11), eqi is the effluent quality index caculated by eq. (1), ae is aeration energy consumption rate which happens in aerobic tanks 5, 6 and 7, pe is pumping energy consumption rate to maintain wastewater flowing. the unit for ae and pe is kwh/d. psldg is the sludge production rate (kg/d). values for ae, pe and psldg are calculated in a similar way to bsm1 [6]. the αi coefficents are oci weighting factors. in this paper, αi values are suggested in [21], i.e. αeq=50 (euro/year)/eqi; αae = αpe=25 (euro/year)/(kwh/d); αsldg=75 (euro/year)/(kg tss/d). 3 fuzzy logic control fuzzy control makes use of so-called fuzzy controllers (fcs) or fuzzy logic controllers to ensure a nonlinear input-output static configuration can be designed/changed according to designer’s mind. compared to the conventional control, fuzzy control could take sufficient advantage of the experience of a human operator, because fuzzy control has the ability to introduce this experience in a more accurate way by applying linguistic variables. the mathematical foundation 940 h. xu, r. vilanova of fuzzy logic control was set by zadeh in his paper about forty years ago [8]. after that, as the computer science and the tools for dealing with mathematical problems were developing rapidly, madamni and assilian applied the first fuzzy control application on a small steam engine [9, 10]. afterwards, in japan and usa, and later, in europe, the fuzzy logic control became more and more popular [19]. until now, fuzzy controllers have been successfully used in the area of process industries [11–17]. this control method based on human’s experience is achieved in fcs by expressing the control requirements and expounding the control signal in terms of the if-then linguistic rules which belong to the set of rules: if(conditions)then(consequent) where the conditions means the present situation of the controlled process dynamics (compared usually with the desired dynamics), and the consequent (conclusion) refers to the action which should be taken under the form of the control input u in order to follow the desired dynamics. the set of rules makes up the rule base of the fc. a typical fuzzy control system is as followed (fig. 2): figure 2: typical fuzzy control system in the present study, different forms of fuzzy logic systems for designing fc have been implemented: mamdani fuzzy inference systems and sugeno fuzzy inference systems. in a mamdani fuzzy inference systems [18], given fuzzy rules: (1) if x is a1 and y is b1 then z is c1, (2) if x is a2 and y is b2 then z is c2; and the fact: x is x1, y is y1, here x1 and y1 are crisp inputs. fig. 3 shows how to determine the fuzzy output (dark aera). different from the conventional mathematical set, which only has two relationships with a certain element (belong to or not belong to), a fuzzy set could also be partly belonged to by an element. in the theory of fuzzy, to describe the relationship between a fuzzy element and a fuzzy set, a grade of membership µ(x) is introduced, and µ(x) = 1 means that the element x totally belongs to a fuzzy set, while µ(x) = 0 means the element x not belongs to the fuzzy set at all. for example, in the fig. 3, x1 partly belongs to set a1 and partly belongs set a2, and the grade membership is described by µ(x1|a1) and µ(x1|a2) separately. similarly, µ(x1|a1) and µ(y2|b2) represent the grade of membership for y1 to fuzzy set b1 and b2. this is how to convert crisp inputs to fuzzy inputs, i.e. fuzzification. next, we should consider how to determine the conclusions. to do this, we will first consider the recommendations of each given fuzzy rule independently. the membership function for the conclusion reached by rule (1), which we denote by µ(1), is showed in fig. 3 and is given by µ(1)(z) = min{µ(x1|a1), µ(x1|a1)} this membership function µ(1)(z) can be explained as how big part we should take into consider for the fuzzy set c1. and similarly, we can reach the other membership function for the conclusion by rule (2). and as the present situation (x is x1 and y is y1) is affected by both rule (1) and rule (2), thus the final decision should be a combination of both membership funcion, as showed in fig.3. according to centroid way, to defuzzify the fuzzy output, we only need to calculate the centroid point of the gray aera: let bi denote the center of the membership function of the pi and fuzzy control for p-removal in wastewater treatment plant 941 sonsequent of rule (i), and let ∫ µ(i) denote the area under the membership function µ(i), the centroid method computes z1 to be z1 = ∑ i bi ∫ µ(i)∑ i ∫ µ(i) and z1 is the defuzzified output, as showed in fig. 3. figure 3: mamdani fuzzy inference systems to design a fuzzy controller, at first the input and output variables of the fuzzy controller should be chosen. normally the input variables are feedback error and higher order derivatives of feedback error, in this paper the feedback error (e) and first order derivative of feedback error (ec) were chosen as inputs of the fuzzy controller. on the other hand the output variables should be controlled inputs of the controlled plant (u), therefore in this case, the kla of the aerobic tanks was chosen as output of the do fuzzy controller, and the internal recycle rate was chosen as the output of the internal recycle fuzzy controller. the next step is to choose the membership functions for input and output variable. the shape of membership functions here we chose the triangle functions both for the input variables and for the output variables. it was concluded that in the case of tank 5 the range of e could be fixed from -1.5 g/m3 to 1.5 g/m3, the range of ec could be fixed from -15g/ (m3d−1) to 15 g/ (m3d−1). the kla of tank 5 could range from 160 d −1 to 280 d−1. in addition, the number of parameters of the membership function was chosen as 7, which included nb, nm, ns, o, ps, pm, pb. here n meant negative, o meant zero, p meant positive, b meant big, m meant medium, s meant small. to express in a more clear way, fig.4 presents the membership function of the feedback error for the fuzzy controller of the tank 5. similarly, this could also be applied on the do fuzzy controller of tank 6, tank 7 and that of the internal recycle. but in the case of tank 6 and 7, e and ec range the same way, because 942 h. xu, r. vilanova table 1: decision table for fuzzy controllers (a) for do fuzzy controllers nb nm ns o ps pm pb nb pb pb pb pb pm o o nm pb pb pb pb pm o o ns pm pm pm pm o ns ns o pm pm ps o ns nm nm ps ps ps o nm nm nm nm pm o o nm nb nb nb nb pb o o nm nb nb nb nb (b) for internal recycle nb ns o ps pb nb pb pb pb ps o ns ps ps ps o ns o ps ps o ns ns ps ps o ns ns ns pb ns nb nb nb nb according the experiments the range of e and ec did not affect the control performance significantly, and the u ranged from 120 d−1 to 240 d−1 and from 60 d−1 to 180 d−1 separately. in the case of internal recycle, the e ranged from -1 g/m3 to 1 g/m3, the ec ranged from -20 g/ (m3d−1) to 20 g/ (m3d−1) and the u ranged from 5000 m3/d to 45000 m3/d. the number of parameters for u of the internal recycle is adjusted to 5, which means that only exist nb, ns, o, ps and pb. the next step was deciding the fuzzy inference mechanism. in the case of 7 parameters of linguistic terms, there are totally 49 control rules formed by if-then clauses. table.1(a) shows the detail of control rules, which are same for the entire 3 do fuzzy controller, whereas table.1(b) shows the fuzzy control rules for the internal controller. figure 4: membership of e of tank 5 4 control configurations in this paper, at the first part, a series of control strategies are applied on the bsm1-p. as given in [7], a default control (dc) is simulated as a reference as well as a test for the updated simulation plant. then 10 additional pi-based control strategies (s1-s10) are applied to compare the performance: the first 5 control strategies (s1-s5) are basic ones, including the do controller, cascade do controller, internal recycle flow rate controller, extra carbon resource controller and the waste sludge amount controller, whereas the other 5 control strategies (s6s10) are generated by combining these basic ones. a detailed description of each control strategy follows below. pi and fuzzy control for p-removal in wastewater treatment plant 943 4.1 default control (dc) strategy similar as with the original bsm1, the dc strategy consists of two pi-based control loops: a do controller in tank 7 and an internal recycle controller. the measured variables are dissolved oxygen of the tank 7 and the concentration of nitrate nitrogen (no3) of the tank 3 respectively. the controlled variables are same as measured variables, and the set points are do=2 g/m3 and no3=1 g/m 3 respectively. the manipulated variables are kla7 and internal recycle qint respectively. as showed in fig. 5. figure 5: configuration of dc 4.2 control strategies configurations a set of 5 basic control strategies (s1 to s5) has also been implemented by using the following control loops respectively: • s1: pi controllers of the dissolved oxygen concentration (do) in the 3 aerobic tanks by regulating the oxygen transfer coefficients (kla) simultaneously, and the set points are all 2 g/m3.internal recycle loop is left as in the openloop. the configuration is shown in fig. 6. figure 6: configuration of s1 • s2: fig.7 shows the cascade pi control of the ammonia nitrogen of the effluent by manipulating the do set points in all the aerobic tanks. the set point of effluent ammonia nitrogen concentration is 1 g/m3, and the controlled variables are also the klas of the aerobic tanks. internal recycle loop is left as in the openloop. • s3: fig.8 shows the control of nitrate nitrogen concentration (sno3) in the tank 4 by manipulating the internal recycle flow rate (qint). the set point is 1 g/m 3. and the klas of aerobic tanks are left constant as in the openloop. 944 h. xu, r. vilanova figure 7: configuration of s2 figure 8: configuration of s3 • s4: fig.9 shows the control of sno3 in the tank 4 by manipulating the extra addition carbon resource (qcarb) into the tank 3. the set point is also 1 g/m 3. and the internal recycle loop was left as openloop. figure 9: configuration of s4 • s5:fig.10 shows the control of total suspended solids concentration (xtss) in tank 7 by manipulating the wastage sludge flow rate (qw). the set point is 4000 g/m 3. and the internal recycle loop was left as openloop. furthermore, 5 extra pi-based control strategies (s6 to s10) generated by combining these basic control strategies are also tested. in detail, the control strategy s6 is to control the dos in the 3 aerobic tanks as well as internal recycle flow rate by applying s1 and s3 simultaneously. as can be seen in the simulation result, the performance of s2 is not beneficial for the phosphorus removal, which is mainly considered in this work, consequently in all the combined control strategies, none is included s2. the control strategy s7 is obtained by combining s1 and s4, pi and fuzzy control for p-removal in wastewater treatment plant 945 figure 10: configuration of s5 which means control the do in 3 aerobic tanks and the extra carbon resource in tank 3. similarly, the control strategy s8 is to combine s1 and s5, s9 is to combine 3 control strategies (s1, s4 and s5). finally, the last control strategy s10 is generated by combining s1, s3 and s5. 5 results and discussions the important information of simulation results of pi-based control strategies is showed in the table.2. for comparison, it included the simulation result of open loop, default control loop and s1 to s10 control strategies. the effluent quality indexes (eqis) are showed in the table to judge the overall performance. in detail, the mainly considered components of a certain wastewater treatment plant: ammonia nitrogen (nh4), total amount of nitrogen (ntot), phosphate (spo4 ), the amount of chemical oxygen demand (cod) and the total suspended solids (tss) being another main factor to evaluate the performance of a wwtp are showed in the table. in addition, the operation cost index (oci), as a consequence of consuming aeration energy (ae), pumping energy (pe), sludge production and the added carbon volume and metal volume (in this paper, 0 in all case), is also given in the table to compare the control strategies. table 2: results of pi-based control strategies (s1-s5) average effluent default s1 s2 s3 s4 s5 s6 s7 s8 s9 s10 concentration component limit (g/m3) snh4 4 3.16 3.36 2.17 2.84 3.62 2.30 3.36 4.02 2.61 3.68 2.60 ntot 18 17.15 15.19 15.06 15.47 12.59 15.16 15.36 12.91 14.82 12.66 15.01 spo4 2 1.86 2.55 3.99 3.150 0.85 2.97 2.27 0.85 2.19 0.82 1.97 cod 125 45.44 45.46 45.58 45.52 45.90 46.17 45.45 45.79 46.04 46.05 46.02 tss 35 14.15 14.05 13.93 14.01 14.54 14.58 14.08 14.47 14.59 14.66 14.60 global plant performance eqi(kg/d 4495 4741 5618 5468 4502 5403 4643 4087 4672 4096 4597 oci(euro/d) 19919 19664 19779 19343 24858 18954 19774 24231 19532 24373 19648 since bsm1-p is an updated version of bsm1, from the results it can be seen that in all cases the ammonia nitrogen (nh4), total amount of nitrogen (ntot) and the chemical oxygen demand (cod) are all under limit amount. in case of s4 and those that included it: s7 and s9, all the limit concentrations for pollutant components concerned are satisfied. the reason is that extra carbon resource can promote the growth and ability of denitrifying organism and phosphorus accumulating organism (pao) simultaneously. however because of the added carbon resource, the operational cost indexes 946 h. xu, r. vilanova (oci) in these 3 cases are higher than any other case. in the other cases of pi-based control strategies, it is obvious that, by comparing with the case of openloop, a lower level of ammonia nitrogen concentration in the effluent corresponds a higher level of phosphorus concentration. this is because without extra carbon resource, the only way to decrease the ammonia is to increase the explosion of air, but this will lead a higher level of nitrate nitrogen, which is harmful for the accumulation of phosphorus. however, the case s5 is an exception, where both snh4 and spo4 are lower than the case of openloop. although the phosphorus is still above the limitation, but considering the concentration of ammonia nitrogen is relatively lower than many cases, the consequence is also beneficial. this is because by controlling the waste sludge flow rate more suspended solids, including the organisms, remain in the treatment plant circumstance, which is beneficial for both nitrogen and phosphorus removal. correspondingly, the cod and tss in the effluent are higher than other cases. furthermore, the oci is the lowest. fig.11 shows the tradeoff among oci and eqi for the different pi-based control strategies. in fig.11, the horizontal axis means the operational cost index (oci) and the vertical axis represents the effluent quality index (eqi). basically, higher operational cost leads to lower effluent quality, which means that the performance of treatment plant is better. from fig.11, it is easy to conclude that those strategies including extra carbon flow (s4, s7, s9) cost much more than other ones, but achieve lower effluent pollutant concentration. in fact, the performance of s4 is the best among the 5 basic pi-based control strategies. among all, s7 and s9 possess the lowest effluent pollutant load without significant difference. however, when analyzing the control strategies combined with the operational cost, the default control, s1, s6, s8 and s10 show a good balance. but the main flaws of them are as followed: the average level of phosphorus in s6 and s8 exceeded the limit amount, and the instant level of phosphorus amount in all the 5 cases violated the limitation for a great partial of the total evaluated time (64.43%, 36.01%, 49.55%, 46.88% and 38.69%, respectively). figure 11: the oci against eqi graph of pi controllers in fig.11, the points that are closer to the origin mean lower effluent quality index with less operational cost, therefor better tradeoff, when choosing the appropriate control, these ones should receive more interest. hence it is obvious that s10 made the best balance enter oci and eqi. fig.12 and fig.13 show a dynamic plant performance of wwtp for n and p under the pi-based control strategy s10. to make a clear comparison with the fuzzy controller, in these three figures we also added in the time response of certain components of effluent by using fuzzy controllers, which will be anylized below. pi and fuzzy control for p-removal in wastewater treatment plant 947 figure 12: total amount of nitrogen of effluent of s10 figure 13: total amount of phosphorus of effluent of s10 figure 14: the comparison of oci to eqi between pi-based and fuzzy-based controllers 948 h. xu, r. vilanova figure 15: total amount of nitrogen of effluent of s1 figure 16: total amount of phosphorus of effluent of s1 table 3: comparison of pi and fuzzy control strategies average effluent open default s1 s6 s8 s10 concentration loop pi fuzzy pi fuzzy pi fuzzy pi fuzzy pi fuzzy component limit (g/m3) snh4 4 2.79 3.16 5.77 4.43 6.99 3.36 6.54 2.61 6.55 2.60 7.11 ntot 18 15.50 17.15 17.04 15.19 15.91 15.36 17.73 14.82 16.42 15.01 18.10 spo4 2 3.69 1.86 1.52 2.55 1.27 2.27 1.08 2.19 1.46 1.97 1.15 cod 125 45.54 45.44 45.36 45.46 45.40 45.45 45.32 46.04 44.80 46.02 44.83 tss 35 13.95 14.15 14.12 14.05 14.17 14.08 14.12 14.59 13.62 14.60 13.69 eqi(kg/d) 5596 4496 4314 4741 4224 4643 4141 4672 4226 4597 4121 % of variation -19.7% -22.9% -15.3% -24.5% -17.0% -26% -16.5% -24.5% -17.9% -26.4% oci(euro/d) 19175 19920 20047 19665 20057 19774 20203 19532 20283 19648 20416 % of variation +3.9% +4.5% +2.6% +4.6% +3.1% +5.4% +1.9% +5.8% +2.5% +6.5% pi and fuzzy control for p-removal in wastewater treatment plant 949 fig.12 and fig.13 show the most concerned component of waste water, nitragen (n) and phosphorus (p) separately. from fig.12, we can see that although the average amount of ntot is satisfied with the requirement according to table.2, in some moments the ntot of effluent violates the limitation. however, the overall performance is rather good. when come to fig.13, the outcome is in contrast, the average amount of p of effluent is higher than the desired limit, and neither is there much time satisfying the limitation. so this is the major defect of s10. but considering the overall control performance and the cost of wwtp, s10 still draws a good attention. so in the next step, we focus on replacing the pi controllers of s10 by fuzzy controllers. in addition, since df, s1, s6 and s8 also get a good comparison of oci and eqi, as well as a good ability for p removal, as can be seen in the table.2 and table.3, it is also necessary to build up a fuzzy-based controller for these control strategies to see the performance. the procedure and important information of designing fuzzy controller are mentioned in the second part. the simulation results of fuzzy control strategy are showed in table.3, to make a clear comparison, in table.3 we also repeat the situation of applying pi-based controllers. from this table, we can see that by applying fcs, the average concentration of p in effluent becomes much lower and satisfies with the requirement. but since the favorable condition for p removal is contrary against the one of n removal, by applying fcs the amount of n rises. however, the average concentration of total n in effluent is still under the limitation (except fuzzy-based s10), this means that fcs are able to satisfy with the requirement of total n and total p simultaneously. in addition, from the table we can also see that by applying fcs, the eqi is lower, but the oci is higher. to make a clear comparison, table.3 also gives a percent (%) of variation of each control strategy against openloop to see in how much degree oci enlarged as well as eqi reduced. from table.3, we can see that in dc by applying fuzzy controllers eqi reduced about 3% more than by applying pi controllers, and oci only gained 0.6%. in other control configurations, by applying fuzzy controllers, eqi reduced 8%-9% comparing with pi controllers, and oci gained about 2%-4%. this means fuzzy logic controllers are able to improve the wwpt performance by increasing operational cost, however the degree of improvement is greater than the increase in cost. fig.14 gives a more clear way to see this conclusion. to see the dynamic plant perform, we can refer to fig.12 and fig.13. it can be concluded that by applying fuzzy controllers in every moment p gets a better removal. another phenomenon that is exihibited in fig.15 and fig.16 needs to be mentioned, which show effluent concentrations of n and p of wwtp controlled by s1.since s1 only contains do controllers, these two pictures can reveal the effect of do on n and p removal process, because despite of the same setpoint for both pi-based and fuzzy-based controllers, the regulated instantaneous do could vary according to different type of controllers. from fig.15, we can see that the pi controllers are beneficial for the n removal, but in some period (such as day7 to day8 and day13 to day 14) the concentration of n did not change much, but from fig.16 we can see that all the time the amount of p by using fuzzy controllers is below the one by using pi controllers. from this point, the eqi of fuzzy controllers is lower than the pi controllers, as showed in table.3. it can be assumed that the process of p removal is more sensitive than the n removal. based on this assumption, the real concentration of do in the fuzzy-based s1 at the moment that n keeps the same could draw more interest, because under this condition we can get better removal of p and at the same time do not affect the removal of n. from the simulation results, we can see that by applying fuzzy control strategies the performance of wwtp is improved, and fuzzy control strategy could get a better operation result for the wastewater treatment process in some degree. 950 h. xu, r. vilanova 6 conclusions in this paper, at first a set of pi control strategies has been applied on the bsm1-p waste water treatment plant to maintain the pollution component of effluent within regulations specified limits. good performance was achieved; it could be proved that the bsm1-p is efficient to simulate the combined p and n removal wwtp. among the 10 designed pi control strategies, we chose a group that could make a better balance between oci and eqi (dc, s1,s6, s8 and s10) to test fuzzy logic controllers. from the results we could conclude that by applying fcs the p removal consequent was enhanced. in this way, the most focused on component (p) of the effluent was controlled under the required limit. the results show fc could be efficiently used to control the wwtp, especially for the p removal. however, in this paper, although a set of fuzzy control strategies was tested on the bsm1p, the operation was only replacing the fcs to the pi controllers. the control loop was not changed at all and the set points were also the same. but as one of the most studied advanced control strategies, fuzzy logic control could make a better improvement. therefore, the future work will not only concern to replace the fc to the pi controllers, but also make a combination of fuzzy logic control and pi control. fcs could act as a higher level to make the important decision as a human being, and the basic control loop could be accomplished by pi controller. for example, when we focus on the concentration of p of effluent, it is not necessary to fix the do of three aerobic tanks to 2 mg/l; high level of do is beneficial for the p removal, but harmful to the denitrification process which is important for the n removal. although according to the reference, 2 mg/l of do is an ideal amount for making the balance between p removal and n removal, however in the real situation there are so many disturbances in the wwtp, it would get a better control performance by adjusting the do according to the specific situation. in this way, we could get a better balance between p removal and n removal, as well as between oci and eqi. acknowledgments this work was supported by the spanish ministry of economy and competitiveness program under grant dpi2013-47825-c3-1-r. the work of xu hongyang is covered by the china scholarship council. bibliography [1] ingildsen, p. (2002); realising full-scale control in wastewater treatment systems using in situ nutrient sensors, ph.d. thesis, department of industrial electrical engineering and automation, lund university, sweden. [2] vrecko, d., hvala, n., carlsson, b. (2003); feedforward-feedback control of an activated sludge process: a simulation study, water sci. technol., 47 (12): 19 26. [3] lindberg, c.f. (1997): control and estimation strategies applied to the activated sludge process, ph.d. thesis, department of systems and control, uppsala university, sweden. [4] yuan, z., oehmen, a., ingildsen, p. (2002); control of nitrate recirculation flow in predenitrification systems, water sci. technol., 45 (4-5): 29-36. [5] jeppsson, u., alex, j., batstone, d., benedetti, l., comas, j., copp, j., corominas, l., flores-alsina, x., gernaey, k., and nopens, i. (2011); quo vadis benchmark simulation models, 8th iwa symposium on systems analysis and integrated assessment, 493-506. pi and fuzzy control for p-removal in wastewater treatment plant 951 [6] copp, j. (2002); the cost simulation benchmark: description and simulator manual, office for official publications of the european community, luxembourg. [7] gernaey v., jorgensen b. (2004); benchmarking combined biological phosphorus and nitrogen removal wastewater treatment process, control engineering practice, 12: 357-373. [8] l.a. zadeh (1965); fuzzy sets, information and control, 8 (3): 338-353. [9] e.h. mamdani (1974); applications of fuzzy algorithms for control of a simple dynamic plant, proceedings of the iee 121, 12: 1585-1588. [10] e.h. mamdani, s. assilian (1975); an experiment in linguistic synthesis with a fuzzy logic controller, international journal of man-machine studies, 7 (1):1-13. [11] u.-c. moon, k.y. lee (2003); hybrid algorithm with fuzzy system and conventional pi control for the temperature control of tv glass furnace, ieee transactions on control systems technology, 11(4): 548-554. [12] m. ramirez, r. haber, v. pena, i. rodriguez (2004); fuzzy control of a multiple hearth urnace, computers in industry, 54(1): 105-113. [13] m. onat, m. dogruel (2004); fuzzy plus integral control of the effluent turbidity in direct filtration, ieee transactions on control systems technology, 12(1): 65-74. [14] m. eftekhari, l. marjanovic, p. angelov (2003); design and performance of a rule-based controller in a naturally ventilated room, computers in industry, 52(3); 299-326. [15] j.n. lygouras, v.s. kodogiannis, t. pachidis, k.n. tarchanidis, c.s. koukourlis (2008); variable structure tito fuzzy-logic controller implementation for a solar airconditionint system, applied energy, 85(4): 190-203. [16] a. maidi, m. diaf, j.-p. corriou (2008), optimal linear pi fuzzy controller design of a heat exchanger, chemical engineering and processing: process intensification, 47(5): 938-945. [17] y.-h. lee, r. kopp (2001); application of fuzzy control for a hydraulic forging machine, fuzzy sets and systems, 118(1): 99-108. [18] ernst, m., thomas, m.b., antonio d. (2002); state detection and control of overloads in the anaerobic wastewater treatment using fuzzy logic, water research, 36: 201-211. [19] radu-emil precup, hans hellendoorn (2011); a survey of industrial applications of fuzzy control, computers in industrial, 62: 213-226 [20] wentzel, m. c., comeau, y., ekama, g. a., van loosdrecht, m.c.m., brdjanovic, d. (2008); enhanced biological phosphorus removal, in: henze, m., van loosdrecht, m.c. m., ekama, g. a., brdjanovic, d. (eds.), biological wastewater treatment: principles, modelling and design, iwa publishing, london, uk, 154-220. [21] vanrolleghem, p. a., & gillot, s. (2002); robustness and economic measures as control benchmark performance criteria, water science and technology, 45(4-5): 117-126. international journal of computers communications & control issn 1841-9836, 12(1):116-130, february 2017. feature analysis to human activity recognition j. suto, s. oniga, p. pop sitar jozsef suto* department of information systems and networks university of debrecen, debrecen, hungary *corresponding author: suto.jozsef@inf.unideb.hu stefan oniga 1. department of information systems and networks university of debrecen, debrecen, hungary oniga.istvan@inf.unideb.hu 2. department of electronic and computer engineering technical university of cluj-napoca, north university center at baia mare, baia mare, romania stefan.oniga@cunbm.utcluj.ro petrica pop sitar department of mathematics and informatics technical university of cluj-napoca, north university center at baia mare, baia mare, romania petrica.pop@cunbm.utcluj.ro abstract: human activity recognition (har) is one of those research areas whose importance and popularity have notably increased in recent years. har can be seen as a general machine learning problem which requires feature extraction and feature selection. in previous articles different features were extracted from time, frequency and wavelet domains for har but it is not clear that, how to determine the best feature combination which maximizes the performance of a machine learning algorithm. the aim of this paper is to present the most relevant feature extraction methods in har and to compare them with widely-used filter and wrapper feature selection algorithms. this work is an extended version of [1]a where we tested the efficiency of filter and wrapper feature selection algorithms in combination with artificial neural networks. in this paper the efficiency of selected features has been investigated on more machine learning algorithms (feed-forward artificial neural network, k-nearest neighbor and decision tree) where an independent database was the data source. the result demonstrates that machine learning in combination with feature selection can overcome other classification approaches. keywords: human activity recognition, feature extraction, feature selection, machine learning. areprinted and extended, with permission based on license number 3958150787732 [2016] ieee, from "computers communications and control (icccc), 2016 6th international conference on" 1 introduction this paper is an extended version of our previous work where only artificial neural networks have been used to the efficiency investigation of feature selection in the human activity recognition (har) problem [1]. however, in this paper by the involvement of more machine learning techniques we can examine the relation between classifier and feature selection methods. the appearance of data mining was a milestone of modern biomedical applications. har is an interesting and rapidly expanding part of this area. in this type of problem, we want to determine copyright © 2006-2017 by ccc publications feature analysis to human activity recognition 117 the activity of people from the information which comes from one or more accelerometer-based data collector devices. generally, sensors are placed to different parts of the body and provide information about the functional ability and lifestyle of an observed person. although many articles have been presented in this topic, some questions are unanswered yet. which feature selection algorithm generate the best feature combination that maximize the recognition rate of a machine learning algorithm? does a general feature combination which similarly efficient independently of the person exist? can a machine learning algorithm in combination with feature selection overcome other classification approaches?. the aim of this study is to give reliable answers for the above questions. the rapidly growing rate of elderly population in our society has a huge impact to health care systems. obviously, everyone wants to stay in a familiar environment where they feel comfortable during the observation. this new challenge motivated the development of different kind of home care services, assistive systems and wearable sensor networks in order to increase the autonomy and life quality of an observed person [2,3]. today, the miniaturized sensor technology (mems) makes it possible for a person to wear data acquisition devices on predetermined body segments. in the past decade, more research groups developed different kinds of devices for har purposes which ensure continuous observation in both indoor and outdoor environments [4][7]. such a wearable sensor network was used to the construction of the ward 1.0 database. it is a benchmark database to the har research which was collected at university of california, berkeley [7]. this public data set gives an opportunity for qualitative comparison of existing har algorithms. the database contains information about 13 different activities which were acquired from 20 people aged between 19 and 75 years. during the data acquisition, 5 sensor nodes were placed at multiple body locations of each person: left and right forearm, waist, left and right ankle. in this study we used the data of only one sensor which has been placed on the right ankle because ertugrul et al. and oniga et al. demonstrated that one sensor is enough for appropriate har recognition while preece et al. claimed that the ankle is an optimal placement for single sensor [8,28,31]. beyond data acquisition, efficient algorithms are also necessary to interpret the collected data. previous works have shown that, machine learning algorithms are efficient for human movement classification. khan et al., oniga et al. and yang et al. used artificial neural network (ann) to their har research and archived 97.9%, 95% and 99% recognition rates (with single sensor), respectively [9][12]. duarte et al. and preece et al. used k-nearest neighbor (knn) method and measured 97.8% and 95% recognition rates [8,13]. finally, gao et al. and maurer et al. reached similarly good results with decision tree 96.4%, 92.8% [4,5]. it was the main motivation to use those three machine learning methods in this study. 2 raw data pre-processing and classification 2.1 feature extraction many machine learning application require feature extraction and feature selection; see for example [23, 25]. feature extraction can be seen as a data pre-processing step where different kinds of features will be extracted from the raw data. in the first step the raw data (time series) will be split into short intervals windows. usually, a window covers one or two seconds long time interval and its size depends on the sampling frequency. for instance, preece at al. used 2 seconds long window with 50% overlapping while gao et al. and karantonis et al. used 1 second long window without overlapping [4, 8, 14]. in our case, a window contains 32 samples and there is a 50% overlapping between windows in the training phase and no overlapping in the test phase. this size covers 1.6 seconds time interval because the sampling frequency of the 118 j. suto, s. oniga, p. pop sitar table 1: most common feature extraction methods in har category feature abbreviation references time mean m [4,8] variance v [4,8] mean absolute deviation mad [5,15] root mean square rms [5,15] zero crossing rate zcr [4,5] interquartile range iqr [5,15] 75’th percentile pe [5,8] kurtosis ks [15,16] signal magnitude area sma [4,15] min-max mm [17,18] frequency spectral energy se [4,8] spectral entropy e [8,15] spectral centroid sc [4,19] principal frequency pf [8,20] other correlation between axis corr [4,17] autoregressive coefficients ar1, ar2 [9,19] tilt angle ta [9,18] ward database is approximately 20 hz. after the windowing step, features will be extracted from each window. in previous activity classification studies, the researchers used different kinds of features from different domains. most of them come from the time a frequency domains but exist some other types of features. table 1 summarizes the most common features from the literature and their references. some features are redundant and those were omitted from the table. for instance, standard deviation because variance is in the table. moreover, wavelet features have been similarly omitted because time-frequency features are more efficient than wavelets, see [8]. at the rest of this subsection we give a short description to the methods in table 1. in the equations t indicates the window size, f is the number of frequency components, i refers to the accelerometer dimensions (x,y,z), ai(t) is an element of time series and ai(f) is a frequency component. mean, variance, mean absolute deviation and root mean square are statistical indicators which give information about sample distribution. zcr measures sing changes along the time series. in the formula, the i{x} function returns 1 if its argument is true and 0 otherwise. zcri = 1 t − 1 t−1∑ t=1 i{ai(t)ai(t + 1) < 0}. (1) quartiles (q1, q2 and q3) divide an ordered time series into quarters. iqr is the difference between the upper and lower quartiles: iqr = q3 q1. it measures the spread of a data set over a range. percentiles are similar as quartiles, except that those divide a data set into arbitrary parts (given in percentage). therefore, the 25th percentile is equal to the lower quartile (q1) and the 75th percentile is equal to the upper quartile (q3). in this study we used only the 75th percentile. kurtosis measures the peakedness of probability distribution of collected data. ksi = 1 t ∑t t=1(ai(t) −mi) 4 ( 1 t ∑t t=1(ai(t) −mi)2)2 . (2) feature analysis to human activity recognition 119 sma equals to the normalized sum of accelerometer components. sma = 1 t t∑ t=1 |ax(t)| + |ay(t)| + |az(t)|. (3) min-max is the difference between the maximum and the minimum values of time series. generally, signal energy is the area under the squared signal. in this case, se measures the sum of the squared frequency components. (since, the spectrum is symmetric, in the following three formulas f can be replaced by f/2). sei = f∑ f=1 |ai(f)|2. (4) the entropy is a measure for uncertainty. the following formula is the entropy of the normalized power spectrum density. ei = − f∑ f=1 |ai(f)|2∑f j=1 |ai(j)|2 log2( |ai(f)|2∑f j=1 |ai(j)|2 ). (5) spectral centroid measures the average frequency, weighted by the amplitude of spectrum. sci = ∑f f=1 f|ai(f)|∑f j=1 |ai(j)| . (6) principal frequency refers to the most significant frequency component which has the highest amplitude (dc component has been omitted). pfi = maxf|ai(f)| f 6= 0. (7) autoregressive model is another representation of signal. in the model, an element in the time series can be estimated by a linear weighted sum of previous elements where the weights are the coefficients. in (8), (9) and (10) p indicates model order, φk is the ar coefficient, ε(t) is the noise and τk refers to the autocorrelation. in table 1, ar1 and ar2 abbreviations refer to the first and second autoregressive coefficients, respectively. ar(p) −→ ai(t) = p∑ k=1 φkai(t−k) + ε(t). (8) φ =   1 r1 r2 · · · rp−1 r1 1 r1 · · · rp−2 ... ... ... ... ... rp−1 rp−2 rp−3 · · · 1   −1 ∗   r1 r2 ... rp   . (9) rk = 1 t ∑t−k t=1 (ai(t) −mi)(ai(t−k) −mi) vi . (10) correlation measures linear relationship between two axes. actually, corri,j is the cosine of angle between normalized vectors. therefore, -1 ≤ corri,j ≤ 1. the sing indicates the 120 j. suto, s. oniga, p. pop sitar direction of correlation. if corri,j is near to ± 1 implies that there is strong linear correlation between vectors. corri,j = ∑t t=1(ai(t) −mi)(aj(t) −mj)√∑t t=1(ai(t) −mi)2(aj(t) −mj)2 . (11) finally, tilt angle indicates the relative tilt of the sensor. it is the angle between z-axis and gravitational vector. in this survey, the absolute values of tilt angles have been summarized. ta = arccos( az(t)√ ax(t)2 + ay(t)2 + az(t)2 ). (12) 2.2 feature selection feature selection, also called feature reduction, is the process of choosing a subset of original features according to a well-defined evaluation criterion. it is a frequently used dimensionality reduction technique which removes irrelevant and redundant features. this approach has more useful effects for real applications because it accelerates algorithms, improves the performance and simplifies the model. in contrast to other dimensionality reduction techniques like linear discriminant analysis (lda) or principal component analysis (pca) which are based on projection, feature selection does not alter the original representation of feature sets, see [21]. according to the training data which are may be tagged, untagged or partially tagged, there have been developed three categories of algorithms: supervised, unsupervised and semisupervised feature selection where a tag refers to a given class. in addition, depending on the feature evaluation process, feature selection algorithms belong to three different groups: filter, wrapper and embedded. filter algorithms calculate scores for all features and select features according to the score. wrapper methods require a predefined classification technique and use its performance as ranking criteria of feature subsets. in embedded models, feature selection takes place at the training process. in this article we confine on the supervised category and particularly we are interested in filter and wrapper methods. in real applications filter methods are frequently used for feature selection because they have some significant advantages. firstly, those methods are independently applicable with any types of machine learning techniques. secondly, filter methods are faster than wrappers. however, wrapper methods are more efficient than feature ranking algorithms in some cases because they take into consideration the classifier hypothesis. this also means that, wrapper techniques can handle feature dependencies. so, both types of feature selection methods have advantages and disadvantages [22]. essentially, independently of categories and groups, the goal is to find the most appropriate hyperplane of the n-dimensional feature space in all cases where the sample distributions can be separable. for example, fig. 1 illustrates a proper separation between six 2 dimensional sample distributions (as the colour indicates) where each distribution (samples) comes from different classes. obviously it is an ideal case and the classification will be 100%. more information about feature selection and their application opportunities in bioinformatics can be found in [24,25]. today’s, a reach literature exists concerning the feature (or variable) selection. during the last decades, there have been developed a large amount of filter algorithms. the proposed algorithms are based on different kinds of approaches such as statistical, information theory, rough set, etc. [23]. however, to the best of our knowledge, the number of articles which utilizes feature selection algorithm on har is very small. we found only two papers where the minimum redundancy maximum relevance and the correlation feature selection techniques have been utilised, see [5, 26]. therefore, we collected the most relevant filter techniques and examined them on the har problem. fortunately, zhao et al. proposed a generally applicable repository to feature feature analysis to human activity recognition 121 figure 1: an ideal separation between sample distributions. selection research which contains 13 supervised and unsupervised filter methods [27]. moreover, the repository suggests references and implementations (in matlab) to the algorithms which have been applied in this investigation. in addition, we similarly utilised the naive bayesian wrapper method from [1]. so, in this work the following 9 feature selection methods have been tested: (1) correlation feature selection (cfs); (2) chi square (chi); (3) fast correlation-based filter (fcbf); (4) fisher score (fis); (5)information gain (ig); (6) kruskal-wallis (kw); (7) minimum redundancy maximum relevance (mrmr); (8) t-test; (9) naive bayesian (bayes). 2.3 classification a classification system has to use an algorithm that is capable to learn and tolerate errors which come from noise. previous studies have shown that artificial neural network (ann), knearest neighbor (knn) and decision tree (dt) are well applicable for har, see for example [5, 11, 13]. therefore, to the efficiency measurement of selected features those classifiers have been applied. as the name indicates, in feed-forward networks the input data go through all layers where the incoming data will be modified according to the weights and biases of a layer. the ann theory gives some advices to architecture construction but general rules do not exist. finding the right architecture of an ann for a specific purpose is a time-consuming task because it requires lots of simulations. our ann architecture design is based on the work of oniga et al. where the authors demonstrated that a simple feed-forward ann with only one hidden and one output layers is enough to har [28]. therefore, in this study, some feed-forward anns were generated with the same architecture as in their work. the number of neurons on the input, hidden and output layers ere equal to i, 2i and c respectively where i indicates the number of inputs and c is the number of activities (classes). the activation functions on the hidden and output layers were sigmoid and linear and the training algorithm was the levembergmarquardt. the knn classifier generation is based on the work of duarte et al. [13]. in this research, the authors reached approximately 98% recognition rate by a 1nn classifier with the euclidean distance metrics. finally, we used the default decision tree in matlab without any modification. 122 j. suto, s. oniga, p. pop sitar 3 results at the beginning of the investigation, 7 volunteers were selected from the ward database with different and similar ages and their raw data were the input of the feature extraction step in the training and test phases: (subject 1) 19 years old; (subject 2) 75 years old; (subject 3) 27 years old; (subject 4) 29 years old; (subject 5) 20 years old; (subject 6) 29 years old; (subject7) 34 years old. in table 1 two features are one dimensional while others are multi-dimensional according to the axis of the accelerometer sensor. therefore, multi-dimensional features were separated into one dimensional. this step generated 50 one dimensional features from the 17 feature extraction methods. after feature extraction, we got a feature matrix (windows x features) where rows contain the features of the windows. this matrix was the input of each feature selection method and the selected feature vectors were the input of the classifiers. in this study we applied the first 5 and 6 selected features because gao et al. and oniga et al. demonstrated that approximately 5 or 6 features are enough to an effective classification [4,28]. finally, according to the selected features the performance of the classifiers was measured. table 2-10 contain the selected features by the methods in selection order while the measured recognition rates (in percentage) can be seen in table 11-17. table 2: selected features by cfs subject features subject 1 mx, mz, madx, rmsx, rmsy, rmsz subject 2 mx, my, madz, rmsx, rmsy, rmsz subject 3 mx, my, vz, mady, rmsx, rmsz subject 4 mx, my, mz, rmsy, rmsz, iqrx subject 5 mx, my, mz, vx, vz, rmsx subject 6 mx, my, mz, mady, rmsx, rmsz subject 8 mx, my, mz, mady, rmsx, iqry table 3: selected features by chi subject features subject 1 pey, pez, mmz, mmy, pex, sma subject 2 pey, mmx, pex, mmy, mmz, rmsx subject 3 pey, mmx, mmz, mmy, pex, pez subject 4 pey, mmx, mmy, pez, mmz, pex subject 5 mmy, pey, mmx, mmz, pez, pex subject 6 mmy, pey, mmz, mmx, pex, pez subject 8 pey, mmx, mmy, pex, mmz, pez feature analysis to human activity recognition 123 table 4: selected features by fcbf subject features subject 1 rmsx, pey, rmsy, mz, rmsz, scx subject 2 sex, pex, sez, rmsy, my, sma subject 3 pey, sma, rmsx, my, rmsz, mmx subject 4 2pey, sey, sex, mz, scx, madx subject 5 rmsx, pey, mx, ta, ey, scx subject 6 sma, rmsz, pex, pey, mx, madz subject 8 pey, rmsx, mz, mady, scx, iqrx table 5: selected features by fis subject features subject 1 rmsx, madx, mx, pex, scz, ey subject 2 sez, rmsx, mx, ta, pex, mz subject 3 rmsx, pex, mx, madx, ey, rmsz subject 4 mmx, madx, mady, mmy, scx, scz subject 5 ey, scz, rmsx, scy, ez, mx subject 6 rmsx, pex, mx, sex, scy, mmz subject 8 mmx, mady, rmsx, iqry, vy, sex table 6: selected features by ig subject features subject 1 zcrz, zcrx, mmx, ex, ez, ar2x subject 2 ar2z, ksz, pfz, ar1y, zcry, corry subject 3 zcrz, iqrz, ex, ar1x, ar2x, ar2y subject 4 zcry, zcrz, ex, ey, ar1x, ar2x subject 5 ksz, zcrx, zcrz, zcry, ex, ar1x subject 6 ar1z, ar2y, pfz, ar2z, zcry, zcrz subject 8 ar2y, zcrz, ex, ey, ar1x, ar2x table 7: selected features by kw subject features subject 1 ey, scy, scz, scx, iqrx, madx subject 2 scz, scy, ey, iqry, ksz, mady subject 3 scy, scz, iqrx, iqry, mmy, madx subject 4 scx, scy, scz, iqrx, iqrz, madx subject 5 ey, ez, scy, scz, scx, iqry subject 6 scy, scz, ey, ez, madx, pey subject 8 scy, scz, scx, my, iqry, iqrz 124 j. suto, s. oniga, p. pop sitar table 8: selected features by mrmr subject features subject 1 pey, ta, ar2y, ar2z, ar2x, ar1z subject 2 pey, ta, scz, scy, ey, ez subject 3 mmx, ta, ar2z, ar1z, ar2x, ar2y subject 4 pex, ta, ar2y, ar2z, ar2x, ar1z subject 5 mmy, ta, ar2z, ar2y, ar2x, ar1z subject 6 mmy, ta, ar2z, ar2y, scz, ez subject 8 pey, ta, ar2z, ar2y, ar2x, ar1z table 9: selected features by t-test subject features subject 1 mmx, ex, ez, ar2x, sez, rmsz subject 2 zcrz, rmsz, ta, sey, sez, rmsy subject 3 iqrz, ex, ar1x, ar2x, ar2y, rmsz subject 4 ex, ey, ar1x, ar2x, zcry, zcrz subject 5 ex, ar1x, rmsy, rmsz, sey, sez subject 6 ta, sez, rmsz, zcry, ar2x, rmsy subject 8 ey, ez, ar1x, ar2x, zcrz, zcry table 10: selected features by bayesian subject features subject 1 pez, sma, corry, ksx, corrx, mmy subject 2 rmsy, my, mmz, ksx, pez, pex subject 3 pey, mmx, sma, pfy, iqrx, ey subject 4 pey, mmx, pez, mmy, mx, pfy subject 5 pey, ta, corry, mmx, pfy, mmz subject 6 pex, pey, pfy, pez, mmx, mmy subject 8 pey, pex, mmx, iqrx, sma, pfx table 11: recognition rates for subject 1 ann 1nn dt n=5 n=6 n=5 n=6 n=5 n=6 cfs 93.1 94.7 98.4 99.7 94.2 95.9 chi 84.9 92.6 100 100 98.6 98.2 fcbf 86.2 90.7 99.7 99.9 90.9 96.3 fis 87.2 93.4 91.9 94.6 90.1 91.8 ig 18.0 24.7 12.5 12.5 18.0 24.7 kw 83.9 89.0 97.4 99.6 87.2 89.7 mrmr 87.9 90.8 95.9 98.7 93.5 95.0 t-test 12.7 51.7 43.5 77.4 44.0 73.9 bayesian 92.2 93.0 96.3 98.3 80.7 94.3 feature analysis to human activity recognition 125 table 12: recognition rates for subject 2 ann 1nn dt n=5 n=6 n=5 n=6 n=5 n=6 cfs 96.5 97.8 100 100 85.1 86.6 chi 96.6 97.9 99.9 100 97.6 85.8 fcbf 82.5 85.5 99.7 99.9 97.1 97.8 fis 71.3 84.7 81.7 83.4 94.0 94.6 ig 63.2 69.0 95.9 88.0 73.6 74.1 kw 89.9 92.2 99.8 99.9 93.2 94.1 mrmr 94.4 94.3 100 100 95.1 96.5 t-test 43.1 72.8 98.5 99.8 93.6 96.0 bayesian 96.4 98.6 98.7 99.4 96.9 97.5 table 13: recognition rates for subject 3 ann 1nn dt n=5 n=6 n=5 n=6 n=5 n=6 cfs 95.0 96.0 88.5 99.8 79.0 87.1 chi 93.7 94.6 100 100 99.7 99.8 fcbf 96.2 96.1 99.0 99.9 88.8 90.0 fis 94.0 89.4 76.2 93.9 92.6 94.0 ig 62.3 65.4 62.2 62.9 62.2 63.1 kw 88.1 89.7 99.4 99.4 97.1 97.4 mrmr 82.6 83.9 79.5 79.7 93.0 93.0 t-test 62.3 65.5 62.2 66.4 62.2 70.6 bayesian 95.3 96.3 99.9 99.9 97.8 98.9 table 14: recognition rates for subject 4 ann 1nn dt n=5 n=6 n=5 n=6 n=5 n=6 cfs 92.1 94.6 99.9 100 94.2 97.2 chi 92.2 93.4 100 100 98.9 99.5 fcbf 62.6 84.5 84.2 87.8 88.9 89.5 fis 75.0 87.4 71.0 77.1 72.1 76.7 ig 57.9 66.7 51.5 62.5 57.9 66.7 kw 82.3 83.0 68.1 68.9 65.7 66.2 mrmr 82.4 86.9 85.2 89.7 88.3 88.6 t-test 62.7 66.8 62.5 66.5 62.8 66.8 bayesian 93.7 95.0 100 100 99.4 99.6 126 j. suto, s. oniga, p. pop sitar table 15: recognition rates for subject 5 ann 1nn dt n=5 n=6 n=5 n=6 n=5 n=6 cfs 90.7 95.5 99.6 99.5 96.9 96.7 chi 92.3 94.5 100 100 99.5 99.6 fcbf 93.1 94.2 70.0 74.1 84.0 62.1 fis 85.4 92.6 93.5 97.5 92.6 94.2 ig 38.4 47.9 26.2 59.3 38.4 58.3 kw 82.8 87.8 88.2 95.6 82.8 88.7 mrmr 85.8 88.2 62.7 74.4 44.1 49.6 t-test 59.5 61.1 82.0 92.3 76.3 87.7 bayesian 94.3 96.5 70.6 82.9 46.1 50.3 table 16: recognition rates for subject 6 ann 1nn dt n=5 n=6 n=5 n=6 n=5 n=6 cfs 93.0 97.6 100 100 97.7 97.8 chi 93.8 96.9 100 100 99.5 99.6 fcbf 92.2 95.3 100 100 97.3 97.8 fis 58.7 62.3 83.1 91.8 88.6 91.5 ig 46.9 67.3 58.1 70.4 53.8 56.6 kw 81.4 91.9 85.5 95.7 81.9 90.6 mrmr 82.6 85.3 87.4 90.7 84.8 88.6 t-test 66.0 68.1 93.6 99.5 83.6 90.4 bayesian 94.5 96.5 100 100 99.7 97.7 table 17: recognition rates for subject 8 ann 1nn dt n=5 n=6 n=5 n=6 n=5 n=6 cfs 92.5 93.8 99.2 99.7 91.4 91.9 chi 92.4 89.8 100 100 99.7 99.5 fcbf 91.8 95.7 94.7 98.8 93.7 96.7 fis 88.0 70.5 98.3 99.9 98.2 98.4 ig 47.9 57.3 42.5 48.6 47.9 52.9 kw 87.8 91.0 83.3 87.7 80.0 81.6 mrmr 84.4 87.0 96.7 99.0 94.6 95.0 t-test 47.9 57.9 42.5 46.9 47.9 57.9 bayesian 95.8 96.8 100 100 99.5 99.6 feature analysis to human activity recognition 127 4 discussion table 18: average recognition rates ann 1nn dt n=5 n=6 n=5 n=6 n=5 n=6 cfs 93.3 95.7 97.9 99.8 91.2 93.3 chi 92.3 94.2 100 100 99.1 97.4 fcbf 86.3 91.7 92.5 94.3 91.5 90.0 fis 79.2 82.9 85.1 91.2 89.7 91.6 ig 47.8 56.9 49.8 57.7 50.3 56.6 kw 85.2 89.2 88.8 92.4 84.0 86.9 mrmr 85.7 88.1 86.8 90.3 84.8 86.6 t-test 50.6 63.4 69.3 78.4 67.2 77.6 bayesian 94.6 96.1 95.1 97.2 88.6 91.1 the results in table 2-17 provide large amount of information about features, feature selection and classification algorithms. moreover, table 18 shows the average recognition accuracies where the best rates are highlighted by red. according to the results, we summarized our observations into the following points: • a generally applicable feature set does not exist because the selected features for the subjects are just partially overlapping. • more features do not guarantee performance improvement, see chi and ig in table 12. • the performance of some feature selection methods is person dependent. for instance, in the case of subject 2 the machine learning algorithms with mrmr reached higher recognition rate than in the case of subject 3. • the performance of machine learning algorithms is feature selection and person dependent. for example, in the case of subject 5, the mrmr algorithm reached higher recognition rates with ann while in other cases, the 1nn or dt were better. in addition, the 1nn produced better results than ann and dt with chi in all cases. • as table 18 shows, a relation exists between machine learning and feature selection algorithms. the 1nn and dt reached the highest recognition rate with the chi while the ann in combination with the bayesian was the most efficient. • already 5 features are enough for good classification, because the difference of recognition rates with 5 and 6 features is rather small. • table 18 shows that, the 1nn produced the best result however the decision of the 1nn is more slower than in the case of ann or dt, see for example [4]. therefore, the usage of dt with chi can be a good decision when the calculation capacity is strongly limited. • with only one training, the performance of the ann is not as good as we expected. the outcome of an ann training depends on more hyperparameters, therefore a right ann construction requires hyperparameter search and more trainings. however, it is a very time-consuming process. • since chi used only time domain features, in this case frequency domain features are negligible. 128 j. suto, s. oniga, p. pop sitar • the result does not show strong relation between selected features and measured recognition rates for subjects of similar age. conclusion in this article, we investigated the performance of feature selection methods on the har problem in combination with three well-known machine learning algorithms. to the survey two external sources have been utilised: ward 1.0 database was the data source and the feature selection methods derived from an open source repository. at the beginning of the article, the most common feature extraction methods from time, frequency and other domains have been collected from the literature. thereafter we selected 7 volunteers with different ages from the ward and applied the feature extraction methods on their data. the selected features were the input of the ann, 1nn and dt classifiers and the recognition rates have been archived. duarte et al., gao et al., maurer et al., khan et al., oniga et al. and yang et al. used similar research conditions and environment to their study (they used single tri-axial accelerometer attached to one part of the body; they applied machine learning algorithm(s); they split time series into windows; etc.) and reached 95%, 97.9%, 99%, 97.8%, 96.4% and 92.8% recognition rates, respectively [4, 5, 9, 10, 12, 13]. as table 10-16 demonstrate, we reached approximately 100% recognition rates in each cases. it is better than previous results and clearly indicates the efficiency of the feature selection and machine learning combination. pinardi et al., su et al. and yang et al. are also used the ward database in their research [7,29,30]. they archived 97.8%, 98.5% and 93.5% recognition rates with five sensors and different kinds of classifiers (majority voting, distributed sparsity and support vector machine). against their works, we showed that less sensors are enough for good classification and a general classifier with an appropriate feature selection algorithm can overcome other classification approaches in the har problem. bibliography [1] suto, j.; oniga, s.; pop sitar, p. (2016); comparison of wrapper and filter feature selection algorithms on human activity recognition, computers communications and control (icccc), 2016 6th international conference on, ieee xplore, e-issn 978-1-5090-1735-5, doi: 10.1109/icccc.2016.7496749, 124-129. [2] chernbumroong, s.; cang, s.; atkins, a.; yu, h. (2013); elderly activities recognition and classification for applications in assisted living. expert systems with applications, issn: 0957-4174, doi: 10.1016/j.eswa.2012.09.004 40(5):1662-1676. [3] sebestyen, g.; tirea, a.; albert, r. (2012); monitoring human activity trough portable devices. carpathian journal of electronic and computer engineering, issn 2343-8908, 5(1):101-106. [4] gao, l; bourke, a.k.; nelson, j. (2014); evaluation of accelerometer based multi−sensor versus single−sensor activity recognition systems. medical engineering & physics, issn: 1350-4533, doi: 10.1016/j.medengphy.2014.02.012, 36(6):779-785. [5] maurer, u.; smailagic, a.; siewiorek, d,p; deisher, m. (2006); activity recognition and monitoring using multiple sensors on different body positions. international workshop on wearable and implementable body sensor networks, isbn: 0-7695-2547-4, doi: 10.1109/bsn.2006.6, cambridge, usa, 112-116. feature analysis to human activity recognition 129 [6] orha, i.; oniga, s. (2015); wearable sensor network for activity recognition using inertial sensors, carpathian journal of electronic and computer engineering, issn 2343-8908, 8(2):3-6. [7] yang, a.y.; jafari, l.; systry, s.s.; bajcsy, r. (2009); distributed recognition of human actions using wearable motion sensor networks. journal of ambient intelligence and smart environments, issn 1876-1372, doi: 10.3233/ais-2009-0016, 1(1):1-5. [8] preece, j.s.; goulermas, j.y.; kenney, l.p.j.; howard, d. (2009); a comparison of feature extraction methods for the classification of dynamic activities from accelerometer data. ieee transactions on biomedical engineering, issn: 1558-2531, doi: 10.1109/tbme.2008.2006190, 56(3):871-879. [9] khan, a.m.; lee, y.k.; lee, s.y.; kim, t.s. (2010); a triaxial accelerometer−based physical−activity recognition via augmented-signal features and a hierarchical recognizer. ieee transactions on information technology in biomedicine, issn: 1558-0032, doi: 10.1109/titb.2010.2051955, 14(5):1166-1172. [10] oniga, s.; suto, j. (2015); optimal recognition method of human activities using artificial neural networks. measurement science review, issn 1335-8871, doi: 10.1515/msr-20150044, 15(5):323-327. [11] oniga, s., suto, j. (2014); human activity recognition using neural networks. 15th international carpathian control conference, isbn: 978-1-4799-3528-4, doi: 10.1109/carpathiancc.2014.6843636, velke karlovice, czech republic, 403-406. [12] yang, j.y.; wang, j.s.; chen, y.p. (2008); using acceleration measurements for activity recognition: an effective learning algorithm for constructing neural classifiers. pattern recognition letters, issn: 0167-8655, doi: 10.1016/j.patrec.2008.08.002, 29(16):2213-2220. [13] duarte, f.; lourenco, a.; abrantes, a. (2014); classification of physical activities using a smartphone: evaluation study using multiple users. procedia technology, issn: 2212-0173, doi: 10.1016/j.protcy.2014.10.234, 17(1):239-247. [14] karantonis, d.m.; narayanan, m.r.; mathie, m.; lovell, n.h.; celler, b.g. (2006); implementation of a real−time human movement classifier using a triaxial accelerometer for ambulatory monitoring. ieee transactions on information technology in biomedicine, issn: 1558-0032, doi: 10.1109/titb.2005.856864, 10(1):156-167. [15] lara, d.o.; labrador, m.a. (2013); a survey on human activity recognition using wearable sensors. ieee communications survey & tutorials, issn: 1553-877x, doi: 10.1109/surv.2012.110112.00192, 15(3):1192-1209. [16] godfrey, a.; conway, r.; meagher, d.; olaighin, g. (2008); direct measurement of human movement by accelerometry. medical engineering & physics, issn: 1350-4533, doi: 10.1016/j.medengphy.2008.09.005, 30(10):1364-1386. [17] bayat, a.; pomplun, m.; tran, d.a. (2014); a study on human activity recognition using accelerometer data from smartphones. procedia computer science, issn: 1877-0509, doi: 10.1016/j.procs.2014.07.009, 34(1):450-457. [18] kavanagh, j.j.; menz, b.h. (2008); accelerometry: a technique for quantifying movement patterns during walking. gait posture, issn: 0966-6362, doi: 10.1016/j.gaitpost.2007.10.010, 28(1):1-15. 130 j. suto, s. oniga, p. pop sitar [19] shoaib, m.; bosch, s.; incel, o.d.; scholten, h.; havinga, p.j.m. (2015); a survey of online activity recognition using mobile phones. sensors, issn 1424-8220, doi: 10.3390/s150102059, 15(1):2059-2085. [20] suto, j.; oniga, s.; buchman, a. (2015); real time human activity monitoring. annales mathematicae et informaticae, issn 1787-6117, 44(1):187-196. [21] cheng, c.h.; wang, p.s.p. (2005); handbook of pattern recognition and computer vision, 3th ed., world scientific, isbn 978-981-4505-21-5. [22] liu, h.; motoda, h.; setiono, r.; zhao, z. (2010); feature selection: an ever evolving frontier in data mining, 4th workshop on feature selection in data mining, issn 15337928, hyderabad, india, 4-13. [23] liu, h.; motoda, h. (2008); computational methods of feature selection, crc press taylor & francis group, isbn 978-158-488-878-9. [24] hall, m.a.; smith, l.a. (1999); feature selection for machine learning: comparing a correlation−based filter approach to the wrapper, florida artificial intelligence symposium, florida, isbn 978-1-57735-756-8, usa, 235-239. [25] saeys, y.; inza i.; larranaga p. (2007); a review of feature selection techniques in bioinformatics, bioinformatics, doi: 10.1093/bioinformatics/btm344, issn 1460-2059, 23(19):25072517. [26] jatoba, c.l.; grobmann, u.; kunze, u.; ottenbacher j.; stork, w. (2008); context-aware mobile health monitoring: evaluation of different pattern recognition methods for classification of physical activity. 30th annual international ieee embs conference, isbn: 978-1-4244-1814-5, doi: 10.1109/iembs.2008.4650398, vancouver, canada, 5250-5253. [27] zhao, z.; morstatte, f.; sharma, s.; alelyani, s.; anand, a.; liu, h. (2011); advancing feature selection researchasu feature selection repository. technical report, arizona state university, http : //featureselection.asu.edu/old/featureselectiontechreport.pdf [28] oniga, s.; suto, j. (2016); activity recognition in adaptive assistive systems using artificial neural networks. elektronika ir elektrotechnika, issn: 2029-5731, doi: http://dx.doi.org/10.5755/j01.eee.22.1.14112, 22(1):68-72. [29] pinardi, s.; bisiani, r. (2010); movement recognition with intelligent multisensor analysis, a lexical approach. 6th international conference on intelligent environments, isbn 978-160750-639-3, doi: 10.3233/978-1-60750-638-6-170, kuala lumpur, malaysia, 170-177. [30] su, b.; tang, q.; wang, g.; sheng, m. (2016); the recognitions of human daily actions with wearable motion sensor system, lecture notes in computer science: transactions on edutainment xii, isbn 978-3-662-50544-1, doi: 10.1007/978-3-662-50544-16, 9292(1):6877. [31] ertugrul, o.f.; kaya, y. (2016); determining the optimal number of body-worn sensors for human activity recognition. soft computing, issn 1433-7479, doi: 10.1007/s00500-0162100-7, 20(2):1-8. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 4 (december), pp. 681-700 a novel parallel transmission strategy for data grid q. ming-cheng, w. xiang-hu, y. xiao-zong qu ming-cheng, wu xiang-hu, yang xiao-zong school of computer science and technology, harbin institute of technology harbin, heilongjiang 150001, china e-mail: qumingcheng@126.com, wuxianghu@hit.edu.com, yangxz@hit.edu.com abstract: creation of multi-copies accelerates data transmission and reduces network traffic, but it causes overhead storage and additional network traffic. a variety of parallel transmission algorithms based on gridftp and multi-copy can be used to accelerate data transmission further, but they can not adapt to a wide range of network, and they can not be used to solve the problems of storage space and network traffic waste. gridtorrent combined with bittorrent and gridftp has compatibility with grid and has flexible scalability, but the speed is very slow when there are few peers, to solve this problem multicopy is needed also. to achieve multiple optimization objectives of storage space saving, suitable for two kinds of application modes(i.e. parallel transfer based on gridftp and bittorrent), adaptability for wide range of network and higher performance when there are fewer peers, based on the idea of gridtorrent, a distributed storage model, parallel transfer algorithm and virtual peer strategy are proposed. in experiments the performance is compared among the verification system vpg-torrent and original parallel transfer algorithm (dcda) only based on gridftp & multi-copy and gridtorrent. when the same amount of data is deployed vpg-torrent has better performance than dcda, and when there are fewer peers vpg-torrent also exceed gridtorrent, which prove the effectiveness of vpg-torrent. keywords: data grid, distributed storage model, parallel transmission. 1 introduction data grid is used during data-intensive computing applications to facilitate the efficient use of distributed data resources. it focuses on the management of data in a wide, heterogeneous, and distributed environment; acquisition of data from a variety of heterogeneous data resources, and extraction of useful information from the data source through collaboration and geographical distribution operation [1]. data grid has been widely used in many fields currently. such as in scientific computing, physics, biology, astronomy, oceans, atmosphere, manufacturing and so on [2–4], and its desktop operation interface has been implemented well [5–7]. combining computing grid and data grid for data-intensive computing applications has become a trend now [8–11]. however, to meet time requirement of data-intensive computing applications, how to accelerate transmission is the key problem faced by data grid [12]. in order to improve the user’s access speed and reduce the network load effectively, much work has been done in recent years on how to create a reasonable number of copies and locate them [13, 14] efficiently. data transmission speed is improved and network traffic is reduced. however, these studies overlooked the mass properties of data in data grid, and storage and network overhead were caused by the creation of multiple copies. if there is a huge amount of data, then it is more harmful than good to create multi-copies [15, 16]. bittorrent is a typical application example of p2p technology. in bt system every peers should upload data when they download. the download speed mainly depends on the number copyright c⃝ 2006-2011 by ccc publications 682 q. ming-cheng, w. xiang-hu, y. xiao-zong of simultaneous peers. now bt is mainly used in file sharing of public community where there are mass accesses. in scientific computing areas there are a few users in general, and computing tasks have strict time requirements in most cases, so bittorrent reveals some shortcomings. oversim is a flexible overlay network simulation framework. all oversim protocol implementations can be used without code modifications in real networks. several applications like i3, scribe, p2pns, and simmud based on these protocols are available. all these implementations can be used for both simulation as well as real world networks. oversim utilizes the graphical interface of omnet++ to display overlay and underlay topology and all network packets. the paper [17, 18] shows that with oversim simulations of overlay networks with up to 100,000 nodes are feasible. moreover it provides a fully configurable network topology with realistic bandwidths, packet delays and packet losses. 2 related works and ideas gridftp provides a striped transmission mode to enable the gridftp client to download the same data in different data blocks from multiple gridftp servers in parallel mode, see figure 1. based on gridftp protocol and multi-copy technologies, many parallel transmission algorithms have been developed [19–22], in these studies, a common feature is to divide original data into sub-blocks, then download these sub-blocks from different nodes through some decision-making in parallel mode, when download is finished the sub-blocks are merged into original data. figure 1: striped transmission mode of gridftp for these parallel transfer algorithms based on client/server(c/s) mode, the performance will decrease dramatically as the amount of access increase, so new copy must be deployed to solve this problem. in spite of this defect, this c/s mode is still necessary for scientific computing community and computing tasks which have higher requirement in reliability, security and some other special data service. but the problems of storage resource waste and network traffic caused by the the deployment of multiple copies can not be neglected. so, how to use a less redundant storage and make full use of gridftp protocol to improve data transmission speed is a challenge faced by data grid now. bittorrent has better adaptability in a wide range of network. but its performance also has much to be desired when there are a few peers or the peers do not allow to upload or upload speed is strictly limited. integrated the advantages of gridftp and bittorrent, some scholars proposed gridtorrent as shown in [23–25]. gridtorrent (see figure 2) is an implementation of the popular bittorrent protocol designed to interface and integrate with well-defined and deployed data grid components and protocols (e.g. gridftp, rls). just like bittorrent, gridtorrent is based on peer-to-peer technique, that a novel parallel transmission strategy for data grid 683 figure 2: gridtorrent allow clients to download files from multiple sources while uploading them to other users at the same time, rather than obtaining them from a central server. by dividing files into fragments, gridtorrent can combine the best out of the two protocols [25]. the data in bittorrent community is generally obtained from other people’s works, like music or movie file, whereas every scientific data is generated in scientific community. therefore, scientific data are more sensitive than data used in bittorrent community. so the number of concurrent peers will be very limited. from figure 2, when the sum of peers is small, the performance of gridtorrent will be degraded as gridftp. so like other parallel transfer algorithms based on gridftp and multicopies, a number of copies should be deployed to enhance the transmission speed. accordingly storage space and network traffic waste are serious. bittorrent algorithms do not rely on global data storage knowledge to schedule. because the number of participating peers, network speed, peer-owned data blocks are dynamically changing. so if we directly divide overall data into sub-blocks of the same size and then deploy them to some servers, the parallel transmission capacity of the servers can not be maximized. and for the previous parallel transmission algorithms, it will inevitably lead to some of the nodes complete their download task first, i.e., all the data stored by the nodes have been downloaded. it can not guarantee each task continuously download from start to finish and all the tasks finish at or tend to the same time. this paper is based on the idea of gridtorrent, and study how to achieve higher speed using less storage when the number of participating peers is small or p2p is not allowed, so that speed does not depend on the number of concurrent peers. to solve the problem from three levels: first, p2p is not allowed (only download from gridftp servers); second, p2p is allowed, but the number of peers is small; third, large-scale peer transmission. therefore, a data grid distributed storage model which uses less data redundancy and can ensure the reliability of data is proposed in this paper. a parallel scheduler and a parallel transmission algorithm based on the model and gridtorrent are thus presented. main contributions: a distributed storage model is put forward, the model can ensure data reliability using less storage. a scheduler is presented based on model, and further a parallel transmission algorithm (ptabm) is put forward based on model and scheduler. comparative 684 q. ming-cheng, w. xiang-hu, y. xiao-zong experiments are carried, our algorithm ptabm achieved the same performance compared with dcda, meanwhile the strategy proposed in this paper has absolute advantage in storage space, and also network load can be reduced greatly during copy deployment and migration. virtual peer(vpeer) is presented in section 4.4 based on ptabm and the idea of gridtorrent. during the scheduling, vpeer is regarded as a general peer, only the transmission from vpeer is based on ptabm. this system is called vpg-torrent, and it can adapt to the situation of large-scale networks. compared with gridtorrent, when the number of peers is small (dozens of peers), vpgtorrent make the average speed of every peers be higher, as the number of peers increases gradually, the performance of them gets closer gradually, it is because that bittorrent accounts for a dominant role when large-scale peers exist. the outcome achieved by this paper is:using less storage space achieved objective of rapid transmission; and the performance is not affected by the sum of concurrent peers; vpg-torrent can met two kinds of application modes, i.e., one is parallel transmission only based on gridftp, the other one is bittorrent service in large-scale network. this is the lack of previous studies. the experiments include two parts: parallel transmission only based on gridftp, simulation of vpg-torrent. (1)the first part is to verify the performance of vpg-torrent when bittorrent is not allowed. we implemented a data dispatcher, rls server and ptabm, and built environment platform composed of 7 computers. good results were achieved compared with previous parallel transmission algorithm (dcda). (2)the second part is to verify the performance of vpg-torrent when bittorrent is allowed, and compare with gridtorrent. in order to simulate large-scale concurrent access, oversim is used in simulation experiment, and also good results were achieved. in following chapters, firstly a distributed storage model is put forward, then a scheduler based on model is proposed, further a parallel transmission algorithm is given, and then expounded the verification system modules and work process, at last experiments and analysis are given. 3 distributed storage model in this section we will put forward a block-devision and block-storage strategy (called distributed storage model), so some necessary definitions will be given first. definitions 1-6 are given to explain how to divide data file and how to storage the blocks into grid nodes. definitions 7-10 are used to show the properties of distributed storage model definition 1. (metasum partition and metadata) let m represent the total amount of data. divide the overall data into sub-blocks of equal size, so that metablks=k(k-1)*metasum, and k is the number of copies, and metablks is divided shares. firstly the data is divided into k(k-1) shares, then each share is divided into metasum shares, and metasum is a variable parameter. so let the amount of data for each share be metadata, and can be expressed as: metadata = m k(k − 1) ∗ metasum (1) definition 2. (local data) the k(k-1)*metasum shares of data defined in definition 1 are evenly distributed among k nodes. so each node contains (k-1)*metasum shares of data. these data is called local data lndi of node ni. a novel parallel transmission strategy for data grid 685 definition 3. (local data virtual group) the local data of ni is classified into some virtual groups.the metasum shares of data is classified as one group, and the classified groups are numbered uniquely within nodes. from definition 2, we can see that the local data of each node can be classified into (k-1) virtual groups. let the virtual group be gij (0≤ i≤ k-1, 0≤ j≤ k-2). the sum of data blocks that one group contains is metasum. let gi represent all the vitural groups that node ni contains, and g represents the current vitural group. definition 4. (remaining node set) after excluding node ni, all remainder grid nodes is called remaining node set: n e i (0 ≤ e ≤ k − 2) and k−2∪ y=0 n y i = k−1∪ j=0,j ̸=i nj , n y i = nj, { y = j if j < i y = j − 1 if j > i definition 5. (cross storage) croup gi of ni is distributed into n e i , that is gi → n e i , and( w∪ r=e ( gri → n e i )) ∣∣∣∣k−2 e=0 {w = (e + (p − 1)) mod k, p ≤ k − 1} , where p is the specified constant, and symbol → means ’storage into’. this is why such storage rule is called cross storage. definition 6. (distributed storage model) all the data andi that node ni contains includes local data lndi and cross storage data ondi. from the definitions 3 to 5 above: andi=(lndi)∪ (ondi)= ( k−2∪ j=0 g j i ) ∪ ( k−1∪ (a=0,a ̸=i) ( w∪ r=e gra ) ) { e = i − 1(a < i) e = i (a > i) &{ w = (e + (p − 1)) mod k p ≤ k − 1 . p is a specified constant, so such storage mode is called distributed storage model. (note: ond represents local data and ondi represents local data of node ni ) theorem 1. (p integrity) if data storage meets distributed storage model, when there are arbitrary p nodes are not available, the merger of data at the remaining k-p nodes is still equivalent to m, that is, the data is complete, this property is called p integrity. proof: for an arbitrary local data gxi , i ∈ (0, k − 1), x ∈ (0, k − 2), where g x i represents arbitrary one local virtual group of one arbitrary node ni. from definition 5, we can know that e ∈ (x−(p−1), k −2), r ∈ (e, w), and w=(e+(p-1) mod k), p≤k-1. so from e to w there are p numbers. retrieve a subset e and e ∈ (x − (p − 1), x) ⊂ (0, k − 2).because of each value of e, r starts from e will get p values. when e varies from x-(p-1) to x, r is in the range of { (x (p 1),x),...,(x,x + (p 1))} .the ’x’ is repeated p times because e has undergone p changes, that is,gxi has been stored into node n e i for p times. here e∈(x-(p-1),x). adding the node that virtual group gxi lies in, there are p +1 nodes where store g x i therefore, if there are arbitrary p nodes are unavailable, a node store gxi is available. as g x i is arbitrary, so all the virtual groups of all the nodes meet the above derivation. the proof is completed. 2 lemma 1. if the data storage met the distributed storage model, then the overall amount of stored data can be expressed as: k(1+p)(k-1)*metasum. proof: derivated from definition 6: k−1∑ i=0 andi = k−1∑ i=0 ((k − 1) ∗ metasum|lndi + (1 + p) ∗ metasum ∗ (k − 1)|ondi) = k(k − 1)(1 + p) ∗ metasum 686 q. ming-cheng, w. xiang-hu, y. xiao-zong the proof is completed. 2 definition 7. (storage space usage rate:ssur) the total amount of data that stored by distributed storage model divided by the total amount of data that stored by k-complete copies is defined as ssur. from definition 1 and lemma 1: ssur = k(k − 1)(1 + p) ∗ metasum k ∗ k(k − 1) ∗ metasum = 1 + p k (2) definition 8. (limit speed ratio: vratio) let vratio=vi/vj represents the limit ratio of the amount of data that downloaded from nodes ni and nj seperately. then: 1 (k − 1)(1 + p) ∗ metasum ≤ vratio ≤ (k − 1)(1 + p) ∗ metasum (3) definition 9. (maximum speed ceiling: vtop) during the period of downloading data, the amount of data downloaded from any node can not excess andi. then: ( vmax/ ∑k−1 i=0 vi ) k(k − 1) ∗ metasum ≤ (k − 1)(1 + p) ∗ metasum ⇒ vtop ≤ (1+p) k ( k−1∑ i=0 vi). vtop is called maximum speed ceiling. example 1 given: k=4,p=2,metasum=4. questions: (a), give the calculus course that distributes the data to grid nodes according to model. (b). calculate ssur. solution a: (1) from definition 1: the data divided shares=4*(4-1)*4=48. these data blocks are as follows: (1), (2),..., (48). (2) from definition 2: lnd0=(1),(2)...(12); lnd1=(13),(14)...(24); lnd2=(25),(26),...,(36); lnd3= (37),(38),...,(48) (3) from definition 3: (a) g00=(1),(2),(3),(4), g 1 0=(5),(6),(7),(8), g 2 0=(9),(10),(11),(12); (b) g01=(13),(14),(15),(16), g 1 1=(17),(18),(19),(20), g 2 1=(21),(22),(23),(24); (c) g02=(25),(26),(27),(28), g 1 2=(29),(30),(31),(32), g 2 2=(33),(34),(35),(36); (d) g03 =(37),(38),(39),(40), g 1 3=(41),(42),(43),(44), g 2 3=(45),(46),(47),(48). (4) from definition 4: (a)n 0 0=n1, n 1 0=n2, n 2 0=n3; (b)n 0 1=n0, n 1 1=n2, n 2 1=n3; (c)n 0 2=n0, n 1 2=n1, n 2 2=n3; (d)n 0 3=n0, n 1 3=n1, n 2 3=n2; (5) from definition 5:(a) g0 → n 0 0 = g0 → n1 = {g00, g 1 0} → n1, g0 → n 1 0 = g0 → n2 = {g10, g 2 0} → n2, g0 → n 2 0 = g0 → n3 = {g20, g 0 0} → n3; (b) g1 → n 0 1 = g1 → n0 = {g01, g 1 1} → n0, g1 → n 1 1 = g1 → n2 = {g11, g 2 1} → n2, g1 → n 2 1 = g0 → n3 = {g21, g 0 1} → n3; (c)(d) omitted solution b: from definition 7: ssur = (1+p) k = 1+2 4 = 0.75. from the calculus above we can know each node stores 36 shares of data, and 4 nodes in all. so all the shares of data are 36*4, 36*4/(48*4)=0.75. the theoretical value equals to the actual value, that is, ssur=0.75. 4 parallel scheduler 4.1 the basic idea for scheduler definition 10. (local data remainder: si) let xi represent the ideal data downloaded from node ni, where xi = vi/( ∑k−1 j=0 vj). let si = xi − lndi, it is called local data remainder. a novel parallel transmission strategy for data grid 687 this indicator reflects the node load. if si tends to or equals to 0, then it shows that each node can roughly finish downloading at the same time. if si is positive, then it shows that the download speed of the current node is fast and it can share the load of other nodes whose si is negative. from definition 9 we can see that under ideal circumstances, there is ∑ si = 0. examples (1) given k=4, p=1, metasum=3, v1=12, v2=10, v3=4, v4=28. scheduling objective: as far as possible to ensure that each node finishes downloading at the same time, and the data is non-repeated. we first give final results of scheduling algorithm as shown in table 1. download the gray data blocks from node ni. the ideal download data blocks from each node is (8, 6.7, 2.7, 18.7), the scheduling result is (8, 7, 3, 18). the load(si) of each node happens (-1, -2, -6, 10) both before (i-a) after scheduling , that is, (0, -0.3, -0.3, 0.7), the load is balanced more or less. as the sum of data blocks is an integer, so the actual scheduling result is rounded off. here, ssur=50%, compared with the full copy deployment, only 50% storage space is used. table 1 example (1) from table 1 we can see that for the fast nodes they not only complete lnd download mission, but they also share simultaneously the download mission of the slow nodes from their ond. for example, the data blocks (25)(26)(27) lie in lnd of node n3, but they are downloaded from the ond of node n4. 4.2 description of scheduler objective: optimize the si of each node, make them close to or tend to 0, to make load balance for each node. explanation: for any two nodes zn and fn, if the si of zn is positive and there are remainder parts in lnd of fn that are not shared by the ond of zn, then zn can share the load of fn. when a block is shared, do zn.si–, fn.si++. if a data block is shared, then the flag which points out download position is marked, which contains the same data block stored at other nodes. so the same block at other nodes will not be operated. input: k, p, vi , and the data distributed according to storage model. output: program for how to download data blocks. (1) build list zsi_list consisted with the nodes whose si are positive, and sort zsi_list in descending order. (2) build list fsi_list consisted with the nodes whose si are negative, and sort zsi_list in ascending order. /* deal with each node in fsi_list. */ (3) for every node fn in fsi_list deal from first /* let the nodes in zsi_list share the load of nodes in fsi_list*/ (4) for every node zn in zsi_list deal from first (5) if fn.si≥0 then fsi_list.move_to_next, go to(3) (6) if zn.si >0 then (a)let the data blocks in ond of zn share the load in lnd of fn. according to the sharing sum, update zn.si. share one block each time, and judge the 688 q. ming-cheng, w. xiang-hu, y. xiao-zong below two conditions timely: (b)if fn.si ≥0 then fsi_list.move_to_next, go to(3); (c) if zn.si ≤0 then zsi_list.move_to_next, go to(4). (7) in stages (1)-(6) above, let the fast node share the load of slow node. if either list has reached the tail, then exit. (8) if there are still some nodes(fn) whose si are still negative in fsi_list, then deal with zsi_list from first. if the intersection for ond(non-processed) of zn and lnd(non-processed) of fn is not null, then let zn share the load of fn until the intersection is null, in this process regardless of whether the si of zn is already negative. (9) there are two circumstances for the nodes in fsi_list, their si is positive or non-positive. let the nodes whose si are positive share the load of negative ones until they can not share. now the nodes in zsi_list exist in two cases, their si is positive or non-positive. for every node in zsi_list, repeat (1)-(6). then output the scheduling sequence. 5 parallel tranfer algorithm: ptabm if the transmission speed is stable, then the scheduler can be used directly. as network transmission speed changes dynamically, a parallel transmission algorithm, which is based on scheduler and can meet the dynamic speed, is proposed in this chapter. 5.1 model-based storage policy assuming there are four nodes. as shown in figure 3, let the data meet sub-block model storage, that is, each data block meets distributed storage model separately. in fig.1 data groups < b1i , b 2 i , b 3 i , b 4 i > (1 ≤ i ≤ 4) and bi = b 1 i ∪ b 2 i ∪ b 3 i ∪ b 4 i meet storage model separately(every group meets the model separately). figure 3: storage policy sample 5.2 parallel transmission algorithm based on model (ptabm) in fig.1, as any bji is only part of bi, so if download bi from four nodes in parallel, the following restriction must be met: (a) uninterrupted download data from the four nodes at the same time; (b) non-repeated download. as the speed changes dynamicly, let ∆ti represent the time interval from the time of first finishing downloading bi−1 to the time of first finishing downloading bi. for example, the time of completion of bi−1 downloaded by n1 is ti−1, the time of completion of bi downloaded by n3 is ti, so the time interval ∆ti =ti − ti−1. as the speed changes dynamically, the size of each time interval may also be different. the amount of bi data (xi) (0 ≤ i ≤ k − 1) that each node should commit is determined as follows:(a)take the vi at ti−1 (0 ≤ i ≤ k − 1) as the input speed of scheduler; (b)the remainder data that every node did not complete is mi ; (c)formula (2) must be met in theory, that is, as far as possible to ensure that each node fulfils its download mission at the same time. a novel parallel transmission strategy for data grid 689 m0 + x0 v0 = m1 + x1 v1 = ... = mk−1 + xk−1 vk−1 (4) formula (2) is changed into formula (3) through identity theorem. from formula (3), xi is deduced as formula (4). take xi as the input value of scheduler to calculate si. ( mi + xi vi )k−1 i=0 = k−1∑ i=0 (mi + xi) k−1∑ i=0 vi = k−1∑ i=0 mi + k−1∑ i=0 xi k−1∑ i=0 vi = k−1∑ i=0 mi + m k−1∑ i=0 vi = λ (5) xi = λvi − mi (0 ≤ i ≤ k − 1) (6) according to formulas and ristriction in this section, the schematic diagram of parallel transmission architecture is shown in figure 4 the client side consists of two parts. at the end of every ∆t download unit passes vi and mi to scheduler unit. scheduler unit calculates xi and the data block bji , and then output them to download unit. download unit downloads data from the nodes according to bji , and real-time observe whether there is a new generation of ∆t, and then repeat the process above. figure 4: schematic diagram of parallel transmission 5.3 reliability analysis the reliability of algorithm includes two aspects, which are scheduler reliability and algorithm reliability. (1) from theorem 1, if there are random p nodes fail, then the data ,the other k-p nodes contain, is integrated. (2) if there are nodes fail during the download period, then (a) get vi of current normal nodes; (b) let the speed of failure nodes be 0; (c) mark the downloaded blocks of bi, and the undownloaded blocks are b‘i; (d) take the amount of data(m’) of ∑ b‘i , vi and mi as the input values of scheduler; (e)use scheduler to recompute the scheduling sequences. 5.4 verification system (vpg-torrent) verification system contains four parts, as shown in figure 5: data dispatcher, rls server, virtual node (vpeer), peer node. data dispatcher: data dispatcher take p, k, list of servers and original data as input, through model computing unit, original data was divided and deployed into servers. meanwhile, the storage information of data blocks(m-table) was sent to rls server. rls server: the replica location service (rls) keeps track of the physical locations of files. the information stored in the replica location service defines the number of replicas for each file as well as the physical location of the actual data. the physical location is identified by a unique physical file name (pfn) such as a gridftp url. to enable the use of the gridtorrent protocol we introduce the gridtorrent url. 690 q. ming-cheng, w. xiang-hu, y. xiao-zong figure 5: data dispatcher and download virtual peer (vpeer): in the process of data download, gridftp servers are regarded as a virtual peer node (vpeer). so vpeer has a fast transmission speed. the other peers regard vpeer as a general peer to schedule. of course, the transmission of vpeer is based on ptabm. peer node: each peer contains scheduling algorithm of bittorrent and a download unit. scheduling algorithm treat vpeer as a normal peer. download unit responsible for data transmission from other peers or vpeer. each new peer must first request access to rls server to get rls and m-talbe information, and upload the information of data blocks they owned to rls server periodically. as vpeer has fast speed, so when the number of peers is small, its contribution to transmission will be greatest. when download from vpeer ptabm must be used. verification system consists of two major experiments: real experiment 5-3 is to verify ptabm, and compare with previous algorithm. simulation experiments 5-4, 5-5 are to verify the performance of large-scale concurrent access, so experiment is carried out in oversim. 6 experiments 6.1 purpose from figure 5, (a)if there are no peers willing to upload, then vpg-torrent will degenerate into the most common parallel transmission system based on gridftp and multi-copy, this transmission mode plays an irreplaceable role in some applications in data grid. therefore, the performance between vpg-torrent and traditional dadc must compared; (b) for large-scale network where bittorrent plays an important role, the performance of vpg-torrent must be verified as the number of concurrent peers increase from small to large, and should be compared with gridtorrent. there is a natural experiment. if storage model is not used, but the data is directly divided into uniform shares, and then stored them into multiple nodes, so the bandwidth is also increased. if so, vpeer is not exist, accordingly ptabm can not be used too, but only bittorrent can be used. as bittorrent algorithms do not rely on global data storage knowledge to schedule, so it leads to the service capacity of servers can not be maximized. this experiment is carried out in 5-4(2). simulation experiments (1) (2) (3) are made to verify the performance of scheduler; real experiments (4) (5) are made to verify the performance of parallel transmission algorithm based on storage model & full copies and compare with the performance, experiment (6) (7) are a novel parallel transmission strategy for data grid 691 made to verify the performance vpg-torrent. experiment 5.5 is made to verify the reliability of sheduler and algorithm. in experiment (4) (5) the client host lies in education network, and the other four grid nodes are all telecommunication network access. gridftp is used as transmission protocol. in experiment the concurrent connections to vpeer is limited to 20. the maximum connection speed of every server in vpeer is 200k, and rated bandwidth is 5m. each peer can start up 15 connections at the same time, the upload rate is up to 20kb. 6.2 basic definitions and explanations if the speed is determined, the ideal amount of data downloaded from each node can be: m ∗ vi/( ∑k−1 j=0 vj) (0 ≤ i ≤ k − 1). the result that scheduler can give and the ideal amount of download data are compared during experiments (1) (2) (3). literature [11] combines basic parallel transmission mode of gridftp and p2p technologies, but the prerequisite is that there are multiple clients requesting; the research of parallel transmission algorithm in literature [9] [10] is based on gridftp only, its dynamic collaborative algorithm (dcda) achieved good performance. the performances of ptabm and dcda are compared in experiments (4) (5). definition 11. (the percentage of time that scheduling results exceeds ideal value:trer) given the ideal download time is tid, the download time outputted by scheduler is tre. let trer = (100(tre − tid)/ti)%. this parameter is used to detect the performance of scheduler. definition 12. (unit speed variance:d(v)) variance divided by the sum of the speed is called unit speed variance, that is: e(v ) = (∑k−1 i=0 vi ) /k and d(v ) = (∑k−1 i=0 (vi − e(v ))2 ) / ( k (∑k−1 i=0 vi )) (7) from definition 9 we can see that when k is fixed and for the same group vi, vtop increases by (∑k−1 i=0 vi ) /k when p increases by 1. from definition 7 we can see that the storage space usage rate increases by 1/k. 6.3 scheduler performance test (1) this experiment is to verify the impact of d(v), vtop, vratio on trer. let p=1, k=4, metasum=15, the range for the speed distribution is (1,100), then the theoratical vratio=90. consecutively record the results of algorithm for 20 times, the impact of d (v), vratio, and vtop on trer is listed in table 2. calculate the average of d (v), that is, ed(v). classified the observational results that meet d(v)>ed(v) into a group, and d(v)ed (v). trer is 4.40. the experiments 1-9 and 10-20 calculate the average trer respectively, the value are 7.40, 1.87 and they are greater or less than the overall average of 4.40. experiment (1) shows that if the distribution of node speed is very uneven and the difference between maximum and minimum speed is large, trer may be increased. if vtop exceeds the theoratical maximum value, then trer will have a substantial increase. (2) this experiment is made to verify the impact of p and speed range on trer. let k=10, and change the range of p & speed, as shown in table 3. there are nine combinations, observe ten times for every combination. calculate the trer average of every combination, the histogram is shown in figure 6. let k=2,p=2, observe the impact of speed range on trer. observing ten times of algorithm results in every speed range, and calculates trer average. the curve is shown in figure 7. 692 q. ming-cheng, w. xiang-hu, y. xiao-zong table 2 impact of d(v), vratio, vtop on trer table 3 experimental methods a novel parallel transmission strategy for data grid 693 from fig.6 we can see that when p=1, the change of speed has greater impact on trer. but when p=2,3 as the speed range increases, trer are almost unaffected. scheduling results are approximately equivalent to the ideal download speed. it can be seen from figure 7, trer gradually increases as the speed range increases. however, when the speed range is in 5-150 and trer are in the vicinity of one, the effect is good. figure 6: mpact of p and speed range on trer when k=10 figure 7: impact of speed range on trer when k=10, p=2 experiment (2) shows that in a random speed range, if p has already guaranteed that trer is maintained at an ideal value. even when p continues to increase, trer will not decrease. (3) this experiment is done to verify the impact of k and speed range on trer. figure 8: impact of k on trer when p=2, vi ∈(1,50) figure 9: impact of k on trer when p=2,vi ∈(5,50) let the amount of data at every node is 300. in order to ensure the metadata will not be affected when k and the overall amount of data change, do as follows: when k=4 and metasum=100, from formula (1) metasum of other nodes can be calculated, as shown in table 4. 694 q. ming-cheng, w. xiang-hu, y. xiao-zong because the sum of nodes increases, if the amount of data doesn’t increase, then the download time will decrease, so the trer will become inaccurate. in order to guarantee experimental parameters consistency, a special treatment is needed. table 4 value of k, m and metasum let p=2, vi ∈(1,50), observe the effect of changes of k on trer. for every k, observe 10 times, then calculate the average of trer. from figure 8 we can see that trer increases gradually as k increases. let p=2, vi ∈(1,50), observe the effect of the change of k on trer. for every k, observe 10 times, then calculate the average of trer. from figure 9 we can see that as k increases trer changes little. when vi ∈(1,50), the difference among the speeds of nodes is larger, and the maximum theoretical vtop=50; while vi ∈(5,50), there is little difference among the speeds of nodes, and the maximum theoretical vtop=10 only. experiment (3) shows that the larger speed range affects trer greatly. if the speed range is not large, then k has little effect on trer. figure 10: distribution of sub_data points when matadata=5m) figure 11: distribution of sub_data points when matadata=5m,10m it is found through experiments that the value of trer near one is caused by the sub-block. the ideal download data of each node is based on an arbitrary partition for the overall data, and the error generated by the scheduler can be reduced by increasing the metasum appropriately. therefore for the value of trer near one we can consider that the scheduler has achieved the ideal scheduling result. 6.4 ptabm performance test a novel parallel transmission strategy for data grid 695 (4) if there are no peers willing to upload, then vpg-torrent will degenerate into the most common parallel transmission system based on gridftp and multi-copy. this experiment is made to compare the transmission speed between ptabm and dcda, as well as compare the amount of download data at the same time of the two algorithms. let metadata be 5m and 10m, analyze the performance of ptabm. let matadata=5m, observe the amount of data downloaded by dcda and ptabm separately ten times at different time. let sub_data equal the amount of download data by sub_data minus ptabm’s. the point distribution is shown in figure 10. let matadata=5m,10m, observe the amount of data that downloaded by dcda, ptabm separately ten times at different time. let sub_data equal the amount of data downloaded by dcda minus ptabm’s. two type point distribution is shown in figure 11. figure 10 in experiment (4) shows that sub_data does not increase as the time increases, and shows the oscillation effects. it shows that sub_data is non-cumulative. so as the amount of download data increases, the ratio of sub_data and overall data tends to zero, so the performance of ptabm tends to dcda. fig.8. shows that when matadata is larger, sub_data is also larger. but as time increases, it also shows the oscillation effects. (5) this experiment is made to compare the performance between ptabm and dcda in download time and amount. let one time unit contain 10 seconds and matadata=15m. observe the amount of download data and calculate the ratio of data downloaded by dcda and ptabm, take the percentage (dr). the curve for the relationship between dr and time units is shown in figure 12. let metadata=5m,10m,20m, p=1. separately observe the time that taken by dcda and ptabm to download 960m and 1920m data. then calculate the ratio of ptabm finishing time and dcda’s, take the percentage (tr). the tr bar graph is shown in figure 113. figure 12 in experiment 5 shows that as the download time increases the speed of ptabm tends to dcda. at 40 50s their speeds are similar and are approximately equivalent at 100s. from figure 13 we can see that smaller matadata can achieve better effects than larger one. when matadata=5 transfer 1920m data the ratio of the time used by ptabm and dcda is 1.05. so for vast data the two algorithms can achieve similar performances. 6.5 vpg-torrent verification (6) this experiment is made to verify performance of vgp-torrent when p=1, p2p is allowed or not allowed (download from gridftp servers), and compare performance with gridtorrent. let original data size be 1g, and the amount of data permitted to be deployed is 2g. within 20 minutes specific amount of peers are generated randomly. each peer will continue to permit to be downloaded by other peers for 40 minutes from its completion. firstly, deploy two copies for gridtorrent system; secondly, given p=1, k=5,6,7 as the input of data dispatcher of vpg-torrent, so the information of data storage is stored into rls server later. when p2p is allowed, from figure 14, in the performance curves of different peers, the curve gt of gridtorrent is located in the bottom. when k=5,6,7, the curves of vpg-torrent increased gradually. and the curves are all concave. it is because that when the number of peers is small, parallel transmission only based on gridftp servers play an important role, as the number increases gradually, then bittorrent plays an important role. when p2p is not allowed, i.e., the transmission is only based on gridftp. from figure 15, vpg-torrent has a great advantage over bittorrent in average speed of peers. most importantly, the storage space used by them is the same. (7) this experiment is made to test how impact of storage method on gridtorrent and compare performance with vpg-torrent. 696 q. ming-cheng, w. xiang-hu, y. xiao-zong figure 12: download data ratio at the same time when matadata=15m figure 13: impact of matadata on download speed for gridtorrent we process the original data as follows: let original data size be 1g, and the amount of data permitted to be deployed is 2g. the original data were evenly divided into 3 shares, and are distributed to 6 nodes, so that there are two nodes have the same data. these six nodes will treated as normal peers for any new peers. experiments were carried out in two conditions: p2p is allowed, and not allowed. when p2p is not allowed, there will be six threads downloading data concurrently, and the completion time depends on the node which complete transmission task at last. in figure 16, k=6 is the performance curve of vpg-torrent, as is the curve of gridtorrent when storage method is adopted, and gt is the curve of gridtorrent obtained in experiment 6-5-(6). obviously as is better than gt. so for bittorrent to increase performance, the better way is to increase the bandwidth by deploying original data to multiple nodes under the condition of using the same size of storage space. however, the performance of as is a little worse than vpg-torrent’s. it is because that bittorrent will not utilize the storage knowledge to schedule, accordingly it can not make full use of server bandwidth. in figure 17, when p2p is not allowed, ’m’ is the column of vpg-torrent, and ’a’ is the column of gridtorrent. obviously the transmission time of vpg-torrent is less than gridtorrent’s. in this experiment vpg-torrent can ensure that all the download tasks can be finished almost at the same time, therefore, it can make the load balance. instead, bittorrent can not. although the storage space used is the same, vpg-torrent exceeds bittorrent in transmission time. 6.6 reliability test this experiment is made to verify the performance of scheduler at normal conditions and when node failed, verify the performance of ptabm when one node failed, and compare the performance of ptabm and dcda at this condition. let p=(1,2), vi ∈(5,25), k=(4,5,6,7), observe the performance of scheduler in normal and failure circumstances. in figure 18 ’f=1’ represents that one node is failed. draw the curve of k and trer . from figure 18 we can see that when p=1, there is one failure node and the performance of scheduler is a little worse than in normal circumstances. while p=2, there is also one failure node, but the performance of scheduler is good, with trer kept near 1. let p=1, vi ∈(5,25), metasum=5, observe the performance of ptabm and dcda in normal a novel parallel transmission strategy for data grid 697 figure 14: impact of peers on average speed when bittorrent is allowed figure 15: impact of peers on average speed when bittorrent is not allowed figure 16: performance compares for different storage methods when p2p is allowed figure 17: performance compares for different storage methods when p2p is not allowed 698 q. ming-cheng, w. xiang-hu, y. xiao-zong figure 18: comparison of scheduler performance between normal and failure curcumstances figure 19: comparison of algorithm performance between normal and failure curcumstances and failure circumstances, make the ratio(tr) of ptabm and dcda. in figure 19 ’n’ represents normal circumstance while ’f’ represents failure circumstance. from the fig we can see when the amount of data is 1920m the download time of the two algorithms is similar. it shows that ptabm has good reliability. 6.7 supplementary analysis of experimental results the large speed range in experiment 6.3 is to verify the performance of scheduler in the extreme distribution of speed. so in practice as long as taking appropriate p value according to the speed range, the smaller trer can be achieved. at the same time, given a reasonable matadata value, the performance of ptabm can be close to or approximately equal to the performance of dcda. the most important is saving storage space and reducing the network traffic caused by the creation of multi-copies. fortunately, the distributed storage model and ptabm meet those requirements. in experiment 6.4 p=1, matadata=5m, the effects are very good. derived from definition 7: when p=1, the ssur and k are calculated as in table 5. when k=4, the ssur is 50%; but when k=10, the ssur is only 20%. as achieved, the objectives of rapid data transmission, the storage space and network traffic are also significantly optimized at the same time. table 5 relationship between p=2, k and ssur from experiment 6.5, whether p2p is allowed or not, the performance of vpg-torrent is higher than gridtorrent’s, even if increasing the bandwidth by adopting uniform storage method. 7 conclusions parallel transmission algorithms based on multi-copy cause a waste of storage space seriously and are difficult to adapt to a wide range of network access. instead, bittorrent for wide range of file-sharing has a unique advantage. although gridtorrent combined the two above, a novel parallel transmission strategy for data grid 699 the performance is poor when the peers are few. in this paper we achieved multiple optimization objectives: storage space saving, suitable for two kinds of application modes (i.e. parallel transfer based on gridftp and bittorrent), adaptability for wide range of network and higher performance when there are fewer peers. the proposed storage model, scheduler, parallel transmission algorithm, virtual peer and vpg-torrent verification system achieved very good result, and the system proposed has certain advantages compared with the previous algorithms. bibliography [1] chen lei, li san-li. a calking dynamic replication distribution algorithm in data grid. acta electronica sinica, 34(11):1-4, 2006 [2] xie xiao-lan, liu yu, zhou de-jian. research on manufacturing grid data access and integration key technology. journal of wuhan university of technology, 31(6):1-4, 2009 [3] zhang guangzhi, he jieyue. application research on biological data grid. computer engineering,(2):1-4, 2004 [4] qin xin, luo ze, nan kai etal. design and implementation of problem solving environment for astronomy application based on science data grid. application research of computers,(4):1-4, 2009 [5] h.a. james, k.a. hawick. scientific data management in a grid environment. journal of grid computing,3: 39-51, 2005 [6] mingwei wang, shusheng zhang, jingtao zhou etal. an architecture of semantic desktop data grid. proceedings of the 10th international conference on computer supported cooperative work in design,ieee computer society ,1-6, 2006 [7] s. fiore, m. mirto, cafaro. a grelc based data grid management environment. 21st ieee international symposium on computer-based medical systems, ieee computer society, 355360,2008 [8] richard mcclatchey, ashiq anjum etal. data intensive and network aware (diana) grid scheduling. journal of grid computing,5:43-64, 2007 [9] h. liu, et al., scheduling jobs on computational grids using a fuzzy particle swarm optimization algorithm, future generation computer systems:1-8,2009 [10] xiangang zhao, bai wang, nan du. qos-based algorithm for job allocation and scheduling in data grid. proceedings of the fifth international conference on grid and cooperative computing workshops (gccw’06), ieee computer society:1-7,2006 [11] nhan nguyen dang, soonwook hwang, sang boem lim. improvement of data grid’s performance by combining job scheduling with dynamic replication strategy. the sixth international conference on grid and cooperative computing(gcc 2007), ieee computer society:1-8,2007 [12] esther pacitti. patrick valduriez. marta mattoso. grid data management: open problems and new issues. journal of grid computing,5:273-281, 2007 700 q. ming-cheng, w. xiang-hu, y. xiao-zong [13] jiang jianjin, yang guangwen. replication strategies in data grid systems with clustered demands. journal of computer research and development,46(2):1-8,2009 [14] w u chang-ze, chen shu-yu, ti an dong. the strategy of creating replica based on cost shared in data grid. huazhong univ. of sci. & tech. (nature science edition),35(2):1-4, 2007 [15] pangfeng liu. jan-jan wu, optimal replica placement strategy for hierarchical data grid systems. proceedings of the sixth ieee international symposium on cluster computing and the grid :ieee computer society: 1-4, 2006 [16] tim ho, david abramson. a unified data grid replication framework. proceedings of the second ieee international conference on e-science and grid computing: ieee computer society: 1-8, 2006 [17] ingmar baumgart, bernhard heep, stephan krause, oversim: a scalable and flexible overlay framework for simulation and real network applications, proceedings of the 9th international conference on peer-to-peer computing (ieee p2p’09 ), pp. 87-88, seattle, wa, usa, sep 2009 [18] ingmar baumgart, bernhard heep, stephan krause, oversim: a flexible overlay network simulation framework, proceedings of 10th ieee global internet symposium (gi ’07) in conjunction with ieee infocom 2007, p. 79-84, anchorage, ak, usa, may 2007 [19] r.s.bhuvaneswaran, yoshiaki katayama, naohisa takahashi. dynamic co-allocation scheme for parallel data transmission in grid environment. proceedings of the first international conference on semantics, knowledge, and grid, ieee computer society: 1-6, 2006 [20] sudharshan, vazhkudai. distributed downloads of bulk, replicated grid data. journal of grid computing,2:31-42, 2005 [21] gaurav khanna, umit catalyurek, tahsin kurc, et al. a dynamic scheduling approach for coordinated wide-area data transfers using gridftp. the 22nd international parallel and distributed processing symposium (ipdps ’08). ieee computer society, 2008,1-12 [22] liu dongmei, liu dongmei. multi-path parallel transmission scheme for optical grid systems. chinese high technology letters,5:1-4,2008 [23] a. zissimos, k. doka, a. chazapis and n. koziris. gridtorrent: optimizing data transfers in the grid with collaborative sharing. in proceedings of the 11th panhellenic conference on informatics (pci2007), patras, greece, may 2007:1-12 [24] athanasia asiki, katerina doka, ioannis konstantinou, et al. a distributed architecture for multi-dimensional indexing and data retrieval in grid environments. in proceedings of the cracow 2007 grid workshop (cgw’07), krakow, polland, october 16-17, 2007:1-8 [25] a. kaplan, g.c. fox and g. von laszewski, gridtorrent framework: a high-performance data transfer and data sharing framework for scientific computing. proc grid computing environments, supercomputing workshops, reno, nv, usa, november 2007:1-10 international journal of computers communications & control issn 1841-9836, 10(3):333-347, june, 2015. determining basic probability assignment based on the improved similarity measures of generalized fuzzy numbers w. jiang, y. yang, y. luo, x.y. qin wen jiang*, yan yang, yu luo, xiyun qin northwestern polytechnical university school of electronics and information, northwestern polytechnical university, xi’an 710072, china jiangwen@nwpu.edu.cn, yangyan7003@nwpu.edu.cn 345255046@qq.com, 945766782@qq.com *corresponding author abstract: dempster-shafer theory of evidence has been widely used in many data fusion application systems. however, how to determine basic probability assignment, which is the main and the first step in evidence theory, is still an open issue. in this paper, an improved method to determine the similarity measure between generalized fuzzy numbers is presented. the proposed method can overcome the drawbacks of the existing similarity measures. then, we propose a new method for obtaining basic probability assignment (bpa) based on the proposed similarity measure method between generalized fuzzy numbers. finally, the efficiency of the proposed method is illustrated by the classification of iris data. keywords: data fusion, dempster-shafer evidence theory, basic probability assignment (bpa), generalized fuzzy numbers, similarity measures 1 introduction dempster-shafer (d-s) theory of evidence is widely used in many fields of information fusion due to its efficiency in dealing with uncertain information. in real data fusion application systems based on ds theory, the basic probability assignment function should be given so that the combined bpa can be obtained through dempster’s rule of combination [1, 2]. however, how to determine basic probability assignment, which is the main and the first step in evidence theory, is still an open issue. a number of authors have addressed this problem using different approaches. zhu et al. proposed a method to derive mass values from fuzzy membership degrees. for this purpose, fuzzy c-means (fcm) clustering is used to represent the grey levels as fuzzy sets [3]. bendjebbour et al. [4] proposed a probabilistic model where the frame of discernment contained the individual clusters and the ignorance that was modeled by the union of all individual clusters. in that work, the authors derived the mass value of ignorance from the mixture of distributions of the individual clusters composing it. guan et al. [5] came up with three methods to construct the bpa function based on gray correlation analysis, fuzzy sets, and attribute measure respectively. chen et al. [6] and li et al. [7] used back-propagation (bp) neural network to obtain basic probability assignment. according as neural network can gain stronger generalization ability, the measured data being processed by neural network can be used as the bpa value of every sensor. xu et al. [8] used the difference matrix of deviation degree to represent quantitatively the degree of similarity between interval numbers, and constructed an expression of basic probability assignment function. since the basic probability assignment of evidence theory obtained by using neural network has high computational complexity, zuo et al. [9] put forward a method of rough set theory based on random set and bp neural network to obtain the basic probability assignment. in the framework of random set, the ability of attribute reduction of rough set was made use of to reduce the neural network input dimension. in papers [10-13], in order to solve the different practical problems, we proposed several different copyright © 2006-2015 by ccc publications 334 w. jiang, y. yang, y. luo, x.y. qin approaches to obtain bpa. these more pragmatic methods are proposed to generate bpas from uncertain information. in this paper, an improved method to determine the similarity measure between generalized fuzzy numbers is presented. then, a new method to obtain basic probability assignment (bpa) is proposed based on the improved similarity measure between generalized fuzzy numbers. an experiment of iris data classification is used to illustrate the efficiency of our method. 2 preliminaries 2.1 dempster shafer evidence theory. evidence theory first supposes the definition of a set of hypotheses θ = {h1, h2, · · · , hn} called the frame of discernment. it is composed of n exhaustive and exclusive hypotheses. let us denote p(θ), the power set composed with the 2n propositions a of θ: p(θ) = {∅, {h1}, {h2}, · · · , {hn}, {h1 ∪ h2}, {h1 ∪ h3} · · · , θ} (1) where ∅ denotes the empty set. the n subsets containing only one element are called singletons. a key point of evidence theory is the basic probability assignment (bpa). a bpa is a function from p(θ) to [0, 1] ,and which satisfies the following conditions:∑ a∈p(θ) m(a) = 1; m(∅) = 0, (2) dempster’s rule of combination (also called orthogonal sum), noted by m = m1 ⊕ m2, is the first one within the framework of evidence theory which can combine two bpas m1 and m2 to yield a new bpa: m(a) = ∑ b∩c=a m1(b)m2(c) 1 − k and k = ∑ b∩c=∅ m1(b)m2(c) (3) where k is a normalization constant, called conflict because it measures the degree of conflict between m1 and m2. k = 0 corresponds to the absence of conflict between m1 and m2, whereas k = 1 implies complete contradiction between m1 and m2. the belief function resulting from the combination of j information sources sj defined as m = m1 ⊕ m2 · · · ⊕ mj · · · · · · ⊕ mj, (4) as can be seen from above, multi source information can be easily fused in the framework of evidence theory, if we can obtain the bpa functions. 2.2 generalized fuzzy numbers a generalized fuzzy number ã = (a, b, c, d; w) is described as any fuzzy subset of the real line r with membership function µ ã which has the following properties [14]: (1) µ ã is a continuous mapping from r to the closed interval in [0, w], 0 ≤ w ≤ 1; (2) µ ã (x) = 0 for all x ∈ (−∞, a]; (3) µ ã (x) is strictly increasing in [a, b]; (4) µ ã (x) = w for all x ∈ [b, c], where w is a constant and 0 < w ≤ 1; (5) µ ã (x) is strictly decreasing in [c, d]; (6) µ ã (x) = 0 for all x ∈ [d, +∞). determining basic probability assignment based on the improved similarity measures of generalized fuzzy numbers 335 where 0 < w ≤ 1, a, b, c and d are real numbers. especially, a generalized trapezoidal fuzzy number can be defined as ã = (a, b, c, d; w), where a ≤ b ≤ c ≤ d, 0 ≤ w ≤ 1, its membership function is defined by µ ã (x) =   (x − a) b − a a ≤ x ≤ b w b ≤ x ≤ c (x − c) d − c c ≤ x ≤ d 0 else (5) if w = 1, then the generalized fuzzy number ã is called a normal trapezoidal fuzzy number, denote as ã = (a, b, c, d). if a = b and c = d, then ã is called a crisp interval. if b = c, then ã is called a generalized triangular fuzzy number. if a = b = c = d, then ã is called a real number. 1.0 0.8 0.1 0.2 0.3 0.4 w a=(0.1,0.2,0.3,0.4;1.0) b=(0.1,0.2,0.3,0.4;0.8) figure 1: two generalized trapezoidal fuzzy numbers ã and b̃ figure 1 shows two different generalized trapezoidal fuzzy numbers ã = (0.1, 0.2, 0.3, 0.4; 1.0) andb̃ = (0.1, 0.2, 0.3, 0.4; 0.8). compared with normal fuzzy numbers, the generalized fuzzy numbers can deal with uncertain information in a more flexible manner. for example, in decision making situation, the values w1 and w2 represent the degree of confidence of the opinions of the decision-makers’ ã andb̃, respectively, where w1 = 1 and w2 = 0.8. 2.3 a review of the existing similarity measures between fuzzy numbers in this section, we briefly introduce some existing similarity measures between fuzzy numbers from chen [15], lee [16], chen and chen [17], wei & chen [18] and hejazi, et al. [19]. let ã and b̃ be two trapezoidal fuzzy numbers, where ã = (a1, a2, a3, a4) and b̃ = (b1, b2, b3, b4), as shown in figure 2. chen [15] presented a similarity measure between fuzzy numbers ã and b̃ based on the geometric distance, where the degree of similarity s(ã, b̃) between the fuzzy numbers ã and b̃ is calculated as follows: s(ã, b̃) = 1 − ∑4 i=1 |ai − bi| 4 . (6) where s(ã, b̃) ∈ [0, 1]. the larger the value of s(ã, b̃), the more the similarity between the fuzzy numbers ã and b̃. if ã and b̃ are two triangular fuzzy numbers, where ã = (a1, a2, a3) and b̃ = (b1, b2, b3). the degree of similarity s(ã, b̃) between the triangular fuzzy numbers ã and b̃ is calculated as 336 w. jiang, y. yang, y. luo, x.y. qin figure 2: trapezoidal fuzzy numbers ã and b̃ follows [15]: s(ã, b̃) = 1 − ∑3 i=1 |ai − bi| 3 . (7) wheres(ã, b̃) ∈ [0, 1]. the larger the value of s(ã, b̃), the more the similarity between the fuzzy numbers ã and b̃. lee [16] presented a similarity measure between trapezoidal fuzzy numbers, where the degree of similarity s(ã, b̃) between the trapezoidal fuzzy numbers ã and b̃ is calculated as follows: s(ã, b̃) = 1 − ∥∥∥ã − b̃∥∥∥ lp ∥u∥ × 4−1/p. (8) where ∥∥∥ã − b̃∥∥∥ lp = ( 4∑ i=1 (|ai − bi|)p )1/p . (9) and ∥u∥ = max(u) − min(u). (10) in order to optimally aggregate experts’ fuzzy opinions, chen and chen [17] presented a similarity measure between generalized trapezoidal fuzzy numbers. first, they calculate the center-of-gravity (cog) point (x∗ ã , y∗ ã ) and (x∗ b̃ , y∗ b̃ ) of the generalized trapezoidal fuzzy numbers ã and b̃, respectively. the cog point (x∗ ã , y∗ ã ) of the generalized trapezoidal fuzzy numbers ã = (a1, a2, a3, a4; wã) is calculated as follows: y∗ ã =   w ã × ( a3 − a2 a4 − a1 + 2 ) 6 if a1 ̸= a4 and 0 < wã ≤ 1, w ã 2 if a1 = a4 and 0 < wã ≤ 1, (11) x∗ ã = y∗ ã (a3 + a2) + (a4 + a1)(wã − y ∗ ã ) 2w ã . (12) then the degree of similarity s(ã, b̃) between the two generalized trapezoidal fuzzy numbers ã and b̃ can be calculated as follows [17]: s(ã, b̃) = [ 1 − ∑4 i=1 |ai − bi| 4 ] × ( 1 − ∣∣∣x∗ ã − x∗ b̃ ∣∣∣)b(sã,sb̃) × min(y∗ã, y∗b̃) max(y∗ ã , y∗ b̃ ) (13) where b(s ã , s b̃ ) are defined as follows: determining basic probability assignment based on the improved similarity measures of generalized fuzzy numbers 337 b(s ã , s b̃ ) = { 1 s ã + s b̃ > 0 0 s ã + s b̃ = 0 (14) where s ã = a4 − a1 and sb̃ = b4 − b1 are the lengths of the generalized trapezoidal fuzzy numbers ã and b̃. the larger the value of s(ã, b̃), the more the similarity measure between two fuzzy numbers. wei & chen [18] proposed a method for calculating the similarity of two fuzzy numbers ã and b̃, where ã = (a1, a2, a3, a4; wã) and b̃ = (b1, b2, b3, b4; wb̃). if 0 ≤ a1 ≤ a2 ≤ a3 ≤ a4 ≤ 1 and 0 ≤ b1 ≤ b2 ≤ b3 ≤ b4 ≤ 1, then the degree of similarity s(ã, b̃) between the generalized trapezoidal fuzzy numbers ã and b̃ is calculated as follows: s(ã, b̃) = [ 1 − ∑4 i=1 |ai − bi| 4 ] × min(p(ã), p(b̃)) + min(w ã , w b̃ ) max(p(ã), p(b̃)) + max(w ã , w b̃ ) (15) where s(ã, b̃) ∈ [0, 1]; p(ã) and p(b̃) are defined as follows: p(ã) = √ (a1 − a2)2 + w2 ã + √ (a3 − a4)2 + w2 ã + (a3 − a2) + (a4 − a1). (16) p(b̃) = √ (b1 − b2)2 + w2 b̃ + √ (b3 − b4)2 + w2 b̃ + (b3 − b2) + (b4 − b1). (17) p(ã) and p(b̃) are the perimeters of generalized trapezoidal fuzzy numbers ã and b̃, respectively. the larger the value of s(ã, b̃), the more the similarity measure between two fuzzy numbers. hejazi, etc.[19] presented an improved similarity measure between two fuzzy numbers ã and b̃ combining the concept of geometric distance, height, areas and perimeters of generalized fuzzy numbers. the degree of similarity s(ã, b̃) between the generalized trapezoidal fuzzy numbers ã and b̃ is calculated as follows: s(ã, b̃) = [ 1 − ∑4 i=1 |ai − bi| 4 ] × min(p(ã), p(b̃)) max(p(ã), p(b̃)) × min(a(ã), a(b̃)) + min(w ã , w b̃ ) max(a(ã), a(b̃)) + max(w ã , w b̃ ) (18) p(ã) and p(b̃) are the perimeters of two generalized trapezoidal fuzzy numbers which are calculated by eqs.(16),(17).on the other hand they have a(ã) and a(b̃) which are the areas of the two fuzzy numbers and that are calculated as follows: a(ã) = 1 2 w ã (a3 − a2 + a4 − a1). (19) a(b̃) = 1 2 w b̃ (b3 − b2 + b4 − b1). (20) the larger the value of s(ã, b̃), the more the similarity measure between two fuzzy numbers. 3 an improved similarity measure of generalized fuzzy numbers many similarity measures between fuzzy numbers have been proposed [15-19]. however, it has been found that the existing methods cannot correctly calculate the degree of similarity between 338 w. jiang, y. yang, y. luo, x.y. qin two generalized fuzzy numbers in some situations. in this section, we present an improved method to calculate the degree of similarity between generalized fuzzy numbers[20],which gives consideration to the horizontal center of gravity, the perimeter, the height and the area of the two fuzzy numbers. the proposed similarity measure can overcome the drawbacks of the existing methods. assume there are two generalized trapezoidal fuzzy numbers ã and b̃, where ã = (a1, a2, a3, a4; w ã ) and b̃ = (b1, b2, b3, b4; wb̃), 0 ≤ a1 ≤ a2 ≤ a3 ≤ a4 ≤ 1 and 0 ≤ b1 ≤ b2 ≤ b3 ≤ b4 ≤ 1. then the degree of similarity s(ã, b̃) between the generalized trapezoidal fuzzy numbers ã and b̃ is calculated as follows: s(ã, b̃) = [ 1 − ∣∣∣x∗ ã − x∗ b̃ ∣∣∣] × [1 − ∣∣wã − wb̃∣∣] × min(p(ã), p(b̃)) + min(a(ã), a(b̃)) max(p(ã), p(b̃)) + max(a(ã), a(b̃)) (21) where x∗ ã and x∗ b̃ are the horizontal center-of-gravity (cog) of the generalized trapezoidal fuzzy numbers ã and b̃, respectively. the cog point (x∗ ã , y∗ ã ) of the generalized trapezoidal fuzzy numbers ã is calculated as follows: y∗ ã =   w ã × ( a3 − a2 a4 − a1 + 2 ) 6 if a1 ̸= a4 and 0 < wã ≤ 1, w ã 2 if a1 = a4 and 0 < wã ≤ 1, (22) x∗ ã = y∗ ã (a3 + a2) + (a4 + a1)(wã − y ∗ ã ) 2w ã , (23) p(ã) and p(b̃) are the perimeters of two generalized trapezoidal fuzzy numbers which are calculated as follows: p(ã) = √ (a1 − a2)2 + w2 ã + √ (a3 − a4)2 + w2 ã + (a3 − a2) + (a4 − a1) (24) p(b̃) = √ (b1 − b2)2 + w2 b̃ + √ (b3 − b4)2 + w2 b̃ + (b3 − b2) + (b4 − b1) (25) a(ã) and a(b̃) are the areas of two generalized trapezoidal fuzzy numbers which are calculated as follows: a(ã) = 1 2 w ã (a3 − a2 + a4 − a1), (26) a(b̃) = 1 2 w b̃ (b3 − b2 + b4 − b1), (27) the larger the value of s(ã, b̃) is, the more the similarity measure between two generalized trapezoidal fuzzy numbers ã and b̃ will be. in the following sections, we will introduce some of properties that our model has: theorem 3.1. two generalized trapezoidal fuzzy numbers ã and b̃ are identical if and only if s(ã, b̃) = 1. proof: (i) if ã and b̃ are identical, x∗ ã = x∗ b̃ , w ã = w b̃ , min(p(ã), p(b̃)) = max(p(ã), p(b̃)), min(a(ã), a(b̃)) = max(a(ã), a(b̃)). determining basic probability assignment based on the improved similarity measures of generalized fuzzy numbers 339 the degree of similarity between two generalized trapezoidal fuzzy numbers is calculated as follows: s(ã, b̃) = [ 1 − ∣∣∣x∗ ã − x∗ b̃ ∣∣∣] × [1 − ∣∣wã − wb̃∣∣] × min(p(ã), p(b̃)) + min(a(ã), a(b̃)) max(p(ã), p(b̃)) + max(a(ã), a(b̃)) = [1 − 0] × [1 − 0] × 1 = 1 (28) (ii) if s(ã, b̃) = 1, then s(ã, b̃) = [ 1 − ∣∣∣x∗ ã − x∗ b̃ ∣∣∣] × [1 − ∣∣wã − wb̃∣∣] × min(p(ã), p(b̃)) + min(a(ã), a(b̃)) max(p(ã), p(b̃)) + max(a(ã), a(b̃)) = 1 (29) it implies that x∗ ã = x∗ b̃ , w ã = w b̃ , min(p(ã), p(b̃)) = max(p(ã), p(b̃)) and min(a(ã), a(b̃)) = max(a(ã), a(b̃)). therefore, the generalized trapezoidal fuzzy numbers ã and b̃ are identical. 2 theorem 3.2. s(ã, b̃) = s(b̃, ã). proof: because s(ã, b̃) = [ 1 − ∣∣∣x∗ ã − x∗ b̃ ∣∣∣] × [1 − ∣∣wã − wb̃∣∣] × min(p(ã), p(b̃)) + min(a(ã), a(b̃)) max(p(ã), p(b̃)) + max(a(ã), a(b̃)) (30) s(b̃, ã) = [ 1 − ∣∣∣x∗ b̃ − x∗ ã ∣∣∣] × [1 − ∣∣wb̃ − wã∣∣] × min(p(b̃), p(ã)) + min(a(b̃), a(ã)) max(p(b̃), p(ã)) + max(a(b̃), a(ã)) (31) we can see that ∣∣∣x∗ ã − x∗ b̃ ∣∣∣ = ∣∣∣x∗ b̃ − x∗ ã ∣∣∣, ∣∣wã − wb̃∣∣ = ∣∣wb̃ − wã∣∣, min(p(ã), p(b̃)) = min(p(b̃), p(ã)), min(a(ã), a(b̃)) = min(a(b̃), a(ã)), and max(a(ã), a(b̃)) = max(a(b̃), a(ã)). therefore, s(ã, b̃) = s(b̃, ã). 2 theorem 3.3. if ã = (a1, a2, a3, a4; wã) and b̃ = (b1, b2, b3, b4; wb̃) are two generalized trapezoidal fuzzy numbers with the same geometric shape and height, then s(ã, b̃) = 1 − d, where d = ∣∣∣x∗ ã − x∗ b̃ ∣∣∣ is the offset between ã and b̃. proof: because w ã = w b̃ , and based on eq.(24) eq.(27), we can get min(p(ã), p(b̃)) = max(p(ã), p(b̃)) and min(a(ã), a(b̃)) = max(a(ã), a(b̃)); therefore, due to eq.(21), the degree of similarity between ã and b̃ is calculated as follows: s(ã, b̃) = [ 1 − ∣∣∣x∗ ã − x∗ b̃ ∣∣∣] × [1 − ∣∣wã − wb̃∣∣] × min(p(ã), p(b̃)) + min(a(ã), a(b̃)) max(p(ã), p(b̃)) + max(a(ã), a(b̃)) = [ 1 − ∣∣∣x∗ ã − x∗ b̃ ∣∣∣] × [1 − 0] × 1 = 1 − d (32) 2 340 w. jiang, y. yang, y. luo, x.y. qin 4 a comparison of the similarity measures in this section, we extend 15 sets of fuzzy numbers presented by wei & chen [18] into 18 sets of fuzzy numbers, as shown in figure 3, and compare the calculation results of the proposed method with the results of the existing similarity measures, as shown in table 1. from figure 3 and table 1, we can see the drawbacks of the existing similarity measures: (1) from figure 3, we can see that set 3 and set 4 are different sets of fuzzy numbers. however, from table 1, we can see that if we apply chen’s method (chen, 1996) and lee’s method (lee, 2002), set 3 and set 4 get the same degree of similarity, respectively. (2) from set 5 of figure 3, we can see that ã and b̃ are different generalized fuzzy numbers. however, from table 1, we can see that if we apply chen’s method (chen, 1996) and lee’s method (lee, 2002), their result is a degree of similarity equal to 1, respectively, which is an incorrect result. (3) from set 6 of figure 3 and table 1, we can see that if we apply lee’s method (lee, 2002), we cannot calculate the degree of similarity between two identical real values due to the fact that the denominator will become zero, such that s(ã, b̃) = ∞, which is an incorrect result. (4) from set 7 of figure 3 and table 1, we can see that if we apply lee’s method (lee, 2002), we can see that lee’s method cannot correctly calculate the degree of similarity between two identical real values due to the fact that the degree of similarity of the real values become zero, which is an incorrect result. (5) from set 8 and set 9 of figure 3, we can see that they are two different sets of fuzzy numbers. however, from table 1, we can see that if we apply chen’s method (chen, 1996), they get the same degree of similarity, respectively, which does not coincide with the intuition of human being. (6) from set 10, set 11 and set 12 of figure 3, we can see that they are different sets of generalized fuzzy numbers. however, from table 1, we can see that if we apply chen’s method (chen, 1996), they get the same degree of similarity, respectively, which does not coincide with the intuition of human being. (7) from set 7, set 9 and set 13 of figure 3, we can see that ã and b̃ have the same shape and the offset d = 0.1 in the x-axis, respectively. by applying the proposed method, we can see that the proposed method has the good property that the degree of similarity between ã and b̃ is equal to1 − |d| = 1 − 0.1 = 0.9. however, from table 1, we can see that if we apply chen and chen’s method (chen & chen, 2003), the degree of similarity is equal to 0.81, which is an incorrect result. (8) from set 14 of figure 3, using chen’s method (chen, 1996) and lee’s method (lee, 2002), the result is a degree of similarity equal to 1, respectively, which is an incorrect result. (9) from set 14 and set 15 of figure 3, we can see that set 14 is more similar than set 15 by the intuition of human being. however, from table 1, we can see that if we apply chen and chen’s method (chen & chen, 2003), we can see that it gets an incorrect result. (10) from figure 3, we can see that set 10 and set 16 are different sets of generalized fuzzy numbers and set 10 is more similar than set 16 by the intuition of human being. however, from table 1, we can see that if we apply the methods presented by chen (1996), lee (2002) and hejazi et al. (2011), set 10 and set 16 get the same degree of similarity, respectively, and if we apply the method presented by wei & chen (2009), the result shows that set 16 is more similar than set 10. they are not the correct results. (11) from figure 3, we can see that set 11 and set 17 are different sets of generalized fuzzy numbers and set 11 is more similar than set 17 by the intuition of human being. however, from table 1, we can see that if we apply the methods presented by chen (1996), lee (2002) and hejazi et al. (2011), set 11 and set 17 get the same degree of similarity, respectively, and if we determining basic probability assignment based on the improved similarity measures of generalized fuzzy numbers 341 apply the method presented by chen & chen (2003) and wei & chen (2009), the result shows that set 17 is more similar than set 11. they are not the correct results. (12) from figure 3, we can see that set 11 and set 18 are different sets of generalized fuzzy numbers. however, from table 1, we can see that if we apply the methods presented by chen (1996), lee (2002), hejazi et al. (2011) and wei & chen (2009), set 11 and set 18 get the same degree of similarity, respectively. in summary, from figure 3 and table 1, we can see that the proposed method can overcome the drawbacks of the existing similarity measures. table 1: the fuzzy model of attribute sl sets chen’s lee’s chen & chen’s wei & chen’s hejazi et al. proposed method method method method method method (1996)[15] (2002)[16] (2003)[17] (2009)[18] (2011)[19] set 1 0.9750 0.9617 0.8357 0.9500 0.9004 0.9473 set 2 1.0000 1.0000 1.0000 1.0000 1.0000 1.0000 set 3 0.7000 0.5000 0.4200 0.6820 0.6465 0.6631 set 4 0.7000 0.5000 0.4900 0.700 0.7000 0.7000 set 5 1.0000 1.0000 0.8000 0.8248 0.6681 0.6659 set 6 1.0000 * 1.0000 1.0000 1.0000 1.0000 set 7 0.9000 0.0000 0.8100 0.9000 0.9000 0.9000 set 8 0.9000 0.5000 0.5400 0.8411 0.3700 0.3896 set 9 0.9000 0.6667 0.8100 0.9000 0.9000 0.9000 set 10 0.9000 0.8333 0.9000 0.7833 0.6261 0.7731 set 11 0.9000 0.7500 0.7200 0.8003 0.6448 0.7938 set 12 0.9000 0.8000 0.8325 0.8289 0.7361 0.7478 set 13 0.9000 0.7500 0.8100 0.9000 0.9000 0.9000 set 14 1.0000 1.0000 0.7000 0.7209 0.5113 0.5104 set 15 0.9500 0.7500 0.9048 0.6215 0.3830 0.4242 set 16 0.9000 0.8333 0.7425 0.8140 0.6261 0.7321 set 17 0.9000 0.7500 0.8911 0.8380 0.6448 0.7432 set 18 0.9000 0.7500 0.6976 0.8003 0.6448 0.7144 note. "*" means that the similarity measure cannot calculate the degree of similarity between two generalized fuzzy numbers and the results that are not satisfactory are given in bold. 5 a new method to obtain bpa in fact, some samples exist in many systems, which often approximatively submit the triangular distribution. therefore, we use the existing sample data to build a triangular distribution to describe model of attribute categories, and then generate the bpa function based on the similarity between the collected attribute and the model attribute. in order to be understood easily, the following iris data classification problem shows the detailed approach of the proposed method. in the iris data, there are 3 species of iris flower, i.e., setosa, versicolor, and virginica [21]. the iris data contain 150 instances, and each species contains 50 instances. there are four 342 w. jiang, y. yang, y. luo, x.y. qin 1.0 0 0.1 0.2 0.50.40.3 a=(0.1,0.2,0.3,0.4;1.0) b=(0.1,0.25,0.25,0.4;1.0) set 1 w 1.0 0 0.1 0.2 0.50.40.3 a=(0.1,0.2,0.3,0.4;1.0) b=(0.1,0.2,0.3,0.4;1.0) set 2 w 1.0 w 0 0.1 0.2 0.50.40.3 0.7 0.80.6 a=(0.1,0.2,0.3,0.4;1.0) b=(0.4,0.55,0.55,0.7;1.0) set 3 1.0 w 0 0.1 0.2 0.50.40.3 0.7 0.80.6 a=(0.1,0.2,0.3,0.4;1.0) b=(0.4,0.5,0.6,0.7;1.0) set 4 1.0 0 0.1 0.2 0.50.40.3 a=(0.1,0.2,0.3,0.4;1.0) b=(0.1,0.2,0.3,0.4;0.8) set 5 w 0.8 1.0 0 0.1 0.2 0.50.40.3 a=(0.3,0.3,0.3,0.3;1.0) b=(0.3,0.3,0.3,0.3;1.0) set 6 w 1.0 0 0.1 0.2 0.50.40.3 a=(0.2,0.2,0.2,0.2;1.0) b=(0.3,0.3,0.3,0.3;1.0) set 7 w 1.0 0 0.1 0.2 0.50.40.3 a=(0.1,0.2,0.2,0.3;1.0) b=(0.3,0.3,0.3,0.3;1.0) set 8 w 1.0 0 0.1 0.2 0.50.40.3 a=(0.1,0.2,0.2,0.3;1.0) b=(0.2,0.3,0.3,0.4;1.0) set 9 w 1.0 w 0 0.1 0.2 0.50.40.3 0.7 0.80.6 a=(0.1,0.4,0.4,0.7;1.0) b=(0.3,0.4,0.4,0.5;1.0) set 10 1.0 w 0 0.2 0.50.40.3 0.70.6 a=(0.2,0.3,0.5,0.6;1.0) b=(0.3,0.4,0.4,0.5;1.0) set 11 1.0 w 0 0.1 0.2 0.50.40.3 0.7 0.80.6 a=(0.4,0.4,0.4,0.8;1.0) b=(0.3,0.4,0.4,0.5;1.0) set 12 1.0 w 0 0.1 0.2 0.50.40.3 0.7 0.80.6 a=(0.2,0.3,0.4,0.5;1.0) b=(0.3,0.4,0.5,0.6;1.0) set 13 1.0 0 0.1 0.2 0.50.40.3 a=(0.1,0.2,0.2,0.3;1.0) b=(0.1,0.2,0.2,0.3;0.7) set 14 w 0.7 1.0 0 0.1 0.2 0.50.40.3 a=(0.1,0.2,0.2,0.3;1.0) b=(0.2,0.2,0.2,0.2;0.7) set 15 w 0.7 1.0 w 0 0.1 0.2 0.50.40.3 0.7 0.80.6 a=(0.1,0.4,0.4,0.7;0.825) b=(0.3,0.4,0.4,0.5;1.0) set 16 0.825 1.0 w 0 0.2 0.50.40.3 0.70.6 a=(0.2,0.3,0.5,0.6;0.79205) b=(0.3,0.4,0.4,0.5;1.0) set 17 0.79205 1.0 w 0 0.2 0.50.40.3 0.70.6 a=(0.2,0.3,0.5,0.6;1.0) b=(0.2,0.3,0.3,0.4;1.0) set 18 figure 3: eighteen sets of generalized fuzzy number ã and b̃ determining basic probability assignment based on the improved similarity measures of generalized fuzzy numbers 343 attributes in the iris data, i.e., sepal length (sl), sepal width (sw), petal length (pl), and petal width (pw). we randomly chose 40 instances from setosa, the min(sl)=4.30; the average(sl)=5.04; the max(sl)=5.80 can be obtained. hence, we can construct the fuzzy model of sl attribute of setosa in figure 4. in the same way, we can construct fuzzy models of sepal length(sl) of versicolor and virginica, as shown in table 2 and figure 4. as can be seen from figure 4, there are some crossing areas. for example, the crossing area of fuzzy number of setosa and versicolour can be shown in figure 5. all the crossing areas can be modeled as generalized fuzzy numbers shown in table 3. table 2: the fuzzy model of attribute sl species setosa(s) versicolor(c) virginica(v) attribute (sl) (4.30,5.04,5.80;1.0) (5.0,5.90,6.8;1.0) (5.6,6.59,7.90;1.0) setosa versicolor virginica 4.5 6.565.55 7 7.5 84 0 0.2 0.4 0.6 0.8 1.0 m e m b e r fu n c ti o n sl(cm) figure 4: the fuzzy number representation of sl attribute of each species setosa versicolor virginica 4.5 6.565.55 7 7.5 84 0 0.2 0.4 0.6 0.8 1.0 m e m b e r fu n c ti o n sl(cm) figure 5: the generalized fuzzy number model of crossing area of setosa and versicolour in a similar way, the fuzzy models of sepal width (sw) attribute, petal length (pl) attribute, and petal width (pw) attribute of each species can be constructed. 344 w. jiang, y. yang, y. luo, x.y. qin table 3: the generalized fuzzy number model of crossing area of three species iris flowers species s&c s&v v&c s&c&v attribute (sl) (5.00,5.43,5.80;0.48) (5.60,5.71,5.80;0.11) (5.60,6.23,6.80;0.63) (5.60,5.71,5.80;0.11) we randomly chose a datum from iris source; for example, a new instance (ni) of setosa could be shown as (5.1cm, 3.5cm, 1.4cm, 0.2cm). in figure 6, the relation between sl attribute of ni and the fuzzy number representation of sl attribute of each species is distinctly shown. to calculate the similarities between ni and each generalized fuzzy number and normalize the obtained similarities, the bpa can be obtained, as shown in table 4. 4.5 6.565.55 7 7.5 84 0 0.2 0.4 0.6 0.8 1.0 m em b er f u n ct io n setosa versicolor virginica sl(cm) ✕★✣✢ figure 6: the relation between the new instance and the model of sl attribute of each species table 4: the bpa of an instance attribute sl m(s) m(c) m(v) m(s,c) m(v,s) m(c,v) m(s,v,c) sl 0.2955 0.2605 0.2215 0.0842 0.0046 0.1291 0.0046 in the same way, the four attributes and their relative bpas can be obtained, as shown in table 5. with the combination rule in eq.(3) used, the fusion results can be shown as follows: m(s) = 0.5324, m(c) = 0.2607, m(v ) = 0.1879, m(s, c) = 0.0112, m(v, s) = 0.0015, m(c, v ) = 0.0063, m(s, v, c) = 0.0001 hence, the instance can be classed as setosa. the result is consistent with the actual situation. the algorithm of our proposed method can be listed step by step as follows. step 1: use the existing sample data to obtain the min, average and max value to construct the triangular fuzzy models, which describe the model attributes of instances. step 2: calculate the similarity between the collected attribute and the model attribute. step 3: normalize the similarity measure to obtain the bpa function. we randomly selected 120 instances, 40 instances for each of the 3 species, to construct species models. the remaining 30 instances, 10 instances for each of the 3 species, were used as collected instances whose class was unknown. by applying the method proposed in the above section to classify the 30 instances, the correct rate of iris data classification was calculated as 96.67%. determining basic probability assignment based on the improved similarity measures of generalized fuzzy numbers 345 table 5: the bpa of an instance m(s) m(c) m(v) m(s,c) m(v,s) m(c,v) m(s,v,c) sl 0.2955 0.2605 0.2215 0.0842 0.0046 0.1291 0.0046 sw 0.1727 0.1614 0.1687 0.0925 0.1510 0.1611 0.0925 pl 0.2198 0.1193 0.0731 0.1943 0.1943 0.0049 0.1943 pw 0.1921 0.0988 0.0499 0.2143 0.2143 0.0162 0.2143 further, we applied the proposed method 10 times; the average correct rate of iris data classification was up to 95.67%. it can be seen that our proposed method has good results in classification problem. 6 conclusions the estimation of bpa plays a very important role in the application of dempster-shafer theory in complex uncertain problems. the fusion performance depends on the method of bpa construction. to solve this problem, firstly, the paper presents an improved method to calculate the degree of similarity between generalized fuzzy numbers, which gives consideration to the horizontal center of gravity, the perimeter, the height and the area of the two fuzzy numbers. the method can overcome the drawbacks of the existing similarity measures. then, a new method to obtain bpa is proposed based on the improved similarity measure between generalized fuzzy numbers. the proposed method to obtain bpa can effectively overcome the problem of subjectivity, which also has strong generality. the classification of iris data is used to illustrate the efficiency and the low computational complexity of the proposed method. the proposed method provides a simple technique that will help to use the classical dempster combination rule effectively. acknowledgment this work was supported in part by a grant from national natural science foundation of china (no. 61104214) and foundation for fundament research of northwestern polytechnical university, grant no.jc20120235. bibliography [1] a. dempster(1967), upper and lower probabilities induced by multivalued mapping, annals of mathematical statistics, issn 0003-4851, 38(2): 325-339. [2] g. shafer(1976), a mathematical theory of evidence, princeton university press, isbn 978069-11-0042-5. [3] y.m. zhu, l. bentabet, m. rombaut, o. dupuis, v. kaftandjian, d. babot(2002), automatic determination of mass functions in ds theory using fcm and spatial neighbourhood information for image segmentation, optical engineering, issn 0091-3286, 41(4): 760-770. [4] a. bendjebbour, y. delignon, l. fouque, v. samson, w. pieczynski(2001), multisensor image segmentation using ds fusion in markov fields context, ieee trans. geosci. remote sensing, issn 0196-2892, 39(8): 1789-1798. 346 w. jiang, y. yang, y. luo, x.y. qin [5] x. guan, x. yi, y. he(2008), study on algorithms of determining basic probability assignment function in dempster-shafer evidence theory, proc. of the 7th int. conf. on machine learning and cybernetics, 121-126. [6] b. chen, j.f. wang, s.b. chen(2010), prediction of pulsed gtaw penetration status based on bp neural network and d-s evidence theory information fusion, international journal of advanced manufacturing technology, issn 0268-3768, 48(1-4): 83-94. [7] x.m. li, l.x. ding, y. li, g. xu, j.b. li(2009), hvac fan mechinery fault diagnosis based on ann and d-s evidence theory, iita int. conf. on control, automation and systems engineering, zhangjiajie, china, 603-606. [8] z. xu, m. liu, g. yang, n. li(2009), application of interval analysis and evidence theory to fault location, iet electric power application, issn 1751-8660, 3(1): 77-84. [9] z.y. zuo, y.f. xu, g.c. chen(2009), a new method of obtaining bpa and application to the bearing fault diagnosis of wind turbine, proc. of the 2009 int. symposium on information processing, huangshan, china, 368-371. [10] w. jiang, j.y. peng, y. deng(2011), a new method to determine bpa in evidence theory, journal of computers, issn 1796-203x, 6(6): 1162-1167. [11] y. deng, w. jiang, r. sadiq(2011), modeling contaminant intrusion in water distribution networks: a new similarity-based dst method, expert systems with applications, issn 0957-4174, 38(1): 571-578. [12] y. deng, r. sadiq, w. jiang, s. tesfamariam(2011), risk analysis in a linguistic environment: a fuzzy evidential reasoning-based approach, expert systems with applications, issn 0957-4174, 38(12): 15438-15446. [13] y. deng, w. jiang, x.b. xu(2009), determinging bpa under uncertainty environments and its application in data fusion, journal of electronics (china), issn 0217-9822, 26(1): 13-17. [14] w. jiang, y. luo, x.y. qin, j. zhan(2015), an improved method to rank generalized fuzzy numbers with different left heights and right heights, journal of intelligent and fuzzy systems, accepted. [15] s. m. chen(1996), foreword, new methods for subjective mental workload assessment and fuzzy risk analysis, cybernetics and systems: an international journal, issn 0196-9722, 27(5): 449-472. [16] h.s. lee(2002), optimal consensus of fuzzy opinions under group decision making environment, fuzzy sets and systems, issn 0196-9722, 132(3): 303-315. [17] s.j. chen, s.m. chen(2003), fuzzy risk analysis based on similarity measures of generalized fuzzy numbers, ieee transaction on fuzzy systems, issn 1063-6706, 11(1): 45-56. [18] s.h. wei, s.m. chen(2009), a new approach for fuzzy risk analysis based on similarity measures of generalized fuzzy numbers, expert systems with applications, issn 0957-4174, 36(1): 589-598. [19] s.r. hejazi, a. doostparast, s.m. hosseini(2011), an improved fuzzy risk analysis based on a new similarity measures of generalized fuzzy numbers, expert systems with applications, issn 0957-4174, 38(8): 9179-9185. determining basic probability assignment based on the improved similarity measures of generalized fuzzy numbers 347 [20] w. jiang, x. fan, d.j. duanmu, y.deng(2011), a modified similarity measure of generalized fuzzy numbers, 2011 int. conf. on advanced in control engineering and information science, 2773-2777. [21] r.a. fisher(1936), the use of multiple measurements in taxonomic problems, annals of eugenics, issn 1469-1809, 7(2): 179-188. international journal of computers communications & control issn 1841-9836, 13(1), 24-38, february 2018. selective feature generation method for classification of low-dimensional data s.-i. choi, s.t. choi, h. yoo sang-il choi, haanju yoo department of computer science and engineering dankook university 152, jukjeon-ro, suji-gu, yongin-si gyeonggi-do, 16890, korea choisi@dankook.ac.kr, haanju.yoo@gmail.com sang tae choi* department of internal medicine chung-ang university college of medicine 102 heukseok-ro, dongjak-gu seoul, 06974, korea. *corresponding author: beconst@cau.ac.kr abstract: we propose a method that generates input features to effectively classify low-dimensional data. to do this, we first generate high-order terms for the input features of the original low-dimensional data to form a candidate set of new input features. then, the discrimination power of the candidate input features is quantitatively evaluated by calculating the ‘discrimination distance’ for each candidate feature. as a result, only candidates with a large amount of discriminative information are selected to create a new input feature vector, and the discriminant features that are to be used as input to the classifier are extracted from the new input feature vectors by using a subspace discriminant analysis. experiments on low-dimensional data sets in the uci machine learning repository and several kinds of low-resolution facial image data show that the proposed method improves the classification performance of low-dimensional data by generating features. keywords: feature generation, input feature selection, feature extraction, discriminant distance, low-dimensional data, data classification. 1 introduction advances in information technology have resulted in a rapid increase in the amount of digital data that is available, and a significant amount of research has been carried out to develop tools to extract useful and necessary information from vast amounts of data. such tools are currently being applied in various fields, including biometrics (e.g., iris, fingerprint and face recognition), data mining, diagnosis systems and pattern classification [22,26]. when working with data samples, which are represented as ‘input features’, feature extraction methods can effectively improve classification performance by extracting useful information. when there are input features in a data sample, feature extraction methods find projection vectors to get new features containing the maximal information for problem solving [4,14,15,27,31]. then, an input data sample is represented by a set of new features (feature vector), each of which is a linear combination of the input features. the different feature extraction methods have different properties, and the appropriate method must be used corresponding to the characteristics of the data and the problem that is to be solved, e.g., data representation, classification, restoration, etc. common feature extraction methods such as principal component analysis (pca) [27] and linear discriminant analysis copyright ©2018 cc by-nc selective feature generation method for classification of low-dimensional data 25 (lda) [15] have been the basis to develop other methods, including null space lda (nlda) [4], biased discriminant analysis (bda) [31], etc. in these methods, data is stored in vector form, and the appropriate features are extracted using a covariance matrix which is appropriately defined depending on the problem to be solved. methods such as matflda [5], two-dimensional lda (2dfld) [29], composite lda (c-lda) [18] and composite bda (c-bda) [17] use an image covariance matrix instead of the covariance matrix. these image covariance-based methods can be used effectively for data in which input features are strongly correlated [17]. c-lda can be viewed as a generalized image covariance-based method because c-lda becomes identical to the 2dlda or matflda form when the composite vector is defined as a row or column vector. in classification problems, an object is described as an array of attributes to search for the underlying patterns in the object. these attributes are represented as numerical values, which are stored in a vector form (input feature vector) [8]. for example, for blood test data for a person in a hospital, the dimension of the data is the number of test items. even when using the same object, the attributes can be defined in different ways depending on the problem that is to be classified. for example, when classifying a dog, attributes such as food or skeletal structure can be used to classify species of mammals, amphibians, and the like, and when distinguishing individual objects belonging to the same group of animals, attributes such as hair color, size, age, etc. can be used. however, when expressing an object with attributes in this manner, the number of attributes is limited, and it is usually represented using low-dimensional data. on the other hand, temporal sensing data such as speech, or spatial data such as images is usually stored as high-dimensional data. even such data is often reduced and stored as low-dimensional data such as a thumbnail image in order to effectively use the data in a small device, which has a relatively small computing power. most feature extraction methods mentioned above use a statistical correlation of input features and extract features from the shape information of the pixels constituting the image, so their classification performance is limited when the number of input features is too small and is affected by the resolution of the image. in the case of the dcv method, which offers a high performance for generic high-dimensional data, the dimension of the null space may decrease or disappear when the dimension of the data decreases. therefore, it is necessary to generate meaningful features from the input features to effectively utilize the existing data classification techniques with low-dimensional data. in this paper, we propose an input feature generation method for classification of lowdimensional data. according to the theorem of cover [10], if data samples are not distributed linearly and separably, they can be made into a linearly separated distribution through conversion into higher dimensions. many methods use kernel functions to convert low-dimensional data into higher dimensions [9,24,28,30]. these methods use a kernel matrix instead of directly computing kernel functions because doing so would require extensive computation. however, in this case, since the value of the high-dimensional data that is created can not be confirmed, even if the feature corresponding to the individual dimension of the high-dimensional data includes unnecessary information that do not help in classification, they can not be removed or separately used. in the proposed method, new input features are generated by adding a higher order term of individual input features, and the separability power for the original input features and the generated input features is measured using the discriminant distance scale [21]. then, only features with high discrimination information are selectively used during data classification. the new input features improves the performance of existing discriminant feature extraction methods especially when classifying low-dimensional data. we recently investigated the feature generation method for face recognition and presented preliminary results in [6]. in this paper, we provide a more detailed analysis of the method, as well as an extensive discussion, and we apply the method to other classification problems other than face recognition. through experiments on 26 s.-i. choi, s.t. choi, h. yoo various low-dimensional data sets, we confirmed that the classification performance is improved when using the proposed input feature generation method. the results of the experiment for low-resolution facial images show that the proposed method offers a higher recognition rate than when the resolution of such images is increased via interpolation. this paper is organized as follows. in the next section, we examine the effect of the data dimension on the classification performance. then, we describe the feature generation method and the optimal input feature selection method. finally, the experimental results are described and the conclusion follows. 2 effect of data dimensionality on classification performance 2.1 subspace discriminant analysis subspace discriminant analysis methods represent a data sample as an n-dimensional vector x. lda, nlda and bda are representative methods of these subspace discriminant analysis methods. when there are n data samples with c classes and ni samples for each class ci(i = 1, ..., c ), the within class scatter matrix sw and the between class scatter matrix sb can be defined as follows: sw = c∑ i=1 ∑ xk∈ci (xk −µi)(xk −µi)t , sb = c∑ i=1 ni(µi −µ)(µi −µ)t (1) where µi is the average of the samples in class ci and µ is the average of all n samples. lda finds a projection matrix wfisher = [µ1, · · · ,wc−1] consisting of projection vectors wl(l = 1, · · · ,c − 1) that satisfies the following objective function. this means that the lda constructs a feature space that maximizes the covariance between the other classes while minimizing the covariance between the same classes in the range space of sw [15]. wlda can be obtained by calculating the eigenvectors of s−1w sb. wfisher = argmaxw ∣∣wtsbw∣∣ |wtsww | (2) unlike the lda, which uses the range space of sw , the nlda uses the null space of sw containing more discriminating information [4]. that is, a projection matrix wdcv satisfying the following objective function is obtained in a space of ∣∣wtsww∣∣ = 0 and ∣∣wtsbw∣∣ 6= 0. wdcv = argmax t |wt sw w |=0 ∣∣wtsbw∣∣ (3) nlda shows good performance especially when the number of input features of the data samples and the null space of sw are large. bda is a modified form of lda. unlike lda, which maximizes the distance of the mean values of classes in a multi-class classification problem, the bda aims to classify one class of interest and the rest [31]. the bda constructs a positive sample in the form of a normal distribution, and negative samples constitute a feature space that is distributed away from the mean of positive samples, and has the following objective function. assuming that (i) the data samples xp and xn are positive and negative samples, respectively, (ii) their numbers are np selective feature generation method for classification of low-dimensional data 27 and nn, respectively, and (iii) the average of the positive samples is µp , the scatter matrix of the positive samples sp and the scattering matrix for the negative samples are defined as shown in eq. (4). the objective function of bda is defined as shown in eq. (5). sp = np∑ k=1 (xpk −µ p )(xpk −µ p )t sn = nn∑ k=1 (xnk −µ p )(xnk −µ p )t (4) wbda = argmaxw ∣∣wtsnw∣∣ |wtspw | (5) to avoid the small sample size problem [11], we use ν and γ instead of srn = (1−ν)sn + ν n tr[sni] and srp = (1 −γ)sp + γ n tr[spi] by using a regularization factor sn and sp for each scattering matrix [31]. after investigating classification rates for various values of ν and γ, we set ν and γ to 0 and 0.1, respectively. in the subspace-based analyses, after finding w in the training phase, the feature vector (y ∈ rm×1, m < n) for a given sample x can be obtained through a linear transformation as y = wt x. also, the problem is effectively solved by defining the covariance matrices and objective function according to the particular type of problem. however, the number of input features should be secured for the covariance analysis of the input features to be successful. besides, some methods, such as nlda, may not be able to conduct an analysis if the number of input features is less than the number of samples. therefore, to more efficiently use subspace discriminant analysis, it is necessary to ensure a certain number of input features. 2.2 classification performance over data dimensionality to confirm the effect of the dimension of the data sample on the classification performance in the subspace discriminant analysis, it is necessary to examine how the classification rate changes with respect to the data representing the same object with vectors of different dimensions. as an example, we performed recognition experiments on facial images with various resolutions [6]. we have experimented on images with 120×100, 60×50, 30×25, 24×20 and 15×12 resolution for the feret database [25], cmu-pie database [1], yale b [12] and yale database [32] database (fig. 1). the nlda method was used for 120×100, 60×50, 30×25, and 24×20 images, and the lda method [2] was used for 15×12 images because there is no null space of sw . as can be seen in fig. 2, the recognition rate decreases as the resolution decreases in all databases. the recognition rate of the 15×12 images, which can not use the nlda method, is significantly lower than that for the 120×100 to 24×20 images because the applicable classification methods are limited when the dimension of the data is low. as a result, when data is a dimension higher than a certain level, it is possible to attempt effective classification using various methods. on the other hand, the variations in illumination and facial expression in facial images can be regarded as a kind of noise. in this sense, the feret database, which has less variation in images than the cmu-pie, yale b and yale databases, can be regarded as relatively noiseless. the results of the experiment for the feret database show that the recognition rates for 120×100, 60×50 and 24×20 images are almost the same. this indicates that, when the influence of the noise is not large, if the dimension of the data becomes larger than a certain level, there is no further advantage in classification accuracy, and the amount of unnecessary calculation 28 s.-i. choi, s.t. choi, h. yoo (a) (b) (c) (d) figure 1: examples from (a) cmu-pie database. (b) yale b database. (c) feret database. (d) yale database. 120x100 60x50 30x25 24x20 15x12 40 50 60 70 80 90 95 image size r ec o gn it io n r at e (% ) cmu-pie yale b feret yale figure 2: face recognition rates for various face image resolutions. increases due to data redundancy. therefore, to efficiently classify the data, it is necessary to construct appropriately sized data. 3 feature generation and construction of optimal features as noted above, low-dimensional data samples may have limitations when classified only with the original input features. therefore, to improve the performance of the data classification, it is desirable to increase the separability of the samples by converting the dataset with the samples into a high-dimensional space through a non-linear transformation ϕ(·) (cover’ theorem [10], fig. 3). one simple way to increase the dimension of the input feature space is to create and add a higher order term from the input features of the data sample. in this paper, we use the correlation between the input features as a new feature by adding the quadratic term (xixj, (i,j = 1, ...,n)) of the input features (pixels) of the data sample (x = [x1, ...,xn] t ). the dimension of the data increases through the addition of a higher order nnew as follows. selective feature generation method for classification of low-dimensional data 29 (low-dimensional) original space (high-dimensional) new space non-linear transformation figure 3: cover’s theorem. nnew = ( n + 2 − 1 2 ) + n = (n + 2 − 1) ! 2 !(n− 1) ! + n = (n2 + 3n) 2 (6) since the dimension of the input feature space increases through feature generation, the accuracy of the whole classification can be improved. however, at the same time, the amount of computation needed in the classifier increases due to the increase in dimensionality. furthermore, if the dimension of the feature space increases beyond a certain level, the classification accuracy would be rather reduced due to overfitting or the like. this phenomenon is called the "curse of dimensionality" [23]. this happens because as the dimension of the feature vector increases, the volume of the feature space increases exponentially, so the number of data samples required to effectively utilize the huge feature space also increases. however, there is a limit to collecting the necessary data samples in reality. since all generated input features do not have a positive effect on the classification performance, creating a feature is not itself a solution to the problem. for example, for an image with a size of 100×120, according to eq. (6), 16290 input features can be created by adding a quadratic term, and some of these features are useful for classification, while others have little effect in solving the classification problems. therefore, to obtain the optimal classification performance, it is necessary to generate only useful input features to construct a new input feature space of the appropriate dimensions. using the proposed method, the amount of discriminative information of individual features is quantitatively measured before using the original input features and the generated input features in the classification process. then, based on the results of the measurement, only features with a large amount of discriminative information are selected to construct a new input feature vector, and the discriminant features that are to be used for classification are extracted using subspace discriminant analysis on the new input feature vectors xsfg. the separability of the individual features is measured using the discriminant distance scale [21]. the distance between the different classes and the class can be defined as follows for a j-th component (feature) of xfg, where xfg ∈ rnnew×1 is a data sample including newly generated input features. 30 s.-i. choi, s.t. choi, h. yoo evaluate individual features discriminant distance optimal feature set (new input features) discriminant features (low-dimensional) original input features generated input features subspace discriminant analysis (e.g. lda or bda) feature generation feature pool classifier (classification) figure 4: overall procedure of the proposed method. d j w = c∑ i=1 ni n(ni − 1) ∑ xf g kj ∈ci (xfgkj −µ j i ) 2 d j b = c∑ i=1 ni n (µ j i −µ j)2 (7) , where µji and µ j are the j-th component of the mean of class ci and all training data samples, respectively. the discriminant distance of the j-th feature from eq. (7) can be defined as (1/σj)[d j b −βd j w ], σ j = (1/(n − 1)) ∑n k=1(x fg kj −m j)2 [21], which can be used as a measure of the amount of discriminative information possessed by the j-th feature. β can be determined according to the distribution of the data samples as a user coefficient. in the case where the distribution within a class is large but the class separability is relatively good, it is preferable to reduce the value of β, which means a penalty of djw . we set the value of β to 2 in this paper. then, a measurement vector s = [s1,s1, ..,snnew ] t , sj = (1/σj)[d j w −βd j b] of the same size as nnew is defined and the new input feature vector xsfg is constructed with the features corresponding to the large sj. the entire process of the proposed method is shown in fig. 4. 4 experimental results and discussion to show the effectiveness of the proposed method, we applied the proposed method to various real world problems. through experiments on face databases and the uci machine learning repository [3], we show that the proposed method works effectively for various kinds of lowdimensional data sets and for low-resolution images. selective feature generation method for classification of low-dimensional data 31 table 1: datasets from uci machine learning repository used in the experiments dataset no. of classes no. of instances no. of original input f. no. of new input f. (for lda/bda) breast cancer 2 683 9 40/6 pima 2 768 8 27/9 bupa 2 345 6 20/15 monk3 2 432 6 4/4 balance 3 625 4 6/2 wine 3 178 13 28/61 glass 6 214 9 6/21 car 4 1728 6 13/20 table 2: classification rates for uci data sets feature extraction lda bda datasetinput features xori xiv s xf g xcom xsf g xori xiv s xf g xcom xsf g breast. 95.9 96.0 95.7 96.5 96.0 95.1 95.8 95.3 96.8 95.8 pima 68.9 69.1 69.8 68.6 70.7 69.3 70.0 69.5 68.7 70.4 bupa 59.8 59.8 63.7 57.7 64.1 64.1 65.5 62.7 63.7 64.8 monk3 87.4 100.0 91.2 99.6 99.9 68.6 100 68.4 99.4 99.8 balance 87.7 87.7 94.1 88.9 99.2 84.3 84.3 85.3 96.3 99.8 wine 98.7 98.7 96.4 98.7 98.6 98.0 98.8 99.3 98.6 99.7 glass 61.8 71.2 64.5 71.7 71.5 71.2 77.6 70.0 72.3 70.6 car 83.5 90.7 91.9 87.3 94.9 95.3 95.3 95.3 87.0 95.5 aver. 80.4 84.1 83.4 83.6 86.8 80.7 85.9 80.7 85.3 87.0 4.1 uci machine learning repository we applied the propose method to several data sets in uci machine learning repositories. brief summaries of eight data sets that have been used in many other studies are given in table 1. for each data set, we performed 10-fold cross validation 10 times and computed the average classification rate. each input feature in the training set was normalized to have zero mean and unit variance, and the input features in the test set were also normalized using the means and variances of the training set. the one nearest neighbor rule was used as a classifier and the l2 norm was used to measure the distance between two samples. lda and bda were used to extract the discriminant features from the input feature vectors. lda is a supervised learning method that is extensively used in data classification. in addition, as shown in table 1, most of the data sets used in the experiments have binary classes, so we evaluated the classification performance using the bda developed for one-class problems as well. we should find ways to extend bda to multi-class problems in order to apply it to a few data sets having more than two classes, such as an iris data set, balance data set, glass data set and car data set. one of the simplest ways [20] to extend the bda to d-class classification problems is to construct d data sets with only two classes (positive and negative). in constructing the i-th data set, the samples from the i-th class are regarded as positive samples, and the rest are regarded as negative samples. then, we obtain d feature spaces by applying bda to each of these data sets. during the test of a sample, a combined feature vector, which is concatenated with d resulting feature vectors from d feature spaces as in [19] is used with the classifier. the necessary parameters for clda and cbda, i.e., the length of a composite vector and the number of composite features, were set to the values with which each classification method exhibited the best performance, as in [17]. 32 s.-i. choi, s.t. choi, h. yoo table 2 shows the classification performance using lda and bda for new input feature vectors obtained by applying various methods to input features. the values in the column corresponding to xori are the classification rates obtained by applying lda or bda to the original data. xiv s are the data samples containing only some input features selected by the ivs method [8] among the original input features, and xfg are data samples with quadratic terms added to original input features using eq. (6). columns corresponding to xcom are the results of clda and cbda using a composite vector, which is a subset of input features. for the last row, the average classification rate of nine data sets was reported for each method. from the results in the table, the proposed method that selectively generated new input features (xsfg) provided the best classification performance in most data sets, showing that the average classification rates were 6.% and 6.3% higher in the lda and bda, respectively, than when using the original input features. the effects of the proposed method are prominent in the monk3 and balance data sets. in particular, for the balance data set, both the lda and bda classification results showed that when new features were selectively generated using the proposed method, the classification rate increased by more than 10% when using the original input features intact. the common characteristic of these two data sets is that the input features have fewer types of values. in both the monk3 and balance data sets, input feature values can only be four and five kinds of integers, respectively. in this case, when new features are generated using the proposed method, not only the dimension of the data but also the kinds of values that the input feature can have increases, so the data samples can be distributed more effectively in the feature space. on the other hand, in the case of the monk3 data set, lda and bda showed 87.4% and 68.6% of the original input features, respectively. however, when some input features were removed using the ivs method, both lda and bda showed 100%, respectively. this means that among the original input features, unnecessary features were included that would disturb the classification. as a result, the performance of xfg, including all quadratic terms generated by these unnecessary input features, increased slightly (in the case of lda) or was even lower than the for xori (in the case of bda). however, in the case of the selective feature generation using the proposed method (xsfg), the classification rate can be seen to have increased to nearly 100% because the unnecessary input features were effectively filtered. 4.2 face database and preprocessing we also applied the proposed method to a face recognition problem. the feret, cmu-pie database, yale b, and yale databases, which are used in the experiments, are widely used in face recognition research (table 3, fig. 2). in order to represent each database’s degree of variation, we selected an image taken under normal conditions (no illumination and expression variations) for each subject as a reference image and computed the psnr of the subject’s other images. as shown in table 3, the psnr of the feret database is higher compared to the other databases; thus, the images in the feret database exhibited a relatively small variation. for the feret database, images for 792 subjects were used, and two images (‘fa’, ‘fb’) taken from the front of each person were used, that is, a total of 1584 images [7]. of 792 participants, 200 images for 100 subjects were used as training images to evaluate the recognition performance, and the remaining images for 692 subjects were used as test images. for the test, the ‘fa’ image was used as a gallery image and the ‘fb’ image was used as the probe image. among the frontal pose images of the cmu-pie database, the ‘illum’ category includes 21 images with different lighting conditions for a total of 68 subjects. in this experiment, we used 21 images for 65 subjects, that is, 1365 images in total, except for images of people who have some shooting defects or do not include all 21 kinds of illumination variations. we used three images (‘27_06’, ‘27_07’, ‘27_08’) for each subject, i.e., 195 total images that have a relatively selective feature generation method for classification of low-dimensional data 33 table 3: characteristics of each face database used for the experiments database feret cmu-pie yale b yale no. of subjects 992 65 10 15 no. of images per subject 2 21 45 11 illumination variation none large large small expression variation small none none large occlusion none none none glasses no. training / test 200 / 1384 195 / 1170 70 / 380 10-fold cv degree of variations (avr.psnr) 16.9 12.6 12.4 14.1 4790 7290 9790 12290 14790 70 75 80 85 90 95 100 nf r ec o gn it io n r at e (% ) cmu-pie yale b feret yale figure 5: recognition performance for various nf. small variation in illumination as training images, and the ‘27_20’ image from the front lighting was used as a gallery image. the remaining images for each subject (total 65 pieces x17 = 1105 pieces) were used as proof images. the yale b database contains images for 10 subjects, and each subject’s image consists of 45 kinds of images with illumination variations. the images are divided into subsets 1, 2, 3, and 4 according to the degree of variation in the illumination. in this experiment, the images for the subset 1 with less variation in illumination were used as training images and gallery images, and the images for remaining subset 2, 3 and 4 were used as probe images. the yale database contains 165 gray images of 15 subjects, with different facial expressions, with or without glasses, and under different illumination variations. in order to evaluate the recognition rates, we performed 10-fold cross validation 10 times and computed the average classification rate. for face recognition experiments, facial images should be aligned to have the same size. for this, the whole face image is cropped based on the distance between the two eyes using manually detected eye coordinates and is then down scaled to a size of 120×100 [8], and the 60×50, 30×25, 24×20, and 15×12 images are downscaled versions of the 120×100 image again. all images were pre-processed for histogram equalization [13] and all pixels were normalized to have zero mean and unit standard deviation [7, 8]. the face recognition rates were evaluated from the 15×12 image (i180), for which the recognition rate decreased sharply in fig. 2, to iip12000 which is resized from the i180 to the 120x100 size via the bicubic interpolation [16], ifg, to which the features 34 s.-i. choi, s.t. choi, h. yoo cmu-pie yale b feret yale 41 65 70 75 80 85 90 95 i1 8 0 i 1 2 0 0 0 i p i f g i n f sfg database r ec og n it io n r at e (% ) ~~ figure 6: comparison of recognition rates for i180, iip12000, i fg and isfgnf (proposed method). generated using eq. (6) from i180 are added. isfgnf (proposed) consists of the optimal features selected by the discriminant distance scale from 16,290 features of ifg. the optimal number of features (nf) is experimentally determined because it depends on the nature of the database [8]. as in fig. 5, after we investigated the recognition rates by changing nf, we set nf to 12290, 10290, 16290, and 14290 for the feret, cmu-pie, yale b, and yale databases, respectively. among the appearance-based face recognition methods, the dcv method was used for feature extraction and the fisherface method was used only for i180 where the sss problem occurred. the nn (nearest neighborhood) method was used as a classifier, and the euclidean distance was used as the distance measurement. fig. 6 shows the recognition rates for i180, iip12000, i fg, isfgnf for various databases. fig. 6 shows that the recognition rate for the cmu-pie database and the feret database was improved when the image size (i.e., the number of pixels) was increased using the interpolation method (iip12000), but in the case of the yale database, the recognition rate for i ip 12000 is less than i180 because the pixels (input features) generated via the interpolation method have brightness values estimated from the spatial relationship of adjacent pixels in the existing image, and thus the generated pixels do not help extract features using linear discriminant analysis. on the other hand, isfgnf , which is composed of the selected features by the discriminant distance scale among features generated in a non-linear way, showed a higher recognition rate than i180 for all databases. compared to iip12000, the recognition rates of i sfg nf were significantly improved in the cmupie, yale b, and yale databases than in the feret database. the images of the feret database, which have a relatively small variation compared to the cmu-pie, yale b and yale databases, are less likely to suffer a loss of identity information due to image reduction. since the images of the cmu-pie, yale b and yale databases have already lost much of the identity information in the original image due to the variations such as in illumination and facial expressions, the reduced image (i180) includes many pieces of face identification information as well as distortion information. bicubic interpolation uses 16 adjacent pixels in i180 to determine the brightness value of a new pixel when expanded from i180 to iip12000, so if any one of the 16 pixels contains distorted information (variation), the distortion is also reflected in the generated pixels. consequently, in the case of the cmu-pie, yale b, and yale databases, the improvement in the recognition rate through the use of iip12000 is not large or is rather worse than using i180. in selective feature generation method for classification of low-dimensional data 35 contrast, the features generated by using the high order terms of the input features are relatively low in the distortion ratio of the identity information, and as a result, the recognition rate of ifg is higher than i180 in all databases. in addition, even if distorted information is included in the generated features, all features are evaluated using the discriminant distance scale. using only features with a high separability based on this (isfgnf ), an additional improvement in the recognition rate can be obtained. 5 conclusions in pattern recognition problems, data for an object is represented vector composed of input features. the dimensions of data sample are determined by the attributes of the object samples are often stored as low-dimensional vectors according to the nature of the problem. several discriminant feature extraction methods developed for data classification use statistical correlation of input features, but their performance is limited when the dimension of data is small or the range of values input features is small. also, in the case of high-dimensional data such as image data, the image taken from a high-resolution camera converted into low-resolution image to reduce the calculation for data processing and effectively use the storage space. however, the performance may when a low-resolution image is used for recognition due to loss of information occur reducing the dimension of data. in this paper, we propose an input feature generation method effectively low-dimensional data to solve these problems. first, by generating high-order terms of the input features of the low-dimensional data samples, information on the correlation between the input features used as a new feature candidate group. then, using the discriminant distance scale, new data samples were constructed with only input features with high separability by removing the features that are not helpful or obstructive to classification among the original input features and newly generated features. the experimental results on various low-dimensional data sets of uci machine learning repository and several kinds of low-resolution facial images showed that the classification performance improved by selectively generating input features using the proposed method. acknowledgments this work was supported by the human resources program in energy technology of the korea institute of energy technology evaluation and planning (ketep) granted financial resource from the ministry of trade, industry and energy, republic of korea (no. 20174030201740), and also supported by the msit (ministry of science and ict), korea, under the itrc (information technology research center) support program (iitp-2017-2015-0-00363) supervised by the iitp (institute for information and communications technology promotion). author’s contributions conceived and designed the experiments: s.-i. choi (sic), h. yoo (hy), s.t. choi (stc). performed the experiments: sic, stc. analyzed the data: sic, hy, stc. contributed reagents/materials/analysis tools: sic, hy. wrote the paper: sic, hy. revised the manuscript critically for important intellectual content: sic, hy, stc. 36 s.-i. choi, s.t. choi, h. yoo bibliography [1] baker, s.; sim, t.; bsat, m. (2003); the cmu pose, illumination, and expression database, ieee transaction on pattern analysis and machine intelligence, doi: 10.1109/tpami.2003.1251154, 25(12), 1615-1618, 2003. [2] belhumeur, p. n.; hespanha, j. p.; kriegman, d. j. (1997); eigenfaces vs. fisherfaces: recognition using class specific linear projection, ieee transactions on pattern analysis and machine intelligence, doi: 10.1109/34.598228, 19(7), 711-720, 1997. [3] blake, c.; merz, c. j. (1998); uci repository of machine learning databases, https://www.nist.gov/, 1998. [4] cevikalp, h.; neamtu, m.; wilkes, m.; barkana, a. (2005); discriminative common vectors for face recognition, ieee transactions on pattern analysis and machine intelligence, doi: 10.1109/tpami.2005.9, 27(1), 4-13, 2005. [5] chen, s.; zhu, y.; zhang, d.; yang, j.-y. (2005); feature extraction approaches based on matrix pattern: matpca and matflda, pattern recognition letters, doi: 10.1016/j.patrec.2004.10.009, 26(8), 1157-1167, 2005. [6] choi, s.-i. (2015); feature generation method for low-resolution face recognition, journal of korea multimedia society, 18(9):1039-1046, 2015. [7] choi, s.-i.; choi, c.-h.; jeong, g.-m.; kwak, n. (2012); pixel selection based on discriminant features with application to face recognition, pattern recognition letters, doi: 10.1016/j.patrec.2012.01.005, 33(9), 1083-1092, 2012. [8] choi, s.-i.; oh, j.; choi, c.-h.; kim, c. (2012); input variable selection for feature extraction in classification problems, signal processing, issn: 01651684, doi: 10.1016/j.sigpro.2011.08.023, 92(3), 636-648, 2012. [9] cortes, c.; vapnik, v. (1995); support-vector networks, machine learning, doi: 10.1023/a:1022627411411, 20(3), 273-297, 1995. [10] cover, t. m. (1965); geometrical and statistical properties of systems of linear inequalities with applications in pattern recognition, ieee transactions on electronic computers, issn: 03677508, doi: 10.1109/pgec.1965.264137, (3):326-334, 1965. [11] duda, r. o.; hart, p. e.; stork, d. g. (2001); pattern classification. 2nd, new york, 55, 2001. [12] georghiades, a. s.; belhumeur, p. n.; kriegman, d. j. (2001); from few to many: illumination cone models for face recognition under variable lighting and pose, ieee transactions on pattern analysis and machine intelligence, doi: 10.1109/34.927464, 23(6), 643-660, 2001. [13] gonzalez, r.; woods, r. (2002); digital image processing, a. dowrkin, ed. upper saddle river, new jersey 07458, prentice hall, 2002. [14] jain, a. k.; duin, r. p. w.; mao, j. (2000); statistical pattern recognition: a review, ieee transactions on pattern analysis and machine intelligence, i, doi: 10.1109/34.824819, 22(1), 4-37, 2000. [15] keinosuke, f. (1990); introduction to statistical pattern recognition, academic press inc., 1990. selective feature generation method for classification of low-dimensional data 37 [16] keys, r. (1981); cubic convolution interpolation for digital image processing, ieee transactions on acoustics, speech, and signal processing, i, doi: 10.1109/tassp.1981.1163711, 29(6), 1153-1160, 1981. [17] kim, c. (2007); pattern recognition using composite features, ph. d. thesis, seoul national university, 2007. [18] kim, c.; choi, c.-h. (2007); a discriminant analysis using composite features for classification problems, pattern recognition, doi: 10.1016/j.patcog.2007.02.008, 40(11), 2958-2966, 2007. [19] kim, c.; oh, j. y.; choi, c.-h. (2005); combined subspace method using global and local features for face recognition, neural networks, 2005. ijcnn’05. proceedings. 2005 ieee international joint conference on, doi: 10.1109/ijcnn.2005.1556212, 4, 2030-2035, 2005. [20] kwak, n.; oh, j. (2009); feature extraction for one-class classification problems: enhancements to biased discriminant analysis, pattern recognition, i doi: 10.1016/j.patcog.2008.07.002, 42(1), 17-26, 2009. [21] liang, j.; yang, s.; winstanley, a. (2008); invariant optimal feature selection: a distance discriminant and feature ranking based solution, pattern recognition, doi: 10.1016/j.patcog.2007.10.018, 41(5), 1429-1439, 2008. [22] lin, f.; zhou, x.; zeng, w. (2016); sparse online learning for collaborative filtering, international journal of computers communications & control, 11(2), 248-258, 2016. [23] marimont, r.; shapiro, m. (1979); nearest neighbour searches and the curse of dimensionality, ima journal of applied mathematics, doi: 10.1093/imamat/24.1.59, 24(1), 59-70, 1979. [24] mika, s.; ratsch, g.; weston, j.; scholkopf, b.; mullers, k.-r. (1999); fisher discriminant analysis with kernels, neural networks for signal processing ix, 1999. proceedings of the 1999 ieee signal processing society workshop, 41-48, 1999. [25] phillips, p. j.; wechsler, h.; huang, j.; rauss, p. j. (1998); the feret database and evaluation procedure for face-recognition algorithms, image and vision computing, 16(5), 295-306, 1998. [26] suto, j.; oniga, s.; pop sitar, p. (2016); feature analysis to human activity recognition, international journal of computers communications & control, issn: 18419836, 12(1), 116-130, 20106. [27] turk, m.; pentland, a. (1991); eigenfaces for recognition, journal of cognitive neuroscience, doi: 10.1162/jocn.1991.3.1.71, 3(1), 71-86, 1991. [28] viriri, s.; lagerwall, b. (2016); increasing face recognition rates using novel classification algorithms, international journal of computers communications & control, 11(3), 381-393, 2016. [29] xiong, h.; swamy, m.; ahmad, m.o. (2005); two-dimensional fld for face recognition, pattern recognition, issn: 00313203, doi: 10.1016/j.patcog.2004.12.003, 38(7), 1121-1124, 2005. 38 s.-i. choi, s.t. choi, h. yoo [30] yang, j.; frangi, a. f.; yang, j.-y.; zhang, d.; jin, z. (2005); kpca plus lda: a complete kernel fisher discriminant framework for feature extraction and recognition, ieee transactions on pattern analysis and machine intelligence, doi: 10.1109/tpami.2005.33, 27(2), 230-244, 2005. [31] zhou, x. s.; huang, t. s. (2001); small sample learning during multimedia retrieval using biasmap, computer vision and pattern recognition, 2001. cvpr 2001. proceedings of the 2001 ieee computer society conference on, doi: 10.1109/cvpr.2001.990450, 1, 111-117, 2001. [32] center for computational vision and control, yale university, the yale facedatabase, http://cvc.cs.yale.edu/cvc/projects/yalefaces/yalefaces.html. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 3 (september), pp. 577-580 on the meaning of approximate reasoning − an unassuming subsidiary to lotfi zadeh’s paper dedicated to the memory of grigore moisil − h.-n. l. teodorescu horia-nicolai l. teodorescu gheorghe asachi technical university, iasi, romania institute for computer science, romanian academy, iasi branch abstract: the concept of “approximate reasoning” is central to zadeh’s contributions in logic. standard fuzzy logic as we use today is only one potential interpretation of zadeh’s concept. i discuss various meanings for the syntagme “approximate reasoning” as intuitively presented in the paper zadeh dedicated to the memory of grigore c. moisil in 1975. keywords: logic, truth value, natural language, inference. 1 introduction i perceive three central ideas in zadeh’s wide conceptual construction in his work until now. the first one is that words and propositions in natural languages, consequently human thoughts are not representable by standard sets and standard binary predicates. the second central idea in many of zadeh’s papers is that humans perform computations in an approximate manner that real numbers can not represent well. the third idea, which zadeh presented in his more recent works, is that of granularity of human mental representations and reasoning. these three strong points were represented by zadeh in various forms and synthesized in the title of his paper on “computing with words” [1]. most frequently and zadeh himself is doing no exception authors cite as the initial paper clearly stating the approximate reasoning model the one published in the journal information sciences, july 1975 [2]. however, another paper published in synthese [3], during the same year, deals with the main ideas in approximate reasoning, not to mention a conference paper published in 1974. the paper published in synthese is important in several ways. first, it offers clear explanations of the meanings associated by zadeh to the syntagme “approximate reasoning”. second, the paper was published in a well-established journal that “spans the topics of epistemology, methodology and philosophy of science,” thus boldly reaching to a large audience in a set of fundamental disciplines and daringly affirming the importance of the new concepts. third, the paper is important for the romanian scientists because it re-enforces the understanding of grigore moisil’s contributions and the deep connections between zadeh and moisil. for an in-depth coverage of the last topic, see [4], [5]. zadeh starts the paper [3] from the common-sense remark that “it is a truism that much of human reasoning is approximate rather than precise in nature.” from that remark, he builds a broad program for research. the program that zadeh establishes remains, in my opinion, unfinished. in the abstract of the paper discussed here, zadeh states “since t [the truth-value set] is not closed under the operations of negation, conjunction, disjunction and implication, the results of an operation on truth-values in t requires, in general, a linguistic approximation by a truth value in t .” with copyright c⃝ 2006-2011 by ccc publications 578 h.-n. l. teodorescu this clarification, classic fuzzy logic as we know today is not a proper representation of zadeh’s original ideas. there are two ways to interpret the above quotation. the first interpretation is in the frame of the classic thinking and runs as follows. because language is a set of propositions (we restrict the discussion to truth-valuable propositions only), under whatever logic, all simple and composed propositions have a truth value. denoting the set of truth values by t , the above remark by zadeh has no effect. moreover, when defining the truth-valuation function we already need to know the set of truth values, which again makes ineffective zadeh’s remark. standard fuzzy logic pursue this direction of thinking, as it starts with . under this approach, as t is given, what is needed is to define the logic operators for the respective logic. the second interpretation is constructive. it may start with the assumption that language is not predetermined and must be constructed as a recursive, dynamic process, as poets, writers, other language professionals, and laymen do every day. whenever a new proposition is invented, it is assigned a meaning. the meaning includes what we conveniently name “truth”, a coverage degree of the reality that we need for making inferences. the truth may or may not be numerically representable. moreover, the truth of a composed proposition may not be representable by the truth of each of the initial propositions. therefore, the “set of truth values” evolves continuously. that, in turn, creates a stumbling block. because we are supposed to know the truth of the original sentences, but not of the result, how are we supposed to infer? the answer proposed by zadeh is that we still have an approximate truth in the initial set of truth values, approximating the truth of the new proposition. he hypothesizes that there is always an inverse application, which i will name projection, from the new truth set to the original one, indicating a truth degree in the original set that approximates the truth degree of the composed proposition. that makes our reasoning possible, if approximate. this way of thinking, more or less directly suggested by zadeh, opens the door to several formal descriptions. i sketch below a loose formalism for the recursive approach. consider a language l = ({p},c1,c2, · · ·cr), where p denotes extant propositions and ck logic operators (either unary or binary). notice that l is an initial language, meaning that the initial set {p} is evolvable, that is, it is recursively increased by adding propositions correctly formed from the initial ones through logic operations. assume that any proposition has a characteristic named truth value. thus, there is some set t of truth values (i use for this set the same notation as zadeh’s), as well as an application such that for any proposition p there is a truth value in t . also assume that any propositions can be concatenated to produce a new or extant proposition. whenever such proposition is new in the language, it has a characteristic truth value which is not necessarily in t . we can regard the creation of new propositions (including those used for reasoning) as producing an application t → t ′ , as for the negation operator, or t ×t ×· · · → t ′, as for the connectives. the new t−set which includes the original one. moreover, for satisfying zadeh’s hypothesis, there is a projection operation ϑ. notice that whenever a new proposition is produced, l is modified. because of t non-closure to logic operations, we can not talk about a t (proper) set for a language. instead, we can conceive an object named t∗-set that can be seen as a string of sets, each generated recursively from the previous t-set or from a product of t-sets. the string must be complemented by a string of applications from a t-set (or from a product of such sets) to the next one in the sequence. next, we need to allow for a construction where the term approximate makes sense. several mathematical constructions may be used in this respect, namely the concept of topological space, where approximate may mean “in a vicinity of”, the concept of metric space or one of its variants, as pseudo-metric space, where the term approximate may be interpreted as “at a distance less than epsilon”, or a measurable space, where the truth may be conceived as elements of the space and the approximation is interpreted as on the meaning of approximate reasoning − an unassuming subsidiary to lotfi zadeh’s paper dedicated to the memory of grigore moisil − 579 “the difference between the measures of the respective two truth values (here, a value can be represented for example by a measurable set) is less than epsilon.” the version based on the measurable spaces corresponds to the probabilities. the same version is adopted below, but with no direct connection to probabilities. the main requirements for the t∗-set are: i) the truth-attribute of a proposition is represented by an application θ : l → t , where t is measurable. ii) for any two extant propositions p1 and p2 and for any connective c, the new proposition p1cp2 has a truth degree such that t∪{θ(p1cp2)} is measurable and includes the measurable space t . iii) for any valid linguistic construction that uses logic operators, there is an application ϑ : t ′ → t ′ , named back-projection. iv) for any connective c, there is a formula fc such that ϑ(θ(p1cp2)) and fc(θ(p1),θ(p2)) are two points close enough in t . standard fuzzy logic obviously satisfies the above conditions, with t = t ′. replacing the measurable space with a metric one, we obtain another construction, which may be closer to zadeh’s intuition in [3]. using distances d : t → t , the conditions to satisfy are: d(ϑ(¬p),1 − θ(p)) < ε d(ϑ(p ∧ q),min(θ(p),θ(q)) < ε d(ϑ(p ∨ q),max(θ(p),θ(q)) < ε, where ε is the allowed approximation error. as grigore moisil has done, zadeh emphasizes in all his work how much language is fundamentally creative. by stating that “ t · · · is not closed under the logic operations,” he highlights that there is always generation of new meaning, or at least an evolution of the meaning, whenever a sentence is uttered. i believe that this is one of the fundamental contributions made by zadeh’s work until now. understanding the creative process in languages and in reasoning was significantly elucidated by zadeh’s work, yet much remains to be done for deriving conclusions and theoretical developments in the directions pointed to by his works. the developments informally suggested above may indicate several such directions, yet others may be put forward in a more formal manner during the years to come. acknowledgment. i thank several colleagues and the referees for their comments and suggestions. the note was not supported by, but is related to research performed in the frame of the romanian academy. bibliography [1] l.a. zadeh, fuzzy logic = computing with words. fuzzy systems, ieee transactions on, volume: 4 issue 2, pp. 103–111 (may 1996) [2] l.a. zadeh, the concept of a linguistic variable and its application to approximate reasoning i, information sciences, vol. 8, no. 3, pp. 199-249, july 1975. [3] l.a. zadeh, fuzzy logic and approximate reasoning (in memory of grigore moisil). synthese, vol. 30, (1975), pp. 407-428, d. reidel publishing co, dordrecht, holland 580 h.-n. l. teodorescu [4] g. georgescu, a. iorgulescu, s. rudeanu, grigore c. moisil (1906 1973) and his school in algebraic logic. int. j. computers, communications & control, vol. i (2006), no. 1, pp. 81-99 [5] z. lemnaouar and a. abdelaziz, on the representation of l-m algebra by intuitionistic fuzzy subsets. arima j., volume 4 (2006), pp. 72-85 int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 1 (march), pp. 63-71 software components for signal fishing based on ga element position optimizer n. crisan, l.c. cremene, m. cremene nicolae crisan, ligia chira cremene, marcel cremene technical university of cluj-napoca romania, cluj-napoca, memorandumului nr. 28, 400114 e-mail(s): nicolae.crisan@com.utcluj.ro, ligia.cremene@com.utcluj.ro, marcel.cremene@com.utcluj.ro abstract: long-term adaptation solutions do not receive much attention in the design phase of a wireless system. a new approach is proposed, where the antenna takes an active role in characterising and learning the operation environment. the proposed solution is based on a signal fishing mechanism. several software components, among which a genetic optimizer, implement the processing stages of autonomous design of the antenna array during operation. keywords: smart antennas, signal fishing, long-term dynamic adaptation, ga optimizer 1 introduction wireless receivers incorporate more and more adaptive techniques in the attempt to compensate the effects of the radio channel. technical challenges for current wireless technologies include limited bandwidth and transmit power, interference, signal fading and other propagation impairments. a multitude of adaptive techniques are nowadays implemented at different levels of a wireless transmission chain, as depicted in figure 1. at the physical layer, data link and network layers there are many adaptive techniques that deal with modulation and coding schemes, equalization, filtering, arq mechanism, fragmentation, routing [9], [10]. these techniques have more or less independent approaches, and their joint operation is not much studied. adaptive solutions are usually specific solutions, having a local effect. most of the existing adaptive solutions are hardcoded and therefore not able to evolve. the main concerns regarding wireless receiver performance are link availability and link capacity. tens of adaptive techniques are trying to dynamically adapt the transmission to the changes in the wireless environment. these are short-term, instant adaptation methods/mechanisms. long-term adaptation solutions are more complex and their design and operation are time consuming. the majority of the adaptation methods considering the spatial component of the propagation channel use highly-parameterized, yet simplified channel models [3], [4]. usually fixed parameters, such as mean values of statistical channel parameters are involved [5]. also, processing methods that consider time and frequency variability of the channel exploit only partially the spatial component information. an approach for developing a long-term dynamic adaptation solution for wireless receiver chains is proposed. unlike current approaches the antenna takes an active role in characterising and learning the operation environment, namely the wireless propagation channel. the proposed solution is based on a spatial characterization of the fading allowing the detection of signal maxima in the antenna environment. the antenna is described as an mxn copyright c⃝ 2006-2011 by ccc publications 64 n. crisan, l.c. cremene, m. cremene rectangular array. only two elements in the array are active at a given time. in order to search for the best positions of the active elements a genetic optimizer is used. this paper is organized as follows: section 2 discuses the interdisciplinary aspect of the smart antenna field. in section 3 we describe several software components that implement the processing stages of the autonomous design of the antenna array during operation. the proposed solution is based on a signal fishing mechanism [6] and uses a genetic optimizer. experimental results are presented in section 4 together with our conclusions. figure 1: adaptive techniques operating at different levels of the wireless tx-rx chain 2 smart antenna interdisciplinary aspect classical smart antennas are based on antenna arrays of different configurations and controlled by means of a specialized signal processor. their beamforming and beamsteering mechanisms address capacity and reliability issues of wireless systems [14], [15]. the subject of smart antennas, also known as adaptive antennas, becomes a widely interdisciplinary field as it bridges several disciplines: electronics, electromagnetics, channel&propagation modelling, communication techniques – signal processing, control&adaptive systems, random processes, and evolutionary computing. antenna, communications, and control engineers tend to view adaptive antennas from quite different perspectives. some are inclined to focus on electromagnetic features (e.g. radiation patterns and levels), others on communication-link parameters like snr (signal-to-noise ratio) and ber (bit error rate), and others on various control algorithms. there are at least three issues that computational intelligence (especially evolutionary computing) can help solve in smart antenna systems. the first direction is generating new, optimized antenna geometries, capable of multi-mode operation. the other two are dynamic optimization issues that concern signal detection and tracking and statistical signal processing. we identify the antenna as a pivotal element in determining and assessing quality in wireless communication systems. actually, the user-perceived network or equipment quality relies, ultimately, on antenna performance. moreover, until recently, the antenna influence on channel measurements for channel modelling was considered a bias, now it can be used to a benefit by integrating it into the channel analysis [3], [4]. the distance between transmitter and receiver, their speed, their transmit powers, obstacles, all have an impact on the channel matrix [7], [2], [1]. these are difficult to control factors. yet, there is a system that has a great influence on the channel matrix, a system that is under the control of the receiver: it is the receiver antenna array. the position of the antenna array in software components for signal fishing based on ga element position optimizer 65 the field and the spacing between array elements are the main parameters that can influence the channel matrix. 3 proposed solution: signal fishing based on ga element position optimizer signal fishing (sf) is a new concept [6] that addresses fading mitigation in adaptive multipleantenna receivers. the sf mechanism is based on spatial fading characterization relaying on the antenna array itself. the main idea is to detect and exploit channel signal maxima. fading is a random fluctuation of amplitude and phase of the propagated signal. amplitude fluctuation in a multipath environment is usually described by a rayleigh distribution. a useful fading characterization will turn the selective-fading channel problem into a flat-fading problem, and will enable the receiver signal processing part to perform better [6]. antenna selection techniques generally consider a fixed number of elements in the antenna array allowing only regular configurations. signal fishing allows a random activation of a variable number of array elements which is equivalent to dynamically modifying the element spacing d and the orientation angle ω. figure 2 a) illustrates the description of the receive rectangular array which is part of a transmit-receive system operating in a mimo 2x2 configuration. each element is a sierpinski gasket (fig. 2b). two elements are selected at a specific moment of time and six wave-fronts are considered. the two array elements operate as receive antennas; the other elements are used for channel estimation and spatial fading characterization and are uniformly spaced (d is constant only for these np-2 elements). spatial fading characterization involves estimation or measuring of the angle of arrival θi„ phase shift φi, and amplitude ai of the incoming signal which is performed during the training sequence. when the training sequence elapses, the channel is considered quasi-static for about 448 symbols. by changing the element displacement (element perturbation) we can introduce a phase difference between arriving signals rx1 and rx2. figure 2: a) 32x32 rectangular antenna array ϕ = 900, 0 ≤ θ ≤ 900, dx = dy = λ4 , m = n = 32. b) sierpinski gasket multi-band antenna the signal equations for each of the two receive elements, rx1 and rx2, are: rx1 (a, b) = np∑ i=1 ρ(θi,900)fiexp [j ((a−1) (dxcos θi )) + j (b−1) (dysin θi )] (1) rx2 (c, d) = np∑ i=1 ρ(θi,900) fiexp [j ((c−1) (dxcos θi )) + j (d−1) (dysin θi )] 66 n. crisan, l.c. cremene, m. cremene where: ρ is the antenna element factor fi = |fi|exp(jϕi) is the complex value associated to wavefront i. a, b, c, and d are the coordinates of the activated receive elements rx1 and rx2 β is the phase constant θi is the angle of arrival (aoa) of the ith front. figure 3 presents an illustration of the signal fishing concept – exploiting the spatial component information of the channel in order to detect and use the signal maxima. signal maxima are estimated based on the optimum element spacing dopt = λ ∑np i=1 |hi| 4k±1 4sinθi∑np i=1 |hi| (2) where k is a positive integer; the array grows wider with k. figure 3: signal fishing with two antennas figure 4: signal fishing based on ga element position optimizer figure 4 shows the block diagram of the signal fishing mechanism realization. the main components are: an np-element antenna array, two receiver chains, an array controller, a baseband processing component and a genetic optimizer. the array controller plays a key role in the antenna selection mechanism. the array controller can be located between the antenna array and the receiver rf chain. antenna selection control is based on a switching algorithm that activates two elements at a time. the array controller is the software component that translates the coordinates a, b, c, d into a binary signal suited for on/off element switching. this is done via a microcontroller located on a software defined radio board. this algorithm can be implemented in the context of the sdaa (software defined autonomous antenna), described by the authors in [11]. software components for signal fishing based on ga element position optimizer 67 benefits of adaptive antenna spacing and angle diversity are discussed in [16]. passive antenna diversity alone is not enough to ensure significant capacity improvements in mimo systems. however, spacing and angle reconfiguration of the antenna array is enough to significantly increase the system data rate [8]. classical digital beamforming and beamsteering algorithms can achieve this but with limited degrees of freedom and introducing noise. activating the appropriate two array elements results in maximization of rx1 and rx2 signals. activating certain elements in the array is equivalent to complex coefficient multiplication (complex weighting) in baseband processing [2]. this spatial method is meant to maximize the snr by spatial decorrelation of the incoming fronts. from the adaptive mechanism point of view, the baseband processing component includes two important software components: the per (packet error rate) calculation component and the rxx correlation matrix estimation component. the per component counts the dropped packets and statistically evaluates the packet error rate after thousands of received data packets. this software component is available in most wireless equipments and its value is readable using a software register of the sdr. the per value is used as a trigger for the genetic search of array element coordinates a,b,c and d. the rxx component provides the correlation matrix at the receiver; the matrix is not readily available and is computed based on pilot signal information. the rxx correlation matrix of the receiver (mxm) offers the mean power delay profile and power angular profile. the h channel matrix coefficients and angles of arrival (aoa) can be further extracted. rxx = e { _ x _h x } = arssa h + rnn (3) where e is the expectation function and _h x is the hermitian of the received signal matrix − x (mx1 ) m is the number of elements that are used for power and angle profile estimation a is the array of the steering vectors (mxd) (one vector for each angle of arrival, d propagation paths are considered and d r+, where: sf(a, b, c, d) = |rx1(a, b)|2 + |rx2(c, d)|2 (4) sf(a, b, c, d) → max 68 n. crisan, l.c. cremene, m. cremene solution encoding. candidate solutions are encoded as x = (a,b,c,d), where a, b ,c ,d are expressed as binary strings. fitness assignment. the fitness of solution x is: eval(x) = 1/sf(a, b, c, d) (5) population model. a generational model where the offspring completely replace the parent population at each generation is considered. stopping criterion. the search process stops when the maximum number of generations is reached. search operators. recombination is based on uniform crossover operator. a mask is randomly generated for each pair. one bit mutation is considered. parameter setting. population size – we consider the initial population of 100 chromosomes randomly generated for the first generation. number of generations – between 100 and 300 generations. mutation rate – 0.01. the number of mutations is 19 for each generation and is computed [13] with: nrmut = mutrate∗nbits∗nvar ∗ (npop−1) (6) where: mutrate is the mutation rate (0,01), nbits is the number of bits/chromosome (5), nvar is the number of genes for each chromosome (4 a,b,c,d) and npop is the total population number (100 chromosomes). nrmut is the number of mutations per generation. the search process follows the flowchart described in figure 5. figure 5: ga optimizer flowchart 4 experimental results figure 6 a) illustrates the ga search process. the solution pair is circled in red rx1 and rx2. it indicates the best coordinates (a,b) for receiving element rx1 and (c,d) for rx2. the corresponding array elements to be activated are shown in figure 6 b). the evolution of the objective function for each generation is tracked in figure 7 a), revealing many local maxima. the global maximum is marked in figure 7 a). software components for signal fishing based on ga element position optimizer 69 figure 6: a) ga search process and solution, b) corresponding activated array elements figure 7: a) objective function over generations, b) signal-fishing mechanism gain 70 n. crisan, l.c. cremene, m. cremene from the antenna point of view the local maxima are also important because each can result in a signal gain. so whenever a tradeoff between speed and gain is necessary these local maxima can be used. the gain is calculated as the difference between the mean signal power for a fixed two-element array and the mean signal power for the signal fishing mechanism. 5 conclusions a long-term adaptation solution for wireless receivers is proposed. it is based on a systemic approach of the wireless receiver chain and gives the antenna array a central role in learning the environment, namely the wireless propagation channel. an implementation of the proposed signal-fishing concept using a ga optimizer for element positioning shows that the maximum signal levels of the channel can be detected and used to increase the mean received signal power. the channel fading effects are mitigated starting at the antenna level, an approach that does not introduce additional noise like current baseband processing techniques. acknowledgments this work was supported by cncsis-uefiscsu, pnii-idei, project number 1062/2007. bibliography [1] f. gross, smart antennas for wireless communications, mcgraw hill, 2005 [2] a.b. gershman, n.d. sidiropoulos, space-time processing for mimo communications, wiley&sons, 2005 [3] 3gpp, “spatialchannel model for multiple input multiple output mimo simulations,” technical specification group radio access network tr 25.996 v6.1.0, 3gpp, sept. 2003. [4] j. salo, g. del galdo, j. salmi, et al., “matlab implementation of the 3gpp spatial channel model,” tech. rep. tr 25.996, 3gpp, jan. 2005. [5] l. mucchi, claudia staderini, j. ylitalo, p. kyosti, “modified spatial channel model for mimo wireless systems”, eurasip j. on wireless comm. and netw., vol. 2007, pp.1-7. [6] n. crisan, ligia c. cremene, “antenna-based signal fishing”, the fifth int. conf. on wireless and mobile communications – icwmc’09, pp.152-156, ieee computer society press, cannes, 2009. [7] g. tsoulos, mimo system technology for wireless communications, crc press cityplaceboca raton, stateflorida, 2006. [8] n. crisan, ligia c. cremene, "a novel combining technique for adaptive antenna arrays", acta technica napocensis electronics and telecommunications, vol. 49/2, pp.27-34, mediamira science publisher, cluj-napoca, 2008. [9] m-s. alouini, adaptive and diversity techniques for wireless digital communications over fading channels, ph.d. thesis, california institute of technology, 1998. [10] l. hanzo, c. h. wong, m. s. yee, adaptive wireless transceivers, john wiley & sons, ny, 2002. software components for signal fishing based on ga element position optimizer 71 [11] ligia c. cremene, n. crisan, “towards cognitive antenna systems based on antennachannel co-evolution” 3rd int. workshop on soft computing applications sofa2009, pp.165-169, szeged, 2009. [12] d. dumitrescu, b. lazzerini, l. c. jain, a. dumitrescu, evolutionary computation, crc press, ny, 2000. [13] r. l. haupt, sue ellen haupt, practical genetic algorithms, john wiley & sons, inc., 2004. [14] s. d. blostein, h. leib, “multiple antenna systems: their role and impact în future wireless access”, ieee communications magazine, vol.41, no.7, pp.94-101, july 2003. [15] angeliki alexiou, m. haardt, “smart antenna technologies for future wireless systems: trends and challenges”, ieee communications magazine, vol.42, no.9, pp. 90-97, 2004. [16] j. d. boerman, j. t. bernhard, “performance study of pattern reconfigurable antennas in mimo communication systems”, ieee trans. on ant. and propag., vol.56, no.1, jan.2008. int j comput commun, issn 1841-9836 8(2):304-311, april, 2013. bio-eco-analysis for risk factors using gis software r. serbu, s. borza, b. marza razvan serbu, borza sorin bogdan marza lucian blaga university of sibiu 10, victoriei bd., sibiu, 550024, românia e-mail: razvan.serbu@ulbsibiu.ro, borza.sorin@ulbsibiu.ro, bogdan.marza@ulbsibiu.ro abstract: agriculture is a business sector ideally suited for the application of geographic information systems (gis) because it is natural resource based, requires the movement, distribution, and/or utilization of large quantities of products, goods, and services, and is increasingly required to record details of its business operations from the field to the marketplace. nearly all agricultural data has some form of spatial component, and a gis allows you to visualize information that might otherwise be difficult to interpret. environment has a major impact on agriculture. in this paper we presented how gis software can be used to analyze risk factors that influence agricultural production naturally. natural risk factors were taken into account are: land degradation, flooding, humidity, action on farmland of the wildlife. the conclusions drawn from this paper using gis allows the adoption of important measures on a short or long time to reduce natural risk factors on agricultural production. the advantage of this model is possibility to bee extended to national, regional and global area. keywords: geographic information systems (g.i.s.), query, information and communication technologies (ict), spatial analysis. 1 introduction from several existing studies that debate relationship between environmental sustainability, economic performance and competitiveness has been debated strongly for many years and still remains unclear. we have to bring to attention the dr javier carrillo-hermosilla’s book named "eco-innovation". here are the two main views that the literature [1], [2], [3], [4] gives us of the link between environmental and economic performance, which give rise to rather different perspectives on this relationship. they are: 1. the ’traditionalist’, or neoclassical, view of a trade-off between environmental performance and competitiveness. according to this view, the purpose of environmental regulation is to maximize social welfare, making polluting firms responsible for the costs of the negative externality they produce, thereby correcting the market failure. as a consequence, environmental policies may have an adverse impact on competitiveness, insofar as this regulation imposes additional costs to firms. this burden may be of particular concern in industries with substantial environmental impact, where the share of environmental costs in total production costs is considerably higher than for the manufacturing sector on average. [5] a defensive business strategy and the adoption of end-of-pipe technologies may be expected. [6] 2. the ’revisionist’ view adopts a more dynamic perspective of the relationship between sustainability and competitiveness, and assigns a central role to technological change and innovation. better environmental performance can lead to lower production costs and enhance competitiveness through efficiency, productivity and new market opportunities. [7], [8], [9], [10], [11], [12] according to the so-called ’porter hypothesis’, [13] stringent environmental regulation could copyright c⃝ 2006-2013 by ccc publications bio-eco-analysis for risk factors using gis software 305 force polluting firms to seek innovations to reduce the cost of compliance and production, improving the firm’s competitiveness and leading to a positive relationship between environmental and economic performance. additionally, companies can obtain ’first mover advantages’ by marketing the innovation itself and through the creation of new markets or market segments. [14] hence properly designed environmental policies may help firms discover their inefficiencies and sources of comparative advantage, promoting innovation and creative thinking. [15] setting aside these theoretical discussions, and in more practical terms, it is clear that technological development and institutional considerations play an important role in the transition of the economic system towards sustainability. [16] in other words, technological change is probably a necessary, albeit insufficient, condition for achieving sustainability. institutional changes, including changes in routines, social norms, formal regulations, etc., are needed not only to induce the required technological changes, but also to encourage behavioral changes at all levels of society in more sustainable directions. today’s major environmental problems, such as climate change, the destruction of the ozone layer, loss of biodiversity, the degeneration and erosion of soil and water pollution are characterized by their delocalization, considerable uncertainty, irreversibility and extreme complexity in terms of consequences and the likelihood that they will occur. [17] 2 the analize of environmental risk in agriculture with gis aplications the analyze of environmental risk in agriculture with gis applications needs a multidisciplinary approach, with input and expertise required from many fields civil and chemical engineering, physics, life sciences, ecology, geology, hydrology and statistics being some of them. a wide range of simple to complex, spatial as well as non-spatial, and quantitative as well as qualitative, input data sets is used in environmental risk assessment and analysis process. the analyze of environmental risk in agriculture process involves preparation and use of the processed information derived and presented in various ways for example, comparative (or relative) risk analysis, cost-benefit analysis, scenario analysis, probabilistic analysis, decision matrix, sensitivity analysis etc. due the need for using and analyzing a huge volume of the spatial as well as non-spatial environmental hazards and exposure data in a fast and reasonably accurate way, gis based software applications using a variety of modeling techniques serve as powerful tools for effective environmental risk assessment and management. [18] such applications can be used for a diverse environmental risk assessment and analysis purposes. these applications can ranges from development of databases/inventory systems for simple to complex gis layers overlays, to complex spatial decision-making systems for study of the impact of air, water and soil pollution, ecological imbalance, and natural disasters on the natural and man-made environment, including living beings, properties, infrastructure, vegetation and ecology. these systems could also be interlinked with other related systems, providing online and real-time input data feeds or communication systems, to allow continuous monitoring and tracking of environmental risks in an integrated way. normally, it is good to start with a prototype application first, which could be expanded further based on the budgetary allocation, user needs and the user feedback obtained from the prototype’s implementation. such a system would allow the users to develop possible scenarios using gis and graphical icons for example, a symbol of a polluting industry planned can be placed at a user defined locations on a given regional map, showing the terrain, rivers, soil, vegetation, population, employment, infrastructure, land-use and wild life attributes, and to evaluate the different aspects of the environmental risk, for a set of industry locations scenarios. 306 r. serbu, s. borza, b. marza it can help in developing a prior understanding of the potential risks and for arriving at the best possible alternatives, within the given constraints. there actions could be a combination of human actions taking place in that region; for example, establishment of an agricultural zone, clearing of some forest area to obtain news terrains for agriculture. this entire situation involves a complex set of multiple actions over a wide geographic region, so there would a need for the system to be able to analyze & manage the environmental risks posed by the combination of these actions. f. capra states that by developing and using some systemic biology, each organic part of the common live is an integral whole and therefore a lively system regardless if we refer to individual systems or social ones, to ecosystems we coexist with and we develop. the economy as live organism is a system composed by human beings and social ones interacting with one another and the ecosystems that are around us and our life depend on. it is from the perspective of the systemic vision that it comes out the understanding the problems with interrelationships that are coming up at the level of the whole common live economy is part of. to look at the economic life apart from the environment, from people’s life, families and communities, from the life of the organizations and institutions means to fail understanding that economy is a live (vivid) system in a continuous change and evolution, dependent on the change of the ecologic and social systems it finds it self related. it is from understanding the economy as a live (vivid) organism that a radical change results in the way the processes of economic growth and development are being conceived, mechanisms backed by the institutions that govern and manage the crisis. this change resulted by the essence of systemic wisdom has the origin in understanding the wisdom of nature and is the substance of the ecologic consciousness bateson talked about, the fact that our natural environment is not only live but conscious too. a new economy as a live vivid organism entails a powerful investing in human resources, developing the human wisdom in such a way as to enforce a new directing of science and technology towards the organic, in a gentle, non violent, elegant way. 3 spatial analysis of natural and hazards factors of risk in agriculture using geomedia professional software the risk analysis of natural and hazard factors, we started with some definitions [19]: hazard is "a threatening event or the likelihood in a region in a given period of a natural phenomenon potentially harmful (damage, environmental damage, human casualties) risk is defined as "the potential number of casualties, injuries, property damage of any kind, produced during a reference period in a given region, where there is a particular natural phenomenon". natural disaster is "a serious disruption of functioning of a society, causing loss of life, materials and environment, which the company can not exceed in-house". the vulnerability is "the degree of loss (0-100%), a phenomenon resulting from potentiality to produce casualties and material damage", depending on the vulnerability of socio-economic development of the area concerned. in addition to the above definitions, there are many approaches that are intended to complement and enhance the significance of the terms set [20]. analysis of risk factors in sibiu county was based on a map using gis software, geomedia professional. we obtain the map with major risk factors [21]. the areas where they occur are presented in map: bio-eco-analysis for risk factors using gis software 307 figure 1: the map with major risk factors. • relatively stable areas with different risk of flooding and embankment works without clogging and regulate rivers and streams of water and maintenance of beds; • stable areas but with the risk of water stagnation, due to low permeability of soil; • risk areas due to excessive soil moisture by raising the groundwater or irrigation; • moderate to strong erosion areas with high risk of landslides activation when heavy rain, deforestation or work on slopes; • strong unstable areas affected by erosion, excessive, coupled with active gulling and landslides, torrents, and springs coastal; • unstable areas at high risk of landslides, collapses and collapsing; • seismic zone, (msk scale); • area of seismic risk as normative p100-1992 (f=0,008;e=0,12;d=0,16) • areas affected by landslides; • areas affected by floods due to overflowing rivers; • areas affected by floods due to leakage from the slopes. the legend shown in the figure above, appears in figure 2. all these risk areas are represented in the map made. gis product provides great opportunities for spatial analysis [22]. you can see the weight of each risk factor in the whole county. the analysis of the digital map shows that: • high risk factor for landslides is relatively high in sibiu county, this having a negative influence on agriculture and human habitat; • soil erosion with negative influences on agriculture is another important factor which covers a large area in the county of sibiu; • flood risk is important in the county. on the map are located in areas where floods occurred with disastrous effects. in sibiu floods occurred in the years: 1956, 1958, 1960, 1962, 1965, 1967, 1969, 1970, 1971, 1975, 1982, 1985, 1998, 2005; • in general the county is a hilly area, the beds of watercourses characterized by gently sloping broad flood plain areas, without equipped with storage or flood defense and without extraction 308 r. serbu, s. borza, b. marza figure 2: the legend of the major risk factors map. opportunity ballast; • the analysis of the map also notes the significant share that has seismic risk factor in sibiu county. gis software allows obtaining custom information using spatial analysis and the particular attribute and spatial queries [23]. so you can analyze the influence of risk on a certain area, using buffer zones. for example in the figure below we present a buffer area for flood risk factor. in the map made, we may introduce new risk factor such as, referring to the influence of wildlife on agricultural production. although the number of wild animals in sibiu county decreased by about 50% in last years, it continues to represent a significant risk on agricultural production, in rural areas given the issues outlined above, figure 4 presents a map that allows the analysis of other natural factors of risk. the map has 3d features. 4 conclusions and future works spatial analysis is very importance to all areas and in agriculture also. the spatial analysis can determine the relevant institutions to take certain measures to be taken as: • make works of dams and abundant vegetation cleaning; • reforestation areas with landslides; • cant dams; • repair of retaining walls; • desalting of water courses; • desalting of culverts sections. motivations of people who live within rural areas, to adopt and use information technologies and those of communication, can differ from the reasons people have within the urban environbio-eco-analysis for risk factors using gis software 309 figure 3: the 3d map of risk factors in the talmaciu area. figure 4: the 3d map of risk factors in the sibiu county area . 310 r. serbu, s. borza, b. marza ment. regarding the implications of geographic information systems in bio-economic analyses of some natural factor of risk, the people from rural area should be more interested because now they can minimize the gap between rural and urban development. researches suggest that information and communication technologies (ict) can eliminate the handicap of distance concerning distance and social interaction. donald janelle used the expression "convergence time-space at the end of year ‘60 in order to describe the capacity of transport, of technologies, in order to approach different places" [18]. the risk we discus about, are between the one that minimize the profit and the good living of people living in the rural area. these people can take advantage of these technologies, and information and communication technologies (ict) offer this way entirely. using information and communication technologies might have greater impact on some persons within the rural area, than another one who lives in the urban environment, because a person who lives in rural area can now access information, goods and services and very important, information that they couldn’t before. now when the global crisis is being considered as a complex, multi dimensional one with faces that reach each aspect of our life health and living conditions, quality of the environment and social, economic, technological, political relationships... a crisis of intellectual, moral, spiritual dimensions , a vast crisis as there has not been before in humanity, human respirituality as sustainable ground of that of the institutions representing a redefining of the human nature from the perspective of the fact the revolution of the means must be subordinated permanently to expectations , only out of their harmony results the health of the entire common live formed of people, communities, organizations, families and institutions. acknowledgements this paper was co-financed from the european social fund through sectoral operational programme human resources development 2007 2013, project number posdru /89/1.5/s/63258 "postdoctoral school for zootechnical biodiversity and food biotehnology based on the eco-economy and the bio-economy required by eco-san-genesys." bibliography [1] xepapadeas, a. and de zeeuw, a., environmental policy and competitiveness: the porter hypothesis and the composition of capital, journal of environmental economics and management, 37: 165-182, 1999. [2] simpson, r. d. and bradford, r. l. taxing variable cost: environmental regulation as industry policy, journal of environmental economics and management, 30: 282-300, 1996. [3] palmer, k. w., oates, w. e. and portney, p. r. tightening environmental standards: the benefit-cost or the no-cost paradigm, journal of economic perspectives, 9(4): 119-132, 1995. [4] walley, n. and whitehead, b., it’s not easy being green, harvard business review, 72(3): 36-44, 1994. [5] luken, r., the effect of environmental regulations on industrial competitiveness of selected industries in developing countries, greener management international, 19: 67-78, 1997. bio-eco-analysis for risk factors using gis software 311 [6] faucheux, s. and nicolai, i., les firmes face au development soutenable: changement technologique et gouvernance au sein de la dynamique industrielle, revue d’economie industrielle, 83: 127-145, 1998. [7] sinclair-desgagné, b., remarks on environmental regulation, firm behaviour and innovation, scientific series 99s-20, 1999. [8] porter, m. and van der linde, c., green and competitive: ending the stalemate, harvard business review, 120-134, 1995. [9] porter, m. and van der linde, c., toward a new conception of the environmentcompetitiveness relationship, journal of economic perspectives, 9(4): 97-118, 1995. [10] shrivastava, p. ecocentric management for a risk society, academy of management review, 20(1):118-37, 1995. [11] porter, m., america’s green strategy, scientific american, 264(4):96, 1991. [12] gabel, l. h. and sinclair-desgagné b., managerial incentives and environmental complianc, journal of environmental economics and management, 24: 940-55, 1993. [13] porter, m. and van der linde, c., green and competitive: ending the stalemate, harvard business review, 120-134, 1995. [14] esty, d. and porter, m., industrial ecology and competitiveness: strategic implications for the firm, journal of industrial ecology, 2(1): 35-43, 1998. [15] jaffe, b., peterson r., portney r. and stavins r., environmental regulation and the competitiveness of us manufacturing: what does the evidence tell us?, journal of economic literature, 33: 132 -163, 1995. [16] wced, our common future, oxford university press for the world commission on environment and development, 1987. [17] dr javier carrillo-hermosilla’s, dr pablo del río gonzález, dr totti könnölä, ecoinnovationt, palgrave macmilla, 2009. [18] pierce , f., clay, f., gis applications in agriculture, crc press, taylor & francis group, 2007. [19] bryant e.a., natural hazards, cambridge university press, 1992. [20] tanislav d., costache a., geografia hazardelor naturale şi antropice, editura transversal, târgovişte, 2007. [21] borza s., realizarea aplicaţiilor gis, folosind geomedia professional, ed. univ. "lucian blaga" din sibiu, isbn 978-606-12-0261-4, 2012. [22] bălteanu d., natural hazards in romania, revue roumaine de géographie, 36: 47-55, 1992. [23] donald g. janellea, spatial reorganization: a model and concept, annals of the association of american geographers, 59(2): 348-364, june 1969. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vii (2012), no. 1 (march), pp. 135-146 adaptive neuro-fuzzy controler with genetic training for mobile robot control o. obe, i. dumitrache olumide obe "politehnica" university of bucharest, romania e-mail: oluobes@gmail.com ioan dumitrache "politehnica" university of bucharest, romania e-mail: ioan.dumitrache@cncsis.ro, idumitrache@ics.pub.ro abstract: in this paper, we investigate the use of adaptive techniques in the optimization of navigation of khepera mobile robot in an unstructured and dynamic environment. we optimize the performance of our simplified fuzzy controller using neural network that utilizes genetic algorithm learning. the adaptation of the system involves the tuning of the control rules thereby trimming the control actions, and adjusting the fuzzy controller output gain. we realised an improved performance in our adaptive neuro-fuzzy controller with genetic training for various implemented behaviours on the robot. keywords: khepera, fuzzy controller, neuro-fuzzy controller, navigation, genetic algorithm. 1 introduction navigation and obstacle avoidance are very important issues for the successful use of an autonomous mobile robot in a dynamic and unstructured environment. mobile robot researchers aim to build an autonomous and intelligent robot which can plan its motion in a dynamic environment. a successful use of an autonomous mobile robot depends on its controller. mobile robot control is difficult as they are subjected to non-holonomic (non-integrable) kinematic constraints involving the time derivates of configuration variables [12] and dynamic constraints. both analytical like potential field method as well as graph-based techniques have been used to solve the navigation problems of robot involving both static and dynamic obstacles. but, all such methods may not be suitable for on-line implementations due to their inherent computational complexity and limitations. mobile robot researchers have carried out various researches in this direction using various intelligent techniques methods such as fuzzy logic, neural network and genetic algorithm and their different hybrids. because of the non-linear kinematics of the robot, the uncertainty in sensors readings, and unstructured environmental constrains in the control of mobile robot navigation; researchers have found fuzzy logic as one of the best intelligent technique for handling the constraints. however, fuzzy logic needs tuning for optimal performance. hand tuning is very difficult and time consuming therefore there is need for automation of the tuning process. the process of tuning requires learning brought about by training or adaptation of the robot to adapt to its dynamic environment. the poor learning capability of fuzzy logic is compensated for by hybridizing fuzzy logic with other soft computing techniques with excellent learning features such as neural network. in this paper, we present an adaptive neuro-fuzzy controller with genetic algorithm learning for the navigation of khepera mobile robot. copyright c⃝ 2006-2012 by ccc publications 136 o. obe, i. dumitrache 2 state-of-the-art of control of mobile robots a mobile robot is a situated and embodied agent endowed with mobility; and operates autonomously, communicating with, and exploring its environment. mobile robots are complex systems functioning in real world environments thus, making it uneasy to design an adaptive control system that can control robots to act as desired. task based decomposition of robot control by brooks’ [8] subsumption architecture that makes easy design of robots to realize multiple behaviours, respond to multiple sensors and incrementally extension has been successfully applied in mobile robot [9].researchers in robotics use theories and concepts from intelligent control theory described as behaviour-based control as an alternative to conventional robot control since real world cannot be accurately modeled. because of the need to operate in unknown environments, mobile robots demand much higher level of intelligence in order to learn to adapt successfully to its ever changing environment. intelligence in a mobile robot is considered as an adaptive behaviour that makes a robot adapt to and acts intelligently in its environment. intelligent control is an act of directing a complex system to a goal [1].the intelligence is the property of a system that emerges when procedures of focusing attention, combinatorial search and generalization are applied to the input information so as to receive the output results [2].consequently, murphy [3]defines an intelligent mobile robot as a situated agent with a mechanical structure that operates autonomously. intelligent systems strategies like fuzzy logic, neural network, and genetic algorithms have been used to endow robot with intelligent capability to navigate its environment autonomously. various hybrids of these strategies have also been implemented successfully on mobile robots. in this work, we employ hybrid geno-neuro-fuzzy intelligent strategies to control khepera mobile robot. 3 fuzzy control of mobile robot fuzzy control is one of the intelligent control techniques that pertain to the realization of intelligent control systems. fuzzy control provides a mechanism for incorporating human-like reasoning capabilities and computation in control systems. the linguistic variables are used to mimic the human action into a system more closely than traditional control. fuzzy logic is a logical system that aims at a formalization of approximate reasoning [10].these can be represented as the concept of a linguistic variable, canonical from, fuzzy if-then rule, fuzzy quantifiers, and modes of reasoning. 3.1 fuzzy controller design in this paper, we design a simple fuzzy logic controller for khepera mobile robot for obstacle avoidance and wall following behaviours. our fuzzy logic controller architecture is as shown in figure 1: where, r = sensors input from the robot and uext = [5000,5000] i.e. the maximum robot’s motor speed for stability (for our design), which is equal to 35mms−1 real speed.ufc is the fuzzy controller output; u = the resultant speed for the robot; and y = the robot output control signal. adaptive neuro-fuzzy controler with genetic training for mobile robot control 137 figure 1: architecture of our fuzzy logic controller for the mobile robot showing the number of inputs and outputs parameters at each processing stage. obstacle avoidance fuzzy controller we design the obstacle avoidance controller to allow the robot to avoid colliding with any object along its path to its target. the controller will empower the robot to maneuver its way around any obstacles without colliding with them. by using mamdani fuzzy logic approach, we define three sensor input variables representing relative distance between the robot and any object in its environment. these include: left distance (ld), right distance (rd), and front distance (fd). since the robot rarely uses the back sensors because, the robot usually moves forward, we exclude back sensor readings in our design. the physical domains over which these input variables are defined are determined as: ld = max(s2,s3); fd = max(s4,s5); rd = max(s6,s7). where si(i = 2,3...7) are the sensor values normalized within the interval [0,1]. all these input variables have the same base variable length. figure 2.1 shows our defined membership function distributions of the input variables. for simplicity, we use two types of shapes for our membership function. these include the z-shaped and s-shaped membership functions shape types. two linguistics variables (or grades of distance) from the robots’ sensors are considered: not detected (nd), and detected (d). we use z-shaped membership function type for the nd grade and s-shaped membership function type for the d membership function. these membership function types are the variant of gaussian membership functions that have been proved experimentally by [11] to be of better performance for the same robot navigation problems. the output fuzzy variables are the motor speed of the robot’s wheels, represented by left velocity, lv and right velocity, rv. for the output fuzzy variables, we use three linguistic variables (s-slow, z-zero, and f-fast) for each of the fuzzy output variables. we use gaussian membership function type for each of the linguistic variables. the simplicity in the design is to reduce complexity. this is at the cost of granularity which we belief would be compensated for by the optimization of fuzzy controller using neural network with genetic training. this design invariably reduces the number of fuzzy rules as expected, to 8 rules. some of the 8 rules are shown in table 1. 4 neural network design a layer network is used for our neural network as shown in figure 2. where, xi = sensors input (the context and the last two history inputs each of three pair) y = f(w ∗ [ 1 x ] ) + bias), bias=1 138 o. obe, i. dumitrache figure 2: fig.2.1 fuzzy controller design fig.2.2: input mfs plot fig.2.3: output mfs plot. input output sensors motor wheels if then rule no. ld fd rd lv rv 1 nd nd nd z z 2 nd nd d z f 3 d nd nd f z 4 nd d – s f 5 d d – f s table 1: fuzzy rule table for khepera iii fuzzy controller f(s) = 1 1+e−s s = w01 + w11 ∗ x1 + · · · + w19 ∗ x9 w ∈ u5,10(r) (5 neurons for fuzzy weights with 9 sensors inputs and a bias). we design a network of 5 neurons representing the 5 fuzzy rules weight to be optimized. the inputs are the sensor readings from the 3 fuzzy input variables (ld, fd, and rd). in other to ensure that our robot collides not with the obstacles considering the simplicity in our fuzzy design, we decided to keep history of the last two sensors readings with the context readings of the robot as the input to our nn. this allows the robot to keep history of the last two previous inputs values from the sensors thereby increasing accuracy of decision of eliciting obstacle avoidance bahaviour. the input of our nn is the sensor values from the robot and the output is the 5 optimized rule weights to optimize fuzzy logic controller performance for obstacles and wall following behaviours. figure 3: a layer neural network architecture for our network design adaptive neuro-fuzzy controler with genetic training for mobile robot control 139 genetic algorithm for training we employ ga as the learning algorithm of neural network weight which invariably optimize nn performance resulting in nn optimizing fuzzy logic rules weights and thereby optimizing the robot navigation in terms of wall following and obstacle avoidance behaviours. initially, 50 random weight matrix of 20 population size for each individual are generated and made to evolve for 100 generations. this was tested on the robot to get new 50 different weights. for each generation, a performance index, ϵ to evaluate the fitness of each chromosome is computed in terms of the number obstacles avoidance performed successfully. we set a fixed path to be traversed by the robot for each generation. each chromosome is evaluated based on this performance measure. the evaluation of the chromosome fitness determines either to accept the chromosome as potential part of the new population for reproduction in the next generation. the fit chromosomes are set as part of the new population. for the next generation, 20 new populations are generated by using genetic operators like crossover and mutation. two point crossovers were used and the mutation factor was set at 0.05. with the best 10 chromosome and 10 new chromosomes, the robot is tested. in this way the algorithm continues till the optimum obstacle performance is obtained. figure 4 below shows the flow chart for our genetic algorithm implementation for the optimization of nn weights. figure 4: flow chart for our genetic algorithm implementation for the purpose of optimization of nn weights. 5 neuro-fuzzy control of mobile robot with genetic training design we design a neuro-fuzzy controller for our mobile robot control using genetic training(figure 5). the overall design of our neuro-fuzzy controller with genetic algorithm learning is as illustrated in figure 9 above. 140 o. obe, i. dumitrache figure 5: geno-neuro-fuzzy controller for khepera mobile robot figure 6: a schematic diagram showing flowchart of the the khepera mobile robot motion planning scheme of our robot control using geno-neuro-fuzzy controller. adaptive neuro-fuzzy controler with genetic training for mobile robot control 141 the overall neuro-fuzzy controller with genetic training has the sensors values of the robot from the fuzzy variable parameters as the reference input, all initialized to zero. the sensor readings from the robot are the inputs to the fuzzy controller and the neural network. the outputs of the fuzzy controller are the speed values for the left and right motors of the robot. the external speed input, uext represents the possible maximum set hardware speed for the khepera robot in other to obtain stable performance. the resultant output speed, u is used for the navigation control of the robot. the difference in the robot output speed y and input speed is computed and feed back as error, ϵ. this error serves as the fitness function to measure the fitness of each individual chromosome in the ga. 6 result and discussion in the developed neuro-fuzzy approach, training is done off-line with the help of genetic algorithm. the computer simulation is carried out by considering an arena (kiks arena) with number of obstacles. apart from smoothness in path traced by the robot, we have decided to use other index of performance measures to evaluate the performance of the optimization results. these include the error measure implemented in our program to monitor the difficulties in obstacles avoidance. each experiment is run in the same environment with equal number of obstacles along the robot’s path. hence, we can measure performance in terms of difficulties in circumventing obstacles by the robot by penalizing or rewarding the actions. this measure is what we call error penalty (ep) in this work. the objective function is to minimize this error. hence, the smaller the error, the better the performance output. fuzzy controller obstacle avoidance simulation results figure 7: fl controller output (ep = 57.99) figure 8: fl controller output after manual tuning of rule base (ep =12.58) 142 o. obe, i. dumitrache from figure 7, it is obvious that even though our fuzzy controller succeeds in avoiding obstacles, the robot was trapped initially and could not move forward until after several loops. in other to improve the robot’s performance, we decided to manually tune our fuzzy controller by adjusting some rules consequences of the rule base. the result of the manual tuning is made obvious by the improved performance observed in figure 11 below. the robot can now move forward to the target without being caught in a loop. we equally observed reduced error measured from 57.99 to 12.58.we further improve on the fuzzy controller by manually adjusting the membership sets of the fuzzy controller. we notice improved performance in the path traced by the robot and the number of error encountered in obstacle avoidance behaviour (from 57.99 errors to 12.58 errors). this result is shown in figure 9 below. figure 9: fuzzy controller after hand tuning of rule base and membership function sets (ep = 12.58) from the fuzzy controller performance results presented above, we could not reach better performance in terms smoothness of path traced by the robot by manual tuning. the lack in granularity is responsible for the zig-zag movement of the robot along its path. therefore, we decided to employ neural network with genetic training to fine tune the navigation performance. because of the simplicity in the fuzzy design, in terms of number of rules in the rule base, we observed that the optimal result would be better realized by deciding on what rules to be executed at an instant of time instead of further tuning of rule base and membership function parameters. hence, we focus on the automatic tuning of the rule weights. the idea is based on the fact that we can train an nn model using a genetic training to evolve an optimal rule weights for the navigation control, wall following, and obstacle avoidance behaviours for our robot. the neuro-fuzzy controller with genetic training designed in this paper was put to test and we have realized the results below. neuro-fuzzy with genetic training for obstacle avoidance result in this section, we present the implementation result of our neuro-fuzzy controller with genetic training. the first weights set generated by our neuro-fuzzy with genetic training were used in the neuro-fuzzy controller. the simulated output result of this implementation is shown in figure 13. apart from smoothness in path traced by the robot and error penalty (ep), we equally utilize the readings as observed by the simulation trace output report window of the kiks simulator as shown in figure 14. the report gives information on the number of forward, backward, and straight moves of adaptive neuro-fuzzy controler with genetic training for mobile robot control 143 the robot. with this information, we can easily measure the performance based on number of straight moves made by the robots and also find the ratio of straight moves to forward moves (called performance ratio in this work) in other to compare the performance of each set of weights set generated in each generation. this performance ratio helps to further justify path smoothness traced by the robot. hence, our overall desire is to obtain performance output result with the smoothest path traced by the robot; minimum error penalty (ep); and maximum performance ratio (pr). figure 10: geno-neuro-fuzzy controller output after first evolution of rule base weights. error penalty: 64.89; forward: 181cm; straight: 84.6cm; performance ratio: 0.46 the result of implementation of the first weight set output of the neuro-fuzzy controller with genetic training on the robot navigation is shown in figure 10. by comparing figure 9 of manual tuned fuzzy controller with figure 10, there is no much difference in terms of path traced smoothness of the robot. figure 11: geno-neuro-fuzzy controller output after 50 evolutions.error penalty: 12.79; forward: 181.0cm;straight: 84.6cm; performance ratio: 0.46 after several evolutions, we tested the best rule weight output on the robot and observed the performance output in figure 14. we noticed a greater improved performance in terms of path traced by the robot and error penalty (from 64.89 to 12.79). the performance output after further evolutions of rule weights by the genetic algorithm for the nn learning results in greater improved performance measures in terms of error penalty (from 12.79 to 9.7390), and performance ratio (0.46 to 0.66) as indicated in figure 15. further evolution results show a little drop in performance as shown in the figure 16. after several generations by the ga, we could not get a rule weights set of better performance than 144 o. obe, i. dumitrache figure 12: geno-neuro-fuzzy controller at 95th evolutions. error penalty: 9.7390; forwards: 146.7cm; straight: 96.6cm; performance ratio = 0.66 figure 13: geno-neuro-fuzzy controller output after 100 evolutions of rule weights.error penalty: 12.20; forward: 154.3cm;straights: 96.8cm; performance ratio = 0.63 figure 14: kiks window to monitor simulation details adaptive neuro-fuzzy controler with genetic training for mobile robot control 145 the one shown in figure 15 (95th generation). this shows that we have reach the optimum performance for the rule weight generation for our experiment. hence, we can use the nn model for this optimal rule weight set to obtain an optimal performance in terms of obstacle avoidance behaviour and navigation of our khepera robot. the nn model with this optimal netweight values for rule weights is judged as the best nn model for our research result. 7 conclusion in this paper, a simple neuro-fuzzy strategy with genetic training has been proposed to drive a khepera mobile robot. with this approach, we have been able to tune the rule weights of our fuzzy controller in order to guide our robot. the proposed neuro-fuzzy controller is a layer neural network along with an evolutionary (genetic) learning algorithm. this system has been successfully implemented in simulation obtaining satisfactory results which is better than fuzzy controller output. the lack of granularity in the membership functions has been to a large extent compensated for by the adaptive hybrid neuro-fuzzy with genetic training controller. this is obvious from the smoothness of path traced by the hybrid system compared to the path traced by the fuzzy controller outputs. we have equally adapted the neuro-fuzzy controller with genetic training to work on real khepera iii robot. the result was satisfactory. the simulation and real implementation on the khepera iii robot have shown the efficacy of using hybrid systems in improving mobile robot navigational performances in terms of obstacles, wall following and path tracking behaviours. bibliography [1] i. dumitrache, 1996, intelligent control of industrial robots, mediamira press, cluj, 1996. [2] i. dumitrache, 2000, intelligent autonomous systems, revue roumaine des sciences techiniques electrotechique et energetique, vol. 45, no.3, pp. 439-453, bucharest. [3] r.r. murphy, 2000, introduction to ai robotics,mit press. [4] i. dumitrache and m. dragocea, 2006, some problems of advanced mobile robot control,ceai, vol. 7, no. 4, pp. 11-30 [5] g. campion , g. bastin, 1996, structural properties and classification of kinematics and dynamic model of wheeled mobile robots, ieee transaction on robotics and automation, vol. 12,no. 1. [6] h.r. everett, 1995, sensors for mobile robots,a.k. peters, ltd. [7] x.q. chen, y.q. chen, and j.g. chase, mobiles robots-past present and future, intech. [8] r. a. brooks, 1986, a robust layered control system for a mobile robot. ieee journal of robotics and automation, ra-2(1):14-23. [9] r. a. brooks, 1999, cambrian intelligence: the early history of the new ai, the mit press, cambridge, massachusetts. [10] l.a. zadeh, 1994, fuzzy logic, neural networks, and soft computing, communications of the acm, 37(3):77-84. 146 o. obe, i. dumitrache [11] o.o.obe and i. dumitrache, fuzzy control of autonomous mobile robot, u.p.b. sci. bull., series c, vol. 72,iss. 3, 2010 [12] m.jackson phinni et. al, obstacle avoidance of a wheel mobile robot: a genetic-neurofuzzy approach, iisc centary-international conference on advances in mechanical engineering (ic-icame),bangalore, india, july, 2008 international journal of computers communications & control issn 1841-9836, 9(6):721-729, december, 2014. framework for automated reporting in eu funded projects a. mihăilă, d. benţa, l. rusu alin mihăilă*, lucia rusu faculty of economics and business administration babes-bolyai university of cluj-napoca alin.mihaila@econ.ubbcluj.ro, lucia.rusu@econ.ubbcluj.ro *coresponding author: alin.mihaila@econ.ubbcluj.ro dan benţa faculty of law and economics agora university of oradea dan.benta@univagora.ro abstract: reporting processes in eu funded projects involve a huge amount of financial operations. this time consuming procedure can be easily automated as it involves repetitive operations. the novelty of the paper consists in the presentation of autofistate software application for automatic financial data capture in projects financed through esf (european social fund) in romania. the autofisate application was developed on the basis of results obtained in the fields of automated testing and scripting languages. the use of the autofistate software application leads to maximum effectiveness in how esf funds are used by reducing the time needed to draw up the financial reports and the related labor costs. using scripting languages to develop such reporting-support programs we can improve reporting and save employees effort and time. keywords: automatic financial data capture, automated testing, scripting languages 1 introduction it is widely known that romania has a very low degree of eu funds absorption. there are many reasons for the failure to absorb the eu’s so-called structural funds and it is not the objective of this paper to discuss them. we will like to address just one of the problems and to suggest a possible solution in order to have projects with greater capacity of absorption. from our own experience with these kinds of projects, one of the main problems in eu funded projects conducted by the universities is the reporting procedure of financial evidence when the requests for reimbursement are made. this is a time consuming activity that needs high cognitive skills such attention to small details. in general, in a reporting period, several eu funded projects are handled by a single person, the financial official from projects’ beneficiary. moreover, financial evidence and reporting should be done by people with great expertise and most of the time these types of tasks cannot be transferred to another team member. we had to experience all this problems and that is why we were interested in this field and finally we managed to develop a software solution to divide reporting tasks in order to facilitate the work of financial officials in beneficiary offices. test automation [1], [2], [3], [4] is a significant area of investment and the market awareness of highly automated testing is very high. as some white papers present [5], [6] you can get the most benefit out of your automated testing efforts by automating: repetitive tests that run for multiple builds, tests that are highly subject to human error, tests that require multiple data sets, frequently-used functionality that introduces high risk conditions, tests that are impossible to perform manually, tests that run on several different hardware or software platforms and copyright © 2006-2014 by ccc publications 722 a. mihăilă, d. benţa, l. rusu configurations, tests that take a lot of effort and time when doing manual testing. moreover, authors [6] describe best practices for script automation or for cross-site scripting attacks [7]. as reporting operations are repetitive tasks, using scripting languages to develop reportingprograms proved to be a great solution for many other businesses, inclusive for our own projects. in a market research report made in 2013 by musier & javed [8], automated testing delivers business benefits in multiple areas for most business companies. more than 4 out of 5 firms using test automation identified business benefits of test automation in multiple areas (86%), with most respondents identifying 3 to 6 different areas of benefits. the 5-top ranked areas of value were: 1) greater staff efficiency and time savings; 2) early identification of defects before business users are impacted; 3) higher quality in business processes and the software that supports them; 4) greater accuracy in catching more defects; 5) faster deployment of innovation and new features for business users. the authors also mention that testing is increasingly seen as an essential competency for most of the global companies. more than that, harvesting economic benefits will continue to drive the industry’s shift toward highly automated testing and away from manual approaches, as companies continue to push for higher quality execution and greater business agility at lower cost. beside the aspects mentioned, an important aspect for any work field is the dynamics of the group. this concept is an essential factor in long term projects as it can affect the project performance. because some staff is short time employed and the same task may be allocated to several employees during the implementation period, aspects of group dynamics become even more important [9]. the reason tackled in this paper is the fulfilment of financial statement demanded when applications for reimbursement (refund) are submitted. the current legislation requires that, for each project implemented by a public institution, each application for reimbursement (refund) should be submitted no later than three months after the first payment is made [10]. drawing up a funding application is a complex and time-consuming activity. in big organizations, and not only, where esf-funded projects are implemented, the job of financial officials who draw up funding applications is very difficult and demanding. to support them during the stage of financial recording, the present paper proposes the automation of the process of data input in the reporting system of the managing authority in romania for the sectorial operational programme human resources development. the reporting system used in romania by the managing authority for the sectorial operational programme human resources development is an online software application called action web. consequently, the automation of the process of data input into the reporting system represents the automation of the process of filling in a web form which, in turn, is similar to automated testing that is appropriate for [5] tests that are highly subject to human error and/or tests that take a lot of effort and time when performed manually. the main advantage of automatizing the financial reporting operations lies in greater staff efficiency and time saving [8]. the novelty of the paper consists in the presentation of autofistate software application for automatic financial data capture in projects financed through esf in romania. to develop the autofistate software application for the automatic data input into the action web to prepare the financial statement, use was made of results in the fields of automated testing [1], [2], [3], [4], [5], [6], [8] and scripting languages [6], [7], [11]. the structure of the present paper is as follows: section 2-problem statement; section 3-presentation of the solution to the problem and of the autofistate software application; section 4-presentation of tests and results; section 5-conclusions. within this framework and taking into consideration that a project can be very difficult to manage without proper instruments, we present our solution for data validation and reporting framework for automated reporting in eu funded projects 723 process in order to avoid human reporting mistakes and to reduce data input time in eu projects. we also consider that our solution helps with the problems that can arise with the short time employment, helps to improve the dynamic of employees and to build trust between partners in eu funded projects. 2 problem statement in the case of esf-funded projects implemented by public institutions, the lead partner must submit, no later than three months after the first payment is made during the reimbursement period, an application for reimbursement consisting of several documents among which is the financial statement. the financial statement contains data about all the expenditures made by all partners in the project during the reimbursement (refund) period. the data obtained from the financial officials of all partners are then entered by the project coordinator into the action web software which automatically generates the financial statement in pdf file format (figure 1). for each type of expenditure made in the project (human resources, participants, other costs, indirect expenses), data must be entered for the documents certifying the commitment of expenditures (date of issue, number of documents, supplier, tax identification number, description, annual budget, budget line expenditure category, payer, currency, vat-inclusive price, vat-exclusive price, vat-exclusive project costs, vat) and for the documents certifying the payment was made (date of payment, number of payment document, currency, vat-inclusive price, vat-exclusive price). figure 1: input and output for reporting given that: • data input into the action web application is a time-consuming operation, • the data to be entered into action web follow the same structure/format for all types of expenditures, and • the action web application does not feature the automatic data capture from a file of a particular format/structure, the opportunity was opened up to develop a software application entitled autofistate that will be used to automatically enter data into action web. 724 a. mihăilă, d. benţa, l. rusu by using the autofistate software application: • the time needed to enter data into action web is reduced, and • the time needed to identify and correct errors made when entering data into action web is also reduced. as a direct consequence thereof, the labor costs for drawing up each financial statement are reduced. 3 software design and implementation the preparation of a financial statement using the autofistate software application is accomplished in three stages (figure 2): • the preparatory stage during which the financial officials of all partners in the project prepare the financial data related to all expenditures made by partners and the project coordinator centralizes financial data collected from the financial officials of the partners; • the autofistate stage during which the project coordinator automatically enters the centralized financial data into the action web application; • the final stage during which the financial official of the lead partner (beneficiary) generates the financial statement for the entire project and prepares it to be approved by the legal representative of the project. figure 2: financial statement generation phases all mentioned steps involve more than one employee and the beneficiary financial officials’ job is to check the whole evidences and costs asking for supporting documents. in our case, we have developed and tested this solution, for the first time, in an eu funded project with five partners (our university as beneficiary and four other partners). we are describing in detail the performed operations in a reporting period. a) during the preparatory stage (figure 3), the financial officials of each partner draw up their own financial statements by entering the data about each category of project expenditures into an excel template file. some data must be entered into the excel template in the format required by the action web application. for instance, calendar dates must be entered in the "dd/mm/yyyy" format and the numeric data should not use any separator except for the dot (".") character. the financial official of the beneficiary receives the financial statements from each partner and checks the eligibility of all project expenditures on the basis of supporting documents. after the eligibility of expenditures for all partners has been checked, the financial official of the beneficiary forwards the excel templates containing the related financial statements to an operator who automatically enters the data into action web using the autofistate application. b) during the autofistate stage (figure 4), an operator of the beneficiary checks whether the data in the excel template file containing the financial statements of each partner comply with the formats of the action web application. if some data are entered not according to the required format, the operator processes the data so that they may comply with the format required by the action web application. after the data in all excel template files have been found to comply framework for automated reporting in eu funded projects 725 figure 3: the preparatory stage with the format required by the action web application, they then are transferred to a central excel file. all fields of the central excel file are set at identical height and width because the autofistate script uses the screen coordinates to copy data from the central excel file into the action web application. while the operator inputs all the records, beneficiary financial official can perform other mandatory operations and prepare the requested for the reimbursement (for instance, documents for the external auditor). figure 4: the autofistate stage the autofistate application reads the central excel file row by row so that at a given moment a single row is active or current. initially, the current row is represented by the first row of the central excel file. the autofistate application copies/captures the data from the current row into the action web software. the data from the current row are copied cell by cell, from the left to the right of the row. after having copied the data from the current row, the autofistate application waits until the data are saved on the server of the managing authority and sets the next row of the central excel file as the current row. the new current row is set by clicking on the 726 a. mihăilă, d. benţa, l. rusu vertical scroll bar so that the next row of the central excel file becomes the first row displayed on the screen. this process is repeated until all rows of the central excel file are copied into the online application of the managing authority. the autofistate application should be run under supervision because either the computer or the action web application or both are sometimes slow in responding. in this case, the autofistate application will be closed and then reopened from the record that failed. closing and reopening the autofistate application is made using the hotkeys/predefined keys from autoit scripts. to develop and use the autofistate application, the following software applications were used: • microsoft office excel, 2010 version, for data preparation by the financial officials of all partners to be entered into the action web software; • mozilla firefox, version 15.0.1, to access the action web application; • auto it scripting package, version v3.3.8.1, to develop and use effectively the autofistate application (during the development stage, window info cursor position tool proved extremely helpful). in order to properly use the autofistate application, several prerequisites/conditions must be met: • only microsoft excel file and reporting platform should be opened for our test we used office 2010 package; • use mozilla firefox to open reporting platform because full screen result differs when ie or google chrome browsers are used; • reporting platform should run in full screen (clicking coordinates were defined in an full screen operation); • minimize the ribbon should be activated in microsoft excel 2010; • auto-hide the taskbar option should be checked in taskbar and start menu properties. those restrictions assure a proper script running and reporting according to excel file records. with this pre-settings made, the script runs. main operations performed by reporting script (figure 5) are coordinated by hotkeys in autoit scripts. a hotkey press for such scripts calls a user function that may pauses or interrupts current autoit function. most used and defined hotkeys in autoit scripts are for pause: hotkeyset("pause", "togglepause") and for terminate: hotkeyset("esc", "terminate"). in our case, we have used "esc" and terminate function to stop script running (figure 5a). in case of processing delays or server slow response, the script can be stopped and restarted from the last selected record. a message window with some information is defined, then, the script reporting phase is initialized (from 0 to n-1, where n is the number of records from excel file). further, the script runs and each record from excel file is inputted in the reporting web form (figure 5b). after last record, an increased waiting time (sleep time) is defined in order to secure the web server response time and loading reporting form to enter the next record (figure 5c). the application crosses financial evidence file in row order. then, all specific operations for the first row are applicable to the next one. to assure this, after finishing a row, the next row should be repositioned. for this to happen, the application clicks once the roll down bar from excel so that second row became first displayed row. this process repeats until the last record is inputted in the reporting web form. c) the automatic financial data capture from the central excel file into the action web application is followed by the final stage (figure 6) during which the financial official of the beneficiary generates the financial statement from the online application of the managing authority and submits it for approval by the legal representative. before report generation, incomes must be inputted. later, along with other documents, the reimbursement (refund) request is send to authorities. framework for automated reporting in eu funded projects 727 figure 5: autoit script: a. safety settings; b. automated reporting; c. save and repeat. figure 6: the final stage 4 tests results the autofistate application has been used with four esf projects financed through the sectorial operational programme human resources development: only the autofistate software application was used to enter the data because, for each project, the amounts of data were relatively large and the available time needed to draw up a funding application and prepare a financial statement was not sufficient as regards entering the data into the action web application both manually and automatically. leaving aside the time needed to discover possible mistakes due to manual data entry and correct them, it has been noticed that 142 hours have been saved by using the autofistate application. since the gross hourly salary of a financial official amounts to 150 lei, then the saving amounts to 21300 lei. 728 a. mihăilă, d. benţa, l. rusu table 1: data test project id no. of partners no. of budget (in million status months euros) 1 5 36 5 closed 2 2 29 4,5 ongoing 3 2 36 3,5 closed 4 9 36 5 closed table 2: test results project manual time* autofistate manual autofistate no. of records id (in hours) time (in hours) error error 1 111 40 ≥ 0% 0% 4767 2 30 11 ≥ 0% 0% 1301 3 23 8 ≥ 0% 0% 978 4 57 20 ≥ 0% 0% 2435 5 conclusions automated reporting shortened our reporting period and improve the quality of reporting tasks. the current version we developed is good enough to make the reporting procedure of all stored records in an efficient and clear manner. there are various testing and automation software solutions that may improve such repetitive reporting tasks. the implemented solution constitutes a solid foundation for our future work. as web browsers update is regular made, sometimes defined coordinated doesn’t perfect match and should be redefined. a solution to resolve this problem may be implemented. this solution have two major advantages: saved financial official time with more than 50% and offers dynamic collaborative framework for projects’ partners. it can be applied to all eu funded projects that need this reporting procedure and can also be applied successfully in national research projects (ancs, cncsis etc.). reporting procedures in eu funded projects needs careful planning and design work as a huge amount of time and effort is involved. using this reporting-support application, a full range of requests is considered. the software enhances the reporting process by increasing data validation and efficiency, removing reporting complexity and lowering costs. it can be adapted and customized for different reporting procedures in projects when repetitive data input operations are needed. the use of the autofistate software application leads to maximum effectiveness in how esf funds are used by reducing the time needed to draw up the financial reports and the related labor costs. the autofistate software application can be improved to the point where it no longer depends on the regular updates of the web browser. the autofistate application can be used for all esf-funded projects in romania, whose financial reports are forwarded to the managing authority via the action web application. the autofistate software can also be slightly adjusted so that it can be used successfully in other eu-funded or government-funded projects. acknowledgement the work presented has been founded by the research grant fp7 aal nitics and was financed by it center for science and technology, romania, as a partner in this program. framework for automated reporting in eu funded projects 729 bibliography [1] jureczko, m. (2008). the level of agility in the testing process in a large scale financial software project. software engineering techniques in progress, oficyna wydawnicza politechniki wroclawskiej, 139-152. [2] li, k., & wu, m. (2004). available gui testing tools vs. proposed tool. effective gui test automation: developing an automated gui testing tool. sybex. [3] myers, g. j., badgett, t., thomas, t. m., & sandler, c. (2004). the psychology and economics of program testing. the art of software testing-second edition, john wiley & sons, isbn 0-471-46912-2.. [4] dustin, e. (2003). automated testing tools. effective software testing: 50 specific ways to improve your testing. pearson education, isbn 0-201-79429-2, 137-154. [5] juniper networks (2008), increasing network availability with automated scripting, available on-line http://support.neoteris.com/solutions/literature/white_papers/200252.pdf [6] smartbear software (2013), 6 tips to getting started with automated testing, white paper, available on-line http://smartbear.com/smartbear/media/pdfs/6_tips_for_automated_test.pdf [7] van acker, s., nikiforakis, n., desmet, l., joosen, w., & piessens, f. (2012). flashover: automated discovery of cross-site scripting vulnerabilities in rich internet applications, asiaccs ’12, may 2-4, 2012, seoul, korea, acm. [8] musier, r., javed, s. (2013). 2013 trends in automated testing for enterprise systems, pp. 3-4, available on-line http://www.worksoft.com/files/resources/worksoft-research-report2013-trends-in-automated-testing.pdf [9] levi, d. (2011). group dynamics for teams. sage publications, 3rd edition, isbn: 9781412977623 [10] romanian government emergency ordinance no 120 from december 23, 2010, art. 5∧1. [11] autoit, http://www.autoitscript.com/site/autoit ijcccv8n1.pdf int j comput commun, issn 1841-9836 8(1):127-135, february, 2013. a novel method for service differentiation in ieee 802.15.4 : priority jamming s.y. shin soo young shin school of electronic engineering, kumoh national institute of technology, room 112, digital building, yanghodong, gumi, gyeongbuk, korea e-mail: wdragon@kumoh.ac.kr abstract: ieee 802.15.4 employs carrier sense multiple access with collision avoidance (csma/ca), which is known to have difficulty for supporting quality of service (qos). in this paper, a new priority scheme called priority jamming (pj) is proposed for service differentiation for ieee 802.15.4. the main idea of the proposed scheme is deferring the low priority packet transmission for the high priority packet. clear channel assessment of ieee 802.15.4 is modified to support the proposed pj. the efficiency of the proposed scheme is validated by comparing the delay, throughput, and energy-per-bit with those of standard csma/ca. simulations results showed that pj improves the delay and throughput simultaneously while maintaining marginal difference in energy efficiency. keywords: csma/ca, qos, priority jamming, service differentiation, ieee 802.15.4 1 introduction ieee 802.15.4 is originally designed for low duty cycle and low rate applications such as wireless personal area network (pan) [1]. however, it can be adopted for some delay-sensitive applications such as emergency detecting, intruder alarming, health care, and so on. for supporting these applications, qos (quality of service) requirements need to be supported. ieee 802.15.4 adopts carrier-sense multiple access with collision avoidance (csma/ca) mechanism. therefore, every node has a statistically equal chance to use the wireless medium and to transmit packets. because of unpredictable and undeterministic nature of csma/ca cased by collisions and random backoff algorithm, to provide qos in csma/ca is generally known to be difficult. some works such as [2], [3], [4] studied the qos support in ieee 802.15.4. in [2], guaranteed time slot (gts) mechanism is exploited for the real-time service when enabling beacon mode. the gts uses the contention-free period (cfp), which is optional feature in ieee 802.15.4 standard. in addition, if the available resources are not sufficient, i.e., the number of nodes requires real-time service is large, the gts may not be sufficient because of limited number of gts slots. in [3], differnet backoff exponent(be) and contention window(cw) are used to provide service differentiation in ieee 802.15.4. however, nodes with different priorities can have the same value of be and/or cw based on slotted carrier-sense multiple-access with collision avoidance (csma/ca) algorithm of ieee 802.15.4, which leads to collisions among the nodes with different priorities and increased delay for successful packet transmission. in [4], the frame tailoring (frt) and the priority toning (prt) are proposed to support qos. using the frt, i.e. padding zeroes, ieee 802.15.4 nodes with high priority performs one clear channel assessment (cca) only. however, the frt causes a packet overhead and the acknowledgement packet of the normal priority node and the data packet of the high priority node can collide. to prevent collisions among high priority and normal priority packets, the priority toning (prt) is used to defer the normal priority packets by allocating some active portion of active period for the copyright c© 2006-2013 by ccc publications 128 s.y. shin table 1: parameters of ieee 802.15.4 parameters values aminbe 3 (default) amaxbe 5 (default) cw 2 (default) macmaxcsmabackoffs 5 (default) slot duration (unitbackoffperiod) 20 symbols cca duration of low priority packet 20 symbols cca duration of high priority packet 8 symbols jamming signal duration 8 symbols high priority packet transmissions. however, if the number of the high priority packets is large and network load is high, collisions may frequently occur among the high priority packets in the allocated portion by the prt. moreover, those methods are designed for the beacon-enabled ieee 802.15.4 network only. to support service differentiation, a novel method for supporting priority called priority jamming (pj) is proposed in this paper. the core idea of the pj is deferring the normal priority packet transmissions using a jamming signal transmitted for high priority nodes who have packets ready to transmit. because the proposed method exploit channel sensing part of ieee 802.15.4, i.e., clear channel assessment (cca), it can be applied both beacon and non-beacon enabled network. although the proposed method considers slotted csma/ca in this paper, it can be used both slotted and unslotted versions of ieee 802.15.4 csma/ca.the paper is organized as follows. the suggested pj is described in the next section. section 3 evaluates the performance of pj and compares with standard csma/ca. finally, we draw our conclusions in section 4. 2 priority jamming in this paper, non-beacon enabled network with slotted csma/ca is assumed. let’s examine the operation of standard csma/ca of ieee 802.15.4 first. ieee 802.15.4 csma/ca works as follows. three variables are maintained at each device for a channel access: nb, cw and be. nb is the number of times the csma/ca backoffs while attempting the current transmission, and is reset to 0 for each new data transmission. cw is the contention window length, which is reset to 2 either for a new data transmission or when the channel is found to be busy. be is the backoff exponent, related to the backoff periods a device should wait before attempting carrier sensing. when a device needs to transmit, it delays for a random number of backoff periods (up to 2be − 1 periods) and then determines if the channel is clear. if the channel is busy, the mac increases both nb and be by one, and resets cw to 2. if nb is less than or equal to macmaxcsmabackoffs, the csma/ca delays for a random time again, otherwise it terminates with a failure. if the channel is assessed to be idle, it must ensure that the contention window is expired before starting transmission. for this, the mac sublayer first decrements cw by one. if cw is not equal to 0, it must go to another channel sensing step. otherwise, it starts transmission on the boundary of the next slot period [1]. in an ieee 802.15.4 network, it is assumed that there are two data categories, namely, high and normal priority packets. the main idea of pj is to provide high priority packets with greater possibility to access the channel compared to normal priority packets. figure 1 describes the operation of the proposed scheme and some parameters related to both pj and standard mac a novel method for service differentiation in ieee 802.15.4 : priority jamming 129 be=aminbe, nb=0, cw=2 locate backoff period boundary delay for random [0,2 be -1] unitbackoffperiods high priority packets? ready for transmit perform cca channel idle? cw=2, nb=nb+1, be=min(be+1, amaxbe) transmission failure cw=cw-1 nb= macmaxcsmabackoffs? cw=0? transmission success transmit priority jamming signal y n n nn yy y figure 1: operation of priority jamming in ieee 802.15.4 130 s.y. shin operation are summarized at table 1. to transmit either a high or normal priority packet, nodes will contend to access channel using csma/ca of ieee 802.15.4 as described earlier. to introduce pj into ieee 802.15.4, grey-colored portion is changed or modified compared to standard csma/ca of ieee 802.15.4 as shown in figure 1. when a node is ready to transmit high priority packet, it not only performs cca to see whether the channel is busy or idle, but also set aside some slot time to send a jamming signal to notify other nodes that a high priority packet is ready to be transmitted. in other words, a node with high priority packets, performs cca during 8 symbol times as defined in ieee 802.15.4 standard and transmits a jamming signal. for jamming signal, any signal whose duration is less than (slot duration-8) symbols can be used. in this paper, we used the preamble of ieee 802.15.4 packet for the jamming signal. with pj, a node with normal priority packets performs cca for entire slot time (20 symbol time), not for just 8 symbol times. because a node with normal priority packets listens entire slot time and then the channel will be assessed to be busy due to jamming signal by the node with high priority packets ready to be transmitted. then, a node with normal priority packets will defer its transmission and perform another random backoff procedure. by deferring the transmissions of normal priority packets, the collision probability among nodes with high and normal priority packets will be reduced compared to standard csma/ca. in other words, the effective node numbers which contend for the channel will be reduced by adopting pj. note that if all the packets have the same priority level, i.e., under either 100% high priority or normal priority traffic, the behavior of pj nodes will be identical to standard csma/ca. also, when nodes with standard csma/ca coexist with nodes using pj, they are considered as nodes with high priority packets. this means that pj guarantees backward compatibility with legacy ieee 802.15.4 standard. data ackchannel node with normal priority backoff backoff c c a 1 c c a 1 c c a 1 c c a 0 transmission priority jamming i c c a 0 data backoff node with high priority (i+2)(i+1)(i-1) figure 2: an example of priority jamming in slotted csma/ca of ieee 802.15.4 figure 2 shows an example of pj operation with one node with a high priority packet and one node with normal priority packet. as illustrated, a node with normal priority packet finishes backoff at (i−1)-th slot and performs the first cca, cca1. at this time, the channel is assessed to be idle and (cw = cw −1). at i-th slot, a node with high priority packet finishes its backoff and performs cca1 with pj. at the same time, the node with normal priority starts the second cca, cca0. now the channel is assessed to be busy because of the jamming signal transmitted a novel method for service differentiation in ieee 802.15.4 : priority jamming 131 by the node with high priority packet. then, at (i+1)-th slot, another random backoff and cca0 are performed by the node with normal priority and the node with high priority, respectively. finally, the node with high priority has a chance to transmit packet at (i + 2)-th slot. 3 performance evaluation to evaluate the performance, we developed an opnet simulation model of the slotted csma/ca and priority jamming of ieee 802.15.4. star topology network with one coordinator and 20 end devices is used for the simulations as illustrated in figure 3. figure 3: simulation scenario of ieee 802.15.4 csma/ca with priority jamming 132 s.y. shin all end devices send data packets to coordinator, which responds with corresponding ack packets. all end devices generate 102 bytes long packets based on exponential distribution with mean 1/λ. two simulation parameters are inter-arrival time, i.e. λ, and ratio of high priority packets in the total traffic. the λ varies from 0.15 to 0.4, which means the traffic introduced to the network varies from 108.8 to 40.8 kbps. the ratio of high priority in the traffic, is one of 10, 30, and 50 %. as the measure of performance, delay, throughput, and energy efficiency are used in this paper. then, the performances of pj are compared to those of standard slotted csma/ca. 0.15 0.2 0.25 0.3 0.35 0.4 0.01 0.015 0.02 0.025 0.03 0.035 0.04 0.045 0.05 packet inter−arrival time (sec) d e la y ( s e c ) std high (50%) normal(50%) high(30%) normal(30%) high(10%) normal(10%) figure 4: delay vs. inter-arrival time figure 4 shows packet delays as the inter-arrival time increases. here, (x %) in the legend of figure 4 means that x percentage of the total traffic introduced to ieee 802.15.4 network is high priority packets. the parallel lines to y-axis means the delay doesn’t converge. in other words, there are so many collisions that the packet delay increases. for example, with λ < 0.2, the delay of standard csma/ca doesn’t converge. however, when pj is applied, the delays of high priority packets are bounded even for λ = 0.15. this is because the collision probability of high priority packets decreases due to service differentiation provided by pj. for example, at λ = 0.2, the collision probabilities of the high and normal priority packets are 0.007, 0.026, 0.049, and 0.111, 0.089, 0.060 for 10, 30, 50 % cases, while that of the standard is 0.129. as the ratio of high priority packets increases, the collision probabilities of high priority packets and those of normal priority packets increase and decrease because the node numbers of high priority and normal priority increases and decreases, respectively. note that the delays of normal priority packets with 10% high priority packages at a novel method for service differentiation in ieee 802.15.4 : priority jamming 133 λ = 0.18 are about 5 second. for λ ≥ 0.18, the delays of the high priority packets increases at the ratios of the high priority packet increases. as λ decreases, the channel will be more crowded, even for the cca of the high priority nodes. then, channel could be assessed as "busy" and may cause many random back-offs. 1 as the packet inter-arrival time decreases, the channel occupation by packet transmissions is more dominant than collisions. for larger λ, the channel is not frequently occupied, the delay is more dependent on the collision probability. hence, the delay of high priority packet decreases as the ratio of high priority decreases. note that the delay experienced by both high and normal priority packets in pj is less than that of standard slotted csma/ca (std). 0.15 0.2 0.25 0.3 0.35 0.4 0 20 40 60 80 100 120 packet inter−arrival time (sec) th ro u g h p u t (k b p s ) std pj total(50%) high (50%) normal(50%) pj total(30%) high(30%) normal(30%) pj total(10%) high(10%) normal(10%) figure 5: throughput vs. inter-arrival time fig. 5 illustrates throughput of pj with high priority, low priority, and total (sum of high and normal priority throughput) and compares to throughput of standard ieee 802.15.4. the throughput for high priority packets are monotonically increases as the inter-arrival time decreases. this is because the high priority packets really have priority to access the channel by priority jamming. the total throughput with pj outperforms that of standard ieee 802.15.4 due to decreased collision probability as mentioned earlier. why the delay with the standard csma/ca diverges when λ = 0.18? the reason is that the throughput is 81.2 kbps, smaller than the generated traffic, 90.7 kbps. contrarily, throughput with pj are 91.8 kbps (50% of high 1in standard csma/ca, channel is already saturated at λ = 0.2. 134 s.y. shin priority), 92.6 kbps (30% of high priority), 92.9 kbps (10% of high priority), which are higer that the generated traffic, so delays with pj are still bounded. 0.15 0.2 0.25 0.3 0.35 0.4 0 0.5 1 1.5 2 2.5 3 3.5 x 10 −4 inter arrival time (sec) e n e rg y p e r b it ( µ j /b it ) std high (50%) normal(50%) high(30%) normal(30%) high(10%) normal(10%) figure 6: energy-per-bit vs. inter-arrival time fig. 6 compares the energy efficiency of pj and standard csma/ca using the concept of energy consumption for transmitting a bit. the power consumption of idle, transmit and receive states are set to pidle=712 µw , ptx= 31.32 mw , and prx=35.28 mw , respectively [5]. because pj scheme has energy consuming elements such as jamming signal (node with high priority packets) and longer cca time (node with normal priority packets), the energy consumption of pj is higher than that of standard csma/ca. however, the µj/bit differences are relatively small because throughput with pj are increased and energy-per-bit values are the same order of magnitude. 4 conclusion in this paper, a new service differentiation scheme which is called priority jamming (pj) is proposed for ieee 802.15.4. the main idea of the proposed priority jamming is deferring the transmissions of normal priority packets using a jamming signal transmitted for high priority nodes who have packets ready to transmit. by using the proposed scheme, the average delay of high priority packets is reduced. in addition, the net throughput of ieee 802.15.4 is increased and delay of normal priority packets is decreased because of lowering the collision probabilities among ieee 802.15.4 nodes. a novel method for service differentiation in ieee 802.15.4 : priority jamming 135 although pj consumes more energy for a packet transmission because energy consuming factors such as jamming signal transmission and long cca time, pj shows better performance both in the delay and the throughput. therefore, the differences in pj and standard ieee 802.15.4 are relatively small. the proposed algorithm can be used both beacon-enabled and non-beacon enabled mode of ieee 802.15.4 and guarantee backward compatibility with legacy ieee 802.15.4 standard. by providing the service differentiation using pj, this paper may contribute to enlarge the delay-sensitive application area of ieee 802.15.4 such as as emergent alarms and intruder detections. acknowledgments this research was supported by basic science research program through the national research foundation of korea(nrf) funded by the ministry of education, science and technology(2012r1a1a1009442). bibliography [1] ieee 802.15 tg, ieee std.802.15.4: ieee standard for wireless medium access control (mac) and physical layer (phy) specifications for low-rate wireless personal area networks (lr-wpans), ieee standard, 2006 [2] a. koubaa, m. alves, e. tovar, guaranteeing real-time services for industrial wireless sensor networks with ieee 802.15.4, ieee transactions on industrial electronics, 57(11):38682876, 2010. [3] m. j. kim, c. h., priority-based service-differentiation scheme for ieee 802.15.4 sensor networks in nonsaturation environments, ieee transactions on vehicular technology, 59(7):3524-3535, 2010. [4] t. h. kim, s. choi, priority-based delay mitigation for event monitoring ieee 802.15.4 lr-wpans, ieee communications letters, 7(3):213-215, 2006. [5] b. bougard, f. catthoor, d. daly, a. chandrakasan, w. dehaene, energy efficiency of the ieee 802.15.4 standard in dense wireless microsensor networks: modeling and improvement perspectives, ieee proceedings of date, 196-201, 2005 international journal of computers communications & control issn 1841-9836, 11(5):602-612, october 2016. language processes and related statistics in the posts associated to disasters on social networks s.c. bolea speranţa cecilia bolea institute of computer science of the romanian academy iasi branch romania, iasi, carol i, 8 cecilia.bolea@iit.academiaromana-is.ro abstract: this paper provides a detailed and long-period statistics of the use of synonyms in posts related to specific events on social networks (sns), an extended analysis of the correlations of the flows of the synonyms in such posts, a study of the applicability of zipf’s law to posts related to specific events on sns, and an analysis of the dynamics of the fluxes of synonyms in the posts. the paper also introduces the study of the distances in the phase space for the characterization of the dynamics of the word fluxes on social networks. this article is a partial report on recent research performed for a deeper analysis of social networks and of processes developing on social networks, including used lexicon, dynamics of messages related to a specific type of topic, and relationships of the processes on sns with external events. keywords: social networks, disaster, analytics, language statistics, correlative analysis, zipf’s law, dynamics. 1 introduction analytics aimed to social networks have explosively developed during the last few years, with some of them focusing on the rescue from disasters [3], [5], others proposing means for disaster management [6], [10], [26] and others on disaster prevention and mitigation [1,2], [11], [15], [22], [27]. the main purpose of this paper is fact-finding about the language of the social networks and related posts, for messages connected to dramatic events such (fires and earthquakes). in subsidiary, we are interested in the distributions of the most used words and of the synonyms. a hypothesis we made in connection with the word probability distribution is that it departs from the typical distribution of the language, which is a zipf’s distribution. this article is a partial report on recent research related to the deeper analysis of social networks and of processes developing on sns; the analysis refers to the lexicon used, dynamics of messages related to a specific type of topic, and the relationships of the processes on sns with external events. the general lexicon was studied in [24], and issues related to the use of synonyms in tweets were discussed in [4], [20], and [24]. the main contributions of this paper include the detailed and long-period statistics of the use of synonyms in posts related to specific events on sns, the extended analysis of the correlations of the flows of the synonyms [29] in those posts, a study of the applicability of zipf’s law to posts related to specific events on sns, and the detailed analysis of the dynamics of the fluxes of synonyms in the posts. the research was performed in the frame of a larger project, being a part of it (see acknowledgments). we advance the study of correlation between time series of the number of words occurrences especially of synonyms in messages related to disasters effects on sns, and compare the sns from this point of view. also, we apply a recently proposed nonlinear technique of analysis of time series for better characterizing, and differentiating specificities of sns. copyright © 2006-2016 by ccc publications language processes and related statistics in the posts associated to disasters on social networks 603 one of the purposes of this article is to present a full year data statistics for synonyms usage in posts related to two types of potentially disastrous events, namely earthquakes and fires, and to analyze in detail the dynamics of the words most used in such posts. the utility of this study is multifold and includes the optimization of searches in software applications for disaster monitoring and management, and better understanding the language use under and in relation with such events [18]. the results may also be of interest to psycho-linguistics, beyond descriptive linguistics. a special emphasis is placed on the correlative study of the dynamics of the flows of synonyms in the posts. this research associates with other analyses of the language used in sns reported recently in the literature, for example with the vast literature on sentiment analysis on sns [7], [12], including the time of disaster events [16], [17]. both the mentioned project and this study relates to previous interests of our research group in computational linguistics and speech technology. the research in the project and partly in this study also reflects the interest in detecting attitudes in messages and to a standing interest in detecting emotions in speech and texts as in the studies [9], [13], [25], [28]. 2 data collection and processing method the data collection process was described in [24]; data comprises all posts, including articles and blogs pointed to by messages on twitter and google+, during the period march 2015 and march 2016, as detected with queries using the keywords (cutremur or seism) and vrancea, respectively incendiu or foc. an important difference between the dataset we used and the data in other papers is that we have not restricted ourselves only to the messages, but we also downloaded the media articles or blogs having addresses in the collected messages. therefore, the most or at least a large part of the text analyzed comes from the media, not from the respective social network (sn) messages. thus, a larger lexicon was included in the database. yet, the most frequent words found beyond stop words have been those intimately related to the search condition, which is an interesting and somewhat unexpected fact. the text was lemmatized before performing the statistics. in doing so, words with various grammatical forms were reduced to their main representative. we used the free lemmatizer for the romanian language from racai institute, available at [30]. the statistic of words in this study actually represents the statistic of lemmas in the dataset. the dataset was conventionally split by sn and by months and is reported as such in the paper, for example in the correlative analysis of synonyms and in the analysis of their dynamics of occurrences. the descriptive statistics, including zipf’s law detection, refers to the whole database, on each sn, thus extending the results in [24]. the dataset includes 3.715 posts for the earthquake type of event (database named cutremurdb) and 1.106 for fire type (database named focdb). we reiterate that the only events of interest here are the earthquakes and the fires. the total number of posts, words (without stopwords), and the number of characters (with spaces) are given in table 1 for the two classes of posts. the stopwords are the words such as "0 retweets 0 likes", "0 retweets 2 like", "view summary" and "newline", and their form depends on the type of browser used (google chrome, mozilla firefox). the focdb database is larger than cutremurdb because fires are much more frequent events, thus produce more posts on sns. interestingly, the posts related to earthquakes have a small number of lemmas (table 1). the average number of words per post are: 126 words for google+ and 124 words for twitter in cutremurdb, and 317 words for google+ and 131 words for twitter in focdb. 604 s.c. bolea table 1: the number of posts and lemmas posts lemma no. char with of lemmas spaces google+(cutremurdb) 318 6.872 40.196 174.030 twitter(cutremurdb) 941 12.229 116.667 632.979 google+(focdb) 245 15.041 77.909 466.618 twitter(focdb) 279 10.396 36.665 231.293 both databases, cutremurdb and focdb have the same structure and contain the fields: year, month, posts (number of posts from the month), lemma (number of lemmas), no. of lemmas (number of lemmas occurrences), char with spaces (number of characters including spaces) and lemma1, lemma2, · · · , lemman, where lemmai, i ∈ {1,n} represents the significant words referring earthquake and fire (nouns, verbs, adjectives, adverbs). the other words such as pronouns, prepositions, conjunctions, numerals and punctuations have no relevance in our study, and they do not appear in our databases. 3 main findings 3.1 descriptive statistics of synonyms used in relation to specific events this study contributes to the descriptive statistics of the sn language by determining the probabilities of the most frequent words and synonyms in posts related to earthquakes and fires on social networks, on extended periods. the main results are summarized in table 2, where lemma denotes the number of lemmas and no. of lemmas the number of lemma’s occurrences. table 2: the probabilities of the most important lemmas for cutremurdb and focdb cutremurdb cutremur seism seismic magnitudine richter vrancea epicentru adâncime produce p for google+ 0,0310 0,0123 0,0069 0,0158 0,0149 0,0244 0,0041 0,0079 0,0133 p for twitter 0,0311 0,0130 0,0064 0,0165 0,0137 0,0244 0,0054 0,0085 0,0153 focdb incendiu foc izbucni pompier suferi stinge flacără pericol arde p for google+ 0,0092 0,0069 0,0018 0,0027 0,0003 0,0006 0,0012 0,0002 0,0012 p for twitter 0,0260 0,0164 0,0055 0,0098 0,0005 0,0024 0,0047 0,0007 0,0027 the best correlation found (in cutremurdb) is for the lemmas: cutremur vrancea and cutremur produce (see table 3). this is because the messages about earthquakes found on sns have a very similar "un cutremur cu magnitudinea de 3, 5 grade pe scara richter s-a produs luni seara în zona seismică vrancea...", "an earthquake with a magnitude of 3.5 on the richter scale occurred monday evening in vrancea seismic zone...". the correlations between cutremur and seism synonyms is a little smaller than the above. the synonyms incendiu and foc have a higher value of correlation than other pairs of words (see table 3). it was shown in [19], [23] that there is a strong correlation between the number of posts related to potentially or actually disastrous events and the intensity of the event. precisely, in the cited papers it was found that the peak of the number of posts per day (post flux) related to the event correlates with the number of victims in several types of disasters; when the event is only potentially disastrous, such as a small seism, the peak of the post flux was found to correlate with the magnitude of the earthquakes. in addition, it was shown in [19], [23] that the flux of posts related to such events is well represented by a pulse with exponential growth and decay, in the form language processes and related statistics in the posts associated to disasters on social networks 605 table 3: examples of correlations in the use of words: correlations of the time series of the monthly number of occurrences of the lemmas word pairs correlation value word pairs correlation value cutremurdb google+ twitter focdb google+ twitter cutremur seism 0,87 0,96 incendiu foc 0,99 0,95 cutremur seismic 0,78 0,90 incendiu izbucni 0,90 0,66 cutremur magnitudine 0,89 0,96 incendiu pompier 0,69 0,92 cutremur richter 0,86 0,93 incendiu suferi 0,77 0,69 cutremur vrancea 0,97 0,97 incendiu stinge 0,82 0,26 cutremur epicentru 0,81 0,86 incendiu flacără 0,87 0,36 cutremur adâncime 0,82 0,99 incendiu pericol 0,61 0,37 cutremur produce 0,96 0,98 incendiu arde 0,83 0,75 cutremur grad 0,87 0,98 incendiu distruge 0,79 0,08 cutremur intensitate 0,61 0,85 incendiu răni 0,93 0,04 cutremur românia 0,83 0,92 incendiu românia 0,81 0,80 n(t) = { ae λ(t−t0) aeλ(tm−t0)e−κ(t−tm) (1) above, t is time, n(t) is the number of posts per day (flux) at time t, t0 is the moment of the event, tm is the moment when the peak occurs, and λ and κ are coefficients. a similar evolution was verified for the words cutremur and foc in our database, see figure 1. notice that during the months of autumn and winter the number of fires increases compared with the other months of the year. yet the summer heat may cause vegetation fires. these facts are mirrored by the number of related messages on sns, see foc in figure 1. 3.2 zipf’s law and the lexicons of posts on sns recall that zipf’s law [14] states that the logarithm of the probability of a word in a language is related to the rank of the word, log(p(w)) = α×rank(w) + β (2) zipf-like laws, that is, power laws, are well known in statistical linguistics, see for example [14], and in many other domains such as populations of cities, economy, and biology. explained in more detail, this law says that, when ordering the words of any language, including romanian, according to their frequency of apparition in a large corpus, the logarithm of their apparition probability decreases linearly with their rank. this means that selecting at random words in a specified language, the law should not apply. we have been interested, see [24], to find if zipf’s law is relevant to the database related to disasters. our initial hypothesis was that the law would not be valid because of the very restrictive set of words used in such posts, their low rank in the language, and their supposedly almost random choice from the lexicon of the global lexicon of the language. yet, our hypothesis was overturned [24], in that at least the most frequently used words obeyed zipf’s law, while words with higher rank confirmed the hypothesis. the results of log-lin representation of the probabilities of the most frequent 8 words, topic seism for google+ (3), and twitter (4) are: y = −0, 1969x − 3, 3642; r2 = 0, 9555 (3) y = −0, 1592x − 3, 0524; r2 = 0, 9494 (4) 606 s.c. bolea figure 1: time series of keywords occurrences: a. google+, b. twitter language processes and related statistics in the posts associated to disasters on social networks 607 the main results are shown in fig. 2, indicating a good agreement (r2 > 0.7) with zipf’s law. notice that on google+ the slope slightly differs from that on twitter, indicating a lexiconbased personality of the two sns. additional results are given on the srol website described in [9]. figure 2: log-lin representation of the probabilities of the most frequent 14 words, topic seism analyzing the equations 3, 4 and figure 2, the first ranked words (up to 8) obey the zipfs law. in case of focdb, the first ranked words(up to 8 or 14) obey zipf’s law (see (5), (6) and figure 3). figure 3: log-lin representation of the probabilities of the most frequent 14 words, topic incendiu y = −0, 33x − 4, 6241; r2 = 0, 8933; in google+ sn (5) y = −0, 3481x − 3, 5163; r2 = 0, 9489; in twitter sn (6) 3.3 dynamics of the fluxes of synonyms the notion of dynamics of sns originates from the papers [19], [23], where it was shown that the flow of messages streaming from an event on sns is predictable and has a dynamics that has characteristics of nonlinearity. it was shown that the time series of the number of messages related to events correlate with the level of seismic activity [19], [23]. one of the issues only slightly addressed in [24] is that of the dynamics of the use of keywords and synonyms. in many respects, this dynamics resembles with the dynamics of the number of messages addressing a certain disaster event, as dealt in [19], [23]. the interest in the dynamics is multifold and includes the expectation of correlation of the query based on keywords and the number of returned messages related to a specified event (probability of finding messages related to an event when the search uses a given keyword), and the dynamics of the joint use of synonyms in messages. on the other hand, the study [24] proved that, while the time series of the number 608 s.c. bolea of occurrences of the words "cutremur" and "seism" are strongly correlated, they have dynamics that are clearly differentiated in the phase space. in this sub-section, we study the difference in the dynamics of synonyms, continuing [24], by generating the phase-space plots (maps) of the time series of the synonyms and then by applying the method of windows in the time space, as proposed in [21]. that method is essentially an approximate way of determining the probability that the attractor of the time series has a point in a specified rectangular region of the phase space. four windows were defined in the rectangular coverage of the attractors, as suggested in [24], and the counts for the respective four windows was performed, producing a vector with four components for each time series. then, the distance between the vectors was determined (method due to hn teodorescu). we found that the euclidean distance is low only for the pair of synonyms cutremur and seism on the twitter sn and for google+ sn, this distance is a little larger. the phase space, in a simplified case of the phase plane for the dynamics x(t), is simply defined as the plane (x,ẋ), dx dt = ẋ, with the corresponding graphical representation. the characterization method introduced in [21], consists in determining the frequency of a point in the phase space that lies in the 2d interval (rectangle) [a1,a2] × [b1,b2], for a set of nonoverlapping rectangles defined in the phase space. let us consider the attractor of the time series of the flux of a specified word, nw(tk), for a given time duration t1, · · · , tp framed in the rectangle [minj∈{1,p}nw(tk),maxj∈{1,p}nw(tk)] × [minj∈{1,p}dnw(tk),maxj∈{1,p}dnw(tk)], where dnw(tk) = nw(tk)−nw(tk−1). divide this rectangular subspace of the phase space into four equal rectangles, overlapping only on one of their edges, r1 = [minj∈{1,p}nw, minjnw + maxjnw) 2 × [minjdnw, minjdnw + maxjdnw 2 ] (7) where everywhere above min and max are for j ∈ {1,p}; similarly one defines r2,r3,r4, see fig. 4. denote the number of points that fall in these rectangles by n1(w1), · · · , n4(w1). form the vector −→ω1 = (n1(w1),n2(w1),n3(w1),n4(w1)). perform the same operations for the specified word, w2, whose dynamics we wish to compare with −→ω1, and find −→ω2. then, the distance d(−→ω1,−→ω2) between the two dynamics is the distance between −→ω1 and −→ω2 (personal communication hnt): d(−→ω1,−→ω2) = ∑ k=1−4 |nk(w1) −nk(w2)|, or the euclidean distance would be d(w1,w2) = √∑ k=1−4(nk(w1) −nk(w2))2. a further improvement is to use relative frequencies in the distances, dividing by the total number of points in the phase diagrams, νk = nk/p, where p = ∑ k nk. the distances between the attractors in the phase plane provides a quantitative, simple characterization of the similarity of the dynamics of the two time series compared, supplementing the information given by the correlation. the phase diagrams for four of the most frequent words found on google+ are shown in figure 4; the corresponding euclidean distances between the dynamics are given in table 4, which stands for the matrix [d12], where d12 denotes the distance between words. the results presented in table 4 show that the larger euclidean distance is between the dynamics of the words magnitudine and vrancea, and the smallest distance is between seism and vrancea respectively cutremur and magnitudine, on google + (both distances 1.41). comparatively, the time series for the word pair cutremur and magnitude have the third higher correlation (see table 3), only slightly higher than the correlation for the pair cutremur seism. even less favorably compares the value of the distance between the dynamics of cutremur and vrancea d = 3.75, the third largest, with the correlation of their time series, which is the highest, language processes and related statistics in the posts associated to disasters on social networks 609 figure 4: example of dynamics in the phase plane of the series for cutremur table 4: euclidean distances between the dynamics of the words cutremur, seism, vrancea, and magnitudine (relative frequencies) words cutremur seism vrancea magnitudine cutremur 0 2,83 3,75 1,41 seism 0 1,41 4,24 vrancea 0 5,10 magnitudine 0 c = 0.97. this shows that distances between the dynamics and the value of the correlation of the respective time series carry at least partly different information and help differentiate between the processes represented by the time series. 4 discussion the use of synonyms has also a psycho-linguistic background [24]; as such, it relates to emotions and attitudes expressed in the posts. it would be interesting to follow this study by a research of the uttering of the synonyms on voice-enabled sns in view of detecting their emotional charge using various methods of characterization, such as those in [8], [9], [13], [28] that report on emotion recognition tools specifically for the romanian language. not all messages posted on sns contain diacritics. this can present a problem in recognizing lemmas. in this study, all the words with and without diacritics were taken into account. there is a difference between the same messages on different browsers and sns. so, the stopwords for twittersn are (in google chrome): "no retweets no likes", "reply", "retweet", "more", "like" "newline", and in case of mozilla’s browser the stopwords are "no retweets no likes" and "newline", where no is a number. the stopwords in google+sn are: "adauga un comentariu...", "newline". this posts were saved first in .docx, than transformed in .txt. the earthquake’s posts contain information about the earthquake from a specific day. in almost all messages there are dates about the biggest earthquake from that year, or the previous years. this is the reason for the cutremurdb contain a small numbers of words than focdb. the data bases include only the following words: nouns, verbs, adjectives, adverbs; we deleted the conjunctions, prepositions, numerals. 610 s.c. bolea 5 conclusions one of the interesting findings first detected in the preliminary study [24] and confirmed by this ampler database is that zipf’s law is still valid for the most important most frequent 6-7 words used in the posts related to disaster. this is surprising because the words are not among the most used in a language, that is, they are among the rare words in common language; moreover, they are not selected in any specific way preserving the distribution in the common language, so there is no reason that these words still obey a zipf-type law. however, the law breaks down when we try to expand the set of the most frequent words beyond the number of eight. these facts may indicate that the word selection mechanism in the common language scales down to the lexicon used when dealing specific topics. this scaling property of language statistics (zipf’s law) was never been pointed out before [24], according to our knowledge. social networks have their own specificity, with features that could be likened to the personality of humans; the specific use of synonyms is one of that. the parameters in zipf’s law for the lexicons used in posts related to various types of events may help to automatically identify the sn from a set of posts, only knowing the related type of event eliciting the messages. the posts which have been studied for one year are mostly messages sent by newspapers and tv’s and have an informal structure of mass-media type. acknowledgments this work was supported in part by the sps nato program under grant g4877 /sfp 984877. the author acknowledges the help of horia-nicolai teodorescu (hnt), who, as the pi of the cited grant, established the principles and methods of this research, proposed the main ideas, proposed the structure of this paper and suggested some of the text. this paper reports detailed and complete results complementing [24], where the research was largely done by hnt. bibliography [1] abbasi, m.a. et al. (2012), lessons learned in using social media for disaster relief asu crisis response game, social computing, behavioral cultural modeling and prediction, springer, lncs 7227:282-289. [2] acar, a.; muraki, y.(2011); twitter for crisis communication: lessons learned from japan’s tsunami disaster, int. j. web based communities, 7(3): 392-402. [3] anderson, k.m.; schram, a. (2011); design and implementation of a data analytics infrastructure in support of crisis informatics research (nier track), the 33rd int. conf. software engineering, icse’11, waikiki, honolulu, usa, may 21-28, acm, 844-847. [4] bolea, s.c. (2015); vocabulary, synonyms and sentiments of hazard-related posts on social networks, 8th ieee int. conf. on speech technology and human-computer dialogue, sped, bucharest, romania, october 14-17. [5] boulos, m.n.k. et al. (2010), social web mining and exploitation for serious alications: technosocial predictive analytics and related technologies for public health, environmental and national security surveillance, computer methods and programs in biomedicine, 100(1): 16-23. [6] bruns, a.; burgess, j.e. (2012); local and global responses to disaster: #eqnz and the christchurch earthquake, disaster and emergency management conf., proc., ast management pty ltd, pp. 86-103. language processes and related statistics in the posts associated to disasters on social networks 611 [7] cambria, e. et al. (2013), new avenues in opinion mining and sentiment analysis, ieee intelligent systems, doi:10.1109/mis.2013.30, 28(2): 15-21. [8] ciobanu, a. et al. (2014), automatic fury recognition in audio records, proc. 12th int. conference on development and application systems (das), suceava, romania, may 15-17, 176-179. [9] feraru, s.m. et al. (2010), srol web-based resources for languages and language technology e-learning, international journal computers communications & control, 5(3): 301313. [10] kryvasheyeu, y. et al. (2016), rapid assessment of disaster damage using social media activity, science advances, doi: 10.1126/sciadv.1500779, 2(3):e1500779. [11] merchant, r.m. et al. (2011), integrating social media into emergency-preparedness efforts, the new england journal of medicine, 365:289-291. [12] nakov, p. et al. (2013), semeval-2013 task 2: sentiment analysis in twitter, 2nd joint conf. lexical and computational semantics (*sem), 7th int. workshop semeval,atlanta, june 14-15, 2:312-320. [13] pavaloi, i. et al. (2013), acoustic analysis methodology on romanian language vowels for different emotional states, proc. int. symposium on signals, circuits and systems (isscs), iasi, romania, july 11-12. [14] piantadosi, s. t. (2014); zipf’s word frequency law in natural language: a critical review and future directions, psychon bull rev., doi: 10.3758/s13423-014-0585-6, 5:11121130. [15] pirnau, m. (2015); tool for monitoring web sites for emergency-related posts and post analysis, 8th ieee int. conf. on speech technology and human-computer dialogue, sped, bucharest, romania, october 14-17. [16] saharia, n. (2015); detecting emotion from short messages on nepal earthquake, proceedings of the 8th international conference on speech technology and human-computer dialogue, sped, bucharest, romania, october 14-17. [17] saif, h. et al.(2012), semantic sentiment analysis of twitter, proceedings of the 11th int. conference the semantic web, part i, boston, usa, november 11-15, pp. 508-524. [18] temnikova, e. et al. (2015), emterms 1.0: a teminological resource for crisis tweets, proceedings iscram 2015 conference, kristiansand, norway, may 24-27. [19] teodorescu, h.n.l. (2015); on the responses of social networks to external events, proc. ecai 2015 7th ieee int. conf. on electronics, computers and artificial intelligence, bucharest, romania, june 25-27, doi: 10.1109/ecai.2015.7301138, 13-18. [20] teodorescu, h.n.; bolea s. c. (2016); analysis of probabilities of specified words’ occurrences in sn messages related to catastrophes, 18-th int. conf. system analysis and information technology, sait, kyiv, ukraine, may 30-june 2. [21] teodorescu, h.n. (2012); characterization of nonlinear dynamic systems for engineering purposes a partial review, international journal of general systems, 41(8):805-825. 612 s.c. bolea [22] teodorescu, h.n. (2013); sn voice and text analysis as a tool for disaster effects estimation a preliminary exploration, proceedings 7th ieee conference on speech technology and human computer dialogue, cluj napoca, romania, october 16-19. [23] teodorescu, h.n. (2016); emergency-related, social network time series: description and analysis, time series analysis and forecasting, contributions to statistics, springer international publishing switzerland, pp.205-215. [24] teodorescu, h.n.l.; bolea s.c. (2016); on the algorithmic role of synonyms and keywords in analytics for catastrophic events, proceedings ecai-2016, 8th international conference on electronics, computers and artificial intelligence, ploiesti, romania, june 30-july 2. [25] teodorescu, h.n.; feraru, s.m. (2007); a study on speech with manifest emotions, lecture notes in artificial intelligence, springer berlin heidelberg, 4629: 254-261. [26] verma, s. et al. (2011), natural language processing to the rescue?: extracting situational awareness tweets during mass emergency, proc. fifth int. aaai conf. on weblogs and social media, north america, july, 385-392. [27] yang m. et al. (2011), social media analytics for radical opinion mining in hate group web forums, j. homeland security and emergency management, august 11, 8(1), doi: 10.2202/1547-7355.1801. [28] zbancioc, m.; feraru, m. (2012); emotion recognition of the srol romanian database using fuzzy knn algorithm, int. symposium on electronics and telecommunications ieeeisetc 2012 tenth edition, timisoara, romania, 347-350. [29] http://www.dictionardesinonime.ro/. [30] http://www.racai.ro/en/tools/text/ international journal of computers communications & control issn 1841-9836, 10(1):38-48, february, 2015. intelligent design environment for second-order positioning systems s. dale, h. silaghi, d. zmaranda, u. rohde sanda dale, helga silaghi, doina zmaranda* university of oradea romania, 410087 oradea, universitatii, 1 sdale@uoradea.ro, hsilaghi@uoradea.ro *corresponding author: dzmaranda@uoradea.ro ulrich rohde tuniversity of technology cottbus germany, 03048 cottbus, lipezker st. 47 ulrich.rohde@tu-cottbus.de abstract: from the designer perspective, in real-time applications as mechatronic systems the control algorithm has to be easy to understand and to implement, easy to optimize and last but not least, to ensure the best possible behavior for the controlled system. the paper presents a design environment which provides the possibility to choose between a classical control algorithm (as state feedback stabilization represents) and a new approach (interpolative controllers) with the possibility to modify the designed parameters such as to obtain the optimum behavior for the controlled system. this opportunity is given through the inner structure and operating laws of the algorithm and exploited using genetic algorithm-based techniques. interpolative-type controller category covers the controllers based on fuzzy, neural and pure interpolative algorithms, due to their common capability to make an approximate reasoning. the first ones, fuzzy and neural algorithms, are already well-known in the control area. a pure interpolative controller presumes to contain at least one block, placed no matter where in his structure, in which interpolation as mathematical operation to have place. the approach presented in the present paper uses this kind of blocks to reproduce and to optimize the behavior of an already existing controller (in this case the state feedback one) [1]. this type of controllers meets all the requirements stated at the beginning: they are easy to implement, easy to understand, they need reduced calculus time and gives notable results in specific cases. the above mentioned kind of controllers will be presented with all the necessary details in the paper. the design environment is presented as a collection of matlab functions, simulink configurable schemes and a user interface. a case study ends the presentation with some experimental results obtained through a simulation for a specific second order positioning system. keywords: interpolative control algorithms, intelligent optimization, design environment, mechatronic positioning system. 1 introduction a mechatronic system is generally an electromechanical system equipped with sensors which detect the state of the environment, actuators which modify the state of the environment and a control system which controls the actuators based on the environment, as described by the sensors. mechatronical systems are implemented in all kind of man-made machines as autonomous robots, industrial robots or flexible manufacturing cells. in many mechatronical applications, the control functions are implemented through a microcontroller. if in the past microprocessors and microcontrollers have been used especially by the copyright © 2006-2015 by ccc publications intelligent design environment for second-order positioning systems 39 electronic and automation engineers (due to the difficulties in design, programming, testing and usage and due to the fact that the amount of needed information for the applications was very large and the given degree of software and hardware integration), today the microcontroller can be used even by people that have minimum knowledge of electronics and programming. there are many brands that offers developing systems with integrated microcontrollers at good prices, well documented and with numerous applications in different fields. some of them have an active policy in attracting the non-specialists in realization of applications with microcontrollers. the main advantage of such an evolution is the possibility to realize, in a relative easy way a sum of practical applications in various physical areas. in this context there isnt surprising the increasing amount of applications from the mechatronics field based on the usage of microcontrollers which implements various control techniques. the actual approaching manner of such applications, focused on miniaturization, has stimulated the efforts in this direction. nowadays, the microcontrollers have better and better performances (memory, speed etc). there is a need to continuous increase of the performances for those products, leading to a larger volume of software applications. testing of the applications becomes more important. the usage of microcontrollers in mechatronical applications leads to the appearance of new directions of research in the field. integrated systems based on microcontrollers are low cost solutions and have the advantage that can be used in applications where miniaturization (from the mass, volume, energy consumption point of view) is an important requirement. as a consequence, the researches on the implementation possibilities for control algorithms in applications from mechatronics as well as motion control, mini-robotics etc., constitutes a necessity. the main disadvantage of using the microcontroller in such applications (comparative with plc’s) its related to the potential difficulties in software development. in this context, development of performant control algorithms easy to use and to implement contributes to reduce and eventually eliminate this inconvenient: among them, some of those knew as intelligent control algorithms. as it is well known, intelligent control is a class of control techniques, that use various ai computing approaches like bayesian probability, approximate reasoning techniques based on expert knowledge or interpolative-type algorithms (as neural networks, fuzzy logic, pure interpolative algorithms), machine learning, evolutionary computation and genetic algorithms. in this branch of control engineering, the control techniques are extended beyond classical mathematical methods based on integral-differential calculus. difficulties that appear in complex non-linear systems control with uncertain models can be eliminated by new approaches based on an operator experience. hence, one way to develop control algorithms is to generate them based on expert knowledge. in that way, they are easier to understand for humans, and this can be considered as the second major requirement for the control algorithms used in mechatronic devices design. intelligent control systems must treat information referring their own state and the environment state and that implies both heuristic and algorithmic programming methods. intelligent control implies also a decisional strategy based on synthetic thesis from system theory, advanced mathematical modeling, computers and knowledge based linguistic methods, as in [2] and [7]. from both perspectives exposed above which can be expressed as two major desiderates or requirements for the control algorithms implemented on microchips and designated to mechatronic devices there are a few categories of appropriate intelligent algorithms. fuzzy and neural ones, due to their complexity, are not one of them. in [3] and [4] some comparative experimental studies was made from the implementation on microcontroller point of view, between pid, fuzzy and interpolative algorithms. the results allow to consider pure interpolative algorithms implemented as interpolative blocks and used in control structures as candidates for implementation on microcontrollers. one of the advantages of using such blocks are, mainly, based on simplifying the solutions (easier implementation and reduced calculus time, meaning the first requirement 40 s. dale, h. silaghi, d. zmaranda, u. rohde is fulfilled) and the possibility to ensure for the control systems, in a relative easy way, some robustness properties [5]. in the same time, as in [4], [6] and [9], the controllers based on interpolation table, on the same form as in drechsels rip (regelbasierte interpolation) controllers introduced in [12], can be developed in much more situations than those based on linguistic rules. they can be conceived starting from solutions deducted in different ways, solutions that, finally, can be improved. from this perspective, interpolative algorithms meet the second requirement also, the possibility to be understood and designed by automation engineers or non-specialists with no loses from the control systems performances point of view. the results achieved in this field in the last years authorise the answer at two major questions. why an interpolative algorithm? because these algorithms belong to the intelligent control techniques category, beeing one of the new-generation performant algorithms. on the other hand, due to the internal structure based on interpolation they are improvable, very simple to understand, use and implement, making them a good choice for the implementation on microcontrollers by specialist either non-specialist. in this way, the interpolative algorithms can present interes for the mechatronic applications field. why a ga-based optimization technique? because it seems to be very appropriate for optimizing the performances of the control structure with interpolative controllers, due to the fact that they represent themselves improvement methods based on natural and genetical mechanisms, which combine artificial surviving of best individuals with specific natural operators. some procedural aspects of an intelligent design environment which combines interpolative control with ga-based optimization are presented in [1]. for the intelligibility of the presentation, in the second chapter interpolative algorithms will be presented from a large perspective, in a descriptive-conceptual manner. the functionality and the structure of the proposed design environment is described in the third chapter that ends with a case study made on a second order positioning system. some relevant conclusions resume the presentation. 2 design procedures for controllers with interpolative blocks optimized through ga-based algorithms the controllers with interpolative blocks as introduced in [5] and [9] represents an alternative to the controllers obtained by various design procedures, that are replaced with controllers containing interpolation tables and algorithms and participate to the partial or integral control law elaboration. the idea of using interpolative blocks, named also table-type blocks its not a new one. they are frequently used in real time programs in order to reduce the calculus time in case of operating with multivariable complex functions. the interpolation technique is usual in robotic control systems. the concept of controller with interpolative blocks brings as a new element the elaboration method of such a controller which can be implemented quite easy using digital techniques. these techniques allow the real time operation with complex tables. the concept maintains actual the design methods for the linear or non-linear control algorithms, whose results constitute the starting point for interpolative controller synthesis and at the same time improves their performances. those controllers are synthesized based on a very well known principle in the quality systems domain: continuous improvement of the activities flux of an already existing system, designated to ensure a specific quality function by which are expressed the system performances [10]. for the interpolative controllers this principle is translated as follows: to obtain, for a certain control system, a better controller than the initial one, designed through usual methods. the synthesis of an interpolation block (implemented by an interpolation table) is accomplished in two stages intelligent design environment for second-order positioning systems 41 (as in figure 1): • primary synthesis stage necessits other two steps: initial control algorithm synthesis and interpolation table construction based on the command hyper-surface of the initial control algorithm • correction stage consists in one or more steps of on-line or off-line correction, operated on the interpolation table figure 1: interpolative controller synthesis the interpolation table is determined through support points extraction from the initial controller hyper-surface. support points set must be structurated as a cartesian grid and has to cover all variations domain for the input variables. the support points ensemble has to embed, from a qualitative point of view, all the inflection points of the command hyper-surface. support points generation is made point by point or based on an analytical function, when such a function is available. if the locations number on the table is too large, the support points extraction can be incomplete; the empty spaces will be filled by shepard interpolation. once the interpolation table was obtained describing approximatively the behavior of the initial block it will replace the initial block, followed by a correction stage. the correction is realized by successive modifications operated on interpolation table with a specific aim: to improve the performance of the control system. the effective correction can be made by modification on the values in table locations and/or by granularity modification on the inputs and/or output and it is an iterative process. at principle level, an interpolative controller contains an interpolative processing block (ipb), usually a non-dynamic non-linear block, implementing the relation in (1). u = int(a1, ....an) (1) the input and output filters (if and of respectively) ensure together the controller dynamics and they adapt the inputs at the ipb and the outputs to the execution element connected after the controller. control system performances (csp) can be improved when an improvement criterion is chosen as in [10] or they can be optimized if an optimization, well defined criterion is considered, as in [6] and [8], where ga are used for a so-called intelligent optimization. an ipb implementation is a digital one and its realized through n-dimensional interpolation tables, made in soft manner (programs capable to execute the operations corresponding to the association of table locations for input a and the calculus of command c based on an interpolation algorithm between the table values) or hard (using plcs, microcontrollers or dspace modules). all implementation modalities offer the possibility of real-time operating. 42 s. dale, h. silaghi, d. zmaranda, u. rohde interpolative controller design has, starting from the correction stage, a heuristic character, mostly empirical. this fact represents in many cases an advantage: brings to the light the creative, innovative, intuitive side of the designer, without forcing him to manipulate a complex mathematical instrument, on models affected by uncertainties. in the present paper the interpolative blocks will be synthesized through the method exposed above, as in [5] and [9], followed by off-line ga-based correction, as in [6] and [4]. ga-based techniques seems very appropriate in such cases due to the fact that gas represent themselves improvement methods based on natural and genetic selection, compare and sorting mechanisms, which combine artificial surviving of best individuals with specific natural operators and form a searching mechanism, usable in various purposes. for this kind of application optimization of an interpolation control table a ga modifies a table population ti which implements the control block for the interpolative structure (look-up table block). the starting point represents an initial set made from interpolation tables tinitial capable to ensure command characteristics. during the synthesis operation the performances of the interpolative control systems are permanently evaluated by simulations made with the simulink control schemes, for preset scenarios (eg. unit step response). a fitness function is used to provide a measure of performances related to the time requirements imposed to the control system. the adopted objectiv function induces the desired performances by using appropriate restrictions associated to the control system’s response. through techniques which are specific to ga, at every generation are preserved a rising number of performant individuals, where the individual performance is associated with a rise in the system performance. at the time when the convergence criterion (cc) consisting on minimization of the error between an imposed trajectory and the current trajectory of the system is accomplished, the best individual is extracted and will be used in the effective implementation, being considered the optimal solution, regarded from the used criterion point of view. the entire process can be synthesized by the schema in figure 2. figure 2: ga-based optimization the main limitations of the interpolative controllers are represented by the fact that they cannot guarantee the robustness and stability for control structures during the operating regimes not provided (included) in their synthesis. this issue represents a common problem for all the systems with non-linear controllers [11]. 3 structure presentation of the intelligent design environment the proposed design environment consists in a collection of matlab function and simulink schemes managed through a user interface. simulink schemes are configurable by user in order to correct and to justify some values or even to reconfigure some blocks if it is needed. the environment addresses to second order positioning systems, as simple mechatronic eleintelligent design environment for second-order positioning systems 43 ments, and offers the possibility to design an interpolative control structure in three stages: 1 state feedback control structure design, 2 interpolative control structure design, 3 optimized ga-based interpolative structure design. every stage, if the user considers it, can be viewed as a final and sufficient stage or it can be seen only as a step on a complex design issue, having as a final result the optimized interpolative structure. the functionality of the environment can be followed through the organigrame presented in figure 3. figure 3: intelligent design procedure as depicted in the organigrame, the usage of the design environment presumes a sum of steps, described in detail in the following paragraphs. 3.1 prelevation of process data transfer function of the positioning system and the requirements for the control systems, in terms of damp and rising time are introduced by the user. based on that information, a preset simulink scheme for the mm-isi model of the second order system is updated. 3.2 design of state feedback control structure consider the feedback state stabilization law with the folowing trajectory: u(t) = lc · uc + uf(t) = lc · uc − l1 · x1(t) − l2 · x2(t) (2) the calculus implied in state feedback control law design is implemented by a matlab function that use the ackermann formula (for l1 and l2) and transmits the results to the simulink scheme (see figure 4) of the general state feedback control structure. with some manual corrections made on lc parameter by trial-error efforts the results are evaluated through a step response. 44 s. dale, h. silaghi, d. zmaranda, u. rohde figure 4: initial state-feedback control loop (for second order processes) 3.3 interpolative controller design based on initial data, significant information are extracted from the command characteristic of the state feedback control structure, as support points. a table which consists in a number of row and columns chosen by the user with linguistic terms meaning for the variables x1(t) and x2(t) is filled with the information organized as a cartesian grid. the command law remains the same, only expressed through an interpolation table, as in table 1. table 1: interpolation table approximating the control law x1/x2 lt2−n lt2−n+1 .... lt20 .... lt2n−1 lt2n lt1−n lt1−n+1 ... lt10 uf = −l1x1 − l2x2 ... lt1n−1 lt1n the interpolation table is implemented as a 2d look-up table in the simulink scheme of the interpolative control structure, replacing the summation element with the irrespective coefficients (which implements the effective state feedback law) in control structure in figure 4. the interpolative control structure can be seen in figure 5. after some corrections made on lc parameter and on table locations made by trial-error efforts the results are evaluated through a step response. if the user is satisfied with the results, the process can be stopped here. 3.4 optimization of the interpolative control algorithm implemented as an interpolation table through ga-based techniques as described in section 2 of the present paper, a ga is used to optimize the performances of the interpolative control structure, starting from a population of initial tables {ti} with the restriction: ti ∈ [-150% · tinitial + 150% · tinitial], i.e. all the support values corresponding to the support points should not surpass, point by point, the âą150% centered on the initial support values obtained at step 3 from the state feedback command surface. during the evolution of the algorithm (realized by matlab functions) the best individuals are stored as tables kept in especially created directories, out of which, in the end, one will replace the interpolation table tinitial. in this case, a target function was formed, by which was imposed that the system intelligent design environment for second-order positioning systems 45 figure 5: interpolative control loop (for second order processes) response to be ideal, meaning to be identical to the standard (unit step). from here appears the necessity of introducing the following criterion, which in fact is a minimizing one: the smaller the difference between ideal response and the tested system, the better the tested system will be. ji(ti) = abs[ ∫ +tmax 0 [yi(t) − yideal]dt] (3) , where yideal is the standard. the genetic algorithm uses a fitness function that is a sum of conditions related to the error polarity, speed of the response, monotony a.s.o. the fitness function is implemented in matlab-simulink, such that it allows the imposing of the abovementioned conditions on the whole considered time, or only on certain intervals. at the end of the procedure, the results evaluation is made and if they are satisfactory to the user, the process stops here or. if the results are considered unsatisfactory, the fitness function implemented in the simulink optimization scheme can be modified and the process can be resumed from the beginning of the 4th step. the interpolation table considered as the best individual at step 3.3 will replace the look-up table in the interpolative control structure. with this action, the entire design process ends. as it can be observed from the description above, the usage of the proposed design environment presumes that the user has some minimal knowledge about control systems and their performances evaluation (for reconfiguration of the schemes and parameters correction) without implying detailed knowledge regarding control system design. the user interface provides a friendly and easy communication and usage of the environment. a text containing instructions for the user is available, describing in detail minimum requirements, design steps, matlab functions and simulink schemes involved in the designing process. 4 case study: second order positioning system as a study case a second order positioning system was chosen, given by the transfer function in (4) and the state-space model in (5). h(s) = 0.5 s2 (4)   x1(t) = −0.5 · x2(t) x2(t) = −u(t) y(t) = x1(t) (5) 46 s. dale, h. silaghi, d. zmaranda, u. rohde the performance requirements for the control system were set at ζ = 0.9 damping and tr = 0.8 sec rising time. after the first step data introduction the design procedure begins with the first stage: state feedback stabilization structure design. for l matrix of the state controller calculus, the following results were provided: l=[l1 l2] = [0.8889 19.7531] . the value for the lc coefficient of the trajectory following component was manually set on the simulink scheme at lc = 39.5. the simulation results of the step response in (6) are presented in figure 6 (continuous line) and the command surface in figure 7. uc(t) = 1,2 · σ(t) (6) figure 6: step response figure 7: command surface in the next stage, significant support points from the command surface of the state feedback intelligent design environment for second-order positioning systems 47 controller were chosen and a 2d look-up table replaces the initial state feedback controller. the simulation results for the step response of the interpolative control system are depicted also in figure 6 (point-dashed line). finally, the ga-based optimization was applied to the interpolative controller. a simple ga was used tournament selection, simple crossing and uniform mutation. the size of the population was 50. the probability of crossing and mutation were considered 0.45 and 0.01 respectively, using the grefenstelte references. the initial populations evolved for 50 generations. the simulation results for the step response of the optimized interpolative control system are depicted also in figure 6 (dashed line). the results obtained through simulation for the three control structures, for the same scenario, present sensible differences between them, with a small advance for the optimized interpolative controller, that justify the demarche made by developing the design environment. 5 conclusions and future works an intelligent design environment development can appear as a daring purpose. the result is a collection of matlab functions and simulink schemes, unified by a quite explicit user interface that offers the possibility to design an optimized interpolative structure for any second order positioning system. the procedure is semi-automatic there are operations which have place in a non-transparent manner (they are done by the functions embeded in the environment without interacting with the user) and also other actions that involve the user in a more or less specialized interaction with the environment. the design environment proposed in the article offers the possibility, for engineers and also for non-specialists, to design in three steps control systems for second order positioning systems and also to simulate their behavior. at the beginning, a state feedback stabilization algorithm can be designed which will constitute the base for a second step, the procedure to develop an interpolative structure. this one will be, in a third step, optimized through a ga-based technique. the environment consists in matlab functions, configurable simulink schemes united by a user interface. the user can study and compare the behaviors of the system in each design stage, having hence the possibility to provide some conclusions related to feasibility, implementability and resources for the involved algorithms. some observations can be made based on the results obtained in previous work and on the study case from the present article: the combination interpolative algorithm plus ga-based optimization techniques offers easy to implement optimized controllers without loses in control performances and in specific cases even better from the performances point of view. encouraged by the results obtained through simulation and dspace experiments, in future work, assisted with the proposed design environment, real-time applications on more complex systems will be developed. bibliography [1] s. dale, h. silaghi, d. zmaranda, u. rohde (2014), procedural aspects concerning intelligent design environment for real-time applications, abstracts of icccc papers, 5th international conference on computers, communications & control, icccc2014, baile felix, romania, issn 1844-4334, 4:41. [2] f.g. filip (2012), a decision-making perspective for designing and building information systems, international journal of computers communication & control, issn 1841-9836, 7(2):264-272. 48 s. dale, h. silaghi, d. zmaranda, u. rohde [3] s. dale, g. gabor, c. gyorodi, d. zmaranda (2010), interpolative control algorithm applied on a 3d-mechatronic system, proceedings of 4th international workshop of soft computing applications, arad, romania, 2010, 229-233. [4] a. bara, s. dale, t.z. nagy (2009), comparative real time experimental study case for control algorithms from the implementation point of view, proceedings of the 8th wseas international conference on system science and simulation in engineering (icossse09), genova, italy, 302-306. [5] s. dale, t.l. dragomir (2009), interpolative-type control solutions, studies in computational intelligence, berlin, springer verlag, 196: 169-203. [6] s. dale (2009), interpolative control for a d.c. motor drive with genetic algorithm-based tuning, proceedings of emes09 in journal of computer science and control systems, oradea, 93-96. [7] a. bara (2006), an algorithm for fuzzy relation identification, proceedings ieee-tttc 2006, tome1, aqtr 2006, cluj-napoca, 96-102. [8] t.l. dragomir, e. vladu, s. dale (2003), interpolative-type controller synthesis using genetic algorithms, proceedings of cscs14, bucuresti, 70-75. [9] t.l. dragomir, i. silea, s. nanu (2002), control performances improving by interpolator controllers, proceedings of the 6th world multiconference on systemics, cybernetics and informatics, july 14-18 2002 orlando, florida, 6: 208-213. [10] g.f. kamische, j.p.brauer (1999), qualittsmanagement von a bis z, hanser. [11] m.m. seron, et all (1997), fundamental limitation in filtering and control, springer-verlag, 1997. [12] d. drechsel (1996), regelbasierte interpolation und fuzzy control, vieweg, 1996. international journal of computers communications & control issn 1841-9836, 13(1), 39-49, february 2018. the integrated environment for learning objects design and storing in semantic web v. dagiene, d. gudoniene, r. bartkute valentina dagiene, daina gudoniene* institute of mathematics and informatics vilnius university akademijos 4, lt-08663 vilnius, lithuania *corresponding author: daina.gudoniene@mii.stud.vu.lt reda bartkute informatics faculty kaunas university of technology studentu 50, lt-51367 kaunas, lithuania abstract: there is a variety of tools and environments for learning objects (los) design and delivery as well as learning object repositories (lor) but the researchers could not find a repository that includes both functions: creation and storing of los. a number of different integrated learning systems are suggested for users that demonstrate the variety of e-learning methods and semantic capabilities. lo repository oer.ndma.lt/lor, that we are going to present, is very friendly and interoperable to use and assure lo design, search in semantic web, adaptation of the re-used objects and storing. there are no more existing lo repositories with the functionality presented by researchers. transformation of closed education into open one without existence of well-structured, multifunctional and integrated environment becomes problematic. authors will present an integrated environment for the lo design, search in semantic web, adaptation and storing of newly designed or re-designed lo. measures will support the transformation of closed education into open and will assure effective design, re-usability and adaptation of lo in the integrated environment. keywords: learning objects (lo), models, semantic web, semantic technologies. 1 introduction the authors of the paper explore the existing learning objects repositories throw the open education and transformation of education into openness where it is a need to have an integrated environment for lo design, search and storing in the semantic web [4, 5]. bellow presented different scientific papers and authors have analyzed technological challenges for lor and is implementation into learning design process. in this paper, learning objects (los) are referred as "small, modular, discrete units of learning design for electronic delivery and use" [27]. all learning objects can be identified by these features: interoperability, reusability, manageability, flexibility, accessibility, durability and scalability [22]. analyzing learning objects from the technological perspective, they are based on the "paradigm of object orientation" [14] which means that the parts of a learning object can be used, changed or created repeatedly. this feature allows the user to create new and unique learning objects that can be used multiple times. in this process, the semantic web plays an important role. the semantic web is "the extension of the world wide web that enables people to share content beyond the boundaries of applications and websites" [12]. in other words, the semantic web can be understood as a web of related meanings. the authors of the paper analyze the opportunities of the semantic web in the perspective of the search of a learning object. in this copyright ©2018 cc by-nc 40 v. dagiene, d. gudoniene, r. bartkute case, semantic web broadens the search and provides more and better fit with the search requests on the various aspects. the context of semantic web, most of the practical implementations and use of standards are related to the marking of learning objects, which creates a lot of additional requirements for the successful use of standards [16]. when searching for ways to improve the courses using semantic web technologies, it is very important to develop simple methods and tools for lo labeling, separation of the objective and subjective metadata to create metadata sets and schemes from a variety of sources, to integrate production and labeling, to include formal semantics into existing standards and dynamically associate metadata with different lo. these are the challenges that are expected to be solved. for sharing reusable learning objects, repositories are required so that these learning objects could be stored and delivered . the role and functions of learning objects’ repository are described in the ims digital storage compatibility specification [11]. a variety of learning object repositories are existing, however metadata studies show that the majority of metadata of currently existing learning objects repositories (lors) is only a general description of the content and settings [21]. such data is difficult to use for program agents. therefore, it is important to create semantic relations in the repositories so that learning objects would be fully integrated and linked [10]. a semantic learning object repository is a system containing the educational resources and metadata (or metadata only) which provides search interface to people or other systems [2]. the analysis shows that the existing lors and is analysed and presented by the authors do not assure effective e-learning objects (elo) design, search in semantic web and adaptation, as well as there is no suggested model, which will assure elo adaptation in semantic web and automatically will integrate elo to be re-used. 2 a research methodology the research methodology was prepared in the frames of constructive method. the constructive research method takes 5 phases (see fig.1): (1) literature review; (2) problem identification; (3) theoretical framework; (4) practical implementation and (5) experiment. (figure 1). figure 1: constructive research method a systematic review of related research works and analytical research methods were used for revealing the advantages of the use of semantic web technologies for integrated environment and for raising issues related to the semantic learning objects’ use in semantic education as well the integrated environment for learning objects design and storing in semantic web 41 as for exploring existing los design approaches and models and for extracting initial data from our model linked to a theoretical framework. the theoretical framework is designed to explain the integrations in the environment and the need of the new technological solutions after problem identification. practical implementation phase evaluated during the experiment phase. 3 related works on learning object repositories implementation the comparative analysis of smart e-learning systems and architectures [26] [25] highlights the challenges of e-learning: (1) development of mobile agent’s architecture in semantic web-based e-learning systems, while the agent knowledge is improved; (2) increased number of visualized educational resources; (3) new cooperation and communication with the learner to apply artificial intelligence achievements and the updating of internal and online databases, wikis etc. 3.1 learning object repositories learning object repositories are operating online by providing the users a lot of different los by covering different educational levels and topics and also developed by variety of technologies and having different metadata described with the aim to classify los [19]. there are a lot of different well known learning object repositories like: cloe; mit open courseware (ocw); vcilt; careo; oli; commonwealth of learning object repository; edclicks; encore; gem; lola; merlot (specialized in microworlds); ndma (www.oer.ndma.lt/ lor/) repository for lo and different educational activities design and store. most of these lors follow ieee-lom metadata standard and metadata annotation is done manually. however, the learning object repositories are used not only for storing but also for sharing, reusing and lo design [18]. mohan and brooks [23] discussed a situation where learning objects are embedded within learning systems. today we have an analogous situation with mainframe systems. learning systems today are essentially centralized, with learning objects (data) managed at a single place by a single system. however, with the growth of learning object repositories on the semantic web, it is necessary to find a model for the lo search, exchange and design where a learning object may also contain links to other courses or content packages, where it has been used before to support searches by software and human agents on the semantic web [17]. goncales and pimenta [13] analyzed the lo’s integration into virtual learning environment and made a conclusion that in a dynamic environment of content sharing and educational practices the lor has to integrate, thus creating dynamic learning environments to allow the interoperable user to retrieve them by searching through federated repositories, with the ability to modify those objects and compose lessons out of them. the integration aspects were discussed by shen, ullrich and borau [28] who outlined the importance of metadata in the lor implementation process. lor uses metadata to describe los and helps to improve the search for users. learning object metadata lower barriers for repository growths as the server or learning environment does not contain the whole object but the metadata. this feature allows to have big learning object repositories with less expenses. the model based on metadata of learning objects were implemented in proposed lor as well. 3.2 learning object design models the researchers worked on the integrated environment for lo design and search in the semantic web. 42 v. dagiene, d. gudoniene, r. bartkute table 1: comparative analysis of the lo models models features of the model verbert and duval model the model components are: content parts, learning objects, content objects. the model explains how the authors distinguish the fragments of content, content and learning objects (text, audio and video), presents individual resources [29]. meyer model lo components are designed on the basis of a clearly defined concept. lo components are well defined and focused on reusing. lo has one or more evaluation criteria and their usage areas includes several lessons [3]. boyle model structural generative learning model (glo) of an object contains inner and outer structures. glo behavioral model consists of development tool, an xml file, and player program. as the main characteristic of the model, boyle distinguishes the ability to change the flexibility of the xml file, using authoring tools to create instances of glo layouts [8]. santiago and raabe generative model generative learning objects of higher levels (aglo) come from the advanced options of heterogeneous meta-programming technology that allows to convey many aspects of learning (such as content, didactic, social and technological aspects) clearly through parameterization [7]. "learnativity" content model the model defines five levels of content hierarchy [29]: 1. the raw data and media items belong to the lowest level. 2. information object includes raw data as well as media items and focuses on one piece of information. 3. on one task basis, information objects are connected to the third level software objects. learning objects are sets of information objects. 4. the fourth level complex sets for larger (end) tasks. 5. lessons or sections can be combined into larger sets. netg lo model this model defines the rate as a matrix, divided into three main components: subjects (vertical), lessons (horizontal) and topics (fields) [1]. bntopm model widely used set of specifications for creating learning content independently from specific content delivery platform. bntopm includes content model compounds, consisting of shared content objects and containing compound [14]. "navy" content model "navy" content model is an improved bntopm content model that offers more specific content definitions for detail levels that are necessary for the "navy interactive learning environment". the content of "navy" is compatible with bntopm. "navy" distinguishes lo compounds, final los, enabling los [29]. "cisco" dnmo/dnio model cisco classifies each dnio. possible content element classifications are: definition, example, overview, further steps, analogy, topology illustration, block scheme, additional resources, pie charts, teacher’s notes, introduction, a key wording, illustration, importance, plan, facts, list, objectives, contrast, table, working scenario, conditions, guidelines, procedures’ table, decisions’ table, demonstration, table of prepared or combined [9]. the integrated environment for learning objects design and storing in semantic web 43 dynamic learning content management system model the aim is to increase learning content reusability providing a modular design strategy together with a structured description [20]. the system includes a component model that defines three circuit levels: 1. the asset is media elements: pictures, video clips, animations, and simulation. 2. the content elements are defined as small, modular learning content units, which: (1) form the basis of learning content, (2) can be combined into larger, didactically based learning units (3) are independent, (4) are based on a single didactic content type (5) can be reused in different teaching contexts and (6) can be made of assets. learning unit is defined as a set of elements. alocom model it defines different levels of detail of contemporary content models as well as their mutual relations. it is based on owl language, which uses ontologies for relating the content models [24]. integrated modeling method conceptual, educational and didactical model it is a sustainable model developed to model the educational content. it consists of several models: conceptual, educational and didactic. each of them defines specific learning object development aspects. the conceptual model is the basis for the field of knowledge, because it provides teaching concepts and determine their mutual relations [6]. the analysis shows that currently there is no model where lo functionality would be directly semantically tied to lo repository and lo development environment, where different learning objects are created. 4 the integrated environment for learning object’s design and storing traditional teaching must be re-evaluated and adapted in a way to of open education. we cannot forget the educational aspects of the lo, which have a direct impact on a successful organization of learning process, and for efficient lo development. for the creation of such learning object, the educational model of lo planning, development and delivery is applicable. the essential functionality is related with lo design, delivery and search for additional material by using semantic web [15] i.e. search in other repositories of open educational resources for improving the designed lo. semantic network software agents can use contractual language service that allows agents to work together and actively introduce the learning material in the context of current problems. the aim to create links between learning objects and semantic network is very important in lo adaptation or improvement (figure 2). the framework for the user profile is close related with the learning program and a different types of lo distributed in the integrated environment having also functions of learning object repository (lor) (figure 3). the context is complex and dynamic, composed of a variety of problematic situations. according to the lom specifications, context is the environment in which the lo may be created. all contextual components are directly related to the tasks, content structure, available resources and contextual model, bringing together all these resources into the whole with the help of lo repository. the aim is to provide the necessary information to the consumer, and to carry out a search of electronic sources using keywords. the new framework links semantic web technologies and learning objects: the technologies that facilitate search and re-use of learning objects in lo repositories are developed. the idea of search in the semantic web, is to find similar content lo not only in national but also internation44 v. dagiene, d. gudoniene, r. bartkute figure 2: integrated environment for learning objects design and storing in semantic web figure 3: framework for the user profile in the lor al/external learning object repositories, directing into storage that support certain educational topics (figure 4). lo search is performed from the user development environment and directly channeled into educational learning object repositories. figure 4 shows the lo search algorithm in semantic web. such a search cannot be carried out without using, i.e. each lo must be described and assigned to a specific storage area containing the various technologies and content objects (see table 2). for example, when modeling the lo repository, we distinguish the following entities: book abstract publication in a repository; catalogue systematic lo catalogue, in which the knowledge area includes a part of books or lo of the repository; teacher lo developer, etc. the designed environment’s functionality allows to users to develop different types of lo (see figure 5) and the integrated environment for learning objects design and storing in semantic web 45 to upload it directly to the repository www.oer.ndma.lt. figure 4: the framework of integrated environment for lo design and delivery table 2: meta data for lo description type of file format text (plain, richtext, html, xml) images (bmp, gif, jpeg, png, tiff) video records (avi, mpeg, quicktime) other formats (pdf, doc, xls, ppt, java, x-shockwave-flash, zip, scorm) figure 5: learning object design tools in the environment (oer.ndma.lt/lor) search lo in the semantic web is assured by relations with external repositories of open educational resources, giving open access to it. in a search in the semantic web the meaning of the words used in the query is considered in the search process what involves, for instance, the understanding of the intention of the user and the context of the search term, either on the newly designed lor (figure 5). 46 v. dagiene, d. gudoniene, r. bartkute 5 experiment on effectiveness of the integrated environment for lo design and delivery the distributed nature of the semantic web enables continuous improvement of learning materials. it enables the use of distributed knowledge provided in various forms, enabled by semantic annotation of content. for example, if user is in a video presentation system, he gets similar videos based on analysis of all data on systems together. the created environment enables personalized lo development and delivery as well as its integration into different learning environments, regardless of their origin and type. this means that it assures minimal time consumption and gives an opportunity for teachers to create learning objects, complement them semantically with other educational content, and integrate them into personalized learning environments, integrated learning environments, repositories and teaching management environments. it allows improve educational module’s fitness for the learner, and to increase learning efficiency and quality as well as the effectiveness of different learning environments. the proposed framework and its practical application is useful for course developers, when designing and developing distance learning courses, massive open online courses, and other ictbased content. table 3: comparison of existing is for lo design and the newly designed features of the systems/environments integrated environment virtual learning environment information system los form the basis of learning content x x x los can be combined into larger x x los are independent x x los are based on a single didactic content type x x x los can be reused in different teaching contexts x x los can be searched in the semantic web x x los described in metadata x x x learning domain ontology x x curricular structure ontology x x pedagogical strategies ontology x x environment have categorization functions x x environment have a semantic search for other external lo x existing user interface x x x existing query / update x x existing web services facilities x x the newly designed framework identifies user’s need to develop and organize the learning the integrated environment for learning objects design and storing in semantic web 47 content. content modeling activities can be carried out at different levels of abstraction from coordination to instructional and educational level. lo models defined a formal framework, where learning objects can be modelled defining their formats, functions, participants and activity sequences. high-resolution digital objects are stored in the repository (oer.ndma.lt/lor). however, using them for further development of the learning process and creating educational programs and courses, requires these objects to be transformed into different formats and to ensure their reusability in other content or courses. information systems and their components designed to work in an environment that is adapted to communicate with standard it platforms, operating systems and computer networks. 6 conclusions and future studies the authors have proposed a friendly and interoperable integrated environment framework, which is able to use and assure lo design, search in semantic web, adaptation of the re-used objects and storing. there are no more existing lo repositories with the functionality presented by researchers. the integrated environment will contribute to the transformation of education into openness. the integrated environment for the lo design, search in semantic web, adaptation and storing of newly designed or re-designed lo is identified as an effective to use. the created environment enables personalized lo development and delivery as well as its integration into different learning environments, regardless of their origin and type. the authors of the paper are planning to continue the research related with smart learning objects for smart education. following the new strategies of the he to be more active in moocs design and transformation of closed education into open one, the smart learning objects and integration with the wide functionality will serve for the new learning methods implementation and new smart learning objects design. bibliography [1] allen, c.; mugisa, e. (2010); improving learning object reuse through ood: a theory of learning objects, journal of object technology, 9(6), 51-75, 2010. [2] aroyo, l.; dicheva, d. (2013); the new challenges for e-learning, the educational semantic web educational technology & society, 7 (4), 59-69, 2013. [3] arreola, r. 1998; writing learning objectives, the university of tennessee, memphis, 1998. [4] berners-lee, t.; hall, w.; hendler, j.a. et al (2006); a framework for web science, foundations and trends r○ in web science, 1(1), 1-130, 2006. [5] brasoveanu, a.m.p.; dzitac i. (2012); the role of visual rhetoric in semantic multimedia: strategies for decision making in times of crisis, international journal of computers communications & control, 7(4), 605-615, 2012. [6] bridges, d.; davison, r.; odegard, p.s.; maki, i.; tomkowiak, j. (2011); inter-professional collaboration: three best practice models of inter-professional education, medical education online, doi: 10.3402/meo.v16i0.6035, 16(10), 2011. [7] burbaite, r. (2014); advanced generative learning objects in informatics education: the concept, models and implementation, kaunas university of technology, 2014. 48 v. dagiene, d. gudoniene, r. bartkute [8] cheng, h.m.; yen, y.n.; chen, m.b.; yang, w.b. (2010); a process for digitizing historical architecture, euro-mediterranean conference (euromed’10), 1-12, 2010. [9] cisco (2006); product bulletin no. 1545, cisco, 1-12, 2006. [10] dhuria, s.; chawla, s. (2014); ontologies for personalized e-learning in the semantic web, international journal of advanced engineering and nano technology (ijaent), 1(4), 1318, 2014. [11] ermalai, i.; mocofan, m.; onita, m.; vasiu, r. (2009); adding semantics to online learning environments. computational intelligence and informatics, 5th international symposium on applied computional intelligence and informatics (saci’09), 569-573, 2009. [12] gladun, a. (2009); an application of intelligent techniques and semantic web technologies in e-learning environments expert systems with applications, expert systems with applications, 36(2-1), 1922-1931, 2009. [13] goncalves, m.j.a.; perez cota, m.; pimenta, p. (2013); a study to determine what kind of learning objects are used in higher education institutions, education, 3(1), 30-36, 2013. [14] gutierrez, i.; alvarez, v.; paule, p.; perez-perez, j.r.; freitas, s. (2016); adaptation in e-learning content specifications with dynamic sharable objects, systems, 4(2), 24, 2016. [15] jovanovic, j.; gasevic, d.; knight, c.; richards, g. (2007); ontologies for effective use of context in e-learning settings, journal of educational technology & society, 10(3), 47-59, 2007. [16] klasnja-milicevic, a.; vesin, b.; ivanovic, m.; budimac, z. (2010); e-learning personalization based on hybrid recommendation strategy and learning style identification, computers & education, 56(3), 885-899, 2010. [17] kontopoulos, e. (2008); an ontology-based planning system for e-course generation, expert systems with applications, 35, 398-406, 2008. [18] lupeikiene, a. (2007); theoretical and technological aspects of information systems: a textbook for doctoral and postgraduate students in computer science, institute for mathematics and informatics, vilnius university, 200, 2007. [19] magnisalis, i.; demetriadis, s.; karakostas, a. (2011); adaptive and intelligent systems for collaborative learning support: a review of the field, ieee transactions on learning technologies, 4(1), 5-20, 2011. [20] mcgreal, r. (2004); learning objects: a practical definition, international journal of instructional technology and distance learning, 1(9), 2004. [21] mcilraith, s.; son, t.; zeng, h. (2001); semantic web services, intelligent systems, 16(2), 46-53, 2001. [22] menolli, a., reinehr, s., malucelli, a. (2013); improving organizational learning: defining units of learning from social tools, informatics in education, 12(2), 273-290, 2013. [23] mohan, p.; brooks, c. (2003); learning objects on the semantic web, proceedings of 3rd ieee international conference on advanced learning technologies (icalt03), 2003. the integrated environment for learning objects design and storing in semantic web 49 [24] psyllidis, a. (2015); ontology-based data integration from heterogenous urban systems: a knowledge representation framework for smart cities, proceedings of the 14th international conference on computers in urban planning and urban management (cupum’14), 2015. [25] raju, p.; ahmed, v. (2012); enabling technologies for developing next-generation learning object repository for construction, automation in construction, 22, 247-257, 2012. [26] sakarkar, g.; deshpande, s.; thakare, v. (2012); intelligent online e-learning systems: a comparative study, international journal of computer applications, 56(4), 21-25, 2012. [27] salas, k.; ellis, l. (2006); the development and implementation of learning objects in a higher education setting, interdisciplinary journal of knowledge and learning objects, https://doi.org/10.28945/398, 2, 1-22, 2006. [28] shen, r.; ullrich, c.; borau, k. (2013); learning from learning objects and their repositories to create sustainable educational app environments, ieee 13th international conference on advanced learning technologies, 285-287, 2013. [29] verberts, k.; duval, e. (2008); alocom: a generic content model for learning objects, international journal on digital libraries, 9(1), 41-63, 2008. [30] the semantic web (2012); the semantic web, 2012. [online] www.semanticweb.org int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 3 (september), pp. 418-427 uncertain fractional order chaotic systems tracking design via adaptive hybrid fuzzy sliding mode control t.c. lin, c.h. kuo, v.e. balas tsung-chih lin feng-chia university, 40724, taichung, taiwan e-mail: tclin@fcu.edu.tw chia-hao kuo ph.d program in electrical and communications engineering feng-chia university, taichung, taiwan e-mail: peterqo022@hotmail.com valentina e. balas aurel vlaicu university of arad, romania b-dul revolutiei 77, 310130 arad, romania e-mail: balas@drbalas.ro abstract: in this paper, in order to achieve tracking performance of uncertain fractional order chaotic systems an adaptive hybrid fuzzy controller is proposed. during the design procedure, a hybrid learning algorithm combining sliding mode control and lyapunov stability criterion is adopted to tune the free parameters on line by output feedback control law and adaptive law. a weighting factor, which can be adjusted by the trade-off between plant knowledge and control knowledge, is adopted to sum together the control efforts from indirect adaptive fuzzy controller and direct adaptive fuzzy controller. to confirm effectiveness of the proposed control scheme, the fractional order chaotic response system is fully illustrated to track the trajectory generated from the fractional order chaotic drive system. the numerical results show that tracking error and control effort can be made smaller and the proposed hybrid intelligent control structure is more flexible during the design process. keywords: fractional order chaotic systems; fuzzy logic control, adaptive hybrid control. 1 introduction in domain due mainly to its demonstrated applications in numerous seemingly diverse and widespread fields of science and engineering, fractional calculus has gained considerable popularity and importance during past three decades [1][2]. in control system, due to the fact that the theoretical aspects are well established, fractional order controllers are successfully used to enhance the performance of the feedback control loop. it is observed that the description of some systems is more accurate when the fractional derivative is used. nowadays, many fractional-order differential systems behave chaotically, such as the fractional-order chua’s system [3], the fractionalorder duffing system [4], the fractional-order system, the fractional-order chen’s system [5], the fractional-order cellular neural network [6], the fractional-order neural network [7]. the tracking problem of fractional order chaotic systems is first investigated by deng and li [21] who carried out tracking in case of the two fractional lü systems. afterwards, they studied chaos tracking of the chen system with a fractional order in a different manner [22]-[24]. based on the universal approximation theorem, [9][20] (fuzzy logic controllers are general enough to perform any nonlinear control actions) there is rapidly growing interest in systematic copyright c⃝ 2006-2011 by ccc publications uncertain fractional order chaotic systems tracking design via adaptive hybrid fuzzy sliding mode control 419 design methodologies for a class of nonlinear systems using fuzzy adaptive control schemes. like the conventional adaptive control, the adaptive fuzzy control is classified into direct and indirect fuzzy adaptive control categories [9], [17][19]. a direct adaptive fuzzy controller uses fuzzy logic systems as controller in which linguistic fuzzy control rules can be directly incorporated into the controller. on the other hand, an indirect adaptive fuzzy controller uses fuzzy descriptions to model the plant in which fuzzy if-then rules describing the plant can be directly incorporated into the indirect fuzzy controller. moreover, a hybrid adaptive fuzzy controller can be constructed using a weighting factor to sum together the control efforts from indirect adaptive fuzzy controller and direct adaptive fuzzy controller. although the concept of sliding mode control (smc) and the theory of fractional order system are well known, their integration, fractional sliding mode control, is an interesting filed of research dwelt on this paper with some applications [8]. the motivation of this paper stands on two driving forces: one, most systems in the reality display behavior characterized best in time domain of fractional operators, the other, the uncertainties on the process dynamics can appropriately be alleviated by utilizing smc technique. in this paper, by combining the approximate mathematical model, linguistic model description and linguistic control rules into a single adaptive fuzzy controller, an adaptive hybrid fuzzy controller is proposed to achieve prescribed tracking performance of fractional order chaotic systems. a new adaptive hybrid fuzzy smc algorithm incorporated lyapunov stability criterion is proposed so that not only the stability of adaptive fuzzy control system is guaranteed but also the influence of the approximation error and external disturbance on the tracking error can be attenuated to an arbitrarily prescribed level. this paper is organized as follows: in section2, an introduction to fractional derivative and its relation to the approximation solution will be addressed. section 3 generally proposes adaptive hybrid fuzzy smc of uncertain fractional order systems in presence of uncertainty and its stability analysis. in section 4, application of the proposed method on fractional order expression chaotic system is investigated. finally, the simulation results and conclusion will be presented in section 5. 2 basic definition and preliminaries for fractional order systems the concept of fractional calculus is popularly believed to have steamed from a question raised in the year 1695 by marquis de l’hoptial to gottfried wilhelm leibniz. it is a generalization of integration and differentiation to non-integer order fundamental operator, denoted by ad q t , where a and t are the limits of the operator. this operator is a notation for taking both the fractional integral and functional derivative in a single expression defined as [1] ad q t =   dq dtq , q > 0 1 q = 0∫ a t (dτ)−q, q < 0 (1) there are some basic definitions for the general fractional and the commonly used definitions are grunwald-letnikov and riemann-liouville [1]. the grunwald-letnikov definition is expressed as ad q t f(t) = lim h→0 [t−ah ]∑ j=0 (−1)j ( a b ) f(t − jh) (2) 420 t.c. lin, c.h. kuo, v.e. balas where [.] is the integer part. the simplest and easiest definition is riemann-liouville definition given as ad q t f(t) = 1 γ(n − q) dn dtn ∫ t 0 f(τ) (t − τ)q−n+1 dτ (3) where n is the first integer which is not less q, i.e., n − 1 < q < n, and γ is the gamma function. the numerical simulation of a fractional differential equation is not simple as that of an ordinary differential equation. in this paper, the algorithm which is an improved version of adams-bashforth-moulton algorithm to find an approximation for fractional order systems based on predictor-correctors is given. consider the following differential equation ad q t y(t) = r(y(t), t), 0 ≤ t ≤ t and y (k)(0) = y(k)o ,k = 0,1,2, ...,m − 1 (4) where ad q t y(t) =   1 γ(m−q) ∫ t 0 f(m)(τ) (t−τ)q−m+1 dτ, m − 1 < q < m dm dtm y(t), q = m (5) and m is the first integer larger the q. the solution of the equation (4) is equivalent to volterra integral equation [1] described as y(t) = [q]−1∑ k=0 y (k) 0 tk k! + 1 γ(q) ∫ t 0 (t − λ)q−1r(y(λ),λ)dλ (6) let h=t/n, tn = nh, n=0,1,2,· · · n . then (6) can be discretized as follows. yh(tn+1) = [q]−1∑ k=0 y (k) 0 tkn+1 k! + hq γ(q + 2) r(y p h(tn+1), tn+1) + hq γ(q + 2) n∑ j=0 aj,n+1r(yh(tj), tj) (7) where predict value yph(tn+1) is determined by y p h(tn+1) = [q]−1∑ k=0 y (k) 0 tkn+1 k! + hq γ(q) n∑ j=0 bj,n+1r(yh(tj), tj) (8) and aj,n+1 =   nq+1 − (n − q)(n + 1)q, j = 0 (n − j + 2)q+1 + (n − j)q+1 − 2(n − j + 1)q+1 1 ≤ j ≤ n 1 j = n + 1 (9) bj,n+1 = hq q ((n + 1 − j)q − (n − j)q) (10) the approximation error is given as max j=0,1,2,···n |y(tj) − yh(tj)| = o(hp) (11) where p=min(2,1+q). therefore, the numerical solution of a fraction order chaotic system discussed in this paper can be obtained by applying the above mentioned algorithm. uncertain fractional order chaotic systems tracking design via adaptive hybrid fuzzy sliding mode control 421 3 adaptive hybrid fuzzy sliding mode control of uncertain fractional order chaotic systems in this section, we study adaptive hybrid fuzzy tracking control of uncertain fractional order chaotic systems, i.e., to force output trajectory which is obtained by the algorithm mentioned in section 2 of the response system to track output trajectory of the drive system. consider a fractional order chaotic dynamic system x(nq) = f(x,t) + g(x,t)u + d(t), y = x1 (12) where x = [x1,x2, ...,xn]t = [x,x(q),x(2q), ...,x((n−1)q)]t is the state vector, f(x,t) and g(x,t) are unknown but bounded nonlinear functions which express system dynamics, d(t) is the external bounded disturbance, |d(t)| ≤ d, and u(t) is the control input. the control objective is to force the system output y to follow a bounded reference signal yd which is the output trajectory of a drive system, under the constraint that all signals involved must be bounded. to begin with, the reference signal vector y d and the tracking error vector e will be defined as y d = [ yd,y (q) d , ...,y ((n−1)q) d ]t ∈ rn, e = y d − x = [ e,e(q),e(2q), ...,e((n−1)q) ]t ∈ rn,e(iq) = y(iq)d − x (iq) = y (iq) d − y (iq) in general, in the space of the error state a sliding surface is defined by s(x,t) = −(ke) = − ( k1e + k2e (q) + ... + kn−1e (n−2)q + e(n−1)q ) (13) where k = [k1,k2, ...,kn−1,1] in which the ki’s are all real and are chosen such that h(r) =∑n i=1 kir (i−1)q,kn = 1 is a hurwitz polynomial where r is a laplace operator. the tracking problem will be considered as the state error vector e remaining on the sliding surface s(x,t) = 0 for all t ≥ 0. the sliding mode control process can be classified into two phases, the approaching phase with s(x,t) ̸= 0 and the sliding phase with s(x,t)= 0 for initial error e(0) = 0. in order to guarantee that the trajectory of the state error vector e will translate from the approaching phase to the sliding phase, the sufficient condition s(x,t)ṡ(x,t) ≤ −η > 0 (14) must be satisfied. two type of control law must be derived separately for those two phases described above. in the sliding phase, it implies s(x,t) = 0 and s(q)(x,t) = 0. in order to force the system dynamics to stay on the sliding surface, the equivalent control u can be derived as follows: if f(x,t) and g(x,t) are known and free of external disturbance, i.e., d(t)=0, taking the derivative of the sliding surface with respective to time, we get s(q) = − ( n−1∑ i=1 cie (iq) + e(nq) ) = − ( n−1∑ i=1 kie (iq) + y (nq)−y(nq) d ) = − ( n−1∑ i=1 kie (iq)−f(x,t)−g(x,t)ueq ) − y(n)d = − n−1∑ i=1 kie (i) + f(x) + b(x)u(t) − x(n)d = 0 (15) therefore, the equivalent control can be obtained as 422 t.c. lin, c.h. kuo, v.e. balas u = 1 g(x,t) ( n−1∑ i=1 kie (iq) − f(x,t) + y(nq)d ) (16) on the contrary, in the approaching phase, s(x,t) ̸= 0, an approaching-type control uap must be added in order satisfy the sufficient condition (4) and the complete sliding mode control will be expressed as u = u − uap, uap = ψhsgn(s) (17) where ψh ≥ η > 0. to obtain the sliding mode control (17), the system functions f(x,t),g(x,t) and switching parameter ψh must be known in advance. however, f(x,t) and g(x,t) are unknown and external disturbance, d(t) ̸= 0, the ideal control effort (16) cannot be implemented. we replace f(x,t), g(x,t) and uap by the fuzzy logic system f(x|θf), g(y|θg) and h(s|θh) in specified form as [9], [17][19], i.e., f(x|θf) = ξ t (x)θf,g(x|θg) = ξ t (x)θg, h(s|θh) = ∅ t (s)θh (18) let |h(s|θh)| = d + ψh + ωmax when s(x,t) is outside the boundary layer. here the fuzzy basis functions ξ(x) and ∅(s) depend on the fuzzy membership functions and is supposed to be fixed, while θf,θg and θh are adjusted by adaptive laws based on lyapunov stability criterion. therefore, depending on plant knowledge and control knowledge, a hybrid adaptive fuzzy controller can be constructed by incorporating both fuzzy description and fuzzy control rules using a weighting factor α to combine the indirect adaptive fuzzy controller and the direct adaptive fuzzy controller. based on the trade-off between plant knowledge and control knowledge, the weighting factor α ∈ [1,1] can be adjusted. therefore, the total control effort can be expressed as uc = αui + (1 − α)ud (19) where the direct adaptive fuzzy controller ud and the indirect adaptive fuzzy controller ui are given as follows: ud(x) = ud(x|θd) − h(s|θh) g(x,t) and ui(x) = 1 g(x|θg) [ n−1∑ i=1 kie (iq) + y (nq) d − f(x, |θf) − h(s|θh) ] (20) where ud(x|θ) is obtained by fuzzy logic system specified as ud(x|θd) = ξ t (x)θd (21) the optimal parameter estimations θ∗f,θ ∗ g, θ ∗ h and θ ∗ d are defined as θ∗f = arg minθf ∈ωf [ sup x∈ωx |f(x|θf) − f(x,t)| ] , θ∗g = arg minθg∈ωg [ sup x∈ωx |g(x|θg) − g(x,t)| ] θ∗d = arg minθd∈ωd [ sup x∈ωx |ud(x|θg) − u| ] , θ∗h = arg minθh∈ωh [ sup x∈ωx |h(s|θh) − uap| ] uncertain fractional order chaotic systems tracking design via adaptive hybrid fuzzy sliding mode control 423 where ωf,ωg,ωd and ωx are constraint sets of suitable bounds on θf,θg,θ ∗ h,θd and x respectively and they are defined as ωf = {θf|θf| ≤ mf}, ωg = {θg|θg| ≤ mg}, ωd = {θd|θd| ≤ md}, ωh = {θh|θh| ≤ mh} and ωx = {x||x| ≤ mx}, where mf,mg,md,mh and are positive constants. by using (20), (21), sliding surface equation (15) can be rewritten as s(q) = ω + α [ f(x|θ∗f) − f(x|θf) ] + α [ g(x|θ∗g) − g(x|θg) ] ui − (1 − α)h(s|θ∗h) −αh(s|θh) − (1 − α)g(x) [ud(x|θ ∗ d) − ud(x|θd)] + αh(s|θ ∗ h) − αh(s|θ ∗ h) (22) +(1 − α)h(s|θ∗h) − (1 − α)h(s|θ ∗ h) + d(t) where the minimum approximation errors is defined as ω = α [ f(x) − f(x|θ∗f) ] + α [ g(x) − g(x|θ∗g) ] ui + (1 − α) [ud(x|θ∗) − ud] (23) if θ̃f = θf − θ ∗ f , θ̃g = θg − θ ∗ g and, θ̃d = θd − θ ∗ d, we have s(q) = −(1 − α)h(s|θ∗h) + ω − αθ̃ t h ∅(s) − αθ̃ t f ξ(x) − αθ̃ t g ξ(x)ui +(1 − α)g(x)θ̃ t dξ(x) − αh(s|θ ∗ h) − (1 − α)θ̃ t h ∅ + d(t) (24) following the proceeding consideration, the following theorem can be obtained. theorem: consider the fractional order siso nonlinear chaotic system (12) with control input (19), if the fuzzy-based adaptive laws are chosen as θ (q) f = r1sξ(x), θ (q) g = r2sξ(x)ui, θ (q) d = r3s∅(s) and θ (q) h = −r4sg(x)ξ(x) (25) where ri > 0, i = 1 ∼ 4. then, the overall adaptive scheme guarantees the global stability of the resulting closed-loop system in the sense that all signals involved are uniformly bounded and the tracking error will converge to zero asymptotically. proof: in order to analyze the closed-loop stability, the lyapunov function candidate is chosen as v = 1 2 s2 + α 2r1 θ̃ t f θ̃f + α 2r2 θ̃ t g θ̃g + α 2r4 θ̃ t h θ̃h + (1 − α) 2r3 θ̃ t dθ̃d (1 − α) 2r4 θ̃ t h θ̃h (26) taking the derivative of the (26) with respect to time, we get v (q) = ss(q) + α r1 θ̃ t f θ̃ (q) f + α r2 θ̃ t g θ̃ (q) g + α r4 θ̃ t h θ̃ (q) h + (1 − α) r3 θ̃ t θ̃ q + (1 − α) r4 θ̃ t h θ̃ (q) h = −(1 − α)sh(s|θ∗h) + sω − αsθ̃ t h ∅(s) − αsθ̃ t h ξ(x) − αsθ̃ t g ξ(x)ui + (1 − α)sg(x)θ̃ t ξ(x) −ash(s|θ∗h) − (1 − α)sθ̃ t h ∅ + sd(t) + α r1 θ̃ t f θ̃ (q) f + α r2 θ̃ t g θ̃ (q) g + α r4 θ̃ t h θ̃ (q) h + (1 − α) r3 θ̃ t θ̃ q + (1 − α) r4 θ̃ t h θ̃ (q) h ≤ α r1 θ̃ t f ( θ̃ (q) f − r1sξ(x) ) + α r2 θ̃ t g ( θ̃ (q) g − r2sξ(x)ui ) + 1 r4 θ̃ t h ( θ̃ (q) h − r4s∅(s) ) − αs(d + η)sgn(s) + 1 − α r3 θ̃ t ( θ̃ (q) + r3sg(x)ξ(x) ) − (1 − α)s(d + ψh)sgn(s) + sd(t) + sω (27) from the robust compensator ua and the fuzzy-based adaptive laws are given (25), after simple manipulation, we have v (q) ≤ sω − sψhsgn(s) = sω − |s|ψh (28) using the corollary of barbalat’s lemma [16]-[19], we have limt→∞ |s(x,t)| = 0. therefore, limt→∞ |e(t)| = 0. the proof is completed. 424 t.c. lin, c.h. kuo, v.e. balas 4 simulation example in this section, we will apply our adaptive hybrid fuzzy sliding mode controller to force the fractional order chaotic gyro response system to track the trajectory of the fractional order chaotic gyro drive system. example: the fractional order chaotic gyro drive and response systems are given as follows: drive system: { y (q) 1 = y2 y (q) 2 = −100 ( y1 4 ) + y31 12 − 0.5y2 − 0.05y32 + sin(y1) + 35.5 sin(2t)y1 − x31 6 + d(t) response system: { x (q) 1 = x2 x (q) 2 = −100 ( x1 4 ) + x31 12 − 0.7x2 − 0.08x32 + sin(x1) + 33 sin(2t)x1 − x31 6 + ∆f(x1,x2) + d(t) + uc(t) where structured uncertainty ∆f(x1,x2) = −0.1sin(x1) and external disturbance d(t) = 0.2cos(πt). the main objective is to control the trajectories of the response system to track the reference trajectories obtained from the drive system. the initial conditions of drive and response systems are chosen as [y1(0),y2(0)]t = [1,−1]t and [x1(0),x2(0)]t = [1.6,0.8]t , respectively. for q=0.95, α = 0.7 and all design constants are specified as k1 = k2 = 1,r1 = 150,r2 = 20,r3 = 1,r4 = 1 and step size h = 0.01. the phase portrait of the drive and response systems for free of control input is given in figure 1. it is obvious that the tracking performance is bad without control effort supplied to response system. figure 1: phase portrait of chaotic drive and response systems the membership functions for xi i=1,2 are selected as follows: µf i1 (xi) = exp [ −0.5 ( xi−4 2 )2] , µf i2(xi) = exp [ −0.5 ( xi−2.7 2 )2] , µf i3(xi) = exp [ −0.5 ( xi−1.2 2 )2] , µf i4 (xi) = exp [ −0.5 ( xi 2 )2], µf i5(xi) = exp[−0.5(xi+1.22 )2], µf i6(xi) = exp[−0.5(xi+2.72 )2], µf i7 (xi) = exp [ −0.5 ( xi+4 2 )2] , from the adaptive laws (25)-(28), the control effort of the response system can be obtained as uc = αui + (1 − α)ud (29) figure 2 shows the trajectories of the states xi,yi and x2,y2, respectively. control effort trajectory is given in figure 3 and phase portrait, tracking performance, of the drive and response uncertain fractional order chaotic systems tracking design via adaptive hybrid fuzzy sliding mode control 425 figure 2: the trajectories of the states xi,yi and x2,y2 figure 3: trajectory of the control effort figure 4: phase portrait, tracking performance, of the drive and response systems systems is shown in figure 4. trajectory of the sliding surface is given is figure 5. the maximum value of v (q)(t) is -1.711e-4 which is always negative defined and consequently is stable. in order to show the robustness of the proposed adaptive hybrid fuzzy sliding mode control, the control effort is activated at 5 second. the phase portrait, tracking performance, of the drive and response systems is given in figure 6. figure 7 shows the trajectories of the states xi,yi and x2,y2 respectively. we can see that a fast tracking of drive and response is achieved as the control effort is activated. control effort trajectory is given in figure 8. trajectory of the sliding surface is given is figure 9. the maximum value of v (q)(t) is -1.732e-4 which is always negative defined and consequently is stable. figure 5: trajectory of the sliding surface figure 6: phase portrait, tracking performance, of the drive and response systems 426 t.c. lin, c.h. kuo, v.e. balas figure 7: the trajectories of the states xi,yi and x2,y2 figure 8: trajectory of the control effort figure 9: trajectory of the sliding surface 5 conclusions a novel adaptive hybrid fuzzy sliding mode controller is proposed to achieve tracking performance of fractional order chaotic systems in this paper. it is a flexible design methodology by the trade-off between plant knowledge and control knowledge using a weighting factor ? adopted to sum together the control effort from indirect adaptive fuzzy controller and direct adaptive fuzzy controller. based on the lyapunov synthesis approach, free parameters of the adaptive fuzzy controller can be tuned on line by output feedback control law and adaptive laws. the simulation example, the output trajectory of the fractional order chaotic response system to tracking the trajectory of the fractional order chaotic drive system, is given to demonstrate the effectiveness of the proposed methodology. bibliography [1] m. s. tavazoei, m. haeri, synchronization of chaotic fractional-order systems via active sliding mode controller, physica a, vol. 387, pp. 57-70, 2008. [2] a.a. kilbas, h.m. srivastava and j.j. trujillo, theory and applications of fractional differential equations, north-holland math. studies 204, elsevier, amsterdam, 2006. [3] i. petras, a note on the fractional-order chua’s system, chaos, solitons & fractals, 38 (1), pp. 140-14,7 2008. [4] x. gao and j. yu, chaos in the fractional order periodically forced complex duffing’s oscillators, chaos, solitons & fractals 26, pp. 1125-1133, 2005. [5] j.g. lu and g. chen, a note on the fractional-order chen system, chaos, solitons & fractals 27, pp. 685-688, 2006. uncertain fractional order chaotic systems tracking design via adaptive hybrid fuzzy sliding mode control 427 [6] p. arena and r. caponetto, bifurcation and chaos in non-integer order cellular neural networks, int j bifurcat chaos 8 (7), pp. 1527-1539, 1998. [7] petras i., a note on the fractional-order cellular neural networks. in: proceedings of the ieee world congress on computational intelligence, international joint conference on neural networks, vancouver, canada; pp.16-21, 2006. [8] s. h. hosseninnia, r. ghaderi, a. ranjbar n., m. mahmoudian, s. momani, sliding mode synchronization of an uncertain fractional order chaotic system, journal computers & mathematics with applications, vol. 59, pp. 1637-1643, 2010. [9] l.a. zadeh, fuzzy logic, neural networks and soft computing. commun. acm 37 3, pp. 77-84, 1994. [10] x. z. zhang; y.n. wang; x. f. yuan, h? robust t-s fuzzy design for uncertain nonlinear systems with state delays based on sliding mode control, international journal of computers communications & control, 5(4):592-602, 2010. [11] balas, m.m., balas, v.e., world knowledge for control applications by fuzzy-interpolative systems, international journal of computers communications and control, vol. 3, supplement: suppl. s, pp. 28-32, 2008. [12] l. x. wang, and j. m. mendel, fuzzy basis function, universal approximation, and orthogonal least square learning, ieee trans. neural networks, vol. 3, no. 5, pp. 807-814, 1992. [13] l. x. wang, adaptive fuzzy systems and control: design and stability analysis. englewood cliffs, nj: prentice-hall, 1994. [14] k. diethelm, an algorithm for the numerical solution of differential equations of fractional order, elec. trans. numer. anal. 5, pp. 1-6, 1997. [15] j. l. castro, "fuzzy logical controllers are universal approximators," ieee trans. syst., man, cybern., 25, pp. 629-635, 1995. [16] s. s. ge, t. h. lee, c. j. harris, adaptive neural network control of robotic manipulators, world scientific publishing co., singapore, 1998. [17] c. h. wang, t. c. lin, t. t.. lee and h. l. liu, "adaptive hybrid intelligent control for unknown nonlinear dynamical systems", ieee transaction on systems, man, and cybernetics part b, 32 (5), october, pp. 583-597, 2002. [18] c. h. wang, h. l. liu and t. c. lin, "direct adaptive fuzzy-neural control with observer and supervisory control for unknown nonlinear systems", ieee transaction on fuzzy systems, 10(1) , pp. 39-49, 2002. [19] t. c. lin, c. h. wang and h. l. liu, "observer-based indirect adaptive fuzzy-neural tracking control for nonlinear siso systems using vss and ", fuzzy sets and systems, 143, pp.211-232, 2004. [20] l. a. zadeh, knowledge representation in fuzzy logic, ieee trans. knowledge and data engineering, 1 (1), pp. 89-100, 1989. int j comput commun, issn 1841-9836 8(2):206-219, april, 2013. digital perception functionality for institutional design: the forest policy issue of "loch lomond & the trossachs national park" of scotland as a case study n.d. hasanagas nikolaos d. hasanagas university of kavala institute of technology, greece e-mail: nikolaos.hasanagas@gmail.com abstract: motivation for this research was the need to perceive the relations among actors involved in the rural-forest policy networks. the problem of an objective analysis of formal and informal power and information flow structure measurement and visualization in policy-making is tried to be solved. the policy network of "loch lomond & the trossachs national park" is used as a case study as it includes a great variety of actors. an answer to the much-discussed problem of digital perception of politico-administrative power are in part the results about: (1) how complexity of policy making can be measured and what this practically means, (2) who controls the communication, (3) where the most important information resources are located, and (4) who is most trustworthy in this network. it is concluded that: "important" information is not a source of trust, but rather inversely, through trust one can impose information as being important. between actors of relatively high trust status, the most trusted one does not need to control information in order to strengthen its position. a less trusted actor may supplement the lack of power status with information control. the leading role that forestry sector may play even in cross-sectoral issues is pointed out in this case study. keywords: forest-environmental policy, cross-sectorality, forestry & rural development, network analysis software. 1 introduction 1.1 aim aim of this research is to present an example of policy arena perception through the social network analysis software visone [1] [2] and to propose the enrichment of software which is engineered for social network analysis, with the algorithm of complexity. the design of such a software product is decisive for the perceptional output of a policy arena [4]. it is going to be shown that different algorithms reveal different structures (such as hierarchies of trust or information) which can empirically and qualitatively interpreted. the discussion will be extended on the role of the informal hierarchy of trust and information exchange and the complexity of a network. a particular environmental network is characterized by a single hierarchy of trust in which each actor is embodied. it will be shown that the position of each actor in this hierarchy is significant for the "importance" of the information that the actor distributes to the other actors of the network. understanding informal structures is important because these structures, and not always the formal ones, determine policy outputs (e.g. roles, winners and losers, decisions, etc). the following questions will be addressed: how complexity of policy making can be measured and what this practically means, who controls the communication, where the most important information resources are located, and who is most trustworthy in this network. copyright c⃝ 2006-2013 by ccc publications digital perception functionality for institutional design: the forest policy issue of "loch lomond & the trossachs national park" of scotland as a case study 207 1.2 literature review "smart" algorithms which should be abstract and simultaneously functional and meaningful for a wide range of heterogeneous policy fields, from gis up to marketing and e-learning, is still a challenging issue in software engineering [3] [5] [13] [14] [16]. the examples suggested until now are mainly related to concrete fields such as environmental, spatial, industrial, commercial or military issues [6] [23]. steps to more abstract fields of interactions concerning influence potential, administration and innovation have also been made [3] [9] [10] [11]. however, they depict a path leading to a more "immaterial" perception of social networks and thereby to sharper perception toward the institutional infrastructure of the reality. many approaches have been presented in data mining and databases utilization [10] [11] [19] but only focused on classical statistics, without strong attention to the conception and perception of social structures. such a perception of structures may also be a database with adaptive learning objects suitable for tentative policy planning and e-learning of lobbyists and policy-makers. a basic hypothesis of decision making theory is that information does not produce power, but inversely, power seems to condition the role of information. trust is a form of power: an actor who has gained the trust of the others can, in general, (mis)lead them with his/ her plausible arguments. the other dimension of power is trust: the trustor follows the trustee [12] [19] [20] [21] [22] [5] [6] [7] [8] [13] [15]. trust is defined as the extent to which an actor is willing to let another actor make a decision for the former without exerting any control. in this way, a trustee may persuade the others that his/ her information is the most "important" and/or control the distribution of information (communication control). 1.3 paper’s innovation at theoretical level, this paper presents an approach of perceiving the institutional relations of the policy arena through software engineered for quantifying and visualizing policy networks. the system approach is not a method to analyze the reality but rather a pattern of perceiving it. the practical added value lies in analyzing hierarchies structured in the framework of a real policy-making case. the results can be interesting for environmental policy-makers, lobbyists and policy analysts. the complexity algorithm is regarded as a software tool for policy analysis, which optimizes the functionality of digital perception and is suggested to be integrated in software products. this indicator is useful for examining which function of an actor is expected to be most or least important for a successful involvement in the network (the leading organs, the public relation officer, or the experts). 2 case study: "loch lomond & the trossachs national park" the "loch lomond & the trossachs national park" was selected as a case study as it is a policy issue directly related to forestry: typical conflicts between environmental groups and forest groups, agencies or enterprises (e.g. whether native or commercial species should be planted in this area) which appear in several european and american regions appear also in this case. a public forest actor appears to be the strongest trustee and has achieved to be considered as the most "important" information source in this cross-sectoral policy issue. it has been argued [22] that forestry actors should avoid joining cross-sectoral policy networks or letting other sectors be involved in forestry issues because in this way the forestry actors become weaker. however, this case study offers a critical feedback to this hypothesis, as a public forest actor in scotland can lead actors of several sectors (recreation, wood production, water and wildlife 208 n.d. hasanagas management etc) by gaining their trust and controlling scientific information, enhancing selfregulation of forest policy and forest policy analysis [13] [21] [14] [15]. 3 the proposed methodology complete quantitative network analysis [17] [18] is applied. the empirical data have been collected through standardized telephone interviews with directors, chairpersons and campaign officers. the standardized questionnaire was based on previous intensive expert interviews. the interviewees were asked to name the other actors which they had contact to in the process of the policy network. afterwards, they were asked to cross-assess all actors which they had named regarding different variables, as for example: to what degree they trusted the other actors, who they considered had been their supplier of general and scientific information, amongst other variables etc. the separation of scientific knowledge from socially-constructed norms that are usually mixed in environmental policy [14] [16] was based on cross-assessment between the participants. qualitative (in-depth) interviews with field experts have been conducted for the interpretation of the quantitative results. for reasons of discretion the actors will not be named but only depicted as described in figure1: figure 1: legend of actors in "loch lomond & the trossachs national park" networks 3.1 analysis of the socio-grams of "loch lomond & the trossachs national park" the policy network approach is a form of system theory appropriately operationalized for policy analysis. according to the system theory, the interaction between the actors determines the properties of its actor (trust, importance etc) and the policy output (i.e. certain winners and losers). the system is in this case the policy network. the elements of this system are the participating actors (forest and land owner associations, enterprises, agencies, ngos, universities etc) and the interactions between them are relations of exchange: offering and reception of information or trust. the policy output is derived from these interactions: the policy (issue) network approach assumes that the structure derived from these interactions explains policy outcomes (results of negotiations or other decision-making procedures, losers and winners). a network is specified through: a. the issue of interest, b. the time in which the interactions take place, c. the content of the relations (in our case, trust and information exchange etc). digital perception functionality for institutional design: the forest policy issue of "loch lomond & the trossachs national park" of scotland as a case study 209 three hierarchies (also to be considered as ’sub-networks’) concerning "loch lomond & the trossachs national park" in 2002 are analyzed using visone [1] [15]: exchange of trust, of general information and of scientific information. general information means monitoring information (political-administrative events and evaluations). scientific information means ’objective’ description of functions and facts using expertise. a basic mathematical entity for the following formulas is the link from actor i to actor j. if there is a link (e.g. information exchange) from actor i (e.g. forest service) to actor j (e.g. a certain environmental group), then this link is defined as: zij. if there is no exchange in direction i→j then: zij=0. the link of trust exchange is valued: zij=1,2,3... . in the case of trust, the value of the link, when existing, is valued from 1 (no trust) to 3 (full trust). in other cases, as for example the information networks, the link was valued either with 0 (no existing link) or 1 (there has been a link).the total number of actors participating is defined as n. 3.2 from density to complexity density (d) is a characteristic of the entire network. it is defined as the proportion of all relations occurring in the matrix to the number of all possible links (n2 -n) [18]: d = n∑ i=1 n∑ j=1 zij n2 −n (1) where i̸=j, zij is the link from actor i to actor j and n is the total number of actors in the network. if the network is visualized as a polygon, its density is the proportion of the existing diagonals to all possible (double-directed) diagonals. d is significant for the intensity of activity and for the extent to which all possible "chances" (contact places) of a network have been exhausted. however, this should not be considered to mena everything for the intensity of activity or the chances, because e.g. a network with n=4 and d=100% is still much simpler than a network of with n=50 and d=30%. thus, complexity (comp) is proposed as a more accurate indicator of the practical difficulties that a lobbyist or policy maker has to be confronted with, if he/she enters the network. comp increases with n and d and can here be defined as the average proportion of the (existent) links to each actor, which is the simplified product of n*d (the denominator is simplified from n-1 to n): comp = n∑ i=1 n∑ j=1 zij n (2) the most complex of sub-networks of this case study is that of trust and not of information. the simplest one seems to be this of scientific information. the complexity is an indicator which implies the intensity of tasks for an organization which participates. thus, in the case of the "loch lomond & the trossachs national park" management, the most challenging task is the development of trustworthiness. second comes the general communication and last the scientific cooperation (fig. 2 and tab. 1). 210 n.d. hasanagas figure 2: visualization of the sub-networks of "loch lomond & the trossachs national park": trust, general information, and scientific information in table 1, the links, the actors’ number, the density and the complexity are presented for the three sub-networks. practically, this means that a chairperson or a campaign officer who is responsible for strengthening the trustworthiness of his/ her organization is mostly stressed in the framework of this issue. table 1: complexity of trust, general information and scientific information in comparison content of exchange trust general information scientific information links 184 98 34 actor number (n) 27 27 16 density (d) 27.25% 14.51% 14.16% complexity (comp) 6.81 3.62 2.12 a secondary role is played by press officers that are responsible for general communication. last come experts (e.g. biologists or environmental and forest scientists) who deal with scientific information, as the network of this kind of information is the simplest one. 3.3 exchange relations functionality trust trust is a basis for prestige and concentration of jurisdictions, authorizations, and competence in decision-making. if e.g. the forestry commission trusts the royal scottish forestry society, which trusts the national trust of scotland and the last two trust the friends of the loch lomond, then the last one proves the most trustworthy as it is able to gain the trust of all previous actors (also from the forestry commission indirectly). thus, trust is a relational value created through successive transfer of reputation. for the reputation of the actor a, it is not merely important how many actors trust a, but also how much reputation these actors gain from other actors, etc [4]. the following formula for calculating the status of an actor in a network has been proposed [17]: digital perception functionality for institutional design: the forest policy issue of "loch lomond & the trossachs national park" of scotland as a case study 211 t = (i −ac)−1 − i (3) where t is a matrix including the status values of all elements, and c is the matrix presenting the real network (of trust exchange). general information importance the indicator closeness centrality (cc) is based on the distance d (i.e. the shortest number of links) between two actors. if i.e. the royal scottish forestry society gives information to the friends of the loch lomond and the latter to the national trust of scotland (and there is no direct information link from the royal scottish forestry society to the last one), then the distance from the royal scottish forestry society to the national trust of scotland is d=2 (links). the sum of all distances from an actor i to any other actor is the closeness of the actor i and then the closeness centrality of i is defined as its inverse closeness: cc(i) = ⌊∑ j d(j, i) ⌋−1 (4) the fewer links are needed to connect i to any other actor, the higher its cc is. if an actor has information that is regarded as important by the other actors, then one can expect this actor to have a high closeness centrality [2]. thus, cc is used as an indicator of the information importance. the information control the index betweenness centrality (cb) quantifies the information control exerted by an actor i and is defined as the sum of the ratios of shortest paths between other actors that the actor i sits on: cb(i) = ∑ |pi(i, j)| |p(i.j)| (5) where p(i,j) and pi(i,j) are the sets of all shortest paths between i and j, and those shortest paths passing through i, respectively. thus, an actor with a high percentage of cb plays the role of the go-between for many other actors in term of shortest paths and, in this way, it can control the distribution of information within the network [20] [21]. it should be clarified that cb shows who is the most crucial "postman" of information in a network, while cc shows who is the most important "sender" who can thereby impose an opinion and function as decision-maker. 4 results and discussion 4.1 meta-data: trust formula 3 defines the position of each actor in comparison to the others. a public forest actor (actor 3) and two public environmental actors (actors 5 and 8) take the top positions in the scale of the trust status (t) (fig. 3). despite the dominant hypothesis that forest policy is outside-controlled (namely by other sectors that may be commercially more important like tourism, water policy etc) [21], in this case a powerful forest actor seems to play a leading role. in figure 3, the trust status is measured on the vertical dimension in %. the horizontal dimension has no meaning. the top three values are: 212 n.d. hasanagas t3 = 8.80%, t5 = 8.35%, t8 = 7.80% the nature conservation agencies have more intensive presence (actors 5 and 8) while the most powerful actor is from the forestry sector (actor 3). these three actors belong to the public and not to the private sector. figure 3: hierarchy of trust status 4.2 meta-data: communication general information importance. the public conservation actor 5, the public exploitation actor 3 and the public conservation actor 8 possess to all appearances the most crucial general information resources in the network (fig.4): cc(5) = 5.50%, cc(3) = 5.24%, cc(8) = 5.00% the closer the actor is to the centre, the higher the closeness centrality of the actor; which means a higher position in the general information hierarchy. the lines surrounding the centre represent different and lower values of closeness centrality, while moving further away from the centre of the circle. the order of the actors around a circle (polar coordinates) has no sociological meaning; the circular order has been carried out automatically by visone software so as to minimize over-crossings. a public agency for nature conservation is regarded as most important in the network (actor 5). a public agency for forest management (actor 3) lies quite close to actor 5. third comes actor 8, which is again a public agency of nature conservation. all other public actors (such as 7, 11, 13 etc) as well as all private actors (environmental ngos such as 26, 25 etc and land owners or forest associations such as 10, 16, 22, 24) lie quite far from the centre. thus, this network is dominated by a few actors of the public sector, while the private sector does not play leading digital perception functionality for institutional design: the forest policy issue of "loch lomond & the trossachs national park" of scotland as a case study 213 role. it is also noticeable that the reputation of the involved scientific institution (actor 23) is very low. the actor which achieves to impose its information as most "important" is not the most powerful one. in this case, the most "important" actor is a conservation agency (actor 5). it seems that the powerful forest actor (actor 3) shares the general information importance with the conservation actors 5 and 8. the conservation actor 5 has achieved a higher influence in general information. this may occur because nature conservation is of interest for a larger part of the public and policy sectors than a forest actor, and this actor has generally a good reputation. figure 4: closeness centrality in general information general information control. figure 5 shows that actor 8 exerts the strongest control on general information distribution. in figure 5, the cb is measured in % and decreases from the centre to the periphery, while the circular order was carried out again by visone software so as to minimize over-crossings and has no sociological meaning. actor 5 is second, and third comes actor 3: cb(8) = 42.24%, cb(5) = 17.04%, cb(3) = 14.15% a practical meaning of cb is that if actor 8 was to leave the network, most information channels would be destroyed and most actors would experience a noticeable lack of information. although actor 5 is considered to be the most "important" one (s. above cc), the largest part (42.24%) of information channels is sustained by actor 8. in other words, the most important "sender" of politico-administrative information is supposed to be actor 5, but the most crucial "postman" of this information is apparently actor 8. without actor 8, actor 5 would lose its influence because the distribution of its information would be hindered. thus, someone who is aware of this network structure is able to differentiate the "sender" from the "postman" and can try to influence the first one in order to achieve a substantial change in the policy content or in the communication of the network. 214 n.d. hasanagas however, although the same three powerful actors hold the top three positions in the general information control, there are noticeable differences between them; it is observed that the most powerful exploitation actor 3, is the weakest of the three in general information control. simultaneously, the conservation actor 8, which occupies the last rank among the three actors in power and general information "importance", exerts the strongest control on the distribution of general information. this difference between power, information importance and information control is the clearest evidence that information is not a resource for developing power. although the most powerful actor is an exploitation actor (actor 3), it has left the communication functions ("importance" and control) to two conservation actors, 8 and 5. although actor 5 has gained a stronger reputation ("importance" in general argumentation), actor 8 seems to have been involved in more communication channels and sub-networks. the same three public actors control the general communication of the network while the rest actors lie at the periphery of the network once again. the scientific institution (actor 23) plays again a very weak role in the control of general communication. figure 5: betweenness centrality in general information 4.3 meta-information: "objectivity" scientific information importance. assuming that scientific information has been distinguished from political and social norms in our empirical research, the importance of the "objective" expertise can be measured in this network. as already shown (fig.1), only 34 out of 98 information links (34.6%) consist of scientific information, while these 34 links are only 18.4% of the total interactions (184 links). in other words, science plays a rather weak role in the network. thus, one can achieve only a weak influence on the policy outcome using "brilliant" expertise. indeed, the public actors 8, 5, 3 monopolize once again the index cc (importance). although actor 8 possesses the lowest trust among the top three actors (fig. 2), it has the highest cc in scientific information (fig.6): digital perception functionality for institutional design: the forest policy issue of "loch lomond & the trossachs national park" of scotland as a case study 215 cc(8) = 21.56%, cc(3) = 18.69%, cc(5) = 17.52% a different ranking between the top three actors is observed. actor 8 is considered to be the least "important" of these three in general information, but it is regarded as the most "important" one by the few science-interested actors. this phenomenon shows that science and general communication (politicoadministrative information) are two quite autonomous worlds with different hierarchies and priorities (what or who is "important" in the one world is not necessarily equally "important" in the other). scientific reputation does not necessarily depend on the formal identity of an actor: the scientific actor 23 does not appear at all in the scientific communication network. figure 6: closeness centrality in scientific information control of scientific information. in our study, the control of scientific information seems to be more relevant to the status of power. actor 3 is the most trustworthy and simultaneously has the highest cb, while actor 8 does not even appear among the top three (fig. 7): cb(3) = 45.74%, cb(7) = 23.26%, cb(5) = 16.28% the forest actor (actor 3) seems to be strongly involved in scientific sub-networks and to control the scientific communication with its forest expertise even in such a cross-sectoral issue like "loch lomond & the trossachs national park". this constitutes evidence that despite its multidisciplinarity [16], forest science can be self-regulated and can offer a sound basis for political negotiation and management even in cross-sectoral issues. although actor 8 was considered to be most "important" in scientific information, if it leaves the network, it will influence the scientific communication between the actors only little; the other actors have alternative access 216 n.d. hasanagas to its "important" information; this practically means that an "important" (crucial) source is not necessarily irreplaceable. actors 3, 7 and 5 are public actors. thus, the control of scientific communication is still dominated by the state while the private sector plays quite a peripheral role. the scientific institution (actor 23) does not appear at all. figure 7: betweenness centrality in scientific information 5 conclusions using software for social network analysis, the politico-administrative reality can be perceived as a system of hierarchies (formal or informal ones). the exact perception output depends on the engineering design of the software, namely of the algorithms included in the program. each algorithm applied to an appropriate type of relation (e.g. trust or information) produce a different output-aspect of the politico-administrative reality. it was observed that the actors 3, 5 and 8 are the top three in trust status. the same ones occupy the first three places in almost all of the four centrality indicators of information exchange (closeness and betweenness in general and scientific information) however, not in the same position. there is a marked differentiation among them: the most trustworthy, actor 3, neither possesses the most important general information (cc of general information) nor exercises the strongest control on its distribution (cb of general information). the least powerful, actor 8, exerts a remarkable control on general and scientific information (cb of general and scientific information). actor 8 may fix up its deficit in trust status through information control. the control of information is not necessarily based on the trust (practically meaning legitimacy or acceptance) but rather on the involvement that an actor has achieved in a network. self-evidently, a network-experienced actor is expected to have more opportunities of becoming digital perception functionality for institutional design: the forest policy issue of "loch lomond & the trossachs national park" of scotland as a case study 217 more strongly involved. the most powerful, actor 3, mainly exerts its control potential on the distribution of scientific information. however, it was also shown that scientific information plays a much weaker role in the network than the general information. therefore, a powerful actor is supposed to implement its power using general information rather than expertise. the following hypotheses are supported by these findings. specifically: 1. "important" information is not a source of trust (as a power component). 2. between actors of relatively high trust status, the most trusted one does not need to control information in order to strengthen its position. a less trusted actor may supplement the lack of power status with information control (which makes this actor irreplaceable). 3. possessing information resources which are regarded by others as "important" does not appear to be very strongly related to the trust position. a practical advice for forest and environment-related actors that derives from this analysis would be to set strengthening of their image as a first priority and not to acquire "important" expertise because it does not seem to be so relevant to their trust status and leading potential. an actor with a strong reputation may even be more influential in science than purely scientific institutions. scientific information does not necessarily play a decisive role in policy making. by gaining trust an actor becomes powerful in a network and thus can define which information is "important". finally, an "important" actor (sender with high closeness centrality) is not necessarily irreplaceable; it has only persuaded many others that it possesses crucial (very useful) information for them. as presented, this information can also be accessed through an actor with high betweenness centrality which acts as a "postman". however, a drastic change in the information content distributed in the network can be achieved only if the sender is influenced. an additional lesson which is of interest specifically for the forestry sector is that the crosssectorality is not always unfavorable for the forest actors, though it makes the power relations uncertain. as shown, a powerful public forest actor in scotland plays a leading role in a quite cross-sectoral issue like that of the "loch lomond & the trossachs national park". moreover, it manages to control the cross-sectoral scientific communication. the quantitative network analysis is an operational form of system theory in policy analysis that can produce meaningful data, if the contents of the exchange relations (giving and receiving of trust, information etc) are clearly operationalized. socio-informatics can open up new ways of describing politico-administrative structures and predicting power in multi-sectoral policies like rural development or forest-environmental policy. particularly, it can help in the description of informal hierarchies like these of trust, information "importance" and communication control which determine the policy output. it is possible that an actor may lose in a policy issue, because it is unaware of its own power (trust) status (trust) or because it confuses the information "importance" (closeness centrality) with the information control potential (betweenness centrality). the "loch lomond & the trossachs national park" is only an example of applying quantitative network analysis to environmental issues. by using larger sample of networks in future research, analyzing further exchange relations (like financial means, pressure etc) and employing complementary qualitative methods, like document analysis and observation, a more accurate and effective policy consulting can be achieved. acknowledgement i would like to thank all of the above anonymous actors for their participation in my research. i would also like to thank profs u. brandes, p.kenis, s. kühnel, dr j. raab and ms despinagreta ceco for the useful discussion and support. 218 n.d. hasanagas bibliography [1] m. baur, m. benkert, u. brandes, s. cornelsen, m. gaertler, b. koepf, j. lerner and d. wagner, visone software for visual social network analysis, proc. 9th int. symp. graph drawing, lecture notes in computer science, 2265, 463-464, springer, 2002. [2] u. brandes, p. kenis, d. wagner communicating centrality in policy network drawings, ieee transactions on visualization and computer graphics, 9(2), 241253, 2003. [3] p. cisar, s. bosnjak and s. maravic cisar, ewma algorithm in network practice. int j comput commun, 5(2): 160-170, 2010. [4] a. degenne and m. forse, introducing social networks, sage, great britain, 1999. [5] n.d. hasanagas, a.d. styliadis, e.i. papadopoulou and p.k. birtsas. land policy & sociospatial impacts in a burned forest: the case of chalkidiki, greece (2006), revcad journal of geodesy and cadastre, 99-112, 2010. [6] e. papadopoulou, n.d. hasanagas and d. harvey. analysis of rural development policy networks in greece: is leader really different? land use policy, 28(4): 663-673, 2011. [7] u. dolata, flaws in network competition and cooperation in the technical development and regulation (orig. german), politische verteljahresschrift 41, sonderheft 31, 2000. [8] s.n. eisenstadt, power, trust, and meaning. essays in sociological theory and analysis, university of chicago press, chicago, london, 1995. [9] j. engler and a. kusiak, mining authoritativeness of collaborative innovation partners, int j comput commun, 5(1): 42-51, 2010. [10] w.j. frawley, g. piatetsky-shapiro g. and c.j. matheus, knowledge discovery in databases: an overview, aaai press, 1992. [11] d. hand, h. mannila and p. smyth, principles of data mining. mit press, cambridge, ma, 2001. [12] n.d. hasanagas and r. shoesmith, the role of the european parliament in forest environment issues, european environment, 12, 213-223, 2002. [13] n.d. hasanagas, a.d. styliadis, e. papadopoulou and l. sexidis, elearning & environmental policy: the case of a politico-administrative gis, int j comput commun, 5(5): 517-524, 2010. [14] n.d. hasanagas. network analysis functionality in environmental policy: combining abstract software engineering with field empiricism, int j comput commun, 6(4): 622-635, 2011. [15] ch. henning and a. wald, on the theory of interest intermediation: a network approach illustrated with the case of the common european agricultural policy (orig. german), politische vierteljahreschrift, 41., heft 4, 647-676, 2000. [16] n.d. hasanagas, a.d. styliadis and e. papadopoulou, environmental policy & science management: using a scientometric-specific gis for e-learning purposes, int j comput commun, 5(2): 171-178, 2010. digital perception functionality for institutional design: the forest policy issue of "loch lomond & the trossachs national park" of scotland as a case study 219 [17] l. katz, a new status index derived from sociometric analysis, psychometrika, v.18, 1, march, 1953. [18] d. knoke and j.h. kuklinski, network analysis. quantitative applications in the social sciences, sage, california, 1982. [19] h.s. ko and s.y. nof s.y. design protocols for task administration in collaborative production systems, int j comput commun, 5(1): 91-105, 2010. [20] m. krott, public administration in environemntal protection. results of a service-oriented policy-analysis using the example of the forest protection as an illustration (orig. german), wilhelm braumueller verlag. wien, 1990. [21] m. krott, self-regulation in forest policy as a challenge to forestry science and practice (orig. german). forstw. cbl. 115, 97-107, 1996. [22] m. krott and n.d. hasanagas, measuring bridges between sectors: causative evaluation of cross-sectorality, forest policy and economics, 8: 555-563, 2006. [23] s. alexiadis, ch. ladias and n. hasanagas. examining agriculture from a regional perspective: implications for the common agricultural policy, land use policy, 30 (1): 665-669, 2013. international journal of computers communications & control issn 1841-9836, 9(6):664-671, december, 2014. hadoop optimization for massive image processing: case study face detection i̇. demir, a. sayar l̇lginç demir advanced technologies research institute the scientific and technological research council of turkey ilginc.demir@tubitak.gov.tr ahmet sayar* computer engineerig department kocaeli university, turkey *corresponding author: ahmet.sayar@kocaeli.edu.tr abstract: face detection applications are widely used for searching, tagging and classifying people inside very large image databases. this type of applications requires processing of relatively small sized and large number of images. on the other hand, hadoop distributed file system (hdfs) is originally designed for storing and processing largesize files. huge number of small-size images causes slowdown in hdfs by increasing total initialization time of jobs, scheduling overhead of tasks and memory usage of the file system manager (namenode). the study in this paper presents two approaches to improve small image file processing performance of hdfs. these are (1) converting the images into single large-size file by merging and (2) combining many images for a single task without merging. we also introduce novel hadoop file formats and record generation methods (for reading image content) in order to develop these techniques. keywords: hadoop, mapreduce, cloud computing, face detection. 1 introduction in the last decade, multimedia usage has increased very quickly, especially as parallel to high usage rate of the internet. multimedia data, stored by flicker, youtube and social networking sites like facebook, has reached enormous size. today search engines facilitate searching of multimedia content on large data sets. so these servers has to manage storing and processing this much data. distributed systems are generally used to store and process large scale multimedia data in a parallel manner. distributed systems have to be scalable for both adding new nodes and for running different jobs simultaneously. images and videos are the largest set of these multimedia contents. so, image processing jobs are required to run in distributed systems to classify, search and tag the images. there are some distributed systems enabling large scale data storing and processing. hadoop distributed file system (hdfs) [1] is developed as an open-source project to manage storage and parallel processing of large scale data. hdfs parallel processing infrastructure is based on mapreduce [2], [3], [4] programming model that is introduced firstly by google file system (gfs) [5] in 2004. mapreduce is a framework for processing highly distributable problems across huge data sets using a large number of computers (nodes), collectively referred to as a cluster. copyright © 2006-2014 by ccc publications hadoop optimization for massive image processing: case study face detection 665 the work presented in this paper proposes two optimization techniques to hadoop framework for processing/handling massive number of small-sized images. this enables extreme parallel processing power of hadoop to be applied to contents of massive number of image files. however, there are two major problems in doing so. first, image files need to be modeled in an appropriate format as a complete entity, and second, they need to be adapted to the mappers in order to utilize parallelization of hadoop framework. as a solution to the first problem, an input file format called imagefileinputformat is developed. a new record generator class called imagefilerecordreader is also developed in order to read/fetch the content of image and create whole image pixel data as an input record to maptask [6]. regarding the second problem, we develop two approaches. first approach is based on combining multiple small size files into a single hadoop sequencefile. sequencefile is created by imagefilerecordreader class, which is developed as an extension to hadoop. second approach proposes a technique to combine many images as a single input to maptask without merging. this technique does not require special input file format as sequencefile, so that images can be used as in their original format. to achieve this, we introduce novel image input format and an image record reader, which is multiimagerecordreader class, to fetch the image content into image processing library. the architectural details are presented in section 3. these two approaches together with the naive approach are going to be applied on distributed face detection applications on images. the effectiveness of the proposed techniques is proven by the test cases and performance evaluations. remaining of this paper is organized as follows. section 2 presents related works. section 3 explains the proposed architecture. it covers extending hadoops application programming interfaces to effectively manipulate images. section 4 evaluates the performances of techniques on a sample scenario. section 5 gives the summary and conclusion. 2 related work hdfs is specialized in storing and processing large-size files. small-size files storage and processing ends up with performance decrease in hdfs. namenode is the file system manager in hdfs master node which registers file information as metadata. when using massive number of small-size files, the memory usage of namenode increases so leading master node to be unresponsive for file operation requests from client nodes [7]. moreover, number of tasks to process these files increases and hadoop jobtracker and tasktrackers, which initialize and execute tasks, have more tasks to schedule. in that way, total hdfs job execution performance decreases. for these reasons, storing and processing massive number of images require different techniques in hadoop. dong et al. propose two techniques for this problem given in [7] and [8]. in [7] they propose firstly, file merging and perfecting scheme for structurally related small files, and secondly, file grouping and perfecting for logically related small files. their approach is based on categorization of files based on their logical or structural properties. in [8], they propose another similar approach on the same problem. they introduce a two-level perfecting mechanism to improve the efficiency of accessing small files, and use power point files as a use case scenario. on the other hand, we tackle this problem by introducing a new input file format and a new record generator class in order to read the content of images and create whole image data as an input record to maptask. hadoop [3] is based on a parallel programming paradigm mapreduce employing a distributed file system for implementation on very large clusters of low performance processors aimed at text based searching. although it has been mainly utilized for textual data collections such as crawled web documents and web logs, later it has been adopted in various 666 i̇. demir, a. sayar types of applications. for example, it has been used for satellite data processing [4], bioinformatics applications [5], and machine learning applications [6]. there are a few example usages of hadoop in image processing. these are mostly implementations of content-based image searching. golpayegani and halem [9] adopted hadoop in satellite image processing. they propose a parallel text-based content searching. so, each image is annotated with some textual information after fetched by the satellites. similarly krishna et al. [10] proposes a hadoop file system for storage and mapreduce paradigm for processing images crawled from the web. the input to the whole system is a list of image urls and the contextual information aka the metadata of the image. searching is done over the metadata of the images. the only challenge in such application is defining key and value pairs and as well as defining map and reduce functions. the other issues are handled by hadoop core system. the architecture presented in [9] and [10] are called hybrid architectures, because they use text-based annotated data for searching and they access the result images by a url defined in content data. afterwards, they can process the image and redefine their metadata and save it with the current version. there is also another type of use cases of hadoop as in [11]. kocakulak and temizel [11] propose a map reduce solution using hadoop for ballistic image comparison. firearms leave microscopic markings on cartridge cases which are characteristic to each firearm. by comparing these marks, it is possible to decide whether these two cartridge cases are fired from the same firearm or not. the similarity scores returned by the algorithm included similarity score for each part of the cartridge case and their weighted sum. all correlation results were passed by map tasks to the partitioner as key/value (k, v) pair. the key and the value constituted the id of the cartridge case and the similarity score object respectively. the work presented in this paper is different from [9] and [10] because they use hybrid model to search images, i.e., images are annotated with some textual information enabling content-based searching. in our model, we use images in their pure formats, and instead of text search we utilize advanced face detection algorithm by comparing pixel information in the images. in addition, since we are doing face recognition, we cannot cut the images (mapping) into small parts as in [11]. if we cut the images we can degrade the success of the system. in other words, we might possibly cut the image at a place where a face might be located. so, we keep the images as a whole and propose a different approach as given in section 3. 3 architecture: hadoop optimization for massive image processing 3.1 distributed computing with hadoop hdfs is a scalable and reliable distributed file system consisting of many computer nodes. the node running namenode is the master node and nodes running datanode are worker nodes. datanodes manage local data storage and report feedbacks about the state of the locally stored data. hdfs has only one namenode but can have thousands of datanodes. hadoop uses worker nodes as both local storage units of file system and parallel processing nodes. hadoop runs jobs parallel by using mapreduce programming model. this model consists of two stages which are map and reduce whose input and outputs are records as pairs. users create jobs by implementing map and reduce functions and by defining the hadoop job execution properties. after having defined, jobs are executed on worker nodes as maptask or reducetask. jobtracker is the main process of hadoop for controlling and scheduling tasks. jobtracker gives roles to the worker nodes as mapper or reducer task by initializing tasktrackhadoop optimization for massive image processing: case study face detection 667 ers in worker nodes. tasktracker runs the mapper or reducer task and reports the progress to jobtracker. hadoop converts the input files into inputsplits and each task processes one inputsplit. inputsplit size should be configured carefully, because inputsplits can be stored more than one block if inputsplit size is chosen to be larger than hdfs block size. in that way, distant data blocks need to be transferred over network to maptask node to create inputsplit. hadoop map function creates output that becomes the input of the reducer. so, the output format of the map function is same with the input format of the reduce function. all hadoop related file input formats derive the fileinputformat class of hadoop. this class holds the data about inputsplit. inputsplit does not become input directly for the map function of the mapper class. initially, inputsplits are converted into input records consisting of pairs. for example, in order to process text files as inputsplit, recordreader class makes text lines of the file as an input record in format where key is the line number and value is the textual data of each line. the content of the records can be changed by implementing another derived class from recordreader class. in distributed systems, the data to be processed is generally not located at the node that processes that data and this situation causes performance decrease in parallel processing. one of the ideas behind the development of hdfs is making the data processed in the same node where it is stored. this principle is called data locality which increases the parallel data processing speed in hadoop [6]. using massive number of small size files causes shortage of memory in master node due to increasing sizes of namenode?s metadata file. moreover, as the number of files increases, the number of tasks to process these files increases and the system ends up with workload increases in hadoop?s jobtracker and tasktrackers, which are responsible for initialization, execution and scheduling of the tasks. these might lead master node to be unresponsive for file operation requests from client nodes [7]. in brief, storing and processing massive number of images require different techniques in hadoop. we propose a solution to this problem of hadoop by an application of face detection. 3.2 interface design in order to apply face detection algorithm to each image, map function has to get the whole image contents as a single input record. hdfs creates splits from input files according to the configured split-size parameter. these inputsplits become the input to the maptasks. creating splits from files causes some files to be divided into more than one split, if their file size is larger than the split-size. moreover, a set of files can become one inputsplit if the total size of input files is smaller than the split size. in other words, some records may not be represented as the binary content of each file. this explains why new classes for input format and record reader have to be implemented to enable maptask to process each binary file as a whole. in this paper, imagefileinputformat class is developed by deriving the fileinputformat class of hadoop. imagefileinputformat creates filesplit from each image file. because, each image file is not splitted, binary image content is not corrupted. in addition, imagefilerecordreader class is developed to create image records from filesplits for map function by deriving hadoop?s recordreader class. in that way pixel data of images are easily fetched from hadoop input splits into image processing tasks (map tasks). after that point, any image processing algorithm can 668 i̇. demir, a. sayar be applied to image content. in our case, map function of the mapper class applies the face detection algorithm to image records. haar feature-based cascade classifier for object detection algorithm defined in opencv library is used for face detection [12]. java native interface (jni) is used to integrate opencv into interface. implementation of map function is presented below. "faceinfostring" is the variable that contains the information about detection properties such as image name and coordinates where faces are detected. c l a s s : mapper function : map map(text key ( f i l e n a m e ) , b y t e s w r i t a b l e v a l u e ( imgdata ) , o u t p u t c o l l e c t o r ) getimgbinarydata_from_value ; convertbinarydata_to_javaimage ; i n i t i a l i z e o p e n c v _ v i a _ j n i i n t e r f a c e ; runopencv_haarlikefacedetector ; f o r e a c h ( detectedface ) c r e a t e f a c e b u f f e r _ f a c e s i z e ; copyfacepixels_to_buffer ; c r e a t e _ f a c e i n f o s t r i n g ; c o l l e c t o u t p u t : s e t _ k e y _ f a c e i n f o s t r i n g ; set_value_faceimgbuffer ; end_foreach hadoop generates names of output files as strings with job identification numbers (e.g.: part 0000). after face detection, our image processing interface creates output files as detected face images. in order to identify these images easily, the output file names should contain detected image name and detected coordinate information (eg: sourceimagename(100,150).jpg). imagefileoutputformat class is developed to store output files as images with desired naming. reducetask is not used for face extraction because each maptask generates unique outputs to be stored in the hdfs. each task processes only one image, creates output and exits. this approach degrades the system performance seriously. the overhead comes from initialization times of huge number of tasks. in order to decrease the number of tasks, firstly, converting small-size files into single largesize file and process technique is implemented. sequencefile is a hadoop file type which is used for merging many small-size files [13]. sequencefile is the most common solution for small file problem in hdfs. many small files are packed as a single large-size file containing small-size files as indexed elements in format. key is file index information and value is the file data. this conversion is done by writing a conversion job that gets small-files as input and sequencefile as output. although general performance is increased with sequencefile usage, input images do not preserve their image formats after merging. preprocessing is also required for each addition of new input image set. small files cannot be directly accessed in sequencefile, whole sequencefile has to be processed to obtain an image data as one element [14]. secondly, combining set of images as one inputsplit technique is implemented to optimize small-size image processing in hdfs. hadoop combinefileinputformat can combine multiple files and create inputsplits from this set of files. in addition to that, combinefileinputformat selects files which are in the same node to be combined as inputsplit. so, amount of data to be transferred from node to node decreases and general performance increases. combinefileinhadoop optimization for massive image processing: case study face detection 669 putformat is an abstract class that does not work with image files directly. we developed combineimageinputformat derived from combinefileinputformat [15] to create combinefilesplit as set of image. multiimagerecordreader class is developed to create records from combinefilesplit. this record reader uses imagefilerecordreader class to make each image content as single record to map function (see technique in fig.1). imagefileoutputformat is used to create output files from detected face images and stored into hdfs. figure 1: combine and process images technique 4 performance evaluations figure 2: performance comparison to test the system and evaluate the results, we have set up an hdfs cluster with 6 nodes. face detection jobs are run on a given set of image files on the cluster. hdfs cluster is set up with 6 nodes to run face detection jobs on image sets. each node has a hadoop framework installed on a virtual machine. although virtualization causes some performance loss in total execution efficiency, installation and management of hadoop become easier by cloning virtual machines. maptasks require large dynamic memory space when map-function for the image 670 i̇. demir, a. sayar processing executes. default java virtual machine (jvm) heap size is not enough for large size images. so, maximum jvm size for hadoop processes is increased to 600 mb. five different small size images are used as input files. distribution of the images according to file sizes are preserved in input folders. the images in the input folders went through the face detection job with the three types of approaches in hdfs. these are (1) one task per image brute-force approach (for comparison only), (2) sequencefile processing approach and (3) combine and process images approach (see performance results in fig.2). 5 conclusion the effectiveness of the proposed technique has been proven by the test cases and performance evaluations. as figure 2 shows, the proposed approach, combine images and then process, has become the most effective method in processing image files in hdfs. the sequencefile processing is slower than the combining technique due to the fact that combineimageinputformat enforces creation of inputsplits by combining images in the same node. additionally, in sequencefile approach, inputsplits to be processed in maptask does not always consist of datablocks in the same node. so some datablocks may be transferred from other storage node to maptask node. extra network transfer causes performance loss in total job execution. on the contrary, sequencefile approach has better performance against the task per image approach, because small number of input files decreases number of created tasks. in that way, job initialization and bookkeeping overheads of tasks are decreased. the slope of the job completion time curve for task per image approach has increased as number of input images increases. but slopes of curves of the other two techniques have slightly decreased by increasing number of input images, because task per image approach causes heavy burden on initialization and bookkeeping by increasing number of tasks. on the contrary, number of tasks is not increased as proportional to the number of images in sequencefile and combine images techniques. a small number of tasks has been able to process more images when the number of input images is increased. consequently, image processing like face detection on massive number of images can be achieved efficiently by using the proposed i/o formats and record generation techniques for reading image content into map tasks are discussed in this paper. we also explained the inner structure of map tasks to read image pixel data and process them. in the future, we plan to enhance and apply the proposed technique on face detection in video streaming data. bibliography [1] http://hadoop.apache.org/. [2] berlinska, j.; m. drozdowski. (2011); scheduling divisible mapreduce computations, journal of parallel and distributed computing, 71(3): 450-459. [3] dean, j.; s. ghemawat. (2010); mapreduce: a flexible data processing tool, communications of the acm, 53(1): 72-77. [4] dean, j.; s. ghemawat. (2008); mapreduce: simplified data processing on large clusters, communications of the acm, 51(1): 1-13. hadoop optimization for massive image processing: case study face detection 671 [5] ghemawat, s.; h. gobioff.; s. t. leung.(2003); the google file system, proceedings of the 19th acm symposium on operating system principles, ny, usa: acm, doi:10.1145/945445.945450. [6] white, t. (2009); the definitive guide. 2009: o’reilly media. [7] dong, b.; et al. (2012); an optimized approach for storing and accessing small files on cloud storage, journal of network and computer applications, 35(6): 1847-1862. [8] dong, b.; et al. (2010); a novel approach to improving the efficiency of storing and accessing small files on hadoop: a case study by powerpoint files, ieee international conference on services computing (scc), florida, usa: ieee, doi:10.1109/scc.2010.72. [9] golpayegani, n.; m. halem. (2009); cloud computing for satellite data processing on high end compute clusters, ieee international conference on cloud computing, bangalore, india: ieee, 88-92, doi:10.1109/cloud.2009.71. [10] krishna, m.; et al. (2010); implementation and performance evaluation of a hybrid distributed system for storing and processing images from the web, 2nd ieee international conference on cloud computing technology and science, indianapolis, usa: ieee, 762-767, doi:10.1109/cloudcom.2010.116. [11] kocakulak, h.; t. t. temizel. (2011); mapreduce: a hadoop solution for ballistic image analysis and recognition, international conference on high performance computing and simulation (hpcs), l̇stanbul, turkey, 836-842, doi:10.1109/hpcsim.2011.5999917. [12] http://opencv.org [13] http://wiki.apache.org/hadoop/sequencefile [14] liu, x.; et al. (2009), implementing webgis on hadoop: a case study of improving small file i/o performance on hdfs, ieee international conference on cluster computing and workshops, louisiana usa: ieee, 1-8, doi:10.1109/clustr.2009.5289196. [15] https://hadoop.apache.org/docs/current/api/org/apache/hadoop/mapred/lib/ combinefileinputformat.html ijcccv11n1.dvi international journal of computers communications & control issn 1841-9836, 11(1):67-76, february 2016. predictive input delay compensation with grey predictor for networked control system a. kuzu, s. bogosyan, m. gokasan ahmet kuzu* 1. istanbul technical university electric-electronics faculty control and automation engineering department turkey, 34469 maslak, istanbul kuzuah@itu.edu.tr 2. tubitak bilgem turkey, 41100 gebze, kocaeli *corresponding author: ahmet.kuzu@tubitak.gov.tr seta bogosyan ece department university of alaska fairbanks, fairbanks, usa sbogosyan@alaska.edu metin gokasan istanbul technical university electric-electronics faculty control and automation engineering department turkey, 34469 maslak, istanbul gokasan@itu.edu.tr abstract: the performance of networked control systems is affected strictly by time delay. most of the literature in the area handle the problem from a stability perspective. however, stability optimized algorithms alone are not sufficient to reduce synchronization problems caused by time delay between the action and reaction in geographically distant places, and the effect and performance of other system components should also be taken into account. in teleoperation applications the reference is often provided by a human, known as the operator, and due to the nature of the human system, references provided by the human operator are of a much lower bandwidth when compared to common control reference inputs. this paper focuses on the operator, and proposes an approach to predict the manipulator’s motion (created by the operator) ahead of time with an aim to reduce the time delay between the master and slave manipulator trajectories. to highlight the improvement offered by the developed approach, hereby called predictive input delay compensator (pidc), we compare the performance with the only other study in the literature that handles this problem using the taylor series approach. the performance of these two approaches is evaluated experimentally for the forward (control) path on a puma robot, manipulated by a human operator and it has been demonstrated that the efficient latency in the forward path is decreased by 100ms, on average, reducing the forward latency from 350ms to 250ms. keywords: communication network delay, delay regulator, grey predictor, taylor series, teleoperation. 1 introduction because of their huge potential to contribute to human life in many different ways, teleoperation and bilateral control systems have been attracting significant interest in control and copyright © 2006-2016 by ccc publications agora university 68 a. kuzu, s. bogosyan, m. gokasan communication communities. telesurgery using remote medical robots, exploration in hazardous environments using teleoperating robots could be good examples of such promising applications. the ultimate aim of a networked control system is the synchronization of the position and/or force between the master and slave in geographically distant motion control systems. currently, the most widely used network communication medium is the internet. however, the internet brings a variable delay between the transmitted channels. this makes control implementations over the internet challenging. the problem of variable can be eliminated via delay regulators [1] resulting in constant, and often relatively long delays. time delay compensation problem in bilateral control systems has been addressed by many different approaches. to name a few of the major methods, scattering variables [2], wave variables [3], smith-predictor (sp) [4], astrom’s modified sp [5], sliding-mode control (smc) [6] and , via the design of communication disturbance observer (cdob) [7]. moreover optimal control methods are used to find an optimal solution in terms of stability and performance constraints of the system [8]. the focus of this study will again be on the performance improvement of master-slave position tracking in networked robot control systems. while the ultimate goal of the networked control is full synchronization, network delay between master and slave is a major obstacle for the desired performance, and network delays happen randomly. in teleoperation applications the reference is often provided by a human, known as the operator. due to the nature of the human system, references provided by the human operator are of a much lower bandwidth when compared to common control reference inputs, and this can sometimes be problematic. all of the above mentioned studies discuss system stabilization under network delay [9], but do not address the operator delay, which also contributes to the delay between master and slave. meanwhile, the prediction of the input delay (in this case, created by the human operator) has the potential to reduce this network latency. however, to the authors’ best knowledge, the only study in the literature addressing this concept is [9], which uses a taylor series based analytical approach to handle this problem. taylor series simply performs the extrapolation of position based on velocity, meanwhile acceleration has significant effect on both velocity and position, and affects the prediction error negatively. in this study, we propose a method based on grey prediction for predictive input delay compensation, and demonstrate experimentally the advantages of the proposed method over the one using taylor series in predicting the operator’s motion. the grey prediction not only performs extrapolation, but unlike the taylor method fits a differential equation to the system dynamics. as a result, grey prediction is more effective in considering the transients, hence, the acceleration. the organization of the paper is as follows. section ii presents the benchmark taylor series based predictor. section iii presents grey prediction. section iv introduce the networking control system and the application of the both two predictor on it. section v introduce the experimental setup and results with conclusions in section vi. 2 taylor based pidc as benchmark system in this section, we will first discuss the benchmark pidc approach based on taylor series. subsystems of the human, such as skeleton, muscle, and neural systems behave similar to massspring-damper like structures, hence result in a high time constant for the operator. this also makes it acceptable to assume the human motion output to be continuously differentiable in time [9]. by accepting this assumption, future signal values can be predicted using simple geometric approaches. the prediction formula is predictive input delay compensation with grey predictor for networked control system 69 lim t→0 qmi(n + h) ≈ (h + 1)qmi(n) −hqmi(n−1) (1) here qmi(n + h) denotes h step further value , qmi(n) denotes current value and finally qmi(n−1) denotes previous value. we must mention that, there are just two error source which is neglected. one is high order terms and the second is discretization. 3 grey based pidc grey system theory [10] is developed for systems characterized by uncertain information. grey prediction is a scientific quantitive prediction method which is based on the theoretical treatment of the original data to determine the future output of the system [11]. basically, it can be defined as a local curve fitting extrapolation method, which requires four data sets only. in grey systems, gm(n,m) denotes a grey model. here n denotes the order of the difference equation, and m is the number of the variables. the commonly used grey model is of the gm(1,1) type. it represents the first order derivative, and one input variable is used for prediction purposes. the process of the grey prediction can be given as below [?]: step 1: collecting the original data sequence, and using generalized coordinate, q (0) mi = { q (0) mi(1),q (0) mi (2), . . . ,q (0) mi(n) } , n ≥ 4 (2) where q (0) mi(n) = qmi(n) (3) q (0) mi (n −1) = qmi(n−1) (4) . . . q (0) mi (1) = qmi(n−n + 1) (5) (6) here n denotes buffer size, q denotes generalized coordinate, m subscript denotes master side not slave side, i subscript denotes ith joint angle. for instance qm3 denotes master manipulators 3rd joint angle. moreover, q (0) mi denotes zero order ago of qmi. step 2: conducting an accumulated generation operation, ago, on the original data sequence in order to diminish the effect of data uncertainty; q (1) mi = { q (1) mi(1),q (1) mi (2), . . . ,q (1) mi(n) } , n ≥ 4 (7) where q (1) mi(k) = k ∑ i=1 q (0) mi(i),k = 1,2, . . . ,n (8) here the q (1) mi denotes first order ago of qmi. step 3: establishing the grey difference equation and then calculating its background values; q (0) mi(k) = −aizi 1(k) + bi (9) zi (1)(k) = 0.5 { q (1) mi(k) + q (1) mi(k −1) } (10) 70 a. kuzu, s. bogosyan, m. gokasan here the ai denotes developing coefficient in grey theory, bi denotes grey input. step 4: constructing data matrix b and data vector y; b =       −zi (1) (2) 1 −zi (1) (3) 1 ... ... −zi (1) (n) 1       (11) yi = [ q (0) mi (2) ,q (0) mi (3) , . . . ,q (0) mi (n) ]t (12) step 5: resolving the matrix; yi = biâi (13) âi = b t i b −1 i b t i yi = [ ai bi ] (14) step 6: deriving the solution to the grey difference equation; q (1) mi (k + 1) = [ q (0) mi (1) − bi ai ] e−aik + bi ai (15) step 7: conducting the inverse accumulated generation operation to obtain a prediction value q (0) mi (k + 1) = [ q (0) mi (1) − bi ai ] e−aik(1 −eai) (16) step 8: by substituting k with n + h −1 q (0) mi (n + h) = [ q (0) mi(1) − bi ai ] e−ai(n+h−2)(1 −eai) (17) step 9: by rearranging formula qmi (n + h) = [ qmi(n−n + 1) − bi ai ] e−ai(n+h−2)(1 −eai) (18) 4 a configuration for the networked control system here we will introduce our proposed networked control system configuration to explain the requirement and the performance measure of the predictive input delay compensator (pidc) algorithms. however first, we will introduce our standard configuration without the pidc. in the standard configuration, the operator forces the master manipulator to a desired posture, which in turn will dictate the slave motion. in order for the slave to track the master motion in the closest possible way, on the master side, an astrom smith predictor (asp) generates the control signal for the model plant. then the control signal generated on the master side, is transmitted to the slave side passing through a delay regulator send unit (drsm)" through the internet to delay regulator receive unit (drrs). on the slave side, a model tracking control (mtc) algorithm inputs the received control to an other model process (same as the model plant at master side) and forces the slave manipulator to track the trajectory of the model plant. predictive input delay compensation with grey predictor for networked control system 71 the angular displacement output of the mtc is fed back to the asp passing through a delay regulator send unit (drss) through the internet to delay regulator receive unit (drrm). [1] here τoq{1,2,3} denote the joint torques generated by the operator, τqm{1,2,3} denote the joint torques applied to the manipulator after the addition of gqm{1,2,3} gravitational compensation terms. τmq{1,2,3} denote the torque signals fed to drsm to be sent to slave side. τ̄mq{1,2,3} denote the delay regulated torque signals coming through the internet from the master to the slave. τcq{1,2,3} denote the joint torques generated by mtc, τqs{1,2,3} denote the joint torques applied to the manipulator after the addition of gqs{1,2,3} gravitational compensation terms. finally qs{1,2,3} denote the slave manipulator’s joint angle (actual) positions. in this study, the predictive input delay compensator (pidc) unit is added to this configuration between the operator and asp . with this addition, it is now possible to predict and compensate the delay caused by the operator, which will allow sending the master side information to the slave side with less delay. this reduced delay increases the synchronization between the master and slave trajectories. to reduce nonlinearities of the master and slave manipulators, namely to increase the compliance on the master side and compensate the gravity on the slave side, gravity compensation blocks gcm and gcs added, respectively. the overall master-slave architecture is given in fig. 1. however the function detail of each block is outside the scope of this study, and will not be discussed. 5 experimental system and results next, experiments have been performed to conduct a comparative performance evaluation for the taylor based and grey prediction approaches. the 6-dof puma560 industrial robot is used for experimentation. the manipulator is operated as a 3-dof system by the operator as can be seen in fig. 2. the well-known euler-lagrange based dynmaic model of the manipulator has the following general form: m(q).q̈ + v (q, q̇).q̇ + g(q) = γ (19) where, q : nx1 position vector m(q) : nxn inertia matrix of the manipulator v (q, q̇) : nx1 vector of centrifugal and coriolis terms g(q) : nx1 vector of gravity terms γ : nx1 vector of torques in this study the system will be taken on the consideration as an independent joint control system. this approach allows each manipulator joint to be controlled independently as a siso system, with the nonlinearities and couplings taken as a disturbance affecting each joint actuator. for our experimental system, since the human speed is considerably low, the main nonlinearities come from the gravity effect g(q). for that reason, in our experiments we apply system gravity compensation g̃(q) ≈ g(q), to cancel and/or reduce the gravity effect on the experimental system to a negligible level. the use of an independent joint control system approach, simplifies the system to be estimated to a one-degree-of-freedom process. 72 a. kuzu, s. bogosyan, m. gokasan figure 1: extended control scheme of networked control experiments are performed for three cases. then, q̃mi is also evaluated for three separated case. these cases and evaluation formulas are; case 1: no pidc q̃mi(n) = qmi(n) (20) case 2: taylor series based benchmark pidc q̃mi(n) = (h + 1)qmi(n) −hqmi(n−1) (21) predictive input delay compensation with grey predictor for networked control system 73 figure 2: experimental setup case 3: grey theory based proposed pidc q̃mi(n) = [ qmi (n −n + 1) − bi ai ] e−ai(n+h−1)(1 −eai) (22) the results of those experiments are seen in fig. 3. here, fig. 3a and fig. 3b depict the results for the first joint , fig. 3c and fig. 3d for the second joint, fig. 3e and fig. fig. 3f for the third joint. the performance of each joint is further demonstrated by also highlighting the zoomed version of the region marked in red. the figures in the right are zoomed versions for the highlighted sections in the diagrams on the left side. for each figure, the grey line represents the operator motion, which is taken as the reference motion to be predicted. the solid line depicts the grey predictor’s output, and finally dashed line demonstrates the output of the benchmark taylor series based predictor. in each figure, it is easily seen that when the angular velocity is low, both algorithms demonstrate similar performance. however when the acceleration increases, the performances show differences. only operation intervals where there is significant operator motion have been selected in the zooms of fig. 3b, fig. 3d and fig. 3f. in all three figures, we see that the grey prediction method achieves a faster prediction of 100ms on average when compared with the taylor based approach. on the other hand, the benchmark method demonstrates a prediction performance that varies between 10ms and 100ms, and demonstrates a poor performance in tracking transients as indicated by the high amplitude oscillation observed in fig. 3b starting at 5.7ms, and in fig. 3d starting at 2.8ms for the benchmark system output. hence, it can be said that the proposed grey based pidc demonstrates a faster and more accurate prediction performance then the taylor based pidc. 74 a. kuzu, s. bogosyan, m. gokasan 0 5 10 15 20 −20 −15 −10 −5 0 5 10 15 20 a n g u la r p o si tio n s( d e g ) time(s) reference grey prediction benchmark prediction (a) 5.5 6 6.5 5.2 5.4 5.6 5.8 6 6.2 6.4 6.6 6.8 7 7.2 a n g u la r p o si tio n s( d e g ) time(s) reference grey prediction benchmark prediction (b) 0 5 10 15 20 −10 0 10 20 30 40 50 a n g u la r p o si tio n s( d e g ) time(s) reference grey prediction benchmark prediction (c) 2.5 2.6 2.7 2.8 2.9 3 15.4 15.45 15.5 15.55 15.6 15.65 15.7 15.75 15.8 15.85 15.9 a n g u la r p o si tio n s( d e g ) time(s) reference grey prediction benchmark prediction (d) 0 5 10 15 20 −30 −20 −10 0 10 20 30 40 50 60 70 a n g u la r p o si tio n s( d e g ) time(s) reference grey prediction benchmark prediction (e) 2.8 3 3.2 3.4 3.6 3.8 13 13.2 13.4 13.6 13.8 14 14.2 14.4 a n g u la r p o si tio n s( d e g ) time(s) reference grey prediction benchmark prediction (f) figure 3: measured and predicted angular positions of each joint and zoomed versions a) joint 1, b) zoomed area of joint 1, c) joint 2, d) zoomed area of joint 2, e) joint 3, f) zoomed area of joint 3 predictive input delay compensation with grey predictor for networked control system 75 6 conclusion in this study, a grey system theory based pidc is developed and implemented for the prediction of the master manipulator motion in order to reduce the transmission latency between the master and slave. our philosophy is to reduce the latency in every way possible within our capability, considering network latency is unavoidable and random. experiments are conducted on a puma 560 manipulator which is just compensated for gravitational force to allow easy manipulation for the operator. the operator randomly manipulates the arm, while both the benchmark and proposed schemes predict the future trajectory of the robot motion created by the operator. the proposed approach outperforms the taylor series based benchmark approach, by predicting the joint motions approximately 100ms ahead on average, while the benchmark’s predictor performance varies between 2.8ms-100ms. based on these results, it can be concluded that grey prediction meets our motion prediction requirements better then the taylor series based approach, which is currently the only other study in the literature to address input delay compensation. acknowledgment this study has been supported by national science foundation-computer & information sciences and engineering (nsf-cise) , and nsf office of international science & engineering (nsf-oise). the authors declare that there is no conflict of interests regarding the publication of this article. bibliography [1] kuzu, a. et al (2011); control and measurement delay compensation in bilateral position control, mechatronics (icm), 2011 ieee international conference on, april 2011, 1003– 1010. [2] mori, t (2012); modified bilateral control by using intervention impedance based on passivity of flexible master-slave manipulators and its design methods, control, automation and systems (iccas), 2012 12th international conference on, 748–753. [3] hashemzadeh, f (2012); a new method for bilateral teleoperation passivity under varying time delays, mathematical problems in engineering, hindawi publishing corporation, http://dx.doi.org/10.1155/2012/792057. [4] lai, c.l. and hsu, p.-l. (2010); design the remote control system with the time-delay estimator and the adaptive smith predictor, industrial informatics, ieee transactions on, 6(1): 73–80. [5] astrom, k. (1994); a new smith predictor for controlling a process with an integrator and long dead-time, automatic control, ieee transactions on, 39(2): 343–345. [6] gadamsetty, b. et al (2010); sliding mode and ekf observers for communication delay compensation in bilateral control systems, industrial electronics (isie), 2010 ieee international symposium on, 328–333. [7] natori, k. et al (2010); time-delay compensation by communication disturbance observer for bilateral teleoperation under time-varying delay, industrial electronics, ieee transactions on, 57(3): 1050–1062. 76 a. kuzu, s. bogosyan, m. gokasan [8] yashiro, d. and ohnishi, k. (2008); l2 stable four-channel control architecture for bilateral teleoperation with time delay, 10th ieee international workshop on advanced motion control, 324–329. [9] baran, e. and sabanovic, a. (2012); predictive input delay compensation for motion control systems, advanced motion control (amc), 2012 12th ieee international workshop on, march 2012, 1–6. [10] ju-long, d. (1982); control problems of grey systems, systems & control letters, 1(5): 288–294. [11] liu, s. and lin, y. (2006); grey information: theory and practical applications, springer. international journal of computers communications & control issn 1841-9836, 10(2):174-187, april, 2015. web service composition framework using petrinet and web service data cache in manet m. deivamani, s.r. murugaiyan, v. ravisankar, p. victer paul, r. baskaran, p. dhavachelvan m. deivamani*, r. baskaran department of computer science and engineering, anna university, chennai, india. m.deivamani@gmail.com, baaski@annauniv.edu *corresponding author:m.deivamani@gmail.com s.r. murugaiyan department of computer science and engineering, m.s.university, tamil nadu, india. murugaiyansr@gmail.com v. ravisankar department of computer science, bharathiar university, tamilnadu, india. v_ravisankar@yahoo.com p. victer paul department of information technology, sri manakula vinayagar engineering college, puducherry, india. victerpaul@gmail.com p. dhavachelvan department of computer science, pondicherry university, puducherry, india. dhavachelvan@gmail.com abstract: a mobile ad hoc network (manet) is characterized by multi-hop wireless links and frequent node mobility. every neighboring node in the manet is likely to have similar task and interests, several nodes might need to access the similar web service at different times. so, by caching the repeatedly accessed web service data within manet, it is possible to reduce the cost of accessing the same service details from the uddi and also from the external providers. composition of web services leads to a better alternative as, at times a candidate web service may not completely serve the need of the customer. an effective data cache mechanism (dcm) has been proposed in [6] using the distributed spanning tree (dst) as a communication structure in mobile network to improve scalability and lessen network overload. as an enhancement, ant colony optimization (aco) technique has been applied on dst to cope with the fragile nature of the manet and to improve the network fault tolerance [1]. in these perspectives, an efficient web service cache mechanism (wscm) can be modeled to improve the performance of the web service operations in manet. in this paper, a fine grained theoretical model has been formulated to assess the various performance factors such as cooperative cache and mobility handoff. in addition to these, the performance improvement of wscm using dst and aco optimized dst techniques in manet has been proved experimentally using precision and data reliability of the system using appropriate simulation. keywords: data cache, web service, distributed spanning tree, ant colony optimization, manet, omnet++ copyright © 2006-2015 by ccc publications web service composition framework using petrinet and web service data cache in manet 175 1 introduction a mobile ad-hoc network (manet) is an autonomous collection of mobile nodes that communicate over relatively bandwidth constrained wireless links. since the nodes are mobile, the network topology may change rapidly and unpredictably over time. the network is decentralized; where all network activity including discovering the topology and delivering messages must be executed by the nodes itself. i.e., routing functionality will be incorporated into mobile nodes [18]. the nodes in manet would probably work for tasks of similar goal (common interest). so, most of the nodes would try to access the same web service data at different time through their corresponding access point (ap). the access points may be located at the boundaries of the manet, where reaching them could be costly in terms of delay, power consumption, and bandwidth utilization. moreover, the access points would be connected to a highly overloaded resource (e.g., a satellite), or an external network that is susceptible to intrusion plays a vital role in response time, security and availability of the system. for such reasons, it is recommended to cache the frequently accessed service information within the nodes in the manet and the search application should check for the availability of the required service data within the network before requesting the external service registry [2]. caching of service refers to the technique of caching the service method invocation information (wsdl) from the registry or service response from the corresponding service providers. the cached responses of service methods can be utilized only if the future requests use similar arguments as that of cached responses. caching the wsdl information saves significant time and resources because subsequent service requests of similar methods will not be required to download wsdl files in a repetitive fashion. at this juncture, a service item refers to the cached wsdl information description or/and the cached web response of the corresponding service. so, the manet applications should check for the existence of the desired service item within the network before attempting to request the external service source [2, 6]. this scenario can reduce the overload of the system for accessing external source for same service and also avoid the possible intrusion threats. a set of proxy nodes are introduced in the manet to provide information about the mobile nodes in the network and the services invoked by them. the proxy nodes are configured with a domain ontology and petri net modelling. the domain ontology enhance the selection of appropriate web service (by using semantics), from the service registry or peer agent nodes. the petri net modelling aims to provide composite value added service to the service requester. the service data caching within the manet can be discussed in two methods based on the diversity of the cached service information. in most of the works, the decision to cache a service is done locally in the proxies [3, 14, 16, 17], that is, without taking into account the all the peers within the network. in such case, there may situation happens that multiple copies of the same data can be cached in the proxy nodes. this redundancy of same service data cache could reduce the possibility to cache different data that are also of interest, which affect the overall performance of the cache system. on other hand, nodes are made to decide the caching data co-operatively among the proxies [13, 15, 18], which can improve the cache diversity and also the overall data cache performance of the system. various opportunities and challenges, like load balancing and mobility, which arise on caching web data within mobile networks, are theoretically discussed in [12]. lan wang [3] proposed clustering in large-scale manet as a means of achieving scalability through a hierarchical approach in which every node in the cluster is one hop away from every other node, that is, each cluster is a diameter-1 graph. but static cached data item manager may easily become the traffic bottleneck and single point failure of the cluster [4]. hassan artail et al. [2] proposed the minimum distance packet forwarding technique for search applications 176 m. deivamani, s.r. murugaiyan, v. ravisankar, p. victer paul, r. baskaran, p. dhavachelvan within manet that are based on the concept of selecting the nearest node from the designated nodes. the cache techniques in the studied works endure problems such as large hop count, message density and single point failure because of not following some efficient communication structure within the manet. to cope with the problem stated, distributed spanning tree (dst) has been used as a communication structure in manet for effective data cache technique proposed in our previous work [5]. dst is a recent and formally proved communication structure in manet to lessen node isolation problem, to reduce the number of hops required to reach the nodes and makes the network scalable [7, 8, 10]. another important performance factor in manet is finding and maintaining routes since node mobility causes topology change which need to be observed for effective communication. in [1], ant colony optimization has been used to deal with the fragile nature of the manet which dynamically identifies the optimal path between the nodes in the dst on-demand. it is also justified that applying aco on dst, enhance the effective routing of message (at low cost) in the manet which in turn reduces the number of message hops required for communication to achieve excellent efficiency in dcm applications. though, effective wscm in manet using dst and aco techniques has been formally proposed in [1, 6], experimentation analyses of the work has been performed for very few performance factors such as hit ratio and message passes. thus, in this paper, it is intended to conduct an extensive analysis on several other critical performance factors such as cooperative cache model, mobility hand-off, precision and data reliability. 2 background information needed in this section, the discussion on innovative techniques proposed in [1, 6] which are necessary to understand the performance assessments performed in the following sections of the paper. 2.1 distributed spanning tree (dst) distributed spanning tree (dst) [10] is the interconnection formation we follow as in [5, 11, 21] which, improve the routing and reduce the number of message passes required for any communication in manet. dst systematizes manet into a hierarchy of groups of nodes. the dst is an overlay structure designed to be scalable [11]. it supports the growth from a small number of nodes to a large one. a comprehensive algorithm for formulation of dst in manet has been proposed and exemplified in [9]. consequently, the manet can be logically converted into dsts and each dst should have its root node, named as the head node (hn) and the possible leaf nodes (lns). every hn will hold the complete details regarding its lns and vice versa. these hns are to be generated dynamically and should hold the service cache details, which is to be accessed by their corresponding lns and indeed by other hns also. in addition to the cache details, domain ontology and petrinet formalism is included in the hns to deliver prominent service compositions. the dst formulated manet can be represented as gm in equation(1), gm =   dst1 = (hn1, ln11, ln12, .....ln1(j1−1)) dst2 = (hn2, ln21, ln22, .....ln2(j2−1)) . . . dsti = (hni, lni1, lni2, .....lni(ji−1))   (1) web service composition framework using petrinet and web service data cache in manet 177 where, • dstv is the distributed spanning tree and ‘i’ is the total number of dsts formed in the network and 0 < v ≤ i • hnv is the head node (hn) and ‘i’ is the total number of hns in the peer network equal to the number of dsts and 0 < v ≤ i • ln refers to the leaf node(s). in lnvz, refers to the corresponding hnv and 0 < z ≤ jv − 1, where ‘jv − 1’ is the total number of lns in the corresponding dst. ant colony optimization for dst ant colony optimization (aco) [19, 21] is one of the most recent techniques for approximate optimization. the inspiring source of aco algorithms are real ant colonies. more specifically, aco is inspired by the ants’ foraging behavior. by applying the aco over the formulated dst [1], we can obtain the optimal path in terms of reduced number of message passes among the nodes in the network. aco is also capable to reform a new optimal path in case of any problem with the current optimal path. in this paper the ant colony optimization algorithm has been modified and proposed for finding an optimal path in dst of the manet. by optimizing every dst and connection among all the other dsts through their hns, it can be argued that the entire network is optimized with aco technique for improved efficiency. the complexity of aco technique depends on the method it is implemented in the manet. in dst structure, computational complexity for aco technique can be calculated as, o(ndst ∗ nln) + o((ndst )2) (2) where, • ndst is the number of dsts or the number of hns formed in the network • nln is the number of ln under a hn (theoretically taken same number of lns under every hn) 2.2 web service data cache mechanism (wscm) with dst and aco techniques an efficient wscm system in manet with dst and aco techniques has been formulated in [1, 6] with necessary algorithms. the projected system has the capability to cope with fragile and dynamic topology changing manet environment and the system structure can be viewed as a four layered as shown in figure 1. manet network layer is a network level layer consists of wireless and highly dynamic topology network. dst formulation layer is simple and converts the graph structure mp2p network into a collection dsts. this dst structure provides the features that are necessary for a dynamic network like reduced size of routing table, minimizes routing overhead, easy network management, reduced message pass, load balancing and fault tolerance. this layer offers the dynamic node insertion into the network and exit from the network in both normal and abnormal manner. this layer also makes the system highly scalable. 178 m. deivamani, s.r. murugaiyan, v. ravisankar, p. victer paul, r. baskaran, p. dhavachelvan figure 1: the layered architecture of wscm in aco optimized dst manet aco optimization layer provides the system to manage with the dynamic nature of the manet. this layer works in simple, effective and on demand way which makes the system to operate on a fragile system with asymmetric links and constantly changing topology. web service data cache management and service composition is an application level layer with specified protocol for an effective web service information cache management system in manet. using the petrinet formalism, the web services are composed together to enhance the service quality. thus, using this four layer system structure, wscm application can be efficiently performed in the highly fragile manet environment using dst and aco techniques. 3 experimental analyses in this section, an extensive analysis of dcm system using dst and aco techniques in manet has been performed based on critical performance factors such as cooperative cache model, mobility and hand-off, availability, routing technique, cache replacement method, precision and data integrity of the system. 3.1 simulation setup a manet environment with 30 nodes and 70 service items has been simulated using omnet++ tool, which is an object-oriented modular discrete event network simulator. table 1 show the partial view of hns and its neighboring nodes, and hop distance between them. simulation parameters are followed as similar to [1]. in the simulated network, the hns formed are node05, node11, node17, node23 & node29 and other nodes act as ln to any one of these hns. for our simulated, equation (1) can be rewritten as,  dst1 dst2 dst3 dst4 dst5   =   06 03 02 05 17 _ _ 12 01 08 14 20 25 _ 18 04 07 19 10 22 27 24 14 13 16 23 28 _ 30 29 09 15 21 26 _   (3) web service composition framework using petrinet and web service data cache in manet 179 where, • the node number are to be preceded by the term node, for example the dst1 should be interpreted are dst1 = (node06, node03, ..., node17), in which the first node node06 is the hn and all other nodes are lns of dst1. to simulate the web service cache mechanism, we created 50 different service items and stored in the service registry which are accessed by the proxy nodes in the manet through the access point. when a node requires any service, it will send the request to its corresponding hn (proxy node). the hns upon receiving the requests, extracts the keywords and match with the cache entries to identify the service has already executed from that node with in a cut-off time. the cut-off time is a time span a service information in a cache will remain valid, after which the wsdl information will be deleted from the cache (leading to cache miss for the next attempt). this deletion ensures to devour the up to date information in the cache. the hns advertises its presents in the network for the lns to identify and request for a service. s.no hn nearest hns and distance in hop(s) hn 1 hop distance from hn hn 2 hop distance from hn 1 node06 node18 2 node30 4 2 node12 node06 4 node17 2 3 node18 node30 3 node12 2 4 node24 node17 4 node18 3 5 node30 node12 2 node06 5 table 1: hns formulated and its hop distance from the nearest two hns in the simulated manet during the very first time access of any service, the ln saves the copy of the accessed service item and intimates its corresponding hn to save the type of the service item and the details of ln which holds it. when any node request for the web service of similar nature, the requesting node is served with the service item by the ln which holds the cache, identified through its hn. a keyword extracted from the request may or may not match with the cache entries. if it matches, the wsdl information is provided to the corresponding requesting ln using standard message passing. if the keyword does not match with the cache table, the domain ontology is used at the first level to find related services (rather than exact keyword matching) from the cache. at the next stage, if related services also give a cache miss, the peer hns or the external service registry is contacted for availing the service configurations. upon reaching all the hns, it can be found that the service does not exist within the network and obtained from the external uddi/service providers. a service request initiated by a ln can be fulfilled by an atomic service or a composite service. at times, atomic services may not be available for a given service request, where several compatible services are identified, composed and executed. to identify the compatible services for composing a value added service, petrinet formalism is used in the mns. petri net is used to mainly identify the reachability of the compatible web services within the domain. definition 1: the best and worst case analysis for accessing the cached service item based on the message hops can be modeled as follows. best case: the total number of message hops required to access the cached data item is minimum, when service item is being cached in any ln which is under the hn of the requesting node. 180 m. deivamani, s.r. murugaiyan, v. ravisankar, p. victer paul, r. baskaran, p. dhavachelvan this can be expressed as, n(accessmessagehops) + n(dictionarysearch) ≤ 4n (4) where, • n(accessmessagehops) is the total number of message hops required to access the cached service item. • n(dictionarysearch) is the total time to search the semantic of the keyword in the search. • n is the total number of message hops between hn and its ln (consider equal for every lns). worst case: the total number of message hops required to access the cached service item is maximum, when the hn of the node that holds the requested service item is at distance ‘k’ from the hn of requester node, where ‘k’ is the total number of hns in the manet. this can be expressed as, n(accessmessagehops) + n(dictionarysearch) ≥ (kxm) + 4n (5) where, • n(accessmessagehops) is the total number of message passes required to access the cached item • n(dictionarysearch) is the total time to search the semantic of the keyword in the search • m is the total number of message pass between two hns (consider equal for between every hns). • n is the total number of message pass between hn and its ln (consider equal for every lns). an extensive analyses to model the various performance factors such as cooperative web service cache model, mobility and hand-off, availability, routing technique, cache replacement method, precision and consistency for the wscm in dst and aco optimized dst manet have been performed in the following sections. 3.2 cooperative cache model to improve the service information accessibility, mobile nodes should cache different service item that of their neighbor nodes [1]. every ln should cooperatively cache the different service to avoid the replicated caching of same service within the network. caching same service on different lns may reduce the access delay but on considering size of the cache memory in lns it will block caching more frequently accessing services (different). definition 2: let lni is the mobile node which cache the service item. to show that each ln caches different service, ln1(sid) ∩ ln2(sid) ∩ ln3(sid).... ∩ lnk(sid) = ϕ (6) lnm(sid1) ∩ lnm(sid2) ∩ lnm(sid3).... ∩ lnm(sidn) = ϕ (7) web service composition framework using petrinet and web service data cache in manet 181 where, • k be the number of ln which cache data item in manet. • n be the number of service items cached in mth ln and 0 < m < k. • sid is web service index entry in ln_table of a ln. eq. 6 refers that no same service is cached by the different lns and eq. 7 refers to no same service is within a ln. thus, there is no repetition in the web service index entry in ln_table of every ln. so, every service is cached only once in the manet. figure 2: comparison on precision performance of three different schemes figure 3: comparison on service request received and served using three different schemes 3.3 mobility and hand-off model due to the fragile nature, mobile nodes are set to move free in manet and node tracking task added complex. this tracking task can be accomplished in two scenarios, exit or remove from the manet [5] and switching among the dsts within the manet. scenario 1: a node can exit/remove from the network dynamically in the following fashion: the node that wants to get remove from the network should send an inform message to its hn, so that the corresponding hn removes the node details and its cached web service identity from its lns list, which obviously removes from the spanning tree. 182 m. deivamani, s.r. murugaiyan, v. ravisankar, p. victer paul, r. baskaran, p. dhavachelvan figure 4: comparison on % of utilization of consistent service item performance of three different schemes scenario 2: switching of node from one dst to another is configured automatically by the hn by passing some messages with the node. any ln can voluntarily hand-off itself from any hn take hn1, if the condition satisfies that number of hops between ln and hn1 exceeds the number of lns under hn2. and the ln can join to hn2 in which number of hops between ln and hn2 will be less than number of lns under hn2. this hand-off should be intimated to both hn1 and hn2. it is must that every ln should be under any hn. if a request arises in the mean time between handover, the hn1 will transmit a “binding warning” and intimate ln is now under hn2. s.no ln manet manet with dst scheme manet with aco optimized dst scheme no. of data items cached no.of data item request served no. of request received precision (%) no. of data items cached no. of data item request served no. of request received precision (%) no. of data items cached no. of data item request served no. of request received precision (%) 1 node06 11 31 95 32.62 11 60 121 49.68 15 94 124 75.63 2 node12 13 53 127 41.77 20 70 145 47.98 11 91 148 61.80 3 node18 10 28 112 24.81 15 71 130 54.52 15 89 138 64.61 4 node24 17 29 94 30.62 12 62 109 57.13 14 74 115 64.39 5 node30 10 20 79 24.78 12 55 94 58.94 15 64 100 64.20 aggregate performance 60 160 507 30.9 70 318 599 53.7 70 413 625 66.1 table 2: comparison on precision for hns involved in serving the nodes in all three different scenarios definition 3: let vi be a mobile node in manet which is under hni and hnj be another hn in the manet. then, vi can decide to hand-off from hni and to join under hnj, if it satisfies the rule of eq. 9. if, hni, hnj ∈ {hn1, hn2, hn3, ....hnn} (8) then, nhop(vi, hni) > n(lni) & &nhop(vi, hnj) > n(lni) (9) where, web service composition framework using petrinet and web service data cache in manet 183 • n be the total number of hns in manet. • nhops(v, hn) be the number of hops required to reach node v from hn. • n(lni) be the total number of lns under the head node hni. figure 5. comparison on data item cached and served using three different schemes. thus, any ln whose distance from existing hn is lesser than another hn can perform handoff from existing hn and join with new one. the mobility and handoff model works under the assumption that the hns will always be in access range within the manet. this assumption is made as lns under the mn should not be disconnected from the spanning tree and thereby to the network. 3.4 precision the precision refers to the ratio of total number of service request received to the total number of requested service found in the manet. the different performance data observed from the simulation in first 100 seconds are tabulated in table 2; which contain hns created, no. of services cached by lns of each hn and no. of service item requests served by each hn, either at an atomic service or a composition. composition of web services is carried using the petri net modelling by the following phases – identifying the similar web services using domain ontology, classification as compatible and non-compatible, execution of compatible web services pertaining to the goal of the service request. to explain in clear manner, consider the first entry in table 2, the mobile node, node06 is an hn and the total number of data items cached in its lns, the total number of data item request served using the cache and the total number of request received for service item are 11, 31 and 95 respectively. thus precision for hn node06 is 32.62%. table 2 shows that precision percentage of hns in the manet can be improved using dst and aco techniques. the maximum precision value recorded for manet, manet with dst scheme and manet with aco optimized dst scheme is 41.7%, 58.94 and 75.63% respectively which is illustrated in figure 2. this is because the dst and aco techniques reduce the number of message hops required for any operation, such as cache request and cache reply, by discovering an optimal path between the nodes in the manet. so operations performed faster and more requests are served from nodes local cache within the stipulated time period. figure 3 illustrates the comparison on service request received and served using three different schemes. from figures 3 and 4, it can be observed that in aco optimized dst scheme outperforms other two schemes. thus, precision performance of the system can be improved from dst scheme from 30.9% to 53.7% and which is further improved to 66.1% using aco optimization technique. 184 m. deivamani, s.r. murugaiyan, v. ravisankar, p. victer paul, r. baskaran, p. dhavachelvan s.no ln manet manet with dst scheme manet with aco optimized dst scheme no. of service items cached no.of service item request served no. of service items deleted (expired) before use utilization % of consistent service item no.of service items cached no. of service item request served no.of service items deleted (expired) before use utilization % of consistent service item no. of service items cached no. of service item request served no.of service items deleted (expired) before use utilization % of consistent service item 1 node02 4 15 2 50.0 7 26 2 71.4 8 59 0 100.0 2 node08 6 28 3 50.0 8 28 1 87.5 7 47 2 71.4 3 node09 7 10 3 57.1 5 24 3 40.0 6 27 1 83.3 4 node13 5 16 2 60.0 6 32 3 50.0 7 42 0 100.0 5 node18 5 17 2 60.0 7 32 2 71.4 6 33 1 83.3 6 node20 8 26 4 50.9 5 26 3 40.0 7 40 0 100.0 7 node21 7 7 4 42.9 8 23 0 100.0 8 23 0 100.0 8 node24 5 12 3 40.0 7 33 3 57.1 7 27 2 71.4 9 node26 7 6 4 42.9 9 14 2 77.8 6 23 1 83.3 10 node27 6 23 3 50.0 8 70 2 75.0 8 92 1 87.5 aggregate performance 60 160 30 50.3 70 318 21 67.0 70 413 8 88.0 table 3: comparison on % of utilization of consistent service item by lns in all three different scenarios 3.5 service reliability service reliability or consistency refers to the correctness of the cached service at the time of access within the manet. though consistency technique followed is time-based it is not required that every mobile should be synchronized in clock. every hn which stores the service item type also store metadata about the service item si which contain the time at which si is being cached. this information is used to check the service item validity. ‘t ’ is the constant time value in the manet which can be varied based on service item being updated in outside network. the factor used to measure the consistency of the system is percentage of consistent service item usage. table 3 shows the comparison on % of utilization of consistent service item by lns in all three different scenarios for first 100 seconds of simulation run. from these statistics, it can be observed that utilization percentage of consistent service item is much improved dst manet and aco optimized dst manet schemes. the same is illustrated in the figure 4 and 5, which confirms the improved performance of the dst and aco optimized dst schemes over manet scheme. table 4 shows the utilization increases more using petrinets and deliver compositions. as per the assumption, the mobile agents tender similar service request, a service composition held in the cache will be a suitable candidate for most of the request, rather being a single atomic service. without loss of generality, we can say that the service served ratio to the atomic service with respect to the composite service will be less. and it is trivial that if an atomic service can satisfy a customer request, a composite service (which includes compatible atomic services) will also satisfy the request in a more efficiently. 3.6 discussion to summarize, the dst structure offers the capabilities that are necessary for a dynamic network like reduced size of routing table, minimizes routing overhead, easy network management, reduced message hops, load balancing and fault tolerance. aco optimized dst structure provides the system to manage with the highly fragile nature of the manet and to find the optimal route between hns and hn & its lns on demand fashion. this layered approach works in simple, effective and on demand way which makes the system to operate on a fragile environment with asymmetric links and constantly changing topology. thus, the performance of service cache technique for web service composition in manet has been analysed for cooperative cache, mobility hand-off, precision and data reliability method. an extensive experimentaweb service composition framework using petrinet and web service data cache in manet 185 s.no ln manet with dst and aco schemes manet with dst , aco and petri net no. of service items cached no.of service item request served no. of service items deleted (expired) before use utilization % of consistent service item no.of service items cached no. of service item request served no.of service items deleted (expired) before use utilization % of consistent service item 1 node02 8 59 0 100.0 8 63 0 100.0 2 node08 7 47 2 71.4 7 58 1 85.7 3 node09 6 27 1 83.3 6 41 0 100.0 4 node13 7 42 0 100.0 7 48 1 85.7 5 node18 6 33 1 83.3 6 33 0 100.0 6 node20 7 40 0 100.0 7 42 0 100.0 7 node21 8 23 0 100.0 8 29 0 100.0 8 node24 7 27 2 71.4 7 35 2 71.4 9 node26 6 23 1 83.3 6 27 1 83.3 10 node27 8 92 1 87.5 8 95 0 100.0 aggregate performance 70 413 8 88.0 70 471 5 92.6 table 4: comparison on % of utilization of consistent service item vs a composition by lns tion has been performed on precision and service reliability of the system which confirms that the aco optimized dst scheme improves the efficiency of service cache technique in manet environment. 4 conclusion the work presented in this paper described modeling and assessing the various performance factors for the service cache mechanism for composing web services using dst and aco techniques in manet proposed in our previous works. a comprehensive theoretical model has been developed for the performance factors such cooperative cache, mobility hand-off, precision and data reliability methods. in addition to these, the performance improvement of web service cache mechanism (wscm) has been proved experimentally for improved based on precision and service integrity factors using three different schemes such as manet, dst manet and aco optimized dst manet. the simulation results shows that the precision performance of the wscm system is improved using the dst scheme from 30.6% to 53.6% and which is further improved to 68.1% using aco optimization scheme. and the service reliability performance is enhanced from 50.3% to 67.0% and to 88.0% using dst and aco optimization schemes respectively. acknowledgements this work is a part of the research project sponsored under the major project scheme, ugc, india, reference no: f. 41-639/july 2012 (sr). the authors would like to express their thanks for the financial support offered by the sponsored agency. bibliography [1] r. baskaran, p. victer paul and p. dhavachelvan (2012), ant colony optimization for data cache technique in manet, int. conf. on advances in computing (icadc 2012), advances in intelligent and soft computing series, india,, springer, 873-878. 186 m. deivamani, s.r. murugaiyan, v. ravisankar, p. victer paul, r. baskaran, p. dhavachelvan [2] hassan artail and khaleel mershad (2009); mdpf: minimum distance packet forwarding for search applications in mobile ad hoc networks , ieee transactions on mobile computing, 8(10): 1412 142. [3] lan wang and stephan olariu (2005); cluster maintenance in mobile ad-hoc networks, springer science + business media, 8: 111-118. [4] p. krishna, m. chatterjee, n. vaidya and d. pradhan(1997); a cluster-based approach for routing in ad-hoc networks, acm sigcomm computer communication, 27(2): 49-64. [5] p. victer paul, n. saravanan, s.k.v. jayakumar, p. dhavachelvan, r. baskaran (2008); qos enhancements for global replication management in peer to peer networks, future generation computer systems, elsevier, 28(3):573-582. [6] p. victer paul, d. rajaguru, n. saravanan, r. baskaran and p. dhavachelvan (2013); efficient service cache management in mobile p2p networks, future generation computer systems, elsevier, 29(6): 1505-1521. [7] sylvain dahan (2005); distributed spanning tree algorithms for large scale traversals, 11th international conference on parallel and distributed systems (icpads’05), doi: 10.1109/icpads.2005.131 , 1: 453-459. [8] p. victer paul, t. vengattaraman, p. dhavachelvan (2010); improving efficiency of peer network applications by formulating distributed spanning tree, third international conference on emerging trends in engineering & technology (icetet-2010), ieee, india, 813-818. [9] r. baskaran, p. victer paul and p. dhavachelvan (2012); algorithm and direction for analysis of global replica management in p2p network, ieee international conference on recent trends in information technology (icrtit), may 2012, chennai, 211 216. [10] sylvain dahan, jean-marc nicod and laurent philippe (2005); the distributed spanning tree: a scalable interconnection topology for efficient and equitable traversal, international symposium on cluster computing and the grid, 2005 ieee. [11] sylvain dahan (2005); distributed spanning tree algorithms for large scale traversals, 11th international conference on parallel and distributed systems (icpads’05), doi: 10.1109/ccgrid.2005.1558561, 1: 243-250. [12] r. friedman (2002); caching web services in mobile ad-hoc networks: opportunities and challenges, proc. second acm intl workshop principles of mobile computing, 90-96. [13] j. zhao, p. zhang, g. cao, and c. das (2010); cooperative caching in wireless p2p networks: design, implementation, and evaluation, parallel and distributed systems, ieee transactions on, 21(2):229-241. [14] s. lim, w. lee, g. cao, and c. das (2006); a novel caching scheme for internet based mobile ad hoc networks performance, ad hoc networks, 4(2):225-239. [15] n. chand, r. c. joshi, and m. misra (2006); efficient cooperative caching in ad hoc networks. comsware, doi:10.1109/comswa.2006.1665190, 1-8. [16] fan ye, qing li, and enhong chen, adaptive caching with heterogeneous devices in mobile peer to peer network, acm symposium on applied computing (sac ’08). acm, new york, usa, 1897-1901. web service composition framework using petrinet and web service data cache in manet 187 [17] guohong cao, liangzhong yin, chita r. das (2004); cooperative cachebased data access in ad hoc networks, pennsylvania state university, ieee computer society, february 2004, 32-39. [18] hassan artail and khaleel mershad (2009); mdpf: minimum distance packet forwarding for search applications in mobile ad hoc networks, ieee transactions on mobile computing, 8(10): 1412 142. [19] dorigo, m., maniezzo, v., colorni a. (1991); the ant system: an autocatalytic optimizing process. tech.rep. 91-016 revised, politecnico di milano, italy. [20] a. colorni, m. dorigo, v. maniezzo (1991); distributed optimization by ant colonies, proc. of ecal91 european conference on artificial life, elsevier publishing, amsterdam, the netherlands, 134-142. [21] p. victer paul, t. vengattaraman, p. dhavachelvan and r. baskaran (2010); improved data cache scheme using distributed spanning tree in mobile adhoc network, the international journal of computer science and communication (ijcsc), 1(2): 329-332. gaoijcccv12n6.pdf international journal of computers communications & control issn 1841-9836, 12(6), 813-823, december 2017. a rapid recognition of impassable terrain for mobile robots with low cost range finder based on hypotheses testing theory y. gao, x.y. wu, y. liu, j. m. li, j. h. liu yang gao* school of automobile, chang’an university, xi’an, shaanxi province, 710061,china *corresponding author: nchygy@chd.edu.cn xueyi wu, yu liu, jian ming li, jia hao liu school of automobile, chang’an university, xi’an, shaanxi province, 710061,china abstract: the recognition of impassable terrain plays a vital role in the motion planning of a mobile robot, which generally relies on expensive sensors such as stereo vision cameras. this paper proposes a rapid impassable terrain recognition algorithm based on hypotheses testing theory using low-cost range finders with different diffusion angles. in this algorithm, a slope estimation model using two range finders mounted on different heights is first established, where the influence of the diffusion angle of the range sensor is considered. to deal with inaccurate measuring from low cost range finders, the hypothesis testing theory is then applied to judge whether there is an impassible terrain approaching, where the historical slope estimation results are treated as a sample set of the same slope, and the judgement of impassible terrain is then made based on the sampling set rather than the concurrent slope estimation. so the robot is only required to count the number of slope estimation that support the determination of a terrain as being impassible, and the judgement is confirmed only when that number is larger than a precisely designed threshold value. then the stable recognition for impassable terrain would be acquired while the risk of wrong judgement is limited. the experiments’ results indicate that this algorithm can provide a reliable recognition of impassable terrain using lower cost range finders with different diffusion angles with minimal computation. keywords: impassable terrain, range finder, slope, hypothesis testing. 1 introduction recognition of impassable terrain, which generally includes rapid vertical variations, is a critical and fundamental issue for mobile robots. over the previous few decades, techniques and theories for terrain recognition of intelligent automatic vehicles or robots have achieved many successes. [8] presented a three-dimensional (3-d) vision technique for incrementally building an accurate 3-d representation of rugged terrain using multiple sensors. this technique demonstrated the desired accuracy for self localization tasks but is too complex for rapid recognition of impassable terrain . [17] proposed a method for the retrieval of 3-d surface models of the earth using optical and radar imaging sensors. however, this requires 3-d sensor and is difficult to fulfill the efficiency requirement. [2] proposed a method for building a 3-d world model for a mobile robot through sensory data derived from outdoor images using a camera. [21] proposed a method for terrain recognition using a 3-d reconstruction technique that requires 3-d sensors and shows poor efficiency. [11] proposed an overview of stereo vision, stereo image processing copyright © cc by-nc 814 y. gao, x.y. wu, y. liu, j. m. li, j. h. liu as well as a method of obstacle avoidance and navigation based on stereo vision for a mobile robot. [4] proposed a novel navigation framework for autonomous vehicles where a stereo camera is used for both slope estimation and obstacle detection. [12] proposed an approach for obstacle detection in a greenhouse environment, where the depth data provided by the kinect 3d sensor are used for slope estimation. [16] proposed a novel method for analyzing a mobile robot’s ability to travers rough terrain where the roughness and the slope are calculated using an elevation map built by a laser range finder. [9] proposed a hierarchical, decentralized and networked controled system for mobile robots where the slope of terrain is detected using a laser range finder. this approach shows good robustness for various environments but needs much more computational resources. [15] proposed a terrain recognizer where a database consisting of five terrain classes is generated, then depth images are enhanced using confidence map based filtering technology. this approach classifies the terrain into five types, but expensive depth sensing and much computational resources are required. [10] proposed a survey of the distribution of slope angles in the alishan area using airborne lidar and aerial digital camera. [20] proposed a novel approach for obstacle detection using optical flow without recovering range information. this approach shows good efficiency but is not reliable without recovering range information. [18] proposed a collision-free area detection algorithm for an autonomous mobile robot, using a depth camera. however, depth cameras are expensive and may lead to much computational capacity, while the accuracy would gradually decrease with increasing distance. [7] proposed a multisensory data-fusion system for driving an autonomous earthwork vehicle operating in a sanitary landfill, using ultrasonic sensors, a laser range finder, and several charge-coupled device cameras. [13] proposed a method for recognition of uneven terrain using 3d laser scanner. [23] proposed a novel but complex method for real-time obstacle detection and recognition, where several key obstacle characteristics are identified based on image information and geometric information using support vector machine (svm). [14] proposed a novel algorithm to qualitatively categorize the terrain type in real-time, using high fidelity sensors. [11] proposed an application of an image registration method for a mobile manipulator, where the range image may be interpreted for object recognition using ultrasonic range finders. [5] proposed a real time terrain recognition method for a mobile robot moving on uneven terrain to capture a terrain depth image using a rgb-d sensor, which may result in poor performance as the environment changes. [1] proposed a probabilistic framework in which the visual information and the mechanical supervision interact to learn the terrain types. it may be concluded that most existing research rely on sensors generally including: 2d sensors such as vision sensors, range finders, laser range finders, ultrasonic range finders, etc; 3d sensors such as 3d vision sensors, 3d laser range finders, etc. the 3d sensor is generally expensive but provides abundant information for the task. as to the 2d sensors, the laser range finder can provide precise angle measuring but is much more expensive. the ultrasonic range finder, on the other hand, is widely used due to its lower cost and can acquire distance measuring along the wave propagation direction of the ultrasonic range which causes a diffusion angle and provides only slightly less accurate measuring in respect of both distance and angle. so, for these lower cost 2d range finders which may come up with different diffusion angles, such as the ultrasonic and infrared range finders, it is important to develop an algorithm for the reliable recognition of the impassable terrain, provide a good conditions for path planning [19], [6], [3]. 2 mathematical model of recognition in this paper, descent terrain is not discussed because of the limited sensor ability. we can define the impassable terrain as a terrain with a slope angle larger than a threshold value of θt , which is set according to the travel capability of the robot. so, impassable terrain may be a rapid recognition of impassable terrain for mobile robots with low cost range finder based on hypotheses testing theory 815 recognized according to the slope estimation result. 2.1 slope estimation based on range finder with various diffusion angles as the slope estimation algorithm will be proposed in another paper, a brief introduction is proposed as follows. a robot coordinate system is mounted on the mobile robot as shown in figure 1, where the orientation of the robot is the x axis, the origin point is settled on the floor under the wheels, and the y axis is settled along the height of the robot. assume there are two range finders mounted on the robot at different heights, as shown in figure 1, whose positions are oa(xa, ya) and ob(yb, yb) respectively and the vertical difference is oaob = h. assume the measurement of sensor a is da(mm), whose diffusion angle is φa, the measurement of sensor b is db(mm), whose diffusion angle is φb. sensor a for instance detects a sector such as a spreading beam as shown as figure 1. let points a and b denote the two boundary contact points of the spreading beam and the obstacle, among which the lower one is a, and the upper one is b. for an obstacle whose slope is θ, the beams of the two sensors are shown in figure 2. so, for sensor a, the horizontal and vertical distances between sensor a and contact point a, respectively denoted as dxa(mm) and dya, is expresses as formula (1). dxa = |da|cos ( φa 2 ) dya = |da|sin ( φa 2 ) (1) similarly, sensor b forms the two boundary contact points c and d, where the horizontal and vertical distances between sensor b and the lower contact point c is expressed as formula (2). dxb = |db|cos ( φb 2 ) dyb = |db|sin ( φb 2 ) (2) let ∆x,∆y denote the difference between the two boundary contact points a and c along the x axis and y axis respectively. then, formula (3) can easily be obtained, where xa, xc, ya, yc, are the x and y coordinates of the two boundary contact points a and c respectively. ∆x = xa−xc = xa + dxa−xb−dxb ∆y = ya−yc = ya−dya−yb + dyb (3) accordingly, the slope of the obstacle θ can be obtained through formula (4). θ = arctan ( ∆y ∆x ) (4) 2.2 impassable terrain recognition based on hypothesis testing theory using unreliable slope estimation it is generally the case that sensors can only provide inaccurate measurements, resulting in inaccurate estimations of the slope. to address this problem, gaussian noise is used to describe the inaccurate nature, whose distribution can be expressed by formula (5), where µ and δ denote the expectation and the variance respectively. in this paper, µ is supposed to be the estimation 816 y. gao, x.y. wu, y. liu, j. m. li, j. h. liu figure 1: one sensor diffusion angle figure 2: two sensors diffusion angles of the slope by using the algorithm described above, implying that the estimation results most likely reflect the actual slope, while δ implies the possible distribution of the actual slope. n(µ, σ) = 1√ 2πσ2 e −(x−µ) 2 2σ2 (5) phit(r, r ∗) = { η1n(r, r ∗, δhit) 0 ≤ r ≤ rmax 0 otherwise (6) η1 = ( ∫ rmax 0 n(r, r∗, δhit)dr) −1 (7) n(r, r∗, δhit) = ∫ r 0 1√ 2πδ2 hit e − (r−r ∗) 2 2δ2 hit dr (8) so, the inaccurate sensor information can be modeled by adding a gaussian noise. let phit(r, r ∗) denote the possibility that r is a noisy observation to r∗ (r∗ represents the actual value of the sensor, rmax represents the maximum measured value of the sensor, r∗ and δhit denote the expectation and the variance respectively of the sensor), which can be obtained by formula (6). the normalizer η1 is expressed as formula (7). the gaussian observed noise n(r, r∗, δhit), as expressed by formula (8), shows that the further r is from r∗, the less phit is. let the wave line superscript denote a gaussian distributed random variable, while the subscript k denotes the time step. then, (θ̃k − n(θk, δ)) denotes the possible distribution of the correct slope on time k affected by the gaussian noise, as shown in formula (5), whose expectation is the slope estimation value θk. assume there are n independent historical slope estimations for the same terrain, while the terrain is supposed to be static. these estimations can then be treated as n estimation samples ({θ̃k ∣ ∣ ∣ k = 0...n}) of the same one terrain confronting the robot. for each observation of the same terrain, applying formulas (1) to (4) to estimate its slope can yield only two opposite results; the terrain with a slope that is passable or impassable. let p denote the probability that the estimation result θ̃k indicates an impassable terrain. let m denote the number of estimation results that indicates it is an impassable terrain. then, the probability, that all the n samples are found to indicate an impassable terrain, conforms to a binomial distribution b(n, p). considering the inaccurate nature of the estimation process, if p is smaller than a threshold value pu, a passable terrain can be confirmed. so, formula (9) is applied to judge whether there is an impassable terrain. here the threshold value pu being approximate to 0 implies a low likelihood of misrecognition. the hypothesis testing theory can then be applied for a reliable recognition of the terrain type based on the n estimation samples as follows. p ≤ pu (9) a rapid recognition of impassable terrain for mobile robots with low cost range finder based on hypotheses testing theory 817 figure 3: relationship between k and p0.05(a) let hypothesis h0 be: a passable terrain is detected (p ≤ pu). let event a be: the number of the estimation samples that have proved to indicate an impassable terrain (θ̃k ≤ θt ), is larger than the threshold value k. let a be the opposite event of a. let pp(a) denote the probability of a with p given. then formulas (10) to (12) can be induced where c j n denotes the extraction of j samples from n samples. according to the hypothesis testing theory, pp(a) also denotes the probability of rejecting h0 and a happening while p is given. similarly, pp(a) denotes the probability of accepting h0 and a happening while p is given. pp(a) = n ∑ j=k+1 c j n(1−p)n−jpj (10) pp(a) = k ∑ j=0 c j n(1−p)n−jpj (11) sup p≤pu p p(a) ≤ α (12) there are two types of error that may occur. the first type is h0 may be rejected and a has happened while h0 is in fact true (p ≤ pu). the second type is h0 may be accepted and a has happened while h0 is in fact untrue (p > pu). the hypothesis testing theory has proved that the probability of the both types of error cannot be decreased simultaneously, and the decreasing of the probability of the first type should be guaranteed. thus, we can confine the probability of the first type under a manually selected level α as expressed by formula (12). as pp(a) is in proportion to p, let ppu = α, then formula (12) is fulfilled. let p = pu, then k can be obtained from formula (10). as a result, it can be concluded that, with the given α if m ≤ k, then a passible terrain is confirmed to be detected; otherwise an impassable terrain is confirmed. for instance, if n = 100,pu = 0.05,α = 0.01, then k = 10 would result (figure 3 shows the relationship between k and p0.05(a) . so an impassable terrain could be confirmed only if m is larger than k. 3 experimental study to validate the algorithm, several experiments are described in this section. all these experiments were carried out in our laboratory, which is a typical indoor environment. a pioneer 3-dx mobile robot (figure 4), equipped with six ultrasonic range finders (figure 5) and a hokuyo utm-30lx laser range finder (figure 6), are used in the experiments. the laser range finders are mounted on the top of the robot, which is capable of providing a long maximum measuring 818 y. gao, x.y. wu, y. liu, j. m. li, j. h. liu range (about 30m) with very limited deviation (up to 50mm). meanwhile, the maximum sensing range of the ultrasonic range finder is about 5m. the diffusion angle of the laser range finder is so limited that it could be considered as zero, while that of the ultrasonic range finder is 4.6 â°. in the following experiment, the number 3 ultrasonic range finder, as shown in figure 5, and the measuring data of the laser range finders, along the orientation of that ultrasonic sensor, are used. both of the two range finders are mounted along the y axis with a height difference of 60mm. the principle of laser and ultrasonic ranging is expressed by formula (13). d = νt/2 (13) where d represents the distance value, v represents the speed of light propagation or sound wave propagation, t represent the time of light propagation or sound wave propagation. figure 4: mobile robot figure 5: ultrasonic range finder figure 6: laser range finder 4 results and discussion 4.1 slope estimation experiments figure 7 shows an example of the inaccurate measuring of the ultrasonic range finder where an object is located in front of the robot at a distance about 500mm. the horizontal ordinate of figure 7 denotes the elapsed time, while the vertical ordinate denotes the range measuring. the blue line shows the measuring of the ultrasonic sensor where a large deviation appears. table 1: objects with different slopes object index 1 2 3 4 5 6 7 8 9 slope(o) 6 7 8 9 10 11 12 13 14 table 2: different distance between the robot and the objects distance index 1 2 3 4 5 6 7 8 9 distance (mm) 500 1000 1500 2000 2500 3000 3500 4000 4500 this section describes several groups of experiments that were conducted. in the first group, nine objects with different slopes were placed for estimation (table 1) to show the large deviation of estimation result (figure 8). in the second group, four objects with different slopes were placed to show the performance of the proposed algorithm where 60 historical samples were adopted for each object (figure 9, 10, 11). it is generally the case that the robot has to distinguish the impassable terrain from the passable ones while the θt is set to 10o. so four objects whose slopes are less than 10o were chosen to represent the passable terrain, during which the slope increase a rapid recognition of impassable terrain for mobile robots with low cost range finder based on hypotheses testing theory 819 figure 7: ultrasonic sensor measuring value figure 8: deviation between the estimated slopes and the actual slopes intervals are set to 1o separately. meanwhile, four objects whose slopes are larger than 10o were also chosen to represent the impassable terrain, during which 1o is set to be the slope increase interval. the accuracy of the measuring may be getting worse with the growing distance. so, we set a group of increasing distances for the objects as shown in table 1. the ultrasonic range finder can only operate within 5m, so the largest distance is set to 4.5m while the distance increase interval is set to 500mm, as shown in table 2. the nine objects with different slopes (table 1), were set in front of the robot at different distances (table 2). all the deviations between the estimation results and the actual slopes are shown in figure 8, in which the horizontal ordinate denotes the actual horizontal distance (mm) between the objects and the sensors while the vertical ordinate denotes the deviation (o). the nine lines in figure 8 show the different deviations with different object slopes. overall, figure 8 shows that larger distances result in larger errors due to the larger deviation in the sensor measuring. moreover, a poor performance occurs when the slope of object is less than 10o. this is mainly because of the misreading of the ultrasonic sensor caused by the specular reflection of the ultrasonic beam toward any direction other than toward the sensor. on the contrary, a better performance was observed when the slope is larger than 10o, in which case the ultrasonic sensor is less likely to provide a misreading. however, an even better performance was observed when the distance is less than 2000mm, even if the slope is less than 10o. this is mainly because the diffused reflected ultrasonic beam has been received by the sensor although most of the ultrasonic beam has been reflected to some other direction by the specular reflection. fortunately, obstacles and impassable terrain generally refer to a surface whose slope is larger than 10o. 820 y. gao, x.y. wu, y. liu, j. m. li, j. h. liu figure 9: estimation result for 8o slope figure 10: estimation result for 10o slope figure 11: estimation result for 12o slope 4.2 terrain type recognition experiments figure 9, 10, 11 shows three objects set in front of the robot with different slopes close to the, a situation which is more likely to result in a wrong judgement. in each experiment, 60 historical slope estimation results were acquired for each object from which 50 estimation results were used to apply to the hypothesis testing. so, for each recognition, the number of samples n = 50 , pu = 0.03, α = 0.05 then k = 4 could be obtained. table 3, 4, 5 shows the results of three groups of experiments for three objects where each group consists of three experiments. in table 3, an object with 8o slope is used, and only one result among the sampling sets of each experiment was larger than θt . as a result, this object was confirmed to be passable terrain in all three experiments. in table 4, an object with 10o slope is used instead, which resulted in more than 10 wrong slope estimation in each experiment, where the slope estimation result is less than θt . however, with the help of the hypothesis testing theory, our algorithm successfully made right recognition for the type of the terrain in all three experiments. in table 5, an object with 12o slope is used instead, which resulted in only one wrong slope estimation in both the first and the third experiments, where the slope estimation result is less than θt . thus, the proposed approach confirms that this is an impassable terrain. table 3: the recognition results to objects with actual slope 8o experiment index m result 1 1 passable 1 1 passable 1 1 passable a rapid recognition of impassable terrain for mobile robots with low cost range finder based on hypotheses testing theory 821 table 4: the recognition results to objects with actual slope 10o experiment index m result 1 32 impassable 1 36 impassable 1 34 impassable table 5: the recognition results to objects with actual slope 12o experiment index m result 1 49 impassable 1 50 impassable 1 49 impassable 5 conclusion this paper proposes an impassible terrain recognition algorithm for a mobile robot. based on this algorithm, historical slope estimation results are used to lessen the influence of inaccurate estimation according to the hypothesis testing theory. this allows different types of low cost range finders with different diffusion angles and inaccurate measuring to be used in the task of recognizing impassible terrain. because the simple slope estimation model is employed and only the estimations referring to impassable terrain are counted, this algorithm is rapid and easily implemented. experiments have demonstrated that the proposed algorithm increases the reliability in most cases with an inexpensive mean for robots to recognize impassable terrain. acknowledgments this work is supported by the national natural science fundation of china (61503043); natural science foundation of shaanxi provincial (2015jq6214); shaanxi natural science foundation (2017jm7016); fundamental research funds for the central universities (310822172204). bibliography [1] angelova a., matthies l., helmick d., perona p. (2008); dimensionality reduction using automatic supervision for vision-based terrain learning, science and systems, 3, 225-232, 2008. [2] asada m. (1990); map building for a mobile robot from sensory data, ieee transactions on systems man and cybernetics, 20(6), 1326-1336, 1990. [3] awasthy m. (2015); automatic obstacle detection for a star algorithm using digital image processing, international journal of heat and technology, 58(1), 84-93, 2015. [4] cao t., xiang z.y., liu j.l. (2015); perception in disparity: an efficient navigation framework for autonomous vehicles with stereo cameras, ieee transactions on intelligent transportation systems, 16(5), 2935-2948, 2015. [5] chang j.w., wang r.j., wang w.j. (2016); a real time terrain recognition method for mobile robot moving, icsse, puli, 1-4, 2016. 822 y. gao, x.y. wu, y. liu, j. m. li, j. h. liu [6] curkovic p., jerbic b. (2007); honey-bees optimization algorithm applied to path planning problem, international journal of simulation modelling, 6(3), 154-164, 2007. [7] foresti g.l., regazzoni c.s. (2012); multisensor data fusion for autonomous vehicle navigation in risky environments, ieee transactions on vehicular technology, 51(5), 1165-1185, 2012. [8] kweon i.s., kanade t. (1992); high resolution terrain map from multiple sensor data, ieee transactions on pattern and machine intelligence, 14(2), 278-292, 1992. [9] lee m.f.r., chiu f.h.s. (2013); a networked intelligent control system for the mobile robot navigation, ieee/sice international symposium on system integration (sii), kobe, 42-47, 2013. [10] liu j.k., shih t.y., liao z.y. (2008), the geomorphometry of rainfall-induced landslides in alishan area obtained by airborne lidar and digital photography, igarss, boston, 1220-1223, 2008. [11] najjaran h., kircansk n. (2001); map building for a terrain scanning robot, ieee international conference on robotics and automation, seoul, 3728-3733, 2001. [12] nissimov s., goldberger j., alchanati v. (2015); obstacle detection in a greenhouse environment using the kinect sensor, computers and electronics in agriculture, 16, 104-115, 2015. [13] reddy s.k., pal p.k. (2016); computing an unevenness field from 3d laser range data to obtain traversable region around a mobile robot, robotics and autonomous systems, 84, 48-63, 2016. [14] sadhukhan d., moore c., collins e. (2004); terrain estimation using internal sensors, 10th iasted international conference on robotics and applications, honolulu, 195-199, 2004. [15] saudabayev a., kungozhin f., nurseitor d. (2015); locomotion strategy selection for a hybrid mobile robot using time of flight depth sensor, journal of sensors, 1-14, 2015. [16] tanaka y., ji y., yamashita a. (2015); fuzzy based traversability analysis for a mobile robot on rough terrain, irca, seattle, 3965-3970, 2015. [17] tison c., tupin f., maitre h. (2007); a fusion scheme for joint retrieval of urban height map and classification from high-resolution interferometric sar images, ieee transactions on geoscience and remote sensing, 45(2), 496-505, 2007. [18] tokuda s., kinoshita t., kobayashi k. (2014); development of collision-free-area detection algorithm for mobile robot, joint 7th international conference on soft computing and intelligent systems (scis) and 15th international symposium on advanced intelligent systems, 565-568, 2014. [19] wang c., mao y.s., du k.j. (2016); simulation on local obstacle avoidance algorithm for unmanned surface vehicle, international journal of simulation modelling, 15(3), 460-472, 2016. [20] young g.s., hong t.h., herman m. (1992); obstacle detection for a vehicle using optical flow, intelligent vehicles ’92 symposium, detroit, 185-190, 1992. a rapid recognition of impassable terrain for mobile robots with low cost range finder based on hypotheses testing theory 823 [21] zhao y., cheng w. (2012); the navigation of mobile robot based on stereo vision, icicta , zhangjiajie, 670-673, 2012. [22] zhao y., cheng w., jia l., ma s. (2010); the obstacle avoidance and navigation based on stereo vision for mobile robot, icoip, haiko, 565-568, 2010. [23] zhu j., wang y., yu h. (2010); obstacle detection and recognition in natural terrain for field mobile robot navigation, wcica, jinan, 6567-6572, 2010. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 4 (december), pp. 603-614 fuzzy local trend transform based fuzzy time series forecasting model j. dan, f. dong, k. hirota jingpei dan, fangyan dong, kaoru hirota tokyo institute of technology japan, 226-8502 yokohama, 4259 nagatsuta, midori-ku, e-mail: {dan,tou,hirota}@hrt.dis.titech.ac.jp jingpei dan chongqing university p.r.china, 400044 chongqing, 174 shazhengjie, shapingba e-mail: danjingpei@hotmail.com abstract: a fuzzy local trend transform based fuzzy time series forecasting model is proposed to improve practicability and forecast accuracy by providing forecast of local trend variation based on the linguistic representation of ratios between any two consecutive points in original time series. local trend variation satisfies a wide range of real applications for the forecast, the practicability is thereby improved. specific values based on the forecasted local trend variations that reflect fluctuations in historical data are calculated accordingly to enhance the forecast accuracy. compared with conventional models, the proposed model is validated by about 50% and 60% average improvement in terms of mlte (mean local trend error) and rmse (root mean squared error), respectively, for three typical forecasting applications. the mlte results indicate that the proposed model outperforms conventional models significantly in reflecting fluctuations in historical data, and the improved rmse results confirm an inherent enhancement of reflection of fluctuations in historical data and hence a better forecast accuracy. the potential applications of the proposed fuzzy local trend transform include time series clustering, classification, and indexing. keywords: time series forecasting, fuzzy time series, trend, transform. 1 introduction based on zadeh’s works (see [1] and [2]), the concept of fuzzy time series and its models for forecasting have been proposed to solve the forecasting problems where the historical data are linguistic values (see [3][5]). conventional fuzzy time series forecasting models that are based on fuzzy time series of original data are limited to forecasting specific values that do not reflect fluctuations in historical data. local trend variations, however, are mainly concerned with real applications. for example, forecast of changing direction of stock price are more important for stock investors to make reasonable determinations than specific forecast values of stock price. in addition, the forecasted specific demand values are unreliable since historical data is distorted when it transfers along the supply chain due to the bullwhip effect, so local trend variations of demand that are not suffered from the bullwhip effect are more valuable and practical for supply chain managers. therefore the practicability of conventional fuzzy time series forecasting methods suffers from the limitation of forecasting specific values. this study aims to improve the copyright c⃝ 2006-2011 by ccc publications 604 j. dan, f. dong, k. hirota practicability and forecast accuracy by forecasting local trend variations that reflect fluctuations in historical data. it should be noted that the word "trend" usually refers to the long-term trend in statistics, whereas as used in this paper word "trend" means local trend variation in short term or during one period. recently, some trend involved fuzzy time series models have been proposed to improve forecasting. huarng has proposed heuristic models by integrating problem-specific heuristic knowledge with chen’s model [6] to improve forecasting by reflecting the fluctuations in fuzzy time series [7]. a trend-weighted fuzzy time series model for forecasting taiwan stock exchange capitalization weighted stock index (taiex) has been proposed in [8]. chen and wang have proposed a method to predict taiex based on fuzzy-trend logical relationship groups to improve forecast accuracy [9]. the interval rearranged method has been proposed to reflect fluctuations in historical data and improve forecast accuracy of fuzzy time series in [10]. although all these methods are involved with trends, they are intrinsically conventional fuzzy time series forecasting methods since they are all based on original data and their forecasting targets are specific values. fuzzy local trend transform is proposed to provide a different forecasting basis by transforming original data into a linguistic representation of local trend variations called fuzzy local trend time series. in contrast to conventional fuzzy time series forecasting models, local trend variations are forecasted based on the transformed fuzzy local trend time series in the proposed model. forecast accuracy of specific values is hence enhanced by forecasted local trend variations that reflect fluctuations in historical data. three typical forecasting targets, enrollment forecasting, stock index forecasting, and inventory demand forecasting are used to validate the proposed model. to make an effective evaluation, forecasts are evaluated by two measures from different aspects. root mean squared error (rmse) is used to evaluate forecast accuracy of specific values while mean local trend error (mlte) measure [11] is used to evaluate how accurately forecast reflects fluctuations in historical data. for enrollment forecasting, the proposed model outperforms typical fuzzy time series models in terms of rmse and mlte. especially, comparing to chen’s model [6], huarng’s model [12], and cheng et al.’ model [13], mlte results show an improvement of 73.3%, 55.6%, and 66.7%, respectively. for taiex forecasting, the proposed model gets the smallest rmse result while the second rank in terms of mlte compared to chen’s model [6], yu’s model [14], and cheng’s model [8]. for inventory demand forecasting, compared to huarng and yu’ model [15], cheng et al.’ model [13], and chen and wang’ model [9], the proposed model yields about 50%, 50%, and 33.3% improvement in terms of mlte and 48.3%, 73.3%, and 49.1% improvement in terms of rmse. the mlte results demonstrate that the proposed model outperforms conventional fuzzy time series models in significantly reflecting fluctuations in historical data, and the improved rmse results confirm an inherent enhancement of reflection of fluctuations in historical data and hence a better forecast accuracy. the rest of the paper is organized as follows: in section 2, fuzzy time series and fuzzy c-means clustering are briefly reviewed. the proposed fuzzy local trend transform based fuzzy time series forecasting model is elaborated in section 3. empirical analyses on three forecasting targets to demonstrate the proposed model are illustrated in section 4. 2 fuzzy time series and fuzzy c-means clustering: a brief review the proposed model is based on conventional fuzzy time series forecasting model and the fuzzy c-means clustering method is applied in the proposed fuzzy local trend transform, so fuzzy time series and fuzzy c-means clustering are briefly reviewed by adjusting the notations. fuzzy local trend transform based fuzzy time series forecasting model 605 2.1 fuzzy time series let u be the universe of discourse, where u = {u1, u2, · · · , ub}. a fuzzy set ai of u is defined as ai = ∑b j=1 fai(uj)/uj, where fai is the membership function of the fuzzy set ai; fai : u → [0, 1], ua is a generic element of fuzzy set ai and fai(ua) is the degree of belongingness of ua to ai; fai(ua) ∈ [0, 1] and a ∈ [1, b]. in [3], the general definitions of fuzzy time series are given as follows: definition 1. let a subset of real numbers y (t)(t = · · · , 0, 1, 2, · · · ) be the universe of discourse by which fuzzy sets fj(t) are defined. if f(t) is a collection of f1(t), f2(t), · · · , then f(t) is called a fuzzy time series defined on y (t). definition 2. if fuzzy time series relationships assume that f(t) is caused only by f(t − 1), then the relationship can be expressed as: f(t) = f(t − 1) ∗ r(t, t − 1), which is the fuzzy relationship between f(t) and f(t − 1), where ∗ represents as an operator. (note that the operator can be either max-min [4], min-max [5], or arithmetic operator [6].) to sum up, let f(t − 1) = ai, and f(t) = aj, the fuzzy logical relationship between f(t) and f(t − 1) can be denoted as ai → aj, where ai refers to the left-hand side and aj refers to the right-hand side of the flr. furthermore, these fuzzy logical relationships can be grouped to establish different fuzzy relationship. these groups are called fuzzy logical relationship groups (flrgs). on account of its simplicity, flr method is chosen by most researchers. the procedure for forecasting using conventional fuzzy time series models has four main steps: (1) define universe of discourse and intervals; (2) define fuzzy sets and fuzzify observations in the original time series; (3) establish fuzzy relationships; (4) forecast and defuzzify the outcome. assume that the current state of f(t) is ai, fdef(t + 1) can be forecasted and defuzzified by the following rules: rule 1: if there is ai → aj in fuzzy logical relationship groups, then f(t + 1) = aj and defuzzified as fdef(t+1) = centerj, where centerj is the center of cluster j to which aj belongs. rule 2: if there is ai → # in fuzzy logical relationship groups, then f(t + 1) = ai and defuzzified as fdef(t + 1) = centeri, where # represents null value and centeri is the center of cluster i to which ai belongs. rule 3: if there is aia1, a2, · · · , aj in fuzzy logical relationship groups, then the forecast at t + 1 is calculated as fdef(t + 1) = (center1 + center2 + · · · + centerj)/j, where cj is the center of cluster to which aj belongs. 2.2 fuzzy c-means clustering fuzzy c-means (fcm) clustering is a method of clustering which allows one piece of data to belong to two or more clusters, and is frequently used in pattern recognition [16]. the fcm is based on minimization of the objective function jm = n∑ i=1 c∑ j=1 umij ∥xi − cj∥ 2, 1 ≤ m ≤ ∞, (1) where m is any real number greater than 1, uij is the degree of membership of xi in the cluster j, xi is the i-th of d-dimensional measured data, cj is the d-dimension center of the cluster, and ∥ ∗ ∥ is any norm expressing the similarity between any measured data and the center. fuzzy partitioning is carried out through an iterative optimization of the objective function (1), with the update of membership uij = 1/ ∑c k=1 (∥xi − cj∥/∥xi − ck∥) 2/m−1 and the cluster centers cj = ∑n i=1 u m ij (xi/ ∑n i=1 u m ij ). 606 j. dan, f. dong, k. hirota this iteration will stop when maxij|uk+1ij − u k ij| < ε, where ε is a termination criterion between 0 and 1, whereas k is the iteration step. this procedure converges to a local minimum or a saddle point of jm. 3 proposed fuzzy local trend transform based fuzzy time series forecasting model in literature, trends are widely represented by using absolute variations, slopes, or relative variations between two consecutive points in literature. in section 3.1, trends or local trend variations in study are defined as relative variations, or ratios. the algorithm of fuzzy local trend transform based on local trend variations is then elaborated in section 3.2. the fuzzy time series forecasting model based on fuzzy local trend transform is presented in section 3.3. 3.1 local trend variations to address the limitation of absolute variations [17] and slopes( [18][20])for representing local trend variations, relative variations, which are the ratios between two consecutive data points in a given historical time series, are adopted to indicate local trend variations in this study. assuming that for any given time series p(t), t = 1, 2, · · · , n, n ∈ n, local trend variation between time t and t − 1 is defined as rt = (p(t) − p(t − 1))/p(t − 1), t = 2, 3, · · · , n, (2) then the time series of local trend variations for p(t) is defined as t(t) = rt, t = 2, · · · , n. the reasons for forecasting based on local trend variations instead of original time series data are explained as follows. first, the forecasts based on original time series data may not reflect the fluctuations in historical data properly. in most previous studies, the forecasts are equal at some consecutive points which indicated that forecasting based on original time series are not appropriate for reflecting fluctuations [15]. second, original time series data varies dramatically in different contexts while the time series of local trend variations varies slightly. finally, forecasting based on local trend variations are more suitable for reflecting fluctuations in historical data since directions and variation degrees of local trend variations can be indicated by signs and magnitudes of ratios easily. ratios are preferable in terms of demonstrating the differences in various contexts, ratios-based lengths of intervals are hence adopted to improve fuzzy time series in [15]. forecasting based on local trend variations are therefore considered more suitable for reflecting fluctuations in historical data and forecast accuracy should be further improved inherently. 3.2 fuzzy local trend transform to forecast local trend variations, original time series is represented by linguistic local trend variations of original data firstly, the algorithm of fuzzy local trend transform is hence proposed as follows: step 1 : obtain the local trend variation time series t(t) by calculating ratios between each two consecutive data points in the original time series p(t) in terms of equation (2). step 2: divide t(t) into three basic clusters in terms of local trend changing direction, i.e., decreasing cluster td, unchanged cluster tu, and increasing cluster ti. it is easy to determine td and ti in terms of sign of ratios after tu is determined. assume that the interval for unchanged fuzzy local trend transform based fuzzy time series forecasting model 607 table 1: parameter śá for determining the interval of unchanged cluster max(|t(t)|)(×10−2) max(|t(t)|) ≤ 1 max(|t(t)|) ≤ 10 max(|t(t)|) ≤ 20 · · · α(×10−2) 0.01 0.1 0.2 · · · cluster is [−α, α], α is determined by max(|t(t)|) according to table 1 since it is possible that the definition of unchanged cluster varies from problem to problem. the observations in t(t), of which the values are greater than α, are then assigned to ti, while the observations of which the values less than −α are assigned to td. step 3: divide ti and td into ci and cd clusters by applying fcm, respectively. assume that the number of observations in ti and td are ni and nd, respectively, then the number of clusters for ti, td and t(t) are predefined by users as ci (2 ≤ ci ≤ ni), cd (2 ≤ cd ≤ nd), and c (c = cd + ci + 1), respectively. ti and td are then divided into ci and cd clusters by fcm, respectively, as described in section 2.2. consequently, the cluster centers and memberships with respect to the clusters are obtained. step 4: fuzzify the local trend time series t(t) as fuzzy local trend time series ft (t). first, the clusters of t(t) are achieved by combining the clustering results obtained in step 2 and step 3. then the linguistic terms ai (i = 1, 2, · · · , c) are defined corresponding to the clusters. t(t) is finally fuzzified into ft (t) by assigning ai to t(t) when the maximum membership of t(t) occurs at the cluster to which ai belongs. 3.3 fuzzy local trend transform based fuzzy time series forecasting model theoretically, the proposed fuzzy local trend transform can be integrated with any conventional fuzzy time series forecasting model. because of simplicity, the proposed model is integrated with chen’s model [6] as stated in section 2.1. the proposed model differs from chen’ model in step 1 and step 4 as described in the following: step 1: transform original time series into fuzzy local trend time series by the proposed fuzzy local trend transform as stated in section 3.2. step 2: establish fuzzy logical relationships and fuzzy logical relationship groups based on fuzzy local trend time series obtained in step 1 as described in section 2.1. step 3: forecast and defuzzify the possible outcomes of local trend variations, which is denoted as tdef(t), t = 1, 2, · · · , n − 1, based on fuzzy logical relationship groups as described in section 2.1. step 4: calculate specific values based on forecasted local trend variations obtained in step 3 in terms of equation(3) that is defined as ppre(t) = p(t) × tdef(t), t = 1, 2, · · · , n − 1, (3) where ppre(t) indicates predicted specific values. 4 empirical analyses on forecasting based on fuzzy local trend time series to validate the proposed model, three applications are used in the empirical analyses, including student enrollment forecasting (the enrollments of the university of alabama [4]), stock index forecasting (taiwan stock exchange capitalization weighted stock index (taiex)), and inventory demand forecasting [15]. the first two data sets are algebraic growth data and widely used to validate fuzzy time series models in many relevant studies while the inventory demand data set is exponential growth data and typical in supply chain management application. we 608 j. dan, f. dong, k. hirota compare the proposed model with typical fuzzy time series models in terms of two forecast accuracy measures. one is conventional measure rmse (root mean squared error) that is commonly used to measure forecast accuracy of specific values based on quantitative error in fuzzy time series forecasting, while the other is mlte (mean local trend error), which is proposed to measure how accurately forecasts reflect fluctuations in actual data based on local trend error [11]. it is more effective and proper for comparing the models by evaluating them from two different aspects than using one or more conventional measures that based on quantitative error. for a given time series yt, t = 1, · · · , n, the prediction of yt is ft, t = 1, · · · , n, rmse is defined as rmse = √√√√1 n n∑ t=1 (ft − yt)2, t = 1, 2, · · · , n, n ∈ n, (4) ąą while mlte is defined as mlte= 1 n−1 n−1∑ i=1 ei×100%, ei = { 1 sign(yt+1−yt) ̸=sign(ft+1−ft) 0 sign(yt+1−yt)=sign(ft+1−ft) t=1, 2,· · ·, n, n ∈ n, (5) where ei indicates the number of local trend change errors, sign indicates the operator for outputting the sign of the operand. when the predicted local trend variation is inconsistent with the original one in the same interval, ei is equal to 1, otherwise ei equals to 0 [11]. the experiments are implemented using matlab r2010a. in section 4.1, the forecasting algorithm is illustrated step by step with the example of enrollment forecasting. performance of the proposed model for stock index and inventory demand forecasting are analyzed in section 4.2 and section 4.3, respectively. 4.1 forecasting enrollments the yearly data of student enrollments of the university of alabama from 1971 to 1992 are commonly used in previous studies on fuzzy time series to validate fuzzy time series models. to make proper comparison with other models, the same data set is used in this study to validate the proposed model. the procedure of forecasting enrollments by the proposed model is as follows: step 1: transform original time series into fuzzy local trend time series by the proposed fuzzy local trend transform. (1) obtain the local trend time series t(t) as shown in the third column of table 2 by calculating the local trend variations of the enrollments between each two consecutive years in the original time series p(t) in terms of equation(2) . (2) divide t(t) into decreasing cluster td, unchanged cluster tu, and increasing cluster ti. since none of the observations in t(t) is equal to 0 and maximum absolute value of the observations in t(t) is 7.6675%, α is determined to be 0.1% according to table 1. consequently, td = {−5.8274, −3.1385, −2.3840, −2.2714, −0.9638}, tu = {0.0466}, and ti = {0.1189, 0.4147, 0.6664, 1.6535, 1.8872, 1.9071, 2.2414, 3.8912, 4.5179, 5.1987, 5.4145, 5.4742, 5.9643, 5.9782, 7.6576}. (3) divide ti and td into ci into cd clusters by applying fcm, respectively. for fcm, the predefined number of clusters c = cd + ci + 1. to make fair comparison, the total number of clusters is predefined to be 7 in this study as the same as used in the previous studies. assume that cd = ci = 3, the cluster centers of td and ti and the membership grades of td and ti are shown in table 3 and table 4, respectively. naturally, the center of unchanged cluster ti is 0%. (4) fuzzify the local trend time series t(t) into fuzzy local trend time series ft (t) as shown in the fourth column of table 2. combine the clustering results obtained in step (2) and step fuzzy local trend transform based fuzzy time series forecasting model 609 table 2: forecasting local trend variations and specific values of the enrollments year actual local trend fuzzified local forecasted local trend forecasted enrollment variation trend variation variation (×10−2) enrollment 1971 13055 1972 13563 3.8912 a6 1973 13867 2.2414 a5 0.1233 13580 1974 14696 5.9782 a7 1.1623 14028 1975 15460 5.1987 a6 3.8752 15266 1976 15311 -0.96378 a3 0.1233 15479 1977 15603 1.9071 a5 1.2224 15498 1978 15861 1.6535 a5 1.1623 15784 1979 16807 5.9643 a7 1.1623 16045 1980 16919 0.66639 a5 3.8752 17458 1981 16388 -3.1385 a2 1.1623 17116 1982 15433 -5.8274 a1 -2.3003 16011 1983 15497 0.4147 a5 1.2224 15622 1984 15145 -2.2714 a2 1.1623 15677 1985 15163 0.11885 a5 -2.3003 14797 1986 15984 5.4145 a7 1.1623 15339 1987 16859 5.4742 a7 3.8752 16603 1988 18150 7.6576 a7 3.8752 17512 1989 18970 4.5179 a6 3.8752 18853 1990 19328 1.8572 a5 0.1233 18993 1991 19337 0.046565 a4 1.1623 19553 1992 18876 -2.384 a2 -2.5770 18839 rmse 438.18 mlte 21.0526% (3) to achieve the clusters of t(t), the linguistic variables are then defined as shown in table 5 according to the clusters. each local trend variation in t(t) is fuzzified by the linguistic variable to which the maximum membership belongs in terms of the results in table 3 and table 4. table 3: membership grades of decreasing cluster for each linguistic variable clusters cluster centers linguistic td(1) td(2) td(3) td(4) td(5) variables 1 -2.5770 a2 0.0000018 0.9 0.9785 0.9407 0.000055 2 -5.8231 a1 0.9999974 0.0393 0.0031 0.0070 0.000006 3 -0.9758 a3 0.0000008 0.0607 0.0184 0.0523 0.999939 ft (t) a1 a2 a2 a2 a3 step 2: the flrs of ft (t) are established as shown in table 6 according to definition 3 in 2.1. then, the flrs are rearranged into flrgs as shown in table 7. step 3: the possible outcomes of local trend variations from 1973 to 1992 are forecasted and defuzzified as shown in the fifth column of table 2. table 4: membership grades of increasing cluster for each linguistic variable clusters cluster linguistic ti(1) ti(2) ti(3) ti(4) ti(5) ti(6) ti(7) ti(8) centers variables 1 4.3997 a6 0.0603 0.0387 0.0215 0.0238 0.0639 0.0684 0.1719 0.9139 2 1.2224 a5 0.9077 0.9416 0.9680 0.9667 0.9123 0.9063 0.7715 0.0332 3 6.0036 a7 0.0319 0.0197 0.0105 0.0095 0.0238 0.0253 0.0566 0.0530 ft (t) a5 a5 a5 a5 a5 a5 a5 a6 ti(9) ti(10) ti(11) ti(12) ti(13) ti(14) ti(15) 1 4.3997 a6 0.9924 0.4936 0.2486 0.1929 0.00063 0.00027 0.1947 2 1.2224 a5 0.0013 0.0199 0.0146 0.0123 0.00007 0.00003 0.0499 3 6.0036 a7 0.0063 0.4864 0.7371 0.7948 0.9993 0.9997 0.7554 ft (t) a6 a6 a7 a7 a7 a7 a7 step 4: the enrollment values from the year 1973 to 1992 are calculated based on the forecasted local trend variations obtained in step 3 as shown in the last column of table 2. the number of intervals of the universe of discourse affects forecasting results [21]. when analyzing the sensitivity of c, which varies in {7, 9, 11, 13}, the rmse and mlte results {502.401, 440.3566, 380.8953, 313.2307} 610 j. dan, f. dong, k. hirota table 5: cluster centers of local trend variations of enrollments cluster centers linguistic variables (×10−2) -5.8231 a1(big decrease) -2.5770 a2(decrease) -0.9758 a3(small decrease) 0 a4(almost unchanged) 1.2224 a5(small increase) 4.3997 a6(increase) 6.0036 a7(big increase) and {23.1579%, 24.2105%, 20%, 13.6842%} respectively. the average rmse and mlte results indicate that forecast error decrease as the number of clusters increases, in other words, the bigger the number of clusters the higher the forecast accuracy. table 6: fuzzy logical relationships of local trend variations of enrollments a6→a5 a5→a7 a7→a6 a6→a3 a3→a5 a5→a5 a5→a7 a7→a5 a5→a2 a2→a1 a1→a5 a5→a2 a2→a5 a5→a7 a7→a7 a7→a7 a7→a6 a6→a5 a5→a4 a4→a2 a2→# table 7: fuzzy logical relationship groups of local trend variations of enrollments group 1: a1→a5 group 2: a2→a1,a5,# group 3: a3→a5 group 4: a4→a2 group 5: a5→a2, a4, a5, a7 group 6: a6→a3, a5 group 7: a7→a5, a6, a7 table 8: comparisons for enrollment forecasting under the same conditions year actual chen huarng cheng et al. the proposed enrollment [6] [12] [13] model 1971 13055 -ąą 1972 13563 14000 14000 14242 -ąą 1973 13867 14000 14000 14242 13580 1974 14696 14000 14000 14242 14028 1975 15460 15500 15500 15474.3 15266 1976 15311 16000 15500 15474.3 15479 1977 15603 16000 16000 15474.3 15498 1978 15861 16000 16000 15474.3 15784 1979 16807 16000 16000 16146.5 16045 1980 16919 16833 17500 16988.3 17458 1981 16388 16833 16000 16988.3 17116 1982 15433 16833 16000 16146.5 16011 1983 15497 16000 16000 15474.3 15622 1984 15145 16000 15500 15474.3 15677 1985 15163 16000 16000 15474.3 14797 1986 15984 16000 16000 15474.3 15339 1987 16859 16000 16000 16146.5 16603 1988 18150 16833 17500 16988.3 17512 1989 18970 19000 19000 19144 18853 1990 19328 19000 19000 19144 18993 1991 19337 19000 19500 19144 19553 1992 18876 19000 19000 19144 18839 rmse ąą 646.79 477.91 466.17 438.18 mlte 78.9474% 47.3684% 63.1579% 21.0526% the proposed method is compared with typical models for enrollment forecasting, including chen’s model [6], huarng’s model [12], and cheng et al.’ model [13]. to make fair comparison, seven linguistic variables are defined in all the compared models and the rmses and mltes are computed with reference to the year 1973. as shown in table 8, the comparative results show that the proposed method outperforms the other models in terms of rmse and mlte. fuzzy local trend transform based fuzzy time series forecasting model 611 especially, comparing to chen’s model, huarng’s model, and cheng et al.’ model, the proposed model makes about 73.3%, 55.6%, and 66.7% improvements in terms of mlte, respectively. 4.2 stock index forecasting table 9: comparisons for taiex forecasting date actual actual local chen yu cheng the proposed forecasted local index trend (×10−2) [6] [14] [8] model trend (×10−2) 00/11/02 5,626.08 1.477 5300 5340 5463.85 5524.81 -0.35 00/11/03 5,796.08 2.933 5750 5721.67 5644.8 5611.08 -0.27 00/11/04 5,677.30 -2.0922 5450 5435 5797.8 5815.39 0.33 00/11/06 5,657.48 -0.35033 5750 5721.67 5690.9 5657.47 -0.35 00/11/07 5,877.77 3.7478 5750 5721.67 5673.06 5648.99 -0.15 00/11/08 6,067.94 3.134 5750 5760 5871.32 5897.36 0.33 00/11/09 6,089.55 0.35487 6075 6062.5 6042.47 6088.09 0.33 00/11/10 6,088.74 -0.013303 6075 6062.5 6061.92 6093.84 0.07 00/11/13 5,793.52 -5.0957 6075 6062.5 6061.19 6079.54 -0.15 00/11/14 5,772.51 -0.36397 5450 5435 5795.5 5685.36 -1.87 00/11/15 5,737.02 -0.61861 5450 5435 5776.59 5763.81 -0.15 00/11/16 5,454.13 -5.1867 5450 5435 5744.65 5728.37 -0.15 00/11/17 5,351.36 -1.9204 5300 5340 5409.92 5352.29 -1.87 00/11/18 5,167.35 -3.561 5350 5350 5317.42 5304.12 -0.88 00/11/20 4,845.21 -6.6486 5150 5150 5151.81 5070.94 -1.87 00/11/21 5,103.00 5.0517 4850 4850 4861.89 4953.18 2.23 00/11/22 5,130.61 0.53814 5150 5150 5093.9 5120.92 0.35 00/11/23 5,146.92 0.31689 5150 5150 5118.75 5134.17 0.07 00/11/24 5,419.99 5.0382 5150 5150 5213.56 5150.48 0.07 00/11/27 5,433.78 0.25378 5300 5340 5459.32 5439.04 0.35 00/11/28 5,362.26 -1.3338 5300 5340 5391.6 5425.62 -0.15 00/11/29 5,319.46 -0.80459 5350 5350 5327.23 5314.92 -0.88 00/11/30 5,256.93 -1.1895 5350 5350 5288.71 5311.49 -0.15 00/12/01 5,342.06 1.5936 5250 5250 5232.44 5210.46 -0.88 00/12/02 5,277.35 -1.2262 5350 5350 5309.05 5327.84 -0.27 00/12/04 5,174.02 -1.9971 5250 5250 5250.81 5230.78 -0.88 00/12/05 5,199.20 0.48431 5150 5150 5157.82 5128.29 -0.88 00/12/06 5,170.62 -0.55274 5150 5150 5180.48 5202.82 0.07 00/12/07 5,212.73 0.80783 5150 5150 5154.76 5162.82 -0.15 00/12/08 5,252.83 0.7634 5250 5250 5192.66 5216.33 0.07 00/12/11 5,284.41 0.59761 5250 5250 5228.75 5256.46 0.07 00/12/12 5,380.09 1.7784 5250 5250 5257.17 5288.08 0.07 00/12/13 5,384.36 0.079304 5350 5350 5343.28 5398.00 0.33 00/12/14 5,320.16 -1.2067 5350 5350 5347.12 5376.30 -0.15 00/12/15 5,224.74 -1.8263 5350 5350 5289.34 5273.20 -0.88 00/12/16 5,134.10 -1.7655 5250 5250 5203.46 5178.54 -0.88 00/12/18 5,055.20 -1.5608 5150 5150 5121.89 5088.74 -0.88 00/12/19 5,040.25 -0.29661 5450 5405 5050.88 5010.54 -0.88 00/12/20 4,947.89 -1.8667 5450 5405 5037.42 5032.72 -0.15 00/12/21 4,817.22 -2.7126 4950 4950 4954.3 4904.19 -0.88 00/12/22 4,811.22 -0.12471 4850 4850 4836.7 4800.37 -0.35 00/12/26 4,721.36 -1.9033 4850 4850 4831.3 4803.96 -0.15 00/12/27 4,614.63 -2.3129 4750 4750 4750.42 4679.69 -0.88 00/12/28 4,797.14 3.8046 4650 4650 4654.37 4598.48 -0.35 00/12/29 4,743.94 -1.1214 4750 4750 4818.62 4813.06 0.33 00/12/30 4,739.09 -0.10234 4750 4750 4770.74 4736.76 -0.15 rmse 176.32 170.27 121.47 114.63 mlte 64.44% 62.22% 26.67% 31.11% the daily stock index, taiex (taiwan stock exchange capitalization weighted stock index), which is the other widely used data set in fuzzy time series studies, is used to further validate the proposed model for out-sample forecasting. the taiex data during 2000/01/01 2000/10/31 are used as training data set, and the data during 2000/11/01 2000/12/31 are used as testing data set. 612 j. dan, f. dong, k. hirota the proposed model is compared with chen’s model [6], yu’s model [14], and cheng’s model [8]. the comparison of the forecasting results is shown in table 9. the proposed model gets the smallest rmse result while the second rank in terms of mlte among the compared models. cheng’s model gets the smallest mlte since it incorporates trend-weighting into chen’s model for taiex forecasting. the proposed model, however, improves forecast accuracy of specific value about 6% compared to cheng’s model by reflecting fluctuations in historical data. 4.3 inventory demand forecasting demand forecasting plays a very important role in supply chain management. to further validate the applicability of the proposed method for demand forecasting, an inventory demand data set [15] is used in this study. this data set has been used in several previous studies, so it is proper for fair comparison. inventory demand data from 1 to 19 are used as training set while data from 20 to 24 are used as testing set. table 10: comparisons for inventory demand forecasting time actual huarng and yu cheng et al. chen and wang the proposed inventory demand [15] [13] [9] model 20 227 206 205.5290 209.945 215.5516 21 223 228 216.4187 224.055 231.8985 22 242 228 216.4187 224.055 243.5307 23 239 244 216.4187 234.11 246.5969 24 266 244 216.4187 244.33 261.0037 rmse 15.1 28.7295 14.88 7.6845 mlte 100% 100% 75% 50% the proposed model is compared with huarng and yu’ model [15], cheng et al.’ model [13], and chen and wang’ model [9]. as shown in table 10 comparing with huarng and yu’ model, cheng et al.’ model, and chen and wang’ model, the mlte results are improved by the proposed model about 50%, 50%, and 33.3%, respectively. this indicates that the proposed model outperforms the comparative models significantly in reflecting fluctuations in historical data. consequently, the forecast accuracy of specific value is improved by the proposed model about 48.3%, 73.3%, and 49.1%, respectively, in terms of rmse, which indicates that forecast accuracy is inherently improved by the proposed model by reflecting fluctuations in historical data. 5 conclusion in contrast to conventional fuzzy time series forecasting models that are based on original fuzzy time series, a different forecasting basis, fuzzy local trend time series, which is the linguistic representation of local trend variations of original data, is provided by the proposed fuzzy local trend transform. local trend variations, which are defined as ratios between any two consecutive data points in original time series, are thereby forecasted based on fuzzy local trend time series and the specific values are are calculated accordingly. therefore the practicability and forecast accuracy are improved by reflecting fluctuations in historical data. the proposed model is validated by using three typical forecasting targets. the results are evaluated by two measures from different aspects. compared to conventional fuzzy time series models, the proposed model yields about 50% and 60% average improvement in terms of mlte and rmse, respectively for the three application areas. the mlte results indicate that the proposed model outperforms conventional fuzzy time series models significantly in reflecting fluctuations in historical data, and the improved rmse results confirm an inherent enhancement of reflection of fluctuations in historical data and hence a better forecast accuracy. fuzzy local trend transform based fuzzy time series forecasting model 613 theoretically, the proposed fuzzy local trend transform can be integrated with any fuzzy time series model. optimization of the parameters predefined in the proposed fuzzy local trend transform is being considered. a linguistic representation of time series data based on the proposed fuzzy local trend transform for the application to time series data mining is ongoing. bibliography [1] l. a. zadeh, outline of a new approach to the analysis of complex systems and decision processes, ieee trans. systems, man,and cybernet, vol.3, no.1, pp.28-44, 1973. [2] l.a. zadeh, the concept of a linguistic variable and its application to approximate reasoning, part 1, information sciences, vol.8, no.3, pp.199-249, 1975. [3] q. song, b. s. chissom, fuzzy time series and its model, fuzzy sets and systems, vol.54, no.3, pp.269-277, 1993. [4] q. song, b. s. chissom, forecasting enrollments with fuzzy time series -part i, fuzzy sets and systems, vol.54, no.1, pp.1-9, 1993. [5] q. song, b. s. chissom, forecasting enrollments with fuzzy time series -part ii, fuzzy sets and systems, vol.62, no.1, pp.1-8, 1994. [6] s. m. chen, forecasting enrollments based on fuzzy time series, fuzzy sets and systems, vol.81, no.3, pp.311-319, 1996. [7] k. h. huarng, heuristic models of fuzzy time series for forecasting, fuzzy sets and systems, vol. 123, no.3, pp.369-386, 2001. [8] c. h. cheng et al, trend-weighted fuzzy time series model for taiex forecasing, lecture notes in compter science, part iii, vol.4234, pp.469-477, 2006. [9] s. m. chen, n. y. wang, fuzzy forecasting based on fuzzy-trend logical relationship groups, ieee trans. systems, man, and cybernetics-part b: cybernetics, vol.40, no.5, pp.1343 1358, 2010. [10] h. y. jung et al, fuzzy time series reflecting the fluctuation of historical data, seventh international conference on fuzzy systems and knowledge discovery, pp.473-477, 2010. [11] j. dan et al, mean local trend error and fuzzy-inference-based multicriteria evaluation for supply chain demand forecasting, journal of advanced computational intelligence and intelligent informatics, vol. 15, no. 2, pp.134-144, 2011. [12] j. r. hwang et al, handling forecasting problems using fuzzy time series, fuzzy sets and systems, vol.100, no.1-3, pp.217-228, 1998. [13] c. h. cheng et al, multi-attribute fuzzy time series method based on fuzzy clustering, expert systems with applications, vol.34, no.2, pp.1235-1242, 2008. [14] h. k. yu, a refined fuzzy time series model for forecasting, physica a, vol.346, no.3/4, pp.657-681, 2005. [15] k. h. huarng, h. k. yu, ratio-based lengths of intervals to improve fuzzy time series forecasting, ieee trans. systems, man, and cybernetics-part b: cybernetics, vol.36, no.2, pp.328-340, 2006. 614 j. dan, f. dong, k. hirota [16] j. c. bezdek et al, fcm: the fuzzy c-means clustering algorithm, computers & geosciences, vol.10, no.2-3, pp.191-203, 1984. [17] j. r. hwang et al, handling forecasting problems using fuzzy time series, fuzzy sets and systems, vol.100, no.1-3, pp.217-228, 1998. [18] a. udechukwu et al, discovering all frequent trends in time series, the 2004 winter international symposium on information and communication technologies, pp.1-6, 2004. [19] c. h. chen et al, mining fuzzy frequent trends from time series, expert systems with applications, vol.36, no.2, part 2, pp.4147-4153, 2009. [20] i. z. batyrshin, l. b. sheremetov, perception-based approach to time series data mining, applied soft computing, vol.8, no.3, pp.1211-1221, 2008. [21] k. h. huarng, effective lengths of intervals to impove forecasting in fuzzy time series, fuzzy sets and systems, vol.124, no.3, pp.387-397, 2001. int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vi (2011), no. 3 (september), pp. 581-582 “the disassembly line: balancing and modeling” -book reviewf.g. filip title of book: the disassembly line: balancing and modeling authors: seamus m. mcgovern and surendra m. gupta editing house: mcgraw hill companies inc., new york isbn: 978-0-07-162287-5 pages: xvii+373 this book is about the disassembly of end-of-life products with particular emphasis on methods and techniques for solving the disassembly line balancing problem. disassembly is viewed as "the systematic separation and extracting valuable entities for possible future re-usage". in fact, disassembly is a distinct phase of the product lifecycle. it follows the "before life" phases (such as design and economical evaluation), "useful period" phases (such as manufacturing, distribution, usage and maintenance) and "end of life" phases (such as collecting, sorting). disassembly might represent the essential first phase of the future activities, such as re-use and re-manufacturing and recycle. due to the ever higher public awareness, the more and more strict regulations concerning environment quality preservation and increasing economic effectiveness and attractiveness for industry, the activities of recovering valuable parts and subassemblies have become a desirable alternative to the old fashioned disposal processes of end-of-life products. the authors state, in the preface of the book, that the "disassembly line seems to be the most efficient way to disassemble a product". consequently, the primary concern of the book is "the complete disassembly of [end-of-life] products on a paced disassembly line for component/material recovery purposes". the authors aim at investigating "the qualitative and quantitative aspects of multi-criteria solution sequences using the various combinatorial optimization techniques" (page 16) to solve the disassembly line balancing problem (dlpb). the dlbp consists in finding a disassembly feasible solution sequence which preserves precedence constraints and aims at attaining several objectives, such as minimizing the number of work stations and total idle time, ensuring similar idle time at each work station, while attempting to remove hazardous parts and materials and extracting highly demanded product components at the earliest moments possible and minimizing the number of direction changes required for disassembly (removing parts with similar part removal directions together), (page 102). the book is composed of 29 chapters grouped into three parts entitled "disassembly background", "disassembly-line balancing" and "further disassembly-line considerations" which address general aspects concerning disassembly processes, variations of methods and techniques to solve the dlbp, and other problems related to the disassembly line, respectively. part i comprises six chapters which are meant to set the stage for the subsequent chapters. various information concerning disassembly processes, assembly lines, disassembly lines, other related researches, graphical representations and computational complexity of combinatorial problems are provided. part ii is made up of 20 chapters and addresses the statement and analysis of the dlbp and several specific variations of methods and techniques which were adapted for solving the problem, tested on four application cases exprimental instances and compared. the objectives of this part of the book are: stating the mathematical model of dlbp, establishing the difficulty of the problem by using the complexity theory and determining the data sets and evaluation criteria to be used in analyzing the problem and solving techniques which are selected (page 99). copyright c⃝ 2006-2011 by ccc publications 582 it is demonstrated (in chapter 9) that the dlbp is a complex np complete problem in the strong sense and necessitates specialized solution techniques. accordingly, authors plea for combinatorial optimization approaches and select several algorithms to solve the problem. the techniques to be utilized to solve the dlbp are introduced in chapter 10 and their usage and performances in solving the problem are presented in chapters 12 through 19. there are seven techniques which are adapted, tested and compared. the exhaustive search is used to provide the optimal solution. two metaheuristic approaches (genetic algorithms and ant colony optimization) are next studied. two purely deterministic searches (the greedy algorithm and the "hunter-killer" search) and two 2-phase hybrid methods are adapted and tested. the four experimental instances (the eight-part personal computer, the enhanced 10-part dlbp case, the 25-part cellular instance, and the size independent "a priori" benchmark with a known optimal solution) are described in chapter 11. chapter 20 contains a detailed comparison of the six heuristic and metaheuristic techniques as applied to the dlbp with respect to several performance measures. several complementary research results are reviewed in chapter 21 together with future research directions. disassembly processes interact with other "before life", "useful", and "after life" periods of product usage and recovery. as a result, to make the picture complete, part iii addresses other areas of disassembly research such as product planning, line and facility design, operations scheduling and sequencing, inventory, "just-in-time", revenue and unbalanced lines (chapter 22 through 29). the authors of the book form a team who may be viewed as a fine and synergic combination of two complementary experiences and backgrounds from academia and industry. seamus mcgovern, an electronics engineer at the volpe national transportation system center, holds a commission in the us navy as an aerospace duty engineer as well as a part-time industrial engineering faculty appointment at the northeastern university. surendra m. gupta is a professor of mechanical and industrial engineering and a director of the laboratory for responsible manufacturing at the northeastern university. he has authored/co-authored over 400 technical papers and is a pioneer in the domain of the book. this book represents a very valuable work in a rather young research domain, which may be viewed as opened by the pioneering paper of güngör and gupta entitled "disassembly line in product recovery (international journal of production recovery, 40 (11), 2002). the volume mainly reflects the original studies of authors and their colleagues. it also makes an exhaustive and systematic review of the results which are reported in the domain scientific literature and are due to other scientists. the organization of the document is well thought and the presentation style is rigorous and clear. subsequently, though information content is very dense and diverse, the book is accessible and its study is scientifically rewarding. special remarks can be made to the uniform and coherent notation which is used throughout the book and to graphical illustrations. a final remark of appreciation is to be made to the excellent quality of editing and printing of the book due to the staff of mcgraw-hill companies. in conclusion, the book is a timely work which contains relevant, inspiring and challenging information. therefore, this reviewer warmly recommends it to the readers of academia and industry as well who are interested in modern manufacturing issues and combinatorial optimization methods and software. florin gheorghe filip romanian academy, bucharest, romania e-mail: ffilip@acad.ro http://www.biblacad.ro/florinfilip.html international journal of computers communications & control issn 1841-9836, 9(6):711-720, december, 2014. an algorithm for production planning based on supply chain kpis d. makajić-nikolić, s. babarogić, d. lečić-cvetković, n. atanasov dragana makajić-nikolić*, sladjan babarogić danica lečić-cvetković,nikola atanasov university of belgrade, faculty of organizational sciences serbia, 11000, belgrade, jove ilića 154 e-mail: gis@fon.bg.ac.rs, sladjan@fon.bg.ac.rs danica@fon.bg.ac.rs, atanasovn@fon.bg.ac.rs *corresponding author abstract: this paper observe multi-period multi-product production planning problem in make-to-stock production environment with limited production capacity. such problem is identified in fast moving consumer goods industry. the goal was to develop an algorithm for supporting dynamic production triggering decisions in relation with two supply chain key performance indicators: stock cover and customer service level. the presented approach is applied to a real example in several scenarios based on different decision criteria. keywords: production planning, stock cover, customer service level, heuristic algorithm. 1 introduction manufacturing companies that operate in markets with changing demand are often faced with the problem of insufficient supplies of finished products. constant fluctuations in demand and the required financial investments are influencing the decision concerning the expansion of the production capacity. until the production capacity is actually increased, the manufacturing company has to meet the growing demand of the market with its existing production capacity. as customer orders are received periodically and production capacities are often insufficient, it is necessary to make a choice of products which will be produced in each period. in circumstances of reduced uncertainty, it is possible to use exact methods for planning customer satisfaction by cycle, when trends in demand are predictable over a longer period of time. however, in real life this is not the case, as demand is a weekly phenomenon which requires dynamic decisionmaking. therefore, in this paper we propose an algorithm for production planning in which decision on triggering new production is based on two supply chain key performance indicators (kpi): customer service level and stock cover. this paper is organized in six sections. next section describes related work with conceptual foundations. problem description with relevant notation is presented in section tree. heuristic algorithm for inventory planning is given in fourth section. in fifth section, numerical results with case study are described. last section is dedicated to conclusions of the research. 2 related work according to [5] uncertainty in production companies is categorized into environmental uncertainty based on demand and supply uncertainty and system uncertainty (within the production process) mainly related to production lead time, quality or failure of production process. uncertainty depends on level of information required to perform relevant business activity based on efficient and effective management decision [4]. responsive production planning and control copyright © 2006-2014 by ccc publications 712 d. makajić-nikolić, s. babarogić, d. lečić-cvetković, n. atanasov system according to [7] is the most important factor in achieving good delivery performance and demand satisfaction in supply chain. this fact represents one of important reason to focus more on customer service as external performance, once when internal performance is already achieved on certain level. as recognized by shen and daskin in [13] major cost factors associated with designing and managing a supply chain are the facility location costs, the inventory management costs, and the distribution costs, and always should be considered jointly and integrated with customer service goals. customer service was recognized as key measure of performances within production companies according to [9] and very well described by [11]. overall managerial question in supply chain is to determine a cost-effective customer-service level in correlation with profits and associated costs, what lead to question: which service level will satisfy customers and what level of inventory is required? jeffery et al. identified a range of models for determining service level and the appropriate level of inventory, process was carried out based on logistic regression to understand how performance of delivery are dependent on three independent variables: order lead time, errors in forecast, and variation in demand [6]. further development of service level and customers selection in make-to-stock production environment was evaluated by [8], while authors in [1] evaluated possibilities for maximization of customer service with limited production capacity and customer classification. stock cover is key performance indicator measuring length of time that available finished goods will last if forecasted consumption happens. available finished goods ready to be delivered to customer according to identified demand are in direct correlation with customer service level. dellaert and jeunet [2] evaluated stock cover in relation to behavior of lot-sizing rules in a multilevel context, when forecast demand is subject to changes within the forecast window and relevant lead time. according to [12], supply system needs to ensure adequate stock level to satisfy customers need, despite that additional stock only generates unnecessary costs, which customer has to absorb at the end. managing customer service level and stock cover represents highly complex problem of supply chain, taking into account that these two kpis are leading to opposite directions high stocks assume high customer service level, and, at the same time, stock need to be minimized to deliver working capital reduction and overall company efficiency. working capital reduction coupled with increase of sales and certain service level was evaluated by [14] through results of horizontal collaboration between supply chain members. combined approach of managing in parallel customer service level and stocks cover was done by [3] who evaluated main obstacles in increasing pressure to reduce working capital, growing variety of products and the fulfillment of a demanding service level. petri nets model of production planning system based on supply chain kpis: customer service level and stock cover was presented in [10]. 3 problem description in this paper we observe multi-period multi-product inventory planning problem in make-tostock production environment with limited production capacity. we started from a real example of fast moving consumer goods (fmcg) in serbia. choice of products than will be produced should be made in each period (cycle). this decision is based on two key performance indicators: customer service level (csl) and stock cover (sc). the basic assumptions of the observed problem can be divided into three groups as follows. • customers orders assumptions: – customers place orders in all or almost all of the cycles; – demand for each product is uneven and is known only for one cycle in advance; an algorithm for production planning based on supply chain kpis 713 – demand for each product represents the sum of all customers orders for delivery in given cycle; – decision about fulfillment is done in given cycle when all orders are received; – demand is fulfilled from the stock, entirely or partially, depending on the inventory level; – orders that have not been fully met in the reporting cycle shall not be compensated in the subsequent cycles no reordering policy; • inventory assumptions: – if the incoming customer orders in a single cycle do not exceed the available stock of finished goods, the allocation is complete and all customer orders are fulfilled, while any surplus products are stored for the next cycle; – inventory holding costs are neglected; – total inventory capacity is not limited. therefore, inventory planning problem becomes production planning problem; • production assumptions: – the production capacity is limited and constant in the entire period; – there is no possibility for production extension in medium term planing horizon; – due to specific production technology requirements, outscoring with acceptable costs is not possible;; – lot sizes of products are different and fixed; – production time and costs are neglected due to homogeneity of the products; in order to formulate the algorithm and based on the problem assumptions, the following notation will be used in the remaining of the paper: • n number of products; • m number of periods in the observed time horizon; • mcsl minimally acceptable customer service level; • msc minimally acceptable stock cover; • li lot size of i-th product, i = 1, . . . , n; • c available production capacities in each period; • si inventory level of i-th product at the beginning of the observed time horizon, i=1,. . . ,n; • tij demand for i-th product in j-th period, i=1,. . . ,n, j=1,. . . ,m; • pij forecast for i-th product in j-th period, i=1,. . . ,n, j=1,. . . ,m; forecasts are calculated using k-periods moving average: pij = 1 k k∑ u=1 tij−u (1) 714 d. makajić-nikolić, s. babarogić, d. lečić-cvetković, n. atanasov 4 algorithm the goal of the algorithm (fig.1) is to support two-phased decision making process. the result of the first phase is a list of products that should be produced in observed period, where decision is made based on calculated kpis (csl and sc). in the second phase, the algorithm forms a list of products that will be produced, applying one of four defined criteria: minimal csl, minimal sc, maximal capacity utilization and maximal number of products and taking into account the available capacities. the proposed algorithm has polynomial complexity. figure 1: algorithm represented as uml 2.0 activity diagram using parameters defined in problem description section, the following variables are calculated in each period. • sij is a stock level of i-th product in j-th period, i=1,. . . ,n, j=1,. . . ,m. • qij delivered quantity of i-th product in j-th period, i=1,. . . ,n, j=1,. . . ,m. the delivered an algorithm for production planning based on supply chain kpis 715 quantity of each product depends on demand and stock level as follows: qij = { tij , sij ≥ tij sij, otherwise , i = 1, . . . , n, j = 1, . . . , m • cslij = qij/tij customer service level of i-th product achieved in j-th period, i=1,. . . ,n, j=1,. . . ,m, observed as fill rate, indicates ratio between delivered and ordered quantity; • scij = sij/pij stock cover for i-th product provided in j-th period, i=1,. . . ,n, j=1,. . . ,m. based on variables values, two decisions must be made sequentially for each product in each period. 1. the first decision refers to the request for the production of the i-th product in j-th period (dsij, i = 1, . . . , n, j = 1, . . . , m). request for production is initiated if any of the indicators (cslij or scij) falls below the minimum value, i.e: dsij = { 1 (production requested) , cslij < mcsl or scij < msc 0 (production not requested), otherwise for i = 1, . . . , n, j = 1, . . . , m. as a result, a set of all products that should be produced during j-th period is obtained: dj = {i|dsij = 1}. 2. since the available production capacities are limited without possibilities of extension in short term and often insufficient, the second decision is related to a choice of products that will be produced. this choice can be made using several criteria: minimal csl, maximal number of product and maximal production capacities utilization. let p be a set of chosen product. in the following, each of the above criteria for generating the set p will be described in detail. options 1 and 2 minimal csl and minimal sc according to the first two criteria, the higher priority is given to the products with smaller demand satisfaction or with smaller stock cover in the previous period. these criteria should be used in the case of large variations in the customer service level or stock cover among products. for each time period j, the following procedure is applied. initialization: p = ∅. do find i∗ such that csli∗j = min{kpiij|i ∈ d}. if li∗ ≤ c then i∗→ p , c := c − li∗ endif. dj := dj i∗ until c = 0 or dj = ∅. variable kpiij represents key performance indicator cslij (option 1) or scij (option 2) depending on chosen criteria. the output of the procedure is the set p which contains the indexes of the products that will be produced. 716 d. makajić-nikolić, s. babarogić, d. lečić-cvetković, n. atanasov options 3 and 4 maximal number of products and maximal production capacities utilization maximal number of products can be used as criterion when company wants to cover the market with wide variety of products. when there is a large lack of capacity, maximal production capacities utilization should be used as a criterion. for each period j, these two criteria can be modeled as following knapsack problem: max ∑ u∈dj ai ·xi s.t.∑ u∈dj li ·xi ≤ c where: xi = { 1, if the producti is chosen to be produced 0, otherwise , i = 1, . . . , n ai = { 1, if the criterion is number of products (option 3) li, if the criterion is capacities utilisation (option 4) , i = 1, . . . , n after obtaining the optimal results, set of products that will be produced, p , contains all products i such that xi = 1. 5 computational results and discussion the algorithm has been applied on real data calculation based on 28 weeks observation, made for four products of real, medium sized fast moving consumer goods (fmcg) company. installed production capacity is 290 units per period (week), while lot sizes are 120, 110, 170 and 50 units for products p1, p2, p3 and p4, respectively. customers’ orders are shown in table 1. forecast is calculated based on three-week moving average (equation 1). table 1: customers’ orders for 28 weeks product w1 w2 w3 w4 w5 w6 w7 w8 w9 w10 p1 35 75 29 48 40 52 29 59 67 82 p2 50 122 55 129 40 346 70 102 112 16 p3 68 112 48 94 62 357 75 98 124 18 p4 6 23 27 57 0 30 45 38 56 69 product w11 w12 w13 w14 w15 w16 w17 w18 w19 w20 p1 96 88 45 23 16 31 166 16 34 137 p2 83 28 40 28 53 100 70 36 49 292 p3 11 25 49 34 63 117 112 48 69 325 p4 14 67 171 27 4 66 5 8 15 110 product w21 w22 w23 w24 w25 w26 w27 w28 p1 83 23 110 156 109 102 34 69 p2 95 119 447 56 154 34 112 107 p3 162 128 314 93 156 88 115 128 p4 3 86 153 1 1 1 0 25 an algorithm for production planning based on supply chain kpis 717 based on descriptive statistical analysis for the observed period, it can be concluded that products p2 and p3 had the highest average demand (105.18 and 110.46, respectively) but also the highest standard deviation of demand (99.81 and 87.69, respectively). in addition, these two products had three major demand peaks in the same periods (w6, w20 and w23). these facts lead us to the conclusion that production capacity will remain mean issue in the future. developed algorithm can be used at operational and strategic level. operational level is related to the decision of production requesting and launching. table 2 illustrates the operational decisions. it shows the decisions for the product p1 in the entire period when mincsl criterion is used. the first column represents the weeks of the observed period. the second and third columns give the orders and the forecasts per weeks for product p1. columns labeled as sb and sa shows the stock level before and after product delivery while the next column gives the amount of delivered quantities. columns csl and sc show calculated values for customer service level and stock cover per periods. the last three columns represent the decisions about requested, confirmed and missed production, respectively. in the last row of the table 2, the values of total delivered quantities, average scl and sc, and total number of production request, conformation and missing are given. table 2: production decisions for 28 weeks for p1 and mincsl criterion time order forecast sb sa delivered csl sc pr pc pm w1 35 36 110 75 35 1 2.083 w2 75 66 75 0 75 1 0 1 1 w3 29 95 120 91 29 1 0.958 1 1 w4 48 46 211 163 48 1 3.518 w5 40 51 163 123 40 1 2.428 w6 52 39 123 71 52 1 1.821 1 1 w7 29 47 71 42 29 1 0.9 1 1 w8 59 40 162 103 59 1 2.554 w9 67 47 103 36 67 1 0.771 1 1 w10 82 52 36 0 36 0.439 0 1 1 w11 96 69 120 24 96 1 0.346 1 1 w12 88 82 144 56 88 1 0.686 1 1 w13 45 89 176 131 45 1 1.477 1 1 w14 23 76 251 228 23 1 2.987 w15 16 52 228 212 16 1 4.077 w16 31 28 212 181 31 1 6.464 w17 166 23 181 15 166 1 0.643 1 1 w18 16 71 135 119 16 1 1.676 1 1 w19 34 71 119 85 34 1 1.197 1 1 w20 137 72 205 68 137 1 0.944 1 1 w21 83 62 68 0 68 0.819 0 1 1 w22 23 85 120 97 23 1 1.146 1 1 w23 110 81 97 0 97 0.882 0 1 1 w24 156 72 0 0 0 0 0 1 1 w25 109 96 120 11 109 1 0.114 1 1 w26 102 125 11 0 11 0.1080 0 1 1 w27 34 122 120 86 34 1 0.703 1 1 w28 69 82 86 17 69 1 0.208 1 1 total 1533 0.902 1.346 21 13 8 718 d. makajić-nikolić, s. babarogić, d. lečić-cvetković, n. atanasov table 3 shows requesting for productions and decisions about confirmation or missing the production for all four products in the entire observed period. abbreviation used in the table3 are the same as in the table 2. in the last row total numbers of production request, conformation and missing are given for all the products. production requests for all four products appeared in 9 out of 28 weeks and in 5 of them only two product were produced. table 3: production decisions for 28 weeks for all products and mincsl criterion p1 p2 p3 p4 time pr pc pm pr pc pm pr pc pm pr pc pm w1 1 1 1 1 w2 1 1 1 1 w3 1 1 1 1 1 1 w4 1 1 1 1 w5 1 1 1 1 1 1 w6 1 1 1 1 1 1 w7 1 1 1 1 1 1 1 1 w8 1 1 1 1 1 1 w9 1 1 1 1 1 1 1 1 w10 1 1 1 1 1 1 w11 1 1 1 1 1 1 w12 1 1 1 1 1 1 w13 1 1 1 1 w14 1 1 w15 1 1 1 1 w16 1 1 1 1 1 1 w17 1 1 1 1 1 1 w18 1 1 1 1 1 1 w19 1 1 1 1 w20 1 1 1 1 1 1 1 1 w21 1 1 1 1 1 1 1 1 w22 1 1 1 1 1 1 1 1 w23 1 1 1 1 1 1 1 1 w24 1 1 1 1 1 1 1 1 w25 1 1 1 1 1 1 1 1 w26 1 1 1 1 1 1 1 1 w27 1 1 1 1 1 1 w28 1 1 1 1 1 1 total 21 13 8 25 20 5 20 15 5 19 14 5 at strategic level, a decision about the most appropriate criterion can be made using the proposed algorithm. computational results for all observed products and all four decision criteria are given in table 4. column "delivered" represents quantities which are delivered during entire period per product. columns "csl" and "sc" represents kpis. the last column represents the percent of launched production requests. the percentages of capacity utilizations for options 1, 2, 3 and 4, are 0.863, 0.857, 0.872 and 0.817, respectively. after algorithm results presentation, management of the observed company considers option 1 the most appropriate. applying the first option (min csl) in algorithm, the highest level of csl and the largest quantity of delivered products are provided. as positive side effect, high level of capacity utilization is achieved even that was not included as criterion an algorithm for production planning based on supply chain kpis 719 table 4: summary results delivered csl sc card(p)/card(d) option 1 (min csl) p1 1533 0.90 1.35 0.714 p2 2263 0.91 0.98 0.800 p3 2538 0.89 1.29 0.750 p4 778 0.83 7.36 0.737 total 7112 avg 0.88 avg 2.75 avg 0.753 option 2 (min sc) p1 1533 0.90 1.40 0.619 p2 2240 0.93 1.05 0.833 p3 2488 0.89 1.03 0.682 p4 728 0.77 7.30 0.684 total 6989 avg 0.87 avg 2.70 avg 0.709 option 3 p1 1823 0.99 1.78 1 (max capacity utilization) p2 1687 0.67 0.67 0.600 p3 2940 0.98 2.44 1 p4 575 0.59 2.08 0.591 total 7025 avg 0.81 avg 1.74 avg 0.765 option 4 p1 1823 0.99 1.78 1 (max number of product) p2 1690 0.73 0.76 0.625 p3 2263 0.80 1.32 0.619 p4 879 0.91 11.20 1 total 6655 avg 0.86 avg 3.76 avg 0.782 in option 1. although the average results in the last column indicate the similar percentages of launched production requests for all four options, detailed analysis by products shows that even nearly 40% production requests for some products remain unrealized in options 3 and 4. considering this indicator in the last column, option 1 gives the most balanced values. 6 conclusions the aim of this paper was to develop a heuristic algorithm for multi-period production planning based on supply chain kpis: customer service level and stock cover. analyzing fmcg company with limited production capacities, two important decisions are recognized in each period: which products should be produced (production requesting) and which product can be produced (production launching). the proposed algorithm provides support for both decisions. the first decision is based on two used kpis, while the second decision can be made by using one of four criteria: mincsl, minsc, maximal capacity utilization and maximal number of products. developed algorithm was applied in real fmcg where option 1 was chosen as the most appropriate according to their business policy. however, the main advantage of the algorithm is the fact that it offers a choice among four different decision criteria based on company’s business policies. it can be extended in order to generate demand forecast based on different forecasting techniques and adding new kpi: demand forecast accuracy, which will be used for evaluation of the impact of forecast accuracy on customer service level and stock cover variations. 720 d. makajić-nikolić, s. babarogić, d. lečić-cvetković, n. atanasov bibliography [1] babarogić, s.; makajić-nikolić, d.; lečić-cvetković, d.; atanasov n. (2012); multi-period customer service level maximization under limited production capacity, international journal of computers communications & control, issn 1841-9836, 7(5): 798-806. [2] dellaert n.p., jeunet j. (2003); demand forecast accuracy and performance of inventory policies under multi-level rolling schedule environments, research at international institute of infonomics, heerlen, the netherlands. [3] fernandez, r.; gouveia, j. b.; pinho, c. (2010); overstock a real option approach, journal of operations and supply chain management, issn 1984-3046, 3(1): 98-107. [4] galbraith, j. r. (1973); designing complex organizations, reading, ma: addison-wesley. [5] ho, c. (1989); evaluating the impact of operating environments on mrp system nervousness, int j prod res, issn 0020-7543, 27(7): 1115-1135. [6] jeffery m.m., butler j.r., malone c.l. (2008); determining a cost-effective customer service level, supply chain management: an international journal, 13: 225-232. [7] lane, r.; szwejczewski, m. (2000); the relative importance of planning and control systems in achieving good delivery performance, prod plan control, issn 0953-7287, 11(5): 422433. [8] lečić-cvetković, d.; atanasov, n.; babarogić, s. (2010); an algorithm for customer order fulfillment in a make-to-stock manufacturing system, international journal of computers communications & control, issn 1841-9836, 5(5): 983-791. [9] lin, j.; chen, j.h. (2005); enhance order promising with atp allocation planning considering material and capacity constraints, journal of the chinese institute of industrial engineers, issn 1017-0669, 22(4): 282-292. [10] makajić-nikolić, d.; lečić-cvetković; atanasov, n.; babarogić, s. (2013); an approach to production planning for supply chain performance improvements, proceedings of xi balkan conference on operational research, isbn 978-86-7680-285-2, 357-366. [11] meyr, h. (2009); customer segmentation, allocation planning and order promising in make-to-stock production, or spectrum, issn 0171-6468, 31(1): 229-256. [12] okulewicz, j. (2009); verification of a service level estimation method, total logistics management, issn 1689-5959, 2: 67-78. [13] shen z-y.m., daskin m. (2005); trade-offs between customer service and cost in integrated supply chain design, manufacturing and service operations management, 7(3): 188-207. [14] wadhwa s., kanda a., bhoon k.s. (2006); bibhushan impact of supply chain collaboration on customer service level and working capital, global journal of flexible systems management, 7(1-2): 27-35. international journal of computers communications & control issn 1841-9836, 12(5), 704-714, october 2017. delay tolerant networks over near field communications: the automatic multi-packet communication r.o. schoeneich, p. sadło radosław olgierd schoeneich* institute of telecommunications warsaw university of technology, poland *corresponding author: rschoeneich@tele.pw.edu.pl piotr sadło institute of informatics warsaw university of technology, poland p.sadlo@stud.elka.pw.edu.pl abstract: the near field communication (nfc) is designed for sending small data, which size does not exceed the several dozen bytes, for distance not higher than two inches. the nfc can be used in store-carry-forward scenarios which are bases for disruptive tolerant networks (dtn). the aim of this paper is proposal how to use nfc in transmission of large data in dtn. we propose an algorithm of fragmenting, sending and receiving data. we described software which was used in our prototype and discussed about implementation. finally we showed results of tests performed on our model. obtained results presents well characteristics of our solution. keywords: mobile communication, radio communication, wireless networks, personal communication networks, cooperative communication. 1 introduction nowadays, an attention of scientists working on ad-hoc networks is focused on delay and disruptive tolerant networks (dtn). the dtn is a specific network in which nodes are mobile, therefore the network topology is often changeable what can be reason of lack of end-to-end path between source and destination nodes. the communication of dtn is done by storing-carrying and forwarding messages. applications of dtn are specific: previously it was proposed for interplanetary communication [1], but now it is used in military [14], search and rescue [15], wildlife monitoring [10], body sensing [17] and in many other solutions [21] [13]. the dtn transmission is based on wireless interfaces which are usually long range like radio satellite communications and medium range e.g. wifi and bluetooth standards. one of the possible communication interface, which fulfills dtn requirements of wireless medium and easy applicability of store-carry-forward paradigm is a very short range near field communication (nfc) [24]. nfc is used to transfer small packets on a very short-range. transmission throughput is very low. this standard is widely used for cashless payment and short tag exchange. it is applicable not only on cards and tags but also on small mobile devices e.g. smartphones. unlike other wireless communication solutions, nfc is able to setup sessions between devices automatically. the usage of nfc can be effective implemented in the dtn environment. automatic and secret communication ability is advantage which supports the implementation of dtn objectives. the aim of this paper is to present solution how to use the nfc to communicate dtn devices using store-carry-forward without data size limitation. for this purpose, we proposed a simple way of communication with the automatic exchange data between devices. we designed and implemented solution how to apply nfc without size limit of the transferred data. copyright © 2006-2017 by ccc publications delay tolerant networks over near field communications: the automatic multi-packet communication 705 the rest of this paper looks as follows: in section 2 we presented the state of the art: we discuss nfc technology, basics of dtn, and other telecommunication issues. in section 3 we present the algorithm of automatic establishment communication session and a mechanism for transferring big packets of data. in the section 4 we present results of tests performed on our prototype. chapter 5 contains a conclusions. 2 state of the art the near field communication is a radio communication standard, based on rfid technology (radio-frequency identification). it allows to wirelessly transfer data on a distance up to 20 cm [2]. in practice, the exchange of information over distances is smaller than 4 cm. nfc can operate in two modes: active and passive. in the active mode, device alternatively communicate with each other. both devices need power energy. the device which is at standby state does not emit signals. in passive mode, only one device can initiate the connection. passive devices cannot read data by themselves; however, they can store it. nfc applications are focused on widely used wireless cashless payment. contactless payment [16] is done by cards and smartphones. other area of nfc usage are applications working with nfc tags with web addresses [23], tags for indoor navigation or information and wide aspects of smart homes e.g. office and home secure, body area communications [25]. all of this applications are using very short nfc messages and no one extends the basic communication functionality. delay and disruptive tolerant networks firstly was proposed by k. fall [7] for interplanetary communication. the basic idea is to provide communications in scenarios in which traditional telecommunication approach does not work. the dtn networks evaluate based on wireless adhoc and sensor network (manet and wsn) approach: every node acts as a part of network and can route a message. based on this for manet and wsn there was proposed several routing protocols [20] [19] and algorithms which supports data handling [8] [9] [12] [5] [3] and transmission [22]. the dtn adds the new assumption to the ad-hoc approach: store-carryforward messages. storing and forwarding messages is done by nodes called message ferries or mules. as a message ferry can act mobile device associated with human [4], animal or mobile machine [6]. by analogy to manet, dtn requires specific solutions for routing and data handling [11] [18]. all this solutions are working with popular communication standards like wifi and bluetooth. as authors know none of these solution is designed for very short range solution like nfc. 3 dtn over nfc multipacket communication this section describes algorithms for sending and automatic receiving large data. algorithms are implemented on android environment. we propose automatic solution that splits large data into elementary packets and avoids "touch to send" restriction that is mandatory in android nfc api. 3.1 message fragmentation into nfc packet each message before sending must be adopted to the size of nfc packet. it means, the fragmentation process must be taken. afterwards, nfc session is set up between devices (nodes) and the packets in sequence are sent to the receiver. sequence of steps is shown in figure 1. 706 r.o. schoeneich, p. sadło figure 1: sending process table 1: the detailed description of each record item description unique number unique number for the message bitstream payload number of packets number of parts generated by splitting the message packet number succeeding number of part delay tolerant networks over near field communications: the automatic multi-packet communication 707 figure 2: block diagram of the algorithm responsible for sending the nfc message in the example the message is divided into 8 fragments. each fragment is built from four nfc records, which are saved in nfc data exchange format (ndef). detailed description of each record is presented in table 1. the receiver analyzes each record of the received packet. the unique number is used to check in the local database if exists any received data of the message. the payload is appended to this data in case of existance another data of the message. after this operation the receiver sends the confirmation flag to inform the sender that the packet has been received and processed succesfully. the sender takes the next packet from the stack and sends it to the receiver. this process is repeated until the stack is empty. 3.2 data sending besides specifying a unique part number and payload, the algorithm determines packet number and calculate the total number of packets of the message. the number of packets is calculated by dividing the size of the message by the size of the single packet. the message is passed to the algorithm responsible for sending data as an input. additional parameters are: an unique number for transmission which was generated in the previous step, the number of packets to send, the number of the current being sent packet, and the defined packet size. the message is mapped to the byte stream described as the variable file. after 708 r.o. schoeneich, p. sadło reading each byte, algorithm determines if the examined byte is the last in the stream and/or whether the output buffer is not full. the algorithm stops injecting data into the input buffer if: (a) the amount of stored data reaches the maximum size of the packet, or (b) all data has been read from the file buffer. the packet is send after the completion of this process. algorithm of sending nfc message is presented in figure 2. 3.3 data receiving algorithm describing the behavior of the system during the process of packets receiving is presented in figure 3. in first step the algorithm analyzes received records. it is verified whether the payload is not empty and checks the database to determine whether another packets of the same message were not transmitted. data is appended to the stream in case, when another part of the message is found. if the application determines that the received packet is the last one, it is marked in the database. the algorithm decides whether the packet is the final one by comparing the parameters: the number of the package and the total number of packets. if the number of packet is equal to the number of message packets, the algorithm assumes that the message was sent. parts of the message are sent sequentially. the sending node will retransmit the packet, if a confirming flag is not sent by the receiver. 3.4 the automatic nfc multi-packet transfer the basic mechanism to nfc data communication is android beam. as a standard procedure before sending a message to another device, android beam enforces to tap mobile device screen. this procedure prevents device from unwanted automatic data send. automatic data send is mandatory for multi packet data transfer, therefore we propose the method to handle this problem. automatic tap functionality was implemented using xposed framework. the framework enables dynamic simulation of the tap action. xposed modules allow to perform actions immediately before and after triggering a captured method. in order to capture the system methods we re-implemented methods: handleloadpackage and findandhookmethod. this methods are called with parameters: the package name, the name of the hooked method and object of an xcmethodhook abstract class. the object have the appropriate method responsible for modifying the behavior of the hooked method. 3.5 delay tolerant networks support the nfc multi-packet communication system was proposed as a part of dtn system for a smartphone. the full functionality of dtn device consists of a group of functions responsible for communication and storage. in terms of architecture, this set of functionalities was implemented in an integrated way. the first component includes a group of functions responsible for autonomous sending and receiving nfc multi-packet messages. the functionality of communications in our implementation consists of the use of communication nfc interfaces. dtn paradigm store-carry-forward is implemented in classical way based on simple epidemic routing idea. this means that all messages are replicated and send to every connected device. 4 results the goal of our tests was to compare the effectiveness of the nfc multi-packet data transfer process for approaches with manual and automatic handling. manual handling requires tap to send (tts), the automatic handling is performed without tts. delay tolerant networks over near field communications: the automatic multi-packet communication 709 figure 3: block diagram of the algorithm responsible for receiving the nfc message figure 4: modular architecture for dtn via nfc using android device 710 r.o. schoeneich, p. sadło figure 5: the total mean transfer time for automatic and manual transfer for packets: a) 2kb, b) 4kb, c) 8kb and d) 16k for tests we proposed packets with sizes: 2 kb, 4kb, 8 kb, 16 kb, 32 kb and 64 kb. for each message more than 100 measurements were done. for each measurement the transfer execution time and the total mean throughput was reported. the total transfer time was defined as a sum of: (a) preparation of the nfc message, (b) user interaction (during tests with tts), (c) data transfer. for tts we manually perform screen tapping as soon as it is possible. tests were done on samsung galaxy s3 and lg l65 mobile phones. both devices have got built-in nfc module. samsung galaxy s3 was working on android 4.0 operation system, lg l65 was working on 4.4 android operation system. tests were done in 12 square meters room. at first we perform measurements of packet transfer time (see figure 5). the time increases with the size of the packet as expected for tts and without tts. some transfer attempts were failed during tests. sessions were interrupted by the logical link control protocol (llcp). llcp is a second layer protocol responsible for setting up session between two nfc enabled devices in peer-to-peer communication. llcp was returning errors during setting up session (marked in figure 5 as 0 ms transfer time sample) or setting up session longer than expected. the second circumstance had big influence on the transfer time, which was getting longer. the effectiveness defined as transmission success ratio of both solutions was presented in figure 6. the effectiveness is increasing with the size of the packet. success ratio was higher than 80 % almost in each test which we have done. in most cases approach without tts was more efficient. furth more there was 100% correctly sent samples for 8 kb and 16 kb packets. although, the efficacy of transferring data without the tts was higher, there were more samples with extreme values. transferring time reached tens of seconds. xposed is not officially supported by google. this framework can be unstable. release of the next android version cause that xposed authors has to adopted framework to the new version. we suspect that xposed instability has got influence on measurements without tts tests. the variation and standard deviation were much larger for transmission with bypassed tts. because of that we decided to focus on median value to calculate throughput for tests. figure 7 presents the median time of transferring packets. results presents the better transfer time for delay tolerant networks over near field communications: the automatic multi-packet communication 711 figure 6: the success ratio of multi-packet transfer figure 7: the median of mean transfer time 712 r.o. schoeneich, p. sadło figure 8: analysis of time needed to perform the operation while transferring the file figure 9: throughput in function packet size automatic solution without tts. the second parameter measured was throughput the total speed of data transfer. we determine the throughput as a message size divided by total transfer time. the total transfer time is the sum of preparation message time, user interaction time and data transfer time. data transfer time increases with the size of the packet. creation time is constant for all samples in the packet size group and has huge impact on the results. the throughput increases with the size of the packet. packet creation time is more negligible with larger packet size. this phenomenon is illustrated on figure 8. moreover the automatic mechanism is more effective through the tts mechanism elimination. results was presented in figure 9. 5 conclusions in this paper we have presented the idea of nfc multi packet communication for dtn networks. we decided to create this work due to the lack of data about nfc usage in dtn networks especially for larger message transmission which requires fragmentation. we described in detail algorithms for sending, receiving and automatic handling of fragmented message. the algorithms was implemented on android device and extensively tested. we examined our automatic approach comparing with human-involved standard android tap-to-send method. all test was conducted with different packet sizes. obtained results confirm the good performance of our solution. the automatic approach has lower total transfer time and higher throughput then manual one. delay tolerant networks over near field communications: the automatic multi-packet communication 713 bibliography [1] burleigh s., hooke a., torgerson l., fall k., cerf v., durst b., scott k., weiss h. (2003); delay-tolerant networking: an approach to interplanetary internet. ieee commun. mag., 128-136, 2003. doi:10.1109/mcom.2003.1204759 [2] curran k., millar a., mc garvey c. (2012); near field communication, international journal of electrical and computer engineering, 2(3), 2012. [3] domaszewicz j., koziuk m., schoeneich r.o. (2008); context-addressable messaging service with ontology-driven addresses, in proc. on the move to meaningful internet systems: otm 2008, 1471-1481, 2008. doi:10.1007/978-3-540-88873-4_37 [4] dziekonski a.m., schoeneich r.o. (2016); dtn routing algorithm for networks with nodes social behavior, in proc. international journal of computers communications and control, 11(4), 457-471, 2016. doi:10.15837/ijccc.2016.4.1454 [5] golanski m., schoeneich r.o., siwko m. (2010); the algorithm for distribution of large-size data in the wireless ad-hoc sensor network, in proc. concepts and implementation for innovative military communications and information technologies, military university of technology, 577-584, 2010. [6] ha p., yamamoto h., yamazaki k. (2013); using autonomous air vehicle in dtn sensor network for environmental observation, in proc. computer software and applications conference (compsac), 2013 ieee 37th annual. ieee, 447-450, 2013. doi:10.1109/compsac.2013.74 [7] fall k. (2003); a delay-tolerant network architecture for chalenged internets, in proc. of the 2003 conference on applications, technologies, architectures, and protocols for computer communications (sigcomm ’03), 27-34, 2003. doi:10.1145/863955.863960 [8] fan c.-s. (2016); high: a hexagon-based intelligent grouping approach in wireless sensor networks, advances in electrical and computer engineering, 16(1), 41-46, 2016. doi:10.4316/aece.2016.01006 [9] fang w., li s., liang x., li z. (2012); cluster-based data gathering in long-strip wireless sensor networks, advances in electrical and computer engineering, 12(1), 3-8, 2012. doi:10.4316/aece.2012.01001 [10] juang p., oki h., wang y., martonosi m., peh l.s., rubenstein d. (2002); energy-efficient computing for wildlife tracking: design tradeoffs and early experiences with zebranet, sigops oper. syst. rev., 36(5) (october 2002), 96-107. doi:10.1145/635508.605408 [11] kawecki m., schoeneich r.o. (2016); mobility-based routing algorithm in delay tolerant networks, in proc. eurasip journal on wireless communications and networking, 81, 1-9, 2016. doi:10.1007/978-3-642-32518-2_23 [12] koziuk m., domaszewicz j., schoeneich r.o., jablonowski m., boetzel p. (2008); mobile context-addressable messaging with dl-lite domain model, in proc. smart sensing and context, 5279, 168-181, 2008. doi 10.1007/978-3-540-88793-5_13 [13] lindgren a., doria a., schelen o. (2003); probabilistic routing in intermittently connected networks, acm sigmobile mobile computing and communications review, 19-20, 2003. doi:10.1145/961268.961272 714 r.o. schoeneich, p. sadło [14] lu z., fan j. (2010); delay/disruption tolerant network and its application in military communications, proc. of computer design and applications, 2010. doi: 10.1109/iccda.2010.5541302 [15] matsuzaki r., ebara h., muranaka n. (2015); rescue support system with dtn for earthquake disasters, ieice trans. commun., 98(9), 1832-1847, 2015, doi:10.1587/transcom.e98.b.1832 [16] ondrus j., pigneur y. (2007); an assessment of nfc for future mobile payment systems, proceedings of the international conference on the management of mobile business (icmb ’07). ieee computer society, 2007, doi:10.1109/icmb.2007.9 [17] quwaider m., biswas s. (2010); dtn routing in body sensor networks with dynamic postural partitioning, ad hoc networks, 8(8), 824-841, 2010, http://dx.doi.org/10.1016/j.adhoc.2010.03.002 [18] palka p., schoeneich r.o. (2013); multi-commodity trade application to the routing algorithm for the delay and disruptive tolerant networks, in proc. new trends in databases and information systems, 185, 241-250, 2013. doi:10.1007/978-3-642-32518-2_23 [19] schoeneich r.o., domaszewicz j., koziuk m. (2009); concept based routing in ad-hoc networks, lecture notes in computer science, 5408, 43-48, 2009. doi:10.1007/978-3-54092295-7_8 [20] schoeneich r.o., golanski m. (2007); mesh cluster based routing protocol: enhancing multi-hop internet access using cluster paradigm, proc. eurocon, 2007. the international conference on computer as a tool, 962-965, 2007. doi:10.1109/eurcon.2007.4400318 [21] spyropoulos t., psounis k., raghavendra c.s. (2005); spray and wait: an efficient routing scheme for intermittently connected mobile networks, proc. of the 2005 acm sigcomm workshop on delay-tolerant networking, 252-259, 2005. doi:10.1145/1080139.1080143 [22] vladuta a.-v., pura m.l., bica i. (2016); mac protocol for data gathering in wireless sensor networks with the aid of unmanned aerial vehicles, advances in electrical and computer engineering, 16(2), 51-56, 2016. doi:10.4316/aece.2016.02007 [23] want r. (2006); an introduction to rfid technology, ieee pervasive computing 5(1), 25-33, 2006. doi:10.1109/mprv.2006.2 [24] want r. (2011); near field communication, ieee pervasive computing, 3(10), 4-7, 2011. doi:10.1109/mprv.2011.55 [25] zimmerman t.g. (1996); personal area networks: near-field intrabody communication, ibm syst. j., 35(3-4), 609-617, 1996. doi:10.1147/sj.353.0609 ijcccv8n1.pdf int j comput commun, issn 1841-9836 8(1):30-36, february, 2013. outlier detection with nonlinear projection pursuit m. breaban, h. luchian mihaela breaban, henri luchian "alexandru ioan cuza" university of iasi, romania e-mail: {pmihaela, hluchian}@infoiasi.ro abstract: the current work proposes and investigates a new method to identify outliers in multivariate numerical data, driving its roots in projection pursuit. projection pursuit is basically a method to deliver meaningful linear combinations of attributes. the novelty of our approach resides in introducing nonlinear combinations, able to model more complex interactions among attributes. the exponential increase of the search space with the increase of the polynomial degree is tackled with a genetic algorithm that performs monomial selection. synthetic test cases highlight the benefits of the new approach over classical linear projection pursuit. keywords: outlier detection, nonlinear projections, genetic algorithms 1 introduction mining for outliers is of great importance in many domains: fraud detection, disease identification,intrusion detection, fault diagnosis and so on. outliers or anomalies represent rare observations/events that deviate from the majority of data either in magnitude or with respect to an overall pattern. whatever the data, a statistical model can be attached to it and consequently the data can be considered to be generated by a statistical process. in this view, outliers correspond to very low probabilities under the underlying distribution. outliers may sometimes simply occur due to erroneous recording of data and not due to meaningful anomalous data which would correspond in the theoretical model to changes in the generative process. identifying them must be one of the first steps in data analysis as their presence misleads many algorithms from the machine learning area solving tasks like clustering, classification or regression. this paper proposes and investigates a new framework for outlier detection derived from the classical projection pursuit methodology, aiming at alleviating some of the drawbacks of the standard approach. an analysis of the popular approaches for outlier detection highlight the benefits of the new approach. the material is structured as follows. section 2 surveys existing computational methods for outlier detection. section 3 succinctly describes the framework of projection pursuit. the method we propose is presented in section 4 and is empirically investigated in section 5, while section 6 draws the conclusions. 2 computational methods for outlier detection we consider the unsupervised framework of outlier detection: there is no pre-specified generative model for the data and no labels are available to provide examples from which an algorithm could learn. several surveys exist in literature that provide a state-of-the-art for outlier detection in this framework [1–4]. generally, the literature distinguishes among several classes of methods. copyright c© 2006-2013 by ccc publications outlier detection with nonlinear projection pursuit 31 most statistical methods for outlier detection are parametric methods: given a certain kind of statistical distribution outliers are detected as those points with low probability of being generated. in the univariate case usually the normal distribution is used and outliers are considered those observations that lie at a distance larger than k standard deviations from the mean. for the multi-variate case the mahalanobis distance is considered to compute the distance of the observations from the mean. the main drawback of this approach is that the parameters of the distribution (mean and standard deviations) are computed based on all observations, including the possible outliers, and therefore they may be highly biased. non-parametric approaches based on standard deviation identify the subset of observations that, after exclusion, determines the highest decrease in variance; using any form of the minkowski metric the method can be generalized for the multi-variate case. the drawback of this method is that the size of the search space is exponential w.r.t. the number of observations. the first and the third quartiles are also used to identify outliers in the univariate case. distance-based approaches [10–13] for outlier detection base their decisions on computing the distances between each data point and its neighbors. these are multi-variate approaches. the main advantage of these methods over statistical approaches is that no hypothesis on the type of distribution is made, hence such methods are applicable without distribution-dependent restrictions. these methods are computationally expensive due to the calculations of distances between data points; therefore, various ways of scaling them up for large databases have been proposed. still, one important drawback is present: even if they do not make any assumptions on the type of distribution they are not parametric-free methods. the result is very sensitive to some user-tuned parameters like the radius of the neighborhood, the number of neighbors to be used or the threshold indicating the average distance to the neighbors above which a data is considered as outlier. clustering-based approaches employ an unsupervised clustering algorithm to identify groups in data. in this context outliers are identified as unusually small groups in data. popular algorithms used in this context are hierarchical clustering methods (mostly the single-link variant) and density based clustering methods like dbscan. the methods have the advantage of being generally applicable to any distributions in data. their drawbacks reside in high computational costs requiring distance computations between data items and sensitivity to parameters in case of density-based methods. all the above-mentioned multi-variate techniques take into account the entire attribute space. based on distance computations, a drawback is inherent: in high-dimensional spaces the ratio of the distances of the nearest and farthest neighbors to a given target is almost one making outlier detection an impossible task. projection pursuit is hardly mentioned in existing surveys on outlier detection. when it is, attention is given usually to a particular exponent of this class of methods principal component analysis (pca)which is in fact a dimensionality reduction method that aims at preserving as much as possible the variance in data and not a method dedicated to outlier detection. projection pursuit can be used to identify subspaces of the original attribute space where outliers are present, alleviating the mentioned drawback resulting from high dimensionality. this paper highlights the important role projection pursuit can play for outlier identification and enhances the classical methodology by introducing nonlinear projections. 3 projection pursuit a k dimensional projection of a data set x ∈ rn×d consisting of n items described by d numerical attributes is a linear transformation involving k orthogonal vectors in a d-dimensional space. these vectors form an orthogonal basis a ∈ rk×d. the projection of x into a is 32 m. breaban, h. luchian the product z = x · at resulting in a new representation for each of the n data items in a k-dimensional space. projection pursuit (pp) [5] is a technique aiming at identifying low-dimensional projections of data that reveal interesting structures. the framework of pp is formulated as an optimization problem with the goal of finding projection axes that minimize/maximize a measure of interest called projection index. the projection index can be formulated to identify subspaces where clusters are visible, linear combinations that discriminate between given classes or low-dimensional views of data that reveal the presence of outliers. depending on the formulation of the index under maximization/minimization analytical methods exist (the case of pca), gradient-based methods may be used (if the index is continuously differentiable) or probabilistic heuristics like hill climbing or simulated annealing are employed. the current work is conducted towards identifying single-dimensional views (one-dimensional projections) of data that present outliers. these can be manually inspected, or simple statistical rules (univariate analysis) can be applied to identify and exclude the outliers. a popular index used to derive projections with high chances of containing outliers is kurtosis defined as the fourth moment around the mean divided by the square of the variance: kurt = n ∑ l=1 (y(l) − µ)4 (n − 1)σ4 (1) where µ is the mean and σ is the standard deviation of the single-dimensional projection. a value close to 3 indicates a normal distribution. higher values indicate the presence of extreme deviations while lower values indicate bimodal distributions. in consequence, this index should be maximized to derive projections containing outliers. other indices also exist but are more expensive computationally [6, 7]. one drawback of the classical projection pursuit approach is that linear projections corresponding to linear combinations over the original attributes, are not able to model complex generative models and consequently are not able to detect the outliers in all cases. such a case is illustrated in figure 1. to alleviate this drawback we extend the original framework to allow the derivation of nonlinear projections. to this aim, we extend the data set by introducing new features built as products of the original ones. projection pursuit can be further conducted with classical methods proposed in literature. because the extension we propose increases considerably the search space, we also design and investigate an optimization framework based on multi-modal genetic algorithms, which allows searching simultaneously for the relevant attributes combinations and the coordinates of the projection axis. 4 an algorithm for detecting outliers with nonlinear projections nonlinear projections can be performed after introducing in the analysis new features generated as products of original features. this is a standard approach in other data mining tasks (i.e. regression and classification) to extend standard methods that derive linear models but hardly used in the field of projection pursuit in general and outlier detection in particular. one drawback is inherent to this approach: the exponential increase of the number of new attributes introduced into the analysis with the increase of the degree of the polynomial model. if the number of original attributes in a data set is m, the number of monomials of degree 2 that can be introduced in the analysis is m(m+1) 2 while the number of monomials of degree 3 is m(m+1)(m+2) 6 . in general, the number of monomials of degree i that can be formed on m variables in ( m+i−1 i ) and the number of attributes that can be introduced to derive polynomials up to a certain degree k is nk = ∑ k i=1 ( m+i−1 i ) = (m+k)! m!k! − 1. outlier detection with nonlinear projection pursuit 33 to deal with the large number of features and speed up the projection pursuit algorithm we incorporate a feature/monomial selection mechanism. to this aim a genetic algorithm is designed that simultaneously selects good monomials and searches for good projection axes within the selected monomial subspace.a multi-modal genetic algorithm is used in order to allow for simultaneous exploitation of several good monomial subspaces. a candidate solution corresponding to a chromosome in population consists of two parts: • a boolean string of length equal to the number of monomials to a given degree k: value 1 corresponds to monomials to be included in search of projections; • a vector in the euclidean space playing the role of the projection axis, corresponding in fact to numerical weights in the resulted polynomial transformation such mixed representations are common in feature selection tasks solved with genetic algorithms in the context of clustering [15, 16] and classification [14]. the projection y(l) ∈ r of an item x(l) ∈ rm in the subspace encoded by a chromosome is computed as follows: y(l) = ∑ nk i=1 bi · wi ∗ m (l) i where m (l) i is the ith monomial computed as a product over elements from x(l). using only a subset of the monomials to conduct further the search of axes is equivalent to assigning weight 0 for the rest. for multi-modal search we use the multi niche crowding ga [8], an algorithm able to maintain stable subpopulations within different niches, to maintain diversity throughout the search and to converge to multiple local optima. mnc is a steady state algorithm that implements replacement based on pairwise comparisons. both the selection and replacement operators implement a crowding mechanism. mating and replacement within members of the same niche are encouraged while allowing at the same time some competition for the population slots among the niches. selection for recombination takes place in two steps: one individual is selected randomly from the population; its mate is the most similar individual from a group of size s which consists of randomly chosen individuals from the population. the two chosen individuals are subject to recombination operators and one offspring is created. the individual to be replaced by the offspring is chosen according to a replacement policy called worst among most similar : f groups are created by randomly picking g (crowding group size) individuals per group from the population and one individual from each group that is most similar to the offspring is identified; then, the one with the lowest fitness value among these is replaced. in the original mnc algorithm the replacement is always performed, even if the fitness of the offspring is lower than the fitness of the individual chosen to be replaced. in our implementation we adopt a simulated annealing strategy: lower fitness survival is accepted with a probability that decreases during the run of the algorithm. the similarity between two individuals is computed based on the boolean part of the chromosome that encodes the monomial subspace; the hamming distance is used. recombination between two chosen individuals consists of crossover that generates one offspring which is subsequently mutated. dedicated crossover and mutation operators are designed and applied to each of the two segments of a chromosome. the crossover operator applied to two chromosomes consists in fact of two operations performed independently on the two parts of the chromosomes. uniform crossover is used on the binary segment encoding the monomial subspace. on the numerical segment, crossover generates each gene of the offspring as a convex combination between the corresponding genes of the parents. mutation is also applied in two distinct phases. each gene in the binary segment is flipped with a given probability which we call binary mutation rate. to each weight in the numerical 34 m. breaban, h. luchian segment corresponding to a selected monomial a random value in the interval (-0.25, 0.25) is added with a probability called weights’ mutation rate. the binary mutation rate is lower than the weights’ mutation rate in order to encourage better exploitation of a given subspace for optimal projection axes. before evaluation, the weight vector in the selected subspace of the offspring is normalized to unit length. the evaluation consists in computing the projection of all data items on the axis given by the weight segment of the chromosome in the encoded monomial subspace, followed by the computation of a projection pursuit index dedicated to detecting clusters in data. the index we use is the kurtosis. we choose to maximize this index mainly because its reduced computational complexity compared to other proposed indices for cluster detection. when the projection is normalized to mean 0 and variance 1 the index consists in summing up the values raised to the fourth power, divided by the total number of items. usually, projection pursuit is preceded by a linear transformation on the data called sphering that guarantees that every linear projection is distributed with mean 0 and standard deviation 1, eliminating the need for further normalization. a sphering procedure is also applicable in our case after all monomials to degree k are added to the original data. however, in our experiments we normalize each projection prior to computing the kurtosis. without a multi-modal search scheme, outliers should be identified and eliminated incrementally, based on linear combinations at first, then monomials of degree 2, 3 and of higher order can be introduced iteratively. the multi-modal algorithm allows for several single-dimensional projections maximizing the index to be returned in one run. this brings some advantages: in a standard heuristic only one solution is returned; identifying all outliers requires iteratively the exclusion of the identified outlier and a new execution of the algorithm. the benefits of multi-modal search were recently highlighted in the context of linear pp [9]. 5 experiments figure 1 represents a data set containing 100 observations in a two-dimensional space. it is illustrative for the drawback of classical pp: linear projections of data on one axis are not able to identify the interior outliers. the parameters of the new method are set as follows: s = 0.15 ˇ pop size, g = 0.10 ˇ pop size, f = 0.15 ˇ pop size. the mutation operator is applied at different rates on the two segments of a chromosome: approximately one mutation during 10 iterations is applied on the binary segment while 1 mutation per iteration is applied on the numerical segment; using a steady-state scheme, only one offspring is generated and evaluated at each iteration. the population consists of 50 individuals, randomly initialized: on the binary segment approximately 4 monomials are selected (set to 1) while on the numerical segment the values are generated in the interval [-1,1]. the algorithm was executed at first with monomials of degree one, to simulate one run of a classical pp algorithm: the search is conducted in the original feature space. the black line in figure 1 a) represents the projection axis generating a linear combination of attributes of maximum kurtosis, returned by our method in this step. figure 1 b) represents the histogram of the data under this linear combination of maximum kurtosis: the exterior outlier appears at the left while the interior outliers get mixed with the rest of the data. without excluding the identified outlier the algorithm was executed again including in the search space all monomials of degree 2. figure 1 c) represents the distribution of the nonlinear combination (of maximum kurtosis) of the same data derived with our method: the three interior outliers can be identified at the right, outlier "1" being at the extreme, followed at the left by outliers "2" and "3". as a second test case we are interested in the ability of our method to detect the outliers when noise attributes are introduced. to this aim, 3 uniformly-distributed attributes outlier detection with nonlinear projection pursuit 35 are added to the data set in figure 1, the result consisting in 100 observation in a 5-dimensional space. figure 2 presents the results of two runs of our algorithm: the linear projection (b) is similarly oriented in the original space, with the outlier at the left and the nonlinear projection (b) identified the three interior outliers at the right. figure 1: a synthetic data set containing 4 outliers a) the projection axis detected with classical pp is drawn in black; b) the histogram of the linear projection detected with classical pp: only the exterior outlier can be identified; c) the histogram of the nonlinear projection returned with new method: the three interior outliers appear at the right figure 2: a synthetic data set containing 4 outliers in a 2-dimensional space and 3 more uniformly-distributed attributes a) the projection axis detected with classical pp is drawn in black; b) the histogram of the linear projection detected with classical pp c) the histogram of the nonlinear projection returned by the new method 6 conclusions the paper proposes an extension of the classical linear projection pursuit framework in the context of outlier detection. by creating nonlinear projections of data the new method is capable of modeling diverse generative processes and identify outliers which linear projection pursuit does not detect. the proposed method compares positively to distance-based and density-based approaches: the new method provides results which are not altered by the presence of many uniform/gaussian attributes, as it happens with the other approaches, where distance computations over the entire space of attributes are performed. this is because pp intrinsically performs subspaces/attributes selection and moreover, our method deals with monomial selection explicitly. at the same time, the (non)linear combinations of attributes identified to contain outliers can provide useful explanatory information to the user on the nature/source of outliers. excepting the univariate analysis performed in the last step for outlier exclusion, our method is parameterfree. scaling up for very large databases is favored by the fact that the proposed algorithm can be easily parallelized: as the most demanding step is projecting the entire data set on a given axis, this operation can be executed in parallel for distinct observations. 36 m. breaban, h. luchian acknowledgement this work was supported by posdru/89/1.5/s/63663 commscie grant. bibliography [1] h.-p. kriegel, p. kröger, a. zimek, outlier detection techniques, tutorial at 16th acm sigkdd conference on knowledge discovery and data mining, washington dc, 2010. [2] v. hodge, j. austin, a survey of outlier detection methodologies, artif. intell. rev., 22(2):85-126, 2004. [3] irad ben-gal, outlier detection, in: maimon o. and rockach l. (eds.), data mining and knowledge discovery handbook: a complete guide for practitioners and researchers, kluwer academic publishers, 2005. [4] v. chandola, a. banerjee, v. kumar, anomaly detection: a survey, acm comput. surv., 41(3), art. 15, 2009. [5] j. h. friedman and j. w. tukey, a projection pursuit algorithm for exploratory data analysis, ieee trans. comput., c23(9):881-890, 1974. [6] stahel, w. a., breakdown of covariance estimators, research report 31, fachgruppe fur statistik, e.t.h. zuurich, 1981. [7] j.h. friedman, exploratory projection pursuit, j am stat assoc, 82(1):249-266, 1987. [8] v. vemuri and w. cedeńo, multi-niche crowding for multimodal search. practical handbook of genetic algorithms: new frontiers, ed. lance chambers, vol.2, 1995. [9] a. ruiz-gazen, s. l. marie-sainte, and a. berro, detecting multivariate outliers using projection pursuit with particle swarm optimization, proc. of compstat2010, 89-98, 2010. [10] knorr, e.m. and ng, r.t., a unified approach for mining outliers, proc. conf. of the centre for advanced studies on collaborative research (cascon), toronto, canada, 1997. [11] knorr, e.m. and ng, r.t., finding intensional knowledge of distance-based outliers, proc. int. conf. on very large data bases (vldb), edinburgh, scotland, 1999. [12] angiulli, f. and pizzuti, c., fast outlier detection in high dimensional spaces, proc. european conf. on principles of knowledge discovery and data mining, helsinki, finland, 2002. [13] hautamaki, v., karkkainen, i., and franti, p.. outlier detection using k-nearest neighbour graph, proc. ieee int. conf. on pattern recognition (icpr), cambridge, uk, 2004. [14] a. sierra, high-order fisher’s discriminant analysis, pattern recognition, 35(6):1291-1302, 2002. [15] j. handl, j. knowles, feature subset selection in unsupervised learning via multiobjective optimization, int. j. of computational intelligence research, 3:217-238, 2006. [16] m. breaban, h. luchian, a unifying criterion for unsupervised clustering and feature selection, pattern recognition, 44(4):854-865, 2011. international journal of computers communications & control issn 1841-9836, 9(4):389-396, august, 2014. a method to construct approximate fuzzy voronoi diagram for fuzzy numbers of dimension two d. arotaritei dragos arotaritei "grigore t. popa" university of medicine and pharmacy romania, 700115 iasi, str. universitatii, 16 e-mail: dragos.arotaritei@umfiasi.ro abstract: in this paper, we propose an approximate "fuzzy voronoi" diagram (fvd)for fuzzy numbers of dimension two (fndt) by designing an extension of crisp voronoi diagram for fuzzy numbers. the fuzzy voronoi sites are defined as fuzzy numbers of dimension two. in this approach, the fuzzy numbers have a convex continuous differentiable shape. the proposed algorithm has two stages: in the first stage we use the fortune’s algorithm in order to construct a "fuzzy voronoi" diagram for membership values of fndts that are equal to 1. in the second stage, we propose a new algorithm based on the euclidean distance between two fuzzy numbers in order to construct the approximate "fuzzy voronoi" diagram for values of the membership of fndts that are smaller than 1. the experimental results are presented for a particular shape, the fuzzy ellipse numbers. keywords: approximated fuzzy voronoi diagram, fuzzy numbers of dimension two, computational geometry, fuzzy arithmetic, bisector median, path planning. 1 introduction many studies have been made relating to voronoi diagram and its application [1]. usually, the main algorithms deal with exact voronoi diagram. only few of them deal with approximated voronoi diagram. in some of paper, 2d objects has been taken into account in order to construct voronoi diagram. an algorithm to construct voronoi diagram for convex sites is proposed in [2]. the authors start with the postoffice problem where the main question is how to process a set of sites in the plane in order to find the closest site to a query point. the study in [2] treats points as disjoint convex polygons. in [3] the authors proposed voronoi diagrams of ellipses using an incremental algorithm that constructs also the exact delauny graph. a distance from a point to an ellipse is defined to be the minimum euclidean distance to the boundary of the ellipse with negative value in case the point is inside the ellipse. most of the papers treat the problem of disjoint objects. in some papers, the objects can intersect [4] or one object can be inside another object [4]. in the latter case, the circles can intersect each other and circles can contain other circles [4]. in [5] the authors used an incremental algorithm in order to construct voronoi diagrams for convex objects. a polynomial approximation of the curves is suggested. an algorithm that computes an approximate version of voronoi diagram is introduced in [6]. the cells in these approximated diagrams are cubes or difference of two cubes. other papers deal with weighted voronoi diagrams [7] using the euclidean distance functions. each point p from a given set p of points in the plane has an associated weight that expresses the level of influence over this neighborhood. fuzzy objects are an important part in the fuzzy set theory. 1-d fuzzy sets have been extended to fuzzy plane and fuzzy plane geometry (circles and polygons) in [9]. only very few papers deal with fuzzy voronoi diagram. in ( [10][11]) the authors proposed an algorithm that deals with fuzzy objects in fuzzy voronoi diagrams. the "fuzzy voronoi" diagram is an extension of the voronoi diagram based on fortune’s algorithm for crisp cases. the "fuzzy voronoi" diagram copyright © 2006-2014 by ccc publications 390 d. arotaritei has been extended for a set of fuzzy circles, rather than fuzzy points, of the same radius. the authors proved that the boundaries of this diagram are constructed by intersecting a number of hyperbolae and they proposed an algorithm that computes the exact formula [11]. the geometric approach is presented in [12]. the authors define the basic fuzzy geometric shapes (point, line, circle, ellipse and polygon) using the 2-d support space on the level of fuzzy set. a fuzzy measure for trapezoidal fuzzy number is proposed in [13]. optimal path planning for mobile robot navigation is an important area of research in computer science. usually, obstacles are approximated by rectangular or circular cylinders and voronoi diagram can be an effective solution [14]. an interesting approach is optimal path planning in an uncertain (fuzzy) environment and 3d space that conduct to idea of fuzzy voronoi diagram. fuzzy solution can be very effective for various problems [15]. we present a different algorithm that computes an approximated fuzzy voronoi diagram for fuzzy numbers of dimension two. it is possible to extend the proposed algorithm to n-dimensional fuzzy numbers using α-cuts. 2 fuzzy numbers of dimension two before we proceed to the main results described in this paper we make a short review of the crisp voronoi diagram [1] and fndt [16]. some proposal for fuzzy distance are investigated by some authors ( [17][19]). we will use euclidean distance in fuzzy workspace and some of notations adapted from [11] for voronoi cell and voronoi diagram. letp be a discrete subset in the metric space x. the voronoi cell v (p) is defined as the set of all points x in x that have the distance to p lower or equal to the distance from x to other points q in p [11]. v (p) = {x ∈ x|∀q ∈ p, d(x, p) ≤ d(x, q)} (1) the voronoi diagram of p is the set of all voronoi cells denoted by v (p). the points p are named voronoi sites of the set p ( [10][11]). a fuzzy numbers of dimension n can be defined according to ( [9], [16]). let’s consider a fuzzy graph g ⊂ r × r... × r, a functional relation in rn having the membership function µg(x1, x2, ..., xn) ∈ rn. we define an n-dimensional fuzzy number f[a]n as any non-empty convex fuzzy subset of g ∈ rn that meets the following conditions [16]: • a is a normal fuzzy set, i.e. there is an x0 ∈ rn, µg(x0) = 1; • a is a convex set, that is membership function is convex hypersurface, i.e. µa(αx + (1 − α)y) ≥ min(µa(x), µa(y)), α ∈ (0, 1]; • the membership function is a convex hypersurface that for ∀a ∈ r, there is µa(x) = a. in particular, for n=2 we have fuzzy numbers of dimension two (fndt), as they are defined in ( [9], [16]). a general shape is shown in fig. 1a. in fig. b we showed the cut of this surface (top view) for a level α ∈ (0, 1]. the fuzzy voronoi diagram involves computing distances in fuzzy spaces. the results of calculus depend of the metrics used for the fuzzy space.an important point is the selection of the shape of fndt, derivable in all the points or not: convex polygon, rectangular, square, circle, ellipse or closed curves. in our approach, a continuous convex closed curve of fndt is considered. the circle or ellipsis is a particular case. various metrics in f̃(ℜ) have been proposed in the literature, most of them based on the hausdorff distance ( [18][19]). other authors proposed definitions for measures in the euclidean a method to construct approximate fuzzy voronoi diagram for fuzzy numbers of dimension two 391 figure 1: (a) a fuzzy number of dimension 2; (b) top view of α-cut of fndt (general case) space, function that capture more information about vagueness, α-cut that transform fuzzy number into an interval number and distance index based on centroid of fuzzy number. the α-level set aα of a fuzzy number is a non fuzzy set defined by: aα = {x ∈ ℜ|µ(x) ≥ α}, µa(x) = sup a∈(0,1] {α · gaα(x)} (2) where gaα(x) is the characteristic of the function aα. 3 approximated fuzzy voronoi diagram (afvd) for fuzzy numbers of dimension two in the next paragraphs we define the fvd for a set of sites that are fuzzy numbers of dimension two. let p̃ be a subset of ℜ2, which is constructed from fuzzy numbers of dimension two and let p̃ ∈ p̃ be a fndt. a fuzzy voronoi cell of p̃ is defined, in a similar manner as in [11], by ṽ (p̃): ṽ (p̃) = {(x, µp̃(p̃))|∀q̃ ∈ p̃, d(x, p) ≤ d(x, q)} (3) we note p = (px, py) and q = (qx, qy) two crisp points that belong to a fndt at α-cut level. we omitted α from the following formulae in order to simplify the notations. the euclidean distance between two points p and q that belong to a fndt at α-cut level is given by: de(p, q) = √ (px − qx)2 + (py − qy)2 (4) this definition is consistent with the results obtained in the particular degenerated case when fndt become a single point or when α = 1. let us consider two fndt ellipse shapes not necessary equal. each level of the α-cuts level will produce two ellipses in the same plane. the fuzzy voronoi diagram for n fndt is the union of voronoi diagrams of fuzzy α-cuts for all n fndt. each α-cut of fndt become a set of ellipses for which a voronoi diagram is required. it is easy to demonstrate that given two disjoint continuous convex closed curves a and b, for any two points inside of continuous convex closed curves, the bisector median of these two points are bordered by other two bisector median of two points situated on the contour of the curves. a conclusion of this assertion is that to construct the bisector median of all the points on surface of two continuous convex closed curves it is enough to construct the bisector median for all the points on the contour of the curve that is the border of the bisector median for all the points. 392 d. arotaritei figure 2: (a) the construction of the bisector median for two points located on the fndt and the asymptotes of the fvd; (b) the construction of apvd for first curve for each α-cut, we can compute the fuzzy voronoi diagram using only the contour of the curve from fndt obtained by this α-cut. in what follows, we will denote by hk ∈[0,1) the α-cut level of index k. the starting point for our approach is the calculus of bisector median for two points, one of them situated on the first curve and the second one situated on the second curve. let’s denote by x1(x1, y1) and x2(x2, y2) these two points. the equation of the bisector median is given by a line, perpendicular on the middle of the segment x1x2. if the slope of the line x1x2 is m,the slope of the bisector median of x1x2 will be m′ = −1/m = (x2 − x1)/(y2 − y1). the equation bisector median (bm) is a line that pass by the point m(xm, ym), the middle of the segment x1x2 and it has the slope m′. after few manipulations, the bms equation si given by: y = − x2 − x1 y2 − y1 · x + x22 + y 2 2 − x 2 1 − y 2 1 2 · (y2 − y1) (5) the fuzzy voronoi diagram for two fndt consists from all the bisector medians for all the points situated on the two closed curves. the asymptotes for the envelopes of all bisector medians is given by the bisector median of internal tangents of the two closed convex curves. if we apply counterclockwise rotation and translation for any point with φ angle in euclidean space, for any two ellipses we have simplified parametric equations of two ellipses (as in fig. 2(a) and fig. 2(b)). let’s consider two convex closed curve that after eventually rotation and translation looks like in fig. 2(a). the origin of axis is located in the center point of the first unimodal fndt and his center point of the second unimodal fndt is located on the axis 0x (fig 2(a)). we consider two points situated on this curves x1(x1, y1) and x2(x2, y2) as in fig. 2b. the bisector median of these points will be given by formula (5). let’s consider a line that pass in the point c1(0, 0), with m2 incrementally from point tl2 and tl1 arbitrary chosen on the asymptotic limits for fuzzy voronoi diagram s1 and s2. the two lines intersect in the point p(xp, yp) given by the solution of the system:{ yp = m1xp + n1 yp = m2xp + n2 (6) { m1 = −(x2 − x1)/(y2 − y1), n1 = (x22 + y 2 2 − x 2 1 − y 2 1)/(2 · (y2 − y1)) m2 = const., n2 = 0 (7) a method to construct approximate fuzzy voronoi diagram for fuzzy numbers of dimension two 393  xp = x22+y 2 2−x 2 1−y 2 1 2·(x2−x1+m2·(y2−y1)) yp = m2xp (8) if m1 ̸= m2, the two line intersect in the point p(xp, yp), where xp = (n1 − n2)/(m1 − m2) and yp = m1((n1 − n2)/(m1 − m2)) + n1. in the case m1 = m2, the two lines are parallel, so no intersection point exists. in our algorithm we simply skip this situation and we go to the next increment. let’s consider the bisector median for points c1 and c2. this crisp voronoi diagram for two points c1 and c2 split conventionally the plane in two halves, left (where first curve is located) and right (where the curve is located). the envelope of the fuzzy voronoi diagram for left half-plane is given by the minimum of distance between points p and c1. for a given m1, this distance is the intersection of the line r (fig. 2(b)) and one of the bisector median for all the lines given by formula (5). the minimum of distance is given by: dm = d 2(c1, p) = d 2 m = x 2 p + y 2 p = m 2 1 · x 2 p (9) the function dm reach the minimum when x2p reach the minimum, that is we must find (x1, y1) and (x2, y2) for which this minimum is obtained. if the curves are given by parametric equations, this mean that we must find the pair (t1, t2) for which dm = f(x1(t1), y1(t1), x2(t2), y2(t2)) = g(t1, t2), t1 ∈ ∠(t11, a, t12), t2 ∈ ∠(t21, c, t22) reach the minimum for a given m2. for each m2j ∈ [mmin, mmax], j = 1, ...n we will find an pj. the union of all the pj will give the left envelope of the afvd for these two fndt at hk level of α-cut. the envelope is asymptotic limited by the bisector median of the internal tangents. fvleft = n∪ j=1 pj (10) for the right side, the formulas (6)-(9) are a bit changed, taking into account the equation of the line r that must pass in the point c2(xc2, 0). after few calculations, the formulas become: xp = x22+y 2 2−x 2 1−y 2 1−q·(y2−y1) 2·(x2−x1+m2·(y2−y1)) , q = 2 · m2 · xc2 = const. yp = m2xp + n2, n2 = −m2 · xc2 (11) dm = d 2(ca2, p) = d 2 m (12) d2m = (xp − xc2) 2 + y2p = (1 + m 2 2) · x 2 p + 2 · (m2n2 − sc2) · xp + xc2 2 + n22) (13) afvd use numerical methods to find the minimum of function given by formulas (7)-(13). let’s suppose that we have the equation of curves given by parametric formula. we can convert the equation of curves into parametric formulas. the quadratic form (circle, ellipse) can be easily converted to parametric form. we taken into account two options. the first one use a minimization with constrains of the function dm as a function of two parameters in order to find (t1, t2) and so one (xp, yp). mint1,t2f(t1, t2) such that t1,min ≤ t1,max, t2,min ≤ t2 ≤ t2,max (14) it is a problem o constrained nonlinear optimization that can be solved by many methods: interior-point, sqp, active-set, trust-region-reflective. we chosen to use function fmincon implemented in matlab toolbox. 394 d. arotaritei the second option is possible in the case of twice differentiable curves. we can find the extremum of the function f(t1, t2) by calculating the stationary points using first partial derivative and thereafter by select the points that are local minimum using the second derivative. the saddle and the maximum points are eliminated. both numerical solutions have a problem: how to choose the initial point in order find the global minimum. the surface of equations (13)-(11) can have many local minima, so we must start with initial solution close enough to global minimum. in order to overcome this situation, we propose to use a brute-force algorithm (bfi ) to find the initial points. we split the domain of ti, i=1,2 in ni parts. for each combination (tij, tik) we find a bisector median and a intersection points. in total we will have maximum n1 ∗ n2 lines that intersect the line r. using formula (14), the pairs (t10, t20) for which the minimum of f(t1, t2) is obtained is the starting point for minimization in the next refinement stage. the same reasoning is used for the second curve that have the center c2(xc2, 0). after both envelopes have been found, we rotate and translate the curves in order to come in initial position. in our experiments we used the first method and the ’trust-region-reflective’ algorithm. 4 algorithm for approximated fuzzy voronoi diagram and experimental results we can summarize the algorithm for apvd as follow. step 1: construct the crisp voronoi diagram for crisp points that are the centers of convex closed curves (fortune’s algorithm)p. step 2: for each level hk of α-cuts do the two following steps.. step 3: for each voronoi cell that includes a convex closed curve ai, construct the fuzzy border generated by all adjacent crisp voronoi cells. the fuzzy border is generated by afvd described previously. step 4: calculate the intersection of the fuzzy borders and apply the norm max for all fuzzy points. step 5: if hk ̸= 0.0, go to step 2, else stop the algorithm. in our experimental results we used ellipses given in a parametric form. it is easy to transform the parametric form into quadratic form, and reverse. the experimental results, afvd for six ellipses, are presented in fig. 3a and fig. 3b. in fig. 3b, the solid line inside the grey area represents the crisp voronoi diagram for centers ci of closed curves (ellipses). unbounded cells have at infinite the limit determined by internal tangent of adjacent cells corresponding to crisp voronoi cells of the centers of closed curves (ellipses). computational time for crisp voronoi (fortune’s algorithm) is given by the known result o(n log n). our approach considers a less efficient algorithm, based on construction of bisector lines, for which the computational time is o(n2 log n). 5 conclusions and future works in this paper, we introduced approximated fuzzy voronoi diagram for fuzzy numbers of dimension two. we developed an original algorithm for computing fuzzy voronoi diagram for fuzzy numbers of dimension two with closed convex derivable curve shape. we used α-cuts in order to develop the diagram and the boundaries of voronoi edges of fuzzy cells. a method to construct approximate fuzzy voronoi diagram for fuzzy numbers of dimension two 395 figure 3: (a)the maximum norm for all fuzzy points inside a fuzzy voronoi cell (cvde: crisp voronoi diagram-edges); (b)afvd for six ellipses we considered not only the contour of the fuzzy numbers of dimension two at h level of α-cuts but also points inside this area. the same dimension for all the fuzzy object is a particular case of the presented algorithm. the fuzzy numbers have the same size but different sizes are taken into account. the experimental results presented the fuzzy voronoi diagram for fndt not only for two sites but also for six sites with applicability and algorithm extension to n sites. bibliography [1] fortune, s. (1887); a sweep algorithm for voronoi diagrams, algoritmica, 2:153-174. [2] mcallisterm, m.; kirkpatrick, d.; snoeyink, j. (1996); a compact piecewise-linear voronoi diagram for convex sites in the plane, discrete comput. geom., 15-73. [3] karavelas, m.; yvinec, m. (2003); voronoi diagram of convex objects in the plane, in proc. europ. symp. algorithms, lncs springer, 337-348. [4] yap, c. k. (1987); o(n log n) algorithm for the voronoi diagram of a set of simple curve segments, discrete comput. geom., 2:365-393. [5] arya, s.; malamatos, t. (2002); linear-size approximate voronoi diagrams, in: 13th annual acm-siam symp. on discrete algorithms, society for industrial and applied mathematics, 147-155. [6] emiris, i.; hemmer, m.; tsigaridas, e.; tzoumas, g. (2008); voronoi diagram of ellipses: cgal-based implementation, acs-tr-363603-01 technical report, university of groningen. [7] aurenhammer, f.; edelsbrunner, h.; an optimal algorithm for constructing the weighted voronoi diagram in the plane, pattern recognition, 17(2): 251-257. 396 d. arotaritei [8] burnikel, c.; mehlhorn, k.; schirra, s. (1994); how to compute voronoi diagram of line segments: theoretical and experimental results, in proc. 2nd annual symposium, lecture notes of computer science, 855:227-237. [9] buckley, j.j. ; eslami, e. (1997); fuzzy plane geometry ii: circles and polygons,fuzzy sets and systems, 87:79-85. [10] jooyandeh, m.; mohades, a.; mirzakah, m. (2009); uncertain voronoi diagram, information processing letters, 109:709-712. [11] jooyandeh, m.; khorasani, a.m. (2009); fuzzy voronoi diagram, advances in computer science and engineering, 13th international csi computer conference, csicc 2008, 6:82-89. [12] chaudhuri, b. b. (1991); some shape definitions in fuzzy geometry of space, pattern recognition letters, 12: 531-535. [13] goetschel, r.; voxman, w. (1986); elementary fuzzy calculus, fuzzy sets and systems, 18:31-43. [14] takahashi, o.; schilling, r.j. (1989); motion planning in plane using generalized voronoi diagrams, ieee transactions on robotics and automation, 5(2):143-150. [15] razavi, s.h.; h. amoozad, h.; zavadskas, e.k. ; hashemi, s.s. (2013); a fuzzy data envelopment analysis approach based on parametric programming, international journal of computers communications & control, issn 1841-9836, 8(4):594-607. [16] kaufmann, a.; gupta, m.m. (1984); introduction to fuzzy arithmetic: theory and applications, an nostrand reinhold company, ny. [17] chakraborty, c.; chakraborty, d. (2006); a theoretical development on a fuzzy distance measure for fuzzy numbers, mathematical and computer modelling, 43:254-261. [18] grzegorzewski, p. (1998); metrics and orders in space of fuzzy numbers, fuzzy sets and systems, 97:83-94. [19] woxman, w. (1998); some remarks on distance between fuzzy numbers, fuzzy sets and systems, 100:353-365. int j comput commun, issn 1841-9836 vol.7 (2012), no. 2 (june), pp. 325-340 small signal monitoring of power system using subspace system identification a. mohammadi, h. khaloozadeh, r. amjadifard aliakbar mohammadi department of electrical engineering science and research branch islamic azad university tehran, iran. aa.mohammadi@srbiau.ac.ir hamid khaloozadeh k. n. toosi university of technology tehran, iran. h_khaloozadeh@kntu.ac.ir roya amjadifard tarbiat moallem university tehran, iran. amjadifard@tmu.ac.ir abstract: in this paper, small signal analysis of power systems is investigated using subspace system identification (ssi) methods. classical small signal analysis methods for power systems are based on mathematical modeling and linearized model of power system in an especial operating point. there are some difficulties when such a classical method is applied, specially, in the case of large power systems. in this paper, such difficulties and their bases are investigated and in order to avoid them, it is suggested to use ssi algorithms for small signal analysis of power systems. the paper discusses extracting of small signal properties of power systems and presents some new suggestions for application of subspace system identification methods. different types of subspace system identification algorithms were applied to different power system case studies using the presented propositions. the benefits and drawbacks of subspace system identification methods and the presented suggestions are studied for small signal analysis of power systems and power system monitoring. several comparisons were investigated using computer simulations. the results express the usefulness and easiness of proposed methods. keywords: small signal, subspace identification, power system, monitoring. 1 introduction it is not more than 20 years that a new horizon has been opened in system identification. subspace system identification (ssi) has been one of the most attractive methods for system identification since 1990s. there is a large amount of literatures devoted to the algorithms based on subspace system identification [1–3]. factually, we can present a list of renowned type of ssi algorithms as following; numerical subspace state space system identification (n4sid) [4], multivariable output error state space (moesp) [5, 6], past output (po-moesp) [7, 8], canonical correlation analysis (cca) [9], orthogonal decomposition (ort) [2]. although they are similar in some general aspects, but there are several differences which may distinguish them. actually, they don’t perform quite the same as each other since they practice various copyright c⃝ 2006-2012 by ccc publications 326 a. mohammadi, h. khaloozadeh, r. amjadifard mathematical tools in different ways. it is not too far to expect fairly different advantages or disadvantages when using ssi algorithms. a useful review of subspace system identification algorithms is available in [10]. authors in [1] present a unifying theory which may be helpful to understand ssi methods. during the past years, application of ssi methods is being developed increasingly to different areas of industry and engineering sciences. maybe the main reason for such a development hides behind the capability and easiness of application of ssi methods for multi-input/multi-output (mimo) systems. in addition, state space structure of ssi methods is usually considered as an advantage. ssi methods also use robust, fast and consistent mathematical tools for calculations, which provide them with some considerable advantages [2, 3]. numerous investigators have worked on ssi methods and they have used different ssi algorithms for different applications. moreover, mimo specifications and also state space based analysis motivated many investigators to apply ssi methods for different analysis of power systems. however, there are not so many ssi applications in power systems, yet. the first steps of ssi applications in power systems may be seen in [11]. the paper provides low order model of large scale power system using n4sid algorithm. results of paper express that ssi based model is in lower orders, more optimized and more suitable for controller design in comparison with classical system identification and modeling. an application of power transformer identification was developed in [12] using frequency response data and ssi methods. authors in [13] and [14] introduce an algorithm based on ssi and some applications of such algorithm for transformer debugging and parameter estimation. a heffron-phillips model of synchronous generator was identified in [15] using subspace identification algorithms and online measurements. in [16] the parameters of a heffron-phillips model of synchronous generator were extracted from closed loop data using ssi algorithms. it divides identification problem of a closed loop system to two open loop identification and then it uses ssi algorithms to identify each open loop transfer function. using some mathematical processing of the provided transfer function, it provides a transfer function as a generator model. in [17], authors discuss a model predictive controller design for multi-machine power system using ssi algorithms. the design uses a recursive subspace system identification algorithm in order to provide a mimo self-tuning adaptive controller; therefore it can be used for online applications. [18] mentions use of different types of power system signals which are applicable to ssi algorithms. it uses such signals to provide identification data. modal analysis of power system was developed using subspace system identification methods and sampled data. [19] also discusses modal analysis and oscillatory stability study of power systems based on ssi methods. it provides a voltage stability measure using identified critical modes of power system. [20] introduces a power system stabilizer (pss) using n4sid algorithm of ssi methods. it provides a power system model using ssi and then it designs a mimo power system stabilizer using identified model. in [21], authors also discuss a pss using stochastic subspace system identification approaches. they also mention small signal analysis of power systems. this paper aims at small signal analysis of power systems using ssi. it proposes some useful notes on how to extract small signal properties of a power system using ssi algorithms. the paper clarifies the process of small signal monitoring of power system using subspace system identification algorithms. it also compares several ssi based analysis of power systems with the classical methods to distinguish the advantages. the paper structure is as following; in order to highlight drawbacks of classical methods, there would be an introduction to classical small signal analysis of power systems in the next section. subspace system identification algorithms are discussed in section three to high-light the advantages of such methods. application of ssi methods for power system small signal analysis is introduced in section four. there are some new ideas in section four, which are used to extract small signal monitoring of power system using subspace system identification 327 small signal properties of test case power systems in section five. on-line monitoring of power system is discussed in section six. in section seven, the application of different ssi algorithms for several power systems is discussed. finally, the conclusions are presented. 2 drawbacks of analytically small signal analysis of power systems we are usually interested in extraction of modes, participation factors of modes, damping ratios and oscillatory frequencies of power system which all are called small signal properties of power systems. since a power system is naturally a nonlinear system, one should follow the following stages to achieve small signal properties of a power system: 1. finding the details of all included elements (generator constants, transformer and line parameters,) 2. finding nonlinear model of power system using constant, parameters and theoretical relations of variables for different power system elements. 3. solving a load flow problem in order to provide an operating point. 4. linearizing of nonlinear model using the provided operating point. 5. application of modern small signal methods to provide small signal properties. providing an operating point, a nonlinear modeling and linearizing the model are all tough works in application, especially when the system is large. we also know that the parameters of system may change during the normal operation of system. therefore, it is obvious that some of the above stages are not applicable to a real system or the result of such an analysis is not reliable. moreover, there is always a big gap between the analysis done on a piece of paper and the system behavior. such a method is not applicable for monitoring of power system, neither. this is a considerable drawback for a scientific method. it is always a question at the end of theoretical analysis of power systems; to what extent are the results useful and applicable? it is not easy to response to the question unless we make a bridge between real world phenomena and theoretical analysis. what can be suggested at this point is usually application of a system identification method, at least, for estimating a linear model. classical identification methods are useful in many applications. while using a classical system identification method, the biggest difficulty origins from single-input/single-output (siso) structure of such methods. classical system identification methods may fall into whirlpool of over parameterization. coping with such problems is itself a new problem. our suggestion for overcoming such problems is to use subspace system identification (ssi) methods. ssi methods are good solution for multi-input/multi-output (mimo) systems. they can be considered as the bridge for passing over the gap between real world system and theoretical analysis. the next section investigates ssi methods to glorify their useful advantages for small signal analysis of power systems. 3 subspace system identification generally, we can arrange ssi methods into two categories from the measurement view; stochastic and deterministic ssi algorithms. if the ssi algorithm uses exogenous input measurements in its raw identification data set, it is called as deterministic ssi algorithm (dssi). 328 a. mohammadi, h. khaloozadeh, r. amjadifard otherwise, it is called a stochastic subspace system identification (sssi) method [2, 22]. if it is supposed to provide sssi methods in an algorithmic way for further understanding, we can arrange the following notations. further investigations of the following notes are also useful for understanding of dssi algorithms; model: the considered model of system for a typical sssi algorithm is{ x(t + 1) = ax(t) + w(t) y(t) = cx(t) + v(t) ,e {[ w(t) v(t) ] [ wt (s) vt (s) ]} = [ q s st r ] δts (1) where y ∈ rny,x ∈ rnare samples of output and state vectors. vt ∈ rny,wt ∈ rn are stationary, zero average state and output noise vectors, consequently. identification data: sssi methods use only samples of system outputs. however, identification data should usually be provided in the following structure: f(t) , [ y(t) y(t + 1) · · · y(t + k − 1) ]t p(t) , [ y(t − 1) y(t − 2) · · · y(t − k) ]t (2) where f(t) and p(t) are future and past data set. k should be strictly bigger than n. it can be a guess. therefore, this is not a restrictive condition. now, we can formulate sssi problem as below [1, 2, 5, 10, 22]: “there are n samples of output vectors, y (t) , [ y(0) y(1) · · · y(n − 1) ] , from a system of order n. find matrices a, c, q, r, s and n for the structure defined in (1).” block hankel matrices: sssi algorithms begin data processing by forming the following block hankel matrices: hk,k = e{f(t)pt (t)} =   λ(1) λ(2) · · · λ(k) λ(2) λ(3) · · · λ(k + 1) ... ... . . . ... λ(k) λ(k + 1) · · · λ(2k − 1)   (3) where λ(l) = e{y(t + l)yt (l)}, l = 0,1, · · · ,l , 2k − 1 ≤ l , k > n is the correlation of future and past data. actually, sssi uses statistical properties of output samples for further processing. the word stochastic in the expression (stochastic subspace system identification) may arise from this point. system order: sssi uses the following singular value decomposition (svd) of hankel matrix in order to provide system order (n): hk,k = [ usys unoise ] [ ∑ sys 0 0 ∑ noise ] [ v tsys v tnoise ] ≃ usysσsysv tsys (4) we can detect noise singular values by detecting a big gap among the singular values of hankel matrix. thus, noise singular values can be neglected since they are very smaller than system singular values. therefore, system order, n can be defined as n , dim(σsys) (5) estimation of state space matrices: svd of hankel matrix can also provide us with an extended observability matrix; ok , usysς1/2sys (6) small signal monitoring of power system using subspace system identification 329 it can also present controllability matrix: ck , σ1/2sysv t sys (7) we can easily obtain system matrix using extended observability matrices: a = o † k−1ok(p + 1 : kny,1 : n) c = ok(1 : ny,1 : n) (8) estimation of variance matrices: if we define c̄t = ck(1 : n,1 : ny) (9) variance matrices can be evaluated as below; q = π∗ − aπat , s = c̄t − aπ∗ct , r = λ(0) − cπ∗ct (10) where the essentials of the above formulation is provided by solving the following algebraic riccati equation; πk+1 = aπka t +(c̄t − aπkct )(λ(0) − cπkct )−1(c̄ − cπkct ) π∗ = lim πk k→∞ (11) innovation model for state estimation: if we are interested in innovation model for state estimation, we should have an estimation of kalman gain matrix: k = (c̄t − aπ∗ct )(λ(0) − cπ∗ct )−1 (12) therefore, states can be estimated using the following dynamic equations, x(t + 1) = ax(t) + ke(t) y(t) = cx(t) + e(t) cov{e(t)} = r (13) considering above mentioned notations, we can investigate the deterministic version of ssi algorithms using the following model:{ x(t + 1) = ax(t) + bu(t) + w(t) y(t) = cx(t) + du(t) + v(t) ,e {[ w(t) v(t) ] [ wt (s) vt (s) ]} = [ q s st r ] δts (14) where ut ∈ rnu,yt ∈ rny,xt ∈ rnare samples of input, output, state vectors and vt ∈ rny,wt ∈ rn are stationary, zero mean state noise and output noise vectors. in the case of deterministic identification, subspace system identification problem can be formulated as below [2, 4, 22]: “there are n samples of input vectors u = [ u0 u1 u2 ... un−1 ] and output vectors y =[ y0 y1 y2 ... yn−1 ] from a system of order n. find a, b, c, d, q, r, s matrices and n for the structure defined in (14).” there are several different algorithms available for ssi. they usually use some consistence mathematical tools that provide them with pretty useful benefits. there are two well-known subspace system identification algorithms expressed in table 1. they use the same measurements, the same block hankel matrices, different types of projections, svd of different matrices, 330 a. mohammadi, h. khaloozadeh, r. amjadifard the same method for extraction of system order and different extended observability matrices. moesp does not need to estimate future states of system, but n4sid provides future state vectors by using a weighting matrix. moesp uses extended observability matrix to extract system matrices but n4sid uses future states and through a least square problem estimates the system matrices. investigating table 1 expresses the following advantages for subspace system identification algorithms: 1. ssi algorithms are the only system identification methods that can easily and extensively be applied to all mimo and siso systems. 2. estimation of system order is one of the steps of ssi algorithms. this advantage reduces amount of time, cost and calculations. 3. ssi methods can handle big packages of data. 4. on-line operations of ssi methods are easier and can easily be applied to mimo systems. 5. ssi methods use robust mathematical tools such as svd, lq decomposition, least square and qr decomposition. they also don’t need nonlinear optimization. 6. some ssi algorithms only use output data to identify a model. this is a considerable advantage. 4 application of ssi methods for small signal analysis of power systems the ssi advantages expressed in previous section can be used to overcome the difficulties with classical small signal analysis of power systems. using ssi methods reduces above five steps to the following three steps: 1. measuring input/output signals of power system. 2. identification of a linear model for power system using ssi algorithms. 3. application of modern small signal methods to provide small signal properties. as it can be seen, the four first steps vanished and two other steps replaced them. the fifth step left with no change. therefore, one can provide small signal analysis of power systems in an easier and faster way. since application of ssi algorithms are very easy, power system small signal analysis will be provided in very low levels of cost and time. we are usually interested in identification of most oscillatory and least damped modes of power system. such modes are usually related to electro-mechanical parts of power system. therefore, in order to identify most critical modes, the angle and speed of electrical machines should be measured. signal measuring is the starting point of system identification. since measured signals should have enough persistency of excitation, we should use most effective inputs. in attention to differential equations of a single machine power system [23], mechanical torque and field voltage are proper inputs. suppose that input vector u and output vector y of a power system have been measured. the goal is to find small signal properties of power system (modes, damping ratios, oscillation frequencies, participation factors) using several samples of u and y. it is announced that if small signal monitoring of power system using subspace system identification 331 step operation moesp algorithm n4sid algorithm 1 model { xt+1 = axt + but + wt yt = cxt + dut + vt e {[ wt vt ] [ wts v t s ]} = [ q s st r ] δts { xt+1 = axt + but + wt yt = cxt + dut + vt e {[ wt vt ] [ wts v t s ]} = [ q s st r ] δts 2 measured data u = [ u0 u1 u2 ... un−1 ] y = [ y0 y1 y2 ... yn−1 ] u = [ u0 u1 u2 ... un−1 ] y = [ y0 y1 y2 ... yn−1 ] 3 block hankel matrices u0,k−1 =  u0 u1 · · · un−1 u1 u2 · · · un ... ... ... ... uk−1 uk · · · uk+n−2   ∈ rknu×n y0,k−1 =  y0 y1 · · · yn−1 y1 y2 · · · yn ... ... ... ... yk−1 yk · · · yk+n−2   ∈ rkny×n u0,k−1 =  u0 u1 · · · un−1 u1 u2 · · · un ... ... ... ... uk−1 uk · · · uk+n−2   ∈ rknu×n y0,k−1 =  y0 y1 · · · yn−1 y1 y2 · · · yn ... ... ... ... yk−1 yk · · · yk+n−2   ∈ rkny×n 4 extra predefined matrices up , u0,k−1 yp , y0,k−1 up , u0,k−1 , yp , y0,k−1 , uf , uk,2k−1 , yf , yk,2k−1 wp , [ up yp ] , wf , [ uf yf ] 5 lq decomposition [ up yp ] =[ l11 0 l21 l22 ] [ qt1 qt2 ]   ufwp yf   =   l11 0 0l21 l22 0 l31 l32 l33     q t 1 qt2 qt3   6 projection yp/u⊥p = l22q t 2 yf/uf wp = l32l † 22wp 7 singular value decomposition (svd) l22 =[ u1 u2 ] [ ∑ 1 0 0 0 ] [ v t1 v t2 ] l32l†22wp =[ u1 u2 ] [ ∑ 1 0 0 0 ] [ v t1 v t2 ] 8 system order n , dim(σ1) n , dim(σ1) 9 extended observability matrix ok , u1 ∑1/2 1 ok , u1 ∑1/2 1 t , |t | ̸= 0 10 future state estimation ———xf = t −1σ 1/2 1 v t 1 ∈ rn×n xk , xf = [ xk xk+1 · · · xk+n−1 ] ∈ rn×n 11 estimation of state space matrixes c = ok(1 : ny,1 : n) a = o † k(1 : ny(k − 1),1 : n) .ok(ny + 1 : kny,1 : n) solving a least equare problem to estimate b and d [ â b̂ ĉ d̂ ] = [ xk+1 yk,k ] [ xk uk,k ]t  .  [ xk uk,k ] [ xk uk,k ]t −1 table 1: comparison of two well-known ssi algorithms; moesp (multivariable output-error state space) algorithm and n4sid (numerical algorithm for subspace state space system identification) algorithm. 332 a. mohammadi, h. khaloozadeh, r. amjadifard n samples of input/output vectors are availabe, then one can identify the following state space linear model by utilizing a subspace system system identification algorithm: x(t + 1) = ax(t) + bu(t), y(t) = cx(t) + du(t) (15) one can find the system modes and as a result damping factors and damping frequencies by digging the matrixa. however, we may encounter some difficulties when calculating the participation factors of modes in states. the problem arises since the state vector x of the identified model is not correspondent to that of the real power system which we may obtain by analytical methods. therefore, mode in state participation factors can’t be utilized using the identified a. in order to cope with such a problem, it is proposed to use modal canonical realization of (15). using t as a similarity transformation matrix, one can provide the following modal canonical realization: ż = λz + b̄u, y = c̄z + d̄u x = tz, λ = t−1at, b̄ = t−1b,c̄ = ct, d̄ = d (16) generally, λ is in jordan and block diagonal structure. thus, mode in state participation factor (pki) is defined as pki , ∂λi ∂akk (17) where akk is a diagonal element of system matrix. since in (16), the system matrix is diagonal with modes as its diagonal elements, we can write: pki , ∂λi ∂akk = ∂λi ∂λi = 1 (18) therefore, modal canonical realization can maximize (100 %) mode in state participation factor of model. in order to clarify the point, suppose that u is zero, λ is diagonal and z0 is the initial condition vector of modal canonical realization. thus, we can write: ż = λz ⇒ zi = e−λitz0i , i = 1,2, · · · ,n (19) therefore, the only participating mode in state zi is λi, so the participation factor of mode λi in the state zi is 100% and each mode is mapped to a state. considering above point and output equation of (15), one can write: y = c̄z ⇒ yk = ∑n i=1 c̄kizi = ∑n i=1 c̄kiz 0 i e −λit, k = 1, · · · ,ny (20) therefore, output yk is affected by mode λi and mode in output participation factor (pki) is proposed as: pki , c̄kiz0i ,k = 1, · · · ,ny and i = 1, · · · ,n (21) in order to provide participation factors, one may need z0 which can be provided through following relation: z0 = t−1x0 (22) x0 is the initial condition vector of the identified state space model. it is also provided by ssi algorithms. some investigators [24] discuss another kind of participation called state in mode participation factor. in most of literatures state in mode and mode in state participation factors are considered the same as each other and they have been used interchangeably. however,there is a discussion on some differences in [24]. the following proposition challenges the definition of state in mode participation factor. small signal monitoring of power system using subspace system identification 333 proposition 1. "state in mode participation" is a meaning-less expression. proof: a system includes some physical elements (such as resistors, capacitors and inductors in electrical systems or dampers, springs and masses in mechanical systems). each element has a value and a role in topology and configuration of system. configuration and topology of system provides the system with unique set of differential equations which we call it mathematical model. a presentation of system mathematical model is state space structure which has a system matrix called a which is affected by basic elements and configuration of system. systems modes are one of the mathematical properties of a, therefore system modes only rely on system elements and system configuration. system states don’t affect its modes so there is no state in mode participation and system modes are not affected by its states. 2 example 2. for instance, one can provide the series rlc circuit with the following state space model: [ ẋ1 ẋ2 ] = [ λ1 0 0 λ2 ] [ x1 x2 ] + [ 1 1 ] vi, vc = [ λ1λ2 (λ1−λ2) −λ1λ2 (λ1−λ2) ] [ x1 x2 ] (23) where λ1,2 = −ξωn ± jωn √ 1 − ξ2, ωn = 1/ √ lc, ξ = (r/2)/ √ l/c (24) therefore, λ1,2 rely only on l, c and r and the topology of circuit and the states depend on system modesλ1,2. thus, there is no participation of states in modes, and the expression state in mode participation is meaningless. 5 test cases and simulations identification process should be provided with sampled input/output signals. therefore, computer simulations of the following test case power systems have been conducted using matlab/simulink installed on a computer with 2.4 ghz cpu and 4g ram. we study the systems shown in figure 1; a single machine connected to an infinite bus, and a four-machine two-area system. since we may extract the small signal properties of generator angle and speed, it is recommended to use mechanical torque as input and rotor speed and/or its angle as output signals. torque and power are the same in per-unit system. therefore, we use mechanical power as input signal. in order to have enough persistency of excitation in input signals, we added a white noise to input signals. to provide more realistic operating conditions, we added a white noise to output signals, as well. effect of noises will be investigated later. 5.1 single machine three-bus system power system shown in the left side of figure 1 is a three bus single machine power system with no control and exciter. the parameters of the system are those used in [23]. we are supposed to extract all small signal properties of system using ssi algorithms and the methods illustrated in previous sections. we acquired 300 samples of input/output data through a 30 second simulation. using the ssi algorithms presented in table 1 , some linear models were identified and the results are presented in table 2. it is clear that for investigating performance of noises, we cannot manipulate their averages since the operating point may vary. this is not applicable in this study. therefore, each noise variance was altered separately in order to see its effect. in table 2, it is expressed that an increase in input noise variance may lead to a better model from the view of fpe (final prediction error) measure. however, we should be conservative when the estimation of 334 a. mohammadi, h. khaloozadeh, r. amjadifard figure 1: left; single machine three-bus power system. right; two-area four-machine power system. σu σy λ ξ ωn (hz) |p| fpe cm −0.714 ± j6.35 0.112 1.0165 [ 0.503 0.503 0.503 0.503 ] ssim01 0.0001 0 −0.7975 ± j6.2443 0.1276 0.9988 [ 0.5037 0.5037 0.5037 0.5037 ] 7.4e-5 ssim02 0.001 0 −0.7192 ± j6.3203 0.1139 1.0105 [ 0.5028 0.5028 0.5028 0.5028 ] 8.5e-5 ssim03 0.01 0 −0.7727 ± j6.1300 0.1253 0.9825 [ 0.5033 0.5033 0.5033 0.5033 ] 11e-5 ssim04 0.001 0.01 −0.7192 ± j6.3203 0.1139 1.0105 [ 0.5028 0.5028 0.5028 0.5028 ] 0.59 ssim05 0.001 0.01 −0.7192 ± j6.3203 0.1139 1.0105 [ 0.5028 0.5028 0.5028 0.5028 ] 0.60 table 2: small signal analysis of single machine power system using ssi algorithms; abriviations are: classical model (cm), subspace system identified model (ssim), participation matrix (p), variance of input noise (σu), variance of output noise (σy), eigenvalue (λ), damping factor (ξ), natural frequency (ωn), and final prediction error (fpe). small signal properties is under consideration. actually, a large increase in input noise variance may alter the operating point or its absorption area. this may lead to instability. in table 2, if we compare ssim4 and ssim5 with ssim2, we can see that output noise has no effect on subspace system identification. actually, additive output noise does not have considerable effects on the ssi identification, since ssi algorithms use robust linear algebra tools. it is noticeable that the left eigenvector of a wide matrix are not considerably sensitive to additive white noise, [2]. therefore, the identification is not sensitive to additive output noise. what is more, the identification process has no effect on normal operating conditions of power system, since the applied input noise is too weak. 5.2 two-area four-machine system the power system introduced in [25] is used as the second case study (see the right side of figure 1). this power system has four generators and two fully symmetrical areas linked together by a weak line. it was specifically designed [23] to study low frequency electromechanical oscillations in large interconnected power systems. despite its small size, it can thoroughly mimic the behavior of typical systems in actual operation. in the case of no pss (power system stabilizer), when the measured signals are differential speeds and differential angles of all generators, the cva algorithm detected 10 modes shown in figure 2. as illustrated in figure 2, there are 8 oscillatory modes, and two non-oscillatory modes. there are three zero modes, since there is no infinite bus as a reference for rotor angles. small signal monitoring of power system using subspace system identification 335 figure 2: pole-zero map of the identified model foe two-area system (no pss) using cva. figure 3: left: normalized participation factors of modes in differential speeds of generators. right: normalized participation factors of modes in differential angles of generators. (no pss, measurements are differential angles and differential speeds). moreover, the speed governors have not been modeled [23], (λ7,8 and a real pole in the origin). in order to distinguish between local and inter-area modes, the participation factors of modes are illustrated in figure 1. participation factors were calculated using the approach presented in previous section. table 3 illustrates the small signal properties of two-area system extracted from figure 2 and figure 1. in order to damp oscillatory modes, different power system stabilizers have been designed and applied to two-area power system. performances of pss have been investigated using ssi algorithms and the method mentioned in previous section. table 4 summarizes the damping performance of multi-band (mb) pss, ∆ω pss and ∆pa pss [26] when applied to two-area power system. all psss perform very well. they stabilize the naturally unstable system. however, it is clear that the mb-pss is superior to the other two psss, since it provides significantly more damping for all modes. the results presented here using identification method, are identical to those analytically achieved in [26]. mode ωd(hz) ξ(%) participated states λ1,2 1.1904 16.70 δ1,δ2 (local, area 1) λ3,4 0.9871 17.84 δ3,δ4,ω3,ω4 (local, area 2) λ5,6 0.3061 24.35 δ3,δ4,ω1,ω2,ω3,ω4 (inter-area) λ7,8 0.0095 20.78 δ1,δ2,δ3,δ4,ω1,ω2,ω3,ω4 (inter-area) table 3: small signal properties of two area test system (no pss). 336 a. mohammadi, h. khaloozadeh, r. amjadifard pss ωd(hz) ξ(%) mode type no pss 1.1904 16.70 local 0.9871 17.84 local 0.3061 24.35 inter-area mb pss 0.3942 51.91 inter-area ∆ω pss 0.7115 47.71 inter-area 0.1113 67.82 controller pa pss 0.4655 43.02 inter-area table 4: investigating the effects of pss on oscillatory modes of two-area system using ssi. figure 4: monitoring of single machine three-bus system 6 on-line monitoring of power system using ssi on-line monitoring of power system is an open area which attracts tremendous consideration of investigators. using analytic method for on-line monitoring may lead in calculation problems and the results are not also trustfully. since the identification process uses sampled signals of power system, there would not be any gap between identification results and power system behavior. therefore, the results are trustworthy in the case of power system monitoring. ssi algorithms can easily be applied to on-line power system monitoring. they can provide a state space model which is most suitable for on-line monitoring. a single machine three-bus power system with two transmission lines is used to investigate on-line monitoring of power systems (figure 2). some computer simulations were conducted for 60 second. in the first 30 seconds, both lines are in operating condition. as a result of a fault, for the second 30 seconds, line 2 is out of work. some state space models were identified in every 20 seconds using ssi algorithms. there would be three models and the operating point will change for the second model, so one can call it as an operating point transient model. three identified model and their properties are available in table 5. ssim6 represents the system performance in the initial operating point before the fault. ssim8 represents system performance after breaking out of second line. ssim7 is a middle model between ssim6 and ssim8. as it can be seen from the column of modes column in table 5 , modes has moved slightly to the right and as a result stability of system has reduced. it implies that the operating point should be re-tuned in order to recover more stable operating conditions. damping frequency and also damping factor has reduced. it implies the need for some control in order to provide power system with more suitable synchronizing and damping factors. it is considerable that the whole monitoring process expresses a little movement of eigenvalues toward the right half plane. moreover, there is a decrease in damping frequency. therefore, a control should be applied to mechanical torque and field voltage, because it is predictable that a variation in operating conditions of power system may lead to instability and oscillatory behavior. small signal monitoring of power system using subspace system identification 337 model λ ωd (hz) ξ |p| ssim6 −0.6010 ± j8.0143 1.2755 0.0748 [ 0.5016 0.5016 0.5016 0.5016 ] ssim7 −0.5269 ± j7.0677 1.1249 0.0743 [ 0.5012 0.5012 0.5012 0.5012 ] ssim8 −0.4810 ± j6.9452 1.1054 0.0691 [ 0.5018 0.5018 0.4775 0.4775 ] table 5: subspace system identification models (ssim) identified during online monitoring. 7 effect of different ssi algorithms on small signal analysis of power systems there are several kinds of subspace system identification algorithms. they may all use the same mathematical tools. but they are different in applying tools to measurement data. therefore, ssi algorithms may show slightly different results for the same input/output set of data. there are three main algorithms which are usually used in the papers discussing power system analysis; moesp (multi-variable output-error state space), n4sid (numerical state space subspace system identification) and cva (canonical variate analysis). we are supposed to compare performance of such algorithms when they are used for small signal analysis of power systems in this section. figure 1 shows the system under study. small signal analysis of mentioned power system was conducted using four models and different ssi algorithms. the models were investigated; 1) classical model of 3-bus single machine power system which has no control and the effect of field is ignored. 2) single machine power system model containing effects of field voltage. 3) auto-voltage regulated (avr) power system. 4) single machine power system with avr and power system stabilizer (pss). in each case, kd were calculated using the system parameters available in [23]. table 6 contains results of small signal analysis for several models of single machine power system using two different ssi algorithms. identification results are almost the same for two ssi algorithms. they are slightly different in estimation of non-oscillatory modes. table 6 also expresses that ssi algorithms are not capable of identifying non-dominant modes. that is not a draw-back, since non-dominant modes are not so important for system analysis. as a whole, one can use subspace system identification algorithms for sure, when it is needed to estimate most effective modes of power system. therefore, subspace system identification algorithms are convenient solution for extracting small signal stability properties of a power system. 8 conclusions and future works the paper discusses pitfalls of small signal analysis of power systems when using analytic methods. analytic methods use basic theories and the provided parameters in order to extract operating point and linearized model of power system. in this paper, it is expressed that we can avoid such boring, time consuming and cost effective stages, using system identification methods. the most proper method for power system small signal analysis is then, application of subspace system identification (ssi) algorithms. ssi methods have many other advantages which were discussed in the paper. extraction of modes and their participation factors are easy when using ssi methods but it needs some modification in ssi algorithms. in order to fulfill such requirements, some suggestions presented in this paper. using ssi algorithms, some ideas 338 a. mohammadi, h. khaloozadeh, r. amjadifard no. classical moesp cva 1 λ 0 ± j6.39 0 ± j6.4158 0 ± j6.4158 ξ 0.0000 0.0000 0.0000 ωd 1.0200 hz 1.0211 hz 1.0211 hz 2 λ λ1,2 = −0.11 ± j6.41 λ3 = −0.2040 λ1,2 = −0.1095 ± j6.4114 λ3 = −0.2038 λ1,2 = −0.1095 ± j6.4114 λ3 = −0.2038 ξ 0.0170 0.0171 0.0171 ωd 1.0700 hz 1.0204 hz 1.0204 hz 3 λ λ1,2 = +0.5040 ± j7.2300 λ3 = −20.202 λ4 = −31.230 λ1,2 = +0.5045 ± j7.2321 λ3 = −0.0015 λ4 = −15.1521 λ1,2 = +0.5045 ± j7.2321 λ3 = −0.0310 λ4 = −15.1174 ξ -0.0700 -0.0696 -0.0696 ωd 1.1500 hz 1.1510 hz 1.1510 hz 4 λ λ1 = −0.7390 λ2,3 = −1.005 ± j6.607 λ4,5 = −19.797 ± j12.822 λ6 = −39.097 λ1 = 0.0000 λ2 = −4.5770 λ3,4 = −1.0357 ± j6.6150 λ5,6 = −16.857 ± j16.288 λ1 = 0.0000 λ2 = −4.6266 λ3,4 = −0.9946 ± j6.5951 λ5,6 = −16.906 ± j16.265 ξ ξ2,3 = 0.15 ξ4,5 = 0.84 ξ3,4 = 0.1547 ξ5,6 = 0.7191 ξ3,4 = 0.1491 ξ5,6 = 0.7204 ωd ωd2,3 = 1.05hz ωd4,5 = 2.04hz ωd2,3 = 1.0528hz ωd4,5 = 2.5924hz ωd2,3 = 1.0496hz ωd4,5 = 2.5886hz table 6: small signal analysis of single machine power system using different types of subspace system identification algorithms. 1) classical model (no field control, kd=0) . 2) system model containing effects of field voltage, (kd=1.53). 3) auto-voltage regulated (avr) system, (kd = −7.06). 4) system model with avr and power system stabilizer (pss), (kd = 14.08) small signal monitoring of power system using subspace system identification 339 such as power system monitoring can be conducted in a more convenient and more realistic way. this point is discussed in several parts of paper while applying ssi algorithms to different power systems. future works may aim at application of subspace system identification methods to monitor modes, stability measures and estimation of damping and synchronous factors of power system. bibliography [1] p. van overschee and b. de moor, a unifying theorem for three subspace system identification algorithms, automatica, vol. 31, pp. 1853-1864, 1995 [2] t. katayama, subspace methods for system identification: springer, 2005 [3] s. j. qin, an overview of subspace identification, elsevier, computer & chemical engineering, vol. 30, pp. 1502-1513, 2006 [4] ] p. van overschee and b. de moor, "n4sid: subspace algorithms for the identification of combined deterministic-stochastic systems, automatica, vol. 30, pp. 75-93, 1994 [5] m. verhaegen and p. dewilde, subspace model identification part 1. the output-error statespace model identification class of algorithms, international journal of control, vol. 56, pp. 1187-1210, 1992/10/01 1992 [6] m. verhaegen and p. dewilde, subspace model identification part 2. analysis of the elementary output-error state-space model identification algorithm, international journal of control, vol. 56, pp. 1211-1241, 1992/10/01 1992 [7] m. viberg, subspace-based methods for the identification of linear time-invariant systems, automatica, vol. 31, pp. 1835-1851, 1995 [8] m. verhaegen, identification of the deterministic part of mimo state space models given in innovations form from input-output data, automatica, vol. 30, pp. 61-74, 1994 [9] w. e. larimore, "canonical variate analysis in identification, filtering, and adaptive control, in decision and control, proceedings of the 29th ieee conference on, 1990, pp. 596-604 [10] s. j. qin, an overview of subspace identification, computers & chemical engineering, vol. 30, pp. 1502-1513, 2006 [11] g. t. i. kamwa, l. gerin-lajoe, low order black-box models for control system design large power systems, ieee transaction on power systems, vol. 11, no. 1, february 1996 [12] s. m. i. h. akcay, b. ninness, subspace-based identification of power transformer models from frequency responce data, ieee transaction on instrumentation and measurement, vol. 48, no. 3, june 1999 [13] h. a. t. mckelvey, lennart ljung, subspace-based multivariable system identification from frequency responce data, ieee transaction on automatic control, vol. 41, no. 7, july 1996 [14] s. m. i. h. akcay, b. ninness, identification of power transformer models from frequency responce data: a case study. elsevier, signal processing, 68, pp 307-315, 1998 340 a. mohammadi, h. khaloozadeh, r. amjadifard [15] o. p. m. m. karrari, identification of heffron-phillips model parameters for synchronous gemerators using online measurment, ieee proceeding of generator, transmition, distribution, vol. 151, no. 3, may 2004. [16] d. w. o. p. m. m. soliman, identification of heffron-phillips model parameters for synchronous generators operating in closed loop, ieee generation, transmition and distribution, 2(4):530-541, 2008 [17] o. p. m. bin wu. multivariable adaptive control of synchronous machines in a multimachine power system. ieee transaction on power systems, vol. 21, no. 4, november 2006 [18] j. w. p. ning zhou, j. f. hauer, initial results in power system identification from injected probing signals using a subspace method, ieee transaction on power systems, vol. 21, no. 3, august 2006 [19] c. c. h. ghasemi, a. moshref, oscillatory stability limit prediction using stochastic subspace identification, ieee transaction on power systems, 21(2):736-745, 2006 [20] l. g.-l. i. kamwa, g. trudel, multi-loop power system stablizers using wide-area synchronous phasor measurements, proceedings of the american control conference, philadelphia, pennsylvania, june 1998 [21] d. y. g cai, y. jiao, ch. shao, power system oscillation mode analysis and parameter determination of pss based on stochastic subspace identification, asia-pacific power and energy engineering conference (appeec), pp. 1-6, march 2009 [22] p. overschee and b. l. r. moor, subspace identification for linear systems: theory, implementation, applications: kluwer academic publishers, 1996 [23] pp. kundur and n. j. balu, power system stability and control: ieee, 1998 [24] m. a. h. wael a. hashlamoun, and eyad h. abed, new results on modal participation factors: revealing a previously unknown dichotomy. ieee transactions of automatic control, vol. 54, no. 7, july 2009 [25] j. j. sanchez-gasca, v. vittal, m. j. gibbard, a. r. messina, d. j. vowles, s. liu, and u. d. annakkage, inclusion of higher order terms for small-signal (modal) analysis: committee report-task force on assessing the need to include higher order terms for small-signal (modal) analysis, power systems, ieee transactions on, 20(4):1886 1904, 2005 [26] r. klein, moorty and kundur. analytical investigation of factors influencing pss performance, ieee trans. on ec, 7(3):382-390, 1992 int. j. of computers, communications & control, issn 1841-9836, e-issn 1841-9844 vol. vii (2012), no. 1 (march), pp. 39-52 a data fusion methodology for wireless sensor systems j.i.-z. chen, y.-n. chung joy iong-zong chen department of electrical engineering dayeh university changhua 51505,taiwan (roc) e-mail: jchen@mail.dyu.edu.tw yi-nung chung department of electrical engineering national chunghua university of education changhua 51505, taiwan (roc) e-mail: yichung@ncue.dyu.edu.tw abstract: an efficient dfa (data fusion algorithm) plays an important role in tracking for moving objects over wss (wireless sensor system) deployments in order to track the objects accurately. accuracy in object tracking is mainly dominated by the prediction for those moving targets by filtering and refining the results from wireless mobile sensors deployed in wss environment. a dfa based on chhn (competitive hopfield neural network) technique for obtaining the relationship between measurements results from wireless mobile sensors and estimation of existing tracks over wss (wireless sensor system) is proposed in this paper. embedded within the chnn is also a competitive learning mechanism which creatively removes the dilemma of occasional irrational solutions in traditional hnn (hopfield neural networks). in this research, except the proposed approach is established with chnn, the methodology of data fusion over wss is guaranteed to converge into a stable state when performing a data association. in words, the chnn-based dfa is combined with wireless mobile sensors in a wss environment to demonstrate the target tracking capabilities. computer simulation results illustrate that the new methodology of data fusion based on chnn is not only successfully able to solve the data association problems addressed over wss environments, but the specified simulated targets can also be tracked without large scale missing. keywords: chnn (competitive hopfield neural network), dfa (data fusion algorithm), mobile sensors, wsn (wireless sensor network). 1 introduction mobile-sensor data association tracking is generally an essential technique for wss (wireless sensor system) surveillance systems employing one or more sensors, which may be deployed as stationary or maneuverable, together with computer subsystems. the main objective of a data association tracking algorithm is to partition sensor data into sets of observations produced by the same target; the other object to avoid the coupling effect existing between the mobile sensors for the same target. once tracks are formed and confirmed, it not merely the number of mobile sensors that can be estimated and quantified; rather, the information gathered by the tracking algorithm can also be associated and fused. it is known that the role of multi-sensor data association in wss environments involves acquiring, processing and combining data [1, 2] coming from different sources, including sensors and databases, into a more precise set of data. thus, copyright c⃝ 2006-2012 by ccc publications 40 j.i.-z. chen, y.-n. chung data association consists of the three aforementioned steps. a dfa (data fusion algorithm) is the most important technique for maintaining tracking procedures. mobile-sensor tracking with a dfa is a prerequisite step for mobile-sensor surveillance systems in wss deployments. once tracks are formed and confirmed, the number of targets can be estimated and information, such as the target position and velocity, computed for each track [3]. in the literature, several dfas for mtt (multiple target tracking) have been proposed and discussed. it is well known that the jpda (joint probabilistic data association) dfa technique, discussed in references [4–8] focusing mainly on mtt deployment, is appropriate for a high false-target density environment. however, these techniques for solving mtt problems may cause some unreliability (latency) because in a nearest-neighbor or all-neighbors-based environment, the relationships between sensor measurements and existing target tracks are usually considered independently [9]. thus, currently a traditional hnn (hopfield neural network), which incorporates weighted objective costs and constraints into an overall energy function, is employed to combine with the neural network approaches for achieving good tracking results. [10] then, through minimizing the overall energy function, superior performance results can be obtained. [11] the chnn (competitive hopfield neural network) algorithm has been applied in image processing. [12] moreover, soujeri and bilgekul [13] adopted a conventional hnn to solve the problems of multiuser reception for asynchronous mc-cdma (multi-carrier coded-division multiple-access) systems in multipath fading channels. since this approach is defective in that the weighting values are very difficult to properly determine, frequently the solution obtains an irrational result, as reported by zhou. [14] thereafter, conventional hnn schemes for tracking maneuvering or non-maneuvering targets with mobile sensors over wss deployments are very sparse. however, wang et al. [15], recently, combined an hnn with a genetic algorithm, designated as hnn-ga, for proposing a mobile agent-based strategy utilizing a low network load and cooperation of mobile agents, to dynamically optimize the combination of nodes and deploy tasks among nodes. based on hnn, the selection method investigated by liang et al. [16], in which the sensor node having the lowest cost and satisfying the distance requirement of a mimo (multi-input multi-output) system, is selected to function as the best transmitting and receiving antenna in wsn environments. by extending the idea expressed in [17], wang et al. [18] proposed a new dynamic sensor node selection strategy to implement global searching to reduce the search space of a ga and ensure the validity of each chromosome in wsn applications. deploying with mobile sensors in wss environment is increasingly becoming integral to targets tracking, mainly due to its convenient deployment, small size, real-time characteristic, and flexibility to support integrated applications. the technique of wss is applied to the traffic monitoring and control in [19] by the authors semertzidis et al. the information is fused and used to provide real-time analytical. however, it is necessary for the important issue of energy supply to the operation of sensor nodes over wss. yen et al. in [20] proposed cld (controlled layer deployment) protocol to guarantee coverage and energy efficiency for sensor nodes deployed in a wss. in order to promote the operation efficiency of the data fusion for the sensor nodes deployed in wss environment. multi-layer clustering routing algorithm is presented by liu et al. in [21] where the wss techniques is developed to track the moving vehicles, and the new scheme efficiently overcomes the hot spot problem in wss environment. furthermore, the authors, shi et al., propose a structure that represents the sensor communications with the fusion centre, obtain the optimal estimation algorithm at the fusion centre, and provide a theoretical closed-form for the steady-state error covariance matrix which has low energy consumption and guarantees a desired level of estimation quality at the fusion centre. [22] however, to apply the advantages of the hnn technique in this research the improved chnn method, which can by artfully arranging the updating function and the cost measurement properly eliminate the aforementioned dilemma, is adopted for solving target tracking with mobile sensors in wss deployments. the chnn is an improved hnn wherein a a data fusion methodology for wireless sensor systems 41 cooperative decision is made on the basis of the simultaneous input of a community of neurons. each neuron receives information from other neurons and also conveys information to others [11, 14]. with this collective information, each neuron moves to a stable stage with the lowest value of a predefined energy function. with such a result, the operation of association between mobile sensor measurements and existing tracks can be obtained under global optimal consideration, which in turn can increase the accuracy of mobile sensor tracking systems. furthermore, due to the embedded competitive updating scheme, the chnn concept has the capability of removing the burden of weight setting. it has also been demonstrated that the network is guaranteed to converge into a stable and rational stage during evolution. [13] thus, the dilemma of lapsing into irrational solutions such as those in traditional hnns can be eliminated. to the best of the author knowledge, the proposed mean is an innovative idea for discussion when exploring the field of wsns. the aim of this study is to enhance system performance by means of a dfa for wss environments, the focus being on a chnn to obtain global matching between mobile sensor measurements and existing tracks. as illustrated in fig.1, moving targets tracked by sensor nodes are considered for improving the tracking deployment for mobile-sensor data association with assumed targets. moreover, in an environment where the dense targets are spread out randomly, some targets can be very close to each other. the measurements produced by these close targets can confuse the data association computation algorithm and result in inaccurate relations hips. consequently, the approach for solving the data association problem should be considered globally. the reminder of this report is organized as follows. in section ii the problem formulation includes the gating technique, and the basic concept of the hnn is illustrated; developing a dfa based on the chnn scheme is presented in section iii; the maneuvering compensator algorithm is explicated in section iv; section v reports the simulation results for the proposed algorithm; and section vi both concludes the report and offers recommendations for further research. 2 problem formulation in this sub-section to obtain the information from the mobile sensor belonging to all of the sensors, without loss of generality, the type of sensing methods is considered to be completely established. it is also assumed that the synchronization between various sensor levels under circumstances of multiple levels of data association is finished. several synchronization methods can be adopted for this procedure [18]. moreover, since this study focuses on data association for tracking targets within wsn environments, several issues are reasonably ignored, e.g., the impact of sensing accuracy on the sensing range and node densities of a sensor, as well as the data fusion overhead and sensing lifetime of a network. thus, a dynamic model with a discrete model set for a multi-sensor tracking algorithm for a mobile sensor can be formulated. 2.1 gating technique according to the mobile sensor tracking situation, the model of a moving target can be defined in variable state equations, given as [23] x(k + 1) = f(k)x(k) + g(k)w(k) + u(k) (1) and y (k) = h(k)x(k) + v (k) (2) where x(k) denotes the state vector of the target, y (k), the measurement vector of the target, f(k) and g(k), the transition matrix and the noise-gain matrix of the target, respectively; w(k), 42 j.i.-z. chen, y.-n. chung is the system noise associated with the target, assumed to be normally distributed with a zero mean and variance q(k). moreover, u(k) represents the forced input; v (k), the measurement error associated with the target, assumed to be normally distributed with zero mean and variance, r(k), uncorrelated with w(k); and h(k) the measurement matrix of the target. the initial state of the target is considered to be a gaussian pdf (probability density function) with a known mean vector, x̂ (0 | 0), and a covariance matrix, p (0 | 0). a large number of close measurements observed from the mobile sensors in an actual tracking situation are provided to determine the trajectory estimates for any target that might be presented. it is difficult to precisely determine which target corresponds to which in such closely spaced measurements. i.e., the data association problem in mobile sensor tracking or associating the measurement vector y (k) to the existing track model for each step, k, must first be solved. additionally, consider an environment wherein a dense target is completely deployed. here, determining the gating size is the obligatory first step in solving the problem of associating observations with tracks. additional logic is required when an observation or multiple observations fall within the gates of multiple target tracks or a single track. a typical situation for a gate diagram consisting of three target tracks t1, t2 and t3 is illustrated in fig. 2. in this figure there are three targets, but eight measurements are assumed to have been obtained. first, the gating technique is applied to eliminate the less probable measurements such as m6 to m8. a chnn algorithm is applied to complete the associative relationship between the remaining measurements and the tracked targets. then, the association of the measurements of the remaining m1,m2, · · · ,m5, with the respective targets should be determined. 2.2 two-dimensional binary neural network-hnn consider that a network consists of n×m mutually interconnected neurons. in this research, the hnn specifically adopted to search for the algorithm belongs to a two-dimensional binary neural network wherein the total input-to-neuron, indexed by (g,i) , i.e., λg,i, at time k, can be calculated as [15] λg,i = n∑ h=1 m∑ j=1 tg,j;h,j � vh,j + ig,i (3) where ig,i denotes a bias input, tg,j;h,j, the interconnection intensity between neuron (g,i) and neuron (h,j); vg,i, a binary state of the (g,i)th neuron. moreover, the state of vg,i is determined by the signum function vg,i = { 1, λ > 0 0, otherwise (4) thus, the expression in 1 explicitly describing the hnn dynamics operating in this network receives weighted inputs tg,j;h,j � vh,j from each neuron, e.g., (h,j), and a bias input (g,i) from outside. therefore, one neuron at a time is updated while the rest remain stationary after a random initialization of each neuron with binary values. this method for neuron updating has been shown to decrease the lyapunov function of a two-dimensional hopfield network given by e = − n∑ g=1 n∑ h=1 m∑ i=1 m∑ j=1 tg,j;h,jvg,ivh,j − 2 n∑ g=1 m∑ i=1 ig,ivg,i (5) where vg,i has the constraint defined in (4). a data fusion methodology for wireless sensor systems 43 3 joint data association in a competitive neural network in this subsection the chnn algorithm is applied to the potential-target measurements to formulate a solution for data association problems. the chnn concept is applied to the data association technique presented here. by means of a network for joint data association, the state of vg,i indicates an associative status between the g − th mobile sensor measurement and the i − th target, expressed as vg,i = { 0, data noassociated 1, data associated (6) then the objective function used for obtaining measurements and mobile sensor targets association with the best decision is given by e = α n∑ g=1 m∑ i=1 dg,ivg,i + β n∑ g=1 n∑ h=1 m∑ i=1 m∑ j=1 vg,ivh,jδg,h + γ m∑ i=1   n∑ g=1 vg,i − 1  2 (7) where dg,i is the distance between the i−th true target and the g −th measurement. the factor dg,i here needs a special design for achieving the task of data association as described subsequently. in eq. 7 the first term is the sum of the distances between the associated measurements and the mobile sensor targets. a situation wherein none of the newly obtained measurements is appropriate for certain specific targets in the processing of data association is unavoidably vexing. concurrently, the previous target information should be kept and chosen as the next item of information in such cases. to achieve this end, the m target data is included as part of the measurement. assume that there are m targets and n − m newly obtained mobile sensor measurements and that the m targets are arranged in front of the n−m measurements to obtain a total of m measurements. where g = i, the distance dg,i is defined as dg,i = r where r is the radius of the gate, according to the aforementioned arrangements. hence, if measurements are distributed inside the gate, then one should be chosen. however, if no measurements are there, then the target itself should be chosen. another constraint is that if g ̸= i and 1 ≤ g ≤ m, then dg,i = ∞. this constraint prevents one target from choosing another as its measurement. thus, after the aforementioned calculations, the distance dg,i can be determined as dg,i =   r, g = i ∞, g ̸= iand1 ≤ g ≤ m√ [τt (k)s(k)−1τ(k)], g ̸= iandg > m (8) where s(k) is the covariance matrix of the innovation τ(k), the superscript denotes the transportation of a matrix, and τ(k) = y (k) − hx̂ (k | k − 1) . moreover, in 5 the second term attempts to guarantee that each measurement can be associated with only one target. moreover, the third term forces the condition wherein each target has one, and only one, associated measurement and the three terms α, β, and γ specify important constant factors. since these factors are highly dependent on the number of targets, the target-measurement distances and the radius of the gate, it is very difficult to determine the appropriate values for them. for this reason, irrational solutions have periodically been reported for the use of traditional hopfield approaches [10, 14] when the weighting factors have not been properly determined. however, to reduce the difficulty in determining the values of the weighting factors, a cwta (competitive winner-take-all) updating is proposed and adopted as vg,i = { 1, λg,i = max {u1,i · · · · · ·un,i} 0, otherwise (9) 44 j.i.-z. chen, y.-n. chung with this modified updating rule, the rigid constraint that each target should be associated with one and only one measurement is automatically embedded inside the network evolution results. as such, the third term can naturally be removed from the objective function. thus, the objective function is simplified and obtained as e = α n∑ g=1 m∑ i=1 dg,ivg,i + β n∑ g=1 n∑ h=1 m∑ i=1 m∑ j=1 vg,ivh,jδg,h (10) to avoid being bounded by irrational solutions, a reasonable method for the network described below is adopted. i.e., once the cwta updating has been applied, it is noteworthy that if α is set as a unit, β can be easily considered greater than the radius of gate r, a relatively constant value. by comparing the resultant objective function with the lyapunov function of the twodimensional hopfield network in 5, the parameters of the interconnection intensity and the bias input can be obtained as ig,i = − α · dg,i 2 (11) and tg,i;h,j = −βδg,h (12) respectively. it can be clearly to seen that the chnn is not fully interconnected from, 11 and 12. however, the chnn is locally connected instead of the neurons in the same column. hence, the total input to neuron (g,i) can be applied to these two equations and obtained as λg,i = −β m∑ j=1 vg,j − α 2 dg,j (13) the proposed algorithm was found to be convergent during network evolutions. 4 estimation of maneuvering targets with an adaptive filter in real-world applications, either maneuvering or non-maneuvering targets must be tracked simultaneously with mobile sensor data association tracking techniques. on the basis of the concept of wireless sensor network equipped with different sensing hardware for distinctly specified applications, a viable method for human interfacing with the environment can be provided. therefore, if target maneuvers occur, then the maneuver-detection and acceleration-estimation algorithm should be applied to modify the parameters of the tracking filter. the developed methods allow the maneuvers to be tracked without diverging or severely distorting the estimate. to complete such an adaptive procedure, the kalman filter equations are modified in this subsection. specifically, certain parameters must be changed for fitting the data association and matching the chnn algorithm. now, the measurement innovation, ρ(k), and the innovative covariance matrix, s(k), of the kalman filter should be modified as ρ(k) = y (k) − h(k)x̂ (k | k − 1) (14) and s(k) = δ(k) − r(k) (15) , respectively, where δ(k) = h(k)p (k | k − 1)ht (k). furthermore, two hypotheses, h0 and h1, corresponding to the assumptions that the system behavior is normal and the target is moving a data fusion methodology for wireless sensor systems 45 with the maneuvers, are assumed as generating events. the detection criterion can be defined as f(k) = k∑ j=k−n+1 ρt (j)s−1(j)ρ(j) h1 > < h0 ζ (16) where f(k) is a chi-squared random variable with n × m degrees of freedom, where n is the residual window size and m denotes the dimension of the measurement vector, since the noises v (k) and w(k) are assumed to be zero mean gaussian white noises. in (16) the criterion ζ can be chosen from standard chi-squared tables [24]. moreover, now consider a situation wherein the target initiates and suffers from a severe maneuvering condition. in this case the proposed algorithm will implement the detection and the estimation for the situation on the basis of certain statistical calculations. in this situation, another important algorithm is also applied to increase the estimation accuracy. in such an algorithm, for the i − th component of a vector, on the basis of the detection results, the testing condition with the components which have jumps are expressed as |ρi(k)| ≤ ∣∣∣d√sii(k)∣∣∣ (17) where d represents the rejected innovation, being a constant related to the gaussian pdf. the variance of the rejected innovation can be calculated as var(d2) = ρ2i (k) {ai(k)δii(k) + rii(k)} −1 (18) where var(·) is variance operator; the parameter ai(k) can be computed as ai(k) = [ρii(k)/d] 2 − rii(k) rii(k) (19) once the parameters have been modified and determined by eq. 18 and 19, the innovative values ρ(k) at time k can exist on the boundaries of the acceptable region defined by 17. thus, to keep the target on track, the covariance of the prediction error p (k | k − 1) is modified to [am(k) · p (k | k − 1)], where am(k) is the largest value of all the ai(k) shown in 19. by virtue of the data association algorithm, not only is a better performance obtained, but the tracking filter also has faster responses in server-maneuvering situations. however, under such a procedure, i.e., after the approach is adopted, the kalman gain increases gradually, after which the tracking performance of the wss will become more stable and efficient. 5 simulation results and discussion developing simulation programs (using matlab ) by virtue of the proposed dfa is implemented in this subsection. the developed dfa associating with chnn technique is first validated in an environment wherein three targets are tracked in wsn deployments. next the dfa is also applied to track five targets, including two non-maneuvering and three maneuvering ones. the initial conditions for simulating the tracking of two targets are listed in table i, mentioned here mainly for demonstrating the accuracy and efficiency of the proposed algorithms. the transition matrix f(k) and the noise gain matrix g(k) corresponding to the target for the sampling interval t , which is assumed in the simulation to be two seconds, are given by 46 j.i.-z. chen, y.-n. chung f(k) =   1 t 0 0 0 1 0 0 0 0 1 t 0 0 0 1   and g(k) =   t2/2 0 t 0 0 t2/2 0 t   , respectively. the initial value of the state error covariance is assumed to default as p (0 | 0) =   10000 100 0 0 0 100 100 0 0 0 10000 100 0 0 100 100  . after the assignment of initial conditions is completed, the procedure of simulation for the dfa algorithm is following the steps illustrated in fig. 3. the results from tracking three targets without and with the dfa calculation are graphed in fig. 4 and fig. 5, respectively. in these simulations eighty steps monte carlo are implemented; moreover, 10 estimated tracking (measurements) with dataassociation calculation are sampled for reciprocal comparison for accuracy. it is easy to see that the many more matching situations occur in fig. 5. i.e., all of the tracking paths tightly parallel the true path marked with circles. it should be emphasized that a little difference does exist the paths of the true targets and the results presented in fig. 4 and fig. 5, since the tracking is generated with a random function of the software program. usage of random-number generators for the measurement of noise and clutter points is illustrated in the simulation. furthermore, a kalman filter is utilized to recursively estimate the state vector x̂(k|k) . on the basis of each hypothesis formulated from the measurement data received, the corresponding correlations can be promptly calculated. hence, the position errors caused by the use and non-use of dfa calculation and without dfa calculation are plotted in fig. 6, and fig. 7, respectively. it is reasonable to state that the larger position error, with an order of ten thousands, is generated by the results obtained prior to dfa calculation; whereas, a smaller position error, in an approximate order of hundreds, occurs in the case after dfa calculation. a case with all the noise uncorrelated is assumed in the simulation. thus the values α = 1 and β = r + 1 are chosen in the simulation for five targets and ten to thirty measurements, where r is the radians of the gate. after eighty monte carlo iterations the simulation results from tracking the five targets, two non-maneuvering (target 1 and target 2) and three maneuvering (target 3 to target 5), are graphed in fig. 8. the results of the tracking position errors after dfa calculation are plotted in fig. 9. according to the simulation results based on several different situations, we know that the performance of the proposed algorithm, dfa on chnn-based, is quite well. the proposed chnn-based dfa algorithm might have a hardware complexity (number of neurons) in direct linear proportion to the number of tracked targets and the deployed mobile sensors in wsss. moreover, tracking for targets with mobile sensors by using the dfa algorithm in wsns is constrained by the requirement for training the chnn. however, this chnn can be thoroughly implemented in analog vlsi technology with currently existing methods. therefore, the authors believe that, overall, since a chnn is more appropriate for a neural network based in multi-mobile sensor tracking environments, one may expect to see many such applications in wss constructions in the near future. furthermore, although in this type of environment the size of the optimization problem requiring much attention is considerably larger, the proposed dfa algorithm performs quite well, as in the simulation illustration. this is probably due to a sparse assumption of fewer tracked targets. however, the scale of the chnn that would be required for the implementation of a practical mobile sensor might become a problem when the number of sensors and the measurements become very large. a current issue for development is reduction of the scale of the quadratic optimization problem that the chnn must solve, so that available analog chnn ic (integrated circuit) implementation can be used to build practical mobile sensors. besides, the trends for implementing the wss in large scale network are a data fusion methodology for wireless sensor systems 47 generally to distribute the fused-data in some small area networks separately. the dfa based on chnn proposed in this paper, thus, can be implemented in the various mobile environments both in large scale and small scale networks. finally, since the amount of energy consumed by a mobile sensor during the processing of the dfa is large, such consumption is another important issue. recently, several methods have been proposed for investigating energy-awareness problems in the mobile sensors in wss [25]. the authors are currently working on developing a method for decreasing energy consumption by a mobile sensor so that the lifetime of sensors in wss can be increased. table 1: initial conditions of two targets x(m) ẋ(m/s) y(m) ẏ(m/s) target 1 1500 400 3500 560 target 2 1000 600 4000 440 table 2: initial conditions of five targets x(m) ẋ(m/s) y(m) ẏ(m/s) target 1 100 400 3500 560 target 2 1000 600 400 440 target 3 0 550 8000 80 target 4 20 540 9400 -100 target 5 0 0 12500 -70 figure 1: deployment with two targets and five mobile sensor nodes, with sensing areas covered in circles. 48 j.i.-z. chen, y.-n. chung figure 2: relationships between measurements and predicted targets, based on gating technique. figure 3: the procedure of simulation for the dfa algorithm. figure 4: data association results for tracking three targets prior to dfa calculation. a data fusion methodology for wireless sensor systems 49 figure 5: data association results for tracking three targets after dfa calculation figure 6: position errors in tracking three targets prior to dfa calculation. figure 7: position errors in tracking three targets after dfa calculation. 50 j.i.-z. chen, y.-n. chung figure 8: simulations of tracking of five targets (using chnn algorithm). figure 9: position errors in tracking six targets after dfa calculation (using chnn technique). 6 conclusions an innovative dfa algorithm based on chnn techniques for tracking multiple targets with mobile sensors over wss environments has been proposed in this report. this tracking technique has been investigated for its advantages in choosing an optimal correlation between mobile sensor measurements and existing target tracks. because of the application of the chnn technique, it was discovered that the system is relieved of the burden of determining the proper weighting factors as in a traditional hnn; therefore, the network can always achieve a rational solution. moreover, an adaptive procedure for tracking maneuvering targets is also employed in this algorithm via stochastic process obtained by the dfa. on the basis of the simulation results obtained in this study, it can be claimed that this dfa algorithm is capable of obtaining the optimal correlations between true targets and mobile sensor measurements in wss scenarios. finally, the approach developed in this research has demonstrated not only stable performance for tracking procedures but definitely also excellent efficiency when tracking both constant velocity and maneuvering targets. acknowledgement the author expresses appreciation to dr. c. rutledge, department of english, dayeh university, for her editorial assistance. the authors also would like to thank the anonymous reviewers for many useful comments which help to improve the quality and readability of this paper. a data fusion methodology for wireless sensor systems 51 bibliography [1] f. zhao, l. guibas, wireless sensor networks: an information processing approach. elsevier pte. ltd., singapore, 2004. [2] m. cetin, lei chen, fisher, j. w., iii, ihler, a. t., moses, r. l., wainwright, m. j. willsky, a. s., distributed fusion in sensor networks. ieee signal processing magazine, vol. 23, issue 4, pp. 42-55, 2006. [3] j. miguez, a. artes-rodriguez, monte carlo algorithms for tracking a maneuvering target using a network of mobile sensors. proc. 1st ieee int. workshop computational advances in multi-sensor adaptive processing, puerto vallarta, mexico, vol. 1, pp. 89-92, 2005. [4] k. c. chang, c. y. chong, y. bar-shalom, joint probabilistic data and association distributed sensor networks. ieee transactions on automatic control, vol. ac-31, pp. 889897, 1986. [5] n. okello, b. ristic, maximum likelihood registration for multiple dissimilar sensors. ieee transactions on aerospace electronic systems, vol. 39, issue 3, pp. 1074-1083, 2003. [6] y. bar-shalom, t. e. fortmann, tracking and data association. academic press, inc., 1989. [7] s.s. blackman, multiple hypothesis tracking for multiple target tracking. ieee aerospace electronic systems, vol. 19, issue 1, pp. 5-18, 2004. [8] y.n. chung, j. i. -z. chen, applying both kinematic and attribute information for a target tracking algorithm. journal of control systems and technology, pp. 203-209, 1997. [9] c. hue, le cadre., j. -p., p. perez, sequential monte carlo methods for multiple target tracking and data fusion. ieee transactions on signal processing, vol. 50, issue 2, pp. 309-325, 2002. [10] d. sengupta, r.a. iltis, neural solution to the multitarget tracking data association problem. ieee aerospace electronic systems, vol. 25, issue 1, pp. 96-108, 1989. [11] l. chin, application of neural networks in target tracking data fusion. ieee aerospace electronic systems, vol. 30, issue 1, pp. 281-287, 1994. [12] c. y. chang, p. c. chung, medical image segmentation using a contextual-constraint based hopfield neural cube. image and vision computing, pp. 669-678, 2001. [13] e. soujeri, h. bilgekul, hopfield multiuser detection of asynchronous mc-cdma signals in multipath fading channels. ieee communications letters, vol. 6, issue 4, pp. 147-149, 2002. [14] b. zhou, n. k. bose, a comprehensive analysis of neural solution to the multitarget tracking data association problem. ieee aerospace electronic systems, vol. 29, issue 1, pp.260-263, 1993. [15] x. wang, a. jiang, s. wang, mobile agent based moving target methods in wireless sensor networks. proc. ieee int. symp. commun. and info. tech., beijing, china, vol. 1, pp. 22-26, 2005. 52 j.i.-z. chen, y.-n. chung [16] q. liang, d. f. yuan, y. wang, r. h. zhang, a new sensor antenna-array selecting method in wireless sensor networks. in proceeding int. conf. on communications, circuits and systems, guilin, china, vol. 3, pp. 1523-1526, 2006. [17] s. y. kung, digital neural networks. ptr prentice hall, englewood cliffs, new jersey, 1993. [18] x. wang, s. wang, d. bi, dynamic sensor node selection strategy for wireless sensor networks. in proceeding ieee int. symp. commun. and info. tech., darling harbour, sydney, australia, vol. 1, pp. 1137-1142, 2007. [19] t. semertzidis, k. dimitropoulos , a. koutsia, n. grammalidis, video sensor network for real-time traffic monitoring and surveillance. iet intelligent transport systems, vol. 4, issue 2, pp. 103-112, 2010. [20] y. -s. yen, s. hong, r. -s. chang, h. -c. chao, controlled deployments for wireless sensor networks. iet communications, vol. 3, issue 5, pp. 820-829, 2009. [21] y. liu, n. xiong, y. zhao, a.v. vasilakos, j. gao, y. jia, multi-layer clustering routing algorithm for wireless vehicular sensor networks. iet communications, vol. 4, issue 7, pp. 810-816, 2010. [22] l. shi, a. capponi, k. h. johansson, r. m. murray, resource optimization in a wireless sensor network with guaranteed estimator performance. iet control theory applications, vol. 4, issue 5, pp. 710-723, 2010. [23] m. s. grewal, a. p. andrew, kalman filtering, theory and practice-using matlab, 2nd ed. john wiley & sons, inc., new york, 2001 [24] a. papoulis, s. u. pillai, probability, random variables, and stochastic processes. 4th ed. mcgraw-hill, comp., inc., new york, 2002 [25] x. wang, d. wang, y. wang, agrawal, d. p., a. mishra, on data fusion and lifetime constrains in wireless sensor networks. in proceeding ieee int. communication. conf., glasgow, scotland, vol. 9, pp. 3942-3947, 2007. int j comput commun, issn 1841-9836 vol.7 (2012), no. 4 (november), pp. 674-687 impact of network infrastructure parameters to the effectiveness of cyber attacks against industrial control systems b. genge, c. siaterlis, m. hohenadel béla genge, christos siaterlis and marc hohenadel institute for the protection and security of the citizen european commission, joint research centre via e. fermi, 21027, ispra (va), italy e-mail: {bela.genge, christos.siaterlis, marc.hohenadel}@jrc.ec.europa.eu abstract: the fact that modern networked industrial control systems (nics) depend on information and communication technologies (ict), is well known. although many studies have focused on the security of scada systems, today we still lack the proper understanding of the effects that cyber attacks have on nics. in this paper we identify the communication and control logic implementation parameters that influence the outcome of attacks against nics and that could be used as effective measures for increasing the resilience of industrial installations. the implemented scenario involves a powerful attacker that is able to send legitimate modbus packets/commands to control hardware in order to bring the physical process into a critical state, i.e. dangerous, or more generally unwanted state of the system. the analysis uses a boiling water power plant to show that the outcome of cyber attacks is influenced by network delays, packet losses, background traffic and control logic scheduling time. the main goal of this paper is to start an exploration of cyber-physical effects in particular scenarios. this study is the first of its kind to analyze cyber-physical systems and provides insight to the way that the cyber realm affects the physical realm. keywords: cyber attacks, industrial control systems, scada, security. 1 introduction modern critical infrastructures (ci), e.g. power plants, water plants and smart grids, rely on information and communication technologies (ict) for their operation since ict can lead to cost optimization as well as greater efficiency, flexibility and interoperability between components. in the past cis were isolated environments and used proprietary hardware and protocols, limiting thus the threats that could affect them. nowadays cis, or more specifically networked industrial control systems (nics), are exposed to significant cyber-threats; a fact that has been highlighted by many studies on the security of supervisory control and data acquisition (scada) systems [1, 2]. the recently reported stuxnet worm [3] is the first malware specifically designed to attack nics. its ability to reprogram the logic of control hardware in order to alter physical processes demonstrated how powerful such threats can be. stuxnet was a concrete proof of a successful cyber-physical attack but by no means a trivial attack. it required a thorough knowledge of the physical system, software and os vulnerabilities. in this paper we consider an adversary with a lower level of sophistication that instead of reprogramming the highly specialized hardware (plcs) as in the stuxnet case, he exploits the ability of control hardware to communicate with remote stations using well-established protocols such as tcp, that are normally used by the operator to read sensor values, e.g. pressure, temperature, and control actuators, e.g. valves. furthermore, we identify the communication and control logic implementation parameters that influence the outcome of cyber attacks against nics and that could be used as effective measures for increasing the resilience of industrial installations. in our study the attacker is located copyright c⃝ 2006-2012 by ccc publications impact of network infrastructure parameters to the effectiveness of cyber attacks against industrial control systems 675 outside the plant, somewhere in the internet, from where he/she exploits the capability of control hardware to communicate with remote stations. although direct connections between control hardware and the internet are usually avoided in nics implementations [5], the attacker might install a malware on a station with an internet connection located within the corporate network that could be used to forward messages to the control hardware. we consider that the control hardware is running legitimate code designed to keep the physical process in a normal operating point, while the attacker tries to bring it to a critical state, i.e. dangerous, or more generally unwanted state of the system [6]. the method employed by the attacker is a repeated transmission of commands to the control hardware that open/close specific valves. the main contribution of this paper is that it is the first of its kind to analyze cyber-physical systems and provides insight to the way that the cyber realm affects the physical realm. the real applicability of the implemented scenario is confirmed not only by stuxnet, but by other past events as well. one example in this sense is a 2002 penetration test done by a security firm for a power company located in california. the testers parked their van outside a remote substation, where they noticed a wireless antenna. without leaving their vehicle they managed to connect to the system and within 20 minutes they have not only mapped the entire network, but also ”they were talking to the business network and had pulled off several business reports” [4]. by taking these events and adapting them to our scenario we can further imagine that the testers are in fact attackers that install a remotely accessible gateway and launch their attack from the anonymity provided by the internet. the attack scenario has been implemented with the help of our previously developed framework [7] that uses simulation for the physical components and an emulation testbed based on emulab [8, 9] to recreate the cyber part of nics, e.g., scada servers, corporate network, etc. in the implemented scenario we have used the model of a boiling water power plant developed by bell and åström [10]. the paper is structured as follows. after an analysis of related work in section 2 we present a brief overview of the experimentation framework in section 3. the methodology and implemented experiments are presented in section 4 and 5, respectively. the paper concludes in section 6. 2 related work an approach where real sensors and actuators, combined with simulated plcs and communication protocols were used to study cyber-physical systems has been proposed by queiroz, et al. [15]. their study showed that while plcs are under a dos attack, operators might take delayed or wrong decisions that could disrupt the operation of the plant. a similar experiment has also been documented by davis, et al. [16] that used the powerworld server to study the effects of communication delays between the physical process and human operators. in the same direction, the work of chabukswar, et al. [17] proved that a ddos attack against communication nodes between controllers and sensors causes the plcs to take wrong decisions based on old sensor values. cárdenas, et al. [18], went further by not only documenting the effects of dos attacks on sensors, but also proposing a new detection mechanism and possible countermeasures. the previously mentioned approaches have demonstrated the effectiveness of dos attacks, but without reaching a sophistication level that would have allowed the attacker to reprogram the low level control logic of the plcs. this fact sets an important barrier in terms of knowledge, skills and efforts required by the attacker, as was the case of stuxnet, where developers had also knowledge of the plc code, os and hardware details. in this category we find the work of nai fovino, et al. [5] that have proposed an experimental platform for studying the effects of cyber attacks against nics. in their paper the authors describe several attack scenarios, including dos attacks and worm infections that send modbus packets to control hardware. although the 676 b. genge, c. siaterlis, m. hohenadel authors provide a wide set of countermeasures, they do not identify communication parameters that affect the outcome of the attacks. moreover, in our analysis we have also identified installation-specific parameters that can directly affect the resilience of physical processes. 3 framework overview after providing a brief description of a typical nics architecture, this section presents a short overview of our previously developed experimentation framework used in our experiments. 3.1 process control architecture overview in modern nics architectures, one can identify two different control layers: (i) the physical layer composed of all the actuators, sensors, and generally speaking hardware devices that physically perform the actions on the system, e.g. open a valve, measure the voltage in a cable; (ii) the cyber layer composed of all the ict devices and software which acquire the data, elaborate low level process strategies and deliver the commands to the physical layer. the cyber layer typically uses scada protocols to control and manage the physical devices within the cyber layer. the ”distributed control system” of the cyber layer is typically split among two networks: the control network and the process network. the process network usually hosts all the scada (also known as scada masters) and hmi (human machine interface) servers. the control network hosts all the devices which, on the one side control the actuators and sensors of the physical layer and on the other side provide the ”control interface” to the process network. a typical control network is composed of a mesh of plcs (programmable logic controller). from an operational point of view, plcs receive data from the physical layer, elaborate a ”local actuation strategy”, and send back commands to the actuators. plcs execute also the commands that they receive from the scada servers (masters) and additionally provide, whenever requested, detailed physical layer data. 3.2 framework architecture the previously developed framework [7] follows a hybrid approach, where the emulab-based testbed recreates the control and process network of nics, including plcs and scada servers, and a software simulation reproduces the physical processes. the architecture, as shown in figure 1, clearly distinguishes 3 layers: the cyber layer, the physical layer and a link layer in between. the cyber layer includes regular ict components used in scada systems, while the physical layer provides the simulation of physical devices. the link layer, i.e. cyber-physical layer, provides the ”glue” between the two layers through the use of a shared memory region. the physical layer is recreated through a soft real-time simulator that runs within the sc (simulation core) unit and executes a model of the physical system. the simulator’s execution time is strongly coupled to the timing service of the underlying operating system (os). throughout the paper the term time step is used to denote the time between two successive executions of the physical model in the simulator. the cyber layer is recreated by an emulation testbed that uses the emulab architecture and software [8, 9] to automatically and dynamically map physical components, e.g. servers, switches to a virtual topology. besides the process network, the cyber layer also includes the control logic code that in the real world is run by plcs. the control code can be run sequentially or in parallel to the physical model. in the sequential case, a tightly coupled code (tcc) is used, i.e. code that is running in the same memory space with the model, within the sc unit. in the parallel case a loosely coupled code (lcc) is used, i.e. code that is running in another address space, possibly on another host, within the r-plc unit (remote impact of network infrastructure parameters to the effectiveness of cyber attacks against industrial control systems 677 plc mem physical models r-plc unit master unit physical layer cyber-physical layer cyber layer sc unit plc code plc code r-plc unit emulab testbed plc mem figure 1: experimentation framework architectural overview plc). the unit that implements global decision algorithms based on the sensor values received from the r-plc units is also present in the proposed framework as the master unit. the cyberphysical layer incorporates the plc memory, seen as a set of registers typical of plcs, and the communication interfaces that ”glue” together the other two layers. prototypes of sc, r-plc and master units have been developed in c# (windows) and have been ported and tested on unix-based systems (freebsd, fedora and ubuntu) with the use of the mono platform. matlab simulink was used as the physical process simulator (physical layer). from simulink models the corresponding ’c’ code is generated using matlab rtw. the communication between sc and r-plc units is handled by .net’s binary implementation of rpc (called remoting) over tcp. for the communication between the r-plc and master units, we used the modbus over tcp protocol. 4 methodology in this section we provide a description of the methodology we used, including a description of the scenario and experimental setup. 4.1 scenario previous security events [3, 4] involving nics showed that attackers can (easily) compromise stations located within an installation’s internal network. these stations can then be used as gateways for downloading malicious code and for remotely controlling other stations, including control hardware such as plcs. the implemented scenario assumes that there is already a compromised station providing access to plcs. this is used by the attacker to send remote modbus commands in order to bring the physical process into a critical state. the physical process used in our experiments is the boiling water power plant (bwpp) model developed by bell and åström [10]. within this context the critical state is given by an increased steam pressure that is more than twice the value of a typical operating point. although to the best of our knowledge the literature does not mention anything about the consequences of running the process with these parameters, we assumed that this might cause physical damages 678 b. genge, c. siaterlis, m. hohenadel and therefore could become a desired target for the attacker. furthermore, we assumed that the attack is conducted for a limited time period of 10 minutes, as immediately after it is started the process experiences significant deviations from its normal operating point. consequently, alarms might be turned on and human operators might intervene by switching off devices or disconnecting equipment. as shown later in this paper, the 10 minute time period is more than enough for the attacker to bring the physical process into a critical state. our experimental results presented in the next section show that the outcome of the attack is affected not only by network delays, packet losses and background traffic, but also by the execution of plc control code and the speed of control valves. these can be used as effective measures to increase the resilience of physical processes confronted with cyber attacks. 4.2 experimental setup the previously described scenario has been implemented in the joint research centre’s (jrc) experimental platform for internet contingencies (epic) laboratory. the emulab testbed included nodes with the following configuration: freebsd os 8, amd athlon dual core cpu at 2.3ghz and 4gb of ram. we have emulated network delays, packet losses and background traffic with the dummynet [11, 12] and iperf [13] software in order to recreate a dynamic and unpredictable environment such as the internet. as shown in figure 2, the experimental setup consisted of 6 hosts: 1 host for running the sc unit, 3 hosts for running the r-plc units, 1 host for running the compromised station and 1 host for the attacker. the attacker uses the compromised station to forward modbus packets to r-plc units and finally to write the plc memory within the sc unit. the control code that is in charge of maintaining the bwpp at a constant operating point has been implemented as tcc code, where tcc1 controls the fuel valve, tcc2 controls the steam valve and tcc3 controls the feed-water valve. the role of the r-plc units is simply to enable access to the physical model using the modbus/tcp protocol. although r-plc units can also run control code, the decision to implement control code as tccs has been made based on the granularity of the process model execution step that needs to be less than 1ms. this is needed in order to emulate plc tasks that can be scheduled to run within milliseconds. consequently, the chosen time step for the physical model was of 0.5ms. network delays, background traffic and packet losses, specific to a dynamic environment such as the internet, have been emulated between the attacker’s station and the compromised station. the limited bandwidth and communication capabilities of plcs have been emulated with 10mbit/s lans (lan2 and lan1, respectively), while using a 100mbit/s lan (lan0 ) between r-plcs and the sc unit in order to maximize the performance of the interaction between r-plc units and the bwpp model. 4.3 boiling water power plant as already mentioned, in this paper we use the boiling water power plant (bwpp) model developed by bell and åström in [10]. it models a 160mw oil-fired electric power plant based on the sydsvenska kraft ab plant in malmö, sweden. the operation of the process is controlled by three valves, i.e. fuel valve, steam valve and feed-water valve, while the operator is able to monitor the process by reading three sensors: steam pressure, water level and generated electricity. the following equations describe the dynamics of the physical process [10]: impact of network infrastructure parameters to the effectiveness of cyber attacks against industrial control systems 679 industrial ict emulation 10.1.1.2 lan0 100 mbit 10.1.1.3 10.1.1.4 10.1.1.5 10.1.3.3 10.1.3.4 10.1.3.5 10.1.3.2 10.1.2.2 10.1.2.3 lan2 10 mbit traffic source traffic sink generator (y2) steam valve (u2)water valve (u3) pressure sensor (y1) level sensor (y3) fuel valve (u1) r-plc unit 2 (steam valve) r-plc unit 1 (fuel valve) r-plc unit 3 (water valve) l a n 1 1 0 m b it internet tcc1 (fuel valve) tcc2 (steam valve) tcc3 (water valve) turbine boiler delay and packet loss emulation sc unit compromised station attacker figure 2: experimental setup ẋ1 = −0.0018u2x 9/8 1 + 0.9u1 −0.15u3, ẋ2 = (0.073u2 −0.016)x 9/8 1 −0.1x2, ẋ3 = (141u3 − (1.1u2 −0.19)x1)/85, y1 = x1, y2 = x2, y3 = 0.05(0.1307x3 + 100sq + er/9−67.975), sq = (1−0.001538x3)(0.8x1−25.6) x3(1.0394−0.0012304x1) , er = (0.85u2 −0.147)x1 + 45.59u1 −2.514u3 −2.096, 0 ≤ ui ≤ 1(i = 1,2,3), u̇1 ≤ 0.007/sec,−1.0 ≤ u̇2 ≤ 0.1/sec, u̇3 ≤ 0.05/sec, (1) where x1,x2 and x3 denote the steam pressure (kg/cm2), electric output (mw), and fluid density (kg/m3), respectively; y1,y2 and y3 denote the outputs of the model, where y3 is the water level deviation (m); u1,u2 and u3 are the valve positions for fuel flow, steam flow and feed-water flow, respectively; sq and er denote the steam quality and evaporation rate (kg/s), respectively. in our experiments we used a normal operating point for the bwpp in which the pressure equals 108 kg/cm2, that was achieved by keeping the three valves in the following normal valve positions (nvp): u1 = 0.34, u2 = 0.69 and u3 = 0.433 [14]. as a consequence, tccs, that implement the control logic code, must maintain constant the position of the three valves in order to provide a constant steam pressure of 108 kg/cm2. for the critical state we consider a pressure of 250 kg/cm2 that is more than twice the value of the steam pressure of the process running in the previously mentioned normal operating point. the attacker is able to bring the 680 b. genge, c. siaterlis, m. hohenadel bwpp into the critical state by continuously sending modbus packets (around 100/sec for each valve) that keep the steam and feed-water valves in the closed position (u2 = 0 and u3 = 0, respectively) and the fuel valve in the opened position (u1 = 1). throughout this paper we use the term attacker’s valve positions (avp) to denote the position of the three valves in the attacker’s setting. 5 attacks and analysis in this section we analyze the influence of several parameters on the cyber attacks launched remotely from the internet, as described in the scenario from the previous section. the analysis is conducted in two phases: in the first phase we analyze the ability of the attacker to maintain the control valves in the avp; in the second phase we analyze the ability of the attacker to increase the steam pressure to 250 kg/cm2, thus actually bringing the plant into the critical state. while the goal of the second phase is clear, the goal of the first one needs further explanation. the actual goal of the first phase is to analyze the reaction of plcs in terms of commands sent to the valves, reaction that might provide assistance in the rationale of the results in the second phase. in both phases we have measured the influence of: plc task scheduling (ts) every 100ms and 1ms; network delays of 0s, 0.5s and 3s; packet losses of 1%, 5% and 10%; and background traffic of 2.5mbit/s, 5mbit/s and 10mbit/s. such extreme values for network delays, i.e. 3s, and packet losses, i.e. 10%, can rarely be measured over the internet (possibly over satellite links or multiple intermediate proxies). nevertheless, we have included them in our analysis in order to justify our statements related to the required magnitude of these parameters for influencing the outcome of the attack. for each configuration setting, representing a combination of plc ts, network delays, packet losses and background traffic we have run one experiment for 10 minutes. in total, we have run 540 experiments in 9 hours. 5.1 the effect of the cyber attack on the position of control valves the implementation of modbus over tcp allows attackers to remotely control the three valves, i.e. fuel valve, steam valve, feed-water valve, thus providing a certain anonymity to the attacker. nevertheless, this status is compensated by fluctuating parameters such as network delays, packet losses and background traffic that can have a major effect on the outcome of the attack. in this sub-section we analyze the effects of these parameters and two different ts, i.e. 100ms and 1ms, on the position of the three control valves. 100ms task scheduling the effect of network delays on the position of control valves for a 100ms ts time is given in figure 3. each sub-figure shows the position of a specific control valve during the 10 minute attack, starting with the nvp shown at t = 0 minutes and following with the changes induced by the attacker that, as shown by most of the figures, bring the valves into the avp. the differences we observe in the behavior of the three valves are caused by the motion speed of each valve that is different in each case (equation 1). thus, the fastest to open and close is the steam valve (0.1/sec for opening and 1/sec for closing) followed by the feed-water valve (0.05/sec) and by the fuel valve (0.007/sec). consequently, the greater the speed, the higher the fluctuations that we see in figure 3 and the higher the ability of plcs to maintain the nvp. one of the main conclusions from our results (figure 3) is that network delays are beneficial to the physical process when confronted with cyber attacks. nevertheless, the attacker is able to bring the valves into the avp even for network delays of 3s. the attacker is most successful with impact of network infrastructure parameters to the effectiveness of cyber attacks against industrial control systems 681 0 0.2 0.4 0.6 0.8 1 0 1 2 3 4 5 6 7 8 9 1 0 time (m) fuel valve: 0s emulated delay 0 0.2 0.4 0.6 0.8 1 0 1 2 3 4 5 6 7 8 9 1 0 time (m) fuel valve: 0.5s emulated delay 0 0.2 0.4 0.6 0.8 1 0 1 2 3 4 5 6 7 8 9 1 0 time (m) fuel valve: 3s emulated delay 0 0.2 0.4 0.6 0.8 1 0 1 2 3 4 5 6 7 8 9 1 0 time (m) steam valve: 0s emulated delay 0 0.2 0.4 0.6 0.8 1 0 1 2 3 4 5 6 7 8 9 1 0 time (m) steam valve: 3s emulated delay 0 0.2 0.4 0.6 0.8 1 0 1 2 3 4 5 6 7 8 9 1 0 time (m) steam valve: 3s emulated delay 0 0.2 0.4 0.6 0.8 1 0 1 2 3 4 5 6 7 8 9 1 0 time (m) feed-water valve: 0s emulated delay 0 0.2 0.4 0.6 0.8 1 0 1 2 3 4 5 6 7 8 9 1 0 time (m) feed-water valve: 0.5s emulated delay 0 0.2 0.4 0.6 0.8 1 0 1 2 3 4 5 6 7 8 9 1 0 time (m) feed-water valve: 3s emulated delay figure 3: effect of network delays on control valve positions for 100ms ts, 1% loss rate and 2.5mbit/s background traffic the fuel valve as this has the lowest speed and the control code is running only once every 100ms, while the attacker sends one modbus command every 10ms, i.e. 100/sec. for the other two valves the plcs are able to produce a slight deviation from the avp; however, these are unable to bring the valves back to the nvp. as we increase the network delays we notice that plcs are causing larger deviations from the avp and in the extreme case of 3s the attacker is able to produce only small deviations from the nvp for the fuel and feed-water valves. nevertheless, the attacker is still able to close the steam valve, as its closing speed (1/sec) exceeds 10 times its opening speed (0.1/sec). based on these results we can conclude that only extreme network delays, e.g. 3s, have a major influence on the outcome of the attack. in addition, an attacker could successfully exploit a different opening and closing speed of valves, that could be interpreted as a slower reaction of the plcs. going further, in figure 4 we show the effect of packet losses on the position of the valves for the same ts of 100ms. by increasing the packet loss from 1% (figure 3) to 5% and 10% (figure 4) we also increase the deviations from avp. nevertheless, even with higher packet losses plcs are unable to maintain the nvp, not even in the case of slower valves such as the fuel and feed-water valves. as shown in the same figure, we have also experimented with extreme packet losses of 10%. however, as the attacker uses a 10 times higher packet rate than the control code scheduling rate, even in this case plcs are unable to keep the valves in the nvp. we have also investigated the effect of background traffic on the three valves, shown in figure 5. by increasing the background traffic from 2.5mbit/s (figure 3) to 5mbit/s (figure 5) we do not notice major effects, as the maximum network capacity is 10mbit/s and the 100 modbus 682 b. genge, c. siaterlis, m. hohenadel 0 0.2 0.4 0.6 0.8 1 0 1 2 3 4 5 6 7 8 9 1 0 time (m) fuel valve: 5% emulated packet loss 0 0.2 0.4 0.6 0.8 1 0 1 2 3 4 5 6 7 8 9 1 0 time (m) fuel valve: 10% emulated packet loss 0 0.2 0.4 0.6 0.8 1 0 1 2 3 4 5 6 7 8 9 1 0 time (m) steam valve: 5% emulated packet loss 0 0.2 0.4 0.6 0.8 1 0 1 2 3 4 5 6 7 8 9 1 0 time (m) steam valve: 10% emulated packet loss 0 0.2 0.4 0.6 0.8 1 0 1 2 3 4 5 6 7 8 9 1 0 time (m) feed-water valve: 5% emulated packet loss 0 0.2 0.4 0.6 0.8 1 0 1 2 3 4 5 6 7 8 9 1 0 time (m) feed-water valve: 10% emulated packet loss figure 4: effect of packet loss on plant valve positions for 100ms ts, 0s emulated delay and 2.5mbit/s background traffic 0 0.2 0.4 0.6 0.8 1 0 1 2 3 4 5 6 7 8 9 1 0 time (m) fuel valve: 5mbit/s emulated traffic 0 0.2 0.4 0.6 0.8 1 0 1 2 3 4 5 6 7 8 9 1 0 time (m) fuel valve: 10mbit/s emulated traffic 0 0.2 0.4 0.6 0.8 1 0 1 2 3 4 5 6 7 8 9 1 0 time (m) steam valve: 5mbit/s emulated traffic 0 0.2 0.4 0.6 0.8 1 0 1 2 3 4 5 6 7 8 9 1 0 time (m) steam valve: 10mbit/s emulated traffic 0 0.2 0.4 0.6 0.8 1 0 1 2 3 4 5 6 7 8 9 1 0 time (m) feed-water valve: 5mbit/s emulated traffic 0 0.2 0.4 0.6 0.8 1 0 1 2 3 4 5 6 7 8 9 1 0 time (m) feed-water valve: 10mbit/s emulated traffic figure 5: effect of background traffic on plant valve positions for 100ms ts, 0s emulated delay and 1% packet loss messages sent every second for each valve generate a traffic of around 140kbit/s, with a total of 420kbit/s for all three valves. nevertheless, when the background traffic reaches the network capacity fewer messages reach the plcs which are able to maintain the nvp with only small deviations. however, even in this later extreme case the steam valve is affected by the attacker, as its closing speed is 10 times higher than its opening speed, and counteracting one single packet received from the attacker requires 10 executions of the control code. 1ms task scheduling the previous results have shown that if plcs have a control code ts of 100ms they are not effective in maintaining the nvp. moreover, the outcome of the attack is affected only by extreme impact of network infrastructure parameters to the effectiveness of cyber attacks against industrial control systems 683 0 0.2 0.4 0.6 0.8 1 0 1 2 3 4 5 6 7 8 9 1 0 time (m) fuel valve: 0s emulated delay 0 0.2 0.4 0.6 0.8 1 0 1 2 3 4 5 6 7 8 9 1 0 time (m) feed-water valve: 0s emulated delay 0 0.2 0.4 0.6 0.8 1 0 1 2 3 4 5 6 7 8 9 1 0 time (m) steam valve: 0s emulated delay 0 0.2 0.4 0.6 0.8 1 0 1 2 3 4 5 6 7 8 9 1 0 time (m) steam valve: 0.5s emulated delay 0 0.2 0.4 0.6 0.8 1 0 1 2 3 4 5 6 7 8 9 1 0 time (m) steam valve: 3s emulated delay figure 6: effect of network delays on plant valve positions for 1ms ts, 1% loss rate and 2.5mbit/s background traffic cases of network delays and background traffic. for this reason we have also experimented with a ts of 1ms that could significantly improve the reaction of the plcs. for a ts of 1ms, even in the case of 0s emulated delay, the attacker is only able to produce insignificant deviations from nvp for fuel and feed-water valves, as shown in figure 6. nevertheless, the steam valve is still affected by the attack and the attacker is able to bring it to the avp. however, in this case larger network delays show a significant effect on the steam valve, as the attacker is unable to maintain the steam valve in the avp for delays larger than 3s. as the fuel and feed-water valves present only insignificant changes already shown in figure 6, we further focus our attention on the steam valve. we have repeated the experiments for different packet losses and background traffic, with the results shown in figure 7. as in the case of the previous results, larger packet losses do not improve the response of the plcs. nevertheless, the extreme case of 10mbit/s for background traffic ensures small deviations from the nvp for the first 5 minutes of the experiment. in the second half more packets reach the plc that produce larger and larger deviations and finally after 8 minutes are able to bring the valve into the avp. the results from this sub-section have shown that the attacker is able to affect the nvp for all three valves if the plcs use a ts of 100ms. within this setting network delays and background traffic are major factors that influence the attack, as also illustrated in table 1. nevertheless, the attacker is still able to produce major deviations from nvp for valves with an opening and closing speed difference. for this reason, a better solution would be to provide a smaller ts combined with an equal opening and closing speed of valves. as shown by the results, with a smaller ts, the fuel and feed-water valves experience insignificant deviations from nvp with minimum emulated delays and background traffic. in contrast, the speed difference of the steam valve still causes major deviations from nvp, that are reduced by only extreme network delays of 3s and a background traffic of 10mbit/s. lowering the value of the ts below 1ms could be considered a measure for a more resilient control code. however, this is only possible if the control code’s execution time is smaller than this value. 684 b. genge, c. siaterlis, m. hohenadel 0 0.2 0.4 0.6 0.8 1 0 1 2 3 4 5 6 7 8 9 1 0 time (m) steam valve: 5% emulated packet loss 0 0.2 0.4 0.6 0.8 1 0 1 2 3 4 5 6 7 8 9 1 0 time (m) steam valve: 10% emulated packet loss 0 0.2 0.4 0.6 0.8 1 0 1 2 3 4 5 6 7 8 9 1 0 time (m) steam valve: 5mbit/s emulated traffic 0 0.2 0.4 0.6 0.8 1 0 1 2 3 4 5 6 7 8 9 1 0 time (m) steam valve: 10mbit/s emulated traffic figure 7: effect of packet loss and background traffic on the steam valve position for 1ms ts and 0s emulated delay 5.2 the effect of the cyber attack on the steam pressure in the previous sub-section we have shown that the attacker can cause major deviations from the nvp for all three valves. in this sub-section we analyze the effects that the previously presented deviations have on the steam pressure. following the same experimental strategy, we have first recorded the steam pressure for a ts of 100ms followed by a ts of 1ms. 100ms task scheduling the results from figure 8 show that for an emulated network delay of 0s, the attacker is able to increase the steam pressure above 250 kg/cm2 after only 2.5 minutes the attack is started. moreover, the attacker is able to bring the process into a critical state even for a network delay of 0.5s. only extreme network delays of 3s show a major impact and prevent the successful outcome of the attack, although the attacker is still able to produce major deviations from nvp, as previously shown in figure 3. with larger packet losses the process reaches the critical state after 4 minutes for a 5% loss and 6 minutes for 10% loss, as shown in the same figure. the reason for this behavior is that all three valves still experience major deviations from their nvp, as shown in figure 4, which causes the pressure to increase immediately after the attack is started. the background traffic affects the outcome of the attack only in extreme cases when it reaches the maximum network capacity. otherwise, the attacker is able to open and close the valves, as shown in figure 5, and to bring the process into the critical state. 1ms task scheduling by decreasing the ts time to 1ms the attacker is not able to reach his goal in neither of the settings included in this study (figure 9). nevertheless, it is still able to increase the steam pressure to a maximum value of 234 kg/cm2 for minimal network delay, packet loss and background traffic. the reason behind this is that the attacker is still able to completely close the steam valve, although it is able to cause only negligible deviations from the nvp for the other two valves, as shown in figure 6. impact of network infrastructure parameters to the effectiveness of cyber attacks against industrial control systems 685 0 150 300 450 600 750 0 1 2 3 4 5 6 7 8 9 1 0 time (m) steam pressure (kg/cm2): 0s emulated delay 0 50 100 150 200 250 0 1 2 3 4 5 6 7 8 9 1 0 time (m) steam pressure (kg/cm2): 3s emulated delay 0 100 200 300 400 500 0 1 2 3 4 5 6 7 8 9 1 0 time (m) steam pressure (kg/cm2): 5% emulated packet loss 0 100 200 300 400 0 1 2 3 4 5 6 7 8 9 1 0 time (m) steam pressure (kg/cm2): 10% emulated packet loss 0 150 300 450 600 750 0 1 2 3 4 5 6 7 8 9 1 0 time (m) steam pressure (kg/cm2): 5mbit/s emulated traffic 0 30 60 90 120 150 0 1 2 3 4 5 6 7 8 9 1 0 time (m) steam pressure (kg/cm2): 10mbit/s emulated traffic figure 8: effect of network delays, packet loss and background traffic on the steam pressure for 100ms ts table 1: average valve positions and maximum pressure (p) during cyber attacks fuel valve steam valve feed-water valve max p (kg/cm2) (target: 0.34) (target: 0.68) (target: 0.43) (target: 108) parameters 100ms 1ms 100ms 1ms 100ms 1ms 100ms 1ms delay (s) 0 0.93 0.34 0.04 0.05 0.03 0.433 585 234 0.5 0.75 0.34 0.11 0.31 0.21 0.433 459 173 3 0.4 0.34 0.25 0.51 0.39 0.433 198 131 loss 5 0.84 0.34 0.1 0.3 0.14 0.433 407 204 (%) 10 0.61 0.34 0.17 0.46 0.2 0.433 347 145 traffic 5 0.92 0.34 0.04 0.05 0.03 0.433 586 234 (mbit/s) 10 0.34 0.34 0.55 0.67 0.43 0.433 128 110 by increasing the emulated network delays the maximum pressure induced by the attacker reduces gradually from 173 kg/cm2 for 0.5s to 131 kg/cm2 for 3s (table 1). however, larger packet losses and background traffic do not produce major differences in the outcome of the attack, unless extreme values are used. nevertheless, a 5% packet loss is still able to reduce the maximum pressure to 204 kg/cm2, while a 10% packet loss reduces the maximum pressure to 145 kg/cm2. in the extreme case of 10mbit/s background traffic the maximum pressure is reduced to 110 kg/cm2. the results from this section have shown that by decreasing the ts to 1ms the physical process is able to react more efficiently to cyber attacks. network delays, packet losses and background traffic have also shown to have an influence on the attack. nevertheless, by using a ts of 100ms these are able to affect the outcome of the attack only in extreme cases. consequently, designers should consider using a lower ts whenever possible in order to prevent the successful outcome of similar attacks. 686 b. genge, c. siaterlis, m. hohenadel 0 50 100 150 200 250 0 1 2 3 4 5 6 7 8 9 1 0 time (m) steam pressure (kg/cm2): 0s emulated delay 0 30 60 90 120 150 0 1 2 3 4 5 6 7 8 9 1 0 time (m) steam pressure (kg/cm2): 3s emulated delay 0 50 100 150 200 250 0 1 2 3 4 5 6 7 8 9 1 0 time (m) steam pressure (kg/cm2): 5% emulated packet loss 0 30 60 90 120 150 180 0 1 2 3 4 5 6 7 8 9 1 0 time (m) steam pressure (kg/cm2): 10% emulated packet loss 0 50 100 150 200 250 0 1 2 3 4 5 6 7 8 9 1 0 time (m) steam pressure (kg/cm2): 5mbit/s emulated traffic 0 30 60 90 120 0 1 2 3 4 5 6 7 8 9 1 0 time (m) steam pressure (kg/cm2): 10mbit/s emulated traffic figure 9: effect of network delays, packet loss and background traffic on the steam pressure for 1ms ts 6 conclusion the study presented in this paper showed that cyber attacks exploiting knowledge on nics can use regular modbus packets to bring the physical process into a critical state. within this scenario we evaluated the impact of network and installation-specific parameters on cyber attacks targeting a power plant. the experimental results showed that while communications parameters such as network delays, packet losses and background traffic have a limited effect on the attack, task scheduling and properties of physical processes, i.e. the speed of control valves, can become effective measures for increasing the resilience of physical processes. the main contribution of this paper is that it identifies two key parameters that could be adopted at design-time to increase the resilience of physical processes confronted with cyber attacks. the first one, i.e. control code task scheduling, provides engineers an efficient mechanism to counterbalance disturbances caused by malicious command packets, while the second one, i.e. the speed of control valves, provides insight into the way that an attacker might manipulate knowledge on physical properties to bring the process into a critical state. such properties should be taken into account at process design time, which will lead to a more resilient physical process. bibliography [1] s. east, j. butts, m. papa, s. shenoi, a taxonomy of attacks on the dnp3 protocol, in proceedings of ifip advances in information and communication technology, 311:67–81, 2009. [2] t.c. aseri, n. singla, enhanced security protocol in wireless sensor networks, international journal of computers communications & control, 6(2):214–221, 2011. [3] the symantec stuxnet dossier, 2010, http://www.wired.com /images_blogs/threatlevel/2010/11/w32_stuxnet_dossier.pdf [4] a.s. brown, scada vs. the hackers can freebie software and a can of pringles bring down the u.s. power grid?, mechanical engineering, 124(12), 2002. impact of network infrastructure parameters to the effectiveness of cyber attacks against industrial control systems 687 [5] i. nai fovino, m. masera, l. guidi, g. carpi, an experimental platform for assessing scada vulnerabilities and countermeasures in power plants, in proceedings of human system interactions, pp. 679–686, 2010. [6] i. nai fovino, a. carcano, t. de lacheze murel, m. masera, a. trombetta, distributed critical state detection system for industrial protocols, in proceedings of ifip international conference on critical infrastructure protection, pp. 95–110, 2010. [7] b. genge, c. siaterlis, i. nai fovino, m. masera, a cyber-physical experimentation environment for the security analysis of networked industrial control systems, computers & electrical engineering, in press, 2012. [8] b. white, j. lepreau, l. stoller, r. ricci, s. guruprasad, m. newbold, m. hibler, c. barb, a. joglekar, an integrated experimental environment for distributed systems and networks, in proceedings of the 5th symposium on operating systems design and implementation, pp. 255–270, 2002. [9] c. siaterlis, a. garcia, b. genge, on the use of emulab testbeds for scientifically rigorous experiments, ieee communications surveys & tutorials, pp(99):1–14, 2012. [10] r.d. bell, k.j. åström, dynamic models for boiler-turbine alternator units: data logs and parameter estimation for a 160mw unit, lundt institute of technology, report tfrt– 3192, sweden, 1987. [11] l. rizzo, dummynet: a simple approach to the evaluation of network protocols, acm computer communication review, 27(1):31–41, 1997. [12] m carbone, l. rizzo, dummynet revisited, acm sigcomm computer communication review, 40(2):12–20, 2010. [13] nlanr/dast, iperf: the tcp/udp bandwidth measurement tool, http://sourceforge.net/projects/iperf/ [14] w. tan, h.j. marquez, t. chen, j. liu, analysis and control of a nonlinear boiler-turbine unit, journal of process control, elsevier, 15(8):883–891, 2005. [15] c. queiroz, a. mahmood, j. hu, z. tari, x. yu, building a scada security testbed, in proceedings of the international conference on network and system security, pp. 357–364, 2009. [16] c.m. davis, j.e. tate, h. okhravi, c. grier, t.j. overbye, d. nicol, scada cyber security testbed development, in proceedings of the north american power symposium, pp. 483–488, 2006. [17] r. chabukswar, b. sinopoli, g. karsai, a. giani, h. neema, a. davis, simulation of network attacks on scada systems, first workshop on secure control systems, april, 2010. [18] a. cárdenas, s. amin, z.s. lin, y.l. huang, chi-y. huang, s. sastry, attacks against process control systems: risk assessment, detection, and response, in proceedings of the acm symposium on information, computer and communications security, pp. 355–366, 2011. international journal of computers communications & control issn 1841-9836, 13(1), 117-128, february 2018. modern interfaces for knowledge representation and processing systems based on markup technologies a. a. mohammed saeed, d. dănciulescu ali amer mohammed saeed* doctoral school of informatics university of piteşti 110040 piteşti, 1 târgul din vale str., romania *corresponding author: ali.amer81@gmail.com daniela dănciulescu computer science department university of craiova 200585 craiova, 13 a. i. cuza str., romania danadanciulescu@gmail.com abstract: the usage of markup technologies to specify knowledge to be processed according to a specific field of application is a common technique. representation techniques based on markup language paradigm to describe various types of knowledge including graph based models is considered and details on using knowledge representation and processing (krp) systems in education are presented. xml, and voicexml were selected to implement smart interface for krp systems. keywords: krp systems, markup technologies, intelligent interfaces, vxml 1 introduction this article deals with markup mechanisms for knowledge, but also for voice interfaces. it is based on [13] being an extended version of the previous work of the first author. the coverage of the subject follows. the next section deals with the usability and efficiency of the following approaches to be used in krp context: sgml / xml, rdf extensions, state-based modeling scxml, and voice xml. from the rdf (resource description framework) category, in the context of krp systems, ckml (conceptual knowledge markup language), ontology markup language (oml) and dlml (descriptive logic markup language) are useful. other approaches are based on the ontology interface layer (oil) and the darpa agent markup language (daml). of the ontological development tools, the most commonly used are: duet (uml enhanced tool), ubot, protege, and ontolingua. an example of processing using descriptions in natural language is illustrated using scxml. scxml and voicexml are covered in the third section. interaction of knowledge bases using java technologies is demonstrated in the fourth section. for this purpose, the legacy knowledge model is modeled by a graph that indicates the inheritance relationship of object attributes. the fifth section is dedicated to the usage of krp systems in education. it is shown that, for visually impaired users, the usage of voicexml based technologies to translate various educational resources is feasible. 2 markup models and knowledge representation by models and markup technologies, in the context of this paper, we understand such models and technologies obtained from sgml (standard generalized markup language; iso 8879:1986 copyright ©2018 cc by-nc 118 a. a. mohammed saeed, d. dănciulescu [28]). sgml is a meta-language, i.e. an artificial language which allow us to describe other languages, in general for the formatting of documents [13]. sgml was used initial by the association of american publishers. then it has become a powerful model with applications and multiple influences. for example, coleman and willis (1997) proposed the usage of sgml in the conservation of the publications of the libraries [4]. in the same year, already appeared html (hypertext markup language, 1990 [23]) useful for www, and extensible markup language (xml, 1996) as the language of the description of the structured information [31]. therefore, sgml is known as being the father of both html and xml [13]. however html is a court specifies its dtd of sgml (with markers predefined), and xml is a subset of sgml where users can define their own tags and attributes. an xml document is composed of markers (tags) and data "character" (char, character). a marking is a string of characters bounded by the symbols "<" and ">". an xml file contains three sections: a header (), the definition of document type internal or external (example: ) and the root (xml information in this part may be set as a tree structure). a xml schema to describe the set of rules used by knowledge representation and processing (krp) systems can be given as below (regula.xsd). xml processors are used to verify whether the xml documents are well formed or not. to access and editing an xml document, initially is loading the xml document in associated task (example with javascript) [13]: parser = new domparser ( ) ; xmldoc = parser . parsefromstring ( text ," text /xml " ) ; then extract the elements of the xml document for processing. in the context of the knowledge representation of the rules used by krp systems we can consider as a root element which may include one or more elements of the rule type. each item rule has a unique identifier rid. a rule of association (a → b) is formed of "the hypothesis of a" and "b the consequent part". therefore, each hypothesis must have one or more items. each item hypothesis has a name that is represented by a sequence. this model is described below. modern interfaces for knowledge representation and processing systems based on markup technologies 119 xml document: a b the dtd of the model: to describe structures useful to outline knowledge in the field of science of the soil for agriculture, the authors of [12] have converted xml declarations in a format useful for application, called kbml (knowledge based markup language). all meta-information is stored in a file kbml, while the actual data may be available in any data source (distributed, etc.). according to [13], kbml is not a markup language, but merely an application of xml. in the context of modeling and knowledge processing many specialized markup notations have been developed, such as: rdf/xml (model supertitles for expressing graphs as rdf documents that xml [27]), ckml (the conceptual knowledge markup language, 2000 [10]), oml (ontology markup language [26]), dlml (logical description markup language [22]). oml is an extension of the shoe and supports the lambda expressions. oml and ckml are based on the conceptual graphs introduced by sowa (2008) in [17]. querying rdf data is possible by specific languages, some in the lines of traditional database query languages, others based on logic and rule language [1]. stratified graphs can be used to automatic generation of queries in formal or natural language [5,6,14]. the kernel of a rdf model is made up of nodes and pairs of attached attributes/values. a description of the rdf syntax is presented in [3] and can be understood on the basis of the following example that describes the creator of the file tey.rdf located in a folder on a windows server: a l i 120 a. a. mohammed saeed, d. dănciulescu for rdf diagrams one shall specify the space of rdfs names. the fundamental rdf classes are: rdfs:resource (class resources), rdf:property (describes the properties of the resources) and rdfs:class (for specifying the type or category). to define a new class of rdf diagram, the corresponding resource class has the property rdfs:type whose value resource is rdfs:class. the resources which belong to the defined class are called courts. an example that describes a collection of resources is: new versions ckml have included the ideas and techniques on the informational flow (if information flow) and the design of the logic of the distributed systems. the final version ckml is both a language based on the logic of the information document and a language based on frames. in accordance with kent(2000), "in ckml the specification requires the use of the concept of mathematical lattice or the most practical notion of conceptual space" [10]. the basis of the theoretical portion of the practice based on ckml is the ckp theorem which states the equivalence between data structures of type conceptual lattice and formal context (classification). oml provides three levels of further specify the restrictions [26]: top sequences (corresponding informational flow); the intermediate pipe calculation of binary relations; lower logical expressions (corresponding to concept graphs). expressing an ontology is possible using the languages of specification such as [13]: kif (knowledge interchange format), cl (common logic), oil, daml+oil and allure. kif is based on the logic of the predicates [25], but provides a lisp oriented syntax for this. from the point of view of the semantic, there are four categories of constant in kif: constant of type object, constant of function type, constant of relation and logical constant. oil (ontology inference layer [7]) extends rdf diagram to provide an intuitive syntax and a great power of expression and a semantics more clearly defined with easy to use descriptive logic within the framework of the schemes of reasoning. such oil brings together and unifies three directions: descriptive logic, modeling based on frames and modeling rdf/xml. (daml darpa agent markup language ) + oil has a syntax diagram type rdf, that inherits the primitives of rdf (subclass, domain, range) and primitive added extras like transitivity, cardinality etc. schematic daml+oil is oriented on the objects in which the concepts are abstracted by grades and roles through the properties of the objects. thus, the ontological model daml+oil is based on a lot of the axioms about the classes and properties, as well as a set of builders very useful from the perspective of the rpc systems [13]: intersectionof; unionof; complementof; oneof; toclass; hasclass; hasvalue; mincardinalityq; maxcardinalityq; cardinalityq. the result of the foregoing the evolutionary process is [13]: 1) oil extends rdf; 2) daml extend rdf; 3. daml+oil daml integrates and oil and extends the rdf; 4) allure extends daml+oil and rdf. modern interfaces for knowledge representation and processing systems based on markup technologies 121 the final result of the research on ontological modeling using rdf/xml has led to the specification of the allure, in three versions [13]: allure lite (simple hierarchy, hierarchy of classes with simple constraints), allure dl (maximum expressiveness) and allure full (very expressive). for the processing of meta-data described using specific markup ontologies have been developed a variety of tools for annotation, navigation, utilities (api), edit, view graphics, marking, pan, validation, import, export, compilation, query, search etc. a list of them would be too long. we will be limited to the most important tools, the rest being described in the references indicated: duet (daml uml enhanced tool), ubot, the platform protégé, and ontolingua. ontolingua editor allows for the creation of ontologies, exploration and collaborative editing. using ontolingua, it is possible to export and import formats like: kif, daml + oil, okbc, prologue, the loom, ontolingua and clip. can import data in the protégé format. 3 scxml and voice xml scxml provides a generic state-machine, an execution environment based on ccxml and harel state tables, according to w3c(2015) in [30]. also in [11] it is mentioned that: "using scxml as the representation of the state machine is seen as a benefit". the mentioned authors found that "large portions of the scxml standard are not necessary for it to be useful to our customers and us." ccxml is designed to upgrade vxml dialog systems with advanced telephony functions. an example of the scxml representation is for speech recognition in the natural language. for the implementation of the krp systems, the role of the scxml is active in the framework of the failures, through voice and natural language. according to the above considerations, it was our choice to propose the usage of vxml to create voice-enabled applications [29]. voicexml (vxml) is a markup language for specifying the vocal dialog between a man and a software application, for example a krp system. thus, using voicexml 2.0 one can develop krp applications which provides automatic recognition of speech (asr automated speech recognition) and interactive vocal response (ivr interactive voice response). the main elements of voicexml are: • start/close any vxml document; • , , used to declare, assign and delete variables; • to specify the grammar of the text under recognition; • , , , , to manage exceptions; •

, , to deal with menu; • , , to describe conditional aspects; • ,
, , ,